LOCUS LT708304 4349904 bp DNA circular BCT 25-MAY-2020 DEFINITION Mycobacterium bovis AF2122/97 genome assembly, chromosome: Mycobacterium_bovis_AF212297. ACCESSION LT708304 VERSION LT708304.1 DBLINK BioProject:PRJEB15187 BioSample:SAMEA20450668 KEYWORDS . SOURCE Mycobacterium tuberculosis variant bovis AF2122/97 ORGANISM Mycobacterium tuberculosis variant bovis AF2122/97 Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Malone K.M. JOURNAL Submitted (06-DEC-2016) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine, University College Dublin, D4, Ireland REFERENCE 2 AUTHORS Malone M K., Farrell D., Malone K. JOURNAL Submitted (15-APR-2020) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine,, University College Dublin, D4, Ireland FEATURES Location/Qualifiers source 1..4349904 /organism="Mycobacterium tuberculosis variant bovis AF2122/97" /chromosome="Mycobacterium_bovis_AF212297" /isolate="AF2122/97" /mol_type="genomic DNA" /isolation_source="Mycobacterium bovis subsp. bovis strain AF2122/97. This strain is a fully virulent strain that was isolated in 1997 in the UK from a cow suffering necrotic lesions in lung and bronchomediastinal lymph nodes. The strain was also reported to infect and persist in badgers that are considered to be a significant source of bovine infection." /db_xref="taxon:233413" CDS 1..1524 /codon_start=1 /transl_table=11 /gene="dnaA" /locus_tag="BQ2027_MB0001" /product="CHROMOSOMAL REPLICATION INITIATOR PROTEIN DNAA" /note="Mb0001, dnaA, len: 507 aa. Equivalent to Rv0001, len: 507 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 507 aa overlap). dnaA, chromosomal replication initiator protein (see citations below), equivalent to other Mycobacterial CHROMOSOMAL REPLICATION INITIATOR PROTEINS e.g. P46388|DNAA_MYCLE from Mycobacterium leprae (502 aa); Q9L7L7|DNAA_MYCPA from Mycobacterium paratuberculosis (509 aa); P49990|DNAA_MYCAV from Mycobacterium avium (508 aa); P49992|DNAA_MYCSM from Mycobacterium smegmatis (504 aa); etc. Also highly similar to others except in N-terminus e.g. Q9ZH75|DNAA_STRCH CHROMOSOMAL REPLICATION INITIATOR PROTEIN from Streptomyces chrysomallus (624 aa); Q9ZH76|DNAA_STRRE from Streptomyces reticuli (643 aa); DNAA_ECOLI|P03004|B3702 chromosomal replication initiator protein from Escherichia coli strain K12 (467 aa), FASTA scores: opt: 986, E(): 0, (43.2% identity in 389 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS01008 DnaA protein signature. BELONGS TO THE DNAA FAMILY. Note that the first base of this gene has been taken as base 1 of the Mycobacterium bovis genomic sequence. Protein product from Mb0001 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0001 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P49991" /db_xref="InterPro:IPR001957" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR010921" /db_xref="InterPro:IPR013159" /db_xref="InterPro:IPR013317" /db_xref="InterPro:IPR018312" /db_xref="InterPro:IPR020591" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P49991" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98333.1" /translation="MTDDPGSGFTTVWNAVVSELNGDPKVDDGPSSDANLSAPLTPQQ RAWLNLVQPLTIVEGFALLSVPSSFVQNEIERHLRAPITDALSRRLGHQIQLGVRIAP PATDEADDTTVPPSENPATTSPDTTTDNDEIDDSAAARGDNQHSWPSYFTERPRNTDS ATAGVTSLNRRYTFDTFVIGASNRFAHAAALAIAEAPARAYNPLFIWGESGLGKTHLL HAAGNYAQRLFPGMRVKYVSTEEFTNDFINSLRDDRKVAFKRSYRDVDVLLVDDIQFI EGKEGIQEEFFHTFNTLHNANKQIVISSDRPPKQLATLEDRLRTRFEWGLITDVQPPE LETRIAILRKKAQMERLAIPDDVLELIASSIERNIRELEGALIRVTAFASLNKTPIDK ALAEIVLRDLIADANTMQISAATIMAATAEYFDTTVEELRGPGKTRALAQSRQIAMYL CRELTDLSLPKIGQAFGRDHTTVMYAQRKILSEMAERREVFDHVKELTTRIRQRSKR" CDS 2052..3260 /codon_start=1 /transl_table=11 /gene="dnaN" /locus_tag="BQ2027_MB0002" /product="DNA POLYMERASE III (BETA CHAIN) DNAN (DNA NUCLEOTIDYLTRANSFERASE)" /note="Mb0002, dnaN, len: 402 aa. Equivalent to Rv0002, len: 402 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 402 aa overlap). dnaN, DNA polymerase III (beta chain) (EC 2.7.7.7) (see citations below), equivalent to other Mycobacterial DNA POLYMERASES III BETA CHAIN e.g. NP_301130.1|NC_002677 from Mycobacterium leprae (399 aa); Q9L7L6|DP3B_MYCPA from Mycobacterium avium subsp. paratuberculosis (399 aa); P52851|DP3B_MYCSM from Mycobacterium smegmatis (397 aa); etc. Also highly similar to others e.g. P27903|DP3B_STRCO DNA POLYMERASE III BETA CHAIN from Streptomyces coelicolor (376 aa), FASTA scores: opt: 1189, E(): 0, (52.8% identity in 337 aa overlap); P21174|DP3B_MICLU from Micrococcus luteus (310 aa); P52023|DP3B_SYNP7 from Synechococcus sp. strain PCC 7942 (375 aa); etc. Overlaps and extends CDS in neighbouring cosmid MTCY10H4.01. Protein product from Mb0002 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0002 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:O33914" /db_xref="InterPro:IPR001001" /db_xref="InterPro:IPR022634" /db_xref="InterPro:IPR022635" /db_xref="InterPro:IPR022637" /db_xref="UniProtKB/Swiss-Prot:O33914" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98337.1" /translation="MDAATTRVGLTDLTFRLLRESFADAVSWVAKNLPARPAVPVLSG VLLTGSDNGLTISGFDYEVSAEAQVGAEIVSPGSVLVSGRLLSDITRALPNKPVGVHV EGNRVALTCGNARFSLPTMPVEDYPTLPTLPEETGLLPAELFAEAISQVAIAAGRDDT LPMLTGIRVEILGETVVLAATDRFRLAVRELKWSASSPDIEAAVLVPAKTLAEAAKAG IGGSDVRLSLGTGPGVGKDGLLGISGNGKRSTTRLLDAEFPKFRQLLPTEHTAVATMD VAELIEAIKLVALVADRGAQVRMEFADGSVRLSAGADDVGRAEEDLVVDYAGEPLTIA FNPTYLTDGLSSLRSERVSFGFTTAGKPALLRPVSGDDRPVAGLNGNGPFPAVSTDYV YLLMPVRLPG" CDS 3280..4437 /codon_start=1 /transl_table=11 /gene="recF" /locus_tag="BQ2027_MB0003" /product="DNA REPLICATION AND REPAIR PROTEIN RECF (SINGLE-STRAND DNA BINDING PROTEIN)" /note="Mb0003, recF, len: 385 aa. Equivalent to Rv0003, len: 385 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 385 aa overlap). recF, DNA replication and repair protein (see citations below), equivalent to others Mycobacterial DNA replication and repair proteins e.g. NP_301131.1|NC_002677 from Mycobacterium leprae (385 aa); Q9L7L5|RECF_MYCPA from Mycobacterium avium subsp. paratuberculosis (385 aa); P50916|RECF_MYCSM from Mycobacterium smegmatis (384 aa); etc. Also highly similar to others e.g. P36176|RECF_STRCO DNA REPLICATION AND REPAIR PROTEIN from Streptomyces coelicolor (373 aa); NP_440892.1|NC_000911 from Synechocystis sp. strain PCC 6803 (384 aa); NP_469352.1|NC_003212 from Listeria innocua (370 aa); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00617 RecF protein signature 1, and PS00618 RecF protein signature 2. BELONGS TO THE RECF FAMILY. Protein product from Mb0003 detected using SWATH mass spectrometry. Mb0003 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U314" /db_xref="InterPro:IPR001238" /db_xref="InterPro:IPR003395" /db_xref="InterPro:IPR018078" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR042174" /db_xref="UniProtKB/Swiss-Prot:Q7U314" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98339.1" /translation="MYVRHLGLRDFRSWACVDLELHPGRTVFVGPNGYGKTNLIEALW YSTTLGSHRVSADLPLIRVGTDRAVISTIVVNDGRECAVDLEIATGRVNKARLNRSSV RSTRDVVGVLRAVLFAPEDLGLVRGDPADRRRYLDDLAIVRRPAIAAVRAEYERVVRQ RTALLKSVPGARYRGDRGVFDTLEVWDSRLAEHGAELVAARIDLVNQLAPEVKKAYQL LAPESRSASIGYRASMDVTGPSEQSDTDRQLLAARLLAALAARRDAELERGVCLVGPH RDDLILRLGDQPAKGFASHGEAWSLAVALRLAAYQLLRVDGGEPVLLLDDVFAELDVM RRRALATAAESAEQVLVTAAVLEDIPAGWDARRVHIDVRADDTGSMSVVLP" CDS 4434..4997 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0004" /product="Zn-ribbon-containing, possibly RNA-binding protein and truncated derivatives" /note="Mb0004, -, len: 187 aa. Equivalent to Rv0004, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 187 aa overlap). Conserved hypothetical protein (see citation below), highly similar, but longer 21 aa in N-terminus, to AAF33696.1|AF222789 unknown protein from Mycobacterium avium subsp. paratuberculosis (166 aa); and highly similar to NP_301132.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (189 aa); S70990 hypothetical protein from Mycobacterium smegmatis (194 aa). Also similar to in C-terminus to C-terminal part of P35925|YREG_STRCO HYPOTHETICAL 19.8 KDA PROTEIN (IN RECF-GYRB INTERGENIC REGION) from Streptomyces coelicolor (190 aa), FASTA scores: opt: 404, E(): 3.9e-18, (40.7% identity in 189 aa overlap). Mb0004 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007922" /db_xref="InterPro:IPR023007" /db_xref="UniProtKB/Swiss-Prot:Q7U313" /protein_id="SIT98341.1" /translation="MTGSVDRPDQNRGERLMKSPGLDLVRRTLDEARAAARARGQDAG RGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQW SAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSL KITGPAAPSWRKGPRHIAGRGPRDTYG" CDS 5123..7267 /codon_start=1 /transl_table=11 /gene="gyrB" /locus_tag="BQ2027_MB0005" /product="DNA GYRASE (SUBUNIT B) GYRB (DNA TOPOISOMERASE (ATP-HYDROLYSING)) (DNA TOPOISOMERASE II) (TYPE II DNA TOPOISOMERASE)" /note="Mb0005, gyrB, len: 714 aa. Equivalent to Rv0005, len: 714 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 714 aa overlap). gyrB, DNA gyrase subunit B (EC 5.99.1.3) (see citations below), equivalent, except in N-terminus, to other Mycobacterial DNA GYRASES SUBUNIT B e.g. T10005 from Mycobacterium leprae (697 aa); Q9L7L3|GYRB_MYCPA from Mycobacterium avium subsp. paratuberculosis (677 aa) (has its N-terminus shorter); P48355|GYRB_MYCSM from Mycobacterium smegmatis (675 aa); etc. Also highly similar to others e.g. T10969 from Streptomyces coelicolor (686 aa); P50075|GYBS_STRSH from Streptomyces spheroides (684 aa); etc. Contains PS00177 DNA topoisomerase II signature. BELONGS TO THE TYPE II TOPOISOMERASE FAMILY. Protein product from Mb0005 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0005 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU15" /db_xref="InterPro:IPR001241" /db_xref="InterPro:IPR002288" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR006171" /db_xref="InterPro:IPR011557" /db_xref="InterPro:IPR013506" /db_xref="InterPro:IPR013759" /db_xref="InterPro:IPR013760" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR018522" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR034160" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3XU15" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98343.1" /translation="MGKNEARRSALAPDHGTVVCDPLRRLNRMHATPEESIRIVAAQK KKAQDEYGAASITILEGLEAVRKRPGMYIGSTGERGLHHLIWEVVDNAVDEAMAGYAT TVNVVLLEDGGVEVADDGRGIPVATHASGIPTVDVVMTQLHAGGKFDSDAYAISGGLH GVGVSVVNALSTRLEVEIKRDGYEWSQVYEKSEPLGLKQGAPTKKTGSTVRFWADPAV FETTEYDFETVARRLQEMAFLNKGLTINLTDERVTQDEVVDEVVSDVAEAPKSASERA AESTAPHKVKSRTFHYPGGLVDFVKHINRTKNAIHSSIVDFSGKGTGHEVEIAMQWNA GYSESVHTFANTINTHEGGTHEEGFRSALTSVVNKYAKDRKLLKDKDPNLTGDDIREG LAAVISVKVSEPQFEGQTKTKLGNTEVKSFVQKVCNEQLTHWFEANPTDSKVVVNKAV SSAQARIAARKARELVRRKSATDIGGLPGKLADCRSTDPRKSELYVVEGDSAGGSAKS GRDSMFQAILPLRGKIINVEKARIDRVLKNTEVQAIITALGTGIHDEFDIGKLRYHKI VLMADADVDGQHISTLLLTLLFRFMRPLIENGHVFLAQPPLYKLKWQRSDPEFAYSDR ERDGLLEAGLKAGKKINKEDGIQRYKGLGEMDAKELWETTMDPSVRVLRQVTLDDAAA ADELFSILMGEDVDARRSFITRNAKDVRFLDV" CDS 7302..9818 /codon_start=1 /transl_table=11 /gene="gyrA" /locus_tag="BQ2027_MB0006" /product="DNA GYRASE (SUBUNIT A) GYRA (DNA TOPOISOMERASE (ATP-HYDROLYSING)) (DNA TOPOISOMERASE II) (TYPE II DNA TOPOISOMERASE)" /note="Mb0006, gyrA, len: 838 aa. Equivalent to Rv0006, len: 838 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 838 aa overlap). gyrA, DNA gyrase subunit A (EC 5.99.1.3) (see citations below), equivalent, except in N-terminus, to other Mycobacterial DNA GYRASES SUBUNIT A e.g. Q57532|GYRA_MYCLE|T10006 from Mycobacterium leprae (1273 aa); P48354|GYRA_MYCSM from Mycobacterium smegmatis (842 aa); etc. Also highly similar to others e.g. P35885|GYRA_STRCO DNA GYRASE SUBUNIT A from Streptomyces coelicolor (864 aa); NP_346654.1|NC_003030 from Clostridium acetobutylicum (830 aa); NP_387888.1|NC_000964 from Bacillus subtilis (821 aa); etc. Contains PS00018 EF-hand calcium-binding domain. Protein product from Mb0006 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0006 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XU07" /db_xref="InterPro:IPR002205" /db_xref="InterPro:IPR005743" /db_xref="InterPro:IPR006691" /db_xref="InterPro:IPR013757" /db_xref="InterPro:IPR013758" /db_xref="InterPro:IPR013760" /db_xref="InterPro:IPR035516" /db_xref="UniProtKB/TrEMBL:A0A1R3XU07" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98345.1" /translation="MTDTTLPPDDSLDRIEPVDIQQEMQRSYIDYAMSVIVGRALPEV RDGLKPVHRRVLYAMFDSGFRPDRSHAKSARSVAETMGNYHPHGDASIYDTLVRMAQP WSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEETVDFIPNYDGRV QEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLRELADAVFWALENHDADEEETLAA VMGRVKGPDFPTAGLIVGSQGTADAYKTGRGSIRMRGVVEVEEDSRGRTSLVITELPY QVNHDNFITSIAEQVRDGKLAGISNIEDQSSDRVGLRIVIEIKRDAVAKVVINNLYKH TQLQTSFGANMLAIVDGVPRTLRLDQLIRYYVDHQLDVIVRRTTYRLRKANERAHILR GLVKALDALDEVIALIRASETVDIARAGLIELLDIDEIQAQAILDMQLRRLAALERQR IIDDLAKIEAEIADLEDILAKPERQRGIVRDELAEIVDRHGDDRRTRIIAADGDVSDE DLIAREDVVVTITETGYAKRTKTDLYRSQKRGGKGVQGAGLKQDDIVAHFFVCSTHDL ILFFTTQGRVYRAKAYDLPEASRTARGQHVANLLAFQPEERIAQVIQIRGYTDAPYLV LATRNGLVKKSKLTAFDSNRSGGIVAVNLRDNDELVGAVLCSADDDLLLVSANGQSIR FSATDEALRPMGRATSGVQGMRFNIDDRLLSLNVVREGTYLLVATSGGYAKRTAIEEY PVQGRGGKGVLTVMYDRRRGRLVGALIVDDDSELYAVTSGGGVIRTAARQVRKAGRQT KGVRLMNLGEGDTLLAIARNAEESGDDNAVDANGADQTGN" CDS 9914..10828 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0007" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb0007, -, len: 304 aa. Equivalent to Rv0007, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 304 aa overlap). Possible conserved membrane protein, highly similar to Z70722|MLCB1770_7 from Mycobacterium leprae (303 aa), FASTA scores: opt: 812, E(): 1.6e-25, (54.2% identity in 319 aa overlap). C-terminal part highly similar to C-terminus of CAB92992.1|AL357152 putative integral membrane protein from Streptomyces coelicolor (185 aa); and N-terminal part highly similar to C-terminus of NP_302684.1|NC_002677 hypothetical protein from Mycobacterium leprae (123 aa). Protein product from Mb0007 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0007 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XW78" /db_xref="InterPro:IPR021949" /db_xref="UniProtKB/TrEMBL:A0A1R3XW78" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98347.1" /translation="MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPP PWQRAATRQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRT PQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAG SSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMI TVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNVVLMT ALATIGAFVYNLITDLIGGIEVTLADRD" tRNA 10887..10960 /locus_tag="BQ2027_ILET" /product="tRNA-Ile" /note="ileT, len: 74 nt. Equivalent to ileT, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Ile, anticodon gat." tRNA 11112..11184 /locus_tag="BQ2027_ALAT" /product="tRNA-Ala" /note="alaT, len: 73 nt. Equivalent to alaT, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Ala, anticodon tgc." CDS complement(11874..12311) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0008C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb0008c, -, len: 145 aa. Equivalent to Rv0008c, len: 145 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 144 aa overlap). Possible membrane protein. Protein product from Mb0008c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0008c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59977" /db_xref="InterPro:IPR024245" /db_xref="UniProtKB/Swiss-Prot:P59977" /protein_id="SIT98350.1" /translation="MSEQVETRLTPRERLTRGLAYSAVGPVDVTRGLLELGVGLGLQS ARSTAAGLRRRYREGRLAREVAAAQETLAQELTAAQDVVANLPQALQDARTQRRSKHH LWIFAGIAAAILAGGAVAFSIVRRSSRPEPSPRPPSVEVQPRP" CDS 12468..13016 /codon_start=1 /transl_table=11 /gene="ppiA" /locus_tag="BQ2027_MB0009" /standard_name="cfp22" /product="PROBABLE IRON-REGULATED PEPTIDYL-PROLYL CIS-TRANS ISOMERASE A PPIA (PPIase A) (ROTAMASE A)" /note="Mb0009, ppiA, len: 182 aa. Equivalent to Rv0009, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 182 aa overlap). Probable ppiA (alternate gene name: cfp22), iron-regulated peptidyl-prolyl cis-trans isomerase A (EC 5.2.1.8), equivalent to NP_301138.1|NC_002677 putative peptidyl-prolyl cis-trans isomerase from Mycobacterium leprae (182 aa), FASTA score: (90.1% identity in 182 aa overlap). Also highly similar to others e.g. T36725 from Streptomyces coelicolor (177 aa); T43805 from Halobacterium salinarum (180 aa); NP_219383.1|NC_000919 from Treponema pallidum (215 aa); etc. BELONGS TO THE CYCLOPHILIN-TYPE PPIASE FAMILY. Alternative start codon has been suggested. Protein product from Mb0009 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0009 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65763" /db_xref="InterPro:IPR002130" /db_xref="InterPro:IPR020892" /db_xref="InterPro:IPR024936" /db_xref="InterPro:IPR029000" /db_xref="UniProtKB/Swiss-Prot:P65763" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98352.1" /translation="MADCDSVTNSPLATATATLHTNRGDIKIALFGNHAPKTVANFVG LAQGTKDYSTQNASGGPSGPFYDGAVFHRVIQGFMIQGGDPTGTGRGGPGYKFADEFH PELQFDKPYLLAMANAGPGTNGSQFFITVGKTPHLNRRHTIFGEVIDAESQRVVEAIS KTATDGNDRPTDPVVIESITIS" CDS complement(13222..13557) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0010C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb0010c, -, len: 111 aa. Equivalent to Rv0010c, len: 141 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap). Probable conserved membrane protein, equivalent to NP_301139.1|NC_002677 putative membrane protein from Mycobacterium leprae (137 aa); and similar to Rv1417|P71686|YE17_MYCTU HYPOTHETICAL 16.4 KD PROTEIN from Mycobacterium tuberculosis (154 aa), FASTA scores: opt: 121, E(): 0.097, (29.6% identity in 81 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base deletion (g-*) leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (111 aa versus 141 aa). Mb0010c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XUN1" /db_xref="InterPro:IPR019692" /db_xref="UniProtKB/TrEMBL:A0A1R3XUN1" /protein_id="SIT98354.1" /translation="MQQTAWAPRTSGIAGCGAGGVVMAIASVTLVTDTPGRVLTGVAA LGLILFASATWRARPRLAITPDGLAIRGWFRTQLLRHSNIKIIRIDEFRRYGRLVRLL EIETVSGGC" CDS complement(13713..13994) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0011C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0011c, -, len: 93 aa. Equivalent to Rv0011c, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Probable conserved transmembrane protein, equivalent to NP_301140.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (93 aa); and similar to AL079308|SCH69_24 hypothetical protein from Streptomyces coelicolor (84 aa), FASTA scores: opt: 135, E(): 0.0068, (32.6% identity in 92 aa overlap). Protein product from Mb0011c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0011c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67377" /db_xref="InterPro:IPR009619" /db_xref="UniProtKB/Swiss-Prot:P67377" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98356.1" /translation="MPKSKVRKKNDFTVSAVSRTPMKVKVGPSSVWFVSLFIGLMLIG LIWLMVFQLAAIGSQAPTALNWMAQLGPWNYAIAFAFMITGLLLTMRWH" CDS 14088..14876 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0012" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb0012, -, len: 262 aa. Equivalent to Rv0012, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv (99.2% identity in 262 aa overlap). Probable conserved membrane protein, similar to AL079308|SCH69_23|T36722 hypothetical protein from Streptomyces coelicolor (237 aa), FASTA scores: opt: 506, E(): 1.9e-25, (39.8% identity in 236 aa overlap). Some similarity to BLU0|1958_35A2 DIVIB (fragment) (188 aa), FASTA scores: opt: 204, E(): 8.9e-07, (35.6% identity in 90 aa overlap); and G1129091|DDS cell division and sporulation protein from Bacillus subtilis (231 aa), FASTA scores: opt: 180, E(): 3.8e-05, (30.7% identity in 101 aa overlap). Also similar to Rv1823|MTCY1A11_20 from Mycobacterium tuberculosis FASTA score: (30.1% identity in 246 aa overlap); and MTCY1A11_18 FASTA score: (25.5% identity in 235 aa overlap). Contains probable N-terminal signal sequence. Protein product from Mb0012 detected using SWATH mass spectrometry. Mb0012 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010273" /db_xref="UniProtKB/TrEMBL:A0A1R3XU20" /protein_id="SIT98357.1" /translation="MRLTHPTPCPENGETMIDRRRSAWRFSVPLVCLLAGLLLAATHG VSGGTEIRRSDAPRLVDLVRRAQASVNRLATEREALTTRIDSVHGRSVDTALAAMQRR SAELAGVAAMNPVHGPGLVVTLQDAQRDANGRFPRDASPDDLVVHQQDIEAVLNALWN AGAEAIQMQDQRIIAMSIARCVGNTLLLNGRTYSPPYTIAAIGDAAAMQAALAAAPLV TLYKQYVVRFGLGYREEVHPDLQIVGYADPVRMHFAQPAGPLDY" CDS 14913..15611 /codon_start=1 /transl_table=11 /gene="trpG" /locus_tag="BQ2027_MB0013" /product="POSSIBLE ANTHRANILATE SYNTHASE COMPONENT II TRPG (GLUTAMINE AMIDOTRANSFERASE)" /note="Mb0013, trpG, len: 232 aa. Equivalent to Rv0013, len: 232 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 232 aa overlap). Possible trpG, anthranilate synthase component II (glutamine amidotransferase) (EC 4.1.3.27), equivalent to NP_301141.1|NC_002677 putative p-aminobenzoate synthase glutamine amidotransferase from Mycobacterium leprae (232 aa). Also highly similar to others e.g. P26922|TRPG_AZOBR Anthranilate synthase component II from Azospirillum brasilense (196 aa), FASTA scores: opt: 703, E(): 8.6e-40, (56.7% identity in 187 aa overlap); T36720 probable glutamine amidotransferase from Streptomyces coelicolor (212 aa); T44524 anthranilate synthase from Nitrosomonas europaea (199 aa); etc. Also similar to E235740 para-aminobenzoate synthase (232 aa), FASTA scores: opt: 1273, E(): 0, (79.7% identity in 232 aa overlap). Contains PS00606 Beta-ketoacyl synthases active site; and PS00442 Glutamine amidotransferases class-I active site. SIMILARITY TO OTHER TYPE-1 GLUTAMINE AMIDOTRANSFERASE DOMAINS. Note that previously known as pabA. Protein product from Mb0013 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0013 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU22" /db_xref="InterPro:IPR006221" /db_xref="InterPro:IPR017926" /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/TrEMBL:A0A1R3XU22" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98358.1" /translation="MRILVVDNYDSFVFNLVQYLGQLGIEAEVWRNDDHRLSDEAAVA GQFDGVLLSPGPGTPERAGASVSMVHACAAAHTPLLGVCLGHQAIGVAFGATVDRAPE LLHGKTSSVFHTNVGVLQGLPDPFTATRYHSLTILPKSLPAVLRVTARTSSGVIMAVQ HTGLPIHGVQFHPESILTEGGHRILANWLTCCGWTQDDTLVRRLENEVLTAISPHFPT STASAGEATGRTSA" CDS complement(15589..17469) /codon_start=1 /transl_table=11 /gene="pknB" /locus_tag="BQ2027_MB0014C" /product="TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE B PKNB (PROTEIN KINASE B) (STPK B)" /note="Mb0014c, pknB, len: 626 aa. Equivalent to Rv0014c, len: 626 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 626 aa overlap). pknB, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citations below), equivalent to MLCB1770_9|T10009 probable serine/threonine-specific protein kinase from Mycobacterium leprae (622 aa), FASTA scores: opt: 3600, E(): 0, (86.4% identity in 626 aa overlap). Also similar (highly similar in N-terminus) to others e.g. T36717 from Streptomyces coelicolor (673 aa); NP_389459.1|NC_000964 from Bacillus subtilis (648 aa); NP_465345.1|NC_003210 from Listeria monocytogenes (655 aa); E235741 protein kinase pknB (315 aa), FASTA scores: opt: 1839, E(): 0, (90.8 identity in 305 aa overlap); etc. Contains PS00107 Protein kinases ATP-binding region signature, and PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation on serine/threonine residues. Protein product from Mb0014c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0014c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5S5" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR005543" /db_xref="InterPro:IPR008271" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR017441" /db_xref="UniProtKB/Swiss-Prot:P0A5S5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98359.1" /translation="MTTPSHLSDRYELGEILGFGGMSEVHLARDLRLHRDVAVKVLRA DLARDPSFYLRFRREAQNAAALNHPAIVAVYDTGEAETPAGPLPYIVMEYVDGVTLRD IVHTEGPMTPKRAIEVIADACQALNFSHQNGIIHRDVKPANIMISATNAVKVMDFGIA RAIADSGNSVTQTAAVIGTAQYLSPEQARGDSVDARSDVYSLGCVLYEVLTGEPPFTG DSPVSVAYQHVREDPIPPSARHEGLSADLDAVVLKALAKNPENRYQTAAEMRADLVRV HNGEPPEAPKVLTDAERTSLLSSAAGNLSGPRTDPLPRQDLDDTDRDRSIGSVGRWVA VVAVLAVLTVVVTIAINTFGGITRDVQVPDVRGQSSADAIATLQNRGFKIRTLQKPDS TIPPDHVIGTDPAANTSVSAGDEITVNVSTGPEQREIPDVSTLTYAEAVKKLTAAGFG RFKQANSPSTPELVGKVIGTNPPANQTSAITNVVIIIVGSGPATKDIPDVAGQTVDVA QKNLNVYGFTKFSQASVDSPRPAGEVTGTNPPAGTTVPVDSVIELQVSKGNQFVMPDL SGMFWVDAEPRLRALGWTGMLDKGADVDAGGSQHNRVVYQNPPAGTGVNRDGIITLRF GQ" CDS complement(17466..18761) /codon_start=1 /transl_table=11 /gene="pknA" /locus_tag="BQ2027_MB0015C" /product="TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE A PKNA (PROTEIN KINASE A) (STPK A)" /note="Mb0015c, pknA, len: 431 aa. Equivalent to Rv0015c, len: 431 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 431 aa overlap). pknA, transmembrane serine/threonine-protein kinase (EC 2.7.1.-), magnesium/manganese dependent (see citations below), equivalent to MLCB1770_10|NP_301143.1|NC_002677 putative serine/threonine protein kinase from Mycobacterium leprae (437 aa), FASTA scores: opt: 1883, E(): 0, (72.1% identity in 434 aa overlap). And also highly similar to other kinases from Mycobacterium leprae e.g. MLCB1770_10 from Mycobacterium leprae (437 aa). Also similar to PKNA_MYCLE protein kinase (253 aa), FASTA scores: opt: 1525, E(): 0, (95.0% identity in 242 aa overlap); etc. Also highly similar in part to others e.g. N-terminus of NP_243370.1|NC_002570 from Bacillus halodurans (664 aa); N-terminus of T36717 from Streptomyces coelicolor (673 aa); etc. Also similar to others from Mycobacterium tuberculosis: MTCY10H4_15, MTV021_9, MTCY28_5, MTCY4C12_28, MTCY50_16, MTCY8D9_8, MTCY49_28, MTCY4C12_30, MTCY28_9, etc. Contains PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. It has been shown that sodium orthovanadate inhibits the activity of the enzyme in vitro. Protein product from Mb0015c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0015c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65727" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR008271" /db_xref="InterPro:IPR011009" /db_xref="UniProtKB/Swiss-Prot:P65727" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98360.1" /translation="MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVL KSEFSSDPEFIERFRAEARTTAMLNHPGIASVHDYGESQMNGEGRTAYLVMELVNGEP LNSVLKRTGRLSLRHALDMLEQTGRALQIAHAAGLVHRDVKPGNILITPTGQVKITDF GIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDASPASDVYSLGVVGYEAVSGKRPFA GDGALTVAMKHIKEPPPPLPPDLPPNVRELIEITLVKNPAMRYRSGGPFADAVAAVRA GRRPPRPSQTPPPGRAAPAAIPSGTTARVAANSAGRTAASRRSRPATGGHRPPRRTFS SGQRALLWAAGVLGALAIIIAVLLVIKAPGDNSPQQAPTPTVTTTGNPPASNTGGTDA SPRLNWTERGETRHSGLQSWVVPPTPHSRASLARYEIAQ" CDS complement(18758..20233) /codon_start=1 /transl_table=11 /gene="pbpA" /locus_tag="BQ2027_MB0016C" /product="PROBABLE PENICILLIN-BINDING PROTEIN PBPA" /note="Mb0016c, pbpA, len: 491 aa. Equivalent to Rv0016c, len: 491 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 491 aa overlap). Probable pbpA, penicillin-binding protein, equivalent to NP_301144.1|NC_002677 putative penicillin-binding protein from Mycobacterium leprae (492 aa); and highly similar to MLCB1770_1 penicillin binding protein from Mycobacterium leprae (474 aa), FASTA scores: opt: 2516, E(): 0, (82.4% identity in 472 aa overlap). Also similar to others e.g. T36716 from Streptomyces coelicolor (490 aa); AAF61246.1|AF241575|PbpA from Streptomyces griseus (485 aa); NP_347146.1|NC_003030 from Clostridium acetobutylicum (482 aa); E235825|pbpA penicillin binding protein (325 aa), FASTA scores: opt: 1618, E(): 0, (78.3% identity in 323 aa overlap); etc. And also similar to MTCY270_5 and MTV003_8 from Mycobacterium tuberculosis. Protein product from Mb0016c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0016c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU21" /db_xref="InterPro:IPR001460" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3XU21" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98361.1" /translation="MNASLRRISVTVMALIVLLLLNATMTQVFTADGLRADPRNQRVL LDEYSRQRGQITAGGQLLAYSVATDGRFRFLRVYPNPEVYAPVTGFYSLRYSSTALER AEDPILNGSDRRLFGRRLADFFTGRDPRGGNVDTTINPRIQQAGWDAMQQGCYGPCKG AVVALEPSTGKILALVSSPSYDPNLLASHNPEVQAQAWQRLGDNPASPLTNRAISETY PPGSTFKVITTAAALAAGATETEQLTAAPTIPLPGSTAQLENYGGAPCGDEPTVSLRE AFVKSCNTAFVQLGIRTGADALRSMARAFGLDSPPRPTPLQVAESTVGPIPDSAALGM TSIGQKDVALTPLANAEIAATIANGGITMRPYLVGSLKGPDLANISTTVGYQQRRAVS PQVAAKLTELMVGAEKVAQQKGAIPGVQIASKTGTAEHGTDPRHTPPHAWYIAFAPAQ APKVAVAVLVENGADRLSATGGALAAPIGRAVIEAALQGEP" CDS complement(20230..21639) /codon_start=1 /transl_table=11 /gene="rodA" /locus_tag="BQ2027_MB0017C" /product="PROBABLE CELL DIVISION PROTEIN RODA" /note="Mb0017c, rodA, len: 469 aa. Equivalent to Rv0017c, len: 469 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 469 aa overlap). Probable rodA (alternate gene name: ftsW), cell division protein, integral membrane protein, equivalent to MLCB1770_12|T10012 probable cell division protein from Mycobacterium leprae (465 aa), FASTA scores: opt: 2475, E(): 0, (81.9% identity in 469 aa overlap). Also highly similar to others e.g. T36715|SCH69.16 from Streptomyces coelicolor (479 aa); NP_243432.1|NC_002570 from Bacillus halodurans (366 aa); NP_347145.1|NC_003030 from Clostridium acetobutylicum (400 aa); etc. Also similar to MTCY270_14 from Mycobacterium tuberculosis FASTA score: (32.2% identity in 369 aa overlap). BELONGS TO THE FTSW/RODA/SPOVE FAMILY. Protein product from Mb0017c detected using SWATH mass spectrometry. Mb0017c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63761" /db_xref="InterPro:IPR001182" /db_xref="InterPro:IPR018365" /db_xref="UniProtKB/Swiss-Prot:P63761" /protein_id="SIT98362.1" /translation="MTTRLQAPVAVTPPLPTRRNAELLLLCFAAVITFAALLVVQANQ DQGVPWDLTSYGLAFLTLFGSAHLAIRRFAPYTDPLLLPVVALLNGLGLVMIHRLDLV DNEIGEHRHPSANQQMLWTLVGVAAFALVVTFLKDHRQLARYGYICGLAGLVFLAVPA LLPAALSEQNGAKIWIRLPGFSIQPAEFSKILLLIFFSAVLVAKRGLFTSAGKHLLGM TLPRPRDLAPLLAAWVISVGVMVFEKDLGASLLLYTSFLVVVYLATQRFSWVVIGLTL FAAGTLVAYFIFEHVRLRVQTWLDPFADPDGTGYQIVQSLFSFATGGIFGTGLGNGQP DTVPAASTDFIIAAFGEELGLVGLTAILMLYTIVIIRGLRTAIATRDSFGKLLAAGLS STLAIQLFIVVGGVTRLIPLTGLTTPWMSYGGSSLLANYILLAILARISHGARRPLRT RPRNKSPITAAGTEVIERV" CDS complement(21636..23180) /codon_start=1 /transl_table=11 /gene="pstp" /locus_tag="BQ2027_MB0018C" /product="phosphoserine/threonine phosphatase pstp" /note="Mb0018c, ppp, len: 514 aa. Equivalent to Rv0018c, len: 514 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 514 aa overlap). Possible ppp, serine/threonine phosphatase (EC 3.1.3.16), equivalent to MLCB1770_13|T10013 PUTATIVE PHOSPHOPROTEIN PHOSPHATASE from Mycobacterium leprae (509 aa), FASTA scores: opt: 2517, E(): 0. Also highly similar to others e.g. T36714 probable protein phosphatase from Streptomyces coelicolor (515 aa); CAA10712.1|AJ132604 pppL protein from Lactococcus lactis (258 aa); NP_248765.1|NC_002516 probable phosphoprotein phosphatase from Pseudomonas aeruginosa (242 aa); etc. Also similar to BSUB0009_46 YLOO PROTEIN from Bacillus subtilis (254 aa), FASTA score: (34.0% identity in 250 aa overlap). Protein product from Mb0018c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0018c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XV75" /db_xref="InterPro:IPR001932" /db_xref="InterPro:IPR015655" /db_xref="InterPro:IPR036457" /db_xref="UniProtKB/TrEMBL:A0A1R3XV75" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98363.1" /translation="MALVTLVLRYAARSDRGLVRANNEDSVYAGARLLALADGMGGHA AGEVASQLVIAALAHLDDDEPGGDLLAKLDAAVRAGNSAIAAQVEMEPDLEGMGTTLT AILFAGNRLGLVHIGDSRGYLLRDGELTQITKDDTFVQTLVDEGRITPEEAHSHPQRS LIMRALTGHEVEPTLTMREARAGDRYLLCSDGLSDPVSDETILEALQIPEVAESAHRL IELALRGGGPDNVTVVVADVVDYDYGQTQPILAGAVSGDDDQLTLPNTAAGRASAISQ RKEIVKRVPPQADTFSRPRWSGRRLAFVVALVTVLMTAGLLIGRAIIRSNYYVADYAG SVSIMRGIQGSLLGMSLHQPYLMGCLSPRNELSQISYGQSGGPLDCHLMKLEDLRPPE RAQVRAGLPAGTLDDAIGQLRELAANSLLPPCPAPRATSPPGRPAPPTTSETTEPNVT SSPAAPSPTTSASAPTGTTPAIPTSASPAAPASPPTPWPVTSSPTMAALPPPPPQPGI DCRAAA" CDS complement(23269..23736) /codon_start=1 /transl_table=11 /gene="fhaB" /locus_tag="BQ2027_MB0019C" /product="FHA-domain-containing protein" /note="Mb0019c, -, len: 155 aa. Equivalent to Rv0019c, len: 155 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 155 aa overlap). Conserved hypothetical protein, equivalent to MLCB1770_14|NP_301147.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (155 aa), FASTA scores: opt: 902, E(): 0, (91.0% identity in 155 aa overlap). Also highly similar to T36713|AL079308|SCH69_14 from Streptomyces coelicolor (172 aa), FASTA scores: opt: 389, E(): 6e-21, (46.2% identity in 171 aa overlap); and similar in C-terminus to others e.g. NP_342559.1|NC_002754 Conserved hypothetical protein from Sulfolobus solfataricus (209 aa); etc. C-terminus also highly similar to C-terminal part of AAF07901.1|AF173844_2|AF173844 putative signal transduction protein GarA from Mycobacterium smegmatis (158 aa). Also similar to Rv1827|MTCY 1A11.16c from Mycobacterium tuberculosis (162 aa), FASTA score: (41.2% identity in 85 aa overlap); MTMOAIS_3; MAU66560_1 and MLCB1788_15. Protein product from Mb0019c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0019c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU87" /db_xref="InterPro:IPR000253" /db_xref="InterPro:IPR008984" /db_xref="InterPro:IPR032030" /db_xref="UniProtKB/TrEMBL:A0A1R3XU87" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98364.1" /translation="MQGLVLQLTRAGFLMLLWVFIWSVLRILKTDIYAPTGAVMMRRG LALRGTLLGARQRRHAARYLVVTEGALTGARITLSEQPVLIGRADDSTLVLTDDYAST RHARLSMRGSEWYVEDLGSTNGTYLDRAKVTTAVRVPIGTPVRIGKTAIELRP" CDS complement(23860..25425) /codon_start=1 /transl_table=11 /gene="fhaA" /locus_tag="BQ2027_MB0020C" /product="FIG00824290: FHA domain protein" /note="Mb0020c, TB39.8, len: 521 aa. Equivalent to Rv0020c, len: 527 aa, from Mycobacterium tuberculosis strain H37Rv, (97.9% identity in 527 aa overlap). TB39.8, conserved hypothetical protein, identified by proteomic study by the Statens Serum Institute, Denmark (spot TB39.8) (see citation below). Highly similar to NP_301148.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (488 aa); and Z70722|MLCB1770_15|T10015 hypothetical protein from Mycobacterium leprae (463 aa), FASTA scores: opt: 1213, E(): 2.2e-32, (72.3% identity in 506 aa overlap). Alternative start codon in position 24979 has been suggested. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 18 bp deletion leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (521 aa versus 527 aa). Protein product from Mb0020c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0020c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000253" /db_xref="InterPro:IPR008984" /db_xref="InterPro:IPR022128" /db_xref="InterPro:IPR042287" /db_xref="UniProtKB/TrEMBL:A0A1R3XUP2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98365.1" /translation="MGSQKRLVQRVERKLEQTVGDAFARIFGGSIVPQEVEALLRREA ADRIQSLQGNRLLAPNEYIITLGVHDFEKLGADPELKSTGFARDLADYIQEQGWQTYG DVVVRFEQSSNLHTGQFRARGTVNPDVETHPPVIDCARPQSNHAFGAEPGVAPMSDNS SYRGGQGQGRPDEYYDDRYARPQEDPRGGPDPQGGSDPRGGYPPETGGYPPQPGYPRP RHPGQGDYPEQIGYPDQGGYPEQRGYPDQRGHQDQGRGYPDQGQGGYPPPYEQRPPVS PGPAAGYGAPGYDQGYRQSGGCGPSPGGGQPGYGGYGEYGRGPARHEEGSYVPSGPPG PPEQRPAYPDQGGYDQGYQQGATTYGRQDYGGGADYTRYTESPQVPGYAPQGGGYAEP AGRDYDYGQSGAPDYGQPAPGGYSGYGQGGYGSAGTSVTLQLDDGSGRTYQLREGSNI IGRGQDAQFRLPDTGVSRRHLEIRWDGQVALLADLNSTNGTTVNNAPVQEWQLADGDV IRLGHSEIIVRMH" tRNA 25625..25707 /locus_tag="BQ2027_LEUT" /product="tRNA-Leu" /note="leuT, len: 83 nt. Equivalent to leuT, len: 83 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 83 nt overlap). tRNA-Leu, anticodon cag." CDS complement(25894..26862) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0021C" /product="putative oxidoreductase, nitronate monooxygenase family" /note="Mb0021c, -, len: 322 aa. Equivalent to Rv0021c, len: 322 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 322 aa overlap). Conserved hypothetical protein, similar to various proteins e.g. NP_464341.1|NC_003210 protein similar to oxidoreductases from Listeria monocytogenes (309 aa); NP_357973.1|NC_003098 Enoyl-acyl carrier protein(ACP) reductase from Streptococcus pneumoniae (324 aa); 2NPD_NEUCR|G726338 2-nitropropane dioxygenase precursor from Neurospora crassa (378 aa), FASTA scores: opt: 383, E(): 1.1e-16, (32.2% identity in 348 aa overlap); etc. Also similar to AE001747_25 from Thermotoga maritima section 59 (314 aa), FASTA scores: opt: 442, E(): 1.5e-19, (30.5% identity in 325 aa overlap). Also similar to other proteins from Mycobacterium tuberculosis e.g. Rv3553 (355 aa), FASTA scores: E(): 6.8e-15, (35.3 identity in 235 aa overlap); and Rv1533 (375 aa), FASTA scores: E(): 4.7e-12, (34.4% identity in 262 aa overlap). Mb0021c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU19" /db_xref="InterPro:IPR004136" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3XU19" /protein_id="SIT98366.1" /translation="MVLSTAFSQMFGIDYPIVSAPMDLIAGGELAAAVSGAGGLGLIG GGYGDRDWLARQFDLAAGAPVGCGFITWSLARQPQLLDLALQYEPVAVMLSFGDPAVF ADAIKSAGTRLVCQIQNRTQAERALQVGADVLVAQGTEAGGHGHGPRSTLTLVPEIVD LVTARGTDIPVIAAGGIADGRGLAAALMLGAAGVLVGTRFYATVEALSTPQARDPLLA ATGDDMCRTTIYDQLRRYPWPQGHTMSVLSNALTDQFEDTELDILHREEAMARYWRAV PARDYSIANVTAGQAAGLVNAVLPAADVITGMAQQAARTLTAMRAV" CDS complement(27004..27423) /codon_start=1 /transl_table=11 /gene="whiB5" /locus_tag="BQ2027_MB0022C" /product="probable transcriptional regulatory protein whib-like whib5" /note="Mb0022c, whiB5, len: 139 aa. Equivalent to Rv0022c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Probable whiB5, WhiB-like regulatory protein (see citation below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Shows some similarity to O88103|AJ239086|SCO239086_1|WHID|SC6G4.45c|WBLB WHID PROTEIN from Streptomyces coelicolor (112 aa), FASTA scores: opt: 125, E(): 0.055, (37.1% identity in 97 aa overlap); and slight similarity to G466960|WHIB WHIB PROTEIN (102 aa), FASTA scores: opt: 112, E(): 0.14, (34.3 identity in 67 aa overlap)." /db_xref="GOA:A0A1R3XU30" /db_xref="InterPro:IPR003482" /db_xref="InterPro:IPR034768" /db_xref="UniProtKB/TrEMBL:A0A1R3XU30" /protein_id="SIT98368.1" /translation="MAHPCATDPELWFGYPDDDGSDGAAKARAYERSATQARIQCLRR CPLLQQRRCAQHAVEHRVEYGVWAGIKLPGGQYRKREQLAAAHDVLRRIAGGEINSRQ LPDNAALLARNEGLEVTPVPGVVVHLPIAQVGPQPAA" CDS 27576..28346 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0023" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0023, -, len: 256 aa. Equivalent to Rv0023, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 256 aa overlap). Possible transcriptional regulator, equivalent to CAB96432.1|AJ251434 hypothetical protein from Mycobacterium avium subsp. paratuberculosis (146 aa). N-terminus showing similarity with other transcriptional regulators e.g. AE0002|ECAE000240_9 from Escherichia coli strain K12 (178 aa), FASTA scores: opt: 149, E(): 0.0048, (33.3% identity in 84 aa overlap); etc. Contains probable helix-turn helix motif from aa 19 to 40 (Score 1615, +4.69 SD). Protein product from Mb0023 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0023 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67705" /db_xref="InterPro:IPR001387" /db_xref="InterPro:IPR010982" /db_xref="UniProtKB/Swiss-Prot:P67705" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98371.1" /translation="MSRESAGAAIRALRESRDWSLADLAAATGVSTMGLSYLERGARK PHKSTVQKVENGLGLPPGTYSRLLVAADPDAELARLIAAQPSNPTAVRRAGAVVVDRH SDTDVLEGYAEAQLDAIKSVIDRLPATTSNEYETYILSVIAQCVKAEMLAASSWRVAV NAGADSTGRLMEHLRALEATRGALLERMPTSLSARFDRACAQSSLPEAVVAALIGVGA DEMWDIRNRGVIPAGALPRVRAFVDAIEASHDADEGQQ" CDS 28343..29176 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0024" /product="PUTATIVE SECRETED PROTEIN P60-RELATED PROTEIN [FIRST PART]" /note="Mb0024, -, len: 277 aa. Equivalent to 5' end of Rv0024, len: 281 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 192 aa overlap). Putative secreted protein, p60 homologue, similar in part to others and relatives proteins e.g. P60_LISIV|Q01837 protein p60 precursor (invasion-associated protein) from Listeria ivanovii (524 aa), FASTA scores: opt: 245, E(): 1.5e-08, (37.0% identity in 100 aa overlap); CAB92656.1|AL356832 putative NPL/P60 family secreted protein from Streptomyces coelicolor (347 aa); etc. Similar to Mycobacterium tuberculosis proteins Rv1477, Rv1478, Rv1566c, Rv2190c. And several homologues in Streptomyces coelicolor e.g. AL049497|SC6G10_8|T35517 probable secreted protein (338 aa), FASTA scores: opt: 399, E(): 9.8e-18, (34.9% identity in 292 aa overlap). COULD BELONG TO THE E. COLI NLPC / LISTERIA P60 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0024 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits Rv0024 into 2 parts, Mb0024 and Mb0025. Mb0024 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XU42" /protein_id="SIT98373.1" /translation="MNYSEVELLSRAHQLFAGDSRRPGLDAGTTPYGDLLSRAADLNV GAGQRRYQLAVDHSRAALLSAARTDAAAGAVITGAQRDRAWARRSTGTVLDEARSDTT VTAVMPIAQREAIRRRVARLRAQRAHVLTARRRARRHLAALRALRYRVAHGPGVALAK LRLPSPSGRAGIAVHAALSRLGRPYVWGATGPTSSTVPVWSSGPTPRRVFTWIAPPIN RSTRGSRCRAHRSGRAIWSSRTPGTCSWRSATIWSSRRPMRARRFGSARWATTCRFGD R" CDS 28918..29187 /pseudo /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0025" /note="Mb0025, -, len: 89 aa. Equivalent to 3' end of Rv0024, len: 281 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 89 aa overlap). Putative secreted protein, p60 homologue, similar in part to others and relatives proteins e.g. P60_LISIV|Q01837 protein p60 precursor (invasion-associated protein) from Listeria ivanovii (524 aa), FASTA scores: opt: 245, E(): 1.5e-08,(37.0% identity in 100 aa overlap); CAB92656.1|AL356832 putative NPL/P60 family secreted protein from Streptomyces coelicolor (347 aa); etc. Similar to Mycobacterium tuberculosis proteins Rv1477, Rv1478, Rv1566c, Rv2190c. And several homologues in Streptomyces coelicolor e.g. AL049497|SC6G10_8|T35517 probable secreted protein (338 aa), FASTA scores: opt: 399, E(): 9.8e-18, (34.9% identity in 292 aa overlap). COULD BELONG TO THE E. COLI NLPC / LISTERIA P60 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0024 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits Rv0024 into 2 parts,Mb0024 and Mb0025.;PUTATIVE SECRETED PROTEIN P60-RELATED PROTEIN [SECOND PART]" CDS 29225..29587 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0026" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0026, -, len: 120 aa. Equivalent to Rv0025, len: 120 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 120 aa overlap). Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis e.g. Rv0739 (268 aa), FASTA score: (37.6% identity in 101 aa overlap), and Rv0026 FASTA score: (35.4% identity in 113 aa overlap); etc. Protein product from Mb0026 detected using SWATH mass spectrometry. Mb0026 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019710" /db_xref="UniProtKB/TrEMBL:A0A1R3XU54" /protein_id="SIT98375.1" /translation="MSEQAGSSVAVIQERQALLARQHDAVAEADRELADVLASAHAAM RESVRRLDAIAAELDRAVPDQDQLAVDTLMGAREFQTFLVAKQREIVAVVAAAHELDR AKSAVLKRLRAQYTEPAR" CDS 29702..31135 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0027" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0027, -, len: 477 aa. Equivalent to 5' end of Rv0026, len: 448 aa, from Mycobacterium tuberculosis strain H37Rv, (94.4% identity in 449 aa overlap). Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis: Rv0025 FASTA score: (35.4% identity in 113 aa overlap) and Rv0739 (268 aa), FASTA score: (32.4% identity in 142 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 4 bp insertion (*-atcg) leads to a longer protein with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (477 aa versus 448 aa). Mb0027 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR019710" /db_xref="UniProtKB/TrEMBL:A0A1R3XU28" /protein_id="SIT98377.1" /translation="MAFDAAMSTHEDLLATIRYVRDRTGDPNAWQTGLTPTEVTAVVT STTRSEQLDAILRKIRQRHSNLYYPAPPDREQGDAARAIADAEAALAHPNSATAQLDL QVVSAILNAHLKTVEGGESLHELQQEIEAAVRIRSDLDTPAGARDFQRFLIGKLKDIR EVVATASLDAASKSALMAAWTSLYDASKGDRGDADDRGPASVGSGGAPARGAGQQPEL PTRAEPDCLLDSLLLEDPGLLADDLQVPGGTSAAIPSASSTPSLPNLGGATMPGGGAT PALVPGVSAPGGLPLSGLLRGVGDEPELTDFDERGQEVRDPADYEHSNEPDERRADDR EGADEDAGLGKSESPPQAPTTVTLPNGETVTAASPQLAAAIKAAASGTPIADAFQQQG IAIPLPGTAVANPVDPARISAGDVGVFTDRHALALGPSKALLDGQIQHISAVRGRNFL GWIHPAATATAPARTEAPTPTRPAAAR" CDS 31173..31490 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0028" /product="conserved hypothetical protein" /note="Mb0028, -, len: 105 aa. Equivalent to Rv0027, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 105 aa overlap). Hypothetical unknown protein. Protein product from Mb0028 detected using SWATH mass spectrometry. Mb0028 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64668" /db_xref="InterPro:IPR022536" /db_xref="UniProtKB/Swiss-Prot:P64668" /protein_id="SIT98378.1" /translation="MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLG PIFSELRDTGRELLELRKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIID GSR" CDS 31498..31803 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0029" /product="conserved hypothetical protein" /note="Mb0029, -, len: 101 aa. Equivalent to Rv0028, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). Hypothetical unknown protein. Protein product from Mb0029 detected using SWATH mass spectrometry. Mb0029 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024426" /db_xref="UniProtKB/Swiss-Prot:P64670" /protein_id="SIT98380.1" /translation="MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETL AEAILLTADVSCLKALLEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR" CDS 32041..33138 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0030" /product="Vegetative cell wall protein gp1 precursor" /note="Mb0030, -, len: 365 aa. Equivalent to Rv0029, len: 365 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 365 aa overlap). Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis e.g. C-terminal region of Rv2082|MTCY49_21|E247006 hypothetical 73.6 kDa protein (721 aa), FASTA scores: opt: 453, E(): 1.2e-22, (38.5% identity in 265 aa overlap); Rv3899c|MTY15F10_12 HYPOTHETICAL 40.8 KD PROTEIN (410 aa), FASTA score: (33.7% identity in 252 aa overlap); etc. Protein product from Mb0030 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0030 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR040604" /db_xref="InterPro:IPR040833" /db_xref="UniProtKB/TrEMBL:A0A1R3XU96" /protein_id="SIT98382.1" /translation="MAIFGRWSARQRLRRATRESLTIPTFSSSLDCTTRVIGGLWPAE LSSNTAETATLAEHLKADLHRIVGSANDELMVIWRAGMADSTRRAEEDRVIDRARASA MRRVESAMRELRQITGRVPVEIPRMRGAGGSDLDTTRLMPAVTVVQPADQACTDWPVA AAEDDEARLQRLLAFVARQEPRLNWAVGVNADGTTVLVTDVAHGWIPPGIALPEGVRL LAPARRAGRAPELVGITTCCKTYTPGDSLRRAVDSTAPTSSVQPRALPAIAGLSVELG IATQRHDGLPKIVHAMATAAGNGAAAEEVDLLRVHVDTALHHVLAQYPRVDPALLLNC MLLAATERSVTGDPIAANYHFAWFRELDSRR" CDS 33208..33537 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0031" /product="conserved hypothetical protein" /note="Mb0031, -, len: 109 aa. Equivalent to Rv0030, len: 109 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 109 aa overlap). Hypothetical unknown protein. Mb0031 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR024296" /db_xref="UniProtKB/Swiss-Prot:P64672" /protein_id="SIT98384.1" /translation="MVSGSDSRSEPSQLSDRDLVESVLRDLSEAADKWEALVTQAETV TYSVDLGDVRAVANSDGRLLELTLHPGVMTGYAHGELADRVNLAITALRDEVEAENRA RYGGRLQ" CDS 33566..33778 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0032" /product="possible remnant of a transposase" /note="Mb0032, -, len: 70 aa. Equivalent to Rv0031, len: 70 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 70 aa overlap). Possible remnant of a transposase, showing partial similarity to mycobacterial transposases in a short overlap, e.g. Rv2791c|MTV002_57 (459 aa), FASTA score: (72.2% identity in 36 aa overlap); Rv2885c, Rv2978c, Rv3827c, etc. Mb0032 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XU29" /protein_id="SIT98386.1" /translation="MLARHFGAGRKAHSRAVATLKADIQAWHPAGIQTPKPRCESDVF ARIGHTSHPSTRKSRVGPGASEAPLA" CDS 34279..36594 /codon_start=1 /transl_table=11 /gene="bioF2" /locus_tag="BQ2027_MB0033" /product="POSSIBLE 8-AMINO-7-OXONONANOATE SYNTHASE BIOF2 (AONS) (8-AMINO-7-KETOPELARGONATE SYNTHASE) (7-KETO-8-AMINO-PELARGONIC ACID SYNTHETASE) (7-KAP SYNTHETASE) (L-ALANINE--PIMELYL CoA LIGASE)" /note="Mb0033, bioF2, len: 771 aa. Equivalent to Rv0032, len: 771 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 771 aa overlap). Probable bioF2, 8-amino-7-oxononanoate synthase (EC 2.3.1.47), with its C-terminal similar to others e.g. BIOF_BACSU|P53556 8-amino-7-oxononanoate synthase from Bacillus subtilis (389 aa), FASTA scores: opt: 775, E(): 0, (37.9% identity in 346 aa overlap); P22806|BIOF_BACSH from Bacillus sphaericus (389 aa); etc. Also similar to BIOF1|Rv1569|MTCY336_35 from Mycobacterium tuberculosis (386 aa), AF041819_4 from Mycobacterium bovis, and BIOF_MYCLE|P45487 from Mycobacterium leprae (385 aa). Contains PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site. BELONGS TO CLASS-II OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES." /db_xref="GOA:A0A1R3XU34" /db_xref="InterPro:IPR001917" /db_xref="InterPro:IPR004839" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR016181" /db_xref="InterPro:IPR038740" /db_xref="UniProtKB/TrEMBL:A0A1R3XU34" /protein_id="SIT98388.1" /translation="MPTGLGYDFLRPVEDSGINDLKHYYFMADLADGQPLGRANLYSV CFDLATTDRKLTPAWRTTIKRWFPGFMTFRFLECGLLTMVSNPLALRSDTDLERVLPV LAGQMDQLAHDDGSDFLMIRDVDPEHYQRYLDILRPLGFRPALGFSRVDTTISWSSVE EALGCLSHKRRLPLKTSLEFRERFGIEVEELDEYAEHAPVLARLWRNVKTEAKDYQRE DLNPEFFAACSRHLHGRSRLWLFRYQGTPIAFFLNVWGADENYILLEWGIDRDFEHYR KANLYRAALMLSLKDAISRDKRRMEMGITNYFTKLRIPGARVIPTIYFLRHSTDPVHT ATLARMMMHNIQRPTLPDDMSEEFCRWEERIRLDQDGLPEHDIFRKIDRQHKYTGLKL GGVYGFYPRFTGPQRSTVKAAELGEIVLLGTNSYLGLATHPEVVEASAEATRRYGTGC SGSPLLNGTLDLHVSLEQELACFLGKPAAVLCSTGYQSNLAAISALCESGDMIIQDAL NHRSLFDAARLSGADFTLYRHNDMDHLARVLRRTEGRRRIIVVDAVFSMEGTVADLAT IAELADRHGCRVYVDESHALGVLGPDGRGASAALGVLARMDVVMGTFSKSFASVGGFI AGDRPVVDYIRHNGSGHVFSASLPPAAAAATHAALRVSRREPDRRARVLAAAEYMATG LARQGYQAEYHGTAIVPVILGNPTVAHAGYLRLMRSGVYVNPVAPPAVPEERSGFRTS YLADHRQSDLDRALHVFAGLAEDLTPQGAAL" CDS 36591..36854 /codon_start=1 /transl_table=11 /gene="acpA" /locus_tag="BQ2027_MB0034" /standard_name="acpP" /product="PROBABLE ACYL CARRIER PROTEIN ACPA (ACP)" /note="Mb0034, acpA, len: 87 aa. Equivalent to Rv0033, len: 87 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 87 aa overlap). Probable acpA (alternate gene name: acpP), acyl carrier protein, similar to others e.g. ACP_BACSU|P80643 acyl carrier protein (acp) from Bacillus subtilis (77 aa), FASTA scores: opt: 149, E(): 0.00026, (41.4% identity in 70 aa overlap); NP_224500.1|NC_000922 Acyl Carrier Protein from Chlamydophila pneumoniae (79 aa); NP_228471.1|NC_000853 acyl carrier protein from Thermotoga maritima (81 aa); etc. Also similar to proteins of Mycobacterium tuberculosis Rv1344 and Rv2244 (31.5% identity in 73 aa overlap)." /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR036736" /db_xref="UniProtKB/TrEMBL:A0A1R3XU46" /protein_id="SIT98390.1" /translation="MKEAINATIQRILRTDRGITANQVLVDDLGFDSLKLFQLITELE DEFDIAISFRDAQNIKTVGDVYTSVAVWFPETAKPAPLGKGTA" CDS 36851..37246 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0035" /product="Ketosteroid isomerase-related protein" /note="Mb0035, -, len: 131 aa. Equivalent to Rv0034, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Conserved hypothetical protein, showing weak similarity to AE001980|AE001980_7 hypothetical protein from Deinococcus radiodurans (120 aa), FASTA scores: opt: 141, E(): 0.0028, (29.3% identity in 123 aa overlap). Protein product from Mb0035 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="InterPro:IPR032710" /db_xref="InterPro:IPR037401" /db_xref="UniProtKB/Swiss-Prot:P64674" /protein_id="SIT98392.1" /translation="MTDDADLDLVRRTFAAFARGDLAELTQCFAPDVEQFVPGKHALA GVFRGVDNVVACLGDTAAAADGTMTVTLEDVLSNTDGQVIAVYRLRASRAGKVLDQRE AILVTVAGGRITRLSEFYADPAATESFWA" CDS 37243..38931 /codon_start=1 /transl_table=11 /gene="fadD34" /locus_tag="BQ2027_MB0036" /product="PROBABLE FATTY-ACID-COA LIGASE FADD34 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb0036, fadD34, len: 562 aa. Equivalent to Rv0035, len: 562 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 562 aa overlap). Probable fadD34, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to many e.g. MBU75685_1 acyl-CoA synthase from Mycobacterium bovis (582 aa), FASTA scores: opt: 408, E(): 8.2e-20; etc. Also similar to G1171128 SAFRAMYCIN MX1 SYNTHETASE B (1770 aa), FASTA scores: opt: 445, E(): 1.3e-21, (28.1% identity in 573 aa overlap). Also similar to other proteins from Mycobacterium tuberculosis e.g. MTCY02B10.09, FASTA score: (32.3% identity in 468 aa overlap), MTCY349_40, MTCY4D9_17, MTCY338_18, MTV045_3, MTCY409_4, MTCI237_30, MTCY24G1_8, MASC_MYCLE MASC PROTEIN, U00010_6, MTV005_21, MTCY19G5_7, MTCY9F9_39, etc. Protein product from Mb0036 detected using SWATH mass spectrometry. Mb0036 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU59" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XU59" /protein_id="SIT98393.1" /translation="MTAALLSPAIAWQQISACTDRTLTITCEDSEVISYQDLIARAAA CIPPLRRLDLKRGEPVLITAHTNLEFLSCFLGLMLHGAVPVPIPPREALKTTERFMTR LGPLLRHHRVLICTPAEHDEIRAAASTDCQISRFTALAEAGDEQFGRATAQQLADTAT ADWPLCTLDDDAYVQYTSGSTAAPRGVVITYRNLLSNMRAMAVGSQFQHGDVMGSWLP LHHDMGLVGSLFAALFNSVSAVFTTPHRFLYDPLGFLRLLTSSGATHTFMPNFALEWL INAYHRRGADIEGIDLHKMRRLIIASEPVHAEGMRRFAATFAGVGLAPTALGSGYGLA EATVAVSMSAPNTGFRTETHAAAEVVTGGRVLPGYEVRIDAAPGARAGTIKLRGDSVA AKAYVGGKKLDALDEEGFCDTHDLGFLVDDEIVILGRQDEVFIVHGENRFPYDIEFII RGESEQHRTKVACFGVNERVVVVLESPLDSIIDKAEADRLRCQVVAATGLQLDELITV RRGAIPTTTSGKLKRRAVAQAYRDGTLPRLATHAWTADPDSAPKTTRSSLEGAH" CDS complement(39040..39813) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0037C" /product="conserved protein" /note="Mb0037c, -, len: 257 aa. Equivalent to Rv0036c, len: 257 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 257 aa overlap). Conserved hypothetical protein, highly similar to CAB95889.1|AL359988 conserved hypothetical protein from Streptomyces (276 aa). Also some similarity to Rv3099c|MTCY164_10 (283 aa), FASTA scores: E(): 3.3e-05, (25.9 % identity in 205 aa overlap). Protein product from Mb0037c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0037c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64676" /db_xref="InterPro:IPR013917" /db_xref="InterPro:IPR017517" /db_xref="InterPro:IPR017518" /db_xref="InterPro:IPR024344" /db_xref="InterPro:IPR034660" /db_xref="UniProtKB/Swiss-Prot:P64676" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98395.1" /translation="MADPGPFVADLRAESDDLDALVAHLPADRWADPTPAPGWTIAHQ IGHLLWTDRVALTAVTDEAGFAELMTAAAANPAGFVDDAATELAAVSPAELLTDWRVT RGRLHEELLAVPDGRKLAWFGPPMSAASMATARLMETWAHGLDVADALGVIRPATQRL RSIAHLGVRTRDYAFIVNNLTPPAEPFLVELRGPSGDTWSWGPSDAAQRVTGSAEDFC FLVTQRRALSTLDVNAVGEDAQRWLTIAQAFAGPPGRGR" CDS complement(39861..41186) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0038C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0038c, -, len: 441 aa. Equivalent to Rv0037c, len: 441 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 441 aa overlap). Probable conserved integral membrane protein, member of major facilitator superfamily (MFS) possibly involved in transport of macrolide, showing some similarity to Rv1258c|MTCY50_24 (419 aa), FASTA score: (25.2% identity in 408 aa overlap); and to AL049826|SCH24_20 from Streptomyces coelicolor (425 aa), FASTA scores: opt: 725, E(): 0, (36.1% identity in 418 aa overlap). Also similarity with several MACROLIDE-EFFLUX PROTEINS e.g. from S. pyogenes (405 aa), FASTA scores: E(): 1.3e-06, (22.8% identity in 416 aa overlap). Protein product from Mb0038c detected using SWATH mass spectrometry." /db_xref="GOA:P0A5C2" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/Swiss-Prot:P0A5C2" /protein_id="SIT98398.1" /translation="MPRVEVGLVIHSRMHARAPVDVWRSVRSLPDFWRLLQVRVASQF GDGLFQAGLAGALLFNPDRAADPMAIAGAFAVLFLPYSLLGPFAGALMDRWDRRWVLV GANTGRLALIAGVGTILAVGAGDVPLLVGALVANGLARFVASGLSAALPHVVPREQVV TMNSVAIASGAVSAFLGANFMLLPRWLLGSGDEGASAIVFLVAIPVSIALLWSLRFGP RVLGPDDTERAIHGSAVYAVVTGWLHGARTVVQLPTVAAGLSGLAAHRMVVGINSLLI LLLVRHVTARAVGGLGTALLFFAATGLGAFLANVLTPTAIRRWGRYATANGALAAAAT IQVAAAGLLVPVMVVCGFLLGVAGQVVKLCADSAMQMDVDDALRGHVFAVQDALFWVS YILSITVAAALIPEHGHAPVFVLFGSAIYLAGLVVHTIVGRRGQPVIGR" CDS 41288..41896 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0039" /product="UPF0301 protein YqgE" /note="Mb0039, -, len: 202 aa. Equivalent to Rv0038, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 202 aa overlap). Conserved hypothetical protein, equivalent to MLCB1770_16|Q50191|Y038_MYCLE hypothetical 22.0 kDa from Mycobacterium leprae (202 aa), FASTA scores: opt: 1194, E(): 0, (88.6% identity in 202 aa overlap). Also highly similar or similar to other hypothetical proteins e.g. CAB72194.1|AL138851|SCE59.07c from Streptomyces coelicolor (193 aa); AAC06288.1|AF050466 from Mycobacterium bovis (82 aa) (similarity in N-terminus); NP_224347.1|NC_000922|YqgE from Chlamydophila pneumoniae (188 aa); YQGE_ECOLI HYPOTHETICAL 20.7 KD PROTEIN (187 aa), FASTA score: (29.5% identity in 166 aa overlap); etc. Protein product from Mb0039 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0039 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003774" /db_xref="UniProtKB/Swiss-Prot:P67758" /protein_id="SIT98399.1" /translation="MVAPHEDPEDHVAPAAQRVRAGTLLLANTDLLEPTFRRSVIYIV EHNDGGTLGVVLNRPSETAVYNVLPQWAKLAAKPKTMFIGGPVKRDAALCLAVLRVGA DPEGVPGLRHVAGRLVMVDLDADPEVLAAAVEGVRIYAGYSGWTIGQLEGEIERDDWI VLSALPSDVLVGPRADLWGQVLRRQPLPLSLLATHPIDLSRN" CDS complement(41988..42335) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0040C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0040c, -, len: 115 aa. Equivalent to Rv0039c, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 115 aa overlap). Possible conserved transmembrane protein, highly similar to NP_301154.1|NC_002677|Z70722|MLCB1770_18 hypothetical protein from Mycobacterium leprae (113 aa), FASTA scores: opt: 492, E(): 7.8e-27, (64.9% identity in 114 aa overlap). Mb0040c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XUB4" /db_xref="UniProtKB/TrEMBL:A0A1R3XUB4" /protein_id="SIT98400.1" /translation="MFLAGVLCMCAAAASALFGSWSLFHTPTADPTALALRAMAPTQL AAAVMLAAGGVVAVAAPGHTALMVVIVCIAGAVGTLAAGSWQSAQYALRRETASPTAN CVGSCAVCTQACH" CDS complement(42417..43349) /codon_start=1 /transl_table=11 /gene="mtc28" /locus_tag="BQ2027_MB0041C" /product="SECRETED PROLINE RICH PROTEIN MTC28 (PROLINE RICH 28 KDA ANTIGEN)" /note="Mb0041c, mtc28, len: 310 aa. Equivalent to Rv0040c, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 310 aa overlap). mtc28, secreted proline rich 28 kDa antigen protein (has hydrophobic stretch at N-terminus) (see citation below). Highly similar to O33075|PR28_MYCLE|MT10019 Proline rich 28 kDa antigen from Mycobacterium leprae (278 aa), FASTA scores: opt: 1007, E(): 0, (65.0% identity in 257 aa overlap); and Q9CD47|LPQT_MYCLE|NP_301305.1|NC_002677 putative lipoprotein from Mycobacterium leprae (218 aa). C-terminal part very similar to lipoprotein Rv1016c from Mycobacterium tuberculosis. Mb0041c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019674" /db_xref="UniProtKB/Swiss-Prot:P0A5Q7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98401.1" /translation="MIQIARTWRVFAGGMATGFIGVVLVTAGKASADPLLPPPPIPAP VSAPATVPPVQNLTALPGGSSNRFSPAPAPAPIASPIPVGAPGSTAVPPLPPPVTPAI SGTLRDHLREKGVKLEAQRPHGFKALDITLPMPPRWTQVPDPNVPDAFVVIADRLGNS VYTSNAQLVVYRLIGDFDPAEAITHGYIDSQKLLAWQTTNASMANFDGFPSSIIEGTY RENDMTLNTSRRHVIATSGADKYLVSLSVTTALSQAVTDGPATDAIVNGFQVVAHAAP AQAPAPAPGSAPVGLPGQAPGYPPAGTLTPVPPR" CDS 43546..46455 /codon_start=1 /transl_table=11 /gene="leuS" /locus_tag="BQ2027_MB0042" /product="PROBABLE LEUCYL-tRNA SYNTHETASE LEUS (LEUCINE--tRNA LIGASE) (LEURS)" /note="Mb0042, leuS, len: 969 aa. Equivalent to Rv0041, len: 969 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 969 aa overlap). Probable leucyl-tRNA synthetase (EC 6.1.1.4), equivalent to NP_301156.1|NC_002677|MLCB628_3 leucyl-tRNA synthase from Mycobacterium leprae (972 aa), FASTA score: (83.6% identity in 972 aa overlap); and highly similar to MLCB1770_20 from Mycobacterium leprae (824 aa), FASTA score: (82.8% identity in 824 aa overlap). Also highly similar to others e.g. CAB66249.1|AL136518 leucyl-tRNA synthetase from Streptomyces coelicolor (966 aa); NP_244147.1|NC_002570 leucyl-tRNA synthetase from Bacillus halodurans (806 aa); SYL_BACSU|P36430 leucyl-tRNA synthetase from Bacillus subtilis (804 aa), FASTA scores: opt: 714, E(): 3.1e-38, (43.7% identity in 938 aa overlap); etc. Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. BELONGS TO CLASS-I AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb0042 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0042 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67511" /db_xref="InterPro:IPR001412" /db_xref="InterPro:IPR002302" /db_xref="InterPro:IPR009008" /db_xref="InterPro:IPR009080" /db_xref="InterPro:IPR013155" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR015413" /db_xref="InterPro:IPR025709" /db_xref="UniProtKB/Swiss-Prot:P67511" /protein_id="SIT98402.1" /translation="MTESPTAGPGGVPRADDADSDVPRYRYTAELAARLERTWQENWA RLGTFNVPNPVGSLAPPDGAAVPDDKLFVQDMFPYPSGEGLHVGHPLGYIATDVYARY FRMVGRNVLHALGFDAFGLPAEQYAVQTGTHPRTRTEANVVNFRRQLGRLGFGHDSRR SFSTTDVDFYRWTQWIFLQIYNAWFDTTANKARPISELVAEFESGARCLDGGRDWAKL TAGERADVIDEYRLVYRADSLVNWCPGLGTVLANEEVTADGRSDRGNFPVFRKRLRQW MMRITAYADRLLDDLDVLDWPEQVKTMQRNWIGRSTGAVALFSARAASDDGFEVDIEV FTTRPDTLFGATYLVLAPEHDLVDELVAASWPAGVNPLWTYGGGTPGEAIAAYRRAIA AKSDLERQESREKTGVFLGSYAINPANGEPVPIFIADYVLAGYGTGAIMAVPGHDQRD WDFARAFGLPIVEVIAGGNISESAYTGDGILVNSDYLNGMSVPAAKRAIVDRLESAGR GRARIEFKLRDWLFARQRYWGEPFPIVYDSDGRPHALDEAALPVELPDVPDYSPVLFD PDDADSEPSPPLAKATEWVHVDLDLGDGLKPYSRDTNVMPQWAGSSWYELRYTDPHNS ERFCAKENEAYWMGPRPAEHGPDDPGGVDLYVGGAEHAVLHLLYSRFWHKVLYDLGHV SSREPYRRLVNQGYIQAYAYTDARGSYVPAEQVIERGDRFVYPGPDGEVEVFQEFGKI GKSLKNSVSPDEICDAYGADTLRVYEMSMGPLEASRPWATKDVVGAYRFLQRVWRLVV DEHTGETRVADGVELDIDTLRALHRTIVGVSEDFAALRNNTATAKLIEYTNHLTKKHR DAVPRAAVEPLVQMLAPLAPHIAEELWLRLGNTTSLAHGPFPKADAAYLVDETVEYPV QVNGKVRGRVVVAADTDEETLKAAVLTDEKVQAFLAGATPRKVIVVAGRLVNLVI" CDS complement(46565..47191) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0043C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY MARR-FAMILY)" /note="Mb0043c, -, len: 208 aa. Equivalent to Rv0042c, len: 208 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 208 aa overlap). Possible transcriptional regulatory protein, MarR-family, highly similar except in N-terminus to CAC32228.1|AL583926 putative MarR-family regulatory protein from Mycobacterium leprae (243 aa). Also similar in part to others e.g. AB76343.1|AL158061 putative MarR-family transcriptional regulator from Streptomyces coelicolor (163 aa); NP_384406.1|NC_003047 PUTATIVE TRANSCRIPTION REGULATOR PROTEIN from Sinorhizobium meliloti (164 aa); NP_531782.1|NC_003304 transcriptional regulator, MarR family from Agrobacterium tumefaciens (151 aa); etc. Also some similarity to Mycobacterium tuberculosis proteins Rv2327, Rv0880, and Rv1404. Protein product from Mb0043c detected using shotgun mass spectrometry. Mb0043c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XU39" /db_xref="InterPro:IPR000835" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XU39" /protein_id="SIT98403.1" /translation="MSVVRSIGKKMQRISGPNALAVKGRPTQVYGHTHVRLDCRFMAD SEFTAPEVTQLAEGLHRALSKLISMLRRGDPNGAAAGDLTLAQLSILVTLLDQGPIRM TDLAAHERVRTPTTTVAIRRLEKIGLVKRSRDPSDLRAVLVDITPQGRAVHGESLANR RAALAALLSQLPRSDLETLRKALAPLERLASGEPASGPASNSPARKRA" CDS complement(47350..48084) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0044C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY GNTR-FAMILY)" /note="Mb0044c, -, len: 244 aa. Equivalent to Rv0043c, len: 244 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 244 aa overlap). Probable transcriptional regulator, GntR family, similar to others e.g. NP_420584.1|NC_002696 transcriptional regulator GntR family from Caulobacter crescentus (221 aa); NP_294539.1|NC_001263 transcriptional regulator GntR family from Deinococcus radiodurans (267 aa); YIN1_STRAM|P32425 hypothetical transcriptional regulatory protein from Streptomyces ambofaciens (236 aa), FASTA scores: opt: 170, E(): 9.8e-05, (27.6% identity in 127 aa overlap); etc. Similar also to SC9B10_7 from Streptomyces coelicolor FASTA score: E():0.00038; and Rv0165c|MTCI28_5 from Mycobacterium tuberculosis (264 aa), FASTA score: (27.7% identity in 130 aa overlap). Mb0044c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67738" /db_xref="InterPro:IPR000524" /db_xref="InterPro:IPR008920" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P67738" /protein_id="SIT98404.1" /translation="MPKKYGVKEKDQVVAHILNLLLTGKLRSGDRVDRNEIAHGLGVS RVPIQEALVQLEHDGIVSTRYHRGAFIERFDVATILEHHELDGLLNGIASARAAANPT PRILGQLDAVMRSLRNSKESRAFAECVWEYRRTVNDEYAGPRLHATIRASQNLIPRVF WMTYQNSRDDVLPFYEEENAAIHRREPEAARAACIGRSELMAQTMLAELFRRRVLVPP EGACPGPFGAPIPGFARSYQPSSPVP" CDS complement(48217..49011) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0045C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0045c, -, len: 264 aa. Equivalent to Rv0044c, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 264 aa overlap). Possible oxidoreductase (EC 1.-.-.-), highly similar to AAD32732.1|MmcI|AF127374| F420-dependent H4MPT reductase from Streptomyces lavendulae (264 aa). Also similar to Mycobacterium tuberculosis proteins e.g. Rv1855c, Rv0953c, Rv0791c, Rv0132c, etc. Protein product from Mb0045c detected using shotgun mass spectrometry. Mb0045c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XU55" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR022480" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3XU55" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98405.1" /translation="MTSLVRPDLPVRIGVQLQPQHAPHYRAVRDAVRRCEDIGVDIAF TWDHFFPLYGDPDGPHFECWTVLGAWAEQTSHIEIGALVTCNSYRNPELLADMARTVD HISGGRLILGIGSGWKQKDYDEYGYRFGTAGSRLDDLAAALPRIKARLGKLNPPPTRD IPVLIGGGGERKTLRLVAEYADIWHSFTAGDSYLAKSAVLSTHCSTVGRNPATIERSA AVDGGGLIASAEALAGLGVTLLTVGCDGPDYDLSAAAALCRWRDGR" CDS complement(49027..49923) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0046C" /product="POSSIBLE HYDROLASE" /note="Mb0046c, -, len: 298 aa. Equivalent to Rv0045c, len 298 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 298 aa overlap). Possible hydrolase (EC 3.-.-.-), showing similarity with others eg NP_107230.1|NC_002678 putative hydrolase from Mesorhizobium loti (278 aa); CAB56730.1|AL121600 putative hydrolase from Streptomyces coelicolor (302 aa); NP_438361.1|NC_000907 putative esterase/lipase from Haemophilus influenzae Rd (287 aa); etc. Also similar to Mycobacterium tuberculosis proteins Rv3473c, Rv1123c, Rv1938, Rv3617, Rv3670, etc. Protein product from Mb0046c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0046c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU63" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XU63" /protein_id="SIT98406.1" /translation="MLSDDELTGLDEFALLAENAEQAGVNGPLPEVERVQAGAISALR WGGSAPRVIFLHGGGQNAHTWDTVIVGLGEPALAVDLPGHGHSAWREDGNYSPQLNSE TLAPVLRELAPGAEFVVGMSLGGLTAIRLAAMAPDLVGELVLVDVTPSALQRHAELTA EQRGTVALMHGEREFPSFQAMLDLTIAAAPHRDVKSLRRGVFHNSRRLDNGNWVWRYD AIRTFGDFAGLWDDVDALSAPITLVRGGSSGFVTDQDTAELHRRATHFRGVHIVEKSG HSVQSDQPRALIEIVRGVLDTR" CDS complement(50005..51108) /codon_start=1 /transl_table=11 /gene="ino1" /locus_tag="BQ2027_MB0047C" /standard_name="tbINO" /product="MYO-INOSITOL-1-PHOSPHATE SYNTHASE INO1 (Inositol 1-phosphate synthetase) (D-glucose 6-phosphate cycloaldolase) (Glucose 6-phosphate cyclase) (Glucocycloaldolase)" /note="Mb0047c, ino1, len: 367 aa. Equivalent to Rv0046c, len: 367 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 367 aa overlap). ino1 (alternate gene name: tbINO), myo-inositol-1-phosphate synthase (EC 5.5.1.4) (see citations below), equivalent to Q57240|Y046_MYCLE|U00015_14|G466956|B1620_F3_113 HYPOTHETICAL 40.3 KDA PROTEIN from Mycobacterium leprae (369 aa), FASTA scores: opt: 2221, E(): 0, (91.8% identity in 366 aa overlap). N-terminus similar to N-terminus of myo-inositol-1-phosphate synthases e.g. INO1_SPIPO|P42803 myo-inositol-1-phosphate synthase (510 aa), FASTA scores: opt: 144, E(): 0.021, (25.2% identity in 365 aa overlap); CAC21218.1|AJ401007 myo-inositol 1P synthase from Thermotoga sp. SG1 (335 aa); etc. Also highly similar to other hypothetical proteins e.g. AL049826|SCH24_21c hypothetical protein from Streptomyces coelicolor (360 aa), FASTA scores: opt: 1790, E(): 0, (77.8% identity in 360 aa overlap); AE000881_1 conserved protein from M. thermoautotrophicus (368 aa); etc. Protein product from Mb0047c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0047c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59967" /db_xref="InterPro:IPR002587" /db_xref="InterPro:IPR013021" /db_xref="InterPro:IPR017815" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P59967" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98407.1" /translation="MSEHQSLPAPEASTEVRVAIVGVGNCASSLVQGVEYYYNADDTS TVPGLMHVRFGPYHVRDVKFVAAFDVDAKKVGFDLSDAIFASENNTIKIADVAPTNVI VQRGPTLDGIGKYYADTIELSDAEPVDVVQALKEAKVDVLVSYLPVGSEEADKFYAQC AIDAGVAFVNALPVFIASDPVWAKKFTDAGVPIVGDDIKSQVGATITHRVLAKLFEDR GVQLDRTMQLNVGGNMDFLNMLERERLESKKISKTQAVTSNLKREFKTKDVHIGPSDH VGWLDDRKWAYVRLEGRAFGDVPLNLEYKLEVWDSPNSAGVIIDAVRAAKIAKDRGIG GPVIPASAYLMKSPPEQLPDDIARAQLEEFIIG" CDS complement(51169..51711) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0048C" /product="Transcriptional regulator, PadR family" /note="Mb0048c, -, len: 180 aa. Equivalent to Rv0047c, len: 180 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 180 aa overlap). Conserved hypothetical protein, equivalent to NP_302717.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (180 aa). Also showing strong similarity to other hypothetical proteins e.g. AL049826|SCH24_22|T36587 from Streptomyces coelicolor (225 aa), FASTA scores: opt: 583, E(): 9e-31, (51.4% identity in 177 aa overlap); etc. Some similarity to Rv1176c from Mycobacterium tuberculosis and to P94443|YFIO from Bacillus subtilis (182 aa), FASTA scores: E(): 0.00066, (24.9% identity in 177 aa overlap). Also some similarity to G1163121 MITHRAMYCIN RESISTANCE DETERMINANT, ATP-BINDING PROTEIN (219 aa), FASTA scores: opt: 143, E(): 0.0091, (29.4% identity in 180 aa overlap). Protein product from Mb0048c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0048c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR005149" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XWC5" /protein_id="SIT98408.1" /translation="MLELAILGLLIESPMHGYELRKRLTGLLGAFRAFSYGSLYPALR RMQADGLIAENAAPAGTPVRRARRVYQLTDKGRRRFGELVADTGPHNYTDDGFGVHLA FFNRTPAEARMRILEGRRRQVEERREGLREAVARASSSFDRYTRQLHQLGLESSEREV KWLNELIAAERAAPNPAEQT" CDS complement(51812..52681) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0049C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb0049c, -, len: 289 aa. Equivalent to Rv0048c, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 289 aa overlap). Possible membrane protein. Protein product from Mb0049c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0049c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XVA5" /db_xref="InterPro:IPR012551" /db_xref="UniProtKB/TrEMBL:A0A1R3XVA5" /protein_id="SIT98410.1" /translation="MAKWLGAPLARGVSTATRAKDSDRQDACRILDDALRDGELSMEE HRERVSAATKAVTLGDLQRLVADLQVESAPAQMPALKSRAKRTELGLLAAAFVASVLL GVGIGWGVYGNTRSPLDFTSDPGAKPDGIAPVVLTPPRQLHSLGGLTGLLEQTRKRFG DTMGYRLVIYPEYASLDRVDPADDRRVLAYTYRGGWGDATSSAKSIADVSVVDLSKFD AKTAVSIMRGAPETLGLKQSDVKSMYLIVEPAKDPTTPAALSLSLYVSSDYGGGYLVF AGDGTIKHVSYPS" CDS 52815..53228 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0050" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0050, -, len: 137 aa. Equivalent to Rv0049, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 137 aa overlap). Conserved hypothetical protein, only equivalent to AL022118|MLCB1913_20 hypothetical protein from Mycobacterium leprae (138 aa), FASTA scores: opt: 768, E(): 0, (83.9% identity in 137 aa overlap). Mb0050 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR035169" /db_xref="UniProtKB/Swiss-Prot:P64678" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98412.1" /translation="MDYTLRRRSLLAEVYSGRTGVSEVCDANPYLLRAAKFHGKPSRV ICPICRKEQLTLVSWVFGEHLGAVSGSARTAEELILLATRFSEFAVHVVEVCRTCSWN HLVKSYVLGAARPARPPRGSGGTRTARNGARTASE" CDS 53221..55689 /codon_start=1 /transl_table=11 /gene="ponA1" /locus_tag="BQ2027_MB0051" /product="PROBABLE BIFUNCTIONAL PENICILLIN-BINDING PROTEIN 1A/1B PONA1 (MUREIN POLYMERASE) (PBP1): PENICILLIN-INSENSITIVE TRANSGLYCOSYLASE (PEPTIDOGLYCAN TGASE) + PENICILLIN-SENSITIVE TRANSPEPTIDASE (DD-TRANSPEPTIDASE)" /note="Mb0051, ponA1, len: 680 aa. Equivalent to Rv0050, len: 678 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 680 aa overlap). Probable ponA1, penicillin-binding protein (class A), bienzymatic protein with transglycosylase (EC 2.4.2.-) and transpeptidase (EC 3.4.-.-) activities, highly similar to many e.g. NP_302715.1|NC_002677 penicillin-binding protein from Mycobacterium leprae (708 aa); AAB53123.1|L39923 penicillin binding protein from Mycobacterium leprae (686 aa), FASTA scores: (82.3% identity in 679 aa overlap); Q9F9V7|PONA|AAG13121.1|AF165523_1|AF165523 penicillin-binding protein 1 from Mycobacterium smegmatis (715 aa) (see citation below); CAB88838.1|AL353832 probable penicillin-binding protein from Streptomyces coelicolor (756 aa); etc. Also similar to ponA2|Rv3682|MTV025.030 BIFUNCTIONAL MEMBRANE-ASSOCIATED PENICILLIN-BINDING PROTEIN 1A/1B from Mycobacterium tuberculosis (810 aa). BELONGS TO THE TRANSGLYCOSYLASE FAMILY IN THE N-TERMINAL SECTION, AND TO THE TRANSPEPTIDASE FAMILY IN THE C-TERMINAL SECTION. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, two insertions of 3 bp each (*-tcc, *-cgt) leads to a slightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (680 aa versus 678 aa). Protein product from Mb0051 detected using SWATH mass spectrometry. Mb0051 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XUS2" /db_xref="InterPro:IPR001264" /db_xref="InterPro:IPR001460" /db_xref="InterPro:IPR012338" /db_xref="InterPro:IPR023346" /db_xref="InterPro:IPR036950" /db_xref="UniProtKB/TrEMBL:A0A1R3XUS2" /protein_id="SIT98414.1" /translation="MNSDGRHHQSSSGAPRGPANPGQRGQVPPDDRLTAILPPVTDDR SAPHADSIEAVKAALDGAPPMPPPRDPLEEVTAALAAPPGKPPRGDQLGGRRRPPGPP GPPGSSGQPAGRLPQPRVDLPRVGQINWKWIRRSLYLTAAVVILLLMVTFTMAYLIVD VPKPGDIRTNQVSTILASDGSEIAKIVPPEGNRVDVNLSQVPMHVRQAVIAAEDRNFY SNPGFSFTGFARAVKNNLFGGDLQGGSTITQQYVKNALVGSAQHGWSGLMRKAKELVI ATKMSGEWSKDDVLQAYLNIIYFGRGAYGISAASKAYFDKPVEQLTVAEGALLAALIR RPSTLDPAVDPEGAHARWNWVLDGMVETKALSPNDRAAQVFPETVPPDLARAENQTKG PNGLIERQVTRELLELFNIDEQTLNTQGLVVTTTIDPQAQRAAEKAVAKYLDGQDPDM RAAVVSIDPHNGAVRAYYGGDNANGFDFAQAGLQTGSSFKVFALVAALEQGIGLGYQV DSSPLTVDGIKITNVEGEGCGTCNIAEALKMSLNTSYYRLMLKLNGGPQAVADAAHQA GIASSFPGVAHTLSEDGKGGPPNNGIVLGQYQTRVIDMASAYATLAASGIYHPPHFVQ KVVSANGQVLFDASTADNTGDQRIPKAVADNVTAAMEPIAGYSRGHNLAGGRDSAAKT GTTQFGDTTANKDAWMVGYTPSLSTAVWVGTVKGDEPLVTASGAAIYGSGLPSDIWKA TMDGALKGTSNETFPKPTEVGGYAGVPPPPPPPPSEVPPSETVIQPTVEIAPGITIPI GPPTTITLAPPPPAPPAATPTPPP" CDS 55686..57368 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0052" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0052, -, len: 560 aa. Equivalent to Rv0051, len:560 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 560 aa overlap). Probable conserved transmembrane protein, equivalent to NP_302714.1|NC_002677 conserved membrane protein from Mycobacterium leprae (564 aa); and highly similar to C-terminus of AAF25828.1|AF187306_1|AF187306 putative transmembrane protein from Mycobacterium smegmatis (692 aa). Also highly similar to MSGDNAB_5|G886306|L222-ORF5 (418 aa), FASTA scores: opt: 2163, E(): 0, (78.4% identity in 412 aa overlap). Also similar to AL049826|SCH24_24|T36589 probable transmembrane protein from Streptomyces coelicolor (502 aa), FASTA scores: opt: 492, E(): 1.4e-23, (35.8% identity in 522 aa overlap). Mb0052 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU43" /db_xref="InterPro:IPR016570" /db_xref="InterPro:IPR018584" /db_xref="UniProtKB/TrEMBL:A0A1R3XU43" /protein_id="SIT98416.1" /translation="MTGALSQSSNISPLPLAADLRSADNRDCPSRTDVLGAALANVVG GPVGRHALIGRTRLMTPLRVMFAIALVFLALGWSTKAACLQSTGTGPGDQRVANWDNQ RAYYQLCYSDTVPLYGAELLSQGKFPYKSSWIETDSNGTPQLRYDGQIAVRYMEYPVL TGIYQYLSMAIAKTYTALSKVAPLPVVAEVVMFFNVAAFGLALAWLTTVWATSGLAGR RIWDAALVAASPLVIFQIFTNFDALATGLATSGLLAWARRRPVLAGVLIGLGSAAKLY PLLFLYPLLLLGIRAGRLNALARTMAAAAATWLLVNLPVMLLFPRGWSEFFRLNTRRG DDMDSLYNVVKSFTGWRGFDPTLGFWEPPLVLNTVVTLLFVLCCAAIAYIALTAPHRP RVAQLTFLTVASFLLVNKVWSPQFSLWLVPLAVLALPHRRILLAWMTIDALVWVPRMY YLYGNPSRSLPEQWFTTTVLLRDIAVMVLCGLVVWQIYRPGRDLVRTGGPGALPACGG VDDPVGGVFANAADAPPGRLPSWLRPRLGDEHARERTPDAGRDRTFSGQHRA" CDS 57400..57963 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0053" /product="Intracellular protease" /note="Mb0053, -, len: 187 aa. Equivalent to Rv0052, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 187 aa overlap). Conserved hypothetical protein, similar to others e.g. AL049587|SC5F2A_30S|T35272 hypothetical protein from Streptomyces coelicolor (211 aa), FASTA scores: opt: 531, E(): 3.4e-29, (49.5% identity in 182 aa overlap); NP_420588.1|NC_002696 ThiJ/PfpI family protein from Caulobacter crescentus (267 aa); etc. Some similarity to Escherichia coli G1100872|thiJ (198 aa), FASTA scores: opt: 178, E(): 6.1e-06, (29.9% identity in 137 aa overlap). Also similar to Rv1930c from Mycobacterium tuberculosis (174 aa). May be a membrane protein. Protein product from Mb0053 detected using shotgun mass spectrometry. Mb0053 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002818" /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/TrEMBL:A0A1R3XU47" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98418.1" /translation="MPSFDVVFVGHRRGEVRSDNAMLGLLCDAAFDELTRPDVVIFPG GIGTRTLIHDQTVLDWVREAHRHTLLTTSVCTGGLVLAAAGLLNGLTATTHWRVQDLF NSLGARYVPQRVVEHLPERVITAAGVSSGIDMGLRLVELLVSREAAEASQLMIEYDPQ PPVDAGSLAKASPATHRLALEFYQHRL" CDS 58182..58472 /codon_start=1 /transl_table=11 /gene="rpsF" /locus_tag="BQ2027_MB0054" /product="30s ribosomal protein s6 rpsf" /note="Mb0054, rpsF, len: 96 aa. Equivalent to Rv0053, len: 96 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 96 aa overlap). Probable 30S ribosomal protein S6, equivalent to RS6_MYCLE|P46389 30s ribosomal protein s6 from Mycobacterium leprae (96 aa), FASTA scores: opt: 570, E(): 1.1e-36, (91.7% identity in 96 aa overlap).Also highly similar to many e.g. Q9X8U2|RS6_STRCO 30S RIBOSOMAL PROTEIN S6 from Streptomyces coelicolor (96 aa); etc. Note that the putative product of this CDS corresponds to spot 6_26 identified in culture supernatant by proteomics at the Max-Planck-Institut fuer Infektionsbiologie (see citations below). Contains PS01048 Ribosomal protein S6 signature. BELONGS TO THE S6P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0054 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0054 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66592" /db_xref="InterPro:IPR000529" /db_xref="InterPro:IPR014717" /db_xref="InterPro:IPR020814" /db_xref="InterPro:IPR020815" /db_xref="InterPro:IPR035980" /db_xref="UniProtKB/Swiss-Prot:P66592" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98421.1" /translation="MRPYEIMVILDPTLDERTVAPSLETFLNVVRKDGGKVEKVDIWG KRRLAYEIAKHAEGIYVVIDVKAAPATVSELDRQLSLNESVLRTKVMRTDKH" CDS 58576..59070 /codon_start=1 /transl_table=11 /gene="ssb" /locus_tag="BQ2027_MB0055" /product="single-strand binding protein ssb (helix-destabilizing protein)" /note="Mb0055, ssb, len: 164 aa. Equivalent to Rv0054, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 164 aa overlap). Probable ssb, single-strand binding protein, equivalent to highly similar to others e.g. SSB_MYCLE|P46390 single-strand binding protein from Mycobacterium leprae (140 aa), FASTA scores: opt: 792, E(): 0, (92.6% identity in 135 aa overlap); and AAK30583.1|AF349434 single-stranded DNA-binding protein from Mycobacterium smegmatis (165 aa). Also highly similar to others e.g. T36594 probable single-strand binding protein from Streptomyces coelicolor (199 aa); etc. Also similar to Rv2478c|MTV008_34c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (161 aa), FASTA score: E (): 1.1e-06. Note that the putative product of this CDS corresponds to spot 3_210 identified in culture supernatant by proteomics at the Max-Planck-Institut fuer Infektionsbiologie (see citations below). BELONGS TO THE SSB FAMILY. Protein product from Mb0055 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0055 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A611" /db_xref="InterPro:IPR000424" /db_xref="InterPro:IPR011344" /db_xref="InterPro:IPR012340" /db_xref="UniProtKB/Swiss-Prot:P0A611" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98423.1" /translation="MAGDTTITIVGNLTADPELRFTPSGAAVANFTVASTPRIYDRQT GEWKDGEALFLRCNIWREAAENVAESLTRGARVIVSGRLKQRSFETREGEKRTVIEVE VDEIGPSLRYATAKVNKASRSGGFGSGSRPAPAQTSSASGDDPWGSAPASGSFGGGDD EPPF" CDS 59112..59366 /codon_start=1 /transl_table=11 /gene="rpsR1" /locus_tag="BQ2027_MB0056" /product="30s ribosomal protein s18-1 rpsr1" /note="Mb0056, rpsR1, len: 84 aa. Equivalent to Rv0055, len: 84 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 84 aa overlap). Probable rpsR1, 30S ribosomal protein S18-1, equivalent to NP_302711.1|NC_002677|O53125|RS18_MYCLE 30S RIBOSOMAL PROTEIN from Mycobacterium leprae (84 aa). Also highly similar to others e.g. Q9X8U4|R18A_STRCO 30S RIBOSOMAL PROTEIN S18-1 from Streptomyces coelicolor (78 aa); RS18_B|ACST|P10806 30s ribosomal protein s18 (bs21) (77 aa), FASTA scores: opt: 220, E(): 4e-10, (52.2% identity in 67 aa overlap); etc. Also similar to MTCY63A_5 from Mycobacterium tuberculosis. BELONGS TO THE S18P FAMILY OF RIBOSOMAL PROTEINS. Note that previously known as rpsR. Protein product from Mb0056 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0056 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P69231" /db_xref="InterPro:IPR001648" /db_xref="InterPro:IPR018275" /db_xref="InterPro:IPR036870" /db_xref="UniProtKB/Swiss-Prot:P69231" /protein_id="SIT98425.1" /translation="MAKSSKRRPAPEKPVKTRKCVFCAKKDQAIDYKDTALLRTYISE RGKIRARRVTGNCVQHQRDIALAVKNAREVALLPFTSSVR" CDS 59399..59857 /codon_start=1 /transl_table=11 /gene="rplI" /locus_tag="BQ2027_MB0057" /product="50s ribosomal protein l9 rpli" /note="Mb0057, rplI, len: 152 aa. Equivalent to Rv0056, len: 152 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 152 aa overlap). Probable rplI, 50S ribosomal protein L9, equivalent to RL9_MYCLE|P46385 50s ribosomal protein l9 from Mycobacterium leprae (152 aa), FASTA scores: opt: 847, E(): 0, (88.7% identity in 150 aa overlap). Also highly similar to others e.g. Q9X8U5|RL9_STRCO 50S RIBOSOMAL PROTEIN L9 from Streptomyces coelicolor (148 aa); etc. Contains PS00651 Ribosomal protein L9 signature. BELONGS TO THE L9P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0057 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0057 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66316" /db_xref="InterPro:IPR000244" /db_xref="InterPro:IPR009027" /db_xref="InterPro:IPR020069" /db_xref="InterPro:IPR020070" /db_xref="InterPro:IPR020594" /db_xref="InterPro:IPR036791" /db_xref="InterPro:IPR036935" /db_xref="UniProtKB/Swiss-Prot:P66316" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98427.1" /translation="MKLILTADVDHLGSIGDTVEVKDGYGRNFLLPRGLAIVASRGAQ KQADEIRRARETKSVRDLEHANEIKAAIEALGPIALPVKTSADSGKLFGSVTAADVVA AIKKAGGPNLDKRIVRLPKTHIKAVGTHFVSVHLHPEIDVEVSLDVVAQS" CDS 59886..60407 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0058" /product="HYPOTHETICAL PROTEIN" /note="Mb0058, -, len: 173 aa. Equivalent to Rv0057, len: 173 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 173 aa overlap). Hypothetical unknown protein. Mb0058 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P64680" /protein_id="SIT98430.1" /translation="MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGL NVRKMCLKANTPGAVTWLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTD VDGYAHAMHSSINSGPLEYLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGAC VGGGESPWRSLMT" CDS 60386..63010 /codon_start=1 /transl_table=11 /gene="dnaB" /locus_tag="BQ2027_MB0059" /product="PROBABLE REPLICATIVE DNA HELICASE DNAB" /note="Mb0059, dnaB, len: 874 aa. Equivalent to Rv0058, len: 874 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 874 aa overlap). Probable dnaB, replicative DNA helicase (EC 3.6.1.-). Contains an intein (position 61630..62838) similar to, and in the same position as, those in Sycnechocystis and Rhodothermus marinus (see citation below). Highly similar to others e.g. DNAB_SYNY3|Q55418 replicative dna helicase (872 aa), FASTA scores: opt: 533, E(): 0, (32.5% identity in 424 aa overlap). Also similar to intein recA|E1173867|AL008967 RECA INTEIN from Mycobacterium tuberculosis (442 aa), FASTA scores: E(): 3.8e-16, (27.0% identity in 426 aa overlap). C-terminal extein (position 62839..63015) similar to many dnaB proteins e.g. NP_302709.1|NC_002677|P46394|DNAB_MYCLE REPLICATIVE DNA HELICASE from Mycobacterium leprae (604 aa); DNAB_ECOLI|P03005 replicative dna helicase from Escherichia coli (471 aa), FASTA scores: opt: 148, E(): 1.5e-07, (37.9% identity in 58 aa overlap); etc. THIS PROTEIN UNDERGOES A PROTEIN SELF SPLICING THAT INVOLVES A POST-TRANSLATIONAL EXCISION OF THE INTERVENING REGION (INTEIN) FOLLOWED BY PEPTIDE LIGATION. BELONGS TO THE HELICASE FAMILY, DNAB SUBFAMILY. IN THE INTEIN SECTION; BELONGS TO THE HOMING ENDONUCLEASE FAMILY. Mb0059 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P59966" /db_xref="InterPro:IPR003586" /db_xref="InterPro:IPR003587" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004042" /db_xref="InterPro:IPR004860" /db_xref="InterPro:IPR006141" /db_xref="InterPro:IPR006142" /db_xref="InterPro:IPR007692" /db_xref="InterPro:IPR007693" /db_xref="InterPro:IPR007694" /db_xref="InterPro:IPR016136" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR027434" /db_xref="InterPro:IPR030934" /db_xref="InterPro:IPR036185" /db_xref="InterPro:IPR036844" /db_xref="UniProtKB/Swiss-Prot:P59966" /protein_id="SIT98432.1" /translation="MAVVDDLAPGMDSSPPSEDYGRQPPQDLAAEQSVLGGMLLSKDA IADVLERLRPGDFYRPAHQNVYDAILDLYGRGEPADAVTVAAELDRRGLLRRIGGAPY LHTLISTVPTAANAGYYASIVAEKALLRRLVEAGTRVVQYGYAGAEGADVAEVVDRAQ AEIYDVADRRLSEDFVALEDLLQPTMDEIDAIASSGGLARGVATGFTELDEVTNGLHP GQMVIVAARPGVGKSTLGLDFMRSCSIRHRMASVIFSLEMSKSEIVMRLLSAEAKIKL SDMRSGRMSDDDWTRLARRMSEISEAPLFIDDSPNLTMMEIRAKARRLRQKANLKLIV VDYLQLMTSGKKYESRQVEVSEFSRHLKLLAKELEVPVVAISQLNRGPEQRTDKKPML ADLRESGCLTASTRILRADTGAEVAFGELMRSGERPMVWSLDERLRMVARPMINVFPS GRKEVFRLRLASGREVEATGSHPFMKFEGWTPLAQLKVGDRIAAPRRVPEPIDTQRMP ESELISLARMIGDGSCLKNQPIRYEPVDEANLAAVTVSAAHSDGAAIRDDYLAARVPS LRPARQRLPRGRCTPIAAWLAGLGLFTKRSHEKCVPEAVFRAPNDQVALFLRHLWSAG GSVRWDPTNGQGRVYYGSTSRRLIDDVAQLLLRVGIFSWITHAPKLGGHDSWRLHIHG AKDQVRFLRHVGVHGAEAVAAQEMLRQLKGPVRNPNLDSAPKKVWAQVRNRLSAKQMM DIQLHEPTMWKHSPSRSRPHRAEARIEDRAIHELARGDAYWDTVVEITSIGDQHVFDG TVSGTHNFVANGISLHNSLEQDADVVILLHRPDAFDRDDPRGGEADFILAKHRNGPTK TVTVAHQLHLSRFANMAR" CDS 63190..63882 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0060" /product="HYPOTHETICAL PROTEIN" /note="Mb0060, -, len: 230 aa. Equivalent to Rv0059, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 230 aa overlap). Hypothetical unknown protein. Protein product from Mb0060 detected using shotgun mass spectrometry. Mb0060 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029494" /db_xref="UniProtKB/TrEMBL:A0A1R3XUE0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98434.1" /translation="MITRYKPESGFVARSGGPDRKRPHDWIVWHFTHADNLPGIITAG RLLADSAVTPTTEVAYNPVKELRRHKVVAPDSRYPASMASDHVPFYIAARSPMLYVVC KGHSGYSGGAGPLVHLGVALGDIIDADLTWCASDGNAAASYTKFSRQVDTLGTFVDFD LLCQRQWHNTDDDPNRQSRRAAEILVYGHVPFELVSYVCCYNTETMTRVRTLLDPVGG VRKYVIKPGMYY" CDS 63899..64957 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0061" /product="ADP-ribose 1-phosphate phophatase related protein" /note="Mb0061, -, len: 352 aa. Equivalent to Rv0060, len: 352 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 352 aa overlap). Conserved hypothetical protein, showing weak similarity to NP_104623.1|NC_002678 hypothetical protein from Mesorhizobium loti (155 aa); and AP000062|AP000062_92 hypothetical protein from Aeropyrum pernix (194 aa), FASTA scores: opt: 186, E(): 4.2e-05, (30.9% identity in 165 aa overlap). Protein product from Mb0061 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0061 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002589" /db_xref="UniProtKB/TrEMBL:A0A1R3XUT5" /protein_id="SIT98436.1" /translation="MITYGSGDLLRADTEALVNTVNCVGVMGKGIALQFKRRYPEMFT AYEKACKRGEVTIGKMFVVDTGQLDGPKHIINFPTKKHWRAPSKLAYIDAGLIDLIRV IRELNIASVAVPPLGVGNGGLDWEDVEQRLVSAFQQLPDVDAVIYPPSGGSRAIEGVE GLRMTWGRAVILEAMRRYLQQRRAMEPWEDPAGISHLEIQKLMYFANEADPDLALDFT PGRYGPYSERVRHLLQGMEGAFTVGLGDGTARVLANQPISLTTKGTDAITDYLATDAA ADRVSAAVDTVLRVIEGFEGPYGVELLASTHWVATREGAKEPATAAAAVRKWTKRKGR IYSDDRIGVALDRILMTA" CDS complement(65002..65340) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0062A" /product="Hypothetical protein" /note="Mb0062A, len: 112 aa. Equivalent to Rv0061c len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 112 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved hypothetical protein supported by RNA-seq data. Similar to MMAR_3839, 76% identity in 112 aa overlap. Replaces questionable ORF Rv0061 (MTV030.04). Protein product from Mb0062A detected using shotgun mass spectrometry. Mb0062A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XU77" /protein_id="SIT98437.1" /translation="MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCP GGRWGFGDLAVCDGEKYPDGSFWHQWMQTWFTGPQFYFDCVSGGEPLPGPPPPGGCGG AIPSEQPNAP" CDS 65542..66684 /codon_start=1 /transl_table=11 /gene="celA1" /locus_tag="BQ2027_MB0063" /product="POSSIBLE CELLULASE CELA1 (ENDOGLUCANASE) (ENDO-1,4-BETA-GLUCANASE) (FI-CMCASE) (CARBOXYMETHYL CELLULASE)" /note="Mb0063, celA1, len: 380 aa. Equivalent to Rv0062, len: 380 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 380 aa overlap). Possible celA1, cellulase (EC 3.2.1.4), similar to many e.g. AB65568.1|AL136058 putative secreted endoglucanase (cellulase) from Streptomyces coelicolor (332 aa); P07984|GUNA_CELFI ENDOGLUCANASE A PRECURSOR from Cellulomonas fimi (449 aa); GUN1_STRHA|P33682 endoglucanase 1 precursor (cellulase) from STREPTOMYCES HALSTEDII (321 aa), FASTA scores: opt: 702, E(): 1. 2e-27, (38.9% identity in 319 aa overlap); etc. SEEMS TO BELONG TO CELLULASE FAMILY B (FAMILY 6 OF GLYCOSYL HYDROLASES). Note that previously known as celA. Protein product from Mb0063 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0063 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU64" /db_xref="InterPro:IPR016288" /db_xref="InterPro:IPR036434" /db_xref="UniProtKB/TrEMBL:A0A1R3XU64" /protein_id="SIT98438.1" /translation="MTRRTGQRWRGTLPGRRPWTRPAPATCRRHLAFVELRHYFARVM SSAIGSVARWIVPLLGVAAVASIGVIADPVRVVRAPALILVDAANPLAGKPFYVDPAS AAMIAARNANPPNAELTSVANTPQSYWLDQAFPPATVGGTVARYTGAAQAAGAMPVLT LYGIPHRDCGSYASGGFATGTDYRGWIDAVASGLGSSPATIIVEPDALAMADCLSPDQ RQERFDLVRYAVDTLTRDPAAAVYVDAGHSRWLSAEAMAARLNDVGVGRARGFSLNVS NFYTTDEEIGYGEAISGLTNGSHYVIDTSRNGAGPAPDAPLNWCNPSGRALGAPPTTA TAGAHADAYLWIKRPGESDGTCGRGEPQAGRFVSQYAIDLAHNAGQ" CDS 66913..68352 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0064" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0064, -, len: 479 aa. Equivalent to Rv0063, len: 479 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 479 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to many e.g. HDNO_ARTOX|P08159 6-hydroxy-d-nicotine oxidase from Arthrobacter oxidans (458 aa), FASTA scores: opt: 343, E(): 3.4e-13, (27.4% identity in 467 aa overlap); AAD28454.1|AF127374_9|AF127374|MitR oxidase from Streptomyces lavendulae (514 aa); AAF81732.1|AF254925|EncM putative FAD-dependent oxygenase from Streptomycesmaritimus (464 aa); etc. Also similar to Mycobacterium tuberculosis proteins e.g. Rv3107c, Rv1257c, etc. Contains PS00862 Oxygen oxidoreductases covalent FAD-binding site. Protein product from Mb0064 detected using SWATH mass spectrometry. Mb0064 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU80" /db_xref="InterPro:IPR006093" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR006311" /db_xref="InterPro:IPR012951" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR016167" /db_xref="InterPro:IPR016169" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3XU80" /protein_id="SIT98439.1" /translation="MAREISRQTFLRGAAGALAAGAVFGSVRATADPAASGWEALSSA LGGKVLQPDDGPQFATAKQVFNTNYNGYTPAVIVTPTSQLDVQKAMAFAAANNLKVAP RGGGHSYVGASTANGAMVLDLRQLPGDINYDATTGRVTVTPATGLYAMHQVLAAAGRG IPTGTCPTVGVAGHALGGGLGANSRHAGLLCDQLTSASVVLPSGQAVTASATDHPDLF WALRGGGGGNFGVTTSLTFATFPSGDLDVVNLNFPPQSFAQVLVGWQNWLRTADRGSW ALADATVDPLGTHCRILATCPAGSGGSVAAAIVSAVGTQPTGTENHTFNYLDLVRYLA VGNLNPSPLGYVGGSDVFTTITPATAQGIASAVDAFPRGAGRMLAIMHALDGALATVS PGATAFPWRRQSALVQWYVETSGSPSEATSWLNTAHQAVRAYSVGGYVNYLEVNQPPA RYFGPNLSRLSAVRQKYDPSRVMFSGLNF" CDS 68610..71549 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0065" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0065, -, len: 979 aa. Equivalent to Rv0064, len: 979 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 979 aa overlap). Probable conserved transmembrane protein, highly similar to NP_301532.1| (NC_002677) putative integral membrane protein from Mycobacterium leprae (983 aa). Also similar to other hypothetical proteins from ARCHAEOGLOBUS FULGIDUS and Synecocystis sp. e.g. P72637|D90899 HYPOTHETICAL 117.2 KD PROTEIN from SYNECHOCYSTIS SP. (1032 aa), FASTA scores: opt: 1004, E(): 3.6e-32, (31.0 % identity in 848 aa overlap); and CAC01334.1|AL390968 putative integral membrane protein (fragment) from Streptomyces coelicolor (815 aa); etc. Also similar to Rv3193c from Mycobacterium tuberculosis (992 aa), FASTA score: (50.3% identity in 985 aa overlap). Contains probable coiled-coil domain from aa 948 to 976. Mb0065 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U2X8" /db_xref="InterPro:IPR005372" /db_xref="UniProtKB/Swiss-Prot:Q7U2X8" /protein_id="SIT98440.1" /translation="METGSPGKRPVLPKRARLLVTAGMGMLALLLFGPRLVDIYVDWL WFGEVGFRSVWITVLLTRLAIVAAVALVVAGIVLAALLLAYRSRPFFVPDEPQRDPVA PLRSAVMRRPRLFGWGIAVTLGVVCGLIASFDWVKVQLFVHGGTFGIVDPEFGYDIGF FVFDLPFYRSVLNWLFVAVVLAFLASLLTHYLFGGLRLTTGRGMLTQAARVQLAVFAG AVVLLKAVAYWLDRYELLSSGRKEPTFTGAGYTDIHAELPAKLVLVAIAVLCAVSFFT AIFLRDLRIPAMAAALLVLSAILVGGLWPLLMEQFSVRPNAADVERPYIQRNIEATRE AYRIGGDWVQYRSYPGIGTKQPRDVPVDVTTIAKVRLLDPHILSRTFTQQQQLKNFFS FAEILDIDRYRIDGELQDYIVGVRELSPKSLTGNQTDWINKHIVYTHGNGFVAAPANR VNAAARDAENISDSNSGYPIYAVSDIASLGSGRQVIPVEQPRVYYGEVIAQADPDYAI VGGAPGSAPREYDTDTSKYTYTGAGGVSIGNWFNRTVFATKFAQHKFLFSREIGSESK VLIHRDPKERVQRVAPWLTTDDNPYPVVVNGRIVWIVDAYTTLDTYPYAQRSSLEGPV TSPTGIVRQGKQVSYVRNSVKATVDAYDGTVTLFQFDRDDPVLRTWMRAFPGTVKSED QIPDELRAHFRYPEDLFEVQRSLLAKYHVDEPREFFTTNAFWSVPSDPTNDANATQPP FYVLVGDQQSAQPSFRLASAMVGYNREFLSAYISAHSDPANYGKLTVLELPTDTLTQG PQQIQNSMISDTRVASERTLLERSNRIHYGNLLSLPIADGGVLYVEPLYTERISTSPS SSTFPQLSRVLVSVREPRTEGGVRVGYAPTLAESLDQVFGPGTGRVATAPGGDAASAP PPGAGGPAPPQAVPPPRTTQPPAAPPRGPDVPPATVAELRETLADLRAVLDRLEKAID AAETPGG" CDS 71616..71855 /codon_start=1 /transl_table=11 /gene="vapB1" /locus_tag="BQ2027_MB0065A" /product="Possible antitoxin VapB1" /note="Mb0065A, len: 79 aa. Equivalent to Rv0064A len: 79 aa, from Mycobacterium tuberculosis strain H37Rv, (98.7% identity in 79 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible vapB1, antitoxin, part of toxin-antitoxin (TA) operon with Rv0065 (See Arcus et al., 2005; Pandey and Gerdes, 2005). Weakly similar to others in Mycobacterium tuberculosis e.g. Rv0300 (73 aa),Rv1721c (75 aa) Protein product from Mb0065A detected using shotgun mass spectrometry. Mb0065A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XU88" /db_xref="InterPro:IPR010985" /db_xref="UniProtKB/TrEMBL:A0A1R3XU88" /protein_id="SIT98441.1" /translation="MATIQVRDLPEDVAETYRRRATAAGQSLQTYMRTKLIEGVRGRD KAEVIEILEQALASTASPGISRETIEASRRELRGG" CDS 71848..72249 /codon_start=1 /transl_table=11 /gene="vapc1" /locus_tag="BQ2027_MB0066" /product="possible toxin vapc1" /note="Mb0066, -, len: 133 aa. Equivalent to Rv0065, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Conserved hypothetical protein, similar to several hypothetical proteins from Mycobacterium tuberculosis: Rv0960 (127 aa), Rv1720c (129 aa), and Rv0549c (137 aa). Protein product from Mb0066 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0066 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XU79" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XU79" /protein_id="SIT98442.1" /translation="MDECVVDAAAVVDALAGKGASAIVLRGLLKESISNAPHLLDAEV GHALRRAVLSDEISEEQARAALDALPYLIDNRYPHSPRLIEYTWQLRHNVTFYDALYV ALATALDVPLLTGDSRLAAAPGLPCEIKLVR" CDS complement(72301..74538) /codon_start=1 /transl_table=11 /gene="icd2" /locus_tag="BQ2027_MB0067C" /product="PROBABLE ISOCITRATE DEHYDROGENASE [NADP] ICD2 (OXALOSUCCINATE DECARBOXYLASE) (IDH) (NADP+-SPECIFIC ICDH) (IDP)" /note="Mb0067c, icd2, len: 745 aa. Equivalent to Rv0066c, len: 745 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 745 aa overlap). Probable icd2, isocitrate dehydrogenase NADP-dependant (EC 1.1.1.42), equivalent to NP_302705.1|NC_002677 isocitrate dehydrogenase [NADP] from Mycobacterium leprae (746 aa). Also highly similar to many members of the monomeric-type family of IDH e.g. NP_251314.1|NC_002516 isocitrate dehydrogenase from Pseudomonas aeruginosa (741 aa); IDH_AZOVI|P16100 isocitrate dehydrogenase (nadp) from Azotobacter vinelandii (741 aa), FASTA scores: opt: 3106, E(): 0, (61.4% identity in 735 aa overlap); NP_230786.1|NC_002505 isocitrate dehydrogenase [Vibrio cholerae (741 aa); etc. BELONGS TO THE MONOMERIC-TYPE FAMILY OF IDH. Protein product from Mb0067c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0067c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWE0" /db_xref="InterPro:IPR004436" /db_xref="UniProtKB/TrEMBL:A0A1R3XWE0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98443.1" /translation="MSAEQPTIIYTLTDEAPLLATYAFLPIVRAFAEPAGIKIEASDI SVAARILAEFPDYLTEEQRVPDNLAELGRLTQLPDTNIIKLPNISASVPQLVAAIKEL QDKGYAVPDYPADPNTDQEKAIKERYARCLGSAVNPVLRQGNSDRRAPKAVKEYARKH PHSMGEWSMASRTHVAHMRHGDFYAGEKSMTLDRARNVRMELLAKSGKTIVLKPEVPL DDGDVIDSMFMSKKALCDFYEEQMQDAFETGVMFSLHVKATMMKVSHPIVFGHAVRIF YKDAFAKHQELFDDLGVNVNNGLSDLYSKIESLPASQRDEIIEDLHRCHEHRPELAMV DSARGISNFHSPSDVIVDASMPAMIRAGGKMYGADGKLKDTKAVNPESTFSRIYQEII NFCKTNGQFDPTTMGTVPNVGLMAQQAEEYGSHDKTFEIPEDGVANIVDVATGEVLLT ENVEAGDIWRMCIVKDAPIRDWVKLAVTRARISGMPVLFWLDPYRPHENELIKKVKTY LKDHDTEGLDIQIMSQVRSMRYTCERLVRGLDTIAATGNILRDYLTDLFPILELGTSA KMLSVVPLMAGGGMYETGAGGSAPKHVKQLVEENHLRWDSLGEFLALGAGFEDIGIKT GNERAKLLGKTLDAAIGKLLDNDKSPSRKTGELDNRGSQFYLAMYWAQELAAQTDDQQ LAEHFASLADVLTKNEDVIVRELTEVQGEPVDIGGYYAPDSDMTTAVMRPSKTFNAAL EAVQG" CDS complement(74656..75225) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0068C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY TETR-FAMILY)" /note="Mb0068c, -, len: 189 aa. Equivalent to Rv0067c, len: 189 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 189 aa overlap). Possible transcriptional regulator, highly similar except in N-terminus to T44726 probable transcription regulator from Mycobacterium leprae (189 aa), FASTA scores: opt: 829, E(): 0, (68.3% identity in 189 aa overlap). And similar to others, often many members of the tetR family, e.g. T36918 probable transcription regulator from Streptomyces coelicolor (202 aa); NP_535866.1|NC_003306 transcriptional regulator TetR family from Agrobacterium tumefaciens strain C58 (Dupont) (194 aa); UIDR_ECOLI|Q59431 uid operon repressor (gus operon repressor) from Escherichia coli (196 aa), FASTA scores: opt: 200, E(): 7.2e-06, (24.7% identity in 186 aa overlap); etc. Also similar to MTCY8D5_28 from Mycobacterium tuberculosis cosmid (229 aa), FASTA score: (32.7% identity in 168 aa overlap). Contains probable helix-turn-helix motif from aa 34 to 55 (Score 1523, +4.37 SD). Protein product from Mb0068c detected using SWATH mass spectrometry. Mb0068c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XVC8" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3XVC8" /protein_id="SIT98444.1" /translation="MAPTDRRVRADAARNRARVLEVAYQTFAADGLSVPVDEIARRAG VGAGTVYRHFPTKEALFQAVIADRMHRIIDKGHALLKSKHPGDALFAFLRSMVLQWGA TDRGLVEALAGVGIEISSAAPEAEADFLDLLTDLLRAAQRAGTVRPDVDVLEVKTLLV GCQAMQSYNAELAAKVTDVALDGLRANRK" CDS 75328..76239 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0069" /product="PROBABLE OXIDOREDUCTASE" /note="Mb0069, -, len: 303 aa. Equivalent to Rv0068, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 303 aa overlap). Probable oxidoreductase (EC 1.-.-.-), equivalent to NP_301343.1|NC_002677 putative oxidoreductase from Mycobacterium leprae (304 aa). Also highly similar to many e.g. NP_485762.1|NC_003272 probable oxidoreductase from Nostoc sp. PCC 7120 (311 aa); NP_279536.1|NC_002607|YajO1 probable oxidoreductase from Halobacterium sp. NRC-1 (316 aa); OXIR_STRAT|Q03326 probable oxidoreductase from Streptomyces antibioticus (298 aa), FASTA scores: opt: 430, E(): 1.3e-16, (34.9% identity in 295 aa overlap); etc. Also highly similar to MTV037_3 and MTV022_13 from Mycobacterium tuberculosis. Protein product from Mb0069 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XUE4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT98446.1" /translation="MTKWTAADIPDQTGRTAVITGANTGLGFETAAALAAHGAHVVLA VRNLDKGKQAAARITEATPGAEVELQELDLTSLASVRAAAAQLKSDHQRIDLLINNAG VMYTPRQTTADGFEMQFGTNHLGHFALTGLLIDRLLPVAGSRVVTISSVGHRIRAAIH FDDLQWERRYRRVAAYGQAKLANLLFTYELQRRLAPGGTTIAVASHPGVSNTELVRNM PRPLVAVAAILAPLMQDAELGALPTLRAATDPAVRGGQYFGPDGFGEIRGYPKVVASS AQSHDEQLQRRLWAVSEELTGVVYPVG" CDS complement(76264..77649) /codon_start=1 /transl_table=11 /gene="sdaA" /locus_tag="BQ2027_MB0070C" /product="PROBABLE L-SERINE DEHYDRATASE SDAA (L-SERINE DEAMINASE) (SDH) (L-SD)" /note="Mb0070c, sdaA, len: 461 aa. Equivalent to Rv0069c, len: 461 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 461 aa overlap). Probable sdaA, L-serine dehydratase (EC 4.3.1.17), equivalent to NP_302203.1| NC_002677 L-serine dehydratase from Mycobacterium leprae (458 aa). Also highly similar to many e.g. NP_251133.1|NC_002516 L-serine dehydratase from Pseudomonas aeruginosa (458 aa); O86564|SDHL_STRCO L-SERINE DEHYDRATASE from Streptomyces coelicolor (455 aa); SDHL_ECOLI|P16095 L-serine dehydratase 1 from Escherichia coli (454 aa), FASTA scores: opt: 1381, E(): 0, (51.1% identity in 460 aa overlap); etc. BELONGS TO THE IRON-SULFUR DEPENDENT L-SERINE DEHYDRATASE FAMILY. COFACTOR: IRON-SULFUR (4FE-4S) (PROBABLE). Protein product from Mb0070c detected using SWATH mass spectrometry. Mb0070c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66774" /db_xref="InterPro:IPR004644" /db_xref="InterPro:IPR005130" /db_xref="InterPro:IPR005131" /db_xref="InterPro:IPR029009" /db_xref="UniProtKB/Swiss-Prot:P66774" /protein_id="SIT98448.1" /translation="MTISVFDLFTIGIGPSSSHTVGPMRAANQFVVALRRRGHLDDLE AMRVDLFGSLAATGAGHGTMSAILLGLEGCQPETITTEHKERRLAEIAASGVTRIGGV IPVPLTERDIDLHPDIVLPTHPNGMTFTAAGPHGRVLATETYFSVGGGFIVTEQTSGN SGQHPCSVALPYVSAQELLDICDRLDVSISEAALRNETCCRTENEVRAALLHLRDVMV ECEQRSIAREGLLPGGLRVRRRAKVWYDRLNAEDPTRKPEFAEDWVNLVALAVNEENA SGGRVVTAPTNGAAGIVPAVLHYAIHYTSAGAGDPDDVTVRFLLTAGAIGSLFKERAS ISGAEVGCQGEVGSAAAMAAAGLAEILGGTPRQVENAAEIAMEHSLGLTCDPIAGLVQ IPCIERNAISAGKAINAARMALRGDGIHRVTLDQVIDTMRATGADMHTKYKETSAGGL AINVAVNIVEC" CDS complement(77646..78923) /codon_start=1 /transl_table=11 /gene="glyA2" /locus_tag="BQ2027_MB0071C" /product="serine hydroxymethyltransferase glya2 (serine methylase 2) (shmt 2)" /note="Mb0071c, glyA2, len: 425 aa. Equivalent to Rv0070c, len: 425 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 425 aa overlap). Probable glyA2, serine hydroxymethyltransferase (EC 2.1.2.1), equivalent to NP_302318.1|NC_002677 serine hydroxymethyltransferase from Mycobacterium leprae (426 aa). Also highly similar to many e.g. O86565|GLYA_STRCO SERINE HYDROXYMETHYLTRANSFERASE from Streptomyces coelicolor (420 aa); AAK60516.1|AF327063_1|AF327063 serine hydroxymethyltransferase from Corynebacterium glutamicum (434 aa); GLYA_ECOLI|P00477 serine hydroxymethyltransferase from Escherichia coli (417 aa), FASTA scores: opt: 1462, E(): 0, (54.3% identity in 416 aa overlap); etc. Also highly similar to MTV017_46 from Mycobacterium tuberculosis. Contains PS00096 Serine hydroxymethyltransferase pyridoxal-phosphate attachment site. BELONGS TO THE SHMT FAMILY. COFACTOR: PYRIDOXAL PHOSPHATE. Protein product from Mb0071c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0071c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U2X3" /db_xref="InterPro:IPR001085" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR019798" /db_xref="InterPro:IPR039429" /db_xref="UniProtKB/Swiss-Prot:Q7U2X3" /protein_id="SIT98450.1" /translation="MNTLNDSLTAFDPDIAALIDGELRRQESGLEMIASENYAPLAVM QAQGSVLTNKYAEGYPGRRYYGGCEFVDGVEQLAIDRVKALFGAEYANVQPHSGATAN AATMHALLNPGDTILGLSLAHGGHLTHGMRINFSGKLYHATAYEVSKEDYLVDMDAVA EAARTHRPKMIIAGWSAYPRQLDFARFRAIADEVDAVLMVDMAHFAGLVAAGVHPSPV PHAHVVTSTTHKTLGGPRGGIILCNDPAIAKKINSAVFPGQQGGPLGHVIAAKATAFK MAAQPEFAQRQQRCLDGARILAGRLTQPDVAERGIAVLTGGTDVHLVLVDLRDAELDG QQAEDRLAAVDITVNRNAVPFDPRPPMITSGLRIGTPALAARGFSHNDFRAVADLIAA ALTATNDDQLGPLRAQVQRLAARYPLYPELHRT" CDS 79513..80229 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0072" /product="POSSIBLE MATURASE" /note="Mb0072, -, len: 238 aa. Equivalent to Rv0071, len: 235 aa, from Mycobacterium tuberculosis strain H37Rv, (98.7% identity in 238 aa overlap). Possible maturase, similar to many proteins of the group II intron maturase family e.g. P95451|U77945 MATURASE-RELATED PROTEIN from PSEUDOMONAS ALCALIGENES (297 aa), FASTA scores: opt: 395, E(): 1.7e-20, (43.5% identity in 147 aa overlap); N-terminus of AAD16434.1|AF101076 maturase-related protein from Pseudomonas putida (473 aa); N-terminus of NP_437373.1|NC_003078 putative reverse transcriptasematurase protein from Sinorhizobium meliloti (453 aa); etc. Also similar to MLCL581_1 from Mycobacterium leprae. Contains 5 VDP repeats at N-terminus, these are also found in two Streptococcus plasmid hypothetical proteins Q52246|X17092 and Q54942|X66468. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 9 bp insertion (*-cggtggacc) leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (238 aa versus 235 aa)." /db_xref="InterPro:IPR000477" /db_xref="UniProtKB/TrEMBL:A0A1R3XU78" /protein_id="SIT98452.1" /translation="MSSITVSVDPVDPVDPVDPVDPVDPVDAVVAAGSDGLTVARIES EIGALEFLNELRTELKSGQFRPQPVRERKIPKPGGLGKVRRLGIPTVADRVVQAALKL VLEPIFETDFEPVSYGFRPARRAHDTIAEIHLFGTQEYRWVLDADIKACFDRIDHADL MDRVRHRIKDKRVLRLVNWQRIRHRWNWTDVRRWLTDPTGRWHPISADGITLFNPAAV PIRRYRYRGNTIPTPWTQAV" repeat_region 80272..80586 /rpt_family="REP" /note="REP'-1, len: 315 nt. Equivalent to REP', len: 315 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 315 nt overlap). Probable pseudogene fragment,len: 105 aa; similar to many Mycobacterium tuberculosis proteins inside REP13E12 elements eg. TR:Q50655 (EMBL:Z95390) MTCY13E12.20 (317 aa), FASTA scores; opt: 324 z-score: 432.5 E(): 6.8e-17, 43.4% identity in 99 aa overlap, but no possible startsite." CDS 80660..81709 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0073" /product="PROBABLE GLUTAMINE-TRANSPORT TRANSMEMBRANE PROTEIN ABC TRANSPORTER" /note="Mb0073, -, len: 349 aa. Equivalent to Rv0072, len: 349 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 349 aa overlap). Probable glutamine-transport transmembrane protein ABC-transporter (see citation below), showing weak similarity to NP_465894.1|NC_003210 protein similar to putative ABC-transporter transmembrane subunit from Listeria monocytogenes EGD-e (367 aa); NP_471800.1|NC_003212 protein similar to putative ABC-transporter transmembrane subunit from Listeria innocua (367 aa); E1204111|AJ003195 MEMBRANE SPANNING SUBUNIT DEVC from ANABAENA VARIABILIS (385 aa), FASTA scores: opt: 155, E(): 8.1e-07, (22.0% identity in 381 aa overlap). Also highly similar to Rv2563|Y0A5_MYCTU|Q50735|MTCY9C4.05c from Mycobacterium tuberculosis (388 aa), FASTA scores: E(): 0, (76.2% identity in 349 aa overlap). Note that supposed act with near ORF Rv0073|MTV030.17 ATP-binding protein ABC-transporter. Protein product from Mb0073 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0073 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247775.1" /translation="MLFAALRDMQWRKRRLVITIISTGLIFGMTLVLTGLANGFRVEA RHTVDSMGVDVFVVRSGAAGPFLGSIPFPDVDLARVAAEPGVMAAAPLGSVGTIMKEG TSTRNVTVFGAPEHGPGMPRVSEGRSPSKPDEVAASSTMGRHLGDTVEVGARRLRVVG IVPNSTALAKIPNVFLTTEGLQKLAYNGQPNITSIGIIGMPRQLPEGYQTFDRVGAVN DLVRPLKVAVNSISIVAVLLWIVAVLIVGSVVYLSALERLRDFAVFKAIGTPTRSIMA GLALQALVIALLAAVVGVVLAQVLAPLFPMIVAVPVGAYLALPVAAIVIGLFASVAGL KRVVTVDPAQAFGGP" CDS 81712..82554 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0074" /product="PROBABLE GLUTAMINE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER GLNQ" /note="Mb0074, -, len: 280 aa. Equivalent to 5' end of Rv0073, len: 330 aa, from Mycobacterium tuberculosis strain H37Rv, (94.9% identity in 275 aa overlap). Probable glutamine-transport ATP-binding protein ABC-transporter (see citation below), similar to many ATP-binding proteins e.g. NP_070646.1|NC_000917 ABC transporter ATP-binding protein from Archaeoglobus fulgidus (231 aa); T34822 ABC-transporter ATP binding protein from Streptomyces coelicolor (230 aa); YBJZ_ECOLI|P75831 hypothetical ABC transporter ATP-binding protein from Escherichia coli (648 aa), FASTA scores: opt: 531, E(): 6.8e-30, (38.6% identity in 233 aa overlap); etc. Also highly similar to Y0A4_MYCT|Q50734|MTCY9C4.04c hypothetical ABC transporter ATP-binding protein from Mycobacterium tuberculosis (330 aa), FASTA scores: E(): 0, (83.3% identity in 330 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00211 ABC transporters family signature, and PS00889 Cyclic nucleotide-binding domain signature 2. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Note that supposed act with near ORF Rv0072|MTV030.16 transmembrane ABC-transporter. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0073 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (a-*) splits Rv0073 into 2 parts, Mb0074 and Mb0075. Mb0074 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247776.1" /translation="MGDLSIQNLVVEYYSGGYALRPINGLNLDVAAGSLVMLLGPSGC GKTTLLSCLGGILRPKSGAIKFDEVDITTLQGAELANYRRNKVGIVFQAFNLVPSLTA VENVMVPLRSAGMSRRASRRRAEELLARVNLAERMNHRPGDLSGGQQQRVAVARAIAL DPPLILADEPTAHLDFIQVEEVLRLIRELADGERVVVVATHDSRMLPMADRVVELTPD FAETNRPPETVHLQAGEVLFEQSTMGDLIYVVSEGEFEIVHDWPTAVRNWSRLPGRGI TSAR" CDS 82557..82703 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0075" /product="PROBABLE GLUTAMINE-TRANSPORT PROTEIN ABC TRANSPORTER GLNQ" /note="Mb0075 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing.,Mb0075, -, len: 48 aa. Equivalent to 3' end of Rv0073, len: 330 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 70 aa overlap). Probable glutamine-transport ATP-binding protein ABC-transporter (see citation below), similar to many ATP-binding proteins e.g. NP_070646.1|NC_000917 ABC transporter ATP-binding protein from Archaeoglobus fulgidus (231 aa); T34822 ABC-transporter ATP binding protein from Streptomyces coelicolor (230 aa); YBJZ_ECOLI|P75831 hypothetical ABC transporter ATP-binding protein from Escherichia coli (648 aa), FASTA scores: opt: 531, E(): 6.8e-30, (38.6% identity in 233 aa overlap); etc. Also highly similar to Y0A4_MYCT|Q50734|MTCY9C4.04c hypothetical ABC transporter ATP-binding protein from Mycobacterium tuberculosis (330 aa), FASTA scores: E(): 0, (83.3% identity in 330 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00211 ABC transporters family signature, and PS00889 Cyclic nucleotide-binding domain signature 2. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Note that supposed act with near ORF Rv0072|MTV030.16 transmembrane ABC-transporter. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0073 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (a-*) splits Rv0073 into 2 parts, Mb0074 and Mb0075. Protein product from Mb0075 detected using SWATH mass spectrometry." /protein_id="CAB5247777.1" /translation="MLFHLPRSATVRARSDATAVGYTVQAFRERLGVGGLRDLIEHRA LAND" CDS 82783..84018 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0076" /product="Xaa-Pro dipeptidase (EC" /EC_number="3.4.13.9" /note="Mb0076, -, len: 411 aa. Equivalent to Rv0074, len: 411 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 411 aa overlap). Conserved hypothetical protein, similar to Rv2915c|MTCY338.03c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis, and showing some simlarity to various enzymes or hypothetical proteins from other organisms, eg NP_243801.1|NC_002570 aryldialkylphosphatase from Bacillus halodurans (394 aa); NP_421471.1|NC_002696 putativ Xaa-Pro dipeptidase from Caulobacter crescentus (429 aa); NP_343436.1|NC_002754 Prolidase (Xaa-Pro dipeptidase) (pepQ-like2) from Sulfolobus solfataricus (408 aa); Q50432|M91040 ORGANO PHOSPHATE ACID ANHYDRASE OPAB from MYCOBACTERIUM SP. (409 aa), FASTA scores: opt: 166, E(): 3.9e-11, (31.2% identity in 430 aa overlap); etc. Protein product from Mb0076 detected using SWATH mass spectrometry. Mb0076 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247778.1" /translation="MGDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRIS AVDFAGSACPDMNLVDLGESTLLPGLVDAHAHLCWDPDGRPEDLAGDPHAVLVGRARR HAAAALRSGITTIRDLGDRDYAALALREEYRQKTTVGPELVVSGPPLTRSGGHCWFLG GVADSVEELVDAVQERAARGADWIKVMATGGFVTTASDPWQPQYGSGQLAAVVAAAEQ VGLPVTAHAHATAGIAAAVAAGVDGIEHCTFLSEGSAAASPDVVEAIVAQGVWCGMTI PRVYPEMPENLVAVVQDGWRNIRRLIDAGARVALSTDAGVAPGRRHDVLPDDLVYLSR HGFTSTEVLTGATAAAAASCGLGHRKGRIAPGYDADLLAVAAGVDHDPAGLCDVKAVW RSGTQVPLQASAVGYNTPS" CDS 84031..85203 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0077" /product="PROBABLE AMINOTRANSFERASE" /note="Mb0077, -, len: 390 aa. Equivalent to Rv0075, len: 390 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 390 aa overlap). Probable aminotransferase (EC 2.6.1.-), similar to many CLASS-II PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES (MALY/PATB SUBFAMILY) e.g. NP_302217.1|NC_002677 aminotransferase from Mycobacterium leprae (402 aa); PATB_BACSU|Q08432 putative aminotransferase b from Bacillus subtilis (387 aa), FASTA scores: opt: 684, E(): 5.4e-33, (31.3% identity in 384 aa overlap); etc. Also similar to several cystathionine beta-lyase (beta C-S lyase) e.g. AAK69425.1|AF276227_1|AF276227 from Corynebacterium glutamicum (368 aa); etc. Also similar to other proteins from Mycobacterium tuberculosis e.g. Rv2294, Rv0858c, etc. Protein product from Mb0077 detected using SWATH mass spectrometry. Mb0077 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247779.1" /translation="MQDSIFNLLTEEQLRGRNTLKWNYFGPDVVPLWLAEMDFPTAPA VLDGVRACVDNEEFGYPPLGEDSLPRATADWCRQRYGWCPRPDWVRVVPDVLKGMEVV VEFLTRPESPVALPVPAYMPFFDVLHVTGRQRVEVPMVQQDSGRYLLDLDALQAAFVR GAGSVIICNPNNPLGTAFTEAELRAIVDIAARHGARVIADEIWAPVVYGSRHVAAASV SEAAAEVVVTLVSASKGWNLPGLMCAQVILSNRRDAHDWDRINMLHRMGASTVGIRAN IAAYHHGESWLDELLPYLRANRDHLARALPELAPGVEVNAPDGTYLSWVDFRALALPS EPAEYLLSKAKVALSPGIPFGAAVGSGFARLNFATTRAILDRAIEAIAAALRDIID" CDS complement(85218..85607) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0078C" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb0078c, -, len: 129 aa. Equivalent to Rv0076c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Probable membrane protein, with membrane-spanning domain at C-terminus. Mb0078c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247780.1" /translation="MPAVTTPSNHWGDERRKLSHQPPVRGQILGRRQARRLSQHFARV GVEAPPKRLQEMLLGAPAADEEWTDVKFALIVTQLNHEKRVAKFHRLQRRATHSLICL GLVLVALNFLICLAYIFFSLTQHAAAL" CDS complement(85671..86501) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0079C" /product="PROBABLE OXIDOREDUCTASE" /note="Mb0079c, -, len: 276 aa. Equivalent to Rv0077c, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 276 aa overlap). Possible oxidoreductase (EC 1.-.-.-), weakly similar to others e.g. CAC44600.1|AL596162 putative oxidoreductase from Streptomyces coelicolor (275 aa); P33912|BPA1_STRAU NON-HAEM BROMOPEROXIDASE BPO-A1 (BROMIDE PEROXIDASE) (EC 1.11.1.-) from Streptomyces aureofaciens (275 aa); BPA1_STRAU|P33912 non-haem bromoperoxidase bpo-a1 from Streptomyces aureofaciens (274 aa), FASTA scores: opt: 230, E(): 1.5e-07, (26.1% identity in 249 aa overlap); etc. Also similar to MTCY05A6_35 and MTCY1A11_10 from Mycobacterium tuberculosis. And shows some similarity in part with AAL17935.1|AY054120 putative epoxide hydrolase from Mycobacterium smegmatis (203 aa). Protein product from Mb0079c detected using shotgun mass spectrometry. Mb0079c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247781.1" /translation="MSTIDISAGTIHYEATGPETGRPVVFVHGYMMGGQLWRRVSERL AGRGLRCIAPTWPLGAHPKPLRPGADQTIGGVAGIVADVLAALELKDVVLVGNDTGGV VTQLVAVHYPERLGALVLTSCDAFEHFPPPILKPVILAAKSATLFRAAIQVMRAPAAR NRAYAGLSHHNIDHLTRAWVRPALSNPAIAEDLRQLSLSLRTEVTTAVAARLPEFDKP ALIAWSADDVFFALENGQRLAATIPRARFEVIEGARTFSMVDSPDRLADQLSTVAVRT " CDS 86563..87168 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0080" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0080, -, len: 201 aa. Equivalent to Rv0078, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 201 aa overlap). Probable transcriptional regulator, equivalent to NP_302706.1|NC_002677 putative TetR-family transcriptional regulator from Mycobacterium leprae (236 aa), FASTA scores: opt: 755, E(): 0, (71.4% identity in 175 aa overlap). Also similar to others e.g. NP_103770.1|NC_002678 probable transcriptional regulator from Mesorhizobium loti (208 aa); NP_384275.1|NC_003047 PUTATIVE TRANSCRIPTION REGULATOR PROTEIN from Sinorhizobium meliloti (197 aa); NP_250960.1|NC_002516 probable transcriptional regulator from Pseudomonas aeruginosa (196 aa); etc. Also similar to TETC_ECOLI|P28815 transposon tn10 tetc protein from Escherichia coli (197 aa), FASTA scores: opt: 181, E(): 9.7e-05, (24.8% identity in 165 aa overlap). Contains probable helix-turn-helix motif from aa 35 to 56 (Score 1348, +3.78 SD). Protein product from Mb0080 detected using SWATH mass spectrometry." /protein_id="CAB5247782.1" /translation="MEIKRRTQEERSAATREALITGARKLWGLRGYAEVGTPEIATEA GVTRGAMYHQFADKAALFRDVVEVVEQDVMARMATLVAASGAATPADAIRAAVDAWLE VSGDPEVRQLVLLDAPVVLGWAGFRDVAQRYSLGMTEQLITEAIRAGQLARQPVRPLA QVLIGALDEAAMFIATADDPKRARRETRQVLRRLIDGMLNG" CDS complement(87243..87836) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0081C" /product="HYPOTHETICAL PROTEIN" /note="Mb0081c, -, len: 197 aa. Equivalent to Rv0078A, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 197 aa overlap). Hypothetical unknown protein. Protein product from Mb0081c detected using shotgun mass spectrometry. Mb0081c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247783.1" /translation="MNAVESTLRRVAKDLTGLRQRWALVGGFAVSARSEPRFTRDVDI VVAVANDDAAESLVRQLLTQQYHLLASVEQDAARRLAAVRLGATADTAANVVVDLLFA SCGIEPEIAEAAEEIEILPDLVAPVATTAHLIAMKLLARDDDRRPQDRSDLRALVDAA SPQDIQDARKAIELITLRGFHRDRDLAAEWTRLAAKW" CDS complement(87833..88039) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0081A" /product="Conserved protein" /note="Mb0081A, len: 68 aa. Equivalent to Rv0078B len: 68 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 68 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein. Protein product from Mb0081A detected using SWATH mass spectrometry. Mb0081A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247784.1" /translation="MAVSVAAQKLRLALDMYEVGEQMQRMRLGRERPNADVVEIEAAI DAWRMTRPGAEEGDSAGPTSTRFT" CDS 88239..89060 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0082" /product="Ribosome hibernation promoting factor Hpf" /note="Mb0082, -, len: 273 aa. Equivalent to Rv0079, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 273 aa overlap). Hypothetical unknown protein. Protein product from Mb0082 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0082 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247785.1" /translation="MEPKRSRLVVCAPEPSHAREFPDVAVFSGGRANASQAERLARAV GRVLADRGVTGGARVRLTMANCADGPTLVQINLQVGDTPLRAQAATAGIDDLRPALIR LDRQIVRASAQWCPRPWPDRPRRRLTTPAEALVTRRKPVVLRRATPLQAIAAMDAMDY DVHLFTDAETGEDAVVYRAGPSGLRLARQHHVFPPGWSRCRAPAGPPVPLIVNSRPTP VLTEAAAVDRAREHGLPFLFFTDQATGRGQLLYSRYDGNLGLITPTGDGVADGLA" CDS 89057..89515 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0083" /product="Putative DNA-binding protein" /note="Mb0083, -, len: 152 aa. Equivalent to Rv0080, len: 152 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 152 aa overlap). Conserved hypothetical protein, similar to several hypothetical proteins from Streptomyces coelicolor e.g. SCJ12.26|AL109989|SCJ12_24 from Streptomyces coelicolor cosmid J1 (137 aa), FASTA scores: opt: 291, E(): 4e-13, (46.5% identity in 129 aa overlap); etc. Protein product from Mb0083 detected using SWATH mass spectrometry. Mb0083 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247786.1" /translation="MSPGSRRASPQSAREVVELDRDEAMRLLASVDHGRVVFTRAALP AIRPVNHLVVDGRVIVRTRLTAKVSVAVRSSADAGVVVAYEADDLDPRRRTGWSVVVT GLATEVSDPEQVARYQRLLHPWVNMAMDTVVAIEPEIVTGIRIVADSRTP" CDS 89610..89954 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0084" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0084, -, len: 114 aa. Equivalent to Rv0081, len: 114 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 114 aa overlap). Probable transcriptional regulator, highly similar to others e.g. AL078610|SCH35_52|T36657 probable transcription regulator from Streptomyces coelicolor (117 aa), FASTA scores: opt: 404, E(): 4.8e-22, (58.2% identity in 110 aa overlap); AAG02351.1|AF210249_10|AF210249 metal-dependent regulatory protein from Streptomyces verticillus (113 aa); NP_435817.1|NC_003037 Putative transcriptional regulator from Sinorhizobium meliloti (115 aa); etc. Protein product from Mb0084 detected using shotgun mass spectrometry and SWATH mass spectrometry." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247787.1" /translation="MESEPLYKLKAEFFKTLAHPARIRILELLVERDRSVGELLSSDV GLESSNLSQQLGVLRRAGVVAARRDGNAMIYSIAAPDIAELLAVARKVLARVLSDRVA VLEDLRAGGSAT" CDS 89959..90438 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0085" /product="PROBABLE OXIDOREDUCTASE" /note="Mb0085, -, len: 159 aa. Equivalent to Rv0082, len: 159 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 159 aa overlap). Probable oxidoreductase (EC 1.-.-.-), highly highly similar or similar to other various oxidoreductases e.g. NP_143304.1|NC_000961 NADH-ubiquinone oxidoreductase subunit from Pyrococcus horikoshii (173 aa); NP_126406.1|NC_000868 CO-induced hydrogenase related, subunit L from Pyrococcus abyssi (170 aa); HYCG_ECOLI|P16433 formate hydrogenlyase subunit 7 from Escherichia coli (255 aa), FASTA scores: opt: 442, E(): 8e-29, (43.2% identity in 148 aa overlap); etc. Protein product from Mb0085 detected using SWATH mass spectrometry." /protein_id="CAB5247788.1" /translation="MGWVAKIFRVGRVVEPAAPLPAAIAEPPAGVRGSLQIRHVDAGS CNGCEVEISGAFGPVYDAERFGARLVASPRHADALLVTGVVTHNMAGPLRKTLEATPR PRVVIACGDCALNRGVFADAYGVVGAVGEVVPVDVEIAGCPPTPAAIMAALRSVTGK" CDS 90435..92363 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0086" /product="PROBABLE OXIDOREDUCTASE" /note="Mb0086, -, len: 642 aa. Equivalent to Rv0083, len: 640 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 640 aa overlap). Probable oxidoreductase (EC 1.-.-.-), showing some similarity to other various oxidoreductases e.g. AAK06855.1|AF335723_1|AF335723 hydrogenase-4 component B from Burkholderia pseudomallei (668 aa); HYFB_ ECOLI|P23482 hydrogenase-4 component b from Escherichia coli strain K12 (672 aa), FASTA scores: opt: 995, E(): 0, (32.2% identity in 571 aa overlap); AAF13041.1|AF157639_1|AF157639 putative formate hydrogenlyase integral membrane subunit from Desulfitobacterium dehalogenans (637 aa); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transversion (a-t) leads to a slightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (642 aa versus 640 aa). Protein product from Mb0086 detected using SWATH mass spectrometry." /protein_id="CAB5247789.1" /translation="MTAAPTAGGVVTSGVGVAGVGVGLLGMFGPVRVVHVGWLLPLSG VHIELDRLGGFFMALTGAVAAPVGCYLIGYVRREHLGRVPMAVVPLFVAAMLLVPAAG SVTTFLLAWELMAIASLILVLSEHARPQVRSAGLWYAVMTQLGFIAILVGLVVLAAAG GSDRFAGLGAVCDGVRVAVFMLTLVGFGSKAGLVPLHAWLPRAHPEAPSPVSALMSAA MVNLGIYGIVRFDLQLLGPGPRWWGLALLAVGGTSALYGVLQASVAADLKRLLAYSTT ENMGLITLALGAATLFADTGAYGPASIAAAAAMLHMIAHAAFKSLAFMAAGSVLAATG LRDLDLLGGLARRMPATTVFFGVAALGACGLPLGAGFVSEWLLVQSLIHAAPGHDPIV ALTTPLAVGVVALATGLSVAAMTKAFGIGFLARPRSTQAEAAREAPASMRAGMAIAAG ACLVLAVAPLLVAPMVRRAAATLPAAQAVKFTGLGAVVRLPAMSGSIAPGVIAAAVLA AALAVAVLARWRFRRRPAPARLPLWACGAADLTVRMQYTATSFAEPLQRVFGDVLRPD TDIEVTHTAESRYMAERITYRTAVADAIEQRLYTPVVGAVAAMAELLRRAHTGSVHRY LAYGALGVLIVLVVARCT" CDS 92363..93313 /codon_start=1 /transl_table=11 /gene="hycD" /locus_tag="BQ2027_MB0087" /product="POSSIBLE FORMATE HYDROGENLYASE HYCD (FHL)" /note="Mb0087, hycD, len: 316 aa. Equivalent to Rv0084, len: 316 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 316 aa overlap). Possible hycD (alternate gene name: hevD), formate hydrogenlyase (EC 1.-.-.-), integral membrane protein, similar to others e.g. HYCD_ECOLI|P16430 formate hydrogenlyase subunit 4 from Escherichia coli (307 aa), FASTA scores: opt: 570, E(): 2.1e-26, (33.8% identity in 305 aa overlap); AAK06856.1|AF335723_2|AF335723 formate hydrogenlyase subunit 4 from Burkholderia pseudomallei (316 aa); NP_457244.1|NC_003198 formate hydrogenlyase subunit 4 from Salmonella enterica subsp. enterica serovar Typhi (307 aa); etc. Also similar to NUOH_ECOLI|P33603 NADH dehydrogenase I chain H from Escherichia coli (325 aa), FASTA scores: opt: 207, E(): 9.5e-06, (26.5% identity in 260 aa overlap). BELONGS TO THE COMPLEX I SUBUNIT 1 FAMILY. Mb0087 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247790.1" /translation="MSYLAGAAQIGGVMVGAPLVIGMTRQVRARWEGRAGAGLLQPWR DLLKQLGKQQITPAGTTIVFAAAPVIVAGTTLLIAAIAPLVATGSPLDPSADLFAVVG LLFLGTVALTLAGIDTGTSFGGMGASREITIAALVEPTILLAVFALSIPAGSANLGAL VASTIDHPGHVVSLAGVLAFVALVIVIVAETGRLPVDNPATHLELTMVHEAMVLEYAG PRLALVEWAAGMRLTVLLALLANLFLPWGIAGAAPTALDVLTGVVAVAAKVAILAVLL ATFEVFLAKLRLFRVPELLAGSFLLALLAVTAANFFTVGA" CDS 93324..93986 /codon_start=1 /transl_table=11 /gene="hycP" /locus_tag="BQ2027_MB0088" /product="POSSIBLE HYDROGENASE HYCP" /note="Mb0088, hycP, len: 220 aa. Equivalent to Rv0085, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 220 aa overlap). Possible hycP, hydrogenase (EC 1.-.-.-), integral membrane protein, weakly similar to P77524|HYFE_ECOLI HYDROGENASE-4 COMPONENT E from Escherichia coli (216 aa), FASTA scores: opt: 204, E():1.2e-07, (25.5% identity in 216 aa overlap). Protein product from Mb0088 detected using SWATH mass spectrometry." /protein_id="CAB5247791.1" /translation="MSNANFSILVDFAAGGLVLASVLIVWRRDLRAIVRLLAWQGAAL AAIPLLRGIRDNDRALIAVGIAVLALRALVLPWLLARAVGAEAAAQREATPLVNTASS LLITAGLTLTAFAITQPVVNLEPGVTINAVPAAFAVVLIALFVMTTRLHAVSQAAGFL MLDNGIAATAFLLTAGVPLIVELGASLDVLFAVIVIGVLTGRLRRIFGDADLDKLREL RD" CDS 93986..95452 /codon_start=1 /transl_table=11 /gene="hycQ" /locus_tag="BQ2027_MB0089" /product="POSSIBLE HYDROGENASE HYCQ" /note="Mb0089, hycQ, len: 488 aa. Equivalent to Rv0086, len: 488 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 488 aa overlap). Possible hycQ, hydrogenase (EC 1.-.-.-), integral membrane protein, weakly similar to P77437|HYFF_ECOLI HYDROGENASE-4 COMPONENT F from Escherichia coli (526 aa), FASTA scores: opt: 948, E(): 0, (35.9% identity in 493 aa overlap); and AAK06855.1|AF335723_1|AF335723 hydrogenase-4 component B from Burkholderia pseudomallei (668 aa). Also similar to d9087711 & NUOL_ECOLI|P33607 NADH dehydrogenase I chain L from Escherichia coli (613 aa), FASTA scores: opt: 360, E():3.2e-13, (27.9% identity in 488 aa overlap); and to NUON_ECOLI|P33608 NADH dehydrogenase I chain N from Escherichia coli (425 aa), FASTA scores: opt: 375, E(): 3.9e-14, (25.0% identity in 432 aa overlap). Mb0089 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247792.1" /translation="MTGLLLAAILAPLAASIASLITGWRRTTATLTALSATTVLACAV AMGFWMGSGAQFGLGGLLRADALTVVMLVVIGIVGTLATAASIGYIDTELAHGHIDGR SARLYGVLTPAFLCATVLAVCANNIGVIWVAIEATTVITAFLVGHRRTRTALEATWKY VVICSVGIAVAFLGTVLLYFAARDSGAAAAGALNLDILAEHAAGLDPGVARLAGGLLL IGYGAKAGLFPFHTWLADAHSQAPAPVSALMSGVLLAVAFSVLIRLRPILDAVSGPAY LRNGLLVVGLATLLVAVLMLTVTGDVKRMLAYSSMEHMGLIAIAAAAGTTLAIAALLL HVLAHGIGKTVLFLAGGQLQAAHDSTAIADITGVMRRSRLIGVSFAVGLIVLLGLPPF AMFASELAIARSLANERLAWVLGAALLLIAIGFTALARNSGRMLLGTPAAGAPAITVP ATAAAALMVGIVVSAALGITAGPLADLLGIAASNVGLP" CDS 95449..96927 /codon_start=1 /transl_table=11 /gene="hycE" /locus_tag="BQ2027_MB0090" /product="possible formate hydrogenase hyce (fhl)" /note="Mb0090, hycE, len: 492 aa. Equivalent to Rv0087, len: 492 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 492 aa overlap). Possible hycE (alternate gene name: hevE), formate hydrogenlyase (EC 1.-.-.-), similar to others e.g. HYCE_ECOLI|P16431 formate hydrogenlyase subunit 5 from Escherichia coli (569 aa), FASTA scores: opt: 680, E(): 1.8e-38, (31.2% identity in 449 aa overlap); NP_457243.1|NC_003198 formate hydrogenlyase subunit 5 from Salmonella enterica subsp. enterica serovar Typhi (569 aa); NP_275541.1|NC_000916 formate hydrogenlyase subunit 5 from Methanothermobacter thermautotrophicus (370 aa); etc. Also some similarity with NUOD_ECOLI|P33600 NADH dehydrogenase I chain D from Escherichia coli (407 aa), FASTA scores: opt: 245, E(): 8.9e-10, (24.5% identity in 368 aa overlap). BELONGS TO THE COMPLEX I 49 kDa SUBUNIT FAMILY. Protein product from Mb0090 detected using SWATH mass spectrometry." /protein_id="CAB5247793.1" /translation="MMSASWLRHRVSERGLIATAEQLWADSFRLALVAAHDDGDSLRV VYLFLAGYPDRRVELEYVVPADNPEIRSLAYLSFPAGRFEREMADLYGIRPVGHPKPR RLVRHAHWPDWHPMRTDAGPAPEFTDTGAFPFLAVEGPGVYEIPVGPVHAGLIEPGHF RFSVAGETIVRLKARLWFVHRGIEKLFHGRPATAAVDLAERISGDTSAAHALAHSLAI EDALGIELPHEIHRLRALIVELERLYNHAADLGALANDVGYSLANAHAQRIRENLLRR NAAVTGHRLLRGAIRAGGVALRALPDTDELAALAVDLAEVATLTLANSVVYDRFAGTA VLHPDDASALGCLGYVARASGLRSDARVEHPTIVLPITEIGAPDGDVLARYTVRRDEF AASAALAQHIVESHTGPIEYAATLHPVGAPSSGIGIVEGWRGTIVHRVEIDVDGRITR AKVVDPSWFNWPALPVAMADTIVPDFPLANKSFNQSYAGNDL" CDS 96962..97636 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0091" /product="possible polyketide cyclase/dehydrase" /note="Mb0091, -, len: 224 aa. Equivalent to Rv0088, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Hypothetical unknown protein. Protein product from Mb0091 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0091 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247794.1" /translation="MSVYKHAPSRVRLRQTRSTVVKGRSGSLSWRRVRTGDLGLAVWG GREEYRAVKPGTPGIQPKGDMMTVTVVDAGPGRVSRSVEVAAPAAELFAIVADPRRHR ELDGSGTVRGNIKVPAKLVVGSKFSTKMKLFGLPYRITSRVTALKPNELVEWSHPLGH RWRWEFESLSPTLTRVTETFDYHAAGAIKNGLKFYEMTGFAKSNAAGIEATLAKLSDQ YARGRA" CDS 97793..98386 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0092" /product="POSSIBLE METHYLTRANSFERASE/METHYLASE" /note="Mb0092, -, len: 197 aa. Equivalent to Rv0089, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 197 aa overlap). Possible methyltransferase (EC 2.1.1.-), showing some weak similarity to others e.g. NP_299749.1|NC_002488 3-demethylubiquinone-9 3-methyltransferase from Xylella fastidiosa 9a5c (246 aa); CAC44277.1| (AL596030) putative methyltransferase from Streptomyces coelicolor (285 aa); NP_111415.1|NC_002689 Predicted SAM-dependent methyltransferase from Thermoplasma volcanium (245 aa); etc. Also some similarity with many biotin biosynthesis proteins e.g. P12999|BIOC_ECOLI|B0777 BIOTIN SYNTHESIS PROTEIN from Escherichia coli (251 aa), FASTA scores: opt: 202, E(): 4.5e-07, (39.0% identity in 118 aa overlap); etc. BELONGS TO THE METHYLTRANSFERASE SUPERFAMILY. Mb0092 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247795.1" /translation="MDQPWNANIHYDALLDAMVPLGTQCVLDVGCGDGLLAARLARRI PYVTAVDIDAPVLRRAQTRFANAPIRWLHADIMTAELPNAGFDAVVSNAALHHIEDTR TALSRLGGLVTPGGTLAVVTFVTPSLRNGLWHLTSWVACGMANRVKGKWEHSAPIKWP PPQTLHELRSHVRALLPGACIRRLLYGRVLVTWRAPV" CDS 98515..99285 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0093" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb0093, -, len: 256 aa. Equivalent to Rv0090, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 256 aa overlap). Possible membrane protein." /protein_id="CAB5247796.1" /translation="MAKNQNRIRNRWELITCGLGGHVTYAPDDAALAARLRASTGLGE VWRCLRCGDFALGGPQGRGAPEDAPLIMRGKALRQAIIIRALGVERLVRALVLALAAW AVWEFRGARGAIQATLDRDLPVLRAAGFKVDQMTVIHALEKALAAKPSTLALITGMLA AYAVLQAVEGVGLWLLKRWGEYFAVVATSIFLPLEVHDLAKGITTTRVVTFSINVAAV VYLLISKRLFGVRGGRKAYDVERRGEQLLDLERAAMLT" CDS 99719..100486 /codon_start=1 /transl_table=11 /gene="mtn" /locus_tag="BQ2027_MB0094" /product="probable bifunctional mta/sah nucleosidase mtn: 5'-methylthioadenosine nucleosidase (methylthioadenosine methylthioribohydrolase) + s-adenosylhomocysteine nucleosidase (s-adenosyl-l-homocysteine homocysteinylribohydrolase)" /note="Mb0094, mtn, len: 255 aa. Equivalent to Rv0091, len: 255 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 255 aa overlap). Probable mtn (alternate gene name: pfs), MTA/SAH nucleosidase, including 5'-methylthioadenosine nucleosidase (EC 3.2.2.16) and S-adenosylhomocysteine nucleosidase (EC 3.2.2.9), similar to others e.g. NP_521493.1|NC_003295 PROBABLE BIFUNCTIONAL PROTEIN (MTA/SAH NUCLEOSIDASE) (P46): 5'-METHYLTHIOADENOSINE NUCLEOSIDASE AND S-ADENOSYLHOMOCYSTEINE NUCLEOSIDASE from Ralstonia solanacearum (261 aa); AAC45731.1|U55214 Pfs from Treponema pallidum (249 aa); P96122|MTN_TREPA MTA/SAH NUCLEOSIDASE from Treponema pallidum (269 aa); PFS_ECOLI|P24247 pfs protein (p46) from Escherichia coli (232 aa), FASTA scores: opt: 214, E(): 3.8e-08, (30.5% identity in 246 aa overlap); etc. BELONGS TO THE MTN FAMILY. Protein product from Mb0094 detected using SWATH mass spectrometry. Mb0094 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247797.1" /translation="MAVTVGVICAIPQELAYLRGVLVDAKRQQVAQILFDSGQLDAHR VVLAAAGMGKVNTGLTATLLADRFGCRTIVFTGVAGGLDPELCIGDIVIADRVVQHDF GLLTDERLRPYQPGHIPFIEPTERLGYPVDPAVIDRVKHRLDGFTLAPLSTAAGGGGR QPRIYYGTILTGDQYLHCERTRNRLHHELGGMAVEMEGGAVAQICASFDIPWLVIRAL SDLAGADSGVDFNRFVGEVAASSARVLLRLLPVLTAC" CDS 100618..102903 /codon_start=1 /transl_table=11 /gene="ctpA" /locus_tag="BQ2027_MB0095" /product="cation transporter p-type atpase a ctpa" /note="Mb0095, ctpA, len: 761 aa. Equivalent to Rv0092, len: 761 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 761 aa overlap). Probable ctpA, cation-transporting P-type ATPase A (transmembrane protein) (EC 3.6.3.-), highly similar to others e.g. CTPA_MYCLE|P46839 cation-transporting P-type ATPase A from Mycobacterium leprae (780 aa), FASTA scores: opt: 3454, E(): 0, (74.4% identity in 741 aa overlap); CAB66270.1|AL136519 probable cation-transporting P-type ATPase from Streptomyces coelicolor (760 aa); NP_391230.1|NC_000964 protein similar to heavy metal-transporting ATPase from Bacillus subtilis (803 aa); etc. Also highly similar to MTCY251.22c from Mycobacterium tuberculosis, FASTA score: (68.3% identity in 742 aa overlap). Contains PS01047 Heavy-metal-associated domain, and PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB. Protein product from Mb0095 detected using SWATH mass spectrometry. Mb0095 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247798.1" /translation="MTAAVTGEHHASVQRIQLRISGMSCSACAHRVESTLNKLPGVRA AVNFGTRVATIDTSEAVDAAALCQAVRRAGYQADLCTDDGRSASDPDADHARQLLIRL AIAAVLFVPVADLSVMFGVVPATRFTGWQWVLSALALPVVTWAAWPFHRVAMRNARHH AASMETLISVGITAATIWSLYTVFGNHSPIERSGIWQALLGSDAIYFEVAAGVTVFVL VGRYFEARAKSQAGSALRALAALSAKEVAVLLPDGSEMVIPADELKEQQRFVVRPGQI VAADGLAVDGSAAVDMSAMTGEAKPTRVRPGGQVIGGTTVLDGRLIVEAAAVGADTQF AGMVRLVEQAQAQKADAQRLADRISSVFVPAVLVIAALTAAGWLIAGGQPDRAVSAAL AVLVIACPCALGLATPTAMMVASGRGAQLGIFLKGYKSLEATRAVDTVVFDKTGTLTT GRLQVSAVTAAPGWEADQVLALAATVEAASEHSVALAIAAATTRRDAVTDFRAIPGRG VSGTVSGRAVRVGKPSWIGSSSCHPNMRAARRHAESLGETAVFVEVDGEPCGVIAVAD AVKDSARDAVAALADRGLRTMLLTGDNPESAAAVATRVGIDEVIADILPEGKVDVIEQ LRDRGHVVAMVGDGINDGPALARADLGMAIGRGTDVAIGAADIILVRDHLDVVPLALD LARATMRTVKLNMVWAFGYNIAAIPVAAAGLLNPLVAGAAMAFSSFFVVSNSLRLRKF GRYPLGCGTVGGPQMTAPSSA" CDS complement(102850..103698) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0096C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb0096c, -, len: 282 aa. Equivalent to Rv0093c, len: 282 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 282 aa overlap). Probable conserved membrane protein, equivalent only to CAC30943.1|AL583924 probable integral membrane protein from Mycobacterium leprae (237 aa). Protein product from Mb0096c detected using SWATH mass spectrometry. Mb0096c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247799.1" /translation="MLAQATTAGSFNHHASTVLQGRRGVPAAMWSEPAGAIRRHCATI DGMDCEVAREALSARLDGERAPVPSARVDEHLGECSACRAWFTQVASQAGDLRRLAES RPVVPPVGRLGIRRAPRRQHSPMTWRRWALLCVGIAQIALGTVQGFGLDVGLTHQHPT GAGTHLLNESTSWSIALGVIMVGAALWPSAAAGLAGVLTAFVAILTGYVIVDALSGAV STTRILTHLPVVIGAVLAIMVWRSASGPRPRPDAVAAEPDIVLPDNASRGRRRGHLWP TDGSAA" CDS complement(103745..104464) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0097C" /product="13E12 repeat family protein" /note="Mb0097c, -, len: 239 aa. Equivalent to Rv0094c, len: 239 aa, from Mycobacterium tuberculosis strain H37Rv, (72% identity in 317 aa overlap). Member of 13E12 repeat family, showing some similarity to U15187|MLU15187_7 from Mycobacterium leprae (94 aa), FASTA score: (49.4% identity in 79 aa overlap)." /protein_id="CAB5247800.1" /translation="MLAKLAAPGATNPDDHTPVIDTTPDAAAIDRDTRSQAQRNHDGL LAGLRALIASGELGQHNGLPVSIVVTTTLTDLQTGAGKGFTGGGTLLPMADVIRMTSH AHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIMLFANDRGCTKPGCDAPAYHS QAHHVTGWTSTGRTDITDLTLACDPDNRLAEKGWTTRKNTHGHTEWLPPPHLDHGQPW TNTFHHPERFLHNQDDDDKPD" repeat_region complement(103748..105251) /rpt_family="REP" /note="REP-2, len: 1504 nt. Equivalent to REP, len: 1503 nt, from Mycobacterium tuberculosis strain H37Rv, (97.0% identity in 1504 nt overlap). REP251, member of REP13E12 family." CDS complement(104469..105251) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0098C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0098c, -, len: 260 aa. Equivalent to Rv1588 and Rv0095c, len: 222 aa and 136 aa, from Mycobacterium tuberculosis strain H37Rv, (98.2% identity in 222 aa overlap and 92.5% identity in 134 aa overlap). Member of 13E12 repeat, also partially similar to AF0418|AF041819_8 from Mycobacterium bovis BCG (222 aa), FASTA score: (89.6% identity in 96 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0095c and Rv1588 exist as separate genes in their respective positions. In Mycobacterium bovis, a 73 bp substitution leads to a single combined gene. Mb0098c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247801.1" /translation="MRYLPVSTRRIWVNPLCHFSFTVISGALFVSARRYDSNMLANSR EELVEVFDALDAELDRLDEVSFEVLTTPERLRSLERLECLVRRLPAVGHTLINQLDTQ ASEEELGGTLCCALANRLRITKPDAALRIADAADLGPRRALTGEPLAPQLTATATAQR QGLIGEAHIKVIRALFRPPARRGGCVHPPGRRSRPGRQSRSISSRRAGPLRPAGHGLA TPRRRPHRHRTRPQTRHHPEQPAIRRHVTAKWLPDPPSAGHL" CDS 105360..106751 /codon_start=1 /transl_table=11 /gene="PPE1" /locus_tag="BQ2027_MB0099" /product="ppe family protein ppe1" /note="Mb0099, PPE1, len: 463 aa. Equivalent to Rv0096, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 463 aa overlap). Member of the Mycobacterium tuberculosis PPE family, similar to many e.g. Z46257|MLACEA_3 aceA gene for isocitrate L from M. leprae (438 aa), FASTA scores: opt: 1207, E(): 0, (55.3% identity in 380 aa overlap). Also similar to Z97559|MTCY261_19 from Mycobacterium tuberculosis (473 aa), FASTA score: (40.2% identity in 478 aa overlap); YHS6_MYCTU|P42611 hypothetical 50.6 kd protein (517 aa), FASTA scores: opt: 365, E(): 4.6e-12, (37.6% identity in 178 aa overlap). Also similar to MTCY274.23c from M. tuberculosis FASTA score: (31.1% identity in 383 overlap). Some similarity also to MTCY31.06c and MTCY48.17 and other mycobacterial PPE family proteins." /protein_id="CAB5247802.1" /translation="MAIPPEVHSGLLSAGCGPGSLLVAAQQWQELSDQYALACAELGQ LLGEVQASSWQGTAATQYVAAHGPYLAWLEQTAINSAVTAAQHVAAAAAYCSALAAMP TPAELAANHAIHGVLIATNFFGINTVPIALNEADYVRMWLQAADTMAAYQAVADAATV AVPSTQPAPPIRAPGGDAADTWLDVLSSIGQLIRDILDFIANPYKYFLEFFEQFGFSP AVTVVLALVALQLYDFLWYPYYASYGLLLLPFFTPTLSALTALSALIHLLNLPPAGLL PIAAALGPGDQWGANLAVAVTPATAAVPGGSPPTSNPAPAAPSSNSVGSASAAPGISY AVPGLAPPGVSSGPKAGTKSPDTAADTLATAGAARPGLARAHRRKRSESGVGIRGYRD EFLDATATVDAATDVPAPANAAGSQGAGTLGFAGTAPTTSGAAAGMVQLSSHSTSTTV PLLPTTWTTDAEQ" CDS 106770..107639 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0100" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0100, -, len: 289 aa. Equivalent to Rv0097, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 289 aa overlap). Possible oxidoreductase (EC 1.-.-.-), equivalent to NP_302343.1|NC_002677 putative oxidoreductase from Mycobacterium leprae (289 aa). Also highly similar to BAB69377.1|AB070955 putative oxidoreductase from Streptomyces avermitilis (296 aa); and weakly similar to others e.g. NP_518867.1|NC_003295 PUTATIVE ALPHA-KETOGLUTARATE-DEPENDENT TAURINE DIOXYGENASE OXIDOREDUCTASE PROTEIN from Ralstonia solanacearum (301 aa); NP_286110.1|NC_002655 taurine dioxygenase (2-oxoglutarate-dependent) from Escherichia coli strain O157:H7 (283 aa); NP_252624.1|NC_002516 taurine dioxygenase from Pseudomonas aeruginosa (277 aa); ECAE00014310 (283 aa), FASTA scores: opt: 304, E(): 2.6e-13, (27.8% identity in 288 aa overlap); TFDA_ALCEU|P10088 2,4-dichlorophenoxyacetate monooxygenase from A. eutropha (287 aa), FASTA scores: opt: 188, E(): 3.5e-06, (26.6% identity in 188 aa overlap); etc. Contains PS00077 Cytochrome c oxidase subunit I, copper B binding region signature. Protein product from Mb0100 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0100 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247803.1" /translation="MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKD VHPSPREFIKLGRIIGQIVPYYEPMYHHEDHPEIFVSSTEEGQGVPKTGAFWHIDYMF MPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVSTHDPRRHIKIR PSDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYICATGTTKIEDKDGNPVDP EVLQELMAATGQLDPEYQSPFIHTQHYQVGDIILWDNRVLMHRAKHGSAAGTLTTYRL TMLDGLKTPGYAA" CDS 107636..108187 /codon_start=1 /transl_table=11 /gene="fcot" /locus_tag="BQ2027_MB0101" /product="probable fatty acyl coa thioesterase type iii fcot" /note="Mb0101, -, len: 183 aa. Equivalent to Rv0098, len: 183 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 183 aa overlap). Conserved hypothetical protein, equivalent to CAC30948.1|AL583924 conserved hypothetical protein from Mycobacterium leprae (183 aa). Also some similarity with BAB69378.1|AB070955 hypothetical protein from Streptomyces avermitilis (172 aa). Protein product from Mb0101 detected using SWATH mass spectrometry. Mb0101 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247804.1" /translation="MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDA QYSATEDSVLAYGNFTIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVL RGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCV IEFWDENGGAASGEIELAALNIP" CDS 108192..109814 /codon_start=1 /transl_table=11 /gene="fadD10" /locus_tag="BQ2027_MB0102" /product="POSSIBLE FATTY-ACID-COA LIGASE FADD10 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb0102, fadD10, len: 540 aa. Equivalent to Rv0099, len: 540 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 540 aa overlap). Possible fadD10, fatty-acid-CoA synthetase (EC 6.2.1.-), equivalent to MLACEA_4|Q50176 LONG CHAIN FATTY ACID-COA LIGASE from Mycobacterium leprae (532 aa), FASTA scores: opt: 2580, E(): 0, (74.6% identity in 531 aa overlap). Also similar to many e.g. BAB69379.1|AB070955 long-chain fatty acid--CoA ligase from Streptomyces avermitilis (518 aa); NP_419782.1|NC_002696 putativ long-chain-fatty-acid--CoA ligase from Caulobacter crescentus (530 aa); NP_435326.1|NC_003037 probable long chain fatty acid CoA ligase from Sinorhizobium meliloti (508 aa); etc. Also similar to ACSA_BACSU|P39062 acetyl-coenzyme A synthetase from Bacillus subtilis (572 aa), FASTA scores: opt: 415, E(): 9.8e-20, (27.1% identity in 539 aa overlap). Contains PS00455 putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb0102 detected using SWATH mass spectrometry. Mb0102 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247805.1" /translation="MGGKKFQAMPQLPSTVLDRVFEQARQQPEAIALRRCDGTSALRY RELVAEVGGLAADLRAQSVSRGSRVLVISDNGPETYLSVLACAKLGAIAVMADGNLPI AAIERFCQITDPAAALVAPGSKMASSAVPEALHSIPVIAVDIAAVTRESEHSLDAASL AGNADQGSEDPLAMIFTSGTTGEPKAVLLANRTFFAVPDILQKEGLNWVTWVVGETTY SPLPATHIGGLWWILTCLMHGGLCVTGGENTTSLLEILTTNAVATTCLVPTLLSKLVS ELKSANATVPSLRLVGYGGSRAIAADVRFIEATGVRTAQVYGLSETGCTALCLPTDDG SIVKIEAGAVGRPYPGVDVYLAATDGIGPTAPGAGPSASFGTLWIKSPANMLGYWNNP ERTAEVLIDGWVNTGDLLERREDGFFYIKGRSSEMIICGGVNIAPDEVDRIAEGVSGV REAACYEIPDEEFGALVGLAVVASAELDESAARALKHTIAARFRRESEPMARPSTIVI VTDIPRTQSGKVMRASLAAAATADKARVVVRG" CDS 109819..110055 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0103" /product="pp-binding family protein Rv0100" /note="Mb0103, -, len: 78 aa. Equivalent to Rv0100, (MTCY251.19), len: 78 aa. Conserved hypothetical protein, equivalent only to CAC30950.1|AL583924 conserved hypothetical protein from Mycobacterium leprae (78 aa). Mb0103 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247806.1" /translation="MRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQ LGVNRQSELPSRLAANPSIAGWLRELEAVCTEFG" CDS 110037..117575 /codon_start=1 /transl_table=11 /gene="nrp" /locus_tag="BQ2027_MB0104" /product="PROBABLE PEPTIDE SYNTHETASE NRP (PEPTIDE SYNTHASE)" /note="Mb0104, nrp, len: 2512 aa. Equivalent to Rv0101, len: 2512 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 2512 aa overlap). Probable nrp, peptide synthetase (EC 6.-.-.-), similar to others e.g. AAD44234.1|AF143772_40|PstB peptide synthetase from Mycobacterium avium (2552 aa); 7476034|S77657 cyclic peptide synthetase from Mycobacterium leprae (1401 aa), FASTA scores: opt: 4268, E(): 0, (65.7% identity in 1091 aa overlap); part of CAB55600.1|AJ238027 peptide synthetase from Mycobacterium smegmatis (5990). Also similar to e.g. AAD56240.1|AF184977_1|AF184977 DhbF protein from Bacillus subtilis (2378 aa); SRF1_BACSU|P27206 surfactin synthetase subunit 1 (3587 aa), FASTA scores: opt: 1708, E(): 0, (30.6% identity in 1633 aa overlap): etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), 2 x PS00455 Putative AMP-binding domain signature, and PS00012 Phosphopantetheine attachment site. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. THOUGHT TO BE NOT INVOLVED IN MYCOBACTIN BIOSYNTHESIS (see citation below). Protein product from Mb0104 detected using shotgun mass spectrometry. Mb0104 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247807.1" /translation="MHRVRLSRSQRNLYNGVRQDNNPALYLIGKSYRFRRLELARFLA ALHATVLDNPVQLCVLENSGADYPDLVPRLRFGDIVRVGSADEHLQSTWCSGILGKPL VRHTVHTDPNGYVTGLDVHTHHILLDGGATGTIEADLARYLTTDPAGETPSVGAGLAK LREAHRRETAKVEESRGRLSAVVQRELADEAYHGGHGHSVSDAPGTAAKGVLHESATI CGNAFDAILTLSEAQRVPLNVLVAAAAVAVDASLRQNTETLLVHTVDNRFGDSDLNVA TCLVNSVAQTVRFPPFASVSDVVRTLDRGYVKAVRRRWLREEHYRRMYLAINRTSHVE ALTLNFIREPCAPGLRPFLSEVPIATDIGPVEGMTVASVLDEEQRTLNLAIWNRADLP ACKTHPKVAERIAAALESMAAMWDRPIAMIVNDWFGIGPDGTRCQGDWPARQPSTPAW FLDSARGVHQFLGRRRFVYPWVAWLVQRGAAPGDVLVFTDDDTDKTIDLLIACHLAGC GYSVCDTADEISVRTNAITEHGDGILVTVVDVAATQLAVVGHDELRKVVDERVTQVTH DALLATKTAYIMPTSGTTGQPKLVRISHGSLAVFCDAISRAYGWGAHDTVLQCAPLTS DISVEEIFGGAACGARLVRSAAMKTGDLAALVDDLVARETTIVDLPTAVWQLLCADGD AIDAIGRSRLRQIVIGGEAIRCSAVDKWLESAASQGISLLSSYGPTEATVVATFLPIV CDQTTMDGALLRLGRPILPNTVFLAFGEVVIVGDLVADGYLGIDGDGFGTVTAADGSR RRAFATGDRVTVDAEGFPVFSGRKDAVVKISGKRVDIAEVTRRIAEDPAVSDVAVELH SGSLGVWFKSQRTREGEQDAAAATRIRLVLVSLGVSSFFVVGVPNIPRKPNGKIDSDN LPRLPQWSAAGLNTAETGQRAAGLSQIWSRQLGRAIGPDSSLLGEGIGSLDLIRILPE TRRYLGWRLSLLDLIGADTAANLADYAPTPDAPTGEDRFRPLVAAQRPAAIPLSFAQR RLWFLDQLQRPAPVYNMAVALRLRGYLDTEALGAAVADVVGRHESLRTVFPAVDGVPR QLVIEARRADLGCDIVDATAWPADRLQRAIEEAARHSFDLATEIPLRTWLFRIADDEH VLVAVAHHIAADGWSVAPLTADLSAAYASRCAGRAPDWAPLPVQYVDYTLWQREILGD LDDSDSPIAAQLAYWENALAGMPERLRLPTARPYPPVADQRGASLVVDWPASVQQQVR RIARQHNATSFMVVAAGLAVLLSKLSGSPDVAVGFPIAGRSDPALDNLVGFFVNTLVL RVNLAGDPSFAELLGQVRARSLAAYENQDVPFEVLVDRLKPTRALTHHPLIQVMLAWQ DNPVGQLNLGDLQATPMPIDTRTARMDLVFSLAERFSEGSEPAGIGGAVEYRTDVFEA QAIDVLIERLRKVLVAVAAAPERTVSSIDALDGTERARLDEWGNRAVLTAPAPTPVSI PQMLAAQVARIPEAEAVCCGDASMTYRELDEASNRLAHRLAGCGAGPGECVALLFERC APAVVAMVAVLKTGAAYLPIDPANPPPRVAFMLGDAVPVAAVTTAGLRSRLAGHDLPI IDVVDALAAYPGTPPPMPAAVNLAYILYTSGTTGEPKGVGITHRNVTRLFASLPARLS AAQVWSQCHSYGFDASAWEIWGALLGGGRLVIVPESVAASPNDFHGLLVAEHVSVLTQ TPAAVAMLPTQGLESVALVVAGEACPAALVDRWAPGRVMLNAYGPTETTICAAISAPL RPGSGMPPIGVPVSGAALFVLDSWLRPVPAGVAGELYIAGAGVGVGYWRRAGLTASRF VACPFGGSGARMYRTGDLVCWRADGQLEFLGRTDDQVKIRGYRIELGEVATALAELAG VGQAVVIAREDRPGDKRLVGYATEIAPGAVDPAGLRAQLAQRLPGYLVPAAVVVIDAL PLTVNGKLDHRALPAPEYGDTNGYRAPAGPVEKTVAGIFARVLGLERVGVDDSFFELG GDSLAAMRVIAAINTTLNADLPVRALLHASSTRGLSQLLGRDARPTSDPRLVSVHGDN PTEVHASDLTLDRFIDADTLATAVNLPGPSPELRTVLLTGATGFLGRYLVLELLRRLD VDGRLICLVRAESDEDARRRLEKTFDSGDPELLRHFKELAADRLEVVAGDKSEPDLGL DQPMWRRLAETVDLIVDSAAMVNAFPYHELFGPNVAGTAELIRIALTTKLKPFTYVST ADVGAAIEPSAFTEDADIRVISPTRTVDGGWAGGYGTSKWAGEVLLREANDLCALPVA VFRCGMILADTSYAGQLNMSDWVTRMVLSLMATGIAPRSFYEPDSEGNRQRAHFDGLP VTFVAEAIAVLGARVAGSSLAGFATYHVMNPHDDGIGLDEYVDWLIEAGYPIRRIDDF AEWLQRFEASLGALPDRQRRHSVLPMLLASNSQRLQPLKPTRGCSAPTDRFRAAVRAA KVGSDKDNPDIPHVSAPTIINYVTNLQLLGLL" CDS 117750..119735 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0105" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0105, -, len: 661 aa. Equivalent to Rv0102, len: 661 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 661 aa overlap). Probable conserved integral membrane protein, highly similar to P53525|Y102_MYCLE|ML1998|NP_302349.1|NC_002677 possible membrane protein from Mycobacterium leprae (659 aa), FASTA scores: opt: 3107, E(): 0, (70.2% identity in 662 aa overlap). Also similar to others e.g. CAC01497.1|AL391017 putative integral membrane protein from Streptomyces coelicolor (316 aa); etc. Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide. Protein product from Mb0105 detected using SWATH mass spectrometry. Mb0105 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247808.1" /translation="MGTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLV SGARRYAEAGNPYPGAFVSVAEPVGFFAASLAGALCLGALIHVVMTAKPEPDGLIDAA AFRIHLLAERVSGLWLGLAATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWI VAAICALVVATALRLYTRWLGHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVF AVAFATLTGLKIAAALAGTTPSRAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFA RLGLLAGVILTSVWLFDCWRLLVRPPHAGRRRGGGSGAALAMMAAMASIAAMAVMTAP RFLTHAFTAWDVFLGYELPQPPTIARVLTVWRFDSLIGAAGVVLAIGYAAGFAALRRR GNSWPVGRLIAWLTGCAALVFTSGSGVRAYGSAMFSVHMAEHMTLNMFIPVLLVLGGP VTLALRVLPVTGDGRPPGAREWLTWLLHSRVTTFLSHPITAFVLFVASPYIVYFTPLF DTFVRYHWGHEFMAIHFLVVGYLFYWAIIGIDPGPRRLPYPGRIGLLFAVMPFHAFFG IALMTMSSTVGATFYRSVNLPWLSSIIADQHLGGGIAWSLTELPVIMVIVALVTQWAR QDRRVASREDRHADSDYADDELEAYNAMLRELSRMRR" CDS complement(119951..122209) /codon_start=1 /transl_table=11 /gene="ctpB" /locus_tag="BQ2027_MB0106C" /product="PROBABLE CATION-TRANSPORTER P-TYPE ATPASE B CTPB" /note="Mb0106c, ctpB, len: 752 aa. Equivalent to Rv0103c, len: 752 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 752 aa overlap). Probable ctpB, cation-transporting P-type ATPase B (transmembrane protein) (EC 3.6.3.-), equivalent to CTPB_MYCLE|P46840 cation-transporting P-type ATPase B from Mycobacterium leprae (750 aa), FASTA scores: opt: 3615, E(): 0, (76.5% identity in 752 aa overlap). Also highly similar to others e.g. CAB96031.1|AL360055 putative metal transporter ATPase from Streptomyces coelicolor (753 aa); NP_241423.1|NC_002570 copper-transporting ATPase from Bacillus halodurans (806 aa); etc. Also highly similar to Z46257|MLACEA_7 aceA gene for isocitrate L from Mycobacterium leprae (750 aa), FASTA scores: opt: 3615, E():0, (76.5% identity in 752 aa overlap). And similar to MTCY251.11 from Mycobacterium tuberculosis, FASTA score: (68.3% identity in 742 aa overlap). Contains PS01047 Heavy-metal-associated domain, PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB. Protein product from Mb0106c detected using SWATH mass spectrometry. Mb0106c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247809.1" /translation="MAAPVVGDADLQSVRRIRLDVSGMSCAACASRVETKLNKIPGVR ASVNFATRVATIDAVGMAADELCGVVEKAGYHAAPHTETTVLDKRTKDPDGAHARRLL RRLLVAAVLFVPLADLSTLFAIVPSARVPGWGYILTALAAPVVTWAAWPFHSVALRNA RHRTTSMETLISVGIVAATAWSLSSVFGDQPPREGSGIWRAILNSDSIYLEVAAGVTV FVLAGRYFEARAKSKAGSALRALAELGAKNVAVLLPDGAELVIPASELKKRQRFVTRP GETIAADGVVVDGSAAIDMSAMTGEAKPVRAYPAASVVGGTVVMDGRLVIEATAVGAD TQFAAMVRLVEQAQTQKARAQRLADHIAGVFVPVVFVIAGLAGAAWLVSGAGADRAFS VTLGVLVIACPCALGLATPTAMMVASGRGAQLGIFIKGYRALETIRSIDTVVFDKTGT LTVGQLAVSTVTMAGSGTSERDREEVLGLAAAVESASEHAMAAAIVAASPDPGPVNGF VAVAGCGVSGEVGGHHVEVGKPSWITRTTPCHDAALVSARLDGESRGETVVFVSVDGV VRAALTIADTLKDSAAAAVAALRSRGLRTILLTGDNRAAADAVAAQVGIDSAVADMLP EGKVDVIQRLREEGHTVAMVGDGINDGPALVGADLGLAIGRGTDVALGAADIILVRDD LNTVPQALDLARATMRTIRMNMIWAFGYNVAAIPIAAAGLLNPLIAGAAMAFSSFFVV SNSLRLRNFGAQ" CDS 122353..123867 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0107" /product="cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases" /note="Mb0107, -, len: 504 aa. Equivalent to Rv0104, len: 504 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 504 aa overlap). Conserved hypothetical protein, showing weak similarity with other cAMP-dependent protein kinases e.g. AAC37564.1|M65066 cAMP-dependent protein kinase RI-beta regulatory subunit from Homo sapiens (380 aa); etc. Mb0107 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247810.1" /translation="MTPVTTFPLVDAILAGRDRNLDGVILIAAQHLLQTTHAMLRSLF RVGLDPRNVAVIGKCYSTHPGVVDAMRADGIYVDDCSDAYAPHESFDTQYTRHVEWFF AESWARLTAGRTARVVLLDDGGSLLAVAGAMLDASADVIGIEQTSAGYAKIVGCALGF PVINIARSSAKLLYESPIIAARVTQTAFERTAGIDSSAAILITGAGAIGTALADVLRP LHDRVDVYDTRSGCMTPIDLPNAIGGYDVIIGATGATSVPASMHELLRPGVLLMSASS SDREFDAVALRRRTTPNPDCHADLRVADGSVDATLLNSGFPVNFDGSPMCGDASMALT MALLAAAVLYASVAVADEMSSDHPHLGLIDQGDIVASFLNIDVPLQALSRLPLLSIDG YRRLQVRSGHTLFRQGERADHFFVIESGELEALVDGKVILRLGAGDHFGEACLLGGMR RIATVRACEPSVLWELDGKAFGDALHGDAAMREIAYGVARTRLMHAGASESLMV" CDS complement(124016..124300) /codon_start=1 /transl_table=11 /gene="rpmB1" /locus_tag="BQ2027_MB0108C" /product="50s ribosomal protein l28-1 rpmb1" /note="Mb0108c, rpmB1, len: 94 aa. Equivalent to Rv0105c, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 94 aa overlap). Probable rpmB1, 50S ribosomal protein L28-1, highly similar to others e.g. Q9X8K8|R28B_STRCO 50S RIBOSOMAL PROTEIN L28-2 from Streptomyces coelicolor (78 aa); RL28_ECOLI|P02428 50s ribosomal protein l28 from Escherichia coli (77 aa), FASTA scores: opt: 167, E(): 6.2e-06, (40.7% identity in 59 aa overlap); etc. Also similar to MTCY63A_2 from Mycobacterium tuberculosis. BELONGS TO THE L28P FAMILY OF RIBOSOMAL PROTEINS. Mb0108c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247811.1" /translation="MSARCQITGRTVGFGKAVSHSHRRTRRRWPPNIQLKAYYLPSED RRIKVRVSAQGIKVIDRDGHRGRRRAARAGSAPAHFARQAGSSLRTAAIL" CDS 124410..125606 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0109" /product="Metal chaperone, involved in Zn homeostasis" /note="Mb0109, -, len: 398 aa. Equivalent to Rv0106, len: 398 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 398 aa overlap). Conserved hypothetical protein, similar to others e.g. AL049841|SCE9_33 from Streptomyces coelicolor (370 aa), FASTA scores: opt: 282, E(): 2.5e-11, (32.0% identity in 381 aa overlap); etc. Some similarity to P94400 HOMOLOGUE TO NITRILE HYDRATASE REGION from Bacillus subtilis (397 aa), FASTA scores: opt: 226, E(): 5.4e-08, (26.4% identity in 405 aa overlap). Also similar to COBW_PSEDE|P29937 FASTA score: (25.3% identity in 186 aa overlap); and P47K_PSECL|P31521 47 kd protein (p47k) (419 aa), FASTA score: (25.9% identity in 401 aa overlap). Protein product from Mb0109 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0109 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247812.1" /translation="MRTPVILVAGQDHTDEVTGALLRRTGTVVVEHRFDGHVVRRMTA TLSRGELITTEDALEFAHGCVSCTIRDDLLVLLRRLHRRDNVGRIVVHLAPWLEPQPI CWAIDHVRVCVGHGYPDGPAALDVRVAAVVTCVDCVRWLPQSLGEDELPDGRTVAQVT VGQAEFADLLVLTHPEPVAVAVLRRLAPRARITGGVDRVELALAHLDDNSRRGRTDTP HTPLLAGLPPLAADGEVAIVEFSARRPFHPQRLHAAVDLLLDGVVRTRGRLWLANRPD QVMWLESAGGGLRVASAGKWLAAMAASEVAYVDLERRLFADLMWVYPFGDRHTAMTVL VCGADPTDIVNALNAALLSDDEMASPQRWQSYVDPFGDWHDDPCHEMPDAAGEFSAHR NSGESR" CDS complement(125701..130578) /codon_start=1 /transl_table=11 /gene="ctpI" /locus_tag="BQ2027_MB0111C" /product="PROBABLE CATION-TRANSPORTER ATPASE I CTPI" /note="Mb0111c, ctpI, len: 1625 aa. Equivalent to 5' end of Rv0107c, len: 1632 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 1572 aa overlap). Probable ctpI, cation-transporting ATPase I P-type (EC 3.6.3.-), highly similar to NP_302704.1|NC_002677 probable cation transport ATPase from Mycobacterium leprae (1609 aa); and similar to others e.g. CAB69720.1|AL137166 putative transport ATPase from Streptomyces coelicolor (1472 aa); ATA1_SYNY|P37367 cation-transporting ATPase pma1 from Synechocystis sp. (915 aa), FASTA scores: opt: 603, E(): 6.6e-29, (32.4% identity in 710 aa overlap); etc. Also similar to MTCY39.21c and MTCY22G10.22c from Mycobacterium tuberculosis, FASTA score: (34.4% identity in 796 aa overlap). Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-t) leads to a product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb0111c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0111c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247813.1" /translation="MKIPGVATVLGGVTNGVAQTVRAGARLPGSAAAAVQTLASPVLE LTGPVVQSVVQTTGRAIGVRGSHNESPDGMTPPVRWRSGRRVHFDLDPLLPFPRWHEH AAMVEEPVRRIPGVAEAHVEGSLGRLVVELEPDVDSDIAVDEVRDVVSAVAADIFLAG SVSSPNSAPFADPGNPLAILVPLTAAAMDLVAMGATVTGWVARLPAAPQTTRALAALI NHQPRMVSLMESRLGRVGTDIALAATTAAANGLTQSLGTPLLDLVQRSLQISEAAAHR RVWRDREPALASPRRPQAPVVPIISSAGAKSQEPRHSWAAAAAGEASHVVVGGSIDAA IDTAKGSRAGPVEQYVNQAANGSLIAAASALVAGGGTEDAAGAILAGVPRAAHMGRQA FAAVLGRGLANTGQLVLDPGALRRLDRVRVVVIDGAALRGDNRAVLHAQGDEPGWDDD RVYEVADALLHGEQAPEPDPDELPATGARLRWAPAQGPSATPAQGLEHADLVVDGQCV GSVDVGWEVDPYAIPLLQTAHRTGARVVLRHVAGTEDLSASVGSTHPPGTPLLKLVRE LRADRGPVLLITAVHRDFASTDTLAALAIADVGVALDDPRGATPWTADLITGTDLAAA VRILSALPVARAASESAVHLAQGGTTLAGLLLVTGEQDKTTNPASFRRWLNPVNAAAA TALVSGMWSAAKVLRMPDPTPQPLTAWHALDPEIVYSRLAGGSRPLAVEPGIPAWRRI LDDLSYEPVMAPLRGPARTLAQLAVATRHELADPLTPILAVGAAASAIVGSNIDALLV AGVMTVNAITGGVQRLRAEAAAAELFAEQDQLVRRVVVPAVATTRRRLEAARHATRTA TVSAKSLRVGDVIDLAAPEVVPADARLLVAEDLEVDESFLTGESLPVDKQVDPVAVND PDRASMLFEGSTIVAGHARAIVVATGVGTAAHRAISAVADVETAAGVQARLRELTSKV LPMTLAGGAAVTALALLRRASLRQAVADGVAIAVAAVPEGLPLVATLSQLAAAQRLTA RGALVRSPRTIEALGRVDTICFDKTGTLTENRLRVVCALPSSTAAERDPLPQTTDAPS AEVLRAAARASTQPHNGEGHAHATDEAILAAASALAGSLSSQGDSEWVVLAEVPFESS RGYAAAIGRVGTDGIPMLMLKGAPETILPRCRLADPGVDHEHAESVVRHLAEQGLRVL AVAQRTWDNGTTHDDETDADAVDAVAHDLELIGYVGLADTARPSSRPLIEALLDAERN VVLITGDHPITARAIARQLGLPADARVVTGAELAVLDEEAHAKLAADMQVFARVSPEQ KVQIVASLQRCGRVTAMVGDGANDAAAIRMADVGIGVSGRGSSAARGAADIVLTDDDL GVLLDALVEGRSMWAGVRDAVTILVGGNVGEVLFTVIGTAFGAGRAPVGTRQLLLVNL LTDMFPALAVAVTSQFAEPDDAEYPTDDAAERAQREHRRAVLIGPTPSLDAPLLRQIV NRGVVTAAGATAAWAIGRWTPGTERRTATMGLTALVMTQLAQTLLTRRHSPLVIATAL GSAGVLVGIIQTPVISHFFGCTPLGPVAWTGVFSATAGATAVSALAPKWLASTVGVVQ PDERPDDAEDSDAGG" CDS complement(130932..131141) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0112C" /product="HYPOTHETICAL PROTEIN" /note="Mb0112c, -, len: 69 aa. Equivalent to Rv0108c, len: 69 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 69 aa overlap). Hypothetical unknown protein. Protein product from Mb0112c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0112c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247814.1" /translation="MVPVETLHSGDPITDVNGGGQRYIVLESKTVGDSCVVLELESRV NHQLQVIEKSFPAGYHVGRAHHRIL" CDS 131420..132910 /codon_start=1 /transl_table=11 /gene="PE_PGRS1" /locus_tag="BQ2027_MB0113" /product="pe-pgrs family protein pe_pgrs1" /note="Mb0113, PE_PGRS1, len: 496 aa. Equivalent to Rv0109, len: 496 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 496 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to many e.g. Q50615|Y0DP_MYCTU HYPOTHETICAL GLYCINE-RICH 40.8 KD PROTEIN from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 1772, E(): 0, (57.3% identity in 513 aa overlap); etc. Mb0113 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247815.1" /translation="MSLLITSPATVAAAATHLAGIGSALSTANAAAAAPTTALSVAGA DEVSVLIAALFEAYAQEYQALSAQALAFHDQFVQALNMGAVCYAAAETANATPLQALQ TVQQNVLTVVNAPTQALLGRPIIGNGANGLPNTGQDGGPGGLLFGNGGNGGSGGVDQA GGNGGAAGLIGNGGSGGVGGPGIAGSAGGAGGAGGLLFGNGGPGGAGGIGTTGDGGPG GAGGNAIGLFGSGGTGGMGGVGGMGGVGNGGNAGNGGTAGLFGHGGAGGAGGIGSADG GLGGGGGNGRFMGNGGVGGAGGYGASGDGGNAGNGGLGGVFGDGGAGGTGGLGDVNGG LAGIGGNAGFVGNGGAGGNGQLGSGAVSSAGGMGGNGGLVFGNGGPGGLGGPGTSAGN GGMGGNAVGLFGQGGAGGAGGSGFGAGIPGGRGGDGGSGGLIGDGGTGGGAGAGDAAA SAGGNGGNARLIGNGGDGGPGMFGGPGGAGGSGGTIFGFAGTPGPS" CDS 133058..133807 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0114" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0114, -, len: 249 aa. Equivalent to Rv0110, len 249 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 249 aa overlap). Probable conserved integral membrane protein, similar to many e.g. AL079308|SCH69_25 from Streptomyces coelicolor (297 aa), FASTA scores: opt: 552, E(): 6.1e-29, (45.4% identity in 251 aa overlap); P54493|YQGP_BACSU HYPOTHETICAL 56.4 KD PROTEIN from Bacillus subtilis (507 aa), FASTA scores: opt: 320, E(): 4e-15, (32.4% identity in 210 aa overlap); etc. Mb0114 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247816.1" /translation="MRVGPVGHQCAECVREGARAVRQPRTPFGGRQRSATPVVTYTLI SLNALVFVMQVTVMGLERQLALWPPAVASGQTYRLVTSAFLHYGAMHLLLNMWALYVV GPPLEMWLGRLRFGALYAVSALGGSVLVYLIAPLNTATAGASGAVFGLFGATFMVARR LHLDVRWVVALIVINLAFTFLAPAISWQGHVGGLVTGALVAATYVYAPRERRNLIQAT VTITVLVAFVVLIGWRTVDLLALFGGRLNLS" CDS 133988..136045 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0115" /product="POSSIBLE TRANSMEMBRANE ACYLTRANSFERASE" /note="Mb0115, -, len: 685 aa. Equivalent to Rv0111, len: 685 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 685 aa overlap). Possible transmembrane acyltransferase (EC 2.3.1.-), equivalent to AA22904.1|AL035300 putative acyltransferase from Mycobacterium leprae (696 aa). Also similar to others e.g. C69975 acyltransferase homolog yrhL from Bacillus subtilis (634 aa), FASTA scores: opt: 520, E(): 4e-22, (36.4% identity in 382 aa overlap). Very similar to Mycobacterium tuberculosis proteins Rv0228, Rv1254, Rv1565c, etc. Mb0115 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247817.1" /translation="MPARSVPRPRWVAPVRRVGRLAVWDRPERRSGIPALDGLRAIAV ALVLASHGGIPGMGGGFIGVDAFFVLSGFLITSLLLDELGRTGRIDLSGFWIRRARRL LPALVLMVLTVSAARALFPDQALTGLRSDAIAAFLWTANWRFVAQNTDYFTQGAPPSP LQHTWSLGVEEQYYVVWPLLLIGATLLLAARARRRCRRATVGGVRFAAFLIASLGTMA SATAAVAFTSAATRDRIYFGTDTRAQALLIGSAAAALLVRDWPSLNRGWCLIRTRWGR RIARLLPFVGLAGLAVTTHVATGSVGEFRHGLLIVVAGAAVIVVASVAMEQRGAVARI LAWRPLVWLGTISYGVYLWHWPIFLALNGQRTGWSGPALFAARCAATVVLAGASWWLI EQPIRRWRPARVPLLPLAAATVASAAAVTMLVVPVGAGPGLREIGLPPGVSAVAAVSP SPPEASQPAPGPRDPNRPFTVSVFGDSIGWTLMHYLPPTPGFRFIDHTVIGCSLVRGT PYRYIGQTLEQRAECDGWPARWSAQVNRDQPDVALLIVGRWETVDRVNEGRWTHIGDP TFDAYLNAELQRALSIVGSTGVRVMVTTVPYSRGGEKPDGRLYPEDQPERVNKWNAML HNAISQHSNVGMIDLNKKLCPDGVYTAKVDGIKVRSDGVHLTQEGVKWLIPWLEDSVR VAS" CDS 136327..137283 /codon_start=1 /transl_table=11 /gene="gca" /locus_tag="BQ2027_MB0116" /product="POSSIBLE GDP-MANNOSE 4,6-DEHYDRATASE GCA (GDP-D-MANNOSE DEHYDRATASE)" /note="Mb0116, gca, len: 318 aa. Equivalent to Rv0112, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 318 aa overlap). Possible gca, GDP-mannose 4,6-dehydratase (EC 4.2.1.47), similar to others e g. U18320|PAU18320_1 GDP-D-mann from Pseudomonas aeruginosa (323 aa), FASTA scores: opt: 415, E(): 4.4e-21, (27.0% identity in 318 aa overlap). Similar to Rv3634c, Rv3784, etc from Mycobacterium tuberculosis. Contains PS00061 Short-chain dehydrogenases/reductases family signature. SEEMS TO BELONG TO THE GDP-MANNOSE 4,6-DEHYDRATASE FAMILY. COFACTOR: NAD(+). Protein product from Mb0116 detected using SWATH mass spectrometry. Mb0116 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247818.1" /translation="MKVWITGAGGMMGSHLAEMLLAAGHDVYATYCRPTIDPSDLQFN GAEVDITDWCSVYDSIATFRPDAVFHLAAQSYPAVSWARPVETLTTNMVGTAIVFEAL RRVRPHAKIIVAGSSAEYGFVDPSEVPINERRELRPLHPYGVSKAATDMLAYQYHKSY GMHTVVARIFNCTGPRKVGDALSDFVRRCTWLEHHPEQSAIRVGNLKTKRTIVDVRDL NRALMLMLDKGEAGADYNVGGSIAYEMGDVLKQVIAACKRDDIVPEVDPALLRPTDEK IIYGDCSKLAAITGWQQEICLTQTIADMFDYWRSKSESALMV" CDS 137357..137947 /codon_start=1 /transl_table=11 /gene="gmhA" /locus_tag="BQ2027_MB0117" /product="probable sedoheptulose-7-phosphate isomerase gmha (phosphoheptose isomerase)" /note="Mb0117, gmhA, len: 196 aa. Equivalent to Rv0113, len: 196 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 196 aa overlap). Probable gmhA (alternate gene name: lpcA), phosphoheptose isomerase (EC 5.-.-.-), similar to many e.g. AE0005|HPAE000596_11 from Helicobacter pylori (192 aa), FASTA scores: opt: 451, E(): 1.9e-24, (45.1% identity in 162 aa overlap). BELONGS TO THE SIS FAMILY, LPCA SUBFAMILY. Mb0117 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247819.1" /translation="MCTARTAEEIFVETIAVKTRILNDRVLLEAARAIGDRLIAGYRA GARVFMCGNGGSAADAQHFAAELTGHLIFDRPPLGAEALHANSSHLTAVANDYDYDTV FARALEGSARPGDTLFAISTSGNSMSVLRAAKTARELGVTVVAMTGESGGQLAEFADF LINVPSRDTGRIQESHIVFIHAISEHVEHALFAPRQ" CDS 137979..138551 /codon_start=1 /transl_table=11 /gene="gmhb" /locus_tag="BQ2027_MB0118" /product="possible d-alpha,beta-d-heptose-1,7-biphosphate phosphatase gmhb (d-glycero-d-manno-heptose 7-phosphate kinase)" /note="Mb0118, -, len: 190 aa. Equivalent to Rv0114, len: 190 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 190 aa overlap). Possible dehydratase (EC 4.-.-.-), similar to several hypothetical proteins and to HIS7_ECOLI|P06987 imidazoleglycerol-phosphate dehydratase (355 aa), FASTA scores: opt: 250, E(): 3.6e-11, (34.0 % identity in 141 aa overlap). Mb0118 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247820.1" /translation="MVAERAGHQWCLFLDRDGVINRQVVGDYVRNWRQFEWLPGAARA LKKLRAWAPYIVVVTNQQGVGAGLMSAVDVMVIHRHLQMQLASDGVLIDGFQVCPHHR SQRCGCRKPRPGLVLDWLRRHPDSEPLLSIVVGDSLSDLELAHNVAAAAGACASVQIG GASSGGVADASFDSLWEFAVAVGHARGERG" CDS 138702..139619 /codon_start=1 /transl_table=11 /gene="hdda" /locus_tag="BQ2027_MB0119" /product="possible d-alpha-d-heptose-7-phosphate kinase hdda" /note="Mb0119, -, len: 305 aa. Equivalent to Rv0115, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (98.5% identity in 275 aa overlap). Possible sugar kinase (EC 2.-.-.-), similar to several hypothetical proteins and sugar kinases e.g. AAK27850.1|AF324836_3 D-glycero-D-manno-heptose 7-phosphate kinase from Aneurinibacillus thermoaerophilus (341 aa); AAK80995.1|AE007802_11 Sugar kinase from Clostridium acetobutylicum (364 aa). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-c) leads to shorter product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv." /protein_id="CAB5247821.1" /translation="MRSPFARRTATEPARPRSTIWRLSKKTLPLHVAVYRRVIAEFNG GTPFPLQLATQVDAPPGSGLGSSSALVVAMLLTTCALIGSSPGPYELARLAWEIERVD LGMAGGWQDHYAAAFGGFNFMESRPNGEVVVNPLRIRREVIAELEASLLLYFGGVSRL SSEVIADQQRNVVERDADALAATHSICAEALEMKDLLVVGDIPGFADSLLRGWQAKKR TSTRISNPAIEHAYQVAQSSGMVAGKVSGAGGGGFLMMIVDPRRRIEVARSLERECGG SVAPCLFTKGGAVTWHIPESTAPRKAWSC" CDS complement(140307..141062) /codon_start=1 /transl_table=11 /gene="ldta" /locus_tag="BQ2027_MB0120C" /product="probable l,d-transpeptidase ldta" /note="Mb0120c, -, len: 251 aa. Equivalent to Rv0116c, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 251 aa overlap). Possible conserved membrane protein, showing similarity to several hypothetical mycobacterial proteins e.g. Rv1433 from Mycobacterium tuberculosis (271 aa); and Q49706|B1496_F2_81|U00013 from Mycobacterium leprae (271 aa); to the C-terminal regions of others like Rv0192 from Mycobacterium tuberculosis (366 aa), FASTA scores: opt: 451, E(): 1.7e-21, (46.7% identity in 270 aa overlap); and Rv0192|Z97050|MTCI28_32 from Mycobacterium tuberculosis cosmid (366 aa), FASTA scores: opt: 699, E(): 0, (45.7% identity in 221 aa overlap). TBparse score is 0.932. Protein product from Mb0120c detected using SWATH mass spectrometry. Mb0120c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247822.1" /translation="MRRVVRYLSVVVAITLMLTAESVSIATAAVPPLQPIPGVASVSP ANGAVVGVAHPVVVTFTTPVTDRRAVERSIRISTPHNTTGHFEWVASNVVRWVPHRYW PPHTRVSVGVQELTEGFETGDALIGVASISAHTFTVSRNGEVLRTMPASLGKPSRPTP IGSFHAMSKERTVVMDSRTIGIPLNSSDGYLLTAHYAVRVTWSGVYVHSAPWSVNSQG YANVSHGCINLSPDNAAWYFDAVTVGDPIEVVG" CDS 141240..142184 /codon_start=1 /transl_table=11 /gene="oxyS" /locus_tag="BQ2027_MB0121" /product="OXIDATIVE STRESS RESPONSE REGULATORY PROTEIN OXYS" /note="Mb0121, oxyS, len: 314 aa. Equivalent to Rv0117, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 314 aa overlap). OxyS, oxidative stress response protein regulatory protein, LysR family (see citation below). Similar to many transcription regulators and OxyR, the oxidative stress response protein of many bacteria. Contains LysR family signature at N-terminus. Also contains helix-turn-helix motif at aa 16-37 (Score 1543, +4.44 SD). BELONGS TO THE LYSR FAMILY OF TRANSCRIPTIONAL REGULATORS. OXYR IS REQUIRED FOR THE INDUCTION OF A REGULON OF HYDROGEN PEROXIDE INDUCIBLE GENES SUCH AS CATALASE, GLUTATHIONE-REDUCTASE, ETC. Protein product from Mb0121 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0121 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247823.1" /translation="MLFRQLEYFVAVAQERHFARAAEKCYVSQPALSSAIAKLERELN VTLINRGHSFEGLTREGERLVVWAKRILAEHAAFKAEVDAVRSGITGTLRLGTVPTAS TTASLVLSAFCSAHPLAKVQVCSRLAATELYRRLREFELDAVIVHPETQDSDDVDLVP LYEEQYVLLSPADMLPPGTSTLVWRDAAQLPLALLTADMRDRQVIDAAFADHAVSAIP QVETDSVASLFAQVATGNWASIVPHTWLWAMPMSGPTGGEIRAVELVDPVLKAQIALA TNALGPGSPVARALITCAQALALNEFFDTQLRGITRRR" CDS complement(142168..143916) /codon_start=1 /transl_table=11 /gene="oxcA" /locus_tag="BQ2027_MB0122C" /product="PROBABLE OXALYL-COA DECARBOXYLASE OXCA" /note="Mb0122c, oxcA, len: 582 aa. Equivalent to Rv0118c, len: 582 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 582 aa overlap). Probable oxcA, oxalyl-CoA decarboxylase (EC 4.1.1.8), highly similar to many e.g. P78093|OXC_ECOLI|7449483|B65011|YFDU|B2373|Z3637 |ECS325 PROBABLE OXALYL-CoA DECARBOXYLASE from Escherichia coli (564 aa); M77128|OXAOXA_1 oxalyl-CoA decarboxylase from Oxalobacter formigenes (568 aa), FASTA scores: opt: 2124, E():0, (55.6% identity in 568 aa overlap). Also similar to mycobacterial IlvB proteins e.g. MLCB1788.46c unknown TPP-requiring enzyme from Mycobacterium leprae (548 aa); and AL0086|MLCB1788_19 from Mycobacterium leprae (548 aa), FASTA scores: opt: 831, E(): 0, (33.9% identity in 567 aa overlap). Protein product from Mb0122c detected using shotgun mass spectrometry and SWATH mass spectrometry." /protein_id="CAB5247824.1" /translation="MTTRSASPCTVLTDGCHLVVDALKANDVDTIYGVVGIPITDLAR AAQASGIRYIGFRHEASAGNAAAAAGFLTARPGVCLTTSGPGFLNGLPALANATTNCF PMIQISGSSSRPMVDLQRGDYQDLDQLNAARPFVKAAYRIGQVQDIGRGVARAIRTAT SGRPGGVYLDIPGDVLGQAVEASAASGAIWRPVDPAPRLLPAPEAIDRALDVLAQAQR PLLVLGKGAAYAQADNVIREFVEHTGIPFLPMSMAKGLLPDSHPQSAAAARSLAMARA DVVLLVGARLNWLLGNGESPQWSADAKFIQVDIEASEFDSNRPIVAPLTGDIGSVMSA LLEAAADRSSVASAAWTGELADRKARNSAKMRRRLADDHHPMRFYNALGAIRSVLQRN PDVYVVNEGANALDLARNIIDMHLPRHRLDSGTWGVMGIGMGYAIAAAVETGRPVVAI EGDSAFGFSGMEFETICRYRLPVTVVILNNGGVYRGDEATIFRSAAPVWRHDPAPTVL NAHARHELIAEAFGGKGYHVSTPTELESALTDALASNGPSLIDCELDPADGVESGHLA KLNTTSAATPAISGDG" CDS 144089..145666 /codon_start=1 /transl_table=11 /gene="fadD7" /locus_tag="BQ2027_MB0123" /product="PROBABLE FATTY-ACID-COA LIGASE FADD7 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb0123, fadD7, len: 525 aa. Equivalent to Rv0119, len: 525 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 525 aa overlap). Probable fadD7, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to 4-coumarate:CoA ligase of many organisms e.g. U39405|PTU39405_1 4-coumarate:CoA ligase from Pinus taedaxylem (537 aa), FASTA scores: opt: 483, E(): 8.3e-22, (28.2% identity in 440 aa overlap). Contains PS00455 Putative AMP-binding domain signature. TBscore is 0.896. Protein product from Mb0123 detected using shotgun mass spectrometry and SWATH mass spectrometry." /protein_id="CAB5247825.1" /translation="MASDFGPRIADLVEVAATRLPEAPALVVTADRIAISHRDLARLV DELAGQLTRSGLLPGDRVALRMGSNAEFVVALLAASRADLVVVPLDPALPITEQRVRS QAAGARVVLIDADGPHDRAEPTTRWWPLTVNVGGDSGPSGGTLSVHLDAATEPNPATS TPEGLRPDDAMIMFTGGTTGLPKMVPWTHANIASSVRAIITGYRLSPRDATVAVMPLY HGHGLIASLLATLASGGAVSLPARGRFSAHTFWDDIKAVGATWYTAVPTIHQILLERS ATEPSGRKPAALRFIRSCSAPLTAQAALALQTEFAAPVVCAFGMTEATHQVTTTQIEG IDQTETPVVSTGLVGRSTGAQIRIVGSDGLPLPAGAVGEIWLRGTTVVRGYLGDPTIT AANFTDGWLRTGDLGSLSAAGDLSIRGRIKELINRGGEKISPERVEGVLASHPNVMEA AVFGVPHQLYGEAVAAVIVPRESAPPTREELVQFCRERLAAFEIPASFQEASGLPHTA KGSLDRRAVAERFGHSV" CDS complement(145667..147460) /codon_start=1 /transl_table=11 /gene="fusA2b" /locus_tag="BQ2027_MB0124C" /standard_name="fus2" /product="PROBABLE ELONGATION FACTOR G FUSA2B [SECOND PART] (EF-G)" /note="Mb0124c, fusA2b, len: 597 aa. Equivalent to 3' end of Rv0120c, len: 714 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 597 aa overlap). Probable fusA2 (alternate gene name: fus2), elongation factor G, highly similar to others e.g. EFG_ECOLI|P02996 elongation factor G (ef-g) from Escherichia coli (703 aa), FASTA scores: opt: 1049, E(): 0, (32.5% identity in 717 aa overlap). Also similar to fusA1|MTCY210.01 from Mycobacterium tuberculosis FASTA score: (39.1% identity in 299 aa overlap); and P30767|EFG_MYCLE ELONGATION FACTOR G (EF-G) from Mycobacterium leprae (701 aa), FASTA score: (31.7% identity in 710 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE GTP-BINDING ELONGATION FACTOR FAMILY, EF-G/EF-2 SUBFAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, fusA2 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2bp to 1bp substitution (gc-t), splits fusA2 into 2 parts, fusA2a and fusA2b. Protein product from Mb0124c detected using SWATH mass spectrometry. Mb0124c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247826.1" /translation="MIAANEGVDEPTKSLWQECSQVGMPRAVVITKLDHARANYREAL TAAQDAFGDKVLPLYLPSGDGLIGLLSQALYEYADGKRTTRTPAESDTERIEEARGAL IEGIIEESEDESLMERYLGGETIDESVLIQDLEKAVARGSFFPVIPVCSSTGVGTLEL LEVATRGFPSPMEHPLPEVFTPQGVPHAELACDNDAPLLAEVVKTTSDPYVGRVSLVR VFSGTIRPDTTVHVSGHFSSFFGGGTSNTHPDHDEDERIGVLSFPLGKQQRPAAAVVA GDICAIGKLSRAETGDTLSDKAEPLVLKPWTMPEPLLPIAIAAHAKTDEDKLSVGLGR LAAEDPTLRIEQNQETHQVVLWCMGEAHAGVVLDTLANRYGVSVDTIELRVPLRETFA GNAKGHGRHIKQSGGHGQYGVCDIEVEPLPEGSGFEFLDKVVGGAVPRQFIPSVEKGV RAQMDKGVHAGYPVVDIRVTLLDGKAHSVDSSDFAFQMAGALALREAAAATKVILLEP IDEISVLVPDDFVGAVLGDLSSRRGRVLGTETAGHDRTVIKAEVPQVELTRYAIDLRS LAHGAASFTRSFARYEPMPESAAARVKAGAG" CDS complement(147457..147810) /codon_start=1 /transl_table=11 /gene="fusA2a" /locus_tag="BQ2027_MB0125C" /standard_name="fus2" /product="PROBABLE ELONGATION FACTOR G FUSA2A [FIRST PART] (EF-G)" /note="Mb0125c, fusA2a, len: 117 aa. Equivalent to 5' end of Rv0120c, len: 714 aa, from Mycobacterium tuberculosis strain H37Rv, (98.1% identity in 108 aa overlap). Probable fusA2 (alternate gene name: fus2), elongation factor G, highly similar to others e.g. EFG_ECOLI|P02996 elongation factor G (ef-g) from Escherichia coli (703 aa), FASTA scores: opt: 1049, E(): 0, (32.5% identity in 717 aa overlap). Also similar to fusA1|MTCY210.01 from Mycobacterium tuberculosis FASTA score: (39.1% identity in 299 aa overlap); and P30767|EFG_MYCLE ELONGATION FACTOR G (EF-G) from Mycobacterium leprae (701 aa), FASTA score: (31.7% identity in 710 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE GTP-BINDING ELONGATION FACTOR FAMILY, EF-G/EF-2 SUBFAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, fusA2 exists as a single gene. In Mycobacterium bovis, a frameshift due to the substitution of a single base (gc-t), splits fusA2 into 2 parts, fusA2a and fusA2b. Protein product from Mb0125c detected using shotgun mass spectrometry. Mb0125c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247827.1" /translation="MADRVNASQGAAAAPTANGPGGVRNVVLVGPSGGGKTTLIEALL VAAKVLSRPGSVTEGTTVCDFDEAEIRQQRSVGLAVASLAYDGIKVNLVDTPGYADFV GELWPGCGPPIAHCS" CDS complement(147946..148380) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0126C" /product="Pyridoxamine 5-phosphate oxidase" /note="Mb0126c, -, len: 144 aa. Equivalent to Rv0121c, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 144 aa overlap). Conserved hypothetical protein, showing some similarity with others proteins from Mycobacterium tuberculosis e.g. Rv1155, Rv1875, Rv2074, etc. Protein product from Mb0126c detected using SWATH mass spectrometry. Mb0126c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247828.1" /translation="MGEFDPKLRFAQSPVARLATSTPDGTPHLVPVVFALGARRPAEA TGADVIYTAVDAKRKTTQRLRRLANLEHNPRASVLVDSYADDWTQLWWVRADGVAAIH RDGEVMRAAYRLLRAKYAQYQSVPLNGPVIAIAVQRWASWHA" CDS 148529..148897 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0127" /product="HYPOTHETICAL PROTEIN" /note="Mb0127, -, len: 122 aa. Equivalent to Rv0122, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 122 aa overlap). Hypothetical unknown protein. Mb0127 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247829.1" /translation="MAGSVSAAAGIGWVGLNVTETNRDQCYRVERTTVDALTHPEYRV HTRGVQRVRVTRNARKHRVSKHRIVAAMRHCGVPVIQEDGSLYYQGRDTSGRLTEVVA VEADDGDLIITHAMPKEWKR" CDS 148894..149262 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0128" /product="unknown protein" /note="Mb0128, -, len: 122 aa. Equivalent to Rv0123, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 122 aa overlap). Hypothetical unknown protein. Protein product from Mb0128 detected using shotgun mass spectrometry and SWATH mass spectrometry." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247830.1" /translation="MTKKPRNPADYVIGDDVEVSDVDLKQEEVYVDGERLTDERVEQM ASESLRLAREREANLIPGGKSLSGGSAHSPAVQVVVSKATHAKLKELARSRKMSVSKL LRPVLDEFVQRETGRILPRR" CDS 149571..151187 /codon_start=1 /transl_table=11 /gene="PE_PGRS2" /locus_tag="BQ2027_MB0129" /product="pe-pgrs family protein pe_pgrs2" /note="Mb0129, PE_PGRS2, len: 538 aa. Similar to Rv0124, len: 487 aa, from Mycobacterium tuberculosis strain H37Rv, (87.4% identity in 547 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to many e.g. Y0DP_MYCTU|Q50615 from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 1730, E(): 0, (60.7% identity in 504 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 180 bp insertion leads to a longer product in the 3' direction compared to its homolog in Mycobacterium tuberculosis strain H37Rv (538 aa versus 487 aa)." /protein_id="CAB5247831.1" /translation="MSFVSVAPEIVVAAATDLAGIGSAISAANAAAAAPTTAVLAAGA DEVSAAIAALFSGHAQAYQALSAQAAAFHQQFVQTLAGGAGAYAAAEAQVEQQLLAAI NAPTQALLGRPLIGNGADGAPGTGQAGGAGGILYGNGGNGGSGAAGQAGGAGGPAGLI GHGGSGGAGGHGGWLWGNGGVGGSGGAGVGAGVAGGHGGAGGAAGLWGAGGGGGNGGN GADANIVSGGDGGLGGAGGGGGWLYGDGGAGGHGGQGAIGLGGGAGGDGGQGGAGRGL WGTGGAGGHGGQGGGTGGPPLPGQAGMGAAGGAGGLIGNGGAGGDGGVGASGGVAGVG GAGGNAMLIGHGGAGGAGGDSSFANGAAGGAGGAGGHLFGNGGSGGHGGAVTAGNTGI GGAGGVGGDARLIGHGGAGGAGGDRAGALVGRDGGPGGNGGAGGQLYGNGGDGGPGGQ GGQAFGANNIGGTGGAGGNGGPAILSGNGGNGGAGGAGGAGGAGGGAGGVGGAGGAPG TGGTLQAAVSGLVTALFGAPGQPGDTGQPG" CDS 151339..152406 /codon_start=1 /transl_table=11 /gene="pepA" /locus_tag="BQ2027_MB0130" /product="probable serine protease pepa (serine proteinase) (mtb32a)" /note="Mb0130, pepA, len: 355 aa. Equivalent to Rv0125, len: 355 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 355 aa overlap). Probable pepA, serine protease (EC 3.4.21.-), highly similar to other proteases e.g. HHOB_ECOLI|P31137 protease hhob precursor (355 aa), FASTA scores: opt: 400, E(): 3.8e-14, (32.4% identity in 346 aa overlap). Also similar to Q50320 34 KDA protein precursor from Mycobacterium tuberculosis (361 aa), FASTA scores: opt: 1689, E(): 0, (70.7% identity in 362 aa overlap). Contains PS00135 Serine proteases, trypsin family, serine active site. Has a possible signal sequence at the N-terminus. Protein product from Mb0130 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0130 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247832.1" /translation="MSNSRRRSLRWSWLLSVLAAVGLGLATAPAQAAPPALSQDRFAD FPALPLDPSAMVAQVGPQVVNINTKLGYNNAVGAGTGIVIDPNGVVLTNNHVIAGATD INAFSVGSGQTYGVDVVGYDRTQDVAVLQLRGAGGLPSAAIGGGVAVGEPVVAMGNSG GQGGTPRAVPGRVVALGQTVQASDSLTGAEETLNGLIQFDAAIQPGDSGGPVVNGLGQ VVGMNTAASDNFQLSQGGQGFAIPIGQAMAIAGQIRSGGGSPTVHIGPTAFLGLGVVD NNGNGARVQRVVGSAPAASLGISTGDVITAVDGAPINSATAMADALNGHHPGDVISVT WQTKSGGTRTGNVTLAEGPPA" CDS 152515..154320 /codon_start=1 /transl_table=11 /gene="treS" /locus_tag="BQ2027_MB0131" /product="TREHALOSE SYNTHASE TRES" /note="Mb0131, treS, len: 601 aa. Equivalent to Rv0126, len: 601 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 601 aa overlap). treS, trehalose synthase (EC 5.4.99.-) (see citation below), highly similar to others e.g. CAA04601.2|AJ001205 putative trehalose synthase from Streptomyces coelicolor (566 aa); S71450|1536814|BAA11303.1|D78198 trehalose synthase maltose-specific from Pimelobacter sp. strain R48 (573 aa). Also similar to MAL1_DROME|P07191 possible maltase precursor (508 aa), FASTA scores: opt: 807, E(): 0, (33.7% identity in 504 aa overlap); and similar to proteins associated with amino-acid transport e.g. Q64319 rat protein which stimulates transport of cystine and dibasic and neutral amino acids (683 aa), FASTA scores: opt: 839, E(): 0, (32.0% identity in 531 aa overlap). Also similar to several other Mycobacterium tuberculosis proteins e.g. Rv2471 FASTA score: (31.7% identity in 164 aa overlap). Protein product from Mb0131 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0131 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247833.1" /translation="MNEAEHSVEHPPVQGSHVEGGVVEHPDAKDFGSAAALPADPTWF KHAVFYEVLVRAFFDASADGSGDLRGLIDRLDYLQWLGIDCIWLPPFYDSPLRDGGYD IRDFYKVLPEFGTVDDFVALVDAAHRRGIRIITDLVMNHTSESHPWFQESRRDPDGPY GDYYVWSDTSERYTDARIIFVDTEESNWSFDPVRRQFYWHRFFSHQPDLNYDNPAVQE AMIDVIRFWLGLGIDGFRLDAVPYLFEREGTNCENLPETHAFLKRVRKVVDDEFPGRV LLAEANQWPGDVVEYFGDPNTGGDECHMAFHFPLMPRIFMAVRRESRFPISEIIAQTP PIPDMAQWGIFLRNHDELTLEMVTDEERDYMYAEYAKDPRMKANVGIRRRLAPLLDND RNQIELFTALLLSLPGSPVLYYGDEIGMGDVIWLGDRDGVRIPMQWTPDRNAGFSTAN PGRLYLPPSQDPVYGYQAVNVEAQRDTSTSLLNFTRTMLAVRRRHPAFAVGAFQELGG SNPSVLAYVRQVAGDDGDTVLCVNNLSRFPQPIELDLQQWTNYTPVELTGHVEFPRIG QVPYLLTLPGHGFYWFQLTTHEVGAPPTCGGERRL" CDS 154423..155790 /codon_start=1 /transl_table=11 /gene="mak" /locus_tag="BQ2027_MB0132" /product="maltokinase mak" /note="Mb0132, -, len: 455 aa. Equivalent to Rv0127, len: 455 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 455 aa overlap). Conserved hypothetical protein, highly similar to various proteins e.g. AJ0012|SCJ001205_4 hypothetical protein from Streptomyces coelicolor A3(2) (464 aa), FASTA scores: opt: 412, E(): 1.1e-19, (40.6% identity in 485 aa overlap); AJ0012|SCJ001206_5 hypothetical protein from Streptomyces coelicolor A3(2) (453 aa), FASTA scores: opt: 403, E(): 4.3 e-19, (36.5% identity in 455 aa overlap). Protein product from Mb0132 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0132 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247834.1" /translation="MTRSDTLATKLPWSDWLPRQRWYAGRNRELATVKPGVVVALRHN LDLVLVDVTYTDGATERYQVLVGWDFEPASEYGTKAAIGVADDRTGFDALYDVAGPQF LLSLIVSSAVCGTSTGEVTFTREPDVELPFAAQPRVCDAEQSNTSVIFDRRAILKVFR RVSSGINPDIELNRVLTRAGNPHVARLLGAYQFGRPNRSPTDALAYALGMVTEYEANA AEGWAMATASVRDLFAEGDLYAHEVGGDFAGESYRLGEAVASVHATLADSLGTAQATF PVDRMLARLSSTVAVVPELREYAPTIEQQFQKLAAEAITVQRVHGDLHLGQVLRTPES WLLIDFEGEPGQPLDERRAPDSPLRDVAGVLRSFEYAAYGPLVDQATDKQLAARAREW VERNRAAFCDGYAVASGIDPRDSALLLGAYELDKAVYETGYETRHRPGWLPIPLRSIA RLTAS" CDS 155858..156637 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0133" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0133, -, len: 259 aa. Equivalent to Rv0128, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 259 aa overlap). Probable conserved transmembrane protein, with some similarity to Rv3064c and other bacterial proteins e.g. AAK85977.1|AE007957|AGR_C_254p from Agrobacterium tumefaciens (206 aa). Mb0133 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247835.1" /translation="MQREIYDGEARLSWVLAALAGILGATAFTHSAGYFVTFMTGNSQ RAVLGLFGDDAWMSVTASLLILFFVAGVVIASVCRRHFWAAHPHGPTVLTTFSLIFAA GVDIMLGGWHESMLDFVPILFVVFGIGALNTSFVKDGEVSVPLSYVTGTLVKMGQGIE RHLAGGKVEDWLGYFLLHASFVLGAAAGGAISMVVTGPQMLAVAAVVCAATTGYTYLH ADRRGLVNQKRPQPGKRLFRALRRGELDSGTSTPATNYGSS" CDS complement(156769..157791) /codon_start=1 /transl_table=11 /gene="fbpC" /locus_tag="BQ2027_MB0134C" /standard_name="mpt45; 85C; fbpC2" /product="SECRETED ANTIGEN 85-C FBPC (85C) (ANTIGEN 85 COMPLEX C) (AG58C) (MYCOLYL TRANSFERASE 85C) (FIBRONECTIN-BINDING PROTEIN C)" /note="Mb0134c, fbpC, len: 340 aa. Equivalent to Rv0129c, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 340 aa overlap). fbpC (alternate gene names: mpt45, 85C, fbpC2), secreted antigen 85c (fibronectin-binding protein C) (mycolyl transferase 85C) (EC 2.3.1.-) (see citations below), also highly similar to other Mycobacterial antigen precursors e.g. A85C_MYCLE|Q05862 antigen 85-c precursor (85c) from Mycobacterium leprae (333 aa), FASTA scores: opt: 1937, E(): 0, (81.4% identity in 333 aa overlap); etc. Protein product from Mb0134c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0134c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247836.1" /translation="MTFFEQVRRLRSAATTLPRRLAIAAMGAVLVYGLVGTFGGPATA GAFSRPGLPVEYLQVPSASMGRDIKVQFQGGGPHAVYLLDGLRAQDDYNGWDINTPAF EEYYQSGLSVIMPVGGQSSFYTDWYQPSQSNGQNYTYKWETFLTREMPAWLQANKGVS PTGNAAVGLSMSGGSALILAAYYPQQFPYAASLSGFLNPSEGWWPTLIGLAMNDSGGY NANSMWGPSSDPAWKRNDPMVQIPRLVANNTRIWVYCGNGTPSDLGGDNIPAKFLEGL TLRTNQTFRDTYAADGGRNGVFNFPPNGTHSWPYWNEQLVAMKADIQHVLNGATPPAA PAAPAA" CDS 158038..158493 /codon_start=1 /transl_table=11 /gene="htdz" /locus_tag="BQ2027_MB0135" /product="probable 3-hydroxyl-thioester dehydratase" /note="Mb0135, -, len: 151 aa. Equivalent to Rv0130, len: 151 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 151 aa overlap). Conserved hypothetical protein, most similar to AL096811|SCI30A_19 from Streptomyces coelicolor (153 aa), FASTA scores: opt: 639, E(): 0, (60.8% identity in 148 aa overlap). Also similar to NODN_RHILV|P08634 nodulation protein from Rhizobium leguminosarum bv. viciae plasmid pRL1JI (161 aa), FASTA scores: opt: 406, E(): 1e-21, (43.9% identity in 148 aa overlap; and to O30041 MONOAMINE OXIDASE REGULATORY PROTEIN (146 aa), FASTA scores: opt: 219, E(): 1.1e-08, (30.8% identity in 133 aa overlap). Protein product from Mb0135 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0135 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247837.1" /translation="MRTFESVADLAAAAGEKVGQSDWVTITQEEVNLFADATGDHQWI HVDPERAAAGPFGTTIAHGFMTLALLPRLQHQMYTVKGVKLAINYGLNKVRFPAPVPV GSRVRATSSLVGVEDLGNGTVQATVSTTVEVEGSAKPACVAESIVRYVA" CDS complement(158506..159849) /codon_start=1 /transl_table=11 /gene="fadE1" /locus_tag="BQ2027_MB0136C" /product="PROBABLE ACYL-CoA DEHYDROGENASE FADE1" /note="Mb0136c, fadE1, len: 447 aa. Equivalent to Rv0131c, len: 447 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 447 aa overlap). Probable fadE1, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. ACDS_HUMAN|P16219 acyl-CoA dehydrogenase short-chain specific precursor (412 aa), FASTA scores: opt: 522, E(): 1.4e-23, (30.1% identity in 425 aa overlap). Also highly similar to MTCI5_28 from Mycobacterium tuberculosis. Protein product from Mb0136c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0136c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247838.1" /translation="MPVRRRAGERLPTVWDFETDPQYQSKLDWVEKFMAEELEPLDLV ALDPYDKKNADTMAILRPLQRQVKDQGLWAAHLRPELGGQGFGQVKLALLNEIIGRSR WAPSAFGCQAPDSGNAEILALFGTDEQKARYLRPLLDGEITSCYSMTEPQGGSDPGLF VTAATRDAAGNGDWIINGEKWFSTNAKHASFFIVMAVTKPEARTYEKMSLFIVPADTP GIEIVRNVGVGAESTRHASHGYIRYHDVRVPADHVLGGEGQAFMIAQTRLGGGRIHHA MRTIALARRAFDMMCERALSRQTRHGRLADLQMTQEKIADSWIQIEQFRLLVLRTAWL IDKHHDYQKVRRDIAAVKVAMPQVLHDVVQRAMHLHGALGVSDEMPFVKMMLAAESLG IADGATELHKMTVARRTLREYQPVTTLFPSQHIPTRRAHAEAWLAQRLEHAIAEF" CDS complement(159891..160973) /codon_start=1 /transl_table=11 /gene="fgd2" /locus_tag="BQ2027_MB0137C" /product="putative f420-dependent glucose-6-phosphate dehydrogenase fgd2" /note="Mb0137c, fgd2, len: 360 aa. Equivalent to Rv0132c, len: 360 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 360 aa overlap). Putative fgd2, F420-dependent glucose-6-phosphate dehydrogenase (EC 1.-.-.-), highly similar to many from Mycobacteria e.g. AAD38167|g5031431 from Mycobacterium chelonae. Also similar to MJ1534|Q58929 N5,N10-METHYLENE TETRAHYDROMETHANOPTERIN REDUCTASE from METHANOCOCCUS JANNASCHII (342 aa), FASTA scores: opt: 285, E(): 7.9e-11, (28.4% identity in 292 aa overlap). And also similar to Rv0953c, Rv0791c, etc from Mycobacterium tuberculosis. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Mb0137c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247839.1" /translation="MTGISRRTFGLAAGFGAIGAGGLGGGCSTRSGPTPTPEPASRGV GVVLSHEQFRTDRLVAHAQAAEQAGFRYVWASDHLQPWQDNEGHSMFPWLTLALVGNS TSSILFGTGVTCPIYRYHPATVAQAFASLAILNPGRVFLGLGTGKRLNEQAATDTFGN YRERHDRLIEAIVLIRQLWSGERISFTGHYFRTDELKLYDTPAMPPPIFVAASGPQSA TLAGRYGDGWIAQARDINDAKLLAAFAAGAQAAGRDPTTLGKRAELFAVVGDDKAAAR AADLWRFTAGAVDQPNPVEIQRAAESNPIEKVLANWAVGTDPGVHIGAVQAVLDAGAV PFLHFPQDDPITAIDFYRTNVLPELR" CDS 161060..161665 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0138" /product="gcn5-related n-acetyltransferase" /note="Mb0138, -, len: 201 aa. Equivalent to Rv0133, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 201 aa overlap). Putative acetyltransferase (EC 2.3.1.-), highly similar to others e.g. PUAC_STRLP|P13249 puromycyn N-acetyltransferase (199 aa), FASTA scores: opt: 341, E(): 1.8e-16, (33.3% identity in 201 aa overlap). Protein product from Mb0138 detected using SWATH mass spectrometry. Mb0138 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247840.1" /translation="MTPQARPARRADVRELSRTMARAFYDDPFMSWLLSNDNARTARL TRLFATIVRHQHLAGGGVEVARGAAGIGGAALWDPPDRWRESRRQQLAMTPGFLRVFG FRTAKARAALDVMMRVHPEEPHWYLAAIGSDPTVRGQGFGQVLMRSRLDRCDAEHCPA YLESTKPENVPYYQRFGFRVTREIALPDAGPPLWAMWREPR" CDS 161962..162345 /pseudo /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0139-1" /note="Mb0139-1, -, len: 128 aa. Pseudogene part1 of Rv0134 / ephF Protein product from Mb0139-1 detected using SWATH mass spectrometry. Mb0139-1 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." CDS 162345..162452 /pseudo /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0139-2" /note="Mb0139-2, -, len: 36 aa. Pseudogene part2 of Rv0134 / ephF,Mb0139-2 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." CDS 162454..162693 /pseudo /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0139-3" /note="Mb0139-3, -, len: 80 aa. Pseudogene part3 of Rv0134 / ephF,Mb0139-3 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." CDS 162696..162863 /pseudo /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0139-4" /note="Mb0139-4, -, len: 55 aa. Pseudogene part4 of Rv0134 / ephF. Mb0139-4 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." CDS complement(162834..163439) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0140C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0140c, -, len: 201 aa. Equivalent to Rv0135c, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 201 aa overlap). Possible transcriptional regulator, weakly similar to others e.g. P32398|YHGD_BACSU HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Bacillus subtilis (191 aa), FASTA scores: opt: 145, E(): 0.0012, (21.0% identity in 162 aa overlap). Protein product from Mb0140c detected using SWATH mass spectrometry. Mb0140c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247845.1" /translation="MTAVAAGALVVETDSFRLRLLDGLVASIGERGYRATTVSDIVRH ARTSKRTFYDRFTSKEQCFLELLLADNETLGNSIRAAVDPNADWHDQIRQAVEAYVTH IESRPAVTLSWIREFPSLGAAAYPVQRRGMEQLTSLLIELSASPGFRRANLPPLNVPL AVILLGGLRELTALTVEDGQPIRNIVEPAVDASIALLGPRS" CDS 163556..164881 /codon_start=1 /transl_table=11 /gene="cyp138" /locus_tag="BQ2027_MB0141" /product="PROBABLE CYTOCHROME P450 138 CYP138" /note="Mb0141, cyp138, len: 441 aa. Equivalent to Rv0136, len: 441 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 441 aa overlap). Probable cyp138, cytochrome P450 138 (EC 1.14.-.-), similar to others e.g. SLR0574|Q59990 from SYNECHOCYSTIS SP. (444 aa), FASTA scores: opt: 315, E(): 1e-13, (25.7% identity in 416 aa overlap); etc. Also similar to MTV039_6 from Mycobacterium tuberculosis (472 aa), FASTA score: (38.2% identity in 442 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop); and PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY. Protein product from Mb0141 detected using SWATH mass spectrometry. Mb0141 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247846.1" /translation="MSEVVTAAPAPPVVRLPPAVRGPKLFQGLAFVVSRRRLLGRFVR RYGKAFTANILMYGRVVVVADPQLARQVFTSSPEELGNIQPNLSRMFGSGSVFALDGD DHRRRRRLLAPPFHGKSMKNYETIIEEETLRETANWPQGQAFATLPSMMHITLNAILR AIFGAGGSELDELRRLIPPWVTLGSRLAALPKPKRDYGRLSPWGRLAEWRRQYDTVID KLIEAERADPNFADRTDVLALMLRSTYDDGSIMSRKDIGDELLTLLAAGHETTAATLG WAFERLSRHPDVLAALVEEVDNGGHELRQAAILEVQRARTVIDFAARRVNPPVYQLGE WVIPRGYSIIINIAQIHGDPDVFPQPDRFDPQRYIGSKPSPFAWIPFGGGTRRCVGAA FANMEMDVVLRTVLRHFTLETTTAAGERSHGRGVAFTPKDGGRVVMRRR" CDS complement(164902..165450) /codon_start=1 /transl_table=11 /gene="msrA" /locus_tag="BQ2027_MB0142C" /product="PROBABLE PEPTIDE METHIONINE SULFOXIDE REDUCTASE MSRA (PROTEIN-METHIONINE-S-OXIDE REDUCTASE) (PEPTIDE MET(O) REDUCTASE)" /note="Mb0142c, msrA, len: 182 aa. Equivalent to Rv0137c, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 182 aa overlap). Probable msrA, peptide methionine sulfoxide reductase (EC 1.8.4.6), equivalent to CAC32179.1|AL583926 putative peptide methionine sulfoxide from Mycobacterium leprae (177 aa). Highly similar to others e.g. CAC18703.1|AL451182 putative peptide methionine sulfoxide reductase from Streptomyces coelicolor (172 aa); PMSR_SCHPO|Q09859 putative peptide methionine sulfoxide reductase from Streptomyces (187 aa), FASTA scores: opt: 468, E(): 9.9e-26, (45.6% identity in 158 aa overlap); etc. BELONGS TO THE MSRA FAMILY. Protein product from Mb0142c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0142c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247847.1" /translation="MTSNQKAILAGGCFWGLQDLIRNQPGVVSTRVGYSGGNIPNATY RNHGTHAEAVEIIFDPTVTDYRTLLEFFFQIHDPTTKDRQGNDRGTSYRSAIFYFDEQ QKRIALDTIADVEASGLWPGKVVTEVSPAGDFWEAEPEHQDYLQRYPNGYTCHFVRPG WRLPRRTAESALRASLSPELGT" CDS 165513..166016 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0143" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0143, -, len: 167 aa. Equivalent to Rv0138, len: 167 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 167 aa overlap). Conserved hypothetical protein, showing weak similarity to Q10827|YT10_MYCTU HYPOTHETICAL 17.0 KDA PROTEIN from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 131, E(): 0.047, (31.15% identity in 106 aa overlap). Mb0143 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247848.1" /translation="MSASEFSRAELAAAFEKFEKTVARAAATRDWDCWVQHYTPDVEY IEHAAGIMRGRQRVRAWIQETMTTFPGSHMVAFPSLWSVIDESTGRIICELGNPMLDP GDGSVISATNISIITYAGNGQWCRQEDIYNPLRFLRAAMKWCRKAQELGTLDEDAARW MRRHGGP" CDS 166017..167039 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0144" /product="possible oxidoreductase" /note="Mb0144, -, len: 340 aa. Equivalent to Rv0139, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 340 aa overlap). Putative oxidoreductase (EC 1.-.-.-), similar to others e.g. O34285|HPNA HPNA PROTEIN from Zymomonas mobilis (337 aa), FASTA scores: opt: 507, E (): 5.8e-27, (31.1% identity in 328 aa overlap); TRE_STRGR|P29782 dtdp-glucose 4,6-dehydratase (328 aa), FASTA scores: opt: 254, E(): 2.6e-10, (29.0% identity in 307 aa overlap). Protein product from Mb0144 detected using SWATH mass spectrometry. Mb0144 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247849.1" /translation="MNAPKLVIGANGFLGSHVTRQLVADCAPQKGEVRAMVRPAANTR SIDDLPLTRFHGDVFDTATVAEAMAGCDDVYYCVVDTRAWLRDPSPLFRTNVAGLRNV LDVATDASLRRFVFTSSYATVGRRRGHVATEEDRVDTRKVTPYVRSRVAAEDLVLQYA HDAGLPAVAMCVSTTYGGGDWGRTPHGAFIAGAVFGRLPFTMRGIRLEAVGVDDAARA LILAAERGHNGERYLISERMMPLQEVVRIAADEAGVPPPRWSISVPVLYALGALGSLR ARLTGKDTELSLASVRMMRSEADVDHGKAVRELGWQPRPVEESIREAARFWAAMRTVG KDPAAS" CDS 167100..167480 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0145" /product="COGs COG2343" /note="Mb0145, -, len: 126 aa. Equivalent to Rv0140, len: 126 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 126 aa overlap). Conserved hypothetical protein, similar to others e.g. P74567|D90916_48 HYPOTHETICAL 20.8 KDP ROTEIN from Synechocystis sp. (180 aa), FASTA scores: opt: 229, E(): 4.7e-10, (36.1% identity in 108 aa overlap). Also similar to Rv1056 and Rv1670 from Mycobacterium tuberculosis. Protein product from Mb0145 detected using SWATH mass spectrometry. Mb0145 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247850.1" /translation="MSNRIVLEPSADHPITIEPTNRRVQVRVNGEVVADTAAALCLQE ASYPAVQYIPLADVVQDRLIRTETSTYCPFKGEASYYSVTTDAGDIVDDVMWTYENPY PAVAAIAGHVACYPDKAEISIFPG" CDS complement(167461..167871) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0146C" /product="Ketosteroid isomerase-related protein" /note="Mb0146c, -, len: 136 aa. Equivalent to Rv0141c, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Hypothetical unknown protein. Protein product from Mb0146c detected using SWATH mass spectrometry. Mb0146c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247851.1" /translation="MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTR TELDKKTFRRAVERRKQVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCI FETRDGMIISLREYSDTQSLAEVYPVACATPGRC" CDS 167901..168827 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0147" /product="3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase" /note="Mb0147, -, len: 308 aa. Equivalent to Rv0142, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 308 aa overlap). Conserved hypothetical protein, similar, except in N-terminus, to AB88922.1|AL353862 hypothetical protein SCE34.20 from Streptomyces coelicolor (326 aa). Mb0147 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247852.1" /translation="MRSIDVVVEAVVTFAGAAGFAHTLAPLRRGQQDPCFRVPGDGTI WRTSLLPTGPVTARISRAGRDAARCVAWGSGAEEFVDMAPAMLGAADDASDFVPLHPA VAAAHRRLPNLRLGRTGQVLEALIPAVIEQRVPGADAFRSWRLLVSKYGTQAPGPAPP GMRVPPSAEVWRHIPSWEFHRANVDPGRARAVVGCAQRAASLERLVSLPAARAAEALT SLPGVGVWTAAETTQRVFGDADAVSVGDYHIPKMIGWTLVGRPVDDAGMLELLEPMRP HRHRVVRLLEASGLAREPRRGPRLPVQNIRAL" CDS complement(168894..170372) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0148C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0148c, -, len: 492 aa. Equivalent to Rv0143c, len: 492 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 492 aa overlap). Probable conserved transmembrane protein, CIC family possibly involved in transport of chloride, similar to others and hypothetical proteins e.g. O28857 PUTATIVE CHLORIDE CHANNEL from Archaeoglobus fulgidus (589 aa), FASTA scores: opt: 966, E(): 0, (37.7% identity in 453 aa overlap); YADQ_ECOLI|P37019 hypothetical 46.0 kd protein (436 aa), FASTA scores: opt: 452, E(): 2.4e-20, (28.0% identity in 460 aa overlap). Protein product from Mb0148c detected using SWATH mass spectrometry. Mb0148c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247853.1" /translation="MAPGDWSVFAWHAANLPTMPEAEDIGNEAAGGRLGVSIRSAGYL RKWFLLGITIGVIAGLGAVVFYLALKYTSEFLLGYLADYQIPTPVGEGGHRGSTGFAR PWAIPLVTTGGAVLSALIVAKLAPEATGHGTDEAIESVHGDPRAIRGRAVLVKMVASA LTIGSGGSGGREGPTAQISAGFCSLLTRRLNLSNEDGRTAVALGIGAGIGAIFAAPLG GAALGASIPYRDDFDYRNLLPGFIASGTAYAVLGAFLGFDPLFGYIDAEYRFEKAWPL LWFVVIGLIAAAVGYLYARVFHASVAITRRLPGGPVLKPAIGGLLVGLLGLPIPQILS SGYGWAQLAADRGTLLSIPLWIVIVLPIAKILATSLSIGTGGSGGLFGPGIVIGAFVG AAIWRLGELTELPGVPHEPGIFVVVAMMACFGSVSRAPLAVMIMVAEMTGSFSVVPGA IIAVGIAALLLSRTNVTIYETQRLNRQTAEAERGGSDRPTTA" CDS 170474..171316 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0149" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY TETR-FAMILY)" /note="Mb0149, -, len: 280 aa. Equivalent to Rv0144, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Probable transcriptional regulator, possibly tetR family. Has region similar to others e.g. Q59431|UIDR_ECOLI|GUSR|B1618|Z2623|ECS2326 UID OPERON REPRESSOR (GUS OPERON) from Escherichia coli strains K12 and O157:H7 (196 aa), FASTA scores: opt: 214, E(): 1.1e-06, (26.0% identity in 196 aa overlap). Contains probable helix-turn helix motif from aa 109-130 (Score 1463, +4.17 SD). COULD BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb0149 detected using shotgun mass spectrometry. Mb0149 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247854.1" /translation="MPHSWTPTSVMTPPLVVAAFRPVGHYRLATDRAGGPCSPPATGA KLTSSVASRPTVGTKPQWWHTLVMSMSLTAGRGPGRPPAAKADETRKRILHAARQVFS ERGYDGATFQEIAVRADLTRPAINHYFANKRVLYQEVVEQTHELVIVAGIERARREPT LMGRLAVVVDFAMEADAQYPASTAFLATTVLESQRHPELSRTENDAVRATREFLVWAV NDAIERGELAADVDVSSLAETLLVVLCGVGFYIGFVGSYQRMATITDSFQQLLAGTLW RPPT" CDS 171405..172358 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0150" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb0150, -, len: 317 aa. Equivalent to Rv0145, len: 317 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 317 aa overlap). Conserved hypothetical protein, highly similar to many e.g. CAC32172.1|AL583926 conserved hypothetical protein from Mycobacterium leprae (310 aa); and several Mycobacterium tuberculosis proteins e.g. Rv0726c, Rv0731c, etc. Protein product from Mb0150 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0150 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247855.1" /translation="MTELDDVSSLPSSRRTAGDTWAITESVGATALGVAAARAVETAA TNPLIRDEFAKVLVSSAGTAWARLADADLAWLDGDQLGRRVHRVACDYQAVRTHFFDE YFGAAVDAGVRQVVILAAGLDARAYRLNWPAGTVVYEIDQPSVLEYKAGILQSHGAVP TARRHAVAVDLRDDWPAALIAAGFDGTQPTAWLAEGLLPYLPGDAADRLFDMVTALSA PGSQVAVEAFTMNTKGNTQRWNRMRERLGLDIDVQALTYHEPDRSDAAQWLATHGWQV HSVSNREEMARLGRAIPQDLVDETVRTTLLRGRLVTPAQPA" CDS 172401..173333 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0151" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb0151, -, len: 310 aa. Equivalent to Rv0146, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 310 aa overlap). Conserved hypothetical protein, highly similar to others e.g. AC30975.1|AL583924 conserved hypothetical protein from Mycobacterium leprae (304 aa); and several Mycobacterium tuberculosis proteins e.g. Rv0726c, Rv0731c, etc. Protein product from Mb0151 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0151 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247856.1" /translation="MRTHDDTWDIKTSVGATAVMVAAARAVETDRPDPLIRDPYARLL VTNAGAGAIWEAMLDPTLVAKAAAIDAETAAIVAYLRSYQAVRTNFFDTYFASAVAAG IRQVVILASGLDSRAYRLDWPAGTIVYEIDQPKVLSYKSTTLAENGVTPSAGRREVPA DLRQDWPAALRDAGFDPTARTAWLAEGLLMYLPAEAQDRLFTQVGAVSVAGSRIAAET APVHGEERRAEMRARFKKVADVLGIEQTIDVQELVYHDQDRASVADWLTDHGWRARSQ RAPDEMRRVGRWVEGVPMADDPTAFAEFVTAERL" CDS 173428..174948 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0152" /product="probable aldehyde dehydrogenase (nad+) dependent" /note="Mb0152, -, len: 506 aa. Equivalent to Rv0147, len: 506 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 506 aa overlap). Probable aldehyde dehydrogenase (NAD+) dependant (EC 1.2.1.-), similar to others e.g. DHAP_RAT|P11883 aldehyde dehydrogenase (dimeric NADP-preferring) (452 aa), FASTA scores: opt: 1291, E(): 0, (43.9% identity in 453 aa overlap). Also similar to several Mycobacterium tuberculosis aledehyde dehydrogenases e.g. Rv0768, Rv2858c, etc. Contains PS00687 aldehyde dehydrogenases glutamic acid active site, and PS00070 aldehyde dehydrogenases cysteine active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY. Protein product from Mb0152 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0152 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247857.1" /translation="MSDRVKAVAPPDGRTMMTTESVARKTQKSETEAPREPAPVSDEK QTDVAKTVARLRKTFASGRTRSVEWRKQQLRALQKLMDENEDAIAAALAEDLDRNPFE AYLADIATTSAEAKYAAKRVRRWMRRRYLLLEVPQLPGRGWVEYEPYGTVLIIGAWNY PFYLTLGPAVGAIAAGNAVVLKPSEIAAASAHLMTELVYRYLDTEAIAVVQGDGAVSQ ELIAQGFDRVMFTGGTEIGRKVYEGAAPHLTPVTLELGGKSPVIVAADADVDVAAKRI AWIKLLNAGQTCVAPDYVLADATVRDELVSKITAALTKFRSGAPQGMRIVNQRQFDRL SGYLAAAKTDAAADGGGVVVGGDCDASNLRIQPTVVVDPDPDGPLMSNEIFGPILPVV TVKSLDDAIRFVNSRPKPLSAYLFTKSRAVRERVIREVPAGGMMVNHLAFQVSTAKLP FGGVGASGMGAYHGRWGFEEFSHRKSVLTKPTRPDLSSFIYPPYTERAIKVARRLF" CDS 175023..175883 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0153" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb0153, -, len: 286 aa. Equivalent to Rv0148, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Probable short-chain dehydrogenase (EC 1.-.-.-), similar to others, in particular Estradiol 17 beta-dehydrogenases (EC 1.1.1.62), e.g. DHB4_MOUSE|P51660 estradiol 17 beta-dehydrogenase 4 (735 aa), FASTA scores: opt: 952, E(): 0, (52.5% identity in 276 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb0153 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0153 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247858.1" /translation="MPGVQDRVIVVTGAGGGLGREYALTLAGEGASVVVNDLGGARDG TGAGSAMADEVVAEIRDKGGRAVANYDSVATEDGAANIIKTALDEFGAVHGVVSNAGI LRDGTFHKMSFENWDAVLKVHLYGGYHVLRAAWPHFREQSYGRVVVATSTSGLFGNFG QTNYGAAKLGLVGLINTLALEGAKYNIHANALAPIAATRMTQDILPPEVLEKLTPEFV APVVAYLCTEECADNASVYVVGGGKVQRVALFGNDGANFDKPPSVQDVAARWAEITDL SGAKIAGFKL" CDS 175890..176858 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0154" /product="possible quinone oxidoreductase (nadph:quinone oxidoreductase) (zeta-crystallin)" /note="Mb0154, -, len: 322 aa. Equivalent to Rv0149, len: 322 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 322 aa overlap). Putative quinone oxidoreductase (EC 1.6.5.-), similar to others oxidoreductases e.g. Q08257 quinone oxidoreductase (EC 1.6.5.5) (329 aa), FASTA scores: opt: 397, E(): 3.2e-18, (28.4% identity in 328 aa overlap); SCHCOADH_4 from Streptomyces coelicolor. Also similar to many proteins from Mycobacterium tuberculosis. Contains PS01162 Quinone oxidoreductase / zeta-crystallin signature. BELONG TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY, QUINONE OXIDOREDUCTASE SUBFAMILY. Protein product from Mb0154 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0154 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247859.1" /translation="MKACVVKELSGPSGMVYTDIDEVSGDGGKVVIDVRAAGVCFPDL LLTKGEYQLKLTPPFVPGMETAGVVRSAPSDAGFHVGERVSAFGVLGGYAEQIAVPVA NVVRSPVELDDAGAVSLLVNYNTMYFALARRAALRPGDTVLVLGAAGGVGTAAVQIAK AMQAGKVIAMVHREGAIDYVASLGADVVLPLTEGWAQQVRDHTYGQGVDIVVDPIGGP TFDDALGVLAIDGKLLLIGFAAGAVPTLKVNRLLVRNISVVGVGWGEYLNAVPGSAAL FAWGLNQLVFLGLRPPPPQRYPLSEAQAALQSLDDGGVLGKVVLEP" CDS complement(176855..177142) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0155C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0155c, -, len: 95 aa. Equivalent to Rv0150c, len: 95 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 95 aa overlap). Conserved hypothetical protein, showing some similarity with C-terminus of O53949|Rv1800|MTV049.22 PPE-FAMILY PROTEIN from Mycobacterium tuberculosis (655 aa), FASTA score: (36.5% identity in 104 aa overlap)." /protein_id="CAB5247860.1" /translation="MLTLPDDRAPTGLPDPGIEALAHTKIASTISTVVADGYAVVLST ADIANSLLANAIGYPIAASVALVTPAAGANSSCWPADPSQHHRIAESRACA" CDS complement(177733..179499) /codon_start=1 /transl_table=11 /gene="PE1" /locus_tag="BQ2027_MB0156C" /product="pe family protein pe1" /note="Mb0156c, PE1, len: 588 aa. Equivalent to Rv0151c, len: 588 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 588 aa overlap). Member of the Mycobacterium tuberculosis PE family, with N-terminal region similar to others e.g. MTV032_2 PE_PGRS family from Mycobacterium tuberculosis (468 aa), FASTA scores: opt: 1125, E(): 0, (46.3% identity in 456 aa overlap); MTCY493_24 from M. tuberculosis FASTA score: (42.5% identity in 558 aa overlap). Also similar to upstream ORF MTCI5.26c FASTA score: (54.7% identity in 464 aa overlap). Also shows similarity to C-terminal part of some PPE family proteins e.g. MTV049_21 from Mycobacterium tuberculosis FASTA score: (41.5% identity in 591 aa overlap). Mb0156c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247861.1" /translation="MAPFGFTPKARHNRGVALRSTYRLDRWVMGPVDKEGWGLSYVFA QPSVLAAAATDLAGIGSAINQATAAVAAPTTGLAAAAADEVSTALATLFGAYGQQFQA ISAQVAAFHNEFTQRLAAAANAFVNAEATNTSALVQEATAGLFKPTSPPVLPPMFNQN TAIIMGGTGSPIPTPSYVNAITTLFIDPVVSNPVVKALVTPEELYPITGVKSLPFQTS VQLGLQILDGAIWEQINAGNHVTVFGYSQSAVIASLEMQHLISLGPNAPSPSQLNFIL IGNEMNPNGGILARIPGLNVTTLGLPFYGATPDNPYPTTTYTLEYDGFADFPRYPLNV LSDINAVFGILTVHTTYADLTPAQIASATQLPTQGTTSNTYYIIETEHLPLLAPLRAI PVIGPPLAALVEPNLEVIVNLGYGDPRFGYSTSPANVPTPFGLFPDVPASVVADALVA GTQQGVNDFMVELPAALNTLPQTPMPAFPPYVPTLLPPPPPPQPATLINIADTFASVV STGYSILLPTADLGLAFVTILPAYDLTLFVNQLAAGNLRAAIELPLAATIGLAALGGM IEFIAIVVTLADITQQLQSFSI" CDS complement(179509..181086) /codon_start=1 /transl_table=11 /gene="PE2" /locus_tag="BQ2027_MB0157C" /product="pe family protein pe2" /note="Mb0157c, PE2, len: 525 aa. Equivalent to Rv0152c, len: 525 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 525 aa overlap). Member of the Mycobacterium tuberculosis PE family, similar to ORF downstream Z92770|MTCI5_25 (588 aa), FASTA scores: opt: 1492, E(): 0, (54.7% identity in 464 aa overlap); and to many other PE family type members. Mb0157c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247862.1" /translation="MRCRPPSRNRSAHTARNTRPCSLKSRRFTVRFHQTLAAAANSYA DAEAAIASTRQNQLAVPAAAPTPAAAAMIPPFPANLTTLFFGPTGIPLPPPSMLTPPI RCRSVRRALQAVFTPEELYPLTGVRSLVLNTSVEEGLTILHDAIMVELATTGNAVTVF GWSQSAIIASLEMQRFTAMGGAAPSASDLNFVLVGNEMNPNGGMLARFPDLTLPTLDL TFYGATPSDTIYPTAIYTLEYDGFADFSRYPLNFISDLNAVAGITFVHTKYLDLTPAQ VEGATKLPTSPGYTGVTDYYIIRTENRPLLQPLRAVPVIGDPLADLIQPNLKVIVNLG YGDPNYGYSTSYADVRTPFGLWPNVPPQVIADALAAGTQEGILDFTADLQALSAQPLT LPQIQLPQPADLVAAVAAAPTPAEVVNTLARIISTNYAVLLPTVDIALALVTTLPLYT TQLFVRQLAAGNLINAIGYPLAATVGLGTIDSGRRGIAHPPRGGLGHRSKHRGPRHLT DSRRHRRPPTTVYRPRQ" CDS complement(181345..182175) /codon_start=1 /transl_table=11 /gene="ptbB" /locus_tag="BQ2027_MB0158C" /standard_name="MPtpB" /product="PHOSPHOTYROSINE PROTEIN PHOSPHATASE PTPB (PROTEIN-TYROSINE-PHOSPHATASE) (PTPase)" /note="Mb0158c, ptbB, len: 276 aa. Equivalent to Rv0153c, len: 276 aa, from Mycobacterium tuberculosis strain H27Rv, (99.6% identity in 276 aa overlap). ptbB (alternate gene name: MPtpB), protein-tyrosine-phosphatase (see citation below) (EC 3.1.3.48), showing some similarity to several protein-tyrosine phosphatases, polyketide synthase and aminotransferase e.g. Q05918|IPHP_NOSCO|IPH PROTEIN-TYROSINE-PHOSPHATASE PRECURSOR from Nostoc commune (EC 3.1.3.48) (294 aa), FASTA scores: opt: 150, E(): 0.0096, (26.8% identity in 269 aa overlap); etc. Supposed a secreted protein. Protein product from Mb0158c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0158c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247863.1" /translation="MAVRELPGAWNFRDVADTATALRPGRLFRSSELSRLDDAGRATL RRLGITDVADLRSSREVARRGPGRVPDGIDVHLLPFPDLADDDADDSAPHETAFKRLL TNGGSNGESGESSQSINDAATRYMTDEYRQFPTRNGAQRALHRVVTLLAAGRPVLTHC FAGKDRTGFVVALVLEAVGLDRDVIVADYLRSNDSVPQLRARISEMIQQRFDTELAPE VVTFTKARLSDGVLGVRAEYLAAARQTIDETYGSLGGYLRDAGISQATVNRMRGVLLG " CDS complement(182177..183388) /codon_start=1 /transl_table=11 /gene="fadE2" /locus_tag="BQ2027_MB0159C" /product="PROBABLE ACYL-CoA DEHYDROGENASE FADE2" /note="Mb0159c, fadE2, len: 403 aa. Equivalent to Rv0154c, len: 403 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 403 aa overlap). Probable fadE2, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. C-terminal region of O01590 ACYL-CoA DEHYDROGENASE (974 aa), FASTA scores: opt: 1150, E(): 0, (50.0% identity in 402 aa overlap); ACDS_MEGEL|Q06319 acyl-CoA dehydrogenase (short-chain) (383 aa), FASTA score: (35.0% identity in 306 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb0159c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0159c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247864.1" /translation="MSAKAIDYRTRLSDFMTEHVFGAEADYDDYRRAAGPADHTAPPI IEELKTKAKDRGLWNLFLSAESGLTNLEYAPLAEMTGWSMEIAPEALNCAAPDTGNME ILHMFGTEQQRAQWLRPLLDGKIRSAFSMTEPAVASSDARNIETTISRDGADYVINGR KWWTSGAADPRCKILIVMGRTNPDAAAHQQQSMVLVPIDTPGVTIVRSTPVFGWQDRH GHCEIDYHNVRVPATNLLGEEGSGFAIAQARLGPGRIHHCMRALGAAERALALMVNRV RNRVAFGRPLAEQGVVQQAIAQSRNEIDQARLLCEKAAWTIDQHGNKEARHLVAMIKA VAPRVACDVIDRAIQVHGAAGVSDDTPLARLYGWHRAMRIFDGPDEVHLRSIARAELS REKSTFAAAVT" CDS 183812..184912 /codon_start=1 /transl_table=11 /gene="pntAa" /locus_tag="BQ2027_MB0160" /standard_name="pntAA" /product="PROBABLE NAD(P) TRANSHYDROGENASE (SUBUNIT ALPHA) PNTAa [FIRST PART; CATALYTIC PART] (PYRIDINE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT ALPHA) (NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT ALPHA)" /note="Mb0160, pntAa, len: 366 aa. Equivalent to Rv0155, len: 366 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 366 aa overlap). Probable pntAa, first part of NAD(P) transhydrogenase subunit alpha (EC 1.6.1.2), similar to N-terminus of others e.g. PNTA_ECOLI|P07001|P76888|B1603 NAD (P) transhydrogenase subunit alpha from Escherichia coli strain K12 (510 aa), FASTA scores: opt: 921, E(): 0, (42.1% identity in 361 aa overlap); PROTON-TRANSLOCATING NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT PNTAA (EC 1.6.1.1). Protein product from Mb0160 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0160 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247865.1" /translation="MTDPQTQSTRVGVVAESGPDERRVALVPKAVASLVNRGVAVVVE AGAGERALLPDELYTAVGASIGDAWAADVVVKVAPPTAAEVGRLRGGQTLIGFLAPRN ADNSIGALTQAGVQAFALEAIPRISRAQVMDALSSQANVSGYKAVLLAASESTRFFPM LTTAAGTVKPATVLVLGVGVAGLQALATAKRLGARTTGYDVRPEVADQVRSVGAQWLD LGISASGEGGYARELTDDERAQQQKALEEAISGFDVVITTALVPGRPAPTLVTAAAVE AMKPGSVVVDLAGETGGNCELTEPGRTVVKHDVTIAAPLNLPATMPEHASELYSKNIT ALLDLLIKDGRLAPDFDDEVIAQSCVTRGKDS" CDS 184913..185245 /codon_start=1 /transl_table=11 /gene="pntAb" /locus_tag="BQ2027_MB0161" /standard_name="pntAB" /product="PROBABLE NAD(P) TRANSHYDROGENASE (SUBUNIT ALPHA) PNTAb [SECOND PART; INTEGRAL MEMBRANE PROTEIN] (PYRIDINE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT ALPHA) (NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT ALPHA)" /note="Mb0161, pntAb, len: 110 aa. Equivalent to Rv0156, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 110 aa overlap). Probable pntAb, second part of NAD(P) transhydrogenase subunit alpha, integral membrane protein, similar to C-terminus of others e.g. Q59764 NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT PNTAB (139 aa), FASTA scores: opt: 247, E(): 1.9e-11, (45.5% identity in 88 aa overlap). Protein product from Mb0161 detected using shotgun mass spectrometry. Mb0161 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247866.1" /translation="MCNELLENLAILVLSGFVGFAVISKVPNTLHTPLMSGTNAIHGI VVLGALVVFGEIEHPSLVLQVILFVAVVFGTLNVIGGFIVTDRMLGMFKAKKPAVPAK PDRDEALR" CDS 185242..186669 /codon_start=1 /transl_table=11 /gene="pntB" /locus_tag="BQ2027_MB0162" /product="PROBABLE NAD(P) TRANSHYDROGENASE (SUBUNIT BETA) PNTB [INTEGRAL MEMBRANE PROTEIN] (PYRIDINE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT BETA) (NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT BETA)" /note="Mb0162, pntB, len: 475 aa. Equivalent to Rv0157, len: 475 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 475 aa overlap). Probable pntB, pyridine nucleotide transhydrogenase (nicotinamide nucleotide transhydrogenase) subunit beta (EC 1.6.1.1), integral membrane protein, similar to others e.g. Q59763 PROTON-TRANSLOCATING NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT BETA from HODOSPIRILLUM RUBRUM (464 aa), FASTA scores: opt: 1344, E(): 0, (46.4% identity in 472 aa overlap); P07002|PNTB_ECOLI|P76890|PNTB|B1602|Z2597|ECS2308 NAD(P) TRANSHYDROGENASE SUBUNIT BETA from Escherichia coli strains K12 and O157:H7 (462 aa). Protein product from Mb0162 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0162 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247867.1" /translation="MNLHYLVEILYIISFSLFIYGLMGLTGPKTAVRGNLIAAAGMTI AVAATLVMIRHTSQWPLIIAGLVVGVVLGVPPARLTKMTAMPQLVAFFNGVGGGTVAL IALSEFIDTTGFSAFQHGESPTVHIVVASLFAAIIGSISFWGSIVAFGKLQEIISGRP IGLGKAQQPINLLLLAVAVAAAVVIGLHAHPGSGGVALWWMIGLLVAAGVLGLMVVLP IGGADMPVVISMLNAMTGLSAAAAGLALNNTAMIVAGMIVGASGSILTNLMAKAMNRS IPAIVAGGFGGGGVAPSGGGDDKHVKATSAADAAIQMAYANQVIVVPGYGLAVAQAQH AVKDLATLLEDRGVPVKYAIHPVAGRMPGHMNVLLAEAEVDYDAMKDMDDINDEFART DVTIVIGANDVTNPAARNETSSPIYGMPILNVDKSRSVIVLKRSMNSGFAGIDNPLFY ADGTTMLFGDAKKSVTEVSEELKAL" CDS complement(186685..186813) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0162A" /product="Conserved protein" /note="Mb0162A, len: 42 aa. Equivalent to Rv0157A len: 42 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 42 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein, showing similarity to C-terminal part (aa 186-220) of O53976|Rv1975|MTV051.13 conserved hypothetical protein from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 173, E(): 3e-06, (62.5% identity in 40 aa overlap). Mb0162A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247868.1" /translation="MMDPSPDYDVSDEIEFFFRYLTWGLRGVETGDGYPPPAYPPV" CDS 186975..187619 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0163" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY TETR-FAMILY)" /note="Mb0163, -, len: 214 aa. Equivalent to Rv0158, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 214 aa overlap). Probable transcriptional regulator, possibly TetR family, showing weak similarity to various transcriptional activators and repressors e.g. P32398|YIXD_BACSU|YHGD HYPOTHETICAL TRANSCRIPTIONAL REGULATORY PROTEIN from Bacillus subtilis (191 aa), FASTA scores: opt:172, E(): 2.4e-05, (23.0% identity in 191 aa overlap). Contains helix-turn-helix motif at aa 32-53 (Score 1296, +3.60 SD). COULD BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb0163 detected using SWATH mass spectrometry. Mb0163 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247869.1" /translation="MPSDTSPNGLSRREELLAVATKLFAARGYHGTRMDDVADVIGLN KATVYHYYASKSLILFDIYRQAAEGTLAAVHDDPSWTAREALYQYTVRLLTAIASNPE RAAVYFQEQPYITEWFTSEQVAEVREKEQQVYEHVHGLIDRGIASGEFYECDSHVVAL GYIGMTLGSYRWLRPSGRRTAKEIAAEFSTALLRGLIRDESIRNQSPLGTRKET" CDS complement(187623..189029) /codon_start=1 /transl_table=11 /gene="PE3" /locus_tag="BQ2027_MB0164C" /product="pe family protein pe3" /note="Mb0164c, PE3, len: 468 aa. Equivalent to Rv0159c, len: 468 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 468 aa overlap). Member of the Mycobacterium tuberculosis PE family, similar to many other PE proteins e.g. O06828 from Mycobacterium tuberculosis (528 aa), FASTA scores: opt: 1163, E(): 0, (45.8% identity in 467 aa overlap). Also highly similar to upstream MTV032_3, and to MTCI5_25, MTCI5_26, MTV049_ 21, MTCY1A10_26, etc. Mb0164c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247870.1" /translation="MSYVIAAPEMLATAAADVDGIGSAIRAASASAAGPTTGLLAAAA DEVSSAAAALFSEYARECQEVLKQAAAFHGEFTRALAAAGAAYAQAEASNTAAMSGTA GSSGALGSVGMLSGNPLTALMMGGTGEPILSDRVLAIIDSAYIRPIFGPNNPVAQYTP EQWWPFIGNLSLDQSIAQGVTLLNNGINAELQNGHDVVVFGYSQSAAVATNEIRALMA LPPGQAPDPSRLAFTLIGNINNPNGGVLERYVGLYLTFLDMSFNGATPPDSPYQTYMY TGQYDGYAHNPQYPLNILSDLNAFMGIRWVHNAYPFTAAEVANAVPLPTSPGYTGNTH YYMFLTQDLPLLQPIRAIPFVGTPIAELIQPDLRVLVDLGYGYGYADVPTPASLFAPI NPIAVASALATGTVQGPQAALVSIGLLPQSALPNTYPYLPSANPGLMFNFGQSSVTEL SVLSGALGSVARLIPPIA" CDS complement(189121..190629) /codon_start=1 /transl_table=11 /gene="PE4" /locus_tag="BQ2027_MB0165C" /product="pe family protein pe4" /note="Mb0165c, PE4, len: 502 aa. Equivalent to Rv0160c, len: 502 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 502 aa overlap). Member of the Mycobacterium tuberculosis PE family, similar to many other PE proteins e.g. Z92770|MTCI5_26c from M. tuberculosis (525 aa), FASTA scores: opt: 816, E(): 0, (41.4% identity in 367 aa overlap); C-terminal region of O06801|RV1768|MTCY28.34 from Mycobacterium tuberculosis (618 aa), FASTA scores: opt: 417, E(): 6.7e-18, (53.5% identity in 142 aa overlap). Also highly similar to downstream ORF MTV032_2. Mb0165c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247871.1" /translation="MSHLVTAPDMLATAAAHVDEIASTLRAANAAAAGPTCNLLAAAG DEVSAATAALFSAYGREYQAVVKQAAAFHSEFTRTLEAAGNAYAHAEAANAARVSHAL DTINAPIRTLLGRTPLSPNGSSGAGGLPAIAQLAAESPITALIMGGTNNPLPDPEYVT DINKAFIQTLFPGAVSQGLFTPEQFWPVTPDLGNLTFNQSVTEGVALLNTTVNNQLAL DNKVVAFGYSQSATIINNYINSLMAMGSPNPDDISFVMIGSGNNPVGGLLARFPGFYI PFLDVPFNGATPANSPYPTHIYTAQYDGIAHAPQFPLRILSDINAFMGYFYVHNTYPE LMATQVDNAVPLPTSPGYTGNTQYYMFLTQDLPLLQPIRDIPYAGPPIADLFQPQLRV LVDLGYADYGPGGNYADIPTPAGLFSIPNPFAVTYYLIKGSLQAPYGAIVEIGVEAGL IGPEWFPDSYPWVPSINPGLNFYFGQPQVTLLSLMSGGLGNILHLIPPPVFT" CDS 190797..192146 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0166" /product="possible oxidoreductase" /note="Mb0166, -, len: 449 aa. Equivalent to Rv0161, len 449 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 449 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to hypothetical proteins and various oxidoreductases e.g. AIP2_YEAST|P46681 actin interacting protein 2 (530 aa), FASTA scores: opt: 356, E (): 0, (33.3% identity in 357 aa overlap); DLD1_YEAST|P32891 d-lactate dehydrogenase (cytochrome) (587 aa), FASTA scores: opt: 311, E(): 2.5e-20, (27.9% identity in 366 aa overlap). Also similar to other Mycobacteria proteins e.g. MTCY339.30c from M. tuberculosis FASTA score: (29.4% identity in 357 aa overlap); MLCL622.30c from Mycobacterium tuberculosis (449 aa). Protein product from Mb0166 detected using shotgun mass spectrometry and SWATH mass spectrometry." /protein_id="CAB5247872.1" /translation="MLTSLVSAVGSHHVTTDPDVLAGRSVDHTGRYRGRASALVRPGS AEEVAEVLRVCRDAGAYVTVQGGRTSLVAGTVPEHDDVLLSTERLCVVSDVDTVERRI EIGAGVTLAAVQHAASTAGLVFGVDLSARDTATVGGMASTNAGGLRTVRYGNMGEQVV GLDVALPDGTVLRRHSRVRRDNTGYDLPALFVGAEGTLGVITALDLRLHPTPSHRVTA VCGFAELAALVDAGRMFRDVEGIAALELIDGRAAALTREHLGVRPPVEADWLLLVELA ADHDQTDRLADLLGGARMCGEPAVGVDAAAQQRLWRTRESLAEVLGVYGPPLKFDVSL PLSAISGFARDAVALVHRHVPDSPEALPLLFGHIGEGNLHLNVLRCPPDREPALYAKM MGLIAECGGNVSSEHGVGSRKRAYLGMSRQANDVAAMRRVKAALDPTGYLNAAVLFD" CDS complement(192174..193325) /codon_start=1 /transl_table=11 /gene="adhE1" /locus_tag="BQ2027_MB0167C" /product="probable zinc-type alcohol dehydrogenase (e subunit) adhe1" /note="Mb0167c, adhE1, len: 383 aa. Equivalent to Rv0162c, len: 383 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 383 aa overlap). Probable adhE1, zinc-type alcohol dehydrogenase (EC 1.1.1.1), similar to others e.g. ADH_MACMU|P28469 alcohol dehydrogenase alpha chain (374 aa), FASTA scores: opt: 619, E(): 0, (34.7% identity in 363 aa overlap). Also similar to other alcohol dehydrogenases from Mycobacterium tuberculosis e.g. MTCY369.06c FASTA score: (34.0% identity in 365 aa overlap), MTV022_9 FASTA score: (35.0% identity in 371 aa overlap). Contains PS00059 Zinc-contain ingalcohol dehydrogenases signature. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY, CLASS-I SUBFAMILY. COFACTOR: ZINC. Protein product from Mb0167c detected using SWATH mass spectrometry. Mb0167c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247873.1" /translation="MPAVQPWLYSNMPAIRGAVLDQIGVPRPYWRSKPISVVELHLDP PDRGEVLVRIEAAGVCHSDLSVVDGTRVRPVPILLGHEAAGIVEQVGDGVDGVAVGQR VVLVFLPRCGQCAACATDGRTPCEPGSAANKAGTLLGGGIRLSRGGRPVYHHLGVSGF ATHVVVNRASVVPVPHEVPPTVAALLGCAVLTGGGAVLNVGDPQPGQSVAVVGLGGVG MAAVLTALTYTDVRVVAVDQLPEKLSAAKALGAHEIYTPQQATAGGVKAAVVVEAVGH PAALHTAIGLTAPGGRTITVGLPPPDVRISLSPLDFVTEGRSLIGSYLGSAVPSHDIP RFVSLWQSGRLPVESLVTSTIRLDDINEAMDHLADGIAVRQLISFTGDL" CDS 193307..193762 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0168" /product="Propionyl-CoA thioesterase activity" /note="Mb0168, -, len: 151 aa. Equivalent to Rv0163, len: 151 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 151 aa overlap). Conserved hypothetical protein, similar to others e.g. Q44017 HYPOTHETICAL 16.6 KDA PROTEIN IN GBD 5'REGION (ORF6)from Alcaligenes eutrophus (145 aa), FASTA scores: opt: 155, E(): 0.0002, (26.6% identity in 139 aa overlap). Also weak similary with MTV008.31c|Rv2475c|B70867 from Mycobacterium tuberculosis (138 aa). Mb0168 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247874.1" /translation="MAALPAPEKLLRSDFPVLWPVGTRWADNDMFGHLNNAVYYQLFD TAINAWINTSTGVDPLAMPVLGIVAESGCRYFSELRFPESLMVGLAVTRLGRSSVTYR LGVFKEPDDAGVITALGHWVHVYVDRTSRRPVPIPEAIRSLLSTACVSG" CDS 193816..194301 /codon_start=1 /transl_table=11 /gene="TB18.5" /locus_tag="BQ2027_MB0169" /product="Cyclase/dehydrase family protein" /note="Mb0169, TB18.5, len: 161 aa. Equivalent to Rv0164, len: 161 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 161 aa overlap). TB18.5, conserved hypothetical protein, equivalent to CAB08818.1|Z95398 HYPOTHETICAL PROTEIN from Mycobacterium leprae (156 aa) FASTA scores: opt: 762, E(): 0, (76.3% identity in 152 aa overlap). Some similarity to Rv2185c, Rv0854, Rv0857 from Mycobacterium tuberculosis. Alternative start codon has been suggested. 3' part corrected since first submission (-24 aa). Protein product from Mb0169 detected using shotgun mass spectrometry. Mb0169 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247875.1" /translation="MTAISCSPRPRYASRMPVLSKTVEVTADAASIMAIVADIERYPE WNEGVKGAWVLARYDDGRPSQVRLDTAVQGIEGTYIHAVYYPGENQIQTVMQQGELFA KQEQLFSVVATGAASLLTVDMDVQVTMPVPEPMVKMLLNNVLEHLAENLKQRAEQLAA S" CDS complement(194315..195007) /codon_start=1 /transl_table=11 /gene="mce1r" /locus_tag="BQ2027_MB0170C" /product="Probable transcriptional regulatory protein Mce1R (probably GntR-family)" /note="Mb0170c, len: 230 aa. Equivalent to Rv0165c len: 223 aa, from Mycobacterium tuberculosis strain H37Rv, (81.2% identity in 218 aa overlap). Probable mce1R,transcriptional regulator, GntR family (See Casali et al.,2006), showing some similarity to several e.g. NTRA_CHELE|P54988 nta operon transcriptional regulator (231 aa), FASTA scores: opt: 154, E(): 0.00058, (32.0% identity in 125 aa overlap); P46833|GNTR_BACLI gluconate operon transcriptional repressor from Bacillus licheniformis (243 aa); GNTR_BACSU gluconate operon repressor from Bacillus subtilis (243 aa). Also similar to Rv0043c from Mycobacterium tuberculosis. Seems to belong to the GntR family of transcriptional regulators. Start changed since first submission (-41 aa). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0165c exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2 bp insertion (*-cc) leads to a product with a different COOH terminus. Mb0170c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247876.1" /translation="MNAPLSAKPRSQLPLRRAQLSDEVAGHLRAAIMSGALRSGTFIR LDETAAELGVSVTPVREALLKLRGEGMVGLEPHRGHVVLPLTRQDIDDIFWLQATIAQ ELATSATAHITDVEIDELDRINNALAGAIGSGDAKTIASIEFAFHRVFNKASRRIKLA WFLLNAARYMPAQVFAADPRWGADAVNSHRQLIAALRRRDTAAVIEHTVWQFTDGARR LTEALAETEVFG" CDS 195185..196849 /codon_start=1 /transl_table=11 /gene="fadD5" /locus_tag="BQ2027_MB0172" /product="PROBABLE FATTY-ACID-CoA LIGASE FADD5 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)" /note="Mb0172, fadD5, len: 554 aa. Equivalent to Rv0166, len: 554 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 554 aa overlap). Probable fadD5, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to many eg LCFA_ECOLI|P29212 long-chain-fatty-acid--CoA ligase (561 aa), FASTA scores: opt: 612, E(): 0, (29.4% identity in 534 aa overlap). Also similar to many other fatty-acid-CoA ligases from Mycobacterium tuberculosis e.g. MTCY07A7.11c FASTA score: (35.3% identity in 487 aa overlap), MTV013_10, MTY25D10_30, etc. Contains PS00455 putative AMP-binding domain signature. Protein product from Mb0172 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0172 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247877.1" /translation="MTAQLASHLTRALTLAQQQPYLARRQNWVNQLERHAMMQPDAPA LRFVGNTMTWADLRRRVAALAGALSGRGVGFGDRVMILMLNRTEFVESVLAANMIGAI AVPLNFRLTPTEIAVLVEDCAAHVMLTEAALAPVAIGVRNIQPLLSVIVVAGGSSQDS VFGYEDLLNEAGDVHEPVDIPNDSPALIMYTAGTTGRPKGAVLTHANLTGQAMTALYT SGANINSDVGFVGVPLFHIAGIGNMLTGLLLGLPTVIYPLGAFDPGQLLDVLEAEKVT GIFLVPAQWQAVCTEQQARPRDLRLRVLSWGAAPAPDALLRQMSATFPETQILAAFGQ TEMSPVTCMLLGEDAIAKRGSVGRVIPTVAARVVDQNMNDVPVGEVGEIVYRAPTLMS CYWNNPEATAEAFAGGWFHSGDLVRMDSDGYVWVVDRKKDMIISGGENIYCAELENVL ASHPDIAEVAVIGRADEKWGEVPIAVAAVTNDDLRIEDLGEFLTDRLARYKHPKALEI VDALPRNPAGKVLKTELRLRYGACVNVERRSASAGFTERRENRQKL" CDS 197053..197850 /codon_start=1 /transl_table=11 /gene="yrbE1A" /locus_tag="BQ2027_MB0173" /product="conserved integral membrane protein yrbe1a" /note="Mb0173, yrbE1A, len: 265 aa. Equivalent to Rv0167, len: 265 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 265 aa overlap). yrbE1A, hypothetical unknown integral membrane protein, part of mce1 operon and member of YrbE family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07791|Rv0587|MTCY19H5.35|yrbE2A (265 aa); O53965|Rv1964|MTV051.02|yrbE3A (265 aa); etc. Also highly similar or similar to conserved hypothetical integral membrane proteins of yrbEA type, e.g. NP_302654.1|NC_002677 conserved membrane protein from Mycobacterium leprae (267 aa); P45030|YRBE_HAEIN|HI1086 hypothetical protein from Haemophilus influenzae (261 aa), FASTA scores: opt: 328, E(): 1.8e-15, (26.6% identity in 244 aa overlap); etc. Protein product from Mb0173 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0173 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247878.1" /translation="MTTSTTLGGYVRDQLQTPLTLVGGFFRMCVLTGKALFRWPFQWR EFILQCWFIMRVGFLPTIMVSIPLTVLLIFTLNILLAQFGAADISGSGAAIGAVTQLG PLTTVLVVAGAGSTAICADLGARTIREEIDAMEVLGIDPIHRLVVPRVLASMLVATLL NGLVITVGLVGGFLFGVYLQNVSGGAYLATLTLITGLPEVVIATIKAATFGLIAGLVG CYRGLTVRGGSKGLGTAVNETVVLCVIALFAVNVILTTIGVRFGTGR" CDS 197852..198721 /codon_start=1 /transl_table=11 /gene="yrbE1B" /locus_tag="BQ2027_MB0174" /product="conserved integral membrane protein yrbe1b" /note="Mb0174, yrbE1B, len: 289 aa. Equivalent to Rv0168, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 289 aa overlap). yrbE1B, hypothetical unknown integral membrane protein, part of mce1 operon and member of YrbE family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07790|Rv0588|MTCY19H5.34|yrbE2B (295 aa); O53966|Rv1965|MTV051.03|yrbE3B (271 aa); etc. Also highly similar to conserved hypothetical integral membrane proteins of the yrbEB type, e.g. NP_302655.1|NC_002677 conserved membrane protein from Mycobacterium leprae (289 aa); P45030|YRBE_HAEIN|HI1086 hypothetical protein from Haemophilus influenzae (261 aa), FASTA scores: opt: 223, E(): 7.6e-07, (23.7% identity in 257 aa overlap); etc. Mb0174 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247879.1" /translation="MSTAAVLRARFPRAVANLRQYGGAAARGLDEAGQLTWFALTSIG QIAHALRYYRKETLRLIAQIGMGTGAMAVVGGTVAIVGFVTLSGSSLVAIQGFASLGN IGVEAFTGFFAALINVRIAGPVVTGVALAATVGAGATAELGAMRISEEIDALEVMGIK SISFLASTRIMAGLVVIIPLYALAMIMSFLSPQITTTVLYGQSNGTYEHYFQTFLRPD DVFWSFLEALIITAIVMVSHCYYGYAAGGGPVGVGEAVGRSMRFSLVSVQVVVLFAAL ALYGVDPNFNLTV" CDS 198726..200090 /codon_start=1 /transl_table=11 /gene="mce1A" /locus_tag="BQ2027_MB0175" /standard_name="mce1" /product="MCE-FAMILY PROTEIN MCE1A" /note="Mb0175, mce1A, len: 454 aa. Equivalent to Rv0169, len: 454 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 454 aa overlap). mce1A; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07789|MCE2|Rv0589|MTCY19H5.33c|mce2A (404 aa); O53967|MCE3|Rv1966|MTV051.04|mce3A (425 aa); etc. Also highly similar to others e.g. AAD52105.1|AF113402_1|AF113402 mycobacterial cell entry protein from Mycobacterium bovis BCG (454 aa); NP_302656.1|NC_002677 putative cell invasion protein from Mycobacterium leprae (441 aa); AAA92845.1|U26018 mce gene product from Mycobacterium avium (88 aa) (similarity on C-terminus); CAC12798.1|AL445327 putative secreted protein from Streptomyces coelicolor (418 aa); etc. Note that equivalent, but longer 22 aa, to P72013|CAA50257.1|X70901 Mcep protein from Mycobacterium tuberculosis (432 aa). Contains a very hydrophobic region around residues 20-35. Note that previously known as mce1. Protein product from Mb0175 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0175 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247880.1" /translation="MTTPGKLNKARVPPYKTAGLGLVLVFALVVALVYLQFRGEFTPK TQLTMLSARAGLVMDPGSKVTYNGVEIGRVDTISEVTRDGESAAKFILDVDPRYIHLI PANVNADIKATTVFGGKYVSLTTPKNPTKRRITPKDVIDVRSVTTEINTLFQTLTSIA EKVDPVKLNLTLSAAAEALTGLGDKFGESIVNANTVLDDLNSRMPQSRHDIQQLAALG DVYADAAPDLFDFLDSSVTTARTINAQQAELDSALLAAAGFGNTTADVFDRGGPYLQR GVADLVPTATLLDTYSPELFCTIRNFYDADPLAKAAAGGGNGYSLRTNSEILSGIGIS LLSPLALATNGAAIGIGLVAGLIASPLAVAANLAGALPGIVGGAPNPYTYPENLPRVN ARGGPGGAPGCWQPITRDLWPAPYLVMDTGASLAPYNHMEVGSPYAVEYVWGRQVGDN TINP" CDS 200087..201127 /codon_start=1 /transl_table=11 /gene="mce1B" /locus_tag="BQ2027_MB0176" /standard_name="mceD" /product="MCE-FAMILY PROTEIN MCE1B" /note="Mb0176, mce1B, len: 346 aa. Equivalent to Rv0170, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 346 aa overlap). mce1B (alternate gene name: mceD); belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07788|Rv0590|MTCY19H5.32c|mce2B (275 aa); O53968|Rv1967|MTV051.05|mce3B (342 aa); etc. Also highly similar to others e.g. NP_302657.1|NC_002677 putative secreted protein from Mycobacterium leprae (346 aa); CAC12797.1|AL445327 putative secreted protein from Streptomyces coelicolor (354 aa); etc. Contains hydrophobic region in N-terminal 30 residues. In Escherichia coli, N-terminal part is functional and directs export of a leaderless beta-lactamase into the periplasm (see seventh citation). Protein product from Mb0176 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0176 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247881.1" /translation="MKITGTVVKLGIVSVVLLFFTVMIIVIFGQMRFDRTNGYTAEFS NVSGLRQGQFVRASGVEIGKVKALHLVDGGRRVRVEFNIDRSVPLYQSTTAQIRYSDL IGNRYVELKRGEGKGANDLLPPGGLIPLSRTSPALDLDALIGGFKPVFRALDPAKVNN IANALITVFQGQGGTINDTLDQTAQLTSQIAERDQAIGEVVKNLNIVLDTTVKHRKEF DETVNNLENLITGLRNHSDQLAGGLAHISNGAGTVADLLAENRTLVRKAVSYLDAIQQ PVIDQRVELDDLLHKTPTALTALGRANGTYGDFQNFYLCDLQIKWNGFQAGGPVRTVK LFSQPTGRCTPQ" CDS 201124..202671 /codon_start=1 /transl_table=11 /gene="mce1C" /locus_tag="BQ2027_MB0177" /product="MCE-FAMILY PROTEIN MCE1C" /note="Mb0177, mce1C, len: 515 aa. Equivalent to Rv0171, len: 515 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 515 aa overlap). mce1C; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07787|Rv0591|MTCY19H5.31|mce2C (481 aa); O53969|Rv1968|MTV051.06|mce3C (410 aa); etc. Also highly similar to others e.g. NP_302658.1|NC_002677 putative secreted protein from Mycobacterium leprae (519 aa); CAC12796.1|AL445327 putative secreted protein from Streptomyces coelicolor (351 aa); etc. Weakly similar to downstream ORF Rv0172|MTCI28.12|mce1D (530 aa), FASTA score: (24.6% identity in 552 aa overlap). Contains possible signal sequence and highly proline-rich C-terminus. Protein product from Mb0177 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0177 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247882.1" /translation="MRTLEPPNRMRIGLMGIVVALLVVAVGQSFTSVPMLFAKPSYYG QFTDSGGLHKGDRVRIAGLGVGTVEGLKIDGDHIVVKFSIGTNTIGTESRLAIRTDTI LGRKVLEIEPRGAQALPPGGVLPVGQSTTPYQIYDAFFDVTKAASGWDIETVKRSLNV LSETVDQTYPHLSAALDGVAKFSDTIGKRDEQITHLLAQANQVASILGDRSDQVDRLL VNAKTLIAAFNERGRAVDALLGNISAFSAQVQNLINDNPNLNHVLEQLRILTDLLVDR KEDLAETLTILGRFSASFGETFASGPYFKVLLANLVPGQILQPFVDAAFKKRGISPED FWRSAGLPAYRWPDPNGTRFPNGAPPPPPPVLEGTPEHPGPAVPPGSPCSYTPPADGL PRPWDPLPCANLTQGPFGGPDFPAPLDVATSPPNPDGPPPAPGLPIAGRPGEVPPNVP GTPVPIPQEAPPGARTLPLGPAPGPAPPPAAPGPPAPPGPGPQLPAPFINPGGTGGSG VTGGSEN" CDS 202668..204260 /codon_start=1 /transl_table=11 /gene="mce1D" /locus_tag="BQ2027_MB0178" /product="MCE-FAMILY PROTEIN MCE1D" /note="Mb0178, mce1D, len: 530 aa. Equivalent to Rv0172, len: 530 aa, fromMycobacterium tuberculosis strain H37Rv, (100.0% identity in 530 aa overlap). mce1D; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07786|Rv0592|MTCY19H5.30c|mce2D (508 aa); O53970|Rv1969|MTV051.07|mce3D (423 aa); etc. Also highly similar to others e.g. NP_302659.1|NC_002677 putative secreted protein from Mycobacterium leprae (531 aa); CAC12795.1|AL445327 putative secreted protein from Streptomyces coelicolor (337 aa); etc. Hydrophobic region at N-terminus. Protein product from Mb0178 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0178 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247883.1" /translation="MSTIFDIRNLRLPQLSRASVVIGSLVVVLALAAGIVGVRLYQKL TNNTVVAYFTQANALYVGDKVQIMGLPVGSIDKIEPAGDKMKVTFHYQNKYKVPANAS AVILNPTLVASRNIQLEPPYRGGPVLADNAVIPVERTQVPTEWDELRDSVSHIIDELG PTPEQPKGPFGEVIEAFADGLAGKGKQINTTLNSLSQALNALNEGRGDFFAVVRSLAL FVNALHQDDQQFVALNKNLAEFTDRLTHSDADLSNAIQQFDSLLAVARPFFAKNREVL THDVNNLATVTTTLLQPDPLDGLETVLHIFPTLAANINQLYHPTHGGVVSLSAFTNFA NPMEFICSSIQAGSRLGYQESAELCAQYLAPVLDAIKFNYFPFGLNVASTASTLPKEI AYSEPRLQPPNGYKDTTVPGIWVPDTPLSHRNTQPGWVVAPGMQGVQVGPITQGLLTP ESLAELMGGPDIAPPSSGLQTPPGPPNAYDEYPVLPPIGLQAPQVPIPPPPPGPDVIP GPVPPTPAPVGAPLPAEAGGGQ" CDS 204257..205429 /codon_start=1 /transl_table=11 /gene="lprK" /locus_tag="BQ2027_MB0179" /standard_name="mce1E" /product="POSSIBLE MCE-FAMILY LIPOPROTEIN LPRK (MCE-FAMILY LIPOPROTEIN MCE1E)" /note="Mb0179, lprK, len: 390 aa. Equivalent to Rv0173, len: 390 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 390 aa overlap). Possible lprK (alternate gene name: mce1E), lipoprotein which belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07785|LPRL|Rv0593|MTCY19H5.29|mce2E (402 aa); O53971|LPRM|Rv1970|MTV051.08|mce3E (377 aa); etc. Also highly similar to others e.g. NP_302660.1|NC_002677 putative lipoprotein from Mycobacterium leprae (392 aa); CAC12794.1|AL445327 putative secreted protein from Streptomyces coelicolor (413 aa); etc. Contains PS00013 prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb0179 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0179 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247884.1" /translation="MMSVLARMRVMRHRAWQGLVLLVLALLLSSCGWRGISNVAIPGG PGTGPGSYTIYVQMPDTLAINGNSRVMVADVWVGSIRAIKLKNWVATLTLSLKKDVTL PKNATAKIGQTSLLGSQHVELAAPPDPSPVPLKDGDTIPLKRSSAYPTTEQTLASIAT LLRGGGLVNLEGIQQEINAIVTGRADQIRAFLGKLDTFTDELNQQRDDITRAIDSTNR LLAYVGGRSEVLNRVLTDLPPLIKHFADKQELLINASDAVGRLSQSADQYLSAARGDL HQDLQALQCPLKELRRAAPYLVGALKLILTQPFDVDTVPQLVRGDYMNLSLTLDLTYS AIDNAFLTGTGFSGALRALEQSFGRDPETMIPDIRYTPNPNDAPGGPLVERGNRQC" CDS 205423..206970 /codon_start=1 /transl_table=11 /gene="mce1F" /locus_tag="BQ2027_MB0180" /product="MCE-FAMILY PROTEIN MCE1F" /note="Mb0180, mce1F, len: 515 aa. Equivalent to Rv0174, len: 515 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 515 aa overlap). mce1F; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), similar to Mycobacterium tuberculosis proteins O07784|Rv0594|MTCY19H5.28c|mce2F (516 aa); O53972|Rv1971|MTV051.09|mce3F (437 aa); etc. Also highly similar to others e.g. NP_302661.1|NC_002677 putative secreted protein from Mycobacterium leprae (516 aa); AAF74993.1|AF143400_1|AF143400|996A027a protein from Mycobacterium avium (80 aa) (similarity on C-terminus); CAC12793.1|AL445327 putative secreted protein from Streptomyces coelicolor (433 aa); etc. Has hydrophobic stretch, possibly a signal peptide at the N-terminus. Protein product from Mb0180 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0180 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247885.1" /translation="MLTRFIRRQLILFAIVSVVAIVVLGWYYLRIPSLVGIGQYTLKA DLPASGGLYPTANVTYRGITIGKVTAVEPTDQGARVTMSIASNYKIPVDASANVHSVS AVGEQYIDLVSTGAPGKYFSSGQTITKGTVPSEIGPALDNSNRGLAALPTEKIGLLLD ETAQAVGGLGPALQRLVDSTQAIVGDFKTNIGDVNDIIENSGPILDSQVNTGDQIERW ARKLNNLAAQTATRDQNVRSILSQAAPTADEVNAVFSGVRDSLPQTLANLEVVFDMLK RYHAGVEQLLVFLPQGAAIAQTVLTPTPGAAQLPLAPAINYPPPCLTGFLPASEWRSP ADTSPRPLPSGTYCKIPQDAQLQVRGARNIPCVDVPGKRAATPKECRSKDPYVPLGTN PWFGDPNQILTCPAPGARCDQPVKPGLVIPAPSINTGLNPAPADQVQGTPPPVSDPLQ RPGSGTVQCNGQQPNPCVYTPTSGPSAVYSPASGELVGPDGVKYAVANSSTTGDDGWK EMLAPAS" CDS 207006..207647 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0181" /product="PROBABLE CONSERVED MCE ASSOCIATED MEMBRANE PROTEIN" /note="Mb0181, -, len: 213 aa. Equivalent to Rv0175, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap). Probable conserved Mce-associated membrane protein, equivalent, but longer in N-terminus, to CAC32127.1|AL583926 possible membrane protein from Mycobacterium leprae (182 aa). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv0177, Rv1973, etc. Contains two 12 residue direct repeats at N-terminus. Protein product from Mb0181 detected using shotgun mass spectrometry. Mb0181 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247886.1" /translation="MKAADSAESDAGADQTGPQVKAADSAESDAGELGEDACPEQALV ERRPSRLRRGWLVGIAATLLALAGGLGAAGYFALRSHQESQSIAREDLAAIEAAKDCV AATQAPDAGAMSASMQKIIECGTGDFGAQASLYTSMLVEAYQAASVHVQVTDMRAAVE RNNNDGSVDVLVALRVKVSNTDSDAHEVGYRLRVRMALDEGRYKIAKLDQVTK" CDS 207644..208612 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0182" /product="PROBABLE CONSERVED MCE ASSOCIATED TRANSMEMBRANE PROTEIN" /note="Mb0182, -, len: 322 aa. Equivalent to Rv0176, len: 322 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 322 aa overlap). Probable conserved Mce-associated transmembrane protein. Contains short region of similarity to PRA_MYCLE|P41484 proline-rich antigen (36 kDa antigen) from Mycobacterium leprae (249 aa) (outside the proline-rich region), FASTA scores: opt: 165, E(): 2.9e-05, (40.0% identity in 65 aa overlap). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv0177, Rv3493c, etc. Protein product from Mb0182 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0182 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247887.1" /translation="MTVVVEKTPTTLPQATPNGAAPWHVRAGAFAIDVLPGLAVAATM ALTALTVPPGSAWRWLCACLLGLTILLLAVNRLLLPTITGWSLGRALTGIRVVRRDGS AIGPWRLLVRDLAHLVDTLSLFVGWLWPLWDSRRRTFADLLLRTEVRRVEPVQRPAVI RRLTAAVALAAAGACASATAVGAAVVYVNEWQTDHTRAQLATRGPKLVVDVLSYDPET VQRDFERARSLATDRYRPQLSIQQDSVRESGPVRNQYWVTDSAVLSATPAQATMLLFM QGERGTPPNQRYISATVRAIFQKSRGQWRLDDLAVVMKPRQPTGEK" CDS 208609..209163 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0183" /product="PROBABLE CONSERVED MCE ASSOCIATED PROTEIN" /note="Mb0183, -, len: 184 aa. Equivalent to Rv0177, len: 184 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 184 aa overlap). Probable conserved Mce-associated protein, equivalent to CAC32129.1|AL583926 conserved membrane protein from Mycobacterium leprae (184 aa). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv1973, Rv3493c, etc. Protein product from Mb0183 detected using shotgun mass spectrometry. Mb0183 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247888.1" /translation="MSPRRKFEPGEGALLAPQSIEPSRRWGLPLALTASAVVMAAAIS ACALMRISHESHQRAAHKDIVMLSDVRSFMTMFTSPDPFHANEYAERVLSHATGDFAK QYHERANDILIRISGVEPTTGTVLDAGVQRWNEDGSANVLVVTQITSKSADGKRVVSN ANRWLVTAKQEGNEWKISSLLPVI" CDS 209130..209864 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0184" /product="PROBABLE CONSERVED MCE ASSOCIATED MEMBRANE PROTEIN" /note="Mb0184, -, len: 244 aa. Equivalent to Rv0178, len: 244 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 244 aa overlap). Probable conserved Mce-associated membrane protein, highly similar in C-terminus to CAC32130.1|AL583926 putative secreted protein from Mycobacterium leprae (184 aa). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv0177, Rv1973, etc. Note that there is a 10 aa overlap with the upstream ORF. Protein product from Mb0184 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0184 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247889.1" /translation="MEDQQSASGDLTQKSVANGESTDTASAATEGHRGEIDAAGEPDE RGAAVADSQADEDDSAATAARGGKTRARRSRGRRLAITVGVAAALFVGSAAFAGATVE PYLSERAVVATKLMVARTAANAITTLWTYTPENMDTLADRAANYLSGDFAAQYRRFVD QIAAANKQAKITNDTEVTGAAVESLSGRDAVAIVYTNTTTTSPVTKNIPALKYLSYRL FMKRYDARWLVTRMTTITSLDLTPQV" CDS complement(209895..211004) /codon_start=1 /transl_table=11 /gene="lprO" /locus_tag="BQ2027_MB0185C" /product="possible lipoprotein lpro" /note="Mb0185c, lprO, len: 369 aa. Equivalent to Rv0179c, len: 369 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 369 aa overlap). Possible lprO, lipoprotein (visibly not conserved). Contains possible N-terminal signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb0185c detected using SWATH mass spectrometry. Mb0185c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247890.1" /translation="MWIRAERVAVLTPTASLRRLTACYAALAVCAALACTTGQPAARA ADGREMLAQAIATTRGSYLVYNFGGGHPMPLLNAGGHWYEMNNGGHLMIIKNASQRLS PHLLVDTHTGDQARCEHNPGAHTGEGLWQASEIYPPLKAWQRMGRPTIAVNANFFDVR GQKGGSWRSTGCSSPLGAYVDNTRGQGRANQAVTGTVAYAGKQGLSGGNELWSSLTTM ILPVGGAPYVLRPKSRQDYDLATPVIEDLLNKNARFVAVAGIGLLSPGNTGQLHDGGP SAARTALAYAKQKDEMYIFQGGNYTPDNIQDLFRGLGSDTAILLDGGGSSAIVLRRDT GGMWAGAGSPKGSCDTRQVLCDSHERALPSWLAFN" CDS complement(211084..212442) /codon_start=1 /transl_table=11 /gene="Mb0186c" /locus_tag="BQ2027_MB0186C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0186c -, len: 452 aa. Equivalent to Rv0180c, len: 452 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 452 aa overlap). Probable conserved transmembrane protein, equivalent to CAC32132.1|AL583926 probable conserved membrane protein from Mycobacterium leprae (465 aa). Shows some similarity with others membrane proteins e.g. AL096849|SCI11_29 from Streptomyces coelicolor (354 aa), FASTA scores: opt: 190, E(): 0.00067, (25.9% identity in 409 aa overlap). Protein product from Mb0186c detected using shotgun mass spectrometry and SWATH mass spectrometry." /protein_id="CAB5247891.1" /translation="MSQAQPRPAAPNPKRNVKAIRTVRFWMAPIATTLALMSALAALY LGGILNPMTNLRHFPIALVNEDAGPAGQQIVDGLVSGLDKNKFDIRVVSPDEARRLLD TAAVYGSALIPPTFSSQLRDFGASAVTPTRTDRPAITISTNPRAGTLAASIAGQTLTR ALTVVNGKVGERLTAEVAAQTGGVALAGAAAAGLASPIDVKSTAYNPLPNGTGNGLSA FYYALLLLLAGFTGSIVVSTLVDSMLGYVPAEFGPVYRFAEQVNISRFRTLLVKWAVM VVLALLTSGVYLAIAHGLGMPIPLGWQVWLYGVFAIIAVGVTSSSLIAVLGSMGLLVS MLIFVILGLPSAGATVPLEAVPAFFRWLAQFEPMHQVFLGVRSLLYLNGNADAGLSQA LTMTSIGLIIGLLLGGFITHLYDRSSFHRIPGAVEMAIAVEHQAQYQARQSARESSSE QP" CDS complement(212469..213203) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0187C" /product="Pirin" /note="Mb0187c, -, len: 244 aa. Equivalent to Rv0181c, len: 244 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 244 aa overlap). Conserved hypothetical protein, highly similar to other hypothetical proteins e.g. YHHW_ECOLI|P46852 hypothetical 26.3 kd protein from Escherichia coli (231 aa), FASTA scores: opt: 479, E(): 1.2e-29, (37.3% identity in 233 aa overlap); P73623|SLL1773 HYPOTHETICAL 25.7 KD PROTEIN from Synechocystis sp. strain PCC 6803 (232 aa), FASTA score: (39.1% identity in 233 aa overlap). Protein product from Mb0187c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0187c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247892.1" /translation="MTATVEIRRAADRAVTTTSWLKSRHSFSFGDHYDPDNTHHGLLL VNNDDQMEPASGFDPHPHRDMEIVTWVLRGALRHQDSAGNSGVIYPGLAQRMSAGTGI LHSEMNDSATEPVHFVQMWVIPDATGITASYQQQEIDDELLRAGLVTIASGIPGQDAA LTLHNSSASLHGARLRPGATVSLPCAPFLHLFVAYGRLTLEGGGELADGDAVRFTDAD ARGLTANEPSEVLIWEMHAKLGDSAT" CDS complement(213220..214332) /codon_start=1 /transl_table=11 /gene="sigG" /locus_tag="BQ2027_MB0188C" /product="PROBABLE ALTERNATIVE RNA POLYMERASE SIGMA FACTOR SIGG (RNA POLYMERASE ECF TYPE SIGMA FACTOR)" /note="Mb0188c, sigG, len: 370 aa. Equivalent to Rv0182c, len: 370 aa (start site uncertain; first of several possibles was chosen, but note that this overlaps the upstream ORF), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 370 aa overlap). Probable sigG, alternative RNA polymerase sigma subunit (see citations below), similar to many e.g. Q45585|SIGW_BACSU RNA POLYMERASE SIGMA FACTOR from Bacillus subtilis (187 aa). Also similar to nine other ECF sigma factors from Mycobacterium tuberculosis e.g. Rv1221, Rv0735, etc. Contains PS01063 Sigma-70 factors ECF subfamily signature and probable helix-turn helix motif from aa 205-226 (Score 1181, +3.21 SD). BELONGS TO THE SIGMA-70 FACTOR FAMILY, ECF SUBFAMILY. Protein product from Mb0188c detected using SWATH mass spectrometry. Mb0188c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247893.1" /translation="MRTSPMPAKFRSVRVVVITGSVTAAPVRVSETLRRLIDVSVLAE NSGREPADERRGDFSAHTEPYRRELLAHCYRMTGSLHDAEDLVQETLLRAWKAYEGFA GKSSLRTWLHRIATNTCLTALEGRRRRPLPTGLGRPSADPSGELVERREVSWLEPLPD VTDDPADPSTIVGNRESVRLAFVAALQHLSPRQRAVLLLRDVLQWKSAEVADAIGTST VAVNSLLQRARSQLQTVRPSAADRLSAPDSPEAQDLLARYIAAFEAYDIDRLVELFTA EAIWEMPPYTGWYQGAQAIVTLIHQQCPAYSPGDMRLISLIANGQPAAAMYMRAGDVH LPFQLHVLDMAADRVSHVVAFLDTTLFPKFGLPDSL" CDS 214280..215119 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0189" /product="POSSIBLE LYSOPHOSPHOLIPASE" /note="Mb0189, -, len: 279 aa. Equivalent to Rv0183, len: 279 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 279 aa overlap). Possible lysophospholipase (EC 3.1.-.-), similar to several (especially eukaryotic enzymes, weaker with Escherichia coli), e.g. U67963|HSU67963_1 Human lysophospholipase homolog from Homo sapiens (313 aa), FASTA scores: opt: 569, E(): 2.6e-29, (37.1% identity in 259 aa overlap); P07000|PLDB_ECOLI LYSOPHOSPHOLIPASE L2 from Escherichia coli (165 aa), FASTA scores: opt: 219, E(): 0.00012. Start changed based on similarity to AE001997_8 from Deinococcus radiodurans (282 aa), FASTA scores: opt: 510, E(): 1.4e-25, (34.8% identity in 282 aa overlap). Also shows some similarity to epoxide hydrolases from Mycobacterium tuberculosis e.g. Rv1938 FASTA score: (30.7% identity in 114 aa overlap); and O07214|YR15_MYCTU|Rv2715|MT2788|MTCY05A6.36 (341 aa). Note that the putative product of this CDS corresponds to spot 3_329 identified in culture supernatant by proteomics at the Max-Planck-Institut fuer Infektionsbiologie (see citations below). Protein product from Mb0189 detected using shotgun mass spectrometry. Mb0189 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247894.1" /translation="MTTTRTERNFAGIGDVRIVYDVWTPDTAPQAVVVLAHGLGEHAR RYDHVAQRLGAAGLVTYALDHRGHGRSGGKRVLVRDISEYTADFDTLVGIATREYPGC KRIVLGHSMGGGIVFAYGVERPDNYDLMVLSAPAVAAQDLVSPVVAVAAKLLGVVVPG LPVQELDFTAISRDPEVVQAYNTDPLVHHGRVPAGIGRALLQVGETMPRRAPALTAPL LVLHGTDDRLIPIEGSRRLVECVGSADVQLKEYPGLYHEVFNEPERNQVLDDVVAWLT ERL" CDS 215161..215910 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0190" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0190, -, len: 249 aa. Equivalent to Rv0184, len: 249 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 249 aa overlap). Conserved hypothetical protein, equivalent to CAC32136.1|AL583926 conserved hypothetical protein from Mycobacterium lepra (249 aa); and C-terminus highly similar to CAB08793.1|Z95398 conserved hypothetical protein from Mycobacterium leprae (145 aa), FASTA scores: E(): 0, (75.2 identity in 145 aa overlap). Also similar to 049841|SCE9_39|T36358 hypothetical protein from Streptomyces coelicolor (418 aa), FASTA scores: opt: 231, E(): 8.1e-08, (30.4% identity in 270 aa overlap). Protein product from Mb0190 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0190 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247895.1" /translation="MTNDKMLARIAALLRQAEGTDNPHEADAFMSTAQRLATAASIDL AVARSHAGNRSPAQAPTQRTITIGAAGTRGLRTYVQLFVLIAAANDVRCDVASNSTFV YAYGFAEDIDTSHALYASLVVQMVRASDAYLASGAHRPTPTITARLNFQLAFGARVGQ RLADAREQTRQEATKDRDRPPGTAIALRDKDIELHEYYRRSSKARGAWRASRATAGYS SAARRAGDRAGRQARLGNNPELPGARAALGR" CDS 215907..216416 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0191" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0191, -, len: 169 aa. Equivalent to Rv0185, len: 169 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 169 aa overlap). Conserved hypothetical protein, equivalent to CAB08794.1|Z95398|MLCL622_2 from Mycobacterium leprae (168 aa), FASTA scores: opt: 861, E(): 0, (76.4% identity in 165 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. Protein product from Mb0191 detected using SWATH mass spectrometry. Mb0191 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247896.1" /translation="MIGADVPRDSQRARVYAAEAFVRTLFDRVTAHGSPTVEFFGTQL TLPPEGRFGSVASVQRYVDDVLALPAVGQNWPTVSPVRVRARRAATAAHYENHGGTGT IAVPDRHTAGWAMRELVVLHEVAHHLCQVPPPHGPEFVATVCTLTELVMGPEVGHVFR VVYAQEGVR" CDS 216461..218536 /codon_start=1 /transl_table=11 /gene="bglS" /locus_tag="BQ2027_MB0192" /product="PROBABLE BETA-GLUCOSIDASE BGLS (GENTIOBIASE) (CELLOBIASE) (BETA-D-GLUCOSIDE GLUCOHYDROLASE)" /note="Mb0192, bglS, len: 691 aa. Equivalent to Rv0186, len: 691 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 691 aa overlap). Probable bglS, beta-glucosidase (EC 3.2.1.21), highly similar to many e.g. BGLS_AGRTU|P27034 beta-glucosidase from Agrobacterium tumefaciens (818 aa), FASTA scores: opt: 643, E(): 0, (32.5% identity in 842 aa overlap). SEEMS TO BELONG TO FAMILY 3 OF GLYCOSYL HYDROLASES. Protein product from Mb0192 detected using SWATH mass spectrometry. Mb0192 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247897.1" /translation="MTDDERFSLLVGLTGASDLWPVRDERIPQGVPMCAGYVPGIPRL GVPALLMSDAGLGVTNPGYRPGDTATALPAGLALAASFNPVLARSSGKAIGREARSRG FNVQLAGAINLARDPRNGRNFEYLSEDPLLSATMAAESIIGIQQQGVIATTKHFSLNC NETNRHWLDAVIDPDAHRESDLLAFEIVIERSQPGAVMAAYNKVNGDYAAGNDHLLND VLKGAWGYRGWVMSDWGGTPSWECALAGLDQECGAQIDAVLWQSEAFTDRLRAAYADG NLPKGRLSDMVRRILRSMFAVGIDRWKPAPAPDMNAHNEIAAQMARQGIVLLQNRGLL PLAPESAGRIAVIGGYAHLGVPAGYGSSAVTPPGGYAGVIPIGGSGLAAGLRNLYLLP SSPLSELRKRLPNAQFEFDPGINPAEAVLAARRADIAIVFAIRAEGEGFDSADLSLPW GQDALIAAVASANANTVVVLETGNPVTMPWRDSVNAIMQAWYPGQAGGQAVAEIVTGQ VNPSGRLPITFPVDLGQTPRSQPRELGAPWGTSTTIHYTEGADVGYRWFASTNQTPMF AFGHGLSYTSFEYRDLVVTGGHTVHASFSVTNTGDRSGADVPQLYMIAAPGESRLRLL GFERVELEPGQTRRVRIEADPRLLARYDGEARSWRIEPGGYTVAVGASAVALKLAAKV KLAGRGFGR" CDS complement(218582..218743) /codon_start=1 /transl_table=11 /gene="mymT" /locus_tag="BQ2027_MB0192A" /product="Metallothionein, MymT" /note="Mb0192A, len: 53 aa. Equivalent to Rv0186A len: 53 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 53 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). MymT, metallothionein,equivalent to MAV_4993|A0QMH5 hypothetical protein from Mycobacterium avium (strain 104) (51 aa), and MAP_3626c|Q73TU2 hypothetical protein from Mycobacterium avium subsp. paratuberculosis (51 aa), FASTA scores: opt: 312, E(): 4.6e-17, (81.2% identity in 48 aa overlap). Protein product from Mb0192A detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0192A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247898.1" /translation="MRVIRMTNYEAGTLLTCSHEGCGCRVRIEVPCHCAGAGDAYRCT CGDELAPVK" CDS 218897..219559 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0193" /product="PROBABLE O-METHYLTRANSFERASE" /note="Mb0193, -, len: 220 aa. Equivalent to Rv0187, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 220 aa overlap). Probable O-methyltransferase (EC 2.1.1.-), similar to many e.g. AB93458.1|AL357591 putative O-methyltransferase from Streptomyces coelicolor (223 aa); MDMC_STRMY|Q00719 O-methyltransferase from Streptomyces mycarofaciens (221 aa), FASTA scores: opt: 327, E(): 2.4e-17, (35.9% identity in 192 aa overlap). Also similar to Rv1703c, Rv1220c from Mycobacterium tuberculosis. Protein product from Mb0193 detected using shotgun mass spectrometry. Mb0193 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247899.1" /translation="MGMDQQPNPPDVDAFLDSTLVGDDPALAAALAASDAAELPRIAV SAQQGKFLCLLAGAIQARRVLEIGTLGGFSTIWLARGAGPQGRVVTLEYQPKHAEVAR VNLQRAGVADRVEVVVGPALDTLPTLAGGPFDLVFIDADKENNVAYIQWAIRLARRGA VIVVDNVIRGGGILAESDDADAVAARRTLQMMGEHPGLDATAIQTVGRKGWDGFALAL VR" CDS 219678..220109 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0194" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0194, -, len: 143 aa. Equivalent to Rv0188, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Probable conserved transmembrane protein, similar to T35347|4835334|CAB42956.1|AL049863|SC5H1_31 probable membrane protein from Streptomyces coelicolor (147 aa), FASTA scores: opt: 326, E(): 6.5e-15, (36.2% identity in 141 aa overlap); N-terminus of P80185|MTRC_METTH TETRAHYDROMETHANOPTERIN S-METHYLTRANSFERASE SUBUNIT C (EC 2.1.1.86) from Methanobacterium thermoautotrophicum strain Marburg/DSM 2133 (266 aa), FASTA scores: opt: 125, E(): 0.033, (31.6% identity in 98 aa overlap). Also similar to Rv3635 from Mycobacterium tuberculosis. Protein product from Mb0194 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0194 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247900.1" /translation="MSTVHSSIDQHPDLLALRASFDRAAESTIAHFTFGLALLAGLYV AASPWIVGFSATRGLPTCDLIVGIAVAYLAYGFASALDRTHGMTWTLPVLGVWVIFSP WVLPGVAVTAGMMWSHIIAGAVVAVLGFYFGMRTRAAANQG" CDS complement(220188..221915) /codon_start=1 /transl_table=11 /gene="ilvD" /locus_tag="BQ2027_MB0195C" /product="PROBABLE DIHYDROXY-ACID DEHYDRATASE ILVD (DAD)" /note="Mb0195c, ilvD, len: 575 aa. Equivalent to Rv0189c, len: 575 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 575 aa overlap). Probable ilvD, dihydroxy-acid dehydratase (EC 4.2.1.9), similar to many e.g. ILVD_LACLA|Q02139 dihydroxy-acid dehydratase (dad) from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (570 aa), FASTA scores: opt: 1605, E(): 0, (46.0% identity in 561 aa overlap). Also similar to ML2608|MLCL622.06c|O06069|ILVD_MYCLE DIHYDROXY-ACID DEHYDRATASE from Mycobacterium leprae (564 aa). Contains PS00886 Dihydroxy-acid and 6-phosphogluconate dehydratases signature 1. BELONGS TO THE ILVD / EDD FAMILY. COFACTOR: BINDS 1 4FE-4S CLUSTER (POTENTIAL). Protein product from Mb0195c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0195c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247901.1" /translation="MPQTTDEAASVSTVADIKPRSRDVTDGLEKAAARGMLRAVGMDD EDFAKPQIGVASSWNEITPCNLSLDRLANAVKEGVFSAGGYPLEFGTISVSDGISMGH EGMHFSLVSREVIADSVEVVMQAERLDGSVLLAGCDKSLPGMLMAAARLDLAAVFLYA GSILPGRAKLSDGSERDVTIIDAFEAVGACSRGLMSRADVDAIERAICPGEGACGGMY TANTMASAAEALGMSLPGSAAPPATDRRRDGFARRSGQAVVELLRRGITARDILTKEA FENAIAVVMAFGGSTNAVLHLLAIAHEANVALSLQDFSRIGSGVPHLADVKPFGRHVM SDVDHIGGVPVVMKALLDAGLLHGDCLTVTGHTMAENLAAITPPDPDGKVLRALANPI HPSGGITILHGSLAPEGAVVKTAGFDSDVFEGTARVFDGERAALDALEDGTITVGDAV VIRYEGPKGGPGMREMLAITGAIKGAGLGKDVLLLTDGRFSGGTTGLCVGHIAPEAVD GGPIALLRNGDRIRLDVAGRVLDVLADPAEFASRQQDFSPPPPRYTTGVLSKYVKLVS SAAVGAVCG" CDS 222063..222353 /codon_start=1 /transl_table=11 /gene="ricR" /locus_tag="BQ2027_MB0196" /product="Metal-sensitive transcriptional repressor" /note="Mb0196, -, len: 96 aa. Equivalent to Rv0190, len: 96 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 96 aa overlap). Conserved hypothetical protein, highly similar to several hypothetical proteins e.g. SYCSLRA_35|Q55554|SLL0176 hypothetical 18.9 KD protein from Synechocystis (167 aa), FASTA scores: opt: 237, E(): 5.8e-16, (39.4% identity in 94 aa overlap). Also highly similar to Z95398|MLCL622_7|O06070 from Mycobacterium leprae (135 aa), FASTA score: (82.6% identity in 92 aa overlap). Also similar to hypothetical proteins from Mycobacterium tuberculosis e.g. Rv0967, Rv0030, Rv1766 (42.5% identity in 80 aa overlap). Protein product from Mb0196 detected using shotgun mass spectrometry. Mb0196 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247902.1" /translation="MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQ ISAVTSALRSVALNLLDEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS" CDS 222481..223722 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0197" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0197, -, len: 413 aa. Equivalent to Rv0191, len: 413 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 413 aa overlap). Probable conserved integral membrane protein, member of major facilitator superfamily (MFS) possibly involved in transport of drug, similar to several hypothetical proteins e.g. YDEA_ECOLI|P31122 hypothetical 42.5 kd protein from Escherichia coli (396 aa), FASTA scores: opt: 475, E(): 4.2e-33, (29.7% identity in 381 aa overlap); and to several chloramphenicol resistance proteins e.g. CMLR_STRLI|P31141 chloramphenicol resistance protein from stremtomyces lividans (392 aa), FASTA scores: opt: 394, E(): 6.7e-12, (28.2% identity in 383 aa overlap). Also similar to SVU09991_1 from Mycobacterium tuberculosis. Protein product from Mb0197 detected using SWATH mass spectrometry. Mb0197 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247903.1" /translation="MTAPTGTSATTTRPWTPRIATQLSVLACAAFIYVTAEILPVGAL SAIARNLRVSVVLVGTLLSWYALVAAVTTVPLVRWTAHWPRRRALVVSLVCLTVSQLV SALAPNFAVLAAGRVLCAVTHGLLWAVIAPIATRLVPPSHAGRATTSIYIGTSLALVV GSPLTAAMSLMWGWRLAAVCVTGAAAAVALAARLALPEMVLRADQLEHVGRRARHHRN PRLVKVSVLTMIAVTGHFVSYTYIVVIIRDVVGVRGPNLAWLLAAYGVAGLVSVPLVA RPLDRWPKGAVIVGMTGLTAAFTLLTALAFGERHTAATALLGTGAIVLWGALATAVSP MLQSAAMRSGGDDPDGASGLYVTAFQIGIMAGALLGGLLYERSLAMMLTASAGLMGVA LFGMTVSQHLFENPTLSPGDG" CDS 223799..224857 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0198" /product="L,D-transpeptidase" /note="Mb0198, -, len: 352 aa. Equivalent to Rv0192 and Rv0192A, len: 366 aa and 100 aa, from Mycobacterium tuberculosis strain H37Rv, (96.5% identity in 318 aa overlap and 91.1% identity in 56 aa overlap). Conserved hypothetical protein. Has Gly- Arg-rich region followed by highly Pro-rich repetitive region near N-terminus. Similar in C-terminus to other hypothetical proteins e.g. Q49706|B1496_F2_81|U00013 from Mycobacterium leprae (271 aa), FASTA scores: opt: 375, E(): 3.2e-24, (36.1% identity in 255 aa overlap); YV09_MYCTU|Q11149|cY20G9.09 hypothetical 47.9 kd protein from Mycobacterium tuberculosis (451 aa), FASTA scores: opt: 330, E(): 3.2e-13, (35.1% identity in 271 aa overlap). Also similar to Rv0116c, Rv1433, Rv2518c, Rv0483 from Mycobacterium tuberculosis. Probable N-terminal part of Rv0192, which is member of family P5.17 with Rv0116c, Rv1433, Rv2518c, Rv0483. These are all predicted to be exported/membrane proteins. Rv0192A has typical N-terminal signal peptide which is functional and was identified by PhoA fusion screens: O52054 PGB14T-O1 PRECURSOR (FRAGMENT 45 AA) (see citation below). Since Rv0192 misses a signal peptide this suggests that there is a frameshift in the region of the overlap with Rv0192 but none found on reinspection of sequence. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0192 and Rv0192A exist as 2 genes with an overlap region. In Mycobacterium bovis, a single base insertion (*-g) leads to a single product. Protein product from Mb0198 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0198 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247904.1" /translation="MSRWKQGWTRGSLFAALNIAAVVAVLMLGAGVAVADPDAAPGDP GGPGGPGGTAGPVDPPAVDLLAPPPDPLALPPALDPLAPPPPDPLAPPPPDPLAVPVA AGPVAGQDPTPFVGPPPFRPPTFNPVDGAMVGVAKPIVINFAVPIADRAMAESAIHIS SIPPVPGKFYWMSPTQVRWRPFEFWPANTAVNIDAAGTKSSFRTGDSLVATADDATHQ MTITRNGVVQKTFPMSMGMVSGGHQTPNGTYYVLEKFATVVMDSSTYGVPVNSAQGYK LTVSDAVRIDNSGNFVHSAPWSVADQGKRNVTHGCINLSPANAKWFYDNFGSGDPVVV KNSVGTYNKNDGAQDWQI" CDS complement(224917..226764) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0199C" /product="HYPOTHETICAL PROTEIN" /note="Mb0199c, -, len: 615 aa. Equivalent to Rv0193c, len: 615 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 615 aa overlap). Hypothetical unknown protein." /protein_id="CAB5247905.1" /translation="MIQISRDMSSLGQTATTQALPDNSDGIQLTKFAADDILPLEYAP PIGPELVSQDQLPAAWAYKRFRDLDDKESYRRKLLQELTDALAAQGSEAAEIATAALR DLIDQMAEQGAVVLADIVESDDFLELVKRYDELMAREGSRSFIHRFLDLRRSPGMLTD PAVNGALVHPLMIALISYAVGGPIRMIDARGKDAEPLSVLAQDNMLHIDNTPFNDEYK ILITWRRGTAQGPAGQNFTFLPGTHKLARTCFVNEDGVPWSSENASIFTTPDSIRKVF DAQRQLGGQDHPTVIEVTDSERPLSSVFAAGSLVHHRFRTASGSARSCIILVFHRVAD NPGRMVSDVEDSSDVSLSELLTRGVPDESYQQRFIATLCAAADEIAELLLKWKKTPQR PVSLPLQTKQIDGARFEEWISAATEAPEVREIRNRELTIPYGEVLSAEEFFDLIWRLM RFDKHGPLDLILYHDNREEPRKWARNLIREMSADRLYERLLGWLADIQQPRPADCLRP LQIHALISEVLKTLPLDEDQDPPADWHFDLLGMSHAEAARSVKHLLEDVAEALLRCED MAAYLSTSLFAFWAVDAAYSLDGRRNLVVKDCARRLLRHYTMLSLTCFQ" CDS 227071..230655 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0200" /product="probable transmembrane multidrug efflux pump" /note="Mb0200, -, len: 1194 aa. Equivalent to Rv0194, len: 1194 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 1194 aa overlap). Probable drugs-transport transmembrane protein ATP binding protein ABC transporter (see citation below), highly similar to many e.g. U62129|STU62129_2|T30293 ABC transport protein homolog from Salmonella typhi (1218 aa), FASTA scores: opt: 1116, E(): 0, (36.3% identity in 1209 aa overlap); CAB66302.1|AL136519 ABC transporter protein ATP-binding component from Streptomyces coelicolor (1243 aa); I84547 mdl protein from Escherichia coli (1143 aa); etc. Also similar to MTCY50_9 and MTCY50_10 from Mycobacterium tuberculosis, FASTA score: (33.8% identity in 574 aa overlap). Contains two PS00017 ATP/GTP-binding site motif A (P-loop) and one PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Alternative start possible at 1823 but no RBS. Protein product from Mb0200 detected using shotgun mass spectrometry." /protein_id="CAB5247906.1" /translation="MRTNCWWRLSGYVMRHRRDLLLGFGAALAGTVIAVLVPLVTKRV IDDAVAADHRPLAPWAVVLVAAAGATYLLTYVRRYYGGRIAHLVQHDLRMDAFQALLR WDGRQQDRWSSGQLIVRTTNDLQLVQALLFDVPNVLRHVLTLLLGVAVMTWLSVPLAL LAVLLVPVIGLIAHRSRRLLAAATHCAQEHKAAVTGVVDAAVCGIRVVKAFGQEERET VKLVMASRALYAAQLRVARLNAHFGPLLQTLPALGQMAVFALGGWMAAQGSITVGTFV AFWACLTLLARPACDLAGMLTIAQQARAGAVRVLELIDSRPTLVDGTKPLSLEARLSL EFQRVSFGYVADRPVLREISLSVRAGETLAVVGAPGSGKSTLASLATRCYDVTQGAVR IGGQDVRELTLDSLRSAIGLVPEDAVLFSGTIGANIAYGRPDATPEQIATAARAAHIE EFVNTLPDGYQTAVGARGLTLSGGQRQRIALARALLHQPRLLIMDDPTSAVDAVIECG IQEVLREAIADRTAVIFTRRRSMLTLADRVAVLDSGRLLDVGTPDEVWERCPRYRELL SPAPDLADDLVVAERSPVCRPVAGLGTKAAQHTNVHNPGPHDHPPGPDPLRRLLREFR GPLALSLLLVAVQTCAGLLPPLLIRHGIDVGIRRHVLSALWWAALAGTATVVIRWVVQ WGSAMVAGYTGEQVLFRLRSVVFAHAQRLGLDAFEDDGDAQIVTAVTADVEAIVAFLR TGLVVAVISVVTLVGILVALLAIRARLVLLIFTTMPVLALATWQFRRASNWTYRRARH RLGTVTATLREYAAGLRIAQAFRAEYRGLQSYFAHSDDYRRLGVRGQRLLALYYPFVA LLCSLATTLVLLDGAREVRAGVISVGALVTYLLYIELLYTPIGELAQMFDDYQRAAVA AGRIRSLLSTRTPSSPAARPVGTLRGEVVFDAVHYSYRTREVPALAGINLRIPAGQTV VFVGSTGSGKSTLIKLVARFYDPTHGTVRVDGCDLREFDVDGYRNRLGIVTQEQYVFA GTVRDAIAYGRPDATDAQVERAAREVGAHPMITALDNGYLHQVTAGGRNLSAGQLQLL ALARARLVDPDILLLDEATVALDPATEAVVQRATLTLAARRTTLIVAHGLAIAEHADR IVVLEHGTVVEDGAHTELLAAGGHYSRLWAAHTRLCSPEITQLQCIDA" CDS 231093..231728 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0201" /product="possible two component transcriptional regulatory protein (probably luxr-family)" /note="Mb0201, -, len: 211 aa. Equivalent to Rv0195, len: 211 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 211 aa overlap). Putative two-component response regulator, luxR family, similar to many e.g. U00008|ECOHU49_15 regulatory protein narP from Escherichia coli strain K12 (225 aa), FASTA scores: opt: 232, E(): 7.3e-09, (29.2% identity in 219 aa overlap). Start chosen by similarity. Contains probable helix-turn-helix motif at aa 166-187 (Score 1164, +3.15 SD)." /protein_id="CAB5247907.1" /translation="MAPVNVISVAVVASDPLTRDGALARLSSHRELDVRAWQAGCETS VLLVLATTITAPLLCQIEDVQKDGPSHAPKLVVVADEFSAEQVFRMIKLGLTGLLYRS QSTFDCIVETIRLSAEGRLRLPERVQRYLVGRIKSTPTAEPDTPCAAALAEREVAVLR LLADGLSTHQVAVQLNYCERTIKNIVHDIVTRLKLRNRTHAVAHALRAGLI" CDS 231841..232425 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0202" /product="possible transcriptional regulatory protein" /note="Mb0202, -, len: 194 aa. Equivalent to Rv0196, len: 194 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 194 aa overlap). Putative transcriptional regulatory protein, similar to two Bacillus subtilis regulators: P42105|YXAF_BACSU HYPOTHETICAL 21.0 KD PROTEIN (191 aa), FASTA scores: opt: 323, E(): 2.1e-15, (30.9% identity in 181 aa overlap); and Z99105|BSUB0002_9 negative regulator of the lincomycin operon (188 aa), FASTA scores: opt: 255, E(): 1e-10, (25.9 identity in 185 aa overlap). Protein product from Mb0202 detected using SWATH mass spectrometry." /protein_id="CAB5247908.1" /translation="MQGPRERMVVSAALLIRERGAHATAISDVLQHSGAPRGSAYHYF PGGRTQLLCEAVDYAGEHVAAMINEAEGGLELLDALIDKYRQQLLSTDFRAGCPIAAV SVEAGDEQDRERMAPVIARAAAVFDRWSDLTAQRFIADGIPPDRAHELAVLATSTLEG AILLARVRRDLTPLDLVHRQLRNLLLAELPERSR" CDS 232425..234671 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0203" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0203, -, len: 748 aa. Equivalent to Rv0197, len: 762 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 748 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to others e.g. 9948789|AAG06102.1|AE004699_7|B83307 probable molybdopterin oxidoreductase from Pseudomonas aeruginosa strain PAO1 (769 aa); 5441785|CAB46809.1|AL096811|T36812 probable dehydrogenase from Streptomyces coelicolor (747 aa), FASTA scores: opt: 617, E(): 9.8e-30, (29.9% identity in 762 aa overlap); BAB04334.1|AP001509 assimilatory nitrate reductase (catalytic subunit) from Bacillus halodurans (743 aa); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transversion (t-g) introduces a stop codon that leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (748 aa versus 762 aa)." /protein_id="CAB5247909.1" /translation="MTSSDWLPTACILCECNCGIVVQVDDRRLARIRGDKAHPGSAGY TCNKALRLDHYQNNRARLSSPMRRRADGTYEEIDWDTAIVEIAEGFKQIRDTHGGDKI FYYGGGGQGNHLGGAYSGAFLKALGSRYRSNALAQEKTGEAWVDFQLYGGHTRGEFEN AEVSVFVGKNPWMSQSFPRARVVLNEIAKDPGRSMIVIDPVVTDTAKMADFHLRVQPG CDAWCLAALAAVLVQENLCNEAFLAAHVHGVDTVRAALQEVPVADYAQRCGVDEELLR AAARRIGTAASVSVFEDLGIQQAPNSTVCSYLNKLLWILTGNFAKKGGQHLHSSFAPL FSQVSGRTPVTGAPIIAGLIPGNVVPEEILTEHPDRFRAMIVESGNPAHSLADSAACR AAFQALELMVVVDVAMTETARLAHYVLPAASQFEKPEATFFNFEFPRNGFQLRRPLFP PLPGTLPEPEIWARLVRALGVVDEADLRPLREAAAQGRQAYTEAFLAAAATNPTVAKL TAYVLYETLGPTLPDGLAGAAALWGLAQKTAMAYPDAVRRAGHADGNALFDAILERPS GVTFTVHNYEDDFALISHPDHKIALEIPEMLAEIRSLTQTPSRLTTPQLPIVLSVGER RAYTANDIFRDPSWRKRDANGALRVSVEDAQALGLADGCLARITTAAGSAEATVEVTE TMLAGHAALPNGFGLDYTGDDGRTVVAGVAPNALTSTRWRDPYAGTPWHKHVPAAIRR ADAESPIW" CDS complement(234712..236703) /codon_start=1 /transl_table=11 /gene="zmp1" /locus_tag="BQ2027_MB0204C" /product="probable zinc metalloprotease zmp1" /note="Mb0204c, -, len: 663 aa. Equivalent to Rv0198c, len: 663 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 663 aa overlap). Probable zinc metalloprotease (EC 3.4.24.-), equivalent to Z95398|MLCL622.12c from Mycobacterium leprae (667 aa), FASTA scores: opt: 3710, E(): 0, (80.8 % identity in 667 aa overlap). Also similar to many other metalloproteases e.g. members of the eukaryotic neprilysin family: P08473|NEP_HUMAN NEPRILYSIN (EC 3.4.24.11) (749 aa), FASTA scores: opt: 872, E(): 0, (31.1% identity in 692 aa overlap); Q07744|PEPO_LACLA NEUTRAL ENDOPEPTIDASE from Lactococcus lactis (626 aa), FASTA scores: opt: 862, E(): 0, (30.0% identity in 654 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. BELONGS TO PEPTIDASE FAMILY M13 (ZINC METALLOPROTEASE); ALSO KNOWN AS THE NEPRILYSIN SUBFAMILY. Protein product from Mb0204c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0204c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247910.1" /translation="MTLAIPSGIDLSHIDADARPQDDLFGHVNGRWLAEHEIPADRAT DGAFRSLFDRAETQVRDLIIQASQAGAAVGTDAQRIGDLYASFLDEEAVERAGVQPLH DELATIDSAADATELAAALGTLQRAGVGGGIGVYVDTDSKDSTRYLVHFTQSGIGLPD ESYYRDEQHAAVLAAYPGHIARMFGLVYGGESRDHAKTADRIVALETKLADAHWDVVK RRDADLGYNLRTFAQLQTEGAGFDWVSWVTALGSAPDAMTELVVRQPDYLVTFASLWA SVNVEDWKCWARWRLIRARAPWLTRALVAEDFEFYGRTLTGAQQLRDRWKRGVSLVEN LMGDAVGKLYVQRHFPPDAKSRIDTLVDNLQEAYRISISELDWMTPQTRQRALAKLNK FTAKVGYPIKWRDYSKLAIDRDDLYGNVQRGYAVNHDRELAKLFGPVDRDEWFMTPQT VNAYYNPGMNEIVFPAAILQPPFFDPQADEAANYGGIGAVIGHEIGHGFDDQGAKYDG DGNLVDWWTDDDRTEFAARTKALIEQYHAYTPRDLVDHPGPPHVQGAFTIGENIGDLG GLSIALLAYQLSLNGNPAPVIDGLTGMQRVFFGWAQIWRTKSRAAEAIRRLAVDPHSP PEFRCNGVVRNVDAFYQAFDVTEDDALFLDPQRRVRIWN" CDS 236746..237405 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0205" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb0205, -, len: 219 aa. Equivalent to Rv0199, len: 219 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 219 aa overlap). Probable conserved membrane protein, equivalent to Z95398|MLCL622.13 from Mycobacterium leprae (224 aa), FASTA scores: opt: 920, E(): 0, (67.7% identity in 220 aa overlap). Also some similarity to Mce-associated membrane proteins from Mycobacterium tuberculosis e.g. Rv0178, Rv0175, etc. Protein product from Mb0205 detected using shotgun mass spectrometry. Mb0205 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247911.1" /translation="MPDGEQSQPPAQEDAEDDSRPDAAEAAAAEPKSSAGPMFSTYGI ASTLLGVLSVAAVVLGAMIWSAHRDDSGERTYLTRVMLTAAEWTAVLINMNADNIDAS LQRLHDGTVGQLNTDFDAVVQPYRQVVEKLRTHSSGRIEAVAIDTVHRELDTQSGAAR PVVTTKLPPFATRTDSVLLVATSVSENAGAKPQTVHWNLRLDVSDVDGKLMISRLESI R" CDS 237402..238091 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0206" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0206, -, len: 229 aa. Equivalent to Rv0200, len: 229 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 229 aa overlap). Possible conserved transmembrane protein, equivalent to Z95398|MLCL622.14 from Mycobacterium leprae (229 aa), FASTA scores: opt: 1147, E(): 0, (74.7% identity in 229 aa overlap). Also some similarity to Rv1973 from Mycobacterium tuberculosis (160 aa); and Rv1362c|Z75555|MTCY02B10_26 (220 aa), FASTA scores: opt: 134, E(): 0.063, (25.8% identity in 159 aa overlap). Protein product from Mb0206 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0206 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247912.1" /translation="MRNAWRLVVFDVLAPLATIAALAAIGVLLGWPLWWVSTCSVLVL LVVEGVAINFWLLRRDSVTVGTDDDAPGLRLAVVFLCAAAISAAVVTGYLRWTTPDRD FNRDSREVVHLATGMAETVASFSPSAPAAAVDRAAAMMVPEHAGGFKEQYAKSSADLA RRGVTAQAATLAAGVEAIGPSAASVAVILRVSQSIPGQPTSQAARALRVTLTKRGSGW LVLDVTPINAR" CDS complement(238088..238591) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0207C" /product="conserved protein" /note="Mb0207c, -, len: 167 aa. Equivalent to Rv0201c, len: 167 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 167 aa overlap). Conserved hypothetical protein, equivalent to Z95398|MLCL622.15c from Mycobacterium leprae (170 aa), FASTA scores: opt: 646, E(): 0, (63.9% identity in 158 aa overlap). Protein product from Mb0207c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0207c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247913.1" /translation="MTLAAEPHPAPPQQPTVAWSEPDVDRRVEFWPTVAIRSALESGD IATWQRIAAALKRDPYGRTARQVEEVLEGIPATGIANAFWEVLDRARTHLDANERAEV ARQVGLLLDRSGLQRQEFASRIGVTAQDLTAYLDGIVSPSASLMIRMRRLSDRFVRAK SVRAADS" CDS complement(238588..241488) /codon_start=1 /transl_table=11 /gene="mmpL11" /locus_tag="BQ2027_MB0208C" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL11" /note="Mb0208c, mmpL11, len: 966 aa. Equivalent to Rv0202c, len: 966 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 966 aa overlap). Probable mmpL11, conserved transmembrane transport protein (see citation below), equivalent to Z95398|MLCL622.16c from Mycobacterium leprae (1014 aa), FASTA scores: opt: 4076, E(): 0, (72.8% identity in 1017 aa overlap). Member of RND superfamily, similar to several putative transport proteins e.g. P96687 from Bacillus subtilis (724 aa), FASTA scores: opt: 594, E(): 9.1e-29, (26.9% identity in 717 aa overlap); etc. BELONGS TO THE MMPL FAMILY. Protein product from Mb0208c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0208c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247914.1" /translation="MMRLSRNLRRCRWLVFTGWLLALVPAVYLAMTQSGNLTGGGFEV AGSQSLLVHDQLDAHYPDRGAPALALVAAPRPDASYQDIDNAVALLRQIASELPGVTE APNPTQRPPQPDRPYVVSLRLDARNAGTSDVAKKLRDRIGVKGDQSGQTANGKVRLYV IGQGALSAAAAANTKHDIANAERWNLPIILMVLVAVFGSLAAAAIPLALAVCTVVITM GLVFVLSMHTTMSVFVTSTVSMFGIALAVDYSLFILMRYREELRCGRRPPDAVDAAMA TSGLAVVLSGMTVIASLTGIYLINTPALRSMATGAILAVAVAMLTSATLTPAVLATFA RAAAKRSALVHWSRRPASTQSWFWSRWVGWVMRRPWITALAASTVLLVMAAPATLMVL GNSLLRQFDSSHEIRTGAAAAAQALGPGALGPVQVLVRFDAGGASAPEHSQTIAAIRH RIAQAPNVVSVAPPRFADDNGSALLSAVLSVDPEDLGARDTITWMRTQLPRVAGAAQV DVGGPTALIKDFDDRVSATQPLVLVFVAVIAFLMLLISIRSVFLAFKGVLMTLLSVAA AYGSLVMVFQWGWARGLGFPALHSIDSTVPPLVLAMTFGLSMDYEIFLLTRIRERFLQ TGQTRDAVAYGVRTSARTITSAALIMIAVFCGFAFAGMPLVAEIGVACAVAIAVDATV VRLVLVPALMAMFDRWNWWLPRWLAHILPSVDFDRPLPKVDLGDVVVIPDDFAAAIPP SADVRMVLKSAAKLKRLAPDAICVTDPLAFTGCGCDGKALDQVQLAYRNGIARAISWG QRPVHPVTVWRKRLAVALDALQTTTWECGGVQTHRAGPGYRRRSPVETTNVALPTGDR LQIPTGAETLRFKGYLIMSRNSSHDYADFADLVDTMAPETAAAVLAGMDRYYSCQAPG RQWMATQLVGRLADPQPSDLGDQSPGADAQAKWEEVRRRCLSVAVAMLEEAR" CDS 241710..242120 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0209" /product="POSSIBLE EXPORTED PROTEIN" /note="Mb0209, -, len: 136 aa. Equivalent to Rv0203, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Possible exported protein (has hydrophobic stretch near N-terminus). Some similarity to part of U02459|LDU02459_1 hypothetical protein from Leishmania donovani (741 aa), FASTA score: opt: 111, E(): 9.1, (30.0% identity in 90 aa overlap). Protein product from Mb0209 detected using SWATH mass spectrometry. Mb0209 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247915.1" /translation="MKTGTATTRRRLLAVLIALALPGAAVALLAEPSATGASDPCAAS EVARTVGSVAKSMGDYLDSHPETNQVMTAVLQQQVGPGSVASLKAHFEANPKVASDLH ALSQPLTDLSTRCSLPISGLQAIGLMQAVQGARR" CDS complement(242172..243455) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0210C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0210c, -, len: 427 aa. Equivalent to Rv0204c, len: 412 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 412 aa overlap). Probable conserved transmembrane protein, equivalent, but has C-terminal extension, to Z95398|MLCL622.17c from Mycobacterium leprae (367 aa), FASTA scores: opt: 2002, E(): 0, (82.4% identity in 374 aa overlap). Some similarity to Rv0585c from Mycobacterium tuberculosis. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, the presence of a more likely start codon leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (427 a versus 412 aa). Mb0210c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247916.1" /translation="MYRDYGATCTVTLKHVSHDAPARNLRQRVGALPRTRVGAPPAEG VPPRGKYWWLRWAVLAIVAIVLAIEVALGWDQLAKAWVSLYRAKWWWLLAAVAAAGAS MHSFAQIQRTLLKSAGVHVKQWRSEAAFYAANSLSTTLPGGPVLSATFLLRQQRIWGA STVVASWQLVMSGVLQAVGLALLGLGGAFFLGAKNNPFSLLFTLGGFVTLLLLAQAVA SRPELIEGIGRRVLSWANSVRGRPADAGLPKWRETLMQLESVSLGRRDLGVAFGWSLF NWIADVACLGFAAYAAGDHASVGGLAVAYAAARAVGTIPLMPGGLLVVEAVLVPGLVS SGMPLPSAISAMLIYRLISWLLIAAIGWVVFFFMFRTESTADSDNDRDPPTDPNLRLV IQPQGTPCDDPVETTPQGPAPTPDLRPEGGETPPR" CDS 243580..244683 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0211" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0211, -, len: 367 aa. Equivalent to Rv0205, len: 367 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 367 aa overlap). Possible conserved transmembrane protein, similar to hypothetical proteins from many bacteria e.g. AL0209|SC4H8_6 from Streptomyces coelicolor (402 aa), FASTA scores: opt: 436, E(): 1.7e-21, (27.2% identity in 349 aa overlap); Z99117|BSUB0014_221 from Bacillus subtilis (353 aa), FASTA scores: opt: 394, E(): 8.6e-19, (28.7% identity in 324 aa overlap). Protein product from Mb0211 detected using SWATH mass spectrometry. Mb0211 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247917.1" /translation="MSASLDDASVAPLVRKTAAWAWRFLVILAAMVALLWVLNKFEVI VVPVLLALMLSALLVPPVDWLDSRGLPRAVAVTLVLLSGFAVLGGILTFVVSQFIAGL PHLVTEVERSIDSARRWLIEGPAHLRGEQIDNAGNAAIEALRNNQAKLTSGALSTAAT ITELVTAAVLVLFTLIFFLYGGRSIWQYVTKAFPASVRDRVRAAGRAGYASLIGYARA TFLVALTDAAGVGAGLAVMGVPLALPLASLVFFGAFIPLIGAVVAGFLAVVVALLAKG IGYALITVGLLIAVNQLEAHLLQPLVMGRAVSIHPLAVVLAIAAGGVLAGVVGALLAV PTAAFFNNAVQVLLGGNPFADVADVSSDHLTEV" CDS complement(244680..247514) /codon_start=1 /transl_table=11 /gene="mmpL3" /locus_tag="BQ2027_MB0212C" /product="POSSIBLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL3" /note="Mb0212c, mmpL3, len: 944 aa. Equivalent to Rv0206c, len: 944 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 944 aa overlap). Possible mmpL3, conserved transmembrane transport protein (see first citation below), equivalent to Z95398|MLCL622.18c from Mycobacterium leprae (955 aa), FASTA scores: opt: 806, E(): 1.8e-21, (57.2% identity in 243 aa overlap). Member of RND superfamily, similar to others. BELONGS TO THE MMPL FAMILY. TBparse score is 0.928. Protein product from Mb0212c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0212c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247918.1" /translation="MFAWWGRTVYRYRFIVIGVMVALCLGGGVFGLSLGKHVTQSGFY DDGSQSVQASVLGDQVYGRDRSGHIVAIFQAPAGKTVDDPAWSKKVVDELNRFQQDHP DQVLGWAGYLRASQATGMATADKKYTFVSIPLKGDDDDTILNNYKAIAPDLQRLDGGT VKLAGLQPVAEALTGTIATDQRRMEVLALPLVAVVLFFVFGGVIAAGLPVMVGGLCIA GALGIMRFLAIFGPVHYFAQPVVSLIGLGIAIDYGLFIVSRFREEIAEGYDTETAVRR TVITAGRTVTFSAVLIVASAIGLLLFPQGFLKSLTYATIASVMLSAILSITVLPACLG ILGKHVDALGVRTLFRVPFLANWKISAAYLNWLADRLQRTKTREEVEAGIWGKLVNRV MKRPVLFAAPIVIIMILLIIPVGKLSLGGISEKYLPPTNSVRQAQEEFDKLFPGYRTN PLTLVIQTSNHQPVTEAQIADIRSKAMAIGGFIEPDNDPANMWQERAYAVGASKDPSV RVLQNGLINPADASKKLTELRAITPPKGITVLVGGTPALELDSIHGLFAKMPLMVVIL LTTTIVLMFLAFGSVVLPIKATLMSALTLGSTMGILTWIFVDGHFSKWLNFTPTPLTA PVIGLIIALVFGLSTDYEVFLVSRMVEARERGMSTQEAIRIGTAATGRIITAAALIVA VVAGAFVFSDLVMMKYLAFGLMAALLLDATVVRMFLVPSVMKLLGDDCWWAPRWARRL QTRIGLGEIHLPDERKRPVSNGRPARPPVTAGLVAARAAGDPRPPHDPTHPLAESPRP ARSSPASSPELTPALEATAAPAAPSGASTTRMQIGSSTEPPTTRLAAAGRSVQSPAST PPPTPTPPSAPSAGQTRAMPLAANRSTDAAGDPAEPTAALPIIRSDGDDSEAATEQLN ARGTSDKTRQRRRGGGALSAQDLLRREGRL" CDS complement(247580..248308) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0213C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0213c, -, len: 242 aa. Equivalent to Rv0207c, len: 242 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 242 aa overlap). Conserved hypothetical protein, equivalent to Z95398|MLCL622_19 from Mycobacterium leprae (261 aa), FASTA scores: E(): 0, (60.8 identity in 199 aa overlap). Protein product from Mb0213c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0213c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247919.1" /translation="MSLTEDVTSQTSESLARHSVLAEDLSQDGLTSLGAPGARVLLVW DAPNLDMGLGSILGRRPTALERPRFDALGRWLLARTAEIVAGRPGISTEPEATVFTNI APGSAEVVRPWVDALRNVGFAVFAKPKVDEDSDVDRDMLAHIDERYREGLAALVVASA DGQAFRQPLEAVARSGTPVQVLGFREHASWALASDTLEFVDLEDIAGVFREPLPRIGL DSLPEQGAWLQPFRPLSSLLTSRV" CDS complement(248311..249102) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0214C" /product="HYPOTHETICAL METHLYTRANSFERASE (METHYLASE)" /note="Mb0214c, -, len: 263 aa. Equivalent to Rv0208c, len: 263 aa, from Mycobacterium tuberculosis strain H37Rv (100.0% identity in 263 aa overlap). Hypothetical methyltransferase (EC 2.1.1.-), equivalent to Z95398|MLCL622_20 from Mycobacterium leprae (279 aa), FASTA score: (64.2% identity in 246 aa overlaps). Also similar to others e.g. 10178368|CAC08407.1|AL392177|Q9F305|MT04_STRCO|SCD17A.03c HYPOTHETICAL METHLYTRANSFERASE from Streptomyces coelicolor (271 aa). Could start at aa 7. Protein product from Mb0214c detected using SWATH mass spectrometry. Mb0214c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247920.1" /translation="MVHHGQMHAQPGVGLRPDTPVASGQLPSTSIRSRRSGISKAQRE TWERLWPELGLLALPQSPRGTPVDTRAWFGRDAPVVLEIGSGSGTSTLAMAKAEPHVD VIAVDVYRRGLAQLLCAIDKVGSDGINIRLILGNAVDVLQHLIAPDSLCGVRVFFPDP WPKARHHKRRLLQPATMALIADRLVPSGVLHAATDHPGYAEHIAAAGDAEPRLVRVDP DTELLPISVVRPATKYERKAQLGGGAVIELLWKKHGCSERDLKIR" CDS 249234..250319 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0215" /product="HYPOTHETICAL PROTEIN" /note="Mb0215, -, len: 361 aa. Equivalent to Rv0209, len: 361 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 361 aa overlap). Hypothetical unknown protein. Mb0215 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247921.1" /translation="MRGQGHQIFVDELARFATSSADQRVVAIAQRAAEPLRVAVRGRP GVGCRTVARALQGAGSSSGMTVTPQARAADSDVDLVVYVTVEVVKPEDREAIAATRRP VVAVLNKADLAGPLSGAGPIVMAQARCAQFSTLLGVPMESMIGLLAVAALDDLDDTLR AALRALAAHPDGFDALDRAVAGFLAAALPVPTEVRLRLLDTLDLFGIALGMAAFRPGR PSRTPAQLRTLLRRVSGVDAVIDKVTAAGSEVRYRRLLDAVAELEALAAQAKEIGGPI GEFLRDDDTVLARMAAAVDVALAVGLDVGPLDDPAAHLPRAVRWHRYSLDNGDMHRTC GADIARGSLRLWSLAGGMPLHRYRKSS" CDS 250316..251794 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0216" /product="HYPOTHETICAL PROTEIN" /note="Mb0216, -, len: 492 aa. Equivalent to Rv0210, len: 492 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 492 aa overlap). Hypothetical unknown protein. Possibly membrane protein; has hydrophobic stretches around aa 333 - 381. Protein product from Mb0216 detected using SWATH mass spectrometry. Mb0216 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247922.1" /translation="MIRAASDDPAGVDELVAAIAPGLAGLGLPVINRREVVLVTGPWL AGVSGVRAALAERLPQRRFVETAELGPGDAPVAVVFVVSAATALTESDCVLLDTAAEH TDAVVAVVSKIDVHRGWRDVLTSNRDRLAARASRYARVPWVGAAAAPELGEPYLDDLV AAIQKQLADPAVARRNMLRAWESRLLMVARRFDGDAQSAGRRARVDALRQQRRTVLRQ GRQSKSEHTIALRAQIQHARVKLSYFARNRCSLLRVELQEHVAGLSRKDIARFAAYTR GRVQEVVAEVGEGAVAHLADVAQLLGVPVQPPVLENLPAVLPTVVAPPLTSRRLEIRL TTLLGAGFGLGIALTLSRLVAGLTPGLAASGMVAGVAIGLAVTAWVVNARALLHDRVV VDRWTGEVTASLRSVVEQLVATRVVAVETLLSTAISERDDAENARVADQVSIIDGELR EHAVAAARAAALRDREMPAVRAALEAVRAELGEPGTPTTGLF" CDS 251978..253798 /codon_start=1 /transl_table=11 /gene="pckA" /locus_tag="BQ2027_MB0217" /standard_name="pckG; pck1" /product="probable iron-regulated phosphoenolpyruvate carboxykinase [gtp] pcka (phosphoenolpyruvate carboxylase) (pepck)(pep carboxykinase)" /note="Mb0217, pckA, len: 606 aa. Equivalent to Rv0211, len: 606 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 606 aa overlap). Probable pckA (alternate gene names: pckG and pck1), phosphoenolpyruvate carboxykinase [GTP] (EC 4.1.1.32), equivalent to Z95398|MLCL622_21 PROBABLE PHOSPHOENOLPYRUVATE CARBOXYKINASE from Mycobacterium leprae (609 aa), FASTA score: (86.1% identity in 605 aa overlap). Also highly similar to others e.g. PPCK_NEOFR|P22130 phosphoenolpyruvate carboxykinase [GTP] (608 aa), FASTA scores: opt: 2287, E(): 0, (55.9% identity in 598 aa overlap). Contains PS00505 Phosphoenolpyruvate carboxykinase (GTP) signature. BELONGS TO THE PHOSPHOENOLPYRUVATE CARBOXYKINASE [GTP] FAMILY. Protein product from Mb0217 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0217 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247923.1" /translation="MTSATIPGLDTAPTNHQGLLSWVEEVAELTQPDRVVFTDGSEEE FQRLCDQLVEAGTFIRLNPEKHKNSYLALSDPSDVARVESRTYICSAKEIDAGPTNNW MDPGEMRSIMKDLYRGCMRGRTMYVVPFCMGPLGAEDPKLGVEITDSEYVVVSMRTMT RMGKAALEKMGDDGFFVKALHSVGAPLEPGQKDVAWPCSETKYITHFPETREIWSYGS GYGGNALLGKKCYSLRIASAMAHDEGWLAEHMLILKLISPENKAYYFAAAFPSACGKT NLAMLQPTIPGWRAETLGDDIAWMRFGKDGRLYAVNPEFGFFGVAPGTNWKSNPNAMR TIAAGNTVFTNVALTDDGDVWWEGLEGDPQHLIDWKGNDWYFRETETNAAHPNSRYCT PMSQCPILAPEWDDPQGVPISGILFGGRRKTTVPLVTEARDWQHGVFIGATLGSEQTA AAEGKVGNVRRDPMAMLPFLGYNVGDYFQHWINLGKHADESKLPKVFFVNWFRRGDDG RFLWPGFGENSRVLKWIVDRIEHKAGGATTPIGTVPAVEDLDLDGLDVDAADVAAALA VDADEWRQELPLIEEWLQFVGEKLPTGVKDEFDALKERLG" CDS complement(253865..254836) /codon_start=1 /transl_table=11 /gene="nadR" /locus_tag="BQ2027_MB0218C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN NADR (PROBABLY ASNC-FAMILY)" /note="Mb0218c, nadR, len: 323 aa. Equivalent to Rv0212c, len: 323 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 323 aa overlap). Possible nadR (alternate gene name: nadI), transcriptional regulator, similar to others e.g. NADR_ECOLI|P27278 transcriptional regulator from Escherichia coli (410 aa), FASTA scores: opt: 377, E (): 1e-17, (31.1% identity in 347 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0218c detected using SWATH mass spectrometry. Mb0218c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247924.1" /translation="MTHGMVLGKFMPPHAGHVYLCEFARRWVDELTIVVGSTAAEPIP GAQRVAWMRELFPFDRVVHLANENPQRPWEHPDFWDIWKASLQGVLATRPDFVFGAEP YNADFAQVLGARFVAVDHGRTVVPVTATDIRADPLGHWQHIPRCVRPAFVKRVSIIGP ESTGKTTLAQAVAEKLRTKWVPERAKMLRELNGGSLIGLEWAEIVRGQIASEEALARD ADRVLICDTDPLATTVWAEFLAGGCPQELRDLARRPYDLTLLTTPDVPWDADDGRCVP GARGTFFARCEQALRAAGRSFVVITGGWEERLSVSLRAVEELVRARR" CDS complement(254833..256146) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0219C" /product="POSSIBLE METHYLTRANSFERASE (METHYLASE)" /note="Mb0219c, -, len: 437 aa. Equivalent to Rv0213c, len: 437 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 437 aa overlap). Possible methyltransferase (EC 2.1.1.-), weakly similar to others methyltransferases e.g. AF127374_30|LINA from Streptomyces lavendulae (611 aa), FASTA scores: opt: 400, E(): 8.1e-19, (27.3% identity in 388 aa overlap); Q50258 fortimicin kl1 methyltransferase (553 aa), FASTA scores: opt: 267, E(): 1.2e-13, (29.3% identity in 351 aa overlap). Mb0219c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247925.1" /translation="MSIKAYAKTQGIAVTSVNGLVAGHGSVQETWLAMQSAAALSGTP RLVGFSCIDTFPEVLWLAQRARQAWDGVRIVIGNAMATLNYERILRQHDCFDYVVVGD GEVAFTKLALALANDAAVDDVPGLARRSEQGQILRTPSSLVDLDELPRPARDELPTVL ADGFAASVFSTRGCPYRCTFCGTGAMSAMLGKDSYRAKSVDAVVDEIDYLVSDYDVNF LSITDDLFISKHPGSQQRAADFANAVLRRGISVNFMVDIRLDSVVDLDLFKHLHRAGL RRVFIGVETGSYEQLRAYRKQILTRGQDAADTINALQQLGIDVIPGTIMFHPTVQPDE LRETVRLLRATKYTVGFKFMSRIVPYPGTPLYQAYSDAGYLTAKWPLGQWEFVDPEAS RVYADVVAKVAPDVGISFDEAEAYFLSRLDEWENVIAGRIAEATS" CDS 256260..257873 /codon_start=1 /transl_table=11 /gene="fadD4" /locus_tag="BQ2027_MB0220" /product="PROBABLE FATTY-ACID-COA LIGASE FADD4 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb0220, fadD4, len: 537 aa. Equivalent to Rv0214, len: 537 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 537 aa overlap). Probable fadD4, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to many e.g. 4CL_PINTA|P41636 4-coumarate--coa ligase (EC 6.2.1.12) (537 aa), FASTA scores: opt: 622, E(): 1e-31, (30.0% identity in 514 aa overlap). Also similar to others from Mycobacterium tuberculosis e.g. MTCY6A4.14 FASTA score: (30.7% identity in 501 aa overlap); MTCY493_27, MTCY07A7_11, MTCI28_6. Contains PS00455 putative AMP-binding domain signature. Protein product from Mb0220 detected using SWATH mass spectrometry. Mb0220 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247926.1" /translation="MPRGELYKRFRLVMGGIAPCGSGRRAATYPRRMQIRPYIAADKP AVILYPSGTVISFDELEARANRLAHWFRQAGLREDDVVAILMENNEHVHAVMWAARRS GLYYVPINTHLTASEAAYIVDNSGAKAIVGSAALRETCHGLAEHLPGGLPDLLMLAGG GLVGWMTYPECVADQPDTPIEDEREGDLLQYSSGTTGRPKGIKRELPHVSPDAAPGMM PALLDFWMDADSVYLSPAPMYHTAPSVWTMSALAAGVTTVVMEKFDAEGALDAIQRYR VTHAQFVPAMFVRMLKLPEAVRNSYDMSSLRRVIHAAAPCPVQIKEQMIHWWGPIIDE YYASSEASGSTLITAEDWLTHPGSVGKPIQGGVHIVGADGSELPPNQPGEIYFEGGYP FEYLNDPAKTAASRNKHGWVTVGDVGYLDDDGYLFLTGRRHHMIISGGVNIYPQEAEN LLVAHPKVLDAAVFGVPDDEMGQRVMAAVQTVDSADANDQFAGELLAWLRDRLSHFKC PRSIAFEPQLPRTDTGKLYKSGLVEKYSV" CDS complement(257915..259081) /codon_start=1 /transl_table=11 /gene="fadE3" /locus_tag="BQ2027_MB0221C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE3" /note="Mb0221c, fadE3, len: 388 aa. Equivalent to Rv0215c, len: 357 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 356 aa overlap). Probable fadE3, acyl-dehydrogenase (EC 1.3.99.-), similar to many e.g. ACDB_BACSU|P45857 acyl-CoA dehydrogenase from B. subtilis (EC 1.3.99.-) (379 aa), FASTA scores: opt: 812, E(): 0, (39.5% identity in 354 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 29 bp insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (388 aa versus 357 aa). Protein product from Mb0221c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0221c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247927.1" /translation="MRNELNDDEAMLVATVRAFIDRDVKPTVREVEHANSYPEAWIEQ MKHIGIYGLAIDEQYGGSPVSMPCYVQVTQELARGWMSLAGAMGGHTVVAKLLTLFGT EEQRRTYLPPMASGELRATMALTEPGGGSDLQNMSTTALADGPEGSAGLLINGCKTWI SNARRSGLFAVLCKTDPNATPRHQGMSIVLVEPGPGLTVSRDLPKLGYKGVESCELSF DNLRVPVSAILGGAMGQGFSQMMKGLETGRIQVAARALGVATAALEDSLAYAQQRESF GRPIWQHQAVGNYLADMATKLTAARQLTRYAAERYDSGQRCDMEAGMAKLFASEVAME IALNAVRIHGGYGYSTEYDVERYFRDAPLMILGEGTNEIQRNVIAGQLVARGGI" CDS 259138..260151 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0222" /product="double hotdog hydratase" /note="Mb0222, -, len: 337 aa. Equivalent to Rv0216, len: 337 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 337 aa overlap). Conserved hypothetical protein, equivalent to Z95398|MLCL622_22 from Mycobacterium leprae (339 aa), FASTA scores: E(): 0, (73.7 identity in 338 aa overlap). Protein product from Mb0222 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0222 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247928.1" /translation="MASGYGGIRVGGPYFDDLSKGQVFDWAPGVTLSLGLAAAHQSIV GNRLRLALDSDLCAAVTGMPGPLAHPGLVCDVAIGQSTLATQRVKANLFYRGLRFHRF PAVGDTLYTRTEVVGLRANSPKPGRAPTGLAGLRMTTIDRTDRLVLDFYRCAMLPASP DWKPGAVPGDDLSRIGADAPAPAADPTAHWDGAVFRKRVPGPHFDAGIAGAVLHSTAD LVSGAPELARLTLNIAATHHDWRVSGRRLVYGGHTIGLALAQATRLLPNLATVLDWES CDHTAPVHEGDTLYSELHIESAQAHADGGVLGLRSLVYAVSDSASEPDRQVLDWRFSA LQF" CDS complement(260148..261056) /codon_start=1 /transl_table=11 /gene="lipW" /locus_tag="BQ2027_MB0223C" /product="POSSIBLE ESTERASE LIPW" /note="Mb0223c, lipW, len: 302 aa. Equivalent to Rv0217c, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 302 aa overlap). Possible esterase (EC 3.1.1.-), showing similarity with others e.g. EST_ACICA|P18773 esterase (303 aa), FASTA scores: opt: 320, E(): 3.2e-13, (29.2% identity in 274 aa overlap). Protein product from Mb0223c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0223c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247929.1" /translation="MSGNEVHPDLRRIAVVTPRQLVGPRTLPVMRALIVVAGLRMSRT PPDIEVLTLESGVGVRLYRPAGSNEPAPALLWIHAGGYVMGTAQQDDRLCLRFSSRLG ITVASVDYRLAPENPYPAALGDCYSALTWLASLPAVDPARVAIGGASAGGGLAAALAL LARDRGGITPAFQLLVYPMLDDRTSIAPANPHYRLWNGRANRFGWRAYLGDADARVAV PGRRDDLGGLAPAWIGVGTHDLLHDEDLAYAERLTAAGVPCQVEVVEGAFHGFDRVAP NVGVSQRFFTSQCNSLRAALALSNRT" CDS 261149..262477 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0224" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0224, -, len: 442 aa. Equivalent to Rv0218, len: 442 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 442 aa overlap). Probable conserved transmembrane protein, some similarity with sulfite oxidases (EC 1.8.3.1) e.g. SUOX_HUMAN|P51687 sulfite oxidase precursor (488 aa), FASTA scores: opt: 153, E(): 0.0087, (28.6% identity in 161 aa overlap); and with some nitrate reductases (EC 1.6.6.3) e.g. NIA_FUSOX|P39863 nitrate reductase (NADPH) (905 aa), FASTA scores: opt: 143, E(): 0.06, (29.3% identity in 92 aa overlap). Also similar to BSUB0017_86 from Mycobacterium tuberculosis. Mb0224 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247930.1" /translation="MSDPARGAEAEDAYGFPAGLWRWLQRHPPPALHRLTRFRSPLRG PWLTSVFGLVLLVALPFVIITGLLSYIAYAPQLGQAIPGDVGWLRLPAFTWPTRPSWL YRLTQGLHVGLGLVIIPVVLAKLWSVIPRLFVWPPARSIAQVLERLSVLMLVGGILFQ IVTGVLNIQYDYIFGFSFYTGHYFGAWVFIAGFLLHIVVKIPHMVTGLRSIPMREVLG TNVADTRAQPCDPDGLVSVNPGEATLSRRGALGLVGAGVLLIGVLTVGQTLGGFTRKA ALLLPRGRVVSPGDFPVNKTAAAAGITAEAIGPDWRLVLRGGPAEVVLDRATLAGLPQ RTARLPLACVEGWSAVRTWSGVPLAELALLAGVPAARSARVTSLQRGGAFGEAKLAAN QIADPDALLALRVDGADLSLNHGYPARIIVPALPGVHNTKWVAGIEFHKR" CDS 262479..263027 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0225" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0225, -, len: 182 aa. Equivalent to Rv0219, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 182 aa overlap). Probable conserved transmembrane protein, showing similarity with CAB76992.1|AL159178 putative lipoprotein from Streptomyces coelicolor (163 aa)." /protein_id="CAB5247931.1" /translation="MFDIATRFKNSYGSGPLHLLAMVSGFALLGYIVATARPSALWNQ ATWWQSIAVWFVAAVVAHDLLLYPLYALADRILARLVGRRDVSAPRRRPELPVRNYIR IPALAAGLTLLVFLPGIIRQGAPTYLDATGQTQEPFLGRWLLLTAVAFGISAAAYAIR LVVAHVRRRRAGCSRVDAIDEE" CDS 263037..264248 /codon_start=1 /transl_table=11 /gene="lipC" /locus_tag="BQ2027_MB0226" /product="PROBABLE ESTERASE LIPC" /note="Mb0226, lipC, len: 403 aa. Equivalent to Rv0220, len: 403 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 403 aa overlap). Probable esterase (EC 3.1.1.-), similar to others proteins and esterases from various organisms and Mycobacterium tuberculosis e.g. Q50681 (431 aa), FASTA scores: opt: 841, E(): 0, (38.2% identity in 408 aa overlap); Rv1426c, Rv1399c, etc. Contains PS00122 Carboxylesterases type-B serine active site. Protein product from Mb0226 detected using SWATH mass spectrometry. Mb0226 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247932.1" /translation="MNQRRAAGSTGVAYIRWLLRARPADYMLALSVAGGSLPVVGKHL KPLGGVTAIGVWGARHASDFLSATAKDLLTPGINEVRRRDRASTQEVSVAALRGIVSP DDLAVEWPAPERTPPVCGALRHRRYVHRRRVLYGDDPAQLLDVWRRKDMPTKPAPVLI FVPGGAWVHGSRAIQGYAVLSRLAAQGWVCLSIDYRVAPHHRWPRHILDVKTAIAWAR ANVDKFGGDRNFIAVAGCSAGGHLSALAGLTANDPQYQAELPEGSDTSVDAVVGIYGR YDWEDRSTPERARFVDFLERVVVQRTIDRHPEVFRDASPIQRVTRNAPPFLVIHGSRD CVIPVEQARSFVERLRAVSRSQVGYLELPGAGHGFDLLDGARTGPTAHAIALFLNQVH RSRAQFAKEVI" CDS 264292..265065 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0227" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb0227, -, len: 257 aa. Equivalent to 5' end of Rv0221, len: 469 aa, from Mycobacterium tuberculosis strain H37Rv, (98.3% identity in 234 aa overlap). Conserved hypothetical protein, similar to others proteins from Mycobacterium tuberculosis e.g. Q50680|Rv2285|MT2343|MTCY339.25c hypothetical 47.7 kDa protein (445 aa), FASTA scores: opt: 455, E(): 8.1e-23, (26.7% identity in 461 aa overlap); Rv3740c, Rv3734c, etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 1902 bp deletion leads to a shorter product with a different COOH part, compared to its homolog in Mycobacterium tuberculosis strain H37Rv (257 aa versus 469 aa). It also leads to the deletion of the next protein, echA1 (Rv0222). Protein product from Mb0227 detected using SWATH mass spectrometry. Mb0227 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247933.1" /translation="MKRLSGWDAVLLYSETPNVHMHTLKVAVIELDSDRQEFGVDAFR EVIAGRLHKLEPLGYQLVDVPLKFHHPMWREHCQVDLNYHIRPWRLRAPGGRRELDEA VGEIASTPLNRDHPLWEMYFVEGLANHRIAVVAKIHHALADGVASANMMARGMDLLPG PEVGRYVPDPAPTKRQLLSAAFIDHLRHLGRIPATIRYTTQGLGRVRRSSRKLSPALT MPFTPPPTFMNRIKKPLSKPSGRPPPHTNRAPSSMPLAM" CDS complement(264931..266088) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0228C" /product="PROBABLE ALDEHYDE DEHYDROGENASE" /note="Mb0228c, -, len: 385 aa. Equivalent to Rv0223c, len: 487 aa, from Mycobacterium tuberculosis strain H37Rv, (95.3% identity in 364 aa overlap). Probable aldehyde dehydrogenase (EC 1.2.1.-), similar to others e.g. A75608|6460525|AAF12231.1|AE001862_57 aldehyde dehydrogenase from Deinococcus radiodurans strain R1 (495 aa); Q47943 L-sorbosone dehydrogenase NAD(P) dependent from Gluconobacter oxydans (498 aa), FASTA scores: opt: 1157, E (): 0, (42.1% identity in 482 aa overlap); etc. Also similar to Rv0768, Rv2858c, etc from Mycobacterium tuberculosis. Contains PS00687 Aldehyde dehydrogenases glutamic acid active site; and PS00070 Aldehyde dehydrogenases cysteine active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-c) leads to a shorter product, with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (385 aa versus 487 aa). Protein product from Mb0228c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0228c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247934.1" /translation="MSDSATEYDKLFIGGKWTKPSTSDVIEVRCPATGEYVGKVPMAA AADVDAAVAAARAAFDNGPWPSTPPHERAAVIAAAVKMLAERKDLFTKLLAAETGQPP TIIETMHWMGSMGAMNYFAGAADKVTWTETRTGSYGQSIVSRESVGVVGAIVAWNVPL FLAVNKIAPALLAGCTIVLKPAAETPLTANALAEVFAEVGLPEGVLSVVPGGIETGQA LTSNPDIDMFTFTGSSAVGREVGRRAAEMLKPCTLELGGKSAAIILEDVDLAAAIPMM VFSGVMNAGQGCVNQTRILAPRSRYDEIVAAVTNFVTALPVGPPSDPAAQIGPLISEK QRTRVEALHRQGHRGGRSVGVRRRPSRGLGQRLLYPIHERRWRGKWHGQCG" CDS complement(266187..266951) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0229C" /product="POSSIBLE METHYLTRANSFERASE (METHYLASE)" /note="Mb0229c, -, len: 254 aa. Equivalent to Rv0224c, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 254 aa overlap). Possible methyltransferase (EC 2.1.1.-), showing weak similarity with other methyltransferases e.g. P74388 STEROL-C-METHYLTRANSFERASE (318 aa), FASTA scores: opt: 190, E(): 3.6e-05, (33.3% identity in 114 aa overlap). Equivalent to AL022486|MLCB1883_1 from Mycobacterium leprae (269 aa), FASTA scores: opt: 1456, E(): 0, (82.9% identity in 252 aa overlap). Also some similarity with MTCY21B4.22c from Mycobacterium tuberculosis FASTA score: (30.1% identity in 136 aa overlap). Mb0229c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247935.1" /translation="MAVTDVFARRATLRRSLRLLADFRYEQRDPARFYRTLAADTAAM IGDLWLATHSEPPVGRTLLDVGGGPGYFATAFSDAGVGYIGVEPDPDEMHAAGPAFTG RPGMFVRASGMALPLADDSVDICLSSNVAEHVPRPWQLGTEMLRVTKPGGLVVLSYTV WLGPFGGHEMGLSHYLGGARAAARYVRKHGHPAKNNYGSSLFAVSAAEGLRWAAGTGA ALAVFPRYHPRWAWWLTSVPVLREFLVSNLVLVLTP" CDS 266987..268141 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0230" /product="Glycosyltransferase" /note="Mb0230, -, len: 384 aa. Equivalent to Rv0225, len: 384 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 384 aa overlap). Possible conserved protein involved in LPS biosynthesis, similar to O26275 LPS BIOSYNTHESIS RFBU RELATED PROTEIN (382 aa), FASTA scores: opt: 426, E(): 1.2e-20, (28.2% identity in 394 aa overlap). Some similarity with Rv3032 from Mycobacterium tuberculosis FASTA score: (31.6% identity in 228 aa overlap). Protein product from Mb0230 detected using SWATH mass spectrometry. Mb0230 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247936.1" /translation="MSALRSVLLLCWRDIGHPQGGGSEAYLQRIGAQLAASGIAVTLR TARYPGAPRHELVDGVRISRAGGRYSVYLWALLAMAAARCGLGPLRRVRPDVVVDTQN GWPFVARLLYGRRSLVLVHHCHREQWPVAGRMMGRLGWYVESMLSPRLHRRNQYVTVS LPSARDLIALGVDSERIAVVRNGLDEAPSPTLSGPRAPTPRVVVLSRLVPHKQIEDAL AAVAELQPRIPGLHLDIVGGGWWRQRLVDHVHRLDIADAVTFHGHVDDVTKHHVLQSS WVHLLPSRKEGWGLAVIEAAQHGVPTIGYRSSGGLADSIVDGVTGILVDDRAELVAWL EQLLSDSVLRDQLGAKAQARSGEFSWRQSAEALRSVLEAVQASRFVSGVV" CDS complement(268158..269888) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0231C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0231c, -, len: 576 aa. Equivalent to Rv0226c, len: 576 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 576 aa overlap). Probable conserved transmembrane protein, equivalent, except in N-terminal part, to AC32114.1|AL583926 conserved membrane protein from Mycobacterium leprae (600 aa), FASTA scores: opt: 2086, E(): 0, (70.3% identity in 579 aa overlap). Also similar to AL021411|SC7H1_20 from Streptomyces coelicolor (483 aa), FASTA scores: opt: 180, E(): 0.00028, (26.5 identity in 388 aa overlap). Protein product from Mb0231c detected using SWATH mass spectrometry. Mb0231c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247937.1" /translation="MRWFRPGYALVLVLLLAAPLLRPGYLLLRDAVSTPRSYVSANAL GLTSAPRATPQDFAVALASHLVDGGVVVKALLLLGLWLAGWGAARLVATALPAAGAAG QFVAITLAIWNPYVAERLLQGHWSLLVGYGCLPWVATAMLTMRTTVGAGWFGLFGLAF WVALAGLTPSGLLLAATVAVVCVAMPGAGRPRWQCGVAALGSALVGALPWLTASALGS SLTSHTAANQLGVTAFAPRAEPGLGTLGSLASLGGIWNGEAVPSSRTTLFAVASAVVL LAMVAIGLPTVARRPVAVPLLTLAAVSVMVPAVLATGPGLHALRVVVDAAPGLGVLRD GQKWVALAVPGYTLSGAGTVLTLRRWLRPATAAVVCCLALVLTLPDLAWGVWGKVAPV HYPSGWAAVAAAINADPRTVAVLPAGTMRRFSWSGSAPVLDPLPRWVRADVLTTGDLV ISGVTVPGEDAHARAVQELLLTGPHPSTLAAAGVGWLVVESDSAGDMGAAARTLGRLA AAHRDDELALYRVGGQTSGASSARLKATMLAHWAWLSMLLVGGAGAAGYWVRRHLHHC EDTPASRAQD" CDS complement(269898..271163) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0232C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb0232c, -, len: 421 aa. Equivalent to Rv0227c, len: 421 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 421 aa overlap). Possible conserved membrane protein, equivalent to AL022486|MLCB1883_4 from Mycobacterium leprae (448 aa), FASTA scores: opt: 2148, E(): 0, (76.6% identity in 423 aa overlap). Protein product from Mb0232c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0232c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247938.1" /translation="MLRFAACGAIGLGAALLIAALLLSTYTTRRIAEIPLDIDATLIS DGTGTALDSASLATEHIVVNQDVPLVSQQQVTVESPANADVVTLQVGSSLRRTDKQKD SGLLLAIVDTVTLNRKTAMAVSDDTHTGGAVQKPRGLNDENPPTAIPLRHDGLSYRFP FHTEKKTYPYFDPIAQKAFDANYEGEEDVNGLTTYRFTQNVGYTPEGKLVAPLKYPSL YAGDEDGKVTTSAAMWGLPGDPNEQITMTRYYAAQRTFWVDPVSGTIVKETERANHYF ARDPLKPEVTFADYQVTSTEETVESQVNAARDERDRLALWSRVLPITFTAAGLVALVG GGLFASFSLRTEGALMAASGDRDDHDYRRGGFEEPVPGAEAETEKLPTQRPDFPREPN GSDPPRLGSAQPPPPPDAGHPDPGPPERR" CDS 271379..272602 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0233" /product="PROBABLE INTEGRAL MEMBRANE ACYLTRANSFERASE" /note="Mb0233, -, len: 407 aa. Equivalent to Rv0228, len: 407 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 407 aa overlap). Probable integral membrane acyltransferase (EC 2.3.1.-), equivalent to 3063875|CAA18555.1|AL022486|T44870 ACYLTRANSFERASE from Mycobacterium leprae (384 aa), FASTA scores: opt: 2004, E(): 0, (79.3% identity in 381 aa overlap). Also similar to others e.g. Q11064 PROBABLE ACYLTRANSFERASE CY50.28C (383 aa), FASTA scores: opt: 372, E(): 2.6e-16, (35.9% identity in 359 aa overlap); Q00718|MDMB_STRMY ACYLTRANSFERASE. Very similar to Rv0111, Rv1254, etc from Mycobacterium tuberculosis. Mb0233 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247939.1" /translation="MGPADESGAPIRPQTPHRHTVLVTNGQVVGGTRGFLPAVEGMRA CAAVGVVVTHVAFQTGHSSGVGGRLFGRFDLAVAVFFAVSGFLLWRGHAAAARDLRSH PRTGPYLRSRVARIMPAYVVAVVVILSLLPDADHASLTVWLANLTLTQIYVPLTLTGG LTQMWSLSVEVAFYAALPVLALLGRRIPVGARVPAIAALAALSWAWGWLPLDAGSGIN PLTWPPAFFSWFAAGMLLAEWAYSPVGLPHRWARRRVAMAVTALLGYLVAASPLAGPE GLVPGTAAQFAVKTAMGSLVAFALVAPLVLDRPDTSHRLLGSPAMVTLGRWSYGLFIW HLAALAMVFPVIGAFPFTGRMPTVLVLTLIFGFAIAAVSYALVESPCREALRRWERRN EPISVGELQADAIAP" CDS complement(272630..273310) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0234C" /product="possible conserved membrane protein with pin domain" /note="Mb0234c, -, len: 226 aa. Equivalent to Rv0229c, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 226 aa overlap). Possible conserved membrane protein, similar to several proteins from Mycobacterium tuberculosis. Other possible start sites and could be shorter as C-terminal region has some similarity with Rv2757c|D70880 from Mycobacterium tuberculosis (138 aa), FASTA scores: E(): 1e-15, (45.3% identity in 137 aa overlap), and Rv0301, Rv2546, etc. Also some similarity with Q48177 virulence associated protein C (132 aa), FASTA scores: opt: 101, E(): 0.6, (24.3% identity in 136 aa overlap). Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2. Protein product from Mb0234c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0234c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247940.1" /translation="MRQPRRANAMGLALCIYIGSLLIYTPIHGETSRRHRRAGFKHGS YRIGHDDDQRHRQRGPAASHVSASSTRRRRSRHAGRRTARGPRRSMALKYLLDTSVIK RLSRPAVRRAVEPLAEAGAVARTQITDLEVGYSARNETEWQRLMVALSAFDLIESTAS HHRRALGIQRLLAARSQRGRKIPDLLIAAAGEEHGLVVLHYDADFDLIAAVTGQPCQW IVPAGTID" CDS complement(273307..274287) /codon_start=1 /transl_table=11 /gene="php" /locus_tag="BQ2027_MB0235C" /product="PROBABLE PHOSPHOTRIESTERASE PHP (PARATHION HYDROLASE) (PTE) (ARYLDIALKYLPHOSPHATASE) (PARAOXONASE) (A-ESTERASE) (ARYLTRIPHOSPHATASE) (PARAOXON HYDROLASE)" /note="Mb0235c, php, len: 326 aa. Equivalent to Rv0230c, len: 326 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 326 aa overlap). Probable php, phosphotriesterase (EC 3.1.8.1), similar to others e.g. AAK42653.1|AE006849 putative aryldialkylphosphatase (phosphotriesterase) (paraoxonase) from Sulfolobus solfataricus (314 aa); PHP_ECOLI|P45548 PHOSPHOTRIESTERASE HOMOLOGY PROTEIN from Escherichia coli (292 aa), FASTA scores: opt: 408, E(): 7.1e-20, (31.1% identity in 305 aa overlap); OPD_FLASP|P16648 parathion hydrolase precursor (365 aa), FASTA scores: opt: 319, E(): 5.1e-14, (34.5% identity in 333 aa overlap); etc. BELONGS TO THE PHOSPHOTRIESTERASE FAMILY. COFACTOR: CONTAINS 2 MOLES OF ZINC PER SUBUNIT. Protein product from Mb0235c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0235c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247941.1" /translation="MPELNTARGPIDTADLGVTLMHEHVFIMTTEIAQNYPEAWGDED KRVAGAIARLGELKARGVDTIVDLTVIGLGRYIPRIARVAAATELNIVVATGLYTYND VPFYFHYLGPGAQLDGPEIMTDMFVRDIEHGIADTGIKAGILKCATDEPGLTPGVERV LRAVAQAHKRTGAPISTHTHAGLRRGLDQQRIFAEEGVELSRVVIGHCGDSTDVGYLE ELIAAGSYLGMDRFGVDVISPFQDRVNIVARMCERGHADKMVLSHDACCYFDALPEEL VPVAMPNWHYLHIHNDVIPALKQHGVTDEQLHTMLVDNPRRIFERQGGYQ" CDS 274382..276088 /codon_start=1 /transl_table=11 /gene="fadE4" /locus_tag="BQ2027_MB0236" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE4" /note="Mb0236, fadE4, len: 568 aa. Equivalent to Rv0231, len: 568 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 568 aa overlap). Probable fadE4, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. O29752 ACYL-COA DEHYDROGENASE (ACD-3) from Archaeoglobus fulgidus (576 aa), FASTA scores: opt: 1788, E(): 0, (51.0% identity in 577 aa overlap); ACDB_BACSU|P45857 acyl-coa dehydrogenase from Bacillus subtilis (379 aa), FASTA scores: opt: 232, E(): 2.2e- 08, (21.6% identity in 291 aa overlap). Protein product from Mb0236 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0236 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247942.1" /translation="MLLNPNHLTRKYPDRRSGEIMAATVDFFESRGKARLKHDDHERI WYSDFLDFVGRERIFASLLTPASYGADDCRWDTYRISEFAEIMGFYGLSYWYPFQVTA LGLGPIWMSANEDAKRKAAAGLEAGEVFAFGLSEQTHGADVYQTDMILTPSDGGWTAN GEKYYIGNANVARMVSTFGKIAGTPESQEYVFFVADSQHERYDLIKNVVNSQNYVANY ALRDYPVTEADILHRGAEAFHAALNTVNVCKYNLGWGAIGMCTHALYESVTHAANRHL YGTVVTDFSHVRRLLTDAYVRLIAMKLVASRASDYMRSASAADRRYLLYSPLTKAKVT SEGERVITALWDVIAAKGVEKDTFFETVAREIGLLPRLEGTVHINIGLLGKFMPNYLF APDSTLPVIPRRDDAADDAFLFAQGPTGGLGKVRFHDWRASFDTCAHLPNVALLREQV DVFAELLASATPDAAQQKDIDFAFGVGQLFANVPYAQLILEEARLSGVDEALIDEIFG VLVRDFNTHAVELHGRSATTAEQARFAMRMVRRPVHDPARYDQIWKDHVLALNGAYQM AP" CDS 276223..276912 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0237" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR/ACRR-FAMILY)" /note="Mb0237, -, len: 229 aa. Equivalent to Rv0232, len: 229 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 229 aa overlap). Probable transcriptional regulatory protein, tetR/AcrR family, similar to others e.g. YIXD_BACSU|P32398 hypothetical transcriptional regulator (191 aa), FASTA scores: opt: 149, E(): 0.0014, (21.5% identity in 158 aa overlap). Also similar to MTV030_11 from Mycobacterium tuberculosis. Contains PS01081 Bacterial regulatory proteins, tetR family signature, and probable helix-turn helix motif from aa 33-54 (Score 1142, +3.08 SD). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Mb0237 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247943.1" /translation="MPTVTWARVDPARRAAVVEAAEAEFGAHGFSRGSLNVIARRAGV AKGSLFQYFADKRDLYAFIADIASQRVRSYMEDLIRELDPNRPFFEFLTDLLDGWVAY FAEHPRERALHAAATLEVDTDARISVRSVLHRHYLDVLRPLVRDAHARGDLRADSDTG ALMSLLLLIFPHLALAPYMRGLDPILGLDEPTPEQPALAVRRLVAVLAAAFDAQHPAT NSAQTRSEEIT" CDS 276909..277853 /codon_start=1 /transl_table=11 /gene="nrdB" /locus_tag="BQ2027_MB0238" /product="ribonucleoside-diphosphate reductase (beta chain) nrdb (ribonucleotide reductase small chain)" /note="Mb0238, nrdB, len: 314 aa. Equivalent to Rv0233, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 314 aa overlap). Probable nrdB (alternate gene name: rnrS) ribonucleoside-diphosphate reductase, beta chain (EC 1.17.4.1), similar to others e.g. RIR2_SCHPO|P36603 ribonucleoside-diphosphate reductase (391 aa), FASTA scores: opt: 168, E(): 0.00018, (26.1% identity in 199 aa overlap); etc. BELONGS TO THE RIBONUCLEOSIDE DIPHOSPHATE REDUCTASE SMALL CHAIN FAMILY. COFACTOR: BINDS 2 IRON IONS. Protein product from Mb0238 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0238 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247944.1" /translation="MTRTRSGSLAAGGLNWASLPLKLFAGGNAKFWDPADIDFTRDRA DWEKLSDDERDYATRLCTQFIAGEEAVTEDIQPFMSAMRAEGRLADEMYLTQFAFEEA KHTQVFRMWLDAVGISEDLHRYLDDLPAYRQIFYAELPECLNALSADPSPAAQVRASV TYNHIVEGMLALTGYYAWHKICVERAILPGMQELVRRIGDDERRHMAWGTFTCRRHVA ADDANWTVFETRMNELIPLALRLIEEGFALYGDQPPFDLSKDDFLQYSTDKGMRRFGT ISNARGRPVAEIDVDYSPAQLEDTFADEDRRTLAAASA" CDS complement(277929..279464) /codon_start=1 /transl_table=11 /gene="gabD1" /locus_tag="BQ2027_MB0239C" /standard_name="gabD2" /product="succinate-semialdehyde dehydrogenase [nadp+] dependent (ssdh) gabd1" /note="Mb0239c, gabD1, len: 511 aa. Equivalent to Rv0234c, len: 511 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 511 aa overlap). Probable gabD1, succinate-semialdehyde dehydrogenase [NADP+] dependent (EC 1.2.1.16), equivalent to AL022486|MLCB1883_6 PROBABLE ALDEHYDE DEHYDROGENASE from Mycobacterium leprae (457 aa), FASTA scores: opt: 2617, E(): 0, (85.7% identity in 455 aa overlap). Also highly similar to Q55585|GABD|SLR0370 PROBABLE SUCCINATE-SEMIALDEHYDE DEHYDROGENASE from Synechocystis sp. strain PCC 6803 (454 aa), FASTA scores: opt: 1676, E(): 0, (55.8% identity in 455 aa overlap); and similar to others e.g. GABD_ECOLI|P25526 succinate-semialdehyde dehydrogenase from Escherichia coli (482 aa), FASTA scores: opt: 929, E(): 0, (36.5% identity in 452 aa overlap); etc. Note that similar to other cytosolic aldehyde dehydrogenases with EC number: 1.2.1.3. Also similar to Rv0768|aldA semialdehyde dehydrogenase from Mycobacterium tuberculosis (489 aa); and gabD2|Rv1731|MTCY04C12.16 POSSIBLE SUCCINATE-SEMIALDEHYDE DEHYDROGENASE [NADP+] DEPENDANT from Mycobacterium tuberculosis (518 aa). Contains PS00070 aldehyde dehydrogenases cysteine active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY. Could start at different site by homology. Note that previously known as gabD2. Protein product from Mb0239c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0239c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247945.1" /translation="MRSVTCSATLVLPVIEPTPADRRPRHLLLGSAGHVSGRLDTGRF VQTHPAKDVSVPIATINPATGETVKTFTAATDDEVDAAIARAHRRFADYRQTSFAQRA RWANATADLLEAEADQAAAMMTLEMGKTLAAAKAEALKCAKGFRYYAENAEALLADEP ADAAKVGASAAYGRYQPLGVILAVMPWNFPLWQAVRFAAPALMAGNVGLLKHASNVPQ CALYLADVIARGGFPDGCFQTLLVSSGAVEAILRDPRVAAATLTGSEPAGQSVGAIAG NEIKPTVLELGGSDPFIVMPSADLDAAVSTAVTGRVQNNGQSCIAAKRFIVHADIYDD FVDKFVARMAALRVGDPTDPDTDVGPLATEQGRNEVAKQVEDAAAAGAVIRCGGKRLD RPGWFYPPTVITDISKDMALYTEEVFGPVASVFRAANIDEAVEIANATTFGLGSNAWT RDETEQRRFIDDIVAGQVFINGMTVSYPELPFGGVKRSGYGRELSAHGIREFCNIKTV WIA" CDS complement(279490..280938) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0240C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0240c, -, len: 482 aa. Equivalent to Rv0235c, len: 482 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 482 aa overlap). Possible conserved transmembrane protein, highly similar to AL133278|CAB61913.1|SCM11_2 putative integral membrane protein from Streptomyces coelicolor (470 aa), FASTA scores: opt: 2116, E(): 0, (61.8% identity in 474 aa overlap); and similar to hypothetical proteins from other organisms e.g. Q13392|384D8_7 hypothetical protein (579 aa), FASTA scores: opt: 355, E(): 6.9e-17, (28.5% identity in 569 aa overlap). Mb0240c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247946.1" /translation="MGWFSAPEYWLGRLALERGTAIIYLIAFVAAAQQFRPLIGEHGM LPVPRYLAGQSFWRTPSIFHFRYSDRVFAGVCWLGAVLSAAVVAGAASFVPLWATMLI WLTLWVLYLSIVNVGQAWYSFGWESLLLETGFLMIFLGNERTAPPILTLLLARWLLFR VEFGAGLIKMRGDSCWRSLTCLYYHHETQPMPGPLSWFFHHLPKPLHRIEVAGNHFAQ LVVPFGLFTPQPAASIAAAIIVVTQLWLVASGNFSWLNWLTILLACSAIDTSSAAALL PMPAQPALSAPPQWFAGLVVVFTAAVLLLSYWPARNLLSSHQRMNMSFNPFHLVNTYG AFGSICRTRREVVIEGTDESPITEQTVWKAYEFKGKPGDPRRLPRQWAPYHLRLDWLM WFAAISPGYALPWMTPFLNRLLRNDPATLKLLRHNPFPQSPPRYVRAQLYQYRFTTVA ELRRDRAWWHRTLIGRYVPPMSLRKVASPPAD" CDS complement(280973..285175) /codon_start=1 /transl_table=11 /gene="aftd" /locus_tag="BQ2027_MB0241C" /product="possible arabinofuranosyltransferase aftd" /note="Mb0241c, -, len: 1400 aa. Equivalent to Rv0236c, len: 1400 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1400 aa overlap). Probable conserved transmembrane protein, equivalent to AL022486|CAC32102.1|MLCB1883_7 possible integral membrane protein from Mycobacterium leprae (1440 aa), FASTA scores: opt: 7491, E(): 0, (78.8% identity in 1397 aa overlap). Mb0241c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247947.1" /translation="MAPLSRKWLPVVGAVALALTFAQSPGQVSPDTKLDLTANPLRFL ARATNLWNSDLPFGQAQNQAYGYLFPHGTFFVIGHLLGVPGWVTQRLWWAVLLTVGFW GLLRVAEALGVGGPSSRVVGAVAFALSPRVLTTLGSISSETLPMMLAPWVLLPTILAL RGTSGRSVRALAAQAGLAVALMGAVNAIATLAGCLPAVIWWACHRPNRLWWRYTAWWL LAMALATLWWVMALTQLHGVSPPFLDFIESSGVTTQWSSLVEVLRGTDSWTPFVAPNA TAGAPLVTGSAAILGTCLVAAAGLAGLTSPAMPARGRLVTMLLVGVVLLAVGHRGGLA SPVAHPVQAFLDAAGTPLRNVHKVGPVIRLPLVLGLAQLLSRVPLPGSAPRPAWLRAF AHPERDKRVAVAVVALTALVVSTSLAWTGRVAPPGTFGALPQYWQEAADWLRTHHAAT PTPGRVLVVPGAPFATQVWGTSHDEPLQVLGDGPWGVRDSIPLTPPQTIRALDSVQRL FAAGRPSAGLADTLARQGISYVLVRNDLDPETSRSARPILLHRSIAGSPGLAKLAEFG APVGPDPLAGFVNDSGLRPRYPAIEIYRVSAPANPGAPYFAATDQLARVDGGPEVLLR LDERRRLQGQPPLGPVLMTADARAAGLPVPQVAVTDTPVARETDYGRVDHHSSAIRAP GDARHTYNRVPDYPVPGAEPVVGGWTGGRITVSSSSADATAMPDVAPASAPAAAVDGD PATAWVSNALQAAVGQWLQVDFDRPVTNAVVTLTPSATAVGAQVRRILIETVNGSTTL RFDEAGKPLTAALPYGETPWVRFTAAATDDGSAGVQFGITDLAITQYDASGFAHPVQL RHTVLVPGPPPGSAIAGWDLGSELLGRPGCAPGPDGVRCAASMALAPEEPANLSRTLT VPRPVSVTPMVWVRPRQGPKLADLIAAPSTTRASGDSDLVDILGSAYAAADGDPATAW TAPQRVVQHKTPPTLTLALPRPTVVTGLRLAASRSMLPAHPTVVAINLGDGPQVRQLQ VGELTTLWLHPRVTDTVSVSLLDWDDVIDRNALGFDQLKPPGLAEVVVLGAGGAPIAP ADAARNRARALTVDCDHGPVVAVAGRFVHTSIRTTVGALLDGEPVAALPCEREPIALP AGQQELLISPGAAFVVDGAQLSTPGAGLSSATVTSAETGAWGPTHREVRVPESATSRV LVVPESINSGWVARTSTGARLTPIAVNGWQQAWVVPAGNPGTITLTFAPNSLYRASLA IGLALLPLLALLAFWRTGRRQLADRPTPPWRPGAWAAAGVLAAGAVIASIAGVMVMGT ALGVRYALRRRERLRDRVTVGLAAGGLILAGAALSRHPWRSVDGYAGNWASVQLLALI SVSVVAASVVATSESRGQDRMQ" CDS complement(285222..285395) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0242C" /product="SMALL SECRETED PROTEIN" /note="Mb0242c, -, len: 57 aa. Equivalent to Rv0236A, len: 57 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 57 aa overlap). Small secreted protein. Protein product from Mb0242c detected using SWATH mass spectrometry. Mb0242c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247948.1" /translation="MNRIVAPAAASVVVGLLLGAAAIFGVTLMVQQDKKPPLPGGDPS SSVLNRVEYGNRS" CDS 285510..286676 /codon_start=1 /transl_table=11 /gene="lpqI" /locus_tag="BQ2027_MB0243" /product="PROBABLE CONSERVED LIPOPROTEIN LPQI" /note="Mb0243, lpqI, len: 388 aa. Equivalent to Rv0237, len: 388 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 388 aa overlap). Probable lpQI, conserved lipoprotein, equivalent to AL022486|MLCB1883_8|T44873 probable secreted hydrolase from Mycobacterium leprae (387 aa), FASTA scores: opt: 1831, E(): 0, (73.3% identity in 390 aa overlap). Also similar to other lipoproteins and various hydrolases e.g. P40406|2126897|YBBD_BACSU|I39839 HYPOTHETICAL 70.6 KDA LIPOPROTEIN from Bacillus subtilis (642 aa); P48823|HEXA_ALTSO BETA-HEXOSAMINIDASE A PRECURSOR from ALTEROMONAS SP. (598 aa), FASTA scores: opt: 415, E(): 5.8e-17, (31.2% identity in 343 aa overlap); PCC6803|P74340 BETA-GLUCOSIDASE from Synechocystis sp. (538 aa), FASTA scores: opt: 414, E(): 6.1e-17, (30.6 identity in 320 aa overlap). Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb0243 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0243 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247949.1" /translation="MAFPRTLAILAAAAALVVACSHGGTPTGSSTTSGASPATPVAVP VPRSCAEPAGIPALLSPRDKLAQLLVVGVRDAADAQAVVTNYHVGGILIGSDTDLTIF DGALAEIVAGGGPLPLAVSVDEEGGRLSRLRSLIGGTGPSARELAQTRTVQQVRDLAR DRGRQMRKLGITIDFAPVVDVTDAPDDTVIGDRSFGSDPATVTAYAGAYAQGLRDAGV LPVLKHFPGHGRGSGDSHNGGVTTPPLDDLVGDDLVPYRTLVTQAPVGVMVGHLQVPG LTGSEPASLSKAAVNLLRTGTGYGAPPFDGPVFSDDLSGMAAISDRFGVSEAVLRTLQ AGADIALWVTTKEVPAVLDRLEQALRAGELPMSAVDRSVVRVATMKGPNPGCGR" CDS 286752..287366 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0244" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb0244, -, len: 204 aa. Equivalent to Rv0238, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 204 aa overlap). Possible transcriptional regulatory protein, TetR family, equivalent to AL022486|MLCB1883_9|T44874 probable transcription regulator from Mycobacterium leprae (208 aa), FASTA scores: opt: 1029, E(): 0, (80.9% identity in 199 aa overlap). Also similar to others e.g. CAB77290.1|AL160312 putative tetR-family regulatory protein from Streptomyces coelicolor (240 aa). Also similar to Mycobacterium tuberculosis proteins Z95120|Rv3208 (228 aa), FASTA scores: opt: 266, E(): 8.3e-12, (28.1% identity in 196 aa overlap); and Rv1019 (197 aa). Protein product from Mb0244 detected using SWATH mass spectrometry. Mb0244 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247950.1" /translation="MAGGTKRLPRAVREQQMLDAAVQMFSVNGYHETSMDAIAAEAQI SKPMLYLYYGSKEDLFGACLNREMSRFIDALRSSINFDQSPKDLLRNTIVSFLRYIDA NRASWIVMYTQATSSQAFAHTVREGREQIVQLVAELVRAGTRGPLTDAEIEMMAVALV GAGEAVATRLGIGDTDVDEAAEMMINLFWLGLKGAPVDRLETGH" CDS 287428..287661 /codon_start=1 /transl_table=11 /gene="vapb24" /locus_tag="BQ2027_MB0245" /product="possible antitoxin vapb24" /note="Mb0245, -, len: 77 aa. Equivalent to Rv0239, len: 77 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 77 aa overlap). Conserved hypothetical protein, weakly similar to Rv1839c|Z83859|MTCY359_34 from Mycobacterium tuberculosis (87 aa), FASTA scores: opt: 88, E(): 5, (40.0% identity in 45 aa overlap). Mb0245 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247951.1" /translation="MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPR RDAASDTWQPPTPRRLGPFRASEETWRELANEA" CDS 287669..288106 /codon_start=1 /transl_table=11 /gene="vapc24" /locus_tag="BQ2027_MB0246" /product="possible toxin vapc24. contains pin domain." /note="Mb0246, -, len: 145 aa. Equivalent to Rv0240, len: 145 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 145 aa overlap). Conserved hypothetical protein, weak similarity with Rv3697c from Mycobacterium tuberculosis (145 aa), FASTA scores: opt: 145, E(): 7.6e-05, (28.0% identity in 143 aa overlap). Mb0246 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247952.1" /translation="MLSIDTNILLYAQNRDCPEHDAAAAFLVECSGRADVAVCELVLM ELYQLLRNPTVVTRPLEGPEAAEVCQTFRRNRRWALLENAPVMNEVWVLAATPRIARR RLFDARLALTLRHHGVDEFATRNINGFTDFGFSRVWDPITSDG" CDS complement(288136..288978) /codon_start=1 /transl_table=11 /gene="htdx" /locus_tag="BQ2027_MB0247C" /product="probable 3-hydroxyacyl-thioester dehydratase htdx" /note="Mb0247c, -, len: 280 aa. Equivalent to Rv0241c, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Conserved hypothetical protein, highly similar to MLCB1883.17c|T44876063881|CAA18566.1|AL022486 hypothetical protein from Mycobacterium leprae (280 aa), FASTA scores: opt: 1564, E(): 0, (81.8% identity in 280 aa overlap); and CAC32097.1|AL583926 conserved hypothetical protein from Mycobacterium leprae (300 aa). Also similar to proteins from other organisms e.g. CAB77291.1|AL160312 putative dehydratase from Streptomyces coelicolor (291 aa); part of BAA92930.1|AB032743 fatty acid synthetase beta subunit from Pichia angusta (2060 aa). Protein product from Mb0247c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0247c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247953.1" /translation="MTQPSGLKNLLRAAAGALPVVPRTDQLPNRTVTVEELPIDPANV AAYAAVTGLRYGNQVPLTYPFALTFPSVMSLVTGFDFPFAAMGAIHTENHITQYRPIA VTDAVGVRVRAENLREHRRGLLVDLVTNVSVGNDVAWHQVTTFLHQQRTSLSGEPKPP PQKKPKLPPPAAVLRITPAKIRRYAAVGGDHNPIHTNPIAAKLFGFPTVIAHGMFTAA AVLANIEARFPDAVRYSVRFAKPVLLPATAGLYVAEGDGGWDLTLRNMAKGYPHLTAT VRGL" CDS complement(288989..290353) /codon_start=1 /transl_table=11 /gene="fabG4" /locus_tag="BQ2027_MB0248C" /product="PROBABLE 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE FABG4 (3-KETOACYL-ACYL CARRIER PROTEIN REDUCTASE)" /note="Mb0248c, fabG4, len: 454 aa. Equivalent to Rv0242c, len: 454 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 454 aa overlap). Probable fabG4, 3-oxoacyl-[acyl-carrier protein] reductase (EC 1.1.1.100), equivalent to 3063883|CAA18568.1|AL022486|MLCB1883_13|T448 78 3-oxoacyl-[acyl-carrier protein] reductase homolog from Mycobacterium leprae (454 aa), FASTA scores: opt: 2486, E(): 0, (84.8% identity in 454 aa overlap). C-terminal part highly similar to many FabG proteins e.g. U39441|VHU3944 1_2 from Vibrio harveyi (244 aa), FASTA scores: opt: 562, E(): 3.4e-28, (40.2% identity in 241 aa overlap); U91631|PAU91631_3 from Pseudomonas aeruginosa (247 aa), FASTA scores: opt: 584, E(): 1.5e-29, (44.4% identity in 241 aa overlap). Has N-terminal extension of ~200 aa and C-terminal part contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb0248c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0248c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247954.1" /translation="MAPKRSSDLFSQVVNSGPGSFLARQLGVPQPETLRRYRAGEPPL TGSLLIGGAGRVVEPLRAALEKDYDLVGNNLGGRWADSFGGLVFDATGITEPAGLKGL HEFFTPVLRNLGRCGRVVVVGGTPEAAASTNERIAQRALEGFTRSLGKELRRGATTAL VYLSPDAKPAATGLESTMRFLLSAKSAYVDGQVFSVGADDSTPPADWEKPLDGKVAIV TGAARGIGATIAEVFARDGAHVVAIDVESAAENLAETASKVGGTALWLDVTADDAVDK ISEHLRDHHGGKADILVNNAGITRDKLLANMDDARWDAVLAVNLLAPLRLTEGLVGNG SIGEGGRVIGLSSIAGIAGNRGQTNYATTKAGMIGITQALAPGLAAKGITINAVAPGF IETQMTAAIPLATREVGRRLNSLLQGGQPVDVAEAIAYFASPASNAVTGNVIRVCGQA MIGA" CDS 290495..291817 /codon_start=1 /transl_table=11 /gene="fadA2" /locus_tag="BQ2027_MB0249" /product="PROBABLE ACETYL-COA ACYLTRANSFERASE FADA2 (3-KETOACYL-COA THIOLASE) (BETA-KETOTHIOLASE)" /note="Mb0249, fadA2, len: 440 aa. Equivalent to Rv0243, len: 440 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 440 aa overlap). Probable fadA2, acetyl-CoA acyltransferase (3-acyl-CoA thiolase) (EC 2.3.1.16), equivalent, but shorter 17 aa, to AL022486|MLCB1883_14T44879 acetyltransferase from Mycobacterium leprae (457 aa), FASTA scores: opt: 250 7, E(): 0, (87.6% identity in 435 aa overlap). Also highly similar to many e.g. G83046|PA478 probable acyl-CoA thiolase from Pseudomonas aeruginosa (425 aa); AB77293.1|AL160312 putative ketoacyl CoA thiolase from Streptomyces coelicolor (428 aa); P76503|7449731|YFCY_ECOLI|D65007|B2342 PROBABLE 3-KETOACYL-COA THIOLASE (ACETYL-COA ACYLTRANSFERASE) (BETA-KETOTHIOLASE) from Escherichia coli strain K-12 (436 aa), FASTA scores: opt: 914, E(): 0, (38.2% identity in 434 aa overlap); P55084|ECHB_HUMAN MITOCHONDRIAL TRIFUNCTONAL ENZYME (474 aa), FASTA scores: opt: 881, E(): 0, (37.7 identity in 451 aa overlap). Contains PS00099 Thiolases active site. BELONGS TO THE THIOLASE FAMILY. Protein product from Mb0249 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0249 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247955.1" /translation="MAPAAKNTSQTRRRVAVLGGNRIPFARSDGAYADASNQDMFTAA LSGLVDRFGLAGERLDMVVGGAVLKHSRDFNLMRECVLGSELSPYTPAFDLQQACGTG LQAAIAAADGIAAGRYEVAAAGGVDTTSDPPIGLGDDLRRTLLKLRRSRSNVQRLKLV GTLPASLGVEIPANSEPRTGLSMGEHAAVTAKQMGIKRVDQDELAAASHRNMADAYDR GFFDDLVSPFLGLYRDDNLRPNSSVEKLATLRPVFGVKAGDATMTAGNSTPLTDGASV ALLASEQWAEAHSLAPLAYLVDAETAAVDYVNGNDGLLMAPTYAVPRLLARNGLSLQD FDFYEIHEAFASVVLAHLAAWESEEYCKRRLGLDAALGSIDRSKLNVNGSSLAAGHPF AATGGRILAQTAKQLAEKKAAKKGGGPLRGLISICAAGGQGVAAILEA" CDS complement(292123..293958) /codon_start=1 /transl_table=11 /gene="fadE5" /locus_tag="BQ2027_MB0250C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE5" /note="Mb0250c, fadE5, len: 611 aa. Equivalent to Rv0244c, len: 611 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 611 aa overlap). Probable fadE5, acyl-CoA dehydrogenase (EC 1.3.99.-), equivalent to AL022486|MLCB1883_15 from Mycobacterium leprae (611 aa), FASTA scores: opt: 3598, E(): 0, (89.4% identity in 611 aa overlap). Also highly similar to AL0211|MTV007.14 from Mycobacterium tuberculosis (609 aa), FASTA scores: opt: 2576, E(): 0, (64.6% identity in 611 aa overlap); and to various other bacterial proteins described as putative acyl-CoA dehydrogenases e.g. AE0010|AE001025_6 from Archaeoglobus fulgidus (387 aa), FASTA scores: opt: 229, E(): 6.8e-08, (29.8% identity in 409 aa overlap); etc. Protein product from Mb0250c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0250c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247956.1" /translation="MSHYRSNVRDQVFNLFEVLGVDKALGHGEFSDVDVDTARDMLAE VSRLAEGPVAESFVEGDRNPPVFDPKTHSVMLPESFKKSVNAMLEAGWDKVGIDEALG GMPMPKAVVWALHEHILGANPAVWMYAGGAGFAQILYHLGTEEQKKWAVLAAERGWGS TMVLTEPDAGSDVGAARTKAVQQADGSWHIDGVKRFITSGDSGDLFENIFHLVLARPE GAGPGTKGLSLYFVPKFLFDVETGEPGERNGVFVTNVEHKMGLKVSATCELAFGQHGV PAKGWLVGEVHNGIAQMFEVIEQARMMVGTKAIATLSTGYLNALQYAKSRVQGADLTQ MTDKTAPRVTITHHPDVRRSLMTQKAYAEGLRALYLYTATFQDAAVAEVVHGVDAKLA VRVNDLMLPVVKGVGSEQAYAKLTESLQTLGGSGFLQDYPIEQYIRDAKIDSLYEGTT AIQAQDFFFRKIVRDKGVALAHVSGQIQAFVDSGAGNGRLKTERALLAKALTDVQGMA AALTGYLMAAQQDVTSLYKVGLGSVRFLMSVGDLIIGWLLQRQAAVAVAALDAGATGD ERSFYEGKVAVASFFAKNFLPLLTSTREVIETLDNDIMELDEAAF" CDS 294330..294818 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0251" /product="possible oxidoreductase" /note="Mb0251, -, len: 162 aa. Equivalent to Rv0245, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). Possible oxidoreductase (EC 1.-.-.-), equivalent to AL022486|MLCB1883_17|T44882 probable oxidoreductase from Mycobacterium leprae (162 aa), FASTA scores: opt: 860, E(): 0, (83.4% identity in 157 aa overlap). Also similar to several hypothetical proteins and various oxidoreductases e.g. AAK24246.1|AE005898 NADH:riboflavin 5'-phosphate oxidoreductase from Caulobacter crescentus (174 aa); Q02058|DIM6_STRCO|CAA45048.1 ACTINORHODIN POLYKETIDE DIMERASE from STREPTOMYCES COELICOLOR (177 aa), FASTA scores: opt: 308, E(): 3. 2e-15, (37.8% identity in 143 aa overlap). Also similar to Z84498|Rv1939|MTCY09F9.25c from Mycobacterium tuberculosis (171 aa), FASTA scores: opt: 517, E(): 3.5e-30, (49.4% identity in 158 aa overlap). Protein product from Mb0251 detected using SWATH mass spectrometry. Mb0251 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247957.1" /translation="MNSTNNLTPSSLREAFGHFPTGVVAIAAEVDGVRQGLAASTFVP VSLEPPLVSFCVQNTSTTWPKLTGVPMLGISVLGEAHDAAVRTLAAKTGDRFAGLETV SNDAGAVFIKGTSVWLESAIEQLVPAGDHTIVVLRVNQVKVDPNVAPIVFHRSVLRRL GV" CDS 295134..296444 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0252" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0252, -, len: 436 aa. Equivalent to Rv0246, len: 436 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 436 aa overlap). Probable conserved integral membrane protein, similar to Rv2209|1237062|CAA94252.1|Z70283|Q10398|YM09_MYCTU from Mycobacterium tuberculosis (512 aa), FASTA scores: opt: 712, E(): 0, (33.2% identity in 422 aa overlap). Mb0252 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247958.1" /translation="MAKTSHRVSSADGMSKRILRLIIAQSGFYSAALQLGNVSIVLPF VVAELDAELWIAALIFPAFTAGGAIGNVVAPPAVAAVPRRHRLFIIVSCLAVLAGVNA LCATIGKGSVAGILLVVNVTLIGVVSVISFVAFADLVAAMPSGTARARILLTEVGVGA ALTAVVAATLSFVPDQHPLSRNIHLLWTAAVAMAISAAICRALPHRIVPRVHAAPGLH KLVYVGWTAIRTNGWYRRYLLVQVLFGSVVLGSSFHSIRVAAVPGDQPDEVVAVVLFV CVGLLGGIALWNRVRERFGLVGLFVGSALVSIAAAVLSIAFDLAGAWPNVVAIGLVIA LVSIANQSVFTAGQLWIARDAEPGLRTSLISFGQLVINAGLVGMGLALGLIAQDHDAV WPVMIVLLLNLTAAYSATRFAPAKSVDVRGLPQVSRTSRPKTGG" CDS complement(296441..297187) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0253C" /product="PROBABLE SUCCINATE DEHYDROGENASE [IRON-SULFUR SUBUNIT] (SUCCINIC DEHYDROGENASE)" /note="Mb0253c, -, len: 248 aa. Equivalent to Rv0247c, len: 248 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 248 aa overlap). Probable succinate dehydrogenase, iron-sulfur subunit (EC 1.3.99.1), highly similar to CAC44313.1|AL596043 putative succinate dehydrogenase iron-sulfur subunit from Streptomyces coelicolor (259 aa); and similar to iron-sulphur protein subunits of fumarate reductase or succinate dehydrogenases from many bacteria e.g. NP_147618.1|7521083|B72691 fumarate reductase iron-sulfur protein from Aeropyrum pernix (305 aa); NP_069516.1|2649932|AAB90556.1|AE001057 succinate dehydrogenase iron-sulfur subunit B (sdhB) from Archaeoglobus fulgidus (236 aa); etc. Also similar to Q10761|FRDB_MYCTU|7431693|F70762 FUMARATE REDUCTASE IRON-SULFUR PROTEIN from Mycobacterium tuberculosis (247 aa), FASTA scores: opt: 358, E():1e-16, (31.3% identity in 214 aa overlap). Contains PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature. NOTE THAT SUCCINATE DEHYDROGENASE FORMS GENERALLY PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN (Rv0248c ?), AN IRON-SULFUR (Rv0247c ?), AND TWO HYDROPHOBIC ANCHOR PROTEINS (Rv0249c ?). Protein product from Mb0253c detected using shotgun mass spectrometry. Mb0253c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247959.1" /translation="MTYSASMRVWRGDESCGELREFTVEVNEGEVVLDVILRLQQTQT PDLAVRWNCKAGKCGSCSAEINGKPRLMCMTRMSTFDEDEIVTVTPMRTFPVIRDLVT DVSFNYQKAREIPSFAPPKELQPSEYRMAQVDVARSQEFRKCIECFLCQNVCHVVRDH EENKDAFAGPRFLMRIAELEMHPLDTRDRRSQAQEEHGLGYCNITKCCTEVCPENIKI TDNALIPMKERVADRKYDPVVWLGSKLFRR" CDS complement(297188..299128) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0254C" /product="PROBABLE SUCCINATE DEHYDROGENASE [IRON-SULFUR SUBUNIT] (SUCCINIC DEHYDROGENASE)" /note="Mb0254c, -, len: 646 aa. Equivalent to Rv0248c, len: 646 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 646 aa overlap). Probable succinate dehydrogenase, flavoprotein subunit (EC 1.3.99.1), highly similar to flavoprotein subunit of various succinate dehydrogenases e.g. M88696|RIRSDHA_1 flavoprotein from Rickettsia prowazekii (596 aa), FASTA scores: opt: 651, E(): 0, (34.6 % identity in 598 aa overlap). Also similar to truncated U00022_17 flavoprotein from Mycobacterium leprae (401 aa), FASTA scores: opt: 677, E(): 0, (39.0% identity in 423 aa overlap). NOTE THAT SUCCINATE DEHYDROGENASE FORMS GENERALLY PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN (Rv0248c ?), AN IRON-SULFUR (Rv0247c ?), AND TWO HYDROPHOBIC ANCHOR PROTEINS (Rv0249c ?). Protein product from Mb0254c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0254c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247960.1" /translation="MVEVERHSYDVVVIGAGGAGLRAVIEARERGLKVAVVCKSLFGK AHTVMAEGGCAAAMGNANPKDNWKTHFGDTMRGGKFLNNWRMAELHAKEAPDRVWELE TYGALFDRTDDGRISQRNFGGHTYPRLAHVGDRTGLELIRTLQQKVVSLQQEDHAELG DYEARIKVFAECTITELLKDQGAIAGAFGYWRESGRFIVFEAPAVVLATGGIGKSFKV TSNSWEYTGDGHALALRAGATLINMEFVQFHPTGMVWPPSVKGILVTEGVRGDGGVLK NSENSRFMFDYIPPVFKGQYAETEEEADQWLKDNDSARRTPDLLPRDEVARAINSEVK AGRGTPHGGVYLDIASRLTPAEIKRRLPSMYHQFKELAEVDITTQAMEVGPTCHYVMG GVEVDADTGAATVPGLFAAGECAGGMHGSNRLGGNSLSDLLVFGRRAGLGAADYVRAL SSRPAVSAEAIDAAAQQALSPFEGPKDGSAPENPYALHMDLQYVMNDLVGIIRNADEI SRALTLLAELWSRYHNVLVEGHRQYNPGWNLSIDLRNMLLVSECVARAALQRTESRGG HTRDDHPGMDPNWRRILLVCRATETMGTGGSGSGDSNCHINVTQQLQTPMRPDLLELF EISELEKYYTDEELAEHPGRRG" CDS complement(299159..299980) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0255C" /product="probable succinate dehydrogenase [membrane anchor subunit] (succinic dehydrogenase)" /note="Mb0255c, -, len: 273 aa. Equivalent to Rv0249c, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 273 aa overlap). Probable succinate dehydrogenase, membrane-anchor subunit for succinate dehydrogenase encoded by Rv0247c and Rv0248c. Highly similar to AC44315.1|AL596043 putative integral membrane protein from Streptomyces coelicolor (278 aa). NOTE THAT SUCCINATE DEHYDROGENASE FORMS GENERALLY PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN (Rv0248c ?), AN IRON-SULFUR (Rv0247c ?), AND TWO HYDROPHOBIC ANCHOR PROTEINS (Rv0249c ?). Protein product from Mb0255c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0255c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247961.1" /translation="MSAPTANRPAIGVFTPTRAQIPERTLRTDLWWLPPLLTNLGLLA FICYATTRAFWGSQYWVEKYHYLTPFYSPCVSASCQPGASHLGVWFGHFPGWIPLGAM VLPFLLGFRLTCYYYRKAYYRSVWQSPTSCAVPEPRAHYTGETRLPLIVQNTHRYFFY IAVVVSLINTYDAIAAFHSPSGFGFGLGNVILTINVVLLWAYTISCHSCRHATGGRLK HFSKHPVRYWIWTQVSKLNTRHMQFAWITLGTLALTDFYIMLVASGSITDLRFIG" CDS complement(300060..300353) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0256C" /product="conserved protein" /note="Mb0256c, -, len: 97 aa. Equivalent to Rv0250c, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 97 aa overlap). Conserved hypothetical protein, equivalent to MLCB1883.27c|T44883|3063888|CAA18576.1|AL022486 hypothetical protein from Mycobacterium leprae (98 aa), FASTA scores: opt: 478, E(): 4.4e-28, (72.6% identity in 95 aa overlap). Also similar to C-terminus of AC44316.1|AL596043|SCBAC31E11.05c hypothetical protein from Streptomyces coelicolor (146 aa). Protein product from Mb0256c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0256c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247962.1" /translation="MSTTAELAELHDLVGGLRRCVTALKARFGDNPATRRIVIDADRI LTDIELLDTDVSELDLERAAVPQPSEKIAIPDTEYDREFWRDVDDEGVGGHRY" CDS complement(300498..300977) /codon_start=1 /transl_table=11 /gene="hsp" /locus_tag="BQ2027_MB0257C" /standard_name="hsp 20; hrpA; acr2" /product="HEAT SHOCK PROTEIN HSP (HEAT-STRESS-INDUCED RIBOSOME-BINDING PROTEIN A)" /note="Mb0257c, hsp, len: 159 aa. Equivalent to Rv0251c, len: 159 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 159 aa overlap). hsp (alternate gene name: hsp20, hrpA, acr2), heat-stress-induced ribosome-binding protein A (see citations below). Highly similar to AAD39038.1|AF072875_1|AF072875 putative HSP20 from Mycobacterium smegmatis (145 aa), FASTA scores: opt: 479, E(): 2.3e-24, (59.9% identity in 157 aa overlap); and similar to many bacterial and eukaryotic hsp proteins e.g. P12811|HS2C_CHLRE CHLOROPLAST HEAT SHOCK 22KD PROTEIN from CHLAMYDOMONAS REINHARDTII (157 aa), FASTA scores: opt: 184, E(): 1.2e-05, (32.4% identity in 142 aa overlap). Also similar to PCC6803 Spore protein sp21 from Synechocystis sp. (146 aa), FASTA scores: opt: 213, E(): 1.2e-07, (30.3 identity in 145 aa overlap). Also similar to P30223|14KD_MYCTU 14 KDA ANTIGEN (16 KDA ANTIGEN) 19K major membrane protein (HSP 16.3) from Mycobacterium tuberculosis (144 aa). BELONG TO THE SMALL HEAT SHOCK PROTEIN (HSP20) FAMILY. Protein product from Mb0257c detected using shotgun mass spectrometry. Mb0257c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247963.1" /translation="MNNLALWSRPVWDVEPWDRWLRDFFGPAATTDWYRPVAGDFTPA AEIVKDGDDAVVRLELPGIDVDKDVNVELDPGQPVSRLVIRGEHRDEHTQDAGDKDGR TLREIRYGSFRRSFRLPAHVTSEAIAASYDAGVLTVRVAGAYKAPAETQAQRIAITK" CDS 301191..303752 /codon_start=1 /transl_table=11 /gene="nirB" /locus_tag="BQ2027_MB0258" /product="PROBABLE NITRITE REDUCTASE [NAD(P)H] LARGE SUBUNIT [FAD FLAVOPROTEIN] NIRB" /note="Mb0258, nirB, len: 853 aa. Equivalent to Rv0252, len: 853 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 853 aa overlap). Probable nirB (alternate gene name: nasB), nitrite reductase [NAD(P)H] large subunit (EC 1.6.6.4), flavoprotein containing siroheme and a 2FE-2S iron-sulfur centre. Highly similar to many others bacterial enzymes e.g. P08201|NIRB_ECOLI NITRITE REDUCTASE (NAD(P)H) LARGE SUBUNIT from Escherichia coli strain K12 (847 aa), FASTA scores: opt: 2775, E(): 0, (49.8% identity in 840 aa overlap); Q06458|NIRB_KLEPN NITRITE REDUCTASE (NAD(P)H) LARGE SUBUNIT (957 aa), FASTA scores: opt: 2902, E(): 0, (54.2% identity in 827 aa overlap). Contains PS00365 Nitrite and sulfite reductases iron-sulfur/siroheme-binding site. HOMODIMER WHICH ASSOCIATES WITH NIRD|Rv0253. COFACTORS: FAD; Iron; Siroheme. Mb0258 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247964.1" /translation="MPTAGSSRAPAAAREIVVVGHGMVGHRLVEAVRARDADGSLRIT VLAEEGDAAYDRVGLTSYTESWDRALLALPGNDYAGDQRVRLLLNTRVTQIDRATKSV VTAAGQRHRYDTLVLATGSYAFVPPVPGHDLPACHVYRTFDDLDAIRAGAQRTLDGGH TDGGVVIGGGLLGLEAANALRQFGLQTHVVEMMPRLMAQQIDEAGGALLARMIADLGI AVHVGTGTESIESVKHSDGSVWARVRLSDGEVIDAGVVIFAAGIRPRDELARAAGLAI GDRGGVLTDLSCRTSDPDIYAVGEVAAIDGRCYGLVGPGYTSAEVVADRLLDGSAEFP EADLSTKLKLLGVDVASFGDAMGATENCLEVVINDAVKRTYAKLVLSDDATTLLGGVL VGDASSYGVLRPMVGAELPGDPLALIAPAGSGAGAGALGVGALPDSAQICSCNNVTKG ELKCAIADGCGDVPALKSCTAAGTSCGSCVPLLKQLLEAEGVEQSKALCEHFSQSRAE LFEIITATEVRTFSGLLDRFGRGKGCDICKPVVASILASTGSDHILDGEQASLQDSND HFLANIQKNGSYSVVPRVPGGDIKPEHLILIGQIAQDFGLYTKITGGQRIDLFGARVD QLPLIWQRLVDGGMESGHAYGKAVRTVKSCVGSDWCRYGQQDSVQLAIDLELRYRGLR APHKIKLGVSGCARECAEARGKDVGVIATEKGWNLYVAGNGGMTPKHAQLLASDLDKE TLIRYIDRFLIYYIRTADRLQRTAPWVESLGLDHVREVVCEDSLGLAEEFEAAMQRHV ANYKCEWKGVLEDPDKLSRFVSFVNAPDAVDSTVTFTERAGRKVPVSIGIPRVRS" CDS 303778..304134 /codon_start=1 /transl_table=11 /gene="nirD" /locus_tag="BQ2027_MB0259" /product="PROBABLE NITRITE REDUCTASE [NAD(P)H] SMALL SUBUNIT NIRD" /note="Mb0259, nirD, len: 118 aa. Equivalent to Rv0253, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 118 aa overlap). Probable nirD, nitrite reductase [NAD(P)H] small subunit (EC 1.6.6.4), similar to others e.g. P23675|NIRD_ECOLI|B3366|Z4727|ECS4217 from Escherichia coli strains K12 and O157:H7 (108 aa), FASTA scores: opt: 271, E():1.7e-12, (41.9% identity in 105 aa overlap). ASSOCIATES WITH NIRB|Rv0252." /protein_id="CAB5247965.1" /translation="MTLLNDIQVWTTACAYDHLIPGRGVGVLLDDGSQVALFRLDDGS VHAVGNVDPFSGAAVMSRGIVGDRGGRAMVQSPILKQAFALDDGSCLDDPRVSVPVYP ARVTPEGRIQVARVAV" CDS complement(304150..304674) /codon_start=1 /transl_table=11 /gene="cobU" /locus_tag="BQ2027_MB0260C" /product="probable bifunctional cobalamin biosynthesis protein cobu: cobinamide kinase + cobinamide phosphate guanylyltransferase" /note="Mb0260c, cobU, len: 174 aa. Equivalent to Rv0254c, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 174 aa overlap). Probable cobU, cobalamin biosynthesis protein including a cobinamide kinase (EC 2.-.-.-) and cobinamide phosphate guanylyltransferase (EC 2.-.-.-). Highly similar to many e.g. Q05599|COBU_SALTY COBINAMIDE KINASE / COBINAMIDE PHOSPHATE GUANYLYLTRANSFERASE from Salmonella typhimurium (181 aa), FASTA scores: opt: 308, E(): 1.1e-14, (38.7% identity in 181 aa overlap); P46886|COBU_ECOLI|B1993|Z3153|ECS2788 Bifunctional cobalamin biosynthesis protein cobU from Escherichia coli strains K12 and O157:H7 (181 aa); part of AL096872|SC5F7_10 from Streptomyces coelicolor (397 aa), FASTA scores: opt: 445, E(): 3.6e-23, (46.0% identity in 176 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0260c detected using SWATH mass spectrometry. Mb0260c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247966.1" /translation="MRILVTGGVRSGKSTHAEALLGDAADVVYVAPGRPAAGSDPDWD ARVALHRARRPPTWLTVETADVATALSEARSPVLVDCLGTWLTAIMDGEALWSAATAD VYAVLEARLDGLCAALTGLPTAIVVTNEVGLGVVPSHSSGVLFRDLLGTINRRVAAVC DEVHLVIAGRVLKL" CDS complement(304699..306183) /codon_start=1 /transl_table=11 /gene="cobQ1" /locus_tag="BQ2027_MB0261C" /product="probable cobyric acid synthase cobq1" /note="Mb0261c, cobQ1, len: 494 aa. Equivalent to Rv0255c, len: 494 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 494 aa overlap). Probable cobQ1, cobyric acid synthase, similar to many e.g. Z46611|RCBLUGNS_8 COBYRIC ACID SYNTHASE from R.capsulatus (483 aa), FASTA scores: opt: 1239, E(): 0, (47.1% identity in 493 aa overlap); P29932|COBQ_PSEDE COBYRIC ACID SYNTHASE from Pseudomonas denitrificans (484 aa), FASTA scores: opt: 1168, E():0, (44.9% identity in 490 aa overlap); etc. BELONGS TO THE COBB/COBQ FAMILY, COBQ SUBFAMILY. Note that previously known as cobQ. Protein product from Mb0261c detected using SWATH mass spectrometry. Mb0261c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247967.1" /translation="MSGLLVAGTTSDAGKSAVTAGLCRALARRGVRVAPFKAQNMSNN SMVCRGPDGTGVEIGRAQWVQALAARTTPEAAMNPVLLKPASDHRSHVVLMGKPWGEV ASSSWCAGRRALAEAACRAFDALAARYDVVVAEGAGSPAEINLRAGDYVNMGLARHAG LPTIVVGDIDRGGVFAAFLGTVALLAAEDQALVAGFVVNKFRGDSDLLAPGLRDLERV TGRRVYGTLPWHPDLWLDSEDALDLQGRRAAGTGARRVAVVRLPRISNFTDVDALGLE PDLDVVFASDPRALDDADLIVLPGTRATIADLAWLRARDLDRALLVHVAAGKPLLGIC GGFQMLGRVIRDPYGIEGPGGQVTEVEGLGLLDVETAFSPHKVLRLPRGEGLGVPASG YEIHHGRITRGDTAEEFLGGARDGPVFGTMWHGSLEGDALREAFLRETLGLAPSGSCF LAARERRLDLLGDLVERHLDVDALLNLARHGCPPTLPFLAPGAP" CDS complement(306202..307872) /codon_start=1 /transl_table=11 /gene="PPE2" /locus_tag="BQ2027_MB0262C" /product="ppe family protein ppe2" /note="Mb0262c, PPE2, len: 556 aa. Equivalent to Rv0256c, len: 556 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 556 aa overlap). Member of the M. tuberculosis PPE family, similar to many e.g. Rv0280, Rv0286, etc. Equivalent to Z98756|MLCB2492.30 from Mycobacterium leprae (572 aa), FASTA scores: opt: 1837, E(): 0, (62.9% identity in 461 aa overlap). Mb0262c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247968.1" /translation="MTAPIWMASPPEVHSALLSSGPGPGPLLVSAEGWHSLSIAYAET ADELAALLAAVQAGTWDGPTAAVYVAAHTPYLAWLVQASANSAAMATRQETAATAYGT ALAAMPTLAELGANHALHGVLMATNFFGINTIPIALNESDYARMWIQAATTMASYQAV STAAVAAAPQTTPAPQIVKANAPTAASDEPNQVQEWLQWLQKIGYTDFYNNVIQPFIN WLTNLPFLQAMFSGFDPWLPSLGNPLTFLSPANIAFALGYPMDIGSYVAFLSQTFAFI GADLAAAFASGNPATIAFTLMFTTVEAIGTIITDTIALVKTLLEQTLALLPAALPLLA APLAPLTLAPASAAGGFAGLSGLAGLVGIPPSAPPVIPPVAAIAPSIPTPTPTPAPAP APTAVTAPTPPPGPPPPPVTAPPPVTGAGIQSFGYLVGDLNSAAQARKAVGTGVRKKT PEPDSAEAPAAAAAPEEQVQPQRRRRPKIKQLGRGYEYLDLDPETGHDPTGSPQGAGT LGFAGTTHKASPGQVAGLITLPNDAFGGSPRTPMMPGTWDTDSATRVE" CDS 308180..308398 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0263" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0263, -, len: 72 aa. Equivalent to Rv0257, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 72 aa overlap). Hypothetical protein, orthologue of ML1828A conserved hypothetical protein from Mycobacterium leprae. Replaced Rv0257c (older annotation). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (c-t) leads to a truncated product in the 5' direction compared to Mycobacterium tuberculosis strain H37Rv (72 aa versus 124 aa). Mb0263 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247969.1" /translation="MQSVDLHVERHLPSRGRSHRTVATVTCVTALGDIRSAQLSATGA WPAVLFPSWSWLCGIGGGVDLQKPSCRA" CDS complement(308619..309074) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0264C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0264c, -, len: 151 aa. Equivalent to Rv0258c, len: 151 aa (alternative start possible), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 151 aa overlap). Conserved hypothetical protein, showing some similarity to Rv1685c|MTCI125_6 from Mycobacterium tuberculosis (207 aa), FASTA scores: E(): 9.3e-07, (32.1% identity in 140 aa overlap). Also some similarity with AL049819|SCE7_13|T36295 probable transcription regulator from Streptomyces coelicolor (204 aa), FASTA scores: opt: 158, E(): 0.00052, (27.0% identity in 111 aa overlap). Protein product from Mb0264c detected using SWATH mass spectrometry. Mb0264c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247970.1" /translation="MARSQEPSRGLLDPVAKMLRLPFGTPDFIEKIVTGSVNQVGRRT LYVLITTWDAAGGGPFAASAIATTGLAKTAEIVQSMFIGPVFNPLLKMLGADKIAIRA SLCAAQLVGLGIMRYGVRSEPLHSMSVEMLVDAIGPTMQRYLVGDIGRG" CDS complement(309099..309842) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0265C" /product="Sirohydrochlorin ferrochelatase SirB (EC" /EC_number="4.99.1.4" /note="Mb0265c, -, len: 247 aa. Equivalent to Rv0259c, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 247 aa overlap). Conserved hypothetical protein, showing some similarity to Rv2393|Z81368|MTCY253_28 from Mycobacterium tuberculosis (281 aa), FASTA scores: E(): 9.5e-16, (33.6 % identity in 235 aa overlap). Also some similarity with CAC33938.1|AL589708 putative secreted protein from Streptomyces coelicolor (248 aa)." /protein_id="CAB5247971.1" /translation="MNLILTAHGTRRPSGVAMIADIAAQVSALVDRTVQVAFVDVLGP SPSEVLSALSCRPAIVVPAFLSRGYHVRTDLPAHVAASAHPHVTVTPALGPCREIAQI VTQQLVESGWRPGDSVILAAAGASDRRARADLHTTRTLVSELTGSWVDMGFAGTGGPD VRTAVQRARDRAEANRGARRVVVASFLLAEGLFQERLRASGADVVTRPLGTHPGLAQL VANRFRSAVARQQRLHRWHGTPTPVTLDL" CDS complement(309839..310984) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0266C" /product="possible transcriptional regulatory protein" /note="Mb0266c, -, len: 381 aa. Equivalent to Rv0260c, len: 381 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 381 aa overlap). Possible two-component response regulator, highly similar to CAB72204.1|AL138851 putative transcriptional regulator from Streptomyces coelicolor (395 aa); and similar to O34394|D69851|YJJA conserved hypothetical protein from Bacillus subtilis (270 aa), FASTA scores: opt: 312, E(): 7.4e-14, (25.8% identity in 267 aa overlap). Also some similarity to regulatory proteins at C-terminal region e.g. CUTR_STRLI|Q03756 transcriptional regulatory protein (217 aa), FASTA scores: opt: 138, E(): 0.02, (30.6% identity in 111 aa overlap)." /protein_id="CAB5247972.1" /translation="MAQAHSAPLTGYRIAVTSARRAEELCALLRRQGAEVCSAPAIKM IALPDDDELQNNTEALIADPPDILVAHTGIGFRGWLAAAEGWGLANELLESLSSARII SRGPKATGALRAAGLREEWSPDSESSHEVLEYLLESGVSRTRIAVQLHGAADSWDPFP EFLGGLRFAGAQVVPIRVYRWKPAPLGGVFDHLVTGIARRQFDAVTFTSAPAAAAVLE RSRELDIEDQLLAALRTDVHAMCVGPVTSRPLIRKGVPTSAPERMRLGALARHIAEEL PLLGSCTFKAAGHVIEIRGTSVLVDDSVKPLSPSGMAILRALVHRPGGVVSRGDLLRV LPGDGSDTHAVDTAVLRLRTALGDKNIVATVVKRGYRLAVDSRHDDV" CDS complement(311084..312493) /codon_start=1 /transl_table=11 /gene="narK3" /locus_tag="BQ2027_MB0267C" /product="PROBABLE INTEGRAL MEMBRANE NITRITE EXTRUSION PROTEIN NARK3 (NITRITE FACILITATOR)" /note="Mb0267c, narK3, len: 469 aa. Equivalent to Rv0261c, len: 469 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 469 aa overlap). Probable nirK3, nitrite extrusion protein, integral membrane protein possibly member of major facilitator superfamily (MFS), equivalent to AAB41700.1|U72744 nitrite extrusion protein from Mycobacterium fortuitum (471 aa); and 2342627|CAB11406.1|Z98741|T44908 nitrite extrusion protein homolog from Mycobacterium leprae (517 aa; longer in N-terminus). Also similar to other nitrite extrusion proteins e.g. NARK_ECOLI|P10903|B1223 nitrite extrusion protein 1 from Escherichia coli strain K12 (463 aa), FASTA scores: opt: 755, E(): 0, (35.0% identity in 466 aa overlap). BELONGS TO THE NARK/NASA FAMILY OF TRANSPORTERS." /protein_id="CAB5247973.1" /translation="MGRSHQISDWDPEDSVAWEAGNKFIARRNLIWSVAAEHVGFSVW SLWSVMVLFMPTSVYGFSAGDKFLLGATATLVGACLRFPYTFATAKFGGRNWTIFSAL VLLIPTVGSILLLANPGLPLWPYLVCGALAGLGGGNFAASMTNINAFFPQRLKGAALA LNAGGGNLGVPMVQLVGLLVIATAGDREPYWVCAIYLVLLAVAGLGAALYMDNLTEYR IELNTMRAVVSEPHTWVISLLYIGTFGSFIGFSFAFGQVLQINFIASGQSTAQASLHA AQIAFLGPLLGSLSRIYGGKLADRIGGGRVTLAAFCAMLLATGILISASTFGDHLAGP MPTATMVGYVIGFTALFILSGIGNGSVYKMIPSIFEARSHSLQISEAERRQWSRSMSG ALIGLAGAVGALGGVGVNLALRESYLTSGTATSAFWAFGVFYLVASVLTWAIYVRRGL KSAGELVPATTAPAGLAYV" CDS complement(312634..313179) /codon_start=1 /transl_table=11 /gene="aac" /locus_tag="BQ2027_MB0268C" /standard_name="aac(2')-IC" /product="AMINOGLYCOSIDE 2'-N-ACETYLTRANSFERASE AAC (AAC(2')-IC)" /note="Mb0268c, aac, len: 181 aa. Equivalent to Rv0262c, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 181 aa overlap). aac, aminoglycoside 2'-N-acetyltransferase (aac(2')-IC) (EC 2.3.1.-) (see citation below), highly similar to NP_302635.1|NC_002677 aminoglycoside 2'-N-acetyltransferase from Mycobacterium leprae (182 aa); Q49157|AAC2_MYCFO|AAC aminoglycoside 2'-N-acetyltransferase from Mycobacterium fortuitum (195 aa), FASTA scores: opt: 884, E(): 0, (69.1% identity in 181 aa overlap); and P94968|AAC2_MYCSM|AAC aminoglycoside 2'-N-acetyltransferase from Mycobacterium smegmatis (210 aa) (see also citation below). Also similar to Q52424|AAC2_PROST AMINOGLYCOSIDE 2'-N-ACETYLTRANSFERASE from Providencia stuartii (178 aa). BELONGS TO THE AAC(2')-I FAMILY OF ACETYLTRANSFERASES. Note that previously known as aac(2')-IC. Protein product from Mb0268c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0268c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247974.1" /translation="MHTQVHTARLVHTADLDSETRQDIRQMVTGAFAGDFTETDWEHT LGGMHALIWHHGAIIAHAAVIQRRLIYRGNALRCGYVEGVAVRADWRGQRLVSALLDA VEQVMRGAYQLGALSSSARARRLYASRGWLPWHGPTSVLAPTGPVRTPDDDGTVFVLP IDISLDTSAELMCDWRAGDVW" CDS complement(313189..314091) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0269C" /product="Allophanate hydrolase 2 subunit 2 (EC" /EC_number="3.5.1.54" /note="Mb0269c, -, len: 300 aa. Equivalent to Rv0263c, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 300 aa overlap). Conserved hypothetical protein, equivalent to NP_302634.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (305 aa). Also similar to others e.g. AL121596|SC51A_21 hypothetical protein from Streptomyces coelicolor (285 aa), FASTA scores: opt: 714, E(): 0, (45.3% identity in 289 aa overlap); NP_233164.1|NC_002506 conserved hypothetical protein from Vibrio cholerae (309 aa); NP_406216.1|NC_003143 conserved hypothetical protein from Yersinia pestis (316 aa); YH30_HAEIN|P44298|hi1730 hypothetical protein from Haemophilus influenzae (309 aa), FASTA scores: opt: 430, E(): 3e-20, (29.6% identity in 284 aa overlap); etc. Also similar to carboxylases eg NP_415240.1|NC_000913|P75745|YBGK_ECOLI putative carboxylase from Escherichia coli strain K12 (310 aa), FASTA score: (34.6% identity in 286 aa overlap); NP_459698.1|NC_003197 putative carboxylase from Salmonella typhimurium (310 aa); and to middle part of NP_420636.1|NC_002696 urea amidolyase-related protein from Caulobacter crescentus (1207 aa). Protein product from Mb0269c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0269c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247975.1" /translation="MTTLEILRSGPLALVEDLGRAGLAHLGVGRSGAADRRSHTLANR LVANPDDWATVEVTFGGFSARVRGGDVDIAVTGADTDPTVNGIMVGTNSIHHVRDGQV ISLGTPRAGLRTYLAVRGGVCVEPVLGSRSYDVMSAIGPSPLRAGDVLPVGEHTDDYP ELDQAPVAAIEEHLVELRVVPGPRDDWLVDPDALVHTIWMASNRSDRVGMRLQGRPLQ HRWPDRQLPGEGVTRGAIQVPPNGLPVILGPDHPITGSYPVVGVITDEDIDKVAQIRP GQYVRLHWARPRSRLPGQGVTQAW" CDS complement(314108..314740) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0270C" /product="Allophanate hydrolase 2 subunit 1 (EC" /EC_number="3.5.1.54" /note="Mb0270c, -, len: 210 aa. Equivalent to Rv0264c, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 210 aa overlap). Conserved hypothetical protein, equivalent to CAC32080.1|AL583926 conserved hypothetical protein from Mycobacterium leprae (222 aa). Also similar to others hypothetical proteins e.g. AL121596|SC51A_20 from Streptomyces coelicolor (252 aa), FASTA scores: opt: 420, E(): 2.7e-20, (41.7% identity in 204 aa overlap); P75744|YBGJ_ECOLI HYPOTHETICAL 23.9 KD PROTEIN from Escherichia coli (218 aa), FASTA scores: E(): 2.1e-14, (35.7% identity in 182 aa overlap); YH31_HAEIN|P44299|hi173 hypothetical protein from Haemophilus influenzae (213 aa), FASTA scores: opt: 252, E(): 8.3e-10, (31.1% identity in 183 aa overlap). Protein product from Mb0270c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0270c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247976.1" /translation="MDAALACTVLDYGDHALMLQCDSTADAMAWTDALRAAALPGVVD IVAASRTVLVKLDAPRYQGVTRQRLRRLRVTPEAVAAADHRCDLVIDVVYDGPDLAEV ARCTGLTTAAVINAHTATGWRAGFSGSAPGFAYLIDGDPSLRVPRRPERRTSMPPGSV ALADGFSAIYPSQAPSDWQIIGHTDAVLWDVDRPQPALLTPGMWVQFRAA" CDS complement(314836..315828) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0271C" /product="PROBABLE PERIPLASMIC IRON-TRANSPORT LIPOPROTEIN" /note="Mb0271c, -, len: 330 aa. Equivalent to Rv0265c, len: 330 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 330 aa overlap). Probable iron-transport lipoprotein, most similar to T36412|5763945|CAB53324.1|AL109974 probable iron-siderophore binding lipoprotein from Streptomyces coelicolor (350 aa); and (N-terminus may be incorrect) to T14166|3560508|AAC82551.1|AF027770 fxuD protein from Mycobacterium smegmatis (420 aa), FASTA scores: opt: 385, E(): 1.5e-16, (32.3% identity in 232 aa overlap). Also similar to AAB97475.1|U02617 DtxR/iron regulated lipoprotein precursor from Corynebacterium diphtheriae (355 aa); FECB_ECOLI|P15028 iron(III) dicitrate-binding periplasmic protein (300 aa), FASTA scores: opt: 191, E(): 2.3e-05, (26.5% identity in 196 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Note that previously known as fecB2. Protein product from Mb0271c detected using shotgun mass spectrometry. Mb0271c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247977.1" /translation="MRQGCSRRGFLQVAEAAAATGLFAGCSSPKPPPGTPGGAAVTIT HLFGQTVIKEPPKRVVSAGYTEQDDLLAVDVVPIAVTDWFGDQPFAVWPWAAPKLGGA RPAVLNLDNGIQIDRIAALKPDLIVAINAGVDADTYQQLSAIAPTVAQSGGDAFFEPW KDQARSIGQAVFAADRMRSLIEAVDQKFAAVAQRHPRWRGKKALLLQGRLWQGNVVAT LAGWRTDFLNDMGLVIADSIKPFAVDQRGVIPRDHIKAVLDAADVLIWMTESPEDEKA LLADPEIAASQATAQRRHIFTSKEQAGAIAFSSVLSYPVVAEQLPPQISQILGA" CDS complement(315850..319479) /codon_start=1 /transl_table=11 /gene="oplA" /locus_tag="BQ2027_MB0272C" /product="PROBABLE 5-OXOPROLINASE OPLA (5-OXO-L-PROLINASE) (PYROGLUTAMASE) (5-OPASE)" /note="Mb0272c, oplA, len: 1209 aa. Equivalent to Rv0266c, len: 1209 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1209 aa overlap). Probable oplA, 5-oxoprolinase (EC 3.5.2.9), highly similar to others or to hypothetical proteins e.g. AAK24340.1|AE005906 hydantoinase/oxoprolinase from Caulobacter crescentus (1196 aa); NP_103129.1|14022305|BAB48915.1|AP002997 5-oxoprolinase from Mesorhizobium loti (1210 aa); CAC48426.1|AL603642 CONSERVED HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (1205 aa); S77037|slr0697|1006579|BAA10729.1|D6400 HYPOTHETICAL PROTEIN from Synechocystis sp. strain PCC 6803 (1252 aa), FASTA scores: opt: 2016, E(): 0, (51.4% identity in 1247 aa overlap); P97608|OPLA_RAT|T42756|11278797 5-OXOPROLINASE (5-OXO-L-PROLINASE) (PYROGLUTAMASE) (5-OPASE) from Rattus norvegicus (1288 aa); etc. BELONGS TO THE OXOPROLINASE FAMILY. Protein product from Mb0272c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0272c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247978.1" /translation="MVGAGWHFWVDRGGTFTDVVARRPDGRLLTHKLLSDNPARYRDA AVAGIRALLANGEAGTRVDAVRMGTTVATNALLERTGERTLLVITRGFGDALRIAYQN RPRIFDRRIVLPEMLYERVVEVDERVTADGRVLRAPDLEALGEKMRQAHADGIRAVAV VCLHSYLYPGHEREIGTLAQRIGFAQISLSSEVSPLMKLVPRGDTTVVDAYLSPVLRR YINQVADQMRGVRLMFMQSNGGLAQAGHFRGKDAILSGPAGGIVGMVRMSALAGFDHV IGFDMGGTSTDVSHYAGEYERVFTTQVAGVRLRAPMLDIHTVAAGGGSILHFDGSRYR VGPDSAGADPGPACYRGGGPLCVTDANVMLGRIQPTHFPSVFGPSGDQPLDAGTVRRG FTDLAADIAARTGDDRSPEQVAEGYLRIAVANMANAVKKISVQKGHDVTRYALTTFGG AGGQHACAVADALGIRTVLIPPMAGVLSALGIGLADTTAMREQSVEIPLGPAAPQRLA SVAESLERAARAELLDEGVPGERIRVVRRVHLRYEGTDTAIPVQLAEIETMATAFESS HRALYTFLLDRPLIAEAISVEATGLTDQPDLSQLGDQANDTTGSSETVRIYSNGLWRD APLRRREAMRPGDVLTGPAIIAEANATTVVDDGWQATMTETGHLLAQRVVTPPRPDAA TRAGFEAGFEADPVLLEIFNNLFMSIAEQMGFRLEATAQSVNIRERLDFSCALFDPDG NLVANAPHIPVHLGSMGTTVKEVIRRRLSGMKPGDVYAVNDPYHGGTHLPDITVITPV FNTGGEDVLFFVASRGHHAEIGGITPGSMPADSREIHEEGVLFDNWLLAENGRFREAE TRRLLTEAPFGSRNPDTNLADLRAQIAANQKGVDEVGKMIDHFGRDVVAAYMRHVQDN AEEAVRRVIDRLDNGAYRYRMDSGATIAVRITVDRAARSATIDFTGTSAQLDTNFNAP TSVVNAAVLYVFRTLVADDIPLNDGCLRPLRIVVPEGSMLAPTHPAAVVAGNVETSQA ITGALFAALGVQAEGSGTMNNVTFGNERHQYYETVGSGSGAGDGYHGASVVQTHMTNS RLTDPEVLEWRYPVLLREFAVRQGSGGAGRWRGGDGAVRRLEFTEPMTVSTLSGHRRV RPYGMAGGSPGELGRNRVERADGSTVELAGCGSTHVEPGDTLVIETPGGGGYGPASTS ARRRR" CDS 319656..321047 /codon_start=1 /transl_table=11 /gene="narU" /locus_tag="BQ2027_MB0273" /product="probable integral membrane nitrite extrusion protein naru (nitrite facilitator)" /note="Mb0273, narU, len: 463 aa. Equivalent to Rv0267, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 463 aa overlap). Probable narU, nitrite extrusion protein, integral membrane protein possibly member of major facilitator superfamily (MFS), similar to other nitrite extrusion proteins e.g. NARU_ECOLI|P37758 nitrite extrusion protein 2 from Escherichia coli (462 aa), FASTA scores: opt: 630, E(): 4.4e-33, (38.9% identity in 463 aa overlap); and NARK_ECOLI|P10903|B1223 nitrite extrusion protein 1 from Escherichia coli strain K12 (463 aa), FASTA scores: opt: 607, E(): 1.3e-31, (42.0% identity in 457 aa overlap). Also similar to Rv0261c, Rv2329c, Rv1737c, and to MLCB22_25 from Mycobacterium leprae (517 aa), FASTA score: (35.1 identity in 459 aa overlap). BELONGS TO THE NARK/NASA FAMILY OF TRANSPORTERS. Protein product from Mb0273 detected using SWATH mass spectrometry." /protein_id="CAB5247979.1" /translation="MALTTAPAIDYALPRQQDEGDHWIDDWRPEDPVFWETIGRPIAR RNLIFSIFAEHVGFSVWMLWSIVVVQMTAAAPGHPAASGWALSASQALCLVAVPSGVG AFLRLPYTFAIPIFGGRNWTTVSAALLVIPCLLLAWAVSHPSLPFAVLVVIAATAGFG GGNFASSMANISFFYPEKDKGWALGLNAAGGNIGVAVVQKIIPPIVVAGSGVALSRAG LFFVPLAVAAAVCAFLFMNNLTEAKADVKPVWQSLRHADTWIMSLLYIGTFGSFIGYS AAFPTLLKTVFGRGDIALGWAFLGAGIGSLVRPLGGKLADRIGGARITAASFVMLAAG AAAALWSVQSVNLPVFFVSFMFLFVATGIGNGSSYRMISRIFQVKGEVAGGDPETMVN MRRQAAGALGIISSIGAFGGFVVPLAYAWSKVHFGNIEPALHFYVAFFLALLVVTWYC YLRRTTPMGQVGV" CDS complement(321089..321598) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0274C" /product="Antitoxin of toxin-antitoxin stability system" /note="Mb0274c, -, len: 169 aa. Equivalent to Rv0268c, len: 169 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 169 aa overlap). Hypothetical unknown protein. Mb0274c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247980.1" /translation="MGTRSKSRTRQLKQSNGCTATTSGASDRRRRARRRTAPAWLRED EWLRHHLPHPPRQLSRCLHRRRRSACHHRYSRRTPKGGLPMTSSLVPISEARAHLSRL VRESADDDVVLMNHGRPAAILISAERYESLMEELEDLRDRLSVHEREHVTMPLDKLGA ELGVDIGRV" CDS complement(321663..322856) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0275C" /product="ATP-dependent DNA ligase (EC" /EC_number="6.5.1.1" /note="Mb0275c, -, len: 397 aa. Equivalent to Rv0269c, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 397 aa overlap). Conserved hypothetical protein, highly similar to AL079355|SC4C6_19 hypothetical protein from Streptomyces coelicolor (341 aa), FASTA scores: opt: 1019, E(): 0, (46.5% identity in 344 aa overlap), and similar to other proteins e.g. CAC49016.1|AL603644 putative ATP-dependent DNA ligase protein from Sinorhizobium meliloti (636 aa); O34398 YKOU PROTEIN from Bacillus subtilis (611 aa), FASTA score: (27.2% identity in 283 aa overlap). Also similar to proteins from Mycobacterium tuberculosis e.g. Rv3062, Rv3731 (both DNA ligases), and Rv0938, Rv3730c. Protein product from Mb0275c detected using SWATH mass spectrometry. Mb0275c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247981.1" /translation="MSRMAAPVSLDVHGRQVIVTHPGRVVFPAHNDRKGYTKFDLVRY YLAVAEGAMRGVAGRPMILKRFVKGISAEAVFQKRAPANRPDWVDVAELHYASGRSAA EAVIHDAAGLAWVINLGCVDLNPHPVLAGDLDHPDELRVDLDPMPGVAWQRVVEVALV VREVLEDYGLTAWPKTSGSRGFHVYARIAPCWSFPQVRLAAQTVAREVERRLPDAATS RWWKEEREGVFVDFNQNAKDRTVASAYSVRATPDARVSTPLHWEEVPGCDPAVFTMAT VPSRLADIGDPWAGMDDAVGRLDRLLMLAEELGPPQKAQSAKPLIEIARAKTRAEAMA ALDIWRDRYPGAAALLRPADVLVDGMRGPSSIWYRIRINLQHVPADQRPPQEELIADY SPWPR" CDS 322892..324574 /codon_start=1 /transl_table=11 /gene="fadD2" /locus_tag="BQ2027_MB0276" /product="PROBABLE FATTY-ACID-COA LIGASE FADD2 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb0276, fadD2, len: 560 aa. Equivalent to Rv0270, len: 560 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 560 aa overlap). Probable fadD2, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to many e.g. LCFA_ECOLI|P29212 long-chain-fatty-acid--CoA ligase from Escherichia coli (561 aa), FASTA scores: opt: 544, E(): 2.9e-26, (27.7% identity in 535 aa overlap). Also similar to others from Mycobacterium tuberculosis e.g. MTCY493_2, MTCY8D5_9, MTCY6G11_8, etc. Contains PS00455 Putative AMP-binding domain signature. Protein product from Mb0276 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0276 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247982.1" /translation="MPNLTDLPGQAVSKLQKSIGQYVARGTAELHYLRKIIESGAIGL EPPLNYAALAADIRKWGEVGMLPSHNARRAPNRAAVIDEEGTLTFSELDEAAHAVANG LLAKGVRAGDGVAILARNHRWFVIANYGAARVGARIILLNSEFSGPQIKEVSDREGAK VIIYDDEYTKAVSLAQPPLGKLRALGVNPDDDKPSGSSDETLAELIAHSSTAPAPKAS RRASIIILTSGTTGTPKGANRNTPPTLAPIGGILSHVPFKAGEVTLLPSPMFHALGYM HAALAMFLGSTLVLRRRFKPALVLEDIEKHKATSMVVVPVMLSRILDQLEKTEPKPDL SSLKIVFVSGSQLGAELATRALGDLGPVIYNMYGSTEVAFATIAGPKDLQFNPSTVGP VVKGVTVKILDENGNEVPQGAVGRIFVGNAFPFEGYTGGGGKQIIDGLLSSGDVGYFD ERGLLYVSGRDDEMIVSGGENVFPAEVEDLISGHPDVVEAAAIGVDDKEFGARLRAFV VKKPGADLDEDTIKQYVRDHLARYKVPREVIFLDELPRNPTGKVLKRELRKL" CDS complement(324591..326786) /codon_start=1 /transl_table=11 /gene="fadE6" /locus_tag="BQ2027_MB0277C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE6" /note="Mb0277c, fadE6, len: 731 aa. Equivalent to Rv0271c, len: 731 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 731 aa overlap). Probable fadE6, acyl-CoA dehydrogenase (EC 1.3.99.-), with C-terminal half similar to many e.g. ACDS_HUMAN|P16219 acyl-CoA dehydrogenase (short-chain) from Homo sapiens (412 aa), FASTA scores: opt: 339, E(): 1.3e-13, (28.1% identity in 288 aa overlap). Protein product from Mb0277c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0277c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247983.1" /translation="MSIAITPEHYELADSVRSLVARVAPSEVLHAALESPVENPPPYW QAAAEQGLQGVHLAESVGGQGFGILELAVVLAEFGYGAVPGPFVPSAIASALIAAHDP QAKVLAELATGAAIAAYALDSGLTATRHGDVLVIRGEVRAVPAAAQASVLVLPVAIES RDEWVVLRNDQLEIEAVKSLDPLRPIAHVRANAVDVSDDALLSNLTMTTAHALMSTLL SAEAVGVARWATDTASAYAKIREQFGRPIGQFQAIKHKCAEMIADTERATAAVWDAAR ALDDAGESSSDVEFAAAVAATLAPATAQRCTQDCIQVHGGIGFTWEHDTNVYYRRALM LAACFGRGSEYPQRVVDTATTAGMRPVDIDLDPSTEKLRAQIRAEVAALKAMPREPRT VAIAEGGWVLPYLPKPWGRAASPVEQIIIAQEFTAGRVKRPQIAIATWIVPSIVAFGT DNQKQRLLPPTFRGDIFWCQLFSEPGAGSDLASLATKATRVDGGWRITGQKIWTTGAQ YSQWGALLARTDPSAPKHNGITYFLLDMKSEGVQVKPLRELTGKEFFNTVYLDDVFVP DELVLGEVNRGWEVSRNTLTAERVSIGGSDSTFLPTLGEFVDFVRDYRFEGQFDQVAR HRAGQLIAEGHATKLLNLRSTLLTLAGGDPMAPAAISKLLSMRTGQGYAEFAVSSFGT DAVIGDTERLPGKWGEYLLASRATTIYGGTSEVQLNIIAERLLGLPRDP" CDS complement(326900..328033) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0278C" /product="unknown protein" /note="Mb0278c, -, len: 377 aa. Equivalent to Rv0272c, len: 377 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 377 aa overlap). Hypothetical unknown protein. Protein product from Mb0278c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0278c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247984.1" /translation="MTGRAATPGVIREFVGLPSRTAGRAAAGGHPCQGLYHHSVGRKP KVALIAAHYQIDFSEHYLAEYMAIRGIGFLGWNTRFRGFESSFLLDHALVDIGVGVRW LREVQGVETVVLLGNSGGGSLMAAYQSQAVDPNVTPLDGMRPAAGVTELPAADAYVAA AAHLGRPDVLTAWMDAAVIDENDPVATDPELDLFDERNGPPYSPEFISRYRSAQVKRN HTITDWAESELKRVRAAGFSDRPFSVMRTWADPRMVDPSIEPTKRRPNQCYAGTPVKA NRSAHGIAAACTLRGWLGMWSLRVAQTRAAPHLARITCPALVLNAEADTGIFPSDAQQ IYDGLASSDKTQVSIDTDHYFTTPGARSEQADTIAKWIAKRWR" CDS complement(328030..328650) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0279C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0279c, -, len: 206 aa. Equivalent to Rv0273c, len: 206 aa (start uncertain), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 206 aa overlap). Possible transcriptional regulator, showing some similarity to hypothetical regulators from Mycobacterium tuberculosis e.g. P96222|Rv3855|MTCY01A6.13c (216 aa); O08377|Rv1534|MTCY07A7A.03 (225 aa), FASTA scores: opt: 123, E(): 3.2e-06, (28.5% identity in 172 aa overlap). Mb0279c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247985.1" /translation="MPDFPTQRGRRTQAAIDAAARTVVVRNGILATTVADITAEAGRS AASFYNYYDSKEAMVRQWALRFRDDANQRALSVIRHGLSDRERAYEAAAAHWYTYRNR LAEAISVSQLAMVSDDFAQYWSEICQIPISFITETVKRAQAHGYCVGDDPQLMAEAIV AMFNQFCYLQLSGKRSRRGQPDDQACIQTLANIYYRAIYSKEDSSN" CDS 328747..329328 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0280" /product="Glyoxalase family protein" /note="Mb0280, -, len: 193 aa. Equivalent to Rv0274, len: 193 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 193 aa overlap). Conserved hypothetical protein, highly similar to AAK25058.1|AE005973 conserved hypothetical protein from Caulobacter crescentus (174 aa). Shows also some similarity to others hypothetical proteins e.g. AJ002571|BSAJ2571_7 from Bacillus subtilis (316 aa), FASTA scores: opt: 138, E(): 0.033, (27.1% identity in 133 aa overlap). Previous hits with Q56415|M85195 FOSFOMYCIN-RESISTANCE PROTEIN from SERRATIA MARCESCENS (141 aa), FASTA scores: opt: 82, E(): 1.1e -08, (29.1% identity in 151 aa overlap). Contains PS00082 Extradiol ring-cleavage dioxygenases signature near C-terminus. TBparse score is 0.914. Protein product from Mb0280 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0280 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247986.1" /translation="MIKPHNTNTEFELGGINHVALVCSDMARTVDFYSNILGMPLIKA LDLPGGQGQHFFFDAGNGDCVAFFWFADAPDRVPGLSSPVAIPGIGDITSAVSTMNHL AFHVPAERFDAYRQRLKDKGVRVGPVLNHDDSETQVSAVVHPGVYVRSFYFQDPDGIT LEFACWTKEFTTSDAQAVPKTAADRRPPVAADR" CDS complement(329258..329983) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0281C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY TETR-FAMILY)" /note="Mb0281c, -, len: 241 aa. Equivalent to Rv0275c, len: 241 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 241 aa overlap). Putative transcriptional regulator, showing some similarity with Rv0825c from Mycobacterium tuberculosis (213 aa), FASTA scores: opt: 230, E(): 2.7e-07, (32.6% identity in 190 aa overlap). Belongs to Mycobacterium tuberculosis regulatory protein family with many TetR orthologues. Protein product from Mb0281c detected using SWATH mass spectrometry. Mb0281c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247987.1" /translation="MTRSDRPYRGVEAAERLATRRRQLLSAGLDLLGSDQHDIAELTI RTICRRAGLSVRYFYESFTDKDEFVGRVFDWVVAELVATTQAAVTAVPAREQTRAGMA NIVRTITADARVGRLLFSTQLANAVITRKRAESSALFAMLSGQHAVDTLHAPANDHVK AVAHFAVGGVGQTISAWLAGDVRLDPDQLVDQLAALLDELTDPNLSRPRVAATAAKSG ANDPQPPEVAGQPPSSARPARRS" CDS 330073..330993 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0282" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0282, -, len: 306 aa. Equivalent to Rv0276, len: 306 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 306 aa overlap). Conserved hypothetical protein, similar to Rv2237|Z70692|MTCY427.18 from Mycobacterium tuberculosis (296 aa), FASTA scores: opt: 874, E(): 0, (49.6% identity in 282 aa overlap). Protein product from Mb0282 detected using SWATH mass spectrometry. Mb0282 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247988.1" /translation="MAISLVAHQPIPHVERPMADPPRLQLARRRRSAAGPGGNEDSLM GVALLAGPANVIMELAMPGVGYGVLESRVESGRLDRHPIKRARTTFTYVAVAVAGSDD QKAAFRRAVNKVHAQVYSTPESPVSYHAFDPELQLWVAACLYKGGVDVYRTFVGEMDD EEADHHYRAGMAMGTTLQVPPQMWPPDRAAFDRYWRQSLDRVHIDDVVRDYLYPIVAL RIRGIALPGPLRRLSEGIALLITTGFLPQRFRDEMRLPWDATKQRRFDALMAVLRTVN RLMPRFVREFPFNLMLWDLDRRMRRGRPLV" CDS complement(331033..331461) /codon_start=1 /transl_table=11 /gene="vapc25" /locus_tag="BQ2027_MB0283C" /product="possible toxin vapc25. contains pin domain." /note="Mb0283c, -, len: 142 aa. Equivalent to Rv0277c, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Conserved hypothetical protein, highly similar to Rv0749|H70824|2911023|CAA17516.1|AL021958 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (142 aa); and similar to several other hypothetical Mycobacterium tuberculosis proteins: Rv0277c, Rv2530c, etc. Mb0283c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247989.1" /translation="MFLIDVNVLLAAHRGDHPNHRTVRPWFDRLLAADDPFTVPNLVW ASFLRLTTNRRIFEIPSPRADAFAFVEAVNAQPHHLPTSPGPRHLVLLRKLCDEADAS GDLIPDAVLGAIAVEHHCAVVSLDRDFARFASVRHIRPPI" CDS complement(331762..332004) /pseudo /codon_start=1 /transl_table=11 /gene="PE_PGRS3b" /locus_tag="BQ2027_MB0284C" /note="Mb0284c, PE_PGRS3b, len: 80 aa. Equivalent to 3' end of Rv0278c, len: 957 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 79 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many e.g. Z95890|MTCY28_25|Rv1759c from Mycobacterium tuberculosis (914 aa), FASTA scores: opt: 3849, E(): 0,(67.8% identity in 903 aa overlap). Contains PS00583 pfkB family of carbohydrate kinases signature 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS3 exists as a single gene. In Mycobacterium bovis, a 2780 bp insertion leads to an extra copy of PE_PGRS3. Also, a frameshift due to single base deletion (t-*), splits this extra copy of PE_PGRS3 into 2 parts, PE_PGRS3a and PE_PGRS3b.;PE-PGRS FAMILY PROTEIN [SECOND PART]" CDS complement(331827..334433) /codon_start=1 /transl_table=11 /gene="PE_PGRS3a" /locus_tag="BQ2027_MB0285C" /product="PE-PGRS FAMILY PROTEIN [FIRST PART]" /note="Mb0285c, PE_PGRS3a, len: 868 aa. Similar to 5' end of Rv0278c, len: 957 aa, from Mycobacterium tuberculosis strain H37Rv, (80.7% identity in 888 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many e.g. Z95890|MTCY28_25|Rv1759c from Mycobacterium tuberculosis (914 aa), FASTA scores: opt: 3849, E(): 0, (67.8% identity in 903 aa overlap). Contains PS00583 pfkB family of carbohydrate kinases signature 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS3 exists as a single gene. In Mycobacterium bovis, a 2780 bp insertion leads to an extra copy of PE_PGRS3. Also, a frameshift due to single base deletion (t-*), splits this extra copy of PE_PGRS3 into 2 parts, PE_PGRS3a and PE_PGRS3b. Mb0285c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247991.1" /translation="MSFVIAAPEVIAAAATDLASLESSIAAANAAAAANTTALLAAGA DEVSTAVAALFGAHGQAYQALSAQAQAFHAQFTQALTSGGGAYAAAEAAATSPLLAPI NEFFLANTGRPLIGNGANGAPGTGADGAPGGWLIGNGGAGGSGAANNAVGGTGGTGGA GGASGLLGSGGAGGAGGVATNTGGIGGAGGTGGNAVLFGAGGAGGASTNTTGGAGGAG GDGGNAGLLFGAAGVGGAGGFALATTASGGAGGAGGAGGMFTDGGVGGVGGKGGFGGA GGAGGNGGLFGAGGTGGAGGTIGAGVAGMGGAGGAGGAGGLFGAGGTGGSGGGGATTG GDGGAGGAGGFGRTTGGIGGTGGNAGLLNGSGGAGGAGGAAITGPGGTGGAGGIPGLI GNGGNGGDGGASVTGTGGNGGAGGNGVQIGNGGNGGSGGTGAAAGKAGLGGLGGQLIG LDGSNAPVSTSVHTLQQAALNVVNEPFQTLTGRPLIGNGANGTPGTGAAGGAGGWLFG NGGNGGHGATNTAATATGGAGGAGGILFGTGGNGGTGGIATGAGGIGGAGGAGGVSLL IGSGGTGGNGGNSIGVAGIGGAGGRGGDAGLLFGAAGTGGHGAAGGVPAGVGGAGGNG GLFANGGAGGAGGFNAAGGNGGNGGLFGTGGTGGAGTNFGAGGNGGNGGLFGAGGTGG AAGSGGSGITTGGGGHGGNAGLLSLGASGGAGGSGGASSLAGGAGGTGGNGALLFGFG GAGGAGGHGGAALTSIQQGGAGGAGGNGGLLFGSAGAGGAGGSGANALGAGTGGTGGD GGHAGVFGNGGDGGAGGFGAGTGGSGGVGGNAVLIGNGGNGGNAGKAGATPGAGGTGG LLLGENGLNGLP" CDS complement(334697..337330) /codon_start=1 /transl_table=11 /gene="PE_PGRS3" /locus_tag="BQ2027_MB0286C" /product="PE-PGRS FAMILY PROTEIN" /note="Mb0286c, PE_PGRS3, len: 877 aa. Equivalent to Rv0278c, len: 957 aa, from Mycobacterium tuberculosis strain H37Rv, (). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many e.g. Z95890|MTCY28_25|Rv1759c from Mycobacterium tuberculosis (914 aa), FASTA scores: opt: 3849, E(): 0, (67.8% identity in 903 aa overlap). Contains PS00583 pfkB family of carbohydrate kinases signature 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, 2 deletions, one of 18 bp and the other of 66 bp, and part of a 2780 bp insertion, leads to a shorter product with a different 3' end compared to its homolog in Mycobacterium tuberculosis strain H37Rv (877 aa versus 957 aa). Mb0286c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247992.1" /translation="MSFVIAAPEVIAAAATDLASLGSSISAANAAAAANTTALMAAGA DEVSTAIAALFGAHGQAYQALSAQAQAFHAQFVQALTSGGGAYAAAEAAAVSPLLDPI NEFFLANTGRPLIGNGANGAPGTGANGGDGGWLIGNGGAGGSGAAGVNGGAGGNGGAG GLIGNGGAGGAGGVASSGIGGSGGAGGNAMLFGAGGAGGAGGGVVALTGGAGGFTNGS ALGGAGGAGGAGGLFATGGVGGSGGAGSSGGAGGAGGAGGLFGAGGTGGHGGFADSSF GGVGGAGGAGGLFGAGGEGGSGGHSLVAGGDGGAGGNAGMLALGAAGGAGGIGGDGGT LTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFADGGQGGPGGNAGTVFGSGGAGGNG GVGQGFAGGIGGAGGTPGLIGNGGNGGNGGASAVTGGNGGIGGTGVLIGNGGNGGSGG IGAGKAGVGGVSGLLLGLDGFNAPASTSPLHTLQQNVLNVVNEPFQTLTGRPLIGNGA NGTPGTGADGGAGGWLFGNGGNGGSGATGTNGGDGGDGGAGGIFFGTGGTGGAGGVGT TGTGGDGGAGGAAFLVGSGGNGGSGGAGLTAGGDGGDGGNAGSFFGAAGTGGAGASTK AGGTGGTGGNGGLFANGGAGGSGGLGGDAGTGGAGGNGGLFGAGGTGGAGGSLGPGAG GAGGNGGLFGAGGTGGSGGHGTPAAVPGGAGGAGGNAGLFSLGASGGAGGSGGSSLTD SGGIGGVGGAGGLLFGYGGAGGAGGYSNIGAGGAGGAGGNAGMLSGSGGSGGTGGASG AAKGGVGGNGGTAGVFGNGGDGGAGGFGAGTGGNGGVGGNAVLIGNGGNGGNGGKAGG TPGAGGTSGLLIGENGLNGLP" CDS complement(337580..340075) /codon_start=1 /transl_table=11 /gene="PE_PGRS4" /locus_tag="BQ2027_MB0287C" /product="pe-pgrs family protein pe_pgrs4" /note="Mb0287c, PE_PGRS4, len: 831 aa. Equivalent to Rv0279c, len: 837 aa, from Mycobacterium tuberculosis strain H37Rv, (97.4% identity in 837 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many e.g. Z95890|MTCY28_25|Rv0278c from Mycobacterium tuberculosis (914 aa), FASTA scores: opt: 2677, E(): 0, (64.5% identity in 926 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, 2 deletions each of 9 bp (ccgccggcg-* and cccgccggc-*), leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (831 aa versus 837 aa). Protein product from Mb0287c detected using shotgun mass spectrometry. Mb0287c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5247993.1" /translation="MSFVIAAPEVIAAAATDLASLESSIAAANAAAAANTTALLAAGA DEVSTAVAALFGAHGQAYQALSAQAQAFHAQFVQALTSGGGAYAAAEAAATSPLLAPI NEFFLANTGRPLIGNGANGAPGTGADGAPGGWLIGNGGAGGSGAAGVNGGAGGNGGAG GLIGNGGAGGAGGRASTGTGGAGGAAGMLFGAAGVGGPGGFAAAFGATGGAGGAGGNG GLFADGGVGGAGGATDAGTGGAGGSGGNGGLFGAGGTGGPGGFGIFGGGAGGDGGSGG LFGAGGTGGSGGTSIINVGGNGGAGGDAGMLSLGAAGGAGGSGGSSPDGGGGAGGIGG DGGTLFGSGGAGGVGGLGFDAGGAGGAGGKAGLLSGAGGAGGAGGGSFAGAGGTGGAG GAPGLVGNAGNGGNGGASANGAGAAGGAGGSGVLIGNGGNGGSGGTGAPAGTAGAGGL GGQLLGRDGFNAPASTPLHTLQQQILNAINEPTQALTGRPLIGNGANGTPGTGADGGA GGWLFGNGGNGGHGATGADGGDGGSGGAGGILSGIGGTGGSGGIGTTGQGGTGGTGGA ALLIGSGGTGGSGGFGLDTGGAGGRGGDAGLFLGAAGTGGQAALSQNFIGAGGTAGAG GTGGLFANGGAGGFGANGGTGGNGLLFGAGGTGGAGTLGADGGAGGHGGLFGAGGTGG AGGSSGGTFGGNGGSGGNAGLLALGASGGAGGSGGSALNVGGTGGVGGNGGSGGSLFG FGGAGGTGGSSGIGSSGGTGGDGGTAGVFGNGGDGGAGGFGTGAGGTGGTGGNAVLIG NGGNGGNGGKAGGTPGAGGTSGLIIGENGLNGL" CDS 340366..341976 /codon_start=1 /transl_table=11 /gene="PPE3" /locus_tag="BQ2027_MB0288" /product="ppe family protein ppe3" /note="Mb0288, PPE3, len: 536 aa. Equivalent to Rv0280, len: 536 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 536 aa overlap). Member of the Mycobacterium tuberculosis PPE family, similar to others e.g. Z80108|MTCY21B4_4|Rv0453 from Mycobacterium tuberculosis (539 aa), FASTA scores: opt: 1131, E(): 0, (51.7% identity in 540 aa overlap). Protein product from Mb0288 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0288 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247994.1" /translation="MTLWMASPPEVHSALLSSGPGPGSVLSAAGVWSSLSAEYAAVAD ELIGLLGAVQTGAWQGPSAAAYVAAHAPYLAWLMRASETSAEAAARHETVAAAYTTAV AAMPTLVELAANHTLHGVLVATNFFGINTIPIALNEADYARMWTQAASTMATYQAVAE AAVASAPQTTPAPPILAAEAADDDHDHDHDHGGEPTPLDYLVAEILRIISGGRLIWDP AEGTMNGIPFEDYTDAAQPIWWVVRAIEFSKDFETFVQELFVNPVEAFQFYFELLLFD YPTHIVQIVEALSQSPQLLAVALGSVISNLGAVTGFAGLSGLAGMQPAAIPALAPVAA APPTLPAVAMAPTMAAPGAAVASAAAPASAPAASTVASATPAPPPAPGAAGFGYPYAI APPGIGFGSGMSASASAQRKAPQPDSAAAAAAAAAVRDQARARRRRRVTRRGYGDEFM DMNIDVDPDWGPPPGEDPVTSTVASDRGAGHLGFAGTARREAVADAAGMTTLAGDDFG DGPTTPMVPGSWDPDRDAPGSAEPGDRG" CDS 342000..342908 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0289" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb0289, -, len: 302 aa. Equivalent to Rv0281, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 302 aa overlap). Conserved hypothetical protein; member of Mycobacterium tuberculosis protein family that includes Rv0726c, Rv0731c, Rv3399, Rv1729c, etc. MTCY31_23 (325 aa), FASTA scores: opt: 1386, E(): 0, (69. 1% identity in 301 aa overlap). Contains possible N-terminal signal sequence. Protein product from Mb0289 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0289 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5247995.1" /translation="MRTEGDSWDITTSVGSTALFVATARALEAQKSDPLVVDPYAEAF CRAVGGSWADVLDGKLPDHKLKSTDFGEHFVNFQGARTKYFDEYFRRAAAAGARQVVI LAAGLDSRAYRLPWPDGTTVFELDRPQVLDFKREVLASHGAQPRALRREIAVDLRDDW PQALRDSGFDAAAPSAWIAEGLLIYLPATAQERLFTGIDALAGRRSHVAVEDGAPMGP DEYAAKVEEERAAIAEGAEEHPFFQLVYNERCAPAAEWFGERGWTAVATLLNDYLEAV GRPVPGPESEAGPMFARNTLVSAARV" CDS 343132..345027 /codon_start=1 /transl_table=11 /gene="ecca3" /locus_tag="BQ2027_MB0290" /product="esx conserved component ecca3. esx-3 type vii secretion system protein." /note="Mb0290, -, len: 631 aa. Equivalent to Rv0282, len: 631 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 631 aa overlap). Conserved hypothetical protein, similar to Y14967|MLCB628.18c hypothetical protein from Mycobacterium leprae (573 aa), FASTA scores: opt: 916, E(): 0, (38.7% identity in 568 aa overlap). Also similar to Mycobacterium tuberculosis proteins e.g. Z94121|MTY15F10.26 (619 aa), FASTA scores: opt: 743, E(): 0, (29.9% identity in 612 aa overlap). Member of CFXQ, CBXP family - 9 members in Mycobacterium tuberculosis. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0290 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0290 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247996.1" /translation="MAGVGAGDSGGVERDDIGMVAASPVASRVNGKVDADVVGRFATC CRALGIAVYQRKRPPDLAAARSGFAALTRVAHDQCDAWTGLAAAGDQSIGVLEAASRT ATTAGVLQRQVELADNALGFLYDTGLYLRFRATGPDDFHLAYAAALASTGGPEEFAKA NHVVSGITERRAGWRAARWLAVVINYRAERWSDVVKLLTPMVNDPDLDEAFSHAAKIT LGTALARLGMFAPALSYLEEPDGPVAVAAVDGALAKALVLRAHVDEESASEVLQDLYA AHPENEQVEQALSDTSFGIVTTTAGRIEARTDPWDPATEPGAEDFVDPAAHERKAALL HEAELQLAEFIGLDEVKRQVSRLKSSVAMELVRKQRGLTVAQRTHHLVFAGPPGTGKT TIARVVAKIYCGLGLLKRENIREVHRADLIGQHIGETEAKTNAIIDSALDGVLFLDEA YALVATGAKNDFGLVAIDTLLARMENDRDRLVVIIAGYRADLDKFLDTNEGLRSRFTR NIDFPSYTSHELVEIAHKMAEQRDSVFEQSALHDLEALFAKLAAESTPDTNGISRRSL DIAGNGRFVRNIVERSEEEREFRLDHSEHAGSGEFSDEELMTITADDVGRSVEPLLRG LGLSVRA" CDS 345024..346640 /codon_start=1 /transl_table=11 /gene="eccb3" /locus_tag="BQ2027_MB0291" /product="esx conserved component eccb3. esx-3 type vii secretion system protein. possible membrane protein." /note="Mb0291, -, len: 538 aa. Equivalent to Rv0283, len: 538 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 538 aa overlap). Possible conserved membrane protein, similar to several hypothetical mycobacterial proteins e.g. Z94121|MTY15F10_16|Rv3895c from Mycobacterium tuberculosis (495 aa), FASTA scores: opt: 698, E(): 0, (37.6% identity in 492 aa overlap); Rv1782; Rv3450c; Rv3869; and Y14967|MLCB628_16|MLCB628.17c from Mycobacterium leprae (481 aa), FASTA scores: opt: 672, E(): 1.5e-31, (37.2% identity in 506 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0291 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0291 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247997.1" /translation="MTNQQHDHDFDHDRRSFASRTPVNNNPDKVVYRRGFVTRHQVTG WRFVMRRIAAGIALHDTRMLVDPLRTQSRAVLMGVLIVITGLIGSFVFSLIRPNGQAG SNAVLADRSTAALYVRVGEQLHPVLNLTSARLIVGRPVSPTTVKSTELDQFPRGNLIG IPGAPERMVQNTSTDANWTVCDGLNAPSRGGADGVGVTVIAGPLEDTGARAAALGPGQ AVLVDSGAGTWLLWDGKRSPIDLADHAVTSGLGLGADVPAPRIIASGLFNAIPEAPPL TAPIIPDAGNPASFGVPAPIGAVVSSYALKDSGKTISDTVQYYAVLPDGLQQISPVLA AILRNNNSYGLQQPPRLGADEVAKLPVSRVLDTRRYPSEPVSLVDVTRDPVTCAYWSK PVGAATSSLTLLAGSALPVPDAVHTVELVGAGNGGVATRVALAAGTGYFTQTVGGGPD APGAGSLFWVSDTGVRYGIDNEPQGVAGGGKAVEALGLNPPPVPIPWSVLSLFVPGPT LSRADALLAHDTLVPDSRPARPVSAEGGYR" CDS 346637..350629 /codon_start=1 /transl_table=11 /gene="eccc3" /locus_tag="BQ2027_MB0292" /product="esx conserved component eccc3. esx-3 type vii secretion system protein. possible membrane protein." /note="Mb0292, -, len: 1330 aa. Equivalent to Rv0284, len: 1330 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1330 aa overlap). Possible conserved membrane protein, similar to products of two adjacent Mycobacterium leprae genes, MLCB628.16c (744 aa) and MLCB628.15c (597 aa); and throughout its length to several large Mycobacterium tuberculosis proteins: Rv3447c, Rv3870, Rv1784, etc. Y14967|MLCB628_ 15 (744 aa), FASTA scores: opt: 942, E(): 0, (33.8% identity in 730 aa overlap); Y14967|MLCB628_14 (597 aa), FASTA scores: opt: 613, E(): 3.1e-30, (31.7% identity in 615 aa overlap); Z94121|MTY15F10_17 (1396 aa), FASTA scores: opt: 652, E(): 2.2e-32, (35.4% identity in 1321 aa overlap); Z95389|MTCY77_19 (1236 aa), FASTA scores: opt 652, E(): 2.2e-32, (35.4% identity in 1321 aa overlap). Contains three PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0292 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0292 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247998.1" /translation="MSRLIFEARRRLAPPSSHQGTIIIEAPPELPRVIPPSLLRRALP YLIGILIVGMIVALVATGMRVISPQTLFFPFVLLLAATALYRGNDKKMRTEEVDAERA DYLRYLSVVRDNIRAQAAEQRASALWSHPDPTALASVPGSRRQWERDPHDPDFLVLRA GRHTVPLATTLRVNDTADEIDLEPVSHSALRSLLDTQRSIGDVPTGIDLTKVSRITVL GERAQVRAVLRAWIAQAVTWHDPTVLGVALAARDLEGRDWNWLKWLPHVDIPGRLDAL GPARNLSTDPDELIALLGPVLADRPAFTGQPTDALRHLLIVVDDPDYDLGASPLAVGR AGVTVVHCSASAPHREQYSDPEKPILRVAHGAIERWQTGGWQPYIDAADQFSADEAAH LARRLSRWDSNPTHAGLRSAATRGASFTTLLGIEDASRLDVPALWAPRRRDEELRVPI GVTGTGEPLMFDLKDEAEGGMGPHGLMIGMTGSGKSQTLMSILLSLLTTHSAERLIVI YADFKGEAGADSFRDFPQVVAVISNMAEKKSLADRFADTLRGEVARREMLLREAGRKV QGSAFNSVLEYENAIAAGHSLPPIPTLFVVADEFTLMLADHPEYAELFDYVARKGRSF RIHILFASQTLDVGKIKDIDKNTAYRIGLKVASPSVSRQIIGVEDAYHIESGKEHKGV GFLVPAPGATPIRFRSTYVDGIYEPPQTAKAVVVQSVPEPKLFTAAAVEPDPGTVIAD TDEQEPADPPRKLIATIGEQLARYGPRAPQLWLPPLDETIPLSAALARAGVGPRQWRW PLGEIDRPFEMRRDPLVFDARSSAGNMVIHGGPKSGKSTALQTFILSAASLHSPHEVS FYCLDYGGGQLRALQDLAHVGSVASALEPERIRRTFGELEQLLLSRQQREVFRDRGAN GSTPDDGFGEVFLVIDNLYGFGRDNTDQFNTRNPLLARVTELVNVGLAYGIHVIITTP SWLEVPLAMRDGLGLRLELRLHDARDSNVRVVGALRRPADAVPHDQPGRGLTMAAEHF LFAAPELDAQTNPVAAINARYPGMAAPPVRLLPTNLAPHAVGELYRGPDQLVIGQREE DLAPVILDLAANPLLMVFGDARSGKTTLLRHIIRTVREHSTADRVAFTVLDRRLHLVD EPLFPDNEYTANIDRIIPAMLGLANLIEARRPPAGMSAAELSRWTFAGHTHYLIIDDV DQVPDSPAMTGPYIGQRPWTPLIGLLAQAGDLGLRVIVTGRATGSAHLLMTSPLLRRF NDLQATTLMLAGNPADSGKIRGERFARLPAGRAILLTDSDSPTYVQLINPLVDAAAVS GETQQKGSQS" CDS 350626..350934 /codon_start=1 /transl_table=11 /gene="PE5" /locus_tag="BQ2027_MB0293" /product="pe family protein pe5" /note="Mb0293, PE5, len: 102 aa. Equivalent to Rv0285, len: 102 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 102 aa overlap). Member of the Mycobacterium tuberculosis PE family (see first citation below), similar to others e.g. AL0212|MTV012_37 from Mycobacterium tuberculosis (105 aa), FASTA scores: opt: 497, E(): 2.6e-24, (80.4% identity in 102 aa overlap); Z80108|MTCY21B4.03 from Mycobacterium tuberculosis (102 aa), FASTA scores: opt: 413, E(): 3.7e-19, (66.7% identity in 102 aa overlap); etc. Mb0293 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5247999.1" /translation="MTLRVVPEGLAAASAAVEALTARLAAAHASAAPVITAVVPPAAD PVSLQTAAGFSAQGVEHAVVTAEGVEELGRAGVGVGESGASYLAGDAAAAATYGVVGG " CDS 350937..352478 /codon_start=1 /transl_table=11 /gene="PPE4" /locus_tag="BQ2027_MB0294" /product="ppe family protein ppe4" /note="Mb0294, PPE4, len: 513 aa. Equivalent to Rv0286, len: 513 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 513 aa overlap). Member of the Mycobacterium tuberculosis PPE family, similar to others e.g. AL0212|MTV012_32 from Mycobacterium tuberculosis (434 aa), FASTA scores: opt: 958, E(): 0, (43.5% identity in 522 aa overlap). Protein product from Mb0294 detected using SWATH mass spectrometry. Mb0294 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248000.1" /translation="MAAPIWMASPPEVHSALLSNGPGPGSLVAAATAWSQLSAEYAST AAELSGLLGAVPGWAWQGPSAEWYVAAHLPYVAWLTQASADAAGAAAQHEAAAAAYTT ALAAMPTLAELAANHVIHTVLVATNFFGINTIPITLNEADYVRMWLQAAAVMGLYQAA SGAALASAPRTVPAPTVMNPGGGAASTVGAVNPWQWLLALLQQLWNAYTGFYGWMLQL IWQFLQDPIGNSIKIIIAFLTNPIQALITYGPLLFALGYQIFFNLVGWPTWGMILSSP FLLPAGLGLGLAAIAFLPIVLAPAVIPPASTPLAAAAVAAGSVWPAVSMAVTGAGTAG AATPAAGAAPSAGAAPAPAAPATASFAYAVGGSGDWGPSLGPTVGGRGGIKAPAATVP AAAAAAATRGQSRARRRRRSELRDYGDEFLDMDSDSGFGPSTGDHGAQASERGAGTLG FAGTATKERRVRAVGLTALAGDEFGNGPRMPMVPGTWEQGSNEPEAPDGSGRGGGDGL PHDSK" CDS 352527..352820 /codon_start=1 /transl_table=11 /gene="esxG" /locus_tag="BQ2027_MB0295" /standard_name="TB9.8" /product="ESAT-6-like protein EsxG" /note="Mb0295, esxG, len: 97 aa. Equivalent to Rv0287, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 97 aa overlap). esxG, conserved hypothetical protein. PE-family related protein; distant member of the Mycobacterium tuberculosis PE family, similar to Rv3020c|AL0212|MTV012.34 (97 aa), FASTA scores: opt: 564, E(): 0, (91.8% identity in 97 aa overlap). Contains probable helix-turn-helix motif at aa 14-35 (Score 144, +4.11 SD). SEEMS TO BELONG TO THE ESAT6 FAMILY (see third citation below). Protein product from Mb0295 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0295 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248001.1" /translation="MSLLDAHIPQLVASQSAFAAKAGLMRHTIGQAEQAAMSAQAFHQ GESSAAFQAAHARFVAAAAKVNTLLDVAQANLGEAAGTYVAADAAAASTYTGF" CDS 352850..353140 /codon_start=1 /transl_table=11 /gene="esxH" /locus_tag="BQ2027_MB0296" /standard_name="cfp7; TB10.4" /product="low molecular weight protein antigen 7 esxh (10 kda antigen) (cfp-7) (protein tb10.4)" /note="Mb0296, esxH, len: 96 aa. Equivalent to Rv0288, len: 96 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 96 aa overlap). esxH (alternate gene name: TB10.4), low molecular weight protein antigen 7 (10 kDa antigen) (CFP-7) (Protein TB10.4) (see citations below), ala-rich protein; member of mycobacterial protein family containing ESAT-6, very similar to MTV012_33 from Mycobacterium tuberculosis (96 aa), FASTA scores: opt: 566, E(): 0, (84.4% identity in 96 aa overlap). Alternative start codon possible position 351878 (see second citation). BELONG TO THE ESAT6 FAMILY (see first, sixth and seventh citations below). Protein product from Mb0296 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0296 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248002.1" /translation="MSQIMYNYPAMLGHAGDMAGYAGTLQSLGAEIAVEQAALQSAWQ GDTGITYQAWQAQWNQAMEDLVRAYHAMSSTHEANTMAMMARDTAEAAKWGG" CDS 353151..354038 /codon_start=1 /transl_table=11 /gene="espg3" /locus_tag="BQ2027_MB0297" /product="esx-3 secretion-associated protein espg3" /note="Mb0297, -, len: 295 aa. Equivalent to Rv0289, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 295 aa overlap). Conserved hypothetical protein, equivalent to CAC32061.1|AL583926 possible DNA-binding protein from Mycobacterium leprae (289 aa); and showing some similarity to Rv3866|G70656|CAB06238.1|Z94121|MTCY15F10.23 from Mycobacterium tuberculosis (276 aa), FASTA scores: opt: 149, E(): 0.0035, (27.7% identity in 289 aa overlap). Protein product from Mb0297 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0297 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248003.1" /translation="MDATPNAVELTVDNAWFIAETIGAGTFPWVLAITMPYSDAAQRG AFVDRQRDELTRMGLLSPQGVINPAVADWIKVVCFPDRWLDLRYVGPASADGACELLR GIVALRTGTGKTSNKTGNGVVALRNAQLVTFTAMDIDDPRALVPILGVGLAHRPPARF DEFSLPTRVGARADERLRSGVPLGEVVDYLGIPASARPVVESVFSGPRSYVEIVAGCN RDGRHTTTEVGLSIVDTSAGRVLVSPSRAFDGEWVSTFSPGTPFAIAVAIQTLTACLP DGQWFPGQRVSRDFSTQSS" CDS 354085..355503 /codon_start=1 /transl_table=11 /gene="eccd3" /locus_tag="BQ2027_MB0298" /product="esx conserved component eccd3. esx-3 type vii secretion system protein. probable transmembrane protein." /note="Mb0298, -, len: 472 aa. Equivalent to Rv0290, len: 472 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 472 aa overlap). Probable conserved transmembrane protein, similar to several others in mycobacteria e.g. Z95389|MTCY77_20|Rv3887c from Mycobacterium tuberculosis (467 aa), FASTA scores: opt: 429, E(): 5.1e-19, (28. 6% identity in 479 aa overlap); Rv3877; Rv1795; Rv3448; and Y14967|MLCB628_9|MLCB628.10c from Mycobacterium leprae (480 aa), FASTA scores: opt: 269, E(): 3.1e-09, (26.0% identity in 503 aa overlap). TBparse score is 0.892. Protein product from Mb0298 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0298 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248004.1" /translation="MSGTVMQIVRVAILADSRLTEMALPAELPLREILPAVQRLVVPS AQNGDGGQADSGAAVQLSLAPVGGQPFSLDASLDTVGVVDGDLLVLQPVPAGPAAPGI VEDIADAAMIFSTSRLKPWGIAHIQRGALAAVIAVALLATGLTVTYRVATGVLAGLLA VAGIAVASALAGLLITIRSPRSGIALSIAALVPIGAALALAVPGKFGPAQVLLGAAGV AAWSLIALMIPSAERERVVAFFTAAAVVGASVALAAGAQLLWQLPLLSIGCGLIVAAL LVTIQAAQLSALWARFPLPVIPAPGDPTPSAPPLRLLEDLPRRVRVSDAHQSGFIAAA VLLSVLGSVAIAVRPEALSVVGWYLVAATAAAATLRARVWDSAACKAWLLAQPYLVAG VLLVFYTTTGRYVAAFGAVLVLAVLMLAWVVVALNPGIASPESYSLPLRRLLGLVAAG LDVSLIPVMAYLVGLFAWVLNR" CDS 355500..356885 /codon_start=1 /transl_table=11 /gene="mycp3" /locus_tag="BQ2027_MB0299" /product="probable membrane-anchored mycosin mycp3 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-3)" /note="Mb0299, -, len: 461 aa. Equivalent to Rv0291, len: 461 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 461 aa overlap). Probable protease precursor (EC 3.4.-.-), similar to several others in mycobacteria e.g. Z94121|MTY15F10_28|Rv1796 from Mycobacterium tuberculosis (446 aa), FASTA scores: opt: 1168, E(): 0, (44.6% identity in 453 aa overlap); Rv3886c; Rv3883c; Rv3449; and Y14967|MLCB628_4|MLCB628.04 from Mycobacterium leprae (446 aa), FASTA scores: opt: 1159, E(): 0, (43.5 identity in 446 aa overlap). Has signal sequence and hydrophobic stretch at C-terminus, followed by short positively charged segment, that could act as membrane anchor. Shows similarity to several members of the subtilase family and contains PS00137 Serine proteases, subtilase family, histidine active site signature. Protein product from Mb0299 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0299 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248005.1" /translation="MIRAAFACLAATVVVAGWWTPPAWAIGPPVVDAAAQPPSGDPGP VAPMEQRGACSVSGVIPGTDPGVPTPSQTMLNLPAAWQFSRGEGQLVAIIDTGVQPGP RLPNVDAGGDFVESTDGLTDCDGHGTLVAGIVAGQPGNDGFSGVAPAARLLSIRAMST KFSPRTSGGDPQLAQATLDVAVLAGAIVHAADLGAKVINVSTITCLPADRMVDQAALG AAIRYAAVDKDAVIVAAAGNTGASGSVSASCDSNPLTDLSRPDDPRNWAGVTSVSIPS WWQPYVLSVASLTSAGQPSKFSMPGPWVGIAAPGENIASVSNSGDGALANGLPDAHQK LVALSGTSYAAGYVSGVAALVRSRYPGLNATEVVRRLTATAHRGARESSNIVGAGNLD AVAALTWQLPAEPGGGAAPAKPVADPPVPAPKDTTPRNVAFAGAAALSVLVGLTAATV AIARRRREPTE" CDS 356882..357877 /codon_start=1 /transl_table=11 /gene="ecce3" /locus_tag="BQ2027_MB0300" /product="esx conserved component ecce3. esx-3 type vii secretion system protein. probable transmembrane protein." /note="Mb0300, -, len: 331 aa. Equivalent to Rv0292, len: 331 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 331 aa overlap). Probable conserved transmembrane protein (has two hydrophobic segments at N-terminal end), equivalent to CAC32058.1|AL583926 conserved membrane protein from Mycobacterium leprae (339 aa). Protein product from Mb0300 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0300 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248006.1" /translation="MNPIPSWPGRGRVTLVLLAVVPVALAYPWQSTRDYVLLGVAAAV VIGLFGFWRGLYFTTIARRGLAILRRRRRIAEPATCTRTTVLVWVGPPASDTNVLPLT LIARYLDRYGIRADTIRITSRVTASGDCRTWVGLTVVADDNLAALQARSARIPLQETA QVAARRLADHLREIGWEAGTAAPDEIPALVAADSRETWRGMRHTDSDYVAAYRVSADA ELPDTLPAIRSRPAQETWIALEIAYAAGSSTRYTVAAACALRTDWRPGGTAPVAGLLP QHGNHVPALTALDPRSTRRLDGHTDAPADLLTRLHWPTPTAGAHRAPLTNAVSRT" CDS complement(357864..359066) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0301C" /product="conserved protein" /note="Mb0301c, -, len: 400 aa. Equivalent to Rv0293c, len: 400 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 400 aa overlap). Conserved hypothetical protein, similar in C-terminal part to Rv2627c|B70573|MTCY01A10.05|CAB08637.1|Z95387 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (413 aa), FASTA scores: opt: 394, E(): 2.1e-17, (31.1% identity in 299 aa overlap). TBparse score is 0.922. Protein product from Mb0301c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0301c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248007.1" /translation="MSGTFTADAIGPPVPIPDVPGADAGAEGLPSRSVLSARQRILVE SSAIADVALRTAVASVLSATVTPAVVANALRHVNEGSERSNLNFYAELAAAHDPAKSF PAPTELPKVTSRPASPLTEWVARGTVDNIAFASGFRAINPTMRQRWSALTANNIVHAQ HWRHRDGPRPTLCVIHGFMGSSYLLNGLFFSLPWYYRSGYDVLLYTLPFHGQRAEKFS PFSGFGYFTSGLSGFAEAMAQAVYDFRSIVDYLRHIGVDRIALTGISLGGYTSALLAS VESRLEAVIPNCPVVMPAKLFDEWFPANKLVKLGLRLTNISRDELIAGLAYHGPLNYR PLLPKDRRMIITGLGDRMAPPEHAVTLWKQWDRCALHWFPGSHLLHVSQLDYLRRMTV FLQGLMFD" CDS 359173..359958 /codon_start=1 /transl_table=11 /gene="tam" /locus_tag="BQ2027_MB0302" /product="PROBABLE TRANS-ACONITATE METHYLTRANSFERASE TAM" /note="Mb0302, tam, len: 261 aa. Equivalent to Rv0294, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv (100.0% identity in 261 aa overlap). Probable tam, trans-aconitate methyltransferase (EC 2.1.1.-), similar to others e.g. P76145|TAM_ECOLI|7465793|B64906|B1519 TRANS-ACONITATE METHYLTRANSFERASE from Escherichia coli strain K12 (252 aa), FASTA scores: opt: 649, E(): 0, (39.3 identity in 252 aa overlap). BELONGS TO THE METHYLTRANSFERASE SUPERFAMILY. Protein product from Mb0302 detected using SWATH mass spectrometry. Mb0302 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248008.1" /translation="MWDPDVYLAFSGHRNRPFYELVSRVGLERARRVVDLGCGPGHLT RYLARRWPGAVIEALDSSPEMVAAAAERGIDATTGDLRDWKPKPDTDVVVSNAALHWV PEHSDLLVRWVDELAPGSWIAVQIPGNFETPSHAAVRALARREPYAKLMRDIPFRVGA VVQSPAYYAELLMDTGCKVDVWETTYLHQLTGEHPVLDWITGSALVPVRERLSDESWQ QFRQELIPLLNDAYPPRADGSTIFPFRRLFMVAEVGGARRSGG" CDS complement(359947..360750) /codon_start=1 /transl_table=11 /gene="stf0" /locus_tag="BQ2027_MB0303C" /product="Sulfotransferase" /note="Mb0303c, -, len: 267 aa. Equivalent to Rv0295c, len: 267 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 267 aa overlap). Conserved hypothetical protein, showing weak similarity with CAC46877.1|AL591790 CONSERVED HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (213 aa); and NP_104818.1|14023999|BAB50604.1|AP00300 Protein with weak similarity to NodH from Mesorhizobium loti (257 aa). Protein product from Mb0303c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0303c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248009.1" /translation="MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPST GMAPQPREWFAGVDDDTILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLM WNQTALLQQRAAQLPDRSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQ VWRGHPDPKRDSQAVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNL TAIVASVLDAIGQDPKLAPAPMLERQANQRSDEWVDRYRAEAPRLGLPT" CDS complement(360760..362157) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0304C" /product="PROBABLE SULFATASE" /note="Mb0304c, -, len: 465 aa. Equivalent to Rv0296c, len: 465 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 465 aa overlap). Probable sulfatase, possibly an aryl-/steryl-sulfatase (EC 3.1.6.-) or a sulfamidase (sulfohydrolase) (sulphamidase) (EC 3.10.1.-). Similar to various hydrolases e.g. AAG41945.1|AF304053_1|AF304053 heparan N-sulfatase from Mus musculus (502 aa); NP_061292.1|6851181|AAF29460.1|AF153827_1|AF153827 N-sulfoglucosamine sulfohydrolase (sulfamidase) (sulphamidase) from Mus musculus (502 aa); AAG17206.1|AF217203_1|AF217203 heparan sulfate sulfamidase from Canis familiaris (507 aa); P08842|STS_HUMAN|1360652 STERYL-SULFATASE PRECURSOR (EC 3.1.6.2) (STEROID SULFATASE) (STERYL-SULFATE SULFOHYDROLASE) (ARYLSULFATASE C) (ASC) from Homo sapiens (583 aa); ARSB_FELCA|P33727 arylsulfatase B precursor (EC 3.1.6.1) (535 aa), FASTA scores: opt: 231, E(): 1.7e-08, (30.3% identity in 261 aa overlap). Also similarity with 4 others sulfatases in Mycobacterium tuberculosis. Contains sulfatases signature 1 (PS00523). Note that previously known as atsG. Protein product from Mb0304c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0304c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248010.1" /translation="MTSERATGQRENLLIVHWHDLGRYLGVYHHPDVYSPRLDRLAAE GILFTRAHATAPLCTPSRGSLFTGRYPQSNGLVGLAHHGWEYRTGVQTLPQLLSESGW YSALFGMQHETSYPKRLGFDEFDVSNSYCEYVVAKAQDWLHNRVPALDGQRFLLTAGF FETHRPYPHERYRPADSAAVELPDYLPDTPEVRQDVAEFYGSIATADEAVGRLLDTLA DTGLDASTWVVFVTDHGPAFPRAKSTLYDAGTGIALIIRPPTRRAMAPRVYDELFSGV DLVPTLLDLLRLEVPADVEGVSHAPALLAPDTENAAVRDHVYTAKTYHDSFDPIRAIR TKEYSYIENYAPRPLLDLPWDIQESPAGMAVAPLVKAPRPQRELYDLRADPTETNNLL AGDDSTQGVAAIAADLAVRLHDWRQRTADVIPSDFAGSRIAERYTETYLRIHRKTPTG RSAIAADRGIDEHCS" CDS 362336..364156 /codon_start=1 /transl_table=11 /gene="PE_PGRS5" /locus_tag="BQ2027_MB0305" /product="pe-pgrs family protein pe_pgrs5" /note="Mb0305, PE_PGRS5, len: 606 aa. Equivalent to Rv0297, len: 591 aa, from Mycobacterium tuberculosis strain H37Rv, (97.5% identity in 606 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to others e.g. Y03A_MYCTU|Q10637 from Mycobacterium tuberculosis (603 aa), FASTA scores: opt: 1884, E(): 0, (53.7% identity in 635 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 45 bp in-frame insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (606 aa versus 591 aa). Mb0305 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248011.1" /translation="MSFVIAQPEMIAAAAGELASIRSAINAANAAAAAQTTGVMSAAA DEVSTAVAALFSSHAQAYQAASAQAAAFHAQVVRTLTVDAGAYASAEAANAGPNMLAA VNAPAQALLGRPLIGNGANGAPGTGQAGGDGGLLFGNGGNGGSGAPGQAGGAGGAAGF FGNGGNGGDGGAGANGGAGGTAGWFFGFGGNGGAGGIGVAGINGGLGGAGGDGGNAGF FGNGGNGGMGGAGAAGVNAVNPGLATPVTPAANGGNGLNLVGVPGTAGGGADGANGSA IGQAGGAGGDGGNASTSGGIGIAQTGGAGGAGGAGGDGAPGGNGGNGGSVEHTGATGS SASGGNGATGGNGGVGAPGGAGGNGGHVSGGSVNTAGAGGKGGNGGTGGAGGPGGHGG SVLSGPVGDSGNGGAGGDGGAGVSATDIAGTGGRGGNGGHGGLWIGNGGDGGAGGVGG VGGAGAAGAIGGHGGDGGSVNTPIGGSEAGDGGKGGLGGDGGDGAAGGDGGAGGDGGG RGIFGQFGAGGAGGAGGVGGAGGAGGTGGGGGNGGAIFNAGTPGAAGTGGDGGVGGTG AAGGKGGAGGSGGVNGATGADGAKGLDGATGGKGNNGNPG" CDS 364299..364526 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0306" /product="Programmed cell death antitoxin YdcD" /note="Mb0306, -, len: 75 aa. Equivalent to Rv0298, len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 75 aa overlap). Hypothetical unknown protein. Protein product from Mb0306 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0306 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248012.1" /translation="MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRV ALRDYTAKTVPALDIDAYAQRVYQANRAAGS" CDS 364523..364825 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0307" /product="Programmed cell death toxin YdcE" /note="Mb0307, -, len: 100 aa. Equivalent to Rv0299, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 100 aa overlap). Hypothetical unknown protein. Equivalent to AAK44536.1 from Mycobacterium tuberculosis strain CDC1551 (49 aa) but longer 51 aa. Protein product from Mb0307 detected using SWATH mass spectrometry. Mb0307 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248013.1" /translation="MIAPGDIAPRRDNEHELYVAVLSNALHRAADTGRVITCPFIPGR VPEDLLAMVVAVEQPNGTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC" CDS 364873..365094 /codon_start=1 /transl_table=11 /gene="vapb2" /locus_tag="BQ2027_MB0308" /product="possible antitoxin vapb2" /note="Mb0308, -, len: 73 aa. Equivalent to Rv0300, len: 73 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 aa overlap). Conserved hypothetical protein, similar to Rv1721c|MTCY04C12.06c|Z81360|MTCY4C12_4 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (75 aa), FASTA scores: opt: 84, E(): 8.3, (39.5% identity in 38 aa overlap). Protein product from Mb0308 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0308 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248014.1" /translation="MSDVLIRDIPDDVLASLDAIAARLGLSRTEYIRRRLAQDAQTAR VTVTAADLRRLRGAVAGLGDPELMRQAWR" CDS 365091..365516 /codon_start=1 /transl_table=11 /gene="vapc2" /locus_tag="BQ2027_MB0309" /product="possible toxin vapc2" /note="Mb0309, -, len: 141 aa. Equivalent to Rv0301, len: 141 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 141 aa overlap). Conserved hypothetical protein, similar to other hypothetical M. tuberculosis proteins e.g. Rv2757c, Rv0229c, Rv2546, etc. Protein product from Mb0309 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0309 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248015.1" /translation="MTDQRWLIDKSALVRLTDSPDMEIWSNRIERGLVHITGVTRLEV GFSAECGEIARREFREPPLSAMPVEYLTPRIEDRALEVQTLLADRGHHRGPSIPDLLI AATAELSGLTVLHVDKDFDAIAALTGQKTERLTHRPPSA" CDS 365652..366284 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0310" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR/ACRR-FAMILY)" /note="Mb0310, -, len: 210 aa. Equivalent to Rv0302, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 210 aa overlap). Probable transcription regulatory protein, TetR family, with its N-terminus similar to N-terminus of several repressors and regulatory proteins of TetR/AcrR family e.g. ACRR_ECOLI|P34000 potential acraB operon repressor from Escherichia coli (215 aa), FASTA scores: opt: 172, E(): 3.1e-05, (22.7% identity in 194 aa overlap). Also similar in N-terminus to N-terminus of MTCY07A7.24 hypothetical regulator from Mycobacterium tuberculosis FASTA score: (38.7% identity in 62 aa overlap). Contains probable helix-turn helix motif from aa 35-56 (Score 1728, +5.07 SD). Mb0310 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248016.1" /translation="MGVPAKKKQQQGERSRESILDATERLMATKGYAATSISDIRDAC GLAPSSIYWHFGSKEGVLAAMMERGAQRFFAAIPTWDEAHGPVEQRSERQLTELVSLQ SQHPDFLRLFYLLSMERSQDPVVAAVVRRVRNTAIARFRDSITHLLPSDIPPGKADLV VAELTAFAVALSDGVYFAGHLEPDTTDVERMYRRLRQALEALIPVLLEET" CDS 366281..367189 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0311" /product="PROBABLE DEHYDROGENASE/REDUCTASE" /note="Mb0311, -, len: 302 aa. Equivalent to Rv0303, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 302 aa overlap). Possible dehydrogenase/reductase (EC 1.-.-.-), similar to various NADPH dehydrogenases and other NADPH oxidoreductases e.g. O48741|PORC_ARATH|7488284|T00897 PROTOCHLOROPHYLLIDE REDUCTASE C CHLOROPLAST PRECURSOR (EC 1.3.1.33) (NADPH-PROTOCHLOROPHYLLIDE OXIDOREDUCTASE C) from Arabidopsis thaliana (401 aa); Q42850 NADPH DEHYDROGENASE (EC 1.6.99. 1) (395 aa), FASTA scores: opt: 347, E(): 3.8e-16, (35.4% identity in 319 aa overlap). Protein product from Mb0311 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0311 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248017.1" /translation="MNTGTAVITGASSGLGLQCARALLRRDASWHVVLAVRDPARGRA AMEELGEPNRCSVLEVDLASVRSVRSFVETVRTTPLPPIRALVCNAGLQVVSGIAFTD DGVEMTFGVNHLGHFALVTGILDWLARPARIVVVSSGTHDPSKHTGMPDPRYTCAADL AHPPTDQNTPAEGRRRYTTSKLCNVLFTYELDRRLDHGEQGVMVNAFDPGLMPGSGLA RDYPPILRLAYRLLSPMLRVLPFVHSTRVSGEHLAALAVDPRFAGVTGQYFAGAKAIR SSAESYDRAKALDLWETSERLLAQVT" CDS complement(367197..370640) /codon_start=1 /transl_table=11 /gene="PPE5" /locus_tag="BQ2027_MB0312C" /product="ppe family protein ppe5" /note="Mb0312c, PPE5, len: 1147 aa. Equivalent to 3' end of Rv0304c, len: 2204 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1147 aa overlap). Member of the Mycobacterium tuberculosis PE family (PPE, MPTR), similar to others e.g. Z95324|MTY13E10_16 from M. tuberculosis (1443 aa), FASTA scores: E(): 0, (50.6% identity in 1403 aa overlap); Y04H_MYCTU|Q10778 from M. tuberculosis (734 aa), FASTA scores: opt: 989, E(): 0, (42.3% identity in 522 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE5 and PPE6 exist as separate genes. In Mycobacterium bovis, a frameshift due to a single base deletion (g-*) leads to a shorter CDS (Mb0312c) equivalent to the 3' end of Rv0304c/PPE5." /protein_id="CAB5248018.1" /translation="MFNSGDGNIGFFNSGTGNFGIGNTGTGNFGIGNSGSTSTGLFNS GDGNTGGFNPGNFNTGNFNTGSFNTGGFNAGNTNTGHFNTGNYNTGIANTGDVSTGAF ISGNYSNGILWRGDYQGLIGYSYALTIPEIPAHLDVNIPIDIPITGSFTDLVVDNFTI PIIGFESFAFSFHIHTEPDIGPIIVPSFVLSVPTFAIAVGGPTTAINISATAGLGPIT IPIIDIPAAPGIGNSTTSPSSGFFNTGAGTASGFGNVGGNTSGLWNLASAASGVSGLL NVGALGSGVANVGNTISGIYNTSPLDLGTPAFGSGLANIAGLLQGGAGTTILDLAGLG NLNVGLANLGGSNFGIGNTGIFNVGFANVGNHNIGLANLGNYSVGFANSGNYHIGIAN TGSANIGFANTGSGNIGIGLTGTGQIGFGSFNSGSHNIGLFNSGDGNVGFFNSGTGNV GIGNTGTANFGIANSGSFNTGLGNTGSTNTSLFNPGNVNTGVGNTGSINTGSFNTGST NTGSFNLGDHNTGSFNSGDYNTGYFNAGDYNTGVANTGNVNTGAFISGNYSNGFFWRG DYQGLIGLSTTITIPEIPYRYDLSVPIDIPITGTVVATTPNSFTIPGFQIRVLLGPAA VLVNEMIGPITIDVNQVIAIDSPIQQTISMVGTGGFGPIPIGISIGGTPGFGNSTTGP SSGFFHTGAGHVSGFGNFGAGNMSGFGNFGAGNSGFFNAGGLGNSGLLNFGALQSGLA NLGNTISGVYNTSTLDLATPAFGSGIANIGANLAGLFLDNTGNLTLNFGVANQGGLNA GIGNLGSVNIGFVNTGDSNLGIGNLGDLNFGGVNIGGNNIGIANTGIFDIGLANLGSY NIGLANLGDDNLGFGNAGSYNIGFANFGSDNLGFANTGSYNIGFANTGNNNIGVGLTG NGQIGIGSLNSGSNNIGLFNSGSGNIGFFNSGTGNVGIFNAGTGNFGLANSGGFNTGI GNAGSTNTGVFNPGDLNTGSFNPGSFNTGGFNPGSGNTGYLNTGDYNTGVANTGDVDT GAFITGSYSNGFLVSGDYQGLIGLPLLGIPVTPGYFNLTGGPSSGFFNSGAGSVSGFV NSGAGLSGYLNTGALGSGVANVGNTISGWLNASALDLATPGFLSGIGNFGTNLAGFFR G" CDS complement(370784..376741) /codon_start=1 /transl_table=11 /gene="PPE6" /locus_tag="BQ2027_MB0313C" /product="ppe family protein ppe6" /note="Mb0313c, PPE6, len: 1985 aa. Equivalent to 5' end of Rv0305c, len: 963 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 809 aa overlap). Member of the Mycobacterium tuberculosis PE family (PPE, MPTR), similar to others e.g. Y04H_MYCTU|Q10778 from M. tuberculosis (734 aa), FASTA scores: opt: 1340, E(): 0, (40.9% identity in 815 aa overlap); Z95324|MTY13E10_16 from Mycobacterium tuberculosis (1443 aa), FASTA scores: E(): 0, (50.6% identity in 1403 aa overlap); Y04H_MYCTU|Q10778 from Mycobacterium tuberculosis (734 aa), FASTA scores: opt: 989, E(): 0, (42.3% identity in 522 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE5 and PPE6 exist as separate genes. In Mycobacterium bovis, a single base deletion (t-*) resulting in the absence of a stop codon leads to a longer product. The second part of this CDS shares homology with the 5' end of Rv0304c/PPE5. Mb0313c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248019.1" /translation="MDFVVSAPEVNSLRMYLGAGSGPMLAAAAAWDGLADELAVAASW FGSVTSGLADAAWRGPAAVAMARAVAPYLGWLISATAQAEQAAAQARVAVATFEAARA ATVHPAIVAANRAVLVSLVSSNLLGFNAPAIAATEAAYERMWAQDVAAMVGYHAGASA AVSALMPFTQQLKKLAGLSERLTSAAAAAAGPPSAAGFNLGLANVGANNVGNGNVGVF NVGFGNLGSYNLGFANLGSDNLGLANLGGHNIGFANTGSNNVGFGNTGSNNVGIGLTG NGQIGFGSFNSGSHNIGLFNSGSGNVGLFNSGTGNFGIGNSGTGNFGLGNTGSTNTGW FNTGDVNTGGFNPGSYNTGNFNTGNYNTGSFNAGNYNTGYFNTGDYNTGVANTGNVNT GAFIAGNYSNGVLWRGDYQGLIGADIALEIPAIPINAQLFSMPIHQVMVMPGSVMTIP GMRLPFTSIVPFVVYYGPVELPQSTLTLPTVTITVGGPTTTIDGNLTGMVGGVSIPLI KIPAAPGFGNSTTSPSSGFFNAGAGTASGFGNFGGGASGFWNLASATSGLSGFGNVGA LGSGVANVGNTISGLYNTSTSNLATPAFNSGLLHHSVGTMTLNFGLANVGGNNVGGAN AGIFNVGLANLGDYNIGFGNLGGDNLGFAHAGSYNIGFANTGSNNLGFANTGDNNIGF ANIGSNNIGIGLTGSGQIGFGSLNSGSHNIGLFNSGDGNIGLFNSGSGNFGIGNAGTG NWGIGNSGAGNFGIGNAGSTNTGLFNSGDLNTGSLNPGSYNTGSVNTGSVNTGGFNAG NYNTGYFNTGDYNTGMANTGNINTGAFISGNHSNGLLWRGDNQGLIDLAIGVDIPEIP IVSVDVNIPIHIPITASFTDIVYSGLDLPPNTAVTVIFFGPVDIDPFTVPVIRITGPT PVVMVGGPTTAINIGATVGVDAINIPIIHIPATPGFGNSTGGLSSGFFNSGAGSASGF GNFGGAASGFMNLVSTTSGMSGFLNVGALGSGVANVGNTISGIYNVGTSDLSTPAVNS GLANIGTNIAGLLRDGAGTAAINLGLANHGNLNVGFASLGGFNFGGATIGHNNVGIGN TGIFDVGLANLGSYNIGFGNLGDDNLGFGNFGSYNIGFGNVGNDNLGFANAGGGNIGF ANTGSNNVGFGNTGSNNVGIGLTGNGQIGFGSFNSGSGNIGLFNSGSNNIGFFNSGSG NFGIANSGSFNTGIGNTGNTNTGLFNSGDVNTGAFNPGSFNTGSFNTGSFNTGGFNPG NTNTGYLNIGNYNTGIANTGDVDTGAFITGNYSNGLFLSGDYQGLVGLNLVIDMPLPI SLGVNIPIDIPITASAGNITLMGVTIPPTGDIVLSSIAGQRAHFGPITIPNITVVGPT TTVAIGGPNTAITITGGGAIRIPLISIPAAPGFGNSTTNPSSGFFNTGAGGASGFGNF GGANSGFWNLASATSGASGLLNVGALGSGLANVGTTVSGFYNTSTSDLATPAFNSGLA NISTSIAGLLRDSTGTMVLNLGLANHGTLNVGIANLGDYNIGFANLGSANFGSANIGG NNIGGANTGIFDIGLANLGSYNIGFGNFGDDNLGFGNLGSYNVGFGNLGNDNLGFANT GSNNIGFANTGSNNIGIGLTGDGQIGFGSLNSGSGNIGLFNSGSGNIGFFNSGNGNVG IGNTGTANFGLGNTGSTNTGFFNSGDVNTGIGNTGSFNTGSFNPGDSNTGDFNPGSYN TGLGNTGDVDTGAFISGSYSNGFLWSGNYQGLIGLHAALAIPEIALTFGVDIPIHIPI NIDAGVVTLQGFSIVAAENNIDFTPIIIPTINITLPTAAITVGGPTTSIGITASAGIG SITIPIIDIPATSGFGNSTTSPSSGFFNSGAGSASGFLNVVAGASGISGYLNVGALGS GVTNVGHTVSGFYNASALDLVTPAFASGLMRDGMGTMTLNLGLANLGSNNAGFGNTGI FDVGVANLGNYNIGFGNFGDDNLALPT" CDS 376944..377615 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0314" /product="PUTATIVE OXIDOREDUCTASE" /note="Mb0314, -, len: 223 aa. Equivalent to Rv0306, len: 223 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 223 aa overlap). Putative oxidoreductase (EC 1.-.-.-), highly similar to H83485|9947208|AAG04663.1|AE004557_4|AE004557 conserved hypothetical protein from Pseudomonas aeruginosa strain PAO1 (218 aa); and to other putative oxidoreductases e.g. middle part of CAB76073.1|AL157953 putative nitroreductase from Streptomyces coelicolor (1212 aa); Q52685|BLUB protein involved in cobalamin (vitamin B12) synthesis from Rhodobacter capsulatus (206 aa), FASTA scores: opt: 318, E(): 2e-15, (35.6% identity in 191 aa overlap). Mb0314 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248020.1" /translation="MFSAPERRAVYRVIAERRDMRRFVPGGVVSEDVLARLLHAAHAA PSVGLMQPWRFIRITDETLKRRIHALVDDERLLTAEALGAREEEFLALKVEGILDCAE LLVVALCDRRGSYIFGRRTLPQMDLASVSCAIQNLWLAARSEGLGMGWVSLFDPQRLA ALLAMPADAEPVAILCLGPVPEFPDRPALELDGWAYARPLAEFVSENRWSYPSALATD HHHGE" CDS complement(377603..378085) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0315C" /product="unknown protein" /note="Mb0315c, -, len: 160 aa. Equivalent to Rv0307c, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 160 aa overlap). Hypothetical unknown protein. Protein product from Mb0315c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0315c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248021.1" /translation="MAVIVRKWFGLGRLPADLRCQVEAEGLIYLAEYVAVTRRFTGVI PGLRASHSIASYVGALAFTEQRVLGTLSMVPKLAGRVVDARWDGPQAGAATAEISPTG LQLDLDVADVDPKFSGQLALHFKATIGEDVLSRLPRRSLAFDVPAEYVNLAVGVTYSP " CDS 378143..378859 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0316" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0316, -, len: 238 aa. Equivalent to Rv0308, len: 238 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 238 aa overlap). Probable conserved integral membrane protein, with C-terminus highly similar to C-terminus of other integral membrane proteins or phosphatases e.g. AAK25788.1|AF336822_1|13430250|AAK25789. 1|AF336823_1 putative phosphatase from Streptococcus pyogenes (201 aa); Q06074 HYPOTHETICAL 24.9 KD PROTEIN (216 aa), FASTA scores: opt: 209, E(): 2e-07, (27.9% identity in 140 aa overlap). Could be a phosphatase. Protein product from Mb0316 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0316 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248022.1" /translation="MTRPQALLAVSLAFVATAVYAVMWVGHSQDWGWLHSFDWSLLNA AHDIGIKNPAWVRFWDGVSLILGPVVLRPLGLLAAMVALAKRKIRIALLLLACLPLNA IMTIAAKSVAHRPRPATALVSAHSTSFPSGHALEATASVLALLTVLLPMLHSRFTRHI AITVGALCVLTVGVARVALNVHHPTDVVAGWALGYLYFLVCLCVFRPPSIFGAQRASH ALSPPVEVSRQPEPEVDTAR" CDS 378961..379617 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0317" /product="POSSIBLE CONSERVED EXPORTED PROTEIN" /note="Mb0317, -, len: 218 aa. Equivalent to Rv0309, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 218 aa overlap). Possible conserved exported protein (has putative N-terminal signal sequence), equivalent to AC32053.1|AL583926 putative secreted protein from Mycobacterium leprae (218 aa). Also similar to others e.g. AB76092.1|AL157956 putative secreted protein from Streptomyces coelicolor (238 aa). Protein product from Mb0317 detected using SWATH mass spectrometry. Mb0317 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248023.1" /translation="MSRLLALLCAAVCTGCVAVVLAPVSLAVVNPWFANSVGNATQVV SVVGTGGSTAKMDVYQRTAAGWQPLKTGITTHIGSAGMAPEAKSGYPATPMGVYSLDS AFGTAPNPGGGLPYTQVGPNHWWSGDDNSPTFNSMQVCQKSQCPFSTADSENLQIPQY KHSVVMGVNKAKVPGKGSAFFFHTTDGGPTAGCVAIDDATLVQIIRWLRPGAVIAIAK " CDS complement(379687..380178) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0318C" /product="Bile acid 7-alpha dehydratase BaiE (EC" /EC_number="4.2.1.106" /note="Mb0318c, -, len: 163 aa. Equivalent to Rv0310c, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). Conserved hypothetical protein, similar to some bile acid dehydratases e.g. P19412|BAIE_EUBSP|98749|D37844|1381566|A AC45413.1|U57489 BILE ACID-INDUCIBLE OPERON PROTEIN E from Eubacterium sp (166 aa), FASTA scores: opt: 302, E(): 1e-11, (38.8% identity in 134 aa overlap); AAF22847.1|AF210152_4 bile acid 7a-dehydratase from Clostridium sp. (168 aa). Protein product from Mb0318c detected using shotgun mass spectrometry. Mb0318c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248024.1" /translation="MCCNGVVTPGDPADIAAIKQLKYRYLRALDTKHWDDFTDTLAED VTGDYGSSVGTELHFTNRADLVDYLRQALGPGVITEHRVTHPEITVTGDTATGIWYLQ DRVIVAEFNFMLIGAAFYHDQYRRTTDGWRISATGYDRTYEATMSLAGLNFNIRPGRA LAD" CDS 380202..381431 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0319" /product="unknown protein" /note="Mb0319, -, len: 409 aa. Equivalent to Rv0311, len: 409 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 409 aa overlap). Hypothetical unknown protein. Contains PS00881 Protein splicing signature. Protein product from Mb0319 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0319 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248025.1" /translation="MSQSRYAGLSRSELAVLLPELLLIGQLIDRSGMAWCIQAFGRQE MLQIAIEEWAGASPIYTKRMQKALNFEGDDVPTIFKGLQLDIGAPPQFMDFRFTLHDR WHGEFHLDHCGALLDVEPMGDDYVVGMCHTIEDPTFDATAIATNPRAQVRPIHRPPRK PADRHPHCAWTVIIDESYPEAEGIPALDAVRETKAATWELDNVDASDDGLVDYSGPLV SDLDFGAFSHSALVRMADEVCLQMHLLNLSFAIAVRKRAKADAQLAISVNTRQLIGVA GLGAERIHRAMALPGGIEGALGVLELHPLLNPAGYVLAETSPDRLVVHNSPAHADGAW ISLCTPASVQPLQAIATAVDPHLKVRISGTDTDWTAELIEADAPASELPEVLVAKVSR GSVFQFEPRRSLPLTVK" CDS 381586..383448 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0320" /product="FIG00821990: molecular chaperone" /note="Mb0320, -, len: 620 aa. Equivalent to Rv0312, len: 620 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 620 aa overlap). Conserved hypothetical protein with highly Pro-, Thr-rich C-terminus. Similar to Pro-,Thr-rich region in Rv2264c|AL021925|MTV022_14 from Mycobacterium tuberculosis (592 aa), FASTA scores: opt: 1075, E(): 0, (38.9% identity in 627 aa overlap). Also some similarity with Rv0350|dnaK from Mycobacterium tuberculosis. Possibly membrane protein; has hydrophobic stetch in its middle part. Protein product from Mb0320 detected using SWATH mass spectrometry. Mb0320 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248026.1" /translation="MYDPLGLSIGTTNLVAAGNGGPPVTRRAVLTLYPHCAPKIGVPS QNPNLIEPGALMSGFVERIGDAVALVSPDGSVHDPDLLLVEALDAMVLTAGADASSSE IAIAVPAHWKPGAVHALRNGLRTHVGFVRSGMAPRLVSDAIAALTAVNSELGLPHGGV VGLLDFGGSATYVTLVETKSDSRTSDFQPVSATARYQDFSGSQIDQALLLRVIDQFGY GDDVDPASTAAVGQLGQLREQCRAAKERLSTDVATELFAELAGCSSSIEMTREQLEDL IQDPLTGFIYAFDDMLARHNASWADLAAVVTVGGGANIPLVTQRLSFHTRRPVLTASQ PGCAAAMGALLLANRGGERDSRTRTSIGLATAAAAGTSVIELPAGDVMVIDHEALTDR ELAWSQTDFPSEAPARFEGDSYNEGGPCWSMRLNAVEPPKGPAWRRIRVSQLLIGVSA VVAMTAIGGVALTLTAIERRPSPLPTPIVPGLAPMPPGSVVPSSRAPTPPPPPSTVAP LPSAAPAPTTVAPAPPPPTQVVTTTTAPPVTTTPRPSPTTTTTTAPPSTTTTTELPVT TTSTIPTIPTTTTTVKMTTEWLHVPFLPVPIPVPIPQNPGAGEPQNPFGSLGSG" CDS 383520..383906 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0321" /product="conserved protein" /note="Mb0321, -, len: 128 aa. Equivalent to Rv0313, len: 128 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 128 aa overlap). Conserved hypothetical protein, equivalent only to CAC32049.1|AL583926 conserved hypothetical protein from Mycobacterium leprae (130 aa). TBparse score is 0.877. Protein product from Mb0321 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0321 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248027.1" /translation="MGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTG WSAIFEDLSRRSRPAPETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNT DPKRKVRFLPYGIAVSVLDDPVDEAQ" CDS complement(383909..384571) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0322C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb0322c, -, len: 220 aa. Equivalent to Rv0314c, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 220 aa overlap). Possible conserved membrane protein, with hydrophobic stretch from residues ~75-100. Similar in C-terminal part to Mycobacterium tuberculosis proteins Rv0679c and Rv0680c. Mb0322c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248028.1" /translation="MIVVWEHLCMNPEDDPEARIRELERPLADVARASELGGSQSGGY TYPPGPPPPPYSYGGPFGGPSPRSSSGNRAWWILAAVVVVGVLVLVGGIAAFSAQRLS QGNFVVLSPTPSVSRAVPTPTAQPATTLPPAGASLSVSGVNVNRTIACNDSIVSVSGM SNTVVITGHCTSLTVSGMRNSVTADSVDTIEAAGFNNEVTYHSGSPKISNAGGSNSVQ QG" CDS 384632..385516 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0323" /product="POSSIBLE BETA-1,3-GLUCANASE PRECURSOR" /note="Mb0323, -, len: 294 aa. Equivalent to Rv0315, len: 294 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 294 aa overlap). Possible beta-1,3-glucanase precursor (EC 3.2.1.-) (has hydrophobic stretch in its N-terminal part), similar to others e.g. Q51333|AAC44371.1 BETA-1,3-GLUCANASE II A from Oerskovia xanthineolytica (306 aa), FASTA scores: opt: 76, E(): 3e-14, (34.1% identity in 302 aa overlap); and AAC38290.1|AF052745 beta-1,3-glucanase II from Oerskovia xanthineolytica (435 aa). Contains glycosyl hydrolases family 16 active site signature (PS01034). Protein product from Mb0323 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0323 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248029.1" /translation="MLMPEMDRRRMMMMAGFGALAAALPAPTAWADPSRPAAPAGPTP APAAPAAATGGLLFHDEFDGPAGSVPDPSKWQVSNHRTPIKNPVGFDRPQFFGQYRDS RQNVFLDGNSNLVLRATREGNRYFGGLVHGLWRGGIGTTWEARIKFNCLAPGMWPAWW LSNDDPGRSGEIDLIEWYGNGTWPSGTTVHANPDGTAFETCPIGVDGGWHNWRVTWNP SGMYFWLDYADGIEPYFSVPATGIEDLNEPIREWPFNDPGYTVFPVLNLAVGGSGGGD PATGSYPQEMLVDWVRVF" CDS 385565..386179 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0324" /product="POSSIBLE MUCONOLACTONE ISOMERASE" /note="Mb0324, -, len: 204 aa. Equivalent to Rv0316, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 204 aa overlap). Possible muconolactone isomerase (EC 5.3.3.-), showing weak similarity with some muconolactone isomerases e.g. O33947|CTC1_ACILW MUCONOLACTONE DELTA-ISOMERASE 1 (MIASE 1)(96 aa), FASTA scores: opt: 179, E(): 3.9e-05, (32.6% identity in 92 aa overlap). Protein product from Mb0324 detected using SWATH mass spectrometry. Mb0324 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248030.1" /translation="MEFLVTMTTRVPDSMPADAVERVRAREAARSRELAAQGKLLRLW RPPLRPGEWRTLGLFAADDNGELEQLLASMPPRSWRTDDVTPLGAHPNDPVGQGITIA PGKGPEFLIATTIMVPPGTPAQVVDDTVAREARRAPELAGRGHLVRLWALPDGPDGQR TLGLWRARDPGELMAILESLPLAGWMTIETTPLSPHPDDPIRMP" CDS complement(386203..386973) /codon_start=1 /transl_table=11 /gene="glpQ2" /locus_tag="BQ2027_MB0325C" /product="possible glycerophosphoryl diester phosphodiesterase glpq2 (glycerophosphodiester phosphodiesterase)" /note="Mb0325c, glpQ2, len: 256 aa. Equivalent to Rv0317c, len: 256 aa (start uncertain, chosen by homology), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 256 aa overlap). Possible glpQ2, glycerophosphoryl diester phosphodiesterase (EC 3.1.4.46), similar to others e.g. E75317|6459876|AAF11631.1|AE002044_4 glycerophosphoryl diester phosphodiesterase from Deinococcus radiodurans (285 aa); P10908|UGPQ_ECOLI from Escherichia coli (247 aa), FASTA scores: opt: 220, E(): 5.2e-07, (28.0% identity in 250 aa overlap). Also similar to MTCY01A6.27 from Mycobacterium tuberculosis FASTA score: (27.5% identity in 247 aa overlap). Protein product from Mb0325c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0325c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248031.1" /translation="MEFLRHGGRIAMAHRGFTSFRLPMNSMGAFQEAAKLGFRYIETD VRATRDGVAVILHDRRLAPGVGLSGAVDRLDWRDVRKAQLGAGQSIPTLEDLLTALPD MRVNIDIKAASAIEPTVNVIERCNAHNRVLIGSFSERRRRRALRLLTKRVASSAGTGA LLAWLTARPLGSRAYAWRMMRDIDCVQLPSRLGGVPVITPARVRGFHAAGRQVHAWTV DEPDVMHTLLDMDVDGIITDRADLLRDVLIARGEWDGA" tRNA complement(387234..387304) /locus_tag="BQ2027_GLYU" /product="tRNA-Gly" /note="glyU, len: 71 nt. Equivalent to glyU, len: 71 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 nt overlap). tRNA-Gly, anticodon ccc." CDS complement(387335..388129) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0326C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0326c, -, len: 264 aa. Equivalent to Rv0318c, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 264 aa overlap). Probable conserved integral membrane protein, with some similarity to C-terminus of GUFA_MYXXA|Q06916 (254 aa), FASTA scores: opt: 157, E (): 0.0032, (28.3% identity in 198 aa overlap). Also similar to O26573 CONSERVED PROTEIN from Methanobacterium thermoauto (259 aa), FASTA scores: opt: 173, E(): 5.2e-05, (32.7% identity in 214 aa overlap). Mb0326c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248032.1" /translation="MSLAVTMFKRARAEIFDRNREVGISNVTTAASLVTFPVLAGILG GVVPSVRTPSAAMVSGVQHFAAGIVMAAVAGEVLPDLRSRGPLWLIVVGFSAGVAVLV ALRRFDGHGEHQDGDDVGELPVGFLTVVAVDLFIDGLLVATGATVSSRTAIIITIALT VEVLFLGLAVALRLAGSGMPRIRAAATTSALSLVIAVGGVSGAVALGRAGNTVLTLVL AFAAAALLWLVVEELLVEAHETPERPWMAVMFFAGFLILYGLGVME" CDS 388178..388846 /codon_start=1 /transl_table=11 /gene="pcp" /locus_tag="BQ2027_MB0327" /product="PROBABLE PYRROLIDONE-CARBOXYLATE PEPTIDASE PCP (5-OXOPROLYL-PEPTIDASE) (PYROGLUTAMYL-PEPTIDASE I) (PGP-I) (PYRASE)" /note="Mb0327, pcp, len: 222 aa. Equivalent to Rv0319, len: 222 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 222 aa overlap). Probable pcp, pyrrolidone-carboxylate peptidase (EC 3.4.19.3), highly similar to others e.g. PCP_PSEFL|P42673 pyrrolidone-carboxylate peptidase from Pseudomonas fluorescens (213 aa), FASTA scores: opt: 478, E(): 7.5e-25, (40.2% identity in 219 aa overlap). BELONGS TO PEPTIDASE FAMILY C15 (THIOL PROTEASE). Protein product from Mb0327 detected using SWATH mass spectrometry. Mb0327 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248033.1" /translation="MSKVLVTGFGPYGVTPVNPAQLTAEELDGRTIAGATVISRIVPN TFFESIAAAQQAIAEIEPALVIMLGEYPGRSMITVERLAQNVNDCGRYGLADCAGRVL VGEPTDPAGPVAYHATVPVRAMVLAMRKAGVPADVSDAAGTFVCNHLMYGVLHHLAQK GLPVRAGWIHLPCLPSVAALDHNLGVPSMSVQTAVAGVTAGIEAAIRQSADIREPIPS RLQI" CDS 388918..389580 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0328" /product="POSSIBLE CONSERVED EXPORTED PROTEIN" /note="Mb0328, -, len: 220 aa. Equivalent to Rv0320, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 220 aa overlap). Possible conserved exported protein, similar to some hypothetical proteins and to the middle part of a peptidase: NP_066789.1|10657900|AAG21739.1|AF116907 putative peptidase from Rhodococcus equi (546 aa). Also similar to Rv1728c|MTCY04C12.13c from Mycobacterium tuberculosis (256 aa), FASTA scores: opt: 497, E(): 1.2e-26, (41.8% identity in 225 aa overlap). Mb0328 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248034.1" /translation="MGRHELARDRRKSSAVLAAVLAPAAVFFATGGDVSTLAARADAN PVLGDDAPCCVQIVPVAPLAFSSQISGGEIGTGLAASQFASASRWRIVSRYLPVGVAP EQGLQVKTVLTARSISAAFPEIREIGGVRPDALRWHPNGLALDVMVPNPGTAEGIALG NEIVAFVLKNATRFGMQDVIWRGAYYTPNGARTTGAGHYDHIHITTVGGGYPTGEELY IR" CDS 389612..390184 /codon_start=1 /transl_table=11 /gene="dcd" /locus_tag="BQ2027_MB0329" /product="PROBABLE DEOXYCYTIDINE TRIPHOSPHATE DEAMINASE DCD (DCTP DEAMINASE)" /note="Mb0329, dcd, len: 190 aa. Equivalent to Rv0321, len: 190 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 190 aa overlap). Probable dcd (alterrnate gene names: dus or paxA), deoxycytidine triphosphate deaminase (EC 3.5.4.13), equivalent to CAC32024.1|AL583925 probable deoxycytidine triphosphate deaminase from Mycobacterium leprae (190 aa). Also highly similar to others e.g. Q9X8W0|DCD_STRCO|7480599|T36613|SCH35.46 DEOXYCYTIDINE TRIPHOSPHATE DEAMINASE from Streptomyces coelicolor (191 aa); DCD_ECOLI|P28248|DUS|PAXA|B2065 DEOXYCYTIDINE TRIPHOSPHATE DEAMINASE from Escherichia coli strain K12 (193 aa), FASTA scores: opt: 408, E(): 2.7e-21, (43.1% identity in 188 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE DCTP DEAMINASE FAMILY. Protein product from Mb0329 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0329 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248035.1" /translation="MLLSDRDLRAEISSGRLGIDPFDDTLVQPSSIDVRLDCLFRVFN NTRYTHIDPAKQQDELTSLVQPVDGEPFVLHPGEFVLGSTLELFTLPDNLAGRLEGKS SLGRLGLLTHSTAGFIDPGFSGHITLELSNVANLPITLWPGMKIGQLCMLRLTSPSEH PYGSSRAGSKYQGQRGPTPSRSCQNFIRST" CDS 390290..391621 /codon_start=1 /transl_table=11 /gene="udgA" /locus_tag="BQ2027_MB0330" /product="PROBABLE UDP-GLUCOSE 6-DEHYDROGENASE UDGA (UDP-GLC DEHYDROGENASE) (UDP-GLCDH) (UDPGDH)" /note="Mb0330, udgA, len: 443 aa. Equivalent to Rv0322, len: 443 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 443 aa overlap). Probable udg (alternate gene name: rkpK), UDP-glucose 6-dehydrogenase (EC 1.1.1.22), highly similar to others e.g. CAC44517.1|AL596138 putative UDP-glucose 6-dehydrogenase from Streptomyces coelicolor (447 aa); Q56812 UDP-GLUCOSE DEHYDROGENASE from Xanthomonas campestris (445 aa), FASTA scores: opt: 713, E(): 0, (41.9% identity in 351 aa overlap); etc. Also similar to several GDP-mannose 6-dehydrogenase. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE UDP-GLUCOSE/GDP-MANNOSE DEHYDROGENASES FAMILY. Mb0330 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248036.1" /translation="MRCSVFGTGYLGATHAVGMAQLGHEVVGVDIDPGKVAKLAGGDI PFYEPGLRKLLTDNLAAGRLRFTTDYDMAADFADVHFLGVGTPQKIGEYGADLRHVHA VIDALVPRLVRASILVGKSTVPVGTAAELGHRAGALAPRGVDVEIAWNPEFLREGFAV HDTLNPDRIVLGVQDDSTRAEVAVRELYAPLLAAGVPFLVTDLQTAELVKVSANAFLA TKISFINAISEVCEAAGADVSQLADALGYDPRIGRQCLNAGLGFGGGCLPKDIRAFMA RAGELGADQALTFLREVDSINMRRRTKMVELATTACGGSLLGANIAVLGAAFKPESDD VRDSPALNVAGQLQLNGATVHVYDPKALDNAHRLFPTLNYAVSVAEACERADAVLVLT EWREFIDLEPADLANRVRARVIVDGRNCLDVTRWRRAGWRVFRLGVPRLGH" CDS complement(391610..392281) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0331C" /product="Mycothiol conjugate amidase Mca (Mycothiol S-conjugate amidase)" /note="Mb0331c, -, len: 223 aa. Equivalent to Rv0323c, len: 223 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 223 aa overlap). Conserved hypothetical protein, similar to others e.g. YPJG_BACSU|P42981 hypothetical 24.8 kd protein from Bacillus subtilis (224 aa), FASTA scores: opt: 182, E(): 1.3e-05, (27.5% identity in 211 aa overlap). Also some similarity to MLU15183_8 from Mycobacterium tuberculosis FASTA score: (32.0% identity in 147 aa overlap). Mb0331c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248037.1" /translation="MNSCNRLPCAHEVLAVFAHPDDESFGLGAVLGDFTAQGTRLRGL CFTHGEASTLGRTDRNLGEVRREELAAAAQVLGVDHVQLLAYPDNGLAQIPLNELTQR VVDALAGADLLLVFDDNGVTGHPDHRRATEAALAAASTPGIPVLAWALPQPIADRLNA EFSASFGGRGHGHLDIMIEVDRSRQLAAIGCHFTQSADNPVLWRRLELLGDREYLRWL RRSVP" CDS 392382..393062 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0332" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY ARSR-FAMILY)" /note="Mb0332, -, len: 226 aa. Equivalent to Rv0324, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 226 aa overlap). Possible transcriptional regulator, arsR family, with its N-terminus similar to the N-terminus of other DNA-binding proteins e.g. P30346|MERR_STRLI probable mercury resistance operon from Streptomyces lividans (125 aa), FASTA scores: opt: 154, E(): 0.002, (32.2% identity in 90 aa overlap)), and its C-terminal part similar to hypothetical bacterial proteins e.g. P54510|YQHL_BACSU hypothetical 14.6 kd protein from Bacillus subtilis (126 aa), FASTA scores: opt: 159, E(): 0.00097, (35.5% identity in 76 aa overlap)). Most similar to AJ005575|SPE005575_2 ORF1 required for antibiotic production from Streptomyces peucetius (226 aa), FASTA scores: opt: 816, E(): 0, (60.7% identity in 211 aa overlap). Also similar in C-terminus to MTCY164.26 molybdopterin biosynthesis moeb protein from Mycobacterium tuberculosis FASTA score: (36.8% identity in 114 aa overlap). Protein product from Mb0332 detected using SWATH mass spectrometry." /protein_id="CAB5248038.1" /translation="MAGQSDRKAALLDQVARVGKALANGRRLQILDLLAQGERAVEAI ATATGMNLTTASANLQALKSGGLVEARREGTRQYYRIAGEDVARLFALVQVVADEHLA DVAVAAADVLGSPEDAITRAELLRRREAGEVTLVDVRPHEEYQAGHIPGAINIPIAEL ADRLAELAGDRDIVAYCRGAYCVMAPDAVRIARDAGREVKRLDDGMLEWRLAGLPVDE GAPVGHGD" CDS 393069..393758 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0333" /product="Methylated-DNA--protein-cysteine methyltransferase (EC" /EC_number="2.1.1.63" /note="Mb0333, -, len: 229 aa. Equivalent to Rv0325 and Rv0326, len: 74 aa and 151 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 aa overlap and 100.0% identity in 151 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0325 and Rv0326 exist as 2 genes. In Mycobacterium bovis, the absence of a stop codon between Rv0325 and Rv0326 due to a single base transition (t-c) leads to single product. Mb0333 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248039.1" /translation="MGPKGSLRLVKRQPELLVAQHEHWQDTYRAHPVLYGTRPSEPGV YAAEVFNADGVQRVLELAAGHGRDTLYFAGQGFTVVATDFSDVAVAQLRRSAQARGVS ARVQPIVHDLRQPLPVKTGSIDGAFAHMALCMALSTSEIHAVVAEVGRVLRPGGKFIY TVRHTGDAHYGAGQAHGDDIFECAGFAVHFFRRELVARLATGWVLEEVHDFEEGELPR RLWRVTVTKPA" CDS complement(393726..395075) /codon_start=1 /transl_table=11 /gene="cyp135A1" /locus_tag="BQ2027_MB0334C" /product="POSSIBLE CYTOCHROME P450 135A1 CYP135A1" /note="Mb0334c, cyp135A1, len: 449 aa. Equivalent to Rv0327c, len: 449 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 449 aa overlap). Possible cyp135A1, cytochrome P450 (EC 1.14.-.-), similar to cytochrome P-450 monoxygenases and other cytochrome P-450 related enzymes e.g. FQ12609 PUTATIVE P450 MONOOXYGENASE (EC 1.14.14.1) (506 aa), FASTA scores: opt: 276, E() : 1.7e-11, (27.9% identity in 433 aa overlap). Also similar to other Mycobacterium tuberculosis proteins e.g. MTV039.06|Rv0568 PUTATIVE CYTOCHROME P450 (472 aa); MTCI5.10 cytochrome p450 FASTA score: (30.4% identity in 434 aa overlap). Contains cytochrome P450 cysteine heme-iron ligand signature (PS00086). BELONGS TO THE CYTOCHROME P450 FAMILY. Alternative start possible at 33706 but no RBS." /protein_id="CAB5248040.1" /translation="MASTLTTGLPPGPRLPRYLQSVLYLRFREWFLPAMHRKYGDVFS LRVPPYADNLVVYTRPEHIKEIFAADPRSLHAGEGNHILGFVMGEHSVLMTDEAEYAR MRSLLMPAFTRAALRGYRDMIASVAREHITRWRPHATINSLDHMNALTLDIILRVVFG VTDPKVKAELTSRLQQIINIHPAILAGVPYPSLKRMNPWKRFFHNQTKIDEILYREIA SRRIDSDLTARTDVLSRLLQTKDTPTKPLTDAELRDQLITLLLAGHETTAAALSWTLW ELAHAPEIQSQVVWAAVGGDDGFLEAVLKEGMRRHTVIASTARKVTAPAEIGGWRLPA GTVVNTSILLAHASEVSHPKPTEFRPSRFLDGSVAPNTWLPFGGGVRRCLGFGFALTE GAVILQEIFRRFTITAAGPSKGETPLVRNITTVPKHGAHLRLIPQRRLGGLGDSDPP" CDS 395141..395743 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0335" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY TETR/ACRR-FAMILY)" /note="Mb0335, -, len: 200 aa. Equivalent to Rv0328, len: 200 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 200 aa overlap). Possible transcription regulator, tetR/acrR family, similar in part to various hypothetical transcriptional regulators e.g. T36696|4726006|CAB41735.1|AL049731 probable regulatory protein from Streptomyces coelicolor (197 aa). Also some similarity with YX44_MYCTU|Q10829 hypothetical transcriptional regulator from Mycobacterium tuberculosis (195 aa), FASTA scores: opt: 154, E(): 0.00061, (26.7% identity in 202 aa overlap). Contains probable helix-turn helix motif from aa 27-48 (Score 1408, +3.98 SD). SEEMS TO BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb0335 detected using SWATH mass spectrometry. Mb0335 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248041.1" /translation="MQQQRTNRDKLLDGALACLRERGYGNTSSRDIARAAGVNIASIN YHFGSKDALLDDALGRCFSTWNQRVQEAFDHSRAAGPAGQILAVLEATVDSFEQIRPA VYACVESYAPALRSEALRERLAAGYADVRQHSVDLAGAALAGTDIAPPENLSTIVSVL MAVIDGLMIQWIADPSATPRSTEVIRALASIGAVVTSQLR" CDS complement(395724..396350) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0336C" /product="SAM-dependent methyltransferase" /note="Mb0336c, -, len: 208 aa. Equivalent to Rv0329c, len: 208 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 208 aa overlap). Conserved hypothetical protein, showing some similarity with others hypothetical proteins and methyltransferases e.g. MitM|AF127374_14 methyltransferase from Streptomyces lavendulae (283 aa), FASTA scores: opt: 242, E(): 1.8e-08, (37.2% identity in 145 aa overlap); Q48938 from Methanosarcina barkeri (262 aa), FASTA scores: opt: 194, E(): 3.6e-06, (31.1% identity in 119 aa overlap). Protein product from Mb0336c detected using SWATH mass spectrometry." /protein_id="CAB5248042.1" /translation="MRLTHPARRYLSSQAARPTGAFGRLLGRIWRAETADVNRIAVEL LAPGPGERVCEIGFGPGRTLGLLAAAGAQVSGVEVSTTMIAIAAHHNAKAIAAGLISL YHGDGVTLPVADHSLDKVLGVHNFYFSPDPRASLCDIARALRPGGRLVLTSISDDQPL AARFDPAIYRVPPTLDTAAWLGAAGFIDVGIKRSADHPATVWFTATAT" CDS complement(396377..397117) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0337C" /product="Transcriptional regulator, AcrR family" /note="Mb0337c, -, len: 246 aa. Equivalent to Rv0330c, len: 246 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 246 aa overlap). Hypothetical unknown protein." /protein_id="CAB5248043.1" /translation="MARSIPADRFSAIVAASARVFIAHGYQRTQVQDVADALALAKGT LYGYAQGKAALFAAAVRYGDAQEALPLASELPVAAPVAGEIAAVVSARLAGEVTDMRL THALRATLPPGATTGDARAELAGIVTDLYSRLARHRIALKLVDRCAPELPDLAEVWFG TGRNAQVDAVQAYLVHRERAGLLILPGPAPMVARTIVELCALWAVHLHFDPSPEPWSI VQPGVIDDDAIAATLAEFVVRATTASSD" CDS 397231..398397 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0338" /product="POSSIBLE DEHYDROGENASE/REDUCTASE" /note="Mb0338, -, len: 388 aa. Equivalent to Rv0331, len: 388 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 388 aa overlap). Possible dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases e.g. NP_103779.1|14022957|BAB49565.1|AP002999 flavoprotein reductase from Mesorhizobium loti (377 aa); NP_147681.1 predicted NAD(FAD)-dependent dehydrogenase from Aeropyrum pernix (381 aa); DHSU_CHRVI|Q06530 sulfide dehydrogenase (431 aa), FASTA scores: opt: 347, E(): 6.8e-15, (25.6% identity in 348 aa overlap). TBparse score is 0.906. Protein product from Mb0338 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0338 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248044.1" /translation="MSKTVLILGAGVGGLTTADTLRQLLPPEDRIILVDRSFDGTLGL SLLWVLRGWRRPDDVRVRPTAASLPGVEMVTATVAHIDIAAQVVHTDNSVIGYDALVI ALGAALNTDAVPGLSDALDADVAGQFYTLDGAAELRAKVEALEHGRIAVAIAGVPFKC PAAPFEAAFLIAAQLGDRYATGTVQIDTFTPDPLPMPVAGPEVGEALVSMLKDHGVGF HPRKALARVDEAARTMHFGDGTSEPFDLLAVVPPHVPSAAARSAGLSESGWIPVDPRT LSTSADNVWAIGDATVLTLPNGKPLPKAAVFAEAQAAVVAHGVARHLGYDVAERHFTG TGACYVETGDHQAAKGDGDFFAPSAPSVTLYPPSREFHEEKVAQELAWLTRWKT" CDS 398472..399257 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0339" /product="conserved protein" /note="Mb0339, -, len: 261 aa. Equivalent to Rv0332, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 261 aa overlap). Conserved hypothetical protein, similar to several conserved hypothetical proteins from Streptomyces coelicolor e.g. SC6A9.18c|AL031035|SC6A9_18|T35449 hypothetical protein (266 aa), FASTA scores: opt: 508, E(): 5.7e-27, (36.7% identity in 251 aa overlap). Protein product from Mb0339 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0339 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248045.1" /translation="MRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWS LGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDA VEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISE FLERIAVQAGSGGTPLPLEDDDTLHLHATDPGLLEAGGWTVRRDERGVTWSHRHGKGA VALRGGATELLLAMVRRLSVADTGIELLGDAGVWQKWLDRTPL" CDS 399284..399658 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0340" /product="unknown protein" /note="Mb0340, -, len: 124 aa. Equivalent to Rv0333, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 124 aa overlap). Hypothetical unknown protein. Protein product from Mb0340 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0340 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248046.1" /translation="MTTSEIATVLAWHDALNAADIETLVALSTDDIDIGDAHGAVQGH DALRGWASSLTTTAELGRMYVHHGVVVVEQKITSGEDPGIARTGAAAFRVVQDHVASV FRHEDLASALAATELTEDDLVD" CDS 399688..400554 /codon_start=1 /transl_table=11 /gene="rmlA" /locus_tag="BQ2027_MB0341" /product="alpha-d-glucose-1-phosphate thymidylyltransferase rmla (dtdp-glucose synthase) (dtdp-glucose pyrophosphorylase)" /note="Mb0341, rmlA, len: 288 aa. Equivalent to Rv0334, len: 288 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 288 aa overlap). Probable rmlA (alternate gene name: rfbA), glucose-1-phosphate thymidylyl-transferase (EC 2.7.7.24), equivalent to CAC32020.1|AL583925 glucose-1-phosphate thymidyltransferase from Mycobacterium leprae (288 aa). Also highly similar to others e.g. AAG29804.1|AF235050 glucose-1-phosphate thymidylyltransferase from Streptomyces rishiriensis (296 aa); RBA1_ECOLI|P37744 glucose-1-phosphate thymidylyltransferase from Escherichia coli strain K12 (293 aa), FASTA scores: opt: 1199, E(): 0, (62.0% identity in 284 aa overlap). BELONGS TO THE GLUCOSE-1-PHOSPHATE THYMIDYLYLTRANSFERASE FAMILY. Protein product from Mb0341 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0341 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248047.1" /translation="MRGIILAGGSGTRLYPITMGISKQLLPVYDKPMIYYPLTTLMMA GIRDIQLITTPHDAPGFHRLLGDGAHLGVNISYATQDQPDGLAQAFVIGANHIGADSV ALVLGDNIFYGPGLGTSLKRFQSISGGAIFAYWVANPSAYGVVEFGAEGMALSLEEKP VTPKSNYAVPGLYFYDNDVIEIARGLKKSARGEYEITEVNQVYLNQGRLAVEVLARGT AWLDTGTFDSLLDAADFVRTLERRQGLKVSIPEEVAWRMGWIDDEQLVQRARALVKSG YGNYLLELLERN" CDS complement(400565..401080) /codon_start=1 /transl_table=11 /gene="PE6" /locus_tag="BQ2027_MB0342C" /product="pe family protein pe6" /note="Mb0342c, PE6, len: 171 aa. Equivalent to Rv0335c, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 171 aa overlap). Member of the Mycobacterium tuberculosis PE family (see first citation below); contains short region of similarity to part of the unique N-terminus of the Mycobacterium tuberculosis PGRS family of Glycine-rich proteins e.g. Y03A_MYCTU|Q10637 hypothetical glycine-rich 49.6 kd protein (603 aa), FASTA scores: opt: 219, E(): 1.1e-08, (51.5% identity in 66 aa overlap). Mb0342c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248048.1" /translation="MRSMGFLHRACRAPSSLPAPLMARPGRSVLARPAATPPGPLCAT TRPRPPQGNQPPASRISNFPPKRHKTRVLAAAEDEVSAAVAALISAHGRRHHSLNNQA AAFHGQFAQNLNVGAGSCASAETTADAPTQALLGPADRQRRQRRAVRQWLVRWAAHPG RATRGFHNHRQ" CDS 401222..402733 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0343" /product="CONSERVED 13E12 REPEAT FAMILY PROTEIN" /note="Mb0343, -, len: 503 aa. Equivalent to Rv0336, len: 503 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 503 aa overlap). Part of Mycobacterium tuberculosis 13E12 repeat family; almost identical to Rv0515|MTCY20G10.05 hypothetical protein from M. tuberculosis FASTA scores: (99.8% identity in 503 aa overlap), possibly due to a recent gene duplication. Also similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv1148c, Rv1945, etc. Mb0343 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248049.1" /translation="MPSPEAIAHFDERFECHAPRTTRVSAAFIDRICSATRAENRAAA AQLVALGELFAYRWSRCGGREEWVMDTMAAVAAEVAAALRISQGLAASRLRYARAMRE RLPKTAEVFSAGDIGYLMFATIVYRTDLIVDPDVLAAVDAQLAANVARWPSMTKARLA GQVDKIVARADADAVRRRKEYQAQRQFWVGESQDGVCQIGGSLLAVDAHALDARLSAL AGTVCEHDPRSREQRRADALGALAGGADRLGCGCGRADCAAGKRPAAPPVVIHLIAEA ATINGTGSAPASQMNADGLITAELVAELAKTATLVPLVHPGDAPPEPGYAPSKALADF VRCRDLTCRWPGCDEPATNCDLDHTIPYAAGGPTHASNLKCYCRTHHLVKTFWGWRDQ QLPDGTLILTSPSGHTYVSTPGSALLFPSLCHFSGGIPAPEADPPYDHCDQRTAMMPK RRRTRAQDRAYRIATERRQNHAARQRAQVLTQTAAATDTHGPPPDHNDDPPPF" CDS complement(402903..404192) /codon_start=1 /transl_table=11 /gene="aspC" /locus_tag="BQ2027_MB0344C" /product="PROBABLE ASPARTATE AMINOTRANSFERASE ASPC (TRANSAMINASE A) (ASPAT)" /note="Mb0344c, aspC, len: 429 aa. Equivalent to Rv0337c, len: 429 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 429 aa overlap). Probable aspC, aspartate aminotransferase (transaminase A) (EC 2.6.1.1), equivalent to CAC32019.1|AL583925 probable aspartate aminotransferase from Mycobacterium leprae (437 aa). Also highly similar to many e.g. Q48143|U32823 aspartate aminotransferase (404 aa), FASTA scores: opt: 1646, E(): 0, (57.2% identity in 404 aa overlap). Also some similarity to Rv3565|MTCY06G11.12 from Mycobacterium tuberculosis FASTA score: (27.2% identity in 383 aa overlap). BELONGS TO CLASS-I OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. COFACTOR: PYRIDOXAL PHOSPHATE. Protein product from Mb0344c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0344c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248050.1" /translation="MDNDGTIVDVTTHQLPWHTASHQRQRAFAQSAKLQDVLYEIRGP VHQHAARLEAEGHRILKLNIGNPAPFGFEAPDVIMRDIIQALPYAQGYSDSQGILSAR RAVVTRYELVPGFPRFDVDDVYLGNGVSELITMTLQALLDNGDQVLIPSPDYPLWTAS TSLAGGTPVHYLCDETQGWQPDIADLESKITERTKALVVINPNNPTGAVYSCEILTQM VDLARKHQLLLLADEIYDKILYDDAKHISLASIAPDMLCLTFNGLSKAYRVAGYRAGW LAITGPKEHASSFIEGIGLLANMRLCPNVPAQHAIQVALGGHQSIEDLVLPGGRLLEQ RDIAWTKLNEIPGVSCVKPAGALYAFPRLDPEVYDIDDDEQLVLDLLLSEKILVTQGT GFNWPAPDHLRLVTLPWSRDLAAAIERLGNFLVSYRQ" CDS complement(404223..406871) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0345C" /product="probable iron-sulfur-binding reductase" /note="Mb0345c, -, len: 882 aa. Equivalent to Rv0338c, len: 882 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 882 aa overlap). Probable iron-sulphur-binding reductase (EC 1.-.-.-), possibly membrane-bound, equivalent to CAC32018.1|AL583925 probable iron-sulphur-binding reductase from Mycobacterium leprae (880 aa). Also highly similar to others e.g. T36608|5019323|CAB44376.1|AL078610 probable iron-sulfur-binding reductase from Streptomyces coelicolor (760 aa), FASTA scores: opt: 1658, E(): 0, (49.9% identity in 772 aa overlap); BAB07521.1|AP001520 iron-sulphur-binding reductase from Bacillus halodurans (700 aa). Contains PS00070 Aldehyde dehydrogenases cysteine active site and two of PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature. First of several possible start sites chosen. Protein product from Mb0345c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0345c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248051.1" /translation="MTTQTLIRLILGMSMTAVVGVFALRRVWWLYKLVMSGQPASGRT DNLGTRIWTQISEVLGQRRLLKWSIPGLAHFFTMWGFFILLTVYIEAYGLLFEERFHI PVIGRWDALGFLQDFFATAVFLGITTFAIIRILRNPREIGRSSRFYGSHNGGAWLVLL MIFNVIWTYVLVRGSAVNNGTLPYGNGAFLSQLFGAILRPLGQPANEIIETTALLLHI GVMLAFLILVLHSKHLHIFLAPINVTFKRLPDGLGPLLPLEADGKPIDFENPSEDAVF GRGKIEDFTWKGMLDFATCTECGRCQSQCPAWNTGKPLSPKLVIMDLRDHWMAKAPYI LGQKDASAGGEAGHQEHHHVPESGFGRVPGHGPEQATRPLVGTEEQGGVIDPDVLWSC VTCGACVEQCPVDIEHVDHIVDMRRYQVMMESEFPSELSVLFKNLETKGNPWGQNASD RTNWIDEVDFDVPVYGQDVDSFDGYEYLFWVGCAGAYDDKAKKTTKAVAELLAVAGVK YLVLGAGETCNGDSARRSGNEFLFQQLAQQAVETLDGLFEGVETVDRKIVVTCPHCFN TIGKEYRQLGANYTVLHHTQLLNRLVRDKRLVPVTPVSQDITYHDPCYLGRHNKVYEA PRELIGAAGASLTEMPRHADRSFCCGAGGARMWMEEHIGKRINHERVDEALATDATAI ATACPFCRVMVTDGVNDRQEEAGRSGVEVLDVAQVLLGSLDHDKAQLPAKGTAAKQAQ ERAPKAAPKAAAPVTPVEAPAEAPQAPAPAAPAAPVKGLGMAAGAKRPGAKKAAPTPA APAAPAAPVKGLGIAAGAKRPGAKKTPPPAPGLAEPAAQPQPEAKPQPEPAAPPKPQT DGDPAAPAAPVKGLGIARGARPPGKR" CDS complement(406980..409478) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0346C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0346c, -, len: 832 aa. Equivalent to Rv0339c, len: 832 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 832 aa overlap). Possible transcriptional regulator, showing very weak similarity with parts of others. Contains PS00017 ATP/GTP-binding site motif A (P-loop); and probable helix-turn helix motif from aa 778-799 (Score 1041, +2.73 SD). Protein product from Mb0346c detected using SWATH mass spectrometry. Mb0346c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248052.1" /translation="MQHRGCKNRGQAYDASVTDSLTEVPPAARRALLELANAPTVPVK VLITGGIGTGKTTVLAAARDTLRRSGLTVLACPPPDGEPPETALVIDDAQLLTDTELL RLTERVADSRLTVVAAAEAREHHRALRALTMALERDRPRISLGPLPVAEHLRDCTAGL PFLIHAVSARAQAPAQAAKVALIERLRRLDEPTLDTLLMMSLTHELGVSDVAAALGIS VTDARGLVDRAHASGLIESSHTAAFLQSVHDAIAQIVGNAHHHEVETSLLRSQLDISP VSAELALRLAEHGLRDERLADILTRYAADTRDASVRCARLYRAAVHAGAKGLTVRLAD ALARTGDCTAAATLADDLLSSPDATERAAAVRVAASVAVHDGNTGHAAELFGWLGPHP DTMVSSAATIVFAANGDLATARATLRLKDAGPPTMAARCARNLAEGLLLTMDQPYPVA MAKLGQAIATEQSLSQVIPDSPAALVTLAAIHAGDPVRARSVIGRAVRAGADPLFQRR HLLLSGWIKMQEGQLPSASADVAAASAGTHLHRRDALWAAALQTAISRRTGDIGALQQ HWYAAMEALAEYSLDLFALLPLGELWVAAARMRQVDQLQHTLDQALTLLDSLGNPALW SNSLHWAGVHAGILANSPESVAPHGQALGAMVAHSTLAQALSDAGRTWLRVLAENVDA DEVTAAARSLSHVGLTSDATRLAGQAALQTSDARVSGAMLQLARDLKLGNDFGEPPSG AGDTEPASGTPPAPRQPPAGSPLSDREREVAELLLLGMPYRDIGARLFISAKTVEHHV ARIRQRLGAGSRSEMLSMLRAMLAPESLTADERR" CDS 409664..410203 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0347" /product="conserved protein" /note="Mb0347, -, len: 179 aa. Equivalent to Rv0340, len: 179 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 179 aa overlap). Conserved hypothetical protein; MEME-MAST analysis shows similarity to product of downstream gene, Rv0341|iniB. Mb0347 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248053.1" /translation="MANSLLDFVISLVRDPEAAARYAANPERSIAEAHLTDVTRADVN SLIPVVSDSLSMSEPIGAAGGAHAGDRGNVWASGAATAALDAFAPHADAGVVQQHGAV GSVLNQPTPPGPGVTPTDPRPFRAGPHETSALLTSAEIPDTTSEDGGLPTDHPAVWNH PVVDPHTVEPDHHGYDIHG" CDS 410392..411831 /codon_start=1 /transl_table=11 /gene="iniB" /locus_tag="BQ2027_MB0348" /product="ISONIAZID INDUCTIBLE GENE PROTEIN INIB" /note="Mb0348, iniB, len: 479 aa. Equivalent to Rv0341, len: 479 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 479 aa overlap). iniB, isoniazid-inducible gene, (see citations below). Protein very Gly-, Ala-rich, similar to cell wall proteins e.g. P27483|GRP_ARATH GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN from A.thaliana (338 aa), FASTA scores: opt: 532, E(): 5.2e-13, (39.3% identity in 321 aa overlap). MEME-MAST analysis shows similarity to product of upstream gene, Rv0340. Protein product from Mb0348 detected using shotgun mass spectrometry. Mb0348 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248054.1" /translation="MTSLIDYILSLFRSEDAARSFVAAPGRAMTSAGLIDIAPHQISS VAANVVPGLNLGAGDPMSGLRQAVAARHGFAQDVANVGFAGDAGAGVASVITTDVGAG LASGLGAGFLGQGGLALAASSGGFGGQVGLAAQVGLGFTAVIEAEVGAQVGAGLGIGT GLGAQAGMGFGGGVGLGLGGQAGGVIGGSAAGAIGAGVGGRLGGNGQIGVAGQGAVGA GVGAGVGGQAGIASQIGVSAGGGLGGVGNVSGLTGVSSNAVLASNASGQAGLIASEGA ALNGAAMPHLSGPLAGVGVGGQAGAAGGAGLGFGAVGHPTPQPAALGAAGVVAKTEAA AGVVGGVGGATAAGVGGAHGDILGHEGAALGSVDTVNAGVTPVEHGLVLPSGPLIHGG TGGYGGMNPPVTDAPAPQVPARAQPMTTAAEHTPAVTQPQHTPVEPPVHDKPPSHSVF DVGHEPPVTHTPPAPIELPSYGLFGLPGF" CDS 411868..413790 /codon_start=1 /transl_table=11 /gene="iniA" /locus_tag="BQ2027_MB0349" /product="ISONIAZID INDUCTIBLE GENE PROTEIN INIA" /note="Mb0349, iniA, len: 640 aa. Equivalent to Rv0342, len: 640 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 640 aa overlap). iniA, isoniazid-inducible gene, (see citations below). Shows slight similarity to some hypothetical bacterial proteins e.g. P40983|YOR6_THER hypothetical protein (402 aa), FASTA scores: opt: 242, E(): 1.4e-07, (22.3% identity in 349 aa overlap). Also some similarity to downstream ORF Rv0343|iniC. Possible transmembrane stretch around residue 490. Alternative translational start at 410824. Contains a phosphopantetheine attachment site motif suggestive of an acyl carrier protein. Note that the iniA gene is also induced by the antibiotic ethambutol, an agent that inhibits cell wall biosynthesis by a mechanism that is distinct from isoniazid. Protein product from Mb0349 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0349 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248055.1" /translation="MVPAGLCAYRDLRRKRARKWGDTVTQPDDPRRVGVIVELIDHTI AIAKLNERGDLVQRLTRARQRITDPQVRVVIAGLLKQGKSQLLSSLLNLPAARVGDDE ATVVITVVSYSAQPSARLVLAAGPDGTTAAVDIPVDDISTDVRRAPHAGGREVLRVEV GAPSPLLRGGLAFIDTPGVGGLGQPHLSATLGLLPEADAVLVVSDTSQEFTEPEMWFV RQAHQICPVGAVVATKTDLYPRWREIVNANAAHLQRARVPMPIIAVSSLLRSHAVTLN DKELNEESNFPAIVKFLSEQVLSRATERVRAGVLGEIRSATEQLAVSLGSELSVVNDP NLRDRLASDLERRKREAQQAVQQTALWQQVLGDGFNDLTADVDHDLRTRFRTVTEDAE RQIDSCDPTAHWAEIGNDVENAIATAVGDNFVWAYQRSEALADDVARSFADAGLDSVL SAELSPHVMGTDFGRLKALGRMESKPLRRGQKMIIGMRGSYGGVVMIGMLSSVVGLGL FNPLSVGAGLILGRMAYKEDKQNRLLRVRSEAKANVRRFVDDISFVVSKQSRDRLKMI QRLLRDHYREIAEEITRSLTESLQATIAAAQVAETERDNRIRELQRQLGILSQVNDNL AGLEPTLTPRASLGRA" CDS 413787..415268 /codon_start=1 /transl_table=11 /gene="iniC" /locus_tag="BQ2027_MB0350" /product="ISONIAZID INDUCTIBLE GENE PROTEIN INIC" /note="Mb0350, iniC, len: 493 aa. Equivalent to Rv0343, len: 493 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 493 aa overlap). iniC, isoniazid-inducible gene, (see citations below). Shows slight similarity to P40983|YOR6_THER8 hypothetical protein (402 aa), FASTA scores: opt: 196, E(): 2.6e-05, (25.9% identity in 228 aa overlap). Also some similarity to upstream ORF Rv0342|iniA. Contains (PS00017) ATP/GTP-binding site motif A (P-loop). Note that the iniA gene is also induced by the antibiotic ethambutol, an agent that inhibits cell wall biosynthesis by a mechanism that is distinct from isoniazid. Protein product from Mb0350 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0350 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248056.1" /translation="MSTSDRVRAILHATIQAYRGAPAYRQRGDVFCQLDRIGARLAEP LRIALAGTLKAGKSTLVNALVGDDIAPTDATEATRIVTWFRHGPTPRVTANHRGGRRA NVPITRRGGLSFDLRRINPAELIDLEVEWPAEELIDATIVDTPGTSSLACDASERTLR LLVPADGVPRVDAVVFLLRTLNAADVALLKQIGGLVGGSVGALGIIGVASRADEIGAG RIDAMLSANDVAKRFTRELNQMGICQAVVPVSGLLALTARTLRQTEFIALRKLAGAER TELNRALLSVDRFVRRDSPLPVDAGIRAQLLERFGMFGIRMSIAVLAAGVTDSTGLAA ELLERSGLVALRNVIDQQFAQRSDMLKAHTALVSLRRFVQTHPVPATPYVIADIDPLL ADTHAFEELRMLSLLPSRATTLNDDEIASLRRIIGGSGTSAAARLGLDPANSREAPRA ALAAAQHWRRRAAHPLNDPFTTRACRAAVRSAEAMVAEFSARR" CDS complement(415411..415971) /codon_start=1 /transl_table=11 /gene="lpqJ" /locus_tag="BQ2027_MB0351C" /product="PROBABLE LIPOPROTEIN LPQJ" /note="Mb0351c, lpqJ, len: 186 aa. Equivalent to Rv0344c, len: 186 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 186 aa overlap). Probable lipoprotein, without homology. Has an appropriately positioned prokaryotic lipoprotein signature (PS00013). Protein product from Mb0351c detected using SWATH mass spectrometry." /protein_id="CAB5248057.1" /translation="MRLSLIARGMAALLAATALVAGCNTTIDGRPVASPGSGPTEPTF PTPRPTTAPPGTTAPTLPTTPVSPTAPAGAIPLPPDSNGYVFIETKSGMTRCQINRDS VGCEAPFTNSPLRDGEHANGIHITAGGSVQWVLGNLGAIPTVSIDYRTYEAQGWTIDA TTDGTRFTNNRTGHGMFVSIEKVDTF" CDS 416217..416489 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0353" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0353, -, len: 90 aa. Equivalent to 3' end of Rv0345, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins e.g. AL13282 4|SCAH10_9 hypothetical protein from Streptomyces coelicolor (207 aa), FASTA scores: opt: 188, E(): 1.5e-05, (41.0% identity in 117 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base deletion (a-*) leads to a shorter product with a different NH2 part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Mb0353 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248058.1" /translation="MVLGAVEVSAPAGVTAITAPDWQQGLSASVRAGLAQADREHADY AVLHVIDTPDVNAKVVARVLGRALVSRSGLAGRGRIPAHSARRRGC" CDS complement(416531..417994) /codon_start=1 /transl_table=11 /gene="ansP2" /locus_tag="BQ2027_MB0354C" /standard_name="aroP2" /product="POSSIBLE L-ASPARAGINE PERMEASE ANSP2 (L-ASPARAGINE TRANSPORT PROTEIN)" /note="Mb0354c, ansP2, len: 487 aa. Equivalent to Rv0346c, len: 487 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 487 aa overlap). Possible ansP2, L-asparagine permease, integral membrane protein belonging to family containing many amino acid permeases, highly similar to G467030|B2126_F2_85|NP_301937.1|NC_002677 probable L-asparagine permease from Mycobacterium leprae (498 aa); and NP_301938.1|NC_002677 probable L-asparagine permease from Mycobacterium leprae (505 aa). Also highly similar to others e.g. P77610|ANSP_ECOLI L-ASPARAGINE PERMEASE from Escherichia coli strain K-12 (499 aa). Also highly similar to ANSP1|Rv2127|MT2186|MTCY261_22|O33261 PROBABLE L-ASPARAGINE PERMEASE from Mycobacterium tuberculosis (489 aa), FASTA score: (72.1% identity in 473 aa overlap). And shows some similarity to MTCY3G12.14 from Mycobacterium tuberculosis. BELONGS TO THE AMINO ACID PERMEASE FAMILY (APC FAMILY). Note that previously known as aroP2. Protein product from Mb0354c detected using SWATH mass spectrometry. Mb0354c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248059.1" /translation="MPPLDITDERLTREDTGYHKGLHSRQLQMIALGGAIGTGLFLGA GGRLASAGPGLFLVYGICGIFVFLILRALGELVLHRPSSGSFVSYAREFYGEKVAFVA GWMYFLNWAMTGIVDTTAIAHYCHYWRAFQPIPQWTLALIALLVVLSMNLISVRLFGE LEFWASLIKVIALVTFLIVGTVFLAGRYKIDGQETGVSLWSSHGGIVPTGLLPIVLVT SGVVFAYAAIELVGIAAGETAEPAKIMPRAINSVVLRIACFYVGSTVLLALLLPYTAY KEHVSPFVTFFSKIGIDAAGSVMNLVVLTAALSSLNAGLYSTGRILRSMAINGSGPRF TAPMSKTGVPYGGILLTAGIGLLGIILNAIKPSQAFEIVLHIAATGVIAAWATIVACQ LRLHRMANAGQLQRPKFRMPLSPFSGYLTLAFLAGVLILMYFDEQHGPWMIAATVIGV PALIGGWYLVRNRVTAVAHHAIDHTKSVAVVHSADPI" CDS 418333..419319 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0355" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb0355, -, len: 328 aa. Equivalent to Rv0347, len: 328 aa (alternative start possible), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 328 aa overlap). Probable conserved membrane protein, similar to Rv0831c|AL022004|MTV043_23 from Mycobacterium tuberculosis (271 aa), FASTA scores: E(): 9.6e-21, (33.1% identity in 266 aa overlap). Protein product from Mb0355 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0355 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248060.1" /translation="MPGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGT RPRWVSFLVIVLVIMNVVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELAR WTPILEQEEVRQVNLETGEHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFR SIVHAMVTARQDVAPVDGCIRIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADL KLTTTAQRHVIQCEGPEPGDSLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDID SAWSDPCKGIPALDAHLVDEVAERLHTPIGPLFESLITSELRTKVLQQPGQE" CDS 419322..419975 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0356" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0356, -, len: 217 aa. Equivalent to Rv0348, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 217 aa overlap). Possible transcriptional regulator, showing some similarity to O53334|RV3188|MTV014.32 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (115 aa), FASTA score: (30.0% identity in 100 aa overlap). Contains probable helix-turn helix motif from aa 89-110 (Score 1407, +3.98 SD). Protein product from Mb0356 detected using shotgun mass spectrometry." /protein_id="CAB5248061.1" /translation="MTISFSSSNLRDDATSGNGDYRLDKLPETTPSTSVFDRADVTYR QFTELHGQARDTRREAHVVELESKTGERARCAPMHALEQLADYGFAWRDIARVVGVSV PAITKWRKGAGVTGENRLKIARLLALIDMLSDRFIGEPASWLEMPIQAGVGITRMDLL ERGRYDLVLALASTHTGDGTVEYVLNETDKDWRETVVDNAFESYTAEDGVISIRPKR" CDS 419978..420637 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0357" /product="HYPOTHETICAL PROTEIN" /note="Mb0357, -, len: 219 aa. Equivalent to Rv0349, len: 219 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 219 aa overlap). Hypothetical unknown protein. Protein product from Mb0357 detected using SWATH mass spectrometry. Mb0357 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248062.1" /translation="MPELETPDDPESIYLARLEDVGEHRPTFTGDIYRLGDGRMVMIL QHPCALRHGVDLHPRLLVAPVRPDSLRSNWARAPFGTMPLPKLIDGQDHSADFINLEL IDSPTLPTCERIAVLSQSGVNLVMQRWVYHSTRLAVPTHTYSDSTVGPFDEADLIEEW VTDRVDDGADPQAAEHECASWLDERISGRTRRALLSDRQHASSIRREARSHRKSVKLA D" CDS 420864..422741 /codon_start=1 /transl_table=11 /gene="dnaK" /locus_tag="BQ2027_MB0358" /standard_name="hsp70" /product="PROBABLE CHAPERONE PROTEIN DNAK (HEAT SHOCK PROTEIN 70) (HEAT SHOCK 70 KDA PROTEIN) (HSP70)" /note="Mb0358, dnaK, len: 625 aa. Equivalent to Rv0350, len: 625 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 625 aa overlap). Probable DnaK (alternate gene name: hsp70), 70 kDa heat shock protein (see citations below), equivalent to AAA25362.1|M95576|1924344A|738248 heat shock protein 70 from Mycobacterium leprae (621 aa); and DNAK_MYCPA|Q00488 (623 aa), FASTA scores: opt: 3678, E(): 0, (92.3% identity in 625 aa overlap). Also highly similar to others e.g. Q05558|DNAK_STRCO|453231|CAA54606.1|X77458 CHAPERONE PROTEIN DNAK from Streptomyces coelicolor (618 aa). Has probably an ATPase activity (EC 3.6.1.-). Note that this sequence differs from DNAK_MYCTU|P32723 (609 aa), due to a frameshift near the N-terminus. BELONGS TO THE HEAT SHOCK PROTEIN 70 FAMILY. Protein product from Mb0358 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0358 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248063.1" /translation="MARAVGIDLGTTNSVVSVLEGGDPVVVANSEGSRTTPSIVAFAR NGEVLVGQPAKNQAVTNVDRTVRSVKRHMGSDWSIEIDGKKYTAPEISARILMKLKRD AEAYLGEDITDAVITTPAYFNDAQRQATKDAGQIAGLNVLRIVNEPTAAALAYGLDKG EKEQRILVFDLGGGTFDVSLLEIGEGVVEVRATSGDNHLGGDDWDQRVVDWLVDKFKG TSGIDLTKDKMAMQRLREAAEKAKIELSSSQSTSINLPYITVDADKNPLFLDEQLTRA EFQRITQDLLDRTRKPFQSVIADTGISVSEIDHVVLVGGSTRMPAVTDLVKELTGGKE PNKGVNPDEVVAVGAALQAGVLKGEVKDVLLLDVTPLSLGIETKGGVMTRLIERNTTI PTKRSETFTTADDNQPSVQIQVYQGEREIAAHNKLLGSFELTGIPPAPRGIPQIEVTF DIDANGIVHVTAKDKGTGKENTIRIQEGSGLSKEDIDRMIKDAEAHAEEDRKRREEAD VRNQAETLVYQTEKFVKEQREAEGGSKVPEDTLNKVDAAVAEAKAALGGSDISAIKSA MEKLGQESQALGQAIYEAAQAASQATGAAHPGGEPGGAHPGSADDVVDAEVVDDGREA K" CDS 422738..423445 /codon_start=1 /transl_table=11 /gene="grpE" /locus_tag="BQ2027_MB0359" /product="PROBABLE GRPE PROTEIN (HSP-70 COFACTOR)" /note="Mb0359, grpE, len: 235 aa. Equivalent to Rv0351, len: 235 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 235 aa overlap). Probable grpE protein (HSP-70 COFACTOR), equivalent to CAC32012.1|AL583925 Hsp70 cofactor from Mycobacterium leprae (229 aa). Also highly similar to others eg Q05562|GRPE_STRCO|2127521|PN0643 GRPE PROTEIN from Streptomyces coelicolor (225 aa). Contains grpE protein signature (PS01071). BELONGS TO THE GRPE FAMILY. Protein product from Mb0359 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0359 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248064.1" /translation="MTDGNQKPDGNSGEQVTVTDKRRIDPETGEVRHVPPGDMPGGTA AADAAHTEDKVAELTADLQRVQADFANYRKRALRDQQAAADRAKASVVSQLLGVLDDL ERARKHGDLESGPLKSVADKLDSALTGLGLVAFGAEGEDFDPVLHEAVQHEGDGGQGS KPVIGTVMRQGYQLGEQVLRHALVGVVDTVVVDAAELESVDDGTAVADTAENDQADQG NSADTLGEQAESEPSGS" CDS 423481..424668 /codon_start=1 /transl_table=11 /gene="dnaJ1" /locus_tag="BQ2027_MB0360" /standard_name="dnaJ" /product="PROBABLE CHAPERONE PROTEIN DNAJ1" /note="Mb0360, dnaJ1, len: 395 aa. Equivalent to Rv0352, len: 395 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 395 aa overlap). Probable DnaJ1, chaperone protein, equivalent to AAA25363.1|M95576 DNA J heatshock protein from Mycobacterium leprae (389 aa). Also highly similar to others. Contains both DnaJ signatures (PS00636, and PS00637). BELONGS TO THE DNAJ FAMILY. COFACTOR: BINDS TWO ZINC IONS PER MONOMER. Note that sequence differs from DNAJ_MYCTU|P07881 due to a frameshift at the N-terminus. Note that previously known as dnaJ. Protein product from Mb0360 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0360 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248065.1" /translation="MAQREWVEKDFYQELGVSSDASPEEIKRAYRKLARDLHPDANPG NPAAGERFKAVSEAHNVLSDPAKRKEYDETRRLFAGGGFGGRRFDSGFGGGFGGFGVG GDGAEFNLNDLFDAASRTGGTTIGDLFGGLFGRGGSARPSRPRRGNDLETETELDFVE AAKGVAMPLRLTSPAPCTNCHGSGARPGTSPKVCPTCNGSGVINRNQGAFGFSEPCTD CRGSGSIIEHPCEECKGTGVTTRTRTINVRIPPGVEDGQRIRLAGQGEAGLRGAPSGD LYVTVHVRPDKIFGRDGDDLTVTVPVSFTELALGSTLSVPTLDGTVGVRVPKGTADGR ILRVRGRGVPKRSGGSGDLLVTVKVAVPPNLAGAAQEALEAYAAAERSSGFNPRAGWA GNR" CDS 424668..425048 /codon_start=1 /transl_table=11 /gene="hspR" /locus_tag="BQ2027_MB0361" /product="probable heat shock protein transcriptional repressor hspr (merr family)" /note="Mb0361, hspR, len: 126 aa. Equivalent to Rv0353, len: 126 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 126 aa overlap). Probable hspR, heat shock regulatory protein, merR family, highly similar to others e.g. HspR|P40183 heat shock regulatory protein from Streptomyces coelicolor (151 aa), FASTA scores: E(): 4.9e-22, (55.7% identity in 140 aa overlap), that binds to three inverted repeats (IR1-IR3) in the promoter region of the dnaK operon. Has possible coiled coil region in C-terminal half. BELONGS TO THE MERR FAMILY OF TRANSCRIPTIONAL REGULATORS. Mb0361 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248066.1" /translation="MAKNPKDGESRTFLISVAAELAGMHAQTLRTYDRLGLVSPRRTS GGGRRYSLHDVELLRQVQHLSQDEGVNLAGIKRIIELTSQVEALQSRLQEMAEELAVL RANQRREVAVVPKSTALVVWKPRR" CDS complement(425173..435696) /codon_start=1 /transl_table=11 /gene="PPE8" /locus_tag="BQ2027_MB0362C" /product="ppe family protein ppe7" /note="Mb0362c, PPE8, len: 3507 aa. Equivalent to Rv0355c and Rv0354c, len: 3300 aa and 141 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 3296 aa overlap and 100.0% identity in 125 aa overlap). PPE8, member of the Mycobacterium tuberculosis PPE family, similar to others e.g. AL009198|MTV004_5 from M. tuberculosis (3716 aa), FASTA scores: opt: 2906, E(): 0, (40.9% identity in 3833 aa overlap); MTV004_3 FASTA scores: (39.0% identity in 3531 aa overlap); etc. Gene contains large number of clustered Major Polymorphic Tandem Repeats (MPTR). Related to MTCY13E10.16c, E(): 0; MTCY13E10.17c, E(): 0; MTCY48.17, E(): 0; MTCY98.0034c, E(): 0; MTCY03C7.23 E(): 0; MTCY98.0031c, E(): 0; MTCY31.06c, E(): 5.6e-17; MTCY359.33, E(): 2.3e-16. PPE7, member of the Mycobacterium tuberculosis PPE family, similar to others e.g. MTCY63_9 from Mycobacterium tuberculosis (2411 aa), FASTA scores: E(): 3.6e-11, (47.6% identity in 103 aa overlap). Possible continuation of ORF upstream, but no sequence error apparent. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE7 and PPE8 exist as 2 genes. In Mycobacterium bovis, a 2 bp insertion (*-ta) resulting in the absence of a stop codon between the 2 genes, leads to a single product. Mb0362c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248067.1" /translation="MSFAVLPPEINSARLYVGAGLAPMLDAAAAWDGLADELGSAAAS FSAVTAGLAGSSWLGAASTAMTGAAAPYLGWLSAAAAQAQQAATQTRLAAAAFEAALA ATVHPAIISANRALFVSLVVSNLLGQNAPAIAATEAAYEQMWAQDVAAMFGYHAGASA AVSALTPFGQALPTVAGGGALVSAAAAQVTTRVFRNLGLANVGEGNVGNGNVGNFNLG SANIGNGNIGSGNIGSSNIGFGNVGPGLTAALNNIGFGNTGSNNIGFGNTGSNNIGFG NTGDGNRGIGLTGSGLLGFGGLNSGTGNIGLFNSGTGNVGIGNSGTGNWGIGNSGNSY NTGFGNSGDANTGFFNSGIANTGVGNAGNYNTGSYNPGNSNTGGFNMGQYNTGYLNSG NYNTGLANSGNVNTGAFITGNFNNGFLWRGDHQGLIFGSPGFFNSTSAPSSGFFNSGA GSASGFLNSGANNSGFFNSSSGAIGNSGLANAGVLVSGVINSGNTVSGLFNMSLVAIT TPALISGFFNTGSNMSGFFGGPPVFNLGLANRGVVNILGNANIGNYNILGSGNVGDFN ILGSGNLGSQNILGSGNVGSFNIGSGNIGVFNVGSGSLGNYNIGSGNLGIYNIGFGNV GDYNVGFGNAGDFNQGFANTGNNNIGFANTGNNNIGIGLSGDNQQGFNIASGWNSGTG NSGLFNSGTNNVGIFNAGTGNVGIANSGTGNWGIGNPGTDNTGILNAGSYNTGILNAG DFNTGFYNTGSYNTGGFNVGNTNTGNFNVGDTNTGSYNPGDTNTGFFNPGNVNTGAFD TGDFNNGFLVAGDNQGQIAIDLSVTTPFIPINEQMVIDVHNVMTFGGNMITVTEASTV FPQTFYLSGLFFFGPVNLSASTLTVPTITLTIGGPTVTVPISIVGALESRTITFLKID PAPGIGNSTTNPSSGFFNSGTGGTSGFQNVGGGSSGVWNSGLSSAIGNSGFQNLGSLQ SGWANLGNSVSGFFNTSTVNLSTPANVSGLNNIGTNLSGVFRGPTGTIFNAGLANLGQ LNIGSANLGDFNLGSGNVGSFNVFSGNQGSYNIGPANLGNYNIGFANLGNYNIGFGNA GDFNQGFANTGNNNIGFANTGNNNIGIGLSGDNQQGFNFAGGWNSGTANIGLFNSGTN NVGIGNSGTGNWGIGNSGSGNTGIGNTGSTNTGFFNTGIVNTGVANAGSYNTGWYNTG DTNTGIANLGDFNTGFYNTGNFSTGFANQGDIATGAFITGDMGNGAFWRGDQQGLFSA GYRVHVPEIPAHVTVEVPVNIPITASFTNTVYSGITLEQINFGFTIDIAGIPLLAGAI SKAVLPPITGTGPAITVNIGDPGGSTAIRIPATASVGPFDVTFVNIAATTGFFNATTD PSSGFFNGGPGTVSGIANIGANISGFQNVANSATSGFNNYGSLQSGLANLGDTVSGVF NTGIGAPANVSGMFNIGSNLAGFFHDQATGMSMFNLGLGNIGQFNVGFSNVGDSNAGL ANIGSFNLGSGNLGSFNVFGGNQGSYNIGPANLGNYNIGLGNLGSYNFGFGNAGDFNL GFANTGNNNIGFANTGNNNIGIGLSGDNQQGFNFAGGWNSGSGNSGLFNSGTNNIGLF NSGTGNIGIGNSGTGNWGIANTGDTNTGIFNTGDVNTGLLNAGNVNTGIFNTGHYNTG SFNAGSFNTAGFNPGSYNTGYLNTGSYNTGLANSGDVNTGGFITGNYSNGFWWRGDYQ GLAGISQTITVPDTAVPVKLHVPIFLDIPVTGTLGTFTVHGFRFPEITGDIFLIGIPF NAATLDAFSFPNISIVLPNIGINLGSGPDPLIDIAGTGGLLPIKIPLIDIPAAPGFGN STTTPSSGFFNAGTGTVSGVGNVGSNSSGFFDLTSGSSGISGVQNFGELISGGFNFGN TVSGLVNASTLGLSMPANLSGGGNVGATVAGFVNNTQILNLGFGNVGSGNVGHGNIGD SNVGLGNLGNANVGHGNIGSFNVFSGNRGSYNIGPANLGNYNIGLGNLGSYNFGFGNA GDFNLGFANSGSNNIGFANTGNNNIGIGLSGHNQQGFGSWNSGTANTGLFNSGTNNIG LFNSGTGNIGIGNSGIGNTGIGNPGVGNTGLGNSGTGNWGLWNPGTGNMGVANVGTYN TGGYNVGSTNTGIANVGIANTGSYNTGSTNTGSFNDGDFNTGFYNTGDYNTGFYNTGD VNTGAFIGGNFSNGAFWQSDHQGQWGAHYAITVPQIPLLNFSLNIPVNIPIHLDFGTL AVNGFQIPAITLRALGVTHFSVGPIIVPRIAGTLPVIDINIGDPGGSSSIPITITSGA GPVVIPLLDIPPAPGFGNSTTGPSSGFFNSGTGSSSGFGNVGANNSGFWNTAFAGIGN SGLQNFGSLQSGWANLGNTVSGFYNTSAADFATPANLSGLSNVGADLTGVLRGPNGST FNAGLANLGQFNVGSANLGSANLGSANLGNSNVGFGNIGNANIGGANIGDFNVGIANT GPGLTAAVNNIGIGNTGNYNIGVGNTGNYNIGFGNTGNNNIGIGLSGDNQIGFGPLNA GIANMGLFNLGDNNFGMANAGNFNQGIANTGNNNIGLFNTGNNNVGIGLTGDGLSGFS SLNSGAGNTGFFNSGTANTGLFNSGTGNTGLFNSGTGNVGIGNMGTGGFGVGLSGDSQ VGIGGTNSGSFNIGLFNSGTGNVGIGNSGTGNVGIGNTGTGNTGIGNSGNYNTGLLNA GLVNTGIANPGNHNTGLFNIGTFNTGIANPGHYNTGSYNTGSYNTGMANAGDYGTGAF ITGSMNNGLLWRADRQGLLAANYTITIERPAAFLNVDIPVNIPITGDITNVSIPAITF PRIDASGSVDIGILSGTVLAPVGPITLHGGDASAPLDTPIEIDFGPSPAINLNIGKPD GSTVINIVGGAGAGPISIPIIDLRPAPGFFNATTGPSSGFLNWGAGSASGLLNFGNNS GLYNFATSSMGNSGFQNYGSLQSGWANLGNSISGIYNTGLGAPANVSGLLNIGTNLAG WLQNGPTETTFSVGLANLGFWNLGSANIGNYNLGSANIGVYNLGSANIGDFNLGSANI GDFNLGSANIGSSNIGFGNVGPGLTAAIGNIGFGNTGNGNIGIGNTGTGNIGFGNTGN GNIGIGLTGDTMTGFGGWNSGTGNIGLFNSGTGNIGFGNSGTGNWGIGNSGDYNTGIG NTGSTNSGFFNTGLVNTGIGNSGDYNTGLFNAGNTNTGSFNPGDYNTGGFNPGNYNTG YFNPGNSNTGFANSGDVNTGAFNSGNYSNGFFWRGDYQGLGGFAYQSAVSEIPWSYDI GSNIEIPIEGDINAITQDAFTIDEFEIPIKLRVSVCVIYIPFKGCVKHVSVTIPITTE HLGPYEIDASTINPDQPIDTAFTQTLDFAGSGTVGAFPFGFGWQQSPGFFNSTTTPSS GFFNSGAGGASGFLNDAAAAVSGLGNVFTETSGFFNAGGVGNSGFQNFGNLLSGWANL GNTVSGFYNTSMLDLATQALISGFGNHGARLSGILNNGSGP" CDS complement(435847..436491) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0363C" /product="Thioesterase superfamily protein Rv0356c" /note="Mb0363c, -, len: 214 aa. Equivalent to Rv0356c, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 214 aa overlap). Conserved hypothetical protein, equivalent to AL023514|MLCB4_12 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (218 aa), FASTA scores: opt: 1067, E(): 0, (73.4% identity in 214 aa overlap). Protein product from Mb0363c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0363c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248068.1" /translation="MTDASVHPDELDPEYHHHGGFPEYGPASPGAGFGQFVATMRRLQ DLAVAADPGDAVWDEAAERAAALVELLSPFEADEGKAPAGRTPGLPGMGSLLLPPWTV TRYGTDGVEMRGSFSRFHVGGNSAVHGGVLPLLFDHMFGMISHAAGRPISRTAFLHVD YRRITPIDVPLIVRGRVTNTEGRKAFVCAELFDSDETLLAEGNGLMVRLLPGQP" CDS complement(436488..437786) /codon_start=1 /transl_table=11 /gene="purA" /locus_tag="BQ2027_MB0364C" /product="PROBABLE ADENYLOSUCCINATE SYNTHETASE PURA (IMP--ASPARTATE LIGASE) (ADSS) (AMPSASE)" /note="Mb0364c, purA, len: 432 aa. Equivalent to Rv0357c, len: 432 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 432 aa overlap). Probable purA, adenylosuccinate synthase (EC 6.3.4.4), equivalent to AL023514|MLCB4_13 from ADENYLOSUCCINATE SYNTHETASE Mycobacterium leprae (432 aa), FASTA scores: opt: 2555, E(): 0, (87.9% identity in 431 aa overlap). Also highly similar to many bacterial adenylosuccinates synthetases e.g. P12283|PURA_ECOLI adenylosuccinates synthetase from Escherichia coli (431 aa), FASTA scores: E(): 0, (51.1% identity in 425 aa overlap); etc. BELONGS TO THE ADENYLOSUCCINATE SYNTHETASE FAMILY. Protein product from Mb0364c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0364c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248069.1" /translation="MPAIVLIGAQWGDEGKGKATDLLGGRVQWVVRYQGGNNAGHTVV LPTGENFALHLIPSGVLTPGVTNVIGNGVVIDPGVLLNELRGLQDRGVDTAKLLISAD AHLLMPYHIAIDKVTERYMGSKKIGTTGRGIGPCYQDKIARIGIRVADVLDPEQLTHK VEAACEFKNQVLVKIYNRKALDPAQVVDALLEQAEGFKHRIADTRLLLNAALEAGETV LLEGSQGTLLDVDHGTYPYVTSSNPTAGGAAVGSGIGPTRIGTVLGILKAYTTRVGSG PFPTELFDEHGEYLSKTGREFGVTTGRRRRCGWFDAVIARYAARVNGITDYFLTKLDV LSSLESVPVCVGYEIDGRRTRDMPMTQRDLCRAKPVYEELPGWWEDISGAREFDDLPA KARDYVLRLEQLAGAPVSCIGVGPGREQTIVRRDVLQDRP" CDS 437877..438524 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0365" /product="conserved protein" /note="Mb0365, -, len: 215 aa. Equivalent to Rv0358, len: 215 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 215 aa overlap). Conserved hypothetical protein, highly similar to AL023514|MLCB4_14 from Mycobacterium leprae (229 aa), FASTA scores: opt: 852, E(): 0, (62.9% identity in 229 aa overlap). Protein product from Mb0365 detected using SWATH mass spectrometry. Mb0365 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248070.1" /translation="MYTAENAPGVAVLLSGDADVPGPLTGLPTHQDNLDTVIGRYSRL IVVGADADLGAVLTRLLRTDRLDVEVGYVPRRRSPATRAYRLPAGRRAARRARCGVAR RVPLIRDETGSVIVGRAQWLPAEEQALIHGEAVVDDTVLFDGDVAGVCIEPTLTLPGL RAAVDGAGKWRRWIGGRAAQLGTTGAAVLRDGVAAPRPVRRSTFYRNVEGWLLVR" CDS 438535..439314 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0366" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0366, -, len: 259 aa. Equivalent to Rv0359, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 259 aa overlap). Probable conserved integral membrane protein, highly similar to hypothetical or other membrane proteins e.g. AL133220|SCC75A_6|T50569 probable membrane protein from Streptomyces coelicolor (265 aa), FASTA scores: opt: 642, E(): 0, (43.1% identity in 248 aa overlap); P70995 HYPOTHETICAL 24.7 KD PROTEIN from Bacillus subtilis (219 aa), FASTA scores: E(): 1.5e-12, (31.3% identity in 192 aa overlap). Contains neutral zinc metallopeptidases, zinc-binding region signature (PS00142). Mb0366 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248071.1" /translation="MSETGQRESVRPSPIFLGLLGLTAVGGALAWLAGETVQPLAYAG VFVMVIAGWLVSLCLHEFGHAFTAWRFGDHDVAVRGYLTLDPRRYSHPMLSLGLPMLF IALGGIGLPGAAVYVHTWFMTTARRTLVSLAGPTVNLALAMLLLAATRLLFDPIHAVL WAGVAFLAFLQLTALVLNLLPIPGLDGYAALEPHLRPETQRALAPAKQFALVFLLVLF LAPTLNGWFFGVVYWLFDLSGVSHRLAAAGSVLTRFWSIWF" CDS complement(439319..439756) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0367C" /product="conserved protein" /note="Mb0367c, -, len: 145 aa. Equivalent to Rv0360c, len: 145 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 145 aa overlap). Conserved hypothetical protein, equivalent to AL023514|MLCB4_16|CAA18948.1|AL023514|MLCB4.27c hypothetical protein from Mycobacterium leprae (137 aa), FASTA scores: opt: 793, E(): 0, (85.4% identity in 137 aa overlap). And similar to AL049754|SCH10_25c|T36537 hypothetical protein from Streptomyces coelicolor (143 aa), FASTA scores: opt: 497, E(): 3.2e-27, (55.8% identity in 138 aa overlap). Protein product from Mb0367c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0367c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248072.1" /translation="MTKRTITPMTSMGDLLGPEPILLPGDSDAEAELLANESPSIVAA AHPSASVAWAVLAEGALADDKTVTAYAYARTGYHRGLDQLRRHGWKGFGPVPYSHQPN RGFLRCVAALARAAAAIGETDEYGRCLDLLDDCDPAARPALGL" CDS 439839..440666 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0368" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb0368, -, len: 275 aa. Equivalent to Rv0361, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 275 aa overlap). Probable conserved membrane protein (has hydrophobic stretch from residues 132-156), equivalent to AL023514|MLCB4_17|AA18949.1|AL023514 putative membrane protein from Mycobacterium leprae (292 aa), FASTA scores: opt: 1044, E(): 0, (58.6% identity in 292 aa overlap). Protein product from Mb0368 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0368 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248073.1" /translation="MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETE TVVITTSDNDAAVTQPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPP RMPTGMPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGK HSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSA AKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN" CDS 440888..442270 /codon_start=1 /transl_table=11 /gene="mgtE" /locus_tag="BQ2027_MB0369" /product="POSSIBLE Mg2+ TRANSPORT TRANSMEMBRANE PROTEIN MGTE" /note="Mb0369, mgtE, len: 460 aa. Equivalent to Rv0362, len: 460 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 460 aa overlap). Possible mgtE, magnesium (Mg2+) transport transmembrane protein; C-terminal region is highly similar to MGTE|G780283 putative Mg2+ transporter from Providencia stuarti (314 aa), FASTA scores: E(): 0, (47.2% identity in 307 aa overlap) (N-terminus extends approx. 150 aa further upstream compared to P. stuarti ORF). Also similar in part to others e.g. AAK20879.1|AF334760_1|AF334760 putative Mg2+ transporter from Aeromonas hydrophila (455 aa); NP_231292.1|NC_002505 magnesium transporter from Vibrio cholerae (451 aa); NP_102305.1|NC_002678 Mg2+ transport protein from Mesorhizobium loti (454 aa); etc. Also similar to Rv1232c|MTV006.04c from Mycobacterium tuberculosis (435 aa). Extended hydrophobic segment spanning last 130 residues. BELONG TO THE MGTE FAMILY. Mb0369 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248074.1" /translation="MSIRPAENSTLDIRHVIGIGTPKAVDLWLDVVTELPDRARELGS LSKAELGKLGPLLDGTNAVELFESIDDKLAAEALHAMDPSLAATFLEALDSDHAANIL REFKEPKREALLTLLPLERAMVLRGLLSWPEDCAAAHMVPETLTVRPNMTVSQAVASV RERASGLRSDARTTAYVYVTDADSHLLGVIAFRALVLANPEQRVRELMGDDLIVVSPL TDKELAAQTIMGHNLMAVPVVDADNRLLGIIAEDEAIDIAEEEATEDAERQGGSAPLE VPYLRASPWLLWRKRAVWLLVLFAAEAYTGSVLRAFSDEMEAVIALAFFIPLLIGTGG NTGTQIATTLVRAMATGQVRFRDVPAVLAKELSTGVLVGLTMAAAAVVRAWTLGVGPQ VTLTVALTVAAIVVWSSLVAAVLPPLLKKLRIDPAIVSGPMIATIVDGTGLLIYFLVA HLTLTELHGL" CDS complement(442282..443316) /codon_start=1 /transl_table=11 /gene="fba" /locus_tag="BQ2027_MB0370C" /standard_name="fda" /product="PROBABLE FRUCTOSE-BISPHOSPHATE ALDOLASE FBA" /note="Mb0370c, fba, len: 344 aa. Equivalent to Rv0363c, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 344 aa overlap). Probable fba (alternate gene name: fda), fructose bisphosphate aldolase (EC 4.1.2.13), equivalent to AL023514|MLCB4_18|O69600|ALF_MYCLE FRUCTOSE-BISPHOSPHATE ALDOLASE from Mycobacterium leprae (345 aa), FASTA scores: opt: 1995, E(): 0, (87.7% identity in 342 aa overlap). Also highly similar to others. BELONGS TO CLASS II FRUCTOSE-BISPHOSPHATE ALDOLASE FAMILY. COFACTOR: ZINC. Protein product from Mb0370c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0370c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248075.1" /translation="MPIATPEVYAEMLGQAKQNSYAFPAINCTSSETVNAAIKGFADA GSDGIIQFSTGGAEFGSGLGVKDMVTGAVALAEFTHVIAAKYPVNVALHTDHCPKDKL DSYVRPLLAISAQRVSKGGNPLFQSHMWDGSAVPIDENLAIAQELLKAAAAAKIILEI EIGVVGGEEDGVANEINEKLYTSPEDFEKTIEALGAGEHGKYLLAATFGNVHGVYKPG NVKLRPDILAQGQQVAAAKLGLPADAKPFDFVFHGGSGSLKSEIEEALRYGVVKMNVD TDTQYAFTRPIAGHMFTNYDGVLKVDGEVGVKKVYDPRSYLKKAEASMSQRVVQACND LHCAGKSLTH" CDS 443412..444095 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0371" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0371, -, len: 227 aa. Equivalent to Rv0364, len: 227 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 227 aa overlap). Possible conserved transmembrane protein, equivalent to O69601|Y364_MYCLE|ML0287|CAA18951.1|AL023514|AL023514|MLCB 4_19 HYPOTHETICAL 24.3 KDA PROTEIN from Mycobacterium leprae (222 aa), FASTA scores: opt: 1027, E(): 0, (66.1% identity in 227 aa overlap). Shows strong similarity to DEDA_ECOLI|P09548 DedA PROTEIN protein from Escherichia coli FASTA scores: E(): 1.3e-28, (39.5% identity in 195 aa overlap). Similar also to Mycobacterium tuberculosis DedA protein Rv2637|MTCY441.0. Mb0371 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248076.1" /translation="MSTAVTAMPDILDPMYWLGANGVFGSAVLPGILIIVFIETGLLF PLLPGESLLFTGGLLSASPAPPVTIGVLAPCVALVAVLGDQTAYFIGRRIGPALFKKE DSRFFKKHYVTESHAFFEKYGKWTIILARFVPIARTFVPVIAGVSYMRYPVFLGFDIV GGVAWGAGVTLAGYFLGSVPFVHMNFQLIILALVFVSLLPALVSAARVYRARRNAPQS DPDPLVLPE" CDS complement(444084..445214) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0372C" /product="fructose-bisphosphate aldolase family protein" /note="Mb0372c, -, len: 376 aa. Equivalent to Rv0365c, len: 376 aa (start uncertain), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 376 aa overlap). Conserved hypothetical protein, very similar to G388212|CAA35191.1, a truncated ORF immediately upstream of the Corynebacterium glutamicum fda gene encoding fructose-1,6-biphosphate aldolase (304 aa), FASTA scores: E(): 7.1e-19, (42.2% identity in 296 aa overlap). Protein product from Mb0372c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0372c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248077.1" /translation="MNLANRAASAETAVTQRHLRRLWALPGTQLAVVAWPSTRRDRLF GSWHYWWQAHLLDCLVDAQLRDPQPQRRARINRQVRSHRVRNNFSWLNSYYDDMAWLA LALERADRVAGVRRRRALPKLTNQFVEAWVPEDGGGIPWRKQDQFFNAPANGPAGLFL ARYPDQYGKRLKRAEQMADWIDRTLIDPETHLVFDGIKAGSLVRAQYTYCQGVVLGLE TELAVRTGPAARARHCARVHRLVAAVNEHMAPLGVLRGAGGGDGGLFAGITARYLALV ATTLPGDSADDAAARDTARAIVLASAQSAWDYRQTVDGLPVFGAFWDREAELPTAGGE QARSVRGAVHSSAIAERDLSVQLSGWMLMEAAHSAAAVSSLG" CDS complement(445239..445832) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0373C" /product="Predicted kinase" /note="Mb0373c, -, len: 197 aa. Equivalent to Rv0366c, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 197 aa overlap). Conserved hypothetical protein, showing weak similarity to HI1395|P44173|YD95_HAEIN HYPOTHETICAL PROTEIN from Haemophilus influenzae (140 aa), FASTA scores: opt: 152, E(): 0.0015, (27.0% identity in 126 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS00850 Glycine radical signature. Mb0373c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248078.1" /translation="MKRLDLVAGPNGAGKSTFVALTLAPLLPGIVFVNADEIAKQRWP DDPTSHAYQAAQVAADTRARLIDLGRPFIAETVFSHPSKLELIRTARTAGYTVVLHVL VIPEGLAVERVRHRVAAGGHDVPETKIRERHRRLAELVAQAITLADGATVYDNSRLAG PRIVAQFSGGGIIGRACWPSWTPPPLMSRWSNRPETA" CDS complement(445861..446250) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0374C" /product="signal transduction" /note="Mb0374c, -, len: 129 aa. Equivalent to Rv0367c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Hypothetical unknown protein. Protein product from Mb0374c detected using SWATH mass spectrometry. Mb0374c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248079.1" /translation="MPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAV SNQHTASRRRVEAALAGHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGI TTVALNDAGDIVEHRPDGTSVVLAATP" CDS complement(446331..447542) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0375C" /product="Carbon monoxide oxidation accessory protein CoxE" /note="Mb0375c, -, len: 403 aa. Equivalent to Rv0368c, len: 403 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 403 aa overlap). Conserved hypothetical protein, showing some similarity to AJ224684|BJAJ4684_4 cooxS protein from Bradyrhizobium japonicum (422 aa), FASTA scores: opt: 341, E(): 4.3e-13, (27.4% identity in 387 aa overlap); Rv2425c|MTCY428_22 hypothetical protein from Mycobacterium tuberculosis FASTA score: (30.7% identity in 238 aa overlap). Contains PS00213 Lipocalin signature." /protein_id="CAB5248080.1" /translation="MATPALLPGVDLAAFAAALAARLRDAGIPVSASGQASLVQALQQ LVPRTPAALYWGARLTLVSRVDELATFDAVFASLFGVFGSAEPDGANRPPPPIAGPRT PVAGVGHRAKRRSCAAQAQNLPWDTRSLTMASAGQGGPSRTLPDVLPSRIVARADEPF DQFDPDDLRLLGAWLEATMARWPRRRSMRFESSPHGKRIDLRATMNASRSTGWESVLL ARIRPRRRPRRVLLLCDVSRSMQPYAAIYLHLMRAAVLRRAGGHPEVFAFSTSLTRLT SVLSHRSAEMALHRANARVTDRYGGTFIGRSVAALLAPPHGNALRGAVVIIASDGWDS DPPDVLVHALTRVRRRAELLVWLNPRAAHPEFQPRASSMAAALPYCDLFLPAHSLAGL HQLLLALAGAR" CDS complement(447548..448063) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0376C" /product="POSSIBLE MEMBRANE OXIDOREDUCTASE" /note="Mb0376c, -, len: 171 aa. Equivalent to Rv0369c, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 171 aa overlap). Possible membrane protein oxidoreductase (EC 1.-.-.-), similar to ORF 4 of the Pseudomonas thermocarboxydovorans protein of cutA-cutB-cutC gene cluster: X77931|PTC2CUTAC_4 ORF4 from Pseudomonas thermocarboxydovorans (171 aa), FASTA scores: opt: 226, E(): 9.8e-08, (31.3% identity in 166 aa overlap). Also similar to MTV036.05, MTV036.08, MTV036.09, and MTV026.10. Mb0376c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248081.1" /translation="MPGAQLIGHEGDEYLGKVKVKVGPVTSEFSGKVHFVEQDRNQHR AVFDAKGKEARGTGNAAATVAAQLHEVGERTRVTVDTDLKIVGKLAQFGSGMLQQVSE KLLGQFVDSLEAELAAQSSESPQGTPPATEAAPIDLLQLADGGQLKKYGSALLAALTV LLLIWVLRRRR" CDS complement(448164..449060) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0377C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0377c, -, len: 298 aa. Equivalent to Rv0370c, len: 298 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 298 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to many hypothetical proteins, but also similar to ORF4|X82447|OCCOXMSL4_4 Protein of coxMSL gene cluster from Pseudomonas/Oligotropha carboxidovorans (295 aa), FASTA scores: opt: 851, E(): 0, (48.2% identity in 282 aa overlap); AJ224684|BJAJ4684_3 cooxS from Bradyrhizobium japonicum (302 aa), FASTA scores: opt: 881, E(): 0, (47.6% identity in 290 aa overlap). Also highly similar to MTCY428_21 from Mycobacterium tuberculosis. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Mb0377c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248082.1" /translation="MTFASPDDVIRRFDEQNYLLDTGTASAIYLAVTLGRPLLLEGEP GVGKTTAAKTLAVVLDTTLIRLQCYEGLTANEALYDWNYQRQLLSIRLAEARGKGISD ISEADLYTEAYLVDRPILRCVRHRGPTPPVLLIDEIDRADDEFEALLLEFLGESAVTV PELGTFLAECPPIAVLTSNRSRDLHDALRRRCLYHWIDYPEPDRAAAIVRRTVPGATA PLIENATQFVCTARDLDLDKPPGVAETIDWVAALVALGVADLTAADSSPALASLGALA KTPDDRTQIRDAYQAFTECSHA" CDS complement(449057..449650) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0378C" /product="Molybdenum cofactor cytidylyltransferase (EC" /EC_number="2.7.7.76" /note="Mb0378c, -, len: 197 aa. Equivalent to Rv0371c, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 197 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins e.g. AL132824|SCAH10.09c|CAB60163.1|AL132824 hypothetical protein from Streptomyces coelicolor (207 aa), FASTA scores: opt: 247, E(): 4.5e-09, (32.3% identity in 195 aa overlap). Also weak similarity with YURE|D70017|Z99120|BSUB0017_134 hypothetical protein yurE from Bacillus subtilis (197 aa), FASTA scores: opt: 217, E(): 2.5e-08, (27.0% identity in 174 aa overlap). Mb0378c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248083.1" /translation="MTATQITGVVLAAGRSNRLGTPKQLLPYRDTTVLGATLDVARQA GFDQLILTLGGAASAVRAAMALDGTDVVVVEDVERGCAASLRVALARVHPRATGIVLM LGDQPQVAPATLRRIIDVGPATEIMVCRYADGVGHPFWFSRTVFGELARLHGDKGVWK LVHSGRHPVRELAVDGCVPLDVDTWDDYRRLLESVPS" CDS complement(449647..450402) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0379C" /product="Aerobic carbon monoxide dehydrogenase molybdenum cofactor insertion protein CoxF" /note="Mb0379c, -, len: 251 aa. Equivalent to Rv0372c, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 251 aa overlap). Conserved hypothetical protein, showing some similarity with CAB76248.1|X82447|COXF CoxF protein from Pseudomonas/Oligotropha carboxidovorans (280 aa); AJ224684|BJAJ4684_6 cooxS from Bradyrhizobium japonicum (176 aa), FASTA scores: opt: 186, E(): 1.6e-05, (41.1% identity in 95 aa overlap). Also similar to upstream ORF Rv0376c from Mycobacterium tuberculosis (380 aa), FASTA scores: E(): 6.8e-07, (31.0% identity in 277 aa overlap). Mb0379c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248084.1" /translation="MSISDRAAQLVAARTPFVRATVVRAQQPTSARPGDEAILLADGT IEGFVGGHCAQNSVRKAAMGVLQAGESVLLRVLPDGDVHFPEAPGACVVVNPCLAGGS LEIFLTPQLPAPLIQIYGETPIADALIELCGLLGYDARRDTDPADTDALPTAIVIASH SGPEAEIIRTALDNGVGYVGLVASTVRGASILDSLDLSDAERARVHTPVGLAIGAKTP AEIAVSIAAELIATLRGGGPRGRKALADENGGA" CDS complement(450421..452820) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0380C" /product="PROBABLE CARBON MONOXYDE DEHYDROGENASE (LARGE CHAIN)" /note="Mb0380c, -, len: 799 aa. Equivalent to Rv0373c, len: 799 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 799 aa overlap). Probable carbon monoxide dehydrogenase, large chain (EC 1.2.99.2), highly similar to others e.g. AAD00363.1| U80806|CUTL carbon monoxide dehydrogenase large subunit CutL protein from Hydrogenophaga pseudoflava (803 aa); S49124|509391|CAA54902.1|X77931|1094915|2107180C|CUTA carbon-monoxide dehydrogenase large chain (EC 1.2.99.2) (cut operon) from Pseudomonas thermocarboxydovorans (842 aa); C56279|809566|CAA57829.1|X82447|OCCOXMSL4_3|COXL carbon-monoxide dehydrogenase large chain (EC 1.2.99.2) (cluster coxMSL) from Pseudomonas/Oligotropha carboxydovorans (809 aa), FASTA scores: opt: 2484, E(): 0, (56.0% identity in 804 aa overlap); etc. Protein product from Mb0380c detected using SWATH mass spectrometry. Mb0380c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248085.1" /translation="MTTIESRPPSPEDLADNAQQPCGHGRMMRKEDPRFIRGRGTYVD DVALPGMLHLAILRSPYAHARIVRIDVTAAQAHPKVKAVVTGADLAAKGLAWMPTLAN DVQAVLATDKTRFQGQEVASVVAEDRYSARDACELVDVDYEPRDPVVDARTALDPSAP VIRTDLEGKSDNHIFDWETGDAAATEAVFAKADVVVQQEIVYPRVHPAPMETCGAVAD LDPVTGKLTLWTTSQAPHAHRTLYALVAGLPEHKIRVISPDIGGGFGNKVPIYPGYVC AIVASLLLDKPVKWMEDRSENLTSTGFARDYIMVGEIAANRDGKILAIRSNVLADHGA FNAQAAPAKYPAGFFGVFTGSYDIEAAYCHMTAVYTNKAPGGVAYACSFRITEAVYFV ERLVDCLAFELKMDPAELRLRNLLRPNQFPYQSKTGWVYDSGDYETTMRKAMNMIGYE ALRAEQKQRRARGELMGIGMSFFTEAVGAGPRKDMDILGLGMADGCELRVHPTGKAVL RLSVQTQGQGHETTFAQIVAEELGIAPDDIEVVHGDTDQTPFGLGTYGSRSTPVSGGA AALVARKVRDKAKIIASGMLEVSVADLQWEKGKFHVKGDPSAAVTIADIAMRAHGAGD LPEGIEGGLDAEVCYNPSNLTYPYGAYFCVVDIDPGTAVVKVRRFLAVDDCGTRINPM IIEGQVHGGIVDGIGMALMEMIAFDEDGNCLGGSLMDYLIPTALEVPHLETGHTVTPS PHHPIGAKGIGESATVGSPPAVVNAVVDALAPFGVRHADMPLTPSRVWEAMQGRATPP I" CDS complement(452817..453296) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0381C" /product="PROBABLE CARBON MONOXYDE DEHYDROGENASE (SMALL CHAIN)" /note="Mb0381c, -, len: 159 aa. Equivalent to Rv0374c, len: 159 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 159 aa overlap). Probable carbon monoxide dehydrogenase, small chain (EC 1.2.99.2), highly similar to others e.g. B56279|5822285|X82447|OCCOXMSL4_2|COXS carbon-monoxide dehydrogenase small chain (EC 1.2.99.2) from Pseudomonas/Oligotropha carboxydovorans (166 aa), FASTA scores: opt: 662, E(): 0, (59.3% identity in 150 aa overlap); CAA12063.1|AJ224684 putative carbon monoxide dehydrogenase small subunit from Bradyrhizobium japonicum (161 aa); S49123|509390|CAA54901.1|X77931|CUTC carbon-monoxide dehydrogenase small chain (EC 1.2.99.2) from Pseudomonas thermocarboxydovorans (163 aa); etc." /protein_id="CAB5248086.1" /translation="MQVNMTVNGEPVTAEVEPRMLLVHFLRDQLRLTGTHWGCDTSNC GTCVVEVDGVPVKSCTMLAVMASGHSIRTVEGLAGPDGQLDPVQEGFMRCHGLQCGFC TPGMLITARALLDRNPDPDEQTIREAISGQICRCTGYTTIVRSIQWAAAHQTVKAQS" CDS complement(453311..454171) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0382C" /product="PROBABLE CARBON MONOXYDE DEHYDROGENASE (MEDIUM CHAIN)" /note="Mb0382c, -, len: 286 aa. Equivalent to Rv0375c, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Probable carbon monoxide dehydrogenase, medium chain (EC 1.2.99.2), similar to others e.g. AAD00361.1|U80806|CUTM carbon monoxide dehydrogenase middle subunit from Hydrogenophaga pseudoflava (287 aa); S49122|509389|CAA54900.1|X77931|CUTB carbon-monoxide dehydrogenase medium chain (EC 1.2.99.2) from Pseudomonas thermocarboxydovorans (287 aa); A56279|809564|CAA57827.1|X82447|OCCOXMSL4_1|COXM|CODH carbon-monoxide dehydrogenase medium chain (EC 1.2.99.2) from Pseudomonas/Oligotropha carboxydovorans (288 aa), FASTA scores: opt: 594, E(): 0, (37.5% identity in 277 aa overlap); etc. Protein product from Mb0382c detected using SWATH mass spectrometry. Mb0382c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248087.1" /translation="MDHAIGLLDRLGEGARVVAGGHSLLPMMKLRIANPEYLVDINDL APELGYVVVGGINNPNLVRLGAMTRHREILDSDALAAVCPIFRDAERVIADPVVRNRG TLGGSLCQADPAEDLSTVCTVLDAVCLAKGPSGEREIAIDDFLVGPYETALAHNEVLI EVRIPLRHNTSSAYAKVERRVGDWAITAAGAAVTLDGQTILAARVGLTAVNPDPVALA ELSAGLVGQPATEEVFAEAGRRAAQACTPVTDVRGTAEYKRHLAGELTVRTLRTAAGR VLGAPAAPEA" CDS complement(454247..455389) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0383C" /product="Xanthine and CO dehydrogenases maturation factor, XdhC/CoxF family" /note="Mb0383c, -, len: 380 aa. Equivalent to Rv0376c, len: 380 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 380 aa overlap). Conserved hypothetical protein, highly similar to T35481|4008539|CAA22508.1|AL034492|SC6C5.10 hypothetical protein from Streptomyces coelicolor (395 aa); and AAK64260.1|AF373840_20 ORF377 hypothetical CoxI from Arthrobacter nicotinovorans (377 aa). And similar to other conserved hypothetical proteins e.g. NP_101963.1|14021136|BAB47749.1|AP002994 hypothetical protein from Mesorhizobium loti (245 aa). Note that C-terminus shows similarity with C-termini of CAB76248.1|X82447|COXF CoxF protein from Pseudomonas/Oligotropha carboxidovorans (280 aa); CAB76250.1|X82447|COXI CoxI protein from Pseudomonas/Oligotropha carboxidovorans (330 aa); and AJ224684|BJAJ4684_6 cooxS from Bradyrhizobium japonicum (176 aa), FASTA scores: E(): 1.9e-17, (47.1% identity in 138 aa overlap). Also some partial similarity with AJ224684|BJAJ4684_5 cooxS from Bradyrhizobium japonicum (107 aa), FASTA scores: opt: 321, E(): 4.2e-14, (53.3% identity in 92 aa overlap); E1184330|Z99120|YURF YURF PROTEIN from Bacillus subtilis (330 aa), FASTA scores: opt: 170, E(): 2.9e- 16, (27.5% identity in 345 aa overlap). Also similar to downstream ORF Rv0372c from Mycobacterium tuberculosis (251 aa), FASTA scores: E(): 2.1e-06, (30.7% identity in 277 aa overlap). Protein product from Mb0383c detected using SWATH mass spectrometry. Mb0383c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248088.1" /translation="MAIWAAGDTAGVATVVRTLRSAPRPPGAAMVVAPDGSVSGSVSG GCVEGAVYELAAEVAQTGIPRLEHYGVSDDTAFAVGLTCGGIIDVFVEPVSRATFPEL GELADDIGAQRPVAIATVIAHPDERRVGRRLVIRPDTKSPVTGSLGSARADAAVIDDA RGLLAVGRSEILEYGPDGQRRGEGMEVFVSSHAPRPRMLVFGAIDFAAALARQGSFLG YRVTVCDARAVFATPARFPTADDVVVAWPHRYLAAQAEAGGIDERTVICVLTHDPKFD VPVLEVALRLGVGYVGAMGSRKTHDDRMDRLRAAGLTDAELSRLSSPIGLDLGARTPE ETAVSIAADIIARRWGGGGRPLADIAGRIHHDAQVAGEFKDYLTRH" CDS 455438..456403 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0384" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY LYSR-FAMILY)" /note="Mb0384, -, len: 321 aa. Equivalent to Rv0377, len: 321 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 321 aa overlap). Probable transcription regulator, lysR family, showing similarity with many hypothetical transcriptional regulators lysR homolog e.g. P32484|YEIE_ECOLI|M89774 HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Escherichia coli (293 aa), FASTA scores: opt: 265, E(): 4.9e-11, (28.6% identity in 266 aa overlap). Also similar to Rv2282c from Mycobacterium tuberculosis. Contains PS00044 bacterial regulatory protein lysR family signature. SEEMS TO BELONG TO THE LYSR FAMILY OF TRANSCRIPTIONAL REGULATORS. Mb0384 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248089.1" /translation="MTPAQLRAYSAVVRLGSVRAAAAELGLSDAGVSMHVAALRKELD DPLFTRTGAGLAFTPGGLRLASRAVEILGLQQQTAIEVTEAAHGRRLLRIAASSAFAE HAAPGLIELFSSRADDLSVELSVHPTSRFRELICSRAVDIAIGPASESSIGSDGSIFL RPFLKYQIITVVAPNSPLAAGIPMPALLRHQQWMLGPSAGSVDGEIATMLRGLAIPES QQRIFQSDAAALEEVMRVGGATLAIGFAVAKDLAAGRLVHVTGPGLDRAGEWCVATLA PSARQPAVSELVGFISTPRCIQAMIRGSGVGVTRFRPKVHVTLWS" CDS 456654..456875 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0385" /product="CONSERVED HYPOTHETICAL GLYCINE RICH PROTEIN" /note="Mb0385, -, len: 73 aa. Equivalent to Rv0378, len: 73 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 aa overlap). Conserved hypothetical gly-rich protein, showing some similarity to M. tuberculosis PE_PGRS family; also similar to MTCY06H11_16|Z85982 hypothetical glycine-rich 88.5 KD protein (1011 aa), FASTA scores: opt: 237, E(): 0.0032, (58.7% identity in 63 aa overlap); MTV043_25." /protein_id="CAB5248090.1" /translation="MSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDA GASGSINGNAGDPGNSGERGAVGKPGAPG" CDS 456994..457209 /codon_start=1 /transl_table=11 /gene="secE2" /locus_tag="BQ2027_MB0386" /product="POSSIBLE PROTEIN TRANSPORT PROTEIN SECE2" /note="Mb0386, secE2, len: 71 aa. Equivalent to Rv0379, len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 aa overlap). Possible secE2, protein transport protein, showing similarity with P27340|S61G_SULSO|SECE PREPROTEIN TRANSLOCASE SECE SUBUNIT (PROTEIN TRANSPORT PROTEIN SEC61 GAMMA SUBUNIT HOMOLOG) from Sulfolobus acidocaldarius (65 aa), FASTA scores: opt: 79, E(): 4.7. (30.3% identity in 66 aa overlap); and hypothetical proteins e.g. Q9HPW4|VNG1446H HYPOTHETICAL PROTEIN from Halobacterium sp. strain NRC-1 (77 aa); Q9I794|PA0038 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (71 aa); etc. Also highly similar to U85467|MTU85467_1 hypothetical Mycobacterium tuberculosis protein from a patient isolate (116 aa), FASTA scores: opt: 443, E(): 7.7e-29, (98.6% identity in 71 aa overlap). Note that for Rv0379|MTV036.14, a translation initiation region different to the one in U85467|MTU85467_1 was chosen. COULD BE A PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA|Rv3240c, SECD|Rv2587c, SECE|Rv0638, SECF|Rv2586c, SECG|Rv1440 AND SECY|Rv0732. Protein product from Mb0386 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0386 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248091.1" /translation="MSVYKVIDIIGTSPTSWEQAAAEAVQRARDSVDDIRVARVIEQD MAVDSAGKITYRIKLEVSFKMRPAQPR" CDS complement(457285..457836) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0387C" /product="POSSIBLE RNA METHYLTRANSFERASE (RNA METHYLASE)" /note="Mb0387c, -, len: 183 aa. Equivalent to Rv0380c, len: 183 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 183 aa overlap). Possible RNA methyltransferase (EC 2.1.1.-), equivalent to CAC32002.1|AL583925 possible RNA methyltransferase from Mycobacterium leprae (182 aa). Also some similarity with others methyltransferases e.g. P19396|TRMH_ECOLI|78514|JV0043 TRNA (GUANOSINE-2'-O-)-METHYLTRANSFERASE (TRNA METHYLTRANSFERASE) from Escherichia coli (229 aa), FASTA scores: opt: 227, E(): 1.4e-09, (28.9% identity in 166 aa overlap). Also similar to Rv0881, Rv3579c, Rv1644 from Mycobacterium tuberculosis. Protein product from Mb0387c detected using SWATH mass spectrometry. Mb0387c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248092.1" /translation="MLLRDGDARNVVDAYRYWTREAIIADIDTRRHPLHVAIENFGHD ANIGSVVRTANAFAVHTVHIVGRRRWNRRGAMVTDRYQRLCHHDSTTGLLEFAAGAGL TVVAVDNVPGAARLEQTALPRECLLLFGQEGPGITDDARAGAAVTVSIAQFGSTRSIN AGVAAGIAMHAWIRQHADLGRAW" CDS complement(457932..458840) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0388C" /product="HYPOTHETICAL PROTEIN" /note="Mb0388c, -, len: 302 aa. Equivalent to Rv0381c, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 302 aa overlap). Hypothetical unknown protein. Equivalent to AAK44616.1 from Mycobacterium tuberculosis strain CDC1551 (254 aa) but longer 48 aa. Mb0388c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248093.1" /translation="MRILVAWATCGAVVLSGLTGCSGSSHSGRTYGAQSARTGESLAV LGWNMSVSNLRWSGDYVLIDVDASPTDPHAPHAKPEDIRFGLYGALAHPMESAALGSC GDAMAHVRDVVSPLSAPAGRLTGTVCLGPLKERSAVRGVYTYSPRDRIPGTAAAYPAA FPVGMLPTNQNDAGLVVKTTSVSAWRADGMQLGKPQLGDPVAFTGNGYMLLGLEVDAV PDRYRDDSAARGGPMMLLAAPTLPGRGLSPACATYGSSVLILPDALLDAVHISASLCT QGEINEALLYATVATVGTHAALWTSR" CDS complement(458858..459397) /codon_start=1 /transl_table=11 /gene="pyrE" /locus_tag="BQ2027_MB0389C" /product="PROBABLE OROTATE PHOSPHORIBOSYLTRANSFERASE PYRE (OPRT) (OPRTASE)" /note="Mb0389c, pyrE, len: 179 aa. Equivalent to Rv0382c, len: 179 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 179 aa overlap). Probable pyrE, orotate phosphoribosyltransferase (EC 2.4.2.10), equivalent to CAC32004.1|AL583925 probable purine/pyrimidine phosphoribosyltransferase from Mycobacterium leprae (179 aa). Also highly similar to many others e.g. T36540|4753874|CAB42037.1|AL049754|SCH10.28c probable orotate phosphoribosyltransferase from Streptomyces coelicolor (182 aa); H69115|2622996|AAB86326.1|AE000938_10|MTH1860 probable orotate phosphoribosyltransferase from Methanobacterium thermoautotrophicum (180 aa), FASTA scores: opt: 389, E(): 2.7e-20, (40.7% identity in 172 aa overlap); O08359|PYRE_SULAC|2065444|CAA73352.1|Y12822 OROTATE PHOSPHORIBOSYLTRANSFERASE from Sulfolobus acidocaldarius (197 aa); etc. Note that also similar to other puridine 5'-monophosphate synthases (umpA genes; UMP synthases), generally in N-terminus that corresponds to orotate phosphoribosyltransferase activity. Contains PS00589 PTS HPR component serine phosphorylation site signature. BELONGS TO THE PURINE/PYRIMIDINE PHOSPHORIBOSYLTRANSFERASE FAMILY. Note that previously known as umpA. Protein product from Mb0389c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0389c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248094.1" /translation="MAGPDRAELAELVRRLSVVHGRVTLSSGREADYYVDLRRATLHH RASALIGRLMRELTADWDYSVVGGLTLGADPVATAIMHAPGRPIDAFVVRKSAKAHGM QRLIEGSEVTGQRVLVVEDTSTTGNSALTAVHAVQDVGGEVVGVATVVDRATGAAEAI EAEGLRYRSVLGLADLGLD" CDS complement(459478..460332) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0390C" /product="POSSIBLE CONSERVED SECRETED PROTEIN" /note="Mb0390c, -, len: 284 aa. Equivalent to Rv0383c, len: 284 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 284 aa overlap). Possible conserved secreted protein, with hydrophobic stretch in N-terminus and Pro-rich C-terminus. Equivalent to CAC32006.1|AL583925 possible secreted protein from Mycobacterium leprae (286 aa). Protein product from Mb0390c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0390c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248095.1" /translation="MVPLWFTLSALCFVGAVVLLYVDIDRRRGRSRRRKSWARSHGFD YERESTEILKRWTRGVMSTVGDVAAHNVVLGQIRGEAVYIFDLEEVATVIALHRKVGT NVVVDLRLKGLKEPRESDIWLLGAIGPRMVYSTNLDAARRACDRRMVTFAHTAPDCAE IMWNEQNWTLVSMPIASTRAQWDEGLRTVRQFNDLLRVLPPLPQEMPQQTGVGPRGAA PGRPVAPGGPAELPPRRAQPDPATTVLPDPARRAPEPIRRDEGRSEGVRRPPPAGRNG QQATNYQH" CDS complement(460473..463019) /codon_start=1 /transl_table=11 /gene="clpB" /locus_tag="BQ2027_MB0391C" /standard_name="htpM" /product="PROBABLE ENDOPEPTIDASE ATP BINDING PROTEIN (CHAIN B) CLPB (CLPB PROTEIN) (HEAT SHOCK PROTEIN F84.1)" /note="Mb0391c, clpB, len: 848 aa. Equivalent to Rv0384c, len: 848 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 848 aa overlap). Probable clpB (alternate gene name: htpM), endopeptidase ATP-binding protein, chain B (EC 3.-.-.-), equivalent to AC32007.1|AL583925 heat shock protein from Mycobacterium leprae (848 aa). Also highly similar to others e.g. P53532|CLPB_CORGL|1163118|AAB49540.1|U43536|CGU43536_1 CLPB PROTEIN (heat-inducible expression) from Corynebacterium glutamicum (852 aa), FASTA scores: opt: 4113, E(): 0, (74.5% identity in 846 aa overlap); T36551|4753885|CAB42048.1|AL049754|clpB|SCOEDB|SCH10.39c probable ATP-dependent proteinase ATP-binding chain from Streptomyces coelicolor (853 aa); P03815|CLPB_ECOLI|1788943|AAC75641.1|AE000345 CLPB PROTEIN (HEAT SHOCK PROTEIN F84.1) from Escherichia coli strains K12 and O157:H7 (857 aa); etc. Also similar to Rv3596c|ClpC from Mycobacterium tuberculosis. Contains PS00870 and PS00871 Chaperonins clpA/B signatures and two PS000017 ATP/GTP-binding site motives A (P-loop). BELONGS TO THE CLPA/CLPB FAMILY. Contains probable coiled-coil domain from aa 411-503. Protein product from Mb0391c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0391c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248096.1" /translation="MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDG IAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQPQLSRESLAAITTAQQLATELD DEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEG LAQRIVAGDVPESLRDKTIVALDLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITF IDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEYRKHIEKDAALERRF QQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAI DLVDEAASRLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELA DQKEKLAELTTRWQNEKNAIEIVRDLKEQLEALRGESERAERDGDLAKAAELRYGRIP EVEKKLDAALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAKLLRMED ELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFL FDDERAMVRIDMSEYGEKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIE KAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTSNLGSGGSAEQVLAAVRATFK PEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGF DPVYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPDADSLILG" CDS 463152..464324 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0392" /product="PROBABLE MONOOXYGENASE" /note="Mb0392, -, len: 390 aa. Equivalent to Rv0385, len: 390 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 390 aa overlap). Probable monooxygenase (EC 1.-.-.-), similar to T37003|5738846|CAB52917.1|AL109949 probable flavohemoprotein from Streptomyces coelicolor (435 aa); and similar in part (C-termini) to various monooxygenases e.g. P19734|DMPP_PSESP|94993|F37831 PHENOL HYDROXYLASE P5 PROTEIN (PHENOL 2-MONOOXYGENASE P5 COMPONENT) (EC 1.14.13.7) from Pseudomonas putida (353 aa), FASTA scores: opt: 363, E(): 4.2e-16, (31.8% identity in 255 aa overlap); S47292|2120861|pir|S70085 phenol 2-monooxygenase (EC 1.14.13.7) chain mopP from Acinetobacter calcoaceticus (350 aa); P21394|XYLA_PSEPU|94933|B37316 XYLENE MONOOXYGENASE ELECTRON TRANSFER COMPONENT (EC 1.18.1.3) [INCLUDES: FERREDOXIN; FERREDOXIN--NAD(+) REDUCTASE] from Pseudomonas putida plasmid pWW0 (350 aa); AAC38360.1|AF043544|NtnMA|ntnA reductase component of 4-nitrotoluene monooxygenase from Pseudomonas sp. (328 aa); etc. Protein product from Mb0392 detected using SWATH mass spectrometry. Mb0392 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248097.1" /translation="MGLEDRDALRVLQNAFKLDDPELVRRFYAHWFALDASVRDLFPP DMGAQRAAFGQALHWVYGELVAQRAEEPVAFLAQLGRDHRKYGVLPTQYDTLRRALYT TLRDYLGHPSRGAWTDAVDEAAGQSLNLIIGVMSGAADADDAPAWWDGTVVEHIRVSR DLAVARLQLDRPLHYYPGQYVNVHVPQCPRRWRYLSPAIPADPNGRIEFHVRVVPGGL VSNAIVGETRPGDRWRLSGPHGAFRVDRDGGDVLMVAGSTGLAPLRALIIDLSRFAVN PRVHLFFGARYACELYDLPTLWQIAAHNPWLSVSPVSEYNGDPAWAADYPDVSAPRGL HVRQTGRLPDVVSRYGGWGDRQILICGGPAMVRATKAALIAKGAPPERIQHDPLSR" CDS 464428..467685 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0393" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY LUXR/UHPA-FAMILY)" /note="Mb0393, -, len: 1085 aa. Equivalent to Rv0386, len: 1085 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1085 aa overlap). Probable regulatory protein, LuxR/uhpA family, highly similar to CAC30706.1|AL583923 possible transcriptional regulator from Mycobacterium leprae (1106 aa). Also similar in part to other regulatory proteins e.g. CAB95788.1|AL359949 putative multi-domain regulatory protein from Streptomyces coelicolor (780 aa); N-terminus of CAB92369.1|AL356612 putative AfsR-like regulatory protein from Streptomyces coelicolor (1114 aa); N-terminus of NP_107139.1|14026327|BAB52925.1|AP003009 transcriptional regulator from Mesorhizobium loti (952 aa); AFSR_STRCO|P25941 regulatory protein afsr from Streptomyces coelicolor (993 aa), FASTA scores: opt: 224, E() : 1.1e-06, (26.1% identity in 867 aa overlap); etc. Also similar to many putative Mycobacterium tuberculosis regulatory proteins e.g. AL0212|MTV008_44 (1137 aa), FASTA scores: opt: 3756, E(): 0, (56.7% identity in 1089 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00622 Bacterial regulatory proteins, luxR family signature and probable helix-turn-helix motif at aa 1042-1063 (Score 1025, +2.68 S D). BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb0393 detected using SWATH mass spectrometry. Mb0393 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248098.1" /translation="MSKLLPRGTVTLLLADVEGSTWLWETHPDDMGAAVARLDKAVSG VIAAHDGVRPVEQGEGDSFVLAFACASDAVAAALDLQRARLAPIRLRIGVHTGEVALR DEGNYAGPTINRTARLRDLAHGGQTVLSGVTESLVIDRLPDKAWLVDLGTHALRDLSR PERVMQLCHPELRIDFPPLRVANDDVAHGLPVHLTRFVGRGAQITEVHRLVTDNRLVT LTGAGGVGKTRLAAQLAAQIAGEFGRAWFVDLAPITDPDLVPVTVAGALGLHDQPGRS TTDTVLRFLGGRPALVVLDNCEHLLDATAALVLALVKACRGVRLLATCREPLRVEGEV SYRVPSLSLSDEAVEMFCYRAQRVRPDFRLTDDNSAAVTEICKRLDGLPLAIELAAAR LRSMTLDEIIDGLRDRFALLTGGARTAAHRQQTLWASVDWSYTLLTEPERTLFRRLAV FVGCFFVDDAQAVACSGDVQRYQVLDEITLLVDKSLVMADDNSGRTCYRLCETMRHYA LEKLSEAGEVDAVFARHRDYYTALAARVDNPGPSDYSHCLDQAETEIDNLRAAFVWNR ENSDTEGALALASSLLRVWMTRGRIQEGRAWFDSILADENARHLEVAAAVRARALADK ALLDIFVDAAAGMEQAQQALVIAREVDEPALLSRALTACGLIAVAVARADAAASYFAE AIDLARAVDDRWRLAQILTFQAVDAVVAGDPVAARPAAQEARELAAAIGDHSNALWCR WCLGYAQLMRGELAAAAAQFGEVVDEAEASQEVLHKANSLQGLAFALAYQGELSAARA AADAALEAAELGEYFAGMGYSALTTAALAAGDVQTAQHASEAAWRNLSLALPLSAAVQ RAFNAQAALAGGDLSAARRWCDDAVQSMTGHHLAMALATRARIAVAEGKREEAERDAH KALACAAESGAHLDLPDVLECLAGLASDAGTHHAAARLFGAAEAIRQQIGSVRFAIYR SDYVQSVTALRDAMGEKDFAAAWAEGAALSIKETIAYAQRGHSWRKRPATGWESLTPT EIDVVRLVGEGLANKDIATRLFVSPRTVQTHLTHVYTKLGFTSRLQLAQAAARRT" CDS complement(467689..469020) /codon_start=1 /transl_table=11 /gene="PPE9" /locus_tag="BQ2027_MB0394C" /product="conserved hypothetical protein" /note="Mb0394c, PPE9, len: 443 aa. Equivalent to Rv0388c and Rv0387c, len: 180 aa and 244 aa, from Mycobacterium tuberculosis strain H37Rv, (95.1% identity in 164 aa overlap and 100.0% identity in 244 aa overlap). Rv0388c: Member of the Mycobacterium tuberculosis PPE family, highly similar to others e.g. MTCY10G2_10|Z92539 from Mycobacterium tuberculosis (391 aa), FASTA scores: opt: 667, E(): 0, (58.3% identity in 180 aa overlap) but much shorter. Rv0387c: conserved hypothetical protein, showing some similarity to MTCI237.20c, and M17282|HUMEL20_1 Human elastin gene, exon 1, Elastin (687 aa), FASTA scores: opt: 193, E(): 0.35, (34.4% identity in 189 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0388c and Rv0387c exist as 2 separate genes. In Mycobacterium bovis, 3 different base substitutions, the first of 14 bases, the second of 8 bases (tctacagt-gctacagg), and lastly of 28 bases, leads to a longer single product. Mb0394c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248099.1" /translation="MDFGALPPEINSARIYSGPGSRPLMQAAAAWQRLANELTATAAS YSSVISGLTGDDWLGPSALSMAAAAVPYVAWMRATAASAEQAAAQAVAAANAYESAYA ATVPPTVIAANRRTMLSLVKTNVFGQNTPAIATSEAQYGAMWAQDIVAMEGYAGASAA ASQLPPFTPPPATTSGAGSLSDAAATAAQAVVPAAAATDVSLLPTLQSFLPPPFDAIP NPIEDLDVLVAAAVAVAAGSLGVSAAQLGEIYRHDVVDEAQKAPHCPAESDQTPAGAA GDGDLPEVGGRVTSPPQPPVAALTGYSANIGGLSVPHSWNLPPAVRQVAAMFPGATPM YMTGSSDGSYAGLAAAGLAGTGLAGLAARGGSAPTPAAAAPAGAGGAGPAATRPAAQQ TPAVPAAAAGSAIPGLPPGLPPGVVANLAATLAAIPGATIIVVPPSPNANQ" CDS 469354..470613 /codon_start=1 /transl_table=11 /gene="purT" /locus_tag="BQ2027_MB0395" /product="PROBABLE PHOSPHORIBOSYLGLYCINAMIDE FORMYLTRANSFERASE 2 PURT (GART 2) (GAR TRANSFORMYLASE 2) (5'-PHOSPHORIBOSYLGLYCINAMIDE TRANSFORMYLASE 2) (FORMATE-DEPENDENT GAR TRANSFORMYLASE)" /note="Mb0395, purT, len: 419 aa. Equivalent to Rv0389, len: 419 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 419 aa overlap). Probable purT, phosphoribosylglycinamide formyltransferase 2 (EC 2.1.2.-), similar to others e.g. P33221|PURT_ECOLI|B1849 phosphoribosylglycinamide formyltransferase 2 from Escherichia coli strain K-12 (391 aa), FASTA scores: opt: 481, E(): 1.3e-22, (40.1% identity in 379 aa overlap); etc. BELONGS TO THE PURK / PURT FAMILY. COFACTOR: MAGNESIUM. Protein product from Mb0395 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0395 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248100.1" /translation="MIDGWTEGQHEPTVRHERPAAPQDVRRVMLLGSAEPSRELAIAL QGLGAEVIAVDGYVGAPAHRIADQSVVVTMTDAEELTAVIRRLQPDFLVTVTAAVSVD ALDAVEQADGECTELVPNARAVRCTADREGLRRLAADQLGLPTAPFWFVGSLGELQAV AVHAGFPLLVSPVAGVAGQGSSVVAGPNEVEPAWQRAAGHQVQPQTGGVSPRVCAESV VEIEFLVTMIVVCSQGPNGPLIEFCAPIGHRDADAGELESWQPQKLSTAALDAAKSIA ARIVKALGGRGVFGVELMINGDEVYFADVTVCPAGSAWVTVRSQRLSVFELQARATLG LAVDTLMISPGAARVINPDHTAGRAAVGAAPPADALTGALGVPESDVVIFGRGLGVAL ATAPEVAIARERAREVASRLNVPDSRE" CDS 470610..471032 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0396" /product="Rhodanese-related sulfurtransferase" /note="Mb0396, -, len: 140 aa. Equivalent to Rv0390, len: 140 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 140 aa overlap). Conserved hypothetical protein, equivalent to AL023514|MLCB4_11|CAA18942.1|AL023514 hypothetical protein from Mycobacterium leprae (147 aa), FASTA scores: opt: 778, E(): 0, (79.0% identity in 138 aa overlap). Also similar to hypothetical proteins from several Rickettsia species. Protein product from Mb0396 detected using shotgun mass spectrometry. Mb0396 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248101.1" /translation="MSYAGDITPLQAWEMLSDNPRAVLVDVRCEAEWRFVGVPDLSSL GREVVYVEWATSDGTHNDNFLAELRDRIPADADQHERPVIFLCRSGNRSIGAAEVATE AGITPAYNVLDGFEGHLDAEGHRGATGWRAVGLPWRQG" CDS 471029..472249 /codon_start=1 /transl_table=11 /gene="metZ" /locus_tag="BQ2027_MB0397" /product="PROBABLE O-SUCCINYLHOMOSERINE SULFHYDRYLASE METZ (OSH SULFHYDRYLASE)" /note="Mb0397, metZ, len: 406 aa. Equivalent to Rv0391, len: 406 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 406 aa overlap). Probable metZ, O-succinylhomoserine sulfhydrylase (EC 4.2.99.-), equivalent, but shorter 20 aa in N-terminus, to AA18941.1|AL023514 O-succinylhomoserine sulfhydrylase from Mycobacterium leprae (426 aa). Also highly similar to others e.g. METZ_PSEAE|P55218 o-succinylhomoserine sulfhydrylase from Pseudomonas aeruginosa (403 aa), FASTA scores: opt: 1175, E(): 0, (47.2% identity in 392 aa overlap); etc. BELONGS TO THE TRANS-SULFURATION ENZYMES FAMILY. Could also be a cystathionine gamma-synthase (EC 4.2.99.9). Protein product from Mb0397 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0397 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248102.1" /translation="MTDESSVRTPKALPDGVSQATVGVRGGMLRSGFEETAEAMYLTS GYVYGSAAVAEKSFAGELDHYVYSRYGNPTVSVFEERLRLIEGAPAAFATASGMAAVF TSLGALLGAGDRLVAARSLFGSCFVVCSEILPRWGVQTVFVDGDDLSQWERALSVPTQ AVFFETPSNPMQSLVDIAAVTELAHAAGAKVVLDNVFATPLLQQGFPLGVDVVVYSGT KHIDGQGRVLGGAILGDREYIDGPVQKLMRHTGPAMSAFNAWVLLKGLETLAIRVQHS NASAQRIAEFLNGHPSVRWVRYPYLPSHPQYDLAKRQMSGGGTVVTFALDCPEDVAKQ RAFEVLDKMRLIDISNNLGDAKSLVTHPATTTHRAMGPEGRAAIGLGDGVVRISVGLE DTDDLIADIDRALS" CDS complement(472246..473658) /codon_start=1 /transl_table=11 /gene="ndhA" /locus_tag="BQ2027_MB0398C" /product="PROBABLE MEMBRANE NADH DEHYDROGENASE NDHA" /note="Mb0398c, ndhA, len: 470 aa. Equivalent to Rv0392c, len: 470 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 470 aa overlap). Probable ndhA, membrane NADH dehydrogenase (EC 1.6.99.3), equivalent to many e.g. AF038423|AF038423_1 NADH dehydrogenase from Mycobacterium smegmatis (457 aa), FASTA scores: opt: 1991, E(): 0, (67.9% identity in 458 aa overlap); MLCB1788_3 NADH dehydrogenase from Mycobacterium leprae (466 aa), FASTA score: (62.5% identity in 467 aa overlap). Also similar to others from several organisms e.g. P00393|DHNA_ECOLI|66211|581140|CAA23586.1|V00306 NADH DEHYDROGENASE from Escherichia coli (434 aa); and Rv0392c|ndhB from Mycobacterium tuberculosis. Has hydrophobic stretch in C-terminus. BELONGS TO THE NADH DEHYDROGENASE FAMILY. Protein product from Mb0398c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0398c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248103.1" /translation="MTLSSGEPSAVGGRHRVVIIGSGFGGLNAAKALKRADVDITLIS KTTTHLFQPLLYQVATGILSEGDIAPTTRLILRRQKNVRVLLGEVNAIDLKAQTVTSK LMDMTTVTPYDSLIVAAGAQQSYFGNDEFATFAPGMKTIDDALELRGRILGAFEAAEV STDHAERERRLTFVVVGAGPTGVEVAGQIVELAERTLAGAFRTITPSECRVILLDAAP AVLPPMGPKLGLKAQRRLEKMDAEVQLNAMVTAVDYKGITIKEKDGGERRIECACKVW AAGVAASPLGKMIAEGSDGTEIDRAGRVIVEPDLTVKGHPNVFVVGDLMFVPGVPGVA QGAIQGARYATTVIKHMVKGNDDPANRKPFHYFNKGSMATISRHSAVAQVGKLEFAGY FAWLAWLVLHLVYLVGYRNRIAALFAWGISFMGRARGQMAITSQMIYARLVMTLMEQQ AQGALAAAEQAEHAEQEAAG" CDS 473800..475125 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0399" /product="CONSERVED 13E12 REPEAT FAMILY PROTEIN" /note="Mb0399, -, len: 441. Equivalent to Rv0393, len: 441 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 441 aa overlap). Member of Mycobacterium tuberculosis 13E12 repeat family of conserved proteins, similar to many e.g. Rv1148c, Rv1945, Rv3467, Rv0336|MTCY279_3 (503 aa), FASTA scores: E(): 0, (61.1% identity in 347 aa overlap)." /protein_id="CAB5248104.1" /translation="MAVGRCAIPRFDQAASGSAINGGQVHLSDGSTSPARQLPAPWPG DAGAAAEGRAGVCCRGNRLPHVSDVGVSHRFDHRPAGVGAGGCRAGAAGAGLAVDDPG QLAAAIDRIVAVADPDAVRQVRERARDREVSIWNSADGMGEVYAQLYATDAQALDARL NALVATVCAGDPRSTDQRRADALGALAAGADRLACRCDNPDCAAEGRPVSAVVIHVVA EQASVKGHGQAPAALLGGDGLIPAELVAELAKTAGLQPIPVPAGTEPGYRPSVKLAAF VRARDLTCRAPGCDRPATQCDLDHTIAFADGGATHAANLKCLCRLHHLLATFCGWRAQ QLPDGTVIWTLPGNQTYVTTPGSALLFPALCTPTGDPPAPEPARADRRGQRTAMMPRR ASTRTQNRAHCIAAERHRNHQARRIAQAAVIATETHGPPPDPDDDPPPF" CDS complement(475141..475860) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0400C" /product="POSSIBLE SECRETED PROTEIN" /note="Mb0400c, -, len: 239 aa. Equivalent to Rv0394c, len: 239 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 239 aa overlap). Possible secreted protein, sharing no homology with other proteins. Has hydrophobic stretch at its N-terminus. Protein product from Mb0400c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0400c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248105.1" /translation="MTEPRPVFAVVISAGLSAIPMVGGPLQTVFDAIEERTRHRAETT TREICESVGGADTVLSRIDKNPELEPLLSQAIEAATRTSMEAKRRLLAQAAAAALEDD QKVEPASLIVATLSQLEPVHIHALVRLAKAAKSSPDQDEIQRREVMRAASKVEPVPVL AALIQTGVAIATTTVWHGNGTGTPAEESGHILIHDVSDFGHRLLAYLRAADAGAELLI LPSGGSAPTGDHPTPHPSTSR" CDS 475959..476363 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0401" /product="HYPOTHETICAL PROTEIN" /note="Mb0401, -, len: 134 aa. Equivalent to Rv0395, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 134 aa overlap). Hypothetical unknown protein." /protein_id="CAB5248106.1" /translation="MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEV LDEHLAVRRRGVPAAIGCVPWLSSEAVAETLLALSAFCVVIDKGTSFPSRLRNPDKGF PNVALLRLRDMAPSEHGSRCSSARGRLCLSMS" CDS 476369..476761 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0402" /product="HYPOTHETICAL PROTEIN" /note="Mb0402, -, len: 130 aa. Equivalent to Rv0396, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 130 aa overlap). Hypothetical unknown protein." /protein_id="CAB5248107.1" /translation="MRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFE PRTVWWGSANWTVRAGSHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPN LVQVEFDDAAMAEAMEEMAEPDDDGEDW" CDS 476835..477203 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0403" /product="CONSERVED 13E12 REPEAT FAMILY PROTEIN" /note="Mb0403, -, len: 122 aa. Equivalent to Rv0397, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 122 aa overlap). Part of 13E12 repeat family of conserved Mycobacterium tuberculosis proteins, similar to downstream Rv0393|Z84725|MTCY4D9_5 CONSERVED 13E12 REPEAT FAMILY PROTEIN (441 aa), FASTA scores: E(): 0, (87.7% identity in 122 aa overlap)." /protein_id="CAB5248108.1" /translation="MLATFWGWRAQQLPDGTVIWTLPGDQTYVTTPGSALLFPALCTP TGDPPRPDPARADRRGQRTAMMPRRASTRAQNRAHYIAAERHRNHQARRIAHVVTQTA TTAPETNGPPPDPDDDPPPF" CDS 477413..477661 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0403A" /product="Conserved protein" /note="Mb0403A, len: 82 aa. Equivalent to Rv0397A len: 82 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 82 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein. Protein product from Mb0403A detected using SWATH mass spectrometry. Mb0403A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248109.1" /translation="MHALRLVGLAILTAIAPIAVLIGSSPAHADTDIGQPCSPEGAKL WGNPGPIYCERTADGQLQWVSIPAWALCVAFCDRPGGP" CDS complement(477698..478339) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0404C" /product="POSSIBLE SECRETED PROTEIN" /note="Mb0404c, -, len: 213 aa. Equivalent to Rv0398c, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap). Possible secreted protein, sharing no homology with other proteins. Has potential signal sequence with hydrophobic stretch from aa 7-25. Protein product from Mb0404c detected using SWATH mass spectrometry. Mb0404c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248110.1" /translation="MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGP VIGDGDAALRQRISQQLFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRV LGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAG TPTSMCNNGWTYPPFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTASSWPG" CDS complement(478346..479575) /codon_start=1 /transl_table=11 /gene="lpqK" /locus_tag="BQ2027_MB0405C" /product="POSSIBLE CONSERVED LIPOPROTEIN LPQK" /note="Mb0405c, lpqK, len: 409 aa. Equivalent to Rv0399c, len: 409 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 409 aa overlap). Possible lpqK, conserved lipoprotein, showing some similarity to penicillin binding proteins and various peptidases e.g. DAC_STRSQ|P15555 d-alanyl-d-alanine carboxypeptidase protein (406 aa), FASTA scores: opt: 348, E(): 5.6e-16, (29.2% identity in 301 aa overlap). Also similar to other Mycobacterium tuberculosis PBPs and esterases. Has possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013). Mb0405c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248111.1" /translation="MPVLRRLGCSVLALGLLAGCAPPRTGPASSPTNNGAKADAVIRI VRDFMTQAHLKAVLVRVTVAGKEVVTRAVGDSMTGVPATTAMHFRNGAVAISYVATLL LKLVDEKKLRLDDKLSRWLPDFPHADRVTLGQLAQMTSGYPDYVLGNEAFDAELYANP FRQWTTQELLDQISSRPLLYDPGTNWNYAHTNYLLLGLALEKAAGQDMPTLLQRKVLS PLGLTATANSDTPAIPEPALHAFTSERRAALKIPAGVPFYEESTFWNPSWTITHGAIQ TTTIYDMEATAVGIGSGRLLSADSYKKMVSTELRGKTRAQPGCPTCFEQNDGYSYGLG IVISGHWLLQNPMFAGYAAVEAYLPSQRVAVAVAVTYAPEAFDDQGNYRNQADILFRK IGAEVAPNDAPPMPPGR" CDS complement(479585..480772) /codon_start=1 /transl_table=11 /gene="fadE7" /locus_tag="BQ2027_MB0406C" /product="acyl-coa dehydrogenase fade7" /note="Mb0406c, fadE7, len: 395 aa. Equivalent to Rv0400c, len: 395 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 395 aa overlap). Probable fadE7, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. CAC12923.1|AL445403 putative acyl CoA dehydrogenase from Streptomyces coelicolor (397 aa); G624219 GLUTARYL-COA DEHYDROGENASE PRECURSOR (438 aa), FASTA scores: opt: 1161, E(): 0, (48.1% identity in 391 aa overlap); etc. Protein product from Mb0406c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0406c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248112.1" /translation="MSTPTPPALDRDDPLGLDASLSSDEIAVRDTVRRFCAEHVTPHV AAWFEDGDLPVARDLAKQFGELGLLGMQLHGHGCGGASAVHYGLACRELEAADSGIRS LVSVQGSLAMFAIASFGSDEQKRQWLPGMATGDLLGCFGLTEPDVGSDPAAMKTRARR DGPDWVITGGKMWITNGSVADVAIVWAATDDGIRGFIVPTDTPGFTANTIGHKLSLRA SITSELVLDNVRLPADAMLPGATGLRAPLACLSEARYGIVWGAMGAARSAWQCALDYA RQRTQFGRPIAGFQLTQAKLVDMAVELHKGQLLSLHLGRLKDRVGLRPDQVSFGKLNN TREALKICRTARTILGGNGISLEYPVIRHMVNLESVLTYEGTPEMHQLVLGQAFTGLA AFR" CDS 480808..481179 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0407" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0407, -, len: 123 aa. Equivalent to Rv0401, len: 123 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 123 aa overlap). Probable conserved transmembrane protein, equivalent to AL023514|MLCB4_9 putative integral membrane protein from Mycobacterium leprae (122 aa), FASTA scores: opt: 548, E(): 4.4e-32, (66.9% identity in 121 aa overlap). Mb0407 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248113.1" /translation="MRPRRALAGLAADVVAVLVFCAVGRRSHAEGLSVTGLAATAWPF LTGTGIGWVLARGWRRPTALAPTGVIVWLCTIVVGMVLRKVSSAGVAASFVVVASAVT AVLLLGWRAAVALMAPHRADG" CDS complement(481374..482477) /codon_start=1 /transl_table=11 /gene="mmpL1b" /locus_tag="BQ2027_MB0408C" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL1B [SECOND PART]" /note="Mb0408c, mmpL1b, len: 367 aa. Equivalent to 3' end of Rv0402c, len: 958 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 367 aa overlap). Probable mmpL1, conserved transmembrane transport protein (see citation below), member of RND superfamily, highly similar to other Mycobacterial proteins e.g. YV34_MYCTU|Q11171 hypothetical 106.2 kd membrane protein from Mycobacterium tuberculosis (968 aa), FASTA scores: opt: 3551, E(): 0, (55.4% identity in 933aa overlap); YV34_MYCLE|P54881 hypothetical 105.2 kd protein from Mycobacterium leprae (959 aa), FASTA scores: opt: 3615, E(): 0, (55.5% identity in 941 aa overlap); etc. Highly similar to many other mycobacterial MmpL proteins from Mycobacterium tuberculosis and M. leprae e.g. Rv0450c, Rv0676c, Rv0507, etc. BELONGS TO THE MMPL FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, mmpL1 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits mmpL1 into 2 parts, mmpL1a and mmpL1b. Mb0408c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248114.1" /translation="MNSMDNVDKLTEDLANLTDDTERMDTTQRQLLAQLDPTIATMQT VKDLAQTLTSAFSGLVTQMEDMTRNATVMGRTFDAANNDDSFYLPPEAFQNPDFQRGL KLFLSPDGTCARFVITHRGDPASAEGISHIDPIMQAADEAVKGTPLQAASIYLAGTSS TYKDIHEGTLYDVMIAVVASLCLIFIIMLGITRSVVASAVIVGTVALSLGSAFGLSVL IWQHILHMPLHWLVLPMAIIVMLAVGSDYNLLLIARFQEEIGAGLKTGMIRAMAGTGR VVTIAGLVFAFTMGSMVASDLRVVGQIGTTIMIGLLFDTLVVRSYMTPALATLLGRWF WWPRRVDRLARQPQVLGPRRTTALSAERAALLQ" CDS complement(482474..484249) /codon_start=1 /transl_table=11 /gene="mmpL1a" /locus_tag="BQ2027_MB0409C" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL1A [FIRST PART]" /note="Mb0409c, mmpL1a, len: 591 aa. Equivalent to 5' end of Rv0402c, len: 958 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 568 aa overlap). Probable mmpL1, conserved transmembrane transport protein (see citation below), member of RND superfamily, highly similar to other Mycobacterial proteins e.g. YV34_MYCTU|Q11171 hypothetical 106.2 kd membrane protein from Mycobacterium tuberculosis (968 aa), FASTA scores: opt: 3551, E(): 0, (55.4% identity in 933aa overlap); YV34_MYCLE|P54881 hypothetical 105.2 kd protein from Mycobacterium leprae (959 aa), FASTA scores: opt: 3615, E(): 0, (55.5% identity in 941 aa overlap); etc. Highly similar to many other mycobacterial MmpL proteins from Mycobacterium tuberculosis and M. leprae e.g. Rv0450c, Rv0676c, Rv0507, etc. BELONGS TO THE MMPL FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, mmpL1 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits mmpL1 into 2 parts, mmpL1a and mmpL1b. Mb0409c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248115.1" /translation="MRSQRLAGHLSAAARTIHALSLPIILFWVALTIVVNVVAPQLQS VARTHSVALGPHDAPSLIAMKRIGKDFQQFDSDTTAMVLLEGQEKLGDEAHRFYDVLV TKLSQDTTHVQHIENFWGDPLTAAGSQSADGKAAYVQLNLTGDQGGSQANESVAAVQR IVDSVPPPPGIKAYVTGPGPLGADRVVYGDRSLHTITGISIAVIAIMLFIAYRSLSAA LIMLLTVGLELLAVRGIISTFAVNDLMGLSTFTVNVLVALTIAASTDYIIFLVGRYQE ARATGQNREAAYYTMFGGTAHVVLASGLTVAGAMYCLGFTRLPYFNTLASPCAIGLVT VMLASLTLAPAIIAVASRFGLFDPKRATTKRRWRRIGTVVVRWPGPVLAATLLIALIG LLALPKYQTNYNERYYIPSAAPSNIGYLASDRHFPQARMEPEVLMVEADHDLRNPTDM PILDRIAKTVFHTPGIARVQSITRPLGAPIDHSSIPFQLGMQSTMTIENLQNLKDRVA DLSTLTDQLQRMIDITQRTQELTRQLTDATHDMNAHTRQMRDNANELRDRIADFDDFW RPSEVSRTGSATASTFPSAGRCAPC" CDS complement(484246..484674) /codon_start=1 /transl_table=11 /gene="mmpS1" /locus_tag="BQ2027_MB0410C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN MMPS1" /note="Mb0410c, mmpS1, len: 142 aa. Equivalent to Rv0403c, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Probable mmpS1, conserved membrane protein (see citation below), highly similar to other Mycobacterial proteins e.g. YV33_MYCLE|P54880 hypothetical 16.9 kd protein from Mycobacterium leprae (154 aa), FASTA scores: opt: 458, E(): 1.6e-26, (46.9% identity in 143 aa overlap); YV33_MYCTU|Q11170 hypothetical 15 .9 kd protein from M. tuberculosis (147 aa), FASTA scores: opt: 362, E(): 1.1e-19, (42.1% identity in 140 aa overlap); etc. Also similar to other MmpS proteins from Mycobacterium tuberculosis e.g. Rv0677c, Rv0451c, etc. BELONGS TO THE MMPS FAMILY. Mb0410c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248116.1" /translation="MFGVAKRFWIPMVIVIVVAVAAVTVSRLHSVFGSHQHAPDTGNL DPIIAFYPKHVLYEVFGPPGTVASINYLDADAQPHEVVNAAVPWSFTIVTTLTAVVAN VVARGDGASLGCRITVNEVIREERIVNAYHAHTSCLVKSA" CDS 484995..486752 /codon_start=1 /transl_table=11 /gene="fadD30" /locus_tag="BQ2027_MB0411" /product="fatty-acid-amp ligase fadd30 (fatty-acid-amp synthetase) (fatty-acid-amp synthase)" /note="Mb0411, fadD30, len: 585 aa. Equivalent to Rv0404, len: 585 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 585 aa overlap). Probable fadD30, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to many e.g. MBU75685_1|AAB52538.1|U75685 acyl-CoA synthase from Mycobacterium bovis (582 aa); MASC_MYCLE|P54200 masc protein from Mycobacterium leprae (372 aa), FASTA scores: opt: 888, E(): 0, (44.2% identity in 342 aa overlap). Also similar to Y06J_MYCTU|Q10976 hypothetical 67.9 kd protein (626 aa), FASTA scores: opt: 1463, E(): 0, (42.4% identity in 568 aa overlap). Protein product from Mb0411 detected using SWATH mass spectrometry. Mb0411 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248117.1" /translation="MSVISTLRDRATTTPSDEAFVFMDYDTKTGDQIDRMTWSQLYSR VTAVSAYLISYGRHADRRRTAAISAPQGLDYVAGFLGALCAGWAPVPLPEPLGSLRDK RTGLAVLDCAADVVLTTSQAETRVRATIATHGASVTTPVIALDTLDEPSGDNCDLDSQ LSDWSSYLQYTSGSTANPRGVVLSMRNVTENVDQIIRNYFRHEGGAPRLPSSVVSWLP LYHDMGLMVGLFIPLFVGCPVILTSPEAFIRKPARWMQLLAKHQAPFSAAPNFAFDLA VAKTSEEDMAGLDLGHVNTIINGAEQVQPNTITKFLRRFRPYNLMPAAVKPSYGMAEA VVYLATTKAGSPPTSTEFDADSLARGHAELSTFETERATRLIRYHSDDKEPLLRIVDP DSNIELGPGRIGEIWIHGKNVSTGYHNADDALNRDKFQASIREASAGTPRSPWLRTGD LGFIVGDEFYIVGRMKDLIIQDGVNHYPDDIETTVKEFTGGRVAAFSVSDDGVEHLVI AAEVRTEHGPDKVTIMDFSTIKRLVVSALSKLHGLHVTDFLLVPPGALPKTTSGKISR AACAKQYGANKLQRVATFP" CDS 486749..488131 /codon_start=1 /transl_table=11 /gene="pks6a" /locus_tag="BQ2027_MB0412" /product="PROBABLE MEMBRANE BOUND POLYKETIDE SYNTHASE PKS6A [FIRST PART]" /note="Mb0412, pks6a, len: 460 aa. Equivalent to 5' end of Rv0405, len: 1402 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 456 aa overlap). Probable pks6, membrane-bound polyketide synthase, highly similar to others e.g. CAC29643.1|AL583917 putative polyketide synthase from Mycobacterium leprae (2103 aa); Y06K_MYCTU|Q10977 probable polyketide synthase (1876 aa), FASTA scores: opt: 2303, E(): 0, (38.7% identity in 1232 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site, 2 x PS00017 ATP/GTP-binding site motif A (P-loop), and PS00012 Phosphopantetheine attachment site. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, pks6 exists as a single gene. In Mycobacterium bovis, a frameshift due to single base insertion (*-g) splits pks6 into 2 parts, pks6a and pks6b. Mb0412 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248118.1" /translation="MTDGSVTADKLQKWFREYLSTHIECHPNEVSLDVPIRDLGLKSI DVLAIPGDLGDRFGFCIPDLAVWDNPSANDLIDSLLNQRSADSLRESHGHADRNTQGR GSINEPVAVIGVGCRFPGDIDGPERLWDFLTEKKCAITAYPDRGFTNAGTFAESGGFL KDVAGFDNRFFDIPPDEALRMDPQQRLLLEVSWEALEHAGIIPESLRLSRTGVFVGVS STDYVRLVSASAQQKSTIWDNTGGSSSIIANRISYFLDIQGPSIVIDTACSSSLVAVH LACRSLSTWDCDIALVGGTNVLISPEPWGGFREAGILSQTGCCHAFDKSADGMVRGEG CGVIVLQRLSDARLEGRRILAILTGSAVNQDGKSNGIMAPNPSAQIGVLENACKSARV DPLEIGYVEAHGTGTSLGDRIEAHALGMVFGRKRPGSGPLMIGSIKPNIGHLEGAAGI AGLIKAGVDG" CDS 488118..490958 /codon_start=1 /transl_table=11 /gene="pks6b" /locus_tag="BQ2027_MB0413" /product="PROBABLE MEMBRANE BOUND POLYKETIDE SYNTHASE PKS6B [SECOND PART]" /note="Mb0413, pks6b, len: 946 aa. Equivalent to 3' end of Rv0405, len: 1402 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 945 aa overlap). Probable pks6, membrane-bound polyketide synthase, highly similar to others e.g. CAC29643.1|AL583917 putative polyketide synthase from Mycobacterium leprae (2103 aa); Y06K_MYCTU|Q10977 probable polyketide synthase (1876 aa), FASTA scores: opt: 2303, E(): 0, (38.7% identity in 1232 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site, 2 x PS00017 ATP/GTP-binding site motif A (P-loop), and PS00012 Phosphopantetheine attachment site. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, pks6 exists as a single gene. In Mycobacterium bovis, a frameshift due to single base insertion (*-g), splits pks6 into 2 parts, pks6a and pks6b. Protein product from Mb0413 detected using SWATH mass spectrometry. Mb0413 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248119.1" /translation="MLMVERGSLLPSGGFTEPNPAIPFTELGLRVVDELQEWPVVAGR PRRAGVSSFGFGGTNAHVIVEEAGSVGADTVSGRADVGGSGGGVVAWVISGKTASALA AQAGRLGRYVRARPALDVVDVGYSLVSTRSVFDHRAVVVGQTRDELLAGLAGVVAGRP EAGVVCGVGKPAGKTAFVFAGQGSQWLGMGSELYAAYPVFAEALDAVVDELDRHLRYP LRDVIWGHDQDLLNTTEFAQPALFAVEVALYRLLMSWGVRPGLVLGHSVGELAAAHVA GALCLPDAAMLVAARGRLMQALPAGGAMFAVQAREDEVAPMLGHDVSIAAVNGPASVV ISGAHDAVSAIADRLRGQGRRVHRLAVSHAFHSALMEPMIAEFTAVAAELSVGLPTIP VISNVTGQLVADDFASADYWARHIRAVVRFGDSVRSAHCAGASRFIEVGPGGGLTSLI EASLADAQIVSVPTLRKDRPEPVSVMTAAAQGFVSGMGLDWASVFSGYRPKRVELPTY AFQHQKFWLAPAPSVSDPTAAGQIGASDGGAELLASSGFAARLAGRSADEQLAAAIEV VCEHAAAVLGRDGAAGLDAGQAFADSGFNSLSAVELRNRLTAVTAVTLPATAIFDHPT PTELAQYLITQIDGHGSSAAAAANPAERIDALTDLFLQACDAGRDADGWKMVALASNT RERMSSPVRNNVSKNVALLADGISDVVVICIPTLTVLSDQREYRDIANAMTGRHSVYS LTLPGFDSSDALPQNADMIVETVSNAIIDVVGGSCRFVLSGYSSGGVLAYALCSHLSV KHQRNPLGVALIDTYLPSQIANPSMNEGFSPNDTGKGLSREVIRVARMLNRLTATRLT AAATYAAIFQAWEPGRSMAPVLNIVAKDRIATVENLREERINRWRTAAAEAAYSVAEV PGDHFGVMSTSSEAIATEIHDWISGLVRGPHP" CDS complement(490906..491724) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0414C" /product="BETA LACTAMASE LIKE PROTEIN" /note="Mb0414c, -, len: 272 aa. Equivalent to Rv0406c, len: 272 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 272 aa overlap). Beta-lactamase-like protein, equivalent to AAD38170.1|AF152397_1 beta-lactamase-like protein from Mycobacterium phlei (243 aa); AL023514|MLCB4_8 hypothetical protein from Mycobacterium leprae (251 aa), FASTA scores: opt: 1284, E(): 0, (74.9% identity in 243 aa overlap); and AAD38164.1|AF152394_2 beta-lactamase-like protein from Mycobacterium avium (247 aa), FASTA scores: opt: 1301, E(): 0, (74.2% identity in 244 aa overlap); etc. Also slight similarity to others beta-lactamases and hypothetical proteins e.g. P52700|BLA1_XANMA|628530|S45349 METALLO-BETA-LACTAMASE L1 PRECURSOR (BETA-LACTAMASE, TYPE II) (PENICILLINASE) from Xanthomonas maltophilia (290 aa), FASTA scores: (34.4% identity in 96 aa overlap). Protein product from Mb0414c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0414c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248120.1" /translation="MVATRGTRLAALALAPRLAGMAELVQITDKVHLARGHAVNWVLV TDDTGVLLIDAGYPGDRAEVLASLNKLGYTPGDVRAIVLTHAHIDHLGSAIWFAREHS TPVYCHAEEVGHAKREYRENASVFDVALRSWRPRVAVWGIHLLRRGGLTGDGIPTAQP LTAEAAAGLPGQPMAIFTPGHTSGHCSYVVDGVLASGDALITGHPMLRHRGPQLLPAV FSHSQQNSIRSLAALALLETNILAPGHGELWHGPIRKATDEALERAQKSNHVFR" CDS 491802..492812 /codon_start=1 /transl_table=11 /gene="fgd1" /locus_tag="BQ2027_MB0415" /standard_name="fgd" /product="f420-dependent glucose-6-phosphate dehydrogenase fgd1" /note="Mb0415, fgd1, len: 336 aa. Equivalent to Rv0407, len: 336 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 336 aa overlap). Probable fgd1, F420-dependent glucose-6-phosphate dehydrogenase (EC 1.-.-.-), equivalent to others from Mycobacteria e.g. AAD38165.1|AF152394_3 from M. avium (336 aa), FASTA scores: opt: 2082, E(): 0, (89.9% identity in 336 aa overlap); AL023514|MLCB 4_7 from Mycobacterium leprae (336 aa), FASTA scores: opt: 2069, E(): 0, (89.0% identity in 336 aa overlap). Also similar to other dehydrogenases e.g. CAA77276.1|Y18730 F420-dependent alcohol dehydrogenase from Methanofollis liminatans (330 aa). Also similar to many proteins from Mycobacterium tuberculosis e.g. Rv0953c, Rv0791c, etc. Note that previously known as fgd. Protein product from Mb0415 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0415 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248121.1" /translation="MAELKLGYKASAEQFAPRELVELAVAAEAHGMDSATVSDHFQPW RHQGGHAPFSLSWMTAVGERTNRLLLGTSVLTPTFRYNPAVIAQAFATMGCLYPNRVF LGVGTGEALNEIATGYEGAWPEFKERFARLRESVGLMRQLWSGDRVDFDGDYYRLKGA SIYDVPDGGVPVYIAAGGPAVAKYAGRAGDGFICTSGKGEELYTEKLMPAVREGAAAA DRSVDGIDKMIEIKISYDPDPELALNNTRFWAPLSLTAEQKHSIDDPIEMEKAADALP IEQIAKRWIVASDPDEAVEKVGQYVTWGLNHLVFHAPGHDQRRFLELFQSDLAPRLRR LG" CDS 492805..494877 /codon_start=1 /transl_table=11 /gene="pta" /locus_tag="BQ2027_MB0416" /product="PROBABLE PHOSPHATE ACETYLTRANSFERASE PTA (PHOSPHOTRANSACETYLASE)" /note="Mb0416, pta, len: 690 aa. Equivalent to Rv0408, len: 690 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 690 aa overlap). Probable pta, phosphate acetyltransferase (EC 2.3.1.8), highly similar to others e.g. PTA_ECOLI|P39184|11279789|JX0357|B2297 phosphate acetyltransferase from Escherichia coli strain K12 (713 aa), FASTA scores: opt: 1303, E(): 0, (38.0% identity in 718 aa overlap); etc. BELONGS TO THE PHOSPHATE ACETYLTRANSFERASE AND BUTYRYLTRANSFERASE FAMILY. Protein product from Mb0416 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0416 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248122.1" /translation="MADSSAIYLAAPESQTGKSTIALGLLHRLTAMVAKVGVFRPITR LSAERDYILELLLAHTSAGLPYERCVGVTYQQLHADRDDAIAEIVDSYHAMADECDAV VVVGSDYTDVTSPTELSVNARIAVNLGAPVLLTVRAKDRTPDQVASVVEVCLAELDTQ RAHTAAVVANRCELSAIPAVTDALRRFTPPSYVVPEEPLLSAPTVAELTQAVNGAVVS GDVALREREVMGVLAAGMTADHVLERLTDGMAVITPGDRSDVVLAVASAHAAEGFPSL SCIVLNGGFQLHPAIAALVSGLRLRLPVIATALGTYDTASAAASARGLVTATSQRKID TALELMDRHVDVAGLLAQLTIPIPTVTTPQMFTYRLLQQARSDLMRIVLPEGDDDRIL KSAGRLLQRGIVDLTILGDEAKVRLRAAELGVDLDGATVIEPCASELHDQFADQYAQL RKAKGITVEHAREIMNDATYFGTMLVHNCHADGMVSGAAHTTAHTVRPALEIIKTVPG ISTVSSIFLMCLPDRVLAYGDCAIIPNPTVEQLADIAICSARTAAQFGIEPRVAMLSY STGDSGKGADVDKVRAATELVRAREPQLPVEGPIQYDAAVEPSVAATKLRDSPVAGRA TVLIFPDLNTGNNTYKAVQRSAGAIAIGPVLQGLRKPVNDLSRGALVDDIVNTVAITA IQAQGVHE" CDS 494870..496027 /codon_start=1 /transl_table=11 /gene="ackA" /locus_tag="BQ2027_MB0417" /product="PROBABLE ACETATE KINASE ACKA (ACETOKINASE)" /note="Mb0417, ackA, len: 385 aa. Equivalent to Rv0409, len: 385 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 385 aa overlap). Probable ackA, acetate kinase (EC 2.7.2.1), highly similar to others e.g. ACKA_BACSU|P37877 acetate kinase from Bacillus subtilis (395 aa), FASTA scores: opt: 974, E(): 0, (43.5% identity in 393 aa overlap); etc. Contains PS01075 Acetate and butyrate kinases family signature 1, PS00758 ArgE / dapE / ACY1/ CPG2 / yscS family signature 1. BELONGS TO THE ACETOKINASE FAMILY. Protein product from Mb0417 detected using SWATH mass spectrometry. Mb0417 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248123.1" /translation="MSSTVLVINSGSSSLKFQLVEPVAGMSRAAGIVERIGERSSPVA DHAQALHRAFKMLAEDGIDLQTCGLVAVGHRVVHGGTEFHQPTLLDDTVIGKLEELSA LAPLHNPPAVLGIKVARRLLANVAHVAVFDTAFFHDLPPAAATYAIDRDVADRWHIRR YGFHGTSHQYVSERAAAFLGRPLDGLNQIVLHLGNGASASAIARGRPVETSMGLTPLE GLVMGTRSGDLDPGVISYLWRTARMGVEDIESMLNHRSGMLGLAGERDFRRLRLVIET GDRSAQLAYEVFIHRLRKYLGAYLAVLGHTDVVSFTAGIGENDAAVRRDALAGLQGLG IALDQDRNLGPGHGARRISSDDSPIAVLVVPTNEELAIARDCLRVLGGRRA" CDS complement(496081..498333) /codon_start=1 /transl_table=11 /gene="pknG" /locus_tag="BQ2027_MB0418C" /product="SERINE/THREONINE-PROTEIN KINASE PKNG (PROTEIN KINASE G) (STPK G)" /note="Mb0418c, pknG, len: 750 aa. Equivalent to Rv0410c, len: 750 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 750 aa overlap). pknG, serine/threonine-protein kinase (EC 2.7.1.-) (see citations below), equivalent to PKNG_MYCLE|P57993|13092623|CAC29812.1|AL583918 PROBABLE SERINE/THREONINE-PROTEIN KINASE from Mycobacterium leprae (767 aa). Also similar to others e.g. AB76890.1|AL159139 putative serine/threonine protein kinase from Streptomyces coelicolor (774 aa); etc. Contains PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Protein product from Mb0418c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0418c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248124.1" /translation="MAKASETERSGPGTQPADAQTATSATVRPLSTQAVFRPDFGDED NFPHPTLGPDTEPQDRMATTSRVRPPVRRLGGGLVEIPRAPDIDPLEALMTNPVVPES KRFCWNCGRPVGRSDSETKGASEGWCPYCGSPYSFLPQLNPGDIVAGQYEVKGCIAHG GLGWIYLALDRNVNGRPVVLKGLVHSGDAEAQAMAMAERQFLAEVVHPSIVQIFNFVE HTDRHGDPVGYIVMEYVGGQSLKRSKGQKLPVAEAIAYLLEILPALSYLHSIGLVYND LKPENIMLTEEQLKLIDLGAVSRINSFGYLYGTPGFQAPEIVRTGPTVATDIYTVGRT LAALTLDLPTRNGRYVDGLPEDDPVLKTYDSYGRLLRRAIDPDPRQRFTTAEEMSAQL TGVLREVVAQDTGVPRPGLSTIFSPSRSTFGVDLLVAHTDVYLDGQVHAEKLTANEIV TALSVPLVDPTDVAASVLQATVLSQPVQTLDSLRAARHGALDADGVDFSESVELPLME VRALLDLGDVAKATRKLDDLAERVGWRWRLVWYRAVAELLTGDYDSATKHFTEVLDTF PGELAPKLALAATAELAGNTDEHKFYQTVWSTNDGVISAAFGLARARSAEGDRVGAVR TLDEVPPTSRHFTTARLTSAVTLLSGRSTSEVTEEQIRDAARRVEALPPTEPRVLQIR ALVLGGALDWLKDNKASTNHILGFPFTSHGLRLGVEASLRSLARVAPTQRHRYTLVDM ANKVRPTSTF" CDS complement(498333..499319) /codon_start=1 /transl_table=11 /gene="glnH" /locus_tag="BQ2027_MB0419C" /product="PROBABLE GLUTAMINE-BINDING LIPOPROTEIN GLNH (GLNBP)" /note="Mb0419c, glnH, len: 328 aa. Equivalent to Rv0411c, len: 328 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 328 aa overlap). Probable glnH, glutamine-binding protein, membrane-bound lipoprotein (see citation below), equivalent to AL035159|MLCB1450_15|T44736|4154051|CAA22704.1 glutamine-binding protein homolog from Mycobacterium leprae (325 aa), FASTA scores: opt: 1747, E(): 0, (79.3% identity in 328 aa overlap). Also similar to others e.g. GLNH_BACST|P27676 glutamine-binding protein precursor from Bacillus stearothermophilus (262 aa), FASTA scores: opt: 493, E(): 7.5e-22, (37.8% identity in 193 aa overlap); etc. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site, PS01039 Bacterial extracellular solute-binding proteins, family 3 signature. BELONGS TO THE BACTERIAL EXTRACELLULAR SOLUTE-BINDING PROTEIN FAMILY 3. Presumed attached to the membrane by a lipid anchor. Protein product from Mb0419c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0419c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248125.1" /translation="MTRRALLARAAAPLAPLALAMVLASCGHSETLGVEATPTLPLPT PVGMEIMPPQPPLPPDSSSQDCDPTASLRPFATKAEADAAVADIRARGRLIVGLDIGS NLFSFRDPITGEITGFDVDIAGEVARDIFGVPSHVEYRILSAAERVTALQKSQVDIVV KTMSITCERRKLVNFSTVYLDANQRILAPRDSPITKVSDLSGKRVCVARGTTSLRRIR EIAPPPVIVSVVNWADCLVALQQREIDAVSTDDTILAGLVEEDPYLHIVGPDMADQPY GVGINLDNTGLVRFVNGTLERIRNDGTWNTLYRKWLTVLGPAPAPPTPRYVD" CDS complement(499319..500638) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0420C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb0420c, -, len: 439 aa. Equivalent to Rv0412c, len: 439 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 439 aa overlap). Possible conserved membrane protein, equivalent to AL035159|MLCB1450_16|T44737 probable membrane protein from Mycobacterium leprae (403 aa), FASTA scores: opt: 2027, E(): 0, (80.4% identity in 403 aa overlap). Also some similarity with CAB71201.1|AL138538 putative secreted protein from Streptomyces coelicolor (429 aa). Protein product from Mb0420c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0420c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248126.1" /translation="MTVELAHPSTEPLGSRSPAEPAHPRRWFISTTPGRIMTIGIVLA ALGVASAFATSTTIEHRQQVLTAVLDHTEPLSFAAGRLYTTLSVADAAAATAFIAQAE PGGVRLRYEQAITDASVAVTRASSGLTDESLVQLLGRINAELAVYTGLVEIARANNRA GNPVGSSYLSEASGLMQSTILPDAQRLYQATSARVDRETTASTQIPAPVILVVATTVV FGAFAHRWLARRTRRRINPGLVVGALGILVMVVWVGTALTISTTASRSAKDTAAESLK TITNLAITAQQARADETLSLIRRGDEEVRKQAFYQRIDAMQRQLNDYMARRHAVDKPD LQGADQLLVRWRQANDRINSYISVGNYRAATQVALGKGEDDATPAFDKLDEALTKAMG QSRTQLRHDILNAHRGLAGAQVGGVVLSLGAAIAVALGLWPRLKEYR" CDS 500732..501385 /codon_start=1 /transl_table=11 /gene="mutT3" /locus_tag="BQ2027_MB0421" /product="POSSIBLE MUTATOR PROTEIN MUTT3 (7,8-DIHYDRO-8-OXOGUANINE-TRIPHOSPHATASE) (8-OXO-DGTPASE) (DGTP PYROPHOSPHOHYDROLASE)" /note="Mb0421, mutT3, len: 217 aa. Equivalent to Rv0413, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 217 aa overlap). Possible mutT3, mutator protein (EC 3.6.1.-), showing some similarity with e.g. MUTT_PROVU|P32090 mutator mutt protein from Proteus vulgaris (112 aa), FASTA scores: opt: 151, E(): 0.0008, (40.7% identity in 59 aa overlap). SEEMS TO BELONG TO THE NUDIX HYDROLASE FAMILY. Protein product from Mb0421 detected using SWATH mass spectrometry. Mb0421 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248127.1" /translation="MPSCPPAYSEQVRGDGDGWVVSDSGVAYWGRYGAAGLLLRAPRP DGTPAVLLQHRALWSHQGGTWGLPGGARDSHETPEQTAVRESSEEAGLSAERLEVRAT VVTAEVCGVDDTHWTYTTVVADAGELLDTVPNRESAELRWVAENEVADLPLHPGFAAS WQRLRTAPATVPLARCDERRQRLPRTIQIEAGVFLWCTPGDADQAPSPLGRRISSLL" CDS complement(501369..502037) /codon_start=1 /transl_table=11 /gene="thiE" /locus_tag="BQ2027_MB0422C" /product="thiamine-phosphate pyrophosphorylase thie (tmp pyrophosphorylase) (tmp-ppase) (thiamine-phosphate synthase)" /note="Mb0422c, thiE, len: 222 aa. Equivalent to Rv0414c, len: 222 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 222 aa overlap). Probable thiE, thiamin phosphate pyrophosphorylase (EC 2.5.1.3), equivalent to Q9ZBL5|AL035159|MLCB1450_17 PROBABLE THIAMINE-PHOSPHATE PYROPHOSPHORYLASE from Mycobacterium leprae (235 aa), FASTA scores: opt: 1095, E(): 0, (78.0% identity in 223 aa overlap). Also similar to others e.g. T34974|5689976|CAB52013.1|AL109663 probable thiamin phosphate pyrophosphorylase from Streptomyces coelicolor (223 aa); THIE_ECOLI|P30137 thie protein from Escherichia coli strain K12 (211 aa), FASTA scores: opt: 275, E(): 7.8e-12, (37.8% identity in 196 aa overlap); etc. BELONGS TO THE TMP-PPASE FAMILY. Protein product from Mb0422c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0422c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248128.1" /translation="MHESRLASARLYLCTDARRERGDLAQFAEAALAGGVDIIQLRDK GSPGELRFGPLQARDELAACEILADAAHRYGALFAVNDRADIARAAGADVLHLGQRDL PVNVARQILAPDTLIGRSTHDPDQVAAAAAGDADYFCVGPCWPTPTKPGRAAPGLGLV RVAAELGGDDKPWFAIGGINAQRLPAVLDAGARRIVVVRAITSADDPRAAAEQLRSAL TAAN" CDS 502167..503189 /codon_start=1 /transl_table=11 /gene="thiO" /locus_tag="BQ2027_MB0423" /product="POSSIBLE THIAMINE BIOSYNTHESIS OXIDOREDUCTASE THIO" /note="Mb0423, thiO, len: 340 aa. Equivalent to Rv0415, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 340 aa overlap). Possible thiO, thiamine biosynthesis oxidoreductase (EC 1.-.-.-), equivalent to T44739|4154054|CAA22708.1|AL035159|MLCB1450. 24 hypothetical protein from Mycobacterium leprae (340 aa), FASTA scores: opt: 1867, E(): 0, (82.0% identity in 338 aa overlap). Shows some similarity to other thiO proteins e.g. THIO_RHIET|O34292 Putative thiamine biosynthesis oxidoreductase from Rhizobium etli plasmid pb (327 aa) (see citation below); AAG31046.1|AF264948_8|THIO putative amino acid oxidase flavoprotein ThiO from Erwinia amylovora (349 aa); NP_106392.1|14025578|BAB52178.1|AP003007|THIO THIAMINE BIOSYNTHESIS OXIDOREDUCTASE THIO from Mesorhizobium loti (333 aa); etc. Protein product from Mb0423 detected using SWATH mass spectrometry. Mb0423 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248129.1" /translation="MASDLHTGSLAVIGGGVIGLSVARRAAQAGWPVRVHRSDERGAS WVAGGMLAPHSEGWPGEERLLRLGLQSLRLWREGSFLDGLGPQLVTAHESLVVAVDRA DVADLRTVADWLSAQGHPVIWESAARDVEPLLAQGIRHGFRAPTELAVDNRALLDVLC RDCERLGVRWSSQVSSLSDVDAHTVVIANGIDAPALWPGLPIRPVKGEVLRLRWRPGC MPLPQRVIRARVRGRQVYLVPRSDGVVVGATQYEHGRDTAPVVSGVRDLLDDACTVLP ALGEYELAECEAGLRPMTPDNLPLVQRLDSRTLVAAGHGRSGFLLAPWTAEQIVSELV SVGAAS" CDS 503186..503392 /codon_start=1 /transl_table=11 /gene="thiS" /locus_tag="BQ2027_MB0424" /product="POSSIBLE PROTEIN THIS" /note="Mb0424, thiS, len: 68 aa. Equivalent to Rv0416, len: 68 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 68 aa overlap). Possible thiS protein, equivalent to T44740|4154055|CAA22709.1|AL035159|MLCB1450. 25 hypothetical protein from Mycobacterium leprae (74 aa), FASTA scores: opt: 303, E(): 2e-18, (71.6% identity in 74 aa overlap). Shows weak similarity with O32583|THIS_ECOLI|THIG1|B3991.1 THIS PROTEIN from Escherichia coli strain K12 (66 aa), FASTA scores: opt: 103, E(): 0.052, (30.9% identity in 68 aa overlap). Mb0424 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248130.1" /translation="MIVVVNEQQVEVDEQTTIAALLDSLGFGDRGIAVALNFSVLPRS DWATKICELRKPVRLEVVTAVQGG" CDS 503385..504143 /codon_start=1 /transl_table=11 /gene="thiG" /locus_tag="BQ2027_MB0425" /product="PROBABLE THIAMIN BIOSYNTHESIS PROTEIN THIG (THIAZOLE BIOSYNTHESIS PROTEIN)" /note="Mb0425, thiG, len: 252 aa. Equivalent to Rv0417, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 252 aa overlap). Probable thiG, thiamin biosynthesis protein, equivalent to AL035159|MLCB1450_20|T44741|THIG probable thiamin biosynthesis protein from Mycobacterium leprae (261 aa), FASTA scores: opt: 1380, E(): 0, (86.8% identity in 250 aa overlap). Also highly similar to others e.g. SCOEDB|SC6E10.03|T35490|THIG probable thiazole biosynthesis protein from Streptomyces coelicolor (264 aa); F82761|9105679|AAF83593.1|AE003919_4|XF0783|THIG thiamin biosynthesis protein thiG from Xylella fastidiosa (275 aa); P30139|THIG_ECOLI|7448315|B65206|409790|AAC43089 .1|U00006 THIG PROTEIN thiamin biosynthesis protein from Escherichia coli strain K-12 (281 aa); etc. BELONGS TO THE THIG FAMILY. Protein product from Mb0425 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0425 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248131.1" /translation="MAESKLVIGDRSFASRLIMGTGGATNLAVLEQALIASGTELTTV AIRRVDADGGTGLLDLLNRLGITPLPNTAGCRSAAEAVLTAQLAREALNTNWVKLEVI ADERTLWPDAVELVRAAEQLVDDGFVVLPYTTDDPVLARRLEDTGCAAVMPLGSPIGT GLGIANPHNIEMIVAGARVPVVLDAGIGTASDAALAMELGCDAVLLASAVTRAADPPA MAAAMAAAVTAGYLARCAGRIPKRFWAQASSPAR" CDS 504515..506017 /codon_start=1 /transl_table=11 /gene="lpqL" /locus_tag="BQ2027_MB0426" /product="PROBABLE LIPOPROTEIN AMINOPEPTIDASE LPQL" /note="Mb0426, lpqL, len: 500 aa. Equivalent to Rv0418, len: 500 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 500 aa overlap). Probable lpqL, lipoprotein aminopeptidase (EC 3.4.11.-), similar to others e.g. B83278|9949035|AAG06327.1|AE004720_3|AE004720| PA2939 probable aminopeptidase from Pseudomonas aeruginosa (536 aa); P80561|APX_STRGR|SGAP|S66427 aminopeptidase (EC 3.4.11.-) from Streptomyces griseus (284 aa) (homology only with C-terminus of Rv0418); P37302|APE3_YEAST|1077010|A54134 aminopeptidase Y (EC 3.4.11.-) from Saccharomyces cerevisiae (537 aa); etc. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb0426 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0426 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248132.1" /translation="MVNKSRMMPAVLAVAVVVAFLTTGCIRWSTQSRPVVNGPAAAEF AVALRNRVSTDAMMAHLSKLQDIANANDGTRAVGTPGYQASVDYVVNTLRNSGFDVQT PEFSARVFKAEKGVVTLGGNTVEARALEYSLGTPPDGVTGPLVAAPADDSPGCSPSDY DRLPVSGAVVLVDRGVCPFAQKEDAAAQRGAVALIIADNIDEQAMGGTLGANTDVKIP VVSVTKSVGFQLRGQSGPTTVKLTASTQSFKARNVIAQTKTGSSANVVMAGAHLDSVP EGPGINDNGSGVAAVLETAVQLGNSPHVSNAVRFAFWGAEEFGLIGSRNYVESLDIDA LKGIALYLNFDMLASPNPGYFTYDGDQSLPLDARGQPVVPEGSAGIERTFVAYLKMAG KTAQDTSFDGRSDYDGFTLAGIPSGGLFSGAEVKKSAEQAELWGGTADEPFDPNYHQK TDTLDHIDRTALGINGAGVAYAVGLYAQDLGGPNGVPVMADRTRHLIAKP" CDS 506105..507601 /codon_start=1 /transl_table=11 /gene="lpqM" /locus_tag="BQ2027_MB0427" /product="POSSIBLE LIPOPROTEIN PEPTIDASE LPQM" /note="Mb0427, lpqM, len: 498 aa. Equivalent to Rv0419, len: 498 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 498 aa overlap). Possible lpqM, lipoprotein peptidase (EC 3.4.-.-); has potential N-terminal signal peptide and contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site, PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. Protein product from Mb0427 detected using SWATH mass spectrometry. Mb0427 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248133.1" /translation="MHGRGRYRPLVRCVRPRRVAASVRTPIACLAAVVVIAGCTTVVD GRALSILNDPFRVGGLPATNGPSGARPDAPAASGTVINTNNGAIDKLSLLSVNDIEDY WMAVYSESLKGTFRPVGKLVSYDSNDPSSPIVCHIDTYQLVNAFFSSRCNLIAWDRGV FMAVAQEYFGDMSVNGVLAHEFGHALQVMANLVTRKDPTIVREQQADCFAGVYLWWVA EGKSTRFTLSTADGLDHVLAGIITTRDPVMEADAENDDEHGSALDRVSAFQLGFINGT PACAAIDEDEVERRRGDLPTTLRVDASGNPETGEVGINEETLSTLMELMGKIFSPKNP PTLSYQPAGCPDAKPSPPAAYCPATNTIVVDLPALARMGKVASAAEHSLPQGDDTSLS IVMSRYALAVQHERGLPMQSPWTALRTACLTGVAHRKMAVPIDLPSGQQLVLTAGDLD EAVSGLLTNRMVASDADGVSVPAGFTRIAAFRAGVGGDMDACYARYPG" CDS complement(507580..507990) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0428C" /product="POSSIBLE TRANSMEMBRANE PROTEIN" /note="Mb0428c, -, len: 136 aa. Equivalent to Rv0420c, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 136 aa overlap). Possible transmembrane protein; has potential transmembrane domains aa 53-99 and aa 100-122. Mb0428c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248134.1" /translation="MRLHDASAAAPESRMHIARHEEAVNRRQMFIGITGLLLAVIGLM ALWFPVYLDQYDAYGIKVTCGSGWRSNLTQALYADGNDNTQALVTRCDTALLVRRAWA IPSVALGWLLVTGFLVMWVHNDQHQGQSYPGYRA" CDS complement(508151..508780) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0429C" /product="Predicted hydrolase of the alpha/beta-hydrolase fold" /note="Mb0429c, -, len: 209 aa. Equivalent to Rv0421c, len: 209 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 209 aa overlap). Conserved hypothetical protein, showing similarity with NP_103507.1|14022684|BAB49293.1|AP002998 hypothetical protein from Mesorhizobium loti (214 aa). Protein product from Mb0429c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0429c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248135.1" /translation="MNLDQIAGVAHQPAGPPHGVVVLTHGAGGSRESTLLQQVCAEWT RRGWLAVRYNLPYRRRRPTGPPSGSGSGDRAGIVEAIQLCRGLAEGPLIAGGHSYGGR QTSMVVAAGQAPVDVLTLFSYPVHPPGKPERVRTEHLPGIAVPTVFTHGTADPFGTLA QVRSAAAMVSAPTEVVEITGARHDLGSKTLDVARLAVDAALRLSAGQIA" CDS complement(508777..509574) /codon_start=1 /transl_table=11 /gene="thiD" /locus_tag="BQ2027_MB0430C" /product="PROBABLE PHOSPHOMETHYLPYRIMIDINE KINASE THID (HMP-PHOSPHATE KINASE) (HMP-P KINASE)" /note="Mb0430c, thiD, len: 265 aa. Equivalent to Rv0422c, len: 265 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 265 aa overlap). Probable thiD, phosphomethylpyrimidine kinase (EC 2.7.4.7), equivalent to AL035159|MLCB1450_21 PHOSPHOMETHYLPYRIMIDINE KINASE from Mycobacterium leprae (279 aa), FASTA scores: opt: 1386, E(): 0, (77.8% identity in 266 aa overlap). Also highly similar to others e.g. HIU32725_3|P44697|THID_HAEIN PHOSPHOMETHYLPYRIMIDINE KINASE from Haemophilus influenzae (269 aa), FASTA scores: opt: 605, E(): 0, (42.1% identity in 259 aa overlap). BELONGS TO THE THID FAMILY. Protein product from Mb0430c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0430c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248136.1" /translation="MTPPRVLSIAGSDSGGGAGIQADMRTMALLGVHACVAVTAVTVQ NTLGVKDIHEVPNDVVAGQIEAVVTDIGVQAAKTGMLASSRIVATVAATWRRLELSVP LVVDPVCASMHGDPLLAPSALDSLRGQLFPLATLLTPNLDEARLLVDIEVVDAESQRA AAKALHALGPQWVLVKGGHLRSSDGSCDLLYDGVSCYQFDAQRLPTGDDHGGGDTLAT AIAAALAHGFTVPDAVDFGKRWVTECLRAAYPLGRGHGPVSPLFRLS" CDS complement(509601..511244) /codon_start=1 /transl_table=11 /gene="thiC" /locus_tag="BQ2027_MB0431C" /product="PROBABLE THIAMINE BIOSYNTHESIS PROTEIN THIC" /note="Mb0431c, thiC, len: 547 aa. Equivalent to Rv0423c, len: 547 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 547 aa overlap). Probable thiC, thiamin biosynthesis protein, equivalent to Q9ZBL0|THIC_MYCLE|11279601|T44743|AL035159|MLCB1450_22 THIAMINE BIOSYNTHESIS PROTEIN from Mycobacterium leprae (547 aa), FASTA scores: opt: 3283, E(): 0, (90.1% identity in 547 aa overlap). Also highly similar to others e.g. P45740|THIC_BACSU THIAMIN BIOSYNTHESIS PROTEIN from Bacillus subtilis (590 aa), FASTA scores: opt: 2295, E(): 0, (65.2% identity in 580 aa overlap); P30136|THIC_ECOLI THIC PROTEIN from Escherichia coli strain K12 (631 aa), FASTA scores: opt: 2141, E(): 0, (62.1% identity in 568 aa overlap); etc. BELONGS TO THE THIC FAMILY. Protein product from Mb0431c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0431c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248137.1" /translation="MTITVEPSVTTGPIAGSAKAYREIEAPGSGATLQVPFRRVHLST GDHFDLYDTSGPYTDTDTVIDLTAGLPHRPGVVRDRGTQLQRARAGEITAEMAFIAAR EDMSAELVRDEVARGRAVIPANHHHPESEPMIIGKAFAVKVNANIGNSAVTSSIAEEV DKMVWATRWGADTIMDLSTGKNIHETREWILRNSPVPVGTVPIYQALEKVKGDPTELT WEIYRDTVIEQCEQGVDYMTVHAGVLLRYVPLTAKRVTGIVSRGGSIMAAWCLAHHRE SFLYTNFEELCDIFARYDVTFSLGDGLRPGSIADANDAAQFAELRTLGELTKIAKAHG AQVMIEGPGHIPMHKIVENVRLEEELCEEAPFYTLGPLATDIAPAYDHITSAIGAAII AQAGTAMLCYVTPKEHLGLPDRKDVKDGVIAYKIAAHAADLAKGHPRAQERDDALSTA RFEFRWNDQFALSLDPDTAREFHDETLPAEPAKTAHFCSMCGPKFCSMRITQDVREYA AEHGLETEADIEAVLAAGMAEKSREFAEHGNRVYLPITQ" CDS complement(511396..511671) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0432C" /product="HYPOTHETICAL PROTEIN" /note="Mb0432c, -, len: 91 aa. Equivalent to Rv0424c, len: 91 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 91 aa overlap). Hypothetical unknown protein. Protein product from Mb0432c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0432c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248138.1" /translation="MAEKNTRRATSQREAVAKIREAETIVMNLPICGQVKIPRPEHLA YYGGLAALAALELIDWPVALVIATGHILANNHHNRVLEELGEAMEEA" CDS complement(511721..516340) /codon_start=1 /transl_table=11 /gene="ctpH" /locus_tag="BQ2027_MB0433C" /product="POSSIBLE METAL CATION TRANSPORTING P-TYPE ATPASE CTPH" /note="Mb0433c, ctpH, len: 1539 aa. Equivalent to Rv0425c, len: 1539 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1539 aa overlap). Possible ctpH, metal cation-transporting P-type ATPase (transmembrane protein) (EC 3.6.1.-), showing some similarity with CAA17934.1|AL022118|13093871|CAC32203.1|AL583926 putative cation-transporting ATPase from Mycobacterium leprae (1609 aa). Also similar to others ATPases e.g. AE000873_1 CATION-TRANSPORTING P-ATPASE from Methanobacterium thermoautotrop (844 aa), FASTA score: (30.5% identity in 827 aa overlap); AB69720.1|AL137166 putative transport ATPase from Streptomyces coelicolor (1472 aa); etc. C-terminal region similar to other ATPases from Mycobacterium tuberculosis e.g. Y05Q_MYCTU|Q10900 putative cation-transporting ATPase C (855 aa), FASTA scores: opt: 770, E(): 5.3e-32, (44.9% identity in 820 aa overlap). Protein product from Mb0433c detected using SWATH mass spectrometry. Mb0433c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248139.1" /translation="MPVRAVATGFRATATLTGASITAATAVSATLAKTGVGTGMKVAI IPLRAGAKALSGELSRETLGRNCWRGERRAWIEVRGLRSGGDDELGRVVLNAIQAHPG VGSASLNYPLSRVVVAIDDPDTSLRELCRIVDDAEKAERHRHPDQAADQLAQSPGSLP GDGVLLAVRAVTVAATAAGLGLALGGRALRWPRFPLVIEAAVAAVDHQPLLRRLLEDR IGTAATATVLELAMAAAHTVTLSPAALSVDLTIQALKAAECRAGARAWRRHEPQLALH ADEPADQPQSLWPRPARSTQPVQRSVARFALIQALSAVLVGAGTRDADMAATATLVAT PKASRTTPEAFAAALGQGLADQHAVLPLRPESLRRLDRVDAIVIDPRVLCTDDLRVAR IRGCGADELSTAWNRAQLVLTESGLRPGWHRVPGVSASGSDSAVEALFRPMHDRLASA VVAEAHRTGADLVSVDVDALGELRPVFDDIRPLDDGASGSLDEALARAVAELRQAGRT VAVLSSVGKQALSAADVALGVLPPPGAGAPPWYADVLLPDLGAAWRVLHAIPAARAAR QRGNEISGGASALGALLMLPGVRGLGPGPVTTGAAAGLLSGYLLARKVVDAQAPRPAP AHEWHAMSVEQVRKALPSPDEQAPAKAPPSPYPARALAGGLHTAKRGAQITQAPLNAL WQLTKAVRAELSDPLTPMLALGAMASAVLGSPVDAVMVGSVLTGNSILAASQRLRAES RLNRLLAQQIPPARKVLAGADDQPRYIEVRAEELRPGDIIEVRTHEVVPADARVIEEV DVEVDESALTGESLSVTKQVEPTPGVDLIERRCMLYAGTTVVSGTAVAVVTAVGPDTQ ERRAAELVSGDLSSVGLQHQLSRLTNQAWPVSMTGGALVTGLGLLRRRGLRQAVASGI AVTVAAVPEGMPLVATLAQQASARRLSHFGALVRIPRSVEALGRVDMVCFDKTGTLSE NRLRVAQVRPVAGHSREEVLRCAAHAAPASNGPQVHATDVAIVQAAAAAAASGTDGAE PGAAEPAAHLPFRSGRSFSASVSGTELTVKGAPEVVLAACEGIGSSMDDAVAELAANG LRVIAVAHRQLTAQQAQSVVDDPDEIARLCRDELSLVGFLGLSDTPRAQAAALLADLH EHDLDIRLITGDHPITAAAIAEELGMQVSPEQVISGAEWDALSRKDQERAVAERVIFA RMTPENKVQIVQTLEHSGRVCAMVGDGSNDAAAIRAATVGIGVVAHGSDPARVAADLV LVDGRIESLLPAILEGRQLWQRVQAAVSVLLGGNAGEVAFAIIGSAITGTSPLNTRQL LLVNMLTDALPAAALAVSKPSDPVTPATRGPDQRELWRAVGIRGATTAAAATVAWVMA GFTGLPRRASTVALVALVAAQLGQTLVDSHAWLVVLTALGSLAALATLISIPVVSQLL GCTPLDPLGWAQATAAATAATVAVAVLNRVLTGRDKSGQPNPQPPETDALSRDASPGA PPGPRRRRRATARRKAPVKAPSATRQTTKPKGPPAHRSSSTYPRR" CDS complement(516392..516835) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0434C" /product="POSSIBLE TRANSMEMBRANE PROTEIN" /note="Mb0434c, -, len: 147 aa. Equivalent to Rv0426c, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). Possible transmembrane protein; has potential transmembrane domains aa 19-41, and aa 61-83. Protein product from Mb0434c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0434c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248140.1" /translation="MSVVGGTVRTVGRTVSGAATATTAAAGAVGGAAVSGIVGGVTGA AKGIQKGLSSGSKSTAAAALAIGAIGVAGLVDWPILLAVGGGALLLRKLNRTPEVAAP PVKAKLAPVPDKPAAAKEAPAKASKTTARKTSGRRAGTAELRSTN" CDS complement(517036..517911) /codon_start=1 /transl_table=11 /gene="xthA" /locus_tag="BQ2027_MB0435C" /product="PROBABLE EXODEOXYRIBONUCLEASE III PROTEIN XTHA (EXONUCLEASE III) (EXO III) (AP ENDONUCLEASE VI)" /note="Mb0435c, xthA, len: 291 aa. Equivalent to Rv0427c, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 291 aa overlap). Probable xthA (alternate gene name: xth), exodeoxyribonuclease III protein (EC 3.1.11.2), similar to others e.g. EX3_ECOLI|P09030 exodeoxyribonuclease III from Escherichia Coli strain K12 (268 aa), FASTA scores: opt: 360, E(): 1.2e-17, (29.3% identity in 270 aa overlap); etc. BELONGS TO THE AP/EXOA FAMILY OF DNA REPAIR ENZYMES. Protein product from Mb0435c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0435c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248141.1" /translation="MPDGTIDGGHPQRPASPRLRSPLLRLATWNVNSIRTRLDRVLDW LGRADVDVLAMQETKCPDGQFPALPLFELGYDVAHVGFDQWNGVAIASRVGLDDVRVG FDGQPSWSGKPEVAATTEARALGATCGGIRVWSLYVPNGRALDDPHYTYKLDWLAALR DTAEGWLRDDPAAPIALMGDWNIAPTDDDVWSTEFFAGCTHVSEPERKAFNAIVDAQF TDVVRPFTPGPGVYTYWDYTQLRFPKKQGMRIDFILGSPALAARVMDAQIVREERKGK APSDHAPVLVDLHAG" CDS complement(517914..518822) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0436C" /product="gcn5-related n-acetyltransferase" /note="Mb0436c, -, len: 302 aa. Equivalent to Rv0428c, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 302 aa overlap). Hypothetical unknown protein. Protein product from Mb0436c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0436c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248142.1" /translation="MVSWPGLGTRVTVRYRRPAGSMPPLTDAVGRLLAVDPTVRVQTK TGTIVEFSPVDVVALRVLTDAPVRTAAIRALEHAAAAAWPGVERTWLDGWLLRAGHGA VLAANSAVPLDISAHTNTITEISAWYASRDLQPWLAVPDRLLPLPAGLAGERREQVLV RDVSTGEPDRSVTLLDHPDDTWLRLYHQRLPLDMATPVIDGELAFGSYLGVAVARAAV TDAPDGTRWVGLSAMRAADEQSATGSAGRQLWEALLGWGAGRGATRGYVRVHDTATSV LAESLGFRLHHHCRYLPAQSVGWDTF" CDS complement(518822..519415) /codon_start=1 /transl_table=11 /gene="def" /locus_tag="BQ2027_MB0437C" /product="PROBABLE POLYPEPTIDE DEFORMYLASE DEF (PDF) (FORMYLMETHIONINE DEFORMYLASE)" /note="Mb0437c, def, len: 197 aa. Equivalent to Rv0429c, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 197 aa overlap). Probable def, polypeptide deformylase (EC 3.5.1.31), equivalent to CAC30884.1|AL583923 polypeptide deformylase from Mycobacterium leprae (197 aa). Also similar to others e.g. DEF_ECOLI|P27251|95874|S23107 polypeptide deformylase from Escherichia coli (169 aa), FASTA scores: opt: 179, E(): 1.8e-05, (34.6% identity in 162 aa overlap); etc. BELONGS TO THE POLYPEPTIDE DEFORMYLASE FAMILY. COFACTOR: BINDS 1 ZINC ION. Protein product from Mb0437c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0437c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248143.1" /translation="MTVVPIRIVGDPVLHTATTPVTVAADGSLPADLAQLIATMYDTM DAANGVGLAANQIGCSLRLFVYDCAADRAMTARRRGVVINPVLETSEIPETMPDPDTD DEGCLSVPGESFPTGRAKWARVTGLDADGSPVSIEGTGLFARMLQHETGHLDGFLYLD RLIGRYARNAKRAVKSHGWGVPGLSWLPGEDPDPFGH" CDS 519752..520060 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0438" /product="tuberculin secretion" /note="Mb0438, -, len: 102 aa. Equivalent to Rv0430, len: 102 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 102 aa overlap). Conserved hypothetical protein, equivalent to AC30882.1|AL583923 conserved hypothetical protein from Mycobacterium leprae (102 aa). Also highly similar to CAB93047.1|SCD95A.20|AL357432 hypothetical protein from Streptomyces coelicolor (84 aa). Protein product from Mb0438 detected using SWATH mass spectrometry. Mb0438 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248144.1" /translation="MDSAMARAIRSGDDAEVADGLTRREHDILAFERQWWKFAGVKEE AIKELFSMSATRYYQVLNALVDRPEALAADPMLVKRLRRLRASRQKARAARRLGFEVT " CDS 520092..520586 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0439" /product="PUTATIVE TUBERCULIN RELATED PEPTIDE" /note="Mb0439, -, len: 164 aa. Equivalent to Rv0431, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (98.8% identity in 164 aa overlap). Putative tuberculin related peptide; almost identical to D00815|MSGAT103_1 AT103 from Mycobacterium tuberculosis (172 aa), FASTA score: (99.4% identity in 163 aa overlap). Highly similar to to CAC30881.1|AL583923 tuberculin related peptide (AT103) from Mycobacterium leprae (167 aa). Some similarity to G550415|HRPC (282 aa), FASTA scores: opt: 120, E(): 0.36, (33.3% identity in 111 aa overlap). Potential transmembrane domain at aa 19-37. Protein product from Mb0439 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0439 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248145.1" /translation="MLVTVGSMNERVPDSSGLPLRAMVMVLLFLGVVFLLLGWQALGS SPNSEDDSSAISTMTTTTAAPTSTSVKPAAPRAEVRVYNISGAEGAAARTADRLKAAG FTVTDVGNLSLPDVAATTVYYTEVEGERATADAVGRTLGAAVELRLPELSDQPPGVIV VVTG" CDS 520619..521341 /codon_start=1 /transl_table=11 /gene="sodC" /locus_tag="BQ2027_MB0440" /product="periplasmic superoxide dismutase [cu-zn] sodc" /note="Mb0440, sodC, len: 240 aa. Equivalent to Rv0432, len: 240 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 240 aa overlap). Probable sodC, periplasmic superoxide dismutase [Cu-Zn] (EC 1.15.1.1), equivalent to CAC30880.1|AL583923 superoxide dismutase precursor (Cu-Zn) from Mycobacterium leprae (240 aa); and AAK20038.1|AF326234_1 copper zinc superoxide dismutase from Mycobacterium avium subsp. paratuberculosis (226 aa). Also similar to others e.g. SODC_PHOLE|P00446 superoxide dismutase precursor (cu-zn) from Photobacterium leiognathi (173 aa), FASTA scores: opt: 214, E(): 5.2 e-06, (36.5% identity in 181 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. BELONGS TO THE CU-ZN SUPEROXIDE DISMUTASE FAMILY. Possibly localized in periplasm, membrane-bound. Protein product from Mb0440 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0440 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248146.1" /translation="MPKPADHRNHAAVSTSVLSALFLGAGAALLSACSSPQHASTVPG TTPSIWTGSPAPSGLSGHDEESPGAQSLTSTLTAPDGTKVATAKFEFANGYATVTIAT TGVGKLTPGFHGLHIHQVGKCEPNSVAPTGGAPGNFLSAGGHYHVPGHTGTPASGDLA SLQVRGDGSAMLVTTTDAFTMDDLLSGAKTAIIIHAGADNFANIPPERYVQVNGTPGP DETTLTTGDAGKRVACGVIGSG" CDS 521343..522473 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0441" /product="Carboxylate-amine ligase" /note="Mb0441, -, len: 376 aa. Equivalent to Rv0433, len: 376 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 376 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins e.g. P77213|YBDK_ECOLI hypothetical 41.7 KD protein from Escherichia coli strain K12 (372 aa), FASTA scores: opt: 555, E(): 2e-30, (28.2% identity in 365 aa overlap). Protein product from Mb0441 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0441 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248147.1" /translation="MPARRSAARIDFAGSPRPTLGVEWEFALVDSQTRDLSNEATAVI AEIGENPRVHKELLRNTVEIVSGICECTAEAMQDLRDTLGPARQIVRDRGMELFCAGT HPFARWSAQKLTDAPRYAELIKRTQWWGRQMLIWGVHVHVGIRSAHKVMPIMTSLLNY YPHLLALSASSPWWGGEDTGYASNRAMMFQQLPTAGLPFHFQRWAEFEGFVYDQKKTG IIDHMDEIRWDIRPSPHLGTLEVRICDGVSNLRELGALVALTHCLIVDLDRRLDAGET LPTMPPWHVQENKWRAARYGLDAVIILDADSNERLVTDDLADVLTRLEPVAKSLNCAD ELAAVSDIYRDGASYQRQLRVAQQHDGDLRAVVDALVAELVI" CDS 522533..523186 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0442" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0442, -, len: 217 aa. Equivalent to Rv0434, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 217 aa overlap). Conserved hypothetical protein, similar to AE002052_2 from Deinococcus radiodurans (213 aa), FASTA scores: opt: 258, E(): 4e-10, (31.9% identity in 213 aa overlap); SYCSLRB_122|Q55701 hypothetical 24.5 kDa protein from Synechocystis (214 aa), FASTA scores: opt: 156, E(): 0.00041, (28.4% identity in 204 aa overlap); MXABSGA_1|LON2_MYXXA|P36774 ATP-dependent protease la 2 from Myxococcus xanthus (826 aa), FASTA scores: opt: 160, E(): 0.00068, (28.4% identity in 197 aa overlap); etc. Protein product from Mb0442 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0442 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248148.1" /translation="MADFAPVELAMFPLESAPLPDEDLPLHIFEPRYAALVRDCMDTA DPRFGVVLISRGREVGGGDTRCDVGTLARITECADAGSGRYMLRCRVGERIRVCDWLP DDPYPRAKVRFWPDQPGHPVTAAQLLEVEDRVVALFERIAAARGVRLPAREVVLGYPV VDPADTGQRLYALACRVPMGPADRYAVLAAPSAADRLVRLGDALDSVAAMVEFELST" CDS complement(523366..525552) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0443C" /product="PUTATIVE CONSERVED ATPASE" /note="Mb0443c, -, len: 728 aa. Equivalent to Rv0435c, len: 728 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 728 aa overlap). Putative conserved ATPase (EC 3.6.1.-), similar to others e.g. SAV_SULAC|Q07590 sav protein involved in cell division from sulfolobus acidocaldarius (780 aa), FASTA scores: opt: 897, E(): 0, (34.5% identity in 693 aa overlap); NP_148637.1|7435761|B72479 transitional endoplasmic reticulum ATPase from Aeropyrum pernix (699 aa); etc. Also similar to Rv3610c and Rv2115c from Mycobacterium tuberculosis. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00674 AAA-protein family signature. Protein product from Mb0443c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0443c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248149.1" /translation="MTHPDPARQLTLTARLNTSAVDSRRGVVRLHPNAIAALGIREWD AVSLTGSRTTAAVAGLAAADTAVGTVLLDDVTLSNAGLREGTEVIVSPVTVYGARSVT LSGSTLATQSVPPVTLRQALLGKVMTVGDAVSLLPRDLGPGTSTSAASRALAAAVGIS WTSELLTVTGVDPDGPVSVQPNSLVTWGAGVPAAMGTSTAGQVSISSPEIQIEELKGA QPQAAKLTEWLKLALDEPHLLQTLGAGTNLGVLVSGPAGVGKATLVRAVCDGRRLVTL DGPEIGALAAGDRVKAVASAVQAVRHEGGVLLITDADALLPAAAEPVASLILSELRTA VATAGVVLIATSARPDQLDARLRSPELCDRELGLPLPDAATRKSLLEALLNPVPTGDL NLDEIASRTPGFVVADLAALVREAALRAASRASADGRPPMLHQDDLLGALTVIRPLSR SASDEVTVGDVTLDDVGDMAAAKQALTEAVLWPLQHPDTFARLGVEPPRGVLLYGPPG CGKTFVVRALASTGQLSVHAVKGSELMDKWVGSSEKAVRELFRRARDSAPSLVFLDEL DALAPRRGQSFDSGVSDRVVAALLTELDGIDPLRDVVMLGATNRPDLIDPALLRPGRL ERLVFVEPPDAAARREILRTAGKSIPLSSDVDLDEVAAGLDGYSAADCVALLREAALT AMRRSIDAANVTAADLATARETVRASLDPLQVASLRKFGTKGDLRS" CDS complement(525549..526409) /codon_start=1 /transl_table=11 /gene="pssA" /locus_tag="BQ2027_MB0444C" /product="PROBABLE CDP-DIACYLGLYCEROL--SERINE O-PHOSPHATIDYLTRANSFERASE PSSA (PS SYNTHASE) (PHOSPHATIDYLSERINE SYNTHASE)" /note="Mb0444c, pssA, len: 286 aa. Equivalent to Rv0436c, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 286 aa overlap). Probable pssA, PS synthase (CDP-diacylglycerol--serine O-phosphatidyltransferase) (EC 2.7.8.8) (see citation below), integral membrane protein, equivalent to AL035159|MLCB1450_9|T44730 from Mycobacterium leprae (300 aa), FASTA scores: opt: 1506, E(): 0, (77.9% identity in 285 aa overlap). Also highly similar to others e.g. NP_108059.1|14027250|BAB54204.1|AP003012 phosphatidylserine synthase from Mesorhizobium loti (248 aa); PSS_BACSU|P39823 cdp-diacylglycerol--serine o-phosphatidyltransferase from Bacillus subtilis (177 aa), FASTA scores: opt: 277, E(): 9.9e-12, (33.3% identity in 183 aa overlap); etc. Contains PS00379 CDP-alcohol phosphatidyltransferases signature. BELONGS TO THE CDP-ALCOHOL PHOSPHATIDYLTRANSFERASE CLASS-I FAMILY. Protein product from Mb0444c detected using SWATH mass spectrometry. Mb0444c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248150.1" /translation="MIGKPRGRRGVNLQILPSAMTVLSICAGLTAIKFALEHQPKAAM ALIAAAAILDGLDGRVARILDAQSRMGAEIDSLADAVNFGVTPALVLYVSMLSKWPVG WVVVLLYAVCVVLRLARYNALQDDGTQPAYAHEFFVGMPAPAGAVSMIGLLALKMQFG EGWWTSVWFLSFWVTGTSILLVSGIPMKKMHAVSVPPNYAAALLAVLAICAAAAVLAP YLLIWVIIIAYMCHIPFAVRSQRWLAQHPEVWDDKPKQRRAVRRASRRAHPYRPSMAR LGLRKPGRRL" CDS complement(526406..527101) /codon_start=1 /transl_table=11 /gene="psd" /locus_tag="BQ2027_MB0445C" /product="POSSIBLE PHOSPHATIDYLSERINE DECARBOXYLASE PSD (PS DECARBOXYLASE)" /note="Mb0445c, psd, len: 231 aa. Equivalent to Rv0437c, len: 231 aa (start uncertain), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 231 aa overlap). Possible psd, phosphatidylserine decarboxylase (EC 4.1.1.65), equivalent to CAC29819.1|AL583918 conserved hypothetical protein from Mycobacterium leprae (243 aa); and highly similar to MLCB1450.11|T44729|4154044|CAA22695.1|AL035159 hypothetical protein from Mycobacterium leprae (202 aa), FASTA score: (74.6% identity in 197 aa overlap). Also similar to other phosphatidylserine decarboxylases e.g. NP_108058.1|14027249|BAB54203.1|AP003012 phosphatidylserine decarboxylase from Mesorhizobium loti (232 aa); AAK86872|g15156090|AGR_C_1963 phosphatidylserine decarboxylase from Agrobacterium tumefaciens (244 aa); AAG12422.1|AY005137|Psd phosphatidylserine decarboxylase from Chlorobium tepidum (216 aa); etc. Protein product from Mb0445c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0445c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248151.1" /translation="MARRPRPDGPQHLLALVRSAVPPVHPAGRPFIAAGLAIAAVGHR YRWLRGTGLLAAAACAGFFRHPQRVPPTRPAAIVAPADGVICAIDSAAPPAELSMGDT PLPRVSIFLSILDAHVQRAPVSGEVIAVQHRPGRFGSADLPEASDDNERTSVRIRMPN GAEVVAVQIAGLVARRIVCDAHVGDKLAIGDTYGLIRFGSRLDTYLPAGAEPIVNVGQ RAVAGETVLAECR" CDS complement(527162..528379) /codon_start=1 /transl_table=11 /gene="moeA2" /locus_tag="BQ2027_MB0446C" /product="PROBABLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN MOEA2" /note="Mb0446c, moeA2, len: 405 aa. Equivalent to Rv0438c, len: 405 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 405 aa overlap). Probable moeA2, molybdenum cofactor biosynthesis protein, highly similar to many e.g. Y10817|ANY10817_2 from A. nicotinovorans (429 aa), FASTA scores: opt: 786, E(): 0, (39.2% identity in 398 aa overlap); etc. Also similar to MOEA1|Rv0994|MTCI237.08|O05577 PROBABLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN from Mycobacterium tuberculosis (426 aa), FASTA scores: opt: 667, E(): 2e-32, (36.5% identity in 425 aa overlap). Note that previously known as moeA3. Protein product from Mb0446c detected using SWATH mass spectrometry. Mb0446c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248152.1" /translation="MRSVQEHQRVVAEMIRACRPITVPLTQAQGLVLGGDVVAPLSLP VFDNSAMDGYAVRAEDTSGATPQNPVMLPVAEDIPAGRADMLTLQPVTAHRIMTGAPV PTGATAIVPVEATDGGVDSVAIRQQATPGKHIRRSGEDVAAGTTVLHNGQIVTPAVLG LAAALGLAELPVLPRQRVLVISTGSELASPGTPLQPGQIYESNSIMLAAAVRDAGAAV VATATAGDDVAQFGAILDRYAVDADLIITSGGVSAGAYEVVKDAFGSADYRGGDHGVE FVKVAMQPGMPQGVGRVAGTPIVTLPGNPVSALVSFEVFIRPPLRMAMGLPDPYRPHR SAVLTASLTSPRGKRQFRRAILDHQAGTVISYGPPASHHLRWLASANGLLDIPEDVVE VAAGTQLQVWDLT" CDS complement(528398..529333) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0447C" /product="probable dehydrogenase/reductase" /note="Mb0447c, -, len: 311 aa. Equivalent to Rv0439c, len: 311 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 311 aa overlap). Probable dehydrogenase/reductase (EC 1.-.-.-), equivalent to AL035159|MLCB1450_6|T44727 probable oxidoreductase from Mycobacterium leprae (304 aa), FASTA scores: opt: 1360, E(): 0, (69.2% identity in 302 aa overlap). Also highly similar to various oxidoreductases, generally dehydrogenases/reductases e.g. PA5031|C83017|9951320|AAG08416.1|AE004916_5|AE004916 probable short chain dehydrogenase from Pseudomonas aeruginosa (309 aa); Q03326|OXIR_STRAT PROBABLE OXIDOREDUCTASE from Streptomyces antibioticus (298 aa), FASTA scores: opt: 400, E(): 1.2e-18, (34.6% identity in 298 aa overlap); etc. Protein product from Mb0447c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0447c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248153.1" /translation="MTANDNKTRKWSAADVPDQSGRVVVVTGANTGIGYHTAAVFADR GAHVVLAVRNLEKGNAARARIMAARPGAHVTLQQLDLCSLDSVRAAADALRTAYPRID VLINNAGVMWTPKQVTKDGFELQFGTNHLGHFALTGLVLDHMLPVPGSRVVTVSSQGH RIHAAIHFDDLQWERRYNRVAAYGQAKLANLLFTYELQRRLGEAGKSTIAVAAHPGGS NTELTRNLPRLIRPVATVLGPLLFQSPEMGALPTLRAATDPTTQGGQYYGPDGFGEQR GHPKVVQSSAQSHDKDLQRRLWTVSEELTGVSFGV" CDS 529627..531249 /codon_start=1 /transl_table=11 /gene="groEL2" /locus_tag="BQ2027_MB0448" /standard_name="groL2; groEL-2; hsp65; hsp60" /product="60 KDA CHAPERONIN 2 GROEL2 (PROTEIN CPN60-2) (GROEL PROTEIN 2) (65 KDA ANTIGEN) (HEAT SHOCK PROTEIN 65) (CELL WALL PROTEIN A) (ANTIGEN A)" /note="Mb0448, groEL2, len: 540 aa. Equivalent to Rv0440, len: 540 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 540 aa overlap). groEL2 (alternate gene names: groL2, groEL-2, hsp65, hsp60), 60 kDa chaperonin 2 (see first citation below). PURIFIED 65 kDa ANTIGEN CAN ELICIT A STRONG DELAYED-TYPE HYPERSENSITIVITY REACTION IN EXPERIMENTAL ANIMALS INFECTED WITH M. TUBERCULOSIS. THIS PROTEIN IS ONE OF THE MAJOR IMMUNOREACTIVE PROTEINS OF THE MYCOBACTERIA. THIS ANTIGEN CONTAINS EPITOPES THAT ARE COMMON TO VARIOUS SPECIES OF MYCOBACTERIA. Contains PS00296 Chaperonins cpn60 signature. BELONGS TO THE CHAPERONIN (HSP60) FAMILY. Protein product from Mb0448 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0448 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248154.1" /translation="MAKTIAYDEEARRGLERGLNALADAVKVTLGPKGRNVVLEKKWG APTITNDGVSIAKEIELEDPYEKIGAELVKEVAKKTDDVAGDGTTTATVLAQALVREG LRNVAAGANPLGLKRGIEKAVEKVTETLLKGAKEVETKEQIAATAAISAGDQSIGDLI AEAMDKVGNEGVITVEESNTFGLQLELTEGMRFDKGYISGYFVTDPERQEAVLEDPYI LLVSSKVSTVKDLLPLLEKVIGAGKPLLIIAEDVEGEALSTLVVNKIRGTFKSVAVKA PGFGDRRKAMLQDMAILTGGQVISEEVGLTLENADLSLLGKARKVVVTKDETTIVEGA GDTDAIAGRVAQIRQEIENSDSDYDREKLQERLAKLAGGVAVIKAGAATEVELKERKH RIEDAVRNAKAAVEEGIVAGGGVTLLQAAPTLDELKLEGDEATGANIVKVALEAPLKQ IAFNSGLEPGVVAEKVRNLPAGHGLNAQTGVYEDLLAAGVADPVKVTRSALQNAASIA GLFLTTEAVVADKPEKEKASVPGGGDMGGMDF" CDS complement(531315..531743) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0449C" /product="FMN binding protein" /note="Mb0449c, -, len: 142 aa. Equivalent to Rv0441c, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Hypothetical unknown protein. Protein product from Mb0449c detected using SWATH mass spectrometry. Mb0449c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248155.1" /translation="MGAKKVDLKRLAAALPDYPFAYLITVDDGHRVHTVAVEPVLREL PDGPDGPRAVVDVGLIGGRTRQNLAHRSEVTLLWPPSDPSGYSLIVDGRAQASDAGPD DDTARCGVVPIRALLHRDAAPDSPTAAKGCLHDCVVFSVP" CDS complement(531770..533209) /codon_start=1 /transl_table=11 /gene="PPE10" /locus_tag="BQ2027_MB0450C" /product="ppe family protein ppe10" /note="Mb0450c, PPE10, len: 479 aa. Equivalent to Rv0442c, len: 487 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 479 aa overlap). Member of the Mycobacterium tuberculosis PPE family, nearly identical to hypothetical protein from Mycobacterium tuberculosis (strain Erdman) and to AN5S46909_1 protein fragment from Mycobacterium bovis (302 aa); P42611|YHS6_MYCTU HYPOTHETICAL 50.6 KD PROTEIN (517 aa), FASTA scores: opt: 3144, E(): 0, (98.4 identity in 492 aa overlap); and S46909|S46909_1 (302 aa), FASTA scores: opt: 1897, E(): 0, (98.0% identity in 302 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, truncation at the 5' start due to a single base transition (g-a), leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (479 aa versus 487 aa). Mb0450c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248156.1" /translation="MPPEINSALMFAGPGSGPLIAAATAWGELAEELLASIASLGSVT SELTSGAWLGPSAAAMMAVATQYLAWLSTAAAQAEQAAAQAMAIATAFEAALAATVQP AVVAANRGLMQLLAATNWFGQNAPALMDVEAAYEQMWALDVAAMAGYHFDASAAVAQL APWQQVLRNLGIDIGKNGQINLGFGNTGSGNIGNNNIGNNNIGSGNTGTGNIGSGNTG SGNLGLGNLGDGNIGFGNTGSGNIGFGITGDHQMGFGGFNSGSGNIGFGNSGTGNVGL FNSGSGNIGIGNSGSLNSGIGTSGTINAGLGSAGSLNTSFWNAGMQNAALGSAAGSEA ALVSSAGYATGGMSTAALSSGILASALGSTGGLQHGLANVLNSGLTNTPVAAPASAPV GGLDSGNPNPGSGSAAAGSGANPGLRSPGTSYPSFVNSGSNDSGLRNTAVREPSTPGS GIPKSNFYPSPDRESAYASPRIGQPVGSE" CDS 533415..533930 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0451" /product="link to FMN-binding protein" /note="Mb0451, -, len: 171 aa. Equivalent to Rv0443, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 171 aa overlap). Conserved hypothetical protein, highly similar to AL049863|SC5H1_23|T35339 hypothetical protein from Streptomyces coelicolor (171 aa), FASTA scores: opt: 561, E(): 2.3e-32, (49.7% identity in 165 aa overlap); and CAC42482.1|AJ318385 hypothetical protein from Amycolatopsis mediterranei (163 aa). Protein product from Mb0451 detected using shotgun mass spectrometry. Mb0451 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248157.1" /translation="MASTDAAAQELLRDAFTRLIEHVDELTDGLTDQLACYRPTPSAN SIAWLLWHSARVQDIQVAHVAGVEEVWTRDGWVDRFGLDLPRHDTGYGHRPEDVAKVR APADLLSGYYHAVHKLTLEYIAGMTADELSRVVDTSWNPPVTVSARLVSIVDDCAQHL GQAAYLRGIAR" CDS complement(534110..534808) /codon_start=1 /transl_table=11 /gene="rska" /locus_tag="BQ2027_MB0452C" /product="anti-sigma factor rska (regulator of sigma k)" /note="Mb0452c, -, len: 232 aa. Equivalent to Rv0444c, len: 232 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 232 aa overlap). Conserved hypothetical protein; C-terminus similar to P12752|Y24K_STRGR HYPOTHETICAL 24.7 KD PROTEIN from Streptomyces griseus (238 aa), FASTA scores: opt: 207, E(): 2.2e-05, (32.9% identity in 158 aa overlap). Protein product from Mb0452c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0452c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248158.1" /translation="MTEHTDFELLELATPYALNAVSDDERADIDRRVAAAPSPVAAAF NDEVRAVRETMAVVSAATTAEPPAHLRTAILDATKPEVRRQSRWRTAAFASAAAIAVG LGAFDLGVLTRPSPPPTVAEQVLTAPDVRTVSRPLGAGTATVVFSRDRNTGLLVMNNV APPSRGTVYQMWLLGGAKGPRSAETMGTAAVTPSTTATLTDLGASTALAFTVEPGTGS PQPTGTILAELPLG" CDS complement(534852..535415) /codon_start=1 /transl_table=11 /gene="sigK" /locus_tag="BQ2027_MB0453C" /product="alternative rna polymerase sigma factor sigk" /note="Mb0453c, sigK, len: 187 aa. Equivalent to Rv0445c, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 187 aa overlap). Probable sigK, alternative RNA polymerase sigma factor (see citation below), highly similar to others e.g. 5531433|CAB50938.1|AL096849|T36745 probable RNA polymerase sigma factor from Streptomyces coelicolor (185 aa); NP_105607.1|14024791|BAB51393.1|AP003005 RNA polymerase sigma factor from Mesorhizobium loti (179 aa); 1654108|AAB17906.1|U11283|A58883 probable transcription initiation factor sigma E from Rhodobacter phaeroides (168 aa), FASTA scores: opt: 299, E(): 2e-14, (32.7% identity in 168 aa overlap); Q45585|SIGW_BACSU RNA POLYMERASE SIGMA FACTOR SIGW from Bacillus subtilis (187 aa), FASTA scores: opt: 213, E(): 2.9e-08, (26.8% identity in 179 aa overlap); etc. Protein product from Mb0453c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0453c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248159.1" /translation="MTGPPRLSSDLDALLRRVAGHDQAAFAEFYDHTKSRVYGLVMRV LRDTGYSEETTQEIYLEVWRNASEFDSAKGSALAWLLTMAHRRAVDRVRCEQAGNQRE VRYGAANVDPASDVVADLAIAGDERRRVTECLKALTDTQRQCIELAYYGGLTYVEVSR RLAANLSTIKSRMRDALRSLRNCLDVS" CDS complement(535464..536234) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0454C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0454c, -, len: 256 aa. Equivalent to Rv0446c, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 256 aa overlap). Possible conserved transmembrane protein, similar at N-terminus to U1740AF|U15183|MLU15183_40 from Mycobacterium leprae (117 aa), FASTA scores: opt: 175, E(): 2.5e-05, (62.5% identity in 40 aa overlap); and at C-terminus to AL021529|SC10A5_3 from Streptomyces coelicolor (226 aa), FASTA scores: opt: 207, E(): 9.8e-07, (34.2% identity in 114 aa overlap). Also similar to others hypothetical proteins e.g. AAK04680.1|AE006291_14|AE006291) HYPOTHETICAL PROTEIN from Lactococcus lactis subsp. lactis (257 aa). Protein product from Mb0454c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0454c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248160.1" /translation="MVTSVSALAVAVVHSVAFAIGRRIGRYNVVDVVWGLGFVAVAVA AATLGHGDPVRRWLLLALVSTWGLRLSWHMYRKTAGQGEDPRYADLLRGATPVQALRK VFGLQGLLTLFVSFPLQLSAVTGPTPKPLLAVGGVGLAVWLVGITFEAVGDWQLWVFK SDPANRGVIMDRGLWAWTRHPNYFGDACVWWGLWLITINDWAPLATVGSPLLMTYLLV DVSGARLTERYLKGRPGFAEYQRRTAYFVPRPPRSARR" CDS complement(536243..537526) /codon_start=1 /transl_table=11 /gene="ufaA1" /locus_tag="BQ2027_MB0455C" /product="probable cyclopropane-fatty-acyl-phospholipid synthase ufaa1 (cyclopropane fatty acid synthase) (cfa synthase)" /note="Mb0455c, ufaA1, len: 427 aa. Equivalent to Rv0447c, len: 427 aa (start uncertain), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 427 aa overlap). Probable ufaA1, cyclopropane-fatty-acyl-phospholipid synthase (EC 2.1.1.79), similar to others e.g. NP_102178.1|14021351|BAB47964.1|AP002994 cyclopropane-fatty-acyl-phospholipid synthase from Mesorhizobium loti (378 aa); B82240|9655593|AAF94281.1|AE004192 cyclopropane-fatty-acyl-phospholipid synthase from Vibrio cholerae (432 aa); P30010|CFA_ECOLI CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE from Escherichia coli strain K-12 (382 aa); X55704|PPLPD_3 LPD-3 from P.putida (394 aa), FASTA scores: opt: 556, E(): 2.8e-30, (33.3% identity in 387 aa overlap); AE0005|HPAE000557_9 from Helicobacter pylori (389 aa), FASTA scores: opt: 539, E(): 3.9e-29, (34.3% identity in 382 aa overlap). Protein product from Mb0455c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0455c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248161.1" /translation="MTVETSQTPSAAIDSDRWPAVAKVPRGPLAAASAAIANRLLRRT ATHLPLRLVYSDGTATGAADPRAPSLFIHRPDALARRIGRHGLIGFGESYMAGEWSSK ELTRVLTVLAGSVDELVPRSLHWLRPITPTFRPSWPDHSRDQARRNIAVHYDLSNDLF AAFLDETMTYSCAMFTDLLAQPTPAWTELAAAQRRKIDRLLDVAGVQQGSHVLEIGTG WGELCIRAAARGAHIRSVTLSVEQQRLARQRVAAAGFGHRVEIDLCDYRDVDGQYDSV VSVEMIEAVGYRSWPRYFAALEQLVRPGGPVAIQAITMPHHRMLATRHTQTWIQKYIF PGGLLPSTQAIIDITGQHTGLRIVDAASLRPHYAETLRLWRERFMQRRDGLAHLGFDE VFARMWELYLAYSEAGFRSGYLDVYQWTLIREGPP" CDS complement(537523..538188) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0456C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0456c, -, len: 221 aa. Equivalent to Rv0448c, len: 221 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 221 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins e.g. Z74841|BOD5A2_1 from B. oleracea (283 aa), FASTA scores: opt: 257, E(): 1.4e-10, (32.0% identity in 197 aa overlap); etc. Some similarity to U15183|MLU15183_38 from Mycobacterium leprae (82 aa), FASTA scores: opt: 134, E(): 0.014, (71.0% identity in 31 aa overlap). Protein product from Mb0456c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0456c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248162.1" /translation="MHHSFAYRSYSWYVDVDNLPQLPWWLRPFARFHADDHFADPFSC PPHSSLRDRLDAFFAARGLAVPDGRITALLQARVLGYVFNPLSIFWCHDRDGQLRHVI AEVHNTYGGRHAYLLPPADLPVVTAKNFYVSPFHQLAGYYLIRAPRPDRELDVTVTLH RDRRQVCPEFTATLRGQRRPATTRQIAMMQIISPLAPMVVAARIRIQGIRLWLRRVPV VPR" CDS complement(538248..539567) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0457C" /product="Amine oxidase, flavin-containing" /note="Mb0457c, -, len: 439 aa. Equivalent to Rv0449c, len: 439 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 439 aa overlap). Conserved hypothetical protein, some similarity with several hypothetical proteins and various enzymes e.g. AAK24569.1|AE005927 amine oxidase, flavin-containing from Caulobacter crescentus (454 aa); BAB02771.1|AB023036 mycolic acid methyl transferase-like protein from Arabidopsis thaliana (842 aa); BAB01742.1|AP000374 protein which contains similarity to cyclopropane fatty acid synthase from Arabidopsis thaliana (793 aa); etc. Has hydrophobic stretch at N-terminus. Protein product from Mb0457c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0457c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248163.1" /translation="MQQSLRRSVAVVGSGVAGLTAAYILSGRDRVTLYEADGRLGGHA HTHYLDNGGGPRGTDVVGVDSAFLVHNDRTYPTLCRLFAELGVATQESEMSMSVRADD IGLEYAGALGARGLFACRQSLRPRYLCMLAEILRFHRAAARLLREETDNAEDKPETLE AFLSRHHFSQYFVDYFITPLVAAVWSCGGADALRYPARYLFVFLDHHGMLSVFGSPTW RTVTGGSANYVQAIAAQLDEVSTRTPVHSLRRLPDGVLVGAGDGPSRRFDAAVVAVHP DQALLLLDEPTPAERAVLGAIAYSTNSAQLHTDESVLPRHHRARASWNYLVTPGQHQV VVSYDISRLMRLDGGRRYLVTLGGHDRVDPSSVIAEMTYSHPLYTPESVAAQRLLPTL GDNRVVFAGAYHGWGFHEDGAASGLRAARRLGADWPAAIPQEAMVAC" CDS complement(539607..542510) /codon_start=1 /transl_table=11 /gene="mmpL4" /locus_tag="BQ2027_MB0458C" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL4" /note="Mb0458c, mmpL4, len: 967 aa. Equivalent to Rv0450c, len: 967 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 967 aa overlap). Probable mmpL4, conserved transmembrane transport protein (see citations below), member of RND superfamily, equivalent to U1740V|P54881|YV34_MYCLE HYPOTHETICAL 105.2 kDa PROTEIN from Mycobacterium leprae (959 aa), FASTA scores: opt: 5051, E(): 0, (78.4% identity in 962 aa overlap). Also highly similar to other proteins from Mycobacterium tuberculosis e.g. Z83860|MTCY98.08 (962 aa), FASTA scores: opt: 3917, E(): 0, (61.3% identity in 950 aa overlap), MTCY20G9.34, etc. Contains PS00211 ABC transporters family signature. BELONGS TO THE MMPL FAMILY. TBparse score is 0.948. Protein product from Mb0458c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0458c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248164.1" /translation="MSTKFANDSNTNARPEKPFIARMIHAFAVPIILGWLAVCVVVTV FVPSLEAVGQERSVSLSPKDAPSFEAMGRIGMVFKEGDSDSFAMVIIEGNQPLGDAAH KYYDGLVAQLRADKKHVQSVQDLWGDPLTAAGVQSNDGKAAYVQLSLAGNQGTPLANE SVEAVRSIVESTPAPPGIKAYVTGPSALAADMHHSGDRSMARITMVTVAVIFIMLLLV YRSIITVVLLLITVGVELTAARGVVAVLGHSGAIGLTTFAVSLLTSLAIAAGTDYGIF IIGRYQEARQAGEDKEAAYYTMYRGTAHVILGSGLTITGATFCLSFARMPYFQTLGIP CAVGMLVAVAVALTLGPAVLHVGSRFGLFDPKRLLKVRGWRRVGTVVVRWPLPVLVAT CAIALVGLLALPGYKTSYNDRDYLPDFIPANQGYAAADRHFSQARMKPEILMIESDHD MRNPADFLVLDKLAKGIFRVPGISRVQAITRPEGTTMDHTSIPFQISMQNAGQLQTIK YQRDRANDMLKQADEMATTIAVLTRMHSLMAEMASTTHRMVGDTEEMKEITEELRDHV ADFDDFWRPIRSYFYWEKHCYGIPICWSFRSIFDALDGIDKLSEQIGVLLGDLREMDR LMPQMVAQIPPQIEAMENMRTMILTMHSTMTGIFDQMLEMSDNATAMGKAFDAAKNDD SFYLPPEVFKNKDFQRAMKSFLSSDGHAARFIILHRGDPQSPEGIKSIDAIRTAAEES LKGTPLEDAKIYLAGTAAVFHDISEGAQWDLLIAAISSLCLIFIIMLIITRAFIAAAV IVGTVALSLGASFGLSVLLWQHILAIHLHWLVLAMSVIVLLAVGSDYNLLLVSRFKQE IGAGLKTGIIRSMGGTGKVVTNAGLVFAVTMASMAVSDLRVIGQVGTTIGLGLLFDTL IVRSFMTPSIAALLGRWFWWPLRVRSRPARTPTVPSETQPAGRPLAMSSDRLG" CDS complement(542507..542929) /codon_start=1 /transl_table=11 /gene="mmpS4" /locus_tag="BQ2027_MB0459C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN MMPS4" /note="Mb0459c, mmpS4, len: 140 aa. Equivalent to Rv0451c, len: 140 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 140 aa overlap). Probable mmpS4, conserved membrane protein (see citations below), equivalent to U1740W|P54880|YV33_MYCLE HYPOTHETICAL 16.9 kDa PROTEIN from Mycobacterium leprae (154 aa), FASTA scores: opt: 727, E(): 0, (75.9% identity in 137 aa overlap). Also similar to other Mycobacterial proteins e.g. Z84725|MTCY04D9.16c from Mycobacterium tuberculosis (142 aa), FASTA scores: opt: 451, E(): 3.2e-24, (50.0% identity in 138 aa overlap); etc. BELONGS TO THE MMPS FAMILY. TBparse score is 0.953. Protein product from Mb0459c detected using SWATH mass spectrometry. Mb0459c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248165.1" /translation="MLMRTWIPLVILVVVIVGGFTVHRIRGFFGSENRPSYSDTNLEN SKPFNPKHLTYEIFGPPGTVADISYFDVNSEPQRVDGAVLPWSLHITTNDAAVMGNIV AQGNSDSIGCRITVDGKVRAERVSNEVNAYTYCLVKSA" CDS 543161..543871 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0460" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0460, -, len: 236 aa. Equivalent to Rv0452, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 236 aa overlap). Possible transcriptional regulator, similar to several putative TetR-family transcriptional regulators from Streptomyces coelicolor. Also similar in N-terminus to U1740Y|U15183|MLU15183_33 from Mycobacterium leprae (67 aa), FASTA score: (76.1% identity in 67 aa overlap). Contains probable helix-turn-helix motif at aa 44-65 (Score 1727, +5.07 SD). Protein product from Mb0460 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0460 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248166.1" /translation="MRYPLAVAQLGFQRARTEENKRQRAAALVEAARSLALETGVASV TLTAVAGRAGIHYSAVRRYFTSHKEVLLHLAAEGWARWSGTVCEQLGEPGPMSAPRVA EALANGLAADPLFCDLLANLHLHLEQEVDVDRVIEVKRTSIAAVIALVDAIESALPAL GRSGAFDILLAAYSLAATLWQIANPPERLTDAYAEEPELLPPEWNLDFAAALTRLLTA TLLGLLAGSPCECRSPTR" CDS 544193..545749 /codon_start=1 /transl_table=11 /gene="PPE11" /locus_tag="BQ2027_MB0461" /product="ppe family protein ppe11" /note="Mb0461, PPE11, len: 518 aa. Equivalent to Rv0453, len: 518 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 518 aa overlap). Member of the Mycobacterium tuberculosis PPE family, similar to many e.g. AL0212|MTV012_32 from Mycobacterium tuberculosis (434 aa), FASTA scores: opt: 882, E(): 7e-31, (41.8% identity in 514 aa overlap). Mb0461 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248167.1" /translation="MTSALIWMASPPEVHSALLSSGPGPGPVLAAATGWSSLGREYAA VAEELGALLAAVQAGVWQGPSAESFAAACLPYLSWLTQASADCAAAAARLEAVTAAYA AALVAMPTLAELAANHATHGAMVATNFFGINTIPIAVNEADYVRMWLQAATTMATYQA VADSAVRSIPDSVPPPRILKSNAQSQHSSSNNSGGADPVDDFIAEILKIITGGRVIWD PEAGTVNGLPYDAYTNPGTLMWWIARSLELLQDFQEFAKLLFTNPVKAFQFLVDLILF DWPTHMLQLATWLAENPQLLVAALTPAISGLGAVSGLAGLTGLVPQPPVVPAPAPDAV VPTVLPLAGTATPTTAPASAPAAGAAPGPPAGTATATSASVPTSAGGFPPYLVGSGPG IDFDAGTPAGSRRAQPAADNVTAVAAAQVSARHQARRRRRAAAKERGNADEFVDMDSG PAIPPSGERDAWASNSGVGGLGFAGTASNETVAAPAGLTTLADDEFQCGPRMPMLPGA WDLGTWDRGD" CDS 545854..546204 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0462" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0462, -, len: 116 aa. Equivalent to Rv0454, len: 116 aa (start uncertain), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 116 aa overlap). Conserved hypothetical protein, showing similarity with AAA63007.1|U15183 hypothetical protein from Mycobacterium leprae (115 aa), FASTA scores: opt: 151, E(): 0.0019, (31.5% identity in 89 aa overlap). Mb0462 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248168.1" /translation="MKQDFGLDVPQAGNAQNFDGVPEWVQVGVVTFVYRMQMHHVTRP VGAPGSGLAGDSTPVQGRQRVWDLVAGRLTHAPRSSVQAMRPTMFTSAPQRHGIPARG RWWLGYQERSRAWP" CDS complement(546394..546840) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0463C" /product="putative membrane protein" /note="Mb0463c, -, len: 148 aa. Equivalent to Rv0455c, len: 148 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 148 aa overlap). Conserved hypothetical protein, equivalent to CAC31896.1|AL583925 possible secreted protein from Mycobacterium leprae (153 aa). Has hydrophobic stretch at N-terminus. Protein product from Mb0463c detected using shotgun mass spectrometry. Mb0463c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248169.1" /translation="MSRLSSILRAGAAFLVLGIAAATFPQSAAADSTEDFPIPRRMIA TTCDAEQYLAAVRDTSPVYYQRYMIDFNNHANLQQATINKAHWFFSLSPAERRDYSEH FYNGDPLTFAWVNHMKIFFNNKGVVAKGTEVCNGYPAGDMSVWNWA" CDS complement(546908..547822) /codon_start=1 /transl_table=11 /gene="echA2" /locus_tag="BQ2027_MB0464C" /product="ENOYL-COA HYDRATASE ECHA2 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb0464c, echA2, len: 304 aa. Equivalent to Rv0456c, len 304 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 304 aa overlap). Probable echA2, enoyl-CoA hydratase (EC 4.2.1.17), similar to other enoyl-coA hydratases e.g. Q13011 PEROXISOMAL ENOYL-COA HYDRATASE-LIKE PROTEIN (328 aa), FASTA scores: opt: 209, E(): 5.3e-07, (31.7% identity in 142 aa overlap). Also similar to several other proteins from Mycobacterium tuberculosis e.g. MTCY09F9.29 FASTA score: (32.9% identity in 146 aa overlap); and MTI376.01c. Protein product from Mb0464c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0464c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248170.1" /translation="MPTPDFQTLLYTTAGPVATITLNRPEQLNTIVPPMPDEIEAAIG LAERDQDIKVIVLRGAGRAFSGGYDFGGGFQHWGDAMMTDGRWDPGKDFAMVTARETG PTQKFMAIWRASKPVIAQVHGWCVGGASDYALCADIVIASEDAVIGTPYSRMWGAYLT GMWLYRLSLAKVKWHSLTGRPLTGVQAAEAELINEAVPFERLEARVAEIATELARIPL SQLQAQKLIVNQAYENMGLASTQLLGGILDGLMRNTPDALEFIRTAQTQGVRAAVERR DGPFGDYSQAPPELRPDPTHVITPDGSM" CDS complement(548095..548376) /codon_start=1 /transl_table=11 /gene="mazf1" /locus_tag="BQ2027_MB0465C" /product="possible toxin mazf1" /note="Mb0465c, -, len: 93 aa. Equivalent to Rv0456A, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Conserved hypothetical protein; N-terminus highly similar to N-terminal part of P71650|Rv2801c|MT2869|MTCY16B7.42 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (118 aa), FASTA scores: opt: 303, E(): 1e-14, (60.44% identity in 91 aa overlap). Also some similarity in part with other hypothetical proteins e.g. Q9PHH8|XFA0027 Plasmid maintenance protein from Xylella fastidiosa (108 aa), FASTA scores: opt: 169, E(): 3.9e-05, (50.820% identity in 61 aa overlap). Mb0465c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248171.1" /translation="MLRGEIWQVDLDPARGSAANMRRPAVIVSNDRANAAAIRLDRGV VPVVPVTSNTEKVPIPGVVAGSERWPGRRFEGAGPAGWIRRCATSPLPS" CDS complement(548363..548488) /codon_start=1 /transl_table=11 /gene="mazE1" /locus_tag="BQ2027_MB0466A" /product="Possible antitoxin MazE1" /note="Mb0466A, len: 41 aa. Equivalent to Rv0456B len: 57 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 40 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible mazE1, antitoxin, part of toxin-antitoxin (TA) operon with Rv0456A (See Pandey and Gerdes, 2005; Zhu et al., 2006). Mb0466A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248172.1" /translation="MRHEAKRELVYRGRRSIGRMPREWACRRSRRFAANGVDAAR" CDS complement(548605..550635) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0466C" /product="PROBABLE PEPTIDASE" /note="Mb0466c, -, len: 676 aa. Equivalent to Rv0457c, len: 673 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 673 aa overlap). Probable peptidase (EC 3.4.-.-), similar to many e.g. NP_102851.1|14022026|BAB48637.1 probable endopeptidase from Mesorhizobium loti (687 aa); Y4NA_RHISN|P55577 probable peptidase (EC 3.4.21.-) (726 aa), FASTA scores: opt: 1126, E(): 0, (40.9% identity in 491 aa overlap). Also similar to Mycobacterium tuberculosis protein MTCY369.26 FASTA score: (33.8% identity in 299 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a possible RBS upstream leads to an earlier start resulting in a slightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (676 aa versus 673 aa). Protein product from Mb0466c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0466c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248173.1" /translation="MASMTFEPAPDGADPYLWLEDVTGAEALDWVRARNKPTTAAFCD AEFERMRVEALEVLDTDARIPYVNRRGNYLYNFWRDAANPRGLWRRTTLDSYRTDSPG WDVLIDVDELGRADDQKWVWGGAGVIEPDYTRALIGLSPGGSDASIVREFDMLTREFV EDGFQLPPAKSQITWEDPDTVLLGTDFGGDSLTTSGYPRVIKRWRRGKPLADAETIFE GAGTDVRVNASADRTPGFERTLLGRALDFWNEEVYELRGSELIRIEAPTDASVSIHRD WLLIELRTDWTVATTRYTAGSLLAAEYDEFLAGSAELQVVFEPDEHTALYQYAWTRDR LLIVTLADVASRVEIATPGSWRREPLSGIPAATNTVIVSADSHGDEFFLDSSGFDTPS RLMRGTDDGRLAEIKSAPAFFDAENMAVTQYFATSDDGTSIPYFVVRRTDADNPGPTL LNGYGGFETSRTPTYDGVLGRLWLARGGTYALANIRGGGEYGPGWHTQAMREGRDKVA QDFAAVATDLVTRGITTAEQLGARGGSNGGLLMGIMLTGYPEKFGALVCDVPLLDMKR YHLLLAGASWMAEYGDPDNPDDWKFISEYSPYQNISANRKYPPVLMTTSTRDDRVHPG HARKMTAALQAAGHPVWYYENIEGGHAGAADNAQIAFKSALSFAFLWRMLAG" CDS 550694..552217 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0467" /product="PROBABLE ALDEHYDE DEHYDROGENASE" /note="Mb0467, -, len: 507 aa. Equivalent to Rv0458, len: 507 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 507 aa overlap). Probable aldehyde dehydrogenase (EC 1.2.1.3), highly similar to many, closest to P46369|THCA_RHOER EPTC-INDUCIBLE ALDEHYDE DEHYDROGENASE from Rhodococcus erythropolis (506 aa), FASTA scores: opt: 2767, E(): 0, (79.7% identity in 507 aa overlap); AAC13641.1|AF029733 chloroacetaldehyde dehydrogenase from Xanthobacter autotrophicus (505 aa), FASTA scores: opt: 2563, E(): 0, (75.4% identity in 492 aa overlap); Q9RJZ6|DHAL_STRCO PROBABLE ALDEHYDE DEHYDROGENASE from Streptomyces coelicolor (507 aa). Also similar to other semialdehyde dehydrogenases in M. tuberculosis e.g. Rv0768, Rv2858c. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY. Protein product from Mb0467 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0467 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248174.1" /translation="MTVFSRPGSAGALMSYESRYQNFIGGQWVAPVHGRYFENPTPVT GQPFCEVPRSDAADIDKALDAAHAAAPGWGKTAPAERAAILNMIADRIDKNAAALAVA EVWDNGKPVREALAADIPLAVDHFRYFAAAIRAQEGALSQIDEDTVAYHFHEPLGVVG QIIPWNFPILMAAWKLAPALAAGNTAVLKPAEQTPASVLYLMSLIGDLLPPGVVNVVN GFGAEAGKPLASSDRIAKVAFTGETTTGRLIMQYASHNLIPVTLELGGKSPNIFFADV LAAHDDFCDKALEGFTMFALNQGEVCTCPSRSLIQADIYDEFLELAAIRTKAVRQGDP LDTETMLGSQASNDQLEKVLSYIEIGKQEGAVIIAGGERAELGGDLSGGYYMQPTIFT GTNNMRIFKEEIFGPVVAVTSFTDYDDAIGIANDTLYGLGAGVWSRDGNTAYRAGRDI QAGRVWVNCYHLYPAHAAFGGYKQSGIGREGHQMMLQHYQHTKNLLVSYSDKALGFF" CDS 552217..552708 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0468" /product="related to oxidoreductase activity" /note="Mb0468, -, len: 163 aa. Equivalent to Rv0459, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). Conserved hypothetical protein, highly similar to other hypothetical proteins. Note that highly similar to products of unidentified orfs in Xanthobacter autotrophicus, AF029733_2 (139 aa), and Rhodococcus erythropolis, REREUTP BC_1 (186 aa). Like MTV038.03, these ORF's are linked to aldehyde dehydrogenase genes. FASTA scores: AF0297|AF029733_2 (139 aa), opt: 439, E(): 6.2e-24, (50.0% identity in 126 aa overlap); and L24492|REREUTPBC_1 (186 aa), opt: 347, E(): 2.1e-17, (52.7% identity in 169 aa overlap). N-terminus also highly similar to AAA63041.1|U15183 ethanolamine permease (eutP) match from Mycobacterium leprae (53 aa). Protein product from Mb0468 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0468 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248175.1" /translation="MNAPAGVLITAEAAALLAGLQDRHGPVMFHQSGGCCDGSAPMCY PRADFLVGDRDILLGVLDVGEDGVPVWISGPQYQAWKHTQLIIDVVPGRGGGFSLEAP EGVRFLSRGRVFSDAEKAMREAAPVITGAAYECGERPLVRGLVVDLDDPDATPGVCRA SRR" CDS 552768..553007 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0469" /product="CONSERVED HYDROPHOBIC PROTEIN" /note="Mb0469, -, len: 79 aa. Equivalent to Rv0460, len: 79 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 79 aa overlap). Conserved hydrophobic protein, highly similar AAA63024.1|U15183 hypothetical protein from Mycobacterium leprae (56 aa), FASTA scores: opt: 197, E(): 3.7e-09, (63.8% identity in 47 aa overlap). Mb0469 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248176.1" /translation="MLVGNAIGLLAGVACSVLVHARIRPDIVIAMVVGIPSAIGLLVI LFSGRRWVTMLGAFILALAPGWFGVLVAIQVASSG" CDS 553045..553569 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0470" /product="PROBABLE TRANSMEMBRANE PROTEIN" /note="Mb0470, -, len: 174 aa. Equivalent to Rv0461, len: 174 aa (start uncertain), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 174 aa overlap). Probable transmembrane protein. Protein product from Mb0470 detected using SWATH mass spectrometry. Mb0470 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248177.1" /translation="MPDFDTGAHSQRFLSLAGQQDRAGKSWPGSTPKPQEDPVGVAPS ASVEVLGSEPAATLAHSVTVPGRYTYLKWWKFVLVVLGVWIGAGEVGLSLFYWWYHTL DKTAAVFVVLVYVVACTVGGLILALVPGRPLITALSLGVMSGPFASVAAAAPLYGYYY CERMSHCLVGVIPY" CDS 553633..555027 /codon_start=1 /transl_table=11 /gene="lpd" /locus_tag="BQ2027_MB0471" /standard_name="TB49.2" /product="dihydrolipoamide dehydrogenase lpdc (lipoamide reductase (nadh)) (lipoyl dehydrogenase) (dihydrolipoyl dehydrogenase) (diaphorase)" /note="Mb0471, lpd, len: 464 aa. Equivalent to Rv0462, len: 464 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 464 aa overlap). lpd (alternate gene name: TB49.2), dihydrolipoamide dehydrogenase (EC 1.8.1.4) (see first citation below), equivalent to AAA63016.1|U15183 lipoamide dehydrogenase from Mycobacterium leprae (467 aa), FASTA scores: opt: 2583, E(): 0, (83.1% identity in 467 aa overlap). Also similar to to many e.g. P50970|DLDH_ZYMMO|X82291|ZMLPD_1 DIHYDROLIPOAMIDE DEHYDROGENASE from Z.mobilis (466 aa), FASTA scores: opt: 1198, E(): 0, (42.4 % identity in 465 aa overlap); etc. BELONG TO THE PYRIDINE NUCLEOTIDE-DISULFIDE OXIDOREDUCTASES CLASS-I. Protein product from Mb0471 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0471 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248178.1" /translation="MTHYDVVVLGAGPGGYVAAIRAAQLGLSTAIVEPKYWGGVCLNV GCIPSKALLRNAELVHIFTKDAKAFGISGEVTFDYGIAYDRSRKVAEGRVAGVHFLMK KNKITEIHGYGTFADANTLLVDLNDGGTESVTFDNAIIATGSSTRLVPGTSLSANVVT YEEQILSRELPKSIIIAGAGAIGMEFGYVLKNYGVDVTIVEFLPRALPNEDADVSKEI EKQFKKLGVTILTATKVESIADGGSQVTVTVTKDGVAQELKAEKVLQAIGFAPNVEGY GLDKAGVALTDRKAIGVDDYMRTNVGHIYAIGDVNGLLQLAHVAEAQGVVAAETIAGA ETLTLGDHRMLPRATFCQPNVASFGLTEQQARNEGYDVVVAKFPFTANAKAHGVGDPS GFVKLVADAKHGELLGGHLVGHDVAELLPELTLAQRWDLTASELARNVHTHPTMSEAL QECFHGLVGHMINF" CDS 555035..555328 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0472" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb0472, -, len: 97 aa. Equivalent to Rv0463, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 97 aa overlap). Probable conserved transmembrane protein, highly similar to AAA63017.1|U15183 hypothetical protein from Mycobacterium leprae (101 aa), FASTA scores: opt: 364, E(): 4e-21, (57.9% identity in 95 aa overlap). Mb0472 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248179.1" /translation="MTRRASTDTPQIIMGAIGGVVTGYILWLAAISVGDGLTTVSQWS RVVLLLSVLVAVCGAAGGLRLRSRGKLAWSAFAFSLPIPPVVLTVAVLADIYL" CDS complement(555332..555904) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0473C" /product="conserved protein" /note="Mb0473c, -, len: 190 aa. Equivalent to Rv0464c, len: 190 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 190 aa overlap). Conserved hypothetical protein, highly similar to CAC31982.1|AL583925 conserved hypothetical protein from Mycobacterium leprae (188 aa). Also some similarity with Rv1531|AL022000|MTV045_5|D70820 hypothetical protein from Mycobacterium tuberculosis (188 aa), FASTA scores: E(): 9.6e-10, (30.9% identity in 175 aa overlap). Protein product from Mb0473c detected using shotgun mass spectrometry. Mb0473c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248180.1" /translation="MTGQNGQVARISPGKFRQLGPVNWLVAKLAARAVGAPQMHLFTT LGYRQYLFWTFAIYTGRLLHGRLPGVDTELVILRVAHLRSCEYELQHHRRMARRRGLD ANTQATIFAWPDVPDGDGPRKVLSARQQALLQATDELIKDRTITAGTWERLATHLDPR LLIEFCLLATQYDAIAATITALAIPPDNPQ" CDS complement(555901..557325) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0474C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0474c, -, len: 474 aa. Equivalent to Rv0465c, len: 474 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 474 aa overlap). Probable transcriptional regulator, highly similar to AC44331.1|AL596102 putative DNA-binding protein from Streptomyces coelicolor (489 aa); and similar to several hypothetical proteins and others transcriptional regulators. Some similarity in N-terminal region (1-100 aa) with repressors e.g. P06153|RPC_BPPH1 IMMUNITY REPRESSOR PROTEIN (144 aa), FASTA scores: opt: 130, E(): 0.084,(27.0% identity in 100 aa overlap). Very similar to Rv1129c|Z95585|MTCY22G8.18c from Mycobacterium tuberculosis (486 aa), FASTA scores: opt: 1475, E(): 0, (47.4% identity in 468 aa overlap). Contains probable helix-turn-helix motif at aa 19-40 (1827, +5.41 SD). Protein product from Mb0474c detected using SWATH mass spectrometry. Mb0474c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248181.1" /translation="MSKTYVGSRVRQLRNERGFSQAALAQMLEISPSYLDQIEHDVRP LTVAVLLRITEVFGVDATFFASQDDTRLVAELREVTLDRDLDIAIDPHEVAEMVSAHP GLARAVVNLHRRYRITTAQLAAATEERFSDGSGRGSITMPHEEVRDYFYQRQNYLHAL DTAAEDLTAQMRMHHGDLARELTRRLTEVHGVRINKRIDLGDTVLHRYDPATNTLEIS SHLSPGQQVFKMAAELAYLEFGDLIDAMVTDGKFTSAESRTLARLGLANYFAAATVLP YRQFHDVAENFRYDVERLSAFYSVSYETIAHRLSTLQRPSMRGVPFTFVQVDRAGNMS KRQSATGFHFSSSGGTCPLWNVYETFANPGKILVQIAQMPDGRNYLWVARTVELRAAR YGQPGKTFAIGLGCELRHAHRLVYSEGLDLSGDPNTAATPIGAGCRVCERDNCPQRAF PALGRALDLDEHRSTVSPYLVKQL" CDS 557477..558271 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0475" /product="Acyl-ACP thioesterase" /note="Mb0475, -, len: 264 aa. Equivalent to Rv0466, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 264 aa overlap). Conserved hypothetical protein, equivalent to CAC31980.1|AL583925 conserved hypothetical protein from Mycobacterium leprae (264 aa). Similar to Rv2001|Z74025|MTCY39.17c HYPOTHETICAL 28.7 KDA PROTEIN from Mycobacterium tuberculosis (250 aa), FASTA scores: opt: 592, E(): 0, (38.0% identity in 263 aa overlap). Some similarity to several THIOESTERASES e.g. Q42561|ATACPTE17_1 ACYL-(ACYL CARRIER PROTEIN) THIOESTER from A. thaliana (362 aa), FASTA scores: E(): 0.0092, (24.4% identity in 197 aa overlap). Protein product from Mb0475 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0475 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248182.1" /translation="MSLDKKLMPVPDGHPDVFDREWPLRVGDIDRAGRLRLDAACRHI QDIGQDQLREMGFEETHPLWIVRRTMVDLIRPIEFGDMLRCRRWCSGTSNRWCEMRVR VDGRKGGLIESEAFWIHVNRETEMPARIADDFLAGLHRTTSVDRLRWKGYLKPGSRDD ASEIHEFPVRVTDIDLFDHMNNAVYWSVIEDYLASHAELLRGPLRVTIEHEAPVALGD KLEIISHVHPAGSTEIFGPGLVDRAVTTLTYVVGDEPKAVASLFNL" CDS 558546..559832 /codon_start=1 /transl_table=11 /gene="icl" /locus_tag="BQ2027_MB0476" /product="ISOCITRATE LYASE ICL (ISOCITRASE) (ISOCITRATASE)" /note="Mb0476, icl, len: 428 aa. Equivalent to Rv0467, len: 428 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 428 aa overlap). icl (previously known as aceA), isocitrate lyase (EC 4.1.3.1) (see citations below), highly similar to many, closest to Z29367|RFISCILY_1 from R. fascians (429 aa), FASTA scores: opt: 2359, E(): 0, (80.7% identity in 429 aa overlap). BELONGS TO THE ISOCITRATE LYASE FAMILY. Protein product from Mb0476 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0476 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248183.1" /translation="MSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVE EHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLS GHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGG ALNVYELQKALIAAGVAGSHWEDQLASEKKCGHLGGKVLIPTQQHIRTLTSARLAADV ADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYA PFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQK ELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATK HQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFH" CDS 559914..560774 /codon_start=1 /transl_table=11 /gene="fadB2" /locus_tag="BQ2027_MB0477" /product="3-hydroxybutyryl-coa dehydrogenase fadb2 (beta-hydroxybutyryl-coa dehydrogenase) (bhbd)" /note="Mb0477, fadB2, len: 286 aa. Equivalent to Rv0468, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Probable fadB2, 3-hydroxybutyryl-CoA dehydrogenase (EC 1.1.1.157), equivalent to CAC31978.1|AL583925 3-hydroxyacyl-CoA dehydrogenase from Mycobacterium leprae (287 aa). Also similar to many 3-hydroxybutyryl-CoA dehydrogenases e.g. U32229|BJU32229_1 beta-hydroxybutyryl coenzyme A dehydrogenase from Bradyrhizobium japonicum (293 aa), FASTA scores: opt: 771, E(): 0, (45.7% identity in 282 aa overlap). BELONGS TO THE 3-HYDROXYACYL-COA DEHYDROGENASE FAMILY. Protein product from Mb0477 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0477 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248184.1" /translation="MSDAIQRVGVVGAGQMGSGIAEVSARAGVEVTVFEPAEALITAG RNRIVKSLERAVSAGKVTERERDRALGLLTFTTDLNDLSDRQLVIEAVVEDEAVKSEI FAELDRVVTDPDAVLASNTSSIPIMKVAAATKQPQRVLGLHFFNPVPVLPLVELVRTL VTDEAAAARTEEFASTVLGKQVVRCSDRSGFVVNALLVPYLLSAIRMVEAGFATVEDV DKAVVAGLSHPMGPLRLSDLVGLDTLKLIADKMFEEFKEPHYGPPPLLLRMVEAGQLG KKSGRGFYTY" CDS 560907..561767 /codon_start=1 /transl_table=11 /gene="umaA1" /locus_tag="BQ2027_MB0478" /product="possible mycolic acid synthase umaa" /note="Mb0478, umaA1, len: 286 aa. Equivalent to Rv0469, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Possible umaA1, mycolic acid synthase (EC 2.-.-.-), highly similar to CAC30854.1|AL583923 methyl mycolic acid synthase 1 from Mycobacterium leprae (286 aa); and CAC31976.1|AL583925 Mycolic acid synthase from Mycobacterium leprae (295 aa), FASTA scores: opt: 1402, E(): 0, (69.6% identity in 286 aa overlap). Also very similar to mycobacterial methyltransferases e.g. U77466|CmaD|MBU77466_1 (286 aa); MTCY20H10.26c|Z92772|MTY20H10_27 (296 aa); highly similar to CFA1_MYCTU|Q11195|U66108|MTU66108_1 cyclopropane-fatty-acyl-phospholipid synthase 1 (287 aa), FASTA scores: opt: 1360, E(): 0, (67.8% identity in 286 aa overlap) (see citation below); and very similar also to methoxy mycolic acid synthase 1 from Mycobacterium tuberculosis e.g. MTU66108_1 (286 aa). Protein product from Mb0478 detected using shotgun mass spectrometry. Mb0478 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248185.1" /translation="MTELRPFYEESQSIYDVSDEFFSLFLDPTMAYTCAYFEREDMTL EEAQNAKFDLALDKLHLEPGMTLLDIGCGWGGGLQRAIENYDVNVIGITLSRNQFEYS KAKLAKIPTERSVQVRLQGWDEFTDKVDRIVSIGAFEAFKMERYAAFFERSYDILPDD GRMLLHTILTYTQKQMHEMGVKVTMSDVRFMKFIGEEIFPGGQLPAQEDIFKFAQAAD FSVEKVQLLQQHYARTLNIWAANLEANKDRAIALQSEEIYNKYMHYLTGCEHFFRKGI SNVGQFTLTK" CDS complement(561867..562730) /codon_start=1 /transl_table=11 /gene="pcaA" /locus_tag="BQ2027_MB0479C" /product="MYCOLIC ACID SYNTHASE PCAA (CYCLOPROPANE SYNTHASE)" /note="Mb0479c, pcaA, len: 287 aa. Equivalent to Rv0470c, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 287 aa overlap). pcaA (previously known as umaA2), mycolic acid synthase (cyclopropane synthase) (EC 2.-.-.-) (see citations below), equivalent to CAC31976.1|AL583925 Mycolic acid synthase from Mycobacterium leprae (295 aa); and highly similar to S72886|B2168_F3_130|467038|AAA17222.1|U00018 hypothetical protein from Mycobacterium leprae (308 aa); Q11195|CFA1_MYCTU CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE 1 (CYCLOPROPANE MYCOLIC ACID SYNTHASE 1) (287 aa) (see second citation below); U27357|MTU27357_1 cyclopropane mycolic acid synthase from Mycobacterium tuberculosis (287 aa), FASTA scores: opt: 1415, E(): 0, (72.8% identity in 287 aa overlap); and related enzymes e.g. MTCY20H10.25c|Z92772|MTY20H10_26 (287 aa), FASTA scores: opt: 1387, E(): 0, (72.5% identity in 287 aa overlap). Protein product from Mb0479c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0479c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248186.1" /translation="MSVQLTPHFGNVQAHYDLSDDFFRLFLDPTQTYSCAYFERDDMT LQEAQIAKIDLALGKLNLEPGMTLLDIGCGWGATMRRAIEKYDVNVVGLTLSENQAGH VQKMFDQMDTPRSRRVLLEGWEKFDEPVDRIVSIGAFEHFGHQRYHHFFEVTHRTLPA DGKMLLHTIVRPTFKEGREKGLTLTHELVHFTKFILAEIFPGGWLPSIPTVHEYAEKV GFRVTAVQSLQLHYARTLDMWATALEANKDQAIAIQSQTVYDRYMKYLTGCAKLFRQG YTDVDQFTLEK" CDS complement(562873..563313) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0480C" /product="HYPOTHETICAL PROTEIN" /note="Mb0480c, -, len: 146 aa. Equivalent to Rv0470A, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 146 aa overlap). Hypothetical unknown protein. GC plot suggests CDS for Cys-rich protein, could possibly be continuation of Rv0471c but no frameshift found to allow this. Sequence same in Mycobacterium bovis and CDC1551. Weak hits to Cys-rich region (aa 258-314) of D63395|D63395_1 mRNA for NOTCH4 from Homo sapiens (1095 aa), FASTA scores: opt: 132, E(): 1.1, (39.35% identity in 61 aa overlap). Mb0480c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248187.1" /translation="MGAGGWEVVLASLPYGLLCTTVLMGKHIDKIGYDEPLGIRTLPV LLGETCARTVTLAMMVGFYLLIAVNVMLAAMPWPRCWSPGRCPGWRKCGPISCDGGPS SRHRRFRCGRCGMPRWPGCTCVRPVRCWLWAWRSVPGGAPGDFR" CDS complement(563244..563732) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0481C" /product="HYPOTHETICAL PROTEIN" /note="Mb0481c, -, len: 162 aa. Equivalent to Rv0471c, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). Hypothetical unknown protein. Mb0481c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248188.1" /translation="MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLP MTLVSGLVAGLLAIGEPGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARA RYAQHPAATGANRAAYTTPRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAP RC" CDS complement(563742..564446) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0482C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY TETR-FAMILY)" /note="Mb0482c, -, len: 234 aa. Equivalent to Rv0472c, len: 234 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 234 aa overlap). Probable regulatory protein, possibly tetR family, equivalent to CAC31974.1|AL583925 possible TetR-family transcriptional regulator from Mycobacterium leprae (233 aa). Also similar to CAC01492.1|AL391017 putative transcriptional regulatory protein from Streptomyces coelicolor (218 aa); and CAC01371.1|AL390975 putative tetR-family transcriptional regulator from Streptomyces coelicolor (228 aa). Also similar to AL0212|MTV012_65 from Mycobacterium tuberculosis (246 aa), FASTA scores: opt: 327, E(): 1.8e-15, (31.0% identity in 232 aa overlap); and Z95120|MTCY07D11.18c (228 aa), FASTA scores: opt: 190, E(): 4.4e-06, (23.1% identity in 186 aa overlap). Contains probable helix-turn-helix doimain at aa 45-66 (Score 1429, +4.05 SD). Protein product from Mb0482c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0482c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248189.1" /translation="MAERIPAVTVKTDGRKRRWHQHKVERRNELVDGTIEAIRRHGRF LSMDEIAAEIGVSKTVLYRYFVDKNDLTTAVMMRFTQTTLIPNMIAALSADMDGFELT REIIRVYVETVAAQPEPYRFVMANSSASKSKVIADSERIIARMLAVMLRRRMQEAGMD TGGVEPWAYLIVGGVQLATHSWMSDPRMSSDELIDYLTMLSWSALCGIVEAGGSLEKF REQPHPSPIVPAWGQV" CDS 564583..565953 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0483" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0483, -, len: 456 aa. Equivalent to Rv0473, len: 456 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 456 aa overlap). Possible conserved transmembrane protein, showing some similarity to hypothetical proteins e.g. NP_102800.1|14021975|BAB48586.1|AP002996 hypothetical protein from Mesorhizobium loti (431 aa); P39385|YJIN_ECOLI|YJIN|B4336 HYPOTHETICAL 48.2 KD PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Escherichia coli strain K12 (426 aa), FASTA scores: opt: 396, E(): 9.8e-19, (31.8 % identity in 424 aa overlap); etc. Protein product from Mb0483 detected using SWATH mass spectrometry. Mb0483 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248190.1" /translation="MVAHRAEVSGSPPPRLNLSTQPTVARRVRASFAESFAAADPEAD AARRMALRRMKVVAVGFLVGATGVFLACRWAQADGADHAWLGYLGAAAEAGMVGALAD WFAVTALFKHPLGIPIPHTAIIKRKKDQLGEGLGTFVRENFLSPPVVETKLRDAQIPS RLGKWLSEATHAQRVAAETATVLRVLVELLRDEDIQQVIDRMIVRRIAEPQWGPPAGR VLATLLAENRQEAFIQLLADRAFQWSLNAGVVIQRVVERDSPSWSPRFIDHLVGDRIH RELMEFTDKVRRNPDHELRRSATRFLFDFADDLQHDPATVARADAIKEELMARDEIAT AAAAAWKTLKRLVLEGVDDPSSALRTRITDAVIRIGESLRDDADLRDKVDSWTVRAAQ HLVSEYGVEITAIITETIERWDAEEASRRIELHVGRDLQFIRINGTVVGAMAGLAIYA IAQLLF" CDS 566040..566462 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0484" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0484, -, len: 140 aa. Equivalent to Rv0474, len: 140 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 140 aa overlap). Probable transcriptional regulator, highly similar to others e.g. CAC04034.1|AL391406 putative DNA-binding protein from Streptomyces coelicolor (141 aa); N-terminus of NP_104173.1|14023352|BAB49959.1|AP003000 transcriptional regulator from Mesorhizobium loti (219 aa); N-terminus of A83618|PA0225 probable transcription regulator from Pseudomonas aeruginosa (179 aa); SINR_BACSU|P06533 sinr protein from Bacillus subtilis (111 aa), FASTA scores: opt: 147, E(): 8.9e-06, (30.6% identity in 111 aa overlap). Also similar to other hypothetical proteins e.g. X66407|RRPHAS_1|ORF1 from Rhodococcus ruber (171 aa), FASTA scores: opt: 280, E(): 4.8e-12, (43.6% identity in 117 aa overlap). Also similar to Rv2745c from Mycobacterium tuberculosis. Contains probable helix-turn-helix domain at aa 35-56 (Score 1709, +5.01 SD). Protein product from Mb0484 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0484 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248191.1" /translation="MSSEEKLAAKVSTKASDVASDIGSFIRSQRETAHVSMRQLAERS GVSNPYLSQVERGLRKPSADVLSQIAKALRVSAEVLYVRAGILEPSETSQVRDAIITD TAITERQKQILLDIYASFTHQNEATREECPSDPTPTDD" CDS 566816..567415 /codon_start=1 /transl_table=11 /gene="hbhA" /locus_tag="BQ2027_MB0485" /product="iron-regulated heparin binding hemagglutinin hbha (adhesin)" /note="Mb0485, hbhA, len: 199 aa. Equivalent to Rv0475, len: 199 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 199 aa overlap). hbhA, heparin-binding hemagglutinin (see citations below), equivalent to CAC31971.1|AL583925 possible hemagglutinin from Mycobacterium leprae (188 aa). Contains possible N-terminal signal sequence and K-A-rich region at C-terminus: SUBCELLULAR LOCATION: SURFACE ASSOCIATED. Protein product from Mb0485 detected using shotgun mass spectrometry. Mb0485 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248192.1" /translation="MAENSNIDDIKAPLLAALGAADLALATVNELITNLRERAEETRT DTRSRVEESRARLTKLQEDLPEQLTELREKFTAEELRKAAEGYLEAATSRYNELVERG EAALERLRSQQSFEEVSARAEGYVDQAVELTQEALGTVASQTRAVGERAAKLVGIELP KKAAPAKKAAPAKKAAPAKKAAAKKAPAKKAAAKKVTQK" CDS 567527..567790 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0486" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0486, -, len: 87 aa. Equivalent to Rv0476, len: 87 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 87 aa overlap). Possible conserved transmembrane protein, equivalent to CAC31970.1|AL583925 conserved membrane protein from Mycobacterium leprae (95 aa). Also highly similar to CAC04036.1|AL391406 putative membrane protein from Streptomyces coelicolor (113 aa). Contains PS00606 Beta-ketoacyl synthases active site. Protein product from Mb0486 detected using SWATH mass spectrometry. Mb0486 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248193.1" /translation="MLVLLVAVLVTAVYAFVHAALQRPDAYTAADKLTKPVWLVILGA AVALASILYPVLGVLGMAMSACASGVYLVDVRPKLLEIQGKSR" CDS 567795..568241 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0487" /product="POSSIBLE CONSERVED SECRETED PROTEIN" /note="Mb0487, -, len: 148 aa. Equivalent to Rv0477, len: 148 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 148 aa overlap). Possible conserved secreted protein, equivalent to CAC31969.1|AL583925 hypothetical protein from Mycobacterium leprae (123 aa). Also similar to G83406|PA1914 conserved hypothetical protein from Pseudomonas aeruginosa (408 aa). Contains possible N-terminal signal sequence. Mb0487 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248194.1" /translation="MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPR LVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQ FICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF" CDS 568241..568915 /codon_start=1 /transl_table=11 /gene="deoC" /locus_tag="BQ2027_MB0488" /product="PROBABLE DEOXYRIBOSE-PHOSPHATE ALDOLASE DEOC (PHOSPHODEOXYRIBOALDOLASE) (DEOXYRIBOALDOLASE)" /note="Mb0488, deoC, len: 224 aa. Equivalent to Rv0478, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Probable deoC, deoxyribose-phosphate aldolase (EC 4.1.2.4), equivalent to Q9CB45|DEOC_MYCLE DEOXYRIBOSE-PHOSPHATE ALDOLASE from M. leprae (226 aa). Also highly similar to others e.g. DEOC_BACSU|P39121 from Bacillus subtilis (214 aa), FASTA scores: opt: 543, E(): 1.4e-26, (45.9% identity in 209 aa overlap); etc. BELONGS TO THE DEOC/FBAB FAMILY OF ALDOLASES, DEOC SUBFAMILY. Protein product from Mb0488 detected using SWATH mass spectrometry. Mb0488 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248195.1" /translation="MLGQPTRAQLAALVDHTLLKPETTRADVAALVAEAAELGVYAVC VSPSMVPVAVQAGGVRVAAVTGFPSGKHVSSVKAHEAAAALASGASEIDMVIDIGAAL CGDIDAVRSDIEAVRAAAAGAVLKVIVESAVLLGQSNAHTLVDACRAAEDAGADFVKT STGCHPAGGATVRAVELMAETVGPRLGVKASGGIRTAADAVAMLNAGATRLGLSGTRA VLDGLS" CDS complement(568940..569986) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0489C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb0489c, -, len: 348 aa. Equivalent to Rv0479c, len: 348 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 348 aa overlap). Probable conserved membrane protein, equivalent to CAC31967.1|AL583925 possible secreted protein from Mycobacterium leprae (254 aa); and C-terminus highly similar to AAF74996.1|AF143402_1|AF143402 putative multicopper oxidase from Mycobacterium avium (149 aa). Contains hydrophobic domain in centre of protein. Protein product from Mb0489c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0489c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248196.1" /translation="MTNPQGPPNDPSPWARPGDQGPLARPPASSEASTGRLRPGEPAG HIQEPVSPPTQPEQQPQTEHLAASHAHTRRSGRQAAHQAWDPTGLLAAQEEEPAAVKT KRRARRDPLTVFLVLIIVFSLVLAGLIGGELYARHVANSKVAQAVACVVKDQATASFG VAPLLLWQVATRHFTNISVETAGNQIRDAKGMQIKLTIQNVRLKNTPNSRGTIGALDA TITWSSEGIKESVQNAIPILGAFVTSSVVTHPADGTVELKGLLNNITAKPIVAGKGLE LQIINFNTLGFSLPKETVQSTLNEFTSSLTKNYPLGIHADSVQVTSTGVVSRFSTRDA AIPTGIQNPCFSHI" CDS complement(569983..570918) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0490C" /product="possible amidohydrolase" /note="Mb0490c, -, len: 311 aa. Equivalent to Rv0480c, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 311 aa overlap). Conserved hypothetical protein, equivalent, but longer 60 aa in N-terminus, to CAC31966.1|AL583925 putative hydrolase from Mycobacterium leprae (271 aa). Also similar to several hypothetical proteins and hydrolases e.g. AL096822|SCGD3_8 probable hydrolase from Streptomyces coelicolor (264 aa), FASTA scores: opt: 368, E(): 6.1e-15, (34.2% identity in 272 aa overlap); and YAUB_SCHPO|Q10166 hypothetical 35.7 kd protein c26a3.11 from S. pombe (322 aa), FASTA scores: opt: 338, E():1.4e-13, (30.3% identity in 277 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 56 bp deletion results in a different NH2 part and leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (329 aa versus 340 aa). Protein product from Mb0490c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0490c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248197.1" /translation="MRRARRRAQAGLPGSCARRCGALVAGPRLARMRIALAQIRSGTD PAANLQLVGKYAGEAATAGAQLVVFPEATMCRLGVPLRQVAEPVDGPWANGVRRIATE AGITVIAGMFTPTGDGRVTNTLIAAGPGTPNQPDAHYHKIHLYDAFGFTESRTVAPGR EPVVVVVDGVRVGLTVCYDIRFPALYTELARRGAQLIAVCASWGSGPGKLEQWTLLAR ARALDSMSYVAAAGQADPGDARTGVGASSAAPTGVGGSLVASPLGEVVVSAGTQPQLL VADIDVDNVAAARDRIAVLRNQTDFVQIDKAQSRG" CDS complement(570951..571475) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0491C" /product="HYPOTHETICAL PROTEIN" /note="Mb0491c, -, len: 174 aa. Equivalent to Rv0481c, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 174 aa overlap). Hypothetical unknown protein. Protein product from Mb0491c detected using SWATH mass spectrometry. Mb0491c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248198.1" /translation="MPRSFDMSADYEGSVEEVHRAFYEADYWKARLAETPVDVATLES IRVGGDSGDDGTIEVVTLQMVRSHNLPGLVTQLHRGDLSVRREETWGPVKEGIATASI AGSIVDAPVNLWGTAVLSPIPESGGSRMTLQVTIQVRIPFIGGKLERLIGTQLSQLVT IEQRFTTLWITNNV" CDS 571502..572611 /codon_start=1 /transl_table=11 /gene="murB" /locus_tag="BQ2027_MB0492" /product="PROBABLE UDP-N-ACETYLENOLPYRUVOYLGLUCOSAMINE REDUCTASE MURB (UDP-N-ACETYLMURAMATE DEHYDROGENASE)" /note="Mb0492, murB, len: 369 aa. Equivalent to Rv0482, len: 369 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 369 aa overlap). Probable murB, UDP-N-acetylenolpyruvoylglucosamine reductase (EC 1.1.1.158), equivalent to CAC31964.1|AL583925 UDP-N-acetylenolpyruvoylglucosamine reductase from Mycobacterium leprae (367 aa). Also highly similar to others e.g. MURB_ECOLI|P08373 UDP-N-acetylenolpyruvoylglucosamine reductase from Escherichia coli (342 aa), FASTA scores: opt: 292, E(): 6.3e-12, (33.5% identity in 355 aa overlap); etc. BELONGS TO THE MURB FAMILY. COFACTOR: FAD. Protein product from Mb0492 detected using shotgun mass spectrometry and SWATH mass spectrometry." /protein_id="CAB5248199.1" /translation="MKRSGVGSLFAGAHIAEAVPLAPLTTLRVGPIARRVITCTSAEQ VVAALRHLDSAAKTGADRPLVFAGGSNLVIAENLTDLTVVRLANSGITIDGNLVRAEA GAVFDDVVVRAIEQGLGGLECLSGIPGSAGATPVQNVGAYGAEVSDTITRVRLLDRCT GEVRWVSARDLRFGYRTSVLKHADGLAVPTVVLEVEFALDPSGRSAPLRYGELIAALN ATSGERADPQAVREAVLALRARKGMVLDPTDHDTWSVGSFFTNPVVTQDVYERLAGDA ATRKDGPVPHYPAPDGVKLAAGWLVERAGFGKGYPDAGAAPCRLSTKHALALTNRGGA TAEDVVTLARAVRDGVHDVFGITLKPEPVLIGCML" CDS 572673..574028 /codon_start=1 /transl_table=11 /gene="lprQ" /locus_tag="BQ2027_MB0493" /product="PROBABLE CONSERVED LIPOPROTEIN LPRQ" /note="Mb0493, lprQ, len: 451 aa. Equivalent to Rv0483, len: 451 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 451 aa overlap). Probable lprQ, conserved lipoprotein, equivalent to CAC31963.1|AL583925|ML2446 possible lipoprotein from Mycobacterium leprae (441 aa); appears longer than ML2446, so start may be further downstream. Shows also similarity with MLCL383_24|O07707 HYPOTHETICAL 43.6 KD PROTEIN from Mycobacterium leprae; and to Q49706|B1496_F2_81 (271 aa). Similar to others lipoproteins from other organisms. Also similar to several Mycobacterium tuberculosis hypothetical proteins e.g. Rv0116c, Rv0192, Rv1433, Rv2518c. Contains potential N-terminal signal sequence and appropriately positioned PS00013 prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb0493 detected using SWATH mass spectrometry. Mb0493 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248200.1" /translation="MVIRVLFRPVSLIPVNNSSTPQSQGPISRRLALTALGFGVLAPN VLVACAGKVTKLAEKRPPPAPRLTFRPADSAADVVPIAPISVEVGDGWFQRVALTNSA GKVVAGAYSRDRTIYTITEPLGYDTTYTWSGSAVGHDGKAVPVAGKFTTVAPVKTINA GFQLADGQTVGIAAPVIIQFDSPISDKAAVERALTVTTDPPVEGGWAWLPDEAQGARV HWRPREYYPAGTTVDVDAKLYGLPFGDGAYGAQDMSLHFQIGRRQVVKAEVSSHRIQV VTDAGVIMDFPCSYGEADLARNVTRNGIHVVTEKYSDFYMSNPAAGYSHIHERWAVRI SNNGEFIHANPMSAGAQGNSNVTNGCINLSTENAEQYYRSAVYGDPVEVTGSSIQLSY ADGDIWDWAVDWDTWVSMSALPPPAAKPAATQIPVTAPVTPSDAPTPSGTPTTTNGPG G" CDS complement(574009..574764) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0494C" /product="probable short-chain type oxidoreductase" /note="Mb0494c, -, len: 251 aa. Equivalent to Rv0484c, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 251 aa overlap). Probable short-chain oxidoreductase (EC 1.-.-.-), highly similar to others e.g. T36118|4678912|CAB41284.1|AL049707 probable oxidoreductase from Streptomyces coelicolor (260 aa); YDFG_HAEIN|P45200|HI1430 hypothetical oxidoreductase (SDR family) from Haemophilus influenzae (252 aa), FASTA scores: opt: 496, E(): 7.9e-25, (35.0 % identity in 243 aa overlap); etc. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. STRONG SIMILARITY, TO BACTERIAL YDFG HOMOLOGS. Protein product from Mb0494c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0494c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248201.1" /translation="MTTIGTRKRVAVVTGASSGIGEATARTLAAQGFHVVAVARRADR ITALANQIGGTAIVADVTDDAAVEALARALSRVDVLVNNAGGAKGLQFVADADLEHWR WMWDTNVLGTLRVTRALLPKLIDSGDGLIVTVTSIAALEVYDGGAGYTAAKHAQGALH RTLRGELLGKPVRLTEIAPGAVETEFSLVRFDGDQQRADAVYAGMTPLVAADVAEVIG FVATRPSHVNLDQIVIRPRDQASASRRATHPVR" CDS 574947..576263 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0495" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0495, -, len: 438 aa. Equivalent to Rv0485, len: 438 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 438 aa overlap). Possible transcriptional repressor, member of the NAGC/XYLR repressor FAMILY; similar to several e.g. D87820_3|O32446|D82254 NAGC N-acetylglucosamine repressor from Vibrio cholerae (404 aa), FASTA scores: opt: 378, E(): 1.2e-17, (26.9% identity in 350 aa overlap); NAGC_ECOLI|P15301 N-acetylglucosamine repressor from Escherichia coli (406 aa), FASTA scores: opt: 305, E(): 1.8e-12, (21.8% identity in 357 aa overlap); etc. Protein product from Mb0495 detected using SWATH mass spectrometry. Mb0495 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248202.1" /translation="MYSTNRTSQSLSRKPGRKHQLRSHRYVMPPSLHLSDSAAASVFR AVRLRGPVGRDVIAGSTSLSIATVNRQVIALLEAGLLRERADLAVSGAIGRPRVPVEV NHEPFVTLGIHIGARTTSIVATDLFGRTLDTVETPTPRNAAGAALTSLADSADRYLQR WRRRRALWVGVTLGGAVDSATGHVDHPRLGWRQAPVGPVLADALGLPVSVASHVDAMA GAELMLGMRRFAPSSSTSLYVYARETVGYALMIGGRVHCPASGPGTIAPLPVHSEMLG GTGQLESTVSDEAVLAAARRLRIIPGIASRTRTGGSATAITDLLRVARAGNQQAKELL AERARVLGGAVALLRDLLNPDEVVVGGQAFTEYPEAMEQVEAAFTAGSVLAPRDIRVT VFGNRVQEAGAGIVSLSGLYADPLGALRRSGALDARLQDTAPEALA" CDS 576311..577753 /codon_start=1 /transl_table=11 /gene="msha" /locus_tag="BQ2027_MB0496" /product="glycosyltransferase msha" /note="Mb0496, -, len: 480 aa. Equivalent to Rv0486, len: 480 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 480 aa overlap). Mannosyltransferase (EC 2.4.1.-) (see citations below), highly similar to P54138|Y486_MYCLE|ML2443 possible glycosyl transferase from Mycobacterium leprae (428 aa); and S72892|B2168_C2_201 probable hexosyltransferase (EC 2.4.1.-) from Mycobacterium leprae (409 aa), FASTA scores: opt: 2375, E(): 0, (86.4% identity in 413 aa overlap). Also highly similar to CAC04040.1|AL391406 putative transferase from Streptomyces coelicolor (496 aa); and similar to various transferases e.g. NP_437172.1|NC_003078 putative membrane-anchored glycosyltransferase protein from Sinorhizobium meliloti (416 aa); O26550|U67601_1 LPS BIOSYNTHESIS RELATED PROTEIN from Methanococcus jannaschii (411 aa), FASTA score: (25.3% identity in 387 aa overlap); etc. Also similar to CAC87824.1|AJ316594 putative sucrose-phosphate synthase from Nostoc punctiforme (422 aa). Contains PS00039 DEAD-box subfamily ATP-dependent helicases signature. Protein product from Mb0496 detected using SWATH mass spectrometry. Mb0496 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248203.1" /translation="MAGVRHDDGSGLIAQRRPVRGEGATRSRGPSGPSNRNVSAADDP RRVALLAVHTSPLAQPGTGDAGGMNVYMLQSALHLARRGIEVEIFTRATASADPPVVR VAPGVLVRNVVAGPFEGLDKYDLPTQLCAFAAGVLRAEAVHEPGYYDIVHSHYWLSGQ VGWLARDRWAVPLVHTAHTLAAVKNAALADGDGPEPPLRTVGEQQVVDEADRLIVNTD DEARQVISLHGADPARIDVVHPGVDLDVFRPGDRRAARAALGLPVDERVVAFVGRIQP LKAPDIVLRAAAKLPGVRIIVAGGPSGSGLASPDGLVRLADELGISARVTFLPPQSHT DLATLFRAADLVAVPSYSESFGLVAVEAQACGTPVVAAAVGGLPVAVRDGITGTLVSG HEVGQWADAIDHLLRLCAGPRGRVMSRAAARHAATFSWENTTDALLASYRRAIGEYNA ERQRRGGEVISDLVAVGKPRHWTPRRGVGA" CDS 577750..578301 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0497" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0497, -, len: 183 aa. Equivalent to Rv0487, len: 183 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 183 aa overlap). Conserved hypothetical protein, highly similar to P54139|Y487_MYCLE|U00018_38|ML2442 HYPOTHETICAL 20.8 KDA PROTEIN from Mycobacterium leprae (184 aa), FASTA scores: opt: 760, E(): 2.4 e-34, (73.0% identity in 159 aa overlap). Also highly similar to CAC04041.1|AL391406 conserved hypothetical protein from Streptomyces coelicolor (168 aa). Protein product from Mb0497 detected using SWATH mass spectrometry. Mb0497 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248204.1" /translation="MTSSLPTVQRVIQNALEVSQLKYSQHPRPGGAPPALIVELPGER KLKINTILSVGEHSVRVEAFVCRKPDENREDVYRFLLRRNRRLYGVAYTLDNVGDIYL VGQMALSAVDADEVDRVLGQVLEVVDSDFNALLELGFRSSIQREWQWRLSRGESLQNL QAFAHLRPTTMQSAQRDEKELGG" CDS 578685..579290 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0498" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0498, -, len: 201 aa. Equivalent to Rv0488, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 201 aa overlap). Probable conserved integral membrane protein, LysE family possibly involved in transport of Lysine, similar to others and conserved hypothetical proteins e.g. AB93746.1|AL357613 putative membrane transport protein from Streptomyces coelicolor (204 aa); D83100|PA4365 probable transporter from Pseudomonas aeruginosa (200 aa); YGGA_ECOLI|P11667 hypothetical 21.7 kd protein from Escherichia coli (197 aa), FASTA scores: opt: 382, E(): 1.1e-19, (39.1% identity in 179 aa overlap); CGLYSEG_2 C|P94633 LYSINE EXPORTER PROTEIN (236 aa), FASTA scores: E(): 2.3e-07, (33.3% identity in 219 aa overlap). Also similar to Rv1986 from Mycobacterium tuberculosis. Mb0498 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248205.1" /translation="MMTLKVAIGPQNAFVLRQGIRREYVLVIVALCGIADGALIAAGV GGFAALIHAHPNMTLVARFGGAAFLIGYALLAARNAWRPSGLVPSESGPAALIGVVQM CLVVTFLNPHVYLDTVVLIGALANEESDLRWFFGAGAWAASVVWFAVLGFSAGRLQPF FATPAAWRILDALVAVTMIGVAVVVLVTSPSVPTANVALII" CDS 579447..580196 /codon_start=1 /transl_table=11 /gene="gpm1" /locus_tag="BQ2027_MB0499" /product="PROBABLE PHOSPHOGLYCERATE MUTASE 1 GPM1 (PHOSPHOGLYCEROMUTASE) (PGAM) (BPG-DEPENDENT PGAM)" /note="Mb0499, gpm1, len: 249 aa. Equivalent to Rv0489, len: 249 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 249 aa overlap). Probable gpm1, phosphoglycerate mutase 1 (EC 5.4.2.1), equivalent to P53531|PMGY_MYCLE PHOSPHOGLYCERATE MUTASE from Mycobacterium leprae (247 aa). Also highly similar to others e.g. PMG1_ECOLI|P31217 (249 aa), FASTA scores: opt: 805, E(): 0, (51.4% identity in 245 aa overlap); etc. Contains PS00175 Phosphoglycerate mutase family phosphohistidine signature, and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE PHOSPHOGLYCERATE MUTASE FAMILY. Note that previously known as gpm. Protein product from Mb0499 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0499 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248206.1" /translation="MANTGSLVLLRHGESDWNALNLFTGWVDVGLTDKGQAEAVRSGE LIAEHDLLPDVLYTSLLRRAITTAHLALDSADRLWIPVRRSWRLNERHYGALQGLDKA ETKARYGEEQFMAWRRSYDTPPPPIERGSQFSQDADPRYADIGGGPLTECLADVVARF LPYFTDVIVGDLRVGKTVLIVAHGNSLRALVKHLDQMSDDEIVGLNIPTGIPLRYDLD SAMRPLVRGGTYLDPEAAAAGAAAVAGQGRG" CDS 580370..581602 /codon_start=1 /transl_table=11 /gene="senX3" /locus_tag="BQ2027_MB0500" /product="PUTATIVE TWO COMPONENT SENSOR HISTIDINE KINASE SENX3" /note="Mb0500, senX3, len: 410 aa. Equivalent to Rv0490, len: 410 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 410 aa overlap). Putative senX3, two-component sensor histidine kinase (EC 2.7.3.-), transmembrane protein (see citations below), equivalent to O07129|SEX3_MYCBO SENSOR-LIKE HISTIDINE KINASE SENX3 from Mycobacterium bovis BCG (410 aa), FASTA scores: E(): 0, (99.5% identity in 410 aa overlap); and highly similar to P54883|SEX3_MYCLE|SENX3 SENSOR-LIKE HISTIDINE KINASE from Mycobacterium leprae (443 aa), FASTA score: (83.8% identity in 408 aa overlap). Also highly similar, except in N-terminus, to CAC31957.1|AL583925 probable two-component system sensor histidine kinase from Mycobacterium leprae (441 aa). Also highly similar to sensor kinase proteins from other organisms e.g. CAB77323.1|AL160331 putative sensor kinase protein from Streptomyces coelicolor (426 aa). Protein product from Mb0500 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0500 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248207.1" /translation="MTVFSALLLAGVLSALALAVGGAVGMRLTSRVVEQRQRVATEWS GITVSQMLQCIVTLMPLGAAVVDTHRDVVYLNERAKELGLVRDRQLDDQAWRAARQAL GGEDVEFDLSPRKRSATGRSGLSVHGHARLLSEEDRRFAVVFVHDQSDYARMEAARRD FVANVSHELKTPVGAMALLAEALLASADDSETVRRFAEKVLIEANRLGDMVAELIELS RLQGAERLPNMTDVDVDTIVSEAISRHKVAADNADIEVRTDAPSNLRVLGDQTLLVTA LANLVSNAIAYSPRGSLVSISRRRRGANIEIAVTDRGIGIAPEDQERVFERFFRGDKA RSRATGGSGLGLAIVKHVAANHDGTIRVWSKPGTGSTFTLALPALIEAYHDDERPEQA REPELRSNRSQREEELSR" CDS 581960..582643 /codon_start=1 /transl_table=11 /gene="regX3" /locus_tag="BQ2027_MB0501" /product="TWO COMPONENT SENSORY TRANSDUCTION PROTEIN REGX3 (TRANSCRIPTIONAL REGULATORY PROTEIN) (PROBABLY LUXR-FAMILY)" /note="Mb0501, regX3, len: 227 aa. Equivalent to Rv0491, len: 227 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 227 aa overlap). regX3, response regulator protein (sensory transduction protein) (see citations below), equivalent to O07130|RGX3_MYCBO|REGX3 SENSORY TRANSDUCTION PROTEIN from Mycobacterium bovis BCG (227 aa); AAG09797.1|AF258346_2|AF258346|REGX3 response regulator from Mycobacterium smegmatis (228 aa); equivalent to P54884|RGX3_MYCLE|REGX3 SENSORY TRANSDUCTION PROTEIN from Mycobacterium leprae (198 aa), FASTA scores : E(): 0, (95.4% identity in 197 aa overlap). Also highly similar to other response regulators e.g. AAG43239.1|AF123314_2 |AF123314 putative response regulator from Corynebacterium glutamicum (232 aa). Protein product from Mb0501 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0501 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248208.1" /translation="MTSVLIVEDEESLADPLAFLLRKEGFEATVVTDGPAALAEFDRA GADIVLLDLMLPGMSGTDVCKQLRARSSVPVIMVTARDSEIDKVVGLELGADDYVTKP YSARELIARIRAVLRRGGDDDSEMSDGVLESGPVRMDVERHVVSVNGDTITLPLKEFD LLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHVKRLRSKIEADPANPVHLVTV RGLGYKLEG" CDS complement(582640..584529) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0502C" /product="PROBABLE OXIDOREDUCTASE GMC-TYPE" /note="Mb0502c, -, len: 629 aa. Equivalent to Rv0492c, len: 629 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 629 aa overlap). Probable oxidoreductase GMC type (EC 1.-.-.-), similar to others except in N-terminus e.g. P55582|AE000087_5|Y4NJ_RHISN HYPOTHETICAL GMC-TYPE OXIDOREDUCTASE from Rhizobium sp. (505 aa), FASTA scores: opt: 873, E():0, (34.3% identity in 502 aa overlap); YTH2_RHOER|P46371 HYPOTHETICAL 53.0 kd GMC-TYPE OXIDOREDUCTASE from Rhodococcus erythropolis (493 aa), FASTA score: (25.7% identity in 521 aa overlap); YTH2_RHOSO|P46371 hypothetical 53.0 kd gmc-type oxidoreductase from Rhodococcus erythropolis (493 aa), FASTA score: (25.7% identity in 521 aa overlap); NP_085596.1|NC_002679 probable oxidoreductase from Mesorhizobium loti (507 aa); NP_285451.1|NC_001264 GMC oxidoreductase from Deinococcus radiodurans (722 aa); NP_249055.1|NC_002516 probable oxidoreductase from Pseudomonas aeruginosa (531 aa); etc. Contains PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature, and PS00624 GMC oxidoreductases signature 2. BELONGS TO THE GMC OXIDOREDUCTASES FAMILY. COFACTOR: FAD (BY SIMILARITY). Note that start changed since first submission (previously 684 aa). Protein product from Mb0502c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0502c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248209.1" /translation="MSRLADRAKSYPLASFGAALLPPELGGPLPAQFVQRVDRYVTRL PATSRFAVRAGLASLAAASYLTTGRSLPRLHPDERARVLHRIAALSPEVAAAVEGLKA IVLLANGADTYAHELLARAQEHDAARPDAELTVILSADSPSVTRADAVVVGSGAGGAM VARTLARAGLDVVVLEEGRRWTVEEFRSTHPVDRYAGLYRGAGATVALGRPAVVLPMG RAVGGTTVVNSGTCFRPSLAVQRRWRDEFGLGLADPDQLGRRLDDAEQTLRVAPVPLE IMGRNGRLLLQAAKSLGWRAAPIPRNAPGCRGCCQCAIGCPSNAKFGVHLNALPQACA AGARIISWARVERILHRAGRAYGVRARRPDGTTLDVLADAVVVAAGATETPGLLRRSG LGGHPRLGHNLALHPATMLAGLFDDDVFAWRGVLQSAAVHEFHESDGVLIEATSTPPG MGSMVFPGYGAELLRWLDRAPQIATFGAMVADRGVGTVRSVRGETVVRYDIAPGEIAK LRVALQAIGRLFFAAGAVEVLTGIPGAPPMRSLPELQDVLRRANPRSLHLAAFHPTGT AAAGADEQLCPVDATGRLRGVEGVWVADASILPSCPEVNPQLSIMAMALAVADQTVAK VVGVR" CDS complement(584526..584855) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0503C" /product="HYPOTHETICAL PROTEIN" /note="Mb0503c, -, len: 109 aa. Equivalent to Rv0492A, len: 109 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 109 aa overlap). Hypothetical unknown protein. GC plot suggests CDS. Mb0503c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248210.1" /translation="MSFLLDPPLLFVCGVLIERRLPVDRRDAAEAAALGVFFGASFGL YHNVPGLGMLWRPFRAQNGRDFMWNSGVFSVDVARAEWPLHAMAAAIFATYPFFIKLG RRLGRRI" CDS complement(584852..585841) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0504C" /product="conserved protein" /note="Mb0504c, -, len: 329 aa. Equivalent to Rv0493c, len: 329 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 329 aa overlap). Conserved hypothetical protein, showing some similarity to U00018_33|B2168_F2_93 from Mycobacterium leprae (167 aa), FASTA scores: opt: 166, E(): 0.00077, (35.9% identity in 131 aa overlap). Protein product from Mb0504c detected using SWATH mass spectrometry. Mb0504c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248211.1" /translation="MGESTTQPAGGAAVDDETRSAALPRWRGAAGRLEVWYATLSDPL TRTGLWVHCETVAPTTGGPYAHGWVTWFPPDAPPGTERFGPQPAQPAAGPAWFDIAGV RMAPAELTGRTRSLAWELSWKDTAAPLWTFPRVAWERELLPGAQVVIAPTAVFAGSLA VGETTHRVDSWRGGVAHIYGHGNAKRWGWIHADLGDGDVLEVVTAVSHKPGLRRLAPL AFVRFRIDGKDWPASPLPSLRMRTTLGVRHWQLEGRIGGREALIRVDQPPERCVSLGY TDPDGAKAVCTNTEQADIHIELGGRHWSVLGTGHAEVGLRGTAAPAIKEGTPA" CDS 585846..586574 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0505" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY GNTR-FAMILY)" /note="Mb0505, -, len: 242 aa. Equivalent to Rv0494, len: 242 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 242 aa overlap). Probable transcriptional regulator, GntR family, with C-terminal part highly similar to S72893|B2168_C2_205 hypothetical protein from Mycobacterium leprae (105 aa). Also similar to other transcription regulators e.g. PDHR_ECOLI|P06957 pyruvate dehydrogenase complex repressor PDHR or GENA from Escherichia coli (254 aa), FASTA scores: opt: 284, E(): 1.2e-11, (32.6% identity in 224 aa overlap); etc. Contains PS00043 Bacterial regulatory proteins, gntR family signature, and probable helix-turn helix motif from aa 50-71 (Score 1229, +3.37 SD)." /protein_id="CAB5248212.1" /translation="MVEPMNQSSVFQPPDRQRVDERIATTIADAILDGVFPPGSTLPP ERDLAERLGVNRTSLRQGLARLQQMGLIEVRHGSGSVVRDPEGLTHPAVVEALVRKLG PDFLVELLEIRAALGPLIGRLAAARSTPEDAEALCAALEVVQQADTAAARQAADLAYF RVLIHSTRNRALGLLYRWVEHAFGGREHALTGAYDDADPVLTDLRAINGAVLAGDPAA AAATVEAYLNASALRMVKSYRDRA" CDS complement(586575..587465) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0506C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0506c, -, len: 296 aa. Equivalent to Rv0495c, len: 296 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 296 aa overlap). Conserved hypothetical protein, highly similar to S72915|B2168_F1_37 hypothetical protein from Mycobacterium leprae (323 aa), FASTA scores: opt: 1615, E(): 0, (82.7% identity in 271 aa overlap); and P54579|Y495_MYCLE|ML243|13094009|CAC31952.1| AL583925 conserved hypothetical protein from Mycobacterium leprae (277 aa). Also highly similar to Q9X8H2|Y716_STRCO|SCE7.16 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (271 aa). Protein product from Mb0506c detected using SWATH mass spectrometry. Mb0506c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248213.1" /translation="MWRPAQGARWHVPAVLGYGGIPRRASWSNVESVANSRRRPVHPG QEVELDFAREWVEFYDPDNPEHLIAADLTWLLSRWACVFGTPACQGTVAGRPNDGCCS HGAFLSDDDDRTRLADAVHKLTDDDWQFRAKGLRRKGYLELDEHDGQPQHRTRKHKGA CIFLNRPGFAGGAGCALHSKALKLGVPPLTMKPDVCWQLPIRRSQEWVTRPDGTEILK TTLTEYDRRGWGSGGADLHWYCTGDPAAHVGTKQVWQSLADELTELLGEKAYGELAAM CKRRSQLGLIAVHPATRAAQ" CDS 587545..588531 /codon_start=1 /transl_table=11 /gene="ppx1" /locus_tag="BQ2027_MB0507" /product="Exopolyphosphatase (EC" /EC_number="3.6.1.11" /note="Mb0507, -, len: 328 aa. Equivalent to Rv0496, len: 328 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 328 aa overlap). Conserved hypothetical protein, highly similar to S72894|467046|AAA17230.1|U00018 exopolyphosphatase (EC 3.6.1.11) ppx from Mycobacterium leprae (406 aa), FASTA scores: opt: 1902, E(): 0, (86.6% identity in 343 aa overlap); and P54882|Y496_MYCLE|ML2434|13094008|CAC31951.1 |AL583925 HYPOTHETICAL 36.2 KDA PROTEIN from Mycobacterium leprae (339 aa). Also highly similar to hypothetical proteins and exopolyphosphatases e.g. Q9X8H1|Y715_STRCO|SCE7.15c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (309 aa). C-terminal region similar to CGU31224_1|Q46054 protein similar to ppx gene product of Mycobacterium leprae from Cornybacterium glutamicum (140 aa), FASTA scores: opt: 615, E(): 2.7e-33, (70.9% identity in 134 aa overlap). Protein product from Mb0507 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0507 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248214.1" /translation="MVDAHRGGHPTPMSSTKATLRLAEATDSSGKITKRGADKLISTI DEFAKIAISSGCAELMAFATSAVRDAENSEDVLSRVRKETGVELQALRGEDESRLTFL AVRRWYGWSAGRILNLDIGGGSLEVSSGVDEEPEIALSLPLGAGRLTREWLPDDPPGR RRVAMLRDWLDAELAEPSVTVLEAGSPDLAVATSKTFRSLARLTGAAPSMAGPRVKRT LTANGLRQLIAFISRMTAVDRAELEGVSADRAPQIVAGALVAEASMRALSIEAVEICP WALREGLILRKLDSEADGTALIESSSVHTSVRAVGGQPADRNAANRSRGSKP" CDS 588528..589460 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0508" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0508, -, len: 310 aa. Equivalent to Rv0497, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 310 aa overlap). Probable conserved transmembrane protein, equivalent (but shorter in C-terminus) to P54580|Y497_MYCLE|ML2433 HYPOTHETICAL 37.9 KDA PROTEIN from Mycobacterium leprae (355 aa). N-terminus highly similar to S72922|B2168_C1_166|467074 hypothetical protein from Mycobacterium leprae (118 aa), FASTA scores: opt: 350, E(): 1.4e-12, (57.9% identity in 114 aa overlap); and hydrophobic C-terminus, highly similar to S72895|B2168_C2_209|467047 hypothetical protein from Mycobacterium leprae (241 aa), FASTA scores: opt: 473, E(): 8e-19, (53.9% identity in 241 aa). Protein product from Mb0508 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0508 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248215.1" /translation="MTGPHPETESSGNRQISVAELLARQGVTGAPARRRRRRRGDSDA ITVAELTGEIPIIRDDHHHAGPDAHASQSPAANGRVQVGEAAPQSPAEPVAEQVAEEP TRTVYWSQPEPRWPKSPPQDRRESGPELSEYPRPLRHTHSDRAPAGPPSGAEHMSPDP VEHYPDLWVDVLDTEVGEAEAETEVREAQPGRGERHAAAAAAGTDVEGDGAAEARVAR RALDVVPTLWRGALVVLQSILAVAFGAGLFIAFDQLWRWNSIVALVLSVMVILGLVVS VRAVRKTEDIASTLIAVAVGALITLGPLALLQSG" CDS 589476..590318 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0509" /product="AP endonuclease, family protein 2" /note="Mb0509, -, len: 280 aa. Equivalent to Rv0498, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Conserved hypothetical protein, highly similar to P54581|Y498_MYCLE|ML2432 HYPOTHETICAL 30.5 KDA PROTEIN from Mycobacterium leprae (280 aa); and S72896|B2168_C2_210 hypothetical protein from Mycobacterium leprae (244 aa), FASTA scores: opt: 1486, E():0, (89.3% identity in 244 aa overlap). Also similar to Q9X8H0|Y714_STRCO|SCE7.14c HYPOTHETICAL PROTEIN from Streptomyces coelicolor. Protein product from Mb0509 detected using SWATH mass spectrometry. Mb0509 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248216.1" /translation="MRPAIKVGLSTASVYPLRAEAAFEYADRLGYDGVELMVWGESVS QDIDAVRKLSRRYRVPVLSVHAPCLLISQRVWGANPILKLDRSVRAAEQLGAQTVVVH PPFRWQRRYAEGFSDQVAALEAASTVMVAVENMFPFRADRFFGAGQSRERMRKRGGGP GPAISAFAPSYDPLDGNHAHYTLDLSHTATAGTDSLDMARRMGPGLVHLHLCDGSGLP ADEHLVPGRGTQPTAEVCQMLAGSGFVGHVVLEVSTSSARSANERESMLAESLQFART HLLR" CDS 590334..591209 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0510" /product="TesB-like acyl-CoA thioesterase 3" /note="Mb0510, -, len: 291 aa. Equivalent to Rv0499, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 291 aa overlap). Conserved hypothetical protein, showing some similarity to AL031184|SC2A11_16|T34762 hypothetical protein from Streptomyces coelicolor (340 aa), FASTA scores: opt: 240, E(): 1.8e-07, (28.9% identity in 270 aa overlap). Protein product from Mb0510 detected using SWATH mass spectrometry. Mb0510 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248217.1" /translation="MNALFTTAMALRPLDSDPGNPACRVFEGELNEHWTIGPKVHGGA MVALCANAARTAYGAAGQQPMRQPVAVSASFLWAPDPGTMRLVTSIRKRGRRISVADV ELTQGGRTAVHAVVTLGEPEHFLPGVDGSGGASGTAPLLSANPVVELMAPEPPEGVVP IGPGHQLAGLVHLGEGCDVRPVLSTLRSATDGRPPVIQLWARPRGVAPDALFALLCGD LSAPVTFAVDRTGWAPTVALTAYLRALPADGWLRVLCTCVEIGQDWFDEDHIVVDRLG RIVVQTRQLAMVPAQ" CDS 591234..592121 /codon_start=1 /transl_table=11 /gene="proC" /locus_tag="BQ2027_MB0511" /product="PROBABLE PYRROLINE-5-CARBOXYLATE REDUCTASE PROC (P5CR) (P5C REDUCTASE)" /note="Mb0511, proC, len: 295 aa. Equivalent to Rv0500, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 295 aa overlap). Probable proC, Pyrroline-5-carboxylate reductase (EC 1.5.1.2) (see citation below), equivalent to P46725|PROC_MYCLE PYRROLINE-5-CARBOXYLATE REDUCTASE from Mycobacterium leprae (294 aa), FASTA scores: opt: 1473, E(): 0, (82.4% identity in 295 aa overlap). Also similar to others e.g. P46540|PROC_CORGL PYRROLINE-5-CARBOXYLATE REDUCTASE from Corynebacterium glutamicum (270 aa); T36286|4803683|CAB42663.1|AL049819 pyrroline-5-carboxylate reductase from Streptomyces coelicolor (284 aa); etc. BELONGS TO THE PYRROLINE-5-CARBOXYLATE REDUCTASE FAMILY. Protein product from Mb0511 detected using shotgun mass spectrometry. Mb0511 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248218.1" /translation="MLFGMARIAIIGGGSIGEALLSGLLRAGRQVKDLVVAERMPDRA NYLAQTYSVLVTSAADAVENATFVVVAVKPADVEPVIADLANATAAAENDSAEQVFVT VVAGITIAYFESKLPAGTPVVRAMPNAAALVGAGVTALAKGRFVTPQQLEEVSALFDA VGGVLTVPESQLDAVTAVSGSGPAYFFLLVEALVDAGVGVGLSRQVATDLAAQTMAGS AAMLLERMDQDQGGANGELMGLRVDLTASRLRAAVTSPGGTTAAALRELERGGFRMAV DAAVQAAKSRSEQLRITPE" CDS 592262..592498 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0512" /product="Excisionase-like protein SCO3328" /note="Mb0512, -, len: 78 aa. Equivalent to Rv0500A, len: 78 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 78 aa overlap). Conserved hypothetical protein, similar to proteins from Mycobacterium leprae and Streptomyces coelicolor e.g. U00018_25 from Mycobacterium leprae cosmid B2168 (86 aa), FASTA scores: opt: 428, E(): 1.3e-27, (82.6% identity in 86 aa overlap); AL079345|SCE68_26 from Streptomyces coelicolor cosmid E6 (70 aa), FASTA scores: opt: 252, E(): 1.2 e-13, (72.2 identity in 54 aa overlap). Protein product from Mb0512 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0512 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248219.1" /translation="MTSTNGPSARDTGFVEGQQAKTQLLTVAEVAALMRVSKMTVYRL VHNGELPAVRVGRSFRVHAKAVHDMLETSYFDAG" CDS 592626..592727 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0512A" /product="30S small ribosomal subunit" /note="Mb0512A, len: 33 aa. Equivalent to Rv0500B len: 33 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 33 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved hypothetical protein. Basic protein 18 of the 33 aa are Arg or Lys, with strong similarity to AL079345|SCE68_25 protein from Streptomyces coelicolor cosmid E6 (32 aa), FASTA scores: opt: 176, E(): 1e-06, (93.1% identity in 29 aa overlap). Same gene arrangement in both actinomycetes. Mb0512A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248220.1" /translation="MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK" CDS 592805..593935 /codon_start=1 /transl_table=11 /gene="galE2" /locus_tag="BQ2027_MB0513" /product="POSSIBLE UDP-GLUCOSE 4-EPIMERASE GALE2 (GALACTOWALDENASE) (UDP-GALACTOSE 4-EPIMERASE) (URIDINE DIPHOSPHATE GALACTOSE 4-EPIMERASE) (URIDINE DIPHOSPHO-GALACTOSE 4-EPIMERASE)" /note="Mb0513, galE2, len: 376 aa. Equivalent to Rv0501, len: 376 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 376 aa overlap). Possible galE2, UDP-glucose 4-epimerase (EC 5.1.3.2), highly similar (except in N-terminus) to CAC31944.1|AL583925 possible glucose epimerase/dehydratase from Mycobacterium leprae (364 aa). N-terminus highly similar to S72923|B2168_C1_174|467075|AAA17259.1|U00018 hypothetical protein from Mycobacterium leprae (180 aa), FASTA scores: opt: 934, E(): 0, (89.6% identity in 164 aa overlap); and C-terminus highly similar to S72898|467050|AAA17234.1|U00018 hypothetical protein from Mycobacterium leprae (168 aa), FASTA scores: opt: 928, E(): 0, (82.7% identity in 168 aa overlap). Also highly similar to T36274|5123671|CAB45360.1|AL079345 probable epimerase from Streptomyces coelicolor (353 aa); and similar in part to other epimerases e.g. GALE_ECOLI|P09147 UDP-glucose 4-epimerase from Escherichia coli (338 aa), FASTA scores: opt: 241, E(): 6.7e-09, (28.2% identity in 294 aa overlap); etc. BELONGS TO THE SUGAR EPIMERASE FAMILY. COFACTOR: NAD. Note that previously known as galE1. Protein product from Mb0513 detected using SWATH mass spectrometry. Mb0513 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248221.1" /translation="MSSSNGRGGAGGVGGSSEHPQYPKVVLVTGACRFLGGYLTARLA QNPLINRVIAVDAIAPSKDMLRRMGRAEFVRADIRNPFIAKVIRNGEVDTVVHAAAAS YAPRSGGSAALKELNVMGAMQLFAACQKAPSVRRVVLKSTSEVYGSSPHDPVMFTEDS SSRRPFSQGFPKDSLDIEGYVRALGRRRPDIAVTILRLANMIGPAMDTTLSRYLAGPL VPTIFGRDARLQLLHEQDALGALERAAMAGKAGTFNIGADGILMLSQAIRRAGRIPVP VPGFGVWALDSLRRANHYTELNREQFAYLSYGRVMDTTRMRVELGYQPKWTTVEAFDD YFRGRGLTPIIDPHRVRSWEGRAVGLAQRWGSRNPIPWSGLR" CDS 593942..595018 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0514" /product="Phospholipid/glycerol acyltransferase" /note="Mb0514, -, len: 358 aa. Equivalent to Rv0502, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 358 aa overlap). Conserved hypothetical protein, equivalent to P54878|Y502_MYCLE|ML2427 HYPOTHETICAL 40.5 KDA PROTEIN from Mycobacterium leprae (367 aa), FASTA scores: opt: 2042, E(): 0, (84.1% identity in 365 aa overlap). Also similar to T36273|SCE68.23c hypothetical protein from Streptomyces coelicolor (355 aa). C-terminal similar to AL021529|SC10A5_4|T34572 hypothetical protein from Streptomyces coelicolor (295 aa), FASTA score: (57.8% identity in 263 aa overlap); and to hypothetical proteins from Mycobacterium tuberculosis Rv1920|G70808 (287 aa); and Rv1428c|G70914 (275 aa). Protein product from Mb0514 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0514 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248222.1" /translation="MGNVAGETRANVIPLHTNRSRVAARRRAGQRAESRQHPSLLSDP NDRASAEQIAAVVREIDEHRRAAGATTSSTEATPNDLAQLVAAVAGFLRQRLTGDYSV DEFGFDPHFNSAIVRPLLRFFFKSWFRVEVSGVENIPRDGAALVVANHAGVLPFDGLM LSVAVHDEHPAHRDLRLLAADMVFDLPVIGEAARKAGHTMACTTDAHRLLASGELTAV FPEGYKGLGKRFEDRYRLQRFGRGGFVSAALRTKAPIVPCSIIGSEEIYPMLTDVKLL ARLFGLPYFPITPLFPLAGPVGLVPLPSKWRIAFGEPICTADYASTDADDPMVTFELT DQVRETIQQTLYRLLAGRRNIFFG" CDS complement(595022..595930) /codon_start=1 /transl_table=11 /gene="cmaA2" /locus_tag="BQ2027_MB0515C" /product="CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE 2 CMAA2 (CYCLOPROPANE FATTY ACID SYNTHASE) (CFA SYNTHASE) (CYCLOPROPANE MYCOLIC ACID SYNTHASE 2) (MYCOLIC ACID TRANS-CYCLOPROPANE SYNTHETASE)" /note="Mb0515c, cmaA2, len: 302 aa. Equivalent to Rv0503c, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 302 aa overlap). cmaA2 (alternate gene name: cma2), cyclopropane-fatty-acyl-phospholipid synthase 2 (mycolic acid trans-cyclopropane synthetase) (EC 2.1.1.79) (see citations below). Note that this protein has 302 aa and not 322 aa: we have chosen a different initiation codon on the basis of homology). Equivalent to S72886|B2168_F3_130 hypothetical protein from Mycobacterium leprae (308 aa), FASTA score: (78.9% identity in 303 aa overlap); and highly similar to other proteins from Mycobacterium leprae. Also similar to other proteins from Mycobacterium tuberculosis and Mycobacterium bovis BCG e.g. MTV038_14|UMAA2|Rv0470c|MTV038.14 PUTATIVE MYCOLIC ACID SYNTHESIS/MODIFICATION PROTEIN (287 aa) (57.2% identity in 297 aa overlap). Protein product from Mb0515c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0515c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248223.1" /translation="MTSQGDTTSGTQLKPPVEAVRSHYDKSNEFFKLWLDPSMTYSCA YFERPDMTLEEAQYAKRKLALDKLNLEPGMTLLDIGCGWGSTMRHAVAEYDVNVIGLT LSENQYAHDKAMFDEVDSPRRKEVRIQGWEEFDEPVDRIVSLGAFEHFADGAGDAGFE RYDTFFKKFYNLTPDDGRMLLHTITIPDKEEAQELGLTSPMSLLRFIKFILTEIFPGG RLPRISQVDYYSSNAGWKVERYHRIGANYVPTLNAWADALQAHKDEAIALKGQETYDI YMHYLRGCSDLFRDKYTDVCQFTLVK" CDS complement(595953..596453) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0516C" /product="UPF0336 protein" /note="Mb0516c, -, len: 166 aa. Equivalent to Rv0504c, len: 166 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 166 aa overlap). Conserved hypothetical protein, equivalent to P54879|Y504_MYCLE|ML2425 HYPOTHETICAL 18.7 KDA PROTEIN from Mycobacterium leprae (166 aa), FASTA scores: opt: 884, E(): 0, (83.1% identity in 166 aa overlap); and highly similar to other proteins from Mycobacterium leprae. Also highly similar to CAB77410.1|AL160431|SCD82.07 hypothetical protein from Streptomyces coelicolor (150 aa). Also similar to M. tuberculosis hypothetical proteins Rv0635|H70612 (158 aa); and Rv0637|B70613 (166 aa). Protein product from Mb0516c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0516c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248224.1" /translation="MTVPEEAQTLIGKHYRAPDHFLVGREKIREFAVAVKDDHPTHYS EPDAAAAGYPALVAPLTFLAIAGRRVQLEIFTKFNIPINIARVFHRDQKFRFHRPILA NDKLYFDTYLDSVIESHGTVLAEIRSEVTDAEGKPVVTSVVTMLGEAAHHEADADATV AAIASI" CDS complement(596615..597736) /codon_start=1 /transl_table=11 /gene="serB1" /locus_tag="BQ2027_MB0517C" /product="POSSIBLE PHOSPHOSERINE PHOSPHATASE SERB1 (PSP) (O-PHOSPHOSERINE PHOSPHOHYDROLASE) (PSPASE)" /note="Mb0517c, serB1, len: 373 aa. Equivalent to Rv0505c, len: 373 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 373 aa overlap). Possible serB1, phosphoserine phosphatase (EC 3.1.3.3), equivalent (but longer ~70 aa in N-terminus) to S72914|serB phosphoserine phosphatase from Mycobacterium leprae (300 aa), FASTA scores: opt: 1570, E(): 0, (83.0% identity in 306 aa overlap). C-terminus highly similar to CAB55344.1|AJ010584 phosphoserine phosphatase from Streptomyces coelicolor (266 aa). Low similarity to SERB_ECOLI|P06862 phosphoserine phosphatase from Escherichia coli strains K12 and O157:H7 (322 aa), FASTA scores: opt: 148, E(): 0.043, (24.0% identity in 150 aa overlap). C-terminus is also similar to O33611|AB004855_1|IMD_STRCN PROTEIN INVOLVED IN INHIBITION OF MORPHOLOGICAL DIFFERENTIATION from Streptomyces cyaneus (277 aa), FASTA score: (37.7% identity in 252 aa overlap). SEEMS TO BELONG TO THE SERB FAMILY. Note that previously known as serB. Protein product from Mb0517c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0517c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248225.1" /translation="MGLTCWPRTAAGRVHDESRCGLANFDTALGLQINPRQPRAPPRI CRIGLITAAASATGQAPRLGVMMVSSHLGSPDQAGHVDLASPADPPPPDASASHSPVD MPAPVAAAGSDRQPPIDLTAAAFFDVDNTLVQGSSAVHFGRGLAARHYFTYRDVLGFL YAQAKFQLLGKENSNDVAAGRRKALAFIEGRSVAELVALGEEIYDEIIADKIWDGTRE LTQMHLDAGQQVWLITATPYELAATIARRLGLTGALGTVAESVDGIFTGRLVGEILHG TGKAHAVRSLAIREGLNLKRCTAYSDSYNDVPMLSLVGTAVAINPDARLRSLARERGW EIRDFRIARKAARIGVPSALALGAAGGALAALASRRQSR" CDS 597910..598353 /codon_start=1 /transl_table=11 /gene="mmpS2" /locus_tag="BQ2027_MB0518" /product="PROBABLE CONSERVED MEMBRANE PROTEIN MMPS2" /note="Mb0518, mmpS2, len: 147 aa. Equivalent to Rv0506, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). Probable mmpS2, conserved membrane protein (see citation below), highly similar to other Mycobacterial proteins e.g. C-terminus of AAD44232.1|AF143772_38|AF143772|TmtpA from M. avium (221 aa); P54880|MMS4_MYCLE|MMPS4 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (154 aa), FASTA scores: opt: 392, E(): 1.3e-20, (43.7% identity in 151 aa overlap); and the PUTATIVE MEMBRANE PROTEINS from Mycobacterium tuberculosis MTV040_5, MTCY4D9_16, MTV037_15. BELONGS TO THE MMPS FAMILY. Mb0518 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248226.1" /translation="MRMISVSGAVKRMWLLLAIVVVAVVGGLGIYRLHSIFGVHEQPT VMVKPDFDVPLFNPKRVTYEVFGPAKTAKIAYLDPDARVHRLDSVSLPWSVTVETTLP AVSVNLMAQSNADVISCRIIVNGAVKDERSETSPRALTSCQVSSG" CDS 598350..601256 /codon_start=1 /transl_table=11 /gene="mmpL2" /locus_tag="BQ2027_MB0519" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL2" /note="Mb0519, mmpL2, len: 968 aa. Equivalent to Rv0507, len: 968 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 968 aa overlap). Probable mmpL2, conserved transmembrane transport protein (see citation below), member of RND superfamily, highly similar to other Mycobacterial proteins e.g. YV34_MYCLE from Mycobacterium leprae (959 aa), FASTA scores: opt: 3699, E(): 0, (58.3% identity in 940 aa overlap); and the Mycobacterium tuberculosis proteins MTV037_14, MTV040_4, MTCY98_8, MTCY4D9_15, MTCY48_8, MTCY19G5_6, MTV005_19, etc. Also similar to STMACTII_3|SC10A5_9 from Streptomyces coelicolor; and BSUB0|004_12 from Bacillus subtilis. C-terminal half similar to Q50086|U1740AB from Mycobacterium leprae (386 aa), FASTA scores: opt: 1526, E(): 0, (61.5% identity in 371 aa overlap). BELONGS TO THE MMPL FAMILY. Mb0519 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248227.1" /translation="MSERHAALTSLPPILPRLIRRFAVVIVLLWLGFTAFVNLAVPQL EVVGKAHSVSMSPSDAASIQAIKRVGQVFGEFDSDNAVTIVLEGDQPLGGDAHRFYSD LMRKLSADTRHVAHIQHFWGDPLTAAGSQSADDRAAYVVVYLVGNNETEAYDSVHAVR HMVDTTPPPHGVKAYVTGPAALNADQAEAGDKSIAKVTAITSMVIAAMLLVIYRSVIT AVLVLIMVGIDLGAIRGFIALLADHNIFSLSTFATNLLVLMAIAASTDYAIFMLGRYH ESRYAGEDRETAFYTMFHGTAHVILGSGLTIAGAMYCLSFARLPYFETLGAPIAIGML VAVLAALTLGPAVLTVGSFFKLFDPKRRMNTRRWRRVGTAIVRWPGPVLAATCLVASI GLLALPSYRTTYDLRKFMPASMPSNVGDAAAGRHFSRARLNPEVLLIETDHDMRNPVD MLVLDKVAKNIYHSPGIEQVKAITRPLGTTIKHTSIPFIISMQGVNSSEQMEFMKDRI YDILVQVAAMNTSIETMHRMYALMGEVIDNTVDMDHLTHDMSDITATLRDHLADFEDF FRPIRSYFYWEKHCFDVPLCWSIRSIFDMFDSVDQLSEKLEYLVKDMDILITLLPQMR AQMPPMISAMTTMRDMMLIWHGTLGAFYKQQERNNKDPGAMGRVFDAAQIDDSFYLPQ SAFENPDFKRGLKMFLSPDGKAARFVIALEGDPATPEGICRVEPIKREAREAIKGTPL QGAAIYLGGTAATFKDIREGARYDLLIAGVAAISLILIIMMIITRSVVAAVVIVGTVV LSMGASFGLSVLVWQDILGIELYWMVLAMSVILLLAVGSDYNLLLISRLKEEIGAGLN TGIIRAMAGTGGVVTAAGMVFAVTMSLFVFSDLRIIGQIGTTIGLGLLFDTLVVRSFM TPSIAALLGRWFWWPLRVRPRPASQMLRPFAPRRLVRALLLPSGQHPSATGAHE" CDS 601249..601542 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0520" /product="Glutaredoxin-like domain-containing protein PA3033" /note="Mb0520, -, len: 97 aa. Equivalent to Rv0508, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 97 aa overlap). Conserved hypothetical protein, showing similarity with T36269|5123666|CAB45355.1|AL079345 probable redoxin from Streptomyces coelicolor (101 aa), FASTA scores: opt: 160, E(): 3.4e-05, (33.3% identity in 75 aa overlap); and E81943|NMA0966 probable thioredoxin from Neisseria meningitidis group A strain Z2491 (77 aa). Protein product from Mb0520 detected using SWATH mass spectrometry. Mb0520 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248228.1" /translation="MSRPQVELLTRAGCAICVRVAEQLAELSSELGFDMMTIDVDVAA STGNPGLRAEFGDRLPVVLLDGREHSYWEVDEHRLRADIARSTFGSPPDKRLP" CDS 601592..602998 /codon_start=1 /transl_table=11 /gene="hemA" /locus_tag="BQ2027_MB0521" /product="probable glutamyl-trna reductase hema (glutr)" /note="Mb0521, hemA, len: 468 aa. Equivalent to Rv0509, len: 468 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 468 aa overlap). Probable hemA, glutamyl-tRNA reductase (EC 1.2.1.-), equivalent to HEM1_MYCLE|P46724 GLUTAMYL-TRNA REDUCTASE from Mycobacterium leprae (467 aa), FASTA scores: opt: 2377, E(): 0, (82.3% identity in 463 aa overlap). Also highly similar (sometimes in part) to others e.g. Q9WX15|HEM1_STRCO GLUTAMYL-TRNA REDUCTASE from Streptomyces coelicolor (581 aa); P16618|HEM1_BACSU|HEMA GLUTAMYL-TRNA REDUCTASE from Bacillus subtilis (455 aa); etc. Contains PS00747 Glutamyl-tRNA reductase signature. BELONGS TO THE GLUTAMYL-TRNA REDUCTASE FAMILY. Protein product from Mb0521 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0521 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248229.1" /translation="MSVLLFGVSHRSAPVVVLEQLSIDESDQVKIIDRVLASPLVTEA MVLSTCNRVEVYAVVDAFHGGLSVIGQVLAEHSGMSMGELTKYAYVRYSEAAVEHLFA VASGLDSAVIGEQQVLGQVRRAYAVAESNRTVGRVLHELAQRALSVGKRVHSETAIDA AGASVVSVALGMAERKLGSLAGTTAVVIGAGAMGALSAVHLTRAGVGHIQVLNRSLSR AQRLARRIRESGVPAEALALDRLANVLADADVVVSCTGAVRPVVSLADVHHALAAARR DEATRPLVICDLGMPRDVDPAVARLPCVWVVDVDSVQHEPSAHAAAADVEAARHIVAA EVASYLVGQRMAEVTPTVTALRQRAAEVVEAELLRLDNRLPGLQSVQREEVARTVRRV VDKLLHAPTVRIKQLASAPGGDSYAEALRELFELDQTAVDAVATAGELPVVPSGFDAE SRRGGGDMQSSPKRSPSN" CDS 603008..603937 /codon_start=1 /transl_table=11 /gene="hemC" /locus_tag="BQ2027_MB0523" /product="PROBABLE PORPHOBILINOGEN DEAMINASE HEMC (PBG) (HYDROXYMETHYLBILANE SYNTHASE) (HMBS) (PRE-UROPORPHYRINOGEN SYNTHASE)" /note="Mb0523, hemC, len: 309 aa. Equivalent to Rv0510, len: 309 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 309 aa overlap). Probable hemC, hydroxymethylbilane synthase (porphobilinogen deaminase) (EC 4.3.1.8), equivalent to HEM3B|Q49808|HEM3_MYCLE PORPHOBILINOGEN DEAMINASE from Mycobacterium leprae (315 aa), FASTA scores: opt: 889, E(): 0, (88.1% identity in 159 aa overlap). Also highly similar to others e.g. Q9WX16|HE31_STRCO PROBABLE PORPHOBILINOGEN DEAMINASE from Streptomyces coelicolor (319 aa); Q9L6Q2|HEM3_SALTY PORPHOBILINOGEN DEAMINASE from Salmonella typhimurium (313 aa); etc. BELONGS TO THE HMBS FAMILY. COFACTOR: COVALENTLY BINDS A DIPYRROMETHANE COFACTOR TO WHICH THE PORPHOBILINOGEN SUBUNITS ARE ADDED. Protein product from Mb0523 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0523 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248230.1" /translation="MIRIGTRGSLLATTQAATVRDALIAGGHSAELVTISTEGDRSMA PIASLGVGVFTTALREAMEAGLVDAAVHSYKDLPTAADPRFTVAAIPPRNDPRDAVVA RDGLTLGELPVGSLVGTSSPRRAAQLRALGLGLEIRPLRGNLDTRLNKVSSGDLDAIV VARAGLARLGRLDDVTETLEPVQMLPAPAQGALAVECRAGDSRLVAVLAELDDADTRA AVTAERALLADLEAGCSAPVGAIAEVVESIDEDGRVFEELSLRGCVAALDGSDVIRAS GIGSCGRARELGLSVAAELFELGARELMWGVRH" CDS 603970..605667 /codon_start=1 /transl_table=11 /gene="hemD" /locus_tag="BQ2027_MB0524" /product="PROBABLE UROPORPHYRIN-III C-METHYLTRANSFERASE HEMD (UROPORPHYRINOGEN III METHYLASE) (UROGEN III METHYLASE) (SUMT) (UROGEN III METHYLASE) (UROM)" /note="Mb0524, hemD, len: 565 aa. Equivalent to Rv0511, len: 565 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 565 aa overlap). Probable hemD (alternate gene name: cysG), uroporphyrin-III C-methyltransferase (EC 2.1.1.107), highly similar to others e.g. CAC31936.1|AL583925 possible uroporphyrin-III C-methyltransferase from Mycobacterium leprae (563 aa); and S72909|CYSG from Mycobacterium leprae (472 aa), FASTA scores: opt: 1946, E(): 0, (83.3% identity in 472 aa overlap); T36265|5123662|CAB45351.1|AL079345 probable uroporphyrin-III C-methyltransferase from Streptomyces coelicolor (565 aa); and similar to others e.g. AAK00606.1|AF221100_3|AF221100 from Selenomonas ruminantium subsp. ruminantium (505 aa); etc. Also similar to Rv2071c and Rv2847c from Mycobacterium tuberculosis. Note that previously known as cysG. Protein product from Mb0524 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0524 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248231.1" /translation="MTRGRKPRPGRIVFVGSGPGDPGLLTTRAAAVLANAALVFTDPD VPEPVVALIGTDLPPVSGPAPAEPVAGNGDAAGGGSAQEHGRAASAVVSGGPDIRPAL GDPADVAKTLTAEARSGVDVVRLVAGDPLTVDAVISEVNAVARTHLHIEIVPGLAASS AVPTYAGLPLGSSHTVADVRIDPENTDWDALAAAPGPLILQATASHLAESARSLIDHQ LAESTPCVVTAHGTTCQQRSVETTLQGLTDPAVLGATDPACSANGRDSQAGPLIVTIG KTVTSRAKLNWWESRALYGWTVLVPRTKDQAGEMSERLTSYGALPVEVPTIAVEPPRS PAQMERAVKGLVDGRFQWIVFTSTNAVRAVWEKFGEFGLDARAFSGVKIACVGESTAD RVRAFGISPELVPSGEQSSLGLLDDFPPYDSVFDPVNRVLLPRADIATETLAEGLRER GWEIEDVTAYRTVRAAPPPATTREMIKTGGFDAVCFTSSSTVRNLVGIAGKPHARTII ACIGPKTAETAAEFGLRVDVQPDTAAIGPLVDALAEHAARLRAEGALPPPRKKSRRR" CDS 605753..606742 /codon_start=1 /transl_table=11 /gene="hemB" /locus_tag="BQ2027_MB0525" /product="PROBABLE DELTA-AMINOLEVULINIC ACID DEHYDRATASE HEMB (PORPHOBILINOGEN SYNTHASE) (ALAD) (ALADH)" /note="Mb0525, hemB, len: 329 aa. Equivalent to Rv0512, len: 329 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 329 aa overlap). Probable hemB, delta-aminolevulinic acid dehydratase (EC 4.2.1.24), equivalent to 46723|HEM2_MYCLE DELTA-AMINOLEVULINIC ACID DEHYDRATASE from Mycobacterium leprae (329 aa). Also highly similar to many e.g. P54919|HEM2_STRCO from Streptomyces coelicolor (330 aa); HEM2_ECOLI|P15002 from Escherichia coli (323 aa), FASTA scores: opt: 942, E(): 0, (47.6% identity in 317 aa overlap); etc. Contains PS00169 Delta-aminolevulinic acid dehydratase active site. BELONGS TO THE ALADH FAMILY. COFACTOR: ZINC. Protein product from Mb0525 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0525 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248232.1" /translation="MSMSSYPRQRPRRLRSTVAMRRLVAQTSLEPRHLVLPMFVADGI DEPRPITSMPGVVQHTRDSLRRAAAAAVAAGVGGLMLFGVPRDQDKDGVGSAGIDPDG ILNVALRDLAKDLGEATVLMADTCLDEFTDHGHCGVLDDRGRVDNDATVARYVELAVA QAESGAHVVGPSGMMDGQVAAIRDGLDAAGYIDVVILAYAAKFASAFYGPFREAVSSS LSGDRRTYQQEPGNAAEALREIELDLDGGADIVMVKPAMGYLDVVAAAADVSPVPVAA YQVSGEYAMIRAAAANNWIDERAAVLESLTGIRRAGADIVLTYWAVDAAGWLT" CDS 606755..607303 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0526" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0526, -, len: 182 aa. Equivalent to Rv0513, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 182 aa overlap). Possible conserved transmembrane protein, with its N-terminus highly similar to S72925|B2168_C1_182 hypothetical protein from Mycobacterium leprae (103 aa), FASTA scores: opt: 217, E(): 8.2e-14, (45.3 % identity in 106 aa overlap). Protein product from Mb0526 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0526 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248233.1" /translation="MTPTGDTKPKLLFYEPGASWYWVLTGPLAAVSVLLLEISSGAGV GLITPAIFLVMVSAFVALQVKAARIHTSVELTHDALRQGTETIRLAEIVKIYPEADGR ETSGEEPAKWQSARTLGELVGVPRGRVGIGLKLTGGRTAQAWARRHQQLRAALTPLVQ ERLGPVDSDVADVNGDDAGPAR" CDS 607300..607599 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0527" /product="POSSIBLE TRANSMEMBRANE PROTEIN" /note="Mb0527, -, len: 99 aa. Equivalent to Rv0514, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Possible transmembrane protein. Mb0527 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248234.1" /translation="MIARYRAGAELFLACAALAGSAASWSRTRSTVAVAPVIDGQPVT LSVVYHPQPLVLTLLLATIAGVLSVVGTARLRRARAGLNAHPDGLNQRPPGGWCH" CDS 607702..609213 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0528" /product="CONSERVED 13E12 REPEAT FAMILY PROTEIN" /note="Mb0528, -, len: 503 aa. Equivalent to Rv0515, len: 503 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 503 aa overlap). Part of M. tuberculosis 13E12 repeat family. Almost identical to Rv0336 (99.8% identity in 503 aa overlap), possibly due to a recent gene duplication. Also similar to other M. tuberculosis hypothetical 13E12 repeat proteins e.g. Rv1148c, Rv1945, etc. Mb0528 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248235.1" /translation="MPSPEAIAHFDERFECHAPRTTRVSAAFIDRICSATRAENRAAA AQLVALGELFAYRWSRCGGREEWVMDTMAAVAAEVAAALRISQGLAASRLRYARAMRE RLPKTAEVFSAGDIGYLMFATIVYRTDLIVDPDVLAAVDAQLAANVARWPSMTKARLA GQVDKIVARADADAVRRRKEYQAQRQFWVGESQDGVCQIGGSLLAVDAHALDARLSAL AGTVCEHDPRSREQRRADALGALAGGADRLGCGCGRADCAAGKRPAAPPVVIHLIAEA ATINGTGSAPASQMNADGLITAELVAELAKTATLVPLVHPGDAPPEPGYAPSKALADF VRCRDLTCRWPGCDEPATNCDLDHTIPYAAGGPTHASNLKCYCRTHHLVKTFWGWRDQ QLPDGTLILTSPSGHTYVSTPGSALLFPSLCHFSGGIPAPEADPPYDHCDQRTAMMPK RRRTRAQDRAYRIATERRQNHAARQRAQVLTQTAAATDTHGPPPDHNDDPPPF" CDS complement(609210..609686) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0529C" /product="possible anti-anti-sigma factor" /note="Mb0529c, -, len: 158 aa. Equivalent to Rv0516c, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 158 aa overlap). Conserved hypothetical protein, showing some similarity to Rv1365c|MTCY02B10_29 from Mycobacterium tuberculosis (128 aa), FASTA scores: E(): 0.0012, (27.4% identity in 124 aa overlap). Protein product from Mb0529c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0529c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248236.1" /translation="MTTTIPTSKSACSVTTRPGNAAVDYGGAQIRAYLHHLATVVTIR GEIDAANVEQISEHVRRFSLGTNPMVLDLSELSHFSGAGISLLCILDEDCRAAGVQWA LVASPAVVEQLGGRCDQGEHESMFPMARSVHKALHDLADAIDRRRQLVLPLISRSA" CDS 609897..611207 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0530" /product="POSSIBLE MEMBRANE ACYLTRANSFERASE" /note="Mb0530, -, len: 436 aa. Equivalent to Rv0517, len: 436 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 436 aa overlap). Possible acyltransferase (EC 2.3.1.-), integral membrane protein, equivalent (but longer 26 aa in N-terminus) to AAK44761.1|AE006954 putative acyltransferase from Mycobacterium tuberculosis strain CDC1551 (410 aa). Also similar to many acyltransferases e.g. MDMB_STRMY|Q00718 from Streptomyces mycarofaciens (387 aa), FASTA scores: opt: 200, E(): 1.1e-08, (28.2% identity in 394 aa overlap). And similar to Rv0111, Rv0228, Rv1254, Rv1565c from Mycobacterium tuberculosis. Mb0530 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248237.1" /translation="MAGGMDQPPGQPRRRTRQQSSDGKNGVRAAEITGEIRALTGLRI VAAVWVVLFHFRPMLGDASPGFRDALAPVLDCGAQGVDLFFILSGFVLTWNYLDRMGR SWSVRANLHFLWLRLARVWPVYLVTLHLAAVWVIFTLHVGHVPSPEAGQLTAISYVRQ ILLVQLWFQPYFDGSSWDGPAWSISAEWLAYLLFGLLILVIFRMKHATRARGLMWLAF AASLPPVVLLLASGQFYTPWSWLPRIVTQFAAGALACAAVRRLRPTDRARRIAGYLSV LVGVAIVGILYLLHAHPLAGVEDSGGVVDVLFVPLVISLAIGVGSLPALLSTRLMVFG GQISFCLYMVHELVHTAWGWAVQQYELALQDQPWKWNVVGLLAIALGAAILLYHFVEE PDRRWMRRMVDVKAASARSEPGEPVGSTRYQIDDALEGVSARAV" CDS 611339..612034 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0531" /product="POSSIBLE EXPORTED PROTEIN" /note="Mb0531, -, len: 231 aa. Equivalent to Rv0518, len: 231 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 231 aa overlap). Possible exported protein; has hydrophobic N-terminus. Mb0531 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248238.1" /translation="MSRPGTYVIGLTLLVGLVVGNPGCPRSYRPLTLDYRLNPVAVIG DSYTTGTDEGGLGSKSWTARTWQMLAARGVRIAADVAAEGRAGYGVPGDHGNVFEDLT ARAVQPDDALVVFFGSRNDQGMDPEDPEMLAEKVRDTFDLARHRAPSASLLVIAPPWP TADVPGPMLRIRDVLGAQARAAGAVFVDPIADHWFVDRPELIGADGVHPNDAGHEYLA DKIAPLISMELVG" CDS complement(612322..613224) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0532C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb0532c, -, len: 300 aa. Equivalent to Rv0519c, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 300 aa overlap). Possible conserved membrane protein, with hydrophobic region near N-terminus. Could be a lipase (EC 3.1.-.-). Similar to Rv0774c|MTCY369.19c|A70708 from Mycobacterium tuberculosis (312 aa), FASTA scores: opt: 1092, E(): 0, (57.9% identity in 299 aa overlap). Contains PS00120 Lipases, serine active site. Mb0532c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248239.1" /translation="MLRRGCAGNTDRRGIMTPMADLTRRALLRWGAGAGAGAAGVWAF GALVDPLEPQAAPAPFEPPTAGSSLPTRISGSFISAARGGIKTNWVISMPPGQSGQLR PVIALHGKDGNAGMMLDLGVEQGLARLVKEGKPAFAVVGVDGGNTYWHRRSSGGDSGA MVLDELLPMLTSMGMDTSRVGFLGWSMGGYGALLLGARLGPARTAGICAISPALFTSF TGSTPGAFDSYDDYVQHSVLGLPALNSIPLRVDCGTSDRFYFATRQFVNQLHQPPAGS FSPGGHDASYWREQLPGELAWMAS" CDS 613405..613755 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0533" /product="POSSIBLE METHYLTRANSFERASE/METHYLASE (FRAGMENT)" /note="Mb0533, -, len: 116 aa. Equivalent to Rv0520, len: 116 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 116 aa overlap). Possible fragment of methyltransferase (possibly first part) (EC 2.1.1.-), highly similar to part of several methyltransferases e.g. Q43445|U43683 S-ADENOSYL-L-METHIONINE:DELTA24-STEROL-C- METHYLTRANSFERASE from Glycine max (Soybean)(367 aa), FASTA scores: opt: 190, E(): 2.3e-12, (39.2% identity in 74 aa overlap). Also some similarity to MTCY19G5_5 from Mycobacterium tuberculosis. Possibly continues as Rv0521 but we can find no frameshift to account for this. Mb0533 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248240.1" /translation="MGGCSITCLNISEVPNETNRKKNRQAGLDRSIRVIHGSFDDIPE PDSGYDVVWSQDAILHAPDRRKVLEEAFRVLRPGGELIFTDPMQADDVPDGVLQPVYD RLNLRDLGSMRFYA" CDS 613748..614053 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0534" /product="POSSIBLE METHYLTRANSFERASE/METHYLASE (FRAGMENT)" /note="Mb0534, -, len: 101 aa. Equivalent to Rv0521, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). Possible fragment of methyltransferase (possibly second part) (EC 2.1.1.-), highly similar to C-terminus of several methyltransferases e.g. AAF87203.1|AF216282 sarcosine-dimethylglycine methyltransferase from Halorhodospira halochloris (279 aa). Possibly continuation of Rv0520 but we can find no frameshift to account for this." /protein_id="CAB5248241.1" /translation="MREAAQALGFEVLDQRDLVRNLRTHYSRVFEELEARRLELEGKS SQEYLDKMRVGLKNWVEAADNGHSRVGHPTFPRTRLTPICQLPTAAIDSTAGRRRYR" CDS 614188..615492 /codon_start=1 /transl_table=11 /gene="gabP" /locus_tag="BQ2027_MB0535" /product="PROBABLE GABA PERMEASE GABP (4-AMINO BUTYRATE TRANSPORT CARRIER) (GAMA-AMINOBUTYRATE PERMEASE)" /note="Mb0535, gabP, len: 434 aa. Equivalent to Rv0522, len: 434 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 434 aa overlap). Probable gabP, GABA permease (gamma-aminobutyrate permease), integral membrane protein, highly similar to others e.g. GABP_ECOLI|P25527 gaba permease from Escherichia coli (466 aa), FASTA scores: opt: 1218, E(): 0, (44.3% identity in 424 aa overlap); etc. Also similar to other Mycobacterium tuberculosis permeases e.g. MTCY13E10.06c FASTA score: (34.4% identity in 407 aa overlap). Contains PS00218 Amino acid permeases signature. Overlaps and extends Rv0523c|MTCY25D10.01 from overlapping cosmid. BELONGS TO THE AMINO ACID PERMEASE FAMILY (APC FAMILY). Mb0535 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248242.1" /translation="MIAIGGVIGAGLFVGSGVVIRATGPAAFLTYALCGALIVLVMRM LGEMAAANPSTGAFADYAAKALGGWAGFSVGWLYWYFWVIVVGFEAVAGGKVLTYWID APLWLASLCLMMMMTATNLVSVSSFGEFEFWFAGVKVATIVGFLVLGTAFAFGLLPGH GMDFSNLSAHGGFFPDGVGAVFAAIVVAIFSMTGTEVVTIAAAEAPDPQRAVQRAMST VVARIVIFFVGSVFLLTVILPWNSLELGASPYVAALRHMGIGGADQIMNAVVLTAVLS CLNSGLYTASRMLFVLAARQEAPAQLVKVNRRGVPTFAIMGSSVVGFLCVIMAWVSPA TVFVFLLNSSGAVILFVYLLIALSQIVLRRQTSGQNLGVRMWLFPGLSIVTVTGIVAV LARMAFDYAARSQLWLSLLSWAVVVGCYLVTTLVRRPLNRPW" CDS complement(615476..615871) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0536C" /product="conserved protein" /note="Mb0536c, -, len: 131 aa. Equivalent to Rv0523c, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Conserved hypothetical protein, showing some similarity to M. tuberculosis proteins Rv1598c|MTCY336.06; and Rv1871c|MTCY336_06|O06592 (136 aa), FASTA scores: opt: 197, E(): 5e-08, (38.4% identity in 99 aa overlap). Protein product from Mb0536c detected using shotgun mass spectrometry. Mb0536c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248243.1" /translation="MQLPQWLARFNRYVTNPIQRLWAGWLPAFAILEHVGRRSGKPYR TPLNVFSADVDGRAGVAILLTYGPNRDWLKNITAAGGGRMRRYGKTFGVANPRRLTKA EAAPYVSSRWRPVFARLPFDEAVLLTKAD" CDS 615985..617373 /codon_start=1 /transl_table=11 /gene="hemL" /locus_tag="BQ2027_MB0537" /product="PROBABLE GLUTAMATE-1-SEMIALDEHYDE 2,1-AMINOMUTASE HEML (GSA) (GLUTAMATE-1-SEMIALDEHYDE AMINOTRANSFERASE) (GSA-AT)" /note="Mb0537, hemL, len: 462 aa. Equivalent to Rv0524, len: 462 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 462 aa overlap). Probable hemL, glutamate-1-semialdehyde 2,1-aminomutase (EC 5.4.3.8), equivalent to P46716|GSA_MYCLE GLUTAMATE-1-SEMIALDEHYDE 2,1-AMINOMUTASE from Mycobacterium leprae (446 aa), FASTA scores: opt: 1532, E(): 0, (82.6% identity in 460 aa overlap). Also highly similar to others e.g. Q9F2S0|GSA_STRCO from Streptomyces coelicolor (438 aa); Q06774|GSA_PROFR from Propionibacterium freudenreichii (441 aa); etc. Contains PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site. BELONGS TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. COFACTOR: PYRIDOXAL PHOSPHATE. Protein product from Mb0537 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0537 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248244.1" /translation="MGSTEQATSRVRGAARTSAQLFEAACSVIPGGVNSPVRAFTAVG GTPRFITEAHGCWLIDADGNRYVDLVCSWGPMILGHAHPAVVEAVAKAAARGLSFGAP TPAETQLAGEIIGRVAPVERIRLVNSGTEATMSAVRLARGFTGRAKIVKFSGCYHGHV DALLADAGSGVATLGLCDDPQRPASPRSQSSRGLPSSPGVTGAAAADTIVLPYNDIDA VQQTFARFGEQIAAVITEASPGNMGVVPPGPGFNAALRAITAEHGALLILDEVMTGFR VSRSGWYGIDPVPADLFAFGKVMSGGMPAAAFGGRAEVMQRLAPLGPVYQAGTLSGNP VAVAAGLATLRAADDAVYTALDANADRLAGLLSEALTDAVVPHQISRAGNMLSVFFGE TPVTDFASARASQTWRYPAFFHAMLDAGVYPPCSAFEAWFVSAALDDAAFGRIANALP AAARAAAQERPA" CDS 617373..617981 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0538" /product="Hypothetical, related to broad specificity phosphatases COG0406" /note="Mb0538, -, len: 202 aa. Equivalent to Rv0525, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 202 aa overlap). Conserved hypothetical protein, equivalent to Q49821|B2168_C3_276|S72912 hypothetical protein from Mycobacterium leprae (202 aa), FASTA scores: opt: 1151, E(): 0, (82.5% identity in 200 aa overlap). Also highly similar to CAC08377.1|AL392176 putative phosphoglycerate mutase from Streptomyces coelicolor (233 aa); and similar to SLL0395|Q55734 hypothetical 23.8 kDa protein from SYNECHOCYSTIS SP. (212 aa), FASTA scores: opt: 207, E(): 5.1e-07, (28.2% identity in 195 aa overlap). Also some similarity to Rv2228c|Y019_MYCTU|Q10512|cy427.09 hypothetical 39.2 kd protein from Mycobacterium tuberculosis (364 aa), FASTA scores: opt: 236, E(): 1.1e-08, (34.3% identity in 198 aa overlap). Protein product from Mb0538 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0538 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248245.1" /translation="MPEETQVHVVRHGEVHNPTGILYGRLPGFHLSATGAAQAAAVAD ALADRDIVAVIASPLQRAQETAAPIAARHDLAVETDPDLIESANFFEGRRVGPGDGAW RDPRVWWQLRNPFTPSWGEPYVDIAARMTTAVDKARVRGAGHEVVCVSHQLPVWTLRL YLTGKRLWHDPRRRDCALASVTSLIYDGDRLVDVVYSQPAAL" CDS 617996..618646 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0539" /product="POSSIBLE THIOREDOXIN PROTEIN (THIOL-DISULFIDE INTERCHANGE PROTEIN)" /note="Mb0539, -, len: 216 aa. Equivalent to Rv0526, len: 216 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 216 aa overlap). Possible thioredoxin protein (thiol-disulfide interchange protein) (EC 1.-.-.-), equivalent to Q49816|U2168C|S72901 hypothetical protein from Mycobacterium leprae (216 aa), FASTA scores: opt: 1144, E(): 0, (78.5% identity in 214 aa overlap). C-terminus shows some similarity to C-terminus of thioredoxins e.g. RESA_BACSU|P35160 resa protein from Bacillus subtilis (181 aa), FASTA scores: opt: 200, E(): 7.4e-06, (24.2% identity in 132 aa overlap); etc. Also similar to Mycobacterium tuberculosis thioredoxin-like proteins Rv1470, Rv1471, Rv1677, etc. Contains PS00194 Thioredoxin family active site. SEEMS TO BELONG TO THE THIOREDOXIN FAMILY. Protein product from Mb0539 detected using SWATH mass spectrometry. Mb0539 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248246.1" /translation="MQSRATRRSGALTMRRLVIAAAVSALLLTGCSGRDAVAQGGTFE FVSPGGKTDIFYDPPASRGRPGPLSGPELADPARSVSLDDFPGQVVVVNVWGQWCGPC RAEVSQLQRVYDATRGAGVSFLGIDVRDNNRQAPQDFINDRHVTYPSIYDPAMRTLIA FGGKYPTSVIPSTLVLDRQHRVAAVFLRELLAADLQPVVERVAEEEPSGRAPVGAQ" CDS 618643..619422 /codon_start=1 /transl_table=11 /gene="ccdA" /locus_tag="BQ2027_MB0540" /product="POSSIBLE CYTOCHROME C-TYPE BIOGENESIS PROTEIN CCDA" /note="Mb0540, ccdA, len: 259 aa. Equivalent to Rv0527, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 259 aa overlap). Possible ccdA, cytochrome C-type biogenesis protein, integral membrane protein, equivalent to Q49810|B2168_C1_192|S72890 hypothetical protein from Mycobacterium leprae (262 aa), FASTA scores: opt: 1341, E(): 0, (79.0% identity in 262 aa overlap). Also highly similar to others e.g. CAC08380.1 (253 aa); CCDA_BACSU|P45706 cytochrome C-type biogenesis protein from Bacillus subtilis (235 aa), FASTA scores: opt: 307, E(): 7.4e-13, (30.4% identity in 237 aa overlap); etc. SEEMS TO BELONG TO THE DSBD SUBFAMILY. Note that previously known as ccsA. Mb0540 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248247.1" /translation="MTGFTEIAAVGPLLVAVGVCLLAGLVSFASPCVVPLVPGYLSYL AAVVGVDEQLPAGVVKPPVAARWRVAGSAALFVAGFTTVFVLGTVAVLGMTTTLITNQ LLLQRVGGVLIVVMGLVFVGFIGALQRQARFTPRQLTSVAGAPVLGAVFALGWTPCLG PTLTGVITVASATEGASVARGIVLVIAYCLGLGIPFVLLAFGSAWAVAGLGWLRRHTR AIQIFGGALLIAVGAALVTGVWNDVVSWLRDAFVSDVRLPI" CDS 619455..621044 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0541" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0541, -, len: 529 aa. Equivalent to Rv0528, len: 529 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 529 aa overlap). Probable conserved transmembrane protein, equivalent (shorter 14 aa in N-terminus) to CAC31926.1|AL583925 conserved membrane protein from Mycobacterium leprae (542 aa). Also highly similar to Q49817|B2168_C2_237|S72902 hypothetical protein from Mycobacterium leprae (364 aa), FASTA scores: opt: 1846, E(): 0, (81.1% identity in 338 aa overlap); and Q49811|B2168_C1_194|S72891 hypothetical protein from Mycobacterium leprae (106 aa), FASTA scores: opt: 506, E(): 3.8e-26, (73.6% identity in 106 aa overlap). Also highly similar to CAC08381.1|AL392176 putative integral membrane protein from Streptomyces coelicolor (574 aa). Protein product from Mb0541 detected using SWATH mass spectrometry. Mb0541 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248248.1" /translation="MWRSLTSMGTALVLLFLLALAAIPGALLPQRGLNAAKVDDYLAA HPLIGPWLDELQAFDVFSSFWFTAIYVLLFVSLVGCLAPRTIEHARSLRATPVAAPRN LARLPKHAHARLAGEPAALAATITGRLRGWRSITRQQGDSVEVSAEKGYLREFGNLVF HFALLGLLVAVAVGKLFGYEGNVIVIADGGPGFCSASPAAFDSFRAGNTVDGTSLHPI CVRVNNFQAHYLPSGQATSFAADIDYQADPATADLIANSWRPYRLQVNHPLRVGGDRV YLQGHGYAPTFTVTFPDGQTRTSTVQWRPDNPQTLLSAGVVRIDPPAGSYPNPDERRK HQIAIQGLLAPTEQLDGTLLSSRFPALNAPAVAIDIYRGDTGLDSGRPQSLFTLDHRL IEQGRLVKEKRVNLRAGQQVRIDQGPAAGTVVRFDGAVPFVNLQVSHDPGQSWVLVFA ITMMAGLLVSLLVRRRRVWARITPTTAGTVNVELGGLTRTDNSGWGAEFERLTGRLLA GFEARSPDMAEAAAGTGRDVD" CDS 621041..622015 /codon_start=1 /transl_table=11 /gene="ccsA" /locus_tag="BQ2027_MB0542" /product="POSSIBLE CYTOCHROME C-TYPE BIOGENESIS PROTEIN CCSA" /note="Mb0542, ccsA, len: 324 aa. Equivalent to Rv0529, len: 324 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 324 aa overlap). Possible ccsA, cytochrome C-type biogenesis protein, integral membrane protein, equivalent to NP_302558.1|NC_002677|B2168_C3_281 possible cytochrome C biogenesis protein from Mycobacterium leprae (327 aa), FASTA scores: opt: 1779, E(): 0, (82.9% identity in 327 aa overlap). Also highly similar to others e.g. CAC08382.1|AL392176 putative cytochrome biogenesis related protein from Streptomyces coelicolor (380 aa); CCSA_CHLRE|P48269 probable cytochrome c biogenesis protein from Chlamydomonas reinhardtii (353 aa), FASTA scores: opt: 449, E(): 1.3e-23, (34.4% identity in 247 aa overlap); etc. BELONGS TO THE CCMF/CYCK/CCL1/NRFE/CCSA FAMILY. Note that previously known as ccsB. Protein product from Mb0542 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0542 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248249.1" /translation="MNTLHVNVGLARYSDWAFTSAVVALVVALLLLAFEFAQVRGRGL APLAVPAGSVATDSATPGIVADQRHRPFDERVGRGGLAVAYLGIGLLLACVVLRGLAT QRVPWGNMYEFINLTCLSGLIAGAVVLRRARYRPLWVFLLVPVLILLTVSGRWLYANA APVMPALQSYWLPIHVSVVSLGSGVFLVAGVASILFLVRTSRLGEPTGEGALAGMVRR LPDAQTLDGIAYRTTIFAFPVFGFGVIFGAIWAEEAWGRYWGWDPKETVSFVAWVVYA AYLHARSTAGWRDRKAAWINVAGFVAMVFNLFFVNLVTVGLHSYAGVG" CDS 622057..623274 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0543" /product="ATPases involved in chromosome partitioning" /note="Mb0543, -, len: 405 aa. Equivalent to Rv0530, len: 405 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 405 aa overlap). Conserved hypothetical protein, similar in part to other hypothetical proteins e.g. AL031231|SC3C3_3|CAA20252.1 from Streptomyces coelicolor (1083 aa), FASTA scores: opt: 870, E(): 0, (39.5% identity in 443 aa overlap); etc. Also similar to Mycobacterium tuberculosis proteins e.g. Rv3868, Rv0282, Rv1798, etc. Protein product from Mb0543 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0543 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248250.1" /translation="MLVTEHPRTGVGAPDSGNGGTDHPTVQLPPVPSVGAPPAAAGGE TPTRSVAGFRTQRLDPTAYGAYYSGPDEGPASPAERPPYRLEPVPHTPYPELATTTLL RPVKPPPSEGWRRLLYLLSGRLINAGEGPRAAHLNDLVAQVNRPLRGCYRIAVLSLKG GVGKTTITATLGATFADLRGDRVVAVDANPDRGTLSQKVPLETPATVRHLLRDADGIE RYSDVRGYTSKGPSGLEVLASDSDPASSDAFSADDYTRTLDILERFYGLVLTDCGTGL LHSAMSAVLPRSDVLVVVSSGSIDGARSAAATLDWLQAHGHDDQVRNSIAVVNAVRPR AGKVDVGKVVEHFSRRCRAVRVVPFDPHLEEGAEIALDRLRRETREALTELAAVVAAG FPGDPRRCKPSFT" CDS complement(623271..623432) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0543A" /product="Conserved protein" /note="Mb0543A, len: 53 aa. Equivalent to Rv0530A len: 53 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 53 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein. Mb0543A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248251.1" /translation="MLYLLLVLILATLIYLGWRAARAQMNRPKTRVIGPDDDPEFLRR LGHGDNNRS" CDS 623479..623796 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0544" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb0544, -, len: 105 aa. Equivalent to Rv0531, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 105 aa overlap). Possible conserved membrane protein, highly similar to Y13803|MLB1306_1|CAA74131.1 hypothetical protein from Mycobacterium leprae (86 aa), FASTA scores: E(): 2.1e-24, (74.4% identity in 86 aa overlap); and NP_302557.1|NC_002677 putative membrane protein from Mycobacterium leprae (111 aa). Protein product from Mb0544 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0544 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248252.1" /translation="MSEAPNDKTTRGVVDILVYATARLLLVVAVSAAIFGVARLIGLT EFPVVVATLFGLIIAMPLGIWVFSPLRRRATAALAVAGERRRAERERLRARLRGESLP EEQ" CDS 623943..625688 /codon_start=1 /transl_table=11 /gene="PE_PGRS6a" /locus_tag="BQ2027_MB0545" /product="PE-PGRS FAMILY PROTEIN [FIRST PART]" /note="Mb0545, PE_PGRS6a, len: 581 aa. Equivalent to 5' end of Rv0532, len: 594 aa, from Mycobacterium tuberculosis strain H37Rv, (92.7% identity in 564 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to others e.g. Y0DP_MYCTU|Q50615 from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 1703, E(): 0, (58.2% identity in 536 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS6 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits PE_PGRS6 into 2 parts, PE_PGRS6a and PE_PGRS6b, resulting in PE_PGRS6a having a different COOH part. There is also a 84 bp and a 9 bp (*-cggggccgg) insertion in PE_PGRS6a." /protein_id="CAB5248253.1" /translation="MSNLLVTPELVAAAAADLAGIGSAIGAANAAAGAPTMALLAAGA DEVSAAVAAVFSSYAQQYQALSAAAAAFHDQFVRALAAGAGAYAGAEAANVEQQLLNA INAPTLALLGRPLIGNGADGAAGTGQAGGAGGLLYGNGGNGGSGAAGQAGGAGGAAGL IGHGGTGGVGGTGAAGGAGGTGGWLFGNGGAGGTGGAVTGVSTTGGPGGHGGDAGLYG FGGAGGAGGFGQSGAAGGAGGAGGAGGWLYGDGGDGGAGGNGGNESGTGVSGVGGVGG AGGAGGLLFGNGGDGGVGGDGGDGSSTQDSGGDGGAGGAGGAGGWLLGNGGAGGAGGA ASIKVATGGLGGDGGDAGLFGFGGDGGWGGRGVDARFGAAGGAAGAGGAGGWLYGDGG AGGVGGVGGAVFSLSSGDGGAGGAGGGGGWLFGNGGDGGAGGGGGGRFGSGSGAGGDG AVGGAGGAGAWFGNGGAGGVGGGGGRGTTAIGGDGGAGGAGGAGGWLYGDGGAGGAGG GGGRGGTGNDGGDGGDGGRGGDAQLLGNGGDGGAGGAGGPAGFGASPGAGAAGGGGGA GGSLFGSPGTTGPHG" CDS 625594..625821 /codon_start=1 /transl_table=11 /gene="PE_PGRS6b" /locus_tag="BQ2027_MB0546" /product="PE-PGRS FAMILY PROTEIN [SECOND PART]" /note="Mb0546, PE_PGRS6b, len: 75 aa. Equivalent to 3' end of Rv0532, len: 594 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 75 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to others e.g. Y0DP_MYCTU|Q50615 from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 1703, E(): 0, (58.2% identity in 536 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS6 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits PE_PGRS6 into 2 parts, PE_PGRS6a and PE_PGRS6b." /protein_id="CAB5248254.1" /translation="MALPPGPARPAGAAVPAVRCSAAPARPARTADPWLAPIFARSTL RHSHHLGGIAQTGAVADQQGQIAGLGRAGRQ" CDS complement(625717..626724) /codon_start=1 /transl_table=11 /gene="fabH" /locus_tag="BQ2027_MB0547C" /standard_name="mtFabH" /product="3-OXOACYL-[ACYL-CARRIER-PROTEIN] SYNTHASE III FABH (BETA-KETOACYL-ACP SYNTHASE III) (KAS III)" /note="Mb0547c, fabH, len: 335 aa. Equivalent to Rv0533c, len: 335 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 335 aa overlap). fabH (alternate gene name: mtFabH), 3-oxoacyl-[acyl-carrier protein] synthase III (EC 2.3.1.41) (see citations below), highly similar to others e.g. Q54206|FABH from STREPTOMYCES GLAUCESCENS (333 aa), FASTA scores: opt: 1109, E(): 0, (51.4% identity in 333 aa overlap); FABH_ECOLI|P24249 3-oxoacyl-[acyl-carrier-protein] synthase III (317 aa), FASTA scores: opt: 666, E(): 0, (37.1% identity in 318 aa overlap); etc. BELONGS TO THE FABH FAMILY. Note that previously known as fabH. Protein product from Mb0547c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0547c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248255.1" /translation="MTEIATTSGARSVGLLSVGAYRPERVVTNDEICQHIDSSDEWIY TRTGIKTRRFAADDESAASMATEACRRALSNAGLSAADIDGVIVTTNTHFLQTPPAAP MVAASLGAKGILGFDLSAGCAGFGYALGAAADMIRGGGAATMLVVGTEKLSPTIDMYD RGNCFIFADGAAAVVVGETPFQGIGPTVAGSDGEQADAIRQDIDWITFAQNPSGPRPF VRLEGPAVFRWAAFKMGDVGRRAMDAAGVRPDQIDVFVPHQANSRINELLVKNLQLRP DAVVANDIEHTGNTSAASIPLAMAELLTTGAAKPGDLALLIGYGAGLSYAAQVVRMPK G" CDS complement(626806..627684) /codon_start=1 /transl_table=11 /gene="menA" /locus_tag="BQ2027_MB0548C" /product="1,4-DIHYDROXY-2-NAPHTHOATE OCTAPRENYLTRANSFERASE MENA (DHNA-OCTAPRENYLTRANSFERASE)" /note="Mb0548c, menA, len: 292 aa. Equivalent to Rv0534c, len: 292 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 292 aa overlap). Probable menA, 1,4-dihydroxy-2-naphthoate octaprenyltransferase (EC 2.5.1.-), integral membrane protein, equivalent to Y13803|MLB1306_2|NP_302556.1 probable 4-dihydroxy-2-naphthoate octaprenyltransferase from Mycobacterium leprae (294 aa), FASTA scores: opt: 1509, E(): 0, (80.2% identity in 288 aa overlap). Also highly similar to others e.g. MENA_ECOLI|P32166|B3930 from Escherichia coli (308 aa), FASTA scores: opt: 495, E(): 2.9e-25, (36.3 identity in 289 aa overlap); etc. BELONGS TO THE MENA FAMILY. Mb0548c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248256.1" /translation="MASFAQWVSGARPRTLPNAIAPVVAGTGAAAWLHAAVWWKALLA LAVAVALVIGVNYANDYSDGIRGTDDDRVGPVRLVGSRLATPRSVLTAAMTSLALGAL AGLVLALLSAPWLIAVGAICIAGAWLYTGGSKPYGYAGFGELAVFVFFGPVAVLGTQY TQALRVDWVGLAQAVATGALSCSVLVANNLRDIPTDARADKITLAVRLGDARTRMLYQ GLLAVAGVLTFVLMLATPWCVVGLVAAPLALRAAGPVRSGRGGRELIPVLRDTGLAML VWALAVAGALAFGQLS" CDS 627701..628495 /codon_start=1 /transl_table=11 /gene="pnp" /locus_tag="BQ2027_MB0549" /product="PROBABLE 5'-METHYLTHIOADENOSINE PHOSPHORYLASE PNP (MTA PHOSPHORYLASE)" /note="Mb0549, pnp, len: 264 aa. Equivalent to Rv0535, len: 264 aa, from Mycobacterium tuberculosis strain H37R, (100.0% identity in 264 aa overlap). Probable pnp, 5'-methylthioadenosine phosphorylase (EC 2.4.2.28), highly similar to others e.g. CAB90972.1|AL355832 putative methylthioadenosine phosphorylase from Streptomyces coelicolor (280 aa); etc. Also similar to Rv3307|deoD PROBABLE PURINE NUCLEOSIDE PHOSPHORYLASE (EC 2.4.2.1) from Mycobacterium tuberculosis (268 aa). BELONGS TO THE PNP/MTAP FAMILY 2 OF PHOSPHORYLASES. Gene name could be inappropriate. Protein product from Mb0549 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0549 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248257.1" /translation="MHNNGRMLGVIGGSGFYTFFGSDTRTVNSDTPYGQPSAPITIGT IGVHDVAFLPRHGAHHQYSAHAVPYRANMWALRALGVRRVFGPCAVGSLDPELEPGAV VVPDQLVDRTSGRADTYFDFGGVHAAFADPYCPTLRAAVTGLPGVVDGGTMVVIQGPR FSTRAESQWFAAAGCNLVNMTGYPEAVLARELELCYAAIALVTDVDAGVAAGDGVKAA DVFAAFGENIELLKRLVRAAIDRVADERTCTHCQHHAGVPLPFELP" CDS 628492..629532 /codon_start=1 /transl_table=11 /gene="galE3" /locus_tag="BQ2027_MB0550" /product="PROBABLE UDP-GLUCOSE 4-EPIMERASE GALE3 (GALACTOWALDENASE) (UDP-GALACTOSE 4-EPIMERASE) (URIDINE DIPHOSPHATE GALACTOSE 4-EPIMERASE) (URIDINE DIPHOSPHO-GALACTOSE 4-EPIMERASE)" /note="Mb0550, galE3, len: 346 aa. Equivalent to Rv0536, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap). Possible galE3, UDP-glucose 4-epimerase (EC 5.1.3.2), highly similar to CAB76986.1|AL159178 putative epimerase from Streptomyces coelicolor (334 aa); and similar to other epimerases e.g. NP_436775.1|NC_003078 putative NDP-glucose dehydrataseepimerase protein from Sinorhizobium meliloti (368 aa); AF143772|AF143772_7 GepiA from Mycobacterium avium strain 2151 (353 aa), FASTA scores: opt: 577, E(): 3.9e-29, (36.6% identity in 352 aa overlap); GALE_METJA|Q57664 putative UDP-glucose 4-epimerase (305 aa), FASTA scores: opt: 300, E(): 1.6e-12, (30.9% identity in 343 aa overlap); etc. Also similar to Mycobacterium tuberculosis proteins e.g. Rv3634c, Rv3784, etc. SEEMS TO BELONG TO THE SUGAR EPIMERASE FAMILY. Note that previously known as galE2. Protein product from Mb0550 detected using SWATH mass spectrometry. Mb0550 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248258.1" /translation="MRVLLTGAAGFIGSRVDAALRAAGHDVVGVDALLPAAHGPNPVL PPGCQRVDVRDASALAPLLAGVDLVCHQAAMVGAGVNAADAPAYGGHNDFATTVLLAQ MFAAGVRRLVLASSMVVYGQGRYDCPQHGPVDPLPRRRADLDNGVFEHRCPGCGEPVI WQLVDEDAPLRPRSLYAASKTAQEHYALAWSEASGGSVVALRYHNVYGPGMPRDTPYS GVAAIFRSAVEKGKPPKVFEDGGQMRDFVHVDDVAAANLAAVHLGEADRDGFTAVNVC SGRPISILQVATAICDARGGSMSPAITGHYRSGDVRHIVADPARAARVLGFRAAVDPG EGLREFAFAPLR" CDS complement(629542..630975) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0551C" /product="PROBABLE INTEGRAL MEMBRANE PROTEIN" /note="Mb0551c, -, len: 477 aa. Equivalent to Rv0537c, len: 477 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 477 aa overlap). Probable integral membrane protein, showing weak similarity to YDNK_STRCO|P40180 hypothetical 41.2 kd protein from Streptomyces coelicolor (411 aa), FASTA scores: opt: 122, E(): 0.85, (28.2% identity in 373 aa overlap). Protein product from Mb0551c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0551c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248259.1" /translation="MGLSSDDTRRREVVRDLAAGALLIGALFFPWNLYFGFRIPDSSK TVFGLLLAVTSLSLASLAVTFAGRRSQLRLGLNVPYLLLVLAFVVFDAIQTIRLGGTV HVPGGVGPGGWLGITGALLSAQPALTGATTDEGSHSRWLRATQFLGYASMLGAALSTG FNLSWRVRYALEPAAGASGFGKQNLAVIDTAVVYGVVALAAVLVASRWLLRPTAAEAL STVALGGSTLIAGSIVWSLPIGREIDAFHGIAQNTSTAGVGYEGYLVWAAAAAMCAPL TLFRSPNAPPIDKTVWRAASRNGLLLIAVWCLGSVAMRLTDLVVAVLLNYPFSRYDSM ALAAFDLATAVLAIWLRFNMATEALPARLISSLCGLLCTFTVSRVIVGVVLAPRFQAS SGGSAHPVYGNDLAQQITSTFDVVLCGLALSILAAAIVIGRLRQLPQPPHTPALSRPA GSPRIFRSAGSTHPVRPKIYRPPDHSS" CDS 631284..632930 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0552" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb0552, -, len: 548 aa. Equivalent to Rv0538, len: 548 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 548 aa overlap). Possible conserved membrane protein. Middle region highly similar to AAB63811.1|AF009829|MBE4863a|O32850 unknown protein from Mycobacterium bovis (295 aa) possible transmembrane protein with a repetitive proline, threonine-rich region at C-terminus. Protein product from Mb0552 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0552 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248260.1" /translation="MDVALGVAVTDRVARLALVDSAAPGTVIDQFVLDVAEHPVEVLT ETVVGTDRSLAGENHRLVATRLCWPDQAKADELQHALQDSGVHDVAVISEAQAATALV GAAHAGSAVLLVGDETATLSVVGDPDAPPTMVAVAPVAGADATSTVDTLMARLGDQAL APGDVFLVGRSAEHTTVLADQLRAASTMRVQTPDDPTFALARGAAMAAGAATMAHPAL VADATTSLPPAEAGQSGSEGEQLAYSQASDYELLPVDEYEEHDEYGAAADRSAPLSRR SLLIGNAVVAFAVIGFASLAVAVAVTIRPTAASKPVEGHQNAQPGKFMPLLPTQQQAP VPPPPPDDPTAGFQGGTIPAVQNVVPRPGTSPGVGGTPASPAPEAPAVPGVVPAPVPI PVPIIIPPFPGWQPGMPTIPTAPPTTPVTTSATTPPTTPPTTPVTTPPTTPPTTPVTT PPTTPPTTPVTTPPTTVAPTTVAPTTVAPTTVAPTTVAPATATPTTVAPQPTQQPTQQ PTQQMPTQQQTVAPQTVAPAPQPPSGGRNGSGGGDLFGGF" CDS 632987..633619 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0553" /product="PROBABLE DOLICHYL-PHOSPHATE SUGAR SYNTHASE (DOLICHOL-PHOSPHATE SUGAR SYNTHETASE) (DOLICHOL-PHOSPHATE SUGAR TRANSFERASE) (SUGAR PHOSPHORYLDOLICHOL SYNTHASE)" /note="Mb0553, -, len: 210 aa. Equivalent to Rv0539, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 210 aa overlap). Probable dolichol-P-sugar synthase (EC 2.4.1.-), highly similar to CAB76989.1|AL159178 putative glycosyltransferase from Streptomyces coelicolor (242 aa), and similar to various dolichol-P-sugar synthetases and sugar transferases e.g. NP_126257.1|NC_000868 DOLICHYL-PHOSPHATE MANNOSE SYNTHASE RELATED PROTEIN from Pyrococcus abyssi (211 aa); N-terminus of NP_127133.1|NC_000868 DOLICHOL-P-GLUCOSE SYNTHETASE from Pyrococcus abyssi (378 aa); N-terminus of NP_068880.1|NC_000917 putative dolichol-P-glucose synthetase from Archaeoglobus fulgidus (369 aa), FASTA scores: E(): 2.4e-13, (32. 1% identity in 193 aa overlap); Q26732 DOLICHYL-PHOSPHATE-MANNOSE SYNTHASE PRECURSOR from TRYPANOSOMA BRUCEI (267 aa), FASTA scores: opt: 179, E(): 0.0011, (30.7% identity in 205 aa overlap); etc. Also similar to Rv2051c|MTY25D10_18 from Mycobacterium tuberculosis. Contains S00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0553 detected using SWATH mass spectrometry. Mb0553 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248261.1" /translation="MLPCLNEEESLPAVLAAIPAGYRALVVDNNSTDDTATVAARHGA QVVVEPRPGYGSAVHAGVLAATTPIVAVIDADGSMDAGDLPKLVAELDKGADLVTGRR RPVAGLHWPWVARVGTVVMSWRLRTRHRLPVHDIAPMRVARREALLDLGVVDRRSGYP LELLVRAAAAGWRVVELDVSYGPRTGGKSKVSGSLRGSIIAILDFWKVIS" CDS 633616..634278 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0554" /product="cell wall biogenesis" /note="Mb0554, -, len: 220 aa. Equivalent to Rv0540, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 220 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from Streptomyces coelicolor: CAB76990.1|AL159178 (213 aa); N-terminus of BAA84086.1|AB032065 (446 aa); and CAB61872.1|AL133252|SCE46_21 (210 aa), FASTA scores: opt: 267, E(): 5.3e-10, (32.7% identity in 202 aa overlap). Also some similarity with D90913_63|PCC6803 from Synecho cystis sp (211 aa), FASTA scores: opt: 189, E(): 4.7e-06, (25.3 identity in 194 aa overlap). Protein product from Mb0554 detected using shotgun mass spectrometry. Mb0554 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248262.1" /translation="MSCLPVSVLVVAKAPEPGRVKTRLAAAIGDKVAADIAAAALLDT LDAVAAAPVTARAVALTGDLDSAADSAEIRRRLKSFTVFRQRGDAFADRLANAHVDAA DGYPVLQIGMDTPQVTAELLADCARLLLQIPAVLGLAFDGGWWVLGIRTPTAAECLRA VPMSQPDTGELTLKALRDNGIDVTLVQRLGDFDIVDDIALVRDCCAPGSRFAQATRAA GL" CDS complement(634299..635648) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0555C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0555c, -, len: 449 aa. Equivalent to Rv0541c, len: 449 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 449 aa overlap). Probable conserved integral membrane protein, highly similar (except first 40 residues) to CAB76994.1|AL159178 putative integral membrane protein from Streptomyces coelicolor (456 aa). Also some similarity to Q13724|GCS1_HUMAN MANNOSYL-OLIGOSACCHARIDE GLUCOSIDASE (834 aa), FASTA scores: opt: 150, E(): 0.013, (27.1% identity in 339 aa overlap). Contains PS00041 Bacterial regulatory proteins, araC family signature. Mb0555c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248263.1" /translation="MRIGRREGLAVAIGFVLVGAAFVLPRLNLGIKPRSDIGLERFAT RAGAAPIFGYWDAHVGWGTAPAVLTAVAVVAWGPVVAHRLPWRVLTLSTWATAAAWAF SLAMIDGWQRGFAGRLTTRDEYLWQVPGIADIPATLRTFTSRILDFQPNSWVTHVSGH PPGALLTFVWLDRIGLRGGGWAGLVCLLVGSSAAAAVLIAVRVLASEQMARRTAPFVA VAPTAIWVAVSADGYFAGVAAWGIALLAVAVHGATRFPALVAAGAGLLLGWGVFLNYG LVLIVLPGMAVLAAADWRPVLRALGPAVLAALVVAVSFAVAGFSWFDGYTLVQQRYWQ GIAKDRPFGYWSWANLACVVCAIGLGSVAGLSRVFDRAAISRRSGCHLLLLAVLAAIA LADLSMLSKAETERIWLPFTIWLTAAPALLPPRSHRLWLAVNAAGALLLNSIIFTNW" CDS complement(635660..636748) /codon_start=1 /transl_table=11 /gene="menE" /locus_tag="BQ2027_MB0556C" /product="POSSIBLE O-SUCCINYLBENZOIC ACID--COA LIGASE MENE (OSB-COA SYNTHETASE) (O-SUCCINYLBENZOATE-COA SYNTHASE)" /note="Mb0556c, menE, len: 362 aa. Equivalent to Rv0542c, len: 362 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 362 aa overlap). Possible menE, O-succinylbenzoic acid-CoA ligase (EC 6.2.1.26), highly similar to Q50170|AAA63145.1|U15187|XCLB 4-Coumarate--CoA ligase from Mycobacterium leprae (352 aa), FASTA scores: opt: 1815, E(): 0, (78.9% identity in 351 aa overlap). Also similar to N-terminus of acid-CoA ligases e.g. NP_471116.1|NC_003212 O-succinylbenzoic acid-CoA ligase from Listeria innocua (469 aa); NP_390957.1|NC_000964 O-succinylbenzoic acid-CoA ligase from Bacillus subtilis (486 aa); MENE_HAEIN|P44565 O-succinylbenzoic acid-CoA ligase from Haemophilus influenzae (452 aa), FASTA scores: opt: 307, E(): 4.6e-12, (25.4% identity in 339 aa overlap); etc. Also some similarity with fadD proteins from Mycobacterium tuberculosis. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb0556c detected using SWATH mass spectrometry. Mb0556c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248264.1" /translation="MLGGSDPALVAVPTQHESLLGALRVGEQIDDDVALVVTTSGTTG PPKGAMLTAAALTASASAAHDRLGGPGSWLLAVPPYHIAGLAVLVRSVIAGSVPVELN VSAGFDVTELPNAIKRLGSGRRYTSLVAAQLAKALTDPAATAALAELDAVLIGGGPAP RPILDAAAAAGITVVRTYGMSETSGGCVYDGVPLDGVRLRVLAGGRIAIGGATLAKGY RNPVSPDPFAEPGWFHTDDLGALESGDSGVLTVLGRADEAISTGGFTVLPQPVEAALG THPAVRDCAVFGLADDRLGQRVVAAIVVGDGCPPPTLEALRAHVARTLDVTAAPRELH VVNVLPRRGIGKVDRAALVRRFAGEADQ" CDS complement(636817..637119) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0557C" /product="conserved protein" /note="Mb0557c, -, len: 100 aa. Equivalent to Rv0543c, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). Conserved hypothetical protein, equivalent to Q50171|MLU15187_32|NP_302469.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (100 aa), FASTA scores: opt: 493, E(): 6.1e-30, (73.5% identity in 98 aa overlap). Some similarity to Rv3046c|NP_217562.1 from Mycobacterium tuberculosis. Protein product from Mb0557c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0557c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248265.1" /translation="MNRFLTSIVAWLRAGYPEGIPPTDSFAVLALLCRRLSHDEVKAV ANELMRLGDFDQIDIGVVITHFTDELPSPEDVERVRARLAAQGWPLDDVRDREEHA" CDS complement(637179..637457) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0558C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0558c, -, len: 92 aa. Equivalent to Rv0544c, len: 92 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 92 aa overlap). Possible conserved transmembrane protein, equivalent to NP_302470.1|NC_002677 possible membrane protein from Mycobacterium leprae (96 aa); and shows some similarity to MLU15187_33|Q50172|U296V from Mycobacterium leprae (36 aa), FASTA scores: opt: 151, E(): 2.1e-05, (71.4% identity in 35 aa overlap). Also some similarity with VATL_NEPNO|Q26250 vacuolar ATP synthase 16 kd proteolipid from Nephrops norvegicus (159 aa), FASTA scores: opt: 80, E(): 11, (26.1% identity in 88 aa overlap). Protein product from Mb0558c detected using SWATH mass spectrometry. Mb0558c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248266.1" /translation="MSAWFNYTATLKILIFSLLAGALLPGLFAVGVRLQAAGDGADAT ARRRPLLVAVSWAIFALVLAVVIIGVLYIARDFIAHHTGWAFLGATPK" CDS complement(637454..638707) /codon_start=1 /transl_table=11 /gene="pitA" /locus_tag="BQ2027_MB0559C" /product="PROBABLE LOW-AFFINITY INORGANIC PHOSPHATE TRANSPORTER INTEGRAL MEMBRANE PROTEIN PITA" /note="Mb0559c, pitA, len: 417 aa. Equivalent to Rv0545c, len: 417 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 417 aa overlap). Probable pitA, low-affinity inorganic phosphate transporter, integral membrane protein, equivalent to Q50173|NP_302471.1 pitA from Mycobacterium leprae (414 aa), FASTA scores: opt: 2035, E(): 0, (76.3% identity in 418 aa overlap). Also highly similar to others e.g. CAB59461.1|AL132644 putative low-affinity phosphate transport protein from Streptomyces coelicolor (423 aa); PITA_ECOLI|P37308 low-affinity inorganic phosphate transporter from Escherichia coli (499 aa), FASTA scores: opt: 304, E(): 6.9e-10, (32.5 % identity in 234 aa overlap); etc. BELONGS TO THE PHO-4 FAMILY OF TRANSPORTERS, PIT SUBFAMILY. Mb0559c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248267.1" /translation="MNLQLFLLLIVVVTALAFDFTNGFHDTGNAMATSIASGALAPRV AVALSAVLNLIGAFLSTAVAATIAKGLIDANLVTLELVFAGLVGGIVWNLLTWLLGIP SSSSHALIGGIVGATIAAVGLRGVIWSGVVSKVIVPAVVAALLATLVGAVGTWLVYRT TRGVAEKRTERGFRRGQIGSASLVSLAHGTNDAQKTMGVIFLALMSYGAVSTTASVPP LWVIVSCAVAMAAGTYLGGWRIIRTLGKGLVEIKPPQGMAAESSSAAVILLSAHFGYA LSTTQVATGSVLGSGVGKPGAEVRWGVAGRMVVAWLVTLPLAGLVGAFTYGLVHFIGG YPGAILGFALLWLTATAIWLRSRRAPIDHTNVNADWEGNLTAGLEAGAQPLADQRPPV PAPPAPTPPPNHRAPQFGVTTRNAP" CDS complement(638827..639213) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0560C" /product="Lactoylglutathione lyase and related lyases" /note="Mb0560c, -, len: 128 aa. Equivalent to Rv0546c, len: 128 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 128 aa overlap). Conserved hypothetical protein, equivalent to AAA63111.1|U15187|Q50174|U296X hypothetical protein from Mycobacterium leprae (144 aa), FASTA scores: opt: 748, E(): 0, (84.2% identity in 133 aa overlap). Also highly similar to CAB95979.1|AL360034 conserved hypothetical protein from Streptomyces coelicolor (130 aa); and similar to AE000854_8|O26852 S-D-LACTOYLGLUTATHIONE METHYLGLYOXAL LYASE from Methanobacterium thermoautotropto (116 aa), FASTA scores: opt: 155, E(): 0.00019, (30.6% identity in 108 aa overlap); YAER_ECOLI hypothetical 14.7 kd protein from Escherichia coli (129 aa), FASTA scores: opt: 104, E(): 0.42, (28.7% identity in 115 aa overlap). Also similar to Rv2068c from Mycobacterium tuberculosis. Protein product from Mb0560c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0560c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248268.1" /translation="MEILASRMLLRPADYQRSLSFYRDQIGLAIAREYGAGTVFFAGQ SLLELAGYGEPDHSRGPFPGALWLQVRDLEATQTELVSRGVSIAREPRREPWGLHEMH VTDPDGITLIFVEVPEGHPLRTDTRA" CDS complement(639276..640160) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0561C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0561c, -, len: 294 aa. Equivalent to Rv0547c, len: 294 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 294 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases e.g. fatty acyl-CoA reductase from Acinetobacter calcoaceticus (295 aa); NP_280196.1|NC_002607 3-oxoacyl-[acyl-carrier-protein] reductase from Halobacterium sp. NRC-1 (255 aa); NP_349214.1|NC_003030 Short-chain alcohol dehydrogenase family protein from Clostridium acetobutylicum (255 aa); etc. Also similar to several proteins from Mycobacterium tuberculosis e.g. Y04M_MYCTU|Q10783 putative oxidoreductase (341 aa), FASTA scores: opt: 644, E(): 0, (46.1% identity in 258 aa overlap). Protein product from Mb0561c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0561c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248269.1" /translation="MSKRPLRWLTEQITLAGMRPPISPQLLINRPAMQPVDLTGKRIL LTGASSGIGAAATKQFGLHRAVVVAVARRKDLLDAVADRITGDGGTAMSLPCDLSDME AIDALVEDVEKRIGGIDILINNAGRSIRRPLAESLERWHDVERTMVLNYYAPLRLIRG LAPGMLERGDGHIINVATWGVLSEASPLFSVYNASKAALSAVSRIIETEWGSQGVHST TLYYPLVATPMIAPTKAYDGLPALTAAEAAEWMVTAARTRPVRIAPRVAVAVNALDSI GPRWVNALMQRRNEQLNP" CDS complement(640256..641200) /codon_start=1 /transl_table=11 /gene="menB" /locus_tag="BQ2027_MB0562C" /product="naphthoate synthase menb (dihydroxynaphthoic acid synthetase) (dhna synthetase)" /note="Mb0562c, menB, len: 314 aa. Equivalent to Rv0548c, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 314 aa overlap). Probable menB, naphthoate synthase (dihydroxynaphthonic acid synthase) (EC 4.1.3.36), equivalent to NP_302473.1|NC_002677 naphthoate synthase from Mycobacterium leprae (300 aa). Also similar to others e.g. MENB_ECOLI|P27290 naphthoate synthase from Escherichia coli (285 aa), FASTA scores: opt: 599, E(): 9.3e-33, (48.1 identity in 285 aa overlap); etc. BELONGS TO THE ENOYL-COA HYDRATASE/ISOMERASE FAMILY. Protein product from Mb0562c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0562c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248270.1" /translation="MVAPAGEQGRSSTALSDNPFDAKAWRLVDGFDDLTDITYHRHVD DATVRVAFNRPEVRNAFRPHTVDELYRVLDHARMSPDVGVVLLTGNGPSPKDGGWAFC SGGDQRIRGRSGYQYASGDTADTVDVARAGRLHILEVQRLIRFMPKVVICLVNGWAAG GGHSLHVVCDLTLASREYARFKQTDADVGSFDGGYGSAYLARQVGQKFAREIFFLGRT YTAEQMHQMGAVNAVAEHAELETVGLQWAAEINAKSPQAQRMLKFAFNLLDDGLVGQQ LFAGEATRLAYMTDEAVEGRDAFLQKRPPDWSPFPRYF" CDS complement(641472..641579) /codon_start=1 /transl_table=11 /gene="vapc3" /locus_tag="BQ2027_MB0563C" /product="CONSERVED HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb0563c, -, len: 45 aa. Similar to 3' end of Rv0549c, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 45 aa overlap). Conserved hypothetical protein, similar to Rv0960, Rv0065, and Rv1720c from Mycobacterium tuberculosis. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0549c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (t-*), splits Rv0549c into 2 parts, Mb0563c and Mb0564c. Protein product from Mb0563c detected using SWATH mass spectrometry. Mb0563c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248271.1" /translation="MTDALYVELAETAGLVLLTTDERLARAWPSAHAIG" CDS complement(641576..641884) /codon_start=1 /transl_table=11 /gene="vapc3" /locus_tag="BQ2027_MB0564C" /product="CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb0564c, -, len: 102 aa. Similar to 5' end of Rv0549c, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 92 aa overlap). Conserved hypothetical protein, similar to Rv0960, Rv0065, and Rv1720c from Mycobacterium tuberculosis. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0549c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (t-*), splits Rv0549c into 2 parts, Mb0563c and Mb0564c. Mb0564c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248272.1" /translation="MRASPTSPPEQVVVDASAMVDLLARTSDRCSAVRARLARTAMHA PAHFDAEVLSALGRMQRAGALTVAYVDAALEELRQVPVTRHGLSSLLAERGRAATPSA " CDS complement(641881..642147) /codon_start=1 /transl_table=11 /gene="vapb3" /locus_tag="BQ2027_MB0565C" /product="possible antitoxin vapb3" /note="Mb0565c, -, len: 88 aa. Equivalent to Rv0550c, len: 88 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 88 aa overlap). Hypothetical unknown protein. Protein product from Mb0565c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0565c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248273.1" /translation="MLSRRTKTIVVCTLVCMARLNVYVPDELAERARARGLNVSALTQ AAISAELENSATDAWLEGLEPRSTGARHDDVLGAIDAARDEFEA" CDS complement(642339..644054) /codon_start=1 /transl_table=11 /gene="fadD8" /locus_tag="BQ2027_MB0566C" /product="PROBABLE FATTY-ACID-COA LIGASE FADD8 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb0566c, fadD8, len: 571 aa. Equivalent to Rv0551c, len: 571 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 571 aa overlap). Probable fadD8, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to many e.g. LCFA_ECOLI|P29212 long-chain-fatty-acid--CoA ligase (561 aa), FASTA scores: opt: 585, E(): 9.5e-30, (28.7% identity in 536 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. Note other possible start sites exist downstream of this start. Protein product from Mb0566c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0566c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248274.1" /translation="MSTAGDDAVGVPPACGGRSDAVGVPQLARESGAMRDQDCSGELL RSPTHNGHLLVGALKRHQNKPVLFLGDTRLTGGQLADRISQYIQAFEALGAGTGVAVG LLSLNRPEVLMIIGAGQARGYRRTALHPLGSLADHAYVLNDAGISSLIIDPNPMFVER ALALLEQVDSLQQILTIGPVPDALKHVAVDLSAEAAKYQPQPLVAADLPPDQVIGLTY TGGTTGKPKGVIGTAQSIATMTSIQLAEWEWPANPRFLMCTPLSHAGAAFFTPTVIKG GEMIVLAKFDPAEVLRIIEEQRITATMLVPSMLYALLDHPDSHTRDLSSLETVYYGAS AINPVRLAEAIRRFGPIFAQYYGQSEAPMVITYLAKGDHDEKRLTSCGRPTLFARVAL LDEHGKPVKQGEVGEICVSGPLLAGGYWNLPDETSRTFKDGWLHTGDLAREDSDGFYY IVDRVKDMIVTGGFNVFPREVEDVVAEHPAVAQVCVVGAPDEKWGEAVTAVVVLRSNA ARDEPAIEAMTAEIQAAVKQRKGSVQAPKRVVVVDSLPLTGLGKPDKKAVRARFWEGA GRAVG" CDS 644132..645736 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0567" /product="Exoenzymes regulatory protein AepA precursor" /note="Mb0567, -, len: 534 aa. Equivalent to Rv0552, len: 534 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 534 aa overlap). Conserved hypothetical protein, similar to others from several organisms. Also shows some similarity with regulatory proteins e.g. AEPA_ERWCA|Q06555 exoenzymes regulatory protein aepA [Precursor] from Erwinia carotovora (465 aa), FASTA scores: opt: 278, E(): 7.6e-11, (23.0% identity in 408 aa overlap). Also similar to Z99119|BSUB0016_28 from Bacillus subtilis (529 aa), FASTA scores: opt: 436, E(): 8.3e-20, (23.8% identity in 547 aa overlap). C-terminus is similar to MLRRNOPR_1 HYPOTHETICAL 17.7 KD PROTEIN from Mycobacterium leprae (154 aa), FASTA score: (43.1% identity in 160 aa overlap). Protein product from Mb0567 detected using SWATH mass spectrometry. Mb0567 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248275.1" /translation="MADADLVMTGTVLTVDDARPTAEAIAVADGRVIAVGDRSEVAGL VGANTRVIDLGAGCVMPGFVEAHGHPLLEAVVLSDRFVDIRPVTMRDADDVVAAIRGE VARRGPAGAYLVGWDPLLQSGLGEPTLTWLDSLAPNGPLVIIHNSGHKAYFNSHAAWL NGLTRDTADPKGAKYGRDGNGELDGTAEEIGAILPLLAGVADPSNFGAMLRAECARLN RAGLTTCSEMAFDPGYRPMVEAVRAELTVRLCTYEISNARMCTDATPGQGDDMLRQVG IKIWVDGSPWVGNIDLTFPYLDTPATRAIGVPPGSRGCANYTREQLAEIVGAYFPRGW QIACHVHGDGGVDTILDVYEEALRRNPRDDHRLRLEHVGAIRPDQLRRAAELGVTCSI FVDQIHYWGDVIVDDLFGAQRGSRWMPAGSAVAAGMRISLHNDPPVTPEEPLRNISVA ATRVAPSGRVLAPEERLTVEQAIRAQTIDAAWQLFAEDAIGSLQVGKYADMVVLSADP RTVPPEQIADLAVRATFLAGRQVYRR" CDS 645733..646713 /codon_start=1 /transl_table=11 /gene="menC" /locus_tag="BQ2027_MB0568" /product="PROBABLE MUCONATE CYCLOISOMERASE MENC (CIS,CIS-MUCONATE LACTONIZING ENZYME) (MLE)" /note="Mb0568, menC, len: 326 aa. Equivalent to Rv0553, len: 326 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 326 aa overlap). Probable menC, muconate cycloisomerase (EC 5.5.1.1), equivalent to NP_302476.1|NC_002677 putative isomerase/racemase from Mycobacterium leprae (334 aa). Also similar to other muconate cycloisomerases e.g. TCBD_PSESP|P27099 chloromuconate cycloisomerase (370 aa), FASTA scores: opt: 249, E(): 7.8e-09, (32.7% identity in 199 aa overlap). Also similar to O-succinylbenzoate-CoA synthases. BELONGS TO THE MANDELATE RACEMASE / MUCONATE LACTONIZING ENZYME FAMILY. Protein product from Mb0568 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0568 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248276.1" /translation="MIPVLPPLEALLDRLYVVALPMRVRFRGITTREVALIEGPAGWG EFGAFVEYQSAQACAWLASAIETAYCAPPPVRRDRVPINATVPAVAAAQVGEVLARFP GARTAKVKVAEPGQSLADDIERVNAVRELVPMVRVDANGGWGVAEAVAAAAALTADGP LEYLEQPCATVAELAELRRRVDVPIAADESIRKAEDPLAVVRAQAADIAVLKVAPLGG ISALLDIAARIAVPVVVSSALDSAVGIAAGLTAAAALPELDHACGLGTGGLFEEDVAE PAAPVDGFLAVARTTPDPARLQALGAPPQRRQWWIDRVKACYSLLVPSFG" CDS 646710..647498 /codon_start=1 /transl_table=11 /gene="bpoC" /locus_tag="BQ2027_MB0569" /product="POSSIBLE PEROXIDASE BPOC (NON-HAEM PEROXIDASE)" /note="Mb0569, bpoC, len: 262 aa. Equivalent to Rv0554, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 262 aa overlap). Possible bpoC, peroxidase (non-haem peroxidase) (EC 1.11.1.-), equivalent to NP_302477.1|NC_002677 putative hydrolase from Mycobacterium leprae (265 aa). Also highly similar or similar to various hydrolases and peroxidases e.g. CAB38877.1|AL035707|T36181 probable hydrolase from Streptomyces coelicolor (272 aa); CAC48368.1|Y16952 putative hydrolase from Amycolatopsis mediterranei (284 aa); P29715|BPA2_STRAU non-haem bromoperoxidase bpo-a2 (bromide peroxidase) (EC 1.11.1.-) from Streptomyces aureofaciens (277 aa), FASTA scores: opt: 325, E(): 2.3e-15, (29.5% identity in 268 aa overlap); O31168|PRXC_STRAU|CPO|CPOT non-heme chloroperoxidase (chloride peroxidase) (EC 1.11.1.10) from Streptomyces aureofaciens (278 aa); etc. Also similar to M. tuberculosis non-heme haloperoxidases and epoxide hydrolases e.g. Rv1938, Rv3617, etc. Protein product from Mb0569 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0569 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248277.1" /translation="MINLAYDDNGTGDPVVFIAGRGGAGRTWHPHQVPAFLAAGYRCI TFDNRGIGATENAEGFTTQTMVADTAALIETLDIAPARVVGVSMGAFIAQELMVVAPE LVSSAVLMATRGRLDRARQFFNKAEAELYDSGVQLPPTYDARARLLENFSRKTLNDDV AVGDWIAMFSMWPIKSTPGLRCQLDCAPQTNRLPAYRNIAAPVLVIGFADDVVTPPYL GREVADALPNGRYLQIPDAGHLGFFERPEAVNTAMLKFFASVKA" CDS 647541..649205 /codon_start=1 /transl_table=11 /gene="menD" /locus_tag="BQ2027_MB0570" /product="PROBABLE BIFUNCTIONAL MENAQUINONE BIOSYNTHESIS PROTEIN MEND : 2-SUCCINYL-6-HYDROXY-2,4-CYCLOHEXADIENE-1- CARBOXYLATE SYNTHASE (SHCHC SYNTHASE) + 2-OXOGLUTARATE DECARBOXYLASE (ALPHA-KETOGLUTARATE DECARBOXYLASE) (KDC)" /note="Mb0570, menD, len: 554 aa. Equivalent to Rv0555, len: 554 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 554 aa overlap). Probable menD, menaquinone biosynthesis protein, including 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase (EC 4.1.3.-) and 2-oxoglutarate decarboxylase (EC 4.1.1.71) activities. Equivalent to NP_302478.1|NC_002677 putative 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1- carboxylate synthase / 2-oxoglutarate decarboxylase from Mycobacterium leprae (556 aa). Also similar to others e.g. MEND_BACSU|P23970 2-succinyl-6-hydroxy-2,4-cyclohexadiene- 1-carboxylate synthase from Bacillus subtilis (548 aa), FASTA scores: opt: 488, E(): 2.3e-21, (34.3% identity in 545 aa overlap); etc. COFACTOR: THIAMINE PYROPHOSPHATE. Protein product from Mb0570 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0570 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248278.1" /translation="MNPSTTQARVVVDELIRGGVRDVVLCPGSRNAPLAFALQDADRS GRIRLHVRIDERTAGYLAIGLAIGAGAPVCVAMTSGTAVANLGPAVVEANYARVPLIV LSANRPYELLGTGANQTMEQLGYFGTQVRASISLGLAEDAPERTSALNATWRSATCRV LAAATGARTANAGPVHFDIPLREPLVPDPEPLGAVTPPGRPAGKPWTYTPPVTFDQPL DIDLSVDTVVISGHGAGVHPNLAALPTVAEPTAPRSGDNPLHPLALPLLRPQQVIMLG RPTLHRPVSVLLADAEVPVFALTTGPRWPDVSGNSQATGTRAVTTGAPRPAWLDRCAA MNRHAIAAVREQLAAHPLTTGLHVAAAVSHALRPGDQLVLGASNPVRDVALAGLDTRG IRVRSNRGVAGIDGTVSTAIGAALAYEGAHERTGSPDSPPRTIALIGDLTFVHDSSGL LIGPTEPIPRSLTIVVSNDNGGGIFELLEQGDPRFSDVSSRIFGTPHDVDVGALCRAY HVESRQIEVDELGPTLDQPGAGMRVLEVKADRSSLRQLHAAIKAAL" CDS 649202..649717 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0571" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0571, -, len: 171 aa. Equivalent to Rv0556, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 171 aa overlap). Probable conserved transmembrane protein, equivalent to NP_302479.1|NC_002677 putative membrane protein from Mycobacterium leprae (175 aa). Protein product from Mb0571 detected using SWATH mass spectrometry. Mb0571 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248279.1" /translation="MISPKPLLHILIHGRSDELPDTRGRIVLRWLRIAVLIVTGLVTL QSVLLVAGAWRNDIAIQRNMGVAQAEVLSAGPRRSTIEFVTPDRITYRPQLGVLYPSE LSTGMRIYVEYNKRDPNLVRVQHRNAGLAIIPAGSIAVVAWLIAAAALVVLAVLDKRL ERRENSASATG" CDS 649779..650915 /codon_start=1 /transl_table=11 /gene="mgta" /locus_tag="BQ2027_MB0572" /product="mannosyltransferase mgta" /note="Mb0572, pimB, len: 378 aa. Equivalent to Rv0557, len: 378 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 378 aa overlap). pimB (alternate gene name: mtfB), mannosyltransferase (EC 2.4.1.-) (see citation below), similar to other various transferases e.g. NP_243554.1|NC_002570 alpha-D-mannose-alpha(1-6)phosphatidyl myo-inositol monomannoside transferase from Bacillus halodurans (381 aa); NP_249533.1|NC_002516 probable glycosyl transferase from Pseudomonas aeruginosa (406 aa); NP_419573.1|NC_002696 glycosyl transferase, group 1 family protein, from Caulobacter crescentus (455 aa); etc. Also similar to Q55598 hypothetical 44.9 kDa protein from SYNECHOCYSTIS SP (409 aa), FASTA scores: opt: 703, E(): 0, (33.9% identity in 378 aa overlap); GPI3_YEAST|P32363 n-acetylglucosaminyl-phosphatidylinositol biosynthetic protein (452 aa), FASTA scores: opt: 230, E(): 1.1e-07, (23.5% identity in 328 aa overlap). Mb0572 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248280.1" /translation="MCGVRVAIVAESFLPQVNGVSNSVVKVLEHLRRTGHEALVIAPD TPPGEDRAERLHDGVRVHRVPSRMFPKVTTLPLGVPTFRMLRALRGFDPDVVHLASPA LLGYGGLHAARRLGVPTVAVYQTDVPGFASSYGIPMTARAAWAWFRHLHRLADRTLAP STATMESLIAQGIPRVHRWARGVDVQRFAPSARNEVLRRRWSPDGKPIVGFVGRLAPE KHVDRLTGLAASGAVRLVIVGDGIDRARLQSAMPTAVFTGARYGKELAEAYASMDVFV HSGEHETFCQVVQEALASGLPVIAPDAGGPRDLITPHRTGLLLPVGEFEHRLPDAVAH LVHERQRYALAARRSVLGRSWPVVCDELLGHYEAVRGRRTTQAA" CDS 650932..651636 /codon_start=1 /transl_table=11 /gene="menH" /locus_tag="BQ2027_MB0573" /product="PROBABLE UBIQUINONE/MENAQUINONE BIOSYNTHESIS METHYLTRANSFERASE MENH (2-heptaprenyl-1,4-naphthoquinone methyltransferase)" /note="Mb0573, menH, len: 234 aa. Equivalent to Rv0558, len: 234 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 234 aa overlap). Probable menH (alternate gene name: menG), ubiquinone/menaquinone biosynthesis methlytransferase (2-heptaprenyl-1,4-naphthoquinone methyltransferase) (EC 2.1.1.-), equivalent to NP_302480.1|NC_002677 putative ubiquinone/menaquinone biosynthesis methyltransferase from Mycobacterium leprae (238 aa). Also highly similar to others e.g. CAB44537.1|AL078618|T34630 from Streptomyces coelicolor (231 aa); UBIE_ECOLI|P27851 from Escherichia coli strain K12 (251 aa), FASTA scores: opt: 421, E(): 1.2e-21, (43.2% identity in 227 aa overlap); GRC2_BACSU|P31113 from Bacillus subtilis (233 aa), FASTA scores: opt: 345, E(): 1.4e-16, (34.6% identity in 231 aa overlap); etc. BELONGS TO THE UBIE FAMILY. Note that previously known as ubiE. Protein product from Mb0573 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0573 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248281.1" /translation="MSRAALDKDPRDVASMFDGVARKYDLTNTVLSLGQDRYWRRATR SALRIGPGQKVLDLAAGTAVSTVELTKSGAWCVAADFSVGMLAAGAARKVPKVAGDAT RLPFGDDVFDAVTISFGLRNVANQQAALREMARVTRPGGRLLVCEFSTPTNALFATAY KEYLMRALPRVARAVSSNPEAYEYLAESIRAWPDQAVLAHQISRAGWSGVRWRNLTGG IVALHAGYKPGKQTPQ" CDS complement(651650..651988) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0574C" /product="POSSIBLE CONSERVED SECRETED PROTEIN" /note="Mb0574c, -, len: 112 aa. Equivalent to Rv0559c, len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 112 aa overlap). Possible conserved secreted protein, similar to NP_302481.1|NC_002677 putative secreted protein from Mycobacterium leprae (112 aa). Also similar to Y08B_MYCTU|Q11048 hypothetical 11.6 kd protein FASTA scores: opt: 111, E(): 011, (25.4% identity in 114 aa overlap). Contains possible N-terminal signal sequence. Protein product from Mb0574c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0574c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248282.1" /translation="MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGP QDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEH VGVLQRAGTR" CDS complement(652022..652747) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0575C" /product="POSSIBLE BENZOQUINONE METHYLTRANSFERASE (METHYLASE)" /note="Mb0575c, -, len: 241 aa. Equivalent to Rv0560c, len: 241 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 241 aa overlap). Possible benzoquinone methyltransferase (EC 2.1.1.-) (see citation below), similar to other hypothetical proteins and methyltransferases e.g. Q54300 METHYLTRANSFERASE (211 aa), FASTA scores: opt: 203, E(): 4.8e-07, (30.9% identity in 136 aa overlap). Similar to Rv3699, Rv1377c, Rv2675c, etc from Mycobacterium tuberculosis. Rv0560c can be induced by salicylate and para-amino-salicylate (PAS). Protein product from Mb0575c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0575c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248283.1" /translation="MSTVLTYIRAVDIYEHMTESLDLEFESAYRGESVAFGEGVRPPW SIGEPQPELAALIVQGKFRGDVLDVGCGEAAISLALAERGHTTVGLDLSPAAVELARH EAAKRGLANASFEVADASSFTGYDGRFDTIVDSTLFHSMPVESREGYLQSIVRAAAPG ASYFVLVFDRAAIPEGPINAVTEDELRAAVSKYWIIDEIKPARLYARFPAGFAGMPAL LDIREEPNGLQSIGGWLLSAHLG" CDS complement(652772..653998) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0576C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0576c, -, len: 408 aa. Equivalent to Rv0561c, len: 408 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 408 aa overlap). Possible oxidoreductase (EC 1.-.-.-), highly similar (except in first 30 aa) to NP_302482.1|NC_002677 putative FAD-linked oxidoreductase from Mycobacterium leprae (408 aa). Also similar to T34627 probable electron transfer oxidoreductase from Streptomyces coelicolor (430 aa); and some bacteriochlorophyll synthases e.g. NP_069300.1|NC_000917 bacteriochlorophyll synthase from Archaeoglobus fulgidus (410 aa); Q55087 GERANYLGERANYL HYDROGENASE (407 aa), FASTA scores: opt: 208, E(): 1.7e-06, (26.9% identity in 327 aa overlap). Protein product from Mb0576c detected using SWATH mass spectrometry. Mb0576c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248284.1" /translation="MSVDDSADVVVVGAGPAGSAAAAWAARAGRDVLVIDTATFPRDK PCGDGLTPRAVAELHQLGLGKWLADHIRHRGLRMSGFGGEVEVDWPGPSFPSYGSAVA RLELDDRIRKVAEDTGARMLLGAKAVAVHHDSSRRVVSLTLADGTEVGCRQLIVADGA RSPLGRKLGRRWHRETVYGVAVRGYLSTAYSDDPWLTSHLELRSPDGAVLPGYGWIFP LGNGEVNIGVGALSTSRRPADLALRPLISYYTDLRRDEWGFTGQPRAVSSALLPMGGA VSGVAGSNWMLIGDAAACVNPLNGEGIDYGLETGRLAAELLDSRDLARLWPSLLADRY GRGFSVARRLALLLTFPRFLPTTGPITMRSTALMNIAVRVMSNLVTDDDRDWVARVWR GGGQLSRLVDRRPPFS" CDS 654014..655021 /codon_start=1 /transl_table=11 /gene="grcC1" /locus_tag="BQ2027_MB0577" /product="PROBABLE POLYPRENYL-DIPHOSPHATE SYNTHASE GRCC1 (POLYPRENYL PYROPHOSPHATE SYNTHETASE)" /note="Mb0577, grcC1, len: 335 aa. Equivalent to Rv0562, len: 335 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 335 aa overlap). Probable grcC1, polyprenyl diphosphate synthetase (EC 2.5.1.-), equivalent to NP_302483.1|NC_002677 polyprenyl diphosphate synthase component from Mycobacterium leprae (330 aa). Also similar to others (generally hepta (EC 2.5.1.30) or hexaprenyl) e.g. GRC3_BACSU|P31114 probable heptaprenyl diphosphate syntetase (348aa), FASTA scores: opt: 599, E(): 4e-31, (33.2% identity in 307 aa overlap); etc. Also highly similar to Mycobacterium tuberculosis proteins Rv0989c|grcC2|NP_215504.1|MTCI237.03c PROBABLE POLYPRENYL-DIPHOSPHATE SYNTHASE (325 aa); Rv3383c, Rv3398c, etc. Contains PS00444 Polyprenyl synthetases signature 2. BELONGS TO THE FPP/GGPP SYNTHETASES FAMILY. Protein product from Mb0577 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0577 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248285.1" /translation="MRTPATVVAGVDLGDAVFAAAVRAGVARVEQLMDTELRQADEVM SDSLLHLFNAGGKRFRPLFTVLSAQIGPQPDAAAVTVAGAVIEMIHLATLYHDDVMDE AQVRRGAPSANAQWGNNVAILAGDYLLATASRLVARLGPEAVRIIADTFAQLVTGQMR ETRGTSENVDSIEQYLKVVQEKTGSLIGAAGRLGGMFSGATDEQVERLSRLGGVVGTA FQIADDIIDIDSESDESGKLPGTDVREGVHTLPMLYALRESGPDCARLRALLNGPVDD DAEVREALTLLRASPGMARAKDVLAQYAAQARHELALLPDVPGRRALAALVDYTVSRH G" CDS 655122..655982 /codon_start=1 /transl_table=11 /gene="htpX" /locus_tag="BQ2027_MB0578" /product="PROBABLE PROTEASE TRANSMEMBRANE PROTEIN HEAT SHOCK PROTEIN HTPX" /note="Mb0578, htpX, len: 286 aa. Equivalent to Rv0563, len: 286 aa (alternative start at position 654006), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Probable htpX, protease heat shock protein X (EC 3.4.24.-) (transmembrane protein), equivalent to NP_302484.1|NC_002677 putative peptidase from Mycobacterium leprae (287 aa). Also highly similar to others e.g. CAC08262.1|AL392146 putative peptidase from Streptomyces coelicolor (287 aa); NP_387431.1|NC_003047 PUTATIVE PROTEASE TRANSMEMBRANE PROTEIN from Sinorhizobium meliloti (319 aa); NP_105051.1|NC_002678 heat shock protein (htpX) from Mesorhizobium loti (336 aa); NP_248692.1|NC_000909|U67608|MJU67608_8 heat shock protein HtpX, possibly protease (htpX) from Methanococcus jannaschii (284 aa), FASTA scores: opt: 660, E(): 0, (46.5 identity in 245 aa overlap). Continuation of MTCY25D10.42. TBparse score is 0.887. BELONGS TO PEPTIDASE FAMILY M48 (ZINC METALLOPROTEASE). COFACTOR: Zinc. Protein product from Mb0578 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0578 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248286.1" /translation="MTWHPHANRLKTFLLLVGMSALIVAVGALFGRTALMLAALFAVG MNVYVYFNSDKLALRAMHAQPVSELQAPAMYRIVRELATSAHQPMPRLYISDTAAPNA FATGRNPRNAAVCCTTGILRILNERELRAVLGHELSHVYNRDILISCVAGALAAVITA LANMAMWAGMFGGNRDNANPFALLLVALLGPIAATVIRMAVSRSREYQADESGAVLTG DPLALASALRKISGGVQAAPLPPEPQLASQAHLMIANPFRAGERIGSLFSTHPPIEDR IRRLEAMARG" CDS complement(656167..657192) /codon_start=1 /transl_table=11 /gene="gpdA1" /locus_tag="BQ2027_MB0579C" /product="PROBABLE GLYCEROL-3-PHOSPHATE DEHYDROGENASE [NAD(P)+] GPDA1 (NAD(P)H-DEPENDENT GLYCEROL-3-PHOSPHATE DEHYDROGENASE) (NAD(P)H-DEPENDENT DIHYDROXYACETONE-PHOSPHATE REDUCTASE)" /note="Mb0579c, gpdA1, len: 341 aa. Equivalent to Rv0564c, len: 341 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 341 aa overlap). Possible gpdA1(alternate gene names: gpsA, glyC), glycerol-3-phosphate dehydrogenase [NAD(P)+] dependant (EC 1.1.1.94), similar to many other glycerol-3-phosphate dehydrogenases e.g. P46919|GPDA_BACSU from Bacillus subtilis (345 aa), FASTA scores: opt: 731, E(): 0, (37.3% identity in 332 aa overlap); etc. Also similar to Rv2982c|gpdA2|MTCY349.05|Z83018|MTCY349_5 from Mycobacterium tuberculosis (334 aa), FASTA scores: opt: 740, E(): 0, (40.4% identity in 322 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE NAD-DEPENDENT GLYCEROL-3-PHOSPHATE DEHYDROGENASE FAMILY. Protein product from Mb0579c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0579c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248287.1" /translation="MAANKREPKVVVLGGGSWGTTVASICARRGPTLQWVRSAVTAQD INDNHRNSRYLGNDVVLSDTLRATTDFTEAANCADVVVMGVPSHGFRGVLVELSKELR PWVPVVSLVKGLEQGTNMRMSQIIEEVLPGHPAGILAGPNIAREVAEGYAAAAVLAMP DQHLATRLSAMFRTRRFRVYTTDDVVGVETAGALKNVFAIAVGMGYSLGIGENTRALV IARALREMTKLGVAMGGKSETFPGLAGLGDLIVTCTSQRSRNRHVGEQLGAGKPIDEI IASMSQVAEGVKAAGVVMEFANEFGLNMPIAREVDAVINHGSTVEQAYRGLIAEVPGH EVHGSGF" CDS complement(657253..658713) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0580C" /product="PROBABLE MONOOXYGENASE" /note="Mb0580c, -, len: 486 aa. Equivalent to Rv0565c, len: 486 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 486 aa overlap). Probable monoxygenase (EC 1.14.-.-), highly similar to NP_301173.1|NC_002677 putative monooxygenase from Mycobacterium leprae (494 aa). Also highly similar to others e.g. NP_421371.1|NC_002696 monooxygenase (flavin-binding family) from Caulobacter crescentus (498 aa); C-terminus of NP_051574.1|NC_000958 arylesterase/monoxygenase from Deinococcus radiodurans (833 aa); P12015|CYMO_ACISP CYCLOHEXANONE MONOOXYGENASE (EC 1.14.13.22) from Acinetobacter sp. (542 aa), FASTA scores: opt: 354, E(): 2.1e-16, (23.7% identity in 435 aa overlap); etc. Also similar to other putative monoxygenases from Mycobacterium tuberculosis e.g. Rv3854c (489 aa), MTCY01A6.14 (489 aa), MTV013_4 (495 aa), MTCY31.20 (495 aa). Protein product from Mb0580c detected using SWATH mass spectrometry. Mb0580c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248288.1" /translation="MSVTPNAGCVDVVIVGAGISGLGAAYRIIERNPQLTYTILERRA RIGGTWDLFRYPGVRSDSSIFTLSFPYEPWTREEGIADGAHIREYLTDMAHKYGIDRH IEFNSYVRAADWDSSTDTWTVTFEQNGVHKHYRSRFVFFGSGYYNYDEGYTPDFGGIE KFGGAVVHPQHWPEDLDYTGKKIVVIGSGATAVTLIPSLTDRAEKVTMLQRSPTYLIS ASKYSTFAAVVRKALPPKTSHLIVRMYNALLEAVFWFLSRKTPVFVKWLLRRTAIKNL PEGYDIETHFTPRYNPWDQRLCLIPDADLYNAITSGRAEVVTDHIDHFDATGIALKSG GHLDADIIVTATGLQLQALGGAAISLDGVEIDPRDRFVYKAHMLEDVPNLFWCVGYTN ASWTLRADMTARATAKLLAHMAAHGHTRAAPHLGDEPMDEKPSWDIQAGYVKRAPYAL PKSGTKRPWNVRQNYLADAIDYRFDRIEEAMVFGAA" CDS complement(658791..659282) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0581C" /product="UPF0234 protein Yitk" /note="Mb0581c, -, len: 163 aa. Equivalent to Rv0566c, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). Conserved hypothetical protein, similar to others e.g. P77482|YAJQ_ECOLI HYPOTHETICAL 19.0 KDa PROTEIN from Escherichia coli (169 aa), FASTA scores: opt: 422, E(): 5.4e-20, (44.1 identity in 161 aa overlap); etc. Protein product from Mb0581c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0581c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248289.1" /translation="MADSSFDIVSKVDRQEVDNALNQAAKELATRFDFRGTDTKIAWK GDEAVELTSSTEERVKAAVDVFKEKLIRRDISLKAFEAGEPQASGKTYKVTGALKQGI SSENAKKITKLIRDAGPKNVKTQIQGDEVRVTSKKRDDLQAVIAMLKKADLDVALQFV NYR" tRNA 659352..659432 /locus_tag="BQ2027_TYRT" /product="tRNA-Tyr" /note="tyrT, len: 81 nt. Equivalent to tyrT, len: 81 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 81 nt overlap). tRNA-Tyr, anticodon gta, length = 81" CDS 659564..660583 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0582" /product="PROBABLE METHYLTRANSFERASE/METHYLASE" /note="Mb0582, -, len: 339 aa. Equivalent to Rv0567, len: 339 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 339 aa overlap). Probable methyltransferase (EC 2.1.1.-), similar to several e.g. P39896|TCMO_STRGA TETRACENOMYCIN POLYKETIDE SYNTHESIS 8-O-METHYLTRANSFERASE from Streptomyces glaucescens (339 aa), FASTA scores: opt: 685, E(): 0, (35.8% identity in 335 aa overlap); P10950|HIOM_BOVIN HYDROXYINDOLE O-METHYLTRANSFERASE (EC 2.1.1.4) from Bos taurus (345 aa), FASTA scores: opt: 509, E(): 3.4e-27, (30.7% identity in 332 aa overlap) etc. Protein product from Mb0582 detected using SWATH mass spectrometry. Mb0582 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248290.1" /translation="MELSPDRIMAIGGGYGPSKVLLTAVGLGLFTELGDEAMTAEAIA DRLGLLKRPAIDFLDALVSLDLLARDGDGPGSHYRNTPETAHFLDEARPTYAGGLLKI WNERNYRFWADLTEALKTGKAQSEVKQTGRPFFEALYADPRRLEAFMAAMDAASRRNI ELLAKRFPFERYRRLCDVGCADGLLSRIVAAAHPHLQCVSLDLPAVTEIARRKLTAEG LGERVQACAGDFLADPLPAADVITMGQILHDWNLDRKQQLVAKAYEALSKEGAFIVIE TLIDDARRENTTGLMMSLNMLIEFGDAFDYSAADFRGWCGEAGFRSFEVIPLAGGSSA AVAYK" CDS 660693..662111 /codon_start=1 /transl_table=11 /gene="cyp135B1" /locus_tag="BQ2027_MB0583" /product="POSSIBLE CYTOCHROME P450 135B1 CYP135B1" /note="Mb0583, cyp135B1, len: 472 aa. Equivalent to Rv0568, len: 472 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 472 aa overlap). Possible cyp135B1, cytochrome P450 (EC 1.14.-.-), similar to putative cytochrome P-450 monoxygenases and other cytochrome P-450 related enzymes e.g. P29980|CPXN_ANASP PROBABLE CYTOCHROME P450 from Anabaena sp. strain PCC 7120 (459 aa), FASTA scores: opt: 525, E(): 7.2e-27, (31.9% identity in 417 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv0327c|NP_214841.1|NC_000962|CYP135A1|MT0342|MTCY63.32c PUTATIVE CYTOCHROME P450 (449 aa), FASTA scores: opt: 1080, E(): 0, (40.5% identity in 444 aa overlap); Rv3685c|NP_218202.1|NC_000962 PUTATIVE CYTOCHROME P450 (476 aa); Rv0136|NP_214650.1|NC_000962 PUTATIVE CYTOCHROME P450 (441 aa); etc. Contains cytochrome P450 cysteine heme-iron ligand signature (PS00086). Protein product from Mb0583 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0583 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248291.1" /translation="MSGTSSMGLPPGPRLSGSVQAVLMLRHGLRFLTACQRRYGSVFT LHVAGFGHMVYLSDPAAIKTVFAGNPSVFHAGEANSMLAGLLGDSSLLLIDDDVHRDR RRLMSPPFHRDAVARQAGPIAEIAAANIAGWPMAKAFAVAPKMSEITLEVILRTVIGA SDPVRLAALRKVMPRLLNVGPWATLALANPSLLNNRLWSRLRRRIEEADALLYAEIAD RRADPDLAARTDTLAMLVRAADEDGRTMTERELRDQLITLLVAGHDTTATGLSWALER LTRHPVTLAKAVQAADASAAGDPAGDEYLDAVAKETLRIRPVVYDVGRVLTEAVEVAG YRLPAGVMVVPAIGLVHASAQLYPDPERFDPDRMVGATLSPTTWLPFGGGNRRCLGAT FAMVEMRVVLREILRRVELSTTTTSGERPKLKHVIMVPHRGARIRVRATRDVSATSQA TAQGAGCPAARGGGPSRAVGSQ" CDS 662246..662512 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0584" /product="conserved protein" /note="Mb0584, -, len: 88 aa. Equivalent to Rv0569, len: 88 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 88 aa overlap). Conserved hypothetical protein. C-terminus highly similar to AAA63065.1|U15184|MLU15184_10 hypothetical protein from Mycobacterium leprae (53 aa), FASTA scores: opt: 140, E(): 0.0046, (64.7% identity in 34 aa overlap). Also similar to T36824|SCI35.11 hypothetical protein from Streptomyces coelicolor (64 aa); and N-terminus of T36956 probable DNA-binding protein from Streptomyces coelicolor (323 aa). Also highly similar to Rv2302|MTCY339.07c|NP_216818.1|NC_000962 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (80 aa), FASTA scores: opt: 300, E(): 1.4e-13, (61.8% identity in 76 aa overlap). Protein product from Mb0584 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0584 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248292.1" /translation="MKAKVGDWLVIKGATIDQPDHRGLIIEVRSSDGSPPYVVRWLET DHVATVIPGPDAVVVTAEEQNAADERAQHRFGAVQSAILHARGT" CDS 662538..664670 /codon_start=1 /transl_table=11 /gene="nrdZ" /locus_tag="BQ2027_MB0585" /product="PROBABLE RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE (LARGE SUBUNIT) NRDZ (RIBONUCLEOTIDE REDUCTASE)" /note="Mb0585, nrdZ, len: 710 aa. Equivalent to 5' end of Rv0570, len: 692 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 631 aa overlap). Probable nrdZ, ribonucleoside-diphosphate reductase, large subunit (EC 1.17.4.-), highly similar to others e.g. NP_070492.1|NC_000917|NRD|AE000988_11 ribonucleotide reductase from Archaeoglobus fulgidus (752 aa), FASTA scores: opt: 2001, E(): 0, (52.5% identity in 562 aa overlap) (N-terminus shorter); U73619|TAU73619_1|T37459 ribonucleotide reductase from Thermoplasma acidophilum (857 aa), FASTA scores: opt: 1678, E(): 0, (43.7% identity in 723 aa overlap); etc. BELONGS TO THE RIBONUCLEOSIDE DIPHOSPHATE REDUCTASE LARGE CHAIN FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) leads to a product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb0585 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0585 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248293.1" /translation="MGVSWPAKVRRRDGTLVPFDIARIEAAVTRAAREVACDDPDMPG TVAKAVADALGRGIAPVEDIQDCVEARLGEAGLDDVARVYIIYRQRRAELRTAKALLG VRDELKLSLAAVTVLRERYLLHDEQGRPAESTGELMDRSARCVAAAEDQYEPGSSRRW AERFATLLRNLEFLPNSPTLMNSGTDLGLLAGCFVLPIEDSLQSIFATLGQAAELQRA GGGTGYAFSHLRPAGDRVASTGGTASGPVSFLRLYDSAAGVVSMGGRRRGACMAVLDV SHPDICDFVTAKAESPSELPHFNLSVGVTDAFLRAVERNGLHRLVNPRTGKIVARMPA AELFDAICKAAHAGGDPGLVFLDTINRANPVPGRGRIEATNPCGEVPLLPYESCNLGS INLARMLADGRVDWDRLEEVAGVAVRFLDDVIDVSRYPFPELGEAARATRKIGLGVMG LAELLAALGIPYDSEEAVRLATRLMRRIQQAAHTASRRLAEERGAFPAFTDSRFARSG PRRNAQVTSVAPTGTISLIAGTTAGIEPMFAIAFTRAIVGRHLLEVNPCFDRLARDRG FYRDELIAEIAQRGGVRGYPRLPAEVRAAFPTAAEIAPQWHLRMQAAVQRHVEAAVSK TVNLPATGRSMTSAPSMWPPGRQRSRASRCIATAAGKDRYCPTPRRNRYWRRLTRSSA AAVRAAPASSDGGSHGASRRRIAQNQRF" CDS complement(664729..666060) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0586C" /product="Phosphoribosyl transferase domain protein" /note="Mb0586c, -, len: 443 aa. Equivalent to Rv0571c, len: 443 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 443 aa overlap). Conserved hypothetical protein, highly similar to the products of two adjacent orfs in Mycobacterium leprae: AAA63059.1|U15184|U650S|Q50111 hypothetical protein (258 aa), FASTA scores: opt: 1071, E(): 0, (72.5% identity in 233 aa overlap); and AAA63058.1|U15184|U650T hypothetical protein (86 aa), FASTA scores: opt: 192, E(): 6.4e-06, (70.8% identity in 48 aa overlap). Also similar to others e.g. NP_107072.1|NC_002678 hypothetical protein from Mesorhizobium loti (235 aa); NP_213031.1|NC_000918 hypothetical protein from Aquifex aeolicus (175 aa); etc. And similar to part of hypothetical proteins from Mycobacterium tuberculosis e.g. C-terminus of Rv2143|MTCY270.25c|Z95388|NP_216659.1|NC_000962 (352 aa), FASTA scores: opt: 592, E(): 7e-32, (49.3% identity in 205 aa overlap); N-terminus of Rv2030c|NP_216546.1|NC_000962 (681 aa). Protein product from Mb0586c detected using SWATH mass spectrometry. Mb0586c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248294.1" /translation="MKLFDDRGDAGRQLAQRLAQLSGKAVVVLGLPRGGVPVAFEVAK SLQAPLDVLVVRKLGVPFQPELAFGAIGEDGVRVLNDDVVRGTHLDAAAMDAVERKQL IELQRRAERFRRGRDRIPLTGRIAVIVDDGIATGATAKAACQVARAHGADKVVLAVPI GPDDIVARFAGYADEVVCLATPALFFAVGQGYRNFTQTSDDEVVAFLDRAHRDFAEAG AIDAAADPPLRDEEVQVVAGPVPVAGHLTVPEKPRGIVVFAHGSGSSRHSIRNRYVAE VLTGAGFATLLFDLLTPEEERNRANVFDIELLASRLIDVTGWLATQPDTASLPVGYFG ASTGAGAALVAAADPRVNVRAVVSRGGRPDLAGDSLGSVVAPTLLIVGGRDQVVLELN QRAQAVIPGKCQLTVVPGATHLFEEPGTLEQVAKLACDWFIDHLCGPGPSG" CDS complement(666284..666625) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0587C" /product="HYPOTHETICAL PROTEIN" /note="Mb0587c, -, len: 113 aa. Equivalent to Rv0572c, len: 113 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 113 aa overlap). Hypothetical unknown protein." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248295.1" /translation="MGEHAIKRHMRQRKPTKHPLAQKRGARILVLTDDPRRSVLIVPG CHLDSMRREKNAYYFQDGNALVGMVVSGGTVEYDADDRTYVVQLTDGRHTTESSFEHS SPSRSPQSDDL" CDS complement(666676..666843) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0587CA" /note="unnamed protein product; Mb0587cA, len: 55 aa. No equivalent in M. tuberculosis H37Rv. Identified by de novo proteomics of Mycobacterium bovis AF2122/97 under exponential conditions" /protein_id="CAB5248296.1" /translation="MAADPQCTRCKQTIEPGWLYITAHRRGQAGIVDDGAVLIHVPGE CPHPGEHVPRS" CDS complement(667093..668484) /codon_start=1 /transl_table=11 /gene="pncb2" /locus_tag="BQ2027_MB0588C" /product="nicotinic acid phosphoribosyltransferase pncb2" /note="Mb0588c, -, len: 463 aa. Equivalent to Rv0573c, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 463 aa overlap). Conserved hypothetical protein, similar to other conserved hypothetical proteins and some nicotinate phosphoribosyltransferases e.g. NP_213718.1|NC_000918 hypothetical protein from Aquifex aeolicus (426 aa); AL109962|T36953|SCJ1.20 conserved hypothetical protein from Streptomyces coelicolor (438 aa), FASTA scores: opt: 1089, E(): 0, (49.4% identity in 385 aa overlap); P_391053.1|Z99120|BSUB0017_57|NC_000964 protein similar to nicotinate phosphoribosyltransferase from Bacillus subtilis (490 aa), FASTA scores: opt: 955, E():0, (43.5% identity in 356 aa overlap); etc. Also similar to Q10641|Y03F_MYCTU|MTCY130.15c|Rv1330c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (509 aa), FASTA scores: opt: 761, E(): 0, (38.4% identity in 437 aa overlap). Protein product from Mb0588c detected using SWATH mass spectrometry." /protein_id="CAB5248297.1" /translation="MAIRQHVGALFTDLYEVTMAQAYWAERMSGTAVFEIFFRKLPPG RSYIMAAGLADVVEFLEAFRFDEQDLRYLRGLGQFSDEFLRWLAGVRFTGDVWAAPEG TVIFPNEPAVQLIAPIIEAQLVETFVLNQIHLQSVLASKAARVVAAARGRPVVDFGAR RAHGTDAACKVARTSYLAGAAGTSNLLAARQYGIPTFGTMAHSFVQAFDSEVAAFEAF ARLYPATMLLVDTYDTLRGVDHVIELAKRLGNRFDVRAVRLDSGDLDELSKATRARLD TAGLEQVEIFASSGLDENRIAALLAARCPIDGFGVGTQLVVAQDAPALDMAYKLVAYD GSGRTKFSSGKVIYPGRKQVFRKLEHGVFCGDTLGEHGENLPGDPLLVPIMTNGRRIR QHAPTLDGARDWARQQIDALPPELRSLEDTGYSYPVAVSDRIVGELARLRHADTAEAH PGSNVVGAKAKRP" CDS complement(668494..669636) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0589C" /product="Capsule biosynthesis protein CapA" /note="Mb0589c, -, len: 380 aa. Equivalent to Rv0574c, len: 380 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 380 aa overlap). Conserved hypothetical protein, showing similarity with other hypothetical proteins and polyglutamate synthases (encapsulation proteins) e.g. AAK64444.1|AF377339_5|AF377339 polyglutamate synthase CapA from Myxococcus xanthus (405 aa); M24150|BACCAPABC_3|CapA polyglutamate synthase (encapsulation protein) from B.anthracis (411 aa), FASTA scores: opt: 261, E(): 4.3e-10, (25.8% identity in 287 aa overlap); etc. Mb0589c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248298.1" /translation="MAGKPDVVTVLLGGDVMLGRGVDQILPHPGKPQLRERYMRDATG YVRLAERVNGRIPLPVDWRWPWGEALAVLENTATDVCLINLETTITADGEFADRKPVC YRMHPDNVPALTALRPHVCALANNHILDFGYQGLTDTVAALAGAGIQSVGAGADLLAA RRSALVTVGHERRVIVGSVAAESSGVPESWAARRDRPGVWLIRDPAQRDVADDVAAQV LADKRPGDIAIVSMHWGSNWGYATAPGDVAFAHRLIDAGIDMVHGHSSHHPRPIEIYR GKPILYGCGDVVDDYEGIGGHESFRSELRLLYLTVTDPASGNLISLQMLPLRVSRMRL QRASQTDTEWLRNTIERISRRFGIRVVTRPDNLLEVVPAANLTSKE" CDS complement(669852..670988) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0590C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0590c, -, len: 378 aa. Equivalent to 5' end of Rv0575c, len: 388 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to many diverse oxidoreductases and monooxygenases e.g. AL109974|SCF34_5|T36404 probable monooxygenase from Streptomyces coelicolor (407 aa), FASTA scores: opt: 786, E(): 0, (38.7% identity in 398 aa overlap); P96555|AB000564 SALICYLATE HYDROXYLASE from SPHINGOMONAS (395 aa), FASTA scores: opt: 267, E():5e-11, (26.4% identity in 390 aa overlap). Also similar to Rv1260|Z77137|MTCY50.22C from Mycobacterium tuberculosis (372 aa), FASTA scores: opt: 762, E(): 0, (40.9% identity in 345 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base insertion (*-c) leads to a shorter product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv." /protein_id="CAB5248299.1" /translation="MKVAISGAGVAGAALAHWLQRTGHTPTVIERAPKFRTGGYMIDF WGVGYQVAKRMGITDQIAAAGYHMEHVRSVGPTGKVKADLGVDVFRRMVGDDFTSLPR GDLAAAIYTTIEDQVETIFDDSIATIDEHRDGVRLTFERTAPRDFDLVIGADGLHSNV RRLVFGPERDFEHYLGCKVAACVVDGYRPRDERSYVLYNTVDRQLARFALRGDRTMFL FVFRAEHDNPGVAPKDELRDQFGDVGWESRDILAALDDVEDLYFDVVSQIRMDRWSRG RVLLIGDAAGCISLLGGEGTGLAITEAYVLAGELARAGGDHRRAFDAYEKRLRPFIEG KQASAAKFIWFFRHPNPIRPVVSQRCDAHDELRPAGDAVRRQRA" CDS 671091..672395 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0591" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY ARSR-FAMILY)" /note="Mb0591, -, len: 434 aa. Equivalent to Rv0576, len: 434 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 434 aa overlap). Probable transcriptional regulator, ArsR family. N-terminus highly similar to others e.g. NP_102487.1|NC_002678 transcriptional regulator from Mesorhizobium loti (104 aa); NP_242952.1|NC_002570 transcriptional regulator (ArsR family) from Bacillus halodurans (109 aa); etc. C-terminal region (~240-434) shows similarity with D67028_1 from Rhodococcus rhodochrous (112 aa); and Rv0738 from Mycobacterium tuberculosis (182 aa). N-terminus also highly similar to Rv2034 from Mycobacterium tuberculosis (107 aa). Contains helix-turn-helix motif at aa 23-43 (Score 1628, +4.73 SD). Protein product from Mb0591 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0591 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248300.1" /translation="MLEVAAEPTRRRLLQLLAPGERTVTQLASQFTVTRSAISQHLGM LAEAGLVTARKQGRERYYRLDERGVLRLRALMESFWSDELDRLVADAAHYPPSQGDCA MPFEKAVVVPLDPTSTFALITQPDRLRRWMAVAARIELRTGGAYRWTVTPGHSAAGTV IDVDPGKRVVFTWGWEDHGDPPPGGSTVTITLTPVDGGTEVRLVHDGLTAQQAARHAK GWNHFLDRLVVAGQHGDAGPDEWAAAPDPLDELSCAEATLAVLQHVLRGIGASDLTRQ TPCTEYDVSQLADHLLRSLAIIGAAAGAQLAPRDVDAPLETQVADAAQAVMEAWRRRG LAGTVELNSNQVPATVPVGILCLEFLVHAWDFAIATGSQVIASEPVSEYVLAVAGKVI TPATRNSAGFAAPAAVGSFAPVLDRLIAFTGRQPTAGHVSAT" CDS 672409..673194 /codon_start=1 /transl_table=11 /gene="cfp32" /locus_tag="BQ2027_MB0592" /product="27 kDa antigen Cfp30B" /note="Mb0592, TB27.3, len: 261 aa. Equivalent to Rv0577, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 261 aa overlap). TB27.3, conserved hypothetical protein. Corresponds to O53774|CF30_MYCTU 27 KD ANTIGEN CFP30B from Mycobacterium tuberculosis culture filtrate (260 aa), FASTA scores: opt: 1781, E(): 0, (100.0% identity in 260 aa overlap). Also similar to several hypothetical proteins and hydroxylases from Steptomyces sp. e.g. T35032 probable hydroxylase from Streptomyces coelicolor (263 aa); Q55078 orfA gene product from Streptomyces sp. (275 aa), FASTA scores: E(): 1.5e-1 9, (38.6% identity in 264 aa overlap); D89734_1|P95754 DNA for SgaA SGAA PROTEIN from Streptomyces griseus; and SC9B10_20 from Streptomyces coelicolor (267 aa), FASTA score: (38.9 identity in 252 aa overlap). Also similar to Rv0911|MTCY21C12.05 from Mycobacterium tuberculosis (257 aa), FASTA scores: E(): 1.1e-20, (32.0% identity in 259 aa overlap). Protein product from Mb0592 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0592 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248301.1" /translation="MPKRSEYRQGTPNWVDLQTTDQSAAKKFYTSLFGWGYDDNPVPG GGGVYSMATLNGEAVAAIAPMPPGAPEGMPPIWNTYIAVDDVDAVVDKVVPGGGQVMM PAFDIGDAGRMSFITDPTGAAVGLWQANRHIGATLVNETGTLIWNELLTDKPDLALAF YEAVVGLTHSSMEIAAGQNYRVLKAGDAEVGGCMEPPMPGVPNHWHVYFAVDDADATA AKAAAAGGQVIAEPADIPSVGRFAVLSDPQGAIFSVLKPAPQQ" CDS complement(673239..677159) /codon_start=1 /transl_table=11 /gene="PE_PGRS7" /locus_tag="BQ2027_MB0593C" /product="pe-pgrs family protein pe_pgrs7" /note="Mb0593c, PE_PGRS7, len: 1306 aa. Equivalent to Rv0578c, len: 1306 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1306 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to many other PGRS proteins e.g. MTCY493.04|Z95844 from M. tuberculosis (1329 aa), FASTA scores: opt: 3994, E(): 0, (54.6% identity in 1375 aa overlap). Contains two PS00583 pfkB family of carbohydrate kinases signatures possibly fortuitously. Mb0593c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248302.1" /translation="MSFVIATPEMLTTAATDLAKIGSTITAANTAAAAVAKVLPASAD EVSVAVAALFGTHAQEYQTVSAQVATFHDRFVQTLSAAASSYVAAEAVNVEQSLLAAV NAPTQALFGRPLIGNGADGSPGTGQAGGPGGILYGNGGNGGSGAPGQRGGAGGAAGLI GNGGNGGAGGVGTTGGAGGHGGAGGWLYGNGGAGGFGGAGAVGGNGGAGGTAGLFGVG GAGGAGGNGIAGVTGTSASTPGGSGTAGGAGGIGGNGGAGGAGGVLMGNGGNGGAGGE GGPGGAGGAGASGAHATNLGADGQAGGNGGNGGAGGTGGVGGPGGGHGLLGLGGSHGA GGAGGSGGDGGAPGDGGNGATGTWGHNLGAGGTGGNGGNPGAGGAGGAGGASVGGSAH GANGAPGTTSTSGGNGGDGGKGADAISSGQTGANGGRGGDGGQVGNGGAGGAGGRGGA GGLGFGSEAPGRPGGAGGTGGAGGNGGTQAGDGGTGGAGGAGGDGGSGGAGSIGFNAS APGAAGSPGGNGGNGGPGGAGGEGGAGGLALAASGQNGSQGAGGDGGAGGNGGTPGNG GHGAAGALGVNGGVGGAGGHGGDPGVGGAGGQGGSGSTPGANGAPGNTPTSGGNGGNG GRGADATGFGQTGASGGRGGDGGLVGNGGAGGAGGNGSKGLPGLGRLGNPGLDGGTGG NGGAGGSGGAWAGNGGTGGAGGTGGVGGTGGSGSDGVNGSSAGADGHPGGTGGVGGTG GKGGDGGDGGAAPNGVAGSQGPGGAGGDGGTGGVGGNGGRGIDGADGATAGARGQDGG AGGAGGKGGRGGTGGPGGAGPAGTTGSQGAGGNGGSGGTGGDPGDGGNGANGSVFTNN GIGGNGGNGGNAGPSGAGGSGGAGSTFGATGSSSSIHVNGGNGGNGGNGDHALSGNGA AGGNGGNGGNGSLRGSGGAGGHGGNGGNASRGMGGDGGTGGAGGNAGQIGNGGAGGNG GDGGTGSDGNPGAITGSGGRGGDGGVGGQGGSVAGDGADGGRGGAGGTGGTGLRGTTG ATGATGTFDAGADGHGGNGGTGGVGGTGGAGGGGGNGGAGGKALSPTGNNGSQGAGGD GGAGGTGGTGGTGGDGGRGAHGTLFSSLAGTGGTGGNGGTGGTGGTGGAGGAGGTGST LGATGATGAAGRAGNGGVGGSGGLGSAFGPGGTGGMGGAGGTSTVSAGGDGGRGGFGG DGLDASSGGNGGDGGHGGDGFRTAGAGGRGGDGGKGADPGGLFPIPGAGGKGGTGGTG GTAHLGPLAIIGQSGQPGQFGSPGADGRGGAGGAGGGGGAGGSF" CDS 677481..678239 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0594" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0594, -, len: 252 aa. Equivalent to Rv0579, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 252 aa overlap). Conserved hypothetical protein, showing some similarity to others e.g. AE001747_4 hypothetical protein from Thermotoga maritima (247 aa), FASTA scores: opt: 612, E(): 0, (39.6% identity in 235 aa overlap); AE001004_2 hypothetical protein from Archaeoglobus fulgidus (159 aa), FASTA scores: opt: 196, E(): 1e-06, (28.3% identity in 159 aa overlap); etc. Mb0594 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248303.1" /translation="MVGYVDVRAYAELNEFVELQARGLTVRRPFRSHQTVKDVLEAMG IPHTEVDLILVNGDPADFSYRPVAGDRIAAYPMFEALDIGSTARLRPAPLRNPRFVVD VNLGQLARLLRLLGFDTRWSSAADDPTLADISLGEQRILLTRDRGLLKRRAITHGLFV HSQHPEEQALEVLRRLDLNGRLAPLSRCLRCNGELAAVSKDEVIGQLEPLTRRYYESF SRCFGCGRIYWPGSHHARLVRLVERLRDQLTTST" CDS complement(678368..678859) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0595C" /product="conserved protein" /note="Mb0595c, -, len: 163 aa. Equivalent to Rv0580c, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). Conserved hypothetical protein, equivalent to AAA90989.1|U20446|MK35 lipoprotein precursor from Mycobacterium kansasii (225 aa). TBparse score is 0.910. Protein product from Mb0595c detected using shotgun mass spectrometry. Mb0595c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248304.1" /translation="MTDQSYAVDIAHPPAALLRLVNPILRSLLHTPLAGPLRTQLMVV SFTGRKTGRHFSIPLSAHVIDNDLYALTEAGWKHNFSDGAAAQVVYDGKTTAMRGELI RDRAVVSELFLRAAQAYGVKRGQRMLGLSFRDRRIPTLEEFAEAVDRLKLVAIRLTPA DNS" CDS 678953..679168 /codon_start=1 /transl_table=11 /gene="vapb26" /locus_tag="BQ2027_MB0596" /product="possible antitoxin vapb26" /note="Mb0596, -, len: 71 aa. Equivalent to Rv0581, len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 aa overlap). Conserved hypothetical protein, showing weak similarity to several Mycobacterium tuberculosis proteins including P95003|Z83863|Rv2550c|MTCY159_6 CONSERVED HYPOTHETICAL PROTEIN (81 aa), FASTA scores: opt: 93, E(): 3.2, (25.7% identity in 70 aa overlap); Rv2871; Rv1241; etc. Also shows weak similarity to X05648|SGSPH_1 from Streptomyces glaucescens (77 aa), FASTA scores: opt: 92, E(): 3.6, (35.4% identity in 65 aa overlap). Protein product from Mb0596 detected using SWATH mass spectrometry. Mb0596 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248305.1" /translation="MDKTTVYLPDELKAAVKRAARQRGVSEAQVIRESIRAAVGGAKP PPRGGLYAGSEPIARRVDELLAGFGER" CDS 679165..679572 /codon_start=1 /transl_table=11 /gene="vapc26" /locus_tag="BQ2027_MB0597" /product="possible toxin vapc26. contains pin domain." /note="Mb0597, -, len: 135 aa. Equivalent to Rv0582, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 135 aa overlap). Hypothetical unknown protein. Protein product from Mb0597 detected using SWATH mass spectrometry. Mb0597 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248306.1" /translation="MIIDTSALLAYFDAAEPDHAAVSECIDSSADALVVSPYVVAELD YLVATRVGVDAELAVLRELAGGAWELANCGAAEIEQAARIVTKYQDQRIGIADAANVV LADRYRTRTILTLDRRHFSALRPIGGGRFTVIP" CDS complement(679632..680318) /codon_start=1 /transl_table=11 /gene="lpqN" /locus_tag="BQ2027_MB0598C" /product="PROBABLE CONSERVED LIPOPROTEIN LPQN" /note="Mb0598c, lpqN, len: 228 aa. Equivalent to Rv0583c, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 228 aa overlap). Probable lpqN, conserved lipoprotein, equivalent to AAA90989.1|U20446|MK35|U20446|MKU20446_1 lipoprotein precursor from Mycobacterium kansasii (225 aa), FASTA scores: opt: 945, E(): 0, (62.7% identity in 228 aa overlap); and similar to others from Mycobacteria e.g. Rv0040c and Rv1016c from Mycobacterium tuberculosis. Contains N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb0598c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0598c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248307.1" /translation="MKHFTAAVATVALSLALAGCSFNIKTDSAPTTSPTTTSPTTSTT TTSATTSAQAAGPNYTIADYIRDNHIQETPVHHGDPGSPTIDLPVPDDWRLLPESSRA PYGGIVYTQPADPNDPPTIVAILSKLTGDIDPAKVLQFAPGELKNLPGFQGSGDGSAA TLGGFSAWQLGGSYSKNGKLRTVAQKTVVIPSQGAVFVLQLNADALDDETMTLMDAAN VIDEQTTITP" CDS 680472..683105 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0599" /product="POSSIBLE CONSERVED EXPORTED PROTEIN" /note="Mb0599, -, len: 877 aa. Equivalent to Rv0584, len: 877 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 877 aa overlap). Possible conserved exported protein, similar to other hypothetical proteins which are not necessarily secreted e.g. CAB61925.1|AL133278 putative secreted protein from Streptomyces coelicolor (772 aa); AAD51075.1|AF175722_1|AF175722 immunoreactive 89kD antigen PG87 from Porphyromonas gingivalis (781 aa), FASTA scores: opt: 637, E(): 2.1e-30, (29.1% identity in 794 aa overlap); etc. Contains PS00699 Nitrogenases component 1 alpha and beta subunits signature 1. Has potential N-terminal signal peptide. Mb0599 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248308.1" /translation="MRARRLRRALAALLAVAGLFVPFIVGVPTAYDGEPVFVAIPVEH VNTLIGTGTGAAIVGEINNFPGASVPFGMVQYSPDTVDNYAGYDYGNPHSTGFSMTHA SVGCPAFGDISMLPTTTPLGSQPWSAWEEIAHDDTEVGVPGYYTVRFPGTGVIAELTA TTRTGVGRFRYPRNGWPALFHVRSGASLAGNYAATLQIEDNTTITGSATSGGFCGKKN LYTVYFAMKFSQPFSSYGTWDGYAVYPGSHSMNSSYSGGYVGFPAGSVLEVRTALSYV SVDGARANLDAEGGASFDDIRAATSSEWNAALSRIAVAGRGPGDVDTFYTCLYRSLLH PNTFNDVDGRYIGFDGVIHSVASGHTHYANFSDWDTYRSLAPLQGLLFPQRASDMIQS LVTDAEQSGAYPRWALANSATGMMSGDSVVPLIVNLYAFGARDFDLKSALHYMVNAAT QGGVGLDGFLERPGIAAYLRLGYGPQTAEFRANGRIAGASVTLEWSVDDFAISRFADS LGDTATAAVFQNRSQYWQNLFNPTTGYISPRSAAGFFPDGPGFVAYPSGFGQDGYDEG NAEQYLWWVPHNVAGLVTALGGRTAVVKRLDRFTKKLNVGPNEPYLWAGNEPGFGVPW LYNYIGQPWKTQRTVDRVRGLFGPTPGGAPGNDDLGALSSWYVWAALGLYPSTPGTTI LTVNTPLFDRAVIALPTGKSIQITAPGASGRNRLKYIDGLTIDRQPSNQTFLPESIVR TGGDLTFSLAGTPNKVWGTAASAAPPSFGAGSSAVTVNIARPIIGIVPGATGTVTVDA QRMIDGVDDYTVTPTSYVVGIAAEPLSGQFDDDGAVSASVAITVARSVPSGYYPIYVT TSAGDSARTLIVLVVVAEAVE" CDS complement(683128..685515) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0600C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0600c, -, len: 795 aa. Equivalent to Rv0585c, len: 795 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 795 aa overlap). Probable conserved integral membrane protein. C-terminus similar to CAB88984.1|AL353864 putative integral membrane protein from Streptomyces coelicolor (299 aa); and C-terminal region of CAC01311.1|AL390968 putative integral membrane protein from Streptomyces coelicolor (925 aa). Also some similarity with Rv0204 from Mycobacterium tuberculosis. Protein product from Mb0600c detected using SWATH mass spectrometry. Mb0600c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248309.1" /translation="MRVDGRDIGVSGNLLQPLTRRTNDIIRAVLAAIYLVAVITSSLI TRPQWVALEKSISEIVGVLSPSQSDLVYLGYGLAILALPFVILIGLIVSRQWKLLGAY AAAGLMAVLPLSISSSRIAAPRWHFDLSDRLATLLAQFLDDPRWIAILAAVLTVSGPW LPARWRHWWWALLLAFVPIHLVVSAIVPARSLLGLAVGWLVGALVVLVVGTPALEVPL DGAIRALAKRGFAVSGLAVVRPAGPGPLVLSAACEQPNAGACSEALIELYGPHQSGGG ALRQLWLKLTLRGTETAPLQASMRRAVEHRALMAIAFGDLGMANTTVIAVSPLDRGWT LYAHRPARGIGISECTKTTPTAHVWEALRTLHDQQISHGDLCSAEITVDNGAVLFGGF GEAEYGATDAQLQSDLAQLLVTTSALYDAEAAVTAAIDTFGKQAILAASRRLTKSAVP KRIRESITDPNAVIASTRAEVMRQTGADQIKAETITRFSRGQLIQLVLIGALVYVAYP FISTVPTFFSQLRTANWWWALLGLAVSALTYVGAAAALWACADGLVGFWKLSIMQVAN TFAATTTPAGVGGLALSTRFLQKGGLTAVRATAAVALQQSVQVIVHLVLLILFSALAG TSTDLSHFVPNATVLYLIAGVALGIVGTFLFVPKLRRWLATAVRPKLREVTNDLIALA REPKRLALIVLGCAGTTLGAALALWASIEAFGGGTTFVTVTVVTMVGGTLASAAPTPG GVGAVEAALIGGLAAFGVPAALGVPSVLLYRLLTCWLPVFAGWQVMHWLTRHEMI" CDS 685653..686375 /codon_start=1 /transl_table=11 /gene="mce2r" /locus_tag="BQ2027_MB0601" /product="probable transcriptional regulatory protein mce2r (gntr-family)" /note="Mb0601, -, len: 240 aa. Equivalent to Rv0586, len: 240 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 240 aa overlap). Probable transcriptional regulator, GntR family, similar to many e.g. P33233|LLDR_ECOLI putative L-lactate dehydrogenase operon regulatory protein from Escherichia coli (258 aa), FASTA scores: opt: 225, E(): 9.3e-08, (26.7% identity in 232 aa overlap); etc. Also similar to other M. tuberculosis transcriptional regulators GntR proteins e.g. Rv3060c, Rv0792c, etc. Contains PS00043 Bacterial regulatory proteins, gntR family signature and probable helix-turn helix motif from aa 35-56 (Score 1531, +4.40 SD). Protein product from Mb0601 detected using SWATH mass spectrometry. Mb0601 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248310.1" /translation="MALQPVTRRSVPEEVFEQIATDVLTGEMPPGEALPSERRLAELL GVSRPAVREALKRLSAAGLVEVRQGDVTTVRDFRRHAGLDLLPRLLFRNGELDISVVR SILEARLRNFPKVAELAAERNEPELAELLQDSLRALDTEEDPIVWQRHTLDFWDHVVD SAGSIVDRLMYNAFRAAYEPTLAALTTTMTAAAKRPSDYRKLADAICSGDPTGAKKAA QDLLELANTSLMAVLVSQASRQ" CDS 686372..687169 /codon_start=1 /transl_table=11 /gene="yrbE2A" /locus_tag="BQ2027_MB0602" /product="CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE2A" /note="Mb0602, yrbE2A, len: 265 aa. Equivalent to Rv0587, len: 265 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 265 aa overlap). yrbE2A, hypothetical unknown integral membrane protein, part of mce2 operon and member of YrbE family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07412|Rv0167|MTCI28.07|yrbE1A (265 aa); O53965|Rv1964|MTV051.02|yrbE3A (265 aa); etc. Also highly similar to conserved hypothetical integral membrane proteins of the yrbEA type, e.g. P45392|YRBE_ECOLI hypothetical 27.9 kDa protein from Escherichia coli (260 aa), FASTA scores: opt: 287, E(): 6.1e-12, (21.5% identity in 256 aa overlap); P45030|YRBE_HAEIN|HI1086 hypothetical protein from Haemophilus influenzae (261 aa), FASTA scores: opt: 311, E(): 1.8e-83, (24.2% identity in 265 aa overlap); NP_302654.1|NC_002677 conserved membrane protein from Mycobacterium leprae (267 aa); etc. Mb0602 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /experiment="experimental evidence, no additional details recorded" /protein_id="CAB5248311.1" /translation="MTTHAVIITYLRDQTQPAVDAIGGFYRTCVLTGKALVRRPFHWR EAIEQGWFITSVSLLPTLAVSIPLTVLIIFTLNILLAEFGAADISGAGAALGAVTQLG PLTTVLVIAGAGATAICADLGARTIREEIDAMEVLGIDPIHRLVVPRVVAATIVAALL NGAVITIGLVGGFVFSVFIQHVSAGAYVGTLTLVTGLPEVIISVVKSATFGLIAGLVG CYRGLTTKGGPKGVGTAVNETLVLCVIALFATNVVLTTIGVRFGTGH" CDS 687171..688058 /codon_start=1 /transl_table=11 /gene="yrbE2B" /locus_tag="BQ2027_MB0603" /product="CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE2B" /note="Mb0603, yrbE2B, len: 295 aa. Equivalent to Rv0588, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 295 aa overlap). yrbE2B, hypothetical unknown integral membrane protein, part of mce2 operon and member of YrbE family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07413|Rv0168|MTCI28.08|yrbE1B (289 aa); O53966|Rv1965|MTV051.03|yrbE3B (271 aa); etc. Also highly similar to conserved hypothetical integral membrane proteins of the yrbEB type, e.g. P45392|YRBE_ECOLI hypothetical 27.9 kd protein from Escherichia coli (260 aa), FASTA scores: opt: 232, E(): 8.4e-08, (22.1 % identity in 267 aa overlap); P45030|YRBE_HAEIN|HI1086 hypothetical protei from Haemophilus influenzae (261 aa), FASTA scores: opt: 234, E(): 6.3e-08, (24.2% identity in 215 aa overlap); NP_302655.1|NC_002677 conserved membrane protein from Mycobacterium leprae (289 aa); etc. Mb0603 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248312.1" /translation="MVESSTASAAAVLRARYPRTAASLDRYGGGTARRLERTGTFARF TRISVVQIGWALRRYRRETLRLVAEIGMGTGAMAVVGGTVAIIGFVTLSGGSLIAIQG FASLGNIGVEAFTGFFAALANTRVAAPIVSGVALAATVGAGATAQLGAMRISEEIDAL EVMGIKSISFLVSTRILGGLVVIMPLYALALDMAFTSGQVVTTVFYGQSNGTYEHYFR TFLRPEDVGWSVVEVVIIAVVVMITHCYYGYTASGGPVGVGQAVGRSMRFSLVSVVVV VLLAELALYGVDPNFNLTV" CDS 688064..689278 /codon_start=1 /transl_table=11 /gene="mce2A" /locus_tag="BQ2027_MB0604" /product="MCE-FAMILY PROTEIN MCE2A" /note="Mb0604, mce2A, len: 404 aa. Equivalent to Rv0589, len: 404 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 404 aa overlap). mce2A; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins P72013|MCE1|Rv0169|MTCI28.09|mce1A (454 aa); O53967|MCE3|Rv1966|MTV051.04|mce3A (425 aa); etc. Also highly similar to others e.g. AAD52105.1|AF113402_1|AF113402 mycobacterial cell entry protein from Mycobacterium bovis BCG (454 aa); NP_302656.1|NC_002677 putative cell invasion protein from Mycobacterium leprae (441 aa); CAC12798.1|AL445327 putative secreted protein from Streptomyces coelicolor (418 aa); etc. Also highly similar, but longer 21 aa, to P72013|CAA50257.1|X70901|MTCI28.08 Mcep protein from Mycobacterium tuberculosis (432 aa), FASTA scores: opt: 1324, E(): 0, (62.6% identity in 436 aa overlap). Contains a possible N-terminal signal or anchor sequence. Note that previously known as mce2. Mb0604 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248313.1" /translation="MPTLVTRKNRRAWLYVEGVVLLLVGALVLVLVYKQFRGEFTPKT ELTMVASRAGLVMEAGSKVTYNGVEIGRVGSISEIERDGRPAAKLVLDVNPRYISLIP VNVVADIEAATLFGNKYVALSAPKIPQQQRISSHDVIDVGSVTTEFNTLFETITSIAE KVDPIELNATLSAVAQAPDGLGGKFGESIVNGNQILAQLNPRLPQLGYDVRRLADLGE VYVDASPDLWSFLQNALTTARTLTSQQRDLDAALLAATGAGNTGEDVFARGGPYLARA AADLVPTATLLDTYSPELFCMIRNFHDAAPKVADAVGGNGYSLAAAGTILGAPNPYVY PDNLPRVNAHGGPGGRPGCWQTITRELWPAPYLVMDTGASLAPYNHVELGQPMFTEYV WGRQYGENTINP" CDS 689275..690306 /codon_start=1 /transl_table=11 /gene="mce2B" /locus_tag="BQ2027_MB0605" /product="mce-family related protein" /note="Mb0605, mce2B, len: 343 aa. Equivalent to Rv0590 and Rv0590A, len: 275 aa and 84 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 255 aa overlap and 100.0% identity in 84 aa overlap). mce2B: belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07414|Rv0170|MTCI28.10|mce1B (346 aa); O53968|Rv1967|MTV051.05|mce3B (342 aa); etc. Also highly similar to others e.g. NP_302657.1|NC_002677 putative secreted protein from Mycobacterium leprae (346 aa); P45391|YRBD_ECOLI hypothetical 19.6 kd protein from Escherichia coli (183 aa), FASTA scores: opt: 160, E(): 0.00099, (28.3% identity in 166 aa overlap); P45029|YRBD_HAEIN|HI1085 hypothetical protein from Haemophilus influenzae (167 aa), FASTA scores: opt: 135, E():0.035, (25.9% identity in 143 aa overlap); etc. Contains possible N-terminal signal or anchor sequence. Rv0590A: probable continuation of mce2B|Rv0590. Can find no frameshift to account for this. Possible nucleotide G missing at 688793 as there are 5 in Mycobacterium bovis but only 4 in CDC1551. Strong similarity to C-terminus of other Mce proteins e.g. AL583926|AL583926_38 from Mycobacterium leprae strain TN (346 aa), FASTA scores: E(): 1.2e-20, (67.85% identity in 84 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, mce2B and Rv0590A are 2 genes, most likely to be linked. In Mycobacterium bovis, an in-frame insertion of a single base (*-g) leads to a single product. Mb0605 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248314.1" /translation="MKTTGTTIKLGIVWLVLSVFTVMIIVVFGQVRFHHTTGYSAVFT HVSGLRAGQFVRAAGVEVGKVAKVTLIDGDKQVLVDFTVDRSLSLDQATTASIRYLNL IGDRYLELGRGHSGQRLAPGATIPLEHTHPALDLDALLGGFRPLFQTLDPDKVNSIAS SIITVFQGQGATINDILDQTASLTATLADRDHAIGEVVNNLNTVLATTVKHQTEFDRT VDKLEVLITGLKNRADPLAAAAAHISSAAGTLADLLGADRPLLHSSFGHLEGIQQPLI DELAELDHVLGKLPDAYRIIGRAGGIYGDFFNFYLCDISLKVNGLQPGGPVRTVKLFG QPTGRCTPQ" CDS 690303..691748 /codon_start=1 /transl_table=11 /gene="mce2C" /locus_tag="BQ2027_MB0606" /product="MCE-FAMILY PROTEIN MCE2C" /note="Mb0606, mce2C, len: 481 aa. Equivalent to Rv0591, len: 481 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 481 aa overlap). mce2C; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07415|R0171|MTCI28.11|mce1C (515 aa); O53969|Rv1968|MTV051.06|mce3C (410 aa); etc. Also highly similar to others e.g. NP_302658.1|NC_002677 putative secreted protein from Mycobacterium leprae (519 aa); CAC12796.1|AL445327 putative secreted protein from Streptomyces coelicolor (351 aa); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and may contain N-terminal signal or anchor sequence. Has highly Pro-rich C-terminus. Mb0606 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248315.1" /translation="MRTLTEFNRGRVGMMGAVVTVLVVGVAQSFTSVPMLFATPTYYA QFADMGGINTGDKVEIAGVNVGLVRSLAIRGNRVLIGFSLPGKTIGMQSRAAIRTDTI LGRKNLEIEPRGSEPLKPNGFLPLAQTTTPYQIYDAFVDVTKAATGWDIDAVKRSLNV LSETFDQTAPHLSAALEGVKAFSDTVGRRGEQIEQLLANANRIARVLGDRSEQVNGLL VNAKTLLAAFKQRSQALRILLTNVSEASAQVSGLITDNPNLNHVLAQLRTVSEELVKR KNELADVAVLLGRYTAALTEAVGSGPFFKAMVVNLLPYQILQPWVDAAFKKRGIDPEN FWRSAGLPEFRWPDPNGTRFPNGAPPAAPPVREGTPKHPGPAVPPGTPCSYTPAAGAL PRPDNPLPCAGATVGPFGGPDFPAPLDVQPSPPNPDGPPPTPGILSAGRPGEPAPAVP GIPMPLPPNAPPGARTQPLEPFPDGTGGSNQ" CDS 691745..693181 /codon_start=1 /transl_table=11 /gene="mce2Da" /locus_tag="BQ2027_MB0607" /product="MCE-FAMILY PROTEIN MCE2DA [FIRST PART]" /note="Mb0607, mce2Da, len: 478 aa. Equivalent to 5' end of Rv0592, len: 508 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 471 aa overlap). mce2D; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07416|Rv0172|MTCI28.12|mce1D (530 aa); O53970|Rv1969|MTV051.07|mce3D (423 aa); etc. Also highly similar to others e.g. NP_302659.1|NC_002677 putative secreted protein from Mycobacterium leprae (531 aa); CAC12795.1|AL445327 putative secreted protein from Streptomyces coelicolor (337 aa); etc. Has highly Pro-rich C-terminus and may contain N-terminal signal or anchor sequence. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, mce2D exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c) splits mce2D into 2 parts, mce2Da and mce2Db. Mb0607 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248316.1" /translation="MSTIFDIRSLRLPKLSAKVVVVGGLVVVLAVVAAAAGARLYRKL TTTTVVAYFSEALALYPGDKVQIMGVRVGSIDKIEPAGDKMRVTLHYSNKYQVPATAT ASILNPSLVASRTIQLSPPYTGGPVLQDGAVIPIERTQVPVEWDQLRDSINGILRQLG PTERQPKGPFGDLIESAADNLAGKGRQLNETLNSLSQALTALNEGRGDFVAITRSLAL FVSALYQNDQQFVALNENLAEFTDWFTKSDHDLADTVERIDDVLGTVRKFVSDNRSVL AADVNNLADATTTLVQPEPRDGLETALHVLPTYASNFNNLYYPLHSSLVGQFVFPNFA NPIQLICSAIQAGSRLGYQESAELCAQYLAPVLDALKFNYLPFGSNPFSSAATLPKEV AYSEERLRPPPGYKDTTVPGIFSRDTPFSHGNHEPGWVVAPGMQGMQVQPFTANMLTP ESLAELLGGPDIAPPAAGNQLARTAECV" CDS 693129..693272 /pseudo /codon_start=1 /transl_table=11 /gene="mce2Db" /locus_tag="BQ2027_MB0608" /note="Mb0608, mce2Db, len: 47 aa. Equivalent to 3' end of Rv0592, len: 508 aa, from Mycobacterium tuberculosis strain H37Rv, (95.7% identity in 47 aa overlap). mce2D; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information),highly similar to Mycobacterium tuberculosis proteins O07416|Rv0172|MTCI28.12|mce1D (530 aa); O53970|Rv1969|MTV051.07|mce3D (423 aa); etc. Also highly similar to others e.g. NP_302659.1|NC_002677 putative secreted protein from Mycobacterium leprae (531 aa); CAC12795.1|AL445327 putative secreted protein from Streptomyces coelicolor (337 aa); etc. Has highly Pro-rich C-terminus and may contain N-terminal signal or anchor sequence. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, mce2D exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c) splits mce2D into 2 parts, mce2Da and mce2Db.;MCE-FAMILY PROTEIN MCE2DB [SECOND PART]" CDS 693269..693835 /codon_start=1 /transl_table=11 /gene="lprL" /locus_tag="BQ2027_MB0609" /product="POSSIBLE MCE-FAMILY LIPOPROTEIN LPRL (MCE-FAMILY LIPOPROTEIN MCE2E)" /note="Mb0609, lprL, len: 188 aa. Equivalent to 5' end of Rv0593, len: 402 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 188 aa overlap). Possible lprL (alternate gene name: mce2E), lipoprotein which belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07417|LPRK|Rv0173|MTCI28.13|mce1E (390 aa); O53971|LPRM|Rv1970|MTV051.08|mce3E (377 aa); etc. Also highly similar to others e.g. NP_302660.1|NC_002677 putative lipoprotein from Mycobacterium leprae (392 aa); CAC12794.1|AL445327 putative secreted protein from Streptomyces coelicolor (413 aa); etc. Contains possible signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (c-t) introducing a stop codon, leads to the formation of a truncated product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (188 aa versus 402 aa). Mb0609 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248318.1" /translation="MRCGVSAGSANGKPNRWTLRCGVSAGHRGSVFLLAVLLAPVVLT SCTWRGIANVPLPVGRGMGPDRMTIYVQMPDTLALNTNSRVRVADVWVGTVRDISLRN WIATLTLELEPTVRLPANATAKIGQTSLLGTQHVELAAPPIPSPQPLKSGDTIGLKNS SAYPTVERTLASVALILTGGGIVNLDVI" CDS 694482..696032 /codon_start=1 /transl_table=11 /gene="mce2F" /locus_tag="BQ2027_MB0610" /product="MCE-FAMILY PROTEIN MCE2F" /note="Mb0610, mce2F, len: 516 aa. Equivalent to Rv0594, len: 516 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 516 aa overlap). mce2F; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), similar to Mycobacterium tuberculosis proteins O07418|Rv0174|MTCI28.14|mce1F (515 aa); O53972|Rv1971|MTV051.09|mce3F (437 aa); etc. Also highly similar to others e.g. NP_302661.1|NC_002677 putative secreted protein from Mycobacterium leprae (516 aa); AAF74993.1|AF143400_1|AF143400|996A027a protein from Mycobacterium avium (80 aa) (similarity on C-terminus); CAC12793.1|AL445327 putative secreted protein from Streptomyces coelicolor (433 aa); etc. Contains possible N-terminal signal or anchor sequence. Mb0610 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248319.1" /translation="MLTRAIKTQLVLLTVLAVIAVVVLGWYFLRIPSLVGIGRYTLYA ELPRSGGLYRTANVTYRGITIGKVTGVEPTERGARATMSIDNGYQIPTDASANVHSVS AVGEQFVDLVSTRTSGPYLRHGQTITTTTVPSQIGPALDAANRGLAVLPKDRVASVLH EASEAVGGLGSSLNRLIEATQAIAHDVRGSLEDIDDIIERSAPIIDSQVNSGNEIARW AANLNTLAAQTAQTDPAVRSILANAAPTADQVNATFSDVRESLPQTLANLEVVIDMLK RYHNGVEQALVFLPQSGAIAQSVTTEFPGQAGLGVGGLALNQPPPCLTGFLPASEWRS PADTSTAPLPKGTYCRIPMDASNVVRGARNNPCVDVPGKRAATPRECRSNEAYVPGGT NPWYGDPNQMLSCPAPAARCDQPVKPGQVIPAPSVNNGINPLPADQLPGTPPPVNDPL QRPGSGTVQCNGQQPNPCVYTPSTFPTTIYDVQSGKVVAPDGVVYSVEASTHAGADGW KVMLAPTG" CDS complement(696084..696476) /codon_start=1 /transl_table=11 /gene="vapc4" /locus_tag="BQ2027_MB0611C" /product="possible toxin vapc4" /note="Mb0611c, -, len: 130 aa. Equivalent to Rv0595c, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 130 aa overlap). Conserved hypothetical protein, similar to other conserved hypothetical proteins e.g. Rv0627 (135 aa) and Rv0665 (112 aa) from Mycobacterium tuberculosis; and STBB_PSESM|Q52562 plasmid stability protein from Pseudomonas syringae (139 aa), FASTA scores: opt: 131, E(): 0.0035, (35.2% identity in 88 aa overlap). Protein product from Mb0611c detected using SWATH mass spectrometry. Mb0611c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248320.1" /translation="MNVRRALADTSVFIGIEATRFDPDRFAGYEWGVSVVTLGELRLG VLQASGPEAAARRLSTYQLAQRFEPLGIDEAVSEAWALLVSKLRAAKLRVPINDSWIA ATAVAHGIAILTQDNDYAAMPDVEVITI" CDS complement(696473..696730) /codon_start=1 /transl_table=11 /gene="vapb4" /locus_tag="BQ2027_MB0612C" /product="possible antitoxin vapb4" /note="Mb0612c, -, len: 85 aa. Equivalent to Rv0596c, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Conserved hypothetical protein, highly similar in part to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv0626, Rv3181c, Rv3385c, Rv3407, etc." /protein_id="CAB5248321.1" /translation="MSATIPARDLRNHTAEVLRRVAAGEEIEVLKDNRPVARIVPLKR RRQWLPAAEVIGELVRLGPDTTNLGEELRETLTQTTDDVRW" CDS complement(696913..698148) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0613C" /product="ATPase" /note="Mb0613c, -, len: 411 aa. Equivalent to Rv0597c, len: 411 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 411 aa overlap). Conserved hypothetical protein, highly similar to Rv3179 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (429 aa). Also similar to AAF76191.1|AF271296_1|AF271296 putative ATP/GTP binding protein from Mycobacterium smegmatis (428 aa); Rv2008c|YW09_MYCTU|Q10849 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (441 aa), FASTA scores: opt: 270, E(): 3.6e-11, (30.5% identity in 416 aa overlap) (N-terminus longer). Also similar to other hypothetical proteins e.g. NP_085874.1|NC_002679 hypothetical protein from Mesorhizobium loti (435 aa) (N-terminus longer). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0613c detected using SWATH mass spectrometry. Mb0613c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248322.1" /translation="MGVVERAIAPSVLAALADTPVVVVNGARQVGKTTLVARLDYPGS SEVVSLDDVANRDAARDDPRAFVSRPVDTLVIDEAQLEPGLFRAIKAEVDRDRRPGRF LLTGSARLLSAPDMADALVGRVEIIELWPFSQGERAGIADGFVDALFTAPRELIHGSD MRRADLVDRIATGGFPDIVARSPSRRRAWFDNYLTTATQSVIREISPIERLAEMPRVL RLCAARTGAELNVSALANDLSIPARTTAGYLALLEAAFLIHRVPAWSTNLSRKVIRRP KLVVSDSGLACHLLGVTGATLDRPGRPLGPLLETFVANEIRKQLTWSTERPSLWHFRD RGGAEVDLVLEHPDGRVCGIEVKATSTPRAEDLRGLRYLAERLDDRFQFGVLLTAAPE ATPFGPTLAALPVSTLWAG" CDS complement(698399..698812) /codon_start=1 /transl_table=11 /gene="vapc27" /locus_tag="BQ2027_MB0614C" /product="possible toxin vapc27. contains pin domain." /note="Mb0614c, -, len: 137 aa. Equivalent to Rv0598c, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 137 aa overlap). Conserved hypothetical protein; similar to Rv2596|Y0B5_MYCTU|Q50625 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (134 aa), FASTA scores: opt: 254, E(): 8.2e-12, (41.5% identity in 130 aa overlap). Protein product from Mb0614c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0614c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /protein_id="CAB5248323.1" /translation="MKPPLAVDTSVAIPLLVRTHTAHAAVVAWWAHREAALCGHALAE TYSVLTRLPRDLRLAPMDAARLLTERFAAPLLLSSRTTEHLPRVLAQFEITGGAVYDA LVALAAAEHRAELATRDARAKDTYEKIGVHVVVAA" CDS complement(698809..699045) /codon_start=1 /transl_table=11 /gene="vapb27" /locus_tag="BQ2027_MB0615C" /product="possible antitoxin vapb27" /note="Mb0615c, -, len: 78 aa. Equivalent to Rv0599c, len: 78 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 78 aa overlap). Conserved hypothetical protein, similar to Rv2595|Y0B6_MYCTU|Q50626 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (81 aa), FASTA scores: opt: 160, E(): 6.2e-07, (35.8% identity in 81 aa overlap). N-terminus shows stong similarity with N-terminus of NP_104908.1|NC_002678 hypothetical protein from Mesorhizobium loti (89 aa). Protein product from Mb0615c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0615c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248324.1" /translation="MKAVVDAAGRIVVPKPLREALGLQPGSTVEISRYGAGLHLIPTG RTARLEEENGVLVATGETTIDDEVVFGLIDSGRK" CDS complement(699149..699655) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0616C" /product="two component sensor kinase [second part]" /note="Mb0616c, -, len: 168 aa. Equivalent to Rv0600c, len: 168 aa (probable partial CDS), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 168 aa overlap). Probable two-component sensor kinase (second part) (EC 2.7.3.-), similar to part (C-termini) of many others e.g. Q04943|AFQ2_STRCO sensor protein afsq2 from Streptomyces coelicolor (535 aa), FASTA scores: opt: 347, E(): 1.9e-12, (33.0% identity in 206 aa overlap); etc. Note that sequence was checked and no errors were detected, which would allow this and the upstream ORF to be joined." /protein_id="CAB5248325.1" /translation="MPITPLLHESVARFAATGADITTRAEPDLFVSIDPDHLRRILTA VLDNAITHGDGEIAVTAHARDGAVDIGVRDHGPGFADHFLPVAFDRFTRADTARGGRG SGLGLAIVAALTTTHGGHANATNHPDGGAELRITLPTPRPPFHEELPRITSSDTKDPN REHDTSDQ" CDS complement(699769..700239) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0617C" /product="two component sensor kinase [first part]" /note="Mb0617c, -, len: 156 aa. Equivalent to Rv0601c, len: 156 aa (probable partial CDS), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 156 aa overlap). Probable two-component sensor kinase (first part) (EC 2.7.3.-), similar to part (N-termini) of others e.g. Q0375|CUTS_STRLI cuts protein from streptomyces lividans (414 aa), FASTA scores: opt: 230, E(): 3.1e-08, (39.1% identity in 115 aa overlap). Note that the sequence was checked and no errors were detected that would allow this and the downstream ORF to be joined." /protein_id="CAB5248326.1" /translation="MALVLAAAGAVTVVQFRDAAHEADPDGALRGLTDDITADLVREL VTILPIVLVIAAVAAYLLSRAALRPVDRIRAAAQTLTTTPHPDTDAPLPVPPTDDEIA WLATTLNTMLTRLQRALAHEQQFVADASHELRTPLALLTTELELRCAGPDPPTS" CDS complement(700283..701044) /codon_start=1 /transl_table=11 /gene="tcrA" /locus_tag="BQ2027_MB0618C" /product="two component dna binding transcriptional regulatory protein tcra" /note="Mb0618c, tcrA, len: 253 aa. Equivalent to Rv0602c, len: 253 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 253 aa overlap). Probable tcrA, two-component DNA-binding response regulator, highly similar to others e.g. NP_107959.1|NC_002678 two-component response regulator from Mesorhizobium loti (239 aa); etc. Also similar to many other Mycobacterium tuberculosis two-component regulators e.g. Q50806|MTCY10G2.16|Rv1033c RESPONSE REGULATOR HOMOLOG TRCR (TCRV) (257 aa), FASTA score: (47.4 identity in 232 aa overlap); etc." /protein_id="CAB5248327.1" /translation="MADETTMRAGRGPGRACGRVSGVRILVIEDEPKMTALLARALTE EGHTVDTVADGRHAVAAVDGGDYDAVVLDVMLPGIDGFEVCARLRRQRVWTPVLMLTA RGAVTDRIAGLDGGADDYLTKPFNLDELFARLRALSRRGPIPRPPTLEAGDLRLDPSE HRVWRADTEIRLSHKEFTLLEALIRRPGIVHTRAQLLERCWDAAYEARSNIVDVYIRY LRDKIDRPFGVTSLETIRGAGYRLRKDGGRHALPR" CDS 701101..701412 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0619" /product="POSSIBLE EXPORTED PROTEIN" /note="Mb0619, -, len: 103 aa. Equivalent to Rv0603, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (98.1% identity in 103 aa overlap). Possible exported protein with hydrophobic stretch at aa 7-29." /protein_id="CAB5248328.1" /translation="MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRAR AAAVQAVPGGTAGEVETETGEGAAAYGVLVTRADGTRVEVHLDRDFRVLDTKPADGDG G" CDS 701484..702434 /codon_start=1 /transl_table=11 /gene="lpqO" /locus_tag="BQ2027_MB0620" /product="PROBABLE CONSERVED LIPOPROTEIN LPQO" /note="Mb0620, lpqO, len: 316 aa. Equivalent to Rv0604, len: 316 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 316 aa overlap). Probable lpqO, conserved lipoprotein, highly similar to Rv2999|lppY PUTATIVE LIPOPROTEIN from Mycobacterium tuberculosis (321 aa), FASTA scores: opt: 1153, E(): 0, (53.2% identity in 312 aa overlap). Contains probable N-terminal signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb0620 detected using SWATH mass spectrometry. Mb0620 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="CAB5248329.1" /translation="MIRRRGARMAALLAAAALALTACAGSDDKGEPDDGGDRGASLAT TSDADWKPVADILGRTGKLNDGSVYKIGFARSDLSVQTKGVTVAPALSLGSWVAFART PDGQTMLMGDLVVTEDELASVTDAVQAGGLQQTALHKHLLEQSPPIWWTHIAGHGDAA DLARAVRSALDATDTPPPAPATSGQTSLDLDTAAIDEALGRSGTIAGGVYKFFIARRD PVTMSGMLIPPSMGLATALNFQPTGNGRAAINGDFVMTAAEVQDVVQALRGGGIDIVA IHNHGFDEQPRLFYMHFWAENDAVALARTLRAAVDATAAR" mobile_element 702629..704012 /mobile_element_type="insertion sequence:IS1536" /locus_tag="BQ2027_IS1536" /note="IS1536, len: 1384 nt. Equivalent to IS1536, len: 1384 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1384 nt overlap). Partial copy of IS_1536" gene 702629..704012 /locus_tag="BQ2027_IS1536" CDS 702651..703259 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0621" /product="POSSIBLE RESOLVASE" /note="Mb0621, -, len: 202 aa. Equivalent to Rv0605, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 202 aa overlap). Possible resolvase for IS_Y349 element, similar to several Mycobacterial hypothetical proteins and weakly similar to Q52563 resolvase from Pseudomonas syringae (210 aa), FASTA scores: opt: 99, E(): 3.1, (35.7% identity in 98 aa overlap). Contains PS00397 Site-specific recombinases active site and probable helix-turn helix motif from aa 9-30 (Score 1815, +5.37 SD). Protein product from Mb0621 detected using SWATH mass spectrometry. Mb0621 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XVY1" /db_xref="InterPro:IPR006118" /db_xref="InterPro:IPR006119" /db_xref="InterPro:IPR036162" /db_xref="InterPro:IPR041718" /db_xref="UniProtKB/TrEMBL:A0A1R3XVY1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99217.1" /translation="MACCRNRGMNLAAWAERNGVARVTAYRWFHAGLLPVPARKVGRL ILVDELASEAGAQPKTAVYARVSSADQKSDLDRQVARVTSWATAEQIPVDKVVTEVGS VLNGHRRKFPAVLRDLSVTRIVVEHRDRFCRFGSEYVHAALAAQGRELVVVDSAEVDD DLVWDMTEILTSMCARLYGKRAAQNRAKRAVAAAAVDDHEAA" CDS 703261..704004 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0622" /product="possible transposase (fragment)" /note="Mb0622, -, len: 247 aa. Equivalent to Rv0606, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 247 aa overlap). Possible truncated transposase for IS_1536 element, highly similar to N-terminus of other transposases from Mycobacterium tuberculosis e.g. YX16_MYCTU|Q10809|Rv2885c|MT2953|MTCY274 .16c PUTATIVE TRANSPOSASE from Mycobacterium tuberculosis (460 aa), FASTA scores: opt: 1368, E(): 0, (83.5% identity in 237 aa overlap); Rv2978c, Rv0922, Rv3827c, etc. Also similar to N-terminus of MTV002_57|Rv2792 RESOLVASE from M. tuberculosis (193 aa), FASTA score: (87.4% identity in 238 aa overlap). Mb0622 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021027" /db_xref="UniProtKB/TrEMBL:A0A1R3XVV9" /protein_id="SIT99218.1" /translation="MPRLEIPNGWCVQAFRFTLDPTAEQAHALARHFGARRKAYNWTV AQLKADIQAWRATGAQTAKPSLRVLRKRWNTVKDEVCVNAETGTVWWPECSKEAYADG IAGAVDAYWNWQQRRAGKRDGKRMGFPRFKKKGRDADRVSFTTGAMRVEPDRRHLTLP VIGCVRTHENTRRIERLIAKDRARVLAITVRRNGTRLDASVRVLVQRPQQPNVELPES RIGVDVGVRRLATVATADGACCPVLVPDG" CDS 704058..704444 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0623" /product="HYPOTHETICAL PROTEIN" /note="Mb0623, -, len: 128 aa. Equivalent to Rv0607, len: 128 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 128 aa overlap). Hypothetical unknown protein. Mb0623 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XVV1" /protein_id="SIT99219.1" /translation="MGAWQTADTMGIFQALPDVWGGWRTECWEDRFEEQLIRCNGALR LPELDLAAGMDSAREWLRDRIFQRFSDSPAGQILKLSELLADVGPGLVVSDDAVTNGG ARPNNEEWARFVAACDLVRGAHAESA" CDS 704489..704734 /codon_start=1 /transl_table=11 /gene="vapb28" /locus_tag="BQ2027_MB0624" /product="possible antitoxin vapb28" /note="Mb0624, -, len: 81 aa. Equivalent to Rv0608, len: 81 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 81 aa overlap). Conserved hypothetical protein, similar to several other Mycobacterium tuberculosis hypothetical short proteins e.g. Rv0623|P96913|MTCY20H10.04 (84 aa), FASTA scores: opt: 159, E(): 1.2e-09, (43.0% identity in 86 aa overlap); Rv2760c (89 aa); Rv1740 (70 aa), etc. Mb0624 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011660" /db_xref="UniProtKB/TrEMBL:A0A1R3XVU9" /protein_id="SIT99220.1" /translation="MALNIKDPSVHQAVKQIAKITGESQARAVATAVNERLARLRSDD LAARLLAIGHKTASRMSPEAKRLDHDALLYDERGLPA" CDS 704731..705132 /codon_start=1 /transl_table=11 /gene="vapc28" /locus_tag="BQ2027_MB0625" /product="possible toxin vapc28. contains pin domain." /note="Mb0625, -, len: 133 aa. Equivalent to Rv0609, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Conserved hypothetical protein, similar to several Mycobacterium tuberculosis hypothetical proteins e.g. YW37_MYCTU|Q10874|Rv1982c|MT2034|MTCY39.37 CONSERVED HYPOTHETICAL PROTEIN (139 aa), FASTA scores: opt: 262, E(): 8.1e-12, (39.1% identity in 128 aa overlap); MTCY20H10.05|Rv0624|MT0652|MTCY20H10.05 CONSERVED HYPOTHETICAL PROTEIN (131 aa), FASTA score: (42.9% identity in 126 aa overlap), Rv0565c, Rv3854c, etc. Mb0625 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67239" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/Swiss-Prot:P67239" /protein_id="SIT99221.1" /translation="MIVDTSAIIAILRDEDDAAAYADALANADVRRLSAASYLECGIV LDSQRDPVISRALDELIEEAEFVVEPVTERQARLARAAYADFGRGSGHPAGLNFGDCL SYALAIDRREPLLWKGNDFGHTGVQRALDRR" CDS 705129..705302 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0626" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0626, -, len: 75 aa. Equivalent to Rv0609A, len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 75 aa overlap). Conserved hypothetical protein, highly similar to part of upstream ORF Rv0612|MTCY19H5.09c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (201 aa), FASTA scores: opt: 154, E(): 1.8e-05, (74.3% identity in 35 aa overlap). Mb0626 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XY04" /protein_id="SIT99222.1" /translation="MIDVSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCGRGPGNA TGSGRLPKPLRHS" CDS complement(705720..705917) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0626CA" /note="unnamed protein product; Mb0626cA, len: 65 aa. No equivalent in M. tuberculosis H37Rv. Identified by de novo proteomics of Mycobacterium bovis AF2122/97 under exponential conditions,Mb0626cA found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing" /db_xref="UniProtKB/TrEMBL:A0A1R3XWW3" /protein_id="SIT99223.1" /translation="MTDEKCVRCGGDQLVEGAVVWNAPLRFKREGAGHFNRGTQVNAV ACETCGHIDLYLESRARGSTK" CDS complement(705997..707154) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0627C" /product="HYPOTHETICAL PROTEIN" /note="Mb0627c, -, len: 385 aa. Equivalent to Rv0610c, len: 385 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 385 aa overlap). Hypothetical unknown protein. Mb0627c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XVZ8" /protein_id="SIT99224.1" /translation="MDDELRGLLARYARGELSADDARRAILRYPKWRVAEIDGELETV ALDDGTPMLIAESSASDGREYSGLELVRDIAPLVGGLSFDPDEPWGSAFRPGALPELQ SWARTVELEDAVAKPGPGQRDLLYEGPWWVAVSPGTGRPAVHRADGLDVITIMTAPDA AATFRRTERHRGLDVVRLGPALWGDLAKRSDFDGVRLNPLRPLAQLWPPHVPAMLVAG CDPRPNAEPLPARTVAEIHLWLDQHGARQEKRELSNRATPVGEVTVARAWWNYDRREI AFTRVAPASDTEGLGSVPSRILCAGKLRQSIQSKLAGLPRLTWRADAWHRQRAALAVG WALELEKLVCGERVPFAALRTPEGAHLWHLEPQAFTARAIRKLRDRAASFR" CDS complement(707206..707589) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0628C" /product="HYPOTHETICAL PROTEIN" /note="Mb0628c, -, len: 127 aa. Equivalent to Rv0611c, len: 127 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 127 aa overlap). Hypothetical unknown protein. Note that first start has been taken although this overlaps slightly with the upstream ORF. Mb0628c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR032568" /db_xref="UniProtKB/TrEMBL:A0A1R3XWB7" /protein_id="SIT99225.1" /translation="MPDRPQHPTASRQSSMVSWNHGAAGWLHCVQCGSATNPTACLDW LPPIHARSGPMYAEHDVVVLTRDVPDKSLIAGDVGAVVGRYAAGGYEVDFTAANGCTV AVVTLAGDDIRPRRRREIPHVREVA" CDS 707569..708174 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0629" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0629, -, len: 201 aa. Equivalent to Rv0612, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 201 aa overlap). Conserved hypothetical protein, highly similar, but in part, to downstream ORF Rv0609A CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (75 aa); and showing weak similarity with other hypothetical proteins from Mycobacterium tuberculosis. Note that first start has been taken although this overlaps slightly with the upstream ORF. Mb0629 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XVV6" /protein_id="SIT99226.1" /translation="MLGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDED RLRKALWNLYWRGTANMRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSLAR SGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRAEDAEPAASALEQLIDLAREADG YDYFRSDDPVAAAGFVVSDVAAAGHPHFREFAAEIGAAIPP" CDS complement(708193..710760) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0630C" /product="unknown protein" /note="Mb0630c, -, len: 855 aa. Equivalent to Rv0613c, len: 855 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 855 aa overlap). Hypothetical unknown protein. Contains a very short region with strong similarity to several preprotein translocases e.g. P47847|SECA_LISMO preprotein translocase seca subunit (836 aa), FASTA scores: opt: 138, E(): 0.18, (38.6% identity in 70 aa overlap, and 72.7% identity in 22 aa overlap). Protein product from Mb0630c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0630c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR004027" /db_xref="UniProtKB/TrEMBL:A0A1R3XVZ0" /protein_id="SIT99227.1" /translation="MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRA LRLETEWPARQLVDDRWVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEH EEYGRLADGSAARIVLAGYDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLV GVRLTAAGLVLERIGTAGADTSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPV APLREILDQHGLTHEDDWLAPGGFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKL HETMSLLLEATDPDELPRDVLATAAETATETGSDSLVDLLGDIGAALADPLLAELLVA ETVGTDSGGAAALGLLTEMLEPKVPRAARVAVRWLRAVALDRIGDVEAAERELLAAES MDTEWPLPLLDLARIASDRGDAERGLALLRRAGTEPDHPLVRLLERHRAQPRRDLGRN EACWCGSGRKYKKCHLGREALPLAERVDWLYAKASQHALSGDWTGLLAEVSYERFRYA DSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLLPDDERLLAEQWLLVERSVF EVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVPAGDTMVFFGGIEP VALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLVNTEGDSLAICEASVRVDDPAGI QGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLDGDTLRVETNSEPRMDRVLAT LTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPDPDSPELAAALEEFIRDYE TSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGARGGMDADRLRTALGL" CDS 710601..711593 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0631" /product="putative membrane protein" /note="Mb0631, -, len: 330 aa. Equivalent to Rv0614, len: 330 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 330 aa overlap). Conserved hypothetical protein, similar in part to Mycobacterium tuberculosis hypothetical proteins e.g. YY16_MYCTU|Q10685|Rv2077c|MT2137|MTCY49.16c CONSERVED HYPOTHETICAL PROTEIN (323 aa), FASTA scores: opt: 200, E(): 0.00016, (28.3% identity in 269 aa overlap); MTCY9F9_15 FASTA score: (40.3% identity in 144 aa overlap), Rv1949c, Rv2542, etc. Several start sites are possible; first start has been chosen. Note that this ORF overlaps with the upstream ORF." /db_xref="UniProtKB/TrEMBL:A0A1R3XVW8" /protein_id="SIT99228.1" /translation="MPAIPFQGEARAGRRPGRPRRCPAGVVRCRPRSMGHVRPGFSPR LGSHRTLRPRWPPYAAASRGLTSGTSRWGWPRLGFGVVTAPTRWTLADGRELLFFSLP GPRTSGTAAERVARHAQAQTFAGDIRQRAIQLVVSEQEVASKITAATAGIATTTFPET PSIDDTIIGNDNRDTGVRLVDVKQDGGTSPPPPFAPWDTPDGTPPPGTGLSPTLQQMI LGGDPANLTGQGLADNVQRFVQSLPANDPNTAWLRGQVADLQAHVADIEYARTHCSTN DWIDRTAQFASGAIVFSIGVLTAETGAGVVAAAAGGVGAATAGVSLLQCLVGSK" CDS 711590..711832 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0632" /product="PROBABLE INTEGRAL MEMBRANE PROTEIN" /note="Mb0632, -, len: 80 aa. Equivalent to Rv0615, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 80 aa overlap). Probable integral membrane protein. Mb0632 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XVW0" /db_xref="UniProtKB/TrEMBL:A0A1R3XVW0" /protein_id="SIT99229.1" /translation="MMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGL LVVTGQTLMAISVAFLVALGGPLVVVNHRRAERSRG" CDS complement(711829..712095) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0633C" /product="HYPOTHETICAL PROTEIN" /note="Mb0633c, -, len: 88 aa. Equivalent to Rv0616c, len: 88 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 88 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3XVW1" /protein_id="SIT99230.1" /translation="MRIPGNRQCLLVQVLRQVDGSAHRLILTSLHRDARADAHRYSNG TDHAGRAADEPAETAHEPCWVAARGLASQASRAMSATYRPSSFI" CDS 712027..712254 /codon_start=1 /transl_table=11 /gene="vapB29" /locus_tag="BQ2027_MB0633A" /product="Possible antitoxin VapB29" /note="Mb0633A, len: 75 aa. Equivalent to Rv0616A len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 75 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible vapB29, antitoxin,part of toxin-antitoxin (TA) operon with Rv0617, see Arcus et al. 2005. Similar to many others in M. tuberculosis e.g. Rv2530A (74 aa) 35.9% identity in 78 aa overlap,Mb0633A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XVV4" /db_xref="InterPro:IPR010985" /db_xref="UniProtKB/TrEMBL:A0A1R3XVV4" /protein_id="SIT99231.1" /translation="MRTTIDLPQDLHKQALAIARDTHRTLSETVADLMRRGLAANRPT ALSSDPRTGLPLVSVGTVVTSEDVRSLEDEQ" CDS 712251..712652 /codon_start=1 /transl_table=11 /gene="vapc29" /locus_tag="BQ2027_MB0634" /product="possible toxin vapc29. contains pin domain." /note="Mb0634, -, len: 133 aa. Equivalent to Rv0617, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv2494, Rv3320c, Rv0749, Rv0277c, Rv2530c, etc. Protein product from Mb0634 detected using SWATH mass spectrometry. Mb0634 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY12" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XY12" /protein_id="SIT99232.1" /translation="MTVLLDANVLIALVVAEHVHHDAAADWLMASDTGFATCPMTQGS LVRFLVRSGQSAAAARDVVSAVQCTSRHEFWPDALSFAGVEVAGVVGHRQVTDAYLAQ LARSHDGQLATLDSGLAHLHGDVAVLIPTTT" CDS 712781..713965 /codon_start=1 /transl_table=11 /gene="galT" /locus_tag="BQ2027_MB0635" /product="probable galactose-1-phosphate uridylyltransferase galtb [second part]" /note="Mb0635, galT, len: 394 aa. Equivalent to Rv0618 and Rv0619, len: 231 aa (probable partial CDS) and 181 aa (probable partial CDS), from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 219 aa overlap and 99.4% identity in 174 aa overlap). Rv0618: Probable galT', first part of galactose-1-phosphate uridylyltransferase (EC 2.7.7.10), highly similar to N-terminal half of other galT proteins e.g. P13212|GAL7_STRLI galactose-1-phosphate uridylyltransferase from Streptomyces lividans (354 aa), FASTA scores: opt: 296, E(): 1.4e-11, (50.8% identity in 177 aa overlap); etc. Also highly similar to N-terminal half of some UDP glucose--hexose-1-phosphate uridylyltransferases (EC 2.7.7.12). N-terminal 28 aa similar to MTCY20H11.08|Rv0627|MTCY20H11.08 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (135 aa), FASTA score: (71.4% identity in 28 aa overlap). BELONGS TO THE GALACTOSE-1-PHOSPHATE URIDYLYLTRANSFERASE >FAMILY 1. Rv0619: Probable 'galT, second part of galactose-1-phosphate uridylyltransferase (EC 2.7.7.10), highly similar to C-terminal half of other galT proteins e.g. P13212|GAL7_STRLI galactose-1-phosphate uridylyltransferase from Streptomyces lividans (354 aa), FASTA scores: opt: 416, E(): 5.2e-22, (43.0% identity in 186 aa overlap), etc. BELONGS TO THE GALACTOSE-1-PHOSPHATE URIDYLYLTRANSFERASE FAMILY 1. Cosmid sequence is correct but there may be a frameshift mutation in this region which would allow the two ORFS to be joined. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, galT is split into 2 parts, galT' and 'galT. In Mycobacterium bovis, an in-frame insertion of a single base (*-a) leads to a single product. Protein product from Mb0635 detected using SWATH mass spectrometry. Mb0635 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWX5" /db_xref="InterPro:IPR001937" /db_xref="InterPro:IPR005849" /db_xref="InterPro:IPR005850" /db_xref="InterPro:IPR019779" /db_xref="InterPro:IPR036265" /db_xref="UniProtKB/TrEMBL:A0A1R3XWX5" /protein_id="SIT99233.1" /translation="MSATPPPGGLDASVFIANERGRQLDEALPVGFCVVTAPTRWTLA DGRDLLFFSLPGHVPAPVSDRRPLPERDPAPSRLRFDRATGQWVIVAAQRQDRTYKPP AARCPLCPGPTGLSSEVPAPDYDVVVFENRFPSLAGAGIAPIGAPDGDGFVSAPGHGR CEVICFSADHTGSFAGLDPAHAGLVVHAWRHRTAELTALPGVAQVFCFENRGEEIGVT LTHPHGQIYAYPYLTPRTAAMLRQARRHRKRHGDNLFASLLAREVADGSRIVVRGELF TAFVPFAARWPVEVHIYPNRLVRNLTELNDGELDEFARIYLDVLQRFDRMYSSPLPYM SALHQFSEVQRDGYFHVELMSIRRSATKLKYLAAAESAMDAFIADVIPESVAARLREL GP" CDS 713962..715053 /codon_start=1 /transl_table=11 /gene="galK" /locus_tag="BQ2027_MB0636" /product="PROBABLE GALACTOKINASE GALK (GALACTOSE KINASE)" /note="Mb0636, galK, len: 363 aa. Equivalent to Rv0620, len: 363 aa, from Mycobacterium tuberculosis strain H37RV, (99.7% identity in 362 aa overlap). Probable galK, galactokinase (EC 2.7.1.6), similar to others e.g. P13227|GAL1_STRLI GALACTOKINASE from Streptomyces lividans (397 aa); P06976|GAL1_ECOLI galactokinase from Escherichia coli (381 aa), FASTA scores: opt: 669, E(): 0, (35.9% identity in 365 aa overlap); etc. Contains PS00106 Galactokinase signature and PS00560 Serine carboxypeptidases, histidine active site. BELONGS TO THE GHMP KINASE FAMILY. GALK SUBFAMILY. Protein product from Mb0636 detected using SWATH mass spectrometry." /db_xref="GOA:Q7U1L7" /db_xref="InterPro:IPR000705" /db_xref="InterPro:IPR006204" /db_xref="InterPro:IPR006206" /db_xref="InterPro:IPR013750" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR019539" /db_xref="InterPro:IPR019741" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR022963" /db_xref="InterPro:IPR036554" /db_xref="UniProtKB/Swiss-Prot:Q7U1L7" /protein_id="SIT99234.1" /translation="MTVSYGAPGRVNLIGEHTDYNLGFALPIALPRRTVVTFTPEHTG AITARSDRADGSARIPLDTTPGQVTGWAAYAAGAIWALRGAGHPVPGGAMSITSDVEI GSGLSSSAALIGAVLGAVGAATGTRIDRLERARLAQRAENDYVGAPTGLLDHLAALFG APKTALLIDFRDITVRPVAFDPDACDVVLLLMDSRARHRHAGGEYALRRASCERAAAD LGVSSLRAVQDRGLAALGAIADPIDARRARHVLTENQRVLDFAAALADSDFTAAGQLL TASHESMREDFAITTERIDLIAESAVRAGALGARMTGGGFGGAVIALVPADRARDVAD TVRRAAVTAGYDEPAVSRTYAAPGAAECC" CDS 715448..716605 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0637" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb0637, -, len: 385 aa. Equivalent to Rv0621, len: 354 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 354 aa overlap). Possible membrane protein; contains potential membrane spanning regions. Also contains PS00017 ATP/GTP-binding site motif A (P-loop). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (a-g) leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (385 aa versus 354 aa). Mb0637 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWC6" /db_xref="UniProtKB/TrEMBL:A0A1R3XWC6" /protein_id="SIT99235.1" /translation="MAGDRGADPGPANVTPGADDHAQHASPTVLCPQGHVNAWDYRFC ERCGSPIGVVPWPSEESGTRQTAPARSFVPLVVLAATLLVVAVVVTAVGYAVTRPARN DREEPSSARGAATTGVPFAQAEAASCPDDPVLEAESIDLTSDGLAVSAAFMSACAGGD VESNSALEVTVADGRRDVAAGSFDFSADPLRIEPGVPARRTLVFPPGMYWRTPDMLSG APALAATRKGRSDRSAARGGSARTTMVAAASAAPAYGSINAVAGAVLVELRDSDFPYV RVGIANRWVPQVSSKRVGLVAAGKTWTSADILRDHLALRQRFGGARLVWSGHWTTFSG PDFWVTVVGPAQPTAAEANRWCDSNGFGADDCFAKFISTLVGAKGTTVYRK" CDS 716616..717563 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0638" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb0638, -, len: 315 aa. Equivalent to Rv0622, len: 315 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 315 aa overlap). Possible membrane protein; contains potential membrane spanning region. Shows weak similarity with Mycobacterium tuberculosis hypothetical proteins Rv1804c, Rv1810, etc." /db_xref="GOA:A0A1R3XVW5" /db_xref="InterPro:IPR007969" /db_xref="UniProtKB/TrEMBL:A0A1R3XVW5" /protein_id="SIT99236.1" /translation="MSFCVYCGAELADPTRCGACGAYKIGSTWHRTTTPTVGAATTAT GWRPDPTGRHEGRYFVAGQPTDLVREGDAEAVDPLGQQQLDQSGAVGVSPSAVSGWVR SGHRRLWWALAGVVAFLGLVGAGVVGTLFLNRDRESIDDKYLAALRRSGLTGEFNSDA NAIARGKQVCRQLQDGGEQQGMPVDQVAVQYYCPQFSDGFHILETITVTGSFTLKDES PNVYAPAITVSGSGCSGSAGYADIDRGTQVTVKNGQGDILATAFLQAGQGGRFLCTFP FSFEITEGEDRYVVSVSRRGEMSYSFADLKANGLSLVLG" CDS 717656..717910 /codon_start=1 /transl_table=11 /gene="vapb30" /locus_tag="BQ2027_MB0639" /product="possible antitoxin vapb30" /note="Mb0639, -, len: 84 aa. Equivalent to Rv0623, len: 84 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 84 aa overlap). Conserved hypothetical protein, highly similar to NP_384911.1|NC_003047 CONSERVED HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (84 aa). Also similar to several Mycobacterium tuberculosis hypothetical proteins e.g MTCY28_2|Rv1740|MTCY28.02|MTCY04C12.25 CONSERVED HYPOTHETICAL PROTEIN (70 aa), FASTA score: (73.5% identity in 68 aa overlap); MTCY4C12_25|Rv0608|MTCY19H5.14c CONSERVED HYPOTHETICAL PROTEIN (81 aa), FASTA score: (73.5 identity in 68 aa overlap); etc. Mb0639 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011660" /db_xref="UniProtKB/TrEMBL:A0A1R3XW02" /protein_id="SIT99237.1" /translation="MALSIKHPEADRLARALAARTGETLTEAVVTALRERLARETGRA RVVPLRDELAAIRHRCAALPVVDNRSAEAILGYDERGLPA" CDS 717910..718305 /codon_start=1 /transl_table=11 /gene="vapc30" /locus_tag="BQ2027_MB0640" /product="possible toxin vapc30. contains pin domain." /note="Mb0640, -, len: 131 aa. Equivalent to Rv0624, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Conserved hypothetical protein, highly similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1741, Rv0609, Rv2759c,Rv0565c, Rv3854c, Rv3083, etc. Mb0640 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67241" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/Swiss-Prot:P67241" /protein_id="SIT99238.1" /translation="MVIDTSALVAMLSDEPDAERFEAAVEADHIRLMSTASYLETALV IEARFGEPGGRELDLWLHRAAVDLVAVHADQADAARAAYRTYGKGRHRAGLNYGDCFS YGLAKISGQPLLFKGEDFQHTDIATVALP" CDS complement(718399..719139) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0641C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0641c, -, len: 246 aa. Equivalent to Rv0625c, len: 246 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 246 aa overlap). Probable conserved transmembrane protein, showing similarity with others e.g. CAB61866.1|AL133252 putative integral membrane protein from Streptomyces coelicolor (249 aa). Also similar to Rv1491c|MTCY277_13 from Mycobacterium tuberculosis. Contains potential membrane spanning regions. Mb0641c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67116" /db_xref="InterPro:IPR015414" /db_xref="InterPro:IPR032816" /db_xref="UniProtKB/Swiss-Prot:P67116" /protein_id="SIT99239.1" /translation="MSTHNDSAPTSRRRHIVRLVVFAGFLVGMFYLVAATDVIDVAAV RGAVSATGPAAPLTYVVVSAVLGALFVPGPILAASSGLLFGPLVGVFVTLGATVGTAV VASLVGRRAGRASARALLGGERADRTDALIERCGLWAVVGQRFVPGISDAFASYAFGT FGVPLWQMAVGAFIGSAPRAFAYTALGAAIGDRSPLLASCAIAVWCVTAIIGAFAARH GYRQWRAHARGDGADGGVEDPDREVGAR" CDS 719271..719531 /codon_start=1 /transl_table=11 /gene="vapb5" /locus_tag="BQ2027_MB0642" /product="possible antitoxin vapb5" /note="Mb0642, -, len: 86 aa. Equivalent to Rv0626, len: 86 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 86 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv0596c, Rv3385c, Rv3407,Rv3181c, etc. Mb0642 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006442" /db_xref="InterPro:IPR036165" /db_xref="UniProtKB/TrEMBL:A0A1R3XVW9" /protein_id="SIT99240.1" /translation="MSEVASRELRNDTAGVLRRVRAGEDVTITVSGRPVAVLTPVRPR RRRWLSKTEFLSRLRGAQADPGLRNDLAVLAGDTTEDLGPIR" CDS 719528..719935 /codon_start=1 /transl_table=11 /gene="vapc5" /locus_tag="BQ2027_MB0643" /product="possible toxin vapc5" /note="Mb0643, -, len: 135 aa. Equivalent to Rv0627, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 135 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins Rv0595c and Rv0665. Protein product from Mb0643 detected using SWATH mass spectrometry. Mb0643 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XVX8" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XVX8" /protein_id="SIT99241.1" /translation="MSTTPAAGVLDTSVFIATESGRQLDEALIPDRVATTVVTLAELR VGVLAAATTDIRAQRLATLESVADMETLPVDDDAARMWARLRIHLAESGRRVRINDLW IAAVAASRALPVITQDDDFAALDGAASVEIIRV" CDS complement(720007..721158) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0644C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0644c, -, len: 383 aa. Equivalent to Rv0628c, len: 383 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 383 aa overlap). Conserved hypothetical protein, highly similar to Rv0874c|YZ02_MYCTU|Q10536 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (386 aa), FASTA scores: opt: 2082, E(): 0, (81.5% identity in 383 aa overlap). Also some similarity to P72543|SPU62616_1 HYPOTHETICAL PROTEIN from Synechococcus, FASTA scores: E(): 2.8e-28, (36.6 identity in 265 aa overlap). Mb0644c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64730" /db_xref="InterPro:IPR013702" /db_xref="InterPro:IPR016741" /db_xref="InterPro:IPR019494" /db_xref="UniProtKB/Swiss-Prot:P64730" /protein_id="SIT99242.1" /translation="MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSH TDQAVDLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDF VRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGR RRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGG RPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGA IGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMF GVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD" CDS complement(721251..722978) /codon_start=1 /transl_table=11 /gene="recD" /locus_tag="BQ2027_MB0645C" /product="PROBABLE EXONUCLEASE V (ALPHA CHAIN) RECD (EXODEOXYRIBONUCLEASE V ALPHA CHAIN) (EXODEOXYRIBONUCLEASE V POLYPEPTIDE)" /note="Mb0645c, recD, len: 575 aa. Equivalent to Rv0629c, len: 575 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 575 aa overlap). Probable recD, exonuclease V, alpha chain (exodeoxyribonuclease V, alpha chain) (EC 3.1.11.5), highly similar to other exonucleases e.g. AF157643_3|AAD46809.1|recD Escherichia coli RecD protein homolog from Mycobacterium smegmatis (554 aa); P04993|EX5A_ECOLI|B2819 exodeoxyribonuclease v 67kd polypeptide (EC 3.1.11.5) (EXONUCLEASE V ALPHA CHAIN) from Escherichia coli strain K12 (608 aa), FASTA scores: opt: 512, E(): 1.9e-24, (36.9% identity in 582 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). CONSIST OF THREE SUBUNITS; RECB|Rv0630c, RECC|Rv0631c AND RECD. Protein product from Mb0645c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XWY4" /db_xref="InterPro:IPR006344" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR027785" /db_xref="UniProtKB/TrEMBL:A0A1R3XWY4" /protein_id="SIT99243.1" /translation="MKLTDVDFAVEASGMVRAFNQAGVLDVSDVHVAQRLCALAGESD ERVALAVAVAVRALRAGSVCVDLLSIARVAGHDDLPWPDPADWLAAVRASPLLADPPV LHLYDDRLLYLDRYWREEEQVCADLLALLTSRRPAGVPDLRRLFPTGFDEQRRAAEIA LSQGVTVLTGGPGTGKTTTVARLLALVAEQAELAGEPRPRIALAAPTGKAAARLAEAV RREMAKLDATDRARLGDLHAVTLHRLLGAKPGARFRQDRQNRLPHNVIVVDETSMVSL TLMARLAEAVRPGARLILVGDADQLASVEAGAVLADLVDGFSVRDDALVAQLRTSHRF GKVIGTLAEAIRAGDGDAVLGLLRSGEERIEFVDDEDPAPRLRAVLVPHALRLREAAL LGASDVALATLDEHRLLCAHRDGPTGVLHWNRRVQAWLAEETGQPPWTPWYAGRPLLV TANDYGLRVYNGDTGVVLAGPTGLRAVISGASGPLDVATGRLGDVETMHAMTIHKSQG SQVDEVTVLMPQEDSRLLTRELLYTAVARAKRKVRVVGSEASVRAAIARRAVRASGLR MRLQSTGCG" CDS complement(722975..723562) /codon_start=1 /transl_table=11 /gene="recBb" /locus_tag="BQ2027_MB0646C" /product="PROBABLE EXONUCLEASE V (BETA CHAIN) RECBB [SECOND PART] (EXODEOXYRIBONUCLEASE V BETA CHAIN)(EXODEOXYRIBONUCLEASE V POLYPEPTIDE)" /note="Mb0646c, recBb, len: 234 aa. Equivalent to 3' end of Rv0630c, len: 1094 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 234 aa overlap). Probable recB, exonuclease V, beta chain (exodeoxyribonuclease V, beta chain) (EC 3.1.11.5), highly similar to other exonucleases e.g. AF157643_2|recB|AAD46808.1 Escherichia coli RecB protein homolog from Mycobacterium smegmatis (1083 aa); P08394|EX5B_ECOLI|RORA|B2820 exodeoxyribonuclease v 135 kd polypeptide (EC 3.1.11.5) (EXONUCLEASE V BETA CHAIN) from Escherichia coli strain K12 (1180 aa), FASTA scores: opt: 289, E(): 4.3e-11, (29.5 identity in 1059 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE HELICASE FAMILY, UVRD SUBFAMILY. CONSIST OF THREE SUBUNITS; RECB, RECC|Rv0631c AND RECD|Rv0629c. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, recB exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (g-*) splits recB into 2 parts, recBa and recBb. Protein product from Mb0646c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XW18" /db_xref="InterPro:IPR000212" /db_xref="InterPro:IPR004586" /db_xref="InterPro:IPR011335" /db_xref="InterPro:IPR011604" /db_xref="UniProtKB/TrEMBL:A0A1R3XW18" /protein_id="SIT99244.1" /translation="MRQIGVRDRLRELDFEMPLAGGDLRGRSPDVSLADVGELLASHL PGDDPLSPYADRLGSAGLGDQPLRGYLAGSIDVVLRLPGQRYLVVDYKTNHLGDTAAD YGFERLTEAMLHSDYPLQALLYVVVLHRFLRWRQRDYAPARHLGGVLYLFVRGMCGAA TPVTAGHPAGVFTWNPPTALVVALSDLLDRGRLQS" CDS complement(723565..726258) /codon_start=1 /transl_table=11 /gene="recBa" /locus_tag="BQ2027_MB0647C" /product="PROBABLE EXONUCLEASE V (BETA CHAIN) RECBA [FIRST PART] (EXODEOXYRIBONUCLEASE V BETA CHAIN)(EXODEOXYRIBONUCLEASE V POLYPEPTIDE)" /note="Mb0647c, recBa, len: 897 aa. Equivalent to 5' end of Rv0630c, len: 1094 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 868 aa overlap). Probable recB, exonuclease V, beta chain (exodeoxyribonuclease V, beta chain) (EC 3.1.11.5), highly similar to other exonucleases e.g. AF157643_2|recB|AAD46808.1 Escherichia coli RecB protein homolog from Mycobacterium smegmatis (1083 aa); P08394|EX5B_ECOLI|RORA|B2820 exodeoxyribonuclease v 135 kd polypeptide (EC 3.1.11.5) (EXONUCLEASE V BETA CHAIN) from Escherichia coli strain K12 (1180 aa), FASTA scores: opt: 289, E(): 4.3e-11, (29.5 identity in 1059 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE HELICASE FAMILY, UVRD SUBFAMILY. CONSIST OF THREE SUBUNITS; RECB, RECC|Rv0631c AND RECD|Rv0629c. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, recB exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (g-*) splits recB into 2 parts, recBa and recBb." /db_xref="GOA:A0A1R3XWD4" /db_xref="InterPro:IPR000212" /db_xref="InterPro:IPR004586" /db_xref="InterPro:IPR011335" /db_xref="InterPro:IPR011604" /db_xref="InterPro:IPR014016" /db_xref="InterPro:IPR014017" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR034739" /db_xref="UniProtKB/TrEMBL:A0A1R3XWD4" /protein_id="SIT99245.1" /translation="MDRFELLGPLPREGTTTVLEASAGTGKTFALAGLVTRYLAETAA TLDEMLLITFNRAASRELRERVRGQIVEALGALQGDAPPSGELVEHLLRGSDAERAQK RSRLRDALANFDAATIATTHEFCGSVLKSLGVAGDNAADVELKESLTDLVTEIVDDRY LANFGRQETDPELTYAEALALALAVVDDPCAQLRPPDPEPGSKAAVRLRFAAEVLEEL ERRKGRLRAQGFNDLLIRLATALEAADSPARDRMRERWRIVLVDEFQDTDPMQWRVLE RAFSRHSALILIGDPKQAIYGFRGGDIHTYLKAAGTADARYTLGVNWRSDRALVESLQ TVLRDATLGHADIVVRGTDAHHAGHRLASAPRPAPFRLRVVKRHTLGYDGTAHVPIEA LRRHIPDDLAADVAALLASGATFAGRPVVAADIAVIVEHHKDARACRNALAEAGIPAI YTGDTDVFASQAAKDWLCLLEAFDAPQRSGLVRAAACTMFFGETAESLAAEGDALTDR VAGTLREWADHARHRGVAAVFQAAQLAGMGRRVLSQRGGERDLTDLAHIAQLLHEAAH RERLGLPGLRDWLRRQAKAGAGPPEHNRRLDSDAAAVQIMTVFVAKGLQFPIVYLPFA FNRNVRSDDILLYHDDGTRCLYIGGKDGGAQRRTVEGLNRVEAAHDNLRLTYVALTRA QSQVVAWWAPTFDEVNGGLSRLLRGRRPGQSQVPDRCTPRVTDEQAWAVFAQWEAAGG PSVEESVIGARSSLEKPVPVPGFEVRHFHRRIDTTWRRTSYSDLVRGSEAVTVTSEPA AGGRADEVEIAVFAAPGSGADLTSPLAALPSGASFGSLVHAVLETADPAAPDLAAELE AQVPGTRRGGPWTSTTRSWLPNWPERCCRCTTRRWDPPPPH" CDS complement(726258..729551) /codon_start=1 /transl_table=11 /gene="recC" /locus_tag="BQ2027_MB0648C" /product="PROBABLE EXONUCLEASE V (GAMMA CHAIN) RECC (EXODEOXYRIBONUCLEASE V GAMMA CHAIN)(EXODEOXYRIBONUCLEASE V POLYPEPTIDE)" /note="Mb0648c, recC, len: 1097 aa. Equivalent to Rv0631c, len: 1097 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1097 aa overlap). Probable recC, exonuclease V, gamma chain (exodeoxyribonuclease V, gamma chain) (EC 3.1.11.5), highly similar to other exonucleases e.g. AF157643_1|RecC|AAD46807.1 Escherichia coli RecC protein homolog from Mycobacterium smegmatis (1085 aa); P07648|EX5C_ECOLI|B2822 exodeoxyribonuclease v 125 kd polypeptide (EC 3.1.11.5) (EXONUCLEASE V GAMMA CHAIN) from Escherichia coli strain K12 (1122 aa), FASTA scores: opt: 954, E(): 0, (29.2% identity in 1109 aa overlap); etc. CONSIST OF THREE SUBUNITS; RECB|Rv0630c, RECC AND RECD|Rv0629c. Protein product from Mb0648c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XVX6" /db_xref="InterPro:IPR006697" /db_xref="InterPro:IPR011335" /db_xref="InterPro:IPR013986" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041500" /db_xref="UniProtKB/TrEMBL:A0A1R3XVX6" /protein_id="SIT99246.1" /translation="MALHLHRAERTDLLADGLGALLADPQPDPFAQELVLVAARGVER WLSQRLSLVLGCGPGRADGVCAGIAFRNPQSLIAEITGTLDDDPWSPEALAWPLLAVI DASLDEPWCRTLASHLGHFATTDAEAELRRGRRYSVARRLAGLFASYARQRPGLLAAW LDGDLGELPGDLAWQPPLWRALVTTVGADPPHVRHDKTIARLRDGPADLPARLSLFGH TRLACTDVQLLDALAVHHDLHLWLPHPSDELWRALAGFQGADGLLPRRQDTSRRAAQH PLLETLGRDVRELQRALPAARATDEFLGATTKPDTLLGWLQADIAGNAPRPAERSLSD ADRSVQVHACHGPARQIDVLREVLLGLLEDDPTLQPRDIVVMCPDIDTYAPLIVAGFG LGEVAGDCHPAHRLRVRLADRALTQTNPLLSVAAELLTIAETRATASQLLNLAQAAPV RAKFGFADDDLDTITTWVRESNIRWGFDPTHRRRYGLDTVVHNTWRFGLDRILTGVAM SEDSQAWLDTALPLDDVGSNRVELAGRLAEFVERLHHVVGGLSGARPLVAWLDALATG IDLLTACNDGWQRAQVQREFADVLARAGSRAAPLLRLPDVRALLDAQLAGRPTRANFR TGTLTVCTMVPMRSVPHRVVCLVGLDDGVFPRLSHPDGDDVLAREPMTGERDIRSEDR QLLLDAIGAATQTLVITYTGADERTGQPRPPAVPLAELLDALDQTTSAPVRERILVTH PLQPFDRKNVTPGALLGAKPFTFDPAALAAAQAAAGKRCPPTAFISGRLPAPPAADVT LADLLDFFKDPVKGFFRALDYTLPWDVDTVEDSIPVQVDALAEWTVGERMLRDMLRGL HLDDAAHSEWRRGTLPPGRLGVRRAKEIRNRARDLAAAALAHRDGHGQAHDVDVDLGD GRRLSGTVTPVFGGRTVSVTYSKLAPKHVLPAWIGLVTLAAQEPGREWSALCIGRSKT RNHIARRLFVPPPDPVAVLRELVLLYDAGRREPLPLPLKTSCAWAQARRDGQDPYPPA RECWQTNRFRPGDDDAPAHVRAWGPRAPFEVLLGKPRAGEEVAGEETRLGALAARLWL PLLAAEGSV" CDS complement(729828..730523) /codon_start=1 /transl_table=11 /gene="echA3" /locus_tag="BQ2027_MB0649C" /product="PROBABLE ENOYL-COA HYDRATASE ECHA3 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb0649c, echA3, len: 231 aa. Equivalent to Rv0632c, len: 231 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 231 aa overlap). Probable echA3, enoyl-CoA hydratase (EC 4.2.1.17), almost identical to the MTU88877_1 enoyl-coA hydratase of Mycobacterium tuberculosis field isolate NTI64719, FASTA score: (92.4% identity in 184 aa overlap). Also similar to others e.g. P24162|ECHH_RHOCA enoyl-CoA hydratase from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (257 aa), FASTA scores: opt: 206, E(): 6.3e-07, (31.5% identity in 232 aa overlap); etc. Protein product from Mb0649c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0649c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW12" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3XW12" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99247.1" /translation="MSDPVSYTRKDSIAVISMDDGKVNALGPAMQQALNAAIDNADRD DVGALVITGNGRVFSGGFDLKILTSGEVQPAIDMLRGGFELAYRLLSYPKPVVMACTG HAIAMGAFLLSCGDHRVAAHAYNIQANEVAIGMTIPYAALEIMKLRLTRSAYQQATGL AKTFFGETALAAGFIDEIALPEVVVSRAEEAAREFAGLNQHAHAATKLRSRADALTAI RAGIDGIAAEFGL" CDS complement(730572..731402) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0650C" /product="POSSIBLE EXPORTED PROTEIN" /note="Mb0650c, -, len: 276 aa. Equivalent to Rv0633c, len: 279 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 279 aa overlap). Possible exported protein; has hydrophobic stretch at aa 23-41. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 9 bp deletion (cgggtgcgc-*) leads to shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (276 aa versus 279 aa). Protein product from Mb0650c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0650c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XVY6" /db_xref="UniProtKB/TrEMBL:A0A1R3XVY6" /protein_id="SIT99248.1" /translation="MVDSMGWVLSSWHEVTGVDSGTWLAWAALGLGVVALVVTKRQIQ RNRRLAAEQTRPYVAMFMEPHVADWHVIELVVRNFGRTAAYDVRFSFPNPPTVAQYEN AANGYADVVELRLPQELPMLAPGQEWRMVWDSALDRAEIGRGIESRFPGTVTYYDRPE QPRRWRFWRRGRRPLETKVVLDWDALPPVARIELMTTHDLAKREKQKLELLRSLLTYF HYASKETRPDVFRSEIDRINRAAAETQDRWRARQVEVPTEVSQRSEGQGPQPTRIPAG " CDS complement(731556..732269) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0651C" /product="POSSIBLE GLYOXALASE II (HYDROXYACYLGLUTATHIONE HYDROLASE) (GLX II)" /note="Mb0651c, -, len: 237 aa. Equivalent to Rv0634c, len: 237 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 237 aa overlap). Possible glyoxalase II (EC 3.1.2.6), equivalent to NP_302290.1|NC_002677 putative glyoxylase II from Mycobacterium leprae (238 aa); and similar to U00011_3|Y0BK_MYCLE|Q49649 hypothetical 23.9 kd protein from Mycobacterium leprae (218 aa), FASTA scores: opt: 281, E(): 3.9e-12, (31.8% identity in 201 aa overlap). Also similar to other glyoxalases and metallo-beta-lactamase family proteins e.g. NP_386770.1|NC_003047 PUTATIVE HYDROXYACYLGLUTATHIONE HYDROLASE from Sinorhizobium meliloti (256 aa); etc. Also similar to other putative glyoxylases from Mycobacterium tuberculosis e.g. Rv1637c. BELONGS TO THE GLYOXALASE II FAMILY. COFACTOR: BINDS TWO ZINC IONS. Protein product from Mb0651c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0651c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XVX3" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/TrEMBL:A0A1R3XVX3" /protein_id="SIT99249.1" /translation="MSKDRLYFRQLLSGRDFAVGDMFATQMRNFAYLIGDRTTGDCVV VDPAYAAGDLLDALESDDMQLSGVLVTHHHPDHVGGSMMGFQLPGLAELLERASVPVH VNTHEALWVSRVTGIPVGDLITHEHGDKVSVGDIDIELLHTPGHTPGSQCFLLDGRLV AGDTLFLEGCGRTDFPGGDSDEMYRSLRQLAELPGDPTVFPGHWYSAEPSASLSEVKR SNYVYRPASLDQWRMLMGG" CDS 732349..732600 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0652" /product="Transcription regulator of the Arc/MetJ class" /note="Mb0652, -, len: 83 aa. Equivalent to Rv0634A, len: 83 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 83 aa overlap). Hypothetical unknown protein. Protein product from Mb0652 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0652 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019239" /db_xref="UniProtKB/TrEMBL:A0A1R3XVX5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99250.1" /translation="MGSDCGCGGYLWSMLKRVEIEVDDDLIQKVIRRYRVKGAREAVN LALRTLLGEADTAEHGHDDEYDEFSDPNAWVPRRSRDTG" tRNA 732730..732802 /locus_tag="BQ2027_THRT" /product="tRNA-Thr" /note="thrT, len: 73 nt. Equivalent to thrT, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Thr, anticodon ggt." tRNA 732839..732912 /locus_tag="BQ2027_METT" /product="tRNA-Met" /note="metT, len: 74 nt. Equivalent to metT, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Met, anticodon cat." CDS 732948..733115 /codon_start=1 /transl_table=11 /gene="rpmG2" /locus_tag="BQ2027_MB0653" /product="50s ribosomal protein l33 rpmg2" /note="Mb0653, rpmG2, len: 55 aa. Equivalent to Rv0634B, len: 55 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 55 aa overlap). Probable rpmG2, 50S ribosomal protein L33. Note that Mycobacterium tuberculosis has a second rpmG gene: P96925|R33H_MYCTU|Rv2057c|MTCY63A.03|rpmG1 PUTATIVE 50S RIBOSOMAL PROTEIN L33 (55 aa), FASTA scores: opt: 391, E(): 2.9e-25, (100.0% identity in 55 aa overlap). BELONGS TO THE L33P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0653 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0653 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5W3" /db_xref="InterPro:IPR001705" /db_xref="InterPro:IPR011332" /db_xref="InterPro:IPR018264" /db_xref="InterPro:IPR038584" /db_xref="UniProtKB/Swiss-Prot:P0A5W3" /protein_id="SIT99251.1" /translation="MASSTDVRPKITLACEVCKHRNYITKKNRRNDPDRLELKKFCPN CGKHQAHRETR" CDS 733166..733642 /codon_start=1 /transl_table=11 /gene="hadA" /locus_tag="BQ2027_MB0654" /product="(3r)-hydroxyacyl-acp dehydratase subunit hada" /note="Mb0654, -, len: 158 aa. Equivalent to Rv0635, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 158 aa overlap). Conserved hypothetical protein, equivalent to NP_302287.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (159 aa); and highly similar to YV31_MYCLE|P54879 conserved hypothetical protein from Mycobacterium leprae (166 aa), FASTA scores: opt: 387, E(): 5.9e-21, (43.4% identity in 145 aa overlap). Also similar CAB77410.1|AL160431|SCD82.07 hypothetical protein from Streptomyces coelicolor (150 aa). And highly similar to two hypothetical proteins from Mycobacterium tuberculosis: Rv0504c|YV31_MYCTU|Q11168 (166 aa), FASTA scores: opt: 405, E(): 3.2e-22, (45.0% identity in 140 aa overlap); and Rv0637|MTY20H10_19 (2 ORFs downstream) (166 aa), FASTA score: (48.7% identity in 150 aa overlap). Protein product from Mb0654 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0654 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR016709" /db_xref="InterPro:IPR029069" /db_xref="InterPro:IPR039569" /db_xref="UniProtKB/Swiss-Prot:Q7U1K4" /protein_id="SIT99252.1" /translation="MALSADIVGMHYRYPDHYEVEREKIREYAVAVQNDDAWYFEEDG AAELGYKGLLAPLTFICVFGYKAQAAFFKHANIATAEAQIVQVDQVLKFEKPIVAGDK LYCDVYVDSVREAHGTQIIVTKNIVTNEEGDLVQETYTTLAGRAGEDGEGFSDGAA" CDS 733629..734057 /codon_start=1 /transl_table=11 /gene="hadB" /locus_tag="BQ2027_MB0655" /product="(3r)-hydroxyacyl-acp dehydratase subunit hadb" /note="Mb0655, -, len: 142 aa. Equivalent to Rv0636, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 142 aa overlap). Conserved hypothetical protein, equivalent to NP_302286.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (142 aa). Also highly similar to CAB77411.1|AL160431|SCD82.08 hypothetical protein from Streptomyces coelicolor (142 aa); and similar to others e.g. U28943|CELE04F6_3 from Caenorhabditis elegans (cosmid E04) (298 aa), FASTA scores: opt: 167, E(): 0.00064, (31.6 identity in 117 aa overlap). Protein product from Mb0655 detected using shotgun mass spectrometry. Mb0655 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002539" /db_xref="InterPro:IPR029069" /db_xref="UniProtKB/TrEMBL:A0A1R3XWZ3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99253.1" /translation="MALREFSSVKVGDQLPEKTYPLTRQDLVNYAGVSGDLNPIHWDD EIAKVVGLDAAIAHGMLTMGIGGGYVTSWVGDPGAVTEYNVRFTAVVPVPNDGKGAEL VFNGRVKSVDPESKSVTIALTATTGGKKIFGRAIASAKLA" CDS 734061..734561 /codon_start=1 /transl_table=11 /gene="hadC" /locus_tag="BQ2027_MB0656" /product="(3r)-hydroxyacyl-acp dehydratase subunit hadc" /note="Mb0656, -, len: 166 aa. Equivalent to Rv0637, len: 166 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 166 aa overlap). Conserved hypothetical protein, equivalent to NP_302285.1|NC_002677|YV31_MYCLE|P54879 conserved hypothetical protein from Mycobacterium leprae (166 aa), FASTA scores: opt: 352, E(): 4e-19, (39.2% identity in 148 aa overlap); and highly similar to others from Mycobacterium leprae e.g. NP_302287.1|NC_002677 conserved hypothetical protein (159 aa). Also highly similar to CAB77410.1|AL160431|SCD82.07 hypothetical protein from Streptomyces coelicolor (150 aa); Rv0635|NP_215149.1|NC_000962|MTY20H10_17 conserved hypothetical protein (two ORFs upstream) from Mycobacterium tuberculosis (158 aa), FASTA score: (49.3% identity in 150 aa overlap); and Rv0504c|NP_215018.1|NC_000962|YV31_MYCTU|Q11168 hypothetical protein from Mycobacterium tuberculosis (166 aa), FASTA scores: opt: 380, E(): 3.8e-21, (43.1% identity in 137 aa overlap). Protein product from Mb0656 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0656 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR016709" /db_xref="InterPro:IPR029069" /db_xref="InterPro:IPR039569" /db_xref="UniProtKB/Swiss-Prot:Q7U1K2" /protein_id="SIT99254.1" /translation="MALKTDIRGMIWRYPDYFIVGREQCREFARAVKCDHPAFFSEEA AADLGYDALVAPLTFVTILAKYVQLDFFRHVDVGMETMQIVQVDQRFVFHKPVLAGDK LWARMDIHSVDERFGADIVVTRNLCTNDDGELVMEAYTTLMGQQGDGSARLKWDKESG QVIRTA" tRNA 734760..734832 /locus_tag="BQ2027_TRPT" /product="tRNA-Trp" /note="trpT, len: 73 nt. Equivalent to trpT, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Trp, anticodon cca." CDS 734973..735458 /codon_start=1 /transl_table=11 /gene="secE1" /locus_tag="BQ2027_MB0657" /product="PROBABLE PREPROTEIN TRANSLOCASE SECE1" /note="Mb0657, secE1, len: 161 aa. Equivalent to Rv0638, len: 161 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 161 aa overlap). Probable secE1, preprotein translocase (tail-anchored membrane protein), highly similar at C-terminal half to others e.g. P36690|SECE_STRGR PREPROTEIN TRANSLOCASE SECE SUBUNIT from Streptomyces griseus (86 aa), FASTA scores: opt: 220, E(): 4.6e-06, (35.4% identity in 96 aa overlap); P16920|SECE_ECOLI preprotein translocase sece subunit from Escherichia coli strains K12 and O157:H7 (127 aa), FASTA scores: opt: 122, E(): 0.34, (37.0% identity in 54 aa overlap); etc. Contains PS01067 Protein secE/sec61-gamma signature. BELONGS TO THE SECE/SEC61-GAMMA FAMILY. PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA|Rv3240c, SECD|Rv2587c, SECE, SECF|Rv2586c, SECG|Rv1440 AND SECY|Rv0732. Note that previously known as secE. Protein product from Mb0657 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0657 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5Z1" /db_xref="InterPro:IPR001901" /db_xref="InterPro:IPR005807" /db_xref="InterPro:IPR038379" /db_xref="UniProtKB/Swiss-Prot:P0A5Z1" /protein_id="SIT99255.1" /translation="MSDEGDVADEAVADGAENADSRGSGGRTALVTKPVVRPQRPTGK RSRSRAAGADADVDVEEPSTAASEATGVAKDDSTTKAVSKAARAKKASKPKARSVNPI AFVYNYLKQVVAEMRKVIWPNRKQMLTYTSVVLAFLAFMVALVAGADLGLTKLVMLVF G" CDS 735490..736206 /codon_start=1 /transl_table=11 /gene="nusG" /locus_tag="BQ2027_MB0658" /product="PROBABLE TRANSCRIPTION ANTITERMINATION PROTEIN NUSG" /note="Mb0658, nusG, len: 238 aa. Equivalent to Rv0639, len: 238 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 238 aa overlap). Probable nusG, transcription antitermination protein, equivalent to NP_302283.1|NC_002677 transcription antitermination protein nusG from Mycobacterium leprae (228 aa). Also highly similar to others e.g. P36260|NUSG_STRGR from Streptomyces griseus (294 aa), FASTA scores: opt: 845, E(): 0, (55.4% identity in 233 aa overlap); etc. Note that shorter at the N-terminus than other nusG. Contains PS01014 Transcription termination factor nusG signature. BELONGS TO THE NUSG FAMILY. Protein product from Mb0658 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0658 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65590" /db_xref="InterPro:IPR001062" /db_xref="InterPro:IPR006645" /db_xref="InterPro:IPR008991" /db_xref="InterPro:IPR014722" /db_xref="InterPro:IPR015869" /db_xref="InterPro:IPR036735" /db_xref="UniProtKB/Swiss-Prot:P65590" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99256.1" /translation="MTTFDGDTSAGEAVDLTEANAFQDAAAPAEEVDPAAALKAELRS KPGDWYVVHSYAGYENKVKANLETRVQNLDVGDYIFQVEVPTEEVTEIKNGQRKQVNR KVLPGYILVRMDLTDDSWAAVRNTPGVTGFVGATSRPSALALDDVVKFLLPRGSTRKA AKGAASTAAAAEAGGLERPVVEVDYEVGESVTVMDGPFATLPATISEVNAEQQKLKVL VSIFGRETPVELTFGQVSKI" CDS 736258..736686 /codon_start=1 /transl_table=11 /gene="rplK" /locus_tag="BQ2027_MB0659" /product="50s ribosomal protein l11 rplk" /note="Mb0659, rplK, len: 142 aa. Equivalent to Rv0640, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Probable rplK, 50S ribosomal protein L11, equivalent to NP_302282.1|NC_002677 50S ribosomal protein L11 from Mycobacterium leprae (142 aa). Also highly similar to others e.g. P48954|RL11_STRCO|SCD82.19 50s ribosomal protein L11 from Streptomyces coelicolor (144 aa), FASTA scores: opt: 763, E(): 0, (84.6% identity in 143 aa overlap); etc. Contains PS00359 Ribosomal protein L11 signature. BELONGS TO THE L11P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0659 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0659 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66057" /db_xref="InterPro:IPR000911" /db_xref="InterPro:IPR006519" /db_xref="InterPro:IPR020783" /db_xref="InterPro:IPR020784" /db_xref="InterPro:IPR020785" /db_xref="InterPro:IPR036769" /db_xref="InterPro:IPR036796" /db_xref="UniProtKB/Swiss-Prot:P66057" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99257.1" /translation="MAPKKKVAGLIKLQIVAGQANPAPPVGPALGQHGVNIMEFCKAY NAATENQRGNVIPVEITVYEDRSFTFTLKTPPAAKLLLKAAGVAKGSAEPHKTKVAKV TWDQVREIAETKKTDLNANDVDAAAKIIAGTARSMGITVE" CDS 736753..737460 /codon_start=1 /transl_table=11 /gene="rplA" /locus_tag="BQ2027_MB0660" /product="50s ribosomal protein l1 rpla" /note="Mb0660, rplA, len: 235 aa. Equivalent to Rv0641, len: 235 aa, from Mycobacterium tuberculosis strain H37Rv (99.6% identity in 235 aa overlap). Probable rplA, 50S ribosomal protein L1, equivalent to NP_302281.1|NC_002677 50S ribosomal protein L1 from Mycobacterium leprae (235 aa). Also highly similar to others e.g. P3625|RL1_STRGR 50s ribosomal protein L1 from Streptomyces griseus (240 aa), FASTA scores: opt: 1081, E(): 0, (72.2% identity in 230 aa overlap); etc. BELONGS TO THE L1P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0660 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0660 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59790" /db_xref="InterPro:IPR002143" /db_xref="InterPro:IPR005878" /db_xref="InterPro:IPR016095" /db_xref="InterPro:IPR023673" /db_xref="InterPro:IPR023674" /db_xref="InterPro:IPR028364" /db_xref="UniProtKB/Swiss-Prot:P59790" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99258.1" /translation="MSKTSKAYRAAAAKVDRTNLYTPLQAAKLAKETSSTKQDATVEV AIRLGVDPRKADQMVRGTVNLPHGTGKTARVAVFAVGEKADAAVAAGADVVGSDDLIE RIQGGWLEFDAAIAAPDQMAKVGRIARVLGPRGLMPNPKTGTVTADVAKAVADIKGGK INFRVDKQANLHFVIGKASFDEKLLAENYGAAIDEVLRLKPSSSKGRYLKKITVSTTT GPGIPVDPSITRNFAGE" CDS complement(737534..738439) /codon_start=1 /transl_table=11 /gene="mmaA4" /locus_tag="BQ2027_MB0661C" /product="METHOXY MYCOLIC ACID SYNTHASE 4 MMAA4 (METHYL MYCOLIC ACID SYNTHASE 4) (MMA4) (HYDROXY MYCOLIC ACID SYNTHASE)" /note="Mb0661c, mmaA4, len: 301 aa. Equivalent to Rv0642c, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 301 aa overlap). mmaA4, methoxy mycolic acid synthase 4 (methyltransferase) (EC 2.1.1.-) (see citations below). Equivalent to AAC44876|AAC44876.1|cmaA methyl transferase (mycolic acid modification protein) from Mycobacterium bovis BCG strain Pasteur (298 aa); NP_302280.1|NC_002677 methyl mycolic acid synthase 4 from Mycobacterium leprae (298 aa); and highly similar to others from Mycobacteria e.g. downstream ORF P72027|mmaA3|Rv0643c|MTCY20H10.24c PUTATIVE METHOXY MYCOLIC ACID SYNTHASE 3 from Mycobacterium tuberculosis (293 aa). Protein product from Mb0661c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0661c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U1K1" /db_xref="InterPro:IPR003333" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7U1K1" /protein_id="SIT99259.1" /translation="MTRMAEKPISPTKTRTRFEDIQAHYDVSDDFFALFQDPTRTYSC AYFEPPELTLEEAQYAKVDLNLDKLDLKPGMTLLDIGCGWGTTMRRAVERLDVNVIGL TLSKNQHARCEQVLASIDTNRSRQVLLQGWEDFAEPVDRIVSIEAFEHFGHENYDDFF KRCFNIMPADGRMTVQSSVSYHPYEMAARGKKLSFETARFIKFIVTEIFPGGRLPSTE MMVEHGEKAGFTVPEPLSLRPHYIKTLRIWGDTLQSNKDKAIEVTSEEVYNRYMKYLR GCEHYFTDEMLDCSLVTYLKPGAAA" CDS complement(738504..739385) /codon_start=1 /transl_table=11 /gene="mmaA3" /locus_tag="BQ2027_MB0662C" /product="METHOXY MYCOLIC ACID SYNTHASE 3 MMAA3 (METHYL MYCOLIC ACID SYNTHASE 3) (MMA3) (HYDROXY MYCOLIC ACID SYNTHASE)" /note="Mb0662c, mmaA3, len: 293 aa. Equivalent to Rv0643c, len: 293 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 293 aa overlap). mmaA3, methoxy mycolic acid synthase 3 (methyltransferase) (EC 2.1.1.-) (see citations below). Equivalent to AAC44875|AAC44875.1|cmaB methyl transferase (mycolic acid modification protein) from Mycobacterium bovis BCG strain Pasteur (289 aa); and highly similar to others from Mycobacteria e.g. upstream ORF P72028|mmaA4|Rv0642c|MTCY20H10.23c PUTATIVE METHOXY MYCOLIC ACID SYNTHASE 4 from Mycobacterium tuberculosis (301 aa). Protein product from Mb0662c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0662c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U1K0" /db_xref="InterPro:IPR003333" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7U1K0" /protein_id="SIT99260.1" /translation="MSDNSTGTTKSRSNVDDVQAHYDLSDAFFALFQDPTRTYSCAYF ERDDMTLHEAQVAKLDLTLGKLGLEPGMTLLDVGCGWGSVMKRAVERYDVNVVGLTLS KNQHAYCQQVLDKVDTNRSHRVLLSDWANFSEPVDRIVTIEAIEHFGFERYDDFFKFA YNAMPADGVMLLHSITGLHVKQVIERGIPLTMEMAKFIRFIVTDIFPGGRLPTIETIE EHVTKAGFTITDIQSLQPHFARTLDLWAEALQAHKDEAIEIQSAEVYERYMKYLTGCA KAFRMGYIDCNQFTLAK" CDS complement(739533..740396) /codon_start=1 /transl_table=11 /gene="mmaA2" /locus_tag="BQ2027_MB0663C" /product="METHOXY MYCOLIC ACID SYNTHASE 2 MMAA2 (METHYL MYCOLIC ACID SYNTHASE 2) (MMA2) (HYDROXY MYCOLIC ACID SYNTHASE)" /note="Mb0663c, mmaA2, len: 287 aa. Equivalent to Rv0644c, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 287 aa overlap). mmaA2, methoxy mycolic acid synthase 2 (methyltransferase) (EC 2.1.1.-) (see citations below). Equivalent to AAC44874|AAC44874.1|cmaC methyl transferase (mycolic acid modification protein) from Mycobacterium bovis BCG strain Pasteur (287 aa); and highly similar to others from Mycobacteria e.g. upstream ORF P72028|mmaA4|Rv0642c|MTCY20H10.23c PUTATIVE METHOXY MYCOLIC ACID SYNTHASE 4 from Mycobacterium tuberculosis (301 aa). Note that alternative start is at position 739247. Protein product from Mb0663c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0663c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U1J9" /db_xref="InterPro:IPR003333" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7U1J9" /protein_id="SIT99261.1" /translation="MVNDLTPHFEDVQAHYDLSDDFFRLFLDPTQTYSCAHFEREDMT LEEAQIAKIDLALGKLGLQPGMTLLDIGCGWGATMRRAIAQYDVNVVGLTLSKNQAAH VQKSFDEMDTPLDRRVLLAGWEQFNEPVDRIVSIGAFEHFGHDRHADFFARAHKILPP DGVLLLHTITGLTRQQMVDHGLPLTLWLARFLKFIATEIFPGGQPPTIEMVEEQSAKT GFTLTRRQSLQPHYARTLDLWAEALQEHKSEAIAIQSEEVYERYMKYLTGCAKLFRVG YIDVNQFTLAK" CDS complement(740563..741423) /codon_start=1 /transl_table=11 /gene="mmaA1" /locus_tag="BQ2027_MB0664C" /product="METHOXY MYCOLIC ACID SYNTHASE 1 MMAA1 (METHYL MYCOLIC ACID SYNTHASE 1) (MMA1) (HYDROXY MYCOLIC ACID SYNTHASE)" /note="Mb0664c, mmaA1, len: 286 aa. Equivalent to Rv0645c, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). mmaA1, methoxy mycolic acid synthase 1 (methyltransferase) (EC 2.1.1.-) (see citations below). Equivalent to NP_302279.1|NC_002677 methyl mycolic acid synthase 1 from Mycobacterium leprae (286 aa); and highly similar to others from Mycobacteria e.g. upstream ORF P72028|mmaA4|Rv0642c|MTCY20H10.23c PUTATIVE METHOXY MYCOLIC ACID SYNTHASE 4 from Mycobacterium tuberculosis (301 aa). Protein product from Mb0664c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0664c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Q1" /db_xref="InterPro:IPR003333" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P0A5Q1" /protein_id="SIT99262.1" /translation="MAKLRPYYEESQSAYDISDDFFALFLDPTWVYTCAYFERDDMTL EEAQLAKVDLALDKLNLEPGMTLLDVGCGWGGALVRAVEKYDVNVIGLTLSRNHYERS KDRLAAIGTQRRAEARLQGWEEFEENVDRIVSFEAFDAFKKERYLTFFERSYDILPDD GRMLLHSLFTYDRRWLHEQGIALTMSDLRFLKFLRESIFPGGELPSEPDIVDNAQAAG FTIEHVQLLQQHYARTLDAWAANLQAARERAIAVQSEEVYNNFMHYLTGCAERFRRGL INVAQFTMTK" CDS complement(741470..742375) /codon_start=1 /transl_table=11 /gene="lipG" /locus_tag="BQ2027_MB0665C" /product="PROBABLE LIPASE/ESTERASE LIPG" /note="Mb0665c, lipG, len: 301 aa. Equivalent to Rv0646c, len: 301 aa. from Mtycobacterium tuberculosis strain H37Rv, (100.0% identity in 301 aa overlap). Probable lipG, lipase/esterase (EC 3.1.-.-), equivalent to NP_302278.1|NC_002677 probable hydrolase from Mycobacterium leprae (304 aa). Also highly similar to various hydrolases, especially lipases e.g. AA61351.1|X88895 carboxyl esterase from Acinetobacter calcoaceticus (312 aa), FASTA scores: opt: 867, E(): 0, (50.2% identity in 279 aa overlap); etc. Also similar to transferases e.g. P77026 MACROLIDE 2'-PHOSPHOTRANSFERASE II from Escherichia coli (279 aa), FASTA scores: E(): 1.3e-14, (32.5% identity in 286 aa overlap). Similar to M. tuberculosis non-heme bromoperoxidases and epoxide hydrolases. Protein product from Mb0665c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0665c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XX04" /protein_id="SIT99263.1" /translation="MDIRSGTAVSGDVKLYYEDMGDLDHPPVLLIMGLGAQMLLWRTD FCARLVAKGLRVIRYDNRDVGLSTKTERHRPGQPLATRLVRSWLGLPSQAAYTLEDMA ADAAALLDHLDVKHAHVVGASMGGMIAQIFAARFAQRTKTLAVIFSSNNHRFLPPPAP RALLALLTGPPPDSPRDVIVDNAVRVSKIIGSPAYPIPEDQVRAEAAESYDRNFHPWG IAQQFSAILGSGSLLRYDRRIVAPTVVIHGRADKLMRPFGGRAVARAINGARLVLIDG MGHDLPRQLWDRVIGELTRNFSEAG" CDS complement(742387..743853) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0666C" /product="Predicted unusual protein kinase" /note="Mb0666c, -, len: 488 aa. Equivalent to Rv0647c, len: 488 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 488 aa overlap). Conserved hypothetical protein, equivalent to NP_302277.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (448 aa). Also showing similarity to a variety of hypothetical ABC1-LIKE proteins or conserved hypothetical proteins e.g. D90908_28|P73627 ABC1-LIKE PROTEIN from Synechocystis (585 aa), FASTA scores: E(): 1.8e-31, (29.1% identity in 474 aa overlap); Q55884 HYPOTHETICAL6 5.0 KD PROTEIN (567 aa), FASTA scores: opt: 583, E(): 5.7e-30, (28.1% identity in 416 aa overlap); etc. Also similar to Rv3197 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis. Protein product from Mb0666c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0666c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW38" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR004147" /db_xref="InterPro:IPR011009" /db_xref="UniProtKB/TrEMBL:A0A1R3XW38" /protein_id="SIT99264.1" /translation="MRAEIGPDFRPHYTFGDAYPASERAHVNWELSAPVWHTAQMGST THREVAKLDRVPLPVEAARVAATGWQVTRTAVRFIGRLPRKGPWQQKVIKELPQTFAD LGPTYVKFGQIIASSPGAFGESLSREFRGLLDRVPPAKTDEVHKLFVEELGDEPARLF ASFEEEPFASASIAQVHYATLRSGEEVVVKIQRPGIRRRVAADLQILKRFAQTVELAK LGRRLSAQDVVADFADNLAEELDFRLEAQSMEAWVSHLHASPLGKNIRVPQVHWDFTT ERVLTMERVHGIRIDNTAAIRKAGFDGVELVKALLFSVFEGGLRHGLFHGDLHAGNLY VDEAGRIVFFDFGIMGRIDPRTRWLLRELVYALLVKKDHAAAGKIVVLMGAVGTMKPE TQAAKDLERFATPLTMQSLGDMSYADIGRQLSALADAYDVKLPRELVLIGKQFLYVER YMKLLAPRWQMMSDPQLTGYFANFMVEVSREHQSDIEV" CDS 744488..748135 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0667" /product="ALPHA-MANNOSIDASE" /note="Mb0667, -, len: 1215 aa. Equivalent to Rv0648, len: 1215 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1215 aa overlap). Alpha-mannosidase (EC 3.2.1.-) (see citation below), showing some similarity to hypothetical proteins and various sugar hydrolases e.g. SYCSLRA_6|Q55528 HYPOTHETICAL 1 20.4 KD PROTEIN from Synechocystis (1042 aa), FASTA scores: opt: 260, E(): 3.6e-08, (23.4% identity in 602 aa overlap); etc. Contains PS00659 Glycosyl hydrolases family 5 signature. Mb0667 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWE5" /db_xref="InterPro:IPR000602" /db_xref="InterPro:IPR011013" /db_xref="InterPro:IPR011330" /db_xref="InterPro:IPR011682" /db_xref="InterPro:IPR015341" /db_xref="InterPro:IPR018905" /db_xref="InterPro:IPR027291" /db_xref="InterPro:IPR028995" /db_xref="InterPro:IPR037094" /db_xref="UniProtKB/TrEMBL:A0A1R3XWE5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99265.1" /translation="MMGGTYNEPNTNLTSPETTIRNLVHGIGFQRDVLGAEPATAWQL DVFGHDPQFPGLAADAGLTSSSWARGPHHQWGPAQGGVDRMQFCSEFEWIAPSGRGLL THYMPAHYSAGWSMDSSTSLADAEAATYALFDQLKKVALTRNVLLPVGTDYTPPNKWV TAIHRDWGARYTWPRFVCALPKEFFAAVRAELAKRGWVPLPQTRDMNPIYTGKDVSYI DTKQANRAAENAVLEAERFAVFAALLTGAEYPQAALAKAWVQLAYGAHHDAITGSESD QVYLDLLTGWRDAWELGRAARDNSLRLLSGAVAASHDRVVVWNPLTQRRTDIVTARVD PPLQAGVRVFDPDGAEVAALVEHDGRSVTWLACDVPSLGWRVYRLVPADEAPGWELVP GTDIANEHYRLAVDPERGGALSSLVQDGRQLIAAGRVANELALYEEYPSHPTQGEGPW HLLPTGPVVCSSACPAQVQAYRGPLGQRLVVRGRIGTLLRYTQTLTLWDGVDRVDCRT SIDEFTGEDRLLRLRWPCPVPGAMPISEVGDAVVGRGFALLHEGPESVDTAQHPWTLD NPAYGWFGLSSAVRVRAGDGVRAVSVAEVVSPTETVSGPMARDLMVALVRAGVTATCS GADKPRYGHLDVDSNLPDARIALGGPDRNTFTKAVLAEAAPAYTAELQRQLAKTGTAR VWVPAANPLARAWLPGADLRAPCALPVLVIDGRDEKHLRAAVASLADDLADAEIVVHQ RAAPQMEPFEDRTVALLNRGVPSFAVDSEGTLHTALMRSCTGWPSGVWIDQPRRTAPD GSNFQLQHWTHHFDYALVCGGGDWRRAGIPARSAQFSHPLLAVAPRRPQGELPAVGSL LHVEPADSVQLGALKAAGNRLAAGSARPVQPAAVALRLVQTTGADTPVTIGCELGKVG ALRPADLLETPLAMARARKSSIDLHGYQVATVLARLDVAADMANVLAADDVALAPHAE TAQPQYARYWLHNRGPAPLGGLPAVAHLHPRRVRGQPGDDVVLRLTAASDCTDSVLGG VVDVVCPLGWPATPARLPFTLGAGAHLQADIALSIPAGAPPGPYPVRAQLRVVDTAVP AAWRQVVEDVCVVTVGADSDLEELVYLVDGPADIELAAGDRARLAVTIGSRAHAELAL DAHSISPWGTWEWIGPPALGAVLPARGMAKLAFDVTPPAWLEPGQWWALVRVGCAGQL VYSPAVKVSVT" CDS 748132..748806 /codon_start=1 /transl_table=11 /gene="fabD2" /locus_tag="BQ2027_MB0668" /product="possible malonyl coa-acyl carrier protein transacylase fabd2 (mct)" /note="Mb0668, fabD2, len: 224 aa. Equivalent to Rv0649, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Possible fabD2, malonyl CoA-acyl carrier protein transacylase (EC 2.3.1.39), similar to MTFABD|FABD_MYCTU|Q10501|Rv2243 malonyl CoA-acyl carrier protein transacylase from Mycobacterium tuberculosis (302 aa), FASTA scores: opt: 133, E(): 0.074, (31.3% identity in 147 aa overlap). Mb0668 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR027304" /db_xref="UniProtKB/TrEMBL:A0A1R3XVZ5" /protein_id="SIT99266.1" /translation="MSGRSRLPGSSSRRDAARIVAERVVATVAGVAVAVDEVDAAEAR LRDGPRAAALPASGTSEGRQLRRWLTQLIVTERVVAAEAAARGLTAAGAPAEADLLPD ATARLEIGSVAAAVLADPLARALFAAVTARVAVTDDAVADYHARNPLRFAAPCPGQHG WRAPAAAAPPLDQVRRAITEHLLGAARRRAFRVWLDARRNALVVLAPGYEHPGDPRQP DNTRRH" CDS 748806..749714 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0669" /product="POSSIBLE SUGAR KINASE" /note="Mb0669, -, len: 302 aa. Equivalent to Rv0650, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 302 aa overlap). Possible sugar kinase, highly similar to others e.g. CAB95296.1|AL359779 putative sugar kinase from Streptomyces coelicolor (317 aa); NP_406512.1|NC_003143 putative sugar kinase from Yersinia pestis (290 aa); NP_229269.1|NC_000853 glucokinase from Thermotoga maritima (317 aa); etc.Contains PS01125 ROK family signature. BELONGS TO THE ROK (NAGC/XYLR) FAMILY. Protein product from Mb0669 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XW29" /db_xref="InterPro:IPR000600" /db_xref="UniProtKB/TrEMBL:A0A1R3XW29" /protein_id="SIT99267.1" /translation="MLTLCLDIGGTKIAAGLADPAGTLVHTAQRPTPAYGGAEQVWAA VAEMIADALGVAGGAVGGVGIASAGPIDLHSGRVSPINIGSWGGFPLRDRVAAAVPGV PVRLGGDGVCMALGEHWLGAGRGARFLLGLVVSTGVGGGLVLDGAPCLGRTGNAGHVG HVVVDPDGSPCPCGGRGCVETIASGPSLARWARANGWSAPPGAGAKELAEAAGAGDPV ALRAFRRGAAALAAMIASVGAVCDLDLAVIGGGVAKSGRLLFEPLRAALADHARLDFL AGLRVVPAELGGAAGLVGAARLAAIA" CDS 750045..750581 /codon_start=1 /transl_table=11 /gene="rplJ" /locus_tag="BQ2027_MB0670" /product="50s ribosomal protein l10 rplj" /note="Mb0670, rplJ, len: 178 aa. Equivalent to Rv0651, len: 178 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 178 aa overlap). Probable rplJ, 50S ribosomal protein L10, equivalent to NP_302276.1|NC_002677 50S ribosomal protein L10 from Mycobacterium leprae (177 aa). Also highly similar to others e.g. P36257|RL10_STRGR 50s ribosomal protein L10 from Streptomyces griseus (185 aa), FASTA scores: opt: 633, E(): 0, (59.0 % identity in 173 aa overlap); etc. BELONGS TO THE L10P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0670 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0670 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66045" /db_xref="InterPro:IPR001790" /db_xref="InterPro:IPR002363" /db_xref="InterPro:IPR022973" /db_xref="UniProtKB/Swiss-Prot:P66045" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99268.1" /translation="MARADKATAVADIAAQFKESTATLITEYRGLTVANLAELRRSLT GSATYAVAKNTLIKRAASEAGIEGLDELFVGPTAIAFVTGEPVDAAKAIKTFAKEHKA LVIKGGYMDGHPLTVAEVERIADLESREVLLAKLAGAMKGNLAKAAGLFNAPASQLAR LAAALQEKKACPGPDSAE" CDS 750618..751010 /codon_start=1 /transl_table=11 /gene="rplL" /locus_tag="BQ2027_MB0671" /standard_name="L7|L12" /product="50s ribosomal protein l7/l12 rpll (sa1)" /note="Mb0671, rplL, len: 130 aa. Equivalent to Rv0652, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 130 aa overlap). Probable rplL (alternate gene name: L7|L12), 50S ribosomal protein L7/L12, equivalent to NP_302275.1|NC_002677 50S ribosomal protein L7/L12 from Mycobacterium leprae (130 aa); and P37381|RL7_MYCBO 50s ribosomal protein L7/L12 from Mycobacterium bovis (130 aa). Also highly similar to others e.g. P02396|RL7_STRGR 50S RIBOSOMAL PROTEIN L7/L12 from Streptomyces griseus (127 aa); etc. BELONGS TO THE L12P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0671 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0671 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5V3" /db_xref="InterPro:IPR000206" /db_xref="InterPro:IPR008932" /db_xref="InterPro:IPR013823" /db_xref="InterPro:IPR014719" /db_xref="InterPro:IPR036235" /db_xref="UniProtKB/Swiss-Prot:P0A5V3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99269.1" /translation="MAKLSTDELLDAFKEMTLLELSDFVKKFEETFEVTAAAPVAVAA AGAAPAGAAVEAAEEQSEFDVILEAAGDKKIGVIKVVREIVSGLGLKEAKDLVDGAPK PLLEKVAKEAADEAKAKLEAAGATVTVK" CDS complement(751003..751698) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0672C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb0672c, -, len: 231 aa. Equivalent to Rv0653c, len: 231 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 231 aa overlap). Possible transcriptional regulator, TetR family, similar in N-terminus to others e.g. CAC03642.1|AL391338 putative TetR-family transcriptional regulator from Streptomyces coelicolor (190 aa); Q51597 CAM REPRESSOR from Pseudomonas putida (186 aa), FASTA scores: opt: 150, E(): 0.00085, (27.8% identity in 97 aa overlap); etc. Also some similarity to Mycobacterium tuberculosis hypothetical transcriptional regulators Rv0681 and Rv1816. Contains probable helix-turn helix motif from aa 27-48 (Score 1156, +3.12 SD)." /db_xref="GOA:A0A1R3XVZ4" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR025996" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3XVZ4" /protein_id="SIT99270.1" /translation="MTSQTGVRDELLHAGVRLLDDHGPDALQTRKVAAAAGTSTMAVY THFGGMRGLIAAIAEEGLRQFDVALTVPQTADPVADLLAIGTAYRRYAIERPHMYRLM FGSTSAHGINVPARDVLTLKVAEIEHQHPSFAHVVRAVHRCLLAGRFATALGADDDTA IVATAAQFWSQIHGFVMLELAGFYGDRGAAVEPVLAAMTVNLLVALGDSPERAQCSLR AEQTQKNTLGRAT" CDS 751769..753274 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0673" /product="PROBABLE DIOXYGENASE" /note="Mb0673, -, len: 501 aa. Equivalent to Rv0654, len: 501 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 501 aa overlap). Probable dioxygenase (EC 1.-.-.-), highly similar to others eg AAK06796.1|AF324838_15|AF324838|SimC5 putative dioxygenase (involved in tetraene formation) from Streptomyces antibioticus (456 aa); CAB56138.1| AL117669 putative dioxygenase from Streptomyces coelicolor (503 aa); T51734 neoxanthin cleavage enzyme (9-cis-epoxy-carotenoid dioxygenase) from Arabidopsis thaliana (538 aa); Q53353 LIGNOSTILBENE-ALPHA,BETA-DIOXYGENASE from Pseudomonas paucimobilis (Sphingomonas paucimobilis), FASTA scores: opt: 280, E(): 2.3e-11, (28.5% identity in 523 aa overlap); etc. Also some similarity with Rv0913c|MTCY21C12.07c POSSIBLE DIOXYGENASE from Mycobacterium tuberculosis (501 aa), FASTA score: (29.5% identity in 522 aa overlap). Protein product from Mb0673 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0673 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW04" /db_xref="InterPro:IPR004294" /db_xref="UniProtKB/TrEMBL:A0A1R3XW04" /protein_id="SIT99271.1" /translation="MTTAQAAESQNPYLEGFLAPVSTEVTATDLPVTGRIPEHLDGRY LRNGPNPVAEVDPATYHWFTGDAMVHGVALRDGKARWYRNRWVRTPAVCAALGEPISA RPHPRTGIIEGGPNTNVLTHAGRTLALVEAGVVNYELTDELDTVGPCDFDGTLHGGYT AHPQRDPHTGELHAVSYSFARGHRVQYSVIGTDGHARRTVDIEVAGSPMMHSFSLTDN YVVIYDLPVTFDPMQVVPASVPRWLQRPARLVIQSVLGRVRIPDPIAALGNRMQGHSD RLPYAWNPSYPARVGVMPREGGNEDVRWFDIEPCYVYHPLNAYSECRNGAEVLVLDVV RYSRMFDRDRRGPGGDSRPSLDRWTINLATGAVTAECRDDRAQEFPRINETLVGGPHR FAYTVGIEGGFLVGAGAALSTPLYKQDCVTGSSTVASLDPDLLIGEMVFVPNPSARAE DDGILMGYGWHRGRDEGQLLLLDAQTLESIATVHLPQRVPMGFHGNWAPTT" CDS 753286..754365 /codon_start=1 /transl_table=11 /gene="mkl" /locus_tag="BQ2027_MB0674" /product="POSSIBLE RIBONUCLEOTIDE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER MKL" /note="Mb0674, mkl, len: 359 aa. Equivalent to Rv0655, len: 359 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 359 aa overlap). Possible mkl, ribonucleotide-transport ATP-binding protein ABC transporter (see first citation below), equivalent to P30769|MKL_MYCLE|ML1892 POSSIBLE RIBONUCLEOTIDE TRANSPORT ATP-BINDING PROTEIN from Mycobacterium leprae (347 aa), FASTA scores: opt: 2021, E(): 0, (92.2% identity in 335 aa overlap). Also highly similar to many e.g. AB92896.1|AL356992 putative ABC-transporter ATP-binding protein from Streptomyces coelicolor (343 aa); NP_253146.1|NC_002516 probable ATP-binding component of ABC transporter from Pseudomonas aeruginosa (269 aa); P45393|YRBF_ECOLI hypothetical ABC transporter ATP-binding protein from Escherichia coli (269 aa), FASTA scores: opt: 644, E(): 3.4e-33, (38.5% identity in 244 aa overlap); etc. Also similar to many other Mycobacterium tuberculosis ABC transporters e.g. P71747|CYSA|Rv2397c|MTCY253.24 (351 aa), FASTA score: (33.6% identity in 241 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb0674 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0674 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63358" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR030296" /db_xref="UniProtKB/Swiss-Prot:P63358" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99272.1" /translation="MRYSDSYHTTGRWQPRASTEGFPMGVSIEVNGLTKSFGSSRIWE DVTLTIPAGEVSVLLGPSGTGKSVFLKSLIGLLRPERGSIIIDGTDIIECSAKELYEI RTLFGVLFQDGALFGSMNLYDNTAFPLREHTKKKESEIRDIVMEKLALVGLGGDEKKF PGEISGGMRKRAGLARALVLDPQIILCDEPDSGLDPVRTAYLSQLIMDINAQIDATIL IVTHNINIARTVPDNMGMLFRKHLVMFGPREVLLTSDEPVVRQFLNGRRIGPIGMSEE KDEATMAEEQALLDAGHHAGGVEEIEGVPPQISATPGMPERKAVARRQARVREMLHTL PKKAQAAILDDLEGTHKYAVHEIGQ" CDS complement(754753..755136) /codon_start=1 /transl_table=11 /gene="vapc6" /locus_tag="BQ2027_MB0675C" /product="possible toxin vapc6" /note="Mb0675c, -, len: 127 aa. Equivalent to Rv0656c, len: 127 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 127 aa overlap). Conserved hypothetical protein, showing similarity with proteins from Mycobacterium tuberculosis e.g. Rv2757c, Rv2546, etc. Mb0675c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX14" /db_xref="InterPro:IPR022907" /db_xref="UniProtKB/TrEMBL:A0A1R3XX14" /protein_id="SIT99273.1" /translation="MAAATTTGTHRGLELRAAQRAVGSCEPQRAEFCRSARNADEFDQ MSRMFGDVYPDVPVPKSVWRWIDSAQHRLARAGAVGALSVVDLLICDTAAARGLVVLH DDADYELAERHLPDIRVRRVVSADD" CDS complement(755231..755386) /codon_start=1 /transl_table=11 /gene="vapb6" /locus_tag="BQ2027_MB0676C" /product="possible antitoxin vapb6" /note="Mb0676c, -, len: 51 aa. Equivalent to Rv0657c, len: 51 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 51 aa overlap). Conserved hypothetical protein, showing similarity with hypothetical proteins from Mycobacterium tuberculosis e.g. Rv2009|MT2064.1|MTCY39.08c|YW08_MYCTU|Q10848 (80 aa), FASTA scores: opt: 107, E(): 0.0038, (45.8% identity in 48 aa overlap), Rv2871, Rv1560, etc. Also some similarity with AL020958|SC4H8_7 from Streptomyces coelicolor (66 aa), FASTA score: (41.0% identity in 39 aa overlap). Mb0676c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019239" /db_xref="UniProtKB/TrEMBL:A0A1R3XW48" /protein_id="SIT99274.1" /translation="MSVTQIDLDDEALADVMRIAAVHTKKEAVNLAMRDYVERFRRIE ALARSRE" CDS complement(755462..756178) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0677C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0677c, -, len: 238 aa. Equivalent to Rv0658c, len: 238 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 238 aa overlap). Probable conserved integral membrane protein, similar to P33774|YPRB_ECOLI hypothetical 24.3 kd protein from Escherichia coli (217 aa), FASTA scores: opt: 174, E(): 5.3e-05, (25.6% identity in 223 aa overlap). Also similar to Rv1863c and Rv0804 from Mycobacterium tuberculosis. Mb0677c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWF5" /db_xref="InterPro:IPR003675" /db_xref="UniProtKB/TrEMBL:A0A1R3XWF5" /protein_id="SIT99275.1" /translation="MEAGRADTVAPSHRWGLGAFLVVELVFLVASTSLAVVLTGHGPV SAGVLALALAAPTVVAAGLAILITRLRGNGPRTDLRLRWSWRGLRLGLMFGFGGMLVT IPASLVYTAIVGPEANSAVVRIFGGVRASWPWALVVFLVVVFVAPLCEEIIYRGLLWG AVDRRWGRWAALVVTTVVFALAHLEFARAPLLVVVAIPIALARFYSGGLLASIVTHQV TNLLPGIVLLLGLTGAISLP" CDS complement(756454..756762) /codon_start=1 /transl_table=11 /gene="mazf2" /locus_tag="BQ2027_MB0678C" /product="toxin mazf2" /note="Mb0678c, -, len: 102 aa. Equivalent to Rv0659c, len: 102 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 102 aa overlap). Conserved hypothetical protein, weakly similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv1942c, Rv1495, etc. Mb0678c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW06" /db_xref="InterPro:IPR003477" /db_xref="InterPro:IPR011067" /db_xref="UniProtKB/TrEMBL:A0A1R3XW06" /protein_id="SIT99276.1" /translation="MRRGELWFAATPGGDRPVLVLTRDPVADRIGAVVVVALTRTRRG LVSELELTAVENRVPSDCVVNFDNIHTLPRTAFRRRITRLSPARLHEACQTLRASTGC " CDS complement(756749..756994) /codon_start=1 /transl_table=11 /gene="maze2" /locus_tag="BQ2027_MB0679C" /product="possible antitoxin maze2" /note="Mb0679c, -, len: 81 aa. Equivalent to Rv0660c, len: 81 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 81 aa overlap). Conserved hypothetical protein, showing some similarity to AF016485_130 from Halobacterium sp (100 aa), FASTA scores: (32.4% identity in 74 aa overlap). Mb0679c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW41" /db_xref="InterPro:IPR002145" /db_xref="UniProtKB/TrEMBL:A0A1R3XW41" /protein_id="SIT99277.1" /translation="MLSFRADDHDVDLADAWARRLHIGRSELLRDALRRHLAALAADQ DVQAYTERPLTDDENALAEIADWGPAEDWADWADAAR" CDS complement(757104..757541) /codon_start=1 /transl_table=11 /gene="vapc7" /locus_tag="BQ2027_MB0680C" /product="possible toxin vapc7" /note="Mb0680c, -, len: 145 aa. Equivalent to Rv0661c, len: 145 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 145 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv2863|MTV003.09|MTV003_7 (126 aa), FASTA scores: E(): 0.00087, (30.4% identity in 125 aa overlap), Rv0749|MTV041.23 (163 aa); Rv0277c, Rv2530c, etc." /db_xref="GOA:A0A1R3XW16" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XW16" /protein_id="SIT99278.1" /translation="MIVLDTTVLVYAKGAEHPLRDPCRDLVAAIADERIAATTTAEVI QEFVHVRARRRDRSDAAALGRVTMPNCSRRYSPSIEATSKRGLTLFETTPGLEACDAV LAAVAASAGATALVSADPAFADLSDVVHVIPDAAGMVSLLGDR" CDS complement(757538..757906) /codon_start=1 /transl_table=11 /gene="vapb7" /locus_tag="BQ2027_MB0681C" /product="possible antitoxin vapb7" /note="Mb0681c, -, len: 122 aa. Equivalent to Rv0662c, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 122 aa overlap). Conserved hypothetical protein, showing weak similarity with other hypothetical proteins from Mycobacterium tuberculosis e.g. Rv2871, Rv1241, Rv2550c, etc." /db_xref="UniProtKB/TrEMBL:A0A1R3XW11" /protein_id="SIT99279.1" /translation="MFLPNTRAYRRYNRSVWAVRGSTRPQWQPPPKFQHAKCMSMRLA HRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADM SVPEPRELKQELEALRARRG" CDS 757906..760269 /codon_start=1 /transl_table=11 /gene="atsD" /locus_tag="BQ2027_MB0682" /product="POSSIBLE ARYLSULFATASE ATSD (ARYL-SULFATE SULPHOHYDROLASE) (ARYLSULPHATASE)" /note="Mb0682, atsD, len: 787 aa. Equivalent to Rv0663, len: 787 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 787 aa overlap). Possible atsD, arylsulfatase (EC 3.1.6.1), similar to others e.g. P5169|ARS_PSEAE arylsulfatase from Pseudomonas aeruginosa (532 aa), FASTA scores: opt: 653, E(): 0, (33.1% identity in 544 aa overlap); etc. Also similar to P95059|MTCY210.30|ATSA|Rv0711|MTCY210.30 from Mycobacterium tuberculosis (787 aa), FASTA score: (38.9% identity in 769 aa overlap); and other arylsulfatases from Mycobacterium tuberculosis e.g. Rv3299c|ATSB (970 aa), Rv0711, etc. Contains PS00523 Sulfatases signature 1. BELONGS TO THE SULFATASE FAMILY. Protein product from Mb0682 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0682 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW05" /db_xref="InterPro:IPR000917" /db_xref="InterPro:IPR013320" /db_xref="InterPro:IPR017850" /db_xref="InterPro:IPR024607" /db_xref="UniProtKB/TrEMBL:A0A1R3XW05" /protein_id="SIT99280.1" /translation="MPQPRTHLPIPSAARTGLITYDAKDPDSTYPPIEQLRPPAGAPN VLLILLDDVGFGASSAFGGPCRTSTAELLAGNGLRYNRFHTTALCSPTRQALLTGRNH HSAGMGGITEIATGAPGYSSVLPNTMSPIARTLKLNGYNTAQFGKCHEVPVWQTSPVG PFDAWPSGGGGFEYFYGFIGGEANQWYPSLYEGTTPVEVNRTPEEGYHFMADMTDKAL GWIGQQKALAPDRPFFVYFAPGATHAPHHVPREWADKYRGRFDVGWDALREETFARQK ELGVIPADCQLTARHAEIPAWDDMPEDLKPVLCRQMEVYAGFLEYTDHHVGRLVDGLQ RLGVLDDTLVFYIIGDNGASAEGTINGTYNEMLNFNGLADIETPRFMTDRLDKFGGPE SYNHYSVGWAHAMDTPYQWTKQVASHWGGTRNGTIVHWPNGIAAKGEMRWQFHHVIDV APTILEAAGLPEPLFVNGVQQHPIEGVSMAYSFDDAQAPDRHETQYFEMFGNRGIYHK GWTAVTKHKTPWILVGEQTVAFDDDVWELYDTTKDWSQAKDLAKEMPEKLHELQRLWL IEATRYNVLPLDDDTASRINPDLAGRPVLIRGNTQVLFSNMGRLSENCVLNLKNKSHT VTAEVEVPETGAEGVIVAQGASIGGWSLYANDGKLKYCYNLGGIKHFYAESADPLPAG AHQVRMEFAYAGGGLGKGGEVTLYVDGQQVGEGHVEATLAIVFSADDGCDVGMDSGSP VSPDYAPGSNAFNGRIKGVQLAIAEAAAAAGHLVDPEHAIRIALARQ" CDS 760301..760573 /codon_start=1 /transl_table=11 /gene="vapb8" /locus_tag="BQ2027_MB0683" /product="possible antitoxin vapb8" /note="Mb0683, -, len: 90 aa. Equivalent to Rv0664, len: 90 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). Hypothetical unknown protein. Mb0683 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XW14" /protein_id="SIT99281.1" /translation="MEKSRCHAVAHGGGCAGSAKSHKSGGRCGQGRGAGDSHGTRGAG RRYRAASAPHPLAVGAHLRDELAKRSADPRLTDELNDLAGHTLDDL" CDS 760570..760908 /codon_start=1 /transl_table=11 /gene="vapc8" /locus_tag="BQ2027_MB0684" /product="possible toxin vapc8" /note="Mb0684, -, len: 112 aa. Equivalent to Rv0665, len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 112 aa overlap). Conserved hypothetical protein, similar to Rv0627 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (135 aa), and showing similarity with Rv0595c. Mb0684 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XY60" /protein_id="SIT99282.1" /translation="MTEGEVGVGLLDTSVFIARESGGAIADLPERVALSVMTIGELQL GLLNAGDSATRSRRADTLALARTADQIPVSEAVMISLARLVADCRAAGVRRSVKLTDA LIAATAEIKV" CDS 760905..761078 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0685" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb0685, -, len: 57 aa. Equivalent to Rv0666, len: 57 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 57 aa overlap). Possible membrane protein; has hydrophobic stretch at aa 29-47. Mb0685 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XX24" /protein_id="SIT99283.1" /translation="MTPRTDEGAAAPCLMPDVTMPVKRGDARGALGVGPALFVVSVSS SLVRARSCRCTAD" CDS 761576..765094 /codon_start=1 /transl_table=11 /gene="rpoB" /locus_tag="BQ2027_MB0686" /product="DNA-DIRECTED RNA POLYMERASE (BETA CHAIN) RPOB (TRANSCRIPTASE BETA CHAIN) (RNA POLYMERASE BETA SUBUNIT)" /note="Mb0686, rpoB, len: 1172 aa. Equivalent to Rv0667, len: 1172 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1172 aa overlap). rpoB, DNA-directed RNA polymerase, beta chain (EC 2.7.7.6) (see first and third citations below), equivalent to P30760|RPOB_MYCLE|ML1891 DNA-directed RNA polymerase beta chain from Mycobacterium leprae (1178 aa). Also highly similar to others e.g. AAF60349.1|AF242549_1|AF242549 DNA-dependent RNA polymerase beta subunit from Amycolatopsis mediterranei (1167 aa); CAB77428.1|AL160431 DNA-directed RNA polymerase beta chain from Streptomyces coelicolor (1161 aa); etc. Start site chosen on basis of RBS but alternative start exists at position 14359. BELONGS TO THE RNA POLYMERASE BETA CHAIN FAMILY. Protein product from Mb0686 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0686 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A681" /db_xref="InterPro:IPR007120" /db_xref="InterPro:IPR007121" /db_xref="InterPro:IPR007641" /db_xref="InterPro:IPR007642" /db_xref="InterPro:IPR007644" /db_xref="InterPro:IPR007645" /db_xref="InterPro:IPR010243" /db_xref="InterPro:IPR014724" /db_xref="InterPro:IPR015712" /db_xref="InterPro:IPR019462" /db_xref="InterPro:IPR037033" /db_xref="InterPro:IPR037034" /db_xref="InterPro:IPR042107" /db_xref="UniProtKB/Swiss-Prot:P0A681" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99284.1" /translation="MADSRQSKTAASPSPSRPQSSSNNSVPGAPNRVSFAKLREPLEV PGLLDVQTDSFEWLIGSPRWRESAAERGDVNPVGGLEEVLYELSPIEDFSGSMSLSFS DPRFDDVKAPVDECKDKDMTYAAPLFVTAEFINNNTGEIKSQTVFMGDFPMMTEKGTF IINGTERVVVSQLVRSPGVYFDETIDKSTDKTLHSVKVIPSRGAWLEFDVDKRDTVGV RIDRKRRQPVTVLLKALGWTSEQIVERFGFSEIMRSTLEKDNTVGTDEALLDIYRKLR PGEPPTKESAQTLLENLFFKEKRYDLARVGRYKVNKKLGLHVGEPITSSTLTEEDVVA TIEYLVRLHEGQTTMTVPGGVEVPVETDDIDHFGNRRLRTVGELIQNQIRVGMSRMER VVRERMTTQDVEAITPQTLINIRPVVAAIKEFFGTSQLSQFMDQNNPLSGLTHKRRLS ALGPGGLSRERAGLEVRDVHPSHYGRMCPIETPEGPNIGLIGSLSVYARVNPFGFIET PYRKVVDGVVSDEIVYLTADEEDRHVVAQANSPIDADGRFVEPRVLVRRKAGEVEYVP SSEVDYMDVSPRQMVSVATAMIPFLEHDDANRALMGANMQRQAVPLVRSEAPLVGTGM ELRAAIDAGDVVVAEESGVIEEVSADYITVMHDNGTRRTYRMRKFARSNHGTCANQCP IVDAGDRVEAGQVIADGPCTDDGEMALGKNLLVAIMPWEGHNYEDAIILSNRLVEEDV LTSIHIEEHEIDARDTKLGAEEITRDIPNISDEVLADLDERGIVRIGAEVRDGDILVG KVTPKGETELTPEERLLRAIFGEKAREVRDTSLKVPHGESGKVIGIRVFSREDEDELP AGVNELVRVYVAQKRKISDGDKLAGRHGNKGVIGKILPVEDMPFLADGTPVDIILNTH GVPRRMNIGQILETHLGWCAHSGWKVDAAKGVPDWAARLPDELLEAQPNAIVSTPVFD GAQEAELQGLLSCTLPNRDGDVLVDADGKAMLFDGRSGEPFPYPVTVGYMYIMKLHHL VDDKIHARSTGPYSMITQQPLGGKAQFGGQRFGEMECWAMQAYGAAYTLQELLTIKSD DTVGRVKVYEAIVKGENIPEPGIPESFKVLLKELQSLCLNVEVLSSDGAAIELREGED EDLERAAANLGINLSRNESASVEDLA" CDS 765139..769089 /codon_start=1 /transl_table=11 /gene="rpoC" /locus_tag="BQ2027_MB0687" /product="DNA-DIRECTED RNA POLYMERASE (BETA' CHAIN) RPOC (TRANSCRIPTASE BETA' CHAIN) (RNA POLYMERASE BETA' SUBUNIT)." /note="Mb0687, rpoC, len: 1316 aa. Equivalent to Rv0668, len: 1316 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1316 aa overlap). rpoC, DNA-directed RNA polymerase, beta' chain (EC 2.7.7.6) (see first citation below), equivalent to P30761|RPOC_MYCLE|ML1890|S31146 DNA-directed RNA polymerase (EC 2.7.7.6) beta' chain from Mycobacterium leprae (1316 aa), FASTA scores: opt: 8295, E(): 0, (95.6% identity in 1316 aa overlap). Also highly similar to others e.g. CAB77429.1|AL160431 DNA-directed RNA polymerase beta' chain (fragment) from Streptomyces coelicolor (1059 aa); P37871|RPOC_BACSU from Bacillus subtilis (1199 aa), FASTA scores: opt: 2367, E(): 0, (52.9 identity in 1317 aa overlap); etc. BELONGS TO THE RNA POLYMERASE BETA' CHAIN FAMILY. Protein product from Mb0687 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0687 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A675" /db_xref="InterPro:IPR000722" /db_xref="InterPro:IPR006592" /db_xref="InterPro:IPR007066" /db_xref="InterPro:IPR007080" /db_xref="InterPro:IPR007081" /db_xref="InterPro:IPR007083" /db_xref="InterPro:IPR012754" /db_xref="InterPro:IPR038120" /db_xref="InterPro:IPR042102" /db_xref="UniProtKB/Swiss-Prot:P0A675" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99285.1" /translation="MLDVNFFDELRIGLATAEDIRQWSYGEVKKPETINYRTLKPEKD GLFCEKIFGPTRDWECYCGKYKRVRFKGIICERCGVEVTRAKVRRERMGHIELAAPVT HIWYFKGVPSRLGYLLDLAPKDLEKIIYFAAYVITSVDEEMRHNELSTLEAEMAVERK AVEDQRDGELEARAQKLEADLAELEAEGAKADARRKVRDGGEREMRQIRDRAQRELDR LEDIWSTFTKLAPKQLIVDENLYRELVDRYGEYFTGAMGAESIQKLIENFDIDAEAES LRDVIRNGKGQKKLRALKRLKVVAAFQQSGNSPMGMVLDAVPVIPPELRPMVQLDGGR FATSDLNDLYRRVINRNNRLKRLIDLGAPEIIVNNEKRMLQESVDALFDNGRRGRPVT GPGNRPLKSLSDLLKGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLHQCGLPKLMALE LFKPFVMKRLVDLNHAQNIKSAKRMVERQRPQVWDVLEEVIAEHPVLLNRAPTLHRLG IQAFEPMLVEGKAIQLHPLVCEAFNADFDGDQMAVHLPLSAEAQAEARILMLSSNNIL SPASGRPLAMPRLDMVTGLYYLTTEVPGDTGEYQPASGDHPETGVYSSPAEAIMAADR GVLSVRAKIKVRLTQLRPPVEIEAELFGHSGWQPGDAWMAETTLGRVMFNELLPLGYP FVNKQMHKKVQAAIINDLAERYPMIVVAQTVDKLKDAGFYWATRSGVTVSMADVLVPP RKKEILDHYEERADKVEKQFQRGALNHDERNEALVEIWKEATDEVGQALREHYPDDNP IITIVDSGATGNFTQTRTLAGMKGLVTNPKGEFIPRPVKSSFREGLTVLEYFINTHGA RKGLADTALRTADSGYLTRRLVDVSQDVIVREHDCQTERGIVVELAERAPDGTLIRDP YIETSAYARTLGTDAVDEAGNVIVERGQDLGDPEIDALLAAGITQVKVRSVLTCATST GVCATCYGRSMATGKLVDIGEAVGIVAAQSIGEPGTQLTMRTFHQGGVGEDITGGLPR VQELFEARVPRGKAPIADVTGRVRLEDGERFYKITIVPDDGGEEVVYDKISKRQRLRV FKHEDGSERVLSDGDHVEVGQQLMEGSADPHEVLRVQGPREVQIHLVREVQEVYRAQG VSIHDKHIEVIVRQMLRRVTIIDSGSTEFLPGSLIDRAEFEAENRRVVAEGGEPAAGR PVLMGITKASLATDSWLSAASFQETTRVLTDAAINCRSDKLNGLKENVIIGKLIPAGT GINRYRNIAVQPTEEARAAAYTIPSYEDQYYSPDFGAATGAAVPLDDYGYSDYR" CDS complement(769453..771366) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0688C" /product="POSSIBLE HYDROLASE" /note="Mb0688c, -, len: 637 aa. Equivalent to Rv0669c, len: 637 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 637 aa overlap). Possible hydrolase (EC 3.-.-.-), highly similar to various hydrolases (N-terminus shorter) e.g. BAA88409.1|AB028646 alkaline ceramidase from Pseudomonas aeruginosa (670 aa,) FASTA scores: opt: 1490, E(): 0, (41.2% identity in 651 aa overlap); NP_063946.1|NM_019893 mitochondrial ceramidase from Homo sapiens (761 aa); P_446098.1|NM_053646 N-acylsphingosine amidohydrolase 2 from Rattus norvegicus (761 aa); BAB09641.1|AB016885 neutral ceramidase from Arabidopsis thaliana (705 aa); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0688c detected using SWATH mass spectrometry. Mb0688c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW15" /db_xref="InterPro:IPR006823" /db_xref="InterPro:IPR031329" /db_xref="InterPro:IPR031331" /db_xref="InterPro:IPR038445" /db_xref="UniProtKB/TrEMBL:A0A1R3XW15" /protein_id="SIT99286.1" /translation="MLSVGRGIADITGEAADCGMLGYGKSDQRTAGIHQRLRSRAFVF RDDSQDGDARLLLIVAELPLPMQNVNEEVLRRLADLYGDTYSEQNTLITATHTHAGPG GYCGYLLYNLTTSGFRPATFAAIVDGIVESVEHAHADVAPAEVPLSHGELYGASINRS PSAFDRNPPADKAFFPKRVDPHTTLVRIDRGEATVGVIHFFATHGTSMTNRNHLISGD NKGFAAYHWERTVGGADYLAGQPDFIAAFAQTNPGDMSPNVDGPLSPEAPPDREFDNT RRTGLCQFEDAFTQLSGATPIGAGIDARFTYVDLGSVLVRGEYTPDGEERRTGRPMFG AGAMAGTDEGPGFHGFRQGRNPFWDRLSRAMYRLARPTAAAQAPKGIVMPARLPNRIH PFVQEIVPVQLVRIGRLYLIGIPGEPTIVAGLRLRRMVASIVGADLADVLCVGYTNAY IHYVTTPEEYLEQRYEGGSTLFGRWELCALMQTVAELAEAMRDGRPVTLGRRPRPTRE LSWVRGAPADAGSFGAVIAEPSATYRPGQAVEAVFVSALPNNDLRRGGTYLEVVRREG ASWVRIADDGDWATSFRWQRQGRAGSHVSIRWDVPGDTTPGQYRIVHHGTARDRNGML TAFSATTREFTVV" CDS 771561..772319 /codon_start=1 /transl_table=11 /gene="end" /locus_tag="BQ2027_MB0689" /product="PROBABLE ENDONUCLEASE IV END (ENDODEOXYRIBONUCLEASE IV) (APURINASE)" /note="Mb0689, end, len: 252 aa. Equivalent to Rv0670, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 252 aa overlap). Probable end (alternate gene name: nfo), endonuclease IV (apurinase) (EC 3.1.21.2), equivalent to END_MYCLE|P30770|NFO|ML1889 probable endonuclease IV (apurinase) from Mycobacterium leprae (252 aa), FASTA scores: opt: 1463, E(): 0, (85.6% identity in 250 aa overlap). Also similar to others e.g. Q9S2N2|END4_STRCO|NFO|SC6E10.05 PROBABLE ENDONUCLEASE IV from Streptomyces coelicolor (294 aa); etc. Contains PS00729 AP endonucleases family 2 signatures 1 and 2 (PS00729, and PS00730). BELONGS TO THE AP ENDONUCLEASES FAMILY 2. COFACTOR: BINDS 3 ZINC IONS. Protein product from Mb0689 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0689 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63536" /db_xref="InterPro:IPR001719" /db_xref="InterPro:IPR013022" /db_xref="InterPro:IPR018246" /db_xref="InterPro:IPR036237" /db_xref="UniProtKB/Swiss-Prot:P63536" /protein_id="SIT99287.1" /translation="MLIGSHVSPTDPLAAAEAEGADVVQIFLGNPQSWKAPKPRDDAA ALKAATLPIYVHAPYLINLASANNRVRIPSRKILQETCAAAADIGAAAVIVHGGHVAD DNDIDKGFQRWRKALDRLETEVPVYLENTAGGDHAMARRFDTIARLWDVIGDTGIGFC LDTCHTWAAGEALTDAVDRIKAITGRIDLVHCNDSRDEAGSGRDRHANLGSGQIDPDL LVAAVKAAGAPVICETADQGRKDDIAFLRERTGS" CDS 772351..773193 /codon_start=1 /transl_table=11 /gene="lpqP" /locus_tag="BQ2027_MB0690" /product="POSSIBLE CONSERVED LIPOPROTEIN LPQP" /note="Mb0690, lpqP, len: 280 aa. Equivalent to Rv0671, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Possible lpqP, conserved lipoprotein, similar to U00012|B1308_F2_43|Q49658 from Mycobacterium leprae (302 aa), FASTA scores: opt: 449, E(): 2.4e-22, (37.6% identity in 242 aa overlap). Also highly similar to lpqC|Rv3298c|MTCY71.38c PUTATIVE LIPOPROTEIN from Mycobacterium tuberculosis (304 aa). Also similar to a large variety of proteins including various esterases and poly(3-hydroxyalkanoate) depolymerases, e.g. NP_249234.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (322 aa); C-terminus of AAD45376.1|AF164516_1|AF164516 cinnamoyl ester hydrolase EstA from Piromyces equi (536 aa); part of P52090|PHA1_PSELE POLY(3-HYDROXYALKANOATE) DEPOLYMERASE C PRECURSOR from Pseudomonas lemoignei (414 aa); CAC10310.1|AL442629 putative secreted protein from Streptomyces coelicolor (348 aa); etc. Has a 17 aa signal sequence and contains appropriately positioned (PS00013) Prokaryotic membrane lipoprotein lipid attachment site. Mb0690 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XW27" /db_xref="InterPro:IPR010126" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XW27" /protein_id="SIT99288.1" /translation="MLRRVAILLAAVLAFAGCSGGTRLAAGFGNGNSVHTLDVDGAGR SYRLYKPVGLPSSAPLVVMLHGGFGSAKQAERSYGWDELADSEKFLVAYPDGYHRAWN ANGGGCCGRPAREGVDDIGFVRAVVADIANNVSIDPARVYVTGMSNGAIMSYTLACNT SIFAAIGVVSGTQLDPCQSPRPVSVIHIHGTADPLVRYHGGPGAGFARIDGPPVPDLN AFWREVNRCGALDTTTEGPVTTSGATCADNRRVVLLTVDDAGHRWPSFATQTLWRFFA AHFR" CDS 773253..774881 /codon_start=1 /transl_table=11 /gene="fadE8" /locus_tag="BQ2027_MB0691" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE8" /note="Mb0691, fadE8, len: 542 aa. Equivalent to Rv0672, len: 542 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 542 aa overlap). Probable fadE8, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. CAC33951.1|AL589708 putative acyl-CoA dehydrogenase from Streptomyces coelicolor (557 aa); P33224|AIDB_ECOLI|B4187 aidb protein (ACYL-COA DEHYDROGENASES FAMILY) from Escherichia coli strain K12 (546 aa), FASTA scores: opt: 1369, E(): 0, (44.1% identity in 524 aa overlap); etc. Also similar to several other M. tuberculosis proteins e.g. Rv0154cRv0154c|MTCI5.28c FASTA score: (26.3% identity in 342 aa overlap); etc. Contains acyl-CoA dehydrogenases signature 2 (PS00073). BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb0691 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0691 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW22" /db_xref="InterPro:IPR006089" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR041504" /db_xref="UniProtKB/TrEMBL:A0A1R3XW22" /protein_id="SIT99289.1" /translation="MSDTHVVTNQVPPLENYNPASSPVLIEALIQEGGQWGLDEVNEV GAISASCQAQRWGELADRNRPILHTHDAYGYRVDEVEYDPAYHELMRTAITHGMHAAP WADDRPGAHVVRAAKTSVWTVEPGHICPISMTYAVVPALRYNSELAAVYEPLLTSREY DPELKPATTKAGITAGMSMTEKQGGSDVRAGTTQATPNADGSYSLTGHKWFTSAPMCD IFLVLAQAPDGLSCFLLPRVLPDGTRNRMFLQRLKDKLGNHANASSEVEYDGAVAWLV GEEGRGVPTIIEMVNLTRLDCALGSATSMRTGLTRAVHHAQHRKAFGAYLIDQPLMRN VLADLAVEAEAATIVAMRMAGATDNAVRGNETEALLRRIGLAAAKYWVCKRSTAHAAE ALECLGGNGYVEDSGMPRLYREAPLMGIWEGSGNVSALDTLRAMATRPACVEVLFDEL ARSAGQDPRLDGHVERLRPQLGDLDTIGYRARKIAEDICLALQGSLLVRHGHPAVAEA FLATRLGGQWGGAYGTMPAGLDLAPILERALVKG" CDS 774892..775830 /codon_start=1 /transl_table=11 /gene="echA4" /locus_tag="BQ2027_MB0692" /product="POSSIBLE ENOYL-COA HYDRATASE ECHA4 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb0692, echA4, len: 312 aa. Equivalent to Rv0673, len: 312 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 312 aa overlap). Possible echA4, enoyl-CoA hydratase (EC 4.2.1.17), showing similarity with others e.g. NP_419216.1|NC_002696 enoyl-CoA hydratase/isomerase family protein from Caulobacter crescentus (256 aa); Q52995|ECHH_RHIME PROBABLE ENOYL-COA HYDRATASE from Sinorhizobium meliloti (257 aa), FASTA scores: opt: 210, E(): 1.2e-06, (27.9% identity in 280 aa overlap); etc. Also similar to other enoyl-CoA hydratases from Mycobacterium tuberculosis e.g. P95279|MTCY09F9.29|ECHA13|Rv1935c|MTCY09F9.29 ENOYL-COA HYDRATASE (318 aa), FASTA score: (27.1% identity in 280 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0692 detected using SWATH mass spectrometry. Mb0692 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW17" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3XW17" /protein_id="SIT99290.1" /translation="MTHAIRPVDFDNLKTMTYEVTGRIARITFNRPEKGNAIIADTPL ELSALVERADLDPGVHVILVSGRGEGFCAGFDLSAYAEGSSSTGGGGAYQGTVLDGKT QAVNHLPNQPWDPMIDYQMMSRFVRGFASLMHADKPTVVKIHGYCVAGGTDIALHADQ VIAAADAKIGYPPTRVWGVPAAGLWAHRLGDQRAKRLLFTGDCITGAQAAEWGLAVEA PEPADLDERTERLVARIAALPVNQLIMVKLALNSALLQQGVATSRMVSTVFDGAARHT PEGHAFVADAVEHGFRDAVRRRDEPFGDYGRQASRV" CDS 775833..776555 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0693" /product="Transcriptional regulator, PaaX family" /note="Mb0693, -, len: 240 aa. Equivalent to Rv0674, len: 240 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 240 aa overlap). Conserved hypothetical protein, highly similar to AC13063.1|AL445503 conserved hypothetical protein from Streptomyces coelicolor (268 aa); and similar to NP_438100.1|NC_003078 putative regulator of phenylacetic acid degradation ArsR family protein from Sinorhizobium meliloti (306 aa) and other proteins e.g. AB011837|AB011837_13 hypothetical protein from Bacillus halodurans (298 aa), FASTA scores: opt: 148, E(): 0.0081, (25.1% identity in 235 aa overlap); etc. Mb0693 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR012906" /db_xref="InterPro:IPR013225" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3XW25" /protein_id="SIT99291.1" /translation="MPAMTARSVVLSVLLGAHPAWATASELIQLTADFGIKETTLRVA LTRMVGAGDLVRSADGYRLSDRLLARQRRQDEAMRPRTRAWHGNWHMLIVTSIGTDAR TRAALRTCMHHKRFGELREGVWMRPDNLDLDLESDVAARVRMLTARDEAPADLAGQLW DLSGWTEAGHRLLGDMAAATDMPGRFVVAAAMVRHLLTDPMLPAELLPADWPGAGLRA AYHDFATAMAKRRDATQLLEVT" CDS 776552..777343 /codon_start=1 /transl_table=11 /gene="echA5" /locus_tag="BQ2027_MB0694" /product="PROBABLE ENOYL-COA HYDRATASE ECHA5 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb0694, echA5, len: 263 aa. Equivalent to Rv0675, len: 263 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 263 aa overlap). Probable echA5, enoyl-CoA hydratase (EC 4.2.1.17), similar to several e.g. NP_252116.1|NC_002516 probable enoyl CoA-hydratase/isomerase from Pseudomonas aeruginosa (256 aa); Q20376 PROTEIN SIMILAR TO ENOYL-COA HYDRATASE from Caenorhabditis elegans (258 aa), FASTA scores: opt: 697, E(): 0, (47.3% identity in 245 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Z92669|MTCY8D5_17 (262 aa), FASTA scores: opt: 493, E(): 3.6e-25, (39.1% identity in 243 aa overlap); etc. Protein product from Mb0694 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0694 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY72" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR018376" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3XY72" /protein_id="SIT99292.1" /translation="MSDLVRVERKGRVTTVILNRPASRNAVNGPTAAALCAAFEQFDR DDAASVAVLWGAGGTFCAGADLKAFGTPEANSVHRTGPGPMGPSRMMLSKPVIAAVSG YAVAGGLELALWCDLRVAEEDAVFGVFCRRWGVPLIDGGTVRLPRLIGHSRAMDMILT GRGVPADEALAMGLANRVVPKGQARQAAEELAAQLAALPQQCLRSDRLSALHQWGLPE SAALDLEFASIARVAGEALEGARRFAAGAGRHGAPAPRAEQGDTL" CDS complement(777355..780249) /codon_start=1 /transl_table=11 /gene="mmpL5" /locus_tag="BQ2027_MB0695C" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL5" /note="Mb0695c, mmpL5, len: 964 aa. Equivalent to Rv0676c, len: 964 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 964 aa overlap). Probable mmpL5, conserved transmembrane transport protein (see first citation below), member of RND superfamily, highly similar to other Mycobacterial proteins e.g. MTV037_14, MTCY98_8, MTCY20G9_34, MTCY4D9_15, MTCY48_8, MTCY19G5_6, MTV005_19, etc. Also similar to other Mycobacterial mmpl proteins e.g. P54881|MML4_MYCLE PUTATIVE MEMBRANE PROTEIN MMPL4 from Mycobacterium leprae (959 aa), FASTA scores: opt: 3991, E(): 0, (62.8% identity in 933 aa overlap); etc. BELONGS TO THE MMPL FAMILY. TBparse score is 0.884. Protein product from Mb0695c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0695c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX34" /db_xref="InterPro:IPR004707" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/TrEMBL:A0A1R3XX34" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99293.1" /translation="MIVQRTAAPTGSVPPDRHAARPFIPRMIRTFAVPIILGWLVTIA VLNVTVPQLETVGQIQAVSMSPDAAPSMISMKHIGKVFEEGDSDSAAMIVLEGQRPLG DAAHAFYDQMIGRLQADTTHVQSLQDFWGDPLTATGAQSSDGKAAYVQVKLAGNQGES LANESVEAVKTIVERLAPPPGVKVYVTGSAALVADQQQAGDRSLQVIEAVTFTVIIVM LLLVYRSIITSAIMLTMVVLGLLATRGGVAFLGFHRIIGLSTFATNLLVVLAIAAATD YAIFLIGRYQEARGLGQDRESAYYTMFGGTAHVVLGSGLTIAGATFCLSFTRLPYFQT LGVPLAIGMVIVVAAALTLGPAIIAVTSRFGKLLEPKRMARVRGWRKVGAAIVRWPGP ILVGAVALALVGLLTLPGYRTNYNDRNYLPADLPANEGYAAAERHFSQARMNPEVLMV ESDHDMRNSADFLVINKIAKAIFAVEGISRVQAITRPDGKPIEHTSIPFLISMQGTSQ KLTEKYNQDLTARMLEQVNDIQSNIDQMERMHSLTQQMADVTHEMVIQMTGMVVDVEE LRNHIADFDDFFRPIRSYFYWEKHCYDIPVCWSLRSVFDTLDGIDVMTEDINNLLPLM QRLDTLMPQLTAMMPEMIQTMKSMKAQMLSMHSTQEGLQDQMAAMQEDSAAMGEAFDA SRNDDSFYLPPEVFDNPDFQRGLEQFLSPDGHAVRFIISHEGDPMSQAGIARIAKIKT AAKEAIKGTPLEGSAIYLGGTAAMFKDLSDGNTYDLMIAGISALCLIFIIMLIITRSV VAAAVIVGTVVLSLGASFGLSVLIWQHILGIELHWLVLAMAVIILLAVGADYNLLLVA RLKEEIHAGINTGIIRAMGGSGSVVTAAGLVFAFTMMSFAVSELTVMAQVGTTIGMGL LFDTLIVRSFMTPSIAALLGKWFWWPQVVRQRPVPQPWPSPASARTFALV" CDS complement(780246..780674) /codon_start=1 /transl_table=11 /gene="mmpS5" /locus_tag="BQ2027_MB0696C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN MMPS5" /note="Mb0696c, mmpS5, len: 142 aa. Equivalent to Rv0677c, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Possible mmpS5, conserved membrane protein (see first citation below), highly similar to other Mycobacterial proteins e.g. P54880|MMS4_MYCLE PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (154 aa), FASTA scores: opt: 443, E(): 1.4e-23, (47.1% identity in 155 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis. BELONGS TO THE MMPS FAMILY. TBparse score is 0.901. Protein product from Mb0696c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0696c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65381" /db_xref="InterPro:IPR008693" /db_xref="InterPro:IPR038468" /db_xref="UniProtKB/Swiss-Prot:P65381" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99294.1" /translation="MIGTLKRAWIPLLILVVVAIAGFTVQRIRTFFGSEGILVTPKVF ADDPEPFDPKVVEYEVSGSGSYVNINYLDLDAKPQRIDGAALPWSLTLKTTAPSAAPN ILAQGDGTSITCRITVDGEVKDERTATGVDALTYCFVKSA" CDS 780759..781256 /codon_start=1 /transl_table=11 /gene="mmpR5" /locus_tag="BQ2027_MB0697" /product="MarR family transcriptional regulator associated with MmpL5/MmpS5 efflux system" /note="Mb0697, -, len: 165 aa. Equivalent to Rv0678, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 165 aa overlap). Conserved hypothetical protein, showing weak similarity with AL049754|SCH10_10 hypothetical protein from Streptomyces coelicolor (152 aa), FASTA scores: opt: 149, E(): 0.0018, (22.9% identity in 140 aa overlap). TBparse score is 0.910. Protein product from Mb0697 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0697 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XWH5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99295.1" /translation="MSVNDGVDQMGAEPDIMEFVEQMGGYFESRSLTRLAGRLLGWLL VCDPERQSSEELATALAASSGGISTNARMLIQFGFIERLAVAGDRRTYFRLRPNAFAA GERERIRAMAELQDLADVGLRALGDAPPQRSRRLREMRDLLAYMENVVSDALGRYSQR TGEDD" CDS complement(781312..781809) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0698C" /product="conserved threonine rich protein" /note="Mb0698c, -, len: 165 aa. Equivalent to Rv0679c, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 165 aa overlap). Conserved hypothetical Thr-rich protein, similar in part to neighboring ORF Rv0680c (124 aa), FASTA score: (35.1% identity in 131 aa overlap); and Rv0314c (220 aa). Contains probable N-terminal signal sequence. TBparse score is 0.894. Protein product from Mb0698c detected using SWATH mass spectrometry. Mb0698c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021417" /db_xref="UniProtKB/TrEMBL:A0A1R3XW26" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99296.1" /translation="MVEKPLRADRATHSRLATFALALAAAALPLAGCSSTANPPAATT TPATATTTTATSGPTAAPTVTTGESTTASIQIGDMLTYGSIGTTATLDCADGKSLNVA GSDNTLTVNGTCETVTVGGANNKIAFDRIDERLVVVGLDNTVTYKNGDPTIDNLGAGN RINKE" CDS complement(781811..782185) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0699C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0699c, -, len: 124 aa. Equivalent to Rv0680c, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 124 aa overlap). Possible conserved transmembrane protein, showing similarity with C-terminal part of Rv0314c|Z96800|MTCY63.19c conserved hypothetical protein from Mycobacterium tuberculosis (220 aa), FASTA scores: opt: 175, E(): 2.2e-05, (31.4% identity in 102 aa overlap). Also some similarity to upstream ORF Rv0679c|MTV040.07c CONSERVED HYPOTHETICAL THREONINE RICH PROTEIN (124 aa), FASTA score: (35.1% identity in 131 aa overlap). Contains probable N-terminal signal sequence. Mb0699c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW62" /db_xref="InterPro:IPR021417" /db_xref="UniProtKB/TrEMBL:A0A1R3XW62" /protein_id="SIT99297.1" /translation="MKWNTVAASLAAGVITIAVALAAPPPAAHAKNGDTHVTGQGIER TLDCNESTLLVNGTQNIVTALGTCWAVTVMGSSNTVVADTIINDITVYGWDETVFFRN GDPFIWDRGRELGMVNRLQRVG" CDS 782490..783080 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0700" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY TETR-FAMILY)" /note="Mb0700, -, len: 196 aa. Equivalent to Rv0681, len: 196 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 196 aa overlap). Probable transcription regulator, TetR family, similar to others and especially many tetracycline repressors e.g. T34657 probable transcription regulator from Streptomyces coelicolor (189 aa); AF0278|AF027868_40|NP_389788.1|NC_000964 yobS regulator from Bacillus subtilis (191 aa), FASTA scores: opt: 213, E(): 1.6e-07, (28.8% identity in 153 aa overlap); P09164|TER4_ECOLI TETRACYCLINE REPRESSOR PROTEIN from Escherichia coli (217 aa), FASTA scores: opt: 145, E(): 0.0068, (39.0% identity in 59 aa overlap); etc. Contains helix-turn-helix motif at aa 28-49 (Score 1020, +2.66 SD). Protein product from Mb0700 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0700 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XW36" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR025996" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3XW36" /protein_id="SIT99298.1" /translation="MARPAKLSRESIVEGALTFLDREGWDSLTINALATQLGTKGPSL YNHVDSLEDLRRAVRIRVIDDIITMLNRVGAGRARDDAVLVMAGAYRSYAHHHPGRYS AFTRMPLGGDDPEYTAATRGAAAPVIAVLSSYGLDGEQAFYAALEFWSALHGFVLLEM TGVMDDIDTDAVFTDMVLRLAAGMERRTTHGGTAST" CDS 783329..783703 /codon_start=1 /transl_table=11 /gene="rpsL" /locus_tag="BQ2027_MB0701" /product="30s ribosomal protein s12 rpsl" /note="Mb0701, rpsL, len: 124 aa. Equivalent to Rv0682, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 124 aa overlap). Probable rpsL, 30S ribosomal protein S12 (see citations below), equivalent to others from Mycobacteria e.g. P41195|RS12_MYCSM 30S RIBOSOMAL PROTEIN S12 from Mycobacterium smegmatis (124 aa); P51999|RS12_MYCAV 30S RIBOSOMAL PROTEIN S12 from Mycobacterium avium (124 aa); etc. Also highly similar to others from other organisms e.g. P97222|RS12_STRCO 30S RIBOSOMAL PROTEIN S12 from Streptomyces roseosporus, lividans and coelicolor (123 aa); etc. Contains PS00055 Ribosomal protein S12 signature. BELONGS TO THE S12P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0701 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0701 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q53538" /db_xref="InterPro:IPR005679" /db_xref="InterPro:IPR006032" /db_xref="InterPro:IPR012340" /db_xref="UniProtKB/Swiss-Prot:Q53538" /protein_id="SIT99299.1" /translation="MPTIQQLVRKGRRDKISKVKTAALKGSPQRRGVCTRVYTTTPKK PNSALRKVARVKLTSQVEVTAYIPGEGHNLQEHSMVLVRGGRVKDLPGVRYKIIRGSL DTQGVKNRKQARSRYGAKKEKG" CDS 783703..784173 /codon_start=1 /transl_table=11 /gene="rpsG" /locus_tag="BQ2027_MB0702" /product="30s ribosomal protein s7 rpsg" /note="Mb0702, rpsG, len: 156 aa. Equivalent to Rv0683, len: 156 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 156 aa overlap). Probable rpsG, 30S ribosomal protein S7 (see citation below), equivalent to others from Mycobacteria e.g. P41193|RS7_MYCSM 30S RIBOSOMAL PROTEIN S7 from Mycobacterium smegmatis (156 aa), FASTA scores: opt: 986, E(): 0, (96.2% identity in 156 aa overlap); Q53539|RS7_MYCBO 30S RIBOSOMAL PROTEIN S7 from Mycobacterium bovis (156 aa); etc. Also highly similar to others e.g. Q9L0K4|RS7_STRCO 30S RIBOSOMAL PROTEIN S7 from Streptomyces coelicolor (156 aa); etc. Contains PS00052 Ribosomal protein S7 signature. BELONGS TO THE S7P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0702 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0702 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q53539" /db_xref="InterPro:IPR000235" /db_xref="InterPro:IPR005717" /db_xref="InterPro:IPR020606" /db_xref="InterPro:IPR023798" /db_xref="InterPro:IPR036823" /db_xref="UniProtKB/Swiss-Prot:Q53539" /protein_id="SIT99300.1" /translation="MPRKGPAPKRPLVNDPVYGSQLVTQLVNKVLLKGKKSLAERIVY GALEQARDKTGTDPVITLKRALDNVKPALEVRSRRVGGATYQVPVEVRPDRSTTLALR WLVGYSRQRREKTMIERLANEILDASNGLGASVKRREDTHKMAEANRAFAHYRW" CDS 784254..786359 /codon_start=1 /transl_table=11 /gene="fusA1" /locus_tag="BQ2027_MB0703" /standard_name="fusA" /product="PROBABLE ELONGATION FACTOR G FUSA1 (EF-G)" /note="Mb0703, fusA1, len: 701 aa. Equivalent to Rv0684, len: 701 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 701 aa overlap). Probable fusA1, elongation factor G, equivalent to P30767|EFG_MYCLE|S31150 translation elongation factor EF-G from Mycobacterium leprae (701 aa), FASTA scores: opt: 2521, E(): 0, (88.2% identity in 432 aa overlap). Also highly similar to others e.g. CAB81852.1|AL161691 elongation factor G from Streptomyces coelicolor (708 aa); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS00301 GTP-binding elongation factors signature. BELONGS TO THE GTP-BINDING ELONGATION FACTOR FAMILY, EF-G/EF-2 SUBFAMILY. Note that previously known as fusA. Protein product from Mb0703 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0703 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A557" /db_xref="InterPro:IPR000640" /db_xref="InterPro:IPR000795" /db_xref="InterPro:IPR004161" /db_xref="InterPro:IPR004540" /db_xref="InterPro:IPR005225" /db_xref="InterPro:IPR005517" /db_xref="InterPro:IPR009000" /db_xref="InterPro:IPR009022" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR031157" /db_xref="InterPro:IPR035647" /db_xref="InterPro:IPR035649" /db_xref="InterPro:IPR041095" /db_xref="UniProtKB/Swiss-Prot:P0A557" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99301.1" /translation="MAQKDVLTDLSRVRNFGIMAHIDAGKTTTTERILYYTGINYKIG EVHDGAATMDWMEQEQERGITITSAATTTFWKDNQLNIIDTPGHVDFTVEVERNLRVL DGAVAVFDGKEGVEPQSEQVWRQADKYDVPRICFVNKMDKIGADFYFSVRTMGERLGA NAVPIQLPVGAEADFEGVVDLVEMNAKVWRGETKLGETYDTVEIPADLAEQAEEYRTK LLEVVAESDEHLLEKYLGGEELTVDEIKGAIRKLTIASEIYPVLCGSAFKNKGVQPML DAVVDYLPSPLDVPPAIGHAPAKEDEEVVRKATTDEPFAALAFKIATHPFFGKLTYIR VYSGTVESGSQVINATKGKKERLGKLFQMHSNKENPVDRASAGHIYAVIGLKDTTTGD TLSDPNQQIVLESMTFPDPVIEVAIEPKTKSDQEKLSLSIQKLAEEDPTFKVHLDSET GQTVIGGMGELHLDILVDRMRREFKVEANVGKPQVAYKETIKRLVQNVEYTHKKQTGG SGQFAKVIINLEPFTGEEGATYEFESKVTGGRIPREYIPSVDAGAQDAMQYGVLAGYP LVNLKVTLLDGAYHEVDSSEMAFKIAGSQVLKKAAALAQPVILEPIMAVEVTTPEDYM GDVIGDLNSRRGQIQAMEERAGARVVRAHVPLSEMFGYVGDLRSKTQGRANYSMVFDS YSEVPANVSKEIIAKATGE" CDS 786590..787780 /codon_start=1 /transl_table=11 /gene="tuf" /locus_tag="BQ2027_MB0704" /product="probable iron-regulated elongation factor tu tuf (ef-tu)" /note="Mb0704, tuf, len: 396 aa. Equivalent to Rv0685, len: 396 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 396 aa overlap). Probable tuf, elongation factor EF-Tu, equivalent to JC2262 translation elongation factor Tu from Mycobacterium leprae (396 aa). Also highly similar to others e.g. P42439|EFTU_CORGL ELONGATION FACTOR TU (EF-TU) from Corynebacterium glutamicum (396 aa); etc. Contains PS00017 ATP/GTP-binding site motif A, and PS00301 GTP-binding elongation factors signature. BELONGS TO THE GTP-BINDING ELONGATION FACTOR FAMILY, EF-TU/EF-1A SUBFAMILY. Protein product from Mb0704 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0704 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A559" /db_xref="InterPro:IPR000795" /db_xref="InterPro:IPR004160" /db_xref="InterPro:IPR004161" /db_xref="InterPro:IPR004541" /db_xref="InterPro:IPR005225" /db_xref="InterPro:IPR009000" /db_xref="InterPro:IPR009001" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR031157" /db_xref="InterPro:IPR033720" /db_xref="InterPro:IPR041709" /db_xref="UniProtKB/Swiss-Prot:P0A559" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99302.1" /translation="MAKAKFQRTKPHVNIGTIGHVDHGKTTLTAAITKVLHDKFPDLN ETKAFDQIDNAPEERQRGITINIAHVEYQTDKRHYAHVDAPGHADYIKNMITGAAQMD GAILVVAATDGPMPQTREHVLLARQVGVPYILVALNKADAVDDEELLELVEMEVRELL AAQEFDEDAPVVRVSALKALEGDAKWVASVEELMNAVDESIPDPVRETDKPFLMPVED VFTITGRGTVVTGRVERGVINVNEEVEIVGIRPSTTKTTVTGVEMFRKLLDQGQAGDN VGLLLRGVKREDVERGQVVTKPGTTTPHTEFEGQVYILSKDEGGRHTPFFNNYRPQFY FRTTDVTGVVTLPEGTEMVMPGDNTNISVKLIQPVAMDEGLRFAIREGGRTVGAGRVT KIIK" CDS 787918..788715 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0705" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb0705, -, len: 265 aa. Equivalent to Rv0686, len: 265 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 265 aa overlap). Probable membrane protein, with hydrophobic N-terminus. Protein product from Mb0705 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0705 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX44" /db_xref="UniProtKB/TrEMBL:A0A1R3XX44" /protein_id="SIT99303.1" /translation="MLARYIKMQLLVLLCGGLVGPIFLVVYFTLGLGSLMSWMFYVGL IITVADVLVALALTNYGAKTAAKTAALERSGVLALAQSTGLSETGTRINDQPLVKVHL HISGPGITPFDTEDRVIASVTRLGNLTARKLVVLVNPATQQYLIDWERSALVNGLVPA QFTVAEDNKTYDLSGQTGPLMEILQILKANNVPLNRMVDIRSNPALRQQVQAVVRRAA ERQAPAAEPASQGSIAERLAELESLRASGAVNAAEYESKRAQIISEI" CDS 788868..789695 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0706" /product="probable short-chain type dehydrogenase/reductase" /note="Mb0706, -, len: 275 aa. Equivalent to Rv0687, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 275 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to various dehydrogenases (generally SDR family) e.g. U17129|RSU17129_7 short-chain dehydrogenase from Rhodococcus erythropolis (275 aa), FASTA scores: opt: 1112, E(): 0, (61.2% identity in 268 aa overlap); MMU34072_2 steroid dehydrogenase from Musmus culus (260 aa), FASTA scores: opt: 390, E(): 2.2e-17, (34.1% identity in 267 aa overlap); etc. Also similar to MTV002_16|O33292|Rv2750 DEHYDROGENASE from Mycobacterium tuberculosis (272 aa). Contains PS00061 Short-chain alcohol dehydrogenase family signature. Protein product from Mb0706 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0706 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW77" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR023985" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XW77" /protein_id="SIT99304.1" /translation="MSARGGSLHGRVAFVTGAARAQGRSHAVRLAREGADIVALDICA PVSGSVTYPPATSEDLGETVRAVEAEGRKVLAREVDIRDDAELRRLVADGVEQFGRLD IVVANAGVLGWGRLWELTDEQWETVIGVNLTGTWRTLRATVPAMIDAGNGGSIVVVSS SAGLKATPGNGHYAASKHALVALTNTLAIELGEFGIRVNSIHPYSVDTPMIEPEAMIQ TFAKHPGYVHSFPPMPLQPKGFMTPDEISDVVVWLAGDGSGALSGNQIPVDKGALKY" CDS 789709..790929 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0707" /product="PUTATIVE FERREDOXIN REDUCTASE" /note="Mb0707, -, len: 406 aa. Equivalent to Rv0688, len: 406 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 406 aa overlap). Putative ferredoxin reductase (EC 1.-.-.-), highly similar to others e.g. BAB55881.1|AB054975 ferredoxin reductase from Terrabacter sp. DBF63 (410 aa); CAC04223.1|AL391515 putative ferredoxin reductase from Streptomyces coelicolor (420 aa); PPU24215_8|Q51973 P-CUMATE DIOXYGENASE FERREDOXIN REDUCTASE SUBUNIT from Pseudomonas putida (402 aa), FASTA scores: opt: 738, E(): 0, (38.8% identity in 330 aa overlap); etc. Also similar to Rv0253 and Rv1869c from Mycobacterium tuberculosis. COULD BELONG TO THE BACTERIAL TYPE FERREDOXIN FAMILY. Protein product from Mb0707 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0707 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWI4" /db_xref="InterPro:IPR016156" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR028202" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XWI4" /protein_id="SIT99305.1" /translation="MNAHVTSREGVNEFDDGIVIVGGGLAAARTAEQLRRAGYSGRLT IVSDEVHLPYDRPPLSKEVLRSEVDDVALKPREFYDEKDIALRLGSAAVSLDTGEQTV TLADGTVLGYDELVIATGLVPRRIPSLPDLDGIRVLRSFDESMALRKHASAARHAVVV GAGFIGCEVAASLRGLGVDVVLVEPQPAPLASVLGEQIGQLVTRLHRDEGVDVRTGVT VAEVRGKGHVDAVVLTDGTELPADLVVVGIGSTPATEWLEGSGVEVDNGVICDKAGRT SAPNVWALGDVASWRDPMGHQARVEHWSNVADQARVVVPAMLGTDVPTGVVVPYFWSD QYDVKIQCLGEPHATDVVHLVEDDGRKFLAYYERDGVLVGVVGGGMAGKVMKVRGKIA AGAPIAEVLDQTQA" CDS complement(790926..791180) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0708C" /product="HYPOTHETICAL PROTEIN" /note="Mb0708c, -, len: 84 aa. Equivalent to Rv0689c, len: 84 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 84 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3XW37" /protein_id="SIT99306.1" /translation="MLGWTVKPGRVADGWQAPGVHLMARCSGPQPASERRADMDGGDI DAAVARVRAAGALAEPSRQPDDMSAECADDQGARCHLGQL" CDS complement(791793..792842) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0709C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0709c, -, len: 349 aa. Equivalent to Rv0690c, len: 349 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 349 aa overlap). Conserved hypothetical protein, showing similarity with NP_386956.1|NC_003047 CONSERVED HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (358 aa); NP_356573.1|NC_003063 AGR_L_1570p from Agrobacterium tumefaciens (346 aa); NP_421938.1|NC_002696 conserved hypothetical protein from Caulobacter crescentus (370 aa). Protein product from Mb0709c detected using SWATH mass spectrometry. Mb0709c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR011200" /db_xref="UniProtKB/TrEMBL:A0A1R3XW72" /protein_id="SIT99307.1" /translation="MTGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFA SILADQKGAPEGQAVPLRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRT ATDQPESLRAALDRPPQTNEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPD RYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNA LSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQ YLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHAR VLGECHPHGPPVTWQ" CDS complement(792839..793435) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0710C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0710c, -, len: 198 aa. Equivalent to Rv0691c, len: 198 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 198 aa overlap). Rv0691c, (MTCY210.08c), len: 198 aa. Probable transcriptional regulator, highly similar to AAC77476.1|U17129 unknown protein from Rhodococcus erythropolis (185 aa); and showing similarity with putative regulatory proteins eg STMTCREP_1|TCMR_STRGA|P39885 tetracenomycin c transcriptional repressor from Streptomyces glaucescens (226 aa), FASTA scores: opt: 178, E(): 8.5e-06, (27.9% identity in 201 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop) and probable helix-turn helix motifs from aa 34-55 (Score 1100, +2.93 SD) and 151-172 (Score 1124, +3.02 SD). Mb0710c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW46" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023851" /db_xref="InterPro:IPR041347" /db_xref="UniProtKB/TrEMBL:A0A1R3XW46" /protein_id="SIT99308.1" /translation="MPHESRVGRRRSTTPHHISDVAIELFAAHGFTDVSVDDIARAAG IARRTLFRYYASKNAIPWGDFSTHLAQLQGLLDNIDSRIQLRDALRAALLAFNTFDES ETIRHRKRMRVILQTPELQAYSMTMYAGWREVIAKFVARRSGGKTTDFMPQTVAWTML GVALSAYEHWLRDESVSLTEALGAAFDVVGAGLDRLNQ" CDS 793427..793615 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0710A" /product="Mycofactocin precursor protein" /note="Mb0710A, len: 62 aa. Equivalent to Rv0691A len: 62 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 62 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Mycofactocin precursor protein. Mb0710A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR023988" /db_xref="UniProtKB/TrEMBL:A0A1R3XW42" /protein_id="SIT99309.1" /translation="MRHHIRPSISALDAILCPDRRIAVETCWRKAIQMDYETDTDTEL VTETLVEEVSIDGMCGVY" CDS 793600..793929 /codon_start=1 /transl_table=11 /gene="mftB" /locus_tag="BQ2027_MB0711" /product="Mycofactocin binding protein MftB" /note="Mb0711, -, len: 109 aa. Equivalent to Rv0692, len: 109 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 109 aa overlap). Conserved hypothetical protein, highly similar to U17129|RSU17129_3|AAC77477.1 unknown protein from Rhodococcus erythropolis (95 aa), FASTA scores: opt: 393, E(): 8.8e-22, (68.2% identity in 88 aa overlap). Mb0711 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR023850" /db_xref="UniProtKB/TrEMBL:A0A1R3XW34" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99310.1" /translation="MWGLLTVPAPAQARRADSSEFDPDRGWRLHPQVAVRPEPFGALL YHFGTRKLSFLKNRTILAVVQTLADYPDIRSACRGAGVDDCDQDPYLHALSVLAGSNM LVPRQTT" CDS 793926..795101 /codon_start=1 /transl_table=11 /gene="pqqE" /locus_tag="BQ2027_MB0712" /product="PROBABLE COENZYME PQQ SYNTHESIS PROTEIN E PQQE (COENZYME PQQ SYNTHESIS PROTEIN III)" /note="Mb0712, pqqE, len: 391 aa. Equivalent to Rv0693, len: 391 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 391 aa overlap). Probable pqqE (alternate gene name: pqqIII), coenzyme PQQ synthesis protein E, similar to others AE001109_9|O30258|PQQE COENZYME PQQ SYNTHESIS PROTEIN from Archaeoglobus fulgidus (375 aa), FASTA scores: E(): 1.6e-16, (28.1% identity in 377 aa overlap); PQQE_ACICA|P07782 coenzyme pqq synthesis protein e from Acinetobacter calcoaceticus (384 aa), FASTA scores: opt: 302, E(): 1.8e-12, (23.9% identity in 377 aa overlap); etc. Also similar to C-terminus of heme biosynthesis proteins e.g. O28270|AF2009 HEME BIOSYNTHESIS PROTEIN (NIRJ-2) from Archaeoglobus fulgidus (468 aa). Note that also highly similar to U17129|RSU17129_4|AAC77478.1 unknown protein from Rhodococcus erythropolis (405 aa), FASTA scores: opt: 1997, E(): 0, (73.3% identity in 390 aa overlap). COULD BELONG TO THE MOAA / NIFB / PQQE FAMILY. Mb0712 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW47" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR017200" /db_xref="InterPro:IPR023885" /db_xref="InterPro:IPR023913" /db_xref="InterPro:IPR034391" /db_xref="UniProtKB/TrEMBL:A0A1R3XW47" /protein_id="SIT99311.1" /translation="MTSPVPRLIEQFERGLDAPICLTWELTYACNLACVHCLSSSGKR DPGELSTRQCKDIIDELERMQVFYVNIGGGEPTVRPDFWELVDYATAHHVGVKFSTNG VRITPEVATRLAATDYVDVQISLDGATAEVNDAIRGTGSFDMAVRALQNLAAAGFAGV KISVVITRRNVAQLDEFATLASRYGATLRITRLRPSGRGTDVWADLHPTADQQVQLYD WLVSKGERVLTGDSFFHLAPLGQSGALAGLNMCGAGRVVCLIDPVGDVYACPFAIHDH FLAGNVLSDGGFQNVWKNSSLFRELREPQSAGACGSCGHYDSCRGGCMAAKFFTGLPL DGPDPECVQGHSEPALARERHLPRPRADHSRGRRVSKPVPLTLSMRPPKRPCNESPV" CDS 795104..796294 /codon_start=1 /transl_table=11 /gene="lldD1" /locus_tag="BQ2027_MB0713" /product="POSSIBLE L-LACTATE DEHYDROGENASE (CYTOCHROME) LLDD1" /note="Mb0713, lldD1, len: 396 aa. Equivalent to Rv0694, len: 396 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 396 aa overlap). Possible lldD1, L-lactate dehydrogenase (cytochrome) (EC 1.1.2.3), similar to NP_302368.1|NC_002677 L-lactate dehydrogenase from Mycobacterium leprae (414 aa). Also similar to others e.g. NP_384560.1|NC_003047 PUTATIVE L-LACTATE DEHYDROGENASE (CYTOCHROME) PROTEIN from Sinorhizobium meliloti (403 aa); NP_251072.1|NC_002516 L-lactate dehydrogenase from Pseudomonas aeruginosa (383 aa); P33232|LLDD_ECOLI L-lactate dehydrogenase (cytochrome) from Escherichia coli strain K12 (396 aa), FASTA scores: opt: 697, E(): 0, (34.5 identity in 380 aa overlap); etc; and also similar to other oxidoreductases. Note that also highly similar to RSU17129_5|AAC77479.1|U17129 unknown protein from Rhodococcus erythropolis (392 aa), FASTA scores: opt: 2006, E(): 0, (74.1% identity in 386 aa overlap). Also similar to lldD2|Rv1872c|MTCY180.46|MTCY359.01 POSSIBLE L-LACTATE DEHYDROGENASE (CYTOCHROME) from Mycobacterium tuberculosis (414 aa). BELONGS TO THE FMN-DEPENDENT ALPHA-HYDROXY ACID DEHYDROGENASES FAMILY. Protein product from Mb0713 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0713 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY91" /db_xref="InterPro:IPR000262" /db_xref="InterPro:IPR012133" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR023989" /db_xref="InterPro:IPR037396" /db_xref="UniProtKB/TrEMBL:A0A1R3XY91" /protein_id="SIT99312.1" /translation="MAEAWFETVAIAQQRAKRRLPKSVYSSLIAASEKGITVADNVAA FSELGFAPHVIGATDKRDLSTTVMGQEVSLPVIISPTGVQAVDPGGEVAVARAAAARG TVMGLSSFASKPIEEVIAANPKTFFQVYWQGGRDALAERVERARQAGAVGLVVTTDWT FSHGRDWGSPKIPEEMNLKTILRLSPEAITRPRWLWKFAKTLRPPDLRVPNQGRRGEP GPPFFAAYGEWMATPPPTWEDIGWLRELWGGPFMLKGVMRVDDAKRAVDAGVSAISVS NHGGNNLDGTPASIRALPAVSAAVGDQVEVLLDGGIRRGSDVVKAVALGARAVMIGRA YLWGLAANGQAGVENVLDILRGGIDSALMGLGHASVHDLSPADILVPTGFIRDLGVPS RRDV" CDS 796484..797239 /codon_start=1 /transl_table=11 /gene="mftE" /locus_tag="BQ2027_MB0714" /product="Cyclic amid hydrolase in mycofactocin cluster" /note="Mb0714, -, len: 251 aa. Equivalent to Rv0695, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 251 aa overlap). Conserved hypothetical protein, similar to many creatinine amidohydrolases or hypothetical proteins e.g. NP_443048.1|NC_000911 creatinine amidohydrolase from Synechococcus sp. PCC 6803 (273 aa); NP_466169.1|NC_003210 protein similar to creatinine amidohydrolase from Listeria monocytogenes (249 aa); T35153|SC5A7.04c hypothetical protein from Streptomyces coelicolor (273 aa); etc. Note that highly similar to RSU17129_10|AAC77474.1|U17129 unknown protein from Rhodococcus erythropolis (230 aa), FASTA scores: opt: 693, E(): 0, (55.7% identity in 237 aa overlap). Mb0714 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR003785" /db_xref="InterPro:IPR023871" /db_xref="InterPro:IPR024087" /db_xref="UniProtKB/TrEMBL:A0A1R3XX53" /protein_id="SIT99313.1" /translation="MNSSYHRRVPVVGELGSATSSQLPSTSPSIVIPLGSTEQHGPHL PLDTDTRIATAVARTVTARLHAEDLPIAQEEWLMAPAIAYGASGEHQRFAGTISIGTE ALTMLLVEYGRSAACWARRLVFVNGHGGNVGALTRAVGLLRAEGRDAGWCPCTCPGGD PHAGHTETSVLLHLSPADVRTERWRAGNRAPLPVLLPSMRRGGVAAVSETGVLGDPTT ATAAEGRRIFAAMVDDCVRRVARWMPQPDGMLT" CDS 797288..798700 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0715" /product="PROBABLE MEMBRANE SUGAR TRANSFERASE" /note="Mb0715, -, len: 470 aa. Equivalent to Rv0696, len: 470 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 470 aa overlap). Probable membrane sugar transferase (EC 2.-.-.-), similar (except in N-terminus) to NP_069157.1|NC_000917 glycosyl transferase from Archaeoglobus fulgidus (324 aa); NP_279985.1|NC_002607 rhamnosyl transferase from Halobacterium sp. NRC-1 (299 aa); NP_059113.1|NM_017417 polypeptide N-acetylgalactosaminyltransferase 8 from (637 aa). Note that also highly similar to P46370|YTH1_RHOER HYPOTHETICAL 55.3 KDA PROTEIN from Rhodococcus erythropolis (513 aa), FASTA scores: opt: 1514, E(): 0, (51.8% identity in 469 aa overlap). Protein product from Mb0715 detected using SWATH mass spectrometry. Mb0715 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XW90" /db_xref="InterPro:IPR001173" /db_xref="InterPro:IPR023981" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/TrEMBL:A0A1R3XW90" /protein_id="SIT99314.1" /translation="MTATRLPDGFAVQVDRRVRVLGDGSALLGGSPTRLLRLAPAARG LLCDGRLKVRDEVSAELARILLDATVAHPRPPSGPSHRDVTVVIPVRNNASGLRRLVT SLRGLRVIVVDDGSACPVESDDFVGAHCDIEVLHHPHSKGPAAARNTGLAACTTDFVA FLDSDVTPRRGWLESLLGHFCDPTVALVAPRIVSLVEGENPVARYEALHSSLDLGQRE APVLPHSTVSYVPSAAIVCRSSAIRDVGGFDETMHSGEDVDLCWRLIEAGARLRYEPI ALVAHDHRTQLRDWIARKAFYGGSAAPLAVRHPDKTAPLVISGGALMAWILMSIGTGL GRLASLVIAVLTGRRIARAMRCAETSFLDVLAVATRGLWAAALQLASAICRHYWPLAL LAAILSRRCRRVVLIAAVVDGVVDWLRRREGADDDAEPIGPLTYLVLKRVDDLAYGAG LWYGVVRERNIGALKPQIRT" CDS 798702..800141 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0716" /product="PROBABLE DEHYDROGENASE" /note="Mb0716, -, len: 479 aa. Equivalent to Rv0697, len: 479 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 479 aa overlap). Probable dehydrogenase (EC 1.-.-.-), highly similar to P30772|YTUR_MYCLE HYPOTHETICAL 24 KD PROTEIN from Mycobacterium leprae (220 aa), FASTA scores: opt: 557, E(): 1.7e-28, (46.2% identity in 223 aa overlap). Also highly similar to P46371|YTH2_RHOER HYPOTHETICAL 53.0 KDA GMC-TYPE OXIDOREDUCTASE from Rhodococcus erythropolis (493 aa); and similar to many dehydrogenases e.g. NP_250814.1|NC_002516 probable dehydrogenase from Pseudomonas aeruginosa (545 aa); BAA13145.1|D86622 FAD dependent L-sorbose dehydrogenase from Gluconobacter oxydans (531 aa); etc. Also similar to Rv1279 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis. Protein product from Mb0716 detected using SWATH mass spectrometry. Mb0716 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWJ3" /db_xref="InterPro:IPR000172" /db_xref="InterPro:IPR007867" /db_xref="InterPro:IPR012132" /db_xref="InterPro:IPR023978" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XWJ3" /protein_id="SIT99315.1" /translation="MTAAVRHSDVLVVGAGSAGSVVAERLSMDSSCVVTVLEAGPGLA DPGLLAQTANGLQLPIGAGSPLVERYRTRLTDRPVRHLPIVRGATVGGSGAINGGYFC RGLPSDFDRASIPGWAWSDVLEHFRAIETDLDFETPVHGRSGPIPVRRTHEMTGITES FMAAAEDAGFAWIADLNDVGPEMPSGVGAVPLNIVNGVRTSSAVGYLMPALGRPNLTL LARTRAVRLRFSATTAVGVDAIGPGGPVSLSADRIVLCAGAIQSAHLLMLSGVGEEEV LRSAGVKVLMALPVGMGCSDHPEWVMPTNWAVAVDRPVLEVLLSTHDGIEIRPYTGGF VAMTGDGTAGHRDWPHIGVALMQPRARGRITLVSSDPQIPVRIEHRYDSEPADVAALR QGSALAHELCGAATRIGPAVWATSQHLCGSAPMGTDDDPRAVVDPRCRVRGIENLWVI DGSVLPSITSRGPHATIVMLGHRAAEFVQ" CDS 800602..800931 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0717" /product="CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb0717, -, len: 109 aa. Equivalent to 5' end of Rv0698, len: 203 aa, from Mycobacterium tuberculosis strain H37Rv, (98.1% identity in 106 aa overlap). Conserved hypothetical protein, highly similar to C-terminus of Rv3639c|MTY15C10.12 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (188 aa), FASTA scores: E(): 2.1e-07, (54.8% identity in 73 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0698 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits Rv0698 into 2 parts, Mb0717 and Mb0718. Mb0717 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XW44" /protein_id="SIT99316.1" /translation="MGRRGNRRVHVDRVRLTGTERELRAENQSPPIFRPQNTLGDGAN GLPLAVCTTTAHTCHTSHTHPSRWTPNPVPATKGVPAGLVQATFIIENLDPGNNDTPT PLHPNCD" CDS 800979..801212 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0718" /product="CONSERVED HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb0718, -, len: 77 aa. Equivalent to 3' end of Rv0698, len: 203 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 77 aa overlap). Conserved hypothetical protein, highly similar to C-terminus of Rv3639c|MTY15C10.12 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (188 aa), FASTA scores: E(): 2.1e-07, (54.8% identity in 73 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0698 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits Rv0698 into 2 parts, Mb0717 and Mb0718. Mb0718 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XW82" /protein_id="SIT99317.1" /translation="MLRRKDTSRRCVQADDVRCVQLVQDPRRGRVELGGYRAELTVGR RAAVNCQRPQYGADGWPVRLGCGVGGAARGDQR" CDS 801397..801618 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0719" /product="HYPOTHETICAL PROTEIN" /note="Mb0719, -, len: 73 aa. Equivalent to Rv0699, len: 73 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3XW55" /protein_id="SIT99318.1" /translation="MGDRRVDLLAAKDSEIRRSMGAVPVGAGSSQVATSWASDRCIRC RAAILSADCANLARANSRGGLAVGGSAVS" CDS 802255..802560 /codon_start=1 /transl_table=11 /gene="rpsJ" /locus_tag="BQ2027_MB0720" /standard_name="nusE" /product="30S RIBOSOMAL PROTEIN S10 RPSJ (TRANSCRIPTION ANTITERMINATION FACTOR NUSE)" /note="Mb0720, rpsJ, len: 101 aa. Equivalent to Rv0700, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). rpsJ (alternate gene name: nusE), 30S ribosomal protein S10 (see first citation below), equivalent to RS10_MYCLE P307653 30S ribosomal protein S10 from Mycobacterium leprae (101 aa), FASTA scores: opt: 645, E(): 0, (97.0% identity in 101 aa overlap). Also highly similar to others e.g. CAB82069.1|AL161803 30S ribosomal protein S10 from Streptomyces coelicolor (102 aa); etc. Contains PS00361 Ribosomal protein S10 signature. BELONGS TO THE S10P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0720 detected using shotgun mass spectrometry. Mb0720 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5X1" /db_xref="InterPro:IPR001848" /db_xref="InterPro:IPR018268" /db_xref="InterPro:IPR027486" /db_xref="InterPro:IPR036838" /db_xref="UniProtKB/Swiss-Prot:P0A5X1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99319.1" /translation="MAGQKIRIRLKAYDHEAIDASARKIVETVVRTGASVVGPVPLPT EKNVYCVIRSPHKYKDSREHFEMRTHKRLIDIIDPTPKTVDALMRIDLPASVDVNIQ" CDS 802577..803230 /codon_start=1 /transl_table=11 /gene="rplC" /locus_tag="BQ2027_MB0721" /product="50s ribosomal protein l3 rplc" /note="Mb0721, rplC, len: 217 aa. Equivalent to Rv0701, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 217 aa overlap). Probable rplC, 50S ribosomal protein L3, equivalent to O06044|RL3_MYCBO 50S RIBOSOMAL PROTEIN L3 from Mycobacterium bovis BCG (217 aa); and P30762|RL3_MYCLE 50S RIBOSOMAL PROTEIN L3 from Mycobacterium leprae (217 aa). Also highly similar to others e.g. CAB82070.1|AL161803 50S ribosomal protein L3 from Streptomyces coelicolor (214 aa); P52860|RL3_THETH ribosomal protein l3 from Thermus aquaticus (206 aa), FASTA scores: opt: 717, E(): 0, (55.2% identity in 210 aa overlap); etc. Contains PS00474 Ribosomal protein L3 signature. BELONGS TO THE L3P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0721 detected using shotgun mass spectrometry. Mb0721 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P60441" /db_xref="InterPro:IPR000597" /db_xref="InterPro:IPR009000" /db_xref="InterPro:IPR019926" /db_xref="InterPro:IPR019927" /db_xref="UniProtKB/Swiss-Prot:P60441" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99320.1" /translation="MARKGILGTKLGMTQVFDESNRVVPVTVVKAGPNVVTRIRTPER DGYSAVQLAYGEISPRKVNKPLTGQYTAAGVNPRRYLAELRLDDSDAATEYQVGQELT AEIFADGSYVDVTGTSKGKGFAGTMKRHGFRGQGASHGAQAVHRRPGSIGGCATPARV FKGTRMAGRMGNDRVTVLNLLVHKVDAENGVLLIKGAVPGRTGGLVMVRSAIKRGEK" CDS 803230..803901 /codon_start=1 /transl_table=11 /gene="rplD" /locus_tag="BQ2027_MB0722" /product="50s ribosomal protein l4 rpld" /note="Mb0722, rplD, len: 223 aa. Equivalent to Rv0702, len: 223 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 223 aa overlap). Probable rplD, 50S ribosomal protein L4, equivalent to O06045|RL4_MYCBO 50S RIBOSOMAL PROTEIN L4 from Mycobacterium bovis BCG (223 aa); O06114|RL4_MYCSM 50S RIBOSOMAL PROTEIN L4 from Mycobacterium smegmatis (215 aa); and MLCB2492_3 50S ribosomal protein L4 from Mycobacterium leprae (230 aa). Also highly similar to others e.g. CAB82071.1|AL161803 50S ribosomal protein L4 from Streptomyces coelicolor (219 aa); P28601|RL4_BACST 50s ribosomal protein L4 from Bacillus stearothermophilus (207 aa), FASTA scores: opt: 522, E(): 3.5e-26, (42.4% identity in 198 aa overlap); etc. BELONGS TO THE L4P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0722 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0722 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P60728" /db_xref="InterPro:IPR002136" /db_xref="InterPro:IPR013005" /db_xref="InterPro:IPR023574" /db_xref="UniProtKB/Swiss-Prot:P60728" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99321.1" /translation="MAAQEQKTLKIDVKTPAGKVDGAIELPAELFDVPANIALMHQVV TAQRAAARQGTHSTKTRGEVSGGGRKPYRQKGTGRARQGSTRAPQFTGGGVVHGPKPR DYSQRTPKKMIAAALRGALSDRARNGRIHAITELVEGQNPSTKSARAFLASLTERKQV LVVIGRSDEAGAKSVRNLPGVHILAPDQLNTYDVLRADDVVFSVEALNAYIAANTTTS EEVSA" CDS 803901..804203 /codon_start=1 /transl_table=11 /gene="rplW" /locus_tag="BQ2027_MB0723" /product="50s ribosomal protein l23 rplw" /note="Mb0723, rplW, len: 100 aa. Equivalent to Rv0703, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). Probable rplW, 50S ribosomal protein L23, equivalent to O06046|RL23_MYCBO 50S RIBOSOMAL PROTEIN L23 from Mycobacterium bovis BCG (100 aa); and MLCB2492_4 50S RIBOSOMAL PROTEIN L23 from Mycobacterium leprae (100 aa). Also highly similar to others e.g. CAB82072.1|AL161803 50S ribosomal protein L23 from Streptomyces coelicolor (139 aa) (N-terminus longer); P04454|RL23_BACST 50s ribosomal protein L23 from Bacillus stearothermophilus (95 aa), FASTA scores: opt: 275, E(): 1.4e-13, (50.5% identity in 95 aa overlap); etc. Contains PS00050 Ribosomal protein L23 signature. BELONGS TO THE L23P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0723 detected using shotgun mass spectrometry. Mb0723 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:O06046" /db_xref="InterPro:IPR001014" /db_xref="InterPro:IPR012677" /db_xref="InterPro:IPR012678" /db_xref="InterPro:IPR013025" /db_xref="UniProtKB/Swiss-Prot:O06046" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99322.1" /translation="MATLADPRDIILAPVISEKSYGLLDDNVYTFLVRPDSNKTQIKI AVEKIFAVKVASVNTANRQGKRKRTRTGYGKRKSTKRAIVTLAPGSRPIDLFGAPA" CDS 804350..805192 /codon_start=1 /transl_table=11 /gene="rplB" /locus_tag="BQ2027_MB0724" /product="50s ribosomal protein l2 rplb" /note="Mb0724, rplB, len: 280 aa. Equivalent to Rv0704, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Probable rplB, 50S ribosomal protein L2, equivalent to O06047|RL2_MYCBO 50S RIBOSOMAL PROTEIN L2 from Mycobacterium bovis BCG (280 aa); and MLCB2492_5M 50S RIBOSOMAL PROTEIN L2 from Mycobacterium leprae (280 aa). Also highly similar to others e.g. CAB82073.1|AL161803 50S ribosomal protein L2 from Streptomyces coelicolor (278 aa); P42919|RL2_BACSU 50s ribosomal protein l2 (bl2) from Bacillus subtilis (276 aa), FASTA scores: opt: 1179, E(): 0, (61.1% identity in 275 aa overlap); etc. Contains PS00467 Ribosomal protein L2 signature. BELONGS TO THE L2P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0724 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0724 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:O06047" /db_xref="InterPro:IPR002171" /db_xref="InterPro:IPR005880" /db_xref="InterPro:IPR008991" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR014722" /db_xref="InterPro:IPR014726" /db_xref="InterPro:IPR022666" /db_xref="InterPro:IPR022669" /db_xref="InterPro:IPR022671" /db_xref="UniProtKB/Swiss-Prot:O06047" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99323.1" /translation="MAIRKYKPTTPGRRGASVSDFAEITRSTPEKSLVRPLHGRGGRN AHGRITTRHKGGGHKRAYRMIDFRRNDKDGVNAKVAHIEYDPNRTARIALLHYLDGEK RYIIAPNGLSQGDVVESGANADIKPGNNLPLRNIPAGTLIHAVELRPGGGAKLARSAG SSIQLLGKEASYASLRMPSGEIRRVDVRCRATVGEVGNAEQANINWGKAGRMRWKGKR PSVRGVVMNPVDHPHGGGEGKTSGGRHPVSPWGKPEGRTRNANKSSNKFIVRRRRTGK KHSR" CDS 805233..805514 /codon_start=1 /transl_table=11 /gene="rpsS" /locus_tag="BQ2027_MB0725" /product="30s ribosomal protein s19 rpss" /note="Mb0725, rpsS, len: 93 aa. Equivalent to Rv0705, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Probable rpsS, 30S ribosomal protein S19, equivalent to S36895 ribosomal protein S19 from Mycobacterium bovis (93 aa), FASTA scores: opt: 623, E(): 0, (98.9% identity in 93 aa overlap); and NP_302261.1|NC_002677 30S ribosomal protein S19 from Mycobacterium leprae (93 aa). Also highly similar to others e.g. CAB82074.1|AL161803 30S ribosomal protein S19 from Streptomyces coelicolor (93 aa); etc. Contains PS00323 Ribosomal protein S19 signature. BELONGS TO THE S19P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0725 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0725 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5X5" /db_xref="InterPro:IPR002222" /db_xref="InterPro:IPR005732" /db_xref="InterPro:IPR020934" /db_xref="InterPro:IPR023575" /db_xref="UniProtKB/Swiss-Prot:P0A5X5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99324.1" /translation="MPRSLKKGPFVDEHLLKKVDVQNEKNTKQVIKTWSRRSTIIPDF IGHTFAVHDGRKHVPVFVTESMVGHKLGEFAPTRTFKGHIKDDRKSKRR" CDS 805511..806104 /codon_start=1 /transl_table=11 /gene="rplV" /locus_tag="BQ2027_MB0726" /product="50s ribosomal protein l22 rplv" /note="Mb0726, rplV, len: 197 aa. Equivalent to Rv0706, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 197 aa overlap). Probable rplV, 50S ribosomal protein L22, equivalent to O06115|RL22_MYCSM 50S RIBOSOMAL PROTEIN L22 from Mycobacterium smegmatis (153 aa); MBS10OPER_7 50S RIBOSOMAL PROTEIN L22 from Mycobacterium bovis BCG; and MLCB2492_7 50S ribosomal protein L22 from Mycobacterium leprae (175 aa). Also highly similar to others e.g. CAB82075.1|AL161803 50S ribosomal protein L22 from Streptomyces coelicolor (125 aa); P42060|RL22_BACSU 50s ribosomal protein L22 from Bacillus subtilis (113 aa), FASTA scores: opt: 368, E(): 2.4e-13, (52.8% identity in 108 aa overlap); etc. Contains PS00464 Ribosomal protein L22 signature, and contains repetitive sequence at C-terminus. BELONGS TO THE L22P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0726 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0726 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P61180" /db_xref="InterPro:IPR001063" /db_xref="InterPro:IPR005727" /db_xref="InterPro:IPR018260" /db_xref="InterPro:IPR036394" /db_xref="UniProtKB/Swiss-Prot:P61180" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99325.1" /translation="MTAATKATEYPSAVAKARFVRVSPRKARRVIDLVRGRSVSDALD ILRWAPQAASGPVAKVIASAAANAQNNGGLDPATLVVATVYADQGPTAKRIRPRAQGR AFRIRRRTSHITVVVESRPAKDQRSAKSSRARRTEASKAASKVGATAPAKKAAAKAPA KKAPASSGVKKTPAKKAPAKKAPAKASETSAAKGGSD" CDS 806104..806928 /codon_start=1 /transl_table=11 /gene="rpsC" /locus_tag="BQ2027_MB0727" /product="30s ribosomal protein s3 rpsc" /note="Mb0727, rpsC, len: 274 aa. Equivalent to Rv0707, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 274 aa overlap). Probable rpsC, 30S ribosomal protein S3, equivalent to O06048|RS3_MYCBO|MBS10OPER_8 30S RIBOSOMAL PROTEIN S3 from Mycobacterium bovis BCG (274 aa); and MLCB2492_8 30S RIBOSOMAL PROTEIN S3 from Mycobacterium leprae (281 aa). Also highly similar to others e.g. CAB82076.1|AL161803 30S ribosomal protein S3 from Streptomyces coelicolor (277 aa); P21465|RS3_BACSU 30s ribosomal protein s3 (bs3) (bs2) from Bacillus subtilis (217 aa), FASTA scores: opt: 794, E(): 0, (52.8% identity in 212 aa overlap); etc. BELONGS TO THE S3P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0727 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0727 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5X7" /db_xref="InterPro:IPR001351" /db_xref="InterPro:IPR004044" /db_xref="InterPro:IPR004087" /db_xref="InterPro:IPR005704" /db_xref="InterPro:IPR009019" /db_xref="InterPro:IPR015946" /db_xref="InterPro:IPR018280" /db_xref="InterPro:IPR036419" /db_xref="UniProtKB/Swiss-Prot:P0A5X7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99326.1" /translation="MGQKINPHGFRLGITTDWKSRWYADKQYAEYVKEDVAIRRLLSS GLERAGIADVEIERTRDRVRVDIHTARPGIVIGRRGTEADRIRADLEKLTGKQVQLNI LEVKNPESQAQLVAQGVAEQLSNRVAFRRAMRKAIQSAMRQPNVKGIRVQCSGRLGGA EMSRSEFYREGRVPLHTLRADIDYGLYEAKTTFGRIGVKVWIYKGDIVGGKRELAAAA PAGADRPRRERPSGTRPRRSGASGTTATGTDAGRAAGGEEAAPDAAAPVEAQSTES" CDS 806932..807348 /codon_start=1 /transl_table=11 /gene="rplP" /locus_tag="BQ2027_MB0728" /product="50s ribosomal protein l16 rplp" /note="Mb0728, rplP, len: 138 aa. Equivalent to Rv0708, len: 138 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 138 aa overlap). Probable rplP, 50S ribosomal protein L16, equivalent to O06049|RL16_MYCBO|MBS10OPER_9 50S RIBOSOMAL PROTEIN L16 from Mycobacterium bovis BCG (138 aa); and MLCB2492_9 50S RIBOSOMAL PROTEIN L16 from Mycobacterium leprae (138 aa). Also highly similar to others e.g. CAB82077.1|AL161803 50S ribosomal protein L16 from Streptomyces coelicolor (139 aa); P14577|RL16_BACSU 50s ribosomal protein l16 from Bacillus subtilis (144 aa), FASTA scores: opt: 600, E(): 0, (63.2% identity in 136 aa overlap); etc. Contains PS00701 Ribosomal protein L16 signature 2. BELONGS TO THE L16P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0728 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0728 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:O06049" /db_xref="InterPro:IPR000114" /db_xref="InterPro:IPR016180" /db_xref="InterPro:IPR020798" /db_xref="InterPro:IPR036920" /db_xref="UniProtKB/Swiss-Prot:O06049" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99327.1" /translation="MLIPRKVKHRKQHHPRQRGIASGGTTVNFGDYGIQALEHAYVTN RQIESARIAINRHIKRGGKVWINIFPDRPLTKKPAETRMGSGKGSPEWWVANVKPGRV LFELSYPNEGVARAALTRAIHKLPIKARIITREEQF" CDS 807348..807581 /codon_start=1 /transl_table=11 /gene="rpmC" /locus_tag="BQ2027_MB0729" /product="50s ribosomal protein l29 rpmc" /note="Mb0729, rpmC, len: 77 aa. Equivalent to Rv0709, len: 77 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 77 aa overlap). Probable rpmC, 50S ribosomal protein L29, equivalent to O06050|RL29_MYCBO|MBS10OPER_10 50S RIBOSOMAL PROTEIN L29 from Mycobacterium bovis BCG (75 aa); and O32989|RL29_MYCLE|MLCB2492_10 50S RIBOSOMAL PROTEIN L29 from Mycobacterium leprae (80 aa). Also highly similar to others e.g. Q9L0D2|RL29_STRCO 50S RIBOSOMAL PROTEIN L29 from Streptomyces coelicolor (74 aa); P12873|RL29_BACSU 50s ribosomal protein l29 from Bacillus subtilis (66 aa), FASTA scores: opt: 225, E(): 8.3e-11, (58.6% identity in 58 aa overlap); etc. BELONGS TO THE L29P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0729 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0729 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:O06050" /db_xref="InterPro:IPR001854" /db_xref="InterPro:IPR018254" /db_xref="InterPro:IPR036049" /db_xref="UniProtKB/Swiss-Prot:O06050" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99328.1" /translation="MAVGVSPGELRELTDEELAERLRESKEELFNLRFQMATGQLNNN RRLRTVRQEIARIYTVLRERELGLATGPDGKES" CDS 807578..807988 /codon_start=1 /transl_table=11 /gene="rpsQ" /locus_tag="BQ2027_MB0730" /product="30s ribosomal protein s17 rpsq" /note="Mb0730, rpsQ, len: 136 aa. Equivalent to Rv0710, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Probable rpsQ, 30S ribosomal protein S17, equivalent to O06051|RS17_MYCBO 30S|MBS10OPER_11 30S RIBOSOMAL PROTEIN S17 from Mycobacterium bovis BCG (136 aa); and MLCB2492_11 30S RIBOSOMAL PROTEIN S17 from Mycobacterium leprae (126 aa). Also highly similar to others e.g. CAB82079.1|AL161803 30S ribosomal protein S17 from Streptomyces coelicolor (95 aa); P12874|RS17_BACSU 30s ribosomal protein s17 (bs 16) from Bacillus subtilis (86 aa), FASTA scores: opt: 305, E(): 1.6e-11, (60.5% identity in 81 aa overlap); etc. Contains PS00056 Ribosomal protein S17 signature. BELONGS TO THE S17P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0730 detected using shotgun mass spectrometry. Mb0730 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:O06051" /db_xref="InterPro:IPR000266" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR019979" /db_xref="InterPro:IPR019984" /db_xref="UniProtKB/Swiss-Prot:O06051" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99329.1" /translation="MMAEAKTGAKAAPRVAKAAKAAPKKAAPNDAEAIGAANAANVKG PKHTPRTPKPRGRRKTRIGYVVSDKMQKTIVVELEDRMRHPLYGKIIRTTKKVKAHDE DSVAGIGDRVSLMETRPLSATKRWRLVEILEKAK" CDS 808157..808603 /codon_start=1 /transl_table=11 /gene="atsAa" /locus_tag="BQ2027_MB0731" /product="POSSIBLE ARYLSULFATASE ATSAa [FIRST PART] (ARYL-SULFATE SULPHOHYDROLASE) (ARYLSULPHATASE)" /note="Mb0731, atsAa, len: 148 aa. Equivalent to 5' end of Rv0711, len: 787 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 148 aa overlap). Possible atsA, arylsulfatase (EC 3.1.6.1), similar to others e.g. P51691|ARS_PSEAE arylsulfatase from Pseudomonas aeruginosa (532 aa), FASTA scores: opt: 439, E(): 2.9e-21, (30.8% identity in 552 aa overlap); etc. Also similar to other hypothetical arylsulfatases from Mycobacterium tuberculosis e.g. Rv3299c, Rv0663, etc. Contains PS00523 Sulfatases signature 1, and PS00149 Sulfatases signature 2. BELONGS TO THE SULFATASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, atsA exists as a single gene. In Mycobacterium bovis, a single base transversion (g-t), introducing a stop codon, splits atsA into 2 parts, atsAa and atsAb. Mb0731 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW61" /db_xref="InterPro:IPR000917" /db_xref="InterPro:IPR017850" /db_xref="InterPro:IPR024607" /db_xref="UniProtKB/TrEMBL:A0A1R3XW61" /protein_id="SIT99330.1" /translation="MAPEATEAFNGTIELDIRDSEPDWGPYAAPVAPEHSPNILYLVW DDVGIATWDCFGGLVEMPAMTRVAERGVRLSQFHTTALCSPTRASLLTGRNATTVGMA TIEEFTDGFPNCNGRIPADTALLPEVLAEHGYNTYCVGKWHLTPLE" CDS 808610..810520 /codon_start=1 /transl_table=11 /gene="atsAb" /locus_tag="BQ2027_MB0732" /product="POSSIBLE ARYLSULFATASE ATSAb [SECOND PART] (ARYL-SULFATE SULPHOHYDROLASE) (ARYLSULPHATASE)" /note="Mb0732, atsAb, len: 636 aa. Equivalent to 3' end of Rv0711, len: 787 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 636 aa overlap). Possible atsA, arylsulfatase (EC 3.1.6.1), similar to others e.g. P51691|ARS_PSEAE arylsulfatase from Pseudomonas aeruginosa (532 aa), FASTA scores: opt: 439, E(): 2.9e-21, (30.8% identity in 552 aa overlap); etc. Also similar to other hypothetical arylsulfatases from Mycobacterium tuberculosis e.g. Rv3299c, Rv0663, etc. Contains PS00523 Sulfatases signature 1, and PS00149 Sulfatases signature 2. BELONGS TO THE SULFATASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, atsA exists as a single gene. In Mycobacterium bovis, a single base transversion (g-t), introducing a stop codon, splits atsA into 2 parts, atsAa and atsAb. Protein product from Mb0732 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0732 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW65" /db_xref="InterPro:IPR000917" /db_xref="InterPro:IPR017850" /db_xref="UniProtKB/TrEMBL:A0A1R3XW65" /protein_id="SIT99331.1" /translation="MASTKRHWPTSRGFERFYGFLGGETDQWYPDLVYDNHPVSPPGT PEGGYHLSKDIADKTIEFIRDAKVIAPDKPWFSYVCPGAGHAPHHVFKEWADRYAGRF DMGYERYREIVLERQKALGIVPPDTELSPINPYLDVPGPNGETWPLQDTVRPWDSLSD EEKKLFCRMAEVFAGFLSYTDAQIGRILDYLEESGQLDNTIIVVISDNGASGEGGPNG SVNEGKFFNGYIDTVAESMKLFDHLGGPQTYNHYPIGWAMAFNTPYKLFKRYASHEGG IADPAIISWPNGIAAHGEIRDNYVNVSDITPTVYDLLGMTPPGTVKGIPQKPMDGVSF IAALADPAADTGKTTQFYTMLGTRGIWHEGWFANTIHAATPAGWSNFNADRWELFHIA ADRSQCHDLAAEHPDKLEELKALWFSEAAKYNGLPLADLNLLETMTRSRPYLVSERAS YVYYPDCADVGIGAAVEIRGRSFAVLADVTIDTTGAEGVLFKHGGAHGGHVLFVRDGR LHYVYNFLGERQQLVSSSGPVPSGRHLLGVRYLRTGTVPNSHTPVGDLELFFDENLVG ALTNVLTHPGTFGLAGAAISVGRNGGSAVSSHYEAPFAFTGGTITQVTVDVSGRPFED VESDLALAFSRD" CDS 810568..811467 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0733" /product="Sulfatase modifying factor 1 precursor (C-alpha-formyglycine- generating enzyme 1)" /note="Mb0733, -, len: 299 aa. Equivalent to Rv0712, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 299 aa overlap). Conserved hypothetical protein, similar to others e.g. NP_106128.1|NC_002678 hypothetical protein from Mesorhizobium loti (372 aa); D90901_33|P72841 HYPOTHETICAL 48.1 KD PROTEIN from Synechocystis sp (410 aa), FASTA scores: E(): 1.1e-07, (28.8% identity in 299 aa overlap); etc. Slight similarity to carboxykinases. Similar to C-terminal part of Rv3703c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (425 aa). Protein product from Mb0733 detected using SWATH mass spectrometry. Mb0733 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR005532" /db_xref="InterPro:IPR016187" /db_xref="InterPro:IPR042095" /db_xref="UniProtKB/TrEMBL:A0A1R3XYB4" /protein_id="SIT99332.1" /translation="MLTELVDLPGGSFRMGSTRFYPEEAPIHTVTVRAFAVERHPVTN AQFAEFVSATGYVTVAEQPLDPGLYPGVDAADLCPGAMVFCPTAGPVDLRDWRQWWDW VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYARWAGRRLPTEAEWEYAARGGTT ATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTSPVGRFPANGFGLLDMIGNVW EWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKGGSHLCAPEYCHRYRPAAR SPQSQDTATTHIGFRCVADPVSG" CDS 811768..812709 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0734" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0734, -, len: 313 aa. Equivalent to Rv0713, len: 313 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 313 aa overlap). Probable conserved transmembrane protein, similar to Rv3435c|MTCY77_7|O06252 from Mycobacterium tuberculosis (284 aa), FASTA scores: opt: 557, E(): 2.1e-29, (35.8% identity in 282 aa overlap); MLCB2492_12|O32991 HYPOTHETICAL 10.7 KD PROTEIN from Mycobacterium leprae (95 aa). Protein product from Mb0734 detected using SWATH mass spectrometry. Mb0734 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX72" /db_xref="InterPro:IPR027948" /db_xref="UniProtKB/TrEMBL:A0A1R3XX72" /protein_id="SIT99333.1" /translation="MAGSDPPTGGPASQAGSDAGASPEHKHMSRRKHLVLDVCIILGV LIAYVFSLLGYDWLAHTPGPLPQPDVGTTDDTVVLIRFEELHTVANRLDVKVLVLPDD SMIDHRLQVLTTDTSVRLYPENELGDLQYPVGKLPAQVATTIEAHGNPGAWPFDTYTT DTVQADVLVGAGDNRQYVPARVEVTGSLEGWDISAVRVGESSQTSDRPDNVIITLKRA KGPLVFDLGICLVLITLPTLALFVAIQMITGRRKFQPPFGTWYAAMLFAVVPLRTILP GSPPAGAWIDRAVVIWVLIALAAAMVVYIVAWYRESD" CDS 813195..813563 /codon_start=1 /transl_table=11 /gene="rplN" /locus_tag="BQ2027_MB0735" /product="50s ribosomal protein l14 rpln" /note="Mb0735, rplN, len: 122 aa. Equivalent to Rv0714, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 122 aa overlap). Probable rplN, 50S ribosomal protein L14, equivalent to O32993|MLCB2492_14|ML1849|RL14_MYCLE 50S RIBOSOMAL PROTEIN L14 from Mycobacterium leprae (122 aa). Also highly similar to others e.g. CAB82080.1|AL161803 50S ribosomal protein L14 from Streptomyces coelicolor (122 aa); P33100|RL14_MICLU 50s ribosomal protein L14 from Micrococcus luteus (122 aa), FASTA scores: opt: 674, E(): 0, (85.2% identity in 122 aa overlap); etc. Contains PS00049 Ribosomal protein L14 signature. BELONGS TO THE L14P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0735 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0735 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66070" /db_xref="InterPro:IPR000218" /db_xref="InterPro:IPR005745" /db_xref="InterPro:IPR019972" /db_xref="InterPro:IPR036853" /db_xref="UniProtKB/Swiss-Prot:P66070" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99334.1" /translation="MIQQESRLKVADNTGAKEILCIRVLGGSSRRYAGIGDVIVATVK DAIPGGNVKRGDVVKAVVVRTVKERRRPDGSYIKFDENAAVIIKPDNDPRGTRIFGPV GRELREKRFMKIISLAPEVL" CDS 813564..813881 /codon_start=1 /transl_table=11 /gene="rplX" /locus_tag="BQ2027_MB0736" /product="50s ribosomal protein l24 rplx" /note="Mb0736, rplX, len: 105 aa. Equivalent to Rv0715, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 105 aa overlap). Probable rplX, 50S ribosomal protein L24, equivalent to O32994|MLCB2492_15 50S RIBOSOMAL PROTEIN L24 from Mycobacterium leprae (105 aa). Also highly similar to others e.g. CAB82081.1|AL161803 50S ribosomal protein L24 from Streptomyces coelicolor (107 aa); P12876|RL24_BACSU 50s ribosomal protein L24 (bl23) from Bacillus subtilis (103 aa), FASTA scores: opt: 363, E(): 1.8e-18, (56.7% identity in 104 aa overlap); etc. Contains PS01108 Ribosomal protein L24 signature. BELONGS TO THE L24P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0736 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0736 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P60628" /db_xref="InterPro:IPR003256" /db_xref="InterPro:IPR005824" /db_xref="InterPro:IPR005825" /db_xref="InterPro:IPR008991" /db_xref="InterPro:IPR014722" /db_xref="InterPro:IPR041988" /db_xref="UniProtKB/Swiss-Prot:P60628" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99335.1" /translation="MKVHKGDTVLVISGKDKGAKGKVLQAYPDRNRVLVEGVNRIKKH TAISTTQRGARSGGIVTQEAPIHVSNVMVVDSDGKPTRIGYRVDEETGKRVRISKRNG KDI" CDS 813881..814444 /codon_start=1 /transl_table=11 /gene="rplE" /locus_tag="BQ2027_MB0737" /product="50s ribosomal protein l5 rple" /note="Mb0737, rplE, len: 187 aa. Equivalent to Rv0716, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 187 aa overlap). Probable rplE, 50S ribosomal protein L5, equivalent to MLCB2492_16 50S RIBOSOMAL PROTEIN L5 from Mycobacterium leprae (187 aa). Also highly similar to others e.g. CAB82082.1|AL161803 50S ribosomal protein L5 from Streptomyces coelicolor (185 aa); P33098|RL5_MICLU 50S RIBOSOMAL PROTEIN L5 from Micrococcus luteus (191 aa), FASTA scores: opt: 930, E(): 0, (73.8% identity in 183 aa overlap); etc. BELONGS TO THE L5P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0737 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0737 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P62402" /db_xref="InterPro:IPR002132" /db_xref="InterPro:IPR020930" /db_xref="InterPro:IPR022803" /db_xref="InterPro:IPR031309" /db_xref="InterPro:IPR031310" /db_xref="UniProtKB/Swiss-Prot:P62402" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99336.1" /translation="MTTAQKVQPRLKERYRSEIRDALRKQFGYGNVMQIPTVTKVVVN MGVGEAARDAKLINGAVNDLALITGQKPEVRRARKSIAQFKLREGMPVGVRVTLRGDR MWEFLDRLTSIALPRIRDFRGLSPKQFDGVGNYTFGLAEQAVFHEVDVDKIDRVRGMD INVVTSAATDDEGRALLRALGFPFKEN" CDS 814449..814634 /codon_start=1 /transl_table=11 /gene="rpsN1" /locus_tag="BQ2027_MB0738" /product="30s ribosomal protein s14 rpsn1" /note="Mb0738, rpsN1, len: 61 aa. Equivalent to Rv0717, len: 61 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 61 aa overlap). Probable rpsN1, 30S ribosomal protein S14, equivalent to MLCB2492_17|O32996 RIBOSOMAL PROTEIN S14 from Mycobacterium leprae (61 aa). Also highly similar to others e.g. CAB82083.1|AL161803 30S ribosomal protein S14 from Streptomyces coelicolor (61 aa); P24320|RS14_THETH 30s ribosomal protein S14 from Thermus aquaticus (subsp. thermophilus) (60 aa), FASTA scores: opt: 316, E(): 2e-19,(70.0% identity in 60 aa overlap); etc. Contains PS00527 Ribosomal protein S14 signature. BELONGS TO THE S14P FAMILY OF RIBOSOMAL PROTEINS. Note that previously known as rpsN. Protein product from Mb0738 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0738 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5X3" /db_xref="InterPro:IPR001209" /db_xref="InterPro:IPR018271" /db_xref="InterPro:IPR023053" /db_xref="UniProtKB/Swiss-Prot:P0A5X3" /protein_id="SIT99337.1" /translation="MAKKALVNKAAGKPRFAVRAYTRCSKCGRPRAVYRKFGLCRICL REMAHAGELPGVQKSSW" CDS 814798..815196 /codon_start=1 /transl_table=11 /gene="rpsH" /locus_tag="BQ2027_MB0739" /product="30s ribosomal protein s8 rpsh" /note="Mb0739, rpsH, len: 132 aa. Equivalent to Rv0718, len: 132 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 132 aa overlap). Probable rpsH, 30S ribosomal protein S8, equivalent to O32997|MLCB2492_18 30S RIBOSOMAL PROTEIN S8 from Mycobacterium leprae (132 aa). Also highly similar to others e.g. CAB82084.1|AL161803 30S ribosomal protein S8 from Streptomyces coelicolor (132 aa); P33106|RS8_MICLU 30s ribosomal protein S8 from Micrococcus luteus (132 aa), FASTA scores: opt: 669, E(): 0, (77.3% identity in 132 aa overlap); etc. Contains PS00053 Ribosomal protein S8 signature. BELONGS TO THE S8P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0739 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0739 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66626" /db_xref="InterPro:IPR000630" /db_xref="InterPro:IPR035987" /db_xref="UniProtKB/Swiss-Prot:P66626" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99338.1" /translation="MTMTDPIADFLTRLRNANSAYHDEVSLPHSKLKANIAQILKNEG YISDFRTEDARVGKSLVIQLKYGPSRERSIAGLRRVSKPGLRVYAKSTNLPRVLGGLG VAIISTSSGLLTDRQAARQGVGGEVLAYVW" CDS 815220..815759 /codon_start=1 /transl_table=11 /gene="rplF" /locus_tag="BQ2027_MB0740" /product="50s ribosomal protein l6 rplf" /note="Mb0740, rplF, len: 179 aa. Equivalent to Rv0719, len: 179 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 179 aa overlap). Probable rplF, 50S ribosomal protein L6, equivalent to O32998|MLCB2492_19 50S RIBOSOMAL PROTEIN L6 from Mycobacterium leprae (179 aa). Also highly similar to others e.g. P46786|RL6_STRCO|CAB82085.1|AL161803|SCD31.42 50S ribosomal protein L6 from Streptomyces coelicolor (179 aa), FASTA scores: opt: 872, E(): 0, (70.4% identity in 179 aa overlap); etc. Contains PS00525 Ribosomal protein L6 signature 1. BELONGS TO THE L6P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0740 detected using shotgun mass spectrometry. Mb0740 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66312" /db_xref="InterPro:IPR000702" /db_xref="InterPro:IPR002358" /db_xref="InterPro:IPR019906" /db_xref="InterPro:IPR020040" /db_xref="InterPro:IPR036789" /db_xref="UniProtKB/Swiss-Prot:P66312" /protein_id="SIT99339.1" /translation="MSRIGKQPIPVPAGVDVTIEGQSISVKGPKGTLGLTVAEPIKVA RNDDGAIVVTRPDDERRNRSLHGLSRTLVSNLVTGVTQGYTTKMEIFGVGYRVQLKGS NLEFALGYSHPVVIEAPEGITFAVQAPTKFTVSGIDKQKVGQIAANIRRLRRPDPYKG KGVRYEGEQIRRKVGKTGK" CDS 815762..816130 /codon_start=1 /transl_table=11 /gene="rplR" /locus_tag="BQ2027_MB0741" /product="50s ribosomal protein l18 rplr" /note="Mb0741, rplR, len: 122 aa. Equivalent to Rv0720, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 122 aa overlap). Probable rplR, 50S ribosomal protein L18, equivalent to O32999|MLCB2492_20|RL18_MYCLE 50S RIBOSOMAL PROTEIN L18 from Mycobacterium leprae (122 aa). Also highly similar to others e.g. CAB82086.1|AL161803 50S ribosomal protein L18 from Streptomyces coelicolor (127 aa); P33102|RL18_MICLU 50s ribosomal protein L18 from Micrococcus luteus (119 aa), FASTA scores: opt: 447, E(): 8.7e-24, (60.4% identity in 111 aa overlap); etc. BELONGS TO THE L18P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0741 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0741 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66077" /db_xref="InterPro:IPR004389" /db_xref="InterPro:IPR005484" /db_xref="UniProtKB/Swiss-Prot:P66077" /protein_id="SIT99340.1" /translation="MAQSVSATRRISRLRRHTRLRKKLSGTAERPRLVVHRSARHIHV QLVNDLNGTTVAAASSIEADVRGVPGDKKARSVRVGQLIAERAKAAGIDTVVFDRGGY TYGGRIAALADAARENGLSF" CDS 816150..816812 /codon_start=1 /transl_table=11 /gene="rpsE" /locus_tag="BQ2027_MB0742" /product="30s ribosomal protein s5 rpse" /note="Mb0742, rpsE, len: 220 aa. Equivalent to Rv0721, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 220 aa overlap). Probable rpsE, 30S ribosomal protein S5, equivalent to MLCB2492_21 RIBOSOMAL PROTEIN S5 from Mycobacterium leprae (217 aa). Also highly similar to others e.g. P46790|RS5_STRCO 30s ribosomal protein S5 from Streptomyces coelicolor (167 aa), FASTA scores: opt: 889, E(): 0, (82.1% identity in 162 aa overlap); etc. Note N-terminus is extented compared to other rpsE genes. Contains PS00585 Ribosomal protein S5 signature, PTS HPr component phosphorylation sites signature. BELONGS TO THE S5P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0742 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0742 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66575" /db_xref="InterPro:IPR000851" /db_xref="InterPro:IPR005324" /db_xref="InterPro:IPR005712" /db_xref="InterPro:IPR013810" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR018192" /db_xref="InterPro:IPR020568" /db_xref="UniProtKB/Swiss-Prot:P66575" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99341.1" /translation="MAEQPAGQAGTTDNRDARGDREGRRRDSGRGSRERDGEKSNYLE RVVAINRVSKVVKGGRRFSFTALVIVGDGNGMVGVGYGKAKEVPAAIAKGVEEARKSF FRVPLIGGTITHPVQGEAAAGVVLLRPASPGTGVIAGGAARAVLECAGVHDILAKSLG SDNAINVVHATVAALKLLQRPEEVAARRGLPIEDVAPAGMLKARRKSEALAASVLPDR TI" CDS 816815..817012 /codon_start=1 /transl_table=11 /gene="rpmD" /locus_tag="BQ2027_MB0743" /product="50s ribosomal protein l30 rpmd" /note="Mb0743, rpmD, len: 65 aa. Equivalent to Rv0722, len: 65 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 65 aa overlap). Probable rpmD, 50S ribosomal protein L30, equivalent to O33001 RIBOSOMAL PROTEIN L30 from Mycobacterium leprae (71 aa). Also highly similar to others e.g. P46789|RL30_STRCO 50S RIBOSOMAL PROTEIN L30 from Streptomyces coelicolor (60 aa); P02430|RL30_ECOLI 50S ribosomal protein L30 from Escherichia coli (58 aa), FASTA scores: opt: 168, E(): 1.5e-13, (53.7% identity in 54 aa overlap); etc. BELONGS TO THE L30P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0743 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0743 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66182" /db_xref="InterPro:IPR005996" /db_xref="InterPro:IPR016082" /db_xref="InterPro:IPR018038" /db_xref="InterPro:IPR036919" /db_xref="UniProtKB/Swiss-Prot:P66182" /protein_id="SIT99342.1" /translation="MSQLKITQVRSTIGARWKQRESLRTLGLRRIRHSVIREDNAATR GLIAVVRHLVEVEPAQTGGKT" CDS 817012..817452 /codon_start=1 /transl_table=11 /gene="rplO" /locus_tag="BQ2027_MB0744" /product="50s ribosomal protein l15 rplo" /note="Mb0744, rplO, len: 146 aa. Equivalent to Rv0723, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 146 aa overlap). Probable rplO, 50S ribosomal protein L15, equivalent to MLCB2492_23|O33002 50S RIBOSOMAL PROTEIN L15 from Mycobacterium leprae (146 aa). Also highly similar to others e.g. P46787|RL15_STRCO|SCD31.46 50S RIBOSOMAL PROTEIN L15 from Streptomyces coelicolor (151 aa); P19946|RL15_BACSU 50s ribosomal protein L15 from Bacillus subtilis (146 aa), FASTA scores: opt: 419, E(): 6.5e-20, (51.0% identity in 145 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00475 Ribosomal protein L15 signature. BELONGS TO THE L15P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb0744 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0744 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U1E9" /db_xref="InterPro:IPR001196" /db_xref="InterPro:IPR005749" /db_xref="InterPro:IPR021131" /db_xref="InterPro:IPR030878" /db_xref="InterPro:IPR036227" /db_xref="UniProtKB/Swiss-Prot:Q7U1E9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99343.1" /translation="MTLKLHDLRPARGSKTARTRVGRGDGSKGKTAGRGTKGTRARKQ VPVTFEGGQMPIHMRLPKLKGFRNRFRTEYEIVNVGDINRLFPQGGAVGVDDLVAKGA VRKNALVKVLGDGKLTAKVDVSAHKFSGSARAKITAAGGSATEL" CDS 817485..819356 /codon_start=1 /transl_table=11 /gene="sppA" /locus_tag="BQ2027_MB0745" /product="POSSIBLE PROTEASE IV SPPA (ENDOPEPTIDASE IV) (SIGNAL PEPTIDE PEPTIDASE)" /note="Mb0745, sppA, len: 623 aa. Equivalent to Rv0724, len: 623 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 623 aa overlap). Possible sppA, protease IV (endopeptidase IV) (EC 3.4.21.-), equivalent (but longer 23 aa) to MLCB2492_24|O33003 ENDOPEPTIDASE IV from Mycobacterium leprae (602 aa). Also similar to others e.g. NP_419743.1|NC_002696 signal peptide peptidase SppA from Caulobacter crescentus (594 aa); P08395|SPPA_ECOLI|B1766 protease IV (endopeptidase) from Escherichia coli strain K-12 (618 aa), FASTA scores: opt: 582, E(): 8.9e-27, (34.1% identity in 525 aa overlap); etc. BELONGS TO PEPTIDASE FAMILY S49. Protein product from Mb0745 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0745 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWB3" /db_xref="InterPro:IPR002142" /db_xref="InterPro:IPR004634" /db_xref="InterPro:IPR004635" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3XWB3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99344.1" /translation="MPIFGGFCVCSRALGGRWVRWVNMVAFLPSIPVVEDLRALVGRV DTARHHGVPNGCVLEFNLRSVPPETTGFDPLTVLTGGGRPMALRDAVAAIHRAAEDPR VAGLIARVQLPPSPAGAVQELREAIAAFSAVKPSLAWAETYPGTLSYYLASAFGEVWM QPSGSVGLVGFATNATFLRDALHKAGIEAQFVARGEYKSAANLFTEDGFTDAHREAVT RMLDSLQDQVWQAVAKSRNIGVDALDELADRAPLLRDDAVTCGLIDRIGFRDQAYARM AELVGVEKGSPESSGSQTSPDEKPPRMYLARYASSARPRLTPPVPSIPGRRSKPTIAV VTLEGPIVNGRGGPQFLPLGPSSAGGDTIAAALREVAADDSVSAIVLRVDSPGGSVTA SETIWREVARARDRGKPVVASMGAVAASGGYYVSMGADAIVANPGTITGSIGVITGKL VVRDLKDRLGVGSDAVRTNANADAWSIDAPFTPDQQAHREAEADLFYSDFVERVAEGR KMTTDAVDVVARGRVWTGADALDRGLVDELGGLRTAVRRAKVLAGLDEDTEVRIVSYP GSSLWDMVRPRPSSRPAAASLPDAMGALLARSIVGIVEQVEQTLSGASVLWLGESRL" CDS complement(819361..820266) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0746C" /product="O-methyltransferase" /note="Mb0746c, -, len: 301 aa. Equivalent to Rv0725c, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 301 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from Mycobacterium tuberculosis e.g. Rv0726c, Rv0731c, Rv3399, etc, e.g. Y893_MYCTU|Q10552|Rv0893C hypothetical 36.1 kd protein cy31.21c (325 aa), FASTA scores: opt: 600, E(): 3.9e-32, (43.8% identity in 219 aa overlap). Mb0746c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U1E7" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7U1E7" /protein_id="SIT99345.1" /translation="MPRAHDDNWDLASSVGATATMVAAGRALATKDPRGLINDPFAEP LVRAVGLDFFTKLIDGELDIATTGNLSPGRAQAMIDGIAVRTKYFDDYFRTATDGGVR QVVILAAGLDARAYRLPWPAGTVVYEIDQPQVIDFKTTTLAGIGAKPTAIRRTVYIDL RADWPAALQAAGLDSTAPTAWLAEGMLIYLPPDPRTGCSTTAPNSVLRAARSLPNLSR ALWISTQAGYEKWRIRFASTAWTSTWRRWCIPANAATSSTTCAPRAGTLRAQCGPTYS GAMVCPFPPHTTTIRSAKSSSSAVV" CDS complement(820359..821462) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0747C" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb0747c, -, len: 367 aa. Equivalent to Rv0726c, len: 367 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 367 aa overlap). Conserved hypothetical protein, highly similar to other conserved hypothetical proteins from Mycobacterium tuberculosis e.g. Q10552|Y893_MYCTU|Rv0893c|MT0917|MTCY31.21c (325 aa), FASTA scores: opt: 646, E(): 0, (38.3% identity in 329 aa overlap); Rv0731c|MTV041.05c (318 aa), Rv3399, etc. Also similar to proteins from Mycobacterium leprae and other organisms e.g. T35930 hypothetical protein SC9B5.10 from Streptomyces coelicolor (303 aa). Protein product from Mb0747c detected using SWATH mass spectrometry. Mb0747c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U1E6" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7U1E6" /protein_id="SIT99346.1" /translation="MTYTGSIRCEGDTWDLASSVGATATMVAAARAMATRAANPLIND QFAEPLVRAVGVDVLTRLASGELTASDIDDPERPNASMVRMAEHHAVRTKFFDEFFMD ATRAGIRQVVILASGLDSRAYRLAWPAQTVVYEIDQPQVMEFKTRTLAELGATPTADR RVVTADLRADWPTALGAAGFDPTQPTAWSAEGLLRYLPPEAQDRLLDNVTALSVPDSR FATESIRNFKPHHEERMRERMTILANRWRAYGFDLDMNELVYFGDRNEPASYLSDNGW LLTEIKSQDLLTANGFQPFEDEEVPLPDFFYVSARLQRKHRQYPAHRKPAPSWRHTAC PVNELSKSAAYTMTRSDAHQASTTAPPPPGLTG" CDS complement(821665..822321) /codon_start=1 /transl_table=11 /gene="fucA" /locus_tag="BQ2027_MB0748C" /product="POSSIBLE L-FUCULOSE PHOSPHATE ALDOLASE FUCA (L-FUCULOSE-1-PHOSPHATE ALDOLASE)" /note="Mb0748c, fucA, len: 218 aa. Equivalent to Rv0727c, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 218 aa overlap). Possible fucA, L-fuculose-1-phosphate aldolase (EC 4.1.2.17), similar to many e.g. NP_386339.1|NC_003047 PUTATIVE L-FUCULOSE PHOSPHATE ALDOLASE PROTEIN from Sinorhizobium meliloti (222 aa); P11550|FUCA_ECOLI L-FUCULOSE PHOSPHATE ALDOLASE from Escherichia strain K12 (215 aa), FASTA scores: opt: 372, E(): 4.1e-19, (34.6% identity in 185 aa overlap); etc. BELONGS TO THE ALDOLASE CLASS II FAMILY, ARAD/FUCA SUBFAMILY. COFACTOR: BINDS ONE ZINC ION PER MOLECULE. Protein product from Mb0748c detected using SWATH mass spectrometry." /db_xref="InterPro:IPR001303" /db_xref="InterPro:IPR036409" /db_xref="UniProtKB/TrEMBL:A0A1R3XWB0" /protein_id="SIT99347.1" /translation="MNFVDDPESAVLAAAKDMLRRGLVEGTAGNISARRSDGNVVITP SSVDYAEMLLHDLVLVDAGGAVLHAKDGRSPSTELNLHLACYRAFDDIGSVIHSHPVW ATMFAVAHEPIPACIDEFAIYCGGDVRCTEYAASGTPEVGRNAVRALEGRAAALIANH GLVAVGPRPDQVLRVTALVERTAQIVWGARALGGPVPIPEDVCRNFTGVYGYLRANPL " CDS complement(822318..823298) /codon_start=1 /transl_table=11 /gene="serA2" /locus_tag="BQ2027_MB0749C" /product="POSSIBLE D-3-PHOSPHOGLYCERATE DEHYDROGENASE SERA2 (PHOSPHOGLYCERATE DEHYDROGENASE) (PGDH)" /note="Mb0749c, serA2, len: 326 aa. Equivalent to Rv0728c, len: 326 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 326 aa overlap). Possible serA2, D-3-phosphoglycerate dehydrogenase (EC 1.1.1.95), similar to others e.g. AF0278|AF027868_5|YoaD D-3-phosphoglycerate dehydrogenase from Bacillus subtilis (344 aa), FASTA scores: opt: 594, E(): 3.1e-31, (35.9% identity in 309 aa overlap); etc. Also similar to Rv2996c|MTV012.10|SERA1 D-3-phosphoglycerate dehydrogenase from Mycobacterium tuberculosis (528 aa)." /db_xref="GOA:A0A1R3XW91" /db_xref="InterPro:IPR006139" /db_xref="InterPro:IPR006140" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XW91" /protein_id="SIT99348.1" /translation="MTPRPRALVTAPLRGPGFAQLRRLADVVYDPWIDQRPLRIYSAE QLADRITAVAADVLVVESDSVGGPVFERGLRVVAATRGDPSNVDIPGATAAGIPVLHT PARNADAVAEMTVALLLAVARHLIPADADVRSGNIFRDGTIPYQRFRGAEIAGLTAGL VGLGAVGRAVRWRLSGLGLRVIAHDPYRDDAGHSLDELLAEADIVSMHAAVTDDTIGM IGAQQFAAMRDGAVFLNTARSQLHDTDALVDALRGGKLAAAGLDHFTGEWLPTDHPLV SMPNVVLTPHIGGATWNTEARQARMVADDLGALLSGNRPAHVVNPEVLGS" CDS 823329..824675 /codon_start=1 /transl_table=11 /gene="xylB" /locus_tag="BQ2027_MB0750" /product="POSSIBLE D-XYLULOSE KINASE XYLB (XYLULOKINASE) (XYLULOSE KINASE)" /note="Mb0750, xylB, len: 448 aa. Equivalent to Rv0729, len: 448 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 448 aa overlap). Possible xylB, D-xylulose-kinase (xylulokinase) (EC 2.7.1.17). C-terminus highly similar to AAD09880.1|U77912 unknown protein from Mycobacterium bovis (102 aa); and N-terminus highly similar to T45387|Z98756|MLCB2492_25 hypothetical protein from Mycobacterium leprae (110 aa), FASTA scores: opt: 427, E(): 1.1e-19, (60.9% identity in 110 aa overlap). Also similar to xylA/xylB genes from various bacterial species e.g. AAC26499.1|AF045245 D-xylulose-kinase from Klebsiella pneumoniae (487 aa); NP_418021.1|NC_000913 xylulokinase from Escherichia coli strain K12 (484 aa), FASTA scores: opt: 260, E(): 7.5e-09, (25.9% identity in 478 aa overlap); etc. Also similar to Rv3696c|glpK PROBABLE GLYCEROL KINASE (EC 2.7.1.30) from Mycobacterium tuberculosis (517 aa). BELONGS TO THE FUCOKINASE / GLUCONOKINASE / GLYCEROKINASE / XYLULOKINASE FAMILY. Protein product from Mb0750 detected using SWATH mass spectrometry. Mb0750 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW81" /db_xref="InterPro:IPR000577" /db_xref="InterPro:IPR018484" /db_xref="InterPro:IPR018485" /db_xref="UniProtKB/TrEMBL:A0A1R3XW81" /protein_id="SIT99349.1" /translation="MSRDDVTIGIDIGTTAVKAVAADDNGRVTARVRIGHQLAVPAPD RLEHDADEAWRRGPLAALDRLVGPDTRALAVAAMVPSLTAVDPAGRPITPGLLYGDAR GRVPNASVARAQSVPSVGETAEFLRWTAGQAPDASGYWPAPAVANYALSGEAVIDYAT AVTTLPLFDGTGWNATACADCGVTVDRMPRVETFGVGVGQVRGTGAVLAVGAVDALCE QIVAGADRDGDVLVLCGATLIVWTTISAARQVPGLWTIPHTAPGKSQIGGASNAGGLF LNWVDRVIGPGDPALADPRRVPVWLPYIRGERTPFHEPDRRAVLDGVDLSQDAASVRR AAYEASGFVVRQLIELSGAPVARIVAAGGGTRIQPWMQAIADATGRPVEVSRVAEGAA LGAAFLGRLAAGLESSIADAARWASTDRIVEPSADWAGPTKERYRRFLALSGSKLA" CDS 824688..825416 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0751" /product="gcn5-related n-acetyltransferase" /note="Mb0751, -, len: 242 aa. Equivalent to Rv0730, len: 242 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 242 aa overlap). Conserved hypothetical protein, only equivalent to Z98756|MLCB2492_26 HYPOTHETICAL PROTEIN from Mycobacterium leprae (227 aa), FASTA scores: opt: 1180, E(): 0, (83.5% identity in 218 aa overlap). Protein product from Mb0751 detected using shotgun mass spectrometry. Mb0751 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW83" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR013757" /db_xref="InterPro:IPR013760" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3XW83" /protein_id="SIT99350.1" /translation="MHGARTGVSFYAYAMTDHDQTAARREIADALLAALERRHEVADA IVEAANKAAAVEAIVNLLGTSHLAAEAVMSMSFDQLTQDARTKIIAELDDLNKQLSFT VKERPASSGEGLELRPFSPDEDRDIFARRTEEMGAAGDGSGGPAGSVDDEIRAAQKRV DDEEAAWFVAVDSGVKVGMVFGELVHGEVDVRIWIHPDHRKKGYGTAALRKSRSEMAW AFPAVPMVARAPAAQPAQPGSAGR" CDS complement(825505..826461) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0752C" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb0752c, -, len: 318 aa. Equivalent to Rv0731c, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 318 aa overlap). Conserved hypothetical protein, highly similar to other conserved hypothetical proteins from Mycobacterium tuberculosis e.g. Rv0726c|MTCY210.45c (367 aa), FASTA score: (60.9% identity in 317 aa overlap); Rv3399, Rv1729c, etc. Protein product from Mb0752c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:O53795" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:O53795" /protein_id="SIT99351.1" /translation="MTQTGSARFEGDSWDLASSVGLTATMVAAARAVAGRAPGALVND QFAEPLVRAVGVDFFVRMASGELDPDELAEDEANGLRRFADAMAIRTHYFDNFFLDAT RAGIRQAVILASGLDSRAYRLRWPAGTIVFEVDQPQVIDFKTTTLAGLGAAPTTDRRT VAVDLRDDWPTALQKAGFDNAQRTAWIAEGLLGYLSAEAQDRLLDQITAQSVPGSQFA TEVLRDINRLNEEELRGRMRRLAERFRRHGLDLDMSGLVYFGDRTDARTYLADHGWRT ASASTTDLLAEHGLPPIDGDDAPFGEVIYVSAELKQKHQDTR" CDS 826622..827947 /codon_start=1 /transl_table=11 /gene="secY" /locus_tag="BQ2027_MB0753" /product="PROBABLE PREPROTEIN TRANSLOCASE SECY" /note="Mb0753, secY, len: 441 aa. Equivalent to Rv0732, len: 441 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 441 aa overlap). Probable SecY, preprotein translocase (integral membrane protein), equivalent to NP_302243.1|NC_002677 SecY subunit of preprotein translocase from Mycobacterium leprae (438 aa); AAC04389.1|AF047021 preprotein translocase subunit from Mycobacterium smegmatis (438 aa); and U77912|MBU77912_1 preprotein translocase subunit from Mycobacterium bovis (441 aa), FASTA scores: opt: 2802, E(): 0, (99.8% identity in 441 aa overlap). Also highly similar to others e.g. P46785|SECY_STRCO PREPROTEIN TRANSLOCASE SECY SUBUNIT from Streptomyces coelicolor (437 aa); etc. Contains PS00755 and PS00756 protein secY signatures 1 and 2. BELONGS TO THE SECE/SEC61-ALPHA FAMILY. PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA|Rv3240c, SECD|Rv2587c, SECE|Rv0638, SECF|Rv2586c, SECG|Rv1440 AND SECY. Protein product from Mb0753 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0753 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Z3" /db_xref="InterPro:IPR002208" /db_xref="InterPro:IPR023201" /db_xref="InterPro:IPR026593" /db_xref="InterPro:IPR030659" /db_xref="UniProtKB/Swiss-Prot:P0A5Z3" /protein_id="SIT99352.1" /translation="MLSAFISSLRTVDLRRKILFTLGIVILYRVGAALPSPGVNFPNV QQCIKEASAGEAGQIYSLINLFSGGALLKLTVFAVGVMPYITASIIVQLLTVVIPRFE ELRKEGQAGQSKMTQYTRYLAIALAILQATSIVALAANGGLLQGCSLDIIADQSIFTL VVIVLVMTGGAALVMWMGELITERGIGNGMSLLIFVGIAARIPAEGQSILESRGGVVF TAVCAAALIIIVGVVFVEQGQRRIPVQYAKRMVGRRMYGGTSTYLPLKVNQAGVIPVI FASSLIYIPHLITQLIRSGSGVVGNSWWDKFVGTYLSDPSNLVYIGIYFGLIIFFTYF YVSITFNPDERADEMKKFGGFIPGIRPGRPTADYLRYVLSRITLPGSIYLGVIAVLPN LFLQIGAGGTVQNLPFGGTAVLIMIGVGLDTVKQIESQLMQRNYEGFLK" CDS 827944..828489 /codon_start=1 /transl_table=11 /gene="adk" /locus_tag="BQ2027_MB0754" /product="adenylate kinase adk (atp-amp transphosphorylase)" /note="Mb0754, adk, len: 181 aa. Equivalent to Rv0733, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 181 aa overlap). Probable adk, adenylate kinase (ATP-AMP transphosphorylase) (EC 2.7.4.3), equivalent to Z98756|MLCB24 92_28 probable adenylate kinase from Mycobacterium leprae (181 aa), FASTA scores: opt: 978, E(): 0, (83.6% identity in 177 aa overlap); and AAF86323.1|AF271342 putative adenylate kinase from Mycobacterium marinum (124 aa) (N-terminus shorter). Also highly similar to others e.g. P43414|KAD_STRCO ADENYLATE KINASE from Streptomyces coelicolor (217 aa), FASTA score: (43.0% identity in 186 aa overlap); etc. Contains PS00113 Adenylate kinase signature. BELONGS TO THE ADENYLATE KINASE FAMILY. Protein product from Mb0754 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0754 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P69439" /db_xref="InterPro:IPR000850" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR033690" /db_xref="UniProtKB/Swiss-Prot:P69439" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99353.1" /translation="MRVLLLGPPGAGKGTQAVKLAEKLGIPQISTGELFRRNIEEGTK LGVEAKRYLDAGDLVPSDLTNELVDDRLNNPDAANGFILDGYPRSVEQAKALHEMLER RGTDIDAVLEFRVSEEVLLERLKGRGRADDTDDVILNRMKVYRDETAPLLEYYRDQLK TVDAVGTMDEVFARALRALGK" CDS 828492..829292 /codon_start=1 /transl_table=11 /gene="mapA" /locus_tag="BQ2027_MB0755" /product="methionine aminopeptidase mapa (map) (peptidase m) (metap)" /note="Mb0755, mapA, len: 266 aa. Equivalent to Rv0734, len: 266 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 266 aa overlap). Probable mapA, methionine aminopeptidase (map) (EC 3.4.11.18), equivalent to Z98756|MLCB2492_29 probable methionine aminopeptidase from Mycobacterium leprae (266 aa), FASTA scores: opt: 1717, E(): 0, (83.4% identity in 265 aa overlap). Also highly similar to many e.g. T35553 methionine aminopeptidase from Streptomyces coelicolor (278 aa); etc. Also similar to Rv2861c|MAPB PROBABLE METHIONINE AMINOPEPTIDASE from Mycobacterium tuberculosis (285 aa). BELONGS TO PEPTIDASE FAMILY M24A; ALSO KNOWN AS THE MAP FAMILY 1. COFACTOR: COBALT; BINDS 2 IONS PER SUBUNIT. Protein product from Mb0755 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0755 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWC1" /db_xref="InterPro:IPR000994" /db_xref="InterPro:IPR001714" /db_xref="InterPro:IPR002467" /db_xref="InterPro:IPR036005" /db_xref="UniProtKB/TrEMBL:A0A1R3XWC1" /protein_id="SIT99354.1" /translation="MRPLARLRGRRVVPQRSAGELDAMAAAGAVVAAALRAIRAAAAP GTSSLSLDEIAESVIRESGATPSFLGYHGYPASICASINDRVVHGIPSTAEVLAPGDL VSIDCGAVLDGWHGDAAITFGVGALSDADEALSEATRESLQAGIAAMVVGNRLTDVAH AIETGTRAAELRYGRSFGIVAGYGGHGIGRQMHMDPFLPNEGAPGRGPLLAAGSVLAI EPMLTLGTTKTVVLDDKWTVTTADGSRAAHWEHTVAVTDDGPRILTLG" CDS 829365..829898 /codon_start=1 /transl_table=11 /gene="sigL" /locus_tag="BQ2027_MB0756" /product="PROBABLE ALTERNATIVE RNA POLYMERASE SIGMA FACTOR SIGL" /note="Mb0756, sigL, len: 177 aa. Equivalent to Rv0735, len: 177 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 177 aa overlap). Probable sigL, alternative RNA polymerase sigma factor (rpoE) (see citation below), highly similar to many proteins of the extracytoplasmatic function (ECF) subfamily e.g. CAB72200.1|AL138851 putative RNA polymerase sigma factor from Streptomyces coelicolor (194 aa); Q06909|CARQ_MYXXA RNA POLYMERASE SIGMA FACTOR CARQ from Myxococcus xanthus (174 aa), FASTA scores: opt: 251, E(): 9.6e-11, (32.9% identity in 161 aa overlap); etc. Also similar to MTCI61_4, MTU87242_1, and MLU15180_30 from Mycobacterium tuberculosis. Contains PS01063 Sigma-70 factors ECF subfamily signature and probable helix-turn helix motif from aa 139-160 (Score 1134, +3.05 SD). BELONGS TO THE SIGMA-70 FACTOR FAMILY, ECF SUBFAMILY. Mb0756 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWN4" /db_xref="InterPro:IPR000838" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR007630" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039425" /db_xref="UniProtKB/TrEMBL:A0A1R3XWN4" /protein_id="SIT99355.1" /translation="MARVSGAAAAEAALMRALYDEHAAVLWRYALRLTGDAAQAEDVV QETLLRAWQHPEVIGDTARPARAWLFTVARNMIIDERRSARFRNVVGSTDQSGTPEQS TPDEVNAALDRLLIADALAQLSAEHRAVIQRSYYRGWSTAQIATDLGIAEGTVKSRLH YAVRALRLTLQELGVTR" CDS 829962..830714 /codon_start=1 /transl_table=11 /gene="rsla" /locus_tag="BQ2027_MB0757" /product="anti-sigma factor rsla" /note="Mb0757, -, len: 250 aa. Equivalent to Rv0736, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 250 aa overlap). Probable conserved membrane protein, showing weak similarity with AL133469|SCM10_32 putative membrane protein from Streptomyces coelicolor (216 aa), FASTA scores: opt: 180, E(): 0.00018, (34.3% identity in 216 aa overlap). Mb0757 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XW87" /db_xref="InterPro:IPR027383" /db_xref="UniProtKB/TrEMBL:A0A1R3XW87" /protein_id="SIT99356.1" /translation="MTMPLRGLGPPDDTGVREVSTGDDHHYAMWDAAYVLGALSAADR REFEAHLAGCPECRGAVTELCGVPALLSQLDRDEVAAISESAPTVVASGLSPELLPSL LAAVHRRRRRTRLITWVASSAAAAVLAIGVLVGVQGHSAAPQRAAVSALPMAQVGTQL LASTVSISGEPWGTFINLRCVCLAPPYASHDTLAMVVVGRDGSQTRLATWLAEPGHTA TPAGSISTPVDQIAAVQVVAADTGQVLLQRSL" CDS 831029..831526 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0758" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0758, -, len: 165 aa. Equivalent to Rv0737, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 165 aa overlap). Possible transcriptional regulator, similar to others e.g. BAB69161.1|AB070937 regulator protein from Streptomyces avermitilis (169 aa); NP_419731.1|NC_002696 transcriptional regulator MarR family from Caulobacter crescentus (148 aa) (homology only at C-terminus); etc. Also shows weak similarity to AB0014|AB001488_14 hypothetical protein from Bacillus subtilis (164 aa), FASTA scores: opt: 163, E(): 9.3e-05, (32.8% identity in 116 aa overlap), which is similar to slyY gene of S. typhimurium required for survival in macrophage. Contains possible helix-turn helix motif from aa 73-94 (Score 1138, +3.06 SD). Protein product from Mb0758 detected using SWATH mass spectrometry. Mb0758 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWB9" /db_xref="InterPro:IPR000835" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XWB9" /protein_id="SIT99357.1" /translation="MASDNRDPIAAARANWERSGWGDVSLGMVAVTSVMRAHQILLAR VETALRPYDLSFSRFELLRLLAFSRIGALPITKASDRLQVHVTSVTHAIRRLEADGLV RRVPHPTDGRTTLVQITELGRSTVEDATVTLNEQVFANVGMGAEESQALVSAVETLRR NAGDF" CDS 831884..832432 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0759" /product="conserved protein" /note="Mb0759, -, len: 182 aa. Equivalent to Rv0738, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 182 aa overlap). Conserved hypothetical protein, showing weak similarity with hypothetical proteins from Mycobacterium tuberculosis: Rv1727|MTCY04C12.12 (189 aa); MTY13D12_7|Z80343 hypothetical protein from Mycobacterium tuberculosis (194 aa), FASTA scores: opt: 172, E(): 0.0004, (24.2% identity in 178 aa overlap); and C-terminus of Rv0576. Protein product from Mb0759 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0759 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XW99" /db_xref="InterPro:IPR017517" /db_xref="InterPro:IPR017520" /db_xref="InterPro:IPR024344" /db_xref="InterPro:IPR034660" /db_xref="UniProtKB/TrEMBL:A0A1R3XW99" /protein_id="SIT99358.1" /translation="MDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHV VGGNEQVGRWAASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPG QVFIGLRTTDVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADE KPCPRERPPADQLAAFLGRTVR" CDS 832637..833485 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0760" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0760, -, len: 282 aa. Equivalent to Rv0739, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 263 aa overlap). Conserved hypothetical protein, showing some similarity to Mycobacterium tuberculosis proteins Rv0026 (448 aa), FASTA score: (37.6% identity in 101 aa overlap)and Rv0025 (120 aa), FASTA score: (32.4% identity in 142 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 2 bp insertion (*-cg) leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (282 aa versus 268 aa). Mb0760 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019710" /db_xref="UniProtKB/TrEMBL:A0A1R3XW88" /protein_id="SIT99359.1" /translation="MSDYSLGVPDETGLGADAARAREVALTQHIGVSAETDRAVVPKL RQAYDSLVCGRRRLGAIGAEIENAVAHQRALGLDTPAGARNFSRFLATKAHDITRVLA ATAAESQAGAARLRSLASSYQAVGFGPKPQEPPPDPVPFPPYQPKVWAACRARGQDPD KVVRTFHHAPMSARFRSLPAGDSVLYCGNDKYGLLHIQAKHGRQWHDIADARWPSAGN WRYLADYAIGATLAYPERVEYNQDNDTFAVYRRVSLPDGRYVFTTRVIISARDGKIIT AFPQTT" CDS 833600..834127 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0761" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0761, -, len: 175 aa. Equivalent to Rv0740, len: 175 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 175 aa overlap). Conserved hypothetical protein; C-terminus (possibly part of truncated IS1557) shows nearly perfect identity to Rv0750|MTV041_24 (81 aa), FASTA score: (92.6% identity in 81 aa overlap). Also shows weak similarity to MTV007_5 hypothetical protein from Mycobacterium tuberculosis (313 aa), FASTA score: (34.5% identity in 110 aa overlap); and MLCL536_27 hypothetical protein from Mycobacterium leprae (315 aa), FASTA score: (34.5% identity in 84 aa overlap). Protein product from Mb0761 detected using SWATH mass spectrometry. Mb0761 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XW86" /protein_id="SIT99360.1" /translation="MLPKNTRPTSETAEEFWDNSLWCSWGDRETGYTRTVTVSICQVA DGEREAEGVRDMMRLECPAGLDLRTPNPEAYEITGQRPGEFVFVLGYLGHVRAIVGNC YIEIMPMGTRVELSKLADVALDIGRSVGCSAYENDFTLPDIPTQWRNQPLGWYTQGLA PYLPGLSDPKDAAEG" mobile_element 834176..834692 /mobile_element_type="insertion sequence:IS1557" /locus_tag="BQ2027_IS1557'-1" /note="IS1557'-1, len: 517 nt. Equivalent to IS1557', len: 517 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 517 nt overlap). Region similar to IS1557 region on MTCY373- (IS1557- 1st copy)." gene 834176..834692 /locus_tag="BQ2027_IS1557'-1" CDS 834358..834672 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0762" /product="probable transposase (fragment)" /note="Mb0762, -, len: 104 aa. Equivalent to Rv0741, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 104 aa overlap). Probable truncated transposase for IS1557, showing similarity to transposases and IS elements e.g. U63997|EFU63997_1 insertion sequence from Enterococcus faecium (424 aa), FASTA score: (31.0% identity in 87 aa overlap). Very high similarity with the C-terminal part of Z73419|MTCY373_3 2 IS1557 from Mycobacterium tuberculosis (444 aa), FASTA score: (86.5% identity in 104 aa overlap). Mb0762 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002560" /db_xref="UniProtKB/TrEMBL:A0A1R3XW94" /protein_id="SIT99361.1" /translation="MFSVKGEEGKQALDRWISWARRCRIPVFVELAGGIVRHRQAIDA ALDHGLWQGLIESTNTKIRLLTRIAFGFRSPEALIALAMLALGGRRPALPGRTKHPRI SQ" CDS 834814..835332 /codon_start=1 /transl_table=11 /gene="PE_PGRS8" /locus_tag="BQ2027_MB0763" /product="pe-pgrs family protein pe_pgrs8" /note="Mb0763, PE_PGRS8, len: 172 aa. Equivalent to Rv0742, len: 172 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 172 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many M. tuberculosis PGRS-type proteins e.g. Z78020|MTCY1A11_25 (498 aa), FASTA scores: opt: 766, E(): 6.1e-25, (73.6% identity in 178 aa overlap). Similarity suggests ORF starts with ATA start codon. Mb0763 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XYE7" /protein_id="SIT99362.1" /translation="MIAAPEAIAAAATDLASIGSTIGAANAAAAANTTAVLAAGADQV SVAIAAAFGAHGQAYQALSAQAATFHIQFVQALTAGAGSYAAAEAASAASITSPLLDA INAPFLAALGRPLIGNGADGAPGTGAAGGAGGLLFGNGGAGGSGAPGGAGGLLFGNGG AGGPGASGGALG" CDS complement(835710..836267) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0764C" /product="HYPOTHETICAL PROTEIN" /note="Mb0764c, -, len: 185 aa. Equivalent to Rv0743c, len: 185 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 185 aa overlap). Hypothetical unknown protein. Protein product from Mb0764c detected using SWATH mass spectrometry. Mb0764c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XXA2" /protein_id="SIT99363.1" /translation="MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQAT ASQEADIAFVNDPARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLV SWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLP EETDPRIGQRIAAWLNYYGAGNHSS" CDS complement(836264..836770) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0765C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0765c, -, len: 168 aa. Equivalent to Rv0744c, len: 168 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 168 aa overlap). Possible transcriptional regulator, showing weak similarity with O86661|SC4A2.05 PUTATIVE TWO-COMPONENT SENSOR from Streptomyces coelicolor (436 aa), FASTA scores: opt: 117, E(): 0.88, (37.25% identity in 94 aa overlap); and some putative excisionases or transposases. Also weakly similar to P71902|YN10_MYCTU|Rv2310|MT2372|MTCY3G12.24c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (114 aa); and Q11144|Y477_MYCTU|Rv0477|MT0495|MTCY20G9.03 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (148 aa). Equivalent to AAK45006 from Mycobacterium tuberculosis strain CDC1551 (179 aa) but shorter 11 aa. Contains probable helix-turn helix motif from aa 5-26 (Score 1350, +3.78 SD). Mb0765c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWC8" /db_xref="InterPro:IPR009061" /db_xref="InterPro:IPR010093" /db_xref="InterPro:IPR041657" /db_xref="UniProtKB/TrEMBL:A0A1R3XWC8" /protein_id="SIT99364.1" /translation="METLLKTSEAAQILGVSRQHVVNMCDRGEMVCVHVGSHRRVPSS EVERVTSRRLTREEERSLWLHRALLSPLLTEPDTVVSAARENLRRWSGMHRRDGMAGW YFTKWQRVLNDGLDAVMHVLTSPSEDAREMRQNSPFAGILPEATRVAVLRSFKDHWDR EHERAMTE" CDS 836978..837505 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0766" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0766, -, len: 175 aa. Equivalent to Rv0745, len: 175 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 175 aa overlap). Conserved hypothetical protein; shows high similarity to a 50 aa region of Rv3649|Z95436|MTY15C10_3 CONSERVED HYPOTHETICAL PROTEIN, similar to ATP-dependent helicases, from Mycobacterium tuberculosis (771 aa), FASTA scores: opt: 225, E(): 7e-06, (70.0% identity in 50 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3XWP4" /protein_id="SIT99365.1" /translation="MGPPHRSRPPLPSPGPTCQVLPTTAVIHTVTAEALGRIGIDAPR IPGSLDVAAHAAIGLLPLVAGCDRRHRRPVRGARAGRAAQVSLCMTAIRVEPVSSNAV CTGPAAQVGDQSRSPQRDYAHQALQPDVPRRRARRHRPRRCSAKTGSSSSTMRCTCHQ NQCLWSSGVSWALAR" CDS 837525..839918 /codon_start=1 /transl_table=11 /gene="PE_PGRS9" /locus_tag="BQ2027_MB0767" /product="pe-pgrs family protein pe_pgrs9" /note="Mb0767, PE_PGRS9, len: 797 aa. Equivalent to Rv0746, len: 783 aa, from Mycobacterium tuberculosis strain H37Rv, (89.7% identity in 829 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to part of MTCY28.25c|Rv1759c|Z95890 antigen wag22 from M. tuberculosis (914 aa), FASTA scores: opt: 2429, E(): 0, (56.9% identity in 873 aa overlap). Also similar to other PE-PGRS FAMILY PROTEINS e.g. AL0212|MTV008_46 FASTA score: (48.8% identity in 887 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, two 48 bp deletions, and a 138 bp insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (797 aa versus 783 aa)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XW92" /protein_id="SIT99366.1" /translation="MSFVLAMPEVLGSAATDLAALGSVLGAADAAAAATTTGIVAAVQ DEVSAAIAALFSAHGRAYQVASAQAAAVHAQFVEALSAGAGAYASAEAAGAAVLANPA QSVQQDLLAAVNAQSVALTGRPLIGNGANGAPGTGANGAPGGWLLGNGGAGGSAAAGS GLPGGAGGAAGLFGTGGAGGAGGSSTVGDGGAGGAGGSGGWLLGTGGVGGVGGLGAGA GGAGGVGGAGGLLGAGGHGGAGGLGAVTGGVGGAGGAGGLLAGLLAGPGGAGGTGGRG FLNDGGVGGAGGNAGLLFGAGGTGGSGGAGLGGDGGAGGAGGNAGVLFGNAGSGGTGG FGDTDGGAGGAGGDAGWLGSGGVGGAGGFGETGDGGVGGAGGKAGLLIGNGGAGGAGV LIGNGGNAGIGGTGPTAGDTGAGGISGLLLGADGFNAPASASPLHTLKQQALAAINAP TQTLTGRPLIGNGTPGAVGSGATGAPGGWLLGDGGAGGSGAAGSGAPGGAGGAAGLWG TGGAGGAGGWLLGDGGAGGIGGASTVLGGTGGGGGVGGLWGAGGAGGAGGTGLVGGDG GAGGAGGTGGLLAGLIGAGGGHGGTGGLNTNGDGGVGGAGGNAGMLAGPGGAGGAGGD GENLDTGGDGGAGGSAGLLFGSGGAGGAGGFGFLGGDGGAGGNAGLLLSSGGAGGFGG FGTAGGVGGAGGNAGWLGFGGAGGIGGIGGNANGGAGGNGGTGGQLWGSGGAGGEGGA ALSVGDTGGAGGVGGSAGLIGTGGNGGNGGTGANAGSPGTGGAGGLLLGQNGLNGLP" CDS 840317..843046 /codon_start=1 /transl_table=11 /gene="PE_PGRS10" /locus_tag="BQ2027_MB0768" /product="pe-pgrs family protein pe_pgrs10" /note="Mb0768, PE_PGRS10, len: 909 aa. Equivalent to Rv0747, len: 801 aa, from Mycobacterium tuberculosis strain H37Rv, (86.6% identity in 912 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to part of MTCY28.25c|Rv1759c|Z95890 antigen wag22 from M. tuberculosis (914 aa), FASTA scores: opt: 2772, E(): 0, (60.9% identity in 941 aa overlap). Also similar to other PE-PGRS FAMILY PROTEINS e.g. Z95844|MTCY493_2 FASTA score: (50.2% identity in 815 aa overlap). Contains PS00012 Phosphopantetheine attachment site. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 168 bp and 105 bp, a 11 bp for 71 bp substitution, and a 9 bp deletion (cggcaacgg-*), leads to a longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (909 aa versus 801 aa)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XWC7" /protein_id="SIT99367.1" /translation="MSWVMVSPELVVAAAADLAGIGSAISSANAAAAVNTTGLLTAGA DEVSTAIAALFGAQGQAYQAASAQAAAFYAQFVQALSAGGGAYAAAEAAAVSPLLAPI NAQFVAATGRPLIGNGANGAPGTGANGGPGGWLIGNGGAGGSGAPGAGAGGNGGAGGL FGSGGAGGAGGNAFGAGEVGGAGGAGGNAMLFGAGGAGGAGGAGGNAGMLFGAAGVGG VGGFSNGGATGGAGGAGGAGGLFTTGGVGGAGGAGGDGGDGGAGGLFGAGGTGGAGGF GTAVLAGTGGAGGPGGAGGLFGAGGEGGSGGSGNLTGGAGGAGGNAGTLATGDGGAGG TGGASRSGGFGGAGGAGGDAGMFFGSGGSGGAGGAGGSGGFGLPSGGKGGAGGDAGML FGSGGSGGAGGISRSVGDGAAGGAGGAPGLIGNGGNGGNGGASTGGGDGGPGGAGGTG VLIGNGGSGGTGATLGKAGIGGTGGVLLGLDGFTAPASTSPLHTLQQDVINMVNDPFQ TLTGRPLIGNGANGTPGTGADGGAGGWLFGNGGNGGQGTIGGVNGGAGGAGGAGGILF GTGGTGGSGGPGATGLGGIGGAGGAALLFGSGGAGGSGGAGAVGGNGGAGGNAGALLG AAGAGGAGGAGAVGGNGGAGGNGGLFANGGAGGPGGFGSPAGAGGIGGAGGNGGLFGA GGTGGAGGGSTLAGGAGGAGGNGGLFGAGGTGGAGSHSTAAGVSGGAGGAGGDAGLLS LGASGGAGGSGGSSLTAAGVVGGIGGAGGLLFGSGGAGGSGGFSNSGNGGAGGAGGDA GLLVGSGGAGGAGASATGAATGGDGGAGGKSGAFGLGGDGGAGGATGLSGAFHIGGKG GVGGSAVLIGNGGNGGNGGNSGNAGKSGGAPGPSGAGGAGGLLLGENGLNGLM" CDS 843137..843394 /codon_start=1 /transl_table=11 /gene="vapb31" /locus_tag="BQ2027_MB0769" /product="possible antitoxin vapb31" /note="Mb0769, -, len: 85 aa. Equivalent to Rv0748, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Conserved hypothetical protein, N-terminus similar to N-terminal region of NP_436939.1|NC_003078 HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (75 aa). Also similar to Mycobacterium tuberculosis proteins Rv2871 CONSERVED HYPOTHETICAL PROTEIN (75 aa); Rv1241, Rv2132, Rv3321c, etc. Protein product from Mb0769 detected using SWATH mass spectrometry. Mb0769 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWA8" /db_xref="InterPro:IPR002145" /db_xref="UniProtKB/TrEMBL:A0A1R3XWA8" /protein_id="SIT99368.1" /translation="MRTTVSISDEILAAAKRRARERGQSLGAVIEDALRREFAAAHVG GARPTVPVFDGGTGPRRGIDLTSNRALSEVLDEGLELNSRK" CDS 843418..843846 /codon_start=1 /transl_table=11 /gene="vapc31" /locus_tag="BQ2027_MB0770" /product="possible toxin vapc31. contains pin domain." /note="Mb0770, -, len: 142 aa. Equivalent to Rv0749, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins from Mycobacterium tuberculosis e.g. Rv0749, Rv0277c, Rv2530c, etc. Mb0770 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWA0" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XWA0" /protein_id="SIT99369.1" /translation="MFLLDANVLLAAHRGDHPNHRTVRPWFDRLLAADDPFTVPNLVW ASFLRLATNRRIFEIPSPRAEAFAFVEAVTAQPHHLPTNPGPRHLMLLRKLCDEADAS GDLIPDAVLAAIAVGHHCAVVSLDRDFARFASVRHIRPPL" CDS complement(843927..844064) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0771C" /product="Protein involved in DNA integration" /note="Mb0771c, -, len: 45 aa. Equivalent to Rv0749A, len: 45 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 45 aa overlap). Conserved hypothetical protein (probably gene fragment), similar to part (aa 250-292) of Rv2807|Z81331_12 from Mycobacterium tuberculosis (384 aa), FASTA scores: opt: 238, E(): 1.9e-13, (79.07% identity in 43 aa overlap). Mb0771c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XWA1" /protein_id="SIT99370.1" /translation="MVRKHAFHWRYDSTEELELLNQLWQLVSLRLNFFTPTKKALGFR P" CDS 844223..844468 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0772" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0772, -, len: 81 aa. Equivalent to Rv0750, len: 81 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 81 aa overlap). Conserved hypothetical protein, showing almost perfect overlap with C-terminus of Rv0740|MTV041_14 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (175 aa), FASTA scores: (93.8% identity in 81 aa overlap). Possible duplication. Mb0772 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XW98" /protein_id="SIT99371.1" /translation="MRAIVGDCVIHIMPMGTGVELSKLADLALDIGRSVGCSAYENDF TLPDIPTQWRNQPLGWYTQGLAPYLPGLSDPKDAAEG" CDS complement(844537..845421) /codon_start=1 /transl_table=11 /gene="mmsB" /locus_tag="BQ2027_MB0773C" /product="PROBABLE 3-HYDROXYISOBUTYRATE DEHYDROGENASE MMSB (HIBADH)" /note="Mb0773c, mmsB, len: 294 aa. Equivalent to Rv0751c, len: 294 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 294 aa overlap). Probable mmsB, 3-hydroxyisobutyrate dehydrogenase (EC 1.1.1.31), highly similar to others e.g. NP_102847.1|NC_002678 3-hydroxyisobutyrate dehydrogenase from Mesorhizobium loti (294 aa); NP_420167.1|NC_002696 3-hydroxyisobutyrate dehydrogenase from Caulobacter crescentus (298 aa); A32867 3-hydroxyisobutyrate dehydrogenase from Rattus norvegicus (346 aa); etc. Also similar to methylmalonate semialdehyde dehydrogenases e.g. M84911|PSE MMSRAB_3 methylmalonate semialdehyde dehydrogenase from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 786, E(): 0, (45.8% identity in 297 aa overlap). Also similar to 6-phosphogluconate dehydrogenases from Mycobacterium tuberculosis e.g. Rv1122 and Rv1844c. Contains PS00895 3-hydroxyisobutyrate dehydrogenase signature. BELONGS TO THE 3-HYDROXYISOBUTYRATE DEHYDROGENASE FAMILY. Protein product from Mb0773c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0773c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63936" /db_xref="InterPro:IPR002204" /db_xref="InterPro:IPR006115" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR011548" /db_xref="InterPro:IPR013328" /db_xref="InterPro:IPR015815" /db_xref="InterPro:IPR029154" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P63936" /protein_id="SIT99372.1" /translation="MTTIAFLGLGNMGAPMSANLVGAGHVVRGFDPAPTAASGAAAHG VAVFRSAPEAVAEADVVITMLPTGEVVRRCYTDVLAAARPATLFIDSSTISVTDAREV HALAESHGMLQLDAPVSGGVKGAAAATLAFMVGGDESTLRRARPVLEPMAGKIIHCGA AGAGQAAKVCNNMVLAVQQIAIAEAFVLAEKLGLSAQSLFDVITGATGNCWAVHTNCP VPGPVPTSPANNDFKPGFSTALMNKDLGLAMDAVAATGATAPLGSHAADIYAKFAADH ADLDFSAVIHTLRARADA" CDS complement(845432..846604) /codon_start=1 /transl_table=11 /gene="fadE9" /locus_tag="BQ2027_MB0774C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE9" /note="Mb0774c, fadE9, len: 390 aa. Equivalent to Rv0752c, len: 390 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 390 aa overlap). Probable fadE9, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. NP_437985.1|NC_003078 putative acyl-CoA dehydrogenase protein from Sinorhizobium meliloti (380 aa); Z99123|BSUB0020_14 from Bacillus subtilis (379 aa), FASTA scores: opt: 853, E(): 0, (39.8% identity in 384 aa overlap); etc. Contains PS00072 Acyl-CoA dehydrogenases signature 1, and PS00073 Acyl-Co Adehydrogenases signature 2. BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb0774c detected using SWATH mass spectrometry. Mb0774c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXB2" /db_xref="InterPro:IPR006089" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3XXB2" /protein_id="SIT99373.1" /translation="MFVLNDDERVIVETAAAFAGKRLAPHALEWDAAKHFPVDVLREA AELGMAAIYCRDDVGGSGLRRLDGARIFEQLAIADPVTAAFLSIHNMCAWMIDSFGTD EQRKDWIPRLATMGVIASYCLTEPGAGSDAGALSTRAVRHGSGKGGDYVLDGVKQFIS GAAASDVYVVMARTGAEGPRGVSAFVVEKGTPGLSFGAPEAKMGWHAQPTAQVVLDGV RVPAEAMLGGADGEGAGFGIAMSGLNGGRLNIAACSLGGAQAAFDKAGAYVRDRQAFG GSLLDEPTVRFTLADMATGLQTSRMLLWRAASALDDDDADKVELCAMAKRYVTDTCFE VADQALQLHGGYGYLREYGLEKIVRDLRVHRILEGTNEIMRLVIGRAEAARFRATV" CDS complement(846611..848143) /codon_start=1 /transl_table=11 /gene="mmsA" /locus_tag="BQ2027_MB0775C" /product="PROBABLE METHYLMALONATE-SEMIALDEHYDE DEHYDROGENASE MMSA (METHYLMALONIC ACID SEMIALDEHYDE DEHYDROGENASE) (MMSDH)" /note="Mb0775c, mmsA, len: 510 aa. Equivalent to Rv0753c, len: 510 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 510 aa overlap). Probable mmsA, methylmalonic acid semialdehyde dehydrogenase (EC 1.2.1.27), highly similar to others e.g. NP_420115.1|NC_002696 putative methylmalonate-semialdehyde dehydrogenase from Caulobacter crescentus (499 aa); L48550|STMMSDA_1|CAB75315.1|AL139164 methylmalonic acid semialdehyde dehydrogenase from Streptomyces coelicolor (500 aa), FASTA score: (51.6% identity in 498 aa overlap); M84911|PSEMMSRAB_2|NP_252260.1|NC_002516 methylmalonate-semialdehyde dehydrogenase from Pseudomonas aeruginosa (497 aa), FASTA scores: opt: 1127, E(): 0, (47.9% identity in 507 aa overlap); etc. Note that also highly similar to malonic semialdehyde oxidative decarboxylases e.g. NP_104968.1|NC_002678 malonic semialdehyde oxidative decarboxylase from Mesorhizobium loti (498 aa); NP_384832.1|NC_003047 PUTATIVE MALONIC SEMIALDEHYDE OXIDATIVE DECARBOXYLASE PROTEIN from Sinorhizobium meliloti (498 aa); etc. Contains PS00070 Aldehyde dehydrogenases cysteine active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY. Protein product from Mb0775c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0775c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWE3" /db_xref="InterPro:IPR010061" /db_xref="InterPro:IPR015590" /db_xref="InterPro:IPR016160" /db_xref="InterPro:IPR016161" /db_xref="InterPro:IPR016162" /db_xref="InterPro:IPR016163" /db_xref="UniProtKB/TrEMBL:A0A1R3XWE3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99374.1" /translation="MTTQISHFIDGQRTAGQSTRSADVFDPNTGQIQAKVPMAGKSDI DAAVASAVEAQKGWAAWNPQRRARVLMRFIELVNDTIDELAELLSREHGKTLADARGD VQRGIEVIEFCLGIPHLLKGEYTEGAGPGIDVYSLRQPLGVVAGITPFNFPAMIPLWK AGPALACGNAFVLKPSERDPSVPVRLAELFIEAGLPAGVFQVVHGDKEAVDAILHHPD IKAVGFVGSSDIAQYIYAGAAATGKRAQCFGGAKNHMIVMPDADLDQAVDALIGAGYG SAGERCMAISVAVPVGDQTAERLRARLIERINNLRVGHSLDPKADYGPLVTGAALARV RDYIGQGVAAGAELVVDGRDRASDDLTFGLPEGDANLEGGFFIGPTLFDHVAAHMSIY TDEIFGPVLCMVRARDYEEALRLPSEHEYGNGVAIFTRDGDAARDFVSRGQVGMVGVN VPIPVPVAYHTFGGWKRSGFGDLNQHGPAAIQFYTKVKTVTSRWPSGIKDGAEFVIPT MS" CDS 848349..850103 /codon_start=1 /transl_table=11 /gene="PE_PGRS11" /locus_tag="BQ2027_MB0776" /product="pe-pgrs family protein pe_pgrs11" /note="Mb0776, PE_PGRS11, len: 584 aa. Equivalent to Rv0754, len: 584 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 584 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to others e.g. AL0212|MTV008_46 from Mycobacterium tuberculosis (1660 aa), FASTA score: (48.7% identity in 345 aa overlap); Z80225|MTCY441_4 from Mycobacterium tuberculosis (778 aa), FASTA score: (41.6% identity in 442 aa overlap); etc. Mb0776 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR013078" /db_xref="InterPro:IPR029033" /db_xref="UniProtKB/TrEMBL:A0A1R3XWQ3" /protein_id="SIT99375.1" /translation="MSFVIVARDALAAAAADLAQIGSAVNAGNLAAANPTTAVAAAAA DEVSAALAALFGAHAREYQAAAAQAAAYHEQFVHRLSAAATSYAVTEVTIATSLRGAL GSAPASVSDGFQAFVYGPIHATGQQWINSPVGEALAPIVNAPTNVLLGRDLIGNGVTG TAAAPNGGPGGLLFGDGGAGYTGGNGGSAGLIGNGGTGGAGFAGGVGGMGGTGGWLMG NGGMGGAGGVGGNGGAGGQALLFGNGGLGGAGGAGGVDGAIGRGGWFIGTGGMATIGG GGNGQSIVIDFVRHGQTPGNAAMLIDTAVPGPGLTALGQQQAQAIANALAAKGPYAGI FDSQLIRTQQTAAPLANLLGMAPQVLPGLNEIHAGIFEDLPQISPAGLLYLVGPIAWT LGFPIVPMLAPGSTDVNGIVFNRAFTGAVQTIYDASLANPVVAADGNITSVAYSSAFT IGVGTMMNVDNPHPLLLLTHPVPNTGAVVVQGNPEGGWTLVSWDGIPVGPASLPTALF VDVRELITAPQYAAYDIWESLFTGDPAAVINAVRDGADEVGAAVVQFPHAVADDVIDA TGHPYLSGLPIGLPSLIP" CDS complement(850293..852230) /codon_start=1 /transl_table=11 /gene="PPE12" /locus_tag="BQ2027_MB0777C" /product="ppe family protein ppe12" /note="Mb0777c, PPE12, len: 645 aa. Equivalent to Rv0755c, len: 645 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 645 aa overlap). Member of the Mycobacterium tuberculosis PPE family, highly similar to others e.g. Z82098|MTCY3C7_23 from Mycobacterium tuberculosis (582 aa), FASTA scores: (56.1% identity in 636 aa overlap); Z92774|MTCY6G11_5 from Mycobacterium tuberculosis (552 aa), FASTA scores: (55.8% identity in 590 aa overlap); etc. Mb0777c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XW96" /protein_id="SIT99376.1" /translation="MVGFAWLPPETNSLRMYLGAGSRPLLAAAGAWDGLAEELHAAAS SFGSVTSELAGGAWQGPASAAMANAAGPYASWLTAAGAQAELAARQARAAAGAFEEAL AGVVHPAVVQANRVRTWLLAVSNVFGQNAPAIAAMESTYEQMWAQDVAVMAGYHAASS AAAAQLASWQPALPNINLGVGNIGNLNVGNGNTGDYNLGNGNLGNANFGGGNGSAFHG QISSFNVGSGNIGNFNLGSGNGNVGIGPSSFNVGSGNIGNANVGGGNSGDNNFGFGNF GNANIGIGNAGPNMSSPAVPTPGNGNVGIGNGGNGNFGGGNTGNANIGLGNVGDGNVG FGNSGSYNFGFGNTGNNNIGIGLTGSNQIGFGGLNSGSGNIGFGNSGTGNIGFFNSGS GNFGVGNSGVTNTGVANSGNINTGFGNSGFINTGFGNALSVNTGFGNSGQANTGIGNA GDFNTGNFNGGIINTGSFNSGAFNSGSFNGGDANSGFLNSGLTNTGFANSGNINTGGF NAGNLNTGFGNTTDGLGENSGFGNAGSGNSGFNNSGKGNSGAQNVGNLQISGFANSGQ SVTGYNNSVSVTSGFGNKGTGLFSGFMSGFGNTGFLQSGFGNLEANPDNNSATSGFGN SGKQDSGGFNSIDFVSGFFHR" CDS complement(852532..852717) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0778C" /product="PUTATIVE TRANSPOSASE (FRAGMENT)" /note="Mb0778c, -, len: 61 aa. Equivalent to Rv0755A, len: 61 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 61 aa overlap). Putative transposase (possibly gene fragment), similar to C-terminal part of Q9EZM2|ISMav2|AF286339_1 putative transposase from Mycobacterium paratuberculosis (395 aa), FASTA scores: opt: 284, E(): 5e-13, (83.02% identity in 53 aa overlap); and to SCJ11.25c|Q9RI80 possible noncomposite transposon transposase from Streptomyces coelicolor (283 aa). Mb0778c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWD0" /db_xref="InterPro:IPR010921" /db_xref="UniProtKB/TrEMBL:A0A1R3XWD0" /protein_id="SIT99377.1" /translation="MKELSVAEQRYQAVLAVISDGLSISQVAEKVGVSRQTLHTWLAR YEAEGLDGLRIGTGTAL" tRNA complement(852826..852897) /locus_tag="BQ2027_THRV" /product="tRNA-Thr" /note="thrV, len: 72 nt. Equivalent to thrV, len: 72 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 72 nt overlap). tRNA-Thr, anticodon tgt." CDS complement(852931..853656) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0779C" /product="unknown protein" /note="Mb0779c, -, len: 241 aa. Equivalent to Rv0756c, len: 241 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 241 aa overlap). Hypothetical unknown protein. Protein product from Mb0779c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0779c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XWB4" /protein_id="SIT99378.1" /translation="MNLGQTLVGIATWPARAGLAAADTGLNMAGAAVDMAKQALGDAG GASGSTSMANMLGIDDTIARANRLARLLDDDMPLGRAIAPNGPMDRMLRPGGVVDLLT QPGGLLDRLTAEGGAMQRALQPGGLADQLLAEDGLIERVLSEDGLADRLLAEGGLIDK ITAKDGPLEQLADVADTLARLTPGMEALEPAIATLQDAVIALTMVVNPLSSIAERIPL PGRRPARRSSSRSVRSQRVVDSE" CDS 853798..854541 /codon_start=1 /transl_table=11 /gene="phoP" /locus_tag="BQ2027_MB0780" /product="POSSIBLE TWO COMPONENT SYSTEM RESPONSE TRANSCRIPTIONAL POSITIVE REGULATOR PHOP" /note="Mb0780, phoP, len: 247 aa. Equivalent to Rv0757, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 247 aa overlap). Possible phoP, two component system response phosphate regulon transcriptional regulator, highly similar to various transcriptional regulators e.g. CAC32360.1|AL583945 putative two component system response regulator from Streptomyces coelicolor (271 aa); T45446 probable two-component response regulator from Mycobacterium leprae (253 aa); and similar to phoP proteins e.g. P13792|PHOP_BACSU alkaline phosphatase synthesis transcription regulatory protein from Bacillus subtilis (240 aa), FASTA scores: opt: 594, E(): 2.3e-33, (41.0% identity in 234 aa overlap); etc. Also highly similar to Rv3765c from Mycobacterium tuberculosis (234 aa), Rv1033c (257 aa), RV0903c|MTCY31.31c|Q10531 (236 aa), FASTA score: (45.4% identity in 229 aa overlap); MTCY10G2_16 and MTU88959_1. Protein product from Mb0780 detected using shotgun mass spectrometry. Mb0780 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWA9" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039420" /db_xref="UniProtKB/TrEMBL:A0A1R3XWA9" /protein_id="SIT99379.1" /translation="MRKGVDLVTAGTPGENTTPEARVLVVDDEANIVELLSVSLKFQG FEVYTATNGAQALDRARETRPDAVILDVMMPGMDGFGVLRRLRADGIDAPALFLTARD SLQDKIAGLTLGGDDYVTKPFSLEEVVARLRVILRRAGKGNKEPRNVRLTFADIELDE ETHEVWKAGQPVSLSPTEFTLLRYFVINAGTVLSKPKILDHVWRYDFGGDVNVVESYV SYLRRKIDTGEKRLLHTLRGVGYVLREPR" CDS 854586..856043 /codon_start=1 /transl_table=11 /gene="phoR" /locus_tag="BQ2027_MB0781" /product="POSSIBLE TWO COMPONENT SYSTEM RESPONSE SENSOR KINASE MEMBRANE ASSOCIATED PHOR" /note="Mb0781, phoR, len: 485 aa. Equivalent to Rv0758, len: 485 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 485 aa overlap). Possible phoR, two component system response phosphate sensor kinase membrane-associated (EC 2.7.-.-), highly similar to various sensor kinases e.g. CAC32361.1|AL583945 putative two component system histidine kinase from Streptomyces coelicolor (524 aa); NP_349365.1|NC_003030 Membrane-associated sensory histidine kinase with HAMP domain from Clostridium acetobutylicum (482 aa); and similar to phoP proteins e.g. NP_372216.1|NC_002758 alkaline phosphatase synthesis sensor protein from Staphylococcus aureus (554 aa); P23545|PHOR_BACSU alkaline phosphatase synthesis sensor from Bacillus subtilis (579 aa), FASTA scores: opt: 515, E(): 1.9e-25, (40.0% identity in 230 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. MTCY20G9.16 FASTA scores: (34.5% identity in 264 aa overlap), MTU88959_2 (509 aa), MTCY10G2_17, etc. Protein product from Mb0781 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0781 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWC2" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR003661" /db_xref="InterPro:IPR004358" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR036097" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3XWC2" /protein_id="SIT99380.1" /translation="MARHLRGRLPLRVRLVAATLILVATGLVASGIAVTSMLQHRLTS RIDRVLLEEAQIWAQITLPLAPDPYPIHNPDRPPSRFYVRVISPDGQSYTALNDNTAI PAVPANNDVGRHPTTLPSIGGSKTLWRAVSVRASDGYLTTVAIDLADVRSTVRSLVLL QVGIGSAVLVVLGVAGYAVVRRSLRPLAEFEQTAAAIGAGQLDRRVPQWHPRTEVGRL SLALNGMLAQIQRAVASAESSAEKARDSEDRMRQFITDASHELRTPLTTIRGFAELYR QGAARDVGMLLSRIESEASRMGLLVDDLLLLARLDAHRPLELCRVDLLALASDAAHDA RAMDPKRRITLEVLDGPGTPEVLGDESRLRQVLRNLVANAIQHTPESADVTVRVGTEG DDAILEVADDGPGMSQEDALRVFERFYRADSSRARASGGTGLGLSIVDSLVAAHGGAV TVTTALGEGCCFRVSLPRVSDVDQLSLTPVVPGPP" CDS complement(856015..856347) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0782C" /product="HIT family protein" /note="Mb0782c, -, len: 110 aa. Equivalent to Rv0759c, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap). Conserved hypothetical protein, highly similar (but shorter 45 aa in N-terminus) to P49774|YHIT_MYCLE|ML2237|MLCB5.04c|U296A HYPOTHETICAL HIT-LIKE PROTEIN from Mycobacterium leprae (155 aa), FASTA scores: opt: 766, E(): 0, (78.7% identity in 150 aa overlap). Also highly similar (but N-terminus always shorter) to HIT-like proteins and protein kinase inhibitors e.g. AAF72728.1|AF265258_1|AF265258 HIT-like protein from Rhodococcus sp. (141 aa); NP_212513.1|NC_001318 protein kinase C1 inhibitor (pkcI) from Borrelia burgdorferi (149 aa); P94252|YHIT_BORBU|BB0379 HYPOTHETICAL HIT-LIKE PROTEIN from Borrelia burgdorferi (139 aa); NP_110768.1|NC_002689 HIT (histidine triad) family protein from Thermoplasma volcanium (158 aa); P16436|IPK1_BOVIN protein kinase C inhibitor 1 (pkci-1) from Bos taurus (Bovine) (125 aa), FASTA scores: opt: 195, E(): 5.2e-08, (33.3% identity in 111 aa overlap); etc. Also shows similarity with Rv2613c|MTCY01A10.20A CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (195 aa) and Rv1262c|MTCY50.20 HYPOTHETICAL HIT-LIKE PROTEIN (144 aa). Protein product from Mb0782c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0782c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5B6" /db_xref="InterPro:IPR001310" /db_xref="InterPro:IPR011146" /db_xref="InterPro:IPR019808" /db_xref="InterPro:IPR036265" /db_xref="UniProtKB/Swiss-Prot:P0A5B6" /protein_id="SIT99381.1" /translation="MAFLTIEPMTQGHTLVVPRAEIDHWQNVDPALFGRVMSVSQLIG KAVCRAFSTQRAGMIIAGLEVPHLHIHVFPTRSLSDFGFANVDRNPSPGSLDEAQAKI RAALAQLA" CDS complement(856456..856875) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0783C" /product="Nuclear transport factor 2" /note="Mb0783c, -, len: 139 aa. Equivalent to Rv0760c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Conserved hypothetical protein, similar to N-terminal part of Rv2042c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (265 aa), FASTA scores: opt: 150, E(): 4.1e-05, (28.7% identity in 136 aa overlap). Protein product from Mb0783c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0783c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002075" /db_xref="InterPro:IPR032710" /db_xref="UniProtKB/TrEMBL:A0A1R3XYG6" /protein_id="SIT99382.1" /translation="MTQTTQSPALIASQSSWRCVQAHDREGWLALMADDVVIEDPIGK SVTNPDGSGIKGKEAVGAFFDTHIAANRLTVTCEETFPSSSPDEIAHILVLHSEFDGG FTSEVRGVFTYRVNKAGLITNMRGYWNLDMMTFGNQE" CDS complement(856888..858015) /codon_start=1 /transl_table=11 /gene="adhB" /locus_tag="BQ2027_MB0784C" /product="possible zinc-containing alcohol dehydrogenase nad dependent adhb" /note="Mb0784c, adhB, len: 375 aa. Equivalent to Rv0761c, len: 375 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 375 aa overlap). Possible adhB, zinc-containing alcohol dehydrogenase NAD-dependant (EC 1.1.1.1), similar to others e.g. AAC15839.1|AF060871_4 hypothetical alcohol dehydrogenase from Rhodococcus rhodochrous (370 aa), FASTA scores: opt: 1234, E(): 0, (46.8% identity in 370 aa overlap); P80468|ADH2_STRCA ALCOHOL DEHYDROGENASE II from Struthio camelus (Ostrich) (379 aa); Q03505|ADH1_RABIT alcohol dehydrogenase alpha chain from Oryctolagus cuniculus (Rabbit) (374 aa), FASTA scores: opt: 872, E(): 0, (39.1% identity in 379 aa overlap); etc. Also similar to adhD alcohol dehydrogenase from Mycobacterium tuberculosis (368 aa). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY. Protein product from Mb0784c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0784c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U1B9" /db_xref="InterPro:IPR002328" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR023921" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:Q7U1B9" /protein_id="SIT99383.1" /translation="MKTKGALIWEFNQPWSVEEIEIGDPRKDEVKIQMEAAGMCRSDH HLVTGDIPMAGFPVLGGHEGAGIVTEVGPGVDDFAPGDHVVLAFIPSCGKCPSCQAGM RNLCDLGAGLLAGESVTDGSFRIQARGQNVYPMTLLGTFSPYMVVHRSSVVKIDPSVP FEVACLVGCGVTTGYGSAVRTADVRPGDDVAIVGLGGVGMAALQGAVSAGARYVFAVE PVEWKRDQALKFGATHVYPDINAALMGIAEVTYGLMAQKVIITVGKLDGADVDSYLTI TAKGGTCVLTAIGSLVDTQVTLNLAMLTLLQKNIQGTIFGGGNPHYDIPKLLSMYKAG KLNLDDMVTTAYKLEQINDGYQDMLNGKNIRGVIRYTDDDR" CDS complement(858114..858659) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0785C" /product="intracellular transport" /note="Mb0785c, -, len: 181 aa. Equivalent to Rv0762c, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 181 aa overlap). Conserved hypothetical protein, showing weak similarity to D90907_77|P73575 HYPOTHETICAL 31.3KD PROTEIN from Synechocystis sp, FASTA scores: E(): 0.0012, (30.4% identity in 92 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Mb0785c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR032710" /db_xref="UniProtKB/TrEMBL:A0A1R3XWE9" /protein_id="SIT99384.1" /translation="MAGYPRDELEDVVHRWLQANRTAERRGDWTLLADFYTDDATYGW NVGPNEDVMCVGIDEIRDIALGQEMDGLQGWRYPYQRVVIDEKQGEVVGFWKQVATDA NGAEQEVYGIGGSWFRYAGGGKWNWQRDFFDFGHVSALYLELIKAGKLSPGMQKRIER AVSGNKVPGYYPLGKTPVPLW" CDS complement(858662..858868) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0786C" /product="POSSIBLE FERREDOXIN" /note="Mb0786c, -, len: 68 aa. Equivalent to Rv0763c, len: 68 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 68 aa overlap). Possible ferredoxin, similar to others and related proteins e.g. P18324|FER1_STRGO|SUAB ferredoxin 1 (fd-1) from Streptomyces griseolus (68 aa); AAK31349.1|AF350429_2|AF350429 putative ferredoxin from Nocardioides sp (63 aa); AAK16536.1|AF331043_16|AF331043 phthalate dioxygenase ferredoxin subunit from Arthrobacter keyseri (64 aa); etc. Probably involved in electron transport for cytochrome P-450 system e.g. downstream ORF Rv0764c|MTCY369.09c PROBABLE CYTOCHROME P450 51 from Mycobacterium tuberculosis (451 aa), FASTA scores: opt: 137, E(): 0.00013, (36.4% identity in 66 aa overlap). Also similar to putative ferredoxins Rv3503c and Rv1786 from Mycobacterium tuberculosis. COULD BELONG TO THE BACTERIAL TYPE FERREDOXIN FAMILY." /db_xref="UniProtKB/TrEMBL:A0A1R3XWR1" /protein_id="SIT99385.1" /translation="MGYRVEADRDLCQGHAMCELEAPEYFRVPKRGQVEILDPEPPEE ARGVIKHAVWACPTQALSIRETGE" CDS complement(858871..860226) /codon_start=1 /transl_table=11 /gene="cyp51" /locus_tag="BQ2027_MB0787C" /product="CYTOCHROME P450 51 CYP51 (CYPL1) (P450-L1A1) (STEROL 14-ALPHA DEMETHYLASE) (LANOSTEROL 14-ALPHA DEMETHYLASE) (P450-14DM)" /note="Mb0787c, cyp51, len: 451 aa. Equivalent to Rv0764c, len: 451 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 451 aa overlap). cyp51, cytochrome P450 51 (sterol 14-alpha demethylase) (EC 1.14.14.-), similar to others e.g. Q16850|CP51_HUMAN CYTOCHROME P450 51 (CYPL1) (P450L1) (STEROL 14-ALPHA DEMETHYLASE) (LANOSTEROL 14-ALPHA DEMETHYLASE) from Homo sapiens (509 aa), FASTA scores: opt: 848, E(): 0, (33.9% identity in 439 aa overlap); NP_172633.1|NC_003070 putative obtusifoliol 14-alpha demethylase from Arabidopsis thaliana (488 aa); P93596|CP51_WHEAT CYTOCHROME P450 51 (CYPL1) (P450-L1A1) (OBTUSIFOLIOL 14-ALPHA DEMETHYLASE) from Triticum aestivum (453 aa); etc. Also similar to many other Mycobacterium tuberculosis cytochromes P450 e.g. Rv1394c, FASTA score: (22.5% identity in 444 aa overlap). Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY. Protein product from Mb0787c detected using SWATH mass spectrometry. Mb0787c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A513" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002403" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P0A513" /protein_id="SIT99386.1" /translation="MSAVALPRVSGGHDEHGHLEEFRTDPIGLMQRVRDECGDVGTFQ LAGKQVVLLSGSHANEFFFRAGDDDLDQAKAYPFMTPIFGEGVVFDASPERRKEMLHN AALRGEQMKGHAATIEDQVRRMIADWGEAGEIDLLDFFAELTIYTSSACLIGKKFRDQ LDGRFAKLYHELERGTDPLAYVDPYLPIESFRRRDEARNGLVALVADIMNGRIANPPT DKSDRDMLDVLIAVKAETGTPRFSADEITGMFISMMFAGHHTSSGTASWTLIELMRHR DAYAAVIDELDELYGDGRSVSFHALRQIPQLENVLKETLRLHPPLIILMRVAKGEFEV QGHRIHEGDLVAASPAISNRIPEDFPDPHDFVPARYEQPRQEDLLNRWTWIPFGAGRH RCVGAAFAIMQIKAIFSVLLREYEFEMAQPPESYRNDHSKMVVQLAQPACVRYRRRTG V" CDS complement(860226..861053) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0788C" /product="PROBABLE OXIDOREDUCTASE" /note="Mb0788c, -, len: 275 aa. Equivalent to Rv0765c, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 275 aa overlap). Probable oxidoreductase (EC 1.-.-.-), similar others e.g. P39071|DHBA_BACSU 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase from Bacillus subtilis (261 aa), FASTA scores: opt: 385, E(): 1.8e-17, (30.6% identity in 252 aa overlap); AAF81239.1|AF263012 putative beta-ketoacyl reductase from Streptomyces griseus (274 aa); NP_436514.1|NC_003037 putative oxidoreductase from Sinorhizobium meliloti (240 aa); etc. Also similar to several other oxidoreductases from Mycobacterium tuberculosis e.g. Rv1544|MTCY48.21, FASTA score: (32.6% identity in 267 aa overlap); etc. Contains PS00061 Short-chain alcohol dehydrogenase family signature. Protein product from Mb0788c detected using SWATH mass spectrometry. Mb0788c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWE2" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XWE2" /protein_id="SIT99387.1" /translation="MPRFEPHPARRTTVVAGASSGIGAATATELAGRGFPVALGARRM DKLAELVDKIRADGGEAVAFPLDVTDPESVKSFVAQTVEALGEVELLVSSAGDMLPGQ LHEVSTEAFAEQVQIHLVGANRLATAVLPAMVARRRGDLIFVGSDVGLRQRPHMGAYG AAKAGLAAMVTNLQMELEGTGVRASIVHPGPTLTGMGWQLSAEQVGPMLADWAKWGQA RHNYFLRPSDLARAIAFVAETPRGCVVVNMEIQPEAPLRDAPAHRQKLVLGEEGMPG" CDS complement(861053..862261) /codon_start=1 /transl_table=11 /gene="cyp123" /locus_tag="BQ2027_MB0789C" /product="PROBABLE CYTOCHROME P450 123 CYP123" /note="Mb0789c, cyp123, len: 402 aa. Equivalent to Rv0766c, len: 402 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 402 aa overlap). Probable cyp123, cytochrome P-450 (EC 1.14.-.-), similar to others e.g. P33271|CPXK_SACER cytochrome P-450 107B1 from Saccharopolyspora erythraea (405 aa), FASTA scores: opt: 770, E(): 0, (36.9% identity in 406 aa overlap); T36526 probable cytochrome P450 hydroxylase from Streptomyces coelicolor (411 aa); P27632|CPXM_BACSU CYTOCHROME P450 109 from Bacillus subtilis (405 aa); etc. Also similar to several other cytochromes P-450 from Mycobacterium tuberculosis e.g. Rv1256c|MTCY50.26 (405 aa), FASTA score: (35.2% identity in 389 aa overlap); etc. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY. Protein product from Mb0789c detected using SWATH mass spectrometry. Mb0789c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63708" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P63708" /protein_id="SIT99388.1" /translation="MTVRVGDPELVLDPYDYDFHEDPYPYYRRLRDEAPLYRNEERNF WAVSRHHDVLQGFRDSTALSNAYGVSLDPSSRTSEAYRVMSMLAMDDPAHLRMRTLVS KGFTPRRIRELEPQVLELARIHLDSALQTESFDFVAEFAGKLPMDVISELIGVPDTDR ARIRALADAVLHREDGVADVPPPAMAASIELMRYYADLIAEFRRRPANNLTSALLAAE LDGDRLSDQEIMAFLFLMVIAGNETTTKLLANAVYWAAHHPGQLARVFADHSRIPMWV EETLRYDTSSQILARTVAHDLTLYDTTIPEGEVLLLLPGSANRDDRVFDDPDDYRIGR EIGCKLVSFGSGAHFCLGAHLARMEARVALGALLRRIRNYEVDDDNVVRVHSSNVRGF AHLPISVQAR" CDS complement(862258..862899) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0790C" /product="Transcriptional regulator, AcrR family" /note="Mb0790c, -, len: 213 aa. Equivalent to Rv0767c, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap). Conserved hypothetical protein, showing weak similarity with AL133220|SCC75A_26 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (215 aa), FASTA scores: opt: 152, E(): 0.0048, (28.4% identity in 204 aa overlap). Protein product from Mb0790c detected using SWATH mass spectrometry. Mb0790c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67433" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023772" /db_xref="UniProtKB/Swiss-Prot:P67433" /protein_id="SIT99389.1" /translation="MSSDVLVTTPAQRQTEPHAEAVSRNRRQQATFRKVLAAAMATLR EKSYADLTVRLVAARAKVAPATAYTYFSSKNHLIAEVYLDLVRQVPCVTDVNVPMPIR VTSSLRHLALVVADEPEIGAACTAALLDGGADPAVRAVRDRIGAEIHRRITSAIGPGA DPGTVFALEMAFFGALVQAGSGTFTYHEIADRLGYVVGLILAGANEPSTGGSE" CDS 863101..864570 /codon_start=1 /transl_table=11 /gene="aldA" /locus_tag="BQ2027_MB0791" /product="probable aldehyde dehydrogenase nad dependent alda (aldehyde dehydrogenase [nad+])" /note="Mb0791, aldA, len: 489 aa. Equivalent to Rv0768, len: 489 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 489 aa overlap). Probable aldA, NAD-dependent aldehyde dehydrogenase (EC 1.2.1.-), highly similar to others e.g. AAL14238.1|AY052630 6-oxolauric acid dehydrogenase from Rhodococcus ruber (474 aa); NP_285450.1|NC_001264 aldehyde dehydrogenase from Deinococcus radiodurans (495 aa); NP_241405.1|NC_002570 NADP-dependent aldehyde dehydrogenase from Bacillus halodurans (498 aa); P42757|DHAB_ATRHO betaine-aldehyde dehydrogenase precursor from Atriplex hortensis (Mountain spinach) (502 aa), FASTA scores: opt: 1001, E(): 0, (35.6% identity in 486 aa overlap); etc. Also highly similar to Rv0223c ALDEHYDE DEHYDROGENASE from Mycobacterium tuberculosis (487 aa). Contains PS00687 Aldehyde dehydrogenases glutamic acid active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY. Protein product from Mb0791 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0791 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWD8" /db_xref="InterPro:IPR015590" /db_xref="InterPro:IPR016161" /db_xref="InterPro:IPR016162" /db_xref="InterPro:IPR016163" /db_xref="InterPro:IPR026460" /db_xref="InterPro:IPR029510" /db_xref="UniProtKB/TrEMBL:A0A1R3XWD8" /protein_id="SIT99390.1" /translation="MALWGDGISALLIDGKLSDGRAGTFPTVNPATEEVLGVAADADA EDMGRAIEAARRAFDSTDWSRNTELRVRCVRQLRDAMQQHVEELRELTISEVGAPRML TASAQLEGPVGDLSFAADTAESYPWKQDLGEASPLGIATRRTLAREAVGVVGAITPWN FPHQINLAKLGPALAAGNTVVLKPAPDTPWCAAALGEIIVEHTDFPPGVVNIVTSSSH ALGALLAKDPRVDMISFTGSTATGRAVMADAAATIKKVFLELGGKSAFVVLDDADLAA ASAVSAFSACMHAGQGCAITTRLVVPRARYEEAVAIAAATMSSIRPGDPNDPGTVCGP LISARQRDRVQGYLDLAVAEGGRFACGGARPADREVGFYIEPTVIAGLTNDARVAREE IFGPVLTVIAHDGDDDAVRIANDSPYGLSGTVYGADPQRAARIASRLRVGTVNVNGGV WYCADAPFGGYKQSGIGREMGLLGFEEYLEAKLIATAAN" CDS 864601..865347 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0792" /product="PROBABLE DEHYDROGENASE/REDUCTASE" /note="Mb0792, -, len: 248 aa. Equivalent to Rv0769, len: 248 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 248 aa overlap). Probable dehydrogenase/reductase (EC 1.-.-.-), similar to others, especially short-chain type dehydrogenases/reductases and 3-oxoacyl-(acyl-carrier protein) reductases e.g. NP_106890.1|NC_002678 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from Mesorhizobium loti (374 aa); NP_243357.1|NC_002570 3-oxoacyl-(acyl-carrier protein) reductase from Bacillus halodurans (246 aa); P28643|FABG_CUPLA 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE from Cuphea lanceolata (320 aa); P25529|HDHA_ECOLI 7-alpha-hydroxysteroid dehydrogenase from Escherichia coli (255 aa), FASTA scores: opt: 536, E(): 6.5e-27, (37.7% identity in 247 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. MTCY02B10.14, FASTA score: (33.7% identity in 249 aa overlap); etc. Protein product from Mb0792 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0792 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XWC3" /protein_id="SIT99391.1" /translation="MFDSKVAIVTGAAQGIGQAYAQALAREGASVVVADINADGAAAV AKQIVADGGTVIHVPVDVSDEDSAKAMVDRAVGAFGGIDYLVNNAAIYGGMKLDLLLT VPLDYYKKFMSVNHDGVLVCTRAVYKHMAKRGGGAIVNQSSTAAWLYSNFYGLAKVGV NGLTQQLARELGGMKIRINAIAPGPIDTEATRTVTPAELVKNMVQTIPLSRMGTPEDL VGMCLFLLSDSASWITGQIFNVDGGQIIRS" CDS 865445..866332 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0793" /product="PROBABLE DEHYDROGENASE/REDUCTASE" /note="Mb0793, -, len: 295 aa. Equivalent to Rv0770, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 295 aa overlap). Probable dehydrogenase/reductase, 3-hydroxyisobutyrate dehydrogenase family (EC 1.1.1.-), possibly 3-hydroxyisobutyrate dehydrogenase (EC 1.1.1.31) or 2-hydroxy-3-oxopropionate reductase (EC 1.1.1.60), similar to others e.g. P23523|GARR_ECOLI 2-HYDROXY-3-OXOPROPIONATE REDUCTASE (TARTRONATE SEMIALDEHYDE REDUCTASE) (TSAR) from Escherichia coli strain K12 (294 aa), FASTA scores: opt: 469, E(): 6.7e-22, (34.4% identity in 282 aa overlap); P28811|MMSB_PSEAE 3-hydroxyisobutyrate dehydrogenase (HIBADH) from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 439, E(): 4.3e-20, (34.9% identity in 269 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv1122 and Rv1844c. SEEMS TO BELONG TO THE 3-HYDROXYISOBUTYRATE DEHYDROGENASE FAMILY. Mb0793 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYH6" /db_xref="InterPro:IPR002204" /db_xref="InterPro:IPR006115" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR013328" /db_xref="InterPro:IPR015815" /db_xref="InterPro:IPR029154" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XYH6" /protein_id="SIT99392.1" /translation="MTAHPETPRLGYIGLGNQGAPMAKRLLDWPGGLTVFDVRVEAMA PFVEGGATAAASVSDVAEADIISITVFDDAQVSSVITADNGLATHAKPGTIVAIHSTI ADTTAVDLAEKLKPQGIHIVDAPGSGGAAAAAKGELAVMVGADDEAFQRIKEPFSRWA SLLIHAGEPGAGTRMKLARNMLTFVSYAAAAEAQRLAEACGLDLVALGKVVRHSDSFT GGAGAIMFRNTTAPMEPADPLRPLLEHTRGLGEKDLSLALALGEVVSVDLPLAQLALQ RLAAGLGVPHPDTEPAKET" CDS 866329..866763 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0794" /product="POSSIBLE 4-CARBOXYMUCONOLACTONE DECARBOXYLASE (CMD)" /note="Mb0794, -, len: 144 aa. Equivalent to Rv0771, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 144 aa overlap). Possible 4-carboxymuconolactone decarboxylase (EC 4.1.1.44), showing similarity with other carboxymuconolactone decarboxylases e.g. AAD39557.1|AF031417 PcaC-like protein from Pseudomonas putida (130 aa); P20370|DC4C_ACICA 4-CARBOXYMUCONOLACTONE DECARBOXYLASE (CMD) from Acinetobacter sp. ADP1 (134 aa), FASTA scores: opt: 174, E(): 0.00075, (31.4% identity in 121 aa overlap); C-terminus of NP_421214.1|NC_002696 3-oxoadipate enol-lactone hydrolase/4-carboxymuconolactone decarboxylase from Caulobacter crescentus (393 aa); C-terminus of T47115 probable 4-carboxymuconolactone decarboxylase / 3-oxoadipate enol-lactone hydrolase from Streptomyces sp (373 aa); NP_407104.1|NC_003143 putative gamma carboxymuconolactone decarboxylase from Yersinia pestis (131 aa); etc. Protein product from Mb0794 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0794 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXC9" /db_xref="InterPro:IPR003779" /db_xref="InterPro:IPR029032" /db_xref="UniProtKB/TrEMBL:A0A1R3XXC9" /protein_id="SIT99393.1" /translation="MMDELRRTGLDKMNEVYAWDMPDMPGEFFALTVDHLFGRIWTRP GLSMRDRRMAVIAVLTAQGQSDLLEVQVNAVLHNDELTIDELRELAVFITHYVGFPLG SRLNSAIERVAAKRKQAAENGSLPDTKANVAEVLAKESGKSS" CDS 866775..868043 /codon_start=1 /transl_table=11 /gene="purD" /locus_tag="BQ2027_MB0795" /product="PROBABLE PHOSPHORIBOSYLAMINE--GLYCINE LIGASE PURD (GARS) (GLYCINAMIDE RIBONUCLEOTIDE SYNTHETASE) (PHOSPHORIBOSYLGLYCINAMIDE SYNTHETASE) (5'-PHOSPHORIBOSYLGLYCINAMIDE SYNTHETASE)" /note="Mb0795, purD, len: 422 aa. Equivalent to Rv0772, len: 422 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 422 aa overlap). Probable purD, phosphoribosylamine--glycine ligase (EC 6.3.4.13), equivalent to Q50144|PURD|PUR2_MYCLE|ML2235|MLCB5.08 PHOSPHORIBOSYLAMINE--GLYCINE LIGASE from Mycobacterium leprae (422 aa), FASTA scores: opt: 2272, E(): 0, (81.8% identity in 422 aa overlap). Also highly similar to others e.g. CAB56348.1|AL118514 phosphoribosylamine-glycine ligase from Streptomyces coelicolor (416 aa); P1564|PUR2_ECOLI phosphoribosylamine--glycine ligase from Escherichia coli (429 aa), FASTA scores: opt: 1039, E(): 0, (42.7% identity in 431 aa overlap); etc. BELONGS TO THE GARS FAMILY. Protein product from Mb0795 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0795 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65894" /db_xref="InterPro:IPR000115" /db_xref="InterPro:IPR011054" /db_xref="InterPro:IPR011761" /db_xref="InterPro:IPR013815" /db_xref="InterPro:IPR016185" /db_xref="InterPro:IPR020559" /db_xref="InterPro:IPR020560" /db_xref="InterPro:IPR020561" /db_xref="InterPro:IPR020562" /db_xref="InterPro:IPR037123" /db_xref="UniProtKB/Swiss-Prot:P65894" /protein_id="SIT99394.1" /translation="MRVLVIGSGAREHALLLALGKDPQVSGLIVAPGNAGTARIAEQH DVDITSAEAVVALAREVGADMVVIGPEVPLVLGVADAVRAAGIVCFGPGKDAARIEGS KAFAKDVMAAAGVRTANSEIVDSPAHLDAALDRFGPPAGDPAWVVKDDRLAAGKGVVV TADRDVARAHGAALLEAGHPVLLESYLDGPEVSLFCVVDRTVVVPLLPAQDFKRVGED DTGLNTGGMGAYAPLPWLPDNIYREVVSRIVEPVAAELVRRGSSFCGLLYVGLAITAR GPAVVEFNCRFGDPETQAVLALLESPLGQLLHAAATGKLADFGELRWRDGVAVTVVLA AENYPGRPRVGDVVVGSEAEGVLHAGTTRRDDGAIVSSGGRVLSVVGTGADLSAARAH AYEILSSIRLPGGHFRSDIGLRAAEGKISV" CDS complement(868040..869578) /codon_start=1 /transl_table=11 /gene="ggtA" /locus_tag="BQ2027_MB0796C" /product="PROBABLE BIFUNCTIONAL ACYLASE GGTA: CEPHALOSPORIN ACYLASE (GL-7ACA ACYLASE) + GAMMA-GLUTAMYLTRANSPEPTIDASE (GGT)" /note="Mb0796c, ggtA, len: 512 aa. Equivalent to Rv0773c, len: 512 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 512 aa overlap). Probable ggtA, bifunctional acylase including cephalosporin acylase (EC 3.5.1.-), and gamma-glutamyl transpeptidase (EC 2.3.2.2); highly similar to others e.g. NP_295247.1|NC_001263 cephalosporin acylase from Deinococcus radiodurans (535 aa); NP_248854.1|NC_002516 probable gamma-glutamyltranspeptidase from Pseudomonas aeruginosa (538 aa); P15557|PAC1_PSES3 ACYLASE ACY 1 [INCLUDES: CEPHALOSPORIN ACYLASE (GL-7ACA ACYLASE); GAMMA-GLUTAMYLTRANSPEPTIDASE (GGT)] from Pseudomonas sp. strain SE83 (558 aa), FASTA scores: opt: 784, E(): 0, (34.2% identity in 526 aa overlap); NP_391491.1|NC_000964|Z93767|BSZ93767_6|O0521 protein similar to gamma-glutamyltransferase from Bacillus subtilis (525 aa), FASTA scores: opt: 1169, E(): 0, (40.1% identity in 516 aa overlap); etc. Also similar to Rv2394|ggtB from Mycobacterium tuberculosis. Member of GL-7ACA ACYLASES AND TO GGT group. Protein product from Mb0796c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0796c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWS1" /db_xref="InterPro:IPR000101" /db_xref="InterPro:IPR029055" /db_xref="UniProtKB/TrEMBL:A0A1R3XWS1" /protein_id="SIT99395.1" /translation="MPILATNVVCTSQPLAAQAGLRMLADGGNAVDAAVATAITLTVV EPVSNGIGSDAFSIVWDGQKLHGLNASGRSPSAWTPEYFGGNAVPVLGWNSVTVPGAV SAWVELHARFGRLPFETLFEPAISYGRNGFLVSPTVAAQWAAQVPLFASQPGFADAFM PGGRAPKPGELFTFPDHAATLEKIAATNGEEFYRGELAAKLEAHSAANGGVMRADDLA AHRVDWVDTITGTYRGYTIHQIPPNGQGIVALIALGILEHFDMSSWSVDSAESVHVQI EALKLAFADAQACVADIDYMPVHPKRLLDKEYLRQRATLIDPKRAMPAATGIPRGGTV YLAAADAAGMMVSMIQSNYLGFGSGVVVPGTGISLHNRGSDFTVVPRHPNRVGPRKRP YHTIIPGFVTRDGAPVMSFGVMGGMMQPQGHVQVLVRIADYGQNPQAACDGPRFRWVN GMRVSFENGFPDSTLDELRQRGHDLVAVADYSQFGSCQAIWRLDDGYLAASDPRRDGQ AAAC" CDS complement(869629..870540) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0797C" /product="PROBABLE CONSERVED EXPORTED PROTEIN" /note="Mb0797c, -, len: 303 aa. Equivalent to Rv0774c, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 303 aa overlap). Possible conserved exported protein with hydrophobic region near N-terminus, highly similar, except in N-terminus, to Rv0519c|Z97831|MTY20G10.09c|O33364 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (300 aa), FASTA scores: opt: 1092, E(): 0, (57.9% identity in 299 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature, and PS00120 Lipases, serine active site. So could be a lipase (EC 3.1.-.-). Protein product from Mb0797c detected using SWATH mass spectrometry. Mb0797c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000801" /db_xref="InterPro:IPR006311" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XWD7" /protein_id="SIT99396.1" /translation="MMARMPELSRRAVLGLGAGTVLGATSAYAIDMLLQPRTSHAAPA AAIGTNVPLAPTPALDPAPPAQAAPTMSTGSFVSAARAGKMTNWAIARPPGQTQALRP VIALHGLGGSASAVMDGGVEQGLAQAVNAGLPPFAVVSVAGGSSYWHQRASGEDAGAM VLNELIPLLDTQRLDTSRVAFLGWSMGGYGALLLGSRLGPARTAAICAVSPALWLSAG AVAPGSFDGPDDWSANSVFGLPALGSIPIRVDCGNSDPFYAATKQFVAQLPHPPAGGF SPGGHNGGFWSAQLPAELTWFAPLLTG" CDS 870596..871219 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0798" /product="Transcriptional regulator, AcrR family" /note="Mb0798, -, len: 207 aa. Equivalent to Rv0775, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 207 aa overlap). Conserved hypothetical protein, showing some similarity to other proteins e.g. ECAE000186_11|MG1655 HYPOTHETICAL PROTEIN from Escherichia coli strain K-12 (178 aa), FASTA scores: E(): 6.4e-05, (27.2% identity in 147 aa overlap); P41037|BIH_ECOLI hypothetical transcriptional regulator from Escherichia coli (103 aa), FASTA scores: opt: 138, E(): 0.003, (30.9% identity in 97 aa overlap); etc. Protein product from Mb0798 detected using SWATH mass spectrometry. Mb0798 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWE8" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR041583" /db_xref="UniProtKB/TrEMBL:A0A1R3XWE8" /protein_id="SIT99397.1" /translation="MGVTAAVTPKGERRRYALVSAAAELLGEGGFEAVRHRAVARRAG LPLASTTYYFSSLDDLIARAVEHIGMIEVAQLRARVSALSRRRRGPETTAVVLVDLLV GEMSSPGLAEQLISRYERHIACTRLPDLRESMRRSLRQRAEAVAEAIERSGRSAQIEL VCTLICAVDGSVVSALVEGRDPRAAALATVVDLIDVLAPVDQRPVPF" CDS complement(871173..871952) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0799C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0799c, -, len: 259 aa. Equivalent to Rv0776c, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 259 aa overlap). Conserved hypothetical protein, similar (except first 50 aa) to P72737|D90900_57 hypothetical protein from Synechocystis sp. strain PCC 6803 (261 aa), FASTA scores: opt: 337, E(): 1.7e-15, (30.5% identity in 266 aa overlap). Mb0799c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007362" /db_xref="InterPro:IPR008306" /db_xref="UniProtKB/TrEMBL:A0A1R3XWD9" /protein_id="SIT99398.1" /translation="MYFVGVDLAWAGRNPTGVAAVDADGCLVGVGAARDDASVLAALR PYVVGDCLVAFDAPLVVANRTGQRPAEAALNRDFRQFEAGAYPANTEKPEFADVPRAA RLARQLALDMDPLSSATRRAIEVYPHPATVALFRLPRALKYKAKPGRSVDLLKSELLR LMDGVEGLAQAGVRMQVAGQPDWVSLRRQVTVAQRKSDLRAAEDPIDAVVCAYVALYA QRRPADVTIYGDFTTGYIVTPSLPTDFRTAPDAGRRARARR" CDS 872197..873615 /codon_start=1 /transl_table=11 /gene="purB" /locus_tag="BQ2027_MB0800" /product="PROBABLE ADENYLOSUCCINATE LYASE PURB (ADENYLOSUCCINASE) (ASL) (ASASE)" /note="Mb0800, purB, len: 472 aa. Equivalent to Rv0777, len: 472 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 472 aa overlap). Probable purB, adenylosuccinate lyase (EC 4.3.2.2), equivalent (but shorter 15 aa) to MLCB5.13|Z95151|g2076607|PURB ADENYLOSUCCINATE LYASE from Mycobacterium leprae (487 aa), FASTA scores: opt: 2640, E(): 0, (86.7% identity in 472 aa overlap). More similar to eukaryotic adenylosuccinate lyases than to prokaryotic adenylosuccinate lyases e.g. P54822|PUR8_MOUSE ADENYLOSUCCINATE LYASE from Mus musculus (484 aa), FASTA scores: opt: 762, E(): 0, (32.4% identity in 445 aa overlap); CAB99134.1|AL390188 putative adenylosuccino lyase (fragment) from Streptomyces coelicolor (362 aa); etc. Contains PS00163 Fumarate lyases signature. BELONGS TO THE LYASE 1 FAMILY, ADENYLOSSUCINATE LYASE SUBFAMILY. Protein product from Mb0800 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0800 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWE6" /db_xref="InterPro:IPR000362" /db_xref="InterPro:IPR004769" /db_xref="InterPro:IPR008948" /db_xref="InterPro:IPR019468" /db_xref="InterPro:IPR020557" /db_xref="InterPro:IPR022761" /db_xref="UniProtKB/TrEMBL:A0A1R3XWE6" /protein_id="SIT99399.1" /translation="MSIPNVLATRYASAEMVAIWSPEAKVVSERRLWLAVLRAQAELG VAVADSVLADYERVVDDVDLASISARERVLRHDVKARIEEFNALAGHEHVHKGMTSRD LTENVEQLQIRRSLEVIFAHGVAAVARLAERAVSYRDLIMAGRSHNVAAQATTLGKRF ASAAQEMMIALRRLRELIDRYPLRGIKGPMGTGQDMLDLLGGDRAALADLERRVADFL GFATVFNSVGQVYPRSLDHDVVSALVQLGAGPSSLAHTIRLMAGHELATEGFAPGQVG SSAMPHKMNTRSCERVNGLQVVLRGYASMVAELAGAQWNEGDVFCSVVRRVALPDSFF AVDGQIETFLTVLDEFGAYPAVIGRELDRYLPFLATTKVLMAAVRAGMGRESAHRLIS EHAVATALAMREHGAEPDLLDRLAADPRLPLGRDALEAALADKKAFAGAAGDQVDDVV AMVDALVSRYPDAAKYTPGAIL" CDS 873620..874864 /codon_start=1 /transl_table=11 /gene="cyp126" /locus_tag="BQ2027_MB0801" /product="POSSIBLE CYTOCHROME P450 126 CYP126" /note="Mb0801, cyp126, len: 414 aa. Equivalent to Rv0778, len: 414 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 414 aa overlap). Possible cyp126, cytochrome P-450 (EC 1.14.-.-), similar to other cytochromes and related proteins e.g. AAG29781.1|AF235050_4|AF235050 cytochrome P-450 from Streptomyces rishiriensis (407 aa); Q59723|PSECYTOCHR_1 cytochrome p-450 linalool 8-monooxygenase (EC 1.14.99.28) (lin C) from Pseudomonas incognita (406 aa), FASTA scores: opt: 769, E(): 0, (37.0% identity in 411 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv0766c, Rv2266, Rv3545c, etc. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. Protein product from Mb0801 detected using SWATH mass spectrometry. Mb0801 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63712" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P63712" /protein_id="SIT99400.1" /translation="MTTAAGLSGIDLTDLDNFADGFPHHLFAIHRREAPVYWHRPTEH TPDGEGFWSVATYAETLEVLRDPVTYSSVTGGQRRFGGTVLQDLPVAGQVLNMMDDPR HTRIRRLVSSGLTPRMIRRVEDDLRRRARGLLDGVEPGAPFDFVVEIAAELPMQMICI LLGVPETDRHWLFEAVEPGFDFRGSRRATMPRLNVEDAGSRLYTYALELIAGKRAEPA DDMLSVVANATIDDPDAPALSDAELYLFFHLLFSAGAETTRNSIAGGLLALAENPDQL QTLRSDFELLPTAIEEIVRWTSPSPSKRRTASRAVSLGGQPIEAGQKVVVWEGSANRD PSVFDRADEFDITRKPNPHLGFGQGVHYCLGANLARLELRVLFEELLSRFGSVRVVEP AEWTRSNRHTGIRHLVVELRGG" CDS complement(874861..875481) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0802C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0802c, -, len: 206 aa. Equivalent to Rv0779c, len: 206 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 206 aa overlap). Possible conserved transmembrane protein, equivalent to Z95151|MLCB5_14 O05747 conserved hypothetical protein from Mycobacterium leprae (206 aa), FASTA scores: opt: 902, E(): 0, (67.2% identity in 204 aa overlap). Protein product from Mb0802c detected using SWATH mass spectrometry. Mb0802c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWD3" /db_xref="UniProtKB/TrEMBL:A0A1R3XWD3" /protein_id="SIT99401.1" /translation="MRSRFLPYATTPGRLLAQLISDITVAVWTTLWMLVGLAVHDAIS IIGEAGRQIEIGSHGIAGNLAAAGQDAQRIPVVGDALSNPITAASQAALDIAGAGHNL DTTAGWLAVVLALAVAATPILAVAMPWLFLRLRFCRRKWTVTTLAATPAGRQLLALRA LANRPPGKLAAVSTDPVGAWRREDPATMRALAALELRAAGIPLRGD" CDS 875532..876425 /codon_start=1 /transl_table=11 /gene="purC" /locus_tag="BQ2027_MB0803" /product="PHOSPHORIBOSYLAMINOIMIDAZOLE-SUCCINOCARBOXAMIDE SYNTHASE PURC (SAICAR SYNTHETASE)" /note="Mb0803, purC, len: 297 aa. Equivalent to Rv0780, len: 297 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 297 aa overlap). purC, phosphoribosylaminoimidazole- succinocarboxamide synthase (EC 6.3.2.6) (see citations below), equivalent to MTU34957_1|PURC phosphoribosylaminoimidazole- succinocarboxamide synthase from Mycobacterium leprae (297 aa), FASTA scores: opt: 1986, E(): 0, (99.3% identity in 297 aa overlap). Also similar to others e.g. CAB56351.1|AL118514 phosphoribosylaminoimidazole-succinocarboxamide synthase from Streptomyces coelicolor (299 aa); etc. Contains PS01058 SAICAR synthetase signature 2. BELONGS TO THE SAICAR SYNTHETASE FAMILY. Protein product from Mb0803 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0803 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5T5" /db_xref="InterPro:IPR001636" /db_xref="InterPro:IPR018236" /db_xref="InterPro:IPR028923" /db_xref="UniProtKB/Swiss-Prot:P0A5T5" /protein_id="SIT99402.1" /translation="MRPALSDYQHVASGKVREIYRVDDEHLLLVASDRISAYDYVLDS TIPDKGRVLTAMSAFFFGLVDAPNHLAGPPDDPRIPDEVLGRALVVRRLEMLPVECVA RGYLTGSGLLDYQATGKVCGIALPPGLVEASRFATPLFTPATKAALGDHDENISFDRV VEMVGALRANQLRDRTLQTYVQAADHALTRGIIIADTKFEFGIDRHGNLLLADEIFTP DSSRYWPADDYRAGVVQTSFDKQFVRSWLTGSESGWDRGSDRPPPPLPEHIVEATRAR YINAYERISELKFDDWIGPGA" CDS 876422..878581 /codon_start=1 /transl_table=11 /gene="ptrB" /locus_tag="BQ2027_MB0804" /product="probable protease ii ptrbb [second part] (oligopeptidase b)" /note="Mb0804, ptrB, len: 719 aa. Equivalent to Rv0781 and Rv0782, len: 236 aa and 552 aa, from Mycobacterium tuberculosis strain H37Rv, (97.6% identity in 206 aa overlap and 100.0% identity in 517 aa overlap). Probable ptrB, protease II (EC 3.4.21.83), equivalent to NP_302455.1|NC_002677 protease II from Mycobacterium leprae (724 aa). Also highly similar to C-termini of many proteases II e.g. P24555|PTRB_ECOLI|TLP|B1845 protease II from Escherichia coli strains K12 and HB101 (707 aa), FASTA scores: opt: 204, E(): 7.4e-07, (29.6% identity in 230 aa overlap); etc. Also highly similar to N-termini of many proteases II e.g. P24555|PTRB_ECOLI|TLP|B1845 protease II from Escherichia coli strains K12 and HB101 (707 aa), FASTA scores: opt: 1251, E(): 0, (42.7% identity in 489 aa overlap); etc. ORFs Rv0782 and Rv0781 appear to be a frameshifted homologues of protease II, but we can find no error in the cosmid sequence to account for this. BELONGS TO PEPTIDASE FAMILY S9A; ALSO KNOWN AS THE PROLYL OLIGOPEPTIDASE FAMILY. Note that previously known as ptrBb. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, ptrB is split into 2 genes, ptrBa and ptrBb, due to a frameshift. In Mycobacterium bovis, a 2 bp insertion (*-gc) leads to a single product. Protein product from Mb0804 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0804 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXD9" /db_xref="InterPro:IPR001375" /db_xref="InterPro:IPR002470" /db_xref="InterPro:IPR023302" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XXD9" /protein_id="SIT99403.1" /translation="MMHRTALPSPPVAKRVQTRREHHGDVFVDPYEWLRDKDSPEVIA YLEAENDYTERTTAHLEPLRQKIFHEIKARTKETDLSVPTRRGNWWYYARTFEGKQYG VHCRCPVTDPDDWNPPEFDERTEIPGEQLLLDENVEADGHDFFALGAASVSLDDNLLA YSVDVVGDERYTLRFKDLRTGEQYPDEIAGIGAGVTWAADNRTVYYTTVDAAWRPDTV WRYRLGSGESSERVYHEADDRFWLAVGRTRSNAYLLIAAGSSITSEVRYAHAADPTAQ FSVVLPRRDGVEYSVEHAVIAGQDRFLILHNDGAVNFTLVEAPVEDPARQRTLIAHRD DVRLDAVDALAGHLVVSYRREALPRVQLWPIGPDGNYGEPEEISFDSELMSAGLGPNP NWDSPKLRVGAGSFVTPVRIYDIDLVTGERTLLKEQPVLGGYRREDYVERRDWAYGDD GTRIPVSIVHRADIEFPAPALIYGYGAYEICEDPRFSIARLSLLDRGMVFVVAHVRGG GEMGRLWYENGKLLDKKNTFTDFIAVARHLVDTGLTSQQQLVALGGSAGGLLMGAVAN MAPDLFAGILAQVPFVDPLTTILDPSLPLTVTEWDEWGNPLNDSDVYAYVKSYSPYEN VTAQKYPAILAMTSLNDTRVYYVEPAKWVAALRHAKTDGNSVLLKTQMHAGHGGISGR YERWKETAFQYGWLLATADSDRYGGGQGNDLDGAAPA" CDS complement(879009..880631) /codon_start=1 /transl_table=11 /gene="emrB" /locus_tag="BQ2027_MB0805C" /product="POSSIBLE MULTIDRUG RESISTANCE INTEGRAL MEMBRANE EFFLUX PROTEIN EMRB" /note="Mb0805c, emrB, len: 540 aa. Equivalent to Rv0783c, len: 540 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 540 aa overlap). Possible emrB, integral membrane drug efflux protein, member of major facilitator superfamily (MFS), equivalent to AAL16083.1|AF421382_1|AF421382 EmrB efflux protein from Mycobacterium avium (538 aa). Also similar to other membrane proteins e.g. CAB61606.1|AL133210 putative export protein from Streptomyces coelicolor (496 aa); NP_108371.1|NC_002678 efflux pump protein FarB from Mesorhizobium loti (511 aa); P44927|EMRB_HAEINHI0897| multidrug resistance protein b homologue from Haemophilus influenzae (510 aa), FASTA scores: opt: 706, E(): 1.3e-36, (30.4% identity in 408 aa overlap); etc. Also similar to Rv2333c|MTCY3G12.01 from Mycobacterium tuberculosis (537 aa), FASTA score: (28.2% identity in 408 aa overlap); and Rv1410c|MTCY21B4.27c from Mycobacterium tuberculosis (518 aa), FASTA score: (26.8% identity in 496 aa overlap). BELONGS TO THE MAJOR FACILITATOR FAMILY; ALSO KNOWN AS THE DRUG RESISTANCE TRANSLOCASE FAMILY. Mb0805c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWG7" /db_xref="InterPro:IPR001411" /db_xref="InterPro:IPR004638" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XWG7" /protein_id="SIT99404.1" /translation="MLGNAMVEACPAEGDAPVPITPAGRPRSGQRSYPDRLDVGLLRT AGVCVLASVMAHVDVTVVSVAQRTFVADFGSTQAVVAWTMTGYMLALATVIPTAGWAA DRFGTRRLFMGSVLAFTLGSLLCAVAPNILLLIIFRVVQGFGGGMLTPVSFAILAREA GPKRLGRVMAVVGIPMLLGPVGGPILGGWLIGAYGWRWIFLVNLPVGLSALVLAAIVF PRDRPAASENFDYMGLLLLSPGLATFLFGVSSSPARGTMADRHVLIPAITGLALIAAF VAHSWYRTEHPLIDMRLFQNRAVAQANMTMTVLSLGLFGSFLLLPSYLQQVLHQSPMQ SGVHIIPQGLGAMLAMPIAGAMMDRRGPAKIVLVGIMLIAAGLGTFAFGVARQADYLP ILPTGLAIMGMGMGCSMMPLSGAAVQTLAPHQIARGSTLISVNQQVGGSIGTALMSVL LTYQFNHSEIIATAKKVALTPESGAGRGAAVDPSSLPRQTNFAAQLLHDLSHAYAVVF VIATALVVSTLIPAAFLPKQQASHRRAPLLSA" CDS 880829..881515 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0806" /product="Predicted deacetylase" /note="Mb0806, -, len: 228 aa. Equivalent to Rv0784, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 228 aa overlap). Conserved hypothetical protein, with some similarity to MLCB5_20|O05752 hypothetical protein from Mycobacterium leprae (193 aa), FASTA scores: opt: 141, E(): 0.0022, (36.0% identity in 114 aa overlap). Also similar to N-terminus of NP_253002.1|NC_002516 conserved hypothetical protein from Pseudomonas aeruginosa (253 aa). Mb0806 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWT5" /db_xref="InterPro:IPR011330" /db_xref="InterPro:IPR018763" /db_xref="UniProtKB/TrEMBL:A0A1R3XWT5" /protein_id="SIT99405.1" /translation="MSVSGIGESTLADVDAFCAEMDARSVPVSLLVAPRMRDDYRLDR DPRTVDWLTGRRAAGDALVLHGYDEAATKRRRGEFAMLRAHEANLRLMAADRVLEHLG LRTRLFAAPGWLVSPGVRTALPANGFRLLADLHGITDLVRLTTVRARVLGIGEGFLAE PWWCRMVVMSAERIARRGGVVRIAVAARHLRKSGPLQAMLDAVDLAMLQGCTPMVYRW RADAAVLDAA" CDS 881531..882106 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0807" /product="CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb0807, -, len: 191 aa. Equivalent to 5' end of Rv0785, len: 566 aa, from Mycobacterium tuberculosis strain H37Rv, (97.1% identity in 140 aa overlap). Conserved hypothetical protein, highly similar to other conserved hypothetical proteins e.g. NP_105777.1| NC_002678 hypothetical protein from Mesorhizobium loti (552 aa); SC5F8.14|CAB93742.1|AL357613 conserved hypothetical protein from Streptomyces coelicolor (557 aa); AE001863|AE001863_31 from Deinococcus radiodurans (554 aa), FASTA scores: opt: 2243, E(): 0, (61.1% identity in 550 aa overlap); YEF7_YEAST|P32614 hypothetical 50.8 kd protein (470 aa), FASTA scores: opt: 169, E(): 0.0014, (23.8% identity in 542 aa overlap); etc. Also similar to Rv1817|MTCY1A11.26c from Mycobacterium tuberculosis (487 aa), FASTA score: (26.7% identity in 587 aa overlap). And shows similarity with other dehydrogenases. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0785 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (t-*), splits Mb0807 into 2 parts, Mb0807 and Mb0808. Mb0807 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWF0" /db_xref="InterPro:IPR003953" /db_xref="InterPro:IPR014614" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XWF0" /protein_id="SIT99406.1" /translation="MALTCTDMSDAVAGSDAEGLTADAIVVGAGLAGLVAACELADRG LRVLILDQENRANVGGQAFWSFGGLFLVNSPEQRRLGIRDSHELALQDWLGTAAFDRP EDYWPEQWAHAYVDFAAGEKRSWLRARGLKIFRWWAGPSVVVTTRRGTATRCPVSTSP GVLGRLWSTYSCVSCVIAPRCALRTATRSTN" CDS 882124..883230 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0808" /product="CONSERVED HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb0808, -, len: 368 aa. Equivalent to 3' end of Rv0785, len: 566 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 368 aa overlap). Conserved hypothetical protein, highly similar to other conserved hypothetical proteins e.g. NP_105777.1| NC_002678 hypothetical protein from Mesorhizobium loti (552 aa); SC5F8.14|CAB93742.1|AL357613 conserved hypothetical protein from Streptomyces coelicolor (557 aa); AE001863|AE001863_31 from Deinococcus radiodurans (554 aa), FASTA scores: opt: 2243, E(): 0, (61.1% identity in 550 aa overlap); YEF7_YEAST|P32614 hypothetical 50.8 kd protein (470 aa), FASTA scores: opt: 169, E(): 0.0014, (23.8% identity in 542 aa overlap); etc. Also similar to Rv1817|MTCY1A11.26c from Mycobacterium tuberculosis (487 aa), FASTA score: (26.7% identity in 587 aa overlap). And shows similarity with other dehydrogenases. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0785 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (t-*), splits Mb0807 into 2 parts, Mb0807 and Mb0808. Protein product from Mb0808 detected using SWATH mass spectrometry. Mb0808 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWF6" /db_xref="InterPro:IPR003953" /db_xref="InterPro:IPR014614" /db_xref="InterPro:IPR027477" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XWF6" /protein_id="SIT99407.1" /translation="MTGVRGTVLEPSDEPRGAPSSRKSVGKFEFRASAVIVASGGIGG NHELVRKNWPRRMGRIPKQLLSGVPAHVDGRMIGIAQKAGAAVINPDRMWHYTEGITN YDPIWPRHGIRIIPGPSSLWLDAAGKRLPVPLFPGFDTLGTLEYITKSGHDYTWFVLN AKIIEKEFALSGQEQNPDLTGRRLGQLLRSRAHAGPPGPVQAFIDRGVDFVHANSLRE LVAAMNELPDVVPLDYETVAAAVTARDREVVNKYSKDGQITAIRAARRYRGDRFGRVV APHRLTDPKAGPLIAVKLHILTRKTLGGIETDLDARVLKADGTPLAGLYAAGEVAGFG GGGVHGYRALEGTFLGGCIFSGRAAGRGAAEDIR" CDS complement(883265..883654) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0809C" /product="Predicted Zn-dependent hydrolases of the beta-lactamase fold" /note="Mb0809c, -, len: 129 aa. Equivalent to Rv0786c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Conserved hypothetical protein, similar to three other hypothetical proteins from Streptomyces coelicolor e.g. SC7H1.08c|T35703 hypothetical protein (202 aa), FASTA scores: opt: 241, E(): 5.1e-10, (41.0% identity in 105 aa overlap); SC3A7.08|T29426 (211 aa). Protein product from Mb0809c detected using SWATH mass spectrometry. Mb0809c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/TrEMBL:A0A1R3XWF3" /protein_id="SIT99408.1" /translation="MHVGDELPLAELTVRAVGGCHAVIHPEIPVIENISYLVGDSKHR ARLMHPGDALFVPGEQVDVLATPAAAPWMKISEAVDYLRAVAPARAVPIHQAIVAPDA RGIYYGRLTEMTTTDFQVLPEESAVTF" CDS 883649..884608 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0810" /product="unknown protein" /note="Mb0810, -, len: 319 aa. Equivalent to Rv0787, len: 319 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 319 aa overlap). Hypothetical unknown protein, equivalent to AAK45053.1 from Mycobacterium tuberculosis strain CDC1551 (242 aa) but longer 77 aa." /db_xref="UniProtKB/TrEMBL:A0A1R3XWF9" /protein_id="SIT99409.1" /translation="MHRPPWLAQLRRRLRIGVQLGSRVVLEQGRQPRDVYVIGVLVGD QDRGQTGDSLEAVRESTGIEEQAGLTELSEEAGMAEMRELHVYDCALMGAFPMRLILA TMLVAGRLLATLMAAPSAQAEPETCPPICDQIPATAWISTHAVPLNSQYRWPAMAGAA VAVTRATPRFGFEQVCATPAFPHDSRDWAVAGRVTVVHPDGQWQLQAQVLHWRGDTAR GGQIAASVFGTAVAALRACQLGAPLQSPSVTDDEPTRMAAVISGPVIMHTYLVAHVSS STISELTLWSSGPPQVPWPTVADSAVLDALTAPLCEAYIGSCP" CDS 884714..884953 /codon_start=1 /transl_table=11 /gene="purS" /locus_tag="BQ2027_MB0811" /product="Phosphoribosylformylglycinamidine synthase, PurS subunit (EC" /EC_number="6.3.5.3" /note="Mb0811, -, len: 79 aa. Equivalent to Rv0787A, len: 79 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 78 aa overlap). Conserved hypothetical protein, equivalent to MLCB5.24 HYPOTHETICAL PROTEIN from Mycobacterium leprae (79 aa), FASTA scores: opt: 434, (84.8% identity in 79 aa overlap). Also similar to P12049|YEXA_BACSU HYPOTHETICAL 9.7 KD PROTEIN from Bacillus subtilis (84 aa), FASTA scores: opt: 172, E(): 4e-06, (44.4% identity in 72 aa overlap). BELONGS TO THE UPF0062 FAMILY. Protein product from Mb0811 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0811 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWG2" /db_xref="InterPro:IPR003850" /db_xref="InterPro:IPR036604" /db_xref="UniProtKB/TrEMBL:A0A1R3XWG2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99410.1" /translation="MARVVVHVMPKAEILDPQGQAIVGALGRLGHLGISDVRQGKRFE LEVDDTVDDTTLAEIAESLLANTVIEDWTISRDPQ" CDS 884950..885624 /codon_start=1 /transl_table=11 /gene="purQ" /locus_tag="BQ2027_MB0812" /product="PROBABLE PHOSPHORIBOSYLFORMYLGLYCINAMIDINE SYNTHASE I PURG (FGAM SYNTHASE I)" /note="Mb0812, purQ, len: 224 aa. Equivalent to Rv0788, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Probable purQ, phosphoribosylformylglycinamidine synthase I (EC 6.3.5.3), equivalent to MLCB5_24|Z95151|O05756|PURQ PHOSPHORIBOSYLFORMYLGLYCINAMIDINE SYNTHASE I from Mycobacterium leprae (224 aa), FASTA scores: opt: 1341, E(): 0, (88.7% identity in 222 aa overlap). Also highly similar to others e.g. P12041|PURQ_BACSU PHOSPHORIBOSYLFORMYLGLYCINAMIDINE SYNTHASE I from Bacillus subtilis (227 aa), FASTA scores: opt: 691, E(): 8.6e-39, (47.7% identity in 214 aa overlap); etc. Contains PS00442 Glutamine amidotransferases class-I active site. BELONGS TO TYPE-1 GLUTAMINE AMIDOTRANSFERASES. Protein product from Mb0812 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0812 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65903" /db_xref="InterPro:IPR010075" /db_xref="InterPro:IPR017926" /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/Swiss-Prot:P65903" /protein_id="SIT99411.1" /translation="MTARIGVVTFPGTLDDVDAARAARQVGAEVVSLWHADADLKGVD AVVVPGGFSYGDYLRAGAIARFAPVMDEVVAAADRGMPVLGICNGFQVLCEAGLLPGA LTRNVGLHFICRDVWLRVASTSTAWTSRFEPDADLLVPLKSGEGRYVAPEKVLDELEG EGRVVFRYHDNVNGSLRDIAGICSANGRVVGLMPHPEHAIEALTGPSDDGLGLFYSAL DAVLTG" CDS complement(885641..886240) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0813C" /product="HYPOTHETICAL PROTEIN" /note="Mb0813c, -, len: 199 aa. Equivalent to Rv0789c, len: 199 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 199 aa overlap). Hypothetical unknown protein. Protein product from Mb0813c detected using shotgun mass spectrometry. Mb0813c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99412.1" /translation="MSRRAIHSGRAAPRRSGNSHLVLRNRVPSSKDSPRRRPHHEFMT ESIGEPLSTNLIERYLRARGRRYFRGHHDAEFFFVANAHLRLHVHLEISPAYRDVFTI RVSPAYFFPATDHTRLAEIVNAWNLQNHEVTAIVHGSSDPHRIGVAAERSLIRDRIRF DDFATFVDNAVSAATELFGQLTAAGLPPTATPPLLRDAG" CDS complement(886262..886990) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0814C" /product="Transglutaminase-like enzymes, putative cysteine proteases" /note="Mb0814c, -, len: 242 aa. Equivalent to Rv0790c, len: 242 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 242 aa overlap). Hypothetical unknown protein. Protein product from Mb0814c detected using SWATH mass spectrometry. Mb0814c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002931" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/TrEMBL:A0A1R3XXE8" /protein_id="SIT99413.1" /translation="MTLANNGTGMDHFLTPTEYLDAGHPLVRTTAATLIRDAVSDTER VRRIYYYVRDVPYDVLASFRYLAQGHHRASDVIGHGVAFCMGKASSFVALCRAAGVPA RIAFQTIDAPDKEFLSPQVRALWGGRTGRPFPWHSLGEAYLGRRWVKLDATIDAPTAA RLGKPYRQEFDGATPIPTVEGTILRENGSYADYPSAVAQWYERIAQSVLKALQSTEVH ALVAADEELWTGPPVELADATHRL" CDS complement(886987..888030) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0815C" /product="Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases" /note="Mb0815c, -, len: 347 aa. Equivalent to Rv0791c, len: 347 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 347 aa overlap). Conserved hypothetical protein, similar (except in N-terminus) to others e.g. CAC44585.1|AL596162 conserved hypothetical protein from Streptomyces coelicolor (307 aa); NP_252643.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (364 aa); etc. Also some similarity with oxidoreductases e.g. AAK38097.1|AF323606_3|AF323606 putative F420-dependent dehydrogenase from Rhodococcus erythropolis (295 aa); etc. And also similar in part to other proteins from Mycobacterium tuberculosis e.g. Rv1855c|MTCY359.18|Z83859 (307 aa), FASTA scores: opt: 366, E(): 4e-16, (35.0% identity in 226 aa overlap); Rv3079c|MTCY22D7.02|Z83866 CONSERVED HYPOTHETICAL PROTEIN (275 aa), FASTA scores: opt: 342, E(): 1.2e-14, (31.6% identity in 234 aa overlap); Rv0044c POSSIBLE OXIDOREDUCTASE (264 aa). Protein product from Mb0815c detected using SWATH mass spectrometry. Mb0815c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWI0" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019921" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3XWI0" /protein_id="SIT99414.1" /translation="MNAKDDPHFGLMLAATVNGLAVGSYREMVVVSQTAEEYGFDSVW LCDHFLTISPGEYAKVAGIAADTGSATGTETGGAGQCAPSRSLPLLECWTALAALSRD TTKLRLGTSVLCNSYRHPSVLAKMAATLDVISQGRLDLGLGAGWFRRESQAYGIPFPP VGDRVSALAESLQVIKAVWTEPNPTYAGRFYTLDGATCDPPPVQRPHPPLWIGGEGDR VQRIAAKHAQGLNVRWWSPQQVTQRRGFLTQASEAAGRDPDTLRLSVTLLLAPTQSGE EEVRIREEFASIPEPGLIVGTPDRCVERIREYQDRGVGHFLFTIPHVVKSDYLHIIGS DIIPRVKTEVTIP" CDS complement(888027..888836) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0816C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY GNTR-FAMILY)" /note="Mb0816c, -, len: 269 aa. Equivalent to Rv0792c, len: 269 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 269 aa overlap). Probable transcriptional regulator, GntR-family, similar to many others of GntR family e.g. BSUB0018_189|Z99121 from Bacillus subtilis (243 aa), FASTA scores: opt: 367, E(): 1.5e-17, (32.1% identity in 246 aa overlap); P31453|YIDP_ECOLI from Escherichia coli (238 aa), FASTA scores: opt: 236, E(): 8.8e-09, (26.4% identity in 235 aa overlap); etc. Protein product from Mb0816c detected using SWATH mass spectrometry. Mb0816c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWU6" /db_xref="InterPro:IPR000524" /db_xref="InterPro:IPR011663" /db_xref="InterPro:IPR028978" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XWU6" /protein_id="SIT99415.1" /translation="MTSVKLDLDAADLRISRGSVPASTQLAEALKAQIIQQRLPRGGR LPSERELIDRSGLSRVTVRAAVGMLQRQGWLVRRQGLGTFVADPVEQELSCGVRTITE VLLSCGVTPQVDVLSHQTGPAPQRISETLGLVEVLCIRRRIRTGDQPLALVTAYLPPG VGPAVEPLLSGSADTETTYAMWERRLGVRIAQATHEIHAAGASPDVADALGLAVGSPV LVVDRTSYTNDGKPLEVVVFHHRPERYQFSVTLPRTLPGSGAGIIEKRDFA" CDS 888909..889214 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0817" /product="possible monooxygenase" /note="Mb0817, -, len: 101 aa. Equivalent to Rv0793, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). Conserved hypothetical protein, similar to others e.g. NP_250888.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (114 aa); AE 001908|AE001908_7 hypothetical protein from Deinococcus radiodurans (101 aa), FASTA scores: opt: 215, E(): 3.1e-09, (40.4% identity in 99 aa overlap); NP_440966.1|NC_000911|D90908|PCC6803|D9 0908_2 unknown protein from Synechocystis sp. strain PCC 6803 (147 aa), FASTA scores: opt: 194, E(): 4.5e-08, (31.1% identity in 90 aa overlap); etc. Also similar to Rv2749|MTV002.14|AL0089|MTV002_15 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (104 aa), FASTA scores: opt: 143, E(): 0.00026, (26.9% identity in 93 aa overlap)." /db_xref="GOA:A0A1R3XWG1" /db_xref="InterPro:IPR007138" /db_xref="InterPro:IPR011008" /db_xref="UniProtKB/TrEMBL:A0A1R3XWG1" /protein_id="SIT99416.1" /translation="MTSPVAVIARFMPRPDARSALRALLDAMITPTRAEDGCRSYDLY ESADGGELVLFERYRSRIALDEHRGSPHYLNYRAQVGELLTRPVAVTVLAPLDEASA" CDS complement(889327..889923) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0818C" /product="PROBABLE OXIDOREDUCTASE [SECOND PART]" /note="Mb0818c, -, len: 198 aa. Equivalent to 3' end of Rv0794c, len: 499 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 203 aa overlap). Probable oxidoreductase (EC 1.-.-.-), possibly dihydrolipoamide dehydrogenase (EC 1.8.1.4) or mercuric reductase (EC 1.16.1.1). Highly similar to CAB62675.1|AL133422 probable oxidoreductase from Streptomyces coelicolor (477 aa); and similar to various oxidoreductases e.g. P08663|MERA_STAAU MERCURIC REDUCTASE (HG(II) REDUCTASE) (EC 1.16.1.1) from Staphylococcus aureus (547 aa); AAK70920.1|AC087551_19|AC087551 putative lipoamide dehydrogenase from Oryza sativa (563 aa); NP_437349.1|NC_003078 putative FAD-dependent pyridine nucleotide-disulphide oxidoreductase, similar to mercuric reductases protein from Sinorhizobium meliloti (473 aa); Q04829|DLDH_HALVO DIHYDROLIPOAMIDE DEHYDROGENASE (EC 1.8.1.4) from Haloferax volcanii (475 aa); P08332|MERA_SHIFL MERCURIC REDUCTASE (EC 1.16.1.1) (564 aa), FASTA scores: opt: 522, E(): 3.7e-26, (31.7% identity in 467 aa overlap); P72740|DLDH_SYNY3|Q53395|LPDA|PDHD|SLR1096 DIHYDROLIPOAMIDE DEHYDROGENASE (EC 1.8.1.4) from Synechocystis sp. strain PCC 6803 (474 aa), FASTA scores: opt: 602, E(): 2.3e-31, (31.0% identity in 493 aa overlap); etc. Note that previously known as lpdB. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0794c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (a-*) splits Mb0818c into 2 parts, Mb0818 and Mb0819. Protein product from Mb0818c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0818c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWG6" /db_xref="InterPro:IPR004099" /db_xref="InterPro:IPR016156" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XWG6" /protein_id="SIT99417.1" /translation="MTPGSWLDVDDTCRVRAVDDGWLYAAGDVNHRALLTHQGKYQAR IAGTAIGARAAGRPLDTTSWGMHATTADHHAVPQAFFTDPEAAAVGLTADQAAQAGHR IKAIDVEIGDVVMGAKLFADGYTGRARMVVDVDRGHLLGVTMVGPGAAELLHSATVAV AGQVPIDRLWHAVPCFPTISELWLRLLESYRDSFYLLV" CDS complement(889926..890825) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0819C" /product="PROBABLE OXIDOREDUCTASE [FIRST PART]" /note="Mb0819c, -, len: 299 aa. Equivalent to 5' end of Rv0794c, len: 499 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 263 aa overlap). Probable oxidoreductase (EC 1.-.-.-), possibly dihydrolipoamide dehydrogenase (EC 1.8.1.4) or mercuric reductase (EC 1.16.1.1). Highly similar to CAB62675.1|AL133422 probable oxidoreductase from Streptomyces coelicolor (477 aa); and similar to various oxidoreductases e.g. P08663|MERA_STAAU MERCURIC REDUCTASE (HG(II) REDUCTASE) (EC 1.16.1.1) from Staphylococcus aureus (547 aa); AAK70920.1|AC087551_19|AC087551 putative lipoamide dehydrogenase from Oryza sativa (563 aa); NP_437349.1|NC_003078 putative FAD-dependent pyridine nucleotide-disulphide oxidoreductase, similar to mercuric reductases protein from Sinorhizobium meliloti (473 aa); Q04829|DLDH_HALVO DIHYDROLIPOAMIDE DEHYDROGENASE (EC 1.8.1.4) from Haloferax volcanii (475 aa); P08332|MERA_SHIFL MERCURIC REDUCTASE (EC 1.16.1.1) (564 aa), FASTA scores: opt: 522, E(): 3.7e-26, (31.7% identity in 467 aa overlap); P72740|DLDH_SYNY3|Q53395|LPDA|PDHD|SLR1096 DIHYDROLIPOAMIDE DEHYDROGENASE (EC 1.8.1.4) from Synechocystis sp. strain PCC 6803 (474 aa), FASTA scores: opt: 602, E(): 2.3e-31, (31.0% identity in 493 aa overlap); etc. Note that previously known as lpdB. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0794c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (a-*) splits Mb0818c into 2 parts, Mb0818a and Mb0818b. Mb0819c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWG0" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XWG0" /protein_id="SIT99418.1" /translation="MTAAQQDQAPMATPGCREGETYDVVVLGAGPVGQNVADRARAGG LRVAVVERELVGGECSYWACVPSKALLRPVIAISDARRVDGAREAVDGSINTAGVFGR RNRYVAHWDDTGQADWVSGIGATLIRGDGRLDGPRRVVVTKSSGESVALTARHAVVIC TGSRPALPDLPGITEARPWTNRQATDNSTVPDRLAIVGAGGVGVEMATAWQGLGASVT LLARGSGLLPRMEPFVGELIGRGLADAGVDVRVGVSVRALGRPNPLAQWSSSWTTVPS CGSTRYSSPPAEHREPTTSAWRQ" CDS 891218..892312 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0820" /product="PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1547" /note="Mb0820, -, len: 364 aa. Equivalent to Rv0797, len 364 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 364 aa overlap). Putative transposase for IS1547; almost identical to (but 20 aa shorter than) Y13470|MTY13470_2 from Mycobacterium tuberculosis (383 aa). Also similar to other transposases e.g. MAIS1110A _1|Q48909 transposase from M. avium (464 aa), FASTA scores: opt: 226, E(): 2.4e-08, (30.7% identity in 199 aa overlap). Also slight similarity to Rv2014|MTCY39.03c from Mycobacterium tuberculosis (222 aa), FASTA score: (24.8% identity in 141 aa overlap)." /db_xref="GOA:A0A1R3XWH1" /db_xref="InterPro:IPR002525" /db_xref="InterPro:IPR003346" /db_xref="UniProtKB/TrEMBL:A0A1R3XWH1" /protein_id="SIT99419.1" /translation="MVVVGTDAHKYSHTFVATDEVGRQLGEKTVKATTAGHATAIMWA REQFGLELIWGIEDCRNMSARLERDLLAAGQQVVRVPTKLMAQTRKSARSRGKSDPID ALAVARAVLRETDLPLATHDETSRELKLLTDRRDVLVAQRTSAINRLRWLVHELDPER APAARSLDAAKHQQALRTWLDTQPGLVAELARAELTDIIRLTGEINTLAQRISARVHQ VAPALLEIPGCAELTAAKIVGEAAGVTRFKSEAAFACHAAVAPIPVWSGNTAGQMRLS RSGNRQLNAALHRIALTQIRMTDSRGQAYYQRLQDAGKTKRAALRCLKRRLARTVFQA LRTVHQPSSEHTQPAAACHRSYCSRSCLSG" mobile_element 891218..892309 /mobile_element_type="insertion sequence:IS1547" /locus_tag="BQ2027_IS1547-1" /note="IS1547-1, len: 1092 nt. Equivalent to IS1547, len: 1092 nt, from Mycobacterium tuberculosis strain H37Rv,(99.7% identity in 1092 nt overlap)." gene 891218..892309 /locus_tag="BQ2027_IS1547-1" CDS complement(892302..893099) /codon_start=1 /transl_table=11 /gene="cfp29" /locus_tag="BQ2027_MB0821C" /product="29 KDa ANTIGEN CFP29" /note="Mb0821c, cfp29, len: 265 aa. Equivalent to Rv0798c, len: 265 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 265 aa overlap). cfp29, 29 kDa antigen (see citations below). Highly similar to Q45296|BLLINM18P_1|CAA63787.1|X93588 linocin M18 from Brevibacterium linens (266 aa), FASTA scores: (58.5% identity in 265 aa overlap). Also shows similarity with NP_228594.1|NC_000853 bacteriocin from Thermotoga maritima (262 aa). Protein product from Mb0821c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0821c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWH2" /db_xref="InterPro:IPR007544" /db_xref="UniProtKB/TrEMBL:A0A1R3XWH2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99420.1" /translation="MNNLYRDLAPVTEAAWAEIELEAARTFKRHIAGRRVVDVSDPGG PVTAAVSTGRLIDVKAPTNGVIAHLRASKPLVRLRVPFTLSRNEIDDVERGSKDSDWE PVKEAAKKLAFVEDRTIFEGYSAASIEGIRSASSNPALTLPEDPREIPDVISQALSEL RLAGVDGPYSVLLSADVYTKVSETSDHGYPIREHLNRLVDGDIIWAPAIDGAFVLTTR GGDFDLQLGTDVAIGYASHDTDTVRLYLQETLTFLCYTAEASVALSH" CDS complement(893096..894103) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0822C" /product="Predicted dye-decolorizing peroxidase (DyP), encapsulated subgroup" /note="Mb0822c, -, len: 335 aa. Equivalent to Rv0799c, len: 335 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 335 aa overlap). Conserved hypothetical protein, similar to Q50021|U2266C from Mycobacterium leprae (146 aa), FASTA scores: opt: 147, E(): 0.0016, (33.3% identity in 117 aa overlap); Q50020|U2266B from Mycobacterium leprae (27 aa), FASTA scores: opt: 94, E(): 1.3, (56.5% identity in 23 aa overlap). Also highly similar to others e.g. CAC01593.1|AL391041 conserved hypothetical protein from Streptomyces coelicolor (316 aa); AF088897|AF088897_9 hypothetical protein from Zymomonas mobilis (322 aa), FASTA scores: opt: 1132, E(): 0, (56.1% identity in 303 aa overlap); P76536|ECAE000330_8 hypothetical protein from Escherichia coli strain K-12 (308 aa), FASTA scores: E(): 2.2e-30, (37.4% identity in 297 aa overlap); etc. Also similar to some tyrA proteins. Protein product from Mb0822c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0822c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWG4" /db_xref="InterPro:IPR006314" /db_xref="InterPro:IPR011008" /db_xref="UniProtKB/TrEMBL:A0A1R3XWG4" /protein_id="SIT99421.1" /translation="MAVPAVSPQPILAPLTPAAIFLVATIGADGEATVHDALSKISGL VRAIGFRDPTKHLSVVVSIGSDAWDRLFAGPRPTELHPFVELTGPRHTAPATPGDLLF HIRAETMDVCFELAGRILKSMGDAVTVVDEVHGFRFFDNRDLLGFVDGTENPSGPIAI KATTIGDEDRNFAGSCYVHVQKYVHDMASWESLSVTEQERVIGRTKLDDIELDDNAKP ANSHVALNVITDDDGTERKIVRHNMPFGEVGKGEYGTYFIGYSRTPTVTEQMLRNMFL GDPAGNTDRVLDFSTAVTGGLFFSPTIDFLDHPPPLPQAATPTLAAGSLSIGSLKGSP R" CDS 894148..895449 /codon_start=1 /transl_table=11 /gene="pepC" /locus_tag="BQ2027_MB0823" /product="PROBABLE AMINOPEPTIDASE PEPC" /note="Mb0823, pepC, len: 433 aa. Equivalent to Rv0800, len: 433 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 433 aa overlap). Probable pepC, aminopeptidase I (EC 3.4.11.-), highly similar (but shorter 17 aa) to Q50022|PEPX AMINOPEPTIDASE from Mycobacterium leprae (443 aa), FASTA scores: opt: 2237, E(): 0, (78.3% identity in 433 aa overlap). Also highly similar to others from Eukaryotes and bacteria, e.g. T36482 probable aminopeptidase from Streptomyces coelicolor (432 aa), P14904|AMPL_YEAST vacuolar aminopeptidase I precursor from Saccharomyces cerevisiae (514 aa), FASTA scores: opt: 425, E(): 4.8e-21, (31.0% identity in 445 aa overlap); etc. Also similar to hypothetical proteins e.g. P38821|YHR3_YEAST hypothetical 54.2 kd protein from Saccharomyces cerevisiae (490 aa), FASTA scores: opt: 429, E(): 2.5e-21, (34.8% identity in 443 aa overlap); etc. Protein product from Mb0823 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0823 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59951" /db_xref="InterPro:IPR001948" /db_xref="InterPro:IPR022984" /db_xref="InterPro:IPR023358" /db_xref="UniProtKB/Swiss-Prot:P59951" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99422.1" /translation="MAATAHGLCEFIDASPSPFHVCATVAGRLLGAGYRELREADRWP DKPGRYFTVRAGSLVAWNAEQSGHTQVPFRIVGAHTDSPNLRVKQHPDRLVAGWHVVA LQPYGGVWLHSWLDRDLGISGRLSVRDGTGVSHRLVRIDDPILRVPQLAIHLAEDRKS LTLDPQRHINAVWGVGERVESFVGYVAQRAGVAAADVLAADLMTHDLTPSALIGASVN GTASLLSAPRLDNQASCYAGMEALLAVDVDSASSGFVPVLAIFDHEEVGSASGHGAQS DLLSSVLERIVLAAGGTREDFLRRLTTSMLASADMAHATHPNYPDRHEPSHPIEVNAG PVLKVHPNLRYATDGRTAAAFALACQRAGVPMQRYEHRADLPCGSTIGPLAAARTGIP TVDVGAAQLAMHSARELMGAHDVAAYSAALQAFLSAELSEA" CDS 895461..895808 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0824" /product="Glyoxalase family protein" /note="Mb0824, -, len: 115 aa. Equivalent to Rv0801, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 115 aa overlap). Conserved hypothetical protein, similar to many hypothetical proteins from Streptomyces sp. e.g. SCD840A.20|AB81865.1|AL161691 hypothetical protein from Streptomyces coelicolor (145 aa); AF072709|AF072709_8 from Streptomyces lividans (131 aa), FASTA scores: opt: 120, E(): 0.2, (26.3% identity in 118 aa overlap); etc. Protein product from Mb0824 detected using shotgun mass spectrometry. Mb0824 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029068" /db_xref="InterPro:IPR037523" /db_xref="InterPro:IPR041581" /db_xref="UniProtKB/TrEMBL:A0A1R3XXF9" /protein_id="SIT99423.1" /translation="MALKVEMVTFDCSDPAKLAGWWAEQFDGTTRELLPGEFVVVART DGPRLGFQKVPDPAPGKNRVHLDFTTKDLDAEVLRLVAAGASEVGRHQVGESFRWVVL ADPEGNAFCVAGQ" CDS complement(895802..896458) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0825C" /product="possible succinyltransferase in the gcn5-related n-acetyltransferase family" /note="Mb0825c, -, len: 218 aa. Equivalent to Rv0802c, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 218 aa overlap). Conserved hypothetical protein, showing partial similarity with many acetyltransferases and hypothetical proteins e.g. P96579|BSUB0003_68 PROBABLE ACETYLTRANSFERASE from Bacillus subtilis (183 aa), FASTA scores: E(): 0.0044, (26.4% identity in 110 aa overlap). Protein product from Mb0825c detected using SWATH mass spectrometry. Mb0825c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWJ0" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3XWJ0" /protein_id="SIT99424.1" /translation="MSRHWPLFDLRITTPRLQLQLPTEELCDQLIDTILEGVHDPDRM PFSVPWTRASREDLPFNTLSHLWQQLAGFKRDDWSLPLAVLVDGRAVGVQALSSKDFP ITRQVDSGSWLGLRYQGHGYGTEMRAAVLYFAFAELEAQVATSRSFVDNPASIAVSRR NGYRDNGLDRVAREGAMAEALLFRLTRDDWQRHRTVEVRVDGFDRCRPLFGPLEPPRY " CDS 896650..898914 /codon_start=1 /transl_table=11 /gene="purL" /locus_tag="BQ2027_MB0826" /product="PHOSPHORIBOSYLFORMYLGLYCINAMIDINE SYNTHASE II PURL (FGAM SYNTHASE II)" /note="Mb0826, purL, len: 754 aa. Equivalent to Rv0803, len: 754 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 754 aa overlap). purL, phosphoribosylformylglycinamidine synthase II (EC 6.3.5.3) (see citations below), equivalent to NP_302451.1|NC_002677 phosphoribosylformylglycinamidine synthase II from Mycobacterium leprae (754 aa). Also highly similar to others e.g. Q9RKK5|PURL_STRCO from Streptomyces coelicolor (752 aa); P12042|PURL_BACSU from Bacillus subtilis (742 aa), FASTA score: (44.7% identity in 716 aa); etc. Start was chosen by similarity. BELONGS TO THE FGAMS FAMILY. Protein product from Mb0826 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0826 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5T9" /db_xref="InterPro:IPR010074" /db_xref="InterPro:IPR010918" /db_xref="InterPro:IPR016188" /db_xref="InterPro:IPR036676" /db_xref="InterPro:IPR036921" /db_xref="InterPro:IPR041609" /db_xref="UniProtKB/Swiss-Prot:P0A5T9" /protein_id="SIT99425.1" /translation="MLDTVEHAATTPDQPQPYGELGLKDDEYRRIRQILGRRPTDTEL AMYSVMWSEHCSYKSSKVHLRYFGETTSDEMRAAMLAGIGENAGVVDIGDGWAVTFKV ESHNHPSYVEPYQGAATGVGGIVRDIMAMGARPVAVMDQLRFGAADAPDTRRVLDGVV RGIGGYGNSLGLPNIGGETVFDPCYAGNPLVNALCVGVLRQEDLHLAFASGAGNKIIL FGARTGLDGIGGVSVLASDTFDAEGSRKKLPSVQVGDPFMEKVLIECCLELYAGGLVI GIQDLGGAGLSCATSELASAGDGGMTIQLDSVPLRAKEMTPAEVLCSESQERMCAVVS PKNVDAFLAVCRKWEVLATVIGEVTDGDRLQITWHGETVVDVPPRTVAHEGPVYQRPV ARPDTQDALNADRSAKLSRPVTGDELRATLLALLGSPHLCSRAFITEQYDRYVRGNTV LAEHADGGMLRIDESTGRGIAVSTDASGRYTLLDPYAGAQLALAEAYRNVAVTGATPV AVTNCLNFGSPEDPGVMWQFTQAVRGLADGCADLGIPVTGGNVSFYNQTGSAAILPTP VVGVLGVIDDVRRRIPTGLGAEPGETLMLLGDTRDEFDGSVWAQVTADHLGGLPPVVD LAREKLLAAVLSSASRDGLVSAAHDLSEGGLAQAIVESALAGETGCRIVLPEGADPFV LLFSESAGRVLVAVPRTEESRFRGMCEARGLPAVRIGVVDQGSDAVEVQGLFAVSLAE LRATSEAVLPRYFG" CDS 898911..899540 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0827" /product="Integral membrane protein" /note="Mb0827, -, len: 209 aa. Equivalent to Rv0804, len: 209 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 209 aa overlap). Conserved hypothetical protein, showing similarity with C-terminus of Rv1863c|MTCY359.10 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (256 aa), FASTA scores: opt: 199, E(): 1.2e-05, (33.2% identity in 220 aa overlap); and Rv0658c. Contains PS01151 Fimbrial biogenesis outer membrane usher protein signature. Mb0827 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWG8" /db_xref="InterPro:IPR003675" /db_xref="InterPro:IPR015837" /db_xref="UniProtKB/TrEMBL:A0A1R3XWG8" /protein_id="SIT99426.1" /translation="MSRLRALSLAAGLVGWSLVSPRLPAPWRIPLQAGLGSVLVLVTR ATMGLWPPRLWAGLRLGWAAGAAAATAIAATTPVPMVRLSMSARELPASVPVWLVWHI PGGTVWAEEAAFRGALATIGARAFGRSGGRILQAGAFGLSHIADARATGEPLVLTVLA TGIAGWMFGWLADRSGSLAAPLLTHLAINEAGAVAAVLVQRRSGISTRL" CDS 899661..900617 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0828" /product="class iii cyclic nucleotide phosphodiesterase (cnmp pde)" /note="Mb0828, -, len: 318 aa. Equivalent to Rv0805, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 318 aa overlap). Conserved hypothetical protein, equivalent to Q50024 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (317 aa), FASTA scores: opt: 1713, E(): 0, (82.5% identity in 315 aa overlap). Also shows similarity with hypothetical proteins and icc proteins e.g. SC9B1.22c|T35867 hypothetical protein from Streptomyces coelicolor (305 aa); P36650|ICC_ECOLI icc protein from Escherichia coli (275 aa), FASTA scores: opt: 310, E(): 8.9e-14, (31.3% identity in 214 aa overlap); etc. Mb0828 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWH6" /db_xref="InterPro:IPR004843" /db_xref="InterPro:IPR026575" /db_xref="InterPro:IPR029052" /db_xref="UniProtKB/TrEMBL:A0A1R3XWH6" /protein_id="SIT99427.1" /translation="MHRLRAAEHPRPDYVLLHISDTHLIGGDRRLYGAVDADDRLGEL LEQLNQSGLRPDAIVFTGDLADKGEPAAYRKLRGLVEPFAAQLGAELVWVMGNHDDRA ELRKFLLDEAPSMAPLDRVCMIDGLRIIVLDTSVPGHHHGEIRASQLGWLAEELATPA PDGTILALHHPPIPSVLDMAVTVELRDQAALGRVLRGTDVRAILAGHLHYSTNATFVG IPVSVASATCYTQDLTVAAGGTRGRDGAQGCNLVHVYPDTVVHSVIPLGGGETVGTFV SPGQARRKIAESGIFIEPSRRDSLFKHPPMVLTSSAPRSPVD" CDS complement(900562..902160) /codon_start=1 /transl_table=11 /gene="cpsY" /locus_tag="BQ2027_MB0829C" /product="POSSIBLE UDP-GLUCOSE-4-EPIMERASE CPSY (GALACTOWALDENASE) (UDP-GALACTOSE-4-EPIMERASE) (URIDINE DIPHOSPHATE GALACTOSE-4-EPIMERASE) (URIDINE DIPHOSPHO-GALACTOSE-4-EPIMERASE)" /note="Mb0829c, cpsY, len: 532 aa. Equivalent to Rv0806c, len: 532 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 532 aa overlap). Possible cpsY, UDP-glucose-4-epimerase (EC 5.1.3.2), equivalent to Q50025|CPSY probable UDP-glucose-4-epimerase from Mycobacterium leprae (542 aa), FASTA scores: opt: 2964, E(): 0, (82.3% identity in 530 aa overlap). Also similar to AAC38286.1|AF019760|SACB CpsY homolog (involved in meningococcal capsule biosynthesis) from Neisseria meningitidis serogroup A (545 aa); Q51151 CAPSULE GENE COMPLEX UPD-GLUCOSE-4-EPIMERASE (GALE) from Neisseria meningitidis (373 aa), FASTA scores: opt: 496, E(): 9.5e-27, (29.3% identity in 358 aa overlap); C-terminus of CAB75373.1|AL139298 putative transferase from Streptomyces coelicolor (942 aa); and many hypothetical proteins from Streptomyces coelicolor. SEEMS TO BELONG TO THE SUGAR EPIMERASE FAMILY. Protein product from Mb0829c detected using SWATH mass spectrometry. Mb0829c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U184" /db_xref="InterPro:IPR021520" /db_xref="InterPro:IPR031356" /db_xref="InterPro:IPR031357" /db_xref="InterPro:IPR031358" /db_xref="UniProtKB/Swiss-Prot:Q7U184" /protein_id="SIT99428.1" /translation="MPKISSRDGGRPAQRTVNPIIVTRRGKIARLESGLTPQEAQIED LVFLRKVLNRADIPYLLIRNHKNRPVLAINIELRAGLERALAAACATEPMYAKTIDEP GLSPVLVATDGLSQLVDPRVVRLYRRRIAPGGFRYGPAFGVELQFWVYEETVIRCPVE NSLSRKVLPRNEITPTNVKLYGYKWPTLDGMFAPHASDVVFDIDMVFSWVDGSDPEFR ARRMAQMSQYVVGEGDDAEARIRQIDELKYALRSVNMFAPWIRRIFIATDSTPPPWLA EHPKITIVRAEDHFSDRSALPTYNSHAVESQLHHIPGLSEHFLYSNDDMFFGRPLKAS MFFSPGGVTRFIEAKTRIGLGANNPARSGFENAARVNRQLLFDRFGQVITRHLEHTAV PLRKSVLIEMEREFPEEFARTAASPFRSDTDISVTNSFYHYYALMTGRAVPQEKAKVL YVDTTSYAGLRLLPKLRKHRGYDFFCLNDGSFPEVPAAQRAERVVSFLERYFPIPAPW EKIAADVSRRDFAVPRTSAPSEGA" CDS 902465..902854 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0830" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0830, -, len: 129 aa. Equivalent to Rv0807, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Conserved hypothetical protein, equivalent to O05761|MLCB5_31 HYPOTHETICAL 14.0 KD PROTEIN from Mycobacterium leprae (131 aa), FASTA scores: E(): 0, (73.4% identity in 128 aa overlap). Also highly similar to BAA89438.1|AB003158|ORF3 HYPOTHETICAL PROTEIN from Corynebacterium ammoniagenes (132 aa); and C-terminus of SCD25.20|CAB56364.1|AL118514 hypothetical protein from Streptomyces coelicolor (202 aa). Protein product from Mb0830 detected using SWATH mass spectrometry. Mb0830 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR041629" /db_xref="UniProtKB/TrEMBL:A0A1R3XWH8" /protein_id="SIT99429.1" /translation="MSARDRVDPAKTRQVVLALADWLRDETLPAPDTDVLAAAVRLTA RTLAALAPGASVEVRIPPFAAVQCISGPRHTRGTPPNVVQTDPRTWLLVATGLSGVAQ ARGSGALQLSGSRAGEIEAWLPLVDLG" CDS 902941..904524 /codon_start=1 /transl_table=11 /gene="purF" /locus_tag="BQ2027_MB0831" /product="amidophosphoribosyltransferase purf (glutamine phosphoribosylpyrophosphate amidotransferase) (atase) (gpatase)" /note="Mb0831, purF, len: 527 aa. Equivalent to Rv0808, len: 527 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 527 aa overlap). Probable purF, amidophosphoribosyltransferase (EC 2.4.2.14), equivalent to MLCB5_32|Q50028|PURF from Mycobacterium leprae (556 aa), FASTA scores: (91.3% identity in 518 aa overlap); and CAB96578.1|AJ278609 phosphoribosyl pyrophosphate amidotransferase from Mycobacterium smegmatis (511 aa). Also highly similar to others e.g. BAA89439.1|AB003158 amidophosphoribosyl transferase from Corynebacterium ammoniagenes (490 aa); P00497|PUR1_BACSU amidophosphoribosyltransferase precursor from Bacillus subtilis (476 aa), FASTA scores: opt: 1412, E(): 0, (46.2% identity in 470 aa overlap); etc. Contains PS00103 Purine/pyrimidine phosphoribosyl transferases signature. BELONGS TO THE PURINE/PYRIMIDINE PHOSPHORIBOSYLTRANSFERASE FAMILY. Protein product from Mb0831 detected using SWATH mass spectrometry. Mb0831 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65830" /db_xref="InterPro:IPR000836" /db_xref="InterPro:IPR005854" /db_xref="InterPro:IPR017932" /db_xref="InterPro:IPR029055" /db_xref="InterPro:IPR029057" /db_xref="InterPro:IPR035584" /db_xref="UniProtKB/Swiss-Prot:P65830" /protein_id="SIT99430.1" /translation="MAVDSDYVTDRAAGSRQTVTGQQPEQDLNSPREECGVFGVWAPG EDVAKLTYYGLYALQHRGQEAAGIAVADGSQVLVFKDLGLVSQVFDEQTLAAMQGHVA IGHCRYSTTGDTTWENAQPVFRNTAAGTGVALGHNGNLVNAAALAARARDAGLIATRC PAPATTDSDILGALLAHGAADSTLEQAALDLLPTVRGAFCLTFMDENTLYACRDPYGV RPLSLGRLDRGWVVASETAALDIVGASFVRDIEPGELLAIDADGVRSTRFANPTPKGC VFEYVYLARPDSTIAGRSVHAARVEIGRRLARECPVEADLVIGVPESGTPAAVGYAQE SGVPYGQGLMKNAYVGRTFIQPSQTIRQLGIRLKLNPLKEVIRGKRLIVVDDSIVRGN TQRALVRMLREAGAVELHVRIASPPVKWPCFYGIDFPSPAELIANAVENEDEMLEAVR HAIGADTLGYISLRGMVAASEQPTSRLCTACFDGKYPIELPRETALGKNVIEHMLANA ARGAALGELAADDEVPVGR" CDS 904555..905649 /codon_start=1 /transl_table=11 /gene="purM" /locus_tag="BQ2027_MB0832" /product="PROBABLE PHOSPHORIBOSYLFORMYLGLYCINAMIDINE CYCLO-LIGASE PURM (AIRS) (PHOSPHORIBOSYL-AMINOIMIDAZOLE SYNTHETASE) (AIR SYNTHASE)" /note="Mb0832, purM, len: 364 aa. Equivalent to Rv0809, len: 364 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 364 aa overlap). Probable purM, 5'-phosphoribosyl-5-aminoimidazole synthetase (EC 6.3.3.1), equivalent to NP_302446.1|NC_002677 5'-phosphoribosyl-5-aminoimidazole synthase from Mycobacterium leprae (364 aa). Also highly similar to many e.g. P12043|PUR5_BACSU PHOSPHORIBOSYLFORMYLGLYCINAMIDINE CYCLO-LIGASE from Bacillus subtilis (346 aa), FASTA scores: opt: 1023, E(): 0, (46.5% identity in 331 aa overlap); U68765|STU68765_2 from Salmonella typhimurium (345 aa), FASTA scores: opt: 1014, E():0, (47.6% identity in 330 aa overlap); etc. Protein product from Mb0832 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0832 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWH4" /db_xref="InterPro:IPR004733" /db_xref="InterPro:IPR010918" /db_xref="InterPro:IPR016188" /db_xref="InterPro:IPR036676" /db_xref="InterPro:IPR036921" /db_xref="UniProtKB/TrEMBL:A0A1R3XWH4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99431.1" /translation="MTDLAKGPGKDPGSRGITYASAGVDIEAGDRAIDLFKPLASKAT RPEVRGGLGGFAGLFTLRGDYREPVLAASSDGVGTKLAIAQAMDKHDTVGLDLVAMVV DDLVVCGAEPLFLLDYIAVGRIVPERLSAIVAGIADGCMRAGCALLGGETAEHPGLIE PDHYDISATGVGVVEADNVLGPDRVKPGDVIIAMGSSGLHSNGYSLVRKVLLEIDRMN LAGHVEEFGRTLGEELLEPTRIYAKDCLALAAETRVRTFCHVTGGGLAGNLQRVIPHG LIAEVDRGTWTPAPVFTMIAQRGRVRRTEMEKTFNMGVGMIAVVAPEDTTRALAVLTA RHLDCWVLGTVCKGGKQGPRAKLVGQHPRF" CDS complement(905735..905917) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0833C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0833c, -, len: 60 aa. Equivalent to Rv0810c, len: 60 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 60 aa overlap). Conserved hypothetical protein, with its N-terminus highly similar to NP_302445.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (62 aa); and AL118514|SCD25_24 hypothetical protein from Streptomyces coelicolor (84 aa), FASTA scores: opt: 180, E(): 5.7e-07, (51.8% identity in 56 aa overlap). TBparse score is 0.876. Protein product from Mb0833c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0833c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021426" /db_xref="UniProtKB/TrEMBL:A0A1R3XYL1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99432.1" /translation="MGRGRAKAKQTKVARELKYSSPQTDFQRLQRELSGTGTDRLDGD GPSDDDSWNDEDDWRR" CDS complement(906064..907170) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0834C" /product="tRNA-modifying protein YgfZ" /note="Mb0834c, -, len: 368 aa. Equivalent to Rv0811c, len: 368 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 368 aa overlap). Conserved hypothetical protein, equivalent to U2266F|U15182|MLU15182_13 HYPOTHETICAL PROTEIN from Mycobacterium leprae (366 aa), FASTA scores: opt: 1870, E(): 0, (77.4% identity in 367 aa overlap). Also highly similar to BAA89441.1|AB003158|ORF4 HYPOTHETICAL PROTEIN from Corynebacterium ammoniagenes (359 aa); and CAB94085.1|AL358692 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (321 aa). Protein product from Mb0834c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0834c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXG9" /db_xref="InterPro:IPR006222" /db_xref="InterPro:IPR017703" /db_xref="InterPro:IPR027266" /db_xref="InterPro:IPR028896" /db_xref="UniProtKB/TrEMBL:A0A1R3XXG9" /protein_id="SIT99433.1" /translation="MAAVPAPDPGPDAGAIWHYGDPLGEQRAGQADAVLVDRSHRAVL TLDGGDRQTWLHSISTQHVSDLPEGASTQNLSLDGQGRVEDHWIQTELGGTTYLDTEP WRGEPLLAYLRKMVFWSMVTPRAADMAVLSLLGPRLAEERVLDALGLDVLPAEWLAVP LAGGGIVRRMPDGLAGQIELDVVVKRGDRADWQRRLTQAGVRPAGIWAYEAHRVAHRV PARRPRLGVDTDERTIPHEVGWIGGPGAGAVHLNKGCYRGQETVARVHNLGRPPRMLV LLHLDESVQRPSTGDAVLAGGRTVGRLGTVVEHVELGPVALALLKRGLPGDTALVTGP EAEVAAVIDVDSLPPADDVGAGRRAVERLRGGIR" CDS 907253..908122 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0835" /product="PROBABLE AMINO ACID AMINOTRANSFERASE" /note="Mb0835, -, len: 289 aa. Equivalent to Rv0812, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 289 aa overlap). Probable amino acid aminotransferase (EC 2.6.1.-), similar to other amino acid aminotransferases, generelly CLASS-IV OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES, and especially ILVE proteins and PABC proteins e.g. B76065.1|AL157953 putative aminotransferase from Streptomyces coelicolor (273 aa); NP_069766.1|NC_000917 branched-chain amino acid aminotransferase (ilvE) from Archaeoglobus fulgidus (290 aa); P54692|DAAA_BACLI D-ALANINE AMINOTRANSFERASE from Bacillus licheniformis (283 aa); P28305|PABC_ECOLI 4-AMINO-4-DEOXYCHORISMATE LYASE From Escherichia coli (269 aa), FASTA scores: opt: 165, E(): 0.00064, (26.8% identity in 198 aa overlap); etc. Note that previously known as pabC. Protein product from Mb0835 detected using SWATH mass spectrometry. Mb0835 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWK1" /db_xref="InterPro:IPR001544" /db_xref="InterPro:IPR017824" /db_xref="InterPro:IPR036038" /db_xref="UniProtKB/TrEMBL:A0A1R3XWK1" /protein_id="SIT99434.1" /translation="MVVTLDGEILQPGMPLLHADDLAAVRGDGVFETLLVRDGRACLV EAHLQRLTQSARLMDLPEPDLPRWRRAVEVATQRWVASTADEGALRLIYSRGREGGSA PTAYVMVSPVPARVIGARRDGVSAITLDRGLPADGGDAMPWLMASAKTLSYAVNMAVL RHAARQGAGDVIFVSTDGYVLEGPRSTVVIATDGDQGGGNPCLLTPPPWYPILRGTTQ QALFEVARAKGYDCDYRALRVADLFDSQGIWLVSSMTLAARVHTLDGRRLPRTPIAEV FAELVDAAIVSDR" CDS complement(908168..908848) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0836C" /product="DUF1794" /note="Mb0836c, -, len: 226 aa. Equivalent to Rv0813c, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 226 aa overlap). Conserved hypothetical protein, highly similar to U15182|MLU15182_16 HYPOTHETICAL PROTEIN from Mycobacterium leprae (242 aa), FASTA scores: opt: 1191, E(): 0, (78.3% identity in 226 aa overlap); and NP_302442.1|NC_002677 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (228 aa). Also similar to AB94083.1|AL358692|SCD66.16 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (191 aa); and Rv2717c|MTCY05A6_37 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (164 aa), FASTA score: (30.4% identity in 171 aa overlap). Protein product from Mb0836c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0836c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U178" /db_xref="InterPro:IPR012674" /db_xref="InterPro:IPR014878" /db_xref="InterPro:IPR022939" /db_xref="UniProtKB/Swiss-Prot:Q7U178" /protein_id="SIT99435.1" /translation="MSSGAGSDATGAGGVHAAGSGDRAVAAAVERAKATAARNIPAFD DLPVPADTANLREGADLNNALLALLPLVGVWRGEGEGRGPDGDYRFGQQIVVSHDGGD YLNWESRSWRLTATGDYQEPGLREAGFWRFVADPYDPSESQAIELLLAHSAGYVELFY GRPRTQSSWELVTDALARSRSGVLVGGAKRLYGIVEGGDLAYVEERVDADGGLVPHLS ARLSRFVG" CDS complement(909011..909313) /codon_start=1 /transl_table=11 /gene="sseC2" /locus_tag="BQ2027_MB0837C" /product="Sulfur metabolism protein SseC" /note="Mb0837c, sseC2, len: 100 aa. Equivalent to Rv0814c, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). sseC2, conserved hypothetical protein, highly similar to AAA62972.1|U15182|MLU15182_17 hypothetical protein from Mycobacterium leprae (143 aa), FASTA scores: opt: 545, E(): 0, (84.0% identity in 100 aa overlap); and NP_302441.1|NC_002677|Z95150|MTCY164_29 conserved hypothetical protein from Mycobacterium leprae (100 aa), FASTA scores: opt: 647, E(): 0, (100.0% identity in 100 aa overlap). Also highly similar to M29612|SERCYSA_5 rhodanese-like protein from Saccharopolyspora erythraea (101 aa), FASTA scores: opt: 345, E(): 1.2e-18, (57.1% identity in 98 aa overlap); and similar at the C-terminus to the C-terminus of CAB94069.1|AL358692 conserved hypothetical protein from Streptomyces coelicolor (95 aa). Identical second copy present as Rv3118|MTCY164.28|SSEC1 from Mycobacterium tuberculosis (100 aa) (100.0% identity in 100 aa overlap). Protein product from Mb0837c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0837c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR008969" /db_xref="InterPro:IPR010814" /db_xref="UniProtKB/TrEMBL:A0A1R3XWI1" /protein_id="SIT99436.1" /translation="MCSGPKQGLTLPASVDLEKETVITGRVVDGDGQAVGGAFVRLLD SSDEFTAEVVASATGDFRFFAAPGSWTLRALSAAGNGDAVVQPSGAGIHEVDVKIT" CDS complement(909315..910148) /codon_start=1 /transl_table=11 /gene="cysA2" /locus_tag="BQ2027_MB0838C" /product="PROBABLE THIOSULFATE SULFURTRANSFERASE CYSA2 (RHODANESE-LIKE PROTEIN) (THIOSULFATE CYANIDE TRANSSULFURASE) (THIOSULFATE THIOTRANSFERASE)" /note="Mb0838c, cysA2, len: 277 aa. Equivalent to Rv0815c, len: 277 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 277 aa overlap). Probable cysA2, thiosulfate sulfurtransferase (EC 2.8.1.1), equivalent to Q50036|CYSA|CYSA3|ML2198|THTR_MYCLE PUTATIVE SULFURTRANSFERASE THIOSULFATE from Mycobacterium leprae (277 aa). Also highly similar to other putative thiosulfate sulfurtransferases e.g. P16385|THTR_SACER PUTATIVE THIOSULFATE SULFURTRANSFERASE from Saccharopolyspora erythraea (Streptomyces erythraeus) (281 aa); NP_293941.1|NC_001263 thiosulfate sulfurtransferase from Deinococcus radiodurans (286 aa); etc. Identical second copy present as Rv3117|MTCY164.27|MT3199|O05793|cysA3 (277 aa) (100.0% identity in 277 aa overlap). Contains PS00683 Rhodanese C-terminal signature at C-terminus. BELONGS TO THE RHODANESE FAMILY. Protein product from Mb0838c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0838c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59989" /db_xref="InterPro:IPR001307" /db_xref="InterPro:IPR001763" /db_xref="InterPro:IPR036873" /db_xref="UniProtKB/Swiss-Prot:P59989" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99437.1" /translation="MARCDVLVSADWAESNLHAPKVVFVEVDEDTSAYDRDHIAGAIK LDWRTDLQDPVKRDFVDAQQFSKLLSERGIANEDTVILYGGNNNWFAAYAYWYFKLYG HEKVKLLDGGRKKWELDGRPLSSDPVSRPVTSYTASPPDNTIRAFRDEVLAAINVKNL IDVRSPDEFSGKILAPAHLPQEQSQRPGHIPGAINVPWSRAANEDGTFKSDEELAKLY ADAGLDNSKETIAYCRIGERSSHTWFVLRELLGHQNVKNYDGSWTEYGSLVGAPIELG S" CDS complement(910441..910863) /codon_start=1 /transl_table=11 /gene="thiX" /locus_tag="BQ2027_MB0839C" /product="PROBABLE THIOREDOXIN THIX" /note="Mb0839c, thiX, len: 140 aa. Equivalent to Rv0816c, len: 140 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 140 aa overlap). Probable thiX, thioredoxin (EC 1.-.-.-), equivalent to ThiX|U15182|MLU15182_21 thioredoxin from Mycobacterium leprae (172 aa), FASTA scores: opt: 556, E(): 8.8e-31, (63.8% identity in 141 aa overlap); and similar to AAL08576.1|AF418548_2|AF418548 thioredoxin from Mycobacterium avium subsp. paratuberculosis (117 aa). Also similar to other bacterial thioredoxins e.g. CAB95303.1|AL359779 putative thioredoxin from Streptomyces coelicolor (126 aa); P33791|THIO_STRAU|TRX|TRXA THIOREDOXIN from Streptomyces aureofaciens (106 aa); etc. And similar to Rv3914|MT4033|MTV028.05|NP_218431.1|NC_0009 62|trxC THIOREDOXIN (TRX) (MPT46) from Mycobacterium tuberculosis (116 aa). Has hydrophobic stretch at N-terminus. SEEMS TO BELONG TO THE THIOREDOXIN FAMILY." /db_xref="GOA:A0A1R3XWH9" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3XWH9" /protein_id="SIT99438.1" /translation="MTTMIVASVATGALATIARWLLTRRSVILREVGPETTPAAPART AELGLSGAGPTVVHFRAPGCAPCDRVRRGVGDVCADLGDVAHIEVDLDSNPQAARRFS VLSLPTTLIFDVDGRQRYRTSGVPKAADLRSALKPLLA" CDS complement(910860..911672) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0840C" /product="PROBABLE CONSERVED EXPORTED PROTEIN" /note="Mb0840c, -, len: 270 aa. Equivalent to Rv0817c, len: 270 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 270 aa overlap). Probable conserved exported protein, with N-terminal signal sequence, equivalent (but shorter 13 aa) to U15182|MLU15182_22|U2266M probable exported protein from Mycobacterium leprae (283 aa), FASTA scores: opt: 1287, E(): 0, (73.0% identity in 270 aa overlap). Protein product from Mb0840c detected using SWATH mass spectrometry. Mb0840c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021373" /db_xref="UniProtKB/TrEMBL:A0A1R3XWI7" /protein_id="SIT99439.1" /translation="MPMRKVLVGVTGAAIVVAVLIVGAVGADFGASIYAEYRLSTTVR KAANLRSDPFVAILRFPFIPQAMREHYAELEIKAFAVEHAGSGTATLEATMHSIDLSY ASWLIRPDAKLPVGELESRIIIDSMHLGRYLGISDLMVAAPRQESNDATGGTTESGIS GSRGLVFSGTPISANFAHRVSVLVDLSVASDDRATLVITPTAVVTGPDTADQPVPDDK RDAVLHAFASKLPNQKLPFGVVPNTVGARGSDVIIEGITRGVTISLDEFKQS" CDS 911802..912569 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0841" /product="TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0841, -, len: 255 aa. Equivalent to Rv0818, len: 255 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 255 aa overlap). Probable transcriptional regulatory protein, highly similar to Q05943|GLNR_STRCO|L03213|STMGLNR_1|SCD84.26c TRANSCRIPTIONAL REGULATORY PROTEIN from Streptomyces coelicolor (267 aa), FASTA scores: opt: 945, E(): 0, (61.5 identity in 239 aa overlap); and similar to others from other organisms. Also similar to Rv2884|MTCY274.15|Z74024 from Mycobacterium tuberculosis (252 aa), FASTA scores: opt: 662, E(): 0, (47.8% identity in 226 aa overlap). Protein product from Mb0841 detected using shotgun mass spectrometry. Mb0841 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWJ5" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039420" /db_xref="UniProtKB/TrEMBL:A0A1R3XWJ5" /protein_id="SIT99440.1" /translation="MLELLLLTSELYPDPVLPALSLLPHTVRTAPAEASSLLEAGNAD AVLVDARNDLSSGRGLCRLLSSTGRSIPVLAVVSEGGLVAVSADWGLDEILLPSTGPA EIDARLRLVVGRRGDLADQESLGKVSLGELVIDEGTYTARLRGRPLDLTYKEFELLKY LAQHAGRVFTRAQLLHEVWGYDFFGGTRTVDVHVRRLRAKLGPEHEALIGTVRNVGYK AVRPARGRPPAADPDDEDADPGRDGMQEPLVDPLRSQ" CDS 912566..913513 /codon_start=1 /transl_table=11 /gene="mshD" /locus_tag="BQ2027_MB0842" /product="gcn5-related n-acetyltransferase, mshd" /note="Mb0842, -, len: 315 aa. Equivalent to Rv0819, len: 315 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 315 aa overlap). Conserved hypothetical protein, equivalent to U2266N|U15182|MLU15182_24 HYPOTHETICAL PROTEIN from Mycobacterium leprae (312 aa), FASTA scores: opt: 1540, E(): 0, (75.2% identity in 314 aa overlap). Also highly similar to CAB88484.1|AL353816 putative acetyltransferase from Streptomyces coelicolor (309 aa). Protein product from Mb0842 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0842 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U173" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="InterPro:IPR017813" /db_xref="UniProtKB/Swiss-Prot:Q7U173" /protein_id="SIT99441.1" /translation="MTALDWRSALTADEQRSVRALVTATTAVDGVAPVGEQVLRELGQ QRTEHLLVAGSRPGGPIIGYLNLSPPRGAGGAMAELVVHPQSRRRGIGTAMARAALAK TAGRNQFWAHGTLDPARATASALGLVGVRELIQMRRPLRDIPEPTIPDGVVIRTYAGT SDDAELLRVNNAAFAGHPEQGGWTAVQLAERRGEAWFDPDGLILAFGDSPRERPGRLL GFHWTKVHPDHPGLGEVYVLGVDPAAQRRGLGQMLTSIGIVSLARRLGGRKTLDPAVE PAVLLYVESDNVAAVRTYQSLGFTTYSVDTAYALAGTDN" CDS 913556..914332 /codon_start=1 /transl_table=11 /gene="phoT" /locus_tag="BQ2027_MB0843" /product="PROBABLE PHOSPHATE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER PHOT" /note="Mb0843, phoT, len: 258 aa. Equivalent to Rv0820, len: 258 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 258 aa overlap). Probable phoT, phosphate-transport ATP-binding protein ABC transporter (see citation below), equivalent to PhoT|MLU15182_28|U15182 phosphate transport system ABC transporter from Mycobacterium leprae (258 aa), FASTA scores: opt: 1556, E(): 0, (91.5% identity in 258 aa overlap). Also highly similar to others e.g. CAB88472.1|AL353816 phosphate ABC transport system ATP-binding protein from Streptomyces coelicolor (258 aa); etc. Note that also highly similar to many PstB proteins e.g. AAC15686.1|AF045938|PstB putative ABC transporter nucleotide binding subunit from Mycobacterium smegmatis (258 aa). Contains PS00211 ABC transporters family signature and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb0843 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0843 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U172" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR005670" /db_xref="InterPro:IPR015850" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:Q7U172" /protein_id="SIT99442.1" /translation="MAKRLDLTDVNIYYGSFHAVADVSLAILPRSVTALIGPSGCGKT TVLRTLNRMHEVIPGARVEGAVLLDDQDIYAPGIDPVGVRRAIGMVFQRPNPFPAMSI RNNVVAGLKLQGVRNRKVLDDTAESSLRGANLWDEVKDRLDKPGGGLSGGQQQRLCIA RAIAVQPDVLLMDEPCSSLDPISTMAIEDLISELKQQYTIVIVTHNMQQAARVSDQTA FFNLEAVGKPGRLVEIASTEKIFSNPNQKATEDYISGRFG" CDS complement(914388..915029) /codon_start=1 /transl_table=11 /gene="phoY2" /locus_tag="BQ2027_MB0844C" /product="PROBABLE PHOSPHATE-TRANSPORT SYSTEM TRANSCRIPTIONAL REGULATORY PROTEIN PHOY2" /note="Mb0844c, phoY2, len: 213 aa. Equivalent to Rv0821c, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap). Probable phoY2, phosphate-transport system regulatory protein, highly similar to PhoY|MLU15182_29|U15182 phosphate transport system regulator from Mycobacterium leprae (222 aa), FASTA scores: opt: 1268, E(): 0, (93.0% identity in 213 aa overlap). Also similar to others e.g. NP_384620.1|NC_003047 PROBABLE PHOSPHATE TRANSPORT SYSTEM TRANSCRIPTIONAL REGULATOR PROTEIN from Sinorhizobium meliloti (237 aa); etc. Also highly similar to MTCI418A.03c|Z96070|PhoY1 PROBABLE PHOSPHATE TRANSPORT SYSTEM TRANSCRIPTIONAL REGULATOR PROTEIN from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 937, E(): 0, (63.4% identity in 213 aa overlap). BELONGS TO THE PHOU FAMILY. TBparse score is 0.910. Protein product from Mb0844c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0844c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65721" /db_xref="InterPro:IPR026022" /db_xref="InterPro:IPR028366" /db_xref="InterPro:IPR038078" /db_xref="UniProtKB/Swiss-Prot:P65721" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99443.1" /translation="MRTAYHEQLSELSERLGEMCGLAGIAMERATQALLQADLVLAEQ VISDHEKIATLSARAEESAFVLLALQAPVAGDLRAIVSAIQMVADIDRMGALALHVAK IARRRHPQHALPEEVNGYFAEMGRVAVELGNSAQEVVLSHDPEKAAQIREEDDAMDDL HRHLFTVLMDREWKHGVAAAVDVTLLSRFYERFADHAVEVARRVIFQATGAFP" CDS complement(915087..917141) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0845C" /product="Cell envelope-associated transcriptional attenuator LytR-CpsA-Psr, subfamily A1" /note="Mb0845c, -, len: 684 aa. Equivalent to Rv0822c, len: 684 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 684 aa overlap). Conserved hypothetical protein, highly similar in the region between aa 370 - 580 to U2266O|U15182|MLU15182_30 HYPOTHETICAL PROTEIN from Mycobacterium leprae (222 aa), FASTA scores: opt: 819, E(): 0, (60.6% identity in 221 aa overlap). More extended similarity to Rv3267|Z92771|MTCY71_7 from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 434, E(): 2.2e-17, (26.6% identity in 541 aa overlap), and Rv3484. Also similar to various proteins, preferiously putative membrane proteins and membrane-bound regulatory proteins e.g. CAC44512.1|AL596138 putative membrane protein from Streptomyces coelicolor (524 aa); U56901|BSU56901_1 regulatory protein from Bacillus subtilis (391 aa), FASTA scores: opt: 225, E(): 1.3e-05, (24.7% identity in 340 aa overlap). Contains hydrophobic stretch (aa ~ 160-195) and PS00041 Bacterial regulatory proteins, araC family signature. Mb0845c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR004474" /db_xref="InterPro:IPR027381" /db_xref="UniProtKB/TrEMBL:A0A1R3XWL2" /protein_id="SIT99444.1" /translation="MSDGESAAPWARLSESAFPDGVDRWITVPPATWVAAQGPRDTQN VGCHATGAVSVADLIARLGPAFPDLPTHRHVAPEPEPSGRGPKVPDDADDQQDTEAIA IPAHSLEFLSELPDLRAANYPRADHARREPELPGKQLTGSARVRPLRIRRTSPAPAKP APNSGRRPMVLAARSLAALFAALALALTGGAWQWSASKNSRLNMVSALDPHSGDIVNP SGQHGDENFLLVGMDSRAGANANIGAGDAEDAGGARSDTVMLVNIPASRERVVAVSFP RDLAITPIQCEAWNPETGKYGPIYDEKTGTMGPRLVYTETKLNSAFSFGGPKCLVKVI QKLSGLSINRFIAIDFVGFARMVEALGGVEVCSTTPLRDYELGTVLEHAGRQVIDGPT ALNYVRARQVTTESNGDYGRIKRQQLFLSSLLRSMISTDTLFNLSRLNNVVNMFIGNS YVDNVKTKDLVELGRSLQHMAAGHVTFVTVPTGITDQNGDEPPRTSDMKALFTAIIDD DPLPLENDHNAQRLGNTPSTPPTTTKKAPQAGLTNEIQHQQVTTTSPKEVTVQVSNST GQAGLATTATDQLKRNGFNVMAPDDYPSSLLATTVFFSPGNEQAAATVAAVFGQSKIE RVTGIGQLVQVVLGQDFSAVRAPLPSGSTVSVQISRNSSSPPTKLPEDLTVTNAADTT CE" CDS complement(917307..918476) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0846C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0846c, -, len: 389 aa. Equivalent to Rv0823c, len: 389 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 389 aa overlap). Possible transcriptional regulator (resembles nitrogen regulation protein), equivalent (but longer 24 aa in N-terminus) to MLU15182_31|U15182|NtrB NtrB protein from Mycobacterium leprae (384 aa), FASTA scores: opt: 2070, E(): 0, (82.3% identity in 384 aa overlap) (see citation below). Also highly similar to CAB63312.1|AL133471|SCC82.03c hypothetical protein from Streptomyces coelicolor (406 aa); and to many transcriptional regulators members of UPF0034 FAMILY (NIFR3/SMM1) e.g. D26185|BAC180K_143 protein similar to transcriptional regulator (nitrogen regulation protein) from Bacillus subtilis (333 aa), FASTA scores: opt: 609, E(): 1.4e-32, (38.3% identity in 326 aa overlap); NP_349795.1|NC_003030 NifR3 family enzyme from Clostridium acetobutylicum (321 aa); etc. Contains PS01136 Uncharacterized protein family UPF0034 signature. Protein product from Mb0846c detected using SWATH mass spectrometry. Mb0846c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWX6" /db_xref="InterPro:IPR001269" /db_xref="InterPro:IPR004652" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR018517" /db_xref="InterPro:IPR024036" /db_xref="InterPro:IPR035587" /db_xref="UniProtKB/TrEMBL:A0A1R3XWX6" /protein_id="SIT99445.1" /translation="MSRRRAIQPSPALRIGPIELASPVVLAPMAGVTNVAFRALCRQL EQSKVGTVSGLYVCEMVTARALIERHPVTMHMTTFSADESPRSLQLYTVDPDTTYAAA RMIAGEGLADHIDMNFGCPVPKVTKRGCGAALPFKRRLFGQIVAAAVRATEGTDIPVT VKFRIGIDDAHHTHLDAGRIAEAEGAAAVALHARTAAQRYSGTADWEQIARLKQHVRT IPVLGNGDIYDAGDALAMMSTTGCDGVVIGRGCLGRPWLFAELSAAFTGSPAPTPPTL GEVADIIRRHGTLLAAHFGEDKGMRDIRKHIAWYLHGFPAGSALRRALAMVKTLDELD CLLDRLDGTVPFPDSATGARGRQGSPARVALPDGWLTDPDDCRVPEGADAMGSGG" CDS complement(918564..919580) /codon_start=1 /transl_table=11 /gene="desA1" /locus_tag="BQ2027_MB0847C" /standard_name="des" /product="PROBABLE ACYL-[ACYL-CARRIER PROTEIN] DESATURASE DESA1 (ACYL-[ACP] DESATURASE) (STEAROYL-ACP DESATURASE) (PROTEIN DES)" /note="Mb0847c, desA1, len: 338 aa. Equivalent to Rv0824c, len: 338 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 338 aa overlap). Probable desA1 (alternate gene name: des), acyl-[acyl-carrier protein] desaturase (stearoyl-ACP desaturase) (EC 1.14.19.2) (see first citation below), equivalent to U15182|MLU15182_32 acyl-[ACP] desaturase from Mycobacterium leprae (338 aa), FASTA scores: opt: 1880, E(): 0, (79.9% identity in 338 aa overlap); and highly similar in part to fragment CAB96061.1|AJ250019 Steroyl-ACP-desaturase from Mycobacterium avium subsp. paratuberculosis (93 aa). Also similar to other fatty acid desaturases e.g. T35035 probable acyl-[acyl-carrier protein] desaturase from Streptomyces coelicolor (328 aa); Q40731|STAD_ORYSA ACYL-[ACYL-CARRIER PROTEIN] DESATURASE PRECURSOR from Oryza sativa (Rice) (390 aa); etc. Also highly similar to desA2|Rv1094 from Mycobacterium tuberculosis (275 aa). Contains PS00225 Crystallins beta and gamma 'Greek key' motif signature. BELONGS TO THE FATTY ACID DESATURASE FAMILY. COFACTOR: FERREDOXIN, FERREDOXIN NADPH REDUCTASE, AND NADPH. Protein product from Mb0847c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0847c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWJ1" /db_xref="InterPro:IPR005067" /db_xref="InterPro:IPR009078" /db_xref="InterPro:IPR012348" /db_xref="UniProtKB/TrEMBL:A0A1R3XWJ1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99446.1" /translation="MSAKLTDLQLLHELEPVVEKYLNRHLSMHKPWNPHDYIPWSDGK NYYALGGQDWDPDQSKLSDVAQVAMVQNLVTEDNLPSYHREIAMNMGMDGAWGQWVNR WTAEENRHGIALRDYLVVTRSVDPVELEKLRLEVVNRGFSPGQNHQGHYFAESLTDSV LYVSFQELATRISHRNTGKACNDPVADQLMAKISADENLHMIFYRDVSEAAFDLVPNQ AMKSLHLILSHFQMPGFQVPEFRRKAVVIAVGGVYDPRIHLDEVVMPVLKKWRIFERE DFTGEGAKLRDELALVIKDLELACDKFEVSKQRQLDREARTGKKVSAHELHKTAGKLA MSRR" CDS complement(919742..920383) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0848C" /product="Transcriptional regulator, AcrR family" /note="Mb0848c, -, len: 213 aa. Equivalent to Rv0825c, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap). Conserved hypothetical protein, highly similar, but in part (between aa ~43-96) to fadD27|Rv0275c|MTV035.03 PUTATIVE FATTY-ACID-COA LIGASE from Mycobacterium tuberculosis (241 aa), FASTA scores: E(): 7.3e-09, (32.6% identity in 190 aa overlap). Also shows similarity with other proteins from Mycobacterium tuberculosis e.g. Rv0078|AL0214|MTV030_22 (201 aa), FASTA scores: opt:118, E(): 0.32, (34.5% identity in 113 aa overlap); etc. Protein product from Mb0848c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0848c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWJ9" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="UniProtKB/TrEMBL:A0A1R3XWJ9" /protein_id="SIT99447.1" /translation="MQTGQNRGRWSGVPLESRHALRRDNLVAAGVQLLGGAGGPALTV RAVCRHAGLTERYFYESFADREHFVRAVYDDVCTRAMATLTSAQTPREAVEQFVELMV DDPVRGRVLLLAPAVEPALTRSGAEWMPNFIELLQRKLSRIVDPVLQKLVATSLIGAL TGLFTAYLNGRLGATRKQFIDYCVNMLLSTAATYAPHRERGESEHSIPAGPHN" CDS 920464..921519 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0849" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0849, -, len: 351 aa. Equivalent to Rv0826, len: 351 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 351 aa overlap). Conserved hypothetical protein, similar to CAB94053.1|AL358672|SC7A12.06 hypothetical protein from Streptomyces coelicolor (300 aa); and NP_421372.1|NC_002696 hypothetical protein from Caulobacter crescentus (299 aa). Also similar to other proteins from Mycobacterium tuberculosis e.g. Rv1645c|Z85982|MTCY06H11.09 (351 aa), FASTA scores: opt: 1199, E(): 0, (57.5% identity in 299 aa overlap); Rv2237; Rv0276; etc. Mb0849 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR018713" /db_xref="UniProtKB/TrEMBL:A0A1R3XWI6" /protein_id="SIT99448.1" /translation="MTQDTSATCPLTSTVQDSSPVAGQLGRPIGFRGLAGGCPVSPLG YESPPLPLGPDSLTWRYFGDWRGMLQGPWAGSMQNMHPQLGAAVEDHSTFFRGRWPRL LRSLYPIGGVVFDGDRAPVTGVQVRDYHITIKGVDGAGRRYHALNPDVFYWAHATFFV GTLHVAERFCGGLTEAQRRQLFDEHVQWYRMYGMSMRPVPATWEEFQDYWDHMCRNVL ENNFAARAVLDLTELPKPPFAQRVPDWLWAAPRKLLARFFVWLTVGLYDPPVRELMGY RWLRRDEWLHRRFGDIVQLVFALVPFRFRKHPRARAGWDRATGRIPADAPLVQTPARN LPPPDERDNPTHYCPKV" CDS complement(921571..921963) /codon_start=1 /transl_table=11 /gene="kmtr" /locus_tag="BQ2027_MB0850C" /product="metal sensor transcriptional regulator kmtr (arsr-smtb family)" /note="Mb0850c, -, len: 130 aa. Equivalent to Rv0827c, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 130 aa overlap). Probable transcriptional regulator, similar to many e.g. CAC42856.1|AL592292 putative regulatory protein from Streptomyces coelicolor (115 aa); NP_301626.1|NC_002677 putative ArsR-family transcriptional regulator from Mycobacterium leprae (140 aa); BSUB0011_75|O31844|Z99114 YOZA PROTEIN from Bacillus subtilis (107 aa), FASTA scores: opt: 208, E(): 3.2e-08, (35.5% identity in 93 aa overlap); etc. Also similar to MTCY27.22c|Z95208 from Mycobacterium tuberculosis (135 aa), FASTA scores: opt: 201, E(): 1.2e-07, (35.7% identity in 98 aa overlap). Contains probable helix-turn helix motif from aa 42-63 (Score 1300, +3.61 SD). Mb0850c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWJ7" /db_xref="InterPro:IPR001845" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XWJ7" /protein_id="SIT99449.1" /translation="MYADSGPDPLPDDQVCLVVEVFRMLADATRVQVLWSLADREMSV NELAEQVGKPAPSVSQHLAKLRMARLVRTRRDGTTIFYRLENEHVRQLVIDAVFNAEH AGPGIPRHHRAAGGLQSVAKASATKDVG" CDS complement(922021..922443) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0851C" /product="POSSIBLE DEAMINASE" /note="Mb0851c, -, len: 140 aa. Equivalent to Rv0828c, len: 140 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 140 aa overlap). Possible deaminase (EC 3.5.-.-), with its N-terminus highly similar to middle part of NP_302602.1|NC_002677 possible cytidine/deoxycytidylate deaminase from Mycobacterium leprae (171 aa). Also similar to other deaminases e.g. CAC18715.2|AL451182 putative deaminase from Streptomyces coelicolor (167 aa); NP_251189.1|NC_002516 probable deaminase from Pseudomonas aeruginosa (151 aa); NP_108387.1|NC_002678 nitrogen fixation protein gene from Mesorhizobium loti (149 aa); etc. Also similar to many conserved hypothetical proteins e.g. NP_389200.1|NC_000964 hypothetical protein from Bacillus subtilis (156 aa), FASTA scores: E(): 1.3e-07, (38.9% identity in 95 aa overlap); etc. And similar to Rv3752c possible deaminase from Mycobacterium tuberculosis. Contains PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature. BELONGS TO THE CYTIDINE AND DEOXYCYTIDYLATE DEAMINASES FAMILY. Mb0851c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWK4" /db_xref="InterPro:IPR002125" /db_xref="InterPro:IPR016192" /db_xref="InterPro:IPR016193" /db_xref="UniProtKB/TrEMBL:A0A1R3XWK4" /protein_id="SIT99450.1" /translation="MPAGMAGFRRWAQTNDPTAHAESLAIRAACTKLGTEHLVGTTLN VLAHPCPMCYGSLYYCSPDEVVFLTSRDAYEPHYVDDRRYFEPATFYDEFAKEWQDRR LPMRQEHRPDIRAGAVDVYRFRQEPNGGERSAIAAPTG" CDS 922405..922695 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0852" /product="possible transposase (fragment)" /note="Mb0852, -, len: 96 aa. Equivalent to Rv0829, len: 96 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 96 aa overlap). Possible transposase for IS1605' (fragment), similar to C-terminal end of many mycobacterial transposases and hypothetical proteins e.g. Z74024|MTCY274_16 from Mycobacterium tuberculosis (460 aa), FASTA scores: opt: 668, E(): 6.2e-32, (98.9% identity in 93 aa overlap); MTV002_57|O33333 TRANSPOSASE from Mycobacterium tuberculosis; L07627|SERRY1_1 insertion element IS1136 from Saccharopolyspora erythraea (90 aa), FASTA score: (34.9% identity in 83 aa overlap). Mb0852 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR010095" /db_xref="UniProtKB/TrEMBL:A0A1R3XWJ4" /protein_id="SIT99451.1" /translation="MGPSSKTCHACRHVQDIGWDEKWQCDGCSITHQRDDNAAINLAR YEEPPSVVGPVGAAVKRGADRKTGPGPAGGREARKGTGHPAGEQPRDGVLVA" mobile_element 922405..922692 /mobile_element_type="insertion sequence:IS1605" /locus_tag="BQ2027_IS1605'" /note="IS1605', len: 288 nt. Equivalent to IS1605', len: 288 nt, from Mycobacterium tuberculosis strain H37Rv,(99.7% identity in 288 nt overlap)." gene 922405..922692 /locus_tag="BQ2027_IS1605'" CDS 922800..923705 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0853" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb0853, -, len: 301 aa. Equivalent to Rv0830, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 301 aa overlap). Conserved hypothetical protein, member of Mycobacterium tuberculosis protein family consisting of the proteins Rv0726c, Rv0731c, Rv3399, Rv1729c|Z81360|MTCY4C12_14c (312 aa), FASTA scores: opt: 1014, E(): 0, (54.1% identity in 292 aa overlap); etc. Protein product from Mb0853 detected using SWATH mass spectrometry. Mb0853 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U163" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7U163" /protein_id="SIT99452.1" /translation="MVRADRDRWDLATSVGATATMVAAQRALAADPRYALIDDPYAAP LVRAVGMDVYTRLVDWQIPVEGDSEFDPQRMATGMACRTRFFDQFFLDATHSGIGQFV ILASGLDARAYRLAWPVGSIVYEVDMPEVIEFKTATLSDLGAEPATERRTVAVDLRDD WATALQTAGFDPKVPAAWSAEGLLVYLPVEAQDALFDNITALSAPGSRLAFEFVPDTA IFADERWRNYHNRMSELGFDIDLNELVYHGQRGHVLDYLTRDGWQTSALTVTQLYEAN GFAYPDDELATAFADLTYSSATLMR" CDS complement(923724..924539) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0854C" /product="conserved protein" /note="Mb0854c, -, len: 271 aa. Equivalent to Rv0831c, len: 271 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 271 aa overlap). Conserved hypothetical protein, similar to Rv0347|MTY13E10_7|Z95324 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (328 aa), FASTA scores: opt: 426, E(): 2.6e-21, (33.6% identity in 262 aa overlap). Protein product from Mb0854c detected using shotgun mass spectrometry. Mb0854c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR026349" /db_xref="UniProtKB/TrEMBL:A0A1R3XXI9" /protein_id="SIT99453.1" /translation="MLPETNQDEVQPNAPVALVTVEIRHPTTDSLTESANRELKHLLI NDLPIERQAQDVSWGMTAPGGAPTPVADRFVRYVNRDNTTAASLKNQAIVVETTAYRS FEAFTDVVMRVVDARAQVSSIVGLERIGLRFVLEIRVPAGVDGRITWSNWIDEQLLGP QRFTPGGLVLTEWQGAAVYRELQPGKSLIVRYGPGMGQALDPNYHLRRITPAQTGPFF LLDIDSFWTPSGGSIPEYNRDALVSTFQDLYGPAQVVFQEMITSRLKDELLRQ" tRNA complement(924627..924699) /locus_tag="BQ2027_LYST" /product="tRNA-Lys" /note="lysT, len: 73 nt. Equivalent to lysT, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Lys, anticodon ttt." tRNA 924823..924896 /locus_tag="BQ2027_GLUT" /product="tRNA-Glu" /note="gluT, len: 74 nt. Equivalent to gluT, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Glu, anticodon ttc." tRNA 924934..925007 /locus_tag="BQ2027_ASPT" /product="tRNA-Asp" /note="aspT, len: 74 nt. Equivalent to aspT, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Asp, anticodon gtc." tRNA 925037..925110 /locus_tag="BQ2027_PHEU" /product="tRNA-Phe" /note="pheU, len: 74 nt. Equivalent to pheU, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Phe, anticodon gaa." CDS 925781..926194 /codon_start=1 /transl_table=11 /gene="PE_PGRS12" /locus_tag="BQ2027_MB0855" /product="pe-pgrs family protein pe_pgrs12" /note="Mb0855, PE_PGRS12, len: 137 aa. Equivalent to Rv0832, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 137 aa overlap). Member of the Mycobacterium tuberculosis PE family, possibly PGRS subfamily of gly-rich proteins, highly similar to many others e.g. MTCY1A11.25c|Z78020 (498 aa), FASTA scores: opt: 529, E(): 5.2e-22, (61.8% identity in 136 aa overlap); etc. Appears to have incurred frameshift as next ORF should be continuation; sequence has been checked but no error found. Mb0855 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XWM1" /protein_id="SIT99454.1" /translation="MSYVSVLPATLATAATEVARIGSALSLASAVAAAQTSAVQAAAA DEVSAAIAALFSAHGRDFQALSARAAAFHHEFVQALAAGAGSYAVAEIAAASPLQSLI DVFNAPIQAATGRPLIGNGANGQPGTGAPGGPAGG" CDS 926191..928512 /codon_start=1 /transl_table=11 /gene="PE_PGRS13" /locus_tag="BQ2027_MB0856" /product="pe-pgrs family protein pe_pgrs13" /note="Mb0856, PE_PGRS13, len: 773 aa. Equivalent to Rv0833, len: 749 aa, from Mycobacterium tuberculosis strain H37Rv, (). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, but lacking N-terminal domain (present in preceding ORF), possibly due to frameshift. Similar in part to many others e.g. MTCY28_25|Z95890 (914 aa), FASTA scores: opt: 2726, E(): 0, (60.1% identity in 773 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 84 bp insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (773 aa versus 749 aa). Mb0856 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XWY6" /protein_id="SIT99455.1" /translation="MIGNGGAGGSGAPGAIGGAGGPAGLIGVGGAGGAGGDSAVAGVI GGAGGAGGAALLFGAGGAGGAGGSGGSGAAGGAGGAGGAGGLFASGGSGGFGGFASTG TGGAGGTGGAGGLFASGGVGGTGGGAGSGGTGGVGGTGGAGGLFASGGAGGAGGSGGT GGAGGTGGAGGLFGAGGAGGLGGQGNHTGGHGGAGGSAGLLALGDGGAGGAGGAATTG TGGAGGAGGKAGLLFGSGGAGGSGGAAGTFGDTGNSGGAGGAGGKAGLLFGSGGAGGS GGAGGFANGSTGGAGGAGGGAGLIGNGGNGGSGGTSVATGGAGNGGAGGAGGGAGLIG NGGNGGSGGMGDAPGGTGVGGIGGLLLGLDGANAPASTNPLHTAQQQALAAVNAPIQA VTGRPLIGNGANGAPGSGAPGGHGGWLFGGGGTGGSGVSGGAGGDGGAGGILFGAGGA GGAGGAVTGTGATGGSGGAGGGALLFGAGGAGGAGGSSGIGGFAAGGAGGPGGAGGLF NGGGAGGAGGSGVSGGAGGEGGAGGAGGLFAGGGIGGAGGFGGFRGGEGGAGGAGGLF AGGGAGGAGGSGNNVGGAGGAGGVGGLFGAGGAGGSGGGGSVAGDGGAGGNAGLLAPG LAGGAGGGGGQGFDTGGAGGPGGDAGLLVGSGGVGGAGGFGLTTGGPGAAGGDAGLLF GSGGAGGAGGSGRTDLGGAGGAGGKAGLIGNGGNGGAGGAGGAGGPGGAAFGLGNGGN GGNGGTGTSAGSPGAGGAGGSLIGAEGLPGLLP" CDS complement(928739..931234) /codon_start=1 /transl_table=11 /gene="PE_PGRS14" /locus_tag="BQ2027_MB0857C" /product="pe-pgrs family protein pe_pgrs14" /note="Mb0857c, PE_PGRS14, len: 831 aa. Equivalent to Rv0834c, len: 882 aa, from Mycobacterium tuberculosis strain H37Rv. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to many others e.g. MTCY493_4|Z95844 (1329 aa), FASTA scores: opt: 2577, E(): 0, (52.0% identity in 950 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, deletions of 143 bp and 9 bp (cgccgttgc-*), leads to a shorter product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (831 aa versus 882 aa). Mb0857c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XWK0" /protein_id="SIT99456.1" /translation="MSFVIAAPDLVAMATEDLAGIGASLTAANAAAAVPTSGLLAAAG DEVSAAIAALFSSHGQQYQAMSAQAAAFHARFVQALAGAMGAYAAAEAANASPLQTLE QGLLGAINAPAAALSGRPFIGNGTNGAPGTGEAGGPGGWLLGNGGNGGSGAPGQTGGA GGAAGLLGHGGTGGAGGTGASGGKGGTGGWLWGSGGAGGAGGSGGGSGGAGGNALMFG IGGNGGAGGAASGVGNGGVGGAGGAGGALVAIGGAGGAGGAATTGTGGAGGAGSNALG LFLGLGGSGGQGGDSAMGSGGAGGAGGSGGAASPFGIDIGIGGAGGHGGAGTNGGAGG AGGAGGSSGTVFALDLSWGGAGGNGGAATTGTGGAGGTGGFAVAPDFIGFGAAYGGAG GLGGAATGAGGTGGTGGVGAGGFAALGVGVGGAGGAGGAATETGGIGGAGGLGVGLLG GAGGAGGPGGAASAGSGGHGGTGGDALGLIGAGIGGVGGVGGAATDTGGNGGAGGSGT GLLGGVGGAGGHGGGASVGTGGSGGAGGDGFGFVGAGGNGGNAGTGVGVNGANGGNGG SATGALAAVGGAGAAGGDATSGTGGFGGAGGSARGLIFALGGAGAAGGDASTGVGGPG GPGGTGTASSPFGIAIAIGGAGAQGGAGTSGATGGAGGDGVFEGIAVLGLGFGGAAGA GGAATGDGATGGAGGFGGAGAGIANFLGFSVLHGGAGGAGGTATGTGGNGGAGGGGGL SSPVILGIGIGGAGGNGGDALGLVGVGGNGGNAGTGFGANTGGNGGDTTIVVNGMLAP STLGYGGNGVNGGAGGTGGKAGVFGAPGQNGLP" CDS 931702..932346 /codon_start=1 /transl_table=11 /gene="lpqQ" /locus_tag="BQ2027_MB0858" /product="POSSIBLE LIPOPROTEIN LPQQ" /note="Mb0858, lpqQ, len: 214 aa. Equivalent to Rv0835, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 214 aa overlap). Possible lpqQ, lipoprotein. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb0858 detected using shotgun mass spectrometry. Mb0858 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR026954" /db_xref="UniProtKB/TrEMBL:A0A1R3XWK8" /protein_id="SIT99457.1" /translation="MCCSTAAKSAVIVCCAAIATTACSFQATSTQPSTAPPTSRVDSL IVSIEDVRRIANYEELAAHFQTDLREPPEADTNVPGPCRVVGSSDRTFGTDWSEFRSA GYHGVTDDLRPGGPVMVETVSQAIALYPDPSTARGVFHRLESSLAECAGLHDPYFDFI LGRPDASTVRIGAAGWSHVYRLKSSVFISVGVLGIEPAEPIANVILQTISDRIQ" CDS complement(932959..933681) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0859C" /product="HYPOTHETICAL PROTEIN" /note="Mb0859c, -, len: 240 aa. Equivalent to Rv0836c, len: 217 aa (start uncertain), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 217 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (a-g) leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (240 aa versus 217 aa)." /db_xref="InterPro:IPR014513" /db_xref="UniProtKB/TrEMBL:A0A1R3XWJ6" /protein_id="SIT99458.1" /translation="MLVGAQCRDLLHWRFCRGVPPRATNDTDIAGTLNNWDHFEAIRA TFRALGSTGHRFLIADRAVDALPFGEVESPTGTTRHPPGNQLMNVHGCTDAYLRADVL PLPGGLTVHLPQPPNYAVLKLHAWLDRSADHDYKDGPDLALVVHWYAGDLDRLYAKPD QWALRRHDFDLRTAAAALLGHDMRASVSAPEAAVLATRATQADHDLLAQHFAVGRPGW PTTTASRRPLVEALLDQLTPGS" CDS complement(933752..934780) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0860C" /product="HYPOTHETICAL PROTEIN" /note="Mb0860c, -, len: 342 aa. Equivalent to Rv0837c, len: 342 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 342 aa overlap). Hypothetical unknown protein. TBparse score is 0.941." /db_xref="InterPro:IPR016600" /db_xref="InterPro:IPR019238" /db_xref="UniProtKB/TrEMBL:A0A1R3XWK7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99459.1" /translation="MDQIGADLAEAVERHLTEYGVRVLGGLSALNSAHPESLDLEIDA HPLTITALYLPHLSATAALQAWDTAGAGSPLLVVGPRLHPSSAETLRARGLWYIDGAG NAYLRHQGGLLIDVRGRRSAVSAQPGTLGDGLHSDGPRNPFTPKRAQVVCVLLDAPQL VDAPLRAIAASAGVSVGMAKETMDTLRTTGFFEHLGSRRRLVRTDELLDLWAAAYPGG LGRANKLLVASGDIHTWSAPDGLAVAVSGEQALPDEIRNPESLMLYVDTPAPGLPADL LIHNRWHRDPHGSIVIRKLFWRNLPDEQPGLAPTALIYADLLASREPRQVEVAHLMRR QDERLARL" CDS 935469..936245 /codon_start=1 /transl_table=11 /gene="lpqR" /locus_tag="BQ2027_MB0861" /product="PROBABLE CONSERVED LIPOPROTEIN LPQR" /note="Mb0861, lpqR, len: 258 aa. Equivalent to Rv0838, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 258 aa overlap). Probable lpqR, conserved lipoprotein. Similar (except in N-terminus) to hypothetical proteins and D-alanyl-D-alanine dipeptidases e.g. NP_416005.1|NC_000913 hypothetical protein from Escherichia coli strain K12 (193 aa); NP_421076.1|NC_002696 D-alanyl-D-alanine dipeptidase from Caulobacter crescentus (212 aa); Q06241|VANX_ENTFC D-ALANYL-D-ALANINE DIPEPTIDASE from Enterococcus faecium (202 aa), FASTA scores: opt: 198, E(): 1.9e-05, (28.1% identity in 199 aa overlap); etc. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 6bp insertion (*-cggccc) leads to a slightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (258 aa versus 256 aa). Mb0861 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWL7" /db_xref="InterPro:IPR000755" /db_xref="InterPro:IPR009045" /db_xref="UniProtKB/TrEMBL:A0A1R3XWL7" /protein_id="SIT99460.1" /translation="MRLIGRLRLLMVGLVVICGACACDRVSAGRWSESPSATSWPVRP VNTTTPSGPVPPVSEAARAAGLVDVRGVVPDAAIDLRYATANNFTGTQLYPPGARCLV HESMAEGLAAAAAVLRPHGQVLVFWDCYRPHDVQVRMFDVVPNPAWVARPGKYAHSHE AGRSVDVTFASAQRQCPSVRRSGELCLADMGTDFDDFSSRATAFATQGVSAEAQANRA HLRAAMQAGGLTVYSGEWWHFDGPGPGAGVDRPILEVPVD" CDS 936332..937144 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0862" /product="SAM-dependent methyltransferase" /note="Mb0862, -, len: 270 aa. Equivalent to Rv0839, len: 270 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 270 aa overlap). Conserved hypothetical protein, similar to various hypothetical proteins or methyltransferases from yeast and bacteria e.g. T34740|SC1E6.19c|AL033505|SC1E6_19 hypothetical protein from Streptomyces coelicolor (273 aa), FASTA scores: opt: 1102, E(): 0, (58.6% identity in 263 aa overlap); T38024|Z98598|SPAC1B3.06c hypothetical protein from Schizosaccharomyces pombe (278 aa), FASTA scores: opt: 562, E(): 1.9e-3, (36.4% identity in 269 aa overlap); JC6531 avermectin B 5-O-methyltransferase (EC 2.1.1.-) from Streptomyces avermitilis (283 aa); etc. Also similar to other Mycobacterium tuberculosis hypothetical proteins that may be methyltransferases e.g. Rv1523, Rv2952, Rv1405c, etc. Protein product from Mb0862 detected using SWATH mass spectrometry. Mb0862 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025714" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3XWK5" /protein_id="SIT99461.1" /translation="MNDKRRAIYTHGYHESVLRSHRRRTAENSAGYLLPYLVPGLSVL DVGCGPGTITVDLAARVVPGSVTGVEPTDDALSLARAEAQLHRLSNISFTTSDVHKLD FPDDAFDVVHAHQVLQHVADPVRALQEMRRVCTPGGIVAARDADYSGFIWFPKLPALD RWLDLYERAARANGGEPDAGRRLLSWARAAGFDDVTPTASVWCFATASAREWWGLVWA DRILQSDLAHQLVDSGLATAAQLEEISTAWREWAAAPDGWLAIPHGEILCRA" CDS complement(937212..938072) /codon_start=1 /transl_table=11 /gene="pip" /locus_tag="BQ2027_MB0863C" /product="PROBABLE PROLINE IMINOPEPTIDASE PIP (PROLYL AMINOPEPTIDASE) (PAP)" /note="Mb0863c, pip, len: 286 aa. Equivalent to Rv0840c, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Possible pip, proline iminopeptidase (EC 3.4.11.5), similar to many e.g. P46541|PIP_BACCO PROLINE IMINOPEPTIDASE from BACILLUS COAGULANS (288 aa), FASTA scores: opt: 657, E(): 0, (37.6% identity in 282 aa overlap); NP_386922.1|NC_003047 PUTATIVE PROLINE IMINOPEPTIDASE PROTEIN from Sinorhizobium meliloti (296 aa); etc. BELONGS TO PEPTIDASE FAMILY S33. Protein product from Mb0863c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XYP0" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR002410" /db_xref="InterPro:IPR005945" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XYP0" /protein_id="SIT99462.1" /translation="MEGTIAVPGGRVWFQRIGGGPGRPLLVVHGGPGLPHNYLAPLRR LSDEREVIFWDQLGCGNSACPSDVDLWTMNRSVAEMATVAEALALTRFHIFSHSWGGM LAQQYVLDKAPDAVSLTIANSTASIPEFSASLVSLKSCLDVATRSAIDRHEAAGTTHS AEYQAAIRTWNETYLCRTRPWPRELTEAFANMGTEIFETMFGPSDFRIVGNVRDWDVV DRLADIAVPTLLVVGRFDECSPEHMREMQGRIAGSRLEFFESSSHMPFIEEPARFDRV MREFLRLHDI" CDS 938348..938590 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0864" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0864, -, len: 80 aa. Equivalent to Rv0841, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (98.8% identity in 80 aa overlap). Conserved transmembrane protein, highly similar to C-terminus of next ORF Rv0842|O53854 PUTATIVE MEMBRANE PROTEIN from Mycobacterium tuberculosis (442 aa), FASTA scores: opt: 246, E(): 3.3e-10, (59.7% identity in 72 aa overlap). Replace previous Rv0841c." /db_xref="GOA:A0A1R3XXJ9" /db_xref="UniProtKB/TrEMBL:A0A1R3XXJ9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99463.1" /translation="MVAASIVHHSAAPANRGRYHGIWSMTPVFASVVVPIMASYGPIH GAHLLAAVVVGSAGAALCLPLARALRRPTPSAMTTD" CDS 938867..940159 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0865" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0865, -, len: 430 aa. Equivalent to Rv0842, len: 430 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 430 aa overlap). Probable conserved integral membrane protein, showing similarity with other integral membrane proteins e.g. P28246|BCR_ECOLI BICYCLOMYCIN RESISTANCE PROTEIN from EScherichia coli (396 aa), FASTA scores: opt: 216, E(): 5.4e-07, (23.7% identity in 376 aa overlap); etc." /db_xref="GOA:A0A1R3XWN0" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XWN0" /protein_id="SIT99464.1" /translation="MRYTGPERCSGDGQVRAAGDRYSTVIWLLGGNLLVRSAGFGYPF LAYHVAGRGHGAGAVGAVVAAYGLGWAVGQLLCGWLVDRVGARVTLVSTMLVAAAVLV LMAGLHTVPGLLVGAMIAGLVCDAPRPVLGAVIAELVADPQRRAQLDGWRYGWVLNIG AAITGGVGGVVAGWLDTPVLYWINGIGCAIFAGLAGRCIPADVCRRTESGLRACTAMS KVGYRQALSDKRLVLLAVSGLATLTTLMGFFAAVPMLMSASGLGVGAYGWVQLINALA VVAVTPLLTPWLSKQLALGPRPDILAGAGVWVTLCMAAAGLARTTVGFSVAAAACSPG EIAWFVVAAGIVHRIAPPAHGGRYHGIWSMAVAASSVAAPILAAFNLANGGRLVLAAT TVTVGFFGAALCLPLARVLAAASCGPLSSKEPSRDSYQ" CDS 940143..941147 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0866" /product="PROBABLE DEHYDROGENASE" /note="Mb0866, -, len: 334 aa. Equivalent to Rv0843, len: 334 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 334 aa overlap). Probable dehydrogenase (EC 1.-.-.-), similar to various dehydrogenases e.g. Q46142|Q46142 TPP-DEPENDENT ACETOIN DEHYDROGENASE (326 aa), FASTA scores: opt: 500, E(): 2.4e-26, (32.3% identity in 300 aa overlap); P51267|ODPA_PORPU PYRUVATE DEHYDROGENASE E1 COMPONENT from Porphyra purpurea (344 aa), FASTA scores: opt: 451, E(): 4.7e-23, (29.6% identity in 311 aa overlap); etc. Also similar to Rv2497c|pdhA pyruvate dehydrogenase E1 component from Mycobacterium tuberculosis (367 aa)." /db_xref="GOA:A0A1R3XWZ4" /db_xref="InterPro:IPR001017" /db_xref="InterPro:IPR017596" /db_xref="InterPro:IPR029061" /db_xref="UniProtKB/TrEMBL:A0A1R3XWZ4" /protein_id="SIT99465.1" /translation="MTRTSEGLAAFVVDQLEELYRRMWVLRLLDMALEQLRIEGLING PLQGGFGQEAVSVGAAAALGEGDVIITTHRPHAQHVGTDAPLGPVIADMLGATAGDLE GADEDAHIADPRAGLPAAIRVVKQSPLLAIGHAYALWLRDTGRVTLCVTQDCDVDADA FNEAADLAAVWQLPVVILVENIRGALSVHLGRYTHEPRVYRRAVAYGMPGVSVDGNDV EAVRDCVANAVVRARAGGGPTLVQAITYRTTDFSGSDRGGYRDLAGSEQFLDPLIFAR RRLIAAGTTRGRLDEQERAACQQVADAVAFAKDRARPNGGGPISRPTSGWHQQPKTRF " CDS complement(941211..941861) /codon_start=1 /transl_table=11 /gene="narL" /locus_tag="BQ2027_MB0867C" /product="POSSIBLE NITRATE/NITRITE RESPONSE TRANSCRIPTIONAL REGULATORY PROTEIN NARL" /note="Mb0867c, narL, len: 216 aa. Equivalent to Rv0844c, len: 216 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 216 aa overlap). Possible narL, nitrate/nitrite response regulator protein, similar to many e.g. CAB44989.1|AJ131854 NarL protein from Pseudomonas stutzeri (218 aa); CAA75536.1|Y15252 nitrate/nitrite regulatory protein from Pseudomonas aeruginosa (216 aa); PCC6803|D64005|SYCSLRG_24 NarL protein from Synechocystis sp. (209 aa), FASTA scores: opt: 438, E(): 1.5e-23, (34.6% identity in 208 aa overlap); etc. Also similar to unidentified regulator e.g. CAB76009.1|AL157916 putative two-component system response regulator from Streptomyces coelicolor (224 aa); etc. Contains probable helix-turn helix motif from aa 170-191 (Score 1124, +3.02 SD). Protein product from Mb0867c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0867c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWL1" /db_xref="InterPro:IPR000792" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR016032" /db_xref="UniProtKB/TrEMBL:A0A1R3XWL1" /protein_id="SIT99466.1" /translation="MSNPQPEKVRVVVGDDHPLFREGVVRALSLSGSVNVVGEADDGA AALELIKAHLPDVALLDYRMPGMDGAQVPAAVRSYELPTRVLLISAHDEPAIVYQALQ QGAAGFLLKDSTRTEIVKAVLDCAKGRDVVAPSLVGGLAGEIRQRAAPVAPVLSARER EVLNRIACGQSIPAIAAELYVAPSTVKTHVQRLYEKLGVSDRAAAVAEAMRQRLLD" CDS 941945..943222 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0868" /product="POSSIBLE TWO COMPONENT SENSOR KINASE" /note="Mb0868, -, len: 425 aa. Equivalent to Rv0845, len: 425 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 425 aa overlap). Possible two-component sensor kinase (EC 2.7.-.-), with its C-terminus similar to C-terminal part of others e.g. NP_294951.1|NC_001263 two-component sensor histidine kinase from Deinococcus radiodurans (469 aa); CAC32293.1|AL583943 putative two component system histidine kinase from Streptomyces coelicolor (404 aa); NP_464546.1|NC_003210 protein similar to two-component sensor histidine kinase from Listeria monocytogenes (352 aa); BSUB0017_193|Z9912 two-component sensor kinase from Bacillus subtilis (360 aa), FASTA scores: opt: 275, E(): 1.6e-11, (30.3% identity in 234 aa overlap); etc. Protein product from Mb0868 detected using SWATH mass spectrometry. Mb0868 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWL8" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3XWL8" /protein_id="SIT99467.1" /translation="MPSYGNLGRLGGRHEYGVLVAMTSSAELDRVRWAHQLRSYRIAS VLRIGVVGLMVAAMVVGTSRSEWPQQIVLIGVYAVAALWALLLAYSASRRFFALRRFR SMGRLEPFAFTAVDVLILTGFQLLSTDGIYPLLIMILLPVLVGLDVSTRRAAVVLACT LVGFAVAVLGDPVMLRAIGWPETIFRFALYAFLCATALMVVRIEERHTRSVAGLSALR EELLAQTMTASEVLQRRIAEAIHDGPLQDVLAARQELIELDAVTPGDERVGRALAGLQ SASERLRQATFELHPAVLEQVGLGPAVKQLAASTAQRSGIKISTDIDYPIRSGIDPIV FGVVRELLSNVVRHSGATTASVRLGITDEKCVLDVADDGVGVTGDTMARRLGEGHIGL ASHRARVDAAGGVLVFLATPRGTHVCVELPLKR" CDS complement(943435..944949) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0869C" /product="PROBABLE OXIDASE" /note="Mb0869c, -, len: 504 aa. Equivalent to Rv0846c, len: 504 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 504 aa overlap). Probable oxidase (EC 1.-.-.-), showing similarity with several oxidases, mainly L-ascorbate oxidases and copper resistance proteins A (precursors) e.g. P24792|ASO_CUCMA L-ASCORBATE OXIDASE PRECURSOR (ASCORBASE) (EC 1.10.3.3) from Cucurbita maxima (Pumpkin) (Winter squash) (579 aa), FASTA scores: opt: 423, E(): 5.8e-18, (28.4% identity in 493 aa overlap); AF010496|AF010496_32 potential multicopper oxidase from Rhodobacter capsulatus (491 aa), FASTA scores: opt: 490, E(): 2.7e-22, (28.8% identity in 510 aa overlap); 47452|PCOA_ECOLI COPPER RESISTANCE PROTEIN A PRECURSOR (BELONGS TO THE FAMILY OF MULTICOPPER OXIDASES) from Escherichia coli strain K12 (605 aa); etc. Contains PS00080 Multicopper oxidases signature 2 at C-terminus. SEEMS TO BELONG TO THE FAMILY OF MULTICOPPER OXIDASES. Protein product from Mb0869c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XWK6" /db_xref="InterPro:IPR001117" /db_xref="InterPro:IPR002355" /db_xref="InterPro:IPR006311" /db_xref="InterPro:IPR008972" /db_xref="InterPro:IPR011706" /db_xref="InterPro:IPR011707" /db_xref="InterPro:IPR033138" /db_xref="InterPro:IPR034279" /db_xref="UniProtKB/TrEMBL:A0A1R3XWK6" /protein_id="SIT99468.1" /translation="MPELATSGNAFDKRRFSRRGFLGAGIASGFALAACASKPTASGA AGMTAAIDAAEAARPHSGRTVTATLTPQPARIDLGGPIVSTLTYGNTIPGPLIRATVG DEIVVSVTNRLGDPTSVHWHGIALRNDMDGTEPATANIGPGGDFTYRFSVPDPGTYWA HPHVGLQGDHGLYLPVVVDDPTEPGHYDAEWIIILDDWTDGIGKSPQQLYGELTDPNK PTMQNTTGMPEGEGVDSNLLGGDGGDIAYPYYLINGRIPVAATSFKAKPGQRIRIRII NSAADTAFRIALAGHSMTVTHTDGYPVIPTEVDALLIGMAERYDVMVTAAGGVFPLVA LAEGKNALARALLSTGAGSPPDPQFRPDELNWRVGTVEMFTAATTANLGRPEPTHDLP VTLGGTMAKYDWTINGEPHSTTNPLHVRLGQRPTLMFDNTTMMYHPIHLHGHTFQMIK ADGSPGARKDTVIVLPKQKMRAVLVADNPGVWVMHCHNNYHQVAGMATRLDYIL" CDS 945098..945490 /codon_start=1 /transl_table=11 /gene="lpqS" /locus_tag="BQ2027_MB0870" /product="PROBABLE LIPOPROTEIN LPQS" /note="Mb0870, lpqS, len: 130 aa. Equivalent to Rv0847, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 130 aa overlap). Probable lpqS, lipoprotein. Contains possible signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Mb0870 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XWL6" /protein_id="SIT99469.1" /translation="MVWMRSAIVAVALGVTVAAVAAACWLPQLHRHVAHPNHPLTTSV GSEFVINTDHGHLVDNSMPPCPERLATAVLPRSATPVLLPDVVAAAPGMTAALTDPVA PAARGPPAAQGSVRTGQDLLTRFCLVRR" CDS 945693..946811 /codon_start=1 /transl_table=11 /gene="cysK2" /locus_tag="BQ2027_MB0871" /product="POSSIBLE CYSTEINE SYNTHASE A CYSK2 (O-ACETYLSERINE SULFHYDRYLASE) (O-ACETYLSERINE (THIOL)-LYASE) (CSASE)" /note="Mb0871, cysK2, len: 372 aa. Equivalent to Rv0848, len: 372 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 372 aa overlap). Possible cysK2, cysteine synthase A (EC 4.2.99.8), but could be also a cysteine synthase B (EC 4.2.99.8) cysM2-product, similar to many e.g. NP_109408.1|NC_002682 cysteine synthase from Mesorhizobium loti (357 aa); Q44004|CYSM_ALCEU CYSTEINE SYNTHASE from Alcaligenes eutrophus strain CH34 (Ralstonia eutropha) (339 aa), FASTA scores: opt: 511, E(): 1.7e-25, (35.0% identity in 314 aa overlap); etc. BELONGS TO THE CYSTEINE SYNTHASE/CYSTATHIONINE BETA-SYNTHASE FAMILY. COFACTOR: PYRIDOXAL PHOSPHATE. Note that previously known as cysM3. Protein product from Mb0871 detected using SWATH mass spectrometry. Mb0871 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWM7" /db_xref="InterPro:IPR001926" /db_xref="InterPro:IPR036052" /db_xref="UniProtKB/TrEMBL:A0A1R3XWM7" /protein_id="SIT99470.1" /translation="MRSRQTRDRYRLLPEGYQVTPGRNRHPGTMVGNTPVLWIPELSG TSDPDRGFWAKLEGFNPGGMKDRPALYMVECARARGDIAPGAAIVESTSGTLGLGLAL AGKVYRHPVTLVTDPGLEPIIARMLTAYGAGVDMGTQPHPVGGWQQARKDRVAQLMAE YPGAWNPNQYGNPDNVGAYRSLALELVAQLGRIDVLVCSVGTGGHSAGVARVLREFNP DMRLIGVDTIGSTIFGQPASNRLMRGLGSSIYPRNVDYRAFDEVHWVAPPEAVWACRS LAATHYASGGWSVGAVALVAGWAARNLPADTTIAAVFPDGPQRYFDTIYNDAYCNEHE LLGGQPPTEPDEIASPLDAVVTRWTRSTTVIDPTQVVS" CDS 946811..948070 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0872" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN" /note="Mb0872, -, len: 419 aa. Equivalent to Rv0849, len: 419 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 419 aa overlap). Probable conserved integral membrane transport protein, possibly member of major facilitator superfamily (MFS) involved in transport of drug, showing similarity with others e.g. T35055 probable transport system permease protein from Streptomyces coelicolor (436 aa); NP_295031.1|NC_001263 major facilitator family protein from Deinococcus radiodurans (458 aa); NP_455659.1|NC_003198 putative membrane transporter from Salmonella enterica subsp. enterica serovar Typhi (402 aa); etc. Protein product from Mb0872 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XWL5" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XWL5" /protein_id="SIT99471.1" /translation="MGARAIFRGFNRPSRVLMINQFGINIGFYMLMPYLADYLAGPLG LAAWAVGLVMGVRNFSQQGMFFVGGTLADRFGYKPLIIAGCLIRTGGFALLVVAQSLP SVLIAAAATGFAGALFNPAVRGYLAAEAGERKIEAFAMFNVFYQSGILLGPLVGLVLL ALDFRITVLAAAGVFGLLTVAQLVALPQHRADSEREKTSILQDWRVVVRNRPFLTLAA AMTGCYALSFQIYLALPMQASILMPRNQYLLIAAMFAVSGLVAVGGQLRITRWFAVRW GAERSLVVGATILAASFIPVAVIPNGQRFGVAVAVMALVLSASLLAVASAALFPFEMR AVVALSGDRLVATHYGFYSTIVGVGVLVGNLAIGSLMSAARRLNTDEIVWGGLILVGI VAVAGLRRLDTFTSGSQNMTGRWAAPR" mobile_element 948066..948396 /mobile_element_type="insertion sequence:IS1606" /locus_tag="BQ2027_IS1606'" /note="IS1606', len: 331 nt. Equivalent to IS1606', len: 331 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 331 nt overlap)." gene 948066..948396 /locus_tag="BQ2027_IS1606'" CDS 948067..948399 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0873" /product="PUTATIVE TRANSPOSASE (FRAGMENT)" /note="Mb0873, -, len: 110 aa. Equivalent to Rv0850, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap). Putative transposase (fragment), similar in part to others e.g. Q45144|Q4514 TRANSPOSABLE ELEMENT IS31831 (436 aa), FASTA scores: opt: 175, E(): 4.3e-05, (38.6% identity in 57 aa overlap); etc." /db_xref="UniProtKB/TrEMBL:A0A1R3XYQ0" /protein_id="SIT99472.1" /translation="MTRDPHSPDCGREGSYRDTITRPLTDLPVAGYPLVPRVASPRYR CTTPQCGRAVFNQDLANVDQYLVVNQLAHQLIDGSSLIPDADKRWDARRHADMTHHLT SSLKENQS" CDS complement(948396..949223) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0874C" /product="probable short-chain type dehydrogenase/reductase" /note="Mb0874c, -, len: 275 aa. Equivalent to Rv0851c, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 275 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to many e.g. Q01198|LIGD_PSEPA C ALPHA-DEHYDROGENASE (EC 1.1.1.-)(SDR FAMILY) from Pseudomonas paucimobilis (Sphingomonas paucimobilis) (305 aa); D11473|PSELIG_1 C alpha-dehydrogenase from P. paucimobilis (305 aa), FASTA scores: opt: 468, E(): 4.9e-23, (30.8% identity in 279 aa overlap); NP_421969.1|NC_002696 short chain dehydrogenase family protein from Caulobacter crescentus (278 aa); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Mb0874c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXK9" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XXK9" /protein_id="SIT99473.1" /translation="MDGFPGRGAVITGGASGIGLATGTEFARRGARVVLGDVDKPGLR QAVNHLRAEGFDVHGVMCDVRHREEVTHLADEAFRLLGHFDVVFSNAGIVVGGPIVEM THDDWRWVIDVDLWGSIHTVEAFLPRLLEQGTGGHVVFTASFAGLVPNAGLGAYGVAK YGVVGLAETLAREVTADGIGVSVLCPMVVETNLVANSERIRGAACAQSSTTGSPGPLP LQDDNLGVDDIAQLTADAILANRLYVLPHAASRASIRRRFERIDRTFDEQAAEGWRH" CDS 949314..950150 /codon_start=1 /transl_table=11 /gene="fadD16" /locus_tag="BQ2027_MB0875" /product="POSSIBLE FATTY-ACID-COA LIGASE FADD16 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb0875, fadD16, len: 278 aa. Equivalent to Rv0852, len: 278 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 278 aa overlap). Possible fadD16, fatty-acid-CoA synthetase (EC 6.2.1.-), similar in part to various CoA ligases e.g. P18163|LCFB_RAT LONG-CHAIN-FATTY-ACID--COA LIGASE from Rattus norvegicus (Rat) (699 aa); D49366|LEP4CCOALA_1 4-coumarate:CoA ligase from Lithospermum erythrorhizon (636 aa), FASTA scores: opt: 134, E(): 0.15, (26.8% identity in 213 aa overlap); orgp|L09229|HUMFACAL_1 long-chain acyl-coenzyme A from homo sapiens (human) (699 aa), FASTA score: (50.0% identity in 40 aa overlap); etc. Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2. Protein product from Mb0875 detected using SWATH mass spectrometry. Mb0875 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWN9" /db_xref="UniProtKB/TrEMBL:A0A1R3XWN9" /protein_id="SIT99474.1" /translation="MFTIGYSCASRGADSWLIRRCSVVQGCLDDPGATVEAIDDDGWP HTGDPCSPNSAASGKYGERPASVSTGDIHSLVIASDYRVPDPGRVWPLLQRNKSALAD IGAHHVLIYASTHDSGRVLVMIGVRSREPIVELLRSRVFFDWFDAMGVDDIPAVFAGE IVDRFVAAPTTTQSTPRVPGVVVAAFASVNNVSNLTAEVRSAIARFTAAGIRKTWVFQ AFDDAHEVLILQEFADEAGARQWIEHPDAAAEWMSGAGVGAYPPLFVGRFFDMMRIEA LQ" CDS complement(950191..951873) /codon_start=1 /transl_table=11 /gene="pdc" /locus_tag="BQ2027_MB0876C" /product="PROBABLE PYRUVATE OR INDOLE-3-PYRUVATE DECARBOXYLASE PDC" /note="Mb0876c, pdc, len: 560 aa. Equivalent to Rv0853c, len: 560 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 560 aa overlap). Probable pdc, pyruvate or indole-pyruvate decarboxylase (EC 4.1.1.-), equivalent to NP_302424.1|NC_002677 pyruvate (or indolepyruvate) decarboxylase from Mycobacterium leprae (569 aa). Also highly similar to others e.g. AAB06571.1|L80006 indolepyruvate decarboxylase from Pantoea agglomerans (550 aa); Q12629|DCPY_KLULA PYRUVATE DECARBOXYLASE (EC 4.1.1.1) from Kluyveromyces marxianus var. lactis (563 aa); P71323 INDOLEPYRUVATE DECARBOXYLASE (EC 4.1.1.74) from Enterobacter herbicola (550 aa), FASTA scores: opt: 1642, E(): 0, (48.1% identity in 547 aa overlap); P23234|DCIP_ENTCL INDOLE-3-PYRUVATE DECARBOXYLASE (INDOLEPYRUVATE DECARBOXYLASE) from Enterobacter cloacae (552 aa), FASTA scores: opt: 1596, E(): 0, (46.8% identity in 551 aa overlap); etc. Contains PS00187 Thiamine pyrophosphate enzymes signature and PS00017 ATP/GTP-binding site motif A (P-loop). COFACTOR: THIAMINE PYROPHOSPHATE. Protein product from Mb0876c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0876c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U140" /db_xref="InterPro:IPR000399" /db_xref="InterPro:IPR011766" /db_xref="InterPro:IPR012000" /db_xref="InterPro:IPR012001" /db_xref="InterPro:IPR012110" /db_xref="InterPro:IPR029035" /db_xref="InterPro:IPR029061" /db_xref="UniProtKB/Swiss-Prot:Q7U140" /protein_id="SIT99475.1" /translation="MTPQKSDACSDPVYTVGDYLLDRLAELGVSEIFGVPGDYNLQFL DHIVAHPTIRWVGSANELNAGYAADGYGRLRGMSAVVTTFGVGELSVTNAIAGSYAEH VPVVHIVGGPTKDAQGTRRALHHSLGDGDFEHFLRISREITCAQANLMPATAGREIDR VLSEVREQKRPGYILLSSDVARFPTEPPAAPLPRYPGGTSPRALSLFTKAAIELIADH QLTVLADLLVHRLQAVKELEALLAADVVPHATLMWGKSLLDESSPNFLGIYAGAASAE RVRAAIEGAPVLVTAGVVFTDMVSGFFSQRIDPARTIDIGQYQSSVADQVFAPLEMSA ALQALATILTGRGISSPPVVPPPAEPPPAMPARDEPLTQQMVWDRVCSALTPGNVVLA DQGTSFYGMADHRLPQGVTFIGQPLWGSIGYTLPAAVGAAVAHPDRRTVLLIGDGAAQ LTVQELGTFSREGLSPVIVVVNNDGYTVERAIHGETAPYNDIVSWNWTELPSALGVTN HLAFRAQTYGQLDDALTVAAARRDRMVLVEVVLPRLEIPRLLGQLVGSMAPQ" CDS 951938..952381 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0877" /product="Cyclase/Dehydrase" /note="Mb0877, -, len: 147 aa. Equivalent to Rv0854, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). Conserved hypothetical protein, similar to several hypothetical protein from Mycobacterium leprae e.g. NP_301674.1|NC_002677 (144 aa); NP_302683.1|NC_002677|Z95398|MLCL622.27c (156 aa), FASTA scores: opt: 193, E(): 1.6e-06, (24.6% identity in 134 aa overlap); NP_301218.1|NC_002677 (146 aa); MTCI28.04|Z97050 (184 aa), FASTA scores: opt: 171, E(): 5.8e-05, (21.5% identity in 135 aa overlap). Also similar to SC6G10.02c|T35511|AL049497|SC6G10_2 hypothetical protein from Streptomyces coelicolor (144 aa), FASTA scores: opt: 344, E(): 6.1e- 17, (37.6% identity in 141 aa overlap). And similar to many proteins from Mycobacterium tuberculosis e.g. downstreams ORFs Rv0856 and Rv0857, etc. Protein product from Mb0877 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0877 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3XWM3" /protein_id="SIT99476.1" /translation="MAIKESRDIVIEASPEEILDVIADFEAMTEWSPAHQSVEILETG DDGRPSKVKMKVKTAGITDEQVVAYSWTDRSVRWTLVSSTQQRSQDGKYELTPKGDNT LVQFEITVDPQVPLPGFVLKRAIKGTIDTATEALRSQVLKVKKGQ" CDS 952387..953466 /codon_start=1 /transl_table=11 /gene="far" /locus_tag="BQ2027_MB0878" /product="PROBABLE FATTY-ACID-COA RACEMASE FAR" /note="Mb0878, far, len: 359 aa. Equivalent to Rv0855, len: 359 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 359 aa overlap). Probable far, fatty acid-CoA racemase (EC 5.1.-.-), highly similar to CAB08122.1|Z94723 unknown protein from Mycobacterium leprae (253 aa) (C-terminus shorter). Also similar to many eukaryotic and bacteria racemases e.g. T35425 probable fatty acid CoA racemase from Streptomyces coelicolor (387 aa); P70473|AMAC_RAT ALPHA-METHYLACYL-COA RACEMASE (2-METHYLACYL-COA RACEMASE) (2-ARYLPROPIONYL-COA EPIMERASE) (EC 5.1.99.4) from Rattus norvegicus (Rat) (382 aa); NP_103687.1|NC_002678 probable fatty acid Co-A racemase from Mesorhizobium loti (389 aa); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. Rv1143|MTCI65.10|MCR from Mycobacterium tuberculosis (360 aa), FASTA scores: opt: 1373, E(): 0, (56.8% identity in 359 aa overlap), Rv1866|MTCY359.07 (C-terminal half) (778 aa), Rv3272 (360 aa). Protein product from Mb0878 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0878 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWM8" /db_xref="InterPro:IPR003673" /db_xref="InterPro:IPR023606" /db_xref="UniProtKB/TrEMBL:A0A1R3XWM8" /protein_id="SIT99477.1" /translation="MTTGGPLAGVKVIELGGIGPGPHAGMVLADLGADVVRVRRPGGL TMPSEDRDLLHRGKRIVDLDVKTQPQAMLELAAKADVLLDCFRPGTCERLGIGPDDCA SVNPRLIFARITGWGQDGPLASTAGHDINYLSQTGALAAFGYADRPPMPPLNLVADFG GGSMLVLLGIVVALYERERSGVGQVVDAAMVDGVSVLAQMMWTMKGIGSLRDQRESFL LDGGAPFYRCYETSDGKYMAVGAIEPQFFAALLSGLGLSAADVPTQLDVAGYPQMYDI FAERFASRTRDEWTRVFAGTDACVTPVLAWSEAANNDHLKARSTVITAHGVQQAAPAP RFSRTPAGPVRPPPAAATPIDEINW" CDS 953579..953983 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0879" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0879, -, len: 134 aa. Equivalent to Rv0856, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 134 aa overlap). Conserved hypothetical protein, showing weak similarity with NP_301674.1| (NC_002677) conserved hypothetical protein from Mycobacterium leprae (144 aa); and SC6G10.02c|T35511 hypothetical protein from Streptomyces coelicolor (144 aa). Also highly similar to other proteins from Mycobacterium tuberculosis e.g. neighbouring ORF downstream Rv0857 CONSERVED HYPOTHETICAL PROTEIN (126 aa), FASTA scores: E(): 7.4e-27, (62.0% identity in 100 aa overlap); neighbouring ORF Rv0854|MTV043_47 CONSERVED HYPOTHETICAL PROTEIN (147 aa), FASTA scores: E(): 1.6e-15, (36.6% identity in 123 aa overlap), MTCI28.04|Z97050|MTCI28_4 (184 aa), FASTA scores: opt: 127, E(): 0.036, (26.0% identity in 127 aa overlap); and MLCL622.27c|Z95398 (156 aa), FASTA scores: opt: 123, E(): 0.06, (26.4% identity in 125 aa overlap). Protein product from Mb0879 detected using SWATH mass spectrometry. Mb0879 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR005031" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3XWL4" /protein_id="SIT99478.1" /translation="MEALADVGVLASWSPLHKQVEVIDYYPDGRPHHVRATVKILGLV DKEVLEYHWGPDWVCWDADQTFQQHGQHIEYTVKPEGVDRARVRFDITVEPAGPIPGF IVKRASEHVLDAAAKGLQKLIAGAGDQGNAKS" CDS 954011..954484 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0880" /product="link to Polyketide_cyclase/dehydratase." /note="Mb0880, -, len: 157 aa. Equivalent to Rv0857, len: 157 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 157 aa overlap). Conserved hypothetical protein, showing weak similarity with SC6G10.02|T35511 hypothetical protein from Streptomyces coelicolor (144 aa). Also highly similar to other proteins from Mycobacterium tuberculosis e.g. upstream ORF Rv0856 (134 aa), FASTA scores: E(): 9.6e-28, (61.6% identity in 99 aa overlap); upstream ORF Rv0854 (147 aa), FASTA scores: E(): 2.8e-18, (42.7% identity in 117 aa overlap); MTCI28.04|Z97050 (184 aa), FASTA scores: opt: 122, E(): 0.031, (29.4% identity in 85 aa overlap); and MLCL622.27c|Z95398 (156 aa), FASTA scores: opt: 114, E(): 0.1, (30.9% identity in 55 aa overlap). Protein product from Mb0880 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0880 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3XWM9" /protein_id="SIT99479.1" /translation="MIANLVAVAIRASREVVIEAPPEVIVEALADMDAVPSWSSVHKR VEVVDTYSDGRPHHVKVTIKVAGIVDTELLEYHWGPDWVVWDAAKTAQQHGQHGEYNL RREDNDKTRVRFTLTVEPSAPLPAFWVNIARKKILHAATEGLRKQVVGRRRFTSG" CDS complement(954481..955674) /codon_start=1 /transl_table=11 /gene="dapc" /locus_tag="BQ2027_MB0881C" /product="probable n-succinyldiaminopimelate aminotransferase dapc (dap-at)" /note="Mb0881c, -, len: 397 aa. Equivalent to Rv0858c, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 397 aa overlap). Probable aminotransferase (EC 2.6.1.-), highly similar to others from Eukaryota and bacteria, especially aspartate aminotransferases (transaminases) (EC 2.6.1.1), e.g. NP_177890.1|NC_003070 putative aminotransferase from Arabidopsis thaliana (440 aa); NP_419555.1|NC_002696 aminotransferase class I from Caulobacter crescentus (385 aa); NP_415133.1|NC_000913|AE0001|ECAE000165_8 putative aminotransferase from Escherichia coli strain K12 (386 aa), FASTA scores: opt: 830, E(): 0, (38.0% identity in 389 aa overlap); X99521|TAX99521_1 aspartate aminotransferase from Thermus aquaticus (383 aa), FASTA scores: opt: 702, E(): 0, (34.9% identity in 393 aa overlap); etc. Also similar to other putative aminotransferases from Mycobacterium tuberculosis e.g. Rv2294, Rv3565, etc. Protein product from Mb0881c detected using SWATH mass spectrometry. Mb0881c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWP0" /db_xref="InterPro:IPR004839" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/TrEMBL:A0A1R3XWP0" /protein_id="SIT99480.1" /translation="MTVSRLRPYATTVFAEMSALATRIGAVNLGQGFPDEDGPPKMLQ AAQDAIAGGVNQYPPGPGSAPLRRAIAAQRRRHFGVDYDPETEVLVTVGATEAIAAAV LGLVEPGSEVLLIEPFYDSYSPVVAMAGAHRVTVPLVPDGRGFALDADALRRAVTPRT RALIINSPHNPTGAVLSATELAAIAEIAVAANLVVITDEVYEHLVFDHARHLPLAGFD GMAERTITISSAAKMFNCTGWKIGWACGPAELIAGVRAAKQYLSYVGGAPFQPAVALA LDTEDAWVAALRNSLRARRDRLAAGLTEIGFAVHDSYGTYFLCADPRPLGYDDSTEFC AALPEKVGVAAIPMSAFCDPAAGQASQQADVWNHLVRFTFCKRDDTLDEAIRRLSVLA ERPAT" CDS 955831..957042 /codon_start=1 /transl_table=11 /gene="fadA" /locus_tag="BQ2027_MB0882" /product="POSSIBLE ACYL-COA THIOLASE FADA" /note="Mb0882, fadA, len: 403 aa. Equivalent to Rv0859, len: 403 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 403 aa overlap). Possible fadA, acyl-CoA thiolase (EC 2.3.1.-), equivalent to NP_302423.1|NC_002677 putative beta-ketoadipyl CoA thiolase from Mycobacterium leprae (403 aa). Also highly similar to acyl/acetyl-CoA thiolases and beta-ketoadipyl CoA thiolases, e.g. T35428 probable acetyl CoA acetyltransferase (thiolase) from Streptomyces coelicolor (404 aa); NP_250427.1|NC_002516 probable acyl-CoA thiolase from Pseudomonas aeruginosa (401 aa); NP_106253.1|NC_002678 probable acyl-CoA thiolase from Mesorhizobium loti (402 aa); NP_248919.1|NC_002516|PcaF beta-ketoadipyl CoA thiolase PcaF from Pseudomonas aeruginosa (401 aa); etc. Contains PS00098 Thiolases acyl-enzyme intermediate signature, PS00737 Thiolases signature 2 and PS00099 Thiolases active site. Protein product from Mb0882 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0882 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWM6" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020610" /db_xref="InterPro:IPR020613" /db_xref="InterPro:IPR020615" /db_xref="InterPro:IPR020616" /db_xref="InterPro:IPR020617" /db_xref="UniProtKB/TrEMBL:A0A1R3XWM6" /protein_id="SIT99481.1" /translation="MSEEAFIYEAIRTPRGKQKNGSLHEVKPLSLVVGLIDELRKRHP DLDENLISDVILGCVSPVGDQGGDIARAAVLASGMPVTSGGVQLNRFCASGLEAVNTA AQKVRSGWDDLVLAGGVESMSRVPMGSDGGAMGLDPATNYDVMFVPQGIGADLIATIE GFSREDVDAYALRSQQKAAEAWSGGYFAKSVVPVRDQNGLLILDHDEHMRPDTTKEGL AKLKPAFEGLAALGGFDDVALQKYHWVEKINHVHTGGNSSGIVDGAALVMIGSAAAGK LQGLTPRARIVATATSGADPVIMLTGPTPATRKVLDRAGLTVDDIDLFELNEAFASVV LKFQKDLNIPDEKLNVNGGAIAMGHPLGATGAMILGTMVDELERRNARRALITLCIGG GMGVATIIERV" CDS 957047..959209 /codon_start=1 /transl_table=11 /gene="fadB" /locus_tag="BQ2027_MB0883" /product="PROBABLE FATTY OXIDATION PROTEIN FADB" /note="Mb0883, fadB, len: 720 aa. Equivalent to Rv0860, len: 720 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 720 aa overlap). Probable fadB, fatty oxidation protein, equivalent to NP_302422.1|NC_002677 putative fatty oxidation complex alpha subunit from Mycobacterium leprae (714 aa). Also highly similar to others and various proteins involved in fatty acid metabolism, e.g. T35429 probable fatty oxidation protein from Streptomyces coelicolor (733 aa); NP_250428.1|NC_002516 probable 3-hydroxyacyl-CoA dehydrogenase from Pseudomonas aeruginosa (714 aa); NP_418895.1|NC_002696 fatty oxidation complex alpha subunit from Caulobacter crescentus (709 aa); P40939|ECHA_HUMAN TRIFUNCTIONAL ENZYME ALPHA SUBUNIT [INCLUDES: LONG-CHAIN ENOYL-COA HYDRATASE (EC 4.2.1.17); LONG CHAIN 3-HYDROXYACYL-COA DEHYDROGENASE (EC 1.1.1.35)] from Homo sapiens (763 aa), FASTA scores: opt: 1176, E(): 0, (32.4% identity in 722 aa overlap); P21177|FADB_ECOLI FATTY OXIDATION COMPLEX ALPHA SUBUNIT [INCLUDES: ENOYL-COA HYDRATASE (EC 4.2.1.17); DELTA(3)-CIS-DELTA(2)-TRANS-ENOYL-COA ISOMERASE (EC 5.3.3.8); 3-HYDROXYACYL-COA DEHYDROGENASE (EC 1.1.1.35); 3-HYDROXYBUTYRYL-COA EPIMERASE (EC 5.1.2.3)] from Escherichia coli strain K12 (729 aa), FASTA scores: opt: 873, E(): 0, (33.6% identity in 693 aa overlap); etc. Protein product from Mb0883 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0883 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYQ9" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR006108" /db_xref="InterPro:IPR006176" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XYQ9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99482.1" /translation="MPDNTIQWDKDADGIVTLTMDDPSGSTNVMNEAYIESMGKAVDR LVAEKDSITGVVVASAKKTFFAGGDVKTMIQARPEDAGDVFNTVETIKRQLRTLETLG KPVVAAINGAALGGGLEIALACHHRIAADVKGSQLGLPEVTLGLLPGGGGVTRTVRMF GIQNAFVSVLAQGTRFKPAKAKEIGLVDELVATVEELVPAAKAWIKEELKANPDGAGV QPWDKKGYKMPGGTPSSPGLAAILPSFPSNLRKQLKGAPMPAPRAILAAAVEGAQVDF DTASRIESRYFASLVTGQVAKNMMQAFFFDLQAINAGGSRPEGIGKTPIKRIGVLGAG MMGAGIAYVSAKAGYEVVLKDVSLEAAAKGKGYSEKLEAKALERGRTTQERSDALLAR ITPTADAADFKGVDFVIEAVFENQELKHKVFGEIEDIVEPNAILGSNTSTLPITGLAT GVKRQEDFIGIHFFSPVDKMPLVEIIKGEKTSDEALARVFDYTLAIGKTPIVVNDSRG FFTSRVIGTFVNEALAMLGEGVEPASIEQAGSQAGYPAPPLQLSDELNLELMHKIAVA TRKGVEDAGGTYQPHPAEAVVEKMIELGRSGRLKGAGFYEYADGKRSGLWPGLRETFK SGSSQPPLQDMIDRMLFAEALETQKCLDEGVLTSTADANIGSIMGIGFPPWTGGSAQF IVGYSGPAGTGKAAFVARTRELAAAYGDRFLPPESLLS" CDS complement(959277..960905) /codon_start=1 /transl_table=11 /gene="ercc3" /locus_tag="BQ2027_MB0884C" /product="dna helicase ercc3" /note="Mb0884c, -, len: 542 aa. Equivalent to Rv0861c, len: 542 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 542 aa overlap). Probable DNA helicase (EC 3.6.1.-), equivalent to NP_302420.1|NC_002677 probable DNA helicase from Mycobacterium leprae (549 aa). Also highly similar to others (shorter than several eukaryotic enzymes) e.g. NP_218820.1|NC_000919|AE001217|AE0 01217_6 putative DNA repair helicase from Treponema pallidum (606 aa), FASTA scores: opt: 1275, E(): 0, (47.5% identity in 592 aa overlap); Q00578|RA25_YEAST DNA REPAIR HELICASE from Saccharomyces cerevisiae (843 aa), FASTA scores: opt: 777, E(): 0, (30.4% identity in 605 aa overlap); P49135|XPB_MOUSE DNA-REPAIR PROTEIN COMPLEMENTING XP-B CELLS from Mus musculus (Mouse) (783 aa), FASTA scores: opt: 761, E(): 0, (36.3% identity in 375 aa overlap); etc. SEEMS TO BELONG TO THE HELICASE FAMILY. Protein product from Mb0884c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0884c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXL8" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR006935" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR032438" /db_xref="InterPro:IPR032830" /db_xref="UniProtKB/TrEMBL:A0A1R3XXL8" /protein_id="SIT99483.1" /translation="MQSDKTVLLEVDHELAGAARAAIAPFAELERAPEHVHTYRITPL ALWNARAAGHDAEQVVDALVSYSRYAVPQPLLVDIVDTMARYGRLQLVKNPAHGLTLV SLDRAVLEEVLRNKKIAPMLGARIDDDTVVVHPSERGRVKQLLLKIGWPAEDLAGYVD GEAHPISLHQEGWQLRDYQRLAADSFWAGGSGVVVLPCGAGKTLVGAAAMAKAGATTL ILVTNIVAARQWKRELVARTSLTENEIGEFSGERKEIRPATISTYQMITRRTKGEYRH LELFDSRDWGLIIYDEVHLLPAPVFRMTADLQSKRRLGLTATLIREDGREGDVFSLIG PKRYDAPWKDIEAQGWIAPAECVEVRVTMTDSERMMYATAEPEERYRICSTVHTKIAV VKSILAKHPDEQTLVIGAYLDQLDELGAELGAPVIQGSTRTSEREALFDAFRRGEVAT LVVSKVANFSIDLPEAAVAVQVSGTFGSRQEEAQRLGRILRPKADGGGAIFYSVVARD SLDAEYAAHRQRFLAEQGYGYIIRDADDLLGPAI" CDS complement(961043..963313) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0885C" /product="Putative DNA-binding protein" /note="Mb0885c, -, len: 756 aa. Equivalent to Rv0862c, len: 756 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 756 aa overlap). Conserved hypothetical protein, equivalent to NP_302419.1|NC_002677 possible DNA-binding protein from Mycobacterium leprae (753 aa); and highly similar (except in C-terminus) to MLCB57.01|Z99494|T45333 hypothetical protein from Mycobacterium leprae (>577 aa, truncated), FASTA scores: opt: 3047, E(): 0, (78.9% identity in 578 aa overlap). Also similar in part to SCD12A.03c|AB93395.1|AL357524 hypothetical protein from Streptomyces coelicolor (867 aa). Protein product from Mb0885c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0885c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR032830" /db_xref="UniProtKB/TrEMBL:A0A1R3XWQ0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99484.1" /translation="MTEHTPDIPLGSWLAALPDERLTQLLELRPDLAQPPPGSIAALA ARAQARQSVKAATDELDFLRLAVFDALLVLQADTAPVPIVRLLAVIGDRAAQADVLGA LADLKQRALAWGETAVRVATDAGTALPWHPGQVTLEGSSRSGDQLADLIAGLDPAQRD VLDKLLQGSPVGRTRDAAPGAPSDRPVPRLLAMGLLRRIDAETVILPRHVGQVLRGEQ PGPMELTAPDPVVSTTTPDDADAAAAGAVIDLLREVDVLLENLGATPVAELRSGGLGV REFKRLAKATGIDEPRLGLILEIAAAAGLIASGMPDPEPPHSDGPFWAPTVAADRFAT MSPAERWHLLASAWLDLPGRPALIGTRGPDAKPYGALSDSLFSTAAPLDRRLLLGMLA ELPAGAGVDASRASATLIWRRPRWARRLQPAPIADLLTEGHALGLVGRGAISTPARAL LDEALEPATAPAAAVGVMARALPKPIDHFLVQADLTVVVPGPLQRELADDLTTVATVE SAGTAMVYRVSEQSIRHALDVGKSRDWLQEFFANRSKTPVPQGLTYLIDDVARRHGQL RIGMAASFVRCEDPTLLAQVVAAPEADGLALRALAPTVAVSPAPISEVLVTLRGAGFA PAAEDSTGAVVDVRTRGARVPTPQRRRPYRPPPRPNSEALKAVVAVLREVTAAPFANV RVDPAVTMSLLQRAAKDQATLVISYLDAAGVATQRVVAPITLRGGQLVAFDSSSGRLR DFAIHRITSVVSAHDR" CDS 963300..963581 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0886" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0886, -, len: 93 aa. Equivalent to Rv0863, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Conserved hypothetical protein, highly similar to NP_302418.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (74 aa). Also weakly similar in part to U82598|ECU82598_135 HYPOTHETICAL PROTEIN from Escherichia coli, FASTA scores: (32.4% identity in 71 aa overlap); and M74011|YEPYSCOP_8 HYPOTHETICAL PROTEIN from Yersinia enterocolitica (165 aa), FASTA scores: (38.6 identity in 57 aa overlap). Protein product from Mb0886 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0886 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XX11" /protein_id="SIT99485.1" /translation="MCSVIADQRRPDQPCGVGGCKTCQNGFVADIAEGKARKTRYVDH GWPTTDPDDHAVSELVTDRTGALSPFGELTFPVPSDDLPYIHPVTVINR" CDS 963665..964093 /codon_start=1 /transl_table=11 /gene="moaC2" /locus_tag="BQ2027_MB0888" /product="probable molybdenum cofactor biosynthesis protein c 2 moac2" /note="Mb0888, moaC2, len: 142 aa. Similar to Rv0864, len: 167 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Probable moaC2, molybdopterin cofactor biosynthesis protein, highly similar to others e.g. CAB59676.1|AL132674 molybdenum cofactor biosynthesis protein from Streptomyces coelicolor (170 aa); NP_418834.1|NC_002696 molybdenum cofactor biosynthesis protein C from Caulobacter crescentus (186 aa); Y10817|ANY10817_3|T44852 molybdopterin co-factor synthesis protein moaC from Arthrobacter nicotinovorans plasmid pAO1 (169 aa), FASTA scores: opt: 491, E(): 2.4e-29, (51.0% identity in 151 aa overlap); etc. Also highly similar to O05788|MOAC1|Rv3111|MTCY164.21 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN C from Mycobacterium tuberculosis (170 aa), FASTA scores: opt: 491, E(): 2.4e-29, (54.9% identity in 153 aa overlap); and O53376|Rv3324c|MOAC3|MTV016.24c PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN C3 (177 aa). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base deletion (g-*) leads to a different NH2 part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb0888 detected using SWATH mass spectrometry. Mb0888 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5K7" /db_xref="InterPro:IPR002820" /db_xref="InterPro:IPR023045" /db_xref="InterPro:IPR036522" /db_xref="UniProtKB/Swiss-Prot:P0A5K7" /protein_id="SIT99486.1" /translation="MVDITEKATTKRTAVAAGILRTSAQVVALISTGGLPKGDALATA RVAGIMAAKRTSDLIPLCHQLALTGVDVDFTVGQLDIEITATVRSTDRTGVEMEALTA VSVAALTLYDMIKAVDPGALIDDIRVLHKEGGRRGTWTRR" CDS 964090..964572 /codon_start=1 /transl_table=11 /gene="mog" /locus_tag="BQ2027_MB0889" /product="probable molybdopterin biosynthesis mog protein" /note="Mb0889, mog, len: 160 aa. Equivalent to Rv0865, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 160 aa overlap). Probable mog, molybdopterin biosynthesis MOG protein, highly similar or similar to other molybdenum cofactor biosynthesis proteins e.g. CAB59675.1|AL132674 molybdenum cofactor biosynthesis protein from Streptomyces coelicolor (179 aa); NP_301253.1|NC_002677 putative molybdenum cofactor biosynthesis protein from Mycobacterium leprae (181 aa); CAC39235.1|AJ312124 Mog protein from Eubacterium acidaminophilum (162 aa); P44645|MOG_HAEIN|MOGA|HI0336 MOLYBDOPTERIN BIOSYNTHESIS MOG PROTEIN from Haemophilus influenzae (197 aa), FASTA scores: opt: 306, E(): 9e-13, (39.6% identity in 139 aa overlap); P28694|MOG_ECOLI MOLYBDOPTERIN BIOSYNTHESIS MOG PROTEIN from Escherichia coli (195 aa), FASTA scores: opt: 265, E(): 3.6e-10, (34.2 identity in 146 aa overlap); etc. Also highly similar to Rv0984|MTV044.12|MOAB2 POSSIBLE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE from Mycobacterium tuberculosis (181 aa). Protein product from Mb0889 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0889 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001453" /db_xref="InterPro:IPR036425" /db_xref="UniProtKB/TrEMBL:A0A1R3XWN7" /protein_id="SIT99487.1" /translation="MSTRSARIVVVSSRAAAGVYTDDCGPIIAGWLEQHGFSSVQPQV VADGNPVGEALHDAVNAGVDVIITSGGTGISPTDTTPEHTVAVLDYVIPGLADAIRRS GLPKVPTSVLSRGVCGVAGRTLIINLPGSPGGVRDGLGVLADVLDHALEQIAGGDHPR " CDS 964569..964994 /codon_start=1 /transl_table=11 /gene="moaE2" /locus_tag="BQ2027_MB0890" /product="probable molybdenum cofactor biosynthesis protein e2 moae2 (molybdopterin converting factor large subunit) (molybdopterin [mpt] converting factor, subunit 2)" /note="Mb0890, moaE2, len: 141 aa. Equivalent to Rv0866, len: 141 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 141 aa overlap). Probable moaE2, molybdopterin converting factor E (molybdopterin converting factor (subunit 2)), similar to others e.g. Y10817|ANY10817_4|T44853 molybdopterin biosynthesis protein E chain from Arthrobacter nicotinovorans plasmid pAO1 (155 aa), FASTA scores: opt: 460, E(): 3.5e-27, (49.3 identity in 146 aa overlap); CAC01331.1|AL390968 moaE-like protein from Streptomyces coelicolor (152 aa); NP_389313.1|NC_000964 molybdopterin converting factor (subunit 2) from Bacillus subtilis (157 aa); etc. Also highly similar to Rv3119|MOAE1|Z95150|MTCY164_30 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN E from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 321, E(): 5.9e-17, (40.9% identity in 132 aa overlap); and O53375|GPHA|Rv3323c|MTV016.23c MOAD-MOAE FUSION PROTEIN from Mycobacterium tuberculosis (221 aa). Protein product from Mb0890 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0890 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWM2" /db_xref="InterPro:IPR003448" /db_xref="InterPro:IPR036563" /db_xref="UniProtKB/TrEMBL:A0A1R3XWM2" /protein_id="SIT99488.1" /translation="MTQVLRAALTDQPIFLAEHEELVSHRSAGAIVGFVGMIRDRDGG RGVLRLEYSAHPSAAQVLADLVAEVAEESSGVRAVAASHRIGVLQVGEAALVAAVAAD HRRAAFGTCAHLVETIKARLPVWKHQFFEDGTDEWVGSV" CDS complement(965012..965995) /codon_start=1 /transl_table=11 /gene="rpfA" /locus_tag="BQ2027_MB0891C" /product="POSSIBLE RESUSCITATION-PROMOTING FACTOR RPFA" /note="Mb0891c, rpfA, len: 327 aa. Equivalent to Rv0867c, len: 407 aa, from Mycobacterium tuberculosis strain H37Rv, (). Possible rpfA, resuscitation-promoting factor (see citation below). N-terminus highly similar to N-terminal part (1-125 aa) of Z99494|MLCB57_3|NP_302417.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (174 aa), FASTA scores: opt: 785, E(): 1.8e-18, (63.0% identity in 200 aa overlap); and highly similar to C-terminus of NP_301299.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (375 aa); and middle part of NP_302360.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (157 aa). N-terminus also highly similar in part of three secreted proteins from Streptomyces coelicolor e.g. CAC09538.1|AL442120 putative secreted protein (244 aa). Regions highly similar to CAB76321.1|AL158060 putative membrane protein from Streptomyces coelicolor (121 aa); and middle part of CAB09664.1|Z96935 rpf from Micrococcus luteus (220 aa). Also highly similar in part to four resuscitation-promoting factors from Mycobacterium tuberculosis: Rv2450 (172 aa), Rv1009 (362 aa), Rv1884c (176 aa), and Rv2389c (154 aa). Contains a probable secretory signal sequence in N-terminus. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 240 bp deletion leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (327 aa versus 407 aa). Mb0891c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010618" /db_xref="InterPro:IPR023346" /db_xref="UniProtKB/TrEMBL:A0A1R3XWP1" /protein_id="SIT99489.1" /translation="MSGRHRKPTTSNVSVAKIAFTGAVLGGGGIAMAAQATAATDGEW DQVARCESGGNWSINTGNGYLGGLQFTQSTWAAHGGGEFAPSAQLASREQQIAVGERV LATQGRGAWPVCGRGLSNATPREVLPASAAMDAPLDAAAVNGEPAPLAPPPADLAPPA PADLAPPAPADLAPPVELAVNDLPAPLGEPLPAAPAELAPPADLAPASADLAPPAPAD LAPPAPAELAPPAPADLAPPAAVNEQTAPGDQPATAPGGPVGLATDLELPEPDPQPAD APPPGDVTEAPAETPQVSNIAYTKKLWQAIRAQDVCGNDALDSLAQPYVIG" CDS complement(966443..966721) /codon_start=1 /transl_table=11 /gene="moaD2" /locus_tag="BQ2027_MB0892C" /product="probable molybdenum cofactor biosynthesis protein d 2 moad2 (molybdopterin converting factor small subunit) (molybdopterin [mpt] converting factor, subunit 1)" /note="Mb0892c, moaD2, len: 92 aa. Equivalent to Rv0868c, len: 92 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 92 aa overlap). Probable moaD2, molybdenum cofactor biosynthesis protein (molybdopterin converting factor (subunit 1)), similar to CAB88494.1|AL353816 putative molybdopterin converting factor from Streptomyces coelicolor (84 aa); and weakly similar to others MoaD proteins e.g. Z99111|BSUB0008_103 from Bacillus subtilis (77 aa), FASTA scores: opt: 86, E(): 2.8, (22.9% identity in 83 aa overlap); etc. Also some similarity with Rv3112|MOAD1|MTCY164.22 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN D from Mycobacterium tuberculosis (83 aa), FASTA scores: opt: 113, E(): 0.024, (31.3% identity in 83 aa overlap). Protein product from Mb0892c detected using SWATH mass spectrometry. Mb0892c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003749" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR016155" /db_xref="UniProtKB/TrEMBL:A0A1R3XWP9" /protein_id="SIT99490.1" /translation="MTQVSDESAGIQVTVRYFAAARAAAGAGSEKVTLRSGATVAELI DGLSVRDVRLATVLSRCSYLRDGIVVRDDAVALSAGDTIDVLPPFAGG" CDS complement(966725..967807) /codon_start=1 /transl_table=11 /gene="moaA2" /locus_tag="BQ2027_MB0893C" /product="probable molybdenum cofactor biosynthesis protein a2 moaa2" /note="Mb0893c, moaA2, len: 360 aa. Equivalent to Rv0869c, len: 360 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 360 aa overlap). Probable moaA2, molybdenum cofactor biosynthesis protein, highly similar to others e.g. CAB59437.1|AL132644|SCI8_6 molybdenum cofactor biosynthesis protein A from Streptomyces coelicolor (341 aa), FASTA scores: opt: 1336, E(): 0, (61.7% identity in 332 aa overlap); S57490|X78980|ANMOAA_1 molybdopterin cofactor synthesis protein from Arthrobacter nicotinovorans (fragment) (374 aa), FASTA scores: opt: 1059, E(): 0, (49.9% identity in 369 aa overlap); Q44118|MOAA_ARTNI PROBABLE MOLYBDOPTERIN COFACTOR SYNTHESIS PROTEIN A from Arthrobacter nicotinovorans plasmid pAO1 (355 aa); etc. Also similar to Rv3109|MTCY164.19|Z95150|MOAA1 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A from Mycobacterium tuberculosis (359 aa), FASTA scores: opt: 657, E(): 0, (36.6% identity in 309 aa overlap). BELONGS TO THE MOAA / NIFB / PQQE FAMILY. Mb0893c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65385" /db_xref="InterPro:IPR000385" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR010505" /db_xref="InterPro:IPR013483" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/Swiss-Prot:P65385" /protein_id="SIT99491.1" /translation="MTLTALGMPALRSRTNGIADPRVVPTTGPLVDTFGRVANDLRVS LTDRCNLRCSYCMPERGLRWLPGEQLLRPDELARLIHIAVTRLGVTSVRFTGGEPLLA HHLDEVVAATARLRPRPEISLTTNGVGLARRAGALAEAGLDRVNVSLDSIDRAHFAAI TRRDRLAHVLAGLAAAKAAGLTPVKVNAVLDPTTGREDVVDLLRFCLERGYQLRVIEQ MPLDAGHSWRRNIALSADDVLAALRPHFRLRPDPAPRGSAPAELWLVDAGPNTPRGRF GVIASVSHAFCSTCDRTRLTADGQIRSCLFSTEETDLRRLLRGGADDDAIEAAWRAAM WSKPAGHGINAPDFIQPDRPMSAIGG" CDS complement(967804..968193) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0894C" /product="POSSIBLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0894c, -, len: 129 aa. Equivalent to Rv0870c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 129 aa overlap). Possible conserved integral membrane protein, highly similar to other membrane proteins: putative secreted proteins or hypothetical proteins e.g. CAC08263.1| AL392146 putative integral membrane protein from Streptomyces coelicolor (138 aa); NP_233433.1|NC_002506 conserved hypothetical protein from Vibrio cholerae (143 aa); NP_455572.1|NC_003198 putative membrane protein from Salmonella enterica subsp. enterica serovar Typhi (148 aa); P37065|YCCF_ECOLI HYPOTHETICAL 16.3 KD PROTEIN from Escherichia coli (148 aa), FASTA scores: opt: 183, E(): 1.9e-06, (36.6% identity in 134 aa overlap); etc. Protein product from Mb0894c detected using SWATH mass spectrometry. Mb0894c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYS0" /db_xref="InterPro:IPR005185" /db_xref="InterPro:IPR031308" /db_xref="UniProtKB/TrEMBL:A0A1R3XYS0" /protein_id="SIT99492.1" /translation="MRLILNVIWLVFGGLWLALGYLLASLVCFLLIITIPFGFAALRI ASYALWPFGRTIVEKPTAGTGALISNVIWVLLFGIWLALGHLVSAAAMAVTIIGIPLA LANLKLIPVSLVPLGKDIVGVNSQVPT" CDS 968358..968765 /codon_start=1 /transl_table=11 /gene="cspB" /locus_tag="BQ2027_MB0895" /product="PROBABLE COLD SHOCK-LIKE PROTEIN B CSPB" /note="Mb0895, cspB, len: 135 aa. Equivalent to Rv0871, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 135 aa overlap). Probable cspB, cold shock-like protein B, equivalent to Z99494|MLCB57_7|MLCB57.11 probable cold shock protein from Mycobacterium leprae (136 aa), FASTA scores: opt: 787, E(): 0, (86.0% identity in 136 aa overlap). Also highly similar (but often longer than) to others e.g. CAB93399.1|AL357524 cold shock protein B from Streptomyces coelicolor (127 aa); Q45099|CSPD_BACCE COLD SHOCK-LIKE PROTEIN CSPD from Bacillus cereus (66 aa); Y101 81|LLCSPB_1 cold shock protein from Lactococcus lactis (66 aa), FASTA scores: opt: 220, E(): 2.5e-07, (48.3% identity in 60 aa overlap); etc. SEEMS TO BELONG TO THE COLD-SHOCK DOMAIN (CSD) FAMILY. Protein product from Mb0895 detected using shotgun mass spectrometry. Mb0895 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXM9" /db_xref="InterPro:IPR002059" /db_xref="InterPro:IPR011129" /db_xref="InterPro:IPR012340" /db_xref="UniProtKB/TrEMBL:A0A1R3XXM9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99493.1" /translation="MPTGKVKWYDPDKGFGFLSQEGGEDVYVRSSALPTGVEALKAGQ RVEFGIASGRRGPQALSLRLIEPPPSLSRPRREPAAEHKHSPDELHGMVEDMITLLES TVQPELRKGRYPDRKTARRVAEVVRAVAREFES" CDS complement(968884..970710) /codon_start=1 /transl_table=11 /gene="PE_PGRS15" /locus_tag="BQ2027_MB0896C" /product="pe-pgrs family protein pe_pgrs15" /note="Mb0896c, PE_PGRS15, len: 608 aa. Equivalent to Rv0872c, len: 606 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 606 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see first citation below), similar to many e.g. MTCY24A1.04c|Z95207 (615 aa), FASTA scores: opt: 2636, E(): 0, (64.6% identity in 619 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 3 bp in-frame deletion and a 9 bp in-frame insertion leads to a lightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (608 aa versus 606 aa). Mb0896c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XWQ7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99494.1" /translation="MSYVLATPEMVAAAANNLAQIGSTLSAANAAALAPTTGVLAAGA DEVSAAVASLFSGHAQAYQTLGTQAAAFHERFIQALSTAAGAYGSAEAANASPLQQAL NVINAPTQTLLGRPLIGNGTNGAPGTGQAGGPGGLLYGNGGNGGSGGVGQAGGAGGSA GLIGIGGTGGAGGAGAVGGVGGNGGWLYGNGGAGGLGGTGVAGVNGGMGAAGGAGGNA YLFGSGGAGGQGGMGAAGADGVNPTPTGTADAGSTGTDQTLGGNAIGGNGGPGDAGDA MTSGGAGGSGGNAVSTVNGDAVGGEGGKGGEGAYGGAGGAGGSAASIGNAAIGGNGGA GGNAQAPGGVGGAGGEGGDAQVGTNSPSNAEAGNGGSGGNGFDSFASGGTGGAGGTGG AGGRGGLLIGDGGAGGAGGVGGTGGSGAPGGGGAGGDGGAANTDSAGSSRKAFGGDGG VGGDGASALGTGGEGGIGGQGGNGGAGGLLIGNGGAGGVGGTAGAGGTGGSGGAGGAG GAGGGGTNSGPGAAFGGNGNTGGNGGNGGAPGALGGKGGSGGLIGRAGSDGGVGAGGA GGAGGAGGTGGEGGTGGDGKTTDGNPGMGGSPGSAGQPGQPG" CDS 970971..972923 /codon_start=1 /transl_table=11 /gene="fadE10" /locus_tag="BQ2027_MB0897" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE10" /note="Mb0897, fadE10, len: 650 aa. Equivalent to Rv0873, len: 650 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 650 aa overlap). Probable fadE10, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. CAB91129.1|AL355913 putative acyl CoA dehydrogenase from Streptomyces coelicolor (658 aa); P50544|ACDV_MOUSE ACYL-COA DEHYDROGENASE from Mus musculus (656 aa); D30647|RATVLCAD_1 very-long-chain Acyl-CoA dehydrogenase from Rattus norvegicus (655 aa), FASTA scores: opt: 675, E(): 0, (33.9% identity in 380 aa overlap); etc. Protein product from Mb0897 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0897 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63430" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/Swiss-Prot:P63430" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99495.1" /translation="MAQQTQVTEEQARALAEESRESGWDKPSFAKELFLGRFPLGLIH PFPKPSDAEEARTEAFLVKLREFLDTVDGSVIERAAQIPDEYVKGLAELGCFGLKIPS EYGGLNMSQVAYNRVLMMVTTVHSSLGALLSAHQSIGVPEPLKLAGTAEQKRRFLPRC AAGAISAFLLTEPDVGSDPARMASTATPIDDGQAYELEGVKLWTTNGVVADLLVVMAR VPRSEGHRGGISAFVVEADSPGITVERRNKFMGLRGIENGVTRLHRVRVPKDNLIGRE GDGLKIALTTLNAGRLSLPAIATGVAKQALKIAREWSVERVQWGKPVGQHEAVASKIS FIAATNYALDAVVELSSQMADEGRNDIRIEAALAKLWSSEMACLVGDELLQIRGGRGY ETAESLAARGERAVPVEQMVRDLRINRIFEGSSEIMRLLIAREAVDAHLTAAGDLANP KADLRQKAAAAAGASGFYAKWLPKLVFGEGQLPTTYREFGALATHLRFVERSSRKLAR NTFYGMARWQASLEKKQGFLGRIVDIGAELFAISAACVRAEAQRTADPVEGEQAYELA EAFCQQATLRVEALFDALWSNTDSIDVRLANDVLEGRYTWLEQGILDQSEGTGPWIAS WEPGPSTEANLARRFLTVSPSSEAKL" CDS complement(973012..974172) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0898C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0898c, -, len: 386 aa. Equivalent to Rv0874c, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 386 aa overlap). Conserved hypothetical protein, highly similar in part to SPU62616_1 hypothetical protein from Synechococcus sp. (280 aa), FASTA scores: E(): 6.3e-26, (35.2% identity in 264 aa overlap); SYCSLLLH_102 from Synechocystis sp. (447 aa), FASTA scores: E(): 1.1e-18, (29.5% identity in 400 aa overlap). Also highly similar to Rv0628c|MTCY20H10_9 from Mycobacterium tuberculosis (383 aa), FASTA scores: E():0, (81.5% identity in 383 aa overlap). Mb0898c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5D4" /db_xref="InterPro:IPR013702" /db_xref="InterPro:IPR016741" /db_xref="InterPro:IPR019494" /db_xref="UniProtKB/Swiss-Prot:P0A5D4" /protein_id="SIT99496.1" /translation="MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAH TDRAADVLSAVLQMIDPPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDF VRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGR RRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGG RPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGS IEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMF GVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTASMALFVDDME" CDS complement(974272..974760) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0899C" /product="POSSIBLE CONSERVED EXPORTED PROTEIN" /note="Mb0899c, -, len: 162 aa. Equivalent to Rv0875c, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). Possible conserved exported protein, equivalent to MLCB57_11|O33056 possible exported protein from Mycobacterium leprae (162 aa), FASTA scores: opt: 789, E(): 0, (71.4% identity in 161 aa overlap). Protein product from Mb0899c detected using SWATH mass spectrometry. Mb0899c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024495" /db_xref="UniProtKB/Swiss-Prot:P64732" /protein_id="SIT99497.1" /translation="MKRGVATLPVILVILLSVAAGAGAWLLVRGHGPQQPEISAYSHG HLTRVGPYLYCNVVDLDDCQTPQAQGELPVSERYPVQLSVPEVISRAPWRLLQVYQDP ANTTSTLFRPDTRLAVTIPTVDPQRGRLTGIVVQLLTLVVDHSGELRDVPHAEWSVRL IF" CDS complement(974757..976403) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0900C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0900c, -, len: 548 aa. Equivalent to Rv0876c, len: 548 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 548 aa overlap). Possible conserved transmembrane protein, equivalent to MLCB57_12|O33057 possible membrane protein from Mycobacterium leprae (579 aa), FASTA scores: opt: 2850, E(): 0, (81.0% identity in 568 aa overlap). Also highly similar (except in N-terminus) to CAB93403.1|AL357524 putative integral membrane protein from Streptomyces coelicolor (463 aa). Protein product from Mb0900c detected using SWATH mass spectrometry. Mb0900c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWN3" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XWN3" /protein_id="SIT99498.1" /translation="MAPTPGRRTRNGSVNGHPGMANYPPGDANYRRSRRPPPMPSANR YLPPLGEQPEPERSRVPPRTTRAGERITVTRAAAMRSREMGSRMYLLVHRAATADGAD KSGLTALTWPVMANFAVDSAMAVALANTLFFAAASGESKSRVALYLLITIAPFAVIAP LIGPALDRLQHGRRVALALSFGLRTALAVVLIMNYDGATGSFPSWVLYPCALAMMVFS KSFSVLRSAVTPRVMPPTIDLVRVNSRLTVFGLLGGTIAGGAIAAGVEFVCTHLFQLP GALFVVVAITIAGASLSMRIPRWVEVTSGEVPATLSYHRDRGRLRRRWPEEVKNLGGT LRQPLGRNIITSLWGNCTIKVMVGFLFLYPAFVAKAHEANGWVQLGMLGLIGAAAAVG NFAGNFTSARLQLGRPAVLVVRCTVLVTVLAIAAAVAGSLAATAIATLITAGSSAIAK ASLDASLQHDLPEESRASGFGRSESTLQLAWVLGGAVGVLVYTELWVGFTAVSALLIL GLAQTIVSFRGDSLIPGLGGNRPVMAEQETTRRGAAVAPQ" CDS 976541..977329 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0901" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0901, -, len: 262 aa. Equivalent to Rv0877, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 262 aa overlap). Conserved hypothetical protein, equivalent to MLCB57_13|O33058 conserved hypothetical protein from Mycobacterium leprae (269 aa), FASTA scores: E(): 0, (80.5% identity in 257 aa overlap). Also highly similar (except in C-terminus) to SCD12A.13|CAB93404.1|AL357524 hypothetical protein from Streptomyces coelicolor (308 aa). Protein product from Mb0901 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0901 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021391" /db_xref="UniProtKB/Swiss-Prot:P64734" /protein_id="SIT99499.1" /translation="MTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAV GDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALL APDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVM SAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSAD GHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEKPAES" CDS complement(977352..978668) /codon_start=1 /transl_table=11 /gene="PPE13" /locus_tag="BQ2027_MB0902C" /product="ppe family protein ppe13" /note="Mb0902c, PPE13, len: 438 aa. Equivalent to Rv0878c, len: 443 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 438 aa overlap). Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. P4261|YHS6_MYCTU (517 aa), FASTA scores: opt: 1044, E(): 0, (47.4% identity in 397 aa overlap); MTV014_3, MTCI65_2, MTCY98_24, MTCY3C7_23, MTCY48_17, MTV004_5, MTV004_3, etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base deletion (a-*) leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (438 aa versus 443 aa). Mb0902c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XWQ8" /protein_id="SIT99500.1" /translation="MNFMVLPPEVNSARIYAGAGPAPMLAAAVAWDGLAAELGMAAAS FSLLISGLTAGPGSAWQGPAAAAMAAAAAPYLSWLNAATARAEGAAAGAKAAAAVYEA ARAATAHPALVAANRNQLLSLVLSNLFGQNLPAIAATEASYEQLWAQDVAAMVGYHGG ASTVASQLTPWQQLLSVLPPVVTAAPAGAVGVPAALAIPALGVENIGVGNFLGIGNIG NNNVGSGNTGDYNFGIGNIGNANLGNGNIGNANLGSGNAGFFNFGNGNDGNTNFGSGN AGFLNIGSGNEGSGNLGFGNAGDDNTGWGNSGDTNTGGFNSGDLNTGIGSPVTQGVAN SGFGNTGTGHSGFFNSGNSGSGFQNLGNGSSGFGNASDTSSGFQNAGTALTRASSTWA DSPRAWPIRAPSRLQVWRTRATTARECSIRVIISRVSSTGAPPQKK" CDS complement(978946..979221) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0903C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0903c, -, len: 91 aa. Equivalent to Rv0879c, len: 91 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 91 aa overlap). Possible conserved transmembrane protein, C-terminus highly similar to C-terminal part of MLCB57_14|O33059 conserved hypothetical protein from Mycobacterium leprae (91 aa), FASTA scores: E(): 1.2e-25, (76.9% identity in 91 aa overlap). Mb0903c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64736" /db_xref="InterPro:IPR019681" /db_xref="UniProtKB/Swiss-Prot:P64736" /protein_id="SIT99501.1" /translation="MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPG LASWRPVTVAGLATGLLGTTIFVWQLAAARRGARGAQAGLETYLDPK" CDS 979399..979830 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0904" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY MARR-FAMILY)" /note="Mb0904, -, len: 143 aa. Equivalent to Rv0880, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Possible transcriptional regulator, MarR family, equivalent to MLCB57_15|O3306|NP_302411.1|NC_002677 putative MarR-family protein from Mycobacterium leprae (143 aa), FASTA scores: opt: 818, E(): 0, (89.5% identity in 143 aa overlap). Also similar to many others e.g. CAB93410.1|AL357524 putative marR-family protein from Streptomyces coelicolor (145 aa); NP_251757.1|NC_002516 probable transcriptional regulator from Pseudomonas aeruginosa (147 aa); etc. Also similar to Rv2327 from Mycobacterium tuberculosis (163 aa). Protein product from Mb0904 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0904 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67746" /db_xref="InterPro:IPR000835" /db_xref="InterPro:IPR023187" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P67746" /protein_id="SIT99502.1" /translation="MLDSDARLASDLSLAVMRLSRQLRFRNPSSPVSLSQLSALTTLA NEGAMTPGALAIRERVRPPSMTRVIASLADMGFVDRAPHPIDGRQVLVSVSESGAELV KAARRARQEWLAERLATLNRSERDILRSAADLMLALVDESP" CDS 979827..980693 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0905" /product="POSSIBLE RRNA METHYLTRANSFERASE (RRNA METHYLASE)" /note="Mb0905, -, len: 288 aa. Equivalent to Rv0881, len: 288 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 288 aa overlap). Possible rRNA methyltransferase (EC 2.1.1.-), highly similar to others and hypothetical proteins e.g. CAB76071.1|AL157953 putative rRNA methylase from Streptomyces coelicolor (272 aa); NP_421117.1|NC_002696 spoU rRNA methylase family protein from Caulobacter crescentus (268 aa); D90913_93|P74261 rRNA METHYLASE from Synechocystis sp. (274 aa), FASTA scores: E(): 1.1e-13, (26.3% identity in 278 aa overlap); P18644|TSNR_STRCN rRNA METHYLTRANSFERASE (EC 2.1.1.66) from Streptomyces cyaneus (Streptomyces curacoi) (269 aa), FASTA scores: E(): 3.7e-08, (23.9% identity in 268 aa overlap); etc. Equivalent to AAK45146.1 from Mycobacterium tuberculosis strain CDC1551 (242 aa) but longer 46 aa. Mb0905 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59968" /db_xref="InterPro:IPR001537" /db_xref="InterPro:IPR029026" /db_xref="InterPro:IPR029028" /db_xref="InterPro:IPR029064" /db_xref="UniProtKB/Swiss-Prot:P59968" /protein_id="SIT99503.1" /translation="MTEGRCAQHPDGLDVQDVCDPDDPRLDDFRDLNSIDRRPDLPTG KALVIAEGVLVVQRMLASRFTPLALFGTDRRLAELKDDLAGVGAPYYRASADVMARVI GFHLNRGVLAAARRVPEPSVAQVVAGARTVAVLEGVNDHENLGSIFRNAAGLSVDAVV FGTGCADPLYRRAVRVSMGHALLVPYARAADWPTELMTLKESGFRLLAMTPHGNACKL PEAIAAVSHERIALLVGAEGPGLTAAALRISDVRVRIPMSRGTDSLNVATAAALAFYE RTRSGHHIGPGT" CDS 980690..980974 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0906" /product="PROBABLE TRANSMEMBRANE PROTEIN" /note="Mb0906, -, len: 94 aa. Equivalent to Rv0882, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 94 aa overlap). Probable transmembrane protein. Mb0906 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64738" /db_xref="InterPro:IPR024244" /db_xref="UniProtKB/Swiss-Prot:P64738" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99504.1" /translation="MNDQRDQAVPWATGLAVAGFVAAVIAVAVVVLSLGLIRVHPLLA VGLNIVAVSGLAPTLWGWRRTPVLRWFVLGAAVGVAGAWLALLALTLGDG" CDS complement(980971..981732) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0907C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0907c, -, len: 253 aa. Equivalent to Rv0883c, len: 253 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 253 aa overlap). Conserved hypothetical protein, equivalent to O3306|MLCB57_16 CONSERVED HYPOTHETICAL PROTEI from Mycobacterium leprae (251 aa), FASTA scores: E(): 0, (79.4% identity in 253 aa overlap). Also highly similar to N_terminus of AL009204|SC9B10_22 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (352 aa), FASTA scores: E(): 6.1e-20, (35.0% identity in 246 aa overlap). Protein product from Mb0907c detected using SWATH mass spectrometry. Mb0907c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021421" /db_xref="UniProtKB/Swiss-Prot:P64740" /protein_id="SIT99505.1" /translation="MRELKVVGLDADGKNIICQGAIPSEQFKLPVDDRLRAALRDDSV QPEQAQLDIEVTNVLSPKEIQARIRAGASVEQVAAASGSDIARIRRFAHPVLLERSRA AELATAAHPVLADGPAVLTMQETVAAALVARGLNPDSLTWDAWRNEDSRWTVQLAWKA GRSDNLAHFRFTPGAHGGTATAIDDTAHELINPTFNRPLRPLAPVAHLDFDEPEPAQP TLTVPSAQPVSNRRGKPAIPAWEDVLLGVRSGGRR" CDS complement(981889..983019) /codon_start=1 /transl_table=11 /gene="serC" /locus_tag="BQ2027_MB0908C" /product="POSSIBLE PHOSPHOSERINE AMINOTRANSFERASE SERC (PSAT)" /note="Mb0908c, serC, len: 376 aa. Equivalent to Rv0884c, len: 376 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 376 aa overlap). Possible serC, phosphoserine aminotransferase (EC 2.6.1.52), equivalent to MLCB57_17 putative phosphoserine aminotransferase from Mycobacterium leprae (376 aa), FASTA scores: E(): 0, (87.5 identity in 376 aa overlap). Also highly similar to CAC08322.1|AL392149 putative aminotransferase from Streptomyces coelicolor (363 aa); and similar to other phosphoserine aminotransferases e.g. NP_386837.1|NC_003047 PUTATIVE PHOSPHOSERINE AMINOTRANSFERASE PROTEIN from Sinorhizobium meliloti (392 aa); P52878|SERC_METBA PHOSPHOSERINE AMINOTRANSFERASE from Methanosarcina barkeri (370 aa); P10658|SERC_RABIT|RABEPIP_1 PHOSPHOSERINE AMINOTRANSFERASE from Rabbit (370 aa), FASTA scores: opt: 271, E(): 3.5e-11, (24.5% identity in 368 aa overlap); etc. BELONGS TO CLASS-V OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. COFACTOR: PYRIDOXAL PHOSPHATE. Protein product from Mb0908c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0908c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63515" /db_xref="InterPro:IPR000192" /db_xref="InterPro:IPR006272" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR022278" /db_xref="UniProtKB/Swiss-Prot:P63515" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99506.1" /translation="MADQLTPHLEIPTAIKPRDGRFGSGPSKVRLEQLQTLTTTAAAL FGTSHRQAPVKNLVGRVRSGLAELFSLPDGYEVILGNGGATAFWDAAAFGLIDKRSLH LTYGEFSAKFASAVSKNPFVGEPIIITSDPGSAPEPQTDPSVDVIAWAHNETSTGVAV AVRRPEGSDDALVVIDATSGAGGLPVDIAETDAYYFAPQKNFASDGGLWLAIMSPAAL SRIEAIAATGRWVPDFLSLPIAVENSLKNQTYNTPAIATLALLAEQIDWLVGNGGLDW AVKRTADSSQRLYSWAQERPYTTPFVTDPGLRSQVVGTIDFVDDVDAGTVAKILRANG IVDTEPYRKLGRNQLRVAMFPAVEPDDVSALTECVDWVVERL" CDS 983227..984249 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0909" /product="Putative transmembrane protein" /note="Mb0909, -, len: 340 aa. Equivalent to Rv0885, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 340 aa overlap). Conserved hypothetical protein, equivalent to O33063|MLCB57_18 possible transmembrane protein from Mycobacterium leprae (341 aa), FASTA score: (83.9% identity in 341 aa overlap). Also similar except in C-terminus to T35630 probable membrane protein from Streptomyces coelicolor (312 aa). Protein product from Mb0909 detected using SWATH mass spectrometry. Mb0909 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5D6" /db_xref="InterPro:IPR009078" /db_xref="InterPro:IPR025859" /db_xref="UniProtKB/Swiss-Prot:P0A5D6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99507.1" /translation="MDRTRIVRRWRRNMDVADDAEYVEMLATLSEGSVRRNFNPYTDI DWESPEFAVTDNDPRWILPATDPLGRHPWYQAQSRERQIEIGMWRQANVAKVGLHFES ILIRGLMNYTFWMPNGSPEYRYCLHESVEECNHTMMFQEMVNRVGADVPGLPRRLRWV SPLVPLVAGPLPVAFFIGVLAGEEPIDHTQKNVLREGKSLHPIMERVMSIHVAEEARH ISFAHEYLRKRLPRLTRMQRFWISLYFPLTMRSLCNAIVVPPKAFWEEFDIPREVKKE LFFGSPESRKWLCDMFADARMLAHDTGLMNPIARLVWRLCKIDGKPSRYRSEPQRQHL AAAPAA" CDS 984268..985995 /codon_start=1 /transl_table=11 /gene="fprB" /locus_tag="BQ2027_MB0910" /product="PROBABLE NADPH:ADRENODOXIN OXIDOREDUCTASE FPRB (ADRENODOXIN REDUCTASE) (AR) (FERREDOXIN-NADP(+) REDUCTASE)" /note="Mb0910, fprB, len: 575 aa. Equivalent to Rv0886, len: 575 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 575 aa overlap). Probable fprB, ferredoxin/ferredoxin-NADP(+) reductase (NADPH:adrenodoxin oxidoreductase) (EC 1.18.1.2), equivalent to O3306|MLCB57_19 FERREDOXIN/FERREDOXIN--NADP REDUCTASE from Mycobacterium leprae (555 aa), FASTA scores: E(): 0, (76.6 identity in 560 aa overlap). Also highly similar or similar to others e.g. NP_294219.1|NC_001263 putative ferredoxin/ferredoxin--NADP reductase from Deinococcus radiodurans (479 aa) (N-terminus shorter); P22570|ADRO_HUMAN NADPH:adrenodoxin oxidoreductase from homo sapiens (497 aa), FASTA scores: opt: 624, E(): 3e-30, (39.7% identity in 484 aa overlap); P08165|ADRO_BOVIN NADPH:ADRENODOXIN OXIDOREDUCTASE from Bos taurus (492 aa); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv3106, Rv3858c, etc. Contains PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature. Mb0910 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65529" /db_xref="InterPro:IPR017896" /db_xref="InterPro:IPR017900" /db_xref="InterPro:IPR021163" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P65529" /protein_id="SIT99508.1" /translation="MPHVITQSCCNDASCVFACPVNCIHPTPDEPGFATSEMLYIDPV ACVDCGACVTACPVSAIAPNTRLDFEQLPFVEINASYYPKRPAGVKLAPTSKLAPVTP AAEVRVRRQPLTVAVVGSGPAAMYAADELLVQQGVQVNVFEKLPTPYGLVRSGVAPDH QNTKRVTRLFDRIAGHRRFRFYLNVEIGKHLGHAELLAHHHAVLYAVGAPDDRRLTID GMGLPGTGTATELVAWLNGHPDFNDLPVDLSHERVVIIGNGNVALDVARVLAADPHEL AATDIADHALSALRNSAVREVVVAARRGPAHSAFTLPELIGLTAGADVVLDPGDHQRV LDDLAIVADPLTRNKLEILSTLGDGSAPARRVGRPRIRLAYRLTPRRVLGQRRAGGVQ FSVTGTDELRQLDAGLVLTSIGYRGKPIPDLPFDEQAALVPNDGGRVIDPGTGEPVPG AYVAGWIKRGPTGFIGTNKSCSMQTVQALVADFNDGRLTDPVATPTALDQLVQARQPQ AIGCAGWRAIDAAEIARGSADGRVRNKFTDVAEMLAAATSAPKEPLRRRVLARLRDLG QPIVLTVPL" CDS complement(985978..986436) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0911C" /product="Glyoxalase/bleomycin resistance protein/dioxygenase" /note="Mb0911c, -, len: 152 aa. Equivalent to Rv0887c, len: 152 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 152 aa overlap). Conserved hypothetical protein, highly similar to others e.g. NP_436346.1|NC_003037 Hypothetical protein from Sinorhizobium meliloti (149 aa); AL132644|SCI8_26 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (194 aa), FASTA scores: opt: 220, E(): 1.5e-07, (33.6% identity in 131 aa overlap); etc. Also shows weak similarity with transposases and related proteins. Protein product from Mb0911c detected using SWATH mass spectrometry. Mb0911c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR029068" /db_xref="InterPro:IPR037523" /db_xref="InterPro:IPR041581" /db_xref="UniProtKB/Swiss-Prot:P64742" /protein_id="SIT99509.1" /translation="MAINVEPALSPHLVVDDAASAIDFYVKAFDAVELGRVPGPDGKL IHAALRINGFTVMLNDDVPQMCGGKSMTPTSLGGTPVTIHLTVTDVDAKFQRALNAGA TVVTALEDQLWGDRYGVVADPFGHHWSLGQPVREVNMDEIQAAMSSQGDG" CDS 987698..989170 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0912" /product="PROBABLE EXPORTED PROTEIN" /note="Mb0912, -, len: 490 aa. Equivalent to Rv0888, len: 490 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 490 aa overlap). Probable exported protein. Equivalent to AAK45157.1 from Mycobacterium tuberculosis strain CDC1551 (507 aa) but shorter 17 aa. Contains possible N-terminal signal sequence. Mb0912 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64744" /db_xref="InterPro:IPR005135" /db_xref="InterPro:IPR036691" /db_xref="UniProtKB/Swiss-Prot:P64744" /protein_id="SIT99510.1" /translation="MDYAKRIGQVGALAVVLGVGAAVTTHAIGSAAPTDPSSSSTDSP VDACSPLGGSASSLAAIPGASVPQVGVRQVDPGSIPDDLLNALIDFLAAVRNGLVPII ENRTPVANPQQVSVPEGGTVGPVRFDACDPDGNRMTFAVRERGAPGGPQHGIVTVDQR TASFIYTADPGFVGTDTFSVNVSDDTSLHVHGLAGYLGPFHGHDDVATVTVFVGNTPT DTISGDFSMLTYNIAGLPFPLSSAILPRFFYTKEIGKRLNAYYVANVQEDFAYHQFLI KKSKMPSQTPPEPPTLLWPIGVPFSDGLNTLSEFKVQRLDRQTWYECTSDNCLTLKGF TYSQMRLPGGDTVDVYNLHTNTGGGPTTNANLAQVANYIQQNSAGRAVIVTGDFNARY SDDQSALLQFAQVNGLTDAWVQVEHGPTTPPFAPTCMVGNECELLDKIFYRSGQGVTL QAVSYGNEAPKFFNSKGEPLSDHSPAVVGFHYVADNVAVR" CDS complement(989205..990326) /codon_start=1 /transl_table=11 /gene="citA" /locus_tag="BQ2027_MB0913C" /product="PROBABLE CITRATE SYNTHASE II CITA" /note="Mb0913c, citA, len: 373 aa. Equivalent to Rv0889c, len: 373 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 373 aa overlap). Probable citA (alternate gene name: gltA), citrate synthase 2 (EC 4.1.3.7), highly similar to others e.g. CAB95899.1|AL359988 putative citrate synthase from Streptomyces coelicolor (387 aa); P39119|CISY_BACSU citrate synthase II from Bacillus subtilis (366 aa), FASTA scores: opt: 586, E(): 5.8e-30, (33.8% identity in 367 aa overlap); etc. Also similar to Rv0896|MTCY31.24 from Mycobacterium tuberculosis (29.2% identity in 274 aa overlap) and Rv1131. Contains PS00480 Citrate synthase signature. BELONGS TO THE CITRATE SYNTHASE FAMILY. Protein product from Mb0913c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0913c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63778" /db_xref="InterPro:IPR002020" /db_xref="InterPro:IPR016142" /db_xref="InterPro:IPR016143" /db_xref="InterPro:IPR019810" /db_xref="InterPro:IPR036969" /db_xref="UniProtKB/Swiss-Prot:P63778" /protein_id="SIT99511.1" /translation="MTVVPENFVPGLDGVVAFTTEIAEPDKDGGALRYRGVDIEDLVS QRVTFGDVWALLVDGNFGSGLPPAEPFPLPIHSGDVRVDVQAGLAMLAPIWGYAPLLD IDDATARQQLARASVMALSYVAQSARGIYQPAVPQRIIDECSTVTARFMTRWQGEPDP RHIEAIDAYWVSAAEHGMNASTFTARVIASTGADVAAALSGAIGAMSGPLHGGAPARV LPMLDEVERAGDARSVVKGILDRGEKLMGFGHRVYRAEDPRARVLRAAAERLGAPRYE VAVAVEQAALSELRERRPDRAIETNVEFWAAVVLDFARVPANMMPAMFTCGRTAGWCA HILEQKRLGKLVRPSAIYVGPGPRSPESVDGWERVLTTA" CDS complement(990413..993061) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0914C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY LUXR-FAMILY)" /note="Mb0914c, -, len: 882 aa. Equivalent to Rv0890c, len: 882 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 882 aa overlap). Probable transcriptional regulatory protein, LuxR family, highly similar (but shorter 238 aa in N-terminus) to NP_302202.1|NC_002677 possible transcriptional regulator from Mycobacterium leprae (1106 aa). Also highly similar (generally in part) to others e.g. T50568 probable multi-domain regulatory protein from Streptomyces coelicolor (1334 aa); P10957|NARL_ECOLI nitrate/nitrite response regulator protein from Escherichia coli (216 aa), FASTA scores: opt: 193, E(): 6e-06, (37.4% identity in 99 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. MTCY02B10_22, MTV008_44, MTV036_21, and MTCY31_24. Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00622 Bacterial regulatory proteins, luxR family signature, and probable helix-turn helix motif from aa 836 to 857 (Score 1559, +4.50 SD). BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb0914c detected using SWATH mass spectrometry. Mb0914c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59969" /db_xref="InterPro:IPR000792" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/Swiss-Prot:P59969" /protein_id="SIT99512.1" /translation="MRALLAQNRLVTLCGTGGVGKTRLAIQIASASELRDGLCFVDLA PITESGIVAATAARAVGLPDQPGRSTMDSLRRFIGNRRMLMVLDNCEHLLDACAALVV ELLGACPELTILATSREPIGMAGEITWRVPSMSITDEAVELFADRASRVQPGFTIANH NAAAVGEICRRLDGIPLAIEFAAARVRSMSPLEIADGLDDCFRLLAGGVRGAVQRQQT LRASIDWSHALLTETEQILFRRLAPFVGGFDLAAVRAVAAGSDLDPFSVLDQLTLLVD KSLVVADDCQGRTRYRLLETVRRYALEKLGDSGEADVHARHRDYYTALAASLNTPADN DHQRLVARAETEIDNLRAAFAWSRENGHITEALQLASSLQPIWFGRAHLREGLSWFNS ILEDQRFHRLAVSTAVRARALADKAMLSTWLATSPVGATDIIAPAQQALAMAREVGDP AALVRALTACGCSSGYNAEAAAPYFAEATDLARAIDDKWTLCQILYWRGVGTCISGDP NALRAAAEECRDLADTIGDRFVSRHCSLWLSLAQMWAGNLTEALELSREITAEAEASN DVPTKVLGLYTQAQVLAYCGASAAHAIAGACIAAATELGGVYQGIGYAAMTYAALAAG DVTAALEASDAARPILRAQPDQVTMHQVLMAQLALAGGDAIAARQFANDAVDATNGWH RMVALTIRARVATARGEPELARDDAHAALACGAELHIYQGMPDAMELLAGLAGEVGSH SEGVRLLGAAAALRQQTRQVRFKIWDAGYQASVTALREAMGDEDFDRAWAEGAALSTD EAIAYAQRGRGERKRPARGWGSLTPTERDVVRLVSEGLSNKDIAKRLFVSPRTVQTHL THVYAKLGLASRVQLVDEAARRGSPS" CDS complement(993063..993920) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0915C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb0915c, -, len: 285 aa. Equivalent to Rv0891c, len: 285 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 285 aa overlap). Possible transcriptional regulator, highly similar in N-terminus to NP_302202.1|NC_002677 possible transcriptional regulator from Mycobacterium leprae (1106 aa). Also highly similar to several Mycobacterium tuberculosis putative transcriptional regulators e.g. Q1102|MTCY02B10_22 PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (1159 aa), FASTA scores: opt: 702, E(): 8.3e-40, (50.6% identity in 247 aa overlap); MTV036_21; MTV008_44; MTCY02B10_23. Also shows similarity with several adenylate cyclases and hydrolases from other organisms. Mb0915c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59970" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/Swiss-Prot:P59970" /protein_id="SIT99513.1" /translation="MLFNAVHNSLPPNIDIDHAILRGEDHPPTCAKCVARGRISALGS LDLRYHSLRCYAAPPDVGRCEFVPPRRRVLIANQGLDVSRLPPTGTVTLLLADVEEST HLWQMCPEDMATAIAHLDHTVSEAITNHGGVQPVKRYEGDSFVAAFTRASDAAACALD LQRTSLAPIRLRIGLHTGEVQLRDELYVGPTINRTARLRDLAHGGQVVLSAATGDLVT GRLPADAWLVDLGRHPLRGLPRPEWVMQLCHPDIREKFPPLRTAKSSPTSILPAQFTT FVGRRAQIS" CDS 994318..995805 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0916" /product="PROBABLE MONOOXYGENASE" /note="Mb0916, -, len: 495 aa. Equivalent to Rv0892, len: 495 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 495 aa overlap). Probable monooxygenase (EC 1.14.-.-), highly similar to others e.g. NP_250787.1|NC_002516 probable flavin-binding monooxygenase from Pseudomonas aeruginosa (491 aa); CAB59668.1|AL132674 monooxygenase from Streptomyces coelicolor (519 aa); P12015|CYMO_ACIS cyclohexanone monooxygenase from Acinetobacter sp. (542 aa), FASTA scores: opt: 489, E(): 6.8e-26, (30.3% identity in 492 aa overlap); etc. Also highly similar to Rv0565c, Rv3854c, Rv3083, etc from Mycobacterium tuberculosis. Has hydrophobic stretch at N-terminus. Protein product from Mb0916 detected using SWATH mass spectrometry. Mb0916 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64746" /db_xref="InterPro:IPR020946" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P64746" /protein_id="SIT99514.1" /translation="MTGRCPTVAVVGAGMSGMCVAITLLSAGITDVCIYEKADDVGGT WRDNTYPGLTCDVPSRLYQYSFAKNPNWTQMFSRGGEIQDYLRGIAERYGLRHRIRFG ATVVSARFDDGRWVLRTDSGTESTVDFLISATGVLHHPRIPPIAGLDDFRGTVFHSAR WDHTVPLLGRRIAVIGTGSTGVQLVCGLAGVAGKVTMFQRTAQWVLPWPNPRYSKLAR VFHRAFPCLGSLAYKAYSLSFETFAVALSNPGLHRKLVGAVCRASLRRVRDPRLRRAL TPDYEPMCKRLVMSGGFYRAIQRDDVELVTAGIDHVEHRGIVTDDGVLHEVDVIVLAT GFDSHAFFRPMQLTGRDGIRIDDVWQDGPHAHQTVAIPGFPNFFMMLGPHSPVGNFPL TAVAESQAEHIVQWIKRWRHGEFDTMEPKSAATEAYNTVLRAAMPNTVWTTGCDSWYL NKDGIPEVWPFAPAKHRAMLANLHPEEYDLRRYAAVRATSRPQSA" CDS complement(995783..996760) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0917C" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb0917c, -, len: 325 aa. Equivalent to Rv0893c, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 325 aa overlap). Conserved hypothetical protein, belongs in family with P96823|Rv0146|MTCI5.20 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (310 aa), FASTA scores: opt: 784, E(): 0, (43.2% identity in 308 aa overlap); Rv0726c, Rv0731c, Rv3399, etc. Also shows some similarity with others e.g. SC9B5.10|T35930 hypothetical protein from Streptomyces coelicolor (303 aa); BSUB0008_141|Q45500 HYPOTHETICAL 34.8 KD PROTEIN from Bacillus subtilis (304 aa), FASTA scores: E(): 0.00033, (26.8% identity in 168 aa overlap); etc. Protein product from Mb0917c detected using SWATH mass spectrometry. Mb0917c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64748" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P64748" /protein_id="SIT99515.1" /translation="MRTEDDSWDVTTSVGSTGLLVAAARALETQKADPLAIDPYAEVF CRAAGGEWADVLDGKLPDHYLTTGDFGEHFVNFQGARTRYFDEYFSRATAAGMKQVVI LAAGLDSRAFRLQWPIGTTIFELDRPQVLDFKNAVLADYHIRPRAQRRSVAVDLRDEW QIALCNNGFDANRPSAWIAEGLLVYLSAEAQQRLFIGIDTLASPGSHVAVEEATPLDP CEFAAKLERERAANAQGDPRRFFQMVYNERWARATEWFDERGWRATATPLAEYLRRVG RAVPEADTEAAPMVTAITFVSAVRTGLVADPARTSPSSTSIGFKRFEAD" CDS 996989..998170 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0918" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (POSSIBLY LUXR-FAMILY)" /note="Mb0918, -, len: 393 aa. Equivalent to Rv0894, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 393 aa overlap). Possible regulatory protein, LuxR family, highly similar in part to NP_302202.1|NC_002677 possible transcriptional regulator from Mycobacterium leprae (1106 aa). Also similar to others e.g. CAB95788.1|AL359949 putative multi-domain regulatory protein from Streptomyces coelicolor (780 aa); NP_107293.1|NC_002678 transcriptional regulator from Mesorhizobium loti (903 aa); etc. Also similar to other regulatory proteins from Mycobacterium tuberculosis e.g. Rv2488c|MTV008_44 (1137 aa), FASTA score: (53.2% identity in 363 aa overlap); Rv1358|MTCY02B10_22 (1159 aa), FASTA score: (52.3% identity in 365 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /db_xref="GOA:P64750" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P64750" /protein_id="SIT99516.1" /translation="MPSRATVQEFSDSYPFCHNGFRPIMMPKIVSVQHSTRRHLTSFV GRKAELNDVRRLLSDKRLVTLTGPDGMGKSRLALQIGAQIAHEFTYGRWDCDLATVTD RDCVSISMLNALGLPVQPGLSAIDTLVGVINDARVLLVLDHCEHLLDACAAIIDSLLR SCPRLTILTTSTEAIGLAGELTWRVPPLSLTNDAIELFVDRARRVRSDFAINADTAVT VGEICRRLDGVPLAIELAAARTDTLSPVEILAGLNDRFRLVAGAAGNAVRPEQTLCAT VQWSHALLSGPERALLHRLAVFAGGFDLDGAQAVGANDEDFEGYQTLGRFAELVDKAF VVVENNRGRAGYRLLYSVRQYALEKLSESGEADAVLARYRKHLKQPNQVVRAGSGGVR Y" CDS 998247..999764 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0919" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb0919, -, len: 505 aa. Equivalent to Rv0895, len: 505 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 505 aa overlap). Conserved hypothetical protein; member of family with: Rv3740c, Rv3734c, Rv1425, Rv1760, etc. Shows some similarity with NP_301898.1|NC_002677 conserved membrane protein from Mycobacterium leprae (491 aa)." /db_xref="GOA:P67205" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/Swiss-Prot:P67205" /protein_id="SIT99517.1" /translation="MRQQQEADVVALGRKPGLLCVPERFRAMDLPMAAADALFLWAET PTRPLHVGALAVLSQPDNGTGRYLRKVFSAAVARQQVAPWWRRRPHRSLTSLGQWSWR TETEVDLDYHVRLSALPPRAGTAELWALVSELHAGMLDRSRPLWQVDLIEGLPGGRCA VYVKVHHALADGVSVMRLLQRIVTADPHQRQMPTLWEVPAQASVAKHTAPRGSSRPLT LAKGVLGQARGVPGMVRVVADTTWRAAQCRSGPLTLAAPHTPLNEPIAGARSVAGCSF PIERLRQVAEHADATINDVVLAMCGGALRAYLISRGALPGAPLIAMVPVSLRDTAVID VFGQGPGNKIGTLMCSLATHLASPVERLSAIRASMRDGKAAIAGRSRNQALAMSALGA APLALAMALGRVPAPLRPPNVTISNVPGPQGALYWNGARLDALYLLSAPVDGAALNIT CSGTNEQITFGLTGCRRAVPALSILTDQLAHELELLVGVSEAGPGTRLRRIAGRR" CDS 999937..1001232 /codon_start=1 /transl_table=11 /gene="gltA2" /locus_tag="BQ2027_MB0920" /product="PROBABLE CITRATE SYNTHASE I GLTA2" /note="Mb0920, gltA2, len: 431 aa. Equivalent to Rv0896, len: 431 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8%% identity in 431 aa overlap). Probable gltA2, citrate synthase 1 (EC 4.1.3.7), highly similar to O33066|NP_302405.1|NC_002677 citrate synthase 1 from Mycobacterium leprae (431 aa), FASTA scores: E(): 0, (91.0 identity in 431 aa overlap); and AAF04133.1|AF191033_1|AF191033 citrate synthase from Mycobacterium smegmatis (441 aa). Also highly similar to others e.g. AAF14286.1|AF181118_1|AF181118 citrate synthase from Streptomyces coelicolor (429 aa); P42457|CISY_CORGL CITRATE SYNTHASE from Corynebacterium glutamicum (437 aa), FASTA scores: opt: 1847, E(): 0, (63.0% identity in 433 aa overlap); etc. Also similar to two other Mycobacterium tuberculosis citrate synthases, Rv0889|MTCY31.17c|citA (373 aa), FASTA score: (29.2% identity in 274 aa overlap) and Rv1131|MTCY22G8.20|gltA1 (393 aa). Contains PS00480 Citrate synthase signature. BELONGS TO THE CITRATE SYNTHASE FAMILY. Protein product from Mb0920 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0920 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWQ4" /db_xref="InterPro:IPR002020" /db_xref="InterPro:IPR010953" /db_xref="InterPro:IPR016142" /db_xref="InterPro:IPR016143" /db_xref="InterPro:IPR019810" /db_xref="InterPro:IPR024176" /db_xref="InterPro:IPR036969" /db_xref="UniProtKB/TrEMBL:A0A1R3XWQ4" /protein_id="SIT99518.1" /translation="MADTDDTATLRYPGGEIDLQIVHATEGADGIALGPLLAKTGHTT FDVGFANTAAAKSSITYIDGDAGILRYRGYPIDQLAEKSTFIEVCYLLIYGELPDTDQ LAQFTGRIQRHTMLHEDLKRFFDGFPRNAHPMPVLSSVVNALSAYYQDALDPMDNGQV ELSTIRLLAKLPTIAAYAYKKSVGQPFLYPDNSLTLVENFLRLTFGFPAEPYQADPEV VRALDMLFILHADHEQNCSTSTVRLVGSSRANLFTSISGGINALWGPLHGGANQAVLE MLEGIRDSGDDVSEFVRKVKNREAGVKLMGFGHRVYKNYDPRARIVKEQADKILAKLG GDDSLLGIAKELEEAALTDDYFIERKLYPNVDFYTGLIYRALGFPTRMFTVLFALGRL PGWIAHWREMHDEGDSKIGRPRQIYTGYAERDYVTIDAR" CDS complement(1001273..1002880) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0921C" /product="PROBABLE OXIDOREDUCTASE" /note="Mb0921c, -, len: 535 aa. Equivalent to Rv0897c, len: 535 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 535 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases from diverse organisms e.g. CAB94055.1|AL358672 putative oxidoreductase from Streptomyces coelicolor (540 aa); NP_147877.1|NC_000854 phytoene dehydrogenase from Aeropyrum pernix (549 aa); Q01671|CRTD_RHOSH methoxyneurosporene dehydrogenase from Rhodobacter sphaeroides (495 aa), FASTA scores: opt: 139, E(): 2.6e-06, (23.8% identity in 538 aa overlap); etc. Also similar to Rv1432, Rv2997, and Rv3829c from Mycobacterium tuberculosis. Protein product from Mb0921c detected using SWATH mass spectrometry. Mb0921c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64752" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P64752" /protein_id="SIT99519.1" /translation="MSDHDRDFDVVVVGGGHNGLVAAAYLARAGLRVRLLERLAQTGG AAVSIQAFDGVEVALSRYSYLVSLLPSRIVADLGAPVRLARRPFSSYTPAPATAGRSG LLIGPTGEPRAAHLAAIGAAPDAHGFAAFYRRCRLVTARLWPTLIEPLRTREQARRDI VEYGGHEAAAAWQAMVDEPIGHAIAGAVANDLLRGVIATDALIGTFARMHEPSLMQNI CFLYHLVGGGTGVWHVPIGGMGSVTSALATAAARHGAEIVTGADVFALDPDGTVRYHS DGSDGAEHLVRGRFVLVGVTPAVLASLLGEPVAALAPGAQVKVNMVVRRLPRLRDDSV TPQQAFAGTFHVNETWSQLDAAYSQAASGRLPDPLPCEAYCHSLTDPSILSARLRDAG AQTLTVFGLHTPHSVFGDTEGLAERLTAAVLASLNSVLAEPIQDVLWTDAQSKPCIET TTTLDLQRTLGMTGGNIFHGALSWPFADNDDPLDTPARQWGVATDHERIMLCGSGARR GGAVSGIGGHNAAMAVLACLASRRKSP" CDS complement(1002906..1003169) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0922C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0922c, -, len: 87 aa. Equivalent to Rv0898c, len: 87 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 87 aa overlap). Conserved hypothetical protein, highly similar to CAC01589.1|AL391041 hypothetical protein from Streptomyces coelicolor (87 aa). Also shows some similarity to Rv0709|MTCY210.28|rpmC from Mycobacterium tuberculosis (77 aa), FASTA score: (28.8% identity in 73 aa overlap). Protein product from Mb0922c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0922c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR020311" /db_xref="UniProtKB/Swiss-Prot:P64754" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99520.1" /translation="MGKGRKPTDSETLAHIRDLVAEEKALRAQLRHGGISESEEQQQL RRIEIELDQCWDLLRQRRALRQTGGDPREAVVRPADQVEGYTG" CDS 1003277..1004257 /codon_start=1 /transl_table=11 /gene="ompA" /locus_tag="BQ2027_MB0923" /product="outer membrane protein a ompa" /note="Mb0923, ompA, len: 326 aa. Equivalent to Rv0899, len: 326 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 326 aa overlap). Probable ompA, outer membrane protein A, C-terminal region similar to C-terminus of many members of the OMPA family of outer membrane proteins e.g. NP_458280.1|NC_003198 putative outer membrane protein from Salmonella enterica subsp. enterica serovar Typhi (220); NP_418008.1|NC_000913 putative outer membrane protein from Escherichia coli strain K12 (219 aa), FASTA scores: opt: 296, E(): 2.2e-11, (45.3% identity in 117 aa overlap); NP_231844.1|NC_002505 outer membrane protein OmpA from Vibrio cholerae (321 aa); Q05146|OMPA_BORAV OUTER MEMBRANE PROTEIN A PRECURSOR from Bordetella avium (194 aa); etc. A signal peptide sequence probably exists at the N-terminus. Contains PS00044 Bacterial regulatory proteins, lysR family signature. BELONGS TO THE OMPA FAMILY. Protein product from Mb0923 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0923 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65594" /db_xref="InterPro:IPR006664" /db_xref="InterPro:IPR006665" /db_xref="InterPro:IPR006690" /db_xref="InterPro:IPR007055" /db_xref="InterPro:IPR036737" /db_xref="UniProtKB/Swiss-Prot:P65594" /protein_id="SIT99521.1" /translation="MASKAGLGQTPATTDARRTQKFYRGSPGRPWLIGAVVIPLLIAA IGYGAFERPQSVTGPTGVLPTLTPTSTRGASALSLSLLSISRSGNTVTLIGDFPDEAA KAALMTALNGLLAPGVNVIDQIHVDPVVRSLDFSSAEPVFTASVPIPDFGLKVERDTV TLTGTAPSSEHKDAVKRAATSTWPDMKIVNNIEVTGQAPPGPPASGPCADLQSAINAV TGGPIAFGNDGASLIPADYEILNRVADKLKACPDARVTINGYTDNTGSEGINIPLSAQ RAKIVADYLVARGVAGDHIATVGLGSVNPIASNATPEGRAKNRRVEIVVN" CDS 1004270..1004422 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0924" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb0924, -, len: 50 aa. Equivalent to Rv0900, len: 50 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 50 aa overlap). Possible membrane protein, with hydrophobic domain from aa 4-26. Mb0924 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64756" /db_xref="UniProtKB/Swiss-Prot:P64756" /protein_id="SIT99522.1" /translation="MDFVIQWSCYLLAFLGGSAVAWVVVTLSIKRASRDEGAAEAPSA AETGAQ" CDS 1004422..1004949 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0925" /product="POSSIBLE CONSERVED EXPORTED OR MEMBRANE PROTEIN" /note="Mb0925, -, len: 175 aa. Equivalent to Rv0901, len: 175 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 175 aa overlap). Possible conserved exported or membrane protein, with hydrophobic N-terminus at aa 7-25. Shows some similarity in C-terminus to O33070|Z99494|MLCB57.59 HYPOTHETICAL PROTEIN from Mycobacterium leprae (113 aa), FASTA scores: opt: 204, E(): 3.2e-12, (44.9% identity in 78 aa overlap) Protein product from Mb0925 detected using SWATH mass spectrometry. Mb0925 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64758" /db_xref="UniProtKB/Swiss-Prot:P64758" /protein_id="SIT99523.1" /translation="MEHVHWWLAGLAFTLGMVLTSTLMVRPVEHQVLVKKSVRGSSAK SKPPTARKPAVKSGTKREESPTAKTKVATESAAEQIPVAGEPAAEPIPVAGEPAARIP VVPYAPYGPGSARAGADGSGPQGWLVKGRSDTRLYYTPEDPTYDPTVAQVWFQDEESA ARAFFTPWRKSTRRT" CDS complement(1004966..1006306) /codon_start=1 /transl_table=11 /gene="prrB" /locus_tag="BQ2027_MB0926C" /product="TWO COMPONENT SENSOR HISTIDINE KINASE PRRB" /note="Mb0926c, prrB, len: 446 aa. Equivalent to Rv0902c, len: 446 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 446 aa overlap). prrB, two-component sensor histidine kinase (EC 2.7.3.-) (see citations below), transmembrane protein, equivalent to MLCB57_26|NP_302403.1|NC_002677 sensor histidine kinase from Mycobacterium leprae (446 aa); and similar at C-termini to NP_301251.1|NC_002677 putative two-component system sensor kinase from Mycobacterium leprae (519 aa). C-terminus also similar to the C-termini of many sensor-like histidine kinase proteins e.g. P08336|CPXA_ECOLI|ECFB|SSD|EUP|B3911|Z5456|ECS4837 sensor protein from Escherichia coli strain K12 (457 aa), FASTA scores: opt: 364, E(): 1.7e-15, (27.1% identity in 398 aa overlap); CAB89748.1|AL354616 putative two-component histidine kinase from Streptomyces coelicolor (483 aa); CAB82845.1|AJ277081 putative histidine kinase from Amycolatopsis mediterranei (472 aa); etc. Also similar in part to Mycobacterium tuberculosis proteins Rv3764c (475 aa); and Rv0982 (504 aa). Thought to be induced at phagocytosis (see second citation below). Protein product from Mb0926c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0926c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Z9" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR003661" /db_xref="InterPro:IPR004358" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR036097" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/Swiss-Prot:P0A5Z9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99524.1" /translation="MNILSRIFARTPSLRTRVVVATAIGAAIPVLIVGTVVWVGITND RKERLDRRLDEAAGFAIPFVPRGLDEIPRSPNDQDALITVRRGNVIKSNSDITLPKLQ DDYADTYVRGVRYRVRTVEIPGPEPTSVAVGATYDATVAETNNLHRRVLLICTFAIGA AAVFAWLLAAFAVRPFKQLAEQTRSIDAGDEAPRVEVHGASEAIEIAEAMRGMLQRIW NEQNRTKEALASARDFAAVSSHELRTPLTAMRTNLEVLSTLDLPDDQRKEVLNDVIRT QSRIEATLSALERLAQGELSTSDDHVPVDITDLLDRAAHDAARIYPDLDVSLVPSPTC IIVGLPAGLRLAVDNAIANAVKHGGATLVQLSAVSSRAGVEIAIDDNGSGVPEGERQV VFERFSRGSTASHSGSGLGLALVAQQAQLHGGTASLENSPLGGARLVLRLPGPS" CDS complement(1006317..1007027) /codon_start=1 /transl_table=11 /gene="prrA" /locus_tag="BQ2027_MB0927C" /product="TWO COMPONENT RESPONSE TRANSCRIPTIONAL REGULATORY PROTEIN PRRA" /note="Mb0927c, prrA, len: 236 aa. Equivalent to Rv0903c, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 236 aa overlap). prrA, two-component response regulator (see citations below), equivalent to Z99494|MLCB57_27|NP_302402.1|NC_002677 two-component response regulator from Mycobacterium leprae (233 aa), FASTA scores: opt: 1414, E(): 0, (95.7% identity in 233 aa overlap); and similar to T45446 probable two-component response regulator from Mycobacterium leprae (253 aa). Also similar to many sensor-like histidine kinase proteins e.g. CAB88489.1|AL353816 putative two-component systen response regulator from Streptomyces coelicolor (248 aa); AAG36759.1|AF119221_1 |AF119221 response regulator from Corynebacterium glutamicum (232 aa); Q02540|COPR_PSESM transcriptional activator protein COPR from Pseudomonas syringae (pv. tomato) (227 aa), FASTA scores: opt: 600, E(): 0, (44.4% identity in 225 aa overlap); etc. Also similar to Rv0981 from Mycobacterium tuberculosis (230 aa), Rv3765c (234 aa), phoP (247 aa), etc. Thought to be induced at phagocytosis (see second citation below). Protein product from Mb0927c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0927c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Z7" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039420" /db_xref="UniProtKB/Swiss-Prot:P0A5Z7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99525.1" /translation="MGGMDTGVTSPRVLVVDDDSDVLASLERGLRLSGFEVATAVDGA EALRSATENRPDAIVLDINMPVLDGVSVVTALRAMDNDVPVCVLSARSSVDDRVAGLE AGADDYLVKPFVLAELVARVKALLRRRGSTATSSSETITVGPLEVDIPGRRARVNGVD VDLTKREFDLLAVLAEHKTAVLSRAQLLELVWGYDFAADTNVVDVFIGYLRRKLEAGG GPRLLHTVRGVGFVLRMQ" CDS complement(1007158..1008645) /codon_start=1 /transl_table=11 /gene="accD3" /locus_tag="BQ2027_MB0928C" /product="PUTATIVE ACETYL-COENZYME A CARBOXYLASE CARBOXYL TRANSFERASE (SUBUNIT BETA) ACCD3 (ACCASE BETA CHAIN)" /note="Mb0928c, accD3, len: 495 aa. Equivalent to Rv0904c, len: 495 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 495 aa overlap). Putative accD3, acetyl-CoA carboxylase carboxyl transferase, beta subunit (carboxyltransferase subunit of acetyl-CoA carboxylase) (EC 6.4.1.2), highly similar in part to AAA63045.1|U15184 zinc finger protein from Mycobacterium leprae (201 aa). Also highly similar to others e.g. CAC42827.1|Y17592 putative carboxyltransferase subunit of acetyl-CoA carboxylase from Corynebacterium glutamicum (491 aa); CAB86110.1|AL163003 putative acetyl CoA carboxylase (alpha and beta subunits) from Streptomyces coelicolor (458 aa); Q54776|ACCD_SYNP7 ACETYL-COENZYME A CARBOXYLASE CARBOXYL TRANSFERASE SUBUNIT BETA from Synechococcus sp. (305 aa); P12217|ACCD_MARPO ACETYL-COENZYME A CARBOXYLASE CARBOXYL TRANSFERASE SUBUNIT BETA from Marchantia polymorpha (316 aa), FASTA scores: opt: 519, E():1.6e-24, (40.2% identity in 219 aa overlap); etc. Also similar to Rv3280, Rv2502c, etc from Mycobacterium tuberculosis. BELONGS TO THE ACCD/PCCB FAMILY. Protein product from Mb0928c detected using SWATH mass spectrometry. Mb0928c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63406" /db_xref="InterPro:IPR000438" /db_xref="InterPro:IPR011762" /db_xref="InterPro:IPR011763" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR034733" /db_xref="UniProtKB/Swiss-Prot:P63406" /protein_id="SIT99526.1" /translation="MSRITTDQLRHAVLDRGSFVSWDSEPLAVPVADSYARELAAARA ATGADESVQTGEGRVFGRRVAVVACEFDFLGGSIGVAAAERITAAVERATAERLPLLA SPSSGGTRMQEGTVAFLQMVKIAAAIQLHNQARLPYLVYLRHPTTGGVFASWGSLGHL TVAEPGALIGFLGPRVYELLYGDPFPSGVQTAENLRRHGIIDGVVALDRLRPMLDRAL TVLIDAPEPLPAPQTPAPVPDVPTWDSVVASRRPDRPGVRQLLRHGATDRVLLSGTDQ GEAATTLLALARFGGQPTVVLGQQRAVGGGGSTVGPAALREARRGMALAAELCLPLVL VIDAAGPALSAAAEQGGLAGQIAHCLAELVTLDTPTVSILLGQGSGGPALAMLPADRV LAALHGWLAPLPPEGASAIVFRDTAHAAELAAAQGIRSADLLKSGIVDTIVPEYPDAA DEPIEFALRLSNAIAAEVHALRKIPAPERLATRLQRYRRIGLPRD" CDS 1008672..1009403 /codon_start=1 /transl_table=11 /gene="echA6" /locus_tag="BQ2027_MB0929" /product="POSSIBLE ENOYL-COA HYDRATASE ECHA6 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb0929, echA6, len: 243 aa. Equivalent to Rv0905, len: 243 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 243 aa overlap). Possible echA6, enoyl-CoA hydratase (EC 4.2.1.17), highly similar to ML15184|U15184 enoyl-CoA hydratase from Mycobacterium leprae (247 aa), FASTA score: (85.8% identity in 247 aa overlap). Also similar to many e.g. NP_250320.1|NC_002516 probable enoyl-CoA hydratase/isomerase from Pseudomonas aeruginosa (261 aa); NP_415911.1|NC_000913 putative enzyme from Escherichia coli strain K12 (255 aa); P24162 ECHH_RHOCA|FADB1 enoyl-CoA hydratase homolog from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (257 aa), FASTA scores: opt: 404, E():7.8e-21, (37.3% identity in 249 aa overlap); etc. Protein product from Mb0929 detected using shotgun mass spectrometry. Mb0929 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64015" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR018376" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/Swiss-Prot:P64015" /protein_id="SIT99527.1" /translation="MIGITQAEAVLTIELQRPERRNALNSQLVEELTQAIRKAGDGSA RAIVLTGQGTAFCAGADLSGDAFAADYPDRLIELHKAMDASPMPVVGAINGPAIGAGL QLAMQCDLRVVAPDAFFQFPTSKYGLALDNWSIRRLSSLVGHGRARAMLLSAEKLTAE IALHTGMANRIGTLADAQAWAAEIARLAPLAIQHAKRVLNDDGAIEEAWPAHKELFDK AWGSQDVIEAQVARMEKRPPKFQGA" CDS 1009409..1010527 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0930" /product="Outer membrane protein romA" /note="Mb0930, -, len: 372 aa. Equivalent to Rv0906, len: 372 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 372 aa overlap). Conserved hypothetical protein, highly similar to others e.g. SC6A5.25|AL049485|T35416 hypothetical protein from Streptomyces coelicolor (370 aa), FASTA scores: opt: 1125, E(): 0, (51.3% identity in 335 aa overlap); NP_242955.1|NC_002570|BH2089 conserved protein from Bacillus halodurans (370 aa); etc. Also shows some similarity to C-terminus of Q48412|ROMA_KLEPN Q48412 outer membrane protein roma (fragment) from Klebsiella pneumoniae (132 aa), FASTA scores: opt: 319, E(): 8.5e-14, (46.2% identity in 104 aa overlap); NP_105215.1|NC_002678 hypothetical protein which contains similarity to outer membrane protein romA from Enterobacter cloacae (350 aa); etc. Protein product from Mb0930 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0930 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64760" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR024884" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/Swiss-Prot:P64760" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99528.1" /translation="MVRRALRLAAGTASLAAGTWLLRALHGTPAALGADAASIRAVSE QSPNYRDGAFVNLDPASMFTLDREELRLIVWELVARHSASRPAAPIPLASPNIYRGDA SRLAVSWFGHSTALLEIDGYRVLTDPVWSDRCSPSDVVGPQRLHPPPVQLAALPAVDA VVISHDHYDHLDIDTVVALVGMQRAPFLVPLGVGAHLRSWGVPQDRIVELDWNQSAQV DELTVVCVPARHFSGRFLSRNTTLWASWAFVGPNHRAYFGGDTGYTKSFTQIGADHGP FDLTLLPIGAYNTAWPDIHMNPEEAVRAHLDVTDSGSGMLVPVHWGTFRLAPHPWGEP VERLLAAAEPEHVTVAVPLPGQRVDPTGPMRLHPWWRL" CDS 1010650..1012200 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0931" /product="Beta-lactamase class C-like and penicillin binding proteins (PBPs) superfamily / DUF3471 domain" /note="Mb0931, -, len: 516 aa. Equivalent to Rv0907, len: 532 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 511 aa overlap). Conserved hypothetical protein, possibly involved in cell wall biosynthesis: similar to many beta-lactamases, penicillin-binding proteins and hypothetical proteins e.g. NP_298910.1|NC_002488 beta-lactamase from Xylella fastidiosa (455 aa); Q06317|PBP4_NOCLA PENICILLIN-BINDING PROTEIN 4 (PBP-4) (381 aa), FASTA scores: opt: 299, E(): 8.8e-05, (28.7% identity in 401 aa overlap); etc. N-terminus highly similar to AAA63047.1|U15184 hypothetical protein from Mycobacterium leprae (58 aa). Related to other putative esterases and penicillin binding proteins in Mycobacterium tuberculosis e.g. Rv1730c|MTCY04C12.15c (517 aa). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base insertion (*-g) leads to a shorter product with a different NH2 part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb0931 detected using SWATH mass spectrometry. Mb0931 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="InterPro:IPR021860" /db_xref="UniProtKB/TrEMBL:A0A1R3XWT2" /protein_id="SIT99529.1" /translation="MMLLTLTGCGSTHQALGPPSGLPDASPNERSAIQIPAGRIDDAV AKVDGLVGELMQNTGIPGMAVAIVHGGKTLYAKGFGVRDVGKGGGPDNKVDADTVFQL ASVSKSVGATVVAHAVTDNVVTWDTPVVSKLPWFALRDPYVTGQVTIADLYSHRSGLP DHAGDLLEDLGYDRRQVLQRLKYLPLAPFRISYAYTNFGVTAAAEAVAAAAGQSWEDL SDEVLYRPLGMGSTSSRFTDFLARPNHAVNHVKVADRWEARYQRDPDAQSPAGGVSSS LNDMTHWLAMVLADGVYNGRRITSPEALLLVYTPQVISRHPVSPRARASFYGYGFNVG VTSSGRTEYSHSGAFGLGAAANFVVLPSEDLAIIALTNAGPIGVPETLTAEFMDLVQY GQVREDWAALYKKAFAPLNELAGSLVGKQSPANPAPSRPLNDYVGVYANDYWGPATVT YHDGQLRLSLGPKNQTFDLTHWDGDTFTFTLSTENALPGSISKATFAGDTLNLEYYDA DKLGTFTR" CDS 1012197..1014590 /codon_start=1 /transl_table=11 /gene="ctpE" /locus_tag="BQ2027_MB0932" /product="PROBABLE METAL CATION TRANSPORTER ATPASE P-TYPE CTPE" /note="Mb0932, ctpE, len: 797 aa. Equivalent to Rv0908, len: 797 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 797 aa overlap). Probable ctpE, metal cation-transporting ATPase P-type, transmembrane protein, E1-E2 family, highly similar to many e.g. AB93406.1|AL357524 putative integral membrane ATPase from Streptomyces coelicolor (802 aa); NP_346063.1|NC_003028 cation-transporting ATPase (E1-E2 family) from Streptococcus pneumoniae (778 aa); P37278|ATCL_SYNP7|PACL cation-transporting atpase from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (926 aa), FASTA scores: opt: 257, E(): 4.8e-33, (27.7% identity in 905 aa overlap); etc. Contains E1-E2 ATPases phosphorylation site (PS00154). BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES). Protein product from Mb0932 detected using SWATH mass spectrometry. Mb0932 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A505" /db_xref="InterPro:IPR001757" /db_xref="InterPro:IPR008250" /db_xref="InterPro:IPR018303" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR023298" /db_xref="InterPro:IPR023299" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/Swiss-Prot:P0A505" /protein_id="SIT99530.1" /translation="MTRSASATAGLTDAEVAQRVAEGKSNDIPERVTRTVGQIVRANV FTRINAILGVLLLIVLATGSLINGMFGLLIIANSVIGMVQEIRAKQTLDKLAIIGQAK PLVRRQSGTRTRSTNEVVLDDIIELGPGDQVVVDGEVVEEENLEIDESLLTGEADPIA KDAGDTVMSGSFVVSGAGAYRATKVGSEAYAAKLAAEASKFTLVKSELRNGINRILQF ITYLLVPAGLLTIYTQLFTTHVGWRESVLRMVGALVPMVPEGLVLMTSIAFAVGVVRL GQRQCLVQELPAIEGLARVDVVCADKTGTLTESGMRVCEVEELDGAGRQESVADVLAA LAAADARPNASMQAIAEAFHSPPGWVVAANAPFKSATKWSGVSFRDHGNWVIGAPDVL LDPASVAARQAERIGAQGLRVLLLAAGSVAVDHAQAPGQVTPVALVVLEQKVRPDARE TLDYFAVQNVSVKVISGDNAVSVGAVADRLGLHGEAMDARALPTGREELADTLDSYTS FGRVRPDQKRAIVHALQSHGHTVAMTGDGVNDVLALKDADIGVAMGSGSPASRAVAQI VLLNNRFATLPHVVGEGRRVIGNIERVANLFLTKTVYSVLLALLVGIECLIAIPLRRD PLLFPFQPIHVTIAAWFTIGIPAFILSLAPNNERAYPGFVRRVMTSAVPFGLVIGVAT FVTYLAAYQGRYASWQEQEQASTAALITLLMTALWVLAVIARPYQWWRLALVLASGLA YVVIFSLPLAREKFLLDASNLATTSIALAVGVVGAATIEAMWWIRSRMLGVKPRVWR" CDS 1015147..1015326 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0933" /product="Antitoxin Rv0909" /note="Mb0933, -, len: 59 aa. Equivalent to Rv0909, len: 59 aa, from Mycobacterium tuberculosis strain H37Rv, (98.3% identity in 59 aa overlap). Conserved hypothetical protein, equivalent to NP_302399.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (56 aa). Also some similarity with AL022268|SC4H2_10c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (97 aa), FASTA scores: opt: 106, E(): 0.13, (43.2% identity in 37 aa overlap). Protein product from Mb0933 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0933 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR028037" /db_xref="UniProtKB/TrEMBL:A0A1R3XWT0" /protein_id="SIT99531.1" /translation="MGILDKVKNLLSQNADKVETVINKAGEFVDEQTQGNYSDAIHKL QDAASNVVGMSDQQS" CDS 1015332..1015766 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0934" /product="Toxin Rv0910" /note="Mb0934, -, len: 144 aa. Equivalent to Rv0910, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 144 aa overlap). Conserved hypothetical protein, equivalent to NP_302398.1|NC_002677 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (181 aa), FASTA scores: opt: 820, E(): 0, (83.9% identity in 143 aa overlap). Also similar to Rv1546|MTCY48.19c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (143 aa). Protein product from Mb0934 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0934 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3XYW4" /protein_id="SIT99532.1" /translation="MAKLSGSIDVPLPPEEAWMHASDLTRYREWLTIHKVWRSKLPEV LEKGTVVESYVEVKGMPNRIKWTIVRYKPPEGMTLNGDGVGGVKVKLIAKVAPKEHGS VVSFDVHLGGPALLGPIGMIVAAALRADIRESLQNFVTVFAG" CDS 1015864..1016637 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0935" /product="Putative hydroxylase" /note="Mb0935, -, len: 257 aa. Equivalent to Rv0911, len: 257 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 257 aa overlap). Conserved hypothetical protein, showing similarity with hydroxylases and hypothetical proteins e.g. T35325 probable hydroxylase from Streptomyces coelicolor (265 aa); Q54242 hypothetical protein from Streptomyces, FASTA scores: opt: 372, E(): 8.8e-18, (32.0% identity in 256 aa overlap); AAD04716.1|U77891 doxorubicin biosynthesis enzyme DnrV from Streptomyces peucetius (275 aa); AAA63051.1|U15184 hypothetical protein from Mycobacterium leprae (94 aa); etc. Also similar to Rv0577 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (261 aa). Protein product from Mb0935 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0935 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029068" /db_xref="InterPro:IPR037523" /db_xref="InterPro:IPR041581" /db_xref="UniProtKB/TrEMBL:A0A1R3XXS1" /protein_id="SIT99533.1" /translation="MPTRSSAPLGAPCWIDLTTSDVDRAQDFYGTVFGWAFESAGPDY GGYINAAKGGHPVAGLMANRPEFQSPDGWATYFHTVDIGATVAKLAAAGGSSCLDPME VPGKGFMSLAVDPSGAAFGLWQPLQHHGFEVIGEAGSPVWHQLTTRDYRSVIDFYRQV FGWRTEQISDTDEFCYTTAWFDDQQLLGVMDGSSCLPEGVPSNWTIFFGAEDVDETLR VICDNGGSVVRAAENTPYGRLAAAADPMGVVFNLSSLQA" CDS 1016702..1017151 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0936" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0936, -, len: 149 aa. Equivalent to Rv0912, len: 149 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 149 aa overlap). Probable conserved transmembrane protein, equivalent to Q50121|NP_302397.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (144 aa), FASTA scores: opt: 677, E(): 6.9e-38, (69.5% identity in 141 aa overlap). Mb0936 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWU7" /db_xref="UniProtKB/TrEMBL:A0A1R3XWU7" /protein_id="SIT99534.1" /translation="MTRRLRPGWLVALSAAVIAASTWMPWLTTTVGGGGWVNAIGGTH GSLELPHGFGPGQLIVLLSSTLLVVGAMAGRGLSVKLSSIAALVVSLLIVALTVWYYK LNVNPPVSAEYGLYFGAAGGVCAVGCSLWAAVSAASPGRRRHREVVR" CDS complement(1017683..1019191) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0937C" /product="POSSIBLE DIOXYGENASE" /note="Mb0937c, -, len: 502 aa. Equivalent to Rv0913c, len: 502 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 502 aa overlap). Possible dioxygenase (EC 1.-.-.-), showing similarity with others e.g. AAK38744.1|AY029525 carotenoid 9,10-9',10' cleavage dioxygenase from Phaseolus vulgaris (543 aa); CAB56138.1|AL117669 putative dioxygenase from Streptomyces coelicolor (503 aa); AAK06796.1|AF324838_15|AF324838 putative dioxygenase SimC5 from Streptomyces antibioticus (456 aa); Q53353|S65040 LIGNOSTILBENE-ALPHA,BETA-DIOXYGENASE (EC 1.13.11.43) from Pseudomonas paucimobilis (485 aa), FASTA scores: opt: 310, E(): 3.4e-20, (28.9% identity in 495 aa overlap); etc. Also some similarity with Rv0654|MTCI376.22 PROBABLE DIOXYGENASE from Mycobacterium tuberculosis (501 aa). Protein product from Mb0937c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0937c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX62" /db_xref="InterPro:IPR004294" /db_xref="UniProtKB/TrEMBL:A0A1R3XX62" /protein_id="SIT99535.1" /translation="MDITIVGKYLSTLPEDDDHPYRTGPWRPQTTEWDADDLTTVTGE VPADLDGIYLRNTENPLHPAFATYHPFDGDGMIHVVGFRDGKAFYRNRFIRTDGFLAE NEAGGPLWPGLAEPVQLAKREHGWGARGLMKDASSTDVIVHRGIALTSFYQCGDLYRI DPYSANTLGKESWHGRFPFDWGVSAHPKVDNKTGELLFFNYSKQEPYMRYGVVDQNNE LVHYVDVPLPGPRLLHDMAFTENYVILNDFPLFWDPRLLERDVHLPRFYPEIPSRFAV VARRGNDIRWFEADPTFVLHFTNAYEQGDEIVLDGFYEGDPQPLDTGGTKWEKLFRFL ALDRLQSRLHRWRLNMVTGAVHEEQLSESITEFGTINADYAASSYRYTYAATGKPSWF LFDGLVKHDLLTGNHECYSFGDGVYGSETAMAPRVGSSAEDDGYLVTLTTDMNDDASY CLVFDAARPGDGPICKLALPERISSGTHSAWVPGAELRRWDHAESPAAAVGL" CDS complement(1019193..1020431) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0938C" /product="POSSIBLE LIPID CARRIER PROTEIN OR KETO ACYL-COA THIOLASE" /note="Mb0938c, -, len: 412 aa. Equivalent to Rv0914c, len: 412 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 412 aa overlap). Possible lipid carrier protein or keto acyl-CoA thiolase (EC 2.3.1.16), highly similar to NP_421905.1|NC_002696 thiolase family protein from Caulobacter crescentus (407 aa); and similar to others e.g. NP_107896.1|NC_002678 3-ketoacyl-CoA thiolase from Mesorhizobium loti (392 aa); NP_385796.1|NC_003047 PUTATIVE 3-KETOACYL-COA THIOLASE PROTEIN from Sinorhizobium meliloti (389 aa); NP_275932.1|NC_000916 lipid-transfer protein (sterol or nonspecific) from Methanothermobacter thermautotrophicus (383 aa); AB55378.1|AL117263 possible 3-ketoacyl-CoA thiolase from Leishmania major (441 aa), FASTA scores: opt: 547, E(): 3.1e-26, (31.0% identity in 435 aa overlap); etc. Also similar to Rv2790c, Rv1627c, Rv0244, etc from Mycobacterium tuberculosis. COULD BELONG TO THE THIOLASE FAMILY. Protein product from Mb0938c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0938c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWT3" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020616" /db_xref="InterPro:IPR020617" /db_xref="UniProtKB/TrEMBL:A0A1R3XWT3" /protein_id="SIT99536.1" /translation="MDDGVWILGGYQSDFARNLSKENRDFADLTREVVDGTLTAAKVD AADLAAAGVVHVANAFGEMFARQGHLGAMPATVCDDLWDTPATRHEAACASGSVATLA AMADLRSGAYRVALVVGLELEKTVPGDTAAEHLSAAAWTGHEGAEARYLWPSMFAQVA DEYDRRYGLDDTHLRAIAQLNFANARRNPNAQTRGWTIPDPITDDDATNPLTEGRLRR FDCSQMTDGGAGLVLVSDAYLRDHRDARPIGRIDGWGHRTVGLGLRQKLDRVAQGDSA PYLLPHVRATVLDALRRARVTLDDLDGIEVHDCFTPSEYLAIDHIGLTGPGESWKAIE NGEIEIGGRLPINPSGGLIGGGHPVGASGVRMLLDAAKQVSGIAGDYQVENAEAFGTL NFGGSTATTVSFVVSTTRGS" CDS complement(1020524..1021795) /codon_start=1 /transl_table=11 /gene="PPE14" /locus_tag="BQ2027_MB0939C" /product="ppe family protein ppe14" /note="Mb0939c, PPE14, len: 423 aa. Equivalent to Rv0915c, len: 423 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 423 aa overlap). Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. Rv1807 from Mycobacterium tuberculosis (403 aa), FASTA scores: opt: 966, E(): 4.4e-30, (45.7% identity in 392 aa overlap); etc. Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2. Mb0939c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XWT8" /protein_id="SIT99537.1" /translation="MDFGLLPPEVNSSRMYSGPGPESMLAAAAAWDGVAAELTSAAVS YGSVVSTLIVEPWMGPAAAAMAAAATPYVGWLAATAALAKETATQARAAAEAFGTAFA MTVPPSLVAANRSRLMSLVAANILGQNSAAIAATQAEYAEMWAQDAAVMYSYEGASAA ASALPPFTPPVQGTGPAGPAAAAAATQAAGAGAVADAQATLAQLPPGILSDILSALAA NADPLTSGLLGIASTLNPQVGSAQPIVIPTPIGELDVIALYIASIATGSIALAITNTA RPWHIGLYGNAGGLGPTQGHPLSSATDEPEPHWGPFGGAAPVSAGVGHAALVGALSVP HSWTTAAPEIQLAVQATPTFSSSAGADPTALNGMPAGLLSGMALASLAARGTTGGGGT RSGTSTDGQEDGRKPPVVVIREQPPPGNPPR" CDS complement(1021810..1022109) /codon_start=1 /transl_table=11 /gene="PE7" /locus_tag="BQ2027_MB0940C" /product="pe family protein pe7" /note="Mb0940c, PE7, len: 99 aa. Equivalent to Rv0916c, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 99 aa overlap). Member of the Mycobacterium tuberculosis PE family, similar to many e.g. Rv1788 from Mycobacterium tuberculosis (99 aa), FASTA scores: opt: 321, E(): 1.3e-11, (53.5% identity in 99 aa overlap); etc. Mb0940c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XWS4" /protein_id="SIT99538.1" /translation="MSFVTIQPVVLAAATGDLPTIGTAVSARNTAVCAPTTGVLPPAA NDVSVLTAARFTAHTKHYRVVSKPAALVHGMFVALPAATADAYATTEAVNVVATG" CDS 1022553..1024334 /codon_start=1 /transl_table=11 /gene="betP" /locus_tag="BQ2027_MB0941" /product="POSSIBLE GLYCINE BETAINE TRANSPORT INTEGRAL MEMBRANE PROTEIN BETP" /note="Mb0941, betP, len: 593 aa. Equivalent to Rv0917, len: 593 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 593 aa overlap). Possible betP, glycine betaine transporter, integral membrane protein, highly similar to many transporters, mainly glycine betaine transporters, e.g. P54582|BETP_CORGL glycine betaine transporter from Corynebacterium glutamicum (Brevibacterium flavum) (595 aa), FASTA scores: opt: 1367, E(): 0, (42.7% identity in 504 aa overlap); T35264 probable BCCT family transporter from Streptomyces coelicolor (578 aa); NP_243511.1|NC_002570 glycine betaine transporter from Bacillus halodurans (504 aa); NP_439848.1|NC_000907 high-affinity choline transport protein (betT) from Haemophilus influenzae (669 aa); etc. SEEMS TO BELONG TO THE BCCT (TC 2.33) FAMILY OF TRANSPORTERS. Mb0941 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63696" /db_xref="InterPro:IPR000060" /db_xref="InterPro:IPR018093" /db_xref="UniProtKB/Swiss-Prot:P63696" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99539.1" /translation="MSAKERGDQNAVVDALRSIQPAVFIPASVVIVAMIVVSVVYSSV AENAFVRLNSAITGGVGWWYILVATGFVVFALYCGISRIGTIRLGRDDELPEFSFWAW LAMLFSAGMGIGLVFYGVAEPLSHYLRPPRSRGVPALTDAAANQAMALTVFHWGLHAW AIYVVVGLGMAYMTYRRGRPLSVRWLLEPVVGRGRVEGALGHAVDVIAIVGTLFGVAT SLGFGITQIASGLEYLGWIRVDNWWMVGMIAAITATATASVVSGVSKGLKWLSNINMA LAAALALFVLLLGPTLFLLQSWVQNLGGYVQSLPQFMLRTAPFSHDGWLGDWTIFYWG WWISWAPFVGMFIARISRGRTIREFIGAVLLVPTVIASLWFTIFGDSALLRQRNNGDM LVNGAVDTNTSLFRLLDGLPIGAITSVLAVLVIVFFFVTSSDSGSLVIDILSAGGELD PPKLTRVYWAVLEGVAAAVLLLIGGAGSLTALRTAAIATALPFSIVMVVACYAMTKAF HFDLAATPRLLHVTVPDVVAAGNRRRHDISATLSGLIAVRDVDSGTYIVHPDTGALTV TAPPDPLDDHVFESDRHVTRRNTTSSR" CDS 1024677..1025153 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0942" /product="link to N-acetyltransferase activity" /note="Mb0942, -, len: 158 aa. Equivalent to Rv0918, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 158 aa overlap). Conserved hypothetical protein, similar in part to Q50116 hypothetical protein from Mycobacterium leprae (44 aa), FASTA scores: opt: 132, E(): 0.0055, (65.6% identity in 32 aa overlap). Also some similarity in C-terminus with other hypothetical proteins e.g. NP_289961.1|NC_002655 hypothetical protein from Escherichia coli strain O157:H7 (94 aa); etc. Mb0942 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWU5" /db_xref="InterPro:IPR010985" /db_xref="InterPro:IPR014795" /db_xref="InterPro:IPR016547" /db_xref="UniProtKB/TrEMBL:A0A1R3XWU5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99540.1" /translation="MHRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSL GGSGLPALNCTPIRHTIDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFT VTAALAHARDVLADRRLFVLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG" CDS 1025150..1025650 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0943" /product="gcn5-related n-acetyltransferase" /note="Mb0943, -, len: 166 aa. Equivalent to Rv0919, len: 166 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 166 aa overlap). Conserved hypothetical protein, some similarity to Q50115 hypothetical protein from Mycobacterium leprae (90 aa), FASTA scores: opt: 243, E(): 5.3e-11, (56.5% identity in 85 aa overlap). Mb0943 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWU2" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3XWU2" /protein_id="SIT99541.1" /translation="MSGYSAPRRISDADDVTSFSSGEPSLDDYLRKRALANHVQGGSR CFVTCRDGRVVGFYALASGSVAHADAPGRVRRNMPDPVPVILLSRLAVDRKEQGRGLG SHLLRDAIGRCVQAADSIGLRAILVHALHDEARAFYVHFDFEISPTDPLHLMLLMKDA RALIGD" tRNA complement(1025781..1025853) /locus_tag="BQ2027_ARGT" /product="tRNA-Arg" /note="argT, len: 73 nt. Equivalent to argT, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Arg, anticodon cct." mobile_element complement(1025924..1027359) /mobile_element_type="insertion sequence:IS1554" /locus_tag="BQ2027_IS1554" /note="IS1554, len: 1436 nt. Equivalent to IS1554, len: 1436 nt, from Mycobacterium tuberculosis strain H37Rv,(99.9% identity in 1436 nt overlap). Putative IS element bounded by 15 bp inverted repeats." repeat_region 1025924..1025938 /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRR,ATTCGGTGTAAGTGG, flanking IS element IS1554." gene complement(1025924..1027359) /locus_tag="BQ2027_IS1554" CDS complement(1025963..1027282) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0944C" /product="probable transposase" /note="Mb0944c, -, len: 439 aa. Equivalent to Rv0920c, len: 439 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 439 aa overlap). Probable transposase for IS1554, highly similar to others e.g. MTCY441.35|Q45111 transposase from Mycobacterium tuberculosis (419 aa), FASTA scores: opt: 1113, E(): 0, (43.9% identity in 378 aa overlap); etc. Contains transposases mutator family signature (PS01007). Mb0944c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYX4" /db_xref="InterPro:IPR001207" /db_xref="UniProtKB/TrEMBL:A0A1R3XYX4" /protein_id="SIT99542.1" /translation="MDAAQVIEPAHAGQDVDEAAVAARELSGAERALVGDLVRQARAE GVALTGPDGLLKALTKTVLEAALQEEMTEHLGYDRHAAAGRGSGNSRNGSRNKKVITD ACGQVEIAVPRDRNGTFEPVIVGKRKRRVTDVDRVVLSLYAKGLTTGEIAAHFADVYG VSVSKDTISRITDRVIEEMQAWWSRPLEKVYAAVFIDAIMVKIRDGQVRNRPVYAAIG VDLDGHKDILGMWAGEGDGESAKFWLAVLTELRNRGVKDIFFLVCDGLKGLPDSVSAA FPLATVQTCIIHLIRNTFRYASRKYWDKISVDLKPIYTAASAAEARLRYEEFAEKWGK PYPAITRLWDSAWEEFIPFLDYDVEIRRVPCSTNAIESLNARYRRAVRARGHFPNEQS ALKTLYLVTRSLDPKGTGQTKWAVRWKPALNALAITFADRMPAAEER" repeat_region complement(1027345..1027359) /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRL,ATTCGGTGTAAGTGG, flanking IS element IS1554." mobile_element 1027507..1029829 /mobile_element_type="insertion sequence:IS1535" /locus_tag="BQ2027_IS1535" /note="IS1535, len: 2323 nt. Equivalent to IS1535, len: 2300 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 2300 nt overlap). Putative IS element bounded by 16 bp inverted repeats." repeat_region 1027507..1027555 /rpt_type=INVERTED /note="49 bp imperfect inverted repeat, IRL,GGGTTCAGAGCTGTTGCGTGTTGAGTGTGTTTTAGTGTGCGTTAGTGTG, flanking IS element IS1535." gene 1027507..1029829 /locus_tag="BQ2027_IS1535" repeat_region 1027526..1027542 /rpt_type=INVERTED /note="17 bp perfect inverted repeat, IRL,GTTGAGTGTGTTTTAGT, flanking IS element IS1535." CDS 1027570..1028151 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0945" /product="POSSIBLE RESOLVASE" /note="Mb0945, -, len: 193 aa. Equivalent to Rv0921, len: 193 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 193 aa overlap). Possible resolvase for IS1535, highly similar to many bacterial resolvases e.g. MTCY274.17c|YX1C_MYCTU Q10831 from Mycobacterium tuberculosis (295 aa), FASTA scores: opt: 537, E(): 5.7e-29, (51.8% identity in 166 aa overlap). Presents an helix turn helix motif. Mb0945 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXT1" /db_xref="InterPro:IPR006119" /db_xref="InterPro:IPR036162" /db_xref="InterPro:IPR041718" /db_xref="UniProtKB/TrEMBL:A0A1R3XXT1" /protein_id="SIT99543.1" /translation="MNLADWAESVGVNRHTAYRWFREGTLPVPAERVGRLILVKTAAS ASAAAAGVVLYARVSSHDRRSDLDRQVARLTAWATERDLGVGQVVCEVGSGLNGKRPK LRRILSDPDARVIVVEHRDRLARFGVEHLEAALSAQGRRIVVADPGETTDDLVCDMIE VLTGMCARLYGRRGARNRAMRAVTEAKREPGAG" CDS 1028151..1029803 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0946" /product="possible transposase" /note="Mb0946, -, len: 550 aa. Equivalent to Rv0922, len: 550 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 550 aa overlap). Possible transposase for IS1535, similar to many e.g. YX16_MYCTU|Q10809|MTCY274.16c from Mycobacterium tuberculosis (460 aa), FASTA scores: opt 939, E(): 0, (40.6% identity in 465 aa overlap); etc. Mb0946 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001959" /db_xref="UniProtKB/TrEMBL:A0A1R3XWV7" /protein_id="SIT99544.1" /translation="MIVRMRSCAQAAKVAEATGGVQLAGKPKPDGTPTFSRYVEIGVD FEAHRPVVESVSVLFELYDGDANSYAATGGPGAQLPSGWMVTAAKFEVEWPADPQRAG LVRSHFGARRKAFNWGLAQVKADLDAKAADPAHESVDWDLKSLRWAWNRAKDDVAPWW AENSKECYSSGLADLAQGLANWKAGKNGTRKGRRVGFPRFKSGRRDPGRVRFTTGTMR IEDDRRTITVPVIGPLRAKENTRRVQRHLVSGRAQILNMTLSQRWGRLFVAVCYALRT PTTRSPLTQPTVRAGMDLGVRTLATVATLDTATGEQTIIEYPNPAPLKATLVARRRAG RELSRRIPGSHGHRAVKAKLARLDRRCVHLRREAAHQLTTELAGTYGQVVIEDLDVAA MKRSMRRRAFRRSVSDAAMGLVAPQLAYKTAKCSGVLTVADRWFASSQIHHGCTSPDG TPCRLQGKGRIDKHLLCPVTGEVVDRDRNAALNLRDWPDNASRGPVGTTAPSAPGPTT TVGTGHGADTGSSGAGGASVRPRPRRAGRGEAKTQTPQGDAA" repeat_region complement(1029782..1029829) /rpt_type=INVERTED /note="49 bp imperfect inverted repeat, IRR,CCGTTGAGTGTGTTTTAGTTGCACTCTCATGCGGCGTCCCCTTTGCGGG, flanking IS element IS1535." repeat_region complement(1029811..1029827) /rpt_type=INVERTED /note="17 bp perfect inverted repeat, IRR,GTTGAGTGTGTTTTAGT, flanking IS element IS1535." CDS complement(1029979..1031043) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0947C" /product="Phage-related replication protein" /note="Mb0947c, -, len: 354 aa. Equivalent to Rv0923c, len: 354 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 354 aa overlap). Conserved hypothetical protein, showing similarity with C-terminal part of AF034138|AF034138_7|yjoB HYPOTHETICAL PROTEIN from Bacillus subtilis (200 aa), FASTA scores: opt: 193, E(): 4.2e-05, (32.3% identity in 167 aa overlap). Protein product from Mb0947c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0947c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX69" /db_xref="InterPro:IPR008585" /db_xref="InterPro:IPR013024" /db_xref="InterPro:IPR017939" /db_xref="InterPro:IPR036568" /db_xref="InterPro:IPR038128" /db_xref="UniProtKB/TrEMBL:A0A1R3XX69" /protein_id="SIT99545.1" /translation="MPDRRHPYFAYGSNLCAHQMASRCPDAGAPRPAVLSDHNWLINQ RGVATVEPFAGNKVHGVLWQLSERDLVRLDSAEGVPVRYRRERLTVHTDDTALPAWVY IDHRVMPGRPRPGYLPRVIDGARHHGLPQRWIDYLHRWDPARWPLPVLPSSRSGPAPQ SLSELLSQPGVIETSQLRSRFGFLAIHGGGLEQVTDLIAERSAEAAGASVYLLRHPDN YPHHLPSARFDPAESARLAEFLDHVDVAVSLHGYDRIGRSTQLLAGGRNRALAAHLAR HIQLPGYRVVTDLAAIPEELRGLHPDNPVNRVRDGGTQLELSIRVRGLGPRSTLPGVG GMSPVTATLVQGLVTAARSW" CDS complement(1031044..1033098) /codon_start=1 /transl_table=11 /gene="mntH" /locus_tag="BQ2027_MB0948C" /standard_name="Nramp; Mramp" /product="conserved protein" /note="Mb0948c, mntH, len: 685 aa. Similar to Rv0924c (mntH) and Rv0925c, len: 428 aa and 245 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 428 aa overlap and 93.2% identity in 237 aa overlap). mntH (alternative gene name: Nramp, Mramp), H+-dependent divalent cation-transport integral membrane protein (see citations below), equivalent to O69443|MNTH_MYCBO PROBABLE MANGANESE TRANSPORT PROTEIN MNTH (BRAMP) from Mycobacterium bovis (415 aa); and NP_302396.1|NC_002677 probable manganese transport protein from Mycobacterium leprae (426 aa). Also similar (but longer 51 aa in N-terminus) to AAA63075.1|U15184 SMF2 protein from Mycobacterium leprae (377 aa), FASTA scores: opt: 1780, E(): 0, (74.5% identity in 376 aa overlap). Also similar to many orthologues of the eukaryotic Nramp (natural resistance-associated macrophage protein), also known as mntH, e.g. NP_456951.1|NC_003198 manganese transport protein MntH from Salmonella enterica subsp. enterica serovar Typhi (413 aa); etc. BELONGS TO THE NRAMP FAMILY. And Conserved hypothetical protein, similar to AL132991|SCF55_19 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (197 aa), FASTA scores: opt: 459, E(): 1.2e-23, (39.3% identity in 201 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0924c and Rv0925c exist as 2 genes. In Mycobacterium bovis, a single base deletion (t-*) leads to a single product. Protein product from Mb0948c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0948c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:O69443" /db_xref="InterPro:IPR001046" /db_xref="InterPro:IPR005025" /db_xref="InterPro:IPR029039" /db_xref="UniProtKB/Swiss-Prot:O69443" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99546.1" /translation="MTTTSDQNAAAPPRFDGLRALFINATLKRSPELSHTDGLIERSS GIMREHGVQVDTLRAVDHDIATGVWPDMTEHGWATDEWPALYRRVLDAHILVLCGPIW LGDNSSVMKRVIERLYACSSLLNEDGQYAYYGRAGGCLITGNEDGVKHCAMNVLYSLQ HLGYTIPPQADAGWIGEAGPGPSYLDPGSGGPENDFTNRNTTFMTFNLMHIAQMLRAP AASRPTAINAPSGTPAAGRTSPTPTTDSQRRSRVARWLVAGEFRLLSHLCSRGSKVGE LAQDTRTSLKTSWYLLGPAFVAAIAYVDPGNVAANVSSGAQFGYLLLWVIVAANVMAA LVQYLSAKLGLVTGRSLPEAIGKRMGRPARLAYWAQAEIVAMATDVAEVIGGAIALRI MFNLPLPIGGIITGVVSLLLLTIQDRRGQRLFERVITALLLVIAIGFTASFFVVTPPP NAVLGGLAPRFQGTESVLLAAAIMGATVMPHAVYLHSGLARDRHGHPDPGPQRRRLLR VTRWDVGLAMLIAGGVNAAMLLVAALNMRGRGDTASIEGAYHAVHDTLGATIAVLFAV GLLASGLASSSVGAYAGAMIMQGLLHWSVPMLVRRLITLGPALAILTLGFDPTRTLVL SQVVLSFGIPFAVLPLVKLTGSPAVMGGDTNHRATTWVGWVVAVMVSLLNVMLIYLTV TG" CDS complement(1033175..1034251) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0949C" /product="Oxidoreductase" /note="Mb0949c, -, len: 358 aa. Equivalent to Rv0926c, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 358 aa overlap). Conserved hypothetical protein, similar to Rv1059 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (354 aa). Also shows some similarity to AF170923|AF170923_3 dihydrodipicolinate reductase from Mastigocladus laminosus (278 aa), FASTA scores: opt: 170, E(): 0.00088, (25.7% identity in 276 aa overlap). Protein product from Mb0949c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0949c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XWU8" /protein_id="SIT99547.1" /translation="MAIPVVQLGTGNVGVHSLRALIADPEFELTGVWVSSDAKAGKDA AELAGLADSTGVRASTDLNAVLATGPRCAVYNAMADNRLPEALEDYRRILAAGINIVG SGPVFLQYPWQVIPDEIIKPLQDAARAGNSSLYVNGIDPGFANDLLPMALAGTCESIE QIRCMEIVDYATYDSAVVMFDVMGFGKPMDQIPMLLQPGVLSLAWGSVVRQLAAGLGI SLDGVEEMYVREPAPEAFNIASGHIPKGSAAALRFEVLGLVDGVPAVVLEHVTRLRAD LCPEWPQPAQPGGSYRIEISGEPCYAMDICLSSRHGDHNHAGLVATAMRIVNAIPAVV AAEPGIRTTLDLPLITGEGRYAAA" CDS complement(1034305..1035096) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0950C" /product="probable short-chain type dehydrogenase/reductase" /note="Mb0950c, -, len: 263 aa. Equivalent to Rv0927c, len: 263 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 263 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases, notably 7-alpha-hydroxysteroid dehydrogenases and glucose 1-dehydrogenases e.g. P25529|HDHA_ECOLI 7-alpha-hydroxysteroid dehydrogenase from Escherichia coli (255 aa), FASTA scores: opt: 551, E(): 1e-26, (39.5% identity in 248 aa overlap); NP_252778.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (253 aa); AAC44307.1|U59433 3-ketoacyl-acyl carrier protein reductase from Bacillus subtilis (246 aa); etc. Also similar to other dehydrogenases from Mycobacterium tuberculosis e.g. MTCY09F9.36, E():1.4e-18; MTCY369.14, E():8e-17; MTCY02B10.14, E():2.5e-14; MTCY09F9.23c, E():1.5e-13; MTCY03C7.07, E():1.9e-13. Contains PS00061 Short-chain dehydrogenases/reductases family signature, and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb0950c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0950c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWT1" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XWT1" /protein_id="SIT99548.1" /translation="MILDMFRLDDKVAVITGGGRGLGAAIALAFAQAGADVLIASRTS SELDAVAEQIRAAGRRAHTVAADLAHPEVTAQLAGQAVGAFGKLDIVVNNVGGTMPNT LLSTSTKDLADAFAFNVGTAHALTVAAVPLMLEHSGGGSVINISSTMGRLAARGFAAY GTAKAALAHYTRLAALDLCPRVRVNAIAPGSILTSALEVVAANDELRAPMEQATPLRR LGDPVDIAAAAVYLASPAGSFLTGKTLEVDGGLTFPNLDLPIPDL" CDS 1035368..1036480 /codon_start=1 /transl_table=11 /gene="pstS3" /locus_tag="BQ2027_MB0951" /standard_name="phoS2" /product="PERIPLASMIC PHOSPHATE-BINDING LIPOPROTEIN PSTS3 (PBP-3) (PSTS3) (PHOS1)" /note="Mb0951, pstS3, len: 370 aa. Equivalent to Rv0928, len: 370 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 370 aa overlap). pstS3 (previously known as phoS2), phosphate-binding lipoprotein component of inorganic phosphate transport system (see citations below), highly similar to others from Mycobacterium leprae e.g. Q50099|PSTS3|PHOS1 phosphate-binding protein 3 precursor (328 aa), FASTA scores: opt: 1772, E(): 0, (79.6% identity in 328 aa overlap); and highly similar to others e.g. AAF74819.1|AF137360_1|AF137360 periplasmic phosphate permease from Mycobacterium avium (369 aa). Also highly similar to Rv0932c|MTCY08D9.07|pstS2 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN (370 aa); and Rv0934|pstS1 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN (374 aa) from Mycobacterium tuberculosis (Mycobacterium tuberculosis seems to have three PstS-like proteins, others being Rv0932c and Rv0934c). Contains lipoprotein signature (PS00013) at N-terminus. BELONGS TO FAMILY OF PHOSPHATE RECEPTORS FOR BACTERIAL ABC-TYPE LIPOPROTEIN TRANSPORTERS. Protein product from Mb0951 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0951 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Y3" /db_xref="InterPro:IPR005673" /db_xref="InterPro:IPR024370" /db_xref="UniProtKB/Swiss-Prot:P0A5Y3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99549.1" /translation="MKLNRFGAAVGVLAAGALVLSACGNDDNVTGGGATTGQASAKVD CGGKKTLKASGSTAQANAMTRFVNVFEQACPGQTLNYTANGSGAGISEFNGNQTDFGG SDVPLSKDEAAAAQRRCGSPAWNLPVVFGPIAVTYNLNSVSSLNLDGPTLAKIFNGSI TQWNNPAIQALNRDFTLPGERIHVVFRSDESGTTDNFQRYLQAASNGAWGKGAGKSFQ GGVGEGARGNDGTSAAAKNTPGSITYNEWSFAQAQHLTMANIVTSAGGDPVAITIDSV GQTIAGATISGVGNDLVLDTDSFYRPKRPGSYPIVLATYEIVCSKYPDSQVGTAVKAF LQSTIGAGQSGLGDNGYIPIPDEFKSRLSTAVNAIA" CDS 1036493..1037467 /codon_start=1 /transl_table=11 /gene="pstC2" /locus_tag="BQ2027_MB0952" /product="PHOSPHATE-TRANSPORT INTEGRAL MEMBRANE ABC TRANSPORTER PSTC2" /note="Mb0952, pstC2, len: 324 aa. Equivalent to Rv0929, len: 324 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 324 aa overlap). pstC2, phosphate-transport integral membrane ABC transporter (see citations below), highly similar to others e.g. NP_302394.1|NC_002677 membrane-bound component of phosphate transport from Mycobacterium leprae (319 aa); CAB88474.1|AL353816 phosphate ABC transport system permease protein from Streptomyces coelicolor (336 aa); NP_290359.1| NC_002655 high-affinity phosphate-specific transport system (cytoplasmic membrane component) from Escherichia coli strain O157:H7 (319 aa); etc. Also similar to Rv935|MTCY08D9.04c|PSTC1 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (338 aa). Contains binding-protein-dependent transport systems inner membrane component signature (PS00402). Mb0952 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A631" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR011864" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/Swiss-Prot:P0A631" /protein_id="SIT99550.1" /translation="MVTEPLTKPALVAVDMRPARRGERLFKLAASAAGSTIVIAILLI AIFLLVRAVPSLRANHANFFTSTQFDTSDDEQLAFGVRDLFMVTALSSITALVLAVPV AVGIAVFLTHYAPRRLSRPFGAMVDLLAAVPSIIFGLWGIFVLAPKLEPIARFLNRNL GWLFLFKQGNVSLAGGGTIFTAGIVLSVMILPIVTSISREVFRQTPLIQIEAALALGA TKWEVVRMTVLPYGRSGVVAASMLGLGRALGETVAVLVILRSAARPGTWSLFDGGYTF ASKIASAASEFSEPLPTGAYISAGFALFVLTFLVNAAARAIAGGKVNG" CDS 1037464..1038378 /codon_start=1 /transl_table=11 /gene="pstA1" /locus_tag="BQ2027_MB0953" /product="PROBABLE PHOSPHATE-TRANSPORT INTEGRAL MEMBRANE ABC TRANSPORTER PSTA1" /note="Mb0953, pstA1, len: 304 aa. Equivalent to Rv0930, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 304 aa overlap). Probable pstA1, phosphate-transport integral membrane ABC transporter (see citation below), highly similar to others e.g. NP_302393.1|NC_002677 membrane-bound component of phosphate transport from Mycobacterium leprae (304 aa); CAB88473.1|AL353816 phosphate ABC transport system permease protein from Streptomyces coelicolor (354 aa) (N-terminus longer); NP_312689.1|NC_002695 phosphate transport system permease protein PstA from Escherichia coli strain O157:H7 (296 aa); etc. Also similar to Rv0936|MTCY08D9.03c|PSTA2 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (301 aa). Mb0953 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWV3" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR005672" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3XWV3" /protein_id="SIT99551.1" /translation="MSPSTSIEALDQPVKPVVFRPLTLRRRIKNSVATTFFFTSFVVA LIPLVWLLWVVIARGWFAVTRSGWWTHSLRGVLPEQFAGGVYHALYGTLVQAGVAAVL AVPLGLMTAVYLVEYGTGRMSRVTTFTVDVLAGVPSIVAALFVFSLWIATLGFQQSAF AVALALVLLMLPVVVRAGEEMLRLVPDELREASYALGVPKWKTIVRIVAPIAMPGIVS GILLSIARVVGETAPVLVLVGYSHSINLDVFHGNMASLPLLIYTELTNPEHAGFLRVW GAALTLIIVVATINLAAAMIRFVATRRR" CDS complement(1038385..1039494) /codon_start=1 /transl_table=11 /gene="pknDb" /locus_tag="BQ2027_MB0954C" /standard_name="mbk" /product="TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE D PKNDb [SECOND PART](PROTEIN KINASE D) (STPK D)" /note="Mb0954c, pknDb, len: 369 aa. Equivalent to 3' end of Rv0931c, len: 664 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 369 aa overlap). pknD (alternate gene name: mbk), transmembrane serine/threonine protein kinase (EC 2.7.1.-) (see citations below), equivalent to CAB62227.1|AJ250200 putative serine/threonine protein kinase from Mycobacterium bovis BCG (291 aa); and highly similar in N-terminus to P54744|PKNB_MYCLE probable serine/threonine-specific protein kinase (EC 2.7.1.-) from Mycobacterium leprae (622 aa). Also highly similar to others, particularly in N-terminal half e.g. NP_243370.1|NC_002570 serine/threonine protein kinase from Bacillus halodurans (664 aa); NP_268044.1|NC_002662 serine/threonine protein kinase from Lactococcus lactis (627 aa); etc. Also highly similar to other serine/threonine protein kinases from Mycobacterium tuberculosis e.g. pknH (626 aa), FASTA scores: opt: 1398, E: 0, (49.3% identity in 540 aa overlap); pknE (566 aa); pknB (626 aa); Rv3524 (343 aa); etc. Contains Hank's kinase subdomain. Contains two transmembrane segments, which flank a highly repetitive region, suggesting a receptor-like anchoring. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation on a serine residue. Appears to be co-transcribed with Rv0932c|pstS2. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, pknD exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits pknD into 2 parts, pknDa and pknDb. Protein product from Mb0954c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0954c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYY6" /db_xref="InterPro:IPR001258" /db_xref="InterPro:IPR011042" /db_xref="InterPro:IPR013017" /db_xref="InterPro:IPR035016" /db_xref="UniProtKB/TrEMBL:A0A1R3XYY6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99552.1" /translation="MLATPADTGLSQSESGIAGAGTGPPTPGAARWSPGDSATVAGPL AADSRGGNWPSQTGHSPAVPNALQASLGHAVPPAGNKRKVWAVVGAAAIVLVAIVAAA GYLVLRPSWSPTQASGQTVLPFTGIDFRLSPSGVAVDSAGNVYVTSEGMYGRVVKLAT GSTGTTVLPFNGLYQPQGLAVDGAGTVYVTDFNNRVVTLAAGSNNQTVLPFDGLNYPE GLAVDTQGAVYVADRGNNRVVKLAAGSKTQTVLPFTGLNDPDGVAVDNSGNVYVTDTD NNRVVKLEAESNNQVVLPFTDITAPWGIAVDEAGTVYVTEHNTNQVVKLLAGSTTSTV LPFTGLNTPLAVAVDSDRTVYVADRGNDRVVKLTS" CDS complement(1039505..1040380) /codon_start=1 /transl_table=11 /gene="pknDa" /locus_tag="BQ2027_MB0955C" /standard_name="mbk" /product="TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE D PKNDa [FIRST PART] (PROTEIN KINASE D) (STPK D)" /note="Mb0955c, pknDa, len: 291 aa. Equivalent to 5' end of Rv0931c, len: 664 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 276 aa overlap). pknD (alternate gene name: mbk), transmembrane serine/threonine protein kinase (EC 2.7.1.-) (see citations below), equivalent to CAB62227.1|AJ250200 putative serine/threonine protein kinase from Mycobacterium bovis BCG (291 aa); and highly similar in N-terminus to P54744|PKNB_MYCLE probable serine/threonine-specific protein kinase (EC 2.7.1.-) from Mycobacterium leprae (622 aa). Also highly similar to others, particularly in N-terminal half e.g. NP_243370.1|NC_002570 serine/threonine protein kinase from Bacillus halodurans (664 aa); NP_268044.1|NC_002662 serine/threonine protein kinase from Lactococcus lactis (627 aa); etc. Also highly similar to other serine/threonine protein kinases from Mycobacterium tuberculosis e.g. pknH (626 aa), FASTA scores: opt: 1398, E: 0, (49.3% identity in 540 aa overlap); pknE (566 aa); pknB (626 aa); Rv3524 (343 aa); etc. Contains Hank's kinase subdomain. Contains two transmembrane segments, which flank a highly repetitive region, suggesting a receptor-like anchoring. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation on a serine residue. Appears to be co-transcribed with Rv0932c|pstS2. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, pknD exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits pknD into 2 parts, pknDa and pknDb. Mb0955c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXU1" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR008271" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR017441" /db_xref="UniProtKB/TrEMBL:A0A1R3XXU1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99553.1" /translation="MSDAVPQVGSQFGPYQLLRLLGRGGMGEVYEAEDTRKHRVVALK LISPQYSDNAVFRARMQREADTAGRLTEPHIVPIHDYGEINGQFFVEMRMIDGTSLRA LLKQYGPLTPARAVAIVRQIAAALDAAHANGVTHRDVKPENILVTASDFAYLVDFGIA RAASDPGLTQTGTAVGTYNYMAPERFTGDEVTYRADIYALACVLGECLTGAPPYRADS VERLIAAHLMDPAPQPSQLRPGRVPPALDQVIAKGMAKNPAERFMSAGDLAIAAHDAL NHIRATPGHDDSAAR" CDS complement(1040402..1041514) /codon_start=1 /transl_table=11 /gene="pstS2" /locus_tag="BQ2027_MB0956C" /product="PERIPLASMIC PHOSPHATE-BINDING LIPOPROTEIN PSTS2 (PBP-2) (PSTS2)" /note="Mb0956c, pstS2, len: 370 aa. Equivalent to Rv0932c, len: 370 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 370 aa overlap). pstS2, phosphate-binding lipoprotein component of inorganic phosphate transport system (see citations below), highly similar to AAF74819.1|AF137360_1|AF137360 periplasmic phosphate permease from Mycobacterium avium (369 aa); Rv0928|MTCY21C12.22|pstS3 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN from Mycobacterium tuberculosis (370 aa), FASTA scores: opt: 1601, E(): 0, (64.5% identity in 372 aa overlap); and Rv0934|MTCY08D9.05c|pstS1 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN from Mycobacterium tuberculosis (374 aa) (M. tuberculosis seems to have three PstS-like proteins, others being Rv0928 and Rv0934c). Also highly similar to MTCY08D9.05c|P15712|PAB_MYCTU PROTEIN ANTIGEN B PRECURSOR from Mycobacterium tuberculosis (374 aa), FASTA scores: opt: 460, E(): 2.7e-20, (31.2% identity in 375 aa overlap). Contains prokaryotic membrane lipoprotein lipid attachment site (PS00013) at N-terminus so the leader peptide of 22 aa is probably removed. BELONGS TO FAMILY OF PHOSPHATE RECEPTORS FOR BACTERIAL ABC-TYPE LIPOPROTEIN TRANSPORTERS. Appears to be co-transcribed with Rv0931c|pknD|mbk. Protein product from Mb0956c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0956c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWW7" /db_xref="InterPro:IPR005673" /db_xref="InterPro:IPR024370" /db_xref="UniProtKB/TrEMBL:A0A1R3XWW7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99554.1" /translation="MKFARSGAAVSLLAAGTLVLTACGGGTNSSSSGAGGTSGSVHCG GKKELHSSGSTAQENAMEQFVYAYVRSCPGYTLDYNANGSGAGVTQFLNNETDFAGSD VPLNPSTGQPDRAAERCGSPAWDLPTVFGPIAITYNIKGVSTLNLDGPTTAKIFNGTI TVWNDPQIQALNSGTDLPPTPISVIFRSDKSGTSDNFQKYLDGASNGAWGKGASETFN GGVGVGASGNNGTSALLQTTDGSITYNEWSFAVGKQLNMAQIITSAGPDPVAITTESV GKTIAGAKIMGQGNDLVLDTSSFYRPTQPGSYPIVLATYEIVCSKYPDATTGTAVRAF MQAAIGPGQEGLDQYGSIPLPKSFQAKLAAAVNAIS" CDS 1041730..1041945 /codon_start=1 /transl_table=11 /gene="pstBa" /locus_tag="BQ2027_MB0957" /product="PHOSPHATE-TRANSPORT PROTEIN ABC TRANSPORTER PSTBa [FIRST PART]" /note="Mb0957, pstBa, len: 71 aa. Equivalent to the 5' end of Rv0933, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv (98.4% identity in 64 aa overlap). pstB, phosphate-transport ATP-binding protein ABC transporter (see citations below), thermostable ATPase, highly similar to others e.g. NP_348334.1|NC_003030 ATPase component of ABC-type phosphate transport system from Clostridium acetobutylicum (249 aa); NP_212352.1|NC_001318 phosphate ABC transporter ATP-binding protein (pstB) from Borrelia burgdorferi (260 aa); NP_390375.1|NC_000964 phosphate ABC transporter (ATP-binding protein) from Bacillus subtilis (269 aa), FASTA scores: opt: 762, E(): 0, (47.7% identity in 243 aa overlap); etc. Also similar to other M. tuberculosis ABC transporters e.g. MTCY253.24, E(): 2.5e-15 and MTCY359.14c, E(): 3.4e-15. Contains PS00211 ABC transporters family signature, and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Magnesium or calcium seem to have no influence on the functionality of this enzyme. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, pstB exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits pstB into 2 parts, pstBa and pstBb. Mb0957 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U0Z9" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR005670" /db_xref="InterPro:IPR015850" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:Q7U0Z9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99555.1" /translation="MACERLGGQSGAADVDAAAPAMAAVNLTLGFAGKTVLDQVSMGF PARAVTSLMGPTGSGKTTFFAHPKPDE" CDS 1041977..1042561 /codon_start=1 /transl_table=11 /gene="pstBb" /locus_tag="BQ2027_MB0958" /product="PHOSPHATE-TRANSPORT PROTEIN ABC TRANSPORTER PSTBb [SECOND PART]" /note="Mb0958, pstBb, len: 213 aa. Equivalent to the 3' end of Rv0933, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap). pstB, phosphate-transport ATP-binding protein ABC transporter (see citations below), thermostable ATPase, highly similar to others e.g. NP_348334.1|NC_003030 ATPase component of ABC-type phosphate transport system from Clostridium acetobutylicum (249 aa); NP_212352.1|NC_001318 phosphate ABC transporter ATP-binding protein (pstB) from Borrelia burgdorferi (260 aa); NP_390375.1|NC_000964 phosphate ABC transporter (ATP-binding protein) from Bacillus subtilis (269 aa), FASTA scores: opt: 762, E(): 0, (47.7% identity in 243 aa overlap); etc. Also similar to other M. tuberculosis ABC transporters e.g. MTCY253.24, E(): 2.5e-15 and MTCY359.14c, E(): 3.4e-15. Contains PS00211 ABC transporters family signature, and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Magnesium or calcium seem to have no influence on the functionality of this enzyme. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, pstB exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits pstB into 2 parts, pstBa and pstBb. Protein product from Mb0958 detected using SWATH mass spectrometry. Mb0958 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0Z9" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR005670" /db_xref="InterPro:IPR015850" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:Q7U0Z9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99556.1" /translation="MLLGGRSIFNYRDVLEFRRRVGMLFQRPNPFPMSIMDNVLAGVR AHKLVPRKEFRGVAQARLTEVGLWDAVKDRLSDSPFRLSGGQQQLLCLARTLAVNPEV LLLDEPTSALDPTTTEKIEEFIRSLADRLTVIIVTHNLAQAARISDRAALFFDGRLVE EGPTEQLFSSPKHAETARYVAGLSGDVKDAKRGN" CDS 1042582..1043706 /codon_start=1 /transl_table=11 /gene="pstS1" /locus_tag="BQ2027_MB0959" /standard_name="phoS1; phoS" /product="PERIPLASMIC PHOSPHATE-BINDING LIPOPROTEIN PSTS1 (PBP-1) (PSTS1)" /note="Mb0959, pstS1, len: 374 aa. Equivalent to Rv0934, len: 374 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 374 aa overlap). pstS1 (previously known as phoS1 or phoS), phosphate-binding lipoprotein component of inorganic phosphate transport system (see first, fourth, fifth and sixth citations below), highly similar to Rv0932c|MTCY08D9.07|pstS2 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN from Mycobacterium tuberculosis (370 aa), FASTA scores: opt: 460, E(): 5.9e-19, (31.2% identity in 375 aa overlap); and Rv0928|MTCY21C12.22|pstS3 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN from Mycobacterium tuberculosis (374 aa), FASTA scores: opt: 435, E():1.1e-17, (30.0% identity in 380 aa overlap) (Mycobacterium tuberculosis seems to have three PstS-like proteins, others being Rv0932c and Rv0928c). Also highly similar to MTCY08D9.05c|P15712|PAB_MYCTU PROTEIN ANTIGEN B PRECURSOR from Mycobacterium tuberculosis (374 aa), FASTA scores: opt: 2459, E(): 0, (100% identity in 374 aa overlap). Contains a prokaryotic membrane lipoprotein lipid attachment site (PS00013) at the N-terminus so the 23 aa leader peptide sequence is probably removed. BELONGS TO FAMILY OF PHOSPHATE RECEPTORS FOR BACTERIAL ABC-TYPE LIPOPROTEIN TRANSPORTERS. Protein product from Mb0959 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0959 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWV8" /db_xref="InterPro:IPR005673" /db_xref="InterPro:IPR024370" /db_xref="UniProtKB/TrEMBL:A0A1R3XWV8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99557.1" /translation="MKIRLHTLLAVLTAAPLLLAAAGCGSKPPSGSPETGAGAGTVAT TPASSPVTLAETGSTLLYPLFNLWGPAFHERYPNVTITAQGTGSGAGIAQAAAGTVNI GASDAYLSEGDMAAHKGLMNIALAISAQQVNYNLPGVSEHLKLNGKVLAAMYQGTIKT WDDPQIAALNPGVNLPGTAVVPLHRSDGSGDTFLFTQYLSKQDPEGWGKSPGFGTTVD FPAVPGALGENGNGGMVTGCAETPGCVAYIGISFLDQASQRGLGEAQLGNSSGNFLLP DAQSIQAAAAGFASKTPANQAISMIDGPAPDGYPIINYEYAIVNNRQKDAATAQTLQA FLHWAITDGNKASFLDQAHFQPLPPAVVKLSDALIATISS" CDS 1043766..1044782 /codon_start=1 /transl_table=11 /gene="pstC1" /locus_tag="BQ2027_MB0960" /product="PHOSPHATE-TRANSPORT INTEGRAL MEMBRANE ABC TRANSPORTER PSTC1" /note="Mb0960, pstC1, len: 338 aa. Equivalent to Rv0935, len: 338 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 338 aa overlap). pstC1, phosphate-transport integral membrane ABC transporter (see citations below), highly similar to others e.g. NP_104768.1|NC_002678|pstC phosphate ABC transporter permease protein from Mesorhizobium loti (327 aa); NP_245372.1|NC_002663|PstC PstC protein from Pasteurella multocida (320 aa); P45191|PSTC_HAEIN PHOSPHATE TRANSPORT SYSTEM PERMEASE from Haemophilus influenza (315 aa), FASTA scores: opt: 667, E(): 0, (36.2% identity in 309 aa overlap); etc. Also similar to Rv0929|MTCY21C12.23|PSTC2 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (324 aa), FASTA scores: opt: 487, E(): 4.1e-21, (32.3% identity in 303 aa overlap); and shows slight similarity to MTCY08D9.03c|PSTA2|Rv0936 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (301 aa). Contains binding-protein-dependent transport systems inner membrane comp signature (PS00402). Mb0960 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A629" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR011864" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/Swiss-Prot:P0A629" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99558.1" /translation="MLARAGEVGRAGPAIRWLGGIGAVIPLLALVLVLVVLVIEAMGA IRLNGLHFFTATEWNPGNTYGETVVTDGVAHPVGAYYGALPLIVGTLATSAIALIIAV PVSVGAALVIVERLPKRLAEAVGIVLELLAGIPSVVVGLWGAMTFGPFIAHHIAPVIA HNAPDVPVLNYLRGDPGNGEGMLVSGLVLAVMVVPIIATTTHDLFRQVPVLPREGAIA LGMSNWECVRRVTLPWVSSGIVGAVVLGLGRALGETMAVAMVSGAVLGAMPANIYATM TTIAATIVSQLDSAMTDSTNFAVKTLAEVGLVLMVITLLTNVAARGMVRRVSRTALPV GRGI" CDS 1044784..1045689 /codon_start=1 /transl_table=11 /gene="pstA2" /locus_tag="BQ2027_MB0961" /product="PHOSPHATE-TRANSPORT INTEGRAL MEMBRANE ABC TRANSPORTER PSTA2" /note="Mb0961, pstA2, len: 301 aa. Equivalent to Rv0936, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 301 aa overlap). pstA2, phosphate-transport integral membrane ABC transporter (see citations below), highly similar to others e.g. NP_442269.1|NC_000911|PstA phosphate transport system permease protein from Synechocystis sp. strain PCC 6803 (287 aa); NP_232473.1|NC_002506 phosphate ABC transporter permease protein from Vibrio cholerae (289 aa); P07654|PSTA_ECOLI PHOSPHATE TRANSPORT SYSTEM PERMEASE from Escherichia coli (296 aa), FASTA scores: opt: 464, E(): 6.7e-24, (30.5% identity in 282 aa overlap); etc. Also similar to O86345|MTCY21C12.24|PSTA1|Rv0930 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (304 aa), FASTA scores: opt: 369, E(): 6.1e-15, (32.7% identity in 248 aa overlap). Contains binding-protein-dependent transport systems inner membrane comp signature (PS00402). Protein product from Mb0961 detected using SWATH mass spectrometry. Mb0961 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A627" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR005672" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/Swiss-Prot:P0A627" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99559.1" /translation="MGESAESGSRQLPAMSPPRRSVAYRRKIVDALWWAACVCCLAVV ITPTLWMLIGVVSRAVPVFHWSVLVQDSQGNGGGLRNAIIGTAVLAIGVILVGGTVSV LTGIYLSEFATGKTRSILRGAYEVLSGIPSIVLGYVGYLALVVYFDWGFSLAAGVLVL SVMSIPYIAKATESALAQVPTSYREAAEALGLPAGWALRKIVLKTAMPGIVTGMLVAL ALAIGETAPLLYTAGWSNSPPTGQLTDSPVGYLTYPIWTFYNQPSKSAQDLSYDAALL LIVFLLLLIFIGRLINWLSRRRWDV" CDS complement(1045666..1046487) /codon_start=1 /transl_table=11 /gene="mku" /locus_tag="BQ2027_MB0962C" /product="dna end-binding protein, mku" /note="Mb0962c, -, len: 273 aa. Equivalent to Rv0937c, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 273 aa overlap). Conserved hypothetical protein, highly similar to others e.g. SC6G9.24c|T35620|AL079356 hypothetical protein from Streptomyces coelicolor (365 aa), FASTA scores: opt: 648, E(): 0, (36.5% identity in 274 aa overlap); Z99110|BSUB0007_223|NP_389224.1|NC_000964 hypothetical proteins from Bacillus subtilis (311 aa), FASTA scores: opt: 623, E(): 1.1e-31, (33.9% identity in 274 aa overlap); O28548|AE000984|AF1726|NP_070554.1|NC_000917 conserved hypothetical protein from Archaeoglobus fulgidus (286 aa), FASTA scores: opt: 583, E(): 0, (36.6% identity in 262 aa overlap); etc. Protein product from Mb0962c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0962c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWW4" /db_xref="InterPro:IPR006164" /db_xref="InterPro:IPR009187" /db_xref="InterPro:IPR016194" /db_xref="UniProtKB/TrEMBL:A0A1R3XWW4" /protein_id="SIT99560.1" /translation="MRAIWTGSIAFGLVNVPVKVYSATADHDIRFHQVHAKDNGRIRY KRVCEACGEVVDYRDLARAYESGDGQMVAITDDDIASLPEERSREIEVLEFVPAADVD PMMFDRSYFLEPDSKSSKSYVLLAKTLAETDRMAIVHFTLRNKTRLAALRVKDFGKRE VMMVHTLLWPDEIRDPDFPVLDQKVEIKPAELKMAGQVVDSMADDFNPDRYHDTYQEQ LQELIDTKLEGGQAFTAEDQPRLLDEPEDVSDLLAKLEASVKARSKANSNVPTPP" CDS 1046603..1048882 /codon_start=1 /transl_table=11 /gene="ligd" /locus_tag="BQ2027_MB0963" /product="atp dependent dna ligase ligd (atp dependent polydeoxyribonucleotide synthase) (thermostable dna ligase) (atp dependent polynucleotide ligase) (sealase) (dna repair enzyme) (dna joinase)" /note="Mb0963, -, len: 759 aa. Equivalent to Rv0938, len: 759 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 759 aa overlap). Possible ATP-dependent DNA ligase (EC 6.5.1.1), with its C-terminus similar to N-terminal parts of many ATP-dependent DNA ligases e.g. NP_250828.1|NC_002516 probable ATP-dependent DNA ligase from Pseudomonas aeruginosa (840 aa); NP_105436.1|NC_002678 ATP-dependent DNA ligase from Mesorhizobium loti (829 aa); CAB92891.1|AL356932 probable ATP-dependent DNA ligase from Streptomyces coelicolor (326 aa); etc. The N-terminal half shows similarity with hypothetical proteins from Mycobacterium tuberculosis Rv0269c and Rv3730c; and the C-terminal half with the DNA ligases Rv3731 and Rv3062. Protein product from Mb0963 detected using SWATH mass spectrometry. Mb0963 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59971" /db_xref="InterPro:IPR012309" /db_xref="InterPro:IPR012310" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR014144" /db_xref="InterPro:IPR014145" /db_xref="InterPro:IPR014146" /db_xref="InterPro:IPR033649" /db_xref="UniProtKB/Swiss-Prot:P59971" /protein_id="SIT99561.1" /translation="MGSASEQRVTLTNADKVLYPATGTTKSDIFDYYAGVAEVMLGHI AGRPATRKRWPNGVDQPAFFEKQLALSAPPWLSRATVAHRSGTTTYPIIDSATGLAWI AQQAALEVHVPQWRFVAEPGSGELNPGPATRLVFDLDPGEGVMMAQLAEVARAVRDLL ADIGLVTFPVTSGSKGLHLYTPLDEPVSSRGATVLAKRVAQRLEQAMPALVTSTMTKS LRAGKVFVDWSQNSGSKTTIAPYSLRGRTHPTVAAPRTWAELDDPALRQLSYDEVLTR IARDGDLLERLDADAPVADRLTRYRRMRDASKTPEPIPTAKPVTGDGNTFVIQEHHAR RPHYDFRLERDGVLVSWAVPKNLPDNTSVNHLAIHTEDHPLEYATFEGAIPSGEYGAG KVIIWDSGTYDTEKFHDDPHTGEVIVNLHGGRISGRYALIRTNGDRWLAHRLKNQKDQ KVFEFDNLAPMLATHGTVAGLKASQWAFEGKWDGYRLLVEADHGAVRLRSRSGRDVTA EYPQLRALAEDLADHHVVLDGEAVVLDSSGVPSFSQMQNRGRDTRVEFWAFDLLYLDG RALLGTRYQDRRKLLETLANATSLTVPELLPGDGAQAFACSRKHGWEGVIAKRRDSRY QPGRRCASWVKDKHWNTQEVVIGGWRAGEGGRSSGVGSLLMGIPGPGGLQFAGRVGTG LSERELANLKEMLAPLHTDESPFDVPLPARDAKGITYVKPALVAEVRYSEWTPEGRLR QSSWRGLRPDKKPSEVVRE" CDS 1048879..1050813 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0964" /product="POSSIBLE BIFUNCTIONAL ENZYME: 2-HYDROXYHEPTA-2,4-DIENE-1,7-DIOATE ISOMERASE (HHDD ISOMERASE) + CYCLASE/DEHYDRASE" /note="Mb0964, -, len: 644 aa. Equivalent to Rv0939, len: 644 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 644 aa overlap). Possible bifunctional enzyme, including 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase activity (EC 5.3.3.-), and cyclase/dehydrase activity (EC undetermined). N-terminal part similar to many isomerases e.g. NP_343861.1|NC_002754 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase (hpcE-1) from Sulfolobus solfataricus (318 aa); NP_068932.1|NC_000917 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase (hpcE-1) from Archaeoglobus fulgidus (324 aa), FASTA scores: opt: 400, E(): 5.8e-15, (33.9% identity in 289 aa overlap); etc. And C-terminal part highly similar to many cyclases/dehydrases e.g. AAK61721.1|AY033994 cyclase-like protein from Streptomyces aureofaciens (305 aa); CAC44204.1|AL593842 cyclase from Streptomyces coelicolor (297 aa), FASTA scores: opt: 375, E(): 2.7e-26, (35.6% identity in 284 aa overlap); NP_343860.1|NC_002754 putative Cyclase/dehydrase from Sulfolobus solfataricus (308 aa); etc. Also similar to Rv2993c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis. Protein product from Mb0964 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0964 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYZ6" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR011234" /db_xref="InterPro:IPR036663" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/TrEMBL:A0A1R3XYZ6" /protein_id="SIT99562.1" /translation="MKWVTYRSDHGERTGVLSGDAIYAMPPDVSLLDLVGRGADGLRT AGERAVRSPAAVVALDEVTLAAPIPRPPSIRDSLCFLDHMRNCQEAMGGGRVLMDTWY RIPAFYFACPSTVLGPYDDAPTAPGSAWQDFELEIAAVIGTSGKDLTVEQAERSIIGY TIFNDWSARDLQMLEGQLRIGQAKGKDSGITLGPYLVTPDELEPYCRGGKLSLRVIAL VNGTVIGSGSTAQMDWSFGEVIAYASRGVTLTPGDVFGSGTVPTCTLVEHLRPPESFP GWLHDGDVVTLQVEGLGETRQTVRTSGTPFPLALRPNPDAEPDRRGVNPAPTRVPFTR GLHEVADRVWAWTLPDGGYGFSNAGLVAGDGASLLVDTLFDLALTREMLAAMKPVTER APITDALITHSNGDHTHGTQLLDRSVRIIAAKGTSEEIEHGPAPEMLARIQTADLGPV ATRYLRDRFGHFDFSGIKLRNADLTFDRDLAIELGGRRVDLLNLGPAHTTADSVVHVA DAGVLFAGDLLFIGCTPIVWAGPIANWVAACDAMIALDAPTVVPGHGPVTGPDGIRAV RGYLAHIAEQAEAAYRKGLSLPEAVETIDLGEYASWLDSERVVVNVYQRYRELDPDTP RQDLLALLVMQAEWAARHCT" CDS complement(1051060..1051926) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0965C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0965c, -, len: 288 aa. Equivalent to Rv0940c, len: 288 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 288 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to hypothetical proteins and oxidoreductases e.g. AAK38097.1|AF323606_3|AF323606 putative F420-dependent dehydrogenase from Rhodococcus erythropolis (295 aa); AAG52987.1|AF040570|Rif17 putative alkanal monooxygenase from Amycolatopsis mediterranei (356 aa); etc. Also similar to putative oxidoreductases from Mycobacterium tuberculosis such as Rv0953c|P71557|YT21_MYCTU (282 aa), FASTA scores: opt: 311, E(): 3.7e-08, (31.0% identity in 248 aa overlap), Rv3079c (275 aa), Rv0791c (347 aa), etc. Protein product from Mb0965c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0965c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64762" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019921" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/Swiss-Prot:P64762" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99563.1" /translation="MRFSYAEAMTDFTFYIPLAKAAEAAGYSSMTIPDSIAYPFESDS KYPYTPDGNREFMDGKPFIETFVLTAALGAVTTRLRFNFFVLKLPIRPPALVAKQAGS LAALIGNRVGLGVGTSPWPEDYELMGVPFAKRGKRIDECIEIVRGLTTGDYFEFHGEF YDIPKTKMTPAPTQPIPILVGGHADAALRRAARADGWMHGGGDPDELDRLIARVKRLR EEAGKTSPFEIHVISLDGFTVDGVKRLEDKGVTDVIVGFRVPYTMGPDTEPLQTKIRN LEMFAENVIAKV" CDS complement(1052011..1052784) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0966C" /product="Protein serine/threonine kinase" /note="Mb0966c, -, len: 257 aa. Equivalent to Rv0941c, len: 257 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 257 aa overlap). Conserved hypothetical protein, showing some similarity with parts of several hypothetical proteins from Streptomyces coelicolor e.g. AL035161|SC9C7_20 (860 aa), FASTA scores: opt: 197, E(): 2.6e-05, (34.2% identity in 114 aa overlap). Protein product from Mb0966c detected using SWATH mass spectrometry. Mb0966c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002645" /db_xref="InterPro:IPR036513" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3XWX7" /protein_id="SIT99564.1" /translation="MVAVSTAAKSPTALAIAVRTQDSVVILTADGALDSSSSALLRDS LTRATLEQPSAVIVNVTELQVAEESAWSVFISARWQADFRADVPVLLVCGHRAGRAAV TRTGVARFMPVYPTEKAASKAIGRLARRNFKRSDAQLPANLNSLRESRQLVREWLTQW SRPGLIPVALVVVNVFVENVLKHTGSDPVMRIESDGPTATIAVSDGSSAPAVRLASPP KGIDVSGLAIVAALSRAWGSSPTSSGKTVWAIIGPENQL" CDS 1052827..1053105 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0967" /product="HYPOTHETICAL PROTEIN" /note="Mb0967, -, len: 92 aa. Equivalent to Rv0942, len: 92 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 92 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/Swiss-Prot:P64764" /protein_id="SIT99565.1" /translation="MGRSATIAMVPKRRDAMNRHSGPILSSGFIASSSNSCPANSLRM PSALAAETLSFDDRAVRRSTHHPGGGYPQKHAINLQSGLCPAYANASR" CDS complement(1053163..1054203) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0968C" /product="PROBABLE MONOOXYGENASE" /note="Mb0968c, -, len: 346 aa. Equivalent to Rv0943c, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 346 aa overlap). Possible monooxygenase (EC 1.-.-.-), similar in part to others e.g. NP_250229.1|NC_002516 probable flavin-containing monooxygenase from Pseudomonas aeruginosa (527 aa); AAC36351.1|AF090329 cyclohexanone monooxygenase homolog from Pseudomonas fluorescens (437 aa); CAB59668.1|AL132674 monooxygenase from Streptomyces coelicolor (519 aa); etc. Also similar to putative monooxygenases from Mycobacterium tuberculosis e.g. Rv1393c|P71662|CY21B4.10C (492 aa). FASTA scores: opt: 129, E(): 8.5e-21, (27.5% identity in 236 aa overlap); Rv0892 (495 aa); Rv3049c (524 aa); etc." /db_xref="InterPro:IPR032371" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P64766" /protein_id="SIT99566.1" /translation="MAGVSEAERRGHRKLVRFQARRAIGPIRPTSAAWDRDFDPAGKR IAVVGTDAAAAHYISRLSESAASVTVFTQAPRRVVTGVPLWTTRAKRWLRRRTGAEHP AVAWATAAIDALTSSGIRTSDGVEHPVDAIIYGTGFAIADQVGDQTLVGAGGVTIRQA WDDGMEPYLGVAVHGFPNYFFITGPDTAAQARCVVECMKLMERTASRRIEVRRSSQQV FNERAQLKPAQPHRQTGGLEAFDLSSAATEDDQTYDGAATLTLAGARFRVRVRLTGHL DPIDGNYHWQGTVFDSLPETSLTHARAATLTIGGRSAPARITEQTPWGTHSVAGVGPP PYARSGPASATT" CDS 1054232..1054708 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0969" /product="POSSIBLE FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE (FAPY-DNA GLYCOSYLASE)" /note="Mb0969, -, len: 158 aa. Equivalent to Rv0944, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 158 aa overlap). Possible formamidopyrimidine-DNA glycosylase (EC 3.2.2.23), similar to C-terminus of formamidopyrimidine-DNA glycosylases e.g. CAB63194.1|AL133469 putative formamidopyrimidine-DNA glycosylase from Streptomyces coelicolor (287 aa); FPG_LACLA|NP_266509.1|NC_002662 formamidopyrimidine-DNA glycosylase (EC 3.2.2.23) from Lactococcus lactis subsp. lactis (273 aa), FASTA scores: opt: 246, E(): 2.4e-09, (28.9% identity in 142 aa overlap); O50606|FPG_THETH|MUTM|FPG FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE from Thermus thermophilus (267 aa); etc. Also similar to C-termini of endonucleases or DNA glycosylases of Mycobacterium tuberculosis e.g. Rv3297, Rv2464c, Rv2924c. MAY BE BELONG TO THE FPG FAMILY. Mb0969 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWW9" /db_xref="InterPro:IPR000214" /db_xref="InterPro:IPR010663" /db_xref="InterPro:IPR010979" /db_xref="InterPro:IPR015886" /db_xref="UniProtKB/TrEMBL:A0A1R3XWW9" /protein_id="SIT99567.1" /translation="MAGTPQPRALGPDALDVSTDDLAGLLAGNTGRIKTVITDQKVIA GIGNAYSDEILHVAKISPFATAGKLSGAQLTCLHEAMASVLSDAVRRSVGQGAAMLKG EKRSGLRVHARTGLPCPVCGDTVREVSFADKSFQYCPTCQTGGKALADRRMSRLLK" CDS 1054714..1055475 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0970" /product="probable short-chain type dehydrogenase/reductase" /note="Mb0970, -, len: 253 aa. Equivalent to Rv0945, len: 253 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 253 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases e.g. NP_346338.1|NC_003028 oxidoreductase (short chain dehydrogenase/reductase family) from Streptococcus pneumoniae (253 aa); AAB70845.1|AF019986|PksB from Dictyostelium discoideum (260 aa); AAF86624.1|U87786 clavaldehyde dehydrogenase from Streptomyces clavuligerus (247 aa); P37440|UCPA_ECOLI oxidoreductase from Escherichia coli (285 aa), FASTA scores: opt: 275, E(): 1.1e-12, (33.8% identity in 201 aa overlap); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb0970 detected using SWATH mass spectrometry. Mb0970 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWU9" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XWU9" /protein_id="SIT99568.1" /translation="MLTGVTRQKILITGASSGLGAGMARSFAAQGRDLALCARRTDRL TELKAELSQRYPDIKIAVAELDVNDHERVPKVFAELSDEIGGIDRVIVNAGIGKGARL GSGKLWANKATIETNLVAALVQIETALDMFNQRGSGHLVLISSVLGVKGVPGVKAAYA ASKAGVRSLGESLRAEYAQGPIRVTVLEPGYIESEMTAKSASTMLMVDNATGVKALVA AIEREPGRAAVPWWPWAPLVRLMWVLPPRLTRRFA" CDS complement(1055491..1057152) /codon_start=1 /transl_table=11 /gene="pgi" /locus_tag="BQ2027_MB0971C" /product="PROBABLE GLUCOSE-6-PHOSPHATE ISOMERASE PGI (GPI) (PHOSPHOGLUCOSE ISOMERASE) (PHOSPHOHEXOSE ISOMERASE) (PHI)" /note="Mb0971c, pgi, len: 553 aa. Equivalent to Rv0946c, len: 553 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 553 aa overlap). Probable pgi, glucose-6-phosphate isomerase (EC 5.3.1.9), equivalent to NP_301236.1|NC_002677 glucose-6-phosphate isomerase from Mycobacterium leprae (554 aa); and P96803|G6PI_MYCSM GLUCOSE-6-PHOSPHATE ISOMERASE from Mycobacterium smegmatis (442 aa). Also highly similar to others e.g. T36015 glucose-6-phosphate isomerase from Streptomyces coelicolor (551 aa); P11537|G6PI_ECOLI|GPI glucose-6-phosphate isomerase from Escherichia coli strains K12 and O157:H7 (549 aa), FASTA scores: opt: 1779, E(): 0, (51.4% identity in 554 aa overlap); etc. Contains PS00765 Phosphoglucose isomerase signature 1, and PS00174 Phosphoglucose isomerase signature 2. BELONGS TO THE GPI FAMILY. Protein product from Mb0971c detected using shotgun mass spectrometry. Mb0971c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64193" /db_xref="InterPro:IPR001672" /db_xref="InterPro:IPR018189" /db_xref="InterPro:IPR023096" /db_xref="InterPro:IPR035476" /db_xref="InterPro:IPR035482" /db_xref="UniProtKB/Swiss-Prot:P64193" /protein_id="SIT99569.1" /translation="MTSAPIPDITATPAWDALRRHHDQIGNTHLRQFFADDPGRGREL TVSVGDLYIDYSKHRVTRETLALLIDLARTAHLEERRDQMFAGVHINTSEDRAVLHTA LRLPRDAELVVDGQDVVTDVHAVLDAMGAFTDRLRSGEWTGATGKRISTVVNIGIGGS DLGPVMVYQALRHYADAGISARFVSNVDPADLIATLADLDPATTLFIVASKTFSTLET LTNATAARRWLTDALGDAAVSRHFVAVSTNKRLVDDFGINTDNMFGFWDWVGGRYSVD SAIGLSLMTVIGRDAFADFLAGFHIIDRHFATAPLESNAPVLLGLIGLWYSNFFGAQS RTVLPYSNDLSRFPAYLQQLTMESNGKSTRADGSPVSADTGEIFWGEPGTNGQHAFYQ LLHQGTRLVPADFIGFAQPLDDLPTAEGTGSMHDLLMSNFFAQTQVLAFGKTAEEIAA DGTPAHVVAHKVMPGNRPSTSILASRLTPSVLGQLIALYEHQVFTEGVVWGIDSFDQW GVELGKTQAKALLPVITGAGSPPPQSDSSTDGLVRRYRTERGRAG" CDS complement(1057767..1057997) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0972C" /product="probable mycolyl transferase, pseudogene" /note="Mb0972c, -, len: 76 aa. Equivalent to Rv0947c, len: 76 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 76 aa overlap). Probable mycolyl transferase pseudogene (EC 2.-.-.-), similar to part of P31953|A85C_MYCTU|fbpC2 antigen 85-c precursor (85c) (FIBRONECTIN-BINDING PROTEIN C) from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 213, E(): 2e-08, (69.6% identity in 46 aa overlap). Mb0972c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="SIT99570.1" /translation="MGGSFDRAARARRQLDNLVNVVAAGSTHRLMVPSRSMHRLIKVE FQGGGPHAWYLSDGILARDDYNGRDIHLPVFG" CDS complement(1058113..1058430) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0973C" /product="chorismate mutase" /note="Mb0973c, -, len: 105 aa. Equivalent to Rv0948c, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 105 aa overlap). Conserved hypothetical protein, equivalent to NP_301237.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (105 aa). Also similar (except in N-terminus) to SCD63.16c|CAB82023.1|AL161755 hypothetical protein from Streptomyces coelicolor (110 aa); and to N-terminus of two chorismate mutase/prephenate dehydratase. Protein product from Mb0973c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0973c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64768" /db_xref="InterPro:IPR002701" /db_xref="InterPro:IPR010958" /db_xref="InterPro:IPR036263" /db_xref="InterPro:IPR036979" /db_xref="UniProtKB/Swiss-Prot:P64768" /protein_id="SIT99571.1" /translation="MRPEPPHHENAELAAMNLEMLESQPVPEIDTLREEIDRLDAEIL ALVKRRAEVSKAIGKARMASGGTRLVHSREMKVIERYSELGPDGKDLAILLLRLGRGR LGH" CDS 1058727..1061042 /codon_start=1 /transl_table=11 /gene="uvrD1" /locus_tag="BQ2027_MB0974" /product="probable atp-dependent dna helicase ii uvrd1" /note="Mb0974, uvrD1, len: 771 aa. Equivalent to Rv0949, len: 771 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 771 aa overlap). Probable uvrD1, ATP dependent DNA helicase (EC 3.6.1.-), equivalent to P_301239.1|NC_002677 putative ATP-dependent DNA helicase from Mycobacterium leprae (778 aa). Also highly similar to others e.g. CAB92660.1|AL356832 from Streptomyces coelicolor (831 aa) (N-terminus longer); P56255|PCRA_BACST from Bacillus stearothermophilus (724 aa); Q10213|YAY5_SCHPO from Schizosaccharomyces pombe (Fission yeast) (887 aa), FASTA scores: opt: 927, E(): 0, (33.5% identity in 659 aa overlap); etc. Also similar to several other UvrD-like proteins in Mycobacterium tuberculosis e.g. Rv3201c, Rv3198c, Rv3202c. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE UVRD SUBFAMILY OF HELICASES. Note that previously known as uvrD. Protein product from Mb0974 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0974 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5A4" /db_xref="InterPro:IPR000212" /db_xref="InterPro:IPR005751" /db_xref="InterPro:IPR013986" /db_xref="InterPro:IPR014016" /db_xref="InterPro:IPR014017" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR034739" /db_xref="UniProtKB/Swiss-Prot:P0A5A4" /protein_id="SIT99572.1" /translation="MSVHATDAKPPGPSPADQLLDGLNPQQRQAVVHEGSPLLIVAGA GSGKTAVLTRRIAYLMAARGVGVGQILAITFTNKAAAEMRERVVGLVGEKARYMWVST FHSTCVRILRNQAALIEGLNSNFSIYDADDSRRLLQMVGRDLGLDIKRYSPRLLANAI SNLKNELIDPHQALAGLTEDSDDLARAVASVYDEYQRRLRAANALDFDDLIGETVAVL QAFPQIAQYYRRRFRHVLVDEYQDTNHAQYVLVRELVGRDSNDGIPPGELCVVGDADQ SIYAFRGATIRNIEDFERDYPDTRTILLEQNYRSTQNILSAANSVIARNAGRREKRLW TDAGAGELIVGYVADNEHDEARFVAEEIDALAEGSEITYNDVAVFYRTNNSSRSLEEV LIRAGIPYKVVGGVRFYERKEIRDIVAYLRVLDNPGDAVSLRRILNTPRRGIGDRAEA CVAVYAENTGVGFGDALVAAAQGKVPMLNTRAEKAIAGFVEMFDELRGRLDDDLGELV EAVLERTGYRRELEASTDPQELARLDNLNELVSVAHEFSTDRENAAALGPDDEDVPDT GVLADFLERVSLVADADEIPEHGAGVVTLMTLHTAKGLEFPVVFVTGWEDGMFPHMRA LDNPTELSEERRLAYVGITRARQRLYVSRAIVRSSWGQPMLNPESRFLREIPQELIDW RRTAPKPSFSAPVSGAGRFGSARPSPTRSGASRRPLLVLQVGDRVTHDKYGLGRVEEV SGVGESAMSLIDFGSSGRVKLMHNHAPVTKL" CDS complement(1061123..1062121) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0975C" /product="Phage peptidoglycan binding endopeptidase" /note="Mb0975c, -, len: 332 aa. Equivalent to Rv0950c, len: 332 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 332 aa overlap). Conserved hypothetical protein, highly similar to AL035500|MLCL373.02c|T45433 hypothetical protein from Mycobacterium leprae (343 aa), FASTA scores: opt: 1500, E(): 0, (71.0% identity in 331 aa overlap). C-terminus highly similar to part of various proteins e.g. C-terminal part of NP_441943.1|NC_000911|NlpD lipoprotein from Synechocystis sp (715 aa); N-terminal part of NP_066789.1|NC_002576 putative peptidase from Rhodococcus equi (546 aa); C-terminal part of NP_212396.1|NC_001318 conserved hypothetical protein from Borrelia burgdorferi (417 aa); C-terminal part of P33648|NLPD_ECOLI|nlpd lipoprotein from Escherichia coli (379 aa), FASTA scores: opt: 276, E(): 2e-10, (29.9% identity in 234 aa overlap); etc. Mb0975c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR011055" /db_xref="InterPro:IPR016047" /db_xref="UniProtKB/TrEMBL:A0A1R3XXW1" /protein_id="SIT99573.1" /translation="MAAIRTPRDRWPHHHRNEVTEIIPLDGFLDGLALYDELDFAELD DLDLGDDCVFDYEAQLLAAPELDDLDDADDLAPEWLVAPTVVLTPEVTPVSRRVGQHR KQPIGAARGRLLISAMAAGAAAAAAHTAIQQSETPRTETVLTAHASALNEGSGSNPPR GVQVIAAQPAASAAVHNAEFARGVAFAEERAEREARLQRPLYVMPTKGIFTSSFGYRW GVLHAGIDLANAIGTPIYAVSDGVVIDAGPTAGYGMWVKLLHADGTVTLYGHVNTTLV SVGERVMAGDQIATMGSRGFSTGPHLHFEVLLGGTERVDPVPWLAKRGLSVGNYTG" CDS 1062431..1063594 /codon_start=1 /transl_table=11 /gene="sucC" /locus_tag="BQ2027_MB0976" /product="PROBABLE SUCCINYL-COA SYNTHETASE (BETA CHAIN) SUCC (SCS-BETA)" /note="Mb0976, sucC, len: 387 aa. Equivalent to Rv0951, len: 387 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 387 aa overlap). Probable sucC, succinyl-Coa synthetase, beta chain (EC 6.2.1.5), equivalent to AL035500|MLCL373_3|NP_301241.1|NC_002677 succinyl-CoA synthase [beta] chain from Mycobacterium leprae (393 aa), FASTA score: (86.7% identity in 391 aa overlap). Also highly similar to others e.g. AB92671.1|AL356832 succinyl-CoA synthetase beta chain from Streptomyces coelicolor (394 aa); P25126|SUCC_THEFL SUCCINYL-COA SYNTHETASE BETA CHAIN from Thermus aquaticus (378 aa); P07460|SUCC_ECOLI succinyl-CoA synthetase beta chain from Escherichia coli (388 aa), FASTA scores: opt: 933, E(): 0, (41.0% identity in 390 aa overlap); etc. Protein product from Mb0976 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0976 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U0Z1" /db_xref="InterPro:IPR005809" /db_xref="InterPro:IPR005811" /db_xref="InterPro:IPR011761" /db_xref="InterPro:IPR013650" /db_xref="InterPro:IPR013815" /db_xref="InterPro:IPR016102" /db_xref="InterPro:IPR017866" /db_xref="UniProtKB/Swiss-Prot:Q7U0Z1" /protein_id="SIT99574.1" /translation="MDLFEYQAKELFAKHNVPSTPGRVTDTAEGAKAIATEIGRPVMV KAQVKIGGRGKAGGVKYAATPQDAYEHAKNILGLDIKGHIVKKLLVAEASDIAEEYYL SFLLDRANRTYLAMCSVEGGMEIEEVAATKPERLAKVPVNAVKGVDLDFARSIAEQGH LPAEVLDTAAVTIAKLWELFVAEDATLVEVNPLVRTPDHKILALDAKITLDGNADFRQ PGHAEFEDRAATDPLELKAKEHDLNYVKLDGQVGIIGNGAGLAMSTLDVVAYAGEKHG GVKPANFLDIGGGASAEVMAAGLDVVLGDQQVKSVFVNVFGGITSCDAVATGIVKALG MLGDEANKPLVVRLDGNNVEEGRRILTEANHPLVTLVATMDEAADKAAELASA" CDS 1063607..1064518 /codon_start=1 /transl_table=11 /gene="sucD" /locus_tag="BQ2027_MB0977" /product="PROBABLE SUCCINYL-COA SYNTHETASE (ALPHA CHAIN) SUCD (SCS-ALPHA)" /note="Mb0977, sucD, len: 303 aa. Equivalent to Rv0952, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 303 aa overlap). Probable sucD, succinyl-CoA synthetase, alpha chain (EC 6.2.1.5), equivalent to AL035500|MLCL373_4|NP_301242.1|NC_002677 succinyl-CoA synthase [alpha] chain from Mycobacterium leprae (300 aa), FASTA score: (86.3% identity in 300 aa overlap). Also highly similar to others e.g. CAB92672.1|AL356832 from Streptomyces coelicolor (294 aa); P53591|SUCD_COXBU from Escherichia coli (288 aa), FASTA scores: opt: 855, E(): 0, (53.8% identity in 286 aa overlap); etc. Contains PS00399 ATP-citrate lyase and succinyl-CoA ligases active site, and PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb0977 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0977 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0Z0" /db_xref="InterPro:IPR003781" /db_xref="InterPro:IPR005810" /db_xref="InterPro:IPR005811" /db_xref="InterPro:IPR016102" /db_xref="InterPro:IPR017440" /db_xref="InterPro:IPR033847" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:Q7U0Z0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99575.1" /translation="MTHMSIFLSRDNKVIVQGITGSEATVHTARMLRAGTQIVGGVNA RKAGTTVTHEDKGGRLIKLPVFGSVAEAMEKTGADVSIIFVPPTFAKDAIIEAIDAEI PLLVVITEGIPVQDTAYAWAYNLEAGHKTRIIGPNCPGIISPGQSLAGITPANITGPG PIGLVSKSGTLTYQMMFELRDLGFSTAIGIGGDPVIGTTHIDAIEAFEKDPDTKLIVM IGEIGGDAEERAADFIKTNVSKPVVGYVAGFTAPEGKTMGHAGAIVSGSSGTAAAKQE ALEAAGVKVGKTPSATAALAREILLSL" CDS complement(1064581..1065429) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0978C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb0978c, -, len: 282 aa. Equivalent to Rv0953c, len: 282 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 282 aa overlap). Possible oxidoreductase (EC 1.-.-.-), equivalent to CAA48222.1|X68102 hypothetical protein from Mycobacterium avium subsp. paratuberculosis (166 aa). Similar to several hypothetical proteins and oxidoreductases e.g. AAK38097.1|AF323606_3|AF323606 putative F420-dependent dehydrogenase from Rhodococcus erythropolis (295 aa); NP_070025.1|NC_000917 N5,N10-methylenetetrahydromethanopterin reductase (mer-2) from Archaeoglobus fulgidus (348 aa); etc. Also similar to several hypothetical proteins and oxidoreductases from Mycobacterium tuberculosis e.g. Rv2161c|O06216|Z95388|MTCY270.07 (288 aa), FASTA scores: opt: 633, E(): 0, (40.4% identity in 277 aa overlap), Rv3079c (275 aa), Rv0791c (347 aa), etc. Contains PS00201 Flavodoxin signature. Protein product from Mb0978c detected using SWATH mass spectrometry. Mb0978c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64770" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019921" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/Swiss-Prot:P64770" /protein_id="SIT99576.1" /translation="MHYGLVLFTSDRGITPAAAARLAESHGFRTFYVPEHTHIPVKRQ AAHPTTGDASLPDDRYMRTLDPWVSLGAASAVTSRIRLATAVALPVEHDPITLAKSIA TLDHLSHGRVSVGVGFGWNTDELVDHGVPPGRRRTMLREYLEAMRALWTQEEACYDGE FVKFGPSWAWPKPVQPHIPVLVGAAGTEKNFKWIARSADGWITTPRDVDIDEPVKLLQ DIWAAAGRDGLPQIVALDVKPVPDKLARWAELGVTEVLFGMPDRSADDAAAYVERLAA KLACCV" CDS 1065594..1066505 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0979" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb0979, -, len: 303 aa. Equivalent to Rv0954, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 303 aa overlap). Probable conserved transmembrane protein, highly similar to 34KD_MYCPA|Q04959 34 kd antigenic protein from Mycobacterium paratuberculosis (298 aa), FASTA scores: opt: 1023, E(): 7.2e-36, (59.3% identity in 305 aa overlap); AAC69251.1|U82111 34 kDa antigen precursor from Mycobacterium leprae (336 aa); and AL035500|MLCL373.06 hypothetical membrane protein from Mycobacterium leprae (297 aa), FASTA score: (55.6% identity in 315 aa overlap). Protein product from Mb0979 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0979 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65638" /db_xref="InterPro:IPR035166" /db_xref="UniProtKB/Swiss-Prot:P65638" /protein_id="SIT99577.1" /translation="MTYSPGNPGYPQAQPAGSYGGVTPSFAHADEGASKLPMYLNIAV AVLGLAAYFASFGPMFTLSTELGGGDGAVSGDTGLPVGVALLAALLAGVALVPKAKSH VTVVAVLGVLGVFLMVSATFNKPSAYSTGWALWVVLAFIVFQAVAAVLALLVETGAIT APAPRPKFDPYGQYGRYGQYGQYGVQPGGYYGQQGAQQAAGLQSPGPQQSPQPPGYGS QYGGYSSSPSQSGSGYTAQPPAQPPAQSGSQQSHQGPSTPPTGFPSFSPPPPVSAGTG SQAGSAPVNYSNPSGGEQSSSPGGAPV" CDS 1066545..1067912 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0980" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0980, -, len: 455 aa. Equivalent to Rv0955, len: 455 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 455 aa overlap). Probable conserved integral membrane protein, highly similar to AL035500|MLCL373_6 putative membrane protein from Mycobacterium leprae (430 aa), FASTA score: (75.9% identity in 419 aa overlap); and AAL05878.1|AF411607_2|AF411607 unknown protein from Mycobacterium avium subsp. paratuberculosis (409 aa). Mb0980 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64772" /db_xref="UniProtKB/Swiss-Prot:P64772" /protein_id="SIT99578.1" /translation="MNRVSASADDRAAGARPARDLVRVAFGPGVVALGIIAAVTLLQL LIANSDMTGAWGAIASMWLGVHLVPISIGGRALGVMPLLPVLLMVWATARSTARATSP QSSGLVVRWVVASALGGPLLMAAIALAVIHDASSVVTELQTPSALRAFTSVLVVHSVG AATGVWSRVGRRALAATALPDWLHDSMRAAAAGVLALLGLSGVVTAGSLVVHWATMQE LYGITDSIFGQFSLTVLSVLYAPNVIVGTSAIAVGSSAHIGFATFSSFAVLGGDIPAL PILAAAPTPPLGPAWVALLIVGASSGVAVGQQCARRALPFVAAMAKLLVAAVAGALVM AVLGYGGGGRLGNFGDVGVDEGALVLGVLFWFTFVGWVTVVIAGGISRRPKRLRPAPP VELDADESSPPVDMFDGAASEQPPASVAEDVPPSHDDIANGLKAPTADDEALPLSDEP PPRAD" CDS 1068028..1068675 /codon_start=1 /transl_table=11 /gene="purN" /locus_tag="BQ2027_MB0981" /product="PROBABLE 5'-PHOSPHORIBOSYLGLYCINAMIDE FORMYLTRANSFERASE PURN (GART) (GAR TRANSFORMYLASE) (5'-PHOSPHORIBOSYLGLYCINAMIDE TRANSFORMYLASE)" /note="Mb0981, purN, len: 215 aa. Equivalent to Rv0956, len: 215 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 215 aa overlap). Probable purN, 5'-phosphoribosylglycinamide formyltransferase (EC 2.1.2.2), equivalent to AAF05726.1|AF191543_1|AF191543|PurN phosphoribosylglycinamide formyltransferase from Mycobacterium avium subsp. paratuberculosis (209 aa); and AL035500|MLCL373_7 from Mycobacterium leprae (215 aa), FASTA score: (79.4% identity in 214 aa overlap). Also highly similar to others e.g. BAA89443.1|AB003159 from Corynebacterium ammoniagenes (199 aa); NP_241498.1|NC_002570 from Bacillus halodurans (188 aa); P08179|PUR3_ECOLI|B2500 from Escherichia coli strain K12 (212 aa), FASTA scores: opt: 380, E(): 2.4e-18, (36.6% identity in 183 aa overlap); C-terminus of P16340|PUR2_DROPS TRIFUNCTIONAL PURINE BIOSYNTHETIC PROTEIN ADENOSINE-3 from Drosophila pseudoobscura (Fruit fly) (1364 aa); etc. Protein product from Mb0981 detected using shotgun mass spectrometry. Mb0981 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWY5" /db_xref="InterPro:IPR002376" /db_xref="InterPro:IPR004607" /db_xref="InterPro:IPR036477" /db_xref="UniProtKB/TrEMBL:A0A1R3XWY5" /protein_id="SIT99579.1" /translation="MQEPLRVPPSAPARLVVLASGTGSLLRSLLDAAVGDYPARVVAV GVDRECRAAEIAAEASVPVFTVRLADHPSRDAWDVAITAATAAHEPDLVVSAGFMRIL GPQFLSRFYGRTLNTHPALLPAFPGTHGVADALAYGVKVTGATVHLVDAGTDTGPILA QQPVPVLDGDDEETLHERIKVTERRLLVAAVAALATHGVTVVGRTATMGRKVTIG" CDS 1068672..1070243 /codon_start=1 /transl_table=11 /gene="purH" /locus_tag="BQ2027_MB0982" /product="PROBABLE BIFUNCTIONAL PURINE BIOSYNTHESIS PROTEIN PURH: PHOSPHORIBOSYLAMINOIMIDAZOLECARBOXAMIDE FORMYLTRANSFERASE (AICAR TRANSFORMYLASE) (5'-PHOSPHORIBOSYL-5-AMINOIMIDAZOLE-4-CARBOXAMIDE FORMYLTRANSFERASE) + INOSINEMONOPHOSPHATE CYCLOHYDROLASE (IMP CYCLOHYDROLASE) (INOSINICASE) (IMP SYNTHETASE) (ATIC)" /note="Mb0982, purH, len: 523 aa. Equivalent to Rv0957, len: 523 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 523 aa overlap). Probable purH, bifunctional purine biosynthesis protein including 5'-phosphoribosyl-5-aminoimidazole-4-carboxamide formyltransferase (EC 2.1.2.3) and inosine-monophosphate (IMP) cyclohydrolase (EC 3.5.4.10), equivalent to AL035500|MLCL373_8 putative phosphoribosylaminoimidazolecarboxamide formyltransferase from Mycobacterium leprae (527 aa), FASTA score: (88.1% identity in 520 aa overlap); and AF05727.1|AF191543_2|AF191543|PurH from Mycobacterium avium subsp. paratuberculosis (527 aa). Also highly similar to others e.g. CAB92677.1|AL356832 bifunctional purine biosynthesis protein from Streptomyces coelicolor (523 aa); NP_388534.1|NC_000964 phosphoribosylaminoimidazole carboxy formyl formyltransferase + inosine-monophosphate cyclohydrolase from Bacillus subtilis (512 aa); P15639|PUR9_ECOLI phosphoribosylaminoimidazolecarboxamide formyltransferase from Escherichia coli (529 aa), FASTA scores: opt: 1147, E(): 0, (44.8% identity in 533 aa overlap); etc. BELONGS TO THE PURH FAMILY. Protein product from Mb0982 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0982 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67542" /db_xref="InterPro:IPR002695" /db_xref="InterPro:IPR011607" /db_xref="InterPro:IPR016193" /db_xref="InterPro:IPR024051" /db_xref="InterPro:IPR036914" /db_xref="UniProtKB/Swiss-Prot:P67542" /protein_id="SIT99580.1" /translation="MSTDDGRRPIRRALISVYDKTGLVDLAQGLSAAGVEIISTGSTA KTIADTGIPVTPVEQLTGFPEVLDGRVKTLHPRVHAGLLADLRKSEHAAALEQLGIEA FELVVVNLYPFSQTVESGASVDDCVEQIDIGGPAMVRAAAKNHPSAAVVTDPLGYHGV LAALRAGGFTLAERKRLASLAFQHIAEYDIAVASWMQQTLAPEHPVAAFPQWFGRSWR RVAMLRYGENPHQQAALYGDPTAWPGLAQAEQLHGKDMSYNNFTDADAAWRAAFDHEQ TCVAIIKHANPCGIAISSVSVADAHRKAHECDPLSAYGGVIAANTEVSVEMAEYVSTI FTEVIVAPGYAPGALDVLARKKNIRVLVAAEPLAGGSELRPISGGLLIQQSDQLDAHG DNPANWTLATGSPADPATLTDLVFAWRACRAVKSNAIVIAADGATVGVGMGQVNRVDA ARLAVERGGERVRGAVAASDAFFPFPDGLETLAAAGVTAVVHPGGSVRDEEVTEAAAK AGVTLYLTGARHFAH" CDS 1070350..1071729 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0983" /product="POSSIBLE MAGNESIUM CHELATASE" /note="Mb0983, -, len: 459 aa. Equivalent to Rv0958, len: 459 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 459 aa overlap). Possible magnesium chelatase (EC 4.99.1.-), similar to others (especially in N-terminal parts) e.g. NP_296313.1|NC_001263|AE002088_10 putative magnesium protoporphyrin chelatase from Deinococcus radiodurans (487 aa), FASTA scores: opt: 1148, E(): 0, (42.4% identity in 450 aa overlap); Q44498|CHLI_ANAVA MAGNESIUM-CHELATASE SUBUNIT CHLI from Anabaena variabilis (338 aa); T31460 probable magnesium chelatase (EC 4.99.1.-) chain I bchI from Heliobacillus mobilis (363 aa); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop) Protein product from Mb0983 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0983 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWY0" /db_xref="InterPro:IPR002078" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XWY0" /protein_id="SIT99581.1" /translation="MSPSNLPRTVGELRAAGHRERGVKQEIRENLLTALADGDNVWPG ILGFDDTVIPQVERALIAGHDFVLLGERGQGKTRLLRALAGLLDEWTPVIAGAELGEH PYTPITPESIRRAAQLGDDLPVAWKHRSERYTEKLATPDTSVADLVGDVDPIKVAEGR SLGDPETIAYGLIPRAHRGIVAVNELPDLAERIQVSMLNVMEERDIQVRGYTLRLPLD VLVVASANPEDYTNRGRIITPIKDRFGAEIRTHYPLELEAEMGVIVQEAHLSAQVPDY LMQVLARFARYLRESRSIDQRSGVSARFAIAAAETVAAAARHRGAVLGETDPVARVVD LGTVIDVLRGKLEFESGEEGREQAVLEHLLRRATADTASRVLGGIDVGSLVTAVEGGS AVTTGERVSAKDVLAAVPGLPVVDRIARKLGAESEGERAAALELALEALYLAKRVDKV CGEGQTVYG" CDS 1071722..1073740 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0984" /product="FIG019045: long form Mg-chelase associated protein with vWA domain" /note="Mb0984, -, len: 672 aa. Equivalent to Rv0959, len: 672 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 672 aa overlap). Conserved hypothetical protein, similar to AE002069|AE002069_12 hypothetical protein from Deinococcus radiodurans (403 aa), FASTA scores: opt: 395, E(): 1.3e-15, (26.8% identity in 426 aa overlap). Contains a single copy at the N-terminus of a short repeat found three times in the M. tuberculosis ORF O33341|MTV003.05c|AL008883. Protein product from Mb0984 detected using SWATH mass spectrometry. Mb0984 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002035" /db_xref="InterPro:IPR036465" /db_xref="UniProtKB/Swiss-Prot:P0A5D8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99582.1" /translation="MAKSDGDDPLRPASPRLRSSRRHSLRYSAYTGGPDPLAPPVDLR DALEQIGQDVMAGASPRRALSELLRRGTRNLTGADRLAAEVNRRRRELLRRNNLDGTL QEIKKLLDEAVLAERKELARALDDDARFAELQLDALPASPAKAVQELAEYRWRSGQAR EKYEQIKDLLGRELLDQRFAGMKQALAGATDDDRRRVTEMLDDLNDLLDKHARGEDTQ RDFDEFMTKHGEFFPENPRNVEELLDSLAKRAAAAQRFRNSLSQEQRDELDALAQQAF GSPALMRALDRLDAHLQAARPGEDWTGSQQFSGDNPFGMGEGTQALADIAELEQLAEQ LSQSYPGASMDDVDLDALARQLGDQAAVDARTLAELERALVNQGFLDRGSDGQWRLSP KAMRRLGETALRDVAQQLSGRHGERDHRRAGAAGELTGATRPWQFGDTEPWHVARTLT NAVLRQAAAVHDRIRITVEDVEVAETETRTQAAVALLVDTSFSMVMENRWLPMKRTAL ALHHLVCTRFRSDALQIIAFGRYARTVTAAELTGLAGVYEQGTNLHHALALAGRHLRR HAGAQPVVLVVTDGEPTAHLEDFDGDGTSVFFDYPPHPRTIAHTVRGFDDMARLGAQV TIFRLGSDPGLARFIDQVARRVQGRVVVPDLDGLGAAVVGDYLRFRRR" CDS 1073794..1074015 /codon_start=1 /transl_table=11 /gene="vapB9" /locus_tag="BQ2027_MB0984A" /product="Possible antitoxin VapB9" /note="Mb0984A, len: 73 aa. Equivalent to Rv0959A len: 73 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible vapB9, antitoxin, part of toxin-antitoxin (TA) operon with Rv0960 (See Arcus et al., 2005; Pandey and Gerdes, 2005). Weakly similar to others in Mycobacterium tuberculosis e.g. Rv1721c Protein product from Mb0984A detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0984A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXX2" /db_xref="InterPro:IPR010985" /db_xref="UniProtKB/TrEMBL:A0A1R3XXX2" /protein_id="SIT99583.1" /translation="MKTLYLRNVPDDVVERLERLAELAKTSVSAVAVRELTEASRRAD NPALLGDLPDIGIDTTELIGGIDAERAGR" CDS 1074012..1074395 /codon_start=1 /transl_table=11 /gene="vapc9" /locus_tag="BQ2027_MB0985" /product="possible toxin vapc9" /note="Mb0985, -, len: 127 aa. Equivalent to Rv0960, len: 127 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 127 aa overlap). Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv0065|MTV030.08 (133 aa), FASTA scores: E(): 1.5e-14, (38.3% identity in 128 aa overlap), Rv1720c (129 aa), and Rv0549c (137 aa). Protein product from Mb0985 detected using SWATH mass spectrometry. Mb0985 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64774" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/Swiss-Prot:P64774" /protein_id="SIT99584.1" /translation="MIVVDASAALAALLNDGQARQLIAAERLHVPHLVDSEIASGLRR LAQRDRLGAADGRRALQTWRRLAVTRYPVVGLFERIWEIRANLSAYDASYVALAEALN CALVTADLRLSDTGQAQCPITVVPR" CDS 1074541..1074888 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0986" /product="PROBABLE INTEGRAL MEMBRANE PROTEIN" /note="Mb0986, -, len: 115 aa. Equivalent to Rv0961, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 115 aa overlap). Probable integral membrane protein. Mb0986 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64776" /db_xref="UniProtKB/Swiss-Prot:P64776" /protein_id="SIT99585.1" /translation="MRVPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAM ATDGCHDSACDASYHVFPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLA LGLVYVAADAVLH" CDS complement(1074907..1075581) /codon_start=1 /transl_table=11 /gene="lprP" /locus_tag="BQ2027_MB0987C" /product="POSSIBLE LIPOPROTEIN LPRP" /note="Mb0987c, lprP, len: 224 aa. Equivalent to Rv0962c, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 224 aa overlap). Possible lprP, lipoprotein. Contains possible N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /db_xref="GOA:P59987" /db_xref="InterPro:IPR032018" /db_xref="UniProtKB/Swiss-Prot:P59987" /protein_id="SIT99586.1" /translation="MKRTSRSLTAALLGIAALLAGCIKPNTFDPYANPGRGELDRRQK IVNGRPDLETVQQQLANLDATIRAMIAKYSPQTRFSTGVTVSHLTNGCNDPFTRTIGR QEASELFFGRPAPTPQQWLQIVTELAPVFKAAGFRPNNSVPGDPPQPLGAPNYSQIRD DGVTINLVNGDNRGPLGYSYNTGCHLPAAWRTAPPPLNMRPANDPDVHYPYLYGSPGG RTRDAY" CDS complement(1075764..1076564) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0988C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0988c, -, len: 266 aa. Equivalent to Rv0963c, len: 266 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 266 aa overlap). Conserved hypothetical protein, similar in part to other CONSERVED HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis e.g. Rv2797c|MTCY16B7.46 (562 aa), FASTA scores: E(): 1.2e-23, (39.0% identity in 254 aa overlap); Rv2542 (403 aa); Rv2079 (656 aa). Also similar in part to AL133423|SC4A7_3 HYPOTHETICAL SECRETED PROTEIN from Streptomyces coelicolor (406 aa), FASTA scores: opt: 231, E(): 6.8e-07, (31.4% identity in 204 aa overlap); and SCH10.21c|T36533 hypothetical protein from Streptomyces coelicolor (329 aa). Mb0988c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR010427" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P64778" /protein_id="SIT99587.1" /translation="MLQRELTRLQNGWLSRDGVWHTDTDKLADLRALRDTLAAHPGTS LILLDTASDPRKVLAAVGVGDVDNAERVGVTMGGLNTRVSSSVGDMVKEAGIQRAKAA ELRERAGWPNYDAVASIAWLGYDAPDGLKDVMHDWSARDAAGPLNRFDKGLAATTNVS DQHITAFGHSYGSLVTSLALQQGAPVSDVVLYGSPGTELTHASQLGVEPGHAFYMIGV NDHVANTIPEFGAFGSAPQDVPGMTQLSVNTGLAPGPLLGDGQLHERA" CDS complement(1076663..1077145) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0989C" /product="HYPOTHETICAL PROTEIN" /note="Mb0989c, -, len: 160 aa. Equivalent to Rv0964c, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 160 aa overlap). Hypothetical unknown protein. Equivalent to AAK45241.1 from Mycobacterium tuberculosis strain CDC1551 (138 aa) but longer 22 aa. Mb0989c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P59978" /protein_id="SIT99588.1" /translation="MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAE EVAAIKMRLQVIRDAAREHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRASNV CWPTPRPPMRIWPRRFDAPPGTCRASRSMPNSAMRHPQCRRCRRRTATLRRSSGGGIR " CDS complement(1077245..1077664) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0990C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb0990c, -, len: 139 aa. Equivalent to Rv0965c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 139 aa overlap). Conserved hypothetical protein, showing weak similarity with Rv2798c|MTCY16B7.45 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (108 aa), FASTA scores: E(): 5.6e-12, (38.9% identity in 90 aa overlap). Equivalent to AAK45242.1 from Mycobacterium tuberculosis strain CDC1551 (146 aa) but shorter 7 aa. Mb0990c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XWZ5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99589.1" /translation="MRVNRPQCARVPYSAESLVRVEASWYGRTLRAIPEVLSQVGYQQ ADHGESLLTSHHCCLGAAEGARPGWVGSSAGALSGLLDSWAEASTAHAAHIGDHSYGM HLAAVGFAEMEEHNAAALAAVYPTGGGSARCDGVDVS" CDS complement(1077700..1078302) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0991C" /product="conserved protein" /note="Mb0991c, -, len: 200 aa. Equivalent to Rv0966c, len: 200 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 200 aa overlap). Conserved hypothetical protein, equivalent to AL035500|MLCL373_12 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (200 aa), FASTA scores: opt: 1080, E(): 0, (79.5% identity in 200 aa overlap). Also highly similar to SCE6.30c|CAB88834.1|AL353832 hypothetical protein from Streptomyces coelicolor (277 aa). Some similarity to Rv2862c|MTV007.08 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (194 aa), FASTA scores: E(): 3.1e-06, (31.5% identity in 184 aa overlap). Equivalent to AAK45243.1 from Mycobacterium tuberculosis strain CDC1551 (230 aa) but shorter 30 aa. Note that Rv0966c has been shortened since first entry. Protein product from Mb0991c detected using SWATH mass spectrometry. Mb0991c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR012551" /db_xref="UniProtKB/TrEMBL:A0A1R3XWZ1" /protein_id="SIT99590.1" /translation="MSNSAQRDARNSRDESARASDTDRIQIAQLLAYAAEQGRLQLTD YEDRLARAYAATTYQELDRLRADLPGAAIGPRRGGECNPAPSTLLLALLGGFERRGRW NVPKKLTTFTLWGSGVLDLRYADFTSTEVDIRAYSIMGAQTILLPPEVNVEIHGHRVM GGFDRKVVGEGTRGAPTVRIRGFSLGGDVGIKRKPRKPRK" CDS 1078442..1078801 /codon_start=1 /transl_table=11 /gene="csor" /locus_tag="BQ2027_MB0992" /product="copper-sensitive operon repressor csor" /note="Mb0992, -, len: 119 aa. Equivalent to Rv0967, len: 119 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 119 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from several organisms e.g. AE002074|AE002074_11 from Deinococcus radiodurans (102 aa), FASTA scores: opt: 233, E(): 8.6e-10, (47.0% identity in 83 aa overlap); O32222|Z99121|YVGZ from Bacillus subtilis (101 aa), FASTA scores: opt:228, E(): 3.2e-15, (38.0% identity in 92 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical proteins Rv0190, and Rv1766. Protein product from Mb0992 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0992 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0Y5" /db_xref="InterPro:IPR003735" /db_xref="InterPro:IPR038390" /db_xref="UniProtKB/Swiss-Prot:Q7U0Y5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99591.1" /translation="MSKELTAKKRAALNRLKTVRGHLDGIVRMLESDAYCVDVMKQIS AVQSSLERANRVMLHNHLETCFSTAVLDGHGQAAIEELIDAVKFTPALTGPHARLGGA AVGESATEEPMPDASNM" CDS 1078858..1079154 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0993" /product="conserved protein" /note="Mb0993, -, len: 98 aa. Equivalent to Rv0968, len: 98 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 98 aa overlap). Conserved hypothetical protein, similar to NP_301579.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (92 aa). Also highly similar to CONSERVED HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis e.g. Rv3269 (93 aa), FASTA score: (51.1% identity in 94 aa overlap); and Rv1993c (90 aa). Protein product from Mb0993 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0993 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR009963" /db_xref="UniProtKB/Swiss-Prot:P64780" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99592.1" /translation="MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVA AWGIRLAREAERKAGESAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH" CDS 1079210..1081522 /codon_start=1 /transl_table=11 /gene="ctpV" /locus_tag="BQ2027_MB0994" /product="PROBABLE METAL CATION TRANSPORTER P-TYPE ATPASE CTPV" /note="Mb0994, ctpV, len: 770 aa. Equivalent to Rv0969, len: 770 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 770 aa overlap). Probable ctpV, metal cation transporter P-type ATPase (transmembrane protein) (EC 3.6.3.-), highly similar (except in N-terminus) to others e.g. NP_391230.1|NC_000964 similar to heavy metal-transporting ATPase from Bacillus subtilis (803 aa); P37279|ATCS_SYNP7|PACS cation-transporting ATPase from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (747 aa), FASTA scores: opt: 1851, E(): 0, (52.1% identity in 664 aa overlap); etc. Equivalent to AAK45246.1 from Mycobacterium tuberculosis strain CDC1551 (792 aa) but shorter 22 aa. Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES). Protein product from Mb0994 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0994 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXY2" /db_xref="InterPro:IPR001757" /db_xref="InterPro:IPR008250" /db_xref="InterPro:IPR018303" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR023298" /db_xref="InterPro:IPR023299" /db_xref="InterPro:IPR027256" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/TrEMBL:A0A1R3XXY2" /protein_id="SIT99593.1" /translation="MRVCVTGFNVDAVRAVAIEETVSQVTGVHAVHAYPRTASVVIWY SPELGDTAAVLSAITKAQHVPAELVPARAPHSAGVRGVGVVRKITGGIRRMLSRPPGV DKPLKASRCGGRPRGPVRGSASWPGEQNRRERRTWLPRVWLALPLGLLALGSSMFFGA YPWAGWLAFAATLPVQFVAGWPILRGAVQQARALTSNMDTLIALGTLTAFVYSTYQLF AGGPLFFDTSALIIAFVVLGRHLEARATGKASEAISKLLELGAKEATLLVDGQELLVP VDQVQVGDLVRVRPGEKIPVDGEVTDGRAAVDESMLTGESVPVEKTAGDRVAGATVNL DGLLTVRATAVGADTALAQIVRLVEQAQGDKAPVQRLADRVSAVFVPAVIGVAVATFA GWTLIAANPVAGMTAAVAVLIIACPCALGLATPTAIMVGTGRGAELGILVKGGEVLEA SKKIDTVVFDKTGTLTRARMRVTDVIAGQRRQPNQVLRLAAAVESGSEHPIGAAIVAA AHERGLAIPAANAFTAVAGHGVRAQVNGGPVVVGRRKLVDEQHLVLPDHLAAAAVEQE ERGRTAVFVGQDGQVVGVLAVADTVKDDAADVVGRLHAMGLQVAMITGDNARTAAAIA KQVGIEKVLAEVLPQDKVAEVRRLQDQGRVVAMVGDGVNDAPALVQADLGIAIGTGTD VAIEASDITLMSGRLDGVVRAIELSRQTLRTIYQNLGWAFGYNTAAIPLAALGALNPV VAGAAMGFSSVSVVTNSLRLRRFGRDGRTA" CDS 1081519..1082151 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB0995" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb0995, -, len: 210 aa. Equivalent to Rv0970, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 210 aa overlap). Probable conserved integral membrane protein, equivalent to NP_302348.1|NC_002677 probable integral membrane protein from Mycobacterium leprae (210 aa). Mb0995 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64782" /db_xref="InterPro:IPR033458" /db_xref="UniProtKB/Swiss-Prot:P64782" /protein_id="SIT99594.1" /translation="MIHDLMLRWVVTGLFVLTAAECGLAIIAKRRPWTLIVNHGLHFA MAVAMAVMAWPWGARVPTTGPAVFFLLAAVWFGATAVVAVRGTATRGLYGYHGLMMLA TAWMYAAMNPRLLPVRSCTEYATEPDGSMPAMDMTAMNMPPNSGSPIWFSAVNWIGTV GFAVAAVFWACRFVMERRQEATQSRLPGSIGQAMMAAGMAMLFFAMLFPV" CDS complement(1082242..1083051) /codon_start=1 /transl_table=11 /gene="echA7" /locus_tag="BQ2027_MB0996C" /product="PROBABLE ENOYL-COA HYDRATASE ECHA7 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb0996c, echA7, len: 269 aa. Equivalent to Rv0971c, len: 269 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 269 aa overlap). Probable echA7, enoyl-CoA hydratase (EC 4.2.1.17), similar to many e.g. CAB95895.1|AL359988 putative enoyl CoA hydratase from Streptomyces coelicolor (247 aa); P24162|ECHH_RHOCA enoyl-CoA hydratase from Rhodobacter capsulatus (257 aa), FASTA scores: opt: 369, E(): 2.6e-15, (33.7% identity in 246 aa overlap); etc. Protein product from Mb0996c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0996c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXB6" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR014748" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3XXB6" /protein_id="SIT99595.1" /translation="MDSPVDYAGPAACGGPFARLTLNSPHNRNALSSTLVSQLHQGLS AAEADPAVRLVVLGHTGGTFCAGADLSEAGGGGGDPYRMAVARAREMTALLRAIVESP LPVVGAINGHVRAGGFGLVGACDMVVAGPESTFALTEARIGVAPAIISLTLLPKLSPR AAARYYLTGEKFGAREAADIGLITMAADDVDAAVAALVADVGRGSPQGLAASKALTTA AVLEGFDRDAERLTEESARLFVSDEAREGMLAFLQKRPPRWVQPATMRAAD" CDS complement(1083051..1084217) /codon_start=1 /transl_table=11 /gene="fadE12" /locus_tag="BQ2027_MB0997C" /product="acyl-coa dehydrogenase fade12" /note="Mb0997c, fadE12, len: 388 aa. Equivalent to Rv0972c, len: 388 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 388 aa overlap). Probable fadE12, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. CAB95893.1|AL359988 putative acyl CoA dehydrogenase from Streptomyces coelicolor (382 aa); P45857|ACDB_BACSU from Bacillus subtilis (379 aa), FASTA scores: opt: 576, E(): 2.3e-26, (29.7% identity in 381 aa overlap); etc. Protein product from Mb0997c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb0997c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0Y2" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/Swiss-Prot:Q7U0Y2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99596.1" /translation="MTDTSFIESEERQALRKAVASWVANYGHEYYLDKARKHEHTSEL WAEAGKLGFLGVNLPEEYGGGGAGMYELSLVMEEMAAAGSALLLMVVSPAINGTIIAK FGTDDQKKRWLPGIADGSLTMAFAITEPDAGSNSHKITTTARRDGSDWIIKGQKVFIS GIDQAQAVLVVGRSEEAKTGKLRPALFVVPTDAPGFSYTPIEMELVSPERQFQVFLDD VRLPADALVGAEDAAIAHLFAGLNPERIMGAASAVGMGRFALGRAVDYVKTRKVWSTP IGAHQGLAHPLAQCHIEVELAKLMTQKAATLYDHGDDFGAAEAANMAKYAAAEASSRA VDQAVQSMGGNGLTKEYGVAAMMTSARLARIAPISREMVLNFVAQTSLGLPRSY" CDS complement(1084214..1086217) /codon_start=1 /transl_table=11 /gene="accA2" /locus_tag="BQ2027_MB0998C" /standard_name="bccA" /product="PROBABLE ACETYL-/PROPIONYL-COENZYME A CARBOXYLASE ALPHA CHAIN (ALPHA SUBUNIT) ACCA2: BIOTIN CARBOXYLASE + BIOTIN CARBOXYL CARRIER PROTEIN (BCCP)" /note="Mb0998c, accA2, len: 667 aa. Equivalent to Rv0973c, len: 667 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 667 aa overlap). Probable accA2 (alternate gene name: bccA), acetyl-/propionyl-coenzyme A carboxylase (alpha subunit) [INCLUDES: BIOTIN CARBOXYLASE (EC 6.3.4.14); BIOTIN CARBOXYL CARRIER PROTEIN (BCCP)], highly similar to others e.g. CAB95892.1|AL359988 putative acetyl/propionyl CoA carboxylase alpha subunit from Streptomyces coelicolor (614 aa); NP_250702.1|NC_002516 probable acyl-CoA carboxylase alpha chain from Pseudomonas aeruginosa (655 aa); NP_420971.1|NC_002696 acetyl/propionyl-CoA carboxylase alpha subunit from Caulobacter crescentus (654 aa); NP_251581.1|NC_002516 probable biotin carboxylase/biotin carboxyl carrier protein from Pseudomonas aeruginosa (661 aa); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Rv2501c|P46401|MTCY07A7.07c|BCCA_MYCTU|ACCA1 PROBABLE ACETYL-/PROPIONYL-COENZYME A CARBOXYLASE ALPHA CHAIN (ALPHA SUBUNIT) (654 aa), FASTA scores, opt: 250, E(): 4e-09, (28.6% identity in 182 aa overlap); and Rv3285|MTCY71.25|ACCA3 (600 aa); Z83018|MTCY349_20 (1127 aa), FASTA scores: opt: 838, E(): 0, (40.2% identity in 500 aa overlap). Contains PS00867 Carbamoyl-phosphate synthase subdomain signature 2 and PS00188 Biotin-requiring enzymes attachment site. Protein product from Mb0998c detected using SWATH mass spectrometry. Mb0998c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWZ8" /db_xref="InterPro:IPR000089" /db_xref="InterPro:IPR001882" /db_xref="InterPro:IPR005479" /db_xref="InterPro:IPR005481" /db_xref="InterPro:IPR005482" /db_xref="InterPro:IPR011053" /db_xref="InterPro:IPR011054" /db_xref="InterPro:IPR011761" /db_xref="InterPro:IPR011764" /db_xref="InterPro:IPR016185" /db_xref="UniProtKB/TrEMBL:A0A1R3XWZ8" /protein_id="SIT99597.1" /translation="MGITRVLVANRGEIARRVFATCRRLGLGTVAVYTDPDAAAPHVA EADARVRLPQTTDYLNAEAIIAAAQAAGADAVHPGYGFLSENAEFAAAVQEAGLTWVG PPVDAVRAMGSKIESKKLMAAAGVPVLEELDPDAVTTAQLPVLVKASAGGGGRGMRVV HELSALPAEVEAARREAQSAFGDPTVFCERYLPTGHHVEVQVMADTHGTVWAVGEREC SFQRRHQKIIEEAPSPLVERVPGMRAKLFDAARLAASAIGYTGAGTVEFLADDSPGRE GEFYFLEMNTRLQVEHPVTEETTGLDLVELQLMIADCGRLDTEPPPAQGYSIEARLYA EDPAHGWQPQAGVMHTIEVPGVRAQFDSLGQRTGIRLDSGIVDGSTVSIHYDPMLAKV VSYGATRRQAALVLADALVRARLHGLRTNRELLVNVLRHPAFLDGATDTGFFDTHGMA ELSTPLADTATLRLSAIAAALADAEHNRASAGVFSSIPSGWRNLASGYQVKTYRDDAD TEHRVEYRFTRTGLALPGDPVVQLVSADVDQVVLAQDGVAHGFTVARHGPDVYVDSAR GPVHLVALSRFPEPSSAVEQGSLVAPMPGNVIRIGAEVGDTVTAGQPLIWLEAMKMEH TIAAPADGVLTHVSVNTGQQVEVGAILARVEAPQNGPAEGDSP" CDS complement(1086223..1087812) /codon_start=1 /transl_table=11 /gene="accD2" /locus_tag="BQ2027_MB0999C" /product="PROBABLE ACETYL-/PROPIONYL-COA CARBOXYLASE (BETA SUBUNIT) ACCD2" /note="Mb0999c, accD2, len: 529 aa. Equivalent to Rv0974c, len: 529 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 529 aa overlap). Probable accD2, acetyl-/propionyl-CoA carboxylase (beta subunit) (EC 6.4.1.-), highly similar to many e.g. CAB95891.1|AL35998 putative acetyl/propionyl CoA carboxylase beta subunit from Streptomyces coelicolor (532 aa); NP_250704.1|NC_002516 probable acyl-CoA carboxyltransferase beta chain from Pseudomonas aeruginosa (535 aa); BAB16296.1|AB039884 acetyl-CoA carboxylase carboxyltransferase from Myxococcus xanthus (538 aa); NP_420973.1|NC_002696 putative propionyl-CoA carboxylase beta subunit from Caulobacter crescentus (530 aa); etc. Also similar to other from Mycobacterium tuberculosis: Rv2502c|ACCD1, Rv3799c|ACCD4, etc. COULD BELONG TO THE ACCD/PCCB FAMILY. Protein product from Mb0999c detected using SWATH mass spectrometry. Mb0999c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XWX8" /db_xref="InterPro:IPR011762" /db_xref="InterPro:IPR011763" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR034733" /db_xref="UniProtKB/TrEMBL:A0A1R3XWX8" /protein_id="SIT99598.1" /translation="MLQSTLDPNASAYDEAAATMSGKLDEINAELAKALAGGGPKYVD RHHARGKLTPRERIELLVDPDSPFLELSPLAAYGSNFQIGASLVTGIGAVCGVECMIV ANDPTVKGGTSNPWTLRKILRANQIAFENRLPVISLVESGGADLPTQKEIFIPGGQMF RDLTRLSAAGIPTIALVFGNSTAGGAYVPGMSDHVVMIKERSKVFLAGPPLVKMATGE ESDDESLGGAEMHARISGLADYFALDELDAIRIGRRIVARLNWIKQGPAPAPVTEPLF DAEELIGIVPPDLRIPFDPREVIARIVDGSEFDEFKPLYGSSLVTGWARLHGYPLGIL ANARGVLFSEESQKATQFIQLANRADTPLLFLHNTTGYMVGKDYEEGGMIKHGSMMIN AVSNSTVPHISLLIGASYGAGHYGMCGRAYDPRFLFAWPSAKSAVMGGAQLSGVLSIV ARAAAEARGQQVDEAADAAMRAAVEGQIEAESLPLVLSGMLYDDGVIDPRDTRTVLGM CLSAIANGPIKGTSNFGVFRM" CDS complement(1087853..1088962) /codon_start=1 /transl_table=11 /gene="fadE13" /locus_tag="BQ2027_MB1000C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE13" /note="Mb1000c, fadE13, len: 369 aa. Equivalent to the 5' end of Rv0975c, len: 382 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 349 aa overlap). Probable fadE13, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. T35427 probable acyl-CoA dehydrogenase from Streptomyces coelicolor (382 aa); M74096|HUMACADL_1 Human long chain acyl-CoA dehydrogenase from Homo sapiens (430 aa), FASTA scores: opt: 819, E(): 0, (37.0% identity in 376 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. fadE20|Z98209|MTCY154_4 (386 aa), FASTA scores: (40.3% identity in 375 aa overlap). Contains PS00073 Acyl-CoA dehydrogenases signature 2. BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base deletion (t-*) leads to a shorter product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb1000c detected using SWATH mass spectrometry. Mb1000c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX05" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3XX05" /protein_id="SIT99599.1" /translation="MNIWTTPERQQLRKTVRAFAEREILPHVDEWERIGELPRGLHRL AGAAGLLGAGFPEAVGGGGGDGADPVIICEEMHQAGAPGGVYASLFTCGIAVPHMVAS GDERLIATYVRPTLAGEKIGALAITEPGGGSDVGHLRTSAVRDGDHYVINGAKTYITS GVRADYVVTAVRTGGPGAAGVSLLVVEKDTPGFEVTRKLDKMGWRSSDTAELCYTDVA VPATNLVGAENSGFTQIARAFVSERIGLAAQAYSSAQRCLDLTAQWCRDRETFGRPLI SRQSVQNTLAEMARRIDVARVYAHHVVERQLAGETDLIAQVCFAKNTAVQAGEWVANQ AVQLFGGMGYMAESEANANTGTCESSVSEAAPPKY" CDS complement(1088959..1090641) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1001C" /product="Terpene utilization protein AtuA" /note="Mb1001c, -, len: 560 aa. Equivalent to Rv0976c, len: 560 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 560 aa overlap). Conserved hypothetical protein, highly similar to others e.g. CAB95890.1|AL359988 conserved hypothetical protein from Streptomyces coelicolor (558 aa); P_251576.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (600 aa); etc. N-terminal part highly similar to AL035500|MLCL373_14 probable pseudogene from Mycobacterium leprae (163 aa), FASTA score: (50.0% identity in 122 aa overlap). Protein product from Mb1001c detected using SWATH mass spectrometry. Mb1001c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010839" /db_xref="UniProtKB/TrEMBL:A0A1R3XX00" /protein_id="SIT99600.1" /translation="MRIGNCSGFYGDRLSAMREMLTGGELDYLTGDYLAELTMLILGR DRMKNPDRGYAKTFLAQLEDCLGLAHDRGVRIVTNAGGLNPAGLANAVRALAARLGIP AQVAHVEGDDLQPRAAELGLGTPLTANAYLGAWGIVDCFERGADVVVTGRVTDASVVV GAAAAHFGWGRTDYHRLAGAVVAGHVIECGVQATGGNYAFFTEIGDLTHAGFPLAEIA ADGSSVITKHHGTGGLVSVDTITAQLLYEITGARYANPDVTARMDSVELSPDGPDRVR ISGVIGEPPPPTYKVSLNSIGGFRNAMTFVLTGLDIDAKADLVRRQLEAALTVKPAEL QWTLARTDHPDADTEETASALLTCVARDPDPANVGRQFSSAAVELALASYPGFTATAP PGDGQVYGVFTPGYVDAGKVAHIAVHADGTRTEIPCATETLELAPAHPPALPDPLPAG PTRRVPLGLIAGARSGDKGGSANVGVWVRTDEQWRWLAHTLTVELLKELLPETAGLVV TRHVLPNLRALNFVIEAILGQGVAYQARFDPQAKGLGEWLRSRHVEIPETLL" CDS 1090840..1093611 /codon_start=1 /transl_table=11 /gene="PE_PGRS16" /locus_tag="BQ2027_MB1002" /product="pe-pgrs family protein pe_pgrs16" /note="Mb1002, PE_PGRS16, len: 923 aa. Equivalent to Rv0977, len: 923 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 923 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to other PGRS-type sequences e.g. AL0091|MTV004_1 from Mycobacterium tuberculosis (1125 aa), FASTA score: (45.4% identity in 959 aa overlap); Z80225|MTCY441_4 from Mycobacterium tuberculosis (778 aa), FASTA score: (51.5% identity in 750 aa overlap); etc. Mb1002 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR021109" /db_xref="UniProtKB/TrEMBL:A0A1R3XWZ9" /protein_id="SIT99601.1" /translation="MSFVVTAPPVLASAASDLGGIASMISEANAMAAVRTTALAPAAA DEVSAAIAALFSSYARDYQTLSVQVTAFHVQFAQTLTNAGQLYAVVDVGNGVLLKTEQ QVLGVINAPTQTLVGRPLIGDGTHGAPGTGQNGGAGGILWGNGGNGGSGAPGQPGGRG GDAGLFGHGGHGGVGGPGIAGAAGTAGLPGGNGANGGSGGIGGAGGAGGNGGLLFGNG GAGGQGGSGGLGGSGGTGGAGMAAGPAGGTGGIGGIGGIGGAGGVGGHGSALFGHGGI NGDGGTGGMGGQGGAGGNGWAAEGITVGIGEQGGQGGDGGAGGAGGIGGSAGGIGGSQ GAGGHGGDGGQGGAGGSGGVGGGGAGAGGDGGAGGIGGTGGNGSIGGAAGNGGNGGRG GAGGMATAGSDGGNGGGGGNGGVGVGSAGGAGGTGGDGGAAGAGGAPGHGYFQQPAPQ GLPIGTGGTGGEGGAGGAGGDGGQGDIGFDGGRGGDGGPGGGGGAGGDGSGTFNAQAN NGGDGGAGGVGGAGGTGGTGGVGADGGRGGDSGRGGDGGNAGHGGAAQFSGRGAYGGE GGSGGAGGNAGGAGTGGTAGSGGAGGFGGNGADGGNGGNGGNGGFGGINGTFGTNGAG GTGGLGTLLGGHNGNIGLNGATGGIGSTTLTNATVPLQLVNTTEPVVFISLNGGQMVP VLLDTGSTGLVMDSQFLTQNFGPVIGTGTAGYAGGLTYNYNTYSTTVDFGNGLLTLPT SVNVVTSSSPGTLGNFLSRSGAVGVLGIGPNNGFPGTSSIVTAMPGLLNNGVLIDESA GILQFGPNTLTGGITISGAPISTVAVQIDNGPLQQAPVMFDSGGINGTIPSALASLPS GGFVPAGTTISVYTSDGQTLLYSYTTTATNTPFVTSGGVMNTGHVPFAQQPIYVSYSP TAIGTTTFN" CDS complement(1093828..1094835) /codon_start=1 /transl_table=11 /gene="PE_PGRS17" /locus_tag="BQ2027_MB1003C" /product="pe-pgrs family protein pe_pgrs17" /note="Mb1003c, PE_PGRS17, len: 335 aa. Similar to Rv0978c, len: 331 aa, from Mycobacterium tuberculosis strain H37Rv, (93.7% identity in 335 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to others e.g. Z95387|MTCY1A10_19 from Mycobacterium tuberculosis (461 aa), FASTA score: (73.6% identity in 277 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 115 bp to 127 bp substitution leads to a slightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (335 aa versus 331 aa)." /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR001258" /db_xref="InterPro:IPR011964" /db_xref="InterPro:IPR013017" /db_xref="InterPro:IPR015943" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ35" /protein_id="SIT99602.1" /translation="MSFVNVAPQLVSTAAADAARIGSAINTANTAAAATTQVLVAAQD EVSTAIAALFGSHGQHYQAISAQVAAYQERFVLALSQAGSTYAVAEAASATPLQQIEQ ALLGVINTPTEALVGRKLIGDGAHGAPGTGQAGGAGGILWGNGGNGGSGAPGQAGGAG GAAGLIGNGGAGGTGGAVSLARAGTAGGAGRGPVGGIGGAGGVGGAGGAAGAVTTITH ASFNDPHGVAVNPGGNVYVTNFGSGTVSVINPATNTVTGSPITIGNGPSGVAVSPVTG LVFVTNFDSNTVSVIDPTTNTVTGSPITVGTAPTGVAVNPVTGEVYVTNFAGDTVSVI S" CDS complement(1095149..1095343) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1004C" /product="HYPOTHETICAL PROTEIN" /note="Mb1004c, -, len: 64 aa. Equivalent to Rv0979c, len: 64 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 64 aa overlap). Hypothetical unknown protein. Start codon changed since first submission (-44 aa)" /db_xref="UniProtKB/TrEMBL:A0A1R3XXZ6" /protein_id="SIT99603.1" /translation="MGFRTQVGAATIASTMTWRIPVEDGPAQFRAGVGPGRDRQFTVV APMVVGLWDRNRRPGWQWPS" CDS 1095365..1095538 /codon_start=1 /transl_table=11 /gene="rpmF" /locus_tag="BQ2027_MB1005" /product="50s ribosomal protein l32 rpmf" /note="Mb1005, rpmF, len: 57 aa. Equivalent to Rv0979A, len: 57 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 57 aa overlap). Probable rpmF, 50S ribosomal protein L32, similar to others e.g. rpmF|Q9RL50 PROBABLE 50S RIBOSOMAL PROTEIN from Streptomyces coelicolor (56 aa), FASTA scores: E(): 5.1e-09, (63.45% identity in 52 aa overlap); etc. BELONGS TO THE L32P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb1005 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1005 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5V9" /db_xref="InterPro:IPR002677" /db_xref="InterPro:IPR011332" /db_xref="UniProtKB/Swiss-Prot:P0A5V9" /protein_id="SIT99604.1" /translation="MAVPKRRKSRSNTRSRRSQWKAAKTELVGVTVAGHAHKVPRRLL KAARLGLIDFDKR" CDS complement(1095557..1096930) /codon_start=1 /transl_table=11 /gene="PE_PGRS18" /locus_tag="BQ2027_MB1006C" /product="pe-pgrs family protein pe_pgrs18" /note="Mb1006c, PE_PGRS18, len: 457 aa. Equivalent to Rv0980c, len: 457 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 457 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to others e.g. Z95387|MTCY1A10_19 from Mycobacterium tuberculosis (461 aa), FASTA score: (66.7% identity in 405 aa overlap); Z95844|MTCY493_2 from Mycobacterium tuberculosis (741 aa), FASTA score: (53.0% identity in 394 aa overlap); etc." /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR001258" /db_xref="InterPro:IPR011045" /db_xref="InterPro:IPR011964" /db_xref="InterPro:IPR013017" /db_xref="InterPro:IPR015943" /db_xref="UniProtKB/TrEMBL:A0A1R3XXC5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99605.1" /translation="MSFVNVAPQLVSTAAADAARIGSAINTANTAAAATTQVLAAAQD EVSTAIAALFGSHGQHYQAISAQVAAYQERFVLALSQASSTYAVAEAASATPLQNVLD AINAPVQSLTGRPLIGDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAG LIGNGGAGGAGGQGLPFEAGANGGAGGAGGWLFGNGGAGGVGGAGGAGTTFGVAGGDG GTGGVGGHGGLIGVGGHGGDGGTGGTGGAVSLARAGTAGGAGGGPAGGIGGTGGVGGA GGAAGAVTTITHASFNDPHGVAVNPGGNIYVTNQGSNTVSVIDPVTNTVTGSITDGNG PSGVAVSPVTGLVFVTNFDSNTVSVIDPNTNTVTGSIPVGTGAYGVAVNPGGNIYVTN QFSNTVSVIDPATNTVTGSPIPVGLDPTGVAVNPVTGVVYVTNSLDDTVSVITGEPAR SVCSAAI" CDS 1097295..1097987 /codon_start=1 /transl_table=11 /gene="mprA" /locus_tag="BQ2027_MB1007" /product="Mycobacterial persistence regulator MRPA (two component response transcriptional regulatory protein)" /note="Mb1007, len: 228 aa. Equivalent to Rv0981 len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 228 aa overlap). MprA,mycobacterial persistence regulator, a two-component response regulator whose expression is required for entrance into and maintenance of persistent infection (see citation below), equivalent to NP_301250.1|NC_002677 putative two-component response regulator from Mycobacterium leprae (228 aa); and highly similar to others from Mycobacterium leprae. Also highly similar to others e.g. AAG36759.1|AF119221_1|AF119221 response regulator from Corynebacterium glutamicum (232 aa); CAB88489.1|AL353816 putative two-component system response regulator from Streptomyces coelicolor (248 aa); BJY09666_1 two-component response regulator (ragA, ragB and rpoH3) from B.japonicum (226 aa), FASTA score: (43.8% identity in 224 aa overlap); BSAJ2571_44 two-component response regulator from Bacillus subtilis (228 aa), FASTA score: (46.4% identity in 224 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Rv1033c (257 aa); Rv0903c (236 aa), FASTA score: (50.7 identity in 225 aa overlap); etc. Contains PS00217 Sugar transport proteins signature 2. Start changed since first submission (-2 aa). MprAB is involved in the regulation of genes in response to environmental stress (See He et al., 2006). Protein product from Mb1007 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1007 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0X4" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039420" /db_xref="UniProtKB/Swiss-Prot:Q7U0X4" /protein_id="SIT99606.1" /translation="MSVRILVVDDDRAVRESLRRSLSFNGYSVELAHDGVEALDMIAS DRPDALVLDVMMPRLDGLEVCRQLRSTGDDLPILVLTARDSVSERVAGLDAGADDYLP KPFALEELLARMRALLRRTKPEDAAESMAMRFSDLTLDPVTREVNRGQRRISLTRTEF ALLEMLIANPRRVLTRSRILEEVWGFDFPTSGNALEVYVGYLRRKTEADGEPRLIHTV RGVGYVLRETPP" CDS 1097987..1099501 /codon_start=1 /transl_table=11 /gene="mprB" /locus_tag="BQ2027_MB1008" /product="two component sensor kinase mprb" /note="Mb1008, mprB, len: 504 aa. Equivalent to Rv0982, len: 504 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 504 aa overlap). Probable mprB, two component sensor kinase, transmembrane protein (EC 2.7.3.-) (see citation below), equivalent to AL035500|MLCL373_16|NP_301251.1|NC_002677 putative two-component system sensor kinase from Mycobacterium leprae (519 aa), FASTA score: (81.0% identity in 521 aa overlap). Also highly similar to others (especially in C-terminal part) e.g. AAG36760.1|AF119221_2|AF119221 sensor kinase from Corynebacterium glutamicum (455 aa); CAB89748.1|AL354616 putative two-component histidine kinase from Streptomyces coelicolor (481 aa); X58793|SLCUTRS_2 sensor kinase from S.lividans (414 aa), FASTA scores: opt: 451, E(): 4.2e-21, (36.0% identity in 303 aa overlap); P30847|BAES_ECOLI SENSOR PROTEIN (EC 2.7.3.-) from Escherichia coli (467 aa), FASTA scores: opt: 412, E(): 1.3e-18, (30.4% identity in 336 aa overlap); etc. Also similar in C-terminal region to C-terminus of Rv0902c|Z73101|MTCY31_33 from Mycobacterium tuberculosis (446 aa), FASTA scores: opt: 423, E(): 2.6e-19, (28.4 identity in 462 aa overlap). Protein product from Mb1008 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1008 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0X3" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR003661" /db_xref="InterPro:IPR004358" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR036097" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/Swiss-Prot:Q7U0X3" /protein_id="SIT99607.1" /translation="MWWFRRRDRAPLRATSSLSLRWRVMLLAMSMVAMVVVLMSFAVY AVISAALYSDIDNQLQSRAQLLIASGSLAADPGKAIEGTAYSDVNAMLVNPGQSIYTA QQPGQTLPVGAAEKAVIRGELFMSRRTTADQRVLAIRLTNGSSLLISKSLKPTEAVMN KLRWVLLIVGGIGVAVAAVAGGMVTRAGLRPVGRLTEAAERVARTDDLRPIPVFGSDE LARLTEAFNLMLRALAESRERQARLVTDAGHELRTPLTSLRTNVELLMASMAPGAPRL PKQEMVDLRADVLAQIEELSTLVGDLVDLSRGDAGEVVHEPVDMADVVDRSLERVRRR RNDIHFDVEVIGWQVYGDTAGLSRMALNLMDNAAKWSPPGGHVGVRLSQLDASHAELV VSDRGPGIPVQERRLVFERFYRSASARALPGSGLGLAIVKQVVLNHGGLLRIEDTDPG GQPPGTSIYVLLPGRRMPIPQLPGATAGARSTDIENSRGSANVISVESQSTRAT" CDS 1099545..1100939 /codon_start=1 /transl_table=11 /gene="pepd" /locus_tag="BQ2027_MB1009" /product="probable serine protease pepd (serine proteinase) (mtb32b)" /note="Mb1009, -, len: 464 aa. Equivalent to Rv0983, len: 464 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 464 aa overlap). Probable secreted or membrane serine protease (EC 3.4.21.-), equivalent (but longer 18 aa in N-terminus) to AL035500|MLCL373_17|T45448 probable serine proteinase (EC 3.4.21.-) from Mycobacterium leprae (452 aa), FASTA score: (74.2% identity in 466 aa overlap); and highly similar to others from Mycobacterium leprae. Also highly similar (except in N-terminus) to other proteases e.g. CAC01350.1|AL390975 putative protease from Streptomyces coelicolor (542 aa); NP_440705.1|NC_000911|HtrA serine protease from Synechocystis sp. (452 aa); NP_346646.1|NC_003028 serine protease from Streptococcus pneumoniae (393 aa); etc. Also similar in part to members of the htrA-antigen family e.g. U87242|MTU87242_3|HtrA serine protease from M. tuberculosis (542 aa), FASTA scores: opt: 846, E(): 2e-28, (40.6% identity in 392 aa overlap); and similar to other hypothetical serine proteases e.g. Rv0983, Rv0125, etc. Protein product from Mb1009 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1009 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XWZ0" /db_xref="InterPro:IPR001478" /db_xref="InterPro:IPR001940" /db_xref="InterPro:IPR009003" /db_xref="InterPro:IPR036034" /db_xref="UniProtKB/TrEMBL:A0A1R3XWZ0" /protein_id="SIT99608.1" /translation="MAKLARVVGLVQEEQPSDMTNHPRYSPPPQQPGTPGYAQGQQQT YSQQFDWRYPPSPPPQPTQYRQPYEALGGTRPGLIPGVIPTMTPPPGMVRQRPRAGML AIGAVTIAVVSAGIGGAAASLVGFNRAPAGPSGGPVAASAAPSIPAANMPPGSVEQVA AKVVPSVVMLETDLGRQSEEGSGIILSAEGLILTNNHVIAAAAKPPLGSPPPKTTVTF SDGRTAPFTVVGADPTSDIAVVRVQGVSGLTPISLGSSSDLRVGQPVLAIGSPLGLEG TVTTGIVSALNRPVSTTGEAGNQNTVLDAIQTDAAINPGNSGGALVNMNAQLVGVNSA IATLGADSADAQSGSIGLGFAIPVDQAKRIADELISTGKASHASLGVQVTNDKDTPGA KIVEVVAGGAAANAGVPKGVVVTKVDDRPINSADALVAAVRSKAPGATVALTFQDPSG GSRTVQVTLGKAEQ" CDS 1100939..1101484 /codon_start=1 /transl_table=11 /gene="moaB2" /locus_tag="BQ2027_MB1010" /product="POSSIBLE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE MOAB2 (PHS) (4-ALPHA-HYDROXY-TETRAHYDROPTERIN DEHYDRATASE) (PTERIN-4-A-CARBINOLAMINE DEHYDRATASE) (PHENYLALANINE HYDROXYLASE-STIMULATING PROTEIN) (PHS) (PTERIN CARBINOLAMINE DEHYDRATASE) (PCD)" /note="Mb1010, moaB2, len: 181 aa. Equivalent to Rv0984, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 181 aa overlap). Possible moaB2, pterin-4-alpha-carbinolamine dehydratase (EC 4.2.1.96), highly similar to NP_301253.1|NC_002677 putative molybdenum cofactor biosynthesis protein from Mycobacterium leprae (181 aa), FASTA score: (92.3% identity in 181 aa overlap). Also similar to others e.g. CAB59675.1|AL132674 molybdenum cofactor biosynthesis protein from Streptomyces coelicolor (179 aa); Q56208|MOCB_SYNP7 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN CB from Synechococcus sp. (319 aa), FASTA score: (37.3% identity in 142 aa overlap); C-terminus of NP_197599.1|NC_003076 MOLYBDOPTERIN BIOSYNTHESIS CNX1 PROTEIN from Arabidopsis thaliana (670 aa); etc. Also similar to Rv0865|MOG from Mycobacterium tuberculosis (160 aa); and other mog proteins e.g. CAC39235.1|AJ312124 Mog protein from Eubacterium acidaminophilum (162 aa). COULD BELONG TO THE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE FAMILY. Alternative start codon has been suggested in position 1100508. Protein product from Mb1010 detected using shotgun mass spectrometry. Mb1010 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001453" /db_xref="InterPro:IPR036425" /db_xref="UniProtKB/TrEMBL:A0A1R3XX15" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99609.1" /translation="MKVAAQCSKLGYTVAPMEQRAELVVGRALVVVVDDRTAHGDEDH SGPLVTELLTEAGFVVDGVVAVSADEVEIRNALNTAVIGGVDLVVSVGGTGVTPRDVT PEATRDILDREILGIAEAIRASGLSAGIVDAGLSRGLAGVSGSTLVVNLAGSRYAVRD GMATLNPLAAQIIGQLSSLEI" CDS complement(1101504..1101959) /codon_start=1 /transl_table=11 /gene="mscL" /locus_tag="BQ2027_MB1011C" /product="POSSIBLE LARGE-CONDUCTANCE ION MECHANOSENSITIVE CHANNEL MSCL" /note="Mb1011c, mscL, len: 151 aa. Equivalent to Rv0985c, len: 151 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 151 aa overlap). Possible mscL, large conductance mechanosensitive ion channel (integral membrane protein), equivalent to AL035500|MLCL373_19|NP_301254.1|NC_002677 putative mechanosensitive channel protein from Mycobacterium leprae (154 aa), FASTA score: (71.0% identity in 155 aa overlap). Also highly similar to others e.g. NP_268999.1|NC_002737 putative large conductance mechanosensitive channel from Streptococcus pyogenes (120 aa); CAB90974.1|AL355832 putative mechanosensitive channel from Streptomyces coelicolor (156 aa); Q9X722|MSCL_CLOHI LARGE-CONDUCTANCE MECHANOSENSITIVE CHANNEL from Clostridium histolyticum (133 aa); Z83337|BSZ83337_6 large conductance mechanosensitive channel from Bacillus subtilis (130 aa), FASTA scores: opt: 248, E(): 8.4e-10, (39.0% identity in 136 aa overlap); U08371|ECU08371_1 large conductance mechanosensitive channel from Escherichia coli strain K-12 (136 aa), FASTA score: (36.6% identity in 134 aa overlap); etc. BELONGS TO THE MSCL FAMILY. Protein product from Mb1011c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1011c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5K9" /db_xref="InterPro:IPR001185" /db_xref="InterPro:IPR019823" /db_xref="InterPro:IPR036019" /db_xref="InterPro:IPR037673" /db_xref="UniProtKB/Swiss-Prot:P0A5K9" /protein_id="SIT99610.1" /translation="MLKGFKEFLARGNIVDLAVAVVIGTAFTALVTKFTDSIITPLIN RIGVNAQSDVGILRIGIGGGQTIDLNVLLSAAINFFLIAFAVYFLVVLPYNTLRKKGE VEQPGDTQVVLLTEIRDLLAQTNGDSPGRHGGRGTPSPTDGPRASTESQ" CDS 1102282..1103028 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1012" /product="PROBABLE ADHESION COMPONENT TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb1012, -, len: 248 aa. Equivalent to Rv0986, len: 248 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 248 aa overlap). Probable ATP-binding protein ABC transporter supposed involved in transport of adhesion component (see citation below), highly similar to many ATP-binding proteins e.g. AE0010|AE001033_8 ABC transporter ATP-binding protein from Archaeoglobus fulgidus (228 aa), FASTA scores: opt: 669, E(): 0, (45.7% identity in 219 aa overlap); CAB81857.1|AL161691 putative ABC-transporter ATP-binding protein from Streptomyces coelicolor (246 aa); X84019|ZMDNAGRP_4 glutamate uptake regulatory protein (grp) from Z.mobilis (232 aa), FASTA score: (44.4% identity in 225 aa overlap); Z99111|BSUB0008_108 from Bacillus subtilis (230 aa), FASTA score: (38.7% identity in 222 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb1012 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1012 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX09" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XX09" /protein_id="SIT99611.1" /translation="MNRQPIVQLSNLSWTFREGETRRQVLDHITFDFEPGEFVALLGQ SGSGKSTLLNLISGIEKPTTGDVTINGFAITQKTERDRTLFRRDQIGIVFQFFNLIPT LTVLENITLPQELAGVSQRKAAVVARDLLEKVGMADRERTFPDKLSGGEQQRVAISRA LAHNPMLVLADEPTGNLDSDTGDKVLDVLLDLTRQAGKTLIMATHSPSMTQHADRVVN LQGGRLIPALNRENQTDQPASTILLPTSYE" CDS 1103021..1104292 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1013" /product="PROBABLE ADHESION COMPONENT TRANSPORT TRANSMEMBRANE PROTEIN ABC TRANSPORTER" /note="Mb1013, -, len: 423 aa. Equivalent to 5' end of Rv0987, len: 855 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 423 aa overlap). Probable transmembrane protein ABC transporter supposed involved in transport of adhesion component (see citation below), whose N-terminus shows similarity with hypothetical proteins, generally transmembrane proteins, e.g. CAB96016.1|AL360055 putative ABC transport system integral membrane protein from Streptomyces coelicolor (855 aa); P44252|YCFU_HAEIN|HI1555 HYPOTHETICAL PROTEIN from Haemophilus influenzae (393 aa), FASTA scores: opt: 265, E(): 1.7e-09, (23.6% identity in 402 aa overlap); etc. N-and C-termini respectively show similarity to O32735 ATTF PROTEIN (420 aa), FASTA scores: E(): 1e-09, (26.7% identity in 430 aa overlap), and G2340078 ATTG PROTEIN (359 aa), FASTA scores: E(): 2.7e-08, (27.8% identity in 356 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0987 exists as a single gene. In Mycobacterium bovis, a single base transition (g-a) introduces a stop codon that splits Rv0987 in two parts, Mb1013 and Mb1014. Mb1013 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ45" /db_xref="InterPro:IPR003838" /db_xref="InterPro:IPR025857" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ45" /protein_id="SIT99612.1" /translation="MNDQAPVAYAPLWRTAWRRLRQRPFQYILLVLGIALGVAMIVAI DVSSNSAQRAFDLSAAAITGKSTHRLVSGPAGVDQQLYVDLRRHGYDFSAPVIEGYVL ARGLGNRAMQFMGTDPFAESAFRSPLWSNQNIAELGGFLTRPNGVVLSRQVAQKYGLA VGDRIALQVKGAPTTVTLVGLLTPADEVSNQKLSDLIIADISTAQELFHMPGRLSHID LIIKDEATATRIQQRLPAGVRMETSDTQRDTVKQMTDAFTVNLTALSLIALLVGIFLI YNTVTFNVVQRRPFFAILRCLGVTREQLFWLIMTESLVAGLIGTGLGLLIGIWLGEGL IGLVTQTINDFYFVINVRNVSVSAESLLKGLIIGIFAAMLATLPPAIEAMRTVPASTL RRSSLESKITKLMPWLWVAWFGLGSFGVLML" CDS 1104293..1105588 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1014" /product="PROBABLE ADHESION COMPONENT TRANSPORT TRANSMEMBRANE PROTEIN ABC TRANSPORTER" /note="Mb1014, -, len: 431 aa. Equivalent to 3' end of Rv0987, len: 855 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 431 aa overlap). Probable transmembrane protein ABC transporter supposed involved in transport of adhesion component (see citation below), whose N-terminus shows similarity with hypothetical proteins, generally transmembrane proteins, e.g. CAB96016.1|AL360055 putative ABC transport system integral membrane protein from Streptomyces coelicolor (855 aa); P44252|YCFU_HAEIN|HI1555 HYPOTHETICAL PROTEIN from Haemophilus influenzae (393 aa), FASTA scores: opt: 265, E(): 1.7e-09, (23.6% identity in 402 aa overlap); etc. N-and C-termini respectively show similarity to O32735 ATTF PROTEIN (420 aa), FASTA scores: E(): 1e-09, (26.7% identity in 430 aa overlap), and G2340078 ATTG PROTEIN (359 aa), FASTA scores: E(): 2.7e-08, (27.8% identity in 356 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv0987 exists as a single gene. In Mycobacterium bovis, a single base transition (g-a) introduces a stop codon that splits Rv0987 into two parts, Mb1013 and Mb1014. Protein product from Mb1014 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XY07" /db_xref="InterPro:IPR003838" /db_xref="UniProtKB/TrEMBL:A0A1R3XY07" /protein_id="SIT99613.1" /translation="MPGNNLVVAFVGLFSVLIALALIAPPLTRFVMLRLAPGLGRLLG PIGRMAPRNIVRSLSRTSIAIAALMMAVSLMVGVSISVGSFRQTLANWLEVTLKSDVY VSPPTLTSGRPSGNLPVDAVRNISKWPGVRDAVMARYSSVFAPDWGREVELMAVSGDI SDGKRPYRWIDGNKDTLWPRFLAGKGVMLSEPMVSRQHLQMPPRPITLMTDSGPQTFP VLAVFSDYTSDQGVILMDRASYRAHWQDDDVTTMFLFLASGANSGALIDQLQAAFAGR EDIVIQSTHSVREASMVIFDRSFTITIALQLVATVVAFIGVLSALMSLELDRAHELGV FRAIGMTTRQLWKLMFIETGLMGGMAGLMALPTGCILAWILVRIINVRSFGWTLQMHF ESAHFLRALLVAVVAALAAGMYPAWRLGRMTIRTAIREE" CDS 1105595..1106755 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1015" /product="POSSIBLE CONSERVED EXPORTED PROTEIN" /note="Mb1015, -, len: 386 aa. Equivalent to Rv0988, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 386 aa overlap). Possible conserved exported protein, with potential N-terminal signal sequence, similar (except in N-terminus) to O32737|L63540 ATTH PROTEIN from Agrobacterium tumefaciens (355 aa), FASTA scores: opt: 651, E(): 5.7e-33, (33.4% identity in 344 aa overlap); and NP_231265.1|NC_002505 conserved hypothetical protein from Vibrio cholerae (372 aa). Protein product from Mb1015 detected using SWATH mass spectrometry." /db_xref="InterPro:IPR010791" /db_xref="InterPro:IPR023374" /db_xref="UniProtKB/TrEMBL:A0A1R3XX28" /protein_id="SIT99614.1" /translation="MRKAGLTGVVLVLTLTLVAFWWWQRPRTNAVAADSLVGVLVDEN NAGYSLATVPGAIRFPRDLGPHYDYQTEWWYYTGNLETADGRLFGYQLTFFRRALAPP GEGVAIADASSWRTTQVYMAHFAISDISNRGFDPAEKFSRQALGLAGASSEPYAVWLD DWYARESNNNSVQLFARTQNTVLDLTLTQTLPPILQGNAGLSVKGAQPGNASNYYSLV RQESRGTVSVNGDTFMVSGLSWKDHEYMTSALAPEDVGWDWFGLQFYNGTALMLFQIR QADGSVTRFSSGTFVAGDGGVIPLESSDFRIKTTDRWTSDQSGATYPIAWEIEIERIG LTLRGAALMANQELRLSRTYWEGAVALEGRYQGMPISGRGYVEMTGYVQRLS" CDS complement(1106884..1107861) /codon_start=1 /transl_table=11 /gene="grcC2" /locus_tag="BQ2027_MB1016C" /product="PROBABLE POLYPRENYL-DIPHOSPHATE SYNTHASE GRCC2 (POLYPRENYL PYROPHOSPHATE SYNTHETASE)" /note="Mb1016c, grcC2, len: 325 aa. Equivalent to Rv0989c, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 325 aa overlap). Probable grcC2, polyprenyl diphosphate synthetase (EC 2.5.1.-), highly similar to NP_302483.1|NC_002677 polyprenyl diphosphate synthase component from Mycobacterium leprae (330 aa). Also similar to others (generally hepta (EC 2.5.1.30) or hexaprenyl e.g. NP_471378.1|NC_003212 protein similar to heptaprenyl diphosphate synthase component II (menaquinone biosynthesis) from Listeria innocua (321 aa); NP_371994.1|NC_002758 heptaprenyl diphosphate syntase component II from Staphylococcus aureus subsp. aureus Mu50 (319 aa); P55785|HEP2_BACST heptaprenyl diphosphate synthase component from Bacillus subtilis (323 aa), FASTA scores: opt: 496, E(): 1.4e-24, (31.4% identity in 306 aa overlap); etc. Also highly similar to Mycobacterium tuberculosis proteins e.g. Rv0562|grcC1|NP_215076.1|MTCY25D10.41 PROBABLE POLYPRENYL-DIPHOSPHATE SYNTHASE (335 aa); Rv3383, Rv3398c, Rv2173, etc. SEEMS TO BELONG TO THE FPP/GGPP SYNTHETASES FAMILY. Protein product from Mb1016c detected using SWATH mass spectrometry. Mb1016c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXD6" /db_xref="InterPro:IPR000092" /db_xref="InterPro:IPR008949" /db_xref="UniProtKB/TrEMBL:A0A1R3XXD6" /protein_id="SIT99615.1" /translation="MIPAVSLGDPQFTANVHDGIARITELINSELSQADEVMRDTVAH LVDAGGTPFRPLFTVLAAQLGSDPDGWEVTVAGAAIELMHLGTLCHDRVVDESDMSRK TPSDNTRWTNNFAILAGDYRFATASQLASRLDPEAFAVVAEAFAELITGQMRATRGPA SHIDTIEHYLRVVHEKTGSLIAASGQLGAALSGAAEEQIRRVARLGRMIGAAFEISRD IIAISGDSATLSGADLGQAVHTLPMLYALREQTPDTSRLRELLAGPIHDDHVAEALTL LRCSPGIGKAKNVVAAYAAQAREELPYLPDRQPRRALATLIDHAVSACD" CDS complement(1107922..1108578) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1017C" /product="Heat shock protein 22.5 (Hsp22.5)" /note="Mb1017c, -, len: 218 aa. Equivalent to Rv0990c, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 218 aa overlap). Hypothetical unknown protein. Mb1017c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR013974" /db_xref="UniProtKB/TrEMBL:A0A1R3XX16" /protein_id="SIT99616.1" /translation="MAESSLNPSLVSRISAFLRPDWTRTVRARRFAAAGLVMLAGVAA LRSNPEDDRAEVVVAAHDLRPGTALTPGDVRLEKRSATTLPDGSQADLDAVVGSTLAS PTRRGEVLTDVRLLGSRLAESTAGPDARIVPLHLADSALVDLVRVGDVVDVLAAPVTD SPAALRLLATDAIVVLVSAQQKAQAADSDRVVLVALPARLANTVAGAALGQTVTLTLH " CDS complement(1108651..1108983) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1018C" /product="conserved serine rich protein" /note="Mb1018c, -, len: 110 aa. Equivalent to Rv0991c, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 110 aa overlap). Conserved hypothetical ser-rich protein (especially in C-terminus), highly similar to N-terminus of NP_301255.1|NC_002677 conserved hypothetical protein (Ser-rich C-terminus) from Mycobacterium leprae (99 aa). Also highly similar to SCE22.04|AB90971.1|AL355832 hypothetical protein from Streptomyces coelicolor (110 aa); and similar to others. Protein product from Mb1018c detected using shotgun mass spectrometry. Mb1018c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR013429" /db_xref="UniProtKB/TrEMBL:A0A1R3XX18" /protein_id="SIT99617.1" /translation="MPTYSYECTQCANRFDVVQAFTDDALTTCERCSGRLRKLFNAVG VVFKGTGFYRTDSRESGKKSKSQTNGSSTSESTKSSGSSGSSGSSESKASGSTEKSTS STTAAAAV" CDS complement(1109057..1109650) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1019C" /product="5-formyltetrahydrofolate cyclo-ligase (EC" /EC_number="6.3.3.2" /note="Mb1019c, -, len: 197 aa. Equivalent to Rv0992c, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 197 aa overlap). Conserved hypothetical protein, equivalent to NP_301256.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (197 aa). Also similar, except in N-terminus, to other hypothetical proteins and ligases e.g. SCE87.34|CAB59679.1|AL132674 hypothetical protein from Streptomyces coelicolor (204 aa); NP_461977.1|NC_003197 putative ligase from Salmonella typhimurium (182 aa); P09160|YGFA_ECOLI HYPOTHETICAL 21.1 kDa PROTEIN from Escherichia coli (182 aa), FASTA scores: opt: 191, E(): 1.1e-09, (29.5% identity in 146 aa overlap); etc. Protein product from Mb1019c detected using shotgun mass spectrometry. Mb1019c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX02" /db_xref="InterPro:IPR002698" /db_xref="InterPro:IPR024185" /db_xref="InterPro:IPR037171" /db_xref="UniProtKB/TrEMBL:A0A1R3XX02" /protein_id="SIT99618.1" /translation="MAIASKSALRDQLLAARRRVADDVRAAEARMLRGHLERMVTSDS TVCAYVPVGGEPGSIEMLDVLLRRAGRVLLPVARTAGGDLPLPLRWGEYRAGGLARAR WGLLEPPEPWLPEAALAQASLVLVPALAVDRQGVRLGRGRGFYDRSLRCRDPHARLVA VVRTVELVDVLPSEPHDVPMTHALTPERGLIALPCGE" CDS 1109751..1110671 /codon_start=1 /transl_table=11 /gene="galU" /locus_tag="BQ2027_MB1020" /product="utp--glucose-1-phosphate uridylyltransferase galu (udp-glucose pyrophosphorylase) (udpgp) (alpha-d-glucosyl-1-phosphate uridylyltransferase) (uridine diphosphoglucose pyrophosphorylase)" /note="Mb1020, galU, len: 306 aa. Equivalent to Rv0993, len: 306 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 306 aa overlap). Probable galU, UTP--glucose-1-phosphate uridylyltransferase (EC 2.7.7.9), equivalent to AL035500|MLCL373_22 putative UTP-glucose-1-phosphate uridylyltransferase from Mycobacterium leprae (306 aa), FASTA score: (89.7% identity in 302 aa overlap). Also highly similar to others e.g. AB59678.1|AL132674 UTP-glucose-1-phosphate uridylyltransferase from Streptomyces coelicolor (303 aa); NP_244519.1|NC_002570 UTP-glucose-1-phosphate uridylyltransferase from Bacillus halodurans (297 aa); P25520|GALU_ECOLI|B1236|Z2012|ECS17 UTP--glucose-1-phosphate uridylyltransferase from Escherichia coli strains K12 and O157:H7 (301 aa), FASTA scores: opt: 624, E(): 2.4e-33, (38.8% identity in 299 aa overlap); etc. BELONGS TO THE PROKARYOTIC UDPGP FAMILY. Protein product from Mb1020 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1020 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX25" /db_xref="InterPro:IPR005771" /db_xref="InterPro:IPR005835" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/TrEMBL:A0A1R3XX25" /protein_id="SIT99619.1" /translation="MSRPEVLTPFTAIVPAAGLGTRFLPATKTVPKELLPVVDTPGIE LVAAEAAAAGAERLVIVTSEGKDGVVAHFVEDLVLEGTLEARGKIAMLAKVRRAPALI KVESVVQAEPLGLGHAIGCVEPTLSPDEDAVAVLLPDDLVLPTGVLETMSKVRASRGG TVLCAIEVAREEISAYGVFDVEPVPDGDYTDDPNVLKVRGMVEKPKAETAPSRYAAAG RYVLDRAIFDALRRIDRGAGGEVQLTDAIALLIAEGHPVHVVVHQGSRHDLGNPGGYL KAAVDFALDRDDYGPDLRRWLVARLGLTEQ" CDS 1110748..1112028 /codon_start=1 /transl_table=11 /gene="moeA1" /locus_tag="BQ2027_MB1021" /product="PROBABLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN MOEA1" /note="Mb1021, moeA1, len: 426 aa. Equivalent to Rv0994, len: 426 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 426 aa overlap). Probable moeA1, molybdenum cofactor biosynthesis protein, equivalent to AL035500|MLCL373_23 putative molybdopterin biosynthesis protein from Mycobacterium leprae (424 aa), FASTA score: (88.3% identity in 426 aa overlap). Also highly similar to many e.g. CAB59677.1|AL132674 molybdopterin biosynthesis protein from Streptomyces coelicolor (424 aa); NP_385769.1|NC_003047 PROBABLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN from Sinorhizobium meliloti (406 aa); P12281|MOEA_ECOLI molybdopterin biosynthesis moea protein from Escherichia coli (411 aa), FASTA scores: opt: 519, E(): 1.3e-24, (32.3% identity in 402 aa overlap); etc. Also similar to MOEA2|Rv0438c|MTV037.02c PROBABLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN from Mycobacterium tuberculosis (405 aa). Note that previously known as moeA. Protein product from Mb1021 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1021 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX20" /db_xref="InterPro:IPR001453" /db_xref="InterPro:IPR005110" /db_xref="InterPro:IPR005111" /db_xref="InterPro:IPR036135" /db_xref="InterPro:IPR036425" /db_xref="InterPro:IPR036688" /db_xref="InterPro:IPR038987" /db_xref="UniProtKB/TrEMBL:A0A1R3XX20" /protein_id="SIT99620.1" /translation="MRSVEEQQARISAAAVAPRPIRVAIAEAQGLMCAEEVVTERPMP GFDQAAIDGYAVRSVDVAGVGDTGGVQVFADHGDLDGRDVLTLPVMGTIEAGARTLSR LQPRQAVRVQTGAPLPTLADAVLPLRWTDGGMSRVRVLRGAPSGAYVRRAGDDVQPGD VAVRAGTIIGAAQVGLLAAVGRERVLVHPRPRLSVMAVGGELVDISRTPGNGQVYDVN SYALAAAGRDAGAEVNRVGIVSNDPTELGEIVEGQLNRAEVVVIAGGVGGAAAEAVRS VLSELGEMEVVRVAMHPGSVQGFGQLGRDGVPTFLLPANPVSALVVFEVMVRPLIRLS LGKRHPMRRIVSARTLSPITSVAGRKGYLRGQLMRDQDSGEYLVQALGGAPGASSHLL ATLAEANCLVVVPTGAEQIRTGEIVDVAFLAQHG" CDS 1112091..1112702 /codon_start=1 /transl_table=11 /gene="rimJ" /locus_tag="BQ2027_MB1022" /product="ribosomal-protein-alanine acetyltransferase rimj (acetylating enzyme for n-terminal of ribosomal protein s5)" /note="Mb1022, rimJ, len: 203 aa. Equivalent to Rv0995, len: 203 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 203 aa overlap). Possible rimJ, ribosomal-protein-alanine acetyltransferase (EC 2.3.1.128), equivalent to AL035500|MLCL373_24 probable acyltransferase from Mycobacterium leprae (218 aa), FASTA scores: (86.0% identity in 200 aa overlap). Also similar to others and many acyltransferases e.g. BAB69252.1|AB070946 possible acyltransferase from Streptomyces avermitilis (156 aa); NP_385025.1|NC_003047 PROBABLE RIBOSOMAL-PROTEIN-ALANINE ACETYLTRANSFERASE from Sinorhizobium meliloti (203 aa); P09454|RIMJ_ECOLI|B1066|Z1703|ECS1444 ribosomal-protein-alanine acetyltransferase from Escherichia coli strains K12 and O157:H7 (194 aa), FASTA scores: opt: 247, E(): 1.5e-10, (26.9% identity in 186 aa overlap). SEEMS TO BELONG TO THE ACETYLTRANSFERASE FAMILY, RIMJ SUBFAMILY. Protein product from Mb1022 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1022 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX19" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3XX19" /protein_id="SIT99621.1" /translation="MAVGPLRVSAGVIRLRPVRMRDGVHWSRIRLADRAHLEPWEPSA DGEWTVRHTVAAWPAVCSGLRSEARNGRMLPYVIELDGQFCGQLTIGNVTHGALRSAW IGYWVPSAATGGGVATGALALGLDHCFGPVMLHRVEATVRPENAASRAVLAKVGFREE GLLRRYLEVDRAWRDHLLMAITVEEVYGSVASTLVRAGHASWP" CDS 1112863..1113939 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1023" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb1023, -, len: 358 aa. Equivalent to Rv0996, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 358 aa overlap). Probable conserved transmembrane protein, equivalent to AL035500|MLCL373_25 putative membrane protein from Mycobacterium leprae (342 aa), FASTA scores: (66.4% identity in 360 aa overlap). Contains possible signal sequence and other hydrophobic domains. Mb1023 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZ60" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ60" /protein_id="SIT99622.1" /translation="MPSIPQSLLWISLVVLWLFVLVPMLISKRDAVRRTSDVALATRV LNGGAGARLLKRGGPAAGHRWGYLPPEGQGDDPDWKPEEDWRDDPVEGGFADVEHDID EDQEADDARRRGAVVMKVAAPQTAGADEPDYLDVDVVEEDSEALPVGAGAAVGESADE ADAEAADGVAGHADPEADPVEYEYEYEYVEDTCGLELEEDDQEAPPTVASGTSRRRRF DTKTAAAVSARKYTFRKRALIVMAVILVGSAAAAFELTPVAWWICGSATGVTVLYLAY LRRQTRIEEKVRRRRMQRIARARLGVQNTRDREYDVVPSRLRRPGAVVLEIDDEDPIF THLESAAPIRNYGWPRDLPRAVGQ" tRNA 1113956..1114028 /locus_tag="BQ2027_ALAV" /product="tRNA-Ala" /note="alaV, len: 73 nt. Equivalent to alaV, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Ala, anticodon cgc." CDS 1114340..1114591 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1023A" /note="unnamed protein product; Mb1023A, len: 83 aa. No equivalent in M. tuberculosis H37Rv. Identified by de novo proteomics of Mycobacterium bovis AF2122/97 under exponential conditions,Mb1023A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing" /db_xref="UniProtKB/TrEMBL:A0A1R3XY17" /protein_id="SIT99623.1" /translation="MSTKYYLQKVPVEAVQPGFSLAIPHDGDYRLFQVDCTQMCQRSG QPVMIRLMSESVDGGQPWVLEYEAGTAVIRLLGVCQAAS" CDS 1114772..1115203 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1024" /product="HYPOTHETICAL PROTEIN" /note="Mb1024, -, len: 143 aa. Equivalent to Rv0997, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 143 aa overlap). Hypothetical unknown protein, equivalent to AAK45276.1 from Mycobacterium tuberculosis strain CDC1551 (87 aa) but longer 56 aa. Mb1024 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XX36" /protein_id="SIT99624.1" /translation="MAGIAGVDRDPPGWPQHSHLLAGDPERFRHQLQRAETTNSIECF VAEWHHAGVAADMTRPWPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTD IEHSVGAAEVQRHRGAVPLGSGGDAAGKVEGGRTPQPFVQP" CDS 1115227..1116228 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1025" /product="Protein lysine acetyltransferase Pat (EC , Mycobacterial type" /EC_number="2.3.1.-" /note="Mb1025, -, len: 333 aa. Equivalent to Rv0998, len: 333 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 333 aa overlap). Conserved hypothetical protein, possibly cyclic nucleotide-dependent protein kinase (EC 2.7.-.-), highly similar to NP_301261.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (353 aa); and AL035500|MLCL373.38|T45457 hypothetical protein from Mycobacterium leprae (143 aa), FASTA score: (61.5% identity in 143 aa overlap). Also similar to many hypothetical proteins and cyclic-NMP-dependent protein kinases (generally at C-terminus) e.g. N-terminus of SC9B10.09|T35878 hypothetical protein from Streptomyces coelicolor (1039 aa); P05987|KAPR_DICDI CAMP-DEPENDENT PROTEIN KINASE REGULATORY CHAIN (EC 2.7.1.37) from Dictyostelium discoideum (327 aa), FASTA scores: opt: 177, E(): 0.00036, (32.0% identity in 122 aa overlap); NP_104403.1|NC_002678 hypothetical protein (contains similarity to cAMP-dependent protein kinase regulatory subunit) from Mesorhizobium loti (151 aa); etc. Contains PS00889 Cyclic nucleotide-binding domain signature 2. Protein product from Mb1025 detected using SWATH mass spectrometry. Mb1025 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXE5" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR000595" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR016181" /db_xref="InterPro:IPR018488" /db_xref="InterPro:IPR018490" /db_xref="UniProtKB/TrEMBL:A0A1R3XXE5" /protein_id="SIT99625.1" /translation="MDGIAELTGARVEDLAGMDVFQGCPAEGLVSLAASVQPLRAAAG QVLLRQGEPAVSFLLISSGSAEVSHVGDDGVAIIARALPGMIVGEIALLRDSPRSATV TTIEPLTGWTGGRGAFATMVHIPGVGERLLRTARQRLAAFVSPIPVRLADGTQLMLRP VLPGDRERTVHGHIQFSGETLYRRFMSARVPSPALMHYLSEVDYVDHFVWVVTDGSDP VADARFVRDETDPTVAEIAFTVADAYQGRGIGSFLIGALSVAARVDGVERFAARMLSD NVPMRTIMDRYGAVWQREDVGVITTMIDVPGPGELSLGREMVDQINRVARQVIEAVG" CDS 1116246..1117004 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1026" /product="unknown protein" /note="Mb1026, -, len: 252 aa. Equivalent to Rv0999, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 252 aa overlap). Hypothetical unknown protein. Protein product from Mb1026 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1026 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR041313" /db_xref="UniProtKB/TrEMBL:A0A1R3XX26" /protein_id="SIT99626.1" /translation="MRPPLAPQFAADLLVKTVSTLRSSGAALGRLTTMRKAVLAVGSV CWLVGCSSGASSTTASTGDIAKVAEVKSGFGPEYTVTDVTPRAIDPGFFSARKLPDGL SFDPANCAQVAAGPQLPTGLQGNMAAVSAEGNGNRFVVIAVETSQPLPAPSPGKDCSK VTFSGTQLRGGIEVVDVPHIDGTQTLGVHRVLQAVVGGSARTGELYDYSARFGDYQVI VIANPLVIPGRPVARVDTQRARDLLVQAVAAVRG" CDS complement(1117010..1117627) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1027C" /product="Alkylated DNA repair protein AlkB" /note="Mb1027c, -, len: 205 aa. Equivalent to Rv1000c, len: 205 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 205 aa overlap). Conserved hypothetical protein, equivalent to ML0190|NP_301263.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (205 aa). Also highly similar to SC5F8.12c|CAB93740.1|AL357613 hypothetical protein from Streptomyces coelicolor (210 aa), FASTA scores: E(): 2.1e-45, (56.8% identity); 9106290|AAF84108.1|AE003963_5|NP_298588.1|NC_002488 protein described as DNA repair system specific for alkylated DNA from Xylella fastidiosa (200 aa), FASTA scores: E(): 3.4e-14, (38.55% identity); and similar in C-terminus to other hypothetical proteins. Note that replaces original Rv1000 predicted on other strand. Mb1027c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX27" /db_xref="InterPro:IPR005123" /db_xref="InterPro:IPR027450" /db_xref="InterPro:IPR032854" /db_xref="InterPro:IPR037151" /db_xref="UniProtKB/TrEMBL:A0A1R3XX27" /protein_id="SIT99627.1" /translation="MCDKLGGVAIAVQGALFEHNERRQLGDGAFIDIRSGWLTGGEEL LDALLSTVPWRAERRQMYDRVVDVPRLVSFHDLTIEDPPHPQLARMRRRLNDIYGGEL GEPFTTAGLCYYRDGSDSVAWHGDTIGRGSTEDTMVAIVSLGATRVFALRPRGRGPSL RLPLAHGDLLVMGGSCQRTFEHAVPKTSAPTGPRVSIQFRPRDVR" CDS 1117664..1118872 /codon_start=1 /transl_table=11 /gene="arcA" /locus_tag="BQ2027_MB1028" /product="PROBABLE ARGININE DEIMINASE ARCA (ADI) (AD) (ARGININE DIHYDROLASE)" /note="Mb1028, arcA, len: 402 aa. Equivalent to Rv1001, len: 402 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 402 aa overlap). Probable arcA, arginine deiminase (EC 3.5.3.6), similar to e.g. ARCA_PSEAE|P13981 arginine deiminase (417 aa), fasta scores: opt: 581, E(): 1.4e-31, (39.4% identity in 411 aa overlap); also similar to SAGP_STRPY|P16962 streptococcal acid glycoprotein (410 aa), FASTA scores, opt: 823, E():0, (38.3% identity in 402 aa overlap). BELONGS TO THE ARGININE DEIMINASE FAMILY. Protein product from Mb1028 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1028 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63552" /db_xref="InterPro:IPR003876" /db_xref="UniProtKB/Swiss-Prot:P63552" /protein_id="SIT99628.1" /translation="MGVELGSNSEVGALRVVILHRPGAELRRLTPRNTDQLLFDGLPW VSRAQDEHDEFAELLASRGAEVLLLSDLLTEALHHSGAARMQGIAAAVDAPRLGLPLA QELSAYLRSLDPGRLAHVLTAGMTFNELPSDTRTDVSLVLRMHHGGDFVIEPLPNLVF TRDSSIWIGPRVVIPSLALRARVREASLTDLIYAHHPRFTGVRRAYESRTAPVEGGDV LLLAPGVVAVGVGERTTPAGAEALARSLFDDDLAHTVLAVPIAQQRAQMHLDTVCTMV DTDTMVMYANVVDTLEAFTIQRTPDGVTIGDAAPFAEAAAKAMGIDKLRVIHTGMDPV VAEREQWDDGNNTLALAPGVVVAYERNVQTNARLQDAGIEVLTIAGSELGTGRGGPRC MSCPAARDPL" CDS complement(1118907..1120418) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1029C" /product="CONSERVED MEMBRANE PROTEIN" /note="Mb1029c, -, len: 503 aa. Equivalent to Rv1002c, len: 503 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 503 aa overlap). Conserved membrane protein. Similar to AL132674|SCE87.05 hypothetical protein from Streptomyces coelicolor (591 aa), FASTA scores: opt: 666, E(): 0, (39.0% identity in 546 aa overlap); weakly similar to TSCC_PSEAM|P55019 thiazide-sensitive sodium-chloride cotransporter from Pseudopleuronectes americanus (1023 aa), FASTA scores: opt: 44, E(): 4.2e-06, (22.4% identity in 326 aa overlap). Protein product from Mb1029c detected using SWATH mass spectrometry. Mb1029c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX38" /db_xref="InterPro:IPR003342" /db_xref="InterPro:IPR027005" /db_xref="InterPro:IPR032421" /db_xref="UniProtKB/TrEMBL:A0A1R3XX38" /protein_id="SIT99629.1" /translation="MVPVVSPGPLVPVADFGPLDRLRGWIVTGLITLLATVTRFLNLG SLTDAGTPIFDEKHYAPQAWQVLNNHGVEDNPGYGLVVHPPVGKQLIAIGEAIFGYNG FGWRFTGALLGVLLVALVVRIVRRISRSTLVGAIAGVLLICDGVSFVTARTALLDGFL TFFVVAAFGALIVDRDQVRERMHIALLAGRSAATVWGPRVGVRWWRFGAGVLLGLACA TKWSGVYFVLFFGAMALAFDVAARRQYQVQRPWLGTVRRDVLPSGYALGLIPFAVYLA TYAPWFASETAIDRHAVGQAVGRNSVVPLPDAVRSLWHYTAKAFHFHAGLTNSAGNYH PWESKPWTWPMSLRPVLYAIDQQDVAGCGAQSCVKAEMLVGTPAMWWLAVPVLAYAGW RMFVRRDWRYAVVLVGYCAGWLPWFADIDRQMYFFYAATMAPFLVMGISLVLGDILYH PGQGSERRTLGLIVVCCYVALVVTNFAWLYPVLTGLPISQQTWNLEIWLPSWR" CDS 1120501..1121358 /codon_start=1 /transl_table=11 /gene="rsmI" /locus_tag="BQ2027_MB1030" /product="16S rRNA (cytidine(1402)-2'-O)-methyltransferase (EC" /EC_number="2.1.1.198" /note="Mb1030, -, len: 285 aa. Equivalent to Rv1003, len: 285 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 285 aa overlap). Conserved hypothetical protein, similar to others e.g. AL132674|SCE87.04 Streptomyces coelicolor (286 aa), FASTA scores: opt: 877, E(): 0, (53.2% identity in 280 aa overlap); and YRAL_ECOLI|P45528 hypothetical 31.3 kd protein (286 aa), FASTA scores: opt: 561, E(): 4.4e-27, (36.9% identity in 279 aa overlap). Protein product from Mb1030 detected using SWATH mass spectrometry." /db_xref="GOA:P0A641" /db_xref="InterPro:IPR000878" /db_xref="InterPro:IPR008189" /db_xref="InterPro:IPR014776" /db_xref="InterPro:IPR014777" /db_xref="InterPro:IPR018063" /db_xref="InterPro:IPR035996" /db_xref="UniProtKB/Swiss-Prot:P0A641" /protein_id="SIT99630.1" /translation="MSSGRLLLGATPLGQPSDASPRLAAALATADVVAAEDTRRVRKL AKALDIRIGGRVVSLFDRVEALRVTALLDAINNGATVLVVSDAGTPVISDPGYRLVAA CIDAGVSVTCLPGPSAVTTALVMSGLPAEKFCFEGFAPRKGAARRAWLAELAEERRTC VFFESPRRLAACLNDAVEQLGGARPAAICRELTKVHEEVVRGSLDELAIWAAGGVLGE ITVVVAGAAPHAELSSLIAQVEEFVAAGIRVKDACSEVAAAHPGVRTRQLYDAVLQSR RETGGPAQP" CDS complement(1121368..1122627) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1031C" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb1031c, -, len: 419 aa. Equivalent to Rv1004c, len: 419 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 419 aa overlap). Probable membrane protein. Contains repetitive sequences, which have similarities with elastin, and possible N-terminal signal sequence. Mb1031c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XX29" /protein_id="SIT99631.1" /translation="MSISCRVREGFVMRLAIVGTAAAAAIGGTLAVAPLTLSTPERVA GGTCSAGQQCDRLAAVLMPDTATPSGPAAAEHAVPAPFEPVADTIAPGLVPRPGVPAA AAVPRVGPPAVPGLPNIPGAAGPALPPPPALPNLAAPSVPGVGIPGIGIPGIGIPGIG IPGVPDPITGVNTAAAVVNGVLGVGGTAAGVVTASAVAVTYLVLAVNALESSGILPTA RGTASTVASLLLPGAQSAAAALPAVGLPALPGVTPASLLAMAAAAGLPGVGFPSLPGV SPTDLMAMAAAAGLPTSLPGLAGMSPAELTALVAGGLPMLAAAGLPAGLAGVDPATLA AALPALAAGGLPPGLPALPGVDPAALAAALPALAAGLPALPAGLPPLPAVPALPAPPP LPGPPPLPALPSRLCTPGFGPIGVCIP" CDS complement(1122701..1124137) /codon_start=1 /transl_table=11 /gene="pabB" /locus_tag="BQ2027_MB1032C" /product="Probable para-aminobenzoate synthase component I PABD" /note="Mb1032c, pabB, len: 478 aa. Similar to Rv1005c, len: 458 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 458 aa overlap). Probable PabD, para-aminobenzoate synthase component I (EC 4.1.3.-). Similar to PABB_ECOLI|P05041 para-aminobenzoate synthase component I from Escherichia coli (453 aa), FASTA scores: opt: 589, E(): 1.8e-27, (40.7% identity in 268 aa overlap). Similar to Mycobacterium tuberculosis Rv1609, Rv3215, Rv2386c. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (g-a) leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (478 aa versus 458 aa). Mb1032c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZ71" /db_xref="InterPro:IPR005801" /db_xref="InterPro:IPR005802" /db_xref="InterPro:IPR015890" /db_xref="InterPro:IPR019999" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ71" /protein_id="SIT99632.1" /translation="MALAARTATPPGTETGAGHGSNLAWELSTRTKSPRSHLRCENPQ FCQARTVRIDRLGDLGGAPAVLRAVGRATSRLDLPPPAALTGEWFGALAVIAPSVSIQ PVSGDDVFSGPPGTGGPDATGAVGGGWVGYLSYPDAGADGRPHRIPEAAGGWTDCVLR RDRDGQWWYESLSGAPIADWLASALATTRASVARPAPACRIDWEPADRAAHRDGVLAC LEAIGAGEVYQACVCTQFAGTVTGSPLDFFIDGFGRTAPSRSAFVAGPWGAVASLSPE LFLRRRGSVVTSSPIKGTLPLDAPPSALRASAKEVAENIMIVDLVRNDLGRVAVTGTV TVPELLVVRPAPGVWHLVSTVSARVPLEEPMSALLDAAFPPASVTGTPKLRARQLISQ WERYRRGIYCGTVGLASPVAGCELNVAIRTVEFDTAGNAVLGVGGGITADSDPDAEWA ECLHKAAPIVGLPAATRTTPARLASKVR" CDS 1124193..1125896 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1033" /product="unknown protein" /note="Mb1033, -, len: 567 aa. Equivalent to Rv1006, len: 567 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 567 aa overlap). Hypothetical unknown protein. Protein product from Mb1033 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1033 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR017853" /db_xref="UniProtKB/TrEMBL:A0A1R3XY25" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99633.1" /translation="MVLRSRKSTLGVVVCLALVLGGPLSGCSSSASHRGPLNAMGSPA IPSTAQEIPNPLRGQYEDLMEPLFPQGNPAQQRYPPWPASYDASLRVSWRQLQPTDPR TLPPDAPDDRKYDFSVIDNALTRLADRGMRLTLRVYAYSSCCKASYPDGTNIAIPDWE RAIASTNTSYPGPATDPSTGVVQVVPNFNDSTYLNDFAQLLAALGRRYDGDERLSVFE FSGYGDFSENHVAYLRDTLGAPGPGPDESVATLGYYSQFRDQNITTASIKQLIAANVS AFPHTQLVTSPANPEIVRELFADEVTNKLAAPVGVRSDCLGVDAPLPAWAESSTSHYV QTKDPVVAALRQRLATAPVITEWCELPTGSSPRAYYEKGLRDVIRYHVSMTSSVNFPD QTATSPMDPALYLVWAQANAAAGYRYSVEAQPGSQALAGKVATISVTWTNYGAAAATE KWVPGYRLVDSTGQVVRTLPAAVDLKTLVSDQRGDRSSDQPTPASVAETVRVDLSGLP AGHYTLRAAIDWQQHKPNGSHVVNYPSMLLSRDGRDDSGFYPVATLDIPRDAQTAVNA S" CDS complement(1125923..1127482) /codon_start=1 /transl_table=11 /gene="metS" /locus_tag="BQ2027_MB1034C" /product="methionyl-trna synthetase mets (metrs) (methionine--trna ligase)" /note="Mb1034c, metS, len: 519 aa. Equivalent to Rv1007c, len: 519 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 519 aa overlap). Probable metS (MetG), methionyl-tRNA synthetase (EC 6.1.1.10), similar to many e.g. SYM_BACSU|P37465 methionyl-tRNA synthetase from Bacillus subtilus (664 aa), FASTA scores: opt: 1506, E(): 0, (44.9% identity in 492 aa overlap); similar to other M. tuberculosis tRNA synthases e.g. Rv2448c, Rv1536, Rv0041. Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. BELONGS TO CLASS-I AMINOACYL-TRNA SYNTHETASE FAMILY. STRONG, TO CYSTEINYL-TRNA SYNTHETASE. Protein product from Mb1034c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1034c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59952" /db_xref="InterPro:IPR001412" /db_xref="InterPro:IPR009080" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR014758" /db_xref="InterPro:IPR015413" /db_xref="InterPro:IPR023457" /db_xref="InterPro:IPR033911" /db_xref="InterPro:IPR041872" /db_xref="UniProtKB/Swiss-Prot:P59952" /protein_id="SIT99634.1" /translation="MKPYYVTTAIAYPNAAPHVGHAYEYIATDAIARFKRLDGYDVRF LTGTDEHGLKVAQAAAAAGVPTAALARRNSDVFQRMQEALNISFDRFIRTTDADHHEA SKELWRRMSAAGDIYLDNYSGWYSVRDERFFVESETQLVDGTRLTVETGTPVTWTEEQ TYFFRLSAYTDKLLAHYHANPDFIAPETRRNEVISFVSGGLDDLSISRTSFDWGVQVP EHPDHVMYVWVDALTNYLTGAGFPDTDSELFRRYWPADLHMIGKDIIRFHAVYWPAFL MSAGIELPRRIFAHGFLHNRGEKMSKSVGNIVDPVALAEALGVDQVRYFLLREVPFGQ DGSYSDEAIVTRINTDLANELGNLAQRSLSMVAKNLDGRVPNPGEFADADAALLATAD GLLERVRGHFDAQAMHLALEAIWLMLGDANKYFSVQQPWVLRKSESEADQARFRTTLY VTCEVVRIAALLIQPVMPESAGKILDLLGQAPNQRSFAAVGVRLTPGTALPPPTGVFP RYQPPQPPEGK" CDS 1127568..1128362 /codon_start=1 /transl_table=11 /gene="tatD" /locus_tag="BQ2027_MB1035" /product="Probable deoxyribonuclease TatD (YjjV protein)" /note="Mb1035, tatD, len: 264 aa. Equivalent to Rv1008, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 264 aa overlap). Probable tatD (alternate gene name: yjjV), deoxyribonuclease (EC 3.1.21.-), component of twin arginine translocation protein export system (see citation below for more information). Similar to many members of the YBL055C/YJJV family e.g. YCFH_ECOLI|P37346 Putative deoxyribonuclease ycfH (EC 3.1.21.-) (265 aa), fasta scores: opt: 487, E(): 1.4e-24, (36.7% identity in 270 aa overlap). Also similar to P37545|YABD_BACSU Putative deoxyribonuclease yabD (255 aa), FASTA scores: opt: 599, E(): 7.7e-33, (40.1% identity in 262 aa overlap). Contains PS01137 Hypothetical YBL055c/yjjV family signature 1, and PS01091 Hypothetical YBL055c/yjjV family signature 3. Protein product from Mb1035 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3XXF5" /db_xref="InterPro:IPR001130" /db_xref="InterPro:IPR015991" /db_xref="InterPro:IPR018228" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/TrEMBL:A0A1R3XXF5" /protein_id="SIT99635.1" /translation="MVDAHTHLDACGARDADTVRSLVERAAAAGVTAVVTVADDLESA RWVTRAAEWDRRVYAAVALHPTRADALTDAARAELERLVAHPRVVAVGETGIDMYWPG RLDGCAEPHVQREAFAWHIDLAKRTGKPLMIHNRQADRDVLDVLRAEGAPDTVILHCF SSDAAMARTCVDAGWLLSLSGTVSFRNARELREAVPLMPVEQLLVETDAPYLTPHPHR GLANEPYCLPYTVRALAELVNRRPEEVALITTSNARRAYGLGWMRQ" CDS 1128570..1129658 /codon_start=1 /transl_table=11 /gene="rpfB" /locus_tag="BQ2027_MB1036" /product="Probable resuscitation-promoting factor rpfB" /note="Mb1036, rpfB, len: 362 aa. Equivalent to Rv1009, len: 362 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 362 aa overlap). Probable rpfB, resuscitation-promoting factor (see citation below), similar to others from Mycobacterium tuberculosis: Rv2450c|MTV008.06c|RPFE PROBABLE RESUSCITATION-PROMOTING FACTOR (172 aa), FASTA scores: E(): 1.9e-19, (42.9% identity in 147 aa overlap); Rv0867c|RPFA, Rv1884c|RPFC, and Rv2389c|RPFD. Possible lipoprotein; contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Mb1036 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007137" /db_xref="InterPro:IPR010618" /db_xref="InterPro:IPR011098" /db_xref="InterPro:IPR023346" /db_xref="UniProtKB/TrEMBL:A0A1R3XX37" /protein_id="SIT99636.1" /translation="MLRLVVGALLLVLAFAGGYAVAACKTVTLTVDGTAMRVTTMKSR VIDIVEENGFSVDDRDDLYPAAGVQVHDADTIVLRRSRPLQISLDGHDAKQVWTTAST VDEALAQLAMTDTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNVA GLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLPPNARRVEDPEM NMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPVANVVVTPAHEAVVRVGTKPGTEV PPVIDESIWDAIAGCEAGGNWAINTGNGYYGGVQFDQGTWEANGGLRYAPRADLATRE EQIAVAEVTRLRQGWGAWPVCAVRAGAR" CDS 1129631..1130584 /codon_start=1 /transl_table=11 /gene="ksgA" /locus_tag="BQ2027_MB1037" /product="PROBABLE DIMETHYLADENOSINE TRANSFERASE KSGA (S-adenosylmethionine-6-N', N'-adenosyl(rRNA) dimethyltransferase) (16S rRNA dimethylase) (High level kasugamycin resistance protein ksgA) (Kasugamycin dimethyltransferase)" /note="Mb1037, ksgA, len: 317 aa. Equivalent to Rv1010, len: 317 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 317 aa overlap). Probable ksgA, dimethyladenosine transferase (EC 2.1.1.-), similar to many e.g. KSGA_BACSU|P37468 dimethyladenosine transferase from Bacillus subtilus (292 aa), FASTA scores: opt: 524, E(): 1.5e-28, (37.2% identity in 274 aa overlap); similar to Mycobacterium tuberculosis hypothetical protein Rv1988. Contains PS01131 Ribosomal RNA adenine dimethylases signature. Protein product from Mb1037 detected using SWATH mass spectrometry. Mb1037 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66661" /db_xref="InterPro:IPR001737" /db_xref="InterPro:IPR011530" /db_xref="InterPro:IPR020596" /db_xref="InterPro:IPR020598" /db_xref="InterPro:IPR023165" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P66661" /protein_id="SIT99637.1" /translation="MCCTSGCALTIRLLGRTEIRRLAKELDFRPRKSLGQNFVHDANT VRRVVAASGVSRSDLVLEVGPGLGSLTLALLDRGATVTAVEIDPLLASRLQQTVAEHS HSEVHRLTVVNRDVLALRREDLAAAPTAVVANLPYNVAVPALLHLLVEFPSIRVVTVM VQAEVAERLAAEPGSKEYGVPSVKLRFFGRVRRCGMVSPTVFWPIPRVYSGLVRIDRY ETSPWPTDDAFRRRVFELVDIAFAQRRKTSRNAFVQWAGSGSESANRLLAASIDPARR GETLSIDDFVRLLRRSGGSDEATSTGRDARAPDISGHASAS" CDS 1130670..1131590 /codon_start=1 /transl_table=11 /gene="ispE" /locus_tag="BQ2027_MB1038" /product="Probable 4-diphosphocytidyl-2-C-methyl-D- erythritol kinase ISPE (CMK) (4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase)" /note="Mb1038, ispE, len: 306 aa. Equivalent to Rv1011, len: 306 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 306 aa overlap). Probable ispE, 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase (EC 2.7.1.-), similar to others e.g. Q9K3R6|ISPE_STRCO Streptomyces coelicolor (299 aa), FASTA scores: opt: 925, E(): 2.7e-49, (54.5% identity in 297 overlap); etc. BELONGS TO THE ISPE FAMILY. Protein product from Mb1038 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1038 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65179" /db_xref="InterPro:IPR004424" /db_xref="InterPro:IPR006204" /db_xref="InterPro:IPR013750" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR036554" /db_xref="UniProtKB/Swiss-Prot:P65179" /protein_id="SIT99638.1" /translation="MPTGSVTVRVPGKVNLYLAVGDRREDGYHELTTVFHAVSLVDEV TVRNADVLSLELVGEGADQLPTDERNLAWQAAELMAEHVGRAPDVSIMIDKSIPVAGG MAGGSADAAAVLVAMNSLWELNVPRRDLRMLAARLGSDVPFALHGGTALGTGRGEELA TVLSRNTFHWVLAFADSGLLTSAVYNELDRLREVGDPPRLGEPGPVLAALAAGDPDQL APLLGNEMQAAAVSLDPALARALRAGVEAGALAGIVSGSGPTCAFLCTSASSAIDVGA QLSGAGVCRTVRVATGPVPGARVVSAPTEV" CDS 1131587..1131901 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1040" /product="HYPOTHETICAL PROTEIN" /note="Mb1040, -, len: 104 aa. Equivalent to the 3' end of Rv1012, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (98.4% identity in 63 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1012 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-g) leads to a different NH2 terminus. Mb1040 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XX48" /protein_id="SIT99639.1" /translation="MTEFLGACLGRPGVSARAEAGGSIGWRTSMPAVGPQASALAEVG GAHQSQAQKPYHDATEPLGENLRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAVT KL" CDS 1132105..1133739 /codon_start=1 /transl_table=11 /gene="pks16" /locus_tag="BQ2027_MB1041" /product="PUTATIVE POLYKETIDE SYNTHASE PKS16" /note="Mb1041, pks16, len: 544 aa. Equivalent to Rv1013, len: 544 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 544 aa overlap). Putative pks16, polyketide synthase, similar to many e.g. N-terminus of Q50857|U24657 SAFRAMYCIN MX1 SYNTHETASE B (1770 aa), FASTA scores: opt: 526, E(): 1.4e-25, (29.3% identity in 542 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb1041 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1041 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX39" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR028154" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XX39" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99640.1" /translation="MSRFTEKMFHNARTATTGMVTGEPHMPVRHTWGEVHERARCIAG GLAAAGVGLGDVVGVLAGFPVEIAPTAQALWMRGASLTMLHQPTPRTDLAVWAEDTMT VIGMIEAKAVIVSEPFLVAIPILEQKGMQVLTVADLLASDPIGPIEVGEDDLALMQLT SGSTGSPKAVQITHRNIYSNAEAMFVGAQYDVDKDVMVSWLPCFHDMGMVGFLTIPMF FGAELVKVTPMDFLRDTLLWAKLIDKYQGTMTAAPNFAYALLAKRLRRQAKPGDFDLS TLRFALSGAEPVEPADVEDLLDAGKPFGLRPSAILPAYGMAETTLAVSFSECNAGLVV DEVDADLLAALRRAVPATKGNTRRLATLGPLLQDLEARIIDEQGDVMPARGVGVIELR GESLTPGYLTMGGFIPAQDEHGWYDTGDLGYLTEEGHVVVCGRVKDVIIMAGRNIYPT DIERAAGRVDGVRPGCAVAVRLDAGHSRESFAVAVESNAFEDPAEVRRIEHQVAHEVV AEVDVRPRNVVVLGPGTIPKTPSGKLRRANSVTLVT" CDS complement(1133813..1134388) /codon_start=1 /transl_table=11 /gene="pth" /locus_tag="BQ2027_MB1042C" /product="PROBABLE PEPTIDYL-TRNA HYDROLASE PTH" /note="Mb1042c, pth, len: 191 aa. Equivalent to Rv1014c, len: 191 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 191 aa overlap). Probable pth, peptidyl-tRNA hydrolase (EC 3.1.1.29), similar to PTH_ECOLI|P23932 peptidy l-trna hydrolase from Escherichia coli (194 aa), FASTA scores: opt: 472, E(): 2.3e-25, (39.6% identity in 187 aa overlap). BELONGS TO THE PTH FAMILY. Protein product from Mb1042c detected using SWATH mass spectrometry. Mb1042c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65866" /db_xref="InterPro:IPR001328" /db_xref="InterPro:IPR018171" /db_xref="InterPro:IPR036416" /db_xref="UniProtKB/Swiss-Prot:P65866" /protein_id="SIT99641.1" /translation="MAEPLLVVGLGNPGANYARTRHNLGFVVADLLAARLGAKFKAHK RSGAEVATGRSAGRSLVLAKPRCYMNESGRQIGPLAKFYSVAPANIIVIHDDLDLEFG RIRLKIGGGEGGHNGLRSVVAALGTKDFQRVRIGIGRPPGRKDPAAFVLENFTPAERA EVPTICEQAADATELLIEQGMEPAQNRVHAW" CDS complement(1134401..1135048) /codon_start=1 /transl_table=11 /gene="rplY" /locus_tag="BQ2027_MB1043C" /product="50s ribosomal protein l25 rply" /note="Mb1043c, rplY, len: 215 aa. Equivalent to Rv1015c, len: 215 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 215 aa overlap). Probable rplY, 50s ribosomal protein L25, similar to RL25_ECOLI|P02426 50s ribosomal protein L25 from Escherichia coli (94 aa), FASTA scores: opt: 182, E(): 2.5e-05, (38.4% identity in 86 aa overlap) and to CTC_BACSU|P14194 general stress protein from Bacillus subtilis (203 aa), FASTA scores: opt: 260, E(): 1.4e-09, (28.4% identity in 201 aa overlap). BELONGS TO THE L25P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb1043c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1043c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66122" /db_xref="InterPro:IPR001021" /db_xref="InterPro:IPR011035" /db_xref="InterPro:IPR020056" /db_xref="InterPro:IPR020057" /db_xref="InterPro:IPR029751" /db_xref="InterPro:IPR037121" /db_xref="UniProtKB/Swiss-Prot:P66122" /protein_id="SIT99642.1" /translation="MAKSASNQLRVTVRTETGKGASRRARRAGKIPAVLYGHGAEPQH LELPGHDYAAVLRHSGTNAVLTLDIAGKEQLALTKALHIHPIRRTIQHADLLVVRRGE KVVVEVSVVVEGQAGPDTLVTQETNSIEIEAEALSIPEQLTVSIEGAEPGTQLTAGQI ALPAGVSLISDPDLLVVNVVKAPTAEELEGEVAGAEEAEEAAVEAGEAEAAGESE" CDS complement(1135182..1135940) /codon_start=1 /transl_table=11 /gene="lpqT" /locus_tag="BQ2027_MB1044C" /product="probable conserved lipoprotein lpqt" /note="Mb1044c, lpqT, len: 252 aa. Equivalent to 5' end of Rv1016c, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 190 aa overlap). Probable lpqT, conserved lipoprotein. Similar to several M. tuberculosis hypothetical proteins e.g. Rv0040c|Y0H3_MYCTU|P71697 Proline rich 28 kDA antigen (310 aa), FASTA scores: opt: 329, E(): 2e-17, (32.3% identity in 229 aa overlap); Rv0583c. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a 5 bp deletion (cacgc-*) leads to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb1044c detected using SWATH mass spectrometry. Mb1044c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0V0" /db_xref="InterPro:IPR019674" /db_xref="UniProtKB/Swiss-Prot:Q7U0V0" /protein_id="SIT99643.1" /translation="MAGRRCPQDSVRPLAVAVAVATLAMSAVACGPKSPDFQSILSTS PTTSAVSTTTEVPVPLWKYLESVGVTGEPVAPSSLTDLTVSIPTPPGWAPMKNPNITP NTEMIAKGESYPTAMLMVFKLHRDFDIAEALKHGTADARLSTNFTELDSSTADFNGFP SSMIQGSYDLHGRRLHTWNRIVFPTGAGQAALPGAAHHHESGQRGRQARFRHRGDHRR IRRRGKVIVRTVGAQLSEQLTRIEFPQCSSHGLA" CDS complement(1135976..1136956) /codon_start=1 /transl_table=11 /gene="prsA" /locus_tag="BQ2027_MB1045C" /product="PROBABLE RIBOSE-PHOSPHATE PYROPHOSPHOKINASE PRSA (Phosphoribosyl pyrophosphate synthetase) (PRPP synthetase)" /note="Mb1045c, prsA, len: 326 aa. Equivalent to Rv1017c, len: 326 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 326 aa overlap). Probable prsA, ribose-phosphate pyrophosphokinase (EC 2.7.6.1), highly similar to others e.g. KPRS_ECOLI|P08330 ribose-phosphate pyrophosphokinase from Escherichia coli (314 aa), FASTA scores: opt: 826, E(): 0, (43.8% identity in 317 aa overlap). Contains PS00103 Purine/pyrimidine phosphoribosyl transferases signature; contains PS00144 Asparaginase / glutaminase active site signature 1. BELONGS TO THE RIBOSE-PHOSPHATE PYROPHOSPHOKINASE FAMILY. COFACTOR: BOTH INORGANIC PHOSPHATE AND MAGNESIUM ION ARE REQUIRED FOR ENZYME STABILITY AND ACTIVITY (by similarity). Protein product from Mb1045c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1045c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65233" /db_xref="InterPro:IPR000836" /db_xref="InterPro:IPR000842" /db_xref="InterPro:IPR005946" /db_xref="InterPro:IPR029057" /db_xref="InterPro:IPR029099" /db_xref="InterPro:IPR037515" /db_xref="UniProtKB/Swiss-Prot:P65233" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99644.1" /translation="MSHDWTDNRKNLMLFAGRAHPELAEQVAKELDVHVTSQDAREFA NGEIFVRFHESVRGCDAFVLQSCPAPVNRWLMEQLIMIDALKRGSAKRITAVMPFYPY ARQDKKHRGREPISARLIADLLKTAGADRIVTVDLHTDQIQGFFDGPVDHMRGQNLLT GYIRDNYPDGNMVVVSPDSGRVRIAEKWADALGGVPLAFIHKTRDPRVPNQVVSNRVV GDVAGRTCVLIDDMIDTGGTIAGAVALLHNDGAGDVIIAATHGVLSDPAAQRLASCGA REVIVTNTLPIGEDKRFPQLTVLSIAPLLASTIRAVFENGSVTGLFDGDA" CDS complement(1137048..1138535) /codon_start=1 /transl_table=11 /gene="glmU" /locus_tag="BQ2027_MB1046C" /product="Probable UDP-N-acetylglucosamine pyrophosphorylase glmU" /note="Mb1046c, glmU, len: 495 aa. Equivalent to Rv1018c, len: 495 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 495 aa overlap). Probable glmU, UDP-n-acetylglucosamine pyrophosphorylase (EC 2.7.7.23), similar to GCAD_BACSU|P14192 UDP-n-acetylglucosamine pyrophosphorylase (456 aa), FASTA scores: opt: 1150, E(): 0, (40.0% identity in 453 aa overlap). Similar to various Mycobacterium tuberculosis sugar-phosphate transferases e.g. Rv0334, Rv1213, Rv3264c, etc. Protein product from Mb1046c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1046c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VF00" /db_xref="InterPro:IPR001451" /db_xref="InterPro:IPR005882" /db_xref="InterPro:IPR011004" /db_xref="InterPro:IPR025877" /db_xref="InterPro:IPR029044" /db_xref="InterPro:IPR038009" /db_xref="UniProtKB/Swiss-Prot:Q7VF00" /protein_id="SIT99645.1" /translation="MTFPGDTAVLVLAAGPGTRMRSDTPKVLHTLAGRSMLSHVLHAI AKLAPQRLIVVLGHDHQRIAPLVGELADTLGRTIDVALQDRPLGTGHAVLCGLSALPD DYAGNVVVTSGDTPLLDADTLADLIATHRAVSAAVTVLTTTLDDPFGYGRILRTQDHG VMAIVEQTDATPSQREIREVNAGVYAFDIAALRSALSRLSSNNAQQELYLTDVIAILR SDGQTVHASHVDDSALVAGVNNRVQLAQLASELNRRVVAAHQLAGVTVVDPATTWIDV DVTIGRDTVIHPGTQLLGRTQIGGRCVVGPDTTLTDVAVGDGASVVRTHGSSSSIGDG AAVGPFTYLRPGTALGADGKLGAFVEVKNSTIGTGTKVPHLTYVGDADIGEYSNIGAS SVFVNYDGTSKRRTTVGSHVRTGSDTMFVAPVTIGDGAYTGAGTVVREDVPPGALAVS AGPQRNIENWVQRKRPGSPAAQASKRASEMACQQPTQPPDADQTP" tRNA complement(1138517..1138588) /locus_tag="BQ2027_GLNT" /product="tRNA-Gln" /note="glnT, len: 72 nt. Equivalent to glnT, len: 72 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 72 nt overlap). tRNA-Gln, anticodon ttg." CDS 1138790..1139383 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1047" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb1047, -, len: 197 aa. Equivalent to Rv1019, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 197 aa overlap). Probable transcriptional regulator, similar to many memebers of the tetR family e.g. MTCY7D11.18c (34.4% identity in 189 aa overlap). Helix turn helix motif from aa 27-48 (+5.42 SD). Protein product from Mb1047 detected using shotgun mass spectrometry. Mb1047 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX46" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023772" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3XX46" /protein_id="SIT99646.1" /translation="MTGTERRHQLIGIARSLFAERGYDGTSIEEIAQRANVSKPVVYE HFGGKEGLYAVVVDREMSALLDGITSSLTNNRSRVRVERVALALLTYVEERTDGFRIM IRDSPASISSGTYSSLLNDAVSQVSSILAGDFARRGLDPDLAPLYAQALVGSVSMTAQ WWLDAREPKKEVVAAHLVNLVWNGLTHLEADPRLQDE" CDS 1139442..1143146 /codon_start=1 /transl_table=11 /gene="mfd" /locus_tag="BQ2027_MB1048" /product="PROBABLE TRANSCRIPTION-REPAIR COUPLING FACTOR MFD (TRCF)" /note="Mb1048, mfd, len: 1234 aa. Equivalent to Rv1020, len: 1234 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 1234 aa overlap). Probable mfd, transcription-repair coupling factor, similar to many e.g. MFD_ECOLI|P30958 transcription-repair coupling factor from Escherichia coli (1148 aa), FASTA scores: opt: 1900, E(): 0, (37.9% identity in 1107 aa overlap). Also similar to M. tuberculosis Rv2973c and Rv1633. Contains PS00017 ATP/GTP-binding site motif A (P-loop). IN THE N-TERMINAL SECTION; BELONGS TO THE UVRB FAMILY. IN THE C-TERMINAL SECTION; BELONGS TO THE HELICASE FAMILY. RECG SUBFAMILY. Protein product from Mb1048 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1048 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64327" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR003711" /db_xref="InterPro:IPR004576" /db_xref="InterPro:IPR005118" /db_xref="InterPro:IPR011545" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036101" /db_xref="InterPro:IPR037235" /db_xref="InterPro:IPR041471" /db_xref="UniProtKB/Swiss-Prot:P64327" /protein_id="SIT99647.1" /translation="MTAPGPACSDTPIAGLVELALSAPTFQQLMQRAGGRPDELTLIA PASARLLVASALARQGPLLVVTATGREADDLAAELRGVFGDAVALLPSWETLPHERLS PGVDTVGTRLMALRRLAHPDDAQLGPPLGVVVTSVRSLLQPMTPQLGMMEPLTLTVGD ESPFDGVVARLVELAYTRVDMVGRRGEFAVRGGILDIFAPTAEHPVRVEFWGDEITEM RMFSVADQRSIPEIDIHTLVAFACRELLLSEDVRARAAQLAARHPAAESTVTGSASDM LAKLAEGIAVDGMEAVLPVLWSDGHALLTDQLPDGTPVLVCDPEKVRTRAADLIRTGR EFLEASWSVAALGTAENQAPVDVEQLGGSGFVELDQVRAAAARTGHPWWTLSQLSDES AIELDVRAAPSARGHQRDIDEIFAMLRAHIATGGYAALVAPGTGTAHRVVERLSESDT PAGMLDPGQAPKPGVVGVLQGPLRDGVIIPGANLVVITETDLTGSRVSAAEGKRLAAK RRNIVDPLALTAGDLVVHDQHGIGRFVEMVERTVGGARREYLVLEYASAKRGGGAKNT DKLYVPMDSLDQLSRYVGGQAPALSRLGGSDWANTKTKARRAVREIAGELVSLYAKRQ ASPGHAFSPDTPWQAELEDAFGFTETVDQLTAIEEVKADMEKPIPMDRVICGDVGYGK TEIAVRAAFKAVQDGKQVAVLVPTTLLADQHLQTFGERMSGFPVTIKGLSRFTDAAES RAVIDGLADGSVDIVIGTHRLLQTGVRWKDLGLVVVDEEQRFGVEHKEHIKSLRTHVD VLTMSATPIPRTLEMSLAGIREMSTILTPPEERYPVLTYVGPHDDKQIAAALRRELLR DGQAFYVHNRVSSIDAAAARVRELVPEARVVVAHGQMPEDLLETTVQRFWNREHDILV CTTIVETGLDISNANTLIVERADTFGLSQLHQLRGRVGRSRERGYAYFLYPPQVPLTE TAYDRLATIAQNNELGAGMAVALKDLEIRGAGNVLGIEQSGHVAGVGFDLYVRLVGEA LETYRDAYRAAADGQTVRTAEEPKDVRIDLPVDAHLPPDYIASDRLRLEGYRRLAAAS SDREVAAVVDELTDRYGALPEPARRLAAVARLRLLCRGSGITDVTAASAATVRLSPLT LPDSAQVRLKRMYPGAHYRATTATVQVPIPRAGGLGAPRIRDVELVQMVADLITALAG KPRQHIGITNPSPPGEDGRGRNTTIKERQP" CDS 1143146..1144123 /codon_start=1 /transl_table=11 /gene="mazG" /locus_tag="BQ2027_MB1049" /product="Nucleoside triphosphate pyrophosphohydrolase MazG (EC" /EC_number="3.6.1.8" /note="Mb1049, -, len: 325 aa. Equivalent to Rv1021, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 325 aa overlap). Conserved hypothetical protein, similar to YBL1_STRCI|P33653 hypothetical 26.1 kd protein from Streptomyces cacaoi (242 aa), FASTA scores: opt: 493, E(): 1.1e-23, (42.9% identity in 238 aa overlap). Protein product from Mb1049 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1049 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR004518" /db_xref="InterPro:IPR011551" /db_xref="UniProtKB/TrEMBL:A0A1R3XX33" /protein_id="SIT99648.1" /translation="MIVVLVDPRRPTLVPVEAIEFLRGEVQYTEEMPVAVPWSLPAAR SAHAGNDAPVLLSSDPNHPAVITRLAAGARLISAPDSQRGERLVDAVAMMDKLRTAGP WESEQTHDSLRRYLLEETYELLDAVRSGSVDQLREELGDLLLQVLFHARIAEDASQSP FTIDDVADTLMRKLGNRAPGVLAGESISLEDQLAQWEAAKASEKARKSVADDVHTGQP ALALAQKVIQRAQKAGLPAHLIPDEITSVSVSADVDAENTLRTAVLDFIDRLRCAERA IAVARRGSNVAEQLDVTPLGVITEQEWLAHWPTAVNDSRGGSKKRKGMR" CDS 1144211..1144942 /codon_start=1 /transl_table=11 /gene="lpqU" /locus_tag="BQ2027_MB1050" /product="probable conserved lipoprotein lpqu" /note="Mb1050, lpqU, len: 243 aa. Equivalent to Rv1022, len: 243 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 243 aa overlap). Probable lpqU conserved lipoprotein. Similar to Mycobacterium tuberculosis hypothetical protein Rv1230c|MTV006.02C, FASTA scores: E(): 2.8e-18, (37.9% identity in 240 aa overlap). Similar to AL133423|SC4A7.37 hypothetical protein from Streptomyces coelicolor (421 aa), FASTA scores: opt: 474, E(): 2.7e-21, (42.2% identity in 211 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb1050 detected using SWATH mass spectrometry. Mb1050 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR023346" /db_xref="InterPro:IPR031304" /db_xref="UniProtKB/TrEMBL:A0A1R3XX58" /protein_id="SIT99649.1" /translation="MSPRRWLRAVAVIGATAMLLASSCTWQLSLFIPDGVPPPPGDPV PPVDTHAGGRPADQLREWAEKRAAALGIPVIALEAYAYAARVAEVENPKCHLAWTTLA GIGRVESHHGTYRGATIAPNGDVSPPIRGVRLDGTGGTLRIVDRDGGGLDGDAAVERA MGPMQFISETWRLYGVAARNDGIANVDNIDDAALSAAGYLCWRGKDLATPRGWITALR AYNNSVIYARAVRDWATAYAAGHPL" CDS 1145039..1146328 /codon_start=1 /transl_table=11 /gene="eno" /locus_tag="BQ2027_MB1051" /product="PROBABLE ENOLASE ENO" /note="Mb1051, eno, len: 429 aa. Equivalent to Rv1023, len: 429 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 429 aa overlap). Probable eno, enolase (EC 4.2.1.11), highly similar to others e.g. ENO_ECOLI|P08324 enolase from Escherichia coli (431 aa), FASTA scores: opt: 1487, E(): 0, (55.5% identity in 422 aa overlap); etc. MAGNESIUM IS REQUIRED FOR CATALYSIS AND FOR STABILIZING THE DIMER. BELONGS TO THE ENOLASE FAMILY. Protein product from Mb1051 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1051 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U0U6" /db_xref="InterPro:IPR000941" /db_xref="InterPro:IPR020809" /db_xref="InterPro:IPR020810" /db_xref="InterPro:IPR020811" /db_xref="InterPro:IPR029017" /db_xref="InterPro:IPR036849" /db_xref="UniProtKB/Swiss-Prot:Q7U0U6" /protein_id="SIT99650.1" /translation="MPIIEQVGAREILDSRGNPTVEVEVALIDGTFARAAVPSGASTG EHEAVELRDGGDRYGGKGVQKAVQAVLDEIGPAVIGLNADDQRLVDQALVDLDGTPDK SRLGGNAILGVSLAVAKAAADSAELPLFRYVGGPNAHILPVPMMNILNGGAHADTAVD IQEFMVAPIGAPSFVEALRWGAEVYHALKSVLKKEGLSTGLGDEGGFAPDVAGTTAAL DLISRAIESAGLRPGADVALALDAAATEFFTDGTGYVFEGTTRTADQMTEFYAGLLGA YPLVSIEDPLSEDDWDGWAALTASIGDRVQIVGDDIFVTNPERLEEGIERGVANALLV KVNQIGTLTETLDAVTLAHHGGYRTMISHRSGETEDTMIADLAVAIGSGQIKTGAPAR SERVAKYNQLLRIEEALGDAARYAGDLAFPRFACETK" CDS 1146333..1147019 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1052" /product="possible conserved membrane protein" /note="Mb1052, -, len: 228 aa. Equivalent to Rv1024, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 228 aa overlap). Hypothetical unknown protein, hydrophobic region from aa 83-101. Protein product from Mb1052 detected using SWATH mass spectrometry. Mb1052 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007060" /db_xref="UniProtKB/TrEMBL:A0A1R3XX51" /protein_id="SIT99651.1" /translation="MPEAKRPESKRRSPASRPGKAGDSVRGGRATKPSAKPSTPAPHA SRKTTRTPHEHIVEPIKRAITESVEKRSEQRLGFTARRAAILAAVVCVLTLTIARPVR TYFAQRAEMEQLAATEAMLRRQIADLEEQQVKLADPAYIAAQARERLGFVMPGDIPFQ VQLPSTPLAPPQPGSDAATATNNEPWYTALWHTIADDPHLPPAAPPAPEPGRPGPLPP ASPNPEQPGG" CDS 1147036..1147503 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1053" /product="FIG004853: possible toxin to DivIC" /note="Mb1053, -, len: 155 aa. Equivalent to Rv1025, len: 155 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 155 aa overlap). Conserved hypothetical protein, similar to AE001768|AE001768_4 hypothetical protein from Thermotoga maritima (170 aa), FASTA scores: opt: 254, E(): 9.5e-10, (35.7% identity in 143 aa overlap). Protein product from Mb1053 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1053 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007511" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ94" /protein_id="SIT99652.1" /translation="MVTRQLGRAPRGVLAIAYRCPNGEPGVVKTAPRLPDGTPFPTLY YLTHPVLTAAASRLETTGLMREMNRRLGQDAELAAAYRRAHESYLSERDALEPLGTTV SAGGMPDRVKCLHVLIAHSLAKGPGLNPFGDEALALLAAEPRTAATLVAGQWR" CDS 1147494..1148453 /codon_start=1 /transl_table=11 /gene="ppx2" /locus_tag="BQ2027_MB1054" /product="Exopolyphosphatase (EC" /EC_number="3.6.1.11" /note="Mb1054, -, len: 319 aa. Equivalent to Rv1026, len: 319 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 319 aa overlap). Conserved hypothetical protein. Equivalent to AL023514|MLCB4.02 hypothetical protein from Mycobacterium leprae (317 aa) (77.9% identity in 321 aa overlap). Similar to GPPA_ECOLI|P25552 guanosine-5'-triphosphate, 3'-diphosphate pyrophoshatase from Escherichia coli (494 aa), FASTA scores: opt: 281, E(): 3.2e-11, (30.6% identity in 291 aa overlap). Protein product from Mb1054 detected using SWATH mass spectrometry. Mb1054 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003695" /db_xref="UniProtKB/TrEMBL:A0A1R3XY40" /protein_id="SIT99653.1" /translation="MALTRVAAIDCGTNSIRLLIADVGAGLARGELHDVHRETRIVRL GQGVDATGRFAPEAIARTRTALTDYAELLTFHHAERVRMVATSAARDVVNRDVFFAMT ADVLGAALPGSAAEVITGAEEAELSFRGAVGELGSAGAPFVVVDLGGGSTEIVLGEHE VVASYSADIGCVRLTERCLHSDPPTLQEVSTARRLVRERLEPALRTVPLELARTWVGL AGTMTTLSALAQSMTAYDAAAIHLSRVPGADLLEVCQRLIGMTRKQRAALAPMHPGRA DVIGGGAIVVEELARELRERAGIDQLTVSEHDILDGIALSLAG" CDS complement(1148902..1149582) /codon_start=1 /transl_table=11 /gene="kdpE" /locus_tag="BQ2027_MB1055C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN KDPE" /note="Mb1055c, kdpE, len: 226 aa. Equivalent to Rv1027c, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 226 aa overlap). Probable KdpE, transcriptional regulatory protein, similar to others e.g. KDPE_ECOLI|P21866 kdp operon transcriptional regulatory protein from Escherichia coli strain K12 (225 aa), FASTA scores: opt: 691, E(): 0, (47.8% identity in 224 aa overlap); AL021530|SC2E9.13 from Streptomyces coelicolor (227 aa), FASTA scores: opt: 981, E(): 0, (66.4% identity in 226 aa overlap); etc. Protein product from Mb1055c detected using SWATH mass spectrometry. Mb1055c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX67" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039420" /db_xref="UniProtKB/TrEMBL:A0A1R3XX67" /protein_id="SIT99654.1" /translation="MTLVLVIDDEPQILRALRINLTVRGYQVITASTGAGALRAAAEH PPDVVILDLGLPDMSGIDVLGGLRGWLTAPVIVLSARTDSSDKVQALDAGADDYVTKP FGMDEFLARLRAAVRRNTAAAELEQPVIETDSFTVDLAGKKVIKDGAEVHLTPTEWGM LEMLARNRGKLVGRGELLKEVWGPAYATETHYLRVYLAQLRRKLEDDPSHPKHLLTES GMGYRFEA" CDS complement(1149579..1152161) /codon_start=1 /transl_table=11 /gene="kdpD" /locus_tag="BQ2027_MB1056C" /product="PROBABLE SENSOR PROTEIN KDPD" /note="Mb1056c, kdpD, len: 860 aa. Equivalent to Rv1028c, len: 860 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 860 aa overlap). Probable kdpD, sensor protein (EC 2.7.3.-), similar to others e.g. KDPD_ECOLI|P21865 sensor protein from Escherichia coli strain K12 (894 aa), FASTA scores: opt: 1041, E(): 0, (32.3% identity in 888 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb1056c detected using SWATH mass spectrometry. Mb1056c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXH9" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR003661" /db_xref="InterPro:IPR003852" /db_xref="InterPro:IPR004358" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR006016" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR025201" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036097" /db_xref="InterPro:IPR036890" /db_xref="InterPro:IPR038318" /db_xref="UniProtKB/TrEMBL:A0A1R3XXH9" /protein_id="SIT99655.1" /translation="MTLLFADLCAIFTPYRWMIEHVTTKRGQLRIYLGAAPGVGKTYA MLGEAHRRLERGTDVVAAVVETHGRNKTAKLLEGIEMIPPRYVEYRGARFPELDVEAV LRRHPQVVLVDELAHTNTPGSKNPKRWQDVQEILDAGITVISTVNIQHLEGLNDVVEQ ITGIEQKEKIPDEIVRAADQVELVDITPEALRRRLAHGNVYAAERVDAALSNYFRTGN LTALREIALLWLADQVDAALEKYRADKKITATWEARERVVVAVTGGPESETLVRRASR IASKSSAELMVVHVIRGDDLAGVSAPQLGRVRELATSLGATMHTVVGDDVPTALLDFA REMNATQLVVGTSRRSRWARLFDEGIGARTVQESGGIDVHMVTHPAASRASGWSRVSP RERHIASWLAALVVPSVICAITVAWLDRFMGIGGESALFFIGVLIVALLGGVAPAALS ALLSGMLLNYFLTEPRYTWTIAEPDAAVTEFVLLAMAVAVAVLVDGAASRTREARRAS QEAELLALFAGSVLRGADLATLLQRVRETYSQRAVTMLRVRQGASTGETVACVGTNPC RDVDSADTAIEVGDDEFWMLMAGRKLAARDRRVLTAVATQAAGLVKQRELAEEAGQAE AIARADELRRSLLSAVSHDLRTPLAAAKVAVSSLRTEDVAFSPEDTAELLATIEESID QLTALVANLLDSSRLAAGVIRPQLRRAYLEEAVQRALVSIGKGATGFYRSGIDRVKVD VGDAVAMADAGLLERVLANLIDNALRYAPDCVVRVNAGRVRERVLINVIDEGPGVPRG TEEQLFAPFQRPGDHDNTTGVGLGMSVARGFVEAMGGTISATDTPGGGLTVVIDLAAP EDRP" CDS 1152395..1152487 /codon_start=1 /transl_table=11 /gene="kdpF" /locus_tag="BQ2027_MB1057" /product="Probable membrane protein kdpF" /note="Mb1057, kdpF, len: 30 aa. Equivalent to Rv1028A, len: 30 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 30 aa overlap). Probable kdpF, membrane protein, showing similarity with P36937|KDPF_ECOLI|B0698.1 PROTEIN KDPF from Escherichia coli strain K12 (see citation below) (27% identity); and KdpF protein from Streptomyces coelicolor (51% identity)." /db_xref="GOA:A0A1R3XX55" /db_xref="InterPro:IPR011726" /db_xref="UniProtKB/TrEMBL:A0A1R3XX55" /protein_id="SIT99656.1" /translation="MTTVDNIVGLVIAVALMAFLFAALLFPEKF" CDS 1152487..1154202 /codon_start=1 /transl_table=11 /gene="kdpA" /locus_tag="BQ2027_MB1058" /product="Probable Potassium-transporting ATPase A chain KDPA (Potassium-translocating ATPase A chain) (ATP phosphohydrolase [potassium-transporting] A chain) (Potassium binding and translocating subunit A)" /note="Mb1058, kdpA, len: 571 aa. Equivalent to Rv1029, len: 571 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 571 aa overlap). Probable kdpA, potassium-transporting ATPase A chain (transmembrane protein) (EC 3.6.3.12), similar to others e.g. ATKA_ECOLI|P03959|KDPA|B0698 potassium-transporting ATPase A chain from Escherichia coli strain K12 (557 aa), FASTA scores: opt: 1763, E(): 0, (50.4% identity in 569 aa overlap); etc. BELONGS TO THE KDPA FAMILY." /db_xref="GOA:P65210" /db_xref="InterPro:IPR004623" /db_xref="UniProtKB/Swiss-Prot:P65210" /protein_id="SIT99657.1" /translation="MSGTSWLQFAALIAVLLLTAPALGGYLAKIYGDEAKKPGDRVFG PIERVIYQVCRVDPGSEQRWSTYALSVLAFSVMSFLLLYGIARFQGVLPFNPTDKPAV TDHVAFNAAVSFMTNTNWQSYSGEATMSHFTQMTGLAVQNFVSASAGMCVLAALIRGL ARKRASTLGNFWVDLARTVLRIMFPLSFVVAILLVSQGVIQNLHGFIVANTLEGAPQL IPGGPVASQVAIKQLGTNGGGFFNVNSAHPFENYTPIGNFVENWAILIIPFALCFAFG KMVHDRRQGWAVLAIMGIIWIGMSVAAMSFEAKGNPRLDALGVTQQTTVDQSGGNLEG KEVRFGVGASGLWAASTTGTSNGSVNSMHDSYTPLGGMVPLAHMMLGEVSPGGTGVGL NGLLVMAILAVFIAGLMVGRTPEYLGKKIQATEMKLVTLYILAMPIALLSFAAASVLI SSALASRNNPGPHGLSEILYAYTSGANNNGSAFAGLTASTWSYDTTIGVAMLIGRFFL IIPVLAIAGSLARKGTTPVTAATFPTHKPLFVGLVIGVVLIVGGLTFFPALALGPIVE QLSTQ" CDS 1154199..1156328 /codon_start=1 /transl_table=11 /gene="kdpB" /locus_tag="BQ2027_MB1059" /product="Probable Potassium-transporting P-type ATPase B chain KDPB (Potassium-translocating ATPase B chain) (ATP phosphohydrolase [potassium-transporting] B chain) (Potassium binding and translocating subunit B)" /note="Mb1059, kdpB, len: 709 aa. Equivalent to Rv1030, len: 709 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 709 aa overlap). Probable kdpB, potassium-transporting P-type ATPase B chain (transmembrane protein) (EC 3.6.3.12), similar to others e.g. ATKB_ECOLI|P03960 potassium-transporting ATPase B chain from Escherichia coli strain K12 (682 aa), FASTA scores: opt: 1481, E(): 0, (63.4% identity in 686 aa overlap); etc. Very similar to AL078610|SCH35.47 H+/K+-exchanging ATPase (EC 3.6.1.36) chain B from Streptomyces coelicolor (707 aa), FASTA scores: opt: 2731, E(): 0, (71.6% identity in 676 aa overlap). Contains PS00154 E1-E2 ATPases phosphorylation site. Protein product from Mb1059 detected using shotgun mass spectrometry." /db_xref="GOA:P63682" /db_xref="InterPro:IPR001757" /db_xref="InterPro:IPR006391" /db_xref="InterPro:IPR008250" /db_xref="InterPro:IPR018303" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR023298" /db_xref="InterPro:IPR023299" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/Swiss-Prot:P63682" /protein_id="SIT99658.1" /translation="MMIARMETSATAAAATSAPRLRLAKRSLFDPMIVRSALPQSLRK LAPRVQARNPVMLVVLVGAVITTLAFLRDLASSTAQENVFNGLVAAFLWFTVLFANFA EAMAEGRGKAQAAALRKVRSETMANRRTAAGNIESVPSSRLDLDDVVEVSAGETIPSD GEIIEGIASVDESAITGESAPVIRESGGDRSAVTGGTVVLSDRIVVRITAKQGQTFID RMIALVEGAARQQTPNEIALNILLAGLTIIFLLAVVTLQPFAIYSGGGQRVVVLVALL VCLIPTTIGALLSAIGIAGMDRLVQHNVLATSGRAVEAAGDVNTLLLDKTGTITLGNR QATEFVPINGVSAEAVADAAQLSSLADETPEGRSIVVLAKDEFGLRARDEGVMSHARF VPFTAETRMSGVDLAEVSGIRRIRKGAAAAVMKWVRDHGGHPTEEVGAIVDGISSGGG TPLVVAEWTDNSSARAIGVVHLKDIVKVGIRERFDEMRRMSIRTVMITGDNPATAKAI AQEAGVDDFLAEATPEDKLALIKREQQGGRLVAMTGDGTNDAPALAQADVGVAMNTGT QAAREAGNMVDLDSDPTKLIEVVEIGKQLLITRGALTTFSIANDVAKYFAIIPAMFVG LYPVLDKLNVMALHSPRSAILSAVIFNALVIVALIPLALRGVRFRAESASAMLRRNLL IYGLGGLVVPFIGIKLVDLVIVALGVS" CDS 1156328..1156897 /codon_start=1 /transl_table=11 /gene="kdpC" /locus_tag="BQ2027_MB1060" /product="Probable Potassium-transporting ATPase C chain KDPC (Potassium-translocating ATPase C chain) (ATP phosphohydrolase [potassium-transporting] C chain) (Potassium binding and translocating subunit C)" /note="Mb1060, kdpC, len: 189 aa. Equivalent to Rv1031, len: 189 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 189 aa overlap). Probable kdpC, potassium-transporting ATPase C chain (membrane protein) (EC 3.6.3.12), similar to others e.g. ATKC_ECOLI|P03961 potassium-transporting ATPase C chain from Escherichia coli strain K12 (190 aa), FASTA scores: opt: 475, E(): 3.1e-24, (45.7% identity in 186 aa overlap); etc. BELONGS TO THE KDPC FAMILY. Mb1060 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65212" /db_xref="InterPro:IPR003820" /db_xref="UniProtKB/Swiss-Prot:P65212" /protein_id="SIT99659.1" /translation="MRRQLLPALTMLLVFTVITGIVYPLAVTGVGQLFFGDQANGALL ERDGQVIGSAHIGQQFTAAKYFHPRPSSAGDGYDAAASSGSNLGPTNEKLLAAVAERV TAYRKENNLPADTLVPVDAVTGSGSGLDPAISVVNAKLQAPRVAQARNISIRQVERLI EDHTDARGLGFLGERAVNVLRLNLALDRL" CDS complement(1156901..1158430) /codon_start=1 /transl_table=11 /gene="trcS" /locus_tag="BQ2027_MB1061C" /product="TWO COMPONENT SENSOR HISTIDINE KINASE TRCS" /note="Mb1061c, trcS, len: 509 aa. Equivalent to Rv1032c, len: 509 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 509 aa overlap). trcS, two component sensor histidine kinase protein (EC 2.7.3.-) (see citations below), similar to YV16_MYCLE|P54883 probable sensor-like histidine kinase from Mycobacterium leprae (443 aa), FASTA scores: opt: 392, E(): 3.8e-18, (31.7% identity in 334 aa overlap). Note that in vitro autophosphorylation of TrcS requires the presence of Mn2+or Ca2+as a divalent cation cofactor and subsequent transphosphorylation of TrcR is evident in the presence of TrcS-phosphate and Ca2+. Protein product from Mb1061c detected using shotgun mass spectrometry. Mb1061c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX59" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR003661" /db_xref="InterPro:IPR004358" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR036097" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3XX59" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99660.1" /translation="MIPDRNTRSRKAPCWRPRSLRQQLLLGVLAVVTVVLVAVGVVSV LSLSGYVTAMNDAELVESLHALNHSYTRYRDSAQTSTPTGNLPMSQAVLEFTGQTPGN LIAVLHDGVVIGSAVFSEDGARPAPPDVIRAIEAQVWDGGPPRVESLGSLGAYQVDSS AAGADRLFVGVSLSLANQIIARKKVTTVALVGAALVVTAALTVWVVGYALRPLRRVAA TAAEVATMPLTDDDHQISVRVRPGDTDPDNEVGIVGHTLNRLLDNVDGALAHRVDSDL RMRQFITDASHELRTPLAAIQGYAELTRQDSSDLPPTTEYALARIESEARRMTLLVDE LLLLSRLSEGEDLETEDLDLTDLVINAVNDAAVAAPTHRWVKNLPDEPVWVNGDHARL HQLVSNLLTNAWVHTQPGVTVTIGITCHRTGPNAPCVELSVTDDGPDIDPEILPHLFD RFVRASKSRSNGSGHGLGLAIVSSIVKAHRGSVTAESGNGQTVFRVRLPMIEQQIATT A" CDS complement(1158438..1159211) /codon_start=1 /transl_table=11 /gene="trcR" /locus_tag="BQ2027_MB1062C" /product="TWO COMPONENT TRANSCRIPTIONAL REGULATOR TRCR" /note="Mb1062c, trcR, len: 257 aa. Equivalent to Rv1033c, len: 257 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 257 aa overlap). trcR, two-component regulatory protein (see citations below), similar to Q50825 TWO COMPONENT RESPONSE REGULATOR from Mycobacterium tuberculosis (234 aa), FASTA scores: opt: 628, E(): 0, (46.0% identity in 226 aa overlap). Note that in vitro autophosphorylation of TrcS requires the presence of Mn2+or Ca2+as a divalent cation cofactor and subsequent transphosphorylation of TrcR is evident in the presence of TrcS-phosphate and Ca2+. Protein product from Mb1062c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1062c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX60" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039420" /db_xref="UniProtKB/TrEMBL:A0A1R3XX60" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99661.1" /translation="MTTMSGYTRSQRPRQAILGQLPRIHRADGSPIRVLLVDDEPALT NLVKMALHYEGWDVEVAHDGQEAIAKFDKVGPDVLVLDIMLPDVDGLEILRRVRESDV YTPTLFLTARDSVMDRVTGLTSGADDYMTKPFSLEELVARLRGLLRRSSHLERPADEA LRVGDLTLDGASREVTRDGTPISLSSTEFELLRFLMRNPRRALSRTEILDRVWNYDFA GRTSIVDLYISYLRKKIDSDREPMIHTVRGIGYMLRPPE" CDS complement(1159393..1159782) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1063C" /product="PROBABLE TRANSPOSASE (FRAGMENT)" /note="Mb1063c, -, len: 129 aa. Equivalent to Rv1034c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 129 aa overlap). Probable IS1560 transposase fragment, similar to part of Rv3387|E1202305|MTV004.45 (225 aa) (65.1% identity in 129 aa overlap)." /db_xref="GOA:A0A1R3XZA0" /db_xref="InterPro:IPR002559" /db_xref="UniProtKB/TrEMBL:A0A1R3XZA0" /protein_id="SIT99662.1" /translation="MQQGNPPDAPQLAPAVAWVKKRAGRTPRTVTADRGYGEAAVDQQ LTEVGVKNVLIPRKGKPSQDRRAEEHRKAFRRTIKWRTGCEGRISHLKRGYGWDRGRI GGLEGTRTWVGHGVFAHNLVTISALPA" mobile_element complement(1159396..1160908) /mobile_element_type="insertion sequence:IS1560" /locus_tag="BQ2027_IS1560'-1" /note="IS1560'-1, len: 1513 nt. Equivalent to IS1560, len: 1513 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1513 nt overlap)." gene complement(1159396..1160908) /locus_tag="BQ2027_IS1560'-1" CDS complement(1159850..1160536) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1064C" /product="PROBABLE TRANSPOSASE (FRAGMENT)" /note="Mb1064c, -, len: 228 aa. Equivalent to Rv1035c, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 228 aa overlap). Probable IS1560 transposase fragment, similar to parts of Rv3387|E1202305|MTV004.45 (225 aa) (47.8% identity in 67 aa overlap) and Rv3386|E1202304|MTV004.44 (234 aa) (55.1% identity in 127 aa overlap). Protein product from Mb1064c detected using SWATH mass spectrometry. Mb1064c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XY47" /protein_id="SIT99663.1" /translation="MPHPTTLMKLTTRCGSAAIDGLNEALLAKAAEAKLLGTNRIRAD TTVARANVSYPTDLGLLAKAMRRIAATGKRIQAAGGAVRTRVGDRSRAAGRRAHAVAA KLRSRAELGRDEARAAVLRFTGELAELAQAAAQEAQQLLDNAKQAVLRAKAKAAALAA RGERDAVAGRRCGGLVRAVNDLTELLNATRQIVAQTRQRVAGITSDGASRRVSLHDGD ARPDHQGSAR" CDS complement(1160570..1160908) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1065C" /product="PROBABLE IS1560 TRANSPOSASE (FRAGMENT)" /note="Mb1065c, -, len: 112 aa. Equivalent to Rv1036c, len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 112 aa overlap). Probable IS1560 transposase fragment, similar to part of Rv3386|E1202304|MTV004.44 (234 aa) (82.8% identity in 87 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3XX81" /protein_id="SIT99664.1" /translation="MIPGRMVLNWEDGLNALVAEGIEAIVFRTLGDQCWLWESLLPDE VRRLPEELARVDALLDDPAFFAPFVPFFDPRRGRPSTPMEVYLQLMFVKFRYRLGYES LCREVADSIT" CDS complement(1161019..1161303) /codon_start=1 /transl_table=11 /gene="esxI" /locus_tag="BQ2027_MB1066C" /standard_name="ES6_1; Mtb9.9D" /product="putative esat-6 like protein esxi (esat-6 like protein 1)" /note="Mb1066c, esxI, len: 94 aa. Equivalent to Rv1037c, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 94 aa overlap). esxI, conserved hypothetical protein, member of ESAT-6 family, highly similar to many Mycobacterial hypothetical proteins e.g. Q49946|ES6X_MYCLE|U1756D PUTATIVE ESAT-6 LIKE PROTEIN X from Mycobacterium leprae (95 aa), FASTA scores: opt: 409, E(): 6.3e-23, (64.15% identity in 92 aa overlap); Rv3619c, Rv1198, Rv2346c, etc from Mycobacterium tuberculosis. Strictly identical to P96364|ES61_MYCTU|Rv3619c|MTCY15C10.33|MTCY07H7B.03|MT3721 PUTATIVE ESAT-6 LIKE PROTEIN 1 (94 aa). BELONGS TO THE ESAT6 FAMILY. Mb1066c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59802" /db_xref="InterPro:IPR009416" /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P59802" /protein_id="SIT99665.1" /translation="MTINYQFGDVDAHGAMIRALAGSLEAEHQAIISDVLTASDFWGG AGSAACQGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA" CDS complement(1161330..1161626) /codon_start=1 /transl_table=11 /gene="esxJ" /locus_tag="BQ2027_MB1067C" /standard_name="ES6_2,TB11.0,QILSS" /product="esat-6 like protein esxj (esat-6 like protein 2)" /note="Mb1067c, esxJ, len: 98 aa. Equivalent to Rv1038c, len: 98 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 98 aa overlap). esxJ, putative ESAT-6 like protein 2, similar to Q49945|U1756C, Mycobacterium leprae (100 aa), FASTA scores: opt: 375, E(): 7.7e-21, (58.3% identity in 96 aa overlap), almost identical to Rv1197, Rv1792, Rv2347c and Rv3620c. BELONGS TO THE ESAT6 FAMILY. Protein product from Mb1067c detected using shotgun mass spectrometry. Mb1067c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0DOB0" /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P0DOB0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99666.1" /translation="MASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAG WSGMAEATSLDTMTQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS" CDS complement(1161772..1162947) /codon_start=1 /transl_table=11 /gene="PPE15" /locus_tag="BQ2027_MB1068C" /product="ppe family protein ppe15" /note="Mb1068c, PPE15, len: 391 aa. Equivalent to Rv1039c, len: 391 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 391 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to Rv2768c|AL008967|MTV002_33 Mycobacterium tuberculosis H37Rv (394 aa), FASTA scores: opt: 1721, E(): 0, (70.4% identity in 398 aa overlap)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XX65" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99667.1" /translation="MDFGALPPEINSARMYAGAGAGPMMAAGAAWNGLAAELGTTAAS YESVITRLTTESWMGPASMAMVAAAQPYLAWLTYTAEAAAHAGSQAMASAAAYEAAYA MTVPPEVVAANRALLAALVATNVLGINTPAIMATEALYAEMWAQDALAMYGYAAASGA AGMLQPLSPPSQTTNPGGLAAQSAAVGSAAATAAVNQVSVADLISSLPNAVSGLASPV TSVLDSTGLSGIIADIDALLATPFVANIINSAVNTAAWYVNAAIPTAIFLANALNSGA PVAIAEGAIEAAEGAASAAAAGLADSVTPAGLGASLGEATLVGRLSVPAAWSTAAPAT TAGATALEGSGWTVAAEEAGPVTGMMPGMASAAKGTGAYAGPRYGFKPTVMPKQVVV" CDS complement(1163024..1163851) /codon_start=1 /transl_table=11 /gene="PE8" /locus_tag="BQ2027_MB1069C" /product="pe family protein pe8" /note="Mb1069c, PE8, len: 275 aa. Equivalent to Rv1040c, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 275 aa overlap). Member of the Mycobacterium tuberculosis PE family, most similar to AL008967|MTV002_34 Mycobacterium tuberculosis H37Rv (275 aa), FASTA scores: opt: 1111, E(): 0, (68.6% identity in 283 aa overlap)." /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR022171" /db_xref="UniProtKB/TrEMBL:A0A1R3XX54" /protein_id="SIT99668.1" /translation="MSFLKTVPEELTAAAAQLGTIGAAMAAQNAAAAAPTTAIAPAAL DEVSALQAALFTAYGTFYQQVSAEAQAMHDMFVNTLGISAGTYGVTESLNSSAAASPL SGITGEASAIIQATTGLFPPELSGGIGNILNIGAGNWASATSTLIGLAGGGLLPAEEA AEAASALGGEAALGELGALGAAEAALGEAGIAAGLGSASAIGMLSVPPAWAGQATLVS TTSTLPGAGWTAAAPQAAAGTFIPGMPGVASAARNSAGFGAPRYGVKPIVMPKPATV" mobile_element complement(1165047..1166025) /mobile_element_type="insertion sequence:IS-LIKE" /locus_tag="BQ2027_IS-LIKE-1" /note="IS-LIKE-1, len: 979 nt. Equivalent to IS-LIKE, len: 978 nt, from Mycobacterium tuberculosis strain H37Rv,(99.8% identity in 979 nt overlap). Insertion sequence,ISlike2, region identical to cos mid y348, blast score: 4902 (+1) 9377 10354 EM_NEW:MTAD20 A d000020 M. tuberculosis sequence from clone y348." CDS complement(1165047..1165529) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1070C" /product="probable is like-2 transposase" /note="Mb1070c, -, len: 160 aa. Equivalent to Rv1041c, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv. Probable IS like-2 transposase. Similar to Q00430|X53945 insertion element IS869 hypothetical protein from Agrobacterium tumefaciens (186 aa), Similar to Rv1150, C-terminal part of transposase of putative Mycobacterium tuberculosis IS like-1. Mb1070c and Mb1071c are frameshifted with respect to Mycobacterium tuberculosis Q50761 transposase, the 10G2 cosmid sequence appears to be correct. Mb1070c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX79" /db_xref="InterPro:IPR002559" /db_xref="UniProtKB/TrEMBL:A0A1R3XX79" /protein_id="SIT99669.1" /translation="MTTKIHALTDQREAPVRIRLTAGQAGDNPQLLPLLDDYRHASTE YALGSTDFRLLADKAYSHPSTRAALRSKKIKHTIPERQDQIDRRKAKGSAGGRPPAFD AALYGLRNTVERGFHRLKQWRGIATRYDKYALTYLGGVLLACAVIHARVGTPKLGDTP " gene complement(1165047..1166025) /locus_tag="BQ2027_IS-LIKE-1" CDS complement(1165567..1165974) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1071C" /product="probable is like-2 transposase" /note="Mb1071c, -, len: 135 aa. Equivalent to Rv1042c, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 135 aa overlap). Probable IS like-2 transposase, similar to Q50761 TRANSPOSASE from Mycobacterium tuberculosis (308 aa), FASTA scores: opt: 823, E(): 0, (99.1% identity in 117 aa overlap). Second copy is Rv1149. Mb1071c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025161" /db_xref="UniProtKB/TrEMBL:A0A1R3XX71" /protein_id="SIT99670.1" /translation="MTRVGVISDEFWAVVEPLMPSHEGKPGRRFSDHRLILEGIAWRF RTGSPWRDLPAEFGPWQTVWKRHHRWSLDGTCDEVFAHVAAVFGVDAEVAEDIEKLLS VDSTNVRAHQHSAGACSDTLATGGTVELQEIRR" CDS complement(1166257..1167282) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1072C" /product="Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain" /note="Mb1072c, -, len: 341 aa. Equivalent to Rv1043c, len: 341 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 341 aa overlap). Conserved hypothetical protein similar to AL096872|SC5F7.08 PUTATIVE LIPOATE-PROTEIN LIGASE from Streptomyces coelicolor (362 aa), FASTA scores: opt: 206, E(): 1.4e-05, (30.3% identity in 201 aa overlap). Weak similarity to P39668|YYXA_BACSU HYPOTHETICAL PROTEASE from Bacillus subtitis (400 aa), FASTA scores: opt: 159, E(): 0.013, (27.1% identity in 210 aa overlap). Protein product from Mb1072c detected using SWATH mass spectrometry. Mb1072c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009003" /db_xref="UniProtKB/TrEMBL:A0A1R3XX73" /protein_id="SIT99671.1" /translation="MCAHQFFGLVHNPVVAAAIGKPEPPPVDSDIGLPTTVPFEPWSV ADFSRYLSTLGLPAAGDAVTLHRILSSMERAGLLLPLGWDPRLPVMGQKYISQGTISK GQRGGNLWLSEVFGAELIIPSYNAVTVQLAGHDDAGNPVDSWGTGLVVDHNHVITNKH VVTGLAGTSAGLSVYPSSNHAEAELVNFSGTAHPHPTLDVAVIKFEMPEGKYIPRLGG MAFRDPDWADEVYVFGYPRVPMTAEMAITVQRGEVVNPAATTIPCRQKIFLYSAIARP GNSGGPIVAQDGRVIGLVVEDSAEAPSTGTGPNAAPFYRGIPSSEVIRALDELDFGGI VEMDTLP" CDS 1167529..1168152 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1073" /product="Predicted transcriptional regulator" /note="Mb1073, -, len: 207 aa. Equivalent to Rv1044, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 207 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protein MTCY06G11.02C|P96837 (289 aa), fasta scores: E(): 8.9e-06, (30.7% identity in 150 aa overlap). Some similarity to U36837|LLU36837_1 Lactococcus lactis plasmid pNP40 (287 aa), FASTA scores: opt: 147, E (): 0.0087, (29.7% identity in 91 aa overlap). Protein product from Mb1073 detected using SWATH mass spectrometry. Mb1073 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025159" /db_xref="UniProtKB/TrEMBL:A0A1R3XZB5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99672.1" /translation="MCAKPYLIDTIAHMAIWDRLVEVAAEQHGYVTTRDARDIGVDPV QLRLLAGRGRLERVGRGVYRVPVLPRGEHDDLAAAVSWTLGRGVISHESALALHALAD VNPSRIHLTVPRNNHPRAAGGELYRVHRRDLQAAHVTSVDGIPVTTVARTIKDCVKTG TDPYQLRAAIERAEAEGTLRRGSAAELRAALDETTAGLRARPKRASA" CDS 1168149..1169030 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1074" /product="HYPOTHETICAL PROTEIN" /note="Mb1074, -, len: 293 aa. Equivalent to Rv1045, len: 293 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 293 aa overlap). Hypothetical unknown protein. Protein product from Mb1074 detected using SWATH mass spectrometry. Mb1074 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR014942" /db_xref="UniProtKB/TrEMBL:A0A1R3XY56" /protein_id="SIT99673.1" /translation="MTKPYSSPPTNLRSLRDRLTQVAERQGVVFGRLQRHVAMIVVAQ FAATLTDDTGAPLLLVKGGSSLELRRGIPDSRTSKDFDTVARRDIELIHEQLADAGET GGEGFTAIFTAPEEIDVPGMPVKPRRFTAKLSYRGRAFATVPIEVSSVEAGNADQFDT LTSDALGLVGVPAAVAVPCMTIPWQIAQKLHAVTAVLEEPKVNDRAHDLVDLQLLEGL LLDADLMPTRSACIAIFEARAQHPWPPRVATLPHWPLIYAGALEGLDHLELARTVDAA AQAVQRFVARIDRATKR" CDS complement(1169112..1169705) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1075C" /product="HYPOTHETICAL PROTEIN" /note="Mb1075c, -, len: 197 aa. Equivalent to Rv1046c, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 174 aa overlap). Hypothetical unknown protein. Start changed since first submission (-65 aa). Mb1075c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XX92" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99674.1" /translation="MKVQARVGWNRRQLSAVGGRGQQLFANAPGHIPSTSHRRGTGDI NRKIDESLAGAARPQANANYGATSDPPLTHQPKPGSPTQVGPRSPSPPGLRGLVKQLP EVHQSSLHLDTVASLPSSRPSPHHTPLALRSRSGHFSPDEIRNRRSRKRSQSHMPPRT PPRGRCLRAPESARLGRRSAAHRHSIARNARAIPFVV" mobile_element 1169747..1171181 /mobile_element_type="insertion sequence:IS1081" /locus_tag="BQ2027_IS1081-1" /note="IS1081-1, len: 1435 nt. Equivalent to IS1081, len: 1450 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1435 nt overlap). Almost identical to Mycobacterium bovis IS1081 (7157 (-1) 60 14 94 EM_BA:MBBIS1081 X84741 Mycobacterium bovis BCG IS1081 DNA 4/96." gene 1169775..1171209 /locus_tag="BQ2027_IS1081-1" repeat_region 1169848..1169862 /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRL,TCGCGTGATCCTTCG, flanking IS element IS1081." CDS 1169900..1171147 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1076" /product="probable transposase" /note="Mb1076, -, len: 415 aa. Equivalent to Rv1047, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 415 aa overlap). IS1081 transposase, most similar to TRA1_MYCBO|P35882 transposase for insertion sequence element (415 aa), FASTA scores: opt: 2675, E(): 0, (99.8% identity in 415 aa overlap). Contains PS01007 Transposases, Mutator family, signature" /db_xref="GOA:P60231" /db_xref="InterPro:IPR001207" /db_xref="UniProtKB/Swiss-Prot:P60231" /protein_id="SIT99675.1" /translation="MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" repeat_region complement(1171157..1171171) /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRR,TCGCGTGATCCTTCG, flanking IS element IS1081." CDS complement(1171515..1172630) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1077C" /product="HYPOTHETICAL PROTEIN" /note="Mb1077c, -, len: 371 aa. Equivalent to Rv1048c, len: 371 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 371 aa overlap). Hypothetical unknown protein. Mb1077c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XX74" /protein_id="SIT99676.1" /translation="MQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSAL EGAFRSEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAP TMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRA TLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLI VDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSA SLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQ NLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK" CDS 1172863..1173309 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1078" /product="PROBABLE TRANSCRIPTIONAL REPRESSOR PROTEIN" /note="Mb1078, -, len: 148 aa. Equivalent to Rv1049, len: 148 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 148 aa overlap). Probable transcriptional repressor protein, similar to many e.g. P74870 NEGATIVE REGULATOR OF EMR LOCUS EMR from Salmonella typhimurium (149 aa), FASTA scores: opt: 146, E(): 0.0011, (31.6% identity in 95 aa overlap). Contains probable helix-turn -helix motif at aa 58-79 (Score 1495, +4.28 SD). Mb1078 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX75" /db_xref="InterPro:IPR000835" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XX75" /protein_id="SIT99677.1" /translation="MGKGAAFDECACYTTRRAARQLGQAYDRALRPSGLTNTQFSTLA VISLSEGSAGIDLTMSELAARIGVERTTLTRNLEVMRRDGLVRVMAGADARCKRIELT AKGRAALQKAVPLWRGVQAEVTASVGDWPRVRRDIANLGQAAEACR" CDS 1173357..1174262 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1079" /product="PROBABLE OXIDOREDUCTASE" /note="Mb1079, -, len: 301 aa. Equivalent to Rv1050, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 301 aa overlap). Probable oxidoreductase (EC 1.-.-.-) similar to many e.g. Rv1543|MTCY48.22C|Q10783 PUTATIVE OXIDOREDUCTASE CY48.22C (341 aa), FASTA scores: opt: 462, E(): 3e-22, (33.6% identity in 265 aa overlap). Protein product from Mb1079 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XX66" /protein_id="SIT99678.1" /translation="MARQRFRDQVVLITGASSGIGEATAKAFAREGAVVALAARREGA LRRVAREIEAAGGRAMVAPLDVSSSESVRAMVADVVGEFGRIDVVFNNAGVSLVGPVD AETFLDDTREMLEIDYLGTVRVVREVLPIMKQQRSGRIMNMSSVVGRKAFARFAGYSS AMHAIAGFSDALRQELRGSGIAVSVIHPALTQTPLLANVDPADMPPPFRSLTPIPVHW VAAAVLDGVARRRARVVVPFQPRLLMVGDAFSPRYGDRVVRLLESKIFGRLIGSYRGS VYRHQPTESAKAQAAQPERGYSSAR" CDS complement(1174421..1175176) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1080C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1080c, -, len: 251 aa. Equivalent to Rv1051c, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 251 aa overlap). Conserved hypothetical protein, similar to LLU36837|U36837.1 protein encoded by Lactococcus lactis plasmid pNP40 (298 aa), FASTA scores: opt: 194, E(): 3.5e-06, (30.3% identity in 155 aa overlap). Contains possible helix-turn-helix motif at aa 197-218 (Score 1097, +2.92 SD)." /db_xref="InterPro:IPR009061" /db_xref="InterPro:IPR014942" /db_xref="InterPro:IPR041657" /db_xref="UniProtKB/TrEMBL:A0A1R3XX93" /protein_id="SIT99679.1" /translation="MRADVTAEHLTQVVRDIAVIDIDDGVAFNLDTSSVQEIRERADY PGLRVRVAMSVGPWQGIAAWDVSTGEPIAPWPTRVTIDRILGEPITLLGYAPETIIAE KGVTILERGITSTRWRDYVDIVQLDRRGIDDDELLRSARAVAQYRGATLEPVAPHLAG YGAVAQAKWATEHGRCQHCWRHWKPAHVGRRNMDLLDAKQVSEMIGVPVGTLRHWRHS DIGPASFTLGRRVVYRRDEVSRWISKRESATRR" CDS 1176199..1176588 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1081" /product="HYPOTHETICAL PROTEIN" /note="Mb1081, -, len: 129 aa. Equivalent to Rv1052, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 129 aa overlap). Hypothetical unknown protein. Mb1081 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XX76" /protein_id="SIT99680.1" /translation="MDCCEERGVARHKGLSQVGTPGCPRWSQAVSCRCSAYREAAVTA VQMPLTPGYGETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTA VEGSLSLDELLSDHFVRDLHARMFGPV" CDS complement(1176487..1176762) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1082C" /product="Mobile element protein" /note="Mb1082c, -, len: 91 aa. Equivalent to Rv1053c, len: 91 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 91 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3XX80" /protein_id="SIT99681.1" /translation="MDSHKVCMNNNTQLPTGPIIGVHPAVRDGVERVAYLDGDLLRCN TDVEFTSSPPPGPVLYRTKHTRVEIADEMVTEKLIKRQRAFNSRRHQ" CDS 1177404..1177718 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1083" /product="probable integrase (fragment)" /note="Mb1083, -, len: 104 aa. Equivalent to Rv1054, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 104 aa overlap). Probable integrase (fragment), similar to Rv2309c|MTCY3G12_25|Z79702 hypothetical protein (shows similarity to integrases) from Mycobacterium tuberculosis (151 aa), FASTA scores: opt: 273, E(): 8.8e-13, (64.7% identity in 68 aa overlap); and to L39071|MSGINT_1 integrase from Mycobacterium paratuberculosis (191 aa), FASTA scores: opt: 105, E(): 0.9, (31.8% identity in 85 aaoverlap). This ORF continues in another frame as Rv1055|MTV017.08 but no error can be found to account for frameshift. Length extended since first submission (+36 aa). Mb1083 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZC2" /db_xref="InterPro:IPR011010" /db_xref="InterPro:IPR014417" /db_xref="UniProtKB/TrEMBL:A0A1R3XZC2" /protein_id="SIT99682.1" /translation="MTGKGIVESTTKTKRDRHVPVPEPVWRRLHAELPTDPNALVFPG RKGGFLPLGEYRWAFDNAGDQVGIEGWYRTVWGTPRPRWRSAQALTSRSCNGSLDTQQ RR" CDS 1177715..1177849 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1084" /product="POSSIBLE INTEGRASE (FRAGMENT)" /note="Mb1084, -, len: 44 aa. Equivalent to Rv1055, len: 44 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 44 aa overlap). Possible integrase (fragment); first 49 aa similar to Rv2309c|MTCY3G12_25|Z79702 hypothetical protein (shows similarity to integrases) from Mycobacterium tuberculosis (151 aa), FASTA scores: opt: 291, E(): 2.2e-16, (74.3% identity in 70 aa overlap); and to L39071|MSGINT_1 integrase from Mycobacterium paratuberculosis (191 aa), FASTA scores: opt: 146, E(): 8.3e-05, (52.1% identity in 48 aa overlap); and to many other integrases or transposases. Shortened since first submission (-34 aa). Mb1084 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XY66" /protein_id="SIT99683.1" /translation="MTLDRHGHLLNDDLAVWPMRCAKSSRTLRYHCGMRRRNRVGLRA " tRNA 1177838..1177911 /locus_tag="BQ2027_LEUX" /product="tRNA-Leu" /note="leuX, len: 74 nt. Equivalent to leuX, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Leu, anticodon taa." CDS 1178104..1178868 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1085" /product="conserved protein" /note="Mb1085, -, len: 254 aa. Equivalent to Rv1056, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 254 aa overlap). Conserved hypothetical protein, some similarity in C-terminal region of Rv0140|MTCI5.14|Z92770 Mycobacterium tuberculosis (126 aa), FASTA scores: opt: 254, E(): 1.2e-10, (43.4% identity in 106 aa overlap); and to Rv1670. C-terminal region is similar to AL035569|SC8D9.02 hypothetical protein from Streptomyces coelicolor (113 aa), FASTA scores: opt: 282, E(): 4.5e-12, (48.0% identity in 100 aa overlap). Protein product from Mb1085 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1085 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007361" /db_xref="InterPro:IPR038694" /db_xref="UniProtKB/TrEMBL:A0A1R3XX99" /protein_id="SIT99684.1" /translation="MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVP YYPQYYIPLADVRMEFLRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPV AGTVRFNWDPLRWFEEDEPIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLL FETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHY PLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS" CDS 1179872..1181053 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1086" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1086, -, len: 393 aa. Equivalent to Rv1057, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 393 aa overlap). Conserved hypothetical protein, some similarity to X84710|MMSAG_1 surface antigen of Methanosarcina mazeii (491 aa), FASTA scores: opt: 363, E():6.2e-15, (31.3% identity in 294 aa overlap). Mb1086 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR011048" /db_xref="InterPro:IPR015943" /db_xref="UniProtKB/TrEMBL:A0A1R3XXK8" /protein_id="SIT99685.1" /translation="MSVMNGREVARESRDAQVFEFGTAPGSAVVKIPVQGGPIGGIAI SRDGSLLVVTNNGTDTVSVVGTDTCRVTQTVTSVNEPFAIAMGNAEANRAYVSTVSSA YDAIAVIDVATNTVLGTHPLALSVSDLTLSPDDKYLYVSRNGTRGADVAVLDTTTGAL IDVVDVSQAPGTTTQCVRMSPDGSVLYVGANGPSGGLLVVITTRAQSDGGRIGSRSRS RQKSSKPRGNQAAAGLRVVATIDIGSSVRDVALSPDGAIAYVASCGSDFGAVVDVIDT RTHQITSSRAISEIGGLVTRVSVSGDADRAYLVSEDRVTVLCTRTHDVIGTIRTGQPS CVVESPDGKYLYIADYSGTITRTAVASTIVSGTEQLALQRRGSMQWFSPELQQYAPAL A" CDS 1181160..1182791 /codon_start=1 /transl_table=11 /gene="fadD14" /locus_tag="BQ2027_MB1087" /product="PROBABLE MEDIUM CHAIN FATTY-ACID-COA LIGASE FADD14 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb1087, fadD14, len: 543 aa. Equivalent to Rv1058, len: 543 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 543 aa overlap). Probable fadD14, medium-chain fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to many e.g. CAC32346.1|AL583945 putative fatty acid CoA ligase from Streptomyces coelicolor (558 aa); N-terminus of NP_419738.1|NC_002696 medium-chain-fatty-acid--CoA ligase from Caulobacter crescentus (1006 aa); Q00594|ALKK_PSEOL MEDIUM-CHAIN-FATTY-ACID--COA LIGASE (EC 6.2.1.-) from Pseudomonas oleovorans (546 aa), FASTA scores: opt: 1468, E(): 0, (41.1% identity in 538 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb1087 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1087 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX85" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XX85" /protein_id="SIT99686.1" /translation="MYGTMQDFPLTITAIMRHGCGVHGRRTVTTATGEGYRHSSYRDV GQRAGQLANALRRLGVTGDQRVATFMWNNTEHLVTYFAVPSMGAVLHTLNIRLFPEQI AYVTNEAEDRVILVDLSLARLLAPVLPKLDTVHTVIAVGEGDTTPLREAGKTVLRFAE LIDAESPDFGWPQIDENSAAAMCYTSGTTGNPKGVVYSHRSSFLHTMAACTTNGIGVG SSDKVLPIVPMFHANGWGLPYAALMAGADLVLPDRHLDARSLIHMVETLKPTLAGAVP TIWNDVMHYLEKDPDHDMSSLRLVACGGSAVPESLMRTFEDKHDVQIRQLWGMTETSP LATMAWPPPGTPDDQHWAFRITQGQPVCGVETRIVDDDGQVLPNDGNAVGEVEVRGPW IAGSYYGGRDESKFDSGWLRTGDVGRIDEQGFITLTDRAKDVIKSGGEWISSVELENC LIAHPDVLEAAVVGVPDERWQERPLAVVVVREGATVSAGDLRAFLADKVVRWWLPERW AFVDEIPRTSVGKYDKKAIRSRYAEGAYQITEVHT" CDS 1182867..1183931 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1088" /product="Oxidoreductase" /note="Mb1088, -, len: 354 aa. Equivalent to Rv1059, len: 354 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 354 aa overlap). Conserved hypothetical protein, similar to Rv0926c|MTCY21C12.20c hypothetical protein from Mycobacterium tuberculosis (358 aa), FASTA scores: opt: 338, E(): 1.4e-14, (33.1% identity in 363 aa overlap). Protein product from Mb1088 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1088 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX84" /db_xref="InterPro:IPR000846" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XX84" /protein_id="SIT99687.1" /translation="MTMSLRVIQWATGSVGVAAIKGVLQHPELELVGCWVHSAAKSGK DVGEIIGSPPLGVIATNSIDDVLALDADAVIYAPLLPSVDEVAALLRSGKNVVTPLGW FYPSEKEAAPLEVAAQAGNATLHGAGIGPGAVTELFPLLLSVMSTGVTFVRSEEFSDL RSYGAPDVLRYVMGFGGTPDSALTGPMQKILDGGFLQSVRLCVDRLGFAADPQIRTSQ EVAVATAPIDSPIGVIEPGQVAGRRFHWEALVEDTVVVQIAVNWLMGSENLDPPWSFG PAGERYEIEVRGSPDTCVTIKGWQPQTVAAGLKSNPGIVATAAHCVNAIPATCAAPAG IQSFFDLPLITGRAAPGLAR" CDS 1183984..1184457 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1089" /product="unknown protein" /note="Mb1089, -, len: 157 aa. Equivalent to Rv1060, len: 157 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 157 aa overlap). Hypothetical unknown protein. Protein product from Mb1089 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1089 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3XX78" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99688.1" /translation="MAKSVVVEQSRAIPVQSEDAFGGTLAAALPVICSHWYGLIPPIK EVRDQTGAWDSVGQARVITMVGGGRVREELTSVDPPRSFGYTLTDIKGPLAPLVALVE GKWSFAPADTGTTVTWQWTIHPRSALAAPVLPVFARMWRGYARGVLEKLSALLVG" CDS 1184491..1185354 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1090" /product="Glutamine amidotransferases class-II" /note="Mb1090, -, len: 287 aa. Equivalent to Rv1061, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 287 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from various bacteria e.g. D64002|SYCSLRD_75 Synechocystis sp. PCC6803 (304 aa), FASTA scores: opt: 245, E(): 1.2e-09, (27.1% identity in 258 aa overlap). Protein product from Mb1090 detected using SWATH mass spectrometry. Mb1090 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR017932" /db_xref="InterPro:IPR026869" /db_xref="InterPro:IPR029055" /db_xref="UniProtKB/TrEMBL:A0A1R3XXA0" /protein_id="SIT99689.1" /translation="MCRLFGLHSGTDAVTATFWLLNASDSLAEQSRRNPDGTGLGVFD EHHQPRLHKQPIAAWQDADFATEAHELTGTTFVAHVRYATTGSLDIRNTHPFLQDGRI FAHNGVVEGLDVLDERLREVGADDLVLGQTDSERVFALITASIRARDGNESAGLIDAL RWLAANVPIYAVNVLLSTATDVWALRYPESHELYILDRRGDGAPEFHLRSKRIRAHST HLRERSSVVFATEPMDDNPRWRLLDAGELVHVDAALRVNRSLVLPDPPRHPIRREDLS EPVLHAQHTSA" CDS 1185359..1186216 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1091" /product="Phospholipase, patatin family protein" /note="Mb1091, -, len: 285 aa. Equivalent to Rv1062, len: 285 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 285 aa overlap). Conserved hypothetical protein, some similarity to AL079356|SC6G9_10 hypothetical protein in Streptomyces coelicolor (289 aa), FASTA scores: opt: 556, E(): 1.2e-27, (39.0% identity in 287 aa overlap), and Z99111|BSUB0008_176 Bacillus subtilis (260 aa), FASTA scores: opt: 163, E(): 0.0013, (27.4% identity in 179aa overlap) Protein product from Mb1091 detected using SWATH mass spectrometry. Mb1091 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX86" /db_xref="InterPro:IPR002641" /db_xref="InterPro:IPR016035" /db_xref="UniProtKB/TrEMBL:A0A1R3XX86" /protein_id="SIT99690.1" /translation="MTTRRALVLAGGGLAGIAWETGVLRGIADESPAAARLLLDSDVL VGTSAGATVAAQISSGCPLDTLYERQLAETSAEIDPGVDIDAITDLFLTAVTEPHIST RRRLQRIGAVALAVDTVPESVRRQVIAQRLPSHDWPDRVLRVTAIDIATGELVVFHRE SNVALVDAVAASCSVPGAWPPVTIAGRRYMDGGVASSVNLGVADDCDAAVVLVPAGAD APSPFGGGAAAEIAAATGMVFAVFADDDSLAAFGPNPLDPLCRVNSAMAGRQQGRREA QAVARLLGV" CDS complement(1186217..1187299) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1092C" /product="FIG00613342: Bacterial patatin-like phospholipase domain containing protein" /note="Mb1092c, -, len: 360 aa. Equivalent to Rv1063c, len: 360 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 360 aa overlap). Conserved hypothetical protein, similar to P37053|YCHK_ECOLI hypothetical protein from Escherichia coli (314 aa), FASTA scores: opt: 487, E(): 7.2e-23, (32.7% identity in 321 aa overlap). Also partially similar to Rv3239c|MTCY20B11.14c. BELONGS TO THE UPF0028 (SWS) FAMILY. Protein product from Mb1092c detected using SWATH mass spectrometry. Mb1092c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67099" /db_xref="InterPro:IPR001423" /db_xref="InterPro:IPR002641" /db_xref="InterPro:IPR016035" /db_xref="UniProtKB/Swiss-Prot:P67099" /protein_id="SIT99691.1" /translation="MPAPAALRVRGSSSPRVALALGSGGARGYAHIGVIQALRERGYD IVGIAGSSMGAVVGGVHAAGRLDEFAHWAKSLTQRTILRLLDPSISAAGILRAEKILD AVRDIVGPVAIEQLPIPYTAVATDLLAGKSVWFQRGPLDAAIRASIAIPGVIAPHEVD GRLLADGGILDPLPMAPIAGVNADLTIAVSLNGSEAGPARDAEPNVTAEWLNRMVRST SALFDVSAARSLLDRPTARAVLSRFGAAAAESDSWSQAPEIEQRPAGPPADREEAADT PGLPKMGSFEVMNRTIDIAQSALARHTLAGYPADLLIEVPRSTCRSLEFHRAVEVIAV GRALATQALEAFEIDDDESAAATIEG" CDS complement(1187380..1187799) /codon_start=1 /transl_table=11 /gene="lpqV" /locus_tag="BQ2027_MB1093C" /product="possible lipoprotein lpqv" /note="Mb1093c, lpqV, len: 139 aa. Equivalent to Rv1064c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 139 aa overlap). Putative lipoprotein LpqV. Has N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Mb1093c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65311" /db_xref="InterPro:IPR020377" /db_xref="UniProtKB/Swiss-Prot:P65311" /protein_id="SIT99692.1" /translation="MRPSRYAPLLCAMVLALAWLSAVAGCSRGGSSKAGRSSSVAGTL PAGVVGVSPAGVTTRVDAPAESTEEEYYQACHAARLWMDAQPGSGESLIEPYLAVVQA SPSGVAGSWHIRWAALTPARQAAVIVAARAAANAECG" CDS 1187911..1188477 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1094" /product="Cysteine dioxygenase (EC" /EC_number="1.13.11.20" /note="Mb1094, -, len: 188 aa. Equivalent to Rv1065, len: 188 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 188 aa overlap). Conserved hypothetical protein, some similarity to AL0209|SC4H8_11 hypothetical protein from Streptomyces coelicolor (182 aa), FASTA scores: opt: 156, E(): 0.0011, (31.3% identity in 195 aa overlap). Mb1094 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XY76" /db_xref="InterPro:IPR010300" /db_xref="InterPro:IPR011051" /db_xref="InterPro:IPR014710" /db_xref="UniProtKB/TrEMBL:A0A1R3XY76" /protein_id="SIT99693.1" /translation="MVMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHL LPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYR WDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLT AMSYYEITERNTLRRQRTELTDQPEGSG" CDS 1188474..1188869 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1095" /product="Rhodanese-related sulfurtransferase, 1 domain" /note="Mb1095, -, len: 131 aa. Equivalent to Rv1066, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 131 aa overlap). Conserved hypothetical protein, strong similarity to AL0209|SC4H8.10 hypothetical protein from Streptomyces coelicolor (132 aa), FASTA scores: opt: 429, E(): 5.2e-23, (57.1% identity in 119 aa overlap). Protein product from Mb1095 detected using SWATH mass spectrometry. Mb1095 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001763" /db_xref="InterPro:IPR036873" /db_xref="UniProtKB/TrEMBL:A0A1R3XXA5" /protein_id="SIT99694.1" /translation="MSRIDRVLEAARRRYRRLAADQVPEAARRGAVLVDIRPQAQRAR EGEVPGALVIERNVLEWRCDPTSDARLPQAVDDDVEWVILCSEGYTSSLAAASLLDLG LHRATDVVGGYRALAAGGVLAELGGAVGG" CDS complement(1188897..1190912) /codon_start=1 /transl_table=11 /gene="PE_PGRS19" /locus_tag="BQ2027_MB1096C" /product="pe-pgrs family protein pe_pgrs19" /note="Mb1096c, PE_PGRS19, len: 671 aa. Equivalent to Rv1067c, len: 667 aa, from Mycobacterium tuberculosis strain H37Rv, (98.5% identity in 673 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Similar to Rv3388|MTV004.46 from Mycobacterium tuberculosis (731 aa), FASTA scores: opt: 2227, E(): 0, (55.6% identity in 710 aa overlap). Contains PS00583 pfkB family of carbohydrate kinases signature 1, probably fortuitous. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, 2 deletions each of 3 bp (ccg-* and cgc-*) and a 18 bp insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (671 aa versus 667 aa)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XXL9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99695.1" /translation="MSFVLVSPSQLMAAAADVAGIGSAISAANAAALAPTSVLAAAGA DEVSAAVAALFSAHAGQYQQLGARAALFHEQFVQALTGAASAYASAEATNVEQQVLGL INAPTQALWGRPLIGNGADGTAANPNGGAGGLLYGNGGNGFSQTTAGLTGGTGGSAGL IGNGGNGGAGGAGANGGAGGNGGWLYGSGGNGGAGGAGPAGAIGAPGVAGGAGGAGGT AGLFGNGGVGGVGGDGGQGGNGAGAGASGTKGGDAGAGGAGGAGGWIHGHGGAGGDGG AGGAGGQASPGAPGPPSQPGGAGGAGGAGGRGGDGGSAGWLSGNGGDAGNGGGGGTAG GAGNGGQFGGDGGTGGTGGTAGAGGNGGRGAVLFGHGGNAGHGGAGGNGAAAGAGGEH VMATAGKGGTGGVGGDGGGGAGGGGGLLYGNGGAGGNGGAGGAGNSGGDGGTGLNAAL GGNGGGGGVGGNAGAGGTGGSAGWLSGNGGAGGSGGSAGAGGAGGKGGDTPNGLAINP GIGGNGGDTGNAGNGGNGGSAARLFGGGGAGGAGGTGSTAGSGGSGGTNPPTGLQAAG GNGGSGHAGGHGGNGGGAGLLGGGTGGNGGGGGQGGLGAAAGGVDGNGGNGGNGGKGG DAQLVGDGGNGGNGGKGGAGLIAGLDGAGGAGGTRGLIFGNAGTPGQ" CDS complement(1191245..1193536) /codon_start=1 /transl_table=11 /gene="PE_PGRS20" /locus_tag="BQ2027_MB1097C" /product="pe-pgrs family protein pe_pgrs20" /note="Mb1097c, PE_PGRS20, len: 763 aa. Similar to Rv1068c, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (59.8% identity in 766 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Similar to AL021897|MTV017_19 M. tuberculosis H37Rv (667 aa), FASTA scores: opt: 1875, E(): 0, (55.0% identity in 667 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 717 bp and 192 bp, and a 9 bp (tccgctgcc-*) deletion, leads to a longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (763 aa versus 463 aa)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XX95" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99696.1" /translation="MSYMIAVPDMLSSAAGDLASIGSSINASTRAAAAATTRLLPAAA DEVSAHIAALFSGHGEGYQAIARQMAAFHDQFTLALTSSAGAYASAEATNVEQQVLGL INAPTQALWGRPLIGNGADGTAANPNGGAGGLLYGNGGNGFSQTTAGLTGGTGGSAGL IGNGGNGGAGGAGANGGAGGNGGWLYGSGGNGGAGGAGPAGAIGAPGVAGGAGGAGGS AGLFGNGGAGGAGGAGGQGGAGIGGADGTKGGDAGAGGAGGAGGWIHGHGGVGGDGGT GGQGGDGVQGEPGDTGAAGGAGGAGGRGGDGGSAGWLSGNGGDAGTGGGGGNAGAGGE GGIFGGNGGNGGTGGTAGGGGNGGRGAALFGHGGNAGHGGAGGNGAAGGNGADTQLGI SGKGGTGGGGGGAGAGGTGGDGGLLYGNGGAGGNGGNGGAAGKGGIGAPGLSTAQGGD GGNGGSGGNAGNGGNGGRGSVLFGHGGNAGHGGAGGNGAVSGNGGSSITAVGGKGGTG GGGGGGAGGTGGDAGLLYGNGGAGGTGGSGGAGARGGDGGAGSGTAQGGDGGAGGVGG NAGNGGNGGSAGWLSGNGGTGGGGGTAGAGGQGGNGNSGIDPGNGGQGADTGNAGNGG HGGSAAKLFGDGGAGGAGGMGSTGGTGGGGGFGGGTGGNGGNGHAGGAGGSGGTAGLL GSGGTGGDGGNGGLGAGSGAKGNGGNGGDGGKGGDAQLIGNGGNGGNGGKGGTGLMPG INGTGGAGGSRGQISGNPGTPGQ" CDS complement(1193898..1195661) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1098C" /product="Predicted membrane protein (DUF2319)" /note="Mb1098c, -, len: 587 aa. Equivalent to Rv1069c, len: 587 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 587 aa overlap). Conserved hypothetical protein, hydrophobic regions in N-terminal domain. Similar in part to O07136|B1306.04C B1306.04c protein from Mycobacterium leprae (89 aa), FASTA scores: opt: 229, E(): 1.3e-07, (54.2% identity in 72 aa overlap). Protein product from Mb1098c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1098c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XX94" /db_xref="InterPro:IPR012037" /db_xref="InterPro:IPR027787" /db_xref="InterPro:IPR027788" /db_xref="UniProtKB/TrEMBL:A0A1R3XX94" /protein_id="SIT99697.1" /translation="MTEPAAATTTNASDEPATGAEQAVDTAATPQTPEPQPIRSTWWI RHYTFTGTAMGLVFVWFSMTPSLLPRGPLFQGLVSGICGAFGYGLGVFAVWLVRYMRS HNSSPPPPRWAWPPLIAVGAVGMVGMAVQFHVWQDDVRDLMGVEHLRWYDYPLAAALS LVVLFTLVEIGQFIRWLFRFLVGQVDRIAPFRVSAAIVVVLLVVLTITLLNGVVLKFA MNSMNSTFAAVNNEMNPDSAPPKTPLRSGGPGSLVSWESLGHQGRIFVHSGPTIADLT AFNGTPAVEPIRTYAGLNSADGIMATAELAARELARTGGLRRAVVAVATSTGTGWINE AEASALEYMYNGDTAIVSMQYSFLPSWLSFLVDKENARHAGEALFEAVDKLIRQLPES QRPKLVVFGESLGSFGGEAPFMNLNNILARTDGALFSGPTFNNTVWNSLTANRDAGSP QWLPIYDDGRNVRFVARARDLQRPDAPWGRPRVVYLQHASDPIAWWTPRLLFREPDWL REQRGYDVLPQTRWIPVVTFVQVSADMAVATHVPDGHGHRYVATVADGWAAVLSPPGW TQQKTERLQPLLHANAKPFGS" CDS complement(1195658..1196431) /codon_start=1 /transl_table=11 /gene="echA8" /locus_tag="BQ2027_MB1099C" /product="PROBABLE ENOYL-CoA HYDRATASE ECHA8 (ENOYL HYDRASE) (UNSATURATED ACYL-CoA HYDRATASE) (CROTONASE)" /note="Mb1099c, echA8, len: 257 aa. Equivalent to Rv1070c, len: 257 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 257 aa overlap). Probable echA8, enoyl-CoA hydratase (EC 4.2.1.17), equivalent to O07137|B1306.05c putative enoyl-CoA hydratase/isomerase from Mycobacterium leprae (257 aa), FASTA scores: opt: 1417, E(): 0, (86.4% identity in 257 aa overlap). Also highly similar to others e.g. NP_106219.1|NC_002678 enoyl CoA hydratase from Mesorhizobium loti (257 aa); L39265|RHMRPST_2 enoyl-CoA hydratase from Rhizobium melilotii (257 aa), FASTA scores: opt: 1100, E(): 0, (66.9% identity in 257 aa overlap); AAK18173.1|AF290950_5|AF290950|FadB1x enoyl-CoA hydratase from Pseudomonas putida (257 aa); etc. Contains PS00166 Enoyl-CoA hydratase/isomerase signature. BELONGS TO THE ENOYL-CoA HYDRATASE/ISOMERASE FAMILY. TBparse score is 0.881. Protein product from Mb1099c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1099c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64017" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR014748" /db_xref="InterPro:IPR018376" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/Swiss-Prot:P64017" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99698.1" /translation="MTYETILVERDQRVGIITLNRPQALNALNSQVMNEVTSAATELD DDPDIGAIIITGSAKAFAAGADIKEMADLTFADAFTADFFATWGKLAAVRTPTIAAVA GYALGGGCELAMMCDVLIAADTAKFGQPEIKLGVLPGMGGSQRLTRAIGKAKAMDLIL TGRTMDAAEAERSGLVSRVVPADDLLTEARATATTISQMSASAARMAKEAVNRAFESS LSEGLLYERRLFHSAFATEDQSEGMAAFIEKRAPQFTHR" CDS complement(1196443..1197480) /codon_start=1 /transl_table=11 /gene="echA9" /locus_tag="BQ2027_MB1100C" /product="POSSIBLE ENOYL-COA HYDRATASE ECHA9 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb1100c, echA9, len: 345 aa. Equivalent to Rv1071c, len: 345 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 345 aa overlap). Possible echA9, enoyl-CoA hydratase (EC 4.2.1.17), equivalent to Y13803|B1306.06c putative enoyl-CoA hydratase/isomerase from Mycobacterium leprae (345 aa), FASTA scores: opt: 1799, E(): 0, (77.7% identity in 345 aa overlap). Also similar to many eukaryotic and prokaryotic enoyl-CoA hydratases e.g. NP_437984.1|NC_003078 putative enoyl-CoA hydratase protein from Sinorhizobium meliloti (356 aa); NP_420165.1|NC_002696 enoyl-CoA hydratase/isomerase family protein from Caulobacter crescentus (350 aa); Q19278 PROTEIN SIMILAR TO ENOYL-COA HYDRATASES from Caenorhabditis elegans (386), FASTA scores: opt: 787, E(): 0, (38.5% identity in 348 aa overlap); etc. Protein product from Mb1100c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1100c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXA9" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR032259" /db_xref="UniProtKB/TrEMBL:A0A1R3XXA9" /protein_id="SIT99699.1" /translation="MTGESHEVLTNVEGGVGFVTLNRPKAINSLNQTMVDLLATVLMS WEHEDAVHAVVLSGAGERGLCAGGDVVAVYHSARKDGVEARRFWRHEYLLNALIGRFA KPYVALMDGIVMGGGVGVSAHANTRVVTDTSKVAMPEVGIGFIPDVGGVYLLSRAPGA LGLHAALTGAPFSGADAIALGFADHFVPHGDLDAFTQKIVTGGVESALAAHAVEPPPS TLAAQRDWIDECYAGDSVADIVAALRKQGGEPAVNASDLIASRSPIALSVTLQAVRRA AKLDTLEDVLIQDYRVSSASLRSHDLVEGIRAQLIDKDRNPNWSPATLDAITAADIEA YFEPVDDDLSF" CDS 1197667..1198503 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1101" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb1101, -, len: 278 aa. Equivalent to Rv1072, len: 278 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 278 aa overlap). Probable conserved transmembrane protein, equivalent to O07139|B1306.07|Y13803 Protein B1306.07 from Mycobacterium leprae (220 aa), FASTA scores: opt:1032, E(): 0, (75.0% identity in 220 aa overlap); and at the C-terminal end to Q50056|U1740D Mycobacterium leprae (96 aa), FASTA scores: opt: 381, E(): 1.2e-18, (71.6% identity in 81 aa overlap). Similar to Q54192|M80628|STMBLDA_1 TRANSFER RNA-LEU (BLDA) GENE AND ORF from Streptomyces griseus (293 aa), FASTA scores: opt:558, E(): 4.7e-30, (41.5% identity in 299 aa overlap). TBparse score is 0.896. Protein product from Mb1101 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1101 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX96" /db_xref="InterPro:IPR010539" /db_xref="UniProtKB/TrEMBL:A0A1R3XX96" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99700.1" /translation="MRETSNPVFRSLPKQRGGYAQFGTGTAQQGFPADPYLAPYREAK ATRPLTIDDVVTKTGLTLAMLAGTAVVSYFLVASNVALAMPLTLVGALGGLALVLVAT FGRKQDNPAIVLSYAALEGLFLGAISFVLANFTVASANAGVLIGEAILGTMGVFFGML VVYKTGAIRVTPKFTRMVVAALFGVLVLMLGNLVLAMFNVGGGEGLGLRSPGPLGIIF SLVCIGIAAFSFLIDFDAADQMIRAGAPEKAAWGVALGLTVTLVWLYIEILRLLSYLQ NE" CDS 1198619..1199470 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1102" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1102, -, len: 283 aa. Equivalent to Rv1073, len: 283 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 283 aa overlap). Conserved hypothetical protein, similar to several hypothetical mycobacterial proteins e.g. Rv1482c|Z79701|MTCY277.03 Mycobacterium tuberculosis (339 aa), FASTA scores: opt: 810, E(): 0, (47.4% identity in 272 aa overlap); Rv3555c|Z92774|MTCY6G11_2 Mycobacterium tuberculosis (289 aa), FASTA scores: opt: 704, E(): 0, (44.4% identity in 259 aa overlap); and Rv3517, etc., and GIR10|AF002133_10 M. avium strain GIR10 (346 aa), FASTA scores: opt: 802, E(): 0, (48.1% identity in 270 aa overlap). Mb1102 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XX98" /protein_id="SIT99701.1" /translation="MGAQPFIGSEALAAGLISWHELGKYYTAIMPNVYLDKRLKPSLR QRVIAAWLWSGRKGVIAGASASALHGAKWVDDHALVELIWRNARAPNGVRTKDELLLD GEVQRLCGLTVTTVERTAFDLGRRPPLGQAITRLDALANATDFKINDVRELARKHPHT RGLRQLDKALDLVDPGAQSPKETWLRLLLINAGFPRPSTQIPLLGVYGHPKYFLDMGW EDIMLAVEYDGEQHRLSRDQFVKDVERLEYIRRAGWTHIRVLADHKGPDVVRRVRQAW DTLTSRR" CDS complement(1199544..1200761) /codon_start=1 /transl_table=11 /gene="fadA3" /locus_tag="BQ2027_MB1103C" /product="PROBABLE BETA-KETOACYL CoA THIOLASE FADA3" /note="Mb1103c, fadA3, len: 405 aa. Equivalent to Rv1074c, len: 405 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 405 aa overlap). Probable fadA3, beta-ketoacyl CoA thiolase (EC 2.3.1.-), highly similar to many involved in beta-oxidation e.g. CAB89028.1|AL353870 beta-ketoadipyl-CoA thiolase from Streptomyces coelicolor (395 aa); P77525|PAAJ_ECOLI probable beta-ketoadipyl CoA thiolase from Escherichia coli (401 aa), FASTA scores: opt: 1034, E(): 5.4e-56, (43.5% identity in 416 aa overlap) and X97452 acetyl-CoA acetyltransferase (thiolase) from Escherichia coli (401 aa), FASTA scores: opt: 1043, E(): 0, (43.4% identity in 415 aa overlap); Q43935|CATF_ACICA beta-ketoadipyl CoA thiolase from Acinetobacter calcoaceticus (401 aa), FASTA scores: opt: 992, E(): 0, (41.5% identity in 415 aa overlap); etc. Contains PS00737 Thiolases signature 2, and PS00445 FGGY family of carbohydrate kinases signature 2, although this is probably fortuitous. BELONGS TO THE THIOLASE FAMILY. Protein product from Mb1103c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1103c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZD9" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020613" /db_xref="InterPro:IPR020616" /db_xref="InterPro:IPR020617" /db_xref="UniProtKB/TrEMBL:A0A1R3XZD9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99702.1" /translation="MPEAVIVSTARSPIGRAMKGSLVGMRPDDLAVQMVRAALDKVPA LNPHQIDDLMMGCGLPGGESGFNIARVVAVALGYDFLPGTTVNRYCSSSLQTTRMAFH AIKAGEGDAFISAGVETVSRFAKGNSDSWPDTKNPLFDGAQERSAAAAAGADEWHDPR TDQKLPDIYIAMGQTAENVAIMTGISREEQDRWGVRSQNRAEEAIKNGFFEREITPVT LPDGTTVSTDDGPRPGTTYEKVSELKPAFRPNGTVTAGNACPLNDGAAAVVITSDTKA KELGLTPLARIVSTGVSGLSPEIMGLGPIEASKKALERAGMAITDIDLVEINEAFAVQ VLGSARELGIDEDKLNISGGAIALGHPFGMTGARITTTLLNNLQTYDKTFGLETMCVG GGQGMAMVIERLA" CDS complement(1200814..1201758) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1104C" /product="CONSERVED EXPORTED PROTEIN" /note="Mb1104c, -, len: 314 aa. Equivalent to Rv1075c, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 314 aa overlap). Possibly exported protein, as it contains a N-terminal signal sequence, hydrophobic domain from aa 7-25. Similar to U15183|MLU15183_2 Mycobacterium leprae cosmid B1740 (106 aa), FASTA scores: opt: 207, E(): 1.6e-06, (42.6% identity in 101 aa overlap). Also weak similarity to many glyceraldehyde-3-phosphate dehydrogenases e.g. Q41595|G3PC_TAXBA Taxus baccata (340 aa), FASTA scores: opt: 147, E(): 0.027, (27.5% identity in 189 aa overlap). Mb1104c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR013830" /db_xref="InterPro:IPR036514" /db_xref="UniProtKB/TrEMBL:A0A1R3XY90" /protein_id="SIT99703.1" /translation="MPRRSTIALATAGALASTGTAYLGARNLLVGQATHARTVIPKSF DAPPRADGVYTRGGGPVQRWRREVPFDVHLMIFGDSTATGYGCASAEEVPGVLIARGL AEQTGKRIRLSTKAIVGATSKGVCGQVDAMFVVGPPPDAAVIMIGANDITALNGIGPS AQRLADCVRRLRTRGAVVVVGTCPDLGVITAIPQPLRALAHTRGVRLARAQTAAVKAA GGVPVPLGHLLAPKFRAMPELMFSADRYHPSAPAYALAADLLFLALRDALTEKLDIPI HETPSRPGTATLEPGHTRHSMMSRLRRPRPARAVPTGG" CDS 1202155..1203048 /codon_start=1 /transl_table=11 /gene="lipU" /locus_tag="BQ2027_MB1105" /product="possible lipase lipu" /note="Mb1105, lipU, len: 297 aa. Equivalent to Rv1076, len: 297 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 297 aa overlap). Putative lipU, lipase (EC 3.1.-.-), very similar to several Mycobacterium tuberculosis proteins e.g. Z95390|Rv3487c|MTCY13E12.41c (277 aa), FASTA scores: opt: 1225, E(): 0, (76.0% identity in 246 aa overlap); Rv1426c, etc. Also similar to esterases and lipases of around 300 aa e.g. Q44087 ESTERASE PRECURSOR from Acinetobacter lwoffii esterase (303), FASTA scores: opt: 427, E(): 1.9e-21, (32.5% identity in 280 aa overlap). Equivalent to AL035159|MLCB1450 _7 Mycobacterium leprae (335 aa), FASTA scores: opt: 1588, E(): 0, (79.7% identity in 296 aa overlap). Protein product from Mb1105 detected using SWATH mass spectrometry. Mb1105 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXB5" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR033140" /db_xref="UniProtKB/TrEMBL:A0A1R3XXB5" /protein_id="SIT99704.1" /translation="MAVRPVLAVGSYLPHAPWPWGVIDQAARVLLPASTTVRAAVSLP NASAQLVRASGVLPADGTRRAVLYLHGGAFLTCGANSHGRLVELLSKFADSPVLVVDY RLIPKHSIGMALDDCHDGYRWLRLLGYEPEQIVLAGDSAGGYLALALAQRLQEVGEEP AALVAISPLLQLAKEHKQAHPNIKTDAMFPARAFDALDALVASAAARNQVDGEPEELY EPLEHITPGLPRTLIHVSGSEVLLHDAQLAAAKLAAAGVPAEVRVWPGQVHDFQVAAS MLPEAIRSLRQIGEYIREATG" CDS 1203105..1204499 /codon_start=1 /transl_table=11 /gene="cbs" /locus_tag="BQ2027_MB1106" /standard_name="cysM2" /product="Probable cystathionine beta-synthase CBS (Serine sulfhydrase) (Beta-thionase) (Hemoprotein H-450)" /note="Mb1106, cbs, len: 464 aa. Equivalent to Rv1077, len: 464 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 464 aa overlap). Probable cbs (previously cysM2), cystathionine beta-synthase (EC 4.2.1.22), similar throughout its length to many eukaryotic cystathionine beta-synthases e.g. P32232|CBS_RAT CYSTATHIONINE BETA-SYNTHASE (560 aa), FASTA scores: opt: 951, E(): 0, (40.2% identity in 450 aa overlap); also similar in N-terminal domain (aa 1 - 330) to Rv2334|MTCY98.03 CysK Mycobacterium tuberculosis (310 aa), FASTA scores: opt: 855, E(): 0, (46.8% identity in 314 overlap); and other cysteine synthase proteins e.g. Rv1336, Rv0848, etc. Contains PS00217 Sugar transport proteins signature 2 probably spurious. BELONGS TO THE CYSTEINE SYNTHASE/CYSTATHIONINE BETA-SYNTHASE FAMILY. Protein product from Mb1106 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1106 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXM8" /db_xref="InterPro:IPR000644" /db_xref="InterPro:IPR001926" /db_xref="InterPro:IPR005857" /db_xref="InterPro:IPR036052" /db_xref="UniProtKB/TrEMBL:A0A1R3XXM8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99705.1" /translation="MRIAQHISELIGGTPLVRLNSVVPDGAGTVAAKVEYLNPGGSSK DRIAVKMIEAAEASGQLKPGGTIVEPTSGNTGVGLALVAQRRGYKCVFVCPDKVSEDK RNVLIAYGAEVVVCPTAVPPHDPASYYSVSDRLVRDIDGAWKPDQYANPEGPASHYVT TGPEIWADTEGKVTHFVAGIGTGGTITGAGRYLKEVSGGRVRIVGADPEGSVYSGGAG RPYLVEGVGEDFWPAAYDPSVPDEIIAVSDSDSFDMTRRLAREEAMLVGGSCGMAVVA ALKVAEEAGPDALIVVLLPDGGRGYMSKIFNDAWMSSYGFLRSRLDGSTEQSTVGDVL RRKSGALPALVHTHPSETVRDAIGILREYGVSQMPVVGAEPPVMAGEVAGSVSERELL SAVFEGRAKLADAVSAHMSPPLRMIGAGELVSAAGKALRDWDALMVVEEGKPVGVITR YDLLGFLSEGAGRR" CDS 1204701..1205423 /codon_start=1 /transl_table=11 /gene="pra" /locus_tag="BQ2027_MB1107" /product="Probable Proline-rich antigen homolog pra" /note="Mb1107, pra, len: 240 aa. Equivalent to Rv1078, len: 240 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 240 aa overlap). Probable pra, Proline-rich antigen homolog, equivalent to X65546|MLPRAG_1 proline rich antigen from Mycobacterium leprae (249 aa), FASTA scores: opt: 1162, E(): 3.3e-30, (64.8% identity in 253 aa overlap). Has potential hydrophobic domains. Protein product from Mb1107 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1107 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXA6" /db_xref="InterPro:IPR010432" /db_xref="UniProtKB/TrEMBL:A0A1R3XXA6" /protein_id="SIT99706.1" /translation="MTEQPPPGGSYPPPPPPPGPSGGHEPPPAAPPGGSGYAPPPPPS SGSGYPPPPPPPGGGAYPPPPPSAGGYAPPPPGPAIRTMPTESYTPWITRVLAAFIDW APYVVLVGIGWVIMLVTQTSSCVTSISEYDVGQFCVSQPSMIGQLVQWLLSVGGLAYL VWNYGYRQGTTGSSIGKSVLKFKVVSETTGQPIGFGMSVVRQLAHFIDAIICFVGFLF PLWDAKRQTLADKIMTTVCVPI" CDS 1205455..1206621 /codon_start=1 /transl_table=11 /gene="metB" /locus_tag="BQ2027_MB1108" /product="cystathionine gamma-synthase metb (cgs) (o-succinylhomoserine [thiol]-lyase)" /note="Mb1108, metB, len: 388 aa. Equivalent to Rv1079, len: 388 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 388 aa overlap). Probable metB, cystathionine gamma-synthase (EC 4.2.99.9) (see citation below). P46807|METB_MYCLE CYSTATHIONINE GAMMA-SYNTHASE from Mycobacterium leprae (EC 4.2.1.22) (388 aa), FASTA scores: opt: 2220, E(): 0, (87.3% identity in 387 aa overlap). Also similar to other Mycobacterium tuberculosis enzymes involved in methionine synthesis e.g. Rv0391 and Rv3340. Contains PS00868 Cys/Met metabolism enzymes pyridoxal-phosphate attachment site. BELONGS TO THE TRANS-SULFURATION ENZYMES FAMILY. Protein product from Mb1108 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1108 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66876" /db_xref="InterPro:IPR000277" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/Swiss-Prot:P66876" /protein_id="SIT99707.1" /translation="MSEDRTGHQGISGPATRAIHAGYRPDPATGAVNVPIYASSTFAQ DGVGGLRGGFEYARTGNPTRAALEASLAAVEEGAFARAFSSGMAATDCALRAMLRPGD HVVIPDDAYGGTFRLIDKVFTRWDVQYTPVRLADLDAVGAAITPRTRLIWVETPTNPL LSIADITAIAELGTDRSAKVLVDNTFASPALQQPLRLGADVVLHSTTKYIGGHSDVVG GALVTNDEELDEEFAFLQNGAGAVPGPFDAYLTMRGLKTLVLRMQRHSENACAVAEFL ADHPSVSSVLYPGLPSHPGHEIAARQMRGFGGMVSVRMRAGRRAAQDLCAKTRVFILA ESLGGVESLIEHPSAMTHASTAGSQLEVPDDLVRLSVGIEDIADLLGDLEQALG" CDS complement(1206692..1207186) /codon_start=1 /transl_table=11 /gene="greA" /locus_tag="BQ2027_MB1109C" /product="PROBABLE TRANSCRIPTION ELONGATION FACTOR GREA (Transcript cleavage factor greA)" /note="Mb1109c, greA, len: 164 aa. Equivalent to Rv1080c, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 164 aa overlap). Probable greA, transcription elongation factor G, closest to P46808|GREA_MYCLE TRANSCRIPTION ELONGATION FACTOR G from Mycobacterium leprae (202 aa), FASTA scores: opt: 1005, E(): 0, (94.5% identity in 164 aa overlap); and similar to many e.g. P21346|GREA_ECOLI from Escherichia coli (158 aa), FASTA scores: opt: 257, E(): 5.7e-10, (37.2% identity in 148 aa overlap); etc. Contains two PS00829 and one PS00830 Prokaryotic transcription elongation factors signatures 1 and 2, respectively. BELONGS TO THE GREA/GREB FAMILY. Protein product from Mb1109c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1109c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64280" /db_xref="InterPro:IPR001437" /db_xref="InterPro:IPR006359" /db_xref="InterPro:IPR018151" /db_xref="InterPro:IPR022691" /db_xref="InterPro:IPR023459" /db_xref="InterPro:IPR028624" /db_xref="InterPro:IPR036805" /db_xref="InterPro:IPR036953" /db_xref="UniProtKB/Swiss-Prot:P64280" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99708.1" /translation="MTDTQVTWLTQESHDRLKAELDQLIANRPVIAAEINDRREEGDL RENGGYHAAREEQGQQEARIRQLQDLLSNAKVGEAPKQSGVALPGSVVKVYYNGDKSD SETFLIATRQEGVSDGKLEVYSPNSPLGGALIDAKVGETRSYTVPNGSTVSVTLVSAE PYHS" CDS complement(1207372..1207806) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1110C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb1110c, -, len: 144 aa. Equivalent to Rv1081c, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 144 aa overlap). Probable conserved membrane protein, with hydrophobic stretch from aa 26 -48, highly similar to NP_302548.1|NC_002677 conserved membrane protein from Mycobacterium leprae. Mb1110c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXC1" /db_xref="InterPro:IPR025443" /db_xref="UniProtKB/TrEMBL:A0A1R3XXC1" /protein_id="SIT99709.1" /translation="MTHTPIPRPDARYGRPRLSRRARRRVAIALGVLVAAAGIVIAVI GYQRISTSAVTGSLVGYRLVDDETASVTISVTRSDPSRPVACIVRVRATNGSETGRRE LLVPPSEATTVQVTTTVKSSQPPVMADVYGCGTEVPSYLRLP" CDS 1207908..1208774 /codon_start=1 /transl_table=11 /gene="mca" /locus_tag="BQ2027_MB1111" /product="Mycothiol conjugate amidase Mca (Mycothiol S-conjugate amidase)" /note="Mb1111, mca, len: 288 aa. Equivalent to Rv1082, len: 288 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 288 aa overlap). mca, mycothiol conjugate amidase (see citation below), equivalent to NP_302547.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (290 aa), FASTA scores: opt: 1737, E(): 0, (86.4% identity in 287 aa overlap); and similar to Q54358|X79146 lmbE protein from Streptomyces lincolnensis (270 aa). Also similar to Rv1170|MTV005.06|MSHB GlcNAc-Ins deacetylase from Mycobacterium tuberculosis (303 aa), FASTA scores: opt: 411, E(): 9.4e-20, (35.8% identity in 299 aa overlap). Protein product from Mb1111 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1111 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXA8" /db_xref="InterPro:IPR003737" /db_xref="InterPro:IPR017811" /db_xref="InterPro:IPR024078" /db_xref="UniProtKB/TrEMBL:A0A1R3XXA8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99710.1" /translation="MSELRLMAVHAHPDDESSKGAATLARYADEGHRVLVVTLTGGER GEILNPAMDLPDVHGRIAEIRRDEMTKAAEILGVEHTWLGFVDSGLPKGDLPPPLPDD CFARVPLEVSTEALVRVVREFRPHVMTTYDENGGYPHPDHIRCHQVSVAAYEAAGDFC RFPDAGEPWTVSKLYYVHGFLRERMQMLQDEFARHGQRGPFEQWLAYWDPDHDFLTSR VTTRVECSKYFSQRDDALRAHATQIDPNAEFFAAPLAWQERLWPTEEFELARSRIPAR PPETELFAGIEP" CDS 1208771..1209037 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1112" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1112, -, len: 88 aa. Equivalent to Rv1083, len: 88 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 88 aa overlap). Conserved hypothetical protein, similar to U15183|MLU15183_9 hypothetical protein from Mycobacterium leprae (167 aa), FASTA scores: opt: 332, E(): 1.2e-13, (58.4% identity in 101 aa overlap). Hydrophobic domain aa 25-43. Mb1112 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXB1" /db_xref="UniProtKB/TrEMBL:A0A1R3XXB1" /protein_id="SIT99711.1" /translation="MNQILLSVIAEGGPGNTGPDFGKASPVGLLVIVLLVIATLFLVR SMNQQLKKVPKSFDRDHPELDQAADEGTDRDGPARPPGPPHESG" CDS 1209024..1211045 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1113" /product="conserved protein" /note="Mb1113, -, len: 673 aa. Equivalent to Rv1084, len: 673 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 673 aa overlap). Conserved hypothetical protein, similar to P37512|YYAL_BACSU hypothetical protein from Bacillus subtilis (689 aa), FASTA scores: opt: 1063, E(): 0, (36.5% identity in 696 aa overlap); AE0009|AE000983_10 Archaeoglobus fulgidus section 1 (642 aa), FASTA scores: opt: 1018, E(): 0, (37.2% identity in 600 aa overlap). Also similar to AE001938|AE001938_9 Deinococcus radiodurans (690 aa), FASTA scores: opt: 1097, E(): 0, (41.6% identity in 694 aa overlap). Protein product from Mb1113 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1113 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZF1" /db_xref="InterPro:IPR004879" /db_xref="InterPro:IPR008928" /db_xref="InterPro:IPR012341" /db_xref="InterPro:IPR024705" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3XZF1" /protein_id="SIT99712.1" /translation="MSPANPSGTNTLALATSPYLRQHADNPVHWQQWTPQALAEAAAR AVPILLSVGYAACHWCHVMAHESFDDDEVAAAMNAGFVCIKVDREERPDIDAVYMNAT VALTGQGGWPMTCFLTPNGRPFFCGTYYPKAAFLQLLSAISETWRERRAEVEQASDHI AAELRSMASGLPGGGPEVAPELCDDAVAGVLREQDTAHGGFGGAPKFPPSALLEALMR HYERTRSPAALEAVARTGNAMARGGIYDQLGGGFARYSVDGAWVVPHFEKMLYDNALL LRAYAHWARRTGDPLARRVAAQTARFLLDELGSKAPADMFTSSLDADADGREGSTYVW TPVQLTEVLGGDDGRWAAEVFGVTEAGTFEHGTSVLQLPADPDDAARLDRVRAALLVA RLARAQPARDDKVVTSWNGLAITALAEASVALDDPALAHAARRCATRLLDLHVVDGRL RRASLGGVVGDSAAILEDHAMLATGLLALYQLTSEGAWLTAATGLLDTAVAHFGDPQR PGRWFDTADDAERLMLRPSDPLDGATPSGASSIAEALLTAGHVVDGARAERYWQLAAD TLRAHAVLLARAPRSAGHWLAVAEAVVRGPLQIAVACDLPRSSLLADARRLAPGGAIV VGGAAGSSALLVGRDRVAGADAAYVCRGRVCDLPVTSAAELATALGVPG" CDS complement(1211144..1211872) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1114C" /product="POSSIBLE HEMOLYSIN-LIKE PROTEIN" /note="Mb1114c, -, len: 242 aa. Equivalent to Rv1085c, len: 242 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 242 aa overlap). Possible hemolysin-like protein, integral membrane protein, similar to many hemolysins, and hypothetical proteins e.g. U28375|ECU28375_49 Hypothetical protein from Escherichia coli (219 aa), FASTA scores: opt: 308, E(): 7.5e-15, (30.6% identity in 180 aa overlap); AE0011|HIAE001124_2 Hypothetical protein from Borrelia burgdorferi (233 aa), FASTA scores: opt: 305, E(): 1.3e-14, (25.6% identity in 203 aa overlap). Also weakly similar to HLY3_BACCE|P54176 haemolysin from Bacillus cereus (219 aa), FASTA scores: opt: 247, E(): 8.7e-12, (27.5% identity in 171 aa overlap). Also similar to AE002027|AE002027_8 probable hemolysin from Deinococcus radiodurans (219 aa), FASTA scores: opt: 354, E(): 1.8e-16, (31.1% identity in 219 aa overlap). Mb1114c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67158" /db_xref="InterPro:IPR004254" /db_xref="InterPro:IPR005744" /db_xref="UniProtKB/Swiss-Prot:P67158" /protein_id="SIT99713.1" /translation="MSGQADTATTAEARTPAHAAHHLVEGVARVLTKPRFRGWIHVYS AGTAVLAGASLVAVSWAVGSAKAGLTTLAYTAATITMFTVSATYHRVNWKSATARNWM KRADHSMIFVFIAGSYTPFALLALPAHDGRVVLSIVWGGAIAGILLKMCWPAAPRSVG VPLYLLLGWVAVWYTATILHNAGVTALVLLFVGGALYSIGGILYAVRWPDPWPTTFGY HEFFHACTAVAAICHYIAMWFVVF" CDS 1211983..1212771 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1115" /product="SHORT (C15) CHAIN Z-ISOPRENYL DIPHOSPHATE SYNTHASE (Z-FPP SYNTHASE) (Z-FARNESYL DIPHOSPHATE SYNTHASE) (Z-FPP SYNTHETASE) (Z-FARNESYL DIPHOSPHATE SYNTHETASE) (GERANYLTRANSTRANSFERASE) (FARNESYL PYROPHOSPHATE SYNTHETASE)" /note="Mb1115, -, len: 262 aa. Equivalent to Rv1086, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 262 aa overlap). Short (C15) chain Z-isoprenyl diphosphate synthase (EC 2.5.1.10) (see citations below), equivalent to NP_302598.1|NC_002677 possible undecaprenyl pyrophosphate synthetase from Mycobacterium leprae (262 aa), similar to many hypothetical proteins and several potential members of the upp synthase family e.g. NP_296167.1|NC_001263 undecaprenyl diphosphate synthase from Deinococcus radiodurans (339 aa); P20182|YT14_STRFR Hypothetical protein from Streptomyces fradiae (259 aa), FASTA scores: opt: 840, E(): 0, (51.0% identity in 259 aa overlap); and P38118|YARF_CORGL Hypothetical protein from Corynebacterium glutamicicum (234 aa), FASTA scores: opt: 729, E(): 0, (56.0% identity in 209 aa overlap); etc. Also similar to Rv2361c|MTCY27.19 (296 aa) (35.6% identity in 233 aa overlap). Contains PS01066 Uncharacterized protein family UPF0015 signature. SEEMS TO BELONG TO THE UPP SYNTHETASE FAMILY. Protein product from Mb1115 detected using SWATH mass spectrometry. Mb1115 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0P8" /db_xref="InterPro:IPR001441" /db_xref="InterPro:IPR018520" /db_xref="InterPro:IPR036424" /db_xref="UniProtKB/Swiss-Prot:Q7U0P8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99714.1" /translation="MEIIPPRLKEPLYRLYELRLRQGLAASKSDLPRHIAVLCDGNRR WARSAGYDDVSYGYRMGAAKIAEMLRWCHEAGIELATVYLLSTENLQRDPDELAALIE IITDVVEEICAPANHWSVRTVGDLGLIGEEPARRLRGAVESTPEVASFHVNVAVGYGG RREIVDAVRALLSKELANGATAEELVDAVTVEGISENLYTSGQPDPDLVIRTSGEQRL SGFLLWQSAYSEMWFTEAHWPAFRHVDFLRALRDYSARHRRYGR" CDS 1212948..1215272 /codon_start=1 /transl_table=11 /gene="PE_PGRS21" /locus_tag="BQ2027_MB1116" /product="pe-pgrs family protein pe_pgrs21" /note="Mb1116, PE_PGRS21, len: 774 aa. Similar to Rv1087, len: 767 aa, from Mycobacterium tuberculosis strain H37Rv, (96.8% identity in 783 aa overlap). Member of the M. tuberculosis PE family, PGRS subfamily of gly-rich proteins. Similar to Rv1090|AL021897|MTV017_43 Mycobacterium tuberculosis H37Rv (853 aa), FASTA scores: opt: 2819, E(): 0, (59.8% identity in 860 aa overlap). Contains PS00583 pfkB family of carbohydrate kinases signature 1 near C -terminus. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 3 bp (*-gcg) and 45 bp, and deletions of 18 bp and 9 bp (ggtggggcc-*), lead to a longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (774 aa versus 767 aa). Mb1116 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XXN8" /protein_id="SIT99715.1" /translation="MSFVVVAPEVLAAAASDLAGIGSTLAQANAAALAPTTAVLAAGA DEVSAAIASLFGAHGQAYQAVSAQMSAFHAQFMQALTGAGGAYAAAEAVNVSAAQSVE QDLLAAINARFERIFGRPLIGDGANGGPGQDGGPGGLLYGNGGNGGTSTTVGMAGGNG GAAGLIGNGGFGGGGGPGAAGGNGGAGGWLFGNGGAGGAGGLGVAPGVPGGAGGAGGA GGVGGPAGLWGHGGAGGAGGAGVAGAGGFEGTIGAGGAGGVGGAGGVGGAGGAGGWLY GDAGAGGDGGVGGAGGTGGLGNRGGAGGAGGAGGVGGAGGAAGLWGGGGAGGVGGTGG GAGLGAQSVTFSSSLSGLSGGDGGAGGAGGAGGAGGTGGWLYGGGGAAGSGGDGGTGG QGGAGGAGVFSLFGSGGGPGGNGGVGGVGGVGGAGGRAGLFGVGGLGGAGGDAGDSGE GGFGGPGLAGGLFGNPGNGGVGGIGGDAAAGGAGGAGGNGGWLFGNGGAGGSGGDGGA AGRGGAGNLGSAGGINAPAGNPGSGSVGIGGAGGAGGTAGLFGDGGAGGAGAAGGFGG ISAATPSAGSEGAMGGAGGVGGNARLLGTGGAGGVGGGGGAGGDGGRGGVATPGGQGG DAGDGGAGGAGGNGGGGASGAGGWLLGTGGAGGAGGNGGNGGKAGFSPGPTNFGLNGA GGGGGVGGNGATGPWLFGDGGAGGGGGAGGIGGDGGPTPGSTGAGAAGGHGGDAQLIG NGGHGGAGGTGVPNGSGGAGGLSGLLFGEPGANG" CDS 1215449..1215769 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1116A" /product="Undecaprenyl diphosphate synthase (EC" /EC_number="2.5.1.31" /note="Mb1116A, -, len: 106 aa. Equivalent to Rv1087A, len: 106 aa (fragment), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 106 aa overlap). Conserved hypothetical protein, highly similar to C-terminus of near ORF O53434|YA86_MYCTU|Rv1086|MT1118|MTV017.39 SHORT (C15) CHAIN Z-ISOPRENYL DIPHOSPHATE SYNTHASE from Mycobacterium tuberculosis (262 aa), FASTA scores: opt: 200, E(): 1.1e-06, (57.9% identity in 76 aa overlap)." /db_xref="GOA:A0A1R3XXB7" /db_xref="InterPro:IPR001441" /db_xref="InterPro:IPR036424" /db_xref="UniProtKB/TrEMBL:A0A1R3XXB7" /protein_id="SIT99716.1" /translation="MPCVGYGDRREFVDAVAVEAICENLNTSGQPDPDLVIRTSGEQR LSGHRGPTGGVSRRRLLRALRDYSTPHASIPYVPPPYRSDGIHASRLAVESVFDALAG RVEL" CDS 1215922..1216356 /codon_start=1 /transl_table=11 /gene="PE9" /locus_tag="BQ2027_MB1117" /product="pe family protein pe9" /note="Mb1117, PE9, len: 144 aa. Equivalent to Rv1088, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 144 aa overlap). Member of Mycobacterium tuberculosis PE family, similar to many others e.g. Z96071|MTCI418B_6 Mycobacterium tuberculosis cosmid (487 aa), FASTA scores: opt: 318, E(): 7.3e-14, (60.9% identity in 87 aa overlap) - except it appears to be frameshifted around codon 84. No error to account for frameshift could be found." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XXB3" /protein_id="SIT99717.1" /translation="MSYMIATPAALTAAATDIDGIGSAVSVANAAAVAATTGVLAAGG DEVLAAIARLFNANAEEYHALSAQVAAFQTLFVRTLTGGCGVFRRRRGRQCVTAAEHR AAGAGRRQRRRRSGDGQWRLRQQRHFGCGGQPEFRQHSEHRR" CDS 1216373..1216540 /codon_start=1 /transl_table=11 /gene="PE10" /locus_tag="BQ2027_MB1118" /product="pe family protein pe10" /note="Mb1118, PE10, len: 101 aa. Equivalent to Rv1089, len: 120 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 101 aa overlap). Member of the Mycobacterium tuberculosis PE family of glycine-rich proteins. Partial ORF that appears to be frameshifted, continuation of Rv1088|MTV017.41. Sequence has been checked and appears correct. Similar to Z95555|MTCY06F7_4 Mycobacterium tuberculosis cosmid (401 aa), FASTA scores: opt: 126, E(): 2, (29.6% identity in 125 aa overlap)." /db_xref="GOA:A0A1R3XXB4" /db_xref="InterPro:IPR008965" /db_xref="UniProtKB/TrEMBL:A0A1R3XXB4" /protein_id="SIT99718.1" /translation="MTTASATASSTGVDGGIAATYAVASQWDGGYVANYTITQFGRDF DDRLAVAIHFA" CDS 1216926..1217030 /codon_start=1 /transl_table=11 /gene="celA2a" /locus_tag="BQ2027_MB1119" /product="PROBABLE CELLULASE CELA2A (ENDO-1,4-BETA-GLUCANASE) (ENDOGLUCANASE) (CARBOXYMETHYL CELLULASE)" /note="Mb1119, celA2a, len: 34 aa. Equivalent to Rv1089A, len: 34 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 34 aa overlap). Probable celA2a, first part of cellulase (endoglucanase) (EC 3.2.1.4), similar to N-terminus of others." /db_xref="UniProtKB/TrEMBL:A0A1R3XXC8" /protein_id="SIT99719.1" /translation="MNGAAPTNGAPLSYPSICEGVHWGHLVGGHQPAY" CDS 1217008..1217463 /codon_start=1 /transl_table=11 /gene="celA2b" /locus_tag="BQ2027_MB1120" /product="PROBABLE CELLULASE CELA2B (ENDO-1,4-BETA-GLUCANASE) (ENDOGLUCANASE) (CARBOXYMETHYL CELLULASE)" /note="Mb1120, celA2b, len: 151 aa. Equivalent to Rv1090, len: 151 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 151 aa overlap). Probable celA2b, second part of cellulase (endoglucanase) (EC 3.2.1.4), similar to C-terminus of others e.g. O08468 cellulase CEL2 from Streptomyces halstedi (377 aa), FASTA scores: opt: 554, E(): 1.2e-30, (52.0% identity in 152 aa overlap); etc. Gene appears to have been inactivated by frameshift mutations but no errors could be found that would account for this. Mb1120 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXB8" /db_xref="InterPro:IPR002594" /db_xref="InterPro:IPR013319" /db_xref="InterPro:IPR013320" /db_xref="UniProtKB/TrEMBL:A0A1R3XXB8" /protein_id="SIT99720.1" /translation="MGTNLPTEVGQILSAPTSIDYNYPTTGVWDASYDICLDSTPKTT GVNQQEIMIWFNHQGSIQPVGSPVGNTTIEGKNFVVWDGSNGMNNAMAYVATEPIEVW SFDVMSFVDHTATMEPITDSWYLTSIRAGLEPWSDGVGLGVDSFSAKVN" CDS 1217878..1220430 /codon_start=1 /transl_table=11 /gene="PE_PGRS22" /locus_tag="BQ2027_MB1121" /product="pe-pgrs family protein pe_pgrs22" /note="Mb1121, PE_PGRS22, len: 850 aa. Equivalent to Rv1091, len: 853 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 853 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Similar to Rv1087|AL021897|MTV017_39 Mycobacterium tuberculosis H37Rv (767 aa), FASTA scores: opt: 2819, E(): 0, (60.0% identity in 860 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 9 bp deletion (ccggcggca-*) leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (850 aa versus 853 aa). Mb1121 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XXC0" /protein_id="SIT99721.1" /translation="MSFVIAAPEALVAVASDLAGIGSALAEANAAALAPTTALLAAGA DEVSAAIAALFGAHGQAYQTVSAQASAFHAQFVQALTGGGGAYAAAEAANVSAAQSTD QRLLDLINGPTQALLGRPLIGDGANGGPGQDGGPGGLLYGNGGNGGTSTTAGVAGGNG GAAGLIGNGGAGGGGGAGAAGGNGGAGGWLYGNGGAGGAGGTSVIPGVAGGNGGAGGS AGLWGTGGAGGDGGNGRSGPVNVAGSAGGNGGAGGAAGLFGDAGAGGNGGKGGAGGAA FSINFTAGDGGAGGAGGSGGHALLWGAGGAGGNGGSGGTGGAGGSTAGAGGNGGAGGG GGTGGLLFGNGGAGGHGAAAGNGLAAGNGVSSSGGGGAGGTGGAGGDGGAGGAGGNAR LWGVGGAGGAGGDGGAGGAGGKGGSGLSGNANGGAGGDSGRGGTGGAGGEGGAAGLLV GTGGHGGDGGAGGAAVKGGDGGAAAGTGIAGAGGRGGAGGSGGSGGDGGGGAAGPAGW LFGDGGAGGNGGAAAAGGAGGQAGGGGGNGGNGGNGGNGGNGGNGATGGWLYGNGGAG GQGATAGAGGAGANGVSSTNGGGNGGIGGTGGSGGAGGNAGLLGVGGAGGHGASGGAG DRGGAGGTGFISSDGGAGGDGGDGGNGGAGGTGGLLFGAGGNGGPGGSGGAADIGGNG GAGNGGGTDGNGGNGGSGGGAGSGGDGGGAGGNGAWLFGNGGAGGGGGKGGNGAGGGL GGGSFGLPGLNGSGGDGGDGGNGAPGGVLYGNGGAGGQGSSGGIGGPGATGGAGGKGG DGGDAQLIGDGGNGGNGGAGGTGGTPGPGGPGGSGGLGGLLFGQTGTAGVSP" CDS complement(1220648..1221586) /codon_start=1 /transl_table=11 /gene="coaA" /locus_tag="BQ2027_MB1122C" /product="Probable pantothenate kinase coaA (Pantothenic acid kinase)" /note="Mb1122c, coaA, len: 312 aa. Equivalent to Rv1092c, len: 312 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 312 aa overlap). Probable coaA, pantothenate kinase (EC 2.7.1.33), similar to many e.g. P15044|COAA_ECOLI Escherichia coli (316 aa), FASTA scores :opt: 1079, E(): 0, (52.7% identity in 311 aa overlap). Equivalent to AL049491|MLCB1222_17 Mycobacterium leprae (312 aa) (93.6% identity in 312 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.912. BELONGS TO THE PANTOTHENATE KINASE FAMILY. Protein product from Mb1122c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1122c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63811" /db_xref="InterPro:IPR004566" /db_xref="InterPro:IPR006083" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P63811" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99722.1" /translation="MSRLSEPSPYVEFDRRQWRALRMSTPLALTEEELVGLRGLGEQI DLLEVEEVYLPLARLIHLQVAARQRLFAATAEFLGEPQQNPDRPVPFIIGVAGSVAVG KSTTARVLQALLARWDHHPRVDLVTTDGFLYPNAELQRRNLMHRKGFPESYNRRALMR FVTSVKSGSDYACAPVYSHLHYDIIPGAEQVVRHPDILILEGLNVLQTGPTLMVSDLF DFSLYVDARIEDIEQWYVSRFLAMRTTAFADPESHFHHYAAFSDSQAVVAAREIWRTI NRPNLVENILPTRPRATLVLRKDADHSINRLRLRKL" CDS 1221974..1223254 /codon_start=1 /transl_table=11 /gene="glyA1" /locus_tag="BQ2027_MB1123" /standard_name="glyA" /product="serine hydroxymethyltransferase 1 glya1" /note="Mb1123, glyA1, len: 426 aa. Equivalent to Rv1093, len: 426 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 426 aa overlap). Probable glyA1, serine hydroxymethyltransferase 1 (EC 2.1.2.1), equivalent to AL049491|MLCB1222_16 from Mycobacterium leprae (426 aa), FASTA score: (89.9 % identity in 426 aa overlap). Also similar to many e.g. P34895|GLYA_HYPME HYPHOMICROBIUM METHYLOVORUM (434 aa), FASTA scores: opt: 1492, E(): 0, (56.8% identity in 419 aa overlap); etc. BELONGS TO THE SHMT FAMILY. Note that previously known as glyA. Protein product from Mb1123 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1123 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59953" /db_xref="InterPro:IPR001085" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR019798" /db_xref="InterPro:IPR039429" /db_xref="UniProtKB/Swiss-Prot:P59953" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99723.1" /translation="MSAPLAEVDPDIAELLAKELGRQRDTLEMIASENFAPRAVLQAQ GSVLTNKYAEGLPGRRYYGGCEHVDVVENLARDRAKALFGAEFANVQPHSGAQANAAV LHALMSPGERLLGLDLANGGHLTHGMRLNFSGKLYENGFYGVDPATHLIDMDAVRATA LEFRPKVIIAGWSAYPRVLDFAAFRSIADEVGAKLLVDMAHFAGLVAAGLHPSPVPHA DVVSTTVHKTLGGGRSGLIVGKQQYAKAINSAVFPGQQGGPLMHVIAGKAVALKIAAT PEFADRQRRTLSGARIIADRLMAPDVAKAGVSVVSGGTDVHLVLVDLRDSPLDGQAAE DLLHEVGITVNRNAVPNDPRPPMVTSGLRIGTPALATRGFGDTEFTEVADIIATALAT GSSVDVSALKDRATRLARAFPLYDGLEEWSLVGR" CDS 1223359..1224186 /codon_start=1 /transl_table=11 /gene="desA2" /locus_tag="BQ2027_MB1124" /product="POSSIBLE ACYL-[ACYL-CARRIER PROTEIN] DESATURASE DESA2 (ACYL-[ACP] DESATURASE) (STEAROYL-ACP DESATURASE)" /note="Mb1124, desA2, len: 275 aa. Equivalent to Rv1094, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 275 aa overlap). Possible desA2, acyl-[acyl-carrier protein] desaturase (stearoyl-ACP desaturase) (EC 1.14.19.2), equivalent to AL049491|MLCB1222_15 from Mycobacterium leprae (275 aa), FASTA score: (78.1% identity in 274 aa overlap). Also weakly similar to plant stearoyl-acyl carrier protein desaturases, and very similar to U49839|MTV043.16C|Rv0824c enzyme desA1 from Mycobacterium tuberculosis (338 aa), FASTA scores: opt: 525, E(): 8.5e-30, (32.2% identity in 270 aa overlap); and to U15182|MLU15182_32 acyl-carrier protein desaturase precursor from Mycobacterium leprae (338 aa), FASTA scores: opt: 506, E(): 1.9e-28, (34.1% identity in 261 aa overlap). TBparse score is 0.894. Protein product from Mb1124 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1124 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXD5" /db_xref="InterPro:IPR005067" /db_xref="InterPro:IPR009078" /db_xref="InterPro:IPR012348" /db_xref="UniProtKB/TrEMBL:A0A1R3XXD5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99724.1" /translation="MAQKPVADALTLELEPVVEANMTRHLDTEDIWFAHDYVPFDQGE NFAFLGGRDWDPSQSTLPRTITDACEILLILKDNLAGHHRELVEHFILEDWWGRWLGR WTAEEHLHAIALREYLVVTREVDPVANEDVRVQHVMKGYRAEKYTQVETLVYMAFYER CGAVFCRNLAAQIEEPILAGLIDRIARDEVRHEEFFANLVTHCLDYTRDETIAAIAAR AADLDVLGADIEAYRDKLQNVADAGIFGKPQLRQLISDRITAWGLAGEPSLKQFVTG" CDS 1224397..1225698 /codon_start=1 /transl_table=11 /gene="phoH2" /locus_tag="BQ2027_MB1125" /product="PROBABLE PHOH-LIKE PROTEIN PHOH2 (PHOSPHATE STARVATION-INDUCIBLE PROTEIN PSIH)" /note="Mb1125, phoH2, len: 433 aa. Equivalent to Rv1095, len: 433 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 433 aa overlap). Probable phoH2, phoH-like protein (phosphate starvation-induced protein), probably ATP-binding protein. Equivalent to AL049491 MLCB1222_14 Mycobacterium leprae (433 aa) (92.8% identity in 432 aa overlap). Similar to many proteins described as PhoH-like e.g. Z97025|BSZ97025_12 Bacillus subtilis (442 aa), FASTA scores: opt: 605, E(): 0, (40.1% identity in 444 aa overlap); or Mycobacterium tuberculosis Rv2368c|O05830|PHOL_MYCTU Mycobacterium tuberculosis (352 aa), FASTA scores: opt: 390, E(): 4e-19, (31.5% identity in 241 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE PHOH FAMILY. Protein product from Mb1125 detected using SWATH mass spectrometry. Mb1125 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXP8" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR003714" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XXP8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99725.1" /translation="MTDTRTYVLDTSVLLSDPWACSRFAEHDVVVPLVVISELEAKRH HHELGWFARQALRLFDDLRLEHGRLDQPIPVGTQGGTLHVELNHTDPAVLPAGFRTDS NDSRILSCAANLAAEGKRVTLVSKDIPLRVKAAAVGLAADEYHAQDVVVSGWSGMHEL ETASADIDALFADGEIDLVEARDLPCHTGIRLLGGGSHALGRVNAHKRVQLVRGDREA FGLRGRSAEQRVALDLLLDESVGIVSLGGKAGTGKSALALCAGLEAVLERRTHRKVVV FRPLYAVGGQELGYLPGSESEKMGPWAQAVFDTLEGLASPAVLEEVLSRGMLEVLPLT HIRGRSLHDSFVIVDEAQSLERNVLLTVLSRLGTGSRVVLTHDIAQRDNLRVGRHDGV AAVIEKLKGHPLFAHITLLRSERSPIAALVTEMLEEITGPR" CDS 1225785..1226660 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1126" /product="POSSIBLE GLYCOSYL HYDROLASE" /note="Mb1126, -, len: 291 aa. Equivalent to Rv1096, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 291 aa overlap). Possible glycosyl hydrolase (EC 3.-.-.-), possibly a deacetylase or esterase. Equivalent to AL049491|MLCB1222_13 Mycobacterium leprae (291 aa) (81.3% identity in 289 aa overlap). Similar at the C-terminus of enzymes involved in carbohydrate degradation including Z99110|BSUB0007_92 endo-1,4-beta-xylanase homolog yjeA from Bacillus subtilis (467 aa), FASTA scores: opt: 418, E(): 2.6e-17, (38.6% identity in 184 aa overlap); M64552|STMXLNB_2 acetyl-xylan esterase from Streptomyces lividans (335 aa), FASTA scores: opt: 371, E(): 1.1e-14, (31.6% identity in 237 aa overlap); NP_345933.1|NC_003028 peptidoglycan N-acetylglucosamine deacetylase A from Streptococcus pneumoniae (463 aa); etc. Has possible N-terminal signal sequence with TMhelix at aa 13-31. Protein product from Mb1126 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1126 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXC7" /db_xref="InterPro:IPR002509" /db_xref="InterPro:IPR011330" /db_xref="UniProtKB/TrEMBL:A0A1R3XXC7" /protein_id="SIT99726.1" /translation="MPKRPDNQTWRYWRTVTGVVVAGAVLVVGGLSGRVTRAENLSCS VIKCVALTFDDGPGPYTDRLLHILTDNDAKATFFLIGNKVAANPAGARRIADAGMEIG SHTWEHPNMTTIPPEDIPGQFSRANDVIAAATGRTPTLYRPAGGLSNDAVRQAAAKVG QAEILWDVIPFDWINDSNTAATRHMLMTQIKPGSVVLFHDTYSSTVDVVYQFIPVLKA NGYRLVTVSELLGPRAPGSSYGSRENGPPVNELRDIPASEIPPLPNTSSPKPMSNFPI TDIAGQNSGGPNNGA" CDS complement(1226663..1227544) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1127C" /product="PROBABLE MEMBRANE GLYCINE AND PROLINE RICH PROTEIN" /note="Mb1127c, -, len: 293 aa. Equivalent to Rv1097c, len: 293 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 293 aa overlap). Probable membrane Gly-, Pro-rich protein, similar to Mycobacterium tuberculosis Rv2507|MTCY07A7. 13|Z95556 (273 aa), FASTA scores: opt: 219, E(): 0.023, (30.5% identity in 266 aa overlap); and Rv2507. Contains potential membrane spanning region (aa ~68-92). TBparse score is 0.912. Protein product from Mb1127c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1127c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXC2" /db_xref="UniProtKB/TrEMBL:A0A1R3XXC2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99727.1" /translation="MTVPPAGPYGNYPYGPNTYGQDLYWGGQPQGGSYPPAYPPQQYP PGWPAGPYPPGPPPPGPGSKTPWLILAGLAVLGVILLVVILVIGLRGDNKSTTATSPA TSAPTSQPFSQQTATGCTPNVSGGVQPIGDSISAGKLSFPTSAAPGWSAFSDDQNPNL IDAVGVGHEVAGADQWMMQAEVAITNFVTTMDVAAQASKLMQCVADGPGYAGSSPTLG PTKTSSITVDGVRAARVDADITIADSSRNVKGDSVTIIAVDTKPVTVFLGATPIGDAT SRATVERVIEALKVNKS" CDS complement(1227541..1228965) /codon_start=1 /transl_table=11 /gene="fum" /locus_tag="BQ2027_MB1128C" /product="PROBABLE FUMARASE FUM (Fumarate hydratase)" /note="Mb1128c, fum, len: 474 aa. Equivalent to Rv1098c, len: 474 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 474 aa overlap). Probable fum, fumarase (EC 4.2.1.2). Equivalent to AL049491|MLCB1222_11 Mycobacterium leprae (474 aa) (89.5 % identity in 467 aa overlap). Similar to many e.g. P14408|FUMH_RAT FUMARATE HYDRATASE, MITOCHONDRIAL PRECURSOR from Rattus norvegicus (507 aa), FASTA scores: opt: 1427, E(): 0, (52.3% identity in 461 aa overlap); and P05042|FUMC_ECOLI Fumarate hydratase class II from Escherichia coli (467 aa), FASTA scores: opt: 1355, E(): 0, (50.2% identity in 444 aa overlap). Contains PS00163 Fumarate lyases signature. TBparse score is 0.886. Protein product from Mb1128c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1128c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0N6" /db_xref="InterPro:IPR000362" /db_xref="InterPro:IPR005677" /db_xref="InterPro:IPR008948" /db_xref="InterPro:IPR018951" /db_xref="InterPro:IPR020557" /db_xref="InterPro:IPR022761" /db_xref="InterPro:IPR024083" /db_xref="UniProtKB/Swiss-Prot:Q7U0N6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99728.1" /translation="MAVDADSANYRIEHDTMGEVRVPAKALWRAQTQRAVENFPISGR GLERTQIRALGLLKGACAQVNSDLGLLAPEKADAIIAAAAEIADGQHDDQFPIDVFQT GSGTSSNMNTNEVIASIAAKGGVTLHPNDDVNMSQSSNDTFPTATHIAATEAAVAHLI PALQQLHDALAAKALDWHTVVKSGRTHLMDAVPVTLGQEFSGYARQIEAGIERVRACL PRLGELAIGGTAVGTGLNAPDDFGVRVVAVLVAQTGLSELRTAANSFEAQAARDGLVE ASGALRTIAVSLTKIANDIRWMGSGPLTGLAEIQLPDLQPGSSIMPGKVNPVLPEAVT QVAAQVIGNDAAIAWGGANGAFELNVYIPMMARNILESFKLLTNVSRLFAQRCIAGLT ANVEHLRRLAESSPSIVTPLNSAIGYEEAAAVAKQALKERKTIRQTVIDRGLIGDRLS IEDLDRRLDVLAMAKAEQLDSDRL" CDS complement(1228996..1230084) /codon_start=1 /transl_table=11 /gene="glpx" /locus_tag="BQ2027_MB1129C" /product="fructose 1,6-bisphosphatase glpx" /note="Mb1129c, -, len: 328 aa. Equivalent to Rv1099c, len: 328 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 328 aa overlap). Conserved hypothetical protein, highly similar to P44811|GLPX_HAEIN GLPX PROTEIN HOMOLOG (believed to be involved in glycerol metabolism) (333 aa), FASTA scores: opt: 763, E():0, (46.2% identity in 327 aa overlap); and Q03224|YWJI_BACSU hypothetical protein from Bacillus subtilis (321aa), FASTA scores: opt: 1092, E(): 0, (52.1% identity in 313 aa overlap). Equivalent to AL049491|MLCB1222_10 Mycobacterium leprae (355 aa), (93.0% identity in 328 aa overlap). Protein product from Mb1129c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1129c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U0N5" /db_xref="InterPro:IPR004464" /db_xref="UniProtKB/Swiss-Prot:Q7U0N5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99729.1" /translation="MTAEGSGSSTAAVASHDPSHTRPSRREAPDRNLAMELVRVTEAG AMAAGRWVGRGDKEGGDGAAVDAMRELVNSVSMRGVVVIGEGEKDHAPMLYNGEEVGN GDGPECDFAVDPIDGTTLMSKGMTNAISVLAVADRGTMFDPSAVFYMNKIAVGPDAAH VLDITAPISENIRAVAKVKDLSVRDMTVCILDRPRHAQLIHDVRATGARIRLITDGDV AGAISACRPHSGTDLLAGIGGTPEGIIAAAAIRCMGGAIQAQLAPRDDAERRKALEAG YDLNQVLTTEDLVSGENVFFCATGVTDGDLLKGVRYYPGGCTTHSIVMRSKSGTVRMI EAYHRLSKLNEYSAIDFTGDSSAVYPLP" CDS 1230083..1230784 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1130" /product="conserved protein" /note="Mb1130, -, len: 233 aa. Equivalent to Rv1100, len: 233 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 233 aa overlap). Conserved hypothetical protein, slightly similar to Rv1906c|MTCY180.12 hypothetical protein from Mycobacterium tuberculosis (156 aa), FASTA scores: opt: 122, E(): 6.9, (27.4% identity in 135 aa overlap). Equivalent to AL049491|MLCB1222_9 Mycobacterium leprae (257 aa) (63.8% identity in 257 aa overlap). Protein product from Mb1130 detected using shotgun mass spectrometry. Mb1130 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXC6" /db_xref="InterPro:IPR025339" /db_xref="UniProtKB/TrEMBL:A0A1R3XXC6" /protein_id="SIT99730.1" /translation="MVGDCPRSRTVRWSWDTGHVTAEPQPTPRPAKPRLLQDGRDMFW SLAPLVVGCILLAGLVGMCSFQLGGTKRGPIPSYDAAQALRADAKTLGFPIRLPQLPG GWTPNSGGRGGIENGRADPATGQRRNAATSIVGFISPTGRYLSLTQSNADEDKLVGSI HPSMYPTGTVDVGGTRWVVYEGSDENGAVEPVWTTRLTGPGGATQLAITGAGSIDQFR TLASATQSQPPLPAR" CDS complement(1230791..1231819) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1131C" /product="CONSERVED MEMBRANE PROTEIN" /note="Mb1131c, -, len: 342 aa. Equivalent to 3' end of Rv1101c, len: 385 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 342 aa overlap). Conserved membrane protein, shows some similarity to other bacterial proteins e.g. P77406|PERM_ECOLI PUTATIVE PERMEASE PERM from Escherichia coli (353 aa), FASTA scores: opt: 287, E(): 8.8e-12, (24.9% identity in 349 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base deletion (t-*) leads to a shorter product with a different amino part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb1131c detected using SWATH mass spectrometry. Mb1131c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXD0" /db_xref="InterPro:IPR002549" /db_xref="UniProtKB/TrEMBL:A0A1R3XXD0" /protein_id="SIT99731.1" /translation="MFTPLFKWFTKRFNTGLSAACTLLSALAAVVVPVGALVGLAIVQ IARMVDSVADWVRTTDLSTLGDKILQFVNGLFDRVPFLHITVTADALRKAMISVAQNV GEWLLHFLRDAAGSLAGVITSAIIFVYVFVALLVNREKLRTLIGQLNPLGEDVTDLYL QKMGSMVRGTVNGQFVIAACQGVAGAASIYIAGFHHGFFIFAIVLTALSIIPLGGGIV TIPFGIGMIFYGNIAGGIFVLLWHLLVVTNIDNVLRPILVPRDARLNSALMLLSVFAG ITMFGPWGIIIGPVLMILIVTTIDVYLAVYKGVELEQFDAPPVRRRWLPRRGPATSRN APPPSTAE" CDS complement(1232059..1232370) /codon_start=1 /transl_table=11 /gene="mazf3" /locus_tag="BQ2027_MB1132C" /product="toxin mazf3" /note="Mb1132c, -, len: 103 aa. Equivalent to Rv1102c, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 103 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protens e.g. Rv1942c|MTCY9F9_22 (109 aa), FASTA scores: opt: 158, E(): 3.6e-05, (33.3% identity in 93 aa overlap); Rv0659c|MTCI376_17 (102aa), opt: 140, E(): 0.00072, (40.6% identity in 69aa overlap); and Rv1495. Mb1132c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZG6" /db_xref="InterPro:IPR003477" /db_xref="InterPro:IPR011067" /db_xref="UniProtKB/TrEMBL:A0A1R3XZG6" /protein_id="SIT99732.1" /translation="MRPIHIAQLDKARPVLILTREVVRPHLTNVTVAPITTTVRGLAT EVPVDAVNGLNQPSVVSCDNIQTIPVCDLGRQIGYLLASQEPALAEAIGNAFDLDWVV A" CDS complement(1232370..1232690) /codon_start=1 /transl_table=11 /gene="maze3" /locus_tag="BQ2027_MB1133C" /product="possible antitoxin maze3" /note="Mb1133c, -, len: 106 aa. Equivalent to Rv1103c, len: 106 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 106 aa overlap). Conserved hypothetical protein, similar to part of Mycobacterium tuberculosis hypothetical protein Rv2472|AL021246|MTV008_27 Mycobacterium tuberculosis (97 aa), FASTA score: opt: 135, E(): 0.0091, (45.8% identity in 72 aa overlap). TBparse score is 0.916. Protein product from Mb1133c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1133c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XYC0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99733.1" /translation="MYLPWGVVLAGGANGFGAGAYQTGTICEVSTQIAVRLPDEIVAF IDDEVRGQHARSRAAVVLRALERERRRRLAERDAEILATNTSATGDLDTLAGHCARTA LDID" CDS 1232700..1233389 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1134" /product="POSSIBLE PARA-NITROBENZYL ESTERASE (FRAGMENT)" /note="Mb1134, -, len: 229 aa. Equivalent to Rv1104, len: 229 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 229 aa overlap). Possible para-nitrobenzyl esterase (fragment; possibly first part) (EC 3.1.1.-). Similar to the N-terminal domain of many e.g. P37967|PNBA_BACSU Bacillus subtilis (489 aa), FASTA scores: opt: 715, E(): 0, (53.4% identity in 191 aa overlap). Gene may be inactivated as a frameshift is required to obtain a product continuing in MTV017.58|Rv1105. Mb1134 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002018" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XXE4" /protein_id="SIT99734.1" /translation="MVVDSCVAESRYGPVRGADDGRVKVWKGIRYAAPPLGDLRFRTP EPPERWTEVADATTFGPACPQPAIPNMPLDLGASQSEDCWSLNIWAPADTEPGDGKPV MVWLHGGAYILGSGSQPLYNGRRLAASGDVVVVTVNYRLGALGFLDLSSFNTSRRRFD SNIGLRDVLAVLRWVADNIAVFGGDPEKVTLFGESARESSRPCSPPRRPRVCSRRRSP RAHRRHRSTTR" CDS 1233710..1234225 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1135" /product="POSSIBLE PARA-NITROBENZYL ESTERASE (FRAGMENT)" /note="Mb1135, -, len: 171 aa. Equivalent to Rv1105, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 171 aa overlap). Possible para-nitrobenzyl esterase (fragment; possibly second part) (EC 3.1.1.-). Similar to C-terminal domain of many e.g. P71048 PARA-NITROBENZYL ESTERASE from Bacillus subtilis (489 aa), FASTA scores: opt: 248, E(): 2.7e-10, (32.3% identity in 167 aa overlap). Gene may be inactivated as a frameshift is required to obtain a product continuing from MTV017.57|Rv1104. Start changed since first submission." /db_xref="InterPro:IPR002018" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XXQ8" /protein_id="SIT99735.1" /translation="MFTQIAAEQPDLQVPTEEQIGSAYSRWRRKARSLSMATDVGFRM PSVWLAEGHSGVAPVYLYRFDYSTPLLKLLLVRAAHATELPYVWGNLGGSQDPALKLG DAKAAIAVSRRVRTRWINFATRGKPTGPDGEPDWPCYEEAHRACLIIGRRDAVVHDVD AHIRATWGSKW" CDS complement(1234243..1235355) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1136C" /product="3-beta-hydroxysteroid dehydrogenase" /note="Mb1136c, -, len: 370 aa. Equivalent to Rv1106c, len: 370 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 370 aa overlap). Probable cholesterol dehydrogenase (EC 1.1.1.-). Equivalent to AL049491|MLCB1222_7 Mycobacterium leprae (376 aa) (75.5% identity in 375 aa overlap). Highly similar to Q03704 NAD(P)-dependent cholesterol dehydrogenase from Nocardia sp. (364 aa), FASTA scores: opt: 1789, E(): 0, (74.5% identity in 361 aa overlap). Also similar to U32426|MCU32426_1 3-beta-hydroxy-Delta5-steroid dehydrogenase from Molluscum contagiosum virus (354 aa), FASTA scores: opt: 432, E(): 1.7e-22, (34.6% identity in 347 aa overlap). Also similar to series of Mycobacterium tuberculosis hypothetical proteins described as sugar epimerases or dehydratases e.g. Rv3634c, Rv3784, Rv3464, etc. Protein product from Mb1136c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1136c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXD8" /db_xref="InterPro:IPR002225" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XXD8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99736.1" /translation="MLRRMGDASLTTELGRVLVTGGAGFVGANLVTTLLDRGHWVRSF DRAPSLLPAHPQLEVLQGDITDADVCAAAVDGIDTIFHTAAIIELMGGASVTDEYRQR SFAVNVGGTENLLHAGQRAGVQRFVYTSSNSVVMGGQNIAGGDETLPYTDRFNDLYTE TKVVAERFVLAQNGVDGMLTCAIRPSGIWGNGDQTMFRKLFESVLKGHVKVLVGRKSA RLDNSYVHNLIHGFILAAAHLVPDGTAPGQAYFINDAEPINMFEFARPVLEACGQRWP KMRISGPAVRWVMTGWQRLHFRFGFPAPLLEPLAVERLYLDNYFSIAKARRDLGYEPL FTTQQALTECLPYYVSLFEQMKNEARAEKTAATVKP" CDS complement(1235365..1235622) /codon_start=1 /transl_table=11 /gene="xseB" /locus_tag="BQ2027_MB1137C" /product="Probable exodeoxyribonuclease VII (small subunit) xseB (Exonuclease VII small subunit)" /note="Mb1137c, xseB, len: 85 aa. Equivalent to Rv1107c, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 85 aa overlap). Probable xseB, exonuclease VII small subunit (EC 3.1.11.6). Equivalent to AL049491|MLCB1222_6 Mycobacterium leprae (87 aa) (77.9% identity in 68 aa overlap). Similar to P43914|EX7S_HAEIN EXODEOXYRIBONUCLEASE SMALL SUBUNIT from H. influenzae (84 aa), FASTA scores: opt: 126, E(): 0.006, (37.3% identity in 67 aa overlap); and P22938|EX7S_ECOLI EXODEOXYRIBONUCLEASE SMALL SUBUNIT from Escherichia coli (79 aa), FASTA scores: opt: 125, E(): 0.0067, (39.7% identity in 58 aa overlap). BELONGS TO THE XSEB FAMILY. Protein product from Mb1137c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1137c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67457" /db_xref="InterPro:IPR003761" /db_xref="InterPro:IPR037004" /db_xref="UniProtKB/Swiss-Prot:P67457" /protein_id="SIT99737.1" /translation="MVCDPNGDDTGRTHATVPVSQLGYEACRDELMEVVRLLEQGGLD LDASLRLWERGEQLAKRCEEHLAGARQRVSDVLAGDEAQNG" CDS complement(1235612..1236859) /codon_start=1 /transl_table=11 /gene="xseA" /locus_tag="BQ2027_MB1138C" /product="PROBABLE EXODEOXYRIBONUCLEASE VII (LARGE SUBUNIT) XSEA (EXONUCLEASE VII LARGE SUBUNIT)" /note="Mb1138c, xseA, len: 415 aa. Equivalent to Rv1108c, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 415 aa overlap). Probable xseA, exodeoxyribonuclease VII large subunit (EC 3.1.11.6) (see first citation below). Equivalent to AL049491|MLCB1222_5 Mycobacterium leprae (428 aa) (81.5% identity in 411 aa overlap). Similar to many e.g. P04994|EX7L_ECOLI exodeoxyribonuclease large subunit from Escherichia coli (456 aa), FASTA scores: opt: 581, E(): 1.6 e-30, (30.8% identity in 425 aa overlap); also similar to the exodeoxyribonuclease in Bacillus subtilis, H. influenzae and H. pylori. TBparse score is 0.890. BELONGS TO THE XSEA FAMILY. Protein product from Mb1138c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1138c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67448" /db_xref="InterPro:IPR003753" /db_xref="InterPro:IPR020579" /db_xref="InterPro:IPR025824" /db_xref="UniProtKB/Swiss-Prot:P67448" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99738.1" /translation="MTQNSAENPFPVRAVAIRVAGWIDKLGAVWVEGQLAQITMRPDA KTVFMVLRDPAADMSLTVTCSRDLVLSAPVKLAEGVQVVVCGKPSFYTGRGTFSLRLS EIRAVGIGELLARIDRLRRLLDAEGLFDPRLKRPIPYLPNMIGLITGRASAAERDVTT VASARWPAARFAVRNVAVQGPNAVGQIVEALRELDRDPDVDVIVLARGGGSVEDLLPF SDETLCRAIAACRTPVVSAVGHEPDNPLCDLVVDLRAATPTDAAKKVVPDTAAEQRLI DDLRRRSAQALRNWVSREQRAVAQLRSRPVLADPMTMVSVRAEEVHRARSTLRRNLTL MVAAETERIGHLAARLATLGPAATLARGYAIVQTVAQTGPEGGSEPQVLRSVHDAPEG TKLRVRVADGALAAVSEGQTNGL" CDS complement(1236856..1237494) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1139C" /product="conserved protein" /note="Mb1139c, -, len: 212 aa. Equivalent to Rv1109c, len: 212 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 212 aa overlap). Conserved hypothetical protein, N-terminal domain is hydrophobic, C-terminal half is very rich in Arg. Equivalent to AL049491|MLCB1222_2 hypothetical protein from Mycobacterium leprae (379 aa) (46.0% identity in 374 aa overlap). Start changed since first submission. TBparse score is 0.934. Protein product from Mb1139c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1139c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XXE9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99739.1" /translation="MATAPYGVRLLVGAATVAVEETMKLPRTILMYPMTLASQAAHVV MRFQQGLAELVIKGDNTLETLFPPKDEKPEWATFDEDLPDALEGTSIPLLGLSDASEA KNDDRRSDGRFALYSVSDTPETTTASRSADRSTNPKTAKHPKSAAKPTVPTPAVAAEL DYPALTLAQLRARLHTLDVPELEALLAYEQATKARAPFQTLLANRITRATAK" CDS 1237584..1238591 /codon_start=1 /transl_table=11 /gene="lytB2" /locus_tag="BQ2027_MB1140" /product="PROBABLE LYTB-RELATED PROTEIN LYTB2" /note="Mb1140, lytB2, len: 335 aa. Equivalent to Rv1110, len: 335 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 335 aa overlap). Probable lytB2, LytB-related protein, equivalent to AL049491|MLCB1222_3 from Mycobacterium leprae (335 aa), FASTA score: (82.9% identity in 333 aa overlap). Also similar to LytB proteins from many bacteria (appears to have N-terminal extension) e.g. P22565|LYTB_ECOLI|B0029|Z0034|ECS0032 LYTB PROTEIN from Escherichia coli strains K12 and O157:H7 (316 aa), FASTA scores: opt: 1041, E():0, (52.4% identity in 309 aa overlap); etc. Also very similar to another LytB-related protein from Mycobacterium tuberculosis: LytB1|Rv3382c|MTV004.40c (329 aa), FASTA scores: opt: 975, E(): 0, (51.3% identity in 312 aa overlap). BELONGS TO THE LYTB FAMILY. Protein product from Mb1140 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1140 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5I1" /db_xref="InterPro:IPR003451" /db_xref="UniProtKB/Swiss-Prot:P0A5I1" /protein_id="SIT99740.1" /translation="MVPTVDMGIPGASVSSRSVADRPNRKRVLLAEPRGYCAGVDRAV ETVERALQKHGPPVYVRHEIVHNRHVVDTLAKAGAVFVEETEQVPEGAIVVFSAHGVA PTVHVSASERNLQVIDATCPLVTKVHNEARRFARDDYDILLIGHEGHEEVVGTAGEAP DHVQLVDGVDAVDQVTVRDEDKVVWLSQTTLSVDETMEIVGRLRRRFPKLQDPPSDDI CYATQNRQVAVKAMAPECELVIVVGSRNSSNSVRLVEVALGAGARAAHLVDWADDIDS AWLDGVTTVGVTSGASVPEVLVRGVLERLAECGYDIVQPVTTANETLVFALPRELRSP R" CDS complement(1238608..1239591) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1141C" /product="Rhomboid family protein" /note="Mb1141c, -, len: 327 aa. Equivalent to Rv1111c, len: 327 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 327 aa overlap). Conserved hypothetical protein, N-terminal domain is hydrophobic, C-terminal half is very rich in Arg. Equivalent to AL049491|MLCB1222_2 hypothetical protein from Mycobacterium leprae (379 aa) (46.0% identity in 374 aa overlap). Start changed since first submission. Protein product from Mb1141c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1141c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXE0" /db_xref="UniProtKB/TrEMBL:A0A1R3XXE0" /protein_id="SIT99741.1" /translation="MSAQRARSAVQASHRSIHPHIPGVPWWAAILIAVTATAIGYAID AGSGHKALTLVFTGCYIAGCVGAVLAVRQSDLFTALVQPPLILFCAVPGAYWLFHGGT IGKFKDLLINCGYSLIERFPLMLGTAAGVLLIGLVRWYLGTALFDSIARKLSSLMTGD SDDDGGRRSAQRPARTRSRHARPPSEDNREPIAERRSRRRPRPQNDPHPRRNAHERPA PRSSRFDSYRSYQPSEPSGPAEPVNRYERRGARYQPYARYEPTYEPQRRRARPSEPTN PTHHPISQVRYRGSATRDARRDNYREEQRFDRRDRSRAPRRPPAESWEYDV" CDS 1239654..1240727 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1142" /product="Probable GTP binding protein" /note="Mb1142, -, len: 357 aa. Equivalent to Rv1112, len: 357 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 357 aa overlap). Probable GTP binding protein, similar to YCHF_HAEIN|P44681 probable gtp-binding protein (362 aa), FASTA scores: opt: 1189, E(): 0, (52.7% identity in 357 aa overlap). Equivalent to AL049491|MLCB1222_1 hypothetical protein from Mycobacterium leprae (356 aa) (85.9% identity in 354 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb1142 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3XZH6" /db_xref="InterPro:IPR004396" /db_xref="InterPro:IPR006073" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR012676" /db_xref="InterPro:IPR013029" /db_xref="InterPro:IPR023192" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR031167" /db_xref="InterPro:IPR041706" /db_xref="UniProtKB/TrEMBL:A0A1R3XZH6" /protein_id="SIT99742.1" /translation="MSLSLGIVGLPNVGKSTLFNALTRNNVVAANYPFATIEPNEGVV SLPDPRLDKLAELFGSQRVVPAPVTFVDIAGLIKGASEGAGLGNKFLAHIRECDAICQ VVRVFVDDDVTHVTGRVDPQSDIEVVETELILADLQTLERATGRLEKEARTNKARKPV YDAALRAQQVLDAGKTLFAAGVDAAALRELNLLTTKPFLYVFNADEAVLTDPARVGEL RALVAPADAVFLDAAIESELTELDDESAAELLESIGQSERGLDALARAGFHTLKLQTF LTAGPKEARAWTIHQGDTAPKAAGVIHSDFEKGFIKAEIVSYDDLVAAGSMAAAKAAG KVRIEGKDYVMADGDVVEFRFNV" CDS 1240815..1241012 /codon_start=1 /transl_table=11 /gene="vapb32" /locus_tag="BQ2027_MB1143" /product="possible antitoxin vapb32" /note="Mb1143, -, len: 65 aa. Equivalent to Rv1113, len: 65 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 65 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protein Rv2758c|AL00896 7|MTV002.23 (88 aa) FASTA scores: opt: 97, E(): 0.86, (33.3% identity in 69 aa overlap). Part of family including Rv2871, Rv1241, Rv2132, Rv3321c, etc. Protein product from Mb1143 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1143 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019239" /db_xref="UniProtKB/TrEMBL:A0A1R3XYD1" /protein_id="SIT99743.1" /translation="MRTTVTVDDALLAKAAELTGVKEKSTLLREGLQTLVRVESARRL AALGGTDPQATAAPRRRTSPR" CDS 1241009..1241383 /codon_start=1 /transl_table=11 /gene="vapc32" /locus_tag="BQ2027_MB1144" /product="possible toxin vapc32. contains pin domain." /note="Mb1144, -, len: 124 aa. Equivalent to Rv1114, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 124 aa overlap). Conserved hypothetical protein, slight similarity to Mycobacterium tuberculosis hypothetical proteins MTCY159.08c (33.0% identity in 115 aa overlap); Rv1561 and Rv2010. Protein product from Mb1144 detected using shotgun mass spectrometry. Mb1144 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXF6" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XXF6" /protein_id="SIT99744.1" /translation="MILVDTSVWIEHLRAADARLVELLGDDEAGCHPLVIEELALGSI KQRDVVLDLLANLYQFPVVTHDEVLRLVGRRRLWGRGLGAVDANLLGSVALVGGARLW TRDKRLKAACAESGVALAEEVS" CDS 1241586..1242284 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1145" /product="POSSIBLE EXPORTED PROTEIN" /note="Mb1145, -, len: 232 aa. Equivalent to Rv1115, len: 232 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 232 aa overlap). Possible exported protein, contains possible N-terminal signal sequence. Mb1145 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXR8" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/TrEMBL:A0A1R3XXR8" /protein_id="SIT99745.1" /translation="MISTTRIDFLWILSVAFAPMIALATLLTLINQVVGTPYIPGGDS PAGTDCSELASWVSNAATARPVFGDRFNTGNEEAALAARGFQQGTAPNALVIGWNGHH TAVTLPDGTPVSSGEGGGVRVGGGGAYQPKFTHHMYLPMDVDAGEDQPPAPDEPVTAV DDVEPEMPAPCPTQRPPVTPRHNLCNKLRTMPGALSAALAAAAPVWPAPISGCRGFST SLLAKRNHPVIVGK" CDS 1242402..1242587 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1146" /product="HYPOTHETICAL PROTEIN" /note="Mb1146, -, len: 61 aa. Equivalent to Rv1116, len: 61 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 61 aa overlap). Hypothetical unknown protein. Mb1146 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XXE7" /protein_id="SIT99746.1" /translation="MCSRMADEPRLEAGAHPFEEGRDKAPELRATQMDHVRFTEGRRE RNRDRLERSQQFRQPGR" CDS complement(1242514..1242789) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1147C" /product="PE family protein" /note="Mb1147c, -, len: 91 aa. Equivalent to Rv1116A, len: 91 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 91 aa overlap). Conserved hypothetical protein (possibly gene fragment), similar to C-terminal part of Rv1646|Z85982_9 from Mycobacterium tuberculosis (310 aa), FASTA scores: opt: 301, E(): 9.3e-13, (68.05% identity in 72 aa overlap). Also overlaps gene on other strand, Rv1116, at 3'-end. Mb1147c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XXE2" /protein_id="SIT99747.1" /translation="MGALGTVRGLQDSNTAFVGALHSGNLLGATGAVLQAPGNAVNGF LFGQTSISQSIDVSPEYGYELVAVSDPVGGTAGSARAGHGYVHADLR" CDS 1243032..1243355 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1148" /product="conserved protein" /note="Mb1148, -, len: 107 aa. Equivalent to Rv1117, len: 107 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 107 aa overlap). Conserved hypothetical protein, some similarity to P94425|D50453 hypothetical protein from Bacillus subtilis (95 aa), fasta scores: opt: 128, E(): 5.1e-06, (28.3% identity in 92 aa overlap); and AL117322|SCF1.02 Streptomyces coelicolor (109 aa), FASTA scores: opt: 437, E(): 1.6e-25, (57.5% identity in 106 aa overlap). Protein product from Mb1148 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1148 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007138" /db_xref="InterPro:IPR011008" /db_xref="UniProtKB/TrEMBL:A0A1R3XXE3" /protein_id="SIT99748.1" /translation="MIFIVVKFETKPEWTERWPDLVASFTAATRAEEGNLWFEWSRSL DDPAEYVLVESFRDGEAGGVHVNSDHFRQAMRELPKALASTPKIISQTIDATGWSAMG EMTVG" CDS complement(1243370..1244230) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1149C" /product="conserved protein" /note="Mb1149c, -, len: 286 aa. Equivalent to Rv1118c, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 286 aa overlap). Conserved hypothetical protein, similar to pseudogene ML0942 in Mycobacterium leprae. Protein product from Mb1149c detected using shotgun mass spectrometry. Mb1149c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/TrEMBL:A0A1R3XXF8" /protein_id="SIT99749.1" /translation="MQSGPHLVGRVGTSFPLIARHQGATRDDAGDTGQPDPLPHVAHP DRLYPPMVHGVDPSTLALDRALNETRTGDLWLFRGRSRPDRAIQTLTNAPVNHVGMTV AIDDLPPLIWHAELGDKLLDVWTGTNHRGVQLNDARQVVQQWAGRYRQRCWLRQLTPH ANRDQEDKLLRVIARMNGTPFPTTARLTGRWLRGRLPTLNDWLRGIPVLDRKVREQTQ RRKQQQRTMGLATAYCAETVAITYEEMGLLVTDKDAHWFDPGKFWSGDSLPLAPGYRL GHEIAVDVGG" CDS complement(1244263..1244412) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1150C" /product="Adenylate cyclase (EC" /EC_number="4.6.1.1" /note="Mb1150c, -, len: 49 aa. Equivalent to Rv1119c, len: 49 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 49 aa overlap). Hypothetical unknown protein." /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/TrEMBL:A0A1R3XXE6" /protein_id="SIT99750.1" /translation="MTARVAGQAVGGQILVGEPVHDAVSDCADIRFGSYRLFSLDAAP GPDLD" CDS complement(1244409..1244903) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1151C" /product="Adenylate cyclase (EC" /EC_number="4.6.1.1" /note="Mb1151c, -, len: 164 aa. Equivalent to Rv1120c, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 164 aa overlap). Conserved hypothetical protein, some similarity at C-terminus to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1890c|MTCY180.28 (462 aa), FASTA scores: opt: 187, E(): 2.2e-05, (36.6% identity in 93 aa overlap) and Rv2488c|YZ19_MYCTU|Q10551 (285 aa), FASTA scores: opt: 156, E(): 0.00074, (32.7% identity in 107 aa overlap)." /db_xref="GOA:A0A1R3XXF0" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/TrEMBL:A0A1R3XXF0" /protein_id="SIT99751.1" /translation="MLSGGREAVKTVWQTANLVRKEGFGAAVRSSIEDPADWAEVERP DLARVTPDARVVILFSDIEESTALDERIGDRTWVKLIGAHDKLVHELVRRWSGHMVTS QGDGFMIAFARAEQAVRCGIDIQDALRNSAKRKRNQGIRVRIGTTWGARCGTVTICSA ATSQ" CDS 1245106..1246506 /codon_start=1 /transl_table=11 /gene="zwf1" /locus_tag="BQ2027_MB1152" /product="PROBABLE GLUCOSE-6-PHOSPHATE 1-DEHYDROGENASE ZWF1 (G6PD)" /note="Mb1152, zwf1, len: 466 aa. Equivalent to Rv1121, len: 466 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 466 aa overlap). Probable zwf1, glucose-6-phosphate 1-dehydrogenase (EC 1.1.1.49), highly similar to many e.g. G6PD_E COLI|P22992 Escherichia coli (491 aa), FASTA scores: opt: 642, E(): 0, (35.8% identity in 478 aa overlap). Mycobacterium tuberculosis has two genes for ZWF, this one is highly divergent. BELONGS TO THE GLUCOSE-6-PHOSPHATE DEHYDROGENASE FAMILY. Note that previously known as zwf. Protein product from Mb1152 detected using SWATH mass spectrometry. Mb1152 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A587" /db_xref="InterPro:IPR001282" /db_xref="InterPro:IPR019796" /db_xref="InterPro:IPR022674" /db_xref="InterPro:IPR022675" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P0A587" /protein_id="SIT99752.1" /translation="MVDGGGGASDLLVIFGITGDLARKMTFRALYRLERHQLLDCPIL GVASDDMSVGQLVKWARESIGRTEKIDDAVFDRLAGRLSYLHGDVTDSQLYDSLAELI GSACRPLYYLEMPPALFAPIVENLANVRLLERARVAVEKPFGHDLASALELNARLRAV LGEDQILRVDHFLGKQPVVELEYLRFANQALAELWDRNSISEIHITMAEDFGVEDRGK FYDAVGALRDVVQNHLLQVLALVTMEPPVGSSADDLNDKKAEVFRAMAPLDPDRCVRG QYLGYTEVAGVASDSATETYVALRTEIDNWRWAGVPIFVRAGKELPAKVTEVRLFLRR VPALAFLPNRRPAEPNQIVLRIDPDPGMRLQISAHTDDSWRDIHLDSSFAVDLGEPIR PYERLLYAGLVGDHQLFAREDSIEQTWRIVQPLLDNPGEIHRYDRGSWGPEAAQSLLR GHRGWQSPWLPRGTDA" CDS 1246528..1247550 /codon_start=1 /transl_table=11 /gene="gnd2" /locus_tag="BQ2027_MB1153" /product="probable 6-phosphogluconate dehydrogenase,decarboxylating gnd2" /note="Mb1153, gnd2, len: 340 aa. Equivalent to Rv1122, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 340 aa overlap). Probable gnd2, 6-phosphogluconate dehydrogenase, decarboxylating (EC 1.1.1.44), highly similar to Q53917 6-PHOSPHOGLUCONATE DEHYDROGENASE from Streptomyces coelicolor (291 aa), fasta scores: opt: 431, E(): 2.2e-20, (44.5% identity in 335 aa overlap). Also similar to Rv1844c|MTCY359.29|gnd1 PROBABLE 6-PHOSPHOGLUCONATE DEHYDROGENASE from Mycobacterium tuberculosis (485 aa), FASTA score: (33.0% identity in 351 aa overlap). Note that Rv1844c|MTCY359.29|gnd1 is most similar to gnd's from Gram negative organisms, while gnd2 is most similar to gnd's from Gram positive organisms. BELONGS TO THE 6-PHOSPHOGLUCONATE DEHYDROGENASE FAMILY. Protein product from Mb1153 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1153 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYE0" /db_xref="InterPro:IPR004849" /db_xref="InterPro:IPR006114" /db_xref="InterPro:IPR006115" /db_xref="InterPro:IPR006183" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR013328" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XYE0" /protein_id="SIT99753.1" /translation="MQLGMIGLGRMGANIVRRLAKGGHDCVVYDHDPDAVKAMAGEDR TTGVASLRELSQRLSAPRVVWVMVPAGNITTAVIEELANTLEAGDIVIDGGNTYYRDD LRHEKLLFKKGIHLLDCGTSGGVWGRERGYCLMIGGDGDAFARAEPIFATVAPGVAAA PRTPGRDGEVAPSEQGYLHCGPCGSGHFVKMVHNGIEYGMMASLAEGLNILRNADVGT RVQHGDAETAPLPNPECYQYDFDIPEVAEVWRRGSVIGSWLLDLTAIALRESPDLAEF SGRVSDSGEGRWTAIAAIDEGVPAPVLTTALQSRFASRDLDDFANKALSAMRKQFGGH AEKPAN" CDS complement(1247543..1248451) /codon_start=1 /transl_table=11 /gene="bpoB" /locus_tag="BQ2027_MB1154C" /product="POSSIBLE PEROXIDASE BPOB (NON-HAEM PEROXIDASE)" /note="Mb1154c, bpoB, len: 302 aa. Equivalent to Rv1123c, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 302 aa overlap). Possible bpoB, peroxidase (non-haem peroxidase) (EC 1.11.1.-), with some similarity to a range of enzymes from several organisms including: DEH1_MORSP|Q01398 haloacetate dehalogenase (EC 3.8.1.3) from Moraxella sp. (294 aa), FASTA scores: opt: 201, E(): 2.1e-06, (35.8% identity in 134 aa overlap); and BPA1_STRAU|P33912 non-haem bromoperoxidase bpo-a1 (EC 1.11.1.-) from Streptomyces aureofaciens (274 aa), FASTA scores: opt: 187, E(): 1.6e-05, (23.1% identity in 281 aa overlap). Similar to several other Mycobacterium tuberculosis proteins, probable epoxide hydrolases and non-heme bromoperoxidases e.g. Rv1938, Rv3617, Rv3473c, Rv3171c, etc. Contains PS00216 Sugar transport proteins signature 1. Protein product from Mb1154c detected using SWATH mass spectrometry. Mb1154c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXG7" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XXG7" /protein_id="SIT99754.1" /translation="MTIWRVPSKVTSGPVSAVSSSPQAVAFSGARGITLVADEWNRGA AAADRPTILMLHGGGQNRFSWKNTGQILADEGHHVVALDTRGPGDSDRAPGADYAVET PTTDVLHVVEAIGRRVVVVEASMGGLTGILVAERAGPQTVNGLVLVDVVPRYEKEGNA RIRDFMLGNIDGFGSLEEAADAVAEYLPHRDKPRSPEGLKRNLRLRDGRWHWHWDPAM MTAPGHDPQLRTENFERAAMGLTIPVLLIRGKLSDVVSSDGARDFLAKVPNAEFVELS NAGRTAAGDDNDAFTDVVVDFVRRLS" CDS 1248526..1249476 /codon_start=1 /transl_table=11 /gene="ephC" /locus_tag="BQ2027_MB1155" /product="PROBABLE EPOXIDE HYDROLASE EPHC (EPOXIDE HYDRATASE)" /note="Mb1155, ephC, len: 316 aa. Equivalent to Rv1124, len: 316 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 316 aa overlap). Probable ephC, epoxide hydrolase (EC 3.3.2.3) (see citation below), similar to Q42566 epoxide hydrolase from Arabidopsis thaliana (321 aa), FASTA scores: opt: 298, E(): 8.2e-13, (27.6% identity in 333 aa overlap). Similar to other Mycobacterium tuberculosis epoxide hydrolases and non-heme bromoperoxidases e.g. Rv1938, Rv3617, Rv3670, Rv3473c, etc. Protein product from Mb1155 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1155 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXS8" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XXS8" /protein_id="SIT99755.1" /translation="MRAGRGERESTWRTTMAEPHWIDVKGPNGDLKALTWGPAGAPVA LCLHGFPDTAYGWRKVAPRLAESGWHVVAPFMRGYAPSSIPADGSYHVGALMHDALRV RSAAGGTERDVIIGHDWGAIAATGLAAMPDSPFAKAVIMSVPPSAAFRPLGRVPERGR LLRELPHQLLRSWYILYFQLPWLPERSASWVVPLLWRRWSPGYHAEEDLRHVDAAIGT PEGRRAALGPYRATMRNTRAPADYADLNRLWTEAPKLPVLYLHGHDDGCATSAFTHWT ARVLPAGSEVAVVEHAGHFLQLEQPDKIAELIVAFIGSPG" CDS 1249481..1250725 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1156" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1156, -, len: 414 aa. Equivalent to Rv1125, len: 414 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 414 aa overlap). Conserved hypothetical protein. Similar to AL133278|SCM11.13 hypothetical protein from Streptomyces coelicolor (446 aa), FASTA scores: opt: 182, E(): 0.0005, (28.1% identity in 437 aa overlap). Protein product from Mb1156 detected using SWATH mass spectrometry. Mb1156 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXF7" /db_xref="InterPro:IPR009721" /db_xref="UniProtKB/TrEMBL:A0A1R3XXF7" /protein_id="SIT99756.1" /translation="MAGHRMAAVDAQFYWMSAKVPNDQFLLYAFDGEPTDLERAVAQV YRRARGCPGLGMRVQDRGALAYPQWVPTPVQRDQLVCHDLADRSWQGCLAAVVGLAGK QLDMRRMPWRLHVFTPVHDVPGVSGLGTVAVMQFAHALGDGARASAMAAWLFGRPAAV PEIARSRAGFLPWRAAHAARAHLRLVRDTNAGLVAPGVGSRPPLSTNARPEGVRAVRT LLRRRSQLAGPTVTVTVLAAVSTGLLGLLGGDVDTLGAEVPMAKPGVPRSYNHFGNVV VGLYPRLEPDERVRRIATDLANARRRFEHPAMLSADRAFAAVPAALLRWGVSQFDAEV RPVRVAGNTVVSSVYRGAADLSFGDAPVVLTAGYPALSPAMGLTHGVHGIGDTVAISV HAAESAVSDIDAYMRLLDAALQ" CDS complement(1250729..1251334) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1157C" /product="conserved protein" /note="Mb1157c, -, len: 201 aa. Equivalent to Rv1126c, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 201 aa overlap). Conserved hypothetical protein, similar in N-terminus to O05567|MLCB33.17 hypothetical protein from Mycobacterium leprae (141 aa), FASTA scores: opt: 332, E(): 1.4e-23, (58.4% identity in 101 aa overlap). Protein product from Mb1157c detected using shotgun mass spectrometry. Mb1157c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XXF2" /protein_id="SIT99757.1" /translation="MSELTVLQAVRLKGRVITTDLAQTLGEDLADVAATVDRLTAAGL LVDATPLRISPSGRMRLDDLLAEERNRADSTVLAAAYRDFRSVNADFKRLVTDWQLKG EKPNTHDDAEYDAAVLSRLDGVHRRVGPIIGTVAMQLPRLSRYPVKLRAALDKVKAGD IAWLTRPLIDSYHTVWFELHEELIQAVGLTRDEAAKSGDAQ" CDS complement(1251331..1252803) /codon_start=1 /transl_table=11 /gene="ppdK" /locus_tag="BQ2027_MB1158C" /product="PROBABLE PYRUVATE, PHOSPHATE DIKINASE PPDK" /note="Mb1158c, ppdK, len: 490 aa. Equivalent to Rv1127c, len: 490 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 490 aa overlap). Probable ppdK, Pyruvate, phosphate dikinase (EC 2.7.9.1). Equivalent (but shorter) to Z94723|MLCB33_16 ppdK from Mycobacterium leprae (601 aa) (71.8% identity in 478 aa overlap). Highly similar to N-terminus of PODK_CLOSY|P22983 pyruvate, phosphate dikinase from Clostridium symbiosum (873 aa), FASTA scores: opt: 786, E(): 0, (37.4% identity in 514 aa overlap). Protein product from Mb1158c detected using SWATH mass spectrometry. Mb1158c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXF3" /db_xref="InterPro:IPR002192" /db_xref="InterPro:IPR008279" /db_xref="InterPro:IPR010121" /db_xref="InterPro:IPR013815" /db_xref="InterPro:IPR036637" /db_xref="UniProtKB/TrEMBL:A0A1R3XXF3" /protein_id="SIT99758.1" /translation="MTRITRANGCPDGTLENAVVALDGGANYPREILGNKGHGIDMMR RHHLPVPPAFCITTEVGVRYLAAPESTIAAIWDDVLDRMSWLETETSCTFGRGPNPLL VSVRSGATQSMPGMMDTILDVGMTDAVERVLARPGAADFAHDTRRRFTSMYRRIVGSA GPITDDPYAQLRASIEAVFASWNSPRAVAYRDHHGLDDQGGTAVVVQAMVFGNLTANS GAGVLSSRNPITGANEPFGEWLPGGQGDDVVSGLVAVAPITALRDQQPAVYDQLMAAA RSLERMAGDVQEIEFTVEDSQLWLLQTRGAERSAQAAVRLALQLHHEGLIDDTETLRR VTPTHIETLLRPSLQPETRLAAPLLAKGLPACPGVVSGTAYTEVDEALDAADRGEPVI LVRDHTRPEDVMGMLAAQGIVTEVGGAASHAAVVSRELGRVAVVGCGPGVAAALAGKE ITVDGYEGEVRQGVLALSAWSESDTPELRELADIAQRISS" repeat_region complement(1253016..1254371) /rpt_family="REP" /note="REP-3, len: 1356 nt. Equivalent to REP, len: 1325 nt, from Mycobacterium tuberculosis strain H37RV, (99.9% identity in 1325 nt overlap). REP22G8, member of REP13E12 family." CDS complement(1253016..1254371) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1159C" /product="13E12 repeat family protein" /note="Mb1159c, -, len: 451 aa. Equivalent to Rv1128c, len: 451 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 451 aa overlap). Conserved hypothetical protein, in REP13E12 degenerate repeat, highly similar to several Mycobacterium tuberculosis proteins in REP13E12 repeats e.g. Rv1148c, Rv1945, Rv3467, etc. Mb1159c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3XXG5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99759.1" /translation="MCSTREEITEAFASLATALSRVLGLTFDALTTPERLALLEHCET ARRQLPSVEHTLINQIGEQSTEEELGGKLGLTLADRLRITRSEAKRRVAEAADLGQRR ALTGEPLPPLLTATAKAQRHGLIGDGHVEVIRAFVHRLPSWVDLKTLEKAERDLAKQA TQYRPDQLAKLAARIMDCLNPDGDYTDEDRARRRGLTLGKQDVDGMSRLSGYVTPELR ATIEAVWAKLAAPGMCNPEQKAPCVNGAPSKEQARRDTRSCPQRNHDALNAGLRSLLT SGNLGQHNGLPASIIVTTTLKDLEAAAGAGLTGGGTILPISDVIRLARHANHYLAIFD RGKALALYHTKRLASPAQRIMLYAKDSGCSAPGCDVPGYYCEVHHVTPYAQCRNTDVN DLTLGCGGHHPLAERGWTTRKNAHGDTEWLPPPHLDHGQPRVNTFHHPEKLLADDEGD P" CDS complement(1254473..1255933) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1160C" /product="PROBABLE TRANSCRIPTIONAL REGULATOR PROTEIN" /note="Mb1160c, -, len: 486 aa. Equivalent to Rv1129c, len: 486 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 486 aa overlap). Possible transcriptional regulator protein, similar to Rv0465c|MTV038.09c Mycobacterium tuberculosis (474 aa), FASTA scores: E(): 0, (47.4% identity in 468 aa overlap). Helix turn helix motif present from aa 32-53. Protein product from Mb1160c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XXF4" /db_xref="InterPro:IPR001387" /db_xref="InterPro:IPR010359" /db_xref="InterPro:IPR010982" /db_xref="InterPro:IPR018653" /db_xref="InterPro:IPR026281" /db_xref="UniProtKB/TrEMBL:A0A1R3XXF4" /protein_id="SIT99760.1" /translation="MTRSNVLPVARTYSRTFSGARLRRLRQERGLTQVALAKALDLST SYVNQLENDQRPITVPVLLLLTERFDLSAQYFSSDSDARLVADLSDVFTDIGVEHAVS GAQIEEFVARMPEVGHSLVAVHRRLRAATEELEGYRSRATAETELPPARPMPFEEVRD FFYDRNNYIHDLDMAAERMFTESGMRTGGLDIQLAELMRDRFGISVVIDDNLPDTAKR RYHPDTKVLRVAHWLMPGQRAFQIATQLALVGQSDLISSIVATDDQLSTEARGVARIG LANYFAGAFLLPYREFHRAAEQLRYDIDLLGRRFGVGFETVCHRLSTLQRPRQRGIPF IFVRTDKAGNISKRQSATAFHFSRVGGSCPLWVVHDAFAQPERIVRQVAQMPDGRSYF WVAKTTAADGLGYLGPHKNFAVGLGCDLAHAHKLVYSTGVVLDDPSTEVPIGAGCKIC NRTSCAQRAFPYLGGRVAVDENAGSSLPYSSTEQSV" CDS 1255954..1257534 /codon_start=1 /transl_table=11 /gene="prpD" /locus_tag="BQ2027_MB1161" /product="possible methylcitrate dehydratase prpD" /note="Mb1161, -, len: 526 aa. Equivalent to Rv1130, len: 526 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 526 aa overlap). Conserved hypothetical protein, some similarity to AP000063|AP000063_192 hypothetical protein from Aeropyrum pernix (479 aa), FASTA scores: opt: 717, E(): 0, (34.3% identity in 443 a a overlap), and to PRPD_ECOLI|P77243 prpd protein from Escherichia coli (483aa), FASTA scores: opt: 234, E(): 3.3e-08, (27.0% identity in 429 aa overlap). Protein product from Mb1161 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1161 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXG0" /db_xref="InterPro:IPR005656" /db_xref="InterPro:IPR036148" /db_xref="InterPro:IPR042183" /db_xref="InterPro:IPR042188" /db_xref="UniProtKB/TrEMBL:A0A1R3XXG0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99761.1" /translation="MPGQDTKVRLFRVFCWCPVLRMVRIMLMHAVRAWRSADDFPCTE HMAYKIAQVAADPVDVDPEVADMVCNRIIDNAAVSAASMVRRPVTVARHQALAHPVRH GAKVFGVEGSYSADWAAWANGVAARELDFHDTFLAADYSHPADNIPPLVAVAQQLGVC GAELIRGLVTAYEIHIDLTRGICLHEHKIDHVAHLGPAVAAGIGTMLRLDQETIYHAI GQALHLTTSTRQSRKGAISSWKAFAPAHAGKVGIEAVDRAMRGEGSPAPIWEGEDGVI AWLLAGPEHTYRVPLPAPGEPKRAILDSYTKQHSAEYQSQAPIDLACRLRERIGDLDQ IASIVLHTSHHTHVVIGTGSGDPQKFDPDASRETLDHSLPYIFAVALQDGCWHHERSY APERARRSDTVALWHKISTVEDPEWTRRYHCADPAKKAFGARAEVTLHSGEVIVDELA VADAHPLGTRPFERKQYVEKFTELADGVVEPVEQQRFLAVVESLADLESGAVGGLNVL VDPRVLDKAPVIPPGIFR" CDS 1257531..1258712 /codon_start=1 /transl_table=11 /gene="prpc" /locus_tag="BQ2027_MB1162" /product="probable methylcitrate synthase prpc" /note="Mb1162, gltA1, len: 393 aa. Equivalent to Rv1131, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 393 aa overlap). Probable gltA1, citrate synthase 1 (EC 4.1.3.7), highly similar to CISY_MYCSM|P26491 citrate synthase from Mycobacterium smegmatis (375 aa), FASTA scores: opt:1942, E(): 0, (80.0% identity in 375 aa overlap). Also similar to two other M. tuberculosis citrate synthases, Rv0896c|MTCY31.24|gltA2 (431 aa), FASTA score: (33.1% identity in 381 aa overlap) and Rv0889|MTCY31.17c|citA (373 aa), FASTA score: (31.8% identity in 371 aa overlap). Contains PS00480 Citrate synthase signature. BELONGS TO THE CITRATE SYNTHASE FAMILY. Protein product from Mb1162 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1162 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZI5" /db_xref="InterPro:IPR002020" /db_xref="InterPro:IPR011278" /db_xref="InterPro:IPR016142" /db_xref="InterPro:IPR016143" /db_xref="InterPro:IPR019810" /db_xref="InterPro:IPR024176" /db_xref="InterPro:IPR036969" /db_xref="UniProtKB/TrEMBL:A0A1R3XZI5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99762.1" /translation="MTGPLAAARSVAATKSMTAPTVDERPDIKKGLAGVVVDTTAISK VVPQTNSLTYRGYPVQDLAARCSFEQVAFLLWRGELPTDAELALFSQRERASRRVDRS MLSLLAKLPDNCHPMDVVRTAISYLGAEDPDEDDAAANRAKAMRMMAVLPTIVAIDMR RRRGLPPIAPHSGLGYAQNFLHMCFGEVPETAVVSAFEQSMILYAEHGFNASTFAARV VTSTQSDIYSAVTGAIGALKGRLHGGANEAVMHDMIEIGDPANAREWLRAKLARKEKI MGFGHRVYRHGDSRVPTMKRALERVGTVRDGQRWLDIYQVLAAEMASATGILPNLDFP TGPAYYLMGFDIASFTPIFVMSRITGWTAHIMEQATANALIRPLSAYCGHEQRVLPGT F" CDS 1258724..1260454 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1163" /product="CONSERVED MEMBRANE PROTEIN" /note="Mb1163, -, len: 576 aa. Equivalent to Rv1132, len: 576 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 576 aa overlap). Conserved membrane protein, similar to O06827|Rv1431|MTCY493.23C membrane protein from Mycobacterium tuberculosis (589 aa), fasta scores: opt: 1811, E(): 0, (48.2% identity in 585 aa overlap). Protein product from Mb1163 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1163 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYE9" /db_xref="InterPro:IPR021941" /db_xref="UniProtKB/TrEMBL:A0A1R3XYE9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99763.1" /translation="MGFLQPRLPDIDLAEWSQGSRSQKIRPMAQHWAEVGFGTPVLLH LFYVAKILLYVLVGWLIVLTTKGIDGFTDAAAWYAEPIVFEKVVLYTMLFEVIGLGCG FGPLNNRFFPPMGSILYWMRFGTIRLPPWPDRVPWTRGTKRKPVDVALYALLVMMLLS ALFTDGAGPIPELGTMVGLLPAWQIVLILLLLGVLGLRDKVIFLAARGEVYATLTVTF LFGRLNGIDMIVAAKLVFLVIWIGAATSKLNRHFPFVISTMMSNNPLFRPRFIKRMFF KKFPGDLRPGLLSRIVAHVSTVIEMCVPVVLFVAHGGWPTVVAATIMVCFHLGILTAI PMGVPLEWNVFMIFGVLSLFVGHACLGLADVKNPVPLAILIAVVAGIVIAGNVFPRKI SFLAAMRYYAGNWDTTLWCIKPSAEDKINRGIVAIASMPAAQLERFYGKDRAQIPMYL GYAFRAMNSHGRALFTLAHRAMAGHDEDDYVITDGERVCSTAVGWNFGDGHLHNEQLI AAMQQRCGFQPGEVRVVLLDAQPIHRQTQEYRLVDAATGEFERGYVRVADMVNRQPWD DDVPVHVLPG" CDS complement(1260466..1262745) /codon_start=1 /transl_table=11 /gene="metE" /locus_tag="BQ2027_MB1164C" /product="PROBABLE 5-METHYLTETRAHYDROPTEROYLTRIGLUTAMATE-- HOMOCYSTEINE METHYLTRANSFERASE METE (methionine synthase, vitamin-B12 independent isozyme)" /note="Mb1164c, metE, len: 759 aa. Equivalent to Rv1133c, len: 759 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 759 aa overlap). Probable metE, 5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase (EC 2.1.1.14), highly similar to others e.g. METE_ECOLI|P25665 Escherichia coli (752 aa), FASTA scores: opt: 2251, E(): 0, (48.1% identity in 756 aa overlap). Equivalent to Z94723|MLCB33_14 metE from M. leprae (760 aa) (85.3% identity in 755 aa overlap). BELONGS TO THE VITAMIN-B12 INDEPENDENT METHIONINE SYNTHASE FAMILY. Protein product from Mb1164c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1164c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65341" /db_xref="InterPro:IPR002629" /db_xref="InterPro:IPR006276" /db_xref="InterPro:IPR013215" /db_xref="InterPro:IPR038071" /db_xref="UniProtKB/Swiss-Prot:P65341" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99764.1" /translation="MTQPVRRQPFTATITGSPRIGPRRELKRATEGYWAGRTSRSELE AVAATLRRDTWSALAAAGLDSVPVNTFSYYDQMLDTAVLLGALPPRVSPVSDGLDRYF AAARGTDQIAPLEMTKWFDTNYHYLVPEIGPSTTFTLHPGKVLAELKEALGQGIPARP VIIGPITFLLLSKAVDGAGAPIERLEELVPVYSELLSLLADGGAQWVQFDEPALVTDL SPDAPALAEAVYTALCSVSNRPAIYVATYFGDPGAALPALARTPVEAIGVDLVAGADT SVAGVPELAGKTLVAGVVDGRNVWRTDLEAALGTLATLLGSAATVAVSTSCSTLHVPY SLEPETDLDDALRSWLAFGAEKVREVVVLARALRDGHDAVADEIASSRAAIASRKRDP RLHNGQIRARIEAIVASGAHRGNAAQRRASQDARLHLPPLPTTTIGSYPQTSAIRVAR AALRAGEIDEAEYVRRMRQEITEVIALQERLGLDVLVHGEPERNDMVQYFAEQLAGFF ATQNGWVQSYGSRCVRPPILYGDVSRPRAMTVEWITYAQSLTDKPVKGMLTGPVTILA WSFVRDDQPLADTANQVALAIRDETVDLQSAGIAVIQVDEPALRELLPLRRADQAEYL RWAVGAFRLATSGVSDATQIHTHLCYSEFGEVIGAIADLDADVTSIEAARSHMEVLDD LNAIGFANGVGPGVYDIHSPRVPSAEEMADSLRAALRAVPAERLWVNPDCGLKTRNVD EVTASLHNMVAAAREVRAG" CDS 1263321..1263557 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1165" /product="HYPOTHETICAL PROTEIN" /note="Mb1165, -, len: 78 aa. Equivalent to Rv1134, len: 78 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 78 aa overlap). Hypothetical unknown protein. Mb1165 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XXT7" /protein_id="SIT99765.1" /translation="MAAYQKFGQEHAAAIRGGAVLHPTATATTVRVTGARGGDVVTGD GPYEAADLDEQGPFPMETVYLWEDGPNGTTRMTL" CDS complement(1263671..1265527) /codon_start=1 /transl_table=11 /gene="PPE16" /locus_tag="BQ2027_MB1166C" /product="ppe family protein ppe16" /note="Mb1166c, PPE16, len: 618 aa. Equivalent to Rv1135c, len: 618 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 618 aa overlap). Member of the M. tuberculosis PPE family of glycine-rich proteins. Similar to Rv2356c (59.6% identity in 627 aa overlap); etc. Mb1166c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XXG8" /protein_id="SIT99766.1" /translation="MSFLVLPPEVNSALMFAGAGSGPTLAAAAAWDGLAAELGQAANS FSSATAALADTAWQGPAATAMAAAAAPYASWLSTAATRALSAAAQAKAAAAVYEAARA ATVDPLLVAANRHQLVSLVLSNLFGQNAPAIAATEAAYEQLWAADVAAMVSYHSGASA VAAQLAPWAQAVRALPNPTAPALASGPAALAIPALGIGNTGIGNIFSIGNIGDYNLGN GNTGNANLGSGNTGQANLGSGNTGFFNFGSGNTANTNFGSGNLGNLNLGSGNDGNGNF GLGNIGDGNRGSGNVGSFNFGTANAGSFNVGSANHGSPNVGFANLGNNNLGIANLGNN NLGIANLGNNNIGIGLTGDNMIGIGALNSGIGNLGFGNSGNNNIGLFNSGNNNIGFFN SGDSNFGFFNSGDTNTGFGNAGFTNTGFGNAGSGNFGFGNAGNNNFGFGNSGFENMGV GNSGAYNTGSFNSGTLNTGDLNSGDFNTGWANSGDINTGGFHSGDLNTGFGSPVDQPV MNSGFGNIGTGNSGFNNSGDANSGFQNTNTGAFFIGHSGLLNSGGGQHVGISNSGTGF NTGLFNTGFNNTGIGNSATNAAFTTTSGVANSGDNSSGGFNAGNDQSGFFDG" CDS 1265713..1265955 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1167" /product="POSSIBLE ACETYL-COA ACETYLTRANSFERASE (ACETOACETYL-COA THIOLASE)" /note="Mb1167, -, len: 80 aa. Equivalent to Rv1135A, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 80 aa overlap). Possible acetyl-CoA acetyltransferase (EC 2.3.1.9) (possible gene fragment), highly similar to other acetyl-CoA acetyltransferases e.g. C-terminal part of Rv3556c|Z92774|MTCY6G11_2|MTCY06G11.03| fadA6 ACETYL-COA ACETYLTRANSFERASE from Mycobacterium tuberculosis (386 aa), FASTA scores: opt: 219, E(): 5.7e-09, (63.6% identity in 55 aa overlap)." /db_xref="GOA:A0A1R3XXG1" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020617" /db_xref="UniProtKB/TrEMBL:A0A1R3XXG1" /protein_id="SIT99767.1" /translation="MQLGNQNTMRFAGRPQRFRQSAYPLFNPNSAIALGHPFGGSGAR LMTTVLHHMPDKGIRYGLQTMCEGRGQANATIVELL" CDS 1266005..1266346 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1168" /product="POSSIBLE ENOYL-COA HYDRATASE" /note="Mb1168, -, len: 113 aa. Equivalent to Rv1136, len: 113 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 113 aa overlap). Probable enoyl-CoA hydratase (possible gene fragment) (EC 5.-.-.-). Some similarity to N-terminus of carnitine racemases and enoyl-CoA hydratases (but much shorter) e.g. I41014 carnitine racemase from Escherichia coli (297 aa), FASTA scores: opt: 258, E(): 2.5e-11, (44.5% identity in 110 aa overlap); and Rv0222 putative enoyl-CoA hydratase from M. tuberculosis (262 aa). Mb1168 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXG3" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3XXG3" /protein_id="SIT99768.1" /translation="MVITINRPEARNAVNGAVSIVVGDALEEAHDNPDVRAVVITGAG DKSLCAGADLKAIARRENPYHPHHGEWGIAGYRHHFIDKPTSAAVSGTALDDGAEPAL ASDLVVADEHT" CDS complement(1266486..1266854) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1169C" /product="HYPOTHETICAL PROTEIN" /note="Mb1169c, -, len: 122 aa. Equivalent to Rv1137c, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 122 aa overlap). Hypothetical unknown protein. Mb1169c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XXH5" /protein_id="SIT99769.1" /translation="MLSARCHIRHIGSPGKDARCAHLSATLRPGIGISPTNVGNATVL ADGTPAKPIQGAETMQRARHTGSCFSANARGPAISSGNPSRAGCGVPSSTTTPSSTPQ AIRLLACTDSDALTVTRTAR" CDS complement(1266871..1267887) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1170C" /product="possible oxidoreductase" /note="Mb1170c, -, len: 338 aa. Equivalent to Rv1138c, len: 338 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 338 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to Q9EWQ8 PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (343 aa). Also similar to many Mycobacterium tuberculosis hypothetical proteins e.g. Rv1751|P72008|MTCY04C12.35 (412 aa), fasta scores: opt: 89, E(): 4.5e-09, (24.6% identity in 358 aa overlap). Mb1170c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXG4" /db_xref="InterPro:IPR002938" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XXG4" /protein_id="SIT99770.1" /translation="MTSYDTDLLVVGGGPGGLATALHARARGLSVIVAEPRENPIDKA CGEGLMPGGLAELTSLGVDPVGLPFHGIAYVGEHRRVQARFRTGPGRGVRRTTLHAAL AARAKEQDTEWIRSRVATIQQDAHGVTAAGVRAKWLVAADGLHSAVRRAVGIKATAGT PRRYGVRWHYRLPVWSDFVEVHWSRWGEAYVTPVEPDLVGVAILSRQRPELAWFPSLA HHLQDASRGHARGCGPLRQVVSRRVAGRVLLVGDAAGYEDALTGEGISLAVKQAAAAV SAIVDDTPASYEAAWHRITRDYRLVTRGLVLASTPRAARRAIVPLCALLPTAFRYGVN ILAY" CDS complement(1267884..1268384) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1171C" /product="CONSERVED HYPOTHETICAL MEMBRANE PROTEIN" /note="Mb1171c, -, len: 166 aa. Equivalent to Rv1139c, len: 166 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 166 aa overlap). Conserved hypothetical membrane protein. Highly similar to P54158|YBPQ_BACSU hypothetical Bacillus subtilis protein, YBPQ (168 aa), FASTA scores: opt: 446, E(): 2.2e-26, (38.4% identity in 164 aa overlap). Some similarity to Mycobacterium tuberculosis hypothetical proteins, Rv0740, Rv0750. Protein product from Mb1171c detected using SWATH mass spectrometry. Mb1171c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXH0" /db_xref="InterPro:IPR007269" /db_xref="UniProtKB/TrEMBL:A0A1R3XXH0" /protein_id="SIT99771.1" /translation="MYYLLILAVVFERLAELVVAQRNARWSFAQGGKEFGRPHYVVMV ILHTALLLGCVVEPWALHRPFIPWLGWPMLAVVVASQGLRWWCVKSLGKRWNTRVIVL PHATLVRRGPYRWMRHPNYVAVVAEGFALPLVHTAWLTALVFTLANATLLTVRLRVEN SVLGYI" CDS 1268746..1269594 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1172" /product="PROBABLE INTEGRAL MEMBRANE PROTEIN" /note="Mb1172, -, len: 282 aa. Equivalent to Rv1140, len: 282 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 282 aa overlap). Probable integral membrane protein. Weak similarity in C-terminus to hypothetical Escherichia coli proteins YPRA and YPRB, possibly membrane-bound e.g. YPRA_ECOLI HYPOTHETICAL 24.3 KD PROTEIN (URF 1) (217 aa), FASTA scores: opt: 166, E(): 0.00062, (31.0% identity in 158 aa overlap). Protein product from Mb1172 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1172 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZK2" /db_xref="InterPro:IPR003675" /db_xref="UniProtKB/TrEMBL:A0A1R3XZK2" /protein_id="SIT99772.1" /translation="MPRDYTAPRWAHAWAGEPRPARWHPANQPAHPDHSNRESPACMS QSTTPYRSSVLAEFRRAITNVAVPHHEPPGIVRRRRVVVGVTLVIGAVMLGFSLRRTP GESSFYWLTLALAAVWIAGALMSGPLHLGGICWRGRNQRPVITGTTVGLLLAGIFGVG AMIVRAIPGAAEPIARVLQFAHQGTLLPILLITLINGIAEEMFFRGALYTALGRRYPV TISTVLYVGATMASANLMLGFAAIFVGTVCALERRASGGVLAPILTHFVWGLIMVFAL PPLFAV" CDS complement(1269602..1270408) /codon_start=1 /transl_table=11 /gene="echA11" /locus_tag="BQ2027_MB1173C" /product="PROBABLE ENOYL-COA HYDRATASE ECHA11 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb1173c, echA11, len: 268 aa. Equivalent to Rv1141c, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 268 aa overlap). Probable echA11, enoyl-CoA hydratase (EC 4.2.1.17), similar to others e.g. P24162|ECHH_RHOCA PROBABLE ENOYL-COA HYDRATASE from Rhodobacter capsulatus (257 aa); CAA66096.1|X97452 enoyl-CoA isomerase from Escherichia coli (262 aa), FASTA scores: opt: 513, E(): 1e-25, (36.1% identity in 249 aa overlap); etc. Also similarity with naphthoate synthases. Also highly similar to downstream ORF Rv1142c|MTCI65.09|echA10 PROBABLE ENOYL-COA HYDRATASE from Mycobacterium tuberculosis (268 aa), FASTA scores: opt: 1225, E(): 0, (72.3% identity in 267 aa overlap). Protein product from Mb1173c detected using SWATH mass spectrometry. Mb1173c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYF7" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR014748" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3XYF7" /protein_id="SIT99773.1" /translation="MPDSGIAALTPVTGLNVTLTDRVLSVRINRPSSLNSLTVPILTG IADTLERAAADPVVKVVRLGGVGRGFSSGVSMSVDDVWGGGPPTAIVEEANRAVRAVA ALPHPVVAVVQGPAVGVAVSLALACDFILASDSAFFMLANTKVALMPDGGASALVAAA TGRIRAMRLALLAEQLPAREALAWGLISAVYPDSDFEAEVDKVISRLLAGPALAFAQA KNAINAAALTELEPTFARELDGQEVLLRTHDFAEGAAAFLQRRTPNFTGS" CDS complement(1270551..1271357) /codon_start=1 /transl_table=11 /gene="echA10" /locus_tag="BQ2027_MB1174C" /product="PROBABLE ENOYL-CoA HYDRATASE ECHA10 (ENOYL HYDRASE) (UNSATURATED ACYL-CoA HYDRATASE) (CROTONASE)" /note="Mb1174c, echA10, len: 268 aa. Equivalent to Rv1142c, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 268 aa overlap). Probable echA10, enoyl-CoA hydratase (EC 4.2.1.17), similar to others e.g. CAA66096.1|X97452 enoyl-CoA isomerase from Escherichia coli (262 aa), FASTA scores: opt: 525, E(): 1.3e-26, (35.1% identity in 251 aa overlap); NP_420658.1|NC_002696 enoyl-CoA hydratase/isomerase family protein from Caulobacter crescentus (267 aa); NP_438092.1|NC_003078 putative enoyl-CoA hydratase protein from Sinorhizobium meliloti (263 aa); etc. Also similarity with naphthoate synthases. Also highly similar to upstream ORF Rv1141c|MTCI65.08c|echA11 PROBABLE ENOYL-CoA HYDRATASE from Mycobacterium tuberculosis (268 aa), FASTA score: opt: 1225, E(): 0, (72.3% identity in 267 aa overlap). TBparse score is 0.891. Protein product from Mb1174c detected using SWATH mass spectrometry. Mb1174c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXI6" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR014748" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3XXI6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99774.1" /translation="MSNYRIDTRTIVPGLAVTLADGVLSVTIDRPESLNSLTKPVLAG MADAIEGAATDPRVKVVRLGGAGRGFSSGGAISVDDVWASGPPTDTVAEANRTVRAIV ALPQPVVAVVQGPTVGCGVSLALACDLVLASDNAFFMLAHTNVGLMPDGGASALVQAA IGRIRAMHMALLPDRVPAAEALSWGLVSAVYPAADFDAEVDKLISRLLAGPALAIAKT KNAINAATLTELAPTLLRELDGQALLLRTDDFAEGATAFQQRRTPMFTGR" CDS 1271461..1272543 /codon_start=1 /transl_table=11 /gene="mcr" /locus_tag="BQ2027_MB1175" /product="PROBABLE ALPHA-METHYLACYL-COA RACEMASE MCR (2-methylacyl-CoA racemase) (2-arylpropionyl-CoA epimerase)" /note="Mb1175, mcr, len: 360 aa. Equivalent to Rv1143, len: 360 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 360 aa overlap). Probable mcr, alpha-methylacyl-CoA racemase (EC 5.1.99.4). Strong similarity to other alpha-methylacyl-CoA racemases and also some similarity to L-carnitine dehydratase (EC 4.2.1.89) e.g. U89905|g1552373 methylacyl-CoA racemase alpha from Norway rat (361 aa), FASTA scores: opt: 1035, E():0, (47.2% identity in 339 aa overlap). Equivalent to (but longer than) Z94723|MLCB33_13 Mycobacterium leprae (253 aa) (85.3% identity in 245 aa overlap). Also similar to Mycobacterium tuberculosis putative racemases Rv0855, Rv1866, Rv3272. Protein product from Mb1175 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1175 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXU6" /db_xref="InterPro:IPR003673" /db_xref="InterPro:IPR023606" /db_xref="UniProtKB/TrEMBL:A0A1R3XXU6" /protein_id="SIT99775.1" /translation="MAGPLSGLRVVELAGIGPGPHAAMILGDLGADVVRIDRPSSVDG ISRDAMLRNRRIVTADLKSDQGLELALKLIAKADVLIEGYRPGVTERLGLGPEECAKV NDRLIYARMTGWGQTGPRSQQAGHDINYISLNGILHAIGRGDERPVPPLNLVGDFGGG SMFLLVGILAALWERQSSGKGQVVDAAMVDGSSVLIQMMWAMRATGMWTDTRGANMLD GGAPYYDTYECADGRYVAVGAIEPQFYAAMLAGLGLDAAELPPQNDRARWPELRALLT EAFASHDRDHWGAVFANSDACVTPVLAFGEVHNEPHIIERNTFYEANGGWQPMPAPRF SRTASSQPRPPAATIDIEAVLTDWDG" CDS 1272555..1273307 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1176" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb1176, -, len: 250 aa. Equivalent to Rv1144, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 250 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to various dehydrogenases e.g. NP_104056.1|NC_002678 3-hydroxyacyl-CoA dehydrogenase type II from Mesorhizobium loti (253 aa); NP_251244.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (255 aa); AAK15008.1|AF233685_1|AF233685 short chain L-3-hydroxyacyl-CoA dehydrogenase from Mus musculus (261 aa); HSU73514|g1778354|XH98G2 human short-chain alcohol dehydrogenase from Homo sapiens (261 aa), FASTA scores: opt: 875, E(): 0, (60.1% identity in 253 aa overlap); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb1176 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1176 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXH8" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XXH8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99776.1" /translation="MKTKDAVAVVTGGASGLGLATTKRLLDAGAQVVVVDLRGDDVVG GLGDRARFAQADVTDEAAVSNALELADSLGPVRVVVNCAGTGNAIRVLSRDGVFPLAA FRKIVDINLVGTFNVLRLGAERIAKTEPIGEERGVIINTASVAAFDGQIGQAAYSASK GGVVGMTLPIARDLASKLIRVVTIAPGLFDTPLLASLPAEAKASLGQQVPHPSRLGNP DEYGALVLHIIENPMLNGEVIRLDGAIRMAPR" CDS 1273822..1276167 /codon_start=1 /transl_table=11 /gene="mmpL13" /locus_tag="BQ2027_MB1177" /product="probable conserved transmembrane transport protein mmpl13b" /note="Mb1177, mmpL13, len: 781 aa. Equivalent to Rv1145 and Rv1146, len: 303 aa and 470 aa, from Mycobacterium tuberculosis strain H37Rv, (97.9% identity in 284 aa overlap and 100% identity in 470 aa overlap). Probable mmpL13A, conserved transmembrane transport protein (see citation below), member of RND superfamily, showing some similarity to putative Mycobacterial and Streptomyces membrane proteins e.g. MTCY987|g1781238 from Mycobacterium tuberculosis (962 aa), FASTA scores: opt: 213, E(): 1.9e-06, (28.0% identity in 296 aa overlap); etc. Strong similarity to U92075|MMU92075_5 hypothetical protein from Mycobacterium marinum (256 aa), FASTA scores: opt: 957, E(): 0, (57.6% identity in 257 aa overlap). Should continue as mmpL13B|Rv1146, but frameshift required. Sequence has been checked and is identical in M. tuberculosis strain CDC1551, and Mycobacterium bovis strain AF2122/97 ????. BELONGS TO THE MMPL FAMILY. Probable mmpL13B, conserved transmembrane transport protein (see citation below), member of RND superfamily, showing some similarity to putative Mycobacterial and Streptomyces membrane proteins e.g. Q53902|C40046 antibiotic transport-associated protein from Streptomyces coelicolor (711 aa), FASTA scores: opt: 193, E(): 2.1e-05, (28.9% identity in 394 aa overlap); etc. Could be in frame with previous ORF mmpL13A|Rv1145, but no sequence error apparent to account for this; sequence is identical in Mycobacterium tuberculosis strain CDC1551. BELONGS TO THE MMPL FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1145 and Rv1146 exist as 2 genes. In Mycobacterium bovis, a single base insertion (*-a) leads to a single product. Mb1177 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXH2" /db_xref="InterPro:IPR000731" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/TrEMBL:A0A1R3XXH2" /protein_id="SIT99777.1" /translation="MLQRIARLAIAAPRRIIGFAVFVFIAAAVFGVPVADSLSPGGFQ DPRSESARAIEVLTDKFGQSGQKMLIVVTAAAGADSPPAREVGTDIVEVLRRSPLVYN VTSPWTVPPTAAADLLSTDGKSGLIVVNVKGGENDAQNHAQTLSDEVAHDRDGVTVRA GGSAMEYAQINRQNKDDLLVMELIAIPLSFLVLIWVFGGLLAAGLPMAQAVLAVVGSM AVLRLVTFATEVSTFALNLSTALGLALAIDYTLLIVSRYRDELAEGSDRDEALIRTMA TSGRTVLFSAVTVALSMSATALFPMYFLKSFAYAGVATVAFVATASIVITPAAIVLLG PRLDALDVRRLVRRLLGRPDPVHKPVKQLFWYRSSKFVMRRWLPVGTAVVALLVLLGL PFLSVKWGFPDDRVLPRSASARQVGDILRDDFGHDPATQIPIVVPDARGLGPVELDSY AAELSRVPDVSAVAAPTGTFVDGSWVGTPRGATGLAEGSAFLTVSSTAPLFSRASDIQ LKRLHQVAGPAGRSVVMAGVAQVNRDSVDAVTDRLPMVLGLIAAITYVLLFLLTGSVV LPAKALVCNVLSLTAAFGALVWIFQEGHFGALGTTPSGTLVANMPVLLFCIAFGLSMD YEVFLVSRIREYWLESGAARPARRSVAEVHAANDESVALGVARTGRVITAAALVMSMS FAALIAAHVSFMRMFGLGLTLAVAADATLVRMVVVPAFMHVTGRWNWWAPRPLAWLHE RFGVSEAAEPVSRRRSHAGGLGKIAGRSDGQTIPASLTRNG" CDS 1276300..1276950 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1178" /product="SAM-dependent methyltransferase" /note="Mb1178, -, len: 216 aa. Equivalent to Rv1147, len: 216 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 216 aa overlap). Conserved hypothetical protein, similar to many conserved hypothetical proteins, and some similarity to several methyltransferases e.g. Q05197|PMTA_RHOSH phosphatidylethanolamine N-methyltransferase (EC 2.1.1.17) from R. sphaeroides (203 aa), FASTA scores: opt: 156, E(): 0.00073, (27.6% identity in 156 aa overlap). Protein product from Mb1178 detected using SWATH mass spectrometry. Mb1178 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3XXH3" /protein_id="SIT99778.1" /translation="MTSGAAASASRVDHPLFARIWPVVAAHEAEAIRALRRENLAGLS GRVLEVGAGVGTNFAYYPVAVEQVIAMEPEPRLAAKARIAAADAPVPIVVTDKTVEEF RDTETFDAVVCSLVLCSVSDPGAVLAHLRSLLRRGGELRYLEHVASAGARGRVQRFVD ATFWPRLAGNCHTHRHTERAILDAGFVVDSSRREWAFPAWVPLPVSELALGRAHRT" repeat_region complement(1277700..1279148) /rpt_family="REP" /note="REP-4, len: 1362 nt. Equivalent to REP, len: 1362 nt, from Mycobacterium tuberculosis strain H37Rv, (96.4% identity in 1362 nt overlap). REP165, member of REP13E12 family." CDS complement(1277700..1279148) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1179C" /product="13E12 repeat family protein" /note="Mb1179c, -, len: 482 aa. Equivalent to Rv1148c, len: 482 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 482 aa overlap). Conserved hypothetical ORF in REP13E12 degenerate repeat, nearly identical to other hypothetical Mycobacterium tuberculosis proteins in REP13E12 repeats, although similarity extends upstream past proposed f-Met start. Very similar to other REP13E12 proteins e.g. Rv1945, Rv3467, Rv0094c, Rv1128c etc. Mb1179c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/Swiss-Prot:P0A5E0" /protein_id="SIT99779.1" /translation="MSETFCLTDHSEPMTARFLSVVLRRIRGMRSDTREEISAALDAY HASLSRVLDLKCDALTTPELLACLQRLEVERRRQGAAEHALINQLAGQACEEELGGTL RTALANRLHITPGEASRRIAEAEDLGERRALTGEPLPAQLTATAAAQREGKIGREHIK EIQAFFKELSAAVDLGIREAAEAQLAELATSRRPDHLHGLATQLMDWLHPDGNFSDQE RARKRGITMGKQEFDGMSRISGLLTPELRATIEAVLAKLAAPGACNPDDQTPLVDDTP DADAVRRDTRSQAQRNHDAFLAALRGLLASGELGQHKGLPVTIVVSTTLKELEAATGK GVTGGGSRVPMSDLIRMASHANHYLALFDGAKPLALYHTKRLASPAQRIMLYAKDRGC SRPGCDAPAYHSEVHHVTPWTTTHRTDINDLTLACGPDNRLVEKGWKTRKNAHGDTEW LPPPHLDHGQPRINRYHHPAKILCEQDDDEPH" mobile_element 1279215..1280192 /mobile_element_type="insertion sequence:IS-LIKE" /locus_tag="BQ2027_IS-LIKE-2" /note="IS-LIKE-2, len: 978 nt. Equivalent to IS-LIKE, len: 978 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 978 nt overlap). IS element ISlike2." repeat_region 1279243..1279246 /rpt_type=DIRECT /note="4 bp direct repeat, CTAG, generated by IS element on insertion. Proposed by Mariani et al. 1993. J. Gen. Microbiol. 139:1767-1772. Note that as the motif is palindromic it could be part of the inverted repeat itself." gene 1279243..1280220 /locus_tag="BQ2027_IS-LIKE-2" repeat_region 1279247..1279263 /rpt_type=INVERTED /note="17 bp imperfect inverted repeat, IRL,GGCGTGTCTCCCAAATT, flanking putative IS element. Proposed by Mariani et al. 1993. J. Gen. Microbiol. 139: 1767-1772." CDS 1279293..1279700 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1180" /product="POSSIBLE TRANSPOSASE" /note="Mb1180, -, len: 135 aa. Equivalent to Rv1149, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 135 aa overlap). Possible transposase. Identical to 117 aa N-terminal region of S21394|X65618 transposase of Mycobacterium tuberculosis (308 aa), FASTA scores: opt: 823, E(): 0, (99.1% identity in 117 aa overlap). Second copy is Rv1042c|MTCY10G2.07. TBparse score is 0.926. Mb1180 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025161" /db_xref="UniProtKB/TrEMBL:A0A1R3XXH4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99780.1" /translation="MTRVGVISDEFWAVVEPLMPSHEGKPGRRFSDHRLILEGIAWRF RTGSPWRDLPAEFGPWQTVWKRHHRWSLDGTCDEVFAHVAAVFGVDAEVAEDIEKLLS VDSTNVRAHQHSAGACSDTLATGGTVGLQEIRR" CDS 1279738..>1280217 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1181" /product="POSSIBLE TRANSPOSASE (FRAGMENT)" /note="Mb1181, -, len: 160 aa. Equivalent to Rv1150, len: 183 aa, from Mycobacterium tuberculosis strain H37Rv. Possible fragment of transposase (pseudogene). Identical to C-terminal part of S21394 transposase of putative Mycobacterium tuberculosis IS element (308 aa), FASTA scores: opt: 959, E(): 0, (99.3% identity in 145 aa overlap). The transposase described here may be made by a -1 frame shifting mechanism during translation that fuses Rv1149|MTCI65.16 and Rv1150|MTCI65.17. No evidence found to account for discrepancy with previously published sequence. Second copy is Rv1041c|MTCY10G2.08. TBparse score is 0.914. Mb1181 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XX79" /db_xref="InterPro:IPR002559" /db_xref="UniProtKB/TrEMBL:A0A1R3XX79" /protein_id="SIT99781.1" /translation="MTTKIHALTDQREAPVRIRLTAGQAGDNPQLLPLLDDYRHASTE YALGSTDFRLLADKAYSHPSTRAALRSKKIKHTIPERQDQIDRRKAKGSAGGRPPAFD AALYGLRNTVERGFHRLKQWRGIATRYDKYALTYLGGVLLACAVIHARVGTPKLGDTP " repeat_region complement(1280201..1280216) /rpt_type=INVERTED /note="17 bp imperfect inverted repeat, IRR,GGCGTGTCTCCCAATTT, flanking putative IS element. Proposed by Mariani et al. 1993. J. Gen. Microbiol. 139: 1767-1772." repeat_region 1280217..1280220 /rpt_type=DIRECT /note="4 bp direct repeat, CTAG, generated by IS element on insertion. Proposed by Mariani et al. 1993. J. Gen. Microbiol. 139 :1767-1772. Note that as motif palindromic could be part of inverted repeat itself." CDS complement(1280304..1281017) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1182C" /product="transcriptional regulatory protein" /note="Mb1182c, -, len: 237 aa. Equivalent to Rv1151c, len: 237 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 237 aa overlap). Probable transcriptional regulatory protein, similar to others AE000776|AE000776_10 Aquifex aeolicus (239 aa), FASTA scores: opt: 725, E(): 0, (46.4% identity in 237 aa overlap); ECAE0002125|g1787358 Escherichia coli (279 aa), FASTA scores: opt: 464, E(): 1.3e-23, (36.7% identity in 240 aa overlap). Protein product from Mb1182c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1182c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66814" /db_xref="InterPro:IPR003000" /db_xref="InterPro:IPR026590" /db_xref="InterPro:IPR026591" /db_xref="InterPro:IPR027546" /db_xref="InterPro:IPR029035" /db_xref="UniProtKB/Swiss-Prot:P66814" /protein_id="SIT99782.1" /translation="MRVAVLSGAGISAESGVPTFRDDKNGLWARFDPYELSSTQGWLR NPERVWGWYLWRHYLVANVEPNDGHRAIAAWQDHAEVSVITQNVDDLHERAGSGAVHH LHGSLFEFRCARCGVPYTDALPEMPEPAIEVEPPVCDCGGLIRPDIVWFGEPLPEEPW RSAVEATGSADVMVVVGTSAIVYPAAGLPDLALARGTAVIEVNPEPTPLSGSATISIR ESASQALPGLLERLPALLK" CDS 1281055..1281420 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1183" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1183, -, len: 121 aa. Equivalent to Rv1152, len: 121 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 121 aa overlap). Start uncertain. Probable transcriptional regulatory protein, some similarity to others e.g. YHCF_BACSU HYPOTHETICAL TRANSCRIPTIONAL REGULATOR (121 aa), FASTA scores: opt: 187, E(): 1.9e-06, (34.9% identity in 106 aa overlap). TBparse score is 0.876. Helix turn helix motif from aa 42-63 (+3.10 SD). Protein product from Mb1183 detected using SWATH mass spectrometry. Mb1183 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYG5" /db_xref="InterPro:IPR000524" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XYG5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99783.1" /translation="MELRDWLRVDVKAGKPLFDQLRTQVIDGVRAGALPPGTRLPTVR DLAGQLGVAANTVARAYRELESAAIVETRGRFGTFISRFDPTDAAMAAAAKEYVGVAR ALGLTKSDAMRYLTHVPDD" CDS complement(1281398..1282246) /codon_start=1 /transl_table=11 /gene="omt" /locus_tag="BQ2027_MB1184C" /product="PROBABLE O-METHYLTRANSFERASE OMT" /note="Mb1184c, omt, len: 282 aa. Equivalent to Rv1153c, len: 282 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 282 aa overlap). Probable omt, O-methyltransferase (EC 2.1.1-), similar to TCMP_STRGA|P39887 Tetracenomycin polyketide synthesis O-methyltransferase tcmP (EC 2.1.1.-) from Streptomyces glaucescens (270 aa), FASTA scores: opt: 368, E(): 1.7e-17, (31.3% identity in 233 aa overlap). Protein product from Mb1184c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1184c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXJ6" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR016874" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3XXJ6" /protein_id="SIT99784.1" /translation="MSAHKPAKQRVALTGVSETALLTLNARAAEARRRDTIIDDPMAV ALVESIDFGFAKFGPTGQGFALRARAFDMAAQHYLDQHPAATVVALAEGLQTSFWRLD VAIPGGQFRWLTVDLPPIVDLRTRLLPSSPRVSVCAQSALDYSWMDSVDPAGGVFITA EGLLMYLQPEQALGLIAQCAQTFPGGQMLFDLPPRWFAGWSRLGLRTSLRYKVPRMPF SMSVAQAADLVNKVPGVVAVRDLRVPPGRGLWVNMALSTVYRLPVFDPLRPCLTLLEF SRPARG" CDS complement(1282243..1282884) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1185C" /product="HYPOTHETICAL PROTEIN" /note="Mb1185c, -, len: 213 aa. Equivalent to Rv1154c, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 213 aa overlap). Hypothetical unknown protein, start uncertain. Protein product from Mb1185c detected using SWATH mass spectrometry. Mb1185c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR012545" /db_xref="UniProtKB/TrEMBL:A0A1R3XXV6" /protein_id="SIT99785.1" /translation="MEFPLITANSLSSKTWRAMPRAYVAVASFSGGLVQSGMAKFAAF LRGVNVGGVNLKMAEVATALTDAGFCNVRTILASGNVLLESTCGAAEVREKTEATLRE RFGYDAWALIYDVDTVRTIVAAYPFECELEGYQSYVTFVADAAILDELSALADTAGPD ENISRGPDPLGVLYWQVPKGSTLDSTIGQTMGKKRYKSSTTTRNLRTLAKVLR" CDS 1282829..1283272 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1186" /product="possible pyridoxamine 5'-phosphate oxidase (pnp/pmp oxidase) (pyridoxinephosphate oxidase) (pnpox) (pyridoxine 5'-phosphate oxidase)" /note="Mb1186, -, len: 147 aa. Equivalent to Rv1155, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 147 aa overlap). Conserved hypothetical protein. Similar to other hypothetical proteins e.g. AL079356|SC6G9.20 Streptomyces coelicolor (144 aa), FASTA scores: opt: 478, E(): 2.8e-26, (55.7% identity in 140 aa overlap); and Mycobacterium tuberculosis proteins Rv1875, Rv0121c, Rv2074. Protein product from Mb1186 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1186 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXI7" /db_xref="InterPro:IPR011576" /db_xref="InterPro:IPR012349" /db_xref="InterPro:IPR019920" /db_xref="UniProtKB/TrEMBL:A0A1R3XXI7" /protein_id="SIT99786.1" /translation="MARQVFDDKLLAVISGNSIGVLATIKHDGRPQLSNVQYHFDPRK LLIQVSIAEPRAKTRNLRRDPRASILVDADDGWSYAVAEGTAQLTPPAAAPDDDTVEA LIALYRNIAGEHPDWDDYRQAMVTDRRVLLTLPISHVYGLPPGMR" CDS 1283430..1283618 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1186A" /note="unnamed protein product; Mb1186A, len: 62 aa. No equivalent in M. tuberculosis H37Rv. Identified by de novo proteomics of Mycobacterium bovis AF2122/97 under exponential conditions,Mb1186A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing" /db_xref="InterPro:IPR035172" /db_xref="UniProtKB/TrEMBL:A0A1R3XXI2" /protein_id="SIT99787.1" /translation="MGESKSPQESSSEGETKRKFREALDRKMAQSSSGSDHKDGGGKQ SRAHGPVASRREFRRKSG" CDS 1283706..1284293 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1187" /product="Base excision DNA repair protein" /note="Mb1187, -, len: 195 aa. Equivalent to Rv1156, len: 195 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 195 aa overlap). Conserved hypothetical protein, highly similar to CAC32318.1|AL583944 conserved hypothetical protein from Streptomyces coelicolor (197 aa). Protein product from Mb1187 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1187 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXI5" /db_xref="InterPro:IPR003265" /db_xref="InterPro:IPR011257" /db_xref="InterPro:IPR017658" /db_xref="UniProtKB/TrEMBL:A0A1R3XXI5" /protein_id="SIT99788.1" /translation="MPNLQLVQEPAADALLNANPFALLVGMLLDQQVPMETAFAGPKK IADRMGSFDAGDIADYDPDKFVALCSERPAIHRFPGSMAKRIQALAQIIVDRYDGDAA ALWTAGEPDGNELLRRLKGLPGFGEQKARIFLALLGKQYGVTPKGWQVAAGEFGQPGT YLSVADIVDAGSLGQVRSHKRQRKAAAKAEGKAPT" CDS complement(1284456..1285571) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1188C" /product="conserved ala-, pro-rich protein" /note="Mb1188c, -, len: 371 aa. Equivalent to Rv1157c, len: 371 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 371 aa overlap). Conserved hypothetical Ala-, Pro-rich protein, similar to other proline rich proteins and extensins e.g. GBU04267|g451543 sea-island cotton proline-rich protein of cotton fiber (214 aa), FASTA scores: opt: 305, E(): 3.9e-05, (35.7% identity in 182 aa overlap). Has hydrophobic stretch at N-terminus suggestive of secretion signal. First start taken. Mb1188c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXJ5" /db_xref="InterPro:IPR003882" /db_xref="UniProtKB/TrEMBL:A0A1R3XXJ5" /protein_id="SIT99789.1" /translation="MRRLTNTEHRENTTVASTWSVCKGLAAVVITSAAAFALCPNAAA DPATPQPNPTQQLPGLPALAQLSPIIQQAAMNPAQATQLLMAAASAFAGNPAVPTESK NVASSVNQFVAEPTNPDSAALGVPAPHGVALPEAIPVPHVPPLGAEPGVQAHLPTGID PSHAAGPAPAVAPTVTPPVAAPPASAPAPAPDAAQPVAVPGPPPAPPAPRAAAPAPAS AAPAPAAAPAPASGFGADAPPTQDFMYPSIGPNCVADGSNSIATALSVAGPAKIPLPG PGPGQTAYVFTAVGTPGPADVQRLPLNVTWVNLTTGKSGSATLRPRSDINPDGPTTLT VIADTGSGSIMSTIFGQVTTKDRQCQFMPTIGSTVVP" CDS complement(1285579..1286262) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1189C" /product="Conserved alanine and proline rich protein" /note="Mb1189c, -, len: 227 aa. Equivalent to Rv1158c, len: 227 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 227 aa overlap). Conserved hypothetical Ala-, Pro-rich protein, similar to other proline rich proteins and extensins e.g. MMSAP62|g633250 house mouse (485 aa), FASTA scores: opt: 367, E(): 1.2e-08, (36.3% identity in 212 aa overlap). Has hydrophobic stretch at N-terminus suggestive of secretion signal. Mb1189c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XXI3" /protein_id="SIT99790.1" /translation="MPTIWTFVRAAAVLVGSSAALLTGGIAHADPAPAPAPAPNIPQQ LISSAANAPQILQNLATALGATPPLSAPKVAEPAPAAPGITATFPGLTPAAPAAAAAP ALTPSIPGVNAPIPGITPAAPALPVTAPAAAPTIPGVNAPIPGITAPAPAAAAVPASV PGVPSAKVDLPQLPYLPLQVPQQLSLPADLPALASGVIPAAPIAPTPPAPGAPALPPG PPSLLAALP" CDS 1286392..1287687 /codon_start=1 /transl_table=11 /gene="pime" /locus_tag="BQ2027_MB1190" /product="mannosyltransferase pime" /note="Mb1190, -, len: 431 aa. Equivalent to Rv1159, len: 431 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 431 aa overlap). Conserved transmembrane protein, similar to others in Mycobacterium tuberculosis e.g. Rv2181|MTCY21D4.13 (560 aa), FASTA scores: opt: 172; E(): 0.00035, (25.0% identity in 332 aa overlap). Mb1190 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXJ0" /db_xref="InterPro:IPR018584" /db_xref="UniProtKB/TrEMBL:A0A1R3XXJ0" /protein_id="SIT99791.1" /translation="MCRTLIDGPVRSAIAKVRQIDTTSSTPAAARRVTSPPARETRAA VLLLVLSVGARLAWTYLAPNGANFVDLHVYVSGAASLDHPGTLYGYVYADQTPDFPLP FTYPPFAAVVFYPLHLVPFGLIALLWQVVTMAALYGAVRISQRLMGGTAETGHFAAML WTAIAIWIEPLRSTFDYGQINVLLMLAALWAVYTPRWWLSGLLVGVASGVKLTPAITA VYLVGVRRLHAAAFSVVVFLATVGVSLLVVGDEARYYFTDLLGDAGRVGPIATSFNQS WRGAISRILGHDAGFGPLVLAAIASTAVLAILAWRALDRSDRLGKLLVVELFGLLLSP ISWTHHWVWLVPLMIWLIDGPARERPGARILGWGWLVLTIVGVPWLLSFAQPSIWQIG RPWYLAWAGLVYVVATLATLGWIAASERYVRIRPRRMAN" CDS complement(1287684..1287968) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1191C" /product="Pterin-4-alpha-carbinolamine dehydratase (EC" /EC_number="4.2.1.96" /note="Mb1191c, -, len: 94 aa. Equivalent to Rv1159A, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 94 aa overlap). Hypothetical unknown protein. Protein product from Mb1191c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1191c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5S3" /db_xref="InterPro:IPR001533" /db_xref="InterPro:IPR036428" /db_xref="UniProtKB/Swiss-Prot:P0A5S3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99792.1" /translation="MAVLTDEQVDAALHDLNGWQRAGGVLRRSIKFPTFMAGIDAVRR VAERAEEVNHHPDIDIRWRTVTFALVTHAVGGITENDIAMAHDIDAMFGA" CDS 1287995..1288420 /codon_start=1 /transl_table=11 /gene="mutT2" /locus_tag="BQ2027_MB1192" /product="PROBABLE MUTATOR PROTEIN MUTT2 (7,8-dihydro-8-oxoguanine-triphosphatase) (8-OXO-DGTPASE)" /note="Mb1192, mutT2, len: 141 aa. Equivalent to Rv1160, len: 141 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 141 aa overlap). Probable mutT, mutator protein or homolog (EC 3.6.1.-). More similar to D908197|g1742860 MutT homolog from Escherichia coli (135 aa), FASTA scores: opt: 226, E():1.1e-08, (39.7% identity in 116 aa overlap); than to MUTT_ECOLI|P08337 MUTATOR MUTT PROTEIN from Escherichia coli (129 aa), FASTA scores: opt: 180, E(): 1.2e-05, (27.1% identity in 129 aa overlap). Contains PS00893 mutT domain signature. Protein product from Mb1192 detected using SWATH mass spectrometry. Mb1192 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYH8" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR015797" /db_xref="InterPro:IPR020084" /db_xref="InterPro:IPR020476" /db_xref="UniProtKB/TrEMBL:A0A1R3XYH8" /protein_id="SIT99793.1" /translation="MLNQIVVAGAIVRGCTVLVAQRVRPPELAGRWELPGGKVAAGET ERAALARELAEELGLEVADLAVGDRVGDDIALNGTTTLRAYRVHLLGGEPRARDHRAL CWVTAAELHDVDWVPADRGWIADLARTLNGSAADVHRRC" CDS 1288728..1292426 /codon_start=1 /transl_table=11 /gene="narG" /locus_tag="BQ2027_MB1193" /product="respiratory nitrate reductase (alpha chain) narg" /note="Mb1193, narG, len: 1232 aa. Equivalent to Rv1161, len: 1232 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1232 aa overlap). Probable narG, respiratory nitrate reductase alpha chain (EC 1.7.99.4). Similar to others e.g. NARG_BACSU NITRATEREDUCTASE ALPHA CHAIN from Bacillus subtilis (1228 aa), FASTA scores: opt: 4218, E(): 0, (50.3% identity in 1229 aa overlap); etc. Also highly similar to N-terminal part of Rv1736c|MTCY04C12.21c|NARX PROBABLE NITRATE REDUCTASE from Mycobacterium tuberculosis (85.1% identity in 281 aa overlap). Contains prokaryotic molybdopterin oxidoreductase signatures 1 and 2 (PS00551, PS00490). BELONGS TO THE PROKARYOTIC MOLYBDOPTERIN-CONTAINING OXIDOREDUCTASE FAMILY. Protein product from Mb1193 detected using SWATH mass spectrometry. Mb1193 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXK6" /db_xref="InterPro:IPR006468" /db_xref="InterPro:IPR006655" /db_xref="InterPro:IPR006656" /db_xref="InterPro:IPR006657" /db_xref="InterPro:IPR006963" /db_xref="InterPro:IPR009010" /db_xref="InterPro:IPR027467" /db_xref="InterPro:IPR037943" /db_xref="UniProtKB/TrEMBL:A0A1R3XXK6" /protein_id="SIT99794.1" /translation="MTVTPHVGGPLEELLERSGRFFTPGEFSADLRTVTRRGGREGDV FYRDRWSHDKVVRSTHGVNCTGSCSWKIYVKDGIITWETQQTDYPSVGPDRPEYEPRG CPRGASFSWYSYSPTRVRYPYARGVLVEMYREAKTRLGDPVLAWADIQADPERRRRYQ QARGKGGLVRVSWAEASEMVAAAHVHTIKTYGPDRVAGFSPIPAMSMVSHAAGSRFVE LIGGVMTSFYDWYADLPVASPQVFGDQTDVPESGDWWDASYLVMWGSNVPITRTPDAH WMAEARYRGAKVVVVSPDYADNTKFADEWVRCAAGTDTALAMAMGHVILSECYVRNQV PFFVDYVRRYTDLPFLIKLEKRGDLLVPGKFLTAADIGEESENAAFKPALLDELTNTV VVPQGSLGFRFGEDGVGKWNLDLGSVVPALSVEMDKAVNGDRSAELVTLPSFDTIDGH GETVSRGLPVRRAGKHLVCTVFDLMLAHYGVARAGLPGEWPTGYHDRTQQNTPAWQES ITGVPAAQAIRFAKEFARNATESGGRSMIIMGGGICHWFHSDVMYRSVLALLMLTGSM GRNGGGWAHYVGQEKVRPLTGWQTMAMATDWSRPPRQVPGASYWYAHTDQWRYDGYGA DKLASPVGRGRFAGKHTMDLLTSATAMGWSPFYPQFDRSSLDVADEARAAGRDVGDYV AEQLAQHKLKLSITDPDNPVNWPRVLTVWRANLIGSSGKGGEYFLRHLLGTDSNVQSD PPTDGVHPRDVVWDSDIPEGKLDLIMSIDFRMTSTTLVSDVVLPAATWYEKSDLSSTD MHPYVHSFSPAIDPPWETRSDFGAFAAIARAFSALAKRHLGTRTDVVLTALQHDTPDE MAYPDGTERDWLATGEVPVPGRTMSKLTVVERDYTAIYDKWLTLGPLIDQFGMTTKGY TVHPFREVSELAANFGVMNSGVAVGRPAITTAKRMADVILALSGTCNGRLAVEGFLEL EKRTGQRLAHLAEGSEERRITYADTQARPVPVITSPEWSGSESGGRRYAPFTINIEHL KPFHTLTGRMHFYLAHDWVEELGEQLPVYRPPLDMARLFNQPELGPTDDGLGLTVRYL TPHSKWSFHSTYQDNLYMLSLSRGGPTMWMSPGDAAKINVRDNDWVEAVNANGIYVCR AIVSHRMPEGVVFVYHVQERTVDTPRTETNGKRGGNHNALTRVRIKPSHLAGGYGQHA FAFNYLGPTGNQRDEVTVVRRRSQEVRY" CDS 1292465..1294141 /codon_start=1 /transl_table=11 /gene="narH" /locus_tag="BQ2027_MB1194" /product="PROBABLE RESPIRATORY NITRATE REDUCTASE (BETA CHAIN) NARH" /note="Mb1194, narH, len: 558 aa. Equivalent to Rv1162, len: 558 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 558 aa overlap). Probable narH, respiratory nitrate reductase beta chain (EC 1.7.99.4). Similar to others e.g. NARH_BACSU|P42176 NITRATE REDUCTASE BETA CHAIN from Bacillus subtilis (487 aa), FASTA scores: opt: 2049, E(): 0, (56.8% identity in 488 aa overlap); etc. Contains PS00190 cytochrome c family heme-binding site signature. Protein product from Mb1194 detected using SWATH mass spectrometry. Mb1194 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXW6" /db_xref="InterPro:IPR006547" /db_xref="InterPro:IPR017896" /db_xref="InterPro:IPR029263" /db_xref="InterPro:IPR038262" /db_xref="UniProtKB/TrEMBL:A0A1R3XXW6" /protein_id="SIT99795.1" /translation="MKVMAQMAMVMNLDKCIGCHTCSVTCKQAWTNRSGTEYVWFNNV ETRPGVGYPRTYEDQERWRGGWVRDKKGRLRLRDGGRIHKLLRIFANPKLPTIGDYYE PWTYDYENLTSAPAGDTFPTAAPRSLISGNPMKVSWGSNWDDNLAGSPEIVPNDPVLK KVNQVNQEVKLKLEETFMFYLPRICEHCLNPSCVASCPSGAMYKRTEDGIVLVDQDRC RGWRMCVSGCPYKKVYFNHKTGKAEKCTLCYPRIEVGLPTVCSETCVGRLRYLGLVLY DVDQVLQAASVESDTDLYEAQRRILLDPHDPRVIAGARAEGIADEWIEAAQRSPVYAL INTYRVALPLHPEYRTMPMVWYIPPLSPVVDAVSRDGHDGEDLGNLFGALDALRIPIA YLAELFTAGDTEVVAGVLRRLAAMRCYMRDINLGRETQPHIPESVGMTEEQIYQMYRL LAVAKYEERYVIPTSYAGELPAAAMTDDMGCSLSVDGGPGMYESGPFGQGSPTPVPIA VESFHALQHAGSAATGGAGRSRVNLLNWDPNGAAAGLFPEPQPSKDVVQR" CDS 1294198..1294803 /codon_start=1 /transl_table=11 /gene="narJ" /locus_tag="BQ2027_MB1195" /product="PROBABLE RESPIRATORY NITRATE REDUCTASE (DELTA CHAIN) NARJ" /note="Mb1195, narJ, len: 201 aa. Equivalent to Rv1163, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 201 aa overlap). Probable narJ, respiratory nitrate reductase delta chain (EC 1.7.99.4). Similar to others e.g. P42178|NARJ_BACSU NITRATE REDUCTASE DELTA CHAIN from Bacillus subtilis (184 aa), FASTA scores: opt: 254, E(): 1.9e-10, (31.8% identity in 179 aa overlap); etc. Strong similarity to region from aa 260 -410 of Rv1736c|MTCY04C12.21c|NARX PROBABLE NITRATE REDUCTASE from Mycobacterium tuberculosis (64.8% identity in 159 aa overlap). Protein product from Mb1195 detected using SWATH mass spectrometry. Mb1195 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXJ7" /db_xref="InterPro:IPR003765" /db_xref="InterPro:IPR020945" /db_xref="InterPro:IPR036411" /db_xref="UniProtKB/TrEMBL:A0A1R3XXJ7" /protein_id="SIT99796.1" /translation="MWQSASLLLAYPDDGLAERLHMVDALRAHQTGPAAALLGRTVAE LRALAPMAAAAQYVETFDMRRRSTMYLTYWTAGDTRNRGREMLAFATAYRDAGVKPPR TEAPDYLPVVLEFAATVDPEAGRRLLTEHRVPIDVLRGALADAKSPYEYTVAAICETL PAATNQEVRRAQRLAQSGPPAEAVGLQPFTLTVPPKRAEGA" CDS 1294806..1295546 /codon_start=1 /transl_table=11 /gene="narI" /locus_tag="BQ2027_MB1196" /product="PROBABLE RESPIRATORY NITRATE REDUCTASE (GAMMA CHAIN) NARI" /note="Mb1196, narI, len: 246 aa. Equivalent to Rv1164, len: 246 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 246 aa overlap). Probable narI, respiratory nitrate reductase gamma chain (EC 1.7.99.4). Similar to others e.g. NARI_BACSU|P42177 NITRATE REDUCTASE GAMMA CHAIN from Bacillus subtilis (223 aa), FASTA scores: opt: 652, E(): 0; (41.6% identity in 221 aa overlap); etc. Highly similar to C-terminal part of Rv1736c|MTCY04C12.21c|NARX PROBABLE NITRATE REDUCTASE (GAMMA CHAIN) from Mycobacterium tuberculosis (68.6% identity in 239 aa overlap). Mb1196 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXJ3" /db_xref="InterPro:IPR003816" /db_xref="InterPro:IPR023234" /db_xref="InterPro:IPR036197" /db_xref="UniProtKB/TrEMBL:A0A1R3XXJ3" /protein_id="SIT99797.1" /translation="MAVLDLVEIFWDAAPYVVVAIAVVGTWWRYRYDKFGWTTRSSQL YESRLLSIGSPMFHFGSLLVIMGHVMGLFIPDSWTRAFGMSDHLYHLQALLLGAPAGF ATLLGIGLLIYRRRIQTPVWLATTRNDKLMYLVLVCAIVAGLACTLMGATHEGDMHDY RRSVSVWFRSIWMLAPRGDLMAQATLYYQVHVLIALALFVLWPFTRLVHAFSAPIAYL FRPYIVYRSREVAAKHELIGSAPRRRGW" CDS 1295568..1297454 /codon_start=1 /transl_table=11 /gene="typA" /locus_tag="BQ2027_MB1197" /product="POSSIBLE GTP-BINDING TRANSLATION ELONGATION FACTOR TYPA (TYROSINE PHOSPHORYLATED PROTEIN A) (GTP-BINDING PROTEIN)" /note="Mb1197, typA, len: 628 aa. Equivalent to Rv1165, len: 628 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 628 aa overlap). Possible typA (alternate gene name: bipA), GTP-binding translation elongation factor, similar to several e.g. P32132|TYPA_ECOLI|BIPA|B387 Escherichia coli (591 aa); YIHK_SYNY3|P72749 GTP-binding protein TYPA/BIPA homolog from synechocystis sp. (597 aa), FASTA scores: E(): 0, (46.9% identity in 610 aa overlap); and to elongation factor EF-G from many organims e.g. EFG_MICLU|P09952 micrococcus luteus (701 aa), FASTA scores: E(): 3e-24, (29.8% identity in 500 aa overlap). BELONGS TO THE GTP-BINDING ELONGATION FACTOR FAMILY, TYPA SUBFAMILY. Protein product from Mb1197 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1197 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXJ4" /db_xref="InterPro:IPR000640" /db_xref="InterPro:IPR000795" /db_xref="InterPro:IPR004161" /db_xref="InterPro:IPR005225" /db_xref="InterPro:IPR006298" /db_xref="InterPro:IPR009000" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR035647" /db_xref="InterPro:IPR035651" /db_xref="InterPro:IPR042116" /db_xref="UniProtKB/TrEMBL:A0A1R3XXJ4" /protein_id="SIT99798.1" /translation="MPFRNVAIVAHVDHGKTTLVDAMLRQSGALRERGELQERVMDTG DLEREKGITILAKNTAVHRHHPDGTVTVINVIDTPGHADFGGEVERGLSMVDGVLLLV DASEGPLPQTRFVLRKALAAHLPVILVVNKTDRPDARIAEVVDASHDLLLDVASDLDD EAAAAAEHALGLPTLYASGRAGVASTTAPPDGQVPDGTNLDPLFEVLEKHVPPPKGEP DAPLQALVTNLDASTFLGRLALIRIYNGRIRKGQQVAWIRQVDGQQTVTTAKITELLA TEGVERKPTDAAVAGDIVAVAGLPEIMIGDTLAASANPVALPRITVDEPAISVTIGTN TSPLAGKVGGHKLTARMVRSRLDAELVGNVSIRVVDIGAPDAWEVQGRGELALAVLVE QMRREGFELTVGKPQVVTKTIDGTLHEPFESMTVDCPEEYIGAVTQLMAARKGRMVEM ANHTTGWVRMDFVVPSRGLIGWRTDFLTETRGSGVGHAVFDGYRPWAGEIRARHTGSL VSDRAGAITPFALLQLADRGQFFVEPGQQTYEGMVVGINPRPEDLDINVTREKKLTNM RSSTADVIETLAKPLQLDLERAMELCAPDECVEVTPEIVRIRKVELAAAARARSRART KARG" CDS 1297552..1299459 /codon_start=1 /transl_table=11 /gene="lpqW" /locus_tag="BQ2027_MB1198" /product="probable conserved lipoprotein lpqw" /note="Mb1198, lpqW, len: 635 aa. Equivalent to Rv1166, len: 635 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 635 aa overlap). Probable Lipoprotein LpqW, almost identical in part to G2384665|AF009358 Mycobacterium tuberculosis gene fragment ORFA2-898 (FRAGMENT) (59 aa) (93.9% identity in 49 aa overlap) (see citation below). Also similar to Rv1280c and Rv2585c. Contains possible N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb1198 detected using SWATH mass spectrometry. Mb1198 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000914" /db_xref="InterPro:IPR039424" /db_xref="UniProtKB/TrEMBL:A0A1R3XXK3" /protein_id="SIT99799.1" /translation="MGVPSPVRRVCVTVGALVALACMVLAGCTVSPPPAPQSTDTPRS TPPPPRRPTQIIMGIDWIGPGFNPHLLSDLSPVNAAISALVLPSAFRPIPDPNTPTGS RWEMDPTLLVSADVTNNHPFTVTYKIRPEAQWTDNAPIAADDFWYLWQQMVTQPGVVD PAGYHLITSVQSLEGGKQAVVTFAQPYPAWRELFTDILPAHIVKDIPGGFASGLARAL PVTGGQFRVENIDPQRDEILIARNDRYWGPPSKPGIILFRRAGAPAALADSVRNGDTQ VAQVHGGSAAFAQLSAIPDVRTARIVTPRVMQFTLRANVPKLADTQVRKAILGLLDVD LLAAVGAGTDNTVTLDQAQIRSPSDPGYVPTAPPAMSSAAALGLLEASGFQVDTNTSV SPAPSVPDSTTTSVSTGPPEVIRGRISKDGEQLTLVIGVAANDPTSVAVANTAADQLR DVGIAATVLALDPVTLYHDALNDNRVDAIVGWRQAGGNLATLLASRYGCPALQATTVP AANAPTTAPSAPIGPTPSAAPDTATPPPTAPRRPSDPGALVKAPSNLTGICDRSIQSN IDAALNGTKNINDVITAVEPRLWNMSTVLPILQDTTIVAAGPSVQNVSLSGAVPVGIV GDAGQWVKTGQ" CDS complement(1299487..1300092) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1199C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1199c, -, len: 201 aa. Equivalent to Rv1167c, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 201 aa overlap). Probable transcriptional regulator, similar to several e.g. D1022772|D85417 hemR from Propionibacterium freudenreichii (243 aa), FASTA scores: opt: 268, E(): 5.4e-16, (35.9% identity in 198 aa overlap) and AL022268|SC4H2.32 Streptomyces coelicolor (111 aa), FASTA scores: opt: 274, E(): 5e-11, (55.1% identity in 89 aa overlap). Protein product from Mb1199c detected using SWATH mass spectrometry. Mb1199c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXJ2" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR011075" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3XXJ2" /protein_id="SIT99800.1" /translation="MTVSAPAKANPYRRRGEVLERALYDATLAELESAGYGGLTMEGI AARAQTGKAALYRRWAGKRELVLAAVQYALPPVPEPRADRSARENLLAVFTANCEILA GKTALPSMEIVSQLLHEPELRAIFINSVWAPRLRIVESILQAGVRSGEIDPATLTPMT ARIGPALIHQHVLFTGSPPDREQLTRIIDAMILTTGERRES" CDS complement(1300164..1300661) /codon_start=1 /transl_table=11 /gene="PPE17b" /locus_tag="BQ2027_MB1200C" /product="PPE FAMILY PROTEIN [SECOND PART]" /note="Mb1200c, PPE17b, len: 165 aa. Equivalent to 3' end of Rv1168c, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, similar to many e.g. E332789|Z98268|MTCI125.27C (385 aa), FASTA scores: opt: 504, E(): 0, (36.6% identity in 388 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE17 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c) splits PPE17 into 2 parts, PPE17a and PPE17b. Mb1200c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR022171" /db_xref="UniProtKB/TrEMBL:A0A1R3XXK0" /protein_id="SIT99801.1" /translation="MIGSVSETVGSFAAPATKNLPSKLWTLLTKGTYPLTAARISSIP VEYVLAFVEGSNMGQMMGNLAMRSLTPTLKGPLELLPNAVRPAVSATLGNADTIGGLS VPPSWVADKSITPLAKAVPTSAPGGPSGTSWAQLGLASLAGGAVGAVAARTRSGVILR SPAAG" CDS complement(1300663..1301205) /codon_start=1 /transl_table=11 /gene="PPE17a" /locus_tag="BQ2027_MB1201C" /product="PPE FAMILY PROTEIN [FIRST PART]" /note="Mb1201c, PPE17a, len: 180 aa. Similar to 5' end of Rv1168c, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (97.1% identity in 174 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, similar to many e.g. E332789|Z98268|MTCI125.27C (385 aa), FASTA scores: opt: 504, E(): 0, (36.6% identity in 388 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE17 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c) splits PPE17 into 2 parts, PPE17a and PPE17b. Mb1201c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZN0" /protein_id="SIT99802.1" /translation="MDFTIFPPEFNSLNIQGSARPFLVAANAWKNLSNELSYAASRFE SEINGLITSWRGPSSTIMAAAVAPFRAWIVTTASLAELVADHISVVAGAYEAAHAAHV PLPVIETNRLTRLALATTNIFGIHTPAIFALDALYAQYWSQDGEAMNLYATMAAAAAR LTPFSPPGADRQPGRAGQTL" CDS complement(1301223..1301525) /codon_start=1 /transl_table=11 /gene="lipx" /locus_tag="BQ2027_MB1202C" /product="pe family protein. possible lipase lipx." /note="Mb1202c, PE11, len: 100 aa. Equivalent to Rv1169c, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 100 aa overlap). Member of the Mycobacterium tuberculosis PE family of proteins (see second citation below), e.g. O05297|Z93777|MTCI364.07 (99 aa), FASTA scores: opt: 209, E(): 1.6e-15, (37.4% identity in 99 aa overlap). Also simlar to the N-terminus of P77909|U76006 ESTERASE/LIPASE (EC 3.1.1.3) from Mycobacterium tuberculosis (437 aa), FASTA scores: opt: 193, E(): 4.4e-14, (37.2% identity in 94 aa overlap). Contains a helix-turn-helix motif from aa 88-109 (+2.76 SD). Mb1202c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XYI5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99803.1" /translation="MSFVTTRPDSIGETAANLHEIGVTMSAHDDGVTPLITNVESPAH DLVSIVTSMLFSMHGELYKAIARQAHVIHESFVQTLQTSKTSYWLTELANRAGTST" CDS 1301705..1302616 /codon_start=1 /transl_table=11 /gene="mshB" /locus_tag="BQ2027_MB1203" /product="N-Acetyl-1-D-myo-Inosityl-2-Amino-2-Deoxy-alpha- D-Glucopyranoside Deacetylase mshB (GlcNAc-Ins deacetylase)" /note="Mb1203, mshB, len: 303 aa. Equivalent to Rv1170, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 303 aa overlap). mshB, N-Acetyl-1-D-myo-Inosityl-2-Amino-2-Deoxy-alpha-D- Glucopyra noside Deacetylase (GlcNAc-Ins deacetylase) (see citation below), similar to Q54358|X79146 lmbE gene from Streptomyces lincolnensis (270 aa), FASTA scores: opt: 308, E(): 1.2e-15, (32.0% identity in 278 aa overlap). Also similar to Rv1082|MCA Mycothiol conjugate amidase from Mycobacterium tuberculosis (288 aa). Protein product from Mb1203 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1203 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0H2" /db_xref="InterPro:IPR003737" /db_xref="InterPro:IPR017810" /db_xref="InterPro:IPR024078" /db_xref="UniProtKB/Swiss-Prot:Q7U0H2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99804.1" /translation="MSETPRLLFVHAHPDDESLSNGATIAHYTSRGAQVHVVTCTLGE EGEVIGDRWAQLTADHADQLGGYRIGELTAALRALGVSAPIYLGGAGRWRDSGMAGTD QRSQRRFVDADPRQTVGALVAIIRELRPHVVVTYDPNGGYGHPDHVHTHTVTTAAVAA AGVGSGTADHPGDPWTVPKFYWTVLGLSALISGARALVPDDLRPEWVLPRADEIAFGY SDDGIDAVVEADEQARAAKVAALAAHATQVVVGPTGRAAALSNNLALPILADEHYVLA GGSAGARDERGWETDLLAGLGFTASGT" CDS 1302708..1303148 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1204" /product="putative membrane protein" /note="Mb1204, -, len: 146 aa. Equivalent to Rv1171, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 146 aa overlap). Conserved hypothetical protein, possibly transmembrane protein. Start has been changed since first submission. Mb1204 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXX5" /db_xref="UniProtKB/TrEMBL:A0A1R3XXX5" /protein_id="SIT99805.1" /translation="MGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAG ALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFG GPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG" CDS complement(1303156..1304082) /codon_start=1 /transl_table=11 /gene="PE12" /locus_tag="BQ2027_MB1205C" /product="pe family protein pe12" /note="Mb1205c, PE12, len: 308 aa. Equivalent to Rv1172c, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 308 aa overlap). Member of the Mycobacterium tuberculosis PE family of proteins e.g. P71748|Z81368|MTCY253.25C (361 aa), FASTA scores: opt: 483, E(): 7.8e-22, (46.4% identity in 192 aa overlap). Protein product from Mb1205c detected using shotgun mass spectrometry. Mb1205c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XXK7" /protein_id="SIT99806.1" /translation="MSFVFAAPEALAAAAADMAGIGSTLNAANVVAAVPTTGVLAAAA DEVSTQVAALLSAHAQGYQQLSRQMMTAFHDQFVQALRASADAYATAEASAAQTMVNA VNAPARALLGHPLISADASTGGGSNALSRVHSMFLGTGGSSALGGSAAANAAASGALQ LQPTGGASGLSAVGALLPRAGAAAAAALPALAAESIGNAIKNLYNAVEPWVQYGFNLT AWAVGWLPYIGILAPQINFFYYLGEPIVQAVLFNAIDFVDGTVTFSQALTNIETATAA SINQFINTEINWIRGFLPPLPPISPPGFPSLP" CDS 1304332..1306902 /codon_start=1 /transl_table=11 /gene="fbiC" /locus_tag="BQ2027_MB1206" /product="PROBABLE F420 BIOSYNTHESIS PROTEIN FBIC" /note="Mb1206, fbiC, len: 856 aa. Equivalent to Rv1173, len: 856 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 856 aa overlap). Probable fbiC, F420 biosynthesis protein, equivalent to AAL91922|FBIC F420 biosynthesis protein fbiC from Mycobacterium bovis BCG (856 aa) (see citation below). The N-terminus (aa 80-420) is similar to Y446_METJA|Q57888 hypothetical protein mj0446 from methanococcus jannaschii (361 aa), FASTA scores: opt: 801, E(): 0, (41.2% identity in 337 aa overlap); and the C-terminus region (aa 530-856) is similar to e.g. YE31_METJA|Q58826 hypothetical protein mj1431 from methanococcus jannaschii (359 aa), FASTA scores: opt: 1089, E(): 0, (48.7% identity in 337 aa overlap). Protein product from Mb1206 detected using SWATH mass spectrometry. Mb1206 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0G9" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR019939" /db_xref="InterPro:IPR019940" /db_xref="InterPro:IPR020050" /db_xref="InterPro:IPR034405" /db_xref="UniProtKB/Swiss-Prot:Q7U0G9" /protein_id="SIT99807.1" /translation="MPQPVGRKSTALPSPVVPPQANASALRRVLRRARDGVTLNVDEA AIAMTARGDELADLCASAARVRDAGLVSAGRHGPSGRLAISYSRKVFIPVTRLCRDNC HYCTFVTVPGKLRAQGSSTYMEPDEILDVARRGAEFGCKEALFTLGDRPEARWRQARE WLGERGYDSTLSYVRAMAIRVLEQTGLLPHLNPGVMSWSEMSRLKPVAPSMGMMLETT SRRLFETKGLAHYGSPDKDPAVRLRVLTDAGRLSIPFTTGLLVGIGETLSERADTLHA IRKSHKEFGHIQEVIVQNFRAKEHTAMAAFPDAGIEDYLATVAVARLVLGPGMRIQAP PNLVSGDECRALVGAGVDDWGGVSPLTPDHVNPERPWPALDELAAVTAEAGYDMVQRL TAQPKYVQAGAAWIDPRVRGHVVALADPATGLARDVNPVGMPWQEPDDVASWGRVDLG AAIDTQGRNTAVRSDLASAFGDWESIREQVHELAVRAPERIDTDVLAALRSAERAPAG CTDGEYLALATADGPALEAVAALADSLRRDVVGDEVTFVVNRNINFTNICYTGCRFCA FAQRKGDADAYSLSVGEVADRAWEAHVAGATEVCMQGGIDPELPVTGYADLVRAVKAR VPSMHVHAFSPMEIANGVTKSGLSIREWLIGLREAGLDTIPGTAAEILDDEVRWVLTK GKLPTSLWIEIVTTAHEVGLRSSSTMMYGHVDSPRHWVAHLNVLRDIQDRTGGFTEFV PLPFVHQNSPLYLAGAARPGPSHRDNRAVHALARIMLHGRISHIQTSWVKLGVRRTQV MLEGGANDLGGTLMEETISRMAGSEHGSAKTVAELVAIAEGIGRPARQRTTTYALLAA " CDS complement(1306946..1307278) /codon_start=1 /transl_table=11 /gene="tb8.4" /locus_tag="BQ2027_MB1207C" /product="low molecular weight t-cell antigen tb8.4" /note="Mb1207c, -, len: 110 aa. Equivalent to Rv1174c, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 110 aa overlap). Hypothetical unknown protein. Mb1207c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXK5" /db_xref="InterPro:IPR016572" /db_xref="InterPro:IPR032407" /db_xref="UniProtKB/TrEMBL:A0A1R3XXK5" /protein_id="SIT99808.1" /translation="MRLSLTALSAGVGAVAMSLTVGAGVASADPVDAVINTTCNYGQV VAALNATDPGAAAQFNASPVAQSYLRNFLAAPPPQRAAMAAQLQAVPGAAQYIGLVES VAGSCNNY" CDS complement(1307479..1309503) /codon_start=1 /transl_table=11 /gene="fadH" /locus_tag="BQ2027_MB1208C" /product="PROBABLE NADPH DEPENDENT 2,4-DIENOYL-COA REDUCTASE FADH (2,4-dienoyl coenzyme A reductase) (4-enoyl-CoA reductase)" /note="Mb1208c, fadH, len: 674 aa. Equivalent to Rv1175c, len: 674 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 674 aa overlap). Probable fadH, NADPH-dependent 2,4-dienoyl-CoA reductase (EC 1.3.1.34), highly similar to others e.g. NP_251782.1|NC_002516 2,4-dienoyl-CoA reductase FadH1 from Pseudomonas aeruginosa (679 aa); CAC01564.1|AL391039 2,4-dienoyl-CoA reductase [NADPH] from Streptomyces coelicolor (671 aa); P42593|FADH_ECOLI 2,4-dienoyl-CoA reductase from Escherichia coli (671 aa), FASTA scores: opt: 2344, E(): 0, (53.1% identity in 671 aa overlap); etc. Also similar to Rv3359|MTV004.16 PUTATIVE OXIDOREDUCTASE from Mycobacterium tuberculosis (396 aa). Protein product from Mb1208c detected using SWATH mass spectrometry. Mb1208c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXL3" /db_xref="InterPro:IPR001155" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XXL3" /protein_id="SIT99809.1" /translation="MTNPYPNLLSPLDLGFTTLRNRVVMGSMHTGLEDRARHIDRLAD YFAERARGGVGLIITGGYAPNRTGWLLPFASELVTSAQARRHRRINRAVHDSGAKILL QILHAGRYAYHPLAVSASPIKAPITPFRPRALSARGVEATIADFARCAQLARDAGYDG VEIMGSEGYLLNQFLAPRTNKRTDSWGGTPANRRRFPVEIIRRSRAAVGSDFIICYRL SMADYVAEGQSWDEIVALATEVEGAGATIINSGFGWHEARVPTIVTSVPGGAFVDISS AVAEHVTIPVVASNRINMPQAAERILAETQVRLISMARPMLSDPDWVLKAQSNRVDEI NTCISCNQACLDHAFARKTVSCLLNPRAGRETQLVLSPTRRARSVAVVGAGPAGLATA ANAAQRGHRVTLFEANDFIGGQFDMARRIPGKEEFSETIRYFSTILAKHGVEVRLGTR VAAQELTGYDEVVLATGVAPRIPAIPGIDHPMVLTYAEAITGVRPVGRTVAVVGAGGI GFDVTELLVTDSSPTLNLKEWKAEWGVADPREARGALTTPLPAPPAREVYLLQRTKGP QGKRLGKTTGWVHRASLKAKGVHQLSGVNYEQINDDGLHISFGPKRRRPQLLAVDNVV VCAGQEPVRDLESELRRHGINPHISGGAAVAAELDAKRAIKQGTELAARL" CDS complement(1309500..1310069) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1209C" /product="Transcriptional regulator, PadR family" /note="Mb1209c, -, len: 189 aa. Equivalent to Rv1176c, len: 189 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 189 aa overlap). Conserved hypothetical protein, some similarity to P94443|D78508 hypothetical protein from Bacillus subtilis (182 aa), FASTA scores: opt: 219, E(): 1.7e-15, (25.1% identity in 183 aa overlap). Similar to Mycobacterium tuberculosis hypothetical protein Rv0047c. Mb1209c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR005149" /db_xref="InterPro:IPR018309" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XXK1" /protein_id="SIT99810.1" /translation="MALPHAILVSLCEQASSGYELARRFDRSIGYFWTATHQQIYRTL RVMENNNWVRATTVLQHGRPDKKVYAISDSGRAELARWIAEPLSPTRPGRGSALTDSS TRDIAVKLRGAGYGDVAALYTQVTALRAERVKSLDTYRGIEKRTFADPSALDGAALHQ YLVLRGGIRAEESAIDWLDEVAEALQEKR" CDS 1310282..1310608 /codon_start=1 /transl_table=11 /gene="fdxC" /locus_tag="BQ2027_MB1210" /product="PROBABLE FERREDOXIN FDXC" /note="Mb1210, fdxC, len: 108 aa. Equivalent to Rv1177, len: 108 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 108 aa overlap). Probable fdxC, ferredoxin (EC 1.-.-.-), equivalent to NP_302047.1|NC_002677 ferredoxin from Mycobacterium leprae (108 aa); P00215|FER_MYCSM FERREDOXIN from Mycobacterium smegmatis (106 aa), FASTA scores: opt: 705, E(): 0, (87.7% identity in 106 aa overlap). Also highly similar to many e.g. JH0239 ferredoxin precursor from Saccharopolyspora erythraea (105 aa); P24496|FER_SACER FERREDOXIN from Saccharopolyspora erythraea (106 aa); etc. Contains PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature. BELONGS TO THE BACTERIAL TYPE FERREDOXIN FAMILY. COFACTOR: BINDS 1 4FE-4S CLUSTER AND A 3FE-4S CLUSTER. Protein product from Mb1210 detected using shotgun mass spectrometry. Mb1210 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXL0" /db_xref="InterPro:IPR000813" /db_xref="InterPro:IPR017896" /db_xref="InterPro:IPR017900" /db_xref="UniProtKB/TrEMBL:A0A1R3XXL0" /protein_id="SIT99811.1" /translation="MTYTIAEPCVDIKDKACIEECPVDCIYEGARMLYIHPDECVDCG ACEPVCPVEAIFYEDDVPEQWSHYTQINADFFAELGSPGGAAKVGMTENDPQAVKDLA PQSEDA" CDS 1310641..1311729 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1211" /product="PROBABLE AMINOTRANSFERASE" /note="Mb1211, -, len: 362 aa. Equivalent to Rv1178, len: 362 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 362 aa overlap). Probable aminotransferase (EC 2.6.1.-), weak similarity to many aspartate aminotransferases e.g. Q55679|D64000 SLL0006 aspartate aminotransferase from Synechocystis sp. (394 aa), FASTA scores: opt: 218, E(): 1.3e-25, (32.5% identity in 379 aa overlap). Contains PS00105 Aminotransferases class-I pyridoxal-phosphate attachment site. Also similar to Mycobacterium tuberculosis aminotransferases Rv2294, Rv0075, etc. Protein product from Mb1211 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1211 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZP9" /db_xref="InterPro:IPR004838" /db_xref="InterPro:IPR004839" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR019880" /db_xref="UniProtKB/TrEMBL:A0A1R3XZP9" /protein_id="SIT99812.1" /translation="MSASLPVFPWDTLADAKALAGAHPDGIVDLSVGTPVDPVAPLIQ EALAAASAAPGYPATAGTARLRESVVAALARRYGITRLTEAAVLPVIGTKELIAWLPT LLGLGGADLVVVPELAYPTYDVGARLAGTRVLRADALTQLGPQSPALLYLNSPSNPTG RVLGVDHLRKVVEWARGRGVLVVSDECYLGLGWDAEPVSVLHPSVCDGDHTGLLAVHS LSKSSSLAGYRAGFVVGDLEIVAELLAVRKHAGMMVPAPVQAAMVAALDDDAHERQQR ERYAQRRAALLPALGSAGFAVDYSDAGLYLWATRGEPCRDSVAWLAQRGILVAPGDFY GPGGAQHVRVALTATDERVAAAVGRLTC" CDS complement(1311757..1314576) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1212C" /product="DNA or RNA helicases of superfamily II" /note="Mb1212c, -, len: 939 aa. Equivalent to Rv1179c, len: 939 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 939 aa overlap). Hypothetical unknown protein. Protein product from Mb1212c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1212c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYJ4" /db_xref="InterPro:IPR006935" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ4" /protein_id="SIT99813.1" /translation="MDPHRDLESRAFAGNWRVYQQQALDAFDADVAAGDNRAYLVLPP GAGKTMIGLEAARRLGRRSLVLVPNTAVQAQWAAAWDNSFPSSDRSASKCGTERGLAS AMNVLTYQSLAVIDAETDSTVRREVLRNRDQQALLDLLHPNGRAVIERAATLGPWTLV LDECHHLLATWGALVSALASVLGAQTALIGLTATPATELTAWQHTLHDELFGTADFVI PTPALVREGDLAPYQELVYLTQPTPEEQAWIGTHRARFADLMLALIDQKVGSMSLAAW LHTRIVDRATREGNQIAWSTFERAEPDLACSGLRFAYDGLIPLPDGVRLREQHRIAPD AQDWVNVLTDFSVGHLQQSADPRDAHALTAIKRVLPGLGYRLTSRGVRVATSPVDRLC ALSESKIAATAHILDTEDAVLGARLRALVLCDFESMTGALPTSLKGAPVSEQSGSAQL VAAMLAASDHRRRTPLHALLVTGQTFACPAAIEDDLIAFCAERGALVTAEPLDAHPSL RVMRGTGGFTPRTWVALATEYFLAGRARVLVGTRSLLGEGWDCAAVNVNIDLTSATTQ AAITQMRGRAIRNDPSDGHKVADNWSVCCIATEHPRGDADYLRLVRKHDGYYAATPQG LIESGVTHCDPSLSPYGPPVTDTHAITARALQRVAERAQARSWWRIGEPYEGVDVATI RVRSRQPLGVAAPRIPASALTPPVPGQFSPVRLARGAVAAVSVVGASTATAVASANLG MLAGAGTAGAIVAAGVGLVATAAAAESRRLDHAPNALEQLAAVVADALYAAGGAQRGS AALRLASDPEGWIRCQLDGVPTEQSLRFTAALDELLAPLAEPRYLIGRKILTPPARPV ARRLFAVRAVVGLSLPGTVAWHAVPRWFARNKDRRQHLAQAWRKHIGPPRQLPADSPQ GQAILDLFRGDNPLSVTTQLRTTWR" CDS 1315003..1321260 /codon_start=1 /transl_table=11 /gene="pks3" /locus_tag="BQ2027_MB1213" /product="probable polyketide beta-ketoacyl synthase pks4" /note="Mb1213, pks3, len: 2085 aa. Equivalent to Rv1180 and Rv1181, len: 488 aa and 1582 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 488 aa overlap and 99.9% identity in 1582 aa overlap). Probable polyketide beta-ketoacyl synthase (EC 2.3.1.-), similar to the N-terminus of many polyketide synthases e.g. MCAS_MYCBO|Q02251 mycocerosic acid synthase from mycobacterium bovis (2110 aa), FASTA scores: opt: 2115, E(): 0, (66.5% identity in 472 aa overlap). Also similar to, and same length as P96284|Z83858|MTCY24G1.02 M. tuberculosis (496 aa), FASTA scores: opt: 1424, E(): 0, (50.9% identity in 444 aa overlap). Contains possible signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site, also PS00606 Beta-ketoacyl synthases active site. BELONGS TO THE BETA-KETOACYL-ACP SYNTHASES FAMILY. Probable polyketide synthase, similar to many e.g. MCAS_MYCBO|Q02251 mycocerosic acid synthase from mycobacterium bovis (2110 aa), FASTA scores: opt: 3518, E(): 0, (59.7% identity in 1614 aa overlap). Note that this similarity extends upstream of the first initiation codon into the upstream MTV005.16; however the stop codon at the end of MTV005.16 is present in at least 4 independent clones (BAC, cosmid and pUC) from the genome. The two CDS's may represent separate modules of the polyketide synthase. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1180 and Rv1181 exist as 2 genes. In Mycobacterium bovis, a single base transversion (a-c) leads to a single product (similar to other organisms). Protein product from Mb1213 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1213 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXM6" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/TrEMBL:A0A1R3XXM6" /protein_id="SIT99814.1" /translation="MRTATATSVAVIGMACRLPGGIDSPQRLWEALLRGDDLVGEIPA DRWDANVYYDPEPGVPGRSVSRWGAFLDDVGGFDCDFFGLTEREATAIDPQHRLLLEV SWEAIEHAGVDPATLAESQTGVFVGLTHGDYELLSADCGAAEGPYGFTGTSNSFASGR VAYTLGLHGPAVTVDTACSSGLTAVHQACRSLDDGESDLALAGGVVVTLEPRKSVSGS LQGMLSPTGRCHAFDEAADGFVSGEGCVVLLLKRLPDAVRDGDRVLAIVRGTAANQDG RTVNIAAPSAQAQIAVYQQALAAAGVEASTVGMVEAHGTGTPVGDPVEYASLAAVYGT EGPCALTSVKTNFGHLQSASGPLGLMKTILALRHGVVPQNLHFCRLPDQLAEIDTELF VPQANTSWPDNTGQPRRAAVSSYGMSGTNVHAILEQAPVSEPAASGPELTPEAGGLAL FPVSATSAEQLHVTAARLADWVDQNGNAGSRVSMRDLGYTLSCRRAHRPVRTVVTASS FDELSAALRDVAGDQIPYQPAVGHDDRGPVWVFSGQGSQWPGMGTELLVAEPVFAATV AAMEPVIARESGFSVTEAMSAPQTVSGIDRVQPTIFAVQVALAAALKSYGVRPGAIIG HSLGEAAAAVVAGALSLHDGLRVICRRSRLMSRIAGSGAMASVELPGQQVLSELAIRG ISDVVLSVVASPTSTVVGGATQSIRDLVAAWEQQDVLAREVAVDVASHTPQVDPILDE LLEVLAEVDPTAPEIPYYSATLWDPRERPSFTGEYWVENLRYTVRFAAAVQAALKDGY RVFGELAPHPLLTYAVEQNAASLDMPIATLAAMRRGEQLPFGLRGFVADVHNAGAKVD FSVQYPDGRLVDAPLPSWTHRTLMLSREDSHRSHTGAVQAVHPLLGAHVHLLEEPERH VWQAGVGTGAHPWLGDHRIHNVAAFPGAAYCEMALAAARTTLGELSEVRDIKFEQTLL LDEQTVVSSAATIAAPGILQFAVESHQEGEPARRASAMLHALEEMPQPPGYDTNALTA AHESSMSGEELRKMFNSLGIQYGPAFSGLVAVHTARGAVTTVLAEVALPGAIRSQQSA YASHPALLDACFQSVLVHPEVQKATVGGLMLPVGVRRLRNYHSTRSAHYCLARVTSSS RAGECEADLDVFDQAGTVLLTVEGLRLAAGISEHERANRVFDERLLTIEWERGELPEV PQIDAGSWLLLSASEADPLTAQLADALNAVGAQSTSVASASDVAQLRSLLGGRLTGVV VVTGPPTGGLTQCGRDYVSQLVGIARELAELPGEPPRLFVVTRSAASVLPSDLANLEQ AGLRGLMRVIDSEHPHLGATAIDVDNDETVAALVASQLQSGSQEDETAWRNGIWYTAR LRPGPLRPAERRTAVVEYRRDGMRLQIRTPGDLESLEFVTFDRVAPGPGEIEVAVTAS SVNFADVLVAFGRYPTFEGYRQQLGIDFAGVVTAVGPDVTEHRIGDHVGGMSANGCWS TFVRCDARLAVTLPPELPVAAAAAVPTASATAWYALHDLARICSDDKVLIHSGTGGVG QAAIAIARAAGCEIFATAGSAQRRQLLHDMGVEHVYDSRSTEFAEQIRGDTDGYGVDV VLNSLPGAAQRAGIELLAFGGRFVEIGKRDIYGDTRLGLFPFRRNLSLYAVDLALLTH SHPHTVRRLLKTVYQHTVEGTLPVPQTTHYPIHDAAVAIRLVGGAGHTGKVVLDVPRT GEGVAVVPPEQVRTSRPDGAYLVTGGLGGLGLFLAGELAAAGCGRIVLNSRSTPSPHA TRVIERLRAAGADIQVECGDIADAATAHRVVAVATASGLPVRGVLHAAAVVEDATLAN VTDELIDRCWAPKVHGAWNIHRATAAQPLEWFCLFSSAAALVGSPGQGAYAAANSWLD AFAHWRRAQGLPATSIAWGAWAEIGRATALAEGTGAAIAPAEGARAFQTLLRYGRAYS GYAPIMGTPWLTAFAQRSRFAEAFHATGQNQPATGKFLAELGSLPREEWPRTVRRLVS DQISLLLRRTIDPDRPLSDYGLDSLGNLELRTRIETETGIRVSPTKITTVRGLAEHVC DELAAAQSAPV" CDS 1321313..1322731 /codon_start=1 /transl_table=11 /gene="papA3" /locus_tag="BQ2027_MB1214" /product="PROBABLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA3" /note="Mb1214, papA3, len: 472 aa. Equivalent to Rv1182, len: 472 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 472 aa overlap). Probable papA3, conserved polyketide synthase (PKS) associated protein, similar to other Mycobacterial hypothetical proteins e.g. Q49618|U00010 B1170_C1_180 from Mycobacterium leprae (471 aa), FASTA scores: opt: 2526, E(): 0, (75.6% identity in 471 aa overlap). Similar to other Mycobacterium tuberculosis hypothetical papA proteins; Rv3824c, Rv3820c, Rv1528c. Protein product from Mb1214 detected using SWATH mass spectrometry. Mb1214 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/TrEMBL:A0A1R3XXY4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99815.1" /translation="MLRVGPLTIGTLDDWAPSTGSTVSWRPSAVAHTKASQAPISDVP VSYMQAQHIRGYCEQKAKGLDYSRLMVVSCQQPGQCDIRAANYVINAHLRRHDTYRSW FQYNGNGQIIRRTIQDPADIEFVPVHHGELTLPQIREIVQNTPDPLQWGCFRFGIVQG CDHFTFFASVDHVHVDAMIVGVTLMEFHLMYAALVGGHAPLELPPAGSYDDFCRRQHT FSSTLTVESPQVRAWTKFAEGTNGSFPDFPLPLGDPSKPSDADIVTVMMLDEEQTAQF ESVCTAAGARFIGGVLACCGLAEHELTGTTTYYGLTPRDTRRTPADAMTQGWFTGLIP ITVPIAGSAFGDAARAAQTSFDSGVKLAEVPYDRVVELSSTLTMPRPNFPVVNFLDAG AAPLSVLLTAELTGTNIGVYSDGRYSYQLSIYVIRVEQGTAVAVMFPDNPIARESVAR YLATLKSVFQRVAESGQQQNVA" CDS 1322798..1325806 /codon_start=1 /transl_table=11 /gene="mmpL10" /locus_tag="BQ2027_MB1215" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL10" /note="Mb1215, mmpL10, len: 1002 aa. Equivalent to Rv1183, len: 1002 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 1002 aa overlap). Probable mmpL10, conserved transmembrane transport protein (see first citation below), member of RND superfamily, similar to many Mycobacterial hypothetical membrane proteins e.g. Q49619|U00010 from Mycobacterium leprae (1008 aa), FASTA scores: opt: 4545, E(): 0, (70.6% identity in 978 aa overlap); etc. BELONGS TO THE MMPL FAMILY. Mb1215 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65373" /db_xref="InterPro:IPR004707" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/Swiss-Prot:P65373" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99816.1" /translation="MVGCWVALALVLPMAVPSLAEMAQRHPVAVLPADAPSSVAVRQM AEAFHESGSENILVVLLTDEKGLGAADENVYHTLVDRLRNDAKDVVMLQDFLTTPPLR EVLGSKDGKAWILPIGLAGDLGTPKSYHAYTDVERIVKRTVAGTTLTANVTGPAATVA DLTDAGARDRASIELAIAVMLLVILMVIYRNPVTMLLPLVTIGASLMTAQALVAGVSL VGGLAVSNQAIVLLSAMIAGAGTDYAVFLISRYHEYVRLGEHPERAVQRAMMSVGKVI AASAATVGITFLGMRFAKLGVFSTVGPALAIGIAVSFLAAVTLLPAILVLASPRGWVA PRGERMATFWRRAGTRIVRRPKAYLGASLIGLVALASCASLAHFNYDDRKQLPPSDPS SVGYAAMEHHFSVNQTIPEYLIIHSAHDLRTPRGLADLEQLAQRVSQIPGVAMVRGVT RPNGETLEQARATYQAGQVGNRLGGASRMIDERTGDLNRLASGANLLADNLGDVRGQV SRAVAGVRSLVDALAYIQNQFGGNKTFNEIDNAARLVSNIHALGDALQVNFDGIANSF DWLDSVVAALDTSPVCDSNPMCGNARVQFHKLQTARDNGTLDKVVGLARQLQSTRSPQ TVSAVVNDLGRSLNSVVRSLKSLGLDNPDAARARLISMQNGANDLASAGRQVADGVQM LVDQTKNMGIGLNQASAFLMAMGNDASQPSMAGFNVPPQVLKSEEFKKVAQAFISPDG HTVRYFIQTDLNPFSTAAMDQVNTIIDTAKGAQPNTSLADASISMSGYPVMLRDIRDY YERDMRLIVAVTVVVVILILMALLRAIVAPLYLVGSVVISYMSAIGLGVVVFQVFLGQ ELHWSVPGLAFVVLVAVGADYNMLLASRLRDESALGVRSSVIRTVRCTGGVITAAGLI FAASMSGLLFSSIGTVVQGGFIIGVGILIDTFVVRTITVPAMATLLGRASWWPGHPWQ RCAPEEGQMSARMSARTKTVFQAVADGSKR" CDS complement(1325810..1326889) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1216C" /product="POSSIBLE EXPORTED PROTEIN" /note="Mb1216c, -, len: 359 aa. Equivalent to Rv1184c, len: 359 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 359 aa overlap). Possible exported protein with potential N-terminal signal sequence. Similar to several Mycobacterial hypothetical proteins e.g. Q49633|U00010) Protein B1170_F3_112 from M. leprae (391 aa), FASTA scores: opt: 1422, E(): 0, (62.7% identity in 338 aa overlap). Also similar to Rv3822, Rv3539, Rv1430, Rv0151c, etc. Protein product from Mb1216c detected using SWATH mass spectrometry. Mb1216c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR013228" /db_xref="UniProtKB/TrEMBL:A0A1R3XXL5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99817.1" /translation="MKRVIAGAFAVWLVGWAGGFGTAIAASEPAYPWAPGPPPSPSPV GDASTAKVVYALGGARMPGIPWYEYTNQAGSQYFPNAKHDLIDYPAGAAFSWWPTMLL PPGSHQDNMTVGVAVKDGTNSLDNAIHHGTDPAAAVGLSQGSLVLDQEQARLANDPTA PAPDKLQFTTFGDPTGRHAFGASFLARIFPPGSHIPIPFIEYTMPQQVDSQYDTNHVV TAYDGFSDFPDRPDNLLAVANAAIGAAIAHTPIGFTGPGDVPPQNIRTTVNSRGATTT TYLVPVNHLPLTLPLRYLGMSDAEVDQIDSVLQPQIDAAYARNDNWFTRPVSVDPVRG LDPLTAPGSIVEGARGLLGSPAFGG" CDS complement(1327054..1328790) /codon_start=1 /transl_table=11 /gene="fadD21" /locus_tag="BQ2027_MB1217C" /product="probable fatty-acid-amp ligase fadd21 (fatty-acid-amp synthetase) (fatty-acid-amp synthase)" /note="Mb1217c, fadD21, len: 578 aa. Equivalent to Rv1185c, len: 578 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 578 aa overlap). Probable fadD21, fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to several from Mycobacteria e.g. NP_301895.1|NC_002677 possible acyl-CoA synthase from Mycobacterium leprae (579 aa); P71495|U75685 ACYL-COA SYNTHASE from Mycobacterium bovis (582 aa), FASTA scores: opt: 2388, E(): 0, (61.8% identity in 579 aa overlap); etc. SEEMS TO BELONG TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb1217c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1217c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63524" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:P63524" /protein_id="SIT99818.1" /translation="MSDSSVLSLLRERAGLQPDDAAFTYIDYEQDWAGITETLTWSEV FRRTRIVAHEVRRHCTTGDRAVILAPQGLAYIAAFLGSMQAGAIAVPLSVPQIGSHDE RVSAVLADASPSVILTTSAVAEAVAEHIHRPNTNNVGPIIEIDSLDLTGNSPSFRVKD LPSAAYLQYTSGSTRAPAGVMISHRNLQANFQQLMSNYFGDRNGVAPPDTTIVSWLPF YHDMGLVLGIIAPILGGYRSELTSPLAFLQRPARWLHSLANGSPSWSAAPNFAFELAV RKTTDADIEGLDLGNVLGITSGAERVHPNTLSRFCNRFAPYNFREDMIRPSYGLAEAT LYVASRNSGDKPEVVYFEPDKLSTGSANRCEPKTGTPLLSYGMPTSPTVRIVDPDTCI ECPAGTIGEIWVKGDNVAEGYWNKPDETRHTFGAMLVHPSAGTPDGSWLRTGDLGFLS EDEMFIVGRMKDMLIVYGRNHYPEDIESTVQEITGGRVAAISVPVDHTEKLVTVIELK LLGDSAGEAMDELDVIKNNVTAAISRSHGLNVADLVLVPPGSIPTTTSGKIRRAACVE QYRLQQFTRLDG" CDS complement(1328967..1330583) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1218C" /product="Regulator of polyketide synthase expression" /note="Mb1218c, -, len: 538 aa. Equivalent to Rv1186c, len: 538 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 538 aa overlap). Conserved hypothetical protein, similar to AL117385|SC5G9.24 hypothetical protein from Streptomyces coelicolor (555 aa), FASTA scores: opt: 485, E(): 2.3e-23, (32.6% identity in 568 aa overlap). Contains helix turn helix motif from aa 488-509 (+2.81 SD). Mb1218c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR025736" /db_xref="InterPro:IPR042070" /db_xref="UniProtKB/TrEMBL:A0A1R3XXM3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99819.1" /translation="MRIAGVGLGQLLLALDATVVSLVDAPRGLDLPVASTALIDSDDV RLGLAAAAGSADVFFLIGVTDDEAVRWVDDQARQRAPVAIFVKHPSDSVVAGAVRAGS AVVAVEPRARWERLYHLVNHVLEHHGDRADPTDDSGTDLFGLAQSLADRIHGMISIED AQSHVLAYSASNDEADELRRLSILGRAGPPEHLQWIGQWGIFDALRAGREVVRVAERP ELGLRPRLAIGIHQPGVGALRPPVFAGTIWVQQGSQPLADDAEEMLRGAAVLAARIMS RLATQPNTHALRVQQLLGLAELNATTAPVDVSTIARELGVAAEGNATLIGFDTAENRD TAVRHVRLVDVMALSASAFRHDAQVAANGSRIYVLLPQTTTGRAVTSWVRGTISALRA ELGVALRAAIAGPVAGLAEVNPARVEVDRVLESAERHPILGQVTSLAEARTTVLLDEI VTLVGTDQRLVDPRIRDLGAQDPVLAQTLRAYLDAFGDIGAAARSLQVHPNTVRYRIR RIEQLLSTSLGDPDVRLLFSLGLRAMERTA" CDS 1330668..1332299 /codon_start=1 /transl_table=11 /gene="rocA" /locus_tag="BQ2027_MB1219" /product="PROBABLE PYRROLINE-5-CARBOXYLATE DEHYDROGENASE ROCA" /note="Mb1219, rocA, len: 543 aa. Equivalent to Rv1187, len: 543 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 543 aa overlap). Probable rocA, pyrroline-5-carboxylate dehydrogenase (EC 1.5.1.12), similar to many e.g. PUT2_HUMAN|P30038 human delta-1-pyrroline-5-carboxylate dehydrogenase (563 aa), FASTA scores: opt: 1596, E():0, (46.0% identity in 531 aa overlap). Also similar to other Mycobacterium tuberculosis hypothetical dehydrogenases e.g. Rv0768, Rv2858c, etc. Contains PS00687 Aldehyde dehydrogenases glutamic acid active site and PS00070 Aldehyde dehydrogenases cysteine active site. Protein product from Mb1219 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1219 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXL2" /db_xref="InterPro:IPR005931" /db_xref="InterPro:IPR015590" /db_xref="InterPro:IPR016160" /db_xref="InterPro:IPR016161" /db_xref="InterPro:IPR016162" /db_xref="InterPro:IPR016163" /db_xref="UniProtKB/TrEMBL:A0A1R3XXL2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99820.1" /translation="MDAITQVPVPANEPVHDYAPKSPERTRLRTELASLADHPIDLPH VIGGRHRMGDGERIDVVQPHRHAARLGTLTNATHADAAAAVEAAMSAKSDWAALPFDE RAAVFLRAADLLAGPWREKIAAATMLGQSKSVYQAEIDAVCELIDFWRFNVAFARQIL EQQPISGPGEWNRIDYRPLDGFVYAITPFNFTSIAGNLPTAPALMGNTVIWKPSITQT LAAYLTMQLLEAAGLPPGVINLVTGDGFAVSDVALADPRLAGIHFTGSTATFGHLWQW VGTNIGRYHSYPRLVGETGGKDFVVAHASARPDVLRTALIRGAFDYQGQKCSAVSRAF IAHSVWQRMGDELLAKAAELRYGDITDLSNYGGALIDQRAFVKNVDAIERAKGAAAVT VAVGGEYDDSEGYFVRPTVLLSDDPTDESFVIEYFGPLLSVHVYPDERYEQILDVIDT GSRYALTGAVIADDRQAVLTALDRLRFAAGNFYVNDKPTGAVVGRQPFGGARGSGTND KAGSPLNLLRWTSARSIKETFVAATDHIYPHMAVD" CDS 1332299..1333288 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1220" /product="PROBABLE PROLINE DEHYDROGENASE" /note="Mb1220, -, len: 329 aa. Equivalent to Rv1188, len: 329 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 329 aa overlap). Possible putA, proline dehydrogenase (EC 1.5.99.8), similar to part of Q52711|X78346 proline dehydrogenase from Rhodobacter capsulatus (1127 aa), FASTA scores: opt: 194, E(): 1.5e-07, (31.2% identity in 349 aa overlap). Also similar to two Bacillus subtilis proline dehydrohenases E1184363|Z99120 (302 aa), FASTA scores: opt: 509, E(): 0, (37.1% identity in 313 aa overlap); and E1182272|Z99105 (303 aa), FASTA scores: opt: 513, E(): 0, (32.5% identity in 311 aa overlap). Highly similar to AL035569|SC8D9.31 Streptomyces coelicolor (308 aa), FASTA scores: opt: 984, E(): 0, (50.0% identity in 312 aa overlap). Protein product from Mb1220 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1220 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXM0" /db_xref="InterPro:IPR002872" /db_xref="InterPro:IPR008219" /db_xref="InterPro:IPR015659" /db_xref="InterPro:IPR029041" /db_xref="UniProtKB/TrEMBL:A0A1R3XXM0" /protein_id="SIT99821.1" /translation="MAGWFAHTLRPAMLAAGRSDRLGRIVERSPLTRGVVRRFVPGDT LDDVVDIVTALRDSGRYLSIDYLGENVTDADDAAAAVRAYLGLLDVLGRRGDIACDGV RPLEVSLKLSALGQALDRDGQKIALDNARAICERAERVGAWVTVDAEDHTTTDSTLSI SGDLRVDFPWLGTVVQAYLRRTLADCAELAAVGARVRLCKGAYDEPASVAYRDAAQVT DSYLRCLRVLTAGRGYPMVATHDPVIIAAVPGITRESGRSQGDFEYQMLYGVRDDEQR RLTGAGNHVRVYVPFGTRWYGYFLRRLAERPANLAFFLRALTDRRRARGCAER" CDS 1333370..1334242 /codon_start=1 /transl_table=11 /gene="sigI" /locus_tag="BQ2027_MB1221" /product="POSSIBLE ALTERNATIVE RNA POLYMERASE SIGMA FACTOR SIGI" /note="Mb1221, sigI, len: 290 aa. Equivalent to Rv1189, len: 290 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 290 aa overlap). Possible sigI, alternative RNA polymerase sigma factor (see citation below), similar to several e.g. O05767|U87307 extracytoplasmic function alternative sigma factor (sigE) from M. smegmatis (204 aa), FASTA scores: opt: 239, E(): 1.3e-09, (32.9% identity in 167 aa overlap). Mb1221 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZQ9" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR013249" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR032710" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3XZQ9" /protein_id="SIT99822.1" /translation="MSQHDPVSAAWRAHRAYLVDLAFRMVGDIGVAEDMVQEAFSRLL RAPVGDIDDERGWLIVVTSRLCLDHIKSASTRRERPQDIAAWHDGDASVSSVDPADRV TLDDEVRLALLIMLERLGPAERVVFVLHEIFGLPYQQIATTIGSQASTCRQLAHRARR KINESRIAASVEPAQHRVVTRAFIEACSNGDLDTLLEVLDPGVAGEIDARKGVVVVGA DRVGPTILRHWSHPATVLVAQPVCGQPAVLAFVNRALAGVLALSIEAGKITKIHVLVQ PSTLDPLRAELGGG" CDS 1334258..1335136 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1222" /product="Hydrolase, alpha/beta fold family" /note="Mb1222, -, len: 292 aa. Equivalent to Rv1190, len: 292 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 292 aa overlap). Conserved hypothetical protein, similar to Rv1833c|Y0DA_MYCTU|Q50600 hypothetical 32.2 kd protein cy1a11.10 (286 aa), fasta scores: opt: 331, E(): 1.4e-15, (29.0% identity in 272 aa overlap), also YU14_MYCTU|Q50670 putative haloalkane dehalogenase (300 aa), FASTA scores: opt: 239, E(): 2.2e-09, (29.9% identity in 298 aa overlap). Protein product from Mb1222 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XYK6" /protein_id="SIT99823.1" /translation="MTMKSLAALDRPSWLSSSAWPWQPYLLSHHQGGIAVTDIGDGPA VLFVHVGSWSFVWRDVLLRLANDFRCVAIDAPGCGLSDRLSTPPTLAQAADAITSVID ALQLRDLTLVAHDLGGPAGFLAAARRGDRVAALAAVNCFAWRPTGPLFRGMLAAMGSA PVRELDAAINALARATSTRFGAGRHWSRADRAAFRAGIDAPARRAWHAYFRDARRAHA LYTDVDAALRGGLADRPLLTIFGQFNDPLRFQPRWKELFPTARQLQVRRGNHFPMCDD PDLVAGALTSFVQRST" CDS 1335209..1336123 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1223" /product="Putative hydrolase" /note="Mb1223, -, len: 304 aa. Equivalent to Rv1191, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 304 aa overlap). Conserved hypothetical protein, similar to Q54528 RDMC from Streptomyces purpurascens (298 aa), FASTA scores: opt: 196, E(): 1.5e-05, (27.5% identity in 269 aa overlap); Rv0134|MTCI5.08 (300 aa), FASTA scores: opt: 197, E(): 6.6e-06, (26.4% identity in 299 aa overlap), some similarity to PIP_NEIGO|P42786 proline iminopeptidase (EC 3.4.11.5) (310 aa), FASTA scores: opt: 196, E(): 1.3e-05, (32.2% identity in 152 aa overlap). Contains PS00044 Bacterial regulatory proteins, lysR family signature. Protein product from Mb1223 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1223 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XXN6" /protein_id="SIT99824.1" /translation="MAVAIARPKLEGNIAVGEDRRIGFAEFGAPQGRAVFWLHGTPGA RRQIPTEARVYAEHHNIRLIGVDRPGIGASTPHQYETILAFADDLRTIADTLGIDKMA VVGLSGGGPYTLACAAGLPDRVVAAGVLGGVAPTRGPDAISGGLMRLGSAVAPLLQVG GTPLRLGASLLIRAARPVASPALDLYGLLSPRADRHLLARPEFKAMFLDDLLNGSRKQ LAAPFADVIAFARDWGFRLDEVKVPVRWWHGDHDHIVPFSHGEHVVSRLPDAKLLHLP GESHLAGLGRGEEILSTLMQIWDRDLRK" CDS 1336205..1337032 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1224" /product="unknown protein" /note="Mb1224, -, len: 275 aa. Equivalent to Rv1192, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 275 aa overlap). Hypothetical unknown protein, contains PS00120 lipases, serine active site. Protein product from Mb1224 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1224 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XXZ3" /protein_id="SIT99825.1" /translation="MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRM LPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDL LDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWS FNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRS SHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA" CDS 1337072..1338493 /codon_start=1 /transl_table=11 /gene="fadD36" /locus_tag="BQ2027_MB1225" /product="PROBABLE FATTY-ACID-COA LIGASE FADD36 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb1225, fadD36, len: 473 aa. Equivalent to Rv1193, len: 473 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 473 aa overlap). Probable fadD36, fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to Q50017|U15181 4-coumarate-CoA ligase from Mycobacterium leprae (476 aa), FASTA scores: opt: 2594, E(): 0, (81.3% identity in 476 aa overlap). Also highly similar to others e.g. CAB86109.1|AL163003 putative fatty acid synthase from Streptomyces coelicolor (485 aa); LCFA_ECOLI|P29212 long-chain-fatty-acid--CoA ligase from Escherichia coli (561 aa), FASTA scores: opt: 605, E(): 8.4e-30, (33.0% identity in 364 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb1225 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1225 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXM7" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XXM7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99826.1" /translation="MLLASLNPAVVSAADIADAVRIDGDVLSRSDLVGAATSVAERVA GAHRVAVLATPTASTVLAITGCLIAGVPVVPVPADVGVTERRHMLTDSGVQAWLGPLP DDPAGLPHIPVRTHARSWHRYPEPSPGAIAMVVYTSGTTGPPKGVQLSRRAIAADLDA LAEAWQWTAEDVLVHGLPLYHVHGLVLGLLGSLRFGNRFVHTGKPTPAGYAQACYEAH GTLFFGVPTVWSRVAADQAAAGALKPARLLVSGSAALPVPVFDKLVQLTGHRPVERYG ASESLITLSTRADGERRPGWVGLPLAGVQTRLVDDDGGEVPHDGETVGKLQVRGPTLF DGYLNQPDATAAAFDADSWYRTGDVAVVDGSGMHRIVGRESVDLIKSGGYRVGAGEIE TVLLGHPDVAEAAVVGVPDDDLGQRIVAYVVGSANVDADGLINFVAQQLSVHKRPREV RIVDALPRNALGKVLKKQLLSEG" CDS complement(1338526..1339791) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1226C" /product="Possible regulatory protein Trx" /note="Mb1226c, -, len: 421 aa. Equivalent to Rv1194c, len: 421 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 421 aa overlap). Conserved hypothetical protein, highly similar to Q50018 possible transcriptional activator from Mycobacterium leprae (517 aa), FASTA scores: opt: 1960, E(): 0, (69.8% identity in 421 aa overlap). Also similar to Mycobacterium tuberculosis Rv2370c|MTCY27.10, (62.0% identity in 421 aa overlap) and Rv1453|MTCY493.01c. Protein product from Mb1226c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1226c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025736" /db_xref="InterPro:IPR041522" /db_xref="InterPro:IPR042070" /db_xref="UniProtKB/TrEMBL:A0A1R3XXM4" /protein_id="SIT99827.1" /translation="MAWQQPSPRIRELIREGARIALNPSPEWIEELDRATIAANPAIA NDPVLAKVVQTANRANLVYWAAANLRDPGARVPANLGTEPLRMARDLVRRGLDTVAFN IYRTGEHIGWRFWMGIAFELTSDPQELRELLDVSARSVNDFIEATLTGIAAQVQSEHD ELTRSTHAERLEVVGLILDGAPISPERAEAKLGYPLSRAHTAAIIWSDELDGDHSYLD RAADLFCHAVGSTRPLTVVAGAASRWAWVTDADGLDIDTVQAAVDNAPGARIAIGTTA NGVEGFRRSHLEALITQRTLSRLRSTQRVAFFADVKMVALISQNPDAASEFITSTLGD LESASPDLQTALLTFINEQCNASRAAKRLHTHRNTFLRRLESAQRLLPRPLDHTSVHV AVALEALQWRGNKAHALSSPGRRSNSVPA" CDS 1340281..1340580 /codon_start=1 /transl_table=11 /gene="PE13" /locus_tag="BQ2027_MB1227" /product="pe family protein pe13" /note="Mb1227, PE13, len: 99 aa. Equivalent to Rv1195, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 99 aa overlap). Member of Mycobacterium tuberculosis PE family (see first citation below), e.g. Y0DP_MYCTU|Q50615 hypothetical glycine-rich 40.8 kd protein (498 aa), FASTA scores: opt: 307, E(): 1.4e-12, (56.3% identity in 96 aa overlap), similar to MTCY21C12.10c (99 aa), FASTA scores: opt:295, E(): 1.9e-11, (51.5% identity in 97 aa overlap). Mb1227 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XXM5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99828.1" /translation="MSFVMAYPEMLAAAADTLQSIGATTVASNAAAAAPTTGVVPPAA DEVSALTAAHFAAHAAMYQSVSARAAAIHDQFVATLASSASSYAATEVANAAAAS" CDS 1340627..1341799 /codon_start=1 /transl_table=11 /gene="PPE18" /locus_tag="BQ2027_MB1228" /standard_name="mtb39a" /product="ppe family protein ppe18" /note="Mb1228, PPE18, len: 390 aa. Equivalent to Rv1196, len: 391 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 391 aa overlap). PPE18 (alternate gene name: mtb39a). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, highly similar to others e.g. Y07P_MYCTU|Q11031 hypothetical 40.0 kDa protein cy02b10.25c (396 aa), FASTA scores: opt: 2124, E(): 0, (85.1% identity in 397 aa overlap). Note that expression of Rv1196 was demonstrated in lysates by immunodetection (see first citation below). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 14 bp to 11 bp substitution leads to a slightly shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (390 aa versus 391 aa). Protein product from Mb1228 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1228 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XXN5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99829.1" /translation="MVDFGALPPEINSARMYAGPGSASLVAAAQMWDSVASDLFSAAS AFQSVVWGLTVGSWIGSSAGLMVAAASPYVAWMSVTAGQAELTAAQVRVAAAAYETAY GLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAAAMFGYAAATA TATATLLPFEEAPEMTSAGGLLEQAAAVEEASDTAAANQLMNNVPQALQQLAQPTQGT TPSSKLGGLWKTVSPHLSPISNMVSMANNHVSMTNSGVSMTNTLSSMLKGFAPAAAQA VETAAQNGVRAMSSLGSSLGSSGLGGGVAANLGRAASVGSLSVPQAWAAANQAVTPAA RALPLTSLTSAAERGPGQMLGGLPVGQMGARAGGGLSGVLRVPPRPYVMPHSPAAG" CDS 1341934..1342230 /codon_start=1 /transl_table=11 /gene="esxK" /locus_tag="BQ2027_MB1229" /standard_name="ES6_3; TB11.0; QILSS" /product="esat-6 like protein esxk (esat-6 like protein 3)" /note="Mb1229, esxK, len: 98 aa. Equivalent to Rv1197, len: 98 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 98 aa overlap). esxK, putative ESAT-6 like protein 3. Member of M. tuberculosis hypothetical QILSS protein family with Rv1038c, etc. Almost identical to MTCY98.023c (98 aa) (99.0% identity in 98 aa overlap) and MTCY10G2.11 (98 aa), FASTA scores: opt: 643, E(): 0, (99.0% identity in 98 aa overlap); highly similar to Q49945|U1756C from Mycobacterium leprae (100 aa), FASTA scores: opt: 377, E(): 8e-21, (58.3% identity in 96 aa overlap). BELONGS TO THE ESAT6 FAMILY. Mb1229 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0DOB1" /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P0DOB1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99830.1" /translation="MASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAG WSGMAEATSLDTMTQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS" CDS 1342281..1342565 /codon_start=1 /transl_table=11 /gene="esxL" /locus_tag="BQ2027_MB1230" /standard_name="ES6_4; Mtb9.9C" /product="putative esat-6 like protein esxl (esat-6 like protein 4)" /note="Mb1230, esxL, len: 94 aa. Equivalent to Rv1198, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 94 aa overlap). esxL, putative ESAT-6 likeprotein 4. Member of the ESAT-6 family with Rv3619c, Rv1037c, etc. Almost identical to MTCY10G2.12 (94 aa) (97.9% identity in 94 aa overlap) and MTCY98.022c (94 aa) (94.7% identity in 94 aa overlap). Highly similar to Q49946|U1756D Mycobacterium leprae (95 aa), FASTA scores: opt: 403, E(): 1.1e-22, (64.1% identity in 92 aa overlap). Protein product from Mb1230 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1230 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59804" /db_xref="InterPro:IPR009416" /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P59804" /protein_id="SIT99831.1" /translation="MTINYQFGDVDDHGAMIRAQAGLLEAEHQAIIRDVLTASDFWGG AGSAACQGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA" repeat_region 1342563..1342570 /rpt_type=DIRECT /note="8 bp direct repeat, TGACACCA, flanking IS element IS1081." mobile_element complement(1342571..1344005) /mobile_element_type="insertion sequence:IS1081" /locus_tag="BQ2027_IS1081-2" /note="IS1081-2, len: 1435 nt. Equivalent to IS1081, len: 1450 nt, from Mycobacterium tuberculosis strain H37Rv,(99.9% identity in 1435 nt overlap). Almost identical to IS1081" gene complement(1342571..1344005) /locus_tag="BQ2027_IS1081-2" repeat_region 1342609..1342623 /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRR,TCGCGTGATCCTTCG, flanking IS element IS1081." CDS complement(1342633..1343880) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1231C" /product="POSSIBLE TRANSPOSASE" /note="Mb1231c, -, len: 415 aa. Equivalent to Rv1199c, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 415 aa overlap). Possible transposase for IS1081, identical to TRA1_MYCBO|P35882 transposase for insertion sequence element (415 aa); region identical to MTCY441.35 (100.0% identity in 261 aa overlap); and almost identical to MTCY10G2.02c (415 aa) (99.8% identity in 415 aa overlap). Contains PS01007 Transposases, Mutator family, signature, PS00435 Peroxidases proximal heme-ligand signature." /db_xref="GOA:P60231" /db_xref="InterPro:IPR001207" /db_xref="UniProtKB/Swiss-Prot:P60231" /protein_id="SIT99832.1" /translation="MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" repeat_region complement(1343918..1343932) /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRL,TCGCGTGATCCTTCG, flanking IS element IS1081." repeat_region 1344006..1344013 /rpt_type=DIRECT /note="8 bp direct repeat, TGACACCA, flanking IS element IS1081." CDS 1344217..1345494 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1232" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN" /note="Mb1232, -, len: 425 aa. Equivalent to Rv1200, len: 425 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 425 aa overlap). Probable conserved integral membrane transport protein, possibly member of major facilitator superfamily (MFS), similar to others e.g. YHJE_ECOLI|P37643 hypothetical metabolite transport protein from Escherichia coli (440 aa), FASTA scores: opt: 1047, E(): 0, (39.1% identity in 427 aa overlap); etc. Contains PS00217 Sugar transport proteins signature 2. Protein product from Mb1232 detected using SWATH mass spectrometry. Mb1232 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYL7" /db_xref="InterPro:IPR004736" /db_xref="InterPro:IPR005828" /db_xref="InterPro:IPR005829" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XYL7" /protein_id="SIT99833.1" /translation="MKRVALACLVGSAIEFYDFLIYGTAAALVFPTVFFPHLDPTVAA VASMGTFAVAFLSRPFGAAVFGYFGDRLGRKKTLVATLLIMGLATVTVGLVPTTVAIG AAAPLILTTMRLLQGFAVGGEWAGSALLSAEYAPASKRGWYGMFTVVGGGIALVLTSL TFLGVNYTIGESSPTFMQWGWRIPFLVSAALIAVALYVRFNIDETPVFARERADEKTR LGPAETPIAQVLRRQRREIVLAAGSAVCCFGFVYLASTYLASYAQTRLGYSRGSILFD SVLGGLLCIVFTALSSALCDQLGRRRVLLAGWAVALPWSLLVMPLIDSGSPSLFAVAV VGMYAIGGFGFGPTASFIPELFATSYRYTGSALAANLAGVAGGALPPVIAGALVATYG SWAIGVMLAILALISLVCTYRLPETAGSALVSR" CDS complement(1345491..1346444) /codon_start=1 /transl_table=11 /gene="dapd" /locus_tag="BQ2027_MB1233C" /product="tetrahydrodipicolinate n-succinyltransferase dapd" /note="Mb1233c, -, len: 317 aa. Equivalent to Rv1201c, len: 317 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 317 aa overlap). Probable transferase (EC 2.-.-.-). Highly similar to Q49948|U1756F Mycobacterium leprae (317 aa), FASTA scores: opt: 1776, E(): 0, (84.9% identity in 317 aa overlap), also Q46064 ORF3 protein from CORYNEBACTERIUM GLUTAMICUM (316 aa), FASTA scores: opt: 864, E(): 0, (44.1% identity in 311 aa overlap). Protein product from Mb1233c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1233c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0E7" /db_xref="InterPro:IPR001451" /db_xref="InterPro:IPR011004" /db_xref="InterPro:IPR019875" /db_xref="InterPro:IPR026586" /db_xref="InterPro:IPR032784" /db_xref="InterPro:IPR038361" /db_xref="UniProtKB/Swiss-Prot:Q7U0E7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99834.1" /translation="MSTVTGAAGIGLATLAADGSVLDTWFPAPELTESGTSATSRLAV SDVPVELAALIGRDDDRRTETIAVRTVIGSLDDVAADPYDAYLRLHLLSHRLVAPHGL NAGGLFGVLTNVVWTNHGPCAIDGFEAVRARLRRRGPVTVYGVDKFPRMVDYVVPTGV RIADADRVRLGAHLAPGTTVMHEGFVNYNAGTLGASMVEGRISAGVVVGDGSDVGGGA SIMGTLSGGGTHVISIGKRCLLGANSGLGISLGDDCVVEAGLYVTAGTRVTMPDSNSV KARELSGSSNLLFRRNSVSGAVEVLARDGQGIALNEDLHAN" CDS 1346535..1347599 /codon_start=1 /transl_table=11 /gene="dapE" /locus_tag="BQ2027_MB1234" /product="PROBABLE SUCCINYL-DIAMINOPIMELATE DESUCCINYLASE DAPE" /note="Mb1234, dapE, len: 354 aa. Equivalent to Rv1202, len: 354 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 354 aa overlap). Probable dapE, succinyl-diaminopimelate desuccinylase (EC 3.5.1.18), similar to DAPE_CORGL|Q59284 succinyl-diaminopimelate desuccinylase from Corynebacterium glutamicum (369 aa), FASTA scores: opt: 1301, E(): 0, (55.7% identity in 359 aa overlap), highly similar to Q49949|U1756G (400 aa), FASTA scores: opt: 2045, E(): 0, (87.0% identity in 354 aa overlap). Protein product from Mb1234 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1234 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXZ9" /db_xref="InterPro:IPR001261" /db_xref="InterPro:IPR002933" /db_xref="InterPro:IPR010174" /db_xref="InterPro:IPR011650" /db_xref="InterPro:IPR036264" /db_xref="UniProtKB/TrEMBL:A0A1R3XXZ9" /protein_id="SIT99835.1" /translation="MLDLRGDPIELTAALIDIPSESRKEARIADEVEAALRAQASGFE IIRNGNAVLARTKLNRSSRVLLAGHLDTVPVAGNLPSRRENDQLHGCGAADMKSGDAV FLHLAATLAEPTHDLTLVFYDCEEIDSAANGLGRIQRELPDWLSADVAILGEPTAGCI EAGCQGTLRVVLSVTGTRAHSARSWLGDNAIHKLGAVLDRLAVYRARSVDIDGCTYRE GLSAVRVAGGVAGNVIPDAASVTINYRFAPDRSVAAALQHVHDVFDGLDVQIEQTDAA AGALPGLSEPAAKALVEAAGGQVRAKYGWTDVSRFAALGIPAVNYGPGDPNLAHCRDE RVPVGNITAAVDLLRRYLGG" CDS complement(1347596..1348180) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1235C" /product="AAA-ATPase, domain of unknown function, and LuxR DNA-binding domain" /note="Mb1235c, -, len: 194 aa. Equivalent to Rv1203c, len: 194 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 194 aa overlap). Hypothetical unknown protein. Mb1235c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XXN7" /protein_id="SIT99836.1" /translation="MLLAYVLITKGEFGAAASMLEPAAATLERTGYSWGPLSLMLLAT AIAQQGHIAESAKTLQRAEARHGTKSALFAPELGLARAWTRAAAQDMTGAIAAAREAA RTAERAGQAAVALCAWHNAVRLGDIRAVDPVTRLAAEIDCTVGNILVKHARGLADGDA AELTAVAEELAGIGMAAAAADATKAAARLGPQQR" CDS complement(1348211..1349899) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1236C" /product="AAA-ATPase, domain of unknown function, and LuxR DNA-binding domain" /note="Mb1236c, -, len: 562 aa. Equivalent to Rv1204c, len: 562 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 562 aa overlap). Conserved hypothetical protein, some similarity to Q55103 CHO-ORF2 from STREPTOMYCES SP. (642 aa), FASTA scores: opt: 215, E(): 3.6e-06, (26.4% identity in 576 aa overlap). Contains PS00017 ATP/GTP-binding site motif A. Mb1236c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XXN3" /protein_id="SIT99837.1" /translation="MRVWKHVEAAVDSPDRCGVVLVGPHGVGKTLLAQLAAEQVMSED GRSGRARWVVGTAPGRAIPFGAFRHLISLPASGADIGRPAALLRAARSSLTGDAGDLL LVVDDAHNLDPLSATLVYQLARAGAARLVVTVASEAEPPDAIAALWSDDLLTRVAIEP LDRAQTAAFVESALDATLDVADADELFRRSLGNPLYLRHLIDGGGLEHVDGRWRCRDE DRRPLSGVIDEYLCALPEPARAVVDYLAIAEPLARTDLVALVGGEQLDTLGQAEAAGA VRVGPDSDTSEIFVGHPLYADRARAVLTAEHAHALRVSLVAQLAKHPSDHVSDQLRLS SLAIDVPASATPAAVTDAATAAGQALRLGDVRLAERLARAALDRSDALAARLPLAYAL GWQGRGREADAVLAAVNPAELTETELMAWAIPRAANRFWMLNEPERATAFLQTTRSRV TEPTARSTLDALAATFAMNSGNLPRAITLATEVLSGPAADDMAVAWAASAAALSSARM GRFGDVDRLAERASAAEHPGLLRFTVGLAQITSLLLAGDVAPAQELAKRFTDFA" CDS 1349994..1350557 /codon_start=1 /transl_table=11 /gene="log" /locus_tag="BQ2027_MB1237" /product="Phosphoribohydrolase involved in Mycobacterial cytokinins production, homolog of plant cytokinin-activating enzyme LOG" /note="Mb1237, -, len: 187 aa. Equivalent to Rv1205, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 187 aa overlap). Conserved hypothetical protein, similar to Q49952 cosmid B1756 from Mycobacterium leprae (187 aa), FASTA scores: opt: 865, E(): 0, (72.4% identity in 174 aa overlap), also similar to FAS6_RHOFA|P46378 hypothetical 21.1 kd protein in fasciation locus (ORF6) (198 aa), FASTA scores: opt: 368, E(): 1.3e-17, (37.4% identity in 174 aa overlap). Some similarity to YJL055W Hypothetical protein in BTN1-PEP8 intergenic region from Saccharomyces cerevisiae and P48636 HYPOTHETICAL protein in AZU 5'REGION from Pseudomonas AERUGINOSA. Protein product from Mb1237 detected using SWATH mass spectrometry. Mb1237 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXN4" /db_xref="InterPro:IPR005269" /db_xref="InterPro:IPR031100" /db_xref="UniProtKB/TrEMBL:A0A1R3XXN4" /protein_id="SIT99838.1" /translation="MSAKIDITGDWTVAVYCAASPTHAELLELAAEVGAAIAGRGWTL VWGGGHVSAMGAVASAARACGGWTVGVIPKMLVYRELADHDADELIVTDTMWERKQIM EDRSDAFIVLPGGVGTLDELFDAWTDGYLGTHDKPIVMVDPWGHFDGLRAWLNGLLDT GYVSPTAMERLVVVDNVKDALRACAPS" CDS 1350607..1352400 /codon_start=1 /transl_table=11 /gene="fadD6" /locus_tag="BQ2027_MB1238" /product="PROBABLE FATTY-ACID-COA LIGASE FADD6 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb1238, fadD6, len: 597 aa. Equivalent to Rv1206, len: 597 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 597 aa overlap). Probable fadD6, fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to several e.g. NP_251583.1|NC_002516 probable very-long-chain acyl-CoA synthetase from Pseudomonas aeruginosa (608 aa); Q60714 mouse fatty acid transport protein fatp (646 aa), FASTA scores: opt:712, E(): 0, (36.8% identity in 600 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb1238 detected using shotgun mass spectrometry. Mb1238 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXP6" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR030310" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XXP6" /protein_id="SIT99839.1" /translation="MSDYYGGAHTTVRLIDLATRMPRVLADTPVIVRGAMTGLLARPN SKASIGTVFQDRAARYGDRVFLKFGDQQLTYRDANATANRYAAVLAARGVGPGDVVGI MLRNSPSTVLAMLATVKCGAIAGMLNYHQRGEVLAHSLGLLDAKVLIAESDLVSAVAE CGASRGRVAGDVLTVEDVERFATTAPATNPASASAVQAKDTAFYIFTSGTTGFPKASV MTHHRWLRALAVFGGMGLRLKGSDTLYSCLPLYHNNALTVAVSSVINSGATLALGKSF SASRFWDEVIANRATAFVYIGEICRYLLNQPAKPTDRAHQVRVICGNGLRPEIWDEFT TRFGVARVCEFYAASEGNSAFINIFNVPRTAGVSPMPLAFVEYDLDTGDPLRDASGRV RRVPDGEPGLLLSRVNRLQPFDGYTDPVASEKKLVRNAFRDGDCWFNTGDVMSPQGMG HAAFVDRLGDTFRWKGENVATTQVEAALASDQTVEECTVYGVQIPRTGGRAGMAAITL RAGAEFDGQALARTVYGHLPGYALPLFVRVVGSLAHTTTFKSRKVELRNQAYGADIED PLYVLAGPDEGYVPYYAEYPEEVSLGRRPQG" CDS 1352466..1353422 /codon_start=1 /transl_table=11 /gene="folP2" /locus_tag="BQ2027_MB1239" /product="dihydropteroate synthase 2 folp2 (dhps 2) (dihydropteroate pyrophosphorylase 2)" /note="Mb1239, folP2, len: 318 aa. Equivalent to Rv1207, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 318 aa overlap). Probable folP2, Dihydropteroate synthase 2 (EC 2.5.1.15), similar to many e.g. DHPS_ECOLI|P26282 Escherichia coli (282 aa), FASTA scores: opt: 480, E(): 1.9e-22, (34.4% identity in 270 aa overlap). Contains PS00792 dihydropteroate synthase signature 1, PS00793 dihydropteroate synthase signature 2. Protein product from Mb1239 detected using SWATH mass spectrometry. Mb1239 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64140" /db_xref="InterPro:IPR000489" /db_xref="InterPro:IPR006390" /db_xref="InterPro:IPR011005" /db_xref="UniProtKB/Swiss-Prot:P64140" /protein_id="SIT99840.1" /translation="MRSTPPASAGRSTPPALAGHSTPPALAGHSTLCGRPVAGDRALI MAIVNRTPDSFYDKGATFSDAAARDAVHRAVADGADVIDVGGVKAGPGERVDVDTEIT RLVPFIEWLRGAYPDQLISVDTWRAQVAKAACAAGADLINDTWGGVDPAMPEVAAEFG AGLVCAHTGGALPRTRPFRVSYGTTTRGVVDAVISQVTAAAERAVAAGVAREKVLIDP AHDFGKNTFHGLLLLRHVADLVMTGWPVLMALSNKDVVGETLGVDLTERLEGTLAATA LAAAAGARMFRVHEVAATRRVLEMVASIQGVRPPTRTVRGLA" CDS 1353419..1354393 /codon_start=1 /transl_table=11 /gene="gpgs" /locus_tag="BQ2027_MB1240" /product="probable glucosyl-3-phosphoglycerate synthase gpgs" /note="Mb1240, -, len: 324 aa. Equivalent to Rv1208, len: 324 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 324 aa overlap). Conserved hypothetical protein, similar to Q49955|U1756L Mycobacterium leprae (318 aa), FASTA scores, opt: 1621, E(): 0, (80.5% identity in 318 aa overlap). Protein product from Mb1240 detected using SWATH mass spectrometry. Mb1240 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0E1" /db_xref="InterPro:IPR001173" /db_xref="InterPro:IPR029044" /db_xref="PDB:5JUD" /db_xref="UniProtKB/Swiss-Prot:Q7U0E1" /protein_id="SIT99841.1" /translation="MTASELVAGDLAGGRAPGALPLDTTWHRPGWTIGELEAAKAGRT ISVVLPALNEEATIESVIDSISPLVDGLVDELIVLDSGSTDDTEIRAIASGARVVSRE QALPEVPVRPGKGEALWRSLAATSGDIVVFIDSDLINPHPLFVPWLVGPLLTGEGIQL VKSFYRRPLQVSDVTSGVCATGGGRVTELVARPLLAALRPELGCVLQPLSGEYAASRE LLTSLPFAPGYGVEIGLLIDTFDRLGLDAIAQVNLGVRAHRNRPLDELGAMSRQVIAT LLSRCGIPDSGVGLTQFLPGGPDDSDYTRHTWPVSLVDRPPMKVMRPR" CDS 1354432..1354800 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1241" /product="conserved protein" /note="Mb1241, -, len: 122 aa. Equivalent to Rv1209, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 122 aa overlap). Conserved hypothetical protein, containing a hydrophobic N-terminus. Similar to Q49956|U1756M hypothetical protein from Mycobacterium leprae (114 aa), FASTA scores: opt: 524, E(): 8.9e-29, (78.6% identity in 112 aa overlap). Protein product from Mb1241 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1241 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZT7" /db_xref="InterPro:IPR019933" /db_xref="UniProtKB/TrEMBL:A0A1R3XZT7" /protein_id="SIT99842.1" /translation="MALVLVYLVVLVLVAIVLFAAASLLFGRGEQLPPLPRATTATTL PAFGVTRADVDAVKFTQVLRGYKTSEVDWVLERLGRELEALRSQLGAIHASSEDAEAE SDASNPSRGETVVHYRSDPA" CDS 1354797..1355411 /codon_start=1 /transl_table=11 /gene="tagA" /locus_tag="BQ2027_MB1242" /product="probable dna-3-methyladenine glycosylase i taga (tag i) (3-methyladenine-dna glycosylase i, constitutive) (dna-3-methyladenine glycosidase i)" /note="Mb1242, tagA, len: 204 aa. Equivalent to Rv1210, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 204 aa overlap). Probable tagA, DNA-3-methyladenine glycosidase I (EC 3.2.2.20), similar to several e.g. 3MG1_ECOLI|P05100 DNA-3-methyladenine glycosidase I from Escherichia coli (187 aa), FASTA scores: opt: 530, E(): 1.3e-27, (44.2% identity in 190 aa overlap). Also similar to Q49957 Mycobacterium leprae cosmid B1756 (192 aa), FASTA scores: opt: 1042, E(): 0, (80.2% identity in 192 aa overlap). Protein product from Mb1242 detected using SWATH mass spectrometry. Mb1242 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYM7" /db_xref="InterPro:IPR004597" /db_xref="InterPro:IPR005019" /db_xref="InterPro:IPR011257" /db_xref="UniProtKB/TrEMBL:A0A1R3XYM7" /protein_id="SIT99843.1" /translation="MSGDGLVRCPWAEVRPGPDAQLYRDYHDNEWGRPLYGRVALFER MSLEAFQSGLSWLIILRKRENFRRAFSGFDIDKIARYTDTDVRRLLADDGIVRNRAKI EATIANARAAADLGSSEDLSELLWSFAPPPRPRPVDGSEIPSVSTESKAMSRELKRRG FRFVGPTTAYALMQATGMVDDHIQACWVPTERPFDQPGCPMAAR" CDS 1355518..1355745 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1243" /product="conserved protein" /note="Mb1243, -, len: 75 aa. Equivalent to Rv1211, len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 75 aa overlap). Conserved hypothetical protein, similar to Q49958|U1756N Mycobacterium leprae (75 aa), FASTA scores: opt: 460, E(): 0, (90.7% identity in 75 aa overlap). Protein product from Mb1243 detected using shotgun mass spectrometry. Mb1243 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021465" /db_xref="UniProtKB/TrEMBL:A0A1R3XXQ4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99844.1" /translation="MLGADQARAGGPARIWREHSMAAMKPRTGDGPLEATKEGRGIVM RVPLEGGGRLVVELTPDEAAALGDELKGVTS" CDS complement(1355773..1356936) /codon_start=1 /transl_table=11 /gene="glga" /locus_tag="BQ2027_MB1244C" /product="putative glycosyl transferase glga" /note="Mb1244c, -, len: 387 aa. Equivalent to Rv1212c, len: 387 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 387 aa overlap). Putative glycosyl transferase (EC 2.-.-.-), highly similar to AJ243803|SCO243803_2 Putative glycosyl transferase from Streptomyces coelicolor (387 aa), FASTA scores: opt: 1344, E(): 0, (54.9% identity in 388 aa overlap). Also similar to MJ1607 probable hexosyltransferase (EC 2.4.1.-) from Methanococcus jannaschii (390 aa), FASTA scores: opt: 445, E(): 7.8e-23, (27.9% identity in 401 aa overlap). The region from aa 267-355 highly similar to Q49959 COSMID B1756 from Mycobacterium leprae (91 aa), FASTA scores, opt: 471, E(): 4.8e-25, (80.9% identity in 89 aa overlap). Similar to Mycobacterium tuberculosis hypothetical protein, Rv3032. Protein product from Mb1244c detected using SWATH mass spectrometry. Mb1244c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY10" /db_xref="InterPro:IPR001296" /db_xref="InterPro:IPR011875" /db_xref="InterPro:IPR028098" /db_xref="UniProtKB/TrEMBL:A0A1R3XY10" /protein_id="SIT99845.1" /translation="MRVAMLTREYPPEVYGGAGVHVTELVAYLRRLCAVDVHCMGAPR PGAFAYRPDPRLGSANAALSTLSADLVMANAASAATVVHSHTWYTALAGHLAAILYDI PHILTAHSLEPLRPWKKEQLGGGYQVSTWVEQTAVLAANAVIAVSSAMRNDMLRVYPS LDPNLVHVIRNGIDTETWYPAGPARTGSVLAELGVDPNRPMAVFVGRITRQKGVVHLV TAAHRFRSDVQLVLCAGAADTPEVADEVRVAVAELARNRTGVFWIQDRLTIGQLREIL SAATVFVCPSVYEPLGIVNLEAMACATAVVASDVGGIPEVVADGITGSLVHYDADDAT GYQARLAEAVNALVADPATAERYGHAGRQRCIQEFSWAYIAEQTLDIYRKVCA" CDS 1357111..1358325 /codon_start=1 /transl_table=11 /gene="glgC" /locus_tag="BQ2027_MB1245" /product="glucose-1-phosphate adenylyltransferase glgc (adp-glucose synthase) (adp-glucose pyrophosphorylase)" /note="Mb1245, glgC, len: 404 aa. Equivalent to Rv1213, len: 404 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 404 aa overlap). Probable glgC, glucose-1-phosphate adenylyltransferase (EC 2.7.7.27), similar to many e.g. GLGC_ECOLI|P00584 Escherichia coli (430 aa), FASTA scores: opt: 1075, E(): 0, (40.3% identity in 407 aa overlap); highly similar to Q49961 GLGC from Mycobacterium leprae (419 aa), FASTA scores: opt: 2532, E(): 0, (92.6% identity in 404 aa overlap). BELONGS TO THE BACTERIAL AND PLANTS GLUCOSE-1-PHOSPHATE ADENYLYLTRANSFERASE FAMILY. Protein product from Mb1245 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1245 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64242" /db_xref="InterPro:IPR005835" /db_xref="InterPro:IPR005836" /db_xref="InterPro:IPR011004" /db_xref="InterPro:IPR011831" /db_xref="InterPro:IPR023049" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/Swiss-Prot:P64242" /protein_id="SIT99846.1" /translation="MREVPHVLGIVLAGGEGKRLYPLTADRAKPAVPFGGAYRLIDFV LSNLVNARYLRICVLTQYKSHSLDRHISQNWRLSGLAGEYITPVPAQQRLGPRWYTGS ADAIYQSLNLIYDEDPDYIVVFGADHVYRMDPEQMVRFHIDSGAGATVAGIRVPRENA TAFGCIDADDSGRIRSFVEKPLEPPGTPDDPDTTFVSMGNYIFTTKVLIDAIRADADD DHSDHDMGGDIVPRLVADGMAAVYDFSDNEVPGATDRDRAYWRDVGTLDAFYDAHMDL VSVHPVFNLYNKRWPIRGESENLAPAKFVNGGSAQESVVGAGSIISAASVRNSVLSSN VVVDDGAIVEGSVIMPGTRVGRGAVVRHAILDKNVVVGPGEMVGVDLEKDRERFAISA GGVVAVGKGVWI" CDS complement(1358521..1358895) /codon_start=1 /transl_table=11 /gene="PE14" /locus_tag="BQ2027_MB1246C" /product="pe family protein pe14" /note="Mb1246c, PE14, len: 124 aa. Equivalent to 5' end of Rv1214c, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 68 aa overlap). Member of Mycobacterium tuberculosis PE family, appears to be frameshifted but sequence appears to be correct. The 5'-end is atypical as first 9 aa appear to be missing. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE14 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 5 bp deletion (cttgt-*) leads to a diffferent COOH terminus. Mb1246c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XXP3" /protein_id="SIT99847.1" /translation="MLASAATDLAGIGSALSAANAAAAAPTTAMLAACADEVSAVVAS LFARHAQAYQALSLQATAFHQQFVPDRRWRGLCGCRSRQRCCGAERAARRAECDQRSH PGTVRSVTAPMADRAKTAGPGG" CDS complement(1359029..1360714) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1247C" /product="Cocaine esterase (EC" /EC_number="3.1.1.-" /note="Mb1247c, -, len: 561 aa. Equivalent to Rv1215c, len: 561 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 561 aa overlap). Conserved hypothetical protein, low similarity to Rv1835c|Y0D8_MYCTU|Q50598 hypothetical 69.9 kd protein cy1a11.08 (628 aa), FASTA scores: opt: 257, E(): 1.3e-09, (34.1% identity in 185 aa overlap). Protein product from Mb1247c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1247c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXP5" /db_xref="InterPro:IPR000383" /db_xref="InterPro:IPR005674" /db_xref="InterPro:IPR008979" /db_xref="InterPro:IPR013736" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XXP5" /protein_id="SIT99848.1" /translation="MARNPSPALDRPWRRPGALRYALERVRGVAKPPITVTDPPADVV IERDVEVPTRDGTLLRINVFRSAEGGARPVIASIHPYGKDALPRRRGNRWTFSPQYRM LRQPKPLTFSALTGWEAPDPAWWTAQGFVVVNADSRGCGRSDGTGDLLSHQEAEDTYD LVGWLADQSWSDGRVVMLGVSYLAISQYAVAALQPPALRAICPWEGFTDAYRDLAFPG GIRESGFTRLWSRGVRRRTRQTYDMEQMQEAHPLRDDFWRSRVPDLSAIKVPMLVCGS FSDNNLHSRGSIRAFTRSGCGHARLYTHRGGKWETFYSATALSEQLKFLRDALAGSSG SRSVRLEVREDRDTITAVREETQWPLAGTRWRPMYLAGPGLLATEPPPTAGSIRFQTR SRAAAFNWTIPEDIELTGPMAARLWVQLDGCDDANLFVGVEKWRDGQFVAFEGSYGWG RDRVTTGWQRVSLRELDPELSQPWEPVPACARPRPVTAGEVVAVDVALGPSATLFRAG EQLRLVVGGRWLSPRNPLTGQFPAAYPRPPRGRVTLHWGPRYDAHLLIPEVPG" CDS complement(1360742..1361416) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1248C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb1248c, -, len: 224 aa. Equivalent to Rv1216c, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 224 aa overlap). Probable conserved integral membrane protein, C-terminal region similar to Q49963|U1756P from Mycobacterium leprae (134 aa), FASTA scores: opt: 311, E(): 3.3e-15, (52.2% identity in 113 aa overlap). Protein product from Mb1248c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1248c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXQ5" /db_xref="InterPro:IPR007318" /db_xref="UniProtKB/TrEMBL:A0A1R3XXQ5" /protein_id="SIT99849.1" /translation="MHIGLKIFIWGVLGLVVFGALLFGPAGTFDYWQAWVFLAAFVST TIGPTIYLARNDPAALQRRMRSGPLAEGRTIQKFIVIGAFLGFFAMMVLSACDHRYGW SSVPAAVCVIGDVLVMTGLGIAMLVVIQNRYAASTVRVEAGQILASDGLYKIVRHPMY AGNVVMMTGIPLALGSYWAMFILVPGTLVLVFRILDEEKLLTQELSGYREYRQLVRYR LVPYVW" CDS complement(1361425..1363071) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1249C" /product="PROBABLE TETRONASIN-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER" /note="Mb1249c, -, len: 548 aa. Equivalent to Rv1217c, len: 548 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 548 aa overlap). Probable tetronasin-transport integral membrane ABC transporter (see citation below), similar to many e.g. AL049754|SCH10_12 probable ABC-type transport system membrane-spanning protein from Streptomyces coelicolor (539 aa), FASTA scores: opt: 1309, E(): 0, (40.9% identity in 550 aa overlap); Q54407|X73633 TnrB3 protein from Streptomyces longisporoflavus (337 aa), FASTA scores: opt: 692, E(): 0, (39.5% identity in 324 aa overlap); etc. Also has regions similar to Mycobacterium leprae proteins Q49964|U1756Q (109 aa), FASTA scores: opt: 431, E(): 3.1e-20, (64.8% identity in 105 aa overlap) and Q49965|U1756R (82 aa), FASTA scores: opt:154, E(): 0.0028, (61.0% identity in 41 aa overlap). Protein product from Mb1249c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1249c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXP2" /db_xref="UniProtKB/TrEMBL:A0A1R3XXP2" /protein_id="SIT99850.1" /translation="MSSTVIDRARPAGHRAPHRGSGFTGTLGLLRLYLRRDRVSLPLW VLLLSVPLATVYIASVETVYPDRSARAAAAAAIMASPAQRALYGPVYNDSLGAVGIWK AGMFHTLIAVAVILTVIRHTRADEESGRAELIDSTVVGRYTNLTGALLLSFGASIATG AIGALGLLATDVAPAGSVAFGVALAASGMVFTAVAAVAAQLSPSARFTRAVAFAVLGT AFALRAIGDAGSGTLSWCSPLGWSLQVRPYAGERWWVLLLSLATAAVLTVLAYRLRAG RDVGAGLIAERPGAGTAGPMLSEPFGLAWRLNRGSLLLWTVGLCLYGLVMGSVVHGIG DQLGDNTAVRDIVTRMGGTGALEQAFLALAFTMIGMVAAAFAVSLTLRLHQEETGLRA ETLLAGAVSRTHWLASHLAMALAGSAVATLISGVAAGLAYGMTVGDVGGKLPTVVGTA AVQLPAVWLLSAVTVGLFGLAPRFTPVAWGVLVGFIALYLLGSLAGFPQMLLNLEPFA HIPRVGGGDFTAVPLLWLLAIDAALITLGAMAFRRRDVRC" CDS complement(1363068..1364003) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1250C" /product="PROBABLE TETRONASIN-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb1250c, -, len: 311 aa. Equivalent to Rv1218c, len: 311 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 311 aa overlap). Probable tetronasin-transport ATP-binding protein ABC transporter (see citation below), similar to many e.g. Q54406|X73633|TNRB2 TNRB2 PROTEIN from Streptomyces longisporoflavus (300 aa), FASTA scores: opt: 1133, E(): 0, (60.8% identity in 291 aa overlap); etc. Also similar to others in Mycobacterium tuberculosis e.g. MTCY19H9.04 (30.0% identity in 297 aa overlap); etc. Contains PS00211 ABC transporters family signature and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS) Protein product from Mb1250c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1250c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXQ1" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR025302" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XXQ1" /protein_id="SIT99851.1" /translation="MSADNHQVPIEIRGLTKHFGSVRALDGLDLTVREGEVHGFLGPN GAGKSTTLRILLGLVKADGGSVRLLGGDPWTDAVDLHRHIAYVPGDVTLWPSLTGGET IDLLARMRGGIDNARRAELIERFGLDPTKKARTYSKGNRQKVSLISALSSHATLLLLD EPSSGLDPLMENVFQQCIGEARQRGVTVLLSSHILAETEALCEKVTIIRAGKTVESGS LDALRHLSRTSIKAEMIGDPGDLSRIKGVEDISIEGTTVRAQVDSESLRELIQVLGHA GVRSLVSQPPTLEELFLRHYSLGPEVAAEQQVATP" CDS complement(1363993..1364631) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1251C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1251c, -, len: 212 aa. Equivalent to Rv1219c, len: 212 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 212 aa overlap). Probable transcriptional regulatory protein, some similarity in N-terminus to YBIH_ECOLI|P41037 hypothetical transcriptional regulator from Escherichia coli (103 aa), FASTA scores: opt: 143, E(): 8.9e-06, (39.7% identity in 63 aa overlap); Helix turn helix motif from aa 28-49. Protein product from Mb1251c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1251c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZU0" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="InterPro:IPR041484" /db_xref="UniProtKB/TrEMBL:A0A1R3XZU0" /protein_id="SIT99852.1" /translation="MRSADLTAHARIREAAIEQFGRHGFGVGLRAIAEAAGVSAALVI HHFGSKEGLRKACDDFVAEEIRSSKAAALKSNDPTTWLAQMAEIESYAPLMAYLVRSM QSGGELAKMLWQKMIDNAEEYLDEGVRAGTVKPSRDPRARARFLAITGGGGFLLYLQM HENPTDLRAALRDYAHDMVLPSLEVYTEGLLADRAMYEAFLAEAQQGEAHVG" CDS complement(1364773..1365420) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1252C" /product="PROBABLE METHYLTRANSFERASE" /note="Mb1252c, -, len: 215 aa. Equivalent to Rv1220c, len: 215 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 215 aa overlap). Possible methyltransferase (EC 2.1.1.-), some similarity to MDMC_STRMY|Q00719 o-methyltransferase from Streptomyces mycarofaciens (221 aa), FASTA scores; opt: 289, E(): 1.3e-07, (30.0% identity in 203 aa overlap). Also similar to Mycobacterium tuberculosis methyltransferases Rv0187|MTCI28.26 (32.9% identity in 222 aa overlap) and Rv1703c. Start site chosen by homology; other possible start sites exist upstream. Protein product from Mb1252c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1252c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0D0" /db_xref="InterPro:IPR002935" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7U0D0" /protein_id="SIT99853.1" /translation="MPGQPAPSRGESLWAHAEGSISEDVILAGARERATDIGAGAVTP AVGALLCLLAKLSGGKAVAEVGTGAGVSGLWLLSGMRDDGVLTTIDIEPEHLRLARQA FAEAGIGPSRTRLISGRAQEVLTRLADASYDLVFIDADPIDQPDYVAEGVRLLRSGGV IVVHRAALGGRAGDPGARDAEVIAVREAARLIAEDERLTPALVPLGDGVLAAVRD" CDS 1365683..1366456 /codon_start=1 /transl_table=11 /gene="sigE" /locus_tag="BQ2027_MB1253" /product="ALTERNATIVE RNA POLYMERASE SIGMA FACTOR SIGE" /note="Mb1253, sigE, len: 257 aa. Equivalent to Rv1221, len: 257 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 257 aa overlap). sigE, alternative sigma factor of extracytoplasmic function (ECF) family (see citations below). Similar to many e.g. RPOE_HAEIN|P44790 RNA polymerase sigma-e factor from Haemophilus influenzae (189 aa), FASTA scores: opt: 247, E(): 3.4e-06, (28.5% identity in 186 aa overlap); etc. Also similar to MTCY07D11.03 rpoE from Mycobacterium tuberculosis (35.2% identity in 159 aa overlap). BELONGS TO THE SIGMA-70 FACTOR FAMILY, ECF SUBFAMILY. Note that in Mycobacterium bovis BCG, the sigE gene is transcribed from two promoters, P1 and P2, and that these promoters were expressed at temperatures from 30-50 C. Protein product from Mb1253 detected using SWATH mass spectrometry. Mb1253 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXR2" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR013249" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039425" /db_xref="UniProtKB/TrEMBL:A0A1R3XXR2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99854.1" /translation="MELLGGPRVGNTESQLCVADGDDLPTYCSANSEDLNITTITTLS PTSMSHPQQVRDDQWVEPSDQLQGTAVFDATGDKATMPSWDELVRQHADRVYRLAYRL SGNQHDAEDLTQETFIRVFRSVQNYQPGTFEGWLHRITTNLFLDMVRRRARIRMEALP EDYDRVPADEPNPEQIYHDARLGPDLQAALASLPPEFRAAVVLCDIEGLSYEEIGATL GVKLGTVRSRIHRGRQALRDYLAAHPEHGECAVHVNPVR" CDS 1366614..1367078 /codon_start=1 /transl_table=11 /gene="rsea" /locus_tag="BQ2027_MB1254" /product="anti-sigma factor rsea" /note="Mb1254, -, len: 154 aa. Equivalent to Rv1222, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 154 aa overlap). Conserved hypothetical protein. Identical to O06290|MTU87242 (but shorter due to different start site chosen by proximity of RBS). Equivalent to O05736|U87308|MAU87308_2 hypothetical protein from Mycobacterium avium (133 aa), FASTA scores: opt: 644, E(): 7e-32, (86.2% identity in 109 aa overlap). Protein product from Mb1254 detected using SWATH mass spectrometry. Mb1254 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XY24" /protein_id="SIT99855.1" /translation="MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSI EAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGL LSEIPRCPPEGPSKGSSGGSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR" CDS 1367147..1368733 /codon_start=1 /transl_table=11 /gene="htrA" /locus_tag="BQ2027_MB1255" /product="PROBABLE SERINE PROTEASE HTRA (DEGP PROTEIN)" /note="Mb1255, htrA, len: 528 aa. Equivalent to Rv1223, len: 528 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 528 aa overlap). Probable htrA (alternate gene name: degP), serine protease precursor (EC 3.4.21.-), equivalent to U15180|MLU15180_31|Q49972|ML1078|HTRA POSSIBLE SERINE PROTEASE from Mycobacterium leprae (533 aa), FASTA scores: opt: 2777, E(): 4.1e-141, (81.6% identity in 533 aa overlap). Also similar to many others e.g. HTRA_ECOLI|P09376 protease do precursor from Escherichia coli (EC 3.4.21.-) (474 aa), FASTA scores: opt: 581, E(): 9.1e-27, (36.3% identity in 278 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Start changed since first submission (-21 aa). Protein product from Mb1255 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1255 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXQ7" /db_xref="InterPro:IPR001478" /db_xref="InterPro:IPR001940" /db_xref="InterPro:IPR009003" /db_xref="InterPro:IPR036034" /db_xref="UniProtKB/TrEMBL:A0A1R3XXQ7" /protein_id="SIT99856.1" /translation="MDTRVDTDNAMPARFSAQIQNEDEVTSDQGNNGGPNGGGRLAPR PVFRPPVDPASRQAFGRPSGVQGSFVAERVRPQKYQDQSDFTPNDQLADPVLQEAFGR PFAGAESLQRHPIDAGALAAEKDGAGPDEPDDPWRDPAAAAALGTPALAAPAPHGALA GSGKLGVRDVLFGGKVSYLALGILVAIALVIGGIGGVIGRKTAEVVDAFTTSKVTLST TGNAQEPAGRFTKVAAAVADSVVTIESVSDQEGMQGSGVIVDGRGYIVTNNHVISEAA NNPSQFKTTVVLNDGKEVPANLVGRDPKTDLAVLKVDNVDNLTVARLGDSSKVRVGDE VLAVGAPLGLRSTVTQGIVSALHRPVPLSGEGSDTDTVIDAIQTDASINHGNSGGPLI DMDAQVIGINTAGKSLSDSASGLGFAIPVNEMKLVANSLIKDGKIVHPTLGINTRSVS NAIASGAQVANVKAGSPAQKGGILENDVIVKVGNRAVADSDEFVVAVRQLAIGQDAPI EVVREGRHVTLTVKPDPDST" CDS 1368735..1369130 /codon_start=1 /transl_table=11 /gene="tatB" /locus_tag="BQ2027_MB1256" /product="Probable protein TatB" /note="Mb1256, tatB, len: 131 aa. Equivalent to Rv1224, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 131 aa overlap). Probable tatB, component of twin-arginine translocation protein export system (see citation below for more information). Possible exported protein with hydrophobic stretch at N-terminus. Highly similar to Q49973|U15180 hypothetical protein U1756Y from Mycobacterium leprae (120 aa), FASTA scores: opt: 601, E(): 0, (73.3% identity in 131 aa overlap). Mb1256 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7VEZ5" /db_xref="InterPro:IPR003369" /db_xref="InterPro:IPR018448" /db_xref="UniProtKB/Swiss-Prot:Q7VEZ5" /protein_id="SIT99857.1" /translation="MFANIGWGEMLVLVMVGLVVLGPERLPGAIRWAASALRQARDYL SGVTSQLREDIGPEFDDLRGHLGELQKLRGMTPRAALTKHLLDGDDSLFTGDFDRPTP KKPDAAGSAGPDATEQIGAGPIPFDSDAT" CDS complement(1369163..1369993) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1257C" /product="HAD-superfamily subfamily IIA hydrolase, hypothetical 2" /note="Mb1257c, -, len: 276 aa. Equivalent to Rv1225c, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 276 aa overlap). Conserved hypothetical protein, some similarity to other hypothetical proteins e.g. AE001078|AE001078_2 Archaeoglobus fulgidus (265 aa), FASTA scores: opt: 339, E(): 5.1e-15, (27.1% identity in 262 aa overlap), and to NAGD_ECOLI|P15302 nagd protein from Escherichia coli (250 aa), FASTA scores: opt: 167, E(): 6.4e-12, (24.8% identity in 258 aa overlap). Also weakly similar to Mycobacterium tuberculosis hypothetical protein Rv3400|MTCY78.28c (29.1% identity in 251 aa overlap). Protein product from Mb1257c detected using SWATH mass spectrometry. Mb1257c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXQ2" /db_xref="InterPro:IPR006355" /db_xref="InterPro:IPR006357" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/TrEMBL:A0A1R3XXQ2" /protein_id="SIT99858.1" /translation="MDVAHLMAAAVLFDIDGVLVLSWRAIPGAAETVRQLTHRGIACA YLTNTTTRTRRQIAEALGAAGIPVAADDVITAGVLTAEYLHGAYPGARCFLVNNGDIT EDLPGIDVVLSTEIGPEDCPEAPDVVVLGSAGPQFDHRTLSRVYGWMLDGVPVVAMHR NMTWNTTDGLRIDTGMYLTGMEQACGKTATAIGKPAAEGFLAAADRVGVDPQQMVMIG DDLHNDVLAAQAVGMTGVLVRTGKFRQQTLDRWLAGASATRPHHVIDSVAGLPPLLGC " CDS complement(1370104..1371567) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1258C" /product="PROBABLE TRANSMEMBRANE PROTEIN" /note="Mb1258c, -, len: 487 aa. Equivalent to Rv1226c, len: 487 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 487 aa overlap). Probable transmembrane protein. Some similarity to AL049841|SCE9.01 Streptomyces coelicolor (436 aa), FASTA scores: opt: 203, E(): 1.2e-05, (29.8% identity in 346 aa overlap). Protein product from Mb1258c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3XXR6" /db_xref="InterPro:IPR005182" /db_xref="InterPro:IPR014529" /db_xref="UniProtKB/TrEMBL:A0A1R3XXR6" /protein_id="SIT99859.1" /translation="MTDRPHDWHRLSPRMLLVHPVHEMLRQLPVLIGSVVLGSATGNP VWPLAALGVTVVFGVLRWFFTTYRIDDENVSLRTGILSRRAVSVPRNRIRSVQTEARL LHRLLGLTVLRVGTGQEARGEAAFELDAVDSARVPRLRALLLAESLAPVEPTGRVLAR WQSSWLRYAPLSFSGLVMIGAVIGLGYQTGLAVRLPESGFARSAVDAAQRAGVVLVVA VTVLLVVGVSALLAVLFSWLTYGNLLLRRGGSGQEGVLHLRHGLLRVREHTYDMRRLR GATLREPLLVRLLRGARLDAVMTGVHGEGQSSMLLPPCPFETATAVLTDLIDNTDAAA GPLRRHGPAAARRRWTRALLVPTLAGVALIAAAPILGVPGWAWTLWAVLTAGCAGLAV DRVRSLGHRVADGWLVARAGSLQRRRDCIACTGIIGWTVRQTLFQRRAGVATLVAATA AGRKGYQVLDVPAELAWSVAGAASPWVADSVWLRHGS" CDS complement(1371564..1372097) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1259C" /product="PROBABLE TRANSMEMBRANE PROTEIN" /note="Mb1259c, -, len: 177 aa. Equivalent to Rv1227c, len: 177 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 177 aa overlap). Possible transmembrane protein, similar to P96615 hypothetical protein ydbS from Bacillus subtilis (159 aa), fasta scores: E(): 3.6e-07, (30.1% identity in 163 aa overlap). Protein product from Mb1259c detected using SWATH mass spectrometry. Mb1259c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXQ6" /db_xref="InterPro:IPR005182" /db_xref="UniProtKB/TrEMBL:A0A1R3XXQ6" /protein_id="SIT99860.1" /translation="MDHARNVPSATGPQRNHLALAEPAHRPSSQAPVMWALSASLGWI LPVIAQLVWWAVHPQPPWPHLAAAALTAVAMVVHIGVVPLWRYRVHRWEISPQAVFTR TGWLVQERRITPISRVQTVDTYRGPMDRLFGLANVTVTTASSAGAVHIEALDTDVADR VVAQLTDIAALRGEDAT" CDS 1372192..1372749 /codon_start=1 /transl_table=11 /gene="lpqX" /locus_tag="BQ2027_MB1260" /product="probable lipoprotein lpqx" /note="Mb1260, lpqX, len: 185 aa. Equivalent to Rv1228, len: 185 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 185 aa overlap). Probable lipoprotein LpqX. Contains possible signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb1260 detected using SWATH mass spectrometry. Mb1260 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XXR3" /protein_id="SIT99861.1" /translation="MSRQWHWLAATLLLITTAACSRPGTEEPDCPTKITLPPGATPTT TLDPRCIVRATTTGTADGDAASRWTGTVRIAGFYASICNAVWDGNVSLAGKDELTGKA TLILVETSCPGKVVAGELVLKGNVGSDSLAITWAHPELPQRAFDLGAGQGTIRRSGDR AEGTFNSDMGGGTEFFLTWSLTMRN" CDS complement(1373049..1374221) /codon_start=1 /transl_table=11 /gene="mrp" /locus_tag="BQ2027_MB1261C" /product="PROBABLE MRP-RELATED PROTEIN MRP" /note="Mb1261c, mrp, len: 390 aa. Equivalent to Rv1229c, len: 390 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 390 aa overlap). Probable Mrp protein, similar to others e.g. MRP_ECOLI|P21590 mrp protein from Escherichia coli (379 aa), FASTA scores: E(): 0, (34.1% identity in 355 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop); and PS01215 MRP Prosite domain. BELONGS TO THE MRP/NBP35 FAMILY OF ATP-BINDING PROTEINS. Protein product from Mb1261c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1261c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65442" /db_xref="InterPro:IPR000808" /db_xref="InterPro:IPR002744" /db_xref="InterPro:IPR019591" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR033756" /db_xref="InterPro:IPR034904" /db_xref="UniProtKB/Swiss-Prot:P65442" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99862.1" /translation="MPSRLHSAVMSGTRDGDLNAAIRTALGKVIDPELRRPITELGMV KSIDTGPDGSVHVEIYLTIAGCPKKSEITERVTRAVADVPGTSAVRVSLDVMSDEQRT ELRKQLRGDTREPVIPFAQPDSLTRVYAVASGKGGVGKSTVTVNLAAAMAVRGLSIGV LDADIHGHSIPRMMGTTDRPTQVESMILPPIAHQVKVISIAQFTQGNTPVVWRGPMLH RALQQFLADVYWGDLDVLLLDLPPGTGDVAISVAQLIPNAELLVVTTPQLAAAEVAER AGSIALQTRQRIVGVVENMSGLTLPDGTTMQVFGEGGGRLVAERLSRAVGADVPLLGQ IPLDPALVAAGDSGVPLVLSSPDSAIGKELHSIADGLSTRRRGLAGMSLGLDPTRR" CDS complement(1374234..1375469) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1262C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb1262c, -, len: 411 aa. Equivalent to Rv1230c, len: 411 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 411 aa overlap). Possible membrane protein with two hydrophobic stretches near N-terminus. Some similarity to Rv1022|MTCY10G2.27c|Z92539 probable lpqU protein from Mycobacterium tuberculosis (243 aa), FASTA score: opt: 408, E(): 1e-11, (43.6% identity in 172 aa overlap). Similar to AL133423|SC4A7.37 hypothetical protein from Streptomyces coelicolor (421 aa), FASTA score: opt: 679, E(): 5.1e-23, (36.4% identity in 398 aa overlap). Mb1262c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYP6" /db_xref="InterPro:IPR001827" /db_xref="InterPro:IPR023346" /db_xref="InterPro:IPR031304" /db_xref="UniProtKB/TrEMBL:A0A1R3XYP6" /protein_id="SIT99863.1" /translation="MHIGGRWGARPAVAAVRRGACRLTRAPAFGVAAIAPLVFASAVG GAAPVFPGRTAPVHAVITPVAAVAASGIDLSGPVVIAMKRPPTSFRVAVATIPAPPPP MIVNSPGALGIPAMALSAYRNAELKMAAAAPGCGVSWNLLAGIGRIESMHANGGATDA RGTAIQPIYGPTLDGTLPGNEIIIQSSVGNRVTYARAMGPMQFLPGTWARYATDGDDD GVADPQNLFDSTLAAARYLCSGGLNLRDPAQVMAALLRYNNSMPYAQNVLGWAAGYAT GVFPVDLPPITGPPPPLGDAHLENPEGLGPGLPINVNGLTADGPMAHLPLIDLTPRQA ALNPPPMFPWMAPDPSAPMPGCTLICIGSHGPPVGAPPFPPTAPPPPFLPAAPPPPDP LAGPPGDAGLAPPAPAPAG" CDS complement(1375594..1376136) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1263C" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb1263c, -, len: 180 aa. Equivalent to Rv1231c, len: 180 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 180 aa overlap). Probable membrane protein, similar to others e.g. AL390975 Streptomyces coelicolor (198 aa). Protein product from Mb1263c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1263c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXS0" /db_xref="InterPro:IPR010406" /db_xref="UniProtKB/TrEMBL:A0A1R3XXS0" /protein_id="SIT99864.1" /translation="MSKPFAPRRLYTPRTSRTLAPRLDPEAVGRTTESIARFFGTGRY LLVQTLLVLTWIVLNLFAVGLRWDPYPFILLNLAFSTQASYAAPLILLAQNRQEKRDR AVFEEDRRRAAQTKADTEYNARELAALRLAIGEVPTRDYLRHELDSLRALLAELQPTD PDVAQPRVADEAEQHAKKSG" CDS complement(1376133..1377440) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1264C" /product="Mg/Co/Ni transporter MgtE, CBS domain-containing" /note="Mb1264c, -, len: 435 aa. Equivalent to Rv1232c, len: 435 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 435 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins e.g. AB013374|AB013374_2 Bacillus halodurans C-125 mamX (449 aa), FASTA scores: opt: 381, E(): 1e-16, (29.9% identity in 251 aa overlap). Some similarity in N-terminus to U15180|MLU1518033 hypothetical Mycobacterium leprae protein u1756u (329 aa), FASTA scores: opt: 300, E(): 4.1e-12, (69.3% identity in 75 aa overlap). Protein product from Mb1264c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1264c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XY33" /db_xref="InterPro:IPR000644" /db_xref="InterPro:IPR006668" /db_xref="InterPro:IPR006669" /db_xref="InterPro:IPR011033" /db_xref="InterPro:IPR038076" /db_xref="UniProtKB/TrEMBL:A0A1R3XY33" /protein_id="SIT99865.1" /translation="MGSVNRVYLARLSRMSVLGPLGESFGRVRDVVISISIVRQQPRV LGLVVDLATRRKIFIPILRVAAIEPHAVTLSTGNVSLHRFEQRPGEALALGQVLDTLV KVNDPALPELAGVDVVVTDLGVEQTRSRDWMVTRVAVRTQRRLRRRGPVHVVDWHNVA GLTPSALAMPGQDVAQLLDQFEGWKAVDVADAIRGLPPKRRHEVFKALHDKRLADVLQ ELPELDQAEVLSQLGTERAADVLEEMDPDDAADLLAVLNPTEAELLLTRMDPGDSGQV RRLLTHSPDTAGGLMTSDPVVLTPDTSIAEALARVRDPDLTPALASMVFVARPPTATP TGHYLGCVHLQRLLRDPPAELVGGVVDTDLLTLTPETPLAAVTRYFAAYNLVCGPVVD DENHLLGAVTVDDLLDHLLPHDWRVDMPELDPSGAPDRPGGPR" CDS complement(1377502..1378098) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1265C" /product="CONSERVED HYPOTHETICAL MEMBRANE PROTEIN" /note="Mb1265c, -, len: 198 aa. Equivalent to Rv1233c, len: 198 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 198 aa overlap). Conserved hypothetical membrane protein, N-terminus is highly proline rich, C-terminus has two hydrophobic stretches. Proline-rich N-terminus has some similarity to CBPA_DICDI calcium binding protein from Dictyostelium discoideum (467 aa), FASTA scores: E(): 4.8e-06, (35.5% identity in 183 aa overlap). Both sequences share multiple copies of a Tyr-Pro-Pro motif. Mb1265c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXR7" /db_xref="InterPro:IPR025241" /db_xref="UniProtKB/TrEMBL:A0A1R3XXR7" /protein_id="SIT99866.1" /translation="MTAPSGSSGESAHDAAGGPPPVGERPPEQPIADAPWAPPASSPM ADHPPPAYPPSGYPPAYQPGYPTDYPPPMPPGGYAPPGYPPPGTSSAGYGDIPYPPMP PPYGGSPGGYYPEPGYLDGYGPSQPGMNTMALVSLISALVGVLCCIGSIVGIVFGAIA INQIKQTREEGYGLAVAGIVIGIATLLVYMIAGIFAIP" CDS 1378248..1378775 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1266" /product="PROBABLE TRANSMEMBRANE PROTEIN" /note="Mb1266, -, len: 175 aa. Equivalent to Rv1234, len: 175 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 175 aa overlap). Possible transmembrane protein with two TM helices. Protein product from Mb1266 detected using shotgun mass spectrometry. Mb1266 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXR1" /db_xref="UniProtKB/TrEMBL:A0A1R3XXR1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99867.1" /translation="MTSPFQPRQVPGSTPAAAGAGRRGVPALPTPPKGWPVGSYPTYA EAQRAVDYLSEQQFPVQQVTIVGVDLMQVERVTGRLTWPKVLGGGVLSGAWLGLFIGL VLGFFSPNPWSALVTGLVAGVFFGLITSAVPYAMARGTRDFSSTMQLVAGRYDVLCDP QNAEKARDLLARLAI" CDS 1378796..1380202 /codon_start=1 /transl_table=11 /gene="lpqY" /locus_tag="BQ2027_MB1267" /product="PROBABLE SUGAR-BINDING LIPOPROTEIN LPQY" /note="Mb1267, lpqY, len: 468 aa. Equivalent to Rv1235, len: 468 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 468 aa overlap). Probable lpqY, sugar-binding lipoprotein component of sugar transport system (see citation below), equivalent to MLU1518034 protein u1756v from Mycobacterium leprae (469 aa), FASTA scores: opt: 2442, E(): 0, (77.4% identity in 470 aa overlap). Also similar to P18815|MALE_ENTAE MALTOSE-BINDING PERIPLASMIC PROTEIN from Enterobacter aerogenes (396 aa), FASTA scores: opt: 193, E(): 2.3e-05, (24.2% identity in 297 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb1267 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1267 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR006059" /db_xref="UniProtKB/TrEMBL:A0A1R3XXR4" /protein_id="SIT99868.1" /translation="MVMSRGRIPRLGAAVLVALTTAAAACGADSQGLVVSFYTPATDG ATFTAIAQRCNQQFGGRFTIAQVSLPRSPNEQRLQLARRLTGNDRTLDVMALDVVWTA EFAEAGWALPLSDDPAGLAENDAVADTLPGPLATAGWNHKLYAAPVTTNTQLLWYRPD LVNSPPTDWNAMIAEAARLHAAGEPSWIAVQANQGEGLVVWFNTLLVSAGGSVLSEDG RHVTLTDTPAHRAATVSALQILKSVATTPGADPSITRTEEGSARLAFEQGKAALEVNW PFVFASMLENAVKGGVPFLPLNRIPQLAGSINDIGTFTPSDEQFRIAYDASQQVFGFA PYPAVAPGQPAKVTIGGLNLAVAKTTRHRAEAFEAVRCLRDQHNQRYVSLEGGLPAVR ASLYSDPQFQAKYPMHAIIRQQLTDAAVRPATPVYQALSIRLAAVLSPITEIDPESTA DELAAQAQKAIDGMGLLP" CDS 1380199..1381122 /codon_start=1 /transl_table=11 /gene="sugA" /locus_tag="BQ2027_MB1268" /product="PROBABLE SUGAR-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER SUGA" /note="Mb1268, sugA, len: 307 aa. Equivalent to Rv1236, len: 307 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 307 aa overlap). Probable sugA, sugar-transport integral membrane protein ABC transporter (see citation below), equivalent to U15180|MLU1518035 protein malFM from Mycobacterium leprae (310 aa), FASTA scores: opt: 1566, E(): 0, (81.8% identity in 292 aa overlap). Also similar to numerous bacterial sugar transport system components. Also similar to Rv2316|MTCY3G12.18c from Mycobacterium tuberculosis (290 aa), FASTA scores: opt: 514, E(): 7.3e-27, (33.2% identity in 283 aa overlap). Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature. Protein product from Mb1268 detected using SWATH mass spectrometry. Mb1268 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXS7" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3XXS7" /protein_id="SIT99869.1" /translation="MTSVEQRTATAVFSRTGSRMAERRLAFMLVAPAAMLMVAVTAYP IGYALWLSLQRNNLATPNDTAFIGLGNYHTILIDRYWWTALAVTLAITAVSVTIEFVL GLALALVMHRTLIGKGLVRTAVLIPYGIVTVVASYSWYYAWTPGTGYLANLLPYDSAP LTQQIPSLGIVVIAEVWKTTPFMSLLLLAGLALVPEDLLRAAQVDGASAWRRLTKVIL PMIKPAIVVALLFRTLDAFRIFDNIYVLTGGSNNTGSVSILGYDNLFKGFNVGLGSAI SVLIFGCVAVIAFIFIKLFGAAAPGGEPSGR" CDS 1381127..1381951 /codon_start=1 /transl_table=11 /gene="sugB" /locus_tag="BQ2027_MB1269" /product="PROBABLE SUGAR-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER SUGB" /note="Mb1269, sugB, len: 274 aa. Equivalent to Rv1237, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 274 aa overlap). Probable sugB, sugar-transport integral membrane protein ABC transporter (see citation below), equivalent to U15180|MLU1518036 protein MalGM from Mycobacterium leprae (296 aa), FASTA scores: opt: 1571, E(): 0, (89.8% identity in 274 aa overlap). Also similar to numerous bacterial sugar transport protein. Related to Rv2834c|MTCY16B7.08 from Mycobacterium tuberculosis (275 aa), FASTA scores: opt: 370, E(): 2.4e-17, (26.8% identity in 269 aa overlap). Protein product from Mb1269 detected using SWATH mass spectrometry. Mb1269 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXR5" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3XXR5" /protein_id="SIT99870.1" /translation="MGARRATYWAVLDTLVVGYALLPVLWIFSLSLKPTSTVKDGKLI PSTVTFDNYRGIFRGDLFSSALINSIGIGLITTVIAVVLGAMAAYAVARLEFPGKRLL IGAALLITMFPSISLVTPLFNIERAIGLFDTWPGLILPYITFALPLAIYTLSAFFREI PWDLEKAAKMDGATPGQAFRKVIVPLAAPGLVTAAILVFIFAWNDLLLALSLTATKAA ITAPVAIANFTGSSQFEEPTGSIAAGAIVITIPIIVFVLIFQRRIVAGLTSGAVKG" CDS 1381956..1383137 /codon_start=1 /transl_table=11 /gene="sugC" /locus_tag="BQ2027_MB1270" /product="PROBABLE SUGAR-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER SUGC" /note="Mb1270, sugC, len: 393 aa. Equivalent to Rv1238, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 393 aa overlap). Probable sugC, sugar-transport ATP-binding protein ABC transporter (see citation below). Highly similar to U15180 protein ugpC from Mycobacterium leprae (392 aa), FASTA score: opt: 2007, E(): 0, (79.9% identity in 389 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb1270 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1270 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXS2" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR008995" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR040582" /db_xref="UniProtKB/TrEMBL:A0A1R3XXS2" /protein_id="SIT99871.1" /translation="MAEIVLDHVNKSYPDGHTAVRDLNLTIADGEFLILVGPSGCGKT TTLNMIAGLEDISSGELRIAGERVNEKAPKDRDIAMVFQSYALYPHMTVRQNIAFPLT LAKMRKADIAQKVSETAKILDLTNLLDRKPSQLSGGQRQRVAMGRAIVRHPKAFLMDE PLSNLDAKLRVQMRGEIAQLQRRLGTTTVYVTHDQTEAMTLGDRVVVMYGGIAQQIGT PEELYERPANLFVAGFIGSPAMNFFPARLTAIGLTLPFGEVTLAPEVQGVIAAHPKPE NVIVGVRPEHIQDAALIDAYQRIRALTFQVKVNLVESLGADKYLYFTTESPAVHSVQL DELAEVEGESALHENQFVARVPAESKVAIGQSVELAFDTARLAVFDADSGANLTIPHR A" CDS complement(1383214..1384314) /codon_start=1 /transl_table=11 /gene="corA" /locus_tag="BQ2027_MB1271C" /product="POSSIBLE MAGNESIUM AND COBALT TRANSPORT TRANSMEMBRANE PROTEIN CORA" /note="Mb1271c, corA, len: 366 aa. Equivalent to Rv1239c, len: 366 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 366 aa overlap). Possible corA, magnesium and cobalt transport transmembrane protein, highly similar to U15180 corA protein from Mycobacterium leprae (373 aa), FASTA scores: opt: 1985, E(): 0, (79.1% identity in 369 aa overlap). Also similar to various CorA proteins of Gram negative bacteria e.g. P27841|CORA_ECOLI|B3816|Z5333|ECS4746 Magnesium and cobalt transport protein from Escherichia coli strains K12 and O157:H7 (316 aa), FASTA scores: opt: 236, E(): 8e-08, (24.5% identity in 306 aa overlap); etc. SEEMS TO BELONG TO THE MIT FAMILY. Protein product from Mb1271c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1271c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZW9" /db_xref="InterPro:IPR002523" /db_xref="InterPro:IPR004488" /db_xref="UniProtKB/TrEMBL:A0A1R3XZW9" /protein_id="SIT99872.1" /translation="MFPGFDALPEVLRPVARPQPPNAHPVAQPPAQALVDCGVYVCGQ RLPGKYTYAAALREVREIELTGQEAFVWIGLHEPDENQMQDVADVFGLHPLAVEDAVH AHQRPKLERYDETLFLVLKTVNYVPHESVVLAREIVETGEIMIFVGKDFVVTVRHGEH GGLSEVRKRMDADPEHLRLGPYAVMHAIADYVVDRYLEVTNLMETDIDSIEEVAFAPG RKLDIEPIYLLKREVVELRRCVNPLSTAFQRMQTESKDLISKEVRRYLRDVADHQTEA ADQIASYDDMLNSLVQAALARVGMQQNMDMRKISAWAGIIAVPTMIAGIYGMNFHFMP ELDSRWGYPTVIGGMVLICLFLYHVFRNRNWL" CDS 1384485..1385474 /codon_start=1 /transl_table=11 /gene="mdh" /locus_tag="BQ2027_MB1272" /product="PROBABLE MALATE DEHYDROGENASE MDH" /note="Mb1272, mdh, len: 329 aa. Equivalent to Rv1240, len: 329 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 329 aa overlap). Probable mdh, Malate dehydrogenase (EC 1.1.1.37). Most similar to P50917|MDH_MYCLE MALATE DEHYDROGENASE from Mycobacterium leprae (329 aa), FASTA scores: opt: 1887, E(): 0, (89.1% identity in 329 aa overlap). Contains PS00068 Malate dehydrogenase active site signature. BELONGS TO THE LDH FAMILY. MDH SUBFAMILY. Protein product from Mb1272 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1272 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5J7" /db_xref="InterPro:IPR001236" /db_xref="InterPro:IPR001252" /db_xref="InterPro:IPR001557" /db_xref="InterPro:IPR010945" /db_xref="InterPro:IPR015955" /db_xref="InterPro:IPR022383" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P0A5J7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99873.1" /translation="MSASPLKVAVTGAAGQIGYSLLFRLASGSLLGPDRPIELRLLEI EPALQALEGVVMELDDCAFPLLSGVEIGSDPQKIFDGVSLALLVGARPRGAGMERSDL LEANGAIFTAQGKALNAVAADDVRVGVTGNPANTNALIAMTNAPDIPRERFSALTRLD HNRAISQLAAKTGAAVTDIKKMTIWGNHSATQYPDLFHAEVAGKNAAEVVNDQAWIED EFIPTVAKRGAAIIDARGASSAASAASATIDAARDWLLGTPADDWVSMAVVSDGSYGV PEGLISSFPVTTKGGNWTIVSGLEIDEFSRGRIDKSTAELADERSAVTELGLI" CDS 1385550..1385810 /codon_start=1 /transl_table=11 /gene="vapb33" /locus_tag="BQ2027_MB1273" /product="possible antitoxin vapb33" /note="Mb1273, -, len: 86 aa. Equivalent to Rv1241, len: 86 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 86 aa overlap). Conserved hypothetical protein, member of family of 16 hypothetical M. tuberculosis proteins including: Rv2871|Q10799|YS71_MYCTU HYPOTHETICAL 13.2 KD PROTEIN CY2 (124 aa), FASTA scores: opt: 172, E(): 9.5e-06, (37.2% identity in 86 aa overlap); Rv2132, Rv3321c, etc. Protein product from Mb1273 detected using SWATH mass spectrometry. Mb1273 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XXT0" /protein_id="SIT99874.1" /translation="MRTTLTLDDDVVRLVEDAVHRERRPMKQVINDALRRALAPPVKR QEQYRLEPHESAVRSGLDLAGFNKLADELEDEALLDATRRAR" CDS 1385807..1386238 /codon_start=1 /transl_table=11 /gene="vapc33" /locus_tag="BQ2027_MB1274" /product="possible toxin vapc33. contains pin domain." /note="Mb1274, -, len: 143 aa. Equivalent to Rv1242, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 143 aa overlap). Conserved hypothetical protein, member of family of 14 hypothetical M. tuberculosis proteins including: Rv2872|Q10800|YS72_MYCTU (147 aa), FASTA scores: opt: 226, E(): 2.7e-09, (32.1% identity in 137 aa overlap); Rv0749, Rv0277c, Rv2530c, etc. Protein product from Mb1274 detected using SWATH mass spectrometry. Mb1274 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY37" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XY37" /protein_id="SIT99875.1" /translation="MIIPDINLLLYAVITGFPQHRRAHAWWQDTVNGHTRIGLTYPAL FGFLRIATSARVLAAPLPTADAIAYVREWLSQPNVDLLTAGPRHLDIALGLLDKLGTA SHLTTDVQLAAYGIEYDAEIHSSDTDFARFADLKWTDPLRE" CDS complement(1386261..1387949) /codon_start=1 /transl_table=11 /gene="PE_PGRS23" /locus_tag="BQ2027_MB1275C" /product="pe-pgrs family protein pe_pgrs23" /note="Mb1275c, PE_PGRS23, len: 562 aa. Equivalent to Rv1243c, len: 562 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 562 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XXS6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99876.1" /translation="MEYLIAAQDVLVAAAADLEGIGSALAAANRAAEAPTTGLLAAGA DEVSAAIASLFSGNAQAYQALSAQAAAFHQQFVRALSSAAGSYAAAEAANASPMQAVL DVVNGPTQLLLGRPLIGDGANGGPGQNGGDGGLLYGNGGNGGSSSTPGQPGGRGGAAG LIGNGGAGGAGGPGANGGAGGNGGWLYGNGGLGGNGGAATQIGGNGGNGGHGGNAGLW GNGGAGGAGAAGAAGANGQNPVSHQVTHATDGADGTTGPDGNGTDAGSGSNAVNPGVG GGAGGIGGDGTNLGQTDVSGGAGGDGGDGANFASGGAGGNGGAAQSGFGDAVGGNGGA GGNGGAGGGGGLGGAGGSANVANAGNSIGGNGGAGGNGGIGAPGGAGGAGGNANQDNP PGGNSTGGNGGAGGDGGVGASADVGGAGGFGGSGGRGGLLLGTGGAGGDGGVGGDGGI GAQGGSGGNGGNGGIGADGMANQDGDGGDGGNGGDGGAGGAGGVGGNGGTGGAGGLFG QSGSPGSGAAGGLGGAGGNGGAGGGGGTGFNPGAPGDPGTQGATGANGQHGLNG" CDS 1388129..1388989 /codon_start=1 /transl_table=11 /gene="lpqZ" /locus_tag="BQ2027_MB1276" /product="probable lipoprotein lpqz" /note="Mb1276, lpqZ, len: 286 aa. Equivalent to Rv1244, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 286 aa overlap). Probable lipoprotein lpqZ, equivalent to U15180|MLU1518042 protein u1756x from Mycobacterium leprae (228 aa), FASTA scores: opt: 1039, E(): 0, (72.5% identity in 229 aa overlap). Similar to M. tuberculosis hypothetical protein Rv3759c. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb1276 detected using SWATH mass spectrometry. Mb1276 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXS3" /db_xref="InterPro:IPR007210" /db_xref="UniProtKB/TrEMBL:A0A1R3XXS3" /protein_id="SIT99877.1" /translation="MRITRILALLLAVLLAVSGVAGCSADTGDRHPELVVGSTPDSEA MLLAAIYVAALRSYGFAAHAETAADPVAKLDSGAFTVVPAFTGQMLQTLQPDASVRSD AQVYRAIVSALPEGIAAGDYTTAAEDKPALVVTQSTAKAWGGGDLSELPSHCRGLLVG RVAGAHTPAAVGPCRLPAPREFRNDATMFAALRAGQLVAAWTTTADPDIPADLIMLTD GKPALIRAENIVPLYRRNALTERKLLAVNEVAGVLDTTALIGMRRQVAAGADPAAVAA GWLAEHPLGR" CDS complement(1389070..1389900) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1277C" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb1277c, -, len: 276 aa. Equivalent to Rv1245c, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 276 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), equivalent to NP_301801.1|NC_002677 short chain alcohol dehydrogenase from Mycobacterium leprae (277 aa). Also highly similar to various dehydrogenases and oxidoreductases e.g. NP_250228.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (295 aa); NP_421969.1|NC_002696 short chain dehydrogenase family protein from Caulobacter crescentus (278 aa); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Rv3085|MTV013.06 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE (276 aa), FASTA scores: opt: 368, E(): 1.2e-16, (35.3% identity in 224 aa overlap); Rv3057c|MTCY22D7.24 PUTATIVE SHORT CHAIN ALCOHOL DEHYDROGENASE/REDUCTASE (287 aa), FASTA scores: opt: 471, E(): 1.3e-21, (32.4% identity in 281 aa overlap); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb1277c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1277c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXS4" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XXS4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99878.1" /translation="MEGFAGKVAVVTGAGSGIGQALAIELARSGAKVAISDVDTDGLA DTEHRLKAISTPVKTDRLDVTEREAFLAYADAVNEHFGTVNQIYNNAGIAFTGDIEVS QFKDIERVMDVDFWGVVNGTKAFLPHLIASGDGHVINISSVFGLFSAPGQAAYNSAKF AVRGFTEALRQEMALAGHPVKVTTVHPGGVKTAIARNATAAEGLDQAELAETFDKRVA HLSPQRAAQIILTGVAKNKARVLVGVDAKVLDLVVRLTGSGYQRIFPIITGRLIPRPR " CDS complement(1389957..1390250) /codon_start=1 /transl_table=11 /gene="relE" /locus_tag="BQ2027_MB1278C" /product="toxin relE" /note="Mb1278c, -, len: 97 aa. Equivalent to Rv1246c, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 97 aa overlap). Conserved hypothetical protein, highly similar to Rv2866|MTV003.12 hypothetical Mycobacterium tuberculosis protein (87 aa), FASTA scores: opt: 290, E(): 3.9e-24, (54.1% identity in 85 aa overlap). Protein product from Mb1278c detected using SWATH mass spectrometry. Mb1278c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007712" /db_xref="InterPro:IPR035093" /db_xref="UniProtKB/TrEMBL:A0A1R3XXT8" /protein_id="SIT99879.1" /translation="MSDDHPYHVAITATAARDLQRLPEKIAAACVEFVFGPLLNNPHR LGKPLRNDLEGLHSARRGDYRVVYAIDDGHHRVEIIHIARRSASYRMNPCRPR" CDS complement(1390247..1390516) /codon_start=1 /transl_table=11 /gene="relB" /locus_tag="BQ2027_MB1279C" /product="antitoxin relb" /note="Mb1279c, -, len: 89 aa. Equivalent to Rv1247c, len: 89 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 89 aa overlap). Conserved hypothetical protein, some similarity to hypothetical proteins including Mycobacterium tuberculosis proteins Rv2865|MTV003.11 (93 aa), FASTA scores: opt: 249, E(): 5.4e-13, (44.2% identity in 86 aa overlap); Rv0268|Z86089|P95225 (169 aa) opt: 125, E(): 0.0089, (41.8% identity in 55 aa overlap); etc. and AE000293|ECAE0002933 from Escherichia coli (92 aa), FASTA scores: opt: 127, E(): 0.0038, (29.3% identity in 82 aa overlap). Protein product from Mb1279c detected using SWATH mass spectrometry. Mb1279c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006442" /db_xref="InterPro:IPR036165" /db_xref="UniProtKB/TrEMBL:A0A1R3XXS5" /protein_id="SIT99880.1" /translation="MAVVPLGEVRNRLSEYVAEVELTHERITITRHGHPAAVLISADD LASIEETLEVLRTPGASEAIREGLADVAAGRFVSNDEIRNRYTAR" CDS complement(1390629..1394324) /codon_start=1 /transl_table=11 /gene="sucA" /locus_tag="BQ2027_MB1280C" /product="Multifunctional alpha-ketoglutarate metabolic enzyme" /note="Mb1280c, len: 1231 aa. Equivalent to Rv1248c len: 1231 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1231 aa overlap). Multifunctional alpha-ketoglutarate metabolic enzyme, highly similar to D84102 Corynebacterium glutamicum (1257 aa), FASTA scores: opt: 4418, E(): 0, (59.4% identity in 1223 aa overlap). Cofactor: thiamine diphosphate. Start changed since first submission (+17 aa). Protein product from Mb1280c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1280c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U0A6" /db_xref="InterPro:IPR001017" /db_xref="InterPro:IPR001078" /db_xref="InterPro:IPR005475" /db_xref="InterPro:IPR011603" /db_xref="InterPro:IPR023213" /db_xref="InterPro:IPR029061" /db_xref="InterPro:IPR031717" /db_xref="InterPro:IPR032106" /db_xref="InterPro:IPR042179" /db_xref="UniProtKB/Swiss-Prot:Q7U0A6" /protein_id="SIT99881.1" /translation="MANISSPFGQNEWLVEAMYRKFRDDPSSVDPSWHEFLVDYSPEP TSQPAAEPTRVTSPLVAERAAAAAPQAPPKPADTAAAGNGVVAALAAKTAVPPPAEGD EVAVLRGAAAAVVKNMSASLEVPTATSVRAVPAKLLIDNRIVINNQLKRTRGGKISFT HLLGYALVQAVKKFPNMNRHYTEVDGKPTAVTPAHTNLGLAIDLQGKDGKRSLVVAGI KRCETMRFAQFVTAYEDIVRRARDGKLTTEDFAGVTISLTNPGTIGTVHSVPRLMPGQ GAIIGVGAMEYPAEFQGASEERIAELGIGKLITLTSTYDHRIIQGAESGDFLRTIHEL LLSDGFWDEVFRELSIPYLPVRWSTDNPDSIVDKNARVMNLIAAYRNRGHLMADTDPL RLDKARFRSHPDLEVLTHGLTLWDLDRVFKVDGFAGAQYKKLRDVLGLLRDAYCRHIG VEYAHILDPEQKEWLEQRVETKHVKPTVAQQKYILSKLNAAEAFETFLQTKYVGQKRF SLEGAESVIPMMDAAIDQCAEHGLDEVVIGMPHRGRLNVLANIVGKPYSQIFTEFEGN LNPSQAHGSGDVKYHLGATGLYLQMFGDNDIQVSLTANPSHLEAVDPVLEGLVRAKQD LLDHGSIDSDGQRAFSVVPLMLHGDAAFAGQGVVAETLNLANLPGYRVGGTIHIIVNN QIGFTTAPEYSRSSEYCTDVAKMIGAPIFHVNGDDPEACVWVARLAVDFRQRFKKDVV IDMLCYRRRGHNEGDDPSMTNPYMYDVVDTKRGARKSYTEALIGRGDISMKEAEDALR DYQGQLERVFNEVRELEKHGVQPSESVESDQMIPAGLATAVDKSLLARIGDAFLALPN GFTAHPRVQPVLEKRREMAYEGKIDWAFGELLALGSLVAEGKLVRLSGQDSRRGTFSQ RHSVLIDRHTGEEFTPLQLLATNSDGSPTGGKFLVYDSPLSEYAAVGFEYGYTVGNPD AVVLWEAQFGDFVNGAQSIIDEFISSGEAKWGQLSNVVLLLPHGHEGQGPDHTSARIE RFLQLWAEGSMTIAMPSTPSNYFHLLRRHALDGIQRPLIVFTPKSMLRHKAAVSEIKD FTEIKFRSVLEEPTYEDGIGDRNKVSRILLTSGKLYYELAARKAKDNRNDLAIVRLEQ LAPLPRRRLRETLDRYENVKEFFWVQEEPANQGAWPRFGLELPELLPDKLAGIKRISR RAMSAPSSGSSKVHAVEQQEILDEAFG" CDS complement(1394466..1395254) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1281C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb1281c, -, len: 262 aa. Equivalent to Rv1249c, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 262 aa overlap). Possible membrane protein. Start uncertain. Protein product from Mb1281c detected using SWATH mass spectrometry. Mb1281c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZY1" /db_xref="UniProtKB/TrEMBL:A0A1R3XZY1" /protein_id="SIT99882.1" /translation="MSARRIRSWKRFDNRSANAAEPDPQLAGTGGRPKVSTRALAQVI ERSSRIQGPAAQAYVARLRRAHPGASPAKIVAKLEKRFLSVVTASGAAVGTAATLPGI GTLAAWFAAAGEVVVFLEATALFVLALASVHAIPLDHRERRRALVLAVLVGDNTTAVA DLLGPGRTSGGWVSETMASLPLPAISSLNSRMLKYVVKRFALKRGALMFGKLVPMGIG AIIGAIGNRLVGKKLVRNARSAFGTPPARWPVTLHVLPTVRDAS" CDS 1395451..1397190 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1282" /product="PROBABLE DRUG-TRANSPORT INTEGRAL MEMBRANE PROTEIN" /note="Mb1282, -, len: 579 aa. Equivalent to Rv1250, len: 579 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 579 aa overlap). Probable drug-transport integral membrane protein, member of major facilitator superfamily (MFS), highly similar to several including P39886|TCMA_STRGA TETRACENOMYCIN C RESISTANCE PROTEIN from Streptomyces glaucescens (538 aa), FASTA scores: opt: 847, E(): 0, (32.9% identity in 517 aa overlap); etc. Also similar to MTCY20B11.14c|Rv3239C from Mycobacterium tuberculosis (1048 aa), FASTA scores: opt: 629, E(): 6.7e-13, (31.9% identity in 423 aa overlap). TBparse score is 0.921. Protein product from Mb1282 detected using SWATH mass spectrometry. Mb1282 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYR8" /db_xref="InterPro:IPR004638" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XYR8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99883.1" /translation="MTTAIRRAAGSSYFRNPWPALWAMMVGFFMIMLDSTVVAIANPT IMAQLRIGYATVVWVTSAYLLAYAVPMLVAGRLGDRFGPKNLYLIGLGVFTVASLGCG LSSGAGMLIAARVVQGVGAGLLTPQTLSTITRIFPAHRRGVALGAWGTVASVASLVGP LAGGALVDSMGWEWIFFVNVPVGVIGLILAAYLIPALPHHPHRFDWFGVGLSGAGMFL IVFGLQQGQSANWQPWIWAVIVGGIGFMSLFVYWQARNAREPLIPLEVFNDRNFSLSN LGIAIIAFAGTGMMLPVTFYAQAVCGLSPTHTAVLFAPTAIVGGVLAPFVGMIIDRSH PLCVLGFGFSVLAIAMTWLLCEMAPGTPIWRLVLPFIALGVAGAFVWSPLTVTATRNL RPHLAGASSGVFNAVRQLGAVLGSASMAAFMTSRIAAEMPGGVDALTGPAGQDATVLQ LPEFVREPFAAAMSQSMLLPAFVALFGIVAALFLVDFTGAAVAKEPLPESDGDADDDD YVEYILRREPEEDCDTQPLRASRPAAAAASRSGAGGPLAVSWSTSAQGMPPGPPGRRA WQADTESTAPSAL" CDS complement(1397093..1400512) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1283C" /product="Superfamily I DNA and RNA helicases and helicase subunits" /note="Mb1283c, -, len: 1139 aa. Equivalent to Rv1251c, len: 1139 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 1139 aa overlap). Conserved hypothetical protein, showing some similarity in C-terminal region with other proteins from eukaryotes and bacteria e.g. NP_142121.1 hypothetical protein from Pyrococcus horikoshii (1188 aa); and some similarity to GTP-binding proteins e.g. P23249|MV10_MOUSE PUTATIVE GTP-BINDING PROTEIN (1004 aa), FASTA scores: opt: 228, E(): 1.7e-06, (27.7% identity in 560 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Mb1283c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR019993" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR038720" /db_xref="InterPro:IPR041679" /db_xref="UniProtKB/TrEMBL:A0A1R3XXU0" /protein_id="SIT99884.1" /translation="MFVTGDSIVYSASDLAAAARCQYALLREFDAKLGRGPAVAVDDE LMARAAVLGSAHEGRRLDQLRHEFGDAVAIIGRPAYTPAGLAAAADATRRAIANHAPV VYQAAMFDGRFVGFADFLIRDGHRYRVADTKLARSPTVTALLQLAAYADALVHSGVPV AADAELELGDGTIVRYRVGELIPVYRSQRALLQRLLDGHYTAGTAVRWDDERVQACFR CPQCTERLRASDDLLLVGGMRVRQRDKLLEAGITTIAELADHTAPVPGLTTNALGKLT AQAKLQIRQRDTGAPQFEIVDPRPLTLLPEPNPGDLFFDFEGDPLWTADGKQWGLEYL FGVLEAGRAGVFRPLWAHDRTAERQALTDFLAIVARRRRRHPNMHIYHYAPYEKTALL RLVGRYGIGEDDVDDLLRNGVLVDLYPLVRKSIRVGTDSFSLKALEPLYLGTQPRSGD VTTAADSINSYARYCELRAAGRIDEAATVLKEIEGYNHYDCRSTRALRDWLLMRAWEA GVTPIGAQPVPDADPIDDGDSLASVLSKFTGDAAAGERTPEQTAVALLAAARGYHRRE DKPFWWAHFDRLNYPVDEWSDSTDVFLASEASVTVDWHMPPRARKPQRRVRLTGELAR GDLNGNVFALYEPPAPPGMTDNPDRRAAGPAAVVETDDPTVPTEVVIVERTGSDGNTF QQLPFALAPGPPVPTTALRESIESTAAAVASGSPQLPSTALMDVLLRRPPRTRSGAAL PRSSDPVTDIAAAALDLDSSYLAVHGPPGTGKTYTAARVIAELVTEHAWRIGVVAQSH ATVENLLEGVISAGLDPGQVAKKPHDHTAGRWQSIDGSQYTEFIRDTAGCVIGGTAWD FANGNRVPKASLDLLVIDEAGQFCLANTIAVAPAATNLLLLGDPQQLPQVSQGTHPEP VDTSALSWLVDGQHTLPDERGYFLDRSYRMHPAVCAAVSALSYEGRLCSHTERTAVRR LDGYPPGVHTRGVHHKGNSIESPEEAEAILAELRQLLGSPWTDEHGTRPLAASDVLVL APYNAQVALVRRRLASAGLGGADGVRVGTVDKFQGGQAPVVFISMTASSADDVPRGIS FLLNRNRLNVAVSRAQYAAVIVRSELLTQYLPATPDGLVDLGAFLGLTSTS" CDS complement(1400568..1401176) /codon_start=1 /transl_table=11 /gene="lprE" /locus_tag="BQ2027_MB1284C" /product="PROBABLE LIPOPROTEIN LPRE" /note="Mb1284c, lprE, len: 202 aa. Equivalent to Rv1252c, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 202 aa overlap). Probable lipoprotein lprE, some similarity to Mycobacterium tuberculosis protein Rv3483c|MTCY13E12.36C (220 aa), FASTA scores: E(): 7e-05, (29.5% identity in 200 aa overlap). Contains possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013). Mb1284c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65313" /db_xref="InterPro:IPR025971" /db_xref="UniProtKB/Swiss-Prot:P65313" /protein_id="SIT99885.1" /translation="MPGVWSPPCPTTPRVGVVAALVAATLTGCGSGDSTVAKTPEATP SLSTAHPAPPSSEPSPPSATAAPPSNHSAAPVDPCAVNLASPTIAKVVSELPRDPRSE QPWNPEPLAGNYNECAQLSAVVIKANTNAGNPTTRAVMFHLGKYIPQGVPDTYGFTGI DTSQCTGDTVALTYASGIGLNNVVKFRWNGGGVELIGNTTGG" CDS 1401242..1402933 /codon_start=1 /transl_table=11 /gene="deaD" /locus_tag="BQ2027_MB1285" /product="PROBABLE COLD-SHOCK DEAD-BOX PROTEIN A HOMOLOG DEAD (ATP-dependent RNA helicase deaD homolog)" /note="Mb1285, deaD, len: 563 aa. Equivalent to Rv1253, len: 563 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 563 aa overlap). Probable Dead, Cold-shock DEAD-box protein A homolog, similar to many e.g. DEAD_ECOLI|P23304 Escherichia coli (646 aa), FASTA scores: opt: 1490, E(): 0, (46.7% identity in 578 aa overlap); similar to Mycobacterium tuberculosis Rv3211. Contains PS00017 ATP/GTP-binding site motif A, PS00039 DEAD-box subfamily ATP-dependent helicases signature. BELONGS TO THE DEAD BOX FAMILY HELICASE. Protein product from Mb1285 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1285 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXT6" /db_xref="InterPro:IPR000629" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR005580" /db_xref="InterPro:IPR011545" /db_xref="InterPro:IPR012677" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR014014" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR028618" /db_xref="InterPro:IPR034415" /db_xref="UniProtKB/TrEMBL:A0A1R3XXT6" /protein_id="SIT99886.1" /translation="MAFPEYSPAASAATFADLQIHPRVLRAIGDVGYESPTAIQAATI PALMAGSDVVGLAQTGTGKTAAFAIPMLSKIDITSKVPQALVLVPTRELALQVAEAFG RYGAYLSQLNVLPIYGGSSYAVQLAGLRRGAQVVVGTPGRVIDHLERATLDLSRVDFL VLDEADEMLTMGFADDVERILSETPEYKQVALFSATMPPAIRKLSAKYLHDPFEVTCK AKTAVAENISQSYIQVARKMDALTRVLEVEPFEAMIVFVRTKQATEEIAEKLRARGFS AAAISGDVPQAQRERTITALRDGDIDILVATDVAARGLDVERISHVLNYDIPHDTESY VHRIGRTGRAGRSGAALIFVSPRELHLLKAIEKATRQTLTEAQLPTVEDVNTQRVAKF ADSITNALGGPGIELFRRLVEEYEREHDVPMADIAAALAVQCRGGEAFLMAPDPPLSR RNRDQRRDRPQRPKRRPDLTTYRVAVGKRHKIGPGAIVGAIANEGGLHRSDFGQIRIG PDFSLVELPAKLPRATLKKLAQTRISGVLIDLRPYRPPDAARRHNGGKPRRKHVG" CDS 1402930..1404081 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1286" /product="PROBABLE ACYLTRANSFERASE" /note="Mb1286, -, len: 383 aa. Equivalent to Rv1254, len: 383 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 383 aa overlap). Probable Acyltransferase (EC 2.3.1.-), similar to G927228 midecamycin 4-0-propionyl transferase (fragment) (388 aa), FASTA scores, opt: 305, E(): 5.6e-14, (28.4% identity in 377 aa overlap). Also similar to other Mycobacterium tuberculosis acyltransferases e.g. Rv0111, Rv0228, etc. Contains PS00881 Protein splicing signature. Protein product from Mb1286 detected using SWATH mass spectrometry. Mb1286 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXT3" /db_xref="InterPro:IPR002656" /db_xref="UniProtKB/TrEMBL:A0A1R3XXT3" /protein_id="SIT99887.1" /translation="MTLPKERAAQGGLERIAHVDRVASLTGIRAVAALLVVGTHAAYT TGKYTHGYWGLMSSRMEIGVPIFFVLSGFLLFRPWVKSAATGGPPPSLSRYAWHRVRR IMPAYTVTVLLAYLVYHFRTAGPNPGHTWVGLFRNLTLTQIYTDGYLGAFLHQGLTQM WSLAVEVAFYLALPALAYLLLVLVCRRRWQPRLLLATMAGLTMISPAWLILVHNTHWM PDGARLWLPTYLAWFVGGMMLAVLAAMGVRCYAFVAIPLAVICYFIVSTPIAGAPTTS PTALAEALVKTAFYAVIAVLAVAPLALGDQGWYAQLLASRPMVFLGEISYEIFLIHLV TMEIAMVDVLGYRVYTSSMVNLCLVTLVLTIPLAWLLHRFTRVQGDRPS" CDS complement(1404050..1404349) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1287C" /product="probable transcriptional regulatory protein" /note="Mb1287c, -, len: 99 aa. Equivalent to 5' end of Rv1257c and 3' end of Rv1255c, len: 455 aa and 202 aa, from Mycobacterium tuberculosis strain H37Rv, (89.5% identity in 57 aa overlap and 83.6% identity in 67 aa overlap). Rv1257c: Probable oxidoreductase (EC 1.-.-.-), similar to e.g. GLCD_ECOLI|P52075 glycolate oxidase subunit glcd (499 aa), FASTA scores: E(): 0, (38.9% identity in 458 aa overlap). Similar to Mycobacterium tuberculosis oxidoreductases e.g. Rv3107c. Rv1255c: Possible regulatory protein, similar to others e.g. ACRR_ECOLI|P34000 potential acrab operon repressor from E. coli (215 aa), FASTA scores: opt: 128, E(): 0.25, (42.1% identity in 57 aa overlap). Helix turn helix motif present at aa 36-57 (+5.48 SD). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 3007 bp deletion (RD13) leads to the fusion of Rv1257c and Rv1255c, and the deletion of Rv1256c, compared to Mycobacterium tuberculosis strain H37Rv. Protein product from Mb1287c detected using SWATH mass spectrometry. Mb1287c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XXT4" /protein_id="SIT99888.1" /translation="MNTDVLAGLMAELPEGMVVTDPAVTDGYRQDRAFDPSAGKPLAI IRPRRARWVVRMLTSLLMFPGRDEADERAMIAEFVVPIVTPASAAARKAGHPGPE" CDS complement(1404346..1405605) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1288C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN" /note="Mb1288c, -, len: 419 aa. Equivalent to Rv1258c, len: 419 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 419 aa overlap). Probable conserved integral membrane transport (efflux) protein, possibly member of major facilitator superfamily (MFS), highly similar to O32859|TAP PROTEIN multidrug-resistance efflux pump from Mycobacterium fortuitum (409 aa), FASTA scores: E(): 0, (68.4% identity in 408 aa overlap). Contains PS00216 Sugar transport proteins signature 1. Mb1288c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64784" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/Swiss-Prot:P64784" /protein_id="SIT99889.1" /translation="MRNSNRGPAFLILFATLMAAAGDGVSIVAFPWLVLQREGSAGQA SIVASATMLPLLFATLVAGTAVDYFGRRRVSMVADALSGAAVAGVPLVAWGYGGDAVN VLVLAVLAALAAAFGPAGMTARDSMLPEAAARAGWSLDRINGAYEAILNLAFIVGPAI GGLMIATVGGITTMWITATAFGLSILAIAALQLEGAGKPHHTSRPQGLVSGIAEGLRF VWNLRVLRTLGMIDLTVTALYLPMESVLFPKYFTDHQQPVQLGWALMAIAGGGLVGAL GYAVLAIRVPRRVTMSTAVLTLGLASMVIAFLPPLPVIMVLCAVVGLVYGPIQPIYNY VIQTRAAQHLRGRVVGVMTSLAYAAGPLGLLLAGPLTDAAGLHATFLALALPIVCTGL VAIRLPALRELDLAPQADIDRPVGSAQ" CDS 1405604..1406503 /codon_start=1 /transl_table=11 /gene="udgb" /locus_tag="BQ2027_MB1289" /product="probable uracil dna glycosylase, udgb" /note="Mb1289, -, len: 299 aa. Equivalent to Rv1259, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 299 aa overlap). Conserved hypothetical protein. Similar to AL109732|SC7H2.04 hypothetical protein from Streptomyces coelicolor (237 aa), FASTA scores: opt: 870, E(): 0, (57.1% identity in 231 aa overlap). Protein product from Mb1289 detected using SWATH mass spectrometry." /db_xref="GOA:P64786" /db_xref="InterPro:IPR005122" /db_xref="InterPro:IPR036895" /db_xref="UniProtKB/Swiss-Prot:P64786" /protein_id="SIT99890.1" /translation="MNIAAESSAKPVWGPPNFCAAAARMQDVRVLMHPKTGRAFRSPV EPGSGWPGDPATPQTPVAADAAQVSALAGGAGSICELNALISVCRACPRLVSWREEVA VVKRRAFADQPYWGRPVPGWGSKRPRLLILGLAPAAHGANRTGRMFTGDRSGDQLYAA LHRAGLVNSPVSVDAADGLRANRIRITAPVRCAPPGNSPTPAERLTCSPWLNAEWRLV SDHIRAIVALGGFAWQVALRLAGASGTPKPRFGHGVVTELGAGVRLLGCYHPSQQNMF TGRLTPTMLDDIFREAKKLAGIE" CDS 1406505..1407329 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1290" /product="PROBABLE OXIDOREDUCTASE [FIRST PART]" /note="Mb1290, -, len: 274 aa. Equivalent to 5' end of Rv1260, len: 372 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 260 aa overlap). Probable oxidoreductase (EC 1.-.-.-), highly similar to E1245747|AL021411 putative oxidoreductase SC7H1.18 from Streptomyces coelicolor (397 aa), FASTA scores: E(): 1.4e-29, (45.9% identity in 355 aa overlap); also some similarity to G912582 FAD binding protein homologue from Pseudomonas aeruginosa (286 aa), FASTA scores: opt: 245, E(): 2e-09, (27.5% identity in 251 aa overlap); PCPB_FLASP|P42535 pentachlorophenol 4-monooxygenase (537 aa), FASTA scores: opt: 219, E(): 1.7e-07, (23.3% identity in 360 aa overlap); TETX_BACFR|Q01911 tetracycline resistance protein (388 aa), FASTA scores: opt: 183, E(): 3e-05, (22.8% identity in 373 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv0575c and Rv1751. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1260 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (g-*) splits Rv1260 into 2 parts, Mb1290 and Mb1291. Mb1290 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXU2" /db_xref="InterPro:IPR002938" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XXU2" /protein_id="SIT99891.1" /translation="MKTVVVSGASVAGTAAAYWLGRHGYSVTMVERHPGLRPGGQAID VRGPALDVLERMGLLAAAQEHKTRIRGASFVDRDGNELFRDTESTPTGGPVNSPDIEL LRDDLVELLYGATQPSVEYLFDDSISTLQDDGDSVRVTFERAAAREFDLVIGADGLHS NVRRLVFGPEEQFVKRLGTHAAIFTVPNFLELDYWQTWHYGDSTMAGVYSARNNTEAR AALAFMDTELRIDYRDTEAQFAELQRRMAEDGWVRAQLLHYIAAHRISISTKCRRS" CDS 1407329..1407622 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1291" /product="PROBABLE OXIDOREDUCTASE [SECOND PART]" /note="Mb1291, -, len: 102 aa. Equivalent to 3' end of Rv1260, len: 372 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 102 aa overlap). Probable oxidoreductase (EC 1.-.-.-), highly similar to E1245747|AL021411 putative oxidoreductase SC7H1.18 from Streptomyces coelicolor (397 aa), FASTA scores: E(): 1.4e-29, (45.9% identity in 355 aa overlap); also some similarity to G912582 FAD binding protein homologue from Pseudomonas aeruginosa (286 aa), FASTA scores: opt: 245, E(): 2e-09, (27.5% identity in 251 aa overlap); PCPB_FLASP|P42535 pentachlorophenol 4-monooxygenase (537 aa), FASTA scores: opt: 219, E(): 1.7e-07, (23.3% identity in 360 aa overlap); TETX_BACFR|Q01911 tetracycline resistance protein (388 aa), FASTA scores: opt: 183, E(): 3e-05, (22.8% identity in 373 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv0575c and Rv1751. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1260 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (g-*) splits Rv1260 into 2 parts, Mb1290 and Mb1291. Protein product from Mb1291 detected using SWATH mass spectrometry. Mb1291 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZY9" /db_xref="InterPro:IPR002938" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XZY9" /protein_id="SIT99892.1" /translation="MDRWSRGRVALVGDAGYCCSPLSGQGTSVALLGAYILAGELKAA GDDYQLGFANYHAEFHGFVERNQWLVSDNIPGGAPIPQEEFERIVHSITIKDY" CDS complement(1407748..1408197) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1292C" /product="AclJ" /note="Mb1292c, -, len: 149 aa. Equivalent to Rv1261c, len: 149 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 149 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1558|MTCY48.07c (39.2% identity in 125 aa overlap); Rv3547 and Rv3178. Protein product from Mb1292c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1292c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64788" /db_xref="InterPro:IPR004378" /db_xref="InterPro:IPR012349" /db_xref="UniProtKB/Swiss-Prot:P64788" /protein_id="SIT99893.1" /translation="MDISRWLERHVGVQLLRLHDAIYRGTNGRIGHRIPGAPPSLLLH TTGAKTSQPRTTSLTYARDGDAYLIVASKGGDPRSPGWYHNLKANPDVEINVGPKRFG VTAKPVQPHDPDYARLWQIVNENNANRYTNYQSRTSRPIPVVVLTRR" CDS complement(1408202..1408636) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1293C" /product="HYPOTHETICAL HIT-LIKE PROTEIN" /note="Mb1293c, -, len: 144 aa. Equivalent to Rv1262c, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 144 aa overlap). Hypothetical HIT-like protein, similar to Q04344|HIT_YEAST hit1 protein (orf u) (144 aa), FASTA scores: opt: 306, E(): 3e-14, (35.9 % identity in 142 aa overlap); also similar to YHIT_MYCGE|P47378 hypothetical 15.6 kd protein (141 aa), FASTA scores: opt: 250, E(): 1.6e-10, (35.5% identity in 107 aa overlap); and YHIT_MYCLE|P49774 hypothetical 17.0 kd protein hit-like (155 aa), FASTA scores: opt: 196, E(): 7e-07, (30.6% identity in 144 aa overlap). Similar to other proteins from Mycobacterium tuberculosis e.g. Rv2613c, Rv0759c. Contains PS00892 HIT family signature. BELONGS TO THE HIT FAMILY. Protein product from Mb1293c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1293c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXU9" /db_xref="InterPro:IPR001310" /db_xref="InterPro:IPR011146" /db_xref="InterPro:IPR019808" /db_xref="InterPro:IPR036265" /db_xref="InterPro:IPR039384" /db_xref="UniProtKB/TrEMBL:A0A1R3XXU9" /protein_id="SIT99894.1" /translation="MPCVFCAIIAGEAPAIRIYEDGGYLAILDIRPFTRGHTLVLPKR HTVDLTDTPPEALADMVAIGQRIARAARATKLADATHIAINDGRAAFQTVFHVHLHVL PRRNGDKLSVAKGMMLRRDPDREATGRILREALAQQDAAAQD" CDS 1408695..1410083 /codon_start=1 /transl_table=11 /gene="amiB2" /locus_tag="BQ2027_MB1294" /product="PROBABLE AMIDASE AMIB2 (AMINOHYDROLASE)" /note="Mb1294, amiB2, len: 462 aa. Equivalent to Rv1263, len: 462 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 462 aa overlap). Probable amiB2, amidase (EC 3.5.1.4). Similar to G1001278 hypothetical 54.3 kDa protein (506 aa), FASTA scores: opt: 767, E(): 7.6e-40, (32.8% identity in 461 aa overlap), also similar to G580673 rhodococcus enantiose lective amidase gene (462 aa), FASTA scores, opt: 668, E(): 7.4e-34, (33.5% identity in 484 aa overlap); and to NYLA_PSES8|P13398 6-aminohexanoate-cyclic-dimer hydrolase (492 aa), FASTA scores opt: 543, E(): 3.1e-26, (33.5% identity in 493 aa overlap). Also similar to MTCY274.19c (33.5% identity in 427 aa overlap). Similar to other putative amidases in M. tuberculosis e.g. Rv2363, Rv2888c, etc. Contains PS00017 ATP/GTP-binding site motif A. BELONGS TO THE AMIDASE FAMILY. Protein product from Mb1294 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:P63493" /db_xref="InterPro:IPR000120" /db_xref="InterPro:IPR020556" /db_xref="InterPro:IPR023631" /db_xref="InterPro:IPR036928" /db_xref="UniProtKB/Swiss-Prot:P63493" /protein_id="SIT99895.1" /translation="MDPTDLAFAGAAAQARMLADGALTAPMLLEVYLQRIERLDSHLR AYRVVQFDRARAEAEAAQQRLDAGERLPLLGVPIAIKDDVDIAGEVTTYGSAGHGPAA TSDAEVVRRLRAAGAVIIGKTNVPELMIMPFTESLAFGATRNPWCLNRTPGGSSGGSA AAVAAGLAPVALGSDGGGSIRIPCTWCGLFGLKPQRDRISLEPHDGAWQGLSVNGPIA RSVMDAALLLDATTTVPGPEGEFVAAAARQPGRLRIALSTRVPTPLPVRCGKQELAAV HQAGALLRDLGHDVVVRDPDYPASTYANYLPRFFRGISDDADAQAHPDRLEARTRAIA RLGSFFSDRRMAALRAAEVVLSSRIQSIFDDVDVVVTPGAATGPSRIGAYQRRGAVST LLLVVQRVPYFQVWNLTGQPAAVVPWDFDGDGLPMSVQLVGRPYDEATLLALAAQIES ARPWAHRRPSVS" CDS 1410158..1411351 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1295" /product="ADENYLYL CYCLASE (ATP PYROPHOSPHATE-LYASE) (ADENYLATE CYCLASE)" /note="Mb1295, -, len: 397 aa. Equivalent to Rv1264, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 397 aa overlap). Adenylate cyclase (EC 4.6.1.1) (function proven experimentally: see first citation below), showing some similarity to other adenylate cyclases e.g. CYAA_BRELI|P27580 (403 aa), FASTA scores, opt: 270, E(): 1.3e-10, (29.3% identity in 317 aa overlap); etc. Similar to other putative cyclases in M. tuberculosis e.g. Rv2212, Rv1647. BELONG TO THE ADENYLYL CYCLASE CLASS-4/GUANYLYL CYCLASE FAMILY. The C terminus seems to code for a catalytic domain belonging to a subfamily of adenylyl cyclase isozymes (mostly found in Gram-positive bacteria). The N terminus seems to be a potential novel regulator of adenylyl cyclase activity (autoinhibitory domain). Protein product from Mb1295 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1295 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXU7" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR029787" /db_xref="InterPro:IPR032026" /db_xref="UniProtKB/TrEMBL:A0A1R3XXU7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99896.1" /translation="MTDHVREADDANIDDLLGDLGGTARAERAKLVEWLLEQGITPDE IRATNPPLLLATRHLVGDDGTYVSAREISENYGVDLELLQRVQRAVGLARVDDPDAVV HMRADGEAAARAQRFVELGLNPDQVVLVVRVLAEGLSHAAEAMRYTALEAIMRPGATE LDIAKGSQALVSQIVPLLGPMIQDMLFMQLRHMMETEAVNAGECAAGKPLPGARQVTV AFADLVGFTQLGEVVSAEELGHLAGRLAGLARDLTAPPVWFIKTIGDAVMLVCPDPAP LLDTVLKLVEVVDTDNNFPRLRAGVASGMAVSRAGDWFGSPVNVASRVTGVARPGAVL VADSVREALGDAPEADGFQWSFAGPRRLRGIRGDVRLFRVRRGATRTGSGGAAQDDDL AGSSP" CDS 1411524..1412204 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1296" /product="unknown protein" /note="Mb1296, -, len: 226 aa. Equivalent to Rv1265, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 226 aa overlap). Hypothetical unknown protein (see citation below). Protein product from Mb1296 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1296 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P64790" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99897.1" /translation="MVLARPDAVFAPARNRCHVSLPVNAMSLKMKVCNHVIMRHHHMH GRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLES PEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVP VMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLR AQGPDLPA" CDS complement(1412224..1414014) /codon_start=1 /transl_table=11 /gene="pknH1" /locus_tag="BQ2027_MB1297C" /note="Mb1297c, PknH1, len: 596 aa. Protein product from Mb1297c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1297c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U095" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR008271" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR017441" /db_xref="InterPro:IPR026954" /db_xref="InterPro:IPR038232" /db_xref="UniProtKB/Swiss-Prot:Q7U095" /protein_id="SIT99898.1" /translation="MSDAQDSRVGSMFGPYHLKRLLGRGGMGEVYEAEHTVKEWTVAV KLMTAEFSKDPVFRERMKREARIAGRLQEPHVVPIHDYGEVDGQMFLEMRLVEGTDLD SVLKRFGPLTPPRAVAIITQIASALDAAHADGVMHRDVKPQNILITRDDFAYLVDFGI ASATTDEKLTQLGTAVGTWKYMAPERFSNDEVTYRADIYALACVLHECLTGAPPYRAD SAGTLVSSHLMGPIPQPSAIRPGIPKAFDAVVARGMAKKPEDRYASAGDLALAAHEAL SDPDQDHAADILRRSQESTLPGTAAVTAQPPTMPTVTPPPIQAAPTGQPSWAPNSGPM PASGPTPTPQYYQGGGWGAPPSGGPSPWAQTPRKTNPWPLVAGAAAVVLVLVLGAIGI WIANRPKPVQPPQPVAEERLSALLLNSSEVNAVMGSSSMQPGKPITSMDSSPVTVSLP DCQGALYTSQDPVYAGTGYTAINGLISSEPGDNYEHWVNQAVVAFPTADKARAFVQTS ADKWKNCAGKTVTVTNKAKTYRWTFADVKGSPPTITVIDTQEGAEGWECQRAMSVANN VVVDVNACGYQITNQAGQIAAKIVDKVNKE" CDS complement(1414258..1416654) /codon_start=1 /transl_table=11 /gene="tbd2" /locus_tag="BQ2027_MB1297CA" /note="Mb1297cA, Tbd2, putative intergral membrane, ATP-binding, ABC transporter. Part of RD900 first identified in M. africanum GM041182 (Bentley et al. 2012, PLoS Negl Top Dis. http://www.ncbi.nlm.nih.gov/pubmed/ 22389744). 797/798 (99%) AA identity with MAF_12860. Misidentified in original M. bovis AF2122/97 genome assembly. Mb1297cA found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXW0" /db_xref="InterPro:IPR000253" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR008984" /db_xref="InterPro:IPR013525" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XXW0" /protein_id="SIT99899.1" /translation="MTDTADMITPNAPRLELRAAGRTWHAVAGREWSIGRASEADIRL DNPRVSRQHAVLEATPEGWVLVNLSTNGTFVDGQRVERLTVRQPITIFLGSASSGQRV QLYPVAQSPTPTPASHPPAPPRPATPKPAQRQGETTVARPPTAFHAIDQLVVTIGRAP ENTVVLNDLLVSRRHAILRRTGNRWELSDNASANGTYVNGHRISRAVIGPTDIVGIGH QLLHLSADRLVEYVDTGDISYQASNLRVVTNKGRVLLADVSFVLPQRSLLAVVGPSGA GKSTLLGALTGFRPAGNGTVRYDERDLYDNYAELRHRIGFVPQDDILHTPLTVRRALN YAARLRFPQDVSVDERNQRIEEVLVELGLSTQADQRIDSLSGGQRKRTSVALELLTKP SLLFLDEPTSGLDPGYEKSVMQTLRKLADDGRSVVVVTHNIAHLNMCDRLLILAPGGR LAYFGPPQQALGYFNCTDFADLFTLLEHDTSTDWTGRFNASPLREALIGHPAMRPARP AAARHARPVAQQSAFAQFAILCRRYLAVIAADRQYAVFLLVLPLLLSLFAHAVPGQAG LSLAKAIELKSTQPSQLLVLLIIGGALMGCAASIREIVKERAIYRREHGIGLSRGAYL ASKLVVLTALTSLQALILGFLGVALLPPPDQSVILPWPSVEVAVAVVAVTVVSMMIGL LISAMIGNADRGMPLLVLVVMAQLVLCGGMFGVSGRPPLEQLSWLSPSRWAYAMAAAT VDLNDLRRTAGGDQDPLWDYNVGSWLMAAGACAVQALVLVILIALQLKRIEPQRKARK " CDS complement(1416665..1418431) /codon_start=1 /transl_table=11 /gene="pknH2" /locus_tag="BQ2027_MB1297CB" /note="Mb1297cB, PknH2, putative membrane spanning serine/threonine protein kinase, part of RD900 first identified in M. africanum GM041182(Bentley et al. 2012, PLoS Negl Top Dis. http://www.ncbi.nlm.nih.gov/pubmed/ 22389744). 555/567 (96%) identity with MAF_12870" /db_xref="GOA:A0A1R3XXU5" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR008271" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR017441" /db_xref="UniProtKB/TrEMBL:A0A1R3XXU5" /protein_id="SIT99900.1" /translation="MSAAAGWLGGSAMSDAQDSRVGSMFGPYHLKRLLGRGGMGEVYE AEHTVKEWTVAVKLMTAEFSKDPVFRERMKREARIAGRLQEPHVVPIHDYGEVDGQMF LEMRLVEGTDLDSVLKRFGPLTPPRAVAIITQIASALDAAHADGVMHRDVKPQNILIT RDDFAYLVDFGIASATTDEKLTQLGTAVGTWKYMAPERFSNDEVTYRADSLRAGLRAA RMLDRGPAVSRRHAGTLVSSHLMGPIPQPSAIRPGIPKAFDAVVARGMAKKPEDRYAS AGDLALAAHEALSDPDQDHAADILRRSQESTLPGTAAVTAQPPTMPTVTPPPIQAAPT GQPSWAPNSGPMPASGPTPTPQYYQGGGWGAPPSGGPSPWAQTPRKTNPWPFVAVAAA VVLVLVLGAIGIWIANRPDDNPKRNIATSPGTPTTTATTSLPATTTPTTAPASDPQTR LLSMLPSGYPTGTCKPTTPKPNSIWVNAVAMVDCGQNTNQGGPSRAIYGLFANPDKLK QAFNDDIAAVELMNCPGEGPSPDGWHYNQTPDVTAGMIACGTYKNRPNVIWSNEAKLT LSDVFGDPATIEDLHNWWAKYG" CDS complement(1418736..1419902) /codon_start=1 /transl_table=11 /gene="embR" /locus_tag="BQ2027_MB1298C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN EMBR" /note="Mb1298c, embR, len: 388 aa. Equivalent to Rv1267c, len: 388 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 388 aa overlap). Probable embR, regulatory protein (see citation below), similar to many e.g. AFSR_STRCO|P25941 regulatory protein AfsR from Streptomyces coelicolor (993 aa), FASTA scores: opt: 489, E(): 1e-25, (33.5% identity in 361 aa overlap); etc. BELONGS TO THE AFSR/DNRI/REDD FAMILY OF REGULATORS. Protein product from Mb1298c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:P66800" /db_xref="InterPro:IPR000253" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR005158" /db_xref="InterPro:IPR008984" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/Swiss-Prot:P66800" /protein_id="SIT99901.1" /translation="MAGSATVEKRLDFGLLGPLQMTIDGTPVPSGTPKQRAVLAMLVI NRNRPVGVDALITALWEEWPPSGARASIHSYVSNLRKLLGGAGIDPRVVLAAAPPGYR LSIPDNTCDLGRFVAEKTAGVHAAAAGRFEQASRHLSAALREWRGPVLDDLRDFQFVE PFATALVEDKVLAHTAKAEAEIACGRASAVIAELEALTFEHPYREPLWTQLITAYYLS DRQSDALGAYRRVKTTLADDLGIDPGPTLRALNERILRQQPLDAKKSAKTTAAGTVTV LDQRTMASGQQAVAYLHDIASGRGYPLQAAATRIGRLHDNDIVLDSANVSRHHAVIVD TGTNYVINDLRSSNGVHVQHERIRSAVTLNDGDHIRICDHEFTFQISAGTHGGT" CDS complement(1420213..1420911) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1299C" /product="Cysteine-Type endopeptidase" /note="Mb1299c, -, len: 232 aa. Equivalent to Rv1268c, len: 232 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 232 aa overlap). Hypothetical unknown protein, probably secreted protein : contains possible signal peptide sequence (score 7.9 at residue 28)." /db_xref="InterPro:IPR025660" /db_xref="InterPro:IPR039564" /db_xref="UniProtKB/Swiss-Prot:P64792" /protein_id="SIT99902.1" /translation="MTTSKIATAFKTATFALAAGAVALGLASPADAAAGTMYGDPAAA AKYWRQQTYDDCVLMSAADVIGQVTGREPSERAIIKVAQSTPSVVHPGSIYTKPADAE HPNSGMGTSVADIPTLLAHYGVDAVITDEDHATATGVATGMAALEQYLGSGHAVIVSI NAEMIWGQPVEETDSAGNPRSDHAVVVTGVDTENGIVHLNDSGTPTGRDEQIPMETFV EAWATSHDFMAVTT" CDS complement(1421134..1421508) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1300C" /product="CONSERVED PROBABLE SECRETED PROTEIN" /note="Mb1300c, -, len: 124 aa. Equivalent to Rv1269c, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 124 aa overlap). Conserved probable exported protein with putative N-terminal signal sequence. Similar to Mycobacterium tuberculosis protein Rv1813c|Y0DU_MYCTU|Q50620 hypothetical protein cy1a11.30 (137 aa), FASTA scores: E(): 9e-21, (41.6% identity in 137 aa overlap). Protein product from Mb1300c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1300c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006311" /db_xref="InterPro:IPR025240" /db_xref="UniProtKB/Swiss-Prot:P0A5E2" /protein_id="SIT99903.1" /translation="MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAY SGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTL AAAMKDALTKLGGGYIDTWACN" CDS complement(1421569..1422303) /codon_start=1 /transl_table=11 /gene="lprA" /locus_tag="BQ2027_MB1301C" /product="possible lipoprotein lpra" /note="Mb1301c, lprA, len: 244 aa. Equivalent to Rv1270c, len: 244 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 244 aa overlap). Putative lipoprotein lprA (Precursor). Similar to O32852|AJ000500 lipoprotein from Mycobacterium bovis (236 aa), fasta scores: E(): 5.2e-23, (35.1% identity in 245 aa overlap). Similar to M. tuberculosis lipoproteins: Rv1368, Rv1411c, Rv2945c. Contains probable N-terminal signal sequence. Protein product from Mb1301c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1301c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U094" /db_xref="InterPro:IPR009830" /db_xref="InterPro:IPR029046" /db_xref="UniProtKB/Swiss-Prot:Q7U094" /protein_id="SIT99904.1" /translation="MKHPPCSVVAAATAILAVVLAIGGCSTEGDAGKASDTAATASNG DAAMLLKQATDAMRKVTGMHVRLAVTGDVPNLRVTKLEGDISNTPQTVATGSATLLVG NKSEDAKFVYVDGHLYSDLGQPGTYTDFGNGTSIYNVSVLLDPNKGLANLLANLKDAS VAGSQQADGVATTKITGNSSADDIATLAGSRLTSEDVKTVPTTVWIASDGSSHLVQIQ IAPTKDTSVTLTMSDWGKQVTATKPV" CDS complement(1422516..1422857) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1302C" /product="CONSERVED HYPOTHETICAL SECRETED PROTEIN" /note="Mb1302c, -, len: 113 aa. Equivalent to Rv1271c, len: 113 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 113 aa overlap). Conserved hypothetical exported protein with potential N-terminal signal sequence. Similar to Mycobacterium tuberculosis hypothetical proteins Rv1804c, Rv1810, Rv0622, etc. Protein product from Mb1302c detected using SWATH mass spectrometry. Mb1302c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007969" /db_xref="UniProtKB/Swiss-Prot:P64794" /protein_id="SIT99905.1" /translation="MLSPLSPRIIAAFTTAVGAAAIGLAVATAGTAGANTKDEAFIAQ MESIGVTFSSPQVATQQAQLVCKKLASGETGTEIAEEVLSQTNLTTKQAAYFVVDATK AYCPQYASQLT" CDS complement(1422965..1424860) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1303C" /product="PROBABLE DRUGS-TRANSPORT TRANSMEMBRANE ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb1303c, -, len: 631 aa. Equivalent to Rv1272c, len: 631 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 631 aa overlap). Probable drugs-transport transmembrane ATP-binding protein ABC transporter (see citation below), similar to e.g. Y015_MYCGE|P47261 hypothetical ABC transporter mg015m from Mycoplasma genitalium (589 aa), FASTA scores: opt: 1054, E(): 0, (34.3% identity in 522 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop); and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS), MSBA SUBFAMILY. Mb1303c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63398" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR011527" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036640" /db_xref="InterPro:IPR039421" /db_xref="UniProtKB/Swiss-Prot:P63398" /protein_id="SIT99906.1" /translation="MTAPPGARPRAASPPPNMRSRDFWGSAARLVKRLAPQRRLSIAV ITLGIAGTTIGVIVPRILGHATDLLFNGVIGRGLPGGITKAQAVASARARGDNTFADL LSGMNVVPGQGVDFAAVERTLALALALYLAAALMIWAQARLLNLTVQKTMVRLRTDVE DKVHRLPLSYFDGQQRGELLSRVTNDIDNLQSSLSMTISQLVTSILTMVAVLAMMVSI SGLLALITLLTVPLSLLVTRAITRRSQPLFVAHWTSTGRLNAHLEETYSGFTVVKTFG HQAAARERFHELNDDVYQAGFGAQFLSGLVQPATAFIGNLGYVAVAVAGGLQVATGQI TLGSIQAFIQYIRQFNMPLSQLAGMYNALQSGVASAERVFDVLDEPEESPEPEPELPN LTGRVEFEHVNFAYLPGTPVIRDLSLVAEPGSTVAIVGPTGAGKTTLVNLLMRFYEIG SGRILIDGVDIASVSRQSLRSRIGMVLQDTWLYDGTIAENIAYGRPEATTDEIVEAAR AAHVDRFVNTLPAGYQTRVSGDGGSISVGEKQLITIARAFLARPQLLILDEATSSVDT RTELLIQRAMRELRRDRTSFIIAHRLSTIRDADHILVVQTGQIVERGNHAELLARRGV YYQMTRA" CDS complement(1424857..1426605) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1304C" /product="PROBABLE DRUGS-TRANSPORT TRANSMEMBRANE ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb1304c, -, len: 582 aa. Equivalent to Rv1273c, len: 582 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 582 aa overlap). Probable drug-transport transmembrane ATP-binding protein ABC transporter (see citation below), similar to e.g. YWJA_BACSU|P45861 hypothetical abc transporter from Bacillus subtilis (575 aa), FASTA scores: opt: 810, E(): 0, (27.0% identity in 578 aa overlap); etc. Contains PS00136 Serine proteases, subtilase family, aspartic acid active site; 2 x PS00211 ABC transporters family signature; and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS), MSBA SUBFAMILY. Protein product from Mb1304c detected using SWATH mass spectrometry. Mb1304c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4W5" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR011527" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036640" /db_xref="InterPro:IPR039421" /db_xref="UniProtKB/Swiss-Prot:P0A4W5" /protein_id="SIT99907.1" /translation="MLLALLRQHIRPYRRLVAMLMMLQLVSTLASLYLPTVNAAIVDD GVAKGDTATIVRLGAVMLGVTGLQVLCAIGAVYLGSRTGAGFGRDLRSAMFEHIITFS ERETARFGAPTLLTRSTNDVRQILFLVQMTATVLVTAPIMCVGGIIMAIHQEAALTWL LLVSVPILAVANYWIISHMLPLFRRMQSLIDGINRVMRDQLSGVRVVRAFTREGYERD KFAQANTALSNAALSAGNWQALMLPVTTLTINASSVALIWFGGLRIDSGQMQVGSLIA FLSYFAQILMAVLMATMTLAVLPRASVCAERITEVLSTPAALGNPDNPKFPTDGVTGV VRLAGATFTYPGADCPVLQDISLTARPGTTTAIVGSTGSGKSTLVSLICRLYDVTAGA VLVDGIDVREYHTERLWSAIGLVPQRSYLFSGTVADNLRYGGGPDQVVTEQEMWEALR VAAADGFVQTDGLQTRVAQGGVNFSGGQRQRLAIARAVIRRPAIYVFDDAFSALDVHT DAKVHASLRQVSGDATIIVVTQRISNAAQADQVIVVDNGKIVGTGTHETLLADCPTYA EFAASQSLSATVGGVG" CDS 1426752..1427309 /codon_start=1 /transl_table=11 /gene="lprB" /locus_tag="BQ2027_MB1305" /product="possible lipoprotein lprb" /note="Mb1305, lprB, len: 185 aa. Equivalent to Rv1274, len: 185 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 185 aa overlap). Possible lipoprotein lprB, contains possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013) . Some similarity to Rv1275. Protein product from Mb1305 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1305 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U093" /db_xref="InterPro:IPR024520" /db_xref="UniProtKB/Swiss-Prot:Q7U093" /protein_id="SIT99908.1" /translation="MRRKVRRLTLAVSALVALFPAVAGCSDSGDNKPGATIPSTPANA EGRHGPFFPQCGGVSDQTVTELTRVTGLVNTAKNSVGCQWLAGGGILGPHFSFSWYRG SPIGRERKTEELSRASVEDINIDGHSGFIAIGNEPSLGDSLCEVGIQFSDDFIEWSVS FSQKPFPPPCDIAKELTRQSIANSK" CDS 1427306..1427848 /codon_start=1 /transl_table=11 /gene="lprC" /locus_tag="BQ2027_MB1306" /product="possible lipoprotein lprc" /note="Mb1306, lprC, len: 180 aa. Equivalent to Rv1275, len: 180 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 180 aa overlap). Possible lipoprotein lprC, contains possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013). Some similarity to Rv1274. Protein product from Mb1306 detected using shotgun mass spectrometry. Mb1306 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024520" /db_xref="UniProtKB/TrEMBL:A0A1R3XXW9" /protein_id="SIT99909.1" /translation="MRRVLVGAAALITALLVLTGCTKSISGTAVKAGGAGVPRNNNSQ ERYPNLLKECEVLTTDILAKTVGADPLDIQSTFVGAICRWQAANPAGLIDITRFWFEQ GSLSNERKVAEGLKYQVETRAIQGVDSIVMRTGDPNGACGVASDAAGVVGWWVNPQAP GIDACGQAIKLMELTLATNA" CDS complement(1427993..1428469) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1307C" /product="Phosphohistidine phosphatase SixA" /note="Mb1307c, -, len: 158 aa. Equivalent to Rv1276c, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 158 aa overlap). Conserved hypothetical protein, similar to AL096844|SCI28.03 hypothetical protein from Streptomyces coelicolor (172 aa), FASTA scores: opt: 385, E(): 3.3e-19, (43.5% identity in 161 aa overlap). Some similarity to P76502|SIXA_ECOLI PHOSPHOHISTIDINE PHOSPHATASE SIXA (161 aa), FASTA scores: opt: 146, E(): 0.0047, (31.9% identity in 116 aa overlap). BELONGS TO THE SIXA FAMILY OF PHOSPHATASES. Protein product from Mb1307c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="InterPro:IPR013078" /db_xref="InterPro:IPR029033" /db_xref="UniProtKB/TrEMBL:A0A1R3XXV5" /protein_id="SIT99910.1" /translation="MRHAKSAYPDGIADHDRPLAPRGIREAGLAGGWLRANLPAVDAV LCSTATRARQTLAHTGIDAPARYAERLYGAAPGTVIEEINRVGDNVTSLLVVGHEPTT SALAIVLASISGTDAAVAERISEKFPTSGIAVLRVAGHWADVEPGCAALVGFHVPR" CDS 1428719..1429972 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1308" /product="DNA double-strand break repair protein Mre11" /note="Mb1308, -, len: 417 aa. Equivalent to Rv1277, len: 417 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 417 aa overlap). Conserved hypothetical protein, some similarity to 3914967|O68033|SBCD_RHOCA EXONUCLEASE SBCD HOMOLOG from Rhodobacter capsulatus (405 aa). May be sbcD protein (see first citation below) Protein product from Mb1308 detected using shotgun mass spectrometry. Mb1308 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXW2" /db_xref="InterPro:IPR004843" /db_xref="InterPro:IPR014577" /db_xref="InterPro:IPR029052" /db_xref="InterPro:IPR041796" /db_xref="UniProtKB/TrEMBL:A0A1R3XXW2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99911.1" /translation="MSPRPGPAGRGPAPCRCADLHSLCVDSHALRRDGMRFLHTADWQ LGMTRHFLAGDAQPRYSAARRDAVAGLKALAADVGAEFVVVAGDVFEHNQLAPQIVGQ SLEAMRVIGLPVYLLPGNHDPLDASSVYTSTLFRAERPDNVVVLDRAGVHEVRPGVQI VAAPWRSKAPTTDPVAEVLAGLPTDAAIRLLVAHGGVDALDPDHDKPSLIRLAALDDA LTRQAIHYVALGDKHSLTQVGSSGRVWYSGAPEVTNFDDVEPDPGHVLVVDIDESDPR HPVTVDARRIGRWRFVTLHHQVDTSRDIADLDLNLDLMTDKDRTVVRLALTGSLTVTD RAALDTCLDKYARLFAWLGLWERHTDLAVIPVDAEFTDLGIGGFAAAAVDELVATARG GDDESAVDAQAALALLLRLADRGAA" CDS 1429969..1432596 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1309" /product="DNA double-strand break repair Rad50 ATPase" /note="Mb1309, -, len: 875 aa. Equivalent to Rv1278, len: 875 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 875 aa overlap). Hypothetical unknown protein, possible coiled-coil regions, contains PS00017 ATP/GTP-binding site motif A. Protein product from Mb1309 detected using SWATH mass spectrometry. Mb1309 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041685" /db_xref="UniProtKB/Swiss-Prot:P64796" /protein_id="SIT99912.1" /translation="MKLHRLALTNYRGIAHRDVEFPDHGVVVVCGANEIGKSSMVEAL DLLLEYKDRSTKKEVKQVKPTNADVGSEVIAEISSGPYRFVYRKRFHKRCETELTVLA PRREQLTGDEAHERVRTMLAETVDTELWHAQRVLQAASTAAVDLSGCDALSRALDLAA GDDAALSGTESLLIERIEAEYARYFTPTGRPTGEWSAAVSRLAAAEAAVADCAAAVAE VDDGVRRHTELTEQVAELSQQLLAHQLRLEAARVAAEKIAAITDDAREAKLIATAAAA TSGASTAAHAGRLGLLTEIDTRTAAVVAAEAKARQAADEQATARAEAEACDAALTEAT QVLTAVRLRAESARRTLDQLADCEEADRLAARLARIDDIEGDRDRVCAELSAVTLTEE LLSRIERAAAAVDRGGAQLASISAAVEFTAAVDIELGVGDQRVSLSAGQSWSVTATGP TEVKVPGVLTARIVPGATALDFQAKYAAAQQELADALAAGEVADLAAARSADLCRREL LSRRDQLTATLAGLCGDEQVDQLRSRLEQLCAGQPAELDLVSTDTATARAELDAVEAA RIAAEKDCETRRQIAAGAARRLAETSTRATVLQNAAAAESAELGAAMTRLACERASVG DDELAAKAEADLRVLQTAEQRVIDLADELAATAPDAVAAELAEAADAVELLRERHDEA IRALHEVGVELSVFGTQGRKGKLDAAETEREHAASHHARVGRRARAARLLRSVMARHR DTTRLRYVEPYRAELHRLGRPVFGPSFEVEVDTDLRIRSRTLDDRTVPYECLSGGAKE QLGILARLAGAALVAKEDAVPVLIDDALGFTDPERLAKMGEVFDTIGADGQVIVLTCS PTRYGGVKGAHRIDLDAIQ" CDS 1432617..1434203 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1310" /product="PROBABLE DEHYDROGENASE FAD flavoprotein GMC oxidoreductase" /note="Mb1310, -, len: 528 aa. Equivalent to Rv1279, len: 528 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 528 aa overlap). Probable dehydrogenase, FAD flavoprotein GMC oxidoreductase (EC 1.1.-.-), similar to several e.g. dBETA_ECOLI|P17444 choline dehydrogenase from Escherichia coli (556 aa), FASTA scores, opt: 1047, E(): 0, (37.7% identity in 541 aa overlap). Similar to Rv0697 putative Mycobacterium tuberculosis GMC oxidoreductase. Contains PS00623 GMC oxidoreductases signature 1, and PS00624 GMC oxidoreductases signature 2. BELONGS TO THE GMC OXIDOREDUCTASES FAMILY. Protein product from Mb1310 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1310 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64264" /db_xref="InterPro:IPR000172" /db_xref="InterPro:IPR007867" /db_xref="InterPro:IPR012132" /db_xref="InterPro:IPR027424" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P64264" /protein_id="SIT99913.1" /translation="MDTQSDYVVVGTGSAGAVVASRLSTDPATTVVALEAGPRDKNRF IGVPAAFSKLFRSEIDWDYLTEPQPELDGREIYWPRGKVLGGSSSMNAMMWVRGFASD YDEWAARAGPRWSYADVLGYFRRIENVTAAWHFVSGDDSGVTGPLHISRQRSPRSVTA AWLAAARECGFAAARPNSPRPEGFCETVVTQRRGARFSTADAYLKPAMRRKNLRVLTG ATATRVVIDGDRAVGVEYQSDGQTRIVYARREVVLCAGAVNSPQLLMLSGIGDRDHLA EHDIDTVYHAPEVGCNLLDHLVTVLGFDVEKDSLFAAEKPGQLISYLLRRRGMLTSNV GEAYGFVRSRPELKLPDLELIFAPAPFYDEALVPPAGHGVVFGPILVAPQSRGQITLR SADPHAKPVIEPRYLSDLGGVDRAAMMAGLRICARIAQARPLRDLLGSIARPRNSTEL DEATLELALATCSHTLYHPMGTCRMGSDEASVVDPQLRVRGVDGLRVADASVMPSTVR GHTHAPSVLIGEKAADLIRS" CDS complement(1434220..1435995) /codon_start=1 /transl_table=11 /gene="oppA" /locus_tag="BQ2027_MB1311C" /product="PROBABLE PERIPLASMIC OLIGOPEPTIDE-BINDING LIPOPROTEIN OPPA" /note="Mb1311c, oppA, len: 591 aa. Equivalent to Rv1280c, len: 591 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 591 aa overlap). Probable oppA, oligopeptide-binding lipoprotein component of peptide transport system (see citation below), sharing some similarity to other periplasmic solute binding proteins e.g. OPPA_SALTY|P06202 periplasmic oligopeptide-binding protein from Salmonella typhimurium (542 aa), FASTA scores: E(): 5.1e-05, (22.1% identity in 458 aa overlap); etc. Also similar to Rv1166 and Rv2585c from Mycobacterium tuberculosis. Has possible N-terminal signal sequence and prokaryotic lipoprotein lipid attachment site (PS00013). BELONGS TO THE BACTERIAL EXTRACELLULAR SOLUTE-BINDING PROTEIN FAMILY 5. Protein product from Mb1311c detected using SWATH mass spectrometry. Mb1311c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66772" /db_xref="InterPro:IPR000914" /db_xref="InterPro:IPR030678" /db_xref="InterPro:IPR039424" /db_xref="UniProtKB/Swiss-Prot:P66772" /protein_id="SIT99914.1" /translation="MADRGQRRGCAPGIASALRASFQGKSRPWTQTRYWAFALLTPLV VAMVLTGCSASGTQLELAPTADRRAAVGTTSDINQQDPATLQDGGNLRLSLTDFPPNF NILHIDGNNAEVAAMMKATLPRAFIIGPDGSTTVDTNYFTSIELTRTAPQVVTYTINP EAVWSDGTPITWRDIASQIHAISGADKAFEIASSSGAERVASVTRGVDDRQAVVTFAK PYAEWRGMFAGNGMLLPASMTATPEAFNKGQLDGPGPSAGPFVVSALDRTAQRIVLTR NPRWWGARPRLDSITYLVLDDAARLPALQNNTIDATGVGTLDQLTIAARTKGISIRRA PGPSWYHFTLNGAPGSILADKALRLAIAKGIDRYTIARVAQYGLTSDPVPLNNHVFVA GQDGYQDNSGVVAYNPEQAKRELDALGWRRSGAFREKDGRQLVIRDLFYDAQSTRQFA QIAQHTLAQIGVKLELQAKSGSGFFSDYVNVGAFDIAQFGWVGDAFPLSSLTQIYASD GESNFGKIGSPQIDAAIERTLAELDPGKARALANQVDELIWAEGFSLPLTQSPGTVAV RSTLANFGATGLADLDYTAIGFMRR" CDS complement(1435988..1437826) /codon_start=1 /transl_table=11 /gene="oppD" /locus_tag="BQ2027_MB1312C" /product="PROBABLE OLIGOPEPTIDE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER OPPD" /note="Mb1312c, oppD, len: 612 aa. Equivalent to Rv1281c, len: 612 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 612 aa overlap). Probable oppD, oligopeptide-transport ATP-binding protein ABC transporter (see citation below), similar to others e.g. DPPD_BACSU|P26905 dipeptide transport ATP-binding protein from Bacillus subtilis (335 aa), FASTA scores: opt: 983, E(): 0, (48.6% identity in 319 aa overlap); etc. Contains 2 x PS00017 ATP/GTP-binding site motif A (P-loop); 2 x PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb1312c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1312c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63396" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR013563" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P63396" /protein_id="SIT99915.1" /translation="MSPLLEVTDLAVTFRTDGDPVTAVRGISYRVEPGEVVAMVGESG SGKSAAAMAVVGLLPEYAQVRGSVRLQGTELLGLADNAMSRFRGKAIGTVFQDPMSAL TPVYTVGDQIAEAIEVHQPRVGKKAARRRAVELLDLVGISQPQRRSRAFPHELSGGER QRVVIAIAIANDPDLLICDEPTTALDVTVQAQILDVLKAARDVTGAGVLIITHDLGVV AEFADRALVMYAGRVVESAGVNDLYRDRRMPYTVGLLGSVPRLDAAQGTRLVPIPGAP PSLAGLAPGCPFAPRCPLVIDECLTAEPELLDVATDHRAACIRTELVTGRSAADIYRV KTEARPAALGDASVVVRVRHLVKTYRLAKGVVLRRAIGEVRAVDGISLELRQGRTLGI VGESGSGKSTTLHEILELAAPQSGSIEVLGTDVATLGTAERRSLRRDIQVVFQDPVAS LDPRLPVFDLIAEPLQANGFGKNETHARVAELLDIVGLRHGDASRYPAEFSGGQKQRI GIARALALQPKILALDEPVSALDVSIQAGIINLLLDLQEQFGLSYLFVSHDLSVVKHL AHQVAVMLAGTVVEQGDSEEVFGNPKHEYTRRLLGAVPQPDPARRG" CDS complement(1437823..1438698) /codon_start=1 /transl_table=11 /gene="oppC" /locus_tag="BQ2027_MB1313C" /product="PROBABLE OLIGOPEPTIDE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER OPPC" /note="Mb1313c, oppC, len: 291 aa. Equivalent to Rv1282c, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 291 aa overlap). Probable oppC, oligopeptide-transport integral membrane protein ABC transporter (see citation below), similar to other integral membrane proteins e.g. OPPC_ECOLI|P77664 oligopeptide transport system permease from Escherichia coli (302 aa), FASTA scores: E(): 4.6e-33, (40.7% identity in 275 aa overlap); etc. Also similar to Rv3664c|DPPC probable peptide-transport integral membrane protein from Mycobacterium tuberculosis. Mb1313c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66965" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR025966" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/Swiss-Prot:P66965" /protein_id="SIT99916.1" /translation="MTEFASRRTLVVRRFLRNRAAVASLAALLLLFVSAYALPPLLPY SYDDLDFNALLQPPGTKHWLGTNALGQDLLAQTLRGMQKSMLIGVCVAVISTGIAATV GAISGYFGGWRDRTLMWVVDLLLVVPSFILIAIVTPRTKNSANIMFLVLLLAGFGWMI SSRMVRGMTMSLREREFIRAARYMGVSSRRIIVGHVVPNVASILIIDAALNVAAAILA ETGLSFLGFGIQPPDVSLGTLIADGTASATAFPWVFLFPASILVLILVCANLTGDGLR DALDPASRSLRRGVR" CDS complement(1438695..1439672) /codon_start=1 /transl_table=11 /gene="oppB" /locus_tag="BQ2027_MB1314C" /product="PROBABLE OLIGOPEPTIDE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER OPPB" /note="Mb1314c, oppB, len: 325 aa. Equivalent to Rv1283c, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 325 aa overlap). Probable oppB, oligopeptide-transport integral membrane protein ABC transporter (see citation below), similar to other integral membrane proteins e.g. DPPB_ECOLI|P37316 dipeptide transport system permease protein from Escherichia coli (339 aa), FASTA scores: opt: 402, E(): 3.4e-20, (31.0% identity in 345 aa overlap); etc. Also similar to Rv3665c|DppB probable peptide-transport integral membrane protein from Mycobacterium tuberculosis. Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature. Protein product from Mb1314c detected using SWATH mass spectrometry. Mb1314c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66967" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/Swiss-Prot:P66967" /protein_id="SIT99917.1" /translation="MTRYLARRLLNYLVLLALASFLTYCLTSLAFSPLESLMQRSPRP PQAVIDAKAHDLGLDRPILARYANWVSHAVRGDFGTTITGQPVGTELGRRIGVSLRLL VVGSVFGTVAGVVIGAWGAIRQYRLSDRVMTTLALLVLSTPTFVVANLLILGALRVNW AVGIQLFDYTGETSPGVAGGVWDRLGDRLQHLILPSLTLALAAAAGFSRYQRNAMLDV LGQDFIRTARAKGLTRRRALLKHGLRTALIPMATLFAYGVAGLVTGAVFVEKIFGWHG MGEWMVRGISTQDTNIVAAITVFSGAVVLLAGLLSDVIYAALDPRVRVS" CDS 1439879..1440370 /codon_start=1 /transl_table=11 /gene="canA" /locus_tag="BQ2027_MB1315" /product="beta-carbonic anhydrase" /note="Mb1315, -, len: 163 aa. Equivalent to Rv1284, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 163 aa overlap). Conserved hypothetical protein, similar to AL109663|SC4A10.26 hypothetical protein from Streptomyces coelicolor (167 aa), FASTA scores: opt: 567, E(): 1.5e-32, (53.4% identity in 163 aa overla); shows some similarity to hypothetical protein from Methanobacterium thermoautotrophicum. Weak similarity to carbonic anhydrases e.g. U51624|MTU516242|P17582 Methanothermobacter thermautotrophicus (171 aa), FASTA score: opt: 305, E(): 1 .2e-14, (35.2% identity in 165 aa overlap). Protein product from Mb1315 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1315 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64798" /db_xref="InterPro:IPR001765" /db_xref="InterPro:IPR036874" /db_xref="UniProtKB/Swiss-Prot:P64798" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99918.1" /translation="MTVTDDYLANNVDYASGFKGPLPMPPSKHIAIVACMDARLDVYR MLGIKEGEAHVIRNAGCVVTDDVIRSLAISQRLLGTREIILLHHTDCGMLTFTDDDFK RAIQDETGIRPTWSPESYPDAVEDVRQSLRRIEVNPFVTKHTSLRGFVFDVATGKLNE VTP" CDS 1440464..1441462 /codon_start=1 /transl_table=11 /gene="cysD" /locus_tag="BQ2027_MB1316" /product="PROBABLE SULFATE ADENYLYLTRANSFERASE SUBUNIT 2 CYSD" /note="Mb1316, cysD, len: 332 aa. Equivalent to Rv1285, len: 332 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 332 aa overlap). Probable cysD, sulfate adenylyltransferase subunit 2 (EC 2.7.7.4) (see first citation below), homology suggests start site at aa 24 or 28, similar to e.g. CYSD_ECOLI|P21156 sulfate adenylate transferase subunit 2 from Escherichia coli (302 aa), FASTA score: opt: 973, E():0, (52.5% identity in 303 aa overlap). Also similar to Mycobacterium tuberculosis Rv2392, 3'-phosphoadenylylsulfate reductase. BELONGS TO THE PAPS REDUCTASE FAMILY. CYSD SUBFAMILY. Thought to be differentially expressed within host cells (see third citation below). Protein product from Mb1316 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1316 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65671" /db_xref="InterPro:IPR002500" /db_xref="InterPro:IPR011784" /db_xref="InterPro:IPR014729" /db_xref="UniProtKB/Swiss-Prot:P65671" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99919.1" /translation="MAITINMVNPTGFIRYEDVEQEAMTSDVTVGPAPGQYQLSHLRL LEAEAIHVIREVAAEFERPVLLFSGGKDSIVMLHLALKAFRPGRLPFPVMHVDTGHNF DEVIATRDELVAAAGVRLVVASVQDDIDAGRVVETIPSRNPIQTVTLLRAIRENQFDA AFGGARRDEEKARAKERVFSFRDEFGQWDPKAQRPELWNLYNGRHHKGEHIRVFPLSN WTEFDIWSYIGAEQVRLPSIYFAHRRKVFQRDGMLLAVHRHMQPRADEPVFEATVRFR TVGDVTCTGCVESSASTVAEVIAETAVARLTERGATRADDRISEAGMEDRKRQGYF" CDS 1441462..1443306 /codon_start=1 /transl_table=11 /gene="cysN" /locus_tag="BQ2027_MB1317" /product="PROBABLE BIFUNCTIONAL ENZYME CYSN/CYSC: SULFATE ADENYLTRANSFERASE (SUBUNIT 1) + ADENYLYLSULFATE KINASE" /note="Mb1317, cysN, len: 614 aa. Equivalent to Rv1286, len: 614 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 614 aa overlap). Probable cysN/cysC bifunctional enzyme, sulfate adenylyltransferase subunit 1 (EC 2.7.7.4) and Adenylylsulfate kinase (EC 2.7.1.25) (see first citation below), similar to CYSN_ECOLI|P23845 sulfate adenylate transferase subunit 1 from Escherichia coli (475 aa), FASTA scores: opt: 1291, E():0, (50.2% identity in 428 aa overlap). Contains 2 x PS00017 ATP/GTP-binding site motif A, PS00301 GTP-binding elongation factors signature. Protein product from Mb1317 detected using shotgun mass spectrometry. Mb1317 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXW4" /db_xref="InterPro:IPR000795" /db_xref="InterPro:IPR002891" /db_xref="InterPro:IPR009000" /db_xref="InterPro:IPR009001" /db_xref="InterPro:IPR011779" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR031157" /db_xref="InterPro:IPR041757" /db_xref="UniProtKB/TrEMBL:A0A1R3XXW4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99920.1" /translation="MTTLLRLATAGSVDDGKSTLIGRLLYDSKAVMEDQWASVEQTSK DRGHDYTDLALVTDGLRAEREQGITIDVAYRYFATPKRKFIIADTPGHIQYTRNMVTG ASTAQLVIVLVDARHGLLEQSRRHAFLASLLGIRHLVLAVNKMDLLGWDQEKFDAIRD EFHAFAARLDVQDVTSIPISALHGDNVVTKSDQTPWYEGPSLLSHLEDVYIAGDRNMV DVRFPVQYVIRPHTLEHQDHRSYAGTLASGVMRSGDEVVVLPIGKTTRITAIDGPNGP VAEAFPPMAVSVRLADDIDISRGDMIARTHNQPRITQEFDATVCWMADNAVLEPGRDY VVKHTTRTVRARIAGLDYRLDVNTLHRDKTATALKLNELGRVSLRTQVPLLLDEYTRN ASTGSFILIDPDTNGTVAAGMVLRDVSARTPSPNTVRHRSLVTAQDRPPRGKTVWFTG LSGSGKSSVAMLVERKLLEKGISAYVLDGDNLRHGLNADLGFSMADRAENLRRLSHVA TLLADCGHLVLVPAISPLAEHRALARKVHADAGIDFFEVFCDTPLQDCERRDPKGLYA KARAGEITHFTGIDSPYQRPKNPDLRLTPDRSIDEQAQEVIDLLESSS" CDS 1443360..1443845 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1318" /product="Predicted transcriptional regulator of sulfate adenylyltransferase, Rrf2 family" /note="Mb1318, -, len: 161 aa. Equivalent to Rv1287, len: 161 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 161 aa overlap). Conserved hypothetical protein, similar to VJEB family of proteins e.g. FASTA score: P44675|Y379_HAEIN HYPOTHETICAL PROTEIN HI0379 (150 aa), FASTA scores: opt: 213, E(): 2.5e-08, (30.0% identity in 130 aa overlap) and YJEB_ECOLI|P21498 hypothetical 15.6 kd protein in pura-vacb (141 aa), opt: 167, E(): 9.5e-06, (25.0% identity in 136 aa overlap). BELONGS TO THE UPF0074 (RFF2) FAMILY. Protein product from Mb1318 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1318 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67160" /db_xref="InterPro:IPR000944" /db_xref="InterPro:IPR030489" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P67160" /protein_id="SIT99921.1" /translation="MRMSAKAEYAVRAMVQLATAASGTVVKTDDLAAAQGIPPQFLVD ILTNLRTDRLVRSHRGREGGYELARPGTEISIADVLRCIDGPLASVRDIGLGDLPYSG PTTALTDVWRALRASMRSVLEETTLADVAGGALPEHVAQLADDYRAQESTRHGASRHG D" CDS 1443903..1445120 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1319" /product="Putative esterase" /note="Mb1319, -, len: 405 aa. Similar to Rv1288, len: 456 aa, from Mycobacterium tuberculosis strain H37Rv, (98.3% identity in 403 aa overlap). Conserved hypothetical protein, some similarity to A85B_MYCTU|P31952 antigen 85-b precursor (85b) (325 aa), FASTA scores: opt: 199, E(): 2.7e-06, (24.7% identity in 279 aa overlap). Also similar to Q01377|CSP1_CORGL PS1 PROTEIN PRECURSOR (related to antigen 85 complex) from Corynebacterium glutamicum (657 aa), FASTA scores: opt: 280, E(): 1.9e-10, (26.4% identity in 352 aa overlap). SEEMS TO CONTAIN 3 LYSM REPEATS. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 153 bp deletion leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (405 aa versus 456 aa). Protein product from Mb1319 detected using SWATH mass spectrometry. Mb1319 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000801" /db_xref="InterPro:IPR018392" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR036779" /db_xref="UniProtKB/TrEMBL:A0A1R3Y024" /protein_id="SIT99922.1" /translation="MVSTHAVVAGETLSALALRFYGDAELYRLIAAASGIADPDVVNV GQRLIMPDFTRYTVVAGDTLSALAARFYGDASLYPLIAAVNGIADPGVIDVGQVLVIF IGRSDGFGLRIVDRNENDPRLWYYRFQTSAIGWNPGVNVLLPDDYRTSGRTYPVLYLF HGGGTDQDFRTFDFLGIRDLTAGKPIIIVMPDGGHAGWYSNPVSSFVGPRNWETFHIA QLLPWIEANFRTYAEYDGRAVAGFSMGGFGALKYAAKYYGHFASASSHSGPASLRRDF GLVVHWANLSSAVLDLGGGTVYGAPLWDQARVSADNPVERIDSYRNKRIFLVAGTSPD PANWFDSVNETQVLAGQREFRERLSNAGIPHESHEVPGGHVFRPDMFRLDLDGIVARL RPASIGAAAERAD" CDS 1445169..1445801 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1320" /product="unknown protein" /note="Mb1320, -, len: 210 aa. Equivalent to Rv1289, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 210 aa overlap). Hypothetical unknown protein. Protein product from Mb1320 detected using shotgun mass spectrometry. Mb1320 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P64800" /protein_id="SIT99923.1" /translation="MCVSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLG VGTRFRTALRDSLDIYGVMATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWL HGHADESSVEFEVSPYVNASAALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPP DYQLSWYDHVFFISVWWGWQDHFREIVNVDRASLVALDFGDLWNGWTPVG" CDS complement(1445940..1447505) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1321C" /product="Predicted membrane protein" /note="Mb1321c, -, len: 521 aa. Equivalent to Rv1290c, len: 521 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 521 aa overlap). Conserved hypothetical protein, similar to AL031013|SC8A6.09 hypothetical protein from Streptomyces coelicolor (443 aa), FASTA scores: opt: 371, E(): 9.5e-17, (28.3% identity in 446 aa overlap). Protein product from Mb1321c detected using SWATH mass spectrometry. Mb1321c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5E4" /db_xref="InterPro:IPR018723" /db_xref="UniProtKB/Swiss-Prot:P0A5E4" /protein_id="SIT99924.1" /translation="MLQRSLGVNGRKLAMSARSAKRERKNASTAASKCYVVPPSARGW VHAYSVTATSMLNRRKAILDYLQGAVWVLPTFGVAIGLGSGAVLSMIPVKSGTLIDKL MFQGTPGDARGVLIVVSATMITTIGIVFSLTVLSLQIASSQFSVRLLRTFLRDVPNQV VLAIFACTFAYSTGGLHTVGEHRDGGAFIPKVAVTGSLALAFVSIAALIYFLHHLMHS IQIDTIMDKVRLRTLGLVDQLYPESDTADRQVETPPSPPADAVPLLAPHSGYLQTVDV DDIAELAAASRYTALLVTFVGDYVTAGGLLGWCWRRGTAPGAPGSDFPQRCLRHVHIG FERTLQQDIRFGLRQMVDIALRALSPALNDPYTAIQVVHHLSAVESVLASRALPDDVR RDRAGELLFWLPYPSFATYLHVGCAQIRRYGSREPLVLTALLQLLSAVAQNCVDPSRR VAVQTQIALVVRAAQREFADESDRAMVLGAAARATEVVERPGTLAPPPSTFGQVAAAQ AAASTIRSADRDG" CDS 1447516..1447830 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1322" /product="HYPOTHETICAL PROTEIN" /note="Mb1322, -, len: 104 aa. Equivalent to Rv1290A, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 104 aa overlap). Hypothetical unknown protein, equivalent to AAK45590 from Mycobacterium tuberculosis strain CDC1551 (122 aa) but shorter 18 aa." /db_xref="UniProtKB/TrEMBL:A0A1R3XY94" /protein_id="SIT99925.1" /translation="MLALHGLSEGVSGSGGSGGRWGAGEVLEGARIGVIADGVSCFPT KADCRRIRGVPVFDGYTRMVARLMGSLAVLRSVSIPKGYRDFGFGSLRAVAPKNCPDV SG" CDS complement(1447957..1448292) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1323C" /product="CONSERVED HYPOTHETICAL SECRETED PROTEIN" /note="Mb1323c, -, len: 111 aa. Equivalent to Rv1291c, len: 111 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 111 aa overlap). Conserved hypothetical secreted protein, similar to others in Mycobacterium tuberculosis e.g. Rv1271c|Q11048|YC71_MYCTU HYPOTHETICAL 11.6 KD PROTEIN (113 aa), FASTA score: opt: 246, E(): 1.7e-09, (40.0% identity in 110 aa overlap); Rv1804c, Rv1810, Rv0622, etc." /db_xref="InterPro:IPR007969" /db_xref="UniProtKB/Swiss-Prot:P0A5E6" /protein_id="SIT99926.1" /translation="MFTRRFAASMVGTTLTAATLGLAALGFAGTASASSTDEAFLAQL QADGITPPSAARAIKDAHAVCDALDEGHSAKAVIKAVAKATGLSAKGAKTFAVDAASA YCPQYVTSS" tRNA complement(1448651..1448723) /locus_tag="BQ2027_ARGV" /product="tRNA-Arg" /note="argV, len: 73 nt. Equivalent to argV, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Arg, anticodon ccg." CDS 1448837..1450489 /codon_start=1 /transl_table=11 /gene="argS" /locus_tag="BQ2027_MB1324" /product="PROBABLE ARGINYL-TRNA SYNTHETASE ARGS (ARGRS) (Arginine--tRNA ligase)" /note="Mb1324, argS, len: 550 aa. Equivalent to Rv1292, len: 550 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 550 aa overlap). Probable argS, Arginyl-tRNA synthetase (EC 6.1.1.19), highly similar to SYR_MYCLE|P45840 Mycobacterium leprae (550 aa), FASTA scores: opt: 3115, E(): 0, (84.9% identity in 550 aa overlap). Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. BELONGS TO CLASS-I AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb1324 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1324 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67570" /db_xref="InterPro:IPR001278" /db_xref="InterPro:IPR001412" /db_xref="InterPro:IPR005148" /db_xref="InterPro:IPR008909" /db_xref="InterPro:IPR009080" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR035684" /db_xref="InterPro:IPR036695" /db_xref="UniProtKB/Swiss-Prot:P67570" /protein_id="SIT99927.1" /translation="MTPADLAELLKATAAAVLAERGLDASALPQMVTVERPRIPEHGD YASNLAMQLAKKVGTNPRELAGWLAEALTKVDGIASAEVAGPGFINMRLETAAQAKVV TSVIDAGHSYGHSLLLAGRKVNLEFVSANPTGPIHIGGTRWAAVGDALGRLLTTQGAD VVREYYFNDHGAQIDRFANSLIAAAKGEPTPQDGYAGSYITNIAEQVLQKAPDALSLP DAELRETFRAIGVDLMFDHIKQSLHEFGTDFDVYTHEDSMHTGGRVENAIARLRETGN IYEKDGATWLRTSAFGDDKDRVVIKSDGKPAYIAGDLAYYLDKRQRGFDLCIYMLGAD HHGYIARLKAAAAAFGDDPATVEVLIGQMVNLVRDGQPVRMSKRAGTVLTLDDLVEAI GVDAARYSLIRSSVDTAIDIDLALWSSASNENPVYYVQYAHARLSALARNAAELALIP DTNHLELLNHDKEGTLLRTLGEFPRVLETAASLREPHRVCRYLEDLAGDYHRFYDSCR VLPQGDEQPTDLHTARLALCQATRQVIANGLAIIGVTAPERM" CDS 1450486..1451829 /codon_start=1 /transl_table=11 /gene="lysA" /locus_tag="BQ2027_MB1325" /product="diaminopimelate decarboxylase lysa (dap decarboxylase)" /note="Mb1325, lysA, len: 447 aa. Equivalent to Rv1293, len: 447 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 447 aa overlap). Probable lysA, diaminopimelate decarboxylase (EC 4.1.1.20) (see citation below), almost identical to DCDA_MYCTU|P31848. Contains PS00878 Orn/DAP/Arg decarboxylases family 2 pyridoxal-P attachment site, PS00879 Orn/DAP/Arg decarboxylases family 2 signature 2. BELONGS TO FAMILY 2 OF ORNITHINE, DAP, AND ARGININE DECARBOXYLASES. Protein product from Mb1325 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1325 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5M5" /db_xref="InterPro:IPR000183" /db_xref="InterPro:IPR002986" /db_xref="InterPro:IPR009006" /db_xref="InterPro:IPR022643" /db_xref="InterPro:IPR022644" /db_xref="InterPro:IPR022653" /db_xref="InterPro:IPR022657" /db_xref="InterPro:IPR029066" /db_xref="UniProtKB/Swiss-Prot:P0A5M5" /protein_id="SIT99928.1" /translation="MNELLHLAPNVWPRNTTRDEVGVVCIAGIPLTQLAQEYGTPLFV IDEDDFRSRCRETAAAFGSGANVHYAAKAFLCSEVARWISEEGLCLDVCTGGELAVAL HASFPPERITLHGNNKSVSELTAAVKAGVGHIVVDSMTEIERLDAIAGEAGIVQDVLV RLTVGVEAHTHEFISTAHEDQKFGLSVASGAAMAAVRRVFATDHLRLVGLHSHIGSQI FDVDGFELAAHRVIGLLRDVVGEFGPEKTAQIATVDLGGGLGISYLPSDDPPPIAELA AKLGTIVSDESTAVGLPTPKLVVEPGRAIAGPGTITLYEVGTVKDVDVSATAHRRYVS VDGGMSDNIRTALYGAQYDVRLVSRVSDAPPVPARLVGKHCESGDIIVRDTWVPDDIR PGDLVAVAATGAYCYSLSSRYNMVGRPAVVAVHAGNARLVLRRETVDDLLSLEVR" CDS 1451833..1453158 /codon_start=1 /transl_table=11 /gene="thrA" /locus_tag="BQ2027_MB1326" /product="PROBABLE HOMOSERINE DEHYDROGENASE THRA" /note="Mb1326, thrA, len: 441 aa. Equivalent to Rv1294, len: 441 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 441 aa overlap). Probable thrA (hom), homoserine dehydrogenase (EC 1.1.1.3), highly similar to DHOM_MYCLE|P46806 from Mycobacterium leprae (441 aa), FASTA scores: opt: 2437, E():0, (89.5% identity in 438 aa overlap). Contains PS00017 ATP/GTP-binding site motif A; PS01042 Homoserine dehydrogenase signature. BELONGS TO THE HOMOSERINE DEHYDROGENASE FAMILY. Protein product from Mb1326 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1326 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63630" /db_xref="InterPro:IPR001342" /db_xref="InterPro:IPR002912" /db_xref="InterPro:IPR005106" /db_xref="InterPro:IPR016204" /db_xref="InterPro:IPR019811" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P63630" /protein_id="SIT99929.1" /translation="MPGDEKPVGVAVLGLGNVGSEVVRIIENSAEDLAARVGAPLVLR GIGVRRVTTDRGVPIELLTDDIEELVAREDVDIVVEVMGPVEPSRKAILGALERGKSV VTANKALLATSTGELAQAAESAHVDLYFEAAVAGAIPVIRPLTQSLAGDTVLRVAGIV NGTTNYILSAMDSTGADYASALADASALGYAEADPTADVEGYDAAAKAAILASIAFHT RVTADDVYREGITKVTPADFGSAHALGCTIKLLSICERITTDEGSQRVSARVYPALVP LSHPLAAVNGAFNAVVVEAEAAGRLMFYGQGAGGAPTASAVTGDLVMAARNRVLGSRG PRESKYAQLPVAPMGFIETRYYVSMNVADKPGVLSAVAAEFAKREVSIAEVRQEGVVD EGGRRVGARIVVVTHLATDAALSETVDALDDLDVVQGVSSVIRLEGTGL" CDS 1453155..1454237 /codon_start=1 /transl_table=11 /gene="thrC" /locus_tag="BQ2027_MB1327" /product="threonine synthase thrc (ts)" /note="Mb1327, thrC, len: 360 aa. Equivalent to Rv1295, len: 360 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 360 aa overlap). Probable thrC, threonine synthase (EC 4.2.3.1) (see first citation below), highly similar to THRC_MYCLE|P45837 Mycobacterium leprae (360 aa), FASTA scores: opt: 2202, E(): 0, (93.9% identity in 359 aa overlap). Contains PS00165 Serine/threonine dehydratases pyridoxal-phosphate attachment site. Protein product from Mb1327 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1327 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66903" /db_xref="InterPro:IPR000634" /db_xref="InterPro:IPR001926" /db_xref="InterPro:IPR004450" /db_xref="InterPro:IPR026260" /db_xref="InterPro:IPR036052" /db_xref="UniProtKB/Swiss-Prot:P66903" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99930.1" /translation="MTVPPTATHQPWPGVIAAYRDRLPVGDDWTPVTLLEGGTPLIAA TNLSKQTGCTIHLKVEGLNPTGSFKDRGMTMAVTDALAHGQRAVLCASTGNTSASAAA YAARAGITCAVLIPQGKIAMGKLAQAVMHGAKIIQIDGNFDDCLELARKMAADFPTIS LVNSVNPVRIEGQKTAAFEIVDVLGTAPDVHALPVGNAGNITAYWKGYTEYHQLGLID KLPRMLGTQAAGAAPLVLGEPVSHPETIATAIRIGSPASWTSAVEAQQQSKGRFLAAS DEEILAAYHLVARVEGVFVEPASAASIAGLLKAIDDGWVARGSTVVCTVTGNGLKDPD TALKDMPSVSPVPVDPVAVVEKLGLA" CDS 1454455..1455405 /codon_start=1 /transl_table=11 /gene="thrB" /locus_tag="BQ2027_MB1328" /product="PROBABLE HOMOSERINE KINASE THRB" /note="Mb1328, thrB, len: 316 aa. Equivalent to Rv1296, len: 316 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 316 aa overlap). Probable thrB, homoserine kinase (EC 2.7.1.39) (see citation below), highly similar to KHSE_MYCLE|P45836 from Mycobacterium leprae (314 aa), FASTA scores, opt: 1657, E(): 0, (82.0% identity in 311 aa overlap). Contains PS00639 Eukaryotic thiol (cysteine) proteases histidine active site, and PS00627 GHMP kinases putative ATP-binding domain. Protein product from Mb1328 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1328 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65225" /db_xref="InterPro:IPR000870" /db_xref="InterPro:IPR006203" /db_xref="InterPro:IPR006204" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR036554" /db_xref="UniProtKB/Swiss-Prot:P65225" /protein_id="SIT99931.1" /translation="MVTQALLPSGLVASAVVAASSANLGPGFDSVGLALSLYDEIIVE TTDSGLTVTVDGEGGDQVPLGPEHLVVRAVQHGLQAAGVSAAGLAVRCRNAIPHSRGL GSSAAAVVGGLAAVNGLVVQTDSSPSSDAELIQLASEFEGHPDNAAAAVLGGAVVSWT DHSGDRPNYSAVSLRLHPDIRLFTAIPEQRSSTAETRVLLPAQVSHDDARFNVSRAAL LVVALTERPDLLMAATEDLLHQPQRAAAMTASAEYLRLLRRHNVAAALSGAGPSLIAL STDSELPTDAVEFGAAKGFAVTELTVGEAVRWSPTVRVPG" CDS 1455662..1457470 /codon_start=1 /transl_table=11 /gene="rho" /locus_tag="BQ2027_MB1329" /product="PROBABLE TRANSCRIPTION TERMINATION FACTOR RHO HOMOLOG" /note="Mb1329, rho, len: 602 aa. Equivalent to Rv1297, len: 602 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 602 aa overlap). Probable rho, transcription termination factor homolog, highly similar to many e.g. RHO_MYCLE|P45835 Mycobacterium leprae (610 aa), FASTA scores: (81.5% identity in 612 aa overlap). CONTAINS 1 RNA RECOGNITION MOTIF (RRM). Protein product from Mb1329 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1329 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66029" /db_xref="InterPro:IPR000194" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004665" /db_xref="InterPro:IPR011112" /db_xref="InterPro:IPR011113" /db_xref="InterPro:IPR011129" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036269" /db_xref="InterPro:IPR041703" /db_xref="UniProtKB/Swiss-Prot:P66029" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99932.1" /translation="MTDTDLITAGESTDGKPSDAAATDPPDLNADEPAGSLATMVLPE LRALANRAGVKGTSGMRKNELIAAIEEIRRQANGAPAVDRSAQEHDKGDRPPSSEAPA TQGEQTPTEQIDSQSQQVRPERRSATREAGPSGSGERAGTAADDTDNRQGGQQDAKTE ERGTDAGGDQGGDQQASGGQQARGDEDGEARQGRRGRRFRDRRRRGERSGDGAEAELR EDDVVQPVAGILDVLDNYAFVRTSGYLPGPHDVYVSMNMVRKNGMRRGDAVTGAVRVP KEGEQPNQRQKFNPLVRLDSINGGSVEDAKKRPEFGKLTPLYPNQRLRLETSTERLTT RVIDLIMPIGKGQRALIVSPPKAGKTTILQDIANAITRNNPECHLMVVLVDERPEEVT DMQRSVKGEVIASTFDRPPSDHTSVAELAIERAKRLVEQGKDVVVLLDSITRLGRAYN NASPASGRILSGGVDSTALYPPKRFLGAARNIEEGGSLTIIATAMVETGSTGDTVIFE EFKGTGNAELKLDRKIAERRVFPAVDVNPSGTRKDELLLSPDEFAIVHKLRRVLSGLD SHQAIDLLMSQLRKTKNNYEFLVQVSKTTPGSMDSD" CDS 1457621..1457863 /codon_start=1 /transl_table=11 /gene="rpmE" /locus_tag="BQ2027_MB1330" /product="50s ribosomal protein l31 rpme" /note="Mb1330, rpmE, len: 80 aa. Equivalent to Rv1298, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 80 aa overlap). Probable rpmE, 50s ribosomal protein L31, highly similar to many e.g. RL31_MYCLE|P45834 50s ribosomal protein L31 from Mycobacterium leprae (84 aa), FASTA scores: opt: 490, E(): 5.5e-28, (89.6% identity in 77 aa overlap). Contains PS01143 Ribosomal protein L31 signature. BELONGS TO THE L31P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb1330 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1330 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66188" /db_xref="InterPro:IPR002150" /db_xref="InterPro:IPR027491" /db_xref="InterPro:IPR034704" /db_xref="InterPro:IPR042105" /db_xref="UniProtKB/Swiss-Prot:P66188" /protein_id="SIT99933.1" /translation="MKSDIHPAYEETTVVCGCGNTFQTRSTKPGGRIVVEVCSQCHPF YTGKQKILDSGGRVARFEKRYGKRKVGADKAVSTGK" CDS 1457953..1459026 /codon_start=1 /transl_table=11 /gene="prfA" /locus_tag="BQ2027_MB1331" /product="PROBABLE PEPTIDE CHAIN RELEASE FACTOR 1 PRFA (RF-1)" /note="Mb1331, prfA, len: 357 aa. Equivalent to Rv1299, len: 357 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 357 aa overlap). Probable prfA, peptide chain release factor 1 (rf-1), highly similar to many e.g. RF1_MYCLE|P45833 peptide chain release factor 1 (rf-1) from Mycobacterium leprae (357 aa), FASTA scores: opt: 2047, E(): 0, (89.3% identity in 356 aa overlap); also similar to Mycobacterium tuberculosis Rv3105c, prfB peptide chain release factor 2. Contains PS00745 Prokaryotic-type class I peptide chain release factors signature. BELONGS TO THE PROKARYOTIC AND MITOCHONDRIAL RELEASE FACTORS FAMILY. Protein product from Mb1331 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1331 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66017" /db_xref="InterPro:IPR000352" /db_xref="InterPro:IPR004373" /db_xref="InterPro:IPR005139" /db_xref="UniProtKB/Swiss-Prot:P66017" /protein_id="SIT99934.1" /translation="MTQPVQTIDVLLAEHAELELALADPALHSNPAEARRVGRRFARL APIVATHRKLTSARDDLETARELVASDESFAAEVAALEARVGELDAQLTDMLAPRDPH DADDIVLEVKSGEGGEESALFAADLARMYIRYAERHGWAVTVLDETTSDLGGYKDATL AIASKADTPDGVWSRMKFEGGVHRVQRVPVTESQGRVHTSAAGVLVYPEPEEVGQVQI DESDLRIDVFRSSGKGGQGVNTTDSAVRITHLPTGIVVTCQNERSQLQNKTRALQVLA ARLQAMAEEQALADASADRASQIRTVDRSERIRTYNFPENRITDHRIGYKSHNLDQVL DGDLDALFDALSAADKQSRLRQS" CDS 1459023..1460000 /codon_start=1 /transl_table=11 /gene="hemK" /locus_tag="BQ2027_MB1332" /product="PROBABLE HEMK PROTEIN HOMOLOG HEMK" /note="Mb1332, hemK, len: 325 aa. Equivalent to Rv1300, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 325 aa overlap). Probable hemK protein homolog (EC 2.1.1.-), homology suggests translation may start at aa 22, highly similar to many e.g. HEMK_MYCLE|P45832 Mycobacterium leprae (288 aa), FASTA scores: opt: 936, E(): 0, (76.7% identity in 189 aa overlap). BELONGS TO THE HEMK FAMILY OF MODIFICATION METHYLASES. Mb1332 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYA7" /db_xref="InterPro:IPR002052" /db_xref="InterPro:IPR004556" /db_xref="InterPro:IPR019874" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR040758" /db_xref="InterPro:IPR041698" /db_xref="UniProtKB/TrEMBL:A0A1R3XYA7" /protein_id="SIT99935.1" /translation="MTSAPATMRWGNLPLAGESGTMTLRQAIDLAAALLAEAGVDSAR CDAEQLAAHLAGTDRGRLPLFEPPGDEFFGRYRDIVTARARRVPLQHLIGTVSFGPVV LHVGPGVFVPRPETEAILAWATAQSLPARPLIVDACTGSGALAVALAQHRANLGLKAR IIGIDDSDCALDYARRNAAGTPVELVRADVTTPCLLPELDGQVDLMVSNPPYIPDAAV LEPEVAQHDPHHALFGGPDGMTVISAVVGLAGRWLRPGGLFAVEHDDTTSSSTVDLVS STKLFVDVQARKDLAGRPRFVTAMRWGHLPLAGENGAIDPRQRRCRAKR" CDS 1460016..1460669 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1333" /product="Threonylcarbamoyl-AMP synthase (EC" /EC_number="2.7.7.87" /note="Mb1333, -, len: 217 aa. Equivalent to Rv1301, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 217 aa overlap). Conserved hypothetical protein, highly similar to YRFE_MYCLE|P45831 hypothetical 22.7 kd protein in rfe-hemk intergenic region, (220 aa), FASTA scores: opt: 1168, E(): 0, (82.8% identity in 215 aa overlap). Contains PS01147 Hypothetical SUA5/yciO/yrdC family signature. BELONGS TO THE SUA5/YRDC/YCIO/YWLC FAMILY. Protein product from Mb1333 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1333 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXY6" /db_xref="InterPro:IPR006070" /db_xref="InterPro:IPR017945" /db_xref="UniProtKB/TrEMBL:A0A1R3XXY6" /protein_id="SIT99936.1" /translation="MTETFDCADPEQRSRGIVSAVGAIKAGQLVVMPTDTVYGIGADA FDSSAVAALLSAKGRGRDMPVGVLVGSWHTIEGLVYSMPDGARELIRAFWPGALSLVV VQAPSLQWDLGDAHGTVMLRMPLHPVAIELLREVGPMAVSSANISGHPPPVDAEQARS QLGDHVAVYLDAGPSEQQAGSTIVDLTGATPRVLRQGPVSTERIAEVLGVDAASLFG" CDS 1460753..1461967 /codon_start=1 /transl_table=11 /gene="rfe" /locus_tag="BQ2027_MB1334" /product="probable undecapaprenyl-phosphate alpha-n-acetylglucosaminyltransferase rfe (udp-glcnac transferase)" /note="Mb1334, rfe, len: 404 aa. Equivalent to Rv1302, len: 404 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 404 aa overlap). Putative rfe, undecaprenyl-phosphate alpha-N-acetylglucosaminyltransferase (EC 2.4.1.-), equivalent to RFE_MYCLE|P45830 Mycobacterium leprae (398 aa), FASTA scores, opt: 2285, E(): 0, (89.2% identity in 398 aa overlap). Protein product from Mb1334 detected using SWATH mass spectrometry. Mb1334 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XXY8" /db_xref="InterPro:IPR000715" /db_xref="UniProtKB/TrEMBL:A0A1R3XXY8" /protein_id="SIT99937.1" /translation="MQYGLEVSSDVAGVAGGLLALSYRGAGVPLRELALVGLTAAIIT YFATGPVRMLASRLGAVAYPRERDVHVTPTPRMGGLAMFLGIVGAVFLASQLPALTRG FVYSTGMPAVLVAGAVIMGIGLIDDRWGLDALTKFAGQITAASVLVTMGVAWSVLYIP VGGVGTIVLDQASSILLTLALTVSIVNAMNFVDGLDGLAAGLGLITALAICMFSVGLL RDHGGDVLYYPPAVISVVLAGACLGFLPHNFHRAKIFMGDSGSMLIGLMLAAASTTAA GPISQNAYGARDVFALLSPFLLVVAVMFVPMLDLLLAIVRRTRAGRSAFSPDKMHLHH RLLQIGHSHRRVVLIIYLWVGIGAFGAASSIFFNPRDTAAVMLGAIVVAGVATLIPLL RRGDDYYDPDLD" CDS 1462224..1462709 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1335" /product="ATP synthase protein I" /note="Mb1335, -, len: 161 aa. Equivalent to Rv1303, len: 161 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 161 aa overlap). Conserved hypothetical transmembrane protein, highly similar to P53431|Y02N_MYCLE hypothetical Mycobacterium leprae protein (153 aa), FASTA score: opt: 636, E():0, (69.8% identity in 149 aa overlap). Protein product from Mb1335 detected using SWATH mass spectrometry. Mb1335 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64802" /db_xref="InterPro:IPR005598" /db_xref="UniProtKB/Swiss-Prot:P64802" /protein_id="SIT99938.1" /translation="MTTPAQDAPLVFPSVAFRPVRLFFINVGLAAVAMLVAGVFGHLT VGMFLGLGLLLGLLNALLVRRSAESITAKEHPLKRSMALNSASRLAIITILGLIIAYI FRPAGLGVVFGLAFFQVLLVATTALPVLKKLRTATEEPVATYSSNGQTGGSEGRSASD D" CDS 1462702..1463454 /codon_start=1 /transl_table=11 /gene="atpB" /locus_tag="BQ2027_MB1336" /product="PROBABLE ATP SYNTHASE A CHAIN ATPB (PROTEIN 6)" /note="Mb1336, atpB, len: 250 aa. Equivalent to Rv1304, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 250 aa overlap). Probable atpB, ATP synthase A chain (EC 3.6.3.14), highly similar to ATP6_MYCLE|P45829 Mycobacterium leprae (251 aa), FASTA scores: opt: 1382, E(): 0, (84.0% identity in 250 aa overlap). Contains PS00449 ATP synthase A subunit signature. SUBUNIT: F-TYPE ATPASES HAVE 2 COMPONENTS, CF(1) - THE CATALYTIC CORE - AND CF(0) - THE MEMBRANE PROTON CHANNEL. CF(1) HAS FIVE SUBUNITS: ALPHA(3), BETA(3), GAMMA(1), DELTA(1), EPSILON(1). CF(0) HAS THREE MAIN SUBUNITS: A, B AND C. BELONGS TO THE ATPASE A CHAIN FAMILY. Protein product from Mb1336 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1336 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63655" /db_xref="InterPro:IPR000568" /db_xref="InterPro:IPR023011" /db_xref="InterPro:IPR035908" /db_xref="UniProtKB/Swiss-Prot:P63655" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99939.1" /translation="MTETILAAQIEVGEHHTATWLGMTVNTDTVLSTAIAGLIVIALA FYLRAKVTSTDVPGGVQLFFEAITIQMRNQVESAIGMRIAPFVLPLAVTIFVFILISN WLAVLPVQYTDKHGHTTELLKSAAADINYVLALALFVFVCYHTAGIWRRGIVGHPIKL LKGHVTLLAPINLVEEVAKPISLSLRLFGNIFAGGILVALIALFPPYIMWAPNAIWKA FDLFVGAIQAFIFALLTILYFSQAMELEEEHH" CDS 1463503..1463748 /codon_start=1 /transl_table=11 /gene="atpE" /locus_tag="BQ2027_MB1337" /product="PROBABLE ATP SYNTHASE C CHAIN ATPE (LIPID-BINDING PROTEIN) (DICYCLOHEXYLCARBODIIMIDE-BINDING PROTEIN)" /note="Mb1337, atpE, len: 81 aa. Equivalent to Rv1305, len: 81 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 81 aa overlap). Probable atpE, ATP synthase C chain (EC 3.6.3.14), highly similar to P45828|ATPL_MYCLE Mycobacterium leprae (92.6% identity in 81 aa overlap). Contains PS00605 ATP synthase C subunit signature. SUBUNIT: F-TYPE ATPASES HAVE 2 COMPONENTS, CF(1) - THE CATALYTIC CORE - AND CF(0) - THE MEMBRANE PROTON CHANNEL. CF(1) HAS FIVE SUBUNITS: ALPHA(3), BETA(3), GAMMA(1), DELTA(1), EPSILON(1). CF(0) HAS THREE MAIN SUBUNITS: A, B AND C. BELONGS TO THE ATPASE C CHAIN FAMILY. Protein product from Mb1337 detected using shotgun mass spectrometry. Mb1337 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63692" /db_xref="InterPro:IPR000454" /db_xref="InterPro:IPR002379" /db_xref="InterPro:IPR005953" /db_xref="InterPro:IPR020537" /db_xref="InterPro:IPR035921" /db_xref="InterPro:IPR038662" /db_xref="UniProtKB/Swiss-Prot:P63692" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99940.1" /translation="MDPTIAAGALIGGGLIMAGGAIGAGIGDGVAGNALISGVARQPE AQGRLFTPFFITVGLVEAAYFINLAFMALFVFATPVK" CDS 1463779..1464294 /codon_start=1 /transl_table=11 /gene="atpF" /locus_tag="BQ2027_MB1338" /product="PROBABLE ATP SYNTHASE B CHAIN ATPF" /note="Mb1338, atpF, len: 171 aa. Equivalent to Rv1306, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 171 aa overlap). Probable atpF, ATP synthase B chain (EC 3.6.3.14), highly similar to ATPF_MYCLE P45827 (170 aa), FASTA scores, opt: 802, E(): 0, (79.5% identity in 171 aa overlap). SUBUNIT: F-TYPE ATPASES HAVE 2 COMPONENTS, CF(1) - THE CATALYTIC CORE -AND CF(0) -THE MEMBRANE PROTON CHANNEL. CF(1) HAS FIVE SUBUNITS: ALPHA(3), BETA(3), GAMMA(1), DELTA(1), EPSILON(1). CF(0) HAS THREE MAIN SUBUNITS: A, B AND C. BELONGS TO THE ATPASE B CHAIN FAMILY. Protein product from Mb1338 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1338 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63657" /db_xref="InterPro:IPR002146" /db_xref="InterPro:IPR028987" /db_xref="UniProtKB/Swiss-Prot:P63657" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99941.1" /translation="MGEVSAIVLAASQAAEEGGESSNFLIPNGTFFVVLAIFLVVLAV IGTFVVPPILKVLRERDAMVAKTLADNKKSDEQFAAAQADYDEAMTEARVQASSLRDN ARADGRKVIEDARVRAEQQVASTLQTAHEQLKRERDAVELDLRAHVGTMSATLASRIL GVDLTASAATR" CDS 1464301..1465641 /codon_start=1 /transl_table=11 /gene="atpH" /locus_tag="BQ2027_MB1339" /product="PROBABLE ATP SYNTHASE DELTA CHAIN ATPH" /note="Mb1339, atpH, len: 446 aa. Equivalent to Rv1307, len: 446 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 446 aa overlap). Probable atpH, ATP synthase delta chain (EC 3.6.3.14). This protein is much longer than that of other bacterial delta chains, the C-terminal region is homologous to delta chains while the N-terminal region is similar to B/B' subunits e.g. ATPD_STRLI|P50008 ATP synthase delta chain from Streptomyces lividans (273 aa), FASTA scores: opt: 505, E(): 5.4e-23, (35.0% identity in 277 aa overlap); and ATPF_HAEIN|P43720 ATP synthase B chain (EC 3.6.1.34) from Haemophilus influenzae (156 aa), FASTA scores: opt: 216, E(): 1.2e-06, (26.1% identity in 153 aa overlap). SUBUNIT: F-TYPE ATPASES HAVE 2 COMPONENTS, CF(1) - THE CATALYTIC CORE - AND CF(0) - THE MEMBRANE PROTON CHANNEL. CF(1) HAS FIVE SUBUNITS: ALPHA(3), BETA(3), GAMMA(1), DELTA(1), EPSILON(1). CF(0) HAS THREE MAIN SUBUNITS: A, B AND C. BELONGS TO THE ATPASE DELTA CHAIN FAMILY. Protein product from Mb1339 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1339 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A501" /db_xref="InterPro:IPR000711" /db_xref="InterPro:IPR002146" /db_xref="InterPro:IPR005864" /db_xref="InterPro:IPR028987" /db_xref="UniProtKB/Swiss-Prot:P0A501" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99942.1" /translation="MSTFIGQLFGFAVIVYLVWRFIVPLVGRLMSARQDTVRQQLADA AAAADRLAEASQAHTKALEDAKSEAHRVVEEARTDAERIAEQLEAQADVEAERIKMQG ARQVDLIRAQLTRQLRLELGHESVRQARELVRNHVADQAQQSATVDRFLDQLDAMAPA TADVDYPLLAKMRSASRRALTSLVDWFGTMAQDLDHQGLTTLAGELVSVARLLDREAV VTRYLTVPAEDATPRIRLIERLVSGKVGAPTLEVLRTAVSKRWSANSDLIDAIEHVSR QALLELAERAGQVDEVEDQLFRFSRILDVQPRLAILLGDCAVPAEGRVRLLRKVLERA DSTVNPVVVALLSHTVELLRGQAVEEAVLFLAEVAVARRGEIVAQVGAAAELSDAQRT RLTEVLSRIYGHPVTVQLHIDAALLGGLSIAVGDEVIDGTLSSRLAAAEARLPD" CDS 1465686..1467335 /codon_start=1 /transl_table=11 /gene="atpA" /locus_tag="BQ2027_MB1340" /product="PROBABLE ATP SYNTHASE ALPHA CHAIN ATPA" /note="Mb1340, atpA, len: 549 aa. Equivalent to Rv1308, len: 549 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 549 aa overlap). Probable atpA, ATP synthase alpha chain (EC 3.6.3.14), highly similar to ATPA_MYCLE|P45825 from Mycobacterium leprae (558 aa), FASTA scores: opt: 3233, E(): 0, (90.3% identity in 547 aa overlap). Contains PS00017 ATP/GTP-binding site motif A, PS00152 ATP synthase alpha and beta subunits signature. SUBUNIT: F-TYPE ATPASES HAVE 2 COMPONENTS, CF(1) - THE CATALYTIC CORE - AND CF(0) - THE MEMBRANE PROTON CHANNEL. CF(1) HAS FIVE SUBUNITS: ALPHA(3), BETA(3), GAMMA(1), DELTA(1), EPSILON(1). CF(0) HAS THREE MAIN SUBUNITS: A, B AND C. BELONGS TO THE ATPASE ALPHA/BETA CHAINS FAMILY. Protein product from Mb1340 detected using shotgun mass spectrometry. Mb1340 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63674" /db_xref="InterPro:IPR000194" /db_xref="InterPro:IPR000793" /db_xref="InterPro:IPR004100" /db_xref="InterPro:IPR005294" /db_xref="InterPro:IPR020003" /db_xref="InterPro:IPR023366" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR033732" /db_xref="InterPro:IPR036121" /db_xref="InterPro:IPR038376" /db_xref="UniProtKB/Swiss-Prot:P63674" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99943.1" /translation="MAELTIPADDIQSAIEEYVSSFTADTSREEVGTVVDAGDGIAHV EGLPSVMTQELLEFPGGILGVALNLDEHSVGAVILGDFENIEEGQQVKRTGEVLSVPV GDGFLGRVVNPLGQPIDGRGDVDSDTRRALELQAPSVVHRQGVKEPLQTGIKAIDAMT PIGRGQRQLIIGDRKTGKTAVCVDTILNQRQNWESGDPKKQVRCVYVAIGQKGTTIAA VRRTLEEGGAMDYTTIVAAAASESAGFKWLAPYTGSAIAQHWMYEGKHVLIIFDDLTK QAEAYRAISLLLRRPPGREAYPGDVFYLHSRLLERCAKLSDDLGGGSLTGLPIIETKA NDISAYIPTNVISITDGQCFLETDLFNQGVRPAINVGVSVSRVGGAAQIKAMKEVAGS LRLDLSQYRELEAFAAFASDLDAASKAQLERGARLVELLKQPQSQPMPVEEQVVSIFL GTGGHLDSVPVEDVRRFETELLDHMRASEEEILTEIRDSQKLTEEAADKLTEVIKNFK KGFAATGGGSVVPDEHVEALDEDKLAKEAVKVKKPAPKKKK" CDS 1467342..1468259 /codon_start=1 /transl_table=11 /gene="atpG" /locus_tag="BQ2027_MB1341" /product="PROBABLE ATP SYNTHASE GAMMA CHAIN ATPG" /note="Mb1341, atpG, len: 305 aa. Equivalent to Rv1309, len: 305 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 305 aa overlap). Probable atpG, ATP synthase gamma chain (EC 3.6.3.14), highly similar to ATPG_MYCLE|P45824 ATP synthase gamma chain from Mycobacterium leprae (298 aa), FASTA scores: opt: 1579, E():0, (83.9% identity in 305 aa overlap). Contains PS00153 ATP synthase gamma subunit signature. SUBUNIT: F-TYPE ATPASES HAVE 2 COMPONENTS, CF(1) - THE CATALYTIC CORE - AND CF(0) - THE MEMBRANE PROTON CHANNEL. CF(1) HAS FIVE SUBUNITS: ALPHA(3), BETA(3), GAMMA(1), DELTA(1), EPSILON(1). CF(0) HAS THREE MAIN SUBUNITS: A, B AND C. BELONGS TO THE ATPASE GAMMA CHAIN FAMILY. Protein product from Mb1341 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1341 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63672" /db_xref="InterPro:IPR000131" /db_xref="InterPro:IPR023632" /db_xref="InterPro:IPR035968" /db_xref="UniProtKB/Swiss-Prot:P63672" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99944.1" /translation="MAATLRELRGRIRSAGSIKKITKAQELIATSRIARAQARLESAR PYAFEITRMLTTLAAEAALDHPLLVERPEPKRAGVLVVSSDRGLCGAYNANIFRRSEE LFSLLREAGKQPVLYVVGRKAQNYYSFRNWNITESWMGFSEQPTYENAAEIASTLVDA FLLGTDNGEDQRSDSGEGVDELHIVYTEFKSMLSQSAEAHRIAPMVVEYVEEDIGPRT LYSFEPDATMLFESLLPRYLTTRVYAALLESAASELASRQRAMKSATDNADDLIKALT LMANRERQAQITQEISEIVGGANALAEAR" CDS 1468299..1469759 /codon_start=1 /transl_table=11 /gene="atpD" /locus_tag="BQ2027_MB1342" /product="PROBABLE ATP SYNTHASE BETA CHAIN ATPD" /note="Mb1342, atpD, len: 486 aa. Equivalent to Rv1310, len: 486 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 486 aa overlap). Probable atpD, ATP synthase beta chain (EC 3.6.3.14), highly similar to ATPB_MYCLE|P45823 Mycobacterium leprae (485 aa), FASTA score: opt: 2916, E(): 0, (92.6% identity in 484 aa overlap). Contains PS00017 ATP/GTP-binding site motif A, PS00152 ATP synthase alpha and beta subunits signature. SUBUNIT: F-TYPE ATPASES HAVE 2 COMPONENTS, CF(1) - THE CATALYTIC CORE - AND CF(0) - THE MEMBRANE PROTON CHANNEL. CF(1) HAS FIVE SUBUNITS: ALPHA(3), BETA(3), GAMMA(1), DELTA(1), EPSILON(1). CF(0) HAS THREE MAIN SUBUNITS: A, B AND C. BELONGS TO THE ATPASE ALPHA/BETA CHAINS FAMILY. Protein product from Mb1342 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1342 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63678" /db_xref="InterPro:IPR000194" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004100" /db_xref="InterPro:IPR005722" /db_xref="InterPro:IPR020003" /db_xref="InterPro:IPR024034" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036121" /db_xref="UniProtKB/Swiss-Prot:P63678" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99945.1" /translation="MTTTAEKTDRPGKPGSSDTSGRVVRVTGPVVDVEFPRGSIPELF NALHAEITFESLAKTLTLEVAQHLGDNLVRTISLQPTDGLVRGVEVIDTGRSISVPVG EGVKGHVFNALGDCLDEPGYGEKFEHWSIHRKPPAFEELEPRTEMLETGLKVVDLLTP YVRGGKIALFGGAGVGKTVLIQEMINRIARNFGGTSVFAGVGERTREGNDLWVELAEA NVLKDTALVFGQMDEPPGTRMRVALSALTMAEWFRDEQGQDVLLFIDNIFRFTQAGSE VSTLLGRMPSAVGYQPTLADEMGELQERITSTRGRSITSMQAVYVPADDYTDPAPATT FAHLDATTELSRAVFSKGIFPAVDPLASSSTILDPSVVGDEHYRVAQEVIRILQRYKD LQDIIAILGIDELSEEDKQLVNRARRIERFLSQNMMAAEQFTGQPGSTVPVKETIEAF DRLCKGDFDHVPEQAFFLIGGLDDLAKKAESLGAKL" CDS 1469773..1470138 /codon_start=1 /transl_table=11 /gene="atpC" /locus_tag="BQ2027_MB1343" /product="PROBABLE ATP SYNTHASE EPSILON CHAIN ATPC" /note="Mb1343, atpC, len: 121 aa. Equivalent to Rv1311, len: 121 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 121 aa overlap). Probable atpC, ATP synthase epsilon chain (EC 3.6.3.14), highly similar to ATPE_MYCLE|P45822 Mycobacterium leprae (124 aa), FASTA scores: opt: 682, E(): 5.4e-40, (87.6% identity in 121 aa overlap). SUBUNIT: F-TYPE ATPASES HAVE 2 COMPONENTS, CF(1) - THE CATALYTIC CORE - AND CF(0) - THE MEMBRANE PROTON CHANNEL. CF(1) HAS FIVE SUBUNITS: ALPHA(3), BETA(3), GAMMA(1), DELTA(1), EPSILON(1). CF(0) HAS THREE MAIN SUBUNITS: A, B AND C. BELONGS TO THE ATPASE EPSILON CHAIN FAMILY. Protein product from Mb1343 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1343 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63663" /db_xref="InterPro:IPR001469" /db_xref="InterPro:IPR020546" /db_xref="InterPro:IPR036771" /db_xref="UniProtKB/Swiss-Prot:P63663" /protein_id="SIT99946.1" /translation="MAELNVEIVAVDRNIWSGTAKFLFTRTTVGEIGILPRHIPLVAQ LVDDAMVRVEREGEKDLRIAVDGGFLSVTEEGVSILAESAEFESEIDEAAAKQDSESD DPRIAARGRARLRAVGAID" CDS 1470146..1470580 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1344" /product="CONSERVED HYPOTHETICAL SECRETED PROTEIN" /note="Mb1344, -, len: 144 aa. Equivalent to Rv1312, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 142 aa overlap). Conserved hypothetical secreted protein with potential N-terminal signal sequence. Highly similar to P53432|Y02W_MYCLE hypothetical Mycobacterium leprae protein (147 aa), FASTA score: opt: 884, E(): 0, (88.4% identity in 147 aa overlap). N-terminus hydrophobic. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 2 bp deletion (cg-*) at the 3' end leads to a slightly shorter product compared to Mycobacterium tuberculosis strain H37Rv (144 aa versus 147 aa). Protein product from Mb1344 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1344 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XXZ8" /db_xref="InterPro:IPR019675" /db_xref="UniProtKB/TrEMBL:A0A1R3XXZ8" /protein_id="SIT99947.1" /translation="MSVPMIGMVVLVVVLGLAVLALSYRLWKLRQGGTAGIMRDIPAV GGHGWRHGVIRYRGGEAAFYRLSSLRLWPDRRLSRRGVEIISRRAPRGDEFDIMTDEI VVVELCDSTQDRRVGYEIALDRGALTAFLSWLESRPSPRAPP" mobile_element complement(1470598..1472097) /mobile_element_type="insertion sequence:IS1557" /locus_tag="BQ2027_IS1557-2" /note="IS1557-2, len: 1498 nt. Equivalent to IS1557, len: 1509 nt, from Mycobacterium tuberculosis strain H37Rv,(99.4% identity in 785 nt overlap). IS1557-2nd copy." repeat_region 1470598..1470617 /rpt_type=INVERTED /note="20 bp imperfect inverted repeat, IRR,AGCAGACGCAAAAGCCCCCA, flanking IS element IS1557." gene complement(1470598..1472097) /locus_tag="BQ2027_IS1557-2" CDS complement(1470627..1471310) /pseudo /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1345C" /note="Mb1345c, -, len: 243 aa. Equivalent to 3' end of Rv1313c, len: 444 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 243 aa overlap). Possible IS1557 transposase, similar to several transposases e.g. U57649|DBU57649 ORF1 from dibenzofuran-degrading bacterium DPO360 (163 aa), FASTA scores: opt: 767, E(): 0, (67.3% identity in 168 aa overlap); TNPA_BORPA|Q06126 transposase for insertion sequence element IS1001 from Bordetella parapertussis (406 aa), FASTA scores: opt: 254, E(): 3.3e-10, (24.9% identity in 402 aa overlap). Also similar to putative Mycobacterium tuberculosis transposases, Rv3798 and Rv0741. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1313c exists as a single gene. In Mycobacterium bovis, a frameshift due to a 12 bp to 1 bp substitution (cttgtcgtggcc-t) splits Rv1313c into 2 parts, Mb1345c and Mb1346c.;POSSIBLE TRANSPOSASE [SECOND PART]" CDS complement(1471333..1471950) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1346C" /product="POSSIBLE TRANSPOSASE [FIRST PART]" /note="Mb1346c, -, len: 205 aa. Equivalent to 5' end of Rv1313c, len: 444 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 195 aa overlap). Possible IS1557 transposase, similar to several transposases e.g. U57649|DBU57649 ORF1 from dibenzofuran-degrading bacterium DPO360 (163 aa), FASTA scores: opt: 767, E(): 0, (67.3% identity in 168 aa overlap); TNPA_BORPA|Q06126 transposase for insertion sequence element IS1001 from Bordetella parapertussis (406 aa), FASTA scores: opt: 254, E(): 3.3e-10, (24.9% identity in 402 aa overlap). Also similar to putative Mycobacterium tuberculosis transposases, Rv3798 and Rv0741. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1313c exists as a single gene. In Mycobacterium bovis, a frameshift due to a 12 bp to 1 bp substitution (cttgtcgtggcc-t) splits Rv1313c into 2 parts, Mb1345c and Mb1346c. Mb1346c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002560" /db_xref="InterPro:IPR029261" /db_xref="InterPro:IPR032877" /db_xref="UniProtKB/TrEMBL:A0A0G2QBZ2" /protein_id="SIT99949.1" /translation="MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSA VLRRCGRCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVPWA RHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADTEKRIDRFANL RRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPSHPGLVLRCPGR" repeat_region complement(1472078..1472097) /rpt_type=INVERTED /note="20 bp imperfect inverted repeat, IRL,AGCAGACGCGAAAGCCCCCA, flanking IS element IS1557." CDS complement(1472116..1472697) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1347C" /product="ATP:Cob(I)alamin adenosyltransferase (EC" /EC_number="2.5.1.17" /note="Mb1347c, -, len: 193 aa. Equivalent to Rv1314c, len: 193 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 193 aa overlap). Conserved hypothetical protein, highly similar to P53523|Y02Y_MYCLE hypothetical Mycobacterium leprae protein (191 aa), FASTA score: opt:1019, E(): 0, (81.2% identity in 191 aa overlap). Some similarity with YDHW_CITFR|P45515 hypothetical 19.8 kd protein in dhar-dhat intergenic region (176 aa), FASTA scores: opt: 297, E(): 1.6e-13, (37.6% identity in 178 aa overlap). Also similar to hypothetical protein AE002007|AE002007_3 Deinococcus radiodurans (185 aa), FASTA score: opt: 386, E(): 7.7e-19, (42.4% identity in 172 aa overlap). Protein product from Mb1347c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1347c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64804" /db_xref="InterPro:IPR016030" /db_xref="InterPro:IPR029499" /db_xref="InterPro:IPR036451" /db_xref="UniProtKB/Swiss-Prot:P64804" /protein_id="SIT99950.1" /translation="MAVHLTRIYTRTGDDGTTGLSDMSRVAKTDARLVAYADCDEANA AIGAALALGHPDTQITDVLRQIQNDLFDAGADLSTPIVENPKHPPLRIAQSYIDRLEG WCDAYNAGLPALKSFVLPGGSPLSALLHVARTVVRRAERSAWAAVDAHPEGVSVLPAK YLNRLSDLLFILSRVANPDGDVLWRPGGDRTAS" CDS 1472766..1474022 /codon_start=1 /transl_table=11 /gene="murA" /locus_tag="BQ2027_MB1348" /product="PROBABLE UDP-N-ACETYLGLUCOSAMINE 1-CARBOXYVINYLTRANSFERASE MURA" /note="Mb1348, murA, len: 418 aa. Equivalent to Rv1315, len: 418 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 418 aa overlap). Probable murA, UDP-N-acetylglucosamine 1-carboxyvinyltransferase (EC 2.5.1.7), highly similar to many e.g. MURA_MYCLE|P45821 (418 aa), FASTA scores: opt: 2495, E(): 0, (96.2% identity in 396 aa overlap). BELONGS TO THE EPSP SYNTHASE FAMILY. MURA SUBFAMILY. Protein product from Mb1348 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1348 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5L3" /db_xref="InterPro:IPR001986" /db_xref="InterPro:IPR005750" /db_xref="InterPro:IPR013792" /db_xref="InterPro:IPR036968" /db_xref="UniProtKB/Swiss-Prot:P0A5L3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99951.1" /translation="MAERFVVTGGNRLSGEVAVGGAKNSVLKLMAATLLAEGTSTITN CPDILDVPLMAEVLRGLGATVELDGDVARITAPDEPKYDADFAAVRQFRASVCVLGPL VGRCKRARVALPGGDAIGSRPLDMHQAGLRQLGAHCNIEHGCVVARAETLRGAEIQLE FPSVGATENILMAAVVAEGVTTIHNAAREPDVVDLCTMLNQMGAQVEGAGSPTMTITG VPRLHPTEHRVIGDRIVAATWGIAAAMTRGDISVAGVDPAHLQLVLHKLHDAGATVTQ TDASFRVTQYERPKAVNVATLPFPGFPTDLQPMAIALASIADGTSMITENVFEARFRF VEEMIRLGADARTDGHHAVVRGLPQLSSAPVWCSDIRAGAGLVLAGLVADGDTEVHDV FHIDRGYPLFVENLVSLGAEIERVCC" rRNA 1474291..1475827 /gene="rrs" /locus_tag="BQ2027_RRS" /product="ribosomal RNA 16S" /note="rrs, len: 1537 nt. Equivalent to rrs, len: 1537 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1537 nt overlap). 16s rRNA gene." rRNA 1476103..1479240 /gene="rrl" /locus_tag="BQ2027_RRL" /product="ribosomal RNA 23S" /note="rrl, len: 3138 nt. Equivalent to rrl, len: 3138 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 3138 nt overlap). 23S rRNA gene (approximate coordinates)." rRNA 1479344..1479458 /gene="rrf" /locus_tag="BQ2027_RRF" /product="ribosomal RNA 5S" /note="rrf, len: 115 nt. Equivalent to rrf, len: 115 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 115 nt overlap). 5S rRNA gene. Identical to Em_ba:MT5SRR, D10035 M.tuberculosis 5S rRNA, len: 116 nt." CDS complement(1479578..1480075) /codon_start=1 /transl_table=11 /gene="ogt" /locus_tag="BQ2027_MB1349C" /product="methylated-dna--protein-cysteine methyltransferase ogt (6-o-methylguanine-dna methyltransferase) (o-6-methylguanine-dna- alkyltransferase)" /note="Mb1349c, ogt, len: 165 aa. Equivalent to Rv1316c, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 165 aa overlap). Probable ogt, methylated-dna--protein-cysteine methytransferase (EC 2.1.1.63), similar to many e.g. OGT_HAEIN|P44687 Haemophilus influenzae (190 aa), FASTA scores: opt: 405, E(): 6.5e-20, (41.9% identity in 155 aa overlap). Contains PS00374 Methylated-DNA--protein-cysteine methyltransferase active site. Protein product from Mb1349c detected using SWATH mass spectrometry. Mb1349c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A697" /db_xref="InterPro:IPR001497" /db_xref="InterPro:IPR008332" /db_xref="InterPro:IPR014048" /db_xref="InterPro:IPR023546" /db_xref="InterPro:IPR036217" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036631" /db_xref="UniProtKB/Swiss-Prot:P0A697" /protein_id="SIT99952.1" /translation="MIHYRTIDSPIGPLTLAGHGSVLTNLRMLEQTYEPSRTHWTPDP GAFSGAVDQLNAYFAGELTEFDVELDLRGTDFQQRVWKALLTIPYGETRSYGEIADQI GAPGAARAVGLANGHNPIAIIVPCHRVIGASGKLTGYGGGINRKRALLELEKSRAPAD LTLFD" CDS complement(1480072..1481322) /codon_start=1 /transl_table=11 /gene="alkAb" /locus_tag="BQ2027_MB1350C" /product="PUTATIVE ADA REGULATORY PROTEIN ALKAb [SECOND PART] (Regulatory protein of adaptative response) : Methylated-DNA--protein-cysteine methyltransferase (O-6-methylguanine-DNA alkyltransferase)" /note="Mb1350c, alkAb, len: 416 aa. Equivalent to the 3' end of Rv1317c, len: 496 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 416 aa overlap). Putative alkA (alternate gene name: ada), regulatory protein (EC 2.1.1.63), similar to 3MG2_ECOLI|P04395 dna-3-methyladenine glycosidase II from Escherichia coli (282 aa), FASTA scores, opt: 437, E(): 8.6e-22, (32.8% identity in 293 aa overlap), also similar to other ada proteins e.g. ADA_SALTY|P26189 Salmonella typhimurium (352 aa), FASTA scores: E(): 5.3e-08, (35.9% identity in 156 aa overlap). Contains PS00041 Bacterial regulatory proteins, araC family signature. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, alkA exists as a single gene. In Mycobacterium bovis, a single base transition (g-a) introduces a premature stop codon that splits alkA into 2 parts, alkAa and alkAb. Mb1350c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y043" /db_xref="InterPro:IPR003265" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR010316" /db_xref="InterPro:IPR011257" /db_xref="InterPro:IPR018060" /db_xref="InterPro:IPR018062" /db_xref="InterPro:IPR023170" /db_xref="InterPro:IPR037046" /db_xref="UniProtKB/TrEMBL:A0A1R3Y043" /protein_id="SIT99953.1" /translation="MRSDVVARAMRLIADGTVDRDGVSGLAAQLGYTIRQLERLLQAV VGAGPLALARAQRMQTARVLIETTNLPFGDVAFAAGFSSIRQFNDTVRLACDGTPTAL RARAAARFESATASAGTVSLRLPVRAPFAFEGVFGHLAATAVPGCEEVRDGAYRRTLR LPWGNGIVSLTPAPDHVRCLLVLDDFRDLMTATARCRRLLDLDADPEAIVEALGADPD LRAVVGKAPGQRIPRTVDEAEFAVRAVLAQQVSTKAASTHAGRLVAAYGRPVHDRHGA LTHTFPSIEQLAEIDPGHLAVPKARQRTINALVASLADKSLVLDAGCDWQRARGQLLA LPGVGPWTAEVIAMRGLGDPDAFPASDLGLRLAAKKLGLPAQRRALTVHSARWRPWRS YATQHLWTTLEHPVNQWPPQEKIA" CDS complement(1481326..1481562) /codon_start=1 /transl_table=11 /gene="alkAa" /locus_tag="BQ2027_MB1351C" /product="PUTATIVE ADA REGULATORY PROTEIN ALKAa [FIRST PART] (Regulatory protein of adaptative response) : Methylated-DNA--protein-cysteine methyltransferase (O-6-methylguanine-DNA alkyltransferase)" /note="Mb1351c, alkAa, len: 78 aa. Equivalent to the 5' end of Rv1317c, len: 496 aa, from Mycobacterium tuberculosis strain H37Rv, (98.7% identity in 78 aa overlap). Putative alkA (alternate gene name: ada), regulatory protein (EC 2.1.1.63), similar to 3MG2_ECOLI|P04395 dna-3-methyladenine glycosidase II from Escherichia coli (282 aa), FASTA scores, opt: 437, E(): 8.6e-22, (32.8% identity in 293 aa overlap), also similar to other ada proteins e.g. ADA_SALTY|P26189 Salmonella typhimurium (352 aa), FASTA scores: E(): 5.3e-08, (35.9% identity in 156 aa overlap). Contains PS00041 Bacterial regulatory proteins, araC family signature. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, alkA exists as a single gene. In Mycobacterium bovis, a single base transition (g-a) introduces a premature stop codon that splits alkA into 2 parts, alkAa and alkAb. Mb1351c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYZ1" /db_xref="InterPro:IPR004026" /db_xref="InterPro:IPR035451" /db_xref="UniProtKB/TrEMBL:A0A1R3XYZ1" /protein_id="SIT99954.1" /translation="MHDDFERCYRAVQSKDARFDGWFVVAVLTTGVYCRPSCPVRPPF ARNVRFLPTAAAAQGEGFRACKRCRPDASPGSPE" CDS complement(1481643..1483268) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1352C" /product="POSSIBLE ADENYLATE CYCLASE (ATP PYROPHOSPHATE-LYASE) (ADENYLYL CYCLASE)" /note="Mb1352c, -, len: 541 aa. Equivalent to Rv1318c, len: 541 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 541 aa overlap). Possible adenylate cyclase (EC 4.6.1.1). Some similarity at the c-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores, opt: 270, E(): 2.5e-11, (28.8% identity in 184 aa overlap); similar to other mycbacterium tuberculosis putative adenylate cyclases e.g. Rv1319c|MTCY130.04c (535 aa), FASTA scores: opt: 2505, E(): 0, (71.0% identity in 534 aa overlap), also similar to Rv1320c|MTCY130.05c (567 aa), FASTA scores, opt: 2423, E(): 0, (68.7% identity in 534 aa overlap). N-terminus is hydrophobic. BELONGS TO ADENYLYL CYCLASE CLASS-3 FAMILY. Mb1352c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63528" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/Swiss-Prot:P63528" /protein_id="SIT99955.1" /translation="MSAKKSTAQRLGRVLETVTRQSGRLPETPAYGSWLLGRVSESQR RRRVRIQVMLTALVVTANLLGIGVALLLVTIAIPEPSIVRDTPRWLTFGVVPGYVLLA LALGSYALTRQTVQALRWAIEGRKPTREEERRTFLAPWRVAVGHLMFWGVGTALLTTL YGLINNAFIPRFLFAVSFCGVLVATATYLHTEFALRPFAAQALEAGPPPRRLAPGILG RTMVVWLLGSGVPVVGIALMAMFEMVLLNLTRMQFATGVLIISMVTLVFGFILMWILA WLTATPVRVVRAALRRVERGELRTNLVVFDGTELGELQRGFNAMVAGLRERERVRDLF GRHVGREVAAAAERERSKLGGEERHVAVVFIDIVGSTQLVTSRPPADVVKLLNKFFAI VVDEVDRHHGLVNKFEGDASLTIFGAPNRLPCPEDKALAAARAIADRLVNEMPECQAG IGVAAGQVIAGNVGARERFEYTVIGEPVNEAARLCELAKSRPGKLLASAQAVDAASEE ERARWSLGRHVKLRGHDQPVRLAKPVGLTKPRR" CDS complement(1483338..1484945) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1353C" /product="POSSIBLE ADENYLATE CYCLASE (ATP PYROPHOSPHATE-LYASE) (ADENYLYL CYCLASE)" /note="Mb1353c, -, len: 535 aa. Equivalent to Rv1319c, len: 535 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 535 aa overlap). Possible adenylate cyclase (EC 4.6.1.1). Some similarity at the C-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores: opt: 254, E(): 2.4e-10, (33.3% identity in 144 aa overlap); similar to other mycbacterium tuberculosis putative adenylate cyclases e.g. Rv1318c|MTCY130.03c (541 aa), FASTA scores: opt: 2505, E(): 0, (71.0% identity in 534 aa overlap); Rv1320c|MTCY130.05c (567 aa), FASTA scores: opt: 2354, E(): 0, (66.3% identity in 534 aa overlap). N-terminus is hydrophobic. BELONGS TO ADENYLYL CYCLASE CLASS-3 FAMILY. Protein product from Mb1353c detected using SWATH mass spectrometry. Mb1353c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4Y3" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/Swiss-Prot:P0A4Y3" /protein_id="SIT99956.1" /translation="MPAKKTMAQRLGQALETMTRQCGQLPETPAYGSWLLGRVSESPS RRWVRIKRIVTVYIMTANLTGIVVALLVVTFAFPVPSIYTDAPWWVTFGVAPAYATLA LAIGTYWITTRIVRASIRWAIEERAPSQADGRNTLLLPFRVAAVHLILWDIGGALLAT LYGLANRVFVTIILFSVTICGVLVATNCYLFTEFALRPVAAKALEAGRPPRRFAPGIM GRTMTVWSLGSGVPVTGIATTALYVLLVHNLTETQLASAVLILSITTLIFGFLVMWIL AWLTAAPVRVVRAALKRVEQGDLRGDLVVFDGTELGELQRGFNAMVNGLRERERVRDL FGRHVGREVAAAAERERPQLGGEDRHAAVVFVDIVGSTQLVDNQPAAHVVKLLNRFFA IVVNEVDRHHGLINKFAGDAALAIFGAPNRLDRPEDAALAAARAIADRLANEMPEVQA GIGVAAGQIVAGNVGAKQRFEYTVVGKPVNQAARLCELAKSHPARLLASSDTLHAASE TERAHWSLGETVTLRGHEQPTRLAVPT" CDS complement(1484958..1486661) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1354C" /product="POSSIBLE ADENYLATE CYCLASE (ATP PYROPHOSPHATE-LYASE) (ADENYLYL CYCLASE)" /note="Mb1354c, -, len: 567 aa. Equivalent to Rv1320c, len: 567 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 567 aa overlap). Possible adenylate cyclase (EC 4.6.1.1) (see second citation below). Some similarity at the C-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores: opt: 277, E(): 2e-12, (34.0% identity in 156 aa overlap); similar to other mycbacterium tuberculosis putative adenylate cyclases e.g. Rv1318c|MTCY130.03c (541 aa), FASTA scores: opt: 2423, E(): 0, (68.7% identity in 534 aa overlap); Rv1319c|MTCY130.04c (535 aa), FASTA scores: opt: 2354, E(): 0, (66.3% identity in 534 aa overlap). N-terminus is hydrophobic. BELONGS TO ADENYLYL CYCLASE CLASS-3 FAMILY. Protein product from Mb1354c detected using SWATH mass spectrometry. Mb1354c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59972" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/Swiss-Prot:P59972" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99957.1" /translation="MPSEKATTRHLPGAVETLSPRTGRRPETPAYGSWLLGRVSESPR MRRVRIQGMLTVAILVTNVIGLIVGAMLLTVAFPKPSVILDAPHWVSFGIVPGYCVLA FILGTYWLTRQTARALRWAIEERTPSHDEARSAFLVPLRVALAVLFLWGAAAALWTII YGLANRLFIPRFLFSMGVIGVVAATSCYLLTEFALRPMAAQALEVGATPRSLVRGIVG RTMLVWLLCSGVPNVGVALTAIFDDTFWELSNDQFMITVLILWAPLLIFGFILMWILA WLTATPVRVVREALNRVEQGDLSGDLVVFDGTELGELQRGFNRMVEGLRERERVRDLF GRHVGREVAAAAERERPKLGGEERHVAVVFVDIVGSTQLVTSRPAAEVVMLLNRFFTV IVDEVNHHRGLVNKFQGDASLAVFGAPNRLSHPEDAALATARAIADRLASEMPECQAG IGVAAGQVVAGNVGAHERFEYTVIGEPVNEAARLCELAKSYPSRLLASSQTLRGASEN ECARWSLGETVTLRGHDQPIRLASPVQQLQMPAQSADIVGGALGDHQTHTIYRGAHPT D" CDS 1486723..1487403 /codon_start=1 /transl_table=11 /gene="nucS" /locus_tag="BQ2027_MB1355" /product="Endonuclease NucS" /note="Mb1355, -, len: 226 aa. Equivalent to Rv1321, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 226 aa overlap). Conserved hypothetical protein. Equivalent to P53524|YD21_MYCLE hypothetical protein from Mycobacterium leprae (201 aa), FASTA scores: opt: 1144, E(): 0, (87.6% identity in 193 aa overlap). Some similarity to hypothetical proteins from other organisms e.g. Y225_METJA|Q57678 Methanococcus jannaschii (263 aa), FASTA scores: E(): 6.5e-05, (25.0% identity in 212 aa overlap). Protein product from Mb1355 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1355 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59979" /db_xref="InterPro:IPR002793" /db_xref="InterPro:IPR011856" /db_xref="UniProtKB/Swiss-Prot:P59979" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99958.1" /translation="MSRVRLVIAQCTVDYIGRLTAHLPSARRLLLFKADGSVSVHADD RAYKPLNWMSPPCWLTEESGGQAPVWVVENKAGEQLRITIEGIEHDSSHELGVDPGLV KDGVEAHLQALLAEHIQLLGEGYTLVRREYMTAIGPVDLLCRDERGGSVAVEIKRRGE IDGVEQLTRYLELLNRDSVLAPVKGVFAAQQIKPQARILATDRGIRCLTLDYDTMRGM DSGEYRLF" CDS 1487426..1487722 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1356" /product="ATP/GTP-binding protein" /note="Mb1356, -, len: 98 aa. Equivalent to Rv1322, len: 98 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 98 aa overlap). Conserved hypothetical protein. Mb1356 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P64806" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99959.1" /translation="MARRRKPLHRQRPEPPSWALRRVEAGPDGHEYEVRPVAAARAVK TYRCPGCDHEIRSGTAHVVVWPTDLPQAGVDDRRHWHTPCWANRATRGPTRKWT" CDS complement(1487757..1488215) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1357C" /product="Methylmalonyl-CoA epimerase (EC @ Ethylmalonyl-CoA epimerase" /EC_number="5.1.99.1" /note="Mb1357c, -, len: 152 aa. Equivalent to Rv1322A, len: 152 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 152 aa overlap). Conserved hypothetical protein, similar to proteins from Mycobacterium leprae and Streptomyces coelicolor e.g. AL583921_2|ML1157 from M. leprae strain TN (155 aa), FASTA scores: opt: 771, E(): 5.1e-43, (75.3% identity in 154 aa overlap); and AL137242_2 from S. coelicolor (146 aa), FASTA scores: opt: 404, E(): 2e-19, (43.165% identity in 139 aa overlap). Protein product from Mb1357c detected using shotgun mass spectrometry. Mb1357c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR017515" /db_xref="InterPro:IPR029068" /db_xref="InterPro:IPR037523" /db_xref="UniProtKB/TrEMBL:A0A1R3XY23" /protein_id="SIT99960.1" /translation="MMTTDQVHARHMLATSLVTGLDHVGIAVADLDVAIEWYHDHLGM ILVHEEINDDQGIREALLAVPGSAAQIQLMAPLDESSVIAKFLDKRGPGIQQLACRVS DLDAMCRRLRSQGVRLVYETARRGTANSRINFIHPKDAGGVLIELVEPAP" CDS 1488306..1489475 /codon_start=1 /transl_table=11 /gene="fadA4" /locus_tag="BQ2027_MB1358" /product="PROBABLE ACETYL-COA ACETYLTRANSFERASE FADA4 (ACETOACETYL-COA THIOLASE)" /note="Mb1358, fadA4, len: 389 aa. Equivalent to Rv1323, len: 389 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 389 aa overlap). Probable fadA4, acetyl-CoA acetyltransferase (EC 2.3.1.9), equivalent to THIL_MYCLE|P46707 possible acetyl-CoA C-acetyltransferase from Mycobacterium leprae (393 aa), FASTA scores: opt: 2218, E(): 0, (87.0% identity in 392 aa overlap). Also highly similar to others e.g. CAB70629.1|AL137242 probable acetoacetyl-coA thiolase from Streptomyces coelicolor (401 aa); T51772 acetyl-CoA C-acetyltransferase (EC 2.3.1.9) [validated] from Alcaligenes latus (392 aa); etc. Some homologies indicate ATA start codon. Contains PS00098 Thiolases acyl-enzyme intermediate signature, PS00737 Thiolases signature 2, and PS00099 Thiolases active site. BELONGS TO THE THIOLASE FAMILY. Protein product from Mb1358 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1358 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66927" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020610" /db_xref="InterPro:IPR020613" /db_xref="InterPro:IPR020615" /db_xref="InterPro:IPR020616" /db_xref="InterPro:IPR020617" /db_xref="UniProtKB/Swiss-Prot:P66927" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99961.1" /translation="MIVAGARTPIGKLMGSLKDFSASELGAIAIKGALEKANVPASLV EYVIMGQVLTAGAGQMPARQAAVAAGIGWDVPALTINKMCLSGIDAIALADQLIRARE FDVVVAGGQESMTKAPHLLMNSRSGYKYGDVTVLDHMAYDGLHDVFTDQPMGALTEQR NDVDMFTRSEQDEYAAASHQKAAAAWKDGVFADEVIPVNIPQRTGDPLQFTEDEGIRA NTTAAALAGLKPAFRGDGTITAGSASQISDGAAAVVVMNQEKAQELGLTWLAEIGAHG VVAGPDSTLQSQPANAINKALDREGISVDQLDVVEINEAFAAVALASIRELGLNPQIV NVNGGAIAVGHPLGMSGTRITLHAALQLARRGSGVGVAALCGAGGQGDALILRAG" CDS 1489605..1490519 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1359" /product="POSSIBLE THIOREDOXIN" /note="Mb1359, -, len: 304 aa. Equivalent to Rv1324, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 304 aa overlap). Possible thioredoxin (EC 1.-.-.-), similar to several e.g. U00014|Q49716 TRXA from Mycobacterium leprae (255 aa), FASTA scores: opt: 1014, E(): 0, (69.7% identity in 228 aa overlap); THIO_RHOSH|P08058 TrxA from Rhodobacter sphaeroides (105 aa), FASTA scores: opt 196, E(): 1.9e-06, (33.0% identity in 103 aa overlap). Contains PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2. Protein product from Mb1359 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1359 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64808" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/Swiss-Prot:P64808" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99962.1" /translation="MTRPRPPLGPAMAGAVDLSGIKQRAQQNAAASTDADRALSTPSG VTEITEANFEDEVIVRSDEVPVVVLLWSPRSEVCVDLLDTLSGLAAAAKGKWSLASVN VDVAPRVAQIFGVQAVPTVVALAAGQPISSFQGLQPADQLSRWVDSLLSATAGKLKGA ASSEESTEVDPAVAQARQQLEDGDFVAARKSYQAILDANPGSVEAKAAIRQIEFLIRA TAQRPDAVSVADSLSDDIDAAFAAADVQVLNQDVSAAFERLIALVRRTSGEERTRVRT RLIELFELFDPADPEVVAGRRNLANALY" CDS complement(1490598..1492409) /codon_start=1 /transl_table=11 /gene="PE_PGRS24" /locus_tag="BQ2027_MB1360C" /product="pe-pgrs family protein pe_pgrs24" /note="Mb1360c, PE_PGRS24, len: 603 aa. Equivalent to Rv1325c, len: 603 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 603 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of ala-, gly-rich proteins, similar to many e.g. YQ04_MYCTU|P71933 hypothetical 63.1 kd glycine-rich protein (778 aa), FASTA scores: E(): 0, (52.3% identity in 724 aa overlap). Mb1360c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y056" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99963.1" /translation="MSFVIAAPETLVRAASDLANIGSTLGAANAAALGPTTELLAAGA DEVSAAIASLFAAHGQAYQAVSAQMSAFHAQFVQTFTAGAGAYASAEAAAAAPLEGLL NIVNTPTQLLLGRPLIGNGANGAPGTGQAGGAGGLLYGNGGAGGSGAPGQAGGPGGAA GLFGNGGAGGAGGDGPGNGAAGGAGGAGGLLFGSGGAGGPGGVGNTGTGGLGGDGGAA GLFGAGGIGGAGGPGFNGGAGGAGGRSGLFEVLAAGGAGGTGGLSVNGGTGGTGGTGG GGGLFSNGGAGGAGGFGVSGSAGGNGGTGGDGGIFTGNGGTGGAGGTGTGNQLVGGEG GAGGAGGNAGILFGAGGIGGTGGTGLGAPDPGGTGGKGGVGGIGGAGALFGPGGAGGT GGFGASSADQMAGGIGGSGGSGGAAKLIGDGGAGGTGGDSVRGAAGSGGTGGTGGLIG DGGAGGAGGTGIEFGSVGGAGGAGGNAAGLSGAGGAGGAGGFGETAGDGGAGGNAGLL NGDGGAGGAGGLGIAGDGGNGGKGGKAGMVGNGGDGGAGGASVVANGGVGGSGGNATL IGNGGNGGNGGVGSAPGKGGAGGTAGLLGLNGSPGLS" CDS complement(1492561..1494756) /codon_start=1 /transl_table=11 /gene="glgB" /locus_tag="BQ2027_MB1361C" /product="1,4-alpha-glucan branching enzyme glgb (glycogen branching enzyme)" /note="Mb1361c, glgB, len: 731 aa. Equivalent to Rv1326c, len: 731 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 731 aa overlap). Probable glgB, 1,4-alpha-glucan branching enzyme (EC 2.4.1.18), similar to others e.g. GLGB_ECOLI|P07762 Escherichia coli (728 aa), FASTA scores: opt: 2330, E(): 0, (48.7% identity in 719 aa overlap). Similar to other Mycobacterium tuberculosis putative alpha-glucan branching enzymes Rv1562c, Rv1563c. BELONGS TO FAMILY OF 13 GLYCOSYL HYDROLASES, ALSO KNOWN AS THE ALPHA-AMYLASE FAMILY. Protein product from Mb1361c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1361c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59816" /db_xref="InterPro:IPR004193" /db_xref="InterPro:IPR006047" /db_xref="InterPro:IPR006048" /db_xref="InterPro:IPR006407" /db_xref="InterPro:IPR013780" /db_xref="InterPro:IPR013783" /db_xref="InterPro:IPR014756" /db_xref="InterPro:IPR017853" /db_xref="InterPro:IPR037439" /db_xref="UniProtKB/Swiss-Prot:P59816" /protein_id="SIT99964.1" /translation="MSRSEKLTGEHLAPEPAEMARLVAGTHHNPHGILGAHEYGDHTV IRAFRPHAVEVVALVGKDRFSLQHLDSGLFAVALPFVDLIDYRLQVTYEGCEPHTVAD AYRFLPTLGEVDLHLFAEGRHERLWEVLGAHPRSFTTADGVVSGVSFAVWAPNAKGVS LIGEFNGWNGHEAPMRVLGPSGVWELFWPDFPCDGLYKFRVHGADGVVTDRADPFAFG TEVPPQTASRVTSSDYTWGDDDWMAGRALRNPVNEAMSTYEVHLGSWRPGLSYRQLAR ELTDYIVDQGFTHVELLPVAEHPFAGSWGYQVTSYYAPTSRFGTPDDFRALVDALHQA GIGVIVDWVPAHFPKDAWALGRFDGTPLYEHSDPKRGEQLDWGTYVFDFGRPEVRNFL VANALYWLQEFHIDGLRVDAVASMLYLDYSRPEGGWTPNVHGGRENLEAVQFLQEMNA TAHKVAPGIVTIAEESTSWPGVTRPTNIGGLGFSMKWNMGWMHDTLDYVSRDPVYRSY HHHEMTFSMLYAFSENYVLPLSHDEVVHGKGTLWGRMPGNNHVKAAGLRSLLAYQWAH PGKQLLFMGQEFGQRAEWSEQRGLDWFQLDENGFSNGIQRLVRDINDIYRCHPALWSL DTTPEGYSWIDANDSANNVLSFMRYGSDGSVLACVFNFAGAEHRDYRLGLPRAGRWRE VLNTDATIYHGSGIGNLGGVDATDDPWHGRPASAVLVLPPTSALWLTPA" CDS complement(1494764..1496869) /codon_start=1 /transl_table=11 /gene="glgE" /locus_tag="BQ2027_MB1362C" /product="PROBABLE GLUCANASE GLGE" /note="Mb1362c, glgE, len: 701 aa. Equivalent to Rv1327c, len: 701 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 701 aa overlap). Probable glgE, glucanase, similar to AF172946|AF172946_2 putative glucanase GlgE from Mycobacterium smegmatis (697 aa), FASTA scores: opt: 3816, E(): 0, (78.5% identity in 692 aa overlap). Similar to putative alpha-amylases e.g. Q9L1K2 Streptomyces coelicolor (675 aa), FASTA scores: opt: 2243, E(): 7.4e-132, (54.2% identity in 684 aa overlap). Start changed since original submission (-36) based on similarity to GlgE of Mycobacterium smegmatis; previous start at position 1494531. Protein product from Mb1362c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1362c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63532" /db_xref="InterPro:IPR006047" /db_xref="InterPro:IPR013780" /db_xref="InterPro:IPR013783" /db_xref="InterPro:IPR017853" /db_xref="InterPro:IPR021828" /db_xref="InterPro:IPR026585" /db_xref="UniProtKB/Swiss-Prot:P63532" /protein_id="SIT99965.1" /translation="MSGRAIGTETEWWVPGRVEIDDVAPVVSCGVYPAKAVVGEVVPV SAAVWREGHEAVAATLVVRYLGVRYPHLTDRPRARVLPTPSEPQQRVKPLLIPMTSGQ EPFVFHGQFTPDRVGLWTFRVDGWGDPIHTWRHGLIAKLDAGQGETELSNDLLVGAVL LERAATGVPRGLRDPLLAAAAALRTPGDPVTRTALALTPEIEELLADYPLRDLVTRGE QFGVWVDRPLARFGAWYEMFPRSTGGWDDDGNPVHGTFATAAAELPRIAGMGFDVVYL PPIHPIGKVHRKGRNNSPTAAPTDVGSPWAIGSDEGGHDTVHPSLGTIDDFDDFVSAA RDLGMEVALDLALQCAPDHPWAREHRQWFTELPDGTIAYAENPPKKYQDIYPLNFDND PEGLYDEVLRVVQHWVNHGVKFFRVDNPHTKPPNFWAWLIAQVKTVDPDVLFLSEAFT PPARQYGLAKLGFTQSYSYFTWRTTKWELTEFGNQIAELADYRRPNLFVNTPDILHAV LQHNGPGMFAIRAVLAATMSPAWGMYCGYELFEHRAVREGSEEYLDSEKYELRPRDFA SALDQGRSLQPFITRLNIIRRLHPAFQQLRTIHFHHVDNDALLAYSKFDPATGDCVLV VVTLNAFGPEEATLWLDMAALGMEDYDRFWVRDEITGEEYQWGQANYIRIDPARAVAH IINMPAVPYESRNTLLRRR" CDS 1497008..1499599 /codon_start=1 /transl_table=11 /gene="glgP" /locus_tag="BQ2027_MB1363" /product="PROBABLE GLYCOGEN PHOSPHORYLASE GLGP" /note="Mb1363, glgP, len: 863 aa. Equivalent to Rv1328, len: 863 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 863 aa overlap). Probable glgP, glycogen phosphorylase (EC 2.4.1.1), similar to many e.g. PHSG_HAEIN|P45180 glycogen phosphorylase from Haemophilus influenzae (821 aa), FASTA scores: E(): 6.9e-08, (25.6% identity in 675 aa overlap). BELONGS TO THE GLYCOGEN PHOSPHORYLASE FAMILY. Protein product from Mb1363 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1363 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U078" /db_xref="InterPro:IPR000811" /db_xref="InterPro:IPR011834" /db_xref="InterPro:IPR024517" /db_xref="InterPro:IPR035090" /db_xref="UniProtKB/Swiss-Prot:Q7U078" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99966.1" /translation="MKALRRFTVRAHLPERLAALDQLSTNLRWSWDKPTQDLFAAIDP ALWEQCGHDPVALLGAVNPARLDELALDAEFLGALDELAADLNDYLSRPLWYQEQQDA GVAAQALPTGIAYFSLEFGVAEVLPNYSGGLGILAGDHLKSASDLGVPLIAVGLYYRS GYFRQSLTADGWQHETYPSLDPQGLPLRLLTDANGDPVLVEVALGDNAVLRARIWVAQ VGRVPLLLLDSDIPENEHDLRNVTDRLYGGDQEHRIKQEILAGIGGVRAIRAYTAVEK LTPPEVFHMNEGHAGFLGIERIRELVTDAGLDFDTALTVVRSSTVFTTHTPVPAGIDR FPLEMVQRYVNDQRGDGRSRLLPGLPADRIVALGAEDDPAKFNMAHMGLRLAQRANGV SLLHGRVSRAMFNELWAGFDPDEVPIGSVTNGVHAPTWAAPQWLQLGRELAGSDSLRE PVVWQRLHQVDPAHLWWIRSQLRSMLVEDVRARLRQSWLERGATDAELGWIATAFDPN VLTVGFARRVPTYKRLTLMLRDPGRLEQLLLDEQRPIQLIVAGKSHPADDGGKALIQQ VVRFADRPQFRHRIAFLPNYDMSMARLLYWGCDVWLNNPLRPLEACGTSGMKSALNGG LNLSIRDGWWDEWYDGENGWEIPSADGVADENRRDDLEAGALYDLLAQAVAPKFYERD ERGVPQRWVEMVRHTLQTLGPKVLASRMVRDYVEHYYAPAAQSFRRTAGAQFDAAREL ADYRRRAEEAWPKIEIADVDSTGLPDTPLLGSQLTLTATVRLAGLRPNDVTVQGVLGR VDSGDVLMDPVTVEMAHTGTGDGGYEIFSTTTPLPLAGPVGYTVRVLPRHPMLAASNE LGLVTLA" CDS complement(1499639..1501633) /codon_start=1 /transl_table=11 /gene="dinG" /locus_tag="BQ2027_MB1364C" /product="probable atp-dependent helicase ding" /note="Mb1364c, dinG, len: 664 aa. Equivalent to Rv1329c, len: 664 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 664 aa overlap). Probable dinG, ATP-dependent helicase homolog, similar to several e.g. DING_HAEIN|P44680 probable ATP-dependent helicase ding from Haemophilus influenzae (640 aa), FASTA scores: opt: 685, E(): 2.3e-38, (32.8% identity in 644 aa overlap). Contains PS00017 ATP/GTP-binding site motif A. Protein product from Mb1364c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1364c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64315" /db_xref="InterPro:IPR006555" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR014013" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P64315" /protein_id="SIT99967.1" /translation="MSESVSMSVPELLAIAVAALGGTRRRGQQEMAAAVAHAFETGEH LVVQAGTGTGKSLAYLVPAIIRALCDDAPVVVSTATIALQRQLVDRDLPQLVDSLTNA LPRRPKFALLKGRRNYLCLNKIHNSVTASDHDDERPQEELFDPVAVTALGRDVQRLTA WASTTVSGDRDDLKPGVGDRSWSQVSVSARECLGVARCPFGSECFSERARGAAGLADV VVTNHALLAIDAVAESAVLPEHRLLVVDEAHELADRVTSVAAAELTSATLGMAARRIT RLVDPKVTQRLQAASATFSSAIHDARPGRIDCLDDEMATYLSALRDAASAARSAIDTG SDTTTASVRAEAGAVLTEISDTASRILASFAPAIPDRSDVVWLEHEDNHESARAVLRV APLSVAELLATQVFARATTVLTSATLTIGGSFDAMATAWGLTADTPWRGLDVGSPFQH AKSGILYVAAHLPPPGRDGSGSAEQLTEIAELITAAGGRTLGLFSSMRAARAATEAMR ERLSTPVLCQGDDSTSTLVEKFTADAATSLFGTLSLWQGVDVPGPSLSLVLIDRIPFP RPDDPLLSARQRAVAARGGNGFMTVAASHAALLLAQGSGRLLRRVTDRGVVAVLDSRM ATARYGEFLRASLPPFWQTTNATQVRAALRRLARADAKAH" CDS complement(1501657..1503003) /codon_start=1 /transl_table=11 /gene="pncB1" /locus_tag="BQ2027_MB1365C" /product="nicotinic acid phosphoribosyltransferase pncB1" /note="Mb1365c, -, len: 448 aa. Equivalent to Rv1330c, len: 448 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 448 aa overlap). Conserved hypothetical protein, similar to others and also several nicotinate phosphoribosyltransferases e.g. O32090 YUEK PROTEIN from Bacillus subtilis (490 aa), FASTA scores: E(): 8.6e-22, (37.9% identity in 369 aa overlap). Also similar to M. tuberculosis Rv0573c|MTV039.11c (38.0% identity in 437 aa overlap). Start changed since original submission based on similarity; previous start at position 1500740 (-61 aa). Protein product from Mb1365c detected using SWATH mass spectrometry. Mb1365c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY18" /db_xref="InterPro:IPR002638" /db_xref="InterPro:IPR006405" /db_xref="InterPro:IPR007229" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR036068" /db_xref="InterPro:IPR040727" /db_xref="UniProtKB/TrEMBL:A0A1R3XY18" /protein_id="SIT99968.1" /translation="MGPPPAARRREGEPDNQDPAGLLTDKYELTMLAAALRDGSANRP TTFEVFARRLPTGRRYGVVAGTGRLLEALPQFRFDADACELLAQFLDPATVRYLREFR FRGDIDGYAEGELYFPGSPVLSVRGSFAECVLLETLVLSIFNHDTAIASAAARMVSAA GGRPLIEMGSRRTHERAAVAAARAAYIAGFAASSNLAAQRRYGVPAHGTAAHAFTMLH AQHGGPTELAERAAFRAQVEALGPGTTLLVDTYDVTTGVANAVAAAGAELGAIRIDSG ELGVLARQAREQLDRLGATRTRIVVSGDLDEFSIAALRGEPVDSYGVGTSLVTGSGAP TANMVYKLVEVDGVPVQKRSSYKKSPGGRKEALRRSRATGTITEELVHPAGRPPVIVE PHRVLTLPLVRAGQPVADTSLAAARQLVASGLRSLPADGLKLAPGEPAIPTRTIPA" CDS 1503105..1503410 /codon_start=1 /transl_table=11 /gene="clpS" /locus_tag="BQ2027_MB1366" /product="ATP-dependent Clp protease adaptor protein ClpS" /note="Mb1366, -, len: 101 aa. Equivalent to Rv1331, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 101 aa overlap). Conserved hypothetical protein, highly similar to U00014|ML014 B1549_C2_207 from Mycobacterium leprae (94 aa), FASTA scores: opt: 573, E(): 2.9e-40, (90.3% identity in 93 aa overlap). Similar to AL096852|SCE19A_16 hypothetical protein from Streptomyces coelicolor (105 aa), FASTA scores: opt: 377, E(): 2.9e-22, (60.0% identity in 105 aa overlap). Protein product from Mb1366 detected using SWATH mass spectrometry. Mb1366 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67648" /db_xref="InterPro:IPR003769" /db_xref="InterPro:IPR014719" /db_xref="InterPro:IPR022935" /db_xref="UniProtKB/Swiss-Prot:P67648" /protein_id="SIT99969.1" /translation="MAVVSAPAKPGTTWQRESAPVDVTDRAWVTIVWDDPVNLMSYVT YVFQKLFGYSEPHATKLMLQVHNEGKAVVSAGSRESMEVDVSKLHAAGLWATMQQDR" CDS 1503370..1504026 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1367" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1367, -, len: 218 aa. Equivalent to Rv1332, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 218 aa overlap). Possible regulatory protein, high similarity to ML014|U00014 from Mycobacterium leprae B1549_C3_236 (222 aa), FASTA scores: opt: 1158, E(): 0, (75.6% identity in 221 aa overlap). Helix turn helix motif fram aa 8-29 (+3.03 SD). Protein product from Mb1367 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1367 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR018561" /db_xref="UniProtKB/Swiss-Prot:P64810" /protein_id="SIT99970.1" /translation="MPPVCGRRCSRTGEIRGYSGSIVRRWKRVETRDGPRFRSSLAPH EAALLKNLAGAMIGLLDDRDSSSPSDELEEITGIKTGHAQRPGDPTLRRLLPDFYRPD DLDDDDPTAVDGSESFNAALRSLHEPEIIDAKRVAAQQLLDTVPDNGGRLELTESDAN AWIAAVNDLRLALGVMLEIGPRGPERLPGNHPLAAHFNVYQWLTVLQEYLVLVLMGSR " CDS 1504043..1505077 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1368" /product="PROBABLE HYDROLASE" /note="Mb1368, -, len: 344 aa. Equivalent to Rv1333, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 344 aa overlap). Possible hydrolase (EC 3.-.-.-), similar to Q57326|D26094 endo-type 6-aminohexanoate oligomer hydrolase (355 aa), fasta scores: E(): 1.4e-10, (31.9% identity in 339 aa overlap). Equivalent to P53425|YD33_MYCLE HYPOTHETICAL 36.1 KD PROTEIN B154 Mycobacterium leprae (362 aa), FASTA scores: opt: 1735, E(): 0, (76.7% identity in 352 aa overlap). Protein product from Mb1368 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1368 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64812" /db_xref="InterPro:IPR005321" /db_xref="InterPro:IPR016117" /db_xref="UniProtKB/Swiss-Prot:P64812" /protein_id="SIT99971.1" /translation="MNSITDVGGIRVGHYQRLDPDASLGAGWACGVTVVLPPPGTVGA VDCRGGAPGTRETDLLDPANSVRFVDALLLAGGSAYGLAAADGVMRWLEEHRRGVAMD SGVVPIVPGAVIFDLPVGGWNCRPTADFGYSACAAAGVDVAVGTVGVGVGARAGALKG GVGTASATLQSGVTVGVLAVVNAAGNVVDPATGLPWMADLVGEFALRAPPAEQIAALA QLSSPLGAFNTPFNTTIGVIACDAALSPAACRRIAIAAHDGLARTIRPAHTPLDGDTV FALATGAVAVPPEAGVPAALSPETQLVTAVGAAAADCLARAVLAGVLNAQPVAGIPTY RDMFPGAFGS" CDS 1505085..1505525 /codon_start=1 /transl_table=11 /gene="mec" /locus_tag="BQ2027_MB1369" /product="possible hydrolase" /note="Mb1369, -, len: 146 aa. Equivalent to Rv1334, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 146 aa overlap). Conserved hypothetical protein, similar to AL096852|SCE19A_13 hypothetical protein from Streptomyces coelicolor (140 aa), Fasta scores: opt: 579, E(): 0, (65.0% identity in 140 aa overlap); and Q54330|M29166 MEC+ from Streptomyces kasugaensis (115 aa), FASTA scores; E(): 7.6e-33, (56.9% identity in 109 aa overlap). Protein product from Mb1369 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1369 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64814" /db_xref="InterPro:IPR000555" /db_xref="InterPro:IPR028090" /db_xref="InterPro:IPR037518" /db_xref="UniProtKB/Swiss-Prot:P64814" /protein_id="SIT99972.1" /translation="MLLRKGTVYVLVIRADLVNAMVAHARRDHPDEACGVLAGPEGSD RPERHIPMTNAERSPTFYRLDSGEQLKVWRAMEDADEVPVVIYHSHTATEAYPSRTDV KLATEPDAHYVLVSTRDPHRHELRSYRIVDGAVTEEPVNVVEQY" CDS 1505547..1505828 /codon_start=1 /transl_table=11 /gene="cysO" /locus_tag="BQ2027_MB1370" /product="sulfur carrier protein cysO" /note="Mb1370, -, len: 93 aa. Equivalent to Rv1335, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 93 aa overlap). 9.5 kDa culture filtrate antigen cfp10A (see citation below). Similar to hypothetical proteins from other organisms e.g. P74060|D90911 Synechocystis (109 aa), FASTA scores: E(): 2.3e-20, (49.5% identity in 93 aa overlap). Protein product from Mb1370 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1370 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A647" /db_xref="InterPro:IPR003749" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR016155" /db_xref="UniProtKB/Swiss-Prot:P0A647" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99973.1" /translation="MNVTVSIPTILRPHTGGQKSVSASGDTLGAVISDLEANYSGISE RLMDPSSPGKLHRFVNIYVNDEDVRFSGGLATAIADGDSVTILPAVAGG" CDS 1505838..1506809 /codon_start=1 /transl_table=11 /gene="cysM" /locus_tag="BQ2027_MB1371" /product="cysteine synthase b cysm (csase b) (o-phosphoserine sulfhydrylase b) (o-phosphoserine (thiol)-lyase b)" /note="Mb1371, cysM, len: 323 aa. Equivalent to Rv1336, len: 323 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 323 aa overlap). Probable cysM, cysteine synthase B (EC 4.2.99.8), similar to many e.g. CYSM_ECOLI|P16703 Escherichia coli (303 aa), FASTA scores: opt: 720, E(): 4.6e-40, (41.1% identity in 302 aa overlap). Also similar to other Mycobacterium tuberculosis cysteine synthase subunits e.g. Rv1077, Rv2334, Rv0848, etc. Contains PS00901 Cysteine synthase/cystathionine beta-synthase P-phosphate attachment site. BELONGS TO THE CYSTEINE SYNTHASE/CYSTATHIONINE BETA-SYNTHASE FAMILY. Protein product from Mb1371 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1371 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63874" /db_xref="InterPro:IPR001216" /db_xref="InterPro:IPR001926" /db_xref="InterPro:IPR005856" /db_xref="InterPro:IPR036052" /db_xref="UniProtKB/Swiss-Prot:P63874" /protein_id="SIT99974.1" /translation="MTRYDSLLQALGNTPLVGLQRLSPRWDDGRDGPHVRLWAKLEDR NPTGSIKDRPAVRMIEQAEADGLLRPGATILEPTSGNTGISLAMAARLKGYRLICVMP ENTSVERRQLLELYGAQIIFSAAEGGSNTAVATAKELAATNPSWVMLYQYGNPANTDS HYCGTGPELLADLPEITHFVAGLGTTGTLMGTGRFLREHVANVKIVAAEPRYGEGVYA LRNMDEGFVPELYDPEILTARYSVGAVDAVRRTRELVHTEGIFAGISTGAVLHAALGV GAGALAAGERADIALVVADAGWKYLSTGAYAGSLDDAETALEGQLWA" CDS 1506800..1507522 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1372" /product="PROBABLE INTEGRAL MEMBRANE PROTEIN" /note="Mb1372, -, len: 240 aa. Equivalent to Rv1337, len: 240 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 240 aa overlap). Probable integral membrane protein. Highly similar to P53426 hypothetical protein B1549_C3_240 from M.leprae (251); and P74553|D90916 hypothetical protein from Synechocystis sp. (198 aa), FASTA scores: E(): 2.3e-25, (43.6% identity in 181 aa overlap). Mb1372 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64816" /db_xref="InterPro:IPR022764" /db_xref="InterPro:IPR035952" /db_xref="UniProtKB/Swiss-Prot:P64816" /protein_id="SIT99975.1" /translation="MGMTPRRKRRGGAVQITRPTGRPRTPTTQTTKRPRWVVGGTTIL TFVALLYLVELIDQLSGSRLDVNGIRPLKTDGLWGVIFAPLLHANWHHLMANTIPLLV LGFLMTLAGLSRFVWATAIIWILGGLGTWLIGNVGSSCGPTDHIGASGLIFGWLAFLL VFGLFVRKGWDIVIGLVVLFVYGGILLGAMPVLGQCGGVSWQGHLSGAVAGVVAAYLL SAPERKARALKRAGARSGHPKL" CDS 1507519..1508334 /codon_start=1 /transl_table=11 /gene="murI" /locus_tag="BQ2027_MB1373" /product="PROBABLE GLUTAMATE RACEMASE MURI" /note="Mb1373, murI, len: 271 aa. Equivalent to Rv1338, len: 271 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 271 aa overlap). Probable murI, glutamate racemase (EC 5.1.1.3), highly similar to many e.g. MURI_MYCLE|P46705 (272 aa), FASTA scores: opt: 1559, E(): 0, (88.9% identity in 271 aa overlap). Contains PS00924 Aspartate and glutamate racemases signature 2. Protein product from Mb1373 detected using SWATH mass spectrometry. Mb1373 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63636" /db_xref="InterPro:IPR001920" /db_xref="InterPro:IPR004391" /db_xref="InterPro:IPR015942" /db_xref="InterPro:IPR018187" /db_xref="InterPro:IPR033134" /db_xref="UniProtKB/Swiss-Prot:P63636" /protein_id="SIT99976.1" /translation="MNSPLAPVGVFDSGVGGLTVARAIIDQLPDEDIVYVGDTGNGPY GPLTIPEIRAHALAIGDDLVGRGVKALVIACNSASSACLRDARERYQVPVVEVILPAV RRAVAATRNGRIGVIGTRATITSHAYQDAFAAARDTEITAVACPRFVDFVERGVTSGR QVLGLAQGYLEPLQRAEVDTLVLGCTHYPLLSGLIQLAMGENVTLVSSAEETAKEVVR VLTEIDLLRPHDAPPATRIFEATGDPEAFTKLAARFLGPVLGGVQPVHPSRIH" CDS 1508361..1509182 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1374" /product="Metal-dependent hydrolases of the beta-lactamase superfamily III" /note="Mb1374, -, len: 273 aa. Equivalent to Rv1339, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 273 aa overlap). Conserved hypothetical protein, highly similar to Y211_MYCLE|P50474 hypothetical protein b1549_c2_211 from Mycobacterium leprae (284 aa), FASTA scores: opt: 1672, E(): 0, (86.2% identity in 276 aa overlap). Also similar to AL096852|SCE19A.08 hypothetical protein from Streptomyces coelicolor (250 aa), FASTA scores: opt: 630, E(): 0, (42.2% identity in 256 aa overlap). Similar to Mycobacterium tuberculosis hypothetical proteins Rv3796, Rv2407. Protein product from Mb1374 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1374 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/Swiss-Prot:P66874" /protein_id="SIT99977.1" /translation="MRRCIPHRCIGHGTVVSVRITVLGCSGSVVGPDSPASGYLLRAP HTPPLVIDFGGGVLGALQRHADPASVHVLLSHLHADHCLDLPGLFVWRRYHPSRPSGK ALLYGPSDTWSRLGAASSPYGGEIDDCSDIFDVHHWADSEPVTLGALTIVPRLVAHPT ESFGLRITDPSGASLAYSGDTGICDQLVELARGVDVFLCEASWTHSPKHPPDLHLSGT EAGMVAAQAGVRELLLTHIPPWTSREDVISEAKAEFDGPVHAVVCDETFEVRRAG" CDS 1509199..1509978 /codon_start=1 /transl_table=11 /gene="rphA" /locus_tag="BQ2027_MB1375" /product="PROBABLE RIBONUCLEASE RPHA (RNase PH) (tRNA nucleotidyltransferase)" /note="Mb1375, rphA, len: 259 aa. Equivalent to Rv1340, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 259 aa overlap). Probable rphA, Ribonuclease ph (EC 2.7.7.56), highly similar to others e.g. RNPH_MYCLE|P37939 from Mycobacterium leprae (259 aa), FASTA scores: opt: 1524, E(): 0, (88.8% identity in 259 aa overlap). BELONGS TO THE RNASE PH FAMILY. Protein product from Mb1375 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1375 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U076" /db_xref="InterPro:IPR001247" /db_xref="InterPro:IPR002381" /db_xref="InterPro:IPR015847" /db_xref="InterPro:IPR018336" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR027408" /db_xref="InterPro:IPR036345" /db_xref="UniProtKB/Swiss-Prot:Q7U076" /protein_id="SIT99978.1" /translation="MSKREDGRLDHELRPVIITRGFTENPAGSVLIEFGHTKVLCTAS VTEGVPRWRKATGLGWLTAEYAMLPSATHSRSDRESVRGRLSGRTQEISRLISRSLRA CIDLAALGENTIAIDCDVLQADGGTRTAAITGAYVALADAVTYLSAAGKLSDPRPLSC AIAAVSVGVVDGRIRVDLPYEEDSRAEVDMNVVATDTGTLVEIQGTGEGATFARSTLD KLLDMALGACDTLFAAQRDALALPYPGVLPQGPPPPKAFGT" CDS 1510017..1510631 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1376" /product="Nucleoside 5-triphosphatase RdgB (dHAPTP, dITP, XTP-specific) (EC" /EC_number="3.6.1.66" /note="Mb1376, -, len: 204 aa. Equivalent to Rv1341, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 204 aa overlap). Conserved hypothetical protein, equivalent to ML014|U00014 hypothetical protein B1549_C2_213 from Mycobacterium leprae (285 aa), FASTA scores: opt: 1073, E(): 0, (83.0% identity in 206 aa overlap). Some similarity to P52061|YGGV_ECOLI HYPOTHETICAL PROTEIN yggV (197 aa), FASTA scores: opt: 521, E(): 7.9e-27, (46.0% identity in 200 aa overlap). Protein product from Mb1376 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1376 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64308" /db_xref="InterPro:IPR002637" /db_xref="InterPro:IPR020922" /db_xref="InterPro:IPR029001" /db_xref="UniProtKB/Swiss-Prot:P64308" /protein_id="SIT99979.1" /translation="MALVTKLLVASRNRKKLAELRRVLDGAGLSGLTLLSLGDVSPLP ETPETGVTFEDNALAKARDAFSATGLASVADDSGLEVAALGGMPGVLSARWSGRYGDD AANTALLLAQLCDVPDERRGAAFVSACALVSGSGEVVVRGEWPGTIAREPRGDGGFGY DPVFVPYGDDRTAAQLSPAEKDAVSHRGRALALLLPALRSLATG" CDS complement(1510628..1510990) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1377C" /product="CONSERVED MEMBRANE PROTEIN" /note="Mb1377c, -, len: 120 aa. Equivalent to Rv1342c, len: 120 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 120 aa overlap). Conserved membrane protein. Highly similar to G466926|P54133 hypothetical protein B1549_F2_59 from Mycobacterium leprae (119 aa), FASTA scores, opt: 544, E(): 1.9e-29, (68.3 % identity in 120 aa overlap). Protein product from Mb1377c detected using SWATH mass spectrometry. Mb1377c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5E8" /db_xref="InterPro:IPR023845" /db_xref="UniProtKB/Swiss-Prot:P0A5E8" /protein_id="SIT99980.1" /translation="MTAPETPAAQHAEPAIAVERIRTALLGYRIMAWTTGLWLIALCY EIVVRYVVKVDNPPTWIGVVHGWVYFTYLLLTLNLAVKVRWPLGKTAGVLLAGTIPLL GIVVEHFQTKEIKARFGL" CDS complement(1510987..1511367) /codon_start=1 /transl_table=11 /gene="lprD" /locus_tag="BQ2027_MB1378C" /product="PROBABLE CONSERVED LIPOPROTEIN LPRD" /note="Mb1378c, lprD, len: 126 aa. Equivalent to Rv1343c, len: 126 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 126 aa overlap). Probable lprD, conserved lipoprotein, highly similar to G466928 Mycobacterium leprae protein B1549_F3_106 (126 aa), FASTA scores, opt: 704, E(): 7.5e-36, (78.4 % identity in 125 aa overlap). Has N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein attachment site. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Mb1378c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY27" /db_xref="UniProtKB/TrEMBL:A0A1R3XY27" /protein_id="SIT99981.1" /translation="MSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLG YALQWPLFAWFCVYAYRNFVRYEETPPQPPTGGAAAAIPAGLLPERPKPAQQPPDDPV LREYNAYLAELAKDDARKQNRTTA" CDS 1511412..1511732 /codon_start=1 /transl_table=11 /gene="mbtl" /locus_tag="BQ2027_MB1379" /product="acyl carrier protein (acp) mbtl" /note="Mb1379, -, len: 106 aa. Equivalent to Rv1344, len: 106 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 106 aa overlap). Possible acyl carrier protein, similar to others e.g. ACP_RHIME|P19372 Rhizobium meliloti (77 aa), FASTA scores: opt: 117, E(): 0.03, (29.9% identity in 67 aa overlap) and ACP_SYNY3|P20804 acyl carrier protein (acp) from Synechocystis sp (77 aa), FASTA scores: E(): 7.1e-05, (34.8% identity in 66 aa overlap). Also similar to Rv2244 and Rv0033 from Mycobacterium tuberculosis. Protein product from Mb1379 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1379 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63453" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR036736" /db_xref="UniProtKB/Swiss-Prot:P63453" /protein_id="SIT99982.1" /translation="MWRYPLSTRLALPNTPGVASFAMTSSPSTVSTTLLSILRDDLNI DLTRVTPDARLVDDVGLDSVAFAVGMVAIEERLGVALSEEELLTCDTVGELEAAIAAK YRDE" CDS 1511725..1513290 /codon_start=1 /transl_table=11 /gene="mbtm" /locus_tag="BQ2027_MB1380" /product="probable fatty acyl-amp ligase mbtm" /note="Mb1380, fadD33, len: 521 aa. Equivalent to Rv1345, len: 521 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 521 aa overlap). Possible fadD33, polyketide synthase, similar to N-terminus of T34918 polyketide synthase from Streptomyces coelicolor (2297 aa); and PKSJ_BACSU|P40806 putative polyketide biosynthesis protein from Bacillus subtilis (557 aa), FASTA scores: opt: 537, E(): 8.2e-27, (27.1% identity in 468 aa overlap). Also similar to other proteins from Mycobacterium tuberculosis eg Rv1013|MTCI237.30|MTCY10G2.36c|pks16 PUTATIVE POLYKETIDE SYNTHASE (544 aa); etc. Protein product from Mb1380 detected using SWATH mass spectrometry. Mb1380 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4X9" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:P0A4X9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99983.1" /translation="MSELAAVLTRSMQASAGDLMVLDRETSLWCRHPWPEVHGLAESV AAWLLDHDRPAAVGLVGEPTVELVAAIQGAWLAGAAVSILPGPVRGANDQRWADATLT RFLGIGVRTVLSQGSYLARLRSVDTAGVTIGDLSTAAHTNRSATPVASEGPAVLQGTA GSTGAPRTAILSPGAVLSNLRGLNQRVGTDAATDVGCSWLPLYHDMGLAFVLSAALAG APLWLAPTTAFTASPFRWLSWLSDSGATMTAAPNFAYNLIGKYARRVSEVDLGALRVT LNGGEPVDCDGLTRFAEAMAPFGFDAGAVLPSYGLAESTCAVTVPVPGIGLLADRVID GSGAHKHAVLGNPIPGMEVRISCGDQAAGNASREIGEIEIRGASMMAGYLGQQPIDPD DWFATGDLGYLGAGGLVVCGRAKEVISIAGRNIFPTEVELVAAQVRGVREGAVVALGT GDRSTRPGLVVAAEFRGPDEANARAELIQRVASECGIVPSDVVFVSPGSLPRTSSGKL RRLAVRRSLEMAD" CDS 1513290..1514450 /codon_start=1 /transl_table=11 /gene="mbtn" /locus_tag="BQ2027_MB1381" /product="acyl-coa dehydrogenase mbtn" /note="Mb1381, fadE14, len: 386 aa. Equivalent to Rv1346, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 386 aa overlap). Possible fadE14, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. NP_251579.1|NC_002516 probable acyl-CoA dehydrogenase from Pseudomonas aeruginosa (386 aa); NP_036951.1|NM_012819|ACDL_RAT|P15650 acyl Coenzyme A dehydrogenase (long chain) from Rattus norvegicus (430 aa), FASTA scores: opt: 414, E(): 1.2e-18, (26.1% identity in 376 aa overlap); etc. Protein product from Mb1381 detected using SWATH mass spectrometry. Mb1381 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63432" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/Swiss-Prot:P63432" /protein_id="SIT99984.1" /translation="MTAGSDLDDFRGLLAKAFDERVVAWTAEAEAQERFPRQLIEHLG VCGVFDAKWATDARPDVGKLVELAFALGQLASAGIGVGVSLHDSAIAILRRFGKSDYL RDICDQAIRGAAVLCIGASEESGGSDLQIVETEIRSRDGGFEVRGVKKFVSLSPIADH IMVVARSVDHDPTSRHGNVAVVAVPAAQVSVQTPYRKVGAGPLDTAAVCIDTWVPADA LVARAGTGLAAISWGLAHERMSIAGQIAASCQRAIGITLARMMSRRQFGQTLFEHQAL RLRMADLQARVDLLRYALHGIAEQGRLELRTAAAVKVTAARLGEEVISECMHIFGGAG YLVDETTLGKWWRDMKLARVGGGTDEVLWELVAAGMTPDHDGYAAVVGASKA" CDS complement(1514417..1515049) /codon_start=1 /transl_table=11 /gene="mbtk" /locus_tag="BQ2027_MB1382C" /product="lysine n-acetyltransferase mbtk" /note="Mb1382c, -, len: 210 aa. Equivalent to Rv1347c, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 210 aa overlap). Conserved hypothetical protein, some similarity to the C-terminus of malonyl-coenzyme A carboxylases e.g. G545170 malonyl-coenzyme A carboxylase (417 aa), FASTA scores: opt: 392, E(): 4.9 e-20, (35.6% identity in 174 aa overlap). Protein product from Mb1382c detected using SWATH mass spectrometry. Mb1382c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64820" /db_xref="InterPro:IPR016181" /db_xref="InterPro:IPR019432" /db_xref="UniProtKB/Swiss-Prot:P64820" /protein_id="SIT99985.1" /translation="MTKPTSAGQADDALVRLARERFDLPDQVRRLARPPVPSLEPPYG LRVAQLTDAEMLAEWMNRPHLAAAWEYDWPASRWRQHLNAQLEGTYSLPLIGSWHGTD GGYLELYWAAKDLISHYYDADPYDLGLHAAIADLSKVNRGFGPLLLPRIVASVFANEP RCRRIMFDPDHRNTATRRLCEWAGCKFLGEHDTTNRRMALYALEAPTTAA" tRNA 1515172..1515255 /locus_tag="BQ2027_LEUW" /product="tRNA-Leu" /note="leuW, len: 84 nt. Equivalent to leuW, len: 84 nt, from Mycobacterium tuberculosis strain H37RV, (100.0% identity in 84 nt overlap). tRNA-Leu, anticodon tag." CDS 1515491..1518070 /codon_start=1 /transl_table=11 /gene="irta" /locus_tag="BQ2027_MB1383" /product="iron-regulated transporter irta" /note="Mb1383, -, len: 859 aa. Equivalent to Rv1348, len: 859 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 859 aa overlap). Probable drugs-transport transmembrane protein ATP binding protein ABC transporter (see citation below), similar to HMT1_SCHPO|Q02592 heavy metal tolerance protein precursor from Schizosaccharomyces pombe (830 aa), FASTA scores: opt: 806, E(): 5.1e-39, (32.9% identity in 504 aa overlap); etc. Also similar to MTCY02B10.13 from Mycobacterium tuberculosis, FASTA score: (31.9% identity in 576 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb1383 detected using SWATH mass spectrometry. Mb1383 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63392" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR007037" /db_xref="InterPro:IPR011527" /db_xref="InterPro:IPR013113" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR017927" /db_xref="InterPro:IPR017938" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036640" /db_xref="InterPro:IPR039261" /db_xref="UniProtKB/Swiss-Prot:P63392" /protein_id="SIT99986.1" /translation="MARGLQGVMLRSFGARDHTATVIETISIAPHFVRVRMVSPTLFQ DAEAEPAAWLRFWFPDPNGSNTEFQRAYTISEADPAAGRFAVDVVLHDPAGPASSWAR TVKPGATIAVMSLMGSSRFDVPEEQPAGYLLIGDSASIPGMNGIIETVPNDVPIEMYL EQHDDNDTLIPLAKHPRLRVRWVMRRDEKSLAEAIENRDWSDWYAWATPEAAALKCVR VRLRDEFGFPKSEIHAQAYWNAGRAMGTHRATEPAATEPEVGAAPQPESAVPAPARGS WRAQAASRLLAPLKLPLVLSGVLAALVTLAQLAPFVLLVELSRLLVSGAGAHRLFTVG FAAVGLLGTGALLAAALTLWLHVIDARFARALRLRLLSKLSRLPLGWFTSRGSGSIKK LVTDDTLALHYLVTHAVPDAVAAVVAPVGVLVYLFVVDWRVALVLFGPVLVYLTITSS LTIQSGPRIVQAQRWAEKMNGEAGSYLEGQPVIRVFGAASSSFRRRLDEYIGFLVAWQ RPLAGKKTLMDLATRPATFLWLIAATGTLLVATHRMDPVNLLPFMFLGTTFGARLLGI AYGLGGLRTGLLAARHLQVTLDETELAVREHPREPLDGEAPATVVFDHVTFGYRPGVP VIQDVSLTLRPGTVTALVGPSGSGKSTLATLLARFHDVERGAIRVGGQDIRSLAADEL YTRVGFVLQEAQLVHGTAAENIALAVPDAPAEQVQVAAREAQIHDRVLRLPDGYDTVL GANSGLSGGERQRLTIARAILGDTPVLILDEATAFADPESEYLVQQALNRLTRDRTVL VIAHRLHTITRADQIVVLDHGRIVERGTHEELLAAGGRYCRLWDTGQGSRVAVAAAQD GTR" CDS 1518067..1519806 /codon_start=1 /transl_table=11 /gene="irtb" /locus_tag="BQ2027_MB1384" /product="iron-regulated transporter irtb" /note="Mb1384, -, len: 579 aa. Equivalent to Rv1349, len: 579 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 579 aa overlap). Probable drugs-transport transmembrane ATP binding protein ABC transporter (see citation below), most similar to YWJA_BACSU|P45861 hypothetical ABC transporter from Bacillus subtilis (575 aa), FASTA scores: opt: 721, E(): 1.8e-35, (28.9% identity in 567 aa overlap); etc. Also similar to MTCY02B10.12 from Mycobacterium tuberculosis, FASTA score: (31.9% identity in 576 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Mb1384 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63394" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR011527" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036640" /db_xref="UniProtKB/Swiss-Prot:P63394" /protein_id="SIT99987.1" /translation="MIRTWIALVPNDHRARLIGFALLAFCSVVARAVGTVLLVPLMAA LFGEAPQRAWLWLGWLSAATVAGWVLDAVTARIGIELGFAVLNHTQHDVADRLPVVRL DWFTAENTATARQAIAATGPELVGLVVNLVTPLTSAILLPAVIALALLPISWQLGVAA LAGVPLLLGALWASAAFARRADTAADKANTALTERIIEFARTQQALRAARRVEPARSL VGNALASQHTATMRLLGMQIPGQLLFSIASQLALIVLAGTTAALTITGTLTVPEAIAL IVVMVRYLEPFTAVSELAPALESTRATLGRIGSVLTAPVMVAGSGTWRDGAVVPRIEF DDVAFGYDGGSGPVLDGVSFCLQPGTTTAIVGPSGCGKSTILALIAGLHQPTRGRVLI DGTDVATLDARAQQAVCSVVFQHPYLFHGTIRDNVFAADPGASDDQFAQAVRLARVDE LIARLPDGANTIVGEAGSALSGGERQRVSIARALLKAAPVLLVDEATSALDAENEAAV VDALAADPRSRTRVIVAHRLASIRHADRVLFVDDGRVVEDGSISELLTAGGRFSQFWR QQHEAAEWQILAE" CDS 1519935..1520678 /codon_start=1 /transl_table=11 /gene="fabG2" /locus_tag="BQ2027_MB1385" /product="probable 3-oxoacyl-[acyl-carrier protein] reductase fabg2 (3-ketoacyl-acyl carrier protein reductase)" /note="Mb1385, fabG2, len: 247 aa. Equivalent to Rv1350, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 247 aa overlap). Probable fabG2, 3-oxoacyl-[acyl-carrier protein] reductase (EC 1.1.1.100), highly similar to many e.g. NP_350157.1|NC_003030 3-ketoacyl-acyl carrier protein reductase from Clostridium acetobutylicum (249 aa); NP_229523.1|NC_000853 3-oxoacyl-(acyl carrier protein) reductase from Thermotoga maritima (246 aa); AAC44307.1|U59433 3-ketoacyl-acyl carrier protein reductase from Bacillus subtilis (246 aa); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb1385 detected using SWATH mass spectrometry. Mb1385 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66782" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P66782" /protein_id="SIT99988.1" /translation="MASLLNARTAVITGGAQGLGLAIGQRFVAEGARVVLGDVNLEAT EVAAKRLGGDDVALAVRCDVTQADDVDILIRTAVERFGGLDVMVNNAGITRDATMRTM TEEQFDQVIAVHLKGTWNGTRLAAAIMRERKRGAIVNMSSVSGKVGMVGQTNYSAAKA GIVGMTKAAAKELAHLGIRVNAIAPGLIRSAMTEAMPQRIWDQKLAEVPMGRAGEPSE VASVAVFLASDLSSYMTGTVLDVTGGRFI" CDS 1520675..1521004 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1386" /product="HYPOTHETICAL PROTEIN" /note="Mb1386, -, len: 109 aa. Equivalent to Rv1351, len: 109 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 109 aa overlap). Hypothetical unknown protein. Mb1386 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P64822" /protein_id="SIT99989.1" /translation="MTPRSLPRYGNSSRRKSFPMHRPSNVATATRKKSSIGWVLLACS VAGCKGIDTTEFILGRAGAFELAVRAAQHRHRYLTMVNVGRAPPRRCRTVCMAATDTP RNIRLNG" CDS 1521207..1521578 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1387" /product="conserved protein" /note="Mb1387, -, len: 123 aa. Equivalent to Rv1352, len: 123 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 123 aa overlap). Conserved hypothetical protein, some similarity to Rv1906c|MTCY180.12 hypothetical protein from Mycobacterium tuberculosis (156 aa), FASTA scores: E(): 4e-05, (36.2% identity in 116 aa overlap). Protein product from Mb1387 detected using SWATH mass spectrometry. Mb1387 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P64824" /protein_id="SIT99990.1" /translation="MARTLALRASAGLVAGMAMAAITLAPGARAETGEQFPGDGVFLV GTDIAPGTYRTEGPSNPLILVFGRVSELSTCSWSTHSAPEVSNENIVDTNTSMGPMSV VIPPTVAAFQTHNCKLWMRIS" CDS complement(1521644..1522429) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1388C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1388c, -, len: 261 aa. Equivalent to Rv1353c, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 261 aa overlap). Probable transcriptional regulatory protein, similar to TER1_ECOLI|P03038 tetracycline repressor protein class A from Escherichia coli (216 aa), FASTA scores, opt: 231, E(): 1.6e-08, (31.3% identity in 211 aa overlap). Helix turn helix motif present at aa 3859 (+3.59 SD). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /db_xref="GOA:P67435" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR004111" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023772" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/Swiss-Prot:P67435" /protein_id="SIT99991.1" /translation="MQTTPGKRQRRQRGSINPEDIISGAFELAQQVSIDNLSMPLLGK HLGVGVTSIYWYFRKKDDLLNAMTDRALSKYVFATPYIEAGDWRETLRNHARSMRKTF ADNPVLCDLILIRAALSPKTARLGAQEMEKAIANLVTAGLSLEDAFDIYSAVSVHVRG SVVLDRLSRKSQSAGSGPSAIEHPVAIDPATTPLLAHATGRGHRIGAPDETNFEYGLE CILDHAGRLIEQSSKAAGEVAVRRPTATADAPTPGARAKAVAR" CDS complement(1522449..1524320) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1389C" /product="Sensory box/GGDEF family protein" /note="Mb1389c, -, len: 623 aa. Equivalent to Rv1354c, len: 623 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 623 aa overlap). Conserved hypothetical protein, similar to many hypothetical proteins e.g. the C-terminus of G1001455 Synechocystis sp. (1244 aa), FASTA scores: opt: 933, E(): 0, (36.8% identity in 462 aa overlap); also similar to Rv1357c|MTCY02B10.21c (34.0% identity in 253 aa overlap)." /db_xref="InterPro:IPR000160" /db_xref="InterPro:IPR001633" /db_xref="InterPro:IPR003018" /db_xref="InterPro:IPR029016" /db_xref="InterPro:IPR029787" /db_xref="InterPro:IPR035919" /db_xref="UniProtKB/Swiss-Prot:P64826" /protein_id="SIT99992.1" /translation="MCNDTATPQLEELVTTVANQLMTVDAATSAEVSQRVLAYLVEQL GVDVSFLRHNDRDRRATRLVAEWPPRLNIPDPDPLRLIYFADADPVFALCEHAKEPLV FRPEPATEDYQRLIEEARGVPVTSAAAVPLVSGEITTGLLGFIKFGDRKWHEAELNAL MTIATLFAQVQARVAAEARLRYLADHDDLTGLHNRRALLQHLDQRLAPGQPGPVAALF LDLDRLKAINDYLGHAAGDQFIHVFAQRIGDALVGESLIARLGGDEFVLIPASPMSAD AAQPLAERLRDQLKDHVAIGGEVLTRTVSIGVASGTPGQHTPSDLLRRADQAALAAKH AGGDSVAIFTADMSVSGELRNDIELHLRRGIESDALRLVYLPEVDLRTGDIVGTEALV RWQHPTRGLLAPGCFIPVAESINLAGELDRWVLRRACNEFSEWQSAGLGHDALLRINV SAGQLVTGGFVDFVADTIGQHGLDASSVCLEITENVVVQDLHTARATLARLKEVGVHI AIDDFGTGYSAISLLQTLPIDTLKIDKTFVRQLGTNTSDLVIVRGIMTLAEGFQLDVV AEGVETEAAARILLDQRCYRAQGFLFSRPVPGEAMRHMLSARRLPPTCIPATDPALS" CDS complement(1524329..1526476) /codon_start=1 /transl_table=11 /gene="moeY" /locus_tag="BQ2027_MB1390C" /product="POSSIBLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN MOEY" /note="Mb1390c, moeY, len: 715 aa. Equivalent to Rv1355c, len: 715 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 715 aa overlap). Possible moeY, Molybdopterin biosynthesis protein, very weak similarity to MOEB_ECOLI|P12282 molybdopterin biosynthesis moeb protein (249 aa), FASTA scores, opt: 180, E(): 8.5e-05, (29.3% identity in 174 aa overlap). Mb1390c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y090" /db_xref="InterPro:IPR000415" /db_xref="InterPro:IPR000594" /db_xref="InterPro:IPR035985" /db_xref="UniProtKB/TrEMBL:A0A1R3Y090" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99993.1" /translation="MTIPHEGGSTGILVLRDDDHDDVLVLDRLRSDPSIEFVDRFAEQ LAGVRRLLPQPDPDLLEEAKRWAYYPWRRMVVAILGPRGFRAVRLDRNRHLITAEEQR ALHALRVGVVGLSAGHAIAYTLAAEGACGTLRLADFDKIELSNLNRVPVGVFDIGLNK AMIAARRIAELDPYLAVDLVTSGLSPESVDEFLDGLDVVIEECDSLDIKVILRQAACA RGVPVLMATSDRGLVDVERYDVEPGRPIFHGLLGDIDADKLCGLTTKDKVPHVLNILD CQELSARCAASMIEVDQTLWGWPQLAGDIWVGAATVAEAVRRIGLGEPLESGRVRVDV SAALDRLDQPPMPSRGNGWLLESVPPTAPAEPQPTSEIVAQAAIRAPSGGNVQPWHVV AKQHSLTIRLAPEHTSAMDIAFRGSAVAVGAAMFNARVAAAAHRVLGSVEFDESQPDS PLQATMHFGRGDDPSLAALYRPMLLRTTNRHHGMPGHVHPATVELLTNTAAAEGARLQ LLLSRNEIDRAATILAAADRIRYLTPRLHEEMMSELRWPGDPSLDAGIDVRSLELDSG ELRVLDILRRSDVVARLAQWDCGTALEDNTNERVSASSALAIVYVDGATLTDFARGGS AMQAVWIVAQQHGLAVQPMSPIFLYARGRHDLDQASPHFAAQLHRLQLDFRELVKPGK EGHEVLIFRLFHAPPPSVCSRRRVRHAIPEPHR" CDS complement(1526473..1527264) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1391C" /product="HYPOTHETICAL PROTEIN" /note="Mb1391c, -, len: 263 aa. Equivalent to Rv1356c, len: 263 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 263 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/Swiss-Prot:P64828" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99994.1" /translation="MLIAGYLTDWRIMTTAQLRPIAPQKLHFSENLSVWVSDAQCRLV VSQPALDPTLWNTYLQGALRAYSKHGVECTLDLDAISDGSDTQLFFAAIDIGGDVVGG ARVIGPLRSADDSHAVVEWAGNPGLSAVRKMINDRAPFGVVEVKSGWVNSDAQRSDAI AAALARALPLSMSLLGVQFVMGTAAAHALDRWRSSGGVIAARIPAAAYPDERYRTKMI WWDRRTLANHAEPKQLSRMLVESRKLLRDVEALSATTAATAGAEQ" CDS complement(1527737..1528660) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1392C" /product="Sensory box/GGDEF family protein" /note="Mb1392c, -, len: 307 aa. Equivalent to Rv1357c, len: 307 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 307 aa overlap). Conserved hypothetical protein, similar to members of the YEGE/YHJK/YJCC family e.g. Y4LL_RHISN|P55552 hypothetical protein Y4ll from Rhizobium sp. (827 aa), FASTA scores: E(): 0, (37.7% identity in 257 aa overlap), also similar to Rv1354c|MTCY02B10.18c (34.0% identity in 253 aa overlap). BELONGS TO THE YEGE/YHDA/YHJK/YJCC FAMILY." /db_xref="GOA:P64830" /db_xref="InterPro:IPR001633" /db_xref="InterPro:IPR035919" /db_xref="UniProtKB/Swiss-Prot:P64830" /protein_id="SIT99995.1" /translation="MDRCCQRATAFACALRPTKLIDYEEMFRGAMQARAMVANPDQWA DSDRDQVNTRHYLSTSMRVALDRGEFFLVYQPIIRLADNRIIGAEALLRWEHPTLGTL LPGRFIDRAENNGLMVPLTAFVLEQACRHVRSWRDHSTDPQPFVSVNVSASTICDPGF LVLVEGVLGETGLPAHALQLELAEDARLSRDEKAVTRLQELSALGVGIAIDDFGIGFS SLAYLPRLPVDVVKLGGKFIECLDGDIQARLANEQITRAMIDLGDKLGITVTAKLVET PSQAARLRAFGCKAAQGWHFAKALPVDFFRE" CDS 1529056..1532535 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1393" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1393, -, len: 1159 aa. Equivalent to Rv1358, len: 1159 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1159 aa overlap). Probable transcriptional regulatory protein, some similarity to AFSR_STRCO|P25941 regulatory protein afsr from Streptomyces coelicolor (993 aa), FASTA scores: opt: 210, E(): 5.5e-06, (27.5% identity in 739 aa overlap). Similar also to Rv0890C|MTCY31.18c (65.5% identity in 884 aa overlap) and to Rv1359|MTCY02B10.23 (43.7% identity in 197 aa overlap). Contains PS00017 ATP/GTP-binding site motif A, PS00622 Bacterial regulatory proteins, luxR family signature. Helix turn helix motif present at aa 1116-1137, (Score 1291, +3.59 SD)." /db_xref="GOA:A0A1R3XYI6" /db_xref="InterPro:IPR000792" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR029787" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3XYI6" /protein_id="SIT99996.1" /translation="MFLSAPAFRVEPTRSRHSALRWARHRRFADVPRWQMLRSLQIAD QIARTGHMPVRRLDLIWISARNAARRELDLGVAALVEAVTLLTADVEGSTRLSQTRLN ELAADYPTLDQNISEAVAAHGGVTRPVDQEVGSGLVVAFLRAGDAIACALELQLSTLA PMRPRVGVHTGDVRLRGDGTITGSAINESACLRDLAHEGQTLLSAATGDLVIDQLPAN TWLTDVGKYPLRGLHRQERVIQLCHRDLRNEFPPLRMSVGNRSSLPAQFTTFVGRDAQ INEVQEVLTNYRLVTLRGEGGVGKTRLAIQIAAASEFRDGLCFVDLAPIADPGMVSTT AAHALGLIDRPGSSTFDTLSHAIGNCHMLMVLDNCEHVLDACAELVVELLGACPELSI LATSRESIGVTGEVTWVVPSLSPANEAIQLFTERARLVQPNFEIVADNFAAVSEICRR LDGMPLAIELAAARLRSLSPNEIANSLDDRFRLLTGGARSTVQRQQTLRASMDWSYAL LTDTERILFRRLAVFVGGFDLTAASEVAAAGGDDFVERYSVLDQLTLLVDKSLVVAEE SRGSTRYRLLETVRQYALEKLNESEEIDGVRARHRTHYATMAAGLNVPASTDYEQRLL QAEAEIDNLRAAFTWSRGNGDIAAALQLASALQPLWSQGRMREGLAWLESILEREGDN HLVPAGVWARALAEKVILKAWPATSPMGAPDIVAQAHHALALARDAGDCAVLARALVA CGCGSGCDTEAAQPYFAEAIELARAINDEWTLSQIDYWQVVGIFISGQPIPLRAAAEQ ARELADSIGNRFVSRQCRLFACLAQIWEGDANGALALSRDVTAEAEVANDVVTKVLGL YVEAMALSYIGDSAARTIAGAALEAATELGGIYQDLGYGAITRAALAAGDVAAIEASE ASWDLRNQHNVVTAHHELMAQAALVRGDVTTARRFADEAVLASTGWHLMMALIARARV AIAQDELGKARDDAHAAVACGVGVQTYLAMPDALELLAGLAGEAGNHGQAVRLFGAAA AQRQRTGEVRHKIWDAGYEAATAALRDAMGDEDFTAAWAEGAAAPLDEAIAYAQRGRG ERKRPSNGWDALTPAEHKIVKLVTEGLVTKDIAARLFVSPRTVQTHLTHIYTKLDVTS RVQLVQEAAQHST" CDS 1532617..1533369 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1394" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1394, -, len: 250 aa. Equivalent to Rv1359, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 250 aa overlap). Probable transcriptional regulatory protein, similar to Rv0891c|MTCY31.19c, (48.5% identity in 204 aa overlap) and to Rv1358|MTCY02B10.22 (43.7% identity in 197 aa overlap)." /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/TrEMBL:A0A1R3XY50" /protein_id="SIT99997.1" /translation="MFMALRAPMLERMNGLHTDDAPVNWLERRGGRLTSRRRVTLLHA GVEHPMRLWGVQSEAITAAMVLSRKVSAIIAGHCGVRLVDQGVGDGFVAAFAHASDAV ACALELHQAPLSPIVLRIGIHTGEAQLVDERIYAGATMNLAAELRDLAHGGQTVMSGA TEDAVLGRLPMRAWLIGLRPMEGSPERHNFPQSQRIAQLCHPNLRNTFPPLRMRIADA SGIPYVGRILVNVQVVPHWEGGCAAAGMVLAG" CDS 1533792..1534814 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1395" /product="PROBABLE OXIDOREDUCTASE" /note="Mb1395, -, len: 340 aa. Equivalent to Rv1360, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 340 aa overlap). Probable oxidoreductase (EC 1.-.-.-). Similar to Q49598|G1002714 coenzyme F420-dependent n5, n10-methylenetetrahydromethanopterin reductase from Methanopyrus kandleri (349 aa), FASTA scores: opt: 264, E(): 4.4e-11, (26.3% identity in 323 aa overlap). Protein product from Mb1395 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1395 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64832" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019919" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/Swiss-Prot:P64832" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99998.1" /translation="MGGARRLKLDGSIPNQLARAADAAVALERNGFDGGWTAEASHDP FLPLLLAAEHTSRLELGTNIAVAFARNPMIVANVGWDLQTYSKGRLILGLGTQIRPHI EKRFSMPWGHPARRMREFVAALRAIWLAWQDGTKLCFEGEFYTHKIMTPMFTPEPQPY PVPRVFIAAVGEAMTEMCGEVADGHLGHPMVSKRYLTEVSVPALLRGLARSGRDRSAF EVSCEVMVATGADDAELAAACTATRKQIAFYGSTPAYRKVLEQHGWGDLHPELHRLSK LGEWEAMGGLIDDEMLGAFAVVGPVDTIAGALRNRCEGVVDRVLPIFMAASQECINAA LQDFRR" CDS complement(1534887..1536077) /codon_start=1 /transl_table=11 /gene="PPE19" /locus_tag="BQ2027_MB1396C" /standard_name="mtb39b" /product="ppe family protein ppe19" /note="Mb1396c, PPE19, len: 396 aa. Similar to Rv1361c, len: 396 aa, from Mycobacterium tuberculosis strain H37Rv, (92.9% identity in 396 aa overlap). PPE19 (alternate gene name: mtb39b). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, highly similar to many e.g. Rv1196|MTCI364.08|PPE18, FASTA scores: E(): 0, (84.9% identity in 397 aa overlap); MTCY274.23c (42.3% identity in 416 aa overlap); etc. Contains PS00501 Signal peptidases I serine active site. Note that expression of Rv1361c was demonstrated in lysates by immunodetection (see first citation below). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, the PPE19 gene contains nine substitutions compared to Mycobacterium tuberculosis strain H37Rv. Mb1396c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XY45" /experiment="experimental evidence, no additional details recorded" /protein_id="SIT99999.1" /translation="MVDFGALPPEINSARMYAGPGSASLVAAAQMWDSVASDLFSAAS AFQSVVWGLTVGSWIGSSAGLMVAAASPYVAWMSVTAGQAELTAAQVRVAAAAYETAY GLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAAAMFGYAAATA TATATLLPFEEAPEMTSAGGLLEQAAAVEEASDTAAANQLMNNVPQALQQLAQPTQGT TPSSKLGGLWKTVSPHLSPISNIVSMLNNHVSMTNSGVSMTNTLHSMLKGFAPAAAQA VETAAQNGVQAMSSLGSQLGSSLGSSGLGAGVAANLGRAASVGSLSVPPAWAAANQAV TPAARALPLTSLTSAAQTAPGHMLGGLPLGQLTNSGGGFGGVSNALRMPPRAYVMPRV PAAG" CDS complement(1536392..1537054) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1397C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb1397c, -, len: 220 aa. Equivalent to Rv1362c, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 220 aa overlap). Possible membrane protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1362c|MTCY02B10.27c (25.9% identity in 216 aa overlap), Rv0177, Rv1973, Rv1972, etc. Protein product from Mb1397c detected using SWATH mass spectrometry. Mb1397c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY64" /db_xref="UniProtKB/TrEMBL:A0A1R3XY64" /protein_id="SIU00000.1" /translation="MTDDVRDVNTETTDATEVAEIDSAAGEAGDSATEAFDTDSATES TAQKGQRHRDLWRMQVTLKPVPVILILLMLISGGATGWLYLEQYRPDQQTDSGAARAA GAAASDGTIALLSYSPDTLDQDFATARSHLAGDFLSYYDQFTQQIVAPAAKQKSLKTT AKVVRAAVSELHPDSAVVLVFVDQSTTSKDSPNPSMAASSVMVTLAKVDGNWLITKFT PV" CDS complement(1537051..1537836) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1398C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb1398c, -, len: 261 aa. Equivalent to Rv1363c, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 261 aa overlap). Possible membrane protein, similar to Mycobacterium tuberculosis hypothetical proteins Rv1362c|MTCY02B10.26c (25.9% identity in 216 aa overlap); Rv1972|MTV051.10 and Rv0177 etc. Protein product from Mb1398c detected using SWATH mass spectrometry. Mb1398c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY48" /db_xref="UniProtKB/TrEMBL:A0A1R3XY48" /protein_id="SIU00001.1" /translation="MAETTEPPSDAGTSQADAMALAAEAEAAEAEALAAAARARARAA RLKREALAMAPAEDENVPEEYADWEDAEDYDDYDDYEAADQEAARSASWRRRLRVRLP RLSTIAMAAAVVIICGFTGLSGYIVWQHHEATERQQRAAAFAAGAKQGVINMNSLDFN KAKEDVARVIDSSTGEFRDDFQQRAADFTKVVEQSKVVTEGTVNATAVESMNEHSAVV LVAATSRVTNSAGAKDEPRAWRLKVTVTEEGGQYKMSKVEFVP" CDS complement(1538127..1540088) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1399C" /product="possible sigma factor regulatory protein" /note="Mb1399c, -, len: 653 aa. Equivalent to Rv1364c, len: 653 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 653 aa overlap). Conserved hypothetical protein, some similarity to RSBU_BACSU|P40399 sigma factor sibg regulation protein from Bacillus subtilis (335 aa), FASTA scores: opt: 224, E(): 2e-07, (25.8% identity in 244 aa overlap). Protein product from Mb1399c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1399c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY52" /db_xref="InterPro:IPR000014" /db_xref="InterPro:IPR000700" /db_xref="InterPro:IPR001932" /db_xref="InterPro:IPR002645" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR013656" /db_xref="InterPro:IPR035965" /db_xref="InterPro:IPR036457" /db_xref="InterPro:IPR036513" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3XY52" /protein_id="SIU00002.1" /translation="MAAEMDWDKTVGAAEDVRRIFEHIPAILVGLEGPDHRFVAVNAA YRGFSPLLDTVGQPAREVYPELEGQQIYEMLDRVYQTGEPQSGSEWRLQTDYDGSGVE ERYFDFVVTPRRRADGSIEGVQLIVDDVTSRVRARQAAEARVEELSERYRNVRDSATV MQQALLAASVPVVPGADIAAEYLVAAEDTAAGGDWFDALALGDRLVLVVGDVVGHGVE AAAVMSQLRTALRMQISTGYTVVEALEAVDRFHKQVPGSKSATMCVGSLDFTSGEFQY CTAGHPPPLLVTADASARYVEPTGAGPLGSGTGFPVRSEVLNIGDAILFYTDGLIERP GRPLEASTAEFADLAASIASGSGGFVLDAPARPIDRLCSDTLELLLRSTGYNDDVTLL AMQRRAPTPPLHITLDATINAARTVRAQLREWLAEIGADHSDIADIVHAISEFVENAV EHGYATDVSKGIVVEAALAGDGNVRASVIDRGQWKDHRDGARGRGRGLAMAEALVSEA RIMHGAGGTTATLTHRLSRPARFVTDTMVRRAAFQQTIDSEFVSLVESGRIVVRGDVD STTAATLDRQIAVESRSGIAPVTIDLSAVTHLGSAGVGALAAACDRARKQGTECVLVA PPGSPAHHVLSLVQLPVVGADTEDIFAQE" CDS complement(1540227..1540613) /codon_start=1 /transl_table=11 /gene="rsfA" /locus_tag="BQ2027_MB1400C" /product="anti-anti-sigma factor rsfA (anti-sigma factor antagonist) (regulator of sigma f a)" /note="Mb1400c, -, len: 128 aa. Equivalent to Rv1365c, len: 128 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 128 aa overlap). Conserved hypothetical protein, similar to other Mycobacterium tuberculosis proteins e.g. Rv2638|MTCY441.08 (148 aa), FASTA scores: E(): 0, (53.6% identity in 125 aa overlap); Rv1904, Rv3687c. Weak similarity to putative anti-anti-sigma factors e.g. AF134889|AF134889_1 Streptomyces coelicolor (113 aa), FASTA scores: opt: 137, E(): 0.004, (26.0% identity in 100 aa overlap). Mb1400c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y099" /db_xref="InterPro:IPR002645" /db_xref="InterPro:IPR003658" /db_xref="InterPro:IPR036513" /db_xref="UniProtKB/TrEMBL:A0A1R3Y099" /protein_id="SIU00003.1" /translation="MNPTQAGSFTTPVSNALKATIQHHDSAVIIHARGEIDAANEHTW QDLVTKAAAATTAPEPLVVNLNGLDFMGCCAVAVLAHKAERCRRRGVDVRLVSRDRAV ARIIHACGYGDVLPVHPTTESALSAT" CDS 1540834..1541655 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1401" /product="GTP pyrophosphokinase (EC" /EC_number="2.7.6.5" /note="Mb1401, -, len: 273 aa. Equivalent to Rv1366, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 273 aa overlap). Hypothetical unknown protein. Protein product from Mb1401 detected using SWATH mass spectrometry. Mb1401 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64834" /db_xref="InterPro:IPR007685" /db_xref="UniProtKB/Swiss-Prot:P64834" /protein_id="SIU00004.1" /translation="MVVALVGSAIVDLHSRPPWSNNAVRRLGVALRDGVDPPVDCPSY AEVMLWHADLAAEVQDRIEGRSWSASELLVTSRAKSQDTLLAKLRRRPYLQLNTIQDI AGVRIDADLLLGEQTRLAREIADHFGADQPAIHDLRDHPHAGYRAVHVWLRLPAGRVE IQIRTILQSLWANFYELLADAYGRGIRYDERPEQLAAGVVPAQLQELVGVMQDASADL AMHEAEWQHCAEIEYPGQRAMALGEASKNKATVLATTKFRLERAINEAESAGGGG" CDS 1541624..1541884 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1401A" /product="Conserved protein" /note="Mb1401A, len: 86 aa. Equivalent to Rv1366A len: 86 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 86 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein. Protein product from Mb1401A detected using SWATH mass spectrometry. Mb1401A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XY61" /protein_id="SIU00005.1" /translation="MRPSRQGEVGEVAGYVVEYNRRTHVRRITEFATPQEAMEHRLKL EAERTDSNIEIVALVSKSLGTLKQTHSRYFTGEELNVGNGAR" CDS complement(1541956..1543089) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1402C" /product="Beta-lactamase class C-like and penicillin binding proteins (PBPs) superfamily" /note="Mb1402c, -, len: 377 aa. Equivalent to Rv1367c, len: 377 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 377 aa overlap). Conserved hypothetical protein. Some similarity to penicillin binding proteins e.g. PBPE_BACSU|P32959 penicillin-binding protein 4* (pbp 4*) from Bacillus subtilis (451 aa), FASTA scores: E(): 6.9e-06, (23.6% identity in 373 aa overlap). Similar to AL031107|SC5A7.06 hypothetical protein from Streptomyces coelicolor (409 aa), FASTA scores: opt: 675, E(): 0, (40.4% identity in 339 aa overlap). Protein product from Mb1402c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1402c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ6" /protein_id="SIU00006.1" /translation="MVWQREKLLQVNEIGYRDIDAGVPMQRDTLFRIASMTKPVTVAA AMSLVDEGKLALRDPITRWAPELCKVAVLDDAAGPLDRTHPARRAILIEDLLTHTSGL AYGFSVSGPISRAYQRLPFGQGPDVWLAALATLPLVHQPGDRVTYSHAIDVLGVIVSR IEDAPLYQVIDERVLGPAGMTDTGFYVSADAQRRAATMYRLDEQDRLRHDVMGPPHVT PPSFCNAGGGLWSTADDYLRFVRMLLGDGTVDGVRVLSPESVRLMRTDRLTDEQKRHS FLGAPFWVGRGFGLNLSVVTDPAKSRPLFGPGGLGTFSWPGAYGTWWQADPSADLILL YLIQHCPDLSVDAAAAVAGNPSLAKLRTAQPKFVRRTYRALGL" CDS 1543464..1544249 /codon_start=1 /transl_table=11 /gene="lprF" /locus_tag="BQ2027_MB1403" /product="probable conserved lipoprotein lprf" /note="Mb1403, lprF, len: 261 aa. Equivalent to Rv1368, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 261 aa overlap). Probable lipoprotein lprF, similar to Mycobacterium tuberculosis hypothetical lipoproteins e.g. Rv1270c|Y08C_MYCTU|Q11049 hypothetical 26.4 kd protein cy50.12. (257 aa), FASTA scores: opt: 286, E(): 5.3e-11, (26.3% identity in 270 aa overlap), also Rv1411c|MTCY21B4.28c, (32.8% identity in 253 aa overlap) and Rv2945c. Contains possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013). BELONGS TO THE LPPX/LPRAFG FAMILY OF LIPOPROTEINS. Protein product from Mb1403 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1403 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65315" /db_xref="InterPro:IPR009830" /db_xref="InterPro:IPR029046" /db_xref="PDB:4QA8" /db_xref="UniProtKB/Swiss-Prot:P65315" /protein_id="SIU00007.1" /translation="MNGLISQACGSHRPRRPSSLGAVAILIAATLFATVVAGCGKKPT TASSPSPGSPSPEAQQILQDSSKATKGLHSVHVVVTVNNLSTLPFESVDADVTNQPQG NGQAVGNAKVRMKPNTPVVATEFLVTNKTMYTKRGGDYVSVGPAEKIYDPGIILDKDR GLGAVVGQVQNPTIQGRDAIDGLATVKVSGTIDAAVIDPIVPQLGKGGGRLPITLWIV DTNASTPAPAANLVRMVIDKDQGNVDITLSNWGAPVTIPNPAG" CDS 1544445..1545398 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1404" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb1404 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing.,Mb1404, -, len: 317 aa. Similar to 5' end of Rv1371, len: 489 aa, from Mycobacterium tuberculosis strain H37Rv, (90.9% identity in 166 aa overlap). Probable membrane protein. Weak similarity to delta 5 fatty acid desaturases e.g. AB022097|AB022097_1 Dictyostelium discoideum (467 aa), FASTA score: opt: 173, E(): 0.00052, (22.4% identity in 438 aa overlap); and Homo sapiens. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1371 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-a) splits Rv1371 into 2 parts, Mb1404 and Mb1405, with the latter being the more likely product." /db_xref="InterPro:IPR001199" /db_xref="InterPro:IPR036400" /db_xref="UniProtKB/TrEMBL:A0A1R3XY65" /protein_id="SIU00008.1" /translation="MTNDLPDVRERDGGPRPAPPAGGPRLSDVWVYNGRAYDLSEWIS KHPGGAFFIGRTKNRDITAIVKSYHRDPAIVERILQRRYALGRDATPRDIHPKHNAPA FLFKDDFNSWRDTPKYRFDDPNDLLHRVKARLAEPALAARIKRMDTLYQRHRCSTGRG LFRGSGCAVGGTELDAAVGLRDCDGSAAQFVGRVRSLRTAPRATRPQPGFQQCLRSQL CGLVLSHRRRTHPAAPPVYPERGGHQEERVHDDDAATVVVSRSRTYDSQIWPHAQRHG DPDRRRLQDHAQGRCRGILRKLACRASTLPWIGRGALASGE" CDS 1545400..1545915 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1405" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb1405, -, len: 331 aa. Equivalent to 3' end of Rv1371, len: 489 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 331 aa overlap). Probable membrane protein. Weak similarity to delta 5 fatty acid desaturases e.g. AB022097|AB022097_1 Dictyostelium discoideum (467 aa), FASTA score: opt: 173, E(): 0.00052, (22.4% identity in 438 aa overlap); and Homo sapiens. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1371 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-a) splits Rv1371 into 2 parts, Mb1404 and Mb1405, with the latter being the more likely product." /db_xref="UniProtKB/TrEMBL:A0A1R3XY54" /protein_id="SIU00009.1" /translation="MVVFAIAGDFWPWALQFVATLWVSTFLVVASHEFEDDTQGGAVN GEDWGIDQLEHANDLTVIGNRYVDCFLSAGLSSHRVHHVLPFQRSGFANIVTEDVLRE EAAKFGVEWLPAKGFITDRLPRLCRKYLLTPSRQAKERHWGFVREHCSPAALKASASY VVAGFVGIGSV" CDS 1545912..1547093 /codon_start=1 /transl_table=11 /gene="pks18" /locus_tag="BQ2027_MB1406" /product="Naringenin-chalcone synthase (EC" /EC_number="2.3.1.74" /note="Mb1406, -, len: 393 aa. Equivalent to Rv1372, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 393 aa overlap). Conserved hypothetical protein, similar to several chalcone synthases e.g. CHS2_GERHY|P48391 chalcone synthase 2 from gerbra hybrid (402 aa), FASTA scores: opt: 511, E(): 7e-26, (28.4% identity in 380 aa overlap). Also similar to M. tuberculosis hypothetical chalcone synthases, Rv1665, Rv1660. Mb1406 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U064" /db_xref="InterPro:IPR001099" /db_xref="InterPro:IPR011141" /db_xref="InterPro:IPR012328" /db_xref="InterPro:IPR016039" /db_xref="UniProtKB/Swiss-Prot:Q7U064" /protein_id="SIU00010.1" /translation="MNVSAESGAPRRAGQRHEVGLAQLPPAPPTTVAVIEGLATGTPR RVVNQSDAADRVAELFLDPGQRERIPRVYQKSRITTRRMAVDPLDAKFDVFRREPATI RDRMHLFYEHAVPLAVDVSKRALAGLPYRAAEIGLLVLATSTGFIAPGVDVAIVKELG LSPSISRVVVNFMGCAAAMNALGTATNYVRAHPAMKALVVCIELCSVNAVFADDINDV VIHSLFGDGCAALVIGASQVQEKLEPGKVVVRSSFSQLLDNTEDGIVLGVNHNGITCE LSENLPGYIFSGVAPVVTEMLWDNGLQISDIDLWAIHPGGPKIIEQSVRSLGISAELA AQSWDVLARFGNMLSVSLIFVLETMVQQAESAKAISTGVAFAFGPGVTVEGMLFDIIR R" CDS 1547099..1547896 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1407" /product="GLYCOLIPID SULFOTRANSFERASE [FIRST PART]" /note="Mb1407, -, len: 265 aa. Equivalent to the 5' end of Rv1373, len: 326 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 154 aa overlap). Glycolipid sulfotransferase (EC 2.8.2.-) (see citation below); slight similarity to sulfotransferases e.g. SUOE_CAVPO|P49887 estrogen sulfotransferase from Cavia porcellus (Guinea pig) (EC 2.8.2.4) (296 aa), FASTA scores, opt: 165, E():0.00054, (24.5% identity in 294 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1373 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c) splits Rv1373 into 2 parts, Mb1407 and Mb1408. Mb1407 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XY57" /db_xref="InterPro:IPR000863" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XY57" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00011.1" /translation="MNSEHPMTDRVVYRSLMADNLRWDALQLRDGDIIISAPSKSGLT WTQRLVSLLVFDGPDLPGPLSTVSPWLDQTIRPIEEVVATLDAQQHRRFIKTHTPLDG LVLDDRVSYICVGRDPRDAAVSMLYQSANMNEDRMRILHEAVVPFHERIAPPVCGTRS CAQPDRGVPGLDGGAESASPWHRFHTSEGDRHSGQHPAPARHGMGPPSPTQRGLVSLR RLPGGLGGRAAPAGKGPRYRRDPRSSPGPGAVRHAGCDALPRVRNRS" CDS 1547562..1548080 /pseudo /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1408" /EC_number="2.8.2.-" /note="Mb1408, -, len: 172 aa. Equivalent to the 3' end of Rv1373, len: 326 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 172 aa overlap). Glycolipid sulfotransferase (EC 2.8.2.-) (see citation below); slight similarity to sulfotransferases e.g. SUOE_CAVPO|P49887 estrogen sulfotransferase from Cavia porcellus (Guinea pig) (EC 2.8.2.4) (296 aa), FASTA scores, opt: 165, E():0.00054, (24.5% identity in 294 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1373 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c) splits Rv1373 into 2 parts, Mb1407 and Mb1408.;GLYCOLIPID SULFOTRANSFERASE [SECOND PART]" /experiment="experimental evidence, no additional details recorded" CDS complement(1548160..1548618) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1409C" /product="HYPOTHETICAL PROTEIN" /note="Mb1409c, -, len: 152 aa. Equivalent to Rv1374c, len: 152 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 152 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3XY59" /protein_id="SIU00012.1" /translation="MVTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPAR PNAPIGARSFAVGRKICRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREV GNYAQRRVGRFAFFEQTFVRHALTPRCSRTDSKASYTQLNRICKFPPHWV" CDS 1548920..1550239 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1410" /product="YcaO-like protein" /note="Mb1410, -, len: 439 aa. Equivalent to Rv1375, len: 439 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 439 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from several organisms e.g. Q52871|U39409 Rhizobium leguminosarum (420 aa), FASTA scores: E(): 2e-30, (34.4% identity in 378 aa overlap). Mb1410 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR003776" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0B5" /protein_id="SIU00013.1" /translation="MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWP SRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITRVADVTWLDCLGIPTVQ AVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYD PAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDT TGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDA GDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSPAITEAAQSRITA ISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATA VANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE" CDS 1550236..1551729 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1411" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1411, -, len: 497 aa. Equivalent to Rv1376, len: 497 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 497 aa overlap). Conserved hypothetical protein, some similarity to hypothetical proteins from several organisms e.g. Q52872|U39409 Rhizobium leguminosarum (247 aa), FASTA scores: E(): 2.1e-12, (34.7% identity in 219 aa overlap). Mb1411 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR012924" /db_xref="InterPro:IPR016845" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ59" /protein_id="SIU00014.1" /translation="MTACGRIVVTAGPTISAADIRSVVPDAEVAPPIAFGQALSYDLR SGDTLLIVDGLFFQQPSVRHKELLTLMADGVRVVGSSSMGALRAAELHPFGMEGYGWV FESYRDGVLEADDEVGVVHGDADDGYPVFVDALVNMRHTLARAVATGVVCSELAERII ETARATPFTMRTWARLLSEVGAPDQRGLAAQLRSLRVDVKHADALLALRQLGQRPRVE PLRPGPPPTVWSRRWRQPWAPPTSVAASADHGESFVDVTDLEVLSFLSVSSVDYWAYR PALQQVAAWYWTLKHPEQSGSVGERAARAVAEVASEGYGRALEFIAYRYALATGIIDE TGFPEAVAAHWLTTEERHGLGNDPISISARVITRTLFVVRLLPAIDHFLDLLRKDSRL PRWRAMAAHALCKRDDLARQKPHLNLGRPDPTQLKRLFGARWGTQVNRIELARRGLMT EDAFYAAATPFAVAAVDDQLPRIEVGTLGPAPLSADVPERHFDFGSV" CDS complement(1551667..1552305) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1412C" /product="PUTATIVE TRANSFERASE" /note="Mb1412c, -, len: 212 aa. Equivalent to Rv1377c, len: 212 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 212 aa overlap). Putative transferase (EC 2.-.-.-), similar to YQEM_BACSU|P54458 hypothetical 28.3 kd protein from Bacillus subtilis (247 aa), FASTA scores: opt: 221, E(): 7.6e-08, (30.6% identity in 144 aa overlap); some similarity to methyltransferases, also similar to Mycobacterium tuberculosis hypothetical proteins Rv0560c, Rv3699, and Rv2675c (~ 39.1% identity in 197 aa overlap). Protein product from Mb1412c detected using shotgun mass spectrometry. Mb1412c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY67" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR041698" /db_xref="UniProtKB/TrEMBL:A0A1R3XY67" /protein_id="SIU00015.1" /translation="MPGIDFDALYRGESPGEGLPPITTPPWDTKAPKDNVIGWHTGGW VHGDVLDIGCGLGDNAIYLARNGYQVTGLDISPTALTTAKRRASDAGVDVKFAVGDAT KLTGYTGAFDTVIDCGMFHCLDDDGKRSYAASVHRATRPGATLLLSCFSNAMPPDEEW PRSTVSEQTLRDVLGGAGWDIESLEPATVRRELDGTEVEMAFWNVRAQRRGS" CDS complement(1552316..1553743) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1413C" /product="CONSERVED 13E12 REPEAT FAMILY PROTEIN" /note="Mb1413c, -, len: 475 aa. Equivalent to Rv1378c, len: 475 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 475 aa overlap). Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv3074|MTCY22D7.07C (424 aa), FASTA scores: E(): 0, (73.0% identity in 429 aa overlap). Mb1413c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3XYK4" /protein_id="SIU00016.1" /translation="MGNLDLLLHLSGRIVKGCRPLGSVALARCGPAVRWPWWPRPAIL EHMFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAG VPARRRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATL IVRESACLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARA ETERTVTIRPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVER VTGQPAEAAQPVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSR ATLRRLYRHPRSGALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPH HRGGPTTATNGLGSCERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPPL PGPLEIDVSQVEARIGVALTHLHAA" CDS 1553742..1554323 /codon_start=1 /transl_table=11 /gene="pyrR" /locus_tag="BQ2027_MB1414" /product="PROBABLE PYRIMIDINE OPERON REGULATORY PROTEIN PYRR" /note="Mb1414, pyrR, len: 193 aa. Equivalent to Rv1379, len: 193 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 193 aa overlap). Probable pyrR, pyrimidine operon regulatory protein, similar to PYRR_BACCL|P41007 pyrimidine operon regulatory protein from Bacillus caldolyticus (179 aa), FASTA scores: opt: 544, E(): 1.1e-30, (54.2% identity in 179 aa overlap). Protein product from Mb1414 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1414 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65942" /db_xref="InterPro:IPR000836" /db_xref="InterPro:IPR023050" /db_xref="InterPro:IPR029057" /db_xref="UniProtKB/Swiss-Prot:P65942" /protein_id="SIU00017.1" /translation="MGAAGDAAIGRESRELMSAADVGRTISRIAHQIIEKTALDDPVG PDAPRVVLLGIPTRGVTLANRLAGNITEYSGIHVGHGALDITLYRDDLMIKPPRPLAS TSIPAGGIDDALVILVDDVLYSGRSVRSALDALRDVGRPRAVQLAVLVDRGHRELPLR ADYVGKNVPTSRSESVHVRLREHDGRDGVVISR" CDS 1554320..1555279 /codon_start=1 /transl_table=11 /gene="pyrB" /locus_tag="BQ2027_MB1415" /product="PROBABLE ASPARTATE CARBAMOYLTRANSFERASE PYRB (ATCase) (Aspartate transcarbamylase)" /note="Mb1415, pyrB, len: 319 aa. Equivalent to Rv1380, len: 319 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 319 aa overlap). Probable pyrB, aspartate carbamoyltransferase (EC 2.1.3.2), similar to many e.g. PYRB_BACCL|P41008 aspartate carbamoyltransferase from Bacillus caldolyticus (308 aa), FASTA scores, opt: 639, E(): 7.3e-36, (39.5% identity in 311 aa overlap). Contains PS00097 Aspartate and ornithine carbamoyltransferases signature. BELONGS TO THE ATCASES/OTCASES FAMILY. Protein product from Mb1415 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1415 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65614" /db_xref="InterPro:IPR002082" /db_xref="InterPro:IPR006130" /db_xref="InterPro:IPR006131" /db_xref="InterPro:IPR006132" /db_xref="InterPro:IPR036901" /db_xref="UniProtKB/Swiss-Prot:P65614" /protein_id="SIU00018.1" /translation="MTPRHLLTAADLSRDDATAILDDADRFAQALVGRDIKKLPTLRG RTVVTMFYENSTRTRVSFEVAGKWMSADVINVSAAGSSVGKGESLRDTALTLRAAGAD ALIIRHPASGAAHLLAQWTGAHNDGPAVINAGDGTHEHPTQALLDALTIRQRLGGIEG RRIVIVGDILHSRVARSNVMLLDTLGAEVVLVAPPTLLPVGVTGWPATVSHDFDAELP AADAVLMLRVQAERMNGGFFPSVREYSVRYGLTERRQAMLPGHAVVLHPGPMVRGMEI TSSVADSSQSAVLQQVSNGVQVRMAVLFHVLVGAQDAGKEGAA" CDS 1555276..1556568 /codon_start=1 /transl_table=11 /gene="pyrC" /locus_tag="BQ2027_MB1416" /product="PROBABLE DIHYDROOROTASE PYRC (DHOase)" /note="Mb1416, pyrC, len: 430 aa. Equivalent to Rv1381, len: 430 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 430 aa overlap). Probable pyrC, dihydroorotase (EC 3.5.2.3), similar to many e.g. PYRC_BACCL|P46538 (40.5% identity in 395 aa overlap). Contains PS00483 Dihydroorotase signature 2. BELONGS TO THE DHOASE FAMILY. SUBFAMILY 2. Protein product from Mb1416 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1416 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U057" /db_xref="InterPro:IPR002195" /db_xref="InterPro:IPR004722" /db_xref="InterPro:IPR006680" /db_xref="InterPro:IPR011059" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/Swiss-Prot:Q7U057" /protein_id="SIU00019.1" /translation="MSVLIRGVRPYGEGERVDVLVDDGQIAQIGPDLAIPDTADVIDA TGHVLLPGFVDLHTHLREPGREYAEDIETGSAAAALGGYTAVFAMANTNPVADSPVVT DHVWHRGQQVGLVDVHPVGAVTVGLAGAELTEMGMMNAGAAQVRMFSDDGVCVHDPLI MRRALEYATGLGVLIAQHAEEPRLTVGAFAHEGPMAARLGLAGWPRAAEESIVARDAL LARDAGARVHICHASAAGTVEILKWAKDQGISITAEVTPHHLLLDDARLASYDGVNRV NPPLREASDAVALRQALADGIIDCVATDHAPHAEHEKCVEFAAARPGMLGLQTALSVV VQTMVAPGLLSWRDIARVMSENPACIARLPDQGRPLEVGEPANLTVVDPDATWTVTGA DLASRSANTPFESMSLPATVTATLLRGKVTARDGKIRA" CDS 1556565..1557062 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1417" /product="PROBABLE EXPORT OR MEMBRANE PROTEIN" /note="Mb1417, -, len: 165 aa. Equivalent to Rv1382, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 165 aa overlap). Possible exported or membrane protein, hydrophobic domain at N-terminus. Protein product from Mb1417 detected using SWATH mass spectrometry. Mb1417 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY77" /db_xref="UniProtKB/TrEMBL:A0A1R3XY77" /protein_id="SIU00020.1" /translation="MNSGTLAGSLIFAAVLVMLIAVLARLMMRGWRRRSERQAELLGD LPDVPEHVSSATVTTRGLYVGATLSPAWNERVTVGDLGYRSKAVLTRYPSGIMVERAR AQPIWIPTESIAAIRMERGVAGKVVAGIGILAIRWRLPSGTEIDVGFRADNRDEYQEW LEEPV" CDS 1557059..1558189 /codon_start=1 /transl_table=11 /gene="carA" /locus_tag="BQ2027_MB1418" /product="PROBABLE CARBAMOYL-PHOSPHATE SYNTHASE SMALL CHAIN CARA (Carbamoyl-phosphate synthetase glutamine chain)" /note="Mb1418, carA, len: 376 aa. Equivalent to Rv1383, len: 376 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 376 aa overlap). Probable carA, Carbamoyl-phosphate synthase small chain (EC 6.3.5.5), similar to many e.g. CARA_ECOLI|P00907 carbamoyl-phosphate synthase small chain from Escherichia coli (382 aa), FASTA scores: opt: 796, E(): 0, (45.5% identity in 382 aa overlap). Contains PS00442 Glutamine amidotransferases class-I active site. THE GATASE DOMAIN BELONGS TO TYPE-1 GLUTAMINE AMIDOTRANSFERASES. SUBUNIT: COMPOSED OF TWO CHAINS; THE SMALL (OR GLUTAMINE) CHAIN PROMOTES THE HYDROLYSIS OF GLUTAMINE TO AMMONIA, WHICH IS USED BY THE LARGE (OR AMMONIA) CHAIN TO SYNTHESIZE CARBAMOYL PHOSPHATE. Protein product from Mb1418 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1418 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U055" /db_xref="InterPro:IPR002474" /db_xref="InterPro:IPR006274" /db_xref="InterPro:IPR017926" /db_xref="InterPro:IPR029062" /db_xref="InterPro:IPR035686" /db_xref="InterPro:IPR036480" /db_xref="UniProtKB/Swiss-Prot:Q7U055" /protein_id="SIU00021.1" /translation="MSKAVLVLEDGRVFTGRPFGATGQALGEAVFSTGMSGYQETLTD PSYHRQIVVATAPQIGNTGWNGEDSESRGERIWVAGYAVRDPSPRASNWRATGTLEDE LIRQRIVGIAGIDTRGVVRHLRSRGSMKAGVFSDGALAEPADLIARVRAQQSMLGADL AGEVSTAEPYVVEPDGPPGVSRFTVAALDLGIKTNTPRNFARRGIRCHVLPASTTFEQ IAELNPHGVFLSNGPGDPATADHVVALTREVLGAGIPLFGICFGNQILGRALGLSTYK MVFGHRGINIPVVDHATGRVAVTAQNHGFALQGEAGQSFATPFGPAVVSHTCANDGVV EGVKLVDGRAFSVQYHPEAAAGPHDAEYLFDQFVELMAGEGR" CDS 1558189..1561536 /codon_start=1 /transl_table=11 /gene="carB" /locus_tag="BQ2027_MB1419" /product="PROBABLE CARBAMOYL-PHOSPHATE SYNTHASE LARGE CHAIN CARB (Carbamoyl-phosphate synthetase ammonia chain)" /note="Mb1419, carB, len: 1115 aa. Equivalent to Rv1384, len: 1115 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1115 aa overlap). Probable carB, Carbamoyl-phosphate synthase large chain (EC 6.3.5.5), similar to many e.g. CARB_ECOLI|P00968 E. coli (1072 aa), FASTA scores: E(): 0, (52.3% identity in 1118 aa overlap). Contains two PS00867 Carbamoyl-phosphate synthase subdomain signature 2 and PS00866 Carbamoyl-phosphatesynthase subdomain signature 1. SUBUNIT: COMPOSED OF TWO CHAINS; THE SMALL (OR GLUTAMINE) CHAIN PROMOTES THE HYDROLYSIS OF GLUTAMINE TO AMMONIA, WHICH IS USED BY THE LARGE (OR AMMONIA) CHAIN TO SYNTHESIZE CARBAMOYL PHOSPHATE. Protein product from Mb1419 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1419 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U054" /db_xref="InterPro:IPR005479" /db_xref="InterPro:IPR005480" /db_xref="InterPro:IPR005483" /db_xref="InterPro:IPR006275" /db_xref="InterPro:IPR011607" /db_xref="InterPro:IPR011761" /db_xref="InterPro:IPR016185" /db_xref="InterPro:IPR033937" /db_xref="InterPro:IPR036897" /db_xref="InterPro:IPR036914" /db_xref="UniProtKB/Swiss-Prot:Q7U054" /protein_id="SIU00022.1" /translation="MPRRTDLHHVLVIGSGPIVIGQACEFDYSGTQACRVLRAEGLQV SLVNSNPATIMTDPEFADHTYVEPITPAFVERVIAQQAERGNKIDALLATLGGQTALN TAVALYESGVLEKYGVELIGADFDAIQRGEDRQRFKDIVAKAGGESARSRVCFTMAEV RETVAELGLPVVVRPSFTMGGLGSGIAYSTDEVDRMAGAGLAASPSANVLIEESIYGW KEFELELMRDGHDNVVVVCSIENVDPMGVHTGDSVTVAPAMTLTDREYQRMRDLGIAI LREVGVDTGGCNIQFAVNPRDGRLIVIEMNPRVSRSSALASKATGFPIAKIAAKLAIG YTLDEIVNDITGETPACFEPTLDYVVVKAPRFAFEKFPGADPTLTTTMKSVGEAMSLG RNFVEALGKVMRSLETTRAGFWTAPDPDGGIEEALTRLRTPAEGRLYDIELALRLGAT VERVAEASGVDPWFIAQINELVNLRNELVAAPVLNAELLRRAKHSGLSDHQIASLRPE LAGEAGVRSLRVRLGIHPVYKTVDTCAAEFEAQTPYHYSSYELDPAAETEVAPQTERP KVLILGSGPNRIGQGIEFDYSCVHAATTLSQAGFETVMVNCNPETVSTDYDTADRLYF EPLTFEDVLEVYHAEMESGSGGPGVAGVIVQLGGQTPLGLAHRLADAGVPIVGTPPEA IDLAEDRGAFGDLLSAAGLPAPKYGTATTFAQARRIAEEIGYPVLVRPSYVLGGRGME IVYDEETLQGYITRATQLSPEHPVLVDRFLEDAVEIDVDALCDGAEVYIGGIMEHIEE AGIHSGDSACALPPVTLGRSDIEKVRKATEAIAHGIGVVGLLNVQYALKDDVLYVLEA NPRASRTVPFVSKATAVPLAKACARIMLGATIAQLRAEGLLAVTGDGAHAARNAPIAV KEAVLPFHRFRRADGAAIDSLLGPEMKSTGEVMGIDRDFGSAFAKSQTAAYGSLPAQG TVFVSVANRDKRSLVFPVKRLADLGFRVLATEGTAEMLRRNGIPCDDVRKHFEPAQPG RPTMSAVDAIRAGEVNMVINTPYGNSGPRIDGYEIRSAAVAGNIPCITTVQGASAAVQ GIEAGIRGDIGVRSLQELHRVIGGVER" CDS 1561533..1562357 /codon_start=1 /transl_table=11 /gene="pyrF" /locus_tag="BQ2027_MB1420" /product="PROBABLE OROTIDINE 5'-PHOSPHATE DECARBOXYLASE PYRF (OMP decarboxylase) (OMPdecase)" /note="Mb1420, pyrF, len: 274 aa. Equivalent to Rv1385, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 274 aa overlap). Probable pyrF, orotidine 5'-phosphate decarboxylase (EC 4.1.1.23), identical to DCOP_MYCBO|P42610 Mycobacterium bovis (274 aa). Contains PS00156 Orotidine 5'-phosphate decarboxylase active site. BELONGS TO THE OMP DECARBOXYLASE FAMILY. Protein product from Mb1420 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1420 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5M7" /db_xref="InterPro:IPR001754" /db_xref="InterPro:IPR011060" /db_xref="InterPro:IPR011995" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR018089" /db_xref="UniProtKB/Swiss-Prot:P0A5M7" /protein_id="SIU00023.1" /translation="MTGFGLRLAEAKARRGPLCLGIDPHPELLRGWDLATTADGLAAF CDICVRAFADFAVVKPQVAFFESYGAAGFAVLERTIAELRAADVLVLADAKRGDIGAT MSAYATAWVGDSPLAADAVTASPYLGFGSLRPLLEVAAAHGRGVFVLAATSNPEGAAV QNAAADGRSVAQLVVDQVGAANEAAGPGPGSIGVVVGATAPQAPDLSAFTGPVLVPGV GVQGGRPEALGGLGGAASSQLLPAVAREVLRAGPGVPELRAAGERMRDAVAYLAAV" CDS 1562552..1562860 /codon_start=1 /transl_table=11 /gene="PE15" /locus_tag="BQ2027_MB1421" /product="pe family protein pe15" /note="Mb1421, PE15, len: 102 aa. Equivalent to Rv1386, len: 102 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 102 aa overlap). Member of Mycobacterium tuberculosis PE family (see first citation below), similar to many e.g. G913039 ORF 3' OF PGRS TANDEM REPEAT (polymorphic GC-rich sequence) (100 aa), FASTA scores: opt: 149, E(): 0.0013, (31.5% identity in 92 aa overlap); also similar to Q49943|U1756A (99 aa) (34.7% identity in 95 aa overlap) and G466937|U1620K (100 aa) (36.2% identity in 69 aa overlap). Protein product from Mb1421 detected using SWATH mass spectrometry. Mb1421 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A683" /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/Swiss-Prot:P0A683" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00024.1" /translation="MTLRVVPESLAGASAAIEAVTARLAAAHAAAAPFIAAVIPPGSD SVSVCNAVEFSVHGSQHVAMAAQGVEELGRSGVGVAESGASYAARDALAAASYLSGGL " CDS 1562857..1564476 /codon_start=1 /transl_table=11 /gene="PPE20" /locus_tag="BQ2027_MB1422" /product="ppe family protein ppe20" /note="Mb1422, PPE20, len: 539 aa. Equivalent to Rv1387, len: 539 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 539 aa overlap). Member of Mycobacterium tuberculosis PPE family of proteins, similar to many e.g. Y05F_MYCTU|Q10892 hypothetical 46.9 kd protein cy251.15 (463 aa), FASTA scores: E(): 4.2e-26, (37.7% identity in 531 aa overlap); similar also to MTCY274.23c (37.5% identity in 168 aa overlap). Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide. Protein product from Mb1422 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1422 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XY78" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00025.1" /translation="MTEPWIAFPPEVHSAMLNYGAGVGPMLISATQNGELSAQYAEAA SEVEELLGVVASEGWQGQAAEAFVAAYMPFLAWLIQASADCVEMAAQQHAVIEAYTAA VELMPTQVELAANQIKLAVLVATNFFGINTIPIAINEAEYVEMWVRAATTMATYSTVS RSALSAMPHTSPPPLILKSDELLPDTGEDSDEDGHNHGGHSHGGHARMIDNFFAEILR GVSAGRIVWDPVNGTLNGLDYDDYVYPGHAIWWLARGLEFFQDGEQFGELLFTNPTGA FQFLLYVVVVDLPTHIAQIATWLGQYPQLLSAALTGVIAHLGAITGLAGLSGLSAIPS AAIPAVVPELTPVAAAPPMLAVAGVGPAVAAPGMLPASAPAPAAAAGATAAGPTPPAT GFGGFPPYLVGGGGPGIGFGSGQSAHAKAAASDSAAAESAAQASARAQARAARRGRSA AKARGHRDEFVTMDMGFDAAAPAPEHQPGARASDCGAGPIGFAGTVRKEAVVKAAGLT TLAGDDFGGGPTMPMMPGTWTHDQGVFDEHR" CDS 1564782..1565354 /codon_start=1 /transl_table=11 /gene="mihF" /locus_tag="BQ2027_MB1423" /product="PUTATIVE INTEGRATION HOST FACTOR MIHF" /note="Mb1423, mihF, len: 190 aa. Equivalent to Rv1388, len: 190 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 190 aa overlap). Putative mihF, integration host factor. Almost identical to, but longer than, P96802|U75344 Mycobacterium smegmatis integration host factor (mIHF) for mycobacteriophage L5 (105 aa), FASTA scores: E(): 0, (96.1% identity in 102 aa overlap). Protein product from Mb1423 detected using shotgun mass spectrometry. Mb1423 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XYL4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00026.1" /translation="MLGNTIHVPCQPCRHGHGAPSRGLRGRPADRWPVARATPTLHVC PQNQGVGLDFVRKPEYGRLRWPAYPAGTNNDRLISMRDGGIVALPQLTDEQRAAALEK AAAARRARAELKDRLKRGGTNLTQVLKDAESDEVLGKMKVSALLEALPKVGKVKAQEI MTELEIAPTRRLRGLGDRQRKALLEKFGSA" CDS 1565489..1566115 /codon_start=1 /transl_table=11 /gene="gmk" /locus_tag="BQ2027_MB1424" /product="PROBABLE GUANYLATE KINASE GMK" /note="Mb1424, gmk, len: 208 aa. Equivalent to Rv1389, len: 208 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 208 aa overlap). Probable gmk, guanylate kinase (EC 2.7.4.8), similar to e.g. KGUA_ECOLI|P24234 guanylate kinase from Escherichia coli (207 aa), FASTA scores: opt: 424, E(): 6.6e-20, (35.9% identity in 184 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00856 Guanylate kinase signature. BELONGS TO THE GUANYLATE KINASE FAMILY. Protein product from Mb1424 detected using SWATH mass spectrometry. Mb1424 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5I5" /db_xref="InterPro:IPR008144" /db_xref="InterPro:IPR008145" /db_xref="InterPro:IPR017665" /db_xref="InterPro:IPR020590" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P0A5I5" /protein_id="SIU00027.1" /translation="MSVGEGPDTKPTARGQPAAVGRVVVLSGPSAVGKSTVVRCLRER IPNLHFSVSATTRAPRPGEVDGVDYHFIDPTRFQQLIDQGELLEWAEIHGGLHRSGTL AQPVRAAAATGVPVLIEVDLAGARAIKKTMPEAVTVFLAPPSWQDLQARLIGRGTETA DVIQRRLDTARIELAAQGDFDKVVVNRRLESACAELVSLLVGTAPGSP" CDS 1566181..1566513 /codon_start=1 /transl_table=11 /gene="rpoZ" /locus_tag="BQ2027_MB1425" /product="PROBABLE DNA-DIRECTED RNA POLYMERASE (OMEGA CHAIN) RPOZ (TRANSCRIPTASE OMEGA CHAIN) (RNA POLYMERASE OMEGA SUBUNIT)" /note="Mb1425, rpoZ, len: 110 aa. Equivalent to Rv1390, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 110 aa overlap). Probable rpoZ, DNA-directed RNA polymerase omega chain (EC 2.7.7.6). BELONGS TO THE RNA POLYMERASE OMEGA CHAIN FAMILY. Protein product from Mb1425 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1425 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66722" /db_xref="InterPro:IPR003716" /db_xref="InterPro:IPR006110" /db_xref="InterPro:IPR012293" /db_xref="InterPro:IPR036161" /db_xref="UniProtKB/Swiss-Prot:P66722" /protein_id="SIU00028.1" /translation="MSISQSDASLAAVPAVDQFDPSSGASGGYDTPLGITNPPIDELL DRVSSKYALVIYAAKRARQINDYYNQLGEGILEYVGPLVEPGLQEKPLSIALREIHAD LLEHTEGE" CDS 1566529..1567785 /codon_start=1 /transl_table=11 /gene="dfp" /locus_tag="BQ2027_MB1426" /product="PROBABLE DNA/PANTOTHENATE METABOLISM FLAVOPROTEIN HOMOLOG DFP" /note="Mb1426, dfp, len: 418 aa. Equivalent to Rv1391, len: 418 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 418 aa overlap). Probable dfp, DNA/pantothenate metabolism flavoprotein homolog, similar to many e.g. DFP_ECOLI|P24285 Escherichia coli (430 aa), FASTA scores: opt: 763, E(): 0, (40.2% identity in 408 aa overlap). Protein product from Mb1426 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1426 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67734" /db_xref="InterPro:IPR003382" /db_xref="InterPro:IPR005252" /db_xref="InterPro:IPR007085" /db_xref="InterPro:IPR035929" /db_xref="InterPro:IPR036551" /db_xref="UniProtKB/Swiss-Prot:P67734" /protein_id="SIU00029.1" /translation="MVDHKRIPKQVIVGVSGGIAAYKACTVVRQLTEASHRVRVIPTE SALRFVGAATFEALSGEPVCTDVFADVPAVPHVHLGQQADLVVVAPATADLLARAAAG RADDLLTATLLTARCPVLFAPAMHTEMWLHPATVDNVATLRRRGAVVLEPATGRLTGA DSGAGRLPEAEEITTLAQLLLERHDALPYDLAGRKLLVTAGGTREPIDPVRFIGNRSS GKQGYAVARVAAQRGADVTLIAGHTAGLVDPAGVEVVHVSSAQQLADAVSKHAPTADV LVMAAAVADFRPAQVATAKIKKGVEGPPTIELLRNDDVLAGVVRARAHGQLPNMRAIV GFAAETGDANGDVLFHARAKLRRKGCDLLVVNAVGEGRAFEVDSNDGWLLASDGTESA LQHGSKTLMASRIVDAIVTFLAGCSS" CDS 1567913..1569124 /codon_start=1 /transl_table=11 /gene="metK" /locus_tag="BQ2027_MB1427" /product="PROBABLE S-ADENOSYLMETHIONINE SYNTHETASE METK (MAT) (AdoMet synthetase) (Methionine adenosyltransferase)" /note="Mb1427, metK, len: 403 aa. Equivalent to Rv1392, len: 403 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 403 aa overlap). Probable metK, S-adenosylmethionine synthetase (EC 2.5.1.6), similar to many e.g. METK_STAAU|P50307 Staphylococcus aureus (397 aa), FASTA scores: opt: 1484, E(): 0, (58.0% identity in 400 aa overlap). Contains PS00376 S-adenosylmethionine synthetase signature 1, PS00377 S-adenosylmethionine synthetase signature 2. BELONGS TO THE ADOMET SYNTHETASE FAMILY. Protein product from Mb1427 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1427 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U051" /db_xref="InterPro:IPR002133" /db_xref="InterPro:IPR022628" /db_xref="InterPro:IPR022629" /db_xref="InterPro:IPR022630" /db_xref="InterPro:IPR022631" /db_xref="InterPro:IPR022636" /db_xref="UniProtKB/Swiss-Prot:Q7U051" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00030.1" /translation="MSEKGRLFTSESVTEGHPDKICDAISDSVLDALLAADPRSRVAV ETLVTTGQVHVVGEVATSAKEAFADITNTVRARILEIGYDSSDKGFDGATCGVNIGIG AQSPDIAQGVDTAHEARVEGAADPLDSQGAGDQGLMFGYAINATPELMPLPIALAHRL SRRLTEVRKNGVLPYLRPDGKTQVTIAYEDNVPVRLDTVVISTQHAADIDLEKTLDPD IREKVLNTVLDDLAHETLDASTVRVLVNPTGKFVLGGPMGDAGLTGRKIIVDTYGGWA RHGGGAFSGKDPSKVDRSAAYAMRWVAKNVVAAGLAERVEVQVAYAIGKAAPVGLFVE TFGTETEDPVKIEKAIGEVFDLRPGAIIRDLNLLRPIYAPTAAYGHFGRTDVELPWEQ LDKVDDLKRAI" CDS complement(1569197..1570675) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1428C" /product="PROBABLE MONOXYGENASE" /note="Mb1428c, -, len: 492 aa. Equivalent to Rv1393c, len: 492 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 492 aa overlap). Probable monooxygenase (EC 1.14.13.-), similar to others e.g. CYMO_ACISP|P12015 cyclohexanone monooxygenase (EC 1.14.13.22) from Acinetobacter sp. (542 aa), FASTA scores: E(): 0, (33.0% identity in 473 aa overlap); also to Rv3083|MTCY31.20|E241788 hypothetical 55.0 kDa protein from Mycobacterium tuberculosis (495 aa) (36.3% identity in 490 aa overlap); and Rv0565c, Rv3854c, Rv3049c, Rv0892. Protein product from Mb1428c detected using SWATH mass spectrometry. Mb1428c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XY79" /protein_id="SIU00031.1" /translation="MMPDYHALIVGAGFSGIGAAIKLDRAGFSDYLVVEAGDGVGGTW HWNTYPGIAVDIPSFSYQFSFEQSRHWSRTYAPGHELKAYAEHCVDKYGIRSRIRLNT KVLAAEFDDEHSLWRVQTDPGGEITARFLISACGILTVPKLPDIDGVDSFEGVTMHTA RWDHTQDLTGKRVGIIGTGASAVQVIPEMAPIVSHLTVFQRTPIWCFPKFDVPLPTAV RWAMRIPGGKAVHRLLSQAFVEATFPIAAHYFAVFPLAKHMESAGRRYLRQQVHDPVV REQLTPRYAVGCKRPGFHNTYLSTFNRDNVRLVTEPIDKITPTAVATTDGASHEIDVL VLATGFKVLDTDSIPTYAVTGTGGASLSRFWDEHRLQAYEGVSVPGYPNFFTVFGPYG YVGSSYFALIETQAHHIIRCLKRARRTGATRIEVTEEANARYFAEVMRRRHRQVFWQD SCRLANSYYFDKNGDVPLRPTTTVEAYWRSRRFDLGDYRISS" CDS complement(1570672..1572057) /codon_start=1 /transl_table=11 /gene="cyp132" /locus_tag="BQ2027_MB1429C" /product="PROBABLE CYTOCHROME P450 132 CYP132" /note="Mb1429c, cyp132, len: 461 aa. Equivalent to Rv1394c, len: 461 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 461 aa overlap). Probable cyp132, cytochrome P450 132 (EC 1.14.-.-). Some similarity to others e.g. CP4B_HUMAN|P13584 human cytochrome p450 (511 aa), FASTA scores: opt: 486, E(): 7.4e-21, (28.6% identity in 423 aa overlap); etc. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. MAY BELONG TO THE CYTOCHROME P450 FAMILY. Mb1429c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P59954" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002401" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P59954" /protein_id="SIU00032.1" /translation="MATATTQRPLKGPAKRMSTWTMTREAITIGFDAGDGFLGRLRGS DITRFRCAGRRFVSISHPDYVDHVLHEARLKYVKSDEYGPIRATAGLNLLTDEGDSWA RHRGALNSTFARRHLRGLVGLMIDPIADVTAALVPGAQFDMHQSMVETTLRVVANALF SQDFGPLVQSMHDLATRGLRRAEKLERLGLWGLMPRTVYDTLIWCIYSGVHLPPPLRE MQEITLTLDRAINSVIDRRLAEPTNSADLLNVLLSADGGIWPRQRVRDEALTFMLAGH ETTANAMSWFWYLMALNPQARDHMLTELDDVLGMRRPTADDLGKLAWTTACLQESQRY FSSVWIIAREAVDDDIIDGHRIRRGTTVVIPIHHIHHDPRWWPDPDRFDPGRFLRCPT DRPRCAYLPFGGGRRICIGQSFALMEMVLMAAIMSQHFTFDLAPGYHVELEATLTLRP KHGVHVIGRRR" CDS 1572135..1573169 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1430" /product="transcriptional regulatory protein" /note="Mb1430, -, len: 344 aa. Equivalent to Rv1395, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 344 aa overlap). Probable transcriptional regulatory protein, similar to many e.g. URER_PROMI|Q02458 urease operon transcriptional activator from Proteus mirabilis (293 aa), FASTA scores: E():1.5e-08, (41.7% identity in 84 aa overlap); YHIX_ECOLI|P37639 hypothetical transcriptional regulatory protein from Escherichia coli (274 aa), FASTA scores: opt: 238, E(): 3.5e-09, (27.3% identity in 249 aa overlap); and G296916|X68281 POSSIBLE VIRULENCE-REGULATING protein from Mycobacterium tuberculosis (339 aa), FASTA scores: opt: 228, E(): 1.9e-08, (27.0% identity in 278 aa overlap). Helix turn helix motif present, aa 261-282 (+4.68 SD). BELONGS TO THE ARAC/XYLS FAMILY OF TRANSCRIPTIONAL REGULATORS. 3' part corrected since first submission (-14 aa). Mb1430 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P68912" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR018060" /db_xref="InterPro:IPR020449" /db_xref="InterPro:IPR032687" /db_xref="UniProtKB/Swiss-Prot:P68912" /protein_id="SIU00033.1" /translation="MGHLPPPAEVRHPVYATRVLCEVANERGVPTADVLAGTAIEPAD LDDPDAVVGALDEITAVRRLLARLPDDAGIGIDVGSRFALTHFGLFGFAVMSCGTLRE LLTIAMRYFALTTMHVDITLFETADDCLVELDASHLPADVRGFFIERDIAGIIATTTS FALPLAAKYADQVSAELAVDAELLRPLLELVPVHDVAFGRAHNRVHFPRAMFDEPLPQ ADRHTLEMCIAQCDVLMQRNERRRGITALVRSKLFRDSGLFPTFTDVAGELDMHPRTL RRRLAEEGTSFRALLGEARSTVAVDLLRNVGLTVQQVSTRLGYTEVSTFSHAFKRWYG VAPSEYSRRG" CDS complement(1573215..1574573) /codon_start=1 /transl_table=11 /gene="PE_PGRS25" /locus_tag="BQ2027_MB1431C" /product="pe-pgrs family protein pe_pgrs25" /note="Mb1431c, PE_PGRS25, len: 452 aa. Similar to Rv1396c, len: 576 aa, from Mycobacterium tuberculosis strain H37Rv, (78.1% identity in 576 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, strong similarity to many e.g. glycine rich protein MTCY130.10C|E245019 (603 aa), FASTA scores: opt: 1945, E(): 0, (57.5% identity in 619 aa overlap). Contains PS00017 ATP/GTP-binding site motif A, similar to other PGRS-type sequences. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 372 bp deletion leads to a shorter product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (452 aa versus 576 aa). Mb1431c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ81" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00034.1" /translation="MSFLFAQPEMLGAAATDLASIGSAISTANAAAAAATTRVLAAGA DEVSAAVAALFSGHAQTYQALSTQAAAFHQQIVQTLTSTAGAYASAEAANVEQQLLGA INAPTMALLGRPLIGHGADGAPGTGQAGGAGGILYGNGGNGGSGATGQAGGAGGAAGL IGHGGAGGLGGTGASGGAGGAGGWLWGNGGAGGNGGVGVAGDPGGVGGAGGAGGAAGL WGSGGSGGTGGQGGVGGGKSGDGGTGGIGGAGGGGGWLHGDGGAGGHGGQGGTGVSSG GNGGAGGTGGDGRGLSGSGGAGGHGGQTGVGGKVGENNFGGAGGAGGTGGLIGNGGAG GTGGKGGDGFGVFGKGGAGGTGGRGGAAGLIGDAGTGGTGGKGGTAGEDGTGGNGGTG GNGGAAVLIGNGGGGGAGGNGGAGNDGTPGNGGGGGVGGTGGTLFGQPGQPGPPGQPG PA" CDS complement(1574828..1575229) /codon_start=1 /transl_table=11 /gene="vapc10" /locus_tag="BQ2027_MB1432C" /product="possible toxin vapc10" /note="Mb1432c, -, len: 133 aa. Equivalent to Rv1397c, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 133 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis protein MTCY159.08C|Rv2548 (125 aa), FASTA scores: E(): 2.3e-14, (42.4% identity in 125 aa overlap). Protein product from Mb1432c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1432c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY88" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XY88" /protein_id="SIU00035.1" /translation="MILVDSDVLIAHLRGVVAARDWLVSARKDGPLAISVVSTAELIG GMRTAERREVWRLLASFRVQPATEVIARRAGDMMRRYRRSHNRIGLGDYLIAATADVQ GLQLATLNVWHFPMFEQLKPPFAVPGHRPRA" CDS complement(1575226..1575483) /codon_start=1 /transl_table=11 /gene="vapb10" /locus_tag="BQ2027_MB1433C" /product="possible antitoxin vapb10" /note="Mb1433c, -, len: 85 aa. Equivalent to Rv1398c, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 85 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis proteins Rv2547|MTCY159.09C (85 aa), FASTA scores: E(): 0.0035, (37.1% identity in 62 aa overlap); Rv0581, Rv2871, Rv1241, etc. Protein product from Mb1433c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1433c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64836" /db_xref="InterPro:IPR002145" /db_xref="InterPro:IPR010985" /db_xref="InterPro:IPR013321" /db_xref="UniProtKB/Swiss-Prot:P64836" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00036.1" /translation="MKRTNIYLDEEQTASLDKLAAQEGVSRAELIRLLLNRALTTAGD DLASDLQAINDSFGTLRHLDPPVRRSGGREQHLAQVWRATS" CDS complement(1575566..1576525) /codon_start=1 /transl_table=11 /gene="nlhh" /locus_tag="BQ2027_MB1434C" /product="probable non lipolytic carboxylesterase nlhh" /note="Mb1434c, lipH, len: 319 aa. Equivalent to Rv1399c, len: 319 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 319 aa overlap). Possible LipH, lipase (EC 3.1.-.-), most similar to G695278 lipase like enzyme from Ralstonia eutropha (364 aa), FASTA scores: opt: 648, E(): 4.4e-34, (37.3% identity in 327 aa ov erlap), similar to Mycobacterium tuberculosis hypothetical lipases e.g. Rv2284, Rv2485c, Rv1426c, etc. Protein product from Mb1434c detected using SWATH mass spectrometry. Mb1434c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XY93" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XY93" /protein_id="SIU00037.1" /translation="MTEPTVARPDIDPVLKMLLDTFPVTFTAADGVEVARARLRQLKT PPELLPELRIEERTVGYDGLTDIPVRVYWPPVVRDNLPVVVYYHGGGWSLGGLDTHDP VARAHAVGAQAIVVSVDYRLAPEHPYPAGIDDSWAALRWVGENAAELGGDPSRIAVAG DSAGGNISAVMAQLARDVGGPPLVFQLLWYPTTMADLSLPSFTENADAPILDRDVIDA FLAWYVPGLDISDHTMLPTTLAPGNADLSGLPPAFIGTAEHDPLRDDGACYAELLTAA GVSVELSNEPTMVHGYVNFALVVPAAAEATGRGLAALKRALHA" CDS complement(1576550..1577512) /codon_start=1 /transl_table=11 /gene="lipI" /locus_tag="BQ2027_MB1435C" /product="PROBABLE LIPASE LIPH" /note="Mb1435c, lipI, len: 320 aa. Equivalent to Rv1400c, len: 320 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 320 aa overlap). Possible lipI, lipase (EC 3.1.-.-), most similar to G695278 lipase like enzyme (364 aa), FASTA sscores: opt: 611, E(): 3.5e-30, (36.6% identity in 352 aa overlap); similar to Mycobacterium tuberculosis hypothetical lipases e.g. Rv1399c|MTCY21B4.16c (58.1% identical in 315 aa overlap); Rv1426c, Rv2284, etc. Protein product from Mb1435c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1435c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY96" /db_xref="InterPro:IPR002168" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR033140" /db_xref="UniProtKB/TrEMBL:A0A1R3XY96" /protein_id="SIU00038.1" /translation="MPSLDNTADEKPAIDPILLKVLDAVPFRLSIDDGIEAVRQRLRD LPRQPVHPELRVVDLAIDGPAGPIGTRIYWPPTCPDQAEAPVVLYFHGGGFVMGDLDT HDGTCRQHAVGADAIVVSVDYRLAPEHPYPAAIEDAWAATRWVAEHGRQVGADLGRIA VAGDSAGGTIAAVIAQRARDMGGPPIVFQLLWYPSTLWDQSLPSLAENADAPILDVKA IAAFSRWYAGEIDLHNPPAPMAPGRAENLADLPPAYIAVAGYDPLRDDGIRYGELLAA AGVPVEVHNAQTLVHGYVGYAGVVPAATEATNRGLVALRVVLHG" CDS 1577646..1578248 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1436" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb1436, -, len: 200 aa. Equivalent to Rv1401, len: 200 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 200 aa overlap). Possible membrane protein. Mb1436 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64838" /db_xref="InterPro:IPR012506" /db_xref="UniProtKB/Swiss-Prot:P64838" /protein_id="SIU00039.1" /translation="MLQPAFKASMAVLLAAAAVAHPIGRERRWLVPALLLSATGDWLL AIPWWTWAFVFGLGAFLLAHLCFIGALLPLARQAAPSRGRVAAVVAMCVASAGLLVWF WPHLGKDNLTIPVTVYIVALSAMVCTALLARLPTIWTAVGAVCFAASDSMIGIGRFIL GNEALAVPIWWSYAAAEILITAGFFFGREVPDNAAAPTDS" CDS 1578329..1580296 /codon_start=1 /transl_table=11 /gene="priA" /locus_tag="BQ2027_MB1437" /product="PUTATIVE PRIMOSOMAL PROTEIN N' PRIA (Replication factor Y)" /note="Mb1437, priA, len: 655 aa. Equivalent to Rv1402, len: 655 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 655 aa overlap). Putative priA, primosomal protein N'. Similar to e.g. PRIA_ECOLI|P17888 primosomal protein N' (replication factor Y) (732 aa), FASTA scores, opt: 386, E(): 1.3e-16, (27.6% identity in 711 aa overlap). Compared to other bacterial priA, it has a very divergent helicase domain. BELONGS TO THE HELICASE FAMILY. PRIA SUBFAMILY. Protein product from Mb1437 detected using SWATH mass spectrometry. Mb1437 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5A6" /db_xref="InterPro:IPR005259" /db_xref="InterPro:IPR041222" /db_xref="InterPro:IPR042115" /db_xref="UniProtKB/Swiss-Prot:P0A5A6" /protein_id="SIU00040.1" /translation="MLSVPHLDRDFDYLVPAEHSDDAQPGVRVRVRFHGRLVDGFVLE RRSDSDHHGKLGWLDRVVSPEPVLTTEIRRLVDAVAARYAGTRQDVLRLAVPARHARV EREITTAPGRPVVAPVDPSGWAAYGRGRQFLAALADSRAARAVWQALPGELWADRFAE AAAQTVRAGRTVLAIVPDQRDLDTLWQAATALVDEHSVVALSAGLGPEARYRRWLAAL RGSARLVIGTRSAVFAPLSELGLVMVWADADDSLAEPRAPYPHAREVAMLRAHQARCA ALIGGYARTAEAHALVRSGWAHDVVAPRPEVRARSPRVVALDDSGYDDARDPAARTAR LPSIALRAARSALQSGAPVLVQVPRRGYIPSLACGRCRAIARCRSCTGPLSLQGAGSP GAVCRWCGRVDPTLRCVRCGSDVVRAVVVGARRTAEELGRAFPGTAVITSAGDTLVPQ LDAGPALVVATPGAEPRAPGGYGAALLLDSWALLGRQDLRAAEDALWRWMTAAALVRP RGAGGVVTVVAESSIPTVQSLIRWDPVGHAEAELAARTEVGLPPSVHIAALDGPAGTV TALLEAARLPDPDRLQADLLGPVDLPPGVRRPAGIPADAPVIRMLLRVCREQGLELAA SLRRGIGVLSARQTRQTRSLVRVQIDPLHIG" CDS complement(1580314..1581138) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1438C" /product="PUTATIVE METHYLTRANSFERASE" /note="Mb1438c, -, len: 274 aa. Equivalent to Rv1403c, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 274 aa overlap). Putative methyltransferase (EC 2.1.1.-), similar to PMTA_RHOSH|Q05197 phosphatidylethanolamine m-methyltransferase (203 aa), FASTA scores: opt: 217, E(): 1.1e-07, (37.1% identity in 105 aa overlap); similar to Rv1405c|MTCY21B4.22c (59.3% identity in 273 aa overlap) and to Rv1523, Rv2952, etc." /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR041698" /db_xref="UniProtKB/Swiss-Prot:P64840" /protein_id="SIU00041.1" /translation="MTVYTPTSERQAPATTHRQMWALGDYAAIAEELLAPLGPILVST SGIRRGDRVLDVAAGSGNVSIPAAMAGAHVTASDLTPELLRRAQARAAAAGLELGWRE ANAEALPFSAGEFDAVLSTIGVMFAPRHQRTADELARVCRRGGKISTLNWTPEGFYGK LLSTIRPYRPTLPAGAPHEVWWGSEDYVSGLFRDHVSDIRTRRGSLTVDRFGCPDECR DYFKNFYGPAINAYRSIADSPECVATLDAEITELCREYLCDGVMQWEYLIFTARKC" CDS 1581307..1581789 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1439" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1439, -, len: 160 aa. Equivalent to Rv1404, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 160 aa overlap). Probable transcriptional regulatory protein, some similarity to MARR_ECOLI|P27245 multiple antibiotic resistance protein from Escherichia coli (125 aa), FASTA scores: opt: 136, E(): 0.004, (35.1% identity in 74 aa overlap). Protein product from Mb1439 detected using shotgun mass spectrometry. Mb1439 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XY87" /db_xref="InterPro:IPR000835" /db_xref="InterPro:IPR023187" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XY87" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00042.1" /translation="MMPTEYPATAEESVDVITDALLTASRLLVAISAHSIAQVDENIT IPQFRTLVILSNHGPINLATLATLLGVQPSATGRMVDRLVGAELIDRLPHPTSRRELL AALTKRGRDVVRQVTEHRRTEIARIVEQMAPAERHGLVRALTAFTEAGGEPDARYEIE " CDS complement(1581861..1582685) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1440C" /product="PUTATIVE METHYLTRANSFERASE" /note="Mb1440c, -, len: 274 aa. Equivalent to Rv1405c, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 274 aa overlap). Putative methyltransferase (EC 2.1.1.-), most similar to PMTA_RHOSH|Q05197 phosphatidylethanolamine m-methyltransferase (203 aa), FASTA scores: opt: 219, E(): 2.6e-07, (29.9% identity in 144 aa overlap); similar to Rv1403c|MTCY21B4.20c (59.3% identity in 273 aa overlap), Rv1523, Rv2952, etc. Protein product from Mb1440c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:P64842" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P64842" /protein_id="SIU00043.1" /translation="MTIDTPAREDQTLAATHRAMWALGDYALMAEEVMAPLGPILVAA AGIGPGVRVLDVAAGSGNISLPAAKTGATVISTDLTPELLQRSQARAAQQGLTLQYQE ANAQALPFADDEFDTVISAIGVMFAPDHQAAADELVRVCRPGGTIGVISWTCEGFFGR MLATIRPYRPSVSADLPPSALWGREAYVTGLLGDGVTGLKTARGLLEVKRFDTAQAVH DYFKNNYGPTIEAYAHIGDNAVLAAELDRQLVELAAQYLSDGVMEWEYLLLTAEKR" CDS 1582882..1583820 /codon_start=1 /transl_table=11 /gene="fmt" /locus_tag="BQ2027_MB1441" /product="PROBABLE METHIONYL-TRNA FORMYLTRANSFERASE FMT" /note="Mb1441, fmt, len: 312 aa. Equivalent to Rv1406, len: 312 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 312 aa overlap). Probable fmt, methionyl-tRNA formyltransferase (EC 2.1.2.9), similar to many e.g. FMT_ECOLI|P23882 Escherichia coli (314 aa), FASTA scores: opt: 616, E(): 6.7e-31, (39.3% identity in 303 aa overlap). BELONGS TO THE FMT FAMILY. Protein product from Mb1441 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1441 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64135" /db_xref="InterPro:IPR002376" /db_xref="InterPro:IPR005793" /db_xref="InterPro:IPR005794" /db_xref="InterPro:IPR011034" /db_xref="InterPro:IPR036477" /db_xref="InterPro:IPR037022" /db_xref="InterPro:IPR041711" /db_xref="UniProtKB/Swiss-Prot:P64135" /protein_id="SIU00044.1" /translation="MRLVFAGTPEPALASLRRLIESPSHDVIAVLTRPDAASGRRGKP QPSPVAREAAERGIPVLRPSRPNSAEFVAELSDLAPECCAVVAYGALLGGPLLAVPPH GWVNLHFSLLPAWRGAAPVQAAIAAGDTITGATTFQIEPSLDSGPIYGVVTEVIQPTD TAGDLLKRLAVSGAALLSTTLDGIADQRLTPRPQPADGVSVAPKITVANARVRWDLPA AVVERRIRAVTPNPGAWTLIGDLRVKLGPVHLDAAHRPSKPLPPGGIHVERTSVWIGT GSEPVRLGQIQPPGKKLMNAADWARGARLDLAARAT" CDS 1583817..1585190 /codon_start=1 /transl_table=11 /gene="fmu" /locus_tag="BQ2027_MB1442" /product="PROBABLE FMU PROTEIN (SUN PROTEIN)" /note="Mb1442, fmu, len: 457 aa. Equivalent to Rv1407, len: 457 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 457 aa overlap). Probable fmu protein, similar to SUN_ECOLI|P36929 sun protein (fmu protein) from Escherichia coli (429 aa), FASTA scores: E(): 2.5e-20, (30.6% identity in 451 aa overlap). Protein product from Mb1442 detected using SWATH mass spectrometry. Mb1442 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XY95" /db_xref="InterPro:IPR001678" /db_xref="InterPro:IPR006027" /db_xref="InterPro:IPR018314" /db_xref="InterPro:IPR023267" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR035926" /db_xref="UniProtKB/TrEMBL:A0A1R3XY95" /protein_id="SIU00045.1" /translation="MTPRSRGPRRRPLDPARRAAFETLRAVSARDAYANLVLPALLAQ RGIGGRDAAFATELTYGTCRARGLLDAVIGAAAERSPQAIDPVLLDLLRLGTYQLLRT RVDAHAAVSTTVEQAGIEFDSARAGFVNGVLRTIAGRDERSWVGELAPDAQNDPIGHA AFVHAHPRWIAQAFADALGAAVGELEAVLASDDERPAVHLAARPGVLTAGELARAVRG TVGRYSPFAVYLPRGDPGRLAPVRDGQALVQDEGSQLVARALTLAPVDGDTGRWLDLC AGPGGKTALLAGLGLQCAARVTAVEPSPHRADLVAQNTRGLPVELLRVDGRHTDLDPG FDRVLVDAPCTGLGALRRRPEARWRRQPADVAALAKLQRELLSAAIALTRPGGVVLYA TCSPHLAETVGAVADALRRHPVHALDTRPLFEPVLAGLGEGPHVQLWPHRHGTDAMFA AALRRLT" CDS 1585215..1585913 /codon_start=1 /transl_table=11 /gene="rpe" /locus_tag="BQ2027_MB1443" /product="PROBABLE RIBULOSE-PHOSPHATE 3-EPIMERASE RPE (PPE) (R5P3E) (Pentose-5-phosphate 3-epimerase)" /note="Mb1443, rpe, len: 232 aa. Equivalent to Rv1408, len: 232 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 232 aa overlap). Probable rpe, ribulose-phosphate 3-epimerase (EC 5.1.3.1), similar to many e.g. CXEC_ALCEU|P40117 (241 aa), FASTA scores: opt: 638, E(): 1.5e-34, (48.3% identity in 234 aa overlap); and RPE_ECOLI|P32661 ribulose-phosphate 3-epimerase (225 aa), FASTA scores: E(): 0, (46.2% identity in 221 aa overlap). Contains PS01085 Ribulose-phosphate 3-epimerase family signature 1. BELONGS TO THE RIBULOSE-PHOSPHATE 3-EPIMERASE FAMILY. Protein product from Mb1443 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1443 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65761" /db_xref="InterPro:IPR000056" /db_xref="InterPro:IPR011060" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR026019" /db_xref="UniProtKB/Swiss-Prot:P65761" /protein_id="SIU00046.1" /translation="MSLMAGSTGGPLIAPSILAADFARLADEAAAVNGADWLHVDVMD GHFVPNLTIGLPVVESLLAVTDIPMDCHLMIDNPDRWAPPYAEAGAYNVTFHAEATDN PVGVARDIRAAGAKAGISVKPGTPLEPYLDILPHFDTLLVMSVEPGFGGQRFIPEVLS KVRAVRKMVDAGELTILVEIDGGINDDTIEQAAEAGVDCFVAGSAVYGADDPAAAVAA LRRQAGAASLHLSL" CDS 1585910..1586929 /codon_start=1 /transl_table=11 /gene="ribG" /locus_tag="BQ2027_MB1444" /product="PROBABLE BIFUNCTIONAL riboflavin biosynthesis protein RIBG : Diaminohydroxyphosphoribosylaminopyrimidine deaminase (Riboflavin-specific deaminase) + 5-amino-6-(5-phosphoribosylamino) uracil reductase (HTP reductase)" /note="Mb1444, ribG, len: 339 aa. Equivalent to Rv1409, len: 339 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 339 aa overlap). Probable ribG (alternate gene name: ribD), bifunctional riboflavin biosynthesis protein, including diaminohydroxyphosphoribosylaminopyrimidine deaminase and 5-amino-6-(5-phosphoribosylamino) uracil reductase (EC 3.5.4.26 and 1.1.1.193), similar to many e.g. RIBD_ECOLI|P25539 riboflavin-specific deaminase from Escherichia coli (367 aa), FASTA scores: E(): 0, (39.8% identity in 364 aa overlap); etc. Contains PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature. IN THE N-TERMINAL SECTION; BELONGS TO THE CYTIDINE AND DEOXYCYTIDYLATE DEAMINASES FAMILY. IN THE C-TERMINAL SECTION; BELONGS TO THE HTP REDUCTASE FAMILY. Protein product from Mb1444 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1444 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYA3" /db_xref="InterPro:IPR002125" /db_xref="InterPro:IPR002734" /db_xref="InterPro:IPR004794" /db_xref="InterPro:IPR016192" /db_xref="InterPro:IPR016193" /db_xref="InterPro:IPR024072" /db_xref="UniProtKB/TrEMBL:A0A1R3XYA3" /protein_id="SIU00047.1" /translation="MNVEQVKSIDEAMGLAIEHSYQVKGTTYPNPPVGAVIVDPNGRI VGAGGTEPAGGDHAEVVALRRAGGLATGAIVVVTMEPCNHYGKTPPCVNALIEARVGT VVYAVADPNGIAGGGAGRLSAAGLQVRSGVLAEQVAAGPLREWLHKQRTGLPHVTWKY ATSIDGRSAAADGSSQWISSEAARLDLHRRRAIADAILVGTGTVLADDPALTARLADG SLAPQQPLRVVVGKRDIPPEARVLNDEARTMMIRTHEPMEVLRALSDRTDVLLEGGPT LAGAFLRAGAINRILAYVAPILLGGPVTAVDDVGVSNITNALRWQFDSVEKVGPDLLL SLVAR" CDS complement(1586926..1588482) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1445C" /standard_name="P55" /product="AMINOGLYCOSIDES/TETRACYCLINE-TRANSPORT INTEGRAL MEMBRANE PROTEIN" /note="Mb1445c, -, len: 518 aa. Equivalent to Rv1410c, len: 518 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 518 aa overlap). Aminoglycoside/tetracycline-transport integral membrane protein (see citation below), member of major facilitator superfamily (MFS), similar to others e.g. AC22_STRCO|P46105 probable actinorhodin transporter from Streptomyces coelicolor (578 aa), FASTA scores: opt: 442, E(): 4.9e-21, (28.5% identity in 466 aa overlap); etc. Contains PS00216 Sugar transport proteins signature 1. Could be termed P55. Note that the Rv1410c-Rv1411c operon seems transcribed from two promoters in M. bovis BCG (see second citation). Protein product from Mb1445c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1445c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U042" /db_xref="InterPro:IPR001411" /db_xref="InterPro:IPR005829" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/Swiss-Prot:Q7U042" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00048.1" /translation="MRAGRRVAISAGSLAVLLGALDTYVVVTIMRDIMNSVGIPINQL HRITWIVTMYLLGYIAAMPLLGRASDRFGRKLMLQVSLAGFIIGSVVTALAGHFGDFH MLIAGRTIQGVASGALLPITLALGADLWSQRNRAGVLGGIGAAQELGSVLGPLYGIFI VWLLHDWRDVFWINVPLTAIAMVMIHFSLPSHDRSTEPERVDLVGGLLLALALGLAVI GLYNPNPDGKHVLPDYGAPLLVGALVAAVAFFGWERFARTRLIDPAGVHFRPFLSALG ASVAAGAALMVTLVDVELFGQGVLQMDQAQAAGMLLWFLIALPIGAVTGGWIATRAGD RAVAFAGLLIAAYGYWLISHWPVDLLADRHNILGLFTVPAMHTDLVVAGLGLGLVIGP LSSATLRVVPSAQHGIASAAVVVARMTGMLIGVAALSAWGLYRFNQILAGLSAAIPPN ASLLERAAAIGARYQQAFALMYGEIFTITAIVCVFGAVLGLLISGRKEHADEPEVQEQ PTLAPQVEPL" CDS complement(1588488..1589198) /codon_start=1 /transl_table=11 /gene="lprG" /locus_tag="BQ2027_MB1446C" /standard_name="P27" /product="conserved lipoprotein lprg" /note="Mb1446c, lprG, len: 236 aa. Equivalent to Rv1411c, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 236 aa overlap). Probable lprG (alternate gene name: P27), conserved lipoprotein, similar to Mycobacterium tuberculosis hypothetical lipoproteins e.g. Rv1270c|MTCY50.12 (35.1% identity in 245 aa overlap); Rv1368, Rv2945c. Contains N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013). Note that the Rv1410c-Rv1411c operon seems transcribed from two promoters in M. bovis BCG (see second citation). Protein product from Mb1446c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1446c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5I9" /db_xref="InterPro:IPR009830" /db_xref="InterPro:IPR029046" /db_xref="UniProtKB/Swiss-Prot:P0A5I9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00049.1" /translation="MRTPRRHCRRIAVLAAVSIAATVVAGCSSGSKPSGGPLPDAKPL VEEATAQTKALKSAHMVLTVNGKIPGLSLKTLSGDLTTNPTAATGNVKLTLGGSDIDA DFVVFDGILYATLTPNQWSDFGPAADIYDPAQVLNPDTGLANVLANFADAKAEGRDTI NGQNTIRISGKVSAQAVNQIAPPFNATQPVPATVWIQETGDHQLAQAQLDRGSGNSVQ MTLSKWGEKVQVTKPPVS" CDS 1589283..1589888 /codon_start=1 /transl_table=11 /gene="ribC" /locus_tag="BQ2027_MB1447" /product="PROBABLE RIBOFLAVIN SYNTHASE ALPHA CHAIN RIBC (RIBE)" /note="Mb1447, ribC, len: 201 aa. Equivalent to Rv1412, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 201 aa overlap). Probable ribC (ribE), Riboflavin synthase alpha chain (EC 2.5.1.9), strong similarity to others e.g. RISA_ACTPL|P50854 (215 aa), FASTA scores: opt: 586, E(): 1.8e-33, (50.8% identity in 197 aa overlap). Contains 2 x PS00693 Riboflavin synthase alpha chain family signature. Protein product from Mb1447 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1447 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65328" /db_xref="InterPro:IPR001783" /db_xref="InterPro:IPR017938" /db_xref="InterPro:IPR023366" /db_xref="InterPro:IPR026017" /db_xref="UniProtKB/Swiss-Prot:P65328" /protein_id="SIU00050.1" /translation="MFTGIVEERGEVTGREALVDAARLTIRGPMVTADAGHGDSIAVN GVCLTVVDVLPDGQFTADVMAETLNRSNLGELRPGSRVNLERAAALGSRLGGHIVQGH VDATGEIVARCPSEHWEVVRIEMPASVARYVVEKGSITVDGISLTVSGLGAEQRDWFE VSLIPTTRELTTLGSAAVGTRVNLEVDVVAKYVERLMRSAG" CDS 1590102..1590617 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1448" /product="METAL-ACTIVATED PYRIDOXAL ENZYME" /note="Mb1448, -, len: 171 aa. Equivalent to Rv1413, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 171 aa overlap). Conserved hypothetical protein, similar to part of AB010956|AB010956_1 metal-activated pyridoxal enzyme from Arthrobacter sp. (379 aa), FASTA scores: opt: 187, E(): 0.00026, (29.0% identity in 162 aa overlap). Mb1448 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001608" /db_xref="InterPro:IPR029066" /db_xref="UniProtKB/Swiss-Prot:P64844" /protein_id="SIU00051.1" /translation="MATIGEVEVFVDHGADDVFITYPLWIGTRQADRLRQLADRARIA VGAGTAEGASNTGARLADAAGAIDVLIEIDSGHHRSGVRAEQVLEVAHAVGEAGLHLV GVFTFPGHSYAPGKPGEAGEQERRALNDAANALVAVGFPISCRSGGSTPTALLTAADG ASETSRRLCAR" CDS 1590607..1591008 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1449" /product="Predicted amino acid aldolase or racemase" /note="Mb1449, -, len: 133 aa. Equivalent to Rv1414, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 133 aa overlap). Conserved hypothetical protein, similar to C-terminal part of AB010956|AB010956_1 novel metal-activated pyridoxal enzyme from Arthrobacter sp. (379 aa), FASTA scores: opt: 163, E(): 0.00063, (32.1% identity in 112 aa overlap). Rv1413 is similar to N-terminal part of same enzyme suggesting possible frameshift. Sequence has been checked and no errors found, it is identical in Mycobacterium tuberculosis CDC1551. Mb1449 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR026956" /db_xref="InterPro:IPR042208" /db_xref="UniProtKB/Swiss-Prot:P64846" /protein_id="SIU00052.1" /translation="MLGDAQQLELGRCAPADIALTVAATVVSRQDCRSGLRRIVLDCG SKILGSDRPAWATGFGRLIDHADARIAALSEHHATVVWPDDAPLPPVGTRLRVIPNHV CLTTNLVDDVAVVRDATLIDRWKVAARGKNH" CDS 1591113..1592390 /codon_start=1 /transl_table=11 /gene="ribA2" /locus_tag="BQ2027_MB1450" /product="PROBABLE RIBOFLAVIN BIOSYNTHESIS PROTEIN RIBA2 : GTP cyclohydrolase II + 3,4-dihydroxy-2-butanone 4-phosphate synthase (DHBP synthase)" /note="Mb1450, ribA2, len: 425 aa. Equivalent to Rv1415, len: 425 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 425 aa overlap). Probable ribA2, Riboflavin biosynthesis protein (EC 3.5.4.25), similar to many e.g. GCH2_BACSU|P17620 from Bacillus subtilis (398 aa), FASTA scores: opt: 1388, E(): 0, (55.4% identity in 399 aa overlap). Also similar to second Mycobacterium tuberculosis gtp cyclohydrolase Rv1940|ribA1 (353 aa). IN THE N-TERMINAL SECTION; BELONGS TO THE DHBP SYNTHASE FAMILY. IN THE C-TERMINAL SECTION; BELONGS TO THE GTP CYCLOHYDROLASE II FAMILY. Protein product from Mb1450 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1450 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5V1" /db_xref="InterPro:IPR000422" /db_xref="InterPro:IPR000926" /db_xref="InterPro:IPR016299" /db_xref="InterPro:IPR017945" /db_xref="InterPro:IPR032677" /db_xref="InterPro:IPR036144" /db_xref="UniProtKB/Swiss-Prot:P0A5V1" /protein_id="SIU00053.1" /translation="MTRLDSVERAVADIAAGKAVIVIDDEDRENEGDLIFAAEKATPE MVAFMVRYTSGYLCVPLDGAICDRLGLLPMYAVNQDKHGTAYTVTVDARNGIGTGISA SDRATTMRLLADPTSVADDFTRPGHVVPLRAKDGGVLRRPGHTEAAVDLARMAGLQPA GAICEIVSQKDEGSMAHTDELRVFADEHGLALITIADLIEWRRKHEKHIERVAEARIP TRHGEFRAIGYTSIYEDVEHVALVRGEIAGPNADGDDVLVRVHSECLTGDVFGSRRCD CGPQLDAALAMVAREGRGVVLYMRGHEGRGIGLMHKLQAYQLQDAGADTVDANLKLGL PADARDYGIGAQILVDLGVRSMRLLTNNPAKRVGLDGYGLHIIERVPLPVRANAENIR YLMTKRDKLGHDLAGLDDFHESVHLPGEFGGAL" CDS 1592405..1592869 /codon_start=1 /transl_table=11 /gene="ribH" /locus_tag="BQ2027_MB1451" /product="PROBABLE RIBOFLAVIN SYNTHASE BETA CHAIN RIBH (6,7-dimethyl-8-ribityllumazine synthase) (DMRL synthase) (Lumazine synthase)" /note="Mb1451, ribH, len: 154 aa. Equivalent to Rv1416, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 154 aa overlap). Probable ribH, riboflavin synthase beta chain (EC 2.5.1.9), similar to many e.g. RISB_ECOLI|P25540 Escherichia coli (156 aa), FASTA scores: opt: 330, E(): 1.8e-15, (44.1% identity in 145 aa overlap). Note alternative GTG start possible overlapping the stop codon of Rv1415|MTCY21B4.33. BELONGS TO THE DMRL SYNTHASE FAMILY. Protein product from Mb1451 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1451 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66035" /db_xref="InterPro:IPR002180" /db_xref="InterPro:IPR034964" /db_xref="InterPro:IPR036467" /db_xref="UniProtKB/Swiss-Prot:P66035" /protein_id="SIU00054.1" /translation="MPDLPSLDASGVRLAIVASSWHGKICDALLDGARKVAAGCGLDD PTVVRVLGAIEIPVVAQELARNHDAVVALGVVIRGQTPHFDYVCDAVTQGLTRVSLDS STPIANGVLTTNTEEQALDRAGLPTSAEDKGAQATVAALATALTLRELRAHS" CDS 1592866..1593330 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1452" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb1452, -, len: 154 aa. Equivalent to Rv1417, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 154 aa overlap). Possible conserved membrane protein, similar to others e.g. AL133213|SC6D7_2 Streptomyces coelicolor (156 aa), FASTA scores: opt: 212, E(): 4.4e-07, (32.4% identity in 136 aa overlap). Protein product from Mb1452 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1452 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64848" /db_xref="InterPro:IPR019692" /db_xref="UniProtKB/Swiss-Prot:P64848" /protein_id="SIU00055.1" /translation="MTAAPNDWDVVLRPHWTPLFAYAAAFLIAVAHVAGGLLLKVGSS GVVFQTADQVAMGALGLVLAGAVLLFARPRLRVGSAGLSVRNLLGDRIVGWSEVIGVS FPGGSRWARIDLADDEYIPVMAIQAVDKDRAVAAMDTVRSLLARYRPDLCAR" CDS 1593355..1594041 /codon_start=1 /transl_table=11 /gene="lprH" /locus_tag="BQ2027_MB1453" /product="PROBABLE LIPOPROTEIN LPRH" /note="Mb1453, lprH, len: 228 aa. Equivalent to Rv1418, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 228 aa overlap). Probable lprH, lipoprotein. Contains N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013). Mb1453 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65317" /db_xref="UniProtKB/Swiss-Prot:P65317" /protein_id="SIU00056.1" /translation="MACLGRPGCRGWAGASLVLVVVLALAACTESVAGRAMRATDRSS GLPTSAKPARARDLLLQDGDRAPFGQVTQSRVGDSYFTSAVPPECSAALLFKGSPLRP DGSSDHAEAAYNVTGPLPYAESVDVYTNVLNVHDVVWNGFRDVSHCRGDAVGVSRAGR STPMRLRYFATLSDGVLVWTMSNPRWTCDYGLAVVPHAVLVLSACGFKPGFPMAEWAS KRRAQLDSQV" CDS 1594221..1594694 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1454" /product="unknown protein" /note="Mb1454, -, len: 157 aa. Equivalent to Rv1419, len: 157 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 157 aa overlap). Hypothetical unknown protein. Protein product from Mb1454 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1454 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64850" /db_xref="InterPro:IPR000772" /db_xref="InterPro:IPR035992" /db_xref="UniProtKB/Swiss-Prot:P64850" /protein_id="SIU00057.1" /translation="MGELRLVGGVLRVLVVVGAVFDVAVLNAGAASADGPVQLKSRLG DVCLDAPSGSWFSPLVINPCNGTDFQRWNLTDDRQVESVAFPGECVNIGNALWARLQP CVNWISQHWTVQPDGLVKSDLDACLTVLGGPDPGTWVSTRWCDPNAPDQQWDSVP" CDS 1594758..1596698 /codon_start=1 /transl_table=11 /gene="uvrC" /locus_tag="BQ2027_MB1455" /product="probable excinuclease abc (subunit c-nuclease) uvrc" /note="Mb1455, uvrC, len: 646 aa. Equivalent to Rv1420, len: 646 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 646 aa overlap). Probable uvrC, excinuclease ABC subunit C, similar to many e.g. UVRC_PSEFL|P32966 Pseudomonas fluorescens (607 aa), fasta scores: opt: 738, E(): 8.4e-39, (36.6% identity in 629 aa overlap). BELONGS TO THE UVRC FAMILY. Protein product from Mb1455 detected using SWATH mass spectrometry. Mb1455 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67427" /db_xref="InterPro:IPR000305" /db_xref="InterPro:IPR001162" /db_xref="InterPro:IPR001943" /db_xref="InterPro:IPR003583" /db_xref="InterPro:IPR004791" /db_xref="InterPro:IPR010994" /db_xref="InterPro:IPR035901" /db_xref="InterPro:IPR036876" /db_xref="InterPro:IPR038476" /db_xref="InterPro:IPR041663" /db_xref="UniProtKB/Swiss-Prot:P67427" /protein_id="SIU00058.1" /translation="MPDPATYRPAPGSIPVEPGVYRFRDQHGRVIYVGKAKSLRSRLT SYFADVASLAPRTRQLVTTAAKVEWTVVGTEVEALQLEYTWIKEFDPRFNVRYRDDKS YPVLAVTLGEEFPRLMVYRGPRRKGVRYFGPYSHAWAIRETLDLLTRVFPARTCSAGV FKRHRQIDRPCLLGYIDKCSAPCIGRVDAAQHRQIVADFCDFLSGKTDRFARALEQQM NAAAEQLDFERAARLRDDLSALKRAMEKQAVVLGDGTDADVVAFADDELEAAVQVFHV RGGRVRGQRGWIVEKPGEPGDSGIQLVEQFLTQFYGDQAALDDAADESANPVPREVLV PCLPSNAEELASWLSGLRGSRVVLRVPRRGDKRALAETVHRNAEDALQQHKLKRASDF NARSAALQSIQDSLGLADAPLRIECVDVSHVQGTDVVGSLVVFEDGLPRKSDYRHFGI REAAGQGRSDDVACIAEVTRRRFLRHLRDQSDPDLLSPERKSRRFAYPPNLYVVDGGA PQVNAASAVIDELGVTDVAVIGLAKRLEEVWVPSEPDPIIMPRNSEGLYLLQRVRDEA HRFAITYHRSKRSTRMTASALDSVPGLGEHRRKALVTHFGSIARLKEATVDEITAVPG IGVATATAVHDALRPDSSGAAR" CDS 1596695..1597600 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1456" /product="RNase adapter protein RapZ" /note="Mb1456, -, len: 301 aa. Equivalent to Rv1421, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 301 aa overlap). Conserved hypothetical protein, similar to many hypothetical proteins e.g. YHBJ_ECOLI|P33995 hypothetical 32.5 kd protein from Escherichia coli (284 aa), FASTA scores: opt: 648, E(): 6.3e-36, (38.7% identity in 282aa overlap). Protein product from Mb1456 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1456 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67107" /db_xref="InterPro:IPR005337" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P67107" /protein_id="SIU00059.1" /translation="MMNHARGVENRSEGGGIDVVLVTGLSGAGRGTAAKVLEDLGWYV ADNLPPQLITRMVDFGLAAGSRITQLAVVMDVRSRGFTGDLDSVRNELATRAITPRVV FMEASDDTLVRRYEQNRRSHPLQGEQTLAEGIAAERRMLAPVRATADLIIDTSTLSVG GLRDSIERAFGGDGGATTSVTVESFGFKYGLPMDADMVMDVRFLPNPHWVDELRPLTG QHPAVRDYVLHRPGAAEFLESYHRLLSLVVDGYRREGKRYMTIAIGCTGGKHRSVAIA EALMGLLRSDQQLSVRALHRDLGRE" CDS 1597597..1598625 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1457" /product="FIG002813: LPPG:FO 2-phospho-L-lactate transferase like, CofD-like" /note="Mb1457, -, len: 342 aa. Equivalent to Rv1422, len: 342 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 342 aa overlap). Conserved hypothetical protein, similar to many hypothetical proteins e.g. YAMB_THETU|P38541 Thermoanaerobacterium thermosulfurigenes (323 aa), FASTA scores: opt: 519, E(): 1.6e-25, (33.1% identity in 320 aa overlap); and AF106003|AF106003_3 Streptomyces coelicolor (363 aa), FASTA scores: opt: 1047, E(): 0, (54.5% identity in 308 aa overlap). Protein product from Mb1457 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1457 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYB6" /db_xref="InterPro:IPR002882" /db_xref="InterPro:IPR010119" /db_xref="InterPro:IPR038136" /db_xref="UniProtKB/TrEMBL:A0A1R3XYB6" /protein_id="SIU00060.1" /translation="MTDGIVALGGGHGLYATLSAARRLTPYVTAVVTVADDGGSSGRL RSELDVVPPGDLRMALAALASDSPHGRLWATILQHRFGGSGVLAGHPIGNLMLAGLSE VLADPVAALDELGRILGVKGRVLPMCPVALQIEADVSGLEADPRMFRLIRGQVAIATT PGKVRRVRLLPTDPPATRQAVDAIMAADLVVLGPGSWFTSVIPHVLVPGLAAALRATS ARRALVLNLVAEPGETAGFSVERHLHVLAQHAPGFTVHDIIIDAERVPSEREREQLRR TATMLQAEVHFADVARPGTPLHDPGKLAAVLDGVCARDVGASEPPVAATQEIPIDGGR PRGDDAWR" CDS 1598622..1599599 /codon_start=1 /transl_table=11 /gene="whiA" /locus_tag="BQ2027_MB1458" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN WHIA" /note="Mb1458, whiA, len: 325 aa. Equivalent to Rv1423, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 325 aa overlap). Putative whiA, transcriptional regulator, probably equivalent to AL035591|SCC54.10 whiA protein from Streptomyces coelicolor (328 aa), FASTA scores: opt: 1505, E(): 0, (70.4% identity in 324 aa overlap). Also some similarity to O06975|YVCL hypothetical protein from Bacillus subtilis (316 aa), FASTA scores: E(): 1.8e-0 8, (25.7% identity in 304 aa overlap). Protein product from Mb1458 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1458 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U040" /db_xref="InterPro:IPR003802" /db_xref="InterPro:IPR018478" /db_xref="InterPro:IPR023054" /db_xref="InterPro:IPR027434" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039518" /db_xref="UniProtKB/Swiss-Prot:Q7U040" /protein_id="SIU00061.1" /translation="MTTDVKDELSRLVVKSVSARRAEVTSLLRFAGGLHIVGGRVVVE AELDLGSIARRLRKEIFELYGYTAVVHVLSASGIRKSTRYVLRVANDGEALARQTGLL DMRGRPVRGLPAQVVGGSIDDAEAAWRGAFLAHGSLTEPGRSSALEVSCPGPEAALAL VGAARRLGVGAKAREVRGADRVVVRDGEAIGALLTRMGAQDTRLVWEERRLRREVRAT ANRLANFDDANLRRSARAAVAAAARVERALEILGDTVPEHLASAGKLRVEHRQASLEE LGRLADPPMTKDAVAGRIRRLLSMADRKAKVDGIPDTESVVTPDLLEDA" CDS complement(1599609..1600370) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1459C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb1459c, -, len: 253 aa. Equivalent to Rv1424c, len: 253 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 253 aa overlap). Possible membrane protein, contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature. Protein product from Mb1459c detected using SWATH mass spectrometry. Mb1459c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P64852" /protein_id="SIU00062.1" /translation="MTVVPGAPSRPASAVSRPSYRQCVQASAQTSARRYSFPSYRRPP AEKLVFPVLLGILTLLLSACQTASASGYNEPRGYDRATLKLVFSMDLGMCLNRFTYDS KLAPSRPQVVACDSREARIRNDGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYC YPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTRSANREFSRFVVTEIKSLGA GGRCDSASVSLQPPEEIEGPAIPPASSQLVCVAPK" CDS 1600374..1601753 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1460" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb1460, -, len: 459 aa. Equivalent to Rv1425, len: 459 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 459 aa overlap). Conserved hypothetical protein, similar to many Mycobacterium tuberculosis hypothetical proteins e.g. Rv3740c, Rv3734c, Rv1760, etc. Protein product from Mb1460 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1460 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0E4" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0E4" /protein_id="SIU00063.1" /translation="MKRLSSVDAAFWSAETAGWHMHVGALAICDPSDAPEYSFQRLRE LIIERLPEIPQLRWRVTGAPLGLDRPWFVEDEELDIDFHIRRIGVPAPGGRRELEELV GRLMSYKLDRSRPLWELWVIEGVEGGRIATLTKMHHAIVDGVSGAGLGEILLDITPEP RPPQQETVGFVGFQIPGLERRAIGALINVGIMTPFRIVRLLEQTVRQQIAALGVAGKP ARYFEAPKTRFNAPVSPHRRVTGTRVELARAKAVKDAFGVKLNDVVLALVAGAARQYL QKRDELPAKPLIAQIPVSTRSEETKADVGNQVSSMTASLATHIEDPAKRLAAIHESTL SAKEMAKALSAHQIMGLTETTPPGLLQLAARAYTASGLSHNLAPINLVVSNVPGPPFP LYMAGARLDSLVPLGPPVMDVALNITCFSYQDYLDFGLVTTPEVANDIDEMADAIEPA LAELERAAE" CDS complement(1601775..1603037) /codon_start=1 /transl_table=11 /gene="lipO" /locus_tag="BQ2027_MB1461C" /product="PROBABLE ESTERASE LIPO" /note="Mb1461c, lipO, len: 420 aa. Equivalent to Rv1426c, len: 420 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 420 aa overlap). Possible Lipo, esterase (EC 3.1.-.-), similar to several Mycobacterium tuberculosis hypothetical lipases and esterases e.g. Rv1399c, Rv2284, etc. Also similar in central region to AAAD_HUMAN|P22760 human arylacetamide deacetylase (398 aa), FASTA scores: opt:210, E(): 7.6e-07, (29.3% identity in 191 aa overlap). Protein product from Mb1461c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1461c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZA8" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XZA8" /protein_id="SIU00064.1" /translation="MRFRRMARPRPLTRAAVELLNAANGLRPLSGSGYSTVLAFWLGW PTSEVPGVYLGASVLDALRRGRRGDFGGLKGKAALALTAAAWVILAVIRYRGATTPGP VLEAGLTEQLGPDYAKELATLPTEPMRSRGRNLPLRTAMARRRYVETTNVVCYGPYGR ANLADIWRRRDLPRDAKAPVLVQVPGGAWVLGWRRPQAYPLMSHLAARGWVCVSLNYR VSPRHTWPDHIVDVKRALAWVKENIAAYGGDPNFVAISGGSAGGHLCALAALTPNDPR FQPGFEQVDTSVAAAVPVYGRYDWFTTDAPGRREFVGLLETFVVKRKFSTHRDIFVDA SPIHHVRADAPPFFVLHGRHDSLIPVAEAHAFVEELRAVSKSPVAYADLPHAQHAFDV FGSPRAHHTAEAVARFLSWVYATNPPAT" CDS complement(1603037..1604644) /codon_start=1 /transl_table=11 /gene="fadD12" /locus_tag="BQ2027_MB1462C" /product="POSSIBLE LONG-CHAIN-FATTY-ACID--COA LIGASE FADD12 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb1462c, fadD12, len: 535 aa. Equivalent to Rv1427c, len: 535 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 535 aa overlap). Possible fadD12, long-chain-fatty-acid-CoA synthetase (EC 6.2.1.-), similar to many e.g. NP_302632.1|NC_002677 acyl-CoA synthase from Mycobacterium leprae (548 aa); AAD01929.2|AF031419 putative long-chain-fatty-acid--CoA ligase from Pseudomonas putida (565 aa); NP_419782.1|NC_002696 putative long-chain-fatty-acid--CoA ligase from Caulobacter crescentus (530 aa); PC60_YEAST|P38137 yeast peroxisomal-coenzyme A synthetase (543 aa), FASTA scores: opt: 507, E(): 2.9e-25, (30.4% identity in 365 aa overlap). Also similar to many M. tuberculosis proteins e.g. MTCY06A4.14 (44.8% identity in 525 aa overlap). Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb1462c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1462c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYB5" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XYB5" /protein_id="SIU00065.1" /translation="MRIRQAFGLIATMRRAGLIAPLRPDRYLRIVAAMRREGMGFTAG FAGAARRCPDRPGLIDELGTLTWRQLDERGNALAAALQALPAGPPRVVGIMCRNHRGF VDALLAVNRIGAHILLLNTSFAGPALAEVVTREGVDTVVYDEEFSATVDRALAEKPQA TRIVAWTDEDHDLTVEKLVAAHAGRRPEHTGSHGKVILLTSGTTGTPKGARHSGGGIG TLKAILDRTPWRAEEVTVIVAPMFHAWGFSQLVLASSLACTIVTRRRFDPEATLDLID RHHATGLVVVPVMFDRIMDLPAEIRNRYDGRSLRFAAASGSRMRPDVVIAFMDQFGDV IYNNYNATEAGMIATATPADLRTAPDTAGRPAEGTEIRILDQQFTEVPTGEVGTIYVR NDSQFDGYTSGAAKDFHAGFMSSGDVGYLDENGRLFVVGRDDEMIVSGGENIYPIEVE KTLATHPDVAEAAVIGVDDQQYGQRLAAFVVLKPGVSATPETLKQHVRDNLANYKVPR DIAVLDELPRGITGKILRTELQSRVGS" CDS complement(1604648..1605475) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1463C" /product="Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like" /note="Mb1463c, -, len: 275 aa. Equivalent to Rv1428c, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 275 aa overlap). Conserved hypothetical protein, some similarity to hypothetical proteins from M. tuberculosis e.g. Rv0502|YV29_MYCTU|Q11167 (358 aa), FASTA scores: opt: 355, E(): 5e-16, (32.6% identity in 273 aa overlap); and Rv1920. Protein product from Mb1463c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1463c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYQ3" /db_xref="InterPro:IPR002123" /db_xref="InterPro:IPR016676" /db_xref="UniProtKB/TrEMBL:A0A1R3XYQ3" /protein_id="SIU00066.1" /translation="MSETDSPGNGDDAGIGDIGKFDPGLTQRLISVLRPVLKTYHRSQ VHGLDSFPPGGALVVANHSGGMFPMDVPVFSVDFYDKFGYDRPVYTLSHDILFMGLTG DLFRRTGYIRATRENAAKALRSGGVVVVFPGGDYDAYRPTFAENVIDFNGRKGYVSTA VEAGVPIVPAVSIGGQESQLYLSRGTWLARRLGLKRLLRSDILPISFGFPFGFSAAIP PNLPLPAKIVMQVLDPINLTKQFGEDPDVDAVDEHVRSVMQQALNDLAAKRRFPILG" CDS 1605594..1606862 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1464" /product="Possible regulatory protein Trx" /note="Mb1464, -, len: 422 aa. Equivalent to Rv1429, len: 422 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 422 aa overlap). Conserved hypothetical protein, some similarity to transcriptional regulator proteins e.g. CDAR_ECOLI|P37047 Carbohydrate diacid regulator from Escherichia coli (391 aa), FASTA scores: opt: 210, E(): 3e-06, (27.7% identity in 296 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv2370c, Rv1194c, Rv1453, Rv2242, and Rv1186c. Protein product from Mb1464 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1464 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025736" /db_xref="InterPro:IPR041522" /db_xref="InterPro:IPR042070" /db_xref="UniProtKB/TrEMBL:A0A1R3XYC5" /protein_id="SIU00067.1" /translation="MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARM ADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLG HARFLEVAMQYVSLLEPADRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRS GLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRC LLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRIRAAFESAGIRARLAC GRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDDVAPVALLADDLEELRRFV TDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQ NLDDPDAAFRVQMALEVCRWMAPAVLRAKQ" CDS 1607102..1608688 /codon_start=1 /transl_table=11 /gene="PE16" /locus_tag="BQ2027_MB1465" /product="pe family protein pe16" /note="Mb1465, PE16, len: 528 aa. Equivalent to Rv1430, len: 528 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 528 aa overlap). Member of the M. tuberculosis PE family of proteins e.g. Y0D4_MYCTU|Q50594 (55.9% identity in 127 aa overlap). The C-terminus shows similarity to Q49633|LEPB1170_F3_112 hypothetical Mycobacterium leprae protein (391 aa), FASTA scores: opt: 342, E(): 1.2e-13, (29.8% identity in 292 aa overlap). Possible TMhelix aa 500-522. Mb1465 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYC9" /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR013228" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XYC9" /protein_id="SIU00068.1" /translation="MSFVFAVPEMVAATASDLASLGAALSEATAAAAIPTTQVLAAAA DEVSAAIAELFGAHGQEFQALSAQASAFHDRFVRALSAAAGWYVDAEAANAALVDTAA TGASELGSGGRTALILGSTGTPRPPFDYMQQVYDRYIAPHYLGYAFSGLYTPAQFQPW TGIPSLTYDQSVAEGAGYLHTAIMQQVAAGNDVVVLGFSQGASVATLEMRHLASLPAG VAPSPDQLSFVLLGNPNNPNGGILARFPGLYLQSLGLTFNGATPDTDYATTIYTTQYD GFADFPKYPLNILADVNALLGIYYSHSLYYGLTPEQVASGIVLPVSSPDTNTTYILLP NEDLPLLQPLRGIVPEPLLDLIEPDLRAIIELGYDRTGYADVPTPAALFPVHIDPIAV PPQIGAAIGGPLTALDGLLDTVINDQLNPVVTSGIYQAGAELSVAAAGYGAPAGVTNA IFIGQQVLPILVEGPGALVTADTHYLVDAIQDLAAGDLSGFNQNLQLIPATNIALLVF AAGIPAVAAVAILTGQDFPV" CDS 1608799..1610568 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1466" /product="CONSERVED MEMBRANE PROTEIN" /note="Mb1466, -, len: 589 aa. Equivalent to Rv1431, len: 589 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 589 aa overlap). Conserved membrane protein, shows strong similarity to another M. tuberculosis hypothetical protein Rv1132|MTCY22G8.21 (48.2% identity in 585 aa overlap). Protein product from Mb1466 detected using SWATH mass spectrometry. Mb1466 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYB1" /db_xref="InterPro:IPR021941" /db_xref="UniProtKB/TrEMBL:A0A1R3XYB1" /protein_id="SIU00069.1" /translation="MGFLKPDLPDVDHDTWLTQPRRTRLQVVTRDWVEHGFGTPYAVY LLYLTKIAVYVAAGAAIISLTPGLGGLSRIGDWWTQPIVYQKVIVFTLLFEVLGFGCG SGPLTGRFWPPIGGFLYWLRPNTIRLPAWPDKVPFTQGDTRTVVDVALYAIVLIGGVW ALLSPGSPGPGGTPVTAAGDVGLINPVLVVPTIVALGVLGLRDKTIFLAARGEHYWLK LFVFFFPFTDQIAAFKIIMLCLWWGAATSKLNHHFPYVVAVMTSNNALLRSRVFNPIK HLLYRDHANDLRPSWLPKLMAHGGGTTAEFLVPGILVLVADGHPWRWFLIGFMVLFHL NILSNLPMGVPLEWNVFFIFSLCYLFGHYGAITATDLRSPLLLAIVIAVVAVVIMGNL LPEKISFLPAMRYYAGNWATSIWCFRGDAEATMETSVVKSSALVVNQLAKLYDGATAE IMTDQVAAFRAMHTHGRALNGLLPRALDDEAHYRIREGEIVAGPLVGWNFGEGHLHNE QLVAAVQRRCNFADGDLRVIILEGQPIHVQKQWYRIVDAKTGLFEAGYVTVEDMLSRQ PWPEPGDEFPVHVTTQRGTPSKP" CDS 1610565..1611986 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1467" /product="PROBABLE DEHYDROGENASE" /note="Mb1467, -, len: 473 aa. Equivalent to Rv1432, len: 473 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 473 aa overlap). Probable dehydrogenase (EC 1.-.-.-), shows strong simlarity to P49_STRLI|P06108 p49 protein from Streptomyces lividans (469 aa), FASTA scores: opt: 1362, E(): 0, (44.9% identity in 474 aa overlap); and weak simlarity to other dehydrogenases. Protein product from Mb1467 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1467 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XYC4" /protein_id="SIU00070.1" /translation="MTTAVVVGAGPNGLAAAIHLARHGVDVQVLEARDTIGGGARSGE LTVPGVIHDHCSAFHPLGVGSPFWAAIDLQRYGLTWKWPDVDCAHPLDDGTAGVLYRS IEATAAGLGPDGKRWQRAVGDLAAGFDELAEDLLRPVLNMPRHPIRLARFGPRAALPA TAMARRFHTERARALFGGAAAHVYTRLDRPLTASLGLMILASGHRHGWPVARGGSGSI TKALAAALDAYGGTVATGVTVTSRRDIPDADIVMLDLSPAAVLGIYGDVMPTRINRSY RRYRAGSSAFKVDFAIEGDVGWTNPDCRRAGTVHLGGTFAEIADTERQRAQGTMVQRP FVLVGQQYLADPSRSVGNINPIWAYAHVPFGYTGDATAAVIDQIERFAPGFRDRIVAT VSTSTTELQTYNRNFIGGDIIGGANDRLQVIFRPRVAVDPYAIGVPGVYLCSQSAPPG AGIHGLCGYHAAESALRWLRKRR" CDS 1612150..1612965 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1468" /product="POSSIBLE CONSERVED EXPORTED PROTEIN" /note="Mb1468, -, len: 271 aa. Equivalent to Rv1433, len: 271 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 271 aa overlap). Possible exported protein with N-terminal signal sequence, highly similar to Q49706 hypothetical protein from Mycobacterium leprae (271 aa), FASTA scores: opt: 1341, E(): 0, (68.3% identity in 271 aa overlap). Also shows similarity to Mycobacterium tuberculosis lipoprotein Rv2518c|MTV009.03c lppS (408 aa) (40.0% identity in 230 aa overlap); and others e.g. Rv0116c, Rv0192, Rv2518c, Rv0483. Mb1468 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYB7" /db_xref="InterPro:IPR005490" /db_xref="InterPro:IPR038063" /db_xref="InterPro:IPR041280" /db_xref="UniProtKB/TrEMBL:A0A1R3XYB7" /protein_id="SIU00071.1" /translation="MRAVFGCAIAVVGIAGSVVAGPADIHLVAAKQSYGFAVASVLPT RGQVVGVAHPVVVTFSAPITNPANRHAAERAVEVKSTPAMTGKFEWLDNDVVQWVPDR FWPAHSTVELSVGSLSSDFKTGPAVVGVASISQHTFTVSIDGVEEGPPPPLPAPHHRV HFGEDGVMPASMGRPEYPTPVGSYTVLSKERSVIMDSSSVGIPVDDPDGYRLSVDYAV RITSRGLYVHSAPWALPALGLENVSHGCISLSREDAEWYYNAVDIGDPVIVQE" CDS 1612972..1613109 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1469" /product="HYPOTHETICAL PROTEIN" /note="Mb1469, -, len: 45 aa. Equivalent to Rv1434, len: 45 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 45 aa overlap). Hypothetical unknown protein. Mb1469 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XYB2" /protein_id="SIU00072.1" /translation="MRASPAERVDGAYAGAGPHTQSVLEEDQRQRAPAGAEAEGPGRT G" CDS complement(1613058..1613687) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1470C" /product="Probable conserved Proline, Glycine, Valine-rich secreted protein" /note="Mb1470c, -, len: 209 aa. Highly similar to Rv1435c, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (96.2% identity in 209 aa overlap). Probable conserved Pro-, Gly-, Val-rich secreted protein (see citation below) with a N-terminal signal sequence. Similar at C-terminus to AF017099|AF017099_1 Mycobacterium tuberculosis pGB1 (87 aa), FASTA scores: opt: 550, E(): 2.3e-17, (97.7% identity in 86 aa overlap). Shows some similarity to N-terminus of CPN_DROME|Q02910 calphotin. drosophila melanogaster (865 aa), FASTA scores: opt: 266, E(): 2.5e-05, (37.2% identity in 191 aa overlap). Contains at least five 7 aa imperfect repeats. Also shows similarity to other Mycobacterium tuberculosis proteins e.g. MTCI237.20c (34.7% identity in 193 aa overlap), MTCI65.25c (36.9% identity in 160 aa overlap) and MTCI65.24c (34.2% identity in 196 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, an in-frame insertion of 21 bp leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (209 aa versus 202 aa). Mb1470c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0G3" /protein_id="SIU00073.1" /translation="MTLMAIVNRFNIKVIAGAGLFAAAIALSPDAAADPLMTGGYACI QGMAGDAPVAAGDPVAAGGPAATGACSAALTDMAGVPFVAPGPVPAAAPVPIGAPVPI PGAPVPIPGAPVPIPGAPVPIPGGPVPIPGAPVPVPAVPAPVIPVGTPLIALGPVLAG APGDGVVSAPIIGMSGVKDALTDPAPAGGPVPGQPVLPGPSASAPAGAR" CDS 1614044..1615063 /codon_start=1 /transl_table=11 /gene="gap" /locus_tag="BQ2027_MB1471" /product="PROBABLE GLYCERALDEHYDE 3-PHOSPHATE DEHYDROGENASE GAP (GAPDH)" /note="Mb1471, gap, len: 339 aa. Equivalent to Rv1436, len: 339 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 339 aa overlap). Probable gap, Glyceraldehyde 3-phosphate dehydrogenase (EC 1.2.1.12), highly similar to many e.g. G3P_MYCLE|P46713 Mycobacterium leprae (339 aa), FASTA scores: opt: 1933, E():0, (89.1% identity in 339 aa overlap). Contains PS00071 Glyceraldehyde 3-phosphate dehydrogenase active site. BELONGS TO THE GLYCERALDEHYDE 3-PHOSPHATE DEHYDROGENASE FAMILY. Protein product from Mb1471 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1471 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64179" /db_xref="InterPro:IPR006424" /db_xref="InterPro:IPR020828" /db_xref="InterPro:IPR020829" /db_xref="InterPro:IPR020830" /db_xref="InterPro:IPR020831" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P64179" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00074.1" /translation="MTVRVGINGFGRIGRNFYRALLAQQEQGTADVEVVAANDITDNS TLAHLLKFDSILGRLPCDVGLEGDDTIVVGRAKIKALAVREGPAALPWGDLGVDVVVE STGLFTNAAKAKGHLDAGAKKVIISAPATDEDITIVLGVNDDKYDGSQNIISNASCTT NCLAPLAKVLDDEFGIVKGLMTTIHAYTQDQNLQDGPHKDLRRARAAALNIVPTSTGA AKAIGLVMPQLKGKLDGYALRVPIPTGSVTDLTVDLSTRASVDEINAAFKAAAEGRLK GILKYYDAPIVSSDIVTDPHSSIFDSGLTKVIDDQAKVVSWYDNEWGYSNRLVDLVTL VGKSL" CDS 1615066..1616304 /codon_start=1 /transl_table=11 /gene="pgk" /locus_tag="BQ2027_MB1472" /product="PROBABLE PHOSPHOGLYCERATE KINASE PGK" /note="Mb1472, pgk, len: 412 aa. Equivalent to Rv1437, len: 412 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 412 aa overlap). Probable pgk, Phosphoglycerate kinase (EC 2.7.2.3), highly similar to many e.g. PGK_MYCLE|P46712 Mycobacterium leprae (416 aa), FASTA scores: opt: 2153, E(): 0, (80.4% identity in 414 aa overlap). Contains PS00111 Phosphoglycerate kinase signature. BELONGS TO THE PHOSPHOGLYCERATE KINASE FAMILY. Protein product from Mb1472 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1472 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65701" /db_xref="InterPro:IPR001576" /db_xref="InterPro:IPR015824" /db_xref="InterPro:IPR015911" /db_xref="InterPro:IPR036043" /db_xref="UniProtKB/Swiss-Prot:P65701" /protein_id="SIU00075.1" /translation="MSVANLKDLLAEGVSGRGVLVRSDLNVPLDEDGTITDAGRIIAS APTLKALLDADAKVVVAAHLGRPKDGPDPTLSLAPVAVALGEQLGRHVQLAGDVVGAD ALARAEGLTGGDILLLENIRFDKRETSKNDDDRRALAKQLVELVGTGGVFVSDGFGVV HRKQASVYDIATLLPHYAGTLVADEMRVLEQLTSSTQRPYAVVLGGSKVSDKLGVIES LATKADSIVIGGGMCFTFLAAQGFSVGTSLLEDDMIEVCRGLLETYHDVLRLPVDLVV TEKFAADSPPQTVDVGAVPNGLMGLDIGPGSIKRFSTLLSNAGTIFWNGPMGVFEFPA YAAGTRGVAEAIVAATGKGAFSVVGGGDSAAAVRAMNIPEGAFSHISTGGGASLEYLE GKTLPGIEVLSREQPTGGVL" CDS 1616301..1617086 /codon_start=1 /transl_table=11 /gene="tpi" /locus_tag="BQ2027_MB1473" /product="PROBABLE TRIOSEPHOSPHATE ISOMERASE TPI (TIM)" /note="Mb1473, tpi, len: 261 aa. Equivalent to Rv1438, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 261 aa overlap). Probable tpi (tpiA), Triosephosphate isomerase (EC 5.3.1.1), highly similar to many e.g. TPIS_MYCLE|P46711 Mycobacterium leprae (261 aa), FASTA scores: opt: 1456, E(): 0, (83.9% identity in 261 aa overlap). Contains PS00171 Triosephosphate isomerase active site. BELONGS TO THE TRIOSEPHOSPHATE ISOMERASE FAMILY. Protein product from Mb1473 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1473 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P66941" /db_xref="InterPro:IPR000652" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR020861" /db_xref="InterPro:IPR022896" /db_xref="InterPro:IPR035990" /db_xref="UniProtKB/Swiss-Prot:P66941" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00076.1" /translation="MSRKPLIAGNWKMNLNHYEAIALVQKIAFSLPDKYYDRVDVAVI PPFTDLRSVQTLVDGDKLRLTYGAQDLSPHDSGAYTGDVSGAFLAKLGCSYVVVGHSE RRTYHNEDDALVAAKAATALKHGLTPIVCIGEHLDVREAGNHVAHNIEQLRGSLAGLL AEQIGSVVIAYEPVWAIGTGRVASAADAQEVCAAIRKELASLASPRIADTVRVLYGGS VNAKNVGDIVAQDDVDGGLVGGASLDGEHFATLAAIAAGGPLP" CDS complement(1617698..1618123) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1474C" /product="unknown protein" /note="Mb1474c, -, len: 141 aa. Equivalent to Rv1439c, len: 141 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 141 aa overlap). Hypothetical unknown protein. Protein product from Mb1474c detected using SWATH mass spectrometry. Mb1474c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR032710" /db_xref="InterPro:IPR037401" /db_xref="UniProtKB/TrEMBL:A0A1R3XYD4" /protein_id="SIU00077.1" /translation="MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRH GLGAAQREFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLL RGDKAVERVSYFDSLPLLIKVVKHPSAWRGWLTTMRSRA" CDS 1618574..1618807 /codon_start=1 /transl_table=11 /gene="secG" /locus_tag="BQ2027_MB1475" /product="probable protein-export membrane protein (translocase subunit) secg" /note="Mb1475, secG, len: 77 aa. Equivalent to Rv1440, len: 77 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 77 aa overlap). Probable protein-export membrane protein secG, similar to many e.g. P38388|SECG_MYCLE PROBABLE PROTEIN-EXPORT MEMBRANE (77 aa), FASTA scores: opt: 450, E(): 6.7e-24, (96.1% identity in 77 aa overlap). Start changed since original submission (-40 aa). PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA|Rv3240c, SECD|Rv2587c, SECE|Rv0638, SECF|Rv2586c, SECG AND SECY|Rv0732. Mb1475 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P66792" /db_xref="InterPro:IPR004692" /db_xref="UniProtKB/Swiss-Prot:P66792" /protein_id="SIU00078.1" /translation="MELALQITLIVTSVLVVLLVLLHRAKGGGLSTLFGGGVQSSLSG STVVEKNLDRLTLFVTGIWLVSIIGVALLIKYR" CDS complement(1618946..1620421) /codon_start=1 /transl_table=11 /gene="PE_PGRS26" /locus_tag="BQ2027_MB1476C" /product="pe-pgrs family protein pe_pgrs26" /note="Mb1476c, PE_PGRS26, len: 491 aa. Equivalent to Rv1441c, len: 491 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 491 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to Y0DP_MYCTU|Q50615 hypothetical glycine-rich 40.8 kd protein (498 aa), fasta scores: opt: 1625, E(): 0, (55.2% identity in 518 aa overlap). Mb1476c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XYC1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00079.1" /translation="MSNVMVVPGMLSAAAADVASIGAALSAANGAAAPTTAGVLAAGA DEVSAAIASLFSGYARDYQALSAQMARFHQQFVQALTASVGSYAAAEAANASPLQALE QQVLAAINAPTQTLLGRPLIGNGADGLPGQNGGAGGLLWGNGGNGGAGDAAHPNGGNG GDAGMFGNGGAGGAGYSPAAGTGAAGGAGGAGGAGGWLSGNGGAGGNGGTGASGADGG GGLPPVPASPGGNGGGGGAGGAAGMFGTGGAGGTGGDGGAGGAGDSPNSGANGARGGD GGNGAAGGAGGRLFGNGGAGGNGGTAGQGGDGGTALGAGGIGGDGGTGGAGGTGGTAG IGGSSAGAGGAGGDGGAGGNGGGSSMIGGKGGTGGNGGVGGTGGASALTIGNGSSAGA GGAGGAGGTGGTGGYIESLDGKGQAGNGGNGGNGAAGGAGGGGTGAGGNGGAGGNGGD GGPSQGGGNPGFGGDGGTGGPGGVGVPDGIGGANGAQGKHG" CDS 1620528..1622828 /codon_start=1 /transl_table=11 /gene="bisC" /locus_tag="BQ2027_MB1477" /product="PROBABLE BIOTIN SULFOXIDE REDUCTASE BISC (BDS reductase) (BSO reductase)" /note="Mb1477, bisC, len: 766 aa. Equivalent to Rv1442, len: 766 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 766 aa overlap). Probable bisC, Biotin sulfoxide reductase (EC 1.-.-.-), similar to BISC_ECOLI|P20099 biotin sulfoxide reductase from Escherichia coli (739 aa), FASTA scores: opt: 1271, E():0, (40.2% identity in 744 aa overlap). Protein product from Mb1477 detected using SWATH mass spectrometry. Mb1477 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYD3" /db_xref="InterPro:IPR006656" /db_xref="InterPro:IPR006657" /db_xref="InterPro:IPR006658" /db_xref="InterPro:IPR009010" /db_xref="InterPro:IPR041460" /db_xref="InterPro:IPR041954" /db_xref="UniProtKB/TrEMBL:A0A1R3XYD3" /protein_id="SIU00080.1" /translation="MQVYTSATHWGVFTARVHGGDIAAVAALASDTNPAPQLQNLPGA VRHRSRIANPAVRRGWLQHGPGPSSARGAEEFVEVSWDELIELLASELRRTVDRYGNE AIYGSSYGWASAGRFHHAQSQVHRFLNMLGGYTASRHSYSAGASEVIFPHIVGAALFE ALAETTTWDVIVDHTALLVAFGGLPVKNTAVMPGGTTAHPDRDYVGRYRARGGRLVSV SPLRDDIAAIAGPLDDRCRWLAPVPGTDVAIMLGLAYVLATESLADRAFLGRYCTGYE RFERYLLGLDDGIPKTPEWAAALSGLAAGDLRDLARRMAEHRTLITTSLSLQRIEHGE QTVWMAATLAAMLGQIGLPGGGFGHGYSSNGVGNPPLACGLPALPQGNNPVSTFIPVA AISELLQRPGQRLAYNGRLLELPDIKCVYWAGGNPFHHHQNLPRLRRALSRVDTIVVH EQYWTAMAKHADIVVPTTTSFERDDFAASKTNPTLIAMPAMVPPYANARDDYHTFSAL AHRLGFGKQFTEGRSAREWLEHMYDKWSAELDFPVPSFAEFWRTGRLELPTRTGLTWL ADFRADPAAHPLGTPSGRIEIFSDTVDAFALPDCAGHPTWYEPSEWLGGPRAARYPLH LIANQPRTRLHSQLDHGGASMASKIRGREPIRIHPDDAAARELTDGDIVRVFNDRGAC LAGVVIDDGLRPKVVQLSTGAWFDPADPRDPDSMCVHGNPNALSNDSGTSSLAHGSTG QHVLVQIERFTGELPPVRAHEPPRLA" CDS complement(1622944..1623429) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1478C" /product="unknown protein" /note="Mb1478c, -, len: 161 aa. Equivalent to Rv1443c, len: 161 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 161 aa overlap). Hypothetical unknown protein. Protein product from Mb1478c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1478c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3XYC2" /protein_id="SIU00081.1" /translation="MVGYAEPVLIERQSVVAAPAEQVWQRVVTPEGINDELRPWMTMS VPRGAKGMTVDTVPIGAPIGRAWLRLFGVLPFDYDRLSIAELEPGRRFREDSTMLSMR QWQHERTVTPEGDTKTIVRDRITFQTRAGLRFAAPLIAAGLRALFGHRHRRLQRHFAQ G" CDS complement(1624043..1624453) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1479C" /product="unknown protein" /note="Mb1479c, -, len: 136 aa. Equivalent to Rv1444c, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 136 aa overlap). Hypothetical unknown protein. Protein product from Mb1479c detected using shotgun mass spectrometry. Mb1479c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XYC3" /protein_id="SIU00082.1" /translation="MTVMADRSGRPAPVRRRMKTLTQAALNADKTVEQVEDVLDGLGK TMAELNSSLSQLNSTVERLEDGLDHLEGTLHSLDDLAKRLIVLVEPVEAIVDRIDYIV SLGETVMSPLSVTEHAVRGVLDRLRNRTVHEPTN" CDS complement(1624470..1625213) /codon_start=1 /transl_table=11 /gene="devB" /locus_tag="BQ2027_MB1480C" /product="PROBABLE 6-PHOSPHOGLUCONOLACTONASE DEVB (6PGL)" /note="Mb1480c, devB, len: 247 aa. Equivalent to Rv1445c, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 247 aa overlap). Possible devB (PGL), 6-phosphogluconolactonase (EC 3.1.1.31), belongs to a different family to the upstream gene zwf2. Similar to e.g. DEVB_ANASP|P46016 putative glucose-6-phosphate 1-dehydrogenase (239 aa), FASTA scores: opt: 439, E(): 2.6e-20, (34.0% identity in 247 aa overlap). BELONGS TO THE GLUCOSAMINE/GALACTOSAMINE-6-PHOSPHATE ISOMERASE FAMILY. 6-PHOSPHOGLUCONOLACTONASE SUBFAMILY. Protein product from Mb1480c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1480c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63339" /db_xref="InterPro:IPR005900" /db_xref="InterPro:IPR006148" /db_xref="InterPro:IPR037171" /db_xref="InterPro:IPR039104" /db_xref="UniProtKB/Swiss-Prot:P63339" /protein_id="SIU00083.1" /translation="MSSSIEIFPDSDILVAAAGKRLVGAIGAAVAARGQALIVLTGGG NGIALLRYLSAQAQQIEWSKVHLFWGDERYVPEDDDERNLKQARRALLNHVDIPSNQV HPMAASDGDFGGDLDAAALAYEQVLAASAAPGDPAPNFDVHLLGMGPEGHINSLFPHS PAVLESTRMVVAVDDSPKPPPRRITLTLPAIQRSREVWLLVSGPGKADAVAAAIGGAD PVSVPAAGAVGRQNTLWLLDRDAAAKLPS" CDS complement(1625210..1626121) /codon_start=1 /transl_table=11 /gene="opcA" /locus_tag="BQ2027_MB1481C" /product="PUTATIVE OXPP CYCLE PROTEIN OPCA" /note="Mb1481c, opcA, len: 303 aa. Equivalent to Rv1446c, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 303 aa overlap). Putative opcA, OxPP cycle protein. Highly similar to S72774 B1496_F1_30 protein from Mycobacterium leprae (265 aa), FASTA scores: opt: 1056, E(): 0, (70.3% identity in 239 aa overlap). Also similar to OPCA_NOSS2|P48971 putative oxppcycle protein opca from Nostoc punctiforme (465 aa), fasta scores: opt: 177, E(): 7.3e-05, (23.4% identity in 321 aa overlap). AIDS IN G6PD ACTIVITY. Protein product from Mb1481c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1481c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR004555" /db_xref="UniProtKB/TrEMBL:A0A1R3XZC7" /protein_id="SIU00084.1" /translation="MIVDLPDTTTTAVNKKLDELREKIGAVAMGRVLTLIIAPDSEAM LEESIEAANDASHEHPSRIIVTMRGDPYADRPRLDAQLRVGADAGAGEFVVLRLSGPL AGHADSVVIPFLLPDIPVVAWWPDIAPAVPAQDALGKLAIRRITDATNAIDPLSAIKS RLAGYGAGDTDLAWSRITYWRALLTSAVDQPPHEPIESALVSGLKTEPALDVLAGWLA SRIEGPVRRAVGELKVELVRNSETIVLSRPQEGITATLTRTGKPDALVPLARRVTGEC LAEDLRRLDPDEIYCAALEGIKKVQYR" CDS complement(1626174..1627718) /codon_start=1 /transl_table=11 /gene="zwf2" /locus_tag="BQ2027_MB1482C" /product="PROBABLE GLUCOSE-6-PHOSPHATE 1-DEHYDROGENASE ZWF2 (G6PD)" /note="Mb1482c, zwf2, len: 514 aa. Equivalent to Rv1447c, len: 514 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 514 aa overlap). Probable zwf2 (ZWF), Glucose-6-phosphate 1-dehydrogenase (EC 1.1.1.49), highly similar to many e.g. G6PD_SYNY3|P73411 Synechocystis sp. (509 aa), FASTA scores: opt: 1578, E(): 0, (46.8% identity in 509 aa overlap). Also similar to Mycobacterium tuberculosis Rv1121, zwf glucose-6-phosphate 1-dehydrogenase. Contains PS00069 Glucose-6-phosphate dehydrogenase active site. M. tuberculosis has two genes for ZWF. This one looks like a classical ZWF. BELONGS TO THE GLUCOSE-6-PHOSPHATE DEHYDROGENASE FAMILY. Protein product from Mb1482c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1482c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A585" /db_xref="InterPro:IPR001282" /db_xref="InterPro:IPR019796" /db_xref="InterPro:IPR022674" /db_xref="InterPro:IPR022675" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P0A585" /protein_id="SIU00085.1" /translation="MKPAHAAASWRNPLRDKRDKRLPRIAGPCGMVIFGVTGDLARKK VMPAVYDLANRGLLPPTFSLVGFARRDWSTQDFGQVVYNAVQEHCRTPFRQQNWDRLA EGFRFVPGTFDDDDAFAQLAETLEKLDAERGTGGNHAFYLAIPPKSFPVVCEQLHKSG LARPQGDRWSRVVIEKPFGHDLASARELNKAVNAVFPEEAVFRIDHYLGKETVQNILA LRFANQLFDPIWNAHYVDHVQITMAEDIGLGGRAGYYDGIGAARDVIQNHLMQLLALT AMEEPVSFHPAALQAEKIKVLSATRLAEPLDQTTSRGQYAAGWQGGEKVVGLLDEEGF AEDSTTETFAAITLEVDTRRWAGVPFYLRTGKRLGRRVTEIALVFRRAPHLPFDATMT DELGTNAMVIRVQPDEGVTLRFGSKVPGTAMEVRDVNMDFSYGSAFAEDSPEAYERLI LDVLLGEPSLFPVNAEVELAWEILDPALEHWAAHGTPDAYEAGTWGPESSLEMLRRTG REWRRP" CDS complement(1627715..1628836) /codon_start=1 /transl_table=11 /gene="tal" /locus_tag="BQ2027_MB1483C" /product="PROBABLE TRANSALDOLASE TAL" /note="Mb1483c, tal, len: 373 aa. Equivalent to Rv1448c, len: 373 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 373 aa overlap). Probable tal, Transaldolase (EC 2.2.1.2), highly similar to many e.g. TAL_MYCLE|P55193 transaldolase from Mycobacterium leprae (375 aa), FASTA scores: opt: 1891, E(): 0, (78.6% identity in 370 aa overlap). BELONGS TO THE TRANSALDOLASE FAMILY. Protein product from Mb1483c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1483c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59955" /db_xref="InterPro:IPR001585" /db_xref="InterPro:IPR004732" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR018225" /db_xref="UniProtKB/Swiss-Prot:P59955" /protein_id="SIU00086.1" /translation="MTAQNPNLAALSAAGVSVWLDDLSRDRLRSGNLQELIDTKSVVG VTTNPSIFQKALSEGHTYDAQIAELAARGADVDATIRTVTTDDVRSACDVLVPQWEDS DGVDGRVSIEVDPRLAHETEKTIQQAIELWKIVDRPNLFIKIPATKAGLPAISAVLAE GISVNVTLIFSVQRYREVMDAYLTGMEKARQAGHSLSKIHSVASFFVSRVDTEIDKRL DRIGSRQALELRGQAGVANARLAYAAYREVFEDSDRYRSLKVDGARVQRPLWASTGVK NPDYSDTLYVTELVAPHTVNTMPEKTIDAVADHGVIQGDTVTGTASDAQAVFDQLGAI GIDLTDVFAVLEEEGVRKFEASWNELLQETRAHLDTAAQ" CDS complement(1628853..1630955) /codon_start=1 /transl_table=11 /gene="tkt" /locus_tag="BQ2027_MB1484C" /product="transketolase tkt (tk)" /note="Mb1484c, tkt, len: 700 aa. Equivalent to Rv1449c, len: 700 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 700 aa overlap). Probable tkt, Transketolase (EC 2.2.1.1). Highly similar to several e.g. TKT_MYCLE|P46708 transketolase (tk) from Mycobacterium leprae (699 aa), FASTA scores: opt: 4216, E(): 0, (89.1% identity in 700 aa overlap). Start site chosen by homology. Contains PS00801 Transketolase signature 1. BELONGS TO THE TRANSKETOLASE FAMILY. Protein product from Mb1484c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1484c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59956" /db_xref="InterPro:IPR005474" /db_xref="InterPro:IPR005475" /db_xref="InterPro:IPR005478" /db_xref="InterPro:IPR009014" /db_xref="InterPro:IPR020826" /db_xref="InterPro:IPR029061" /db_xref="InterPro:IPR033247" /db_xref="InterPro:IPR033248" /db_xref="UniProtKB/Swiss-Prot:P59956" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00087.1" /translation="MTTLEEISALTRPRHPDDWTEIDSAAVDTIRVLAADAVQKVGNG HPGTAMSLAPLAYTLFQRTMRHDPSDTHWLGRDRFVLSAGHSSLTLYIQLYLGGFGLE LSDIESLRTWGSKTPGHPEFRHTPGVEITTGPLGQGLASAVGMAMASRYERGLFDPDA EPGASPFDHYIYVIASDGDIEEGVTSEASSLAAVQQLGNLIVFYDRNQISIEDDTNIA LCEDTAARYRAYGWHVQEVEGGENVVGIEEAIANAQAVTDRPSFIALRTVIGYPAPNL MDTGKAHGAALGDDEVAAVKKIVGFDPDKTFQVREDVLTHTRGLVARGKQAHERWQLE FDAWARREPERKALLDRLLAQKLPDGWDADLPHWEPGSKALATRAASGAVLSALGPKL PELWGGSADLAGSNNTTIKGADSFGPPSISTKEYTAHWYGRTLHFGVREHAMGAILSG IVLHGPTRAYGGTFLQFSDYMRPAVRLAALMDIDTIYVWTHDSIGLGEDGPTHQPIEH LSALRAIPRLSVVRPADANETAYAWRTILARRNGSGPVGLILTRQGVPVLDGTDAEGV ARGGYVLSDAGGLQPGEEPDVILIATGSEVQLAVAAQTLLADNDILARVVSMPCLEWF EAQPYEYRDAVLPPTVSARVAVEAGVAQCWHQLVGDTGEIVSIEHYGESADHKTLFRE YGFTAEAVAAAAERALDN" CDS complement(1631394..1635620) /codon_start=1 /transl_table=11 /gene="PE_PGRS27" /locus_tag="BQ2027_MB1485C" /product="pe-pgrs family protein pe_pgrs27" /note="Mb1485c, PE_PGRS27, len: 1408 aa. Similar to Rv1450c, len: 1329 aa, from Mycobacterium tuberculosis strain H37Rv, (92.1% identity in 1417 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to Y03A_MYCTU|Q10637 hypothetical glycine-rich 49.6 kd protein (603 aa), fasta scores: opt: 2112, E(): 0, (56.5% identity in 630 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 27 bp, 207 bp and 27 bp, substitutions of 60 bp to 63 bp and 11 bp, and a 27 bp deletion, leads to a longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (1408 aa versus 1329 aa)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XYE8" /protein_id="SIU00088.1" /translation="MSLVIVAPETVAAAALDVARIGSSIGVANSAAAGSTTSVLAAGA DEVSAAIATLFGSHAREYQAISTQVAAFHDRFAQTLSAAVGSYVSAEATNAAPLATLE HNVLNALNAPTQALLGRPLIGDGAAGAPGTGQAGGAGGILWGNGGAGGSGAPGQVGGA GGAAGLFGTGGAGGAGGAGAAGGAGGSGGWLLGNGGVGGAGGQSLLGGATGGAGGNAG LFGVGGTGGPGGPGGPGGPGGPGGPGGVGGTGGAGGLGGTLYGAGGHGGAGGPGPIGG VGGHGGVGGAAGLLGVGGHGGAGGHGAAGIAGAAGKGGIFPDGSNGGAGGDAGDGGTG GRGGWLAGAGGAGGDGGIGGTGGAGGAGFSRALIVAGDNGGDGGNGGMGGAGGAGGPG GAGGLISLLGGQGAGGDGGDGGAGGVGGDRGAGANGNQAVNAGAGGAGGHGGDPGAGG AGGTGGAGSTIGAHGAAGASPTSGGNGGAGGNGAHFSSGGKAGGNGGAGGAGGLVGNG GAGGAGGNGAPGAPPSGGDPNGGGGGAGGAGGKGGDGGAQAGDGGAGGAGGKGGNGGN GATGATGLNGLGAGADGTDGGKGGNGGAGGGGGAGGQGGKALAATHQDGSMGAGGAGG NGGAGGMGGDGGNGAKGTFDNGGDGVGGNGGNGGSRGIGGAGGIGGAGSTAGADGARG ATPTSGGNGGTGGNGANATVAGGAGGAGGKGGNGGLVGNGGAGGKGGDGMAGVAGSSP TTAGESGTSGQNGGAGGAGGAGGRGGDFGGDGGTGGAGGNGADGGAGGNGANGANATT PGAKGGDGGHGGPGAQGGNGGQGGPGGLAGNLFGQNGIQGVGGSGGKGGAGGLAGDGG NGANGNFAFGDGNGGHGGNGGNPGAGGQGGSGGAGSTPGAKGAHGFTPTSGGDGGDGG NGGHSQVVGGNGGDGGNGGNGGSAGTGGNGGRGGDGAFGGMSANATNPGENGPNGNPG GNGGAGGAGGAGLNGGNGGAGGNGGLGGFGGNGAAGANGVAVGAPGQPGGAGGHGGAG GNGGAGGNGGQGVVSDGAGGAGGAGGDGGAPGDGANGGNGQGAGAFAGGGGGRGGDGG NAGNAGAGGPGGTGSTAGKAGPAGSILHDGGNGGHGGHGAASGGNGGPGGHGGNGGNG GTGANGGNGGIGGTGGAGSTGAKGVLGTNEGDGGDGGRGGNGGRGGNGGQGLTGAGGN GGTGGTPGNGGNGGNGASGDLVTSPGDGGGGGRGGDAGRGGDAGLGGSSGPGGTPGDW GTGGTGGTGGTGGQGANGGLTGGRGGTGGNGGAGGTGGTGHNGSQPGMGGNGGAGGFG GNGFAGVGGRGGMGGSGGTGGTGDAGPFGTGTGGTGGHGGQGGGGGFSILLGLGGLGG LGSPGSIATGTAGGAGGGGGFGGLGGGEFV" CDS 1636022..1636948 /codon_start=1 /transl_table=11 /gene="ctaB" /locus_tag="BQ2027_MB1486" /product="PROBABLE CYTOCHROME C OXIDASE ASSEMBLY FACTOR CTAB" /note="Mb1486, ctaB, len: 308 aa. Equivalent to Rv1451, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 308 aa overlap). Probable ctaB, cytochrome C oxidase assembly factor, and integral membrane protein. Highly similar to several Mycobacterium leprae proteins e.g. Q49685 CYOE cytochrome O ubiquinol oxidase assembly factor (300 aa), FASTA scores: opt: 1636, E(): 0, (82.7% identity in 307 aa overlap); NP_301495.1|NC_002677 putative protoheme IX farnesyltransferase (321 aa); NP_301495.1|NC_002677 putative protoheme IX farnesyltransferase (321 aa). Mb1486 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7U021" /db_xref="InterPro:IPR000537" /db_xref="InterPro:IPR006369" /db_xref="UniProtKB/Swiss-Prot:Q7U021" /protein_id="SIU00089.1" /translation="MNVRGRVAPRRVTGRAMSTLLAYLALTKPRVIELLLVTAIPAML LADRGAIHPLLMLNTLVGGMMAATGANTLNCVADADIDKVMKRTARRPLAREAVPTRN ALALGLTLTVISFFWLWCATNLLAGVLALVTVAFYVFVYTLWLKRRTSQNVVWGGAAG CMPVMIGWSAITGTIAWPALAMFAIIFFWTPPHTWALAMRYKQDYQVAGVPMLPAVAT ERQVTKQILIYTWLTVAATLVLALATSWLYGAVALVAGGWFLTMAHQLYAGVRAGEPV RPLRLFLQSNNYLAVVFCALAVDSVIALPTLH" CDS complement(1636997..1639384) /codon_start=1 /transl_table=11 /gene="PE_PGRS28" /locus_tag="BQ2027_MB1487C" /product="pe-pgrs family protein pe_pgrs28" /note="Mb1487c, PE_PGRS28, len: 795 aa. Highly similar to Rv1452c, len: 741 aa, from Mycobacterium tuberculosis strain H37Rv, (89.1% identity in 795 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to Y03A_MYCTU|Q10637 hypothetical glycine-rich 49.6 kd protein (603 aa), fasta scores: opt: 2090, E(): 0, (56.3% identity in 641 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, substitutions of 3 bp to 144 bp, 40 bp to 49 bp, 31 bp to 31 bp, 4 bp to 4 bp (gggc-ttgg), 12 bp to 12 bp, 11 bp to 11 bp, 60 bp to 63 bp, 13 bp to 13 bp and 2 bp to 2 bp (tt-cc), and a 9 bp insertion (*-cccgccggc), leads to a longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (795 aa versus 741 aa)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XYE4" /protein_id="SIU00090.1" /translation="MSLVIVAPETVAAAALDVARIGSSIGVANSAAAGSTTSVLAAGA DEVSAAIATLFGSHAREYQAISTQVAAFHDRFAQTLSAAVGSYVSAEATNAAPLATLE HNVLNALNAPTQALLGRPLIGDGAAGAPGTGQAGGAGGILWGNGGAGGSGAPGQVGGA GGAAGLFGTGGAGGAGGAGAAGGAGGSGGWLLGNGGVGGAGGQSLLGGATGGAGGNAG LFGVGGTGGPGGPGGPGGVGGTGGAGGLGGTLYGAGGHGGAGGPGPIGGVGGHGGVGG AAGLLGVGGHGGAGGHGAAGIAGAAGKGGIFPDGSNGGAGGDAGDGGTGGRGGWLAGA GGAGGDGGIGGTGGAGGAGFSRALIVAGDNGGDGGNGGMGGAGGAGGPGGAGGLISLL GGQGAGGDGGDGGAGGVGGDRGAGANGNQAVNAGAGGAGGHGGDPGAGGAGGTGGAGS TIGAHGAAGASPTSGGNGGAGGNGAHFSSGGKAGGNGGAGGAGGHGGLVGNGGAGGAG GNGANGAAGTNASDSGAVGGKGNSGGNGGQGGAGGDGGTLAGNGGAGGTGGRGADGGL GGSGAEGANATTAGERGQDGGKGGNGGVGGTGGNAVAPGANGGHGGNGGNPGFSGAGG LGGLSGDGVTRAAQGATPDFADTGGKGGNGGNGANAVAPGGTGASGGAGGNAGAGGKG GENIIGDGGGGGNGGAGGQGGDGTAGAGGDGGAGGKGGDGGDGGSDPTEGRGFGGLGG AGGAGGKGGAGTLLGLTVFGDNGGAGVLGDSTDPDGSGGAGGAGGAGGAGGDPTI" CDS 1639536..1640801 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1488" /product="POSSIBLE TRANSCRIPTIONAL ACTIVATOR PROTEIN" /note="Mb1488, -, len: 421 aa. Equivalent to Rv1453, len: 421 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 421 aa overlap). Possible transcriptional activator, similar to Q50018 putative transcriptional activator trx from Mycobacterium leprae (517 aa), FASTA scores: opt: 1719, E(): 0, (54.0% identity in 500 aa overlap). Also highly similar to Mycobacterium tuberculosis proteins Rv2370c, Rv1194c, Rv2242, Rv1186c, and to the further upstream ORF's Rv1429|MTCY493.25c (28.1% identity in 335 aa overlap). Start changed since first submission (-11 aa)." /db_xref="InterPro:IPR025736" /db_xref="InterPro:IPR041522" /db_xref="InterPro:IPR042070" /db_xref="UniProtKB/TrEMBL:A0A1R3XYD2" /protein_id="SIU00091.1" /translation="MALRETSPRIHELIREAARIALNPTQEWLDEFDRAILAANPSIA ADPALATVVKRSNRAHLIHFAAANLRNPGAPVPANLGPEPLRMARDLVRVGLDALALD IYRIGQNVAWRRWTDIAFGLTSDPDELHELLDVPFRTANEFVDTTLAGITTEMQLERD KLTRDVPAERRKIVQLLIDGAPISREHAEARLGYPLDRSHTAAVIWGDQAQGDHSHLD RVADAFGHAGGCPHPLVVVAGAATRWVWVKDAPGFDIDLIHEVLHDIPDARIAIGATA PGIEGFRRSHRDALTTARMIIRLESPHRVAFFTDVEMVALLTENAEGADDFIQRTLGN LESASPALKTTLLTFINQQCNASRAARLLFTHRNTLMNRLETAQRLLPRPLADTTIHV AVALEAQQWREKQTSDPPAKKESNGTKMR" CDS complement(1640829..1641815) /codon_start=1 /transl_table=11 /gene="qor" /locus_tag="BQ2027_MB1489C" /product="PROBABLE QUINONE REDUCTASE QOR (NADPH:quinone reductase) (Zeta-crystallin homolog protein)" /note="Mb1489c, qor, len: 328 aa. Equivalent to Rv1454c, len: 328 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 328 aa overlap). Probable qor, quinone oxidoreductase (EC 1.6.5.5), simiar to U87282|RCU87282_2 quinone oxidoreductase from Rhodobacter capsulatus (323 aa), FASTA scores: opt: 849, E(): 0, (44.7% identity in 329 aa overlap). Also similar to MTCY180.06 Hypothetical protein from Mycobacterium tuberculosis (334 aa), FASTA scores: opt: 430, E(): 2e-14, (32.3% identity in 350 aa overlap). Contains PS01162 Quinone oxidoreductase / zeta-crystallin signature. Protein product from Mb1489c detected using shotgun mass spectrometry. Mb1489c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYD6" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XYD6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00092.1" /translation="MHAIEVTETGGPGVLRHVDQPQPQPGHGELLIKAEAIGVNFIDT YFRSGQYPRELPFVIGSEVCGTVEAVGPGVTAADTAISVGDRVVSASANGAYAEFCTA PASLTAKVPDDVTSEVAASALLKGLTAHYLLKSVYPVKRGDTVLVHAGAGGVGLILTQ WATHLGVRVITTVSTAEKAKLSKDAGADVVLDYPEDAWQFAGRVRELTGGTGVQAVYD GVGATTFDASLASLAVRGTLALFGAASGPVPPVDPQRLNAAGSVYLTRPSLFHFTRTG EEFSWRAAELFDAIGSEAITVAVGGRYPLADALRAHQDLEARKTVGSVVLLP" CDS 1641835..1642698 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1490" /product="conserved protein" /note="Mb1490, -, len: 287 aa. Equivalent to Rv1455, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 287 aa overlap). Conserved hypothetical protein, some similarity from aa 80-160 to Z99125|MLCL536.35c hypothetical Mycobacterium leprae protein (101 aa), FASTA scores: opt: 238, E(): 1.8e-08, (51.3% identity in 78 aa overlap). Protein product from Mb1490 detected using SWATH mass spectrometry." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0I3" /protein_id="SIU00093.1" /translation="MKLARPDVFHPRVVLAGWPQQPAGDGDDAGLVAALRHRGLHAGW LSWDDPEIVHADLVILRATRDYPARLDEFLAWTTRVANLLNSRPVVAWNVERRYLRDL MDRGVPTVPGEVYVPGEPVRLPRKGQVFVGPTIGTGTRRCSARFAAEFVAQLHAAGQA VLVQPGGSGDETVLVFLGGEPSHAFTKQADTWRQTEPDFEIWDVGAAAVAGAAAQVGV DPGELLYARAHITGGSRDPRLLELQLVDPSLGWQWLDPDIRNLAQRDFALCVQSALER LGLGPFSHRRP" CDS complement(1642648..1643580) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1491C" /product="PROBABLE UNIDENTIFIED ANTIBIOTIC-TRANSPORT INTEGRAL MEMBRANE ABC TRANSPORTER" /note="Mb1491c, -, len: 310 aa. Equivalent to Rv1456c, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 310 aa overlap). Possible unidentified antibiotic-transport integral membrane protein ABC transporter (see citation below), equivalent to Z99125|MLCL536.34 from Mycobacterium leprae (311 aa), FASTA scores: opt: 1607, E(): 0, (83.3% identity in 300 aa overlap). Mb1491c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZD6" /db_xref="InterPro:IPR003780" /db_xref="UniProtKB/TrEMBL:A0A1R3XZD6" /protein_id="SIU00094.1" /translation="MPYDRAVSPSLRVQRVIAAIVILTQGGIAVTGAIVRVTASGLGC PTWPQCFPGSFTPVVVAEVPRVHQAVEFGNRMVTFAVVIAAALAVLVVTRARRRTEVL AYAWLMPVSTVVQAMIGGITVRTGLLWWTVAIHLLASMTMVWLAVLLYVKIGQPDDGV VHELVVSPLRALTALSALNLAAVLVTGTLVTAAGPHAGDRSPSRTVPRLKVEITTLVH MHSSLLVAYLALLIGLGFGLLAVGATRAILVRLAVLLALVATQAAVGTTQYFTGVPAA LVAIHVAGAAAVTAATAALWASMGERAQPQPLQR" CDS complement(1643692..1644477) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1492C" /product="PROBABLE UNIDENTIFIED ANTIBIOTIC-TRANSPORT INTEGRAL MEMBRANE ABC TRANSPORTER" /note="Mb1492c, -, len: 261 aa. Equivalent to Rv1457c, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 261 aa overlap). Possible unidentified antibiotic-transport integral membrane protein ABC transporter (see citation below), equivalent to Z99125|MLCL536.32 from Mycobacterium leprae (265 aa), FASTA scores: opt: 1415, E(): 0, (83.1% identity in 260 aa overlap). Mb1492c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYE5" /db_xref="InterPro:IPR000412" /db_xref="InterPro:IPR004377" /db_xref="InterPro:IPR013525" /db_xref="UniProtKB/TrEMBL:A0A1R3XYE5" /protein_id="SIU00095.1" /translation="MTQTNRPAFPAGTFSPDPRPNAVPLMLAAQFSLELKLLLRNGEQ LLLTMFIPITLLVGLTLLPMGSFGHNRAATFVPVIMALAVISTAFTGQAIAVAFDRRY GALKRLGATPLPVWGIIAGKSLAVVAVVFLQAIILGAIGFALGWRPALTALTLGAGII ALGTAGFAALGLLLGGTLRAEIVLAVANLMWFVFAGFGALTLESNVIPTAFKWVARVT PSGALTEALSQAMTVSVDWFGIVVLAVWGALAALAALRWFRFT" CDS complement(1644474..1645415) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1493C" /product="PROBABLE UNIDENTIFIED ANTIBIOTIC-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb1493c, -, len: 313 aa. Equivalent to Rv1458c, len: 313 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 313 aa overlap). Possible unidentified antibiotic-transport ATP-binding protein ABC transporter (see citation below), equivalent to Z99125|MLCL536.31 from Mycobacterium leprae (315 aa), FASTA scores: opt: 1812, E(): 0, (88.0% identity in 308 aa overlap). Similar to AF027770|AF027770_7 ABC-type transporter in FxbA region in Mycobacterium smegmatis (284 aa), FASTA scores: opt: 1412, E(): 0, (85.1% identity in 248 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb1493c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1493c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYT3" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XYT3" /protein_id="SIU00096.1" /translation="MNRAPDTPEVVLRLRGVCKRYGSITAVSNLDLDVHDAEVMALLG PNGAGKTTTVEMCEGFVRPDAGSIEVLGLDPITDNARLRARIGVMLQGGGGYPAARAG EMLDLVASYAANPLDPHWLLDTLGLTEAARTTYRRLSGGQQQRLALACALVGRPQLVF LDEPTAGMDAHARVLVWELIDALRRDGVTVVLTTHHLKEAEELADRLVIIDHGVTVAA GTPAELMRSGAKDQLRFTAPPRLDLSLLASALPEGYQATELTPGEYLVEGPVDPQVLA TVTAWCAQIDVLATDMRVEQRSLEDVFLDLTGRKLRQ" CDS complement(1645518..1647293) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1494C" /product="POSSIBLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb1494c, -, len: 591 aa. Equivalent to Rv1459c, len: 591 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 591 aa overlap). Possible conserved integral membrane protein, equivalent to MLCL536.30|Z99125 hypothetical protein from Mycobacterium leprae (593 aa), FASTA scores: opt: 1670, E(): 0, (78.6% identity in 585 aa overlap). Also similar to Mycobacterium tuberculosis protein Rv2174|MTV021.07 (33.1% identity in 523 aa overlap). Protein product from Mb1494c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1494c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYF9" /db_xref="UniProtKB/TrEMBL:A0A1R3XYF9" /protein_id="SIU00097.1" /translation="MAARHHTLSWSIASLHGDEQAVGAPLTTTELTALARTRLFGATG TVLMAIGALGAGARPVVQDPTFGVRLLNLPSRIQTVSLTMTTTGAVMMALAWLMLGRF TLGRRRMSRGELDRTLLLWMLPLLIAPPMYSKDVYSYLAQSEIGRDGLDPYRVGPASG LGLGHVFTLSVPSLWRETPAPYGPLFLWIGRGISSLTGENIVAAVLCHRLVVLIGVTL IVWATPRLAQRCGVAEVSALWLGAANPLLIMHLVAGIHNEALMLGLMLTGVEFALRGL DMANTPRPSPETWRLGPATIRASRRPELGASPRAGASRAVKPRPEWGPLAMLLAGSIL ITLSSQVKLPSLLAMGFVTTVLAYRWGGNLRALLLAAAVMASLTLAIMAILGWASGLG FGWINTLGTANVVRSWMSPPTLLALGTGHVGILLGLGDHTTAVLSLTRAIGVLIITVM VCWLLLAVLRGRLHPIGGLGVALAVTVLLFPVVQPWYLLWAIIPLAAWATRPGFRVAA ILATLIVGIFGPTANGDRFALFQIVDATAASAIIVILLIALTYTRLPWRPLAAEQVVT AAESASKTPATRRPTAAPDAYADST" CDS 1647341..1648147 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1495" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1495, -, len: 268 aa. Equivalent to Rv1460, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 268 aa overlap). Probable transcriptional regulatory protein. Equivalent to Z99125|MLCL536.29c hypothetical protein from Mycobacterium leprae (254 aa), FASTA scores: opt: 1273, E(): 0, (79.6% identity in 250 aa overlap). Possible helix-turn-helix motif between aa 68 - 89. Start changed since original submission. Protein product from Mb1495 detected using SWATH mass spectrometry. Mb1495 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XYF5" /protein_id="SIU00098.1" /translation="MTSTTLPHRASLVDRSTEFCHTDVVKIPAVSTTVPAAVSDGHTR RAIVRLLLESGSITAGEIGDRLGLSAAGVRRHLDALIEAGDAEASAAAPWQQVGRGRP AKRYRLTAAGRAKLDHSYDDLASAAMRQLREIGGEEAVRTFARRRIDAILADVAPADG PDDAALEAAAERIATALSKAGYVATTTRVGGPIHGVQICQHHCPVSHVAEEFPELCET EQQAMAEVLGTHVQRLATIVNGDCACTTHVPLSPAPSPRPPATSTEGVSR" CDS 1648144..1650684 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1496" /product="Iron-sulfur cluster assembly protein SufB" /note="Mb1496, -, len: 846 aa. Equivalent to Rv1461, len: 846 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 846 aa overlap). Conserved hypothetical protein. Equivalent of spliced protein from Mycobacterium leprae MLCL536.28c len: 869. Residues 1-253 represent N-extein, and 613-846 the C-extein. The intein present from residues 254 - 612 is different in sequence and site of the insertion from the one present in MLCL536.28c. FASTA scores: Z99125|MLCL536_23 Mycobacterium leprae cosmid L536 (869 aa), opt: 1498 E(): 0, (54.1% identity in 917 aa overlap). The mature protein is similar to Z99120|BSUB0017_150 hypothetical Bacillus subtilis protein (465 aa), FASTA scores: opt:1053, E(): 0, (34.8% identity in 821 aa overlap). The intein shows some similarity to inteins from U67548|MJU67548_6 Methanococcus jannaschii (895 aa), FASTA scores: opt: 181, E(): 0.00023, (25.2% identity in 274 aa overlap). Protein product from Mb1496 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1496 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67126" /db_xref="InterPro:IPR000825" /db_xref="InterPro:IPR003586" /db_xref="InterPro:IPR003587" /db_xref="InterPro:IPR004042" /db_xref="InterPro:IPR006141" /db_xref="InterPro:IPR006142" /db_xref="InterPro:IPR010231" /db_xref="InterPro:IPR027434" /db_xref="InterPro:IPR030934" /db_xref="InterPro:IPR036844" /db_xref="InterPro:IPR037284" /db_xref="UniProtKB/Swiss-Prot:P67126" /protein_id="SIU00099.1" /translation="MTLTPEASKSVAQPPTQAPLTQEEAIASLGRYGYGWADSDVAGA NAQRGLSEAVVRDISAKKNEPDWMLQSRLKALRIFDRKPIPKWGSNLDGIDFDNIKYF VRSTEKQAASWDDLPEDIRNTYDRLGIPEAEKQRLVAGVAAQYESEVVYHQIREDLEA QGVIFLDTDTGLREHPDIFKEYFGTVIPAGDNKFSALNTAVWSGGSFIYVPPGVHVDI PLQAYFRINTENMGQFERTLIIADEGSYVHYVEGCLPAGELITTADGDLRPIESIRVG DFVTGHDGRPHRVTAVQVRDLDGELFTFTPMSPANAFSVTAEHPLLAIPRDEVRVMRK ERNGWKAEVNSTKLRSAEPRWIAAKDVAEGDFLIYPKPKPIPHRTVLPLEFARLAGYY LAEGHACLTNGCESLIFSFHSDEFEYVEDVRQACKSLYEKSGSVLIEEHKHSARVTVY TKAGYAAMRDNVGIGSSNKKLSDLLMRQDETFLRELVDAYVNGDGNVTRRNGAVWKRV HTTSRLWAFQLQSILARLGHYATVELRRPGGPGVIMGRNVVRKDIYQVQWTEGGRGPK QARDCGDYFAVPIKKRAVREAHEPVYNLDVENPDSYLAYGFAVHNCTAPIYKSDSLHS AVVEIIVKPHARVRYTTIQNWSNNVYNLVTKRARAEAGATMEWIDGNIGSKVTMKYPA VWMTGEHAKGEVLSVAFAGEDQHQDTGAKMLHLAPNTSSNIVSKSVARGGGRTSYRGL VQVNKGAHGSRSSVKCDALLVDTVSRSDTYPYVDIREDDVTMGHEATVSKVSENQLFY LMSRGLTEDEAMAMVVRGFVEPIAKELPMEYALELNRLIELQMEGAVG" CDS 1650681..1651874 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1497" /product="Iron-sulfur cluster assembly protein SufD" /note="Mb1497, -, len: 397 aa. Equivalent to Rv1462, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 397 aa overlap). Conserved hypothetical protein. Equivalent to MLCL536.27c|Z99125 hypothetical protein from Mycobacterium leprae (392 aa), FASTA scores: opt: 2059, E(): 0, (80.4% identity in 392 aa overlap). Also similar to nearby Mycobacterium tuberculosis hypothetical protein Rv1461. Protein product from Mb1497 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1497 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59973" /db_xref="InterPro:IPR000825" /db_xref="InterPro:IPR011542" /db_xref="InterPro:IPR037284" /db_xref="UniProtKB/Swiss-Prot:P59973" /protein_id="SIU00100.1" /translation="MTAPGLTAAVEGIAHNKGELFASFDVDAFEVPHGRDEIWRFTPL RRLRGLHDGSARATGSATITVSERPGVYTQTVRRGDPRLGEGGVPTDRVAAQAFSSFN SATLVTVERDTQVVEPVGITVTGPGEGAVAYGHLQVRIEELGEAVVVIDHRGGGTYAD NVEFVVDDAARLTAVWIADWADDTVHLSAHHARIGKDAVLRHVTVMLGGDVVRMSAGV RFCGAGGDAELLGLYFADDGQHLESRLLVDHAHPDCKSNVLYKGALQGDPASSLPDAH TVWVGDVLIRAQATGTDTFEVNRNLVLTDGARADSVPNLEIETGEIVGAGHASATGRF DDEQLFYLRSRGIPEAQARRLVVRGFFGEIIAKIAVPEVRERLTAAIEHELEITESTE KTTVS" CDS 1651871..1652671 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1498" /product="PROBABLE CONSERVED ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb1498, -, len: 266 aa. Equivalent to Rv1463, len: 266 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 266 aa overlap). Probable conserved ATP-binding protein ABC transporter, equivalent to Z99125|MLCL536.26c putative ABC transporter ATP-binding protein from Mycobacterium leprae (260 aa), FASTA scores: opt: 1444, E(): 0, (86.0% identity in 267 aa overlap). Very similar to U38804|PPU38804_55 ATP-DEPENDENT TRANSPORTER YCF16 from PORPHYRA PURPUREA chloroplast (251 aa), FASTA scores: opt: 822, E(): 0, (52.4% identity in 248 aa overlap); and similar to others. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb1498 detected using shotgun mass spectrometry. Mb1498 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYE2" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR010230" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XYE2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00101.1" /translation="MTILEIKDLHVSVENPAEADHEIPILRGVDLTVKSGETHALMGP NGSGKSTLSYAIAGHPKYHVTSGTITLDGADVLAMSIDERARAGLFLAMQYPVEVPGV SMSNFLRSAATAIRGEPPKLRHWVKEVKAAMAALDIDPAFAERSVNEGFSGGEKKRHE ILQLELLKPKIAILDETDSGLDVDALRVVSEGVNRYAESQHGGILLITHYTRILRYIH PEYVHVFVGGRIVESGGSELADELDQNGYVRFSPASGRYPHQPAPTGA" CDS 1652673..1653926 /codon_start=1 /transl_table=11 /gene="csd" /locus_tag="BQ2027_MB1499" /product="PROBABLE CYSTEINE DESULFURASE CSD" /note="Mb1499, csd, len: 417 aa. Equivalent to Rv1464, len: 417 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 417 aa overlap). Probable csd, cysteine desulfurase (EC 4.4.1.-). Equivalent to Q49690|MLCL536.25C cysteine desulfurase from Mycobacterium leprae (418 aa), FASTA scores: opt: 2333, E(): 0, (85.4% identity in 417 aa overlap); and similar to cysteine desulfurase from other organisms. Also similar to M. tuberculosis proteins Rv3025c|ISCS and Rv3778c. Contains PS00595 Aminotransferases class-V pyridoxal-phosphate attachment site. BELONGS TO CLASS-V OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. CSD SUBFAMILY. Protein product from Mb1499 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1499 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63517" /db_xref="InterPro:IPR000192" /db_xref="InterPro:IPR010970" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR020578" /db_xref="UniProtKB/Swiss-Prot:P63517" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00102.1" /translation="MTASVNSLDLAAIRADFPILKRIMRGGNPLAYLDSGATSQRPLQ VLDAEREFLTASNGAVHRGAHQLMEEATDAYEQGRADIALFVGADTDELVFTKNATEA LNLVSYVLGDSRFERAVGPGDVIVTTELEHHANLIPWQELARRTGATLRWYGVTDDGR IDLDSLYLDDRVKVVAFTHHSNVTGVLTPVSELVSRAHQSGALTVLDACQSVPHQPVD LHELGVDFAAFSGHKMLGPNGIGVLYGRRELLAQMPPFLTGGSMIETVTMEGATYAPA PQRFEAGTPMTSQVVGLAAAARYLGAIGMAAVEAHERELVAAAIEGLSGIDGVRILGP TSMRDRGSPVAFVVEGVHAHDVGQVLDDGGVAVRVGHHCALPLHRRFGLAATARASFA VYNTADEVDRLVAGVRRSRHFFGRA" CDS 1653923..1654411 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1500" /product="POSSIBLE NITROGEN FIXATION RELATED PROTEIN" /note="Mb1500, -, len: 162 aa. Equivalent to Rv1465, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 162 aa overlap). Possible nitrogen fixation related protein. Equivalent to Z99125|MLCL536.24c nitrogen fixation protein NIFU from Mycobacterium leprae (165 aa), FASTA scores: opt: 870, E(): 0, (81.8% identity in 165 aa overlap). Also similar to O32163|Z99120|NIFU_BACSU NifU-like protein from Bacillus subtilis (147 aa), FASTA scores: opt: 354, E(): 4.1e-17, (38.3% identity in 141 aa overlap) and to AL096839|SCC22.02 hypothetical protein from Streptomyces coelicolor (156 aa), FASTA scores: opt: 569, E(): 1.2e-31, (56.3% identity in 158 aa overlap). Protein product from Mb1500 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1500 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0J3" /db_xref="InterPro:IPR002871" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0J3" /protein_id="SIU00103.1" /translation="MTLRLEQIYQDVILDHYKHPQHRGLREPFGAQVYHVNPICGDEV TLRVALSEDGTRVTDVSYDGQGCSISQAATSVLTEQVIGQRVPRALNIVDAFTEMVSS RGTVPGDEDVLGDGVAFAGVAKYPARVKCALLGWMAFKDALAQASEAFEEVTDERNQR TG" CDS 1654386..1654733 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1501" /product="PaaD-like protein (DUF59) involved in Fe-S cluster assembly" /note="Mb1501, -, len: 115 aa. Equivalent to Rv1466, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 115 aa overlap). Conserved hypothetical protein. Equivalent to Z99125|MLCL536.23c hypothetical protein from Mycobacterium leprae (115 aa), FASTA scores: opt: 648, E(): 0, (81.7% identity in 115 aa overlap). Similar to ORF's downstream of sigma factors in Streptococcus mutans and Streptococcus pneumoniae e.g. O06451 ORF3 downstream of RpoD (SPDNAGCPO) (109 aa). Alternative TTG start possible at 13757 then avoids overlap with MTV007.12. Protein product from Mb1501 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1501 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002744" /db_xref="InterPro:IPR034904" /db_xref="UniProtKB/TrEMBL:A0A1R3XZE4" /protein_id="SIU00104.1" /translation="MSETSAPAEELLADVEEAMRDVVDPELGINVVDLGLVYGLDVQD GDEGTVALIDMTLTSAACPLTDVIEDQSRSALVGSGLVDDIRINWVWNPPWGPDKITE DGREQLRALGFTV" CDS complement(1654828..1656657) /codon_start=1 /transl_table=11 /gene="fadE15" /locus_tag="BQ2027_MB1502C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE15" /note="Mb1502c, fadE15, len: 609 aa. Equivalent to Rv1467c, len: 609 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 609 aa overlap). Probable fadE15, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to NP_302639.1|NC_002677 acyl-CoA dehydrogenase from Mycobacterium leprae (611 aa). Also highly similar to many e.g. T36481 probable acyl-CoA dehydrogenase (fragment) from Streptomyces coelicolor (491 aa) (has its N-terminus very shorter); NP_384640.1|NC_003047 PUTATIVE ACYL-COA DEHYDROGENASE PROTEIN from Sinorhizobium meliloti (598 aa); ACDS_MEGEL|Q06319 acyl-CoA dehydrogenase (short-chain specific) from Megasphaera elsdenii (383 aa), FASTA scores: E(): 2e-12, (25.4% identity in 410 aa overlap); etc. Also highly similar to fadE5|Rv0244c|MTV034.10c ACYL-COA DEHYDROGENASE from M. tuberculosis (611 aa); and similar to other proteins from Mycobacterium tuberculosis. Protein product from Mb1502c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1502c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYF3" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR020953" /db_xref="InterPro:IPR025878" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3XYF3" /protein_id="SIU00105.1" /translation="MGHYIANVRDLEFNLLEVLDIGAVLGTGRYSDLDVDTVRTILAE AARLAEGPIAESFGYADRNPPVFDPNTHSISVPDELAKTVQAIKEAGWWRLGLAEEIG GMPAPPPLAWAVNEMIYCANPSACFFNLGPVLAQSLYIEGNDEQRRWAAEGVQRGWQA TMVLTEPDAGSDVGAGRTKAFEQPDGTWHIEGVKRFISGGDVGNTAENIFHLVLARPE GAGPGTKGLSLFYVPNYLFDPDTFELGARNGVYVTGLEHKMGLKSSPTCELTFGGADV PAVGYLVGGVHNGIAQMFTVIEHARMTIGVKSAGTLSTGYLNALAFAKERVQGADLTQ MTDKTAPRVTIMHHPDVRRSLMTQKAYAEGLRALYLYAAAHQDDAVAQRVSGADHDMA HRVDDLLLPIVKGVGSERAYEILTESLQTLGGSGFLVDYPLEQYIRDAKIDSLYEGTT AIQALDFFFRKIVRDHGKALQFVLAQVTHTVENIDPSLKPQAELLRTALDDITAMTGA LTGYLMSAAQHSSDIYKVGLGSVRYLLAVGDLLIGWRLLVLAGVAHAALADGPSQNDE AFYRGKIAVAAFFAKNMLPKLTGVRSVIENIDDDIMRVPEDAF" CDS complement(1656764..1657876) /codon_start=1 /transl_table=11 /gene="PE_PGRS29" /locus_tag="BQ2027_MB1503C" /product="pe-pgrs family protein pe_pgrs29" /note="Mb1503c, PE_PGRS29, len: 370 aa. Equivalent to Rv1468c, len: 370 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 370 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Mb1503c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XYU6" /protein_id="SIU00106.1" /translation="MSFVVANTEFVSGAAGNLARLGSMISAANSAAAAQTTAVAAAGA DEVSAAVAALFGAHGQTYQVLSAQAAAFHSQFVQALSGGAQAYAAAEATNFGPLQPLF DVINAPTLALLNRPLIGNGADGTAANPNGQAGGLLIGNGGNGFSPAAGPGGNGGAAGL LGHGGNGGVGALGANGGAGGTGGWLFGNGGAGGNSGGGGGAGGIGGSAVLFGAGGAGG ISPNGMGAGGSGGNGGLFFGNGGAGASSFLGGGGAGGRAFLFGDGGAGGAALSAGSAG RGGDAGFFYGNGGAGGSGAGGASSAHGGAGGQAGLFGNGGEGGDGGALGGNGGNGGNA QLIGNGGDGGDGGGAGAPGLGGRGGLLLGLPGANGT" CDS 1658118..1660091 /codon_start=1 /transl_table=11 /gene="ctpD" /locus_tag="BQ2027_MB1504" /product="PROBABLE CATION TRANSPORTER P-TYPE ATPASE D CTPD" /note="Mb1504, ctpD, len: 657 aa. Equivalent to Rv1469, len: 657 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 657 aa overlap). Probable ctpD, cation-transporting P-type ATPase D (transmembrane protein) (EC 3.6.3.-), highly similar to others e.g. T35947 probable cation-transporting ATPase from Streptomyces coelicolor (638 aa); NP_442633.1|NC_000911 cation-transporting ATPase (E1-E2 ATPase) from Synechocystis sp. strain PCC 6803 (642 aa), FASTA scores: opt: 1438, E(): 0, (41.9% identity in 592 aa overlap); NP_389268.1|NC_000964 protein similar to heavy metal-transporting ATPase from Bacillus subtilis (637 aa); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Rv3743c|MTV025.091c|CTPJ (660 aa). Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB. Mb1504 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63686" /db_xref="InterPro:IPR001757" /db_xref="InterPro:IPR008250" /db_xref="InterPro:IPR018303" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR023298" /db_xref="InterPro:IPR023299" /db_xref="InterPro:IPR027256" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/Swiss-Prot:P63686" /protein_id="SIU00107.1" /translation="MTLTACEVTAAEAPFDRVSKTIPHPLSWGAALWSVVSVRWATVA LLLFLAGLVAQLNGAPEAMWWTLYLACYLAGGWGSAWAGAQALRNKALDVDLLMIAAA VGAVAIGQIFDGALLIVIFATSGALDDIATRHTAESVKGLLDLAPDQAVVVQGDGSER VVAASELVVGDRVVVRPGDRIPADGAVLSGASDVDQRSITGESMPVAKARGDEVFAGT VNGSGVLHLVVTRDPSQTVVARIVELVADASATKAKTQLFIEKIEQRYSLGMVAATLA LIVIPLMFGADLRPVLLRAMTFMIVASPCAVVLATMPPLLSAIANAGRHGVLVKSAVV VERLADTSIVALDKTGTLTRGIPRLASVAPLDPNVVDARRLLQLAAAAEQSSEHPLGR AIVAEARRRGIAIPPAKDFRAVPGCGVHALVGNDFVEIASPQSYRGAPLAELAPLLSA GATAAIVLLDGVAIGVLGLTDQLRPDAVESVAAMAALTAAPPVLLTGDNGRAAWRVAR NAGITDVRAALLPEQKVEVVRNLQAGGHQVLLVGDGVNDAPAMAAARAAVAMGAGADL TLQTADGVTIRDELHTIPTIIGLARQARRVVTVNLAIAATFIAVLVLWDLFGQLPLPL GVVGHEGSTVLVALNGMRLLTNRSWRAAASAAR" CDS 1660135..1660509 /codon_start=1 /transl_table=11 /gene="trxA" /locus_tag="BQ2027_MB1505" /product="PROBABLE THIOREDOXIN TRXA" /note="Mb1505, trxA, len: 124 aa. Equivalent to Rv1470, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 124 aa overlap). Probable trxA, thioredoxin (EC 1.-.-.-), similar to many e.g. P12243|THI1_SYNP7 THIOREDOXIN 1 from Synechococcus sp. (106 aa), FASTA scores: opt: 201, E(): 9.2e-08, (35.4% identity in 99 aa overlap); etc. Highly similar to downstream ORF Rv1471|trxB1 probable thioredoxin from M. tuberculosis (123 aa), FASTA scores: opt: 402, E(): 0, (54.4% identity in 114 aa overlap). Warning: note that Rv3914|MT4033|MTV028.05|trxC can be alternatively named trxA. Protein product from Mb1505 detected using SWATH mass spectrometry. Mb1505 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYG2" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3XYG2" /protein_id="SIU00108.1" /translation="MTTRDLTAAYFQQTISANSNVLVYFWAPLCAPCDLFTPTYEASS RKHFDVVHGKVNIETEKDLASIAGVKLLPTLMAFKKGKLVFKQAGIANPAIMDNLVQQ LRAYTFKSPAGEGIGPGTKTSS" CDS 1660525..1660896 /codon_start=1 /transl_table=11 /gene="trxB1" /locus_tag="BQ2027_MB1506" /standard_name="trxB" /product="PROBABLE THIOREDOXIN TRXB1" /note="Mb1506, trxB1, len: 123 aa. Equivalent to Rv1471, len: 123 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 123 aa overlap). Probable trxB1, thioredoxin (EC 1.-.-.-), similar to many bacterial thioredoxins e.g. P33636|THI2_ECOLI from Escherichia coli (139 aa), FASTA scores: opt: 290, E(): 1.8e-13, (44.3% identity in 97 aa overlap); etc. Highly similar to Rv1470|TrxA probable thioredoxin from Mycobacterium tuberculosis (124 aa), FASTA scores: opt: 402, E(): 1.2e-32, (54.4% identity in 114 aa overlap). Contains PS00194 Thioredoxin family active site. BELONGS TO THE THIOREDOXIN FAMILY. Note that previously known as trxB. Protein product from Mb1506 detected using shotgun mass spectrometry. Mb1506 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYF1" /db_xref="InterPro:IPR005746" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR017937" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3XYF1" /protein_id="SIU00109.1" /translation="MTTRDLTAAQFNETIQSSDMVLVDYWASWCGPCRAFAPTFAESS EKHPDVVHAKVDTEAERELAAAAQIRSIPTIMAFKNGKLLFNQAGALPPAALESLVQQ LKAYEVEAGEATTQNGRAQQA" CDS 1660918..1661775 /codon_start=1 /transl_table=11 /gene="echA12" /locus_tag="BQ2027_MB1507" /product="POSSIBLE ENOYL-CoA HYDRATASE ECHA12 (ENOYL HYDRASE) (UNSATURATED ACYL-CoA HYDRATASE) (CROTONASE)" /note="Mb1507, echA12, len: 285 aa. Equivalent to Rv1472, len: 285 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 285 aa overlap). Possible echA12, enoyl-CoA hydratase (EC 4.2.1.17), highly similar to P53526|ECHH_MYCLE|NP_301896.1|NC_002677 possible enoyl-CoA hydratase/isomerase from Mycobacterium leprae (294 aa), FASTA scores: opt: 1265, E(): 0, (72.0% identity in 271 aa overlap). Also similar to others e.g. CAA66096.1|X97452 enoyl-CoA isomerase from Escherichia coli strain K12 (262 aa); CAC44593.1|AL596162 putative enoyl-CoA hydratase from Streptomyces coelicolor (275 aa); etc. Also similar to others from Mycobacterium tuberculosis e.g. ECHA16|Rv2831|MTCY16B7.11c (249 aa), FASTA scores: opt: 232, E(): 1.3e-15, (33.8% identity in 204 aa overlap); etc. TBparse score is 0.916. Protein product from Mb1507 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1507 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7U004" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR014748" /db_xref="InterPro:IPR018376" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/Swiss-Prot:Q7U004" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00110.1" /translation="MPHRCAAQVVAGYRSTVSLVLVEHPRPEIAQITLNRPERMNSMA FDVMVPLKEALAQVSYDNSVRVVVLTGAGRGFSSGADHKSAGVVPHVENLTRPTYALR SMELLDDVILMLRRLHQPVIAAVNGPAIGGGLCLALAADIRVASSSAYFRAAGINNGL TASELGLSYLLPRAIGSSRAFEIMLTGRDVSAEEAERIGLVSRQVPDEQLLDACYAIA ARMAGFSRPGIELTKRTLWSGLDAASLEAHMQAEGLGQLFVRLLTANFEEAVAARAEQ RAPVFTDDT" CDS 1661811..1663439 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1508" /product="PROBABLE MACROLIDE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb1508, -, len: 542 aa. Equivalent to Rv1473, len: 542 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 542 aa overlap). Possible macrolide-transport ATP-binding protein ABC transporter (see citation below), possibly in EF-3 subfamily. Similar to many ABC-transporters e.g. D90909_48|YHES_HAEIN from Synechocystis sp. strain PCC6803 (574 aa), FASTA scores: opt: 870, E(): 0, (33.3% identity in 525 aa overlap); P44808|YHES_HAEIN from Haemophilus influenzae (638 aa), FASTA scores: opt: 706, E(): 0, (33.7% identity in 517 aa overlap); etc. Contains two PS00017 ATP/GTP-binding site motif A (P-loop), and two PS00211 ABC transporter family signatures. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb1508 detected using SWATH mass spectrometry. Mb1508 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYF2" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR032781" /db_xref="UniProtKB/TrEMBL:A0A1R3XYF2" /protein_id="SIU00111.1" /translation="MITATDLEVRAGARILLAPDGPDLRVQPGDRIGLVGRNGAGKTT TLRILAGEVEPYAGSVTRAGEIGYLPQDPKVGDLDVLARDRVLSARGLDVLLTDLEKQ QALMAEVADEDERDRAIRRYGQLEERFVALGGYGAESEAGRICASLGLPERVLTQRLR TLSGGQRRRVELARILFAASESGAGNSTTLLLDEPTNHLDADSLGWLRDFLRLHTGGL VVISHNVDLVADVVNKVWFLDAVRGQVDVYNMGWQRYVDARATDEQRRIRERANAERK AAALRAQAAKLGAKATKAVAAQNMLRRADRMMAALDEERVADKVARIKFPTPAACGRT PLVANGLGKTYGSLEVFTGVDLAIDRGSRVVILGLNGAGKTTLLRLLAGVEQPDTGVL EPGYGLRIGYFAQEHDTLDNDATVWENVRHAAPDAGEQDLRGLLGAFMFTGPQLEQPA GTLSGGEKTRLALAGLVASTANVLLLDEPTNNLDPASREQVLDALRSYRGAVVLVTHD PGAAAALGPQRVVLLPDGTEDYWSDEYRDLIELA" CDS 1663536..1663727 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1509" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1509, -, len: 63 aa. Equivalent to Rv1473A, len: 63 aa, from Mycobacterium tuberculosis strain H37Rv, (98.4% identity in 63 aa overlap). Possible transcriptional regulator, CDS predicted by GC plot. Similar to SCI8.24c|AL132644_24 putative transcriptional regulator from Streptomyces coelicolor (73 aa), FASTA scores: opt: 210, E(): 1.5e-08, (56.15% identity in 57 aa overlap). Protein product from Mb1509 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1509 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XYF8" /protein_id="SIU00112.1" /translation="MRKSKKTRDQLLRELRNAYEGGASIRNLAATTGRSYGSIHSMLR ESGTTMRGRGGPNRRPRPR" CDS complement(1663796..1664359) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1510C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1510c, -, len: 187 aa. Equivalent to Rv1474c, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 187 aa overlap). Probable transcription regulator, equivalent to AF0021|AF002133_1 transcriptional regulator from Mycobacterium avium strain GIR10 (82 aa), FASTA scores: opt: 490, E(): 6.7e-26, (92.5% identity in 80 aa overlap). Also similar to Q59431|UIDR_ECOLI UID OPERON REPRESSOR (GUS OPERON) from Escherichia coli (196 aa), FASTA scores: opt: 192, E(): 5.8e-06, (28.5% identity in 172 aa overlap). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Helix turn helix motif predicted at aa 33-54 (+3.40 SD). Protein product from Mb1510c detected using SWATH mass spectrometry. Mb1510c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0K1" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0K1" /protein_id="SIU00113.1" /translation="MPKVSEDHLAARRRQILDGARRCFAEYGYDKATVRRLEQAIGMS RGAIFHHFRDKDALFFALAREDTERMAAVASREGLIGVMRDMLAAPDQFDWLATRLEI ARKLRNDPDFSRGWAERSAELAAATTDRLRRQKQANRVRDDVPSDVLRCYLDLVLDGL LARLASGEDPQRLAAVLDLVENSVRRS" CDS complement(1664370..1667201) /codon_start=1 /transl_table=11 /gene="acn" /locus_tag="BQ2027_MB1511C" /product="probable iron-regulated aconitate hydratase acn (citrate hydro-lyase) (aconitase)" /note="Mb1511c, acn, len: 943 aa. Equivalent to Rv1475c, len: 943 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 943 aa overlap). Probable acn, aconitate hydratase (EC 4.2.1.3), similar to many e.g. P70920|ACON_BRAJA ACONITATE HYDRATASE from Bradyrhizobium japonicum (906 aa), FASTA scores: opt:1912, E(): 0, (54.8% identity in 958 aa overlap); closest to AF0021|AF002133_2 Mycobacterium avium strain GIR10 (961 aa), FASTA scores: opt: 5072, E(): 0, (82.8% identity in 943 aa overlap). NOTE ACONITASE HAS AN ACTIVE (4FE-4S) AND AN INACTIVE (3FE-4S) FORMS. THE ACTIVE (4FE-4S) CLUSTER IS PART OF THE CATALYTIC SITE THAT INTERCONVERTS CITRATE, CIS-ACONITASE, AND ISOCITRATE. Protein product from Mb1511c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1511c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZG0" /db_xref="InterPro:IPR000573" /db_xref="InterPro:IPR001030" /db_xref="InterPro:IPR006249" /db_xref="InterPro:IPR015928" /db_xref="InterPro:IPR018136" /db_xref="InterPro:IPR036008" /db_xref="UniProtKB/TrEMBL:A0A1R3XZG0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00114.1" /translation="MTSKSVNSFGAHDTLKVGEKSYQIYRLDAVPNTAKLPYSLKVLA ENLLRNEDGSNITKDHIEAIANWDPKAEPSIEIQYTPARVVMQDFTGVPCIVDLATMR EAIADLGGNPDKVNPLAPADLVIDHSVIADLFGRADAFERNVEIEYQRNGERYQFLRW GQGAFDDFKVVPPGTGIVHQVNIEYLASVVMTRDGVAYPDTCVGTDSHTTMVNGLGVL GWGVGGIEAEAAMLGQPVSMLIPRVVGFRLTGEIQPGVTATDVVLTVTEMLRQHGVVG KFVEFYGEGVAEVPLANRATLGNMSPEFGSTAAIFPIDEETIKYLRFTGRTPEQVALV EAYAKAQGMWHDPKHEPEFSEYLELNLSDVVPSIAGPKRPQDRIALAQAKSTFREQIY HYVGNGSPDSPHDPHSKLDEVVEETFPASDPGQLTFANDDVATDETVHSAAAHADGRV SNPVRVKSDELGEFVLDHGAVVIAAITSCTNTSNPEVMLGAALLARNAVEKGLTSKPW VKTTIAPGSQVVNDYYDRSGLWPYLEKLGFYLVGYGCTTCIGNSGPLPEEISKAVNDN DLSVTAVLSGNRNFEGRINPDVKMNYLASPPLVIAYALAGTMDFDFQTQPLGQDKDGK NVFLRDIWPSQQDVSDTIAAAINQEMFTRNYADVFKGDDRWRNLPTPSGNTFEWDPNS TYVRKPPYFEGMTAKPEPVGNISGARVLALLGDSVTTDHISPAGAIKPGTPAARYLDE HGVDRKDYNSFGSRRGNHEVMIRGTFANIRLRNQLLDDVSGGYTRDFTQPGGPQAFIY DAAQNYAAQHIPLVVFGGKEYGSGSSRDWAAKGTLLLGVRAVIAESFERIHRSNLIGM GVIPLQFPEGKSASSLGLDGTEVFDITGIDVLNDGKTPKTVCVQATKGDGATIEFDAV VRIDTPGEADYYRNGGILQYVLRNILKSG" CDS 1667359..1667919 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1512" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb1512, -, len: 186 aa. Equivalent to Rv1476, len: 186 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 186 aa overlap). Possible membrane protein, TMhelix 138-60. Protein product from Mb1512 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1512 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYG3" /db_xref="UniProtKB/TrEMBL:A0A1R3XYG3" /protein_id="SIU00115.1" /translation="MTGPYFPQTIPFLPSYIPQDVDMTAVKAEVAALGVSAPPAATPG LLEVVQHARDEGIDLKIVLLDHNPPNDTPLRDIATVVGADYSDATVLVLSPNYVGSYS TQYPRVTLEAGEDHSKTGNPVQSAQNFVHELSTPEFPWSALTIVLLIGVLAAAVGARL MQLRGRRSATSTDAAPGAGDDLNQGV" CDS 1668145..1669563 /codon_start=1 /transl_table=11 /gene="ripa" /locus_tag="BQ2027_MB1513" /product="peptidoglycan hydrolase" /note="Mb1513, -, len: 472 aa. Equivalent to Rv1477, len: 472 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 472 aa overlap). Hypothetical Invasion protein. Possibly exported protein with unusually long signal sequence. The last 277 residues are nearly identical to those of AF0060|AF006054_1 hypothetical invasion protein INV1 from Mycobacterium tuberculosis (277 aa), FASTA scores: opt: 1833, E(): 0, (98.2% identity in 277 aa overlap); also very similar to AF0021|AF002133_4 invasin 1 protein from Mycobacterium avium (273 aa), FASTA scores: opt: 1452, E(): 0, (78.1% identity in 279 aa overlap). Similar to Rv1566c|MTCY336.37|Z95586 Mycobacterium tuberculosis cosmid (230 aa), FASTA scores: opt: 528, E(): 4.4e-20, (52.0% identity in 150 aa overlap); and weakly similar to p60 proteins of Listeria spp throughout its length e.g. M80351|LISIAPB_1 Listeria monocytogenes iap-related protein (478 aa), FASTA scores: opt: 251, E(): 8e-06, (24.4% identity in 487 aa overlap). C-terminal domain highly similar to next ORF Rv1478|MTV007.25. Mb1513 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYV5" /db_xref="InterPro:IPR000064" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/TrEMBL:A0A1R3XYV5" /protein_id="SIU00116.1" /translation="MRRNRRGSPARPAARFVRPAIPSALSVALLVCTPGLATADPQTD TIAALIADVAKANQRLQDLSDEVQAEQESVNKAMVDVETARDNAAAAEDDLEVSQRAV KDANAAIAAAQHRFDTFAAATYMNGPSVSYLSASSPDEIIATVTAAKTLSASSQAVMA NLQRARTERVNTESAARLAKQKADKAAADAKASQDAAVAALTETRRKFDEQREEVQRL AAERDAAQARLQAARLVAWSSEGGQGAPPFRMWDPGSGPAGGRAWDGLWDPTLPMIPS ANIPGDPIAVVNQVLGISATSAQVTANMGRKFLEQLGILQPTDTGITNAPAGSAQGRI PRVYGRQASEYVIRRGMSQIGVPYSWGGGNAAGPSKGIDSGAGTVGFDCSGLVLYSFA GVGIKLPHYSGSQYNLGRKIPSSQMRRGDVIFYGPNGSQHVTIYLGNGQMLEAPDVGL KVRVAPVRTAGMTPYVVRYIEY" CDS 1669574..1670299 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1514" /product="possible invasion protein" /note="Mb1514, -, len: 241 aa. Equivalent to Rv1478, len: 241 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 241 aa overlap). Hypothetical Invasion protein. Possibly exported protein, nearly identical to AF0060|AF006054_2 hypothetical invasion protein INV2 of M. tuberculosis (240 aa), FASTA scores: opt: 1509, E(): 0, (95.0% identity in 241 aa overlap); very similar to AF0021|AF002133_5 hypothetical invasion protein INV2 from Mycobacterium avium (244 aa), FASTA scores: opt: 1269, E():0, (78.0% identity in 246 aa overlap). Also similar to Mycobacterium tuberculosis protein MTCY336.37 and weakly similar to C-terminal segment of p60 proteins of Listeria spp.e.g. Q01836|P60_LISIN PROTEIN P60 PRECURSOR (481 aa), FASTA scores: opt: 241, E():4e-07, (37.7% identity in 122 aa overlap). Highly similar to C-terminal domain of preceeding ORF Rv1477|MTV007.24 (472 aa), FASTA scores: opt: 864, E(): 0, (60.1% identity in 213 aa overlap). Mb1514 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000064" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/TrEMBL:A0A1R3XYH7" /protein_id="SIU00117.1" /translation="MRHTRFHPIKLAWITAVVAGLMVGVATPADAEPGQWDPTLPALV SAGAPGDPLAVANASLQATAQATQTTLDLGRQFLGGLGINLGGPAASAPSAATTGASR IPRANARQAVEYVIRRAGSQMGVPYSWGGGSLQGPSKGVDSGANTVGFDCSGLVRYAF AGVGVLIPRFSGDQYNAGRHVPPAEAKRGDLIFYGPGGGQHVTLYLGNGQMLEASGSA GKVTVSPVRKAGMTPFVTRIIEY" CDS 1670438..1671571 /codon_start=1 /transl_table=11 /gene="moxR1" /locus_tag="BQ2027_MB1515" /standard_name="moxR" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN MOXR1" /note="Mb1515, moxR1, len: 377 aa. Equivalent to Rv1479, len: 377 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 377 aa overlap). Probable moxR1, transcriptional regulatory protein, similar to X96434|BBGIDBMOX_2 moxR regulator from Borrelia burgdorferi (329 aa), FASTA scores: opt: 850, E():0, (43.5% identity in 317 aa overlap); and P. denitrificans. Highly similar to MoxR homologs of Mycobacterium tuberculosis and M. avium (but these both differ at C-terminus) e.g. Rv3692, Rv3164c, and AF0021|AF002133_6 Mycobacterium avium strain GIR10 (309 aa), FASTA scores: opt: 1181, E(): 0, (83.7% identity in 227 aa overlap). Also similar to O33173|AF006054 MoxR fragment from Mycobacterium tuberculosis (211 aa), FASTA scores: opt: 1305, E(): 0, (94.3% identity in 212 aa overlap). Note that previously known as moxR. Protein product from Mb1515 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1515 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYH3" /db_xref="InterPro:IPR011703" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041628" /db_xref="UniProtKB/TrEMBL:A0A1R3XYH3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00118.1" /translation="MTSAGGFPAGAGGYQTPGGHSASPAHEAPPGGAEGLAAEVHTLE RAIFEVKRIIVGQDQLVERMLVGLLSKGHVLLEGVPGVAKTLAVETFARVVGGTFSRI QFTPDLVPTDIIGTRIYRQGREEFDTELGPVVANFLLADEINRAPAKVQSALLEVMQE RHVSIGGRTFPMPSPFLVMATQNPIEHEGVYPLPEAQRDRFLFKINVGYPSPEEEREI IYRMGVTPPQAKQILSTGDLLRLQEIAANNFVHHALVDYVVRVVFATRKPEQLGMNDV KSWVAFGASPRASLGIIAAARSLALVRGRDYVIPQDVIEVIPDVLRHRLVLTYDALAD EISPEIVINRVLQTVALPQVNAVPQQGHSVPPVMQAAAAASGR" CDS 1671568..1672521 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1516" /product="conserved protein" /note="Mb1516, -, len: 317 aa. Equivalent to Rv1480, len: 317 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 317 aa overlap). Conserved hypothetical protein, last 110 aa residues correspond to first 110 aa of YS01_MYCAV|O07394 hypothetical 18.7 KD Mycobacterium avium protein MAV169 (169 aa), FASTA scores: opt: 642, E(): 0, (84.2% identity in 114 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv3163c and Rv3693. Protein product from Mb1516 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1516 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002881" /db_xref="InterPro:IPR036465" /db_xref="UniProtKB/Swiss-Prot:P64854" /protein_id="SIU00119.1" /translation="MTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVL HGDHLGLIPGPGSEPGESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVV DMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQH QHTMLRTIATMPQAPAGVRGDLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAI AARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVREFSIDPALRDDFARAAAAHRAD VARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGALAGHQ" CDS 1672532..1673539 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1517" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb1517, -, len: 335 aa. Equivalent to Rv1481, len: 335 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 335 aa overlap). Probable membrane protein, highly similar to YS02_MYCAV|O07395 hypothetical 36.1 kd protein mav335 from M. avium (335 aa), FASTA scores: opt: 1904, E(): 0, (89.0% identity in 337 aa overlap). Similar to AF116251|AF116251_1 BatA protein from Bacteroides fragilis (327 aa), FASTA scores: opt: 317, E(): 2e-12, (26.5% identity in 340 aa overlap). Protein product from Mb1517 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1517 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64856" /db_xref="InterPro:IPR002035" /db_xref="InterPro:IPR022933" /db_xref="InterPro:IPR024163" /db_xref="InterPro:IPR036465" /db_xref="UniProtKB/Swiss-Prot:P64856" /protein_id="SIU00120.1" /translation="MTLPLLGPMTLSGFAHSWFFLFLFVVAGLVALYILMQLARQRRM LRFANMELLESVAPKRPSRWRHVPAILLVLSLLLFTIAMAGPTHDVRIPRNRAVVMLV IDVSQSMRATDVEPSRMVAAQEAAKQFADELTPGINLGLIAYAGTATVLVSPTTNREA TKNALDKLQFADRTATGEAIFTALQAIATVGAVIGGGDTPPPARIVLFSDGKETMPTN PDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGG NSYNAATLAELRAVYSSLQQQIGYETIKGDASVGWLRLGALALALAALAALLINRRLP T" CDS complement(1673612..1674454) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1518C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1518c, -, len: 280 aa. Equivalent to Rv1482c, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 280 aa overlap). Conserved hypothetical protein, highly similar to O07396|AF002133 Mycobacterium avium protein MAV346 (346 aa), FASTA scores: E(): 0, (65.2% identity in 342 aa overlap); slight similarity to GRPE_ECOLI|P09372 heat shock protein from E. coli (197 aa), FASTA scores: opt: 139, E(): 0.012, (28.3% identity in 159 aa overlap). Similar to Mycobacterium tuberculosis hypothetical proteins Rv3517, Rv3555c, Rv3714c, Rv1073, etc. Start changed since first submission (-59 aa)." /db_xref="UniProtKB/TrEMBL:A0A1R3XYG0" /protein_id="SIU00121.1" /translation="MTDPFLGSEALAAGVLTPYELRSRYVALHKDVYVPQGVELTAQL RAKALWLRSRRRGVLAGYSASAFHGAKWIDADLPAAIIDTNRRRAPGLQVWEERIEPD EICVIEGMRVTTPERTALDLTSRFPLDPAVAAVDALIQATDLKVADVEPLIERYRGRR GMKAARAALDLVDGGAQSPKETWLRLLLIRAGFPRPQTQIAVRNEWGWAEAHLDMGWQ DIKVAAEYDGDHHLTSRYHYRKDILRHEKVQHRYGWIVVRVVAEDHPADIIRRVGEAR AFRA" CDS 1674595..1675338 /codon_start=1 /transl_table=11 /gene="fabG1" /locus_tag="BQ2027_MB1519" /standard_name="mabA" /product="3-oxoacyl-[acyl-carrier protein] reductase fabg1 (3-ketoacyl-acyl carrier protein reductase) (mycolic acid biosynthesis a protein)" /note="Mb1519, fabG1, len: 247 aa. Equivalent to Rv1483, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 247 aa overlap). Probable fabG1 (alternate gene name: mabA), 3-oxoacyl-[acyl-carrier protein] reductase (EC 1.1.1.100), equivalent to O07399|FABG_MYCAV 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE from Mycobacterium avium (255 aa); P71534|FABG_MYCSM 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE from Mycobacterium smegmatis (255 aa); and NP_302228.1|NC_002677 3-oxoacyl-[ACP] reductase (aka MabA) from Mycobacterium leprae (253 aa). Also highly similar to many e.g. T36779 probable 3-oxacyl-(acyl-carrier-protein) reductase from Streptomyces coelicolor (234 aa); FABG_ECOLI|P25716|NP_415611.1|NC_000913 3-oxoacyl-[acyl-carrier-protein] reductase from Escherichia coli strain K12 (244 aa), FASTA scores: opt: 664, E(): 6.8e-35, (44.4% identity in 241 aa overlap); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb1519 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1519 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5Y5" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P0A5Y5" /protein_id="SIU00122.1" /translation="MTATATEGAKPPFVSRSVLVTGGNRGIGLAIAQRLAADGHKVAV THRGSGAPKGLFGVECDVTDSDAVDRAFTAVEEHQGPVEVLVSNAGLSADAFLMRMTE EKFEKVINANLTGAFRVAQRASRSMQRNKFGRMIFIGSVSGSWGIGNQANYAASKAGV IGMARSIARELSKANVTANVVAPGYIDTDMTRALDERIQQGALQFIPAKRVGTPAEVA GVVSFLASEDASYISGAVIPVDGGMGMGH" CDS 1675357..1676166 /codon_start=1 /transl_table=11 /gene="inhA" /locus_tag="BQ2027_MB1520" /product="NADH-DEPENDENT ENOYL-[ACYL-CARRIER-PROTEIN] REDUCTASE INHA (NADH-DEPENDENT ENOYL-ACP REDUCTASE)" /note="Mb1520, inhA, len: 269 aa. Equivalent to Rv1484, len: 269 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 269 aa overlap). inhA, NADH-dependent enoyl-[acyl-carrier-protein] reductase (EC 1.3.1.9) (see citations below). Identical to INHA_MYCTU|P46533 enoyl-[acyl-carrier-protein] reductase from Mycobacterium tuberculosis and G1155270 Mycobacterium bovis enoyl acp reductase. SOME SIMILARITY TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb1520 detected using shotgun mass spectrometry. Mb1520 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Y7" /db_xref="InterPro:IPR014358" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P0A5Y7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00123.1" /translation="MTGLLDGKRILVSGIITDSSIAFHIARVAQEQGAQLVLTGFDRL RLIQRITDRLPAKAPLLELDVQNEEHLASLAGRVTEAIGAGNKLDGVVHSIGFMPQTG MGINPFFDAPYADVSKGIHISAYSYASMAKALLPIMNPGGSIVGMDFDPSRAMPAYNW MTVAKSALESVNRFVAREAGKYGVRSNLVAAGPIRTLAMSAIVGGALGEEAGAQIQLL EEGWDQRAPIGWNMKDATPVAKTVCALLSDWLPATTGDIIYADGGAHTQLL" CDS 1676172..1677206 /codon_start=1 /transl_table=11 /gene="hemZ" /locus_tag="BQ2027_MB1521" /product="FERROCHELATASE HEMZ (PROTOHEME FERRO-LYASE) (HEME SYNTHETASE)" /note="Mb1521, hemZ, len: 344 aa. Equivalent to Rv1485, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 344 aa overlap). hemZ, ferrochelatase (EC 4.99.1.1) (see citation below), similar to many e.g. HEMZ_BACSU|P32396 ferrochelatase from Bacillus subtilus (310 aa), FASTA scores: opt:490, E(): 2e-24, (30.2% identity in 295 aa overlap); etc. BELONGS TO THE FERROCHELATASE FAMILY. Protein product from Mb1521 detected using SWATH mass spectrometry. Mb1521 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A577" /db_xref="InterPro:IPR001015" /db_xref="InterPro:IPR019772" /db_xref="InterPro:IPR033644" /db_xref="InterPro:IPR033659" /db_xref="UniProtKB/Swiss-Prot:P0A577" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00124.1" /translation="MQFDAVLLLSFGGPEGPEQVRPFLENVTRGRGVPAERLDAVAEH YLHFGGVSPINGINRTLIAELEAQQELPVYFGNRNWEPYVEDAVTAMRDNGVRRAAVF ATSAWSGYSSCTQYVEDIARARRAAGRDAPELVKLRPYFDHPLFVEMFADAITAAAAT VRGDARLVFTAHSIPTAADRRCGPNLYSRQVAYATRLVAAAAGYCDFDLAWQSRSGPP QVPWLEPDVTDQLTGLAGAGINAVIVCPIGFVADHIEVVWDLDHELRLQAEAAGIAYA RASTPNADPRFARLARGLIDELRYGRIPARVSGPDPVPGCLSSINGQPCRPPHCVASV SPARPSAGSP" CDS complement(1677172..1678038) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1522C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1522c, -, len: 288 aa. Equivalent to Rv1486c, len: 288 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 288 aa overlap). Conserved hypothetical protein, highly similar to YS07_MYCAV|O07402 hypothetical 33.5 kd protein mav321 from Mycobacterium avium (320 aa), FASTA scores: opt: 1217, E(): 0, (71.1% identity in 315 aa overlap). Weak similarity to AL079332|SCI5.07 hypothetical protein from Streptomyces coelicolor (259 aa), FASTA scores: opt: 131, E(): 0.29, (32.3% identity in 279 aa overlap). Start changed since original submission. Protein product from Mb1522c detected using SWATH mass spectrometry. Mb1522c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P59980" /protein_id="SIU00125.1" /translation="MWCPSVSLSIWANAWLAGKAAPDDVLDALSLWAPTQSVAAYDAV AAGHTGLPWPDVHDAGTVSLLQTLRAAVGRRRLRGTINVVLPVPGDVRGLAAGTQFEH DALAAGEAVIVANPEDPGSAVGLVPEFSYGDVDEAAQSEPLTPELCALSWMVYSLPGA PVLEHYELGDAEYALRSAVRSAAEALSTIGLGSSDVANPRGLVEQLLESSRQHRVPDH APSRALRVLENAAHVDAIIAVSAGLSRLPIGTQSLSDAQRATDALRPLTAVVRSARMS AVTAILHSAWPD" CDS 1678096..1678530 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1523" /product="CONSERVED MEMBRANE PROTEIN" /note="Mb1523, -, len: 144 aa. Equivalent to Rv1487, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 144 aa overlap). Conserved membrane protein. Highly similar to O07404|AF002133 MAV145 from Mycobacterium avium (145 aa), FASTA scores: opt: 667, E(): 0, (72.5% identity in 142 aa overlap). Also similar to AL079332|SCI5.05 hypothetical protein from Streptomyces coelicolor (143 aa), FASTA scores: opt: 344, E(): 1.3e-15, (44.8% identity in 134 aa overlap). Protein product from Mb1523 detected using SWATH mass spectrometry. Mb1523 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYW5" /db_xref="InterPro:IPR002810" /db_xref="UniProtKB/TrEMBL:A0A1R3XYW5" /protein_id="SIU00126.1" /translation="MPVALIWLIAALVLVGAEALTGDMFLLMLGGGALAASVSSWLLA WPMWADGAVFLLVSVLLLVLVRPAVRRRLTQTKGVQLGIEALEGKKAVVLGRVARDGG QVKLDGQVWTARPLNDGDVFEPGDSVTVVQIDGATAVVFKDV" CDS 1678552..1679697 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1524" /product="Protein QmcA (possibly involved in integral membrane quality control)" /note="Mb1524, -, len: 381 aa. Equivalent to Rv1488, len: 381 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 381 aa overlap). Possible exported conserved protein; contains possible N-terminal signal sequence. Similar to YBBK_ECOLI|P77367 hypothetical protein ybbK from Escherichia coli (305 aa), FASTA scores: opt: 716, E(): 0, (37.1% identity in 307 aa overlap). Similar to stomatin-like proteins e.g. AF065260|AF065260_1 Clostridium difficile (320 aa), FASTA scores: opt: 767, E(): 0, (42.3% identity in 307 aa overlap). Protein product from Mb1524 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1524 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63694" /db_xref="InterPro:IPR001107" /db_xref="InterPro:IPR001972" /db_xref="InterPro:IPR018080" /db_xref="InterPro:IPR036013" /db_xref="UniProtKB/Swiss-Prot:P63694" /protein_id="SIU00127.1" /translation="MQGAVAGLVFLAVLVIFAIIVVAKSVALIPQAEAAVIERLGRYS RTVSGQLTLLVPFIDRVRARVDLRERVVSFPPQPVITEDNLTLNIDTVVYFQVTVPQA AVYEISNYIVGVEQLTTTTLRNVVGGMTLEQTLTSRDQINAQLRGVLDEATGRWGLRV ARVELRSIDPPPSIQASMEKQMKADREKRAMILTAEGTREAAIKQAEGQKQAQILAAE GAKQAAILAAEADRQSRMLRAQGERAAAYLQAQGQAKAIEKTFAAIKAGRPTPEMLAY QYLQTLPEMARGDANKVWVVPSDFNAALQGFTRLLGKPGEDGVFRFEPSPVEDQPKHA ADGDDAEVAGWFSTDTDPSIARAVATAEAIARKPVEGSLGTPPRLTQ" CDS 1679707..1680063 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1525" /product="conserved protein" /note="Mb1525, -, len: 118 aa. Equivalent to Rv1489, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 118 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from Mycobacterium avium subsp. paratuberculosis and Streptomyces coelicolor e.g. AJ250017_1 insertion sequence IS900, Locus 3, putative invasion protein from M. paratuberculosis (138 aa), FASTA scores: opt: 120, E(): 0.26, (34.375% identity in 96 aa overlap); SCD6.11c|AL353815_11 possible integral membrane protein from Streptomyces coelicolor (136 aa), FASTA scores: opt: 106, E(): 2.2, (35.9% identity in 103 aa overlap). ORF predicted by GC plot. Replaces previous Rv1489c on other strand. Protein product from Mb1525 detected using shotgun mass spectrometry. Mb1525 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYI2" /db_xref="InterPro:IPR032808" /db_xref="UniProtKB/TrEMBL:A0A1R3XYI2" /protein_id="SIU00128.1" /translation="MSGLTSPKTYAVLAALQAGDAVACAIPLPPIARLLDDLDVPVSV RPVLPVVKAASAVGLLSVTRFPALARLTTAMLTLYFILAVGAHVRVRDRVVNAIPAAS FLTLFALMTAKGPERT" CDS 1680097..1680327 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1526" /product="Methylmalonyl-CoA mutase large subunit, MutB (EC" /EC_number="5.4.99.2" /note="Mb1526, -, len: 76 aa. Equivalent to Rv1489A, len: 76 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 76 aa overlap). Conserved hypothetical protein, similar to part of alpha subunit of many methylmalonyl-CoA mutases (~750 aa). Size difference suggests possible gene fragment although Mycobacterium tuberculosis has intact methylmalonyl-CoA mutase gene. P71774|MUTB_MYCTU PROBABLE METHYLMALONYL-COA MUTASE from Mycobacterium tuberculosis (750 aa), FASTA scores: opt: 258, E(): 3.2e-10, (73.35% identity in 60 aa overlap). ORF predicted by GC plot. Mb1526 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYH1" /db_xref="InterPro:IPR006099" /db_xref="UniProtKB/TrEMBL:A0A1R3XYH1" /protein_id="SIU00129.1" /translation="MSVGEVEVLKVENSRVRAEQLAKLYELRSSRDRVRVDAALAELS RAAAARGCAGTSGLGNNLMAPGPPHSLLGRDR" CDS 1680477..1681784 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1527" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb1527, -, len: 435 aa. Equivalent to Rv1490, len: 435 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 435 aa overlap). Probable membrane protein. Protein product from Mb1527 detected using SWATH mass spectrometry." /db_xref="GOA:P64858" /db_xref="UniProtKB/Swiss-Prot:P64858" /protein_id="SIU00130.1" /translation="MSQCFAVKGIGGADQATLGSAEILVKYAQLADKRARVYVLVSTW LVVWGIWHVYFVEAVFPNAILWLHYYAASYEFGFVRRGLGGELIRMLTGDHFFAGAYT VLWTSITVWLIALAVVVWLILSTGNRSERRIMLALLVPVLPFAFSYAIYNPHPELFGM TALVAFSIFLTRAHTSRTRVILSTLYGLTMAVLALIHEAIPLEFALGAVLAIIVLSKN ATGATRRICTALAIGPGTVSVLLLAVVGRRDIADQLCAHIPHGMVENPWAVATTPQRV LDYIFGRVESHADYHDWVCEHVTPWFNLDWITSAKLVAVVGFRALFGAFLLGLLFFVA TTSMIRYVSAVPVRTFFAELRGNLALPVLASALLVPLFITAVDWTRWWVMITLDVAIV YILYAIDRPEIEQPPSRRNVQVFVCVVLVLAVIPTGSANNIGR" CDS complement(1682363..1683121) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1528C" /product="CONSERVED MEMBRANE PROTEIN" /note="Mb1528c, -, len: 252 aa. Equivalent to Rv1491c, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 252 aa overlap). Conserved membrane protein. Similar to hypothetical proteins from many organisms e.g. YDJZ_ECOLI|P76221 Escherichia coli (235 aa), FASTA scores: opt: 223, E():6.7e-07, (31.7% identity in 145 aa overlap); AL133252|SCE46.15 Streptomyces coelicolor (249 aa), FASTA scores: opt: 378, E(): 1.5e-17, (39.1% identity in 169 aa overlap). Also similar to M. tuberculosis hypothetical protein Rv0625c. Mb1528c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67118" /db_xref="InterPro:IPR015414" /db_xref="InterPro:IPR032816" /db_xref="UniProtKB/Swiss-Prot:P67118" /protein_id="SIU00131.1" /translation="MTAPAICNTTETVHGIATSLGAVARQASLPRIVGTVVGITVLVV VALLVPVPTAVELRDWAKSLGAWFPLAFLLVHTVVTVPPFPRTAFTLAAGLLFGSVVG VFIAVVGSTASAVIAMLLVRATGWQLNSLVRRRAINRLDERLRERGWLAILSLRLIPV VPFAAINYAAGASGVRILSFAWATLAGLLPGTAAVVILGDAFAGSGSPLLILVSVCTG ALGLTGLVYEIRNYRRQHRRMPGYDDPVREPALI" CDS 1683312..1685159 /codon_start=1 /transl_table=11 /gene="mutA" /locus_tag="BQ2027_MB1529" /product="PROBABLE METHYLMALONYL-COA MUTASE SMALL SUBUNIT MUTA (MCM)" /note="Mb1529, mutA, len: 615 aa. Equivalent to Rv1492, len: 615 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 615 aa overlap). Probable mutA, Methylmalonyl-coa mutase small-subunit (EC 5.4.99.2), strong similarity to e.g. MUTA_STRCM|Q05064 methylmalonyl-coa mutase beta-subunit from Streptomyces cinnamonensis (616 aa), FASTA scores: opt: 1512, E(): 0, (45.9% identity in 628 aa overlap). Contains PS00213 Lipocalin signature, PS00544 Methylmalonyl-CoA mutase signature. BELONGS TO THE METHYLMALONYL-COA MUTASE FAMILY. Protein product from Mb1529 detected using shotgun mass spectrometry. Mb1529 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65486" /db_xref="InterPro:IPR004608" /db_xref="InterPro:IPR006099" /db_xref="InterPro:IPR016176" /db_xref="InterPro:IPR036724" /db_xref="UniProtKB/Swiss-Prot:P65486" /protein_id="SIU00132.1" /translation="MSIDVPERADLEQVRGRWRNAVAGVLSKSNRTDSAQLGDHPERL LDTQTADGFAIRALYTAFDELPEPPLPGQWPFVRGGDPLRDVHSGWKVAEAFPANGAT ADTNAAVLAALGEGVSALLIRVGESGVAPDRLTALLSGVYLNLAPVILDAGADYRPAC DVMLALVAQLDPGQRDTLSIDLGADPLTASLRDRPAPPIEEVVAVASRAAGERGLRAI TVDGPAFHNLGATAATELAATVAAAVAYLRVLTESGLVVSDALRQISFRLAADDDQFM TLAKMRALRQLWARVAEVVGDPGGGAAVVHAETSLPMMTQRDPWVNMLRCTLAAFGAG VGGADTVLVHPFDVAIPGGFPGTAAGFARRIARNTQLLLLEESHVGRVLDPAGGSWFV EELTDRLARRAWQRFQAIEARGGFVEAHDFLAGQIAECAARRADDIAHRRLAITGVNE YPNLGEPALPPGDPTSPVRRYAAGFEALRDRSDHHLARTGARPRVLLLPLGPLAEHNI RTTFATNLLASGGIEAIDPGTVDAGTVGNAVADAGSPSVAVICGTDARYRDEVADIVQ AARAAGVSRVYLAGPEKALGDAAHRPDEFLTAKINVVQALSNLLTRLGA" CDS 1685160..1687412 /codon_start=1 /transl_table=11 /gene="mutB" /locus_tag="BQ2027_MB1530" /product="PROBABLE METHYLMALONYL-COA MUTASE LARGE SUBUNIT MUTB (MCM)" /note="Mb1530, mutB, len: 750 aa. Equivalent to Rv1493, len: 750 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 750 aa overlap). Probable mutB, Methylmalonyl-coa mutase large-subunit (EC 5.4.99.2), strong similarity to e.g. MUTB_STRCM|Q05065 methylmalonyl-coa mutase alpha-subunit from Streptomyces cinnamonensis (733 aa), FASTA scores: opt: 3562, E(): 0, (75.8% identity in 730 aa overlap). Contains PS00544 Methylmalonyl-CoA mutase signature. BELONGS TO THE METHYLMALONYL-COA MUTASE FAMILY. Protein product from Mb1530 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1530 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65488" /db_xref="InterPro:IPR006098" /db_xref="InterPro:IPR006099" /db_xref="InterPro:IPR006158" /db_xref="InterPro:IPR006159" /db_xref="InterPro:IPR016176" /db_xref="InterPro:IPR036724" /db_xref="UniProtKB/Swiss-Prot:P65488" /protein_id="SIU00133.1" /translation="MTTKTPVIGSFAGVPLHSERAAQSPTEAAVHTHVAAAAAAHGYT PEQLVWHTPEGIDVTPVYIAADRAAAEAEGYPLHSFPGEPPFVRGPYPTMYVNQPWTI RQYAGFSTAADSNAFYRRNLAAGQKGLSVAFDLATHRGYDSDHPRVQGDVGMAGVAID SILDMRQLFDGIDLSTVSVSMTMNGAVLPILALYVVAAEEQGVAPEQLAGTIQNDILK EFMVRNTYIYPPKPSMRIISDIFAYTSAKMPKFNSISISGYHIQEAGATADLELAYTL ADGVDYIRAGLNAGLDIDSFAPRLSFFWGIGMNFFMEVAKLRAGRLLWSELVAQFAPK SAKSLSLRTHSQTSGWSLTAQDVFNNVARTCIEAMAATQGHTQSLHTNALDEALALPT DFSARIARNTQLVLQQESGTTRPIDPWGGSYYVEWLTHRLARRARAHIAEVAEHGGMA QAISDGIPKLRIEEAAARTQARIDSGQQPVVGVNKYQVPEDHEIEVLKVENSRVRAEQ LAKLQRLRAGRDEPAVRAALAELTRAAAEQGRAGADGLGNNLLALAIDAARAQATVGE ISEALEKVYGRHRAEIRTISGVYRDEVGKAPNIAAATELVEKFAEADGRRPRILIAKM GQDGHDRGQKVIATAFADIGFDVDVGSLFSTPEEVARQAADNDVHVIGVSSLAAGHLT LVPALRDALAQVGRPDIMIVVGGVIPPGDFDELYAAGATAIFPPGTVIADAAIDLLHR LAERLGYTLD" CDS 1687426..1687728 /codon_start=1 /transl_table=11 /gene="maze4" /locus_tag="BQ2027_MB1531" /product="possible antitoxin maze4" /note="Mb1531, -, len: 100 aa. Equivalent to Rv1494, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 100 aa overlap). Hypothetical unknown protein. Mb1531 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P0A5F0" /protein_id="SIU00134.1" /translation="MPFLVALSGIISGVRDHSMTVRLDQQTRQRLQDIVKGGYRSANA AIVDAINKRWEALHDEQLDAAYAAAIHDNPAYPYESEAERSAARARRNARQQRSAQ" CDS 1687725..1688042 /codon_start=1 /transl_table=11 /gene="mazf4" /locus_tag="BQ2027_MB1532" /product="possible toxin mazf4" /note="Mb1532, -, len: 105 aa. Equivalent to Rv1495, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 105 aa overlap). Conserved hypothetical protein, some similarity to Rv1942c|MTCY09F9.22 hypothetical protein from Mycobacterium tuberculosis (109 aa) (0.7% identity in 101 aa overlap) and Rv0659c, Rv1102c. Protein product from Mb1532 detected using shotgun mass spectrometry. Mb1532 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64860" /db_xref="InterPro:IPR003477" /db_xref="InterPro:IPR011067" /db_xref="UniProtKB/Swiss-Prot:P64860" /protein_id="SIU00135.1" /translation="MNAPLRGQVYRCDLGYGAKPWLIVSNNARNRHTADVVAVRLTTT RRTIPTWVAMGPSDPLTGYVNADNIETLGKDELGDYLGEVTPATMNKINTALATALGL PWP" CDS 1688039..1689043 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1533" /product="Possible transport system kinase" /note="Mb1533, -, len: 334 aa. Equivalent to Rv1496, len: 334 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 334 aa overlap). Possible transport system kinase (EC 2.7.-.-). Equivalent to NP_302220.1|NC_002677 putative kinase from Mycobacterium leprae (327 aa). Highly similar to several transport system kinases and NTPase transporters e.g. P27254|ARGK_ECOLI|B2918 LAO/AO transport system kinase (EC 2.7.-.-) from Escherichia coli K12 (331 aa) (see citation below); NP_311815.1|NC_002695 ATPase component of two convergent arginine transporter from Escherichia coli O157:H7 (331 aa); etc. Also similar to YPLE_CAUCR|P37895 hypothetical 34.6 kd protein in Caulobacter crescentus (326 aa), FASTA scores, opt: 1125, E(): 0, (55.7% identity in 316 aa overlap). Protein product from Mb1533 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1533 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63578" /db_xref="InterPro:IPR005129" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P63578" /protein_id="SIU00136.1" /translation="MMAASHDDDTVDGLATAVRGGDRAALPRAITLVESTRPDHREQA QQLLLRLLPDSGNAHRVGITGVPGVGKSTAIEALGMHLIERGHRVAVLAVDPSSTRTG GSILGDKTRMARLAVHPNAYIRPSPTSGTLGGVTRATRETVVLLEAAGFDVILIETVG VGQSEVAVANMVDTFVLLTLARTGDQLQGIKKGVLELADIVVVNKADGEHHKEARLAA RELSAAIRLIYPREALWRPPVLTMSAVEGRGLAELWDTVERHRQVLTGAGEFDARRRD QQVDWTWQLVRDAVLDRVWSNPTVRKVRSELERRVRAGELTPALAAQQILEIANLTDR " CDS 1689096..1690385 /codon_start=1 /transl_table=11 /gene="lipL" /locus_tag="BQ2027_MB1534" /product="PROBABLE ESTERASE LIPL" /note="Mb1534, lipL, len: 429 aa. Equivalent to Rv1497, len: 429 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 429 aa overlap). Probable LipL, esterase (EC 3.1.-.-), very similar to Mycobacterium tuberculosis hypothetical esterases and penicillin binding proteins e.g. Rv1923, Rv2463, Rv3775, etc. Also similar to G151214|M68491 esterase estA from Pseudomonas sp (389 aa), FASTA scores: opt: 604, E(): 1e-31, (34.4% identity in 389 aa overlap). Protein product from Mb1534 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1534 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ7" /protein_id="SIU00137.1" /translation="MMVDTGVDHRAVSSHDGPDAGRRVFGAADPRFACVVRAFASMFP GRRFGGGALAVYLDGQPVVDVWKGWADRAGWVPWSADSAPMVFSATKGMTATVIHRLA DRGLIDYEAPVAEYWPAFGANGKATLTVRDVMRHQAGLSGLRGATQQDLLDHVVMEER LAAAVPGRLLGKSAYHALTFGWLMSGLARAVTGKDMRLLFREELAEPLDTDGLHLGRP PADAPTRVAEIIMPQDIAANAVLTCAMRRLAHRFSGGFRSMYFPGAIAAVQGEAPLLD AEIPAANGVATARALARMYGAIANGGEIDGIRFLSRELVTGLTRNRRQVLPDRNLLVP LNFHLGYHGMPIGNVMPGFGHVGLGGSIGWTDPETGVAFALVHNRLLSPLVMTDHAGF VGIYHLIRQAAAQARKRGYQPVTPFGAPYSEPGAAAG" CDS complement(1690458..1691231) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1535C" /product="PROBABLE METHYLTRANSFERASE" /note="Mb1535c, -, len: 205 aa. Equivalent to Rv1498c, len: 205 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 205 aa overlap). Probable methyltransferase (EC 2.1.1.-). Similar to G2792343|AF040571 METHYLTRANSFERASE from AMYCOLATOPSIS MEDITERRANEI (272 aa), FASTA scores: E(): 5.1e-11, (32.3% identity in 124 aa overlap). Contains PS00017 ATP/GTP-binding site motif A. Protein product from Mb1535c detected using SWATH mass spectrometry. Mb1535c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYJ0" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00138.1" /translation="MTRSKRGSADGGSAEALPPKSLRQFVGGAYKEVGAEFVGYLVDL CGLQPDEAVLDVGCGSGRMALPLTGYLNSEGRYAGFDISQKAIAWCQEHITSAHPNFQ FEVSDIYNSLYNPKGKYQSLDFRFPYPDASFDVVFLTSVFTHMFPPDVEHYLDEISRV LKPGGRCLCTYFLLNDESLAHIAEGKSAHNFQHEGPGYRTIHKKRPEEAIGLPETFVR DVYGKFGLAVHEPLHYGSWSGREPHLSFQDIVIATKTAS" CDS complement(1691289..1691480) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1536C" /product="Dodecin, a flavin storage/sequestration protein" /note="Mb1536c, -, len: 63 aa. Equivalent to Rv1498A, len: 70 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 63 aa overlap). Conserved hypothetical protein, highly similar to other hypothetical proteins e.g. from Streptomyces coelicolor, Sinorhizobium meliloti and Pseudomonas aeruginosa. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (a-g) at the 5' start leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (63 aa versus 70 aa). Protein product from Mb1536c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1536c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009923" /db_xref="InterPro:IPR025543" /db_xref="InterPro:IPR036694" /db_xref="UniProtKB/TrEMBL:A0A1R3XYI0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00139.1" /translation="MIEIVGTSPDGVDAAIQGGLARAAQTMRALDWFEVQSIRGHLVD GAVAHFQVTMKVGFRLEDS" CDS 1691562..1691960 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1537" /product="HYPOTHETICAL PROTEIN" /note="Mb1537, -, len: 132 aa. Equivalent to Rv1499, len: 132 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 132 aa overlap). Hypothetical unknown protein, was initially longer but has been shortened owing to overlap with Rv1498A." /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ5" /protein_id="SIU00140.1" /translation="MPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIA TFDQKRPAVGVDEHDPGGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQPKRPA LRPTKAAATTAATTWIERVQNRRGRHSALV" CDS 1692005..1693033 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1538" /product="PROBABLE GLYCOSYLTRANSFERASE" /note="Mb1538, -, len: 342 aa. Equivalent to Rv1500, len: 342 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 342 aa overlap). Probable glycosyltransferase (EC 2.-.-.-), hydrophobic domain near C-terminus. Some similarity to putative glycosyl-transferases from Bacillus subtilis e.g. O34319|YKCC_BACSU (323 aa), opt: 490, E(): 6.1e-25, (28.85% identity in 312 aa overlap) and to N-acetyl glucosamine transferases. Also similar to G1001347 hypothetical 36.7 kDa protein (318 aa), FASTA scores: opt: 523, E(): 7.2e-26, (30.6% identity in 307 aa overlap). Protein product from Mb1538 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1538 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYH5" /db_xref="InterPro:IPR001173" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/TrEMBL:A0A1R3XYH5" /protein_id="SIU00141.1" /translation="MRLSIVTTMYMSEPYVLEFYRRARAAADKITPDVEIIFVDDGSP DAALQQAVSLLDSDPCVRVIQLSRNFGHHKAMMTGLAHATGDLVFLIDSDLEEDPALL EPFYEKLISTGADVVFGCHARRPGGWLRNFGPKIHYRASALLCDPPLHENTLTVRLMT ADYVRSLVQHQERELSIAGLWQITGFYQVPMSVNKAWKGTTTYTFRRKVATLVDNVTS FSNKPLVFIFYLGAAIFIISSSAAGYLIIDRIFFRALQAGWASVIVSIWMLGGVTIFC IGLVGIYVSKVFIETKQRPYTIIRRIYGSDLTTREPSSLKTAFPAAHLSNGKRVTSEP EGLATGNR" CDS 1693045..1693866 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1539" /product="Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin" /note="Mb1539, -, len: 273 aa. Equivalent to Rv1501, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 273 aa overlap). Conserved hypothetical protein, some similarity to O06374|Rv3633|MTCY15C10.19C hypothetical protein from Mycobacterium tuberculosis, FASTA scores: E(): 3.9e-10, (27.5% identity in 280 aa overlap). Protein product from Mb1539 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1539 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR008775" /db_xref="UniProtKB/Swiss-Prot:P67771" /protein_id="SIU00142.1" /translation="MIPVKVENNTSLDQVQDALNCVGYAVVEDVLDEASLAATRDRMY RVQERILTEIGKERLARAGELGVLRLMMKYDPHFFTFLEIPEVLSIVDRVLSETAILH LQNGFILPSFPPFSTPDVFQNAFHQDFPRVLSGYIASVNIMFAIDPFTRDTGATLVVP GSHQRIEKPDHTYLARNAVPVQCAAGSLFVFDSTLWHAAGRNTSGKDRLAINHQFTRS FFKQQIDYVRALGDAVVLEQPARTQQLLGWYSRVVTNLDEYYQPPDKRLYRKGQG" CDS 1694079..1694429 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1540" /product="HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb1540, -, len: 116 aa. Equivalent to the 5' end of Rv1502, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (94.7% identity in 114 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1502 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (t-*), splits Rv1502 into 2 parts, Mb1540 and Mb1541. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1502 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (t-*), splits Rv1502 into 2 parts, Mb1540 and Mb1541. Mb1540 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR023296" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Q2" /protein_id="SIU00143.1" /translation="MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRD GQNRSSIGSVIVDLAVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYY TGWNSLSPCPGKTP" CDS 1694432..1694977 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1541" /product="HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb1541, -, len: 189 aa. Equivalent to the 3' end of Rv1502, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 189 aa overlap). Hypothetical unknown protein. Protein product from Mb1541 detected using SWATH mass spectrometry. Mb1541 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR023296" /db_xref="UniProtKB/TrEMBL:A0A1R3XZI4" /protein_id="SIU00144.1" /translation="MAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYRMW YGSNLGWGEGTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPCVVRDAGV YRMWFCARGAKYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQR FMLYSGDGYGRTGFGLAVLEN" CDS complement(1695150..1696298) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1542C" /product="dTDP-4-amino-4,6-dideoxygalactose transaminase (EC" /EC_number="2.6.1.59" /note="Mb1542c, -, len: 382 aa. Equivalent to Rv1503c and Rv1504c, len: 182 aa and 199 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 182 aa overlap and 100% identity in 199 aa overlap). Conserved hypothetical protein, similar to C-terminal region of P27833|RFFA_ECOLI LIPOPOLYSACCHARIDE BIOSYNTHESIS PROTEIN from Escherichia coli (376 aa), FASTA scores: opt: 565, E(): 0, (49.4% identity in 170 aa overlap) and similar to N-terminal region of P27833|RFFA_ECOLI LIPOPOLYSACCHARIDE BIOSYNTHESIS PROTEIN from Escherichia coli (376 aa), FASTA scores: opt: 863, E(): 0, (68.0% identity in 194 aa overlap); Rv1503c and Rv1504c are both similar to RFFA_ECOLI but are separated by a stop codon, sequence appears to be correct so possible pseudogene. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1503c and Rv1504c exist as 2 genes. In Mycobacterium bovis, a single base transversion (t-g) leads to a single product. Mb1542c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYJ3" /db_xref="InterPro:IPR000653" /db_xref="InterPro:IPR012749" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ3" /protein_id="SIU00145.1" /translation="MSDHKVPFNRPYMTGRELAYIAEAHSCGHLAGDGPFTRRSHAWL EQQTGCRKALLTPSCTAALEMMALLLDIEEGDEVILPSYTFVSTANAFVLRGGVPVFV DIRPDTLNIDETRIVDAITPRTKAIVPVHYAGVACEMDAIMKIATHHNLAVVEDAAQG AMASYRGRALGSIGDLGALSFHETKNVISGEGGALLVNSEDFLLRAEILREKGTNRSR FLRNEVDKYTWQDKGSSYLPSELVAAFLWAQFEEAERITRIRLDLWNRYHESFESLEQ RGLLRRPIIPQGCSHNAHMYYVLLAPSADREEVLARLTSKGIGAVFHYVPLHDSPAGR RYGRTNGNLTVTNDVASRLIRLPMWVGLQEVDQSRVVEALTRILTLRA" CDS complement(1696435..1697100) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1543C" /product="Putative transferase" /note="Mb1543c, -, len: 221 aa. Equivalent to Rv1505c, len: 221 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 221 aa overlap). Conserved hypothetical protein, some similarity to hypothetical proteins and glycosylases e.g. P71063|O08181 HYPOTHETICAL 22.5 KD PROTEIN YVFD from Bacillus subtilis (216 aa), FASTA scores: E(): 2.4e-08, (25.5% identity in 196 aa overlap). Protein product from Mb1543c detected using shotgun mass spectrometry. Mb1543c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001451" /db_xref="InterPro:IPR011004" /db_xref="InterPro:IPR020019" /db_xref="UniProtKB/TrEMBL:A0A1R3XYY4" /protein_id="SIU00146.1" /translation="MTKPLVIFGSGDIAQLAHYYFTRDSEYEVVAFTVDRDYASVSEF CGLPLVAFDEVAQRFPPESHAMFVALAYAKLNGVRKEKYLAAKALGYELASYVSSHAT VLNDGRIGENVFLLEDNTIQPFVSIGNNVTLWSGNHIGHHSTIHDHCFLASHIVVSGG VVIEEQSFIGVNATLRDHITIGSRCVVGAGALLLGDADADGVYIGTKTERRPVPSTEL RKI" CDS 1697293..1698057 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1544" /product="CONSERVED HYPOTHETICAL TRANSMEMBRANE PROTEIN" /note="Mb1544, -, len: 254 aa. Equivalent to Rv1517, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 254 aa overlap). Conserved hypothetical transmembrane protein, similar to G466802|LEPB1170_F2_64 from Mycobacterium leprae (230 aa), FASTA scores: opt: 282, E(): 2.2e-11, (34.1% identity in 255 aa overlap). Also similar to Mycobacterium tuberculosis Rv3821|MTCY409.09c (237 aa) (36.3% identity in 256 aa overlap); and Rv3481c. Mb1544 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYK5" /db_xref="InterPro:IPR021315" /db_xref="UniProtKB/TrEMBL:A0A1R3XYK5" /protein_id="SIU00147.1" /translation="MWTMVLLLGLGMAIDPARLGLAVVMLSRRRPMLNLFAFWVGGMV AGVGIALAVLVFMRDVALAAIQGVVSAANEFREAVGILAGGRLHIVIGVIMLLLAARM VARARAQVGVPVGPVGVADGGMSALALAQRPPGLVARLEVRTQQMLQGDVVWPAFVVG VASSAPPFESVVALTVIMASGAEIGTQFGAFVVFTLLVLAVIEIPLVAYLAIPQQTQQ VMLRFQDWVRSNRRQISLTILIGVGFLFLYQGVTSL" CDS 1698066..1699025 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1545" /product="Glycosyltransferase PglI (EC" /EC_number="2.4.1.-" /note="Mb1545, -, len: 319 aa. Equivalent to Rv1518, len: 319 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 319 aa overlap). Conserved hypothetical protein, possibly a glycosyl transferase involved in exopolysaccharide synthesis, similar to several hypothetical proteins and glycosyl transferases from diverse organisms e.g. P73996|D90911 from SYNECHO CYSTIS sp. (309 aa), Fasta scores: opt: 300, E(): 1.8e-13, (29.5% identity in 241 aa overlap). Mb1545 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001173" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/TrEMBL:A0A1R3XYK2" /protein_id="SIU00148.1" /translation="MVPGDASSVVSVNPAKPLISVCIPMYNNGATIERCLRSILEQEG VEFEIVVVDDDSSDDCAAIAATMLRPGDRLLRNEPRLGLNRNHNKCLEVARGGLIQFV HGDDRLLPGALQTLSRRFEDPSVGMAFAPRRVESDDIKWQQRYGRVHTRFRKLRDRNH GPSLVLQMVLHGAKENWIGEPTAVMFRRQLALDAGGFRTDIYQLVDVDFWLRLMLRSA VCFVPHELSVRRHTAATETTRVMATRRNVLDRQRILTWLIVDPLSPNRVRSAAALWWI PAWLAMIVEVAVLGPQRRTHLKALAPAPFREFAHARRQLPLAD" CDS 1699155..1699424 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1546" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1546, -, len: 89 aa. Equivalent to Rv1519, len: 89 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 89 aa overlap). Conserved hypothetical protein, high similarity to C-terminus of Q50723|MTCY78.26|Rv3402c (412 aa) (58.1% identity in 74 aa overlap). Mb1546 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64862" /db_xref="InterPro:IPR000653" /db_xref="InterPro:IPR015422" /db_xref="UniProtKB/Swiss-Prot:P64862" /protein_id="SIU00149.1" /translation="MRCGCLACDGVLCANGPGRPRRPALTCTAVATRTLHSLATNAEL VESADLTVTEDICSRIVSLPVHDHMAIADVARVVAPFGEGLARGG" CDS 1699450..1700490 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1547" /product="probable sugar transferase" /note="Mb1547, -, len: 346 aa. Equivalent to Rv1520, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 346 aa overlap). Probable sugar transferase (EC 2.-.-.-), similar to several e.g. AB010970|AB010970_6 Streptococcus mutans glycosyltransferase (465 aa), FASTA scores: opt: 381, E(): 1.2e-18, (31.7% identity in 240 aa overlap); O34234|Y07786 SUGAR TRANSFERASE from Vibrio cholerae (337 aa), FASTA scores: opt: 214, E(): 8.4e-05, (25.9% identity in 212 aa overlap). Also strongly similar to Mycobacterium tuberculosis probable sugar transferase Rv1516c. Mb1547 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR001173" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/Swiss-Prot:P64864" /protein_id="SIU00150.1" /translation="MSIVSISYNQEEYIREALDGFAAQRTEFPVEVIIADDASTDATP RIIGEYAARYPQLFRPILRQTNIGVHANFKDVLSAARGEYLALCEGDDYWTDPLKLSK QVKYLDRHPETTVCFHPVRVIYEDGAKDSEFPPLSWRRDLSVDALLARNFIQTNSVVY RRQPSYDDIPANVMPIDWYLHVRHAVGGEIAMLPETMAVYRRHAHGIWHSAYTDRRKF WETRGHGMAATLEAMLDLVHGHREREAIVGEVSAWVLREIGKTPGRQGRALLLKSIAD HPRMTMLSLQHRWAQTPWRRFKRRLSTELSSLAALAYATRRRALEGRDGGYRETTSPP TGRGRNVRGSHA" CDS 1700724..1702475 /codon_start=1 /transl_table=11 /gene="fadD25" /locus_tag="BQ2027_MB1548" /product="probable fatty-acid-amp ligase fadd25 (fatty-acid-amp synthetase) (fatty-acid-amp synthase)" /note="Mb1548, fadD25, len: 583 aa. Equivalent to Rv1521, len: 583 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 583 aa overlap). Probable fadD25, fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to many e.g. P71495|U75685 ACYL-CoA SYNTHASE from Mycobacterium bovis (582 aa), FASTA scores: opt: 2486, E(): 0, (63.4% identity in 584 aa overlap); NP_301232.1|NC_002677 acyl-CoA synthetase from Mycobacterium leprae (579 aa); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. fadD24 (584 aa); fadD28 (580 aa); etc. Protein product from Mb1548 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1548 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYJ1" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00151.1" /translation="MSVVESSLPGVLRERASFQPNDKALTFIDYERSWDGVEETLTWS QLYRRTLNLAAQLREHGSTGDRALILAPQILDYVVSFIASLQAGIVAVPLSIPQGGAH DERTVSVFADTAPAIVLTASSVVDNVVEYVQPQPGQNAPAVIEVDRLDLDARPSSGSR SAAHGHPDILYLQYTSGSTRTPAGVMVSNKNLFANFEQIMTSYYGVYGKVAPPGSTVV SWLPFYHDMGFVLGLILPILAGIPAVLTSPIGFLQRPARWIQMLASNTLAFTAAPNFA FDLASRKTKDEDMEGLDLGGVHGILNGSERVQPVTLKRFIDRFAPFNLDPKAIRPSYG MAEATVYVATRKAGQPPKIVQFDPQKLPDGQAERTESDGGTPLVSYGIVDTQLVRIVD PDTGIERPAGTIGEIWVHGDNVAIGYWQKPEATERTFSATIVNPSEGTPAGPWLRTGD SGFLSEGELFIMGRIKDLLIVYGRNHSPDDIEATIQTISPGRCAAIAVSEHGAEKLVA IIELKKKDESDDEAAERLGFVKREVTSAISKSHGLSVADLVLVSPGSIPITTSGKIRR AQCVELYRQDEFTRLDA" CDS complement(1702711..1706034) /codon_start=1 /transl_table=11 /gene="mmpL12" /locus_tag="BQ2027_MB1549C" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL12" /note="Mb1549c, mmpL12, len: 1107 aa. Equivalent to Rv1522c, len: 1146 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 1107 aa overlap). Probable mmpL12, conserved transmembrane transport protein (see first citation below), member of RND superfamily. Strong similarity to many Mycobacterial membrane proteins e.g. Q49619|G466786 putative transport protein B1170_C1_181 from Mycobacterium leprae (1008 aa), FASTA scores: opt: 2418, E(): 0, (51.0% identity in 1006 aa overlap); etc. Also highly similar to MmpL8|MTCY48.08c|Rv3823c PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis, FASTA score: (34.3% identity in 376 aa overlap); and some similarity to MmpL10|MTCY20G9|Rv1183 PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN, FASTA score: (27.2% identity in 1011 aa overlap). BELONGS TO THE MMPL FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transversion (c-a) omitting a stop codon, leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (1107 aa versus 1146 aa). Protein product from Mb1549c detected using shotgun mass spectrometry. Mb1549c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYJ9" /db_xref="InterPro:IPR000731" /db_xref="InterPro:IPR004707" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00152.1" /translation="MARHDEAKAGGLFDRIGNFVVRWPLIVIGCWIAVAAALTLLLPT LQAQAAKREQAPLPPGAPSMVLQKEMSAAFQEKIETSALLLVLLTNENGLGPADEAVY RKLIENLRADTQDKISVQDFLAVPEMKELLASKDNKAWNLPITFAGDAASPETQAAFK RVAAIVKQTVAGTSLTVHLSGPIATVADLTELGEKDVRIIEIGAAVSVLIILILVYRN LVTMLVPLATIGASVVTAQGTLSGLAEFGLAVNMQAIVFMSAVMIGAGTDYAVFLISR YHDYVRHGEKSDMAVKKALMSIGKVITASAATVAVTFLAMVFTKLEVFSAVGPAIAVA ITVSLLGAVTLLPAILTLTGRRGWIKPRRDLTSRMWRRSGVRIVRRPTIHLVGSLIVL VALAGCTLLIRFNYDDLKTVPQHVESVKGYEAMNRHFPMNAMTPMVLFIKSPRDLRTP GALADIEMMSREIAELPNIVMVRGLTRPNGEPLKETKVSFQAGEVGGKLDEATTLLEE HGGELDQLTGGAHQLADALAQIRNEINGAVASSSGIVNTLQAMMDLMGGDKTIRQLEN ASQYVGRMRALGDNLSGTVTDAEQIATWASPMVNALNSSPVCNSDPACRTSRAQLAAI VQAQDDGLLRSIRALAVTLQQTQEYQTLARTVSTLDGQLKQVVSTLKAVDGLPTKLAQ MQQGANALADGSAALAAGVQELVDQVKKMGSGLNEAADFLLGIKRDADKPSMAGFNIP PQIFSRDEFKKGAQIFLSADGHAARYFVQSALNPATTEAMDQVNDILRVADSARPNTE LEDATIGLAGVPTALRDIRDYYNSDMKFIVIATIVIVFLILVILLRALVAPIYLIGSV LISYLSALGIGTLVFQLILGQEMHWSLPGLSFILLVAIGADYNMLLISRIRDESPHGI RIGVIRTVGSTGGVITSAGLIFAASMFGLVGANINTMAQAGFTIGIGIVLDTFLVRTV TVPALTTMIGRANWWPSELGRDPSTPPTKADRWLRRVKGHRRKAPIPAPKPPHTKVVR NTNGHASKAATKSVPNGKPADLAEGNGEYLIDHLRRHSLPLFGYAAMPAYDVVDGVSK PNGDGAHIGKEPVDHLLGH" CDS 1706075..1707118 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1550" /product="Probable methyltransferase" /note="Mb1550, -, len: 347 aa. Equivalent to Rv1523, len: 347 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 347 aa overlap). Probable methyltransferase (EC 2.1.1.-), similar to G560513|U0002O Mycobacterium leprae (270 aa), FASTA scores: opt: 965, E(): 0, (60.3% identity in 247 aa overlap). Also similar to many e.g. Q54303|X86780 METHYLTRANSFERASE RAPM from Streptomyces hygroscopicus (317 aa), FASTA scores: opt: 323, E(): 1e-15, (41.2% identity in 136 aa overlap). And similar to Mycobacterium tuberculosis hypothetical proteins Rv2952, Rv1405c, Rv1403c, Rv0839. Start uncertain. Protein product from Mb1550 detected using SWATH mass spectrometry. Mb1550 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0R2" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0R2" /protein_id="SIU00153.1" /translation="MTITALTVTLPLLWRRLTTAGVKYADQGHFVGSAGVPAADAGGR DAASEQIARWTQTCTVVLVCGHGPAKWAFRSWCTSRSCDTLPVALRYRLQSNPLVGKL TTKYFLPLGTRQVGDHVVFFNFGYEEDPPMALPLSESDEPNRYCIQLYHQTASQVDLT GKEVLEVSCGAGGGASYIARNLGPASYTGLDLNPASIDLCRAKHRLPGLQFVQGDAQN LPFPDESFDAVVNVEASHQYPDFRGFLAEVARVLRPGGHFLYTDSRRNPVVAEWEAAL ADAPLRTISQRDIGAQAKRGLDANTARSQEAIGRRAPVLLAGLTRCAVRVLDWDLRRG GGFSYRIYLFAKD" CDS 1707148..1708392 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1551" /product="Probable glycosyltransferase" /note="Mb1551, -, len: 414 aa. Equivalent to Rv1524, len: 414 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 414 aa overlap). Probable glycosyltransferase (EC 2.4.1.-), similar to many e.g. P96559|U84349 GLYCOSYLTRANSFERASE GTFB from Amycolatopsis orientalis (407 aa), FASTA scores: opt: 363, E(): 6.2e-23, (28.8% identity in 430 aa overlap); also high similarity to Rv1526c|MTCY19G5.02 Mycobacterium tuberculosis hypothetical protein (58.7% identity in 416 aa overlap); and AF143772|AF143772_15 glycosyltransferase gtfB from Mycobacterium avium strain 215 (418 aa), FASTA scores: opt: 1801, E(): 0, (65.2% identity in 417 aa overlap). Protein product from Mb1551 detected using SWATH mass spectrometry. Mb1551 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64866" /db_xref="InterPro:IPR004276" /db_xref="UniProtKB/Swiss-Prot:P64866" /protein_id="SIU00154.1" /translation="MKFVVASYGTRGDIEPCAAVGLELQRRGHDVCLAVPPNLIGFVE TAGLSAVAYGSRDSQEQLDEQFLHNAWKLQNPIKLLREAMAPVTEGWAELSAMLTPVA AGADLLLTGQIYQEVVANVAEHHGIPLAALHFYPVRANGEIAFPARLPAPLVRSTITA IDWLYWRMTKGVEDAQRRELGLPKASTPAPRRMAVRGSLEIQAYDALCFPGLAAEWGG RRPFVGALTMESATDADDEVASWIAADTPPIYFGFGSMPIGSLADRVAMISAACAELG ERALICSGPSDATGIPQFDHVKVVRVVSHAAVFPTCRAVVHHGGAGTTAAGLRAGIPT LILWVTSDQPIWAAQIKQLKVGRGRRFSSATKESLIADLRTILAPDYVTRAREIASRM TKPAASVTATADLLEDAARRAR" CDS 1708439..1709224 /codon_start=1 /transl_table=11 /gene="wbbL2" /locus_tag="BQ2027_MB1552" /product="POSSIBLE RHAMNOSYL TRANSFERASE WBBL2" /note="Mb1552, wbbL2, len: 261 aa. Equivalent to Rv1525, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 261 aa overlap). Possible wbbL2, rhamnosyl transferase (EC 2.-.-.-), showing weak similarity to several rhamnosyl transferases. Similar to AF105060|AF105060_1 Riftia pachyptila endosymbiont (746 aa), FASTA scores: opt: 183, E(): 0.00013, (35.2% identity in 105 aa overlap). Protein product from Mb1552 detected using SWATH mass spectrometry. Mb1552 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/Swiss-Prot:P64868" /protein_id="SIU00155.1" /translation="MYAPLVSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDY PRIGTERVSTPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRL PADAGMVGPMFDVGFPFAVADEKPDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGG MDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWGAS AAMIRGLRRTHGWPAAMGILREMGMAHHRKWHKSFPLTCPASC" CDS complement(1709202..1710482) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1553C" /product="Probable glycosyltransferase" /note="Mb1553c, -, len: 426 aa. Equivalent to Rv1526c, len: 426 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 426 aa overlap). Probable glycosyltransferase (EC 2.4.1.-), highly similar to G467196 Protein L518_C2_147 from Mycobacterium leprae (421 aa), FASTA scores, opt: 1497, E(): 0, (55.0% identity in 424 aa overlap); similar to G452504 rhamnosyltransferase (24.7% identity in 433 aa overlap); and P96565|U84350 GLYCOSYLTRANSFERASE GTFE from Amycolatopsis orientalis (408 aa), E(): 3.4e-24, (28.4% identity in 429 aa overlap), also high similarity to Rv1524|MTCY19G5.04c (58.7 % identity in 416 aa overlap). Protein product from Mb1553c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1553c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64870" /db_xref="InterPro:IPR004276" /db_xref="UniProtKB/Swiss-Prot:P64870" /protein_id="SIU00156.1" /translation="MKFVLAVHGTRGDVEPCAAVGVELRRRGHAVHMAVPPNLIEFVE SAGLTGVAYGPDSDEQINTVAAFVRNLTRAQNPLNLARAVKELFVEGWAEMGTTLTTL ADGADLVMTGQTYHGVAANVAEYYDIPAAALHHFPMQVNGQIAIPSIPTPATLVRATM KVSWRLYAYVSKDADRAQRRELGLPPAPAPAVRRLAERGAPEIQAYDPVFFPGLAAEW SDRRPFVGPLTMELHSEPNEELESWIAAGTPPIYFGFGSTPVQTPVQTLAMISDVCAQ LGERALIYSPAANSTRIRHADHVKRVGLVNYSTILPKCRAVVHHGGAGTTAAGLRAGM PTLILWDVADQPIWAGAVQRLKVGSAKRFTNITRGSLLKELRSILAPECAARAREIST RMTRPTAAVTAAADLLEATARQTPGSTPSSSPGR" CDS complement(1710505..1716831) /codon_start=1 /transl_table=11 /gene="pks5" /locus_tag="BQ2027_MB1554C" /product="Probable polyketide synthase pks5" /note="Mb1554c, pks5, len: 2108 aa. Equivalent to Rv1527c, len: 2108 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 2108 aa overlap). Probable pks5, polyketide synthase, highly similar to many e.g. MCAS_MYCBO|Q02251 mycocerosic acid synthase from Mycobacterium bovis (2110 aa), FASTA scores: opt: 6270, E(): 0, (63.6% identity in 2126 aa overlap). Protein product from Mb1554c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1554c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYL5" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/TrEMBL:A0A1R3XYL5" /protein_id="SIU00157.1" /translation="MGKERTKTVDRTRVTPVAVIGMGCRLPGGIDSPDRLWEALLRGD DLVTEIPADRWDIDEYYDPEPGVPGRTDCKWGAYLDNVGDFDPEFFGIGEKEAIAIDP QHRLLLETSWEAMEHGGLTPNQMASRTGVFVGLVHTDYILVHADNQTFEGPYGNTGTN ACFASGRVAYAMGLQGPAITVDTACSSGLTAIHLACRSLHDGESDIALAGGVYVMLEP RRFASGSALGMLSATGRCHAFDVSADGFVSGEGCVMLALKRLPDALADGDRILAVIRG TAANQDGHTVNIATPSRSAQVAAYREALDVAGVDPATVGMVEAHGPGTPVGDPIEYAS LAEVYGNDGPCALASVKTNFGHTQSAAGALGLMKAVLALQHGVVPQNLHFTALPDKLA AIETNLFVPQEITPWPGADQETPRRAAVSSYGMTGTNVHAIVEQAPVPAPESGAPGDT PATPGIDGALLFALSASSQDALRQTAARLADWVDAQGPELAPADLAYTLARRRGHRPV RTAVLAATTAELTEALREVATGETPYPPAVGQDDRGPVWVFSGQGSQWAGMGADLLAT EPVFAATIAAIEPLIAAESGFSVTEAMTAPEVVTGIDRVQPTLFAMQVALAATMKSYG VAPGAVIGHSLGESAAAVVAGALCLEDGVRVICRRSALMTRIAGAGAMASVELPAQQV LSELMARGVNDAVVAVVASPQSTVIGGATQTVRDLVAAWEQRDVLAREVAVDVASHSP QVDPILDELAEALAEISPLQPEIPYYSATSFDPREEPYCDAYYWVDNLRHTVRFAAAV QAALEDGYRVFTELTPHPLLTHAVDQTARSLDMSAAALAGMRREQPLPHGLRALAGDL YAAGAAVDFAVLYPTGRLINAPLPTWNHRRLLLDDTTRRIAHANTVAVHPLLGSHVRL PEEPERHVWQGEVGTVTQPWLADHQIHGAAALPGAAYCEMALAAARAVLGEASEVRDI RFEQMLLLDDETPIGVTATVEAPGVVPLTVETSHDGRYTRQLAAVLHVVREADDAPDQ PPQKNIAELLASHPHKVDGAEVRQWLDKRGHRLGPAFAGLVDAYIAEGAGDTVLAEVN LPGPLRSQVKAYGVHPVLLDACFQSVAAHPAVQGMADGGLLLPLGVRRLRSYGSARHA RYCCTTVTACGVGVEADLDVLDEHGAVVLAVRGLQLGTGASQASERARVLGERLLSIE WHERELPENSHAEPGAWLLISTCDATDLVAAQLTDALKVHDAQCTTMSWPQRADHAAQ AARLRDQLGTGGFTGVFVLTAPQTGDPDAESPVRGGELVKHVVRIAREIPEITAQEPR LYVLTHNAQAVLSGDRPNLEQGGMRGLLRVIGAEHPHLKASYVDVDEQTGAESVARQL LAASGEDETAWRNDQWYTARLCPAPLRPEERQTTVVDHAEAGMRLQIRTPGDLQTLEF AALDRVPPGPGEIEVAVTASSINFADVLVTFGRYQTLDGRQPQLGTDFAGVVSAVGPG VSELKVGDRVGGMSPNGCWATFVTCDARLATRLPEGLTDAQAAAVTTASATAWYGLQD LARIKAGDKVLIHSATGGVGQAAIAIARAAGAQIYATAGNEKRRDLLRDMGIEHVYDS RSVEFAEQIRRDTAGYGVDIVLNSVTGAAQLAGLKLLALGGRFIEIGKRDIYSNTRLE LLPFRRNLAFYGLDLGLMSVSHPAAVRELLSTVYRLTVEGVLPMPQSTHYPLAEAATA IRVMGAAEHTGKLILDVPHAGRSSVVLPPEQARVFRSDGSYIITGGLGGLGLFLAEKM ANAGAGRIVLSSRSQPSQKALETIELVRAIGSDVVVECGDIAQPDTADRLVTAATATG LPLRGVLHAAAVVEDATLANITDELIERDWAPKAYGAWQLHRATADQPLDWFCSFSSA AALVGSPGQGAYAAANSWLDTFTHWRRAQDLPATSIAWGAWGQIGRAIAFAEQTGDAI APEEGAYAFETLLRHNRAYSGYAPVIGSPWLTAFAQHSPFAEKFQSLGQNRSGTSKFL AELVDLPREEWPDRLRRLLSKQVGLILRRTIDTDRLLSEYGLDSLSSQELRARVEAET GIRISATEINTTVRGLADLMCDKLAADRDAPAPA" CDS complement(1717375..1717872) /codon_start=1 /transl_table=11 /gene="papA4" /locus_tag="BQ2027_MB1555C" /product="PROBABLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA4" /note="Mb1555c, papA4, len: 165 aa. Equivalent to Rv1528c, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 165 aa overlap). Probable papA4, conserved polyketide synthase (PKS) associated protein; shows some similarity to C-terminal part of hypothetical proteins from Mycobacterium tuberculosis and Mycobacterium leprae e.g. Z97188|MTCY409_10 Mycobacterium tuberculosis cosmid (468) (37.9% identity in 66 aa overlap); or U00010_11 Mycobacterium leprae cosmid B1170 (35.7% identity in 84 aa overlap). Also similar to Mycobacterium tuberculosis PKS-associated proteins Rv1182, Rv3824c, Rv3820c. Mb1555c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XYL0" /protein_id="SIU00158.1" /translation="MTQLPQPTWRWWQQRETEQVQSSHIDGEIVGALIPDLAVLHSED ASRAAVGREKHRCSLDPLGGGFRSRRASMPAGALLLSAVIAIQLDRMNARVFGDGWIG AQACMWVNKFHEESTVTALSPSSPIAQGSIARHPETMQSAYVRIAEGGSRDVAPAAQL QRRRP" CDS 1717924..1719678 /codon_start=1 /transl_table=11 /gene="fadD24" /locus_tag="BQ2027_MB1556" /product="probable fatty-acid-amp ligase fadd24 (fatty-acid-amp synthetase) (fatty-acid-amp synthase)" /note="Mb1556, fadD24, len: 584 aa. Equivalent to Rv1529, len: 584 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 584 aa overlap). Probable fadD24, fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to many e.g. MBU75685_1|AAB52538.1|U75685 acyl-CoA synthase from Mycobacterium bovis (582 aa), FASTA score: (65.6% identity in 582 aa overlap); and many other fatty-acid-CoA synthetases from Mycobacteria e.g. fadD25|MTCY19G5_7 from Mycobacterium tuberculosis (583 aa), FASTA score: (68.7% identity in 584 aa overlap); fadD28|MTCY24G1_8 from Mycobacterium tuberculosis (580 aa), FASTA score: (66.0% identity in 582 aa overlap); NP_301232.1|NC_002677|U00010_6 from Mycobacterium leprae (372 aa), FASTA score: (57.6% identity in 342 aa overlap); FADD23|Rv3826|MTCY409.04c from Mycobacterium tuberculosis (584 aa), FASTA score: (63.2% identity in 584 aa overlap); etc. Protein product from Mb1556 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1556 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYJ8" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XYJ8" /protein_id="SIU00159.1" /translation="MVASSIPTALRERASVHPNGAAITYIDYEQDWAGVAETLTWSQL YRRMLNVAEPLRHVGATGDRAVILAPQGIEYVVGFLGALQAGRIAVPLPVPHAGAHDE RTISVLSDTSPAVILTTSGAVDDVRECAQPQPGQSAPSIVELDLLDLDSRQRSRSPGA RPTGRDTPETAYLQYTSGSTRTPAGVMVSNKNVFANFEQIVADFFAPEGGVVPPDLTV VSWLPLYHDMGLLLGAIMPILAGVPTVLTSPVGFLQRPARWIQLLARNGRTISAGPNF AFELAVRKTSDDDMDGLDLAGVHTILNGSERVHPATLKRFAERFGRFNFAAAALRPAY GMAEATVYIATRNVNEPPEIVDFESEKLPAGQAIRCPSGSGTPLVSYGVPRSQLVRIV DPDTCIECPQGSVGEIWVQGGNVASGYWHKPEESKRTFGARIVTPSAGTPEAPWLRTG DSGFVSGGELFIIGRIKDLLIVYGRNHAPDDIEATIQEITSGRCAAIAVPDHGTEKLV AIIELKKRGDSDEDVADRLRIVKRDVAAAIFDSHGLSVADLVLVSPGSIPITTSGKIR RAQCVQLYRRREFTRLDA" CDS 1719795..1720898 /codon_start=1 /transl_table=11 /gene="adh" /locus_tag="BQ2027_MB1557" /product="Probable alcohol dehydrogenase adh" /note="Mb1557, adh, len: 367 aa. Equivalent to Rv1530, len: 367 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 367 aa overlap). Probable adh, alcohol dehydrogenase (EC 1.1.1.1), zinc-dependent, similar to many e.g. AE0009|AE000958_23 Archaeoglobus fulgidus section 1 (402 aa), FASTA scores: opt: 423, E(): 1.8e-19, (31.7% identity in 341 aa overlap). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. Protein product from Mb1557 detected using SWATH mass spectrometry. Mb1557 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYL6" /db_xref="InterPro:IPR002328" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XYL6" /protein_id="SIU00160.1" /translation="MSDGAVVRALVLEAPRRLVVRQYRLPRIGDDDALVRVEACGLCG TDHEQYTGELAGGFAFVPGHETVGTIAAIGPRAEQRWGVSAGDRVAVEVFQSCRQCAN CRGGEYRRCVRHGLADMYGFIPVDREPGLWGGYAEYQYLAPDSMVLRVAGDLSPEVAT LFNPLGAGIRWGVTIPETKPGDVVAVLGPGIRGLCAAAAAKGAGAGFVMVTGLGPRDA DRLALAAQFGADLAVDVAIDDPVAALTEQTGGLADVVVDVTAKAPAAFAQAIALARPA GTVVVAGTRGVGSGAPGFSPDVVVFKELRVLGALGVDATAYRAALDLLVSGRYPFASL PRRCVRLEGAEDLLATMAGERDGVPPIHGVLTP" CDS 1720895..1721461 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1558" /product="conserved protein" /note="Mb1558, -, len: 188 aa. Equivalent to Rv1531, len: 188 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 188 aa overlap). Conserved hypothetical protein, similar to Rv0464c|MTV038.08c (190 aa), FASTA scores: E(): 4.8e-10, (30.9% identity in 175 aa overlap). Protein product from Mb1558 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1558 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYK1" /db_xref="InterPro:IPR003779" /db_xref="InterPro:IPR029032" /db_xref="UniProtKB/TrEMBL:A0A1R3XYK1" /protein_id="SIU00161.1" /translation="MTTSRVPLLPVDEAKAAADEAGVPDYMAELSIFQVLLNHPRLAR TFNDLLATMLWHGTLDSRLRELVIMRIGWLTDCDYEWTQHWRVASGLGVSADDLLGVR DWQGYNGFGPAEQAVLAATDDVVREGAVSAQSWSACERELHCDKVVLIELVTVISAWR MVASILHSLEVPLEDGVSSWPPDGLSPR" CDS complement(1721538..1721972) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1559C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1559c, -, len: 144 aa. Equivalent to Rv1532c, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 144 aa overlap). Conserved hypothetical protein, similar to P20378|YPHR_HALHA Hypothetical 15.6 kd protein from Halobacterium halobium (151 aa), FASTA scores: opt: 152, E():4.5e-05, (30.1% identity in 103 aa overlap). Protein product from Mb1559c detected using SWATH mass spectrometry. Mb1559c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003736" /db_xref="InterPro:IPR006683" /db_xref="InterPro:IPR029069" /db_xref="UniProtKB/TrEMBL:A0A1R3XYK8" /protein_id="SIU00162.1" /translation="MSDPLTAQEQHKRRQAVRELMPRTPFIGGLGIVFERYEPDDVVI RLPFRTDLTNDGTYFHGGVIASVMDTAGAAAAWSNHDFDRGTRAATVAMSIQYTGAAK RCDLLCHARTARRRKELTFTEITATDPDGNIVAHAVQTYRIV" CDS 1722032..1723159 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1560" /product="putative oxidoreductase, nitronate monooxygenase family" /note="Mb1560, -, len: 375 aa. Equivalent to Rv1533, len: 375 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 375 aa overlap). Conserved hypothetical protein. Similar to 2NPD_NEUCR|Q01284 2-nitropropane dioxygenase precursor (378 aa), fasta scores: opt: 279, E(): 9.1e-11, (31.3% identity in 256 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv1894c, Rv0021c, Rv3553, Rv2781c. Protein product from Mb1560 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1560 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0S2" /db_xref="InterPro:IPR004136" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0S2" /protein_id="SIU00163.1" /translation="MRTRVAELLGAEFPICAFSHCRDVVAAVSNAGGFGILGAVAHSP KRLESELTWIEEHTGGKPYGVDVLLPPKYIGAEQGGIDAQQARELIPEGHRTFVDDLL VRYGIPAVTDRQRSSSAGGLHISPKGYQPLLDVAFAHDIRLIASALGPPPPDLVERAH NHDVLVAALAGTAQHARRHAAAGVDLIVAQGTEAGGHTGEVATMVLVPEVVDAVSPTP VLAAGGIARGRQIAAALALGAEGVWCGSVWLTTEEAETPPVVKDKFLAATSSDTVRSR SLTGKPARMLRTAWTDEWDRPDSPDPLGMPLQSALVSDPQLRINQAAGQPGAKARELA TYFVGQVVGSLDRVRSARSVVLDMVEEFIDTVGQLQGLVQR" CDS 1723156..1723833 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1561" /product="Probable transcriptional regulator" /note="Mb1561, -, len: 225 aa. Equivalent to Rv1534, len: 225 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 225 aa overlap). Probable transcriptional regulator, similar to YCDC_ECOLI|P75899 hypothetical transcriptional regulator from Escherichia coli (212 aa), FASTA scores: opt: 166, E(): 9.8e-05, (24.2% identity in 219 aa overlap). Contains PS01081 Bacterial regulatory proteins, tetR family signature and helix turn helix motif (aa 41-62). Protein product from Mb1561 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1561 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZJ7" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023772" /db_xref="InterPro:IPR036271" /db_xref="InterPro:IPR041669" /db_xref="UniProtKB/TrEMBL:A0A1R3XZJ7" /protein_id="SIU00164.1" /translation="MSRASARRRRAVSDEDKSQRRDEILAAAKIVFAHKGFHATTVAD IAKQAGLAYGLIYWYFDSKDDLFHALMAGEEEALRAHVAAELARVGGSTEAPLRALLQ AAVQATFEFFETDKATVKLLFRDAYALGGRFEEHLGGIYERFIDDIEAVVVAAQRRGE VVEAPSRMAAYTLAALVGQLAHRRLNTDDNVTAAQVADFVVSLVLDGLRPRALAVGAR GGRAART" CDS 1724398..1724634 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1562" /product="unknown protein" /note="Mb1562, -, len: 78 aa. Equivalent to Rv1535, len: 78 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 78 aa overlap). Hypothetical unknown protein. Mb1562 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XYL3" /protein_id="SIU00165.1" /translation="MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGT GGDPLVDGAARLLSIPLRHLYAALWRVGLLEVQA" CDS 1724941..1728066 /codon_start=1 /transl_table=11 /gene="ileS" /locus_tag="BQ2027_MB1563" /product="isoleucyl-tRNA synthetase ileS" /note="Mb1563, ileS, len: 1041 aa. Equivalent to Rv1536, len: 1041 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1041 aa overlap). ileS, Isoleucyl-tRNA synthetase (EC 6.1.1.5), similar to several e.g. SYIC_YEAST P09436 isoleucyl-tRNA synthetase (1072 aa), FASTA scores: opt: 1447, E(): 0, (37.8% identity in 1072 aa overlap); contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. BELONGS TO CLASS-I AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb1563 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1563 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7VEZ0" /db_xref="InterPro:IPR001412" /db_xref="InterPro:IPR002300" /db_xref="InterPro:IPR002301" /db_xref="InterPro:IPR009008" /db_xref="InterPro:IPR009080" /db_xref="InterPro:IPR013155" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR023586" /db_xref="InterPro:IPR033709" /db_xref="UniProtKB/Swiss-Prot:Q7VEZ0" /protein_id="SIU00166.1" /translation="MTDNAYPKLAGGAPDLPALELEVLDYWSRDDTFRASIARRDGAP EYVFYDGPPFANGLPHYGHLLTGYVKDIVPRYRTMRGYKVERRFGWDTHGLPAELEVE RQLGITDKSQIEAMGIAAFNDACRASVLRYTDEWQAYVTRQARWVDFDNDYKTLDLAY MESVIWAFKQLWDKGLAYEGYRVLPYCWRDETPLSNHELRMDDDVYQSRQDPAVTVGF KVVGGQPDNGLDGAYLLVWTTTPWTLPSNLAVAVSPDITYVQVQAGDRRFVLAEARLA AYARELGEEPVVLGTYRGAELLGTRYLPPFAYFMDWPNAFQVLAGDFVTTDDGTGIVH MAPAYGEDDMVVAEAVGIAPVTPVDSKGRFDVTVADYQGQHVFDANAQIVRDLKTQSG PAAVNGPVLIRHETYEHPYPHCWRCRNPLIYRSVSSWFVRVTDFRDRMVELNQQITWY PEHVKDGQFGKWLQGARDWSISRNRYWGTPIPVWKSDDPAYPRIDVYGSLDELERDFG VRPANLHRPYIDELTRPNPDDPTGRSTMRRIPDVLDVWFDSGSMPYAQVHYPFENLDW FQGHYPGDFIVEYIGQTRGWFYTLHVLATALFDRPAFKTCVAHGIVLGFDGQKMSKSL RNYPDVTEVFDRDGSDAMRWFLMASPILRGGNLIVTEQGIRDGVRQVLLPLWNTYSFL ALYAPKVGTWRVDSVHVLDRYILAKLAVLRDDLSESMEVYDIPGACEHLRQFTEALTN WYVRRSRSRFWAEDADAIDTLHTVLEVTTRLAAPLLPLITEIIWRGLTRERSVHLTDW PAPDLLPSDADLVAAMDQVRDVCSAASSLRKAKKLRVRLPLPKLIVAVENPQLLRPFV DLIGDELNVKQVELTDAIDTYGRFELTVNARVAGPRLGKDVQAAIKAVKAGDGVINPD GTLLAGPAVLTADEYNSRLVAADPESTAALPDGAGLVVLDGTVTAELEAEGWAKDRIR ELQELRKSTGLDVSDRIRVVMSVPAEREDWARTHRDLIAGEILATDFEFADLADGVAI GDGVRVSIEKT" CDS 1728278..1729669 /codon_start=1 /transl_table=11 /gene="dinX" /locus_tag="BQ2027_MB1564" /product="probable dna polymerase iv dinx (pol iv 1) (dna nucleotidyltransferase (dna-directed))" /note="Mb1564, dinX, len: 463 aa. Equivalent to Rv1537, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 463 aa overlap). Probable dinX, DNA polymerase IV. Similar to umuC, mucB, samb, and impb (uv protection and mutation) e.g. IMPB_SALTY|P18642 impb protein from Salmonella typhimurium (424 aa), FASTA scores, opt: 386, E(): 1.7e-17, (27.5% identity in 415 aa overlap). Also similar to Mycobacterium tuberculosis Rv3056. Mb1564 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63986" /db_xref="InterPro:IPR001126" /db_xref="InterPro:IPR017961" /db_xref="InterPro:IPR022880" /db_xref="InterPro:IPR024728" /db_xref="InterPro:IPR036775" /db_xref="UniProtKB/Swiss-Prot:P63986" /protein_id="SIU00167.1" /translation="MLHLDMDAFFASVEQLTRPTLRGRPVLVGGLGGRGVVAGASYEA RAYGARSAMPMHQARRLIGVTAVVLPPRGVVYGIASRRVFDTVRGLVPVVEQLSFDEA FAEPPQLAGAVAEDVETFCERLRRRVRDETGLIASVGAGSGKQIAKIASGLAKPDGIR VVRHAEEQALLSGLPVRRLWGIGPVAEEKLHRLGIETIGQLAALSDAEAANILGATIG PALHRLARGIDDRPVVERAEAKQISAESTFAVDLTTMEQLHEAIDSIAEHAHQRLLRD GRGARTITVKLKKSDMSTLTRSATMPYPTTDAGALFTVARRLLPDPLQIGPIRLLGVG FSGLSDIRQESLFADSDLTQETAAAHYVETPGAVVPAAHDATMWRVGDDVAHPELGHG WVQGAGHGVVTVRFETRGSGPGSARTFPVDTGDISNASPLDSLDWPDYIGQLSVEGSA GASAPTVDDVGDR" CDS complement(1729634..1730614) /codon_start=1 /transl_table=11 /gene="ansA" /locus_tag="BQ2027_MB1565C" /product="Probable L-aparaginase ansA" /note="Mb1565c, ansA, len: 326 aa. Equivalent to Rv1538c, len: 326 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 326 aa overlap). Probable ansA, L-aparaginase, most similar to ASPG_BACLI|P30363 L-asparaginase (322 aa), FASTA scores: opt: 417, E(): 8.8e-19, (30.9% identity in 314 aa overlap). Contains PS00917 Asparaginase / glutaminase active site signature 2. Protein product from Mb1565c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1565c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63628" /db_xref="InterPro:IPR004550" /db_xref="InterPro:IPR006034" /db_xref="InterPro:IPR020827" /db_xref="InterPro:IPR027473" /db_xref="InterPro:IPR027474" /db_xref="InterPro:IPR027475" /db_xref="InterPro:IPR036152" /db_xref="InterPro:IPR037152" /db_xref="InterPro:IPR040919" /db_xref="UniProtKB/Swiss-Prot:P63628" /protein_id="SIU00168.1" /translation="MGANHVRNDPIMARLTVITTGGTISTTAGPDGVLRPTHCGATLI AGLDMDSDIEVVDLMALDSSKLTPADWDRIGAAVQEAFRGGADGVVITHGTDTLEETA LWLDLTYAGSRPVVLTGAMLSADAPGADGPANLRDALAVAADPAARDLGVLVSFGGRV LQPLGLHKVANPDLCGFAGESLGFTSGGVRLTRTKTRPYLGDLGAAVAPRVDIVAVYP GSDAVAMDACVAAGARAVVLEALGSGNAGAAVIEGVRRHCRDGSDPVVIAVSTRVAGA RVGAGYGPGHDLVEAGAVMVPRLPPSQARVLLMAALAANSPVADVIDRWG" CDS 1730666..1731274 /codon_start=1 /transl_table=11 /gene="lspA" /locus_tag="BQ2027_MB1566" /product="Probable lipoprotein signal peptidase lspA" /note="Mb1566, lspA, len: 202 aa. Equivalent to Rv1539, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 202 aa overlap). Probable lspA, lipoprotein signal peptidase (EC 3.4.23.36), similar to several e.g. LSPA_PSEFL|P17942 (170 aa), FASTA scores: opt: 299, E(): 2.6e-12, (38.3% identity in 167 aa overlap). Mb1566 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65263" /db_xref="InterPro:IPR001872" /db_xref="UniProtKB/Swiss-Prot:P65263" /protein_id="SIU00169.1" /translation="MPDEPTGSADPLTSTEEAGGAGEPNAPAPPRRLRMLLSVAVVVL TLDIVTKVVAVQLLPPGQPVSIIGDTVTWTLVRNSGAAFSMATGYTWVLTLIATGVVV GIFWMGRRLVSPWWALGLGMILGGAMGNLVDRFFRAPGPLRGHVVDFLSVGWWPVFNV ADPSVVGGAILLVILSIFGFDFDTVGRRHADGDTVGRRKADG" CDS 1731267..1732193 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1567" /product="LSU rRNA pseudouridine(1911/1915/1917) synthase (EC" /EC_number="5.4.99.23" /note="Mb1567, -, len: 308 aa. Equivalent to Rv1540, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 308 aa overlap). Member of the yabO/yceC/yfiI family of hypothetical proteins, similar to P44445|YFII_HAEIN hypothetical protein HI0176 from Haemophilus influenzae (324 aa), FASTA scores: opt: 437, E(): 1.2e-22, (33.2% identity in 322 aa overlap). Equivalent to AL049478|MLCL458_13 hypothetical protein from Mycobacterium leprae (308 aa), (89.3% identity in 307 aa overlap). Contains PS01129 hypothetical yabO/yceC/yfiI family signature. Protein product from Mb1567 detected using SWATH mass spectrometry. Mb1567 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5T3" /db_xref="InterPro:IPR002942" /db_xref="InterPro:IPR006145" /db_xref="InterPro:IPR006224" /db_xref="InterPro:IPR006225" /db_xref="InterPro:IPR020103" /db_xref="InterPro:IPR036986" /db_xref="UniProtKB/Swiss-Prot:P0A5T3" /protein_id="SIU00170.1" /translation="MADRSMPVPDGLAGMRVDTGLARLLGLSRTAAAALAEEGAVELN GVPAGKSDRLVSGALLQVRLPEAPAPLQNTPIDIEGMTILYSDDDIVAVDKPAAVAAH ASVGWTGPTVLGGLAAAGYRITTSGVHERQGIVHRLDVGTSGVMVVAISERAYTVLKR AFKYRTVDKRYHALVQGHPDPSSGTIDAPIGRHRGHEWKFAITKNGRHSLTHYDTLEA FVAASLLDVHLETGRTHQIRVHFAALHHPCCGDLVYGADPKLAKRLGLDRQWLHARSL AFAHPADGRRVEIVSPYPADLQHALKILRGEG" CDS complement(1732200..1732793) /codon_start=1 /transl_table=11 /gene="lprI" /locus_tag="BQ2027_MB1568C" /product="Possible lipoprotein lprI" /note="Mb1568c, lprI, len: 197 aa. Equivalent to Rv1541c, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 197 aa overlap). Possible lipoprotein lprI, contains appropriately positioned prokaryotic membrane lipoprotein lipid attachment site (PS0013)." /db_xref="GOA:P65319" /db_xref="InterPro:IPR009739" /db_xref="InterPro:IPR018660" /db_xref="InterPro:IPR036328" /db_xref="UniProtKB/Swiss-Prot:P65319" /protein_id="SIU00171.1" /translation="MRWIGVLVTALVLSACAANPPANTTSPTAGQSLDCTKPATIVQQ LVCHDRQLTSLDHRLSTAYQQALAHRRSAALEAAQSSWTMLRDACAQDTDPRTCVQEA YQTRLVQLAIADPATATPPVLTYRCPTQDGPLTAQFYNQFDPKTAVLNWKGDQVIVFV ELSGSGARYGRQGIEYWEHQGEVRLDFHGATFVCRTS" CDS complement(1732848..1733258) /codon_start=1 /transl_table=11 /gene="glbN" /locus_tag="BQ2027_MB1569C" /product="hemoglobin glbn" /note="Mb1569c, glbN, len: 136 aa. Equivalent to Rv1542c, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 136 aa overlap). Probable glbN, hemoglobin. Belongs to the protozoan/cyanobacterial globin family. Similar to myoglobins e.g. GLB_PARCA|P15160 myoglobin (hemoglobin) paramecium (116 aa), FASTA scores, opt: 284, E(): 2.1e -13, (35.7% identity in 115 aa overlap). Similar to Mycobacterium tuberculosis hypothetical globin, Rv2470." /db_xref="GOA:P0A593" /db_xref="InterPro:IPR001486" /db_xref="InterPro:IPR009050" /db_xref="InterPro:IPR012292" /db_xref="InterPro:IPR016339" /db_xref="InterPro:IPR019795" /db_xref="UniProtKB/Swiss-Prot:P0A593" /protein_id="SIU00172.1" /translation="MGLLSRLRKREPISIYDKIGGHEAIEVVVEDFYVRVLADDQLSA FFSGTNMSRLKGKQVEFFAAALGGPEPYTGAPMKQVHQGRGITMHHFSLVAGHLADAL TAAGVPSETITEILGVIAPLAVDVTSGESTTAPV" CDS 1733486..1734511 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1570" /product="POSSIBLE FATTY ACYL-COA REDUCTASE" /note="Mb1570, -, len: 341 aa. Equivalent to Rv1543, len: 341 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 341 aa overlap). Possible fatty-acyl CoA reductase (EC 1.2.1.-), highly similar to P94129|U77680 FATTY ACYL-COA REDUCTASE ACR1 from Acinetobacter calcoaceticus (295 aa), FASTA scores: opt: 899, E(): 0, (48.5% identity in 293 aa overlap). Also highly similar to acrA1|Rv3391|MTV004.49|NP_217908.1|NC_000962 fatty acyl-CoA reductase from Mycobacterium tuberculosis (650 aa). Also highly similar to many oxidoreductases short-chain family. Protein product from Mb1570 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1570 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66780" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P66780" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00173.1" /translation="MNLGDLTNFVEKPLAAVSNIVNTPNSAGRYRPFYLRNLLDAVQG RNLNDAVKGKVVLITGGSSGIGAAAAKKIAEAGGTVVLVARTLENLENVANDIRAIRG NGGTAHVYPCDLSDMDAIAVMADQVLGDLGGVDILINNAGRSIRRSLELSYDRIHDYQ RTMQLNYLGAVQLILKFIPGMRERHFGHIVNVSSVGVQTRAPRFGAYIASKAALDSLC DALQAETVHDNVRFTTVHMALVRTPMISPTTIYDKFPTLTPDQAAGVITDAIVHRPRR ASSPFGQFAAVADAVNPAVMDRVRNRAFNMFGDSSAAKGSESQTDTSELDKRSETFVR ATRGIHW" CDS 1734516..1735319 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1571" /product="Possible ketoacyl reductase" /note="Mb1571, -, len: 267 aa. Equivalent to Rv1544, len: 267 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 267 aa overlap). Possible ketoacyl reductase (EC 1.3.1.-), highly similar to Z97179|MLCL383_26 putative oxidoreductase from Mycobacterium leprae (268 aa), FASTA score: (43.0% identity in 270 aa overlap). Also highly similar to others e.g. T29125 ketoacyl reductase homolog from Streptomyces coelicolor (276 aa); NP_470957.1|NC_003212 protein similar to ketoacyl reductases from Listeria innocua (253 aa); HETN_ANASP|P37694 ketoacyl reductase from Anabaena sp. strain PCC 7120 (287 aa), FASTA scores: opt: 379, E(): 7.5e-18, (31.6% identity in 250 aa overlap); etc. And highly similar to many oxidoreductases short-chain family. Also highly similar to Rv2509 from Mycobacterium tuberculosis (268 aa). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb1571 detected using shotgun mass spectrometry. Mb1571 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZJ9" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZJ9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00174.1" /translation="MSLPKPNNQTTVVITGASSGIGVELARGLAGRGFPLMLVARRRE RLDELADQLRQEHCVGVEVLPLDLADTQARAQLADRLRSDAIAGLCNSAGFGTSGRFW ELPFARESEEVVLNALALMELTHAALPGMVKRGAGAVLNIASIAGFQPIPYMAVYSAT KAFVLTFSEAVQEELHGTGVSVTALCPGPVPTEWAEIASAERFSIPLAQVSPHDVAEA AIAGMLSGKRTVVPGIVPKFVSTSGRFAPRSLLLPAIRIGNRLRGGPSR" CDS 1735341..1735568 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1572" /product="HYPOTHETICAL PROTEIN" /note="Mb1572, -, len: 75 aa. Equivalent to Rv1545, len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 75 aa overlap). Hypothetical unknown protein. Mb1572 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P64872" /protein_id="SIU00175.1" /translation="MPNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVA CREEVGITTILAGRDECGVCDKTAGLDGAAP" CDS 1735617..1736048 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1573" /product="conserved protein" /note="Mb1573, -, len: 143 aa. Equivalent to Rv1546, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 143 aa overlap). Conserved hypothetical protein, similar to O05902|Rv0910|MTCY21C12.04 Hypothetical protein from Mycobacterium tuberculosis (144 aa), FASTA scores: E(): 5e-30, (37.3% identity in 142 aa overlap). Protein product from Mb1573 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1573 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/Swiss-Prot:P64874" /protein_id="SIU00176.1" /translation="MASVELSADVPISPQDTWDHVSELSELGEWLVIHEGWRSELPDQ LGEGVQIVGVARAMGMRNRVTWRVTKWDPPHEVAMTGSGKGGTKYGVTLTVRPTKGGS ALGLRLELGGRALFGPLGSAAARAVKGDVEKSLKQFAELYG" CDS 1736116..1739670 /codon_start=1 /transl_table=11 /gene="dnaE1" /locus_tag="BQ2027_MB1574" /product="probable dna polymerase iii (alpha chain) dnae1 (dna nucleotidyltransferase)" /note="Mb1574, dnaE1, len: 1184 aa. Equivalent to Rv1547, len: 1184 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 1184 aa overlap). Probable dnaE1, DNA polymerase III, alpha chain (EC 2.7.7.7), similar to e.g. DP3A_ECOLI|P10443 dna polymerase III, alpha chain (1160 aa), FASTA scores: opt: 1789, E(): 0, (36.5% identity in 1193 aa overlap). Also similar to M. tuberculosis, DnaE2|Rv3370c. Protein product from Mb1574 detected using SWATH mass spectrometry. Mb1574 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63978" /db_xref="InterPro:IPR003141" /db_xref="InterPro:IPR004013" /db_xref="InterPro:IPR004805" /db_xref="InterPro:IPR011708" /db_xref="InterPro:IPR016195" /db_xref="InterPro:IPR029460" /db_xref="InterPro:IPR040982" /db_xref="InterPro:IPR041931" /db_xref="UniProtKB/Swiss-Prot:P63978" /protein_id="SIU00177.1" /translation="MSGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVG MTDHGNMFGASEFYNSATKAGIKPIIGVEAYIAPGSRFDTRRILWGDPSQKADDVSGS GSYTHLTMMAENATGLRNLFKLSSHASFEGQLSKWSRMDAELIAEHAEGIIITTGCPS GEVQTRLRLGQDREALEAAAKWREIVGPDNYFLELMDHGLTIERRVRDGLLEIGRALN IPPLATNDCHYVTRDAAHNHEALLCVQTGKTLSDPNRFKFDGDGYYLKSAAEMRQIWD DEVPGACDSTLLIAERVQSYADVWTPRDRMPVFPVPDGHDQASWLRHEVDAGLRRRFP AGPPDGYRERAAYEIDVICSKGFPSYFLIVADLISYARSAGIRVGPGRGSAAGSLVAY ALGITDIDPIPHGLLFERFLNPERTSMPDIDIDFDDRRRGEMVRYAADKWGHDRVAQV ITFGTIKTKAALKDSARIHYGQPGFAIADRITKALPPAIMAKDIPLSGITDPSHERYK EAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMSSEPLTEAIPLWKRPQD GAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDAIDNVRANRGIDLDLESVPLDDKAT YELLGRGDTLGVFQLDGGPMRDLLRRMQPTGFEDVVAVIALYRPGPMGMNAHNDYADR KNNRQAIKPIHPELEEPLREILAETYGLIVYQEQIMRIAQKVASYSLARADILRKAMG KKKREVLEKEFEGFSDGMQANGFSPAAIKALWDTILPFADYAFNKSHAAGYGMVSYWT AYLKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESGLNFASVGQD IRYGLGAVRNVGANVVGSLLQTRNDKGKFTDFSDYLNKIDISACNKKVTESLIKAGAF DSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLFGSNDDGTGTADPVFTIKVPDDE WEDKHKLALEREMLGLYVSGHPLNGVAHLLAAQVDTAIPAILDGDVPNDAQVRVGGIL ASVNRRVNKNGMPWASAQLEDLTGGIEVMFFPHTYSSYGADIVDDAVVLVNAKVAVRD DRIALIANDLTVPDFSNAEVERPLAVSLPTRQCTFDKVSALKQVLARHPGTSQVHLRL ISGDRITTLALDQSLRVTPSPALMGDLKELLGPGCLGS" CDS complement(1739719..1741755) /codon_start=1 /transl_table=11 /gene="PPE21" /locus_tag="BQ2027_MB1575C" /product="ppe family protein ppe21" /note="Mb1575c, PPE21, len: 678 aa. Equivalent to Rv1548c, len: 678 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 678 aa overlap). Member of the M. tuberculosis PPE family, similar to several e.g. YHS6_MYCTU|P42611 hypothetical 50.6 kd protein in hsp65 3' region (517 aa), FASTA scores: opt:1142, E(): 0, (40.6% identity in 616 aa overlap); also similar to MTCY31.06c (54.9% identity in 381 aa overlap)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XYN6" /protein_id="SIU00178.1" /translation="MNFSVLPPEINSALMFAGAGPGPMLAAASAWTGLAGDLGSAAAS FSAVTSQLATGSWQGPASAAMTGAAASYARWLTTAAAQAEQAAGQAQAAVSAFEAALA ATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASA VALSLTPFTPSPSAAATPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPG SANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPGSGNTGTLNWGSGNIGSYN LGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGD TNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFG NSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQ LSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTG SFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTG TNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHLSGFRNSV PTITGTANISGFVNAGTSISGFFNFGSLMSGFANFDDEVSGYLNGDSRASGWIH" CDS 1741912..1743855 /codon_start=1 /transl_table=11 /gene="fadD11" /locus_tag="BQ2027_MB1576" /product="probable fatty-acid-coa ligase fadd11 (fatty-acid-coa synthetase) (fatty-acid-coa synthase)" /note="Mb1576, fadD11, len: 647 aa. Equivalent to Rv1549 (fadD11') and Rv1550 (fadD11), len: 175 aa and 571 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 167 aa overlap and 99.8% identity in 470 aa overlap). Possible fadD11', fatty-acid-CoA synthetase (EC 6.2.1.-), similar to the N-terminus of many fatty-acid CoA synthetases e.g. NP_147860.1|NC_000854 long-chain-fatty-acid--CoA ligase from Aeropyrum pernix (651 aa); P31685|4CL2_SOLTU 4-coumarate--CoA ligase 2 (EC 6.2.1.12) from Solanum tuberosum (Potato) (545 aa), FASTA scores: opt: 168, E(): 4.4e-06, (30.4% identity in 112 aa overlap); etc. Possible frameshift with respect to next ORF Rv1550|MTCY48.15c but we can find no sequence error to account for this. Probable fadD11, fatty-acid-CoA synthetase (EC 6.2.1.-), similar, except in N-terminus, to many e.g. SC6A5.39|T35430 probable long-chain-fatty-acid--CoA ligase (EC 6.2.1.3) from Streptomyces coelicolor (612 aa); NP_301672.1|NC_002677 putative long-chain-fatty-acid-CoA ligase from Mycobacterium leprae (600 aa); P44446|LCFH_HAEIN putative long-chain-fatty-acid-CoA ligase from Haemophilus influenzae (607 aa), FASTA scores: opt: 762, E(): 2.3e-38, (34.4% identity in 436 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1549 and Rv1550 exist as 2 genes. In Mycobacterium bovis, a two single base insertions (*-c and *-c) lead to a single product. Protein product from Mb1576 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1576 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYM2" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XYM2" /protein_id="SIU00179.1" /translation="MGRHGSRWSRPPCFRVLRLWTYAHRCDLGHTDPLSRRTEMTTTE RPTTMCEAFQRTAVMDPDAVALRTPGGNQTMTWRDYAAQVRRVAAGLAGLGVRRGDTV SLMMANRIEFYPLDVGAQHVGATSFSVYNTLPAEQLTYVFDNAGTKVVICEQQYVDRV RASGVPIEHIVCVDGAPPGTLSLTDLYAAASGDFFDFESTWRAVQPEDIVTLIYTSGT TGNPKGVEMTHANLLFEGYAIDEVLGIRFGDRVTSFLPSAHIADRMTGLYLQEMFGTQ VTAVADARTIAAALPDVRPTVWGAVPRVWEKLKAGIEFTVARETDEMKRQALAWAMSV AGKRANALLAGESMSDQLVAEWAKADESVLSKLRERLGFGELRWALSGAAPIPKETLA FFAGIGIPIAEIWGMSELSCVATASHPRDGRLGTVGKLLPGLQGKIAEDGEYLVRGPL VMKGYRKEPAKTAEAIDSDGWLHTGDVFDIDSDGYLRVVDRKKELIINAAGKNMSPAN IENTILAACPMVGVMMAIGDGRTYNTALLVFDADSLGPYAAQRGLDASPAALAADPEV IARIAAGVAEGNAKLSRVEQIKRFRILPTLWEPGGDEITLTMKLKRRRIAAKYSAEIE ELYASELRPQVYEPAAVPSTQPA" CDS 1743869..1745734 /codon_start=1 /transl_table=11 /gene="plsB1" /locus_tag="BQ2027_MB1577" /product="Possible acyltransferase plsB1" /note="Mb1577, plsB1, len: 621 aa. Equivalent to Rv1551, len: 621 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 621 aa overlap). Possible plsB1, acyltransferase (EC 2.-.-.-), similar to PLSB_HAEIN|P44857 glycerol-3-phosphate acyltransferase from Haemophilus influenzae (810 aa), FASTA scores: opt: 434, E(): 6.2e-22, (27.6% identity in 395 aa overlap). Also similar to Rv2482c|plsB2 Probable glycerol-3-phosphate acyltransferase from M.tuberculosis (789 aa). Protein product from Mb1577 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1577 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65735" /db_xref="InterPro:IPR002123" /db_xref="InterPro:IPR022284" /db_xref="InterPro:IPR028354" /db_xref="InterPro:IPR041728" /db_xref="UniProtKB/Swiss-Prot:P65735" /protein_id="SIU00180.1" /translation="MTAREVGRIGLRKLLQRIGIVAESMTPLATDPVEVTQLLDARWY DERLRALADELGRDPDSVRAEAAGYLREMAASLDERAVQAWRGFSRWLMRAYDVLVDE DQITQLRKLDRKATLAFAFSHRSYLDGMLLPEAILANRLSPALTFGGANLNFFPMGAW AKRTGAIFIRRQTKDIPVYRFVLRAYAAQLVQNHVNLTWSIEGGRTRTGKLRPPVFGI LRYITDAVDEIDGPEVYLVPTSIVYDQLHEVEAMTTEAYGAVKRPEDLRFLVRLARQQ GERLGRAYLDFGEPLPLRKRLQEMRADKSGTGSEIERIALDVEHRINRATPVTPTAVV SLALLGADRSLSISEVLATVRPLASYIAARNWAVAGAADLTNRSTIRWTLHQMVASGV VSVYDAGTEAVWGIGEDQHLVAAFYRNTAIHILVDRAVAELALLAAAETTTNGSVSPA TVRDEALSLRDLLKFEFLFSGRAQFEKDLANEVLLIGSVVDTSKPAAAADVWRLLESA DVLLAHLVLRPFLDAYHIVADRLAAHEDDSFDEEGFLAECLQVGKQWELQRNIASAES RSMELFKTALRLARHRELVDGADATDIAKRRQQFADEIATATRRVNTIAELARRQ" CDS 1746105..1747856 /codon_start=1 /transl_table=11 /gene="frdA" /locus_tag="BQ2027_MB1578" /product="PROBABLE FUMARATE REDUCTASE [FLAVOPROTEIN SUBUNIT] FRDA (FUMARATE DEHYDROGENASE) (FUMARIC HYDROGENASE)" /note="Mb1578, frdA, len: 583 aa. Equivalent to Rv1552, len: 583 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 583 aa overlap). Probable frdA, fumarate reductase, flavoprotein subunit (EC 1.3.99.1), highly similar to others e.g. P00363|FRDA_ECOLI fumarate reductase flavoprotein subunit from Escherichia coli strain K12 (601 aa), FASTA scores: opt: 2102, E(): 0, (54.7% identity in 585 aa overlap); NP_232284.1|NC_002505 fumarate reductase, flavoprotein subunit from Vibrio cholerae (602 aa); frdA|NP_438995.1|NC_000907 fumarate reductase, flavoprotein subunit from Haemophilus influenzae (599 aa); etc. Contains PS00504 Fumarate reductase / succinate dehydrogenase FAD-binding site. NOTE THAT FUMARATE REDUCTASE FORMS PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN (Rv1552|frdA), AN IRON-SULFUR (Rv1553|frdB), AND TWO HYDROPHOBIC ANCHOR PROTEINS (Rv1554|frdC and Rv1555|frdD). Protein product from Mb1578 detected using SWATH mass spectrometry. Mb1578 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64175" /db_xref="InterPro:IPR003952" /db_xref="InterPro:IPR003953" /db_xref="InterPro:IPR005884" /db_xref="InterPro:IPR014006" /db_xref="InterPro:IPR015939" /db_xref="InterPro:IPR027477" /db_xref="InterPro:IPR036188" /db_xref="InterPro:IPR037099" /db_xref="UniProtKB/Swiss-Prot:P64175" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00181.1" /translation="MTAQHNIVVIGGGGAGLRAAIAIAETNPHLDVAIVSKVYPMRSH TVSAEGGAAAVTGDDDSLDEHAHDTVSGGDWLCDQDAVEAFVAEAPKELVQLEHWGCP WSRKPDGRVAVRPFGGMKKLRTWFAADKTGFHLLHTLFQRLLTYSDVMRYDEWFATTL LVDDGRVCGLVAIELATGRIETILADAVILCTGGCGRVFPFTTNANIKTGDGMALAFR AGAPLKDMEFVQYHPTGLPFTGILITEAARAEGGWLLNKDGYRYLQDYDLGKPTPEPR LRSMELGPRDRLSQAFVHEHNKGRTVDTPYGPVVYLDLRHLGADLIDAKLPFVRELCR DYQHIDPVVELVPVRPVVHYMMGGVHTDINGATTLPGLYAAGETACVSINGANRLGSN SLPELLVFGARAGRAAADYAARHQKSDRGPSSAVRAQARTEALRLERELSRHGQGGER IADIRADMQATLESAAGIYRDGPTLTKAVEEIRVLQERFATAGIDDHSRTFNTELTAL LELSGMLDVALAIVESGLRREESRGAHQRTDFPNRDDEHFLAHTLVHRESDGTLRVGY LPVTITRWPPGERVYGR" CDS 1747859..1748983 /codon_start=1 /transl_table=11 /gene="frdBC" /locus_tag="BQ2027_MB1579" /product="probable fumarate reductase [membrane anchor subunit] frdc (fumarate dehydrogenase) (fumaric hydrogenase)" /note="Mb1579, frdBC, len: 374 aa. Equivalent to Rv1553 (frdB) and Rv1554 (frdC), len: 247 aa and len: 126 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 247 aa overlap and 99.2% identity in 126 aa overlap). Probable frdB, fumarate reductase, iron-sulfur subunit (EC 1.3.99.1), highly similar to others e.g. P00364|FRDB_ECOLI fumarate reductase iron-sulfur protein from Escherichia coli strain K12 (243 aa), FASTA scores: opt: 846, E(): 0, (50.0% identity in 242 aa overlap); P20921|FRDB_PROVU FUMARATE REDUCTASE IRON-SULFUR PROTEIN from Proteus vulgaris (245 aa); G64097 fumarate reductase (EC 1.3.99.1) iron-sulfur protein from Haemophilus influenzae (276 aa); etc. Contains PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature. And probable frdC, fumarate reductase, membrane-anchor subunit (EC 1.3.99.1), highly similar to others e.g. P03805|FRDC_ECOLI fumarate reductase 15 kDa hydrophobic protein from Escherichia coli strain K12 (131 aa), FASTA scores, opt: 268, E(): 3.9e-10, (31.1% identity in 122 aa overlap); NP_458780.1|NC_003198 fumarate reductase complex subunit C; membrane anchor polypeptide from Salmonella enterica subsp. enterica serovar Typhi (131 aa); P20923|FRDC_PROVU FUMARATE REDUCTASE 15 KD HYDROPHOBIC PROTEIN from Proteus vulgaris (131 aa); etc. NOTE THAT FUMARATE REDUCTASE FORMS PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN (Rv1552|frdA), AN IRON-SULFUR (Rv1553|frdB), AND TWO HYDROPHOBIC ANCHOR PROTEINS (Rv1554|frdC and Rv1555|frdD). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1553 and Rv1554 exist as 2 genes. In Mycobacterium bovis, a 4 bp insertion (*-gggg) leads to a single product. Mb1579 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYM8" /db_xref="InterPro:IPR003510" /db_xref="InterPro:IPR004489" /db_xref="InterPro:IPR009051" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR017896" /db_xref="InterPro:IPR017900" /db_xref="InterPro:IPR025192" /db_xref="InterPro:IPR034804" /db_xref="InterPro:IPR036010" /db_xref="UniProtKB/TrEMBL:A0A1R3XYM8" /protein_id="SIU00182.1" /translation="MMDRIVMEVSRYRPEIESAPTFQAYEVPLTREWAVLDGLTYIKD HLDGTLSFRWSCRMGICGSSGMTINGDPKLACATFLADYLPGPVRVEPMRNFPVIRDL VVDISDFMAKLPSVKPWLVRHDEPPVEDGEYRQTPAELDAFKQFSMCINCMLCYSACP VYALDPDFLGPAAIALGQRYNLDSRDQGAADRRDVLAAADGAWACTLVGECSTACPKG VDPAGAIQRYKLTAATHALKKLLFPWGGGRMSAYRQPVERYWWARRRSYLRFMLREIS CIFVAWFVLYLVLVLRAVGAGGNSYQRFLDFSANPVVVVLNVVALSFLLLHAVTWFGS APRAMVIQVRGRRVPARAVLAGHYAAWLVVSVIVAWMVLS" CDS 1748980..1749357 /codon_start=1 /transl_table=11 /gene="frdD" /locus_tag="BQ2027_MB1580" /product="PROBABLE FUMARATE REDUCTASE [MEMBRANE ANCHOR SUBUNIT] FRDD (FUMARATE DEHYDROGENASE) (FUMARIC HYDROGENASE)" /note="Mb1580, frdD, len: 125 aa. Equivalent to Rv1555, len: 125 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 125 aa overlap). Probable frdD, fumarate reductase, membrane-anchor subunit (EC 1.3.99.1), similar to others e.g. P03806|FRDD_ECOLI fumarate reductase 13 kDa hydrophobic protein from Escherichia coli strain K12 (119 aa), FASTA scores: opt: 212, E(): 4.4e-08, (36.8% identity in 106 aa overlap); etc. NOTE THAT FUMARATE REDUCTASE FORMS PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN (Rv1552|frdA), AN IRON-SULFUR (Rv1553|frdB), AND TWO HYDROPHOBIC ANCHOR PROTEINS (Rv1554|frdC and Rv1555|frdD). Protein product from Mb1580 detected using shotgun mass spectrometry. Mb1580 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67644" /db_xref="InterPro:IPR003418" /db_xref="InterPro:IPR034804" /db_xref="UniProtKB/Swiss-Prot:P67644" /protein_id="SIU00183.1" /translation="MTPSTSDARSRRRSAEPFLWLLFSAGGMVTALVAPVLLLLFGLA FPLGWLDAPDHGHLLAMVRNPITKLVVLVLVVLALFHAAHRFRFVLDHGLQLGRFDRV IALWCYGMAVLGSATAGWMLLTM" CDS 1749425..1750033 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1581" /product="Possible regulatory protein" /note="Mb1581, -, len: 202 aa. Equivalent to Rv1556, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 202 aa overlap). Possible regulatory protein, similar to X86780|SHGCPIR2|g987088 orfY, regulator of antibiotic transport complexes from Streptomyces hygroscopicus (204 aa), FASTA score: opt: 251, E(): 1.7e-10, (33.8% identity in 201 aa overlap) and others. Protein product from Mb1581 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1581 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67437" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR011075" /db_xref="InterPro:IPR023772" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/Swiss-Prot:P67437" /protein_id="SIU00184.1" /translation="MVGAVTQIADRPTDPSPWSPRETELLAVTLRLLQEHGYDRLTVD AVAASARASKATVYRRWPSKAELVLAAFIEGIRQVAVPPNTGNLRDDLLRLGELICRE VGQHASTIRAVLVEVSRNPALNDVLQHQFVDHRKALIQYILQQAVDRGEISSAAISDE LWDLLPGYLIFRSIIPNRPPTQDTVQALVDDVILPSLTRSTG" CDS 1750172..1750618 /codon_start=1 /transl_table=11 /gene="mmpS6" /locus_tag="BQ2027_MB1582" /product="PROBABLE CONSERVED MEMBRANE PROTEIN MMPS6" /note="Mb1582, mmpS6, len: 148 aa. No equivalent in Mycobacterium tuberculosis strain H37Rv. Probable mmpS6, conserved membrane protein (see citations below), highly similar to other Mycobacterial proteins e.g. P54880|MMS4_MYCLE|ML2377|U1740W|MMPS4 Putative membrane protein from Mycobacterium leprae (154 aa), FASTA scores: opt: 521, E(): 4.7e-29, (53.06% identity in 147 aa overlap); P95212|MMPS1|MMS1_MYCTU|Rv0403c|MT0415|MTCY04D9. 16c PROBABLE CONSERVED MEMBRANE PROTEIN from Mycobacterium tuberculosis (142 aa), FASTA scores: opt: 518, E(): 7.2e-29, (56.75% identity in 141 aa overlap); O53736|MMS4_MYCTU|MMPS4|Rv0451c|MT0467|MTV037.15c PROBABLE CONSERVED MEMBRANE PROTEIN from Mycobacterium tuberculosis (140 aa), FASTA scores: opt: 498, E(): 1.8e-27, (52.85% identity in 140 aa overlap); etc. BELONGS TO THE MMPS FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 2153 bp insertion leads to a new protein with no equivalent in Mycobacterium tuberculosis strain H37Rv. Belongs to the TbD1 region. Mb1582 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYN3" /db_xref="InterPro:IPR008693" /db_xref="InterPro:IPR038468" /db_xref="UniProtKB/TrEMBL:A0A1R3XYN3" /protein_id="SIU00185.1" /translation="MQGISVTGLVKRGWMVLVAVAVVAVAGFSVYRLHGIFGSHDTTS TAGGVANDIKPFNPKQVTLEVFGAPGTVATINYLDVDATPRQVLDTTLPWSYTITTTL PAVFANVVAQGDSNSIGCRITVNGVVKDERIVNEVRAYTFCLDKSS" CDS 1750615..1753518 /codon_start=1 /transl_table=11 /gene="mmpL6" /locus_tag="BQ2027_MB1583" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL6" /note="Mb1583, mmpL6, len: 967 aa. Equivalent to 3' end of Rv1557, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 384 aa overlap). Probable mmpL6, conserved transmembrane transport protein (see citations below). Member of RND superfamily, with strong similarity to other members of large Mycobacterial membrane protein family belonging to RND superfamily e.g. Q11171|MML2_MYCTU|MMPL2|Rv0507|MT0528|MTCY20G9.34 PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis (968 aa), FASTA scores: opt: 4142, E(): 0, (64.4% identity in 947 aa overlap); O53735|MML4_MYCTU|MMPL4|Rv0450c|MT0466|MTV037.14c PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis (967 aa), FASTA scores: opt: 4035, E(): 0, (61.5% identity in 948 aa overlap); P54881|MML4_MYCLE|MMPL4|ML2378|U1740V Putative membrane protein from Mycobacterium leprae (959 aa), FASTA scores: opt: 3961, E(): 0, (60.95% identity in 945 aa overlap); etc. BELONGS TO THE MMPL FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 2153 bp insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Belongs to the TbD1 region. Mb1583 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ18" /db_xref="InterPro:IPR000731" /db_xref="InterPro:IPR004707" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ18" /protein_id="SIU00186.1" /translation="MSNHHRPRPWLPHTIRRLSLPILLFWVGVAAITNAAVPQLEVVG EAHNVAQSSPDDPSLQAMKRIGKVFHEFDSDSAAMIVLEGDKPLGNDAHRFYDTLLRN LSNDTKHVEHVQDFWGDPLTAAGSQSTDGKAAYVQVYLAGNQGEALSIESVDAVRDIV AHTPPPAGVKAYVTGAAPLMADQFQVGSKGTAKVTGITLVVIAVMLLFVYRSVVTMVL VLITVLIELAAARGIVAFLGNAGVIGLSTYSTNLLTLLVIAAGTDYAIFVLGRYHEAR YAAQDRETAFYTMYRGTAHVVLGSGLTVAGAVYCLSFTRLPYFQSLGIPASIGVMIAL AAALSLAPSVLILGSRFGCFEPKRRMRTRGWRRIGTAIVRWPGPILAVACAIAVVGLL ALPGYKTSYDARYYMPATAPANIGYMAAERHFPQARLNPELLMIETDHDMRNPADMLI LDRIAKAVFHLPGIGLVQAMTRPLGTPIDHSSIPFQISMQSVGQIQNLKYQRDRAADL LKQAEELGKTIEILQRQYALQQELAAATHEQAESFHQTIATVKELRDRIANFDDFFRP IRSYFYWEKHCYDIPSCWALRSVFDTIDGIDQLGEQLASVTVTLDKLAAIQPQLVALL PDEIASQQINRELALANYATMSGIYAQTAALIENAAAMGQAFDAAKNDDSFYLPPEAF DNPDFQRGLKLFLSADGKAARMIISHEGDPATPEGISHIDAIKQAAHEAVKGTPMAGA GIYLAGTAATFKDIQDGATYDLLIAGIAALSLILLIMMIITRSLVAALVIVGTVALSL GASFGLSVLVWQHLLGIQLYWIVLALAVILLLAVGSDYNLLLISRFKEEIGAGLNTGI IRAMAGTGGVVTAAGLVFAATMSSFVFSDLRVLGQIGTTIGLGLLFDTLVVRAFMTPS IAVLLGRWFWWPQRVRPRPASRMLRPYGPRPVVRELLLREGNDDPRTQVATHR" CDS 1753528..1753974 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1584" /product="F420H(2)-dependent quinone reductase Rv1558 (Fqr) (EC" /EC_number="1.1.98.-" /note="Mb1584, -, len: 148 aa. Equivalent to Rv1558, len: 148 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 148 aa overlap). Conserved hypothetical protein, similar to other Mycobacterial tuberculosis proteins e.g. P71854|MTCY03C7.09c|Rv3547 (151 aa), FASTA scores opt: 330, E(): 9.1e-17, (39.7% identity in 151 aa overlap); also Q11057|Rv1261c (149 aa), and O53328|Rv3178 (119 aa). Similar also to AF072709|AF072709_5 Hypothetical protein with a new amplifiable element AUD4 from Streptomyces lividans (149 aa), FASTA scores: opt: 695, E(): 0, (69.1% identity in 149 aa overlap). Protein product from Mb1584 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1584 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64876" /db_xref="InterPro:IPR004378" /db_xref="InterPro:IPR012349" /db_xref="UniProtKB/Swiss-Prot:P64876" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00187.1" /translation="MPLSGEYAPSPLDWSREQADTYMKSGGTEGTQLQGKPVILLTTV GAKTGKLRKTPLMRVEHDGQYAIVASLGGAPKNPVWYHNVVKNPRVELQDGTVTGDYD AREVFGDEKAIWWQRAVAVWPDYASYQTKTDRQIPVFVLTPVRAGG" CDS 1754009..1755298 /codon_start=1 /transl_table=11 /gene="ilvA" /locus_tag="BQ2027_MB1585" /product="Probable threonine dehydratase ilvA" /note="Mb1585, ilvA, len: 429 aa. Equivalent to Rv1559, len: 429 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 429 aa overlap). Probable ilvA, threonine dehydratase (EC 4.3.1.19), biosynthetic protein, similar to several e.g. THD1_CORGL|Q04513 threonine dehydratase biosynthetic (436 aa), FASTA scores: opt: 1694, E(): 0, (61.9% identity in 415 aa overlap). Contains PS00165 Serine/threonine dehydratases pyridoxal-phosphate attachment site. Protein product from Mb1585 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1585 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66898" /db_xref="InterPro:IPR000634" /db_xref="InterPro:IPR001721" /db_xref="InterPro:IPR001926" /db_xref="InterPro:IPR011820" /db_xref="InterPro:IPR036052" /db_xref="InterPro:IPR038110" /db_xref="UniProtKB/Swiss-Prot:P66898" /protein_id="SIU00188.1" /translation="MSAELSQSPSSSPLFSLSGADIDRAAKRIAPVVTPTPLQPSDRL SAITGATVYLKREDLQTVRSYKLRGAYNLLVQLSDEELAAGVVCSSAGNHAQGFAYAC RCLGVHGRVYVPAKTPKQKRDRIRYHGGEFIDLIVGGSTYDLAAAAALEDVERTGATL VPPFDDLRTIAGQGTIAVEVLGQLEDEPDLVVVPVGGGGCIAGITTYLAERTTNTAVL GVEPAGAAAMMAALAAGEPVTLDHVDQFVDGAAVNRAGTLTYAALAAAGDMVSLTTVD EGAVCTAMLDLYQNEGIIAEPAGALSVAGLLEADIEPGSTVVCLISGGNNDVSRYGEV LERSLVHLGLKHYFLVDFPQEPGALRRFLDDVLGPNDDITLFEYVKRNNRETGEALVG IELGSAADLDGLLARMRATDIHVEALEPGSPAYRYLL" CDS 1755336..1755554 /codon_start=1 /transl_table=11 /gene="vapb11" /locus_tag="BQ2027_MB1586" /product="possible antitoxin vapb11" /note="Mb1586, -, len: 72 aa. Equivalent to Rv1560, len: 72 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 72 aa overlap). Conserved hypothetical protein, part of a Mycobacterial tuberculosis family of proteins e.g. Q10848|Rv2009|MTCY39.08c (80 aa), FASTA score: (54.4% identity in 68 aa overlap); Q10799|Rv2871|MTCY274.02 (85 aa); O50456|Rv1241|MTV006.13 (86 aa), O06243|Rv2132|MTCY270.36C (76 aa); etc. Mb1586 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR019239" /db_xref="UniProtKB/Swiss-Prot:P64878" /protein_id="SIU00189.1" /translation="MYRWCMSRTNIDIDDELAAEVMRRFGLTTKRAAVDLALRRLVGS PLSREFLLGLEGVGWEGDLDDLRSDRPD" CDS 1755560..1755964 /codon_start=1 /transl_table=11 /gene="vapc11" /locus_tag="BQ2027_MB1587" /product="possible toxin vapc11" /note="Mb1587, -, len: 134 aa. Equivalent to Rv1561, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 134 aa overlap). Conserved hypothetical protein, similar to others from Mycobacterium tuberculosis e.g. Q10847|Rv2010|MTCY39.07c (132 aa), FASTA scores: (37.0% identity in 127 aa overlap); and O06566|Rv1114|MTCY22G8.03 (124 aa). Protein product from Mb1587 detected using SWATH mass spectrometry. Mb1587 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64880" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/Swiss-Prot:P64880" /protein_id="SIU00190.1" /translation="MILIDTSAWVEYFRATGSIAAVEVRRLLSEEAARIAMCEPIAME ILSGALDDNTHTTLERLVNGLPSLNVDDAIDFRAAAGIYRAARRAGETVRSINDCLIA ALAIRHGARIVHRDADFDVIARITNLQAASFR" CDS complement(1755981..1757723) /codon_start=1 /transl_table=11 /gene="treZ" /locus_tag="BQ2027_MB1588C" /standard_name="glgZ" /product="Maltooligosyltrehalose trehalohydrolase TreZ" /note="Mb1588c, treZ, len: 580 aa. Equivalent to Rv1562c, len: 580 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 580 aa overlap). treZ (previously called glgZ), Maltooligosyltrehalose trehalohydrolase, confirmed biochemically (see citation below). Similar to Q44316|D63343 TREZ MALTOOLIGOSYL TREHALOSE TREHALOHYDROLASE from ARTHROBACTER SP (598 aa), FASTA scores: opt: 2071, E(): 0, (52.2% identity in 582 aa overlap); also similar to 1,4-alpha-glucan branching enzymes e.g. GLGB_BACST|P30538 (639 aa), FASTA scores: opt: 313, E(): 3.8e-13, (27.5% identity in 462 aa overlap). Also similar to Mycobacterium tuberculosis proteins Rv1326c|glgB, and Rv1563c treY (previously glgY). Protein product from Mb1588c detected using SWATH mass spectrometry. Mb1588c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYM9" /db_xref="InterPro:IPR006047" /db_xref="InterPro:IPR012768" /db_xref="InterPro:IPR013783" /db_xref="InterPro:IPR014756" /db_xref="InterPro:IPR017853" /db_xref="InterPro:IPR022567" /db_xref="UniProtKB/TrEMBL:A0A1R3XYM9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00191.1" /translation="MPEFRVWAPKPALVRLDVNGAVHAMTRSADGWWHTTVAAPADAR YGYLLDDDPTVLPDPRSARQPDGVHARSQRWEPPGQFGAARTDTGWPGRSVEGAVIYE LHIGTFTTAGTFDAAIEKLDYLVDLGIDFVELMPVNSFAGTRGWGYDGVLWYSVHEPY GGPDGLVRFIDACHTRRLGVLIDAVFNHLGPSGNYLPRFGPYLSSASNPWGDGINIAG ADSDEVRHYIIDCALRWMRDFHADGLRLDAVHALVDTTAVHVLEELANATRWLSGQLG RPLSLIAETDRNDPRLITRPSHGGYGITAQWNDDIHHAIHTAVSGERQGYYADFGSLA TLAYTLRNGYFHAGTYSSFRRRRHGRALDTSAIPATRLLAYTCTHDQVGNRALGDRPS QYLTGGQLAIKAALTLGSPYTAMLFMGEEWGASSPFQFFCSHPEPELAHSTVAGRKEE FAEHGWAADDIPDPQDPQTFQRCKLNWAEAGSGEHARLHRFYRDLIALRHNEADLADP WLDHLMVDYDEQQRWVVMRRGQLMIACNLGAEPTCVPVSGELVLAWESPIIGDNSTEL AAYSLAILRAAEPA" CDS complement(1757716..1758450) /codon_start=1 /transl_table=11 /gene="treYb" /locus_tag="BQ2027_MB1589C" /standard_name="glgY" /product="Maltooligosyltrehalose synthase TreYb [SECOND PART]" /note="Mb1589c, treYb, len: 244 aa. Equivalent to the 3' end of Rv1563c, len: 765 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 244 aa overlap). treY (previously called glgY), maltooligosyl trehalose synthase, confirmed biochemically (see citation below). Strong similarity to Q44315|63343 TREY MALTOOLIGOSYL TREHALOSE SYNTHASE from ARTHROBACTER SP (775 aa), fasta scores: opt: 1953, E(): 0; (46.0% identity in 789 aa overlap). Some similarity to alpha-amylases and to MTCY48.03 (30.2% identity in 215 aa overlap). May catalyse conversion of maltodextrins to maltooligosyl trehaloses. Also similar to Mycobacterium tuberculosis glgB (Rv1326c), treZ (Rv1562c). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1563c/treY exists as a single gene. In Mycobacterium bovis, a large deletion of 806 bp splits Rv1563c into two parts, treYa and treYb. Protein product from Mb1589c detected using SWATH mass spectrometry. Mb1589c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR017853" /db_xref="UniProtKB/TrEMBL:A0A1R3XYN8" /protein_id="SIU00192.1" /translation="MTGQFLWQNVFGVWPVSGEVSAALRGRLHTYAEKAIREAAWHTS WHNPNRAFEDDVHGWLDLVLDGPLASELTGLVAHLNSHAESDALAAKLLALTVPGVPD VYQGSELWDDSLVDPDNRRPVDYGTRRVALKALQHPKIRVLAAALRLRRTHPESFLGG AYHPVFAAGPAADHVVAFRRGDDILVAVTRWTVRLQQTGWDHTVLPLPDGSWTDALTG FTASGHTPAVELFADLPVVLLVRDNA" CDS complement(1758455..1759207) /codon_start=1 /transl_table=11 /gene="treYa" /locus_tag="BQ2027_MB1590C" /standard_name="glgY" /product="Maltooligosyltrehalose synthase TreYa [FIRST PART]" /note="Mb1590c, treYa, len: 250 aa. Equivalent to the 5' end of Rv1563c, len: 765 aa, from Mycobacterium tuberculosis strain H37Rv, (97.4% identity in 191 aa overlap). treY (previously called glgY), maltooligosyl trehalose synthase, confirmed biochemically (see citation below). Strong similarity to Q44315|63343 TREY MALTOOLIGOSYL TREHALOSE SYNTHASE from ARTHROBACTER SP (775 aa), fasta scores: opt: 1953, E(): 0; (46.0% identity in 789 aa overlap). Some similarity to alpha-amylases and to MTCY48.03 (30.2% identity in 215 aa overlap). May catalyse conversion of maltodextrins to maltooligosyl trehaloses. Also similar to Mycobacterium tuberculosis glgB (Rv1326c), treZ (Rv1562c). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1563c/treY exists as a single gene. In Mycobacterium bovis, a large deletion of 806 bp splits Rv1563c in two parts, treYa and treYb. Mb1590c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0U5" /db_xref="InterPro:IPR006047" /db_xref="InterPro:IPR017853" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0U5" /protein_id="SIU00193.1" /translation="MAFPVISTYRVQMRGRSNGFGFTFADAENLLDYLDDLGVSHLYL SPILTAVGGSTHGYDVTDPTTVSPELGGSDGLARLSAAARSRGMGLIVDIVPSHVGVG KPEQNAWWWDVLKFGRSSAYAEFFDIDWELGDGRIILPLLGSDSDVANLRVDGDLLRL GDLALPVAPGSGDGTGPAVHDRQHSVWCGRRGVSSPGRHPCSVVATVHDDTVHPRHQT RRGRACPHRRAVPSAVAVGQVHRPRPSHCARP" CDS complement(1759211..1761376) /codon_start=1 /transl_table=11 /gene="treX" /locus_tag="BQ2027_MB1591C" /standard_name="glgX" /product="probable maltooligosyltrehalose synthase trex" /note="Mb1591c, treX, len: 721 aa. Equivalent to Rv1564c, len: 721 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 721 aa overlap). Probable treX (previously called glgX), Maltooligosyltrehalose synthase. Strong similarity to D83245|g1890053 treX, glycogen debranching enzyme (glgX) from Sulfolobus acidocaldarius (713 aa), FASTA score: opt: 2396, E(): 0, (48.4% identity in 709 aa overlap); similar to GLGX_HAEIN|P45178 glycogen operon protein glgx (659 aa), FASTA scores: opt: 1512, E(): 0, (42.3% identity in 645 aa overlap). Protein product from Mb1591c detected using SWATH mass spectrometry. Mb1591c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4Y5" /db_xref="InterPro:IPR004193" /db_xref="InterPro:IPR006047" /db_xref="InterPro:IPR011837" /db_xref="InterPro:IPR013780" /db_xref="InterPro:IPR013783" /db_xref="InterPro:IPR014756" /db_xref="InterPro:IPR017853" /db_xref="UniProtKB/Swiss-Prot:P0A4Y5" /protein_id="SIU00194.1" /translation="MSSNNAGESDGTGPALPTVWPGNAYPLGATYDGAGTNFSLFSEI AEKVELCLIDEDGVESRIPLDEVDGYVWHAYLPNITPGQRYGFRVHGPFDPAAGHRCD PSKLLLDPYGKSFHGDFTFGQALYSYDVNAVDPDSTPPMVDSLGHTMTSVVINPFFDW AYDRSPRTPYHETVIYEAHVKGMTQTHPSIPPELRGTYAGLAHPVIIDHLNELNVTAV ELMPVHQFLHDSRLLDLGLRNYWGYNTFGFFAPHHQYASTRQAGSAVAEFKTMVRSLH EAGIEVILDVVYNHTAEGNHLGPTINFRGIDNTAYYRLMDHDLRFYKDFTGTGNSLNA RHPHTLQLIMDSLRYWVIEMHVDGFRFDLASTLARELHDVDRLSAFFDLVQQDPVVSQ VKLIAEPWDVGEGGYQVGNFPGLWTEWNGKYRDTVRDYWRGEPATLGEFASRLTGSSD LYEATGRRPSASINFVTAHDGFTLNDLVSYNDKHNEANGENNRDGESYNRSWNCGVEG PTDDPDILALRARQMRNMWATLMVSQGTPMIAHGDEIGRTQYGNNNVYCQDSELSWMD WSLVDKNADLLAFARKATTLRKNHKVFRRRRFFEGEPIRSGDEVRDIAWLTPSGREMT HEDWGRGFDRCVAVFLNGEAITAPDARGERVVDDSFLLCFNAHDHDVEFVMPHDGYAQ QWTGELDTNDPVGDIDLTVTATDTFSVPARSLLVLRKTL" CDS complement(1761415..1763604) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1592C" /product="O-antigen acetylase" /note="Mb1592c, -, len: 729 aa. Equivalent to Rv1565c, len: 729 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 729 aa overlap). Conserved hypothetical membrane protein, some similarity to O05402 HYPOTHETICAL 72.2 KD PROTEIN from Bacillus subtilis (634 aa), FASTA results: opt: 384, E(): 4.8e-17, (29.1% identity in 378 aa overlap); and to Y392_HAEIN|P43993 hypothetical protein hi0392 from H. influenzae (245 aa), FASTA results: opt: 265, E(): 5.5e-10, (28.3% identity in 247 aa overlap). C-terminal half equivalent to AL049478|MLCL458_19 (274 aa) (78.5% identity in 274 aa overlap). Also similar to M. tuberculosis hypothetical proteins Rv0111, Rv0228, Rv1254, Rv0517. N-terminal half hydrophobic. Protein product from Mb1592c detected using SWATH mass spectrometry. Mb1592c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYP2" /db_xref="InterPro:IPR002656" /db_xref="UniProtKB/TrEMBL:A0A1R3XYP2" /protein_id="SIU00195.1" /translation="MLTLSPPRPPALTPEPALPPVTMGTRTTGFYRHDLDGLRGVAIA LVAVFHVWFGRVSGGVDVFLALSGFFFGGKILRAALNPDLSLSPIAEVIRLIRRLLPA LVVVLAGCALLTIAIQPQTRWEAFANQSLASLGYYQNWELASTVSNYLRAGEAVSPLQ HIWSMSVQGQFYLAFLLLVAGCAYLLRRLFRGPRAPYLRTMFVVLLSTLTLASFIYAI VAHHAYQATAYYNTFARAWELLAGALVGAVVPHVRWPMWLRTAVATAALAAILSCGAL IDGVKEFPGPWALVPVGATMLMILAGANRQGHPGTRDRLPLPNRLLATAPLVALGAMA YSWYLWHWPLLIFWLSYTGHRHANFVEGAAVLLVSGLLAYLTTRLVEDPLRYRAPAGV RSPAAVPPIPWRLRLRRPTIVLGSVVALLGVALTATSFTWREHVIVQRAAGKELSGLS SRDYPGARALIDHVRVPKLRMRPTVLEVRHDLPTSTKDGCISDFVNPAIINCTYGDVD APRTIALAGGSHAEHWLTALDLLGRMHHFKVVTYLKMGCPLSTEEVPLIMGNNAPYPQ CHQWVQAAMAKLVADHPDYVFTTSTRPWNIKPGDVMPATYVGIWQTFADNNIPVLAMR DTPWLVKDGQPFIPADCLAKGGNPQSCGIARSKVLVDRNPTLDFVARFPLLKPLDMSD AICRTDTCRAVEGNVLVYRDSHHLTPTYMRTMTSELGRQIAANTDWW" CDS complement(1763703..1764395) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1593C" /product="Possible inv protein" /note="Mb1593c, -, len: 230 aa. Equivalent to Rv1566c, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 230 aa overlap). Possible inv protein, probably exported as has QQAPV repeats at C-terminus. Similar to Q49634 inv protein from Mycobacterium leprae (246 aa), FASTA scores: opt: 957, E(): 0, (70.0% identity in 207 aa overlap); also to putative invasins 1,2 (O07390, O07391) from M. avium. Slightly similar to C-terminus of P60_LISMO|P21171 Listeria invasion-associated protein p60 precursor. Also similar to Mycobacterium tuberculosis p60 homologues Rv1477, Rv1478, Rv0024, Rv2190c. Protein product from Mb1593c detected using SWATH mass spectrometry. Mb1593c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000064" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ27" /protein_id="SIU00196.1" /translation="MKRSMKSGSFAIGLAMMLAPMVAAPGLAAADPATRPVDYQQITD VVIARGLSQRGVPFSWAGGGISGPTRGTGTGINTVGFDASGLIQYAYAGAGLKLPRSS GQMYKVGQKVLPQQARKGDLIFYGPEGTQSVALYLGKGQMLEVGDVVQVSPVRTNGMT PYLVRVLGTQPTPVQQAPVQPAPVQQAPVQQAPVQQAPVQQAPVQQAPVQQAPVQQAP VQPPPFGTARSR" CDS complement(1764635..1764919) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1594C" /product="probable hypothetical membrane protein" /note="Mb1594c, -, len: 94 aa. Equivalent to Rv1567c, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 94 aa overlap). Probable membrane protein. Mb1594c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYQ4" /db_xref="UniProtKB/TrEMBL:A0A1R3XYQ4" /protein_id="SIU00197.1" /translation="MVTMTSWPSRLFAFTDNVCPPDACPLVPFGVNYYIYPVMWGGIG AAIATAVIGPFVSMLKGWYMSFWPIISIAVITVTSIAGYAIAGFSERYWH" CDS 1765167..1766480 /codon_start=1 /transl_table=11 /gene="bioA" /locus_tag="BQ2027_MB1595" /product="adenosylmethionine-8-amino-7-oxononanoate aminotransferase bioa" /note="Mb1595, bioA, len: 437 aa. Equivalent to Rv1568, len: 437 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 437 aa overlap). Probable bioA, adenosylmethionine-8-amino-7-oxononanoate aminotransferase (EC 2.6.1.62). Highly similar to BIOA_MYCLE|P4548 from M. leprae (436 aa), FASTA results: opt: 2534, E(): 0, (85.1% identity in 436 aa overlap). Also similar to other M. tuberculosis proteins e.g. MTCY227.12c (449 aa), FASTA score: E(): 3.5e-16, (29.5% identity in 421 aa overlap). Contains aminotransferases class-III pyridoxal-phosphate attachment site (PS00600). BELONGS TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Protein product from Mb1595 detected using SWATH mass spectrometry. Mb1595 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4X7" /db_xref="InterPro:IPR005814" /db_xref="InterPro:IPR005815" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="PDB:5KGS" /db_xref="PDB:5KGT" /db_xref="PDB:5TE2" /db_xref="UniProtKB/Swiss-Prot:P0A4X7" /protein_id="SIU00198.1" /translation="MAAATGGLTPEQIIAVDGAHLWHPYSSIGREAVSPVVAVAAHGA WLTLIRDGQPIEVLDAMSSWWTAIHGHGHPALDQALTTQLRVMNHVMFGGLTHEPAAR LAKLLVDITPAGLDTVFFSDSGSVSVEVAAKMALQYWRGRGLPGKRRLMTWRGGYHGD TFLAMSICDPHGGMHSLWTDVLAAQVFAPQVPRDYDPAYSAAFEAQLAQHAGELAAVV VEPVVQGAGGMRFHDPRYLHDLRDICRRYEVLLIFDEIATGFGRTGALFAADHAGVSP DIMCVGKALTGGYLSLAATLCTADVAHTISAGAAGALMHGPTFMANPLACAVSVASVE LLLGQDWRTRITELAAGLTAGLDTARALPAVTDVRVCGAIGVIECDRPVDLAVATPAA LDRGVWLRPFRNLVYAMPPYICTPAEITQITSAMVEVARLVGSLP" CDS 1766477..1767637 /codon_start=1 /transl_table=11 /gene="bioF1" /locus_tag="BQ2027_MB1596" /product="PROBABLE 8-AMINO-7-OXONONANOATE SYNTHASE BIOF1 (AONS) (8-AMINO-7-KETOPELARGONATE SYNTHASE) (7-KETO-8-AMINO-PELARGONIC ACID SYNTHETASE) (7-KAP SYNTHETASE) (L-ALANINE--PIMELYL CoA LIGASE)" /note="Mb1596, bioF1, len: 386 aa. Equivalent to Rv1569, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 386 aa overlap). Probable bioF1, 8-amino-7-oxononanoate synthase (EC 2.3.1.47), highly similar to BIOF_MYCLE|P45487 from Mycobacterium leprae (385 aa), FASTA results: opt: 1971, E(): 0, (80.1% identity in 381 aa overlap). Also similar to BIOF2|Rv0032|MTCY10H4.32 POSSIBLE 8-AMINO-7-OXONONANOATE SYNTHASE from Mycobacterium tuberculosis (771 aa), FASTA score: E(): 5.5e-29, (37.4% identity in 393 aa overlap). Contains aminotransferases class-II pyridoxal-phosphate attachment site (PS00599). BELONGS TO CLASS-II OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Protein product from Mb1596 detected using SWATH mass spectrometry. Mb1596 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4X5" /db_xref="InterPro:IPR001917" /db_xref="InterPro:IPR004839" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/Swiss-Prot:P0A4X5" /protein_id="SIU00199.1" /translation="MKAATQARIDDSPLAWLDAVQRQRHEAGLRRCLRPRPAVATELD LASNDYLGLSRHPAVIDGGVQALRIWGAGATGSRLVTGDTKLHQQFEAELAEFVGAAA GLLFSSGYTANLGAVVGLSGPGSLLVSDARSHASLVDACRLSRARVVVTPHRDVDAVD AALRSRDEQRAVVVTDSVFSADGSLAPVRELLEVCRRHGALLLVDEAHGLGVRGGGRG LLYELGLAGAPDVVMTTTLSKALGSQGGVVLGPTPVRAHLIDAARPFIFDTGLAPAAV GAARAALRVLQAEPWRPQAVLNHAGELARMCGVAAVPDSAMVSVILGEPESAVAAAAA CLDAGVKVGCFRPPTVPAGTSRLRLTARASLNAGELELARRVLTDVLAVARR" CDS 1767634..1768314 /codon_start=1 /transl_table=11 /gene="bioD" /locus_tag="BQ2027_MB1597" /product="dethiobiotin synthetase biod" /note="Mb1597, bioD, len: 226 aa. Equivalent to Rv1570, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 226 aa overlap). Probable bioD, dethiobiotin synthetase (EC 6.3.3.3). Similar to many e.g. BIOD_MYCLE|P45486 from Mycobacterium leprae (223 aa), FASTA results: opt: 1059, E(): 0, (74.8% identity in 222 aa overlap). BELONGS TO THE DETHIOBIOTIN SYNTHETASE FAMILY. Protein product from Mb1597 detected using SWATH mass spectrometry. Mb1597 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:O52587" /db_xref="InterPro:IPR004472" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:O52587" /protein_id="SIU00200.1" /translation="MTILVVTGTGTGVGKTVVCAALASAARQAGIDVAVCKPVQTGTA RGDDDLAEVGRLAGVTQLAGLARYPQPMAPAAAAEHAGMALPARDQIVRLIADLDRPG RLTLVEGAGGLLVELAEPGVTLRDVAVDVAAAALVVVTADLGTLNHTKLTLEALAAQQ VSCAGLVIGSWPDPPGLVAASNRSALARIATVRAALPAGAASLDAGDFAAMSAAAFDR NWVAGLVG" CDS 1768314..1768823 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1598" /product="conserved protein" /note="Mb1598, -, len: 169 aa. Equivalent to Rv1571, len: 169 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 169 aa overlap). Conserved hypothetical protein, similar at N-terminal region to Q49625|LEPB1170_C3_227 hypothetical protein from Mycobacterium leprae (104 aa), FASTA results: opt: 473, E(): 3.9e-24, (74.5% identity in 102 aa overlap). Identical to O06619|AF041819|AF041819_6 Mycobacterium bovis BCG (169 aa). Protein product from Mb1598 detected using SWATH mass spectrometry. Mb1598 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009097" /db_xref="UniProtKB/TrEMBL:A0A1R3XYN9" /protein_id="SIU00201.1" /translation="MVHSIELVFDSDTEAAIRRIWAGLAAAGIPSQAPASRPHVSLAV AERIAPEVDEPLGAVARRLPLDCVIGAPVLFGRANVVFTRLVVPTSELLALHAEVHRL CGPHLAPAPMANSLPGQWTAHVTLARRVGGHQLGRALRIAGRPSRIDGRFAGLRRWDG NTRAEYLLG" CDS complement(1768969..1769073) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1598A" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1598A, -, len: 34 aa. Equivalent to Rv1572c (MTCY336.31B), len: 34 aa, from Mycobacterium tuberculosis strain H37Rv (100.0% identity in 34 aa overlap). Partial ORF, part of REP13E12 repeat element; 3' end of Rv1613c (Rv1587c) (MTCY336.17) after phage-like element (see citation below). Similar to C-terminal ends of other REP13E12 repeat elements e.g. Rv1148, Rv1945, Rv3467, etc. Length extended since first submission (+7 aa). Mb1598A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XYP9" /protein_id="SIU00202.1" /translation="MECSSAVHGQPRTNTFHHHEKLLRHNDEDNHDDP" CDS 1769089..1769499 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1599" /product="Probable phiRV1 phage protein" /note="Mb1599, -, len: 136 aa. Equivalent to Rv1573, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 136 aa overlap). Probable phiRv1 phage protein (see citation below). Mb1599 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0V5" /protein_id="SIU00203.1" /translation="MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLE VREALVRLQAALNRHEHTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINR QLGLAGDDEPDGDDTPPWSRMIGLGGGSPAEDER" CDS 1769705..1770016 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1600" /product="Probable phiRV1 phage related protein" /note="Mb1600, -, len: 103 aa. Equivalent to Rv1574, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 103 aa overlap). Probable phiRV1 phage related protein (see citation below); some similarity to Rv1575|MTCY441.17, E() 1.5e-06; and Rv2647|MTCY336.29c phiRV2 phage protein, E(): 3.5e-05. Belongs to phage phiRv1 proteins. Helix turn helix motif present at aa 14-35 (+3.61 SD). Mb1600 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZM7" /protein_id="SIU00204.1" /translation="MGYKPESERHSTKTDTAIGAALGISAGTYRRLKRIDNATHSDDK EIRRFAEKQMAPLVAGSPSWNARKPRSANARVVASVHRSPMPALVPWNQSRLSATLTR R" CDS 1770121..1770474 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1601" /product="Probable phiRV1 phage protein" /note="Mb1601, -, len: 117 aa. Similar to Rv1575 and Rv1575A, len: 117 aa and 65 aa, from Mycobacterium tuberculosis strain H37Rv, (66.9% identity in 124 aa overlap). Probable phiRV1 phage protein (see citation below). Similarity in N-terminal part to Rv1574|MTCY336.30c (103 aa), FASTA score: E(): 0.00022, (44.0% identity in 50 aa overlap); and Rv2647 phiRV2 phage protein. Similarity suggests should continue as Rv1575A. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1575 and Rv1575A exist as 2 genes with an overlap region between them. In Mycobacterium bovis, a single base deletion (c-*) and a single base insertion (*-g) lead to a single product. Mb1601 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XYQ2" /protein_id="SIU00205.1" /translation="MAPLAAGSPSWNGRKPSSGNRKAATMAARLDILAWGPWAQARIG ASFDENRHCYRRSPRHLRRHLPAAQTNRQRNPQRVGGVGGPAPLSRGRPRLALSYLRG SLHLQNSKRVAHQHI" CDS complement(1770418..1771839) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1602C" /product="Probable phiRV1 phage protein" /note="Mb1602c, -, len: 473 aa. Equivalent to Rv1576c, len: 473 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 473 aa overlap). Probable phiRV1 phage protein (capsid subunit) (see citation below). Highly similar to hypothetical Mycobacterium tuberculosis protein Rv2650c|MTCY441.19 phiRV2 phage related protein, FASTA scores: opt: 2782, E(): 0, (89.1% identity in 468 aa overlap). Protein product from Mb1602c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1602c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024455" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ36" /protein_id="SIU00206.1" /translation="MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRH AEELRAEQRRRGREAEEELRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDS CVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHT VWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVAR VVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGD AASFVGEIGKILADSVEQLQTAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVA ADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVL EVSHMDTVDSAVTATNHPLVLGDWKQFLIVDRVGSMVELVPHLFGPNRRPTGQRGFFA WFRVGSDVLVRNAFRVLKVETTA" CDS complement(1771847..1772359) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1603C" /product="Probable phiRv1 phage protein" /note="Mb1603c, -, len: 170 aa. Equivalent to Rv1577c, len: 170 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 170 aa overlap). Probable phiRv1 phage protein (prohead protease) (see citation below). Highly similar to hypothetical protein Rv2651c|MTCY441.20c phiRV2 prohead protease, FASTA scores: E(): 0, (89.3% identity in 169 aa overlap). Some similarity to VP4_BPHK7|P49860 putative bacteriophage HK97 prohead protease (gp4) (225 aa), FASTA results: opt: 176, E(): 1.3e-05, (27.3% identity in 165 aa overlap). Mb1603c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006433" /db_xref="UniProtKB/TrEMBL:A0A1R3XYR4" /protein_id="SIU00207.1" /translation="MAELRSGEGRTVHGTIVPYNEATTVRDFDGEFQEMFAPGAFRRS IAERGHKLKLLVSHDARTRYPVGRAVELREEPHGLFGAFEIADTPDGDEALANVKAGV VDSFSVGFRPIRDRREGDVLVRVEAALLEVSLTGVPAYSGAQIAGVRAESLTVVSRST AEAWLSLLDW" CDS complement(1772533..1773003) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1604C" /product="Probable phiRv1 phage protein" /note="Mb1604c, -, len: 156 aa. Equivalent to Rv1578c, len: 156 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 156 aa overlap). Probable phiRv1 phage protein (terminase) (see citation below), highly similar to Rv2652c|MTCY441.21c phiRV2 phage protein from M. tuberculosis, FASTA scores: E(): 4.8e-22, (48.1% identity in 156 aa overlap). Also similar to X65555|ARP3COS_1 hypothetical protein (cos site) - actinophage RP3 (210 aa), FASTA scores: opt: 373, E(): 6.5e-17, (50.0% identity in 114 aa overlap). Contains MIP family signature (PS00221). Protein product from Mb1604c detected using SWATH mass spectrometry. Mb1604c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006448" /db_xref="InterPro:IPR022357" /db_xref="UniProtKB/TrEMBL:A0A1R3XYR7" /protein_id="SIU00208.1" /translation="MPRPPKPARLKLVEGRSPGRDSGGRKVPESPKFIRQAPDAPDWL DAEALAEWRRVAPTLERLDLLKPEDRALLSAYCETWSVYVAAVQRVRAEGLTITSPKS GVVHRNPAVTVAETARMHLLRLASEFGLTPAAEQRLAVAPGDDGDGLNPFAPDR" CDS complement(1773084..1773398) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1605C" /product="Probable phiRv1 phage protein" /note="Mb1605c, -, len: 104 aa. Equivalent to Rv1579c, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 104 aa overlap). Probable phiRv1 phage protein (see citation below). Protein product from Mb1605c detected using shotgun mass spectrometry." /db_xref="UniProtKB/TrEMBL:A0A1R3XYR0" /protein_id="SIU00209.1" /translation="MTPINRPLTNDERQLMHELAVQVVCSQTGCSPDAAVEALESFAK DGTLILRGDTENAYLEAGGNVLVHADRDWLAFHASYPGNDPLRDARPIEQDDDQGAGS PS" CDS complement(1773395..1773667) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1606C" /product="Probable phiRv1 phage protein" /note="Mb1606c, -, len: 90 aa. Equivalent to Rv1580c, len: 90 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 90 aa overlap). Probable phiRv1 phage protein (see citation below). Protein product from Mb1606c detected using SWATH mass spectrometry." /db_xref="UniProtKB/TrEMBL:A0A1R3XYR2" /protein_id="SIU00210.1" /translation="MAETPDHAELRRRIADMAFNADVGMATCKRCGDAVPYIILPNLQ TGEPVMGVADNKWKRANCPVDVGKPCPFLIAEGVADSTDDTIEVDQ" CDS complement(1773681..1774076) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1607C" /product="Probable phiRv1 phage protein" /note="Mb1607c, -, len: 131 aa. Equivalent to Rv1581c, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 131 aa overlap). Probable phiRv1 phage protein (see citation below)." /db_xref="InterPro:IPR036869" /db_xref="UniProtKB/TrEMBL:A0A1R3XYP8" /protein_id="SIU00211.1" /translation="MTAVAITPASGGRHSVRFAYDSAIVSLIKSTIPAYARSWSAHTR CWFIDADWTPLLAAELRYHGHTVTGPADPAQQQCTDWAKALFRAVGPQRTPAVYRALS KVLHPDAPTGCPILQQQLNAARTALTNPA" CDS complement(1774272..1775687) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1608C" /product="Probable phiRv1 phage protein" /note="Mb1608c, -, len: 471 aa. Equivalent to Rv1582c, len: 471 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 471 aa overlap). Probable phiRv1 phage protein (see citation below). N-terminus is similar to C-terminus of Q38030 ORF9 Bacteriophage phi-C31 (519 aa), FASTA scores: opt: 331, E(): 6.5e-15, (28.5% identity in 235 aa overlap); and C-terminus to whole of Q38031 ORF10 of Bacteriophage phi-C31 (202 aa), FASTA scores: opt: 353, E(): 1e-16, (31.1% identity in 190 aa overlap). Also similar to part of AB016282|AB016282_42 Bacteriophage phi-105 (806 aa), FASTA scores: opt: 790, E(): 0, (32.7% identity in 459 aa overlap). Similarity to other phage proteins described as putative DNA-polymerase or DNA-primase. Also slightly similar to MTCY441.24c, FASTA scores: E(): 0.0055, (36.0% identity in 75 aa overlap). Mb1608c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006500" /db_xref="InterPro:IPR014015" /db_xref="InterPro:IPR014818" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XYQ8" /protein_id="SIU00212.1" /translation="MADIPYGTDYPDAPWIDRDGHVLIDDGGKPTQVHRGQARIAYRL AERYQDKLLHVAGIGWHSWDGRRWAADDRGEAKRAVLAELRQALSDSLNDKELRADVR KCESASGVAGVLDLAAALVPFAATLADLDSDPHLLNVANGTLDLHTLKLRPHAPADRI TKICRGAYQSDTESPLWQAFLTRVLPDEGVRGFVQRLAGVGLLGTVREHVLAILIGVG ANGKSVFDKAIRYALGDYACTAEPDLFMHRENAHPTGEMDLRGVRWVAVSESEKDRRL AESTIKRLTGGDTIRARKMRQDFVEFTPSHTPLLITNHLPRVPGDDTAIWRRIRVVPF EVVIPADEQDRELDARLQLEADSILSWAVAGWSDYQRIGLSQPDAVLAATSNYREDSD TIKRFIDDECVTSSPVLKATTTHLFEAWQRWRVQEGVPEISRKAFGQSLDTHGYPVTD KARDGRWRAGIAVRGADDFDD" CDS complement(1775687..1776085) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1609C" /product="Probable phiRv1 phage protein" /note="Mb1609c, -, len: 132 aa. Equivalent to Rv1583c, len: 132 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 132 aa overlap). Probable phiRv1 phage protein (see citation below), highly similar to Rv2656c|MTCY441.25c phiRV2 phage protein (130 aa), FASTA score: E(): 1.3e-33, (81.7% identity in 131 aa overlap). Mb1609c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024384" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0W3" /protein_id="SIU00213.1" /translation="MTAGAGGSPPTRRCSATEDRAPATVATPSSADPTASRAVSWWSV HEHVAPVLDAAGSWPMAGTPAWRQLDDADPRKWAAICDAARHWALRVETCQEAMAQAS RDVSAAADWPGIAREIVRRRGVYIPRAGVA" CDS complement(1776082..1776303) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1610C" /product="Possible phiRv1 phage protein" /note="Mb1610c, -, len: 73 aa. Equivalent to Rv1584c, len: 73 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 73 aa overlap). Possible phiRv1 phage protein (putative excisionase) (see citation below). Mb1610c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZN2" /protein_id="SIU00214.1" /translation="MSTIYHHRGRVAALSRSRASDDPEFIAAKTDLVAANIADYLIRT LAAAPPLTDEQRTRLAELLRPVRRSGGAR" CDS complement(1776359..1776874) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1611C" /product="Possible phage phiRv1 protein" /note="Mb1611c, -, len: 171 aa. Equivalent to Rv1585c, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 171 aa overlap). Possible phage phiRv1 protein (see citation below). Protein product from Mb1611c detected using SWATH mass spectrometry. Mb1611c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XYR1" /protein_id="SIU00215.1" /translation="MSRHHNIVIVCDHGRKGDGRIEHERCDLVAPIIWVDETQGWLPQ APAVATLLDDDNQPRAVIGLPPNESRLRPEMRRDGWVRLHWEFACLRYGAAGVRTCEQ RPVRVRNGDLQTLCENVPRLLTGLAGNPDYAPGFAVQSDAVVVAMWLWRTLCESDTPN KLRATPTRGSC" CDS complement(1776871..1777935) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1612C" /product="Probable phiRv1 integrase" /note="Mb1612c, -, len: 469 aa. Equivalent to Rv1586c, len: 469 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 469 aa overlap). Probable phiRv1 integrase, possibly member of the serine family of recombinases (see citation below), similar to several bacteriophage integrases e.g. Q37839 ORF469 PROTEIN from Bacteriophage R4 (469 aa), FASTA scores: opt: 623, E(): 1.6e-29, (31.1% identity in 482 aa overlap); and Bacteriophage TP901-1. Protein product from Mb1612c detected using SWATH mass spectrometry. Mb1612c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ46" /db_xref="InterPro:IPR011109" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ46" /protein_id="SIU00216.1" /translation="MATPQGRLVARLKGSVAAHETEHKKARQRRAARQKAERGHPNWS KAFGYLPGPNGPEPDPRTAPLVKQAYADILAGASLGDVCRQWNDAGAFTITGRPWTTT TLSKFLRKPRNAGLRAYKGARYGPVDRDAIVGKAQWSPLVDEATFWAAQAVLDAPGRA PGRKSVRRHLLTGLAGCGKCGNHLAGSYRTDGQVVYVCKACHGVAILADNIEPILYHI VAERLAMPDAVDLLRREIHDAAEAETIRLELETLYGELDRLAVERAEGLLTARQVKIS TDIVNAKITKLQARQQDQERLRVFDGIPLGTPQVAGMIAELSPDRFRAVLDVLAEVVV QPVGKSGRIFNPERVQVNWR" CDS complement(1777937..1778938) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1613C" /product="Partial REP13E12 repeat protein" /note="Mb1613c, -, len: 333 aa. Equivalent to Rv1587c, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (97.2% identity in 217 aa overlap). Partial REP13E12 repeat protein (see citation below), nearly identical (but has been interrupted by phiRv1 prophage) to Q50655|MTCY251.13c HYPOTHETICAL 34.6 KD PROTEIN from M. tuberculosis (317 aa), FASTA results: opt: 2020, E(): 0, (98.0% identity in 302 aa overlap); Rv0094c|MTCY251.13c (317 aa), FASTA results: E(): 0, (100.0% identity in 217 aa overlap). Codon usage suggests that translation may involve frameshifting of Rv1588c mRNA in poly_C stretch into reading frame of Rv1587c. 3' end found in Rv1572c. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv and Mycobacterium bovis, Rv1587 possibly exists as a pseudogene that has been disrupted when the phage entered. Mb1613c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3XYS5" /protein_id="SIU00217.1" /translation="MLAKLAAPGATNPDDHTPVIDTTPDAAAIDRDTRSQAQRNHDGL LAGLRALIASGELGQHNGLPVSIVVTTTLTDLQTGAGKGFTGGGTLLPMADVIRMTSH AHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIMLFANDRGCTKPGCDAPAYHS QAHHVTGWTSTGRTDITDLTLACDPDNRLAEKGWTTRKNTHGHTEWLPPPHLDHGQPW TCEIHYTCACCCLPPNLRRPLRRTARRGPPTRGLPKAVRAAKMGARRVPRQRRQRINR QAPPRLRADVGRHHRRQDRRRGGLGPGPAPSPSHRAGSLHVISRREAAGPGHRRRRR" repeat_region complement(1778282..1779644) /rpt_family="REP" /note="REP-5, len: 1363 nt. Equivalent to REP, len: 1298 nt, from Mycobacterium tuberculosis strain H37Rv, (96.6% identity in 1363 nt overlap). REP336, member of REP13E12 family." CDS complement(1778943..1779611) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1614C" /product="Partial REP13E12 repeat protein" /note="Mb1614c, -, len: 222 aa. Equivalent to Rv1588c, len: 222 aa, from Mycobacterium tuberculosis strain H37Rv, (98.2% identity in 222 aa overlap). Partial REP13E12 repeat protein (see citation below), nearly identical to ORF's in other Rep13E12 repeats, including Rv0095c|MTCY251.14c|Y05E_MYCTU|Q10891 hypothetical 15.4 kd protein cy251.14 from Mycobacterium tuberculosis (136 aa), FASTA results: opt: 613, E(): 9.9e-29, (86.5% identity in 111 aa overlap). Mb1614c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/Swiss-Prot:P0A5F2" /protein_id="SIU00218.1" /translation="MLANSREELVEVFDALDAELDRLDEVSFEVLTTPERLRSLERLE CLVRRLPAVGHTLINQLDTQASEEELGGTLCCALANRLRITKPDAALRIADAADLGPR RALTGEPLAPQLTATATAQRQGLIGEAHIKVIRALFRPPARRGGCVHPPGRRSRPGRQ SRSISSRRAGPLRPAGHGLATPRRRPHRHRTRPQTRHHPEQPAIRRHVTAKWLPDPPS AGHL" CDS 1780059..1781108 /codon_start=1 /transl_table=11 /gene="bioB" /locus_tag="BQ2027_MB1615" /product="probable biotin synthetase biob" /note="Mb1615, bioB, len: 349 aa. Equivalent to Rv1589, len: 349 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 349 aa overlap). bioB, biotin synthetase (EC 2.8.1.-) O06601. Highly similar to BIOB_MYCLE|P46715 BioB from Mycobacterium leprae (345 aa), FASTA results: opt: 1982, E(): 0, (86.5% identity in 349 aa overlap). Protein product from Mb1615 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1615 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A507" /db_xref="InterPro:IPR002684" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR010722" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR024177" /db_xref="UniProtKB/Swiss-Prot:P0A507" /protein_id="SIU00219.1" /translation="MTQAATRPTNDAGQDGGNNSDILVVARQQVLQRGEGLNQDQVLA VLQLPDDRLEELLALAHEVRMRWCGPEVEVEGIISLKTGGCPEDCHFCSQSGLFASPV RSAWLDIPSLVEAAKQTAKSGATEFCIVAAVRGPDERLMAQVAAGIEAIRNEVEINIA CSLGMLTAEQVDQLAARGVHRYNHNLETARSFFANVVTTHTWEERWQTLSMVRDAGME VCCGGILGMGETLQQRAEFAAELAELGPDEVPLNFLNPRPGTPFADLEVMPVGDALKA VAAFRLALPRTMLRFAGGREITLGDLGAKRGILGGINAVIVGNYLTTLGRPAEADLEL LDELQMPLKALNASL" CDS 1781109..1781348 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1616" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1616, -, len: 79 aa. Equivalent to Rv1590, len: 79 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 79 aa overlap). Conserved hypothetical protein, similar to Q49616|LEPB1170_C1_162|YF90_MYCLE from Mycobacterium leprae (80 aa), FASTA scores: opt: 368, E(): 1.7e-21, Smith-Waterman score: 368, (67.1% identity in 73 aa overlap). Protein product from Mb1616 detected using SWATH mass spectrometry. Mb1616 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P64882" /protein_id="SIU00220.1" /translation="MVEIVAGKQRAPVAAGVYNVYTGELADTATPTAARMGLEPPRFC AQCGRRMVVQVRPDGWWARCSRHGQVDSADLATQR" CDS 1781345..1782010 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1617" /product="PROBABLE TRANSMEMBRANE PROTEIN" /note="Mb1617, -, len: 221 aa. Equivalent to Rv1591, len: 221 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 221 aa overlap). Probable transmembrane protein, similar to Q49626|LEPB1170_C3_229|YF91_MYCLE Hypothetical Mycobacterium leprae protein (198 aa), FASTA results: opt: 802, E(): 0, (63.8% identity in 188 aa overlap). Protein product from Mb1617 detected using SWATH mass spectrometry. Mb1617 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5F4" /db_xref="InterPro:IPR021213" /db_xref="UniProtKB/Swiss-Prot:P0A5F4" /protein_id="SIU00221.1" /translation="MTEPPGFGGPSEPSGAPRTSRTRAVLFVMLGLSATGVLVGGLWA WIAPPIHAVVAITRAGERVHEYLGSESQNFFIAPFMLLGLLSVLAVVASALMWQWREH RGPQMVAGLSIGLTTAAAIAAGVGALVVRLRYGALDFDTVPLSRGDHALTYVTQAPPV FFARRPLQIALTLMWPAGIASLVYALLAAGTARDDLGGYPAVDPSSNARTEALETPQA PVS" CDS complement(1782175..1783515) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1618C" /product="Lipase 1 (EC" /EC_number="3.1.1.3" /note="Mb1618c, -, len: 446 aa. Equivalent to Rv1592c, len: 446 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 446 aa overlap). Conserved hypothetical protein, some similarity to Q49629|B1170_F1_46 from Mycobacterium leprae (132 aa), FASTA results: opt: 332, E(): 4.5e-14, (56.3% identity in 87 aa overlap). Nearly identical to truncated Mycobacterium bovis BCG protein (148 aa) AF041819|AF041819_11. Protein product from Mb1618c detected using SWATH mass spectrometry. Mb1618c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYR5" /db_xref="InterPro:IPR005152" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XYR5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00222.1" /translation="MVEPGNLAGATGAEWIGRPPHEELQRKVRPLLPSDDPFYFPPAG YQHAVPGTVLRSRDVELAFMGLIPQPVTATQLLYRTTNMYGNPEATVTTVIVPAELAP GQTCPLLSYQCAIDAMSSRCFPSYALRRRAKALGSLTQMELLMISAALAEGWAVSVPD HEGPKGLWGSPYEPGYRVLDGIRAALNSERVGLSPATPIGLWGYSGGGLASAWAAEAC GEYAPDLDIVGAVLGSPVGDLGHTFRRLNGTLLAGLPALVVAALQHSYPGLARVIKEH ANDEGRQLLEQLTEMTTVDAVIRMAGRDMGDFLDEPLEDILSTPEVSHVFGDTKLGSA VPTPPVLIVQAVHDYLIDVSDIDALADSYTAGGANVTYHRDLFSEHVSLHPLSAPMTL RWLTDRFAGKPLTDHRVRTTWPTIFNPMTYAGMARLAVIAAKVITGRKLSRRPL" CDS complement(1783772..1784482) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1619C" /product="Nudix-related transcriptional regulator NrtR" /note="Mb1619c, -, len: 236 aa. Equivalent to Rv1593c, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 236 aa overlap). Conserved hypothetical protein, highly similar to Q49628|B1170_F1_44 from Mycobacterium leprae (286 aa), FASTA scores: opt: 1304, E (): 0, (85.4% identity in 233 aa overlap); similar to several putative DNA hydrolases e.g. Q9S233|SCI51.07C from Streptomyces coelicolor (239 aa), FASTA scores: opt: 415, E(): 4.6e-20, (34.8% identity in 221 aa overlap); also similar to P74291|SLR1690 hypothetical protein from synechocystis (261 aa), FASTA scores: opt: 228, E(): 1.4e-17, (31.5% identity in 213 aa overlap). TBparse score is 0.922 Protein product from Mb1619c detected using shotgun mass spectrometry. Mb1619c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0X0" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR015797" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0X0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00223.1" /translation="MAHGSTAHEVLAVVFQVRGVGMSRGAAKPQLNVLLWQRAKEPQR GAWSLPGGRLRNDEDMTSSVRRQLAEKVDLRELAHLEQLAVFSDPHRLPGIRMIASTY LGVVPSPATPELPADTRWHPVSSLPPMAFDHGPMVTHARTRLIAKMSYTNIGFALAPK EFALSTLRDIYGAALGYQVDATNLQRVLARRRVITQTGTIAQSGRSGGRPAALYRFTD SQLRVTDEFAALRPPGQL" CDS 1784531..1785580 /codon_start=1 /transl_table=11 /gene="nadA" /locus_tag="BQ2027_MB1620" /product="Probable quinolinate synthetase nadA" /note="Mb1620, nadA, len: 349 aa. Equivalent to Rv1594, len: 349 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 349 aa overlap). Probable nadA, quinolinate synthetase. Similar to many e.g. Q49622 NADA from Mycobacterium leprae (368 aa), FASTA results: opt: 1994, E(): 0, (84.4% identity in 352 aa overlap). Protein product from Mb1620 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1620 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65498" /db_xref="InterPro:IPR003473" /db_xref="InterPro:IPR023066" /db_xref="InterPro:IPR036094" /db_xref="UniProtKB/Swiss-Prot:P65498" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00224.1" /translation="MTVLNRTDTLVDELTADITNTPLGYGGVDGDERWAAEIRRLAHL RGATVLAHNYQLPAIQDVADHVGDSLALSRVAAEAPEDTIVFCGVHFMAETAKILSPH KTVLIPDQRAGCSLADSITPDELRAWKDEHPGAVVVSYVNTTAAVKALTDICCTSSNA VDVVASIDPDREVLFCPDQFLGAHVRRVTGRKNLHVWAGECHVHAGINGDELADQARA HPDAELFVHPECGCATSALYLAGEGAFPAERVKILSTGGMLEAAHTTRARQVLVATEV GMLHQLRRAAPEVDFRAVNDRASCKYMKMITPAALLRCLVEGADEVHVDPGIAASGRR SVQRMIEIGHPGGGE" CDS 1785580..1787163 /codon_start=1 /transl_table=11 /gene="nadB" /locus_tag="BQ2027_MB1621" /product="Probable L-aspartate oxidase nadB" /note="Mb1621, nadB, len: 527 aa. Equivalent to Rv1595, len: 527 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 527 aa overlap). Probable nadB, L-aspartate oxidase (EC 1.4.3.16). Similar to many e.g. Q49617 L-ASPARTATE OXIDASE (QUINOLINATE SYNTHETASE) from Mycobacterium leprae (424 aa), FASTA results: opt: 2152, E(): 0, (82.0% identity in 400 aa overlap). Also shows some similarity to Rv1552 frdA from Mycobacterium tuberculosis (583 aa), FASTA results: E(): 1e-10, (35.3% identity in 566 aa overlap). HETERODIMER. THE QUINOLINATE SYNTHETASE COMPLEX CONSISTS OF THE TWO ENZYMES QUINOLINATE SYNTHETASE A AND B. TBparse score is 0.896. Protein product from Mb1621 detected using SWATH mass spectrometry. Mb1621 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65500" /db_xref="InterPro:IPR003953" /db_xref="InterPro:IPR005288" /db_xref="InterPro:IPR015939" /db_xref="InterPro:IPR027477" /db_xref="InterPro:IPR036188" /db_xref="InterPro:IPR037099" /db_xref="UniProtKB/Swiss-Prot:P65500" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00225.1" /translation="MAGPAWRDAADVVVIGTGVAGLAAALAADRAGRSVVVLSKAAQT HVTATHYAQGGIAVVLPDNDDSVDAHVADTLAAGAGLCDPDAVYSIVADGYRAVTDLV GAGARLDESVPGRWALTREGGHSRRRIVHAGGDATGAEVQRALQDAAGMLDIRTGHVA LRVLHDGTAVTGLLVVRPDGCGIISAPSVILATGGLGHLYSATTNPAGSTGDGIALGL WAGVAVSDLEFIQFHPTMLFAGRAGGRRPLITEAIRGEGAILVDRQGNSITAGVHPMG DLAPRDVVAAAIDARLKATGDPCVYLDARGIEGFASRFPTVTASCRAAGIDPVRQPIP VVPGAHYSCGGIVTDVYGQTELLGLYAAGEVARTGLHGANRLASNSLLEGLVVGGRAG KAAAAHAAAAGRSRATSSATWPEPISYTALDRGDLQRAMSRDASMYRAAAGLHRLCDS LSGAQVRDVACRRDFEDVALTLVAQSVTAAALARTESRGCHHRAEYPCTVPEQARSIV VRGADDANAVCVQALVAVC" CDS 1787163..1788020 /codon_start=1 /transl_table=11 /gene="nadC" /locus_tag="BQ2027_MB1622" /product="Probable nicotinate-nucleotide pyrophosphatase nadC" /note="Mb1622, nadC, len: 285 aa. Equivalent to Rv1596, len: 285 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 285 aa overlap). Probable nadC, nicotinate-nucleotide pyrophosphatase (EC 2.4.2.19) O06594. Similar to many e.g. ADC_MYCLE|P46714 from Mycobacterium leprae (284 aa), FASTA results: opt: 1418, E(): 0,(79.2% identity in 283 aa overlap). BELONGS TO THE NADC/MODD FAMILY. Protein product from Mb1622 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1622 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ55" /db_xref="InterPro:IPR002638" /db_xref="InterPro:IPR004393" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR022412" /db_xref="InterPro:IPR027277" /db_xref="InterPro:IPR036068" /db_xref="InterPro:IPR037128" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ55" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00226.1" /translation="MGLSDWELAAARAAIARGLDEDLRYGPDVTTLATVPASATTTAS LVTREAGVVAGLDVALLTLDEVLGTNGYRVLDRVEDGARVPPGEALMTLEAQTRGLLT AERTMLNLVGHLSGIATATAAWVDAVRGTKAKIRDTRKTLPGLRALQKYAVRTGGGVN HRLGLGDAALIKDNHVAAAGSVVDALRAVRNAAPDLPCEVEVDSLEQLDAVLPEKPEL ILLDNFAVWQTQTAVQRRDSRAPTVMLESSGGLSLQTAATYAETGVDYLAVGALTHSV RVLDIGLDM" CDS 1788069..1788827 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1623" /product="SAM-dependent methyltransferase" /note="Mb1623, -, len: 252 aa. Equivalent to Rv1597, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 252 aa overlap). Hypothetical unknown protein. Protein product from Mb1623 detected using SWATH mass spectrometry. Mb1623 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYT4" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3XYT4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00227.1" /translation="MARTFEDLVAEAASASVGGWDFSWLDGRATEERPSWGYQRQLSQ RLANATAALDLETGGGEVLAGAGNFPPTMVATEAWPPNAAMATRRLHPLGAVVVITGD KPPLPFADAAFDLVTSRHPSTRWWTEIARVLRAGGSYFAQHVGPATLWDLREHFLGPR EHNGADQYAQVVRTCITDAGLEIVDLQMERLRVEFFDVGAVIYFLRKVIWFLPDFTVE GYHDRLRALHERIQAEGPFVTYSTRALIEARKPS" CDS complement(1788848..1789258) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1624C" /product="conserved protein" /note="Mb1624c, -, len: 136 aa. Equivalent to Rv1598c, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 136 aa overlap). Conserved hypothetical protein, some similarity to O06389|Rv0523c|MTCY25D10.02 from Mycobacterium tuberculosis (131 aa), FASTA scores: E(): 2.2e-09, (38.4% identity in 99 aa overlap); and P95144|MTCY359.02|Rv1871c (129 aa). Protein product from Mb1624c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1624c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYT8" /db_xref="InterPro:IPR004378" /db_xref="InterPro:IPR012349" /db_xref="UniProtKB/TrEMBL:A0A1R3XYT8" /protein_id="SIU00228.1" /translation="MSAKDHPNNAPGVPMVFPLWLERLQVKYINRALKPIARYLPGTA TIEHRGRKSGKPYQTIVTAYRKDGVLAIALAHGKTDWVKNVLAAGEADVHFARGVVHV INPRIVPAGSDGQGLPRMARLQLRRIGVFVGDIA" CDS 1789358..1790674 /codon_start=1 /transl_table=11 /gene="hisD" /locus_tag="BQ2027_MB1625" /product="Probable histidinol dehydrogenase HisD (HDH)" /note="Mb1625, hisD, len: 438 aa. Equivalent to Rv1599, len: 438 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 438 aa overlap). Probable hisD, histidinol dehydrogenase (EC 1.1.1.23) (see citation below) O08396. Similar to many e.g. HISX_MYCSM|P28736 from Mycobacterium smegmatis (445 aa), FASTA results: opt: 2356, E(): 0, (83.1% identity in 437 aa overlap). Contains histidinol dehydrogenase signature (PS00611). Protein product from Mb1625 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1625 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63951" /db_xref="InterPro:IPR001692" /db_xref="InterPro:IPR012131" /db_xref="InterPro:IPR016161" /db_xref="InterPro:IPR022695" /db_xref="UniProtKB/Swiss-Prot:P63951" /protein_id="SIU00229.1" /translation="MLTRIDLRGAELTAAELRAALPRGGADVEAVLPTVRPIVAAVAE RGAEAALDFGASFDGVRPHAIRVPDAALDAALAGLDCDVCEALQVMVERTRAVHSGQR RTDVTTTLGPGATVTERWVPVERVGLYVPGGNAVYPSSVVMNVVPAQAAGVDSLVVAS PPQAQWDGMPHPTILAAARLLGVDEVWAVGGAQAVALLAYGGTDTDGAALTPVDMITG PGNIYVTAAKRLCRSRVGIDAEAGPTEIAILADHTADPVHVAADLISQAEHDELAASV LVTPSEDLADATDAELAGQLQTTVHRERVTAALTGRQSAIVLVDDVDAAVLVVNAYAA EHLEIQTADAPQVASRIRSAGAIFVGPWSPVSLGDYCAGSNHVLPTAGCARHSSGLSV QTFLRGIHVVEYTEAALKDVSGHVITLATAEDLPAHGEAVRRRFER" CDS 1790671..1791813 /codon_start=1 /transl_table=11 /gene="hisC1" /locus_tag="BQ2027_MB1626" /standard_name="hisC" /product="Probable histidinol-phosphate aminotransferase hisC1" /note="Mb1626, hisC1, len: 380 aa. Equivalent to Rv1600, len: 380 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 380 aa overlap). Probable hisC1, histidinol-phosphate aminotransferase (EC 2.6.1.9) O06591. Similar to many e.g. HIS8_STRCO|P16246 from Streptomyces coelicolor (369 aa), FASTA results: opt: 1353, E(): 0, (59.0% identity in 356 aa overlap). Some similarity to other Mycobacterium tuberculosis aminotransferases e.g. Rv3772|MTCY13D12.06, FASTA results: E(): 7.4e-25, (33.7% identity in 365 aa overlap). Contains aminotransferases class-II pyridoxal-phosphate attachment site (PS00599). BELONGS TO CLASS-II OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Note that previously known as hisC. Protein product from Mb1626 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1626 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A679" /db_xref="InterPro:IPR001917" /db_xref="InterPro:IPR004839" /db_xref="InterPro:IPR005861" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/Swiss-Prot:P0A679" /protein_id="SIU00230.1" /translation="MTRSGHPVTLDDLPLRADLRGKAPYGAPQLAVPVRLNTNENPHP PTRALVDDVVRSVREAAIDLHRYPDRDAVALRADLAGYLTAQTGIQLGVENIWAANGS NEILQQLLQAFGGPGRSAIGFVPSYSMHPIISDGTHTEWIEASRANDFGLDVDVAVAA VVDRKPDVVFIASPNNPSGQSVSLPDLCKLLDVAPGIAIVDEAYGEFSSQPSAVSLVE EYPSKLVVTRTMSKAFAFAGGRLGYLIATPAVIDAMLLVRLPYHLSSVTQAAARAALR HSDDTLSSVAALIAERERVTTSLNDMGFRVIPSDANFVLFGEFADAPAAWRRYLEAGI LIRDVGIPGYLRATTGLAEENDAFLRASARIATDLVPVTRSPVGAP" CDS 1791810..1792442 /codon_start=1 /transl_table=11 /gene="hisB" /locus_tag="BQ2027_MB1627" /product="Probable imidazole glycerol-phosphate dehydratase hisB" /note="Mb1627, hisB, len: 210 aa. Equivalent to Rv1601, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 210 aa overlap). Probable hisB, imidazole glycerol-phosphate dehydratase (EC 4.2.1.19). Similar to many e.g. HIS7_STRCO|P16247 from Streptomyces coelicolor (197 aa),FASTA results: opt: 763, E(): 0, (57.4% identity in 202 aa overlap). BELONGS TO THE IMIDAZOLEGLYCEROL-PHOSPHATE DEHYDRATASE FAMILY. Protein product from Mb1627 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1627 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64369" /db_xref="InterPro:IPR000807" /db_xref="InterPro:IPR020565" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR038494" /db_xref="UniProtKB/Swiss-Prot:P64369" /protein_id="SIU00231.1" /translation="MTTTQTAKASRRARIERRTRESDIVIELDLDGTGQVAVDTGVPF YDHMLTALGSHASFDLTVRATGDVEIEAHHTIEDTAIALGTALGQALGDKRGIRRFGD AFIPMDETLAHAAVDLSGRPYCVHTGEPDHLQHTTIAGSSVPYHTVINRHVFESLAAN ARIALHVRVLYGRDPHHITEAQYKAVARALRQAVEPDPRVSGVPSTKGAL" CDS 1792439..1793059 /codon_start=1 /transl_table=11 /gene="hisH" /locus_tag="BQ2027_MB1628" /product="Probable amidotransferase hisH" /note="Mb1628, hisH, len: 206 aa. Equivalent to Rv1602, len: 206 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 206 aa overlap). Probable hisH, amidotransferase (EC 2.4.2.-). Similar to many e.g. HIS5_STRCO|P16249 from Streptomyces coelicolor (222 aa), FASTA results: opt: 872, E():0, (61.0% identity in 210 aa overlap). Contains glutamine amidotransferases class-I active site (PS00442). BELONGS TO THE HISH FAMILY. Protein product from Mb1628 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1628 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59957" /db_xref="InterPro:IPR010139" /db_xref="InterPro:IPR017926" /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/Swiss-Prot:P59957" /protein_id="SIU00232.1" /translation="MTAKSVVVLDYGSGNLRSAQRALQRVGAEVEVTADTDAAMTADG LVVPGVGAFAACMAGLRKISGERIIAERVAAGRPVLGVCVGMQILFACGVEFGVQTPG CGHWPGAVIRLEAPVIPHMGWNVVDSAAGSALFKGLDVDARFYFVHSYAAQRWEGSPD ALLTWATYRAPFLAAVEDGALAATQFHPEKSGDAGAAVLSNWVDGL" CDS 1793069..1793806 /codon_start=1 /transl_table=11 /gene="hisA" /locus_tag="BQ2027_MB1629" /product="PROBABLE PHOSPHORIBOSYLFORMIMINO-5- AMINOIMIDAZOLE CARBOXAMIDE RIBOTIDE ISOMERASE HISA" /note="Mb1629, hisA, len: 245 aa. Equivalent to Rv1603, len: 245 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 245 aa overlap). Probable hisA, PHOSPHORIBOSYLFORMIMINO-5-AMINOIMIDAZOLE CARBOXAMIDE RIBOTIDE ISOMERASE (EC 5.3.1.16), similar to many e.g. HIS4_STRCO|P16250 phosphoribosylformimino-5-aminoimidaz from Streptomyces coelicolor (240 aa), FASTA scores: opt: 1081, E(): 0, (69.0% identity in 239 aa overlap). Protein product from Mb1629 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1629 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P60579" /db_xref="InterPro:IPR006062" /db_xref="InterPro:IPR010188" /db_xref="InterPro:IPR011060" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR023016" /db_xref="UniProtKB/Swiss-Prot:P60579" /protein_id="SIU00233.1" /translation="MMPLILLPAVDVVEGRAVRLVQGKAGSQTEYGSAVDAALGWQRD GAEWIHLVDLDAAFGRGSNHELLAEVVGKLDVQVELSGGIRDDESLAAALATGCARVN VGTAALENPQWCARVIGEHGDQVAVGLDVQIIDGEHRLRGRGWETDGGDLWDVLERLD SEGCSRFVVTDITKDGTLGGPNLDLLAGVADRTDAPVIASGGVSSLDDLRAIATLTHR GVEGAIVGKALYARRFTLPQALAAVRD" CDS 1793814..1794626 /codon_start=1 /transl_table=11 /gene="impA" /locus_tag="BQ2027_MB1630" /product="PROBABLE INOSITOL-MONOPHOSPHATASE IMPA (IMP)" /note="Mb1630, impA, len: 270 aa. Equivalent to Rv1604, len: 270 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 270 aa overlap). Probable impA, inositol monophosphatase (EC 3.1.3.25), similar to many e.g. AF0059|AF005905_2 inositol monophosphate phosphatase from Mycobacterium smegmatis (276 aa), FASTA scores: opt: 1241, E(): 0, (70.5% identity in 261 aa overlap). Also similar to Mycobacterium tuberculosis proteins Rv3137 and Rv2701c. Mb1630 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZP5" /db_xref="InterPro:IPR000760" /db_xref="InterPro:IPR020550" /db_xref="UniProtKB/TrEMBL:A0A1R3XZP5" /protein_id="SIU00234.1" /translation="MHLDSLVAPLVEQASAILDAATALFLVGHRADSAVRKKGNDFAT EVDLAIERQVVAALVAATGIEVHGEEFGGPAVDSRWVWVLDPIDGTINHAAGSPLAAI LLGLLHDGVPVAGLTWMPFTDQRYTAVAGGPLIKNGVPQPPLADAELANVLVGVGTFS ADSRGQFPGRYRLAVLEKLSRVSSRLRMHGSTGIDLVFVADGILGGAISFGGHVWDHA AGVALVRAAGGVVTDLAGQPWTPASRSALAGPLRVHAQILEILGSIGEPEDY" CDS 1794628..1795431 /codon_start=1 /transl_table=11 /gene="hisF" /locus_tag="BQ2027_MB1631" /product="Probable cyclase hisF" /note="Mb1631, hisF, len: 267 aa. Equivalent to Rv1605, len: 267 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 267 aa overlap). Probable hisF, cyclase involved in histidine biosynthetic pathway, similar to many e.g. AF0304|AF030405_1 Corynebacterium glutamicum cyclase (257 aa), FASTA scores: opt: 1201, E(): 0, (71.9% identity in 256 aa overlap). BELONGS TO THE HISA / HISF FAMILY. Protein product from Mb1631 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1631 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7VEW8" /db_xref="InterPro:IPR004651" /db_xref="InterPro:IPR006062" /db_xref="InterPro:IPR011060" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/Swiss-Prot:Q7VEW8" /protein_id="SIU00235.1" /translation="MYADRDLPGAGGLAVRVIPCLDVDDGRVVKGVNFENLRDAGDPV ELAAVYDAEGADELTFLDVTASSSGRATMLEVVRRTAEQVFIPLTVGGGVRTVADVDS LLRAGADKVAVNTAAIACLDLLADMARQFGSQCIVLSVDARTVPVGSAPTPSGWEVTT HGGRRGTGMDAVQWAARGADLGVGEILLNSMDADGTKAGFDLALLRAVRAAVTVPVIA SGGAGAVEHFAPAVAAGADAVLAASVFHFRELTIGQVKAALAAEGITVR" CDS 1795428..1795775 /codon_start=1 /transl_table=11 /gene="hisI" /locus_tag="BQ2027_MB1632" /product="Probable phosphoribosyl-AMP 1,6 cyclohydrolase hisI" /note="Mb1632, hisI, len: 115 aa. Equivalent to Rv1606, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 115 aa overlap). Probable hisI, phosphoribosyl-AMP 1,6 cyclohydrolase (EC 3.5.4.19), similar to several e.g. X82010|RSHISI_2 HISI from Rhodobacter sphaeroides (119 aa), FASTA scores: opt: 378, E(): 2.8e-21, (52.3% identity in 109 aa overlap); etc. Protein product from Mb1632 detected using SWATH mass spectrometry. Mb1632 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5B4" /db_xref="InterPro:IPR002496" /db_xref="InterPro:IPR026660" /db_xref="InterPro:IPR038019" /db_xref="UniProtKB/Swiss-Prot:P0A5B4" /protein_id="SIU00236.1" /translation="MTLDPKIAARLKRNADGLVTAVVQERGSGDVLMVAWMNDEALAR TLQTREATYYSRSRAEQWVKGATSGHTQHVHSVRLDCDGDAVLLTVDQVGGACHTGDH SCFDAAVLLEPDD" CDS 1795956..1797038 /codon_start=1 /transl_table=11 /gene="chaA" /locus_tag="BQ2027_MB1633" /product="Probable ionic transporter integral membrane protein chaA" /note="Mb1633, chaA, len: 360 aa. Equivalent to Rv1607, len: 360 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 360 aa overlap). Probable chaA, ionic transporter integral membrane protein, putative calcium/proton antiporter, similar to many e.g. P31801|CHAA_ECOLI CALCIUM/PROTON ANTIPORTER from Escherichia coli (366 aa), FASTA scores: opt: 736, E(): 0, (35.9% identity in 351 aa overlap). Equivalent to Mycobacterium leprae AL049913|MLCB1610_21 (77.7% identity in 364 aa overlap). SEEMS TO BELONG TO THE CaCA FAMILY. Protein product from Mb1633 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1633 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYU2" /db_xref="InterPro:IPR004837" /db_xref="UniProtKB/TrEMBL:A0A1R3XYU2" /protein_id="SIU00237.1" /translation="MLKRVPWTVVLPSLAFVALVLTWGKQIGPVVGLLAAVLLAGAVL AAVNHAEVVAARVGEPFGSLVLAVAVTTIEVALIVALMVSGGDDAATLARDTVFAAVM ITTNGIAGLSLLLGSLRYGVTLFNPHGSGAALATVTTLATLSLVLPTFTTSQSGPELS PGQLIFAGAASLGLYVLFLFTQTVRHRDFFLPVAQKGAVEDDSHADPPSTRAALLSLG LLLVALVAVVGLAKVESPVIEEVVSAAGFPQSFVGVVIATLVLLPETLAAARAARQGR LQTSLNLAYGSAMASIGLTIPTIALASLWLSGPLQLGLGAIQLVLLVLTVVVSVLTVV PGRATRLQGEVHLVLLAAYLFLAVVP" CDS complement(1797073..1797537) /codon_start=1 /transl_table=11 /gene="bcpB" /locus_tag="BQ2027_MB1634C" /product="Probable peroxidoxin bcpB" /note="Mb1634c, bcpB, len: 154 aa. Equivalent to Rv1608c, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 154 aa overlap). Probable bcpB, peroxidoxin or bacterioferritin comigratory protein, similar to many, e.g. AE0003|ECAE000335_4 bacterioferritin comigratory protein from Escherichia coli K-12 MG1655 (156 aa), FASTA scores: opt: 329, E(): 1.2e-16, (38.2% identity in 152 aa overlap); Z97179|MLCL383_22 Mycobacterium leprae cosmid L383 (161 aa) (40.2% identity in 132 aa overlap). Also similar to Rv2428 AhpC, alkyl hydroperoxide reductase from Mycobacterium tuberculosis; and other Mycobacterium tuberculosis putative peroxidoxins Rv2521, Rv2238c, Rv1932. Protein product from Mb1634c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1634c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYU8" /db_xref="InterPro:IPR000866" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR024706" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3XYU8" /protein_id="SIU00238.1" /translation="MKTGDTVADFELPDQTGTPRRLSVLLSDGPVVLFFYPAAMTPGC TKEACHFRDLAKEFAEVRASRVGISTDPVRKQAKFAEVRRFDYPLLSDAQGTVAAQFG VKRGLLGKLMPVKRTTFVIDTDRKVLDVISSEFSMDAHADKALATLRAIRSG" CDS 1797678..1799228 /codon_start=1 /transl_table=11 /gene="trpE" /locus_tag="BQ2027_MB1635" /product="anthranilate synthase component i trpe (glutamine amidotransferase)" /note="Mb1635, trpE, len: 516 aa. Equivalent to Rv1609, len: 516 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 516 aa overlap). Probable trpE, anthranilate synthase component I (EC 4.1.3.27). FASTA best: TRPE_CLOTM|P14953 anthranilate synthase component I from Clostridium thermocellum (494 aa), E(): 0, (42.6% identity in 498 aa overlap). Some similarity to Rv2386c|MTCY253.35, E(): 6.3e-17; and Rv3215|MTCY07D11.11c, E(): 5.7e-15. BELONGS TO THE ANTHRANILATE SYNTHASE COMPONENT I FAMILY. Protein product from Mb1635 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1635 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67002" /db_xref="InterPro:IPR005256" /db_xref="InterPro:IPR005801" /db_xref="InterPro:IPR006805" /db_xref="InterPro:IPR015890" /db_xref="InterPro:IPR019999" /db_xref="UniProtKB/Swiss-Prot:P67002" /protein_id="SIU00239.1" /translation="MHADLAATTSREDFRLLAAEHRVVPVTRKVLADSETPLSAYRKL AANRPGTFLLESAENGRSWSRWSFIGAGAPTALTVREGQAVWLGAVPKDAPTGGDPLR ALQVTLELLATADRQSEPGLPPLSGGMVGFFAYDMVRRLERLPERAVDDLCLPDMLLL LATDVAAVDHHEGTITLIANAVNWNGTDERVDWAYDDAVARLDVMTAALGQPLPSTVA TFSRPEPRHRAQRTVEEYGAIVEYLVDQIAAGEAFQVVPSQRFEMDTDVDPIDVYRIL RVTNPSPYMYLLQVPNSDGAVDFSIVGSSPEALVTVHEGWATTHPIAGTRWRGRTDDE DVLLEKELLADDKERAEHLMLVDLGRNDLGRVCTPGTVRVEDYSHIERYSHVMHLVST VTGKLGEGRTALDAVTACFPAGTLSGAPKVRAMELIEEVEKTRRGLYGGVVGYLDFAG NADFAIAIRTALMRNGTAYVQAGGGVVADSNGSYEYNEARNKARAVLNAIAAAETLAA PGANRSGC" CDS 1799218..1799925 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1636" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb1636, -, len: 235 aa. Equivalent to Rv1610, len: 235 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 235 aa overlap). Possible conserved membrane protein. Equivalent to AL049913|MLCB1610_23 hypothetical protein from Mycobacterium leprae (264 aa), FASTA score: (65.8% identity in 231 aa overlap). Protein product from Mb1636 detected using SWATH mass spectrometry. Mb1636 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYU4" /db_xref="InterPro:IPR011746" /db_xref="InterPro:IPR019051" /db_xref="UniProtKB/TrEMBL:A0A1R3XYU4" /protein_id="SIU00240.1" /translation="MAANAGSVRPNRRARPMIGIAQLLLVVAAGALWMAARLPWVVIG SFDELGPPKEVTLTGASWSTALLPLALLMLAAAVAALAVRGWPLRALAVLLAAASFAV GYLGISLWVVPDVAARGADLAHVPVVTLVGSARHYWGAVAAVLAAVCALLAAVFLMSS AAIRGSAGEDMARYAAPRARRSIARRQHSNAAGRAAPQDDGPDMGPRMSERMIWEALD EGRDPTDREQESDTEGR" CDS 1800015..1800833 /codon_start=1 /transl_table=11 /gene="trpC" /locus_tag="BQ2027_MB1637" /product="Probable indole-3-glycerol phosphate synthase trpC" /note="Mb1637, trpC, len: 272 aa. Equivalent to Rv1611, len: 272 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 272 aa overlap). Probable trpC, indole-3-glycerol phosphate synthase (EC 4.1.1.48). Similar to Q55508|SLR0546 HYPOTHETICAL 33.0 KD PROTEIN from SYNECHOCYSTIS SP (295 aa), FASTA score: opt: 26, E(): 7.6e-32, (44.2% identity in 265 aa overlap); also similar to TRPC_AZOBR|P26938 indole-3-glycerol-phosphate synthaseindole-3-glycerol-phosphate synthase from Azospirillum brasilense (262 aa), FASTA score: opt: 596, E(): 4.8e-30, (43.8% identity in 258 aa overlap). Equivalent to AL0499 13|MLCB1610_24 from Mycobacterium leprae (272 aa) (90.8% identity in 272 aa overlap). Contains indole-3-glycerol phosphate synthase signature (PS00614). BELONGS TO THE TRPC FAMILY. Protein product from Mb1637 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1637 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A633" /db_xref="InterPro:IPR001468" /db_xref="InterPro:IPR011060" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR013798" /db_xref="UniProtKB/Swiss-Prot:P0A633" /protein_id="SIU00241.1" /translation="MSPATVLDSILEGVRADVAAREASVSLSEIKAAAAAAPPPLDVM AALREPGIGVIAEVKRASPSAGALATIADPAKLAQAYQDGGARIVSVVTEQRRFQGSL DDLDAVRASVSIPVLRKDFVVQPYQIHEARAHGADMLLLIVAALEQSVLVSMLDRTES LGMTALVEVHTEQEADRALKAGAKVIGVNARDLMTLDVDRDCFARIAPGLPSSVIRIA ESGVRGTADLLAYAGAGADAVLVGEGLVTSGDPRAAVADLVTAGTHPSCPKPAR" CDS 1800902..1802134 /codon_start=1 /transl_table=11 /gene="trpB" /locus_tag="BQ2027_MB1638" /product="tryptophan synthase, beta subunit trpb" /note="Mb1638, trpB, len: 410 aa. Equivalent to Rv1612, len: 410 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 410 aa overlap). Probable trpB, tryptophan synthase beta chain (EC 4.2.1.20). Equivalent to AL049913|MLCB1610_25 from Mycobacterium leprae (340 aa) (88.5% identity in 331 aa overlap). Similar to others e.g. TRPB_CAUCR|P12290 tryptophan synthase beta chain from Caulobacter crescentus (406 aa), FASTA scores: opt: 1662, E(): 0, (60.6% identity in 404 aa overlap). BELONGS TO THE TRPB FAMILY. TETRAMER OF TWO ALPHA AND TWO BETA CHAINS. Protein product from Mb1638 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1638 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P66985" /db_xref="InterPro:IPR001926" /db_xref="InterPro:IPR006653" /db_xref="InterPro:IPR006654" /db_xref="InterPro:IPR023026" /db_xref="InterPro:IPR036052" /db_xref="UniProtKB/Swiss-Prot:P66985" /protein_id="SIU00242.1" /translation="MSAAIAEPTSHDPDSGGHFGGPSGWGGRYVPEALMAVIEEVTAA YQKERVSQDFLDDLDRLQANYAGRPSPLYEATRLSQHAGSARIFLKREDLNHTGSHKI NNVLGQALLARRMGKTRVIAETGAGQHGVATATACALLGLDCVIYMGGIDTARQALNV ARMRLLGAEVVAVQTGSKTLKDAINEAFRDWVANADNTYYCFGTAAGPHPFPTMVRDF QRIIGMEARVQIQGQAGRLPDAVVACVGGGSNAIGIFHAFLDDPGVRLVGFEAAGDGV ETGRHAATFTAGSPGAFHGSFSYLLQDEDGQTIESHSISAGLDYPGVGPEHAWLKEAG RVDYRPITDSEAMDAFGLLCRMEGIIPAIESAHAVAGALKLGVELGRGAVIVVNLSGR GDKDVETAAKWFGLLGND" CDS 1802134..1802946 /codon_start=1 /transl_table=11 /gene="trpA" /locus_tag="BQ2027_MB1639" /product="Probable tryptophan synthase, alpha subunit trpA" /note="Mb1639, trpA, len: 270 aa. Equivalent to Rv1613, len: 270 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 270 aa overlap). Probable trpA, tryptophan synthase alpha chain (EC 4.2.1.20). FASTA best: O68906|TRPA_MYCIT TRYPTOPHAN SYNTHASE ALPHA CHAIN from Mycobacterium intracellulare (271 aa), opt: 1442, E(): 0, (85.3% identity in 265 aa overlap). Protein product from Mb1639 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1639 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66981" /db_xref="InterPro:IPR002028" /db_xref="InterPro:IPR011060" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR018204" /db_xref="UniProtKB/Swiss-Prot:P66981" /protein_id="SIU00243.1" /translation="MVAVEQSEASRLGPVFDSCRANNRAALIGYLPTGYPDVPASVAA MTALVESGCDIIEVGVPYSDPGMDGPTIARATEAALRGGVRVRDTLAAVEAISIAGGR AVVMTYWNPVLRYGVDAFARDLAAAGGLGLITPDLIPDEAQQWLAASEEHRLDRIFLV APSSTPERLAATVEASRGFVYAASTMGVTGARDAVSQAAPELVGRVKAVSDIPVGVGL GVRSRAQAAQIAQYADGVIVGSALVTALTEGLPRLRALTGELAAGVRLGMSA" CDS 1802946..1804352 /codon_start=1 /transl_table=11 /gene="lgt" /locus_tag="BQ2027_MB1640" /product="Possible prolipoprotein diacylglyceryl transferases Lgt" /note="Mb1640, lgt, len: 468 aa. Equivalent to Rv1614, len: 468 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 468 aa overlap). Possible lgt, prolipoprotein diacylglyceryl transferases (EC 2.4.99.-), similar to many prolipoprotein diacylglyceryl transferases. FASTA scores: LGT_STAAU|P52282 prolipoprotein diacylglyceryl transferase from Staphylococcus aureus subsp. (279 aa), opt: 289, E():3.6e-09, (31.5% identity in 257 aa overlap); AL096884|SC4G6_3 cosmid 4G6 from Streptomyces coelicolor (343 aa), opt: 735, E(): 4e-32, (46.5% identity in 391 aa overlap). Protein product from Mb1640 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1640 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEW5" /db_xref="InterPro:IPR001640" /db_xref="UniProtKB/Swiss-Prot:Q7VEW5" /protein_id="SIU00244.1" /translation="MRMLPSYIPSPPRGVWYLGPLPVRAYAVCVITGIIVALLIGDRR LTARGGERGMTYDIALWAVPFGLIGGRLYHLATDWRTYFGDGGAGLAAALRIWDGGLG IWGAVTLGVMGAWIGCRRCGIPLPVLLDAVAPGVVLAQAIGRLGNYFNQELYGRETTM PWGLEIFYRRDPSGFDVPNSLDGVSTGQVAFVVQPTFLYELIWNVLVFVALIYIDRRF IIGHGRLFGFYVAFYCAGRFCVELLRDDPATLIAGIRINSFTSTFVFIGAVVYIILAP KGREAPGALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVK AEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAE AASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDDPAEPDGIRRQ DDFSSRRRRWWRLRRRRQ" CDS 1805028..1805468 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1641" /product="probable membrane protein" /note="Mb1641, -, len: 146 aa. Equivalent to Rv1615, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 146 aa overlap). Probable membrane protein. Protein product from Mb1641 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1641 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYU1" /db_xref="InterPro:IPR007829" /db_xref="UniProtKB/TrEMBL:A0A1R3XYU1" /protein_id="SIU00245.1" /translation="MGLRPARVVRPARSGMLKGVTDPLQHGAFEPGWQSAPPGYPPPY PQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGL GIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWGRPLRDGS" CDS 1805458..1805856 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1642" /product="CONSERVED MEMBRANE PROTEIN" /note="Mb1642, -, len: 132 aa. Equivalent to Rv1616, len: 132 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 132 aa overlap). Conserved membrane protein, with some similarity to other hypothetical proteins e.g. AL096884|SC4G6_9 from Streptomyces coelicolor cosmid 4G6 (148 aa), FASTA scores: opt: 245, E(): 1.7e-1 0, (36.7% identity in 128 aa overlap); Q55401|SLL0543 HYPOTHETICAL 16.5 KD PROTEIN from SYNECHOCYSTIS SP (148 aa), FASTA scores: opt: 225, E(): 6.5e-10, (35.9% identity in 117 aa overlap). Has cysteine cluster and contains a rubredoxin signature (PS00202). Mb1642 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ74" /db_xref="InterPro:IPR021215" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ74" /protein_id="SIU00246.1" /translation="MEASGRQRRYAAAGSVVLLAGALGYIGLVDPHNSNSLYPPCLFK LLTGWNCPACGGLRMIHDLLHGELAASINDNVFLLVGVPVLASWVLLRRRHGDLALPI PVMIAVAVAVIAWTVLRNLPGFPLVPTISG" CDS 1805964..1807382 /codon_start=1 /transl_table=11 /gene="pykA" /locus_tag="BQ2027_MB1643" /product="Probable pyruvate kinase pykA" /note="Mb1643, pykA, len: 472 aa. Equivalent to Rv1617, len: 472 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 472 aa overlap). Probable pykA, pyruvate kinase (EC 2.7.1.40). FASTA best: Q46078 PYRUVATE KINASE from CORYNEBACTERIUM GLUTAMICUM (475 aa), opt: 2221, E(): 0, (72.2% identity in 468 aa overlap). BELONGS TO THE PYRUVATE KINASE FAMILY. Protein product from Mb1643 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1643 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYV1" /db_xref="InterPro:IPR001697" /db_xref="InterPro:IPR011037" /db_xref="InterPro:IPR015793" /db_xref="InterPro:IPR015795" /db_xref="InterPro:IPR015806" /db_xref="InterPro:IPR015813" /db_xref="InterPro:IPR036918" /db_xref="InterPro:IPR040442" /db_xref="UniProtKB/TrEMBL:A0A1R3XYV1" /protein_id="SIU00247.1" /translation="MTRRGKIVCTLGPATQRDDLVRALVEAGMDVARMNFSHGDYDDH KVAYERVRVASDATGRAVGVLADLQGPKIRLGRFASGATHWAEGETVRITVGACEGSH DRVSTTYKRLAQDAVAGDRVLVDDGKVALVVDAVEGDDVVCTVVEGGPVSDNKGISLP GMNVTAPALSEKDIEDLTFALNLGVDMVALSFVRSPADVELVHEVMDRIGRRVPVIAK LDKPEAIDNLEAIVLAFDAVMVARGDLGVELPLEEVPLVQKRAIQMARENAKPVIVAT QMLDSMIENSRPTRAEASDVANAVLDGADALMLSGETSVGKYPLAAVRTMSRIICAVE ENSTAAPPLTHIPRTKRGVISYAARDIGERLDAKALVAFTQSGDTVRRLARLHTPLPL LAFTAWPEVRSQLAMTWGTETFIVPKMQSTDGMIRQVDKSLLELARYKRGDLVVIVAG APPGTVGSTNLIHVHRIGEDDV" CDS 1807390..1808292 /codon_start=1 /transl_table=11 /gene="tesB1" /locus_tag="BQ2027_MB1644" /product="Probable acyl-CoA thioesterase II tesB1" /note="Mb1644, tesB1, len: 300 aa. Equivalent to Rv1618, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 300 aa overlap). Probable tesB1, acyl-CoA thioesterase II (EC 3.1.2.-), similar to other acyl-CoA thioesterases e.g. TESB_ECOLI|P23911 acyl-coa thioesterase II from Escherichia coli (285 aa), FASTA scores: opt: 495, E(): 2.9e-27, (32.5% identity in 283 aa overlap); etc. Also similar to Rv2605c|tesB2 from M. tuberculosis. Protein product from Mb1644 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1644 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYV7" /db_xref="InterPro:IPR003703" /db_xref="InterPro:IPR025652" /db_xref="InterPro:IPR029069" /db_xref="InterPro:IPR042171" /db_xref="UniProtKB/TrEMBL:A0A1R3XYV7" /protein_id="SIU00248.1" /translation="MPDGKPMSDFDELLAVLDLNAVASDLFTGSHPSKNPLRTFGGQL MAQSFVASSRTLTRHHLPPSAFSVHFINGGDTAKDIEFQVIRLRDERRFANRRVDAVQ DGTLLSSAMVSYMAGGRGLEHALDPPQVAEPHTRPPIGELLRGYEETVPHFVNALQPI EWRYANDPAWIMRDKGDRLAYNRVWVKALGEMPDDPVLHTATLLYSSDTTVLDSVITT HGLSWGFDRIFAASANHSVWFHRQVNFDDWVLYSTSSPVAADSRGLGSGHFFDRSGKL IATVVQEGVLKYFPATPDSAAGRS" CDS 1808350..1809804 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1645" /product="CONSERVED MEMBRANE PROTEIN" /note="Mb1645, -, len: 484 aa. Equivalent to Rv1619, len: 484 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 484 aa overlap). Conserved membrane protein. Some similarity to N-terminus of P94974|Rv1640c|MTCY06H11.04c PROBABLE LYSYL-TRNA SYNTHETASE 2 (EC 6.1.1.6) from Mycobacterium tuberculosis (1172 aa), FASTA scores: E(): 1.4e-16, (28.0% identity in 410 aa overlap); and similar in part to O69916| SC3C8.03C Putative intergral membrane protein from Streptomyces coelicolor cosmid 3C8 (589 aa), FASTA scores: opt: 453 E(): 8.4e-22, (31.3% identity in 313 aa overlap). Mb1645 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYU9" /db_xref="InterPro:IPR024320" /db_xref="UniProtKB/TrEMBL:A0A1R3XYU9" /protein_id="SIU00249.1" /translation="MVAAAGEPLNCQRANPEVTVKLPSADVVPRLRGRQRVVVHVDSR TARCVGALALVCAACWLIALLAGDYRHAQWAVAGRLGWSLTVLAAVAFIARGIFLGRP VTAMHATAAGLFLLAGLAAHVLVADLLGEILIAGSGWALMWPTSAHPRPEDLPRVWAL INATRADSLAPFAMQAGKSHHFSAAGTAALAYRTRIGYAVVSGDPIGDEAQFPQLVAD FAAMCHMHGWRIVVVGCSERRLGLWSDPMVVGQSLRPIPIGRDVVIDVSNFEMTGRRF RNLRQAVKRTHNFGVTTEIVAEQQLDDQRQAELAEVLAASPSGARTDRGFCMNLDGVL EGRYPGIQLIIARDASGRVQGFHRYATAGGGSDMSLDVPWRRRGAPNGIDERLSADMI AAAKDAGVQRLSLAFAAFPDLFGANQLGRLQRVCRALIHILDPLIALESLYRYLRKFH ALDERRYVLISMTQVFALALVLLSLEFVPRRRHL" CDS complement(1809738..1811468) /codon_start=1 /transl_table=11 /gene="cydC" /locus_tag="BQ2027_MB1646C" /product="PROBABLE 'COMPONENT LINKED WITH THE ASSEMBLY OF CYTOCHROME' TRANSPORT TRANSMEMBRANE ATP-BINDING PROTEIN ABC TRANSPORTER CYDC" /note="Mb1646c, cydC, len: 576 aa. Equivalent to Rv1620c, len: 576 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 576 aa overlap). Probable cydC, transmembrane ATP-binding protein ABC transporter involved in transport of component linked with the assembly of cytochrome (see citation below), similar to others e.g. CYDC_ECOLI|P23886 transport ATP-binding protein from Escherichia coli (573 aa), FASTA scores: opt: 631, E(): 1.6e-30, (28.5% identity in 569 aa overlap); C-terminal part of AL034355|SCD78_14 from Streptomyces coelicolor (1172 aa), FASTA scores: opt: 956, E(): 0, (38.8% identity in 554 aa overlap); etc. Contains (PS00211) ABC transporters family signature, and (PS00017) ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Mb1646c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYV3" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR011527" /db_xref="InterPro:IPR014223" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036640" /db_xref="UniProtKB/TrEMBL:A0A1R3XYV3" /protein_id="SIU00250.1" /translation="MNRPSAVSRRQRDLLAASGLLGPRLPRILAAVALGVLSLGSALA LAGVSAWLITRAWQMPPVLDLSVAVVAVRAFAISRGVLHYCERLATHDTALRAAGRAR TLIYHRLAHGPAAAAVGLHSGDLAARVGADVDELANMLVRALVPIAVAAVLAVAATAV VAAVSVPAAVVLAVCLLVAGVVAPWLAGRTAAAQEAIARQHRGMRDTSAMIALEHAPE LRVAGALRNVIADSQRRQHAWADALDAAARTGAIAEAMPTAAIGASLLGAVVAGIGMA PTVAPTTLAILMLLPLSAFEATVALPAAAVQLTRSRIAAARLLDLTGSNRVRETESTV SARLPVGTGVLAADVCCGHQEAQSIRVTIDLPPGARLAVTGASGAGKTTLLMTLAGLL PPVHGRVLLDGTNLSDFDEDELRSAVSFFAEDAHIFATTVRDNLLTARGDCPDDELIE ALDRVGLCGWLAGLPEGLSTVLIGGAQAVSAGQRRRLLLARAVLSPARIVLLDEPVEH LDAANADLLRDLLAPNSGIMSAMRTVVVATHHLPNDIQCAELSIATDQRCRRRGTNSS DNNTNASAKT" CDS complement(1811465..1813048) /codon_start=1 /transl_table=11 /gene="cydD" /locus_tag="BQ2027_MB1647C" /product="PROBABLE 'COMPONENT LINKED WITH THE ASSEMBLY OF CYTOCHROME' TRANSPORT TRANSMEMBRANE ATP-BINDING PROTEIN ABC TRANSPORTER CYDD" /note="Mb1647c, cydD, len: 527 aa. Equivalent to Rv1621c, len: 527 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 527 aa overlap). Probable cydD, transmembrane ATP-binding protein ABC transporter involved in transport of component linked with the assembly of cytochrome (see citation below), similar to others e.g. P94366|CYDC_BACSU TRANSPORT ATP-BINDING PROTEIN from Bacillus subtilis (567 aa), FASTA scores: opt: 784, E(): 0, (30.1% identity in 535 aa overlap); N-terminal part of AL034355|SCD78_14 from Streptomyces coelicolor (1172 aa), FASTA scores: opt: 1295, E(): 0, (44.6% identity in 534 aa overlap); etc. Also similar to Q11019|Y07D_MYCTU from Mycobacterium tuberculosis (579 aa), FASTA scores: opt: 530, E(): 6.9e-25, (29.1% identity in 530 aa overlap). Contains (PS00211) ABC transporters family signature, and (PS00017) ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Mb1647c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYT5" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR011527" /db_xref="InterPro:IPR014216" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036640" /db_xref="InterPro:IPR039421" /db_xref="UniProtKB/TrEMBL:A0A1R3XYT5" /protein_id="SIU00251.1" /translation="MACGVGISGCAIGSAIVLASIVAGVIDPANPGMAGLRRWLGPLS ILLVLWGLRASIQWLQARLAQRGASAVIADLSGQVLTAVTARRPSQLAAQRDAAAVLI TRGLDGLRPYFTGYLPTLLLAAILTPATVAVIGLYDLKSMAIVVITLPLIPIFMVLIG LATTNPSAAALAAMTAVQARLLDLIAGIPTLRALGRASGPEQRIAELSADHRRSAMAT LRIAFLSALVLELLATLGVALVAVGIGLRLVFGEMSLTAGLTVLLLAPEVYWPLRRVG VQFHAAADGRTAADKAFALLGESPSPTPGRRTVTARGGVIRLERLSVRGRDGRAPYDL TADIEPGRVTVLTGRNGAGKSTTLQAIAGLTAPSSGRITVAGVDVTNLAPAAWWRQLS WLPQRPVLVPGTVRHNLVLLGPVDDLERACAAAGFDAVLDELPRGLDTVLGRGGVGLS LGQRQRLGLARALGSPAAVLLLDEPTAHLDARTEQHVLGAIVERARAGATVLVVAHRQ QVAAAGDRVVEVNSDGFRR" CDS complement(1813135..1814175) /codon_start=1 /transl_table=11 /gene="cydB" /locus_tag="BQ2027_MB1648C" /product="Probable integral membrane cytochrome D ubiquinol oxidase (subunit II) cydB (Cytochrome BD-I oxidase subunit II)" /note="Mb1648c, cydB, len: 346 aa. Equivalent to Rv1622c, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 346 aa overlap). Probable cydB, cytochrome D ubiquinol oxidase subunit II (EC 1.10.3.-), integral membrane protein, similar to others e.g. P11027|CYDB_ECOLI CYTOCHROME D UBIQUINOL OXIDASE SUBUNIT II from Escherichia coli strain K12 (379 aa), FASTA scores: opt: 519, E(): 0, (32.3% identity in 372 aa overlap); P94365|CYDB_BACSU CYTOCHROME D UBIQUINOL OXIDASE SUBUNIT II from Bacillus subtilis (338 aa), FASTA scores: opt: 824, E(): 0, (39.5% identity in 337 aa overlap); etc. Protein product from Mb1648c detected using SWATH mass spectrometry. Mb1648c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYU5" /db_xref="InterPro:IPR003317" /db_xref="UniProtKB/TrEMBL:A0A1R3XYU5" /protein_id="SIU00252.1" /translation="MVLQELWFGVIAALFLGFFILEGFDFGVGMLMAPFAHVGMGDPE THRRTALNTIGPVWDGNEVWLITAGAAIFAAFPGWYATVFSALYLPLLAILFGMILRA VAIEWRGKIDDPKWRTGADFGIAAGSWLPALLWGVAFAILVRGLPVDANGHVALSIPD VLNAYTLLGGLATAGLFSLYGAVFIALKTSGPIRDDAYRFAVWLSLPVAGLVAGFGLW TQLAYGKDWTWLVLAVAGCAQAAATVLVWRRVSDGWAFMCTLIVVAAVVVLLFGALYP NLVPSTLNPQWSLTIHNASSTPYTLKIMTWVTAFFAPLTVAYQTWTYWVFRQRISAER IPPPTGLARRAP" CDS complement(1814205..1815662) /codon_start=1 /transl_table=11 /gene="cydA" /locus_tag="BQ2027_MB1649C" /standard_name="appC" /product="Probable integral membrane cytochrome D ubiquinol oxidase (subunit I) cydA (Cytochrome BD-I oxidase subunit I)" /note="Mb1649c, cydA, len: 485 aa. Equivalent to Rv1623c, len: 485 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 485 aa overlap). Probable cydA (previously known as appC, but renamed cydA to conform with Mycobacterium smegmatis nomenclature), cytochrome D ubiquinol oxidase subunit I (EC 1.10.3.-), integral membrane protein, similar to others e.g. P26459|APPC_ECOLI|CYXA|CBDA|B0978 CYTOCHROME BD-II OXIDASE SUBUNIT I from Escherichia coli strain K12 (514 aa), FASTA scores: opt: 870, E(): 0, (35.9% identity in 485 aa overlap); AL034355|SCD78_12 from Streptomyces coelicolor (501 aa), FASTA scores: opt: 1099, E(): 0, (48.6% identity in 510 aa overlap); etc. Protein product from Mb1649c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1649c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0Z3" /db_xref="InterPro:IPR002585" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Z3" /protein_id="SIU00253.1" /translation="MNVVDISRWQFGITTVYHFIFVPLTIGLAPLIAVMQTLWVVTDN PAWYRLTKFFGKLFLINFAIGVATGIVQEFQFGMNWSEYSRFVGDVFGAPLAMEGLAA FFFESTFIGLWIFGWNRLPRLVHLACIWIVAIAVNVSAFFIIAANSFMQHPVGAHYNP TTGRAELSSIVVLLTNNTAQAAFTHTVSGALLTAGTFVAAVSAWWLVRSSTTHADSDT QAMYRPATILGCWVALAATAGLLFTGDHQGKLMFQQQPMKMASAESLCDTQTDPNFSV LTVGRQNNCDSLTRVIEVPYVLPFLAEGRISGVTLQGIRDLQQEYQQRFGPNDYRPNL FVTYWSFRMMIGLMAIPVLFALIALWLTRGGQIPNQRWFSWLALLTMPAPFLANSAGW VFTEMGRQPWVVVPNPTGDQLVRLTVKAGVSDHSATVVATSLLMFTLVYAVLAVIWCW LLKRYIVEGPLEHDAEPAAHGAPRDDEVAPLSFAY" CDS complement(1815773..1816360) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1650C" /product="Probable conserved membrane protein" /note="Mb1650c, -, len: 195 aa. Equivalent to Rv1624c, len: 195 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 195 aa overlap). Probable membrane protein, first start taken. Some similarity to Rv3155 nuoK, NADH dehydrogenase chain K from Mycobacterium tuberculosis. Also similar to AAK72093.1|AF196488 hypothetical protein from Mycobacterium smegmatis (205 aa). Identities = 117/195 (60%). Mb1650c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZR5" /db_xref="InterPro:IPR005325" /db_xref="UniProtKB/TrEMBL:A0A1R3XZR5" /protein_id="SIU00254.1" /translation="MCHTAPMEPSPVVSPLPRLLPHLWKSTLASGILSLILGVLVLAW PGISILVAAMAFGVYLLITGVAQVAFAFSLHVSAGGRILLFISGAASLILAVLAFRHF GDAVLLLAIWIGIGFIFRGVATTVSAISDPMLPGRGWSIFVGVISLIAGIVVMASPFE SIWILALVVGIWLVVIGACEIASSFAIRKASQTLG" CDS complement(1816389..1817645) /codon_start=1 /transl_table=11 /gene="cya" /locus_tag="BQ2027_MB1651C" /product="MEMBRANE-ANCHORED ADENYLYL CYCLASE CYA (ATP PYROPHOSPHATE-LYASE) (ADENYLATE CYCLASE)" /note="Mb1651c, cya, len: 418 aa. Equivalent to Rv1625c, len: 418 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 418 aa overlap). cya, membrane-anchored adenylyl cyclase (EC 4.6.1.1) (see citations below). C-terminal half is similar to region in numerous eukaryotic adenylate and guanylate cyclases. N-terminal half hydrophobic. FASTA score: CYG2_RAT|P22717 guanylate cyclase soluble, beta-2 chain (682 aa), FASTA scores: opt: 552, E(): 2.7e-26, (40.3% identity in 226 aa overlap). Some similarity to Rv2435c|MTCY428.11 from Mycobacterium tuberculosis (730 aa), E(): 7e-19. Start changed since first submission (+25 aa). BELONGS TO ADENYLYL CYCLASE CLASS-4/GUANYLYL CYCLASE FAMILY. Protein product from Mb1651c detected using SWATH mass spectrometry. Mb1651c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A4Y1" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR018297" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/Swiss-Prot:P0A4Y1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00255.1" /translation="MRTQARAPTQHYAESVARRQRVLTITAWLAVVVTGSFALMQLAT GAGGWYIALINVFTAVTFAIVPLLHRFGGLVAPLTFIGTAYVAIFAIGWDVGTDAGAQ FFFLVAAALVVLLVGIEHTALAVGLAAVAAGLVIALEFLVPPDTGLQPPWAMSVSFVL TTVSACGVAVATVWFALRDTARAEAVMEAEHDRSEALLANMLPASIAERLKEPERNII ADKYDEASVLFADIVGFTERASSTAPADLVRFLDRLYSAFDELVDQHGLEKIKVSGDS YMVVSGVPRPRPDHTQALADFALDMTNVAAQLKDPRGNPVPLRVGLATGPVVAGVVGS RRFFYDVWGDAVNVASRMESTDSVGQIQVPDEVYERLKDDFVLRERGHINVKGKGVMR TWYLIGRKVAADPGEVRGAEPRTAGV" tRNA complement(1817790..1817863) /locus_tag="BQ2027_LEUV" /product="tRNA-Leu" /note="leuV, len: 74 nt. Equivalent to leuV, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Leu, anticodon caa." CDS 1817955..1818572 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1652" /product="Probable two-component system transcriptional regulator" /note="Mb1652, -, len: 205 aa. Equivalent to Rv1626, len: 205 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 205 aa overlap). Probable two-component response system transcriptional regulator, similar to many e.g. CHEY_BACSU|P24072 chemotaxis protein chey homolog (119 aa), FASTA scores: opt: 283, E(): 1.6e-16, (43.0% identity in 114 aa overlap). Also similar to AL109732|SC7H2_27 hypothetical protein from Streptomyces coelicolor (218 aa), opt: 880, E(): 0, (69.4% identity in 196 aa overlap). Protein product from Mb1652 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1652 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ84" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR005561" /db_xref="InterPro:IPR008327" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ84" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00256.1" /translation="MTGPTTDADAAVPRRVLIAEDEALIRMDLAEMLREEGYEIVGEA GDGQEAVELAELHKPDLVIMDVKMPRRDGIDAASEIASKRIAPIVVLTAFSQRDLVER ARDAGAMAYLVKPFSISDLIPAIELAVSRFREITALEGEVATLSERLETRKLVERAKG LLQTKHGMTEPDAFKWIQRAAMDRRTTMKRVAEVVLETLGTPKDT" CDS complement(1818640..1819848) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1653C" /product="Probable nonspecific lipid-transfer protein" /note="Mb1653c, -, len: 402 aa. Equivalent to Rv1627c, len: 402 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 402 aa overlap). Probable nonspecific lipid-transfer protein, similar to many lipid carrier proteins e.g. Q51797 ACETYL CoA SYNTHASE from Pyrococcus furiosus (388 aa), FASTA scores: opt: 400, E(): 3.2e-18, (34.4% identity in 407 aa overlap); etc. Also some similarity to Mycobacterium tuberculosis proteins Rv3523, Rv3540c, Rv0244, Rv2790c, Rv1323, etc. Protein product from Mb1653c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1653c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYW1" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020616" /db_xref="UniProtKB/TrEMBL:A0A1R3XYW1" /protein_id="SIU00257.1" /translation="MRMSAPEPVYILGAGMHPWGKWGNDFTEYGVVAARAALRDAGVD WRHVQLVAGADTIRNGYPGFVAGATFAQKLGWTGVPVSSSYAACASGSQALQSARAQI LAGFCDVALVIGADTTPKGFFAPVGGERKGDPDWQRFHLIGATNTVYFALLARRRMDL YGATVEDFAQVKVKNSRHGLDNPNARYRKENSIDDVLASPVVSDPLRLLDICATSDGA AALIVASKSFTEKHLGSVAGVPSVRAISTVTPKYPQHLPELPDIATDSTAAVPAPERV FKDQILDAAYAEAGIGPEDLSLAEVYDLSTALELDWYEHLGLCPKGEAEALLRSGATT LGGRVPVNPSGGLACFGEAIPAQAIAQVCELTWQLRGQATGRQVADAKVGVTANQGLF GHGSSVIVAR" CDS complement(1819845..1820336) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1654C" /product="Predicted nucleic-acid-binding protein containing a Zn-ribbon" /note="Mb1654c, -, len: 163 aa. Equivalent to Rv1628c, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 163 aa overlap). Conserved hypothetical protein, some similarity to others e.g. Q51796 ACAC PROTEIN in Pyrococcus furiosus (136 aa), FASTA scores: opt: 199, E(): 4.6e-06, (34.7% identity in 121 aa overlap). Protein product from Mb1654c detected using shotgun mass spectrometry. Mb1654c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002878" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR022002" /db_xref="UniProtKB/TrEMBL:A0A1R3XYW6" /protein_id="SIU00258.1" /translation="MPEVTREEPAIDGWFTTDKAGNPHLLGGKCPQCGTYVFPPRADN CPNPACGSDTLESVGLSTRGKLWSYTENRYAPPPPYPAPDPFEPFAVAAVELADEGLI VLGKVVDGTLAADLKVGMEMELTTMPLFADDDGVQRIVYAWRIPSRAGDDAERSDAEE RRR" CDS 1820440..1823154 /codon_start=1 /transl_table=11 /gene="polA" /locus_tag="BQ2027_MB1655" /product="probable dna polymerase i pola" /note="Mb1655, polA, len: 904 aa. Equivalent to Rv1629, len: 904 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 904 aa overlap). polA, DNA polymerase I (EC 2.7.7.7). Has DNA polymerase family A signature (PS00447) at C-terminal end. FASTA best: DPO1_MYCTU|Q07700 DNA polymerase I from Mycobacterium tuberculosis (904 aa). Some similarity to Rv2090|MTCY49.30 (393 aa), E(): 2.2e-18, (38.7% identity in 292 aa overlap). BELONGS TO DNA POLYMERASE TYPE-A FAMILY. Protein product from Mb1655 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1655 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A551" /db_xref="InterPro:IPR001098" /db_xref="InterPro:IPR002298" /db_xref="InterPro:IPR002421" /db_xref="InterPro:IPR002562" /db_xref="InterPro:IPR008918" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR018320" /db_xref="InterPro:IPR019760" /db_xref="InterPro:IPR020045" /db_xref="InterPro:IPR020046" /db_xref="InterPro:IPR029060" /db_xref="InterPro:IPR036279" /db_xref="InterPro:IPR036397" /db_xref="UniProtKB/Swiss-Prot:P0A551" /protein_id="SIU00259.1" /translation="MVTTASAPSEDRAKPTLMLLDGNSLAFRAFYALPAENFKTRGGL TTNAVYGFTAMLINLLRDEAPTHIAAAFDVSRQTFRLQRYPEYKANRSSTPDEFAGQI DITKEVLGALGITVLSEPGFEADDLIATLATQAENEGYRVLVVTGDRDALQLVSDDVT VLYPRKGVSELTRFTPEAVVEKYGLTPRQYPDFAALRGDPSDNLPGIPGVGEKTAAKW IAEYGSLRSLVDNVDAVRGKVGDALRANLASVVRNRELTDLVRDVPLAQTPDTLRLQP WDRDHIHRLFDDLEFRVLRDRLFDTLAAAGGPEVDEGFDVRGGALAPGTVRQWLAEHA GDGRRAGLTVVGTHLPHGGDATAMAVAAADGEGAYLDTATLTPDDDAALAAWLADPAK PKALHEAKAAVHDLAGRGWTLEGVTSDTALAAYLVRPGQRSFTLDDLSLRYLRRELRA ETPQQQQLSLLDDDDTDAETIQTTILRARAVIDLADALDAELARIDSTALLGEMELPV QRVLAKMESAGIAVDLPMLTELQSQFGDQIRDAAEAAYGVIGKQINLGSPKQLQVVLF DELGMPKTKRTKTGYTTDADALQSLFDKTGHPFLQHLLAHRDVTRLKVTVDGLLQAVA ADGRIHTTFNQTIAATGRLSSTEPNLQNIPIRTDAGRRIRDAFVVGDGYAELMTADYS QIEMRIMAHLSGDEGLIEAFNTGEDLHSFVASRAFGVPIDEVTGELRRRVKAMSYGLA YGLSAYGLSQQLKISTEEANEQMDAYFARFGGVRDYLRAVVERARKDGYTSTVLGRRR YLPELDSSNRQVREAAERAALNAPIQGSAADIIKVAMIQVDKALNEAQLASRMLLQVH DELLFEIAPGERERVEALVRDKMGGAYPLDVPLEVSVGYGRSWDAAAH" CDS 1823317..1824762 /codon_start=1 /transl_table=11 /gene="rpsA" /locus_tag="BQ2027_MB1656" /product="30s ribosomal protein s1 rpsa" /note="Mb1656, rpsA, len: 481 aa. Equivalent to Rv1630, len: 481 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 481 aa overlap). Probable rpsA, ribosomal protein S1. FASTA best: RS1_MYCLE|P46836 30s ribosomal protein S1 from Mycobacterium leprae (482 aa), opt: 2655, E(): 0, (87.2% identity in 483 aa overlap). Protein product from Mb1656 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1656 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYW3" /db_xref="InterPro:IPR003029" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR022967" /db_xref="UniProtKB/TrEMBL:A0A1R3XYW3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00260.1" /translation="MPSPTVTSPQVAVNDIGSSEDFLAAIDKTIKYFNDGDIVEGTIV KVDRDEVLLDIGYKTEGVIPARELSIKHDVDPNEVVSVGDEVEALVLTKEDKEGRLIL SKKRAQYERAWGTIEALKEKDEAVKGTVIEVVKGGLILDIGLRGFLPASLVEMRRVRD LQPYIGKEIEAKIIELDKNRNNVVLSRRAWLEQTQSEVRSEFLNNLQKGTIRKGVVSS IVNFGAFVDLGGVDGLVHVSELSWKHIDHPSEVVQVGDEVTVEVLDVDMDRERVSLSL KATQEDPWRHFARTHAIGQIVPGKVTKLVPFGAFVRVEEGIEGLVHISELAERHVEVP DQVVAVGDDAMVKVIDIDLERRRISLSLKQANEDYTEEFDPAKYGMADSYDEQGNYIF PEGFDAETNEWLEGFEKQRAEWEARYAEAERRHKMHTAQMEKFAAAETAGRGADDQSS ASSAPSEKTAGGSLASDAQLAALREKLAGSA" CDS 1824788..1826011 /codon_start=1 /transl_table=11 /gene="coaE" /locus_tag="BQ2027_MB1657" /product="Probable dephospho-CoA kinase coaE (dephosphocoenzyme A kinase)" /note="Mb1657, coaE, len: 407 aa. Equivalent to Rv1631, len: 407 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 407 aa overlap). Probable coaE, dephospho-CoA kinase (EC 2.7.1.24), similar to many e.g. Q50178|ML1383|COAE_MYCLE DEPHOSPHO-COA KINASE from Mycobacterium leprae (410 aa), FASTA scores: E(): 0, (77.5% identity in 409 aa overlap). Has ATP/GTP-binding site motif A (P-loop, PS00017) at N-terminus. IN THE N-TERMINAL SECTION; BELONGS TO THE COAE FAMILY. IN THE C-TERMINAL SECTION; BELONGS TO THE UPF0157 (GRPB) FAMILY. Protein product from Mb1657 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1657 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63827" /db_xref="InterPro:IPR001977" /db_xref="InterPro:IPR007344" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P63827" /protein_id="SIU00261.1" /translation="MLRIGLTGGIGAGKSLLSTTFSQCGGIVVDGDVLAREVVQPGTE GLASLVDAFGRDILLADGALDRQALAAKAFRDDESRGVLNGIVHPLVARRRSEIIAAV SGDAVVVEDIPLLVESGMAPLFPLVVVVHADVELRVRRLVEQRGMAEADARARIAAQA SDQQRRAVADVWLDNSGSPEDLVRRARDVWNTRVQPFAHNLAQRQIARAPARLVPADP SWPDQARRIVNRLKIACGHKALRVDHIGSTAVSGFPDFLAKDVIDIQVTVESLDVADE LAEPLLAAGYPRLEHITQDTEKTDARSTVGRYDHTDSAALWHKRVHASADPGRPTNVH LRVHGWPNQQFALLFVDWLAANPGAREDYLTVKCDADRRADGELARYVTAKEPWFLDA YQRAWEWADAVHWRP" CDS complement(1826162..1826605) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1658C" /product="HYPOTHETICAL PROTEIN" /note="Mb1658c, -, len: 147 aa. Equivalent to Rv1632c, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 147 aa overlap). Hypothetical unknown protein. Protein product from Mb1658c detected using SWATH mass spectrometry. Mb1658c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007295" /db_xref="InterPro:IPR014465" /db_xref="InterPro:IPR035930" /db_xref="UniProtKB/TrEMBL:A0A1R3XYW0" /protein_id="SIU00262.1" /translation="MRAVDEYTVHPWGLYLARPTPGRAQFHYLESWLLPSLGLRATVF HFNPSHKRDHDYYLDVGEYTPGPSVWRSEDHYLDIEVRTGGGAELADVDELLDAVRHG LLTPTVAEQAVRHAVDAVEGLARNGYDLTRWLATKGMELTWRSGS" CDS 1826850..1828946 /codon_start=1 /transl_table=11 /gene="uvrB" /locus_tag="BQ2027_MB1659" /product="probable excinuclease abc (subunit b-helicase) uvrb" /note="Mb1659, uvrB, len: 698 aa. Equivalent to Rv1633, len: 698 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 698 aa overlap). uvrB, Excinuclease abc subunit B, has ATP/GTP-binding site motif A (P-loop; PS00017) near N-terminus. FASTA best: UVRB_MICLU|P10125 from Micrococcus luteus (709 aa), opt: 3268, E(): 0, (71.3% identity in 704 aa overlap). Also similar to M. tuberculosis Rv2973c (recG); and Rv1020 (mfd). BELONGS TO THE UVRB FAMILY. Protein product from Mb1659 detected using SWATH mass spectrometry. Mb1659 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67423" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR001943" /db_xref="InterPro:IPR004807" /db_xref="InterPro:IPR006935" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR024759" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036876" /db_xref="InterPro:IPR041471" /db_xref="UniProtKB/Swiss-Prot:P67423" /protein_id="SIU00263.1" /translation="MRAGGHFEVVSPHAPAGDQPAAIDELERRINAGERDVVLLGATG TGKSATTAWLIERLQRPTLVMAPNKTLAAQLANELREMLPHNAVEYFVSYYDYYQPEA YIAQTDTYIEKDSSINDDVERLRHSATSALLSRRDVVVVASVSCIYGLGTPQSYLDRS VELKVGEEVPRDGLLRLLVDVQYTRNDMSFTRGSFRVRGDTVEIIPSYEELAVRIEFF GDEIEALYYLHPLTGEVIRQVDSLRIFPATHYVAGPERMAHAVSAIEEELAERLAELE SQGKLLEAQRLRMRTNYDIEMMRQVGFCSGIENYSRHIDGRGPGTPPATLLDYFPEDF LLVIDESHVTVPQIGGMYEGDISRKRNLVEYGFRLPSACDNRPLTWEEFADRIGQTVY LSATPGPYELSQTGGEFVEQVIRPTGLVDPKVVVKPTKGQIDDLIGEIRTRADADQRV LVTTLTKKMAEDLTDYLLEMGIRVRYLHSEVDTLRRVELLRQLRLGDYDVLVGINLLR EGLDLPEVSLVAILDADKEGFLRSSRSLIQTIGRAARNVSGEVHMYADKITDSMREAI DETERRRAKQIAYNEANGIDPQPLRKKIADILDQVYREADDTAVVEVGGSGRNASRGR RAQGEPGRAVSAGVFEGRDTSAMPRAELADLIKDLTAQMMAAARDLQFELAARFRDEI ADLKRELRGMDAAGLK" CDS 1828943..1830358 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1660" /product="Possible drug efflux membrane protein" /note="Mb1660, -, len: 471 aa. Equivalent to Rv1634, len: 471 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 471 aa overlap). Possible drug efflux membrane protein of major facilitator superfamily (MFS), similar to many antibiotic resistance (efflux) proteins. FASTA best: Q56175 TU22 DTDP-GLUCOSE DEHYDRTATASE (GRAE) from Streptomyces violaceoruber (557 aa), opt: 415, E(): 1.7e-17, (26.7% identity in 446 aa overlap). Relatives in Mycobacterium tuberculosis: MTCY369.27c, E(): 4.8e-12; MTCY20B11.14c, E(): 2.9e-10. Mb1660 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZR8" /db_xref="InterPro:IPR001411" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XZR8" /protein_id="SIU00264.1" /translation="MTETASETGSWRELLSRYLGTSIVLAGGVALYATNEFLTISLLP STIADIGGSRLYAWVTTLYLVGSVVAATTVNTMLLRVGARSSYLMGLAVFGLASLVCA AAPSMQILVAGRTLQGIAGGLLAGLGYALINSTLPKSLWTRGSALVSAMWGVATLIGP ATGGLFAQLGLWRWAFGVMTLLTALMAMLVPVALGAGRVGPGGETPVGSTHKVPVWSL LLMGAAALAISVAALPNYLVQTAGLLAAAALLVAVFVVVDWRIHAAVLPPSVFGSGPL KWIYLTMSVQMIAAMVDTYVPLFGQRLGHLTPVAAGFLGAALAVGWTVGEVASASLNS ARVIGHVVAAAPLVMASGLALGAVTQRADAPVGIIALWALALLIIGTGIGIAWPHLTV RAMDSVADPAESSAAAAAINVVQLISGAFGAGLAGVVVNTAKGGEVAAARGLYMAFTV LAAAGVIASYQATHRDRRLPR" CDS complement(1830347..1832017) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1661C" /product="probable mannosyltransferase. probable conserved transmembrane protein." /note="Mb1661c, -, len: 556 aa. Equivalent to Rv1635c, len: 556 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 556 aa overlap). Membrane protein equivalent to CAC31770.1|AL583921 Mycobacterium leprae membrane protein (527 aa), Identities = 332/527 (62%). Mb1661c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYV8" /db_xref="InterPro:IPR038731" /db_xref="UniProtKB/TrEMBL:A0A1R3XYV8" /protein_id="SIU00265.1" /translation="MHASRPGAPPHAGLPSRRTAGDQDHRADPKVTRIMSASTLEQPA AAHVDELVARMRGRLLDPLAIAVLAAVISGAWASRPSLWFDEGATISASASRTLPELW SLLGHIDAVHGLYYLLMHGWFAIFPPTELWSRLPSCLAIGAAAAGVVVFAKQFSGRTT AVCAGAVFAILPRVTWAGIEARSSALSVAAAVWLTVLLVAAVRCNTQRRWLLYALVLM LSILVSINLALLVPAYATMVPLLASGKSRKSPVIWWTVVTAAALGAMTPFILFAHGQV WQVGWIAGLNRNIILDVIHRQYFDHSVPFAILAGLIVAAGIAAHLAGARGPGGDTHRL VLVSAAWIVVPTAVVLIYSATVEPIYYPRYLILTAPAAAVILAVCVVTIARKPWLIAG VVFLLAAAAFPNYFFTQRGPYAKEGWDYSQVADVISAHAKPGDCLLVDNTAGWRPGPI RALLATRPAAFRSLIDVERGTYGPKVGTLWDGHVAVWLTTAKIDKCPTLWTIANRDKS LPDHQVGEMLSPGTGFGRTPVYRFPSYLGFRIVERWQFHYSQVVKSTR" CDS 1832226..1832666 /codon_start=1 /transl_table=11 /gene="TB15.3" /locus_tag="BQ2027_MB1662" /product="iron-regulated universal stress protein family protein tb15.3" /note="Mb1662, TB15.3, len: 146 aa. Equivalent to Rv1636, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 146 aa overlap). TB15.3, conserved hypothetical protein (see citations below), similar to other hypothetical proteins from diverse organisms e.g. Q57951|MJ0531|Y531_METJA from Methanococcus jannaschii (170 aa), FASTA scores: opt: 188, E(): 6e-06, (32.2% identity in 149 aa overlap); also P42297|YXIE_BACSU hypothetical 15.9 kd protein in bglh-wapa intergenic region precursor from Bacillus subtilis (148 aa), FASTA scores: opt: 162, E(): 0.00025, (30.8% identity in 156 aa overlap). Part of family of Mycobacterium tuberculosis hypothetical proteins (but lacks C-terminal region) including Rv2005c, Rv2623, Rv2026c, Rv1996, etc. Protein product from Mb1662 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1662 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006015" /db_xref="InterPro:IPR006016" /db_xref="InterPro:IPR014729" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ92" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00266.1" /translation="MSAYKTVVVGTDGSDSSMRAVDRAAQIAGADAKLIIASAYLPQH EDARAADILKDESYKVTGTAPIYEILHDAKERAHNAGAKNVEERPIVGAPVDALVNLA DEEKADLLVVGNVGLSTIAGRLLGSVPANVSRRAKVDVLIVHTT" CDS complement(1832673..1833467) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1663C" /product="MBL-fold metallo-hydrolase superfamily" /note="Mb1663c, -, len: 264 aa. Equivalent to Rv1637c, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 264 aa overlap). Conserved hypothetical protein, some similarity to others e.g. P05446|GLO2_RHOBL PROBABLE HYDROXYACYLGLUTATHIONE HYDROLASE (EC 3.1.2.6) (255 aa), FASTA scores: opt: 252, E(): 2e-09, (39.0% identity in 146 aa overlap). Also similar to Q9Z505|AL035591|SCC54.20 putative hydrolase from Streptomyces coelicolor (218 aa), FASTA scores: opt: 732, E(): 0, (52.3% identity in 220 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins and putative glyoxylases e.g. Rv0634c, Rv3677c, Rv2581c, Rv2260. Protein product from Mb1663c detected using shotgun mass spectrometry. Mb1663c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/TrEMBL:A0A1R3XYW9" /protein_id="SIU00267.1" /translation="MLCARTDNHQGTGNVVTSAHMTRANDDDAGAAGIGAVAHMTTVD DNYTGHVERGKAARRFLPGATILKASVGPMDNNAYLVTCSATGETLLIDAANDAEVLI DLVRRYAPKLALIVTSHQHFDHWQALQAVAAATGAPTAAHPIDADPLPVKPDRLLTHG DSVRIGELTFDVIHLRGHTPGSIALALGGPVTGGVTQLFTGDCLFPGGVGKTWQPADF TQLLDDVTTRVFDVYADSTVIYPGHGDDTELGAERPSLSEWRARGW" CDS 1833516..1836434 /codon_start=1 /transl_table=11 /gene="uvrA" /locus_tag="BQ2027_MB1664" /product="probable excinuclease abc (subunit a-dna-binding atpase) uvra" /note="Mb1664, uvrA, len: 972 aa. Equivalent to Rv1638, len: 972 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 972 aa overlap). uvrA, Excinuclease ABC subunit A, similar to many e.g. UVRA_ECOLI|P07671 excinuclease abc subunit a from Escherichia coli (940 aa), FASTA scores: opt: 2573, E(): 0, (56.2% identity in 951 aa overlap). Contains 2x PS00017 ATP/GTP-binding site motif A, PS00211 ABC transporters family signature, PS00211 ABC transporters family signature. CONSISTS OF THREE SUBUNITS; UVRA, UVRB AND UVRC. BELONGS TO THE ABC TRANSPORTER FAMILY. UVRA SUBFAMILY. Protein product from Mb1664 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1664 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63381" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR004602" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041102" /db_xref="InterPro:IPR041552" /db_xref="UniProtKB/Swiss-Prot:P63381" /protein_id="SIU00268.1" /translation="MADRLIVKGAREHNLRSVDLDLPRDALIVFTGLSGSGKSSLAFD TIFAEGQRRYVESLSAYARQFLGQMDKPDVDFIEGLSPAVSIDQKSTNRNPRSTVGTI TEVYDYLRLLYARAGTPHCPTCGERVARQTPQQIVDQVLAMPEGTRFLVLAPVVRTRK GEFADLFDKLNAQGYSRVRVDGVVHPLTDPPKLKKQEKHDIEVVVDRLTVKAAAKRRL TDSVETALNLADGIVVLEFVDHELGAPHREQRFSEKLACPNGHALAVDDLEPRSFSFN SPYGACPECSGLGIRKEVDPELVVPDPDRTLAQGAVAPWSNGHTAEYFTRMMAGLGEA LGFDVDTPWRKLPAKARKAILEGADEQVHVRYRNRYGRTRSYYADFEGVLAFLQRKMS QTESEQMKERYEGFMRDVPCPVCAGTRLKPEILAVTLAGESKGEHGAKSIAEVCELSI ADCADFLNALTLGPREQAIAGQVLKEIRSRLGFLLDVGLEYLSLSRAAATLSGGEAQR IRLATQIGSGLVGVLYVLDEPSIGLHQRDNRRLIETLTRLRDLGNTLIVVEHDEDTIE HADWIVDIGPGAGEHGGRIVHSGPYDELLRNKDSITGAYLSGRESIEIPAIRRSVDPR RQLTVVGAREHNLRGIDVSFPLGVLTSVTGVSGSGKSTLVNDILAAVLANRLNGARQV PGRHTRVTGLDYLDKLVRVDQSPIGRTPRSNPATYTGVFDKIRTLFAATTEAKVRGYQ PGRFSFNVKGGRCEACTGDGTIKIEMNFLPDVYVPCEVCQGARYNRETLEVHYKGKTV SEVLDMSIEEAAEFFEPIAGVHRYLRTLVDVGLGYVRLGQPAPTLSGGEAQRVKLASE LQKRSTGRTVYILDEPTTGLHFDDIRKLLNVINGLVDKGNTVIVIEHNLDVIKTSDWI IDLGPEGGAGGGTVVAQGTPEDVAAVPASYTGKFLAEVVGGGASAATSRSNRRRNVSA " CDS complement(1836491..1836748) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1665C" /product="Phage shock protein A (IM30), suppresses sigma54-dependent transcription" /note="Mb1665c, -, len: 85 aa. Equivalent to Rv1638A, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 85 aa overlap). Conserved hypothetical protein, similar to C-terminal part of P31511|35KD_MYCTU 35kd immunogenic protein from Mycobacterium tuberculosis (270 aa), FASTA scores: opt: 159, E(): 0.002, (50.90% identity in 55 aa overlap); and to Mycobacterium leprae ML0981 possible pseudogene, an orthologue of 35kd immunogenic protein from M. tuberculosis. Size difference suggests possible gene fragment. Protein product from Mb1665c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1665c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XYX0" /protein_id="SIU00269.1" /translation="MPDEPTPPEATTPNSESDPRYDSAGVPTFESVREKIETRYGTAL GATELDAESPQGRRLEDQYAQRQRAAAERLAQIRESMHTDE" CDS complement(1836764..1838233) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1666C" /product="CONSERVED HYPOTHETICAL MEMBRANE PROTEIN" /note="Mb1666c, -, len: 489 aa. Equivalent to Rv1639c, len: 489 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 489 aa overlap). Conserved hypothetical membrane protein. Some similarity to P35866|YLI2_CORGL Hypothetical 45.7 kd protein from Corynebacterium glutamicum (426 aa), FASTA scores: opt: 511, E(): 2.4e-23, (28.9% identity in 370 aa overlap). Contains PS00904 protein phenyltransferases alpha subunit repeat signature. Mb1666c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYX3" /db_xref="InterPro:IPR000801" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XYX3" /protein_id="SIU00270.1" /translation="MAQNELVTASTPPAATQPLAVGHTSLMHGWVPLAVQVVTAVVLV LAAGWRSRHWQRRWLPTAAAIGATLAWGTRWYVTGNGLANERPPSTLWIWVALTGAAA TVLILGWRSARWWRRGASLLAVPLCLLSATLTLNLWVGYFPTVQTAWNQLTSGPLPDQ ADQAAVAALAHSGVRPSHGTLLPVVIPSDASHFKHRGELVYLPPAWFDREHRSENPPP PQLPTVMMIGGQFNTPADWARAGNAVKTLDDFAAAHSGNAPVVVFVDSGGAFNNDTEC VNGRRGNAADHLTKDVVPYMVSKFGVSPEQTSWGIVGWSMGGTCAVDLTVMHPTLFSA FVDIAGDFYPNAGNKTQTIVRLFGGNEDAWSAFDPTTVITRHGSYTGLSGWFAISSPG PPSPDNAVADTTTMRLAGRDAAANPGNQAAAANALCALGRANGIYCAVVPQPGKHDWP FADRVFAAALPWLAGQLATPGVPKIPLPGTTQQIAGTGR" CDS complement(1838292..1841810) /codon_start=1 /transl_table=11 /gene="lysX" /locus_tag="BQ2027_MB1667C" /product="lysyl-trna synthetase 2 lysx" /note="Mb1667c, lysX, len: 1172 aa. Equivalent to Rv1640c, len: 1172 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1172 aa overlap). Probable two domain protein, possible lysyl-tRNA synthetase 2 (EC 6.1.1.6). N-terminal part (bases 1850153 to 1852033) is similar to AL023861|SC3C8_3 hypothetical membrane protein from Streptomyces coelicolor (589 aa), Fasta scores: opt: 1426, E(): 0, (44.6% identity in 585 aa overlap). The C-terminal part is similar to SYK_CRILO|P37879 lysyl-tRNA synthetases (EC 6.1.1.6) from Cricetulus longicaudatus (Long-tailed hamster) (597 aa), Fasta scores, opt: 985, E(): 0, (36.8% identity in 524 aa overlap). Contains PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1, PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2. This may indicate a frame shift but sequence has been checked and no error found. BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb1667c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1667c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEV7" /db_xref="InterPro:IPR002313" /db_xref="InterPro:IPR004364" /db_xref="InterPro:IPR004365" /db_xref="InterPro:IPR006195" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR018149" /db_xref="InterPro:IPR024320" /db_xref="InterPro:IPR031553" /db_xref="UniProtKB/Swiss-Prot:Q7VEV7" /protein_id="SIU00271.1" /translation="MGLHLTVPGLRRDGRGVQSNSHDTSSKTTADISRCPQHTDAGLQ RAATPGISRLLGISSRSVTLTKPRSATRGNSRYHWVPAAAGWTVGVIATLSLLASVSP LIRWIIKVPREFINDYLFNFPDTNFAWSFVLALLAAALTARKRIAWLVLLANMVLAAV VNAAEIAAGGNTAAESFGENLGFAVHVVAIVVLVLGYREFWAKVRRGALFRAAAVWLA GAVVGIVASWGLVELFPGSLAPDERLGYAANRVVGFALADPDLFTGRPHVFLNAIFGL FGAFALIGAAIVLFLSQRADNALTGEDESAIRGLLDLYGKDDSLGYFATRRDKSVVFA SSGRACITYRVEVGVCLASGDPVGDHRAWPQAVDAWLRLCQTYGWAPGVMGASSQGAQ TYREAGLTALELGDEAILRPADFKLSGPEMRGVRQAVTRARRAGLTVRIRRHRDIAED EMAQTITRADSWRDTETERGFSMALGRLGDPADSDCLLVEAIDPHNQVLAMLSLVPWG TTGVSLDLMRRSPQSPNGTIELMVSELALHAESLGITRISLNFAVFRAAFEQGAQLGA GPVARLWRGLLVFFSRWWQLETLYRSNMKYQPEWVPRYACYEDARVIPRVGVASVIAE GFLVLPFSRRNRVHTGHHPAVPERLAATGLLHHDGSAPDVSGLRQVGLTNGDGVERRL PEQVRVRFDKLEKLRSSGIDAFPVGRPPSHTVAQALAADHQASVSVSGRIMRIRNYGG VLFAQLRDWSGEMQVLLDNSRLDQGCAADFNAATDLGDLVEMTGHMGASKTGTPSLIV SGWRLIGKCLRPLPNKWKGLLDPEARVRTRYLDLAVNAESRALITARSSVLRAVRETL FAKGFVEVETPILQQLHGGATARPFVTHINTYSMDLFLRIAPELYLKRLCVGGVERVF ELGRAFRNEGVDFSHNPEFTLLEAYQAHAGYLEWIDGCRELIQNAAQAANGAPIAMRP RTDKGSDGTRHHLEPVDISGIWPVRTVHDAISEALGERIDADTGLTTLRKLCDAAGVP YRTQWDAGAVVLELYEHLVECRTEQPTFYIDFPTSVSPLTRPHRSKRGVAERWDLVAW GIELGTAYSELTDPVEQRRRLQEQSLLAAGGDPEAMELDEDFLQAMEYAMPPTGGLGM GIDRVVMLITGRSIRETLPFPLAKPH" CDS 1842047..1842652 /codon_start=1 /transl_table=11 /gene="infC" /locus_tag="BQ2027_MB1668" /product="PROBABLE INITIATION FACTOR IF-3 INFC" /note="Mb1668, infC, len: 201 aa. Equivalent to Rv1641, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 201 aa overlap). Probable infC, initiation factor IF-3, similar to many e.g. IF3_BACST|P03000 initiation factor IF-3 from Bacillus stearothermophilus (171 aa), FASTA scores: opt: 560, E(): 1.9e-27, (50.6% identity in 166 aa overlap). Note that an AUC initiation codon has been used, the Bacillus (IF3_BACSU) and Escherichia coli (IF3_ECOLI) proteins use an AUU initiation codon, and the Myxococcus xanthus (DSG_MYXXA) homolog uses a AUC. BELONGS TO THE IF-3 FAMILY. Protein product from Mb1668 detected using shotgun mass spectrometry. Mb1668 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65136" /db_xref="InterPro:IPR001288" /db_xref="InterPro:IPR019813" /db_xref="InterPro:IPR019814" /db_xref="InterPro:IPR019815" /db_xref="InterPro:IPR036787" /db_xref="InterPro:IPR036788" /db_xref="UniProtKB/Swiss-Prot:P65136" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00272.1" /translation="MSTETRVNERIRVPEVRLIGPGGEQVGIVRIEDALRVAADADLD LVEVAPNARPPVCKIMDYGKYKYEAAQKARESRRNQQQTVVKEQKLRPKIDDHDYETK KGHVVRFLEAGSKVKVTIMFRGREQSRPELGYRLLQRLGADVADYGFIETSAKQDGRN MTMVLAPHRGAKTRARARHPGEPAGGPPPKPTAGDSKAAPN" CDS 1842702..1842896 /codon_start=1 /transl_table=11 /gene="rpmI" /locus_tag="BQ2027_MB1669" /product="50S ribosomal protein L35 rpmI" /note="Mb1669, rpmI, len: 64 aa. Equivalent to Rv1642, len: 64 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 64 aa overlap). rpmI, 50S ribosomal protein L35, similar to several e.g. RL35_SYNY3|P48959 from Synechocystis sp. (67 aa), fasta scores: opt: 179, E(): 2.7e-08, (51.6% identity in 64 aa overlap). BELONGS TO THE L35P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb1669 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1669 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66272" /db_xref="InterPro:IPR001706" /db_xref="InterPro:IPR018265" /db_xref="InterPro:IPR021137" /db_xref="InterPro:IPR037229" /db_xref="UniProtKB/Swiss-Prot:P66272" /protein_id="SIU00273.1" /translation="MPKAKTHSGASKRFRRTGTGKIVRQKANRRHLLEHKPSTRTRRL DGRTVVAANDTKRVTSLLNG" CDS 1842958..1843347 /codon_start=1 /transl_table=11 /gene="rplT" /locus_tag="BQ2027_MB1670" /product="50S ribosomal protein L20 rplT" /note="Mb1670, rplT, len: 129 aa. Equivalent to Rv1643, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 129 aa overlap). rplT, 50S ribosomal protein L20, similar to several e.g. RL20_ECOLI|P02421 from Escherichia coli (117 aa), FASTA scores: opt: 438, E(): 5.8e-24, (60.3% identity in 116 aa overlap). Contains PS00937 Ribosomal protein L20 signature. Protein product from Mb1670 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1670 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66106" /db_xref="InterPro:IPR005813" /db_xref="InterPro:IPR035566" /db_xref="UniProtKB/Swiss-Prot:P66106" /protein_id="SIU00274.1" /translation="MARVKRAVNAHKKRRSILKASRGYRGQRSRLYRKAKEQQLHSLN YAYRDRRARKGEFRKLWIARINAAARLNDITYNRLIQGLKAAGVEVDRKNLADIAISD PAAFTALVDVARAALPEDVNAPSGEAA" CDS 1843380..1844162 /codon_start=1 /transl_table=11 /gene="tsnR" /locus_tag="BQ2027_MB1671" /product="Possible 23S rRNA methyltransferase tsnR" /note="Mb1671, tsnR, len: 260 aa. Equivalent to Rv1644, len: 260 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 260 aa overlap). Possible tsnR, 23S rRNA methyltransferase (EC 2.1.1.-), similar to several e.g. TSNR_STRLU|P52393 from Streptomyces laurentii (270 aa), FASTA scores: opt: 276, E(): 3.6e-11, (27.6% identity in 261 aa overlap). Also similar to M. tuberculosis hypothetical proteins Rv0881, Rv3579c, and Rv0380c. Protein product from Mb1671 detected using SWATH mass spectrometry. Mb1671 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYX1" /db_xref="InterPro:IPR001537" /db_xref="InterPro:IPR013123" /db_xref="InterPro:IPR029026" /db_xref="InterPro:IPR029028" /db_xref="InterPro:IPR029064" /db_xref="UniProtKB/TrEMBL:A0A1R3XYX1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00275.1" /translation="MLTERSARVATAVKLHRHVGRRRAGRFLAEGPNLVAAALARGLV REVFVTEVAARRHELLLAAHEASVHLVTERAAKALSDTVTPAGLVAVCDLPATRLEDV LAGSPQLIAVTVEIREPGNAGTVIRIADAMGAAAVILAGRSVDPYNGKCLRASTGSIF AIPVVVAPDVGAAIADLRAAGLQVLATAVDGEMALDDADRLLAEPTAWLFGPEAHGLS AEIAALADHRVHIPMSGGAESLNVAAAAAICLYESARALGRR" CDS complement(1844173..1845228) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1672C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1672c, -, len: 351 aa. Equivalent to Rv1645c, len: 351 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 351 aa overlap). Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. O53837|Rv0826|MTV043.18 (351 aa), FASTA scores: (57.5% identity in 299 aa overlap); Q10519|Rv2237|YM37_MYCTU (255 aa), O53682|Rv0276 (306 aa). Mb1672c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR018713" /db_xref="UniProtKB/TrEMBL:A0A1R3XZA1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00276.1" /translation="MTVASRTSADPLGPDSLTWKYFGDLRTGMMGVWIGAIQNMYPEL GAGVEEHSILLREPLQRVARSVYPIMGVVYDGDRAAQTGQQIKGYHRTIKGVDAEGRR YHALNPDTFYWAHATFFMLVIKVAEYFCGGLTEAEKHQLFEEHVRWYRMYGMSMRPVP KSWEDFQDYWDRVCRDKLEINQATVDILQMRIPKPRFVLMPTPIWDQLFKPLIAGQRW IAAGLFDPAVREKAGMHWTPGDEVLLRVFGKVVELAFLAVPDEIRLHPRALAAYRRAA GRTRHDAPLVQAPGFMAPPRDRQGLPMHYFPPRSHRFTRSALDPAKALMERAGALVHS TLSLAGVRPARGPSRAA" CDS 1845538..1846470 /codon_start=1 /transl_table=11 /gene="PE17" /locus_tag="BQ2027_MB1673" /product="pe family protein pe17" /note="Mb1673, PE17, len: 310 aa. Equivalent to Rv1646, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 310 aa overlap). Member of the M. tuberculosis PE family of proteins, similar to many e.g. YW36_MYCTU|Q10873 hypothetical 53.7 kd protein cy39.36c (558 aa), FASTA scores, opt: 411, E(): 1.3e-15, (34.4% identity in 320 aa overlap). Mb1673 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XYX8" /protein_id="SIU00277.1" /translation="MSFLTVAPDMVTAAAGNLESVGSALNEAAAAAAPATVGLAAPAA DRVSAVVAAMLGAYARDFQGISAQIAGFHNQFVGALRGGAAAYASAEAANVQQTVVNA VNAPAQALLGHPLIGPETVGSSAAAVSFGFGPLLLAGSDPLLAVPFSYPASLPTPFGP VTMTLNGSFDPLTQQVVFDSGSLTAPAPFVYGLGAVGPALTTMTALQNSGTAFSGAVQ SGNLLGAAGALLQAPGNAVTGFLFGQTAISQSIPGPSNLGYESVGISVPVGGLLAPLQ PVTVTLTPTSGMPTAIQLSGTQFGGLLPALLNGF" CDS 1846548..1847498 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1674" /product="adenylate cyclase (atp pyrophosphate-lyase) (adenylyl cyclase)" /note="Mb1674, -, len: 316 aa. Equivalent to Rv1647, len: 316 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 316 aa overlap). Conserved hypothetical protein, some similarity to other Mycobacterium tuberculosis hypothetical proteins e.g. Q11055|Rv1264|YC64_MYCTU Hypothetical 42.2 kd protein (397 aa), FASTA scores: opt: 197, E(): 9.4e-06, (27.1% identity in 181 aa overlap) and Q10400|Rv2212|YM12_MYCTU (378 aa). Protein product from Mb1674 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1674 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYY5" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/TrEMBL:A0A1R3XYY5" /protein_id="SIU00278.1" /translation="MPGSARTTYPCHVEVGPQDSESGAPDETATAMASPVPRQRSALR WLRTVNRSPGLVSFIHRARRLLPGDPEFGDPLSTAGEGGPRAAARAADRLLRDRDAAS REVGLSVLQVWQALTEAVSRRPANPEVTLVFTDLVGFSTWSLHAGDDATLTLLRQVAR AVESPLLDAGGHIVKRLGDGIMAVFRNPTVALRAVLVAQDAVKSLEVQGYTPRMRIGI HTGRPQRLAADWLGVDVNIAARVMERATKGGIMISQPTLDLIPQSELDALGVVARRVR KPVFASKPTGIPPDLAIYRIKTVSESTAADNFDEMSPDAQ" CDS 1847505..1848311 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1675" /product="Probable transmembrane protein" /note="Mb1675, -, len: 268 aa. Equivalent to Rv1648, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 268 aa overlap). Probable transmembrane protein, some similarity to Rv3434c|MTCY77.06C (237 aa), FASTA scores: E(): 0.00039, (31.4% identity in 194 aa overlap). Mb1675 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYY0" /db_xref="UniProtKB/TrEMBL:A0A1R3XYY0" /protein_id="SIU00279.1" /translation="MIYRVACLLARIRFTVGYVAALASVSTTILMHGPQVHAQVIRHA STNLHNLAHGHLGTLWNSAFVIDEGPLYFWLPCLACLLAVAELQLRSLRLTVAFVVGH IGATLLVAAVLAGAIEIGWLPWSISRVSDVGMSYGALAALGALTAAIPGRWRPAWIGW WVSLGLATATIGGGFTDAGHTVALLLGMLVTACFTRPARWTLGRCALLAVASGFCLVL LAHSWWSLVSGSALGLLGALGAAGFARWTRARATSLPPGALAIPQPALSR" CDS 1848507..1849532 /codon_start=1 /transl_table=11 /gene="pheS" /locus_tag="BQ2027_MB1676" /product="probable phenylalanyl-trna synthetase, alpha chain phes" /note="Mb1676, pheS, len: 341 aa. Equivalent to Rv1649, len: 341 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 341 aa overlap). pheS, Phenylalanyl-tRNA synthetase alpha chain (EC 6.1.1.20), similar to several e.g. SYFA_ECOLI|P08312 from Escherichia coli (327 aa), FASTA scores: opt: 978, E(): 0, (46.5% identity in 331 aa overlap). Homology suggests this start site, but there is a potential rbs upstream of a gtg 30 bp upstream; contains PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1. BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. PHE-TRNA SYNTHETASE ALPHA CHAIN SUBFAMILY 1. Protein product from Mb1676 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1676 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEV4" /db_xref="InterPro:IPR002319" /db_xref="InterPro:IPR004188" /db_xref="InterPro:IPR004529" /db_xref="InterPro:IPR006195" /db_xref="InterPro:IPR010978" /db_xref="InterPro:IPR022911" /db_xref="UniProtKB/Swiss-Prot:Q7VEV4" /protein_id="SIU00280.1" /translation="MLSPEALTTAVDAAQQAIALADTLDVLARVKTEHLGDRSPLALA RQALAVLPKEQRAEAGKRVNAARNAAQRSYDERLATLRAERDAAVLVAEGIDVTLPST RVPAGARHPIIMLAEHVADTFIAMGWELAEGPEVETEQFNFDALNFPADHPARGEQDT FYIAPEDSRQLLRTHTSPVQIRTLLARELPVYIISIGRTFRTDELDATHTPIFHQVEG LAVDRGLSMAHLRGTLDAFARAEFGPSARTRIRPHFFPFTEPSAEVDVWFANKIGGAD WVEWGGCGMVHPNVLRATGIDPDLYSGFAFGMGLERTLQFRNGIPDMRDMVEGDVRFS LPFGVGA" CDS 1849532..1852027 /codon_start=1 /transl_table=11 /gene="pheT" /locus_tag="BQ2027_MB1677" /product="probable phenylalanyl-trna synthetase, beta chain phet" /note="Mb1677, pheT, len: 831 aa. Equivalent to Rv1650, len: 831 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 831 aa overlap). pheT, Phenylalanyl-tRNA synthetase beta chain (EC 6.1.1.20), similar to several e.g. SYFB_ECOLI|P07395 from Escherichia coli (795 aa), FASTA scores: opt: 995, E(): 0, (31.8% identity in 847 aa overlap). BELONGS TO THE PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN FAMILY - SUBFAMILY 1. Protein product from Mb1677 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1677 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEV3" /db_xref="InterPro:IPR002547" /db_xref="InterPro:IPR004532" /db_xref="InterPro:IPR005121" /db_xref="InterPro:IPR005146" /db_xref="InterPro:IPR005147" /db_xref="InterPro:IPR009061" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR020825" /db_xref="InterPro:IPR033714" /db_xref="InterPro:IPR036690" /db_xref="InterPro:IPR041616" /db_xref="UniProtKB/Swiss-Prot:Q7VEV3" /protein_id="SIU00281.1" /translation="MRLPYSWLREVVAVGASGWDVTPGELEQTLLRIGHEVEEVIPLG PVDGPVTVGRVADIEELTGYKKPIRACAVDIGDRQYREIICGATNFAVGDLVVVALPG ATLPGGFTISARKAYGRNSDGMICSAAELNLGADHSGILVLPPGAAEPGADGAGVLGL DDVVFHLAITPDRGYCMSVRGLARELACAYDLDFVDPASNSRVPPLPIEGPVWPLTVQ PETGVRRFALRPVIGIDPAAVSPWWLQRRLLLCGIRATCPAVDVTNYVMLELGHPMHA HDRNRISGTLGVRFARSGETAVTLDGIERKLDTADVLIVDDAATAAIGGVMGAASTEV RADSTDVLLEAAIWDPAAVSRTQRRLHLPSEAARRYERTVDPAISVAALDRCARLLAD IAGGEVSPTLTDWRGDPPCDDWSPPPIRMGVDVPDRIAGVAYPQGTTARRLAQIGAVV THDGDTLTVTPPSWRPDLRQPADLVEEVLRLEGLEVIPSVLPPAPAGRGLTAGQQRRR TIGRSLALSGYVEILPTPFLPAGVFDLWGLEADDSRRMTTRVLNPLEADRPQLATTLL PALLEALVRNVSRGLVDVALFAIAQVVQPTEQTRGVGLIPVDRRPTDDEIAMLDASLP RQPQHVAAVLAGLREPRGPWGPGRPVEAADAFEAVRIIARASRVDVTLRPAQYLPWHP GRCAQVFVGESSVGHAGQLHPAVIERSGLPKGTCAVELNLDAIPCSAPLPAPRVSPYP AVFQDVSLVVAADIPAQAVADAVRAGAGDLLEDIALFDVFTGPQIGEHRKSLTFALRF RAPDRTLTEDDASAARDAAVQSAAERVGAVLRG" CDS complement(1852121..1855177) /codon_start=1 /transl_table=11 /gene="PE_PGRS30" /locus_tag="BQ2027_MB1679C" /product="pe-pgrs family protein pe_pgrs30" /note="Mb1679c, PE_PGRS30, len: 1018 aa. Equivalent to Rv1651c, len: 1011 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 1018 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many e.g. Q10637|Y03A_MYCTU hypothetical glycine-rich 49.6 kd protein (603 aa), FASTA scores: opt: 1757, E(): 0, (50.8% identity in 714aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, an in-frame insertion of 21 bp leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (1018 aa versus 1011 aa). Mb1679c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XYX7" /protein_id="SIU00282.1" /translation="MSFLLVEPDLVTAAAANLAGIRSALSEAAAAASTPTTALASAGA DEVSAAVSRLFGAYGQQFQALNARAATFHAEFVSLLNGGAAAYTGAEAASVSSMQALL DAVNAPTQTLLGRPLIGNGADGVAGTGSNAGGNGGPGGILYGNGGNGGAGGPDGGAGG NGGAAGLIGNGGAGGAGGVGGAGGAGGAGGTGGLLYGNGGAGGNGGSAAAAGGAGGNA LLFGNGGNGGSGASGGAAGHAGTIFGNGGNAGAGSGLAGADGGLFGNGGDGGSSTSKA GGAGGNALFGNGGDGGSSTVAAGGAGGNTLVGNGGAGGAGGTSGLTGSGVAGGAGGSV GLWGSGGAGGDGGAATSLLGVGMNAGAGGAGGNAGLLYGNGGAGGAGGNGGDTTVPLF DSGVGGAGGAGGNASLFGNGGTGGVGGKGGTSSDLASATSGAGGAGGAGGVGGLLYGN GGNGGAGGIGGAAINILANAGAGGAGGAAGSSFIGNGGNGGAGGAGGAAALFSSGVGG AGGSGGTALLLGSGGAGGNGGTGGANSGSLFASPGGTGGAGGHGGAGGLIWGNGGAGG NGGNGGTTADGALEGGTGGIGGTGGSAIAFGNGGQGGAGGTGGDHSGGNGIGGKGGAS GNGGNAGQVFGDGGTGGTGGAGGAGSGTKAGGTGSDGGHGGNATLIGNGGDGGAGGAG GAGSPAGAPGNGGTGGTGGVLFGQSGSSGPPGAAALAFPSLSSSVPILGPYEDLIANT VANLASIGNTWLADPAPFLQQYLANQFGYGQLTLTALTDATRDFAIGLAGIPPSLQSA LQALAAGDVSGAVTDVLGAVVKVFVSGVDASDLSNILLLGPVGDLFPILSIPGAMSQN FTNVVMTVTDTTIAFSIDTTNLTGVMTFGLPLAMTLNAVGSPITTAIAFAESTTAFVS AVQAGNLQAAAAALVGAPANVANGFLNGEARLPLALPTSATGGIPVTVEVPVGGILAP LQPFQATAVIPVIGPVTVTLEGTPAGGIVPALVNYAPTQLAQAIAP" CDS 1855371..1856429 /codon_start=1 /transl_table=11 /gene="argC" /locus_tag="BQ2027_MB1680" /product="probable n-acetyl-gamma-glutamyl-phoshate reductase argc" /note="Mb1680, argC, len: 352 aa. Equivalent to Rv1652, len: 352 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 352 aa overlap). Probable argC, N-acetyl-gamma-glutamyl-phosphate reductase (EC 1.2.1.38), similar to many e.g. ARGC_STRCL|P54896 from Streptomyces clavuligerus (340 aa), FASTA scores: opt: 1119, E(): 0, (56.9% identity in 350 aa overlap); etc. BELONGS TO THE NAGSA DEHYDROGENASE FAMILY. Protein product from Mb1680 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1680 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63563" /db_xref="InterPro:IPR000534" /db_xref="InterPro:IPR000706" /db_xref="InterPro:IPR012280" /db_xref="InterPro:IPR023013" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P63563" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00283.1" /translation="MQNRQVANATKVAVAGASGYAGGEILRLLLGHPAYADGRLRIGA LTAATSAGSTLGEHHPHLTPLAHRVVEPTEAAVLGGHDAVFLALPHGHSAVLAQQLSP ETLIIDCGADFRLTDAAVWERFYGSSHAGSWPYGLPELPGARDQLRGTRRIAVPGCYP TAALLALFPALAADLIEPAVTVVAVSGTSGAGRAATTDLLGAEVIGSARAYNIAGVHR HTPEIAQGLRAVTDRDVSVSFTPVLIPASRGILATCTARTRSPLSQLRAAYEKAYHAE PFIYLMPEGQLPRTGAVIGSNAAHIAVAVDEDAQTFVAIAAIDNLVKGTAGAAVQSMN LALGWPETDGLSVVGVAP" CDS 1856426..1857640 /codon_start=1 /transl_table=11 /gene="argJ" /locus_tag="BQ2027_MB1681" /product="Probable Glutamate n-acetyltransferase argJ" /note="Mb1681, argJ, len: 404 aa. Equivalent to Rv1653, len: 404 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 404 aa overlap). Probable argJ, Glutamate n-acetyltransferase (EC 2.3.1.35), similar to ARGJ_BACSU|P36843 from Bacillus subtilis (406 aa), fasta scores: opt: 727, E(): 0, (36.3% identity in 410 a a overlap). Protein product from Mb1681 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1681 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63572" /db_xref="InterPro:IPR002813" /db_xref="InterPro:IPR016117" /db_xref="InterPro:IPR042195" /db_xref="UniProtKB/Swiss-Prot:P63572" /protein_id="SIU00284.1" /translation="MTDLAGTTRLLRAQGVTAPAGFRAAGVAAGIKASGALDLALVFN EGPDYAAAGVFTRNQVKAAPVLWTQQVLTTGRLRAVILNSGGANACTGPAGFADTHAT AEAVAAALSDWGTETGAIEVAVCSTGLIGDRLPMDKLLAGVAHVVHEMHGGLVGGDEA AHAIMTTDNVPKQVALHHHDNWTVGGMAKGAGMLAPSLATMLCVLTTDAAAEPAALER ALRRAAAATFDRLDIDGSCSTNDTVLLLSSGASEIPPAQADLDEAVLRVCDDLCAQLQ ADAEGVTKRVTVTVTGAATEDDALVAARQIARDSLVKTALFGSDPNWGRVLAAVGMAP ITLDPDRISVSFNGAAVCVHGVGAPGAREVDLSDADIDITVDLGVGDGQARIRTTDLS HAYVEENSAYSS" CDS 1857637..1858521 /codon_start=1 /transl_table=11 /gene="argB" /locus_tag="BQ2027_MB1682" /product="Probable Acetylglutamate kinase argB" /note="Mb1682, argB, len: 294 aa. Equivalent to Rv1654, len: 294 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 294 aa overlap). Probable argB, Acetylglutamate kinase (EC 2.7.2.8), similar to ARGB_CORGL|Q59281 (294 aa), FASTA scores: opt: 1209, E(): 0, (64.4% identity in 270 aa overlap). BELONGS TO THE ACETYLGLUTAMATE KINASE FAMILY. Protein product from Mb1682 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1682 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4Y7" /db_xref="InterPro:IPR001048" /db_xref="InterPro:IPR001057" /db_xref="InterPro:IPR004662" /db_xref="InterPro:IPR036393" /db_xref="InterPro:IPR037528" /db_xref="InterPro:IPR041727" /db_xref="UniProtKB/Swiss-Prot:P0A4Y7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00285.1" /translation="MSRIEALPTHIKAQVLAEALPWLKQLHGKVVVVKYGGNAMTDDT LRRAFAADMAFLRNCGIHPVVVHGGGPQITAMLRRLGIEGDFKGGFRVTTPEVLDVAR MVLFGQVGRELVNLINAHGPYAVGITGEDAQLFTAVRRSVTVDGVATDIGLVGDVDQV NTAAMLDLVAAGRIPVVSTLAPDADGVVHNINADTAAAAVAEALGAEKLLMLTDIDGL YTRWPDRDSLVSEIDTGTLAQLLPTLESGMVPKVEACLRAVIGGVPSAHIIDGRVTHC VLVELFTDAGTGTKVVRG" CDS 1858518..1859720 /codon_start=1 /transl_table=11 /gene="argD" /locus_tag="BQ2027_MB1683" /product="Probable Acetylornithine aminotransferase argD" /note="Mb1683, argD, len: 400 aa. Equivalent to Rv1655, len: 400 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 400 aa overlap). Probable argD, Acetylornithine aminotransferase (EC 2.6.1.11), similar to ARGD_ECOLI|P18335 (406 aa), FASTA scores: opt: 958, E(): 0, (38.6% identity in 404 aa overlap), contains PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site. BELONGS TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Protein product from Mb1683 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1683 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63569" /db_xref="InterPro:IPR004636" /db_xref="InterPro:IPR005814" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/Swiss-Prot:P63569" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00286.1" /translation="MTGASTTTATMRQRWQAVMMNNYGTPPIALASGDGAVVTDVDGR TYIDLLGGIAVNVLGHRHPAVIEAVTRQMSTLGHTSNLYATEPGIALAEELVALLGAD QRTRVFFCNSGAEANEAAFKLSRLTGRTKLVAAHDAFHGRTMGSLALTGQPAKQTPFA PLPGDVTHVGYGDVDALAAAVDDHTAAVFLEPIMGESGVVVPPAGYLAAARDITARRG ALLVLDEVQTGMGRTGAFFAHQHDGITPDVVTLAKGLGGGLPIGACLAVGPAAELLTP GLHGSTFGGNPVCAAAALAVLRVLASDGLVRRAEVLGKSLRHGIEALGHPLIDHVRGR GLLLGIALTAPHAKDAEATARDAGYLVNAAAPDVIRLAPPLIIAEAQLDGFVAALPAI LDRAVGAP" CDS 1859717..1860640 /codon_start=1 /transl_table=11 /gene="argF" /locus_tag="BQ2027_MB1684" /product="Probable Ornithine carbamoyltransferase, anabolic ArgF" /note="Mb1684, argF, len: 307 aa. Equivalent to Rv1656, len: 307 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 307 aa overlap). Probable argF, ornithine carbamoyltransferase, anabolic (EC 2.1.3.3) (see citation below), almost identical to OTCA_MYCBO|Q02095 ornithine carbamoyltransferase, anabolic from Mycobacterium bovis (307 aa), FASTA scores: opt: 1980, E(): 0, (99.0% identity in 307 aa overlap); contains PS00097 Aspartate and ornithine carbamoyltransferases signature. BELONGS TO THE ATCASES/OTCASES FAMILY. Protein product from Mb1684 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1684 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5M9" /db_xref="InterPro:IPR002292" /db_xref="InterPro:IPR006130" /db_xref="InterPro:IPR006131" /db_xref="InterPro:IPR006132" /db_xref="InterPro:IPR024904" /db_xref="InterPro:IPR036901" /db_xref="UniProtKB/Swiss-Prot:P0A5M9" /protein_id="SIU00287.1" /translation="MIRHFLRDDDLSPAEQAEVLELAAELKKDPVSRRPLQGPRGVAV IFDKNSTRTRFSFELGIAQLGGHAVVVDSGSTQLGRDETLQDTAKVLSRYVDAIVWRT FGQERLDAMASVATVPVINALSDEFHPCQVLADLQTIAERKGALRGLRLSYFGDGANN MAHSLLLGGVTAGIHVTVAAPEGFLPDPSVRAAAERRAQDTGASVTVTADAHAAAAGA DVLVTDTWTSMGQENDGLDRVKPFRPFQLNSRLLALADSDAIVLHCLPAHRGDEITDA VMDGPASAVWDEAENRLHAQKALLVWLLERS" CDS 1860637..1861149 /codon_start=1 /transl_table=11 /gene="argR" /locus_tag="BQ2027_MB1685" /standard_name="ahrC" /product="Probable Arginine repressor argR (AHRC)" /note="Mb1685, argR, len: 170 aa. Equivalent to Rv1657, len: 170 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 170 aa overlap). Probable argR, Arginine repressor (alternate gene name: ahrC). Similar to AHRC_BACSU|P17893 arginine hydroximate resistance protein from Bacillus subtilis (149 aa), FASTA scores: opt: 283, E(): 1.8e-11, (34.5% identity in 142 aa overlap); and ARGR_ECOLI|P15282 arginine repressor from Escherichia coli (156 aa), FASTA scores: opt: 194, E(): 6.4e-06, (30.8% identity in 146 aa overlap). BELONGS TO THE ARGR FAMILY. Protein product from Mb1685 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1685 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A4Y9" /db_xref="InterPro:IPR001669" /db_xref="InterPro:IPR020899" /db_xref="InterPro:IPR020900" /db_xref="InterPro:IPR036251" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P0A4Y9" /protein_id="SIU00288.1" /translation="MSRAKAAPVAGPEVAANRAGRQARIVAILSSAQVRSQNELAALL AAEGIEVTQATLSRDLEELGAVKLRGADGGTGIYVVPEDGSPVRGVSGGTDRMARLLG ELLVSTDDSGNLAVLRTPPGAAHYLASAIDRAALPQVVGTIAGDDTILVVAREPTTGA QLAGMFENLR" CDS 1861158..1862354 /codon_start=1 /transl_table=11 /gene="argG" /locus_tag="BQ2027_MB1686" /product="Probable Argininosuccinate synthase argG" /note="Mb1686, argG, len: 398 aa. Equivalent to Rv1658, len: 398 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 398 aa overlap). Probable argG, Argininosuccinate synthase (EC 6.3.4.5), similar to ASSY_STRCL|P50986 argininosuccinate synthase from Streptomyces clavuligerus (397 aa), FASTA scores: opt: 1873, E(): 0, (67.8% identity in 397 aa overlap); contains PS00564 Argininosuccinate synthase signature 1, PS00565 Argininosuccinate synthase signature 2. BELONGS TO THE ARGININOSUCCINATE SYNTHASE FAMILY. Protein product from Mb1686 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1686 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63643" /db_xref="InterPro:IPR001518" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR018223" /db_xref="InterPro:IPR023434" /db_xref="InterPro:IPR024074" /db_xref="UniProtKB/Swiss-Prot:P63643" /protein_id="SIU00289.1" /translation="MSERVILAYSGGLDTSVAISWIGKETGREVVAVAIDLGQGGEHM DVIRQRALDCGAVEAVVVDARDEFAEGYCLPTVLNNALYMDRYPLVSAISRPLIVKHL VAAAREHGGGIVAHGCTGKGNDQVRFEVGFASLAPDLEVLAPVRDYAWTREKAIAFAE ENAIPINVTKRSPFSIDQNVWGRAVETGFLEHLWNAPTKDIYAYTEDPTINWGVPDEV IVGFERGVPVSVDGKPVSMLAAIEELNRRAGAQGVGRLDVVEDRLVGIKSREIYEAPG AMVLITAHTELEHVTLERELGRFKRQTDQRWAELVYDGLWYSPLKAALEAFVAKTQEH VSGEVRLVLHGGHIAVNGRRSAESLYDFNLATYDEGDSFDQSAARGFVYVHGLSSKLA ARRDLR" CDS 1862434..1863846 /codon_start=1 /transl_table=11 /gene="argH" /locus_tag="BQ2027_MB1687" /product="Probable Argininosuccinate lyase argH" /note="Mb1687, argH, len: 470 aa. Equivalent to Rv1659, len: 470 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 470 aa overlap). Probable argH, argininosuccinate lyase (EC 4.3.2.1), similar to ARLY_ECOLI|P11447 argininosuccinate lyase from Escherichia coli (457 aa), FASTA scores: opt: 1091, E(): 0, (42.5% identity in 461 aa overlap); contains PS00017 ATP/GTP-binding site motif A, PS00163 Fumarate lyases signature. BELONGS TO THE LYASE 1 FAMILY. ARGININOSUCCINATE LYASE SUBFAMILY. Protein product from Mb1687 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1687 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4Z1" /db_xref="InterPro:IPR000362" /db_xref="InterPro:IPR008948" /db_xref="InterPro:IPR009049" /db_xref="InterPro:IPR020557" /db_xref="InterPro:IPR022761" /db_xref="InterPro:IPR024083" /db_xref="InterPro:IPR029419" /db_xref="UniProtKB/Swiss-Prot:P0A4Z1" /protein_id="SIU00290.1" /translation="MSTNEGSLWGGRFAGGPSDALAALSKSTHFDWVLAPYDLTASRA HTMVLFRAGLLTEEQRDGLLAGLDSLAQDVADGSFGPLVTDEDVHAALERGLIDRVGP DLGGRLRAGRSRNDQVAALFRMWLRDAVRRVATGVLDVVGALAEQAAAHPSAIMPGKT HLQSAQPILLAHHLLAHAHPLLRDLDRIVDFDKRAAVSPYGSGALAGSSLGLDPDAIA ADLGFSAAADNSVDATAARDFAAEAAFVFAMIAVDLSRLAEDIIVWSSTEFGYVTLHD SWSTGSSIMPQKKNPDIAELARGKSGRLIGNLAGLLATLKAQPLAYNRDLQEDKEPVF DSVAQLELLLPAMAGLVASLTFNVQRMAELAPAGYTLATDLAEWLVRQGVPFRSAHEA AGAAVRAAEQRGVGLQELTDDELAAISPELTPQVREVLTIEGSVSARDCRGGTAPGRV AEQLNAIGEAAERLRRQLVR" CDS 1863955..1865016 /codon_start=1 /transl_table=11 /gene="pks10" /locus_tag="BQ2027_MB1688" /product="chalcone synthase pks10" /note="Mb1688, pks10, len: 353 aa. Equivalent to Rv1660, len: 353 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 353 aa overlap). Possible pks10, chalcone synthase (EC 2.3.1.74), similar to BCSA_BACSU|P54157 putative chalcone synthase from B. subtilis (365 aa), FASTA scores: opt: 701, E(): 0, (33.1% identity in 362 aa overlap). Also similar to M. tuberculosis Rv1665|pks11 polyketide synthase (chalcone synthase); and Rv1372|pks18 polyketide synthase. Other upstream initiation sites are possible but homology suggests this start. Protein product from Mb1688 detected using SWATH mass spectrometry. Mb1688 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEV2" /db_xref="InterPro:IPR001099" /db_xref="InterPro:IPR011141" /db_xref="InterPro:IPR012328" /db_xref="InterPro:IPR016039" /db_xref="UniProtKB/Swiss-Prot:Q7VEV2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00291.1" /translation="MSVIAGVFGALPPYRYSQRELTDSFVSIPDFEGYEDIVRQLHAS AKVNSRHLVLPLEKYPKLTDFGEANKIFIEKAVDLGVQALAGALDESGLRPEDLDVLI TATVTGLAVPSLDARIAGRLGLRADVRRVPLFGLGCVAGAAGVARLHDYLRGAPDGVA ALVSVELCSLTYPGYKPTLPGLVGSALFADGAAAVVAAGVKRAQDIGADGPDILDSRS HLYPDSLRTMGYDVGSAGFELVLSRDLAAVVEQYLGNDVTTFLASHGLSTTDVGAWVT HPGGPKIINAITETLDLSPQALELTWRSLGEIGNLSSASVLHVLRDTIAKPPPSGSPG LMIAMGPGFCSELVLLRWH" CDS 1865099..1871479 /codon_start=1 /transl_table=11 /gene="pks7" /locus_tag="BQ2027_MB1689" /product="Probable polyketide synthase pks7" /note="Mb1689, pks7, len: 2126 aa. Equivalent to Rv1661, len: 2126 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 2126 aa overlap). Probable pks7, polyketide synthase, similar to many e.g. ERY2_SACER|Q03132 erythronolide synthase, modules 3 and 4 (3567 aa), FASTA scores: E(): 0, (48.8% identity in 2131 aa overlap); also similar to Mycobacterium tuberculosis pks12. Contains PS00606 Beta-ketoacyl synthases active site, PS00012 Phosphopantetheine attachment site. Protein product from Mb1689 detected using SWATH mass spectrometry. Mb1689 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYY8" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR041314" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/TrEMBL:A0A1R3XYY8" /protein_id="SIU00292.1" /translation="MNSTPEDLVKALRRSLKQNERLKRENRDLLARTTEPVAVVGMGC RYPGGVDSPETLWELVAHGRDAVSEFPADRGWDVAGLFDPDPDAVGKSYTRCGGFLTD VAGFDAEFFGIAPSEALAMDPQQRLLLEVSWEALERAGIDPITLRGSQTGVFAGVFHG SYGGQGRVPGDLERYGLRGSTLSVASGRVAYVLGLQGPAVSVDTACSSSLVALHLAVQ SLRLGECDLALVGGVTVMATPAMFIEFSRQRALSADGRCKAYAGAADGTAFAEGAGVL VLARLADARRLGHPVLALVRGSAVNQDGASNGLATPNGPAQQRVITAALASARLGVAD VDVVEGHGTGTTLGDPIEAQAILATYGQRPADRPLWLGSIKSNIGHTSAAAGVAGVIK MVQAMRHGVLPKTLHVDVPTPHVDWSAGAVSLLTEPRPWHVPGRPRRAGVSSFGISGT NAHVILEEAPAVEPVGAAHGNDPVAVPWVLSARSAQALTNQARRLLAWVGADENVRPL DVGWSLVNTRSLFDHRAVVVGADRTQLMEGLTGLAAGVPGADVVAGRAQTVGKTAFVF PGQGAQWLGMGAQLCATAPVFAEHIHRCERALREHVEWSLLDVLRGAPGAPGLDRVDV VQPALWAVMVSLAELWRSVGVVPDAVIGHSQGEIAAAYVAGALSLWDAAAVVALRSRL LVRLGGAGGMVSLACGQPQAEKLASQWGDRLNIAAVNGVSSVVLAGETDAVTELMQRC EAEGIRARRIDVDYASHSAQVDAIREELIAALRGIEPRTSTVAFFSTVTGELMDTAGV NAEYWYRSIRQPVQFERAVRNAFDGGYRVFVESSPHPVLIAGIEETLVDCDRGATGEP IVIPTLGRDDGGVGRFWLSAGQAHVAGVGVDWRAAFADLGGRRVELPTYAFARQRFWL DGLGAVGGDLGGVGLVGAEHGLLAAVVQRPDSGGVVLTGRISVVAAPWLADHAVGPVV LFPGTGFVELALRAGDEVGCSVLQELTLQAPLVLPADGVRVQVVVGGVEQSGTRNVWV YSAAGQADSSPGWTLHAQGVLGVGSVQPAAELSVWPPVGARAMDVADGYQVLAARGYG YGPAFRGLQALWRRGAEVFADVTLPEGVPIRGFGIHPAVLDAALHAWGIVEGEQQTML PFSWQGVCLHASGAARVRVRLAPVGRGAVPVELADPQGLPVLSVRQLMVRPVSAAALS RSTAGDRGLLEMIWTPVPLEGGDIGDDAVVWELPPHAGAQAGGDVLAAVYRGVHEVLE VLQSWLASDATGLGVVVTRGAVGPVDDDVTDLAGAAVWGLVRSAQAEHPGRVVLVDTD GSVAVEDAVGFGARSGEPQLVVRRGRVYAARLAPVAAGLTLPSASAGGWRLVAGGGGT LADVVVAPVAPVELATGQVRVAVGAVGVNFRDVLVALGMYPGGGELGVDGAGVVVEVG PGVTGLAVGDRVMGLLGLVGSEAVVDARLVTMVPAGWSLVEAAAVPVAFLTAFYGLSV LAEVAAGQKVLVHAGTGGVGMAAVSLARYWGAEVFVTASRAKWDTLRAMGFDDIHISD SRSLEFEEAFLRATEGSGVDVVLNSLAGEFTDASLRLLPSGGRFIELGKTDIRDGQTV AERHRGVRYRAFDLVEAGPDRIAAMLSEVVGLLAAGVLARLPVKTFDARCAPAAYRFV SQARHIGKVVLTIPDGPGGQSGLAGGTVVVTGGTGMAGSAVATHLVRRHGVANLVLVS RSGEQADRAAEVAALLREGGAQVAVVSCDVADRDALAALLAGLDPRYPLKGVFHAAGV LDDAVITGLTPDRVDTVLRAKVDGAWNLHELTEDMDLSAFVVFSSMAGIVGTPAQGNY AAANAFLDGLVAYRRSRGLAGLSVAWGLWEQASAMTRHLGERDRARMTQAGLAPLTTE QALGFLDTALQADRAVVVAARLDRAALAGAGAALPALFSQLAAGPTRRRIDAADTAVS MSGLVSRLHALTPERRQRELTDLVISNAAAVLGRSSSVDINAHKAFQDLGFDSLTAVE LRNRLKTATGLTLSPTLIFDYPTPATLAEHLDSRLVTASGSDQQSLSDRVDDITRELV VLLDQPDLSANVKAHLRTRLQTMLTSLTTEDDDIAAATESQLFAILDEELGS" CDS 1871499..1876307 /codon_start=1 /transl_table=11 /gene="pks8" /locus_tag="BQ2027_MB1690" /product="Probable polyketide synthase pks8" /note="Mb1690, pks8, len: 1602 aa. Equivalent to Rv1662, len: 1602 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 1602 aa overlap). Probable pks8, polyketide synthase, similar to many polyketide synthases e.g. ERY2_SACER|Q03132 erythronolide synthase, modules 3 and 4 from Saccharopolyspora erythraea (Streptomyces erythraeus) (3567 aa), FASTA scores: opt: 3319, E(): 0, (45.8% identity in 1619 aa overlap). Also similar to other Mycobacterium tuberculosis probable polyketide synthases e.g. pks7 and pks12. Contains PS00606 Beta-ketoacyl synthases active site and PS01162 Quinone oxidoreductase/zeta-crystallin signature. Note that the similarity extends into the downstream ORF Rv1663 (MTCY275.02), and this could be accounted for by a frameshift, although the sequence has been checked and no discrepancy was found." /db_xref="GOA:A0A1R3Y136" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR002364" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR015083" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/TrEMBL:A0A1R3Y136" /protein_id="SIU00293.1" /translation="MSGTTTHVDYLKRLTADLRRTRRRLSDLEAKLSEPVAVVGMGCR YPGGVDSPETLWELVAQGRDAVSDFPADRGWDVYGLFDPDPDACGKMYTRRGTFLEHA GDFDAGFFGIGPSEALAMDPQQRLLLEVSWEALERTGIDPTKLRGSATGVFAGVIHAG YGGQLSGELEGYGLTGSTLSVASGRVAYVLGLEGPAVSVDTACSSSLVALHLAVQSLR SGECDLALAGGVTVMATPAAFVEFSRQRALARDGRCKVYAGAADGTAWSEGAGVLVVE RLVDARRLGHPVLALVRGSAVNQDGASNGLTAPNGPSQQRVIRAALASARLRAVEVDV VEGHGTGTMLGDPIEAQALLATYGQDRVEPLWLGSIKSNIGHTSAAAGVAGVIKMVQA MRHGVMPKTLHVDVPTPHVDWSVGAVSLLTQPRAWSVHGRPRRAGVSSFGISGTNAHV ILEQAPVVESVVPEVASPTAASAVPWVLSARSEQALAGQAQRLLAFVAANPDLDPIDV GWSLVKTRAMFEHRAVVVGADRGALLAGLAALAAGESGAGVAVGRARSVGKTVFVFPG QGAQWVGMGAQLYAELPLFALAFDAVAEELDRHLRLPLRNVLWEGDEALLTSTEFAQP ALFAIEVALATLLQHWGISPDFLIGHSVGEIAAAHLAGVLSLTDAAGLVAARGRLMAE LPAGGVMVVVAASEEEVLPVLVDGANLAAVNAPHSVVVSGCEAAVSDIADHFARRGRR VHRLAVSHAFHSLLMEPMLAEFTRIAAGISVSKPRIPLVSNVTGQMAGAGYGDGQYWV EHARRPVRFVEGVQLLNAVGATRFVEVGPGGGLTALVEQSLPLGEALSVAMMRREHPE VSSVLGAVATLFTAGAQMDWPAVFGSPGRRIELPTYAFQRQRYWLPPTSAGSADISGV GLLAARHGLLGAVVEQPDSDVVVLTGRLSVGEQRWLADHVIAGVVLLAGAAFVELALR AADQVDCGVVEELTVVTPLVLPTVGGVQLQVVVGVGEMGQRPVSIYSRNAESDSGWVL HARGVLGAKAVAPAADLSVWPPLGAAPVDVDGAYQRFAELGYEYGRAFQGLTAMWRRE SELFADVAVPDDVDVTLSGFGIHPLVLDAALHAMGMVGEQAATMLPFSWQGVSLHAAG ASRVRARIAPAGDGTVSVELADQAGLPVLSVQALVMRSVSSQLLSAAVAAADAAGRGL LEVAWLPVELAHNDISADLVVWELESFQDGVGPVYSATHRVLVALQSWLAQERAGGLV VLTQGSVGQDATNLAGAAVWGLVRSAQAEHPGRVMLVDSDGSMDVGDVIGCGEEQLMI RNGTAYAARLAQLRPQPILQLPDTNSGWRLVAGGAGTLEDLTLASCPAKELAPGQVRI EVRALGVNFRDVLVALGIYPGAAELGAEGAGVVTEVGPGVTGLAVGDPVMGLLGVAGS EAVVDARLVVKLPNRWPLTDAAGVPVVFLTAYCALRVLAQVQPGESVLVHAAAGGVGM AAVQLARLWGLEVFATASRGKWDTLHTMGCDNTHVADSRTLAFEETFWLTTEGRGVDV VLNSLAGEFTDASLRLLPRGGRFIEMGKTEFGTPRSLPRTILGWPTGLST" CDS 1876307..1877815 /codon_start=1 /transl_table=11 /gene="pks17" /locus_tag="BQ2027_MB1691" /product="Probable polyketide synthase pks17" /note="Mb1691, pks17, len: 502 aa. Equivalent to Rv1663, len: 502 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 502 aa overlap). Probable pks17, polyketide synthase, similar to other polyketide synthases e g. ERY2_SACER|Q03132 erythronolide synthase, modules 3 and 4 (3567 aa) from Saccharopolyspora erythraea (Streptomyces erythraeus), FASTA scores: opt: 1207, E(): 0, (43.9% identity in 531 aa overlap). Also similar to other Mycobacterium tuberculosis probable polyketide synthases e.g. pks7 and pks1. Note that the similarity extends into the upstream ORF Rv1662 (MTCY275.01) and this could be accounted for by a frameshift, although the sequence has been checked and no discrepancy was found. Contains PS00012 Phosphopantetheine attachment site. Mb1691 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZU8" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="UniProtKB/TrEMBL:A0A1R3XZU8" /protein_id="SIU00294.1" /translation="MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFL SQARHVGKVVLTMPDAWAAGTVLITGGTGMAGSAVARHLVSRYGVRQVVLASRAGEHT ESVAALVDELGSAGARVQVVSCDVADRDAVAGLVASQPDLTAVFHAAGVLDDAVITGL TPERVDKVLRAKVDGAWNLHELTRHLDVSAFVLFSSMAGIVGAPGQANYAAANAFLDG LAAYRRSRGLAALSVAWGLWEQASAMTEHLGERDRVRMSRVGLAPLPTNQAMGFLDAA LLADRPVVVAARLDRAALAGAELPALFSQLVAGPIRRIIDGADEVSGSGLASRLHGLT PEQRHRELTELVCSNAAIVLGHSGTEIDAHKAFQDLGFDSLTAVELRNRLKTATGLTL PPTLIFDYPTAAELAEHLDIQLANAPAVTVDQPNPSTRFNEVTRELQALLDQPNWNPD DKTRLIKRLQAILTDCTAPPASSGPSTTHDDEDITTATESQLFAILDDELGP" CDS 1877821..1880874 /codon_start=1 /transl_table=11 /gene="pks9" /locus_tag="BQ2027_MB1692" /product="Probable polyketide synthase pks9" /note="Mb1692, pks9, len: 1017 aa. Equivalent to Rv1664, len: 1017 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1017 aa overlap). Probable pks9, polyketide synthase, similar to OL56_STRAT|Q07017 oleandomycin polyketide synthase, modules 5 and 6 from Streptomyces antibioticus (3519 aa), FASTA scores: opt: 1767, E(): 0, (41.6% identity in 919 aa overlap). Similar to other Mycobacterium tuberculosis probable polyketide synthases e.g. pks6, pks8, etc. Contains PS00012 Phosphopantetheine attachment site. Mb1692 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYZ5" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036736" /db_xref="UniProtKB/TrEMBL:A0A1R3XYZ5" /protein_id="SIU00295.1" /translation="MQPTGIAIIGLACRFPTVVSPGDLWDLLRDGREATGSIDNVADF DADFFNLSPREASAMDPRQRLALELTWELLEDAFVVPETLRGQPIAVYLGAMNDDYAV LTLAADRVDHHAFAGTSRAIIANRVSFAFGLRGPSVTIDSGQSSSLVAVHLACESVRT GEAPLAIAGGVHLNLARETAMLEQEFGAVSPSGHTYAFDERADGYVPGDGGGLVLLKP VQAALDDGDRIHAIIRGSAVGNAGHSATGLTVPSVAGQVDVIRRAMSGAGVDCHQVHY VEAHGTGTKIGDPIEARALGEIFAARQRRPVSVGSVKTNIGHTGGAAGIAGLLKAVLA IENAVIPPSLNYVGAPIDLDSLGLRVDTALTPWPVADEPRRAGVSSFGMGGTNAHVIL EQGPTQSPEIVESVAAAGSNAPVAVPWVLAARSPQALTNQAGRLLAHLTADDGLTALD VGWSLVSTRSVFDHRAVVVGADRGRLMAGLAGLAAGEPGAGVVVGRARSVGKTVFVFP GQGSQWLGMGRQLYGRYSVFARAFDEVVAVLDGQLRLSVRQVMWGADAGLLESTEFAQ PALFVVQVALAALLQDWGVLPDLVMGHSVGEIAAAYVAGALSLVDAARVVAARGRLMQ ALPAGGVMVAVAASEDEVAPLLTEGVCIAAVNAPESVVISGEQAAVGVVVDRLVGLGR RVRRLAVSHAFHSVLMDPMVEEFSKVLADVCVRAPRIGLVSNVTGQLAGAGYGSPAYW VEHVRKPVRFFDGVGLAESLGARVFVEVGPGAGLEASVALLARDRPEVESVLAGVGRL FAEGVAVDWSSVFAGLGGRRVELPTYGFARQRFWLGDNGELSVDQTGKDAGAIARLQS LAPPELQRQLVELVCFHAAIVLGRKSSHDIDPECAFQDLGFDSMSGVELRNRLQMAIG LPGLSLPRTLIFDYPTASALAECLGQLLGGQHESSDDESIWQLLKNIPIHQLRRTGLL DKLLLLAGQPEESLAGRTVSDEVIDSLSPEALIGLALDEDENDIR" CDS 1881021..1882082 /codon_start=1 /transl_table=11 /gene="pks11" /locus_tag="BQ2027_MB1693" /product="chalcone synthase pks11" /note="Mb1693, pks11, len: 353 aa. Equivalent to Rv1665, len: 353 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 353 aa overlap). Probable pks11, chalcone synthase (EC 2.3.1.74), some similarity to BCSA_BACSU|P54157 putative chalcone synthase from Bacillus subtilis (365 aa), FASTA scores: opt: 615, E(): 6.2e-32, (33.4% identity in 308 aa overlap); and to many plant chalcone synthases e.g. CHS_VIGUN|P51089 chalcone synthase (EC 2.3.1.74) (388 aa), FASTA scores: opt: 391, E(): 7.8e-18, (27.2% identity in 349 aa overlap). Highly similar to upstream ORF Rv1660|MTCY06H11.25 pks10 (72.7% identity in 308 aa overlap); and Rv1372 pks18. Protein product from Mb1693 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1693 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEU7" /db_xref="InterPro:IPR001099" /db_xref="InterPro:IPR011141" /db_xref="InterPro:IPR012328" /db_xref="InterPro:IPR016039" /db_xref="UniProtKB/Swiss-Prot:Q7VEU7" /protein_id="SIU00296.1" /translation="MSVIAGVFGALPPHRYSQSEITDSFVEFPGLKEHEEIIRRLHAA AKVNGRHLVLPLQQYPSLTDFGDANEIFIEKAVDLGVEALLGALDDANLRPSDIDMIA TATVTGVAVPSLDARIAGRLGLRPDVRRMPLFGLGCVAGAAGVARLRDYLRGAPDDVA VLVSVELCSLTYPAVKPTVSSLVGTALFGDGAAAVVAVGDRRAEQVRAGGPDILDSRS SLYPDSLHIMGWDVGSHGLRLRLSPDLTNLIERYLANDVTTFLDAHRLTKDDIGAWVS HPGGPKVIDAVATSLALPPEALELTWRSLGEIGNLSSASILHILRDTIEKRPPSGSAG LMLAMGPGFCTELVLLRWR" CDS complement(1882065..1883357) /codon_start=1 /transl_table=11 /gene="cyp139" /locus_tag="BQ2027_MB1694C" /product="Probable cytochrome P450 139 CYP139" /note="Mb1694c, cyp139, len: 430 aa. Equivalent to Rv1666c, len: 430 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 430 aa overlap). Probable cyp139, cytochrome P450 (EC 1.14.-.-), similar to many e.g. U38537|APU38537_7 from Anabaena sp. (459 aa), FASTA scores: opt: 516, E(): 1.7e-26, (25.8% identity in 418 aa overlap). Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY. Protein product from Mb1694c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1694c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63720" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002403" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P63720" /protein_id="SIU00297.1" /translation="MRYPLGEALLALYRWRGPLINAGVGGHGYTYLLGAEANRFVFAN ADAFSWSQTFESLVPVDGPTALIVSDGADHRRRRSVVAPGLRHHHVQRYVATMVSNID TVIDGWQPGQRLDIYQELRSAVRRSTAESLFGQRLAVHSDFLGEQLQPLLDLTRRPPQ VMRLQQRVNSPGWRRAMAARKRIDDLIDAQIADARTAPRPDDHMLTTLISGCSEEGTT LSDNEIRDSIVSLITAGYETTSGALAWAIYALLTVPGTWESAASEVARVLGGRVPAAD DLSALTYLNGVVHETLRLYSPGVISARRVLRDLWFDGHRIRAGRLLIFSAYVTHRLPE IWPEPTEFRPLRWDPNAADYRKPAPHEFIPFSGGLHRCIGAVMATTEMTVILARLVAR AMLQLPAQRTHRIRAANFAALRPWPGLTVEIRKSAPAQ" CDS complement(1883372..1885147) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1695C" /product="probable second part of macrolide-transport atp-binding protein abc transporter" /note="Mb1695c, -, len: 591 aa. Equivalent to Rv1668c and Rv1667c, len: 372 aa and 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 347 aa overlap and 100% identity in 217 aa overlap). Macrolide-transport ATP-binding protein ABC transporter (see citation below), similar to many ATP-binding proteins ABC transporter e.g. X80735|SEABCT_1|Q54072 Saccharopolyspora erythraea ertX gene (481 aa), FASTA scores: opt: 938, E(): 0, (45.6% identity in 353 aa overlap); etc. Similarity to other NBD components of ABC transporters suggests that Rv1667c and Rv1668c should be contiguous. However, sequence has been checked and no error found, also same sequence in Mycobacterium tuberculosis CSU93 and Mycobacterium bovis. Contains PS00211 ABC transporters family signature and two times PS00017 ATP/GTP-binding site motif A. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1668c and Rv1667c exist as 2 genes with a small overlap between them. In Mycobacterium bovis, a 10 bp insertion (*-tcttgccgcg) leads to a single product. Protein product from Mb1695c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1695c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ10" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR006073" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ10" /protein_id="SIU00298.1" /translation="MAHLLGAEAVHLAYPTQVVFEAVTLGVNDGARIGIVGRNGDGKS SLLGLLTGQLRPDSGRVTRRSGLRVNALSQTDTLDPNRTVGWTLIGDQPEHQWAGNPR IRDVVAGLVSDIAWDTPVSTLSGGQRRRVQLASLLVGEWDVIALDEPTNHLDIQGITW LADHLRRRWARNTGGLLVVTHDRWFLDEVATTTWEVHDGIVEPFEGGYAAYVLQRVER DRLTAAAEAKRQNLLRKELAWLRRGAPARTCKPKFRIEAANQLIADVPPPRNTVELAK LAAARLGKDVVDLLGVSVSYQPSGGRPVLRDIEWRIGPGERIGIVGANGAGKSTLLGL IAGTVQPGVGRVKRGKTVRLAVLDQHGDDLAPFADDRIADVLGRLRGGYQVEGREVTP TQLLERLGFRRDQLSARVDDLSGGQRRRLQLMLTLLSEPNVLLLDEPTNDVDTEMLTA TEDLLDSWAGTLIVVSHDRYLLERVTDQQYAILDDRLRHLPGGIDEYLQLAARVSAPA PAERPAPPAMSGAQRRATEKELAAVDRQLARLADRVAAKHTELAEHDQSDHVGITRLT QQLRVLQDHVAAMENRWLELSEMLE" CDS 1885530..1885892 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1696" /product="HYPOTHETICAL PROTEIN" /note="Mb1696, -, len: 120 aa. Equivalent to Rv1669, len: 120 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 120 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3XZ00" /protein_id="SIU00299.1" /translation="MSRRPGYSNGRAGASRQAARGGSAGASSVAFSSQPNCGLTESVL GHQVTGICLGTIHLDAMQWPWSSAYRLEPAVATTLIGISAWWANGSVKQYAGDLTDRV ATMTVCRRTPAPRVHYRQ" CDS 1885925..1886272 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1697" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1697, -, len: 115 aa. Equivalent to Rv1670, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 115 aa overlap). Conserved hypothetical protein, highly similar to D90908|D90908_87 Hypothetical protein of Synechocystis sp. PCC6803 complete (94 aa), FASTA scores opt: 378, E(): 3.5e-2, (55.2% identity in 96 aa overlap); also shows some similarity to M.tuberculosis hypothetical proteins e.g. C-terminal region of O53404|Rv1056 (254 aa), and P96817|Rv0140 (126 aa). Mb1697 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007361" /db_xref="InterPro:IPR038694" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ03" /protein_id="SIU00300.1" /translation="MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSI CPWKGLAHYYNVVVDGPYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESR HGLARRVVAWLGK" CDS 1886280..1886672 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1698" /product="probable membrane protein" /note="Mb1698, -, len: 130 aa. Equivalent to Rv1671, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 130 aa overlap). Probable membrane protein. Weak similarity to mercuric transport proteins. Mb1698 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYY1" /db_xref="UniProtKB/TrEMBL:A0A1R3XYY1" /protein_id="SIU00301.1" /translation="MPTVGPADHAAGLDRRATPDQLPIWRIGIISGLVGMLCCVGPTI LALVGIISAATAFAWANDLYDNYAWWFRVSGLAVLAILVWWALRHRNRCSVNAIRRLR WRLMAVLAIAVGTYGVLSAVTTWFGTFV" CDS complement(1886681..1888012) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1699C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN" /note="Mb1699c, -, len: 443 aa. Equivalent to Rv1672c, len: 443 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 443 aa overlap). Probable conserved integral membrane transport protein, major facilitator superfamily, similar to several phthalate transporters or tartrate transporters e.g. U25634|AVU25634_2 Agrobacterium vitis plasmid pTrAB (433 aa), FASTA scores: opt: 914, E(): 0, (37.1% identity in 426 aa overlap); etc. Protein product from Mb1699c detected using SWATH mass spectrometry. Mb1699c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XYZ8" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XYZ8" /protein_id="SIU00302.1" /translation="MATIAASPTHNALGKAARRLLPLLFVLYVINFVDRANISVAALA MNADLRLSATAYGTAAGVFFLGYVLFQVPANAALARFGAGRTLTAVVLAWGVCSAATA LVTSAHTLYLARFALGVAEGGFFPGVIAYLTVWFPCAQRARAVATFLLAIPVANTVGL PLSGLIVGHVHMAGLPGWRAMFVIEALPALLLAPLLRRLLPDNPQRASWLTPEERAEL SARLTEDTPAPTGRSSGAGWDLVLFAVVYGGLYFALYALQFFLPQLVASLAHGTATLT AATLAALPYGVAALAMLAWSHRSIDRSGAQAGHITLPTTAAGSAALGAALSPMSPIVT LSWLTIAVAGILAAMPAFWSRCTAALAGPRVAVAIATVNAVASLASFAGPYATGHLKD ATGTYHLALLTVAAVLAAAAACSLLLRHAGRTVCANDSEIMLHPSPATPFV" CDS complement(1888105..1889037) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1700C" /product="Transglutaminase-like enzymes, putative cysteine proteases" /note="Mb1700c, -, len: 310 aa. Equivalent to Rv1673c, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 310 aa overlap). Conserved hypothetical protein, shows weak similarity to P44103|YA48_HAEIN Hypothetical protein HI10 48 precursor (369 aa), FASTA scores: E(): 8.3e-11, (26.1% identity in 330 aa overlap)." /db_xref="InterPro:IPR002931" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/TrEMBL:A0A1R3Y145" /protein_id="SIU00303.1" /translation="MTITDPAVSAHADATIGLFEITDHITIDSTQGAHTVEMWCPVIG DGAFQRVLDVEVTSEDPYDLTREPEFGNLMLYSRLRLATAASWSIRYVVERRAIGHAP DPARARPLATAQLFSRALIPEAHVDVDERTRTLAQDVVGPETNPLEQARRIYDYVTGA MDYDATKQSFLGSTEHALTCSVGNCNDIHALFVSLCRSVDIPARFVLGQALELPQPGA QDCEVCGYHCWAEFFVAGLGWLPADASCATKYGTHGLFANLQANHIAWSIGRDILLAP PQRAGRSLFFAGPYAEIDGETHPAQRQIRFTAMT" CDS complement(1889065..1889721) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1701C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1701c, -, len: 218 aa. Equivalent to Rv1674c, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 218 aa overlap). Probable transcriptional regulatory protein. Highly similar to AJ005575|SPE005575_2 Streptomyces peucetius (226 aa), FASTA scores: opt: 662, E(): 0, (50.0% identity in 208 aa overlap). Similar to Rv0324|Z96800|MTCY63.29 M. tuberculosis cosmid (226 aa), FASTA scores: opt: 579, E(): 0, (45.3% identity in 214 aa overlap). N-terminus is similar to transcriptional activators e.g. MERR_STRLI|P30346 probable mercury resistance operon regulator (125 aa), FASTA scores: opt: 183, E(): 1.9e-06, (35.6% identity in 90 aa overlap). Contains PS00380 Rhodanese signature 1. Protein product from Mb1701c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3XZU9" /db_xref="InterPro:IPR001307" /db_xref="InterPro:IPR001763" /db_xref="InterPro:IPR001845" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="InterPro:IPR036873" /db_xref="UniProtKB/TrEMBL:A0A1R3XZU9" /protein_id="SIU00304.1" /translation="MSGAKKLIFEQFALVGQALSSGHRLELLDLLVQGERSVDALARA SGLTFANASQHLLQLRRAGLVTSRRDGKRVIYALSDPQVWDVVRAVRAVAERNLASVG SLVRQYYTDRDSLEPISRDELQARVAAGSVLVLDVRPAMEYAAGHLPGAVSIPLDELA ERLDELPSGIDIVACCRGPYCVYAYDALELLRPNGFSARRLDGGFSEWLAADLPVVRT " CDS complement(1890046..1890780) /codon_start=1 /transl_table=11 /gene="cmr" /locus_tag="BQ2027_MB1702C" /product="probable transcriptional regulatory protein cmr" /note="Mb1702c, -, len: 244 aa. Equivalent to Rv1675c, len: 244 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 244 aa overlap). Probable transcriptional regulatory protein, weak similarity to D00496|LBATRP_7 trp operon from Lactobacillus casei (219 aa), FASTA scores: opt: 172, E(): 0.00011, (26.9% identity in 186 aa overlap)." /db_xref="GOA:A0A1R3XZ04" /db_xref="InterPro:IPR000595" /db_xref="InterPro:IPR012318" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR018490" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ04" /protein_id="SIU00305.1" /translation="MADRSVRPLRHLVHAVTGGQPPSEAQVRQAAWIARCVGRGGSAP LHRDDVSALAETLQAKEFAPGAVVFHADQTADGVWIVRHGLIELAVGSRRRRAVVNIL HPGDVDGDIPLLLEMPMVYTGRALTQATCLFLDRQAFERLLATHPAIARRWLSSVAQR VSTAQIRLMGMLGRPLPAQVAQLLLDEAIDARIELAQRTLAAMLGAQRPSINKILKEF ERDRLITVGYAVIEITDQHGLRARAQ" CDS 1890852..1891556 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1703" /product="Peroxiredoxin" /note="Mb1703, -, len: 234 aa. Equivalent to Rv1676, len: 234 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 234 aa overlap). Hypothetical unknown protein. Protein product from Mb1703 detected using shotgun mass spectrometry. Mb1703 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZC9" /db_xref="InterPro:IPR000866" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3XZC9" /protein_id="SIU00306.1" /translation="MACPEWEISRSKRTRKPVLRPRHSVSTLTNRFLAEFCHRYGIGV PTRLARGATVPTRRLQDINDQPVDVPAATGRTHLQFRRFAACPICHLHLRSFANRHQE VADSGITEVVFFHSAADALRGYQSLLPFAVIADPDRVQYREFGVEKSLGAITHPRALW AAVRGSAAMLHRNDPERAGVGFGDGTTHLGLPADFLLDADGTVAAVHYGRHADDQWSV DQLIDINRSLGGKGTQ" CDS 1891553..1892101 /codon_start=1 /transl_table=11 /gene="dsbF" /locus_tag="BQ2027_MB1704" /product="PROBABLE CONSERVED LIPOPROTEIN DSBF" /note="Mb1704, dsbF, len: 182 aa. Equivalent to Rv1677, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 182 aa overlap). Probable dsbF, conserved lipoprotein possibly involved in thiol:disulfide interchange. Highly similar to C-terminus of Z74024|MTCY274.09 mpt53 soluble secreted antigen precursor from Mycobacterium tuberculosis (173 aa), FASTA scores: opt: 482, E(): 3.6e-23, (52.8% identity in 142 aa overlap) . Also some similarity to P52237|TIPB_PSEFL THIOL:DISULFIDE INTERCHANGE PROTEIN TIPB PRECURSOR from Pseudomonas fluorescens (178 aa), FASTA scores: opt: 190, E(): 4.4e-05, (28.5% identity in 151 aa overlap); and P33926|DSBE_ECOLI THIOL:DISULFIDE INTERCHANGE PROTEIN from Escherichia coli (185 aa), FASTA scores: opt: 194, E(): 2.6e-05, (29.1% identity in 175 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site and PS00194 Thioredoxin family active site. Protein product from Mb1704 detected using shotgun mass spectrometry. Mb1704 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ07" /db_xref="InterPro:IPR000866" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR017937" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ07" /protein_id="SIU00307.1" /translation="MTHSRLIGALTVVAIIVTACGSQPKSQPAVAPTGDAAAATQVPA GQTVPAQLQFSAKTLDGHDFHGESLLGKPAVLWFWAPWCPTCQGEAPVVGQVAASHPE VTFVGVAGLDQVPAMQEFVNKYPVKTFTQLADTDGSVWANFGVTQQPAYAFVDPHGNV DVVRGRMSQDELTRRVTALTSR" CDS 1892202..1893104 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1705" /product="probable integral membrane protein" /note="Mb1705, -, len: 300 aa. Equivalent to Rv1678, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 300 aa overlap). Probable integral membrane protein. Protein product from Mb1705 detected using SWATH mass spectrometry. Mb1705 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ23" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ23" /protein_id="SIU00308.1" /translation="MARVRRGTELLLSPQSPPATGGLIVLTGLRLLAGLIWLYNVVWK VPPDFGERGRRDLYHFTHLAVEHPVFTPFSWVIEHAVLPYFTAFGWGVLFAESALAVL LLTGTAVRLAALIGIGQSVAIGLSVAESPGEWPWAYAMLLGIHVVLLFTCSTRYAAVD AVRAAATGSAARTAAQRLLAGWGIVLGLIGLVAVWRGLGDDRPAYVGIRALEFSLGEY NLRGALALIAIALAMLAAAKRGWRTVALVAAVVAVAAAAAIYLQVGRTAVWLGGTNTT AAVFVCAAVVSLATEFRIGRVEGA" CDS 1893104..1894225 /codon_start=1 /transl_table=11 /gene="fadE16" /locus_tag="BQ2027_MB1706" /product="POSSIBLE ACYL-COA DEHYDROGENASE FADE16" /note="Mb1706, fadE16, len: 373 aa. Equivalent to Rv1679, len: 373 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 373 aa overlap). Possible fadE16, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to acyl/butyryl-CoA dehydrogenases e.g. NP_244665.1|NC_002570 acyl-CoA dehydrogenase from Bacillus halodurans (380 aa); NP_000008.1|NM_000017 acyl-Coenzyme A dehydrogenase from Homo sapiens (412 aa); Z99113|BSUB0010_119 from Bacillus subtilis (380 aa), FASTA scores: opt: 439, E(): 3.4e-20, (29.6% identity in 287 aa overlap); etc. Weakly similar to many dehydrogenases and to P31571|CAIA_ECOLI probable carnitine operon oxidoreductase from Escherichia coli (380 aa), FASTA scores: opt: 109, E(): 0.0066, (28.6% identity in 98 aa overlap). Protein product from Mb1706 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1706 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ09" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ09" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00309.1" /translation="MATPGVVQEVVSVAAEHAERVDTDCAFPAEAVDALRKTGLLGLV LPREIGGMGSGPVEFTEVVAQLSAACGSTAMIYLMHMAAAVTVAASPPPGLPDLLADM ASGKQLGTLAFSEPGSRSHFWAPVSTASADGDGIAVRADKSWVTSAGFADVYVVSVGS ADGAAGDVDLYAVPADTPGLRVAGTFTGMGLRGNASAPMAVDIRIPDSYRLGEAGGGF GIMMQTVLPWFNLGNAAVSLGLATAATGAAVKHVGTARLEHLGGSLAELPTIRAQIAR MGTTLAAQKAYLEVAANSVSSPDDTTLTHVLGVKASVNDAALTITESAMRVCGGAAFS KHLPIERAFRDARAGSVMAPTADALYDFYGRAVTGLPLF" CDS 1894234..1895058 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1707" /product="ABC transporter, substrate-binding protein (cluster 12, methionine/phosphonates)" /note="Mb1707, -, len: 274 aa. Equivalent to Rv1680, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 274 aa overlap). Hypothetical unknown protein. Protein product from Mb1707 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1707 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZ13" /protein_id="SIU00310.1" /translation="MSTEPLVVGAVAYTPNVVPIWEGIRGYFQDSESPDTQMDFVLYS NYARLVDSLIAGHIDIAWNTNLAYVRTVLQTGGRCTPLAQRDTDVDYTTVFVAHAGSD LHGAKDIAGKRLALGSADSAHAAILPLYYLRRAGIAESDLQVIRFDTDIGKHGDTGRS ELDAVDAVLAGEADVAAIGSSTWAAMGAAELMGESLTEVWRTDGYCHCMFTALDTLPA ERYQPWLDRLLGMSWDDSEHRKILELEGLRRWVPPHLDGYKPLFEAVQEQGIDPRW" CDS 1895055..1896047 /codon_start=1 /transl_table=11 /gene="moeX" /locus_tag="BQ2027_MB1708" /product="POSSIBLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN MOEX" /note="Mb1708, moeX, len: 330 aa. Equivalent to Rv1681, len: 330 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 330 aa overlap). Possible moeX, Molybdopterin biosynthesis protein, has weak similarity to MOAA_ECOLI|P30745 molybdenum cofactor biosynthesis protein (329 aa), FASTA scores: opt: 162, E(): 0.00081, (27.7% identity in 224 aa overlap) and to Rv3109|MTCY164.19 MoaA from Mycobacterium tuberculosis (28.5% identity in 165 aa overlap). Mb1708 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XYY9" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3XYY9" /protein_id="SIU00311.1" /translation="MIIELMRRVVGLAQGATAEVAVYGDRDRDLAERWCANTGNTLVR ADVDQTGVGTLVVRRGHPPDPASVLGPDRLPGVRLWLYTNFHCNLCCDYCCVSSSPST PHRELGAERIGRIVGEAARWGVRELFLTGGEPFLLPDIDTIIATCVKQLPTTVLTNGM VFKGRGRRALESLPRGLALQISLDSATPELHDAHRGAGTWVKAVAGIRLALSLGFRVR VAATVASPAPGELTAFHDFLDGLGIAPGDQLVRPIALEGAASQGVALTRESLVPEVTV TADGVYWHPVAATDERALVTRTVEPLTPALDMVSRLFAEQWTRAAEEAALFPCA" CDS 1896208..1897125 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1709" /product="Probable coiled-coil structural protein" /note="Mb1709, -, len: 305 aa. Equivalent to Rv1682, len: 305 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 305 aa overlap). Probable coiled-coil structural protein, weakly similar to many paramyosins, kinesins and plectins e.g. MYSP_ONCVO|Q02171 paramyosin from onchocerca volvulus (879 aa), fasta scores: opt: 180, E():2.6e-08, (24.4% identity in 234 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical coiled-coil proteins (wag31 antigen 84) Rv2145c and Rv2927c. Mb1709 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007793" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ06" /protein_id="SIU00312.1" /translation="MLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDP VAVNHYIAELVLRQQAQHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRL AVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRR AREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQ ECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLEN IPALLESARHSETEPLQSINGAVAELRAI" CDS 1897343..1900342 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1710" /product="possible bifunctional enzyme; long-chain acyl-coa synthase and lipase." /note="Mb1710, -, len: 999 aa. Equivalent to Rv1683, len: 999 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 999 aa overlap). Possible long-chain acyl-CoA synthase. Equivalent to Z95117|MLCB1351_21 possible long-chain acyl-CoA synthase from Mycobacterium leprae (1002 aa) (85.6% identity in 1002 aa overlap). Weakly similar to FATP_MOUSE|Q60714 long-chain fatty acid transport protein (646 aa), fasta scores: opt: 331, E(): 5e-08, (24.8% identity in 630 aa overlap). Also similar to O35488|AF033031 Mouse VERY-LONG-CHAIN ACYL-COA SYNTHETASE (620 aa), fasta scores: opt: 435, E(): 2.2e-12, (24.8% identity in 545 aa overlap). Weakly similar to M. tuberculosis protein MTCI364.18 (27.4% identity in 583 aa overlap) . Contains PS00120 Lipases, serine active site. Protein product from Mb1710 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1710 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y153" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y153" /protein_id="SIU00313.1" /translation="MVDLNFSMVTRPIERLVATAQNGLEVLRLGGLETGSVPSPSQIV ESVPMYKLRRYFPPDNRPGQPPVGPPVLMVHPMMMSADMWDVTREDGAVGILHASGLD PWVIDFGSPDEVEGGMRRNLADHIVALSEAVDTVKDATGHDVHFVGYSQGGMFCYQAA AYRRSKDIASVVAFGSPVDTLAALPMGIPANMGAAVADFMADHVFNRLDIPSWMARMG FQMMDPLKTAKARVDFVRQLHDREALLPREQQRRFLESEGWIAWSGPAISELLKQFIA HNRMMTGGFAISGQMVTLTDITCPILAFVGEVDDIGQPASVRGIRRAAPNSEVYECLI RAGHFGLVVGSRAAQQSWPTVADWVRWISGDGTKPENIHLMADQPAEHTDSGVAFSSR VAHGIGEVSEAALALARGAADAVVAANRSVRTLAVETVRTLPRLARLGQLNDHTRISL GRIIDEQAHDAPKGEFLLFDGRVHTYEAVNRRINNVVRGLIAVGVRQGDRVGVLMETR PSALVAIAALSRLGAVAVVMRPDTDLSASVRLGRVTEILTDPTNLDAARQLPGQVLVL GGGESRDLDLPADALEQGQVIDMEKIDPDAVELPAWYRPNPGLARDLAFIAFSSADGD LVAKQITNYRWAVSAFGTASTAALGRRDTVYCLTPLHHESALLVSLGGAVVGGTRIAL SRGLRPDRFVAEVRQYGVTVVSYTWAMLRDVVDDPAFVLHGNHPVRLFIGSGMPTGLW ERVVEAFAPAHVVEFFATTDGQAVLANVAGAKIGSKGRPLPGAGRVELGAYDAEHDLI LENDRGFVQVAGVNQVGVLLAQSRGPIDPTASVKRGVFAPADTWISTDYLFWRDDDGD YWLAGGRGSVVRTARGMVYTEPVTNALGLITGVDLAVTYGVLVRGRHVAVSAVTLLPG ATITAADLTEAVASMPVGLGPDIVHVVPQLTLSGTYRPTVSALRANGIPKAGRQAWYF NSGGNEYRRLTPAVRTELTGQHRRGNA" CDS 1900335..1900559 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1710A" /product="FIG002473: Protein YcaR in KDO2-Lipid A biosynthesis cluster" /note="Mb1710A, -, len: 74 aa. Equivalent to Rv1684, len: 74 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 74 aa overlap). Conserved hypothetical protein, similar to P75844|YCAR_ECOLI Protein YCAR from Escherichia coli (60 aa), FASTA scores: opt: 108, E(): 0.00022, (39.0% identity in 59 aa overlap). Protein product from Mb1710A detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1710A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR005651" /db_xref="UniProtKB/TrEMBL:A0A1R3XZV4" /protein_id="SIU00314.1" /translation="MLDEALLAILVCPADRGPLVLVEDGDIQVLYNPRLRRAYRIEDG IPVLLVDEAREVDEDEHARLMARGRPAAPQ" CDS complement(1900525..1901148) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1711C" /product="Transcriptional regulator, AcrR family" /note="Mb1711c, -, len: 207 aa. Equivalent to Rv1685c, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 207 aa overlap). Conserved hypothetical protein, some similarity to other Mycobacterium tuberculosis hypothetical regulatory proteins e.g. Q10774|Rv1556|YF56_MYCTU (202 aa), FASTA scores: opt: 111, E(): 1.7e-05, (24.1% identity in 195 aa overlap); and P95215|Rv0258c|MTCY06A4.02c (151 aa) FASTA scores: (32.9% identity in 140 aa overlap); also similar to Q9X8G9|SCE7.13C|AL049819 putative Streptomyces coelicolor transcriptional regulator (204 aa), FASTA scores: opt: 480, E(): 6.4e-25, (40.4% identity in 203 aa overlap). Protein product from Mb1711c detected using shotgun mass spectrometry." /db_xref="GOA:A0A1R3XZ14" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="InterPro:IPR041678" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ14" /protein_id="SIU00315.1" /translation="MAAPDNSRRRPGRPAGSSDTRERILSSARELFAHNGIDRTSIRA VAAKAGVDAALVHHYFGTKQQLFAAAIHIPIDPMVIIGPIREAPVEELGYKLPSLLLP IWDSELGAGLIATLRSLISGSDVGLARSFLEEVVTVELGSRVDNPPGTGKIRTQFVAS QLMGVVMARYIVRIEPFASLPAEQIVQTIAPNLQRYLTGELPDDLAP" CDS complement(1901150..1901830) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1712C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER" /note="Mb1712c, -, len: 226 aa. Equivalent to Rv1686c, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 226 aa overlap). Probable conserved integral membrane protein ABC transporter (see citation below), similar to AL049819|SCE7.05 putative integral membrane protein from Streptomyces coelicolor (266 aa), FASTA sacores: opt: 661, E(): 0, (45.1% identity in 226 aa overlap); and Q53627|U43537 MEMBRANE PROTEIN INVOLVED IN MITHRAMYCIN RESISTANCE from STREPTOMYCES ARGILLACEUS (233 aa), FASTA scores: opt: 222, E(): 5.4e-10, (28.7% identity in 216 aa overlap)." /db_xref="GOA:A0A1R3XZE1" /db_xref="InterPro:IPR000412" /db_xref="InterPro:IPR004377" /db_xref="InterPro:IPR013525" /db_xref="UniProtKB/TrEMBL:A0A1R3XZE1" /protein_id="SIU00316.1" /translation="MILLVPILIITLMYFMFENFPHRPGTPSGFNTACLVLLGLFPLF VMFVITAITMQRERASGTLERILTTPLRRLDLLAGYGTAFSIAAAAQATLACIVAFWF LGFDTAGSPVWVFAIAIVNAVLGVGLGLLCSAFARTEFQAVQFIPLVMVPQLLLAGII VPRALMPTWLEWISNVMPASYALEALQQVGAHPELTGIAVRDVVVVLSFAVASLCLAA VTLRRRTS" CDS complement(1901902..1902669) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1713C" /product="PROBABLE CONSERVED ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb1713c, -, len: 255 aa. Equivalent to Rv1687c, len: 255 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 255 aa overlap). Probable conserved ATP-binding protein ABC transporter (see citation below), similar to many ABC-type transporters e.g. P55476|NODI_RHISN nodulation ATP-binding protein I from Rhizobium sp. (343 aa), FASTA scores: opt: 479, E(): 3.7e-23, (34.6% identity in 243 aa overlap); etc. Also similar to many other Mycobacterium tuberculosis ABC-type transporters e.g. MTCY19H9.04 (34.5% identity in 238 aa overlap). Contains PS00211 ABC transporters family signature and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Also contains PS00039 DEAD-box subfamily ATP-dependent helicases signature, though this may be spurious. Protein product from Mb1713c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3XZ19" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ19" /protein_id="SIU00317.1" /translation="MMISSSDELLRDGADPAVIIDQLRVIRGKRLALQDVSVRVACGT ITGLLGPSGSGKTTLIRCIVGSQIIASGSVSVLGQPAGSAELRHRVGYMPQDPTIYND LRVIDNIRYFAELCGVDRQAADEVIEAVDLRDHRTARCANLSGGQRARVSLACALVGR PDLLVLDEPTIGLDPVLRVELWDRFTALARRGTTLLVSSHVMDEADRCGDLLLLRQGQ LLAHTTPHRLRKETGCTSLEEAFLSIVRRTTTVPAAG" CDS 1902728..1903339 /codon_start=1 /transl_table=11 /gene="mpg" /locus_tag="BQ2027_MB1714" /product="possible 3-methyladenine dna glycosylase mpg" /note="Mb1714, -, len: 203 aa. Equivalent to Rv1688, len: 203 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 203 aa overlap). Possible 3-methyladenine DNA glycosylase (EC 3.2.2.-), similar to several eukaryotic 3-methylpurine DNA glycosylases and 3-methyladenine DNA glycosylases e.g. Q39147|X76169 3-METHYLADENINE GLYCOSYLASE from Arabidobsis thaliana (254 aa), FASTA scores: opt: 297, E(): 8.3e-15, (31.8% identity in 198 aa overlap) and P29372|3MG_HUMAN dna-3-methyladenine glycosidase (298 aa), FASTA scores: opt: 220, E(): 7.2e-05, (36.4% identity in 184 aa overlap). BELONGS TO THE MPG FAMILY OF DNA GLYCOSYLASES. Protein product from Mb1714 detected using SWATH mass spectrometry." /db_xref="GOA:P65413" /db_xref="InterPro:IPR003180" /db_xref="InterPro:IPR011034" /db_xref="InterPro:IPR036995" /db_xref="UniProtKB/Swiss-Prot:P65413" /protein_id="SIU00318.1" /translation="MNAEELAIDPVAAAHRLLGATIAGRGVRAMVVEVEAYGGVPDGP WPDAAAHSYRGRNGRNDVMFGPPGRLYTYRSHGIHVCANVACGPDGTAAAVLLRAAAI EDGAELATSRRGQTVRAVALARGPGNLCAALGITMADNGIDLFDPSSPVRLRLNDTHR ARSGPRVGVSQAADRPWRLWLTGRPEVSAYRRSSRAPARGASD" CDS 1903351..1904625 /codon_start=1 /transl_table=11 /gene="tyrS" /locus_tag="BQ2027_MB1715" /product="Probable Tyrosyl-tRNA synthase tyrS (TYRRS)" /note="Mb1715, tyrS, len: 424 aa. Equivalent to Rv1689, len: 424 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 424 aa overlap). Probable tyrS, Tyrosyl-tRNA synthase (EC 6.1.1.1), highly similar to many e.g. SYY_ECOLI|P00951 Escherichia coli (EC 6.1.1.1) (423 aa), FASTA scores: opt: 1271, E(): 0, (47.3% identity in 419 aa overlap). Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. BELONGS TO CLASS-I AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb1715 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1715 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67612" /db_xref="InterPro:IPR001412" /db_xref="InterPro:IPR002305" /db_xref="InterPro:IPR002307" /db_xref="InterPro:IPR002942" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR024088" /db_xref="InterPro:IPR024107" /db_xref="InterPro:IPR036986" /db_xref="UniProtKB/Swiss-Prot:P67612" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00319.1" /translation="MSGMILDELSWRGLIAQSTDLDTLAAEAQRGPMTVYAGFDPTAP SLHAGHLVPLLTLRRFQRAGHRPIVLAGGATGMIGDPRDVGERSLNEADTVAEWTERI RGQLERFVDFDDSPMGAIVENNLEWTGSLSAIEFLRDIGKHFSVNVMLARDTIRRRLA GEGISYTEFSYLLLQANDYVELHRRHGCTLQIGGADQWGNIIAGVRLVRQKLGATVHA LTVPLVTAADGTKFGKSTGGGSLWLDPQMTSPYAWYQYFVNTADADVIRYLRWFTFLS ADELAELEQATAQRPQQRAAQRRLASELTVLVHGEAATAAVEHASRALFGRGELARLD EATLAAALRETTVAELKPGSPDGIVDLLVASGLSASKGAARRTIHEGGVSVNNIRVDN EEWVPQSSDFLHGRWLVLRRGKRSIAGVERIG" CDS 1905242..1905661 /codon_start=1 /transl_table=11 /gene="lprJ" /locus_tag="BQ2027_MB1716" /product="PROBABLE LIPOPROTEIN LPRJ" /note="Mb1716, lprJ, len: 139 aa. Equivalent to Rv1690, len: 127 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 124 aa overlap). Probable lprJ, lipoprotein, contains possible signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Weakly similar to other Mycobacterium tuberculosis hypothetical proteins with conserved cysteines e.g. Rv1804c, Rv1810, Rv3354, etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 2 bp insertion (*-ac) at the 5' end, leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (139 aa versus 127 aa). Mb1716 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZ22" /db_xref="InterPro:IPR007969" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ22" /protein_id="SIU00320.1" /translation="MTQVRQQREAGDDGTHTHDGTRTWRTGRQATTLLALLAGVFGGA ASCAAPIQADMMGNAFLTALTNAGIAYDQPATTVALGRSVCPMVVAPGGTFESITSRM AEINGMSRDMASTFTIVAIGTYCPAVIAPLMPNRLQA" CDS 1905700..1906452 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1717" /product="TPR-repeat-containing protein" /note="Mb1717, -, len: 250 aa. Equivalent to Rv1691, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 250 aa overlap). Conserved hypothetical protein, similar to Q9S210|SCI51.30C|AL109848 Hypothetical protein from Streptomyces coelicolor (210 aa), FASTA score: opt: 556, E(): 6.4e-27, (50.6% identity in 180 aa overlap). Protein product from Mb1717 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1717 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011990" /db_xref="UniProtKB/TrEMBL:A0A1R3XYZ9" /protein_id="SIU00321.1" /translation="MVDDRQGRRGGRRPRSAAADNRPAFRDGPAIPPGIHARQLAPEI RRELSTLDRATADAVACHLVAAGELIDDDPEAALRHARAARVRASRIAAVREAVGIAA YRCGDWAQALAELRAARRMGSKSPLLALIADCERGLGRPQRAIELARGSEAVELSGDA ADELRIVAAGARADLGQLEQALTVLSTPQLDPGRTGSTAARLFYAYAEILLALGRGDE ALQWFLRSAAADIDGVTDAEDRVDELGAREQK" CDS 1906449..1907510 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1718" /product="PROBABLE PHOSPHATASE" /note="Mb1718, -, len: 353 aa. Equivalent to Rv1692, len: 353 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 353 aa overlap). Probable phosphatase (EC 3.1.-.-), some similarity to others e.g. PNPP_SCHPO|Q00472 4-nitrophenylphosphatase (269 aa), FASTA scores: opt: 214, E(): 1.3e-10, (29.5% identity in 241 aa overlap); and to NAGD_ECOLI|P15302 nagd protein from Escherichia coli (250 aa), FASTA scores: opt: 314, E(): 9.8e-08, (28.2% identity in 245 aa overlap). Also similar to AL109848|SCI51.28 hypothetical protein from Streptomyces coelicolor (343 aa), FASTA scores: opt: 768, E(): 0, (44.8% identity in 315 aa overlap). Protein product from Mb1718 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1718 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006357" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="InterPro:IPR041065" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ15" /protein_id="SIU00322.1" /translation="MKSIAQEHDCLLIDLDGTVFCGRQPTGGAVQSLSQVRSRKLFVT NNASRSADEVAAHLCELGFTATGEDVVTSAQSAAHLLAGQLAPGARVLIVGTEALANE VAAVGLRPVRRFEDRPDAVVQGLSMTTGWSDLAEAALAIRAGALWVAANVDPTLPTER GLLPGNGSMVAALRTATGMDPRVAGKPAPALMTEAVARGDFRAALVVGDRLDTDIEGA NAAGLPSLMVLTGVNSAWDAVYAEPVRRPTYIGHDLRSLHQDSKLLAVAPQPGWQIDV GGGAVTVCANGDVDDLEFIDDGLSIVRAVASAVWEARAADLHQRPLRIEAGDERARAA LQRWSLMRSDHPVTSVGTQ" CDS 1907507..1907683 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1719" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1719, -, len: 58 aa. Equivalent to Rv1693, len: 58 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 58 aa overlap). Conserved hypothetical protein, shows some similarity to AL583921 hypothetical protein from Mycobacterium leprae (61 aa). Probable coiled-coil from aa 30 to 58. Protein product from Mb1719 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1719 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y163" /protein_id="SIU00323.1" /translation="MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEA HEVLLAALESAEKG" CDS 1907691..1908497 /codon_start=1 /transl_table=11 /gene="tlyA" /locus_tag="BQ2027_MB1720" /product="2'-o-methyltransferase tlya" /note="Mb1720, tlyA, len: 268 aa. Equivalent to Rv1694, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 268 aa overlap). tlyA, cytotoxin/haemolysin homologue (see citations below), almost identical to NP_301968.1|NC_002677 cytotoxin/haemolysin homologue TlyA from Mycobacterium leprae (269 aa). TlyA homologues were also identified by PCR in M. avium, M. bovis BCG, but appeared absent in M. smegmatis, M. vaccae, M. kansasii, M. chelonae and M. phlei (see first citation below). Also highly similar to CAB83047.1|AJ271681 putative haemolysin from Mycobacterium ulcerans (281 aa); and similar to HLYA_TREHY|Q06803 pore-forming haemolysin/cytotoxin virulence determinant from Treponema hyodysenteriae (240 aa), FASTA scores: opt: 514, E():3e-30, (37.3% identity in 236 aa overlap). Mb1720 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZV6" /db_xref="InterPro:IPR002877" /db_xref="InterPro:IPR002942" /db_xref="InterPro:IPR004538" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR036986" /db_xref="UniProtKB/TrEMBL:A0A1R3XZV6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00324.1" /translation="MARRARVDAELVRRGLARSRQQAAELIGAGKVRIDGLPAVKPAT AVSDTTALTVVTDSERAWVSRGAHKLVGALEAFAIAVAGRRCLDAGASTGGFTEVLLD RGAAHVVAADVGYGQLAWSLRNDPRVVVLERTNARGLTPEAIGGRVDLVVADLSFISL ATVLPALVGCASRDADIVPLVKPQFEVGKGQVGPGGVVHDPQLRARSVLAVARRAQEL GWHSVGVKASPLPGPSGNVEYFLWLRTQTDRALSAKGLEDAVHRAISEGP" CDS 1908497..1909420 /codon_start=1 /transl_table=11 /gene="ppnK" /locus_tag="BQ2027_MB1721" /product="inorganic polyphosphate/atp-nad kinase ppnk (poly(p)/atp nad kinase)" /note="Mb1721, ppnK, len: 307 aa. Equivalent to Rv1695, len: 307 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 307 aa overlap). Probable ppnK, inorganic polyphosphate/ATP-NAD kinase (EC 2.7.1.23), equivalent to Q49897|MLC1351.13C|Z95117|PPNK_MYCLE INORGANIC POLYPHOSPHATE/ATP-NAD KINASE from Mycobacterium leprae (311 aa) (87.9% identity in 305 aa overlap). Also similar to many e.g. P37768|PPNK_ECOLI PROBABLE INORGANIC POLYPHOSPHATE/ATP-NAD KINASE (292 aa), FASTA scores: opt: 384, E(): 1.7e-23, (33.5% identity in 233 aa overlap); etc. BELONGS TO THE NAD KINASE FAMILY. Protein product from Mb1721 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1721 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5S7" /db_xref="InterPro:IPR002504" /db_xref="InterPro:IPR016064" /db_xref="InterPro:IPR017437" /db_xref="InterPro:IPR017438" /db_xref="UniProtKB/Swiss-Prot:P0A5S7" /protein_id="SIU00325.1" /translation="MTAHRSVLLVVHTGRDEATETARRVEKVLGDNKIALRVLSAEAV DRGSLHLAPDDMRAMGVEIEVVDADQHAADGCELVLVLGGDGTFLRAAELARNASIPV LGVNLGRIGFLAEAEAEAIDAVLEHVVAQDYRVEDRLTLDVVVRQGGRIVNRGWALNE VSLEKGPRLGVLGVVVEIDGRPVSAFGCDGVLVSTPTGSTAYAFSAGGPVLWPDLEAI LVVPNNAHALFGRPMVTSPEATIAIEIEADGHDALVFCDGRREMLIPAGSRLEVTRCV TSVKWARLDSAPFTDRLVRKFRLPVTGWRGK" CDS 1909434..1911197 /codon_start=1 /transl_table=11 /gene="recN" /locus_tag="BQ2027_MB1722" /product="Probable DNA repair protein recN (Recombination protein N)" /note="Mb1722, recN, len: 587 aa. Equivalent to Rv1696, len: 587 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 587 aa overlap). Probable recN, DNA repair protein, similar to many e.g. RECN_ECOLI|P05824 dna repair protein recN (553 aa), FASTA scores: opt: 508, E(): 1.9e-33, (31.5% identity in 587 aa overlap). Equivalent to Z95117|MLCB1351_12 recN from Mycobacterium leprae (587 aa), FASTA scores: (76.1% identit y in 589 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb1722 detected using SWATH mass spectrometry. Mb1722 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5U7" /db_xref="InterPro:IPR003395" /db_xref="InterPro:IPR004604" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P0A5U7" /protein_id="SIU00326.1" /translation="MLTELRIESLGAISVATAEFDRGFTVLTGETGTGKTMVVTGLHL LGGARADATRVRSGADRAVVEGRFTTTDLDDATVAGLQAVLDSSGAERDEDGSVIALR SISRDGPSRAYLGGRGVPAKSLSGFTNELLTLHGQNDQLRLMRPDEQRGALDRFAAAG EAVQRYRKLRDAWLTARRDLVDRRNRARELAQEADRLKFALNEIDTVDPQPGEDVALV ADIARLSELDTLREAATTARATLCGTPDADAFDRGAVDSLGRARAALQSSDDAALRGL AEQVGEALTVVVDAVAELGAYLDELPADASALDAKLARQAQLRTLTRKYAADIDGVLR WADEARARLAQLDVSEEGLAALERRTGELAHELGQAAVDLSTIRRKAAKRLAKEVSAE LSALAMADAEFTIGVTTELADHGDPVALALASGELARAGADGVDAVEFGFVAHRGMTV LPLAKSASGGELSRVMLSLEVVLATSRKQAAGTTMVFDEIDAGVGGWAAVQIGRRLAR LARTHQVIVVTHLPQVAAYADVHLMVQRTGRDGASGVRRLTSEDRVAELARMLAGLGD SDSGRAHARELLETAQNDELT" CDS 1911293..1912474 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1723" /product="FIG005773: conserved membrane protein ML1361" /note="Mb1723, -, len: 393 aa. Equivalent to Rv1697, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 393 aa overlap). Conserved hypothetical protein, highly similar to Q49895|MLC1351.11C|U00021 Hypothetical protein of Mycobacterium leprae from cosmid L247 (430 aa), FASTA scores: opt: 2345, E(): 0, (90.6% identity in 393 aa overlap). Protein product from Mb1723 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1723 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ32" /db_xref="InterPro:IPR022215" /db_xref="InterPro:IPR036759" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ32" /protein_id="SIU00327.1" /translation="MRMSALLSRNTSRPGLIGIARVDRNIDRLLRRVCPGDIVVLDVL DLDRITADALVEAEIAAVVNASSSVSGRYPNLGPEVLVTNGVTLIDETGPEIFKKVKD GAKVRLYEGGVYAGDRRLIRGTERTDHDIADLMREAKSGLVAHLEAFAGNTIEFIRSE SPLLIDGIGIPDVDVDLRRRHVVIVADEPSGPDDLKSLKPFIKEYQPVLVGVGTGADV LRKAGYRPQLIVGDPDQISTEVLKCGAQVVLPADADGHAPGLERIQDLGVGAMTFPAA GSATDLALLLADHHGAALLVTAGHAANIETFFDRTRVQSNPSTFLTRLRVGEKLVDAK AVATLYRNHISGGAIALLALTMLIAIIVALWVSRTDGVVLHWIIDYWNRFSLWVQHLV S" CDS 1912496..1913440 /codon_start=1 /transl_table=11 /gene="mctb" /locus_tag="BQ2027_MB1724" /product="outer membrane protein mctb" /note="Mb1724, -, len: 314 aa. Equivalent to Rv1698, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 314 aa overlap). Conserved hypothetical protein, possibly exported protein with potential N-terminal signal sequence. Equivalent to Q49894|MLC1351.10C|Z95117 Hypothetical protein from Mycobacterium leprae (317 aa), FASTA scores: (77.0% identity in 317 aa overlap). Probable coiled-coil from aa 31 to 67. Protein product from Mb1724 detected using SWATH mass spectrometry. Mb1724 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64884" /db_xref="InterPro:IPR021522" /db_xref="UniProtKB/Swiss-Prot:P64884" /protein_id="SIU00328.1" /translation="MISLRQHAVSLAAVFLALAMGVVLGSGFFSDTLLSSLRSEKRDL YTQIDRLTDQRDALREKLSAADNFDIQVGSRIVHDALVGKSVVIFRTPDAHDDDIAAV SKIVGQAGGAVTATVSLTQEFVEANSAEKLRSVVNSSILPAGSQLSTKLVDQGSQAGD LLGIALLSNADPAAPTVEQAQRDTVLAALRETGFITYQPRDRIGTANATVVVTGGALS TDAGNQGVSVARFAAALAPRGSGTLLAGRDGSANRPAAVAVTRADADMAAEISTVDDI DAEPGRITVILALHDLINGGHVGHYGTGHGAMSVTVSQ" CDS 1913580..1915340 /codon_start=1 /transl_table=11 /gene="pyrG" /locus_tag="BQ2027_MB1725" /product="probable ctp synthase pyrg" /note="Mb1725, pyrG, len: 586 aa. Equivalent to Rv1699, len: 586 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 586 aa overlap). pyrG, CTP synthase (EC 6.3.4.2) highly similar to many e.g. PYRG_ECOLI|P08398 ctp synthase from Escherichia coli (544 aa), FASTA scores: opt: 1786, E():0, (51.8% identity in 548 aa overlap). Contains PS00442 Glutamine amidotransferases class-I active site. Protein product from Mb1725 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1725 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5U3" /db_xref="InterPro:IPR004468" /db_xref="InterPro:IPR017456" /db_xref="InterPro:IPR017926" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR029062" /db_xref="InterPro:IPR033828" /db_xref="UniProtKB/Swiss-Prot:P0A5U3" /protein_id="SIU00329.1" /translation="MRKHPQTATKHLFVSGGVASSLGKGLTASSLGQLLTARGLHVTM QKLDPYLNVDPGTMNPFQHGEVFVTEDGAETDLDVGHYERFLDRNLPGSANVTTGQVY STVIAKERRGEYLGDTVQVIPHITDEIKRRILAMAQPDADGNRPDVVITEIGGTVGDI ESQPFLEAARQVRHYLGREDVFFLHVSLVPYLAPSGELKTKPTQHSVAALRSIGITPD ALILRCDRDVPEALKNKIALMCDVDIDGVISTPDAPSIYDIPKVLHREELDAFVVRRL NLPFRDVDWTEWDDLLRRVHEPHETVRIALVGKYVELSDAYLSVAEALRAGGFKHRAK VEICWVASDGCETTSGAAAALGDVHGVLIPGGFGIRGIEGKIGAIAYARARGLPVLGL CLGLQCIVIEAARSVGLTNANSAEFDPDTPDPVIATMPDQEEIVAGEADLGGTMRLGS YPAVLEPDSVVAQAYQTTQVSERHRHRYEVNNAYRDKIAESGLRFSGTSPDGHLVEFV EYPPDRHPFVVGTQAHPELKSRPTRPHPLFVAFVGAAIDYKAGELLPVEIPEIPEHTP NGSSHRDGVGQPLPEPASRG" CDS 1915333..1915956 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1726" /product="nudix hydrolase" /note="Mb1726, -, len: 207 aa. Equivalent to Rv1700, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 207 aa overlap). Conserved hypothetical protein, equivalent to Q49891|MLC1351.08C|Z95117 Hypothetical protein from Mycobacterium leprae (177 aa), FASTA scores: (66.7% identity in 171 aa overlap); also similar to Q9S225|SCI51.15C|AL109848 Hypothetical protein from Streptomyces coelicolor (211 aa), FASTA scores: opt: 508, E(): 1.2e-27, (43.1% identity in 197 aa overlap); similar to P54570|ADPP_BACSU ADP-RIBOSE PYROPHOSPHATASE (EC 3.6.1.13) (185 aa), FASTA scores: opt: 313, E(): 1.1e-06, (42.7% identity in 124 aa overlap). Protein product from Mb1726 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1726 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ31" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR015797" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ31" /protein_id="SIU00330.1" /translation="MAEHDFETISSETLHTGAIFALRRDQVRMPGGGIVTREVVEHLG AVAIVAMDDNGNIPMVYQYRHTYGRRLWELPAGLLDVAGEPPHLTAARELREEVGLQA STWQVLVDLDTAPGFSDESVRVYLATGLREVGRPEAHHEEADMTMGWYPIAEAARRVL RGEIVNSIAIAGVLAVHAVTTGFAQPRPLDTEWIDRPTAFATRRAER" CDS 1915953..1916888 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1727" /product="PROBABLE INTEGRASE/RECOMBINASE" /note="Mb1727, -, len: 311 aa. Equivalent to Rv1701, len: 311 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 311 aa overlap). Probable integrase/recombinase, similar to many e.g. XERD_ECOLI|P21891 integrase/recombinase xerd (298 aa), FASTA scores: opt: 583, E(): 0, (41.8% identity in 311 aa overlap). Also similar to other Mycobacterium tuberculosis integrase/recombinase proteins RV2894c|MTCY274.25c (43.1% identity in 304 aa overlap); and Rv2646|MTCY441.16 phiRv2 integrase (31.1% identity in 161 aa overlap). Equivalent to Z95117|MLCB1351_7 from Mycobacterium leprae (316 aa) (85.4% identity in 316 aa overlap). Protein product from Mb1727 detected using SWATH mass spectrometry. Mb1727 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67637" /db_xref="InterPro:IPR002104" /db_xref="InterPro:IPR004107" /db_xref="InterPro:IPR010998" /db_xref="InterPro:IPR011010" /db_xref="InterPro:IPR011932" /db_xref="InterPro:IPR013762" /db_xref="InterPro:IPR023009" /db_xref="UniProtKB/Swiss-Prot:P67637" /protein_id="SIU00331.1" /translation="MKTLALQLQGYLDHLTIERGVAANTLSSYRRDLRRYSKHLEERG ITDLAKVGEHDVSEFLVALRRGDPDSGTAALSAVSAARALIAVRGLHRFAAAEGLAEL DVARAVRPPTPSRRLPKSLTIDEVLSLLEGAGGDKPSDGPLTLRNRAVLELLYSTGAR ISEAVGLDLDDIDTHARSVLLRGKGGKQRLVPVGRPAVHALDAYLVRGRPDLARRGRG TAAIFLNARGGRLSRQSAWQVLQDAAERAGITAGVSPHMLRHSFATHLLEGGADVRVV QELLGHASVTTTQIYTLVTVHALREVWAGAHPRAR" repeat_region complement(1916962..1918340) /rpt_family="REP" /note="REP-6, len: 1379 nt. Equivalent to REP, len: 1362 nt, from Mycobacterium tuberculosis strain H37RV, (65.3% identity in 1364 nt overlap). REPI125, member of REP13E12 family." CDS complement(1916962..1918326) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1728C" /product="13E12 repeat family protein" /note="Mb1728c, -, len: 454 aa. Equivalent to Rv1702c, len: 454 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 454 aa overlap). Conserved hypothetical ORF in REP13E12 degenerate repeat. Similar to other hypothetical proteins inside REP13E12 elements (often in two parts) e.g. Rv0094c|Q50655|MTCY251.13c (317 aa), FASTA scores: opt: 1284, E(): 0, (59.7% identity in 315 aa overlap); and Rv1128c, Rv1945, Rv1148c, etc. Mb1728c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/Swiss-Prot:P64886" /protein_id="SIU00332.1" /translation="MYSSSREEAVAAFDNLDTALNRVLKVSPDDLTIPECLAMLQRCE KIRRRLPAAEHPFINKLADQTDQTELGGKLPFALAERLHISRGEASRRIHEAADLGPR RTLTGQPLPPLLTATAAAQRAGHLGPAHVQVIRCFLHQLPHHVDLPTREKAEAELATL GGRFRPDQLHKLATKLADCLNPDGNYNDTDRARRRSIILGNQGPDGMSAISGYLTPEA RATVDAVLAKLAAPGMANPADDTPCLAGTPSQAAIEADTRSAGQRHHDGLLAALRALL CSGELGQHNGLPAAIIVSTSLTELQSRAGHALTGGGTLLPMSDVIRLASHANHYLRIF DHGRELALYHTKRLASPGQRIVLYAKDRGCSFPNCDVPGYLTEVHHVTDFAQCQETDI NELTQGCGPHHQLATTGGWITRKRKDGTTEWLPPAHLDHGQPRTNSYFHPEKLLHDSD EDDP" CDS complement(1918881..1919471) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1729C" /product="Probable catechol-o-methyltransferase" /note="Mb1729c, -, len: 196 aa. Equivalent to Rv1703c, len: 196 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 196 aa overlap). Probable catechol-o-methyltransferase (EC 2.1.1.6), most similar to COMT_HUMAN|P21964 soluble form of mammalian catechol o-methyltransferase (271 aa), FASTA scores: opt: 405, E(): 7 .8e-29, (38.9% identity in 190 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical methyltransferases Rv0187, Rv1220c. Protein product from Mb1729c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1729c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y172" /db_xref="InterPro:IPR002935" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y172" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00333.1" /translation="MLATIDKFAYEKSMLINVGDEKGTLLDAAVRRADPALALELGTY LGYGALRIARAAPEARVYSVELAEANASNARRIWAHAGVDDRVVCVVGTIGDGGRTLD ALTEHGFATGTLDFVFLDHDKKAYLPDLQSILDRGWLHPGSIVVADNVRVPGAPKYRA YMRRQQGMSWNTIEHKTHLEYQTLVPDLVLESEYLG" CDS complement(1919536..1921206) /codon_start=1 /transl_table=11 /gene="cycA" /locus_tag="BQ2027_MB1730C" /product="PROBABLE D-SERINE/ALANINE/GLYCINE TRANSPORTER PROTEIN CYCA" /note="Mb1730c, cycA, len: 556 aa. Equivalent to Rv1704c, len: 556 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 556 aa overlap). Probable cycA, D-serine/D-alanine/glycine transporter, highly similar to P39312|CYCA_ECOLI d-serine/d-alanine/glycine transporter from Escherichia coli (470 aa), FASTA scores: opt: 1906, E(): 0, (59.3% identity in 459 aa overlap); etc. Also similar to other Mycobacterium tuberculosis amino-acid permeases e.g. Rv2127, Rv0346c, etc. Contains PS00218 amino acid permeases signature. BELONGS TO THE AMINO ACID PERMEASE FAMILY (APC FAMILY). Protein product from Mb1730c detected using SWATH mass spectrometry. Mb1730c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZW7" /db_xref="InterPro:IPR002293" /db_xref="InterPro:IPR004840" /db_xref="InterPro:IPR004841" /db_xref="UniProtKB/TrEMBL:A0A1R3XZW7" /protein_id="SIU00334.1" /translation="MPDDIAAADPTDTQPHLRRDLANRHIQLIAIGGAIGTGLFMGSG RTISLAGPAVMVVYGIIGFFVFFVLRAMGELLLSNLNYKSFVDFAADLLGPAAGFFVG WSYWFAWVVTGIADLVAITGYARFWWPGLPIWVPALVTVALILAVNLFSVRHFGELEF WFALIKVAAIVCLIAVGAILVATNFVSPHGVHATIENLWNDNGFFPTGFLGVVSGFQI AFFAYIGVELVGTAAAETADPRRTLPRAINAVPLRVAVFYIGALLAILAVVPWRQFAS GESPFVTMFSLAGLAAAASVVNFVVVTAAASSANSGFFSTGRMLFGLADEGHAPAAFH QLNRGGVPAPALLLTAPLLLTSIPLLYAGRSVIGAFTLVTTVSSLLFMFVWAMIIISY LVYRRRHPQRHTDSVYKMPGGVVMCWAVLVFFAFVIWTLTTETETATALAWFPLWFVL LAVGWLVTQRRQSRRSFGFHCQVVGVRQQLGRGMARLAMKIHARPKLRSAVVVEPVSA GEPGARRSAKSVRKLASDDSQSAHCPVAVVGLADGGRDPQYHHDGPDR" CDS complement(1921247..1922404) /codon_start=1 /transl_table=11 /gene="PPE22" /locus_tag="BQ2027_MB1731C" /product="ppe family protein ppe22" /note="Mb1731c, PPE22, len: 385 aa. Equivalent to Rv1705c, len: 385 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 385 aa overlap). Member of the M. tuberculosis PPE family of glycine-rich proteins, similar to many e.g. YX23_MYCTU|Q10813 hypothetical 41.1 kd protein cy274.2 3 (404 aa), fasta scores: opt: 819, E(): 0, (46.2% identity in 413 aa overlap)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ34" /protein_id="SIU00335.1" /translation="MDFGALPPEVNSGRMYCGPGSAPMVAAASAWNGLAAELSVAAVG YERVITTLQTEEWLGPASTLMVEAVAPYVAWMRATAIQAEQAASQARAAAAAYETAFA AIVPPPLIAANRARLTSLVTHNVFGQNTASIAATEAQYAEMWAQDAMAMYGYAGSSAT ATKVTPFAPPPNTTSPSAAATQLSAVAKAAGTSAGAAQSAIAELIAHLPNTLLGLTSP LSSALTAAATPGWLEWFINWYLPISQLFYNTVGLPYFAIGIGNSLITSWRALGWIGPE AAEAAAAAPAAVGAAVGGTGPVSAGLGNAATIGKLSVPPNWAGASPSLAPTVGSASAP LVSDIVEQPEAGAAGNLLGGMPLAGSGTGTGGAGPRYGFRVTVMSRPPFAG" CDS complement(1922444..1923628) /codon_start=1 /transl_table=11 /gene="PPE23" /locus_tag="BQ2027_MB1732C" /product="ppe family protein ppe23" /note="Mb1732c, PPE23, len: 394 aa. Equivalent to Rv1706c, len: 394 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 394 aa overlap). Member of the M. tuberculosis PPE family of glycine-rich proteins, similar to many e.g. YX23_MYCTU|Q10813 hypothetical 41.1 kd protein cy274.23 (404 aa), fasta scores: opt: 841, E(): 3.9e-31, (46.8% identity in 408 aa overlap). Mb1732c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZG1" /protein_id="SIU00336.1" /translation="MTLDVPVNQGHVPPGSVACCLVGVTAVADGIAGHSLSNFGALPP EINSGRMYSGPGSGPLMAAAAAWDGLAAELSSAATGYGAAISELTNMRWWSGPASDSM VAAVLPFVGWLSTTATLAEQAAMQARAAAAAFEAAFAMTVPPPAIAANRTLLMTLVDT NWFGQNTPAIATTESQYAEMWAQDAAAMYGYASAAAPATVLTPFAPPPQTTNATGLVG HATAVAALRGQHSWAAAIPWSDIQKYWMMFLGALATAEGFIYDSGGLTLNALQFVGGM LWSTALAEAGAAEAAAGAGGAAGWSAWSQLGAGPVAASATLAAKIGPMSVPPGWSAPP ATPQAQTVARSIPGIRSAAEAAETSVLLRGAPTPGRSRAAHMGRRYGRRLTVMADRPN VG" CDS complement(1924232..1924399) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1733C" /product="Permeases of the major facilitator superfamily" /note="Mb1733c, -, len: 55 aa. Equivalent to Rv1706A, len: 55 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 55 aa overlap). Conserved hypothetical protein, similar to part of several probable export proteins e.g. Rv0783c|Z80226_28 from Mycobacterium tuberculosis (540 aa), FASTA scores: opt: 125, E(): 0.011, (52.85% identity in 53 aa overlap). Size difference suggests possible gene fragment." /db_xref="UniProtKB/TrEMBL:A0A1R3XZ39" /protein_id="SIU00337.1" /translation="MGSLAAFKLGWLLSAMAPNVVLLTAFRVPQGLTMLTVFATGQAG QHRCRTFHVTP" CDS 1924632..1926092 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1734" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb1734, -, len: 486 aa. Equivalent to Rv1707, len: 486 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 486 aa overlap). Probable conserved transmembrane protein, possibly involved in transport of sulfate, similar to several hypothetical proteins belonging to the sulfate permease family e.g. P40877|YCHM_ECOLI hypothetical 58.4 kd protein in pth-prsa intergenic region from Escherichia coli (550 aa), FASTA scores: opt: 486, E(): 0, (33.1% identity in 492 aa overlap). Also similar to many other Mycobacterium tuberculosis membrane proteins eg. Rv3273, Rv1739c. SEEMS TO BELONG TO THE SULP FAMILY. Protein product from Mb1734 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1734 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ51" /db_xref="InterPro:IPR001902" /db_xref="InterPro:IPR002645" /db_xref="InterPro:IPR011547" /db_xref="InterPro:IPR036513" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ51" /protein_id="SIU00338.1" /translation="MLQRIARELLSGVAVAIVALPLAIAFGITATGTSQGALIGLYGA IFAGFFAAVFGGTPGQVTGPTGPITVVATATIAEHGLEGAFFAFILAGVFQILFGACR LGSLIRYVPHPVISGFMGGIAILIIMTQLDQVRSSSLLVLVTVVLLLASGRFIKAIPP SLLVLVLVSSVLPLAAPWLRDLRAGPVSINRTVDYIGEIPQAMPSFDFPQVANSTMLQ VLLSAVAIALLGSLDSLLTSLVMDNIRGTRHRSNKELIGQGIGNIAAGLFGGLAGAGA TVRSVVNVRNGGQTALSAATHSVVLFVFVAGLGAVVQYIPLAVLSGILILVAVGMFDW HAMRKAHVSPRGDVIVMFTTMIITVVVDLTIAVMVGIALSLLVHRLRSRQRKAKVTQD DTGTYRIDGPLSFLSVDGVFGSLRDGREDVSLDLQHVTYLDTSGAQALLYFIDHSEKD GVAVSIKRIPPRLESQLTALADNEQRDKLRTVLESA" CDS 1926110..1927066 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1735" /product="PUTATIVE INITIATION INHIBITOR PROTEIN" /note="Mb1735, -, len: 318 aa. Equivalent to Rv1708, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 318 aa overlap). Putative initiation inhibitor protein, a soj-related protein probably involved in cell process, highly similar to many sporulation initiation inhibitor proteins soj e.g. P37522|SOJ_BACSU Soj protein from Bacillus subtilis (253 aa), FASTA scores: opt: 745, E(): 0, (46.0% identity in 248 aa overlap), and more weakly to various repA/para/incC proteins from various organisms e.g. Y4CK_RHISN|P55393 putative replication protein A from Rhizobium sp. (407 aa), FASTA scores: opt: 205, E(): 4e-13, (29.0% identity in 252 aa overlap). Also similar to Mycobacterium tuberculosis hyothetical proteins Rv3213c and Rv3918c. Protein product from Mb1735 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1735 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025669" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ37" /protein_id="SIU00339.1" /translation="MPAGLPGQASVAVRLSCDVPPDARHHEPRPGMTDHPDTGNGIGL TGRPPRAIPDPTPRSSHGPAKVIAMCNQKGGVGKTTSTINLGAALGEYGRRVLLVDMD PQGALSAGLGVPHYELDKTIHNVLVEPRVSIDDVLIHSRVKNMDLVPSNIDLSAAEIQ LVNEVGREQTLARALYPVLDRYDYVLIDCQPSLGLLTVNGLACTDGVIIPTECEFFSL RGLALLTDTVDKVRDRLNPKLDISGILITRYDPRTVNSREVMARVVERFGDLVFDTVI TRTVRFPETSVAGEPITTWAPKSAGALAYRALARELIDRFGM" CDS 1927063..1927899 /codon_start=1 /transl_table=11 /gene="scpa" /locus_tag="BQ2027_MB1736" /product="possible segregation and condensation protein scpa" /note="Mb1736, -, len: 278 aa. Equivalent to Rv1709, len: 278 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 278 aa overlap). Conserved hypothetical protein, similar to others e.g. P35154|YPUG_BACSU from Bacillus subtilis (251 aa), FASTA scores: opt: 271, E(): 8.2e-10, (27.0% identity in 248 aa overlap); Q9S230|SCI51.10C|AL109848 from Streptomyces coelicolor (264 aa), FASTA scores: opt: 855, E(): 0, (56.8% identity in 257 aa overlap). Equivalent to Q49888|MLC1351.05C|Z95117 from Mycobacterium leprae (268 aa), FASTA scores: (78.9% identity in 251 aa overlap). Protein product from Mb1736 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1736 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ42" /db_xref="InterPro:IPR003768" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ42" /protein_id="SIU00340.1" /translation="MNGLQNSLANGGTAPENGYSAGFRVRLTNFEGPFDLLLQLIFAH QLDVTEVALHQVTDDFIAYTKAIGARLELEETTAFLVIAATLLDLKAARLLPAGQVDD EEDLALLEVRDLLFARLLQYRAFKHVAEMFAELEATALRSYPRAVSLEDGFVGLLPEV MLGVDAHRFAEIAAIALTPRPAPTVATEHLHELMVSVPEQAEHLLAMLKARGSGQWAS FSELVADCTAPIEIVGRFLALLELYRTRAVAFEQSEPLGALQVSWTGDDAERSDEKER RL" CDS 1927896..1928591 /codon_start=1 /transl_table=11 /gene="scpb" /locus_tag="BQ2027_MB1737" /product="possible segregation and condensation protein scpb" /note="Mb1737, -, len: 231 aa. Equivalent to Rv1710, len: 231 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 231 aa overlap). Conserved hypothetical protein, similar to several hypothetical proteins e.g. P35155|YPUH_BACSU from Bacillus subtilis (197 aa), FASTA scores: opt: 339, E(): 1.3e-09, (36.0% identity in 186 aa overlap); Q9S231|SCI51.09C|AL109848 from Streptomyces coelicolor (223 aa), FASTA scores: opt: 626, E(): 0, (51.0% identity in 192 aa overlap). Equivalent to O05669|MLC1351.04C|Z95117 Hypothetical protein from Mycobacterium leprae (231 aa), FASTA scores: (77.9% identity in 231 aa overlap). Protein product from Mb1737 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1737 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ21" /db_xref="InterPro:IPR005234" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ21" /protein_id="SIU00341.1" /translation="MTEHMPEHDPSYGIPDIAEPAELDADELKRVLEALLLVIDTPVT ADALAAATEQPVYRVAAKLQLMADELTGRDSGIDLRHTSEGWRMYTRARFAPYVEKLL LDGARTKLTRAALETLAVVAYRQPVTRARVSAVRGVNVDAVMRTLLARGLITEVGTDA DTGAVTFATTELFLERLGLTSLSELPDIAPLLPDVDTIDDLSESLDSEPRFIKLTGEL ASEQTLSFDVDRD" CDS 1928588..1929352 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1738" /product="LSU rRNA pseudouridine(2605) synthase (EC" /EC_number="5.4.99.22" /note="Mb1738, -, len: 254 aa. Equivalent to Rv1711, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 254 aa overlap). Conserved hypothetical protein, highly similar to a large family of hypothetical proteins e.g. P37765|YCIL_ECOLI from Escherichia coli (291 aa), FASTA scores: opt: 496, E(): 1.1e-29, (41.6% identity in 250 aa overlap); 9S232|SCI51.08C|AL109848 PUTATIVE PSEUDOURIDINE SYNTHASE from Streptomyces coelicolor (371 aa), FASTA scores: opt: 818, E(): 0, (53.1% identity in 245 aa overlap). Equivalent to O05668|MLCB1351.03C|Z95117 Hypothetical protein from Mycobacterium leprae (256 aa), (80.5% identity in 256 aa overlap). Contains PS01149 Hypothetical yciL/yejD/yjbC family signature. Protein product from Mb1738 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1738 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65843" /db_xref="InterPro:IPR000748" /db_xref="InterPro:IPR002942" /db_xref="InterPro:IPR006145" /db_xref="InterPro:IPR018496" /db_xref="InterPro:IPR020103" /db_xref="InterPro:IPR036986" /db_xref="InterPro:IPR042092" /db_xref="UniProtKB/Swiss-Prot:P65843" /protein_id="SIU00342.1" /translation="MMAEPEESREPRGIRLQKVLSQAGIASRRAAEKMIVDGRVEVDG HVVTELGTRVDPQVAVVRVDGARVVLDDSLVYLALNKPRGMHSTMSDDRGRPCIGDLI ERKVRGTKKLFHVGRLDADTEGLMLLTNDGELAHRLMHPSHEVPKTYLATVTGSVPRG LGRTLRAGIELDDGPAFVDDFAVVDAIPGKTLVRVTLHEGRNRIVRRLLAAAGFPVEA LVRTDIGAVSLGKQRPGSVRALRSNEIGQLYQAVGL" CDS 1929349..1930041 /codon_start=1 /transl_table=11 /gene="cmk" /locus_tag="BQ2027_MB1739" /product="cytidylate kinase cmk (cmp kinase) (cytidine monophosphate kinase) (ck)" /note="Mb1739, cmk, len: 230 aa. Equivalent to Rv1712, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 230 aa overlap). Probable cmk, cytidylate kinase (EC 2.7.4.14), highly similar to many e.g. KCY_ECOLI|P23863 cytidylate kinase from Escherichia coli (227 aa), FASTA scores: opt: 534, E (): 0, (40.3% identity in 221 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Equivalent to Z95117|MLCB1351_2 from Mycobacterium leprae (223 aa) (73.5% identity in 226 aa overlap). BELONGS TO THE CYTIDYLATE KINASE FAMILY, SUBFAMILY 1. Protein product from Mb1739 detected using SWATH mass spectrometry. Mb1739 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63804" /db_xref="InterPro:IPR003136" /db_xref="InterPro:IPR011994" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P63804" /protein_id="SIU00343.1" /translation="MSRLSAAVVAIDGPAGTGKSSVSRRLARELGARFLDTGAMYRIV TLAVLRAGADPSDIAAVETIASTVQMSLGYDPDGDSCYLAGEDVSVEIRGDAVTRAVS AVSSVPAVRTRLVELQRTMAEGPGSIVVEGRDIGTVVFPDAPVKIFLTASAETRARRR NAQNVAAGLADDYDGVLADVRRRDHLDSTRAVSPLQAAGDAVIVDTSDMTEAEVVAHL LELVTRRSEAVR" CDS 1930038..1931429 /codon_start=1 /transl_table=11 /gene="engA" /locus_tag="BQ2027_MB1740" /product="PROBABLE GTP-BINDING PROTEIN ENGA" /note="Mb1740, engA, len: 463 aa. Equivalent to Rv1713, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 463 aa overlap). Probable engA, GTP-binding protein. Equivalent to Q49884|MLCB1351.01|U00021_5 PROBABLE GTP-BINDING PROTEIN ENGA from Mycobacterium leprae (461 aa), (88.6% identity in 463 aa overlap). And similar to many e.g. P50743|ENGA_BACSU PROBABLE GTP-BINDING PROTEIN ENGA from Bacillus subtilus (436 aa), FASTA scores: opt: 1077, E(): 0, (40.6% identity in 434 aa overlap). Contains two PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ERA/TRME FAMILY OF GTP-BINDING PROTEINS. ENGA SUBFAMILY. Protein product from Mb1740 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1740 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64058" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR005225" /db_xref="InterPro:IPR006073" /db_xref="InterPro:IPR015946" /db_xref="InterPro:IPR016484" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR031166" /db_xref="InterPro:IPR032859" /db_xref="UniProtKB/Swiss-Prot:P64058" /protein_id="SIU00344.1" /translation="MTQDGTWVDESDWQLDDSEIAESGAAPVVAVVGRPNVGKSTLVN RILGRREAVVQDIPGVTRDRVCYDALWTGRRFVVQDTGGWEPNAKGLQRLVAEQASVA MRTADAVILVVDAGVGATAADEAAARILLRSGKPVFLAANKVDSEKGESDAAALWSLG LGEPHAISAMHGRGVADLLDGVLAALPEVGESASASGGPRRVALVGKPNVGKSSLLNK LAGDQRSVVHEAAGTTVDPVDSLIELGGDVWRFVDTAGLRRKVGQASGHEFYASVRTH AAIDSAEVAIVLIDASQPLTEQDLRVISMVIEAGRALVLAYNKWDLVDEDRRELLQRE IDRELVQVRWAQRVNISAKTGRAVHKLVPAMEDALASWDTRIATGPLNTWLTEVTAAT PPPVRGGKQPRILFATQATARPPTFVLFTTGFLEAGYRRFLERRLRETFGFDGSPIRV NVRVREKRAGKRR" CDS 1931603..1932415 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1741" /product="Probable oxidoreductase" /note="Mb1741, -, len: 270 aa. Equivalent to Rv1714, len: 270 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 270 aa overlap). Probable oxidoreductase (EC 1.-.-.-) similar to many e.g. AE0010|AE001021_4 Archaeoglobus fulgidus section 79 (281 aa), FASTA scores: opt: 578, E(): 3.3e-31, (38.9% identity in 265 aa overlap). Also similar to several other Mycobacterium tuberculosis oxidoreductases e.g. Rv1544, etc. Mb1741 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ44" /protein_id="SIU00345.1" /translation="MEEMALAQQVPNLGLARFSVQDKSILITGATGSLGRVAARALAD AGARLTLAGGNSAGLAELVNGAGIDDAAVVTCRPDSLADAQQMVEAALGRYGRLDGVL VASGSNHVAPITEMAVEDFDAVMDANVRGAWLVCRAAGRVLLEQGQGGSVVLVSSVRG GLGNAAGYSAYCPSKAGTDLLAKTLAAEWGGHGIRVNALAPTVFRSAVTEWMFTDDPK GRATREAMLARIPLRRFAEPEDFVGALIYLLSDASSFYTGQVMYLDGGYTAC" mobile_element 1932189..1932363 /mobile_element_type="insertion sequence:IS1540" /locus_tag="BQ2027_IS1540'-1" /note="IS1540'-1, len: 175 nt. Equivalent to a region of IS1540, len: 1163 nt, from Mycobacterium tuberculosis strain H37Rv, (97.7% identity in 176 nt overlap)." CDS 1932409..1932972 /codon_start=1 /transl_table=11 /gene="fadB3a" /locus_tag="BQ2027_MB1742" /product="PROBABLE 3-HYDROXYBUTYRYL-COA DEHYDROGENASE FADB3a [FIRST PART] (BETA-HYDROXYBUTYRYL-COA DEHYDROGENASE) (BHBD)" /note="Mb1742, fadB3a, len: 187 aa. Equivalent to the 5' end of Rv1715, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 187 aa overlap). Probable fadB3, 3-hydroxybutyryl-CoA dehydrogenase (EC 1.1.1.157), highly similar to many e.g. NP_107236.1|NC_002678 3-hydroxybutyryl-CoA dehydrogenase from Mesorhizobium loti (309 aa); NP_250319.1|NC_002516 probable 3-hydroxyacyl-CoA dehydrogenase from Pseudomonas aeruginosa (509 aa); P45856|HBD_BACSU PROBABLE 3-HYDROXYBUTYRYL-COA DEHYDROGENASE from Bacillus subtilis (287 aa), FASTA scores: opt: 488, E(): 1.5e-24, (38.7% identity in 279 aa overlap); etc. COULD BELONG TO THE 3-HYDROXYACYL-COA DEHYDROGENASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1715/fadB3 exists as a single gene. In Mycobacterium bovis, a single base transition (g-a) splits fadB3 into 2 parts, fadB3a and fadB3b. Mb1742 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZG8" /db_xref="InterPro:IPR006176" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZG8" /protein_id="SIU00346.1" /translation="MLTSHGFSRAAVVGAGLMGRRIAGVLASAGLDVAITDTNAEILH AAAVEAARVAGAGRGSVAAAADLAAAIPDADLVIEAVVENLAVKQELFERLATLAPDA VLATNTSVLPIGAVTERVEDGSRVIGTHFWNPPDLIPVVEVVPSARTAPDTADRVVAL LTQVGKLPVRVGRDVPGFIGNRLQHAL" CDS 1933030..1933323 /codon_start=1 /transl_table=11 /gene="fadB3b" /locus_tag="BQ2027_MB1743" /product="PROBABLE 3-HYDROXYBUTYRYL-COA DEHYDROGENASE FADB3b [SECOND PART] (BETA-HYDROXYBUTYRYL-COA DEHYDROGENASE) (BHBD)" /note="Mb1743, fadB3b, len: 97 aa. Equivalent to the 3' end of Rv1715, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 97 aa overlap). Probable fadB3, 3-hydroxybutyryl-CoA dehydrogenase (EC 1.1.1.157), highly similar to many e.g. NP_107236.1|NC_002678 3-hydroxybutyryl-CoA dehydrogenase from Mesorhizobium loti (309 aa); NP_250319.1|NC_002516 probable 3-hydroxyacyl-CoA dehydrogenase from Pseudomonas aeruginosa (509 aa); P45856|HBD_BACSU PROBABLE 3-HYDROXYBUTYRYL-COA DEHYDROGENASE from Bacillus subtilis (287 aa), FASTA scores: opt: 488, E(): 1.5e-24, (38.7% identity in 279 aa overlap); etc. COULD BELONG TO THE 3-HYDROXYACYL-COA DEHYDROGENASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1715/fadB3 exists as a single gene. In Mycobacterium bovis, a single base transition (g-a) splits fadB3 into 2 parts, fadB3a and fadB3b. Mb1743 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZ49" /db_xref="InterPro:IPR006108" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR013328" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ49" /protein_id="SIU00347.1" /translation="MVRNTIGLRLATLGPLENADYIGLDLTLAIHDAVIPSLNHDPHP SPLLRELVAAGQLGARTGHGFLDWPAGAREATTARLAQHIAAQLQANEKGRGT" CDS 1933326..1934156 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1744" /product="Kynurenine formamidase, bacterial (EC" /EC_number="3.5.1.9" /note="Mb1744, -, len: 276 aa. Equivalent to Rv1716, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 276 aa overlap). Conserved hypothetical protein, shows high similarity with AF1200|O29068|AE001021_11A conserved protein of Archaeoglobus fulgidus, gp fulgidus section 7 (278 aa), FASTA scores: E(): 0, (61.8% identity in 251 a a overlap); also weak similarity to several polyketide cyclases e.g. O68500|AF048833|DPSY from Streptomyces peucetius (272 aa), FASTA scores: opt: 194, E(): 1.7e-05, (29.6% identity in 223 aa overlap)." /db_xref="GOA:A0A1R3XZ58" /db_xref="InterPro:IPR007325" /db_xref="InterPro:IPR037175" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ58" /protein_id="SIU00348.1" /translation="MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGM AKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMV TAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVG TDTQALDHPLATAIAPHGPAEAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILS QGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKAA " CDS 1934156..1934506 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1745" /product="Cupin 2 barrel protein" /note="Mb1745, -, len: 116 aa. Equivalent to Rv1717, len: 116 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 116 aa overlap). Conserved hypothetical protein, similar to O29060|AF1208|AE001021 Hypothetical protein from Arecheoglobus fulgidus (114 aa), FASTA scores: opt: 254, E(): 3.3e-09, (37.7% identity in 114 aa overlap)." /db_xref="InterPro:IPR011051" /db_xref="InterPro:IPR013096" /db_xref="InterPro:IPR014710" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ53" /protein_id="SIU00349.1" /translation="MKLTRASQAPRYVAPAHHEVSTMRLQGREAGRTERFWVGLSVYR PGGTAEPAPTREETVYVVLDGELVVTVDGAETVLGWLDSVHLAKGELRSIHNRTDRQA LLLVTVAHPVAEVA" CDS 1934559..1935182 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1746" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1746, -, len: 207 aa. Equivalent to the 5' end of Rv1718, len: 272 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 193 aa overlap). Conserved hypothetical protein, similar to O29058|AF1210|AE001021 Hypothetical protein from Archeoglobus (313 aa), FASTA scores: opt: 301, E(): 8e-23, (31.6% identity in 301 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1718 exists as a single gene. In Mycobacterium bovis, a single base deletion (c-*) leads to a shorter product with a different COOH terminus." /db_xref="GOA:A0A1R3XZ47" /db_xref="InterPro:IPR008567" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ47" /protein_id="SIU00350.1" /translation="MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVA HIHLRDENERPTADPNIARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRM ATLNPCSMSFGAGEFRNPPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLL AEPLQFSIVLGVRGGMAATADNLLTMVRRLPPGRSGKSSRSVRPTWN" CDS 1935390..1936169 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1748" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1748, -, len: 259 aa. Equivalent to Rv1719, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 259 aa overlap). Probable transcriptional regulatory protein, similar to YIAJ_ECOLI|P37671 hypothetical transcriptional regulator from Escherichia coli (282 aa), FASTA scores: opt: 353, E(): 3.2e-15, (31.1% identity in 235 aa overlap). Similar to Mycobacterium tuberculosis hypothetical IclR-family transcriptional regulators Rv2989, Rv1773c. Helix-turn-helix motif from aa 34-55 (+6.94 SD). Protein product from Mb1748 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3XZ29" /db_xref="InterPro:IPR005471" /db_xref="InterPro:IPR012318" /db_xref="InterPro:IPR014757" /db_xref="InterPro:IPR029016" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ29" /protein_id="SIU00351.1" /translation="MSAEEQDTRSGGIQVIARAAELLRVLQAHPGGLSQAEIGERVGM ARSTVSRILNALEDEGLVASRGARGPYRLGPEITRMATTVRLGVVTEMHPFLTELSRE LDETVDLSILDGDRADVVDQVVPPQRLRAVSAVGESFPLYCCANGKALLAALPPERQA RALPSRLAPLTANTITDRAALRDELNRIRVDGVAYDREEQTEGICAVGAVLRGVSVEL VAVSVPVPAQRFYGREAELAGALLAWVSKVDAWFNGTEDRK" tRNA 1936362..1936435 /locus_tag="BQ2027_PROT" /product="tRNA-Pro" /note="proT, len: 74 nt. Equivalent to proT, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Pro, anticodon ggg." gene 1936598..1936772 /locus_tag="BQ2027_IS1540'-1" CDS complement(1936779..1937168) /codon_start=1 /transl_table=11 /gene="vapc12" /locus_tag="BQ2027_MB1749C" /product="possible toxin vapc12" /note="Mb1749c, -, len: 129 aa. Equivalent to Rv1720c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 129 aa overlap). Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. O53610|Rv0065|MTV030.08 (133 aa), FASTA scores: E(): 1.5e-10, (39.1% identity in 128 aa overlap); P71550|Rv0960|MTCY10D7.14C (129 aa) and O06415|Rv0549c|MTCY25D10.28C (137 aa). Mb1749c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ41" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ41" /protein_id="SIU00352.1" /translation="MIVLDASAAVELMLTTPAGAAVARRLRGETVHAPAHFDVEVIGA IRQAVVRQLISDHEGLVVVVNFLSLPVRRWPLKPFTQRAYQLRSTHTVADGAYVALAE GLGVPLITCDGRLAQSHGHNAEIELVA" CDS complement(1937165..1937392) /codon_start=1 /transl_table=11 /gene="vapb12" /locus_tag="BQ2027_MB1750C" /product="possible antitoxin vapb12" /note="Mb1750c, -, len: 75 aa. Equivalent to Rv1721c, len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 75 aa overlap). Conserved hypothetical protein, similar to Rv0300|MTCY63.05|O07227 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (73 aa). Start changed since original submission. Protein product from Mb1750c detected using SWATH mass spectrometry. Mb1750c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y196" /db_xref="InterPro:IPR010985" /db_xref="UniProtKB/TrEMBL:A0A1R3Y196" /protein_id="SIU00353.1" /translation="MSAMVQIRNVPDELLHELKARAAAQRMSLSDFLLARLAEIAEEP ALDDVLDRLAALPRRDLGASAAELVDEARSE" CDS 1937610..1939094 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1751" /product="POSSIBLE CARBOXYLASE" /note="Mb1751, -, len: 494 aa. Equivalent to Rv1722, len: 494 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 494 aa overlap). Possible carboxylases. Weak similarity to several e.g. ACCC_BACSU|P49787 biotin carboxylase from Bacillus subtilis (448 aa), fasta scores: opt: 171, E(): 0.00021, (22.8% identity in 237 aa overlap). Protein product from Mb1751 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1751 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZY7" /db_xref="InterPro:IPR011761" /db_xref="UniProtKB/TrEMBL:A0A1R3XZY7" /protein_id="SIU00354.1" /translation="MIVPAREPEPQPRRVLNGLSDVRAFFHNNTMPLYFISPTPFNLL GIYRWIRNFFYLTYYDSFEGEHSRVFVPRRRDRRDFDGMGDVCNHLLRDPETLEFIKN RGPGGKACFVMLDEETQALARQAGLEVMHPPAELRHRLESKIVMTRLADEAGVPSVPH VIGRVSSYDELSALAHGAGLGDDLVVEAAYGNAGSATFFVRGLRDWDQCAGGIVGQPE IKVMKRIRNVEVCIEATVTRHGTVIGPAMTSLVGYPELTPYRGAWCGNDVWRGALPPA QTRAAREMVAKLGDVLSREGYRGYFEVDLLHDLDADELYLGEVNPRLSGASPMTNLTT EAYADMPLFLFHLLEYMDVDYELDIEAINSRWERGYGEDEVWGQLIMSETSPDLELFT ATPRTGMWRLNHDGRVSFARQGNDWATMLDESEAFYMRVAAPGDLRCEGAQLGVLVTR GHLQTDDYQLTERGRRWIDGLKAQFASTPLTPAAPIVSRLVARA" CDS 1939091..1940338 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1752" /product="PROBABLE HYDROLASE" /note="Mb1752, -, len: 415 aa. Equivalent to Rv1723, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 415 aa overlap). Possible hydrolase (EC 3.-.-.-), similar to others e.g. NYLB_FLASP|P07061 6-aminohexanoate-dimer hydrolase from Flavobacterium sp. (392 aa), FASTA scores: opt: 717, E(): 0, (35.1% identity in 396 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical esterases and penicillin binding proteins e.g. Rv1923, Rv1497, Rv2463, etc." /db_xref="GOA:A0A1R3XZ54" /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ54" /protein_id="SIU00355.1" /translation="MSGGVPAGLALDNWLSSPYSHWAFQHVEDFMPTTVIARGTEPVV TLPADNAPIADIGLTSTDGIATTVGAVMAATATDGWAVAHRGALVAEQYLDGLGPRTR HLLFSVSKSLVAAVVGALHGAGAIELDAPVTAYVPALADCGYAGATVRHLLDMRSGVA FSENYDDPAAEIHVREQVIGWAPKRGPDLPATLRDYLLTLRRKSAHGGPFEYRSCETD VLGWICEAAAGQPMPELMSELLWSRIGAQCDATIALDVAGAAGTGIFDGGISACLTDM IRFGSLYLRDGVSLAGQQVVPAAWIADTFDGGPDSRQAFAASPDDNPMPGGMYRNQVW FPYPGSNVALCVGMCGQLIYVNRAAEVVAAKLSTQPHSHEPHMLDTLRAFDAVAHELS GIRSSSTNDPQRPSPPAQEASPG" CDS complement(1940381..1940800) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1753C" /product="HYPOTHETICAL PROTEIN" /note="Mb1753c, -, len: 139 aa. Equivalent to Rv1724c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 139 aa overlap). Hypothetical unknown protein. Mb1753c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZH9" /protein_id="SIU00356.1" /translation="MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPV PHWPKYWIQALAKHFQRQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYN AVRNAGRAIENEQAALDHKLAEVRKRRMDTWDESYFR" CDS complement(1940790..1941500) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1754C" /product="Transcriptional regulator, HxlR family / Domain of unknown function" /note="Mb1754c, -, len: 236 aa. Equivalent to Rv1725c, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 236 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins from diverse organisms e.g. P70885|U44893 ORF108 from BUTYRIVIBRIO FIBRISOLVENS, (108 aa), FASTA scores: opt: 223, E(): 2e-09, (39.1% identity in 92 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical transcriptional regulator, O05774|Rv3095|YU95_MYCTU (158 aa). Protein product from Mb1754c detected using SWATH mass spectrometry. Mb1754c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002577" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="InterPro:IPR036527" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ57" /protein_id="SIU00357.1" /translation="MQPYGQYCPVARAAELLGDRWTLLIVRELLFGPLRFTEIERGLP GISRSVLAQRLRRLQHDRIIEAVPEHTGGGYRFTVAGEELRPVLQTLGDWVSRWLMAD PTPAECDPELLTLWISRRVNTEALPGRRVVVEFRYHGERPLWAWLVLEPGDISVCLHD PCLPVDLTVRGHPRDLYRVYSGRSTLAAEISAERIELDGLPAMRRAFPSWMAWSPFAP AMRQAVVSVDQMPEAHGG" CDS 1941601..1942986 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1755" /product="PROBABLE OXIDOREDUCTASE" /note="Mb1755, -, len: 461 aa. Equivalent to Rv1726, len: 461 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 461 aa overlap). Probable oxidoreductase (EC 1.-.-.-), similar to HDNO_ARTOX|P08159 6-hydroxy-d-nicotine oxidase (458 aa), FASTA scores: opt: 678, E(): 0, (29.5% identity in 465 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical dehydrogenases e.g. Rv3107c, Rv1257c, etc." /db_xref="GOA:A0A1R3XZ69" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR012951" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR016167" /db_xref="InterPro:IPR016169" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ69" /protein_id="SIU00358.1" /translation="MTATLTKTLGSLDDFRGTLCVPGDPDYPRVRAIWNGQVAREPAL IATCHDACDVRTVLRRPVDAGMVTAVRGGGHNVAGTALCDGGVVIDLSAMRAVSLDPA TGRVRVQGGATLADLDHATVPFARVAPAGIVTTTGVGGLTLGGGVGWTTRRFGLSCDN LVAVRLVTAAGDYLSVDDERDPELMWGLRGGGGNFGIVTEFEFATHPFGPVAVAGFVV YRLDDGPAVLRGYRQFAAAAPEEVTTIVVLRHAPPAPWIPVDQRGKPVVMIGAVHTGS IQTGIEALRPVKSLARPVADTVWPTPFLAHQAVLDASNPAGHRYYWKSDYLAELNDEA IDLLVEQTAQLSSPDSLIGIFQLGGAAARGGERSCFPSRHARFMVNYATHWTEAREDD LHRQWTRDAIEALAPYGLGTAYVNFTADDAPMHVETLYSTTEFSRLVTLKNRLDPDNV FRNNHNIRPSA" CDS 1943019..1943588 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1756" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1756, -, len: 189 aa. Equivalent to Rv1727, len: 189 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 189 aa overlap). Similar to Mycobacterium tuberculosis hypothetical proteins P72040|Rv3773c|MTCY13D12.07C (194 aa), FASTA scores: opt: 176, E(): 2.7e-08, (31.1% identity in 180 aa overlap); and O53801|Rv0738 (182 aa)." /db_xref="GOA:A0A1R3XZ63" /db_xref="InterPro:IPR017517" /db_xref="InterPro:IPR017520" /db_xref="InterPro:IPR024344" /db_xref="InterPro:IPR034660" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ63" /protein_id="SIU00359.1" /translation="MDLYSNLVEAEQRLVALVSSIEADSYSSPTPCDRWDVRALLSHA LASIDAFAAAVDGAPGPDMAQVFSGADIVGDDPLGATQRITRRSQAAWSTVRDLNAEL STFIGVMPAGQALAIITFSTVVHGWDLAVATGQAGELPEHLAEAAQQVAAELVPVLRP RGLFAHDVDLAGEATPTQRLVALTGRKPR" CDS complement(1943613..1944383) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1757C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1757c, -, len: 256 aa. Equivalent to Rv1728c, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 256 aa overlap). Conserved hypothetical protein, some similarity to O07246|Rv0320|MTCY63.25 possible exported protein from Mycobacterium tuberculosis (220 aa), FASTA scores: E(): 1.3e-31, (42.3% identity in 220 aa overlap). C-terminal region similar to Q9ZX60|AF068845|AF068845_17 segment of gp17 of Mycobacteriophage TM4 (1229 aa), FASTA scores: opt: 385, E(): 4.3e-17, (44.6% identity in 139 aa overlap). Mb1757c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZ56" /protein_id="SIU00360.1" /translation="MSVNGLPGAHNAGLQPIDSKGCHTRRTRHTKVLFVSKGVLANGR GRWLAIAASLVVSAAILYAQGAEHTCCRETPAAIPTGPDSAPANAPRIASPTEADLLA ASAPVAAQQFQFALPAGVASEEGLQVKTIWVARAVSVLFPQITNIFGYRQDPLKWHPN GLAIDVMIPNHHSDEGIQLGNQVAGLALANAKRWGVLHVIWRQGYYPGIGAPSWTADY GSETLNHYDHVHIATDGGGYPTGRETYYVGSMSPTPPE" CDS complement(1944380..1945318) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1758C" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb1758c, -, len: 312 aa. Equivalent to Rv1729c, len: 312 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 312 aa overlap). Conserved hypothetical protein, similar to many Mycobacterium tuberculosis hypothetical proteins e.g. Q50726|Rv3399|YX99_MYCTU (348 aa), FASTA scores: opt: 1019, E(): 0, (55.7% identity in 296 aa overlap); P95074|Rv0726c (367 aa), O53795|Rv0731c (318 aa), and O53841|Rv0830 (301 aa), etc. Protein product from Mb1758c detected using SWATH mass spectrometry. Mb1758c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TZP5" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7TZP5" /protein_id="SIU00361.1" /translation="MARTDDDNWDLTSSVGVTATIVAVGRALATKDPRGLINDPFAEP LVRAVGLDLFTKMMDGELDMSTIADVSPAVAQAMVYGNAVRTKYFDDYLLNATAGGIR QVAILASGLDSRAYRLPWPTRTVVYEIDQPKVMEFKTTTLADLGAEPSAIRRAVPIDL RADWPTALQAAGFDSAAPTAWLAEGLLIYLKPQTQDRLFDNITALSAPGSMVATEFVT GIADFSAERARTISNPFRCHGVDVDLASLVYTGPRNHVLDYLAAKGWQPEGVSLAELF RRSGLDVRAADDDTIFISGCLTDHSSISPPTAAGWR" CDS complement(1945498..1947051) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1759C" /product="POSSIBLE PENICILLIN-BINDING PROTEIN" /note="Mb1759c, -, len: 517 aa. Equivalent to Rv1730c, len: 517 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 517 aa overlap). Possible penicillin-binding protein, similar to others e.g. PBP4_NOCLA|Q06317 penicillin-binding protein 4 (pbp-4) from Nocardia lactamdurans (381 aa), FASTA scores: opt: 643, E(): 3.8e-32, (33.8% identity in 370 aa overlap); etc. Also similar to other Mycobacterium tuberculosis hypothetical penicillin binding proteins and esterases e.g. Rv1923, Rv1497, etc. Protein product from Mb1759c detected using SWATH mass spectrometry. Mb1759c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ52" /protein_id="SIU00362.1" /translation="MCPPIILSSATPTGTRCGTRHGRAVVTEYVRALDRLPHEIATAV VETVNCADPGAAFDELDAKINAGMKAYAIPGVAVAVWAGGQEYVKGYGVTNVDHPMPV DGDTVFRIGSTTKTFTGTVMMRLVERGKVDLDSPVRRYIPDFAVADESASATVTVRQL LNHTAGWDGRNGQDFGRGDDAVALYVKAMTRLPQLTPPGTAFAYNNSGLVVAGRIIEL VAGTTYESTVQRLLLDPLQLAHTRYFSDQIIGLNVAASHSVVDGKPIAVTDFWTFPRS CNPTGGLMSTARDQLRYAQFHLGDGRAPNGEQILSRQSLKAMRSNPGAGGTLWVELTG MGVTWMLRPSAENVTIVEHGGTWKGQRSGFVMVPDRNFAMTVLTNSDGGFHMINDLFA SDWALQRFAGLSNLPATPQRLGAVDLAPYEGRYIAKQVAQNGDLETTVIDFRARDGQL AGSMSTDDANPDGQNSANLGLAFYRPDYGLDLGPDNKPTGSRSNFVRGPDGNIAWFCS QHGRLFRRQ" CDS 1947483..1949039 /codon_start=1 /transl_table=11 /gene="gabD2" /locus_tag="BQ2027_MB1760" /standard_name="gabD1" /product="possible succinate-semialdehyde dehydrogenase [nadp+] dependent (ssdh) gabd2" /note="Mb1760, gabD2, len: 518 aa. Equivalent to Rv1731, len: 518 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 518 aa overlap). Possible gabD2, succinate-semialdehyde dehydrogenase [NADP+] dependent (EC 1.2.1.16), similar to others e.g. GABD_ECOLI|P25526 succinate-semialdehyde dehydrogenase from Escherichia coli (482 aa), FASTA scores: opt: 870, E(): 0, (34.7% identity in 449 aa overlap); etc. Also similar to gabD1|Rv0234c|MTCY08D5.30c PROBABLE SUCCINATE-SEMIALDEHYDE DEHYDROGENASE [NADP+] DEPENDANT from Mycobacterium tuberculosis (511 aa); and other semialdehyde dehydrogenases e.g. Rv0768|aldA (489 aa), Rv2858c|aldC (455 aa), etc. Contains PS00216 Sugar transport proteins signature 1, PS00687 Aldehyde dehydrogenases glutamic acid active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY. Note that previously known as gabD1. Protein product from Mb1760 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1760 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TZP3" /db_xref="InterPro:IPR015590" /db_xref="InterPro:IPR016161" /db_xref="InterPro:IPR016162" /db_xref="InterPro:IPR016163" /db_xref="InterPro:IPR029510" /db_xref="UniProtKB/Swiss-Prot:Q7TZP3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00363.1" /translation="MPAPSAEVFDRLRNLAAIKDVAARPTRTIDEVFTGKPLTTIPVG TAADVEAAFAEARAAQTDWAKRPVIERAAVIRRYRDLVIENREFLMDLLQAEAGKARW AAQEEIVDLIANANYYARVCVDLLKPRKAQPLLPGIGKTTVCYQPKGVVGVISPWNYP MTLTVSDSVPALVAGNAVVLKPDSQTPYCALACAELLYRAGLPRALYAIVPGPGSVVG TAITDNCDYLMFTGSSATGSRLAEHAGRRLIGFSAELGGKNPMIVARGANLDKVAKAA TRACFSNAGQLCISIERIYVEKDIAEEFTRKFGDAVRNMKLGTAYDFSVDMGSLISEA QLKTVSGHVDDATAKGAKVIAGGKARPDIGPLFYEPTVLTNVAPEMECAANETFGPVV SIYPVADVDEAVEKANDTDYGLNASVWAGSTAEGQRIAARLRSGTVNVDEGYAFAWGS LSAPMGGMGLSGVGRRHGPEGLLKYTESQTIATARVFNLDPPFGIPATVWQKSLLPIV RTVMKLPGRR" CDS complement(1949049..1949597) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1761C" /product="Alkyl hydroperoxide reductase and/or thiol-specific antioxidant family (AhpC/TSA) protein" /note="Mb1761c, -, len: 182 aa. Equivalent to Rv1732c, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 182 aa overlap). Conserved hypothetical protein, highly similar to hypothetical proteins from several organisms e.g. P73178|SLL1289|D90904 from Synechocystis (194 aa), FASTA scores: opt: 663, E(): 0, (53.1% identity in 179 aa overlap); etc. Protein product from Mb1761c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1761c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZZ8" /db_xref="InterPro:IPR000866" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3XZZ8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00364.1" /translation="MAVESSMLALGTPAPSFTLPQPATGATVSLDELTGPALVVTFIC NHCPYVQHVAAGLATLGRDLADQGVPMVGISSNDVVTYPQDGPDQMVAEARRHGWTFP YLYDETQDVARAFSAACTPDTFVFDGQRRLVYRGQLDDSRPGNGRPVTAADVRAAVDA LLAGRPVNPDQRPSIGCGIKWR" CDS complement(1949661..1950293) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1762C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb1762c, -, len: 210 aa. Equivalent to Rv1733c, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 210 aa overlap). Probable conserved transmembrane protein. Similar to AL109962|SCJ1_26 hypothetical protein from Streptomyces coelicolor (193 aa), FASTA scores: opt: 287, E(): 3.8e-11, (35.2% identity in 182 aa overlap). Mb1762c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ65" /db_xref="InterPro:IPR039708" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ65" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00365.1" /translation="MIATTRDREGATMITFRLRLPCRTILRVFSRNPLVRGTDRLEAV VMLLAVTVSLLTIPFAAAAGTAVHDSRSHVYAHQAQTRHPATATVIDHEGVIDSNTTA TSAPPRTKITVPARWVVNGIERSGEVNAKPGTKSGDRVGIWVDSAGQLVDEPAPPARA IADAALAALGLWLSVAAVAGALLALTRAILIRVRNASWQHDIDSLFCTQR" CDS complement(1950580..1950822) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1763C" /product="Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes" /note="Mb1763c, -, len: 80 aa. Equivalent to Rv1734c, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (98.8% identity in 80 aa overlap). Conserved hypothetical protein, similar to C-terminal region Q9Z8N2|CP0452|AE001615 Dihydrolipoamide Acetyltransferase from Chlamydia pneumoniae (429 aa), FASTA scores: opt: 138, E(): 0.0012, (26.9% identity in 78 aa overlap)." /db_xref="GOA:A0A1R3XZI3" /db_xref="InterPro:IPR001078" /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/TrEMBL:A0A1R3XZI3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00366.1" /translation="MTNVGDQGVDAVFGVIYPPQVALVSFGKPAQRVCAVDGAIHVMT TVLATLPADHGCSDDHRGALFFLSINELTRCAAVAG" CDS complement(1951097..1951594) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1764C" /product="HYPOTHETICAL MEMBRANE PROTEIN" /note="Mb1764c, -, len: 165 aa. Equivalent to Rv1735c, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 165 aa overlap). Hypothetical membrane protein, similar to part of O58614|PH0884|AP000004 Hypothetical malic acid transport protein from Pyrococcus horikoshii (330 aa), FASTA scores: opt: 167, E(): 0.0003, (29.2% identity in 120 aa overlap)." /db_xref="GOA:A0A1R3XZ67" /db_xref="InterPro:IPR004695" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ67" /protein_id="SIU00367.1" /translation="MGATAITVLAGAHIVEMADAPMAIVTSGLVAGASVVFWAFGPWL IPPLVAASIWKHVVHRVPLRYEATLWSVVFPLGMYGVGAYRLGLAAHLPIVESIGEFE GWVALAVWTITFVAMLHHLAATIGRSGRSSHAIGAADDTHAIICRPPRSFDHQVRAFR RNQPM" CDS complement(1952034..1953992) /codon_start=1 /transl_table=11 /gene="narX" /locus_tag="BQ2027_MB1765C" /product="Probable nitrate reductase NarX" /note="Mb1765c, narX, len: 652 aa. Equivalent to Rv1736c, len: 652 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 652 aa overlap). Probable narX, nitrate reductase (EC 1.7.99.4). Contains three domains: N-terminus (250 aa) is similar to e.g. N-terminus of NARG_ECOLI|P09152 respiratory nitrate reductase 1 alpha chain from Escherichia coli (1246 aa), FASTA scores: E(): 0, (58.6% identity in 251 aa overlap); and Rv1161|MTCI65.28|NARG PROBABLE RESPIRATORY NITRATE REDUCTASE (ALPHA CHAIN) from Mycobacterium tuberculosis (1232 aa). Central region (260-410 aa) is similar to Rv1163|O06561|NARJ PROBABLE RESPIRATORY NITRATE REDUCTASE (DELTA CHAIN) from Mycobacterium tuberculosis (201 aa), FASTA scores: E(): 0, (64.2% identity in 159 aa overlap). C-terminus (420 aa-) is similar to Rv1164|O06562|NARI PROBABLE RESPIRATORY NITRATE REDUCTASE (GAMMA CHAIN) from M. tuberculosis (246 aa), FASTA scores: E(): 0, (68.6% identity in 239 aa overlap). Contains PS00551 Prokaryotic molybdopterin oxidoreductases signature 1. Mb1765c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZ78" /db_xref="InterPro:IPR003765" /db_xref="InterPro:IPR003816" /db_xref="InterPro:IPR006656" /db_xref="InterPro:IPR006963" /db_xref="InterPro:IPR020945" /db_xref="InterPro:IPR023234" /db_xref="InterPro:IPR027467" /db_xref="InterPro:IPR036197" /db_xref="InterPro:IPR036411" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ78" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00368.1" /translation="MTVTPRTGSRIEELLARSGRFFIPGEISADLRTVTRRGGRDGDV FYRDRWSHHKVVRSTHGVNCTGSCSWKIYVKDGIITWETQETDYPSVGPDRPEYEPRG CPRGAAFSWYTYSPTRVRHPYARGVLVEMYREAKARLGDPVAAWADIQADPRRRRRYQ RARGKGGLVRVSWAEATEMIAAAHVHTISTYGPDRVAGFSPIPAMSMVSHAAGSRFVE LIGGVMTSFYDWYADLPVASPQVFGDQTDVPESGDWWDVVWQCASVLLTYPNSRQLGT AEELLAHIDGPAADLLGRTVSELRRADPLTAATRYVDTFDLRGRATLYLTYWTAGDTR NRGREMLAFAQTYRSTDVAPPRGETPDFLPVVLEFAATVDPEAGRRLLSGYRVPIAAL CNALTEAALPYAHTVAAVCRTGDMMGELFWTVVPYVTMTIVAVGSWWRYRYDKFGWTT RSSQLYESRLLRIASPMFHFGILVVIVGHGIGLVIPQSWTQAAGLSEGAYHVQAVVLG SIAGITTLAGVTLLIYRRRTRGPVFMATTVNDKVMYLVLVAAIVAGLGATALGSGVVG EAYNYRETVSVWFRSVWVLQPRGDLMAEAPLYYQIHVLIGLALFALWPFTRLVHAFSA PIGYLFRPYIIYRSREELVLTRPRRRGW" CDS complement(1953989..1955176) /codon_start=1 /transl_table=11 /gene="narK2" /locus_tag="BQ2027_MB1766C" /product="POSSIBLE NITRATE/NITRITE TRANSPORTER NARK2" /note="Mb1766c, narK2, len: 395 aa. Equivalent to Rv1737c, len: 395 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 395 aa overlap). Possible narK2, nitrate/nitrite-transport integral membrane protein (see first citation below), possibly member of major facilitator superfamily (MFS), similar to P46907|NARK_BACSU nitrite extrusion protein from Bacillus subtilis (395 aa), FASTA scores: opt: 742, E(): 0, (33.6% identity in 375 aa overlap); and to AL109989|SCJ12.23 hypothetical nitrate/nitrite transporter from Streptomyces coelicolor (412 aa), FASTA scores: opt: 1181, E(): 0, (49.4% identity in 389 aa overlap). Protein product from Mb1766c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XZ73" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ73" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00369.1" /translation="MRGQAANLVLATWISVVNFWAWNLIGPLSTSYARDMSLSSAEAS LLVATPILVGALGRIVTGPLTDRFGGRAMLIAVTLASILPVLAVGVAATMGSYALLVF FGLFLGVAGTIFAVGIPFANNWYQPARRGFSTGVFGMGMVGTALSAFFTPRFVRWFGL FTTHAIVAAALASTAVVAMVVLRDAPYFRPNADPVLPRLKAAARLPVTWEMSFLYAIV FGGFVAFSNYLPTYITTIYGFSTVDAGARTAGFALAAVLARPVGGWLSDRIAPRHVVL ASLAGTALLAFAAALQPPPEVWSAATFITLAVCLGVGTGGVFAWVARRAPAASVGSVT GIVAAAGGLGGYFPPLVMGATYDPVDNDYTVGLLLLVATALVACTYTALHAREPVSEE ASR" CDS 1955463..1955747 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1767" /product="conserved protein" /note="Mb1767, -, len: 94 aa. Equivalent to Rv1738, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 94 aa overlap). Conserved hypothetical protein, similar to P71931|Rv2632c|YQ32_MYCTU Hypothetical 10.1 kd protein from Mycobacterium tuberculosis (93 aa), FASTA scores: opt: 319, E(): 2.6e-27, (53.9% identity in 89 aa overlap). Protein product from Mb1767 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1767 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR015057" /db_xref="InterPro:IPR038070" /db_xref="UniProtKB/Swiss-Prot:P64888" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00370.1" /translation="MCGDQSDHVLQHWTVDISIDEHEGLTRAKARLRWREKELVGVGL ARLNPADRNVPEIGDELSVARALSDLGKRMLKVSTHDIEAVTHQPARLLY" CDS complement(1955761..1957443) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1768C" /product="PROBABLE SULPHATE-TRANSPORT TRANSMEMBRANE PROTEIN ABC TRANSPORTER" /note="Mb1768c, -, len: 560 aa. Equivalent to Rv1739c, len: 560 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 560 aa overlap). Probable sulphate-transport transmembrane protein ABC transporter, similar to several e.g. P53392|G607186 high affinity sulphate transporter from Stylosanthes hamata (662 aa), FASTA scores: opt: 382, E(): 1.6e-16, (28.0% identity in 564 aa overlap); U59234.1|AAB88215.1 biotin carb. from Synechococcus sp. PCC 7942 (574 aa), FASTA scores: opt: 1838, E(): 0, (50.0% identity in 550 aa overlap); etc. Contains PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS), AND SEEMS TO BELONG TO THE SULP FAMILY. Protein product from Mb1768c detected using SWATH mass spectrometry. Mb1768c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ50" /db_xref="InterPro:IPR001902" /db_xref="InterPro:IPR002645" /db_xref="InterPro:IPR011547" /db_xref="InterPro:IPR036513" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ50" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00371.1" /translation="MIPTMTSAGWAPGVVQFREYQRRWLRGDVLAGLTVAAYLIPQAM AYATVAGLPPAAGLWASIAPLAIYALLGSSRQLSIGPESATALMTAAVLAPMAAGDLR RYAVLAATLGLLVGLICLLAGTARLGFLASLLSRPVLVGYMAGIALVMISSQLGTITG TSVEGNEFFSEVHSFATSVTRVHWPTFVLAMSVLALLTMLTRWAPRAPGPIIAVLAAT MLVAVMSLDAKGIAIVGRIPSGLPTPGVPPVSVEDLRALIIPAAGIAIVTFTDGVLTA RAFAARRGQEVNANAELRAVGACNIAAGLTHGFPVSSSSSRTALADVVGGRTQLYSLI ALGLVVIVMVFASGLLAMFPIAALGALVVYAALRLIDLSEFRRLARFRRSELMLALAT TAAVLGLGVFYGVLAAVALSILELLRRVAHPHDSVLGFVPGIAGMHDIDDYPQAKRVP GLVVYRYDAPLCFANAEDFRRRALTVVDQDPGQVEWFVLNAESNVEVDLTALDALDQL RTELLRRGIVFAMARVKQDLRESLRAASLLDKIGEDHIFMTLPTAVQAFRRR" CDS 1957511..1957723 /codon_start=1 /transl_table=11 /gene="vapb34" /locus_tag="BQ2027_MB1769" /product="possible antitoxin vapb34" /note="Mb1769, -, len: 70 aa. Equivalent to Rv1740, len: 70 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 70 aa overlap). Conserved hypothetical protein, highly similar to other Mycobacterium tuberculosis hypothetical proteins e.g. P96913|Rv0623|MTCY20H10.04 (84 aa), (73.5% identity in 68 aa overlap); P71998|Rv1740 (70 aa), and O07770|Rv0608 (81 aa). Mb1769 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR011660" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ61" /protein_id="SIU00372.1" /translation="MELAARMGETLTQAVVVAVREQLARRTGRTRSISLREELAAIGR RCAALPVLDTRAADTILGYDERGLPA" CDS 1957723..1957971 /codon_start=1 /transl_table=11 /gene="vapc34" /locus_tag="BQ2027_MB1770" /product="possible toxin vapc34. contains pin domain." /note="Mb1770, -, len: 82 aa. Equivalent to Rv1741, len: 82 aa, from Mycobacterium tuberculosis strain H37Rv, (98.8% identity in 82 aa overlap). Conserved hypothetical protein, very similar in N-terminus to other M. tuberculosis hypothetical proteins e.g. P96914|Rv0624|MTCY20H10.05 (131 aa), (80.4% identity in 56 aa overlap); P71999|Rv1741 (82 aa) and O07769|Rv0609 (133 aa). Mb1770 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1B6" /protein_id="SIU00373.1" /translation="MVIDTSALVAMLNDEPEAQRFEIAVAADHVWLMSTASYPEMATV IETRFGEPGGREPKVSGQPLLYTGDDFACIDIRAVLAG" CDS 1958052..1958717 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1771" /product="unknown protein" /note="Mb1771, -, len: 221 aa. Equivalent to the 3' end of Rv1742, len: 245 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 220 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-c) leads to shorter product with a different NH2 part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb1771 detected using SWATH mass spectrometry. Mb1771 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y006" /protein_id="SIU00374.1" /translation="MGRVRTGGLLLRTRVPGNRFADYRITVHVQQARTVLDPFPRDGY RGVFESGQVRIESHDGAVISSRAHPRAAFFGRSGLRRNIRWDPLDSVYFAGYAMWNYL TTPYLLTREGVAVEEGAPWQQEGETWRRLIVSFPPDIDTHSPRQTFYVDASGLLRRHD YVPEVVGHWARAAHYCADPVDVDGFVFPTCRWVHPIGPGNRSLPFPTLVSILLTDIRV ETD" CDS 1958811..1960511 /codon_start=1 /transl_table=11 /gene="pknE" /locus_tag="BQ2027_MB1772" /product="PROBABLE TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE E PKNE (PROTEIN KINASE E) (STPK E)" /note="Mb1772, pknE, len: 566 aa. Equivalent to Rv1743, len: 566 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 566 aa overlap). Probable pknE, transmembrane serine/threonine protein kinase (EC 2.7.1.-) (see citation below), similar to PKN1_MYXXA|P33973 serine/threonine-protein kinase pkn1 (693 aa), fasta scores: opt: 542, E(): 1.1e-19, (35.8% identity in 302 aa overlap). Also highly similar to K08G_MYCTU|Q11053 probable serine/threonine-protein kinase (626 aa) (59.8% identity in 381 aa overlap). Contains PS00107 Protein kinases ATP-binding region signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Protein product from Mb1772 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1772 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TZN3" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR012336" /db_xref="InterPro:IPR017441" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/Swiss-Prot:Q7TZN3" /protein_id="SIU00375.1" /translation="MDGTAESREGTQFGPYRLRRLVGRGGMGDVYEAEDTVRERIVAL KLMSETLSSDPDFRTRMQREARTAGRLQEPHVVPIHDFGEIDGQLYVDMRLINGVDLA AMLRRQGPLAPPRAVAIVRQIGSALDAAHAAGATHRDVKPENILVSADDFAYLVDFGI ASATTDEKLTQLGNTVGTLYYMAPERFSESHATYRADIYALTCVLYECLTGSPPYQGD QLSVMGAHINQAIPRPSTVRPGIPVAFDAVIARGMAKNPEDRYVTCGDLSAAAHAALA TADQDRATDILRRSQVAKLPVPSTHPVSPGTRWPQPTPWAGGAPPWGPPSSPLPRSAR QPWLWVGVAVAVVVALAGGLGIALAHPWRSSGPRTSAPPPPPPADAVELRVLNDGVFV GSSVAPTTIDIFNEPICPPCGSFIRSYASDIDTAVADKQLAVRYHLLNFLDDQSHSKN YSTRAVAASYCVAGQNDPKLYASFYSALFGSDFQPQENAASDRTDAELAHLAQTVGAE PTAISCIKSGADLGTAQTKATNASETLAGFNASGTPFVWDGSMVVNYQDPSWLARLIG " CDS complement(1960796..1961197) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1773C" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb1773c, -, len: 133 aa. Equivalent to Rv1744c, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 133 aa overlap). Probable membrane protein, contains four imperfect 10 aa repeats, some similarity to Q25946 (MSA-2) (FRAGMENT) from Plasmodium falciparum (205 aa), FASTA scores: opt: 145, E(): 0.048, (52.4% identity in 63 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZJ4" /protein_id="SIU00376.1" /translation="MVINRSIASIDSIAVAGSAATTGAVAVAGSVATAGSVAVAGSVA TAGSVAIAGAAATAGSVGIIGSLLTVLCVAVRQCVACLACITCTRCVACIGCVRCTDC VGCLWCVNCSGLRNVVGAQNLRVGNLGRVSN" CDS complement(1961268..1961798) /codon_start=1 /transl_table=11 /gene="idi" /locus_tag="BQ2027_MB1774C" /product="Probable isopentenyl-diphosphate delta-isomerase IDI (IPP isomerase) (Isopentenyl pyrophosphate isomerase)" /note="Mb1774c, idi, len: 176 aa. Equivalent to Rv1745c, len: 203 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 176 aa overlap). Probable idi, isopentenyl-diphosphate delta-isomerase (EC 5.3.3.2), similar to Q46822|ORF_O182 from Escherichia coli (182 aa), FASTA scores: opt: 465, E(): 4.7e-25, (46.9% identity in 162 aa overlap), and to IPPI_SCHPO|Q10132 isopentenyl-diphosphate delta-isomerase from Schizosaccharomyces pombe (227 aa), FASTA scores: opt: 185, E(): 5.4e-06, (30.3% identity in 152 aa overlap). BELONGS TO THE IPP ISOMERASE TYPE 1 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, truncation at the 3' end due to a single base tranversion (g-t), leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (176 aa versus 203 aa)." /db_xref="GOA:Q7VEU0" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR011876" /db_xref="InterPro:IPR015797" /db_xref="UniProtKB/Swiss-Prot:Q7VEU0" /protein_id="SIU00377.1" /translation="MTRSYRPAPPIERVVLLNDRGDATGVADKATVHTGDTPLHLAFS SYVSDLHDQLLITRRAATKRTWPAVWTNSCCGHPLPGESLPGAIRRRLAAELGLTPDR VDLILPGFRYRAAMADGTVENEICPVYRVQVDQQPRPNSDEVDAIRWLSWEQFVRDVT AGVIAPVSPWCRSQLG" CDS 1961945..1963375 /codon_start=1 /transl_table=11 /gene="pknF" /locus_tag="BQ2027_MB1775" /product="ANCHORED-MEMBRANE SERINE/THREONINE-PROTEIN KINASE PKNF (PROTEIN KINASE F) (STPK F)" /note="Mb1775, pknF, len: 476 aa. Equivalent to Rv1746, len: 476 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 476 aa overlap). pknF, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citations below), highly similar to KY28_MYCTU|Q10697 probable serine/threonine-protein kinase from Mycobacterium tuberculosis (589 aa), FASTA scores: opt: 870, E(): 0, (41.6% identity in 406 aa overlap). Contains PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation. Start site chosen by homology, may extend further upstream. Protein product from Mb1775 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1775 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TZN1" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR008271" /db_xref="InterPro:IPR011009" /db_xref="UniProtKB/Swiss-Prot:Q7TZN1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00378.1" /translation="MPLAEGSTFAGFTIVRQLGSGGMGEVYLARHPRLPRQDALKVLR ADVSADGEYRARFNREADAAASLWHPHIVAVHDRGEFDGQLWIDMDFVDGTDTVSLLR DRYPNGMPGPEVTEIITAVAEALDYAHERRLLHRDVKPANILIANPDSPDRRIMLADF GIAGWVDDPSGLTATNMTVGTVSYAAPEQLMGNELDGRADQYALAATAFHLLTGSPPF QHANPAVVISQHLSASPPAIGDRVPELTPLDPVFAKALAKQPKDRYQRCVDFARALGH RLGGAGDPDDTRVSQPVAVAAPAKRSLLRTAVIVPAVLAMLLVMAVAVTVREFQRADD ERAAQPARTRTTTSAGTTTSVAPASTTRPAPTTPTTTGAADTATASPTAAVVAIGALC FPLGSTGTTKTGATAYCSTLQGTNTTIWSLTEDTVASPTVTATADPTEAPLPIEQESP IRVCMQQTGQTRRECREEIRRSNGWP" CDS 1963437..1966034 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1776" /product="PROBABLE CONSERVED TRANSMEMBRANE ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb1776, -, len: 865 aa. Equivalent to Rv1747, len: 865 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 865 aa overlap). Probable conserved transmembrane ATP-binding protein ABC transporter (see citation below), similar to others e.g Q55956 ABC transporter from Synechocystis sp. (790 aa), FASTA scores: opt: 738, E(): 6.3e-26, (31.6% identity in 632 aa overlap); etc. Also similar to other Mycobacterium tuberculosis ABC-type transporters e.g. Rv2397c|MTCY253.24, FASTA score: (35.2% identity in 213 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb1776 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1776 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ83" /db_xref="InterPro:IPR000253" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR008984" /db_xref="InterPro:IPR013525" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ83" /protein_id="SIU00379.1" /translation="MPMSQPAAPPVLTVRYEGSERTFAAGHDVVVGRDLRADVRVAHP LISRAHLLLRFDQGRWVAIDNGSLNGLYLNNRRVPVVDIYDAQRVHIGNPDGPALDFE VGRHRGSAGRPPQTTSIRLPNLSAGAWPTDGPPQTGTLGSGQLQQLPPATTRIPAAPP SGPQPRYPTGGQQLWPPSGPQRAPQIYRPPTAAPPPAGARGGTEAGNLATSMMKILRP GRLTGELPPGAVRIGRANDNDIVIPEVLASRHHATLVPTPGGTEIRDNRSINGTFVNG ARVDAALLHDGDVVTIGNIDLVFADGTLARREENLLETRVGGLDVRGVTWTIDGDKTL LDGISLTARPGMLTAVIGPSGAGKSTLARLVAGYTHPTDGTVTFEGHNVHAEYASLRS RIGMVPQDDVVHGQLTVKHALMYAAELRLPPDTTKDDRTQVVARVLEELEMSKHIDTR VDKLSGGQRKRASVALELLTGPSLLILDEPTSGLDPALDRQVMTMLRQLADAGRVVLV VTHSLTYLDVCDQVLLLAPGGKTAFCGPPTQIGPVMGTTNWADIFSTVADDPDAAKAR YLARTGPTPPPPPVEQPAELGDPAHTSLFRQFSTIARRQLRLIVSDRGYFVFLALLPF IMGALSMSVPGDVGFGFPNPMGDAPNEPGQILVLLNVGAVFMGTALTIRDLIGERAIF RREQAVGLSTTAYLIAKVCVYTVLAVVQSAIVTVIVLVGKGGPTQGAVALSKPDLELF VDVAVTCVASAMLGLALSAIAKSNEQIMPLLVVAVMSQLVFSGGMIPVTGRVPLDQMS WVTPARWGFAASAATVDLIKLVPGPLTPKDSHWHHTASAWWFDMAMLVALSVIYVGFV RWKIRLKAC" CDS 1966407..1967138 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1777" /product="unknown protein" /note="Mb1777, -, len: 243 aa. Equivalent to Rv1748, len: 243 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 243 aa overlap). Hypothetical unknown protein. Possibly exported protein, hydrophobic domain, TM helix aa 23-45. Protein product from Mb1777 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1777 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ76" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ76" /protein_id="SIU00380.1" /translation="MPGGVCSGRPWGRPWWHPGLVGLLIRLAELLVVMLPLIGVLYVG IKALSSFTRRLGEASGDLASDSPAMPRPTTVENDAARWRAITRAVEAHERTDARWLEY ELDAAKLLDFPVMTDMRDPLTTAFHKAKLQADFHKPLRAEDLLDDPDAAGHYLDAVRD YVTAFDTAEAEAMRRRRTGFSREEQQRLARAQSLLRVASDAGATAQERERAYRLARTE LDGLIVLPDRTRAGIERGIAGELDD" CDS complement(1967135..1967692) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1778C" /product="POSSIBLE INTEGRAL MEMBRANE PROTEIN" /note="Mb1778c, -, len: 185 aa. Equivalent to Rv1749c, len: 185 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 185 aa overlap). Possible integral membrane protein, similar to O27914|AE000940 hypothetical protein MTH1892 from Methanobacterium thermoautotrophicum (168 aa), fasta scores: E(): 9.3e-16, (37.4% identity in 123 aa overlap). Protein product from Mb1778c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1778c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ62" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ62" /protein_id="SIU00381.1" /translation="MLRAVNEIRQHDGTLKLGKGVGMFTIVGVIVALIGAFVQSRRHR HRPAADIHMLWWMVLIVGVVSIIGAGYHVFDGERTAELIGYTRGDGGFQWENAMGDLA IGVVGLMAYRFRGHFWLATIVVLTIQYVGDAAGHIYYWVVENNTNPYNIGVPLWTDIL LPIVMWALYAWSWHSNGDAVPKGQP" CDS complement(1967776..1969374) /codon_start=1 /transl_table=11 /gene="fadD1" /locus_tag="BQ2027_MB1779C" /product="POSSIBLE FATTY-ACID-COA LIGASE FADD1 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb1779c, fadD1, len: 532 aa. Equivalent to Rv1750c, len: 532 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 532 aa overlap). Possible fadD1, fatty-acid-CoA synthetase (EC 6.2.1.-), similar in part to others e.g. O35488|VLCS_MOUSE VERY-LONG-CHAIN ACYL-COA SYNTHETASE from Mus musculus (620 aa); NP_113924.1|NM_031736 solute carrier family 27 (fatty acid transporter) member 2 from Rattus norvegicus (620 aa); NP_459076.1|NC_003197 crotonobetaine/carnitine-CoA ligase from Salmonella typhimurium (517 aa); CAIC_ECOLI|P31552 probable crotonobetaine/carnitine-coa ligase from Escherichia coli (522 aa), FASTA scores: opt: 448, E(): 1.9e-21, (25.1% identity in 502 aa overlap); etc. Also highly similar to fadD17|Rv3506|MTV023.13 PROBABLE FATTY-ACID-COA LIGASE from Mycobacterium tuberculosis (502 aa); and similar to others from Mycobacterium tuberculosis e.g. fadD6|MTCI364.18|Rv1206|O05307 PROBABLE FATTY-ACID-COA LIGASE (597 aa), FASTA score: (28.3% identity in 519 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb1779c detected using SWATH mass spectrometry. Mb1779c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ72" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR030310" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ72" /protein_id="SIU00382.1" /translation="MTDTIQSLLRQHVSDPTIAVKYGGLQWTWSQYLAESAARAAALI TIADPQRPTHIGSLLGNTPEMLAQLAAAGLGGYVLCGLNTTRRGDALAADVRRADCQI VVTDADHRALLDGLDLAGARILDTSTPRWAELVAGDGAFVPYREVDTMDPFMMIFTSG TSGNPKAVPVSHLMATFAGRSLTERFGLTEQDTCYVSMPLFHSNAVVAGWAPAVVSGA AIAPATFSATGFLDDVRRYHATYMNYVGKPLAYILATPERDDDADNPLRVAFGNEAND KDIEEFSRRFGVQVEDGFGSTENAVIVIREPGTPPGSIGRGAHGVAVYNGETVTECAV ARFDAHGALTNADEAIGELVNTTGSGFFTGYYNDPEANAERMRHGMYWSGDLAYRDSE GWIYLAGRTADWMRVDGENLTAAPIERILLRYKAINRVAVYAVPDEYVGDQVMAALVL RAGDTFDPDAFEAFLDAQPDLSTKARPRYIRIAADLPSTATHKVLKRQLIDEGTAVGK ADTLWVREPRGSAYHHASGPAKAI" CDS 1969428..1970810 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1780" /product="PROBABLE OXIDOREDUCTASE" /note="Mb1780, -, len: 460 aa. Equivalent to Rv1751, len: 460 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 460 aa overlap). Probable oxidoreductase (EC 1.-.-.-), possibly a monooxygenase or hydroxylase, similar to MHPA_ECOLI|P77397 3-(3-hydroxy-phenyl) propionate hydroxylase (554 aa), FASTA scores: opt: 239, E(): 2e-08, (24.6% identity in 435 aa overlap); and AJ007932|SAR7932.13 oxygenase from Streptomyces argillaceus (436 aa), FASTA scores: opt: 587, E(): 8.6e-30, (32.3% identity in 359 aa overlap). Contains PS00075 Dihydrofolate reductase signature. Also similar to Mycobacterium tuberculosis hypothetical oxidoreductases Rv1260 and Rv0575c. Protein product from Mb1780 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1780 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1C5" /db_xref="InterPro:IPR002938" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C5" /protein_id="SIU00383.1" /translation="MIATMPSMARRSRHDNKITTPAVDCLTIERLDSPASGAPQVTPY ARALMGETTTCAIIGGGPAGMVLGLLLARAGVQVTLLEKHGDFLRDFRGDTVHPTTMR LLDELGLWERFAALPYSEVRTATLHSNGRAVTYIDFERLHQPYPYVAMVPQWDLLNLL AEAAQAEPSFTLRMKTEVTGLLREGGKVTGVRYQGAEGPGELRAELTVACDGRWSIAR HEAGLKAREFPVNFDVWWFKLPREGDAEFSFLPRFSPGKGLGVIPREGYFQIAYLGPK GTDAQLRERGIEEFRRDVSELLPEATASVAALASTDEVKHLNVKVNRLRRWHIDGLLC IGDAAHAMSPVAGVGINLAVQDAVAAATILAEPLREHRVSSRHLAAVRRRRAFPTAVT QAVQRVLHRRLLGPLLQGRDPTPPAALLGLVERLPWLSAVPAYFVGVGVRPEHAPAFA RRGPGNRKGP" CDS 1970937..1971386 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1781" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1781, -, len: 149 aa. Equivalent to Rv1752, len: 149 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 149 aa overlap). Conserved hypothetical protein, similar to C-terminal half of Q9TV68|AB021930|CAN2DD Dihydrodiol dehydrogenase (EC 1.3.1.20) from Canis familiaris (335 aa), FASTA score, opt: 168, E(): 0.00015, (31.3% identity in 112 aa overlap). Mb1781 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y027" /protein_id="SIU00384.1" /translation="MDAGCYAVHMAHTFGGATPEVVSAQAKLRDPAVDRAMTAELKFP GGHTGGIRCSMRSSDLLNVSARVVGDRGELRVLNPVVPQLFHRLPPLACVSARRFRCR SAARASGQDDAQGRGREHERDPRDLSGRRAPIAQPELNMVAASGSAA" CDS complement(1971421..1974576) /codon_start=1 /transl_table=11 /gene="PPE24" /locus_tag="BQ2027_MB1782C" /product="ppe family protein ppe24" /note="Mb1782c, PPE24, len: 1051 aa. Similar to Rv1753c, len: 1053 aa, from Mycobacterium tuberculosis strain H37Rv, (90.7% identity in 1103 aa overlap). Member of the Mycobacterium tuberculosis PPE family of Gly-, Asn-rich proteins, similar to many e.g. YF48_MYCTU|Q10778 hypothetical protein cy48.17 (678 aa), FASTA scores: opt: 1360, E(): 0, (48.9% identity in 550 aa overlap). Note that the Gly-, Asn-rich sequence is interrupted by six near-perfect 26 aa repeats, a unique region, and another, more degenerate region of five 25 aa repeats before resuming at the C-terminus. The end of the first Gly-, Asn-rich region and the start of the first set of repeats shows some similarity to Q50577|AT10S from Mycobacterium tuberculosis (170 aa) (40.2% identity in 189 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 150 bp insertion and a 156 bp deletion, leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (1051 aa versus 1053 aa)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ91" /protein_id="SIU00385.1" /translation="MNFSVLPPEINSALIFAGAGPEPMAAAATAWDGLAMELASAAAS FGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAAVQAEQTAAQAAAMIAEFEAVKT AVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYHAGASA IASALSPFSKPLQNLAGLPAWLASGAPAAAMTAAAGIPALAGGPTAINLGIANVGGGN VGNANNGLANIGNANLGNYNFGSGNFGNSNIGSASLGNNNIGFGNLGSNNVGVGNLGN LNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQIGIGGLNSGTGNFGLFNSGSGNVGFF NSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGSYNMG DFNPGSSNTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNTGDMNNGVFYRGVGQ GSLQFSITTPDLTLPPLQIPGISVPAFSLPAITLPSLTIPAATTPANITVGAFSLPGL TLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLN IPAATTPANITVSGFQLPPLSIPSVAIPPVTVPPITVGAFNLPPLQIPEVTIPQLTIP AGITIGGFSLPAIHTQPITVGQIGVGQFGLPSIGWDVFLSTPRITVPAFGIPFTLQFQ TNVPALQPPGGGLSTFTNGALIFGEFDLPQLVVHPYTLTGPIVIGSFFLPAFNIPGID VPAINVDGFTLPQITTPAITTPEFAIPPIGVGGFTLPQITTQEIITPELTINSIGVGG FTLPQITTPPITTPPLTIDPINLTGFTLPQITTPPITTPPLTIDPINLTGFTLPQITT PPITTPPLTIDPINLTGFTLPQITTPPITTPPLTIDPINLTGFTLPQITTPPITTPPL TIEPIGVGGFTTPPLTVPGIHLPSTTIGAFAIPGGPGYFNSSTAPSSGFFNSGAGGNS GFGNNGSGLSGWFNTNPAGLLGGSGYQNFGGLSSGFSNLGSGVSGFANRGILPFSVAS VVSGFANIGTNLAGFFQGTTS" CDS complement(1974780..1976471) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1783C" /product="conserved protein" /note="Mb1783c, -, len: 563 aa. Equivalent to Rv1754c, len: 563 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 563 aa overlap). Conserved hypothetical protein, has proline-rich central region. Some similarity in central region to other Mycobacterium tuberculosis proline-rich proteins e.g. O06555|Rv1157c|MTCI65.24c (371 aa), (32.5% identity in 191 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb1783c detected using SWATH mass spectrometry. Mb1783c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZK5" /db_xref="InterPro:IPR025442" /db_xref="UniProtKB/TrEMBL:A0A1R3XZK5" /protein_id="SIU00386.1" /translation="MYRYQVRVQQRRSEMNRWVATRSRRHTYQWITDHKSPRDHYRHI SELRTSIATSSPGRCDMSPIPRIVSVSLAWAAAIGLMVPIGLAPPAMAAPCSGDAANA PPPPSAIVTDPGATALGPVRPGHGPIPTGRKPRGANDRAPLPKLGPLISALLNPGARN AAPLQQQALVPRANPGPNPAPNPPATGPQPPNATQLTPNPAPAPDPAPAAAPDPGATL AGATTSLAEWVTGPDSPNKTLERFGISGTDLGIPWDNGDPANRQVLMIFGDTFGYCAV DGHQWRYNTLFRSQDRDLGNGVHVTSGDASNRYSGSPVRQPGFSKQLINSIKWARDET GIIPTAGIAVGKTQYVNFMSIRNWGRDGEWTTNYSGIAVSKDNGQTWGVFPGTIRASG PDSGGKARFVPGNENFQMGAYLKSNDGYLYSFGTPPGRGGSAYLARVPQRFVPDLTKY QYWNGDSNSWVPNKPDAATPVIPGPVGEMSVQYNTYLKQYLALYTNGMNDVVARTAPA PQGPWSAEQMLVSSWQMPGGIYAPMMHPWSTGKDVYFNLSLWSAYNVMLMHTVLP" CDS complement(1976655..1978199) /codon_start=1 /transl_table=11 /gene="plcD" /locus_tag="BQ2027_MB1784C" /product="PROBABLE PHOSPHOLIPASE C 4 PLCD" /note="Mb1784c, plcD, len: 514 aa. Equivalent to MT1799, len: 514 aa, from Mycobacterium tuberculosis strain CDC1551, (100% identity in 514 aa overlap). Probable plcD, phospholipase C 4 (EC 3.1.4.3) (see citation below), highly similar to other phospholipases e.g. Q04001|PHLA_MYCTU|Rv2351c|PLCA|MTP40|MT2416|MTCY98.20c PROBABLE MEMBRANE-ASSOCIATED PHOSPHOLIPASE C 1 PLCA from Mycobacterium tuberculosis (512 aa), FASTA score: opt: 2657, E(): 4.9e-156, (71.1% identity in 512 aa overlap); P95246|PHLB_MYCTU|PLCB|MPCB|Rv2350c|MT2415|MTCY98.19c PROBABLE MEMBRANE-ASSOCIATED PHOSPHOLIPASE C 2 PLCB from Mycobacterium tuberculosis (512 aa), FASTA score: opt: 2638, E(): 7.3e-155, (70.9% identity in 512 aa overlap); etc. BELONGS TO THE BACTERIAL PHOSPHOLIPASE C FAMILY. REMARK-M.bovis-M.tuberculosis: No equivalent in Mycobacterium tuberculosis strain H37Rv. Belong to RvD2 region." /db_xref="GOA:P0A5R9" /db_xref="InterPro:IPR006311" /db_xref="InterPro:IPR007312" /db_xref="InterPro:IPR017850" /db_xref="UniProtKB/Swiss-Prot:P0A5R9" /protein_id="SIU00387.1" /translation="MSQSHIGGVSRREFLAKVAAGGAGALMSFAGPVIEKAYGAGPCS GHLTDIEHFVFFMQENRSFDHYFGTLSGTDGFNTVSPLFQQKGWNPMTQALDATGVTM PYRFDTTRGPFLDGACVNDPDHSWVAMHESWNGGVNDNWLPAQAKTRSAAHTPTVMGY YTRQDIPIHYLLADAFTVCDRYFCSVLGPTLPNRLYWLSATIDPDGQNGGPELQSPTF QPVRRFGWRIMPQNLSDAGVSWKVYRNKTLGPISSVLTYGSLVTSFKQSADPRSDLVR FGVAPSYPASFAADVLANRLPRVSWVIPNVLESEHPAVPAAAGAFAIVNILRILLANP AVWEKTALIVSYDENGGFFDHVVPATAPAGTPGEYVTVPDIDQVPGSGGIRGPIGLGF RVPCFVISPYSRGPQMVHDTFDHTSQLRLLETRFGVPVPNLTAWRRSVTGDMTSTFNF AVPPNSSWPNLDYPGLHALSTVPQCVPNAALGTINRGIPYRVPDPQIMPTQETTPTRG IPSGPC" CDS complement(1978328..1979512) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1785C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1785c, -, len: 394 aa. Similar to MT1800 (also known as RVD2-ORF1), len: 381 aa, from Mycobacterium tuberculosis strain CDC1551, (99.71% identity in 345 aa overlap). Conserved hypothetical protein (see citation below), showing similarity with glycosyl transferases, sulfolipid sulfoquinovosyldiacylglycerol synthases, and hypothetical proteins, e.g. Q9R6U1|SQDX SQDX PROTEIN (required for biosynthesis of the sulfolipid sulfoquinovosyldiacylglycerol) from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (377 aa), FASTA scores: opt: 348, E(): 6.7e-15, (30.40% identity in 296 aa overlap); Q8YUR9|SQDX|ALR2265 SULFOLIPID SULFOQUINOVOSYLDIACYLGLYCEROL BIOSYNTHESIS PROTEIN from Anabaena sp. strain PCC 7120 (378 aa), FASTA scores: opt: 336, E(): 4e-14, (27.832% identity in 309 aa overlap); Q9AA50|CC0756 GLYCOSYL TRANSFERASE (GROUP 1 FAMILY PROTEIN) from Caulobacter crescentus (455 aa), FASTA scores: opt: 333, E(): 7.1e-14, (29.5% identity in 315 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: No equivalent in Mycobacterium tuberculosis strain H37Rv. Belongs to the RvD2 region." /db_xref="InterPro:IPR001296" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ97" /protein_id="SIU00388.1" /translation="MSLTAHDGRFCSRYACCSGRQLLWTSLGRTPHRGGPAGRGILRQ RTRGVLIVPGARTERHLLRTGVVRITLPAKHIPYTGGYRAVMPGAVRTVLETLRPDTL EVSDRLTLRSLGRWGREHGVTTVMISHERLDRFAGQLLPRRAAQKFADFANARTAANY DTVVCTTGFAREEFDRIGATNTVTVPLGVDLKTFHPRRRCARVRQHWATPTQILLVHC GRLSVEKHADRSIDALAALCDAGVDARLVIAGEGPLRARLERKATGLPIDFTGFISDR HAVAGLLASADVALAPGPHETFGLAALESLACGTPAVVSRTSALTEIITADSGACADN RPEAIAHAVRTIVSRPERHRRRCARRRAEIFTWQRAAASMLATLGAMAVSTRCGDTQD TA" CDS 1979662..1980801 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1786" /product="POSSIBLE SULFITE OXIDASE" /note="Mb1786, -, len: 379 aa. Equivalent to MT1801, len: 379 aa, from Mycobacterium tuberculosis strain CDC1551, (99.75% identity in 379 aa overlap). See citations below. Possible sulfite oxidase (EC 1.8.3.1), showing similarity with others e.g. P51687|SUOX_HUMAN Sulfite oxidase, mitochondrial precursor from Homo sapiens (488 aa), FASTA scores: opt: 952, E(): 6.1e-52, (43.7% identity in 357 aa overlap); Q07116|SUOX_RAT Sulfite oxidase, mitochondrial precursor from Rattus norvegicus (488 aa), FASTA scores: opt: 952, E(): 6.1e-52, (43.8% identity in 356 aa overlap); Q9VWP4|CG7280 PROBABLE SULFITE OXIDASE PRECURSOR from Drosophila melanogaster (Fruit fly) (573 aa), FASTA scores: opt: 764, E(): 4e-40, (38.4% identity in 362 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: No equivalent in Mycobacterium tuberculosis strain H37Rv. Belong to RvD2 region." /db_xref="GOA:A0A1R3XZ93" /db_xref="InterPro:IPR000572" /db_xref="InterPro:IPR005066" /db_xref="InterPro:IPR008335" /db_xref="InterPro:IPR014756" /db_xref="InterPro:IPR022407" /db_xref="InterPro:IPR036374" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ93" /protein_id="SIU00389.1" /translation="MTDRGENRTTIAMLETAGLWGKRADMIVRGCLPYNAEPPPAVLA GSDITPINAFYVRNHGPVPDIAPQHWRLTVGGLVDNPLTVTYERLTTEFDQHCVVATL ACAGNRRAELLRVRQIPGKEPWAHGAISTAQWCGVRLADILQAADVHIDEGLHVAFDA PDVAEEARPIQPYGSSIPLSKALSPEVLLAWQMNSEPLPRAHGGPVRVVVPGFIGARS VKWVTAITVQPGASQNYFQALDYRILPADADADIVGPGEGISLSSLALNCDILDPTDG DDVPAGALTIRGYGMAGDGRSVERVDVSVDDGLTWQQADLHAAPSQWSWRPWSLTVDV EPGPLGITARAWDDTGALQPESAVSLWNPRGYGNNAWARVALRVS" CDS 1981002..1983839 /codon_start=1 /transl_table=11 /gene="mmpL14" /locus_tag="BQ2027_MB1787" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL14" /note="Mb1787, mmpL14, len: 945 aa. Equivalent to MT1802 (also known as RVD2-ORF3), len: 945 aa, from Mycobacterium tuberculosis strain CDC1551, (99.7% identity in 945 aa overlap). Probable mmpL14, conserved transmembrane transport protein (see citation below). Member of RND superfamily, similar to several putative transport proteins e.g. O53784|MML5_MYCTU|MMPL5|Rv0676c|MT0705|MTV04 0.04c PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis strain H37Rv (964 aa), FASTA scores: opt: 1171, E(): 2.5e-61, (32.10% identity in 947 aa overlap); P95211|MML1_MYCTU|MMPL1|Rv0402c|MT0412|MT CY04D9.15c PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis strain H37Rv (958 aa), FASTA scores: opt: 1170, E(): 2.8e-61, (33.1% identity in 940 aa overlap); etc. BELONGS TO THE MMPL FAMILY. REMARK-M.bovis-M.tuberculosis: No equivalent in Mycobacterium tuberculosis strain H37Rv. Belong to RvD2 region. Mb1787 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZ87" /db_xref="InterPro:IPR004707" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ87" /protein_id="SIU00390.1" /translation="MTAIGRLIHRYAIWIVGVWALAAIIGNNFAPPLEQVITAEDQPF SPAGTATSRAVERSAAAFSQAPGDNIGYLVLERNGVLNDQDRAYYDALVVALRRDSRH VIEVVDWWGTPAIAEVARSDDHHAVTAALRFGGMVGTSQAGESITAARSIVTQLHPPD GLHVFVTGPGATIVDEFAAIDRQTQLITATTIVVLLILLLIVYRSAITATVPLLSVVV SLAVAKPIVSVLVDRDFIGISLFSLGLSVAVVVGAGTGFAMFLIGRYHERRRQHIAPA AALADAYRGVAPAIAGATFIVVTSLGAVGWLSLARIGMFATTGILCSIGVLAVGLAAL TLTPALVALASRANLLKPPQHKRIQRQFRRLGTHVARWPAPILVASGVFVLIMMIALP RVPIGWDEAAATPSAAESNRGYRAADRHFAPNQLLPTQVMIETDHDIRNPAGLTAIER ITAAIMAIGGVRMVQSASHPNGMVSKQAALTASAGNLGDQLDEFSDQLTSRQATFTNL EAAVRDVVSALDLVQAGIRQDGYGLGQVSLAVRLMQQAITKLQGSAGDVFDIFDPLRR FVAAIPECRANPVCSVAQEVVQWANTVTESCAKLADAAGQLARGIADVASATSGVSGL PNALDGIGGQLAQVRESAAGVQELLNNVGAAPLRELPDYLRELAAVSQSAPGVDLYAA RRILTDPNMRAVLDYFVSPNGHATRLLVYGDGSEWGDDGAQRARAIVTAVAEETDEGT LRPTAVELTGVGPATRDLQDLVGSDLTLLAVITLAVIFAIAALLLRSPLAGLVVVGTI ATSYICALGASVVIWKHILGDNLHWSVLPIAFVLLISVGSAYNLLFALRIREESPAGP RTSVIRAFAATGMVVTAAGIVFGTTMFALAASTSLSVAQIGVTVGMGLLLDALVIRGF VLPALMVLLGRWLWWPRRSVSNRQVPEPSPA" CDS 1984160..1984816 /codon_start=1 /transl_table=11 /gene="cut1" /locus_tag="BQ2027_MB1788" /product="PROBABLE CUTINASE CUT1" /note="Mb1788, -, len: 218 aa. Similar to Rv1758, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (98.25% identity in 172 aa overlap). Probable cut1, serine esterase, cutinase family (EC 3.1.1.-), similar to Rv2301|CUT2_MYCTU|Q50664 probable cutinase cy339.08c precursor from Mycobacterium tuberculosis (219 aa), FASTA scores: opt: 369, E(): 1. 1e-16, (39.1% identity in 179 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical cutinases Rv3452, Rv1984c, Rv3451 and Rv3724. CDS has been interrupted by IS6110 insertion element and 5'-end deleted. BELONGS TO THE CUTINASE FAMILY. REMARK-M.bovis-M.tuberculosis: Belongs to the RvD2 region. In Mycobacterium tuberculosis strain H37Rv, Rv1758 is interrupted by IS6110 insertion element and the 5'-end is deleted." /db_xref="GOA:A0A1R3XZ70" /db_xref="InterPro:IPR000675" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ70" /protein_id="SIU00391.1" /translation="MVTTWALLFAPVPAASADPPDPTVSDGACPDVEVVFARGTGEPP GVGGIGEDFIDALRSKIGEKSMGVYGVDYPATTDFPTAMAGIYDAGTHVEQTAANCPQ SKLVLGGFSQGAAVMGFVTAAAIPDGAPLDAPRPMPPEVADHVAAVTLFGMPSVAFMH SIGAPPIVIGPLYAEKTIQLCAPGDPVCSSGGNWAAHNGYADDGMVEQAAVFAAGRLG " CDS complement(1985083..1987545) /codon_start=1 /transl_table=11 /gene="wag22b" /locus_tag="BQ2027_MB1789C" /product="PE-PGRS FAMILY PROTEIN WAG22B [SECOND PART]" /note="Mb1789c, wag22b, len: 820 aa. Equivalent to 3' end of Rv1759c, len: 914 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 820 aa overlap). wag22, antigen member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to others e.g. MT1367|Q10637 hypothetical glycine-rich 49.6 kd protein from Mycobacterium tuberculosis (603 aa), FASTA scores: opt: 2010, E(): 0, (53.0% identity in 724 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1759c exists as a single gene. In Mycobacterium bovis, a single base deletion (c-*) splits wag22 into 2 parts, wag22a and wag22b." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/Swiss-Prot:P0A687" /protein_id="SIU00392.1" /translation="MTPLLNSINAPVLAATGRPLIGNGANGAPGTGANGGDAGWLIGN GGAGGSGAKGANGGAGGPGGAAGLFGNGGAGGAGGTATANNGIGGAGGAGGSAMLFGA GGAGGAGGAATSLVGGIGGTGGTGGNAGMLAGAAGAGGAGGFSFSTAGGAGGAGGAGG LFTTGGVGGAGGQGHTGGAGGAGGAGGLFGAGGMGGAGGFGDHGTLGTGGAGGDGGGG GLFGAGGDGGAGGSGLTTGGAAGNGGNAGTLSLGAAGGAGGTGGAGGTVFGGGKGGAG GAGGNAGMLFGSGGGGGTGGFGFAAGGQGGVGGSAGMLSGSGGSGGAGGSGGPAGTAA GGAGGAGGAPGLIGNGGNGGNGGESGGTGGVGGAGGNAVLIGNGGEGGIGALAGKSGF GGFGGLLLGADGYNAPESTSPWHNLQQDILSFINEPTEALTGRPLIGNGDSGTPGTGD DGGAGGWLFGNGGNGGAGAAGTNGSAGGAGGAGGILFGTGGAGGAGGVGTAGAGGAGG AGGSAFLIGSGGTGGVGGAATTTGGVGGAGGNAGLLIGAAGLGGCGGGAFTAGVTTGG AGGTGGAAGLFANGGAGGAGGTGSTAGGAGGAGGAGGLYAHGGTGGPGGNGGSTGAGG TGGAGGPGGLYGAGGSGGAGGHGGMAGGGGGVGGNAGSLTLNASGGAGGSGGSSLSGK AGAGGAGGSAGLFYGSGGAGGNGGYSLNGTGGDGGTGGAGQITGLRSGFGGAGGAGGA SDTGAGGNGGAGGKAGLYGNGGDGGAGGDGATSGKGGAGGNAVVIGNGGNGGNAGKAG GTAGAGGAGGLVLGRDGQHGLT" CDS complement(1987542..1987826) /codon_start=1 /transl_table=11 /gene="wag22a" /locus_tag="BQ2027_MB1790C" /product="PE-PGRS FAMILY PROTEIN WAG22A [FIRST PART]" /note="Mb1790c, wag22a, len: 94 aa. Equivalent to 5' end of Rv1759c, len: 914 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 84 aa overlap). wag22, antigen member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to others e.g. MT1367|Q10637 hypothetical glycine-rich 49.6 kd protein from Mycobacterium tuberculosis (603 aa), FASTA scores: opt: 2010, E(): 0, (53.0% identity in 724 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1759c exists as a single gene. In Mycobacterium bovis, a single base deletion (c-*) splits wag22 into 2 parts, wag22a and wag22b. Mb1790c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/Swiss-Prot:P0A687" /protein_id="SIU00393.1" /translation="MSFVIAVPETIAAAATDLADLGSTIAGANAAAAANTTSLLAAGA DEISAAIAALFGAHGRAYQAASAEAAAFHGRFVQALTTGGAPMRPPRPPP" CDS 1988402..1989931 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1791" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb1791, -, len: 509 aa. Equivalent to the 5' end of Rv1760, len: 502 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 442 aa overlap). Conserved hypothetical protein, similar to several other Mycobacterium tuberculosis hypothetical proteins e.g. Q10554|Y895_MYCTU|MTCY31.23 (505 aa), FASTA scores: opt: 692, E(): 0, (31.7% identity in 477 aa overlap). Member of family with at least 15 other members e.g. Rv3740c, Rv3734c, Rv1425, etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) leads to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Mb1791 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y029" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="UniProtKB/TrEMBL:A0A1R3Y029" /protein_id="SIU00394.1" /translation="MPRGCAGARFACNACLNFLAGLGISEPISPGWAAMERLSGLDAF FLYMETPSQPLNVCCVLELDTSTMPGGYTYGRFHAALEKYVKAAPEFRMKLADTELNL DHPVWVDDDNFQIRHHLRRVAMPAPGGRRELAEICGYIAGLPLDRDRPLWEMWVIEGG ARSDTVAVMLKVHHAVVDGVAGANLLSHLCSLQPDAPAPQPVRGTGGGNVLQIAASGL VGFASRPVRLATVVPATVLTLVRTLLRAREGRTMAAPFSAPPTPFNGPLGRLRNIAYT QLDMRDVKRVKDRFGVTINDVVVALCAGALRRFLLEHGVLPEAPLVATVPVSVHDKSD RPGRNQATWMFCRVPSQISDPAQRIRTIAAGNTVAKDHAAAIGPTLLHDWIQFGGSTM FGAAMRILPHISITHSPAYNLILSNVPGPQAQLYFLGCRMDSMFPLGPSLATRASTSP SCPSTGNWVSALSPAPTCCRTCGAWQTGFPRRSKSCWSAVMTSRKAATTRTPESYVQN R" CDS complement(1989919..1990302) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1792C" /product="possible exported protein" /note="Mb1792c, -, len: 127 aa. Equivalent to Rv1761c, len: 127 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 127 aa overlap). Possibly exported protein with hydrophobic stretch or TMhelix at aa 15-37. Protein product from Mb1792c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1792c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZA7" /db_xref="InterPro:IPR031816" /db_xref="UniProtKB/TrEMBL:A0A1R3XZA7" /protein_id="SIU00395.1" /translation="MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRG FFRSNPERIQIGDWRYEVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVS RYGATVIPNINAAIEVLGTGTDYRF" CDS complement(1990302..1991090) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1793C" /product="unknown protein" /note="Mb1793c, -, len: 262 aa. Equivalent to Rv1762c, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 262 aa overlap). Hypothetical unknown protein. Protein product from Mb1793c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1793c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002765" /db_xref="InterPro:IPR035439" /db_xref="UniProtKB/TrEMBL:A0A1R3XZL4" /protein_id="SIU00396.1" /translation="MQSSSLDPVASERLSHAEKSFTSDLSINEFALLHGAGFEPIELV MGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNW RHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAF VMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVV GVHFAISNYAWGVHTVEFYTAGTAVRRTGSGETITPSFVLPMDS" CDS complement(1992108..1993364) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1794C" /product="HNH endonuclease domain protein" /note="Mb1794c, -, len: 418 aa. Equivalent to Rv1765c, len: 365 aa, from Mycobacterium tuberculosis strain H37Rv, (97.8% identity in 364 aa overlap). Conserved hypothetical protein, highly similar to O53461|Rv2015c|MTV018.02c CONSERVED HYPOTHETICAL PROTEIN (418 aa), (97.8% identity in 364 aa overlap). BLAST hits with non-IS part of sequence submitted under MTU78639. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 121 bp to 848 bp substitution affects the COOH part of Mb1794c leading to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (418 aa versus 365 aa). Mb1794c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ95" /db_xref="InterPro:IPR002711" /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ95" /protein_id="SIU00397.1" /translation="MSSTATSGAAVVSPAERVEVLFEELAELAGQRNAIDGRIVEIVA ELDRDGLWGVTGARSVAGLVAWKMGCSSGNAHTIATVARRLPEFPRCARGMREGRLSL DQVGVIAGRAGEGSDAHYAQLAGVATVNQLRTALKLEPRPEPEPDFRPEPRPSITRSA DEQFSCWRIKLPHVEAAKFDAALQSHLDALIAEYKRDHDNSDGVSDQRPPLPGNVEAF LRLVEAGWDAEVARRPHGQHTTVVMHLDVQERAAGLHLGPLLSESERRYLLCDATFEA WFERDGQVIGCGRTTRQINRRLRRALEHRDRTCVVPGCGATRGLHAHHIRHWQDGGAT ELANLVLVCPYHHRAHHRGLITITGPADNLTVADSAGRPLSAGSLARASTKPPPAVAP WPGPTGERADWWWYEPFQPQPPPISN" mobile_element complement(1993433..1994662) /mobile_element_type="insertion sequence:ISB9" /locus_tag="BQ2027_ISB9'" /note="ISB9', len: 1230 nt. Equivalent to ISB9', len: 1230 nt, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1230 nt overlap). Sequence ISB9, nearly identical to EM_BA:MTU78639. Note that this sequence shows several differences to EM_BA: MTU78639, and the transposase ORFs are extensively frameshi fted. Our sequence has been checked and is thought to be correct; the sequence in EM_BA:MTU78639 is from a different isolate of Mycobacterium tuberculosis." repeat_region 1993433..1993446 /rpt_type=INVERTED /note="14 bp imperfect inverted repeat, IRR,ATCACCCCGCAAAG, flanking IS element ISB9'." gene complement(1993433..1994662) /locus_tag="BQ2027_ISB9'" CDS complement(1993991..1994206) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1795C" /product="PUTATIVE TRANSPOSASE (FRAGMENT)" /note="Mb1795c, -, len: 71 aa. Equivalent to Rv1765A, len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (98.6% identity in 71 aa overlap). Putative transposase (fragment), similar to part of many transposase genes including IS6110 e.g. P19774|TRA9_MYCTU PUTATIVE TRANSPOSASE from Mycobacterium tuberculosis (278 aa), FASTA scores: opt: 231, E(): 4.7e-11, (45.35% identity in 75 aa overlap). Mb1795c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR012337" /db_xref="UniProtKB/TrEMBL:A0A1R3XZA2" /protein_id="SIU00398.1" /translation="MWVADITFVRTWQGFCYTAFVTDVCTRKIVVRAVSATMRTEDLP VQVFNHAVWQSNSDLSELVHHSDPGSQ" CDS 1994586..1994855 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1795A" /product="Metal-sensitive transcriptional repressor" /note="Mb1795A, -, len: 89 aa. Equivalent to Rv1766, len: 89 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 89 aa overlap). Conserved hypothetical protein, highly similar to P54431|YRKD_BACSU Hypothetical 7.0 kDa protein in bltr-spoIIIC intergenic region from Bacillus subtilis (63 aa), FASTA scores: opt: 151, E(): 1.5e-05, (53.3% identity in 45 aa overlap). Also similar to Q9RD62|SCF56.04C|AL133424 Hypothetical protein from Streptomyces coelicolor (92 aa), FASTA scores: opt: 239, E(): 1.3e-11, (62.5% identity in 64 aa overlap). Also some similarity to other Mycobacterium tuberculosis hypothetical proteins e.g. O07434|Rv0190|MTCI28.29 (96 aa), (35.5% identity in 62 aa overlap); P71543|Rv0967 (119 aa), and P71600|Rv0030 (109 aa). Start changed since original submission. Protein product from Mb1795A detected using SWATH mass spectrometry. Mb1795A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZA6" /db_xref="InterPro:IPR003735" /db_xref="InterPro:IPR038390" /db_xref="UniProtKB/TrEMBL:A0A1R3XZA6" /protein_id="SIU00399.1" /translation="MIGDQDSIAAVLNRLRRAQGQLAGVISMIEQGRDCRDVVTQLAA VSRALDRAGFKIVAAGLKECVSGATASGAAPLSAAELEKLFLALA" repeat_region complement(1994649..1994662) /rpt_type=INVERTED /note="14 bp imperfect inverted repeat, IRL,ATCACCCCGGCAAG, flanking IS element ISB9'." CDS 1994923..1995282 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1796" /product="Possible carboxymuconolactone decarboxylase family protein (EC" /EC_number="4.1.1.44" /note="Mb1796, -, len: 119 aa. Equivalent to Rv1767, len: 119 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 119 aa overlap). Conserved hypothetical protein, similar to Q57498|YA53_HAEIN HYPOTHETICAL PROTEIN HI1053 from Haemophilus influenzae (113 aa), FASTA scores: opt: 233, E(): 6.4e-10, (40.0% identity in 90 aa overlap). Protein product from Mb1796 detected using SWATH mass spectrometry. Mb1796 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZ98" /db_xref="InterPro:IPR003779" /db_xref="InterPro:IPR004675" /db_xref="InterPro:IPR029032" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ98" /protein_id="SIU00400.1" /translation="MSDQPRHHQVLDDLLPQHRALRHQIPQVYQRFVALGDAALTDGA LSRKVKELVALAIAVVQGCDGCVASHAQAAVRAGATAQEAAEAIGVTILMHGGPATIH GARAYAAFCEFADTTPS" CDS 1995463..1997337 /codon_start=1 /transl_table=11 /gene="PE_PGRS31" /locus_tag="BQ2027_MB1797" /product="pe-pgrs family protein pe_pgrs31" /note="Mb1797, PE_PGRS31, len: 624 aa. Equivalent to Rv1768, len: 618 aa, from Mycobacterium tuberculosis strain H37Rv, (98.88% identity in 624 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to Q50615 HYPOTHETICAL 40.8 KD PROTEIN (498 aa), FASTA scores: opt: 1703, E(): 0, (57.4% identity in 566 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 18 bp insertion, leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (624 aa versus 618 aa). Mb1797 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ75" /protein_id="SIU00401.1" /translation="MSYLVVVPELVAAAATDLANIGSSISAANAAAAAPTTALVAAGG DEVSAAIAALFGAHARAYQALSAQAAMFHEQFVRALAAGGNSYAVAEAATAQSVQQDL LNLINAPTQALLGRPLIGNGANGLPGTGQNGGDGGILYGNGGNGGSGGVNQAGGNGGN AGLWGNGGSGGAGGNATTAGRNGFNGGAGGSGGLLWGNGGAGGAGGHGGPAPLVGGVG TTGGAGGNGGGAGLFYGFGGAGGNGGMGGVAPSTGPSMGILPAGGVGGPGGSGGASAL AFGSGGVGGAGGLGGPTDGTVQGVGGFGGQGGNGGQSGLLFGNAGAGGAGAAGGAGTG DTESFGGHGGAGGDGGAVGLIGNGGGGGNGGAGGTGSPGAVVGGNGGVGGLGGAGSPG GLLYGTGGAGGNGGPGGDGGTGATVGFAGSGGFGGAGGIAQLFGTGGMGGSGGGIGAG TTTVVPPDVAPVGGTGGNGGRAGLLLGVGGMGGNGGATSVGGTLYAAGGNGGDGGLVW GNGGTGGSGGAGGAGSVGNGGAGGNAALLFGNGGAGGAGGAGGIGAGGAGGFGAVLFG NGGAGGSGAPGGIGAGGNGGNALLVGNGGNGGAGTGGAAGGAGGSGGLLFGQNGMPGP " CDS 1997493..1998737 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1798" /product="L-gulono-1,4-lactone oxidase (EC" /EC_number="1.1.3.8" /note="Mb1798, -, len: 414 aa. Equivalent to Rv1769, len: 414 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 414 aa overlap). Conserved hypothetical protein, similar to O88066|SCI35.31|AL031541 hypothetical protein from Streptomyces coelicolor (402 aa), FASTA scores: opt: 1341, E(): 0, (53.8% identity in 398 aa overlap). Protein product from Mb1798 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1798 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001608" /db_xref="InterPro:IPR029066" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ90" /protein_id="SIU00402.1" /translation="MHEVAAREQRSDGPMRLDAQGRLQRYEEAFADYDAPFAFVDLDA MWGNADQLLARAGDKPIRVASKSLRCRPLQREILDASERFDGLLTFTLTETLWLAGQG FSNLLLAYPPTDRAALRALGELTAKDPDGAPIVMVDSVEHLDLIERTTDKPVRLCLDF DAGYWRAGGRIKIGSKRSPLHTPEQARALAVEIARRPALTLAALMCYEAHIAGLGDNV AGKRVHNAIIRRMQRMSFEELRERRARAVELVREVADIKIVNAGGTGDLQLVAQEPLI TEATAGSGFYAPTLFDSYSTFTLQPAAMFALPVCRRPGAKTVTALGGGYLASGVGAKD RMPTPYLPVGLKLNALEGTGEVQTPLSGDAARRLKLGDKVYFRHTKAGELCERFDHLH LVRGAEVVDTVPTYRGEGRTFL" CDS 1998745..2000031 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1799" /product="Putative aminopeptidase" /note="Mb1799, -, len: 428 aa. Equivalent to Rv1770, len: 428 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 428 aa overlap). Conserved hypothetical protein, highly similar in N-terminus to Q49882 Hypothetical protein from Mycobacterium leprae from cosmid L247 (83 aa), FASTA scores: opt: 301, E(): 1e-12, (56.5% identity in 85 aa overlap). Protein product from Mb1799 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1799 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007484" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1E3" /protein_id="SIU00403.1" /translation="MDEAHPAHPADAGRPGGPIQGARRGAAMTPITALPTELAAMREV VETLAPIERAAGEPGEHKAAEWIVERLRTAGAQDARIEEEQYLDGYPRLHLKLSVIGV AAGVAGLLSRRLRIPAALAGVGAGLAIADDCANGPRIVRKRTETPRTTWNAVAEAGDP AGQLTVVVCAHHDAAHSGKFFEAHIEEVMVELFPGIVERIDTQLPNWWGPILAPALAG VGALRGSRPMMIAGTVGSALAAALFADIARSPVVPGANDNLSAVALLVALAERLRERP VKGVRVLLVSLGAEETLQGGIYGFLARHKPELDRDRTYFLNFDTIGSPELIMLEGEGP TVMEDYFYRPFRDLVIRAAERADAPLRRGIRSRNSTDAVLMSRAGYPTACFVSINRHK SVANYHLMSDTPENLCYETVSHAVTVAESVIRELAR" CDS 2000028..2001314 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1800" /product="l-gulono-1,4-lactone dehydrogenase" /note="Mb1800, -, len: 428 aa. Equivalent to Rv1771, len: 428 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 428 aa overlap). Probable oxidoreductase (EC 1.-.-.-), similar to e.g. GGLO_RAT|P10867 l-gulonolactone oxidase (ec 1.1.3.8) (439 aa), FASTA scores: opt: 862, E(): 0, (34.1% identity in 434 aa overlap). Also shows slight similarity to Mycobacterium tuberculosis oxidoreductase Rv1726|MTCY04C12.11 (22.9% identity in 441 aa overlap) and others e.g. Rv3107c, Rv1257c, Rv2251, etc. Contains PS00862 Oxygen oxidoreductases covalent FAD-binding site. Protein product from Mb1800 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1800 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y038" /db_xref="InterPro:IPR006093" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR007173" /db_xref="InterPro:IPR010031" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR016167" /db_xref="InterPro:IPR016169" /db_xref="InterPro:IPR016171" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3Y038" /protein_id="SIU00404.1" /translation="MSPIWSNWPGEQVCAPSAIVRPTSEAELADVIAQAAKRGERVRA VGSGHSFTDIACTDGVMIDMTGLQRVLDVDQPTGLVTVEGGAKLRALGPQLAQRRLGL ENQGDVDPQSITGATATATHGTGVRFQNLSARIVSLRLVTAGGEVLSLSEGDDYLAAR VSLGALGVISQVTLQTVPLFTLHRHDQRRSLAQTLERLDEFVDGNDHFEFFVFPYADK ALTRTMHRSDEQPKPTPGWQRMVGENFENGGLSLICQTGRRFPSVAPRLNRLMTNMMS SSTVQDRAYKVFATQRKVRFTEMEYAIPRENGREALQRVIDLVRRRSLPIMFPIEVRF SAPDDSFLSTAYGRDTCYIAVHQYAGMEFESYFRAVEEIMDDYAGRPHWGKRHYQTAA TLRERYPQWDRFAAVRDRLDPDRVFLNDYTRRVLGP" CDS 2001503..2001814 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1801" /product="HYPOTHETICAL PROTEIN" /note="Mb1801, -, len: 103 aa. Equivalent to Rv1772, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 103 aa overlap). Hypothetical unknown protein. Mb1801 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZB7" /db_xref="InterPro:IPR005561" /db_xref="InterPro:IPR024189" /db_xref="UniProtKB/TrEMBL:A0A1R3XZB7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00405.1" /translation="MGSTGGSQPMTANRGPAAISSGSNSGRVLDTARGILIALRRCPA ETAFDELHNAAQRHRLPVFEIAWALVHLAVEGSTPCRSFVDAQSAARREWGQLFAHAA A" CDS complement(2001887..2002633) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1802C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1802c, -, len: 248 aa. Equivalent to Rv1773c, len: 248 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 248 aa overlap). Probable transcriptional regulator belonging to IclR family, similar to ICLR_ECOLI|P16528 acetate operon repressor from Escherichia coli (274 aa), FASTA scores: opt: 261, E(): 3.3e-10, (26.9% identity in 249 aa overlap). Also similar to Mycobacterium tuberculosis protein Rv1719|MTCY04C12.04 (40.2% identity in 244 aa overlap); and Rv2989. Start site chosen by homology, but may extend further upstream. Contains possible helix-turn-helix motif at aa 37-58 (+3.24 SD). Protein product from Mb1802c detected using SWATH mass spectrometry. Mb1802c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZM4" /db_xref="InterPro:IPR005471" /db_xref="InterPro:IPR014757" /db_xref="InterPro:IPR029016" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XZM4" /protein_id="SIU00406.1" /translation="MPPTEGKSTTNRDEGIQVLRRAVAALDEIAAEPGHLRLVDLCER LGLAKSTTRRLLVGLVEVGLVSVDSHGRFALGERLLGFGSVTGAHIAAAFRPTVERVA RATDGETVDLSVLRGQRMWFVDQIESSYRLRAVSAVGLRFPLNGTANGKAALAALDDA DAEAALCRLDPMVAEGLRREIVEIRRTGIAFDRNEHTPGISAAAIARRALGDNVIAIS VPAPTARFLEKEQRIIAALRAAADSPDWTR" CDS 2002699..2004039 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1803" /product="PROBABLE OXIDOREDUCTASE" /note="Mb1803, -, len: 446 aa. Equivalent to Rv1774, len: 446 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 446 aa overlap). Probable oxidoreductase (EC 1.-.-.-), similar to several e.g. HDNO_ARTOX|P08159 6-hydroxy-d-nicotine oxidase (458 aa), FASTA scores: opt: 417, E(): 6e-20, (28.4% identity in 462 aa overlap). Also some similarity to Mycobacterium tuberculosis oxidoreductase MTCY04C12.11 (24.1% identity in 444 aa overlap). Contains PS00862 Oxygen oxidoreductases covalent FAD-binding site. Protein product from Mb1803 detected using SWATH mass spectrometry. Mb1803 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZA5" /db_xref="InterPro:IPR006093" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR016167" /db_xref="InterPro:IPR016169" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3XZA5" /protein_id="SIU00407.1" /translation="MRALPAGRHFFRGSDGYEAARRGTVWHRRVPDRYPEVIVQAVSA DDIVSAIRYATVNGHKVSVVSGGHSFAASHLRDGAVLLDVSRIDHASIDADKGRAVVG PGKGGSVLMAELEAQGLFFPGGHCRGVCLGGYLLQGGYGWNSRIYGPACESVIGLDVI TADGAQIHCDADNHADLYWAARGAGPGFFGVVTSFYLKLYPRPATCGTSVYVYPFDLA DEVFTWARAVSAEVDPRVELQALASRGEPSMGIDVPVISLASPAFADSPEEAEQALAL FGTCPVVEQALVKVPYMPTDLPAWYDVAMTHYLSDHHYAVDNMWTSASAEDLLPGIRS ILDTLPPHPAHFLWLNWGPCPPRQDMAYSIEADIYLALYGSWKDPADEAKYADWARSH MAAMSHLAVGIQLADENLGARPARFASDAAMAKLDRVRAEYDPDGLFNSWMGRI" CDS 2004039..2004857 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1804" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1804, -, len: 272 aa. Equivalent to Rv1775, len: 272 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 272 aa overlap). Conserved hypothetical protein, similar to O28806|AF1466 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (255 aa), FASTA scores: opt: 364, E(): 1e-17, (29.2% identity in 267 aa overlap). Protein product from Mb1804 detected using SWATH mass spectrometry. Mb1804 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR041526" /db_xref="UniProtKB/TrEMBL:A0A1R3XZB6" /protein_id="SIU00408.1" /translation="MASDLYLGYRNDDADTPFGKFFKPEMAPLPQHVVVALQHGPQAG MALLAFDDAASIVDEGYQQTENGYGILGDGSMQVSVRTDMPGVTPAMWAWWFGWHGSD TRRYKLWHPRAHLSARWKDGDQDSGAGRRGAQRYVGRWSMISEYIGSTKLGAAIQFVE PAAMGLPDDSDDTVSICARLGSADAPVDAGWFVHQVRSTPGGSEMRSRFWMGGPHIAV RKAPEVASKAVRPIASKLIGVSESTARNLLVYCAQEMNHLAGFLADLWESFGDE" CDS complement(2004862..2005422) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1805C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1805c, -, len: 186 aa. Equivalent to Rv1776c, len: 186 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 186 aa overlap). Possible regulatory protein, some similarity to Mycobacterium tuberculosis Rv1255c|Q11063 hypothetical transcriptional regulator (202 aa), FASTA scores: opt: 270, E(): 9.7e-09, (28.3% identity in 191 aa overlap) . Contains possible helix-turn-helix motif at aa 37-58 (+3.49 SD). Mb1805c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZB2" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="UniProtKB/TrEMBL:A0A1R3XZB2" /protein_id="SIU00409.1" /translation="MPGNDWIVGGNRRTIAAERIYAAATDLITRYGLNALDIDKLARE VHCSRATIYRRAGGKAQIRDVVLTRAAARIADGVRSDVETLRGRERVVAAILLSLQRI RSDPLGKLMFGSIHGGAGELAWLTESPLLADFATELTGIAGGDPQGAKWVVRVVLSLM YWPAENDEAERRLVEKYVAPAFAEQS" CDS 2005523..2006827 /codon_start=1 /transl_table=11 /gene="cyp144" /locus_tag="BQ2027_MB1806" /product="Probable cytochrome p450 144 CYP144" /note="Mb1806, cyp144, len: 434 aa. Equivalent to Rv1777, len: 434 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 434 aa overlap). Probable cyp144, cytochrome p450 (EC 1.14.-.-), similar to CPXM_BACME|Q06069 cytochrome p450 (meg) (EC 1.14.99.-) (410 aa), FASTA scores: opt: 435 E(): 2.3e-16, (28.8% identity in 372 aa overlap). Also similar to several other Mycobacterium tuberculosis p450 genes including Rv0766c, Rv2266, etc. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY." /db_xref="GOA:A0A1R3XZA3" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/TrEMBL:A0A1R3XZA3" /protein_id="SIU00410.1" /translation="MRRSPKGSPGAVLDLQRRVDQAVSADHAELMTIAKDANTFFGAE SVQDPYPLYERMRAAGSVHRIANSDFYAVCGWDAVNEAIGRPEDFSSNLTATMTYTAE GTAKPFEMDPLGGPTHVLATADDPAHAVHRKLVLRHLAAKRIRVMEQFTVQAADRLWV DGMQDGCIEWMGAMANRLPMMVVAELIGLPDPDIAQLVKWGYAATQLLEGLVENDQLV AAGVALMELSGYIFEQFDRAAADPRDNLLGELATACASGELDTLTAQVMMVTLFAAGG ESTAALLGSAVWILATRPDIQQQVRANPELLGAFIEETLRYEPPFRGQYRHVRNATTL DGTELPADSHLLLLWGAANRDPAQFEAPGEFRLDRAGGKGHISFGKGAHFCVGAALAR LEARIVLRLLLDRTSVIEAADVGGWLPSILVRRIERLELAVQ" CDS complement(2006948..2007397) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1807C" /product="unknown protein" /note="Mb1807c, -, len: 149 aa. Equivalent to Rv1778c, len: 149 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 149 aa overlap). Hypothetical unknown protein. Protein product from Mb1807c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1807c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZ85" /protein_id="SIU00411.1" /translation="MRVSLFLSDAAQADAQSGKVHALGLGWRQCQRPTPPFALVLFLD IDWDETNKQHQLKCQLLTADGDPVVVPGPHGPQRILFEAAAEAGRAPGAIHGTSVRMP LTLNIPAGIPLEPGIYEWRVEVEGYERATAVEAFIVAGGGHPPASCG" CDS complement(2007553..2009346) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1808C" /product="possible integral membrane protein" /note="Mb1808c, -, len: 597 aa. Equivalent to Rv1779c, len: 597 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 597 aa overlap). Possible integral membrane protein. Protein product from Mb1808c detected using SWATH mass spectrometry. Mb1808c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZA4" /db_xref="InterPro:IPR025519" /db_xref="UniProtKB/TrEMBL:A0A1R3XZA4" /protein_id="SIU00412.1" /translation="MCAHEYAEQRSAVSGIEGLLTWLGGGHWRELGERHERSTHAVAG VIVAVGAALAGLLASLAVSEAAQGPISSPIGAASLALVLGLLVGAVTRGTASGPARGR AGVTGRASVAVAVGFVVGELAALVMFSGAIDRRLDEQAMHSADATPAAVQASASLQQA RNARTALDSAVERARGRLDDALVVARCEYHPTPACPQTRITGVPGRGPETRTANQLLA DAQRELDNALAARDHQAPALDAKMAHDEQALAEVRQAVVADAGRGLGSRWVAMNDLTL ASAGALTARMLAIAFFALLYLLPLILRLWRGDTTHDRHAAARAERERAELEADTAIAI KRAEVRRAAEIMWAEHQLTQTRLAIEAQAEIDREQQRRRVVEALEGPVRASSERTLQP VEDEVYLPIAAETEAASRTVAQLPAGAAHHRPGIAKNLPAQVQPEGAVEPREKRATPV IRSIPDATKAAARWIRPLVPPFVARMLDNTTAPLRTARQVFEEVEEIAFSFKRTHKVT VNAEGSDPNDQPPLESHSPAAPAESNPIASSDSARRSRLATNDDHPPLAQVPPRDLAS LSVGSTGELTQREGPHELRSPDGPRQLPPPR" CDS 2009566..2010129 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1809" /product="conserved protein" /note="Mb1809, -, len: 187 aa. Equivalent to Rv1780, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 187 aa overlap). Conserved hypothetical protein, equivalent to Q49881|ML1380|U00021_2 cosmid L247 from Mycobacterium leprae (187 aa), FASTA scores: opt: 1000, E(): 0, (82.4% identity in 187 aa overlap). Protein product from Mb1809 detected using shotgun mass spectrometry. Mb1809 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1F0" /protein_id="SIU00413.1" /translation="MQNHDYVTYEEFGRRFFEVAVTPDRVAAAFADIAGSEFAMEPIS QGPGGIAKVSANVKIREPRVTRKLGDLITFVIHIPLSIDLLLDLRLDKQRFMVAGDIA LRATARAAEPLLLIVDVAKPRPSDITVNVSSKSIRGEVLRILAGVDGEIRRFIAQYVS AEIDSPKSQAAQVINVAEQLDSTWSGP" CDS complement(2010169..2012343) /codon_start=1 /transl_table=11 /gene="malQ" /locus_tag="BQ2027_MB1810C" /product="PROBABLE 4-ALPHA-GLUCANOTRANSFERASE MALQ (Amylomaltase) (Disproportionating enzyme) (D-enzyme)" /note="Mb1810c, malQ, len: 724 aa. Equivalent to Rv1781c, len: 724 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 724 aa overlap). Probable malQ, 4-ALPHA-GLUCANOTRANSFERASE (EC 2.4.1.25), similar to many, e.g. P15977|MALQ_ECOLI 4-ALPHA-GLUCANOTRANSFERASE (694 aa), FASTA scores: opt: 964, E(): 0, (31.8% identity in 694 aa overlap). BELONGS TO THE DISPROPORTIONATING ENZYME FAMILY. Protein product from Mb1810c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1810c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65337" /db_xref="InterPro:IPR003385" /db_xref="InterPro:IPR017853" /db_xref="UniProtKB/Swiss-Prot:P65337" /protein_id="SIU00414.1" /translation="MTELAPSLVELARRFGIATEYTDWTGRQVLVSEATLVAALAALG VPAQTEQQRNDALAAQLRSYWARPLPATIVMRAGEQTQFRVHVTDGAPADVWLQLEDG TTRAEVVQVDNFTPPFDLDGRWIGEASFVLPADLPLGYHRVNLRSGDSQASAAVVVTP DWLGLPDKLAGRRAWGLAVQLYSVRSRQSWGIGDLTDLANLALWSASAHGAGYVLVNP LHAATLPGPAGRSKPIEPSPYLPTSRRFVNPLYLRVEAIPELVDLPKRGRVQRLRTNV QQHADQLDTIDRDSAWAAKRAALKLVHRVPRSAGRELAYAAFRTREGRALDDFATWCA LAETYGDDWHRWPKSLRHPDASGVADFVDKHADAVDFHRWLQWQLDEQLASAQSQALR AGMSLGIMADLAVGVHPNGADAWALQDVLAQGVTAGAPPDEFNQLGQDWSQPPWRPDR LAEQEYRPFRALIQAALRHAGAVRIDHIIGLFRLWWIPDGAPPTQGTYVRYDHDAMIG IVALEAHRAGAVVVGEDLGTVEPWVRDYLLLRGLLGTSILWFEQDRDCGPAGTPLPAE RWREYCLSSVTTHDLPPTAGYLAGDQVRLRESLGLLTNPVEAELESARADRAAWMAEL RRVGLLADGAEPDSEEAVLALYRYLGRTPSRLLAVALTDAVGDRRTQNQPGTTDEYPN WRVPLTGPDGQPMLLEDIFTDRRAATLAEAVRAATTSPMSCW" CDS 2012607..2014127 /codon_start=1 /transl_table=11 /gene="eccb5" /locus_tag="BQ2027_MB1811" /product="esx conserved component eccb5. esx-5 type vii secretion system protein. probable membrane protein." /note="Mb1811, -, len: 506 aa. Equivalent to Rv1782, len: 506 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 506 aa overlap). Probable conserved membrane protein, similar to four other Mycobacterium tuberculosis hypothetical membrane proteins e.g. O05449|Rv3895c|MTCY15F10.17|Z94121 (495 aa), FASTA scores: opt: 1106, E(): 0, (41.2% identity in 485 aa overlap); Rv0283, Rv3450c, and Rv3869, all located near ESAT-6 family genes. Also similar to O33088|MLCB628.17C|Y14967 cosmid B628 from Mycobacterium leprae (481 aa), (32.7% identity in 486 aa overlap); and equivalent to Q9Z5I3|MLCB596.27|AL035472 hypothetical protein from Mycobacterium leprae (506 aa) (82.6% identity in 506 aa overlap). Has hydrophobic stretch from aa 54-76. Protein product from Mb1811 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1811 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZC8" /db_xref="InterPro:IPR007795" /db_xref="InterPro:IPR042485" /db_xref="UniProtKB/TrEMBL:A0A1R3XZC8" /protein_id="SIU00415.1" /translation="MAEESRGQRGSGYGLGLSTRTQVTGYQFLARRTAMALTRWHVRM EIEPGRRQTLAVVASVSAALVICLGALLWSFISPSGQLNESPIIADRDSGALYVRVGD RLYPALNLASARLITGRPDNPHLVRSSQIATMPRGPLVGIPGAPSSFSPKSPPASSWL VCDTVATSSSIGSLQGVTVTVIDGTPDLTGHRQILSGSDAVVLRYGGDAWVIREGRRS RIEPTNRAVLLPLGLTPEQVSQARPMSRALFDALPVGPELLVPEVPNAGGPATFPGAP GPIGTVIVTPQISGPQQYSLVLGDGVQTLPPLVAQILQNAGSAGNTKPLTVEPSTLAK MPVVNRLDLSAYPDNPLEVVDIREHPSTCWWWERTAGENRARVRVVSGPTIPVAATEM NKVVSLVKADTSGRQADQVYFGPDHANFVAVTGNNPGAQTSESLWWVTDAGARFGVED SKEARDALGLTLTPSLAPWVALRLLPQGPTLSRADALVEHDTLPMDMTPAELVVPK" CDS 2014124..2018299 /codon_start=1 /transl_table=11 /gene="eccc5" /locus_tag="BQ2027_MB1812" /product="esx conserved component eccc5. esx-5 type vii secretion system protein." /note="Mb1812, -, len: 1391 aa. Similar to Rv1783 and Rv1784, len: 435 aa and 932 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 435 aa overlap and 99.9% identity in 932 aa overlap). Rv1783: Probable conserved membrane protein. Member of family of Mycobacterium tuberculosis hypothetical proteins including O05450|Rv3894c|MTY15F10.18|Z94121 (1396 aa), FASTA scores: opt: 542, E(): 1.5e-26, (31.4% identity in 440 aa overlap); Rv3447c, Rv0284, Rv3870, Rv1784, and Rv3871, all linked to ESAT-6 family gene. Similar to N-terminal part of Rv3894c (1396 aa), Rv1784 is similar to remainder of Rv3894c. Also similar to O33087|MLCB628.16C|Y14967 Hypothetical protein from Mycobacterium leprae (744 aa), (30.0% identity in 437 aa overlap) and equivalent to N-terminal part of Q9Z5I2|MLCB596.28|AL035472 hypothetical protein from Mycobacterium leprae (1345 aa), (86.4% identity in 397 aa overlap). Rv1784: Conserved hypothetical protein, member of family of Mycobacterium tuberculosis hypothetical proteins including Rv3447c, Rv0284, Rv3870, Rv1783, Rv3871, Rv3894c, all linked to ESAT-6 family genes. Probably ATP-binding membrane proteins. Similar to C-terminal region of 006264|Rv3447c (1236 aa), (36.2% identity in 930 aa overlap). Equivalent to C-terminal region of Mycobacterium leprae hypothetical protein Q9Z512|MLCB596.28|AL035472 (1345 aa), (87.8% identity in 932 aa overlap); also similar to other hypothetical proteins e.g. MLCB628.14 from Mycobacterium leprae, (32.0% identity in 600 aa overlap); MLCB628.15 from Mycobacterium leprae, (35.0% identity in 280 aa overlap); and O86653|SC3C3.20|AL031231 ATP/GTP binding protein from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 618, E(): 4.6e-30, (34.3% identity in 937 aa overlap). Contains two times PS00017 ATP/GTP-binding site motif A (P-loop). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1783 and Rv1784 exist as 2 genes. In Mycobacterium bovis, a single base transversion (a-t) leads to a single product. Protein product from Mb1812 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1812 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZN3" /db_xref="InterPro:IPR002543" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR023836" /db_xref="InterPro:IPR023837" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3XZN3" /protein_id="SIU00416.1" /translation="MKRGFARPTPEKPPVIKPENIVLSTPLSIPPPEGKPWWLIVVGV VVVGLLGGMVAMVFASGSHVFGGIGSIFPLFMMVGIMMMMFRGMGGGQQQMSRPKLDA MRAQFMLMLDMLRETAQESADSMDANYRWFHPAPNTLAAAVGSPRMWERKPDGKDLNF GVVRVGVGMTRPEVTWGEPQNMPTDIELEPVTGKALQEFGRYQSVVYNLPKMVSLLVE PWYALVGEREQVLGLMRAIICQLAFSHGPDHVQMIVVSSDLDQWDWVKWLPHFGDSRR HDAAGNARMVYTSVREFAAEQAELFAGRGSFTPRHASSSAQTPTPHTVIIADVDDPQW EYVISAEGVDGVTFFDLTGSSMWTDIPERKLQFDKTGVIEALPRDRDTWMVIDDKAWF FALTDQVSIAEAEEFAQKLAQWRLAEAYEEIGQRVAHIGARDILSYYGIDDPGNIDFD SLWASRTDTMGRSRLRAPFGNRSDNGELLFLDMKSLDEGGDGPHGVMSGTTGSGKSTL VRTVIESLMLSHPPEELQFVLADLKGGSAVKPFAGVPHVSRIITDLEEDQALMERFLD ALWGEIARRKAICDSAGVDDAKEYNSVRARMRARGQDMAPLPMLVVVIDEFYEWFRIM PTAVDVLDSIGRQGRAYWIHLMMASQTIESRAEKLMENMGYRLVLKARTAGAAQAAGV PNAVNLPAQAGLGYFRKSLEDIIRFQAEFLWRDYFQPGVSIDGEEAPALVHSIDYIRP QLFTNSFTPLEVSVGGPDIEPVVAQPNGEMLESDDIEGGEDEDEEGVRTPKVGTVIID QLRKIKFEPYRLWQPPLTQPVAIDDLVNRFLGRPWHKEYGSACNLVFPIGIIDRPYKH DQPPWTVDTSGPGANVLILGAGGSGKTTALQTLICSAALTHTPQQVQFYCLAYSSTAL TTVSRIPHVGEVAGPTDPYGVRRTVAELLALVRERKRSFLECGIASMEMFRRRKFGGE AGPVPDDGFGDVYLVIDNYRALAEENEVLIEQVNVIINQGPSFGVHVVVTADRESELR PPVRSGFGSRIELRLAAVEDAKLVRSRFAKDVPVKPGRGMVAVNYVRLDSDPQAGLHT LVARPALGSTPDNVFECDSVVAAVSRLTSAQAPPVRRLPARFGVEQVRELASRDTRQG VGAGGIAWAISELDLAPVYLNFAENSHLMVTGRRECGRTTTLATIMSEIGRLYAPGAS SAPPPAPGRPSAQVWLVDPRRQLLTALGSDYVERFAYNLDGVVAMMGELAAALAGREP PPGLSAEELLSRSWWSGPEIFLIVDDIQQLPPGFDSPLHKAVPFVNRAADVGLHVIVT RTFGGWSSAGSDPMLRALHQANAPLLVMDADPDEGFIRGKMKGGPLPRGRGLLMAEDT GVFVQVAATEVRR" CDS complement(2018314..2019495) /codon_start=1 /transl_table=11 /gene="cyp143" /locus_tag="BQ2027_MB1813C" /product="PROBABLE CYTOCHROME P450 143 CYP143" /note="Mb1813c, cyp143, len: 393 aa. Equivalent to Rv1785c, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 393 aa overlap). Probable cyp143, cytochrome P450 (1.14.-.-), similar to many e.g. AE0001|RZAE000101_4 Rhizobium sp. NGR234 (414 aa), FASTA scores: opt: 663, E(): 0, (32.4% identity in 413 aa overlap). Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY. Protein product from Mb1813c detected using SWATH mass spectrometry. Mb1813c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63724" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P63724" /protein_id="SIU00417.1" /translation="MTTPGEDHAGSFYLPRLEYSTLPMAVDRGVGWKTLRDAGPVVFM NGWYYLTRREDVLAALRNPKVFSSRKALQPPGNPLPVVPLAFDPPEHTRYRRILQPYF SPAALSKALPSLRRHTVAMIDAIAGRGECEAMADLANLFPFQLFLVLYGLPLEDRDRL IGWKDAVIAMSDRPHPTEADVAAARELLEYLTAMVAERRRNPGPDVLSQVQIGEDPLS EIEVLGLSHLLILAGLDTVTAAVGFSLLELARRPQLRAMLRDNPKQIRVFIEEIVRLE PSAPVAPRVTTEPVTVGGMTLPAGSPVRLCMAAVNRDGSDAMSTDELVMDGKVHRHWG FGGGPHRCLGSHLARLELTLLVGEWLNQIPDFELAPDYAPEIRFPSKSFALKNLPLRW S" CDS 2019695..2019898 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1814" /product="probable ferredoxin" /note="Mb1814, -, len: 67 aa. Equivalent to Rv1786, len: 67 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 67 aa overlap). Probable ferredoxin (EC 1.-.-.-), similar to others e.g. X63601|FERS_STRGR FERREDOXIN from Streptomyces griseus (65 aa), FASTA scores: opt: 140, E(): 0.001, (38.1% identity in 63 aa overlap); T50943 probable ferredoxin DitA from Pseudomonas abietaniphila (78 aa); BAA84714.1|AB017795 ferredoxin from Nocardioides sp. (69 aa); etc. Also similar to Rv0763c|MTCY369.08 from Mycobacterium tuberculosis (68 aa), FASTA score: (30.6% identity in 62 aa overlap); and Rv0763c. Mb1814 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZC4" /protein_id="SIU00418.1" /translation="MKVRLDPSRCVGHAQCYAVDPDLFPIDDSGNSILAEHEVRPEDM QLTRDGVAACPEMALILEEDDAD" CDS 2020168..2021262 /codon_start=1 /transl_table=11 /gene="PPE25" /locus_tag="BQ2027_MB1815" /product="ppe family protein ppe25" /note="Mb1815, PPE25, len: 364 aa. Equivalent to Rv1787, len: 365 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 365 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, similar to Z74024|MTCY274.24 Mycobacterium tuberculosis cosmid (404 aa), FASTA scores: opt: 837, E(): 0, (52.0% identity in 406 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 8 bp to 5 bp substitution (agcccggt-ccggg), leads to a slightly shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (364 aa versus 365 aa)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZC3" /protein_id="SIU00419.1" /translation="MDFGALPPEINSGRMYCGPGSGPMLAAAAAWDGVAVELGLAATG YASVIAELTGAPWVGAASLSMVAAATPYVAWLSQAAARAEQAGMQAAAAAAAYEAAFV MTVPPPVITANRVLVMTLIATNFFGQNSAAIAVAEAQYAEMWAQDAVAMYGYAAASAS ASRLIPFAAPPKTTNSAGVVAQVAAVAAMPGLLQRLSSAASVSWSNPNDWWLVRLLGS ITPTERTTIVRLLGQSYFATGMAQFFASIAQQLTFGPGGTTAGSGGAWYPTPQFAGLG ASRAVSASLARANKIGALSVPPSWVKTTALTEPGAHAVSANPTVGSSHGPHGLLRGLP LGSRITRRSGAFAHRYGFRHSVVARPPSAG" CDS 2021341..2021640 /codon_start=1 /transl_table=11 /gene="PE18" /locus_tag="BQ2027_MB1816" /product="pe family protein pe18" /note="Mb1816, PE18, len: 99 aa. Equivalent to Rv1788, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Member of the Mycobacterium tuberculosis PE family of gly-, ala-rich proteins, similar to Z93777|MTCI364.07 Mycobacterium tuberculosis cosmid (99 aa), FASTA scores: opt: 414, E(): 3.6e-20, (72.4% identity in 98 aa overlap). Mb1816 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XZB3" /protein_id="SIU00420.1" /translation="MSFVTTQPEALAAAAGSLQGIGSALNAQNAAAATPTTGVVPAAA DEVSALTAAQFAAHAQIYQAVSAQAAAIHEMFVNTLQMSSGSYAATEAANAAAAG" CDS 2021654..2022835 /codon_start=1 /transl_table=11 /gene="PPE26" /locus_tag="BQ2027_MB1817" /product="ppe family protein ppe26" /note="Mb1817, PPE26, len: 393 aa. Equivalent to Rv1789, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 393 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, highly similar to others e.g.Z98268|MTCI125.26 Mycobacterium tuberculosis cosmid (385 aa), FASTA score: opt: 1283, E(): 0, (62.7% identity in 408 aa overlap). Mb1817 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZ96" /protein_id="SIU00421.1" /translation="MDFGALPPEVNSVRMYAGPGSAPMVAAASAWNGLAAELSSAATG YETVITQLSSEGWLGPASAAMAEAVAPYVAWMSAAAAQAEQAATQARAAAAAFEAAFA ATVPPPLIAANRASLMQLISTNVFGQNTSAIAAAEAQYGEMWAQDSAAMYAYAGSSAS ASAVTPFSTPPQIANPTAQGTQAAAVATAAGTAQSTLTEMITGLPNALQSLTSPLLQS SNGPLSWLWQILFGTPNFPTSISALLTDLQPYASFFYNTEGLPYFSIGMGNNFIQAAK TLGLIGSAAPAAVAAAGDAAKGLPGLGGMLGGGPVAAGLGNAASVGKLSVPPVWSGPL PGSVTPGAAPLPVSTVSAAPEAAPGSLLGGLPLAGAGGAGAGPRYGFRPTVMARPPFA G" CDS 2023289..2024341 /codon_start=1 /transl_table=11 /gene="PPE27" /locus_tag="BQ2027_MB1818" /product="ppe family protein ppe27" /note="Mb1818, PPE27, len: 350 aa. Equivalent to Rv1790, len: 350 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 350 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich protein, similar to Z74024|MTCY274.24 Mycobacterium tuberculosis cosmid (404 aa), FASTA scores: opt: 849, E(): 0, (50.0% identity in 406 aa overlap). Mb1818 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZB1" /protein_id="SIU00422.1" /translation="MDFGALPPEINSGRMYCGPGSGPMLAAAAAWDGVAVELGLAATG YASVIAELTGAPWVGAASLSMVAAATPYVAWLSQAAARAEQAGMQAAAAAAAYEAAFV MTVPPPVITANRVLVMTLIATNFFGQNSAAIAVAEAQYAEMWAQDAVAMYGYAAASAS ASRLIPFAAPPKTTNSAGVVAQAVASVSWSNPNDWWLVRLLGSITPTERTTIVRLLGQ SYLATGMARFLTSIAQQLTFGPGGTTAGSGGAWYPTPQFAGLGAGPAVSASLARAEPV GRLSVPPSWAVAAPAFAEKPEAGTPMSVIGEASSCGQGGLLRGIPLARAGRRTGAFAH RYGFRHSVITRSPSAG" CDS 2024768..2025067 /codon_start=1 /transl_table=11 /gene="PE19" /locus_tag="BQ2027_MB1819" /product="pe family protein pe19" /note="Mb1819, PE19, len: 99 aa. Equivalent to Rv1791, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Member of the Mycobacterium tuberculosis PE family, but no glycine rich C-terminus, highly similar to Z93777|MTCI364.07 M.tuberculosis cosmid (99 aa) opt: 430 E(): 2.4e-21, (75.5% identity in 98 aa overlap). Mb1819 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1H4" /protein_id="SIU00423.1" /translation="MSFVTTQPEALAAAAANLQGIGTTMNAQNAAAAAPTTGVVPAAA DEVSALTAAQFAAHAQMYQTVSAQAAAIHEMFVNTLVASSGSYAATEAANAAAAG" CDS 2025211..2025507 /codon_start=1 /transl_table=11 /gene="esxM" /locus_tag="BQ2027_MB1820" /standard_name="TB11.0; QILSS" /product="esat-6 like protein esxm" /note="Mb1820, esxM, len: 98 aa. Equivalent to Rv1792, len: 98 aa, from Mycobacterium tuberculosis strain H37Rv, (98.0% identity in 98 aa overlap). esxM, conserved hypothetical protein, member of Mycobacterium tuberculosis QILSS familyof proteins, genes linked to those of ESAT-6 family. Has in-frame stop codon at 18074, no error could be found to account for this. Identical (apart from stop codon) to P96363|Rv1038c|MTCY10G2.11 PUTATIVE ESAT-6 LIKE PROTEIN 2 (98 aa), FASTA scores: opt: 389, E(): 5.8e-26, (100.0% identity in 58 aa overlap). Also identical to Rv1038c, Rv1197, and almost identical to Rv3620c and Rv2347c. Similar protein present in Mycobacterium leprae e.g. Q49946|MLCB1701.06C|AL049191 PUTATIVE ESAT-6 LIKE PROTEIN X (95 aa), FASTA scores: opt: 343, E(): 1.6e-17, (57.6% identity in 92 aa overlap). Mb1820 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P59805" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00424.1" /translation="MASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAG WSGQAEATSLDTMTQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS" CDS 2025558..2025842 /codon_start=1 /transl_table=11 /gene="esxN" /locus_tag="BQ2027_MB1821" /standard_name="ES6_5; Mtb9.9A" /product="putative esat-6 like protein esxn (esat-6 like protein 5)" /note="Mb1821, esxN, len: 94 aa. Equivalent to Rv1793, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 94 aa overlap). esxN, putative ESAT-6 like protein, conserved hypothetical protein, almost identical to several hypothetical mycobacterial proteins of the ESAT-6-like family including P95242|Rv2346c|MTCY98.15C|Z83860 PUTATIVE ESAT-6 LIKE PROTEIN 6 (94 aa), FASTA scores: opt: 610, E(): 0, (97.9 % identity in 94 aa overlap); Rv3619c, Rv1037c, and Rv1198, etc. Also present in Mycobacterium leprae. Protein product from Mb1821 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1821 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A571" /db_xref="InterPro:IPR009416" /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P0A571" /protein_id="SIU00425.1" /translation="MTINYQFGDVDAHGAMIRAQAASLEAEHQAIVRDVLAAGDFWGG AGSVACQEFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA" CDS 2025930..2026832 /codon_start=1 /transl_table=11 /gene="espG5" /locus_tag="BQ2027_MB1822" /product="ESX-5 secretion-associated protein EspG5" /note="Mb1822, -, len: 300 aa. Equivalent to Rv1794, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 300 aa overlap). Conserved hypothetical protein, slight similarity to Mycobacterium tuberculosis O53694|Rv0289|MTV035.17, (295 aa), FASTA scores: opt: 172, E(): 0.00083, (25.7% identity in 261 aa overlap). Equivalent to Mycobacterium leprae hypothetical protein Q9Z5I1|MLCB596.31|AL035472 (300 aa), (88.0% identity in 300 aa overlap). Contains PS00211 ABC transporters family signature. Protein product from Mb1822 detected using shotgun mass spectrometry. Mb1822 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025734" /db_xref="UniProtKB/TrEMBL:A0A1R3XZP3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00426.1" /translation="MDQQSTRTDITVNVDGFWMLQALLDIRHVAPELRCRPYVSTDSN DWLNEHPGMAVMREQGIVVNDAVNEQVAARMKVLAAPDLEVVALLSRGKLLYGVIDDE NQPPGSRDIPDNEFRVVLARRGQHWVSAVRVGNDITVDDVTVSDSASIAALVMDGLES IHHADPAAINAVNVPMEEMLEATKSWQESGFNVFSGGDLRRMGISAATVAALGQALSD PAAEVAVYARQYRDDAKGPSASVLSLKDGSGGRIALYQQARTAGSGEAWLAICPATPQ LVQVGVKTVLDTLPYGEWKTHSRV" CDS 2027104..2028615 /codon_start=1 /transl_table=11 /gene="eccd5" /locus_tag="BQ2027_MB1823" /product="esx conserved component eccd5. esx-5 type vii secretion system protein. probable membrane protein." /note="Mb1823, -, len: 503 aa. Equivalent to Rv1795, len: 503 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 503 aa overlap). Conserved hypothetical membrane protein, has a hydrophilic stretch from ~1-130 then very hydrophobic. Similar to several other mycobacterial proteins, all linked to ESAT-6 family e.g. Rv3887c|MTY15F10.24|Z94121 (509 aa), FASTA scores: opt: 360, E(): 1.6e-15, (26.7% identity in 514 aa overlap); Rv3448, and Rv0290. Protein product from Mb1823 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1823 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZC1" /db_xref="InterPro:IPR006707" /db_xref="InterPro:IPR024962" /db_xref="UniProtKB/TrEMBL:A0A1R3XZC1" /protein_id="SIU00427.1" /translation="MTAVADAPQADIEGVASPQAVVVGVMAGEGVQIGVLLDANAPVS VMTDPLLKVVNSRLRELGEAPLEATGRGRWALCLVDGAPLRATQSLTEQDVYDGDRLW IRFIADTERRSQVIEHISTAVASDLSKRFARIDPIVAVQVGASMVATGVVLATGVLGW WRWHHNTWLTTIYTAVIGVLVLAVAMLLLMRAKTDADRRVADIMLMSAIMPVTVAAAA APPGPVGSPQAVLGFGVLTVAAALALRFTGRRLGIYTAIVIIDALTMLAALARMVAAT SAVTLLSSLLLICVVAYHAAPALSRRLAGIRLPVFPSATSRWVFEARPDLPTTVVVSG GSAPVLEGPSSVRDVLLQAERARSFLSGLLTGLGVMVVVCMTSLCDPHTGQRWLPLIL AGFTSGFLLLRGRSYVDRWQSITLAGTAVIIAAAVCVRYALELSSPLAVSIVAAILVL LPAAGMAAAAHVPHTIYSPLFRKFVEWIEYLCLMPIFPLALWLMNVYAAIRYR" CDS 2028593..2030350 /codon_start=1 /transl_table=11 /gene="mycp5" /locus_tag="BQ2027_MB1824" /product="probable proline rich membrane-anchored mycosin mycp5 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-5)" /note="Mb1824, -, len: 585 aa. Equivalent to Rv1796, len: 585 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 585 aa overlap). Conserved hypothetical Pro-rich protease. Member of family with four other Mycobacterium tuberculosis hypothetical proteases including Rv3886c|O05458|MTCY15F10.26|Z94121 (550 aa), FASTA scores: opt: 1173, E(): 0, (47.9% identity in 578 aa overlap); Rv0291, Rv3883c, and Rv3449. Genes all linked to those of ESAT-6 family. Has possible N-terminal signal peptide and hydrophobic anchor-like stretch at C-terminus. Contains two serine protease, subtilase family active site motifs: a aspartic acid active site motif (PS00136); and a histidine active site motif (PS00137). Protein product from Mb1824 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1824 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZD8" /db_xref="InterPro:IPR000209" /db_xref="InterPro:IPR015500" /db_xref="InterPro:IPR023827" /db_xref="InterPro:IPR023834" /db_xref="InterPro:IPR036852" /db_xref="UniProtKB/TrEMBL:A0A1R3XZD8" /protein_id="SIU00428.1" /translation="MQRFGTGSSRSWCGRAGTATIAAVLLASGALTGLPPAYAISPPT IDPGALPPDGPPGPLAPMKQNAYCTEVGVLPGTDFQLQPKYMEMLNLNEAWQFGRGDG VKVAVIDTGVTPHPRLPRLIPGGDYVMAGGDGLSDCDAHGTLVASMIAAVPANGAVPL PSVPRRPVTIPTTETPPPPQTVTLSPVPPQTVTVIPAPPPEEGVPPGAPVPGPEPPPA PGPQPPAVDRGGGTVTVPSYSGGRKIAPIDNPRNPHPSAPSPALGPPPDAFSGIAPGV EIISIRQSSQAFGLKDPYTGDEDPQTAQKIDNVETMARAIVHAANMGASVINISDVMC MSARNVIDQRALGAAVHYAAVDKDAVIVAAAGDGSKKDCKQNPIFDPLQPDDPRAWNA VTTVVTPSWFHDYVLTVGAVDANGQPLSKMSIAGPWVSISAPGTDVVGLSPRDDGLIN AIDGPDNSLLVPAGTSFSAAIVSGVAALVRAKFPELSAYQIINRLIHTARPPARGVDN QVGYGVVDPVAALTWDVPKGPAEPPKQLSAPLVVPQPPAPRDMVPIWVAAGGLAGALL IGGAVFGTATLMRRSRKQQ" CDS 2030347..2031567 /codon_start=1 /transl_table=11 /gene="ecce5" /locus_tag="BQ2027_MB1825" /product="esx conserved component ecce5. esx-5 type vii secretion system protein. probable membrane protein." /note="Mb1825, -, len: 406 aa. Equivalent to Rv1797, len: 406 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 406 aa overlap). Conserved hypothetical protein, some similarity to Mycobacterium tuberculosis O05462|Rv3882c|MTCY15F10.30|Z94121 (462 aa), FASTA scores: opt: 181, E(): 9.2e-05, (25.4% identity in 283 aa overlap). Has hydrophobic stretch near N-terminus. Protein product from Mb1825 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1825 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZD1" /db_xref="InterPro:IPR021368" /db_xref="UniProtKB/TrEMBL:A0A1R3XZD1" /protein_id="SIU00429.1" /translation="MKAQRSFGLALSWPRVTAVFLVDVLILAVASHCPDSWQADHHVA WWVGVGVAAVVTLLSVVSYHGITVISGLATWVRDWSADPGTTLGAGCTPAIDHQRRFG RDTVGVREYNGRLVSVIEVTCGESGPSGRHWHRKSPVPMLPVVAVADGLRQFDIHLDG IDIVSVLVRGGVDAAKASASLQEWEPQGWKSEERAGDRTVADRRRTWLVLRMNPQRNV AAVACRDSLASTLVAATERLVQDLDGQSCAARPVTADELTEVDSAVLADLEPTWSRPG WRHLKHFNGYATSFWVTPSDITSETLDELCLPDSPEVGTTVVTVRLTTRVGSPALSAW VRYHSDTRLPKEVAAGLNRLTGRQLAAVRASLPAPTHRPLLVIPSRNLRDHDELVLPV GQELEHATSSFVGQ" CDS 2031564..2033396 /codon_start=1 /transl_table=11 /gene="ecca5" /locus_tag="BQ2027_MB1826" /product="esx conserved component ecca5. esx-5 type vii secretion system protein." /note="Mb1826, -, len: 610 aa. Equivalent to Rv1798, len: 610 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 610 aa overlap). Conserved hypothetical protein, similar to several mycobacterial proteins e.g. O05460|MTCY15F10.28|Rv3884c|Z94121 from M. tuberculosis (619 aa), FASTA scores: opt: 669, E(): 0, (31.0% identity in 549 aa overlap); and O33089|MLCB628.18c|Y14967 from Mycobacterium leprae (573 aa), FASTA scores: opt: 723, E(): 0, (32.4% identity in 568 aa overlap). Also very similar to Rv0282. May belong to the CBXX/CFQX family as last ~320 aa domain very similar to several family members. Contains ATP/GTP-binding site motif A (P-loop; PS00017). Protein product from Mb1826 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1826 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63745" /db_xref="InterPro:IPR000641" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR003959" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR023835" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041627" /db_xref="UniProtKB/Swiss-Prot:P63745" /protein_id="SIU00430.1" /translation="MTRPQAAAEDARNAMVAGLLASGISVNGLQPSHNPQVAAQMFTT ATRLDPKMCDAWLARLLAGDQSIEVLAGAWAAVRTFGWETRRLGVTDLQFRPEVSDGL FLRLAITSVDSLACAYAAVLAEAKRYQEAAELLDATDPRHPFDAELVSYVRGVLYFRT KRWPDVLAQFPEATQWRHPELKAAGAAMATTALASLGVFEEAFRRAQEAIEGDRVPGA ANIALYTQGMCLRHVGREEEAVELLRRVYSRDAKFTPAREALDNPNFRLILTDPETIE ARTDPWDPDSAPTRAQTEAARHAEMAAKYLAEGDAELNAMLGMEQAKKEIKLIKSTTK VNLARAKMGLPVPVTSRHTLLLGPPGTGKTSVARAFTKQLCGLTVLRKPLVVETSRTK LLGRYMADAEKNTEEMLEGALGGAVFFDEMHTLHEKGYSQGDPYGNAIINTLLLYMEN HRDELVVFGAGYAKAMEKMLEVNQGLRRRFSTVIEFFSYTPQELIALTQLMGRENEDV ITEEESQVLLPSYTKFYMEQSYSEDGDLIRGIDLLGNAGFVRNVVEKARDHRSFRLDD EDLDAVLASDLTEFSEDQLRRFKELTREDLAEGLRAAVAEKKTK" CDS 2034023..2034214 /codon_start=1 /transl_table=11 /gene="lppT" /locus_tag="BQ2027_MB1827" /product="PROBABLE LIPOPROTEIN LPPT" /note="Mb1827, lppT, len: 63 aa. Equivalent to Rv1799, len: 63 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 63 aa overlap). Probable lppT lipoprotein, has possible signal peptide and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Mb1827 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZB0" /protein_id="SIU00431.1" /translation="MSVKSKNGRLAARVLVALAALFAMIALTGSACLAEGPPLGRNPQ GAPAPVGGTVIVAPMHSGV" CDS 2034317..2036284 /codon_start=1 /transl_table=11 /gene="PPE28" /locus_tag="BQ2027_MB1828" /product="ppe family protein ppe28" /note="Mb1828, PPE28, len: 655 aa. Equivalent to Rv1800, len: 655 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 655 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, C-terminal very similar to parts of PE proteins e.g. Z92770|MTCI5.25|Rv0151c (588 aa), FASTA scores: opt: 1269, E(): 0, (41.5% identity in 591 aa overlap). Mb1828 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR013228" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZC0" /protein_id="SIU00432.1" /translation="MLPNFAVLPPEVNSARVFAGAGSAPMLAAAAAWDDLASELHCAA MSFGSVTSGLVVGWWQGSASAAMVDAAASYIGWLSTSAAHAEGAAGLARAAVSVFEEA LAATVHPAMVAANRAQVASLVASNLFGQNAPAIAALESLYEWMWAQDAAAMAGYYVGA SAVATQLASWLQRLQSIPGAASLDARLPSSAEAPMGVVRAVNSAIAANAAAAQTVGLV MGGSGTPIPSARYVELANALYMSGSVPGVIAQALVTPQGLYPVVVIKNLTFDSSVAQG AVILESAIRQQIAAGNNVTVFGYSQSATISSLVMANLAASADPPSPDELSFTLIGNPN NPNGGVATRFPGISFPSLGVTATGATPHNLYPTKIYTIEYDGVADFPRYPLNFVSTLN AIAGTYYVHSNYFILTPEQIDAAVPLTNTVGPTMTQYYIIRTENLPLLEPLRSVPIVG NPLANLVQPNLKVIVNLGYGDPAYGYSTSPPNVATPFGLFPEVSPVVIADALAAGTQQ GIGDFAYDVSHLELPLPADGSTMPSTAPGSGTPVPPLSIDSLIDDLQVANRNLANTIS KVAATSYATVLPTADIANAALTIVPSYNIHLFLEGIQQALKGDPMGLVNAVGYPLAAD VALFTAAGGLQLLIIISAGRTIANDISAIVP" CDS 2036865..2038136 /codon_start=1 /transl_table=11 /gene="PPE29" /locus_tag="BQ2027_MB1829" /product="ppe family protein ppe29" /note="Mb1829, PPE29, len: 423 aa. Equivalent to Rv1801, len: 423 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 423 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to AL022021|MTV049.29|Rv1808 (409 aa), FASTA scores: opt: 1229, E(): 0, (55.2% identity in 422 aa overlap). TBparse score is 0.927." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1I6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00433.1" /translation="MDFGLLPPEINSGRMYTGPGPGPMLAAATAWDGLAVELHATAAG YASELSALTGAWSGPSSTSMASAAAPYVAWMSATAVHAELAGAQARLAIAAYEAAFAA TVPPPVIAANRAQLMVLIATNIFGQNTPAIMMTEAQYMEMWAQDAAAMYGYAGSSATA SRMTAFTEPPQTTNHGQLGAQSSAVAQTAATAAGGNLQSAFPQLLSAVPRALQGLALP TASQSASATPQWVTDLGNLSTFLGGAVTGPYTFPGVLPPSGVPYLLGIQSVLVTQNGQ GVSALLGKIGGKPITGALAPLAEFALHTPILGSEGLGGGSVSAGIGRAGLVGKLSVPQ GWTVAAPEIPSPAAALQATRLAAAPIAATDGAGALLGGMALSGLAGRAAAGSTGHPIG SAAAPAVGAAAAAVEDLATEANIFVIPAMDD" CDS 2038248..2039639 /codon_start=1 /transl_table=11 /gene="PPE30" /locus_tag="BQ2027_MB1830" /product="ppe family protein ppe30" /note="Mb1830, PPE30, len: 463 aa. Equivalent to Rv1802, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 463 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to AL022021|MTV049.30|Rv1809 (468 aa), FASTA scores: opt: 1238, E(): 0, (51.0% identity in 471 aa overlap)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/Swiss-Prot:P0A693" /protein_id="SIU00434.1" /translation="MDFGVLPPEINSGRMYAGPGSGPMLAAAAAWDGLATELQSTAAD YGSVISVLTGVWSGQSSGTMAAAAAPYVAWMSATAALAREAAAQASAAAAAYEAAFAA TVPPPVVAANRAELAVLAATNIFGQNTGAIAAAEARYAEMWAQDAAAMYGYAGSSSVA TQVTPFAAPPPTTNAAGLATQGVAVAQAVGASAGNARSLVSEVLEFLATAGTNYNKTV ASLMNAVTGVPYASSVYNSMLGLGFAESKMVLPANDTVISTIFGMVQFQKFFNPVTPF NPDLIPKSALGAGLGLRSAISSGLGSTAPAISAGASQAGSVGGMSVPPSWAAATPAIR TVAAVFSSTGLQAVPAAAISEGSLLSQMALASVAGGALGGAAARATGGFLGGGRVTAV KKSLKDSDSPDKLRRVVAHMMEKPESVQHWHTDEDGLDDLLAELKKKPGIHAVHMAGG NKAEIAPTISESG" CDS complement(2039787..2041409) /codon_start=1 /transl_table=11 /gene="PE_PGRS32b" /locus_tag="BQ2027_MB1831C" /product="PE-PGRS FAMILY PROTEIN [SECOND PART]" /note="Mb1831c, PE_PGRS32b, len: 540 aa. Equivalent to 3' end of Rv1803c, len: 639 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 540 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Most similar to Rv1768|MTCY28.34|Z95890 (618 aa), FASTA scores: opt: 1827, E(): 0, (53.5% identity in 664 aa overlap). Contains two PS00583 pfkB family of carbohydrate kinases signatures 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS32 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base transition (c-t) splits PE_PGRS32 into 2 parts, PE_PGRS32a and PE_PGRS32b, with PE_PGRS32a being truncated. Mb1831c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZE5" /protein_id="SIU00435.1" /translation="MQIVGQTALDAINSPVQTLTGRPLIGNGANGVAGTGQNGGDGGW LYGNGGNGGSGGTGQNGGNGGSAGLWGSGGNGGQGGAGANGAAGQPGKAGGSGGNGGA GGWIYGHGGHGGAGGNGGNATAPGGASAGFDGGAGGNGGSGGRGGLLFGNGGNGSVGG MGGQGTNDTAGDSAGSGGLGGNGGNGAQGGWLIGNGGQGGDSGAGGGTDSTQTGVMNG ASGGSAGIAGNGGDAGLVGNGGAGGNGGNGAAGSALGTTIFGGSGGVGGSGGDGGNGG WLFGSGASGGNGGQGGDAGTNGFAGFGGSAGGGGWVGAVNFGPISVQGFGLFGHGGDG GNGGDVGAGSLSIQFGASGGDGGQGGVLYGNGGNGGNAGSGGGTGFEGSAGQGGAAIL IGNGGAGGNGATGGTGVGNIIQEAGGDGSDGGAGGSGGLLFGSGGAGGIGGAGGVGGS GNDGGNGGDGGQGGASGLGIGNGGPGGSGGTGGAGGTGGSAGTGGAGGDGGNAALLIG TGGDGGDGVPPAPGGQGGKGGLIGLPGQNGQP" CDS complement(2041443..2041706) /codon_start=1 /transl_table=11 /gene="PE_PGRS32a" /locus_tag="BQ2027_MB1832C" /product="PE-PGRS FAMILY PROTEIN [FIRST PART]" /note="Mb1832c, PE_PGRS32a, len: 87 aa. Equivalent to 5' end of Rv1803c, len: 639 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 87 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Most similar to Rv1768|MTCY28.34|Z95890 (618 aa), FASTA scores: opt: 1827, E(): 0, (53.5% identity in 664 aa overlap). Contains two PS00583 pfkB family of carbohydrate kinases signatures 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS32 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base transition (c-t) splits PE_PGRS32 into 2 parts, PE_PGRS32a and PE_PGRS32b, with PE_PGRS32a being truncated. Mb1832c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XZQ3" /protein_id="SIU00436.1" /translation="MWTSQMIVAPAFVDAAAKDLATIGSAISRANAEALVPITALLPA GADDVSAAIAALFATHGQAYQELSAHAVAFHEQFVQLMSAGAA" CDS complement(2041887..2042213) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1833C" /product="conserved protein" /note="Mb1833c, -, len: 108 aa. Equivalent to Rv1804c, len: 108 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 108 aa overlap). Conserved hypothetical protein, similar to several hypothetical Mycobacterium tuberculosis proteins that may be exported (hydrophobic stretch at N-terminus) e.g. O07222|Rv1810|MTCY16F9.04C|Z96073 (118 aa), FASTA scores: opt: 361, E(): 2.3e-19, (53.5% identity in 101 aa overlap); Rv0622, Rv1690, and Rv3067, etc. Mb1833c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007969" /db_xref="UniProtKB/TrEMBL:A0A1R3XZD2" /protein_id="SIU00437.1" /translation="MRVVSTLLSIPLMIGLAVPAHAGPSGDDAVFLASLERAGITYSH PDQAIASGKAVCALVESGESGLQVVNELRTRNPGFSMDGCCKFAAISAHVYCPHQITK TSVSAK" CDS complement(2042551..2042898) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1834C" /product="HYPOTHETICAL PROTEIN" /note="Mb1834c, -, len: 115 aa. Equivalent to Rv1805c, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 115 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3XZF2" /protein_id="SIU00438.1" /translation="MTASVVATSRERHSHKAAKQRACEITDFEPEGRFRVRKRRRGRI GTKRSSISDTDYRRDSFRSHLLTAGAHGDADAQHKGMTAQQTTELGTPLVRALAPHGV SGRSSRKPLGLNP" CDS 2042936..2043235 /codon_start=1 /transl_table=11 /gene="PE20" /locus_tag="BQ2027_MB1835" /product="pe family protein pe20" /note="Mb1835, PE20, len: 99 aa. Equivalent to Rv1806, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Member of the Mycobacterium tuberculosis PE family of gly-, ala-rich proteins, most similar to Rv1788|MTV049.10|AL022021 (99 aa), FASTA scores: opt: 334, E(): 4.7 e-15, (59.8% identity in 97 aa overlap). Mb1835 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XZE2" /protein_id="SIU00439.1" /translation="MAFVLVCPDALAIAAGQLRHVGSVIAARNAVAAPATAELAPAAA DEVSALTATQFNFHAAMYQAVGAQAIAMNEAFVAMLGASADSYAATEAANIIAVS" CDS 2043250..2044461 /codon_start=1 /transl_table=11 /gene="PPE31" /locus_tag="BQ2027_MB1836" /product="ppe family protein ppe31" /note="Mb1836, PPE31, len: 403 aa. Equivalent to Rv1807, len: 399 aa, from Mycobacterium tuberculosis strain H37Rv, (99% identity in 399 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to Rv1789|MTV049.11|AL022021 (393 aa), FASTA scores: opt: 1169, E(): 0, (49.5% identity in 399 aa overlap). Start site changed since original genome assembly submission" /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZE3" /protein_id="SIU00440.1" /translation="MTAALDFATLPPEINSARMYSGAGSAPMLAAASAWHGLSAELRA SALSYSSVLSTLTGEEWHGPASASMTAAAAPYVAWMSVTAVRAEQAGAQAEAAAAAYE AAFAATVPPPVIEANRAQLMALIATNVLGQNAPAIAATEAQYAEMWSQDAMAMYGYAG ASAAATQLTPFTEPVQTTNASGLAAQSAAIAHATGASAGAQQTTLSQLIAAIPSVLQG LSSSTAATSASGPSGLLGILGSGSSWLDKLWALLDPNSNFWNTIASSGLFLPSNTIAP FLGLLGGVAAADAAGDVLGEATSGGLGGALVAPLGSAGGLGGTVAAGLGNAATVGTLS VPPSWTAAAPLASPLGSALGGTPMVAPPPAVAAGMPGMPFGTMGGQGFGRAVPQYGFR PNFVARPPAAG" CDS 2044785..2046014 /codon_start=1 /transl_table=11 /gene="PPE32" /locus_tag="BQ2027_MB1837" /product="ppe family protein ppe32" /note="Mb1837, PPE32, len: 409 aa. Equivalent to Rv1808, len: 409 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 409 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to Rv1800|MTV049.22|AL022021 (655 aa), FASTA scores: opt: 1225, E(): 0, (55.1% identity in 423 aa overlap). Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide. TBparse score is 0.919. Protein product from Mb1837 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1837 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZC6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00441.1" /translation="MDFGALPPEINSGRMYAGPGSGPLLAAAAAWDALAAELYSAAAS YGSTIEGLTVAPWMGPSSITMAAAVAPYVAWISVTAGQAEQAGAQAKIAAGVYETAFA ATVPPPVIEANRALLMSLVATNIFGQNTPAIAATEAHYAEMWAQDAAAMYGYAGSSAT ASQLAPFSEPPQTTNPSATAAQSAVVAQAAGAAASSDITAQLSQLISLLPSTLQSLAT TATATSASAGWDTVLQSITTILANLTGPYSIIGLGAIPGGWWLTFGQILGLAQNAPGV AALLGPKAAAGALSPLAPLRGGYIADITPLGGGATGGIARAIYVGSLSVPQGWAEAAP VMRAVASVLPGTGAAPALAAEAPGALFGEMALSSLAGRALAGTAVRSGAGAARVAGGS VTEDVASTTTIIVIPAD" CDS 2046146..2046709 /codon_start=1 /transl_table=11 /gene="PPE33a" /locus_tag="BQ2027_MB1838" /product="PPE FAMILY PROTEIN [FIRST PART]" /note="Mb1838, PPE33a, len: 187 aa. Equivalent to 5' end of Rv1809, len: 468 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 187 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to RV1802AL022021|MTV049.23 (463 aa), FASTA scores: opt: 1238, E(): 0, (51.2% identity in 471 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE33 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base transition (c-t) splits PPE33 into 2 parts, PPE33a and PPE33b, with PPE33a being truncated. Mb1838 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZD0" /protein_id="SIU00442.1" /translation="MDFGLQPPEITSGEMYLGPGAGPMLAAAVAWDGLAAELQSMAAS YASIVEGMASESWLGPSSAGMAAAAAPYVTWMSGTSAQAKAAADQARAAVVAYETAFA AVVPPPQIAANRSQLISLVATNIFGQNTAAIAATEAEYGEMWAQDTMAMFGYASSSAT ASRLTPFTAPPQTTNPSGLAGQAAATG" CDS 2046710..2047558 /pseudo /codon_start=1 /transl_table=11 /gene="PPE33b" /locus_tag="BQ2027_MB1839" /note="Mb1839, PPE33b, len: 282 aa. Equivalent to 3' end of Rv1809, len: 468 aa, from Mycobacterium tuberculosis strain H37Rv, (). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to RV1802AL022021|MTV049.23 (463 aa), FASTA scores: opt: 1238, E(): 0, (51.2% identity in 471 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE33 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base transition (c-t), splits PPE33 into 2 parts, PPE33a and PPE33b, with PPE33a being truncated. Mb1839 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing.;PPE FAMILY PROTEIN [SECOND PART]" CDS 2047797..2048153 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1840" /product="conserved protein" /note="Mb1840, -, len: 118 aa. Equivalent to Rv1810, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 118 aa overlap). Conserved hypothetical protein, similar to several hypothetical Mycobacterium tuberculosis proteins that may be exported (possible N-terminal signal sequence) e.g. O53953|Rv1804c|MTV049.26c|AL022021 (108 aa), FASTA scores: opt: 361, E(): 9.6e-17, (53.5% identity in 101 aa overlap); Rv0622, and Rv1690, etc. Protein product from Mb1840 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1840 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007969" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1J6" /protein_id="SIU00444.1" /translation="MQLQRTMGQCRPMRMLVALLLSAATMIGLAAPGKADPTGDDAAF LAALDQAGITYADPGHAITAAKAMCGLCANGVTGLQLVADLRDYNPGLTMDSAAKFAA IASGAYCPEHLEHHPS" CDS 2048307..2049011 /codon_start=1 /transl_table=11 /gene="mgtC" /locus_tag="BQ2027_MB1841" /product="POSSIBLE Mg2+ TRANSPORT P-TYPE ATPASE C MGTC" /note="Mb1841, -, len: 234 aa. Equivalent to Rv1811, len: 234 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 234 aa overlap). Possible mgtC, magnesium (Mg2+) transport P-type ATPase C (transmembrane protein) (EC 3.6.3.1), highly similar to many e.g. NP_442124.1|NC_000911 Mg2+ transport ATPase from Synechocystis sp. strain PCC 6803 (234 aa); NP_251248.1|NC_002516 probable transport protein from Pseudomonas aeruginosa (230 aa); P22037|ATMC_SALTY|STM3764 magnesium transport ATPase protein C from Salmonella typhimurium (231 aa), FASTA scores: opt: 545, E(): 4.1e-30, (42.3% identity in 220 aa overlap); N-terminus of NP_213315.1|NC_000918 Mg(2+) transport ATPase from Aquifex aeolicus (225 aa); etc. BELONGS TO THE MGTC / SAPB FAMILY,Mb1841 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y076" /db_xref="InterPro:IPR003416" /db_xref="UniProtKB/TrEMBL:A0A1R3Y076" /protein_id="SIU00445.1" /translation="MQTLTVADFALRLAVGVGCGAIIGLERQWRARMAGLRTNALVAT GATLFVLYAVATEDSSPTRVASYVVSGIGFLGGGVILREGFNVRGLNTAATLWCSAAV GVLAASGHLVFTLIGTGTIVAVHLLGRPLGRLVDRDNAVEDEGLQPYQVRVICRPKAE TYVRAHIVQRTSSNDITLRGIRTGPAGDDNITLTAHLLMVGHTPAKLERLVAELSLQP GVYAVHWYAGEHAQAE" CDS complement(2049021..2050223) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1842C" /product="PROBABLE DEHYDROGENASE" /note="Mb1842c, -, len: 400 aa. Equivalent to Rv1812c, len: 400 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 400 aa overlap). Probable dehydrogenase (EC 1.-.-.-), similar to other dehydrogenases/oxidases e.g. AE001947|AE001947_10 NADH dehydrogenase II of Deinococcus radiodurans (379 aa), FASTA scores: opt: 404, E(): 3.4e-18, (26.4% identity in 363 aa overlap) and DHNA_HAEIN|P44856 nadh dehydrogenase (EC 1.6.99.3) (444 aa), FASTA scores: opt: 200, E(): 8.5e-06, (23.3% identity in 258 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical dehydrogenases Rv0392c, and Rv1854c|MTCY359.19 ndh probable NADH dehydrogenase (31.5% identity in 321 aa overlap). Protein product from Mb1842c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1842c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZF5" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XZF5" /protein_id="SIU00446.1" /translation="MTRVVVIGSGFAGLWAALGAARRLDELAVPAGTVDVMVVSNKPF HDIRVRNYEADLSACRIPLGDVLGPAGVAHVTAEVTAIDADGRRVTTSTGASYSYDRL VLASGSHVVKPALPGLAEFGFDVDTYDGAVRLQQHLQGLAGGPLTSAAATVVVVGAGL TGIETACELPGRLHALFARGDGVTPRVVLIDHNPFVGSDMGLSARPVIEQALLDNGVE TRTGVSVAAVSPGGVTLSSGERLAAATVVWCAGMRASRLTEQLPVARDRLGRLQVDDY LRVIGVPAMFAAGDVAAARMDDEHLSVMSCQHGRPMGRYAGCNVINDLFDQPLLALRI PWYVTVLDLGSAGAVYTEGWERKVVSQGAPAKTTKQSINTRRIYPPLNGSRADLLAAA APRVQPRP" CDS complement(2050545..2050976) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1843C" /product="putative membrane protein" /note="Mb1843c, -, len: 143 aa. Equivalent to Rv1813c, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Conserved hypothetical protein. Possibly a exported protein with potential N-terminal signal sequence. Similar to Q11050|Rv1269c|MTCY50.13 hypothetical protein from Mycobacterium tuberculosis (124 aa), (42.7% identity in 143 aa overlap). Protein product from Mb1843c detected using SWATH mass spectrometry. Mb1843c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64890" /db_xref="InterPro:IPR025240" /db_xref="UniProtKB/Swiss-Prot:P64890" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00447.1" /translation="MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMM SEIAGLPIPPIIHYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTR CGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGGRIVNWACN" CDS 2051385..2052287 /codon_start=1 /transl_table=11 /gene="erg3" /locus_tag="BQ2027_MB1844" /product="MEMBRANE-BOUND C-5 STEROL DESATURASE ERG3 (STEROL-C5-DESATURASE)" /note="Mb1844, erg3, len: 300 aa. Equivalent to Rv1814, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 300 aa overlap). erg3, transmembrane C-5 sterol desaturase (EC 1.3.-.-) (see *), weak similarity to several e.g. ERG3_YEAST|P32353 c-5 sterol desaturase (365 aa), FASTA scores: opt: 154, E(): 0.0011, (22.9% identity in 288 aa overlap). BELONGS TO THE STEROL DESATURASE FAMILY. [* note work of Jackson, C.J., Lamb, D.C., Kelly, D.E., Kelly, S.L., Characterization of a sterol delta 5,6-desaturase homolog in Mycobacterium bovis (BCG). Submitted (JUN-2000) to the EMBL/GenBank/DDBJ databases]. Mb1844 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P68434" /db_xref="InterPro:IPR006694" /db_xref="UniProtKB/Swiss-Prot:P68434" /protein_id="SIU00448.1" /translation="MRDPVLFAIPCFLLLLILEWTAARKLESIETAATGQPRPASGAY LTRDSVASISMGLVSIATTAGWKSLALLGYAAIYAYLAPWQLSAHRWYTWVIAIVGVD LLYYSYHRIAHRVRLIWATHQAHHSSEYFNFATALRQKWNNSGEILMWVPLPLMGLPP WMVFCSWSLNLIYQFWVHTERIDRLPRWFEFVFNTPSHHRVHHGMDPVYLDKNYGGIL IIWDRLFGSFQPELFRPHYGLTKRVDTFNIWKLQTREYVAIVRDWRSATRLRDRLGYV FGPPGWEPRTIDKSNAAASLVTSR" CDS 2052392..2053057 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1845" /product="conserved protein" /note="Mb1845, -, len: 221 aa. Equivalent to Rv1815, len: 221 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 221 aa overlap). Conserved hypothetical protein, similar to G473456 hypothetical protein from Mycobacterium fortuitum (255 aa), FASTA scores: opt: 182, E(): 3.2e-05, (29.6% identity in 230 aa overlap). Mb1845 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR009003" /db_xref="UniProtKB/Swiss-Prot:P59981" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00449.1" /translation="MVRLVPRAFAATVALLAAGFSPATASADPVLVFPGMEIRQDNHV CTLGYVDPALKIAFTAGHCRGGGAVTSRDYKVIGHLRAFRDNTPSGSTVATHELIADY EAIVLADDVTASNILPSGRALESRPGVVLHPGQAVCHFGVSTGETCGTVESVNNGWFT MSHGVLSEKGDSGGPVYLAPDGGPAQIVGIFNSVWGGFPAAVSWRSTSEQVHADLGVT PLA" CDS 2053120..2053824 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1846" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1846, -, len: 234 aa. Equivalent to Rv1816, len: 234 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 234 aa overlap). Possible transcriptional regulatory protein. MEME analysis suggests similarity to putative Mycobacterium tuberculosis transcriptional regulators, Rv0653c, Rv0681. Contains helix-turn-helix motif at aa 38-59 (+4.30 SD). Protein product from Mb1846 detected using SWATH mass spectrometry. Mb1846 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67439" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR025996" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/Swiss-Prot:P67439" /protein_id="SIU00450.1" /translation="MCQTCRVGKRRDAREQIEAKIVELGRRQLLDHGAAGLSLRAIAR NLGMVSSAVYRYVSSRDELLTLLLVDAYSDLADTVDRARDDTVADSWSDDVIAIARAV RGWAVTNPARWALLYGSPVPGYHAPPDRTAGVATRVVGAFFDAIAAGIATGDIRLTDD VAPQPMSSDFEKIRQEFGFPGDDRVVTKCFLLWAGVVGAISLEVFGQYGADMLTDPGV VFDAQTRLLVAVLAEH" CDS 2054305..2054385 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1847" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1847, -, len: 26 aa. Equivalent to Rv1816A, len: 26 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 26 aa overlap). Conserved hypothetical protein. Mb1847 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZE9" /protein_id="SIU00451.1" /translation="MSRAGDDAVGVPPACGGRSDDEERRQ" CDS 2054459..2055922 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1848" /product="POSSIBLE FLAVOPROTEIN" /note="Mb1848, -, len: 487 aa. Equivalent to Rv1817, len: 487 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 487 aa overlap). Possible flavoprotein, similar to G746486 flavoprotein subunit of fumarate reductase fad domain homologue (474 aa), FASTA scores: opt: 223, E(): 5.7e-07, (24.1% identity in 489 aa overlap); and AJ236923|SFR236923_3 soluble fumarate reductase of Shewanella frigidimarina ifcA (588 aa), FASTA scores: opt: 310, E(): 2.5e-11, (27.3% identity in 484 aa overlap). Mb1848 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR003953" /db_xref="InterPro:IPR027477" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XZD7" /protein_id="SIU00452.1" /translation="MSTDIPATVSAETVTSWSDDVDVTVIGFGIAGGCAAVSAAAAGA RVLVLERAAAAGGTTALAGGHFYLGGGTTVQLATGHPDSPEEMYKYLVAVSREPDHDK IRAYCDGSVEHFNWLEGLGFQFERSYFPGKAVIQPNTEGLMFTGNEKVWPFLELAVPA PRGHKVPVPGDTGGAAMVIDLLLKRAASLGIQIRYETGATELIVDGTGKVTGVMWKRF SETGAIKAKSVIIAAGGFVMNPDMVAKYTPKLAEKPFVLGNTYDDGLGIRLGVSAGGA TQHMDQMFITAPPYPPSILLTGIIVNKLGQRFVAEDSYHSRTAGFIMEQPDSAAYLIV DEAHLEHPKMPLVPLIDGWETVVEMEAALGIPPGNLAATLDRYNAYAARGADPDFHKQ PEFLAAQDNGPWGAFDMSLGKAMYAGFTLGGLATSVDGQVLRDDGAVVAGLYAVGACA SNIAQDGKGYASGTQLGEGSFFGRRAGAHAAARAQGM" CDS complement(2056042..2057547) /codon_start=1 /transl_table=11 /gene="PE_PGRS33" /locus_tag="BQ2027_MB1849C" /product="pe-pgrs family protein pe_pgrs33" /note="Mb1849c, PE_PGRS33, len: 501 aa. Equivalent to Rv1818c, len: 498 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 501 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see first citation), similar to many. Contains 2 x PS00583 pfkB family of carbohydrate kinases signature 1. Supposed localised to the cell surface. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 9 bp insertion (*-gccgccggc), leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (501 aa versus 498 aa). Mb1849c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3XZD5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00453.1" /translation="MSFVVTIPEALAAVATDLAGIGSTIGTANAAAAVPTTTVLAAAA DEVSAAMAALFSGHAQAYQALSAQAALFHEQFVRALTAGAGSYAAAEAASAAPLEGVL DVINAPALALLGRPLIGNGANGAPGTGANGGDGGILIGNGGAGGSGAAGMPGGNGGAA GLFGNGGAGGAGGNVASGTAGFGGAGGAGGLLYGAGGAGGAGGRAGGGVGGIGGAGGA GGNGGLLFGAGGAGSVGGLAADAGDGGAGGDGGLFFGVGGAGGAGGTGTNVTGGAGGA GGNGGLLFGAGGVGGVGGDGVAFLGTAPGGPGGAGGAGGLFGVGGAGGAGGIGLVGNG GAGGSGGSALLWGDGGAGGAGGVGSTTGGAGGAGGNAGLLVGAGGAGGAGALGGGATG VGGAGGNGGTAGLLFGAGGAGGAGGFGFGGAGGAGGLGGKAGLIGDGGDGGAGGNGTG AKGGDGGAGGGAILVGNGGNGGNAGSGTPNGSAGTGGAGGLLGKNGMNGLP" CDS complement(2057682..2059601) /codon_start=1 /transl_table=11 /gene="baca" /locus_tag="BQ2027_MB1850C" /product="probable drug-transport transmembrane atp-binding protein abc transporter baca" /note="Mb1850c, -, len: 639 aa. Equivalent to Rv1819c, len: 639 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 639 aa overlap). Probable drugs-transport transmembrane ATP-binding protein ABC transporter (see citation below), equivalent to AL008609|MLCB1788.47 hypothetical ABC transporter from Mycobacterium leprae (638 aa), (74.9% identity in 634 aa overlap). Also similar to other transmembrane ATP-binding proteins e.g. Q57335|Y036_HAEIN hypothetical ABC transporter ATP-binding protein from Haemophilus influenzae (592 aa), FASTA scores: opt: 1235, E(): 2.8e-61, (40.8% identity in 623 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb1850c detected using SWATH mass spectrometry. Mb1850c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1K9" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR011527" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036640" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1K9" /protein_id="SIU00454.1" /translation="MGPKLFKPSIDWSRAFPDSVYWVGKAWTISAICVLAILVLLRYL TPWGRQFWRITRAYFVGPNSVRVWLMLGVLLLSVVLAVRLNVLFSYQGNDMYTALQKA FEGIASGDGTVKRSGARGFWMSIGVFSVMAVLHVTRVMADIYLTQRFIIAWRVWLTHH LTQDWLDGRAYYRDLFIDETIDNPDQRIQQDVDIFTAGAGGTPNAPSNGTASTLLFGA VQSIISVISFTAILWNLSGTLNIFGVSIPRAMFWTVLVYVFVATVISFIIGRPLIWLS FRNEKLNAAFRYALVRLRDAAEAVGFYRGERVEGTQLQRRFTPVIDNYRRYVRRSIAF NGWNLSVSQTIVPLPWVIQAPRLFAGQIDFGDVGQTATSFGNIHDSLSFFRNNYDAFA SFRAAIIRLHGLVDANEKGRALPAVLTRPSDDESVELNDIEVRTPAGDRLIDPLDVRL DRGGSLVITGRSGAGKTTLLRSLAELWPYASGTLHRPGGENETMFLSQLPYVPLGTLR DVVCYPNSAAAIPDATLRDTLTKVALAPLCDRLDEERDWAKVLSPGEQQRVAFARILL TKPKAVFLDESTSALDTGLEFALYQLLRSELPDCIVVSVSHRPALERLHENQLELLGG GQWRLAPVEAAPAEV" CDS 2059672..2061315 /codon_start=1 /transl_table=11 /gene="ilvG" /locus_tag="BQ2027_MB1851" /product="Probable Acetolactate synthase ilvG (Acetohydroxy-acid synthase)(ALS)" /note="Mb1851, ilvG, len: 547 aa. Equivalent to Rv1820, len: 547 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 547 aa overlap). Probable ilvG, acetolactate synthase (EC 4.1.3.18). Equivalent to AL008609|MLCB1788.46c ilvG from Mycobacterium leprae (548 aa) (86.1% identity in 548 aa overlap). Similar to ILVB_KLEPN|P27696 (559 aa), FASTA scores: opt: 660, E(): 2.9e-34, (29.1% identity in 549 aa overlap). Also similar to other Mycobacterium tuberculosis Ilv proteins e.g. Rv3003c (ilvB), etc. Contains PS00187 Thiamine pyrophosphate enzymes signature. Protein product from Mb1851 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1851 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66947" /db_xref="InterPro:IPR000399" /db_xref="InterPro:IPR011766" /db_xref="InterPro:IPR012000" /db_xref="InterPro:IPR012001" /db_xref="InterPro:IPR029035" /db_xref="InterPro:IPR029061" /db_xref="UniProtKB/Swiss-Prot:P66947" /protein_id="SIU00455.1" /translation="MSTDTAPAQTMHAGRLIARRLKASGIDTVFTLSGGHLFSIYDGC REEGIRLIDTRHEQTAAFAAEGWSKVTRVPGVAALTAGPGITNGMSAMAAAQQNQSPL VVLGGRAPALRWGMGSLQEIDHVPFVAPVARFAATAQSAENAGLLVDQALQAAVSAPS GVAFVDFPMDHAFSMSSDNGRPGALTELPAGPTPAGDALDRAAGLLSTAQRPVIMAGT NVWWGHAEAALLRLVEERHIPVLMNGMARGVVPADHRLAFSRARSKALGEADVALIVG VPMDFRLGFGGVFGSTTQLIVADRVEPAREHPRPVAAGLYGDLTATLSALAGSGGTDH QGWIEELATAETMARDLEKAELVDDRIPLHPMRVYAELAALLERDALVVIDAGDFGSY AGRMIDSYLPGCWLDSGPFGCLGSGPGYALAAKLARPQRQVVLLQGDGAFGFSGMEWD TLVRHNVAVVSVIGNNGIWGLEKHPMEALYGYSVVAELRPGTRYDEVVRALGGHGELV SVPAELRPALERAFASGLPAVVNVLTDPSVAYPRRSNLA" CDS 2061330..2063756 /codon_start=1 /transl_table=11 /gene="secA2" /locus_tag="BQ2027_MB1852" /product="possible preprotein translocase atpase seca2" /note="Mb1852, secA2, len: 808 aa. Equivalent to Rv1821, len: 808 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 808 aa overlap). Possible secA2, preprotein translocase, component of secretion apparatus, similar to several preprotein translocases e.g. P28366|SECA_BACSU preprotein translocase secA subunit from Bacillus subtilis (841 aa), FASTA scores: opt: 1424, E(): 0, (35.9% identity in 786 aa overlap). Equivalent to AL008609|MLCB1788.45 Preprotein translocase SecA 2 from Mycobacterium leprae (778 aa) (87.1% identity in 780 aa overlap). Also similar to Rv3240c|MTCY20B11.15c secA preprotein translocase from Mycobacterium tuberculosis (949 aa). COULD BE PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA|Rv3240c, SECD|Rv2587c, SECE|Rv0638, SECF|Rv2586c, SECG|Rv1440 AND SECY|Rv0732. Protein product from Mb1852 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1852 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66786" /db_xref="InterPro:IPR000185" /db_xref="InterPro:IPR011115" /db_xref="InterPro:IPR011116" /db_xref="InterPro:IPR011130" /db_xref="InterPro:IPR014018" /db_xref="InterPro:IPR020937" /db_xref="InterPro:IPR026389" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036266" /db_xref="InterPro:IPR036670" /db_xref="UniProtKB/Swiss-Prot:P66786" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00456.1" /translation="MNVHGCPRIAACRCTDTHPRGRPAFAYRWFVPKTTRAQPGRLSS RFWRLLGASTEKNRSRSLADVTASAEYDKEAADLSDEKLRKAAGLLNLDDLAESADIP QFLAIAREAAERRTGLRPFDVQLLGALRMLAGDVIEMATGEGKTLAGAIAAAGYALAG RHVHVVTINDYLARRDAEWMGPLLDAMGLTVGWITADSTPDERRTAYDRDVTYASVNE IGFDVLRDQLVTDVNDLVSPNPDVALIDEADSVLVDEALVPLVLAGTTHRETPRLEII RLVAELVGDKDADEYFATDSDNRNVHLTEHGARKVEKALGGIDLYSEEHVGTTLTEVN VALHAHVLLQRDVHYIVRDDAVHLINASRGRIAQLQRWPDGLQAAVEAKEGIETTETG EVLDTITVQALINRYATVCGMTGTALAAGEQLRQFYQLGVSPIPPNKPNIREDEADRV YITTAAKNDGIVEHITEVHQRGQPVLVGTRDVAESEELHERLVRRGVPAVVLNAKNDA EEARVIAEAGKYGAVTVSTQMAGRGTDIRLGGSDEADHDRVAELGGLHVVGTGRHHTE RLDNQLRGRAGRQGDPGSSVFFSSWEDDVVAANLDHNKLPMATDENGRIVSPRTGSLL DHAQRVAEGRLLDVHANTWRYNQLIAQQRAIIVERRNTLLRTVTAREELAELAPKRYE ELSDKVSEERLETICRQIMLYHLDRGWADHLAYLADIRESIHLRALGRQNPLDEFHRM AVDAFASLAADAIEAAQQTFETANVLDHEPGLDLSKLARPTSTWTYMVNDNPLSDDTL SALSLPGVFR" CDS 2063953..2064582 /codon_start=1 /transl_table=11 /gene="pgsA2" /locus_tag="BQ2027_MB1853" /product="PROBABLE CDP-DIACYLGLYCEROL--GLYCEROL-3- PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE PGSA2 (PGP SYNTHASE) (PHOSPHATIDYLGLYCEROPHOSPHATE SYNTHASE) (3-PHOSPHATIDYL-1'-GLYCEROL-3'PHOSPHATE SYNTHASE)" /note="Mb1853, pgsA2, len: 209 aa. Equivalent to Rv1822, len: 209 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 209 aa overlap). Probable pgsA2, CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyl-transferase (EC 2.7.8.5) (see citation below), integral membrane protein, equivalent to AL008609|MLCB1788_17 phosphatidyltransferase from Mycobacterium leprae (206 aa), FASTA score: (76.6% identity in 205 aa overlap). Also highly similar or similar to others e.g. CAB88885.1|AL353861 putative CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyl-transferase from Streptomyces coelicolor (215 aa); AAC44003.1|U29587 phosphatidylglycerol phosphate synthase from Rhodobacter sphaeroides (227 aa); NP_405431.1|NC_003143 CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase from Yersinia pestis (182 aa); P06978|PGSA_ECOLI CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase from Escherichia coli (181 aa), FASTA scores: opt: 252, E(): 2.8e-09, (29.7% identity in 175 aa overlap); etc. Also similar to Rv2746c|PGSA3|MTV002.11c CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE (PGP SYNTHASE) from Mycobacterium tuberculosis (209 aa). Contains PS00379 CDP-alcohol phosphatidyltransferases signature; and PS00075 Dihydrofolate reductase signature. BELONGS TO THE CDP-ALCOHOL PHOSPHATIDYLTRANSFERASE CLASS-I FAMILY. Protein product from Mb1853 detected using SWATH mass spectrometry. Mb1853 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63754" /db_xref="InterPro:IPR000462" /db_xref="InterPro:IPR004570" /db_xref="UniProtKB/Swiss-Prot:P63754" /protein_id="SIU00457.1" /translation="MEPVLTQNRVLTVPNMLSVIRLALIPAFVYVVLSAHANGWGVAI LVFSGVSDWADGKIARLLNQSSRLGALLDPAVDRLYMVTVPIVFGLSGIVPWWFVLTL LTRDALLAGTLPLLWSRGLSALPVTYVGKAATFGFMVGFPTILLGQCDPLWSHVLLAC GWAFLIWGMYAYLWAFVLYAVQMTMVVRQMPKLKGRAHRPAAQNAGERG" CDS 2064575..2065498 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1854" /product="UPF0749 protein Rv1823" /note="Mb1854, -, len: 307 aa. Equivalent to Rv1823, len: 307 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 307 aa overlap). Conserved hypothetical protein, similar to P71582|MTCY10H4.12|RV0012 hypothetical protein CY10H4.12 from Mycobacterium tuberculosis (262 aa), FASTA scores: opt: 304, E(): 1.5e-12, (30.1% identity in 246 aa overlap). Protein product from Mb1854 detected using SWATH mass spectrometry. Mb1854 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64892" /db_xref="InterPro:IPR010273" /db_xref="UniProtKB/Swiss-Prot:P64892" /protein_id="SIU00458.1" /translation="MAESDRLLGGYDPNAGYSAHAGAQPQRIPVPSLLRALLSEHLDA GYAAVAAERERAAAPRCWQARAVSWMWQALAATLVAAVFAAAVAQARSVAPGVRAAQQ LLVASVRSTQAAATTLAQRRSTLSAKVDDVRRIVLADDAEGQRLLARLDVLSLAAASA PVVGPGLTVTVTDPGASPNLSDVSKQRVSGSQQIILDRDLQLVVNSLWESGAEAISID GVRIGPNVTIRQAGGAILVDNNPTSSPYTILAVGPPHAMQDVFDRSAGLYRLRLLETS YGVGVSVNVGDGLALPAGATRDVKFAKQIGP" CDS 2065527..2065892 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1855" /product="Small basic protein Sbp" /note="Mb1855, -, len: 121 aa. Equivalent to Rv1824, len: 121 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 121 aa overlap). Conserved hypothetical membrane protein similar to P28265|SBP_BACSU sbp protein from Bacillus subtilis (121 aa), FASTA scores: opt: 261, E(): 1.9e-12, (38.9% identity in 113 aa overlap). Mb1855 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64894" /db_xref="InterPro:IPR009709" /db_xref="UniProtKB/Swiss-Prot:P64894" /protein_id="SIU00459.1" /translation="MGSDTAWSPARMIGIAALAVGIVLGLVFHPGVPEVIQPYLPIAV VAALDAVFGGLRAYLERIFDPKVFVVSFVFNVLVAALIVYVGDQLGVGTQLSTAIIVV LGIRIFGNTAALRRRLFGA" CDS 2065909..2066787 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1856" /product="UPF0749 protein Rv1825" /note="Mb1856, -, len: 292 aa. Equivalent to Rv1825, len: 292 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 292 aa overlap). Conserved hypothetical protein, weak similarity to Mycobacterium tuberculosis hypothetical proteins Q50610|MTCY1A11.20C|Rv1823|Z78020 (307 aa), FASTA scores: opt: 182, E(): 0.00044, (29.9% identity in 204 aa overlap); and Rv0012. Has a hydrophobic stretch, TMhelix from aa 67 to 85 Protein product from Mb1856 detected using SWATH mass spectrometry. Mb1856 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64896" /db_xref="InterPro:IPR010273" /db_xref="UniProtKB/Swiss-Prot:P64896" /protein_id="SIU00460.1" /translation="MSENRPEPVAAETSAATTARHSQADAGAHDAVRRGRHELPADHP RSKVGPLRRTRLTEILRGGRSRLVFGTLAILLCLVLGVAIVTQVRQTDSGDSLETARP ADLLVLLDSLRQREATLNAEVIDLQNTLNALQASGNTDQAALESAQARLAALSILVGA VGATGPGVMITIDDPGPGVAPEVMIDVINELRAAGAEAIQINDAHRSVRVGVDTWVVG VPGSLTVDTKVLSPPYSILAIGDPPTLAAAMNIPGGAQDGVKRVGGRMVVQQADRVDV TALRQPKQHQYAQPVK" CDS 2066825..2067229 /codon_start=1 /transl_table=11 /gene="gcvH" /locus_tag="BQ2027_MB1857" /product="PROBABLE GLYCINE CLEAVAGE SYSTEM H PROTEIN GCVH" /note="Mb1857, gcvH, len: 134 aa. Equivalent to Rv1826, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 134 aa overlap). Probable gcvH, glycine cleavage system H protein, highly similar to GCSH_ECOLI|P23884 glycine cleavage system H protein from Escherichia coli (129 aa), FASTA scores: opt: 428, E(): 2.2e-22, (47.8% identity in 134 aa overlap). Equivalent to MLCB1788.37c gcvH from Mycobacterium leprae (78.4% identity in 134 aa overlap). Contains PS00189 2-oxo acid dehydrogenases acyltransferase component lipoyl binding site. BELONGS TO THE GCVH FAMILY. Protein product from Mb1857 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1857 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TZG8" /db_xref="InterPro:IPR000089" /db_xref="InterPro:IPR002930" /db_xref="InterPro:IPR003016" /db_xref="InterPro:IPR011053" /db_xref="InterPro:IPR017453" /db_xref="InterPro:IPR033753" /db_xref="UniProtKB/Swiss-Prot:Q7TZG8" /protein_id="SIU00461.1" /translation="MSDIPSDLHYTAEHEWIRRSGDDTVRVGITDYAQSALGDVVFVQ LPVIGTAVTAGETFGEVESTKSVSDLYAPISGKVSAVNSDLDGTPQLVNSDPYGAGWL LDIQVDSSDVAALESALTTLLDAEAYRGTLTE" CDS 2067469..2067957 /codon_start=1 /transl_table=11 /gene="garA" /locus_tag="BQ2027_MB1858" /product="Glycogen accumulation regulator GarA" /note="Mb1858, cfp17, len: 162 aa. Equivalent to Rv1827, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). cfp17, conserved hypothetical protein (see citation below), equivalent to O32919|MLCB1788.36c hypothetical protein from Mycobacterium leprae (162 aa), FASTA scores: opt: 888, E(): 0, (87.0% identity in 161 aa overlap). Protein product from Mb1858 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1858 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000253" /db_xref="InterPro:IPR008984" /db_xref="PDB:2KKL" /db_xref="UniProtKB/Swiss-Prot:P64898" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00462.1" /translation="MTDMNPDIEKDQTSDEVTVETTSVFRADFLSELDAPAQAGTESA VSGVEGLPPGSALLVVKRGPNAGSRFLLDQAITSAGRHPDSDIFLDDVTVSRRHAEFR LENNEFNVVDVGSLNGTYVNREPVDSAVLANGDEVQIGKFRLVFLTGPKQGEDDGSTG GP" CDS 2067954..2068697 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1859" /product="Transcriptional regulator, MerR family" /note="Mb1859, -, len: 247 aa. Equivalent to Rv1828, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 247 aa overlap). Conserved hypothetical protein, equivalent to O32918|MLCB1788.35c|AL008609 hypothetical protein from Mycobacterium leprae (251 aa), FASTA scores: opt: 1397, E(): 0, (87.6% identity in 251 aa overlap). Protein product from Mb1859 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1859 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67670" /db_xref="InterPro:IPR000551" /db_xref="InterPro:IPR009061" /db_xref="UniProtKB/Swiss-Prot:P67670" /protein_id="SIU00463.1" /translation="MSAPDSPALAGMSIGAVLDLLRPDFPDVTISKIRFLEAEGLVTP RRASSGYRRFTAYDCARLRFILTAQRDHYLPLKVIRAQLDAQPDGELPPFGSPYVLPR LVPVAGDSAGGVGSDTASVSLTGIRLSREDLLERSEVADELLTALLKAGVITTGPGGF FDEHAVVILQCARALAEYGVEPRHLRAFRSAADRQSDLIAQIAGPLVKAGKAGARDRA DDLAREVAALAITLHTSLIKSAVRDVLHR" CDS 2068816..2069310 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1860" /product="conserved protein" /note="Mb1860, -, len: 164 aa. Equivalent to Rv1829, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 164 aa overlap). Conserved hypothetical protein, equivalent to O32917|MLCB1788.34|AL008609 Hypothetical protein from Mycobacterium leprae (164 aa), FASTA scores: opt: 1011, E(): 0, (95.1% identity in 164 aa overlap). Also present in Aquifex aeolicus, etc. Protein product from Mb1860 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1860 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1L8" /db_xref="InterPro:IPR003729" /db_xref="InterPro:IPR036104" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1L8" /protein_id="SIU00464.1" /translation="MGEVRVVGIRVEQPQNQPVLLLREANGDRYLPIWIGQSEAAAIA LEQQGVEPPRPLTHDLIRDLIAALGHSLKEVRIVDLQEGTFYADLIFDCNIKVSARPS DSVAIALRVGVPIYVEEAVLAQAGLLIPDESDEEATTAVREDEVEKFKEFLDSVSPDD FKAT" CDS 2069602..2070279 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1861" /product="Predicted transcriptional regulators" /note="Mb1861, -, len: 225 aa. Equivalent to Rv1830, len: 225 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 225 aa overlap). Conserved hypothetical protein, equivalent to Mycobacterium leprae hypothetical protein MLCB1788.33c|AL008609|O32916 (231 aa), FASTA scores: opt: 1307, E(): 0, (89.6% identity in 231 aa overlap). Protein product from Mb1861 detected using SWATH mass spectrometry. Mb1861 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67672" /db_xref="InterPro:IPR000551" /db_xref="InterPro:IPR009061" /db_xref="UniProtKB/Swiss-Prot:P67672" /protein_id="SIU00465.1" /translation="MTQLVTRARSARGSTLGEQPRQDQLDFADHTGTAGDGNDGAAAA SGPVQPGLFPDDSVPDELVGYRGPSACQIAGITYRQLDYWARTSLVVPSIRSAAGSGS QRLYSFKDILVLKIVKRLLDTGISLHNIRVAVDHLRQRGVQDLANITLFSDGTTVYEC TSAEEVVDLLQGGQGVFGIAVSGAMRELTGVIADFHGERADGGESIAAPEDELASRRK HRDRKIG" CDS 2070332..2070589 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1862" /product="HYPOTHETICAL PROTEIN" /note="Mb1862, -, len: 85 aa. Equivalent to Rv1831, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Hypothetical unknown protein. Mb1862 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P64900" /protein_id="SIU00466.1" /translation="MRLCVCSAVDWTTHRSSAGEFCGCQLRTPKEQYLSVNLSGTRTA RDYDASGKRWRPLAVLTRRWGKAIHLTVDRVAESLRRLACR" CDS 2070638..2073463 /codon_start=1 /transl_table=11 /gene="gcvB" /locus_tag="BQ2027_MB1863" /product="Probable glycine dehydrogenase gcvB (Glycine decarboxylase) (Glycine cleavage system P-protein)" /note="Mb1863, gcvB, len: 941 aa. Equivalent to Rv1832, len: 941 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 941 aa overlap). Probable gcvB, glycine dehydrogenase [decarboxylating] (EC 1.4.4.2), highly similar to GCSP_ECOLI|P33195 glycine dehydrogenase (decarboxylating) from Escherichia coli (957 aa), FASTA scores: opt: 2194, E(): 0, (55.4% identity in 961 aa overlap). THE GLYCINE CLEAVAGE SYSTEM IS COMPOSED OF FOUR PROTEINS: P, T, L, AND H. Protein product from Mb1863 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1863 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VET8" /db_xref="InterPro:IPR003437" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR020581" /db_xref="UniProtKB/Swiss-Prot:Q7VET8" /protein_id="SIU00467.1" /translation="MSDHSTFADRHIGLDSQAVATMLAVIGVDSLDDLAVKAVPAGIL DTLTDTGAAPGLDSLPPAASEAEALAELRALADANTVAVSMIGQGYYDTHTPPVLLRN IIENPAWYTAYTPYQPEISQGRLEALLNFQTLVTDLTGLEIANASMLDEGTAAAEAMT LMHRAARGPVKRVVVDADVFTQTAAVLATRAKPLGIEIVTADLRAGLPDGEFFGAIAQ LPGASGRITDWSALVQQAHDRGALVAVGADLLALTLIAPPGEIGADVAFGTTQRFGVP MGFGGPHAGYLAVHAKHARQLPGRLVGVSVDSDGTPAYRLALQTREQHIRRDKATSNI CTAQVLLAVLAAMYASYHGAGGLTAIARRVHAHAEAIAGALGDALVHDKYFDTVLARV PGRADEVLARAKANGINLWRVDADHVSVACDEATTDTHVAVVLDAFGVAAAAPAHADI ATRTSEFLTHPAFTQYRTETSMMRYLRALADKDIALDRSMIPLGSCTMKLNAAAEMES ITWPEFGRQHPFAPASDTAGLRQLVADLQSWLVLITGYDAVSLQPNAGSQGEYAGLLA IHEYHASRGEPHRDICLIPSSAHGTNAASAALAGMRVVVVDCHDNGDVDLDDLRAKVG EHAERLSALMITYPSTHGVYEHDIAEICAAVHDAGGQVYVDGANLNALVGLARPGKFG GDVSHLNLHKTFCIPHGGGGPGVGPVAVRAHLAPFLPGHPFAPELPKGYPVSSAPYGS ASILPITWAYIRMMGAEGLRAASLTAITSANYIARRLDEYYPVLYTGENGMVAHECIL DLRGITKLTGITVDDVAKRLADYGFHAPTMSFPVAGTLMVEPTESESLAEVDAFCEAM IGIRAEIDKVGAGEWPVDDNPLRGAPHTAQCLLASDWDHPYTREQAAYPLGTAFRPKV WPAVRRIDGAYGDRNLVCSCPPVEAFA" CDS complement(2073690..2074550) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1864C" /product="Possible haloalkane dehalogenase" /note="Mb1864c, -, len: 286 aa. Equivalent to Rv1833c, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Possible haloalkane dehalogenase (EC 3.8.1.5). Similar to several haloalkane dehalogenase e.g. CAB45532.1|AJ243259 from Mycobacterium bovis (300 aa); also similar to LINB_PSEPA|P51698 1,3,4,6-tetrachloro-1,4-cyclohexadien from Pseudomonas paucimobilis (295 aa), FASTA scores: opt: 314, E(): 1.5e-13, (33.1% identity in 281 aa overlap). Protein product from Mb1864c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1864c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64304" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR023489" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P64304" /protein_id="SIU00468.1" /translation="MSIDFTPDPQLYPFESRWFDSSRGRIHYVDEGTGPPILLCHGNP TWSFLYRDIIVALRDRFRCVAPDYLGFGLSERPSGFGYQIDEHARVIGEFVDHLGLDR YLSMGQDWGGPISMAVAVERADRVRGVVLGNTWFWPADTLAMKAFSRVMSSPPVQYAI LRRNFFVERLIPAGTEHRPSSAVMAHYRAVQPNAAARRGVAEMPKQILAARPLLARLA REVPATLGTKPTLLIWGMKDVAFRPKTIIPRLSATFPDHVLVELPNAKHFIQEDAPDR IAAAIIERFG" CDS 2074591..2075457 /codon_start=1 /transl_table=11 /gene="lipz" /locus_tag="BQ2027_MB1865" /product="Probable hydrolase" /note="Mb1865, -, len: 288 aa. Equivalent to Rv1834, len: 288 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 288 aa overlap). Probable hydrolase (EC 3.-.-.-), some similarity to haloalkane dehalogenases and D16262 hypothetical 38.9 kd protein (335 aa), FASTA scores: opt: 507, E(): 7.6e-28, (33.0% identity in 300 aa overlap). Mb1865 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZJ0" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XZJ0" /protein_id="SIU00469.1" /translation="MTSPSVREWRDGGRWLPTAVGKVFVRSGPGDTPTMLLLHGYPSS SFDFRAVIPHLTGQAWVTMDFLGFGLSDKPRPHRYSLLEQAHLVETVVAHTVTGAVVV LAHDMGTSVTTELLARDLDGRLPFDLRRAVLSNGSVILERASLRPIQKVLRSPLGPVA ARLVSRGGFTRGFGRIFSPAHPLSAQEAQAQWELLCYNDGNRIPHLLISYLDERIRHA QRWHGAVRDWPKPLGFVWGLDDPVATTNVLNGLRELRPSAAVVELPGLGHYPQAEAPK AYAEAALSLLVD" CDS complement(2075462..2077348) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1866C" /product="Predicted acyl esterases" /note="Mb1866c, -, len: 628 aa. Equivalent to Rv1835c, len: 628 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 628 aa overlap). Conserved hypothetical protein, some similarity to putative acylases e.g. G216374 glutaryl 7-aca acylase precursor (634 aa) FASTA scores, opt: 202, E(): 3.5e-06, (25.1% identity in 669 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv2800 and Rv1215c. Protein product from Mb1866c detected using SWATH mass spectrometry. Mb1866c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5F6" /db_xref="InterPro:IPR000383" /db_xref="InterPro:IPR005674" /db_xref="InterPro:IPR008979" /db_xref="InterPro:IPR013736" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P0A5F6" /protein_id="SIU00470.1" /translation="MTRRGGSDAAWYSAPDQRSAYPRYRGMRYSSCYVTMRDGVRIAI DLYLPAGLTSAARLPAILHQTRYYRSLQLRWPLRMLLGGKPLQHIAADKRRRRRFVAS GYAWVDVDVRGSGASFGARVCEWSSDEIRDGAEIVDWIVRQPWCNGTVAALGNSYDGT SAELLLVNQHPAVRVIAPCFSLFDVYTDIAFPGGIHAAWFTDTWGRYNEALDRNALHE VVGWWAKLPVTGMQPVQEDRDRSLRDGAIAAHRGNYDVHQIAGSLTFRDDVSASDPYR GQPDARLEPIGTPIESGSINLISPHNYWRDVQASGAAIYSYSGWFDGGYAHAAIKRFL TVSTPGSHLILGPWNHTGGWRVDPLRGLSRPDFDHDGELLRFIDHHVKGADTGIGSEP PVHYFTMVENRWKSADTWPPPATTQSYYLSADRQLRPDAPDCDSGADEYVVDQTAGTG ERSRWRSQVGIGGHVCYPDRKAQDAKLLTYTSAPLDHPLEVTGHVVVTLFITSTSSDG TFFVYLEDVDPRGRVAYITEGQLRAIHRRLSDGPPPYRQVVPYRTFASGDAWPLVPGE IARLTFDLLPTSYLFQPGHRIRIAIAGADASHFAILPGCAPTVRVYRSRMHASRIDLP VIQP" CDS complement(2077364..2079397) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1867C" /product="Von Willebrand factor (vWF) type A domain-containing protein" /note="Mb1867c, -, len: 677 aa. Equivalent to Rv1836c, len: 677 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 677 aa overlap). Conserved hypothetical protein. Equivalent to MLCB1788.28|AL008609 hypothetical protein from Mycobacterium leprae (710 aa), FASTA scores: opt: 2938, E(): 0, (66.0% identity in 714 aa overlap). Contains PS00036 bZIP transcription factors basic domain signature. Protein product from Mb1867c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1867c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZG5" /db_xref="InterPro:IPR002035" /db_xref="InterPro:IPR036465" /db_xref="UniProtKB/TrEMBL:A0A1R3XZG5" /protein_id="SIU00471.1" /translation="MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDD GPLSSEGHYSAVGGYSASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDW QAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWCFFGDALSNRSHTAAARCVGGKDT VAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGG QPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPG LQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVR TLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGS WLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKP PSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLS NVVAALENRIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQ YSSGGGAVSFTTLRLIYQEMLANYHVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSA DPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS" CDS complement(2079517..2081742) /codon_start=1 /transl_table=11 /gene="glcB" /locus_tag="BQ2027_MB1868C" /product="malate synthase g glcb" /note="Mb1868c, glcB, len: 741 aa. Equivalent to Rv1837c, len: 741 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 741 aa overlap). Probable glcB, malate synthase G (EC 4.1.3.2) (see citations below), highly similar to MASY_CORGL|P42450 malate synthase (738 aa), FASTA score: opt: 2961, E(): 0, (61.3% identity in 724 aa overlap). BELONGS TO THE MALATE SYNTHASE G FAMILY. Protein product from Mb1868c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1868c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5J5" /db_xref="InterPro:IPR001465" /db_xref="InterPro:IPR006253" /db_xref="InterPro:IPR011076" /db_xref="InterPro:IPR023310" /db_xref="UniProtKB/Swiss-Prot:P0A5J5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00472.1" /translation="MTDRVSVGNLRIARVLYDFVNNEALPGTDIDPDSFWAGVDKVVA DLTPQNQALLNARDELQAQIDKWHRRRVIEPIDMDAYRQFLTEIGYLLPEPDDFTITT SGVDAEITTTAGPQLVVPVLNARFALNAANARWGSLYDALYGTDVIPETDGAEKGPTY NKVRGDKVIAYARKFLDDSVPLSSGSFGDATGFTVQDGQLVVALPDKSTGLANPGQFA GYTGAAESPTSVLLINHGLHIEILIDPESQVGTTDRAGVKDVILESAITTIMDFEDSV AAVDAADKVLGYRNWLGLNKGDLAAAVDKDGTAFLRVLNRDRNYTAPGGGQFTLPGRS LMFVRNVGHLMTNDAIVDTDGSEVFEGIMDALFTGLIAIHGLKASDVNGPLINSRTGS IYIVKPKMHGPAEVAFTCELFSRVEDVLGLPQNTMKIGIMDEERRTTVNLKACIKAAA DRVVFINTGFLDRTGDEIHTSMEAGPMVRKGTMKSQPWILAYEDHNVDAGLAAGFSGR AQVGKGMWTMTELMADMVETKIAQPRAGASTAWVPSPTAATLHALHYHQVDVAAVQQG LAGKRRATIEQLLTIPLAKELAWAPDEIREEVDNNCQSILGYVVRWVDQGVGCSKVPD IHDVALMEDRATLRISSQLLANWLRHGVITSADVRASLERMAPLVDRQNAGDVAYRPM APNFDDSIAFLAAQELILSGAQQPNGYTEPILHRRRREFKARAAEKPAPSDRAGDDAA R" CDS complement(2082018..2082413) /codon_start=1 /transl_table=11 /gene="vapc13" /locus_tag="BQ2027_MB1869C" /product="possible toxin vapc13" /note="Mb1869c, -, len: 131 aa. Equivalent to Rv1838c, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Conserved hypothetical protein. Part of 14-membered Mycobacterium tuberculosis protein family with Rv2863|MTV003.09|AL008883 (126 aa), FASTA scores: opt: 293, E(): 1.5e-14, (38.2% identity in 123 aa overlap); Rv0749, Rv0277c, Rv2530c, etc. Also similar to AJ248288|CNSPAX06_181 Pyrococcus abyssi complete genome (136 aa), FASTA scores: opt: 197, E(): 2.2e-07, (33. 1% identity in 133 aa overlap). Protein product from Mb1869c detected using SWATH mass spectrometry. Mb1869c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64902" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/Swiss-Prot:P64902" /protein_id="SIU00473.1" /translation="MILVDSNIPMYLVGASHPHKLDAQRLLESALSGGERLVTDAEVL QEICHRYVAIKRREAIQPAFDAIIGVVDEVLPIERTDVEHARDALLRYQTLSARDALH IAVMAHHDITRLMSFDRGFDSYPGIKRLA" CDS complement(2082410..2082673) /codon_start=1 /transl_table=11 /gene="vapb13" /locus_tag="BQ2027_MB1870C" /product="possible antitoxin vapb13" /note="Mb1870c, -, len: 87 aa. Equivalent to Rv1839c, len: 87 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 87 aa overlap). Conserved hypothetical protein. Some similarity to G217008 CHO-ORF1 (279 aa), FASTA scores: opt: 86, E(): 13, (38.7% identity in 62 aa overlap). Mb1870c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1M5" /protein_id="SIU00474.1" /translation="MSKRLQVLLDPDEWEELREIARRHRTTVSEWVRRTLREAREREP RGDLDMKLRSVRAAARHEFPTADVEQMLEEIERGRGAEREGSR" CDS complement(2082732..2084132) /codon_start=1 /transl_table=11 /gene="PE_PGRS34" /locus_tag="BQ2027_MB1871C" /product="pe-pgrs family protein pe_pgrs34" /note="Mb1871c, PE_PGRS34, len: 466 aa. Similar to Rv1840c, len: 515 aa, from Mycobacterium tuberculosis strain H37Rv, (90.485% identity in 515 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Similar to many e.g. Y03A_MYCTU|Q10637 hypothetical glycine-rich 49.6 kd protein (603 aa), FASTA scores: opt: 1693, E(): 0, (53.1% identity in 612 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, an in-frame deletion of 147 bp leads to a longer protein compared to its homolog in Mycobacterium tuberculosis strain H37Rv (466 aa versus 518 aa). Mb1871c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0B4" /protein_id="SIU00475.1" /translation="MSFVVAAPEVVVAAASDLAGIGSAIGAANAAAAVPTMGVLAAGA DEVSAAVADLFGAHAQAYQALSAQAALFHEQFVHAMTAGAGAYAGAEAADAAALDVLN GPFQALFGRPLIGDGANGAPGQPGGPGGLLYGNGGNGGNGGIGQPGGAGGDAGLIGNG GNGGIGGPGATGLAGGAGGLGKAGFAGGAGGTGGTGGLLYGNGGNGGNVPSGAADGGA GGDARLIGNGGDGGSVGAAPTGIGNGGNGGNGGWLYGDGGSGGSTLQGFSDGGTGGNA GMFGDGGNGGFSFFDGNGGDGGTGGTLIGNGGDGGNSVQTDGFLRGHGGDGGNAVGLI GNGGAGGAGSAGTGVFAPGGGSGGNGGNGALLVGNGGAGGSGGPTQIPSVAVPVTGAG GTGGNGGTAGLIGNGGNGGAAGVSGDGTPGTGGNGGYAQLIGDGGDGGPGDSGGPGGS GGTGGTLAGQNGSPGG" CDS complement(2084295..2085344) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1872C" /product="Magnesium and cobalt efflux protein CorC" /note="Mb1872c, -, len: 349 aa. Equivalent to Rv1841c, len: 345 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 349 aa overlap). Conserved hypothetical membrane protein. Some similarity to O07585|YHDP_BACSU HYPOTHETICAL 49.9 KD PROTEIN from Bacillus subtilis (444 aa), FASTA scores: opt: 620, E(): 0, (31.1% identity in 350 aa overlap). Also similar to other Mycobacterium tuberculosis proteins e.g. Rv1842c, Rv2366c. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 12 bp in-frame insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (349 aa versus 345 aa). Protein product from Mb1872c detected using SWATH mass spectrometry. Mb1872c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZI0" /db_xref="InterPro:IPR000644" /db_xref="InterPro:IPR002550" /db_xref="UniProtKB/TrEMBL:A0A1R3XZI0" /protein_id="SIU00476.1" /translation="MDVLSAVLLALLLIGANAFFVGAEFALISARRDRLEALAEQGKA TAVTVIRAGEQLPAMLTGAQLGVTVSSILLGRVGEPAVVKLLQLSFGLSGVPPALLHT LSLAVALAIVVALHVLLGEMVPKNIALAGPERTAMLLVPPYLVYVRLARPFIAFYNNC ANAILRLVGVQPKDELDIAVSTAELSEMIAESLSEGLLDHEEHTRLTRALRIRTRLVA DVAVPLVNIRAVQVSAVGSGPTIGGVEQALAQTGYSRFPVVDRGGRFIGYLHIKDVLT LGDNPQTVIDLAVVRPLPRVPQSLPLADALSRMRRINSHLALVTADNGSVVGMVALED VVEDLVGTMRDGTHR" CDS complement(2085344..2086711) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1873C" /product="UPF0053 protein Rv1842c/MT1890" /note="Mb1873c, -, len: 455 aa. Equivalent to Rv1842c, len: 455 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 455 aa overlap). Conserved hypothetical membrane protein. Similar to Z99109|0O7589 Potential integral membrane protein from Bacillus subtilis (461 aa), FASTA scores: opt: 723, E(): 0, (31.2% identity in 449 aa overlap). Similar to other Mycobacterium tuberculosis putative integral membrane proteins e.g. Rv2366c, Rv1841c. Protein product from Mb1873c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1873c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZW2" /db_xref="InterPro:IPR000644" /db_xref="InterPro:IPR002550" /db_xref="InterPro:IPR005170" /db_xref="InterPro:IPR016169" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3XZW2" /protein_id="SIU00477.1" /translation="MNLTDTVATILAILALTAGTGVFVAAEFSLTALDRSTVEANARG GTSRDRFIQRAHHRLSFQLSGAQLGISITTLATGYLTEPLVAELPHPGLVAVGMSDRV ADGLITFFALVIVTSLSMVFGELVPKYLAVARPLRTARSVVAGQVLFSLLLTPAIRLT NGAANWIVRRLGIEPAEELRSARTPQELVSLVRSSARSGALDDATAWLMRRSLQFGAL TAEELMTPRSKIVALQTDDTIADLVAAAAASGFSRFPVVEGDLDATVGIVHVKQVFEV PPGDRAHTLLTTVAEPVAVVPSTLDGDAVMAQVRASALQTAMVVDEYGGTAGMVTLED LIEEIVGDVRDEHDDATPDVVAAGNGWRVSGLLRIDEVASATGYRAPDGPYETIGGLV LRELGHIPVAGETVELTALDQDGLPDDSMRWLATVIQMDGRRIDSLELIKMGGHADPG SGRGR" CDS complement(2086885..2087679) /codon_start=1 /transl_table=11 /gene="guaB1" /locus_tag="BQ2027_MB1874C" /product="PROBABLE INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE GUAB1(IMP DEHYDROGENASE) (IMPDH) (IMPD)" /note="Mb1874c, guaB1, len: 479 aa. Equivalent to Rv1843c, len: 479 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 479 aa overlap). Probable guaB1, inosine-5'-monophosphate dehydrogenase (EC 1.1.1.205). Similar to others e.g. IMDH_BACSU|P21879 from Bacillus subtilis (513 aa), FASTA score: opt: 904, E(): 0, (37.8% identity in 471 aa overlap). Similar to other Mycobacterium tuberculosis proteins e.g. guaB2, Rv3411c. Protein product from Mb1874c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1874c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65173" /db_xref="InterPro:IPR000644" /db_xref="InterPro:IPR001093" /db_xref="InterPro:IPR005990" /db_xref="InterPro:IPR005991" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/Swiss-Prot:P65173" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00478.1" /translation="MRIGAAVGINGDVGAKARALAEAGVDVLVIDTAHGHQVKTLDAI KAVSALDLGLPLAAGNVVSAEGTRDLLKAGANVVKVGVGPGAMCTTRMMTGVGRPQFS AVLECASAARQLGGHIWADGGIRHPRDVALALAAGASNVMIGSWFAGTYESPGDLMRD RDDQPYKESYGMASKRAVVARTGADNPFDRARKALFEEGISTSRMGLDPDRGGVEDLI DHITSGVRSTCTYVGASNLAELHERAVVGVQSGAGFAEGHPLPAGW" CDS complement(2087690..2089747) /codon_start=1 /transl_table=11 /gene="gnd1" /locus_tag="BQ2027_MB1875C" /product="PROBABLE 6-PHOSPHOGLUCONATE DEHYDROGENASE GND1" /note="Mb1875c, gnd1, len: 705 aa. Similar to 5' end of Rv1844c, len: 485 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 404 aa overlap). Probable gnd1, 6-phosphogluconate dehydrogenase (EC 1.1.1.44). Similar to others e.g. 6PGD_ECOLI|P00350 from Escherichia coli (468 aa), FASTA scores: opt: 1661, E(): 0, (53.6% identity in 466 aa overlap); etc. Also similar to Rv1122|MTCY22G8.11|gnd2 PROBABLE 6-PHOSPHOGLUCONATE DEHYDROGENASE, DECARBOXYLATING from Mycobacterium tuberculosis (340 aa), FASTA score: (33.0% identity in 351 aa overlap). Note that Rv1844c is most similar to gnd's from Gram negative organisms, while Rv1122|MTCY22G8.11|gnd2 is most similar to gnd's from Gram positive organisms. BELONGS TO THE 6-PHOSPHOGLUCONATE DEHYDROGENASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (c-*) leads to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (705 aa versus 485 aa). Mb1875c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZK7" /db_xref="InterPro:IPR006113" /db_xref="InterPro:IPR006114" /db_xref="InterPro:IPR006115" /db_xref="InterPro:IPR006183" /db_xref="InterPro:IPR006184" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR013328" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZK7" /protein_id="SIU00479.1" /translation="MGSNIARNFARHGYTVAVHNRSVAKTDALLKEHSSDGKFVRSET IPEFLAALEKPRRVLIMVKAGEATDAVINELADAMEPGDIIIDGGNALYTDTMRREKA MRERGLHFVGAGISGGEEGALNGPSIMPGGPAESYQSLGPLLEEISAHVDGVPCCTHI GPDGSGHFVKMVHNGIEYSDMQLIGEAYQLMRDGLGLTAPAIADVFTEWNNGDLDSYL VEITAEVLRQTDAKTGKPLVDVIVDRAEQKGTGRWTVKSALDLGVPVTGIAEAVFARA LSGSVGQRSAASGLASGKLGEQPADPATFTEDVRQALYASKIVAYAQGFNQIQAGSAE FGWDITPGDLATIWRGGCIIRAKFLNHIKEAFDASPNLASLIVAPYFRAPSNRRSTVG GVWCRRRPNWVSRPRDSRRPCRITTRCAPRGCPLHSPRPSATSSAHTPTAGSTNQASS THYGVQTAPKYRCSGLELKGGKGVSDEISRRAPTRVRPDIQRRVHRSEPIRGRVALRR RFVHRRRLGHHHSGSGRQYDRGSRAADGRDGRPPRWHRNPAAGSADPGGKADGGVRQK PGPGARHPSDAGTRRFGVRRHGAHPQARTWRRGGHPRGSPDRIGARIVLPGRGSLHPG ARYRRDGLCDRSSGNRATQDLRPAGARPGRRCGADRRRRHVGGSAKPHRGYPRRYLHP GHR" CDS complement(2089837..2090787) /codon_start=1 /transl_table=11 /gene="blar" /locus_tag="BQ2027_MB1876C" /product="possible sensor-transducer protein blar" /note="Mb1876c, -, len: 316 aa. Equivalent to Rv1845c, len: 316 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 316 aa overlap). Conserved hypothetical transmembrane protein. Equivalent to MLCB1788.18|AL008609 Hypothetical protein from Mycobacterium leprae (316 aa), FASTA scores: opt: 1762, E(): 0, (87.6% identity in 314 aa overlap). Similar to proteins in Streptomyces coelicolor e.g. SC10A7.04|AL078618.1. Mb1876c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZH7" /db_xref="InterPro:IPR001915" /db_xref="UniProtKB/TrEMBL:A0A1R3XZH7" /protein_id="SIU00480.1" /translation="MSALAFTILAVLLAGPTPALLARATWPLRAPRAAMVLWQAIALA AVLSSFSAGIAIASRLLMPGPDGRPTTSFVGAAGRLGWPLWAAYITVFALTVLVGARL AVAVVRVATATRRRRAHHRMVVDLVGVGHNGALAQPCARARDLRVLDVAQPLAYCLPG VRSRVVVSEGTLTALADAEVAAILTHERAHLRARHDLVLEAFTAVHAAFPRLVRSANA LGAVQLLVELLADDAAVRAAGRTPLARALVACASGRAPSGALAVGGPSTVLRVRRLSG RGNSAVLSAAAYLAAAAVLVVPTVALAVPWLTQLQRLFIA" CDS complement(2090802..2091218) /codon_start=1 /transl_table=11 /gene="blai" /locus_tag="BQ2027_MB1877C" /product="transcriptional repressor blai" /note="Mb1877c, -, len: 138 aa. Equivalent to Rv1846c, len: 138 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 138 aa overlap). Possible transcriptional regulatory protein. Equivalent to MLCB1788.17|AL008609 hypothetical protein from Mycobacterium leprae (142 aa), FASTA scores: opt: 736 E(): 0, (95.1% identity in 123 aa overlap). Also similar to BLAI_BACLI|P06555 penicillinase repressor (128 aa), fasta scores: opt: 114, E(): 0.12, (23.7% identity in 131 aa overlap). Protein product from Mb1877c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1877c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZH1" /db_xref="InterPro:IPR005650" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XZH1" /protein_id="SIU00481.1" /translation="MAKLTRLGDLERAVMDHLWSRTEPQTVRQVHEALSARRDLAYTT VMTVLQRLAKKNLVLQIRDDRAHRYAPVHGRDELVAGLMVDALAQAEYSGSRQAALVH FVERVGADEADALRRALAELEAGHGNRPPAGAATET" CDS 2091496..2091918 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1878" /product="Putative esterase" /note="Mb1878, -, len: 140 aa. Equivalent to Rv1847, len: 140 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 140 aa overlap=. Conserved hypothetical protein, possible thioesterase, some similarity to YBDB proteins of Escherichia coli and H. influenzae e.g. P15050|YBDB_ECOLI HYPOTHETICAL 15.0 KD PROTEIN IN ENTA-CSTA INTERGENIC REGION (137 aa), FASTA scores: opt: 232, E(): 6.6e-10, (35.8% identity in 106 aa overlap); C48956|G142208 thioesterase from Arthrobacter sp (151 aa), FASTA score: opt: 254, E(): 1.7e-11, (33.3% identity in 138 aa overlap). Also similar to AF064959|AF064959_1 hypothetical protein from Coxiella burnetii (148 aa), FASTA score: opt: 264, E(): 9.3e- 12, (36.8% identity in 117 aa overlap). Protein product from Mb1878 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1878 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003736" /db_xref="InterPro:IPR006683" /db_xref="InterPro:IPR029069" /db_xref="UniProtKB/TrEMBL:A0A1R3XZH4" /protein_id="SIU00482.1" /translation="MQPSPDSPAPLNVTVPFDSELGLQFTELGPDGARAQLDVRPKLL QLTGVVHGGVYCAMIESIASMAAFAWLNSHGEGGSVVGVNNNTDFLRSISSGMVYGTA EPLHRGRRQQLWLVTITDDTDRVVARGQVRLQNLEARP" CDS 2091967..2092269 /codon_start=1 /transl_table=11 /gene="ureA" /locus_tag="BQ2027_MB1879" /product="Urease gamma subunit ureA (Urea amidohydrolase)" /note="Mb1879, ureA, len: 100 aa. Equivalent to Rv1848, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). ureA, urease gamma subunit (EC 3.5.1.5). Similar to URE3_MYCTU|P50043 from Mycobacterium tuberculosis (100 aa), FASTA scores: opt: 630, E(): 1.3e-36, (99.0% identity in 100 aa overlap). BELONGS TO THE UREASE GAMMA SUBUNIT FAMILY. Protein product from Mb1879 detected using SWATH mass spectrometry. Mb1879 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A677" /db_xref="InterPro:IPR002026" /db_xref="InterPro:IPR012010" /db_xref="InterPro:IPR036463" /db_xref="UniProtKB/Swiss-Prot:P0A677" /protein_id="SIU00483.1" /translation="MRLTPHEQERLLLSYAAELARRRRARGLRLNHPEAIAVIADHIL EGARDGRTVAELMASGREVLGRDDVMEGVPEMLAEVQVEATFPDGTKLVTVHQPIA" CDS 2092266..2092580 /codon_start=1 /transl_table=11 /gene="ureB" /locus_tag="BQ2027_MB1880" /product="urease beta subunit ureb (urea amidohydrolase)" /note="Mb1880, ureB, len: 104 aa. Equivalent to Rv1849, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 104 aa overlap). ureB, urease beta subunit (EC 3.5.1.5). Identical to URE2_MYCTU|P50048 urease beta subunit from Mycobacterium tuberculosis (100 aa). BELONGS TO THE UREASE GAMMA SUBUNIT FAMILY. Mb1880 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A663" /db_xref="InterPro:IPR002019" /db_xref="InterPro:IPR036461" /db_xref="UniProtKB/Swiss-Prot:P0A663" /protein_id="SIU00484.1" /translation="MIPGEIFYGSGDIEMNAAALSRLQMRIINAGDRPVQVGSHVHLP QANRALSFDRATAHGYRLDIPAATAVRFEPGIPQIVGLVPLGGRREVPGLTLNPPGRL DR" CDS 2092580..2094313 /codon_start=1 /transl_table=11 /gene="ureC" /locus_tag="BQ2027_MB1881" /product="Urease alpha subunit ureC (Urea amidohydrolase)" /note="Mb1881, ureC, len: 577 aa. Equivalent to Rv1850, len: 577 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 577 aa overlap). ureC, urease alpha subunit (EC 3.5.1.5). Similar to URE1_MYCTU|P50042 from M. tuberculosis (577 aa), FASTA scores: opt: 3794, E(): 0, (98.3% identity in 577 aa overlap). Contains PS00145 Urease active site motif. BELONGS TO THE UREASE FAMILY. Protein product from Mb1881 detected using SWATH mass spectrometry. Mb1881 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A661" /db_xref="InterPro:IPR005848" /db_xref="InterPro:IPR006680" /db_xref="InterPro:IPR011059" /db_xref="InterPro:IPR011612" /db_xref="InterPro:IPR017950" /db_xref="InterPro:IPR017951" /db_xref="InterPro:IPR029754" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/Swiss-Prot:P0A661" /protein_id="SIU00485.1" /translation="MARLSRERYAQLYGPTTGDRIRLADTNLLVEVTEDRCGGPGLAG DEAVFGGGKVLRESMGQGRASRADGAPDTVITGAVIIDYWGIIKADIGIRDGRIVGIG KAGNPDIMTGVHRDLVVGPSTEIISGNRRIVTAGTVDCHVHLICPQIIVEALAAGTTT IIGGGTGPAEGTKATTVTPGEWHLARMLESLDGWPVNFALLGKGNTVNPDALWEQLRG GASGFKLHEDWGSTPAAIDTCLAVADVAGVQVALHSDTLNETGFVEDTIGAIAGRSIH AYHTEGAGGGHAPDIITVAAQPNVLPSSTNPTRPHTVNTLDEHLDMLMVCHHLNPRIP EDLAFAESRIRPSTIAAEDVLHDMGAISMIGSDSQAMGRVGEVVLRTWQTAHVMKARR GALEGDPSGSQAADNNRVRRYIAKYTICPAIAHGMDHLIGSVEVGKLADLVLWEPAFF GVRPHVVLKGGAIAWAAMGDANASIPTPQPVLPRPMFGAAAATAAATSVHFVAPQSID ARLADRLAVNRGLAPVADVRAVGKTDLPLNDALPSIEVDPDTFTVRIDGQVWQPQPAA ELPMTQRYFLF" CDS 2094313..2094948 /codon_start=1 /transl_table=11 /gene="ureF" /locus_tag="BQ2027_MB1882" /product="Urease accessory protein uref" /note="Mb1882, ureF, len: 211 aa. Equivalent to Rv1851, len: 211 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 211 aa overlap). ureF, urease accessory protein. Identical to UREF_MYCTU|P50050 from M. tuberculosis. Mb1882 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VET6" /db_xref="InterPro:IPR002639" /db_xref="InterPro:IPR038277" /db_xref="UniProtKB/Swiss-Prot:Q7VET6" /protein_id="SIU00486.1" /translation="MTSLAVLLTLADSRLPTGAHVHSGGIEEAIAAGLVTGLATLEAF LKRRVRTHGLLTASIAAAVHRGELAVDDADRETDARTPAPAARHASRSQGRGLIRLAR RVWPDSGWEELGPRPHLAVVAGRVGALSGLAPEHNALHLVYITMTGSAIAAQRLLALD PAEVTVVTFQLSELCEQIAQEATAGLADLSDPLLDTLAQRHDERVRPLFVS" CDS 2094959..2095633 /codon_start=1 /transl_table=11 /gene="ureG" /locus_tag="BQ2027_MB1883" /product="Urease accessory protein ureG" /note="Mb1883, ureG, len: 224 aa. Equivalent to Rv1852, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). ureG, urease accessory protein. Identical to UREG_MYCTU|P50051 from M. tuberculosis. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE UREG FAMILY. Protein product from Mb1883 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1883 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A665" /db_xref="InterPro:IPR003495" /db_xref="InterPro:IPR004400" /db_xref="InterPro:IPR012202" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P0A665" /protein_id="SIU00487.1" /translation="MATHSHPHSHTVPARPRRVRKPGEPLRIGVGGPVGSGKTALVAA LCRQLRGELSLAVLTNDIYTTEDADFLRTHAVLPDDRIAAVQTGGCPHTAIRDDITAN LDAIDELMAAHDALDLILVESGGDNLTATFSSGLVDAQIFVIDVAGGDKVPRKGGPGV TYSDLLVVNKTDLAALVGADLAVMARDADAVRDGRPTVLQSLTEDPAASDVVAWVRSQ LAADGV" CDS 2095641..2096267 /codon_start=1 /transl_table=11 /gene="ureD" /locus_tag="BQ2027_MB1884" /product="Probable urease accessory protein ureD" /note="Mb1884, ureD, len: 208 aa. Equivalent to Rv1853, len: 208 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 208 aa overlap). ureD, probable urease accessory protein. Similar to URED_YEREN|P42868 Urease operon ureD protein from Yersinia enterocolitica (325 aa), Fasta scores: opt: 114, E(): 0.37, (25.2% identity in 119 aa overlap). Mb1884 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZI1" /db_xref="InterPro:IPR002669" /db_xref="UniProtKB/TrEMBL:A0A1R3XZI1" /protein_id="SIU00488.1" /translation="MVASPNRLPRIDCRGGVQARRTAPDTVHLVSAAATPLGGDTMRI RVIVERGAQLRLRSAAATVALPGVDTLTSHAHWEIDVTGTLDVDLEPTVVAASARHLS HATLRLHDDGRVRLRERVQIGRCNEREGFWSSSLQADRHGRPLLRHRVELGAGSLADD VIAAPRATISELRYPATAFTDAIDARSTVLALAGGGTLSTWQADRLPG" CDS complement(2096270..2097661) /codon_start=1 /transl_table=11 /gene="ndh" /locus_tag="BQ2027_MB1885C" /product="PROBABLE NADH DEHYDROGENASE NDH" /note="Mb1885c, ndh, len: 463 aa. Equivalent to Rv1854c, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 463 aa overlap). Probable ndh, NADH dehydrogenase (EC 1.6.99.3) (see citations below), similar to several e.g. S74826 NADH dehydrogenase from Synechocystis sp. (445 aa), FASTA score: opt: 1228, E(): 0, (46.3% identity in 432 aa overlap). Highly similar to Rv0392c|Z84725|g1817703 from Mycobacterium tuberculosis (470 aa), FASTA scores: opt: 1911, E(): 0, (64.7% identity in 459 aa overlap); and Rv1812c. Protein product from Mb1885c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1885c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZL7" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3XZL7" /protein_id="SIU00489.1" /translation="MSPQQEPTAQPPRRHRVVIIGSGFGGLNAAKKLKRADVDIKLIA RTTHHLFQPLLYQVATGIISEGEIAPPTRVVLRKQRNVQVLLGNVTHIDLAGQCVVSE LLGHTYQTPYDSLIVAAGAGQSYFGNDHFAEFAPGMKSIDDALELRGRILSAFEQAER SSDPERRAKLLTFTVVGAGPTGVEMAGQIAELAEHTLKGAFRHIDSTKARVILLDAAP AVLPPMGAKLGQRAAARLQKLGVEIQLGAMVTDVDRNGITVKDSDGTVRRIESACKVW SAGVSASWLGRDLAEQSRVELDRAGRVQVLPDLSIPGYPNVFVVGDMAAVEGVPGVAQ GAIQGAKYVASTIKAELAGANPAEREPFQYFDKGSMATVSRFSAVAKIGPVEFSGFIA WLIWLVLHLAYLIGFKTKITTLLSWTVTFLSTRRGQLTITDQQAFARTRLEQLAELAA EAQGSAASAKVAS" CDS complement(2097802..2098725) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1886C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb1886c, -, len: 307 aa. Equivalent to Rv1855c, len: 307 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 307 aa overlap). Possible oxidoreductase (EC 1.-.-.-), possibly a monooxygenase. Contains PS00217 Sugar transport proteins signature 2, probably fortuitously. Similar to G487716 (78-11) LINCOMYCIN PRODUCTION GENES (29.2% identity in 154 aa overlap). Also similar to other Mycobacterium tuberculosis proteins e.g. Rv0953c, Rv0791c, Rv0132c, Rv2951c, etc. Protein product from Mb1886c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1886c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZJ1" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019952" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3XZJ1" /protein_id="SIU00490.1" /translation="MTIRLGLQIPNFSYGTGVEKLFPSVIAQAREAEAAGYDSLFVMD HFYQLPMLGTPDQPMLEAYTALGALATATERLQLGALVTGNTYRSPTLLAKIITTLDV VSAGRAILGIGAGWFELEHRQLGFEFGTFSDRFNRLEEALQILEPMVKGERPTFFGDW YTTESAMAEPRYRDRIPILIGGGGEKKTFAIAARFADHLNIVAAVDELPRKMRALAAR CDEAGRDRSTLQTSLLLTVMIDETLSPDAIPAEMSGRVVVGSPAQIADQIQAKVLDAG VDGLIINLAPHGYLPGVITTAAEALRPLLGV" CDS complement(2098764..2099441) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1887C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb1887c, -, len: 225 aa. Equivalent to Rv1856c, len: 225 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 225 aa overlap). Possible oxidoreductase (EC 1.-.-.-). Equivalent to MLCB1788.11c|AL008609 OXIDOREDUCTASE from Mycobacterium leprae (224 aa), FASTA scores: opt: 1211, E(): 0; (80.4% identity in 224 aa overlap). Some similarity to dehydrogenases of short-chain dehydrogenase/reductase family and fatty-acyl CoA reductases e.g. P16543|DHK2_STRVN GRANATICIN POLYKETIDE SYNTHASE P (249 aa), FASTA score: opt: 194, E(): 1.1e-05, (32.5% identity in 237 aa overlap). Protein product from Mb1887c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1887c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZI6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00491.1" /translation="MAVEVLVTGGDTDLGRTMAEGFRNDGHKVTLVGARRGDLEVAAK ELDVDAVVCDTTDPTSLTEARGLFPRHLDTIVNVPAPSWDAGDPRAYSVSDTANAWRN ALDATVLSVVLTVQSVGDHLRSGGSIVSVVAENPPAGGAESAIKAALSNWIAGQAAVF GTRGITINTVACGRSVQTGYEGLSHTPAPVAAEIARLALFLTTPAARHITGQTLHVSH GALAHFG" CDS 2099603..2100388 /codon_start=1 /transl_table=11 /gene="modA" /locus_tag="BQ2027_MB1888" /product="PROBABLE MOLYBDATE-BINDING LIPOPROTEIN MODA" /note="Mb1888, modA, len: 261 aa. Equivalent to Rv1857, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 261 aa overlap). Probable modA, molybdate-binding protein attached to membrane by lipid-modified N-terminal cysteine (contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site), component of molybdate transport system (see citation below). Shows strong similarity to precursors of periplasmic molybdate/sulphate binding proteins e.g. O31229|Y10817|ANY108174 ModA from Arthrobacter nicotinovorans (260 aa), FASTA score: opt: 725, E(): 0, (47.8% identity in 249 aa overlap). Mb1888 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Y1" /db_xref="InterPro:IPR005950" /db_xref="UniProtKB/Swiss-Prot:P0A5Y1" /protein_id="SIU00492.1" /translation="MRWIGLSTGLVSAMLVAGLVACGSNSPASSPAGPTQGARSIVVF AAASLQSAFTQIGEQFKAGNPGVNVNFAFAGSSELATQLTQGATADVFASADTAQMDS VAKAGLLAGHPTNFATNTMVIVAAAGNPKKIRSFADLTRPGLNVVVCQPSVPCGSATR RIEDATGIHLNPVSEELSVTDVLNKVITGQADAGLVYVSDALSVATKVTCVRFPEAAG VVNVYAIAVLKRTSQPALARQFVAMVTAAAGRRILDQSGFAKP" CDS 2100391..2101185 /codon_start=1 /transl_table=11 /gene="modB" /locus_tag="BQ2027_MB1889" /product="probable molybdenum-transport integral membrane protein abc transporter modb" /note="Mb1889, modB, len: 264 aa. Equivalent to Rv1858, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 264 aa overlap). Probable modB, molybdenum-transport integral membrane protein ABC transporter (see citation below), similar to others e.g. Y10817|ANY108175 ModB from Arthrobacter (239 aa), FASTA scores: opt: 937, E(): 0, (67.8% identity in 230 aa overlap); etc. Similar to other Mycobacterium tuberculosis transport proteins e.g. Rv2039c, Rv2316, etc. Mb1889 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A625" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR006469" /db_xref="InterPro:IPR011867" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/Swiss-Prot:P0A625" /protein_id="SIU00493.1" /translation="MHPPTDLPRWVYLPAIAGIVFVAMPLVAIAIRVDWPRFWALITT PSSQTALLLSVKTAAASTVLCVLLGVPMALVLARSRGRLVRSLRPLILLPLVLPPVVG GIALLYAFGRLGLIGRYLEAAGISIAFSTAAVVLAQTFVSLPYLVISLEGAARTAGAD YEVVAATLGARPGTVWWRVTLPLLLPGVVSGSVLAFARSLGEFGATLTFAGSRQGVTR TLPLEIYLQRVTDPDAAVALSLLLVVVAALVVLGVGARTPIGTDTR" CDS 2101192..2102301 /codon_start=1 /transl_table=11 /gene="modC" /locus_tag="BQ2027_MB1890" /product="PROBABLE MOLYBDENUM-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER MODC" /note="Mb1890, modC, len: 369 aa. Equivalent to Rv1859, len: 369 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 369 aa overlap). Probable modC, molybdenum-transport ATP-binding protein ABC transporter (see citation below), similar to others e.g. Y10817|ANY108176 ModC from Arthrobacter (349 aa), FASTA scores: opt: 895, E(): 0, (46.0% identity in 361 aa overlap); etc. Shows similarity to other Mycobacterium tuberculosis ABC-transporter proteins e.g. Rv0073, Rv1238, Rv2564, etc. Contains both PS00017 ATP/GTP-binding site motif A (P-loop) and PS00211 ABC transporters family signatures involved in molybdate uptake. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Mb1890 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1P5" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR005116" /db_xref="InterPro:IPR008995" /db_xref="InterPro:IPR015852" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1P5" /protein_id="SIU00494.1" /translation="MSKLQLRAVVADRRLDVEFSVSAGEVLAVLGPNGAGKSTALHVI AGLLRPDAGLVRLGDRVLTDTEAGVNVATHDRRVGLLLQDPLLFPHLSVAKNVAFGPQ CRRGMFGSGRARTRASALRWLREVNAEQFADRKPRQLSGGQAQRVAIARALAAEPDVL LLDEPLTGLDVAAAAGIRSVLRSVVARSGCAVVLTTHDLLDVFTLADRVLVLESGTIA EIGPVADVLTAPRSRFGARIAGVNLVNGTIGPDGSLRTQSGAHWYGTPVQDLPTGHEA IAVFPPTAVAVYPEPPHGSPRNIVGLTVAEVDTRGPMVLVRGHDQPGGAPGLAACITV DAATELRVAPGSRVWFSVKAQEVALHPAPHQHASS" CDS 2102354..2103331 /codon_start=1 /transl_table=11 /gene="apa" /locus_tag="BQ2027_MB1891" /standard_name="mpt32; modD" /product="ALANINE AND PROLINE RICH SECRETED PROTEIN APA (FIBRONECTIN ATTACHMENT PROTEIN) (Immunogenic protein MPT32) (Antigen MPT-32) (45-kDa glycoprotein) (45/47 kDa antigen)" /note="Mb1891, apa, len: 325 aa. Equivalent to Rv1860, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 325 aa overlap). apa (alternate gene names: mpt32, modD), Ala-, Pro-rich 45/47 kDa secreted protein, very similar to P46842|N43L_MYCLE from Mycobacterium leprae (287 aa), FASTA scores: opt: 1166, E(): 0, (66.4% identity in 298 aa overlap). Known to be glycosylated fibronectin-binding protein (see some citations). TBparse score is 0.924. Protein product from Mb1891 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1891 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:O30620" /db_xref="InterPro:IPR010801" /db_xref="UniProtKB/Swiss-Prot:O30620" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00495.1" /translation="MHQVDPNLTRRKGRLAALAIAAMASASLVTVAVPATANADPEPA PPVPTTAASPPSTAAAPPAPATPVAPPPPAAANTPNAQPGDPNAAPPPADPNAPPPPV IAPNAPQPVRIDNPVGGFSFALPAGWVESDAAHLDYGSALLSKTTGDPPFPGQPPPVA NDTRIVLGRLDQKLYASAEATDSKAAARLGSDMGEFYMPYPGTRINQETVSLDANGVS GSASYYEVKFSDPSKPNGQIWTGVIGSPAANAPDAGPPQRWFVVWLGTANNPVDKGAA KALAESIRPLVAPPPAPAPAPAEPAPAPAPAGEVAPTPTTPTPQRTLPA" CDS 2103783..2104088 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1892" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb1892, -, len: 101 aa. Equivalent to Rv1861, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). Probable conserved transmembrane protein, showing weak similarity to AE002069|AE002069_10 hypothetical protein from Deinococcus radiodurans (146 aa), FASTA scores: opt: 154, E(): 0.0027, (30.8% identity in 104 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Mb1892 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZK0" /db_xref="InterPro:IPR007341" /db_xref="UniProtKB/TrEMBL:A0A1R3XZK0" /protein_id="SIU00496.1" /translation="MDITATTEFSAMNLDGKTGIGWLGYIVIGGIAGWLASKIVKGGG SGILMNVVIGVVGAFGAGLVLNALGVDVNHGGYWFTFFVALGGAVVLLWIVGMVRKT" CDS 2104163..2105203 /codon_start=1 /transl_table=11 /gene="adhA" /locus_tag="BQ2027_MB1893" /product="Probable alcohol dehydrogenase adhA" /note="Mb1893, adhA, len: 346 aa. Equivalent to Rv1862, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap=. Probable adhA, alcohol dehydrogenase (EC 1.1.1.1), similar to ADH2_BACST|P42327 alcohol dehydrogenase (339 aa), FASTA scores: opt: 630, E(): 2.4e-32 (34.4% identity in 320 aa overlap). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. Protein product from Mb1893 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1893 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZX7" /db_xref="InterPro:IPR002328" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR014187" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZX7" /protein_id="SIU00497.1" /translation="MVSPATTATMSAWQVRRPGPMDTGPLERVTTRVPRPAPSELLVA VHACGVCRTDLHVTEGDLPVHRERVIPGHEVVGEVIEVGSAVGAAAGGEFDRGDRVGI AWLRHTCGVCKYCRRGSENLCPQSRYTGWDADGGYAEFTTVPAAFAHHLPSGYSDSEL APLLCAGIIGYRSLLRTELPPGGRLGLYGFGGSAHITAQVALAQGAEIHVMTRGARAR KLALQLGAASAQDAADRPPVPLDAAILFAPVGDLVLPALEALDRGGILAIAGIHLTDI PDLNYQQHLFQERQIRSVTSNTRADARAFFDFAAQHHIEVTTPEYPLGQADRALGDLS AGRIAGAAVLLI" CDS complement(2105210..2105980) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1894C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb1894c, -, len: 256 aa. Equivalent to Rv1863c, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 256 aa overlap). Probable conserved integral membrane protein, similar to Rv0804|Z95618|MTCY7H7A.05 Hypothetical protein from Mycobacterium tuberculosis (209 aa), FASTA scores: opt: 199, E(): 1e-06, (33.2% identity in 220 aa overlap); and Rv0658c. Mb1894c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZJ3" /db_xref="InterPro:IPR003675" /db_xref="InterPro:IPR015837" /db_xref="UniProtKB/TrEMBL:A0A1R3XZJ3" /protein_id="SIU00498.1" /translation="MSDHLTACAAVHPGPLVSHLSVMHRFRIYVDIAVVVLVLVLTNL IAHFTTPWASIATVPAAAVGLVILVRSRGLGWAELGLSRQHWKSGLVYALAAVALVVA VISVGVLLPITRPMFMNHHYATISGAVIASMVMIPLQTVIPEELAFRGVLHGALNRAW GFRGVAVAGSVLFGLWHIATSLGLTSSNVGFTRLFGGGIIGLVAGVMLAVLATGVAGF VFSWLRRRSGSLIAPIALHWSLNGMGALAAALVWHLST" CDS complement(2105973..2106728) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1895C" /product="conserved protein" /note="Mb1895c, -, len: 251 aa. Equivalent to Rv1864c, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 251 aa overlap). Conserved hypothetical protein. Similar to other hypothetical proteins e.g. AL031317|SC6G4.43 from Streptomyces coelicolor cosmid 6G (233 aa), FASTA scores: opt: 716, E(): 0, (54.4% identity in 215 aa overlap); also P43976|YIIM_HAEIN hypothetical protein hi0278 (221 aa), FASTA scores: opt: 223, E(): 3.8e-08, (29.5% identity in 173 aa overlap). Protein product from Mb1895c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1895c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZM6" /db_xref="InterPro:IPR005302" /db_xref="InterPro:IPR011037" /db_xref="UniProtKB/TrEMBL:A0A1R3XZM6" /protein_id="SIU00499.1" /translation="MTVAPRRLAWTNARQSYPVRVAHVLSVNLARVRANPDPRAQSKL TGIDKVAASEAVMVRAPGSMHAGVGSGLVGDTVGNPKLHGGDDQAVYAYAREDLDAWE TQLHRTLHNGMFGENLTTSGVDVTYARIGERWRIGSDGLVLEVSAPRIPCRTFAAFLD LRYWIKTFTRAAKPGAYLRVIAPGTVRAGDTITVDYRPEHNVTVGLVFRARTSESELL PQLLAADALAAELKAYARERTPSPPPVDSADDV" CDS complement(2106725..2107585) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1896C" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE" /note="Mb1896c, -, len: 286 aa. Equivalent to Rv1865c, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Probable short-chain dehydrogenase (EC 1.-.-.-), highly similar to C-terminus of NP_301650.1|NC_00267 putative oxidoreductase from Mycobacterium leprae (596 aa). Also similar to various dehydrogenases, generally belonging to short-chain family, e.g. AAG02168.1|AF212041_24|AF212041 3-oxoacyl-(acylcarrier protein) reductase from Zymomonas mobilis (251 aa); P50198|LINX_PSEPA 2,5-DICHLORO-2,5-CYCLOHEXADIENE-1,4-DIOL DEHYDROGENASE from Sphingomonas paucimobilis (250 aa); NP_105680.1|NC_002678 sorbitol dehydrogenase (also similar to acetoin reductase) from Mesorhizobium loti (256 aa); etc. And highly similar to C-terminus of ephD|Rv2214c|MTCY190.25c from Mycobacterium tuberculosis (592 aa); and many other oxidoreductases from Mycobacterium tuberculosis e.g. Y00P_MYCTU|Q10402 putative oxidoreductase (650 aa), FASTA scores: opt: 439, E(): 8.9e-20, (32.5% identity in 280 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb1896c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1896c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZJ8" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZJ8" /protein_id="SIU00500.1" /translation="MPGRTSIGVKIRDKVQDKVIAITGGARGIGLATAAALHNLGAKV AIGDIDEAMAKESGADLDLDMYGKLDVTDPDSFSGFLDAVERQLGPIDVLVNNAGIMP VGRIVDEPDPVTRRILDINVYGVILGSKLAAQRMVPRGRGHVINVASLAGEIYAVGVA TYCASKHAVVAFTDSARLEYRSAGVKFSMVLPSFVNTELIAGTGGIKGFKNAEPADIA DAIVGLIVHPKPRVRVTKAAGSMIVAQRFMPRQVSEGLNRLLGGEHVFTDDVDMEKRR TYEARARGEE" CDS 2107759..2110095 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1897" /product="CaiB/BaiF family protein" /note="Mb1897, -, len: 778 aa. Equivalent to Rv1866, len: 778 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 778 aa overlap). Conserved hypothetical protein, N-terminal region similar to fatty acyl-CoA racemases e.g. Rv0855, Rv1143, and C-terminal region (from aa 370) similar to L-carnitine dehydratases, racemases, and Rv3272|MTCY71.12 Mycobacterium tuberculosis (394 aa), FASTA score: opt: 472, E(): 2.1e-21, (29.9% identity in 388 aa overlap). Also similar to P31572|CAIB_ECOLI L-CARNITINE DEHYDRATASE (EC 4.2.1.89) (405 aa), FASTA score: opt: 306, E(): 2.1e-11, (23.3% identity in 424 aa overlap). Protein product from Mb1897 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XZJ5" /db_xref="InterPro:IPR003673" /db_xref="InterPro:IPR023606" /db_xref="UniProtKB/TrEMBL:A0A1R3XZJ5" /protein_id="SIU00501.1" /translation="MVTRLLADLGADVLKVQPPGGSPGRHVRPTLAGTSIGFAMHNAN KRSAVLNPLDESDRRRFLDLAASADIVVDCGLPGQAAAYGASCAELADRYRHLVALSI TDFGAAGPRSSWRATDPVLYAMSGALSRSGPTAGTPVLPPDGIASATAAVQAAWAVLV AYFNRLRCGTGDYIDFSRFDAVVMALDPPFGAHGQVAAGIRSTGRWRGRPKNQDAYPI YPCRDGYVRFCVMAPRQWRGLRRWLGEPEDFQDPKYDVIGARLAAWPQISVLVAKLCA EKTMKELVAAGQALGVPITAVLTPSRILASEHFQAVGAITDAELVPGVRTGVPTGYFV VDGKRAGFRTPAPAAGQDEPRWLADPAPVPPPSGRVGGYPFEGLRILDLGIIVAGGEL SRLFGDLGAEVIKVESADHPDGLRQTRVGDAMSESFAWTHRNHLALGLDLRNSEGKAI FGRLVAESDAVFANFKPGTLTSLGFSYDVLHAFNPRIVLAGSSAFGNRGPWSTRMGYG PLVRAATGVTRVWTSDEAQPDNSRHPFYDATTIFPDHVVGRVGALLALAALIHRDRTG GGAHVHISQAEVVVNQLDTMFVAEAARATDVAEIHPDTSVHAVYPCAGDDEWCVISIR SDDEWRRATSVFGQPELANDPRFGASRSRVANRSELVAAVSAWTSTRTPVQAAGALQA AGVAAGPMNRPSDILEDPQLIERNLFRDMVHPLIARPLPAETGPAPFRHIPQAPQRPA PLPGQDSVQICRKLLGMTADETERLINERVMFGPAVTA" CDS 2110383..2111867 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1898" /product="Acetyl-CoA acetyltransferase" /note="Mb1898, -, len: 494 aa. Equivalent to Rv1867, len: 494 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 494 aa overlap). Conserved hypothetical protein, some similarity to acetyl CoA synthase and to lipid carriers. FASTA best: E155295 acetyl CoA synthase (388 aa), opt: 213, E(): 4.5e-07, (23.2% identity in 423 aa overlap). Protein product from Mb1898 detected using SWATH mass spectrometry. Mb1898 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZK6" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR040771" /db_xref="UniProtKB/TrEMBL:A0A1R3XZK6" /protein_id="SIU00502.1" /translation="MPVDPRTPVLIGYGQVNHRGDIDAEKQSIEPVDLMAAAARKAAD STVLEAVDSIRVVHMLSAHYRNPGQLLGERIKARTFTTGYSGVGGNMPQSLVNRACLD IQRGRAGVVLLAGAETWRTRTGLRAKGSKLEWTVQDESVPLPDMAGDDVPMAGAAELR INLDRPAYVYPIFEQALRIAYGESIENHRKRIGELWARFSAVAADNPHAWIRNPVTAD EIWQPGPQNRMVSWPYTKLMNSNNMVDQGAALLLTSVERATRLRIPAERWVYPQAGTD AHDTPAVADRHRLHRSTAIRIAGARALELAGLGLDDIEYVDLYSCFPSAVQVAAIELG LDTDDPARPLTVTGGLTFAGGPWSNYVTHSIATMAELLAANPGRRGLITANGGYLTKH SFGVYGTEPPSEFRWEDMQPAVDREPTGDGLVEWEGIGTVEAWTTPVNRDGQPEKAFL AVRTPDGSRSLAVITDPASVQATVREDIAGVKVAVAPDGTATLR" CDS 2111966..2114065 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1899" /product="Phosphoenolpyruvate synthase (EC" /EC_number="2.7.9.2" /note="Mb1899, -, len: 699 aa. Equivalent to Rv1868, len: 699 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 699 aa overlap). Conserved hypothetical protein, similar to products of three consecutive ORFS in Mycobacterium leprae MLCB2052.18|Z98604|B2052 (257 aa), FASTA scores: opt: 314, E(): 9.9e-12, (35.2% identity in 213 aa overlap); MLCB2052.17, and MLCB2052.16. Also similar to M. tuberculosis hypothetical protein Rv2047c. Protein product from Mb1899 detected using SWATH mass spectrometry. Mb1899 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR016040" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZJ6" /protein_id="SIU00503.1" /translation="MQILVTDATGAVGRSVTRQLIAAGHTVSGIAQHPHDALDPRVDY VCASLRNPVLQELAGEADAVIHLAPVDTSAPGGVGITGLAHVANAAARAGARLLFVSQ AAGRPELYRQAETLVSTGWAPSLVIRIAPPVGRQLDWMVCRTVATLLRSKVSARPIRV LHLDDLVRFLVLALNTDRNGVVDLATPDTTNVVTAWRLLRSVDPHLRTRRVRSWEQLI PEVDIAAVQEDWNFEFGWQATEAIVDTGRGLVGRRLHPAGATNGSGQLALPVEAPPRS VPSHGEPLGSAAPEGLEGEFDDRIDERFPVFSSASLAEALPGPLTPMTLDVQLSGLRA AGRAMGRVLALGGVVADEWERRAIAVFGHRPYIGVSANIVAAAQLPGWDAQAVARRAL GEQPQVTELLPFGRPQLAGGPLGSVAKVVVTARSLALLRHLRSDTHHYVAAADAEHLA AGQLASLPDAGLEVRIRLLRDRIHQGWILTVLWVIDTGVTAATLEHTRAGSAVSGGGM IMESGRIGAEIAPLAAVLRADPPLCALANDGNLASIRALSAPAAAAVDAVIARIGHRG LGEAELANLTFADDPALLLKTAAEIAARPAGPAHPATLIQRLAAGTRSARELAHDTTI RFTHELRMTLRELGSRRVAADVIDVVDDVFYLTCDELITTPADARLRIKRRRAERERL QAQRPPDVIDHAWVPVE" CDS complement(2114079..2115314) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1900C" /product="Probable reductase" /note="Mb1900c, -, len: 411 aa. Equivalent to Rv1869c, len: 411 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 411 aa overlap). Probable reductase (1.-.-.-). Similar to several reductases e.g. CAC04223.1|AL391515 putative ferredoxin reductase from Streptomyces coelicolor (420 aa); THCD_RHOSO|P43494 rhodocoxin reductase (426 aa), FASTA scores: opt: 904, E(): 0, (40.8% identity in 370 aa overlap). Also similar to Mycobacterium tuberculosis proteins Rv0688 (406 aa) (39.9% identity in 391 aa overlap); and Rv0253 (nitrite reductase subunit). Protein product from Mb1900c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1900c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1Q6" /db_xref="InterPro:IPR016156" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR028202" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Q6" /protein_id="SIU00504.1" /translation="MASSTTFVIVGGGLAGAKAVEALRRSDFGGRIILFGDEEHLPYD RPPLSKEFLAGKKSLSDFTIQTSDWYRDHDVDVRLGVRVSSLDRSAHTVELPDGAAVR YDKLLLATGSAPRRPPIPGSDAAGVHYLRSYNDAVALNSVLVQGSSLAVVGAGWIGLE VAASARQRGVDVTVVETAIQPLLAALGEAVGKVFADLHRDQGVDLRLQTQLEEITAAD GKATGLKMRDGSTVAADAVLVAVGAKPNVELAQQAGLAMGEGGVLVDASLRTSDPDIY AVGDIAAAEHPLLGTRVRTEHWANALKQPAVAAAGMLGRPGEYAELPYLFTDQYDLGM EYVGHAPSCDRVVFRGNVAGREFLSFWLDGDSRVLAGMNVNVWDVVDDVKGLIRSGNP VDVDRLVDPQWPLADLTTN" CDS complement(2115381..2116049) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1901C" /product="Protein involved in DNA repair" /note="Mb1901c, -, len: 222 aa. Equivalent to Rv1870c, len: 211 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 211 aa overlap). Conserved hypothetical protein. Some similarity to SC6F7.17c hypothetical protein from Streptomyces coelicolor (216 aa). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transversion (t-a) and a single base transition (c-t) lead to a slightly longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (222 aa versus 211 aa). Mb1901c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0D9" /db_xref="InterPro:IPR011257" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0D9" /protein_id="SIU00505.1" /translation="MPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMP LFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRY DESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLRE VQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSNALLAAALVRVALDDELRL QVTG" CDS complement(2116114..2116503) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1902C" /product="DNA-repair" /note="Mb1902c, -, len: 129 aa. Equivalent to Rv1871c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins Q11057|Rv1261|MTCY50.21 (149 aa), FASTA score: opt: 125, E(): 0.019, (32.6% identity in 89 aa overlap); Rv0523c, and Rv1598c. Protein product from Mb1902c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1902c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZL5" /db_xref="InterPro:IPR004378" /db_xref="InterPro:IPR012349" /db_xref="UniProtKB/TrEMBL:A0A1R3XZL5" /protein_id="SIU00506.1" /translation="MNAAMNLKREFVHRVQRFVVNPIGRQLPMTMLETIGRKTGQPRR TAVGGRVVDNQFWMVSEHGEHSDYVYNIKANPAVRVRIGGRWRSGTAYLLPDDDPRQR LRGLPRLNSAGVRAMGTDLLTIRVDLD" CDS complement(2116526..2117770) /codon_start=1 /transl_table=11 /gene="lldD2" /locus_tag="BQ2027_MB1903C" /product="POSSIBLE L-LACTATE DEHYDROGENASE (CYTOCHROME) LLDD2" /note="Mb1903c, lldD2, len: 414 aa (start uncertain). Equivalent to Rv1872c, len: 414 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 414 aa overlap). Possible lldD2, L-lactate dehydrogenase (cytochrome) (EC 1.1.2.3), similar to other lactate dehydrogenases and other oxidases e.g. LLDD_ECOLI|P33232 l-lactate dehydrogenase (cytochrome) from Escherichia coli strain K12 (396 aa), FASTA results: opt: 674, E(): 1.1e-37, (40.5% identity in 279 aa overlap); Q51135 LACTATE DEHYDROGENASE from Neisseria meningitidis (390 aa), FASTA results: opt: 309, E(): 4.1e-15, (42.5% identity in 113 aa overlap); etc. Also shows similarity with Rv0694|lldD1|MTCY210.11 POSSIBLE L-LACTATE DEHYDROGENASE (CYTOCHROME) from Mycobacterium tuberculosis (396 aa). Contains PS00557 FMN-dependent alpha-hydroxy acid dehydrogenases active site. BELONGS TO THE FMN-DEPENDENT ALPHA-HYDROXY ACID DEHYDROGENASES FAMILY. Protein product from Mb1903c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1903c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZY8" /db_xref="InterPro:IPR000262" /db_xref="InterPro:IPR008259" /db_xref="InterPro:IPR012133" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR037396" /db_xref="UniProtKB/TrEMBL:A0A1R3XZY8" /protein_id="SIU00507.1" /translation="MAVNRRVPRVRDLAPLLQFNRPQFDTSKRRLGAALTIQDLRRIA KRRTPRAAFDYADGAAEDELSIARARQGFRDIEFHPTILRDVTTVCAGWNVLGQPTVL PFGIAPTGFTRLMHTEGEIAGARAAAAAGIPFSLSTLATCAIEDLVIAVPQGRKWFQL YMWRDRDRSMALVRRAAAAGFDTMLVTVDVPVAGARLRDVRNGMSIPPALTLRTVLDA MGHPRWWFDLLTTEPLAFASLDRWPGTVGEYLNTVFDPSLTFDDLAWIKSQWPGKLVV KGIQTLDDARAVVDRGVDGIVLSNHGGRQLDRAPVPFHLLPHVARELGKHTEILVDTG IMSGADIVAAIALGARCTLIGRAYLYGLMAGGEAGVNRAIEILQTGVIRTMRLLGVTC LEELSPRHVTQLRRLGPIGAPT" CDS 2117793..2118230 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1904" /product="NTP pyrophosphohydrolases including oxidative damage repair enzymes" /note="Mb1904, -, len: 145 aa. Equivalent to Rv1873, len: 145 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 145 aa overlap). Conserved hypothetical protein. Some similarity to AL591783 hypothetical protein from Sinorhizobium meliloti. Mb1904 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR014937" /db_xref="InterPro:IPR036287" /db_xref="UniProtKB/TrEMBL:A0A1R3XZK4" /protein_id="SIU00508.1" /translation="MKSASDPFDLKRFVYAQAPVYRSVVEELRAGRKRGHWMWFVFPQ LRGLGSSPLAVRYGISSLEEAQAYLQHDLLGPRLHECTGLVNQVQGRSIEEIFGPPDN LKLCSSMTLFARATDANQDFVALLAKYYGGGEDRRTVALLAVT" CDS 2118303..2118989 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1905" /product="link to sulfotransferase activity" /note="Mb1905, -, len: 228 aa. Equivalent to Rv1874, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 228 aa overlap). Hypothetical unknown protein. Protein product from Mb1905 detected using shotgun mass spectrometry. Mb1905 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009799" /db_xref="InterPro:IPR011008" /db_xref="UniProtKB/TrEMBL:A0A1R3XZN4" /protein_id="SIU00509.1" /translation="MLMRPEPDDDWCARQRAQVADALLGLGVAGLSINVRDSTVRDSL MTLTTLYPPVAAVVSLWTQQCYGEQVAAALRLLAQECDELGAYLVTESVPLTFPSLVE SGSRTPGLANIALLRRPDGLDQATWLTRWQRDHTQVAIEAQATFGYTQNWVVRALTPE APGIAGIVEELFPVAATTDLKAFFGAADDNDLRNRISRMVASTSAFGANQNIDTVPTS RYVFRTPFKD" CDS 2119000..2119443 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1906" /product="probable F420-dependent enzyme" /note="Mb1906, -, len: 147 aa. Equivalent to Rv1875, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). Conserved hypothetical protein. Some similarity to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1155|MTCI65.22|Z95584 (147 aa), FASTA scores: opt: 178, E(): 7.4e-06, (26.9% identity in 130 aa overlap); Rv0121c and Rv2074. Also similar to AL079356|SC6G9.21 hypothetical protein from Streptomyces coelicolor (144 aa), FASTA scores: opt: 239, E(): 3.1 e-09, (38.7% identity in 137 aa overlap). Protein product from Mb1906 detected using shotgun mass spectrometry. Mb1906 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZL6" /db_xref="InterPro:IPR011576" /db_xref="InterPro:IPR012349" /db_xref="InterPro:IPR019920" /db_xref="UniProtKB/TrEMBL:A0A1R3XZL6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00510.1" /translation="MTTLNEAAALAAAERGLAVVSTVRADGTVQASLVNVGLLPHPVS GEPSLGFTTYGKVKLGNLRARPQLAVTFRNGWQWATVEGRAQLVGPDDPRPWLVDGER LRLLLREVFTAAGGTHDDWDEYDRVMAQEQRAVVLITPTRIYSNG" CDS 2119959..2120438 /codon_start=1 /transl_table=11 /gene="bfrA" /locus_tag="BQ2027_MB1907" /standard_name="bfr" /product="PROBABLE BACTERIOFERRITIN BFRA" /note="Mb1907, bfrA, len: 159 aa. Equivalent to Rv1876, len: 159 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 159 aa overlap). Probable bfrA, bacterioferritin, similar to BFR_MYCLE|P43315 bacterioferritin (bfr) from Mycobacterium leprae (159 aa), FASTA results: opt: 958, E(): 0, (90.6% identity in 159 aa overlap). Also similar to Rv3841|MTCY01A6.28c|bfrB POSSIBLE BACTERIOFERRITIN from Mycobacterium tuberculosis (181 aa). BELONGS TO THE BACTERIOFERRITIN FAMILY. Protein product from Mb1907 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1907 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63698" /db_xref="InterPro:IPR002024" /db_xref="InterPro:IPR008331" /db_xref="InterPro:IPR009040" /db_xref="InterPro:IPR009078" /db_xref="InterPro:IPR012347" /db_xref="UniProtKB/Swiss-Prot:P63698" /protein_id="SIU00511.1" /translation="MQGDPDVLRLLNEQLTSELTAINQYFLHSKMQDNWGFTELAAHT RAESFDEMRHAEEITDRILLLDGLPNYQRIGSLRIGQTLREQFEADLAIEYDVLNRLK PGIVMCREKQDTTSAVLLEKIVADEEEHIDYLETQLELMDKLGEELYSAQCVSRPPT" CDS 2120523..2122058 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1908" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN [FIRST PART]" /note="Mb1908, -, len: 511 aa. Similar to 5' end of Rv1877, len: 687 aa, from Mycobacterium tuberculosis strain H37Rv, (93.2% identity in 292 aa overlap). Probable conserved integral membrane protein, part of major facilitator superfamily (MFS), similar to many antibiotic and drug efflux proteins. Similar to e.g. Q56175 TU22 DTDP-GLUCOSE DEHYDRTATASE from Streptomyces violaceoruber (557 aa), FASTA scores: opt: 895, E(): 0, (34.7% identity in 528 aa overlap). Also similar to Mycobacterium tuberculosis relatives protein, include Rv3728, Rv3239c, Rv2846c, etc. Contains PS00217 Sugar transport proteins signature 2 (PS00217). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1877 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c), splits Rv1877 into 2 parts, Mb1908 and Mb1909. Mb1908 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZL3" /db_xref="InterPro:IPR001411" /db_xref="InterPro:IPR001958" /db_xref="InterPro:IPR005829" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XZL3" /protein_id="SIU00512.1" /translation="MAGPTAPTTAPTAIRAGGPLLSPVRRNIIFTALVFGVLVAATGQ TIVVPALPTIVAELGSTVDQSWAVTSYLLGGTVVVVVAGKLGDLLGRNRVLLGSVVVF VVGSVLCGLSQTMTMLAISRALQGVGAGAISVTAYALAAEVVPLRDRGRYQGVLGAVF GVNTVTGPLLGGWLTDYLSWRWAFWINVPVSIAVLTVAATAVPALARPPKPVIDYLGI LVIAVATTALIMATSWGGTTYAWGSATIVGLLIGAAVALGFFVWLEGRARCGHPAAQA VWQPSICRVLRPVLRGRIRDAGCTDLRTDLSGVRGRRVGDRVRSAHVADGDRPADRLD RDGCPGRPDGPLQDLPGRGDGADGGCVPADVADGRVDATAAAIAVPGRPRCRHRIVHA GARSHRAEHVVFRRPRRRNIGCDLLPGGRRLVWYRNIRCVVRKLPGPKTRFRADVGRR ACPGSAISGCLASAAPEHGRPDRAGICRVAHPGVPLRGLGHGGRFHPGAVAARGTAHR HPR" CDS 2122087..2122587 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1909" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN [SECOND PART]" /note="Mb1909, -, len: 404 aa. Equivalent to 3' end of Rv1877, len: 687 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 404 aa overlap). Probable conserved integral membrane protein, part of major facilitator superfamily (MFS), similar to many antibiotic and drug efflux proteins. Similar to e.g. Q56175 TU22 DTDP-GLUCOSE DEHYDRTATASE from Streptomyces violaceoruber (557 aa), FASTA scores: opt: 895, E(): 0, (34.7% identity in 528 aa overlap). Also similar to Mycobacterium tuberculosis relatives protein, include Rv3728, Rv3239c, Rv2846c, etc. Contains PS00217 Sugar transport proteins signature 2 (PS00217). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1877 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c), splits Rv1877 into 2 parts, Mb1908 and Mb1909. Mb1909 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3XZK3" /protein_id="SIU00513.1" /translation="MPRAESPEDVLEIAVRRMLPNGVRLRDIATQPGCGLGVAELWAL LRIYQYQRLFEAVRLTDIGRHLHVPYQVFEPVFDRLVQTGYAARDGDILTLTPSGHRQ VDSLAVLIRQWLLDHLAVAPGLKRQPDHQFEAALQHVTDAVLVQRDWYEDLGDLSESR QLAATT" CDS 2122642..2123994 /codon_start=1 /transl_table=11 /gene="glnA3" /locus_tag="BQ2027_MB1910" /product="PROBABLE GLUTAMINE SYNTHETASE GLNA3 (GLUTAMINE SYNTHASE) (GS-I)" /note="Mb1910, glnA3, len: 450 aa. Equivalent to Rv1878, len: 450 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 450 aa overlap). Probable glnA3, glutamine synthetase class I (EC 6.3.1.2), similar to many e.g. GLNA_BACCE|P19064 from Bacillus cereus (443 aa), FASTA results: opt: 497, E(): 5.2e-23, (29.0% identity in 331 aa overlap); etc. Also similar to C-terminus of FLUG_EMENI|P38094 flug protein from emericella nidulans (865 aa), FASTA scores: opt: 227, E (): 6.4e-13, (29.9% identity in 394 aa overlap). Note that the downstream ORF MTCY180.39c is similar to the N-terminus. Also similar to three other potential glutamine synthases in M. tuberculosis: Q10378|GLN2_MYCTU|GLNA2|Rv2222c|MT2280|MTCY1 90.33c|MTCY427 .03c; Rv2860c|MTV003.06c|glnA4 and Rv2220|glnA1. BELONGS TO THE GLUTAMINE SYNTHETASE FAMILY. Protein product from Mb1910 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1910 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1R5" /db_xref="InterPro:IPR008146" /db_xref="InterPro:IPR014746" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1R5" /protein_id="SIU00514.1" /translation="MTATPLAAAAIAQLEAEGVDTVIGTVVNPAGLTQAKTVPIRRTN TFANPGLGASPVWHTFCIDQCSIAFTADISVVGDQRLRIDLSALRIIGDGLAWAPAGF FEQDGTPVPACSRGTLSRIEAALADAGIDAVIGHEVEFLLVDADGQRLPSTLWAQYGV AGVLEHEAFVRDVNAAATAAGIAIEQFHPEYGANQFEISLAPQPPVAAADQLVLTRLI IGRTARRHGLRVSLSPAPFAGSIGSGAHQHFSLTMSEGMLFSGGTGAAGMTSAGEATV AGVLRGLPDAQGILCGSIVSGLRMRPGNWAGIYACWGTENREAAVRFVKGGAGSAYGG NVEVKVVDPSANPYLASAAILGLALDGMKTKAVLPSETTVDPTQLSDVDRDRAGILRL AADQADAIAVLDSSKLLRCILGDPVVDAVVAVRQLEHERYGDLDPAQLADKFRMAWSV " CDS 2123997..2125133 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1911" /product="Amidohydrolase" /note="Mb1911, -, len: 378 aa. Equivalent to Rv1879, len: 378 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 378 aa overlap). Conserved hypothetical protein, similar to SCC22.14c|AL096839 hypothetical protein from Streptomyces coelicolor (368 aa), FASTA results: opt: 772, E(): 0 (40.3% identity in 372 aa overlap); and to N-terminal half of nodulin/glutamate-ammonia ligase-like protein. Some similarity to N-terminus of AL132958|ATT4D2_11 Arabidopsis thaliana (845 aa), FASTA results: opt: 354, E(): 3.1e-16, (29.2% identity in 383 aa overlap); and to P38094|FLUG_EMENI Flug protein of Emericella nidulans (865 aa), FASTA results: opt: 306, E(): 6.2e-13, (26.5% identity in 415 aa overlap). Note that the upstream ORF Rv1878|MTCY18 0.40c is similar to the C-terminus. Mb1911 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0F1" /db_xref="InterPro:IPR006680" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0F1" /protein_id="SIU00515.1" /translation="MADSAGSDLTRHTAEVPLIDQHVHGCWLTEGNRRRFENALNEAN TEPLADFDSGFDSQLGFAVRNHCAPILGLPRHVDPQTYWDRRSQFSEAELARRFLQAA GVTDWLVETGIGYDVSGMASVAGLGELSGSHAHEVVRLEQVAEQAVQASGDYASAFNE ILRRRAATAVATKSILAYRGGFDGDLTEPPAAQVAEAAKRWRDRGGVRLQDRVLLRFG LHQALRLGKPLQFHVGFGDRDADLHKANPLYLLDFLRQSGNTPIVLLHCYPYEREAGY LAQAFNNVYLDGGLSVHYLGARSPAFIGRLLELAPFRKIVYSSDGFGPAELHFLGATL WRSGIQRVLRGFVERDDWCETDALRVVDLIAHGTAARIYRLGDR" CDS complement(2125161..2126477) /codon_start=1 /transl_table=11 /gene="cyp140" /locus_tag="BQ2027_MB1912C" /product="Probable cytochrome p450 140 CYP140" /note="Mb1912c, cyp140, len: 438 aa. Equivalent to Rv1880c, len: 438 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 438 aa overlap). Probable cyp140, cytochrome p450 (EC 1.14.-.-). Similar to Q00441|CPXJ_SACER 6-deoxyerythronolide beta hydroxylase (404 aa), FASTA scores: opt: 775, E(): 0, (44.2% identity in 319 aa overlap); and other members of the cytochrome P450 family. Related to Mycobacterium tuberculosis proteins include: Rv0766c, Rv2266, Rv0778, etc. Contains cytochrome P450 cysteine heme-iron ligand signature (PS00086). BELONGS TO THE CYTOCHROME P450 FAMILY. Protein product from Mb1912c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1912c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63722" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P63722" /protein_id="SIU00516.1" /translation="MKDKLHWLAMHGVIRGIAAIGIRRGDLQARLIADPAVATDPVPF YDEVRSHGALVRNRANYLTVDHRLAHDLLRSDDFRVVSFGENLPPPLRWLERRTRGDQ LHPLREPSLLAVEPPDHTRYRKTVSAVFTSRAVSALRDLVEQTAINLLDRFAEQPGIV DVVGRYCSQLPIVVISEILGVPEHDRPRVLEFGELAAPSLDIGIPWRQYLRVQQGIRG FDCWLEGHLQQLRHAPGDDLMSQLIQIAESGDNETQLDETELRAIAGLVLVAGFETTV NLLGNGIRMLLDTPEHLATLRQHPELWPNTVEEILRLDSPVQLTARVACRDVEVAGVR IKRGEVVVIYLAAANRDPAVFPDPHRFDIERPNAGRHLAFSTGRHFCLGAALARAEGE VGLRTFFDRFPDVRAAGAGSRRDTRVLRGWSTLPVTLGPARSMVSP" CDS complement(2126527..2126949) /codon_start=1 /transl_table=11 /gene="lppE" /locus_tag="BQ2027_MB1913C" /product="POSSIBLE CONSERVED LIPOPROTEIN LPPE" /note="Mb1913c, lppE, len: 140 aa. Equivalent to Rv1881c, len: 140 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 140 aa overlap=. Possible lppE, lipoprotein, showing some similarity to L12238|MSG18S19K_1 19K antigen from Mycobacterium intracellulare (162 aa), FASTA scores: opt: 137, E(): 0.0069, (27.6% identity in 156 aa overlap). Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb1913c detected using SWATH mass spectrometry. Mb1913c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZZ9" /db_xref="InterPro:IPR008691" /db_xref="UniProtKB/TrEMBL:A0A1R3XZZ9" /protein_id="SIU00517.1" /translation="MCNRLVTVTGVAMVVAAGLSACGQAQTVPRKAARLTIDGVTHTT RPATCSQEHSYRTIDIRNHDSTVQAVVLLSGDRVIPQWVKIRNVDGFNGSFWHGGVGN ARADRARNTYTVAGSAYGISSKKPNTVVSTDFNILAEC" CDS complement(2126990..2127823) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1914C" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb1914c, -, len: 277 aa. Equivalent to Rv1882c, len: 277 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 277 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases, generally belonging to SDR family, e.g. NP_250789.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (251 aa); NP_421760.1|NC_002696 short chain dehydrogenase family protein from Caulobacter crescentus (270 aa); NP_107167.1|NC_002678 oxidoreductase (short chain dehydrogenase/reductase family) from Mesorhizobium loti (253 aa); P50197|LINC_PSEPA 2,5-dichloro-2,5-cyclohexadiene-1,4-diol dehydrogenase from Pseudomonas paucimobilis (Sphingomonas paucimobilis) (250 aa), FASTA scores: opt: 301, E(): 2.3e-12, (30.0% identity in 223 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. Rv3057c, Rv1245, etc. Contains possible helix-turn-helix motif at aa 246-267 (+4.32 SD). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb1914c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1914c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZL1" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZL1" /protein_id="SIU00518.1" /translation="MKAIFITGAGSGMGREGATLFHANGWRVGAIDRNEDGLAALRVQ LGAERLWARAVDVTDKAALEGALADFCAGNVGGGLDMMWNNAGIGEGGWFEDVPYEAA VRVVDVNFKAVLTGAYAALPYLKKAPGSLMFSTSSSSGTYGMPRIAVYSATKHAVKGL TEALSVEWQRHGVRVADVLPGLIDTAILTSTRQHSDEGPYTISAEQIRAAAPKKGMFR LMPSSSVAEAAWRAYQHPTRLHWYVPRSIRWIDRLKGVSPEFVRRHIAKSLATLEPKR K" CDS complement(2127851..2128327) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1915C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1915c, -, len: 158 aa. Equivalent to Rv1883c, len: 153 aa, from Mycobacterium tuberculosis strain H37Rv, (96.8% identity in 158 aa overlap). Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08, (34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB from Streptomyces actuosus (173 aa), FASTA score: opt: 207, E(): 1.8e-07, (40.2% identity in 102 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 15 bp in-frame insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (158 aa versus 153 aa). Protein product from Mb1915c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1915c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3XZN9" /protein_id="SIU00519.1" /translation="MCLDQVMEGSATVHMAAPPDKIWTLIADVRNTGRFSPETFEAEW LDGATGPALGARFRGHVRRNGIGPVYWTVCEVTACEPGREFGFAVLLGDRPVNNWHYR LTPTADGTEVTESFRLPPSVLTTVYYRVFGGWLRQRRNIRDMTKTLQRIKDLVEAG" CDS complement(2128366..2128896) /codon_start=1 /transl_table=11 /gene="rpfC" /locus_tag="BQ2027_MB1916C" /product="PROBABLE RESUSCITATION-PROMOTING FACTOR RPFC" /note="Mb1916c, rpfC, len: 176 aa. Equivalent to Rv1884c, len: 176 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 176 aa overlap). Probable rpfC, resuscitation promoting factor (see citation below), similar to Z96935|MLRPF_1 resusicitation-promoting factor from Micrococcus luteus (220 aa), FASTA score: opt: 287, E() : 3.3e-11, (40.0% identity in 120 aa overlap). Also similar to others from Mycobacterium tuberculosis: Rv2389c|MTCY253.32|RPFD PROBABLE RESUSCITATION-PROMOTING FACTOR (154 aa), FASTA score: opt: 382, E(): 7.1e-17, (55.4% identity in 101 aa overlap); Rv0867c|RPFA (N-terminal part), Rv2450c|RPFE, and Rv1009|RPFB (C-terminal part). Mb1916c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010618" /db_xref="InterPro:IPR023346" /db_xref="UniProtKB/TrEMBL:A0A1R3XZM2" /protein_id="SIU00520.1" /translation="MHPLPADHGRSRCNRHPISPLSLIGNASATSGDMSSMTRIAKPL IKSAMAAGLVTASMSLSTAVAHAGPSPNWDAVAQCESGGNWAANTGNGKYGGLQFKPA TWAAFGGVGNPAAASREQQIAVANRVLAEQGLDAWPTCGAASGLPIALWSKPAQGIKQ IINEIIWAGIQASIPR" CDS complement(2128908..2129507) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1917C" /product="chorismate mutase" /note="Mb1917c, -, len: 199 aa. Equivalent to Rv1885c, len: 199 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 199 aa overlap). Conserved hypothetical protein, some similarity to P42517|CHMU_ERWHE MONOFUNCTIONAL CHORISMATE MUTASE (181 aa), FASTA score: opt: 181, E(): 0.00017, (28.6% identity in 133 aa overlap). Protein product from Mb1917c detected using SWATH mass spectrometry. Mb1917c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZL2" /db_xref="InterPro:IPR002701" /db_xref="InterPro:IPR008240" /db_xref="InterPro:IPR036263" /db_xref="UniProtKB/TrEMBL:A0A1R3XZL2" /protein_id="SIU00521.1" /translation="MLTRPREIYLATAVSIGILLSLIAPLGPPLARADGTSQLAELVD AAAERLEVADPVAAFKWRAQLPIEDSGRVEQQLAKLGEDARSQHIDPDYVTRVFDDQI RATEAIEYSRFSDWKLNPASAPPEPPDLSASRSAIDSLNNRMLSQIWSHWSLLSAPSC AAQLDRAKRDIVRSRHLDSLYQRALTTATQSYCQALPPA" CDS complement(2129525..2130502) /codon_start=1 /transl_table=11 /gene="fbpB" /locus_tag="BQ2027_MB1918C" /standard_name="mpt59; 85B" /product="secreted antigen 85-B fbpB (85B) (antigen 85 complex B) (Mycolyl transferase 85B) (Fibronectin-binding protein B) (Extracellular alpha-antigen)" /note="Mb1918c, fbpB, len: 325 aa. Equivalent to Rv1886c, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv (100.0% identity in 325 aa overlap). fbpB (alternate gene names: mpt59, 85B), precursor of the 85-B antigen (fibronectin-binding protein B) (mycolyl transferase 85B) (EC 2.3.1.-) (see citations below), highly similar to other Mycobacterial antigen precursors e.g. P12942|A85B_MYCBO ANTIGEN 85-B PRECURSOR from Mycobacterium bovis (323 aa); P21160|A85B_MYCKA ANTIGEN 85-B PRECURSOR from Mycobacterium kansasii (325 aa); etc. Also highly similar to Mycobacterium tuberculosis antigen precursors: Rv3804c|fbpA (338 aa), Rv0129c|fbpC2 (340 aa), and Rv3803c|fbpC1 (299 aa). Protein product from Mb1918c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1918c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0C2T2" /db_xref="InterPro:IPR000801" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P0C2T2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00522.1" /translation="MTDVSRKIRAWGRRLMIGTAAAVVLPGLVGLAGGAATAGAFSRP GLPVEYLQVPSPSMGRDIKVQFQSGGNNSPAVYLLDGLRAQDDYNGWDINTPAFEWYY QSGLSIVMPVGGQSSFYSDWYSPACGKAGCQTYKWETFLTSELPQWLSANRAVKPTGS AAIGLSMAGSSAMILAAYHPQQFIYAGSLSALLDPSQGMGPSLIGLAMGDAGGYKAAD MWGPSSDPAWERNDPTQQIPKLVANNTRLWVYCGNGTPNELGGANIPAEFLENFVRSS NLKFQDAYNAAGGHNAVFNFPPNGTHSWEYWGAQLNAMKGDLQSSLGAG" CDS 2130893..2132035 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1919" /product="putative membrane protein" /note="Mb1919, -, len: 380 aa. Equivalent to Rv1887, len: 380 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 380 aa overlap). Hypothetical unknown protein; contains eukaryotic thiol (cysteine) proteases histidine active site at N-terminus (PS00639) and Pro-rich region near C-terminus. Protein product from Mb1919 detected using SWATH mass spectrometry. Mb1919 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZK9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00523.1" /translation="MDTVLGLSITPTTLGWVLAEGHGADGAILDRNELELHSGRNAQA IHTAEQLAAEVLLAHEVAAAGDHRLRVIGVTWNAEASAQAALLVESLTGAGFDNVVPV RRLRAIETLAQAIAPVIGYEQIAVCVLEHESATVVMVDTHDGKTQIAVKHVCRGLSGL TSWLTGMFGRDAWRPAGVVVVGSDSEVSEFSWQLERVLPVPVFAQTMAQVTVARGAAL AAAQSTEFTDAQLVADSVSQPTVAPRRSRHYAGAAAALAAAAVTFVASLSLAVGIQLA PHNDTGTAKHGAHKPTPRIAKAVAPAVPPPPTVTPPVPARAPRPAAQHEPPARVTSGE ALTEPNPPEEQPNASAPQQDRNDSQPITRVLEHIPGAYGDSAPPAE" CDS complement(2132005..2132724) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1920C" /product="POSSIBLE TRANSMEMBRANE PROTEIN" /note="Mb1920c, -, len: 239 aa. Similar to Rv1888c, len: 186 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 186 aa overlap). Possible transmembrane protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 10 bp insertion (*-tccgatcacc) leads to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis H37Rv (239 aa versus 186 aa). Protein product from Mb1920c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1920c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1S2" /db_xref="InterPro:IPR025498" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1S2" /protein_id="SIU00524.1" /translation="MQPDAYPVRVRGDLDPALSRWQWLVKWFLAIPHYIVLFFLHVAA VVVTVIAFFAILFTGRYPRTLFDFNVGVMRWRWRVAFYALSALGTDRYPPFSLQTKAE YPADLEVDYPERLSRGLVLIKWWLLAIPHYLILAVFLSSGWRVFLIDPHDRVGIMWPS LLVILLLVAVVALLFTGRYPIGLYNLVIGVNRWALRVRAYTTLMRDEYPPLRLDMGPR EQVSQPATAASDYSAGGAESP" CDS complement(2133089..2133262) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1921C" /product="O-methyltransferase" /note="Mb1921c, -, len: 57 aa. Equivalent to Rv1888A, len: 57 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 57 aa overlap). Conserved hypothetical protein. Possibly continuation of Rv1889c, part of large family of Mycobacterium tuberculosis proteins with conserved N-terminal domain of ~ 120 aa. Includes: C-terminus of Rv0726c|P95074 CONSERVED HYPOTHETICAL PROTEIN (367 aa), FASTA scores: opt: 295, E(): 3.1e-15, (73.684% identity in 57 aa overlap); C-terminus of Rv3399|Q50726|MTCY78.29c CONSERVED HYPOTHETICAL PROTEIN (348 aa), FASTA scores: opt: 504, E(): 7.3e-29, (64.2% identity in 120 aa overlap); C-terminus of Rv0731c; etc. Mb1921c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0F9" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0F9" /protein_id="SIU00525.1" /translation="MVPVDLRRDWPTPLRQAGFDPNQPSAWLAEGLLAFLPPDAQDRL LDNITALSAPGSR" CDS complement(2133306..2133662) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1922C" /product="O-methyltransferase" /note="Mb1922c, -, len: 118 aa. Equivalent to Rv1889c, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 118 aa overlap). Conserved hypothetical protein. Part of large family of Mycobacterium tuberculosis proteins with conserved N-terminal domain of ~ 120 aa. Includes: Rv3399|Q50726|MTCY78.29C hypothetical 38.1 kd protein (348 aa), FASTA results: opt: 504, E(): 7.3e-29, (64.2% identity in 120 aa overlap); Rv0726c, Rv0731c, etc. Mb1922c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZN6" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3XZN6" /protein_id="SIU00526.1" /translation="MPRTNNDAWDLATSVGATATMVAAARAVATRADNPLIDDPFAEP LVRAVGIDFFTRWAAGNIKATDVDDPDGTWGLQRLADLLAARTRYFDAFFRDATSAGI RQAVILASGLDARAYR" CDS complement(2133721..2134332) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1923C" /product="HYPOTHETICAL PROTEIN" /note="Mb1923c, -, len: 203 aa. Equivalent to Rv1890c, len: 203 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 203 aa overlap). Hypothetical unknown protein. Protein product from Mb1923c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="InterPro:IPR007372" /db_xref="InterPro:IPR036761" /db_xref="UniProtKB/TrEMBL:A0A1R3Y011" /protein_id="SIU00527.1" /translation="MAHKTRREGRAGRSSEYSRGVSDAVWTLDASDGELVLRTGVVGR AARLGHRLTIAMTRWQALVNWSGTDPVAGELVAEVDSFEVMRGEGGVKGLSEPEKALV RANALKTLNASRFPHIRFTTEAIAQTGNGYRLTGKLHIRGKSREHVIDLHTEDLGAAW RISADTTVRQSNYGVKPYSLLMGSIRVADEVSVAFTAVRAKDD" CDS 2134386..2134793 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1924" /product="putative membrane protein" /note="Mb1924, -, len: 135 aa. Equivalent to Rv1891, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 135 aa overlap). Conserved hypothetical protein. Equivalent to MLCB561.09|AL049571 hypothetical protein from Mycobacterium leprae (134 aa), FASTA scores: opt: 800, E(): 0, (79.7% identity in 133 aa overlap). Mb1924 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZL9" /protein_id="SIU00528.1" /translation="MIRELVTTAAITGAAIGGAPVAGADPQRYDGDVPGMNYDASLGA PCSSWERFIFGRGPSGQAEACHFPPPNQFPPAETGYWVISYPLYGVQQVGAPCPKPQA AAQSPDGLPMLCLGARGWQPGWFTGAGFFPPEP" CDS 2134810..2135121 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1925" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb1925, -, len: 103 aa. Equivalent to Rv1892, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 103 aa overlap). Probable membrane protein. Mb1925 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZP7" /db_xref="UniProtKB/TrEMBL:A0A1R3XZP7" /protein_id="SIU00529.1" /translation="MIMCEGRPTESPIPRWLRFVLTSDRAGSAWYIGAGFFFAPVLAV LSPWPTITAVLWWIIGLAGLWLGLLGIAMAVGLARVLRSGAEIPEAYWRTLVDYRSAN E" CDS 2135131..2135349 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1926" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1926, -, len: 72 aa. Equivalent to Rv1893, len: 72 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 72 aa overlap). Conserved hypothetical protein. Equivalent to MLCB561.11|AL049571 hypothetical protein from Mycobacterium leprae (74 aa), FASTA scores: opt: 317, E(): 4.6e-15, (69.4% identity in 72 aa overlap). Protein product from Mb1926 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1926 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZN1" /protein_id="SIU00530.1" /translation="MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASG IVKDSIDIATHAVDRTKEVFTGKTDDEG" CDS complement(2135384..2136514) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1927C" /product="putative oxidoreductase, nitronate monooxygenase family" /note="Mb1927c, -, len: 376 aa. Equivalent to Rv1894c, len: 376 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 376 aa overlap). Conserved hypothetical protein, weak similarity to some oxidoreductases e.g. Q01284 2-NITROPROPANE DIOXYGENASE PRECURSOR (378 aa), FASTA results: opt: 204, E(): 5.8e-06, (34.3% identity in 140 aa overlap). Similar to hypothetical Mycobacterium tuberculosis proteins e.g. Rv3553|MTCY03C7.02c (355 aa), FASTA results: opt: 296, E(): 1.6e-10, (32.9% identity in 167 aa overlap); Rv1533 (375 aa) (48.1% identity in 376 aa overlap); Rv0021c, Rv2781c. Protein product from Mb1927c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1927c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZM5" /db_xref="InterPro:IPR004136" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3XZM5" /protein_id="SIU00531.1" /translation="MHTAICDELGIEFPIFAFTHCRDVVVAVSKAGGFGVLGAVGFTP EQLEIELNWIDEHIGDHPYGVDIVIPNKYEGMDSQLSADELAKTLRSMVPQEHLDFAR KILADHGVPVEDADEDSLQLLGWTEATATPQVDAALKHPKMTMVANALGTPPADMIKH IHDSGRKVAALCGSPSQARKHADAGVDIIIAQGGEAGGHCGEVGSIVLWPQVVKEVAP VPVLAAGGIGSGQQIAAALALGTQGAWTGSQWLMVEEAANTAVQQAAYVKATSRDTVR SRSFTGKPARMLRNDWTEAWEQPESPKPLGMPLQYMVSGMAVKATHKYPNETVDVAFN PVGQVVGQFTKVEKTATVIERWVQEYLEATARLDALNAAASV" CDS 2137166..2137489 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1928" /product="POSSIBLE DEHYDROGENASE [FIRST PART]" /note="Mb1928, -, len: 107 aa. Equivalent to 5' end of Rv1895, len: 384 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 96 aa overlap). Possible dehydrogenase (EC 1.1.-.-), similar to various sorbitol and alcohol dehydrogenases, and to putative glutathione-dependent aldehyde dehydrogenase e.g DHSO_BACSU|Q06004 Sorbitol dehydrogenase (EC 1.1.1.14) from Streptomyces coelicolor (352 aa), FASTA results: opt: 506, E(): 7.2e-24, (30.6% identity in 350 aa overlap); and AL109962|SCJ1.28 PUTATIVE ZINC-CONTAINING DEHYDROGENASE from Streptomyces coelicolor (356 aa), FASTA results: opt: 634, E(): 2.9e-30, (34.7% identity in 357 aa overlap). Also similar to other Mycobacterium tuberculosis dehydrogenases. Note that there is a substantial (134 bp) overlap at the C-terminus with the C-terminus of the downstream ORF, although both appear to be true coding regions. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1895 exists as a single gene. In Mycobacterium bovis, two frameshifts due to a single base deletion (a-*) and a single base insertion (*-t), consecutively, splits Rv1895 into 2 main parts, Mb1928 and Mb1929. Mb1928 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZP1" /db_xref="InterPro:IPR002328" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013154" /db_xref="UniProtKB/TrEMBL:A0A1R3XZP1" /protein_id="SIU00532.1" /translation="MRAVVIDGAGSVRVNTQPDPALPGPDGVVVAVTAAGICGSDLHF YEGEYPFTEPVALGHEAVGTIVEAGPQVRTVGVGDLVMVSSVAGCGVCPGCEPMIQSC ASPAR" CDS 2137486..2138190 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1929" /product="POSSIBLE DEHYDROGENASE [SECOND PART]" /note="Mb1929, -, len: 234 aa. Equivalent to middle part of Rv1895, len: 384 aa, from Mycobacterium tuberculosis strain H37Rv, (97.9% identity in 234 aa overlap). Possible dehydrogenase (EC 1.1.-.-), similar to various sorbitol and alcohol dehydrogenases, and to putative glutathione-dependent aldehyde dehydrogenase e.g DHSO_BACSU|Q06004 Sorbitol dehydrogenase (EC 1.1.1.14) from Streptomyces coelicolor (352 aa), FASTA results: opt: 506, E(): 7.2e-24, (30.6% identity in 350 aa overlap); and AL109962|SCJ1.28 PUTATIVE ZINC-CONTAINING DEHYDROGENASE from Streptomyces coelicolor (356 aa), FASTA results: opt: 634, E(): 2.9e-30, (34.7% identity in 357 aa overlap). Also similar to other Mycobacterium tuberculosis dehydrogenases. ****Note that there is a substantial (134 bp) overlap at the C-terminus with the C-terminus of the downstream ORF, although both appear to be true coding regions. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv1895 exists as a single gene. In Mycobacterium bovis, two frameshifts due to a single base deletion (a-*) and a single base insertion (*-t), consecutively, splits Rv1895 into 2 main parts, Mb1928 and Mb1929." /db_xref="GOA:A0A1R3XZM8" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZM8" /protein_id="SIU00533.1" /translation="MIFGAGVLGGAQADLLAVPAADFQVLKIPEGITTEQALLLTDNL ATGWAAAQRADISFGSAVAVIGLGAVGLCALRSAFIHGAATVFAVDRVKGRLQRAATW GATPIPSPAAETILAATRGRGADSVIDAVGTDASMSDALNAVRPGGTVSVVGVHDLQP FPLPALTCLLRSITLRMTMAPVQRTWPELIPLLQSGRLDVDGIFTTTLPLDEAAKGYA TARARSGEELKVLLTP" CDS complement(2138180..2139091) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1931C" /product="O-methyltransferase" /note="Mb1931c, -, len: 303 aa. Equivalent to Rv1896c, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 303 aa overlap). Conserved hypothetical protein. Similar to several (14) hypothetical Mycobacterium tuberculosis proteins e.g. Rv0145|MTCI5.19 (317 aa), FASTA results: opt: 720, E(): 0, (41.6% identity in 308 aa overlap); Q10552|YZ21_MYCTU (325 aa), opt: 689, E(): 0, (40.5% identity in 304 aa overlap); Rv0726c, Rv0731c, Rv3399, etc. and to related proteins in other actinomycetes. Note that there is a substantial (134 bp) overlap at the C-terminus with the C-terminus of the downstream ORF, although both appear to be true coding regions. Protein product from Mb1931c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1931c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TZC0" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7TZC0" /protein_id="SIU00534.1" /translation="MTTPEYGSLRSDDDHWDIVSNVGYTALLVAGWRALHTTGPKPLV QDEYAKHFITASADPYLEGLLANPRTSEDGTAFPRLYGVQTRFFDDFFNCADEAGIRQ AVIVAAGLDCRAYRLDWQPGTTVFEIDVPKVLEFKARVLSERGAVPKAHRVAVPADLR TDWPTPLTAAGFDPQRPSAWSVEGLLPYLTGDAQYALFARIDELCAPGSRVALGALGS RLDHEQLAALETAHPGVNMSGDVNFSALTYDDKTDPVEWLVEHGWAVDPVRSTLELQV GYGLTPPDVDVKIDSFMRSQYITAVRA" CDS complement(2139096..2139527) /codon_start=1 /transl_table=11 /gene="dtd" /locus_tag="BQ2027_MB1932C" /product="D-aminoacyl-tRNA deacylase (EC" /EC_number="3.1.1.96" /note="Mb1932c, -, len: 143 aa. Equivalent to Rv1897c, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Conserved hypothetical protein. Some similarity to D63706|Q54235 ORF2 from Streptomyces griseus (149 aa), FASTA results: opt: 509, E(): 1.2e-28, (57.3% identity in 150 aa overlap); and Q45303 ORF1 PROTEIN from Corynebacterium glutamicum (144 aa), FASTA results: opt: 460, E(): 5.5e-23, (49.7% identity in 143 aa overlap). Mb1932c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63996" /db_xref="InterPro:IPR003732" /db_xref="InterPro:IPR023509" /db_xref="UniProtKB/Swiss-Prot:P63996" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00535.1" /translation="MRVLVQRVSSAAVRVDGRVVGAIRPDGQGLVAFVGVTHGDDLDK ARRLAEKLWNLRVLADEKSASDMHAPILVISQFTLYADTAKGRRPSWNAAAPGAVAQP LIAAFAAALRQLGAHVEAGVFGAHMQVELVNDGPVTVMLEG" CDS 2139585..2139893 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1933" /product="ExtraCellular Mutant; Ecm15p" /note="Mb1933, -, len: 102 aa. Equivalent to Rv1898, len: 102 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 102 aa overlap). Conserved hypothetical protein, some similarity to other hypothetical proteins e.g. Q58452 from METHANOCOCCUS JANNASCH II (100 aa), FASTA results: opt: 152, E(): 9.1e-05, (31.5% identity in 92 aa overlap); and AE000771|AE000771_2 from Aquifex aeolicus (157 aa), FASTA results: opt: 246, E(): 3.2e-11, (39.0% identity in 100 aa overlap). Protein product from Mb1933 detected using SWATH mass spectrometry. Mb1933 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002767" /db_xref="InterPro:IPR029756" /db_xref="UniProtKB/Swiss-Prot:P67120" /protein_id="SIU00536.1" /translation="MSVLVAFSVTPLGVGEGVGEIVTEAIRVVRDSGLPNQTDAMFTV IEGDTWAEVMAVVQRAVEAVAARAPRVSAVIKVDWRPGVTDAMTQKVATVERYLLRPE " CDS complement(2139859..2140935) /codon_start=1 /transl_table=11 /gene="lppD" /locus_tag="BQ2027_MB1934C" /product="POSSIBLE LIPOPROTEIN LPPD" /note="Mb1934c, lppD, len: 358 aa. Similar to Rv1899c, len: 343 aa, from Mycobacterium tuberculosis strain H37Rv, (95.5% identity in 358 aa overlap). Possible lipoprotein; contains appropriately localized lipoprotein lipid attachment site (PS00013). Some similarity to C-terminal part of AE000717|AE000717_4 hypothetical protein from Aquifex aeolicus section 49 (165 aa), FASTA results: opt: 372, E(): 2.3e-14, (43.5% identity in 147 aa overlap); and Q44020 4-hydroxybutyrate dehydrogenase (173 aa), FASTA results: opt: 272, E(): 4.7e-09, (35.8% identity in 165 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 45 bp in-frame insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (358 aa versus 343 aa). Protein product from Mb1934c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1934c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002589" /db_xref="UniProtKB/Swiss-Prot:Q7TZB9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00537.1" /translation="MSRAAGLPRLSWFAGLTWFAGGSTGAGCAAHPALAGLTAGARCP AYAAISASTARPAATALPAVAASTARPAATAGTTPATGASGSARPTDAAGMADLARPG VVATHAVRTLGTTGSRAIGLCPCQPLDCPRSPQATPNLGSMGRSLDGPQWRRARVRLC GRWWRRSNTTRGASPRPPSTCRGDNVSMIELEVHQADVTKLELDAITNAANTRLRHAG GVAAAIARAGGPELQRESTEKAPIGLGEAVETTAGDMPARYVIHAATMELGGPTSGEI ITAATAATLRKADELGCRSLALVAFGTGVGGFPLDDAARLMVGAVRRHRPGSLQRVVF AVHGDAAERAFSAAIQAGEDTARR" CDS complement(2140935..2142323) /codon_start=1 /transl_table=11 /gene="lipJ" /locus_tag="BQ2027_MB1935C" /product="PROBABLE LIGNIN PEROXIDASE LIPJ" /note="Mb1935c, lipJ, len: 462 aa. Equivalent to Rv1900c, len: 462 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 462 aa overlap). Probable lipJ, lignin peroxidase, with some similarity to esterases, hydrolases and hypothetical Mycobacterium tuberculosis proteins e.g. Q43936 BETA-KETOADIPATE ENOL-LACTONE HYDROLASE from Acinetobacter calcoaceticus (267 aa), FASTA results: opt: 217, E(): 1.7e-07, (29.2% identity in 260 aa overlap). Also similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv2212|Q10400|YM12_MYCTU (378 aa), FASTA results: opt: 216, E(): 6.7e-07, (27.7% identity in 285 aa overlap). Protein product from Mb1935c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1935c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZM9" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/TrEMBL:A0A1R3XZM9" /protein_id="SIU00538.1" /translation="MAQAPHIHRTRYAKCGDMDIAYQVLGDGPTDLLVLPGPFVPIDS IDDEPSLYRFHRRLASFSRVIRLDHRGVGLSSRLAAITTLGPKFWAQDAIAVMDAVGC EQATIFAPSFHAMNGLVLAADYPERVRSLIVVNGSARPLWAPDYPVGAQVRRADPFLT VALEPDAVEQGFDVLSIVAPTVAGDDVFRAWWDLAGNRAGPPSMARAVSKVIAEADVR DVLGHIEAPTLILHRVGSTYIPVGHGRYLAEHIAGSRLVELPGTDTLYWVGDTGPMLD EIEEFITGVRGGADAERMLATIMFTDIVGSTQHAAALGDDRWRDLLDNHDTIVCHEIQ RFGGREVNTAGDGFVATFTSPSAAIACADDIVDAVAALGIEVRIGIHAGEVEVRDASH GTDVAGVAVHIGARVCALAGPSEVLVSSTVRDIVAGSRHRFAERGEQELKGVPGRWRL CVLMRDDATRTR" CDS 2142352..2143644 /codon_start=1 /transl_table=11 /gene="cinA" /locus_tag="BQ2027_MB1936" /product="PROBABLE CINA-LIKE PROTEIN CINA" /note="Mb1936, cinA, len: 430 aa. Equivalent to Rv1901, len: 430 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 430 aa overlap). Probable cinA-like protein, strong similarity to competence damage proteins CinA of Bacillus subtilis and S. pneumoniae. FASTA results: Q55760 HYPOTHETICAL 44.7 KD PROTEIN (416 aa) opt: 755, E(): 0, (36.0% identity in 433 aa overlap). Protein product from Mb1936 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1936 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001453" /db_xref="InterPro:IPR008135" /db_xref="InterPro:IPR008136" /db_xref="InterPro:IPR036425" /db_xref="InterPro:IPR036653" /db_xref="UniProtKB/Swiss-Prot:P63776" /protein_id="SIU00539.1" /translation="MAVSARAGIVITGTEVLTGRVQDRNGPWIADRLLELGVELAHIT ICGDRPADIEAQLRFMAEQGVDLIVTSGGLGPTADDMTVEVVARYCGRELVLDDELEN RIANILKKLMGRNPAIEPANFDSIRAANRKQAMIPAGSQVIDPVGTAPGLVVPGRPAV MVLPGPPRELQPIWSKAIQTAPVQDAIAGRTTYRQETIRIFGLPESSLADTLRDAEAA IPGFDLVEITTCLRRGEIEMVTRFEPNAAQVYTQLARLLRDRHGHQVYSEDGASVDEL VAKLLTGRRIATAESCTAGLLAARLTDRPGSSKYVAGAVVAYSNEAKAQLLGVDPALI EAHGAVSEPVAQAMAAGALQGFGADTATAITGIAGPSGGTPEKPVGTVCFTVLLDDGR TTTRTVRLPGNRSDIRERSTTVAMHLLRRTLSGIPGSP" CDS complement(2143696..2144964) /codon_start=1 /transl_table=11 /gene="nanT" /locus_tag="BQ2027_MB1937C" /product="PROBABLE SIALIC ACID-TRANSPORT INTEGRAL MEMBRANE PROTEIN NANT" /note="Mb1937c, nanT, len: 422 aa. Equivalent to Rv1902c, len: 422 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 422 aa overlap). Probable nanT, sialic acid-transport integral membrane protein, possibly member of major facilitator superfamily (MFS), similar to others e.g. Q48076 SIALIC ACID TRANSPORTER (407 aa), FASTA results: opt: 443, E(): 5.4e-22, (26.7% identity in 389 aa overlap); etc. Some similarity to MTCI364.12|O05301 conserved hypothetical protein from Mycobacterium tuberculosis (425 aa), FASTA results: opt: 251, E(): 1.1e-09, (23.5% identity in 417 aa overlap). Contains sugar transport proteins signature 2 (PS00217). Mb1937c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZN8" /db_xref="InterPro:IPR004742" /db_xref="InterPro:IPR005828" /db_xref="InterPro:IPR005829" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3XZN8" /protein_id="SIU00540.1" /translation="MAAPRLTGDQRNAFMASFLGWTMDAFDYFLVVLVYADIATTFHH TKTDVAFLTTATLAMRPVGALLFGLWADRVGRRVPLMVDVSFYSVIGFLCAFAPNFTV LVILRLLYGIGMGGEWGLGAALSMEKVPAERRGVFSGLLQEGYAFGYLLASVAALVVM NWLGLSWRWLFGLSIIPALISLIIRYRVKESEVWEAAQDRMRLTKTRIRDVLGNPAIV RRFVYLVLLMTAFNWMSHGTQDVYPTFLTATTDHGAGLSSLTARWIVVIYNIGAIIGG LAFGTLSQRFSRRYTIVFCAALGLPIVPLFAYSRTAAMLCLGSFLMQVFVQGAWGVIP AHLTEMSPDAIRGVYPGVTYQLGNLLAAFNLPIQERLAESHGYPFALAATIVPVLLVV AVLTAIGKDATGIRFGTTETAFLVRHRNRH" CDS 2145054..2145458 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1938" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb1938, -, len: 134 aa. Equivalent to Rv1903, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 134 aa overlap). Probable conserved membrane protein, similar to Q53868|YPT3_STRCO hypothetical 15.9 kd protein from Streptomyces coelicolor (148 aa) opt: 323, E(): 1.3e-16, (42.9% identity in 126 aa overlap); and equivalent to AJ000521|MLCOSL672_3 from Mycobacterium leprae (139 aa), FASTA results: opt: 680, E(): 0, (80.6% identity in 129 aa overlap). Protein product from Mb1938 detected using SWATH mass spectrometry. Mb1938 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZN5" /db_xref="InterPro:IPR007165" /db_xref="UniProtKB/TrEMBL:A0A1R3XZN5" /protein_id="SIU00541.1" /translation="MVPFLMRAAVTGFALWVVTLFVPGMRFAGGDTTLQRVAIIFVVA VIFGLVNAFIKPIVQILSIPLYILTLGLFHVVVNASMLWLTAWITEHTTHWGLQIDHF WWTAIWAAILLSIVSWILSLLARDFRRVTRAH" CDS 2145644..2146075 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1939" /product="Anti-sigma B factor antagonist RsbV" /note="Mb1939, -, len: 143 aa. Equivalent to Rv1904, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 143 aa overlap). Conserved hypothetical protein, some similarity to other hypothetical Mycobacterium tuberculosis proteins e.g. Rv2638|MTCY441.08|P71937 (148 aa), FASTA results: opt: 456, E(): 2.7e-23, (52.8% identity in 125 aa overlap); Rv1365|Q11035 (128 aa), FASTA results: opt: 393, E(): 1.4e-19, (48.8% identity in 123 aa overlap); and Rv3687c. Also weak similarity to Q9WVX8|RSBV_STRCO ANTI-SIGMA B FACTOR ANTAGONIST from Streptomyces coelicolor (113 aa). Protein product from Mb1939 detected using SWATH mass spectrometry. Mb1939 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZQ1" /db_xref="InterPro:IPR002645" /db_xref="InterPro:IPR003658" /db_xref="InterPro:IPR036513" /db_xref="UniProtKB/TrEMBL:A0A1R3XZQ1" /protein_id="SIU00542.1" /translation="MRTVAIGPGAGPSSTRPSSQPSDLHSGLRAVTECTGSAVVVHVG GDIDASNEVAWQRLVSKSAAIAIAPGPFVIDIRDLDFMGSCAYAVLAQESVRCRRRGV NMRLVSNQPIVARTIAACGLRRLIPLYAMVETALAPPPSAH" CDS complement(2146123..2147085) /codon_start=1 /transl_table=11 /gene="aao" /locus_tag="BQ2027_MB1940C" /product="probable d-amino acid oxidase aao" /note="Mb1940c, aao, len: 320 aa. Equivalent to Rv1905c, len: 320 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 320 aa overlap). Probable aao, D-amino acid oxidase (EC 1.4.3.3), similar to many. Equivalent to AJ000521|MLCOSL672.02|O33145 Mycobacterium leprae (320 aa), FASTA results: opt: 1541, E(): 0, (71.7% identity in 315 aa overlap); also similar to OXDD_BOVIN|P31228 d-aspartate oxidase (EC 1.4.3.1) from bos taurus (338 aa), FASTA results: opt: 461, E(): 1.1e-21, (31.8% identity in 321 aa overlap). Protein product from Mb1940c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1940c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZP0" /db_xref="InterPro:IPR006076" /db_xref="InterPro:IPR023209" /db_xref="UniProtKB/TrEMBL:A0A1R3XZP0" /protein_id="SIU00543.1" /translation="MAIGEQQVIVIGAGVSGLTSAICLAEAGWPVRVWAAALPQQTTS AVAGAVWGPRPKEPVAKVRGWIEQSLHVFRDLAKDPATGVRMTPALSVGDRIETGAMP PGLELIPDVRPADPADVPGGFRAGFHATLPMIDMPQYLDCLTQRLAATGCEIETRPLR SLAEAAEAAPIVINCAGLGARELAGDATVWPRFGQHVVLTNPGLEQLFIERTGGSEWI CYFAHPQRVVCGGISIPGRWDTTPEPEITERILQRCRRIQPRLAEAAVIETITGLRPD RPSVRVEAEPIGRALCIHNYGHGGDGVTLSWGCAREVVNLVGGG" CDS complement(2147115..2147585) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1941C" /product="conserved protein" /note="Mb1941c, -, len: 156 aa. Equivalent to Rv1906c, len: 156 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 156 aa overlap). Conserved hypothetical protein, possibly exported protein, equivalent to Mycobacterium leprae AJ000521|MLCOSL672.01 (153 aa), FASTA scores: opt: 637, E(): 2.6e-28, (63.2% identity in 155 aa overlap). Also similar to M. tuberculosis hypothetical exported protein, Rv1352. Protein product from Mb1941c detected using SWATH mass spectrometry. Mb1941c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1T0" /protein_id="SIU00544.1" /translation="MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAI DSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTD KAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQP" CDS complement(2147925..2148572) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1942C" /product="HYPOTHETICAL PROTEIN" /note="Mb1942c, -, len: 215 aa. Equivalent to Rv1907c, len: 215 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 215 aa overlap). Hypothetical unknown protein. Similar to Q50763 Ethyl methane sulphonate resistance protein from Mycobacterium tuberculosis (168 aa), FASTA scores: opt: 638, E(): 0, (69.7% identity in 152 aa overlap). Downstream of a cloned katG gene (EMBL:MTKATG). Differences are due to frameshift errors in the EMBL sequence and the use of an earlier start codon. Mb1942c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR025358" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0H4" /protein_id="SIU00545.1" /translation="MIGPARRSTTTRRSTPRADRLAGCWCLPGAICQTPRAWWSQARR DGDDETGMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTV GLTRRGLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTH PDAHLYCAIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA" CDS complement(2148579..2150801) /codon_start=1 /transl_table=11 /gene="katG" /locus_tag="BQ2027_MB1943C" /product="CATALASE-PEROXIDASE-PEROXYNITRITASE T KATG" /note="Mb1943c, katG, len: 740 aa. Equivalent to Rv1908c, len: 740 aa, from Mycobacterium tuberculosis strain H37Rv (99.9% identity in 740 aa overlap). katG, catalase-peroxidase-peroxynitritase T (EC 1.11.1.6) (see citations below), HPI. FASTA results: Q57215 CATALASE-PEROXIDASE from Mycobacterium tuberculosis (740 aa) opt: 5081, E(): 0, (100% identity in 740 aa overlap). Contains peroxidases active site signature (PS00436) and ATP/GTP-binding site motif A (P-loop; PS00017). Cosmid sequence was corrected to agree with a sequencing read from the H37Rv genome. DELETIONS OR DEFECTS IN KATG GENE CAUSE ISONIAZID (INH) RESISTANCE. BELONGS TO THE PEROXIDASE FAMILY. BACTERIAL PEROXIDASE/CATALASE SUBFAMILY. KATG TRANSCRIPTION SEEMS TO BE REGULATED BY FURA|Rv1909c PRODUCT. The catalase-peroxidase activity is associated with the amino-terminal domain but no definite function has been assigned to the carboxy-terminal domain. Protein product from Mb1943c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1943c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P46817" /db_xref="InterPro:IPR000763" /db_xref="InterPro:IPR002016" /db_xref="InterPro:IPR010255" /db_xref="InterPro:IPR019793" /db_xref="InterPro:IPR019794" /db_xref="UniProtKB/Swiss-Prot:P46817" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00546.1" /translation="MPEQHPPITETTTGAASNGCPVVGHMKYPVEGGGNQDWWPNRLN LKVLHQNPAVADPMGAAFDYAAEVATIDVDALTRDIEEVMTTSQPWWPADYGHYGPLF IRMAWHAAGTYRIHDGRGGAGGGMQRFAPLNSWPDNASLDKARRLLWPVKKKYGKKLS WADLIVFAGNCALESMGFKTFGFGFGRVDQWEPDEVYWGKEATWLGDERYSGKRDLEN PLAAVQMGLIYVNPEGPNGNPDPMAAAVDIRETFRRMAMNDVETAALIVGGHTFGKTH GAGPADLVGPEPEAAPLEQMGLGWKSSYGTGTGKDAITSGIEVVWTNTPTKWDNSFLE ILYGYEWELTKSPAGAWQYTAKDGAGAGTIPDPFGGPGRSPTMLATDLSLRVDPIYER ITRRWLEHPEELADEFAKAWYKLIHRDMGPVARYLGPLVPKQTLLWQDPVPAVSHDLV GEAEIASLKSQILASGLTVSQLVSTAWAAASSFRGSDKRGGANGGRIRLQPQVGWEVN DPDGDLRKVIRTLEEIQESFNSAAPGNIKVSFADLVVLGGCAAIEKAAKAAGHNITVP FTPGRTDASQEQTDVESFAVLEPKADGFRNYLGKGNPLPAEYMLLDKANLLTLSAPEM TVLVGGLRVLGANYKRLPLGVFTEASESLTNDFFVNLLDMGITWEPSPADDGTYQGKD GSGKVKWTGSRVDLVFGSNSELRALVEVYGADDAQPKFVQDFVAAWDKVMNLDRFDVR " CDS complement(2150839..2151282) /codon_start=1 /transl_table=11 /gene="furA" /locus_tag="BQ2027_MB1944C" /product="Ferric uptake regulation protein FurA (fur)" /note="Mb1944c, len: 147 aa. Equivalent to Rv1909c len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). FurA, Ferric uptake regulation protein, similar to Q48835 legionella pneumophila 130B (wadsworth) ferric uptake regulation (136 aa), FASTA results: opt: 230, E(): 2.5e-09, (32.3% identity in 133 aa overlap). Also similar to Mycobacterium tuberculosis zur zinc uptake regulatory protein, Rv2359. Belongs to the fur family. Start changed since original submission (-3 aa). Mb1944c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A583" /db_xref="InterPro:IPR002481" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P0A583" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00547.1" /translation="MSSIPDYAEQLRTADLRVTRPRVAVLEAVNAHPHADTETIFGAV RFALPDVSRQAVYDVLHALTAAGLVRKIQPSGSVARYESRVGDNHHHIVCRSCGVIAD VDCAVGEAPCLTASDHNGFLLDEAEVIYWGLCPDCSISDTSRSHP" CDS complement(2151396..2151989) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1945C" /product="PROBABLE EXPORTED PROTEIN" /note="Mb1945c, -, len: 197 aa. Equivalent to Rv1910c, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 197 aa overlap). Possible exported protein, very similar to upstream ORF MTCY180.07 (201 aa), FASTA score: E(): 0, (64.0% identity in 200 aa overlap). Also similar to Q9Z729|Y877_CHLPN PROTEIN CPN0877 from Chlamydophila pneumoniae (150 aa). Protein product from Mb1945c detected using SWATH mass spectrometry. Mb1945c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR005247" /db_xref="InterPro:IPR008914" /db_xref="InterPro:IPR036610" /db_xref="UniProtKB/Swiss-Prot:P67223" /protein_id="SIU00548.1" /translation="MAHAFHRFALAILGLALPVALVAYGGNGDSRKAAPLAPKAAALG RSMPETPTGDVLTISSPAFADGAPIPEQYTCKGANIAPPLTWSAPFGGALVVDDPDAP REPYVHWIVIGIAPGAGSTADGETPGGGISLPNSSGQPAYTGPCPPAGTGTHHYRFTL YHLPAVPPLAGLAGTQAARVIAQAATMQARLIGTYEG" CDS complement(2152072..2152677) /codon_start=1 /transl_table=11 /gene="lppC" /locus_tag="BQ2027_MB1946C" /product="PROBABLE LIPOPROTEIN LPPC" /note="Mb1946c, lppC, len: 201 aa. Equivalent to Rv1911c, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 201 aa overlap). Probable lipoprotein lppC, contains appropriately positioned prokaryotic membrane lipoprotein lipid attachment site (PS00013). Very similar to downstream ORF MTCY180.08 (204 aa) (although this lacks lipoprotein motif), FASTA score: opt: 831, E(): 0, (64.0% identity in 200 aa overlap). Also similar to Q9Z729|Y877_CHLPN HYPOTHETICAL PROTEIN CPN0877 from Chlamydia pneumoniae (strain CWL029) (150 aa). Protein product from Mb1946c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1946c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67225" /db_xref="InterPro:IPR005247" /db_xref="InterPro:IPR008914" /db_xref="InterPro:IPR036610" /db_xref="UniProtKB/Swiss-Prot:P67225" /protein_id="SIU00549.1" /translation="MTSTLHRTPLATAGLALVVALGGCGGGGGDSRETPPYVPKATTV DATTPAPAAEPLTIASPMFADGAPIPVQFSCKGANVAPPLTWSSPAGAAELALVVDDP DAVGGLYVHWIVTGIAPGSGSTADGQTPAGGHSVPNSGGRQGYFGPCPPAGTGTHHYR FTLYHLPVALQLPPGATGVQAAQAIAQAASGQARLVGTFEG" CDS complement(2152777..2153781) /codon_start=1 /transl_table=11 /gene="fadB5" /locus_tag="BQ2027_MB1947C" /product="POSSIBLE OXIDOREDUCTASE FADB5" /note="Mb1947c, fadB5, len: 334 aa. Equivalent to Rv1912c, len: 334 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 334 aa overlap). Possible fadB5, oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases: 3-hydroxyacyl-CoA dehydrogenase (EC 1.1.1.35), quinone oxidoreductases (EC 1.6.5.5), and polyketide synthases, e.g. NP_104067.1|NC_002678 probable oxidoreductase from Mesorhizobium loti (308 aa); NP_464140.1|NC_003210 protein similar to oxidoreductase from Listeria monocytogenes (313 aa); NP_193889.1|NC_003075 putative NADPH quinone oxidoreductase from Arabidopsis thaliana (325 aa); NP_001880.2|NM_001889 crystallin, zeta; quinone oxidoreductase; NADPH:quinone reductase from Homo sapiens (329 aa); part 2983 to 3197 of T17410 polyketide synthase type I from Streptomyces venezuelae (3739 aa); Q53927|SCBAC20F6.16 HYDROXYACYL-COA DEHYDROGENASE from Streptomyces coelicolor (329 aa), FASTA scores: opt: 621, E(): 2e-30, (39.5% identity in 349 aa overlap); etc. Also similar to many hypothetical Mycobacterium tuberculosis proteins including: MTCY24G1.09, MTCY13D12.11, MTCY19H9.01, MTCY24G1.03, MTCY03A2.17c, etc. Contains quinone oxidoreductase/zeta-crystallin signature (PS01162). Protein product from Mb1947c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1947c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZP6" /db_xref="InterPro:IPR002364" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZP6" /protein_id="SIU00550.1" /translation="MRAVVITKHGDPSVLQVRQRPDPPPPGPGQLRVAVRAAGVNFAD HLARVGLYPDAPKLPAVVGYEVAGTVEAVGDGVDPNRVGERVLAGTRFGGYCEIVNVA ATDSVVLPDALSFEQGAAVPVNYATAWAALHGYGSLRAGERVLIHAAAGGVGIAAVQF AKAAKAEVHGTASPQKHQKLAEFGVDRAIDYRRDGWWQGLGPYDVVLDALGGTSLRRS YTLLRPGGRLVGYGISNMQHGEKRSMRRVAPHALSMLRGFNLMKQLEESKTVIGLNML RLWDDRRTLEPWIAPLTKALNDGTILPIVHAIVPFAEAPEAHRILAARENVGKVVLVP " CDS 2153881..2154633 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1948" /product="MBL-fold metallo-hydrolase superfamily" /note="Mb1948, -, len: 250 aa. Equivalent to Rv1913, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 250 aa overlap). Conserved hypothetical protein, slight similarity to dehydrase and beta-lactamase precursors e.g. Q02057 DEHYDRASE from Streptomyces coelicolor (297 aa), FASTA scores: opt: 184, E(): 4.3e-05, (31.6% identity in 215 aa overlap). Protein product from Mb1948 detected using SWATH mass spectrometry. Mb1948 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/TrEMBL:A0A1R3XZP2" /protein_id="SIU00551.1" /translation="MHFDWERLTDSVHRCRLPFCDVTVGLVRGRTGILLVDTGTTLGE ATAIAADVKQIAGCQVTHVVLTHKHFDHVLGSSVFDQAEVFCAPEVVEYLRSATDRLR EDALSYGADTAEVDRAIAALKPPQHGIYDAAVDLGDRTVTITHPGSGHTTADLVVVAP ATGHADGPTVVFTGDLVEESADPDIDADSDLAAWPATLDRVLAIGGPDASYVPGHGKV VDAQFVRRQRAWLRTRASRQPRETPATLPCKR" CDS complement(2154611..2155018) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1949C" /product="unknown protein" /note="Mb1949c, -, len: 135 aa. Equivalent to Rv1914c, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 135 aa overlap). Hypothetical unknown protein. Protein product from Mb1949c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1949c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZS2" /protein_id="SIU00552.1" /translation="MVLSRTSTGRVILVPTQLRFDRWFLPLAVPLGLGPKNSELWVGA GSLHVKMGWAFAADIPLTSITKAEATNARVYAAGVHFGFGRWLVNGSRKGLVALTIDP PEQAKMWKKSMTVRELWVSVTDPDALVTACTAK" CDS 2155153..2157453 /codon_start=1 /transl_table=11 /gene="aceA" /locus_tag="BQ2027_MB1950" /product="probable isocitrate lyase aceab [second part] (isocitrase) (isocitratase) (icl)" /note="Mb1950, aceA, len: 766 aa. Similar to Rv1915 and Rv1916, len: 367 aa and 398 aa, from Mycobacterium tuberculosis strain H37Rv, (89.1% identity in 339 aa overlap and 100.0% identity in 398 aa overlap). Probable aceA, isocitrate lyase (EC 4.1.3.1). Contains PS00161 Isocitrate lyase signature. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, aceAa and aceAb exist as 2 genes. In Mycobacterium bovis, a single base insertion (*-t) leads to a single product. Protein product from Mb1950 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1950 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TZA8" /db_xref="InterPro:IPR006254" /db_xref="InterPro:IPR015813" /db_xref="InterPro:IPR018523" /db_xref="InterPro:IPR039556" /db_xref="InterPro:IPR040442" /db_xref="UniProtKB/Swiss-Prot:Q7TZA8" /protein_id="SIU00553.1" /translation="MAIAETDTEVHTPFEQDFEKDVAATQRYFDSSRFAGIIRLYTAR QVVEQRGTIPVDHIVAREAAGAFYERLRELFAARKSITTFGPYSPGQAVSMKRMGIEA IYLGGWATSAKGSSTEDPGPDLASYPLSQVPDDAAVLVRALLTADRNQHYLRLQMSER QRAATPAYDFRPFIIADADTGHGGDPHVRNLIRRFVEVGVPGYHIEDQRPGTKKCGHQ GGKVLVPSDEQIKRLNAARFQLDIMRVPGIIVARTDAEAANLIDSRADERDQPFLLGA TKLDVPSYKSCFLAMVRRFYELGVKELNGHLLYALGDSEYAAAGGWLERQGIFGLVSD AVNAWREDGQQSIDGIFDQVESRFVAAWEDDAGLMTYGEAVADVLEFGQSEGEPIGMA PEEWRAFAARASLHAARAKAKELGADPPWDCELAKTPEGYYQIRGGIPYAIAKSLAAA PFADILWMETKTADLADARQFAEAIHAEFPDQMLAYNLSPSFNWDTTGMTDEEMRRFP EELGKMGFVFNFITYGGHQIDGVAAEEFATALRQDGMLALARLQRKMRLVESPYRTPQ TLVGGPRSDAALAASSGRTATTKAMGKGSTQHQHLVQTEVPRKLLEEWLAMWSGHYQL KDKLRVQLRPQRAGSEVLELGIHGESDDKLANVIFQPIQDRRGRTILLVRDQNTFGAE LRQKRLMTLIHLWLVHRFKAQAVHYVTPTDDNLYQTSKMKSHGIFTEVNQEVGEIIVA EVNHPRIAELLTPDRVALRKLITKEA" CDS complement(2157623..2163355) /codon_start=1 /transl_table=11 /gene="PPE34" /locus_tag="BQ2027_MB1951C" /product="ppe family protein ppe34" /note="Mb1951c, PPE34, len: 1910 aa. Similar to Rv1917c, len: 1459 aa, from Mycobacterium tuberculosis strain H37Rv. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, MPTR subfamily (see citation below). Similar to MTCY28.16, MTCY13E10.17, MTCY63.10, MTV004.05, MTCY98.24, MTCY6G11.05, etc. C-terminus is identical to Q50471. Unknown Mycobacterium tuberculosis protein (693 aa), FASTA results: opt: 2635, E(): 0, (99.7% identity in 391 aa overlap). Start changed since original submission (+23 aa). Thougth to be surface exposed, cell-wall associated. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, deletions of 12 bp and 69 bp, insertions of 483 bp and 375 bp, and a large substitution of 52 bp to 628 bp, leads to a longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (1910 aa versus 1459 aa)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1U1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00554.1" /translation="MNFSTLPPEINSALIFGGAGSEPMSAAAVAWDQLAMELASAAAS FNSVTSGLVGESWLGPSSAAMAAAVAPYLGWLAAAAAQAQRSATQAAALVAEFEAVRA AMVQPALVAANRSDLVSLVFSNFFGQNAPAIAAIEAAYEQMWAIDVSVMSAYHAGASA VASALTPFTAPPQNLTDLPAQLAAAPAAVVTAAITSSKGVLANLSLGLANSGFGQMGA ANLGILNLGSLNPGGNNFGLGNVGSNNVGLGNTGNGNIGFGNTGNGNIGFGLTGDNQQ GFGGWNSGTGNIGLFNSGTGNIGIGNTGTGNFGIGNSGTSYNTGIGNTGQANTGFFNA GIANTGIGNTGNYNTGSFNLGSFNTGDFNTGSSNTGFFNPGNLNTGVGNTGNVNTGGF NSGNYSNGFFWRGDYQGLIGFSGTLTIPAAGLDLNGLGSVGPITIPSITIPEIGLGIN SSGALVGPINVPPITVPAIGLGVSSSGALVGPINVPPITVPAIGLGVSSSGALVGPIN VPPITVPAIGLGVSSSGALVGPINVPPITVPAIGLGVSSSGALVGPINVPPITVPAIG LGVSSSGALVGPINVPPITVPAIGLGVSSSGALVGPINVPPITVPAIGLGVSSSGALV GPINVPPITVPAIGLGVSSSGALVGPINVPPITVPAIGLGVSSSGALVGPINVPPITL NSIGLELSAFQVINVGSISIPASPLAIGLFGVNPTVGSIGPGSISIQLGTPEIPAIPP FFPGFPPDYVTVSGQIGPITFLSGGYSLPAIPLGIDVGGGLGPFTVFPDGYSLPAIPL GIDVGGGLGPFTVFPDGYSLPAIPLGIDVGGGLGPFTVFPDGYSLPAIPLGIDVGGGL GPFTVFPDGYSLPAIPLGIDVGGGLGPFTVFPDGYSLPAIPLGIDVGGGLGPFTVFPD GYSLPAIPLGIDVGGGLGPFTVFPDGYSLPAIPLGIDVGGGLGPFTVFPDGYSLPAIP LGIDVGGAIGPLTTPPITIPAIPLGIDVSGSLGPINIPIEIAGTPGFGNSTTTPSSGF FNSGTGGTSGFGNVGSGGSGFWNIAGNLGNSGFLNVGPLTSGILNFGNTVSGLYNTST LGLATSAFHSGVGNTDSQLAGFMRNAAGGTLFNFGFANDGTLNLGNANLGDYNVGSGN VGSYNFGSGNIGNGSFGFGNIGSNNFGFGNVGSNNLGFANTGPGLTEALHNIGFGNIG GNNYGFANIGNGNIGFGNTGTGNIGIGLTGDNQVGFGALNSGSGNIGFFNSGNGNIGF FNSGNGNVGIGNSGNYNTGLGNVGNANTGLFNTGNVNTGIGNAGSYNTGSYNAGDTNT GDLNPGNANTGYLNLGDLNTGWGNIGDLNTGALISGSYSNGILWRGDYQGLIGYSDTL SIPAIPLSVEVNGGIGPIVVPDITIPGIPLSLNALGGVGPIVVPDITIPGIPLSLNAL GGVGPIVVPDITIPGIPLSLNALGGVGPIVVPDITIPGIPLSLNALGGVGPITVPGVP ISRIPLTINIRIPVNITLNELPFNVAGIFTGYIGPIPLSTFVLGVTLAGGTLESGIQG FSVNPFGLNIPLSGATNAVTIPGFAINPFGLNVPLSGGTSPVTIPGFAINPFGLDVPL SGGTNAVTIPGFAINPFGLDVPLSGGTSPVTIPGFAINPFGLDVPLSGGTNAVTIPGF AINPFGLDVPLSGGTNAVTIPGFAINPFGLDVPLSGGTSPVTIPGFAINPFDLNVPLS GGTNAVTIPGFAINPFGLNVPLSGGTSPVTIPGFAINPFGLNVPLSGGTSPVTIPGFT IPGSPLNLTANGGLGPINITSAPGFGNSTTTPSSGFFNSGDGSASGFGNVGPGISGLW NQVPNALQGGVSGIYNVGQLASGVANLGNTVSGFNNTSTVGHLTAAFNSGVNNIGQML LGFFSPGAGP" CDS complement(2163693..2164697) /codon_start=1 /transl_table=11 /gene="PPE35b" /locus_tag="BQ2027_MB1952C" /product="PPE FAMILY PROTEIN" /note="Mb1952c, PPE35b, len: 334 aa. Equivalent to 3' end of Rv1918c, len: 987 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 334 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins. Similar to MTCY28.16|Z95890 M. tuberculosis cosmid (1053 aa), FASTA scores: opt: 3404, E(): 0, (65.6% identity in 1058 aa overlap). Also similar to MTV004.05, MTY13E10.17, MTV014.03, MTCY3C7.23, MTCY6G11.05, MTCY48.17, MTV004.03, MTCY31.07, MTCY4C12.36, MTCY180.01, etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE35 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits PPE35 into 2 parts, PPE35a and PPE35b. Mb1952c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0I4" /protein_id="SIU00555.1" /translation="MVVFIPNNITALQTNMPGVFPQIGGFANTPPAFINTGTITVGGG QINGVGFSIGAINVTPFTLPNVVIQPWSLGGISVDGFTLPEISTQEFTTPALTISPIG VGALSLPDIITQQFTTPELTIDPITLGGFTLPQLSIPAITTPAFTIDPIALGGFTLPQ IMTPEITTPPFAIDPIGLSGFTLPQVNIPEITTPEFTIQPVGLAAFTTPALTIARIHL PSTTMGGFAIPAGPGYFNSSATPSSGFFNAGIGGNSGFGNSGSGLSGWFNTSPVGLLA GSGYQNYGGLISGFSNLGSGISGFANTGTLPFAVTSLVSGLANIGNNLSGLFFQSTTP " CDS complement(2164780..2166657) /codon_start=1 /transl_table=11 /gene="PPE35a" /locus_tag="BQ2027_MB1953C" /product="PPE FAMILY PROTEIN" /note="Mb1953c, PPE35a, len: 625 aa. Equivalent to 5'end of Rv1918c, len: 987 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 599 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins. Similar to MTCY28.16|Z95890 M. tuberculosis cosmid (1053 aa), FASTA scores: opt: 3404, E(): 0, (65.6% identity in 1058 aa overlap). Also similar to MTV004.05, MTY13E10.17, MTV014.03, MTCY3C7.23, MTCY6G11.05, MTCY48.17, MTV004.03, MTCY31.07, MTCY4C12.36, MTCY180.01, etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE35 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits PPE35 into 2 parts, PPE35a and PPE35b. Mb1953c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3XZR2" /protein_id="SIU00556.1" /translation="MHYSVLPPEINSALIFAGAGSGPMLAAASAWDGLATELASAAVS FGSVTAGLVGGSWQGRSSVAMAAAAAPYAGWLAAAATQAEQAATQAQVMVAEFEAVRL AMVQPALVAANRSGLISLVISNLFGQNAPAIAAAEAAYEEMWALDVSAMAAYHSGASA VAVALPAFALPLRLPAGLAAGPAAVVTALTTAVGMPTFAGRAIAASLGLANVGGGNLG NANNGLGNIGNANLGNNNLGSGNFGSFNIGSANLGGNNIGIGNAGANNFGLANLGNLN TGFANAGIGNFGIANTGNNNIGNGLTGNNQIGIGGLNSGNGNVGLFNAGSANIGFFNS GNGNFGIGNSGNFSTGLFNPGHGNTGFLNAGSFNTGMFDVGNANTGSFNVGHYNFGAF NPGPSNTGTFNTGGANTGWFNTGSINTGAFNIGDMNNGLFNTGDMNNGVFYRGVGQGS LQFAITSPDLTLPSLEIPGISVPAFSLPAITLPSLTIPAVTTPANVTVGAFDLPGLTV PSLTIPAAMTPANITVGAFDLPGLTVPSLTIPATTTPANITVGAFNLPQLSIPSVTVP PITIPAGTALGAFNLPTLSIPSVTVPPITIPAGNHCRRIYATHDTHPVNKYTPNKYRR L" CDS complement(2167106..2167570) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1954C" /product="conserved protein" /note="Mb1954c, -, len: 154 aa. Equivalent to Rv1919c, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 154 aa overlap). Conserved hypothetical protein, shows weak similarity to several major pollen antigens e.g. Z72431|BVGC25_1 MAJOR ALLERGEN BET V 1 from Betula verrucosa (160 aa), FASTA scores: opt: 133, E(): 0.012, (26.8% identity in 149 aa overlap). Also shows some similarity to Rv2574|MTCY227.27C Hypothetical protein from Mycobacterium tuberculosis (167 aa), (27.4% identity in 124 aa overlap). Protein product from Mb1954c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1954c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3Y050" /protein_id="SIU00557.1" /translation="MSGRKFSFEVTKTSSAPAATLFRLVTDGGNWATWAKPIVAQSSW ARRGDPAPGGIGAIRKLGMWPVFVQEETVEYEQDRRHVYKLVGARTPVQDYFGEVVLT PNASGGTDLRWSGSFTEKVRGTGPVMRAALGGAVRFFAGQLVKAAEREAVRR" CDS 2167668..2168531 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1955" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb1955, -, len: 287 aa. Equivalent to Rv1920, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 287 aa overlap). Probable membrane protein, similar to AL0215|SC10A5.04 putative membrane protein from Streptomyces coelicolor cosmid 10A5 (295 aa), FASTA scores: opt: 292, E(): 3.6e-13, (31.3% identity in 243 aa overlap). Also weakly similar to several Mycobacterial putative proteins with unknown function e.g. Rv0502, Rv1428c, U00018_22 Mycobacterium leprae cosmid B2168. Protein product from Mb1955 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1955 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZR3" /db_xref="InterPro:IPR002123" /db_xref="InterPro:IPR016676" /db_xref="UniProtKB/TrEMBL:A0A1R3XZR3" /protein_id="SIU00558.1" /translation="MFPRWPQQAHNHEVSRADTVSVPRAPTQAEVAAVLRIMTPLRKV IKPKVYGIENVPTERALLVGNHNTLGLVDAPLLAAELWERGRIVRSLGDHAHFKIPGW RDALTRTGVVEGTREITSELMRRGELVIVFPGGAREVNKRKNERYKLVWKNRLGFARL AIQHGYPIVPFASVGAEHGIDIVLDNESPLLAPVQFLAEKLLGTKDGPALVRGVGLTP VPRPERQYYWFGEPIDTTEFMGQQADDNAARRVRERAAAAIEHGIELMLAERAADPNR SLVGRLLRSDA" CDS complement(2168569..2169840) /codon_start=1 /transl_table=11 /gene="lppF" /locus_tag="BQ2027_MB1956C" /product="PROBABLE CONSERVED LIPOPROTEIN LPPF" /note="Mb1956c, lppF, len: 423 aa. Equivalent to Rv1921c, len: 423 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 423 aa overlap). Probable lppF, conserved lipoprotein, similar to G403173 lipoprotein precursor (fragment) from Rhodococcus erythropolis (225 aa), fasta scores: opt: 364, E(): 9.2e-19, (41.9% identity in 148 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/TrEMBL:A0A1R3XZS0" /protein_id="SIU00559.1" /translation="MVRLIPSLLAMATVLGGVIGCSAHQPPTPASGCRQLDAFLKWHH GVREFLQSAIDANSRCTGTADGSARKVAIFDWDNTVVKNDIGYATNYYMLQHSLVLQP ANQDWHAASRYLTDAAANALSVACGKVVPAGKPLPTGSNALCANEILSLLDGETTTGQ PAFVGNNVRRLAGPYAWSNALSAGYTAEELAGFADQAKKQNLAADVGATQQVGTQQVD GYIRVYPQMKDLIGTLQAHGIDTWVVSASPEPIVKVWAGEVGLDDQHVVGVRSVADQS GKLTAHLVGCGGVRDGDDSVMTYLDGKRCWANQVIFGVTGPQAFNQLAADRRQVLAAG DSNSDATFVGDATVVSLVINRNQDDLMCRAYDGLFTRGGKWAINPMFIDPLPQHAPYV CGEAFINPDGSKQPVLRNDGTPIPDQVDSVF" CDS 2170112..2171227 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1957" /product="PROBABLE CONSERVED LIPOPROTEIN" /note="Mb1957, -, len: 371 aa. Equivalent to Rv1922, len: 371 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 371 aa overlap). Probable conserved lipoprotein, possibly peptidase (EC 3.4.-.-) similar to many peptidases, e.g. P15555|DAC_STRSQ D-alanyl-D-alanine carboxypeptidase from Streptomyces sp. (406 aa), FASTA scores: opt: 382, E(): 3.1e-17, (28.0% identity in 379 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv1497, Rv2463, Rv3775, etc. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb1957 detected using SWATH mass spectrometry. Mb1957 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3XZQ7" /protein_id="SIU00560.1" /translation="MDSTVTASIRRMLGLLAATLLLGGCTGQHTTRTAASTTYTPHIK ASSQDVLDGAINADEPGCSAAVGVEGKVIWSGVRGIADLASGAKITTDTVFDIASVSK QFTATAILLLVEAGKLTLDDPISQYVPELPDWAQTVTVEQLMHQTSGIPDYVALLAAR GYQVSDRTIEAEARQALAAAPELQFKPGTRFDYSNSNYLLLGEIVHRASGQPLPEFLS AEIFQPLGLAMVVDPVGKVPNKAVSYEKGTGGNRSEYRVGNPAWEQIGDGGIQTTPSQ LARWADNYRTGSVGGLKLLEAQLAGAVETEPGGGDRYGAGIVSRADGTLDHAGAWAGF VTAFHISSDRRTSVAISCNTDKPDPVAMADALGRLWM" CDS 2171218..2172558 /codon_start=1 /transl_table=11 /gene="lipD" /locus_tag="BQ2027_MB1958" /product="PROBABLE LIPASE LIPD" /note="Mb1958, lipD, len: 446 aa. Equivalent to Rv1923, len: 446 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 446 aa overlap). Probable lipD, hydrolase lipase (EC 3.1.-.-), similar to esterases and beta-lactamases e.g. G151214 esterase, (389 aa), fasta scores: opt: 569, E(): 5.4e-29, (33.7% identity in 401 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv1497, Rv2463, Rv3775, etc. Protein product from Mb1958 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1958 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3XZP8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00561.1" /translation="MDVAGLPRLAAGTQAAIIHGMAQPPSLLTTDNGLPFGVQGACDS RFTGVIRAFAGLYPGRKFGGGALSVYIDDRQVVDVWTGWSDRQGKVPWTADTGAMVFS ATKGLAATVIHRLVDRGLLSYDAPVAEYWPEFGANGKSEVTVSDVLRHRSGLAHLKGV DKDEVMDHLLMEQKLAAAPLNRQHGKLAYHAVTYGWLLSGLARAVTGKGMRELFREEL ARPLNTDGIHLGRPPADSPTKAAQTLLPQAKVPTPLLDFIAPKVAGLSFSGLLGAVYF PGILSLLQDDMPFLDGEVPAVNGVVTARALAKTYGALANDGVIDGTRLLSSQAVRGLT GKSELWPDLNLGLPFTYHQGYQSSPVPGLLEGYGHIGLGGTIGWADPETGSAFGYVHN RLLTLLLFDIGSFAGLAALLNSAVVAARRDDPLEVPHFGAPYSEPRHEQAASGA" CDS complement(2172595..2172975) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1959C" /product="unknown protein" /note="Mb1959c, -, len: 126 aa. Equivalent to Rv1924c, len: 126 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 126 aa overlap). Hypothetical unknown protein. Protein product from Mb1959c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1959c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZT1" /db_xref="UniProtKB/TrEMBL:A0A1R3XZT1" /protein_id="SIU00562.1" /translation="MDPADVINPTSTRDAALARVLAYRQRVRARPLLIRATLAVVGGG LFVVSLPMIVLLPELGIPALLVAFRLLAVEAQWAVRAYAWTDWRFTQLREWFHRQVLV TRAAILVGLFLAAVALVWLLVYEF" CDS 2173132..2174994 /codon_start=1 /transl_table=11 /gene="fadD31" /locus_tag="BQ2027_MB1960" /product="PROBABLE ACYL-COA LIGASE FADD31 (ACYL-COA SYNTHETASE) (ACYL-COA SYNTHASE)" /note="Mb1960, fadD31, len: 620 aa. Equivalent to Rv1925, len: 620 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 620 aa overlap). Probable fadD31, acyl-CoA synthetase (EC 6.2.1.-), highly similar to others from Mycobacterium leprae e.g. NP_301198.1|NC_002677 putative acyl-CoA synthetase (635 aa); NP_302537.1|NC_002677 probable acyl-CoA synthase (583 aa); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. fadD32 (637 aa); fadD21 (578 aa); fadD29 (619 aa); fadD26|FD26_MYCTU|Q10976 (626 aa), FASTA scores: opt: 945, E(): 0, (39.8% identity in 598 aa overlap); etc. Also similar to N-terminus of G1171128 SAFRAMYCIN MX1 SYNTHETASE B from Myxococcus xanthus (1770 aa), FASTA scores: opt: 845, E(): 0, (37.4% identity in 593 aa overlap); N-terminus of T34918 polyketide synthase from Streptomyces coelicolor (2297 aa); etc. Protein product from Mb1960 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1960 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZQ8" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3XZQ8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00563.1" /translation="MNDGSRQELRVRSGLLQIEDCLDADGGIALPAGTTLISLIERNI KYVGDLVAYRYLDHARSAAGCALEVTWTQFGMRLAAIGAHVQRFAGPGDRVAILAPQG IDYVCGFYAAIKAGTVAVPLFAPELPGHAERLDTALRDSEPAVILTTAAAKNAVEGFL NNVPRLRKPTVLVIDQIPDREGELFVPVELDIDAVSHLQYTSGSTRPPVGVEITHRAV GTNLVQMILSIDLLNRNTHGVSWLPLYHDMGLSMIGFPAVYGGHSTLMSPTAFVRRPL RWIQALSEGSRTGRVVTAAPNFAYEWAAQRGLPAQGDDVDLSNVVLIIGSEPVSIDAV TTFNKAFAPYGLPRTAFKPSYGIAEATLLVATIDHAAEPTVVYLDPEQLGAGHATRVA PDAPNAVVHVSCGHVARSLWAVIVDPDTGPEAGAELPDGEIGEVWLQGDNVARGYWGR PEETRMTFGARLQSPLAEGSHADGSAIDDTWLRTGDLGVYLDGELYITGRIADLLTID GRNHYPQDIEATAAEASPMVRRGYITAFTVPASDGDDRNQRLVIIAERAAGTSRSDPR PALDAIRAAVCNRHGLSVADLSFLPAGAIPRTTSGKLARQACRAQYLSGRLGVH" CDS complement(2175002..2175481) /codon_start=1 /transl_table=11 /gene="mpt63" /locus_tag="BQ2027_MB1961C" /standard_name="mpb63" /product="immunogenic protein mpt63 (antigen mpt63/mpb63) (16 kda immunoprotective extracellular protein)" /note="Mb1961c, mpt63, len: 159 aa. Equivalent to Rv1926c, len: 159 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 159 aa overlap). mpt63 (alternate gene name: mpb63), immunogenic protein (see citations below), identical to MPT63|MPB63 from Mycobacterium bovis (159 aa). Exported protein containing a N-terminal signal sequence: see notes below about proteomics. Protein product from Mb1961c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1961c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5Q3" /db_xref="InterPro:IPR015250" /db_xref="InterPro:IPR029050" /db_xref="UniProtKB/Swiss-Prot:P0A5Q3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00564.1" /translation="MKLTTMIKTAVAVVAMAAIATFAAPVALAAYPITGKLGSELTMT DTVGQVVLGWKVSDLKSSTAVIPGYPVAGQVWEATATVNAIRGSVTPAVSQFNARTAD GINYRVLWQAAGPDTISGATIPQGEQSTGKIYFDVTGPSPTIVAMNNGMEDLLIWEP" CDS 2175718..2176491 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1962" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1962, -, len: 257 aa. Equivalent to Rv1927, len: 257 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 257 aa overlap). Conserved hypothetical protein, similar to SCG11A.10c|AL133210 hypothetical protein from Streptomyces coelicolor (252 aa), FASTA scores: opt: 729, E(): 0, (48.3% identity in 238 aa overlap). Slight similarity with P54543|YQJF_BACSU hypothetical 23.9 kd protein from Bacillus subtilis (209 aa), FASTA scores, opt: 230, E(): 2.8e-08, (28.0% identity in 164 aa overlap). Protein product from Mb1962 detected using SWATH mass spectrometry. Mb1962 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR018644" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0J4" /protein_id="SIU00565.1" /translation="MTAIPGPSGAEPGESRALAGYPVTPPALPRPVIFDQRWTDLTFI HWPVLPESVAGSYPPGTRPDVFADGMTYVGLVPFRMSSTKLGTALPIPYVGTFPETNV RLYSIDNAGRHGVLFRSLETARLTVVPLTRIGLGIPYAWSRMRMMRSGKHITYHSVRR WPRRGLRSLLTITIGDLVEPTPLEVWLTARWGAHTRKAGRTWWVPNEHKPWPLRAAEI AELNDELIDASGVQPTGDRLRALFSPGVHARFGRPCVVQ" CDS complement(2176495..2177262) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1963C" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb1963c, -, len: 255 aa. Equivalent to Rv1928c, len: 255 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 255 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to others e.g. NP_228109.1|NC_000853 oxidoreductase (short chain dehydrogenase/reductase family) from Thermotoga maritima (257 aa); T41116 short chain dehydrogenase from Schizosaccharomyces pombe (261 aa); P87219|SOU1_CANAL SORBITOL UTILIZATION PROTEIN (SDR FAMILY) from Candida albicans (281 aa); P25529|HDHA_ECOLI 7-alpha-hydroxysteroid dehydrogenase from Escherichia coli (255 aa), FASTA scores: opt: 541, E(): 1.2e-27, (37.5% identity in 251 aa overlap); etc. Also similar to many mycobacterial tuberculosis proteins e.g. Rv1350, Rv0927c, Rv2002, Rv0769, Rv2766c, etc. Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb1963c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1963c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZS7" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZS7" /protein_id="SIU00566.1" /translation="MSVLDLFDLHGKRALITGASTGIGKRVALAYVEAGAQVAIAARH LDALEKLADEIGTSGGKVVPVCCDVSQHQQVTSMLDQVTAELGGIDIAVCNAGIITVT PMLDMPLEEFQRLQNTNVTGVFLTAQAAAKAMVKQGQGGVIINTASMSGHIINVPQQV SHYCASKAAVIHLTKAMAVELAPHKIRVNSVSPGYILTELVEPYTEYQPLWEPKIPLG RLGRPEELAGLYLYLASEASSYMTGSDIVIDGGYTCP" CDS complement(2177307..2177951) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1964C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1964c, -, len: 214 aa. Equivalent to Rv1929c, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (99.533% identity in 214 aa overlap). Conserved hypothetical protein, similar to SC4G6.14|AL096884 hypothetical protein from Streptomyces coelicolor (211 aa), FASTA scores: opt: 416, E(): 2.4e-22, (39.8% identity in 206 aa overlap). Protein product from Mb1964c detected using SWATH mass spectrometry. Mb1964c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR017517" /db_xref="InterPro:IPR017519" /db_xref="InterPro:IPR034660" /db_xref="UniProtKB/TrEMBL:A0A1R3Y060" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00567.1" /translation="MADVPLDAQERLELCDLLEELGPAVATLIEGWTAHDLAAHIVLR ERDLVAGLCIVLPGPFQRFAERRRARLAQSKDFTWLVARIRSGPPMGFFRIGWVRTLA NLNEFFVHHEDVRRASGRGPRSLTPEMDAALWRNVRRGSHFLSRRLHGCGLEIEWVGT GKRVRVRSGEPTARLTGPPGELLLYVFGRRAVARVEVSGPLEAIAAVHRTHFGM" CDS complement(2177963..2178487) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1965C" /product="Putative intracellular protease/amidase" /note="Mb1965c, -, len: 174 aa. Equivalent to Rv1930c, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 174 aa overlap). Conserved hypothetical protein, similar to SC5F2A.30|AL049587 hypothetical protein from Streptomyces coelicolor (211 aa), FASTA scores: opt: 307, E(): 2.8e-13, (54.8% identity in 84 aa overlap). Some similarity to M. tuber culosis hypothetical protein Rv0052|MTCY21D4.15 (43% identity in 93 aa overlap)." /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/TrEMBL:A0A1R3XZR9" /protein_id="SIU00568.1" /translation="MTQIAFVAYPGVTALDVVGPYEVLRNLPHAQVRFVWLRGRRATS HWLTLPALKAFGAIPVADERIVHQDNIVTSAGVSAGLDLALWLAGQLGGEARAKAIQL AIEYDPQPPFDSGHMSKASPTTKAAATALLSKDSAKPANLTAATLLAWERALAAVQSR RRKRQPVGAQARRP" CDS complement(2178505..2179284) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1966C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb1966c, -, len: 259 aa. Equivalent to Rv1931c, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 259 aa overlap). Probable transcriptional regulatory protein. Similarity in C-terminal half to transcriptional activators e.g. Q43970 ARAC-LIKE PROTEIN (227 aa), FASTA scores: opt: 238, E(): 7.1e-07, (42.4% identity in 92 aa overlap). Similar to many probable transcription regulators in Streptomyces e.g. AL049587|SC5F2A.29 Streptomyces coelicolor (325 aa), FASTA scores: opt: 387, E(): 3.2e-16, (34.4% identity in 259 aa overlap). Mb1966c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZS8" /db_xref="InterPro:IPR002818" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR018060" /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/TrEMBL:A0A1R3XZS8" /protein_id="SIU00569.1" /translation="MVIVGFPGDPVDTVILPGGAGVDAARSEPALIDWVKAVSGTARR VVTVCTGAFLAAEAGLLGRTPSDDALGLCRTFRPRISGRSGRCRPDLHAQFAEGVDRG WSHRRHRPRAGTGRRRPRHRDCPDGCPLARPVSAPTRWADPVRGSGVDATRQTDLDPP GAGGHRGRAGGAHRIGELAQRAAMSPRHFTRVFSDEVGEAPGRYVERIRTEAARRQLE ETHDTVVAIAARCGFGTAETMRRSFIRRVGISPDQYRKAFA" CDS 2179417..2179914 /codon_start=1 /transl_table=11 /gene="tpx" /locus_tag="BQ2027_MB1967" /standard_name="cfp20" /product="PROBABLE THIOL PEROXIDASE TPX" /note="Mb1967, tpx, len: 165 aa. Equivalent to Rv1932, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 165 aa overlap). Probable tpx (alternate gene name: cfp20), thiol peroxidase (EC 1.11.1.-) similar to TPX_ECOLI|P37901 thiol peroxidase (EC 1.11.1.-) (p20) from Escherichia coli (167 aa), fasta scores: opt: 535, E(): 7.3e-25, (52.4% identity in 164 aa overlap). There are four other related enzymes in M. tuberculosis: Rv2428, Rv2521, Rv2238c, Rv1608c. Protein product from Mb1967 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1967 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66953" /db_xref="InterPro:IPR002065" /db_xref="InterPro:IPR013740" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR018219" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/Swiss-Prot:P66953" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00570.1" /translation="MAQITLRGNAINTVGELPAVGSPAPAFTLTGGDLGVISSDQFRG KSVLLNIFPSVDTPVCATSVRTFDERAAASGATVLCVSKDLPFAQKRFCGAEGTENVM PASAFRDSFGEDYGVTIADGPMAGLLARAIVVIGADGNVAYTELVPEIAQEPNYEAAL AALGA" CDS complement(2179911..2181002) /codon_start=1 /transl_table=11 /gene="fadE18" /locus_tag="BQ2027_MB1968C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE18" /note="Mb1968c, fadE18, len: 363 aa. Equivalent to Rv1933c, len: 363 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 363 aa overlap). Probable fadE18, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. CAB61609.1|AL133210 putative acyl-CoA dehydrogenase from Streptomyces coelicolor (362 aa); NP_421282.1|NC_002696 acyl-CoA dehydrogenase family protein from Caulobacter crescentus (344 aa); ACDS_RAT|P15651 short-chain specific acyl-CoA dehydrogenase from Rattus norvegicus (Rat) (412 aa), fasta scores: opt: 239, E(): 2.1e-08, (28.4% identity in 331 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. N-terminus of fadE22 (721 aa); fadE33 (318 aa); N-terminus of fadE34 (711 aa); etc. COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY." /db_xref="GOA:A0A1R3XZR1" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3XZR1" /protein_id="SIU00571.1" /translation="MDFRYSTEQDDFRASLRGFLGRGAPVREMAAADGSDRRLWQRLC TELELPALHVPPEHGGLGATLVETAIAFAELGRALTPIPFAATVFAIEAILRMGDDEQ RKRLLAGLLTGARIGTIAVSGHDVASATTVRAVRRDGRPALTGECTPVLHGHVADLFV VPAVADGSIVLHVVAADAPGVTVTPLPSFDITRPVATLRLAGSPAEPLTAGTPDDMER VLDVARVLLAAEMLGGAEACLDLAVQYAGRRTQFDRPIGSFQAVKHACADMMIEIDAT RATVMFAAMSAANGDELQTVAPLAKAQTAETFVLCAGSALQIHGAIAFTWEHDLHLYY RRAKTTEALFGSSARNRALLAERAGLVKA" CDS complement(2181004..2182233) /codon_start=1 /transl_table=11 /gene="fadE17" /locus_tag="BQ2027_MB1969C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE17" /note="Mb1969c, fadE17, len: 409 aa. Equivalent to Rv1934c, len: 409 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 409 aa overlap). Probable fadE17, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to ACD_MYCLE|P46703 acyl-CoA dehydrogenase from Mycobacterium leprae (389 aa), FASTA scores: opt: 414, E(): 2.6e-19, (28.3% identity in 407 aa overlap). Also similar to many e.g. NP_249713.1|NC_002516 probable acyl-CoA dehydrogenase from Pseudomonas aeruginosa (381 aa); NP_420614.1|NC_002696 acyl-CoA dehydrogenase family protein from Caulobacter crescentus (355 aa); CAB61610.1|AL133210 putative acyl-CoA dehydrogenase from Streptomyces coelicolor (393 aa); etc. Also similar to others from Mycobacterium tuberculosis e.g. fadE30 (385 aa); fadE31 (377 aa); C-terminus of fadE34 (711 aa); etc. COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY." /db_xref="GOA:A0A1R3XZU2" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3XZU2" /protein_id="SIU00572.1" /translation="MDVSYPPEAEAFRDRIREFVAEHLPPGWPGPGALPPHEREEFAR HWRRALAGAGLVAVSWPTEYGGGGLSPMEQVVLAEEFARAGAPERAENDLLGIDLLGN TLIALGSEAQKRHFLPRILSGEHRWCQGFSEPEAGSDLASVRTRGVLDGDEWVINGHK IWTSAGTTANWIFLLARTDPSAAKHRGLSFLLVPMDQPGVVVRPIVNAAGHSSFSEVF LTDARTSAGNVVGRVGDGWSTAMTLLGFERGSHIATAAIDFERDLQRLCELARDRGLH TDPRVRDGLAWCYARVQIMRYRGYRDLTLALTGRPPGAEAAITKVIWSEYFRRYTDLA VEILGLEALGPRGPGNGGARLVPEAGTPNSPACWMDELLYARAATIYAGSSQIQRNVI GERLLGLPKEPRPEVLC" CDS complement(2182248..2183204) /codon_start=1 /transl_table=11 /gene="echA13" /locus_tag="BQ2027_MB1970C" /product="POSSIBLE ENOYL-COA HYDRATASE ECHA13 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb1970c, echA13, len: 318 aa. Equivalent to Rv1935c, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 318 aa overlap). Possible echA13, enoyl-CoA hydratase (EC 4.2.1.17), similar to others and various enzymes e.g. CAC48381.1|Y16952 putative enoyl-CoA-isomerase from Amycolatopsis mediterranei (269 aa); AAK18173.1|AF290950_5|AF290950|FadB1x enoyl-CoA hydratase from Pseudomonas putida (257 aa); AAF78820.1|AF042490 4-chlorobenzoyl CoA dehalogenase from Arthrobacter sp. TM1 (276 aa); ECHM_RAT|P14604 enoyl-coa hydratase mitochondrial precursor from Rattus norvegicus (Rat) (290 aa), FASTA scores: opt: 228, E(): 1.2e-08, (31.0% identity in 258 aa overlap); etc." /db_xref="GOA:A0A1R3XZR7" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3XZR7" /protein_id="SIU00573.1" /translation="MFVGRVGPVDRRSDGERSRRPREFEYIRYETIDDGRIAAITLDR PKQRNAQTRGMLVELGAAFELAEADDTVRVVILRAAGPAFSAGHDLGSADDIRERSPG PDQHPSYRCNGATFGGVESRNRQEWHYYFENTKRWRNLRKITIAQVHGAVLSAGLMLA WCCDLIVASEDTVFADVVGTRLGMCGVEYFGHPWEFGPRKTKELLLTGDCIGADEAHA LGMVSKVFPADELATSTIEFARRIAKVPTMAALLIKESVNQTVDAMGFSAALDGCFKI HQLNHAHWGEVTGGKLSYGTVEYGLEDWRAAPQIRPAIKQRP" CDS 2183429..2184538 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1971" /product="POSSIBLE MONOOXYGENASE" /note="Mb1971, -, len: 369 aa. Equivalent to Rv1936, len: 369 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 369 aa overlap). Possible monooxygenase (EC 1.-.-.-), similar to LXA2_PHOLU|P23146 alkanal monooxygenase alpha chain (362 aa), FASTA scores: opt: 196, E(): 6.3e-06, (22.3% identity in 373 aa overlap). Also similar to many other Mycobacterium tuberculosis hypothetical oxidoreductases and monooxygenases e.g. Rv0953c, Rv0791c, Rv0132c, etc. Protein product from Mb1971 detected using SWATH mass spectrometry. Mb1971 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1V8" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1V8" /protein_id="SIU00574.1" /translation="MEIGIFLMPAHPPERTLYDATRWDLDVIELADQLGYVEAWVGEH FTVPWEPICAPDLLLAQALLRTQQIKLAPGAHLLPYHHPVELAHRVAYFDHLAQGRFM LGVGASGIPGDWALYDVDGKNGEHREMTREALEIMLRIWTEDEPWEHRGKYWNANGIA PMFEGLMRRHIKPYQKPHPPIGVTGFSAGSETLKLAGERGYIPMSLDLNTEYVATHWD AVEEGALRSGRTPDRRDWRLVREVLVAETDEQAFRYAVDGTMGRAMREYVLPTFRMFG MTKFYKHNPSVPDDEVTPEYLAENTFVVGSVQTVVDKLEATYDQVGGFGHLLILGFDY SDNPGPWKESLRLLAHEVMPRLNARLATKPATAVV" CDS 2184541..2187060 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1972" /product="POSSIBLE OXYGENASE" /note="Mb1972, -, len: 839 aa. Equivalent to Rv1937, len: 839 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 839 aa overlap). Possible oxygenase (EC 1.-.-.-), similar in N-terminus to N-terminal part (approx. 350 aa) of dioxygenases (including ring-hydroxylating dioxygenase electron transfer components) and monooxygenases, e.g. AAC34815.1|AF071556 anthranilate dioxygenase reductase from Acinetobacter sp. (343 aa); AAK52291.1|AY026914|AntC putative anthranilate dioxygenase reductase from Pseudomonas putida (340 aa); AAF63450.1|AF218267_7|AF218267 benzoate dioxygenase / ferredoxin reductase from Pseudomonas putida (336 aa); P23101|XYLZ_PSEPU toluate 1,2-dioxygenase electron transfer component [INCLUDES: FERREDOXIN; FERREDOXIN--NAD(+) REDUCTASE (EC 1.18.1.3)] from Pseudomonas putida plasmid TOL pWW0 (336 aa), FASTA scores: opt: 700, E(): 0, (34.3% identity in 335 aa overlap); S23479 probable benzoate 1,2-dioxygenase (EC 1.14.12.10) reductase component benC from Acinetobacter calcoaceticus (338 aa); AAC45294.1|U81594 soluble methane monooxygenase protein C from Methylocystis sp. (343 aa); P22868|MEMC_METCA METHANE MONOOXYGENASE COMPONENT C from Methylococcus capsulatus (348 aa); etc. Also similar in part to Mycobacterium tuberculosis hypothetical electron transfer proteins Rv3554, Rv3571, etc. Contains PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature." /db_xref="GOA:A0A1R3Y0K2" /db_xref="InterPro:IPR001041" /db_xref="InterPro:IPR001433" /db_xref="InterPro:IPR006058" /db_xref="InterPro:IPR008333" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR017927" /db_xref="InterPro:IPR017938" /db_xref="InterPro:IPR036010" /db_xref="InterPro:IPR036188" /db_xref="InterPro:IPR039261" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0K2" /protein_id="SIU00575.1" /translation="MAVRQVTVGYSDGTHKTMPVRCDQTVLDAAEEHGVAIVNECQSG ICGTCVATCTAGRYQMGRTEGLSDVERAARKILTCQTFVTSDCRIELQYPVDDNAALL VTGDGVVTAVELVSPSTAILRVDTSGMAGALRYRAGQFAQLQVPGTNVWRNYSYAHPA DGRGECEFIIRLLPDGVMSNYLRDRAQPGDHIALRCSKGSFYLRPIVRPVILVAGGTG LSAILAMAQSLDADVAHPVYLLYGVERTEDLCKLDELTELRRRVGRLEVHVVVARPDP DWDGRTGLVTDLLDERMLASGDADVYLCGPVAMVDAARTWLDHNGFHRVGLYYEKFVA SGAARRRTPARLDYAGVDIAEVCRRGRGTAVVIGGSIAGIAAAKMLSETFDRVIVLEK DGPHRRREGRPGAAQGWHLHHLLTAGQIELERIFPGIVDDMVREGAFKVDMAAQYRIR LGGTWKKPGTSDIEIVCAGRPLLEWCVRRRLDDEPRIDFRYESEVADLAFDRANNAIV GVAVDNGDADGGDGLQVVPAEFVVDASGKNTRVPEFLERLGVGAPEAEQDIINCFYST MQHRVPPERRWQDKVMVICYAYRPFEDTYAAQYYTDSSRTILSTSLVAYNCYSPPRTA REFRAFADLMPSPVIGENIDGLEPASPIYNFRYPNMLRLRYEKKRNLPRALLAVGDAY TSADPVSGLGMSLALKEVREMQALLAKYGAGHRDLPRRYYRAIAKMADTAWFVIREQN LRFDWMKDVDKKRPFYFGVLTWYMDRVLELVHDDLDAYREFLAVVHLVKPPSALMRPR IASRVLGKWARTRLSGQKTLIARNYENHPIPAEPADQLVNA" CDS 2187072..2188142 /codon_start=1 /transl_table=11 /gene="ephB" /locus_tag="BQ2027_MB1973" /product="PROBABLE EPOXIDE HYDROLASE EPHB (EPOXIDE HYDRATASE)" /note="Mb1973, ephB, len: 356 aa. Equivalent to Rv1938, len: 356 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 356 aa overlap). Probable ephB, epoxide hydrolase (EC 3.3.2.3) (see citation below), similar to many e.g. G1109600 ATSEH (EC 3.3.2.3) (321 aa), FASTA scores: opt: 442, E(): 1.2e-21 (33.1% identity in 356 aa overlap); etc. Also similar to many other M. tuberculosis hypothetical epoxide hydrolases e.g. Rv3617, Rv3670, Rv0134, etc. Protein product from Mb1973 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3XZT6" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3XZT6" /protein_id="SIU00576.1" /translation="MSQVHRILNCRGTRIHAVADSPPDQQGPLVVLLHGFPESWYSWR HQIPALAGAGYRVVAIDQRGYGRSSKYRVQKAYRIKELVGDVVGVLDSYGAEQAFVVG HDWGAPVAWTFAWLHPDRCAGVVGISVPFAGRGVIGLPGSPFGERRPSDYHLELAGPG RVWYQDYFAVQDGIITEIEEDLRGWLLGLTYTVSGEGMMAATKAAVDAGVDLESMDPI DVIRAGPLCMAEGARLKDAFVYPETMPAWFTEADLDFYTGEFERSGFGGPLSFYHNID NDWHDLADQQGKPLTPPALFIGGQYDVGTIWGAQAIERAHEVMPNYRGTHMIADVGHW IQQEAPEETNRLLLDFLGGLRP" CDS 2188139..2188873 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1974" /product="PROBABLE OXIDOREDUCTASE" /note="Mb1974, -, len: 244 aa. Similar to Rv1939, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 167 aa overlap). Probable oxidoreductase (EC 1.-.-.-), similar to NP_302637.1|NC_002677 probable oxidoreductase from Mycobacterium leprae (162 aa) Also similar to NTAB_CHELE|P54990 nitrilotriacetate monooxygenase component from Chelatobacter heintzii (322 aa), fasta scores: opt: 269, E(): 5.3e-11, (33.1% identity in 151 aa overlap). And similar to Mycobacterium tuberculosis probable monooxygenase components Rv0246, Rv3567, and to a lesser extent, Rv3007c. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base deletion (t-*) leads to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis H37Rv (244 aa versus 171 aa)." /db_xref="GOA:A0A1R3Y070" /db_xref="InterPro:IPR002563" /db_xref="InterPro:IPR012349" /db_xref="UniProtKB/TrEMBL:A0A1R3Y070" /protein_id="SIU00577.1" /translation="MSCTFDMVPETVDHLDEVGLRRVFGCFPCGVIAVCAMVDDQPVG MAASSFTSVSVDPPLVSICVQNCSTTWPKLRDRPRLGVSVLAEGHDAACMSLSRKEGN RFAGVFWSELSSGGVVIAGAGAWLDCRPYAEIPAGDHLIALLEICAVRADPETPPLVF HGSRFRRWSLDEDDRCAGTSCDHGDGGRSRRGPDRRPQWRWLSRLRRPGRDAAAGCLC GPAHLGLFARRAAGRRMRATAPAAHV" CDS 2188650..2189711 /codon_start=1 /transl_table=11 /gene="ribA1" /locus_tag="BQ2027_MB1975" /standard_name="ribA" /product="Probable Riboflavin biosynthesis protein ribA1 (GTP cyclohydrolase II)" /note="Mb1975, ribA1, len: 353 aa. Equivalent to Rv1940, len: 353 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 353 aa overlap). Probable ribA1, Riboflavin biosynthesis protein (EC 3.5.4.25), similar to GCH2_BACSU|P17620 gtp cyclohydrolase ii (EC 3.5.4.25) (398 aa), FASTA scores: opt: 682, E(): 0, (37.7% identity in 363 aa overlap), also similar to Rv1415|MTCY21B4.33|ribA2 (428 aa) (45.4% identity in 368 aa overlap). Note that previously known as ribA." /db_xref="GOA:A0A1R3XZS4" /db_xref="InterPro:IPR000422" /db_xref="InterPro:IPR017945" /db_xref="InterPro:IPR032677" /db_xref="InterPro:IPR036144" /db_xref="UniProtKB/TrEMBL:A0A1R3XZS4" /protein_id="SIU00578.1" /translation="MKTTDVRVRRAITAMAGGHAVVLTGDPNGDGYLVFAAQAATPRL VAFAVRHTSGYLRVALPGAECERLHLPPMCDRDTTHCVSVDVRGTGTGISASDRAWTI AALASATSVAADFQRPGHVVPVQAQADGVLGRRGPAEAAVDLARLAERRPAAALCEIV SPDNPVQMAHHAESVEFAVEHGLAMVSIGELVAYRRRIEPQVVRFTAATLPTWAGASR VIGFRDVYDLGEHLAVIVGAVGAGVPVPLHVHIECLTGDVFGSTACRCGEELNGALAR MSAQGSGVVLYLRPPGPAQACGLFARGDAATDVMPETVTWILRDLGVYAIRLSDDVPG FGLVMFGAIREASTLAAAG" CDS 2189708..2190478 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1976" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb1976, -, len: 256 aa. Equivalent to Rv1941, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 256 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases, generally belonging to SDR family, e.g. NP_299015.1|NC_002488 2,5-dichloro-2,5-cyclohexadiene-1,4-diol dehydrogenase from Xylella fastidiosa (255 aa); NP_250340.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (253 aa); NP_106890.1|NC_002678 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from Mesorhizobium loti (374 aa) (has its N-terminus longter); P50197|LINC_PSEPA 2,5-dichloro-2,5-cyclohexadiene-1,4- dehydrogenase from Pseudomonas paucimobilis (Sphingomonas paucimobilis) (250 aa), FASTA scores: opt: 529, E(): 5.7e-25, (40.6% identity in 251 aa overlap); etc. Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Mb1976 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZV8" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3XZV8" /protein_id="SIU00579.1" /translation="MNHPDLAGKVAIVTGAGAGIGLAVARRLADEGCHVLCADIDGDA ADAAATKIGCGAAACRVDVSDEQQIIAMVDACVAAFGGVDKLVANAGVVHLASLIDTT VEDFDRVIAINLRGAWLCTKHAAPRMIERGGGAIVNLSSLAGQVAVGGTGAYGMSKAG IIQLSRITAAELRSSGIRSNTLLPAFVDTPMQQTAMAMFDGALGAGGARSMIARLQGR MAAPEEMAGIVVFLLSDDASMITGTTQIADGGTIAALW" CDS complement(2190688..2191017) /codon_start=1 /transl_table=11 /gene="mazf5" /locus_tag="BQ2027_MB1977C" /product="possible toxin mazf5" /note="Mb1977c, -, len: 109 aa. Equivalent to Rv1942c, len: 109 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 109 aa overlap). Conserved hypothetical protein, shows some similarity to Q10867|MTCY39.28|Rv1991 hypothetical 12.3 kd protein (114 aa), FASTA scores: opt: 117, E(): 0.021, (24. 5% identity in 110 aa overlap) also P33645|CHPA_ECOLI pemk-like protein 1 (mazf protein) from Escherichia coli (111 aa), FASTA scores: opt: 104, E(): 0.18, (29.1% identity in 110 aa overlap). Also similar to Mycobacterium tuberculosis Rv0659c (102 aa) (32.7% identity in 101 aa overlap); Rv1102c (33.3% identity in 93 aa overlap) and Rv1495. Protein product from Mb1977c detected using SWATH mass spectrometry. Mb1977c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZS9" /db_xref="InterPro:IPR003477" /db_xref="InterPro:IPR011067" /db_xref="UniProtKB/TrEMBL:A0A1R3XZS9" /protein_id="SIU00580.1" /translation="MTALPARGEVWWCEMAEIGRRPVVVLSRDAAIPRLRRALVAPCT TTIRGLASEVVLEPGSDPIPRRSAVNLDSVESVSVAVLVNRLGRLADIRMRAICTALE VAVDCSR" CDS complement(2191014..2191391) /codon_start=1 /transl_table=11 /gene="maze5" /locus_tag="BQ2027_MB1978C" /product="possible antitoxin maze5" /note="Mb1978c, -, len: 125 aa. Equivalent to Rv1943c, len: 125 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 125 aa overlap). Conserved hypothetical protein, showing some similarity with Rv1946c|MTCY09F9.18|lppG possible conserved lipoprotein from Mycobacterium tuberculosis (150 aa), FASTA score: (71.4% identity in 28 aa overlap). Protein product from Mb1978c detected using SWATH mass spectrometry. Mb1978c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZS3" /protein_id="SIU00581.1" /translation="MKTARLQVTLRCAVDLINSSSDQCFARIEHVASDQADPRPGVWH SSGMNRIRLSTTVDAALLTSARDMRAGITDAALIDEALAALLARHRSAEVDASYAAYD KHPVDEPDEWGDLASWRRAAGDS" CDS complement(2191388..2191978) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1979C" /product="conserved protein" /note="Mb1979c, -, len: 196 aa. Equivalent to Rv1944c, len: 196 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 196 aa overlap). Conserved hypothetical protein, similar to C-terminal part of SCE20.29|AL136058|CAB65585.1 hypothetical protein from Streptomyces coelicolor (338 aa), BLASTP scores, Identities = 37/131 (28%), Positives = 51/131 (38%). Protein product from Mb1979c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1979c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR004027" /db_xref="UniProtKB/TrEMBL:A0A1R3XZV1" /protein_id="SIU00582.1" /translation="MISDTEDFAHGDKAAPPRLRASYAACGGDAAGCWTMSDNGASRV PPVDETPAAESAEPITAVSLAWLPAGDYERALDLWPDFAGSDLVTGPDGPVAHPLYCR RMQQKLVEFAEAGFPGLAVAAIRVAPFAAWCAEQGQEPDSPEARAEYAAYLTAHGDHD VMAWPPGRNQQCWCGSGHKYKKCCAAASFIDTEPAP" CDS 2192033..2193397 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1980" /product="13E12 repeat family protein" /note="Mb1980, -, len: 454 aa. Equivalent to Rv1945, len: 454 aa, from Mycobacterium tuberculosis strain H37Rv, (99.780% identity in 454 aa overlap). Member of Mycobacterium tuberculosis REP13E12 repeat family. Similar to several others, best with Rv1148c|Z95584|MTCI65.15 (482 aa), FASTA score: opt: 2954, E(): 0, (97.1% identity in 454 aa overlap). Contains possible helix-turn-helix motif at aa 74-95 (+2.90 SD). Mb1980 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3XZS1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00583.1" /translation="MRSDTREEISAALDAYHASLSRVLDLKCDALTTPELLACLQRLE VERRRQGAAEHALINQLAGQACEEELGGTLRTALANRLHITPGEASRRIAEAEDLGER RALTGEPLPAQLTATAAAQREGKIGREHIKEIQAFFKELSAAVDLGIREAAEAQLAEL ATSRRPDHLHGLATQLMDWLHPDGNFSDQERARKRGITMGKQEFDGMSRISGLLTPEL RATIEAVLAKLAAPGACNPDDQTPLVDDTPDADAVRRDTRSQAQRHHDGLLAGLRGLL ASGELGQHRGLPVTVVVSTTLKELEAATGKGVTGGGSRVPMSDLIRMASNAHHYLALF DGAKPLALYHTKRLASPAQRIMLYAKDRGCSRPGCDAPAYHSEVHHVTPWTTTHRTDI NDLTLACGPDNRLVEKGWKTRKNAKGDTEWLPPAHLDHGQPRINRYHHPEKILCEPDD DEPH" repeat_region 2192033..2193394 /rpt_family="REP" /note="REP-7, len: 1362 nt. Equivalent to REP, len: 1362 nt, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1362 nt overlap). REP09F9, member of the REP13E12 family." CDS complement(2193552..2194004) /codon_start=1 /transl_table=11 /gene="lppG" /locus_tag="BQ2027_MB1981C" /product="POSSIBLE LIPOPROTEIN" /note="Mb1981c, lppG, len: 150 aa. Equivalent to Rv1946c, len: 150 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 150 aa overlap). Possible lppG, conserved lipoprotein, showing some similarity to Rv1943c|MTCY09F9.21 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (125 aa), FASTA score: (71.4% identity in 28 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1W8" /protein_id="SIU00584.1" /translation="MIRGSAVSGLLMPSVNGGTAGSVACVQCLFLPKVAVDLINLSGI QCFARIEHVAHAQAHPFVVLVGKPAQHGARIGAVAGAILTGDVIVSHDGELYRTVTAL RQNGPRPHASRRLHAPALCSARSRRGHLRPSCWLPPPRFAGRQSLVAR" CDS 2194068..2194469 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1982" /product="HYPOTHETICAL PROTEIN" /note="Mb1982, -, len: 133 aa. Equivalent to Rv1947, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Hypothetical unknown protein,Mb1982 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0K9" /protein_id="SIU00585.1" /translation="MDRYNDQASGRALIEIRLCNERATPMPIPIGLWMFQTKLHVNAG GADVFLPVCDVLEQDLAERDEEVRQLNLQYRNRLEYAIGRTCSAAWSVNGSRRPSAVW TTWLPVAETPHTRARSVENALLSMDSRGGVT" CDS complement(2194758..2195108) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1983C" /product="HYPOTHETICAL PROTEIN" /note="Mb1983c, -, len: 116 aa. Equivalent to Rv1948c, len: 116 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 116 aa overlap). Hypothetical unknown protein" /db_xref="UniProtKB/TrEMBL:A0A1R3XZU5" /protein_id="SIU00586.1" /translation="MTVFRIKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFD IDGVQQRIVRESGTADMELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLN SPAPTLMISVDEYA" CDS complement(2195119..2196021) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1984C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1984c, -, len: 300 aa. Equivalent to Rv1949c, len: 319 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 300 aa overlap). Conserved hypothetical protein, partial ORF. Rv1949c and Rv1950c|MTCY09F9.14 are similar but frameshifted with respect to Rv2077c|MTCY49.16C|Q10685 hypothetical 33.3 kd protein (323 aa), FASTA scores: opt: 459, E(): 2.8e-16, (54.8% identity in 157 aa overlap). Cosmid sequence appears to be correct, genomic sequence is also frameshifted in Mycobacterium bovis strain AF2122/97. Similar to M. tuberculosis hypothetical proteins: Rv2542, Rv2077c, Rv2797c, Rv0963c, etc." /db_xref="UniProtKB/TrEMBL:A0A1R3Y079" /protein_id="SIU00587.1" /translation="MRQASGLAREGAGTIGAAQRRVIYAVQDAHNAGFNVEEDLSVTD TRTSRTFAEQAARQAQAQALAGDIRQRATQLIGVEHEVAAKIATATAPLNTVGFHEPP IAPSLPTPVPHNEKPQIHAVDRSWKQDPPSPMPGDPKDMTAVQARAAWDAVNADIARY NARCGRTFVLPNEQAAYDACIADKGSLLERQAAIRARLGELGVPVEGEPPPAPDPAGP QPNEGLPPPGVSPPAESNLTVGPPSRPIQQARGGESLWDENGGEWRYFPGDNYRYPHW DYNPHDSPTARWQNIPIGDLPTHK" CDS complement(2196042..2196233) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1985C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1985c, -, len: 63 aa. Equivalent to Rv1950c, len: 63 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 63 aa overlap). Conserved hypothetical protein, partial ORF. Highly similar to N-terminus of Rv2077c|MTCY49.16C|Q10685 hypothetical 33.3 kd protein (323 aa), FASTA scores: opt: 280, E(): 1.2 e-16, (71.7% identity in 53 aa overlap) but homology continues in different frame ie MTCY09F9.15, cosmid sequence appears to be correct, genomic sequence is also frameshifted in Mycobacterium bovis strain AF2122/97. Mb1985c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZT4" /protein_id="SIU00588.1" /translation="MLPTLSHIHAWDTEHLIEAAYYWTKVADQWEDVFLEMRNRSHFI AWEGAGGDGCDSEPALTYR" CDS complement(2196234..2196530) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1986C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb1986c, -, len: 98 aa. Equivalent to Rv1951c, len: 98 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 98 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protein Rv2541 (135 aa) (40.9% identity in 88 aa overlap). Mb1986c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZX0" /protein_id="SIU00589.1" /translation="MKAGELRVNIQQVAATASQWSGRSTELSVLAPPPLGQPFQPTTA AVGGAHAAVGLAVAAFTARTHATASAVEAAAAEYANNEAAAAAEMAAVPQTRLV" CDS 2196770..2196985 /codon_start=1 /transl_table=11 /gene="vapb14" /locus_tag="BQ2027_MB1987" /product="possible antitoxin vapb14" /note="Mb1987, -, len: 71 aa. Equivalent to Rv1952, len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 aa overlap). Conserved hypothetical protein. Some similarity to P55510|Y4JJ_RHISN PUTATIVE PLASMID STABILITY PROTEIN (85 aa), FASTA scores: opt: 127, E(): 0.00096, (42.5% identity in 73 aa overlap). Mb1987 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZT8" /db_xref="InterPro:IPR010985" /db_xref="InterPro:IPR013321" /db_xref="UniProtKB/TrEMBL:A0A1R3XZT8" /protein_id="SIU00590.1" /translation="MIRNLPEGTKAALRVRAARHHHSVEAEARAILTAGLLGEEVPMP VLLAADSGHDIDFEPERLGLIARTPQL" CDS 2196982..2197293 /codon_start=1 /transl_table=11 /gene="vapc14" /locus_tag="BQ2027_MB1988" /product="possible toxin vapc14" /note="Mb1988, -, len: 103 aa. Equivalent to Rv1953, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 103 aa overlap). Conserved hypothetical protein. Some similarity to O33827 PLASMID STABILITY-LIKE PROTEIN from Thiobacillus ferrooxidans (143 aa), FASTA scores: opt: 170, E(): 3.5e-06, (45.3% identity in 75 aa overlap). Mb1988 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XZT3" /protein_id="SIU00591.1" /translation="MTYVLDTNVVSALRVPGRHPAVAAWADSVQVAEQFVVAITLAEI ERGVIAKERTDPTQSEHLRRWFDDKVLRIFVFARRGTNLIMQPLAGHIGYSLYSGISW F" CDS complement(2197267..2197788) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1989C" /product="HYPOTHETICAL PROTEIN" /note="Mb1989c, -, len: 173 aa. Equivalent to Rv1954c, len: 173 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 173 aa overlap). Hypothetical unknown protein, end overlaps next ORF upstream, Rv1955 (MTCY09F9.09c)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZV7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00592.1" /translation="MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPP RRCDTHPDGTSSAAAALVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRR SRLTRGRSFTSHLITSCPRLDDHQHRHPTRCRAEHAGCTVATCIPNARDPAPGHQTPR WGPFRLKPAYTRI" CDS 2197321..2197623 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1988A" /product="Hypothetical protein" /note="Mb1988A, len: 100 aa. Equivalent to Rv1954A len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 100 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Hypothetical unknown protein. Protein product from Mb1988A detected using SWATH mass spectrometry. Mb1988A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZT2" /protein_id="SIU00593.1" /translation="MARGRVACIGDAGCDCTPGVFRATAGGMPVLVVIESGTGGDQMA RKATSPGKPAPTSGQYRPVGGGNEVTVPKGHRLPPSPKPGQKWVNVDPTKNKSGRG" CDS 2197628..2198140 /codon_start=1 /transl_table=11 /gene="higb" /locus_tag="BQ2027_MB1990" /product="possible toxin higb" /note="Mb1990, -, len: 170 aa. Equivalent to Rv1955, len: 170 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 170 aa overlap). Hypothetical unknown protein, start overlaps another ORF, Rv1954c (MTCY09F9.10). Mb1990 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009241" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1X7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00594.1" /translation="MPSGWVSHRLGGSPKCISALSLPSGTVGAPSKPDNDATRGRTRP TVPPPDPAAMGTWKFFRASVDGRPVFKKEFDKLPDQARAALIVLMQRYLVGDLAAGSI KPIRGDILELRWHEANNHFRVLFFRWGQHPVALTAFYKNQQKTPKTKIETALDRQKIW KRAFGDTPPI" CDS 2198182..2198631 /codon_start=1 /transl_table=11 /gene="higa" /locus_tag="BQ2027_MB1991" /product="possible antitoxin higa" /note="Mb1991, -, len: 149 aa. Equivalent to Rv1956, len: 149 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 149 aa overlap). Possible transcriptional regulatory protein, contains probable helix-turn-helix motif at aa 52-73 (+4.78 SD). Protein product from Mb1991 detected using SWATH mass spectrometry. Mb1991 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0L8" /db_xref="InterPro:IPR001387" /db_xref="InterPro:IPR010982" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0L8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00595.1" /translation="MSIDFPLGDDLAGYIAEAIAADPSFKGTLEDAEEARRLVDALIA LRKHCQLSQVEVAKRMGVRQPTVSGFEKEPSDPKLSTLQRYARALDARLRLVLEVPTL REVPTWHRLSSYRGSARDHQVRVGADKEILMQTNWARHISVRQVEVA" CDS 2198628..2199173 /codon_start=1 /transl_table=11 /gene="secBL" /locus_tag="BQ2027_MB1992" /product="SecB-like chaperone" /note="Mb1992, -, len: 181 aa. Equivalent to Rv1957, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 181 aa overlap). Hypothetical unknown protein. Protein product from Mb1992 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb1992 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR035958" /db_xref="UniProtKB/TrEMBL:A0A1R3XZV0" /protein_id="SIU00596.1" /translation="MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPA QGLTYDLEFEPAVDADPATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATAD FEFAALFDYHLQEGEDDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLE ILSRPMPVSPGAQWPATRGTP" CDS complement(2199062..2199676) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1993C" /product="HYPOTHETICAL PROTEIN" /note="Mb1993c, -, len: 204 aa. Equivalent to Rv1958c, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 204 aa overlap). Hypothetical unknown protein. Mb1993c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y088" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00597.1" /translation="MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNP RRLSMNPGGMRIRCRRGDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVF ENLELRAAAGLAFGFRLRPFGGTAADSPPVAAQDLDPCRWADSPALHLAVGVETMVVG QLDSPSFGQGVPLVAGHWAPGETGIGRDNISRVNGGSARRPVRS" CDS complement(2199725..2200021) /codon_start=1 /transl_table=11 /gene="pare1" /locus_tag="BQ2027_MB1994C" /product="possible toxin pare1" /note="Mb1994c, -, len: 98 aa. Equivalent to Rv1959c, len: 98 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 98 aa overlap). Conserved hypothetical protein, similar to other hypothetical plasmid proteins e.g. AL117189|YPCD1.08 from Yersinia pestis (99 aa), FASTA scores: opt: 162, E(): 7.3e-05, (33.0% identity in 91 aa overlap); also some similarity to E145339 hypothetical protein (103 aa), FASTA scores: opt: 142, E(): 0.0003, (33.0% identity in 91 aa overlap). Protein product from Mb1994c detected using SWATH mass spectrometry. Mb1994c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007712" /db_xref="InterPro:IPR028344" /db_xref="InterPro:IPR035093" /db_xref="UniProtKB/TrEMBL:A0A1R3XZU3" /protein_id="SIU00598.1" /translation="MSSRYLLSPAAQAHLEEIWDCTYDRWGVDQAEQYLRELQHAIDR AAANPRIGRACDEIRPGYRKLSAGSHTLFYRVTGEGTIDVVRVLHQRMDVDRNL" CDS complement(2200018..2200269) /codon_start=1 /transl_table=11 /gene="pard1" /locus_tag="BQ2027_MB1995C" /product="possible antitoxin pard1" /note="Mb1995c, -, len: 83 aa. Equivalent to Rv1960c, len: 83 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 83 aa overlap). Conserved hypothetical protein, similar to O85269|AF102990|AF102990_51 hypothetical protein of Yersinia enterocolitica (80 aa), FASTA scores: opt: 149, E(): 0.00037, (42 .1% identity in 57 aa overlap). Mb1995c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67299" /db_xref="InterPro:IPR010985" /db_xref="InterPro:IPR022789" /db_xref="InterPro:IPR038296" /db_xref="UniProtKB/Swiss-Prot:P67299" /protein_id="SIU00599.1" /translation="MGKNTSFVLDEHYSAFIDGEIAAGRYRSASEVIRSALRLLEDRE TQLRALREALEAGERSGSSTPFDFDGFLGRKRADASRGR" CDS 2200256..2200750 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB1996" /product="Helicase, C-terminal" /note="Mb1996, -, len: 164 aa. Equivalent to Rv1961, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 164 aa overlap). Hypothetical unknown protein. Mb1996 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3XZW1" /protein_id="SIU00600.1" /translation="MFLPTNAQYQLLVVGVSPWDTPSPSGRISWGSAWPHQARRAQTC QRVRRHWMIDTTEAAYRLTYQPDGTSITVRENLVDILARELLGPIRGPQEVLPFSPRS QYLVGHLAPVKLTGAALIDDNAVQARANAEALAEGGGVPAYAADETTPTPTTTPKTAH PSRA" CDS complement(2200910..2201317) /codon_start=1 /transl_table=11 /gene="vapc35" /locus_tag="BQ2027_MB1997C" /product="possible toxin vapc35. contains pin domain." /note="Mb1997c, -, len: 135 aa. Equivalent to Rv1962c, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 135 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins Rv3408|MTCY78.20c (133 aa) (36.2% identity in 138 aa overlap); and Rv3384c (130 aa) (43.1% identity in 130 aa overlap) Protein product from Mb1997c detected using SWATH mass spectrometry. Mb1997c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZT9" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3XZT9" /protein_id="SIU00601.1" /translation="MIYLETSALVKLIRIEVESDALADWLDDRTELRWITSALTEVEL SRAIRAVSPEGLPAVPSVLARLDRFEIDAVIRSTAAAYPNPALRSLDAIHLATAQTAG SVAPLTALVTYDNRLKEAAEALSLAVVAPGQAR" CDS complement(2201321..2201593) /codon_start=1 /transl_table=11 /gene="vapB35" /locus_tag="BQ2027_MB1997A" /product="Possible antitoxin VapB35" /note="Mb1997A, len: 90 aa. Equivalent to Rv1962A len: 90 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible vapB35, antitoxin,part of toxin-antitoxin (TA) operon with Rv1962c, see Arcus et al. 2005. Similar to others in M. tuberculosis e.g. Rv3385c, Rv3407, Rv0626,Mb1997A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR036165" /db_xref="UniProtKB/TrEMBL:A0A1R3XZW6" /protein_id="SIU00602.1" /translation="MNEVSIRTLNQETSKVLARVKRGEEINLTERGKVIARIIPASAG PLDSLISTGSVQPARVHGPAPRPTIPMRGGLDSGTLLERMRAEERY" CDS complement(2201626..2202846) /codon_start=1 /transl_table=11 /gene="mce3r" /locus_tag="BQ2027_MB1998C" /product="probable transcriptional repressor (probably tetr-family) mce3r" /note="Mb1998c, -, len: 406 aa. Equivalent to Rv1963c, len: 406 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 406 aa overlap). Probable transcriptional regulatory protein, similar to several e.g. AL049485|SC6A5.30 Streptomyces coelicolor cosmid 6 A (404 aa), FASTA scores: opt: 319, E(): 6.4e-13, (29.5% identity in 373 aa overlap); and Z84498|MTCY9F9_1 (259 aa), FASTA scores: opt: 208, E(): 1.6e-07, (100.0% identity in 32 aa overlap). Contains probable helix-turn-helix at aa 36-57 (+4.23 SD). Protein product from Mb1998c detected using SWATH mass spectrometry. Mb1998c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZU1" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="UniProtKB/TrEMBL:A0A1R3XZU1" /protein_id="SIU00603.1" /translation="MASVAQPVRRRPKDRKKQILDQAVGLFIERGFHSVKLEDIAEAA GVTARALYRHYDNKQALLAEAIRTGQDQYQSARRLTEGETEPTPRPLNADLEDLIAAA VASRALTVLWQREARYLNEDDRTAVRRRINAIVAGMRDSVLLEVPDLSPQHSELRAWA VSSTLTSLGRHSLSLPGEELKKLLYQACMAAARTPPVCELPPLPAGDAARDEADVLFS RYETLLAAGARLFRAQGYPAVNTSEIGKGAGIAGPGLYRSFSSKQAILDALIRRLDEW RCLECIRALRANQQAAQRLRGLVQGHVRISLDAPDLVAVSVTELSHASVEVRDGYLRN QGDREAVWIDLIGKLVPATSVAQGRLLVAAAISFIEDVARTWHLTRYAGVADEISGLA LAILTSGAGNLLRA" CDS 2203745..2204149 /codon_start=1 /transl_table=11 /gene="yrbE3A" /locus_tag="BQ2027_MB1999" /product="CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE3A" /note="Mb1999, yrbE3A, len: 134 aa. Similar to 5' end of Rv1964, len: 265 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 102 aa overlap). yrbE3A, hypothetical unknown integral membrane protein, part of mce3 operon and member of YrbE family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07412|Rv0167|MTCI28.07|yrbE1A (265 aa), O07791|Rv0587|MTCY19H5.35|yrbE2A (265 aa), Rv3501c|MTV023.08c|yrbE4A (254 aa), etc. Also highly similar to conserved hypothetical integral membrane proteins of yrbEA type, e.g. AAD24544.1|AF116213|YrbE1A from Mycobacterium leprae (112 aa); P45392|YRBE_ECOLI from Escherichia coli (260 aa), FASTA scores: opt: 893, E(): 0, (51.4% identity in 253 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large deletion of 12719 bp (RD7) leads to the loss of the COOH part of yrbE3A, the entire mce3 operon and the following genes up to Mb2000 compared to Mycobacterium tuberculosis strain H37Rv. Mb1999 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1Y6" /db_xref="InterPro:IPR030802" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Y6" /protein_id="SIU00604.1" /translation="MVIVADKAAGRVADPVLRPVGALGDFFAMTLDTSVCMFKPPFAW REYLLQCWFVARVSTLPGVLMTIPWAVISGFLFNVLLTDIGAADFSGTGCAIFTVNQS RGTGSLERGRFIGPQDHRVAAALEVTAPLLRS" CDS 2204234..2205082 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2000" /product="SAM-dependent methyltransferases" /note="Mb2000, -, len: 282 aa. Equivalent to Rv1978, len: 282 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 282 aa overlap). Conserved hypothetical protein, similar to several hypothetical proteins and methyltransferases e.g. X86780|SHGCPIR.15 methyltransferase from S. hygroscopicus (211 aa), FASTA scores: opt: 151, E(): 0.0072, (30.6% identity in 121 aa overlap). Protein product from Mb2000 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2000 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR041698" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0M7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00605.1" /translation="MGEANIREQAIATMPRGGPDASWLDRRFQTDALEYLDRDDVPDE VKQKIIGVLDRVGTLTNLHEKYARIALKLVSDIPNPRILELGAGHGKLSAKILELHPT ATVTISDLDPTSVANIAAGELGTHPRARTQVIDATAIDGHGHSYDLAVFALAFHHLPP TVACKAIAEATRVGKRFLIIDLKRQKPLSFTLSSVLLLPLHLLLLPWSSMRSSMHDGF ISALRAYSPSALQTLARAADPGMQVEILPAPTRLFPPSLAVVFSRSSSAPTESSECSA DRQPGE" CDS complement(2205045..2206490) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2001C" /product="POSSIBLE CONSERVED PERMEASE" /note="Mb2001c, -, len: 481 aa. Equivalent to Rv1979c, len: 481 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 480 aa overlap). Possible permease, APC family possibly involved in transport of amino acid, showing some similarity to other permeases. Also similar to MTCY39.19 from Mycobacterium tuberculosis (28.2% identity in 277 aa overlap). Contains PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site. Mb2001c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TZ67" /db_xref="InterPro:IPR002293" /db_xref="UniProtKB/Swiss-Prot:Q7TZ67" /protein_id="SIU00606.1" /translation="MVGPRTRGYAIHKLGFCSVVMLGINSIIGAGIFLTPGEVIGLAG PFAPMAYVLAGIFAGVVAIVFATAARYVRTNGASYAYTTAAFGRRIGIYVGVTHAITA SIAWGVLASFFVSTLLRVAFPDKAWADAEQLFSVKTLTFLGFIGVLLAINLFGNRAIK WANGTSTVGKAFALSAFIVGGLWIITTQHVNNYATAWSAYSATPYSLLGVAEIGKGTF SSMALATIVALYAFTGFESIANAAEEMDAPDRNLPRAIPIAIFSVGAIYLLTLTVAML LGSNKIAASGDTVKLAAAIGNATFRTIIVVGALISMFGINVAASFGAPRLWTALADSG VLPTRLSRKNQYDVPMVSFAITASLALAFPLALRFDNLHLTGLAVIARFVQFIIVPIA LIALARSQAVEHAAVRRNAFTDKVLPLVAIVVSVGLAVSYDYRCIFLVRGGPNYFSIA LIVITFIVVPAMAYLHYYRIIRRVGDRPSTR" CDS complement(2206669..2207355) /codon_start=1 /transl_table=11 /gene="mpt64" /locus_tag="BQ2027_MB2002C" /standard_name="mpb64" /product="IMMUNOGENIC PROTEIN MPT64 (ANTIGEN MPT64/MPB64)" /note="Mb2002c, -, len: 228 aa. Equivalent to Rv1980c, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 228 aa overlap). mpt64 (alternate gene name: mpb64), immunogenic protein (alternate gene name: mpb64) (see citations below), identical to MPT64|MPB64 from Mycobacterium bovis (228 aa). Similar to Rv3036c|MTV012.51c from Mycobacterium tuberculosis. Exported protein containing a N-terminal signal sequence: see notes below about proteomics. Protein product from Mb2002c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2002c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Q5" /db_xref="InterPro:IPR021729" /db_xref="InterPro:IPR037126" /db_xref="UniProtKB/Swiss-Prot:P0A5Q5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00607.1" /translation="MRIKIFMLVTAVVLLCCSGVATAAPKTYCEELKGTDTGQACQIQ MSDPAYNINISLPSYYPDQKSLENYIAQTRDKFLSAATSSTPREAPYELNITSATYQS AIPPRGTQAVVLKVYQNAGGTHPTTTYKAFDWDQAYRKPITYDTLWQADTDPLPVVFP IVQGELSKQTGQQVSIAPNAGLDPVNYQNFAVTNDGVIFFFNPGELLPEAAGPTQVLV PRSAIDSMLA" CDS complement(2207546..2208514) /codon_start=1 /transl_table=11 /gene="nrdF1" /locus_tag="BQ2027_MB2003C" /standard_name="nrdF" /product="RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE (BETA CHAIN) NRDF1 (RIBONUCLEOTIDE REDUCTASE SMALL SUBUNIT) (R2F PROTEIN)" /note="Mb2003c, nrdF1, len: 322 aa. Equivalent to Rv1981c, len: 322 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 322 aa overlap). nrdF1, ribonucleoside-diphosphate reductase, beta chain (EC 1.17.4.1) (see citation below), highly similar to others e.g. RIR4_SALTY|P17424 ribonucleoside-diphosphate reductase (319 aa), FASTA scores: opt: 1402, E(): 0, (66.0% identity in 315 aa overlap); etc. Also similar to Rv3048c|MTV012.63c from Mycobacterium tuberculosis. Contains PS00368 Ribonucleotide reductase small subunit signature. BELONGS TO THE RIBONUCLEOSIDE DIPHOSPHATE REDUCTASE SMALL CHAIN FAMILY. COFACTOR: BINDS 2 IRON IONS (BY SIMILARITY). Note that previously known as nrdF. Protein product from Mb2003c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2003c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZV2" /db_xref="InterPro:IPR000358" /db_xref="InterPro:IPR009078" /db_xref="InterPro:IPR012348" /db_xref="InterPro:IPR026494" /db_xref="InterPro:IPR030475" /db_xref="InterPro:IPR033909" /db_xref="UniProtKB/TrEMBL:A0A1R3XZV2" /protein_id="SIU00608.1" /translation="MTGKRVERVHAINWNRLLDAKDLQVWERLTGNFWLPEKIPLSND LASWQTLSSTEQQTTIRVFTGLTLLDTAQATVGAVAMIDDAVTPHEEAVLTNMAFMES VHAKSYSSIFSTLCSTKQIDDAFDWSEQNPYLQRKAQIIVDYYRGDDALKRKASSVML ESFLFYSGFYLPMYWSSRGKLTNTADLIRLIIRDEAVHGYYIGYKCQRGLADLTDAER ADHREYTCELLHTLYANEIDYAHDLYDELGWTDDVLPYMRYNANKALANLGYQPAFDR DTCQVNPAVRAALDPGAGENHDFFSGSGSSYVMGTHQPTTDTDWDF" CDS complement(2208739..2209158) /codon_start=1 /transl_table=11 /gene="vapc36" /locus_tag="BQ2027_MB2004C" /product="possible toxin vapc36. contains pin domain." /note="Mb2004c, -, len: 139 aa. Equivalent to Rv1982c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Conserved hypothetical protein. BELONGS TO THE UPF0110 FAMILY. Similar to Rv0624|Z92772|MTY20H10.05 from Mycobacterium tuberculosis (131 aa), FASTA scores: opt: 288, E(): 4.1e-14, (40.2% identity in 127 aa overlap); also similar to Rv0624, Rv2759c, and Rv0609,Mb2004c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A653" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/Swiss-Prot:P0A653" /protein_id="SIU00609.1" /translation="MIVDTSAVVALVQGERPHATLVAAALAGAHSPVMSAPTVAECLI VLTARHGPVARTIFERLRSEIGLSVSSFTAEHAAATQRAFLRYGKGRHRAALNFGDCM TYATAQLGHQPLLAVGNDFPQTDLEFRGVVGYWPGVA" CDS complement(2209167..2209427) /codon_start=1 /transl_table=11 /gene="vapB36" /locus_tag="BQ2027_MB2004A" /product="Possible antitoxin VapB36" /note="Mb2004A, len: 86 aa. Equivalent to Rv1982A len: 86 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 86 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible vapB36, antitoxin,part of toxin-antitoxin (TA) operon with Rv1982c, see Arcus et al. 2005. Similar to others in Mycobacterium tuberculosis e.g. Rv0623, Rv2760c, Rv0608 Protein product from Mb2004A detected using SWATH mass spectrometry. Mb2004A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011660" /db_xref="UniProtKB/TrEMBL:A0A1R3XZX5" /protein_id="SIU00610.1" /translation="MALNIKDPEVDRLAAELADRLHTSKTAAIRHALSAQLAFLESRA GDREAQLLDILRTEIWPLLADRSPITKLEREQILGYDPATGV" CDS 2209570..2211246 /codon_start=1 /transl_table=11 /gene="PE_PGRS35" /locus_tag="BQ2027_MB2005" /product="pe-pgrs family protein pe_pgrs35" /note="Mb2005, PE_PGRS35, len: 558 aa. Equivalent to Rv1983, len: 558 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 558 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Similar to other PE proteins e.g. Rv0977, etc. Contains PS00141 Eukaryotic and viral aspartyl proteases active site. Mb2005 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR021109" /db_xref="UniProtKB/TrEMBL:A0A1R3XZU6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00611.1" /translation="MSFLVVVPEFLTSAAADVENIGSTLRAANAAAAASTTALAAAGA DEVSAAVAALFARFGQEYQAVSAQASAFHQQFVQTLNSASGSYAAAEATIASQLQTAQ HDLLGAVNAPTETLLGRPLIGDGAPGTATSPNGGAGGLLYGNGGNGYSATASGVGGGA GGSAGLIGNGGAGGAGGPNAPGGAGGNGGWLLGNGGIGGPGGASSIPGMSGGAGGTGG ATGLLGWGANGGAGGLGDGVGVDRGTGGAGGRGGLLYGGYGVSGPGGDGRTVPLEIIH VTEPTVHANVNGGPTSTILVDTGSAGLVVSPEDVGGILGVLHMGLPTGLSISGYSGGL YYIFATYTTTVDFGNGIVTAPTAVNVVLLSIPTSPFAISTYFSALLADPTTTPFEAYF GAVGVDGVLGVGPNAVGPGPSIPTMALPGDLNQGVLIDAPAGELVFGPNPLPAPNVEV VGSPITTLYVKIDGGTPIPVPSIIDSGGVTGTIPSYVIGSGTLPANTNIEVYTSPGGD RLYAFNTNDYRPTVISSGLMNTGFLPFRFQPVYIDYSPSGIGTTVFDHPA" CDS complement(2211234..2211887) /codon_start=1 /transl_table=11 /gene="cfp21" /locus_tag="BQ2027_MB2006C" /product="PROBABLE CUTINASE PRECURSOR CFP21" /note="Mb2006c, cfp21, len: 217 aa. Equivalent to Rv1984c, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 217 aa overlap). cfp21, probable cutinase precursor with N-terminal signal sequence (EC 3.1.1.-), similar to P41744|CUTI_ALTBR cutinase precursor from Alternaria brassicicola (209 aa), FASTA scores: opt: 283, E(): 2.2e-11, (32.6% identity in 193 aa overlap). Also similar to Mycobacterium tuberculosis proteins e.g. Rv3452, Rv3451, Rv2301, Rv1758, Rv3724. BELONGS TO THE CUTINASE FAMILY. Protein product from Mb2006c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2006c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63880" /db_xref="InterPro:IPR000675" /db_xref="InterPro:IPR011150" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P63880" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00612.1" /translation="MTPRSLVRIVGVVVATTLALVSAPAGGRAAHADPCSDIAVVFAR GTHQASGLGDVGEAFVDSLTSQVGGRSIGVYAVNYPASDDYRASASNGSDDASAHIQR TVASCPNTRIVLGGYSQGATVIDLSTSAMPPAVADHVAAVALFGEPSSGFSSMLWGGG SLPTIGPLYSSKTINLCAPDDPICTGGGNIMAHVSYVQSGMTSQAATFAANRLDHAG" CDS complement(2212317..2213228) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2007C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY LYSR-FAMILY)" /note="Mb2007c, -, len: 303 aa. Equivalent to Rv1985c, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 303 aa overlap). Probable transcriptional regulatory protein, LysR family member. Similar to many regulatory proteins, especially ICIA_ECOLI|P24194 chromosome initiation inhibitor from Escherichia coli (297 aa), FASTA scores: opt: 520, E(): 1.1e-28, (35.8% identity in 285 aa overlap); and P94632|LYSG_CORGL LYSINE EXPORT REGULATOR PROTEIN (290 aa), FASTA scores: opt: 705, E(): 0, (42.7% identity in 288 aa overlap); etc. Contains PS00044 Bacterial regulatory proteins, lysR family signature. Also contains helix-turn-helix motif at aa 22-43,(+5.52 SD). BELONGS TO THE LYSR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb2007c detected using SWATH mass spectrometry. Mb2007c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67666" /db_xref="InterPro:IPR000847" /db_xref="InterPro:IPR005119" /db_xref="InterPro:IPR017685" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P67666" /protein_id="SIU00613.1" /translation="MVDPQLDGPQLAALAAVVELGSFDAAAERLHVTPSAVSQRIKSL EQQVGQVLVVREKPCRATTAGIPLLRLAAQTALLESEALAEMGGNASLKRTRITIAVN ADSMATWFSAVFDGLGDVLLDVRIEDQDHSARLLREGVAMGAVTTERNPVPGCRVHPL GEMRYLPVASRPFVQRHLSDGFTAAAAAKAPSLAWNRDDGLQDMLVRKAFRRAITRPT HFVPTTEGFTAAARAGLGWGMFPEKLAASPLADGSFVRVCDIHLDVPLYWQCWKLDSP IIARITDTVRAAASGLYRGQQRRRRPG" CDS 2213337..2213936 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2008" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb2008, -, len: 199 aa. Equivalent to Rv1986, len: 199 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 199 aa overlap). Probable conserved integral membrane protein, LysE family possibly involved in transport of Lysine, similar to P11667|YGGA_ECOLI hypothetical 23.2 kd protein in sbm-fba intergenic region (211 aa), FASTA scores: opt: 379, E(): 1.5e-19, (37.3% identity in 185 aa overlap); and Q11154|Rv0488 HYPOTHETICAL 20.9 KD PROTEIN from Mycobacterium tuberculosis (201 aa), FASTA scores: opt: 784, E(): 0, (63.4% identity in 186 aa overlap). BELONGS TO THE LYSE/YGGA FAMILY. Mb2008 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64904" /db_xref="InterPro:IPR001123" /db_xref="InterPro:IPR004777" /db_xref="UniProtKB/Swiss-Prot:P64904" /protein_id="SIU00614.1" /translation="MNSPLVVGFLACFTLIAAIGAQNAFVLRQGIQREHVLPVVALCT VSDIVLIAAGIAGFGALIGAHPRALNVVKFGGAAFLIGYGLLAARRAWRPVALIPSGA TPVRLAEVLVTCAAFTFLNPHVYLDTVVLLGALANEHSDQRWLFGLGAVTASAVWFAT LGFGAGRLRGLFTNPGSWRILDGLIAVMMVALGISLTVT" CDS 2214352..2214780 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2009" /product="POSSIBLE CHITINASE" /note="Mb2009, -, len: 142 aa. Equivalent to Rv1987, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Possible chitinase (EC 3.2.1.14), similar to several e.g. P36909|CHIT_STRLI chitinase c precursor (619 aa) FASTA scores, opt: 324, E(): 1.2e-14, (39.5% identity in 129 aa overlap). Protein product from Mb2009 detected using SWATH mass spectrometry. Mb2009 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64906" /db_xref="InterPro:IPR001919" /db_xref="InterPro:IPR008965" /db_xref="InterPro:IPR012291" /db_xref="UniProtKB/Swiss-Prot:P64906" /protein_id="SIU00615.1" /translation="MAGLNIYVRRWRTALHATVSALIVAILGLAITPVASAATARATL SVTSTWQTGFIARFTITNSSTAPLTDWKLEFDLPAGESVLHTWNSTVARSGTHYVLSP ANWNRIIAPGGSATGGLRGGLTGSYSPPSSCLLNGQYPCT" CDS 2215006..2215545 /codon_start=1 /transl_table=11 /gene="erm(37)" /locus_tag="BQ2027_MB2010" /product="probable 23s rrna methyltransferase erm(37)" /note="Mb2010, -, len: 179 aa. Equivalent to Rv1988, len: 179 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 179 aa overlap). Probable methyltransferase (EC 2.1.1.-), similar to ERME_SACER|P07287 rrna adenine n-6-methyltransferase (370 aa), FASTA scores: opt: 259, E(): 2e-11, (35.1% identity in 171 aa overlap); contains PS00092 N-6 Adenine-specific DNA methylases signature. Also similar to Mycobacterium tuberculosis Rv1010 ksgA 16S rRNA dimethyltransferase. Mb2010 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZX2" /db_xref="InterPro:IPR001737" /db_xref="InterPro:IPR020598" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3XZX2" /protein_id="SIU00616.1" /translation="MSALGRSRRAWGWHRLHDEWAARVVSAAAVRPGELVFDIGAGEG ALTAHLVRAGARVVAVELHPRRVGVLRERFPGITVVHADAASIRLPGRPFRVVANPPY GISSRLLRTLLAPNSGLVAADLVLQRALVCKFASRNARRFTLTVGLMLPRRAFLPPPH VDSAVLVVRRRKCGDWQGR" CDS complement(2216065..2216625) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2011C" /product="HYPOTHETICAL PROTEIN" /note="Mb2011c, -, len: 186 aa. Equivalent to Rv1989c, len: 186 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 186 aa overlap). Hypothetical unknown protein. Protein product from Mb2011c detected using SWATH mass spectrometry. Mb2011c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64908" /db_xref="InterPro:IPR014914" /db_xref="UniProtKB/Swiss-Prot:P64908" /protein_id="SIU00617.1" /translation="MSDALDEGLVQRIDARGTIEWSETCYRYTGAHRDALSGEGARRF GGRWNPPLLFPAIYLADSAQACMVEVERAAQAASTTAEKMLEAAYRLHTIDVTDLAVL DLTTPQAREAVGLENDDIYGDDWSGCQAVGHAAWFLHMQGVLVPAAGGVGLVVTAYEQ RTRPGQLQLRQSVDLTPALYQELRAT" CDS complement(2216622..2216963) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2012C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb2012c, -, len: 113 aa. Equivalent to Rv1990c, len: 113 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 113 aa overlap). Probable transcriptional regulatory protein, similar to Mycobacterium tuberculosis Rv3188|AL021646|MTV014.32 (115 aa), FASTA scores: opt: 184, E(): 8.2e-07, (28.4% identity in 109 aa overlap). Contains probable helix-turn-helix motif at aa 20-44 (+4.22 SD). Protein product from Mb2012c detected using SWATH mass spectrometry. Mb2012c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024467" /db_xref="UniProtKB/Swiss-Prot:P64910" /protein_id="SIU00618.1" /translation="MGVNVLASTVSGAIERLGLTYEEVGDIVDASPRSVARWTAGQVV PQRLNKQRLIELAYVADALAEVLPRDQANVWMFSPNRLLEHRKPADLVRDGEYQRVLA LIDAMAEGVFV" CDS complement(2217207..2217542) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2013C" /product="POSSIBLE DEHYDROGENASE (FRAGMENT)" /note="Mb2013c, -, len: 111 aa. Equivalent to Rv1990A, len: 111 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 111 aa overlap). Possible dehydrogenase (fragment) (EC 1.-.-.-), similar to N-terminal part of several dehydrogenases and hypothetical proteins, e.g. Rv2750|MTV002.15|AL008967 from Mycobacterium tuberculosis (272 aa), FASTA scores: opt: 151, E(): 0.0045, (47.45% identity in 78 aa overlap), but lacks C-terminal part. Maybe a pseudogene. Also similar to U17129|RSU17129_7 putative short-chain alcohol dehydrogenase from Rhodococcus erythropolis (275 aa), FASTA scores: opt: 142, E(): 0.018, (54.15% identity in 48 aa overlap). Mb2013c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y001" /protein_id="SIU00619.1" /translation="MGRLEGKVAFITGVARGQGRSHAVRLADGQARALGKVDVEACGA LVGEVEVWGRDVRDDRRVFVESPADEFGACRRVARQGIRVVGLPVSQRELVEPEAGCA ARRSAAGSQ" CDS complement(2217631..2217975) /codon_start=1 /transl_table=11 /gene="mazf6" /locus_tag="BQ2027_MB2014C" /product="toxin mazf6" /note="Mb2014c, -, len: 114 aa. Equivalent to Rv1991c, len: 114 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 114 aa overlap). Conserved hypothetical protein, showing some similarity to P13976|PEMK_ECOLI pemk protein (133 aa), FASTA scores: opt: 113, E(): 0.043, (29.2% identity in 113 aa overlap); and P96622|YDCE PROTEIN from Bacillus subtilis (116 aa), FASTA scores: opt: 227, E(): 6.9e-09, (37.4% identity in 115 aa overlap). Also similar to Mycobacterium tuberculosis Rv2801c, and Rv0659c. Mb2014c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64912" /db_xref="InterPro:IPR003477" /db_xref="InterPro:IPR011067" /db_xref="UniProtKB/Swiss-Prot:P64912" /protein_id="SIU00620.1" /translation="MVISRAEIYWADLGPPSGSQPAKRRPVLVIQSDPYNASRLATVI AAVITSNTALAAMPGNVFLPATTTRLPRDSVVNVTAIVTLNKTDLTDRVGEVPASLMH EVDRGLRRVLDL" CDS complement(2217969..2218217) /codon_start=1 /transl_table=11 /gene="mazE6" /locus_tag="BQ2027_MB2014A" /product="Antitoxin MazE6" /note="Mb2014A, len: 82 aa. Equivalent to Rv1991A len: 82 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 82 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). MazE6, antitoxin, part of toxin-antitoxin (TA) operon with Rv1991c. Similar to ChpI of L. interrogans, FASTA scores: opt: 134, E(): 0.024,29.762% identity (65.476% similar) in 84 aa overlap. Note that Pandey and Gerdes, 2005 predicts a different N-terminus, adding 10 amino acids. Mb2014A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0CL58" /db_xref="InterPro:IPR002145" /db_xref="InterPro:IPR010985" /db_xref="UniProtKB/Swiss-Prot:P0CL58" /protein_id="SIU00621.1" /translation="MKTAISLPDETFDRVSRRASELGMSRSEFFTKAAQRYLHELDAQ LLTGQIDRALESIHGTDEAEALAVANAYRVLETMDDEW" CDS complement(2218317..2220632) /codon_start=1 /transl_table=11 /gene="ctpG" /locus_tag="BQ2027_MB2015C" /product="PROBABLE METAL CATION TRANSPORTER P-TYPE ATPASE G CTPG" /note="Mb2015c, ctpG, len: 771 aa. Equivalent to Rv1992c, len: 771 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 771 aa overlap). Probable ctpG, metal cation-transporting P-type ATPase G (transmembrane protein) (EC 3.6.3.-), similar to others, especially cadmium-transporting ATPases (EC 3.6.3.3), e.g. NP_244904.1|NC_002570 cadmium-transporting ATPase from Bacillus halodurans (707 aa); P30336|CADA_BACFI PROBABLE CADMIUM-TRANSPORTING ATPASE from Bacillus firmus (723 aa); BAB47609.1|AB037671 cadmium resistance protein B from Staphylococcus aureus (804 aa); 3121832|Q60048|CADA_LISMO PROBABLE CADMIUM-TRANSPORTING ATPase from Listeria monocytogenes (707 aa); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv0969|MTCY10D7.05c|ctpV PUTATIVE CATION TRANSPORTER P-TYPE ATPASE V (770 aa); Rv1469; Rv0092; etc. Contains PS00435 Peroxidases proximal heme-ligand signature and PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB. Protein product from Mb2015c detected using SWATH mass spectrometry. Mb2015c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63690" /db_xref="InterPro:IPR001757" /db_xref="InterPro:IPR008250" /db_xref="InterPro:IPR018303" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR023298" /db_xref="InterPro:IPR023299" /db_xref="InterPro:IPR027256" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/Swiss-Prot:P63690" /protein_id="SIU00622.1" /translation="MTTVVDAEVQLTVVSDAAGRMRVQATGFQFDAGRAVAIEDTVGK VAGVQAVHAYPRTASIVIWYSRAICDTAAILSAIIDAETVPAAAVPAYASRSASNRKA GVVQKIIDWSTRTLSGVRRDVAAQPSGETSDACCDGEDNEDREPEQLWQVAKLRRAAF SGVLLTASLVAAWAYPLWPVVLGLKALALAVGASTFVPSSLKRLAEGRVGVGTLMTIA ALGAVALGELGEAATLAFLFSISEGLEEYATARTRRGLRALLSLVPDQATVLREGTET IVASTELHVGDQMIVKPGERLATDGIIRAGRTALDVSAITGESVPVEVGPGDEVFAGS INGLGVLQVGVTATAANNSLARIVHIVEAEQVRKGASQRLADCIARPLVPSIMIAAAL IAGTGSVLGNPLVWIERALVVLVAAAPCALAIAVPVTVVASIGAASRLGVLIKGGAAL ETLGTIRAVALDKTGTLTANRPVVIDVATTNGATREEVLAVAAALEARSEHPLAVAVL AATQATTAASDVQAVPGAGLIGRLDGRVVRLGRPGWLDAAELADHVACMQQAGATAVL VERDQQLLGAIAVRDELRPEAAEVVAGLRTGGYQVTMLTGDNHATAAALAAQAGIEQV HAELRPEDKAHLVAQLRARQPTAMVGDGVNDAPALAAADLGIAMGAMGTDVAIETADV ALMGQDLRHLPQALDHARRSRQIMVQNVGLSLSIITVLMPLALFGILGLAAVVLVHEF TEVIVIANGVRAGRIKPLAGPPKTPDRTIPG" CDS complement(2220629..2220901) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2016C" /product="conserved protein" /note="Mb2016c, -, len: 90 aa. Equivalent to Rv1993c, len: 90 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). Conserved hypothetical protein, very similar to Rv3269|Z92771|MTCY71.09 hypothetical protein from Mycobacterium tuberculosis (93 aa), FASTA results: opt: 309, E(): 3.2e-16, (63.3% identity in 79 aa overlap). Also similar to Rv0968 (98 aa) (51.1% identity in 94 aa overlap). Protein product from Mb2016c detected using SWATH mass spectrometry. Mb2016c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR009963" /db_xref="UniProtKB/Swiss-Prot:P64914" /protein_id="SIU00623.1" /translation="MVTHELLVKAAGAVLTGLVGVSAYETLRKALGTAPIRRASVTVM EWGLRGTRRAEAAAESARLTVADVVAEARGRIGEEAPLPAGARVDE" CDS complement(2220954..2221310) /codon_start=1 /transl_table=11 /gene="cmtr" /locus_tag="BQ2027_MB2017C" /product="metal sensor transcriptional regulator cmtr (arsr-smtb family)" /note="Mb2017c, -, len: 118 aa. Equivalent to Rv1994c, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 118 aa overlap). Probable transcription regulator, similar to MERR_STRLI|P30346 probable mercury resistance operon repressor (125 aa), FASTA scores: opt: 199, E(): 3e-08, (36.3% identity in 102 aa overlap). Contains probable helix-turn-helix motif at aa 36-57 (+3.78 SD). Mb2017c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67732" /db_xref="InterPro:IPR001845" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P67732" /protein_id="SIU00624.1" /translation="MLTCEMRESALARLGRALADPTRCRILVALLDGVCYPGQLAAHL GLTRSNVSNHLSCLRGCGLVVATYEGRQVRYALADSHLARALGELVQVVLAVDTDQPC VAERAASGEAVEMTGS" CDS 2221467..2222234 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2018" /product="Repair of Iron Centers di-iron protein" /note="Mb2018, -, len: 255 aa. Equivalent to Rv1995, len: 255 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 255 aa overlap). Hypothetical unknown protein. Protein product from Mb2018 detected using SWATH mass spectrometry." /db_xref="InterPro:IPR012312" /db_xref="UniProtKB/Swiss-Prot:P64916" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00625.1" /translation="MVASGAATKGVTVMKQTPPAAVGRRHLLEISASAAGVIALSACS GSPPEPGKGRPDTTPEQEVPVTAPEDLMREHGVLKRILLIYREGIRRLQADDQSPAPA LNESAQIIRRFIEDYHGQLEEQYVFPKLEQAGKLTDITSVLRTQHQRGRVLTDRVLAA TTAAAAFDQPARDTLAQDMAAYIRMFEPHEAREDTVVFPALRDVMSAVEFRDMAETFE DEEHRRFGEAGFQSVVDKVADIEKSLGIYDLSQFTPS" CDS 2222330..2223283 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2019" /product="universal stress protein family protein" /note="Mb2019, -, len: 317 aa. Identical to Rv1996, len: 317 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 317 aa overlap). Conserved hypothetical protein. Similar to several Mycobacterium tuberculosis hypothetical proteins e.g. Rv2005c|Q10851|YK05_MYCTU (295 aa), FASTA scores: opt: 775, E(): 0, (50.3% identity in 316 aa overlap); Rv2026c (294 aa) (47.9% identity in 311 aa overlap); and Rv2623, etc. Also similar to SCJ1.30c|AL109962 hypothetical protein from Streptomyces coelicolor (328 aa). Protein product from Mb2019 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2019 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5F8" /db_xref="InterPro:IPR006015" /db_xref="InterPro:IPR006016" /db_xref="UniProtKB/Swiss-Prot:P0A5F8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00626.1" /translation="MSAQQTNLGIVVGVDGSPCSHTAVEWAARDAQMRNVALRVVQVV PPVITAPEGWAFEYSRFQEAQKREIVEHSYLVAQAHQIVEQAHKVALEASSSGRAAQI TGEVLHGQIVPTLANISRQVAMVVLGYRGQGAVAGALLGSVSSSLVRHAHGPVAVIPE EPRPARPPHAPVVVGIDGSPTSGLAAEIAFDEASRRGVDLVALHAWSDMGPLDFPRLN WAPIEWRNLEDEQEKMLARRLSGWQDRYPDVVVHKVVVCDRPAPRLLELAQTAQLVVV GSHGRGGFPGMHLGSVSRAVVNSGQAPVIVARIPQDPAVPA" CDS 2223485..2226202 /codon_start=1 /transl_table=11 /gene="ctpF" /locus_tag="BQ2027_MB2020" /product="PROBABLE METAL CATION TRANSPORTER P-TYPE ATPASE A CTPF" /note="Mb2020, ctpF, len: 905 aa. Equivalent to Rv1997, len: 905 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 905 aa overlap). Probable ctpF, metal cation-transporting P-type ATPase F (transmembrane protein) (EC 3.6.3.-), highly similar to others e.g. NP_250120.1|NC_002516 probable cation-transporting P-type ATPase from Pseudomonas aeruginosa (902 aa); NP_441217.1|NC_000911 cation-transporting ATPase (E1-E2 ATPase) from Synechocystis sp. strain PCC 6803 (905 aa); NP_404093.1|NC_003143 putative cation-transporting P-type ATPase from Yersinia pestis (908 aa); P37367|ATA1_SYNY3 cation-transporting ATPase pma1 from Synechocystis sp. (915 aa), FASTA scores: opt: 2392, E(): 0, (46.5% identity in 852 aa overlap); etc. Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB. Was frame-shifted in original cosmid sequence. Protein product from Mb2020 detected using SWATH mass spectrometry. Mb2020 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63688" /db_xref="InterPro:IPR001757" /db_xref="InterPro:IPR004014" /db_xref="InterPro:IPR006068" /db_xref="InterPro:IPR008250" /db_xref="InterPro:IPR018303" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR023298" /db_xref="InterPro:IPR023299" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/Swiss-Prot:P63688" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00627.1" /translation="MSASVSATTAHHGLPAHEVVLLLESDPYHGLSDGEAAQRLERFG PNTLAVVTRASLLARILRQFHHPLIYVLLVAGTITAGLKEFVDAAVIFGVVVINAIVG FIQESKAEAALQGLRSMVHTHAKVVREGHEHTMPSEELVPGDLVLLAAGDKVPADLRL VRQTGLSVNESALTGESTPVHKDEVALPEGTPVADRRNIAYSGTLVTAGHGAGIVVAT GAETELGEIHRLVGAAEVVATPLTAKLAWFSKFLTIAILGLAALTFGVGLLRRQDAVE TFTAAIALAVGAIPEGLPTAVTITLAIGMARMAKRRAVIRRLPAVETLGSTTVICADK TGTLTENQMTVQSIWTPHGEIRATGTGYAPDVLLCDTDDAPVPVNANAALRWSLLAGA CSNDAALVRDGTRWQIVGDPTEGAMLVVAAKAGFNPERLATTLPQVAAIPFSSERQYM ATLHRDGTDHVVLAKGAVERMLDLCGTEMGADGALRPLDRATVLRATEMLTSRGLRVL ATGMGAGAGTPDDFDENVIPGSLALTGLQAMSDPPRAAAASAVAACHSAGIAVKMITG DHAGTATAIATEVGLLDNTEPAAGSVLTGAELAALSADQYPEAVDTASVFARVSPEQK LRLVQALQARGHVVAMTGDGVNDAPALRQANIGVAMGRGGTEVAKDAADMVLTDDDFA TIEAAVEEGRGVFDNLTKFITWTLPTNLGEGLVILAAIAVGVALPILPTQILWINMTT AIALGLMLAFEPKEAGIMTRPPRDPDQPLLTGWLVRRTLLVSTLLVASAWWLFAWELD NGAGLHEARTAALNLFVVVEAFYLFSCRSLTRSAWRLGMFANRWIILGVSAQAIAQFA ITYLPAMNMVFDTAPIDIGVWVRIFAVATAITIVVATDTLLPRIRAQPP" CDS complement(2226271..2227047) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2021C" /product="PEP phosphonomutase and related enzymes" /note="Mb2021c, -, len: 258 aa. Equivalent to Rv1998c, len: 258 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 258 aa overlap). Conserved hypothetical protein, showing some similarity with other hypothetical proteins e.g. U82823|SEU82823.03 Saccharopolyspora erythraea (266 aa), FASTA results: opt: 654, E(): 0, (43.8% identity in 249 aa overlap); and AL034446|SC1A9.07 Streptomyces coelicolor (251 aa), FASTA scores: opt: 592, E(): 1.5e-31, (43.4% identity in 251 aa overlap). Protein product from Mb2021c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2021c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3XZW5" /db_xref="InterPro:IPR015813" /db_xref="InterPro:IPR039556" /db_xref="InterPro:IPR040442" /db_xref="UniProtKB/TrEMBL:A0A1R3XZW5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00628.1" /translation="MSFHDLHHQGVPFVLPNAWDVPSALAYLAEGFTAIGTTSFGVSS SGGHPDGHRATRGANIALAAALAPLQCYVSVDIEDGYSDEPDAIADYVAQLSTAGINI EDSSAEKLIDPALAAAKIVAIKQRNPEVFVNARVDTYWLRQHADTTSTIQRALRYVDA GADGVFVPLANDPDELAELTRNIPCPVNTLPVPGLTIADLGELGVARVSTGSVPYSAG LYAAAHAARAVRDGEQLPRSVPYAELQARLVDYENRTSTT" CDS complement(2227142..2228464) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2022C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb2022c, -, len: 440 aa. Equivalent to Rv1999c, len: 440 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 440 aa overlap). Probable conserved integral membrane protein, possibly transporter of cationic amino acid, similar to many transporters, especially amino acid transporters, e.g. CAC08265.1|AL392146 putative amino acid transporter from Streptomyces coelicolor (414 aa); P39277|YJEH_ECOLI hypothetical 44.8 kd protein from Escherichia coli (418 aa), FASTA scores, opt: 343, E(): 6.6e-15, (27.2% identity in 408 aa overlap); etc. Also similar to Rv1979c from Mycobacterium tuberculosis, FASTA score: (28.2% identity in 277 aa overlap); Rv2127, Rv0346c, Rv0522, etc. SEEMS TO BELONG TO THE APC FAMILY." /db_xref="GOA:P63350" /db_xref="InterPro:IPR002293" /db_xref="UniProtKB/Swiss-Prot:P63350" /protein_id="SIU00629.1" /translation="MRRPLDPRDIPDELRRRLGLLDAVVIGLGSMIGAGIFAALAPAA YAAGSGLLLGLAVAAVVAYCNAISSARLAARYPASGGTYVYGRMRLGDFWGYLAGWGF VVGKTASCAAMALTVGFYVWPAQAHAVAVAVVVALTAVNYAGIQKSAWLTRSIVAVVL VVLTAVVVAAYGSGAADPARLDIGVDAHVWGMLQAAGLLFFAFAGYARIATLGEEVRD PARTIPRAIPLALGITLAVYALVAVAVIAVLGPQRLARAAAPLSEAMRVAGVNWLIPV VQIGAAVAALGSLLALILGVSRTTLAMARDRHLPRWLAAVHPRFKVPFRAELVVGAVV AALAATADIRGAIGFSSFGVLVYYAIANASALTLGLDEGRPRRLIPLVGLIGCVVLAF ALPLSSVAAGAAVLGVGVAAYGVRRIITRRARQTDSGDTQRSGHPSAT" CDS 2228535..2230148 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2023" /product="unknown protein" /note="Mb2023, -, len: 537 aa. Equivalent to Rv2000, len: 537 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 537 aa overlap). Hypothetical unknown protein. Protein product from Mb2023 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P64918" /protein_id="SIU00630.1" /translation="MRPGFVGLGFGQWPVYVVRWPKLHLTPRQRKRVLHRRRLLTDRP ISLSQIPIRTGGPMNDPWPRPTQGPAKTIETDYLVIGAGAMGMAFTDTLITESGARVV MIDRACQPGGHWTTAYPFVRLHQPSAYYGVNSRALGNNTIDLVGWNQGLNELAPVGEI CAYFDAVLQQQLLPTGRVDYFPMSEYLGDGRFRTLAGTEYVVTVNRRIVDATYLRAVV PSMRPAPYSVAPGVDCVAPNELPKLGTRDRYVVVGAGKTGMDVCLWLLRNDVCPDKLT WIMPRDSWLIDRATLQPGPTFVRQFRESYGATLEAIGAATSTDDLFDRLETAGTLLRI DPSVRPSMYRCATVSHLELEQLRRIRDIVRMGHVQRIEPTTIVLDGGSVPATPTALYI DCTADGAPQRPAKPVFDADHLTLQAVRGCQQVFSAAFIAHVEFAYEDDAVKNELCTPI PHPDCDLDWMRLMHSDLGNFQRWLNDPDLTDWLSSARLNLLADLLPPLSHKPRVRERV VSMFQKRLGTAGDQLAKLLDAATATTEQR" CDS 2230158..2230910 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2024" /product="Acyl-ACP thioesterase" /note="Mb2024, -, len: 250 aa. Equivalent to Rv2001, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 250 aa overlap). Conserved hypothetical protein. Similar to Mycobacterium tuberculosis Rv0466, AL021933|MTV038_10 (264 aa), FASTA scores: opt: 592, E():0, (38.0% identity in 263 aa overlap). Protein product from Mb2024 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XZW3" /db_xref="InterPro:IPR002864" /db_xref="InterPro:IPR029069" /db_xref="UniProtKB/TrEMBL:A0A1R3XZW3" /protein_id="SIU00631.1" /translation="MHHNRDVDLALVERPSSGYVYTTGWRLATTDIDEHQQLRLDGVA RYIQEVGAEHLADAQLAEVHPHWIVLRTVIDVINPIELPSDITFHRWCAALSTRWCSM RVQLQGSAGGHIETEGFWICVNKDTLTPSRLTDDCIARFGSTTENHRLKWRPWLTGPN IDGTETPFPLRRTDIDPFEHVNNTIYWHGVHEILCQIPTLTAPYRAVLEYRSPIKSGE PLTIRYEQHDDVVRMHFVVGDDVRAAALLRRL" CDS 2230986..2231768 /codon_start=1 /transl_table=11 /gene="fabG3" /locus_tag="BQ2027_MB2025" /product="POSSIBLE 20-BETA-HYDROXYSTEROID DEHYDROGENASE FABG3 (Cortisone reductase) ((R)-20-hydroxysteroid dehydrogenase)" /note="Mb2025, fabG3, len: 260 aa. Equivalent to Rv2002, len: 260 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 260 aa overlap). Possible fabG3, 20-beta-hydroxysteroid dehydrogenase (EC 1.1.1.53), similar to e.g. 2BHD_STREX|P19992 20-beta-hydroxysteroid dehydrogenase (255 aa), FASTA scores: opt: 718, E(): 2e-38, (49.8% identity in 243 aa overlap), and many mycobacterial proteins. Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb2025 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2025 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P69166" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P69166" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00632.1" /translation="MSGRLIGKVALVSGGARGMGASHVRAMVAEGAKVVFGDILDEEG KAVAAELADAARYVHLDVTQPAQWTAAVDTAVTAFGGLHVLVNNAGILNIGTIEDYAL TEWQRILDVNLTGVFLGIRAVVKPMKEAGRGSIINISSIEGLAGTVACHGYTATKFAV RGLTKSTALELGPSGIRVNSIHPGLVKTPMTDWVPEDIFQTALGRAAEPVEVSNLVVY LASDESSYSTGAEFVVDGGTVAGLAHNDFGAVEVSSQPEWVT" CDS complement(2231889..2232746) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2026C" /product="Methyltransferase type 11" /note="Mb2026c, -, len: 285 aa. Equivalent to Rv2003c, len: 285 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 285 aa overlap). Conserved hypothetical protein. Some similarity with Methanococcus jannaschii 67555|U67555_3 (205 aa), FASTA scores: opt: 357, E(): 3.2e-17, (33.8% identity in 204 aa overlap). Protein product from Mb2026c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2026c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64920" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P64920" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00633.1" /translation="MVKRSRATRLSPSIWSGWESPQCRSIRARLLLPRGRSRPPNADC CWNQLAVTPDTRMPASSAAGRDAAAYDAWYDSPTGRPILATEVAALRPLIEVFAQPRL EIGVGTGRFADLLGVRFGLDPSRDALMFARRRGVLVANAVGEAVPFVSRHFGAVLMAF TLCFVTDPAAIFRETRRLLADGGGLVIGFLPRGTPWADLYALRAARGQPGYRDARFYT AAELEQLLADSGFRVIARRCTLHQPPGLARYDIEAAHDGIQAGAGFVAISAVDQAHEP KDDHPLESE" CDS complement(2232804..2234300) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2027C" /product="Predicted kinase" /note="Mb2027c, -, len: 498 aa. Equivalent to Rv2004c, len: 498 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 498 aa overlap). Conserved hypothetical protein similar to several e.g. >pir||T36945 hypothetical protein SCJ1.12 (508 aa) - Streptomyces coelicolor >gi|5748625|emb|CAB53130.1| (AL109962). Smith-Waterman score: 7e-94, Identities = 199/468 (42%). Contains PS00017 ATP/GTP-binding site motif A. Protein product from Mb2027c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2027c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5G0" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P0A5G0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00634.1" /translation="MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKP VVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRD KQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAEL RHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGE PALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLR DFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRLALVGGNPGTG KSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVVAVYQEALR KARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVMADRIVARA GGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI" CDS complement(2234322..2235209) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2028C" /product="universal stress protein family protein" /note="Mb2028c, -, len: 295 aa. Equivalent to Rv2005c, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 295 aa overlap). Conserved hypothetical protein, similar to MTCY39.23c, (50.3% identity in 316 aa overlap), C-terminus shows some similarity with YXIE_BACSU P42297 hypothetical 15.9 kd protein in bglh- (148 aa), FASTA scores, opt: 124, E(): 0.038, (28.5% identity in 144 aa overlap), also similar to Rv2623 (294 aa), (52.7% identity in 296 aa overlap) and other Mycobacterium tuberculosis hypothetical proteins e.g. Rv1996, Rv2624c, Rv2028c, Rv3134c, Rv1636. Some, possibly all, of these belong to universal stress protein family. Protein product from Mb2028c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2028c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64922" /db_xref="InterPro:IPR006015" /db_xref="InterPro:IPR006016" /db_xref="UniProtKB/Swiss-Prot:P64922" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00635.1" /translation="MSKPRKQHGVVVGVDGSLESDAAACWGATDAAMRNIPLTVVHVV NADVATWPPMPYPETWGVWQEDEGRQIVANAVKLAKEAVGADRKLSVKSELVFSTPVP TMVEISNEAEMVVLGSSGRGALARGLLGSVSSSLVRRAGCPVAVIHSDDAVIPDPQHA PVLVGIDGSPVSELATAVAFDEASRRGVELIAVHAWSDVEVVELPGLDFSAVQQEAEL SLAERLAGWQERYPDVPVSRVVVCDRPARKLVQKSASAQLVVVGSHGRGGLTGMLLGS VSNAVLHAARVPVIVARQS" CDS 2235328..2239311 /codon_start=1 /transl_table=11 /gene="otsB1" /locus_tag="BQ2027_MB2029" /standard_name="otsB" /product="PROBABLE TREHALOSE-6-PHOSPHATE PHOSPHATASE OTSB1 (TREHALOSE-PHOSPHATASE) (TPP)" /note="Mb2029, otsB1, len: 1327 aa. Equivalent to Rv2006, len: 1327 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1327 aa overlap). Probable otsB1, trehalose-6-phosphate phosphatase (EC 3.1.3.12) (see citations below); strong similarity in central domain to OTSB_ECOLI P31678 trehalose-phosphatase (266 aa) and M. leprae TREHALOSE-PHOSPHATASE Q49734 (429 aa). Belongs to Glycosyl hydrolases family 65 (http://www.expasy.ch/cgi-bin/lists?glycosid.txt). FASTA scores, sp|Q49734|Q49734 PUTATIVE TREHALOSE-PHOSPHATASE (429 aa) opt: 1283 E(): 0; 51.7% identity in 420 aa overlap opt: 278, E(): 3.6e-11, (29.4% identity in 255 aa overlap). Note that previously known as otsB. Protein product from Mb2029 detected using SWATH mass spectrometry. Mb2029 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZZ3" /db_xref="InterPro:IPR003337" /db_xref="InterPro:IPR005194" /db_xref="InterPro:IPR005195" /db_xref="InterPro:IPR005196" /db_xref="InterPro:IPR006379" /db_xref="InterPro:IPR008928" /db_xref="InterPro:IPR011013" /db_xref="InterPro:IPR012341" /db_xref="InterPro:IPR023198" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="InterPro:IPR037018" /db_xref="UniProtKB/TrEMBL:A0A1R3XZZ3" /protein_id="SIU00636.1" /translation="MRCGIVVNVTGPPPTIDRRYHDAVIVGLDNVVDKATRVHAAAWT KFLDDYLTRRPQRTGEDHCPLTHDDYRRFLAGKPDSVADFLAARGIRLPPGSPTDLTD DTVYGLQNLERQTFLQLLNTGVPEGKSIASFARRLQVAGVRVAAHTSHRNYGHTLDAT GLAEVFAVFVDGAVTAELGLPAEPNPAGLIETAKRLGANPGRCVVIDSCQTGLRAGRN GGFALVIAVDAHGDAENLLSSGADAVVADLAAVTVGSGDAAISTIPDALQVYSQLKRL LTGRRPAVFLDFDGTLSDIVERPEAATLVDGAAEALRALAAQCPVAVISGRDLADVRN RVKVDGLWLAGSHGFELVAPDGSHHQNAAATAAIDGLAEAAAQLADALREIAGAVVEH KRFAVAVHYRNVADDSVDNLIAAVRRLGHAAGLRVTTGRKVVELRPDIAWDKGKALDW IGERLGPAEVGPDLRLPIYIGDDLTDEDAFDAVRFTGVGIVVRHNEHGDRRSAATFRL ECPYTVCQFLSQLACDLQEAVQHDDPWTLVFHGYDPGQERLREALCAVGNGYLGSRGC APESAESEAHYPGTYVAGVYNQLTDHIEGCTVDNESLVNLPNWLSLTFRIDGGAWFNV DTVELLSYRQTFDLRRATLTRSLRFRDAGGRVTTMTQERFASMNRPNLVALQTRIESE NWSGTVDFRSLVDGGVHNTLVDRYRQLSSQHLTTAEIEVLADSVLLRTQTSQSGIAIA VAARSTLWRDGQRVDAQYRVARDTNRGGHDIQVTLSAGQSVTLEKVATIFTSRDAATL TAAISAQRCLGEAGRYAELCQQHVRAWARLWERCAIDLTGNTEELRLVRLHLLHLLQT ISPHTAELDAGVPARGLNGEAYRGHVFWDALFVAPVLSLRMPKVARSLLDYRYRRLPA ARRAAHRAGHLGAMYPWQSGSDGSEVSQQLHLNPRSGRWTPDPSDRAHHVGLAVAYNA WHYYQVTGDRQYLVDCGAELLVEIARFWVGLAKLDDSRGRYLIRGVIGPDEFHSGYPG NEYDGIDNNAYTNVMAVWVILRAMEALDLLPLTDRRHLIEKLGLTTQERDQWDDVSRR MFVPFHDGVISQFEGYSELAELDWDHYRHRYGNIQRLDRILEAEGDSVNNYQASKQAD ALMLLYLLSSDELIGLLARLGYRFAPTQIPGTVDYYLARTSDGSTLSAVVHAWVLARA NRSNAMEYFRQVLRSDIADVQGGTTQEGIHLAAMAGSIDLPQRCYSGLELRDDRLVLS PQWPEALGPLEFPFVYRRHQLSLRISGRSATLTAESGDAEPIEVECRGHVQRLRCGHT IEVGCSR" mobile_element 2239359..2240586 /mobile_element_type="insertion sequence:IS1607" /locus_tag="BQ2027_IS1607" /note="IS1607, len: 1228 nt. Equivalent to IS1607, len: 1228 nt, from Mycobacterium tuberculosis strain H37Rv,(99.8% identity in 1084 nt overlap)." CDS complement(2239410..2239754) /codon_start=1 /transl_table=11 /gene="fdxA" /locus_tag="BQ2027_MB2030C" /product="ferredoxin fdxa" /note="Mb2030c, fdxA, len: 114 aa. Equivalent to Rv2007c, len: 114 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 114 aa overlap). Probable fdxA, ferredoxin, similar to e.g. FER_MYCSM P00215 ferredoxin, Mycobacterium smegmatis (106 aa), FASTA scores, opt: 448, E(): 1 .6e-21, (58.7% identity in 109 aa overlap), also similar to Rv0886|MTCY31.14, (34.2% identity in 117 aa overlap) and fdxC|Rv1177. Protein product from Mb2030c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2030c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64123" /db_xref="InterPro:IPR000813" /db_xref="InterPro:IPR017896" /db_xref="UniProtKB/Swiss-Prot:P64123" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00637.1" /translation="MTYVIGSECVDVMDKSCVQECPVDCIYEGARMLYINPDECVDCG ACKPACRVEAIYWEGDLPDDQHQHLGDNAAFFHQVLPGRVAPLGSPGGAAAVGPIGVD TPLVAAIPVECP" CDS complement(2239943..2241268) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2031C" /product="ATPase, AAA family" /note="Mb2031c, -, len: 441 aa. Equivalent to Rv2008c, len: 441 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 441 aa overlap). Conserved hypothetical protein. Contains PS00017 ATP/GTP-binding site motif A, PS00501 Signal peptidases I serine active site. Also contains helix-turn-helix motif at aa 258-279. Similar to several conserved hypothetical proteins e.g. NP_085874.1|14028123|dbj|BAB54715.1 hypothetical protein from Mesorhizobium loti (435 aa). Smith-Waterman score: 1e-74, Identities = 158/359 (44%) Protein product from Mb2031c detected using SWATH mass spectrometry." /db_xref="GOA:P64924" /db_xref="InterPro:IPR025420" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041682" /db_xref="UniProtKB/Swiss-Prot:P64924" /protein_id="SIU00638.1" /translation="MDEIESLIGLRPTPLTWPVVIAGDFLGVWDPPPSLPGAANHEIS APTARISCMLIERRDAAARLRRALHRAPVVLLTGPRQAGKTTLSRLVGKSAPECTFDA ENPVDATRLADPMLALSGLSGLITIDEAQRIPDLFPVLRVLVDRPVMPARFLILGSAS PDLVGLASESLAGRVELVELSGLTVRDVGSSAADRLWLRGGLPPSFTARSNEDSAAWR DGYITTFLERDLAQLGVRIPAATMRRAWTMLAHYHGQLFSGAELARSLDVAQTTARRY LDALTDALVVRQLTPWFANIGKRQRRSPKIYIRDTGLLHRLLGIDDRLALERNPKLGA SWEGFVLEQLAALLAPNPLYYWRTQQDAELDLYVELSGRPYGFEIKRTSTPSISRSMR SALVDLQLARLAIVYPGEHRFPLSDTVVAVPADQILTTGSVDELLALLK" CDS 2241356..2241598 /codon_start=1 /transl_table=11 /gene="vapb15" /locus_tag="BQ2027_MB2032" /product="antitoxin vapb15" /note="Mb2032, -, len: 80 aa. Equivalent to Rv2009, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (98.8% identity in 80 aa overlap). Conserved hypothetical protein, very similar to Rv1560|MTCY48.05c (54.4% identity in 68 aa overlap). Mb2032 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019239" /db_xref="UniProtKB/TrEMBL:A0A1R3Y019" /protein_id="SIU00639.1" /translation="MYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGE PLGRDEALALQGSGFDFSDDEIESFSDTDRKLADES" CDS 2241599..2241997 /codon_start=1 /transl_table=11 /gene="vapc15" /locus_tag="BQ2027_MB2033" /product="toxin vapc15" /note="Mb2033, -, len: 132 aa. Equivalent to Rv2010, len: 132 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 132 aa overlap). Conserved hypothetical protein, similar to Rv1561|MTCY48.04c, (38.1% identity in 126 aa overlap) Protein product from Mb2033 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2033 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64926" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/Swiss-Prot:P64926" /protein_id="SIU00640.1" /translation="MIVDTSVWIAYLSTSESLASRWLADRIAADSTVIVPEVVMMELL IGKTDEDTAALRRRLLQRFAIEPLAPVRDAEDAAAIHRRCRRGGDTVRSLIDCQVAAM ALRIGVAVAHRDRDYEAIRTHCGLRTEPLF" CDS complement(2242180..2242611) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2034C" /product="Transcriptional regulator, MarR family" /note="Mb2034c, -, len: 143 aa. Equivalent to Rv2011c, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Conserved hypothetical protein, some similarity to putative regulatory proteins e.g. putative marR-family regulatory protein from Streptomyces coelicolor A3(2) (157 aa), emb|CAB63189.1| (AL133469) 34% identity in 110 aa overlap. Low similarity to PETP_RHOCA P31078 petp protein. Rhodobacter capsulatus (166 aa), FASTA scores, opt: 101, E(): 0 .36, (31.8% identity in 88 aa overlap)" /db_xref="GOA:P64928" /db_xref="InterPro:IPR000835" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P64928" /protein_id="SIU00641.1" /translation="MSDEIARLVADVFELAGLLRRSGEVVAAREGHTQARWQLLSVVS DRALTVPQAARRLGVTRQGVQRVANDLVVCGLAELRHNPDHRTSPLLVLTENGRRVLQ AITERAIVVNNRLADAVDPAALQATRDSLRRMIVALKAERP" CDS 2242652..2243146 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2035" /product="Phage envelope protein" /note="Mb2035, -, len: 164 aa. Equivalent to Rv2012, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 164 aa overlap). Conserved hypothetical protein, similar to AAK04358.1|AE006263_5 hypothetical protein from Lactococcus lactis (137 aa), (48% identity in 129 aa overlap). Protein product from Mb2035 detected using SWATH mass spectrometry." /db_xref="InterPro:IPR009833" /db_xref="InterPro:IPR036696" /db_xref="UniProtKB/Swiss-Prot:P64930" /protein_id="SIU00642.1" /translation="MLSKSKRSCRRRETLRIGEKMSAPITNLQAAQRDAIMNRPAVNG FPHLAETLRRAGVRTNTWWLPAMQSLYETDYGPVLDQGVPLIDGVAEVPAFDRTALVT ALRADQAGQTSFREFAAAAWRAGVLRYVVDLENRTCTYFGLHDQTYMEHYAAVEPSGG APTS" gene 2243769..2244996 /locus_tag="BQ2027_IS1607" CDS 2243850..2244470 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2036" /product="transposase" /note="Mb2036, -, len: 206 aa. Similar to Rv2013, len: 159 aa, from Mycobacterium tuberculosis strain H37Rv. Possible transposase: shows similarity to N-terminal part of transposase and insertion element hypothetical proteins eg sp|Q53198|Y4UE_RHISN PUTATIVE TRANSPOSASE Y4UE (359 aa) opt: 383, E(): 1.3e-18; 35.1% identity in 225 aa overlap; sp|P 14707|YM3_STRCO MINI-CIRCLE HYPOTHETICAL 45.7 KD P (414 aa) opt: 302, E(): 4.2e-13; 33.3% identity in 207 aa overlap; and YI90_MYCPA P14322 insertion element is900 hypothetical protein (399 aa), FASTA scores, opt: 146, E(): 0.0021, (26.9% identity in 145 aa overlap). Length changed since first submission (no clear start apparent). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (c-t) at the 5' start of Mb2036, leads to a longer product with a different NH2 part compared toits homolog in Mycobacterium tuberculosis strain H37Rv (206 aa versus 159 aa)." /db_xref="GOA:A0A1R3XZY3" /db_xref="InterPro:IPR002525" /db_xref="UniProtKB/TrEMBL:A0A1R3XZY3" /protein_id="SIU00643.1" /translation="MSIVDARGREVRRATIEHNAAGLRELLELLSRAGAREVAIERPD GPVVDTLLEAGITVVVISPNQLKNLRGRYGSAGNKDDRFDAFVLADTLRTDRSRLRPL LPDTPATATLRRTCRPRKDLVAHRVALANQLRAHLRVVFPGVVGLFADLDSPISLAFL TFLPRFDCQDRADWLSVKRLAGWLAAAGYCGRAPRPAHRCPARRHR" CDS 2244424..2245014 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2037" /product="transposase" /note="Mb2037, -, len: 196 aa. Equivalent to Rv2014, len: 196 aa, from Mycobacterium tuberculosis strain H37Rv, (). Possible transposase, similar to insertion elements e.g. sp|P14707|YM3_STRCO MINI-CIRCLE HYPOT HETICAL 45.7 KD P (414 aa) opt: 249 z-score: 307.0 E(): 1.4e-09; 33.1% identity in 169 aa overlap; and YI90_MYCPA P14322 insertion element is900 hypothetical protein (399 a a), FASTA scores, opt: 242, z-score: 299.9, E(): 3.7e-10, (3 2.5% identity in 163 aa overlap); possibly made by frameshifting with respect to upstream ORF. Length changed since first submission." /db_xref="GOA:A0A1R3Y237" /db_xref="InterPro:IPR003346" /db_xref="UniProtKB/TrEMBL:A0A1R3Y237" /protein_id="SIU00644.1" /translation="MLHDRLTGAPRGATGDEGAANAHITRAMVAALTSVATQIKTLDA QIAEQLSLHADAHIFTSLPRSGTVRAARLLAEIGDCRARFPTPESLACLAGVAPSTRQ SGKVKHVGFRWAADKQLRDAVCDFAGDSRRANLWAADRYNRAIARGHDHPHAVRILAR AWLYAIWHCWQDGAAYHPANHRALQALLNQDQDRAA" CDS complement(2245142..2246398) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2038C" /product="HNH endonuclease domain protein" /note="Mb2038c, -, len: 418 aa. Equivalent to Rv2015c, len: 418 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 418 aa overlap). Conserved hypothetical protein. Nearly identical to Mycobacterium tuberculosis Rv1765c|MTCY28.31c, (378 aa), an ORF starting next to ISB9, and ending in IS6110. Different N-terminus chosen and C-terminus differs as that of Rv1765c has been truncated by IS6110. Does NOT show similarities with transposases. BLAST hits with non-IS part of MTU78639. FASTA scores: Z95890|MTCY28_31 (378 aa) opt: 2417, E(): 0, (97.8% identity in 364 aa overlap). Mb2038c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0Q9" /db_xref="InterPro:IPR002711" /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Q9" /protein_id="SIU00645.1" /translation="MSSTATSGAAVVSPAERVEVLFEELAELAGQRNAIDGRIVEIVA ELDRDGLWGVTGARLVAGLVAWKMGCSSGNAHTIATVARRLPEFPRCARGMREGRLSL DQVGVIAGRAGEGSDAHYAQLAGVATVNQLRTALKLEPRPEPEPDFRPEPRPSITRSA DEQFSCWRIKLPHVEAAKFDAALQSHLDALIAEYKRDHDNSDGVSDQRPPLPGNVEAF LRLVEAGWDAEVARRPHGQHTTVVMHLDVQERAAGLHLGPLLSESERRYLLCDATFEA WFERDGQVIGCGRTTRQINRRLRRALEHRDRTCVVPGCGATRGLHAHHIRHWQDGGAT ELANLVLVCPYHHRAHHRGLITITGPADNLTVADSAGRPLSAGSLARASTKPPPAVAP WPGPTGERADWWWYEPFQPQPPPISN" CDS 2246752..2247327 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2039" /product="HYPOTHETICAL PROTEIN" /note="Mb2039, -, len: 191 aa. Equivalent to Rv2016, len: 191 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 191 aa overlap). Hypothetical protein. Protein product from Mb2039 detected using SWATH mass spectrometry." /db_xref="UniProtKB/TrEMBL:A0A1R3Y002" /protein_id="SIU00646.1" /translation="MTELGDKFLAALVGTIRDTRFDIADMRNWRPGWFPTMHSRCLSN LIHDRIWAHLVTLIASNPGTSIKDKGATREIVVGAHLRLRIKRHHAGDEISTYPTRTA IEFWQQGSQPAFPGLEEVRIAVGYRWDPDTREIGAPLLSLRDGKDHVIWVVELDEPAA GVKITWTPIEPTLPSIDFGDLGEDSGASGER" CDS 2247324..2248364 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2040" /product="transcriptional regulatory protein" /note="Mb2040, -, len: 346 aa. Equivalent to Rv2017, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 346 aa overlap). Hypothetical regulatory protein, shows similarity at N-terminal end to several transcriptional regulators e.g. Bacillus subtilis BSUB0012_44 (108 aa). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature in C-terminal half, may be fortuitous. FASTA scores: Z99115|BSUB0012_44 Bacillus subtilis (108 aa) opt: 154, E(): 0.0012; 35.5% identity in 62 aa overlap. Contains probable helix-turn-helix motif at aa 18-39 (Score 2243, +6.83 SD)" /db_xref="GOA:A0A1R3Y0C4" /db_xref="InterPro:IPR001387" /db_xref="InterPro:IPR010359" /db_xref="InterPro:IPR010982" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0C4" /protein_id="SIU00647.1" /translation="MNGLGDVLAVARKARGLTQIELAELVGLTQPAINRYESGDRDPD QHIVAKLAEILGVTDDLLIHGNRFRGALAVDAHMRRHKTTKASAWRQLEARLNLLRVH ASFLFEEVAINSEQHVPAFDPEFTAAEDAARLVRAQWRMPMGPVVNLTRWMEAAGCLV FEEDFATQRIDGLSQWVDDYPVMLINANAAPDRKRLTLAHELGHLVLHSTNPTENMET EATAFAAEFLMPESEIRPELRRLDLGKLLELKREWGVSMQALLERAYRMGLVSAEART KLYKAMNARGWKTKEPGIESIVREKPSLPAHIGMTLRSRGFTDQQAAAIAGYANPADN PFRPEGGRLHAI" CDS 2248606..2249325 /codon_start=1 /transl_table=11 /gene="vapB45" /locus_tag="BQ2027_MB2041" /product="Putative antitoxin VapB45" /note="Mb2041, -, len: 239 aa. Equivalent to Rv2018, len: 239 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 239 aa overlap). Conserved hypothetical protein, similar to Rv2308|MTCY339.01c (238 aa). FASTA scores: Z77163|MTCY339_1 Mycobacterium tuberculosis cosmid (238 aa) opt: 142, E(): 0.029; (24.8% identity in 250 aa overlap). Contains probable helix-turn-helix motif at aa 215-236 (Score 1175, +3.19 SD). Protein product from Mb2041 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2041 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZX8" /db_xref="InterPro:IPR007367" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR017277" /db_xref="UniProtKB/TrEMBL:A0A1R3XZX8" /protein_id="SIU00648.1" /translation="MAGDQELELRFDVPLYTLAEASRYLVVPRATLATWADGYERRPA NAPAVQGQPIITALPHPTGSHARLPFVGIAEAYVLNAFRRAGVPMQRIRPSLDWLIKN VGPHALASQDLCTGGAEVLWRFAERSGEGSPDDLVVRGLIVPRSGQYVFKEIVEHYLQ QISFADDNLASMIRLPQYGDANVVLDPRRGYGQPVFDGSGVRVADVLGPLRAGATFQA VADDYGVTPDQLRDALDAIAA" CDS 2249315..2249731 /codon_start=1 /transl_table=11 /gene="vapC45" /locus_tag="BQ2027_MB2042" /product="Putative ribonuclease VapC45 (RNase VapC45) (EC (Toxin VapC45)" /EC_number="3.1.-.-" /note="Mb2042, -, len: 138 aa. Equivalent to Rv2019, len: 138 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 138 aa overlap). Hypothetical protein. Protein product from Mb2042 detected using SWATH mass spectrometry. Mb2042 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR041375" /db_xref="UniProtKB/TrEMBL:A0A1R3Y032" /protein_id="SIU00649.1" /translation="MQPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYG ETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAE QVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL" CDS complement(2249747..2250046) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2043C" /product="Putative helicase" /note="Mb2043c, -, len: 99 aa. Equivalent to Rv2020c, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Conserved hypothetical protein, nearly identical to C-terminal part of hypothetical protein RvD1-Rv2024c' from Mycobacterium bovis BCG (1606 aa) emb|CAB44655.1| (Y18605). Corresponds to deletion region RvD1 so probably truncated protein. Mb2043c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR041635" /db_xref="UniProtKB/TrEMBL:A0A1R3Y013" /protein_id="SIU00650.1" /translation="MAPGMKWAAKTDHLAIVLLPRHHRRHSRRGRALPARSRSALGWI IERYRVTTDKASGIVNDPNDWCDEHDDPTYIVDLIKKVTTVSVETMKIVDGLAGG" CDS complement(2250131..2250436) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2044C" /product="transcriptional regulatory protein" /note="Mb2044c, -, len: 101 aa. Equivalent to Rv2021c, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). Unknown, possible regulatory protein similar to Mycobacterium tuberculosis hypothetical protein Rv3183, MTV014.27 (EMBL; AL021646). FASTA scores; TR:E12487 74 (109 aa) opt: 214 E(): 1.2e-09, 43.0% identity in 107 aa overlap. Contains probable helix-turn-helix at aa 45-66 (Score 1472, +4.20 SD) Protein product from Mb2044c detected using SWATH mass spectrometry. Mb2044c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3XZX6" /db_xref="InterPro:IPR001387" /db_xref="InterPro:IPR010982" /db_xref="InterPro:IPR039554" /db_xref="UniProtKB/TrEMBL:A0A1R3XZX6" /protein_id="SIU00651.1" /translation="MAMTLRDMDAVRPVNREAVDRHKARMRDEVRAFRLRELRAAQSL TQVQVAALAHIRQSRVSSIENGDIGSAQVNTLRKYVSALGGELDITVRLGDETFTLA" CDS complement(2250445..2251050) /codon_start=1 /transl_table=11 /gene="higB2" /locus_tag="BQ2027_MB2045C" /product="Toxin HigB" /note="Mb2045c, -, len: 201 aa. Equivalent to Rv2022c, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 201 aa overlap). Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protein Rv3182, MTV014.26 (EMBL:AL 021646). FASTA scores; TR:E1248773 (114 aa) opt: 335, E(): 3e-22, 53.8% identity in 106 aa overlap and to hypothetical proteins from Yersinia pestis (115 aa) e.g. emb|CAB53172.1| (AL109969), 41% identity in 108 aa overlap. Protein product from Mb2045c detected using shotgun mass spectrometry. Mb2045c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009241" /db_xref="UniProtKB/TrEMBL:A0A1R3Y021" /protein_id="SIU00652.1" /translation="MNVPWENAHGGALYCLIRGDEFSAWHRLLFQRPGCAESVLACRH FLDGSPVARCSYPEEYHPCVISRIALLCDSVGWTADVERISAWLNGLDRETYELVFAA IEVLEEEGPALGCPLVDTVRGSRHKNMKELRPGSQGRSEVRILFAFDPARQAIMLAAG NKAGRWTQWYDEKIKAADEMFAEHLAQFEDTKPKRRKRKKG" CDS complement(2251075..2251434) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2046C" /product="HYPOTHETICAL PROTEIN" /note="Mb2046c, -, len: 119 aa. Equivalent to Rv2023c, len: 119 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 119 aa overlap). Hypothetical protein, alternative upstream start possible. Mb2046c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3XZZ5" /protein_id="SIU00653.1" /translation="MAARHARAGRWAAQPRPMLGSGAVRYEVGANIDATGFGGIAAVH RLVTRLGLVTRLGLVERVDAHSRFSSSNLPKSSRRISGRVSLSGMSNSAAKVVASTSS SPWGQPLSVGLRRRWRS" CDS complement(2251594..2252271) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2047C" /product="HYPOTHETICAL PROTEIN" /note="Mb2047c, -, len: 225 aa. No equivalent in Mycobacterium tuberculosis strain H37Rv, but equivalent to MT2080, len: 225 aa, from Mycobacterium tuberculosis strain CDC1551, (100.0% identity in 225 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: Belongs to the RvD1 region. Absent in Mycobacterium tuberculosis strain H37Rv. Mb2047c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y244" /protein_id="SIU00654.1" /translation="MVQRYPFRMVQRTPAMTSVAQLEHYLEEHLTKELAWLLRAATEW HAQHCMNLGIDGYSMQVYALDSTVLHARTLFEFFTQNTSVGQNANYYNCTVYKVPLIG SILYQFHWRRPIHSHMMHAQDRRPVTQLPTYDDHAQTKPLNEMPVDFAKEIVRLWRVF VKDLNNHTNLQFRPIGATAQTALASEINAAKRVRTNDVTQRQIAVGKETSRLEPNFSI PQIEWPA" CDS complement(2252666..2253562) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2048C" /product="HYPOTHETICAL PROTEIN" /note="Mb2048c, -, len: 298 aa. No equivalent in Mycobacterium tuberculosis strain H37Rv, but equivalent to MT2081, len: 311 aa, from Mycobacterium tuberculosis strain CDC1551, (99.66% identity in 298 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: Belongs to the RvD1 region. Absent in Mycobacterium tuberculosis strain H37Rv. Mb2048c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0R8" /protein_id="SIU00655.1" /translation="MTLIQTVTTDDLVIQVADRRLSRPDGSVFDDDYTKLVCWNTSFT VGFTGLARIDPAQKKSTSEWLAETLCDYASFEDGVDALRYWASGQIGQLPTGKGWEDK RLGIIIAGFDRRRIPLVAEISNFDPEAPIPANQNEFKCYRIRRAPGHSASFRITGAAL TEKMYANILLRRVPRMLKQQDGITRAARLMVALQRRISEDNPGVGRHAMAVAIPRERT MPAVLSNLDAPSLNTMNSNFCYFDDAGFNYKQLGPHMAGGGWAWADFVAEADPSNPDM QKVGGRVLKCPQPPPQAESTGC" CDS complement(2253746..2258566) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2049C" /product="Putative helicase" /note="Mb2049c, RvD1-Rv2024c, len: 1606 aa. Equivalent to Rv2024c and similar to Rv2020c, len: 515 aa and 99 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 506 aa overlap and 67.0% identity in 97 aa overlap). Rv2024c: Conserved hypothetical protein. Identical to N-terminal part of much larger hypothetical protein, RvD1-Rv2024c' (1606 aa), from Mycobacterium bovis BCG: CAB44655.1|Y18605|13881753|AAK46361.1|AE007059 so probably truncated. Part of RvD1 chromosomal deletion region. Also similar to hypothetical protein from Helicobacter pylori. FASTA scores: AE0005|HPAE000580_2 Helicobacter pylori (607 aa) opt: 64, E(): 0, (36.2% identity in 464 aa overlap). Rv2020c: Conserved hypothetical protein, nearly identical to C-terminal part of hypothetical protein RvD1-Rv2024c' from Mycobacterium bovis BCG (1606 aa) emb|CAB44655.1| (Y18605). Corresponds to deletion region RvD1 so probably truncated protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, a large deletion region (RvD1) exists in between Rv2023 and Rv2024. In Mycobacterium bovis a 5000 bp insertion at this region results in Mb2049c being a much larger product with a different COOH part (1606 aa versus 515 aa). Protein product from Mb2049c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2049c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y012" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR002052" /db_xref="InterPro:IPR006935" /db_xref="InterPro:IPR011335" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR039442" /db_xref="InterPro:IPR041635" /db_xref="UniProtKB/TrEMBL:A0A1R3Y012" /protein_id="SIU00656.1" /translation="MGSVHDVIEAFRKAPSNAERGTKFEQLMVRYFELDPTMAQQYDA VWRWIDWPERRGRTDTGIDLVARERDTGNYTAIQCKFYEPTHTLAKGDIDSFFTASGK TGFTNRVIISTTDRWGRNAEDALADQLVPVQRIGMAEIAESPIDWDIAWPAGDLQVNL TPAKRHELRPHQQQAIDAVFRGFAVGNDRGKLIMACGTGKTFTALKIAERIAADNGGS ARILLLVPSISLLSQTLREWTAQSELDVRAFAVCSDTKVSRSAEDYHVHDVPIPVTTD ARVLLHEMAHRRCAQGLTVVFCTYQSLPTVAKAQRLGVDEFDLVMCDEAHRTTGVTLA GDDESNVVRVHDGQYLKAARRLYMTATPRIFTESIKDRADQHSAELVSMDDELTFGPE FHRLSFGEAVERGLLTDYKVMVLTVDQGVIAPRLQQELSGVSGELMLDDASKIVGCWN GLAKRSGTGIVAGEPPMRRAVAFAKDIKTSKQVAELFPKVVEAYRELVDDGPGLACSV RHVDGTFNALVRNEQLAWLKGVVAEDECRILSNARCLSEGVDVPALDAVLFLNPRNSI VDVVQSVGRVMRKSPGKDYGYVILPVAVPEGVEPSAALADNKRFKVVWQVLNALRSHD ERFDAMVNSIALNVKPTKTGEGSDKLLGGHIGPTSDEAGPAVAEQLAMFSLSQWQEAI YARIVDKVGTRTYWEQWAADVADIAATLTTRIHALLGGADATAAAAFEQFLAGLRDNL NDSITPDDAISMLSQHLITKPVFDALFAGHDFASHNPVSRAMQKMVDTVGGAGLEAET ARLEGFYESVRRRAGEVTSAEGKQQVIAELYEKFFRIGFKKQAEALGIVYTPVEVVDF IVRAADFVSRKHFGRGLTDEGVHILDGFAGTGTFITRLLQSDLITAADLTRKYSQELH ANEIMLLAYYIAAVNIESTYHALAGKTADADAYEPFPGMALADTFQISEAGDSMDAIM FPYNNARILRQLATPISVIIGNPPYSVGQSSANDLNANVKYPTLDGRIEQTYAKRSTA QLKNSLYDSYIRAFRWATDRIGDNGVVGFVSNGGYIDGNTADGMRLSLADDYAAVYVY NLRGNQRTAGELSRQEGGKVFGGGSRNTVAIFLGIKDPKHSGPCDVLYRDIGDYLSRE EKLRIVGDGYLDTVEWQTVTPNLHGDWVNQRDDAFSAWPVIGDKKAALDVTRVFANYS AGLKTSRDAWCYNFSRGALEANIGRTIDFYNSEVDRINEIRGRDAKTPPVDALITVDS AKFSWDRINKRQVAQGIRIEFAPAGMRLGTYRPFTKEHAYLDPNQQLNNCTYQLPSMF PTPEHGNVGYYVVGMGSDKPFSCLMLNAIPDLAFWGSSNGQFFPRWTYEKTEPRDGEL DFESTTNAEVDDHGYRRVDNITGVILKLYRDTIGDQVTKDDIFYYVYGLLHDPAYRTK YAADLKKMLPHIPTPETRERFDQLASAGRKLADLHVGYESVKPYPLDVQLKPGADPED RETWRVEKMKWKSKQDHSTIIYNSRVTIAGIPDEAERYLLGSRSALGWIIDRYRVTTD KASGIVNDPNDWCDEHANPTYIVDLIKKVTTVSVETMKIVDSIVALASAGSDST" CDS complement(2259075..2260073) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2050C" /product="conserved membrane protein" /note="Mb2050c, -, len: 332 aa. Equivalent to Rv2025c, len: 332 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 332 aa overlap). Possible conserved transmembrane protein, CDF family possibly involved in transport of metal ions, similar to several hypothetical bacterial proteins e. g. Methanobacterium thermoautotrophicum AE000941_1 (298 aa; described as cation efflux system protein) and Archaeoglob us fulgidus AE001111_5 (384 aa). FASTA scores: AE000941_1 M ethanobacterium thermoautotrophicum (298 aa) opt: 452 E(): 3.3e-24; 30.8% identity in 266 aa overlap and AE001111_5 Archaeoglobus fulgidus section 16 (384 aa) opt: 371 E(): 1.7e-18; 27.7% identity in 267 aa overlap. TBparse score is 0.897 Protein product from Mb2050c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y0D1" /db_xref="InterPro:IPR002524" /db_xref="InterPro:IPR027469" /db_xref="InterPro:IPR027470" /db_xref="InterPro:IPR036837" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0D1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00657.1" /translation="MTHDHAHSRGVPAMIKEIFAPHSHDAADSVDDTLESTAAGIRTV KISLLVLGLTALIQIVIVVMSGSVALAADTIHNFADALTAVPLWIAFALGAKPATRRY TYGFGRVEDLAGSFVVAMITMSAIIAGYEAIARLIHPQQIEHVGWVALAGLVGFIGNE WVALYRIRVGHRIGSAALIADGLHARTDGFTSLAVLCSAGGVALGFPLADPIVGLLIT AAILAVLRTAARDVFRRLLDGVDPAMVDAAEQALAARPGVQAVRSVRMRWIGHRLHAD AELDVDPALDLAQAHRIAHDAEHELTHTVPKLTTALIHAYPAEHGSSIPDRGRTVE" CDS complement(2260188..2261072) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2051C" /product="universal stress protein family protein" /note="Mb2051c, -, len: 294 aa. Equivalent to Rv2026c, len: 294 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 294 aa overlap). Conserved hypothetical protein, very similar to Mycobacterium tuberculosis hypothetical proteins Rv2005c, Rv2623, Rv1996, Rv2624c, Rv2028c, Rv3134c, Rv1636. Some, possibly all, of these belong to universal stress protein family. Protein product from Mb2051c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="InterPro:IPR006015" /db_xref="InterPro:IPR006016" /db_xref="UniProtKB/TrEMBL:A0A1R3XZY6" /protein_id="SIU00658.1" /translation="MSAATAKYGILVGVDGSAQSNAAVAWAAREAVMRQLPITLLHIV APVVVGWPVGQLYANMTEWQKDNAQQVIEQAREALTNSLGESKPPQVHTELVFSNVVP TLIDASQQAWLMVVGSQGMGALGRLLLGSISTALLHHARCPVAIIHSGNGATPDSDAP VLVGIDGSPASEAATALAFDEASRRRVDLVALHAWTDLGMFPVLGMDWREREKREAEV LAERLAGWQEQYPDVRVHRSLVCDKPARWLLEHSEQAQLVVVGSHGRGGFSGMLLGSV SSAVAHSVRIPVIVVRPS" CDS complement(2261112..2262833) /codon_start=1 /transl_table=11 /gene="dost" /locus_tag="BQ2027_MB2052C" /product="two component sensor histidine kinase dost" /note="Mb2052c, -, len: 573 aa. Equivalent to Rv2027c, len: 573 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 573 aa overlap). Membrane protein related to histidine kinase response regulators. Highly similar to Mycobacterium tuberculosis protein Rv3132c, MTCY03A2.2 6. FASTA scores: Z83867|MTCY3A2_26 (578 aa) opt: 2330, E(): 0; 62.5% identity in 560 aa overlap. Protein product from Mb2052c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2052c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y037" /db_xref="InterPro:IPR003018" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR011712" /db_xref="InterPro:IPR029016" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3Y037" /protein_id="SIU00659.1" /translation="MTHPDRANVNPGSPPVRETLSQLRLRELLLEVQDRIEQIVEGRD RLDGLIDAILAITSGLKLDATLRAIVHTAAELVDARYGALGVRGYDHRLVEFVYEGID EETRHLIGSLPEGRGVLGALIEEPKPIRLDDISRHPASVGFPLHHPPMRTFLGVPVRI RDEVFGNLYLTEKADGQPFSDDDEVLVQALAAAAGIAVDNARLFEESRTREAWIEATR DIGTQMLAGADPAMVFRLIAEEALTLMAGAATLVAVPLDDEAPACEVDDLVIVEVAGE ISPAVKQMTVAVSGTSIGGVFHDRTPRRFDRLDLAVDGPVEPGPALVLPLRAADTVAG VLVALRSADEQPFSDKQLDMMAAFADQAALAWRLATAQRQMREVEILTDRDRIARDLH DHVIQRLFAVGLTLQGAAPRARVPAVRESIYSSIDDLQEIIQEIRSAIFDLHAGPSRA TGLRHRLDKVIDQLAIPALHTTVQYTGPLSVVDTVLANHAEAVLREAVSNAVRHANAT SLAINVSVEDDVRVEVVDDGVGISGDITESGLRNLRQRADDAGGEFTVENMPTGGTLL RWSAPLR" CDS complement(2262894..2263733) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2053C" /product="universal stress protein family protein" /note="Mb2053c, -, len: 279 aa. Equivalent to Rv2028c, len: 279 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 279 aa overlap). Conserved hypothetical protein, highly similar to Mycobacterium tuberculosis proteins Rv2005c, Rv2623, Rv1996, Rv2624c, Rv3134c, Rv1636. Some, possibly all, of these belong to universal stress protein family. Rv2624c|MTCY01A10.08 (272 aa) and Rv3134c|MTCY03A2.24 (268 aa). FASTA scores: Z95387|MTCY1A10_8 (272 aa) opt: 563, E(): 2.5e-31, (36.8% identity in 266 aa overlap) and Z83867|MTCY3A2_24 (268 aa) opt: 562, E(): 2.9e-31, (40.7% identity in 273 aa overlap). Protein product from Mb2053c detected using SWATH mass spectrometry." /db_xref="InterPro:IPR006015" /db_xref="InterPro:IPR006016" /db_xref="UniProtKB/TrEMBL:A0A1R3Y020" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00660.1" /translation="MNQSHKPPSIVVGIDGSKPAVQAALWAVDEAASRDIPLRLLYAI EPDDPGYAAHGAAARKLAAAENAVRYAFTAVEAADRPVKVEVEITQERPVTSLIRASA AAALVCVGAIGVHHFRPERVGSTAAALALSAQCPVAIVRPHRVPIGRDAAWIVVEADG SSDIGVLLGAVMAEARLRDSPVRVVTCRQSGVGDTGDDVRASLDRWLARWQPRYPDVR VQSAAVHGELLDYLAGLGRSVHMVVLSASDQEHVEQLVGAPGNAVLQEAGCTLLVVGQ QYL" CDS complement(2263730..2264749) /codon_start=1 /transl_table=11 /gene="pfkB" /locus_tag="BQ2027_MB2054C" /product="6-phosphofructokinase pfkb (phosphohexokinase) (phosphofructokinase)" /note="Mb2054c, pfkB, len: 339 aa. Equivalent to Rv2029c, len: 339 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 339 aa overlap). Probable pfkB, phosphofructokinase (EC 2.7.1.-), similar to others eg P06999|K6P2_ECOLI 6-PHOSPHOFRUCTOKINASE I SOZYME 2 from E. coli (309 aa), FASTA scores: opt: 705, E(): 0; (41.4% identity in 304 aa overlap); and LACC_STRMU phosphotagatosekinase (310 aa); etc. Contains PS00583 pfkB family of carbohydrate kinases signature 1. Protein product from Mb2054c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3XZZ1" /db_xref="InterPro:IPR002173" /db_xref="InterPro:IPR011611" /db_xref="InterPro:IPR017583" /db_xref="InterPro:IPR029056" /db_xref="UniProtKB/TrEMBL:A0A1R3XZZ1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00661.1" /translation="MTEPAAWDEGKPRIITLTMNPALDITTSVDVVRPTEKMRCGAPR YDPGGGGINVARIVHVLGGCSTALFPAGGSTGSLLMALLGDAGVPFRVIPIAASTRES FTVNESRTAKQYRFVLPGPSLTVAEQEQCLDELRGAAASAAFVVASGSLPPGVAADYY QRVADICRRSSTPLILDTSGGGLQHISSGVFLLKASVRELRECVGSELLTEPEQLAAA HELIDRGRAEVVVVSLGSQGALLATRHASHRFSSIPMTAVSGVGAGDAMVAAITVGLS RGWSLIKSVRLGNAAGAAMLLTPGTAACNRDDVERFFELAAEPTEVGQDQYVWHPIVN PEASP" CDS complement(2264766..2265005) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2055C" /product="CONSERVED HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb2055c, -, len: 79 aa. Equivalent to 3' end of Rv2030c, len: 681 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 79 aa overlap). Conserved hypothetical protein that corresponds to products of two adjacent ORF's described previously MSGTUBDWN_4 (390 aa) and MSGTUBDWN_1 (385 aa). Also similar to C-terminal two-thirds of Mycobacterium tuberculosis protein Rv2143 (MTCY270.25c; 352 aa) and to Rv0571c (443 aa) and Mycobacterium leprae protein U650s MLU15184_16 (258 aa). FASTA scores: M93129|MSGTUBDWN_4 (390 aa) opt: 2530 E(): 0; 97.7% identity in 385 aa overlap and M93129|MSGTUBDWN_1 (385 aa) opt: 1983 E(): 0; 99.0% identity in 309 aa overlap. Z95388| MTCY270_25 (352 aa) opt: 882 E(): 0; 61.1 % identity in 226 aa overlap. U15184|MLU15184_16 (258 aa) opt: 549 E(): 9.8e-29; 43.8% identity in 219 aa overlap. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2030c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits Rv2030c into 2 parts, Mb2055c and Mb2056c. Protein product from Mb2055c detected using SWATH mass spectrometry. Mb2055c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y033" /db_xref="InterPro:IPR007815" /db_xref="UniProtKB/TrEMBL:A0A1R3Y033" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00662.1" /translation="MSARLSRDAEAPLDVVRLGRAIGVVYLPATERQSHYLHVRPADQ FDAMIHIDQTRALEPLEVTSRWIAGENPETYPTGL" CDS complement(2264990..2266810) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2056C" /product="CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb2056c, -, len: 606 aa. Equivalent to 5' end of Rv2030c, len: 681 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 587 aa overlap). Conserved hypothetical protein that corresponds to products of two adjacent ORF's described previously MSGTUBDWN_4 (390 aa) and MSGTUBDWN_1 (385 aa). Also similar to C-terminal two-thirds of Mycobacterium tuberculosis protein Rv2143 (MTCY270.25c; 352 aa) and to Rv0571c (443 aa) and Mycobacterium leprae protein U650s MLU15184_16 (258 aa). FASTA scores: M93129|MSGTUBDWN_4 (390 aa) opt: 2530 E(): 0; 97.7% identity in 385 aa overlap and M93129|MSGTUBDWN_1 (385 aa) opt: 1983 E(): 0; 99.0% identity in 309 aa overlap. Z95388| MTCY270_25 (352 aa) opt: 882 E(): 0; 61.1 % identity in 226 aa overlap. U15184|MLU15184_16 (258 aa) opt: 549 E(): 9.8e-29; 43.8% identity in 219 aa overlap. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2030c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (g-*) splits Rv2030c into 2 parts, Mb2055c and Mb2056c. Protein product from Mb2056c detected using shotgun mass spectrometry. Mb2056c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y005" /db_xref="InterPro:IPR000836" /db_xref="InterPro:IPR007815" /db_xref="InterPro:IPR014622" /db_xref="InterPro:IPR029057" /db_xref="UniProtKB/TrEMBL:A0A1R3Y005" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00663.1" /translation="MLMTAAADVTRRSPRRVFRDRREAGRVLAELLAAYRDQPDVIVL GLARGGLPVAWEVAAALHAPLDAFVVRKLGAPGHDEFAVGALASGGRVVVNDDVVRGL RITPQQLRDIAEREGRELLRRESAYRGERPPTDITGKTVIVVDDGLATGASMFAAVQA LRDAQPAQIVIAVPAAPESTCREFAGLVDDVVCATMPTPFLAVGESFWDFRQVTDEEV RRLLATPTAGPSLRRPAASTAADVLRRVAIDAPGGVPTHEVLAELVGDARIVLIGESS HGTHEFYQARAAMTQWLIEEKGFGAVAAEADWPDAYRVNRYVRGLGEDTNADEALSGF ERFPAWMWRNTVVRDFVEWLRTRNQRYESGALRQAGFYGLDLYSLHRSIQEVISYLDK VDPRAAARARARYACFDHACADDGQAYGFAAAFGAGPSCEREAVEQLVDVQRNALAYA RQDGLLAEDELFYAQQNAQTVRDAEVYYRAMFSGRVTSWNLRDQHMAQTLGSLLTHLD RHLDAPPARIVVWAHNSHVGDARATEVWADGQLTLGQIVRERYGDESRSIGFSTYTGT VTAASEWGGIAQRKAVRPALHAVSRSSSTRLQTVSWCQRG" CDS complement(2266822..2267256) /codon_start=1 /transl_table=11 /gene="hspX" /locus_tag="BQ2027_MB2057C" /standard_name="acr" /product="heat shock protein hspx (alpha-crystallin homolog) (14 kda antigen) (hsp16.3)" /note="Mb2057c, hspX, len: 144 aa. Equivalent to Rv2031c, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 144 aa overlap). hspX, heat shock protein localized in the inner membrane (see citations below). Identical to P30223|14KD_MYCTU 14 KD ANTIGEN (16 kDa ANTIGEN) (HSP 16.3) of Mycobacterium tuberculosis (143 aa), FASTA scores: opt: 933, E(): 0, (100.0% identity in 143 aa overlap). BELONGS TO THE SMALL HEAT SHOCK PROTEIN (HSP20) FAMILY. Also known as alpha-crystallin and gene as acr (see some citations below). TBparse score is 0.897. Protein product from Mb2057c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2057c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5B8" /db_xref="InterPro:IPR002068" /db_xref="InterPro:IPR008978" /db_xref="UniProtKB/Swiss-Prot:P0A5B8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00664.1" /translation="MATTLPVQRHPRSLFPEFSELFAAFPSFAGLRPTFDTRLMRLED EMKEGRYEVRAELPGVDPDKDVDIMVRDGQLTIKAERTEQKDFDGRSEFAYGSFVRTV SLPVGADEDDIKATYDKGILTVSVAVSEGKPTEKHIQIRSTN" CDS 2267453..2268448 /codon_start=1 /transl_table=11 /gene="acg" /locus_tag="BQ2027_MB2058" /product="Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2" /note="Mb2058, -, len: 331 aa. Equivalent to Rv2032, len: 331 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 331 aa overlap). acg (for acr-coregulated gene), conserved hypothetical protein possibly member of a superfamily of classical nitroreductases (see first citation below), similar to hypothetical mycobacterial proteins Rv3127|MTCY164.37 (344 aa) and Rv3131|MTCY03A2.27c (332 aa). FASTA scores: Z95150|MTCY164_38 Mycobacterium tuberculosis cosmid (344 aa) opt: 1208, E(): 0, (56.4% identity in 321 aa overlap); Z83867| MTCY3A2_27 Mycobacterium tuberculosis cosmid (332 aa) opt: 568, E(): 8.6e-30, (36.8% identity in 321 aa overlap). Similar to proteins SCJ1.11 (330 aa; AL109962) and SCJ12.27c (335 aa; AL109989) in Streptomyces coelicolor. Protein product from Mb2058 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2058 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0T1" /db_xref="InterPro:IPR000415" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0T1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00665.1" /translation="MPDTMVTTDVIKSAVQLACRAPSLHNSQPWRWIAEDHTVALFLD KDRVLYATDHSGREALLGCGAVLDHFRVAMAAAGTTANVERFPNPNDPLHLASIDFSP ADFVTEGHRLRADAILLRRTDRLPFAEPPDWDLVESQLRTTVTADTVRIDVIADDMRP ELAAASKLTESLRLYDSSYHAELFWWTGAFETSEGIPHSSLVSAAESDRVTFGRDFPV VANTDRRPEFGHDRSKVLVLSTYDNERASLLRCGEMLSAVLLDATMAGLATCTLTHIT ELHASRDLVAALIGQPATPQALVRVGLAPEMEEPPPATPRRLIDEVFHVRAKDHR" CDS complement(2268564..2269406) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2059C" /product="intermediary metabolism and respiration" /note="Mb2059c, -, len: 280 aa. Equivalent to Rv2033c, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Conserved hypothetical protein, similar to hypothetical protein SCC77.24 (274 aa) from Streptomyces coelicolor A3(2) CAB66235.1|AL13650) (50% identity in 261 aa overlap). Mb2059c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021447" /db_xref="UniProtKB/TrEMBL:A0A1R3Y017" /protein_id="SIU00666.1" /translation="MLDRYGTDVLAAGGRRRPRSVEHPVELGMVVEDAETGYVGAVVR VEYGRIDLEDRYGKTRGFPLGPGYLLDGLPVILTAPRCAAAAGPRRTASGSVAVPGAR ARVARASRIYVEGRHDAELIAAVWGADLRIEGVVVEHLGGVDDLVEIVAKFRPGPRRR LGVLVDHLVAGSKEARIAEVVRRGPGGSDTLVVGHPYVDIWQAVKPQRVGLAAWPRVP RHIEWKHGVCDALGWPHADQADIAAAWRRIRSQVRDWTDLEPALIGRVEELIDFVTQP AGDE" CDS 2269618..2269941 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2060" /product="arsr repressor protein" /note="Mb2060, -, len: 107 aa. Equivalent to Rv2034, len: 107 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 107 aa overlap). Probable repressor protein similar to several belonging to the ARSR FAMILY e.g. Q53040 (112 aa). FASTA scores: sptr|Q53040|Q53040 NITRILE HYDRATASE REGULATAR 2 (112 aa) opt: 167, E(): 6.7e-06; 44.7% identity in 76 aa overlap. TBparse score is 0.905. Contains probable helix-turn-helix at aa 32-53 (S core 1350, +3.78 SD),Mb2060 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0E0" /db_xref="InterPro:IPR001845" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0E0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00667.1" /translation="MSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVSR PAVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRTDLDRFWTRALTGYAQLIDS EGDDT" CDS 2269938..2270426 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2061" /product="Ligand-binding SRPBCC domain CalC" /note="Mb2061, -, len: 162 aa. Equivalent to Rv2035, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). Conserved hypothetical protein, similar to conserved hypothetical protein (156 aa) from Sinorhizobium meliloti CAC46569.1|AL591789 (34% identity in 146 aa overlap). Protein product from Mb2061 detected using shotgun mass spectrometry. Mb2061 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR013538" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3XZZ7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00668.1" /translation="MTRPRTDAIHHHVVVNAPIERAFAVFTTRFGDFKPREHNLLAIP ITETVFECHAGGHIYDRGVDGSVCKWARVLVYEPPSRVLFTWDIGPTWRPETDLAKTS EVEVRFTAQSAETTRVDLEHRHLDRHGPGWESVADGVDSEAGWPLYLRRYTDLLCIQV QP" CDS 2270423..2271064 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2062" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2062, -, len: 213 aa. Equivalent to Rv2036, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 213 aa overlap). Conserved hypothetical protein; slight similarity to Streptomyces lincolnensis protein involved in lincomycin production Q54375 (238 aa). FASTA scores: sptr|Q54375|Q54375 (78-11) LINCOMYCIN PRODUCTION GENES (238 aa) opt: 119, E(): 0.97; 31.3% identity in 99 aa overlap. TBparse score is 0.934,Mb2062 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y046" /db_xref="InterPro:IPR017517" /db_xref="InterPro:IPR024344" /db_xref="InterPro:IPR034660" /db_xref="UniProtKB/TrEMBL:A0A1R3Y046" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00669.1" /translation="MIAADDDTEKSMMDMARAERAELAAFLTTLTLQQWETPSLCAGW SVKEVVAHMISYEDLGVFGLLKRFAKGRIVRANEVGVDEFAGLSPQELVDYVGRHLQP RGLTAGFGGMIALVDGMIHHQDIRRPLGQPRTIPAQRLDRVLRLMPKNPRLRARPRIK GLRLRATDLDWTIGTGPEVTGPGEALLMAMAGRPAAVSDLSGPGKPTLAGRLG" CDS complement(2271071..2272045) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2063C" /product="conserved transmembrane protein" /note="Mb2063c, -, len: 324 aa. Equivalent to Rv2037c, len: 324 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 324 aa overlap). Possible conserved transmembrane protein, similar to hypothetical proteins from Mycobacterium leprae MLCB2052.31 (329 aa) and Bacillus subtilis P54513|YQHO_BACSU (291 aa). FASTA scores: Z98604|MLCB2052_1 6 Mycobacterium leprae cosmid B205 (329 aa) opt: 1764, E(): 0; 80.5% identity in 323 aa overlap and sp|P54513|YQHO_BACSU HYPOTHETICAL 32.9 KD PROTEIN IN G (291 aa) opt: 328, E(): 8.8e-14; 36.6% identity in 306 aa overlap. TBparse score is 0.919 Protein product from Mb2063c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2063c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y031" /db_xref="InterPro:IPR002641" /db_xref="InterPro:IPR016035" /db_xref="UniProtKB/TrEMBL:A0A1R3Y031" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00670.1" /translation="MALVSTARVDLVCEGGGVRGIGLVGAVDALADAGYRFPRVAGSS AGAIVASLVAALQTAGEPVTRLAEMMRSIDYPKFLDRNLIGHVPLIGGGLSLLLSDGV YRGAYLEQLLGGLLADLGVHTFGDLRTGEAPEQFAWSLVVTASDLSRRRLVRIPWDLD SYGIHPDDFSVARAVHASSAIPFVFEPVRVRGATWVDGGLLSNFPVALFDRTDAEPRW PTFGIRLSARPGIPPTRPVQGPVSLGIAAIETLVSNQDNAYIDDPCTVRRTIFVPAHD VSPIDFDITAEQREALYQRGFQAGQKFLANWNYADYLADCGGPFTPSL" CDS complement(2272047..2273120) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2064C" /product="Probable sugar-transport ATP-binding protein ABC transporter" /note="Mb2064c, -, len: 357 aa. Equivalent to Rv2038c, len: 357 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 357 aa overlap). Probable sugar-transport ATP-binding protein ABC transporter (see citation below), equivalent to MLCB2052.30|Z98604|MLCB2052_15 from Mycobacterium leprae (356 aa), FASTA scores: opt: 1866, E(): 0, (79.7% identity in 355 aa overlap). Also similar to multiple sugar import proteins e.g. Y08921|SRMSIK_1 msiK protein from Streptomyces reticuli (377 aa), FASTA scores: opt: 1336, E(): 0, (62.6% identity in 377 aa overlap); etc. Also similar to several proteins from Mycobacterium tuberculosis e.g. Rv2832c, Rv1238, Rv2397c, Rv3758c. Contains PS00211 ABC transporters family signature and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Mb2064c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y000" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR008995" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR040582" /db_xref="UniProtKB/TrEMBL:A0A1R3Y000" /protein_id="SIU00671.1" /translation="MASVSFEQATRRYPGTDRPALDRLDLIVGDGEFVVLVGPSGCGK TTSLRMVAGLETLDCGRIRIGERDVTEVDPKDRDVAMVFQNYALYPHMTVAQNMGFAL KVAKIGKAEIRERVLAAAKLLDLQSYLDRKPKDLSGGQRQRVAMGRAIVRRPQVFLMD EPLSNLDAKLRGQTRNQIAALQRQLGTTTVYVTHDQVEAMTMGDRVAVLSDGVLQQCA SPRELYRNPGNVFVAGFIGSPAMNLFRLSIADSTVSLGDWQILLPRAVVGTAAEVIIG VRPEHLELGGAGIEMDVDMVEELGADAYLYGRIVSGGCEMDQSIVARVDGRGPPERGS RVRLCPTPGHLHFFAVDGRRIPG" CDS complement(2273123..2273965) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2065C" /product="Probable sugar-transport integral membrane protein ABC transporter" /note="Mb2065c, -, len: 280 aa. Equivalent to Rv2039c, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 280 aa overlap). Probable sugar-transport integral membrane protein ABC transporter (see citation below), equivalent to MLCB2052.29|Z98604|MLCB2052_14 from Mycobacterium leprae (283 aa), FASTA scores: opt: 1593, E(): 0, (79.2% identity in 283 aa overlap). Also similar to maltose and lactose transport proteins e.g. X66092|CPMALGHOM_1 from C. perfringens (275 aa), FASTA scores: opt: 695, E(): 0, (41.2% identity in 228 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature. Also contains possible helix-turn-helix motif at aa 171-192, although this is probably fortuitous." /db_xref="GOA:A0A1R3Y041" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y041" /protein_id="SIU00672.1" /translation="MGWADRIVHRHFIRGLALYAGLIGIAWCALFPIIWALSGSLKAD GEVTEPTLFPSHPQWSNYREVFALMPFWRMFFNTVLYAGCVTAGQVFFCSLAGYAFAR LQFRGRDTLFVLYLSTLMVPLTVTVIPQFILMRIVGWVDTPWAMIVPGLFGSAFGTYL MRQFFRTLPTDLEEAAILDGCSPWQIYWRILLPHSRPAVLVLGVLTWVNVWNDFLWPL LMIQRNSLATLTLGLVRLRGEYVARWPVLMAASMLMLVPLVILYAVAQRSFVRGIAVT GLGG" CDS complement(2273952..2274854) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2066C" /product="Probable sugar-transport integral membrane protein ABC transporter" /note="Mb2066c, -, len: 300 aa. Equivalent to Rv2040c, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 300 aa overlap). Probable sugar-transport integral membrane protein ABC transporter (see citation below), equivalent to MLCB2052.28|Z98604|MLCB2052_13 from Mycobacterium leprae (319 aa), FASTA scores: opt: 1606, E(): 0, (81.6% identity in 293 aa overlap). Also similar to many diverse sugar transport proteins. Mb2066c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y015" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y015" /protein_id="SIU00673.1" /translation="MTRRRGRRAWAGRMFVAPNLAAVVVFMLFPLGFSLYMSFQKWDL FTHATFVRLDNFRNLFTSDPLFLIAVVNTAVYTVGTVVPTVIVSLVVAAFLNRKIKGI SLFRTVVFLPLAISSVVMAVVWQFVFNTDNGLLNIMLGWLGIGPIPWLIEPRWAMVSL CLVSVWRSVPFATVVLLAAMQGVPETVYEAARIDGAGEIRQFVSITVPLIRGALSFVV VISIIHAFQAFDLVYVLTGANGGPETATYVLGIMLFQHAFSFLEFGYASALAWVMFAI LLVLTVLQLRITHRRSWEASRGLG" CDS complement(2274851..2276170) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2067C" /product="Probable sugar-binding lipoprotein" /note="Mb2067c, -, len: 439 aa. Equivalent to Rv2041c, len: 439 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 439 aa overlap). Probable sugar-binding lipoprotein component of sugar transport system, equivalent to Z98604|MLCB2052_1|MLCB2052.27 from Mycobacterium leprae (445 aa), FASTA scores: opt: 2324, E(): 0, (77.4% identity in 446 aa overlap). Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb2067c detected using SWATH mass spectrometry. Mb2067c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006059" /db_xref="InterPro:IPR006311" /db_xref="UniProtKB/TrEMBL:A0A1R3Y269" /protein_id="SIU00674.1" /translation="MVNKPFERRSLLRGAGALTAASLAPWAAGCAADDDDALTFFFAA NPDELRPRMRVVNEFQRRYPDIKVRALLSGPGVMQQLATFCAGGKCPDVLMAWELTYA ELADRGVLLDLNTLLARDQVFAAELKSDSIGALYETFTFNGGQYAFPEQWSGNFLFYN KQLFDDAGVPPPPGSWERPWSFAEFLDAAQALTKQGRSGRDRQWGFVNAWVSFYAAGL FAMNNGVPWSVPRMNPTHLNFDHDGFLEAVQFYADLTNKHKVAPSAAEQQSMSTADLF SVGKAGIALAGHWRYQTFDRADGLDFDVAPLPIGPRGRAACSDIGVTGLAIAATSRRK DQAWEFVKFATGPVGQALIGESRLFVPVLRSAINSHGFANAHRRVGNLAVLSEGPAYS EGLPVTPAWEKIAALMDRYFGPVLRGSRPATSLTGLSQAVDEVLRNP" CDS complement(2276208..2277005) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2068C" /product="Nuclear transport factor 2" /note="Mb2068c, -, len: 265 aa. Equivalent to Rv2042c, len: 265 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 265 aa overlap). Conserved hypothetical protein,similar in N-terminal part to hypothetical proteins MLCB2052.24 (95 aa) and Rv0760c|MTCY369.05 (139 aa). FASTA scores: Z98604|MLCB2052_9 Mycobacterium leprae cosmid B2052 (95 aa) opt: 269, E(): 2.9e-12, (55.4% identity in 92 aa overlap) and Z80226|MTCY369_5 Mycobacterium tuberculosis cosmid (139 aa) opt: 150, E(): 0.001, (28.7% identity in 136 aa overlap). Protein product from Mb2068c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2068c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002075" /db_xref="InterPro:IPR032710" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0U1" /protein_id="SIU00675.1" /translation="MAPPNRDELLAAVERSPQAAAAHDRAGWVGLFTGDARVEDPVGS QPQVGHEAIGRFYDTFIGPRDITFHRDLDIVSGTVVLRDLELEVAMDSAVTVFIPAFL RYDLRPVTGEWQIAALRAYWELPAMMLQFLRTGSGATRPALQLSRALLGNQGLGGTAG FLTGFRRAGRRHKKLVETFLNAASRADKSAAYHALSRTATMTLGEDELLDIVELFEQL RGASWTKVTGAGSTVAVSLASDHRRGIMFADVPWRGNRINRIRYFPA" CDS complement(2277005..2277565) /codon_start=1 /transl_table=11 /gene="pncA" /locus_tag="BQ2027_MB2069C" /product="pyrazinamidase/nicotinamidase pnca (pzase)" /note="Mb2069c, pncA, len: 186 aa. Equivalent to Rv2043c, len: 186 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 186 aa overlap). pncA, pyrazinamidase/nicotinamidase (EC 3.5.1.-) (see citations below). Identical to PYRAZINAMIDASE/NICOTINAMIDASE involved in susceptibility or resistance to antituberculous drug pyrazinamide. FASTA scores: sptr|Q50575|Q50575 PYRAZINAMIDASE/NICOTINAMIDASE. (186 aa) opt: 1236, E(): 0; 100.0% identity in 186 aa overlap. Protein product from Mb2069c detected using SWATH mass spectrometry. Mb2069c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y026" /db_xref="InterPro:IPR000868" /db_xref="InterPro:IPR036380" /db_xref="UniProtKB/TrEMBL:A0A1R3Y026" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00676.1" /translation="MRALIIVDVQNDFCEGGSLAVTGGAALARAISDYLAEAADYHHV VATKDFHIDPGDDFSGTPDYSSSWPPHCVSGTPGADFHPSLDTSAIEAVFYKGAYTGA YSGFEGVDENGTPLLNWLRQRGVDEVDVVGIATDHCVRQTAEDAVRNGLATRVLVDLT AGVSADTTVAALEEMRTASVELVCSS" CDS complement(2277606..2277923) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2070C" /product="putative membrane protein" /note="Mb2070c, -, len: 105 aa. Equivalent to Rv2044c, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 105 aa overlap). Conserved hypothetical protein, similar to conserved hypothetical protein PA3386 (121 aa) from Pseudomonas aeruginosa |E83221 conserved hypothetical protein PA3386 [imported] -Pseudomonas aeruginosa (strain PAO1) 9949522|gb|AAG06774.1|AE004760_2 (AE004760). (46% identity in 92 aa overlap). Mb2070c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0F0" /db_xref="InterPro:IPR021218" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0F0" /protein_id="SIU00677.1" /translation="MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCP LTWVERWARTKAAMTPLSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWL PRR" CDS complement(2278009..2279544) /codon_start=1 /transl_table=11 /gene="lipT" /locus_tag="BQ2027_MB2071C" /product="carboxylesterase lipt" /note="Mb2071c, lipT, len: 511 aa. Equivalent to Rv2045c, len: 511 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 511 aa overlap). Probable lipT, carboxylesterase similar to many e.g. O08472 (489 aa) and P37967|PNBA_ BACSU (489 aa). PARA-NITROBENZYL ESTERASE (EC 3.1.1.-). Contains PS00941 Carboxylesterases type-B signature 2. Contains PS00122 Carboxylesterases type-B serine active site. FASTA scores: sptr|O08472|O08472 INTRACELLULAR ESTERASE B (489 aa) opt: 849, E(): 0, (36.2% identity in 489 aa overlap) and sp|P37967|PNBA_BACSU PARA-NITROBENZYL ESTERASE (489 aa) opt: 838, E(): 0, (36.0% identity in 489 aa overlap). TBparse score is 0 .918 Protein product from Mb2071c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2071c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y007" /db_xref="InterPro:IPR002018" /db_xref="InterPro:IPR002168" /db_xref="InterPro:IPR019819" /db_xref="InterPro:IPR019826" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y007" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00678.1" /translation="MALESATVGSMHERTVRARTATGIVEGFTRDGVHRWRSIPYARA PVGSLRFRAPQPAQPWPGVRHCHTFANCAPQQRRYTVMGIGRYQTRSEDCLTLNVVTP EEPATQPLPVMVFIHGGGYILGSSATPIYDGAALARRGCVYVSVNYRLGALGCLDLSS LSTPQITLDSNVYLRDLVLALRWVHDNIAEFGGDPGNVTIFGESAGAHITATLLAVPA AKGLFARAISESPAAGMVRSREVAAEFAARFANLIGARTQDAANALMQASPAQLVEAQ HHLIRQGMRKRLGAFPIGPVFGDDYLPMDPVEAMRSGRVHAVPLIVGTNAEEGRLFTR FLGMLPTNEPMVEELLSGMKPADRERITAAYPNYPAPSACIQLGGDFAFSSAAWQIAE AHGANAPTYLYRYDYAPRTLRWSGFGATHATELFAVFDIYRTRFGALLTAAADRRAAL RVSNEVQRRWRCFSQIGVPGDDWPAYTQDDRAVLVFDRRCRIEFDPHQHRRIAWDGFS LAN" CDS 2279593..2280249 /codon_start=1 /transl_table=11 /gene="lppI" /locus_tag="BQ2027_MB2072" /product="Probable lipoprotein lppI" /note="Mb2072, lppI, len: 218 aa. Equivalent to Rv2046, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 218 aa overlap). Probable lppI, lipoprotein contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb2072 detected using SWATH mass spectrometry. Mb2072 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y052" /protein_id="SIU00679.1" /translation="MRIAALVAVSLLIAGCPREVGGDVGQSQTIAPPAPAPSAAPSTP PAAGAPITTIVSWIEAGHPVDPAAYHVATRDGVTTQLGDDVAFSASSGTVACMTDARH TSGTLACLVRLANPPPRPETAYGEWKGGWVDFDGIHLQVGSARADPGPFVYGNGPELA NGDTLSIGDYRCRSYQAGLFCVNYAHQSAVRFASAGIEPFGCLKPAPPPDGVGVAFGC " CDS complement(2280286..2282850) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2073C" /product="Phosphoenolpyruvate synthase (EC" /EC_number="2.7.9.2" /note="Mb2073c, -, len: 854 aa. Equivalent to Rv2047c, len: 854 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 854 aa overlap). Conserved hypothetical protein, similar to hypothetical protein from Mycobacterium tuberculosis Rv1868|MTCY359.05c (699 aa) and three possible pseudogene fragments from Mycobacterium leprae MLCB2052.16 (251 aa), MLCB2052.17 (120 aa), MLCB2052.18 (257 aa). FASTA scores: gp|Z98604|MLCB2052_7 (257 aa) opt: 1248, E(): 0, (78.6% identity in 248 aa overlap); and Z98604|MLCB2052_5 (251 aa) opt: 674, E(): 0, (50.0% identity in 250 aa overlap); and Z98604|MLCB2052_6 (120 aa) opt: 608 E() : 3.6e-30, (84.0% identity in 106 aa overlap); and Rv1868 Z83859|MTCY359_5 (699 aa) opt: 521 E(): 3e-24; (33.0% identity in 730 aa overlap). Protein product from Mb2073c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2073c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65685" /db_xref="InterPro:IPR001509" /db_xref="InterPro:IPR008279" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036637" /db_xref="UniProtKB/Swiss-Prot:P65685" /protein_id="SIU00680.1" /translation="MRIAVTGASGVLGRGLTARLLSQGHEVVGIARHRPDSWPSSADF IAADIRDATAVESAMTGADVVAHCAWVRGRNDHINIDGTANVLKAMAETGTGRIVFTS SGHQPRVEQMLADCGLEWVAVRCALIFGRNVDNWVQRLFALPVLPAGYADRVVQVVHS DDAQRLLVRALLDTVIDSGPVNLAAPGELTFRRIAAALGRPMVPIGSPVLRRVTSFAE LELLHSAPLMDVTLLRDRWGFQPAWNAEECLEDFTLAVRGRIGLGKRTFSLPWRLANI QDLPAVDSPADDGVAPRLAGPEGANGEFDTPIDPRFPTYLATNLSEALPGPFSPSSAS VTVRGLRAGGVGIAERLRPSGVIQREIAMRTVAVFAHRLYGAITSAHFMAATVPFAKP ATIVSNSGFFGPSMASLPIFGAQRPPSESSRARRWLRTLRNIGVFGVNLVGLSAGSPR DTDAYVADVDRLERLAFDNLATHDDRRLLSLILLARDHVVHGWVLASGSFMLCAAFNV LLRGLCGRDTAPAAGPELVSARSVEAVQRLVAAARRDPVVIRLLAEPGERLDKLAVEA PEFHSAVLAELTLIGHRGPAEVEMAATSYADNPELLVRMVAKTLRAVPAPQPPTPVIP LRAKPVALLAARQLRDREVRRDRMVRAIWVLRALLREYGRRLTEAGVFDTPDDVFYLL VDEIDALPADVSGLVARRRAEQRRLAGIVPPTVFSGSWEPSPSSAAALAAGDTLRGVG VCGGRVRGRVRIVRPETIDDLQPGEILVAEVTDVGYTAAFCYAAAVVTELGGPMSHAA VVAREFGFPCVVDAQGATRFLPPGALVEVDGATGEIHVVELASEDGPALPGSDLSR" CDS complement(2282855..2295310) /codon_start=1 /transl_table=11 /gene="pks12" /locus_tag="BQ2027_MB2074C" /product="polyketide synthase pks12" /note="Mb2074c, pks12, len: 4151 aa. Equivalent to Rv2048c, len: 4151 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 4151 aa overlap). Probable pks12, polyketide synthase similar to many polyketide synthases e.g. the second and third modules of polyketide synthase from S. erythraea (3567 aa), many other Streptomyces enzymes and putative Mycobacterium tuberculosis polyketide synthases, e.g. Z85982|MTCY06H11.26 (2126 aa), FASTA scores: opt: 6668, E(): 0 (61.2% identity in 2058 aa overlap); and Q03132|ERY2_SACER ERYTHRONOLIDE SYNTHASE, MODULES 3 from S. erythraea (3567 aa), FASTA scores: opt: 5309, E(): 0, (40.5% identity in 4141 aa overlap). Contains 2x PS00012 Phosphopantetheine attachment site, 2x PS00606 Beta-ketoacyl synthases active site, and PS00343 Gram-positive cocci surface proteins 'anchor ing' hexapeptide. Protein product from Mb2074c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2074c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y009" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/TrEMBL:A0A1R3Y009" /protein_id="SIU00681.1" /translation="MVDQLQHATEALRKALVQVERLKRTNRALLERSSEPIAIVGMSC RFPGGVDSPEGLWQMVADARDVMSEFPTDRGWDLAGLFDPDPDVRHKSYARTGGFVDG VADFDPAFFGISPSEALAMDPQHRMLLELSWEALERAGIDPTGLRGSATGVFAGLIVG GYGMLAEEIEGYRLTGMTSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLR SGECDLALAGGVTVNATPTVFVEFSRHRGLAPDGRCKPYAGRADGVGWSEGGGMLVLQ RLSDARRLGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRVVRAALANAGLSAAEVDV VEGHGTGTTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKM VLAMRHELLPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTN AHVIIEAVPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVRGDDGLDVADVG WSLAGRSVFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRGTATAAGKTVFVFPGQG SQWLGMGIELLDTAPAFAQQIDACAEAFAEFVDWSLVDVLRGAPGAPGLDRVDVVQPV LFAVMVSLAELWKSVAVHPDAVIGHSQGEIAAAYVAGALSLRDAARVVTLRSKLLAGL AGPGGMVSIACGADQARDLLAPFGDRVSIAVVNGPSAVVVSGEVGALEELIAVCSTKE LRTRRIEVDYASHSVEVEAIRGPLAEALSGIEPRSTRTVFFSTVTGNRLDTAGLDADY WYRNVRQTVLFDQAVRNACEQGYRTFIESSPHPALITGVEETFAACTDGDSEAIVVPT LGRGDGGLHRFLLSAASAFVAGVAVNWRGTLDGAGYVELPTYAFDKRRFWLSAEGSGA DVSGLGLGASEHPLLGAVVDLPASGGVVLTGRLSPNVQPWLADHAVSDVVLFPGTGFV ELAIRAGDEVGCSVLDELTLAAPLLLPATGSVAVQVVVDAGRDSNSRGVSIFSRADAQ AGWLLHAEGILRPGSVEPGADLSVWPPAGAVTVDVADGYERLATRGYRYGPAFRGLTA MWARGEEIFAEVRLPEAAGGVGGFGVHPALLDAVLHAVVIAGDPDELALPFAWQGVSL HATGASAVRARIAPAGPSAVSVELADGLGLPVLSVASMVARPVTERQLLAAVSGSGPD RLFEVIWSPASAATSPGPTPAYQIFESVAADQDPVAGSYVRSHQALAAVQSWLTDHES GVLVVATRGAMALPREDVADLAGAAVWGLVRSAQTEHPGRIVLVDSDAATDDAAIAMA LATGEPQVVLRGGQVYTARVRGSRAADAILVPPGDGPWRLGLGSAGTFENLRLEPVPN ADAPLGPGQVRVAMRAIAANFRDIMITLGMFTHDALLGGEGAGVVVEVGPGVTEFSVG DSVFGFFPDGSGTLVAGDVRLLLPMPADWSYAEAAAISAVFTTAYYAFIHLADVQPGQ RVLIHAGTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEFED KFRAATGGRGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVR YRAFDLFEAGPDRIAQILAELATLFGDGVLRPLPVTTFDVRRAPAALRYLSQARHTGK VVMLMPGSWAAGTVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPGAAELVA ELAAAGAQVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRV DVVLRSKVDAAWHLHELTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAAHR RAHGLPAISLGWGLWDQASAMTSGLDAADLARLGREGVLALSTAEALELFDTAMIVDE PFLAPARIDLTALRAHAVAVPPMFSDLASAPTRRQVDDSVAAAKSKSALAHRLHGLPE AEQHAVLLGLVRLHIATVLGNITPEAIDPDKAFQDLGFDSLTAVEMRNRLKSATGLSL SPTLIFDYPTPNRLASYIRTELAGLPQEIKHTPAVRTTSEDPIAIVGMACRYPGGVNS PDDMWDMLIQGRDVLSEFPADRGWDLAGLYNPDPDAAGACYTRTGGFVDGVGDFDPAF FGVGPSEALAMDPQQRMLLELSWEALERAGIDPTGLRGSATGVFAGVMTQGYGMFAAE PVEGFRLTGQLSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLRSGECDLA LAGGVTVNATPDIFVEFSRWRGLSPDGRCKAFAAAADGTGFSEGGGMLVLQRLSDARR LGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRVVRAALANAGLSAAEVDVVEGHGTG TTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKMVLAMRHE LLPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTNAHVIIEA VPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVRGDDGLDVADVGWSLAGRS VFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRGTATAAGKTVFVFPGQGSQWLGMG MGLHAGYPVFAEAFNTVVGELDRHLLRPLREVMWGHDENLLNSTEFAQPALFAVEVAL FRLLGSWGVRPDFVMGHSIGELSAAHVAGVLSLENAAVLVAARGRLMQALPAGGAMVA VQAAEEEVRPLLSAEVDIAAVNGPASLVISGAQNAVAAVADQLRADGRRVHQLAVSHA FHSPLMDPMIDEFAAVAAGIAIGRPTIGVISNVTGQLAGDDFGSAAYWRRHIRQAVRF ADSVRFAQAAGGSRFLEVGPSGGLVASIEESLPDVAVTTMSALRKDRPEPATLTNAVA QGFVTGMDLDWRAVVGEAQFVELPTYAFQRRRFWLSGDGVAADAASLGLAASEHALLG AVIDLPASGGVVLTGRLSPSVQGWLADHSVAGVTIFPGAGFVELAIRAGDEVGCGVVD ELTLAAPLVLPASGSVAVQVVVNGPDESGVRGVSVYSRGDVGTGWVLHAEGALRAGSA EPTADLAMWPPAGAVPVEVADGYQQLAERGYGYGPAFRGLTAMWRRGDEVFAEVALPA DAGVSVTGFGVHPVLLDAALHAVVLSAESAERGQGSVLVPFSWQGVSLHAAGASAVRA RIAPVGPSAVSIELADGLGLPVLSVASMLARPVTDQQLRAAVSSSGPDRLFEVTWSPQ PSAAVEPLPVCAWGTTEDSAAVVFESVPLAGDVVAGVYAATSSVLDVLQSWLTRDGAG VLVVMTRGAVALPGEDVTDLAGAAVWGLVRSAQTEHPGRIVLVDSDAPLDDSALAAVV TTGEPQVLWRRGEVYTARVHGSRAVGGLLVPPSDRPWRLAMSTAGTFENLRLELIPDA DAPLGPGQVRVAVSAIAANFRDVMIALGLYPDPDAVMGVEACGVVIETSLNKGSFAVG DRVMGLFPEGTGTVASTDQRLLVKVPAGWSHTAAATTSVVFATAHYALVDLADVQPGQ RVLIHAGTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEFED KFRAATGGRGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVR YRAFDLFEAGPDRIAQILAELATLFGDGVLRPLPVTTFDVRRAPAALRYLSQARHTGK VVMLMPGSWAAGTVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPGAAELVA ELAAAGAQVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRV DVVLRSKVDAAWHLHELTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAAHR RAHGLPAISLGWGLWDQASAMTSGLATVDFKRFARDGIVAMSSADALQLFDTAMIVDE PFMLPAHIDFAALKVKFDGGTLPPMFVDLINAPTRRQVDDSLAAAKSKSALAHRLHGL PEDEQHAVLLDLVRSHIATVLGSASPEAIDPDRAFQDLGFDSLTAVEMRNRLKSATGL SLSPTLIFDYPNSAALAGYMRRELLGSSPQDTSAVAAGEAELQRIVASIPVKRLRQAG VLDLLLALANETETSGQDPALAPTAEQEIADMDLDDLVNAAFRNDDE" CDS complement(2295617..2295841) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2075C" /product="conserved hypothetical protein" /note="Mb2075c, -, len: 74 aa. Equivalent to Rv2049c, len: 74 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 aa overlap). Hypothetical protein. Mb2075c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y049" /protein_id="SIU00682.1" /translation="MLTRGEVRALPADAVVLSADDAADLSDRVYQVRCAAEDVVTALD EGAAATELRDLCDELIRAARAADGWRRAGA" CDS 2296145..2296480 /codon_start=1 /transl_table=11 /gene="rbpA" /locus_tag="BQ2027_MB2076" /product="RNAP Holo/RbpA/Fidaxomicin/upstream fork DNA" /note="Mb2076, -, len: 111 aa. Equivalent to Rv2050, len: 111 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 111 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from Mycobacterium leprae, MLCB2052.03c (113 aa), and Streptomyces coelicolor A3(2), SC6D7.18c (124 aa). FASTA scores: Z98604|MLCB2052_3 Mycobacterium leprae cosmid B2052 (113 aa) opt: 737, E(): 0, (97.3% identity in 111 aa overlap) and (55% identity in 85 aa overlap) with emb|CAB61670.1|AL133213 hypothetical protein SC6D7.18c. TBparse score is 0.884 Protein product from Mb2076 detected using shotgun mass spectrometry. Mb2076 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y022" /db_xref="InterPro:IPR025182" /db_xref="InterPro:IPR038638" /db_xref="UniProtKB/TrEMBL:A0A1R3Y022" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00683.1" /translation="MADRVLRGSRLGAVSYETDRNHDLAPRQIARYRTDNGEEFEVPF ADDAEIPGTWLCRNGMEGTLIEGDLPEPKKVKPPRTHWDMLLERRSIEELEELLKERL ELIRSRRRG" CDS complement(2296455..2299079) /codon_start=1 /transl_table=11 /gene="ppm1" /locus_tag="BQ2027_MB2077C" /product="Polyprenol-monophosphomannose synthase Ppm1" /note="Mb2077c, ppm1, len: 874 aa. Equivalent to Rv2051c, len: 874 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 874 aa overlap). ppm1, Polyprenol-monophosphomannose synthase. Transfers mannose from GDP-Mannose to all endogenous polyprenol-phosphates in Mycobacterium tuberculosis, proven experimentally (A. Baulard, Institut Pasteur de Lille: see citation below). Very similar to polyprenol-phosphate-mannose synthases from Mycobacterium smegmatis (594 aa). Two-domain protein similar to products of two adjacent ORFs in Mycobacterium leprae MLCB2052.01 (644 aa), probable membrane protein and MLCB2052.02 (277 aa). First domain (aa 1 - 590) corresponds to membrane protein with similarity to P23930|LNT_ECOLI apolipoprotein n-acyltransferase (512 aa) while second domain (aa 591 - 874) is similar to Schizosaccharomyces pombe dolichol monophosphate mannose synthase (236 aa) and to Mycobacterium tuberculosis Rv0539. FASTA scores: Z 98604|MLCB2052_1 (644 aa) opt: 2725 E(): 0; 67.7% identity in 601 aa overlap; and Z98604|MLCB2052_2 (277 aa) opt: 1449 E(): 0; 78.9% identity in 275 aa overlap; and gp|AF0078|AF007873_1 Schizosaccharomyces pombe dolichocholmonophosphate mannose synthase (236 aa) opt: 456 E(): 7.8e-19; 34.5% identity in 223 aa overlap and sp|P23930|LNT_ECOLI APOLIPOPROTEIN N-ACYLTRANSFERASE (512 aa) opt: 330 E(): 1.9e-11; 26.9% identity in 539 aa overlap; and polyprenol-phosphate-mannose synthases from Mycobacterium smegmatis (594 aa). CAC15462.1|AJ294477 putative polyprenol-phosphate-mannose synthase 2 (Ppm2): (55% identity in 533 aa overlap). Protein product from Mb2077c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2077c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y276" /db_xref="InterPro:IPR001173" /db_xref="InterPro:IPR003010" /db_xref="InterPro:IPR004563" /db_xref="InterPro:IPR029044" /db_xref="InterPro:IPR036526" /db_xref="InterPro:IPR039528" /db_xref="UniProtKB/TrEMBL:A0A1R3Y276" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00684.1" /translation="MKLGAWVAAQLPTTRTAVRTRLTRLVVSIVAGLLLYASFPPRNC WWAAVVALALLAWVLTHRATTPVGGLGYGLLFGLVFYVSLLPWIGELVGPGPWLALAT TCALFPGIFGLFAVVVRLLPGWPIWFAVGWAAQEWLKSILPFGGFPWGSVAFGQAEGP LLPLVQLGGVALLSTGVALVGCGLTAIALEIEKWWRTGGQGDAPPAVVLPAACICLVL FAAIVVWPQVRHAGSGSGGEPTVTVAVVQGNVPRLGLDFNAQRRAVLDNHVEETLRLA ADVHAGLAQQPQFVIWPENSSDIDPFVNPDAGQRISAAAEAIGAPILIGTLMDVPGRP RENPEWTNTAIVWNPGTGPADRHDKAIVQPFGEYLPMPWLFRHLSGYADRAGHFVPGN GTGVVRIAGVPVGVATCWEVIFDRAPRKSILGGAQLLTVPSNNATFNKTMSEQQLAFA KVRAVEHDRYVVVAGTTGISAVIAPDGGELIRTDFFQPAYLDSQVRLKTRLTPATRWG PILQWILVGAAAAVVLVAMRQNGWFPRPRRSEPKGENDDSDAPPGRSEASGPPALSES DDELIQPEQGGRHSSGFGRHRATSRSYMTTGQPAPPAPGNRPSQRVLVIIPTFNEREN LPVIHRRLTQACPAVHVLVVDDSSPDGTGQLADELAQADPGRTHVMHRTAKNGLGAAY LAGFAWGLSREYSVLVEMDADGSHAPEQLQRLLDAVDAGADLAIGSRYVAGGTVRNWP WRRLVLSKTANTYSRLALGIGIHDITAGYRAYRREALEAIDLDGVDSKGYCFQIDLTW RTVSNGFVVTEVPITFTERELGVSKMSGSNIREALVKVARWGIEGRLSRSDHARARPD IARPGAGGSRVSRADVTE" CDS complement(2299237..2300841) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2078C" /product="Exoenzymes regulatory protein AepA in lipid-linked oligosaccharide synthesis cluster" /note="Mb2078c, -, len: 534 aa. Equivalent to Rv2052c, len: 534 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 534 aa overlap). Conserved hypothetical protein, very similar to hypothetical protein SC6D7.15 (536 aa) from Streptomyces coelicolor A3(2). Smith-Waterman scores >emb|CAB61667.1| (AL133213) hypothetical protein SC6D7.15 [Streptomyces coelicolor A3(2)] Expect = e-113 Identities = 247/533 (46%) Protein product from Mb2078c detected using SWATH mass spectrometry. Mb2078c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0U8" /db_xref="InterPro:IPR011059" /db_xref="InterPro:IPR013108" /db_xref="InterPro:IPR032466" /db_xref="InterPro:IPR033932" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0U8" /protein_id="SIU00685.1" /translation="MSQIPVKLLVNGRVYSPTHPEATAMAVRGDVVAWLGSDDVGRDQ FPDADVQDLDGRFVAPGFVDSHIHLTATGLMLSGLDLRPATSRAQCLRMVADYAADHP GQPLWGHGWDESAWPENAAPSTADLDAVLGDCPAYLARIDSHSALVSSGLRRLVPELA AATGYTAQRPLTGDAHHLARAAARYLLTDVQLADARAVALQAIAAAGVVAVHECAGPE IGGLDDWLRLRALEHGVEVIGYWGEAVATPAQARDLVTETGARGLAGDLFVDGALGSR TAWLHEPYADAPDCIGTCHLDVDGIEAHVRACTKAEVTAGFHVIGDAAVSAAVAAFER VVADLGVVAVARCGHRLEHVEMVTADQAAKLGAWGVIASVQPNFDELWGGGDGMYARR LGAQRGSELNPLALLASQGVPLALGSDAPVTGFDPWASVRAAVNHRTPGSGVSARAAF AAATRGGWRAGGVRDGRIGTLVPGAPASYAIWDAGDFDVDAPRDAVQRWSTDPRSRVP ALPRLGPTDALPRCRQTVHRGAVIYG" CDS complement(2300846..2301373) /codon_start=1 /transl_table=11 /gene="fxsa" /locus_tag="BQ2027_MB2079C" /product="probable transmembrane protein fxsa" /note="Mb2079c, -, len: 175 aa. Equivalent to Rv2053c, len: 175 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 175 aa overlap). Probable transmembrane protein,Mb2079c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y030" /db_xref="InterPro:IPR007313" /db_xref="UniProtKB/TrEMBL:A0A1R3Y030" /protein_id="SIU00686.1" /translation="MSRLLLSYAVVELAVVFALAATIGFGWTLLVLLATFVLGFGLLA PLGGWQLGRRLLWLRSGLAEPRSALSDGALVTVASVLVLVPGLVTTTMGLLLLVPPIR ALARPGLTAIAVRGFLRNVPLTADAAANMAGAFGESGTDPDFIDGEVIDVIDVEPLTL QPPRVAAEPPSPGSN" CDS 2301449..2302162 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2080" /product="Dienelactone hydrolase family protein" /note="Mb2080, -, len: 237 aa. Equivalent to Rv2054, len: 237 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 237 aa overlap). Conserved hypothetical protein, some similarity to various carboxymethylenebutenolidases e.g. sp|O67988|CLCD_RHOOP CARBOXYMETHYLENEBUTENOLIDASE (DIENELACTONE HYDROLASE) (DLH) >gi|2935034|gb|AAC38252.1| (AF003948) dienelactone hydrolase [Rhodococcus opacus] Smith-Waterman scores: Length = 252, Expect = 4e-08 Identities = 62/217 (28%). Also similar to Rv2765. Protein product from Mb2080 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2080 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0F5" /db_xref="InterPro:IPR002925" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0F5" /protein_id="SIU00687.1" /translation="MTTIEIDAPAGPIDALLGLPPGQGPWPGVVVVHDAVGYVPDNKL ISERIARAGYVVLTPNMYARGGRARCITRVFRELLTKRGRALDDILAARDHLLAMPEC SGRVGIVGFCMGGQFALVLSPRGFGATAPFYGTPLPRHLSETLNGACPIVASFGTRDP LGIGAANRLRKVTAAKNIPADIKSYPGAGHSFANKLPGQPLVRIAGFGYNEAATEDAW RRVFEFFGQHLRAGSPGEP" CDS complement(2302411..2302677) /codon_start=1 /transl_table=11 /gene="rpsR2" /locus_tag="BQ2027_MB2081C" /product="30s ribosomal protein s18 rpsr2" /note="Mb2081c, rpsR2, len: 88 aa. Equivalent to Rv2055c, len: 88 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 88 aa overlap). Probable rpsR2, ribosomal protein S18, similar to others e.g. RR18_ODOSI|P49505 chloroplast 30S ribosomal protein S18 (72 aa), FASTA scores: opt: 209, E(): 4.7e-09, (51.6% identity in 64 aa overlap); etc. Also similar to rpsR|Rv0055|MTCY21D4.18 from Mycobacterium tuberculosis (50.0% identity in 84 aa overlap). Protein product from Mb2081c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2081c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66466" /db_xref="InterPro:IPR001648" /db_xref="InterPro:IPR018275" /db_xref="InterPro:IPR036870" /db_xref="UniProtKB/Swiss-Prot:P66466" /protein_id="SIU00688.1" /translation="MAAKSARKGPTKAKKNLLDSLGVESVDYKDTATLRVFISDRGKI RSRGVTGLTVQQQRQVAQAIKNAREMALLPYPGQDRQRRAALCP" CDS complement(2302678..2302983) /codon_start=1 /transl_table=11 /gene="rpsN2" /locus_tag="BQ2027_MB2082C" /product="30s ribosomal protein s14 rpsn2" /note="Mb2082c, rpsN2, len: 101 aa. Equivalent to Rv2056c, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). Probable rpsN2, ribosomal protein S14, similar to others e.g. RS14_ECOLI|P02370 30S ribosomal protein S14 from Escherichia coli (100 aa), FASTA scores: opt: 290; E(): 1.7e- 13; (46.0% identity in 100 aa overlap); etc. Also similar to rpsN|Rv0717|MTCY210.36 from Mycobacterium tuberculosis, (50.0% identity in 62 aa overlap). Mb2082c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P66406" /db_xref="InterPro:IPR001209" /db_xref="InterPro:IPR023036" /db_xref="UniProtKB/Swiss-Prot:P66406" /protein_id="SIU00689.1" /translation="MAKKSKIVKNQRRAATVARYASRRTALKDIIRSPSSAPEQRSTA QRALARQPRDASPVRLRNRDAIDGRPRGHLRKFGLSRVRVRQLAHDGHLPGVRKASW" CDS complement(2302985..2303149) /codon_start=1 /transl_table=11 /gene="rpmG1" /locus_tag="BQ2027_MB2083C" /product="50s ribosomal protein l33 rpmg1" /note="Mb2083c, rpmG1, len: 54 aa. Equivalent to Rv2057c, len: 54 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 54 aa overlap). Probable rpmG1, ribosomal protein L33. FASTA results: RL33_ECOLI P02436 50S ribosomal protein L33 (54 aa) opt: 183; E(): 1.6e-09; 51.0% identity in 49 aa overlap. Note that previously known as rpmG. Protein product from Mb2083c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2083c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5W1" /db_xref="InterPro:IPR001705" /db_xref="InterPro:IPR011332" /db_xref="InterPro:IPR018264" /db_xref="InterPro:IPR038584" /db_xref="UniProtKB/Swiss-Prot:P0A5W1" /protein_id="SIU00690.1" /translation="MARTDIRPIVKLRSTAGTGYTYTTRKNRRNDPDRLILRKYDPIL RRHVDFREER" CDS complement(2303149..2303385) /codon_start=1 /transl_table=11 /gene="rpmB2" /locus_tag="BQ2027_MB2084C" /product="50s ribosomal protein l28 rpmb2" /note="Mb2084c, rpmB2, len: 78 aa. Equivalent to Rv2058c, len: 78 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 78 aa overlap). Probable rpmB2, ribosomal protein L28, very similar to rL28 of M. tuberculosis. FASTA results: RL28_MYCTU Q10879 50S ribosomal protein L28. mycobacter (94 aa) opt: 338; E(): 9.8e-19; 64.9% identity in 77 aa overlap. Also similar to rpmB (Rv0105c) of Mycobacterium tuberculosis. Protein product from Mb2084c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2084c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P66149" /db_xref="InterPro:IPR001383" /db_xref="InterPro:IPR026569" /db_xref="InterPro:IPR034704" /db_xref="InterPro:IPR037147" /db_xref="UniProtKB/Swiss-Prot:P66149" /protein_id="SIU00691.1" /translation="MSAHCQVTGRKPGFGNTVSHSHRRSRRRWSPNIQQRTYYLPSEG RRIRLRVSTKGIKVIDRDGIEAVVARLRRQGQRI" CDS 2303498..2305033 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2085" /product="Zinc ABC transporter, substrate-binding protein ZnuA" /note="Mb2085, -, len: 511 aa. Equivalent to Rv2059, len: 511 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 511 aa overlap). Conserved hypothetical protein. Some similarity to EWLA protein gp|U52850|ERU52850_1 Erysipelothrix rhusiopathiae 36 k (304 aa), FASTA score, opt: 287 E(): 6.9e-09; 27.2% identity in 228 aa overlap. There appears to be a frameshift in this ORF around position 3315980 that causes an overlap with next ORF. C-terminal end of protein may be wrong. No error can be found to account for this. Mb2085 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y059" /db_xref="InterPro:IPR006127" /db_xref="UniProtKB/TrEMBL:A0A1R3Y059" /protein_id="SIU00692.1" /translation="MATPVILVTGHEGTAAVTADLLGLLTDHGTATLRSVAPGSVRRA DPRPRCHRREQRRRHRASMKSAIHPDHHPRRLPRCPVLRRDQVVLEMIVITMVGRPSG PGERKWDVWGSVARAVTGGHVPVKSILTGAHADPHSYQASPADAAAIVDAELVIYNGG GYDPWVDQVLAGHPGVQAVDAYSLLGAVGDDDAPNEHVFYDPNVAKAVAATIADRLAD LDPSNSGNYRANAAEFSRGADAIAISEHAIATTYPDAAVIATEPVVHYLLAAAGLKNR TPATFIAANENGNDPTPADMAAVLDMIAGREVAALLVNPQTPTAATDELQVAARRAGV PITELTETLPSGTDRDQFCAADRPDRRGRSLRADHADRGLSARGHRVGDLLPTALVCH RRSGGRGRPRRASARPGNCVRRTDGRGSRPGCPDRRGTPRDVFADHPRRGGRPGRGCP GRRDRDLGGLRRGFRRRRHPAVAGAWSPGVGVRGHHLVCDLPDLLVAPAAPLTSRSRF RPL" CDS 2304603..2305004 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2086" /product="Possible conserved integral membrane protein" /note="Mb2086, -, len: 133 aa. Equivalent to Rv2060, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv,(100.000% identity in 133 aa overlap). Possible conserved integral membrane protein smaller than but similar to several hypothetical bacterial proteins e.g. >emb|CAC29843.1| (AL583918) putative ABC-transporter transmembrane protein [Mycobacterium leprae] Length = 286 and P44691|YEBI_HAEIN (261 aa). FASTA scores: P44691|YEBI_HAEIN HYPOTHETICAL PROTEIN HI0407 (261 aa) opt: 218, E(): 4.2e-08; 31.1% identity in 122 aa overlap. Maybe frameshift upstream at position 3315980 but no error can be found to account for this." /db_xref="GOA:A0A383WQC6" /db_xref="InterPro:IPR001626" /db_xref="InterPro:IPR037294" /db_xref="UniProtKB/TrEMBL:A0A383WQC6" /protein_id="SZX79525.1" /translation="MLTVVCLLVVTVLAICYRPLLFATVDPEVAAARGVPVRALGIVF AALMGVVAAQAVQIVGALLVMSLLITPAAAAARVVVAPVAAIATSVVFAEVSAVGGIL LSLAPGVPVSVFVATISFVIYLICWLLRRRR" CDS complement(2305005..2305409) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2087C" /product="F420-dependent enzyme Rv2061c" /note="Mb2087c, -, len: 134 aa. Equivalent to Rv2061c, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 134 aa overlap). Conserved hypothetical protein. Similar to conserved hypothetical proteins from Mycobacterium leprae (128 aa) and Streptomyces coelicolor (153 aa). Smith-Waterman scores: >emb|CAC30396.1| (AL583922) [Mycobacterium leprae], Expect = 7e-47, Identities = 92/131 (70%); >emb|CAC14932.1| (AL449216) [Streptomyces coelicolor], Expect = 6e-19 Identities = 48/124 (38%). Protein product from Mb2087c detected using shotgun mass spectrometry. Mb2087c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y034" /db_xref="InterPro:IPR011576" /db_xref="InterPro:IPR012349" /db_xref="InterPro:IPR019965" /db_xref="UniProtKB/TrEMBL:A0A1R3Y034" /protein_id="SIU00693.1" /translation="MTPTFSDLAEAQYLLLTTFTKDGRPKPVPIWAALDTDRGDRLLV ITEKKSWKVKRIRNTPRVTLATCTLRGRPTSEAVEATAAILDESQTGAVYDAIVKRYG IQGKLFTFVSKLRGGMRNNIGLELKVAESETG" CDS complement(2305493..2309077) /codon_start=1 /transl_table=11 /gene="cobN" /locus_tag="BQ2027_MB2088C" /product="cobalamin biosynthesis protein cobn" /note="Mb2088c, cobN, len: 1194 aa. Equivalent to Rv2062c, len: 1194 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1194 aa overlap). Probable cobN, cobalamin biosynthesis protein - very similar to COBN_PSEDE P29929 cobn protein. Pseudomonas denitrifica (1275 aa), FASTA scores, opt: 831, E(): 0, (37.5% identity in 983 aa overlap). Also similar to several Mg2+-chelatases e.g. H64479 magnesium chelatase subunit homolog (1226 aa)opt: 962 E(): 0; (27.3% identity in 846 aa overlap) Protein product from Mb2088c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2088c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y298" /db_xref="InterPro:IPR003672" /db_xref="InterPro:IPR011953" /db_xref="UniProtKB/TrEMBL:A0A1R3Y298" /protein_id="SIU00694.1" /translation="MPEPTVLLLSTSDTDLISARSSGKNYRWANPSRLSDLELTDLLA EASIVVIRILGGYRAWQSGIDTVIAGGVPAALVSGEQAADAELTDRSTVAAGTALQAH IYLAHGGVDNLRELHAFLCDTVLMTGFGFTPPVATPTWGVLERPDAGKTGPTIAVLYY RAQHLAGNTGYVEALCRAIEDAGGRPLPLYCASLRTAEPRLLERLGGADAMVVTVLAA GGVKPAAASAGGDDDSWNVEHLAALDIPILQGLCLTSPRDQWCANDDGLSPLDVASQV AVPEFDGRIITVPFSFKEIDDDGLISYVADPERCARVAGLAVRHARLRQVAPADKRVA LVFSAYPTKHARIGNAVGLDTPASAVALLQAMRQRGYRVGDLPGVESNDGDALIHALI ECGGHDPDWLTEGQLAGNPIRVSAKEYRDWFATLPAELTDVVTAYWGPPPGELFVDRS HDPDGEIVIAALRAGNLVLMVQPPRGFGENPVAIYHDPDLPPSHHYLAAYRWLDTGFS NGFGAHAVVHLGKHGNLEWLPGKTLGMSASCGPDAALGDLPLIYPFLVNDPGEGTQAK RRAHAVLVDHLIPPMARAETYGDIARLEQLLDEHASVAALDPGKLPAIRQQIWTLIRA AKMDHDLGLTERPEEDSFDDMLLHVDGWLCEIKDVQIRDGLHILGQNPTGEQELDLVL AILRARQLFGGAHAIPGLRQALGLAEDGTDERATVDQTEAKARELVAALQATGWDPSA ADRLTGNADAAAVLRFAATEVIPRLAGTATEIEQVLRALDGRFIPAGPSGSPLRGLVN VLPTGRNFYSVDPKAVPSRLAWEAGVALADSLLARYRDEHGRWPRSVGLSVWGTSAMR TAGDDIAEVLALLGVRPVWDDASRRVIDLAPMQPAELGRPRIDVTVRISGFFRDAFPH VVTMLDDAVRLVADLDEAAEDNYVRAHAQADLAHHGDQRRATTRIFGSKPGTYGAGLL QLIDSRSWRDDADLAQVYTAWGGFAYGRDLDGREAIDDMNRQYRRIAVAAKNTDTREH DIADSDDYFQYHGGMVATVRALTGQAPAAYIGDNTRPDAIRTRTLSEETTRVFRARVV NPRWMAAMRRHGYKGAFEMAATVDYLFGYDATAGVMADWMYEQLTQRYVLDAQNRTFM TESNPWALHGMAERLLEAAGRGLWAQPAPETLDGLRQVLLETEGDLEA" CDS 2309155..2309388 /codon_start=1 /transl_table=11 /gene="maze7" /locus_tag="BQ2027_MB2089" /product="antitoxin maze7" /note="Mb2089, -, len: 77 aa. Equivalent to Rv2063, len: 77 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 77 aa overlap). Conserved hypothetical protein, showing some similarity to other conserved hypothetical proteins e.g. AL109974_2|SCF34.02c hypothetical protein from Streptomyces coelicolor (133 aa), FASTA scores: opt: 102, E(): 1.7, (34.35% identity in 67 aa overlap); and AE005182_1 from Escherichia coli strain O157:H7 (77 aa), FASTA scores: opt: 95, E(): 3.3, (34.85% identity in 66 aa overlap). This ORF replaces previous Rv2063c on other strand. Protein product from Mb2089 detected using SWATH mass spectrometry." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0V7" /protein_id="SIU00695.1" /translation="MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAI FRAEREASHAETTTQAVRDEDREWEGTVGDGLG" CDS 2309381..2309791 /codon_start=1 /transl_table=11 /gene="mazF7" /locus_tag="BQ2027_MB2089A" /product="Possible toxin MazF7" /note="Mb2089A, len: 136 aa. Equivalent to Rv2063A len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible mazF7 toxin, part of toxin-antitoxin (TA) operon with Rv2063 (See Pandey and Gerdes, 2005). Protein product from Mb2089A detected using SWATH mass spectrometry. Mb2089A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y048" /db_xref="InterPro:IPR003477" /db_xref="InterPro:IPR011067" /db_xref="UniProtKB/TrEMBL:A0A1R3Y048" /protein_id="SIU00696.1" /translation="MAEPRRGDLWLVSLGAARAGEPGKHRPAVVVSVDELLTGIDDEL VVVVPVSSSRSRTPLRPPVAPSEGVAADSVAVCRGVRAVARARLVERLGALKPATMRA IENALTLILGLPTGPERGEAATHSPVRWTGGRDP" CDS 2309775..2310866 /codon_start=1 /transl_table=11 /gene="cobG" /locus_tag="BQ2027_MB2090" /product="precorrin-3b synthase cobg" /note="Mb2090, cobG, len: 363 aa. Equivalent to Rv2064, len: 363 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 363 aa overlap). Possible cobG, cobalamin biosynthesis protein. Some similarity to COBG_PSEDE P21637 cobg protein. pseudomonas (459 aa) FASTA scores, opt: 240, E(): 1.3e-08, (27.5% identity in 407 aa overlap); contains PS01156 TonB-dependent receptor proteins signature 2 Protein product from Mb2090 detected using SWATH mass spectrometry. Mb2090 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0G9" /db_xref="InterPro:IPR005117" /db_xref="InterPro:IPR012798" /db_xref="InterPro:IPR036136" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0G9" /protein_id="SIU00697.1" /translation="MAGTRDADACPGALRPHQAADGALARIRLPGGMITAAQLATLAS VASDFGSATLELTARGNVQLRGIRDVAAVADAVAKAGLLPSATHERVRNIVASPLSGR AGGLADVRAWVGELDAAIRAEPRLAELGGRFWFGLDDGRADVSGLGADVGVQVFPDGP RLLLTGRDTGVRVADVAETLIEVALRFVKIRETAWRVTELADIGELQSGVELGPSVRP VTKTPVGWIPQDDSRVTLGAAVPLGVLPARVAECLAAIEAPLVITPWRSVLICDLDDA TADAALRVLAPLGLVFDENSPWLNISACTGSPGCAHSAADVRADAARSLNVESAGHRH FVGCERACGSPPAGEVLVATGGGYRRLRP" CDS 2310876..2311502 /codon_start=1 /transl_table=11 /gene="cobH" /locus_tag="BQ2027_MB2091" /product="precorrin-8x methylmutase cobh (aka precorrin isomerase)" /note="Mb2091, cobH, len: 208 aa. Equivalent to Rv2065, len: 208 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 208 aa overlap). Probable cobH, precorrin-8X methylmutase (aka precorrin isomerase) (EC 5.4.1.2), similar to COBH_PSEDE P21638 precorrin isomerase (210 aa), FASTA scores: opt: 750, E(): 0, (55.4% identity in 202 aa overlap). Protein product from Mb2091 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2091 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63840" /db_xref="InterPro:IPR003722" /db_xref="InterPro:IPR036588" /db_xref="UniProtKB/Swiss-Prot:P63840" /protein_id="SIU00698.1" /translation="MLDYLRDAAEIYRRSFAVIRAEADLARFPADVARVVVRLIHTCG QVDVAEHVAYTDDVVARAGAALAAGAPVLCDSSMVAAGITTSRLPADNQIVSLVADPR ATELAARRQTTRSAAGVELCAERLPGAVLAIGNAPTALFRLLELVDEGAPPPAAVLGG PVGFVGSAQAKEELIERPRGMSYLVVRGRRGGSAMAAAAVNAIASDRE" CDS 2311499..2313025 /codon_start=1 /transl_table=11 /gene="cobI" /locus_tag="BQ2027_MB2092" /product="Probable bifunctional protein, CobI-CobJ fusion protein: S-adenosyl-L-methionine-precorrin-2 methyl transferase + precorrin-3 methylase" /note="Mb2092, cobI, len: 508 aa. Equivalent to Rv2066, len: 508 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 508 aa overlap). Probable CobI-CobJ fusion protein, S-adenosyl-L-methionine-precorrin-2 methyl transferase and precorrin-3 methylase (EC 2.1.1.-). Similar in N-terminal half (aa 1-240) to COBI_PSEDE|P21639, S-adenosyl-L-methionine-precorrin-2 methyl transferase (244 aa), FASTA scores: opt: 759, E(): 4.4e-34, (49.2% identity in 238 aa overlap); and in C-terminal half (aa 240-508) to P21640|COBJ_PSEDE PRECORRIN-3 METHYLASE (EC 2.1.1.-) (254 aa), FASTA scores: opt: 695, E(): 0, (45.3% identity in 258 aa overlap). Protein product from Mb2092 detected using SWATH mass spectrometry. Mb2092 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66878" /db_xref="InterPro:IPR000878" /db_xref="InterPro:IPR003043" /db_xref="InterPro:IPR006363" /db_xref="InterPro:IPR006364" /db_xref="InterPro:IPR012382" /db_xref="InterPro:IPR014776" /db_xref="InterPro:IPR014777" /db_xref="InterPro:IPR035996" /db_xref="UniProtKB/Swiss-Prot:P66878" /protein_id="SIU00699.1" /translation="MSARGTLWGVGLGPGDPELVTVKAARVIGEADVVAYHSAPHGHS IARGIAEPYLRPGQLEEHLVYPVTTEATNHPGGYAGALEDFYADATERIATHLDAGRN VALLAEGDPLFYSSYMHLHTRLTRRFNAVIVPGVTSVSAASAAVATPLVAGDQVLSVL PGTLPVGELTRRLADADAAVVVKLGRSYHNVREALSASGLLGDAFYVERASTAGQRVL PAADVDETSVPYFSLAMLPGGRRRALLTGTVAVVGLGPGDSDWMTPQSRRELAAATDL IGYRGYLDRVEVRDGQRRHPSDNTDEPARARLACSLADQGRAVAVVSSGDPGVFAMAT AVLEEAEQWPGVRVRVIPAMTAAQAVASRVGAPLGHDYAVISLSDRLKPWDVIAARLT AAAAADLVLAIYNPASVTRTWQVGAMRELLLAHRDPGIPVVIGRNVSGPVSGPNEDVR VVKLADLNPAEIDMRCLLIVGSSQTRWYSVDSQDRVFTPRRYPEAGRATATKSSRHSD " CDS complement(2312971..2314194) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2093C" /product="SAM-dependent methyltransferase" /note="Mb2093c, -, len: 407 aa. Equivalent to Rv2067c, len: 407 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 407 aa overlap). Conserved hypothetical protein, some similarity to YAT1_SYNP6 P08442 atp synthase subunits region ORF 1. (417 aa), FASTA scores, opt: 373, E(): 4.9e-18, (27.7% identity in 358 aa overlap) Protein product from Mb2093c detected using SWATH mass spectrometry. Mb2093c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025714" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P0A5G2" /protein_id="SIU00700.1" /translation="MTDDHPRADIVSRQYHRWLYPHPIADLEAWTTANWEWFDPVHSH RILWPDREYRPDLDILIAGCGTNQAAIFAFTNRAAKVVAIDISRPALDHQQYLKDKHG LANLELHLLPIEELATLGRDFDLVVSTGVLHHLADPRAGMKELAHCLRRDGVVAAMLY GKYGRIGVELLGSVFRDLGLGQDDASIKLAKEAISLLPTYHPLRNYLTKARDLLSDSA LVDTFLHGRQRSYTVEECVDLVTSAGLVFQGWFHKAPYYPHDFFVPNSEFYAAVNTLP EVKAWSVMERLETLNATHLFMACRRDRPKEQYTIDFSTVAALDYVPLMRTRCGVSGTD MFWPGWRMAPSPAQLAFLQQVDGRRTIREIAGCVARTGEPSGGSLADLEEFGRKLFQS LWRLDFVAVALPASG" CDS complement(2314210..2315133) /codon_start=1 /transl_table=11 /gene="blaC" /locus_tag="BQ2027_MB2094C" /product="CLASS A BETA-LACTAMASE BLAC" /note="Mb2094c, blaC, len: 307 aa. Equivalent to Rv2068c, len: 307 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 307 aa overlap). blaC, class A beta-lactamase (EC 3.5.2.6) (see citation below), similar to e.g. BLAC_NOCLA Q06316 beta-lactamase precursor (302 aa), FASTA scores, opt: 860, E(): 0, (50.2% identity in 283 aa overlap); eyc. Contains PS00013 Prokaryotic lipid attachment site near N-terminus, and PS00146 Beta-lactamase class-A active site. BELONGS TO THE CLASS-C BETA-LACTAMASE FAMILY. Protein product from Mb2094c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2094c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5I7" /db_xref="InterPro:IPR000871" /db_xref="InterPro:IPR012338" /db_xref="InterPro:IPR023650" /db_xref="UniProtKB/Swiss-Prot:P0A5I7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00701.1" /translation="MRNRGFGRRELLVAMAMLVSVTGCARHASGARPASTTLPAGADL ADRFAELERRYDARLGVYVPATGTTAAIEYRADERFAFCSTFKAPLVAAVLHQNPLTH LDKLITYTSDDIRSISPVAQQHVQTGMTIGQLCDAAIRYSDGTAANLLLADLGGPGGG TAAFTGYLRSLGDTVSRLDAEEPELNRDPPGDERDTTTPHAIALVLQQLVLGNALPPD KRALLTDWMARNTTGAKRIRAGFPADWKVIDKTGTGDYGRANDIAVVWSPTGVPYVVA VMSDRAGGGYDAEPREALLAEAATCVAGVLA" CDS 2315268..2315825 /codon_start=1 /transl_table=11 /gene="sigC" /locus_tag="BQ2027_MB2095" /product="rna polymerase sigma factor, ecf subfamily, sigc" /note="Mb2095, sigC, len: 185 aa. Equivalent to Rv2069, len: 185 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 185 aa overlap). Probable sigC, RNA polymerase sigma factor (see citation below), with similarity to SIGX_BACSU|P35165 probable RNA polymerase sigma factor from Bacillus subtilis (194 aa), FASTA scores: opt: 218, E(): 4.1e-07, (32.6% identity in 129 aa overlap). Belongs to ECF subfamily. Mb2095 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66810" /db_xref="InterPro:IPR000838" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR013249" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039425" /db_xref="UniProtKB/Swiss-Prot:P66810" /protein_id="SIU00702.1" /translation="MTATASDDEAVTALALSAAKGNGRALEAFIKATQQDVWRFVAYL SDVGSADDLTQETFLRAIGAIPRFSARSSARTWLLAIARHVVADHIRHVRSRPRTTRG ARPEHLIDGDRHARGFEDLVEVTTMIADLTTDQREALLLTQLLGLSYADAAAVCGCPV GTIRSRVARARDALLADAEPDDLTG" CDS complement(2315815..2316510) /codon_start=1 /transl_table=11 /gene="cobK" /locus_tag="BQ2027_MB2096C" /product="precorrin-6x reductase cobk" /note="Mb2096c, cobK, len: 231 aa. Similar to Rv2070c, len: 244 aa, from Mycobacterium tuberculosis strain H37Rv, (98.7% identity in 227 aa overlap). Probable cobK, precorrin-6x reductase (EC 1.3.1.54), similar to e.g. L21196|g347169|RERCOBLMK3 RERCOBLMKN from Rhodococcus sp. NI86/21 (248 aa), FASTA scores: opt: 792, E(): 0, (53.6% identity in 250 aa overlap). Also similarity to CBIJ_SALTY|Q05591 cbij protein from Salmonella typhimurium (263 aa), FASTA scores: opt: 166, E(): 9e-0 5, (26.7% identity in 258 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-a) leads to a shorter product with a different NH2 part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (231 aa versus 244 aa). Protein product from Mb2096c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y042" /db_xref="InterPro:IPR003723" /db_xref="UniProtKB/TrEMBL:A0A1R3Y042" /protein_id="SIU00703.1" /translation="MRWRKSCNPHVEIVSSLAGRVPNPALPIGPVRIGGFGGVEGLRG WLREERIDAVVDATHPFAVTITAHAAQVCGELGLPYLVLARPPWDPGTAIIAVSDIEA ADVVAEQGYSRVFLTTGRSGIAAFANSDAWFLIRVVTAPDGTALPRRHKLVLSRGPYG YHDEFALLREQRIDALVTKNSGGKMTRAKLDAAAALGISVVMIARPLLPAGVAAVDSV HRAAMWVAGLPSR" CDS complement(2316547..2317302) /codon_start=1 /transl_table=11 /gene="cobM" /locus_tag="BQ2027_MB2097C" /product="precorrin-3 methylase cobm (precorrin-4 c11-methyltransferase)" /note="Mb2097c, cobM, len: 251 aa. Equivalent to Rv2071c, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 251 aa overlap). Probable cobM, precorrin-3 methylase (EC 2.1.1.133), similar to e.g. L21196|g347169|RERCOBLMK2 RERCOBLMK from Rhodococococcus sp. NI86/21 (249 aa), FASTA scores: opt: 992, E(): 0, (62.4% identity in 245 aa overlap) and to COBM_ PSEDE|P21922 precorrin-3 methylase (253 aa), FASTA scores: opt: 863, E(): 0, (54.6% identity in 249 aa overlap). Contains PS00839 Uroporphyrin-III C-methyltransferase signature 1, and PS00840 Uroporphyrin-III C-methyltransferase signature 2." /db_xref="GOA:A0A1R3Y2A3" /db_xref="InterPro:IPR000878" /db_xref="InterPro:IPR003043" /db_xref="InterPro:IPR006362" /db_xref="InterPro:IPR014776" /db_xref="InterPro:IPR014777" /db_xref="InterPro:IPR035996" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A3" /protein_id="SIU00704.1" /translation="MTVYFIGAGPGAADLITVRGQRLLQRCPVCLYAGSIMPDDLLAQ CPPGATIVDTGPLTLEQIVRKLADADADGRDVARLHSGDPSLYSALAEQCRELDALGI GYEIVPGVPAFAAAAAALKRELTVPGVAQTVTLTRVATLSTPMPPGEDLAALARSRAT LVLHLAAAQIDAIVPRLLDGGYRPETPVAVVAFASWPQQRTLRGTLADIAARMHDAKI TRTAVIVVGDVLTAEGFTDSYLYSVARHGRYAQ" CDS complement(2317299..2318132) /codon_start=1 /transl_table=11 /gene="cobLb" /locus_tag="BQ2027_MB2098C" /product="Probable precorrin-6y methyltransferase CobLb [SECOND PART]" /note="Mb2098c, cobLb, len: 294 aa. Equivalent to 3' end of Rv2072c, len: 390 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 294 aa overlap). Probable cobL, methyl transferase (EC 2.1.1.132), similar to L21196|g347169|RERCOBLMK1 from Rhodocococcus sp. NI86/21 (447 aa), FASTA scores: opt: 892; E(): 0; (50.1% identity in 369 aa overlap), and to COBL_PSEDE|P21921 precorrin-6y methylase (413 aa), FASTA scores: opt: 830, E(): 0, (40.6% identity in 404 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large 2029 bp deletion (H37Rv2.2330073-2332101-*)(RD9) leads to the loss of the NH2 part of cobL, the entire Rv2073 and Rv2074, and the COOH part of Mb2100c. In addition, while cobL exists as a single gene in Mycobacterium tuberculosis strain H37Rv, in Mycobacterium bovis a frameshift due to a single base insertion (*-t) splits cobL into 2 parts, cobLa and cobLb." /db_xref="GOA:A0A1R3Y0W7" /db_xref="InterPro:IPR014008" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR035996" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0W7" /protein_id="SIU00705.1" /translation="MYDTEVISLVTAQPHTAVRRGGRAIVLSGDRSTPQALAVLLTEH GRGDSKFSVLEQLGGPAERRRDGTARAWACDPPLDVDELNVIAVRYLPDERTSWAPDE AFAHDGQITKHPIRVLTLAALAPRPGQRLWDVGAGSGAIAVQWCRSWPGCTAVAFERD ERRRRNIGFNAAAFGVSVDVRGDAPDAFDDAARPSVIFLGGGVTQPGLLEACLDSLPA GGNLVANAVTVESEAALAHAYSRLGGELRRFQHYLGEPLGGFTGWRPQLPVTQWSVTK R" CDS complement(2318125..2318313) /codon_start=1 /transl_table=11 /gene="cobLa" /locus_tag="BQ2027_MB2099C" /product="Probable precorrin-6y methyltransferase CobLa [FIRST PART]" /note="Mb2099c, cobLa, len: 62 aa. Similar to 5' end of Rv2072c, len: 390 aa, from Mycobacterium tuberculosis strain H37Rv, (97.727% identity in 44 aa overlap). Probable cobL, methyl transferase (EC 2.1.1.132), similar to L21196|g347169|RERCOBLMK1 from Rhodocococcus sp. NI86/21 (447 aa), FASTA scores: opt: 892; E(): 0; (50.1% identity in 369 aa overlap), and to COBL_PSEDE|P21921 precorrin-6y methylase (413 aa), FASTA scores: opt: 830, E(): 0, (40.6% identity in 404 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large deletion of 2029 bp (H37Rv2.2330073-2332101-*) (RD9) leads to the loss of the NH2 part of cobL, the entire Rv2073 and Rv2074, and the COOH part of Mb2100c. In addition, while cobL exists as a single gene in Mycobacterium tuberculosis strain H37Rv, in Mycobacterium bovis a frameshift due to a single base insertion (*-t) splits cobL into 2 parts, cobLa and cobLb. Mb2099c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y058" /db_xref="InterPro:IPR035996" /db_xref="UniProtKB/TrEMBL:A0A1R3Y058" /protein_id="SIU00706.1" /translation="MLPAVQGLSPDGADLHVVASGDPLLHGIGSTLIRLFGHDNVTVF AARVRGDVGVRPDGLERV" CDS complement(2318388..2319176) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2100C" /product="Possible hypothetical exported or envelope protein" /note="Mb2100c, -, len: 262 aa. Equivalent to 5' end of Rv2075c, len: 487 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 259 aa overlap). Possibly exported or envelope protein; has potential signal peptide at N-terminus and hydrophobic stretch around residue 430. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large 2029 bp deletion (RD9) leads to the loss of the COOH part of Mb2100c, the entire Rv2074 and Rv2073, and the NH2 part of cobL compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Mb2100c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0I2" /db_xref="InterPro:IPR017946" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0I2" /protein_id="SIU00707.1" /translation="MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCD VISPVAIPCVALGKFADAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTA RFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELD LHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVIL LYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRGPQ" CDS complement(2319334..2319585) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2101C" /product="conserved hypothetical protein" /note="Mb2101c, -, len: 83 aa. Equivalent to Rv2076c, len: 83 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 83 aa overlap). Unknown, questionable ORF,Mb2101c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64932" /db_xref="UniProtKB/Swiss-Prot:P64932" /protein_id="SIU00708.1" /translation="MVVCLIGGVAGSLWPRPAGRLRGGCYFAFMGVAWVLLAISAIAN AVKGSLWWDIWSLGLLVLIPAVVYGKMRRSRRISSDQDR" CDS complement(2319620..2320591) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2102C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2102c, -, len: 323 aa. Equivalent to Rv2077c, len: 323 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 323 aa overlap). Possible conserved transmembrane protein. Part of Mycobacterium tuberculosis protein family with Rv2542, Rv2079, Rv2797c, Rv0963c, Rv1949c. Hydrophobic stretches at C-terminus." /db_xref="GOA:P64934" /db_xref="UniProtKB/Swiss-Prot:P64934" /protein_id="SIU00709.1" /translation="MLATLSQIRAWSTEHLIDAAGYWTETADRWEDVFLQMRNQAHAI AWNGAGGDGLRQRTRADFSTVSGIADQLRRAATIARNGAGTIDAAQRRVMYAVEDAQD AGFNVGEDLSVTDTKTTQPAAVQAARLAQAQALAGDIRLRVGQLVAAENEVSGQLAAT TGDVGNVRFAGAPVVAHSAVQLVDFFKQDGPTPPPPGAPHPSGGADGPYSDPITSMML PPAGTEAPVSDATKRWVDNMVNELAARPPDDPIAVEARRLAFQALHRPCNSAEWTAAV AGFAGSSAGVVGTALAIPAGPADWALLGAALLGVGGSGAAVVNCATK" CDS complement(2320592..2320891) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2103C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2103c, -, len: 99 aa. Equivalent to Rv2077A, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Conserved hypothetical protein, similar to P95263|Rv1951c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (137 aa), FASTA scores: opt: 271, E(): 1.5e-11, (51.04% identity in 97 aa overlap); and some similarity with P95012|Rv2541 HYPOTHETICAL ALANINE RICH PROTEIN from Mycobacterium tuberculosis (135 aa), FASTA scores: opt: 140, E(): 0.014, (32.95% identity in 88 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y068" /protein_id="SIU00710.1" /translation="MGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATT VAVSGINAAICCAAAEFATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV" CDS 2321356..2321670 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2104" /product="conserved hypothetical protein" /note="Mb2104, -, len: 104 aa. Equivalent to Rv2078, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 104 aa overlap). Unknown" /db_xref="InterPro:IPR022534" /db_xref="UniProtKB/Swiss-Prot:P59982" /protein_id="SIU00711.1" /translation="MFVDVGLLHSGANESHYAGEHAHGGADQLSRGPLLSGMFGTFPV AQTFHDAVGAAHAQQMRNLHAHRQALITVGEKARHAATGFTDMDDGNAAELKAVVCSC AT" CDS 2321652..2323622 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2105" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2105, -, len: 656 aa. Equivalent to Rv2079, len: 656 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 656 aa overlap). Conserved hypothetical protein; part of Mycobacterium tuberculosis protein family with Rv2542, Rv2077c, Rv2797c, Rv0963c, Rv1949c. Contains PS00120 Lipases, serine active site,Mb2105 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010427" /db_xref="UniProtKB/TrEMBL:A0A1R3Y081" /protein_id="SIU00712.1" /translation="MQLRHINIRALIAEAGGDPWAIEHSLHAGRPAQIAELAEAFHAA GRCTAEANAAFEEARRRFEASWNRENGEHPINDSAEVQRVTAALGVQSLQLPKIGVDL ENIAADLAEAQRAAAGRIATLESQLQRIDDQLDQALELEHDPRLAAAERSELDALITC LEQDAIDDTASALGQLQSIRAGYSDHLQQSLAMLRADGYDGAGLQGLDAPQSPVKLEE PIQIPPPGTGAPEVHRWWTSLTSEERQRLIAEHPEQIGNLNGVPVSARSDANIAVMTR DLNRVRDIATRYRTSVDDVLGDPAKYGLSAGDITRYRNADETKKGLDHNARNDPRNPS PVYLFAYDPMAFGGKGRAAIAIGNPDTAKHTAVIVPGTSSSVKGGWLHDNHDDALNLF NQAKAADPNNPTAVIAWMGYDAPNDFTDPRIATPMLARIGGAALAEDVNGLWVTHLGV GQNVTVLGHSYGSTTVADAFALGGMHANDAVLLGCPGTDLAHSAASFHLDGGRVYVGA ASTDPISMLGQLDSLSQYVNRGNLAGQLQGLAVGLGTDPAGDGFGSVRFRAEVPNSDG INPHDHSYYYHRGSEALRSMADIASGHGDALASDGMLAQPRHQPGVEIDIPGLGSVEI DIPGTPASIDPEWSRPPGSITDDHVFDAPLHR" CDS 2323603..2324166 /codon_start=1 /transl_table=11 /gene="lppJ" /locus_tag="BQ2027_MB2106" /product="lipoprotein lppj" /note="Mb2106, lppJ, len: 187 aa. Equivalent to Rv2080, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 187 aa overlap). Possible lppJ, lipoprotein; contains prokayotic lipoprotein modification site (PS00013) and signal sequence at N-terminus. Mb2106 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7VER1" /db_xref="UniProtKB/Swiss-Prot:Q7VER1" /protein_id="SIU00713.1" /translation="MPHSTADRRLRLTRQALLAAAVAPLLAGCALVMHKPHSAGSSNP WDDSAHPLTDDQAMAQVVEPAKQIVAAADLQAVRAGFSFTSCNDQGDPPYQGTVRMAF LLQGDHDAYFQHVRAAMLSHGWIDGPPPGQYFHGITLHKNGVTANMSLALDHSYGEMI LDGECRNTTDHHHDDETTNITNQLVQP" CDS complement(2324362..2324805) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2107C" /product="conserved transmembrane protein" /note="Mb2107c, -, len: 147 aa. Equivalent to Rv2081c, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 147 aa overlap). Possible transmembrane unknown protein. Hydrophobic stretch from aa 32-54. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 3 bp insertion (*-ggg) leads to a slightly longer product compared t its homolog in Mycobacterium tuberculosis strain H37Rv (147 aa versus 146 aa). Mb2107c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2B4" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2B4" /protein_id="SIU00714.1" /translation="MFANAGLSPFVAIWTARAASLYTSHNFWCAAAVSAAVYVGSAVV PAAVAGPLFVGRVSATIKAAAPSTTAAIATLATAANGQLRERGGAGGWVGVHCPVVGG GGGVGHPRKAIAAAVSVHSTCMPAAFGGHLGLGDRSRSVSLSGTP" mobile_element 2324842..2326310 /mobile_element_type="insertion sequence:IS1556" /locus_tag="BQ2027_IS1556" /note="IS1556, len: 1469 nt. Equivalent to IS1556, len: 1469 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1469 nt overlap). Possible IS-like region." CDS 2325009..2327174 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2108" /product="Vegetative cell wall protein gp1 precursor" /note="Mb2108, -, len: 721 aa. Equivalent to Rv2082, len: 721 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 721 aa overlap). Conserved hypothetical protein. Similar to Mycobacterium tuberculosis Rv0029, and to Rv3899c and Rv3900c which may be frameshifted. Protein product from Mb2108 detected using SWATH mass spectrometry. Mb2108 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR040604" /db_xref="InterPro:IPR040833" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0X4" /protein_id="SIU00715.1" /translation="MAGDLPPGRWSALLVGAWWPARPDAPMAGVTYWRKAAQLKRNEA NDLRNERSLLAVNQGRTADDLLERYWRGEQRLATIAHQCEVKSDQSEQVADAVNYLRD RLTEIAQSGNQQINQILAGKGPIEAKVAAVNAVIEQSNAMADHVGATAMSNIIDATQR VFDETIGGDAHTWLRDHGVSLDAPARPRPVTAEDMTSMTANSPAGSPFGAAPSAPSHS TTTSGPPTAPTPTSPFGTAPMVLSSSSTSSGPPTAPTPTSPFGTAPMPPGPPPPGTVS PPLPPSAPAVGVGGPSVPAAGMPPAAAAATAPLSPQSLGQSFTTGMTTGTPAAAGAQA LSAGALHAATEPLPPPAPPPTTPTVTTPTVATATTAGIPHIPDSAPTPSPAPIAPPTT DNASAMTPIAPMVANGPPASPAPPAAAPAGPLPAYGADLRPPVTTPPATPPTPTGPIS GAAVTPSSPAAGGSLMSPVVNKSTAPATTQAQPSNPTPPLASATAAATTGAAAGDTSR RAAEQQRLRRILDTVARQEPGLSWAAGLRDNGQTTLLVTDLASGWIPPHIRLPAHITL LEPAPRRRHATVTDLLGTTTVAAAHHPHGYLSQPDPDTPALTGDRTARIAPTIDELGP TLVETVRRHDTLPRIAQAVVVAATRNYGVPDNETDLLHHKTTEIHQAVLTTYPNHDIA TVVDWMLLAAINALIAGDQSGANYHLAWAIAAISTRRSR" CDS 2327171..2328115 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2109" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2109, -, len: 314 aa. Equivalent to Rv2083, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 314 aa overlap). Conserved hypothetical protein. Similar to Mycobacterium tuberculosis Rv3898c (110 aa) and Rv3897c (210 aa) Protein product from Mb2109 detected using SWATH mass spectrometry. Mb2109 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y069" /protein_id="SIU00716.1" /translation="MTSIESHPEQYWAAAGRPGPVPLALGPVHPGGPTLIDLLMALFG LSTNADLGGTNADIEGDDTDRRAHAADAARKFSANEANAAEQMQGVGAQGMAQMASGI GGALSGALGGVMGPLTQLPQQAMQAGQGAMQPLMSAMQQAQGADGLAAVDGARLLDSI GGEPGLGSGAGGGDVGGGGAGGTTPTGYLGPPPVPTSSPPTTPAGAPTKSATMPPPGG ASPASAHMGAAGMPMVPPGAMGARGEGSGQEKPVEKRVTAPAVPNGQPVKGRLTVPPS APTTKPTDGKPVVRRRILLPEHKDFGRIAPDEKTDAGE" CDS 2328108..2329103 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2110" /product="HYPOTHETICAL PROTEIN" /note="Mb2110, -, len: 331 aa. Similar to 5' end of Rv2084, len: 378 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 284 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2084 exists as a single gene. In Mycobacterium bovis, a frameshift due to an 11 bp insertion (*-ggcgtacacac), splits Rv2084 into 2 parts, Mb2110 and Mb2111. Mb2110 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0J2" /protein_id="SIU00717.1" /translation="MSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLD TQPRPLVIVHGPLFQAVKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLI DVLNVLLAAITAAGVERAYACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIH ALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGE LSYGPLHEGGNAARKSVYRRLVQLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTA ATIQQAYTRAERRAMAAAVVAKIRGDAMGLDAQRDAVHRAAADALHALQSVGIHQ" CDS 2329148..2329255 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2111" /product="HYPOTHETICAL PROTEIN" /note="Mb2111 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing.,Mb2111, -, len: 110 aa. Similar to 3' end of Rv2084, len: 378 aa, from Mycobacterium tuberculosis strain H37Rv, (97.0% identity in 100 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis stain H37Rv, Rv2084 exists as a single gene. In Mycobacterium bovis, a frameshift due to an 11 bp insertion (*-ggcgtacacac), splits Rv2084 into 2 parts, Mb2110 and Mb2111. Protein product from Mb2111 detected using SWATH mass spectrometry." /db_xref="UniProtKB/TrEMBL:A0A1R3Y045" /protein_id="SIU00718.1" /translation="MPGRESTPSDDGGSLHPSGRPRRVHRRRWCGLGLC" gene 2329253..2330721 /locus_tag="BQ2027_IS1556" CDS 2329338..2329643 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2112" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2112, -, len: 101 aa. Equivalent to Rv2085, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). Conserved hypothetical protein, similar to YI32_MYCTU P19772 insertion element IS986 hypothetical 6.6 kda protein (59 aa), FASTA scores, opt: 119, E(): 0.002 9, (36.4% identity in 55 aa overlap); ORFs Rv2085, Rv2086 and Rv2087 (MTCY49.24,25,26, and 27) all show similarity to transposases but we can find no sequence errors to account for the frameshifts. Contains possible helix-turn-helix motif at aa 33 to 54,(+3.11 SD). Mb2112 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P64936" /protein_id="SIU00719.1" /translation="MSDMCDVVSFVGAAERVLRARFRPSPESGPPVHARRCGWSLGIS AETLRRWAGQAEVDSGVVAGVSASRSGSVKTSELEQTIEILKVATSFFARKCDPRHR" CDS 2329622..2330227 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2113" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2113, -, len: 201 aa. Equivalent to Rv2086, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 201 aa overlap). Conserved hypothetical protein: low similarity to transposases; ORFs Rv2085, Rv2086 and Rv2087 (MTCY49.24,25,26, and 27) all show similarity to transposases but we can find no sequence errors to account for the frameshifts. Start changed since first submission (-16 aa)." /db_xref="InterPro:IPR025948" /db_xref="UniProtKB/Swiss-Prot:P64938" /protein_id="SIU00720.1" /translation="MRPATPLICAFGDKHKHTYGVTPICRALAVHGVQIASRTYFADR AAAPSKRALWDTTITEILAGYYEPDAEGKRPPECLYGSLKMWAHLQRQGFRWPSATVK TIMRANGWRGVPLAAHITHHRTRPGRGPGPRPGGSAMAGFSNEPAGSGRLHLRADDVE FRLHRVRGRRLRRCDRGLGMLADQRRSVRRTRITPRPSRLT" CDS 2330305..2330535 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2114" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2114, -, len: 76 aa. Equivalent to Rv2087, len: 76 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 76 aa overlap). Conserved hypothetical protein, with low similarity to transposases; ORFs Rv2085, Rv2086 and Rv2087 (MTCY49.24,25,26, and 27) all show similarity to transposases but we can find no sequence errors to account for the frameshifts. Start changed since first submission (-45 aa). Mb2114 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y063" /protein_id="SIU00721.1" /translation="MLAGLRPSIGIVGDALDNALCETTTGPHRTECSHGSPFRSGPIR TLADLEDIASAWVEHTCHTQQGVRIPGRLQPA" CDS 2330722..2332491 /codon_start=1 /transl_table=11 /gene="pknJ" /locus_tag="BQ2027_MB2115" /product="transmembrane serine/threonine-protein kinase j pknj (protein kinase j) (stpk j)" /note="Mb2115, pknJ, len: 589 aa. Equivalent to Rv2088, len: 589 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 589 aa overlap). Probable pknJ, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citation below), similar to other serine/threonine-protein kinases e.g. PKWA_THECU|P49695 putative serine/threonine-protein kinase (742 aa), FASTA scores: opt: 457, E(): 2.7e-15, (26.0% identity in 578 aa overlap); etc. Contains PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation. Protein product from Mb2115 detected using SWATH mass spectrometry. Mb2115 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65733" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR008271" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR026954" /db_xref="InterPro:IPR038232" /db_xref="UniProtKB/Swiss-Prot:P65733" /protein_id="SIU00722.1" /translation="MAHELSAGSVFAGYRIERMLGAGGMGTVYLARNPDLPRSEALKV LAAELSRDLDFRARFVREADVAAGLDHPNIVAVHQRGQFEGRLWIAMQFVDGGNAEDA LRAATMTTARAVYVIGEVAKALDYAHQQGVIHRDIKPANFLLSRAAGGDERVLLSDFG IARALGDTGLTSTGSVLATLAYAAPEVLAGQGFDGRADLYSLGCALFRLLTGEAPFAA GAGAAVAVVAGHLHQPPPTVSDRVPGLSAAMDAVIATAMAKDPMRRFTSAGEFAHAAA AALYGGATDGWVPPSPAPHVISQGAVPGSPWWQHPVGSVTALATPPGHGWPPGLPPLP RRPRRYRRGVAAVAAVMVVAAAAVTAVTMTSHQPRTATPPSAAALSPTSSSTTPPQPP IVTRSRLPGLLPPLDDVKNFVGIQNLVAHEPMLQPQTPNGSINPAECWPAVGGGVPSA YDLGTVIGFYGLTIDEPPTGTAPNQVGQLIVAFRDAATAQRHLADLASIWRRCGGRTV TLFRSEWRRPVELSTSVPEVVDGITTMVLTAQGPVLRVREDHAIAAKNNVLVDVDIMT PDTSRGQQAVIGITNYILAKIPG" CDS complement(2332508..2333635) /codon_start=1 /transl_table=11 /gene="pepE" /locus_tag="BQ2027_MB2116C" /product="dipeptidase pepe" /note="Mb2116c, pepE, len: 375 aa. Equivalent to Rv2089c, len: 375 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 375 aa overlap). Probable pepE, dipeptidase, similar to e.g. PEPQ_LACDL P46545, xaa-pro dipeptidase (368 aa), FASTA scores, opt: 617, E(): 5.1 e-32, (34.7% identity in 363 aa overlap); contains PS00491 Aminopeptidase P and proline dipeptidase signature. Also similar to Mycobacterium tuberculosis peptidases Rv2861c, Rv0734, Rv2535c. Protein product from Mb2116c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2116c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65811" /db_xref="InterPro:IPR000587" /db_xref="InterPro:IPR000994" /db_xref="InterPro:IPR001131" /db_xref="InterPro:IPR029149" /db_xref="InterPro:IPR036005" /db_xref="UniProtKB/Swiss-Prot:P65811" /protein_id="SIU00723.1" /translation="MGSRRFDAEVYARRLALAAAATADAGLAGLVITPGYDLCYLIGS RAETFERLTALVLPAAGAPAVVLPRLELAALKQSAAAELGLRVCDWVDGDDPYGLVSA VLGGAPVATAVTDSMPALHMLPLADALGVLPVLATDVLRRLRMVKEETEIDALRKAGA AIDRVHARVPEFLVPGRTEADVAADIAEAIVAEGHSEVAFVIVGSGPHGADPHHGYSD RELREGDIVVVDIGGTYGPGYHSDSTRTYSIGEPDSDVAQSYSMLQRAQRAAFEAIRP GVTAEQVDAAARDVLAEAGLAEYFVHRTGHGIGLCVHEEPYIVAGNDLVLVPGMAFSI EPGIYFPGRWGARIEDIVIVTEDGAVSVNNCPHELIVVPVS" CDS 2333684..2334808 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2117" /product="probable 5'-3' exonuclease" /note="Mb2117, -, len: 374 aa. Equivalent to Rv2090, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (94.7% identity in 393 aa overlap). Probable 5'-3' exonuclease, similar to exonuclease part of DNA polymerase, e.g. DPO1_MYCTU Q07700 DNA polymerase I (EC 2.7.7.7) (pol i) (904 aa), FASTA scores, opt: 461, E(): 1.2e-17, (38.7% identity in 292 aa overlap). BELONGS TO FAMILY A OF DNA POLYMERASES. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 57 bp in-frame deletion leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (374 aa versus 393 aa). Protein product from Mb2117 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y2B9" /db_xref="InterPro:IPR002421" /db_xref="InterPro:IPR008918" /db_xref="InterPro:IPR020045" /db_xref="InterPro:IPR020046" /db_xref="InterPro:IPR029060" /db_xref="InterPro:IPR036279" /db_xref="InterPro:IPR038969" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2B9" /protein_id="SIU00724.1" /translation="MPAPDPMRGDPPHPAPPRLRSPLDPTSGDPLHPAPPRLRSPLDP TSGDPLHPAPPRLRSPLVLLDGASMWFRSFFGVPSSITAPDGRPVNAVRGFIDSMAVV IIQQRPNRLAVCLDLDWRPQFRVDLIPSYKAHRVAEPEPNGQPDVEEVPDELTPQVDM IMELLDAFGIAMAGAPGFEADDVLGTLATRERRDPVIVVSGDRDLLQVVADDPVPVRV LYLGRGLAKATLFGPAEVAERYGLPAHRAGAAYAELALLRGDPSDGLPGVPGVGEKTA ATLLARHGSLDQIMAAADDRKTTMAKGLRTKLLAASAYIKAADRVVRVATDAPVTLST PTDRLPLVAADPERTAELATRFGVESSIARLQKALDTLPG" CDS complement(2334812..2335546) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2118C" /product="Probable membrane protein" /note="Mb2118c, -, len: 244 aa. Equivalent to Rv2091c, len: 244 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 244 aa overlap). Probable membrane protein; contains potential transmembrane region. Repetitive ORF. Protein product from Mb2118c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2118c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64940" /db_xref="InterPro:IPR025637" /db_xref="UniProtKB/Swiss-Prot:P64940" /protein_id="SIU00725.1" /translation="MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATW QAPAYTPQYQQPADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQ PGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFW APGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTV SIDGTSKRVTVTFQDNKGTYEVGRPQ" CDS complement(2335588..2338308) /codon_start=1 /transl_table=11 /gene="helY" /locus_tag="BQ2027_MB2119C" /product="atp-dependent dna helicase hely" /note="Mb2119c, helY, len: 906 aa. Equivalent to Rv2092c, len: 906 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 906 aa overlap). Probable helY, DNA helicase (EC 3.6.1.-), with similarity to YJF0_YEAST P47047 hypothetical helicase in tdh1-gyp6 intergenic region, (1073 aa), FASTA scores, opt: 1004, E(): 0, (29.0% identity in 970 aa o verlap); contains PS00017 ATP/GTP-binding site motif A, PS00402 Binding-protein-dependent transport systems inner membrane comp signature. BELONGS TO THE SKI2 SUBFAMILY OF HELICASES. Protein product from Mb2119c detected using SWATH mass spectrometry. Mb2119c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y080" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR011545" /db_xref="InterPro:IPR012961" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y080" /protein_id="SIU00726.1" /translation="MTELAELDRFTAELPFSLDDFQQRACSALERGHGVLVCAPTGAG KTVVGEFAVHLALAAGSKCFYTTPLKALSNQKHTDLTARYGRDQIGLLTGDLSVNGNA PVVVMTTEVLRNMLYADSPALQGLSYVVMDEVHFLADRMRGPVWEEVILQLPDDVRVV SLSATVSNAEEFGGWIQMVRGDTTVVVDEHRPVPLWQHVLVGKRMFDLFDYRIGEAEG QPQVNRELLRHIAHRREADRMADWQPRRRGSGRPGFYRPPGRPEVIAKLDAEGLLPAI TFVFSRAGCDAAVTQCLRSPLRLTSEEERARIAEVIDHRCGDLADSDLAVLGYYEWRE GLLRGLAAHHAGMLPAFRHTVEELFTAGLVKAVFATETLALGINMPARTVVLERLVKF NGEQHMPLTPGEYTQLTGRAGRRGIDVEGHAVVIWHPEIEPSEVAGLASTRTFPLRSS FAPSYNMTINLVHRMGPQQAHRLLEQSFAQYQADRSVVGLVRGIERGNRILGEIAAEL GGSDAPILEYARLRARVSELERAQARASRLQRRQAATDALAALRRGDIITITHGRRGG LAVVLESARDRDDPRPLVLTEHRWAGRISSADYSGTTPVGSMTLPKRVEHRQPRVRRD LASALRSAAAGLVIPAARRVSEAGGFHDPELESSREQLRRHPVHTSPGLEDQIRQAER YLRIERDNAQLERKVAAATNSLARTFDRFVGLLTEREFIDGPATDPVVTDDGRLLARI YSESDLLVAECLRTGAWEGLKPAELAGVVSAVVYETRGGDGQGAPFGADVPTPRLRQA LTQTSRLSTTLRADEQAHRITPSREPDDGFVRVIYRWSRTGDLAAALAAADVNGSGSP LLAGDFVRWCRQVLDLLDQVRNAAPNPELRATAKRAIGDIRRGVVAVDAG" CDS complement(2338357..2339283) /codon_start=1 /transl_table=11 /gene="tatC" /locus_tag="BQ2027_MB2120C" /product="sec-independent protein translocase transmembrane protein tatc" /note="Mb2120c, tatC, len: 308 aa. Equivalent to Rv2093c, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 308 aa overlap). Probable tatC, transmembrane protein, component of twin-arginine translocation protein export system (see citation below for more information), equivalent to U00017|U00017_1 from Mycobacterium leprae (317 aa), FASTA scores: opt: 1722, E(): 0, (84.5% identity in 310 aa overlap). Similarity to others e.g. P27857|TATC_ECOLI|MTTB|B3839|Z5360|ECS4768 Sec-independent protein translocase protein from E. coli strain K12 and O157:H7 (258 aa), FASTA scores: opt: 344, E(): 6e-16, (32.5% identity in 265 aa overlap). BELONGS TO THE TATC FAMILY. Protein product from Mb2120c detected using SWATH mass spectrometry. Mb2120c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66896" /db_xref="InterPro:IPR002033" /db_xref="InterPro:IPR019820" /db_xref="UniProtKB/Swiss-Prot:P66896" /protein_id="SIU00727.1" /translation="MRAAGLLKRLNPRNRRSRVNPDATMSLVDHLTELRTRLLISLAA ILVTTIFGFVWYSHSIFGLDSLGEWLRHPYCALPQSARADISADGECRLLATAPFDQF MLRLKVGMAAGIVLACPVWFYQLWAFITPGLYQRERRFAVAFVIPAAVLFVAGAVLAY LVLSKALGFLLTVGSDVQVTALSGDRYFGFLLNLLVVFGVSFEFPLLIVMLNLAGLLT YERLKSWRRGLIFAMFVFAAIFTPGSDPFSMTALGAALTVLLELAIQIARVHDKRKAK REAAIPDDEASVIDPPSPVPAPSVIGSHDDVT" CDS complement(2339300..2339551) /codon_start=1 /transl_table=11 /gene="tatA" /locus_tag="BQ2027_MB2121C" /product="sec-independent protein translocase membrane-bound protein tata" /note="Mb2121c, tatA, len: 83 aa. Equivalent to Rv2094c, len: 83 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 83 aa overlap). Probable tatA, membrane-bound protein, component of twin-arginine translocation protein export system (see citation below for more information), equivalent to U00017_2 from Mycobacterium leprae (88 aa), FASTA scores: opt: 392, E(): 2e-20, (68.2% identity in 88 aa overlap). Similarity to others e.g. P27856|O65938|TATA_ECOLI SEC-INDEPENDENT PROTEIN TRANSLOCASE PROTEIN from E. coli strains K12 and O157:H7 (261 aa), FASTA scores: opt: 111, E(): 0.25, (28.0 % identity in 75 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE TATA/E FAMILY. Protein product from Mb2121c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2121c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66890" /db_xref="InterPro:IPR003369" /db_xref="InterPro:IPR006312" /db_xref="UniProtKB/Swiss-Prot:P66890" /protein_id="SIU00728.1" /translation="MGSLSPWHWAILAVVVIVLFGAKKLPDAARSLGKSLRIFKSEVR ELQNENKAEASIETPTPVQSQRVDPSAASGQDSTEARPA" CDS complement(2339619..2340569) /codon_start=1 /transl_table=11 /gene="pafc" /locus_tag="BQ2027_MB2122C" /product="proteasome accessory factor c pafc" /note="Mb2122c, -, len: 316 aa. Equivalent to Rv2095c, len: 316 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 316 aa overlap). Conserved hypothetical protein. Highly similar to ML1330 P54075|YY35_MYCLE HYPOTHETICAL 27.0 KD PROTEIN (247 aa) opt: 1127 E(): 0, (78.4% identity in 227 aa overlap). Also similar to ORF11(1) of Rhodococcus erythropolis. FASTA score: Z82004|REZ820043 REZ82004 NID: g1666179 - Rhodococcus (326 aa) opt: 624 E(): 1.1e-30; (56.7% identity in 319 aa overlap). Contains possible helix-turn-helix motif at aa 25-46, (+2.92 SD) Protein product from Mb2122c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2122c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0A3" /db_xref="InterPro:IPR026881" /db_xref="InterPro:IPR028349" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0A3" /protein_id="SIU00729.1" /translation="MSALSTRLVRLLNMVPYFQANPRITRAEAAAELGVTAKQLEEDL NQLWMCGLPGYSPGDLIDFEFCGDTIEVTFSAGIDRPLKLTSPEATGLLVALRALADI PGVVDPQAARSAIAKIAAAAGAVAAVAEQAPTESPAAAAVRAAVRNSRALTIDYYAAS HDTLTTRIVDPIRVLLIGGHSYLEAWSREAEGVRLFRFDRIVDAAELGEPAVPPESAR QAPPDTSLFDGDLSLPSATLRVAPSASWMLEYYPIRELRQLPDGSCEVVMTYASEDWM TRLLLGFGSDVRVLAPESLAQRVRDAATAALDAYQAAAPP" CDS complement(2340566..2341564) /codon_start=1 /transl_table=11 /gene="pafb" /locus_tag="BQ2027_MB2123C" /product="proteasome accessory factor b pafb" /note="Mb2123c, -, len: 332 aa. Equivalent to Rv2096c, len: 332 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 332 aa overlap). Conserved hypothetical protein. Highly similar to ML1329, P54076|YY36_MYCLE HYPOTHETICAL 35.4 KD PROTEIN B21 (331 aa) opt: 1676 E(): 0; (80.2% identity in 329 aa overlap) and to ORF10(1) of Rhodococcus erythropolis, Z82004|REZ820042 REZ 82004 NID: g1666179 (330 aa) opt: 1232, E(): 0; 59.9% identity in 332 aa overlap Protein product from Mb2123c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2123c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR026881" /db_xref="UniProtKB/Swiss-Prot:P64942" /protein_id="SIU00730.1" /translation="MATSKVERLVNLVIALLSTRGYITAEKIRSSVAGYSDSPSVEAF SRMFERDKNELRDLGIPLEVGRVSALEPTEGYRINRDAYALSPVELTPDEAAAVAVAT QLWESPELITATQGALLKLRAAGVDVDPLDTGAPVAIASAAAVSGLRGSEDVLGILLS AIDSGQVVQFSHRSSRAEPYTVRTVEPWGVVTEKGRWYLVGHDRDRDATRVFRLSRIG AQVTPIGPAGATTVPAGVDLRSIVAQKVTEVPTGEQATVWVAEGRATALRRAGRSAGP RQLGGRDGEVIELEIRSSDRLAREITGYGADAIVLQPGSLRDDVLARLRAQAGALA" CDS complement(2341573..2342931) /codon_start=1 /transl_table=11 /gene="pafa" /locus_tag="BQ2027_MB2124C" /product="proteasome accessory factor a pafa" /note="Mb2124c, -, len: 452 aa. Equivalent to Rv2097c, len: 452 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 452 aa overlap). Conserved hypothetical protein. Similarity to YTH6_ RHOSO P43484 hypothetical protein in thcr 5' region (333 aa), FASTA scores opt: 738, E(): 0, (38.5% identity in 330 aa overlap). Also highly similar to Mycobacterium leprae protein ML1328, P54077|YY37_MYCLE HYPOTHETICAL 38.1 KD PROTEIN (336 aa) opt: 1985 E(): 0; (96.4% identity in 307 aa overlap) Protein product from Mb2124c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2124c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64944" /db_xref="InterPro:IPR004347" /db_xref="InterPro:IPR022279" /db_xref="UniProtKB/Swiss-Prot:P64944" /protein_id="SIU00731.1" /translation="MQRRIMGIETEFGVTCTFHGHRRLSPDEVARYLFRRVVSWGRSS NVFLRNGARLYLDVGSHPEYATAECDSLVQLVTHDRAGEWVLEDLLVDAEQRLADEGI GGDIYLFKNNTDSAGNSYGCHENYLIVRAGEFSRISDVLLPFLVTRQLICGAGKVLQT PKAATYCLSQRAEHIWEGVSSATTRSRPIINTRDEPHADAEKYRRLHVIVGDSNMSET TTMLKVGTAALVLEMIESGVAFRDFSLDNPIRAIREVSHDVTGRRPVRLAGGRQASAL DIQREYYTRAVEHLQTREPNAQIEQVVDLWGRQLDAVESQDFAKVDTEIDWVIKRKLF QRYQDRYDMELSHPKIAQLDLAYHDIKRGRGIFDLLQRKGLAARVTTDEEIAEAVDQP PQTTRARLRGEFISAAQEAGRDFTVDWVHLKLNDQAQRTVLCKDPFRAVDERVKRLIA SM" CDS complement(2342983..2344458) /codon_start=1 /transl_table=11 /gene="pe21" /locus_tag="BQ2027_MB2125C" /product="pe-pgrs family protein pe_pgrs36" /note="Mb2125c, PE_PGRS36, len: 491 aa. Equivalent to Rv2099c (PE21) and Rv2098c (PE_PGRS36), len: 58 aa and 433 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 58 aa overlap and 99.8% identity in 433 aa overlap). Rv2098c|PE_PGRS36: Member of Mycobacterium tuberculosis PE-family, PGRS sub-family. Frameshifted near N-terminus. Rv2099c|PE21: 5'-end of Rv2098c (MTCY49.38c), then frameshifts. Sequence has been checked, no errors found. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS36 and PE21 exist as 2 genes. In Mycobacterium bovis, a single base insertion (*-c) leads to a single product more similar to PE_PGRS36. There is also a 3 bp in-frame deletion (ggc-*)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/Swiss-Prot:P0A689" /protein_id="SIU00732.1" /translation="MSFVIASPEALLAAATDLAAIRSTIRAANAAAAVPTTGALAPAA DEVSAGIAALFGAQAQSYQAVSAQAAAFHDRFVQLLNAGGGSYASAEIANAQQNLLNA VNAPTQTLLGRPLVGDGADGASGPVGQPGGDGGILWGNGGNGGDSTSPGVAGGAGGSA GLIGNGGRGGNGAPGGAGGNGGLGGLLLGNGGAGGVGGTGDNGVGDLGAGGGGGDGGL GGRAGLIGHGGAGGNGGDGGHGGSGKAGGSGGSGGFGQFGGAGGLLYGNGGAAGSGGN GGDAGTGVSSDGFAGLGGSGGRGGDAGLIGVGGGGGNGGDPGLGARLFQVGSRGGDGG VGGWLYGDGGGGGDGGNGGLPFIGSTNAGNGGSARLIGNGGAGGSGGSGAPGSVSSGG VGGAGNPGGSGGNGGVWYGNGGAGGAAGQGGPGMNTTSPGGPGGVGGHGGTAILFGDG GAGGAGAAGGPGTPDGAAGPGGSGGTGGLLFGVPGPSGPDG" CDS 2344641..2346293 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2126" /product="13E12 repeat family protein, HNH endonuclease domain" /note="Mb2126, -, len: 550 aa. Equivalent to Rv2100, len: 550 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 550 aa overlap). Conserved hypothetical protein. Member of Mycobacterium tuberculosis 13E12 repeat family with Rv1148c, Rv1945, Rv3467, Rv0094c, Rv1128c, Rv1587c, Rv1702c, Rv3466, Rv1588c." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/Swiss-Prot:P64946" /protein_id="SIU00733.1" /translation="MAGALFEPSFAAAHPAGLLRRPVTRTVVLSVAATSIAHMFEISL PDPTELCRSDDGALVAAIEDCARVEAAASARRLSAIAELTGRRTGADQRADWACDFWD CAAAEVAAALTISHGKASGQMHLSLALNRLPQVAALFLAGHLGARLFSIIAWRTYLVR DPHALSLLDAALAEHAGAWGPLSAPKLEKAIDSWIDRYDPGALRRSRISARTRDLCIG DPDEDAGTAALWGRLYATDAAMLDRRLTEMAHGVCEDDPRTLAQRRADALGALAAGAD HLACGCGKPDCPSGAGNDERAAGVVIHVVADASALDAQPDPHLSGDEPPSRPLTPETT LFEALTPDPEPDPPATHAPAELITTGGGVVPAPLLAELIRGGATISQVRHPGDLAAEP HYRPSAKLAEFVRMRDLTCRFPGCDVPAEFCDIDHSAPWPLGPTHPSNLKCACRKHHL LKTFWTGWRDVQLPDGTVIWTAPNGHTYTTHPGSRIFFPTWHTTTAELPQTSTAAVNV DARGLMMPRRRRTRAAELAHRINAERALNDAYMAERNKPPSF" CDS 2346492..2349533 /codon_start=1 /transl_table=11 /gene="helZ" /locus_tag="BQ2027_MB2127" /product="PROBABLE HELICASE HELZ" /note="Mb2127, helZ, len: 1013 aa. Equivalent to Rv2101, len: 1013 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 1013 aa overlap). Probable helZ, helicase (EC 3.6.-.-), similar to many e.g. PCC6803|P74552|SLL1366 HELICASE OF THE SNF2/RAD54 FAMILY from Synechocystis sp. strain PCC 6803 (1039 aa), FASTA scores: opt: 2015, E(): 0, (38.4% identity in 1063 aa overlap); etc. Protein product from Mb2127 detected using SWATH mass spectrometry. Mb2127 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2C1" /db_xref="InterPro:IPR000330" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR022138" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR038718" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2C1" /protein_id="SIU00734.1" /translation="MLVLHGFWSNSGGMRLWAEDSDLLVKSPSQALRSARPHPFAAPA DLIAGIHPGKPATAVLLLPSLRSAPLDSPELIRLAPRPAARTDPMLLAWTVPVVDLDP TAALAAFDQPAPDVRYGASVDYLAELAVFARELVERGRVLPQLRRDTHGAAACWRPVL QGRDVVAMTSLVSAMPPVCRAEVGGHDPHELATSALDAMVDAAVRAALSPMDLLPPRR GRSKRHRAVEAWLTALTCPDGRFDAEPDELDALAEALRPWDDVGIGTVGPARATFRLS EVETENEETPAGSLWRLEFLLQSTQDPSLLVPAEQAWNDDGSLRRWLDRPQELLLTEL GRASRIFPELVPALRTACPSGLELDADGAYRFLSGTAAVLDEAGFGVLLPSWWDRRRK LGLVLSAYTPVDGVVGKASKFGREQLVEFRWELAVGDDPLSEEEIAALTETKSPLIRL RGQWVALDTEQLRRGLEFLERKPTGRKTTAEILALAASHPDDVDTPLEVTAVRADGWL GDLLAGAAAASLQPLDPPDGFTATLRPYQQRGLAWLAFLSSLGLGSCLADDMGLGKTV QLLALETLESVQRHQDRGVGPTLLLCPMSLVGNWQQEAARFAPNLRVYAHHGGARLHG EALRDHLERTDLVVSTYTTATRDIDELSEYEWNRVVLDEAQAVKNSLSRAAKAVRRLR AAHRVALTGTPMENRLAELWSIMDFLNPGLLGSSERFRTRYAIPIERHGHTEPAERLR ASTRPYILRRLKTDPAIIDDLPEKIEIKQYCQLTTEQASLYQAVVADMMEKIENTEGI ERRGNVLAAMAKLKQVCNHPAQLLHDRSPVGRRSGKVIRLEEILEEILAEGDRVLCFT QFTEFAELLVPHLAARFGRAARDIAYLHGGTPRKRRDEMVARFQSGDGPPIFLLSLKA GGTGLNLTAANHVVHLDRWWNPAVENQATDRAFRIGQRRTVQVRKFICTGTLEEKIDE MIEEKKALADLVVTDGEGWLTELSTRDLREVFALSEGAVGE" CDS 2349526..2350359 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2128" /product="SWF/SNF family helicase Rv2102" /note="Mb2128, -, len: 238 aa. Equivalent to Rv2102, len: 238 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 238 aa overlap). Conserved hypothetical protein, similar to part of hypothetical protein D90916_18 (289 aa) from Synechocystis sp. PCC6803. Contains PS00017 ATP/GTP-binding site motif A(P-loop). FASTA scores: D90916|D90916_18 Synechocystis sp.PCC6803 complete (289 aa) opt: 498, E(): 1.9e-25; 46.7% identity in 167 aa overlap. Protein product from Mb2128 detected using SWATH mass spectrometry. Mb2128 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0Z5" /db_xref="InterPro:IPR007527" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Z5" /protein_id="SIU00735.1" /translation="MSSTWYPPPSRPRPVEGGIKARSTRGAIAQTWWSERFIAVLEDI GLGNRLQRGRSYARKGQVISLQVDAGLVTALVQGSRARPYRIRIGIPAFGKSQWAHVE RTLAENAWYAAKLLSGEMPEDIEDVFAGLGLSLFPGTARELSLDCSCPDYAVPCKHLA ATFYLLAESFDEDPFAILAWRGREREDLLANLAAARADGAAPAADHAEQVAQPLTDCL DRYYARQADINVPSPPATPSTALLDQLPDTGLSARGRPLTELLRPAYHALTHHHNSAG G" CDS complement(2350338..2350772) /codon_start=1 /transl_table=11 /gene="vapc37" /locus_tag="BQ2027_MB2129C" /product="possible toxin vapc37. contains pin domain." /note="Mb2129c, -, len: 144 aa. Equivalent to Rv2103c, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 144 aa overlap). Conserved hypothetical protein, similar to hypothetical mycobacterial proteins belonging to family, includes Rv0749, Rv0277c, Rv2530c, Rv3320c, Rv2494, Rv2872, Rv0617, Rv1242 etc. FASTA scores: sptr|Q49793|Q49793 B2126_C3_261 (97 aa) opt: 331, E(): 4.8e-18; 59.4% identity in 96 aa overlap and gp|Z74024|MTCY274_3 Mycobacterium tuberculosis cosmid (147 aa) opt: 234, E(): 1.2e-10; 34.8% identity in 141 aa overlap. Protein product from Mb2129c detected using shotgun mass spectrometry. Mb2129c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y089" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y089" /protein_id="SIU00736.1" /translation="MKIVDANVLLYAVNTTSEHHKPSLRWLDGALSGADRVGFAWVPL LAFVRLATKVGLFPRPLPREAAITQVADWLAAPSAVLVNPTVRHADILARMLTYVGTG ANLVNDAHLAALAVEHRASIVSYDSDFGRFEGVRWDQPPALL" CDS complement(2350779..2351033) /codon_start=1 /transl_table=11 /gene="vapb37" /locus_tag="BQ2027_MB2130C" /product="possible antitoxin vapb37" /note="Mb2130c, -, len: 84 aa. Equivalent to Rv2104c, len: 84 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 84 aa overlap). Conserved hypothetical protein, similar to members of a family of hypothetical mycobacterial proteins including Rv2871, Rv1241, Rv2132, Rv3321c, Rv1113, Rv0657, Rv1560, etc. FASTA scores: sptr|Q49787|Q49787 B2126_C2_217 (97 aa) opt: 197, E(): 2e-07; 57.1% identity in 56 aa overlap and Z95388|MTCY270_36 Mycobacterium tuberculosis cosmid (76 aa) opt: 142, E(): 0.0011; 41.8% identity in 55 aa overlap. Mb2130c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0M6" /protein_id="SIU00737.1" /translation="MRTTVTLDDDVEQLVRRRMAERQVSFKKALNDAIRDGASGRPAP SHFSTRTADLGVPAVNLDRALQLAADLEDEELVRRQRRGS" CDS 2352253..2352549 /codon_start=1 /transl_table=11 /gene="PE22" /locus_tag="BQ2027_MB2131" /product="pe family protein pe22" /note="Mb2131, PE22, len: 98 aa. Equivalent to Rv2107, len: 98 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 98 aa overlap). Member of mycobacterial PE family e.g. Y03A_MYCTU Q10637 hypothetical glycine-rich 49.6 kd protein (603 aa), FASTA scores; opt: 214 E(): 1.3e-14, 39.8% identity in 93 aa overlap" /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y062" /protein_id="SIU00738.1" /translation="MSFVNVDPFGMLAAAATLESLGSHMAVSNAAVASVTTKVPPPAA DYVSKKLSLFFSSHGQQYQVQAARGTAFHRKLVRTLANGALAYEEVEIANNEGF" CDS 2352605..2353336 /codon_start=1 /transl_table=11 /gene="PPE36" /locus_tag="BQ2027_MB2132" /product="ppe family protein ppe36" /note="Mb2132, PPE36, len: 243 aa. Equivalent to Rv2108, len: 243 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 243 aa overlap). N-terminus is similar to N-terminal region of Mycobacterium tuberculosis PPE family proteins eg. YX23_MYCTU Q10813 hypothetical 41.1 kd protein cy274.23 (404 aa), FASTA scores; opt: 431, E(): 3.9e-32, 44.0% identity in 166 aa overlap" /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0A7" /protein_id="SIU00739.1" /translation="MPNFWALPPEINSTRIYLGPGSGPILAAAQGWNALASELEKTKV GLQSALDTLLESYRGQSSQALIQQTLPYVQWLTTTAEHAHKTAIQLTAAANAYEQARA AMVPPAMVRANRVQTTVLKAINWFGQFSTRIADKEADYEQMWFQDALVMENYWEAVQE AIQSTSHFEDPPEMADDYDEAWMLNTVFDYHNENAKEEVIHLVPDVNKERGPIELVTK VDKEGTIRLVYDGEPTFSYKEHPKF" CDS complement(2353876..2354622) /codon_start=1 /transl_table=11 /gene="prcA" /locus_tag="BQ2027_MB2133C" /product="proteasome alpha subunit prca; assembles with beta subunit prcb." /note="Mb2133c, prcA, len: 248 aa. Equivalent to Rv2109c, len: 248 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 248 aa overlap). prcA, proteasome alpha-type subunit 1, highly similar to TR:Q53080 (EMBL:U26421) proteasome alpha-type subunit 1 from Rhodococcus (259 aa), FASTA scores; opt: 1035, E(): 0, 67.2% identity in 247 aa overlap. Protein product from Mb2133c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2133c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TZ14" /db_xref="InterPro:IPR001353" /db_xref="InterPro:IPR022296" /db_xref="InterPro:IPR023332" /db_xref="InterPro:IPR029055" /db_xref="UniProtKB/Swiss-Prot:Q7TZ14" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00740.1" /translation="MSFPYFISPEQAMRERSELARKGIARAKSVVALAYAGGVLFVAE NPSRSLQKISELYDRVGFAAAGKFNEFDNLRRGGIQFADTRGYAYDRRDVTGRQLANV YAQTLGTIFTEQAKPYEVELCVAEVAHYGETKPPELYRITYDGSIADEPHFVVMGGTT EPIANALKESYAENASLTDALGIAVAALRAGSADTSGGDQPTLGVASLEVAVLDANRP RRAFRRITGSALQALLVDQESPQSDGESSG" CDS complement(2354619..2355494) /codon_start=1 /transl_table=11 /gene="prcB" /locus_tag="BQ2027_MB2134C" /product="proteasome beta subunit prcb; assembles with alpha subunit prca." /note="Mb2134c, prcB, len: 291 aa. Equivalent to Rv2110c, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 291 aa overlap). prcB, proteasome beta-type subunit 2, highly similar to eg. TR:Q53083 (EMBL:U264 22) proteasome beta-type subunit 2 from Rhodococcus (292 aa), FASTA scores; opt: 1103, E(): 0, 64.5% identity in 262 aa overlap. Protein product from Mb2134c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2134c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TZ13" /db_xref="InterPro:IPR001353" /db_xref="InterPro:IPR022483" /db_xref="InterPro:IPR023333" /db_xref="InterPro:IPR029055" /db_xref="UniProtKB/Swiss-Prot:Q7TZ13" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00741.1" /translation="MTWPLPDRLSINSLSGTPAVDLSSFTDFLRRQAPELLPASISGG APLAGGDAQLPHGTTIVALKYPGGVVMAGDRRSTQGNMISGRDVRKVYITDDYTATGI AGTAAVAVEFARLYAVELEHYEKLEGVPLTFAGKINRLAIMVRGNLAAAMQGLLALPL LAGYDIHASDPQSAGRIVSFDAAGGWNIEEEGYQAVGSGSLFAKSSMKKLYSQVTDGD SGLRVAVEALYDAADDDSATGGPDLVRGIFPTAVIIDADGAVDVPESRIAELARAIIE SRSGADTFGSDGGEK" CDS complement(2355491..2355685) /codon_start=1 /transl_table=11 /gene="pup" /locus_tag="BQ2027_MB2135C" /product="prokaryotic ubiquitin-like protein pup" /note="Mb2135c, -, len: 64 aa. Equivalent to Rv2111c, len: 64 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 64 aa overlap). Conserved hypothetical protein. Highly similar to a hypothetical protein TR:Q53078 (EMBL:U26422) (64 aa) upstream of Rhodococcus proteasome beta-type subunit 1, FASTA scores; opt: 349, E(): 7.3e-25, 84.4% identity in 64 aa overlap Protein product from Mb2135c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2135c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TZ12" /db_xref="InterPro:IPR008515" /db_xref="UniProtKB/Swiss-Prot:Q7TZ12" /protein_id="SIU00742.1" /translation="MAQEQTKRGGGGGDDDDIAGSTAAGQERREKLTEETDDLLDEID DVLEENAEDFVRAYVQKGGQ" CDS complement(2355798..2357405) /codon_start=1 /transl_table=11 /gene="dop" /locus_tag="BQ2027_MB2136C" /product="deamidase of pup dop" /note="Mb2136c, -, len: 535 aa. Similar to Rv2112c, len: 554 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 532 aa overlap). Conserved hypothetical protein. Highly similar to a hypothetical protein TR:Q53081 (EMBL:U26422) (499 aa) upstream of Rhodococcus proteasome beta-type subunit 1, FASTA scores opt: 2832 E(): 0, 85.3% identity in 502 aa overlap. Also some similarity to Mycobacterium tuberculosis hypothetical protein Rv2097c (MTCY49.37c, 38.2% identity in 419 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 57 bp in-frame deletion at the NH2 part, leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (535 aa versus 554 aa). Protein product from Mb2136c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2136c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y085" /db_xref="InterPro:IPR004347" /db_xref="InterPro:IPR022366" /db_xref="UniProtKB/TrEMBL:A0A1R3Y085" /protein_id="SIU00743.1" /translation="MFWVGGRCLMPASSAARCAARIVGGPRLYGMQRIIGTEVEYGIS SPSDPTANPILTSTQAVLAYAAAAGIQRAKRTRWDYEVESPLRDARGFDLSRSAGPPP VVDADEVGAANMILTNGARLYVDHAHPEYSAPECTDPLDAVIWDKAGERVMEAAARHV ASVPGAAKLQLYKNNVDGKGASYGSHENYLMSRQTPFSAIITGLTPFLVSRQVVTGSG RVGIGPSGDEPGFQLSQRSDYIEVEVGLETTLKRGIINTRDEPHADADRYRRLHVIIG DANLAETSTYLKLGTTALVLDLIEEGPAHAIDLTDLALARPVHAVHAISRDPSLRATV ALADGRELTGLALQRIYLDRVAKLVDSRDPDPRAADIVETWAHVLDQLERDPMDCAEL LDWPAKLRLLDGFRQRENLSWSAPRLHLVDLQYSDVRLDKGLYNRLVARGSMKRLVTE HQVLSAVENPPTDTRAYFRGECLRRFGADIAAASWDSVIFDLGGDSLVRIPTLEPLRG SKAHVGALLDSVDSAVELVEQLTAEPR" CDS 2357466..2358659 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2137" /product="Probable integral membrane protein" /note="Mb2137, -, len: 397 aa. Equivalent to Rv2113, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 397 aa overlap). Probable integral membrane protein. Protein product from Mb2137 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2137 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2C2" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2C2" /protein_id="SIU00744.1" /translation="MSLSVRRPPAARAAAIVEAESWFLKRGLPSVLTMRGRCRRLWPR SAPMLAAWAVVEGCLMAVFFVTDGGEVFISATPTTAQWVILALLAVALPLASLVGWLV SQISSGRGQAAVATMAVAFAAASDVIESGPIQLLRTAVVVGLVLLQTGCGVGSVLGWA VRMTLEHLATVGTLAVRALPIVLLTALVFFNTYVWLMAANINGERLPLAMVFLLAIAG AFVVSKTVERVRPLLRSTTVMPQGSQSLAGTPFATMGDPSPGFPLTRAERLNVVFLLA ALQLVEILVVASVGAAIYLVLGMIILTPPLLREWTHYDSMTTTVLGMTFPAPDSLIRM CLFLGALTFMYISARAVDDAEYRAMFLDPLIDDLHTALLARNRYRNNVVTAPCAGVDA GHVDD" CDS 2358670..2359293 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2138" /product="conserved protein" /note="Mb2138, -, len: 207 aa. Equivalent to Rv2114, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 207 aa overlap). Unknown Protein product from Mb2138 detected using shotgun mass spectrometry. Mb2138 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR016792" /db_xref="UniProtKB/TrEMBL:A0A1R3Y103" /protein_id="SIU00745.1" /translation="MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAE LWSALDPQALATEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLL SSMNPAPYTVSQVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPK LGERLNLPAGWSYHTRVLTSELRVDTTNREARVLQDDLTNSYSLVTA" CDS complement(2359297..2361126) /codon_start=1 /transl_table=11 /gene="mpa" /locus_tag="BQ2027_MB2139C" /product="mycobacterial proteasome atpase mpa" /note="Mb2139c, -, len: 609 aa. Equivalent to Rv2115c, len: 609 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 609 aa overlap). Probable ATPase (EC 3.6.1.-), similar to e.g. YB56_METJA Q58556 cell division cycle protein 48 homolog (903 aa), FASTA scores; opt: 423, E(): 8.1e-32, 45.8% identity in 249 aa overlap. Contains PS00674 AAA-protein family signature and PS00017 ATP/GTP-binding site motif A (P-loop). Also some similarity to other Mycobacterium tuberculosis ATPases eg. Rv0435c and Rv3610c. Equivalent to Mycobacterium leprae U00 017|U00017_18 (609 aa), FASTA scores; opt: 3670 E(): 0; 92.9% identity in 609 aa overlap. Protein product from Mb2139c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2139c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63346" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR003959" /db_xref="InterPro:IPR003960" /db_xref="InterPro:IPR022482" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR032501" /db_xref="InterPro:IPR041626" /db_xref="UniProtKB/Swiss-Prot:P63346" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00746.1" /translation="MGESERSEAFGIPRDSPLSSGDAAELEQLRREAAVLREQLENAV GSHAPTRSARDIHQLEARIDSLAARNSKLMETLKEARQQLLALREEVDRLGQPPSGYG VLLATHDDDTVDVFTSGRKMRLTCSPNIDAASLKKGQTVRLNEALTVVEAGTFEAVGE ISTLREILADGHRALVVGHADEERVVWLADPLIAEDLPDGLPEALNDDTRPRKLRPGD SLLVDTKAGYAFERIPKAEVEDLVLEEVPDVSYADIGGLSRQIEQIRDAVELPFLHKE LYREYSLRPPKGVLLYGPPGCGKTLIAKAVANSLAKKMAEVRGDDAHEAKSYFLNIKG PELLNKFVGETERHIRLIFQRAREKASEGTPVIVFFDEMDSIFRTRGTGVSSDVETTV VPQLLSEIDGVEGLENVIVIGASNREDMIDPAILRPGRLDVKIKIERPDAEAAQDIYS KYLTEFLPVHADDLAEFDGDRSACIKAMIEKVVDRMYAEIDDNRFLEVTYANGDKEVM YFKDFNSGAMIQNVVDRAKKNAIKSVLETGQPGLRIQHLLDSIVDEFAENEDLPNTTN PDDWARISGKKGERIVYIRTLVTGKSSSASRAIDTESNLGQYL" CDS 2361407..2361976 /codon_start=1 /transl_table=11 /gene="lppK" /locus_tag="BQ2027_MB2140" /product="conserved lipoprotein lppk" /note="Mb2140, lppK, len: 189 aa. Equivalent to Rv2116, len: 189 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 189 aa overlap). Probable lppK, conserved lipoprotein, similar to Mycobacterium leprae B2126_F3_115 TR:Q49803 (194 aa), FASTA scores; opt: 624, E(): 3.1e-31, 51.6% identity in 190 aa overlap. Contains N-terminal signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Some similarity to Rv2376c. Mb2140 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65301" /db_xref="UniProtKB/Swiss-Prot:P65301" /protein_id="SIU00747.1" /translation="MRRNIRVTLGAATIVAALGLSGCSHPEFKRSSPPAPSLPPVTSS PLEAAPITPLPAPEALIDVLSRLADPAVPGTNKVQLIEGATPENAAALDRFTTALRDG SYLPMTFAANDIAWSDNKPSDVMATVVVTTAHPDNREFTFPMEFVSFKGGWQLSRQTA EMLLAMGNSPDSTPSATSPAPAPSPTPPG" CDS 2361984..2362277 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2141" /product="YlxP-like protein" /note="Mb2141, -, len: 97 aa. Equivalent to Rv2117, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 97 aa overlap). Conserved hypothetical protein. Similar to hypothetical proteins from Mycobacterium leprae TR:Q49798 U2126J (97 aa), FASTA scores; opt: 554, E(): 0, 85.6% identity in 97 aa overlap, and Bacillus subtilis YLXP_BACSU P32730 hypothetical 10.7 kd protein (92 aa), FASTA scores; opt: 173, E(): 1.4e-11, 34.1% identity in 82 aa overlap Protein product from Mb2141 detected using SWATH mass spectrometry." /db_xref="InterPro:IPR007546" /db_xref="InterPro:IPR036746" /db_xref="UniProtKB/TrEMBL:A0A1R3Y074" /protein_id="SIU00748.1" /translation="MWIGWLEFDVLLGDVRSLKQKRSVTRPLVAELQRKFSVSAAETG SHDLYRRAGIGVAVVSGDRSHAVDVLDNAERLVAAHPEFELLSVRRGLHRTDD" CDS complement(2362306..2363148) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2142C" /product="rna methyltransferase" /note="Mb2142c, -, len: 280 aa. Equivalent to Rv2118c, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Possible S-adenosyl-l-methionine-dependent RNA methyltransferase (EC 2.1.1.-) (see citation below); corresponds to Mycobacterium leprae B2126_C1_165, similar to hypothetical proteins from several organisms e.g. Y134_METJA Q57598 hypothetical protein mj0134 (282 aa), FASTA scores; opt: 256, E(): 1e-13, FASTA scores; 30.2% identity in 285 aa overlap. The larger catalytic C-terminal domain binds the cofactor S-adenosyl-l-methionine (AdoMet) and is involved in the transfer of methyl group from AdoMet to the substrate. Protein product from Mb2142c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2142c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0B7" /db_xref="InterPro:IPR014816" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0B7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00749.1" /translation="MSATGPFSIGERVQLTDAKGRRYTMSLTPGAEFHTHRGSIAHDA VIGLEQGSVVKSSNGALFLVLRPLLVDYVMSMPRGPQVIYPKDAAQIVHEGDIFPGAR VLEAGAGSGALTLSLLRAVGPAGQVISYEQRADHAEHARRNVSGCYGQPPDNWRLVVS DLADSELPDGSVDRAVLDMLAPWEVLDAVSRLLVAGGVLMVYVATVTQLSRIVEALRA KQCWTEPRAWETLQRGWNVVGLAVRPQHSMRGHTAFLVATRRLAPGAVAPAPLGRKRE GRDG" CDS 2363222..2364058 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2143" /product="RecB family exonuclease" /note="Mb2143, -, len: 278 aa. Equivalent to Rv2119, len: 278 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 278 aa overlap). Conserved hypothetical protein. Similar to Mycobacterium leprae hypothetical protein TR:Q49799 U2126V (212 aa), FASTA scores; opt: 1153, E(): 0, 83.6% identity in 195 aa overlap. Orthologs present in Rhodococcus erythropolis (gb|AAC68687.1|(AF088800) and Streptomyces emb|CAB59506.1|(AL132648),Mb2143 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011335" /db_xref="InterPro:IPR011604" /db_xref="InterPro:IPR038726" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0B1" /protein_id="SIU00750.1" /translation="MADQPDPPTPRPALSPSRATDFKQCPLLYRFRAIDRLPEATSAA QLRGSVVHAALEQLYGLPAGLRSPDTARSLVQRAWDQMVAAEPELAGELDPGQPTQLL EDARALVSGYYRLEDPTRFDPQCCEQRVEVELADGTLLRGYIDRIDVAATGELRVVDY KTGKAPPAARALAEFKAMFQMKFYAVALFRSRGVPPTRLRLIYLADGQLLDYSPDRDE LLRFEKTLMAIWRAIQSAGETGDFRPNPSRLCDWCPHQQRCPAFGGTPPPYPGWPTEP AA" CDS complement(2364081..2364563) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2144C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb2144c, -, len: 160 aa. Equivalent to Rv2120c, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 160 aa overlap). Probable conserved integral membrane protein, similar to hypothetical protein from Mesorhizobium loti (153 aa). Smith-Waterman scores: NP_104030.1 hypothetical protein [Mesorhizobium loti] >gi|14023209|dbj|BAB49816.1| (AP003000) Identities = 50/135 (37%). Mb2144c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y094" /db_xref="InterPro:IPR008816" /db_xref="UniProtKB/TrEMBL:A0A1R3Y094" /protein_id="SIU00751.1" /translation="MTHVLVLLLALLIGVVAGLRSLTAPAVVSWAAFLGWINLHGTWA SWMGNFVTVVIVSVLAVAELVNDKRPKTPPRTVTPVFAVRIILGAFAGAVIGTAWGYR WGGLGAGVIGAVLGTMGGYQARTRLVAARGGHDLPIALLEDSVAVLGGFAIVAAAAAL " CDS complement(2364642..2365496) /codon_start=1 /transl_table=11 /gene="hisG" /locus_tag="BQ2027_MB2145C" /product="atp phosphoribosyltransferase hisg" /note="Mb2145c, hisG, len: 284 aa. Equivalent to Rv2121c, len: 284 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 284 aa overlap). Probable hisG, ATP phosphoribosyltransferase (EC 2.4.2.17) (see citation below), similar to others e.g. HIS1_ECOLI|P10366 ATP phosphoribosyltransferase from Escherichia coli (299 aa), FASTA scores: opt: 351, E(): 4.5e-20, (31.8% identity in 289 aa overlap); etc. Protein product from Mb2145c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:P60760" /db_xref="InterPro:IPR001348" /db_xref="InterPro:IPR011322" /db_xref="InterPro:IPR013115" /db_xref="InterPro:IPR013820" /db_xref="InterPro:IPR015867" /db_xref="InterPro:IPR018198" /db_xref="InterPro:IPR020621" /db_xref="UniProtKB/Swiss-Prot:P60760" /protein_id="SIU00752.1" /translation="MLRVAVPNKGALSEPATEILAEAGYRRRTDSKDLTVIDPVNNVE FFFLRPKDIAIYVGSGELDFGITGRDLVCDSGAQVRERLALGFGSSSFRYAAPAGRNW TTADLAGMRIATAYPNLVRKDLATKGIEATVIRLDGAVEISVQLGVADAIADVVGSGR TLSQHDLVAFGEPLCDSEAVLIERAGTDGQDQTEARDQLVARVQGVVFGQQYLMLDYD CPRSALKKATAITPGLESPTIAPLADPDWVAIRALVPRRDVNGIMDELAAIGAKAILA SDIRFCRF" CDS complement(2365499..2365780) /codon_start=1 /transl_table=11 /gene="hisE" /locus_tag="BQ2027_MB2146C" /standard_name="irg1" /product="phosphoribosyl-amp pyrophosphatase hise" /note="Mb2146c, hisE, len: 93 aa. Equivalent to Rv2122c, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Probable hisE (alternate gene name: irg1), phosphoribosyl-AMP cyclohydrolase (EC 3.6.1.31) (see citation below), similar to N-terminus of e.g. HIS2_SYNY3 P74755 phosphoribosyl-AMP cyclohydrolase (230 aa), FASTA scores; opt: 150, E(): 4e-08, (37.9% identity in 87 aa overlap); etc. Note that previously misnamed hisI. Protein product from Mb2146c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:P0A5B2" /db_xref="InterPro:IPR008179" /db_xref="InterPro:IPR021130" /db_xref="UniProtKB/Swiss-Prot:P0A5B2" /protein_id="SIU00753.1" /translation="MQQSLAVKTFEDLFAELGDRARTRPADSTTVAALDGGVHALGKK LLEEAGEVWLAAEHESNDALAEEISQLLYWTQVLMISRGLSLDDVYRKL" CDS 2365907..2367301 /codon_start=1 /transl_table=11 /gene="PPE37" /locus_tag="BQ2027_MB2147" /standard_name="irg2" /product="ppe family protein ppe37" /note="Mb2147, PPE37, len: 464 aa. Equivalent to Rv2123, len: 473 aa, from Mycobacterium tuberculosis strain H37Rv, (97.9% identity in 473 aa overlap). PPE37 (alternate gene name: irg2), member of the Mycobacterium tuberculosis PPE family of proteins but the C-terminus is not repetitive. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 27 bp in-frame deletion leads to a slightly shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (464 aa versus 473 aa)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2D4" /protein_id="SIU00754.1" /translation="MTFPMWFAVPPEVPSAWLSTGMGPGPLLAAQYTEIATELASVLA AVQASSWQGPSADRFVVAHQPFRYWLTHAATVATAAAAAHETAAAGYTSALGGMPTLA ELAANHAMHGALVTTNFFGVNTIPIALNEADYLRMWIQAATVMSHYQAVAHESVAATP STPPAPQIVTSAASSAASISFPDPTKLILQLLKDFLELLRYLAVELLPGPLGDLIAQV LDWFISFVSGPVFTFLAYLVLDPLIYFGPFAPLTSPVLLPAGLTGLAGLGAVSGPAGP MVERVHSDGPSRQSWPAATGVTLVGTNPAALVTTPAPAPTTSAAPTAPSTPGSSAAQG LYAVGGPDGEGFNPIAKTTALAGVTTDAAAPAAKLPGDQAQSSASKATRLRRRLRQHR FEFLADDGRLTMPNTPEMADVAAGNRGLDALGFAGTIPKSAPGSATGLTHLGGGFADV LSQPMLPHTWDGSD" CDS complement(2367298..2370876) /codon_start=1 /transl_table=11 /gene="metH" /locus_tag="BQ2027_MB2148C" /product="5-methyltetrahydrofolate--homocystein methyltransferase meth (methionine synthase, vitamin-b12 dependent isozyme) (ms)" /note="Mb2148c, metH, len: 1192 aa. Equivalent to Rv2124c, len: 1192 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1192 aa overlap). Probable metH, methionine synthase (EC 2.1.1.13), similar to many e.g. METH_ECOLI|P13009 5-methyltetrahydrofolate--homocystein methyltransferase from Escherichia coli (1226 aa), FASTA scores: opt: 1446, E(): 0, (32.1% identity in 1223 aa overlap); etc. Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. BELONGS TO THE VITAMIN-B12 DEPENDENT METHIONINE SYNTHASE FAMILY. Protein product from Mb2148c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2148c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y112" /db_xref="InterPro:IPR000489" /db_xref="InterPro:IPR003726" /db_xref="InterPro:IPR003759" /db_xref="InterPro:IPR004223" /db_xref="InterPro:IPR006158" /db_xref="InterPro:IPR011005" /db_xref="InterPro:IPR011822" /db_xref="InterPro:IPR033706" /db_xref="InterPro:IPR036589" /db_xref="InterPro:IPR036594" /db_xref="InterPro:IPR036724" /db_xref="InterPro:IPR037010" /db_xref="UniProtKB/TrEMBL:A0A1R3Y112" /protein_id="SIU00755.1" /translation="MTAADKHLYDTDLLDVLSQRVMVGDGAMGTQLQAADLTLDDFRG LEGCNEILNETRPDVLETIHRNYFEAGADAVETNTFGCNLSNLGDYDIADRIRDLSQK GTAIARRVADELGSPDRKRYVLGSMGPGTKLPTLGHTEYAVIRDAYTEAALGMLDGGA DAILVETCQDLLQLKAAVLGSRRAMTRAGRHIPVFAHVTVETTGTMLLGSEIGAALTA VEPLGVDMIGLNCATGPAEMSEHLRHLSRHARIPVSVMPNAGLPVLGAKGAEYPLLPD ELAEALAGFIAEFGLSLVGGCCGTTPAHIREVAAAVANIKRPERQVSYEPSVSSLYTA IPFAQDASVLVIGERTNANGSKGFREAMIAEDYQKCLDIAKDQTRDGAHLLDLCVDYV GRDGVADMKALASRLATSSTLPIMLDSTETAVLQAGLEHLGGRCAINSVNYEDGDGPE SRFAKTMALVAEHGAAVVALTIDEEGQARTAQKKVEIAERLINDITGNWGVDESSILI DTLTFTIATGQEESRRDGIETIEAIRELKKRHPDVQTTLGLSNISFGLNPAARQVLNS VFLHECQEAGLDSAIVHASKILPMNRIPEEQRNVALDLVYDRRREDYDPLQELMRLFE GVSAASSKEDRLAELAGLPLFERLAQRIVDGERNGLDADLDEAMTQKPPLQIINEHLL AGMKTVGELFGSGQMQLPFVLQSAEVMKAAVAYLEPHMERSDDDSGKGRIVLATVKGD VHDIGKNLVDIVLSNNGYEVVNIGIKQPIATILEVAEDKSADVVGMSGLLVKSTVVMK ENLEEMNTRGVAEKFPVLLGGAALTRSYVENDLAEIYQGEVHYARDAFEGLKLMDTIM SAKRGEAPDENSPEAIKAREKEAERKARHQRSKRIAAQRKAAEEPVEVPERSDVAADI EVPAPPFWGSRIVKGLAVADYTGLLDERALFLGQWGLRGQRGGEGPSYEDLVETEGRP RLRYWLDRLSTDGILAHAAVVYGYFPAVSEGNDIVVLTEPKPDAPVRYRFHFPRQQRG RFLCIADFIRSRELAAERGEVDVLPFQLVTMGQPIADFANELFASNAYRDYLEVHGIG VQLTEALAEYWHRRIREELKFSGDRAMAAEDPEAKEDYFKLGYRGARFAFGYGACPDL EDRAKMMALLEPERIGVTLSEELQLHPEQSTDAFVLHHPEAKYFNV" CDS 2371102..2371980 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2149" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2149, -, len: 292 aa. Equivalent to Rv2125, len: 292 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 292 aa overlap). Conserved hypothetical protein. Corresponds to Mycobacterium leprae hypothetical protein e.g. TR:Q49797 B2126_F1_36 (317 aa), FASTA scores; opt: 1648, E(): 0, 84.1% identity in 290 aa overlap. Very similar to Mycobacterium tuberculosis hypothetical protein Rv2714,Mb2149 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR008492" /db_xref="InterPro:IPR019151" /db_xref="InterPro:IPR038389" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0D5" /protein_id="SIU00756.1" /translation="MTPSEGNAPLPELHNTVVVAAFEGWNDAGDAASDAVAHLAASWQ ALPIVEIDDEAYYDYQVNRPVIRQVDGVTRELQWPAMRISHCRPPGSDRDVVLMCGVE PNMRWRTFCDELLAVIDKLNVDTVVILGALLADTPHTRPVPVSGAAYSAASARQFGLQ ETRYEGPTGIAGVFQSACVGAGIPAVTFWAAVPHYVSHPPNPKATIALLRRVEDVLDV EVPLADLPAQAEAWEREITETIAEDHELAEYVQTLEQHGDAAVDMNEALGNIDGDALA AEFERYLRRRRPGFGR" CDS complement(2372011..2373078) /codon_start=1 /transl_table=11 /gene="PE_PGRS37" /locus_tag="BQ2027_MB2150C" /product="pe-pgrs family protein pe_pgrs37" /note="Mb2150c, -, len: 256 aa. Equivalent to Rv2126c, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 256 aa overlap). Possible PE_PGRS pseudogene fragment, similar to the Gly-rich C-terminus of many members of the Mycobacterium tuberculosis PGRS family e.g. MTCY441.04c (778 aa), FASTA scores; opt: 935, E(): 4.4e-18, 56.1% identity in 271 aa overlap,Mb2150c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0P0" /protein_id="SIU00757.1" /translation="MAGIGSAISAANALVAGPTTALAADRRRRGVDGYRGAVRRECAG IPTDQRAGGRVSRAVCRGPNLGWRVVRGRRDRQRHVGTRSAQCRQCAHPGIVGRPLIG DGANGGPGQPGGPGGLLYGNGGHGGAGAAGQDRGAGNSAGLIGNGGAGGAGGNGGIGG AGAPGGLGGDGGKGGFADEFTGGFAQGGRGGFGGNGNTGASGGMGGAGGAGGAGGAGG LLIGDGGAGGAGGIGGAGGVGGGGGAGGTGGGGVASAFGGGNAFGGRGGDGGDGGDGG TGGAGGARGAGGAGGAGGWLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGT NPGKAGAPGTQGDSGDPGPPG" CDS 2373425..2374894 /codon_start=1 /transl_table=11 /gene="ansP1" /locus_tag="BQ2027_MB2151" /product="l-asparagine permease ansp1" /note="Mb2151, ansP1, len: 489 aa. Equivalent to Rv2127, len: 489 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 489 aa overlap). Probable ansP1, L-asparagine permease, integral membrane protein highly similar to many eg. ANSP_ECOLI P77610 L-asparagine permease (L-asparagine transport protein) (516 aa), FASTA scores: opt: 1880, E(): 0, (60.3% identity in 463 aa overlap); etc. Also highly similar to Mycobacterium tuberculosis permeases Rv0346c|MTCY13E10.06c, (72.1% identity in 473 aa overlap) and Rv1704c|MTCI125.26c|cycA. Contains PS00218 Amino acid permeases signature. SEEMS TO BELONG TO THE APC FAMILY. Protein product from Mb2151 detected using SWATH mass spectrometry. Mb2151 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEQ4" /db_xref="InterPro:IPR002293" /db_xref="InterPro:IPR004840" /db_xref="InterPro:IPR004841" /db_xref="UniProtKB/Swiss-Prot:Q7VEQ4" /protein_id="SIU00758.1" /translation="MSAASQRVDAFGEEAGYHKGLKPRQLQMIGIGGAIGTGLFLGAS GRLAKAGPGLFLVYGVCGVFVFLILRALGELVLHRPSSGSFVSYAREFFGEKAAYAVG WMYFLHWAMTSIVDTTAIATYLQRWTIFTVVPQWILALIALTVVLSMNLISVEWFGEL EFWAALIKVLALMAFLVVGTVFLAGRYPVDGHSTGLSLWNNHGGLFPTSWLPLLIVTS GVVFAYSAVELVGTAAGETAEPEKIMPRAINSVVARIAIFYVGSVALLALLLPYTAYK AGESPFVTFFSKIGFHGAGDLMNIVVLTAALSSLNAGLYSTGRVMHSIAMSGSAPRFT ARMSKSGVPYGGIVLTAVITLFGVALNAFKPGEAFEIVLNMSALGIIAGWATIVLCQL RLHKLANAGIMQRPRFRMPFSPYSGYLTLLFLLVVLVTMASDKPIGTWTVATLIIVIP ALTAGWYLVRKRVMAVARERLGHTGPFPAVANPPVRSRD" CDS 2374894..2375097 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2152" /product="conserved transmembrane protein" /note="Mb2152, -, len: 67 aa. Equivalent to Rv2128, len: 67 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 67 aa overlap). Probable conserved transmembrane protein. Mb2152 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0C7" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0C7" /protein_id="SIU00759.1" /translation="MLRRGESIIRNRYASKPPLYGMAMVFLAMAVVAVTAYFRMGWWS IIGYAAAAIIGVIGFALAFRDLS" CDS complement(2375117..2375998) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2153C" /product="Probable oxidoreductase" /note="Mb2153c, -, len: 293 aa. Equivalent to Rv2129c, len: 293 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 293 aa overlap). Probable oxidoreductase (EC 1.-.-.-), similar to many e.g. FABG_SYNY3|P73826 3-oxoacyl-[acyl-carrier protein] reductase (240 aa), FASTA scores: opt: 241, E(): 5.1e-17, (32.7% identity in 196 aa overlap); etc. Also similar to a number of other Mycobacterium tuberculosis oxidoreductases e.g. MTCY210.04 (34.1% identity in 217 aa overlap). Protein product from Mb2153c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2153c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0C0" /protein_id="SIU00760.1" /translation="MTSLQGKVVFITGAARGIGAEVARRLHNKGAKLVLTDLSKSELA VMGAELGGDDRLLTVVADVRDLPAMQAAAETAVERFGGIDVVVANAGIASYGSVLKVD PQAFRRVLDVNLLGNFHTVRATLPALIDRRGYVLIVSSLAAFAAPPGMAPYNMSKAGN EHFANALRLEVAHLGVSVGSAHMSWIDTALVRDTKADLPAFAELLARLPWPLNKTTSV NKCAAAFVNGIEGRKDRVYCPGWVALFRWLKPLLSTRVGQRPIRNTVAKLMPQMDAEV AALGRFASAYTESLENS" CDS complement(2376024..2377268) /codon_start=1 /transl_table=11 /gene="mshc" /locus_tag="BQ2027_MB2154C" /product="cysteine:1d-myo-inosityl 2-amino-2-deoxy--d-glucopyranoside ligase mshc" /note="Mb2154c, cysS2, len: 414 aa. Equivalent to Rv2130c, len: 414 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 414 aa overlap). Probable cysS2, cysteinyl-tRNA synthetase, similar to many e.g. SYC_ECOLI|P21888 cysteinyl-tRNA synthetase from Escherichia coli (461 aa), FASTA scores: opt: 535, E(): 0, (37.0% identity in 370 aa overlap); etc. Also similar to Mycobacterium tuberculosis cysS|Rv3580c|MTCY06G11.27c, (35.8% identity in 372 aa overlap). Protein product from Mb2154c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2154c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67018" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR017812" /db_xref="InterPro:IPR024909" /db_xref="InterPro:IPR032678" /db_xref="UniProtKB/Swiss-Prot:P67018" /protein_id="SIU00761.1" /translation="MQSWYCPPVPVLPGRGPQLRLYDSADRQVRPVAPGSKATMYVCG ITPYDATHLGHAATYVTFDLIHRLWLDLGHELHYVQNITDIDDPLFERADRDGVDWRD LAQAEVALFCEDMAALRVLPPQDYVGATEAIAEMVELIEKMLACGAAYVIDREMGEYQ DIYFRADATLQFGYESGYDRDTMLRLCEERGGDPRRPGKSDELDALLWRAARPGEPSW PSPFGPGRPGWHVECAAIALSRIGSGLDIQGGGSDLIFPHHEFTAAHAECVSGERRFA RHYVHAGMIGWDGHKMSKSRGNLVLVSALRAQDVEPSAVRLGLLAGHYRADRFWSQQV LDEATARLHRWRTATALPAGPAAVDVVARVRRYLADDLDTPKAIAALDGWVTDAVEYG GHDAGAPKLVATAIDALLGVDL" CDS complement(2377326..2378129) /codon_start=1 /transl_table=11 /gene="cysQ" /locus_tag="BQ2027_MB2155C" /product="monophosphatase cysq" /note="Mb2155c, cysQ, len: 267 aa. Equivalent to Rv2131c, len: 267 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 267 aa overlap). cysQ, equivalent to Mycobacterium leprae CYSQ_MYCLE P46726 cysQ protein homolog (289 aa). Contains inositol monophosphatase family signature 1 (PS00629), significance uncertain. FASTA best: CYSQ_MYCLE P4672 6 cysq protein homolog (289 aa) opt: 1374, E(): 0; (77.3% identity in 264 aa overlap) Protein product from Mb2155c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2155c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65164" /db_xref="InterPro:IPR000760" /db_xref="InterPro:IPR020583" /db_xref="UniProtKB/Swiss-Prot:P65164" /protein_id="SIU00762.1" /translation="MVSPAAPDLTDDLTDAELAADLAADAGKLLLQVRAEIGFDQPWT LGEAGDRQANSLLLRRLQAERPGDAVLSEEAHDDLARLKSDRVWIIDPLDGTREFSTP GRDDWAVHIALWRRSSNGQPEITDAAVALPARGNVVYRTDTVTSGAAPAGVPGTLRIA VSATRPPAVLHRIRQTLAIQPVSIGSAGAKAMAVIDGYVDAYLHAGGQWEWDSAAPAG VMLAAGMHASRLDGSPLRYNQLDPYLPDLLMCRAEVAPILLGAIADAWR" CDS 2378220..2378450 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2156" /product="Antitoxin to Toxin 1, PIN domain" /note="Mb2156, -, len: 76 aa. Equivalent to Rv2132, len: 76 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 76 aa overlap). Conserved hypothetical protein. Function unknown but belongs to Mycobacterium tuberculosis protein family including Rv2871, Rv1241, Rv3321c, Rv1113, Rv0657c, Rv1560, Rv2104c, etc. Similarity to Mycobacterium tuberculosis protein Rv2871 (AL021924|MTV020_4, 84 aa). FASTA score: opt: 142, E(): 0.00036; 41.8% identity in 55 aa overlap Protein product from Mb2156 detected using SWATH mass spectrometry. Mb2156 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0A2" /db_xref="InterPro:IPR002145" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0A2" /protein_id="SIU00763.1" /translation="MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVA NRFQQQTYDMGEGIDYSNIGDAIETLDGPASG" CDS complement(2378660..2379448) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2157C" /product="Phosphatidylinositol 3- and 4-kinase" /note="Mb2157c, -, len: 262 aa. Equivalent to Rv2133c, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 262 aa overlap). Conserved hypothetical protein. Function: unknown but equivalent to hypothetical Mycobacterium leprae protein, Q49774. FASTA best: Q49774 B2126_C1_150 (262 aa) opt: 1447, E(): 0; (79.0% identity in 262 aa overlap),Mb2157c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR022292" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2F0" /protein_id="SIU00764.1" /translation="MLADGELTVLGRIRSASNATFLCESTLGLRSLHCVYKPVSGERP LWDFPDGTLAGRELSAYLVSTQLGWNLVPHTIIRDGPAGIGMLQLWVQQPGDAVDSDP LPGPDLVDLFPAHRPRPGYLPVLRAYDYAGDEVVLMHADDIRLRRMAVFDVLINNADR KGGHILCGIDGQVYGVDHGLCLHVENKLRTVLWGWAGKPIDDQILQAVAGLADALGGP LAEALAGRIAAAEIGALRRRAQSLLDQPVMPGPNGHRPIPWPAF" CDS complement(2379459..2380046) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2158C" /product="conserved protein" /note="Mb2158c, -, len: 195 aa. Equivalent to Rv2134c, len: 195 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 195 aa overlap). Conserved hypothetical protein. Function: unknown but equivalent to hypothetical Mycobacterium leprae protein, Q49789. FASTA best: Q49789 B2126_C3_228, opt: 1192, E(): 0 (91.1% identity in 192 aa overlap) Protein product from Mb2158c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2158c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021441" /db_xref="UniProtKB/TrEMBL:A0A1R3Y123" /protein_id="SIU00765.1" /translation="MARAIHVFRTPDRFVAGTVGQPGNRTFYLQAVHDSRVVSVVLEK QQVAVLAERIGALLFEVNRRFGTPVPPEPTEIDDLSPLIMPVDAEFRVGTMGLGWDSE AQSVVVELLAVTDAEFDASVVLDDTEEGPDAVRVFLTPESARQFATRSYRVISAGRPP CPLCDEPLDPEGHICARTNGYRRDVLLGSGDDPAG" CDS complement(2380110..2380820) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2159C" /product="Phosphoglycerate mutase family protein" /note="Mb2159c, -, len: 236 aa. Equivalent to Rv2135c, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 236 aa overlap). Conserved hypothetical protein. Function: unknown but equivalent to hypothetical Mycobacterium leprae protein, Q49773. FASTA best: Q49773 B2126_C1_148 opt: 1183, E() : 0; (74.8% identity in 250 aa overlap), also similar in C-terminus to PMG2_ECOLI P36942 probable phosphoglycerate mutase 2 (215 aa), FASTA scores; opt: 212, E(): 2.5e-07 27.9% identity in 190 aa overlap; and to Rv2228 and Rv2419c Protein product from Mb2159c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2159c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR013078" /db_xref="InterPro:IPR022492" /db_xref="InterPro:IPR029033" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0E8" /protein_id="SIU00766.1" /translation="MTVILLRHARSTSNTAGVLAGRSGVDLDEKGREQATGLIDRIGD LPIRAVASSPMLRCQRTVEPLAEALCLEPLIDDRFSEVDYGEWTGRKIGDLVDEPLWR VVQAHPSAAVFPGGEGLAQVQTRAVAAVREHDRRLADQHGHDVLWLACTHGDVIKAVI ADAFGMHLDSFQRITADPGSVSVVRYTQLRPFVLHVNHTGARLAPALQAAASAQGASP EPNAAVPPGDAVIGGSTD" CDS complement(2380817..2381647) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2160C" /product="Possible conserved transmembrane protein" /note="Mb2160c, -, len: 276 aa. Equivalent to Rv2136c, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 276 aa overlap). Possible conserved transmembrane protein, very similar to hypothetical Mycobacterium leprae protein Q49783. FASTA best: Q49783 B2126_C2_190 opt: 1023, E(): 0; (82.4% identity in 187 aa over lap) similar to BACA_ECOLI P31054 bacitracin resistance protein (273 aa) opt: 477, E(): 7e-26, (35.6% identity in 267 aa overlap),Mb2160c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEQ2" /db_xref="InterPro:IPR003824" /db_xref="UniProtKB/Swiss-Prot:Q7VEQ2" /protein_id="SIU00767.1" /translation="MSWWQVIVLAAAQGLTEFLPVSSSGHLAIVSRIFFSGDAGASFT AVSQLGTEAAVVIYFARDIVRILSAWVHGLVVKAHRNTDYRLGWYVIIGTIPICILGL FFKDDIRSGVRNLWVVVTALVVFSGVIALAEYVGRQSRHIERLTWRDAVVVGIAQTLA LVPGVSRSGSTISAGLFLGLDRELAARFGFLLAIPAVFASGLFSLPDAFHPVTEGMSA TGPQLLVATLIAFVLGLTAVAWLLRFLVRHNMYWFVGYRVLVGTGMLVLLATGTVAAT " CDS complement(2381711..2382124) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2161C" /product="secretion/lipid metabolism" /note="Mb2161c, -, len: 137 aa. Equivalent to Rv2137c, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 137 aa overlap). Conserved hypothetical protein. C-terminus is very similar to hypothetical Mycobacterium leprae protein B2126_C2_188 (150 aa). FASTA best: Q49782 B2126_C2_188. (150 aa) opt: 469, E(): 9.6e-28; (77.2% identity in 101 aa overlap) Protein product from Mb2161c detected using SWATH mass spectrometry. Mb2161c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y093" /protein_id="SIU00768.1" /translation="MRNMKSTSHESESGKLLSISSCRPREMVLQRYSLGMTVTADRHL ADKREEFAVEDISTGIFASGYGQVGDGRSFSFHIEHRSLVVEIYRPRVAGPVPQAEDV VAMAVRGLVDIDLTDERSLAAAVRDSVASAAPVSR" CDS 2382139..2383215 /codon_start=1 /transl_table=11 /gene="lppL" /locus_tag="BQ2027_MB2162" /product="Probable conserved lipoprotein LppL" /note="Mb2162, lppL, len: 358 aa. Equivalent to Rv2138, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 358 aa overlap). Probable lppL, conserved lipoprotein, with appropriately placed lipoprotein signature (PS00013) strongly similar to hypothetical Mycobacterium leprae protein, Q49806. FASTA best: Q49806 B2126_F3_142. (298 aa) opt: 1495, E(): 0; (75.3% identity in 300 aa overlap). Protein product from Mb2162 detected using SWATH mass spectrometry. Mb2162 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR015943" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0D3" /protein_id="SIU00769.1" /translation="MLTGNKPAVQRRFIGLLMLSVLVAGCSSNPLANFAPGYPPTIEP AQPAVSPPTSQDPAGAVRPLSGHPRAALFDNGTRQLVALRPGADSAAPASIMVFDDVH VAPRVIFLPGPAAALTSDDHGTAFLAARGGYFVADLSSGHTARVNVADAAHTDFTAIA RRSDGKLVLGSADGAVYTLAKNPAVDPASGAATVASRTKIFARVDALVTQGNTTVVLD RGQTSVTTIGADGHAQQALRAGQGATTMAADPLGRVLIADTRGGQLLVYGVDPLILRQ AYPVRQAPYGLAGSRELAWVSQTASNTVIGYDLTTGIPVEKVRYPTVQQPNSLAFDET SDTLYVVSGSGAGVQVIEHAAGTR" CDS 2383529..2384602 /codon_start=1 /transl_table=11 /gene="pyrD" /locus_tag="BQ2027_MB2163" /product="probable dihydroorotate dehydrogenase pyrd" /note="Mb2163, pyrD, len: 357 aa. Equivalent to Rv2139, len: 357 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 357 aa overlap). Probable pyrD, dihydroorotate dehydrogenase (EC 1.3.3.1); contains dihydroorotate dehydrogenase signatures 1 and 2 (PS00911, PS00912). FASTA best: PYRD_MYCLE P46727 dihydroorotate dehydrogenase (309 aa) opt: 1653, E(): 0; (82.6% identity in 304 aa overlap) Protein product from Mb2163 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2163 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65909" /db_xref="InterPro:IPR001295" /db_xref="InterPro:IPR005719" /db_xref="InterPro:IPR005720" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/Swiss-Prot:P65909" /protein_id="SIU00770.1" /translation="MYPLVRRLLFLIPPEHAHKLVFAVLRGVAAVAPVRRLLRRLLGP TDPVLASTVFGVRFPAPLGLAAGFDKDGTALSSWGAMGFGYAEIGTVTAHPQPGNPAP RLFRLADDRALLNRMGFNNHGARALAIRLARHRPEIPIGVNIGKTKKTPAGDAVNDYR ASARMVGPLASYLVVNVSSPNTPGLRDLQAVESLRPILSAVRAETSTPVLVKIAPDLS DSDLDDIADLAVELDLAGIVATNTTVSRDGLTTPGVDRLGPGGISGPPLAQRAVQVLR RLYDRVGDRLALISVGGIETADDAWERITAGASLLQGYTGFIYGGERWAKDIHEGIAR RLHDGGFGSLHEAVGSARRRQPS" CDS complement(2384607..2385137) /codon_start=1 /transl_table=11 /gene="TB18.6" /locus_tag="BQ2027_MB2164C" /product="Phospholipid-binding protein" /note="Mb2164c, TB18.6, len: 176 aa. Equivalent to Rv2140c, len: 176 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 176 aa overlap). TB18.6, conserved hypothetical protein; shows good similarity to hypothetical proteins from Streptomyces coelicolor (177 aa; 58% identity) >emb|CAC32358.1| (AL583945) and to 17.1 kd Escherichia coli protein YbhB. FASTA best: YBHB_ECOLI P12994 hypothetical 17.1 kd protein (158 aa) opt: 465 E(): 2e-23; (46.2% identity in 156 aa overlap). Protein product from Mb2164c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2164c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR005247" /db_xref="InterPro:IPR008914" /db_xref="InterPro:IPR036610" /db_xref="UniProtKB/Swiss-Prot:P67227" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00771.1" /translation="MTTSPDPYAALPKLPSFSLTSTSITDGQPLATPQVSGIMGAGGA DASPQLRWSGFPSETRSFAVTVYDPDAPTLSGFWHWAVANLPANVTELPEGVGDGREL PGGALTLVNDAGMRRYVGAAPPPGHGVHRYYVAVHAVKVEKLDLPEDASPAYLGFNLF QHAIARAVIFGTYEQR" CDS complement(2385185..2386531) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2165C" /product="YscS-like amidohydrolase" /note="Mb2165c, -, len: 448 aa. Equivalent to Rv2141c, len: 448 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 448 aa overlap). Conserved hypothetical protein. Shows some similarity to conserved hypothetical proteins and to acetylornithine deacetylase and succinyl-diaminopimelate desuccinylase and contains ArgE/dapE/ACY1/CPG2/yscS family signature 1 (PS00758). FASTA best: CBPS_YEAST P27614 carboxypeptidases precursor (576 aa) opt: 234, E(): 4.3e-08; (24.3% identity in 412 aa overlap). Previously named dapE2 Protein product from Mb2165c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2165c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0E1" /db_xref="InterPro:IPR001261" /db_xref="InterPro:IPR002933" /db_xref="InterPro:IPR011650" /db_xref="InterPro:IPR036264" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0E1" /protein_id="SIU00772.1" /translation="MTDETGASSDHSDDVAQVVSRLIRFDTTNSGEPGTTKGEAECAR WVAEQLAEVGYQPEYVESGAPGRGNVFARLAGADSSRGALLIHGHLDVVPAEPAEWSV HPFSGAIEDGYVWGRGAVDMKDMVGMMIVVARHLRQAAIVPPRDLVFAFVADEEHGGK YGSHWLVDNRPDLFDGITEAIGEVGGFSLTVPRHDGGERRLYLIETAEKGIQWMRLTA RGRAGHGSMVHDQNAVTAVCEAVARLGRHQFPLVCTDTVAQFLAVVGEETGLAFDLDS PDLAGTIDKLGPMARMLKAVLHDTANPTMLKAGYKANVVPATAEAVVDCRVLPGRRAA FEAEVDALIGPDVTREWVSDLPSYETTFDGDLVAAMNAAVLAVDPDGRTVPYMLSGGT DAKAFARLGIRCFGFSPLRLPPDLDFTSLFHGVDERVPIDGLRFGTEVLTHLLTHC" tRNA 2386912..2386997 /locus_tag="BQ2027_LEUU" /product="tRNA-Leu" /note="leuU, len: 86 nt. Equivalent to leuU, len: 86 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 86 nt overlap). leu-tRNA, anticodon gag." CDS complement(2387118..2387435) /codon_start=1 /transl_table=11 /gene="pare2" /locus_tag="BQ2027_MB2166C" /product="possible toxin pare2" /note="Mb2166c, -, len: 105 aa. Equivalent to Rv2142c, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 105 aa overlap). Hypothetical unknown protein. Mb2166c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007712" /db_xref="InterPro:IPR035093" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0A6" /protein_id="SIU00773.1" /translation="MTRRLRVHNGVEDDLFEAFSYYADAAPDQIDRLYNLFVDAVTKR IPQAPNAFAPLFKHYRHIYLRPFRYYVAYRTTDEAIDILAVRHGMENPNAVEAEISGR TFE" CDS complement(2387432..2387647) /codon_start=1 /transl_table=11 /gene="parD2" /locus_tag="BQ2027_MB2166A" /product="Possible antitoxin ParD2" /note="Mb2166A, len: 71 aa. Equivalent to Rv2142A len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible parD2, antitoxin, part of toxin-antitoxin (TA) operon with Rv2142c (See Pandey and Gerdes, 2005). Protein product from Mb2166A detected using SWATH mass spectrometry. Mb2166A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR013406" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2F4" /protein_id="SIU00774.1" /translation="MVVNRALLASVDALSRDEQIELVEHINGNLAEGMHISEANQALI EARANDTDDAHWSTIDDFDKRIRARLG" CDS 2387902..2388960 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2167" /product="Phosphoribosyl transferase domain protein" /note="Mb2167, -, len: 352 aa. Equivalent to Rv2143, len: 352 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 352 aa overlap). Conserved hypothetical protein, strongly similar to two hypothetical mycobacterial proteins Rv2030c 2.1e-50 and Rv0571c from position 120 (Q50819; Q50111). FASTA best: Q50819 opt: 882, E() 0; (61.1% identity in 226 aa overlap). Also similar to AL021942|MTV039_9 (443 aa), FASTA scores: opt: 592, E(): 5e-30; 46.9% identity in 224 aa overlap,Mb2167 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y135" /db_xref="InterPro:IPR000836" /db_xref="InterPro:IPR029057" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/TrEMBL:A0A1R3Y135" /protein_id="SIU00775.1" /translation="MEAPPYAGDPTFERLRRSFQPADLLPELQAAGVHYTIAVEAADD PAENESLLATARHHDWIARVIGWVPLADPDEVTESSTHGRHRPDASWRRDLRCPGLLP PGCHQPVLVVGLVGQQPEMRPMNPPSGFLRRTPTRRFRDRRDAGRVLADELASYRGRD RLLVLGLARGGVPVGWEVASALGAELDVFLVRKLGVPQWRELAMGALASGGGVVMNDD VVSSLRITDQQVRAAIDSETAELQRRELAYRGGRPVVDPRARIVILVDDGIATGASML AAVRTIRATGPESIVVAVPVGPATACRELAAEADDVVCATMPAAFEAVGQVYNDFHQV TDDEVRELLATPTTGAAT" CDS complement(2389090..2389446) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2168C" /product="Probable transmembrane protein" /note="Mb2168c, -, len: 118 aa. Equivalent to Rv2144c, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 118 aa overlap). Probable transmembrane protein. Mb2168c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0F4" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0F4" /protein_id="SIU00776.1" /translation="MLIIALVLALIGLLALVFAVVTSNQLVAWVCIGASVLGVALLIV DALRERQQGGADEADGAGETGVAEEADVDYPEEAPEESQAVDAGVIGSEEPSEEASEA TEESAVSADRSDDSAK" CDS complement(2389541..2390323) /codon_start=1 /transl_table=11 /gene="wag31" /locus_tag="BQ2027_MB2169C" /standard_name="ag84" /product="diviva family protein wag31" /note="Mb2169c, wag31, len: 260 aa. Equivalent to Rv2145c, len: 260 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 260 aa overlap). wag31 (alternate gene name: ag84). Function unknown but corresponds to antigen 84 of Mycobacterium tuberculosis (wag31) (see first citation below). Predicted to contain significant amount of coiled coil structure. Some similarity to Rv1682 and Rv2927c. FASTA best: AG84_MYCTU P46816 antigen 84. Protein product from Mb2169c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2169c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5N3" /db_xref="InterPro:IPR007793" /db_xref="InterPro:IPR019933" /db_xref="UniProtKB/Swiss-Prot:P0A5N3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00777.1" /translation="MPLTPADVHNVAFSKPPIGKRGYNEDEVDAFLDLVENELTRLIE ENSDLRQRINELDQELAAGGGAGVTPQATQAIPAYEPEPGKPAPAAVSAGMNEEQALK AARVLSLAQDTADRLTNTAKAESDKMLADARANAEQILGEARHTADATVAEARQRADA MLADAQSRSEAQLRQAQEKADALQADAERKHSEIMGTINQQRAVLEGRLEQLRTFERE YRTRLKTYLESQLEELGQRGSAAPVDSNADAGGFDQFNRGKN" CDS complement(2390591..2390881) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2170C" /product="Possible conserved transmembrane protein" /note="Mb2170c, -, len: 96 aa. Equivalent to Rv2146c, len: 96 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 96 aa overlap). Possible conserved transmembrane protein, orthologs present in Mycobacterium leprae, ML0921 (96 aa) and Streptomyces coelicolor. Second start taken GTG alternative upstream but much less probable in TBparse. FASTA best: Q44935 SIMILAR TO A HYPOTHETICAL INTEGRAL MEMBRANE PROT EIN (97 aa) opt: 105, E(): 0.093; (25.3% identity in 87 aa overlap). >emb|CAC31302.1| (AL583920) possible membrane protein ML0921 [Mycobacterium leprae] E(): 5e-32 (76% identity in 96 aa overlap),Mb2170c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0A1" /db_xref="InterPro:IPR003425" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0A1" /protein_id="SIU00778.1" /translation="MVVFFQILGFALFIFWLLLIARVVVEFIRSFSRDWRPTGVTVVI LEIIMSITDPPVKVLRRLIPQLTIGAVRFDLSIMVLLLVAFIGMQLAFGAAA" CDS complement(2391043..2391768) /codon_start=1 /transl_table=11 /gene="sepF" /locus_tag="BQ2027_MB2171C" /product="SepF, FtsZ-interacting protein related to cell division" /note="Mb2171c, -, len: 241 aa. Equivalent to Rv2147c, len: 241 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 241 aa overlap). Conserved hypothetical protein, similar to conserved hypothetical proteins in Mycobacterium leprae ML0920 (210 aa) and Streptomyces coelicolor. FASTA scores: >emb|CAC31301.1| (AL583920) hypothetical protein ML0920 hypothetical protein (210 aa) opt: 1242, E(): 5.7e-74; 83.486% identity in 218 aa overlap Protein product from Mb2171c detected using SWATH mass spectrometry. Mb2171c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYZ5" /db_xref="InterPro:IPR007561" /db_xref="InterPro:IPR023052" /db_xref="InterPro:IPR038594" /db_xref="UniProtKB/Swiss-Prot:Q7TYZ5" /protein_id="SIU00779.1" /translation="MNSHCSHTFITDNRSPRARRGHAMSTLHKVKAYFGMAPMEDYDD EYYDDRAPSRGYARPRFDDDYGRYDGRDYDDARSDSRGDLRGEPADYPPPGYRGGYAD EPRFRPREFDRAEMTRPRFGSWLRNSTRGALAMDPRRMAMMFEDGHPLSKITTLRPKD YSEARTIGERFRDGSPVIMDLVSMDNADAKRLVDFAAGLAFALRGSFDKVATKVFLLS PADVDVSPEERRRIAETGFYAYQ" CDS complement(2391765..2392541) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2172C" /product="Pyridoxal phosphate-containing protein YggS" /note="Mb2172c, -, len: 258 aa. Equivalent to Rv2148c, len: 258 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 258 aa overlap). Conserved hypothetical protein; should belong to the YGGS/YBL036C/F09E5.8 family. FASTA best: AB003132|AB003132_5 Corynebacterium glutamicum gene (221 aa) opt: 440, E(): 2.3e-23; 42.8% identity in 236 aa overlap; and YPI1_VIBAL P52055 hypothetical protein in pilt-proc intergenic region in Vibrio alginolyticus. opt: 266, E(): 1.8e-11; 27.9% identity in 244 aa overlap. Protein product from Mb2172c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2172c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67084" /db_xref="InterPro:IPR001608" /db_xref="InterPro:IPR011078" /db_xref="InterPro:IPR029066" /db_xref="UniProtKB/Swiss-Prot:P67084" /protein_id="SIU00780.1" /translation="MAADLSAYPDRESELTHALAAMRSRLAAAAEAAGRNVGEIELLP ITKFFPATDVAILFRLGCRSVGESREQEASAKMAELNRLLAAAELGHSGGVHWHMVGR IQRNKAGSLARWAHTAHSVDSSRLVTALDRAVVAALAEHRRGERLRVYVQVSLDGDGS RGGVDSTTPGAVDRICAQVQESEGLELVGLMGIPPLDWDPDEAFDRLQSEHNRVRAMF PHAIGLSAGMSNDLEVAVKHGSTCVRVGTALLGPRRLRSP" CDS complement(2392547..2393299) /codon_start=1 /transl_table=11 /gene="yfiH" /locus_tag="BQ2027_MB2173C" /product="FIG00003370: Multicopper polyphenol oxidase" /note="Mb2173c, yfiH, len: 250 aa. Equivalent to Rv2149c, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 250 aa overlap). yfiH; corresponds to hypothetical 25.3 kDa YfiH protein in ftsZ 3' region of Streptomyces griseus, and to YfiH proteins in other bacteria. Belongs to UPF0124 Family. FASTA best: YFIH_STRGR P45496, (246 aa) opt: 722, E(): 1.9e-37; (49.4% identity in 245 aa overlap) Protein product from Mb2173c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2173c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67257" /db_xref="InterPro:IPR003730" /db_xref="InterPro:IPR011324" /db_xref="InterPro:IPR038371" /db_xref="UniProtKB/Swiss-Prot:P67257" /protein_id="SIU00781.1" /translation="MLASTRHIARGDTGNVSVRIRRVTTTRAGGVSAPPFDTFNLGDH VGDDPAAVAANRARLAAAIGLPGNRVVWMNQVHGDRVELVDQPRNTALDDTDGLVTAT PRLALAVVTADCVPVLMADARAGIAAAVHAGRAGAQRGVVVRALEVMLSLGAQVRDIS ALLGPAVSGRNYEVPAAMADEVEAALPGSRTTTAAGTPGVDLRAGIACQLRDLGVESI DVDPRCTVADPTLFSHRRDAPTGRFASLVWME" CDS complement(2393310..2394449) /codon_start=1 /transl_table=11 /gene="ftsZ" /locus_tag="BQ2027_MB2174C" /product="cell division protein FtsZ" /note="Mb2174c, ftsZ, len: 379 aa. Equivalent to Rv2150c, len: 379 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 379 aa overlap). ftsZ, cell division protein (see first citation below). Contains FtsZ protein signature 2 (PS01135). FASTA best: FTSZ_STRCO P45500 cell division protein FtsZ (399 aa) opt: 1674, E(): 0; (77.3% identity in 339 aa overlap). Protein product from Mb2174c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2174c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64171" /db_xref="InterPro:IPR000158" /db_xref="InterPro:IPR003008" /db_xref="InterPro:IPR008280" /db_xref="InterPro:IPR018316" /db_xref="InterPro:IPR020805" /db_xref="InterPro:IPR024757" /db_xref="InterPro:IPR036525" /db_xref="InterPro:IPR037103" /db_xref="UniProtKB/Swiss-Prot:P64171" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00782.1" /translation="MTPPHNYLAVIKVVGIGGGGVNAVNRMIEQGLKGVEFIAINTDA QALLMSDADVKLDVGRDSTRGLGAGADPEVGRKAAEDAKDEIEELLRGADMVFVTAGE GGGTGTGGAPVVASIARKLGALTVGVVTRPFSFEGKRRSNQAENGIAALRESCDTLIV IPNDRLLQMGDAAVSLMDAFRSADEVLLNGVQGITDLITTPGLINVDFADVKGIMSGA GTALMGIGSARGEGRSLKAAEIAINSPLLEASMEGAQGVLMSIAGGSDLGLFEINEAA SLVQDAAHPDANIIFGTVIDDSLGDEVRVTVIAAGFDVSGPGRKPVMGETGGAHRIES AKAGKLTSTLFEPVDAVSVPLHTNGATLSIGGDDDDVDVPPFMRR" CDS complement(2394622..2395566) /codon_start=1 /transl_table=11 /gene="ftsQ" /locus_tag="BQ2027_MB2175C" /product="POSSIBLE CELL DIVISION PROTEIN FTSQ" /note="Mb2175c, ftsQ, len: 314 aa. Equivalent to Rv2151c, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 314 aa overlap). Possible ftsQ, cell division protein, with some homology to FTSQ_STRGR|P45503 cell division protein ftsq homolog from Streptomyces griseus (208 aa), FASTA scores: opt: 204, E(): 4e-05; (30.6% identity in 193 aa overlap). Protein product from Mb2175c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2175c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64169" /db_xref="InterPro:IPR005548" /db_xref="InterPro:IPR013685" /db_xref="InterPro:IPR026579" /db_xref="InterPro:IPR034746" /db_xref="UniProtKB/Swiss-Prot:P64169" /protein_id="SIU00783.1" /translation="MTEHNEDPQIERVADDAADEEAVTEPLATESKDEPAEHPEFEGP RRRARRERAERRAAQARATAIEQARRAAKRRARGQIVSEQNPAKPAARGVVRGLKALL ATVVLAVVGIGLGLALYFTPAMSAREIVIIGIGAVSREEVLDAARVRPATPLLQIDTQ QVADRVATIRRVASARVQRQYPSALRITIVERVPVVVKDFSDGPHLFDRDGVDFATDP PPPALPYFDVDNPGPSDPTTKAALQVLTALHPEVASQVGRIAAPSVASITLTLADGRV VIWGTTDRCEEKAEKLAALLTQPGRTYDVSSPDLPTVK" CDS complement(2395563..2397047) /codon_start=1 /transl_table=11 /gene="murC" /locus_tag="BQ2027_MB2176C" /product="probable udp-n-acetylmuramate-alanine ligase murc" /note="Mb2176c, murC, len: 494 aa. Equivalent to Rv2152c, len: 494 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 494 aa overlap). Probable murC, UDP-N-acetylmuramate-alanine ligase (EC 6.3.2.8). FASTA best: MURC_ECOLI P17952 (491 aa) opt: 764, E(): 0; (36.9% identity in 474 aa overlap) Protein product from Mb2176c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2176c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65473" /db_xref="InterPro:IPR000713" /db_xref="InterPro:IPR004101" /db_xref="InterPro:IPR005758" /db_xref="InterPro:IPR013221" /db_xref="InterPro:IPR036565" /db_xref="InterPro:IPR036615" /db_xref="UniProtKB/Swiss-Prot:P65473" /protein_id="SIU00784.1" /translation="MSTEQLPPDLRRVHMVGIGGAGMSGIARILLDRGGLVSGSDAKE SRGVHALRARGALIRIGHDASSLDLLPGGATAVVTTHAAIPKTNPELVEARRRGIPVV LRPAVLAKLMAGRTTLMVTGTHGKTTTTSMLIVALQHCGLDPSFAVGGELGEAGTNAH HGSGDCFVAEADESDGSLLQYTPHVAVITNIESDHLDFYGSVEAYVAVFDSFVERIVP GGALVVCTDDPGGAALAQRATELGIRVLRYGSVPGETMAATLVSWQQQGVGAVAHIRL ASELATAQGPRVMRLSVPGRHMALNALGALLAAVQIGAPADEVLDGLAGFEGVRRRFE LVGTCGVGKASVRVFDDYAHHPTEISATLAAARMVLEQGDGGRCMVVFQPHLYSRTKA FAAEFGRALNAADEVFVLDVYGAREQPLAGVSGASVAEHVTVPMRYVPDFSAVAQQVA AAASPGDVIVTMGAGDVTLLGPEILTALRVRANRSAPGRPGVLG" CDS complement(2397044..2398276) /codon_start=1 /transl_table=11 /gene="murG" /locus_tag="BQ2027_MB2177C" /product="probable upd-n-acetylglucosamine-n- acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol-n-acetylglucosamine transferase murg" /note="Mb2177c, murG, len: 410 aa. Equivalent to Rv2153c, len: 410 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 410 aa overlap). Probable MURG PROTEIN (UPD-N-acetylglucosamine-N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol-N-acetylglucosamine transferase. FASTA score: MURG_BACSU P37585 murg protein (363 aa) opt: 494, E(): 1.1e-20; (27.9% identity in 365 aa overlap) Protein product from Mb2177c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2177c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEP8" /db_xref="InterPro:IPR004276" /db_xref="InterPro:IPR006009" /db_xref="InterPro:IPR007235" /db_xref="UniProtKB/Swiss-Prot:Q7VEP8" /protein_id="SIU00785.1" /translation="MKDTVSQPAGGRGATAPRPADAASPSCGSSPSADSLSVVLAGGG TAGHVEPAMAVADALVALDPRVRITALGTPRGLETRLVPQRGYHLELITAVPMPRKPG GDLARLPSRVWRAVREARDVLDDVDADVVVGFGGYVALPAYLAARGLPLPPRRRRRIP VVIHEANARAGLANRVGAHTADRVLSAVPDSGLRRAEVVGVPVRASIAALDRAVLRAE ARAHFGFPDDARVLLVFGGSQGAVSLNRAVSGAAADLAAAGVCVLHAHGPQNVLELRR RAQGDPPYVAVPYLDRMELAYAAADLVICRAGAMTVAEVSAVGLPAIYVPLPIGNGEQ RLNALPVVNAGGGMVVADAALTPELVARQVAGLLTDPARLAAMTAAAARVGHRDAAGQ VARAALAVATGAGARTTT" CDS complement(2398273..2399847) /codon_start=1 /transl_table=11 /gene="ftsW" /locus_tag="BQ2027_MB2178C" /product="FtsW-like protein FtsW" /note="Mb2178c, ftsW, len: 524 aa. Equivalent to Rv2154c, len: 524 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 524 aa overlap). Probable ftsW, cell division protein, related to MTCY10H4.17c, 3.2e-17. FASTA best: SP5E_BACSU P07373 stage V sporulation protein E (366 aa) opt: 755, E(): 1.6e-33; (38.4% identity in 357 aa overlap),Mb2178c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63763" /db_xref="InterPro:IPR001182" /db_xref="InterPro:IPR013437" /db_xref="InterPro:IPR018365" /db_xref="UniProtKB/Swiss-Prot:P63763" /protein_id="SIU00786.1" /translation="MLTRLLRRGTSDTDGSQTRGAEPVEGQRTGPEEASNPGSARPRT RFGAWLGRPMTSFHLIIAVAALLTTLGLIMVLSASAVRSYDDDGSAWVIFGKQVLWTL VGLIGGYVCLRMSVRFMRRIAFSGFAITIVMLVLVLVPGIGKEANGSRGWFVVAGFSM QPSELAKMAFAIWGAHLLAARRMERASLREMLIPLVPAAVVALALIVAQPDLGQTVSM GIILLGLLWYAGLPLRVFLSSLAAVVVSAAILAVSAGYRSDRVRSWLNPENDPQDSGY QARQAKFALAQGGIFGDGLGQGVAKWNYLPNAHNDFIFAIIGEELGLVGALGLLGLFG LFAYTGMRIASRSADPFLRLLTATTTLWVLGQAFINIGYVIGLLPVTGLQLPLISAGG TSTAATLSLIGIIANAARHEPEAVAALRAGRDDKVNRLLRLPLPEPYLPPRLEAFRDR KRANPQPAQTQPARKTPRTAPGQPARQMGLPPRPGSPRTADPPVRRSVHHGAGQRYAG QRRTRRVRALEGQRYG" CDS complement(2399859..2401319) /codon_start=1 /transl_table=11 /gene="murD" /locus_tag="BQ2027_MB2179C" /product="probable udp-n-acetylmuramoylalanine-d-glutamate ligase murd" /note="Mb2179c, -, len: 486 aa. Equivalent to Rv2155c, len: 486 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 486 aa overlap). Probable murD, UDP-N-acetylmuramoylalanine-D-glutamate ligase (EC 6.3.2.9). FASTA best: MURD_BACSU Q03522 (451 aa) opt: 534, E(): 2.7e-25; (28.8% identity in 483 aa overlap); contains PS01011 Folylpolyglutamate synthase signature 1 Protein product from Mb2179c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2179c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7VEP7" /db_xref="InterPro:IPR004101" /db_xref="InterPro:IPR005762" /db_xref="InterPro:IPR013221" /db_xref="InterPro:IPR036565" /db_xref="InterPro:IPR036615" /db_xref="UniProtKB/Swiss-Prot:Q7VEP7" /protein_id="SIU00787.1" /translation="MLDPLGPGAPVLVAGGRVTGQAVAAVLTRFGATPTVCDDDPVML RPHAERGLPTVSSSDAVQQITGYALVVASPGFSPATPLLAAAAAAGVPIWGDVELAWR LDAAGCYGPPRSWLVVTGTNGKTTTTSMLHAMLIAGGRRAVLCGNIGSAVLDVLDEPA ELLAVELSSFQLHWAPSLRPEAGAVLNIAEDHLDWHATMAEYTAAKARVLTGGVAVAG LDDSRAAALLDGSPAQVRVGFRLGEPAAGELGVRDAHLVDRAFSDDLTLLPVASIPVP GPVGVLDALAAAALARSVGVPAGAIADAVTSFRVGRHRAEVVAVADGITYVDDSKATN PHAARASVLAYPRVVWIAGGLLKGASLHAEVAAMASRLVGAVLIGRDRAAVAEALSRH APDVPVVQVVAGEDTGMPATVEVPVACVLDVAKDDKAGETVGAAVMTAAVAAARRMAQ PGDTVLLAPAGASFDQFTGYADRGEAFATAVRAVIR" CDS complement(2401321..2402400) /codon_start=1 /transl_table=11 /gene="murX" /locus_tag="BQ2027_MB2180C" /product="probable phospho-n-acetylmuramoyl- pentappeptidetransferase murx" /note="Mb2180c, murX, len: 359 aa. Equivalent to Rv2156c, len: 359 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 359 aa overlap). Probable murX, phospho-N-acetylmuramoyl-pentappeptidetransferase (EC 2.7.8 .13). FASTA best: MRAY_ECOLI P15876 (360 aa) opt: 572 z-sco re: 651.6 E(): 2.7e-29; (35.8% identity in 344 aa overlap) Protein product from Mb2180c detected using SWATH mass spectrometry. Mb2180c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64260" /db_xref="InterPro:IPR000715" /db_xref="InterPro:IPR003524" /db_xref="InterPro:IPR018480" /db_xref="UniProtKB/Swiss-Prot:P64260" /protein_id="SIU00788.1" /translation="MRQILIAVAVAVTVSILLTPVLIRLFTKQGFGHQIREDGPPSHH TKRGTPSMGGVAILAGIWAGYLGAHLAGLAFDGEGIGASGLLVLGLATALGGVGFIDD LIKIRRSRNLGLNKTAKTVGQITSAVLFGVLVLQFRNAAGLTPGSADLSYVREIATVT LAPVLFVLFCVVIVSAWSNAVNFTDGLDGLAAGTMAMVTAAYVLITFWQYRNACVTAP GLGCYNVRDPLDLALIAAATAGACIGFLWWNAAPAKIFMGDTGSLALGGVIAGLSVTS RTEILAVVLGALFVAEITSVVLQILTFRTTGRRMFRMAPFHHHFELVGWAETTVIIRF WLLTAITCGLGVALFYGEWLAAVGA" CDS complement(2402397..2403929) /codon_start=1 /transl_table=11 /gene="murF" /locus_tag="BQ2027_MB2181C" /product="probable udp-n-acetylmuramoylalanyl-d-glutamyl- 2, 6-diaminopimelate-d-alanyl-d-alanyl ligase murf" /note="Mb2181c, murF, len: 510 aa. Equivalent to Rv2157c, len: 510 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 510 aa overlap). Probable murF, UDP-N-acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate-D -alanyl-D-alanyl ligase (EC 6.3.2.15) (UDP-MURNAC-PENTAPEPTIDE SYNTHETASE) also related to other Mycobacterium tuberculosis mur gene products. FASTA best: MURF_ECOLI P11880 (452 aa) opt: 515, E(): 2.6e-24, (31.9% identity in 511 aa overlap); deleted EC number 6.3.2.15 Protein product from Mb2181c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2181c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5L5" /db_xref="InterPro:IPR000713" /db_xref="InterPro:IPR004101" /db_xref="InterPro:IPR005863" /db_xref="InterPro:IPR013221" /db_xref="InterPro:IPR035911" /db_xref="InterPro:IPR036565" /db_xref="InterPro:IPR036615" /db_xref="UniProtKB/Swiss-Prot:P0A5L5" /protein_id="SIU00789.1" /translation="MIELTVAQIAEIVGGAVADISPQDAAHRRVTGTVEFDSRAIGPG GLFLALPGARADGHDHAASAVAAGAAVVLAARPVGVPAIVVPPVAAPNVLAGVLEHDN DGSGAAVLAALAKLATAVAAQLVAGGLTIIGITGSSGKTSTKDLMAAVLAPLGEVVAP PGSFNNELGHPWTVLRATRRTDYLILEMAARHHGNIAALAEIAPPSIGVVLNVGTAHL GEFGSREVIAQTKAELPQAVPHSGAVVLNADDPAVAAMAKLTAARVVRVSRDNTGDVW AGPVSLDELARPRFTLHAHDAQAEVRLGVCGDHQVTNALCAAAVALECGASVEQVAAA LTAAPPVSRHRMQVTTRGDGVTVIDDAYNANPDSMRAGLQALAWIAHQPEATRRSWAV LGEMAELGEDAIAEHDRIGRLAVRLDVSRLVVVGTGRSISAMHHGAVLEGAWGSGEAT ADHGADRTAVNVADGDAALALLRAELRPGDVVLVKASNAAGLGAVADALVADDTCGSV RP" CDS complement(2403926..2405533) /codon_start=1 /transl_table=11 /gene="murE" /locus_tag="BQ2027_MB2182C" /product="Probable UDP-N-acetylmuramoylalanyl-D-glutamate- 2,6-diaminopimelate ligase MurE" /note="Mb2182c, murE, len: 535 aa. Equivalent to Rv2158c, len: 535 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 535 aa overlap). Probable murE, UDP-N-acetylmuramoylalanyl-D-glutamate-2,6-diaminopimelate ligase (EC 6.3.2.13; UDP-N-ACETYLMURAMYL-TRIPEPTIDE SYNTHETASE) also related to other Mycobacterium tuberculosis mur gene products. FASTA best: MURE_BACSU Q03523 (494 aa) opt: 1020 z- score: 1110.1 E(): 0; (40.1% identity in 476 aa overlap) Protein product from Mb2182c detected using shotgun mass spectrometry. Mb2182c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65478" /db_xref="InterPro:IPR000713" /db_xref="InterPro:IPR004101" /db_xref="InterPro:IPR005761" /db_xref="InterPro:IPR013221" /db_xref="InterPro:IPR035911" /db_xref="InterPro:IPR036565" /db_xref="InterPro:IPR036615" /db_xref="UniProtKB/Swiss-Prot:P65478" /protein_id="SIU00790.1" /translation="MSSLARGISRRRTEVATQVEAAPTGLRPNAVVGVRLAALADQVG AALAEGPAQRAVTEDRTVTGVTLRAQDVSPGDLFAALTGSTTHGARHVGDAIARGAVA VLTDPAGVAEIAGRAAVPVLVHPAPRGVLGGLAATVYGHPSERLTVIGITGTSGKTTT TYLVEAGLRAAGRVAGLIGTIGIRVGGADLPSALTTPEAPTLQAMLAAMVERGVDTVV MEVSSHALALGRVDGTRFAVGAFTNLSRDHLDFHPSMADYFEAKASLFDPDSALRART AVVCIDDDAGRAMAARAADAITVSAADRPAHWRATDVAPTDAGGQQFTAIDPAGVGHH IGIRLPGRYNVANCLVALAILDTVGVSPEQAVPGLREIRVPGRLEQIDRGQGFLALVD YAHKPEALRSVLTTLAHPDRRLAVVFGAGGDRDPGKRAPMGRIAAQLADLVVVTDDNP RDEDPTAIRREILAGAAEVGGDAQVVEIADRRDAIRHAVAWARPGDVVLIAGKGHETG QRGGGRVRPFDDRVELAAALEALERRA" CDS complement(2405556..2406590) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2183C" /product="Peroxidase" /note="Mb2183c, -, len: 344 aa. Equivalent to Rv2159c, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 344 aa overlap). Conserved hypothetical protein; some similarity to hypothetical protein from Streptomyces coelicolor SC1A6.09c (337 aa, 29% identity). Smith-Waterman scores: >pir||T28690 hypothetical protein -Streptomyces coelicolor >gi|3127841|emb|CAA18907.1| (AL023496) Expect = 2e-18 Protein product from Mb2183c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y0C9" /db_xref="InterPro:IPR003779" /db_xref="InterPro:IPR004675" /db_xref="InterPro:IPR029032" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0C9" /protein_id="SIU00791.1" /translation="MKFVNHIEPVAPRRAGGAVAEVYAEARREFGRLPEPLAMLSPDE GLLTAGWATLRETLLVGQVPRGRKEAVAAAVAASLRCPWCVDAHTTMLYAAGQTDTAA AILAGTAPAAGDPNAPYVAWAAGTGTPAGPPAPFGPDVAAEYLGTAVQFHFIARLVLV LLDETFLPGGPRAQQLMRRAGGLVFARKVRAEHRPGRSTRRLEPRTLPDDLAWATPSE PIATAFAALSHHLDTAPHLPPPTRQVVRRVVGSWHGEPMPMSSRWTNEHTAELPADLH APTRLALLTGLAPHQVTDDDVAAARSLLDTDAALVGALAWAAFTAARRIGTWIGAAAE GQVSRQNPTG" CDS complement(2406587..2407207) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2184C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2184c, -, len: 206 aa. Similar to 5' end of Rv2160A and 3' end of Rv2160c, len: 211 aa and 112 aa, from Mycobacterium tuberculosis strain H37Rv, (96.3% identity in 107 aa overlap and 92.0% identity in 112 aa overlap). Conserved hypothetical protein, possibly a tetR-family transcriptional regulator, similar to N-terminal half of AL512667_12|Q9AD73|SCK31.01c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (200 aa), FASTA scores: opt: 285, E(): 1.4e-08, (51.042% identity in 96 aa overlap). Next gene, Rv2160c, is similar to C-terminal half of 2SCK31.01c suggesting possible frameshift near 2421978 but sequence of this region has been checked and is also identical in strain CDC1551. Conserved hypothetical protein, possibly a tetR-family transcriptional regulator, similar to C-terminal half of AL512667_12|Q9AD73|SCK31.01c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (200 aa), while Rv2160A is similar to the N-terminal half of 2SCK31.01c. This suggests possible frameshift near 2421978 but sequence of this region has been checked and is also identical in strain CDC1551. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2160A and Rv2160c exist as 2 genes with an overlap region between them. In Mycobacterium bovis, a 4 bp insertion (*-ggaa) leads to a single product." /db_xref="GOA:A0A1R3Y0F7" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0F7" /protein_id="SIU00792.1" /translation="MPSADVGRQTRAQILRAAMDIASVKGLSGLSIGELAGRLGMSKS GLFRHFGAKEQLQLATVEAAVSVFEAEVVAPAMAAPPGVDRVRALMHAWVGYLERDVF PGGCFFAAAAADVDSQPGPVRDRIAATGRAGIAAITADVETAQRRGEIRADIEARQLA FELHAYAMEANWALLLLDDDGAGERARTAIDAALARVGTTQEGVES" CDS complement(2407200..2408066) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2185C" /product="Probable F420-dependent oxidoreductase family protein" /note="Mb2185c, -, len: 288 aa. Equivalent to Rv2161c, len: 288 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 288 aa overlap). Conserved hypothetical protein; shows some similarity to protein involved in lincomycin production and to other M. tuberculosis proteins e.g. Rv0953c, Rv0791c, Rv0132c, Rv2951c, Rv1855c. FASTA best: Q54379 (78-11) LINCOMYCIN PRODUCTION GENES (295 aa) opt: 243, E(): 2.4e-09; (29.5% identity in 285 aa overlap). Protein product from Mb2185c detected using SWATH mass spectrometry. Mb2185c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0D6" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019921" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0D6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00793.1" /translation="MLVSLMQFVTDLTPPPQLVAVWAEERGFAGLYVPEKTHVPISRS TPWPGGELPDWYRRCYDPVVALAAAAAVTTRLRVGTGACLVAVHDPILLAKQIASLCA MSGERFVLGVGFGWNVEELADHGVPFADRIAVTVDKLAAMRALWAAEPVHYEGTHASV PPSWAWPKPAVAPPVLFGCRPSARAFEVIARHGDGWQPIEGYGELLGALPMLHAAFER AGRDPATAQVCVYSSAGDPATLHEYRRAGVAEVALALPSAGRDQVLAALDRSAPLVDA FAGDDREVKSHA" CDS complement(2408169..2409641) /codon_start=1 /transl_table=11 /gene="PE_PGRS38" /locus_tag="BQ2027_MB2186C" /product="pe-pgrs family protein pe_pgrs38" /note="Mb2186c, PE_PGRS38, len: 490 aa. Similar to Rv2162c, len: 532 aa, from Mycobacterium tuberculosis strain H37Rv, (92.105% identity in 532 aa overlap). Member of Mycobacterium tuberculosis PE_PGRS family. FASTA score: Y03A_MYCTU Q 10637 hypothetical glycine-rich 49.6 kd protein (603 aa) op t: 1798 z-score: 1220.0 E(): 0; (55.4% identity in 590 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, in-frame deletions of 108 bp and 18 bp leads to a shorter product than in Mycobacterium tuberculosis strain H37Rv (490 aa versus 532 aa). Mb2186c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2H6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00794.1" /translation="MSFVIAAPEVMAAAATDLANIGSSISAASAAAAGPTMGILAAGA DEVSVAISALFGSHAQGYQTLSAQLAAYHNQFVRALNAGAGSYASAEAANVQQTLLNA INAPTQTLLGRPLIGNGADGGPGQNGGPGGLLYGNGGNGGAGDTANPNGGNGGSAGLI GNGGAGGAGAATGAGGAGGNGGWLYGNGGPGGAAGLGTAGGVSPAGGAGGAAGLWGHG GAGGAGGSASGAPGAGGAGGDGGRGGLLYGDGGAGGAGGNGSNGVTGVHGGNGGAGGA AGLIGNGGAGGDGGNGGLSNTGASGGAGGAGGAALIGNGGDGGHGGNGGHGNSGGAGG AGGAGGHVGLIGNGGNGGAGGNGGNDNSSTLADAGSGGAGAAGGNGGLFYGNGGVGGR GGNGGQAPTPGNAGDGGAGGNARLIGDGGRGGNGGEGGDGPPGVKGDGGNGGNGGNAV VIGNGGNGGAGGFGIPVGSGGAGGSRGVLFGTPGANGADG" CDS complement(2409851..2411890) /codon_start=1 /transl_table=11 /gene="pbpB" /locus_tag="BQ2027_MB2187C" /product="Probable penicillin-binding membrane protein pbpB" /note="Mb2187c, pbpB, len: 679 aa. Equivalent to Rv2163c, len: 679 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 679 aa overlap). Probable pbpB, penicillin-binding membrane protein, similar to many bacterial PBP2 proteins e.g. P11882|PBP2_NEIME|PENA|NMA2072|NMB0413 penicillin-binding protein 2 (pbp-2) from Neisseria meningitidis (serogroups A and B) (581 aa), FASTA scores: opt: 665, E(): 1.6e-31, (33.2% identity in 591 aa overlap); etc. Also similar to Rv0016c and Rv2864c from Mycobacterium tuberculosis (2.8e-10). Contains PS00017 possible ATP/GTP-binding site motif A (P-loop) near C-terminus. FASTA best: PBP2_NEIME P11882 penicillin-binding protein 2 (pbp-2). (581 aa) opt: 665, E(): 1.6e-31; (33 .2% identity in 591 aa overlap),Mb2187c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y155" /db_xref="InterPro:IPR001460" /db_xref="InterPro:IPR005311" /db_xref="InterPro:IPR012338" /db_xref="InterPro:IPR036138" /db_xref="UniProtKB/TrEMBL:A0A1R3Y155" /protein_id="SIU00795.1" /translation="MSRAAPRRASQSQSTRPARGLRRPPGAQEVGQRKRPGKTQKARQ AQEATKSRPATRSDVAPAGRSTRARRTRQVVDVGTRGASFVFRHRTGNAVILVLMLVA ATQLFFLQVSHAAGLRAQAAGQLKVTDVQPAARGSIVDRNNDRLAFTIEARALTFQPK RIRRQLEEARKKTSAAPDPQQRLRDIAQEVAGKLNNKPDAAAVLKKLQSDETFVYLAR AVDPAVASAICAKYPEVGAERQDLRQYPGGSLAANVVGGIDWDGHGLLGLEDSLDAVL AGTDGSVTYDRGSDGVVIPGSYRNRHKAVHGSTVVLTLDNDIQFYVQQQVQQAKNLSG AHNVSAVVLDAKTGEVLAMANDNTFDPSQDIGRQGDKQLGNPAVSSPFEPGSVNKIVA ASAVIEHGLSSPDEVLQVPGSIQMGGVTVHDAWEHGVMPYTTTGVFGKSSNVGTLMLS QRVGPERYYDMLRKFGLGQRTGVGLPGESAGLVPPIDQWSGSTFANLPIGQGLSMTLL QMTGMYQAIANDGVRVPPRIIKATVAPDGSRTEEPRPDDIRVVSAQTAQTVRQMLRAV VQRDPMGYQQGTGPTAGVPGYQMAGKTGTAQQINPGCGCYFDDVYWITFAGIATADNP RYVIGIMLDNPARNSDGAPGHSAAPLFHNIAGWLMQRENVPLSPDPGPPLVLQAT" CDS complement(2411887..2413041) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2188C" /product="PROBABLE CONSERVED PROLINE RICH MEMBRANE PROTEIN" /note="Mb2188c, -, len: 384 aa. Equivalent to Rv2164c, len: 384 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 384 aa overlap). Probable pro- rich conserved membrane protein, equivalent to ML0907|AL022602 putative conserved membrane protein from Mycobacterium leprae (377 aa) (AL022602), FASTA scores: opt: 1495, E(): 1.7e-56, (62.217% identity in 397 aa overlap). Protein product from Mb2188c detected using SWATH mass spectrometry. Mb2188c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0H5" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0H5" /protein_id="SIU00796.1" /translation="MRAKREAPKSRSSDRRRRADSPAAATRRTTTNSAPSRRIRSRAG KTSAPGRQARVSRPGPQTSPMLSPFDRPAPAKNTSQAKARAKARKAKAPKLVRPTPME RLAARLTSIDLRPRTLANKVPFVVLVIGSLGVGLGLTLWLSTDAAERSYQLSNARERT RMLQQHKEALERDVREAASAPALAEAARRQGMIPTRDTAHLAQDPDGNWVVVGTPKPA DGVPPPPLNTKLPEDPPPPPKPAAVPLEVPVRVTPGPDDPAPPARSGPEVLVRTPDGT ATLGGATHLPTQAGPQLPGPVPIPGAPGPMPAPPLGAVPSPAPAENPVPLQVGAAPPA GLPGPAPVAATPGLSGGSQPMVAPPAPVPANGEQFGPVTAPVPTAPGAPR" CDS complement(2413038..2414228) /codon_start=1 /transl_table=11 /gene="rsmH" /locus_tag="BQ2027_MB2189C" /product="16S rRNA (cytosine(1402)-N(4))-methyltransferase (EC" /EC_number="2.1.1.199" /note="Mb2189c, -, len: 396 aa. Equivalent to Rv2165c, len: 396 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 396 aa overlap). Conserved hypothetical protein; shows strong similarity to several hypothetical bacterial proteins but has extra 80 aa residues at N-terminus FASTA best: YLXA_BACSU Q07876 hypothetical 35.3 kd protein in ftsl (311 aa) opt: 781, E(): 0; (45.6% identity in 296 aa overlap), BELONGS TO THE YABC (E.COLI), YLXA (B.SUBTILIS) FAMILY Protein product from Mb2189c detected using SWATH mass spectrometry. Mb2189c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65430" /db_xref="InterPro:IPR002903" /db_xref="InterPro:IPR023397" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P65430" /protein_id="SIU00797.1" /translation="MQTRAPWSLPEATLAYFPNARFVSSDRDLGAGAAPGIAASRSTA CQTWGGITVADPGSGPTGFGHVPVLAQRCFELLTPALTRYYPDGSQAVLLDATIGAGG HAERFLEGLPGLRLIGLDRDPTALDVARSRLVRFADRLTLVHTRYDCLGAALAESGYA AVGSVDGILFDLGVSSMQLDRAERGFAYATDAPLDMRMDPTTPLTAADIVNTYDEAAL ADILRRYGEERFARRIAAGIVRRRAKTPFTSTAELVALLYQAIPAPARRVGGHPAKRT FQALRIAVNDELESLRTAVPAALDALAIGGRIAVLAYQSLEDRIVKRVFAEAVASATP AGLPVELPGHEPRFRSLTHGAERASVAEIERNPRSTPVRLRALQRVEHRAQSQQWATE KGDS" CDS complement(2414230..2414661) /codon_start=1 /transl_table=11 /gene="mraZ" /locus_tag="BQ2027_MB2190C" /product="Transcriptional regulator MraZ" /note="Mb2190c, -, len: 143 aa. Equivalent to Rv2166c, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Conserved hypothetical protein; shows strong similarity to several hypothetical bacterial proteins such as YLLB_BACSU P55343. Is equivalent to Mycobacterium leprae hypothetical protein ML0905 (143 aa, 92% identity) MLCB268.11c >sp|O69561|YL66_MYCLE HYPOTHETICAL 16.1 KDA PROTEIN ML0905 >gi|3080482|emb|CAA18677.1|(AL022602) >gi|13092975|emb|CAC31286.1|(AL583920). FASTA scores: ML0905|ML0905 conserved hypothetical protein (143 aa) opt: 873, E(): 3.1e-52; 92.254% identity in 142 aa overlap; YLLB_BACSU P55343 hypothetical 16.6 kd protein (143 aa) opt: 340, E(): 3.6e-17; (35.0% identity in 143 aa overlap). BELONGS TO THE YABB (E.COLI), YLLB (B.SUBTILIS), MG221 (M.GENITALIUM) FAMILY Protein product from Mb2190c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2190c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65437" /db_xref="InterPro:IPR003444" /db_xref="InterPro:IPR007159" /db_xref="InterPro:IPR020603" /db_xref="InterPro:IPR035642" /db_xref="InterPro:IPR035644" /db_xref="InterPro:IPR037914" /db_xref="InterPro:IPR038619" /db_xref="UniProtKB/Swiss-Prot:P65437" /protein_id="SIU00798.1" /translation="MFLGTYTPKLDDKGRLTLPAKFRDALAGGLMVTKSQDHSLAVYP RAAFEQLARRASKAPRSNPEARAFLRNLAAGTDEQHPDSQGRITLSADHRRYASLSKD CVVIGAVDYLEIWDAQAWQNYQQIHEENFSAASDEALGDIF" CDS complement(2415010..2415414) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2191C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2191c, -, len: 134 aa. Equivalent to Rv2169c, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 134 aa overlap). Probable conserved transmembrane protein, with orthologs in M. leprae, ML0904 probable membrane protein (134 aa), and Streptomyces coelicolor. FASTA scores with ML0904, opt: 767, E(): 5.1e-43; 86.567% identity in 134 aa overlap. emb|CAA18678.1| (AL022602) >gi|13092974|emb|CAC31285.1| (AL583920). TBparse score is 0.934 Protein product from Mb2191c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2191c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0G1" /db_xref="InterPro:IPR021401" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0G1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00799.1" /translation="MPLSDHEQRMLDQIESALYAEDPKFASSVRGGGFRAPTARRRLQ GAALFIIGLGMLVSGVAFKETMIGSFPILSVFGFVVMFGGVVYAITGPRLSGRMDRGG SAAGASRQRRTKGAGGSFTSRMEDRFRRRFDE" CDS 2415680..2416300 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2192" /product="gcn5-related n-acetyltransferase" /note="Mb2192, -, len: 206 aa. Equivalent to Rv2170, len: 206 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 206 aa overlap). Conserved hypothetical protein, equivalent to hypothetical protein ML0903 (210 aa) from Mycobacterium leprae. FASTA scores: ML0903 conserved hypothetical protein (210 aa) opt: 1045, E(): 9.1e-57; 77.143% identity in 210 aa overlap. >emb|CAA18679.1| (AL022602) >gi|13092973|emb|CAC31284.1| (AL583920). Protein product from Mb2192 detected using SWATH mass spectrometry. Mb2192 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0H2" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR013653" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0H2" /protein_id="SIU00800.1" /translation="MAIFLIDLPPSDMERRLGDALTVYVDAMRYPRGTETLRAPMWLE HIRRRGWQAVAAVEVTAAEQAEAADTTALPSAAELSNAPMLGVAYGYPGAPGQWWQQQ VVLGLQRSGFPRLAIARLMTSYFELTELHILPRAQGRGLGEALARRLLAGRDEDNVLL STPETNGEDNRAWRLYRRLGFTDIIRGYHFAGDPRAFAILGRTLPL" CDS 2416396..2417079 /codon_start=1 /transl_table=11 /gene="lppM" /locus_tag="BQ2027_MB2193" /product="Probable conserved lipoprotein lppM" /note="Mb2193, lppM, len: 227 aa. Equivalent to Rv2171, len: 227 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 227 aa overlap). Probable lppM, conserved lipoprotein; contains putative signal peptide and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Has hydrophobic stretch at C-terminus and also contains PS00225 Crystallins beta and gamma 'Greek key' motif signature. Unknown but equivalent to Mycobacterium leprae lipoprotein ML0902 (239 aa). FASTA scores: opt: 1083, E(): 2.4e-56; 75.446% identity in 224 aa overlap (5-227:16-239) >emb|CAA18680.1| (AL022602) >gi|13092972|emb|CAC31283.1| (AL583920). Protein product from Mb2193 detected using shotgun mass spectrometry. Mb2193 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0D8" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0D8" /protein_id="SIU00801.1" /translation="MARTRRRGMLAIAMLLMLVPLATGCLRVRASITISPDDLVSGEI IAAAKPKNSKDTGPALDGDVPFSQKVAVSNYDSDGYVGSQAVFSDLTFAELPQLANMN SDAAGVNLSLRRNGNIVILEGRADLTSVSDPDADVELTVAFPAAVTSTNGDRIEPEVV QWKLKPGVVSTMSAQARYTDPNTRSFTGAGIWLGIAAFAAAGVVAVLAWIDRDRSPRL TASGDPPTS" CDS complement(2417076..2417981) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2194C" /product="conserved protein" /note="Mb2194c, -, len: 301 aa. Equivalent to Rv2172c, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 301 aa overlap). Conserved hypothetical protein, equivalent to Mycobacterium leprae conserved hypothetical protein ML0901 (304 aa). FASTA scores: opt: 1656, E(): 7.7e-98; 81.271% identity in 299 aa overlap (1-299:1-299) CAA18681.1|AL022602|13092971|CAC31282.1|AL583920. TBparse score is 0.905 Protein product from Mb2194c detected using shotgun mass spectrometry. Mb2194c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0G5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00802.1" /translation="MTLNTIALELVPPNLEGGKERAIEDARKVVQYSAASGLDGRIRH VMMPGMIAEDDDRPIPMQPKLDVLDFWSIIKPELAGVHGLCTQVTAFMDEPSLHRRLV DLSDAGMEGIVFVGVPRTMQDGEGSGVAPTDALSLYRQLVANRGVIVIPTRDGEQGRL NFKCSRGATYGMTQLLYSDAIVGFLREFARTTEHRPEILLSFGFVPKVETRIGLINWL IQDPGNAAVADEQAFVQKLAGSEPARRRRLMVDLYKRVLDGVADLGFPLSIHLEATYG VSAAAFETFAEMLAYWSPAEPGKPD" CDS 2418292..2419350 /codon_start=1 /transl_table=11 /gene="idsA2" /locus_tag="BQ2027_MB2195" /product="PROBABLE GERANYLGERANYL PYROPHOSPHATE SYNTHETASE IDSA2 (GGPPSASE) (GGPP SYNTHETASE) (GERANYLGERANYL DIPHOSPHATE SYNTHASE)" /note="Mb2195, idsA2, len: 352 aa. Equivalent to Rv2173, len: 352 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 352 aa overlap). Probable idsA2, geranylgeranyl pyrophosphate synthase (EC 2.5.1.-), similar to many e.g. Q54193 geranylgeranyl pyrophosphate synthase from Streptomyces griseus (425 aa). Contains PS00723 and PS00444Polyprenyl synthetases signature 1 and 2. FASTA scores: sptr|Q54193|Q54193 GERANYLGERANYL PYROPHOSPHATE SYNTHASE (425 aa) opt: 744, E(): 0; 39.2% identity in 352 aa overlap. Protein product from Mb2195 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2195 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0E9" /db_xref="InterPro:IPR000092" /db_xref="InterPro:IPR008949" /db_xref="InterPro:IPR033749" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0E9" /protein_id="SIU00803.1" /translation="MAGAITDQLRRYLHGRRRAAAHMGSDYDGLIADLEDFVLGGGKR LRPLFAYWGWHAVASREPDPDVLLLFSALELLHAWALVHDDLIDRSATRRGRPTAQLR YAALHRDRDWRGSPDQFGMSAAILLGDLAQVWADDIVSKVCQSALAPDAQRRVHRVWA DIRNEVLGGQYLDIVAEASAAESIESAMNVATLKTACYTVSRPLQLGTAAAADRSDVA AIFEHFGADLGVAFQLRDDVLGVFGDPAVTGKPSGDDLKSGKRTVLVAEAVELADRSD PLAAKLLRTSIGTRLTDAQVRELRTVIEAVGARAAAESRIAALTQRALATLASAPINA TAKAGLSELAMMAANRSA" CDS 2419354..2420904 /codon_start=1 /transl_table=11 /gene="mpta" /locus_tag="BQ2027_MB2196" /product="alpha(1->6)mannosyltransferase. possible conserved integral membrane protein." /note="Mb2196, -, len: 516 aa. Equivalent to Rv2174, len: 516 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 516 aa overlap). Possible conserved integral membrane protein, similar to some hypothetical mycobacterial proteins e.g. Mycobacterium leprae ML0899 probable integral-membrane protein (505 aa) and MLCL536_26 (593 aa). FASTA scores: ML0899 opt: 2715; 78.884% identity in 502 aa overlap and gp|Z99125|MLCL536_26 Mycobacterium leprae cosmid L536. (593 aa) opt: 552, E(): 7.1e-30; 31.6% identity in 513 aa overlap. Also similar to Rv1459c. Mb2196 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2J0" /db_xref="InterPro:IPR017822" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2J0" /protein_id="SIU00804.1" /translation="MTTPSHAPAVDLATAKDAVVQHLSRLFEFTTGPQGGPARLGFAG AVLITAGGLGAGSVRQHDPLLESIHMSWLRFGHGLVLSSILLWTGVGVMLLAWLGLGR RVLAGEATEFTMRATTVIWLAPLLLSVPVFSRDTYSYLAQGALLRDGLDPYAVGPVGN PNALLDDVSPIWTITTAPYGPAFILVAKFVTVIVGNNVVAGTMLLRLCMLPGLALLVW ATPRLASHLGTHGPTALWICVLNPLVLIHLMGGVHNEMLMVGLMTAGIALTVQGRNVA GIILITVAIAVKATAGIALPFLVWVWLRHLRERRGYRPVQAFLAAAAISLLIFVAVFA VLSAVAGVGLGWLTALAGSVKIINWLTVPTGAANVIHALGRGLFTVDFYTLLRITRLI GIVIIAVSLPLLWWRFRRDDRAALTGVAWSMLIVVLFVPAALPWYYSWPLAVAAPLAQ SRRAIAAIAGLSTWVMVIFKPDGSHGMYSWLHFWIATACALTAWYVLYRSPDRRGVQA ATPVVNTP" CDS complement(2420891..2421331) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2197C" /product="conserved regulatory protein" /note="Mb2197c, -, len: 146 aa. Equivalent to Rv2175c, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 146 aa overlap). Conserved hypothetical protein, possibly involved in regulation. Contains possible helix-turn-helix domain at aa 31-52 (Score 1042, +2.74 SD). Equivalent to Mycobacterium leprae ML0898 putative DNA-binding protein (134 aa). FASTA scores: opt: 747; 82.090% identity in 134 aa overlap (AL022602) >gi|13092969|emb|CAC31279.1| (AL583920) Protein product from Mb2197c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2197c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR041098" /db_xref="UniProtKB/TrEMBL:A0A1R3Y164" /protein_id="SIU00805.1" /translation="MPGRAPGSTLARVGSILAGDDVLDPDEPTYDLPRVAELLGVPVS KVAQQLREGHLVAVRRAGGVVIPQVFFTNSGQVVKSLPGLLTILHDGGYRDTEIMRWL FTPDPSLTITRDGSRDAVSNARPVDALHAHQAREVVRRAQAMAY" CDS 2421386..2422585 /codon_start=1 /transl_table=11 /gene="pknL" /locus_tag="BQ2027_MB2198" /product="PROBABLE TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE L PKNL (PROTEIN KINASE L) (STPK L)" /note="Mb2198, pknL, len: 399 aa. Equivalent to Rv2176, len: 399 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 399 aa overlap). Probable pknL, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citation below), similar to many e.g. MLCB1770_9 (622 aa). Lacks C-terminal domain and ends with putative transmembrane segment. Contains PS00108 Serine/Threonine protein kinases active-site signature. FASTA scores: Z70722|MLC B1770_9 Mycobacterium leprae cosmid B1770 (622 aa) opt: 732, E(): 5.9e-23; 44.4% identity in 266 aa overlap. Also similar to several Mycobacterium tuberculosis STPK proteins e.g. Rv0014c|PKNB, Rv0015c|PKNA, Rv1743|PKNE, Rv1266c|PKNH etc. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Protein product from Mb2198 detected using SWATH mass spectrometry. Mb2198 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYY6" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR008271" /db_xref="InterPro:IPR011009" /db_xref="UniProtKB/Swiss-Prot:Q7TYY6" /protein_id="SIU00806.1" /translation="MVEAGTRDPLESALLDSRYLVQAKIASGGTSTVYRGLDVRLDRP VALKVMDARYAGDEQFLTRFRLEARAVARLNNRALVAVYDQGKDGRHPFLVMELIEGG TLRELLIERGPMPPHAVVAVLRPVLGGLAAAHRAGLVHRDVKPENILISDDGDVKLAD FGLVRAVAAASITSTGVILGTAAYLSPEQVRDGNADPRSDVYSVGVLVYELLTGHTPF TGDSALSIAYQRLDADVPRASAVIDGVPPQFDELVACATARNPADRYADAIAMGADLE AIAEELALPEFRVPAPRNSAQHRSAALYRSRITQQGQLGAKPVHHPTRQLTRQPGDCS EPASGSEPEHEPITGQFAGIAIEEFIWARQHARRMVLVWVSVVLAITGLVASAAWTIG SNLSGLL" mobile_element complement(2422590..2423393) /mobile_element_type="insertion sequence:IS1558" /locus_tag="BQ2027_IS1558'-1" /note="IS1558'-1, len: 804 nt. Equivalent to IS1558', len: 804 nt, from Mycobacterium tuberculosis strain H37Rv,(99.9% identity in 804 nt overlap). Nearly identical to complement of region 24105 24908 in EM_BA:MTCY428 Z81451 Mycobacterium tuberculosis cosmid Y428." gene complement(2422590..2423393) /locus_tag="BQ2027_IS1558'-1" CDS complement(2422727..2423392) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2199C" /product="POSSIBLE TRANSPOSASE" /note="Mb2199c, -, len: 221 aa. Equivalent to Rv2177c, len: 221 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 221 aa overlap). Possible IS1558 transposase (see citation below), similar to several IS element proteins and transposases but nearly identical to last 221 residues of MTCY428_23 (333 aa). FASTA scores: Z81451|MTCY428_23 Mycobacterium tuberculosis cosmid (333 aa) opt: 1491, E() : 0; 98.6% identity in 221 aa overlap. Mb2199c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0U4" /db_xref="InterPro:IPR003346" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0U4" /protein_id="SIU00807.1" /translation="MRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLDAMIGALDEQ IEQLMHPFCARRELIASIPGIGVGASATVISEIGADPAAWFPSAEHLASWVRLCPGNH ESAGKRHHGARRKGNQHLQPVLVECAWAAVRTDGYLREYYRRQVRKFGGFRSPAANKK AIIAVAHKLIVIIWHVLATGRPYQDLGADYFTTRMDPDKERRRLVAKLEAQGLGVTLE PAA" CDS complement(2423777..2425165) /codon_start=1 /transl_table=11 /gene="aroG" /locus_tag="BQ2027_MB2200C" /product="3-deoxy-d-arabino-heptulosonate 7-phosphate synthase arog (dahp synthetase, phenylalanine-repressible)" /note="Mb2200c, aroG, len: 462 aa. Equivalent to Rv2178c, len: 462 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 462 aa overlap). Probable aroG, 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase similar to many, especially those from plants. FASTA scores: Y15113|M C3DDAH7P_1Morinda citrifolia mRNA for 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase (535 aa) opt: 1421, E(): 0; 48.3% identity in 443 aa overlap. Protein product from Mb2200c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2200c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0C5" /db_xref="InterPro:IPR002480" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0C5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00808.1" /translation="MNWTVDIPIDQLPSLPPLPTDLRTRLDAALAKPAAQQPTWPADQ ALAMRTVLESVPPVTVPSEIVRLQEQLAQVAKGEAFLLQGGDCAETFMDNTEPHIRGN VRALLQMAVVLTYGASMPVVKVARIAGQYAKPRSADIDALGLRSYRGDMINGFAPDAA AREHDPSRLVRAYANASAAMNLVRALTSSGLASLHLVHDWNREFVRTSPAGARYEALA TEIDRGLRFMSACGVADRNLQTAEIYASHEALVLDYERAMLRLSDGEDGEPQLFDLSA HTVWIGERTRQIDGAHIAFAQVIANPVGVKLGPNMTPELAVEYVERLDPHNKPGRLTL VSRMGNHKVRDLLPPIVEKVQATGHQVIWQCDPMHGNTHESSTGFKTRHFDRIVDEVQ GFFEVHRALGTHPGGIHVEITGENVTECLGGAQDISETDLAGRYETACDPRLNTQQSL ELAFLVAEMLRD" CDS complement(2425256..2425762) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2201C" /product="3'-5' exoribonuclease Rv2179c (EC" /EC_number="3.1.13.-" /note="Mb2201c, -, len: 168 aa. Equivalent to Rv2179c, len: 168 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 168 aa overlap). Conserved hypothetical protein, equivalent to conserved hypothetical protein from Mycobacterium leprae ML0895 conserved hypothetical protein (171 aa). FASTA scores: opt: 977, E(): 1.4e-58; 82.530% identity in 166 aa overlap (AL022602). Protein product from Mb2201c detected using shotgun mass spectrometry. Mb2201c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0H1" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR030853" /db_xref="InterPro:IPR033390" /db_xref="InterPro:IPR036397" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0H1" /protein_id="SIU00809.1" /translation="MRYFYDTEFIEDGHTIELISIGVVAEDGREYYAVSTEFDPERAG SWVRTHVLPKLPPPASQLWRSRQQIRLDLEEFLRIDGTDSIELWAWVGAYDHVALCQL WGPMTALPPTVPRFTRELRQLWEDRGCPRMPPRPRDVHDALVDARDQLRRFRLITSTD DAGRGAAR" CDS complement(2425772..2426659) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2202C" /product="Probable conserved integral membrane protein" /note="Mb2202c, -, len: 295 aa. Equivalent to Rv2180c, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 295 aa overlap). Probable conserved integral membrane protein, similar to pir||T35292 probable integral membrane protein from Streptomyces coelicolor >gi|5578858|emb|CAB51260.1| (AL096872) (246 aa) (36% identity in 249 aa overlap). Mb2202c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0H9" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0H9" /protein_id="SIU00810.1" /translation="MEVFHWLQHDIVDRGRLPLLCCLVAFVLTFLVTRSFVRFIHRRA ADGRPARWWQPRNVHIGSVHIHHVAFGVVLVMISGLTLVTLSVDGREPEFTIAASIFG VGAALVLDEYALILHLSDVYWEEDGRTSVDAVFAAVAVAGLLIMGLHPLIFFLTVRQG ANWVVLQTTLIAGLVLTLPLAVVVLLKGKVWTGLLGMFVVVLLVVGAVRLSRPHAPWA RWRYTRHPEKMRRALQRERTWRRPVVRIKLWLQYVIAGTPRMPDERAVDAQLDQDVRP APPPERTAPILISGSVWSD" CDS 2426747..2428030 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2203" /product="alpha(1->2)mannosyltransferase" /note="Mb2203, -, len: 427 aa. Equivalent to Rv2181, len: 427 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 427 aa overlap). Probable conserved integral membrane protein, similar to others in Mycobacterium tuberculosis e.g. Rv1159 (MTCI65.26, 431 aa). Start uncertain. FASTA scores: Z95584|MTCI65_26 (431 aa) opt: 428, E(): 8e-22; 31.2% identity in 407 aa overlap. Protein product from Mb2203 detected using SWATH mass spectrometry. Mb2203 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0E6" /db_xref="InterPro:IPR018584" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0E6" /protein_id="SIU00811.1" /translation="MSAWRAPEVGSRLGRRVLWCLLWLLAGVALGYVAWRLFGHTPYR IDIDIYQMGARAWLDGRPLYGGGVLFHTPIGLNLPFTYPPLAAVLFSPFAWLQMPAAS VAITVLTLVLLIASTAIVLTGLDAWPTSRLVPAPARLRRLWLAVLIVAPATIWLEPIS SNFAFGQINVVLMTLVIVDCFPRRTPWPRGLMLGLGIALKLTPAVFLLYFLLRRDGRA ALTALASFAVATLLGFVLAWRDSWEYWTHTLHHTDRIGAAALNTDQNIAGALARLTIG DDERFALWVAGSLLVLAATIWAMRRVLRAGEPTLAVICVALFGLVVSPVSWSHHWVWM LPAVLVIGLLGWRRRNVALAMLSLAGVVLMRWTPIDLLPQHRETTAVWWRQLAGMSYV WWALAVIVVAGLTVTARMTPQRSLTRGLTPAPTAS" CDS complement(2428031..2428774) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2204C" /product="1-acylglycerol-3-phosphate O-acyltransferase" /note="Mb2204c, -, len: 247 aa. Equivalent to Rv2182c, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 247 aa overlap). Probable 1-acylglycerol-3-phosphate O-acyltransferase, similar to many e.g. in Streptomyces. Contains PS00017 ATP/GTP-binding site motif A (P-loop). FASTA scores: pir||T35503 1-acylglycerol-3-phosphate O-acyltransferase (EC 2.3.1.51) homolog SC6E10.16c - Streptomyces coelicolor >gi|5689932|emb|CAB51970.1| (AL109661) hypothetical protein [Streptomyces coelicolor A3(2)] Length = 262, Expect = 6e-61 (54% identity in 215 aa overlap). Protein product from Mb2204c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2204c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0H6" /db_xref="InterPro:IPR002123" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0H6" /protein_id="SIU00812.1" /translation="MWYYLFKYIFMGPLFTLLGRPKVEGLEYIPSSGPAILASNHLAV ADSFYLPLVVRRRIWFLAKSEYFTGTGLKGWINRWFYSVSGQVPIDRTNADSAQGALQ TAVVLLGQGKLLGMYPEGTRSPDGRLYKGKTGLARLALHTGVPVIPVAMIGTNVVNPP GRKMLRFGRVTVRFGKPMDFSRFEGLAGNHFIERAVTDEVIYELMGLSGQEYVDIYAA SVKDGRNAGGAGANPNSTDAARIPETAAG" CDS complement(2428860..2429255) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2205C" /product="link to cyclase activity" /note="Mb2205c, -, len: 131 aa. Equivalent to Rv2183c, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Conserved hypothetical protein, equivalent to Mycobacterium leprae hypothetical protein ML0891 (MLCB268.25c, 130 aa). FASTA scores: opt: 558, E(): 8.3e-28; 61.832% identity in 131 aa overlap >gi|13092963|emb|CAC31272.1| (AL583920) (AL022602). Protein product from Mb2205c detected using SWATH mass spectrometry. Mb2205c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0F8" /protein_id="SIU00813.1" /translation="MSGAHTDVRPELRKLAQAILDGIDPAVRVAAAMASGGGPGTGKC QQVWCPLCALAALVTGEQHPLLTVIADHSLALLEVIRAIVDDIDRSAKPPPEGPPGGG QTGASGGENTNGEGSMKSHYQAIPVTIEE" CDS complement(2429252..2430391) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2206C" /product="anion-transporting ATPase" /note="Mb2206c, -, len: 379 aa. Equivalent to Rv2184c, len: 379 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 379 aa overlap). Conserved hypothetical protein, equivalent to hypothetical protein ML0890 (415 aa) from Mycobacterium leprae and also shows some similarity to other hypothetical proteins. FASTA scores: ML0890 opt: 1949; 79.630% identity in 378 aa overlap >emb|CAA18692.1| (AL022602) >gi|13092962|emb|CAC31271.1| (AL583920) and sptr|Q55794|Q55794 HYPOTHETICAL 44.6 KD PROTEIN. (396 aa) opt: 251, E(): 3.3e-09; 25.5% identity in 384 aa overlap. Protein product from Mb2206c detected using SWATH mass spectrometry. Mb2206c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2K1" /db_xref="InterPro:IPR008978" /db_xref="InterPro:IPR016300" /db_xref="InterPro:IPR025723" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR040612" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2K1" /protein_id="SIU00814.1" /translation="MVVSTDQAHSLGDVLGIAVPPTGQGDPVRVLAYDPEAGGGFLDA LALDTLALLEGRWLHVVETLDRRFPGSELSSIAPEELCALPGIQEVLGLHAVGELAAA RRWDRIVVDCASTADALRMLTLPATFGLYVERAWPRHRRLSIGADDGRSAVLAELLER IRASVERLSTLLTDGALVSAHLVLTPERVVAAEAVRTLGSLALMGVRVEELLVNQLLV QDENYEYRSLPDHPAFHWYAERIGEQRAVLDDLDATIGDVALVLVPHLAGEPIGPKAL GGLLDSARRRQGSAPPGPLQPIVDLESGSGLASIYRLRLALPQLDPGTLTLGRADDDL IVSAGGMRRRVRLASVLRRCTVLDAHLRGGELTVRFRPNPEVWPT" CDS complement(2430511..2430945) /codon_start=1 /transl_table=11 /gene="TB16.3" /locus_tag="BQ2027_MB2207C" /product="Cyclase/Dehydrase" /note="Mb2207c, TB16.3, len: 144 aa. Equivalent to Rv2185c, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 144 aa overlap). TB16.3, conserved hypothetical protein, similar to other hypothetical actinomycete proteins and equivalent to Mycobacterium leprae ML0889 (144 aa). Some similarity to Mycobacterium tuberculosis Rv0854, Rv0856, Rv0857, Rv0164 and other Mycobacterium leprae proteins. FASTA scores : ML0889 opt: 811; 85.417% identity in 144 aa overlap (AL022602). Protein product from Mb2207c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2207c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3Y175" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00815.1" /translation="MADKTTQTIYIDADPGEVMKAIADIEAYPQWISEYKEVEILEAD DEGYPKRARMLMDAAIFKDTLIMSYEWPEDRQSLSWTLESSSLLKSLEGTYRLAPKGS GTEVTYELAVDLAVPMIGMLKRKAERRLIDGALKDLKKRVEG" CDS complement(2431050..2431439) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2208C" /product="link to cyclase activity" /note="Mb2208c, -, len: 129 aa. Equivalent to Rv2186c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Conserved hypothetical protein, equivalent to hypothetical Mycobacterium leprae protein ML0888 (135 aa). FASTA scores: ML0888 opt: 704, E(): 2.9e-43; 80.000% identity in 130 aa overlap CAA18694.1| (AL022602). Protein product from Mb2208c detected using shotgun mass spectrometry. Mb2208c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0J5" /protein_id="SIU00816.1" /translation="MNSIQIADETYVAADAARVSAAVADRCSWRRWWPDLRLQVTEDR ADKGIRWTVTGALTGTMEIWLEPSMDGVLLHYFLHAEPTGVAAWQLARMNLARMTHHR RVAGKKMAFEVKTVLERSRPIGVSPVT" CDS 2431605..2432117 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_FADD15" /product="HYPOTHETICAL PROTEIN" /note="Mb2209, -, len: 170 aa. Equivalent to 5' end of Rv2187, len: 600 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). Probable fadD15, long-chain-fatty-acid-CoA ligase (EC 6.2.1.3), similar to several e.g. P44446|LCFH_HAEIN PUTATIVE LONG-CHAIN-FATTY-ACID--CoA LIGASE from Haemophilus influenzae (607 aa), FASTA scores: (607 aa) opt: 992, E(): 0, (31.5% identity in 578 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, fadD15 exists as a single gene. In Mycobacterium bovis, a frameshift due to single base deletion (t-*) splits fadD15 into 2 parts, Mb2209 and fadD15." /db_xref="GOA:A0A1R3Y0V3" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0V3" /protein_id="SIU00817.1" /translation="MREISVPAPFTVGEHDNVAAMVFEHERDDPDYVIYQRLIDGVWT DVTCAEAANQIRAAALGLISLGVQAGDRVVIFSATRYEWAILDFAIRLWVRSPYRSTR PRQRSRCAGFYKTPKRWCCSPKPTHTRQWSPNSPAACPPCGRYCRSPVRVPTRSIGSR RRAPRSTRPS" CDS 2432204..2433406 /codon_start=1 /transl_table=11 /gene="fadD15" /locus_tag="BQ2027_MB2210" /product="Probable long-chain-fatty-acid-CoA ligase fadD15 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)" /note="Mb2210, fadD15, len: 508 aa. Equivalent to 3' end of Rv2187, len: 600 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 508 aa overlap). Probable fadD15, long-chain-fatty-acid-CoA ligase (EC 6.2.1.3), similar to several e.g. P44446|LCFH_HAEIN PUTATIVE LONG-CHAIN-FATTY-ACID--CoA LIGASE from Haemophilus influenzae (607 aa), FASTA scores: (607 aa) opt: 992, E(): 0, (31.5% identity in 578 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. TBparse score is 0.902. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to single base deletion (t-*) leads to a shorter product with a different NH2 part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb2210 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2210 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYX8" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q7TYX8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00818.1" /translation="MTQSNLVHEIKGARAYHPTLLRKGERLLVFLPLAHVLARAISMA AFHSKVTVGFTSDIKNLLPMLAVFKPTVVVSVPRVFEKVYNTAEQNAANAGKGRIFAI AAQTAVDWSEACDRGGPGLLLRAKHAVFDRLVYRKLRAALGGNCRAAVSGGAPLGARL GHFYRGAGLTIYEGYGLSETSGGVAISQFNDLKIGTVGKPVPGNSLRIADDGELLVRG GVVFSGYWRNEQATTEAFTDGWFKTGDLGAVDEDGFLTITGRKKEIIVTAGGKNVAPA VLEDQLRAHPLISQAVVVGDAKPFIGALITIDPEAFEGWKQRNSKTAGASVGDLATDP DLIAEIDAAVKQANLAVSHAESIRKFRILPVDFTEDTGELTPTMKVKRKVVAEKFASD IEAIYNKE" CDS complement(2433437..2434594) /codon_start=1 /transl_table=11 /gene="pimb" /locus_tag="BQ2027_MB2211C" /product="mannosyltransferase pimb" /note="Mb2211c, -, len: 385 aa. Equivalent to Rv2188c, len: 385 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 385 aa overlap). Conserved hypothetical protein, possibly glycosyl transferase similar to several putative glycosyl transferases and hypothetical proteins e.g. P73369. Equivalent to Mycobacterium leprae ML0886 putative glycosyl transferase (384 aa). FASTA scores: ML0886 (CAA18697.1| (AL022602)) opt: 2113, E(): 1.8e-106; 81.462% identity in 383 aa overlap; sptr|P73369|P73369 HYPOTHETICAL 46.2 KD PROTEIN (404 aa) opt: 379, E(): 2.2e-18; 27.5% identity in 397 aa overlap. Start changed since first submission, now 14 aa shorter. Protein product from Mb2211c detected using SWATH mass spectrometry. Mb2211c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0H8" /db_xref="InterPro:IPR001296" /db_xref="InterPro:IPR028098" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0H8" /protein_id="SIU00819.1" /translation="MSRVLLVTNDFPPRRGGIQSYLGEFVGRLVGSRAHAMTVYAPQW KGADAFDDAARAAGYRVVRHPSTVMLPGPTVDVRMRRLIAEHDIETVWFGAAAPLALL APRARLAGASRVLASTHGHEVGWSMLPVARSVLRRIGDGTDVVTFVSSYTRSRFASAF GPAASLEYLPPGVDTDRFRPDPAARAELRKRYRLGERPTVVCLSRLVPRKGQDTLVTA LPSIRRRVDGAALVIVGGGPYLETLRKLAHDCGVADHVTFTGGVATDELPAHHALADV FAMPCRTRGAGMDVEGLGIVFLEASAAGVPVIAGNSGGAPETVQHNKTGLVVDGRSVD RVADAVAELLIDRDRAVAMGAAGREWVTAQWRWDTLAAKLADFLRGDDAAR" CDS complement(2434691..2435464) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2212C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2212c, -, len: 257 aa. Equivalent to Rv2189c, len: 257 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 257 aa overlap). Conserved hypothetical protein; some similarity to hypothetical protein SC6G10.07c (385 aa) from Streptomyces coelicolor A3(2). Smith-Waterman scores: pir||T35516 hypothetical protein SC6G10.07c -Streptomyces coelicolor >gi|4539203|emb|CAB39861.1| (AL049497) Expect = 2e-08; 30% identity in 245 aa overlap. TBparse score is 0.908,Mb2212c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0J0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00820.1" /translation="MRDGPAAPAQVVAPADGFVALRVADDRTVRLLSLGGAATDRLLS RIAAGIDAAVDEVVAFWGTDWSHDIFVVAAGSDEQFHAAAGGGLASQWADIAAITVVD RVDPARRTVVGQRIVFAPGAAHMSPAALRIVLGHELFHYAARADTALDAPRWLAEGVA DFVARPKTPPPADAVSVALSLPSDTDLDTPGPQRSLAYDRAWWFARFVAAAYGTAKLR ELYLATCGVGHFDLATAAHDVLGIDAAGLLARWQRWLMG" CDS complement(2435559..2436716) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2213C" /product="Cell wall-associated hydrolases (invasion-associated proteins)" /note="Mb2213c, -, len: 385 aa. Equivalent to Rv2190c, len: 385 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 385 aa overlap). Conserved hypothetical protein; similar to other hypothetical mycobacterial proteins, including Rv1477, Rv1478, Rv1566c, Rv0024, that are similar to protein p60 precursors from Listeria eg Q018 38|P60_LISSE protein p60 precursor (invasion-associated protein) (524 aa). FASTA scores: gp|Z80233|MTCY10H4_25 (281 a a) opt: 290, E(): 6.9e-05; 37.0% identity in 127 aa overlap and sp|Q01838|P60_LISSE PROTEIN P60 PRECURSOR (523 aa) opt: 268, E(): 0.00071; 38.5% identity in 104 aa overlap. Protein product from Mb2213c detected using SWATH mass spectrometry. Mb2213c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67474" /db_xref="InterPro:IPR000064" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/Swiss-Prot:P67474" /protein_id="SIU00821.1" /translation="MRLDQRWLIARVIMRSAIGFFASFTVSSGVLAANVLADPADDAL AKLNELSRQAEQTTEALHSAQLDLNEKLAAQRAADQKLADNRTALDAARARLATFQTA VNKVAAATYMGGRTHGMDAILTAESPQLLIDRLSVQRVMAHQMSTQMARFKAAGEQAV KAEQAAAKSAADARSAAEQAAAVRANLQHKQSQLQVQIAVVKSQYVALTPEERTALAD PGPVPAVAAIAPGAPPAALPPGAPPGDGPAPGVAPPPGGMPGLPFVQPDGAGGDRTAV VQAALTQVGAPYAWGGAAPGGFDCSGLVMWAFQQAGIALPHSSQALAHGGQPVALSDL QPGDVLTFYSDASHAGIYIGDGLMVHSSTYGVPVRVVPMDSSGPIYDARRY" CDS 2437263..2439200 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2214" /product="DNA polymerase III epsilon subunit-related protein MSMEG4261" /note="Mb2214, -, len: 645 aa. Equivalent to Rv2191, len: 645 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 645 aa overlap). Conserved hypothetical protein, similar to SW:DP3A_B ACSU P13267 DNA polymerase III, alpha chain (31.3% identity in 249 aa overlap) and SW:UVRC_ECOLI P07028 excinuclease ABC subunit C (25.7% identity in 230 aa overlap). Also similar to M. tuberculosis Rv3711c (dnaQ DNA polymerase III e chain) and Rv1420 (uvrC excinuclease ABC subunit C),Mb2214 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0I7" /db_xref="InterPro:IPR000305" /db_xref="InterPro:IPR006054" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR013520" /db_xref="InterPro:IPR035901" /db_xref="InterPro:IPR036397" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0I7" /protein_id="SIU00822.1" /translation="MQGPNVAAMGATGGTQLSFADLAHAQGAAWTPADEMSLRETTFV VVDLETTGGRTTGNDATPPDAITEIGAVKVCGGAVLGEFATLVNPQHSIPPQIVRLTG ITTAMVGNAPTIDAVLPMFFEFAGDSVLVAHNAGFDIGFLRAAARRCDITWPQPQVLC TMRLARRVLSRDEAPSVRLAALARLFAVASNPTHRALDDARATVDVLHALIERVGNQG VHTYAELRSYLPNVTQAQRCKRVLAETLPHRPGVYLFRGPSGEVLYVGTAADLRRRVS QYFNGTDRRKRMTEMVMLASSIDHVECAHPLEAGVRELRMLSTHAPPYNRRSKFPYRW WWVALTDEAFPRLSVIRAPRHDRVVGPFRSRSKAAETTALLARCTGLRTCTTRLTRSA RHGPACPELEVSACPAARDVTAAQYAEAVLRAAALIGGLDNAALAAAVQQVTELAERR RYESAARLRDHLATAIEALWHGQRLRALAALPELIAAKPDGPREGGYQLAVIRHGQLA AAGRAPRGVPPMPVVDAIRRGAQAILPTPAPLGGALVEEIALIARWLAEPGVRIVGVS NDAAGLASPVRSAGPWAAWAATARSAQLAGEQLSRGWQSDLPTEPHPSREQLFGRTGV DCRTGPPQPLLPGRQPFSTAG" CDS complement(2439075..2440187) /codon_start=1 /transl_table=11 /gene="trpD" /locus_tag="BQ2027_MB2215C" /product="Probable anthranilate phosphoribosyltransferase TrpD" /note="Mb2215c, trpD, len: 370 aa. Equivalent to Rv2192c, len: 370 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 370 aa overlap). Probable trpD, anthranilate phosphoribosyltransferase (EC 2.4.2.18) (see citation below), similar to e.g. TRPD_LACCA|P17170, (43.2% identity in 308 aa overlap). Initiation codon uncertain, gtg at 4086 in MTCY190 favoured by homology but this has no clear ribosome binding site. Protein product from Mb2215c detected using SWATH mass spectrometry. Mb2215c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P66993" /db_xref="InterPro:IPR000312" /db_xref="InterPro:IPR005940" /db_xref="InterPro:IPR017459" /db_xref="InterPro:IPR035902" /db_xref="InterPro:IPR036320" /db_xref="UniProtKB/Swiss-Prot:P66993" /protein_id="SIU00823.1" /translation="MALSAEGSSGGSRGGSPKAEAASVPSWPQILGRLTDNRDLARGQ AAWAMDQIMTGNARPAQIAAFAVAMTMKAPTADEVGELAGVMLSHAHPLPADTVPDDA VDVVGTGGDGVNTVNLSTMAAIVVAAAGVPVVKHGNRAASSLSGGADTLEALGVRIDL GPDLVARSLAEVGIGFCFAPRFHPSYRHAAAVRREIGVPTVFNLLGPLTNPARPRAGL IGCAFADLAEVMAGVFAARRSSVLVVHGDDGLDELTTTTTSTIWRVAAGSVDKLTFDP AGFGFARAQLDQLAGGDAQANAAAVRAVLGGARGPVRDAVVLNAAGAIVAHAGLSSRA EWLPAWEEGLRRASAAIDTGAAEQLLARWVRFGRQI" CDS 2440345..2440956 /codon_start=1 /transl_table=11 /gene="ctaE" /locus_tag="BQ2027_MB2216" /product="PROBABLE CYTOCHROME C OXIDASE (SUBUNIT III) CTAE" /note="Mb2216, ctaE, len: 203 aa. Equivalent to Rv2193, len: 203 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 203 aa overlap). Probable ctaE, cytochrome c oxidase polypeptide III (cox3) (EC 1.9.3.1), with strong similarity to others e.g. COX3_SYNY3|Q06475 (29.8% identity in 225 aa overlap). Protein product from Mb2216 detected using SWATH mass spectrometry. Mb2216 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63857" /db_xref="InterPro:IPR000298" /db_xref="InterPro:IPR013833" /db_xref="InterPro:IPR024791" /db_xref="InterPro:IPR035973" /db_xref="UniProtKB/Swiss-Prot:P63857" /protein_id="SIU00824.1" /translation="MTSAVGTSGTAITSRVHSLNRPNMVSVGTIVWLSSELMFFAGLF AFYFSARAQAGGNWPPPPTELNLYQAVPVTLVLIASSFTCQMGVFAAERGDIFGLRRW YVITFLMGLFFVLGQAYEYRNLMSHGTSIPSSAYGSVFYLATGFHGLHVTGGLIAFIF LLVRTGMSKFTPAQATASIVVSYYWHFVDIVWIALFTVIYFIR" CDS 2440997..2441839 /codon_start=1 /transl_table=11 /gene="qcrC" /locus_tag="BQ2027_MB2217" /product="Probable Ubiquinol-cytochrome C reductase QcrC (cytochrome C subunit)" /note="Mb2217, qcrC, len: 280 aa. Equivalent to Rv2194, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Probable qcrC, Ubiquinol-cytochrome C reductase cytochrome C subunit (cyoA), shows similarity to cytochrome c family; contains 2 X PS00190 Cytochrome c family heme-binding site signature. Protein product from Mb2217 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2217 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63888" /db_xref="InterPro:IPR009056" /db_xref="InterPro:IPR009152" /db_xref="InterPro:IPR036909" /db_xref="UniProtKB/Swiss-Prot:P63888" /protein_id="SIU00825.1" /translation="MTKLGFTRSGGSKSGRTRRRLRRRLSGGVLLLIALTIAGGLAAV LTPTPQVAVADESSSALLRTGKQLFDTSCVSCHGANLQGVPDHGPSLIGVGEAAVYFQ VSTGRMPAMRGEAQAPRKDPIFDEAQIDAIGAYVQANGGGPTVVRNPDGSIATQSLRG NDLGRGGDLFRLNCASCHNFTGKGGALSSGKYAPDLAPANEQQILTAMLTGPQNMPKF SNRQLSFEAKKDIIAYVKVATEARQPGGYLLGGFGPAPEGMAMWIIGMVAAIGLALWI GARS" CDS 2441836..2443125 /codon_start=1 /transl_table=11 /gene="qcrA" /locus_tag="BQ2027_MB2218" /product="Probable Rieske iron-sulfur protein QcrA" /note="Mb2218, qcrA, len: 429 aa. Equivalent to Rv2195, len: 429 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 429 aa overlap). Probable qcrA, Ubiquinol-cytochrome C reductase iron-sulfur subunit (cyoB), shows some similarity to cytochrome B6-F complex iron-sulphur subunits (Rieske iron-sulfur protein); contains PS00200 Rieske iron-sulfur protein signature 2 Protein product from Mb2218 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2218 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYX4" /db_xref="InterPro:IPR014349" /db_xref="InterPro:IPR017941" /db_xref="InterPro:IPR036922" /db_xref="UniProtKB/Swiss-Prot:Q7TYX4" /protein_id="SIU00826.1" /translation="MSRADDDAVGVPPTCGGRSDEEERRIVPGPNPQDGAKDGAKATA VPREPDEAALAAMSNQELLALGGKLDGVRIAYKEPRWPVEGTKAEKRAERSVAVWLLL GGVFGLALLLIFLFWPWEFKAADGESDFIYSLTTPLYGLTFGLSILSIAIGAVLYQKR FIPEEISIQERHDGASREIDRKTVVANLTDAFEGSTIRRRKLIGLSFGVGMGAFGLGT LVAFAGGLIKNPWKPVVPTAEGKKAVLWTSGWTPRYQGETIYLARATGTEDGPPFIKM RPEDIDAGGMETVFPWRESDGDGTTVESHHKLQEIAMGIRNPVMLIRIKPSDLGRVVK RKGQESFNFGEFFAFTKVCSHLGCPSSLYEQQSYRILCPCHQSQFDALHFAKPIFGPA ARALAQLPITIDTDGYLVANGDFVEPVGPAFWERTTT" CDS 2443122..2444771 /codon_start=1 /transl_table=11 /gene="qcrB" /locus_tag="BQ2027_MB2219" /product="Probable Ubiquinol-cytochrome C reductase QcrB (cytochrome B subunit)" /note="Mb2219, qcrB, len: 549 aa. Equivalent to Rv2196, len: 549 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 549 aa overlap). Probable qcrB, Ubiquinol-cytochrome C reductase cytochrome B subunit (cytB), integral membrane protein, low similarity in amino-terminal half to cytochrome b subunits, highly similar at C-terminus to SW:12KD_MYCLE P15878 12 KD protein PIR:S08427 (86.9% identity in 153 aa overlap). FASTA scores: sp|Q45658|QCRB_BACST MENAQUINOL-CYTOCHROME C REDUCTASE (224 aa) opt: 341, E(): 6.8e-15; 28.0% identity in 207 aa overlap Protein product from Mb2219 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2219 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63886" /db_xref="InterPro:IPR005797" /db_xref="InterPro:IPR016174" /db_xref="InterPro:IPR027387" /db_xref="UniProtKB/Swiss-Prot:P63886" /protein_id="SIU00827.1" /translation="MSPKLSPPNIGEVLARQAEDIDTRYHPSAALRRQLNKVFPTHWS FLLGEIALYSFVVLLITGVYLTLFFDPSMVDVTYNGVYQPLRGVEMSRAYQSALDISF EVRGGLFVRQIHHWAALMFAAAIMVHLARIFFTGAFRRPRETNWVIGSLLLILAMFEG YFGYSLPDDLLSGLGLRAALSSITLGMPVIGTWLHWALFGGDFPGTILIPRLYALHIL LLPGIILALIGLHLALVWFQKHTQFPGPGRTEHNVVGVRVMPVFAFKSGAFFAAIVGV LGLMGGLLQINPIWNLGPYKPSQVSAGSQPDFYMMWTEGLARIWPPWEFYFWHHTIPA PVWVAVIMGLVFVLLPAYPFLEKRFTGDYAHHNLLQRPRDVPVRTAIGAMAIAFYMVL TLAAMNDIIALKFHISLNATTWIGRIGMVILPPFVYFITYRWCIGLQRSDRSVLEHGV ETGIIKRLPHGAYIELHQPLGPVDEHGHPIPLQYQGAPLPKRMNKLGSAGSPGSGSFL FADSAAEDAALREAGHAAEQRALAALREHQDSIMGSPDGEH" CDS complement(2445062..2445706) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2220C" /product="Probable conserved transmembrane protein" /note="Mb2220c, -, len: 214 aa. Equivalent to Rv2197c, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 214 aa overlap). Probable conserved transmembrane protein, equivalent to ML0878 conserved hypothetical protein (212 aa) of Mycobacterium leprae. FASTA scores: opt: 858; 62.559% identity in 211 aa overlap CAC31259.1|(AL583920) Protein product from Mb2220c detected using SWATH mass spectrometry. Mb2220c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0G0" /db_xref="InterPro:IPR024381" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0G0" /protein_id="SIU00828.1" /translation="MVSRYSAYRRGPDVISPDVIDRILVGACAAVWLVFTGVSVAAAV ALMDLGRGFHEMAGNPHTTWVLYAVIVVSALVIVGAIPVLLRARRMAEAEPATRPTGA SVRGGRSIGSGHPAKRAVAESAPVQHADAFEVAAEWSSEAVDRIWLRGTVVLTSAIGI ALIAVAAATYLMAVGHDGPSWISYGLAGVVTAGMPVIEWLYSRQLRRVVAPQSS" CDS complement(2445706..2446605) /codon_start=1 /transl_table=11 /gene="mmpS3" /locus_tag="BQ2027_MB2221C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN MMPS3" /note="Mb2221c, mmpS3, len: 299 aa. Equivalent to Rv2198c, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 299 aa overlap). Probable mmpS3, conserved membrane protein (see citation below), equivalent to ML0877|mmpS3 putative membrane protein from Mycobacterium leprae (293 aa), FASTA scores: opt: 1089, E(): 1.2e-43, (69.80% identity in 308 aa overlap). Also similar to other proteins e.g. Rv3209 from Mycobacterium tuberculosis. Contains PS00499 C2 domain signature, a hydrophobic region, and a repetitive proline and threonine rich region. BELONGS TO THE MMPS FAMILY. Protein product from Mb2221c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2221c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65379" /db_xref="InterPro:IPR008693" /db_xref="InterPro:IPR038468" /db_xref="UniProtKB/Swiss-Prot:P65379" /protein_id="SIU00829.1" /translation="MSGPNPPGREPDEPESEPVSDTGDERASGNHLPPVAGGGDKLPS DQTGETDAYSRAYSAPESEHVTGGPYVPADLRLYDYDDYEESSDLDDELAAPRWPWVV GVAAIIAAVALVVSVSLLVTRPHTSKLATGDTTSSAPPVQDEITTTKPAPPPPPPAPP PTTEIPTATETQTVTVTPPPPPPPATTTAPPPATTTTAAAPPPTTTTPTGPRQVTYSV TGTKAPGDIISVTYVDAAGRRRTQHNVYIPWSMTVTPISQSDVGSVEASSLFRVSKLN CSITTSDGTVLSSNSNDGPQTSC" CDS complement(2446791..2447210) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2222C" /product="Possible conserved integral membrane protein" /note="Mb2222c, -, len: 139 aa. Equivalent to Rv2199c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Possible conserved integral membrane protein, similar to hypothetical membrane proteins in Actinomycetes and equivalent to Mycobacterium leprae, ML0876, putative membrane protein (139 aa) FASTA scores: opt: 866, E(): 1.1e-43; 91.367% identity in 139 aa overlap CAC31257.1| (AL583920),Mb2222c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64948" /db_xref="InterPro:IPR021050" /db_xref="UniProtKB/Swiss-Prot:P64948" /protein_id="SIU00830.1" /translation="MHIEARLFEFVAAFFVVTAVLYGVLTSMFATGGVEWAGTTALAL TGGMALIVATFFRFVARRLDSRPEDYEGAEISDGAGELGFFSPHSWWPIMVALSGSVA AVGIALWLPWLIAAGVAFILASAAGLVFEYYVGPEKH" CDS complement(2447218..2448309) /codon_start=1 /transl_table=11 /gene="ctaC" /locus_tag="BQ2027_MB2223C" /product="PROBABLE TRANSMEMBRANE CYTOCHROME C OXIDASE (SUBUNIT II) CTAC" /note="Mb2223c, ctaC, len: 363 aa. Equivalent to Rv2200c, len: 363 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 363 aa overlap). Probable ctaC, transmembrane cytochrome C oxidase (subunit II), COX2, similar e.g. to JT0964 cytochrome-c oxidase chain II (23.0% identity in 317 aa overlap); etc. Contains PS00078 Cytochrome c oxidase subunit II, copper A binding region signature. BELONGS TO THE CYTOCHROME C OXIDASE SUBUNIT 2 FAMILY. Protein product from Mb2223c detected using shotgun mass spectrometry. Mb2223c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63855" /db_xref="InterPro:IPR001505" /db_xref="InterPro:IPR002429" /db_xref="InterPro:IPR008972" /db_xref="InterPro:IPR036257" /db_xref="UniProtKB/Swiss-Prot:P63855" /protein_id="SIU00831.1" /translation="MTPRGPGRLQRLSQCRPQRGSGGPARGLRQLALAAMLGALAVTV SGCSWSEALGIGWPEGITPEAHLNRELWIGAVIASLAVGVIVWGLIFWSAVFHRKKNT DTELPRQFGYNMPLELVLTVIPFLIISVLFYFTVVVQEKMLQIAKDPEVVIDITSFQW NWKFGYQRVNFKDGTLTYDGADPERKRAMVSKPEGKDKYGEELVGPVRGLNTEDRTYL NFDKVETLGTSTEIPVLVLPSGKRIEFQMASADVIHAFWVPEFLFKRDVMPNPVANNS VNVFQIEEITKTGAFVGHCAEMCGTYHSMMNFEVRVVTPNDFKAYLQQRIDGKTNAEA LRAINQPPLAVTTHPFDTRRGELAPQPVG" CDS 2448555..2450513 /codon_start=1 /transl_table=11 /gene="asnB" /locus_tag="BQ2027_MB2224" /product="Probable asparagine synthetase AsnB" /note="Mb2224, asnB, len: 652 aa. Equivalent to Rv2201, len: 652 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 652 aa overlap). Probable asnB, asparagine synthetase, similar to e.g. SW:ASNH_BACSU P42113 putative asparagine synthetase (26.0% identity in 438 aa overlap) Protein product from Mb2224 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2224 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64248" /db_xref="InterPro:IPR001962" /db_xref="InterPro:IPR006426" /db_xref="InterPro:IPR017932" /db_xref="InterPro:IPR029055" /db_xref="InterPro:IPR033738" /db_xref="UniProtKB/Swiss-Prot:P64248" /protein_id="SIU00832.1" /translation="MCGLLAFVAAPAGAAGPEGADAASAIARASHLMRHRGPDESGTW HAVDGASGGVVFGFNRLSIIDIAHSHQPLRWGPPEAPDRYVLVFNGEIYNYLELRDEL RTQHGAVFATDGDGEAILAGYHHWGTEVLQRLRGMFAFALWDTVTRELFCARDPFGIK PLFIATGAGGTAVASEKKCLLDLVELVGFDTEIDHRALQHYTVLQYVPEPETLHRGVR RLESGCFARIRADQLAPVITRYFVPRFAASPITNDNDQARYDEITAVLEDSVAKHMRA DVTVGAFLSGGIDSTAIAALAIRHNPRLITFTTGFEREGFSEIDVAVASAEAIGARHI AKVVSADEFVAALPEIVWYLDEPVADPALVPLFFVAREARKHVKVVLSGEGADELFGG YTIYREPLSLRPFDYLPKPLRRSMGKVSKPLPEGMRGKSLLHRGSLTLEERYYGNARS FSGAQLREVLPGFRPDWTHTDVTAPVYAESAGWDPVARMQHIDLFTWLRGDILVKADK ITMANSLELRVPFLDPEVFAVASRLPAGAKITRTTTKYALRRALEPIVPAHVLHRPKL GFPVPIRHWLRAGELLEWAYATVGSSQAGHLVDIAAVYRMLDEHRCGSSDHSRRLWTM LIFMLWHAIFVEHSVVPQISEPQYPVQL" CDS complement(2450611..2451585) /codon_start=1 /transl_table=11 /gene="adok" /locus_tag="BQ2027_MB2225C" /product="adenosine kinase" /note="Mb2225c, cbhK, len: 324 aa. Equivalent to Rv2202c, len: 324 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 324 aa overlap). Probable cbhK, carbohydrate kinase (but not ribose) (EC 2.7.-.-), similar to several e.g. AE000915_1 Methanobacterium thermoautotrop (309 aa) FASTA score: opt: 370, E(): 3.3e-18; 31.2% identity in 276 aa overlap. Low similarity to carbohydrate kinases, e.g. SW:RBSK_BACSU P36945 ribokinase (23.9% identity in 272 aa overlap); contains PS00583 pfkB family of carbohydrate kinases signature 1 Protein product from Mb2225c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2225c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P83736" /db_xref="InterPro:IPR002173" /db_xref="InterPro:IPR011611" /db_xref="InterPro:IPR029056" /db_xref="PDB:4UBE" /db_xref="UniProtKB/Swiss-Prot:P83736" /protein_id="SIU00833.1" /translation="MTIAVTGSIATDHLMRFPGRFSEQLLPEHLHKVSLSFLVDDLVM HRGGVAGNMAFAIGVLGGEVALVGAAGADFADYRDWLKARGVNCDHVLISETAHTARF TCTTDVDMAQIASFYPGAMSEARNIKLADVVSAIGKPELVIIGANDPEAMFLHTEECR KLGLAFAADPSQQLARLSGEEIRRLVNGAAYLFTNDYEWDLLLSKTGWSEADVMAQID LRVTTLGPKGVDLVEPDGTTIHVGVVPETSQTDPTGVGDAFRAGFLTGRSAGLGLERS AQLGSLVAVLVLESTGTQEWQWDYEAAASRLAGAYGEHAAAEIVAVLA" CDS 2451789..2452481 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2226" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb2226, -, len: 230 aa. Equivalent to Rv2203, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 230 aa overlap). Possible conserved membrane protein; has single hydrophobic stretch from aa 75 to 97 and is equivalent to Mycobacterium leprae ML0872 putative membrane protein (171 aa). FASTA scores: opt: 821, E(): 3.4e-42; 72.353% identity in 170 aa overlap -CAC31253.1| (AL583920). 2468411. Protein product from Mb2226 detected using SWATH mass spectrometry. Mb2226 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64950" /db_xref="UniProtKB/Swiss-Prot:P64950" /protein_id="SIU00834.1" /translation="MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFA PGPADDAALPPAAYPGVPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGA NTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDA FRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVL VCSYVLRTAGSY" CDS complement(2452489..2452845) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2227C" /product="Iron-sulfur cluster insertion protein SCO2161" /note="Mb2227c, -, len: 118 aa. Equivalent to Rv2204c, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 118 aa overlap). Conserved hypothetical protein. Similar to conserved hypothetical proteins in Actinomycetes and equivalent to Mycobacterium leprae ML0871|ML0871 conserved hypothetical protein (118 aa) and to sp|P45344|YADR_HAEIN HYPOTHETICAL PROTEIN HI1723 (114 aa). FASTA score: ML0871 opt: 720, E(): 8.4e-45; 92.373% identity in 118 aa overlapCAC31252.1| (AL583920); and P45344 opt: 346, E(): 1.8e-18; 45.6% identity in 103 aa overlap. Contains PS01152 Hypothetical hesB/y yadR/yfhF family signature Protein product from Mb2227c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2227c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5B0" /db_xref="InterPro:IPR000361" /db_xref="InterPro:IPR016092" /db_xref="InterPro:IPR017870" /db_xref="InterPro:IPR035903" /db_xref="UniProtKB/Swiss-Prot:P0A5B0" /protein_id="SIU00835.1" /translation="MTVQNEPSAKTHGVILTEAAAAKAKSLLDQEGRDDLALRIAVQP GGCAGLRYNLFFDDRTLDGDQTAEFGGVRLIVDRMSAPYVEGASIDFVDTIEKQGFTI DNPNATGSCACGDSFN" CDS complement(2452945..2454021) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2228C" /product="Glycerate kinase (EC" /EC_number="2.7.1.31" /note="Mb2228c, -, len: 358 aa. Equivalent to Rv2205c, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 358 aa overlap). Conserved hypothetical protein. Very similar to YHAD_ECOLI|P23524 hypothetical protein (YHAD (E.coli) / YXAA (S14A) (B.subtilis) family) (41.6% identity in 154 aa overlap), and to other members of the glycerate kinase family. Start changed since first submission; protein now 122 aa shorter, owing to extension of Rv2206. Mb2228c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64289" /db_xref="InterPro:IPR004381" /db_xref="InterPro:IPR018193" /db_xref="InterPro:IPR018197" /db_xref="InterPro:IPR036129" /db_xref="UniProtKB/Swiss-Prot:P64289" /protein_id="SIU00836.1" /translation="MRVLVAPDCYGDSLSAVEAAAAIATGWTRSRPGDSFIVAPQSDG GPGFVEVLGSRLGETRRLRVCGPLNTVVNAAWVFDPGSATAYLECAQACGLGLLGGPP TPETALAAHSKGVGQLIAAALRAGAARIVVGLGGSACTDGGKGMIAELGGLDAARRQL ADVEVIAASDVEYPLLGPWGTARVFAPQKGADMATVAVLEGRLAAWAIELDAAAGRGV SAEPGAGAAGGIGAGLLAVGGRYQSGAAIIAEHTHFADDLADAELIVTGEGRFDEQSL HGKVVGAIAAAARPLAIPVIVLAGQVSLDKSALRSAGIMAALSIAEYAGSVRLALADA ANQLMGLASQVAARLGNSGPSGYR" CDS 2454180..2454890 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2229" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2229, -, len: 236 aa. Equivalent to Rv2206, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 236 aa overlap). Probable conserved transmembrane protein. Equivalent to hypothetical protein ML0869 (247 aa) of Mycobacterium leprae gZ98741|MLCB22_2 (247 aa), FASTA scores: opt: 1052, (67.5% identity in 237 aa overlap). Two hydrophobic stretches in C-terminal part. Start changed since original submission (+112 aa). Protein product from Mb2229 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2229 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64952" /db_xref="InterPro:IPR021403" /db_xref="UniProtKB/Swiss-Prot:P64952" /protein_id="SIU00837.1" /translation="MKLLGHRKSHGHQRADASPDAGSKDGCRPDSGRTSGSDTSRGSQ TTGPKGRPTPKRNQSRRHTKKGPVAPAPMTAAQARARRKSLAGPKLSREERRAEKAAN RARMTERRERMMAGEEAYLLPRDRGPVRRYVRDVVDSRRNLLGLFMPSALTLLFVMFA VPQVQFYLSPAMLILLALMTIDAIILGRKVGRLVDTKFPSNTESRWRLGLYAAGRASQ IRRLRAPRPQVERGGDVG" CDS 2454969..2456054 /codon_start=1 /transl_table=11 /gene="cobT" /locus_tag="BQ2027_MB2230" /product="Probable nicotinate-nucleotide- dimethylbenzimidazol phosphoribosyltransferase CobT" /note="Mb2230, cobT, len: 361 aa. Equivalent to Rv2207, len: 361 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 361 aa overlap). Probable cobT, phosphoribosyltransferase, similar to many e.g. SW:COBT_ECOLI P36562 nicotinate-nucleotide--dimethylbenzimidazol phosphoribosyltransferase (34.6% identity in 341 aa overlap) Protein product from Mb2230 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2230 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63842" /db_xref="InterPro:IPR003200" /db_xref="InterPro:IPR017846" /db_xref="InterPro:IPR023195" /db_xref="InterPro:IPR036087" /db_xref="UniProtKB/Swiss-Prot:P63842" /protein_id="SIU00838.1" /translation="MIGFAPVSTPDAAAEAAARARQDSLTKPRGALGSLEDLSVWVAS CQQRCPPRQFERARVVVFAGDHGVARSGVSAYPPEVTAQMVANIDAGGAAINALADVA GATVRVADLAVDADPLSERIGAHKVRRGSGNIATEDALTNDETAAAITAGQQIADEEV DAGADLLIAGDMGIGNTTAAAVLVAALTDAEPVAVVGFGTGIDDAGWARKTAAVRDAL FRVRPVLPDPVGLLRCAGGADLAAIAGFCAQAAVRRTPLLLDGVAVTAAALVAERLAP GAHRWWQAGHRSSEPGHGLALAALGLDPIVDLHMRLGEGTGAAVALMVLRAAVAALSS MATFTEAGVSTRSVDGVDRTAPPAVSP" CDS 2456051..2456800 /codon_start=1 /transl_table=11 /gene="cobS" /locus_tag="BQ2027_MB2231" /product="Probable cobalamin 5'-phosphate synthase CobS" /note="Mb2231, cobS, len: 249 aa. Equivalent to Rv2208, len: 249 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 249 aa overlap). Probable cobS, cobalamin 5'-phosphate synthase; similarity to SW:COBS_ECOLI P36561 cobalamin (5'-phosphate) synthase (28.0% identity in 243 aa overlap),Mb2231 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEN6" /db_xref="InterPro:IPR003805" /db_xref="UniProtKB/Swiss-Prot:Q7VEN6" /protein_id="SIU00839.1" /translation="MMRSLATAFAFATVIPTPGSATTPMGRGPMTALPVVGAALGALA AAIAWAGAQVFGPSSPLSGMLTVAVLLVVTRGLHIDGVADTADGLGCYGPPQRALAVM RDGSTGPFGVAAVVLVIALQGLAFATLTTVGIAGITLAVLSGRVTAVLVCRRSVPAAH GSTLGSRVAGTQPAPVVAAWLAVLLAVSVPAGPRPWQGPIAVLVAVTAGAALAAHCVH RFGGVTGDVLGSAIELSTTVSAVTLAGLARL" CDS 2456958..2458496 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2232" /product="Probable conserved integral membrane protein" /note="Mb2232, -, len: 512 aa. Equivalent to Rv2209, len: 512 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 512 aa overlap). Probable conserved integral membrane protein, similar to but longer than Rv0246 gp|AL021929|MTV 034_12 Mycobacterium tuberculosis (436 aa). FASTA score: opt: 712, E(): 2.8e- 32; 33.4% identity in 422 aa overlap,Mb2232 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64954" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/Swiss-Prot:P64954" /protein_id="SIU00840.1" /translation="MPASRLVRQVSAPRNLFGRLVAQGGFYTAGLQLGSGAVVLPVIC AHQGLTWAAGLLYPAFCIGAILGNSLSPLILQRAGQLRHLLMAAISATAAALVVCNAA VPWTGVGVAAVFLATTGAGGVVTGVSSVAYTDMISSMLPAVRRGELLLTQGAAGSVLA TGVTLVIVPMLAHGNEMARYHDLLWLGAAGLVCSGIAALFVGPMRSVSVTTATRMPLR EIYWMGFAIARSQPWFRRYMTTYLLFVPISLGTTFFSLRAAQSNGSLHVLVILSSIGL VVGSMLWRQINRLFGVRGLLLGSALLNAAAALLCMVAESCGQWVHAWAYGTAFLLATV AAQTVVAASISWISVLAPERYRATLICVGSTLAAVEATVLGVALGGIAQKHATIWPVV VVLTLAVIAAVASLRAPTRIGVTADTSPQAATLQAYRPATPNPIHSDERSTPPDHLSV RRGQLRHVWDSRRPAPPLNRPSCRRAARRPAPGKPAAALPQPRHPAVGVREGAPLDAG QRIA" CDS complement(2458422..2459528) /codon_start=1 /transl_table=11 /gene="ilvE" /locus_tag="BQ2027_MB2233C" /product="branched-chain amino acid transaminase ilve" /note="Mb2233c, ilvE, len: 368 aa. Equivalent to Rv2210c, len: 368 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 368 aa overlap). Probable ilvE, Branched-chain-amino-acid transaminase, highly similar to many e.g. YWAA_BACSU|P39576 from Bacillus subtilis (48.4% identity in 339 aa overlap); etc. Protein product from Mb2233c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2233c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0I1" /db_xref="InterPro:IPR001544" /db_xref="InterPro:IPR005786" /db_xref="InterPro:IPR018300" /db_xref="InterPro:IPR033939" /db_xref="InterPro:IPR036038" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0I1" /protein_id="SIU00841.1" /translation="MTSGSLQFTVLRAVNPATDAQRESMLRAPGFGKYHTDHMVSIDY AEGRGWHNARVIPYGPIELDPSAIVLHYAQEVFEGLKAYRWADGSIVSFRADANAARL RSSARRLAIPELPDAVFIESLRQLIAVDKAWVPGAGGEEALYLRPFIFATEPGLGVRP ATQYRYLLIASPAGAYFKGGIAPVSVWVSTEYVRACPGGTGAAKFGGNYAASLLAQAE AAENGCDQVVWLDAVERRYIEEMGGMNIFFVLGSGGSARLVTPELSGSLLPGITRDSL LQLAIDAGFAVEERRIDIDEWQKKAAAGEITEVFACGTAAVITPVARVRHGASEFRIA DGQPGEVTMALRDTLTGIQRGTFADTHGWMARLG" CDS complement(2459600..2460739) /codon_start=1 /transl_table=11 /gene="gcvT" /locus_tag="BQ2027_MB2234C" /product="Probable aminomethyltransferase GcvT (Glycine cleavage system T protein)" /note="Mb2234c, gcvT, len: 379 aa. Equivalent to Rv2211c, len: 379 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 379 aa overlap). Probable gcvT, aminomethyltransferase (EC 2.1.2.10), similar to many e.g. GCST_ECOLI|P27248 for Escherichia coli (38.2% identity in 364 aa overlap); etc. BELONGS TO THE GCVT FAMILY. Protein product from Mb2234c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2234c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64221" /db_xref="InterPro:IPR006222" /db_xref="InterPro:IPR006223" /db_xref="InterPro:IPR013977" /db_xref="InterPro:IPR022903" /db_xref="InterPro:IPR027266" /db_xref="InterPro:IPR028896" /db_xref="InterPro:IPR029043" /db_xref="UniProtKB/Swiss-Prot:P64221" /protein_id="SIU00842.1" /translation="MCQQGRPLGWDAVSDVPELIHGPLEDRHRELGASFAEFGGWLMP VSYAGTVSEHNATRTAVGLFDVSHLGKALVRGPGAAQFVNSALTNDLGRIGPGKAQYT LCCTESGGVIDDLIAYYVSDDEIFLVPNAANTAAVVGALQAAAPGGLSITNLHRSYAV LAVQGPCSTDVLTALGLPTEMDYMGYADASYSGVPVRVCRTGYTGEHGYELLPPWESA GVVFDALLAAVSAAGGEPAGLGARDTLRTEMGYPLHGHELSLDISPLQARCGWAVGWR KDAFFGRAALLAEKAAGPRRLLRGLRMVGRGVLRPGLAVLVGDETVGVTTSGTFSPTL QVGIGLALIDSDAGIEDGQQINVDVRGRAVECQVVCPPFVAVKTR" CDS 2460748..2461884 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2235" /product="adenylyl cyclase (atp pyrophosphate-lyase) (adenylate cyclase)" /note="Mb2235, -, len: 378 aa. Equivalent to Rv2212, len: 378 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 378 aa overlap). Conserved hypothetical protein. Some similarity to adenylate cyclases, e.g. SW:CYAA_STRCO P40135 (29.2% identity in 291 aa overlap); ttg at 24614 in MTCY190 has a better rbs. Contains possible helix-turn-helix motif at aa 64- 85, (+2.72 SD). Also similar to Rv1264 and Rv1647 Protein product from Mb2235 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2235 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64266" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR029787" /db_xref="InterPro:IPR032026" /db_xref="UniProtKB/Swiss-Prot:P64266" /protein_id="SIU00843.1" /translation="MYDSLDFDALEAAGIANPRERAGLLTYLDELGFTVEEMVQAERR GRLFGLAGDVLLWSGPPIYTLATAADELGLSADDVARAWSLLGLTVAGPDVPTLSQAD VDALATWVALKALVGEDGAFGLLRVLGTAMARLAEAESTMIRAGSPNIQMTHTHDELA TARAYRAAAEFVPRIGALIDTVHRHHLASARTYFEGVIGDTSASVTCGIGFADLSSFT ALTQALTPAQLQDLLTEFDAAVTDVVHADGGRLVKFIGDAVMWVSSSPERLVRAAVDL VDHPGARAAELQVRAGLAYGTVLALNGDYFGNPVNLAARLVAAAAPGQILAAAQLRDM LPDWPALAHGPLTLKGFDAPVMAFELHDNPRARDADTPSPAASD" CDS 2461896..2463443 /codon_start=1 /transl_table=11 /gene="pepB" /locus_tag="BQ2027_MB2236" /product="Probable aminopeptidase PepB" /note="Mb2236, pepB, len: 515 aa. Equivalent to Rv2213, len: 515 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 515 aa overlap). Probable pepB, leucine aminopeptidase, similar to many e.g. SW:AMPA_ECOLI P11648 aminopeptidase A/I, (41.4% identity in 309 aa overlap). Equivalent to Z98741|MLCB22_6 Mycobacterium leprae cosmid B22; Am (524 aa), FASTA scores: opt: 2793, E(): 0; 83.1% identity in 522 aa overlap. Contains PS00631 Cytosol aminopeptidase signature, NTDAEGRL Protein product from Mb2236 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2236 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEN5" /db_xref="InterPro:IPR000819" /db_xref="InterPro:IPR008283" /db_xref="InterPro:IPR011356" /db_xref="InterPro:IPR023042" /db_xref="UniProtKB/Swiss-Prot:Q7VEN5" /protein_id="SIU00844.1" /translation="MTTEPGYLSPSVAVATSMPKRGVGAAVLIVPVVSTGEEDRPGAV VASAEPFLRADTVAEIEAGLRALDATGASDQVHRLAVPSLPVGSVLTVGLGKPRREWP ADTIRCAAGVAARALNSSEAVITTLAELPGDGICSATVEGLILGSYRFSAFRSDKTAP KDAGLRKITVLCCAKDAKKRALHGAAVATAVATARDLVNTPPSHLFPAELAKRAKTLS ESVGLDVEVIDEKALKKAGYGGVIGVGQGSSRPPRLVRLIHRGSRLAKNPQKAKKVAL VGKGITFDTGGISIKPAASMHHMTSDMGGAAAVIATVTLAARLRLPIDVIATVPMAEN MPSATAQRPGDVLTQYGGTTVEVLNTDAEGRLILADAIVRACEDKPDYLIETSTLTGA QTVALGTRIPGVMGSDEFRDRVAAISQRVGENGWPMPLPDDLKDDLKSTVADLANVSG QRFAGMLVAGVFLREFVAESVDWAHIDVAGPAYNTGSAWGYTPKGATGVPTRTMFAVL EDIAKNG" CDS complement(2463481..2465259) /codon_start=1 /transl_table=11 /gene="ephD" /locus_tag="BQ2027_MB2237C" /product="Possible short-chain dehydrogenase EphD" /note="Mb2237c, ephD, len: 592 aa. Equivalent to Rv2214c, len: 592 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 592 aa overlap). Possible ephD, short-chain dehydrogenase (EC 1.-.-.-) (see citation below), equivalent to Z98741|MLCB22_8 Mycobacterium leprae cosmid B22; (596 aa), FASTA score: opt: 3262, E(): 0; 80.4% identity in 596 aa overlap. C-terminus similar to short-chain alcohol dehydrogenase family, similar to SW:LIGD_PSEPA Q01198 c alpha-dehydrogenase (30.7% identity in 241 aa overlap); contains PS00061 Short-chain alcohol dehydrogenase family signature, PS00697 ATP-dependent DNA ligase AMP-binding site. N-terminus corresponds to several epoxide hydrolases of plants and Mycobacterium tuberculosis e.g. MTCY9F925 Protein product from Mb2237c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2237c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66778" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P66778" /protein_id="SIU00845.1" /translation="MPATQQMSRLVDSPDGVRIAVYHEGNPDGPTVVLVHGFPDSHVL WDGVVPLLAERFRIVRYDNRGVGRSSVPKPISAYTMAHFADDFDAVIGELSPGEPVHV LAHDWGSVGVWEYLRRPGASDRVASFTSVSGPSQDHLVNYVYGGLRRPWRPRTFLRAI SQTLRLSYMALFSVPVVAPLLLRVALSSAAVRRNMVGDIPVDQIHHSETLARDAAHSV KTYPANYFRSFSSSRRGRAIPIVDVPVQLIVNSQDPYVRPYGYDQTARWVPRLWRRDI KAGHFSPMSHPQVMAAAVHDFADLADGKQPSRALLRAQVGRPRGYFGDTLVSVTGAGS GIGRETALAFAREGAEIVISDIDEATVKDTAAEIAARGGIAYPYVLDVSDAEAVEAFA ERVSAEHGVPDIVVNNAGIGQAGRFLDTPAEQFDRVLAVNLGGVVNGCRAFGQRLVER GTGGHIVNVSSMAAYAPLQSLSAYCTSKAATYMFSDCLRAELDAAGVGLTTICPGVID TNIVATTGFHAPGTDEEKIDGRRGQIDKMFALRSYGPDKVADAIVSAVKKKKPIRPVA PEAYALYGISRVLPQALRSTARLRVI" CDS 2465523..2467184 /codon_start=1 /transl_table=11 /gene="dlat" /locus_tag="BQ2027_MB2238" /product="dlat, dihydrolipoamide acyltransferase, e2 component of pyruvate dehydrogenase" /note="Mb2238, sucB, len: 553 aa. Equivalent to Rv2215, len: 553 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 553 aa overlap). Probable sucB, dihydrolipoamide acetyltransferase component (E2), similar to e.g. SW:O PD2_ACHLA P35489 dihydrolipoamide acetyltransferase component (E2) of pyruvate dehydrogenase complex (35.3% identity in 552 aa overlap); contains PS00189 2-oxo acid dehydrogenases acyltransferase component lipoyl binding site. Protein product from Mb2238 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2238 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65634" /db_xref="InterPro:IPR000089" /db_xref="InterPro:IPR001078" /db_xref="InterPro:IPR003016" /db_xref="InterPro:IPR004167" /db_xref="InterPro:IPR011053" /db_xref="InterPro:IPR014276" /db_xref="InterPro:IPR023213" /db_xref="InterPro:IPR036625" /db_xref="UniProtKB/Swiss-Prot:P65634" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00846.1" /translation="MAFSVQMPALGESVTEGTVTRWLKQEGDTVELDEPLVEVSTDKV DTEIPSPAAGVLTKIIAQEDDTVEVGGELAVIGDAKDAGEAAAPAPEKVPAAQPESKP APEPPPVQPTSGAPAGGDAKPVLMPELGESVTEGTVIRWLKKIGDSVQVDEPLVEVST DKVDTEIPSPVAGVLVSISADEDATVPVGGELARIGVAADIGAAPAPKPAPKPVPEPA PTPKAEPAPSPPAAQPAGAAEGAPYVTPLVRKLASENNIDLAGVTGTGVGGRIRKQDV LAAAEQKKRAKAPAPAAQAAAAPAPKAPPAPAPALAHLRGTTQKASRIRQITANKTRE SLQATAQLTQTHEVDMTKIVGLRARAKAAFAEREGVNLTFLPFFAKAVIDALKIHPNI NASYNEDTKEITYYDAEHLGFAVDTEQGLLSPVIHDAGDLSLAGLARAIADIAARARS GNLKPDELSGGTFTITNIGSQGALFDTPILVPPQAAMLGTGAIVKRPRVVVDASGNES IGVRSVCYLPLTYDHRLIDGADAGRFLTTIKHRLEEGAFEADLGL" CDS 2467184..2468089 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2239" /product="Cell division inhibitor SulA" /note="Mb2239, -, len: 301 aa. Equivalent to Rv2216, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 301 aa overlap). Conserved hypothetical protein, equivalent to Mycobacterium leprae ML0860 (307 aa), Z98741|MLCB22_10 Mycobacterium leprae cosmid B22; H (307 aa). FASTA score: opt: 1656, E(): 0; 84.2% identity in 297 aa overlap. Also gp|AE000319|ECAE000319_8 Escherichia coli strain K12 MG1655 (297 aa) opt: 640, E(): 0; 39.5% identity in 294 aa overlap. Protein product from Mb2239 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2239 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67233" /db_xref="InterPro:IPR001509" /db_xref="InterPro:IPR010099" /db_xref="InterPro:IPR013549" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P67233" /protein_id="SIU00847.1" /translation="MANAVVAIAGSSGLIGSALTAALRAADHTVLRIVRRAPANSEEL HWNPESGEFDPHALTDVDAVVNLCGVNIAQRRWSGAFKQSLRDSRITPTEVLSAAVAD AGVATLINASAVGYYGNTKDRVVDENDSAGTGFLAQLCVDWETATRPAQQSGARVVLA RTGVVLSPAGGMLRRMRPLFSVGLGARLGSGRQYMSWISLEDEVRALQFAIAQPNLSG PVNLTGPAPVTNAEFTTAFGRAVNRPTPLMLPSVAVRAAFGEFADEGLLIGQRAIPSA LERAGFQFHHNTIGEALGYATTRPG" CDS 2468142..2468834 /codon_start=1 /transl_table=11 /gene="lipB" /locus_tag="BQ2027_MB2240" /product="Probable lipoate biosynthesis protein B LipB" /note="Mb2240, lipB, len: 230 aa. Equivalent to Rv2217, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 230 aa overlap). Probable lipB, similar to SW:LIPB_ECOLI P30976 liopate biosynthesis protein B (33.8% identity in 160 aa overlap). Equivalent to gp|Z98741| MLCB22_11 Mycobacterium leprae (235 aa). FASTA score: opt: 1124, E(): 0; 78.4% identity in 218 aa overlap Protein product from Mb2240 detected using SWATH mass spectrometry. Mb2240 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7VEN4" /db_xref="InterPro:IPR000544" /db_xref="InterPro:IPR004143" /db_xref="InterPro:IPR020605" /db_xref="UniProtKB/Swiss-Prot:Q7VEN4" /protein_id="SIU00848.1" /translation="MTGSIRSKLSAIDVRQLGTVDYRTAWQLQRELADARVAGGADTL LLLEHPAVYTAGRRTETHERPIDGTPVVGTDRGGKITWHGPGQLVGYPIIGLAEPLDV VNYVRRLEESLIQVCADLGLHAGRVDGRSGVWLPGRPARKVAAIGVRVSRATTLHGFA LNCDCDLAAFTAIVPCGISDAAVTSLSAELGRTVTVDEVRATVAAAVCAALDGVLPVG DRVPSHAVPSPL" CDS 2468831..2469766 /codon_start=1 /transl_table=11 /gene="lipA" /locus_tag="BQ2027_MB2241" /product="Probable lipoate biosynthesis protein A LipA" /note="Mb2241, lipA, len: 311 aa. Equivalent to Rv2218, len: 311 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 311 aa overlap). Probable lipA, lipoic acid synthetase, similar to e.g. SW:LIPA_HAEIN P44463 (42 .6% identity in 291 aa overlap). Equivalent to Z98741|MLCB2 2_12 Mycobacterium leprae cosmid B22; (314 aa). FASTA score : opt: 1836, E(): 0; 86.8% identity in 310 aa overlap,Mb2241 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65284" /db_xref="InterPro:IPR003698" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR031691" /db_xref="UniProtKB/Swiss-Prot:P65284" /protein_id="SIU00849.1" /translation="MSVAAEGRRLLRLEVRNAQTPIERKPPWIKTRARIGPEYTELKN LVRREGLHTVCEEAGCPNIFECWEDREATFLIGGDQCTRRCDFCQIDTGKPAELDRDE PRRVADSVRTMGLRYATVTGVARDDLPDGGAWLYAATVRAIKELNPSTGVELLIPDFN GEPTRLAEVFESGPEVLAHNVETVPRIFKRIRPAFTYRRSLGVLTAARDAGLVTKSNL ILGLGETSDEVRTALGDLRDAGCDIVTITQYLRPSARHHPVERWVKPEEFVQFARFAE GLGFAGVLAGPLVRSSYRAGRLYEQARNSRALASR" CDS 2469793..2470545 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2242" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2242, -, len: 250 aa. Equivalent to Rv2219, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 250 aa overlap). Probable conserved transmembrane protein. Equivalent to hypothetical membrane protein ML0857 (250 aa) from Mycobacterium leprae Z98741 |MLCB22_13 Mycobacterium leprae cosmid B22; H (250 aa) opt : 1328, E(): 0; 80.8% identity in 250 aa overlap. Protein product from Mb2242 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2242 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0L2" /db_xref="InterPro:IPR025445" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0L2" /protein_id="SIU00850.1" /translation="MAKPRNAAESKAAKAQANAARKAAARQRRAQLWQAFTLQRKEDK RLLPYMIGAFLLIVGASVGVGVWAGGFTMLTMIPLGVLLGALVAFVIFGRRAQRTVYR KAEGQTGAAAWALDNLRGKWRVTPGVAATGNLDAVHRVIGRPGVIFVGEGSAARVKPL LAQEKKRTARLVGDVPIYDIIVGNGDGEVPLAKLERHLTRLPANITVKQMDTVESRLA ALGSRAGAGVMPKGPLPTTAKMRSVQRTVRRK" CDS complement(2470552..2470974) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2243C" /product="probable conserved membrane protein" /note="Mb2243c, -, len: 140 aa. Equivalent to Rv2219A, len: 140 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 140 aa overlap). Probable membrane protein, similar to SC3H12.05c|AL355740_5 possible integral membrane protein from Streptomyces coelicolor (155 aa), FASTA scores: opt: 327, E(): 7.5e-14, (46.6% identity in 133 aa overlap), also linked to glnA. Protein product from Mb2243c detected using SWATH mass spectrometry. Mb2243c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0I9" /db_xref="InterPro:IPR010432" /db_xref="InterPro:IPR016795" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0I9" /protein_id="SIU00851.1" /translation="MTAKSPPDYPGKTLGLPDTGPGSLAPMGRRLAALLIDWLIAYGL ALLGVEFGVWSTPMLSTVVLVIWLLLGVAAVRLFGFTPGQLMLGLVVVAVGGRRPVGI GRLVVRGLLIGLVVPPLFTDSDGRGLHDRLTATAVVRR" CDS 2471173..2472609 /codon_start=1 /transl_table=11 /gene="glnA1" /locus_tag="BQ2027_MB2244" /standard_name="glnA" /product="GLUTAMINE SYNTHETASE GLNA1 (GLUTAMINE SYNTHASE) (GS-I)" /note="Mb2244, glnA1, len: 478 aa. Equivalent to Rv2220, len: 478 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 478 aa overlap). glnA1, glutamine synthetase class I (EC 6.3.1.2) (see first citation below), similar to many e.g. GLNA_STRCO|P15106 from Streptomyces coelicolor, FASTA score: (71.4% identity in 475 aa overlap); etc. Also similar to three other potential glutamine synthetases in Mycobacterium tuberculosis: Rv2222c|glnA2, Rv2860c|glnA4, and Rv1878|glnA3. Contains PS00180 Glutamine synthetase signature 1, PS00181 Glutamine synthetase putative ATP-binding region signature, and PS00182 Glutamine synthetase class-I adenylation site. BELONGS TO THE GLUTAMINE SYNTHETASE FAMILY. Protein product from Mb2244 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2244 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A591" /db_xref="InterPro:IPR001637" /db_xref="InterPro:IPR004809" /db_xref="InterPro:IPR008146" /db_xref="InterPro:IPR008147" /db_xref="InterPro:IPR014746" /db_xref="InterPro:IPR027302" /db_xref="InterPro:IPR027303" /db_xref="InterPro:IPR036651" /db_xref="UniProtKB/Swiss-Prot:P0A591" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00852.1" /translation="MTEKTPDDVFKLAKDEKVEYVDVRFCDLPGIMQHFTIPASAFDK SVFDDGLAFDGSSIRGFQSIHESDMLLLPDPETARIDPFRAAKTLNINFFVHDPFTLE PYSRDPRNIARKAENYLISTGIADTAYFGAEAEFYIFDSVSFDSRANGSFYEVDAISG WWNTGAATEADGSPNRGYKVRHKGGYFPVAPNDQYVDLRDKMLTNLINSGFILEKGHH EVGSGGQAEINYQFNSLLHAADDMQLYKYIIKNTAWQNGKTVTFMPKPLFGDNGSGMH CHQSLWKDGAPLMYDETGYAGLSDTARHYIGGLLHHAPSLLAFTNPTVNSYKRLVPGY EAPINLVYSQRNRSACVRIPITGSNPKAKRLEFRSPDSSGNPYLAFSAMLMAGLDGIK NKIEPQAPVDKDLYELPPEEAASIPQTPTQLSDVIDRLEADHEYLTEGGVFTNDLIET WISFKRENEIEPVNIRPHPYEFALYYDV" CDS complement(2472927..2475911) /codon_start=1 /transl_table=11 /gene="glnE" /locus_tag="BQ2027_MB2245C" /product="GLUTAMATE-AMMONIA-LIGASE ADENYLYLTRANSFERASE GLNE (Glutamine-synthetase adenylyltransferase)" /note="Mb2245c, glnE, len: 994 aa. Equivalent to Rv2221c, len: 994 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 994 aa overlap). glnE, glutamate-ammonia-ligase adenylyltransferase (EC 2.7.7.42) (see citations below), similar to others e.g. GLNE_ECOLI|P30870 glutamate-ammonia-ligase adenylyltransferase from Escherichia coli, FASTA score: (24.4% identity in 721 aa overlap); GLNE_HAEIN|P44419 Glutamate-ammonia-ligase adenylyltransferase from Haemophilus influenzae (981 aa), FASTA score: (28.1% identity in 199 aa overlap); etc. Note that initiation codon uncertain. Protein product from Mb2245c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2245c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P69941" /db_xref="InterPro:IPR005190" /db_xref="InterPro:IPR013546" /db_xref="InterPro:IPR023057" /db_xref="UniProtKB/Swiss-Prot:P69941" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00853.1" /translation="MVVTKLATQRPKLPSVGRLGLVDPPAGERLAQLGWDRHEDQAHV DLLWSLSRAPDADAALRALIRLSENPDTGWDELNAALLRERSLRGRLFSVLGSSLALG DHLVAHPQSWKLLRGKVTLPSHDQLQRSFVECVEESEGMPGSLVHRLRTQYRDYVLML AALDLAATVEDEPVLPFTVVAARLADAADAALAAALRVAEASVCGEHPPPRLAVIAMG KCGARELNYVSDVDVIFVAERSDPRNARVASEMMRVASAAFFEVDAALRPEGRNGELV RTLESHIAYYQRWAKTWEFQALLKARPVVGDAELGERYLTALMPMVWRACEREDFVVE VQAMRRRVEQLVPADVRGRELKLGSGGLRDVEFAVQLLQLVHARSDESLRVASTVDAL AALGEGGYIGREDAANMTASYEFLRLLEHRLQLQRLKRTHLLPDPEDEEAVRWLARAA HIRPDGRNDAAGVLREELKKQNVRVSKLHTKLFYQPLLESIGPTGLEIAHGMTLEAAG RRLAALGYEGPQTALKHMSALVNQSGRRGRVQSVLLPRLLDWMSYAPDPDGGLLAYRR LSEALATESWYLATLRDKPAVAKRLMHVLGTSAYVPDLLMRAPRVIQQYEDGPAGPKL LETEPAAVARALIASASRYPDPERAIAGARTLRRRELARIGSADLLGLLEVTEVCRAL TSVWVAVLQAALDVMIRASLPDDDRAPAAIAVIGMGRLGGAELGYGSDADVMFVCEPA TGVDDARAVKWSTSIAERVRALLGTPSVDPPLELDANLRPEGRNGPLVRTLGSYAAYY EQWAQPWEIQALLRAHAVAGDAELGQRFLRMVDKTRYPPDGVSADSVREIRRIKARIE SERLPRGADPNTHTKLGRGGLADIEWTVQLLQLQHAHQVPALHNTSTLQSLDVIAAAD LVPAADVELLRQAWLTATRARNALVLVRGKPTDQLPGPGRQLNAVAVAAGWRNDDGGE FLDNYLRVTRRAKAVVRKVFGS" CDS complement(2475960..2477300) /codon_start=1 /transl_table=11 /gene="glnA2" /locus_tag="BQ2027_MB2246C" /product="PROBABLE GLUTAMINE SYNTHETASE GLNA2 (GLUTAMINE SYNTHASE) (GS-II)" /note="Mb2246c, glnA2, len: 446 aa. Equivalent to Rv2222c, len: 446 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 446 aa overlap). Probable glnA2, glutamine synthetase class II (EC 6.3.1.2), similar to others. Also similar to three other potential glutamine synthetases in Mycobacterium tuberculosis: Rv2220|glnA1, Rv2860c|glnA4, and Rv1878|glnA3. BELONGS TO THE GLUTAMINE SYNTHETASE FAMILY. Protein product from Mb2246c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2246c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64246" /db_xref="InterPro:IPR008146" /db_xref="InterPro:IPR008147" /db_xref="InterPro:IPR014746" /db_xref="InterPro:IPR027303" /db_xref="InterPro:IPR036651" /db_xref="UniProtKB/Swiss-Prot:P64246" /protein_id="SIU00854.1" /translation="MDRQKEFVLRTLEERDIRFVRLWFTDVLGFLKSVAIAPAELEGA FEEGIGFDGSSIEGFARVSESDTVAHPDPSTFQVLPWATSSGHHHSARMFCDITMPDG SPSWADPRHVLRRQLTKAGELGFSCYVHPEIEFFLLKPGPEDGSVPVPVDNAGYFDQA VHDSALNFRRHAIDALEFMGISVEFSHHEGAPGQQEIDLRFADALSMADNVMTFRYVI KEVALEEGARASFMPKPFGQHPGSAMHTHMSLFEGDVNAFHSADDPLQLSEVGKSFIA GILEHACEISAVTNQWVNSYKRLVQGGEAPTAASWGAANRSALVRVPMYTPHKTSSRR VEVRSPDSACNPYLTFAVLLAAGLRGVEKGYVLGPQAEDNVWDLTPEERRAMGYRELP SSLDSALRAMEASELVAEALGEHVFDFFLRNKRTEWANYRSHVTPYELRTYLSL" CDS complement(2477395..2478957) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2247C" /product="Probable exported protease" /note="Mb2247c, -, len: 520 aa. Equivalent to Rv2223c, len: 520 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 520 aa overlap). Probable exported protease (EC 3.4.-.-); has signal sequence. Very similar to three proteases/peptidases from Streptomyces spp.: L42758, L42759, L27466. FASTA score: L42758|STMSLPD STMSLPD NID: g940302 - Streptomyces (539 aa) opt: 1032 E(): 0, (37.5% identity in 533 aa overlap). Also similar to hypothetical proteins SW:YZZE _ECOLI P34211 (25.4% identity in 406 aa overlap) and PIR:B36944 in ompP 3' region (27.5% identity in 218 aa overlap). Highly similar to Rv2224c and Rv2672 (49.3% identity in 507 aa overlap); contains PS00120 Lipases, serine active site,Mb2247c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65822" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR013595" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P65822" /protein_id="SIU00855.1" /translation="MAAMWRRRPLSSALLSFGLLLGGLPLAAPPLAGATEEPGAGQTP GAPVVAPQQSWNSCREFIADTSEIRTARCATVSVPVDYDQPGGTQAKLAVIRVPATGQ RFGALLVNPGGPGASAVDMVAAMAPAIADTDILRHFDLVGFDPRGVGHSTPALRCRTD AEFDAYRRDPMADYSPAGVTHVEQVYRQLAQDCVDRMGFSFLANIGTASVARDMDMVR QALGDDQINYLGYSYGTELGTAYLERFGTHVRAMVLDGAIDPAVSPIEESISQMAGFQ TAFNDYAADCARSPACPLGTDSAQWVNRYHALVDPLVQKPGKTSDPRGLSYADATTGT INALYSPQRWKYLTSGLLGLQRGSDAGDLLVLADDYDGRDADGHYSNDQDAFNAVRCV DAPTPADPAAWVAADQRIRQVAPFLSYGQFTGSAPRDLCALWPVPATSTPHPAAPAGA GKVVVVSTTHDPATPYQSGVDLARQLGAPLITFDGTQHTAVFDGNQCVDSAVMHYFLD GTLPPTSLRCAP" CDS complement(2479019..2480581) /codon_start=1 /transl_table=11 /gene="caea" /locus_tag="BQ2027_MB2248C" /product="probable carboxylesterase caea" /note="Mb2248c, -, len: 520 aa. Equivalent to Rv2224c, len: 520 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 520 aa overlap). Probable exported protease (EC 3.4.-.-); has signal sequence and lipoprotein motif at N-terminal end. Very similar to three proteases/peptidases from Streptomyces spp.: L42758, L42759, L27466. FASTA score: L4 2758|STMSLPD STMSLPD NID: g940302 - Streptomyces (539 aa) opt: 1032 E(): 0, (37.5% identity in 533 aa overlap). Similar to hypothetical protein SW:YZZE_ECOLI P34211 (27.7% identity in 412 aa overlap) and highly similar to Rv2224c and Rv2672 (49.3% identity in 507 aa overlap); contains PS00013, Prokaryotic membrane lipoprotein lipid attachment site, and PS00120 Lipases, serine active site. Protein product from Mb2248c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2248c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65824" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P65824" /protein_id="SIU00856.1" /translation="MGMRLSRRDKIARMLLIWAALAAVALVLVGCIRVVGGRARMAEP KLGQPVEWTPCRSSNPQVKIPGGALCGKLAVPVDYDRPDGDVAALALIRFPATGDKIG SLVINPGGPGESGIEAALGVFQTLPKRVHERFDLVGFDPRGVASSRPAIWCNSDADND RLRAEPQVDYSREGVAHIENETKQFVGRCVDKMGKNFLAHVGTVNVAKDLDAIRAALG DDKLTYLGYSYGTRIGSAYAEEFPQRVRAMILDGAVDPNADPIEAELRQAKGFQDAFN NYAADCAKNAGCPLGADPAKAVEVYHSLVDPLVDPDNPRISRPARTKDPRGLSYSDAI VGTIMALYSPNLWQHLTDGLSELVDNRGDTLLALADMYMRRDSHGRYNNSGDARVAIN CVDQPPVTDRDKVIDEDRRAREIAPFMSYGKFTGDAPLGTCAFWPVPPTSQPHAVSAP GLVPTVVVSTTHDPATPYKAGVDLANQLRGSLLTFDGTQHTVVFQGDSCIDEYVTAYL IGGTTPPSGAKC" CDS 2481300..2482145 /codon_start=1 /transl_table=11 /gene="panB" /locus_tag="BQ2027_MB2249" /product="3-methyl-2-oxobutanoate hydroxymethyltransferase panb" /note="Mb2249, panB, len: 281 aa. Equivalent to Rv2225, len: 281 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 281 aa overlap). Probable panB, 3-methyl-2-oxobutanoate hydroxymethyltransferase (EC 2.1.2.11), similar to PANB_ECOLI|P31057 3-methyl-2-oxobutanoate hydroxymethyltransferase from Escherichia coli (45.9% identity in 257 aa overlap). Protein product from Mb2249 detected using SWATH mass spectrometry. Mb2249 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0C2T4" /db_xref="InterPro:IPR003700" /db_xref="InterPro:IPR015813" /db_xref="InterPro:IPR040442" /db_xref="UniProtKB/Swiss-Prot:P0C2T4" /protein_id="SIU00857.1" /translation="MSEQTIYGANTPGGSGPRTKIRTHHLQRWKADGHKWAMLTAYDY STARIFDEAGIPVLLVGDSAANVVYGYDTTVPISIDELIPLVRGVVRGAPHALVVADL PFGSYEAGPTAALAAATRFLKDGGAHAVKLEGGERVAEQIACLTAAGIPVMAHIGFTP QSVNTLGGFRVQGRGDAAEQTIADAIAVAEAGAFAVVMEMVPAELATQITGKLTIPTV GIGAGPNCDGQVLVWQDMAGFSGAKTARFVKRYADVGGELRRAAMQYAQEVAGGVFPA DEHSF" CDS 2482390..2483931 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2250" /product="CHAD domain containing protein" /note="Mb2250, -, len: 513 aa. Equivalent to Rv2226, len: 513 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 513 aa overlap). Conserved hypothetical protein, similar to hypothetical secreted protein (510 aa) from Streptomyces coelicolor A3(2) emb|CAB59601.1| (AL132662) hypothetical secreted protein [Streptomyces coelicolor. Smith-Waterman scores Expect = 5e-44 Identities = 166/506 (32%) Protein product from Mb2250 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2250 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007899" /db_xref="InterPro:IPR023577" /db_xref="InterPro:IPR033469" /db_xref="InterPro:IPR038186" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0J1" /protein_id="SIU00858.1" /translation="MPVEAPRPARHLEVERKFDVIESTVSPSFEDIAAVVRVEQSPTQ QLDAVYFDTPSHDLARNQITLRRRTGGADAGWHLKLPAGPDKRTEMRAPLSASGDAVP AELLDVVLAIVRDQPVQPVARISTHRESQILYGAGGDALAEFCNDDVTAWSAGAFHAA GAADNGPAEQQWREWELELVTTDGTADTKLLDRLANRLLDAGAAPAGHGSKLARVLGA TSPGELPNGPQPPADPVHRAVSEQVEQLLLWDRAVRADAYDAVHQMRVTTRKIRSLLT DSQESFGLKESAWVIDELRELANVLGVARDAEVLGDRYQRELDALAPELVRGRVRERL VDGARRRYQTGLRRSLIALRSQRYFRLLDALDALVSERAHATSGEESAPVTIDAAYRR VRKAAKAAKTAGDQAGDHHRDEALHLIRKRAKRLRYTAAATGADNVSQEAKVIQTLLG DHQDSVVSREHLIQQAIAANTAGEDTFTYGLLYQQEADLAERCREQLEAALRKLDKAV RKARD" gene complement(2484003..2484309) /gene="rnpB" misc_RNA complement(2484003..2484309) /gene="rnpB" /product="ribonuclease P RNA" /note="rnpB, len: 307 nt. Equivalent to rnpB, len: 307 nt,from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 307 nt overlap). rna component of RNase P." CDS 2484489..2484809 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2251" /product="conserved hypothetical protein [FIRST PART]" /note="Mb2251, -, len: 106 aa. Equivalent to 5' end of Rv2227, len: 233 aa, from Mycobacterium tuberculosis strain H37Rv, (93.8% identity in 80 aa overlap). Conserved hypothetical protein, similar to conserved hypothetical proteins from various bacteria e.g. gb|AAK22693.1| (AE005746) conserved hypothetical protein from Caulobacter crescentus (234 aa) Smith-Waterman score = 109 bits (429), Expect = 1e-41 Identities = 83/167 (49%). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2227 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 1 bp to 2 bp substitution (t-cc) splits Rv2227 in two parts, Mb2251 and Mb2252." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0L3" /protein_id="SIU00859.1" /translation="MGQTRRLRRLGRHRCRGQRVRWRTATSADHPRRGRPAAQAVRRR RPVSLDGRYGIQAVRRRAVSIFPCPLSRADRASQAGAVSQTAADSAQLVGQTGPGGAL ARQP" CDS 2484817..2485191 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2252" /product="conserved hypothetical protein [SECOND PART]" /note="Mb2252, -, len: 124 aa. Equivalent to 3' end of Rv2227, len: 233 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 124 aa overlap). Conserved hypothetical protein, similar to conserved hypothetical proteins from various bacteria e.g. gb|AAK22693.1| (AE005746) conserved hypothetical protein from Caulobacter crescentus (234 aa) Smith-Waterman score = 109 bits (429), Expect = 1e-41 Identities = 83/167 (49%). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2227 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 1 bp to 2 bp substitution (t-cc) splits Rv2227 in two parts, Mb2251 and Mb2252." /db_xref="InterPro:IPR018655" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0M4" /protein_id="SIU00860.1" /translation="MASCHAAGQTRSTALMLKYGTNDWNALHQDLYGELVFPLQVVIN LSDPETDYTGGEFLLVEQRPRAQSRGTAMQLPQGHGYVLTTRDRPVRTSRGWSASPVR HGLSTIRSGERYAMGLIFHDAA" CDS complement(2485203..2486297) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2253C" /product="multifunctional protein. has rnase h,alpha-ribazole phosphatase, and acid phosphatase activities." /note="Mb2253c, -, len: 364 aa. Equivalent to Rv2228c, len: 364 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 364 aa overlap). Conserved hypothetical protein. Some similarity to phosphoglycerate mutase and ribonuclease H. Similar to CAB88177.1|AL352972 putative bifunctional protein (ribonuclease H/phosphoglycerate mutase) from Streptomyces coelicolor A3(2) (497 aa); Smith-Waterman scores: 107 bits (424), Expect = 4e-41 Identities = 160/485 (32%). Also similar in C-terminal part to Rv2419c and Rv2135c. Protein product from Mb2253c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2253c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64956" /db_xref="InterPro:IPR002156" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR013078" /db_xref="InterPro:IPR014636" /db_xref="InterPro:IPR029033" /db_xref="InterPro:IPR036397" /db_xref="UniProtKB/Swiss-Prot:P64956" /protein_id="SIU00861.1" /translation="MKVVIEADGGSRGNPGPAGYGAVVWTADHSTVLAESKQAIGRAT NNVAEYRGLIAGLDDAVKLGATEAAVLMDSKLVVEQMSGRWKVKHPDLLKLYVQAQAL ASQFRRINYEWVPRARNTYADRLANDAMDAAAQSAAADADPAKIVATESPTSPGWTGA RGTPTRLLLLRHGQTELSEQRRYSGRGNPGLNEVGWRQVGAAAGYLARRGGIAAVVSS PLQRAYDTAVTAARALALDVVVDDDLVETDFGAWEGLTFAEAAERDPELHRRWLQDTS ITPPGGESFDDVLRRVRRGRDRIIVGYEGATVLVVSHVTPIKMLLRLALDAGSGVLYR LHLDLASLSIAEFYADGASSVRLVNQTGYL" CDS complement(2486294..2487031) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2254C" /product="Zn-ribbon protein, possibly nucleic acid-binding" /note="Mb2254c, -, len: 245 aa. Equivalent to Rv2229c, len: 245 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 245 aa overlap). Conserved hypothetical protein; probable coiled-coil protein similar to conserved hypothetical proteins in Actinomycetes. Equivalent to Mycobacterium leprae ML1638 (232 aa), FASTA scores: opt: 868 E(): 4.4e-43; 60.870% identity in 230 aa overlap emb|CAC30589.1| (AL583922) Protein product from Mb2254c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2254c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003743" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0L9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00862.1" /translation="MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEH NAANDRMAALRIAAEDLDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHE LDSLQRRQASLEDALLEVLERREELQAQQTAESRALQALRADLAAAQQALDEALAEID QARHQHSSQRDMLTATLDPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQ ISAAAEDEVVRCPECGAILLQLEGFEE" CDS complement(2487028..2488167) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2255C" /product="GTP cyclohydrolase 1 type 2 homolog YbgI" /note="Mb2255c, -, len: 379 aa. Equivalent to Rv2230c, len: 379 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 379 aa overlap). Conserved hypothetical protein. Equivalent to Mycobacterium leprae, ML1639, conserved hypothetical protein (385 aa). Similar to hypothetical proteins from B. subtilis, P54472, and L. monocytogenes, P53434. FASTA score: ML1639 (MLCB1243.36) opt: 2088, E(): 4e-107; 79.481% identity in 385 aa overlap same as >pir||T44719 hypothetical protein MLCB1243.36 [imported] - Mycobacterium leprae >gi|3150237|emb|CAA19217.1| (AL023635); P54472|YQFO_BACSU HYPOTHETICAL 30. 7 KD PROTEIN IN (279 aa) opt: 604; E(): 2.2e-30; 38.8% identity in 258 aa overlap. P53434|YRP2_LISMO HYPOTHETICAL 41.4 KD PROTEIN (373 aa) opt: 595, E(): 1e-29; 30.7% identity in 326 aa overlap Protein product from Mb2255c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2255c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A657" /db_xref="InterPro:IPR002678" /db_xref="InterPro:IPR015867" /db_xref="InterPro:IPR017221" /db_xref="InterPro:IPR036069" /db_xref="UniProtKB/Swiss-Prot:P0A657" /protein_id="SIU00863.1" /translation="MSVRLADVIDVLDQAYPPRLAQSWDSVGLVCGDPDDVVDSVTVA VDATPAVVDQVPQAGLLLVHHPLLLRGVDTVAANTPKGVLVHRLIRTGRSLFTAHTNA DSASPGVSDALAHAVGLTVDAVLDPVPGAADLDKWVIYVPRENSEAVRAAVFEAGAGH IGDYSHCSWSVAGTGQFLAHDGASPAIGSVGTVERVAEDRVEVVAPARARAEVLAAMR AAHPYEEPAFDIFALVPPPVGSGLGRIGRLPKPEPLRTFVARLEAALPPTATGVRAAG DPDLLVSRVAVCGGAGDSLLATVAAADVQAYVTADLRHHPADEHCRASQVALIDVAHW ASEFPWCGQAAEVLRSHFGASLPVRVCTICTDPWNLDHETGRDQA" CDS complement(2488164..2489258) /codon_start=1 /transl_table=11 /gene="cobC" /locus_tag="BQ2027_MB2256C" /product="Possible aminotransferase CobC" /note="Mb2256c, cobC, len: 364 aa. Equivalent to Rv2231c, len: 364 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 364 aa overlap). Possible cobC, aminotransferase. Note that initiation codon uncertain. Similar to CobC aminotransferases e.g. sp|P21633|COBC_PSEDE COBC PROTEIN (333 aa) opt: 277, E(): 1.7e-11; 28.8% identity in 313 aa overlap and also to e.g. SW:HIS8_ECOLI P06986 histidinol-phosphate aminotransferase (27.0% identity in 289 aa overlap), contains PS00105 aminotransferases class-I pyridoxal-phosphate attachment site. Real Mycobacterium tuberculosis histidinol-phosphate aminotransferase, hisC, is Rv1600 (MTCY336.04c). Protein product from Mb2256c detected using SWATH mass spectrometry. Mb2256c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63501" /db_xref="InterPro:IPR004838" /db_xref="InterPro:IPR004839" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/Swiss-Prot:P63501" /protein_id="SIU00864.1" /translation="MLWILGPHTGPLLFDAVASLDTSPLAAARYHGDQDVAPGVLDFA VNVRHDRPPEWLVRQLAALLPELARYPSTDDVHRAQDAVAERHGRTRDEVLPLVGAAE GFALLHNLSPVRAAIVVPAFTEPAIALSAAGITAHHVVLKPPFVLDTAHVPDDADLVV VGNPTNPTSVLHLREQLLELRRPGRILVVDEAFADWVPGEPQSLADDSLPDVLVLRSL TKTWSLAGLRVGYALGSPDVLARLTVQRAHWPLGTLQLTAIAACCAPRAVAAAAADAV RLTALRAEMVAGLRSVGAEVVDGAAPFVLFNIADADGLRNYLQSKGIAVRRGDTFVGL DARYLRAAVRPEWPVLVAAIAEWAKRGGRR" CDS complement(2489295..2489720) /codon_start=1 /transl_table=11 /gene="vapC16" /locus_tag="BQ2027_MB2256A" /product="Possible toxin VapC16" /note="Mb2256A, len: 141 aa. Equivalent to Rv2231A len: 141 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 141 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible vapC16, toxin, part of toxin-antitoxin (TA) operon with Rv2231B (See Pandey and Gerdes, 2005). Nucleotide position 2505919 in the genome sequence has been corrected, A:G resulting in A81A." /db_xref="GOA:A0A1R3Y1C4" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR041705" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C4" /protein_id="SIU00865.1" /translation="MTMACTACPTIWTLRCQTTCSNAFTGEALPHRHPRLAADAVNET RAIVQDVRNSILLSAASAWEIAINYRLGKLPPPEPSASYVPDRMRRCGTSPLSVDHAH TAHRRASGSPSTSIRPCAHRPGTAAWPDDHHRRRPVSCL" CDS complement(2489766..2489942) /codon_start=1 /transl_table=11 /gene="vapB16" /locus_tag="BQ2027_MB2256B" /product="Possible antitoxin VapB16" /note="Mb2256B, len: 58 aa. Equivalent to Rv2231B len: 58 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 58 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible vapB16, antitoxin,part of toxin-antitoxin (TA) operon with Rv2231A (See Pandey and Gerdes, 2005). Mb2256B found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0S0" /protein_id="SIU00866.1" /translation="MALWYQAMIAKFGEQVVDAKVWAPAKRVGVHEAKTRLSELLRLV YGGQRLRLPAAASR" CDS 2489837..2490712 /codon_start=1 /transl_table=11 /gene="ptka" /locus_tag="BQ2027_MB2257" /product="protein tyrosine kinase transcriptional regulatory protein ptka" /note="Mb2257, -, len: 291 aa. Equivalent to Rv2232, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 291 aa overlap). Conserved hypothetical protein, similar to members of haloacid dehalogenase-like family from several bacteria and to putative phosphatases e.g. Q9I767 and AAK78398. Contains N-terminal extension. FASTA scores: Q9I767 HYPOTHETICAL PROTEIN PA0065 (221 aa) opt: 439 E(): 3.2e-18; 38.679% identity (40.196% ungapped) in 212 aa overlap; >>tr|AAK78398 Predicted phosphatase, HAD family (216 aa) opt: 427, E(): 1.5e-17; 34.762% identity (35.437% ungapped) in 210 aa overlap. Replaces previous Rv2232 and Rv2233. Mb2257 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR023198" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="InterPro:IPR041492" /db_xref="UniProtKB/Swiss-Prot:P68910" /protein_id="SIU00867.1" /translation="MSSPRERRPASQAPRLSRRPPAHQTSRSSPDTTAPTGSGLSNRF VNDNGIVTDTTASGTNCPPPPRAAARRASSPGESPQLVIFDLDGTLTDSARGIVSSFR HALNHIGAPVPEGDLATHIVGPPMHETLRAMGLGESAEEAIVAYRADYSARGWAMNSL FDGIGPLLADLRTAGVRLAVATSKAEPTARRILRHFGIEQHFEVIAGASTDGSRGSKV DVLAHALAQLRPLPERLVMVGDRSHDVDGAAAHGIDTVVVGWGYGRADFIDKTSTTVV THAATIDELREALGV" CDS 2490705..2491196 /codon_start=1 /transl_table=11 /gene="ptpA" /locus_tag="BQ2027_MB2258" /standard_name="MPtpA" /product="PHOSPHOTYROSINE PROTEIN PHOSPHATASE PTPA (PROTEIN-TYROSINE-PHOSPHATASE) (PTPase) (LMW PHOSPHATASE)" /note="Mb2258, ptpA, len: 163 aa. Equivalent to Rv2234, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). ptpA (alternate gene name: MPtpA), low molecular weight protein-tyrosine-phosphatase (see citations below) (EC 3.1.3.48), similar to other phosphotyrosine protein phosphatases e.g. P53433|PTPA_STRCO LOW MOLECULAR WEIGHT PROTEIN-TYROSINE PHOSPHATASE from Streptomyces coelicolor (164 aa), FASTA scores: opt: 455, E(): 3.3e -25, (49.7% identity in 155 aa overlap); PA1S_HUMAN|P24667 red cell acid phosphatase 1, FASTA score: (37.7% identity in 138 aa overlap); etc. Contains a phosphatase catalytic site domain located in N-terminal part. Activity proven biochemically. Supposed a secreted protein. Protein product from Mb2258 detected using SWATH mass spectrometry. Mb2258 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65717" /db_xref="InterPro:IPR017867" /db_xref="InterPro:IPR023485" /db_xref="InterPro:IPR036196" /db_xref="UniProtKB/Swiss-Prot:P65717" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00868.1" /translation="MSDPLHVTFVCTGNICRSPMAEKMFAQQLRHRGLGDAVRVTSAG TGNWHVGSCADERAAGVLRAHGYPTDHRAAQVGTEHLAADLLVALDRNHARLLRQLGV EAARVRMLRSFDPRSGTHALDVEDPYYGDHSDFEEVFAVIESALPGLHDWVDERLARN GPS" CDS 2491196..2492011 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2259" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2259, -, len: 271 aa. Equivalent to Rv2235, len: 271 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 271 aa overlap). Probable conserved transmembrane protein (see first citation below); hydrophobic regions near N- and C-terminus. Similar to conserved membrane proteins in other Actinomycetes. Equivalent to Mycobacterium leprae. ML1644 (270 aa). FASTA scores: opt: 1357, E(): 1.2e-72; 74.170% identity in 271 aa overlap T44717|3150235|CAA19213.1|AL023635 13093419|CAC30595.1|AL583922. Protein product from Mb2259 detected using SWATH mass spectrometry. Mb2259 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66884" /db_xref="InterPro:IPR002994" /db_xref="UniProtKB/Swiss-Prot:P66884" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00869.1" /translation="MPRLAFLLRPGWLALALVVVAFTYLCFTVLAPWQLGKNAKTSRE NQQIRYSLDTPPVPLKTLLPQQDSSAPDAQWRRVTATGQYLPDVQVLARLRVVEGDQA FEVLAPFVVDGGPTVLVDRGYVRPQVGSHVPPIPRLPVQTVTITARLRDSEPSVAGKD PFVRDGFQQVYSINTGQVAALTGVQLAGSYLQLIEDQPGGLGVLGVPHLDPGPFLSYG IQWISFGILAPIGLGYFAYAEIRARRREKAGSPPPDKPMTVEQKLADRYGRRR" CDS complement(2491993..2492934) /codon_start=1 /transl_table=11 /gene="cobD" /locus_tag="BQ2027_MB2260C" /product="probable cobalamin biosynthesis transmembrane protein cobd" /note="Mb2260c, cobD, len: 313 aa. Equivalent to Rv2236c, len: 313 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 313 aa overlap). Probable cobD, conserved membrane protein, similar to PIR:S52223 Rhodobacter capsulatus 945 protein BluD (39.0% identity in 287 aa overlap) involved in cobinamide synthesis, and to SW:COBD_PSEDE Pseudomonas dentrificans cobD protein (37.5% identity in 269 aa overlap), also SW:CBIB_SALTY Salmonella typhimurum cbiB protein (35.5% identity in 304 aa overlap)" /db_xref="GOA:Q7VEN1" /db_xref="InterPro:IPR004485" /db_xref="UniProtKB/Swiss-Prot:Q7VEN1" /protein_id="SIU00870.1" /translation="MFASIWQTRAVGVLIGCLLDVVFGDPKRGHPVALFGRAAAKLEQ ITYRDGRVAGAVHVGLLVGAVGLLGAALQRLPGRCWPVAATATATWAALGGTSLARTG RQISDLLERDDVEAARRLLPSLCGRDPAQLGGPGLTRAALESVAENTADAQVVPLLWA ASSGVPAVLGYRAINTLDSMIGYRSPRYLRFGWAAARLDDWANYVGARATAVLVVICA PVVGGSPRGAVRAWRRDAARHPSPNAGVVEAAFAGALDVRLGGPTRYHHELQIRPTLG DGRSPKVADLRRAVVLSRVVQAGAAVLAVMLVYRRRP" CDS 2493048..2493815 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2261" /product="conserved protein" /note="Mb2261, -, len: 255 aa. Equivalent to Rv2237, len: 255 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 255 aa overlap). Conserved hypothetical protein. Similar to Mycobacterium tuberculosis hypothetical proteins Rv0276, Rv0826, Rv1645c. FASTA score: Rv0276 gp|AL021930|MTV035_4 (306 aa) opt: 874, E(): 0; 49.6% identity in 282 aa overlap Protein product from Mb2261 detected using SWATH mass spectrometry. Mb2261 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64958" /db_xref="InterPro:IPR018713" /db_xref="UniProtKB/Swiss-Prot:P64958" /protein_id="SIU00871.1" /translation="MLLPAANVIMQLAVPGVGYGVLESPVDSGNVYKHPFKRARTTGT YLAVATIGTESDRALIRGAVDVAHRQVRSTASSPVSYNAFDPKLQLWVAACLYRYFVD QHEFLYGPLEDATADAVYQDAKRLGTTLQVPEGMWPPDRVAFDEYWKRSLDGLQIDAP VREHLRGVASVAFLPWPLRAVAGPFNLFATTGFLAPEFRAMMQLEWSQAQQRRFEWLL SVLRLADRLIPHRAWIFVYQLYLWDMRFRARHGRRIV" CDS complement(2493910..2494146) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2261A" /product="Conserved protein" /note="Mb2261A, len: 78 aa. Equivalent to Rv2237A len: 78 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 78 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein. Protein product from Mb2261A detected using SWATH mass spectrometry. Mb2261A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0N1" /protein_id="SIU00872.1" /translation="MLRCRRGAGYGSVVVVGERPGFQSDSAARQTAPPVRPMTSDQLP ATKADLYAAVDAMRADMRELLEQISTLIREATQK" tRNA complement(2494157..2494228) /locus_tag="BQ2027_VALV" /product="tRNA-Val" /note="valV, len: 72 nt. Equivalent to valV, len: 72 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 72 nt overlap). tRNA-Val, anticodon tac." CDS complement(2494274..2494735) /codon_start=1 /transl_table=11 /gene="ahpE" /locus_tag="BQ2027_MB2262C" /product="probable peroxiredoxin ahpe" /note="Mb2262c, ahpE, len: 153 aa. Equivalent to Rv2238c, len: 153 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 153 aa overlap). ahpE, peroxiredoxin. Similarity to many members of AHPC/TSA family e.g. sp|Q96291|BAS1_ARATH 2-CYS PEROXIREDOXIN BAS1 PRECURSOR (265 aa). FASTA score: opt: 275, E(): 2.7e-12; 35.0% identity in 143 aa overlap Protein product from Mb2262c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2262c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65689" /db_xref="InterPro:IPR000866" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR024706" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/Swiss-Prot:P65689" /protein_id="SIU00873.1" /translation="MLNVGATAPDFTLRDQNQQLVTLRGYRGAKNVLLVFFPLAFTGI CQGELDQLRDHLPEFENDDSAALAISVGPPPTHKIWATQSGFTFPLLSDFWPHGAVSQ AYGVFNEQAGIANRGTFVVDRSGIIRFAEMKQPGEVRDQRLWTDALAALTA" CDS complement(2494735..2495211) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2263C" /product="cell redox homeostasis" /note="Mb2263c, -, len: 158 aa. Equivalent to Rv2239c, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 158 aa overlap). Conserved hypothetical protein, similar to conserved hypothetical proteins from Mycobacterium leprae (ML1649, 140 aa) and Streptomyces coelicolor A3(2) (SCC8A.28c, 159 aa). Equivalent to ML1649 conserved hypothetical protein (140 aa). FASTA scores: ML1649 conserved hypothetical protein (140 aa) opt: 846, E(): 6.5e-45; 86.429% identity in 140 aa overlap (tr|O69479|O69479 HYPOTHETICAL 15.2 KDA PROTEIN (140 aa); and opt: 447, E(): 1.2e-21; 50.355% identity (51.825% ungapped) in 141 aa overlap. Similarity with ML1649 suggests alternative start at 251198. Protein product from Mb2263c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2263c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR021412" /db_xref="UniProtKB/Swiss-Prot:P64960" /protein_id="SIU00874.1" /translation="MPIATVCTWPAETEGGSTVVAADHASNYARKLGIQRDQLIQEWG WDEDTDDDIRAAIEEACGGELLDEDTDEVIDVVLLWWRDGDGDLVDTLMDAIGPLAED GVIWVVTPKTGQPGHVLPAEIAEAAPTAGLMPTSSVNLGNWSASRLVQPKSRAGKR" CDS complement(2495249..2495839) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2264C" /product="unknown protein" /note="Mb2264c, len: 196 aa. Equivalent to Rv2240c len: 196 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 196 aa overlap). Unknown protein. Start changed since first submission (-69 aa). Protein product from Mb2264c detected using SWATH mass spectrometry. Mb2264c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1D4" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1D4" /protein_id="SIU00875.1" /translation="MLIGWRAVPRRHGGELPRRGALALGCIALLLMGIVGCTTVTDGT AMPDTNVAPAYRSSVSASVSASAATSSIRESQRQQSLTTKAIRTSCDALAATSKDAID KVNAYVAAFNQGRNTGPTEGPAIDALNNSASTVSGSLSAALSAQLGDALNAYVDAARA VANAIGAHASTAEFNRRVDRLNDTKTKALKMCVAAF" CDS 2496098..2498803 /codon_start=1 /transl_table=11 /gene="aceE" /locus_tag="BQ2027_MB2265" /product="pyruvate dehydrogenase e1 component acee (pyruvate decarboxylase) (pyruvate dehydrogenase) (pyruvic dehydrogenase)" /note="Mb2265, aceE, len: 901 aa. Equivalent to Rv2241, len: 901 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 901 aa overlap). Probable aceE, pyruvate dehydrogenase E1 component (EC 1.2.4.1), similar to others e.g. ODP1_ECOLI|P06958 pyruvate dehydrogenase E1 component from Escherichia coli, FASTA score: (51.2% identity in 891 aa overlap); etc. Protein product from Mb2265 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2265 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0S8" /db_xref="InterPro:IPR004660" /db_xref="InterPro:IPR005474" /db_xref="InterPro:IPR009014" /db_xref="InterPro:IPR029061" /db_xref="InterPro:IPR035807" /db_xref="InterPro:IPR041621" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0S8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00876.1" /translation="MASYLPDIDPEETSEWLESFDTLLQRCGPSRARYLMLRLLERAG EQRVAIPALTSTDYVNTIPTELEPWFPGDEDVERRYRAWIRWNAAIMVHRAQRPGVGV GGHISTYASSAALYEVGFNHFFRGKSHPGGGDQVFIQGHASPGIYARAFLEGRLTAEQ LDGFRQEHSHVGGGLPSYPHPRLMPDFWEFPTVSMGLGPLNAIYQARFNHYLHDRGIK DTSDQHVWCFLGDGEMDEPESRGLAHVGALEGLDNLTFVINCNLQRLDGPVRGNGKII QELESFFRGAGWNVIKVVWGREWDALLHADRDRALVNLMNTTPDGDYQTYKANDGGYV RDHFFGRDPRTKALVENMSDQDIWNLKRGGHDYRKVYAAYRAAVDHKGQPTVILAKTI KGYALGKHFEGRNATHQMKKLTLEDLKEFRDTQRIPVSDAQLEENPYLPPYYHPGLNA PEIRYMLDRRRALGGFVPERRTKSKALTLPGRDIYAPLKKGSGHQEVATTMATVRTFK EVLRDKQIGPRIVPIIPDEARTFGMDSWFPSLKIYNRNGQLYTAVDADLMLAYKESEV GQILHEGINEAGSVGSFIAAGTSYATHNEPMIPIYIFYSMFGFQRTGDSFWAAADQMA RGFVLGATAGRTTLTGEGLQHADGHSLLLAATNPAVVAYDPAFAYEIAYIVESGLARM CGENPENIFFYITVYNEPYVQPPEPENFDPEGVLRGIYRYHAATEQRTNKAQILASGV AMPAALRAAQMLAAEWDVAADVWSVTSWGELNRDGVAIETEKLRHPDRPAGVPYVTRA LENARGPVIAVSDWMRAVPEQIRPWVPGTYLTLGTDGFGFSDTRPAARRYFNTDAESQ VVAVLEALAGDGEIDPSVPVAAARQYRIDDVAAAPEQTTDPGPGA" CDS 2498863..2500107 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2266" /product="Transcriptional regulator, CdaR-family" /note="Mb2266, -, len: 414 aa. Equivalent to Rv2242, len: 414 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 414 aa overlap). Conserved hypothetical protein. Equivalent to ML1652 conserved hypothetical protein from Mycobacterium leprae (414 aa), and ortholog in Streptomyces coelicolor A3(2). FASTA scores: ML1652 opt: 2369, E(): 4.2e-128; 88.406% identity in 414 aa overlap (AL023635)(AL583922). some similarity at 3' end with S25203 srmR protein - Streptomyces ambofaciens (604 aa) opt: 188 E(): 9e-05; (26.4% identity in 277 aa overlap) and with SW:YAEG_HAEIN P44509 hypothetical protein HI0093 (42.3% identity in 52 aa overlap). Contains possible helix-turn-helix motif at aa 360-381 (+3.52 SD) Protein product from Mb2266 detected using SWATH mass spectrometry. Mb2266 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025736" /db_xref="InterPro:IPR041522" /db_xref="InterPro:IPR042070" /db_xref="UniProtKB/Swiss-Prot:P63750" /protein_id="SIU00877.1" /translation="MNDNQLAPVARPRSPLELLDTVPDSLLRRLKQYSGRLATEAVSA MQERLPFFADLEASQRASVALVVQTAVVNFVEWMHDPHSDVGYTAQAFELVPQDLTRR IALRQTVDMVRVTMEFFEEVVPLLARSEEQLTALTVGILKYSRDLAFTAATAYADAAE ARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTTAPATVLVGTPAPGPNGSNSD GDSERASQDVRDTAARHGRAALTDVHGTWLVAIVSGQLSPTEKFLKDLLAAFADAPVV IGPTAPMLTAAHRSASEAISGMNAVAGWRGAPRPVLARELLPERALMGDASAIVALHT DVMRPLADAGPTLIETLDAYLDCGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPTQ PRDAYVLRVAATVGQLNYPTPH" CDS 2500346..2501254 /codon_start=1 /transl_table=11 /gene="fabD" /locus_tag="BQ2027_MB2267" /standard_name="mtFabD" /product="MALONYL CoA-ACYL CARRIER PROTEIN TRANSACYLASE FABD (Malonyl CoA:AcpM acyltransferase) (MCT)" /note="Mb2267, fabD, len: 302 aa. Equivalent to Rv2243, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 302 aa overlap). fabD (alternate gene name: mtFabD), malonyl CoA-acyl carrier protein transacylase (EC 2.3.1.39) (see citations below), highly similar to e.g. A57356 acyl-CoA carrier protein malonyltransferase from Streptomyces coelicolor (316 aa), FASTA score: opt: 955, E(): 0, (52.6% identity in 304 aa overlap); FABD_HAEIN|P43712 malonyl CoA-acyl carrier protein transacylase from Haemophilus influenzae, FASTA score: (30.5% identity in 308 aa overlap); and FABD_ECOLI|P25715 from Escherichia coli, FASTA score: (31.4% identity in 309 aa overlap). Protein product from Mb2267 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2267 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63459" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR020801" /db_xref="UniProtKB/Swiss-Prot:P63459" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00878.1" /translation="MIALLAPGQGSQTEGMLSPWLQLPGAADQIAAWSKAADLDLARL GTTASTEEITDTAVAQPLIVAATLLAHQELARRCVLAGKDVIVAGHSVGEIAAYAIAG VIAADDAVALAATRGAEMAKACATEPTGMSAVLGGDETEVLSRLEQLDLVPANRNAAG QIVAAGRLTALEKLAEDPPAKARVRALGVAGAFHTEFMAPALDGFAAAAANIATADPT ATLLSNRDGKPVTSAAAAMDTLVSQLTQPVRWDLCTATLREHTVTAIVEFPPAGTLSG IAKRELRGVPARAVKSPADLDELANL" CDS 2501330..2501677 /codon_start=1 /transl_table=11 /gene="acpM" /locus_tag="BQ2027_MB2268" /product="MEROMYCOLATE EXTENSION ACYL CARRIER PROTEIN ACPM" /note="Mb2268, acpM, len: 115 aa. Equivalent to Rv2244, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 115 aa overlap). acpM, acyl carrier protein, meromycolate precursor transport, involved in meromycolate extension (see citations below). Highly similar to others e.g. L43074|STMFABD2|STMFABD|g870805 acyl carrier protein from Streptomyces glaucescens (82 aa), FASTA scores: opt: 298, E(): 8.4e-13, (56.6% identity in 76 aa overlap); and ACP_ECOLI|P02901 acyl carrier protein from Escherichia coli, FASTA score: (37.3% identity in 67 aa overlap); etc. Protein product from Mb2268 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2268 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4W7" /db_xref="InterPro:IPR003231" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR036736" /db_xref="UniProtKB/Swiss-Prot:P0A4W7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00879.1" /translation="MPVTQEEIIAGIAEIIEEVTGIEPSEITPEKSFVDDLDIDSLSM VEIAVQTEDKYGVKIPDEDLAGLRTVGDVVAYIQKLEEENPEAAQALRAKIESENPDA VANVQARLEAESK" CDS 2501674..2502924 /codon_start=1 /transl_table=11 /gene="kasA" /locus_tag="BQ2027_MB2269" /product="3-OXOACYL-[ACYL-CARRIER PROTEIN] SYNTHASE 1 KASA (BETA-KETOACYL-ACP SYNTHASE) (KAS I)" /note="Mb2269, kasA, len: 416 aa. Equivalent to Rv2245, len: 416 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 416 aa overlap). kasA, beta-ketoacyl-ACP synthase (EC 2.3.1.41), involved in meromycolate extension (see citations below): belongs to the FAS-II system, which utilizes primarily palmitoyl-ACP rather than short-chain acyl-ACP primers. Highly similar to others e.g. L43074|STMFABD3|g870805 beta-ketoacyl-ACP synthase from Streptomyces glaucescens (423 aa), FASTA scores: opt: 1105, E(): 0, (44.6% identity in 417 aa overlap); FABF_ECOLI|P39435 3-oxoacyl-[acyl-carrier-protein] synthase II from Escherichia coli, FASTA score: (39.4% identity in 254 aa overlap); FABB_HORVU|P23902 3-oxoacyl-[acyl-carrier-protein] synthase I, FASTA score: (33.4% identity in 413 aa overlap); etc. Strongest similarity to downstream ORF kasB|Rv2246|MTCY427.27 3-oxoacyl-[acyl-carrier-protein] synthase 2 from Mycobacterium tuberculosis (438 aa), FASTA score: (66.3% identity in 409 aa overlap). BELONGS TO THE BETA-KETOACYL-ACP SYNTHASES FAMILY. Protein product from Mb2269 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2269 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63455" /db_xref="InterPro:IPR000794" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020841" /db_xref="UniProtKB/Swiss-Prot:P63455" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00880.1" /translation="MSQPSTANGGFPSVVVTAVTATTSISPDIESTWKGLLAGESGIH ALEDEFVTKWDLAVKIGGHLKDPVDSHMGRLDMRRMSYVQRMGKLLGGQLWESAGSPE VDPDRFAVVVGTGLGGAERIVESYDLMNAGGPRKVSPLAVQMIMPNGAAAVIGLQLGA RAGVMTPVSACSSGSEAIAHAWRQIVMGDADVAVCGGVEGPIEALPIAAFSMMRAMST RNDEPERASRPFDKDRDGFVFGEAGALMLIETEEHAKARGAKPLARLLGAGITSDAFH MVAPAADGVRAGRAMTRSLELAGLSPADIDHVNAHGTATPIGDAAEANAIRVAGCDQA AVYAPKSALGHSIGAVGALESVLTVLTLRDGVIPPTLNYETPDPEIDLDVVAGEPRYG DYRYAVNNSFGFGGHNVALAFGRY" CDS 2502955..2504271 /codon_start=1 /transl_table=11 /gene="kasB" /locus_tag="BQ2027_MB2270" /product="3-OXOACYL-[ACYL-CARRIER PROTEIN] SYNTHASE 2 KASB (BETA-KETOACYL-ACP SYNTHASE) (KAS I)" /note="Mb2270, kasB, len: 438 aa. Equivalent to Rv2246, len: 438 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 438 aa overlap). kasB, beta-ketoacyl-ACP synthase (EC 2.3.1.41), involved in meromycolate extension (see citations below). Highly similar or similar to others e.g. L43074|STMFABD3|g870805 beta-ketoacyl-ACP synthase from Streptomyces glaucescens (423 aa), FASTA scores: opt: 1091, E(): 0, (44.7% identity in 416 aa overlap); FABF_ECOLI|P39435 3-oxoacyl-[acyl-carrier-protein] synthase II from Escherichia coli, FASTA score: (37.0% identity in 411 aa overlap); FABB_HORVU|P23902 3-oxoacyl-[acyl-carrier-protein] synthase I, FASTA score: (32.5% identity in 415 aa overlap); etc. Strongest similarity to upstream ORF Rv2245|kasA|MTCY427.26 3-oxoacyl-[acyl-carrier-protein] synthase 1 from Mycobacterium tuberculosis (416 aa), FASTA score: (66.3% identity in 409 aa overlap). BELONGS TO THE BETA-KETOACYL-ACP SYNTHASES FAMILY. Protein product from Mb2270 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2270 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63457" /db_xref="InterPro:IPR000794" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020841" /db_xref="UniProtKB/Swiss-Prot:P63457" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00881.1" /translation="MGVPPLAGASRTDMEGTFARPMTELVTGKAFPYVVVTGIAMTTA LATDAETTWKLLLDRQSGIRTLDDPFVEEFDLPVRIGGHLLEEFDHQLTRIELRRMGY LQRMSTVLSRRLWENAGSPEVDTNRLMVSIGTGLGSAEELVFSYDDMRARGMKAVSPL TVQKYMPNGAAAAVGLERHAKAGVMTPVSACASGAEAIARAWQQIVLGEADAAICGGV ETRIEAVPIAGFAQMRIVMSTNNDDPAGACRPFDRDRDGFVFGEGGALLLIETEEHAK ARGANILARIMGASITSDGFHMVAPDPNGERAGHAITRAIQLAGLAPGDIDHVNAHAT GTQVGDLAEGRAINNALGGNRPAVYAPKSALGHSVGAVGAVESILTVLALRDQVIPPT LNLVNLDPEIDLDVVAGEPRPGNYRYAINNSFGFGGHNVAIAFGRY" CDS 2504302..2505723 /codon_start=1 /transl_table=11 /gene="accD6" /locus_tag="BQ2027_MB2271" /product="ACETYL/PROPIONYL-CoA CARBOXYLASE (BETA SUBUNIT) ACCD6" /note="Mb2271, accD6, len: 473 aa. Equivalent to Rv2247, len: 473 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 473 aa overlap). accD6, Acetyl/Propionyl CoA Carboxylase, beta subunit (EC 6.4.1.3) (see citations below), highly similar to e.g. PCCB_RHOSO|Q06101 propionyl-CoA carboxylase beta chain, FASTA score: (75.1% identity in 437 aa overlap). Similar to many other Acetyl/Propionyl CoA Carboxylases from Mycobacterium tuberculosis. BELONGS TO THE ACCD / PCCB FAMILY. Protein product from Mb2271 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2271 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63408" /db_xref="InterPro:IPR011762" /db_xref="InterPro:IPR011763" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR034733" /db_xref="UniProtKB/Swiss-Prot:P63408" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00882.1" /translation="MTIMAPEAVGESLDPRDPLLRLSNFFDDGSVELLHERDRSGVLA AAGTVNGVRTIAFCTDGTVMGGAMGVEGCTHIVNAYDTAIEDQSPIVGIWHSGGARLA EGVRALHAVGQVFEAMIRASGYIPQISVVVGFAAGGAAYGPALTDVVVMAPESRVFVT GPDVVRSVTGEDVDMASLGGPETHHKKSGVCHIVADDELDAYDRGRRLVGLFCQQGHF DRSKAEAGDTDIHALLPESSRRAYDVRPIVTAILDADTPFDEFQANWAPSMVVGLGRL SGRTVGVLANNPLRLGGCLNSESAEKAARFVRLCDAFGIPLVVVVDVPGYLPGVDQEW GGVVRRGAKLLHAFGECTVPRVTLVTRKTYGGAYIAMNSRSLNATKVFAWPDAEVAVM GAKAAVGILHKKKLAAAPEHEREALHDQLAAEHERIAGGVDSALDIGVVDEKIDPAHT RSKLTEALAQAPARRGRHKNIPL" CDS 2505831..2506646 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2272" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2272, -, len: 271 aa. Equivalent to Rv2248, len: 271 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 271 aa overlap). Conserved hypothetical protein. Very similar to hypothetical Mycobacterium tuberculosis proteins Rv3517, Rv1482c, Rv3555c, Rv3714c, Rv1073. FASTA score: MTCY06G11.02c MTCY6G11 NID: g1877284 -(289 aa) opt: 366 E(): 5.3e-18; (32.1% identity in 249 aa overlap). Some similarity to M. avium protein AF002133|AF0021 339 AF002133 NID: g2183254 (346 aa) opt: 308 E(): 5.2e-14; (28.3% identity in 254 aa overlap),Mb2272 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR011335" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0M0" /protein_id="SIU00883.1" /translation="MTRQQLDVQVKNGGLVRVWYGVYAAQEPDLLGRLAALDVFMGGH AVACLGTAAALYGFDTENTVAIHMLDPGVRMRPTVGLMVHQRVGARLQRVSGRLATAP AWTAVEVARQLRRPRALATLEAALRSMRCARSEIENAVAEQRGRRGIVAARELLPFAD GRAESAMESEARLVMIDHGLPLPELQYPIHGHGGEMWRVDFAWPDMRLAAEYESIEWH AGPAEMLRDKTRWAKLQELGWTIVPIVVDDVRREPGRLAARIARHLDRARMAG" CDS complement(2506715..2508265) /codon_start=1 /transl_table=11 /gene="glpD1" /locus_tag="BQ2027_MB2273C" /product="PROBABLE GLYCEROL-3-PHOSPHATE DEHYDROGENASE GLPD1" /note="Mb2273c, glpD1, len: 516 aa. Equivalent to Rv2249c, len: 516 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 516 aa overlap). Probable glpD1, glycerol-3-phosphate dehydrogenase, similar to SW:GLPD_ECOLI P13035 aerobic glycerol-3-phosphate dehydrogenase (30.0% identity in 486 aa overlap) and SW:GLPA_ECOLI P13032 anaerobic glycerol-3-phosphate dehydrogenase (28.2% identity in 504 aa overlap). Also similar to Rv3302c|glpD2 glycerol-3-phosphate dehydrogenase. COFACTOR: FAD (BY SIMILARITY). BELONGS TO THE FAD-DEPENDENT GLYCEROL-3-PHOSPHATE DEHYDROGENASE FAMILY. Protein product from Mb2273c detected using SWATH mass spectrometry." /db_xref="GOA:P64183" /db_xref="InterPro:IPR000447" /db_xref="InterPro:IPR006076" /db_xref="InterPro:IPR031656" /db_xref="InterPro:IPR036188" /db_xref="InterPro:IPR038299" /db_xref="UniProtKB/Swiss-Prot:P64183" /protein_id="SIU00884.1" /translation="MLMPHSAALNAARRSADLTALADGGALDVIVIGGGITGVGIALD AATRGLTVALVEKHDLAFGTSRWSSKLVHGGLRYLASGNVGIARRSAVERGILMTRNA PHLVHAMPQLVPLLPSMGHTKRALVRAGFLAGDALRVLAGTPAATLPRSRRIPASRVV EIAPTVRRDGLDGGLLAYDGQLIDDARLVMAVARTAAQHGARILTYVGASNVTGTSVE LTDRRTRQSFALSARAVINAAGVWAGEIDPSLRLRPSRGTHLVFDAKSFANPTAALTI PIPGELNRFVFAMPEQLGRIYLGLTDEDAPGPIPDVPQPSSEEITFLLDTVNTALGTA VGTKDVIGAYAGLRPLIDTGGAGVQGRTADVSRDHAVFESPSGVISVVGGKLTEYRYM AEDVLNRAITLRHLRAAKCRTRNLPLIGAPANPGPAPGSGAGLPESLVARYGAEAANV AAAATCERPTEPVADGIDVTRAEFEYAVTHEGALDVDDILDRRTRIGLVPRDRERVVA VAKEFLSR" CDS complement(2508259..2508828) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2274C" /product="Possible transcriptional regulatory protein" /note="Mb2274c, -, len: 189 aa. Equivalent to Rv2250c, len: 189 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 189 aa overlap). Possible transcriptional regulatory protein, TetR family. Start unclear; ORF has been shortened since first submission to avoid overlap with Rv2251 (-30 aa). Contains probable helix-turn-helix motif (Score 2243, +6.70 SD) Protein product from Mb2274c detected using SWATH mass spectrometry." /db_xref="GOA:Q7VEN0" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023772" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/Swiss-Prot:Q7VEN0" /protein_id="SIU00885.1" /translation="MLSMSNDRADTGGRILRAAASCVVDYGVDRVTLAEIARRAGVSR PTVYRRWPDTRSIMASMLTSHIAAVLREVPLDGDDREALVKQIVAVADRLRGDDLIMS VMHSELARVYITERLGTSQQVLIEGLAARLTVAQRSGSVRSGDARRLATMVLLIAQST IQSADIVDSILDSAALATELTHALNGYLC" CDS 2508876..2510465 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2275" /product="POSSIBLE FLAVOPROTEIN" /note="Mb2275, -, len: 529 aa. Similar to 5' end of Rv2250A and 3' end of Rv2251, len: 139 aa and 475 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 108 aa overlap and 98.2% identity in 433 aa overlap). Rv2250A: Conserved hypothetical protein, possibly flavoprotein. Similar to N-terminus of SCF91.28c|AL132973_28 possible flavoprotein from Streptomyces coelicolor (530 aa), FASTA scores: opt: 240, E(): 1.1e-07, (39.25% identity in 107 aa overlap). Possible frameshift between nt 2525723 to 2525727. The sequences of CDC 1551 and Mycobacterium bovis are missing a single G base. Rv2251: Possible flavoprotein, probably continuation of Rv2250A, similar to MTCY164.18 from Mycobacterium tuberculosis and to several ALKYLDIHYDROXYACETONEPHOSPHATE SYNTHASES (e.g. O00116). Also some similarity to D-lactate dehydrogenases. FASTA scores: sptr|O05784|O05784 HYPOTHETICAL 56.5 KD PROTEIN. (527 aa) opt: 1019 E(): 0; (38.6% identity in 487 aa overlap) and sp|O00116|ADAS_HUMAN ALKYLDIHYDROXYACETON EPHOSPHATE SYNTHASE PRECURSOR (EC 2.5.1.26) (658 aa) opt: 558 E(): 6.2e-27; (31.3% identity in 447 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis H37Rv, Rv2250A and Rv2251 exist as 2 genes with an overlap region between them. In Mycobacterium bovis, a single base deletion (g-*) leads to a single product. Protein product from Mb2275 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2275 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0U3" /db_xref="InterPro:IPR004113" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR016164" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR016171" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0U3" /protein_id="SIU00886.1" /translation="MKWDAWGDPAAAKPLSDGVRSLLKQVVGLADSEQPELDPAQVQL RPSALSGADHDALARIVGTEYFRTADRDRLLHAGGKSTPDLLRRKDTGVQDAPDAVLL PGGPNGEDAVADILHYCSDHGIAVVPFGGGTSVVGGLDPVRNDFRAVISLDMRRFDRL HRIDEVSGEAELEAGVTGPEAERLLGEHGFSLGHFPQSFEFATIGGFAATRSSGQDSA GYGRFNDMILGLRMITPVGVLDLGRVPASAAGPDLRQLAIGSEGVFGVITRVRLRVHR IPESTRYEAWSFPDFATGVAALRTITQTGTGPTVVRLSDEAETGVNLATTEAIGETQI TGGCLGITVFEGTQEHTESRHAETRALLAARGGTSLGEGPARAWERGRFAAPYLRDSL LAAGALCETLETATVWSNTPVLKAAVTEALTTSLAASGTPALVMCHVSHVYPTGASLY FTVVAGQRGDPIEQWLAAKKAASDAIMATGGTITHHHAVGSDHRPWMRAEVGDLGVTL LRTIKATLDPAGILNPGKLIP" CDS 2510462..2511391 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2276" /product="diacylglycerol kinase" /note="Mb2276, -, len: 309 aa. Equivalent to Rv2252, len: 309 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 309 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from Bacillus subtilis (e.g. BSUB0004_120), Streptomyces coelicolor A3(2) >emb|CAB61184.1| (AL132973) hypothetical protein SCF91.27c (293 aa) and P39074. FASTA scores: Z99107|BSUB0004_120 Bacillus subtilis complete genome (303 aa) opt: 397, E(): 1.7e-19; (26.4% identity in 299 aa overlap) and P390 74|BMRU_BACSU BMRU PROTEIN (297 aa) opt: 309, E(): 1.3e-13; (25.0% identity in 284 aa overlap). Protein product from Mb2276 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2276 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y113" /db_xref="InterPro:IPR001206" /db_xref="InterPro:IPR005218" /db_xref="InterPro:IPR016064" /db_xref="InterPro:IPR017438" /db_xref="UniProtKB/TrEMBL:A0A1R3Y113" /protein_id="SIU00887.1" /translation="MSAGQLRRHEIGKVTALTNPLSGHGAAVKAAHGAIARLKHRGVD VVEIVGGDAHDARHLLAAAVAKGTDAVMVTGGDGVVSNALQVLAGTDIPLGIIPAGTG NDHAREFGLPTKNPKAAADIVVDGWTETIDLGRIQDDNGIEKWFGTVAATGFDSLVND RANRMRWPHGRMRYYIAMLAELSRLRPLPFRLVLDGTEEIVADLTLADFGNTRSYGGG LLICPNADHSDGLLDITMAQSDSRTKLLRLFPTIFKGAHVELDEVSTTRAKTVHVECP GINVYADGDFACPLPAEISAVPAALQVLRPRHG" CDS 2511457..2511960 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2277" /product="Possible secreted unknown protein" /note="Mb2277, -, len: 167 aa. Equivalent to Rv2253, len: 167 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 167 aa overlap). Possible secreted protein; has potential N-terminal signal peptide. Mb2277 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0L7" /protein_id="SIU00888.1" /translation="MSGHRKKAMLALAAASLAATLAPNAVAAAEPSWNGQYLVTLSAN AKTGTSMAANRPEYPHKANYTFSSRCASDVCIATVVDAPPPKNEFIPRPIEYTWNGTQ WVREISWQWDCLLPDGTIEYAPAKSITAYTPGQYGILTGVFHTDIASGTCKGNVDMPV SAKPIVG" CDS complement(2511993..2512448) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2278C" /product="Probable integral membrane protein" /note="Mb2278c, -, len: 151 aa. Equivalent to Rv2254c, len: 151 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 151 aa overlap). Probable integral membrane protein. Mb2278c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0P4" /db_xref="InterPro:IPR001123" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0P4" /protein_id="SIU00889.1" /translation="MRYRDLETVAAPTINVLRVWPEIVGAIVLLVIAAMGIGHGLRPS PEPVPAPQKQLGCVRFALIFGLTVINPATFVYFTAVAVTLARALRATTAIAVVVGVAL ASLLWQLLLVSAGAFLRSRATARVRRMTVLAGNAVIAAFGAVLVVHAFA" CDS complement(2512453..2512647) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2279C" /product="HYPOTHETICAL PROTEIN" /note="Mb2279c, -, len: 64 aa. Equivalent to Rv2255c, len: 64 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 64 aa overlap). Hypothetical unknown protein. Mb2279c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Q3" /protein_id="SIU00890.1" /translation="MDGIVDRGVRARPCQKVVAVLRRSKSHIDKRLDAATGNAFLGKQ VLSAAGVVEYRPPRRSPLST" CDS complement(2512814..2513347) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2280C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2280c, -, len: 177 aa. Equivalent to Rv2256c, len: 177 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 177 aa overlap). Conserved hypothetical protein, similar to Streptomyces glaucescens ORF5 (164 aa) and Streptomyces coelicolor hypothetical protein SC4A7.19c (164 aa; emb|CAB62723.1|AL133423). FASTA scores: sptr|Q54209|Q54209 FABD, FABH, FABC, FABB, AND ORF5 (164 aa) opt: 504, E(): 3.9e-27; (44.4% identity in 162 aa overlap). Protein product from Mb2280c detected using SWATH mass spectrometry. Mb2280c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021491" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0N9" /protein_id="SIU00891.1" /translation="MEPKEQQMRASNQFADVTSGVVYIHGSPAAVCPHVEWALSSTLQ AKANLVWTPQPALPPQLRAVTNWVGPVGTGARLANALRSWSVLRFEVTEDPSPGVDGQ RFSHTPQLGLWSGAMSANGDIMVGEMRLRAMMAQGADTLAAELDSVLGTAWDQALEVY RDGGDAGEVTWLSRGVG" CDS complement(2513477..2514295) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2281C" /product="Beta-lactamase class C-like and penicillin binding proteins (PBPs) superfamily" /note="Mb2281c, -, len: 272 aa. Equivalent to Rv2257c, len: 272 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 272 aa overlap). Conserved hypothetical protein, similar to hypothetical protein SC4A7.08 from Streptomyces coelicolor (273 aa; 58% identity in 243 aa overlap). Also similar to several putative esterases and penicillin-binding proteins in M. tuberculosis e.g. Rv1923, Rv1497, Rv2463, Rv3775, Rv1922, Rv1730c. Protein product from Mb2281c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2281c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0P6" /protein_id="SIU00892.1" /translation="MTALEVLGGWPVPAAAAAVIGPAGVLATHGDTARVFALASVTKP LVARAAQVAVEEGVVNLDTPAGPPGSTVRHLLAHTSGLAMHSDQALARPGTRRMYSNY GFTVLAESVQRESGIEFGRYLTEAVCEPLGMVTTRLDGGPAAAGFGATSTVADLAVFA GDLLRPSTVSAQMHADATTVQFPGLDGVLPGYGVQRPNDWGLGFEIRNSKSPHWTGEC NSTRTFGHFGQSGGFIWVDPKADLALVVLTARDFGDWALDLWPAISDAVLAEYT" CDS complement(2514309..2515370) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2282C" /product="Possible transcriptional regulatory protein" /note="Mb2282c, -, len: 353 aa. Equivalent to Rv2258c, len:353 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 353 aa overlap). Possible transcriptional regulatory protein, similar to several hypothetical proteins from C. elegans. FASTA scores: sptr|O01593|O01593 CODED FOR BY C. ELEGANS CDNA YK102 F (365 aa) opt: 577, E(): 6.4e-31; (30.5% identity in 341 aa overlap). Contains possible helix-turn helix motif at aa 47-68 (+3.65 SD) Protein product from Mb2282c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2282c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025714" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0N3" /protein_id="SIU00893.1" /translation="MSGALETTEEFGNRFVAAIDSAGLAILVSVGHQTGLLDTMAGLP PATSMEIAEAAGLEERYVREWLGGMTTGQIVEYDAGSSTYSLPAHRAGMLTRAAGPDN LAVIAQFVSLLGEVEQKVIRCFREGGGVPYSEYPRFHKLMAEMSGMVFDAALIDVVLP LVDGLPDRLRSGADVADFGCGSGRAVKLMAQAFGASRFTGIDFSDEAVAAGTEEAARL GLANATFERHDLAELDKVGAYDVITVFDAIHDQAQPARVLQNIYRALRPGGVLLMVDI KASSQLEDNVGVPLSTYLYTTSLMHCMTVSLALDGAGLGTVWGRQLATSMLADAGFTD VTVAEIESDVLNNYYIARK" CDS 2515612..2516697 /codon_start=1 /transl_table=11 /gene="mscr" /locus_tag="BQ2027_MB2283" /product="s-nitrosomycothiol reductase mscr" /note="Mb2283, adhE2, len: 361 aa. Equivalent to Rv2259, len: 361 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 361 aa overlap). Probable adhE2, zinc-containing alcohol dehydrogenase, similar to several, especially mycothiol-dependent formaldehyde dehydrogenase from Amycolatopsis methanolica P80094 (360 aa). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. FASTA scores: >sp|P80094|FADH_AMYME NAD/MYCOTHIOL-DEPENDENT FORMALDEHYDE DEHYDROGENASE (MD-FALDH) Length = 360, Expect = e-156, Identities = 268/358 (74%). Also similar to Rv0162c, (MTCI28.02c, 35.0% identity in 371 aa overlap). Protein product from Mb2283 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2283 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2U2" /db_xref="InterPro:IPR002328" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR017816" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2U2" /protein_id="SIU00894.1" /translation="MSQTVRGVIARQKGEPVELVNIVVPDPGPGEAVVDVTACGVCHT DLTYREGGINDEYPFLLGHEAAGIIEAVGPGVTAVEPGDFVILNWRAVCGQCRACKRG RPRYCFDTFNAEQKMTLTDGTELTAALGIGAFADKTLVHSGQCTKVDPAADPAVAGLL GCGVMAGLGAAINTGGVTRDDTVAVIGCGGVGDAAIAGAALVGAKRIIAVDTDDTKLD WARTFGATHTVNAREVDVVQAIGGLTDGFGADVVIDAVGRPETYQQAFYARDLAGTVV LVGVPTPDMRLDMPLVDFFSHGGALKSSWYGDCLPESDFPTLIDLYLQGRLPLQRFVS ERIGLEDVEEAFHKMHGGKVLRSVVML" CDS 2516697..2517332 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2284" /product="Putative hydrolase in cluster with formaldehyde/S-nitrosomycothiol reductase MscR" /note="Mb2284, -, len: 211 aa. Equivalent to Rv2260, len: 211 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 211 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins Rv0634c, Rv1637c, Rv3677c, Rv2581c from Mycobacterium tuberculosis and to various hydrolases. FASTA scores: sptr|O06154|O06154 HYPOTHETICAL 21.3 KD PROTEIN (200 aa) opt: 355, E(): 4e-15; (37.4% identity in 198 aa overlap). Protein product from Mb2284 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2284 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1H3" /protein_id="SIU00895.1" /translation="MAAIERVITHGTFELDGGSWEVDNNIWLVGDDSEVVVFDAAHHA APIIDAVGGRKVVAVICTHGHNDHVTVAPELGTALDAPVLMHPGDAVLWRMTHPDKSF RAVSDGDAVRVGGTELRALHTPGHSPGSVCWYAPELGPGTGTVFSGDTLFAGGPGATG RSYSDFPTILRSISGRLGALPGDTVVHTGHGDSTTIGDEIVHYEEWVARGH" CDS complement(2517409..2518917) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2285C" /product="Apolipoprotein N-acyltransferase" /note="Mb2285c, -, len: 502 aa. Equivalent to Rv2262c and Rv2261c, len: 360 aa and 140 aa, from Mycobacterium tuberculosis strain H37Rv, (94.7% identity in 357 aa overlap and 100.0% identity in 140 aa overlap). Conserved hypothetical protein, with function unknown but some similarity to N-terminal 70% of P23930|P77703|LNT_ECOLI|CUTE|B0657 APOLIPOPROTEIN N-ACYLTRANSFERASE (EC 2.3.1.-) from Escherichia coli strain K12 (512 aa), FASTA scores: opt: 239, E(): 1.6e-07, (30.4% identity in 359 aa overlap). Note that neighboring ORF shows similarity to N -terminal part of PCC6803 apolipoprotein N-acyltransferase from Synechocystis sp., suggesting possibility of frameshift. Sequence of clones from two sources has been checked but no error found. Appear to be two extra bases at position 1876970 compared to CDC1551 strain. Conserved hypothetical protein, with function unknown but some similarity to C-terminal end of PCC6803 apolipoprotein N-acyltransferase from Synechocystis sp. Note that next ORF shows similarity to N-terminal part of P74055 APOLIPOPROTEIN N-ACYLTRANSFERASE from Escherichia coli (519 aa), FASTA scores: opt: 142, E(): 0.007, (29.9% identity in 117 aa overlap), suggesting possible frameshift. Sequence of clones from two sources has been checked but no error found. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2262c and Rv2261c exist as 2 genes. In Mycobacterium bovis, a 2 bp deletion (ct-*) results in a single product which is more similar to Rv2262c." /db_xref="GOA:A0A1R3Y0V4" /db_xref="InterPro:IPR003010" /db_xref="InterPro:IPR004563" /db_xref="InterPro:IPR036526" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0V4" /protein_id="SIU00896.1" /translation="MALRAGARRQPVIGCAAALVFGGLPALAFPAPSWWWLAWFGLVP LLLVVRAAPTSWEGALRAWTGMGGFVLATQYWLVTSAGPMLVLLAAGLGVLWLPAGWL AHRLLSVPVTTCRVGAALVVVPSAWVAAEAVRSWQSLGGPWALLGASQWSQPVTLASA SLGGVWLTSFLLVATNTAIASVLVCRATGGRLVALGCVIGCAGLGPASYLLGSVPVGG PTVRVALVQAGDIADAAARLAAGEEFTAAVADQRPDLVVWGESSVGQDLTRHPDVLAR LAELSQRVGADLLVNVDAPAPDGGIYKSAVLVGAHEAVGSYRKTRLVPFGEYVPLRPL FGWITRYSKAAAKDRQRGAGPVVLAVNSLHIAPLISYEMTFSDLTRHAARLGAALLVY QSSTSTFQGSWAQPQLAAQPAVRAVEAGIPAVHASLSGDSSAFDTRGRRLAWCSAEFN GAIVVNVPLASNVTLYLRLGDWVPVTAFVVMGAGFAVFLRRSLARVSDCADK" CDS 2519006..2519959 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2286" /product="Possible oxidoreductase" /note="Mb2286, -, len: 317 aa. Equivalent to Rv2263, len: 317 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 317 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to several oxidoreductases. Similarity suggests alternative GTG start at 10154 but then no rbs. FASTA scores: sptr|Q544 05|Q54405 PROBABLY AN NADP-DEPENDENT OXIDOREDUCTASE (297 aa) opt: 487, E(): 1.1e-23; (36.1% identity in 299 aa overlap). Also similar to M. tuberculosis Rv0068, and Rv0439c. Protein product from Mb2286 detected using SWATH mass spectrometry. Mb2286 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y124" /protein_id="SIU00897.1" /translation="MAKDLVATVPDLSGKLAIITGANSGLGFGLARRLSAAGADVIMA IRNRAKGEAAVEEIRTAVPDAKLTIKALDLSSLASVAALGEQLMADGRPIDLLINNAG VMTPPERVTTADGFELQFGSNHLGHFALTAHLLPLLRAAQRARVVSLSSLAARRGRIH FDDLQFERSYAPMTAYGQSKLAVLMFARELDRRSRAAGWGIISNAAHPGLTKTNLQIA GPSHGRDKPALMERLYKTSWRFAPFLWQEIEEGILPALYAAATPQADGGAFYGPRGRY EVAGGGVREAKVPAAARNDADSKRLWEVSEQLTGVSYPKSR" CDS complement(2519937..2521715) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2287C" /product="FIG00821990: molecular chaperone" /note="Mb2287c, -, len: 592 aa. Equivalent to Rv2264c, len: 592 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 592 aa overlap). Conserved hypothetical Pro-rich protein, similar to hypothetical proteins Rv0312 (MTCY63.17, 620 aa and Rv0350) that has highly Pro-, Thr-rich C-terminus. Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide. FASTA scores: Z96800|MTCY63_17 Mycobacterium tuberculosis cosmid (620 aa) opt: 1075, E(): 8.8e-24; (38.9% identity in 627 aa overlap). Protein product from Mb2287c detected using SWATH mass spectrometry. Mb2287c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0M5" /db_xref="InterPro:IPR004753" /db_xref="InterPro:IPR013126" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0M5" /protein_id="SIU00898.1" /translation="MATGARPALGLSIGVTNLAAVAADHSITRKPVLTLYRQRPPEVG VPSENPRLDEPGLVITDFVDRVGDSVGIVAADGSVYRSEALVADALLALAYTATGGRA LPGSVTVTYPAHWGPAAVAALDSALRRASEWSHGTSSTAQPLSLLPDAAAALYAIRAD PGIPARGIVAVCDFGGSGTGITLVDAADEYRPVAATVRHQAFSGDLIDQSLLSYVMSE LPGTGAFDPAGTSAIGSLTKLRIECRKAKERLSSSTVTTLTDALGGDIRLTRNELEDT IRDSLDSVGRALEQTLARSGIRTAELVAIVSVGGGANIPAVTTTLSGRFCVPVVRTPR PQLTAAFGGALWAARRPGDTSATVLTAVTSATATAPADAPASVLQPALAWSEADEDSH IGPAPGYTAARPSLSFDHDAHAEPEPKSPPIPWYRLPAVIITGTTVAVLLVGAAVAIG LSTGDQPTAPGTPQRPGVTTTAAPPPSPAPASDGPTTEPAPPVQAPATGGPAPPLQQP LPPPPTTTNTQPAVTTDVITPAPTTPASAPPATTQPPATTQPPATTSPSPPPIPPIPP IPEIPQLPPGIPQVPGIGQFSAISGS" CDS 2522065..2523294 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2288" /product="Possible conserved integral membrane protein" /note="Mb2288, -, len: 409 aa. Equivalent to Rv2265, len: 409 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 409 aa overlap). Possible conserved integral membrane protein, with some similarity to others e.g. M. thermoauto. sp|O26855|O26855 CONSERVED PROTEIN (383 aa), FASTA score: opt: 898 z-score: 1023.5 E(): 0; 38.0% identity in 384 aa overlap; Q58713 HYPOTHETICAL 44.1 KD PROTEIN 1 317 (398 aa), FASTA scores, opt: 305 E(): 1.2e-11; 22.8% identity in 382 aa overlap; also KGTP_ECOLI P17448 alpha-ketoglutarate permease (432 aa), FASTA scores, opt: 156, E(): 0.006, (24.8% identity in 416 aa overlap) Protein product from Mb2288 detected using SWATH mass spectrometry." /db_xref="GOA:P64962" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/Swiss-Prot:P64962" /protein_id="SIU00899.1" /translation="MGANGDVALSRIGATRPALSAWRFVTVFGVVGLLADVVYEGARS ITGPLLASLGATGLVVGVVTGVGEAAALGLRLVSGPLADRSRRFWAWTIAGYTLTVVT VPLLGIAGALWVACALVIAERVGKAVRGPAKDTLLSHAASVTGRGRGFAVHEALDQVG AMIGPLTVAGMLAITGNAYAPALGVLTLPGGAALALLLWLQRRVPRPESYEDCPVVLG NPSAPRPWALPAQFWLYCGFTAITMLGFGTFGLLSFHMVSHGVLAAAMVPVVYAAAMA ADALTALASGFSYDRYGAKTLAVLPILSILVVLFAFTDNVTMVVIGTLVWGAAVGIQE STLRGVVADLVASPRRASAYGVFAAGLGAATAGGGALIGWLYDISIGTLVVVVIALEL MALVMMFAIRLPRVAPS" CDS 2523469..2524755 /codon_start=1 /transl_table=11 /gene="cyp124" /locus_tag="BQ2027_MB2289" /product="Probable cytochrome P450 124 CYP124" /note="Mb2289, cyp124, len: 428 aa. Equivalent to Rv2266, len: 428 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 428 aa overlap). Probable cyp124, cytochrome P450 (EC 1.14.-.-), similar to e.g. G405543 cytochrome P450 (406 aa), FASTA scores, opt: 763,E(): 0, (35.4% identity in 393 aa overlap), similar to e.g. MTCY50.26, 33.8% identity in 370 aa overlap Protein product from Mb2289 detected using SWATH mass spectrometry. Mb2289 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A517" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P0A517" /protein_id="SIU00900.1" /translation="MGLNTAIATRVNGTPPPEVPIADIELGSLDFWALDDDVRDGAFA TLRREAPISFWPTIELPGFVAGNGHWALTKYDDVFYASRHPDIFSSYPNITINDQTPE LAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEAAVRDRAHRLVSSMIANNPDRQ ADLVSELAGPLPLQIICDMMGIPKADHQRIFHWTNVILGFGDPDLATDFDEFMQVSAD IGAYATALAEDRRVNHHDDLTSSLVEAEVDGERLSSREIASFFILLVVAGNETTRNAI THGVLALSRYPEQRDRWWSDFDGLAPTAVEEIVRWASPVVYMRRTLTQDIELRGTKMA AGDKVSLWYCSANRDESKFADPWTFDLARNPNPHLGFGGGGAHFCLGANLARREIRVA FDELRRQMPDVVATEEPARLLSQFIHGIKTLPVTWS" CDS complement(2525009..2526175) /codon_start=1 /transl_table=11 /gene="stf3" /locus_tag="BQ2027_MB2290C" /product="Sulfotransferase" /note="Mb2290c, -, len: 388 aa. Equivalent to Rv2267c, len: 388 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 388 aa overlap). Conserved hypothetical protein; some similarity to Mycobacterium tuberculosis Rv3529c; gp|Z82098|MTCY3C7_27 (384 aa) FASTA score: opt: 261, E(): 3.6e-10; 27.3% identity in 253 aa overlap" /db_xref="GOA:P64964" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P64964" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00901.1" /translation="MKALRSSSRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSR WHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVV DDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQ GLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILKN PTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKV VSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRL RQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYDRHTPEPARLRPAVGG" CDS complement(2526172..2527641) /codon_start=1 /transl_table=11 /gene="cyp128" /locus_tag="BQ2027_MB2291C" /product="PROBABLE CYTOCHROME P450 128 CYP128" /note="Mb2291c, cyp128, len: 489 aa. Equivalent to Rv2268c, len: 489 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 489 aa overlap). Probable cyp128, cytochrome P450 (EC 1.14.-.-), similar to (but longer than) cytochrome p-450 e.g. CPXK_SACER P3 3271 cytochrome p-450 107b1 (405 aa), FASTA scores, opt: 620, E(): 8.3e-33, (31.8% identity in 406 aa overlap); contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature, similar to MTCY50.26, 32.7% identity in 382 aa overlap" /db_xref="GOA:P63714" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P63714" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00902.1" /translation="MTATQSPPEPAPDRVRLAGCPLAGTPDVGLTAQDATTALGVPTR RRASSGGIPVATSMWRDAQTVRTYGPAVAKALALRVAGKARSRLTGRHCRKFMQLTDF DPFDPAIAADPYPHYRELLAGERVQYNPKRDVYILSRYADVREAARNHDTLSSARGVT FSRGWLPFLPTSDPPAHTRMRKQLAPGMARGALETWRPMVDQLARELVGGLLTQTPAD VVSTVAAPMPMRAITSVLGVDGPDEAAFCRLSNQAVRITDVALSASGLISLVQGFAGF RRLRALFTHRRDNGLLRECTVLGKLATHAEQGRLSDDELFFFAVLLLVAGYESTAHMI STLFLTLADYPDQLTLLAQQPDLIPSAIEEHLRFISPIQNICRTTRVDYSVGQAVIPA GSLVLLAWGAANRDPRQYEDPDVFRADRNPVGHLAFGSGIHLCPGTQLARMEGQAILR EIVANIDRIEVVEPPTWTTNANLRGLTRLRVAVTPRVAP" CDS complement(2527654..2527986) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2292C" /product="HYPOTHETICAL PROTEIN" /note="Mb2292c, -, len: 110 aa. Equivalent to Rv2269c, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap). Unknown protein." /db_xref="UniProtKB/Swiss-Prot:P64966" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00903.1" /translation="MANDARPLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRY GGRAGIGRSETVTDHGAVGRRYHQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPL PCDCSTPL" CDS 2528063..2528590 /codon_start=1 /transl_table=11 /gene="lppN" /locus_tag="BQ2027_MB2293" /product="Probable lipoprotein lppN" /note="Mb2293, lppN, len: 175 aa. Equivalent to Rv2270, len: 175 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 175 aa overlap). Probable lppN, lipoprotein; has appropriately positioned prokaryotic membrane lipoprotein attachment site PS00013. Mb2293 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7VEM3" /db_xref="UniProtKB/Swiss-Prot:Q7VEM3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00904.1" /translation="MRLPGRHVLYALSAVTMLAACSSNGARGGIASTNMNPTNPPATA ETATVSPTPAPQSARTETWINLQVGDCLADLPPADLSRITVTIVDCATAHSAEVYLRA PVAVDAAVVSMANRDCAAGFAPYTGQSVDTSPYSVAYLIDSHQDRTGADLTPSTVICL LQPANGQLLTGSARR" CDS 2528697..2528996 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2294" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2294, -, len: 99 aa. Equivalent to Rv2271, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Conserved hypothetical protein; some similarity to hypothetical protein AAK01340.1|AF265275_3 (AF265275) from uncultured organism Pu8 (104 aa) E= 4e-10, (34% identity in 91 aa overlap) Protein product from Mb2294 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2294 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024248" /db_xref="UniProtKB/Swiss-Prot:P64968" /protein_id="SIU00905.1" /translation="MTTPPDKARRRFLRDAYKNAERVARTALLTIDQDQLEQLLDYVD ERLGEQPCDHTARHAQRWAQSHRIEWETLAEGLQEFGGYCDCEIVMNVEPEAIFG" CDS 2529102..2529470 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2295" /product="probable conserved transmembrane protein" /note="Mb2295, -, len: 122 aa. Equivalent to Rv2272, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 122 aa overlap). Probable conserved transmembrane PROTEIN, similar to YIDH_ECOLI P31445 hypothetical 12.8 kd protein (115 aa), FASTA scores, opt: 291, E(): 2.9e-14, (45.6% identity in 103 aa overlap), similar to MTCY339.37c, (35.0% identity in 100 aa overlap). Mb2295 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64970" /db_xref="InterPro:IPR003807" /db_xref="UniProtKB/Swiss-Prot:P64970" /protein_id="SIU00906.1" /translation="MADDSNDTATDVEPDYRFTLANERTFLAWQRTALGLLAAAVALV QLVPELTIPGARQVLGVVLAILAILTSGMGLLRWQQADRAMRRHLPLPRHPTPGYLAV GLCVVGVVALALVVAKAITG" CDS 2529467..2529796 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2296" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2296, -, len: 109 aa. Equivalent to Rv2273, len: 109 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 109 aa overlap). Probable conserved transmembrane protein, similar to Rv2272 (MTCY339.38c), (35.0% identity in 100 aa overlap). Mb2296 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64972" /db_xref="InterPro:IPR003807" /db_xref="UniProtKB/Swiss-Prot:P64972" /protein_id="SIU00907.1" /translation="MNRHSTAASDRGLQAERTTLAWTRTAFALLVNGVLLTLKDTQGA DGPAGLIPAGLAGAAASCCYVIALQRQRALSHRPLPARITPRGQVHILATAVLVLMVV TAFAQLL" CDS complement(2529853..2530170) /codon_start=1 /transl_table=11 /gene="mazf8" /locus_tag="BQ2027_MB2297C" /product="possible toxin mazf8" /note="Mb2297c, -, len: 105 aa. Equivalent to Rv2274c, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 105 aa overlap). Unknown protein; questionable ORF,Mb2297c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0N0" /protein_id="SIU00908.1" /translation="MSIARSAQPIGWISCPPKGGSSCCRCGGGYTHMFCVSAWTGLVV DLQAEQVRSVVTERLRRRIGRGAPILAGTLAPGVGLAAQNREFRQFTGRSAPPSATIA FGE" CDS complement(2530204..2530452) /codon_start=1 /transl_table=11 /gene="mazE8" /locus_tag="BQ2027_MB2297A" /product="Possible antitoxin MazE8" /note="Mb2297A, len: 82 aa. Equivalent to Rv2274A len: 82 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 82 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible mazE8, antitoxin, part of toxin-antitoxin (TA) operon with Rv2274c (See Pandey and Gerdes, 2005). Mb2297A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0R0" /protein_id="SIU00909.1" /translation="MAEPETLPGRWLPECACLAETVSWEQSRLWSRLLCRPHFRHALP GLTGGSASRPSARSARLVRQPRMTLFSLDHRDGVDARC" CDS 2530248..2531117 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2298" /product="Cyclo(L-tyrosyl-L-tyrosyl) synthase (EC" /EC_number="2.3.2.21" /note="Mb2298, -, len: 289 aa. Equivalent to Rv2275, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 289 aa overlap). Conserved hypothetical protein. Some similarity to Bacillus subtilis sp|O34351|O34351 YVMC (248 aa), FASTA score: opt: 280, E(): 2.7e -11; 28.2% identity in 227 aa overlap,Mb2298 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0S3" /db_xref="InterPro:IPR030903" /db_xref="InterPro:IPR038622" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0S3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00910.1" /translation="MSYVAAEPGVLISPTDDLQSPRSAPAAHDENADGITGGTRDDSA PNSRFQLGRRIPEATAQEGFLVRPFTQQCQIIHTEGDHAVIGVSPGNSYFSRQRLRDL GLWGLTNFDRVDFVYTDVHVAESYEALGDSAIEARRKAVKNIRGVRAKITTTVNELDP AGARLCVRPMSEFQSNEAYRELHADLLTRLKDDEDLRAVCQDLVRRFLSTKVGPRQGA TATQEQVCMDYICAEAPLFLDTPAILGVPSSLNCYHQSLPLAEMLYARGSGLRASRNQ GHAIVTPDGSPAE" CDS 2531114..2532304 /codon_start=1 /transl_table=11 /gene="cyp121" /locus_tag="BQ2027_MB2299" /product="Cytochrome P450 121 CYP121" /note="Mb2299, cyp121, len: 396 aa. Equivalent to Rv2276, len: 396 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 396 aa overlap). cyp121, cytochrome P450 (EC 1.14.-.-) (see citation below), similar to e.g. G303644 (397 aa) opt: 675, z-score: 776.4, E(): 2.7e-36, (33.7% identity in 407 aa overlap); contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature, similar to MTCY339.42, 29.2% identity in 298 aa overlap. Protein product from Mb2299 detected using SWATH mass spectrometry. Mb2299 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A515" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="PDB:5EDT" /db_xref="UniProtKB/Swiss-Prot:P0A515" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00911.1" /translation="MTATVLLEVPFSARGDRIPDAVAELRTREPIRKVRTITGAEAWL VSSYALCTQVLEDRRFSMKETAAAGAPRLNALTVPPEVVNNMGNIADAGLRKAVMKAI TPKAPGLEQFLRDTANSLLDNLITEGAPADLRNDFADPLATALHCKVLGIPQEDGPKL FRSLSIAFMSSADPIPAAKINWDRDIEYMAGILENPNITTGLMGELSRLRKDPAYSHV SDELFATIGVTFFGAGVISTGSFLTTALISLIQRPQLRNLLHEKPELIPAGVEELLRI NLSFADGLPRLATADIQVGDVLVRKGELVLVLLEGANFDPEHFPNPGSIELDRPNPTS HLAFGRGQHFCPGSALGRRHAQIGIEALLKKMPGVDLAVPIDQLVWRTRFQRRIPERL PVLW" CDS complement(2532489..2533388) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2300C" /product="Possible glycerolphosphodiesterase" /note="Mb2300c, -, len: 299 aa. Equivalent to Rv2277c, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (99.661% identity in 295 aa overlap). Possible glycerolphosphodiesterase, similar to e.g. UGPQ_ECOLI P10908 glycerophosphoryldiester phosphodiesterase (cytosolic) (247 aa), FASTA scores, opt: 149, E(): 0.0061, (27.2% identity in 195 aa overlap). Start of protein uncertain, encoded by neighbouring IS6110 as given, is intact in Mycobacterium tuberculosis CDC1551. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 1358 bp deletion containing an IS6110 sequence prior to the start of Mb2300c disrupts the 5' start of Rv2277c resulting in a slightly shorter product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (299 aa versus 301 aa). Mb2300c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0R7" /db_xref="InterPro:IPR017946" /db_xref="InterPro:IPR030395" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0R7" /protein_id="SIU00912.1" /translation="MLGAVALVIALGGTCGVADALPLGQTDDPMIVAHRAGTRDFPEN TVLAITNAVAAGVDGMWLTVQVSSDGVPVLYRPSDLATLTDGAGPVNSKTVQQLQQLN AGWNFTTPGVEGHPYRQRATPIPTLEQAIGATPPDMTLFLDPKQTPPQPLVSAVAQVL TRTGAAGRSIVYSTNADITAAASRQEGLQVAESRDVTRQRLFNMALNHHCDPQPDPGK WAGFELHRDVTVTEEFTLGSGISAVNAELWDEASVDCFRSQSGMKVMGFAVKTVDDYR LAHKIGLDAVLVDSPLAAQQWRH" CDS 2533567..2534946 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2301" /product="Probable dehydrogenase" /note="Mb2301, -, len: 459 aa. Equivalent to Rv2280, len: 459 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 459 aa overlap). Probable dehydrogenase. Similar to D-lactate dehydrogenase (cytochrome) precursor e.g. G1061264 (587 aa), FASTA scores, opt: 645,E(): 1.3e-31, (28.0% identity in 478 aa overlap), similar to MTCY50.25, 36.5% identity in 447 aa overlap Protein product from Mb2301 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2301 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0P8" /db_xref="InterPro:IPR004113" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR016164" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR016171" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0P8" /protein_id="SIU00913.1" /translation="MSEMTARFSEIVGNANLLTGDAIPEDYAHDEELTGPPQKPAYAA KPATPEEVAQLLKAASENGVPVTARGSGCGLSGAARPVEGGLLISFDRMNKVLEVDTA NQVAVVQPGVALTDLDAATADTGLRYTVYPGELSSSVGGNVGTNAGGMRAVKYGVARH NVLGLQAVLPTGEIIRTGGRMAKVSTGYDLTQLIIGSEGTLALVTEVIVKLHPRLDHN ASVLAPFADFDQVMAAVPKILASGLAPDILEYIDNTSMAALISTQNLELGIPDQIRDS CEAYLLVALENRIADRLFEDIQTVGEMLMELGAVDAYVLEGGSARKLIEAREKAFWAA KALGADDIIDTVVPRASMPKFLSTARGLAAAADGAAVGCGHAGDGNVHMAIACKDPEK KKKLMTDIFALAMELGGAISGEHGVGRAKTGYFLELEDPVKISLMRRIKQSFDPAGIL NPGVVFGDT" CDS 2535180..2536838 /codon_start=1 /transl_table=11 /gene="pitB" /locus_tag="BQ2027_MB2302" /product="Putative phosphate-transport permease PitB" /note="Mb2302, pitB, len: 552 aa. Equivalent to Rv2281, len: 552 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 552 aa overlap). Putative pitB, phosphate-transport permease, integral membrane protein, similar to YG04_HAEIN P45268 putative phosphate permease hi1604 (420 aa). FASTA scores, opt: 484, E(): 5e-23, (33.5% identity in 498 aa overlap) also to G399598 amphotropic murine retrovirus receptor (656 aa) FASTA scores, opt: 453, E(): 5.8e-21, (26.8% identity in 645 aa overlap). Also similar to Rv0545c|pitA from M. tuberculosis. BELONGS TO THE PIT SUBFAMILY. Mb2302 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65713" /db_xref="InterPro:IPR001204" /db_xref="UniProtKB/Swiss-Prot:P65713" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00914.1" /translation="MSDNAKHHRDGHLVASGLQDRAARTPQHEGFLGPDRPWHLSFSL LLAGSFVLFSWWAFDYAGSGANKVILVLATVVGMFMAFNVGGNDVANSFGTSVGAGTL TMKQALLVAAIFEVSGAVIAGGDVTETIRSGIVDLSGVSVDPRDFMNIMLSALSAAAL WLLFANRMGYPVSTTHSIIGGIVGAAIALGMVSGQGGAALRMVQWDQIGQIVVSWVLS PVLGGLVSYLLYGVIKRHILLYNEQAERRLTEIKKERIAHRERHKAAFDRLTEIQQIA YTGALARDAVAANRKDFDPDELESDYYRELHEIDAKTSSVDAFRALQNWVPLVAAAGS MIIVAMLLFKGFKHMHLGLTTMNNYFIIAMVGAAVWMATFIFAKTLRGESLSRSTFLM FSWMQVFTASGFAFSHGSNDIANAIGPFAAILDVLRTGAIEGNAAVPAAAMVTFGVAL CAGLWFIGRRVIATVGHNLTTMHPASGFAAELSAAGVVMGATVLGLPVSSTHILIGAV LGVGIVNRSTNWGLMKPIVLAWVITLPSAAILASVGLVALRAIF" CDS complement(2536945..2537883) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2303C" /product="Probable transcription regulator (lysR family)" /note="Mb2303c, -, len: 312 aa. Equivalent to Rv2282c, len: 312 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 312 aa overlap). Probable transcriptional regulator, lysR family, similar to others e.g. YC30_CYAPA|P48271 hypothetical transcriptional regulator YCF30 (324 aa), FASTA scores: opt: 292, E(): 4e-12, (27.6% identity in 286 aa overlap); etc. Also similar to Rv0377|MTCY39.34 from Mycobacterium tuberculosis, FASTA score: (25.4% identity in 268 aa overlap). Contains PS00044 Bacterial regulatory proteins, lysR family signature, and contains helix-turn-helix motif at aa 24 -45 (+4.93 SD). Protein product from Mb2303c detected using SWATH mass spectrometry." /db_xref="GOA:P67668" /db_xref="InterPro:IPR000847" /db_xref="InterPro:IPR005119" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P67668" /protein_id="SIU00915.1" /translation="MPLSSRMPGLTCFEIFLAIAEAGSLGGAARELGLTQQAVSRRLA SMEAQIGVRLAIRTTRGSQLTPAGIVVAEWAARLLEVADEIDAGLGSLRTEGRQRIRV VASQTIAEQLMPHWMLSLRAADMRRGGTVPEVILTATNSEHAIAAVRDGIADLGFIEN PCPPTGLGSVVVARDELVVVVPPGHKWARRSRVVSARELAQTPLVTREPNSGIRDSLT AALRDTLGEDMQQAPPVLELSSAAAVRAAVLAGAGPAAMSRLAIADDLAFGRLLAVDI PALNLRRQLRAIWVGGRTPPAGAIRDLLSHITSRST" CDS 2537948..2538142 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2304" /product="HYPOTHETICAL PROTEIN" /note="Mb2304, -, len: 64 aa. Equivalent to Rv2283, len: 64 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 64 aa overlap). Unknown protein; questionable ORF" /db_xref="UniProtKB/Swiss-Prot:P64974" /protein_id="SIU00916.1" /translation="MLEKCPHASVDCGASKIGITDNDPATATNRRLASTIRKPPIEHA AGPLGSTSRAGHRSYGGVAS" CDS 2538152..2539447 /codon_start=1 /transl_table=11 /gene="lipM" /locus_tag="BQ2027_MB2305" /product="Probable esterase LipM" /note="Mb2305, lipM, len: 431 aa. Equivalent to Rv2284, len: 431 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 431 aa overlap). Probable lipM, esterase (EC 3.1.-.-), similar to others e.g. gp|Z95844|MTCY493_28 from Mycobacterium tuberculosis cosmid (420 aa), FASTA scores: opt: 1266, E(): 0, (50.1% identity in 411 aa overlap). Some similarity to G537514 arylacetamide deacetylase (399 aa), FASTA scores: opt: 190, E(): 5.9e-05, (30.4% identity in 138 aa overlap). Protein product from Mb2305 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2305 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y144" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y144" /protein_id="SIU00917.1" /translation="MGAPRLIHVIRQIGALVVAAVTAAATINAYRPLARNGFASLWSW FIGLVVTEFPLPTLASQLGGLVLTAQRLTRPVRAVSWLVAAFSALGLLNLSRAGRQAD AQLTAALDSGLGPDRRTASAGLWRRPAGGGTAKTPGPLRMLRIYRDYAHDGDISYGEY GRANHLDIWRRPDLDLTGTAPVLFQIPGGAWTTGNKRGQAHPLMSHLAELGWICVAIN YRHSPRNTWPDHIIDVKRALAWVKAHISEYGGDPDFIAITGGSAGGHLSSLAALTPND PRFQPGFEEADTRVQAAVPFYGVYDFTRLQDAMHPMMLPLLERMVVKQPRTANMQSYL DASPVTHISADAPPFFVLHGRNDSLVPVQQARGFVDQLRQVSKQPVVYAELPFTQHAF DLLGSARAAHTAIAVEQFLAEVYATQHAGSEPGPAVAIP" CDS 2539480..2540817 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2306" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb2306, -, len: 445 aa. Equivalent to Rv2285, len: 445 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 445 aa overlap). Conserved hypothetical protein, member of Mycobacterium tuberculosis 15-membered protein family including Rv3740c, Rv3734c, Rv1425, Rv1760, Rv0895, Rv3480c. FASTA scores: gp|Z95844|MTCY493_29 Mycobacterium tuberculosis cosmid (459 aa) opt: 640, E(): 0; 33.4% identity in 470 aa overlap. Protein product from Mb2306 detected using SWATH mass spectrometry. Mb2306 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67207" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="UniProtKB/Swiss-Prot:P67207" /protein_id="SIU00918.1" /translation="MKLLSPLDQMFARMEAPRTPMHIGAFAVFDLPKGAPRRFIRDLY EAISQLAFLPFPFDSVIAGGASMAYWRQVQPDPSYHVRLSALPYPGTGRDLGALVERL HSTPLDMAKPLWELHLIEGLTGRQFAMYFKAHHCAVDGLGGVNLIKSWLTTDPEAPPG SGKPEPFGDDYDLASVLAAATTKRAVEGVSAVSELAGRLSSMVLGANSSVRAALTTPR TPFNTRVNRHRRLAVQVLKLPRLKAVAHATDCTVNDVILASVGGACRRYLQELGDLPT NTLTASVPVGFERDADTVNAASGFVAPLGTSIEDPVARLTTISASTTRGKAELLAMSP NALQHYSVFGLLPIAVGQKTGALGVIPPLFNFTVSNVVLSKDPLYLSGAKLDVIVPMS FLCDGYGLNVTLVGYTDKVVLGFLGCRDTLPHLQRLAQYTGAAFEELETAALP" CDS complement(2540884..2541147) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2307C" /product="conserved hypothetical protein [SECOND PART]" /note="Mb2307c, -, len: 87 aa. Equivalent to 3' end of Rv2286c, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (). Conserved hypothetical protein. Similar to Mycobacterium tuberculosis hypothetical protein, Rv2466c, AL021246|MTV008_22 (207 aa). FASTA score: opt: 324, E(): 8.9e-15; . REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2286c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base transition (c-t) splits Rv2286c into 2 parts, Mb2307c and Mb2308c. Protein product from Mb2307c detected using SWATH mass spectrometry. Mb2307c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0R9" /protein_id="SIU00919.1" /translation="MPTLFLDGQCLFGPVLVDPPAGPAALNLWSVVTGMAGLPHVYEL QRPKSPADVELIAQQLRPYLDGRDWVSINRGEIVDIDRLAGRS" CDS complement(2541175..2541576) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2308C" /product="conserved hypothetical protein [FIRST PART]" /note="Mb2308c, -, len: 133 aa. Equivalent to 5' end of Rv2286c, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Conserved hypothetical protein. Similar to Mycobacterium tuberculosis hypothetical protein, Rv2466c, AL021246|MTV008_22 (207 aa). FASTA score: opt: 324, E(): 8.9e-15; 30.4% identity in 194 aa overlap. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2286c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base transition (c-t) splits Rv2286c into 2 parts, Mb2307c and Mb2308c. Mb2308c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0T3" /db_xref="InterPro:IPR001853" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0T3" /protein_id="SIU00920.1" /translation="MTTVDFHFDPLCPFAYQTSVWIRDVRAQLGITINWRFFSLEEIN LVAGKKHPWERDWSYGWSLMRIGALLRRTNMSLLDRWYAAIGHELHTLGGKPHDPAVA RRLLCDVGVNAAILDAALDDPTTHDDVRADH" CDS 2541710..2543338 /codon_start=1 /transl_table=11 /gene="yjcE" /locus_tag="BQ2027_MB2309" /product="Probable conserved integral membrane transport protein YjcE" /note="Mb2309, yjcE, len: 542 aa. Equivalent to Rv2287, len: 542 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 542 aa overlap). Probable yjcE, conserved integral membrane transport protein, similar to eukaryote NA+/H+ exchangers e.g. YJCE_ECOLI|P32703|B4065 Putative Na(+)/H(+) exchanger from Escherichia coli (549 aa), FASTA scores: opt: 436, E(): 5.6e-21, (29.4% identity in 555 aa overlap); etc. SEEMS TO BELONG TO CPA1 FAMILY (NA(+)/H(+) EXCHANGER FAMILY)." /db_xref="GOA:P65527" /db_xref="InterPro:IPR004705" /db_xref="InterPro:IPR006153" /db_xref="InterPro:IPR018422" /db_xref="UniProtKB/Swiss-Prot:P65527" /protein_id="SIU00921.1" /translation="MNGRRTIGEDGLVFGLVVIVALVAAVVVGTVLGHRYRVGPPVLL ILSGSLLGLIPRFGDVQIDGEVVLLLFLPAILYWESMNTSFREIRWNLRVIVMFSIGL VIATAVAVSWTARALGMESHAAAVLGAVLSPTDAAAVAGLAKRLPRRALTVLRGESLI NDGTALVLFAVTVAVAEGAAGIGPAALVGRFVVSYLGGIMAGLLVGGLVTLLRRRIDA PLEEGALSLLTPFAAFLLAQSLKCSGVVAVLVSALVLTYVGPTVIRARSRLQAHAFWD IATFLINGSLWVFVGVQIPGAIDHIAGEDGGLPRATVLALAVTGVVIATRIAWVQATT VLGHTVDRVLKKPTRHVGFRQRCVTSWAGFRGAVSLAAALAVPMTTNSGAPFPDRNLI IFVVSVVILVTVLVQGTSLPTVVRWARMPEDVAHANELQLARTRSAQAALDALPTVAD ELGVAPDLVKHLEKEYEERAVLVMADGADSATSDLAERNDLVRRVRLGVLQHQRQAVT TLRNQNLIDDIVLRELQAAMDLEEVQLLDPADAE" CDS 2543335..2543712 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2310" /product="HYPOTHETICAL PROTEIN" /note="Mb2310, -, len: 125 aa. Equivalent to Rv2288, len: 125 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 125 aa overlap). Unknown hypothetical protein,Mb2310 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P64976" /protein_id="SIU00922.1" /translation="MSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAP MRRWCDGDVDGRKLLPPARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPG WAPFGWLHEPSGARCPKADGQSV" CDS 2543682..2544464 /codon_start=1 /transl_table=11 /gene="cdh" /locus_tag="BQ2027_MB2311" /product="Probable CDP-diacylglycerol pyrophosphatase Cdh (CDP-diacylglycerol diphosphatase) (CDP-diacylglycerol phosphatidylhydrolase)" /note="Mb2311, cdh, len: 260 aa. Equivalent to Rv2289, len: 260 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 260 aa overlap). Probable cdh, CDP-diacylglycerol pyrophosphatase (EC 3.6.1.26), similar to CDH_SALTY|P26219 cdp-diacylglycerol pyrophosphatase (251 aa), FASTA scores: opt: 395, E(): 5.9e-20, (33.5% identity in 221 aa overlap). Protein product from Mb2311 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2311 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63752" /db_xref="InterPro:IPR003763" /db_xref="InterPro:IPR036265" /db_xref="InterPro:IPR038433" /db_xref="UniProtKB/Swiss-Prot:P63752" /protein_id="SIU00923.1" /translation="MPKSRRAVSLSVLIGAVIAALAGALIAVTVPARPNRPEADREAL WKIVHDRCEFGYRRTGAYAPCTFVDEQSGTALYKADFDPYQFLLIPLARITGIEDPAL RESAGRNYLYDAWAARFLVTARLNNSLPESDVVLTINPKNARTQDQLHIHISCSSPTT SAALRNVDTSEYVGWKQLPIDLGGRRFQGLAVDTKAFESRNLFRDIYLKVTADGKKME NASIAVANVAQDQFLLLLAEGTEDQPVAAETLQDHDCSITKS" CDS 2544606..2544761 /codon_start=1 /transl_table=11 /gene="lppOa" /locus_tag="BQ2027_MB2312" /product="Probable conserved lipoprotein lppOa" /note="Mb2312, lppOa, len: 51 aa. Equivalent to 5' end of Rv2290, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 39 aa overlap). Probable lppO, conserved lipoprotein, similar to Rv3763, 19KD_MYCTU P11572 19 kd lipoprotein antigen precursor (159 aa) FASTA scores, opt: 119, E (): 1.3, (25.6% identity in 164 aa overlap). ???Contains appropriately positioned PS00013 lipoprotein motif (with one mismatch). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, lppO exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*), splits lppO into 2 parts, lppOa and lppOb. Protein product from Mb2312 detected using shotgun mass spectrometry. Mb2312 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2X6" /protein_id="SIU00924.1" /translation="MTDPRHTVRIAVGATALGVSALGATLPACSAHSGPGSPPVRRQL PRPRPSW" CDS 2544755..2545120 /codon_start=1 /transl_table=11 /gene="lppOb" /locus_tag="BQ2027_MB2313" /product="Probable conserved lipoprotein lppOb" /note="Mb2313, lppOb, len: 121 aa. Equivalent to 3' end of Rv2290, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 121 aa overlap). Probable lppO, conserved lipoprotein, similar to Rv3763, 19KD_MYCTU P11572 19 kd lipoprotein antigen precursor (159 aa) FASTA scores, opt: 119, E (): 1.3, (25.6% identity in 164 aa overlap). Contains appropriately positioned PS00013 lipoprotein motif (with one mismatch). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, lppO exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*), splits lppO into 2 parts, lppOa and lppOb. Protein product from Mb2313 detected using SWATH mass spectrometry. Mb2313 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1J8" /db_xref="InterPro:IPR008691" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1J8" /protein_id="SIU00925.1" /translation="MVEGHTHTISGAVECRTSPAVRTATPSESGTQTTRVNAHDDSAS VTLSLSDSTPPDVNGFGISLKIGSVDYQMPYQPVQSPTQVEATRQGKSYTLTGTGHAV IPGQTGMRELPFGVHVTCP" CDS 2545262..2546032 /codon_start=1 /transl_table=11 /gene="sseB" /locus_tag="BQ2027_MB2314" /product="Probable thiosulfate sulfurtransferase SseB" /note="Mb2314, sseB, len: 256 aa. Equivalent to 3' end of Rv2291, len: 284 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 256 aa overlap). Probable sseB, thiosulfate sulfurtransferase. Very similar to thiosulfate sulfurtransferas/rhodanese from Streptomyces coelicolor AL00920 4|SC9B10_21 (283 aa) opt: 765, E(): 0; Smith-Waterman score: 765; 46.9% identity in 286 aa overlap, similar to THTR_ECOLI P31142 putative thiosulfate sulfurtransferase (280 aa), FASTA scores, opt: 478, E(): 1e-23, (35.1% identity in 265 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, truncation at the 5' start due to a 2 bp deletion (tg-*), leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (256 aa versus 284 aa). Protein product from Mb2314 detected using SWATH mass spectrometry. Mb2314 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0Y9" /db_xref="InterPro:IPR001307" /db_xref="InterPro:IPR001763" /db_xref="InterPro:IPR036873" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Y9" /protein_id="SIU00926.1" /translation="MRWRLDEPDGHAAYLQGHLPGAVFVSLEDELSDHTIAGRGRHPL PSGASLQATVRRCGIRHDVPVVVYDDWNRAGSARAWWVLTAAGIANVRILDGGLPAWR SAGGSIETGQVSPQLGNVTVLHDDLYAGQRLTLTAQQAGAGGVTLLDARVPERFRGDV EPVDAVAGHIPGAINVPSGSVLADDGTFLGNGALNALLSDHGIDHGGRVGVYCGSGVS AAVIVAALAVIGQDAALFPGSWSEWSSDPTRPVGRGTA" CDS complement(2546033..2547037) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2315C" /product="5'-methylthioadenosine nucleosidase (EC @ S-adenosylhomocysteine nucleosidase (EC" /EC_number="3.2.2.16" /EC_number="3.2.2.9" /note="Mb2315c, -, len: 334 aa. Similar to Rv2293c and Rv2292c, len: 246 aa and 74 aa, from Mycobacterium tuberculosis strain H37Rv, (92.2% identity in 245 aa overlap and 100.0% identity in 74 aa overlap). Conserved hypothetical protein; some similarity to hypothetical protein (299 aa) AAK24237.1| (AE005897) belonging to phosphorylase family [Caulobacter crescentus] (33% identity in 131 aa overlap). Possible lipoprotein: signal peptide at N-terminus. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2293c and Rv2292c exist as 2 genes. In Mycobacterium bovis, a single base insertion (*-g) results in a single product which is more similar to Rv2293c. Mb2315c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64978" /db_xref="InterPro:IPR000845" /db_xref="InterPro:IPR035994" /db_xref="UniProtKB/Swiss-Prot:P64978" /protein_id="SIU00927.1" /translation="MGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSA FPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFA RFTCASSIAIAAVMFSGVAGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTL SVVLDNINTLGNPVCLCRNVPVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGG SVFGCQPCSAPDRSLGYTGNFFQAAGPWLKNALISNLNIVSTVNPGFDAVDQETAAAQ AVADAHGVPFLGIRGMSDGPGDPLHLPGFPVQFFVYKQIAANNAARVTEAFLQNWAGV " CDS 2547332..2548555 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2316" /product="Probable aminotransferase" /note="Mb2316, -, len: 407 aa. Equivalent to Rv2294, len: 407 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 407 aa overlap). Probable aminotransferase (EC 2.6.1.-), similar to others in M. tuberculosis e.g. MTV030_19, also similar to PATB_BACSU|Q08432 putative aminotransferase b from Bacillus subtilis (387 aa), FASTA scores: opt: 563, E(): 2 .8e-29, (31.4% identity in 408 aa overlap); and to MALY_ECOLI|P23256 maly protein from Escherichia coli (390 aa), FASTA scores: opt: 530, E(): 3.6e-27, (31.3% identity in 384 aa overlap). BELONGS TO CLASS-II OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Protein product from Mb2316 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2316 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63503" /db_xref="InterPro:IPR004839" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/Swiss-Prot:P63503" /protein_id="SIU00928.1" /translation="MIPNPLEELTLEQLRSQRTSMKWRAHPADVLPLWVAEMDVKLPP TVADALRRAIDDGDTGYPYGTEYAEAVREFACQRWQWHDLEVSRTAIVPDVMLGIVEV LRLITDRGDPVIVNSPVYAPFYAFVSHDGRRVIPAPLRGDGRIDLDALQEAFSSARAS SGSSGNVAYLLCNPHNPTGSVHTADELRGIAERAQRFGVRVVSDEIHAPLIPSGARFT PYLSVPGAENAFALMSASKAWNLGGLKAALAIAGREAAADLARMPEEVGHGPSHLGVI AHTAAFRTGGNWLDALLRGLDHNRTLLGALVDEHLPGVQYRWPQGTYLAWLDCRELGF DDAASDEMTEGLAVVSDLSGPARWFLDHARVALSSGHVFGIGGAGHVRINFATSRAIL IEAVSRMSRSLLERR" CDS 2548778..2549416 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2317" /product="Colicin E2 tolerance protein CbrC-like protein" /note="Mb2317, -, len: 212 aa. Equivalent to Rv2295, len: 212 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 212 aa overlap). Conserved hypothetical protein, cysteine-rich protein, similar to YIEJ_ECOLI P31469 hypothetical 22.5 kd protein in tnab-bglb intergenic region (195 aa), opt: 270, E(): 3.4e-11, (36.4% identity in 198 aa overlap). Alternative start suggested by similarity 26 codons further downstream,Mb2317 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR005363" /db_xref="UniProtKB/Swiss-Prot:P67310" /protein_id="SIU00929.1" /translation="MDQSANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHP DPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATF TDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPD ALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADFA" CDS 2549510..2550412 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2318" /product="Probable haloalkane dehalogenase" /note="Mb2318, -, len: 300 aa. Equivalent to Rv2296, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 300 aa overlap). Probable haloalkane dehalogenase (EC 3.8.1.5), similar to e.g. HALO_XANAU P22643, haloalkane dehalogenase, (310 aa), opt: 510 z-score: 577.7 E(): 3.1e-25 (39.0% identity in 315 aa overlap). Protein product from Mb2318 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2318 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64302" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR023489" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P64302" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00930.1" /translation="MDVLRTPDSRFEHLVGYPFAPHYVDVTAGDTQPLRMHYVDEGPG DGPPIVLLHGEPTWSYLYRTMIPPLSAAGHRVLAPDLIGFGRSDKPTRIEDYTYLRHV EWVTSWFENLDLHDVTLFVQDWGSLIGLRIAAEHGDRIARLVVANGFLPAAQGRTPLP FYVWRAFARYSPVLPAGRLVNFGTVHRVPAGVRAGYDAPFPDKTYQAGARAFPRLVPT SPDDPAVPANRAAWEALGRWDKPFLAIFGYRDPILGQADGPLIKHIPGAAGQPHARIK ASHFIQEDSGTELAERMLSWQQAT" CDS 2550444..2550896 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2319" /product="unknown protein" /note="Mb2319, -, len: 150 aa. Equivalent to Rv2297, len: 150 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 150 aa overlap). Unknown protein; contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide Protein product from Mb2319 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2319 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P64980" /protein_id="SIU00931.1" /translation="MAMEMAMMGLLGTVVGASAMGIGGIAKSIAEAYVPGVAAAKDRR QQMNVDLQARRYEAVRVWRSGLCSASNAYRQWEAGSRDTHAPNVVGDEWFEGLRPHLP TTGEAAKFRTAYEVRCDNPTLMVLSLEIGRIEKEWMVEASGRTPKHRG" CDS 2551088..2552059 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2320" /product="Aldo/keto reductase" /note="Mb2320, -, len: 323 aa. Equivalent to Rv2298, len: 323 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 323 aa overlap). Conserved hypothetical protein. Similar to SLR0545 Synechocystis sp, Q55493 hypothetical 34.6 kDa protein (314 aa), FASTA scores, opt: 427, E(): 1.7e-20, (39.3% identity in 303 aa overlap) and to YZAE_BACSU P46905 hypothetical protein in natb 3'region (268 aa) FASTA scores, opt: 370, E(): 6.1e-17, (31.4% identity in 264 aa overlap) Protein product from Mb2320 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2320 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63485" /db_xref="InterPro:IPR023210" /db_xref="InterPro:IPR036812" /db_xref="UniProtKB/Swiss-Prot:P63485" /protein_id="SIU00932.1" /translation="MKYLDVDGIGQVSRIGLGTWQFGSREWGYGDRYATGAARDIVKR ARALGVTLFDTAEIYGLGKSERILGEALGDDRTEVVVASKVFPVAPFPAVIKNRERAS ARRLQLNRIPLYQIHQPNPVVPDSVIMPGMRDLLDSGDIGAAGVSNYSLARWRKADAA LGRPVVSNQVHFSLAHPDALEDLVPFAELENRIVIAYSPLAQGLLGGKYGLENRPGGV RALNPLFGTENLRRIEPLLATLRAIAVDVDAKPAQVALAWLISLPGVVAIPGASSVEQ LEFNVAAADIELSAQSRDALTDAARAFRPVSTGRFLTDMVREKVSRR" CDS complement(2552065..2554008) /codon_start=1 /transl_table=11 /gene="htpG" /locus_tag="BQ2027_MB2321C" /product="PROBABLE CHAPERONE PROTEIN HTPG (HEAT SHOCK PROTEIN) (HSP90 FAMILY PROTEIN) (HIGH TEMPERATURE PROTEIN G)" /note="Mb2321c, htpG, len: 647 aa. Equivalent to Rv2299c, len: 647 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 647 aa overlap). htpG, probable chaperone, HSP90 familyHEAT SHOCK PROTEIN HSP90 FAMILY. Similar to HTPG_BACSU|P46208 heat shock protein htpG homologue from Bacillus subtilis (626 aa), FASTA scores: opt: 1551, E(): 0, (39.6% identity in 631 aa overlap). Contains possible helix-turn-helix motif at aa 519-540 (+3.77 SD). BELONGS TO THE HEAT SHOCK PROTEIN 90 FAMILY. Protein product from Mb2321c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2321c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64412" /db_xref="InterPro:IPR001404" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR019805" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR020575" /db_xref="InterPro:IPR036890" /db_xref="InterPro:IPR037196" /db_xref="UniProtKB/Swiss-Prot:P64412" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00933.1" /translation="MNAHVEQLEFQAEARQLLDLMVHSVYSNKDAFLRELISNASDAL DKLRIEALRNKDLEVDTSDLHIEIDADKAARTLTVRDNGIGMAREEVVDLIGTLAKSG TAELRAQLREAKNAAASEELIGQFGIGFYSSFMVADKVQLLTRKAGESAATRWESSGE GTYTIESVEDAPQGTSVTLHLKPEDAEDDLHDYTSEWKIRNLVKKYSDFIAWPIRMDV ERRTPASQEEGGEGGEETVTIETETLNSMKALWARPKEEVSEQEYKEFYKHVAHAWDD PLEIIAMKAEGTFEYQALLFIPSHAPFDLFDRDAHVGIQLYVKRVFIMGDCDQLMPEY LRFVKGVVDAQDMSLNVSREILQQDRQIKAIRRRLTKKVLSTIKDVQSSRPEDYRTFW TQFGRVLKEGLLSDIDNRETLLGISSFVSTYSEEEPTTLAEYVERMKDGQQQIFYATG ETRQQLLKSPHLEAFKAKGYEVLLLTDPVDEVWVGMVPEFDGKPLQSVAKGEVDLSSE EDTSEAEREERQKEFADLLTWLQETLSDHVKEVRLSTRLTESPACLITDAFGMTPALA RIYRASGQEVPVGKRILELNPSHPLVTGLRQAHQDRADDAEKSLAETAELLYGTALLA EGGALEDPARFAELLAERLARTL" CDS complement(2554082..2555014) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2322C" /product="MBL-fold metallo-hydrolase superfamily" /note="Mb2322c, -, len: 310 aa. Equivalent to Rv2300c, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 310 aa overlap). Conserved hypothetical protein, similar to others e.g. Q9RXY2|DR0172 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (271 aa), FASTA scores: opt: 306, E(): 1.3e-12, (34.6% identity in 229 aa overlap); Q9HZH1|PA3037 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (288 aa), FASTA scores: opt: 248, E(): 7.9e-09, (31.5% identity in 238 aa overlap); Q9PDL8|XF1361 HYPOTHETICAL PROTEIN from Xylella fastidiosa (279 aa), FASTA scores: opt: 236, E(): 4.6e-08, (29.7% identity in 249 aa overlap); U70053|XCU70053_3 GumP PROTEIN from Xanthomonas campestris (282 aa), FASTA scores: opt: 222, E(): 3.7e-07, (30.1% identity in 248 aa overlap); etc. Mb2322c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64982" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/Swiss-Prot:P64982" /protein_id="SIU00934.1" /translation="MVATRGRPCPTNFSRPQRPRVAGNGTKSQRCRGRLTTSMLGVAP EAKGPPVKVHHLNCGTMNAFGIALLCHVLLVETDDGLVLVDTGFGIQDCLDPGRVGLF RHVLRPAFLQAETAARQIEQLGYRTSDVRHIVLTHFDFDHIGGIADFPEAHLHVTAAE ARGAIHAPSLRERLRYRRGQWAHGPKLVEHGPDGEPWRGFASAKPLDSIGTGVVLVPM PGHTRGHAAVAVDAGHRWVLHCGDAFYHRGTLDGRFRVPFVMRAEEKLLSYNRNQLRD NQARIVELHRRHDPDLLIVCAHDPDLYQLARDTA" CDS 2555021..2555713 /codon_start=1 /transl_table=11 /gene="cut2" /locus_tag="BQ2027_MB2323" /standard_name="cfp25" /product="PROBABLE CUTINASE CUT2" /note="Mb2323, cut2, len: 230 aa. Equivalent to Rv2301, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 230 aa overlap). Probable cut2 (alternate gene name: cfp25), cutinase (EC 3.1.1.-), highly similar to others from Mycobacteria tuberculosis e.g. MTCY13E12.04|Rv3451|O06318|CUT3_MYCTU (247 aa), FASTA scores: opt: 569, E(): 2.3e-27, (45.3% identity in 223 aa overlap); MT2037|MTCY39.35|RV1984C|Q10837|CUT1_MYCTU (217 aa), FASTA scores: opt: 383, E(): 3.4e-16 (42.9% identity in 217 aa overlap); O69691|Rv3724|MTV025.072 PUTATIVE CUTINASE PRECURSOR (187 aa), FASTA scores: opt: 248, E(): 4.3e-08, (41.85% identity in 172 aa overlap); etc. Also similar to few others from other organisms e.g. Q9KK87 SERINE ESTERASE CUTINASE from Mycobacterium avium (220 aa), FASTA scores: opt: 391, E(): 1.1e-16, (39.15% identity in 235 aa overlap); etc. Contains PS00095 C-5 cytosine-specific DNA methylases C-terminal signature. BELONGS TO THE CUTINASE FAMILY. Start changed since first submission (+11 aa). Protein product from Mb2323 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2323 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63882" /db_xref="InterPro:IPR000675" /db_xref="InterPro:IPR006311" /db_xref="InterPro:IPR011150" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P63882" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00935.1" /translation="MNDLLTRRLLTMGAAAAMLAAVLLLTPITVPAGYPGAVAPATAA CPDAEVVFARGRFEPPGIGTVGNAFVSALRSKVNKNVGVYAVKYPADNQIDVGANDMS AHIQSMANSCPNTRLVPGGYSLGAAVTDVVLAVPTQMWGFTNPLPPGSDEHIAAVALF GNGSQWVGPITNFSPAYNDRTIELCHGDDPVCHPADPNTWEANWPQHLAGAYVSSGMV NQAADFVAGKLQ" CDS 2555819..2556061 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2324" /product="conserved protein" /note="Mb2324, -, len: 80 aa. Equivalent to Rv2302, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 80 aa overlap). Conserved hypothetical protein, highly similar to others: O53766|AL021942|Rv0569|MTV039.07 HYPOTHETICAL 9.5 KDA PROTEIN from Mycobacterium tuberculosis (88 aa), FASTA scores: opt: 300, E(): 1.4e-14, (61.85% identity in 76 aa overlap); O88049|SCI35.11 HYPOTHETICAL 7.1 KDA PROTEIN from Streptomyces coelicolor (64 aa), FASTA scores: opt: 169, E(): 1.5e-05, (46.55% identity in 58 aa overlap) (has its C-terminus shorter); Q9XCD1 HYPOTHETICAL 12.0 KDA PROTEIN (FRAGMENT) from Thermomonospora fusca (106 aa), FASTA scores: opt: 126, E(): 0.023, (50.0% identity in 34 aa overlap) (similarity in part for this one). Also weakly similar to U650M|G699303|Q50105 HYPOTHETICAL 5.7 KDA PROTEIN from Mycobacterium leprae (53 aa), FASTA scores: opt: 89, E(): 0.66, (45.5% identity in 33 aa overlap); and weakly similar to N-terminus of Q9RIZ1|SCJ1.23c putative DNA-binding protein from Streptomyces coelicolor (323 aa), FASTA scores: opt: 182, E(): 7.3e-06, (42.25% identity in 71 aa overlap). Protein product from Mb2324 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2324 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR015035" /db_xref="UniProtKB/Swiss-Prot:P64984" /protein_id="SIU00936.1" /translation="MHAKVGDYLVVKGTTTERHDQHAEIIEVRSADGSPPYVVRWLVN GHETTVYPGSDAVVVTATEHAEAEKRAAARAGHAAT" CDS complement(2556102..2557025) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2325C" /product="PROBABLE ANTIBIOTIC-RESISTANCE PROTEIN" /note="Mb2325c, -, len: 307 aa. Equivalent to Rv2303c, len: 307 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 307 aa overlap). Probable antibiotic-resistance protein, with some similarity to Q54229|G153373 macrotetrolide antibiotic-resistance protein (NONR) from Streptomyces griseus (347 aa) (see the first citation below), FASTA scores: opt: 438, E(): 3.1e-21, (33.2% identity in 226 aa overlap); and other hypothetical proteins e.g. P95886 ORF C02006 from Sulfolobus solfataricus (269 aa), FASTA scores: opt: 252, E(): 3.5e-09, (25.5% identity in 286 aa overlap). Also similar to Mycobacterium tuberculosis Rv3510c|O53555|MTV023.17. Note that the protein Q9XDF3|NONC from Streptomyces griseus subsp. griseus (317 aa) is equivalent to Q54229|G153373|NONR however the N-terminal end is shorter (30 aa) owing to a changed start codon (see the second citation below). Protein product from Mb2325c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y165" /db_xref="InterPro:IPR006680" /db_xref="InterPro:IPR032465" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/TrEMBL:A0A1R3Y165" /protein_id="SIU00937.1" /translation="MTAPEPRVPVIDMWAPFVPSAEVIDDLREGFPVELLSYFEVFTK TTISAEQFGAYAESLRRTDDQILDSLDDAGITRSLITGFDERSTCGVTFVHNASVAAV AARYPDRFLPFAGADILAGDSAVDEFERWVVEHGFRGLSLRPFMIGRPASDPAYFPCY AKCVELGVPVSIHTSADWTRTRLSDLGHPRHIDDVACRFPELTILMSHGGYPWVLQAC LIAWKHPNVYLELAAHRPKYFASPGAGWEPLMRFGQTTIRNKIVYGTGGFLINRPYLQ LCDEMRALPVPREVLEDWLWRNATRVLRLDT" CDS complement(2557022..2557231) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2326C" /product="HYPOTHETICAL PROTEIN" /note="Mb2326c, -, len: 69 aa. Equivalent to Rv2304c, len: 69 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 69 aa overlap). Hypothetical unknown protein. Mb2326c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P64986" /protein_id="SIU00938.1" /translation="MSHDIATEEADDGALDRCVLCDLTGKRVDVKEATCTGRPATTFE QAFAVERDAGFDDFLHGPVGPRSTP" CDS 2557815..2559104 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2327" /product="NRPS condensation (elongation) domain containing protein" /note="Mb2327, -, len: 429 aa. Equivalent to Rv2305, len: 429 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 429 aa overlap). Hypothetical unknown protein. Protein product from Mb2327 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2327 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0U0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00939.1" /translation="MTQTLRLTALDEMFITDDIDIVPSVQIEARVSGRFDLDRLAAAL RAAVAKHALARARLGRASLTARTLYWEVPDRADHLAVEITDEPVGEVRSRFYARAPEL HRSPVFAVAVVRETVGDRLLLNFHHAAFDGMGGLRLLLSLARAYADEPDEVGGPPIEE ARNLKGVAGSRDLFDVLIRARGLAKPAIDRKRTTRVAPDGGSPDGPRFVFAPLTIESD EMATAVARRPEGATVNDLAMAALALTILQWNRTHDVPAADSVSVNMPVNFRPTAWSTE VISNFASYLAIVLRVDEVTDLEKATAIVAGITGPLKQSGAAGWVVDLLEGGKVLPAML KRQLQLLLPLVEDRFVESVCLSNLGRVDVPAFGGEAGDTTEVWFSPTAAMSVMPIGVG LVGFGGTLRAMFRGDGRTIGGEALGRFAALYRDTLLT" CDS 2559114..2559707 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2328" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb2328, -, len: 197 aa. Equivalent to Rv2306A, len: 197 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 197 aa overlap). Possible conserved membrane protein, similar to several hypothetical membrane proteins from Mycobacterium tuberculosis and Streptomyces coelicolor, e.g. Rv0625c|P96915|Y625_MYCTU HYPOTHETICAL 25.2 KDA PROTEIN from Mycobacterium tuberculosis (246 aa), FASTA scores: opt: 410, E(): 2.7e-17, (53.25% identity in 139 aa overlap). First 140 aa show high similarity, this then decreases but continues in next ORF Rv2306B, suggesting a frameshift near nt 2577473. However the sequence has been checked and no error found. The sequence is identical in CDC1551 and Mycobacterium bovis. Replaces original Rv2306c on other strand." /db_xref="GOA:A0A1R3Y0V2" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0V2" /protein_id="SIU00940.1" /translation="MTDNECPADSRRRHVLRLALFAGILLGLFYLVAVARVIHVDGVR SAVVVATGPIAPLAYVVVSAALGALFVPGPILAAGSGVLFGPLLDTFVTLPAFSAGAQ AGMTPRRCWVSIAPIASMHRSNGADCGRWSVSASSPASRMRWPRTPSGRSEFRCGRWS LGRSSGRRHGCSSTPRWARRSPTCRRRWFTRRSRCGA" CDS 2559494..2559928 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2329" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb2329, -, len: 144 aa. Equivalent to Rv2306B, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 144 aa overlap). Possible conserved membrane protein, similar to C-terminal part of several hypothetical membrane proteins from Mycobacterium tuberculosis and Streptomyces coelicolor e.g. P96915|Y625_MYCTU|RV0625c HYPOTHETICAL 25.2 KDA PROTEIN from Mycobacterium tuberculosis (246 aa), FASTA scores: opt: 480, E(): 5e-24, (77.15% identity in 92 aa overlap). Could be a continuation of Rv2306A suggesting there may be a frameshift near nt 2577473. The C-terminal part is longer than Rv0625c and the 3'-end of gene overlaps Rv2307c, so maybe a further framehift. However, sequence has been checked and no error found. Also same sequence as strain CDC1551 and Mycobacterium bovis. Replaces original Rv2306c on other strand." /db_xref="GOA:A0A1R3Y0T8" /db_xref="InterPro:IPR015414" /db_xref="InterPro:IPR032816" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0T8" /protein_id="SIU00941.1" /translation="MWAVVGQRFVPGISDALASYTFGAFGVPLWQMVVGSFIGSAPRV FVYTALGASITNLSSPLVYSAIAVWCVTAIIGAFAARRWYRKWRARPRRRCGLAQLTT GSQQRHTSHRTPAGVVMPGSLSEHRRLRQEAPDRIEHHPPIE" CDS complement(2559857..2560702) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2330C" /product="Hydrolase, alpha/beta fold family" /note="Mb2330c, -, len: 281 aa. Equivalent to Rv2307c, len: 281 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 281 aa overlap). Conserved hypothetical protein, similar to many other hypothetical proteins and BEM1/BUD5 suppressors e.g. P77538 HYPOTHETICAL PROTEIN from Escherichia coli (293 aa), FASTA scores: opt: 421, E(): 2.4e-18, (32.1% identity in 268 aa overlap) (alias AAG57647|Z3802|BAB36823|ECS3400 Putative enzyme (3.4.-) from Escherichia coli (293 aa), FASTA scores: opt: 425, E(): 1.7e-18, (32.1% identity in 268 aa overlap));P54069|BE46_SCHPO|BEM46|SPBC32H8.03|PI020 BEM46 PROTEIN from Schizosaccharomyces pombe (Fission yeast) (352 aa), FASTA scores: opt: 355, E(): 3.3e-14, (30.45% identity in 279 aa overlap); O76462|BEM46 BEM46 PROTEIN from Drosophila melanogaster (338 aa), FASTA scores: opt: 404, E(): 2.8e-17, (32.75% identity in 281 aa overlap); etc. Equivalent (but with few differences) to AAK46650|MT2364 protein from Mycobacterium tuberculosis strain CDC1551 (281 aa). Mb2330c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR022742" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0V0" /protein_id="SIU00942.1" /translation="MSLKRCRALPVVAIVALVASGVITFIWSQQRRLIYFPSAGPVPS ASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALH GLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAA VAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVL VIAGGSDDIVPATLSEWLVAAAAEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTET AVLGQ" CDS complement(2561234..2561425) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2331C" /product="HYPOTHETICAL GLYCINE RICH PROTEIN" /note="Mb2331c, -, len: 63 aa. Equivalent to Rv2307A, len: 63 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 63 aa overlap). Hypothetical unknown protein. Mb2331c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0S7" /protein_id="SIU00943.1" /translation="MAFVDLRYPWCRGDGWISPPVVAVALGWAMRRKPFSRFNEYVGS ASNTCWFARALELRTLLIR" CDS complement(2561510..2561941) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2332C" /product="HYPOTHETICAL GLYCINE RICH PROTEIN" /note="Mb2332c, -, len: 143 aa. Equivalent to Rv2307B, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Hypothetical unknown Gly- rich protein. Equivalent to AAK46653 from Mycobacterium tuberculosis strain CDC1551 (133 aa) but longer 10 aa. Mb2332c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Z9" /protein_id="SIU00944.1" /translation="MEEVPTGPPAMGHRACGGQKAAFPTRMNSGVEKMYKNSIAIAIG TLTMAVEFSMVSANAEPAPPPGQDPHMPNSAMGYCPGGGFGGITGWGYCDGIRYPDGS YWHQVRVPAPFVGTTLTLSCVIDDGSPVPPLAAPGSCGGGA" CDS complement(2562034..2562216) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2333C" /product="HYPOTHETICAL PROTEIN" /note="Mb2333c, -, len: 60 aa. Equivalent to Rv2307D, len: 60 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 60 aa overlap). Hypothetical unknown protein. Mb2333c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1L6" /protein_id="SIU00945.1" /translation="MWRHLWLMQPQRRYPRGSGTTRTARRDAGVAPLYGVSRVTVLAS TTATTAPPVKSFPDLL" CDS 2562425..2563141 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2334" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2334, -, len: 238 aa. Equivalent to Rv2308, len: 238 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 238 aa overlap). Conserved hypothetical protein, sharing similarity with O53464|Rv2018|MTV018.05 from Mycobacterium tuberculosis (239 aa), FASTA scores: opt: 142, E(): 0.034, (24.8% identity in 250 aa overlap). As contains possible helix-turn-helix motif at aa 16-37 (Sequence: YVYAEVDKLIGLPAGTAKRWIN) (Score 1169, +3.17 SD), may be a transcriptional regulator. Mb2334 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y108" /db_xref="InterPro:IPR007367" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR017277" /db_xref="UniProtKB/TrEMBL:A0A1R3Y108" /protein_id="SIU00946.1" /translation="MRADMSVTSMLDREVYVYAEVDKLIGLPAGTAKRWINGYERGVK DHPPILRVTPGATPWVTWGEFVETRMLAEYRDRRKVPIVRQRAAIEELRARFNLRYPL AHLRPFLSTHERDLTMGGEEIGLPDAEVTIRTGQALLGDARWLASIATPGRDEVGEAV IVELPVDKAFPEIVINPSRYSGQPTFVGRRVSPVTIAQMVDGGEEREDLAADYGLSLK QIQDAIDYTKKYRLARLVAA" tRNA complement(2563770..2563843) /locus_tag="BQ2027_METV" /product="tRNA-Met" /note="metV, len: 74 nt. Equivalent to metV, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Met, anticodon cat." CDS complement(2563849..2564304) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2335C" /product="POSSIBLE INTEGRASE (FRAGMENT)" /note="Mb2335c, -, len: 151 aa. Equivalent to Rv2309c, len: 151 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 151 aa overlap). Possible integrase (fragment), similar to others e.g. Q48908 INTEGRASE (FRAGMENT) from Mycobacterium paratuberculos (191 aa), FASTA scores: opt: 279, E(): 3.2e-11, (40.4% identity in 136 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv1055|MTV017.08 INTEGRASE (FRAGMENT) (78 aa) (72.85% identity in 70 aa overlap); and Rv1054|MTV017.07 INTEGRASE (FRAGMENT). COULD BELONG TO THE 'PHAGE' INTEGRASE FAMILY. Mb2335c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y177" /db_xref="InterPro:IPR002104" /db_xref="InterPro:IPR011010" /db_xref="InterPro:IPR013762" /db_xref="InterPro:IPR014417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y177" /protein_id="SIU00947.1" /translation="MTGAGIVETTTNRVRHVPVPEPVSERLRDELPTEPNALVFPSYR GGHLPIEEYRRAFDKGCKAVGIADLVPHGLRHTTASLAISAGANVKVVQRLLGHATAA MTLDRHGHLLSDDLAGVAGLLVQAIKSAAASLRYSDPDSVAVENISAAS" CDS 2565051..2565338 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2336" /product="HYPOTHETICAL PROTEIN" /note="Mb2336, -, len: 95 aa. Equivalent to Rv2309A, len: 95 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 95 aa overlap). Hypothetical unknown protein. Equivalent to AAK46663 from Mycobacterium tuberculosis strain CDC1551 (95 aa) but longer 13 aa." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0R5" /protein_id="SIU00948.1" /translation="MATSSDDITINRHPPLNCAVNRHDESRRSPLRRGLLANGLRERQ AGALFERYKSQFDSFGYIEKVRYRGSGYRVEDVYARADSGPSAGAELPVGP" CDS 2565441..2565785 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2337" /product="POSSIBLE EXCISIONASE" /note="Mb2337, -, len: 114 aa. Equivalent to Rv2310, len: 114 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 114 aa overlap). Possible excisionase, showing some similarity to others e.g. Q9LCU5 PUTATIVE EXCISIONASE from Arthrobacter sp. TM1 (174 aa) FASTA scores: opt: 341, E(): 6.6e-15, (48.2% identity in 110 aa overlap); O85865 PUTATIVE EXCISIONASE from Sphingomonas aromaticivorans (152 aa), FASTA scores: opt: 205, E(): 2.2e-06, (41.25% identity in 80 aa overlap); etc. Also similar to Rv3750c|O69717 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (130 aa), FASTA scores: opt: 228, E(): 6.9e-08, (43.9% identity in 82 aa overlap). Contains possible helix-turn-helix motif at aa 20-41 (Score 2181, +6.62 SD)." /db_xref="GOA:P64988" /db_xref="InterPro:IPR009061" /db_xref="InterPro:IPR010093" /db_xref="InterPro:IPR041657" /db_xref="UniProtKB/Swiss-Prot:P64988" /protein_id="SIU00949.1" /translation="MVAALHAGKAVTIAPQSMTLTTQQAADLLGVSRPTVVRLIKSGE LAAERIGNRHRLVLDDVLAYREARRQRQYDALAESAMDIDADEDPEVICEQLREARRV VAARRRTERRRA" CDS 2565890..2566414 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2338" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2338, -, len: 174 aa. Equivalent to Rv2311, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 174 aa overlap). Conserved hypothetical protein, with similarity (in part) to transfer proteins homologous TRAA e.g. Q9EUN8|TRAA TRANSFER PROTEIN HOMOLOG TRAA from Corynebacterium glutamicum (1160 aa), FASTA scores: opt: 221, E(): 2.9e-07, (36.8% identity in 136 aa overlap); Q9ETQ3|TRAA CONJUGAL TRANSFER PROTEIN (TRAA-LIKE PROTEIN) from Corynebacterium equii (1367 aa), FASTA scores: opt: 188, E(): 5.5e-05, (33% identity in 106 aa overlap); P55418|TRAA_RHISN|Y4DS PROBABLE CONJUGAL TRANSFER PROTEIN from Rhizobium sp. strain NGR234 (1102 aa), FASTA scores: opt: 145, E(): 0.035, (29.08% identity in 141 aa overlap); etc." /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P64990" /protein_id="SIU00950.1" /translation="MAPTGQAVDVAVREGAGDVGYSVERENLPADDPVRNGNRWRVIA VDTEHHRIAARRLGDGARAAFSGDYLHEHITHGYAITVHASQGTTAHSTHAVLGDNTS RATLYVAMTPARESNTAYLCERTAGEGARVDLAGWDLWVSGKAEAMSDEKSASPVWCR VGARCDHRGKRSCW" CDS 2566492..2566761 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2339" /product="HYPOTHETICAL PROTEIN" /note="Mb2339, -, len: 89 aa. Equivalent to Rv2312, len: 89 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 89 aa overlap). Hypothetical unknown protein. Mb2339 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P64992" /protein_id="SIU00951.1" /translation="MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPIN TPGPGRTKQFMEELSQLASAPGPDIDGGIDLTDDEFQAFLQAARS" CDS complement(2567058..2567912) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2340C" /product="HYPOTHETICAL PROTEIN" /note="Mb2340c, -, len: 284 aa. Equivalent to Rv2313c, len: 284 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 284 aa overlap). Hypothetical unknown protein. Protein product from Mb2340c detected using SWATH mass spectrometry. Mb2340c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029032" /db_xref="UniProtKB/Swiss-Prot:P64994" /protein_id="SIU00952.1" /translation="MPAPVSVRDDLCRLVALSPGDGRIAGLVRQVCARALSLPSLPCE VAVNEPESPAEAVVAEFAEQFSVDVSAITGEQRSLLWTHLGEDAFGAVVAMYIADFVP RVRAGLEALGVGKEYLGWVTGPISWDHNTDLSAAVFNGFLPAVARMRALDPVTSELVR LRGAAQHNCRVCKSLREVSALDAGGSETLYGEIERFDTSVLLDVRAKAALRYADALIW TPAHLAVDVAVEVRSRFSDDEAVELTFDIMRNASNKVAVSLGADAPRVQQGTERYRIG LDGQTVFG" CDS complement(2567923..2569296) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2341C" /product="TldE/PmbA family protein, Actinobacterial subgroup" /note="Mb2341c, -, len: 457 aa. Equivalent to Rv2314c, len: 457 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 457 aa overlap). Conserved hypothetical protein, highly similar to Q9RJ51|SCI8.02 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (464 aa) FASTA scores: opt: 1485, E(): 5.2e-83, (53.5% identity in 454 aa overlap); similar to AAK24788|CC2824 TldD/PmbA family protein from Caulobacter crescentus (441 aa), FASTA scores: opt: 364, E(): 8.3e-15, (29.8% identity in 460 aa overlap); and showing similarity with Q9HJZ6|TA0814 HYPOTHETICAL PROTEIN from Thermoplasma acidophilum (430 aa), FASTA scores: opt: 220, E(): 4.7e-06, (21.85% identity in 348 aa overlap). Protein product from Mb2341c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2341c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0T7" /db_xref="InterPro:IPR002510" /db_xref="InterPro:IPR036059" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0T7" /protein_id="SIU00953.1" /translation="MIEPQHAVNIVLKEAARSGRADETMVLVTEKVEATLRWAGNSMT TNGVSHSRNVTVISIVRRGDSAFVGSVVSAEVDPSVLPGLVVSSQDAARSAPEAGDAA PLLADTGEPDDWDAPVPGTGAGVFTGIAGSLSRGFRGADRLYGYAHRSVSTTFLASST GLRRRYTQPTGAIEINAKRGDASAWVGIGTPDFVEVPIDLMLERLSTRLRWAQRTVEL PAGRYQTIMPPSTVADMMIYLGWSMAGRGAQEGRTAFSAPGGGTRVGERLTELPLTLF TDPAAPGLACTPFVAVSNSSETQSVFDNGMEISQVDWIRSGVINALAYPRATAAKFDA PVAVAADNLIMTGGSADLADMIAGTERGLLLTTLWYIREVDPTTLLLTGLTRDGVYLV EDGEVSAAVNNFRFNESPLDLLRRATEAGVSEPTLPREWSDWVTRTAMPPLRIPDFHM SSVSQAQ" CDS complement(2569293..2570810) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2342C" /product="TldD family protein, Actinobacterial subgroup" /note="Mb2342c, -, len: 515 aa. Equivalent to Rv2315c, len: 515 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 505 aa overlap). Conserved hypothetical protein, highly similar to Q9S273|SCI28.10 HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (435 aa), FASTA scores: opt: 1768, E():5.6e-101, (63.2% identity in 432 overlap); and similar to others e.g. AAK24787|CC2823 hypothetical protein (TldD/PmbA family) from Caulobacter crescentus (543 aa), FASTA scores: opt: 876, E():3.1e-46, (42.8% identity in 505 overlap); O58578|PH0848 HYPOTHETICAL 54.4 KDA PROTEIN from Pyrococcus horikoshii (481 aa), FASTA scores: opt: 661, E(): 4.3e-33, (29.95% identity in 484 aa overlap); Q9UZ95|PAB1547 HYPOTHETICAL 53.6 KDA PROTEIN from Pyrococcus abyssi (473 aa), FASTA scores: opt: 656, E(): 8.6e-33, (29.1% identity in 481 aa overlap); etc. Protein product from Mb2342c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2342c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y309" /db_xref="InterPro:IPR002510" /db_xref="InterPro:IPR035068" /db_xref="InterPro:IPR036059" /db_xref="UniProtKB/TrEMBL:A0A1R3Y309" /protein_id="SIU00954.1" /translation="MTPNRGIDEDFLDLPRQQLADAALSAAATAGASHADLRVHRIST EIIQLRDGELETAVISRELGLAVRVIVAGTWGFASHAELAPDVAAATARHAVHVATVL AALNTERVRLAPEPVYTDAEWVSNYRIDPFGVPASEKIAVLRDYSGRLLDADGIDHVS ASLNAVKEQTFYADTFGSSITQQRVRLLPCLDAVAVDSAAGNFESMRTLAPPTARGWE VVAGDEIWNWTDELAQLPSLLAEKVRAPSVMPGPTDLVIDPTNLWLTIHESIGHATEY DRAIGYEAAYAGTSFATPDKLGTLRYGSPVMNVTADRTAEFGLATVGYDDEGVAAQSW DLVRDGVFVGYQLDRAFAPRLGEPRSNGCSYADSPHHVPIQRMANISLQPGIEDLSTA DLIGRVDDGIYIVGDKSWSIDMQRYNFQFTGQRFFRIRGGQLYGQLRDVAYQSSTTDF WNAMEAVGGPSTWRMGGAINCGKAQPGQVAAVSHGCPSALFRGVNVLNTRTEGGR" CDS 2570844..2571716 /codon_start=1 /transl_table=11 /gene="uspA" /locus_tag="BQ2027_MB2343" /product="PROBABLE SUGAR-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER USPA" /note="Mb2343, uspA, len: 290 aa. Equivalent to Rv2316, len: 290 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 290 aa overlap). Probable uspA, sugar-transport integral membrane protein ABC transporter (see citation below), most similar to Q9CBN8|USPA|ML1768 SUGAR TRANSPORT INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (328 aa), FASTA scores: opt: 1593, E(): 1.9e-93, (82.35% identity in 289 aa overlap); and similar to O32940|ML1426|MLCB2052.28 POSSIBLE SUGAR TRANSPORT PROTEIN (PROBABLE ABC-TRANSPORT PROTEIN, INNER MEMBRANE COMPONENT) from Mycobacterium leprae (319 aa), FASTA scores: opt: 600, E(): 9.2e-31, (34.25% identity in 295 aa overlap). Also similar to other proteins involved in transport e.g. Q9X860|SCE134.05c PUTATIVE BINDING PROTEIN DEPENDENT TRANSPORT PROTEIN from Streptomyces coelicolor (327 aa), FASTA scores: opt: 639, E(): 3.2e-33, (40.45% identity in 272 aa overlap); Q9K6N9|BH3689 SUGAR TRANSPORT SYSTEM (PERMEASE) from Bacillus halodurans (300 aa), FASTA scores: opt: 590, E(): 3.7e-30, (35.65% identity in 289 aa overlap); etc." /db_xref="GOA:A0A1R3Y1M8" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1M8" /protein_id="SIU00955.1" /translation="MRDAPRRRTALAYALLAPSLVGVVAFLLLPILVVVWLSLHRWDL LGPLRYVGLTNWRSVLTDSGFADSLVVTAVFVAIVVPAQTVLGLLAASLLARRLPGTG LFRTLYVLPWICAPLAIAVMWRWILAPTDGAISTVLGHRIEWLTDPGLALPVVSAVVV WTNVGYVSLFFLAGLMAIPQDIHNAARTDGASAWQRFWRITLPMLRPTMFFVLVTGII SAAQVFDTVYALTGGGPQGSTDLVAHRIYAEAFGAAAIGRASVMAVVLFVILVGATVV QHLYFRRRISYELT" CDS 2571703..2572527 /codon_start=1 /transl_table=11 /gene="uspB" /locus_tag="BQ2027_MB2344" /product="PROBABLE SUGAR-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER USPB" /note="Mb2344, uspB, len: 274 aa. Equivalent to Rv2317, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 274 aa overlap). Probable uspB, sugar-transport integral membrane protein ABC transporter (see citation below), most similar to Q9CBN7|USPE|ML1769 SUGAR TRANSPORT INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (274 aa), FASTA scores: opt: 1522, E(): 3.4e-89, (85.0% identity in 274 aa overlap); and similar to O32941|ML1425|MLCB2052.29 PROBABLE ABC-TRANSPORT PROTEIN, INNER MEMBRANE COMPONENT from Mycobacterium leprae (283 aa), FASTA scores: opt: 630, E(): 8.4e-33, (36.55% identity in 268 aa overlap). Also similar to other integral membrane proteins e.g. P73854|LACG|SLR1723 LACTOSE TRANSPORT SYSTEM PERMEASE PROTEIN from Synechocystis sp. strain PCC 6803 (270 aa), FASTA scores: opt: 605, E(): 3.1e-31, (36.0% identity in 264 aa overlap); Q9F3B8|SC5F1.11 PUTATIVE SUGAR TRANSPORT INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (307 aa), FASTA scores: opt: 582, E(): 9.7e-30, (34.45% identity in 264 aa overlap); etc. Also similar to O53483|Rv2039c|MTV018.26c SUGAR TRANSPORT PROTEIN from Mycobacterium tuberculosis (280 aa), FASTA scores: opt: 630, E(): 8.3e-89, (37.7% identity in 268 aa overlap)." /db_xref="GOA:A0A1R3Y122" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y122" /protein_id="SIU00956.1" /translation="MSSPSRVSNTAVYAVLTIGAVITLSPFLLGLLTSFTSAHQFATG TPLQLPRPPTLANYADIADAGFRRAAVVTALMTAVILLGQLTFSVLAAYAFARLQFRG RDALFWVYVATLMVPGTVTVVPLYLMMAQLGLRNTFWALVLPFMFGSPYAIFLLREHF RLIPDDLINAARLDGANTLDVIVHVVIPSSRPVLAALAMITVVSQWNNFMWPLVITSG HKWRVLTVATADLQSRFNDQWTLVMAATTVAIVPLIALFVTFQRHIVASIVVSGLK" CDS 2572524..2573846 /codon_start=1 /transl_table=11 /gene="uspC" /locus_tag="BQ2027_MB2345" /product="PROBABLE PERIPLASMIC SUGAR-BINDING LIPOPROTEIN USPC" /note="Mb2345, uspC, len: 440 aa. Equivalent to Rv2318, len: 440 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 440 aa overlap). Probable uspC, sugar-binding lipoprotein component of sugar transport system (see citation below), most similar to Q9CBN6|USPC|ML1770 SUGAR TRANSPORT PERIPLASMIC BINDING PROTEIN from Mycobacterium leprae (446 aa), FASTA scores: opt: 2294, E(): 8.1e-135, (74.7% identity in 446 aa overlap). Also similar to other substrate-binding proteins e.g. Q9RK89|SCF1.15 PUTATIVE SUBSTRATE BINDING PROTEIN (EXTRACELLULAR) (BINDING-PROTEIN-DEPENDENT TRANSPORT) (FRAGMENT) from Streptomyces coelicolor (221 aa), FASTA scores: opt: 377, E(): 3e-16, (32.25% identity in 217 aa overlap); Q9K6N8|BH3690 SUGAR TRANSPORT SYSTEM (SUGAR-BINDING PROTEIN) from Bacillus halodurans (420 aa), FASTA scores: opt: 227, E(): 1e-06, (25.00% identity in 452 aa overlap); etc. Also similar to O53485|Rv2041c|MTV018.28C LIPOPROTEIN COMPONENT OF SUGAR TRANSPORT SYSTEM from Mycobacterium tuberculosis (439 aa), FASTA scores: opt: 246, E(): 7e-08, (26.75% identity in 325 aa overlap). Contains a hydrophobic stretch (possible signal peptide) at N-terminal end. Mb2345 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006059" /db_xref="UniProtKB/TrEMBL:A0A1R3Y187" /protein_id="SIU00957.1" /translation="MTRPRQSTLVATALVLVAILLGVTAVLLGLSAEPRGGKIVVTVR LWDEPIAAAYRQSFAAFTRSHPDIEVRTNLVAYSTYFETLRTDVAGGSADDIFWLSNA YFAAYADSGRLMKIQTDAADWEPAVVDQFTRSGVLWGVPQLTDAGIAVFYNADLLAAA GVDPTQVDNLRWSRGDDDTLRPMLARLTVDADGRTANTPGFDARRVRQWGYNAANDPQ AIYLNYIGSAGGVFQRDGKFAFDNPGAIEAFRYLVGLINDDHVAPPASDTNDNGDFSR NQFLAGKMALFQSGTYSLAPVARDALFHWGVAMLPAGPAGRVSVTNGIAAAGNSASKH PDAVRQVLAWMGSTEGNSYVGRHGAAIPAVLSAQPVYFDYWSARGVDVTPFFAVLNGP RIAAPGGAGFAAGQQALEPYFDEMFLGRGDVTTTLRQAQAAANAATQR" CDS complement(2573854..2574732) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2346C" /product="universal stress protein family protein" /note="Mb2346c, -, len: 292 aa. Equivalent to Rv2319c, len: 292 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 292 aa overlap). Hypothetical unknown protein. Mb2346c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR006015" /db_xref="InterPro:IPR006016" /db_xref="UniProtKB/Swiss-Prot:P64996" /protein_id="SIU00958.1" /translation="MTIVVGYLAGKVGPSALHLAVRVARMHKTSLTVATIVRRHWPTP SLARVDAEYELWSEQLAAASAREAQRYLRRLADGIEVSYHHRAHRSVSAGLLDVVEEL EAEVLVLGSFPSGRRARVLIGSTADRLLHSSPVPVAITPRRYRCYTDRLTRLSCGYSA TSGSVDVVRRCGHLASRYGVPMRVITFAVRGRTMYPPEVGLHAEASVLEAWAAQAREL LEKLRINGVVSEDVVLQVVTGNGWAQALDAADWQDGEILALGTSPFGDVARVFLGSWS GKIIRYSPVPVLVLPG" CDS complement(2574729..2576159) /codon_start=1 /transl_table=11 /gene="rocE" /locus_tag="BQ2027_MB2347C" /product="PROBABLE CATIONIC AMINO ACID TRANSPORT INTEGRAL MEMBRANE PROTEIN ROCE" /note="Mb2347c, rocE, len: 476 aa. Equivalent to Rv2320c, len: 476 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 476 aa overlap). Probable rocE, cationic amino acid (especially arginine and ornithine) transporter (permease), highly similar to other amino acid transporters e.g. Q9L100|SCL6.16C PUTATIVE AMINO ACID TRANSPORTER from Streptomyces coelicolor (496 aa), FASTA scores: opt: 1485, E(): 9.4e-82, (48.4% identity in 477 aa overlap); O06479|YFNA PUTATIVE AMINO ACID TRANSPORTER from Bacillus subtilis (462 aa), FASTA scores: opt: 1271, E(): 6.1e-69, (41.9% identity in 463 aa overlap); Q9PG94|XF0408 AMINO ACID TRANSPORTER from Xylella fastidiosa (509 aa), FASTA scores: opt: 1128, E(): 2.5e-60, (39.5% identity in 481 aa overlap); etc. Also some similarity with Z99108.1|BSUB0005 from Bacillus subtilis (461 aa), FASTA scores: opt: 1271, E(): 0, (41.9% identity in 463 aa overlap); and G403170 ETHANOLAMINE PERMEASE (488 aa), FASTA scores: opt: 468, E(): 1e-23, (28.1% identity in 462 aa overlap). SEEMS TO BELONG TO THE APC FAMILY. Mb2347c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0V6" /db_xref="InterPro:IPR002293" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0V6" /protein_id="SIU00959.1" /translation="MPTTSMSLRELMLRRRPVSGAPVASGASGNLKRSFGTFQLTMFG VGATIGTGIFFVLAQAVPEAGPGVIVSFIIAGIAAGLAAICYAELASAVPISGSAYSY AYTTLGEAVAMVVAACLLLEYGVATAAVAVGWSGYVNKLLSNLFGFQMPHVLSAAPWD THPGWVNLPAVILIGLCALLLIRGASESARVNAIMVLIKLGVLGMFMIIAFSAYSADH LKDFVPFGVAGIGSAAGTIFFSYIGLDAVSTAGDEVKDPQKTMPRALIAALVVVTGVY VLVALAALGTQPWQDFAEQETAGLAIILDNVTHGEWASTILAAGAVVSIFTVTLVTMY GQTRILFAMGRDGLLPARFAKVNPRTMTPVHNTVIVAIFASTLAAFIPLDSLADMVSI GTLTAFSVVAVGVIVLRVREPDLPRGFKVPGYPVTPVLSVLACGYILASLHWYTWLAF SGWVAVAVIFYLMWGRHHSALNEEVP" CDS complement(2576160..2576705) /codon_start=1 /transl_table=11 /gene="rocD2" /locus_tag="BQ2027_MB2348C" /product="PROBABLE ORNITHINE AMINOTRANSFERASE (C-terminus part) ROCD2 (ORNITHINE--OXO-ACID AMINOTRANSFERASE)" /note="Mb2348c, rocD2, len: 181 aa. Equivalent to Rv2321c, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 181 aa overlap). Probable rocD2, ornithine aminotransferase (EC: 2.6.1.13), highly similar to C-terminal region of other ornithine aminotransferases, e.g. Q9FC90|ROCD from Streptomyces coelicolor (407 aa), FASTA scores: opt: 628, E(): 1.2e-32, (55.35% identity in 168 aa overlap); P3802|OAT_BACSU|ROCD from Bacillus subtilis (401 aa), FASTA scores: opt: 477, E(): 4.3e-23, (42.1% identity in 178 aa overlap); BAB42057|ROCD|SA0818 from Staphylococcus aureus subsp. aureus N315 (396 aa), FASTA scores: opt: 437, E(): 1.5e-20, (41.3% identity in 170 aa overlap); etc. Contains PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site. BELONGS TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Rv2322c|MTCY3G12.12 (upstream ORF) and Rv2321c|MTCY3G12.13 appear to be an ornithine aminotransferase homologue but are frameshifted - we can find no sequence error in the cosmid to account for this. Mb2348c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0W4" /db_xref="InterPro:IPR005814" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR034757" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0W4" /protein_id="SIU00960.1" /translation="MIADEIQSGLACTGYPFACDHGGVLPDIYLLGKTLGGGAVPLSA MVADREIFGVVHPGEHGSTFGGNPLAAAIGTPVVSMVVWGECQARSAKLGAHLHQRLA DLIGDGAVALRGLGWWADVDIERALAIGTDMSMRLADRGVLLKDTYGAALRFAPPLVI TAQEIDCAVRRFADALWEAGS" CDS complement(2576705..2577370) /codon_start=1 /transl_table=11 /gene="rocD1" /locus_tag="BQ2027_MB2349C" /product="PROBABLE ORNITHINE AMINOTRANSFERASE (N-terminus part) ROCD1 (ORNITHINE--OXO-ACID AMINOTRANSFERASE)" /note="Mb2349c, rocD1, len: 221 aa. Equivalent to Rv2322c, len: 221 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 221 aa overlap). Probable rocD1, ornithine aminotransferase (EC: 2.6.1.13), highly similar to N-terminal region of other ornithine aminotransferases, e.g. Q9FC90|ROCD from Streptomyces coelicolor (407 aa), FASTA scores: opt: 770, E(): 8.7e-40, (55.7% identity in 201 aa overlap); BAB42057|ROCD|SA0818 from Staphylococcus aureus subsp. aureus N315 (396 aa) FASTA scores: opt: 632, E(): 2.2e-31, (46.1% identity in 208 aa overlap); P38021|OAT_BACSU|ROCD from Bacillus subtilis (401 aa), FASTA scores: opt: 626, E(): 5.1e-31, (43.1% identity in 218 aa overlap); etc. BELONGS TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Rv2322c|MTCY3G12.12 and Rv2321c|MTCY3G12.13 (upstream ORF) appear to be an ornithine aminotransferase homologue but are frameshifted - we can find no sequence error in the cosmid to account for this. Mb2349c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0W6" /db_xref="InterPro:IPR005814" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR034757" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0W6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00961.1" /translation="MTNLADATQATMALVERHAAHNYSPLPVVAASAEGAWIADIDGL RYLDWLAAYSAVNLGHRNPASTATAHAQVDTVTLLNRALHADRLGPLGAALAQLCGKD VVLPMNSDAEAVESGLRVARKWGADVNGLPAGRHDIILANNNFHGHTSSVVSFSSDPA AGSGVEPSTPGLRSVPFGDAAAPAQTIDDNTVADLLEPIPGQAGIIVPADDYLPAASS TTC" CDS complement(2577367..2578275) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2350C" /product="NG,NG-dimethylarginine dimethylaminohydrolase 1 (EC" /EC_number="3.5.3.18" /note="Mb2350c, -, len: 302 aa. Equivalent to Rv2323c, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 302 aa overlap). Conserved hypothetical protein, highly similar to others eg Q9FC91|2SCG58.22 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (288 aa), FASTA scores: opt: 561, E(): 7.3e-28, (46.95% identity in 279 aa overlap); P74535|SLL1336 HYPOTHETICAL 78.3 KDA PROTEIN from Synechocystis sp. (705 aa), FASTA scores: opt: 555, E(): 2.1e-27, (37.75% identity in 265 aa overlap); etc. Also similar to various hydrolases e.g. Q53797 BETA-HYDROXYLASE (BLEOMYCIN/PHLEOMYCIN BINDING PROTEIN, ANKYRIN HOMOLOGUE, BLEOMYCIN AND TRANSPORT PROTEIN) from Streptomyces verticillus (326 aa), FASTA scores: opt: 211, E(): 4.5e-06, (26.75% identity in 303 aa overlap); Q9X7M4|DDAH_STRCO|SC5F2A.01c NG,NG-dimethylarginine dimethylaminohydrolase (EC 3.5.3.18) (Dimethylargininase) (Dimethylarginine dimethylaminohydrolase) (258 aa), FASTA scores: opt: 209, E(): 4.9e-06, (27.15% identity in 243 aa overlap); G434715 beta-hydroxylase (bleomicin/phleomycin binding protein) from Streptomyces verticillus (326 aa), FASTA scores: opt: 211, E(): 4.5e-06, (26.75% identity in 303 aa overlap); etc. Mb2350c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0X5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00962.1" /translation="MENTQRPSFDCEIRAKYRWFMTDSYVAAARLGSPARRTPRTRRY AMTPPAFFAVAYAINPWMDVTAPVDVQVAQAQWEHLHQTYLRLGHSVDLIEPISGLPD MVYTANGGFITHDIAVVARFRFPERAGESRAYASWMSSVGYRPVTTRHVNEGQGDLLM VGERVLAGYGFRTDQRAHAEIAAVLGLPVVSLELVDPRFYHLDTALAVLDDHTIAYYP PAFSTAAQEQLSALFPDAIVVGSADAFVFGLNAVSDGLNVVLPVAAMGFAAQLRAAGF EPVGVDLSELLKGGGSVKCCTLEIHP" CDS 2578340..2578786 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2351" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY ASNC-FAMILY)" /note="Mb2351, -, len: 148 aa. Equivalent to Rv2324, len: 148 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 148 aa overlap). Probable transcriptional regulatory protein, asnC-family, similar to other PUTATIVE ASNC-FAMILY REGULATORY PROTEINS e.g. Q9L101|SCL6.15C from Streptomyces coelicolor (150 aa) FASTA scores: opt: 466, E(): 2.4e-24, (52.8% identity in 142 aa overlap); Q9RKY4|SC6D7.14 PUTATIVE ASNC-FAMILY TRANSCRIPTIONAL REGULATORY PROTEIN from Streptomyces coelicolor (165 aa), FASTA scores: opt: 266, E(): 5.5e-11, (32.4% identity in 145 aa overlap); Q9ZEP1|LRPA|SCE94.12c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (150 aa), FASTA scores: opt: 249, E(): 6.9e-10, (33.35% identity in 147 aa overlap); etc. Also similar to P96896|Rv3291c|MTCY71.31c from Mycobacterium tuberculosis (150 aa), FASTA scores: opt: 261, E(): 1.1e-10, (36.4% identity in 143 aa overlap). Mb2351 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0U9" /db_xref="InterPro:IPR000485" /db_xref="InterPro:IPR011008" /db_xref="InterPro:IPR019887" /db_xref="InterPro:IPR019888" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0U9" /protein_id="SIU00963.1" /translation="MDRLDDTDERILAELAEHARATFAEIGHKVSLSAPAVKRRVDRM LESGVIKGFTTVVDRNALGWNTEAYVQIFCHGRIAPDQLRAAWVNIPEVVSAATVTGT SDAILHVLAHDMRHLEAALERIRSSADVERSESTVVLSNLIDRMPP" CDS complement(2579015..2579863) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2352C" /product="Transmembrane component of energizing module of ECF transporters in Mycobacteria" /note="Mb2352c, -, len: 282 aa. Equivalent to Rv2325c, len: 282 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 282 aa overlap). Conserved hypothetical protein, equivalent to O32970|MLCB22.37c|ML0849 hypothetical protein from Mycobacterium leprae (283 aa), FASTA scores: opt: 1405, E(): 1.8e-78, (77.7% identity in 282 aa overlap). Also some similarity to other proteins e.g. Q9Z9J1|YBAF|BH0166 YBAF PROTEIN (BH0166 PROTEIN) (HYPOTHETICAL PROTEIN) from Bacillus halodurans (265 aa), FASTA scores: opt: 288, E(): 2.8e-10, (25.8% identity in 264 aa overlap); P70972|YBAF YBAF PROTEIN (HYPOTHETICAL PROTEIN) from Bacillus subtilis (265 aa), FASTA scores: opt: 259, E(): 1.5e-08, (25.45% identity in 224 aa overlap); AAK34821|SPY2193|Q99X13 Conserved hypothetical protein from Streptococcus pyogenes (266 aa), FASTA scores: opt: 232, E(): 6.5e-07, (25.1% identity in 267 aa overlap); etc. Protein product from Mb2352c detected using SWATH mass spectrometry. Mb2352c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64998" /db_xref="InterPro:IPR003339" /db_xref="UniProtKB/Swiss-Prot:P64998" /protein_id="SIU00964.1" /translation="MTTTSAPARNGTRRPSRPIVLLIPVPGSSVIHDLWAGTKLLVVF GISVLLTFYPGWVTIGMMAALVLAAARIAHIPRGALPSVPRWLWIVLAIGFLTAALAG GTPVVAVGGVQLGLGGALHFLRITALSVVLLALGAMVSWTTNVAEISPAVATLGRPFR VLRIPVDEWAVALALALRAFPMLIDEFQVLYAARRLRPKRMPPSRKARRQRHARELID LLAAAITVTLRRADEMGDAITARGGTGQLSAHPGRPKLADWVTLAITAMASGTAVAIE SLILHS" CDS complement(2579860..2581953) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2353C" /product="possible transmembrane atp-binding protein abc transporter" /note="Mb2353c, -, len: 697 aa. Equivalent to Rv2326c, len: 697 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 697 aa overlap). Putative transmembrane ATP-binding protein ABC transporter (see citation below). Equivalent to Q9CCF9|ML0848 ABC TRANSPORTER from Mycobacterium leprae (724 aa), FASTA scores: opt: 3482, E(): 2.8e-182, (76.9% identity in 697 aa overlap) and also to O32971|MLCB22.38c ABC-TYPE TRANSPORTER from Mycobacterium leprae (726 aa), FASTA scores: opt: 3482, E(): 2.8e-182, (76.9% identity in 697 aa overlap). Similar in part to other ABC TRANSPORTERS e.g. Q9WY65|TM0222 from Thermotoga maritima (266 aa), FASTA scores: opt: 407, E(): 4.2e-15, (38.0% identity in 213 aa overlap); etc. Contains 2 X PS00017 ATP/GTP-binding site motif A (P-loop); and 2 x PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb2353c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2353c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63400" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P63400" /protein_id="SIU00965.1" /translation="MCCAVCGPEPGRIGEVTPLGPCPAQHRGGPLRPSELAQASVMAA LCAVTAIISVVVPFAAGLALLGTVPTGLLAYRYRLRVLAAATVAAGMIAFLIAGLGGF MGVVHSAYIGGLTGIVKRRGRGTPTVVVSSLIGGFVFGAAMVGMLAAMVRLRHLIFKV MTANVDGIAATLARMHMQGAAADVKRYFAEGLQYWPWVLLGYFNIGIMIVSLIGWWAL SRLLERMRGIPDVHKLDPPPGDDVDALIGPVPVRLDKVRFRYPRAGQDALREVSLDVR AGEHLAIIGANGSGKTTLMLILAGRAPTSGTVDRPGTVGLGKLGGTAVVLQHPESQVL GTRVADDVVWGLPLGTTADVGRLLSEVGLEALAERDTGSLSGGELQRLALAAALAREP AMLIADEVTTMVDQQGRDALLAVLSGLTQRHRTALVHITHYDNEADSADRTLSLSDSP DNTDMVHTAAMPAPVIGVDQPQHAPALELVGVGHEYASGTPWAKTALRDINFVVEQGD GVLIHGGNGSGKSTLAWIMAGLTIPTTGACLLDGRPTHEQVGAVALSFQAARLQLMRS RVDLEVASAAGFSASEQDRVAAALTVVGLDPALGARRIDQLSGGQMRRVVLAGLLARA PRALILDEPLAGLDAASQRGLLRLLEDLRRARGLTVVVVSHDFAGMEELCPRTLHLRD GVLESAAASEAGGMS" CDS 2581994..2582485 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2354" /product="Transcriptional regulator Rv2327, MarR family" /note="Mb2354, -, len: 163 aa. Equivalent to Rv2327, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). Conserved hypothetical protein, similar to Z80775|MTCY21D4.05c|Rv0042c from Mycobacterium tuberculosis (208 aa), FASTA scores: opt: 242, E(): 5e-08, (43.0% identity in 107 aa overlap). Also slight similarity to putative transcriptional regulatory proteins belonging to the MARR-FAMILY e.g. Q9CCY2/ML2696 from Mycobacterium leprae (243 aa), FASTA scores: opt: 245, E(): 3.7e-08, (35.35% identity in 150 aa overlap); Q9L135|SC6D11.20 from Streptomyces coelicolor (155 aa), FASTA scores: opt: 242, E(): 3.9e-08, (34.75% identity in 141 aa overlap); etc. Protein product from Mb2354 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2354 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y131" /db_xref="InterPro:IPR000835" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y131" /protein_id="SIU00966.1" /translation="MSPSPAAANRSEVGGPLPGLGADLLAVVARLNRLATQRIQMPLP AAQARLLATIEAQGEARIGDLAAVDHCSQPTMTTQVRRLEDAGLVTRTADPGDARAVR IRITPEGIRTLTAVRADRAAAIEPQLALLPPADRRVLADAVDVLRRLLDHAATTPGRA TRQ" CDS 2582737..2583885 /codon_start=1 /transl_table=11 /gene="PE23" /locus_tag="BQ2027_MB2355" /product="pe family protein pe23" /note="Mb2355, PE23, len: 382 aa. Equivalent to Rv2328, len: 382 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 382 aa overlap). Member of the Mycobacterium tuberculosis PE family, similar to others e.g. Q9L8K5|MAG24-1 PE-PGRS HOMOLOG from Mycobacterium marinum (638 aa), FASTA scores: opt: 495, E(): 6.6e-18, (34.65% identity in 401 aa overlap); etc. Mb2355 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A685" /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/Swiss-Prot:P0A685" /protein_id="SIU00967.1" /translation="MQFLSVIPEQVESAAQDLAGIRSALSASYAAAAGPTTAVVSAAE DEVSTAIASIFGAYGRQCQVLSAQASAFHDEFVNLLKTGATAYRNTEFANAQSNVLNA VNAPARSLLGHPSAAESVQNSAPTLGGGHSTVTAGLAAQAGRAVATVEQQAAAAVAPL PSAGAGLAQVVNGVVTAGQGSAAKLATALQSAAPWLAKSGGEFIVAGQSALTGVALLQ PAVVGVVQAGGTFLTAGTSAATGLGLLTLAGVEFSQGVGNLALASGTAATGLGLLGSA GVQLFSPAFLLAVPTALGGVGSLAIAVVQLVQGVQHLSLVVPNVVAGIAALQTAGAQF AQGVNHTMLAAQLGAPGIAVLQTAGGHFAQGIGHLTTAGNAAVTVLIS" CDS complement(2583920..2585467) /codon_start=1 /transl_table=11 /gene="narK1" /locus_tag="BQ2027_MB2356C" /product="PROBABLE NITRITE EXTRUSION PROTEIN 1 NARK1 (NITRITE FACILITATOR 1)" /note="Mb2356c, narK1, len: 515 aa. Equivalent to Rv2329c, len: 515 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 515 aa overlap). Probable narK1, nitrite extrusion protein, possibly member of major facilitator superfamily (MFS). Equivalent to O32974|MLCB22.41c|NARK|ML0844 PUTATIVE NITRITE EXTRUSION PROTEIN from Mycobacterium leprae (517 aa), FASTA scores: opt: 2224, E(): 1.9e-129, (69.3% identity in 488 aa overlap). Also highly similar to others e.g. P94933 NITRITE EXTRUSION PROTEIN from Mycobacterium fortuitum (471 aa), FASTA scores: opt: 1969, E(): 8.6e-114, (62.1% identity in 459 aa overlap); P37758|NARU_ECOLI NITRITE EXTRUSION PROTEIN 2 from Escherichia coli strain K12 (462 aa), FASTA scores: opt: 792, E(): 2.3e-41, (36.95% identity in 476 aa overlap); P10903|NARK_ECOLI nitrite extrusion protein (nitrite facilitator 1) from Escherichia coli strain K12 (463 aa), FASTA scores: opt: 784, E(): 7e-41, (35.3% identity in 468 aa overlap); etc. Also similar to RV0261c|Z86089|MTCY6A4_5 from Mycobacterium tuberculosis (469 aa), FASTA scores: opt: 2000, E(): 1.1e-115, (62.6% identity in 470 aa overlap). BELONGS TO THE NARK/NASA FAMILY OF TRANSPORTERS." /db_xref="GOA:A0A1R3Y0T4" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0T4" /protein_id="SIU00968.1" /translation="MEQHTLLQREESPRSPAAPSLRRLGGSRHITHWDPEDLGAWEAG NKGIARRNLLWSVVTVHLGYSVWTLWPVLELLMPQDVYGFSTSDKFLLGTIATLFGAF LRMPYALASAIFGGRNWATFSAIVLLIPAIGTTVLLTHPGLPLWPYLVCAALTGLGGG NFASSMSNANAFYPHRLKGSALGIAGGVGNLGVPAIQLVGLLAIATVGERKPYLVCAL YVVLVAIAVIGVSLFMNNVEQHRVQVNRLRPIVSAVLSTRDTWLLSLLYLGTFGSFIG FSFVFGQVLQTNFLACGQSPARATLHAVELAFVGPLLAAVARIYGGRLADRVGGSRLT LIVFVAMTLAAGLLISASTLEGRHVGQHRGATMVGYFVCFVALFVLSGLGNGSVYKMI PTIFEACSRSLDLSEAERRDWSRIISGVVIGFVAAFGALGGVGINMALRESYLSTGSG TDAFWIFMMCYAAAAVLTWKVYDRRTVTDMGMLQAALVRQPASTPAELIGPRTQSDRF SGCSISA" CDS complement(2585701..2586228) /codon_start=1 /transl_table=11 /gene="lppP" /locus_tag="BQ2027_MB2357C" /product="PROBABLE LIPOPROTEIN LPPP" /note="Mb2357c, lppP, len: 175 aa. Equivalent to Rv2330c, len: 175 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 175 aa overlap). Probable lppP, lipoprotein. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb2357c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:P65303" /db_xref="InterPro:IPR025971" /db_xref="UniProtKB/Swiss-Prot:P65303" /protein_id="SIU00969.1" /translation="MRRQRSAVPILALLALLALLALIVGLGASGCAWKPPTTRPSPPN TCKDSDGPTADTVRQAIAAVPIVVPGSKWVEITRGHTRNCRLHWVQIIPTIASQSTPQ QLLFFDRNIPLGSPTRNPKPYITVLPAGDDTVTVQYQWQIGSDQECCPTGIGTVRFHI GSDGKLEALGSIPHQ" CDS 2586303..2586689 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2358" /product="Assimilatory nitrate reductase large subunit (EC" /EC_number="1.7.99.4" /note="Mb2358, -, len: 128 aa. Equivalent to Rv2331, len: 128 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 128 aa overlap). Hypothetical unknown protein; shortened version of MTCY3G12.03c to eliminate overlap with MTCY3G12.04. Mb2358 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P0A5G4" /protein_id="SIU00970.1" /translation="MPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSRPCSLIG QPNTMGDEMGYLGPGLAGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLH SISFIFKGRPGEVYLTVTGYNFRGVP" CDS 2586746..2587084 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2359" /product="HYPOTHETICAL PROTEIN" /note="Mb2359, -, len: 112 aa. Equivalent to Rv2331A, len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 112 aa overlap). Hypothetical unknown protein. Mb2359 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0X3" /protein_id="SIU00971.1" /translation="MKGHLATFGHPALPTYRGSWLSREPGSPYRLPAGAGRDRGDACR RLPRRTGSGTLLRPGQRCTFAANADPMAKGVDRALCEIVAERRQLDLDLAKAQVRSAL ANQRYHRDVH" CDS 2587114..2588760 /codon_start=1 /transl_table=11 /gene="mez" /locus_tag="BQ2027_MB2360" /product="PROBABLE [NAD] DEPENDENT MALATE OXIDOREDUCTASE MEZ (MALIC ENZYME) (NAD-MALIC ENZYME) (MALATE DEHYDROGENASE (OXALOACETATE DECARBOXYLATING)) (PYRUVIC-MALIC CARBOXYLASE) (NAD-ME)" /note="Mb2360, mez, len: 548 aa. Equivalent to Rv2332, len: 548 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 548 aa overlap). Probable mez, malate oxidoreductase [NAD] dependant (EC 1.1.1.38) (malic enzyme), highly similar to others e.g. O34389|MALS PUTATIVE MALOLACTIC ENZYME [INCLUDES: MALIC ENZYME (EC 1.1.1.-); L-LACTATE DEHYDROGENASE (EC 1.1.1.27)] from Bacillus subtilis (566 aa), FASTA scores: opt: 1927, E(): 5.5e-111, (52.9% identity in 539 aa overlap); P45868|MAO2_BACSU|YWKA PROBABLE NAD-DEPENDENT MALIC ENZYME from Bacillus subtilis (582 aa), FASTA scores: opt: 1849, E(): 3.6e-106, (50.45% identity in 543 aa overlap); Q48796|MLES_OENOE MALOLACTIC ENZYME from Oenococcus oeni (541 aa), FASTA scores: opt: 1540, E(): 3.6e-87, (44.2% identity in 536 aa overlap); etc. BELONGS TO THE MALIC ENZYMES FAMILY. N-terminus shortened since first submission (previously 652 aa). Protein product from Mb2360 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y0Y2" /db_xref="InterPro:IPR001891" /db_xref="InterPro:IPR012301" /db_xref="InterPro:IPR012302" /db_xref="InterPro:IPR015884" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR037062" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Y2" /protein_id="SIU00972.1" /translation="MSDARVPRIPAALSAPSLNRGVGFTHAQRRRLGLTGRLPSAVLT LDQQAERVWHQLQSLATDLGRNLLLEQLHYRHEVLYFKVLADHLPELMPVVYTPTVGE AIQRFSDEYRGQRGLFLSIDEPDEIEEAFNTLGLGPEDVDLIVCTDAEAILGIGDWGV GGIQIAVGKLALYTAGGGVDPRRCLAVSLDVGTDNEQLLADPFYLGNRHARRRGREYD EFVSRYIETAQRLFPRAILHFEDFGPANARKILDTYGTDYCVFNDDMQGTGAVVLAAV YSGLKVTGIPLRDQTIVVFGAGTAGMGIADQIRDAMVADGATLEQAVSQIWPLDRPGL LFDDMDDLRDFQVPYAKNRHQLGVAVGDRVGLSDAIKIASPTILLGCSTVYGAFTKEV VEAMTASCKHPMIFPLSNPTSRMEAIPADVLAWSNGRALLATGSPVAPVEFDETTYVI GQANNVLAFPGIGLGVIVAGARLITRRMLHAAAKAIAHQANPTNPGDSLLPDVQNLRA ISTTVAEAVYRAAVQDGVASRTHDDVRQAIVDTMWLPAYD" CDS complement(2588757..2590328) /codon_start=1 /transl_table=11 /gene="stp" /locus_tag="BQ2027_MB2361C" /product="integral membrane drug efflux protein stp" /note="Mb2361c, -, len: 523 aa. Equivalent to Rv2333c, len: 537 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 508 aa overlap). Probable conserved integral membrane transport protein, member of major facilitator superfamily (MFS) possibly involved in transport of drug, highly similar to many e.g. Q9RL22|C5G9.04c PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (489 aa), FASTA scores: opt: 1031, E(): 4e-55, (37.4% identity in 412 aa overlap); Q9L0L9|SCD82.12 PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (490 aa), FASTA scores: opt: 883, E(): 3.8e-46, (36.35% identity in 407 aa overlap); Q9ZBW5|SC4B5.03c PUTATIVE INTEGRAL MEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (504 aa), FASTA scores: opt: 899, E(): 4.1e-47, (37.4% identity in 415 aa overlap); P39886|TCMA_STRGA tetracenomycin C resistance and export protein from Streptomyces glaucescens (538 aa), FASTA scores: opt: 839, E(): 1.9e-43, (32.3% identity in 489 aa overlap); etc. Also highly similar to Rv2459|O53186|MTV008.15 PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis strain H37Rv (508 aa), FASTA scores: opt: 1385, E(): 1.5e-76, (44.05% identity in 504 aa overlap); and AAK46834|MT2534 DRUG TRANSPORTER from Mycobacterium tuberculosis strain CDC1551 (523 aa), FASTA scores: opt: 1385, E(): 1.5e-76, (44.4% identity in 504 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-g) leads to a slightly longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (523 aa versus 537 aa). Mb2361c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0V9" /db_xref="InterPro:IPR004638" /db_xref="InterPro:IPR005829" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0V9" /protein_id="SIU00973.1" /translation="MNRTQLLTLIATGLGLFMIFLDALIVNVALPDIQRSFAVGEDGL QWVVASYSLGMAVFIMSAATLADLYGRRRWYLIGVSLFTLGSIACGLAPSIAVLTTAR GAQGLGAAAVSVTSLALVSAAFPEAKEKARAIGIWTAIASIGTTTGPTLGGLLVDQWG WRSIFYVNLPMGALVLFLTLCYVEESCNERARRFDLSGQLLFIVAVGALVYAVIEGPQ IGWTSVQTIVMLWTAAVGCALFVWLERRSSNPMMDLTLFRDTSYALAIATICTVFFAV YGMLLLTTQFLQNVRGYTPSVTGLMILPFSAAVAIVSPLVGHLVGRIGARVPILAGLC MLMLGLLMLIFSEHRSSALVLVGLGLCGSGVALCLTPITTVAMTAVPAERAGMASGIM SAQRAIGSTIGFAVLGSVLAAWLSATLEPHLERAVPDPVQRHVLAEIIIDSANPRAHV GGIVPRRHIEHRDPVAIAEEDFIEGIRVALLVATATLAVVFLAGWRWFPRDVQTAGSD AKREAAYSDDRRVRG" CDS 2590803..2591735 /codon_start=1 /transl_table=11 /gene="cysK1" /locus_tag="BQ2027_MB2362" /product="cysteine synthase a cysk1 (o-acetylserine sulfhydrylase a) (o-acetylserine (thiol)-lyase a) (csase a)" /note="Mb2362, cysK1, len: 310 aa. Equivalent to Rv2334, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 310 aa overlap). Probable cysK1, cysteine synthase A (EC 4.2.99.8), equivalent to O32978|CYSK_MYCLE|ML0839|MLCB22.47 CYSTEINE SYNTHASE A from Mycobacterium leprae (310 aa), FASTA scores: opt: 1756, E(): 8.6e-96, (85.8% identity in 310 aa overlap). Also highly similar to other CYSTEINE SYNTHASES e.g. Q9JQL6|CYSK|NMA0974|NMB0763 PUTATIVE CYSTEINE SYNTHASE from Neisseria meningitidis (serogroup A and B) (310 aa), FASTA scores: opt: 1368, E(): 4.6e-73, (66.45% identity in 310 aa overlap); P73410|CYSK_SYNY3|SLR1842 from Synechocystis sp (312 aa), FASTA scores: opt: 1310, E(): 1.2e-69, (64.65% identity in 311 aa overlap); Q43725|CYSM_ARATH|OASC|ACS1|AT3G59760|F24G16.30 CYSTEINE SYNTHASE (MITOCHONDRIAL PRECURSOR) from Arabidopsis thaliana (Mouse-ear cress) (424 aa), FASTA scores: opt: 1253, E(): 3.2e-66, (59.2% identity in 309 aa overlap) (has its N-terminus longer 104 aa); etc. Contains PS00901 Cysteine synthase/cystathionine beta-synthase P-phosphate attachment site. BELONGS TO THE CYSTEINE SYNTHASE/CYSTATHIONINE BETA-SYNTHASE FAMILY. Note that previously known as cysK. Protein product from Mb2362 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2362 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A535" /db_xref="InterPro:IPR001216" /db_xref="InterPro:IPR001926" /db_xref="InterPro:IPR005856" /db_xref="InterPro:IPR005859" /db_xref="InterPro:IPR036052" /db_xref="UniProtKB/Swiss-Prot:P0A535" /protein_id="SIU00974.1" /translation="MSIAEDITQLIGRTPLVRLRRVTDGAVADIVAKLEFFNPANSVK DRIGVAMLQAAEQAGLIKPDTIILEPTSGNTGIALAMVCAARGYRCVLTMPETMSLER RMLLRAYGAELILTPGADGMSGAIAKAEELAKTDQRYFVPQQFENPANPAIHRVTTAE EVWRDTDGKVDIVVAGVGTGGTITGVAQVIKERKPSARFVAVEPAASPVLSGGQKGPH PIQGIGAGFVPPVLDQDLVDEIITVGNEDALNVARRLAREEGLLVGISSGAATVAALQ VARRPENAGKLIVVVLPDFGERYLSTPLFADVAD" CDS 2591739..2592428 /codon_start=1 /transl_table=11 /gene="cysE" /locus_tag="BQ2027_MB2363" /product="PROBABLE SERINE ACETYLTRANSFERASE CYSE (SAT)" /note="Mb2363, -, len: 229 aa. Equivalent to Rv2335, len: 229 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 229 aa overlap). Probable cysE, serine acetyltransferase (EC 2.3.1.30), equivalent to O32979|CYSE|ML0838 SERINE ACETYLTRANSFERASE from Mycobacterium leprae (227 aa), FASTA scores: opt: 1152, E(): 9.6e-62, (76.4% identity in 229 aa overlap). Also highly similar, except in C-terminal part, to others e.g. Q9HXI6|CYSE|PA3816 O-ACETYLSERINE SYNTHASE from Pseudomonas aeruginosa (258 aa), FASTA scores: opt: 737, E(): 6e-37, (61.3% identity in 168 aa overlap); P23145|NIFP_AZOCH PROBABLE SERINE ACETYLTRANSFERASE from Azotobacter chroococcum mcd 1 (269 aa), FASTA scores: opt: 718, E(): 8.4e-36, (55.45% identity in 220 aa overlap); Q06750|CYSE_BACSU SERINE ACETYLTRANSFERASE from Bacillus subtilis (217 aa), FASTA scores: opt: 640, E(): 3.1e-31, (48.0% identity in 200 aa overlap); etc. Contains PS00101 Bacterial hexapeptide-repeat containing-transferases signature. BELONGS TO THE CYSE/LACA/LPXA/NODL FAMILY OF ACETYLTRANSFERASES. COMPOSED OF MULTIPLE REPEATS OF [LIV]-G-X(4). Protein product from Mb2363 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2363 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1P8" /db_xref="InterPro:IPR001451" /db_xref="InterPro:IPR005881" /db_xref="InterPro:IPR011004" /db_xref="InterPro:IPR018357" /db_xref="InterPro:IPR042122" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1P8" /protein_id="SIU00975.1" /translation="MLTAMRGDIRAARERDPAAPTALEVIFCYPGVHAVWGHRLAHWL WQRGARLLARAAAEFTRILTGVDIHPGAVIGARVFIDHATGVVIGETAEVGDDVTIYH GVTLGGSGMVGGKRHPTVGDRVIIGAGAKVLGPIKIGEDSRIGANAVVVKPVPPSAVV VGVPGQVIGQSQPSPGGPFDWRLPDLVGASLDSLLTRVARLEALGGGPQAAGVIRPPE AGIWHGEDFSI" CDS 2592844..2593812 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2364" /product="HYPOTHETICAL PROTEIN" /note="Mb2364, -, len: 322 aa. Equivalent to Rv2336, len: 322 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 322 aa overlap). Hypothetical unknown protein (see second citation below). Protein product from Mb2364 detected using SWATH mass spectrometry. Mb2364 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y142" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00976.1" /translation="MDVPHEQPALSSSKSNRFTSQRQTTGVGTTTVERLEPRLSPASR HITEAKAFGTECHVSSFTREQDPDRAVRVEQIHGEAYVAAGHVYESALDELGRLDNSN AEFILDKARGSTRETEVIYLHAVPAEPLSGSQGEGGLRIVGISAVGSIDDLSAFKAAK PSMGLAHQRKLYDAIEDLGHGGVKEIAALSVTADAPPTVSYSLIREVLRLYHRTGEKL IITFAMPAYAKMVMNFGRFAMPQVGEPFYAHRNNDPRTSNDLLLVPSIVEPSNFLENI SRGVVTADDGPTARRRFATLCYMTDGLDDYFMPLTRQVLSEGIQDI" CDS complement(2593876..2594994) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2365C" /product="HYPOTHETICAL PROTEIN" /note="Mb2365c, -, len: 372 aa. Equivalent to Rv2337c, len: 372 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 372 aa overlap). Hypothetical unknown protein, sharing some similarity with Q9RI33|SCJ12.27c HYPOTHETICAL 37.2 KDA PROTEIN from Streptomyces coelicolor (335 aa), BLAST scores: 134 AND 46, (28% AND 33% identity, 52% AND 44% positive); FASTA scores: opt: 176, E(): 0.00042, (31.95% identity in 355 aa overlap). Mb2365c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1A9" /db_xref="InterPro:IPR000415" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1A9" /protein_id="SIU00977.1" /translation="MRAGRWGPGMTGLDPAEFLSLVEAAALAPSADNRREVQLEHAGR RVRLWGDQTWRSAPEHRRIMSLVAIGAAVENVKLRAGRLGFETKVCWFPDSGNPGLVA EIDVDRLPQTRVDPIEVAIERRRTNRRVRFRGPPLSQGELGALSAEATGIDGIQLHWF DSPETRKQILRLVRLAETERFRSRELHEELFSAVRFDIGWTASSDDGLPPGSLEVEAW MRPMFRGLRHWRVLRLLRTVGMHHALGLRAAYLPCRLAPHVGALTTSLDLASGALTAG AVFERIWLRTTLLGAELQPFAASAVLSLPACEWVAPHVRAALVGGWNLLAPGHWPMMV FRIGHARAPSVRTMRQSVEAYCYAPAERSGSDSESRFA" CDS complement(2595114..2596070) /codon_start=1 /transl_table=11 /gene="moeW" /locus_tag="BQ2027_MB2366C" /product="POSSIBLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN MOEW" /note="Mb2366c, moeW, len: 318 aa. Equivalent to Rv2338c, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 318 aa overlap). Possible moeW, molybdoptenum biosynthesis protein, showing some similarity to several molybdopterin biosynthesis proteins e.g. O27613|MTH1571 MOLYBDOPTERIN BIOSYNTHESIS PROTEIN MOEB HOMOLOG from Methanobacterium thermoautotrophicum (251 aa), FASTA scores: opt: 309, E(): 4.7e-14; (30.7% identity in 254 aa overlap); Q9KPQ5|VC2311 HESA/MOEB/THIF FAMILY PROTEIN from Vibrio cholerae (273 aa), FASTA scores: opt: 255, E(): 4e-09, (36.25% identity in 149 aa overlap); Q9PD34|XF1545 MOLYBDOPTERIN BIOSYNTHESIS PROTEIN from Xylella fastidiosa (276 aa), FASTA scores: opt: 233,E(): 1e-07, (33.6% identity in 128 aa overlap); etc. SEEMS TO BELONG TO THE HESA/MOEB/THIF FAMILY. Protein product from Mb2366c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2366c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0U6" /db_xref="InterPro:IPR000594" /db_xref="InterPro:IPR035985" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0U6" /protein_id="SIU00978.1" /translation="MRAGADAPDSGRVKESAPWSYDEAFCRNLGLISPTEQQRLRNSR VAIAGMGGVGGIDMVALARMGIGKFTIADPDVFEIRNSNRQYGAMRSTNGQAKAEVMR NIVHDINPEAEIRAFCEPIGKENAATFLEGADVLVDGIDAFEIDLRRLLYREAQQRGI YALGAGPLGFSTAWVVFDPKGMTFDRYFDLSDAMNTVDKFVAFIAGIAPSATHRRSID LSYVDIENRTGPSVGLACHLASGVVAAEVLKILLGHGRVYAAPYFHQFDAYRSIYVRK RLRCGNRHPLQRVKRRLLARYINRRSAGVIPGLRYHRTEPSY" CDS 2596701..2599181 /codon_start=1 /transl_table=11 /gene="mmpL9a" /locus_tag="BQ2027_MB2367" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL9A [FIRST PART]" /note="Mb2367, mmpL9a, len: 826 aa. Equivalent to 5' end of Rv2339, len: 962 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 826 aa overlap). Probable mmpL9, conserved transmembrane transport protein (see citation below), with strong similarity to other Mycobacterial proteins e.g. P54881|YV34_MYCLE|MML4_MYCLE hypothetical 105.2 kd protein from Mycobacterium leprae (959 aa), FASTA scores: opt: 3799, E(): 0, (59.3% identity in 937 aa overlap); G699237|U1740AB from Mycobacterium leprae; and MTCY20G9.34; MTCY48.08c; MTCY19G5.06 from M. tuberculosis. BELONGS TO THE MMPL FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, mmpL9 exists as a single gene. In Mycobacterium bovis, truncation due to a single base transition (g-a) splits mmpL9 into 2 parts, mmpL9a and mmpL9b, resulting in a shorter mmpL9a product. Mb2367 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y0X6" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0X6" /protein_id="SIU00979.1" /translation="MVPGEVHMSDTPSGPHPIIPRTIRLAAIPILLCWLGFTVFVSVV VPPLEAIGETRAVAVAPDDAQSMRAMRRAGKVFNEFDSNSIAMVVLESDQPLGEKAHR YYDHLVDTLVLDQSHIQHIQDFWRDPLTAAGAVSADGKAAYVQLYLAGNMGEALANES VEAVRKIVANSTPPEGIRTYVTGPAALFADQIAAGDRSMKLITGLTFAVITVLLLLVY RSIATTLLILPMVFIGLGATRGTIAFLGYHGMVGLSTFVVNILTALAIAAGTDYAIFL VGRYQEARHIGQNREASFYTMYRGTANVILGSGLTIAGATYCLSFARLTLFHTMGPPL AIGMLVSVAAALTLAPAIIAIAGRFGLLDPKRRLKTRGWRRVGTAVVRWPGPILATSV ALALVGLLALPGYRPGYNDRYYLRAGTPVNRGYAAADRHFGPARMNPEMLLVESDQDM RNPAGMLVIDKIAKEVLHVSGVERVQAITRPQGVPLEHASIPFQISMMGATQTMSLPY MRERMADMLTMSDEMLVAINSMEQMLDLVQQLNDVTHEMAATTREIKATTSELRDHLA DIDDFVRPLRSYFYWEHHCFDIPLCSATRSLFDTLDGVDTLTDQLRALTDDMNKMEAL TPQFLALLPPMITTMKTMRTMMLTMRSTISGVQDQMADMQDHATAMGQAFDTAKSGDS FYLPPEAFDNAEFQQGMKLFLSPNGKAVRFVISHESDPASTEGIDRIEAIRAATKDAI KATPLQGAKIYIGGTAATYQDIRDGTKYDILIVGIAAVCLVFIVMLMITQSLIASLVI VGTVLLSLGTAFGLSVLIWQHFVGLQVH" CDS 2599194..2599589 /codon_start=1 /transl_table=11 /gene="mmpL9b" /locus_tag="BQ2027_MB2368" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL9B [SECOND PART]" /note="Mb2368, mmpL9b, len: 131 aa. Equivalent to 3' end of Rv2339, len: 962 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Probable mmpL9, conserved transmembrane transport protein (see citation below), with strong similarity to other Mycobacterial proteins e.g. P54881|YV34_MYCLE|MML4_MYCLE hypothetical 105.2 kd protein from Mycobacterium leprae (959 aa), FASTA scores: opt: 3799, E(): 0, (59.3% identity in 937 aa overlap); G699237|U1740AB from Mycobacterium leprae; and MTCY20G9.34; MTCY48.08c; MTCY19G5.06 from M. tuberculosis. BELONGS TO THE MMPL FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, mmpL9 exists as a single gene. In Mycobacterium bovis, truncation due to a single base transition (g-a) splits mmpL9 into 2 parts, mmpL9a and mmpL9b. Protein product from Mb2368 detected using SWATH mass spectrometry. Mb2368 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0Y1" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Y1" /protein_id="SIU00980.1" /translation="MSVIVLLAVGSDYNLLLVSRFKEEVGAGLKTGIIRAMAGTGAVV TSAGLVFAFTMASMAVSELRVIGQVGTTIGLGLLFDTLVVRSFMTPSIAALLGRWFWW PNMIHSRPTVPEAHTRQGARRIQPHLHRG" CDS complement(2599675..2600916) /codon_start=1 /transl_table=11 /gene="PE_PGRS39" /locus_tag="BQ2027_MB2369C" /product="pe-pgrs family protein pe_pgrs39" /note="Mb2369c, PE_PGRS39, len: 413 aa. Equivalent to Rv2340c, len: 413 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 413 aa overlap). Member of the Mycobacterium tuberculosis PE_family, PGRS subfamily of gly-rich proteins, similar to others eg YI18_MYCTU|Q50615|Rv1818c|MTCY1A11.25 PE-PGRS FAMILY PROTEIN from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 710, E(): 1.4e-22, (41.0% identity in 368 aa overlap); O53884|Rv0872v|MTV043.65c PGRS-FAMILY PROTEIN from Mycobacterium tuberculosis (606 aa), FASTA scores: opt: 708, E(): 1.9e-22, (42.4% identity in 389 aa overlap); etc. Mb2369c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0X9" /protein_id="SIU00981.1" /translation="MSHVTAAPNVLAASAGELAAIGSTMRAANAAAAAPTAGVLAAGG DDVSAGIAALFGARAQAYQAISAQAALFHDRFVQILQEGAAAYAMAEAANALPLQKAQ GVVSELAQDRTGGTGTGQSRGAGGFGGVGQAGGKGWDGGPIGNGQVGEQHGAGQLGST DGNPGVAGAAHGSGVSASHGSGATGAAGVADPGGSGAGVGSAAGNGTGAGSADAVGGA GTGRDIVGSVRGDGGVGMASGDGGLSTGAAGASAEGGLMPGFGGAPWVGGHWGLGGEG HSGAIGGVGEQVAPAVATAPAVSPATTSAVAAESGSTPATKAQAMHATTNPGNAAHQG NPADPGNSARRADGGRDEQLLLLPLTSLRGLRHTLKKLSGLRARNGLLTASGDNASGS GRPWDRDQLLRALGLRPPGHE" tRNA complement(2601415..2601487) /locus_tag="BQ2027_ASNT" /product="tRNA-Asn" /note="asnT, len: 73 nt. Equivalent to asnT, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Asn, anticodon gtt." CDS 2601605..2602024 /codon_start=1 /transl_table=11 /gene="lppQ" /locus_tag="BQ2027_MB2370" /product="PROBABLE CONSERVED LIPOPROTEIN LPPQ" /note="Mb2370, lppQ, len: 139 aa. Equivalent to Rv2341, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Probable lppQ, conserved lipoprotein, showing some similarity with Rv1228|O33224|LPQX|MTCI61.11 from Mycobacterium tuberculosis (185 aa), FASTA scores: opt: 155; E(): 0.0073; (31.9% identity in 116 aa overlap). Also shows few similarity with P29228|VLPA_MYCHR variant surface antigen A precursor from Mycoplasma hyorhinis (157 aa), FASTA scores: opt: 96, E(): 7.3, (23.1% identity in 143 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Mb2370 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Y8" /protein_id="SIU00982.1" /translation="MPVGGRQHVFEKLASILGLVAAPLMLLGLSACGRSAGKTSEPTC PTEPIDAADSSTTPDPSCVVRATEINGNGSRIQTWTGSYDAAATQSGGVCGGTCNFHA TVRFTVDEGQISGSVDQVYQAAMVAIATRPTSPSLAP" CDS 2602280..2602537 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2371" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2371, -, len: 85 aa. Equivalent to Rv2342, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Conserved hypothetical protein, highly similar to Q9CCG1|ML0834 HYPOTHETICAL PROTEIN from Mycobacterium leprae (100 aa), FASTA scores: opt: 392, E(): 2.9e-20, (78.2% identity in 78 aa overlap). N-terminus highly similar to N-terminal part of Q9L085|SCC24.32 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (108 aa), FASTA scores: opt: 122, E(): 0.077, (39.15% identity in 46 aa overlap). Protein product from Mb2371 detected using SWATH mass spectrometry. Mb2371 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0W0" /protein_id="SIU00983.1" /translation="MIGYVAVLGLGYVLGAKAGRRRYEQIASTYRALTGSPVARSMIE GGRRKIANRISPDAGFVTLAEIDNQTAVVQRGVERQPKTAR" CDS complement(2602541..2604460) /codon_start=1 /transl_table=11 /gene="dnaG" /locus_tag="BQ2027_MB2372C" /product="PROBABLE DNA PRIMASE DNAG" /note="Mb2372c, dnaG, len: 639 aa. Equivalent to Rv2343c, len: 639 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 639 aa overlap). Probable dnaG, DNA primase (EC 2.7.7.-), equivalent to O52200|PRIM_MYCSM|DNAG DNA PRIMASE from Mycobacterium smegmatis (636 aa), FASTA scores: opt: 3504, E(): 5.5e-202, (81.55% identity in 639 aa overlap); and Q9CCG2|DNAG|ML0833 DNA PRIMASE from Mycobacterium leprae (642 aa), FASTA scores: opt: 3443, E(): 2.5e-198, (80.4% identity in 642 aa overlap). Also highly similar to many DNA primases e.g. Q9S1N4|PRIM_STRCO|DNAG|SC7A8.07c from Streptomyces coelicolor (641 aa), FASTA scores: opt: 1899, E(): 5.1e-106, (47.9% identity in 643 aa overlap); P74893|PRIM_SYNP7|DNAG from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (616 aa), FASTA scores: opt: 860, E(): 6.6e-44, (35.3% identity in 513 aa overlap); P05096|PRIM_BACSU from Bacillus subtilis (603 aa) FASTA scores: opt: 800, E(): 2.5e-40, (33.7% identity in 430 aa overlap); etc. Protein product from Mb2372c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2372c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63963" /db_xref="InterPro:IPR002694" /db_xref="InterPro:IPR006171" /db_xref="InterPro:IPR006295" /db_xref="InterPro:IPR013173" /db_xref="InterPro:IPR013264" /db_xref="InterPro:IPR019475" /db_xref="InterPro:IPR030846" /db_xref="InterPro:IPR034151" /db_xref="InterPro:IPR036977" /db_xref="InterPro:IPR037068" /db_xref="UniProtKB/Swiss-Prot:P63963" /protein_id="SIU00984.1" /translation="MSGRISDRDIAAIREGARIEDVVGDYVQLRRAGADSLKGLCPFH NEKSPSFHVRPNHGHFHCFGCGEGGDVYAFIQKIEHVSFVEAVELLADRIGHTISYTG AATSVQRDRGSRSRLLAANAAAAAFYAQALQSDEAAPARQYLTERSFDAAAARKFGCG FAPSGWDSLTKHLQRKGFEFEELEAAGLSRQGRHGPMDRFHRRLLWPIRTSAGEVVGF GARRLFDDDAMEAKYVNTPETLLYKKSSVMFGIDLAKRDIAKGHQAVVVEGYTDVMAM HLAGVTTAVASCGTAFGGEHLAMLRRLMMDDSFFRGELIYVFDGDEAGRAAALKAFDG EQKLAGQSFVAVAPDGMDPCDLRLKCGDAALRDLVARRTPLFEFAIRAAIAEMDLDSA EGRVAALRRCVPMVGQIKDPTLRDEYARQLAGWVGWADVAQVIGRVRGEAKRTKHPRL GRLGSTTIARAAQRPTAGPPTELAVRPDPRDPTLWPQREALKSALQYPALAGPVFDAL TVEGFTHPEYAAVRAAIDTAGGTSAGLSGAQWLDMVRQQTTSTVTSALISELGVEAIQ VDDDKLPRYIAGVLARLQEVWLGRQIAEVKSKLQRMSPIEQGDEYHALFGDLVAMEAY RRSLLEQASGDDLTA" CDS complement(2604465..2605760) /codon_start=1 /transl_table=11 /gene="dgt" /locus_tag="BQ2027_MB2373C" /product="PROBABLE DEOXYGUANOSINE TRIPHOSPHATE TRIPHOSPHOHYDROLASE DGT (DGTPASE) (DGTP TRIPHOSPHOHYDROLASE)" /note="Mb2373c, dgt, len: 431 aa. Equivalent to Rv2344c, len: 431 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 431 aa overlap). Probable dgt, deoxyguanosine triphosphate triphosphohydrolase (EC 3.1.5.1), equivalent to Q9CCG3|DGT|ML0831 PUTATIVE DEOXYGUANOSINE TRIPHOSPHATE TRIPHOSPHOHYDROLASE from Mycobacterium leprae (429 aa), FASTA scores: opt: 2316, E(): 1.6e-137, (83.85% identity in 421 aa overlap); and O52199|DGTP_MYCSM|AF027507_2 DEOXYGUANOSINETRIPHOSPHATE TRIPHOSPHOHYDROLASE from Mycobacterium smegmatis (428 aa), FASTA scores: opt: 1991, E(): 3.4e-117, (73.5% identity in 422 aa overlap). Also highly similar or similar to several deoxyguanosine triphosphate hydrolases e.g. Q9L2E9|SC7A8.09c PUTATIVE DEOXYGUANOSINETRIPHOSPHATE TRIPHOSPHOHYDROLASE from Streptomyces coelicolor (424 aa), FASTA scores: opt: 1216, E(): 1e-68, (51.05% identity in 425 aa overlap); BAB48544|MLL1093 DGTP TRIPHOSPHOHYDROLASE from Rhizobium loti (Mesorhizobium loti) (404 aa), FASTA scores: opt: 489, E(): 3.1e-23, (33.85% identity in 387 aa overlap); P15723|DGTP_ECOLI|DGT|B0160 from Escherichia coli strain K12 (504 aa), FASTA scores: opt: 173, E(): 0.0022, (31.65% identity in 259 aa overlap); etc. BELONGS TO THE DGTPASE FAMILY. Protein product from Mb2373c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2373c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A541" /db_xref="InterPro:IPR003607" /db_xref="InterPro:IPR006261" /db_xref="InterPro:IPR006674" /db_xref="InterPro:IPR023023" /db_xref="InterPro:IPR026875" /db_xref="UniProtKB/Swiss-Prot:P0A541" /protein_id="SIU00985.1" /translation="MSASEHDPYDDFDRQRRVAEAPKTAGLPGTEGQYRSDFARDRAR VLHSAALRRLADKTQVVGPREGDTPRTRLTHSLEVAQIGRGMAIGLGCDLDLVELAGL AHDIGHPPYGHNGERALDEVAASHGGFEGNAQNFRILTSLEPKVVDAQGLSAGLNLTR ASLDAVTKYPWMRGDGLGSQRRKFGFYDDDRESAVWVRQGAPPERACLEAQVMDWADD VAYSVHDVEDGVVSERIDLRVLAAEEDAAALARLGEREFSRVSADELMAAARRLSRLP VVAAVGKYDATLSASVALKRLTSELVGRFASAAIATTRAAAGPGPLVRFRADLQVPDL VRAEVAVLKILALQFIMSDPRHLETQARQRERIHRVAHRLYSGAPQTLDPVYAAAFNT AADDAARLRVVVDQIASYTEGRLERIDADQLGVSRNALD" CDS 2605829..2607811 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2374" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2374, -, len: 660 aa. Equivalent to Rv2345, len: 660 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 660 aa overlap). Possible conserved transmembrane protein, with hydrophobic stretch at N-terminal end around position 180. Similar to O52198 HYPOTHETICAL 21.2 KDA PROTEIN (FRAGMENT) from Mycobacterium smegmatis (195 aa), FASTA scores: opt: 589, E(): 1.5e-23; (47.2% identity in 195 aa overlap). Protein product from Mb2374 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2374 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y161" /db_xref="InterPro:IPR007621" /db_xref="UniProtKB/TrEMBL:A0A1R3Y161" /protein_id="SIU00986.1" /translation="MRLVRLLGMVLTILAAGLLLGPPAGAQPPFRLSNYVTDNAGVLT SSGRTAVTAAVDRLYADRRIRLWVVYVENFSGQSALNWAQRTTRTSELGNYDALLAVA TTGREYAFLVPSAMPGVSEGQVDNVRRYQIEPALHDGDYSGAAVAAANGLNRSPSSSS RVVLLVTVGIIVIVVAVLLVVMRHRNRRRRADELAAARRVDPTNVMALAAVPLQALDD LSRSMVVDVDNAVRTSTNELALAIEEFGERRTAPFTQAVNNAKAALSQAFTVRQQLDD NTPETPAQRRELLTRVIVSAAHADRELASQTEAFEKLRDLVINAPARLDLLTQQYVEL TTRIGPTQQRLAELHTEFDAAAMTSIAGNVTTATERLAFADRNISAARDLADQAVSGR QAGLVDAVRAAESALGQARALLDAVDSAATDIRHAVASLPAVVADIQTGIKRANQHLQ QAQQPQTGRTGDLIAARDAAARALDRARGAADPLTAFDQLTKVDADLDRLLATLAEEQ ATADRLNRSLEQALFTAESRVRAVSEYIDTRRGSIGPEARTRLAEAKRQLEAAHDRKS SNPTEAIAYANAASTLAAHAQSLANADVQSAQRAYTRRGGNNAGAILGGIIIGDLLSG GTRGGLGGWIPTSFGGSSNAPGSSPDGGFLGGGGRF" CDS complement(2607896..2608078) /codon_start=1 /transl_table=11 /gene="esxO" /locus_tag="BQ2027_MB2375C" /standard_name="ES6_6; Mtb9.9E" /product="putative esat-6 like protein esxo (esat-6 like protein 6)" /note="Mb2375c, esxO, len: 60 aa. Equivalent to 3' end of Rv2346c, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (98.3% identity in 60 aa overlap). esxO, putative ESAT-6 like protein 6, conserved hypotheticalprotein, member of proteins family from Mycobacterium tuberculosis, with O53942|Rv1793|MTV049.15, O05300|Rv1198|MTCI364.10, MTCY15C10.33, P96364|MTCY07H7B.03|Rv1037c|MTCY10G2.12, MTCI364.10, etc. BELONGS TO THE ESAT6 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large 8963 bp deletion (RD5) leads to the loss of the NH2 part of Mb2375c, and the 9 following CDSs up to Rv2356 including the 3 phospolipases C enzymes plcC, plcB and plcA, compared to the homolog in Mycobacterium tuberculosis H37Rv. Protein product from Mb2375c detected using SWATH mass spectrometry. Mb2375c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C7" /protein_id="SIU00987.1" /translation="MLAAGDFWGGAGSVACQEFITQLGRNFQVIYEQANAHGQKVQAA GNNMAQTDSAVGSSWA" CDS complement(2608062..2609435) /codon_start=1 /transl_table=11 /gene="PPE71" /locus_tag="BQ2027_MB2376C" /product="PPE FAMILY PROTEIN" /note="Mb2376c, PPE71, len: 457 aa. Equivalent to 5' end of MT2423, len: 621 aa, from Mycobacterium tuberculosis strain CDC1551, (98.475% identity in 459 aa overlap). PPE FAMILY PROTEIN. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, there is a large 8963 bp deletion (RD5) in between Mb2375c and Mb2377c compared to Mycobacterium tuberculosis strain H37Rv. In this region of Mycobacterium bovis two substitutions are present that encode a CDS, Mb2376c, equivalent to that found in Mycobacterium tuberculosis strain CDC1551. The first substitution is of 1218 bp to 350 bp and the second is of 123 bp to 431 bp. Mb2376c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0W9" /protein_id="SIU00988.1" /translation="MVNFSVLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAE SFGLVTSGLAGGSGQAWQGAAAAMVVAAAPYAGWLAAAAARAGGAAVQAKAVAGAFEA ARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHGG ASAAAALAPWQQAVPGLLGLLDSAQSSAQAVTAQAVGSTVPGPLQGINFGFGNIGSLN LGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGIGNVGNLNLGSGNFGS QNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGSGNTGSNNIGFGNT GSGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVG FGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGFFDAGNYNFGSLNAGNIN SSFVGRG" CDS complement(2610173..2612017) /codon_start=1 /transl_table=11 /gene="PPE40" /locus_tag="BQ2027_MB2377C" /product="ppe family protein ppe40" /note="Mb2377c, PPE40, len: 614 aa. Equivalent to Rv2356c, len: 615 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 615 aa overlap). Member of Mycobacterium tuberculosis PPE_family, highly similar to others e.g. Q10778|MTCY48.17|YF48_MYCTU HYPOTHETICAL PPE-FAMILY PROTEIN (678 aa), FASTA scores: opt: 1888, E(): 1.9e-78, (54.4% identity in 667 aa overlap); Q10540|MTCY31.06c, E241779|MTCY98, P42611|MTV037.06c, Q10813|MTCY274.23c, P71657|MTCY02B10.25c, MTCY03C7.23, P71869|MTCY03C7.24c, etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 3 bp deletion (gcg-*) leads to a slightly shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (614 aa versus 615 aa). Mb2377c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Z4" /protein_id="SIU00989.1" /translation="MVNFSVLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAE SFGLVTSGLAGGSGQAWQGAAAAAMVVAAAPYAGWLAAAAARAGGAAVQAKAVAGAFE AARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHG GASAAAALAPWQQAVPGLSGLLGGAANAPAAAAQGAAQGLAELTLNLGVGNIGSLNLG SGNIGGTNVGSGNVGGTNLGSGNYGSLNWGSGNTGTGNAGSGNTGDYNPGSGNFGSGN FGSGNIGSLNVGSGNFGTLNLANGNNGDVNFGGGNTGDFNFGGGNNGTLNFGFGNTGS GNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGTGNIGFGNSGNNNIGFFNSGDGNIGFF NSGDGNTGFGNAGNINTGFWNAGNLNTGFGSAGNGNVGIFDGGNSNSGSFNVGFQNTG FGNSGAGNTGFFNAGDSNTGFANAGNVNTGFFNGGDINTGGFNGGNVNTGFGSALTQA GANSGFGNLGTGNSGWGNSDPSGTGNSGFFNTGNGNSGFSNAGPAMLPGFNSGFANIG SFNAGIANSGNNLAGISNSGDDSSGAVNSGSQNSGAFNAGVGLSGFFR" CDS complement(2612155..2613546) /codon_start=1 /transl_table=11 /gene="glyS" /locus_tag="BQ2027_MB2378C" /product="PROBABLE GLYCYL-tRNA SYNTHETASE GLYS (GLYCINE--tRNA LIGASE) (GLYRS)" /note="Mb2378c, glyS, len: 463 aa. Equivalent to Rv2357c, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 463 aa overlap). Probable glyS, glycyl-tRNA synthetase (EC 6.1.1.14), equivalent to Q9CCG4|GLYS|ML0826 PUTATIVE GLYCYL-TRNA SYNTHASE from Mycobacterium leprae (463 aa), FASTA scores: opt: 2898, E(): 1e-179, (90.2% identity in 459 aa overlap). Also highly similar to others e.g. Q9L2H9|SYG_STRCO|SCC121.07c from Streptomyces coelicolor (460 aa), FASTA scores: opt: 2210, E(): 2.9e-135, (68.3% identity in 457 aa overlap); Q9PPZ7|SYG_UREPA|GLYS|UU493 GLYCYL-TRNA SYNTHETASE from Ureaplasma parvum (Ureaplasma urealyticum biotype 1) (473 aa), FASTA scores: opt: 1254, E(): 1.7e-73, (45.25% identity in 462 aa overlap); P75425|SYG_MYCPN|GLYS|MPN354|MP482 GLYCYL-TRNA SYNTHETASE from Mycoplasma pneumoniae (449 aa), FASTA scores: opt: 1074, E(): 6.9e-62, (39.45% identity in 454 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1. BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb2378c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2378c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67033" /db_xref="InterPro:IPR002314" /db_xref="InterPro:IPR002315" /db_xref="InterPro:IPR004154" /db_xref="InterPro:IPR006195" /db_xref="InterPro:IPR022961" /db_xref="InterPro:IPR027031" /db_xref="InterPro:IPR033731" /db_xref="InterPro:IPR036621" /db_xref="UniProtKB/Swiss-Prot:P67033" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00990.1" /translation="MHHPVAPVIDTVVNLAKRRGFVYPSGEIYGGTKSAWDYGPLGVE LKENIKRQWWRSVVTGRDDVVGIDSSIILPREVWVASGHVDVFHDPLVESLITHKRYR ADHLIEAYEAKHGHPPPNGLADIRDPETGEPGQWTQPREFNMMLKTYLGPIETEEGLH YLRPETAQGIFVNFANVVTTARKKPPFGIGQIGKSFRNEITPGNFIFRTREFEQMEME FFVEPATAKEWHQYWIDNRLQWYIDLGIRRENLRLWEHPKDKLSHYSDRTVDIEYKFG FMGNPWGELEGVANRTDFDLSTHARHSGVDLSFYDQINDVRYTPYVIEPAAGLTRSFM AFLIDAYTEDEAPNTKGGMDKRTVLRLDPRLAPVKAAVLPLSRHADLSPKARDLGAEL RKCWNIDFDDAGAIGRRYRRQDEVGTPFCVTVDFDSLQDNAVTVRERDAMTQDRVAMS SVADYLAVRLKGS" CDS 2613728..2614135 /codon_start=1 /transl_table=11 /gene="smtb" /locus_tag="BQ2027_MB2379" /product="probable transcriptional regulatory protein smtb (probably arsr-family)" /note="Mb2379, -, len: 135 aa. Equivalent to Rv2358, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 135 aa overlap). Probable transcriptional regulator, arsR family, equivalent to Q9CCG5|ML0825 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (140 aa), FASTA scores: opt: 647, E(): 2e-34, (72.9% identity in 140 aa overlap). Also similar to others e.g. BAB48273|MLR0745 Transcriptional regulator from Rhizobium loti (Mesorhizobium loti) (104 aa), FASTA scores: opt: 185, E(): 3.4e-05, (43.25% identity in 74 aa overlap) (has its N-terminus shorter); P15905|ARR1_ECOLI arsenical resistance operon repressor from Escherichia coli (117 aa), FASTA scores: opt: 164, E(): 8.1e-05, (39.1% identity in 69 aa overlap); etc. Also similar to O53838|Rv0827|MTV043.19c PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium tuberculosis (130 aa), FASTA scores: opt: 201, E(): 4e-06, (35.7% identity in 98 aa overlap); and O69711|Rv3744|MTV025.092 PUTATIVE REGULATORY PROTEIN from Mycobacterium tuberculosis (120 aa), FASTA scores: opt: 209, E(): 1.2e-06, (35.5 % identity in 93 aa overlap). Contains possible helix-turn-helix motif at aa 72-93 (Score 1103, +2.94 SD). Belongs to the ARSR family of transciptional regulators. Protein product from Mb2379 detected using SWATH mass spectrometry. Mb2379 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y102" /db_xref="InterPro:IPR001845" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y102" /protein_id="SIU00991.1" /translation="MVTSPSTPTAAHEDVGADEVGGHQHPADRFAECPTFPAPPPREI LDAAGELLRALAAPVRIAIVLQLRESQRCVHELVDALHVPQPLVSQHLKILKAAGVVT GERSGREVLYRLADHHLAHIVLDAVAHAGEDAI" CDS 2614132..2614524 /codon_start=1 /transl_table=11 /gene="zur" /locus_tag="BQ2027_MB2380" /product="probable zinc uptake regulation protein zur" /note="Mb2380, furB, len: 130 aa. Equivalent to Rv2359, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 130 aa overlap). Probable furB, ferric uptake regulation protein, equivalent to FURB|ML0824|Q9CCG6 PUTATIVE FERRIC UPTAKE REGULATORY PROTEIN from Mycobacterium leprae (131 aa), FASTA scores: opt: 765, E(): 1.7e-43, (86.9% identity in 130 aa overlap). Also highly similar to FERRIC UPTAKE REGULATION PROTEINS e.g. Q9L2H5|SCC121.11 PUTATIVE METAL UPTAKE REGULATION PROTEIN from Streptomyces coelicolor (139 aa), FASTA scores: opt: 547, E(): 3.4e-29, (59.4% identity in 133 aa overlap); P06975|FUR_ECOLI from Escherichia coli (148 aa), FASTA scores: opt: 322, E(): 1.9e-14, (37.9% identity in 132 aa overlap); P45599|FUR_KLEPN FERRIC UPTAKE REGULATION PROTEIN from Klebsiella pneumoniae (155 aa), FASTA scores: opt: 314, E(): 6.7e-14, (36.35% identity in 132 aa overlap); etc. BELONGS TO THE FUR FAMILY. Protein product from Mb2380 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2380 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y106" /db_xref="InterPro:IPR002481" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y106" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00992.1" /translation="MSAAGVRSTRQRAAISTLLETLDDFRSAQELHDELRRRGENIGL TTVYRTLQSMASSGLVDTLRTDTGESVYRRCSEHHHHHLVCRSCGSTIEVGDHEVEAW AAEVATKHGFSDVSHTIEIFGTCSDCRS" CDS complement(2614632..2615060) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2381C" /product="unknown protein" /note="Mb2381c, -, len: 142 aa. Equivalent to Rv2360c, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 142 aa overlap). Hypothetical unknown protein. Protein product from Mb2381c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2381c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y0X8" /protein_id="SIU00993.1" /translation="MPSLPDRLASILRDVLPAEEEPDGALTVRHDGTFASLRVVSIAE DLELVSLTQILAWDLPLTKRLTEQVAKQARDINFGSVSLREKVSEKAARRSSGRPASN TADVMLRYNFPGTGLTDDALRTLILLVLETGATIRSALVG" CDS complement(2615060..2615950) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2382C" /product="LONG (C50) CHAIN Z-ISOPRENYL DIPHOSPHATE SYNTHASE (Z-DECAPRENYL DIPHOSPHATE SYNTHASE)" /note="Mb2382c, -, len: 296 aa. Equivalent to Rv2361c, len: 296 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 296 aa overlap). Long (C50) chain Z-isoprenyl diphosphate synthase (EC 2.5.1.-) (see citation below), equivalent to UPPS_MYCLE|ML0634|B1937_F2_65|P38119 UNDECAPRENYL PYROPHOSPHATE SYNTHETASE from Mycobacterium leprae (296 aa), FASTA scores: opt: 1789, E(): 1.8e-97, (86.5% identity in 296 aa overlap). Also highly similar to others e.g. UPPS|Q9L2H4 UNDECAPRENYL PYROPHOSPHATE SYNTHETASE from Streptomyces coelicolor (277 aa), FASTA scores: opt: 1098, E(): 8.2e-60, (63.5% identity in 247 aa overlap); Q55482|UPPS_SYNY3|SLL0506 from Synechocystis sp. strain PCC 6803 (249 aa), FASTA scores: opt: 686, E(): 4.2e-33, (46.4% identity in 235 aa overlap); O67291|UPPS_AQUAE|AQ_1248 from Aquifex aeolicus (231 aa), FASTA scores: opt: 684, E(): 5.2e-33, (46.3% identity in 229 aa overlap); etc. Also similar to Rv1086|MTV017.39 from Mycobacterium tuberculosis. Contains PS01066 Hypothetical YBR002c family signature. SEEMS TO BELONG TO THE UPP SYNTHETASE FAMILY. Note that previously known as uppS. Protein product from Mb2382c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2382c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P60478" /db_xref="InterPro:IPR001441" /db_xref="InterPro:IPR018520" /db_xref="InterPro:IPR036424" /db_xref="UniProtKB/Swiss-Prot:P60478" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU00994.1" /translation="MARDARKRTSSNFPQLPPAPDDYPTFPDTSTWPVVFPELPAAPY GGPCRPPQHTSKAAAPRIPADRLPNHVAIVMDGNGRWATQRGLARTEGHKMGEAVVID IACGAIELGIKWLSLYAFSTENWKRSPEEVRFLMGFNRDVVRRRRDTLKKLGVRIRWV GSRPRLWRSVINELAVAEEMTKSNDVITINYCVNYGGRTEITEATREIAREVAAGRLN PERITESTIARHLQRPDIPDVDLFLRTSGEQRSSNFMLWQAAYAEYIFQDKLWPDYDR RDLWAACEEYASRTRRFGSA" CDS complement(2615943..2616740) /codon_start=1 /transl_table=11 /gene="reco" /locus_tag="BQ2027_MB2383C" /product="possible dna repair protein reco" /note="Mb2383c, -, len: 265 aa. Equivalent to Rv2362c, len: 265 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 265 aa overlap). Conserved hypothetical protein, equivalent to the Mycobacterium leprae proteins Q49754|B1937_F1_25 Hypothetical protein (269 aa), FASTA scores: opt: 1561, E(): 8.5e-93, (86.6% identity in 268 aa overlap); and Q9CCN0|ML0633 Hypothetical protein (268 aa), FASTA scores: opt: 1560, E(): 8.5e-93, (86.6% identity in 268 aa overlap). Also highly similar to Q9L2H3|SCC121.13c HYPOTHETICAL 27.1 KDA PROTEIN from Streptomyces coelicolor (251 aa), FASTA scores: opt: 843, E(): 6.9e-47, (52.2% identity in 249 aa overlap); ans similar to other hypothetical proteins. Weak similarity with P42095|RECO_BACSU DNA REPAIR PROTEIN RECOMBINASE from Bacillus subtilis (255 aa), FASTA scores: opt: 270, E(): 3.6e-10, (26.4% identity in 182 aa overlap). Mb2383c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65984" /db_xref="InterPro:IPR003717" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR022572" /db_xref="InterPro:IPR037278" /db_xref="InterPro:IPR042242" /db_xref="UniProtKB/Swiss-Prot:P65984" /protein_id="SIU00995.1" /translation="MRLYRDRAVVLRQHKLGEADRIVTLLTRDHGLVRAVAKGVRRTR SKFGARLEPFAHIEVQLHPGRNLDIVTQVVSVDAFATDIVADYGRYTCGCAILETAER LAGEERAPAPALHRLTVGALRAVADGQRPRDLLLDAYLLRAMGIAGWAPALTECARCA TPGPHRAFHIATGGSVCAHCRPAGSTTPPLGVVDLMSALYDGDWEAAEAAPQSARSHV SGLVAAHLQWHLERQLKTLPLVERFYQADRSVAERRAALIGQDIAGG" CDS 2616802..2618256 /codon_start=1 /transl_table=11 /gene="amiA2" /locus_tag="BQ2027_MB2384" /product="PROBABLE AMIDASE AMIA2 (AMINOHYDROLASE)" /note="Mb2384, amiA2, len: 484 aa. Equivalent to Rv2363, len: 484 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 484 aa overlap). Probable amiA2, amidase (EC 3.5.1.4), highly similar or similar to others e.g. O28325|YJ54_ARCFU|AF1954 PUTATIVE AMIDASE from Archaeoglobus fulgidus (453 aa), FASTA scores: opt: 777, E(): 1.1e-38, (35.0% identity in 474 aa overlap); Q55424|AMID_SYNY3|SLL0828 PUTATIVE AMIDASE from Synechocystis sp. strain PCC 6803 (506 aa), FASTA scores: opt: 770, E(): 3e-38, (36.4% identity in 456 aa overlap); Q53116|AMDA ENANTIOMERASE-SELECTIVE AMIDASE from Rhodococcus sp. (462 aa), FASTA scores: opt: 701, E(): 3.5e-34, (32.7% identity in 468 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. AMI2_MYCTU|AMIB2|Q11056|Rv1263|MT1301|MTCY50.19c|cy50 .19c AMIDASE (462 aa), FASTA scores: opt: 1141, E(): 2.9e-60, (45.4% identity in 454 aa overlap); etc. Contains PS00571 Amidases signature, and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE AMIDASE FAMILY. Protein product from Mb2384 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2384 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63491" /db_xref="InterPro:IPR000120" /db_xref="InterPro:IPR020556" /db_xref="InterPro:IPR023631" /db_xref="InterPro:IPR036928" /db_xref="UniProtKB/Swiss-Prot:P63491" /protein_id="SIU00996.1" /translation="MVGASGSDAGAISGSGNQRLPTLTDLLYQLATRAVTSEELVRRS LRAIDVSQPTLNAFRVVLTESALADAAAADKRRAAGDTAPLLGIPIAVKDDVDVAGVP TAFGTQGYVAPATDDCEVVRRLKAAGAVIVGKTNTCELGQWPFTSGPGFGHTRNPWSR RHTPGGSSGGSAAAVAAGLVTAAIGSDGAGSIRIPAAWTHLVGIKPQRGRISTWPLPE AFNGVTVNGVLARTVEDAALVLDAASGNVEGDRHQPPPVTVSDFVGIAPGPLKIALST HFPYTGFRAKLHPEILAATQRVGDQLELLGHTVVKGNPDYGLRLSWNFLARSTAGLWE WAERLGDEVTLDRRTVSNLRMGHVLSQAILRSARRHEAADQRRVGSIFDIVDVVLAPT TAQPPPMARAFDRLGSFGTDRAIIAACPSTWPWNLLGWPSINVPAGFTSDGLPIGVQL MGPANSEGMLISLAAELEAVSGWATKQPQVWWTS" CDS complement(2618253..2619155) /codon_start=1 /transl_table=11 /gene="era" /locus_tag="BQ2027_MB2385C" /product="PROBABLE GTP-BINDING PROTEIN ERA" /note="Mb2385c, era, len: 300 aa. Equivalent to Rv2364c, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 300 aa overlap). Probable era, GTP-binding protein, equivalent to Q49768|ERA_MYCLE|ML0631|B1937_F3_102 GTP-BINDING PROTEIN ERA HOMOLOG from Mycobacterium leprae (300 aa) FASTA scores: opt: 1589, E(): 3.4e-88, (81.4% identity in 301 aa overlap). Also highly similar to other GTP-binding proteins e.g. Q9RDF2|ERA_STRCO|SCC77.06 from Streptomyces coelicolor (317 aa), FASTA scores: opt: 1264, E(): 1.1e-68, (64.0% identity in 306 aa overlap); Q9KD52|ERA_BACHD|BH1367|BEX from Bacillus halodurans (304 aa), FASTA scores: opt: 869, (44.8% identity in 297 aa overlap); Q9KIH7|ERA_LACLA|ERAL from Lactococcus lactis (subsp. lactis) (Streptococcus lactis), and Lactococcus lactis (subsp. cremoris) (Streptococcus cremoris) (303 aa), FASTA scores: opt: 781, E(): 9.4e-40, (40.25% identity in 298 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ERA/TRME FAMILY OF GTP-BINDING PROTEINS, ERA SUBFAMILY. Note that previously known as bex. Protein product from Mb2385c detected using SWATH mass spectrometry. Mb2385c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A563" /db_xref="InterPro:IPR004044" /db_xref="InterPro:IPR005225" /db_xref="InterPro:IPR005662" /db_xref="InterPro:IPR006073" /db_xref="InterPro:IPR009019" /db_xref="InterPro:IPR015946" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR030388" /db_xref="UniProtKB/Swiss-Prot:P0A563" /protein_id="SIU00997.1" /translation="MTEFHSGFVCLVGRPNTGKSTLTNALVGAKVAITSTRPQTTRHA IRGIVHSDDFQIILVDTPGLHRPRTLLGKRLNDLVRETYAAVDVIGLCIPADEAIGPG DRWIVEQLRSTGPANTTLVVIVTKIDKVPKEKVVAQLVAVSELVTNAAEIVPVSAMTG DRVDLLIDVLAAALPAGPAYYPDGELTDEPEEVLMAELIREAALQGVRDELPHSLAVV IDEVSPREGRDDLIDVHAALYVERDSQKGIVIGKGGARLREVGTAARSQIENLLGTKV YLDLRVKVAKNWQRDPKQLGRLGF" CDS complement(2619229..2619570) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2386C" /product="Protein involved in pyrimidine metabolism" /note="Mb2386c, -, len: 113 aa. Equivalent to Rv2365c, len: 113 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 113 aa overlap). Conserved hypothetical protein, highly similar to Q49767|ML0630|B1937_F3_101|CAC30138 Hypothetical protein from Mycobacterium leprae (108 aa), FASTA scores: opt: 426, E(): 1.4e-18, (67.9% identity in 106 aa overlap). Also highly similar to Q9RDF3|SCC77.05 from Streptomyces coelicolor (132 aa), FASTA scores: opt: 254, E(): 1.9e-18, (53.1% identity in 96 aa overlap). Equivalent to AAK46728 from Mycobacterium tuberculosis strain CDC1551 (93 aa) but longer 20 aa. Protein product from Mb2386c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2386c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0Z2" /db_xref="InterPro:IPR016193" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Z2" /protein_id="SIU00998.1" /translation="MMRRPITLAEQLDAEDAKLVVLARAAMARAEAGAGAAVRDVDGR TYAAAPVALSALELTGLQAAVAAAVSSGATGLQAAVLVAGSVDDPGIAAVRELAPTAA IIVTDRAGNPL" CDS complement(2619542..2620849) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2387C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2387c, -, len: 435 aa. Equivalent to Rv2366c, len: 435 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 435 aa overlap). Probable conserved transmembrane protein, highly similar to Q9L2L3|SCC117.07 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (358 aa), FASTA scores: opt: 1159, E(): 5.5e-64, (53.0% identity in 353 aa overlap); ans similar to hypothetical proteins and hemolysin-related proteins e.g. Q9HN02|HLP|VNG2308G HEMOLYSIN PROTEIN from Halobacterium sp. strain NRC-1 (457 aa), FASTA scores: opt: 623, E(): 6.2e-31, (28.4% identity in 433 aa overlap); etc. Potential transmembrane protein with 2 CBS domains. BELONGS TO THE UPF0053 FAMILY. Protein product from Mb2387c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2387c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67131" /db_xref="InterPro:IPR000644" /db_xref="InterPro:IPR002550" /db_xref="InterPro:IPR005170" /db_xref="InterPro:IPR016169" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/Swiss-Prot:P67131" /protein_id="SIU00999.1" /translation="MTGYYQLLGSIVLIGLGGLFAAIDAAISTVSPARVDELVRDQRP GAGSLRKVMADRPRYVNLVVLLRTSCEITATALLVVFIRYHFSMVWGLYLAAGIMVLA SFVVVGVGPRTLGRQNAYSISLATALPLRLISWLLMPISRLLVLLGNALTPGRGFRNG PFASEIELREVVDLAQQRGVVAADERRMIESVFELGDTPAREVMVPRTEMIWIESDKT AGQAMTLAVRSGHSRIPVIGENVDDIVGVVYLKDLVEQTFCSTNGGRETTVARVMRPA VFVPDSKPLDALLREMQRDRNHMALLVDEYGAIAGLVSIEDVLEEIVGEIADEYDQAE TAPVEDLGDKRFRVSARLPIEDVGELYGVEFDDDLDVDTVGGLLALELGRVPLPGAEV ISHGLRLHAEGGTDHRGRVRIGTVLLSPAEPDGADDEEADHPG" CDS complement(2620846..2621394) /codon_start=1 /transl_table=11 /gene="ybeY" /locus_tag="BQ2027_MB2388C" /product="Metal-dependent hydrolase YbeY, involved in rRNA and/or ribosome maturation and assembly" /note="Mb2388c, -, len: 182 aa. Equivalent to Rv2367c, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 182 aa overlap). Conserved hypothetical protein, equivalent to Q49752|YN67_MYCLE|ML0628|B1937_F1_21 HYPOTHETICAL 19.8 KDA PROTEIN from Mycobacterium leprae (178 aa), FASTA scores: opt: 1051, E(): 2e-59, (89.1% identity in 175 aa overlap). Also highly similar to others e.g. Q9L2L4|SCC117.06 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (165 aa), FASTA scores: opt: 599, E(): 6e-31, (56.5% identity in 154 aa overlap); Q9KD56|BH1363 HYPOTHETICAL PROTEIN from Bacillus halodurans (159 aa), FASTA scores: opt: 311, E(): 8.3e-13, (45.05% identity in 111 aa overlap); etc. Protein product from Mb2388c detected using shotgun mass spectrometry. Mb2388c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67135" /db_xref="InterPro:IPR002036" /db_xref="InterPro:IPR020549" /db_xref="InterPro:IPR023091" /db_xref="UniProtKB/Swiss-Prot:P67135" /protein_id="SIU01000.1" /translation="MREHLMSIEVANESGIDVSEAELVSVARFVIAKMDVNPCAELSM LLLDTAAMADLHMRWMDLPGPTDVMSFPMDELEPGGRPDAPEPGPSMLGDIVLCPEFA AEQAAAAGHSLGHELALLTIHGVLHLLGYDHAEPDEEKEMFALQDRLLEEWVADQVEA YQHDRQDEKDRRLLDKSRYFDL" CDS complement(2621398..2622456) /codon_start=1 /transl_table=11 /gene="phoH1" /locus_tag="BQ2027_MB2389C" /product="PROBABLE PHOH-LIKE PROTEIN PHOH1 (PHOSPHATE STARVATION-INDUCIBLE PROTEIN PSIH)" /note="Mb2389c, phoH1, len: 352 aa. Equivalent to Rv2368c, len: 352 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 352 aa overlap). Probable phoH1, phoH-like protein (phosphate starvation-induced protein), probably ATP-binding protein, equivalent to Q49751|PHOL_MYCLE| ML0627|B1937_F1_20 PHOH-LIKE PROTEIN from Mycobacterium leprae (349 aa), FASTA scores: opt: 1952, E(): 4.7e-107, (88.9% identity in 352 aa overlap). Also highly similar to Q9L2L5|SCC117.05 PHOH-LIKE PROTEIN from Streptomyces coelicolor (359 aa), FASTA scores: opt: 1407, E(): 3.6e-75, (63.6% identity in 349 aa overlap); Q9RSY1|DR1988 PHOH-RELATED PROTEIN from Deinococcus radiodurans (380 aa), FASTA scores: opt: 1053, E(): 1.9e-54, (53.3% identity in 349 aa overlap); Q9KD58|PHOH|BH1361 PHOSPHATE STARVATION-INDUCED PROTEIN from Bacillus halodurans (320 aa), FASTA scores: opt: 1019, E(): 1.6e-52, (54.35% identity in 300 aa overlap); P46343|PHOL_BACSU PHOH-LIKE PROTEIN from Bacillus subtilis (319 aa), FASTA scores: opt: 1014, E(): 3.2e-52, (50.8% identity in 303 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE PHOH FAMILY. Note that previously known as phoH. Protein product from Mb2389c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2389c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5S1" /db_xref="InterPro:IPR003714" /db_xref="InterPro:IPR004087" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036612" /db_xref="UniProtKB/Swiss-Prot:P0A5S1" /protein_id="SIU01001.1" /translation="MTSRETRAADAAGARQADAQVRSSIDVPPDLVVGLLGSADENLR ALERTLSADLHVRGNAVTLCGEPADVALAERVISELIAIVASGQSLTPEVVRHSVAML VGTGNESPAEVLTLDILSRRGKTIRPKTLNQKRYVDAIDANTIVFGIGPAGTGKTYLA MAKAVHALQTKQVTRIILTRPAVEAGERLGFLPGTLSEKIDPYLRPLYDALYDMMDPE LIPKLMSAGVIEVAPLAYMRGRTLNDAFIVLDEAQNTTAEQMKMFLTRLGFGSKVVVT GDVTQIDLPGGARSGLRAAVDILEDIDDIHIAELTSVDVVRHRLVSEIVDAYARYEEP GSGLNRAARRASGARGRR" CDS complement(2622428..2622730) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2390C" /product="HYPOTHETICAL PROTEIN" /note="Mb2390c, -, len: 100 aa. Equivalent to Rv2369c, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). Hypothetical unknown protein. Mb2390c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y116" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01002.1" /translation="MIVGLADRHGHGRDVAAHRQAQLAGPRVAAVRRHRTGGHRQASS RIKVSAHGLGVVRCAPTPSLTGVRMKLQHSSVRQVPVDRPESRHQKPGDVPRDPRC" CDS complement(2622727..2624040) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2391C" /product="Possible regulatory protein Trx" /note="Mb2391c, -, len: 437 aa. Equivalent to Rv2370c, len: 437 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 437 aa overlap). Conserved hypothetical protein, member of family proteins from Mycobacterium tuberculosis with Rv1453|MTCY493_01c|O06807 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (432 aa), FASTA scores: opt: 1943, E(): 9.4e-115, (69.9% identity in 409 aa overlap); Rv1194c|MTCI364.06c; etc. Also similar to AAK45764|MT1500 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (432 aa), FASTA scores: opt: 1934, E(): 9.4e-115, (69.9% identity in 409 aa overlap)." /db_xref="InterPro:IPR025736" /db_xref="InterPro:IPR041522" /db_xref="InterPro:IPR042070" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Y6" /protein_id="SIU01003.1" /translation="MVLPKPTPRGRELIRQAAKVALHPTPEWLDELDRATLAAHPSIA ADPALATVVSRANRSHLIHFATANLRKPGQPVPANLGPDPLRMARDLVRRGLDASALD VYRVGQNVAWQRWTEIAFGLTTDPQELHELLTLPFRSASEFIDATLAGLAAQMQLEYD ELTRDVHAEHRRIVELILDGAPISRQSAEAKLGYPLDRSHTAAIIWYDDPDDNQNHLD HTARAFGRALGCPQPLIAVASAATRWVWVSDAATLDTDRIHQVLDHAPHARIAVGTTA RGIDGFRRSHRDALATQRMLARLRSQQRLAFFADIHMIAVLTENPDSAADFITSTLGD LESASPQLLTTVLTYINEQCNASRAAHVLHTHRNTLLRRLETAQRLLPRPLDHTIIQV AVAISALQWRGSQTSDPVETPVEGITSPPPESLGRRRSRLAQLER" CDS 2624235..2624420 /codon_start=1 /transl_table=11 /gene="PE_PGRS40" /locus_tag="BQ2027_MB2392" /product="pe-pgrs family protein pe_pgrs40" /note="Mb2392, PE_PGRS40, len: 61 aa. Equivalent to Rv2371, len: 61 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 61 aa overlap). Short protein, member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to N-terminal part of others e.g. AAK44356|MT0132 PE_PGRS FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (561 aa), FASTA scores: opt: 217, E(): 4.9e-08, (69.65% identity in 56 aa overlap); etc. Mb2392 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y370" /protein_id="SIU01004.1" /translation="MSLVSVAPELVVTAVPDVARIGSSIGAPDTAAAARPTTSVLAAG ADEVSADVVALFGWVAR" CDS complement(2624519..2625307) /codon_start=1 /transl_table=11 /gene="rsmE" /locus_tag="BQ2027_MB2393C" /product="16S rRNA (uracil(1498)-N(3))-methyltransferase (EC" /EC_number="2.1.1.193" /note="Mb2393c, -, len: 262 aa. Equivalent to Rv2372c, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 262 aa overlap). Conserved hypothetical protein, equivalent to Q9CCN1|ML0626 HYPOTHETICAL PROTEIN from Mycobacterium leprae (257 aa), FASTA scores: opt: 1277, E(): 3e-71, (77.25% identity in 255 aa overlap). Also highly similar to others e.g. Q9RDD9|SDRD HYPOTHETICAL 26.1 KDA PROTEIN from Streptomyces coelicolor (249 aa), FASTA scores: opt: 624, E(): 3.2e-31, (45.05% identity in 253 aa overlap); P54461|YQEU_BACSU hypothetical 28.8 kd protein from Bacillus subtilis (256 aa), FASTA scores: opt: 375, E(): 6e-16, (32.5% identity in 234 aa overlap); etc. C-terminal half highly similar to Q49763|B1937_F2_57 from Mycobacterium leprae (128 aa), FASTA scores: opt: 577, E(): 1.4e-28, (75.8% identity in 124 aa overlap). BELONGS TO THE UPF0088 FAMILY. Protein product from Mb2393c detected using SWATH mass spectrometry. Mb2393c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67203" /db_xref="InterPro:IPR006700" /db_xref="InterPro:IPR015947" /db_xref="InterPro:IPR029026" /db_xref="InterPro:IPR029028" /db_xref="UniProtKB/Swiss-Prot:P67203" /protein_id="SIU01005.1" /translation="MVAMLFYVDTLPDTGAVAVVDGDEGFHAATVRRIRPGEQLVLGD GVGRLARCVVEQAGRGGLRARVLRRWSVPPVRPPVTVVQALPKSERSELAIELATEAG ADAFLAWQAARCVANWDGARVDKGLRRWRAVVRSAARQSRRARIPPVDGVLSTPMLVQ RVREEVAAGAAVLVLHEEATERIVDIAAAQAGSLMLVVGPEGGIAPDELAALTDAGAV AVRLGPTVLRTSTAAAVALGAVGVLTSRWDASASDCEYCDVTRR" CDS complement(2625321..2626469) /codon_start=1 /transl_table=11 /gene="dnaJ2" /locus_tag="BQ2027_MB2394C" /product="PROBABLE CHAPERONE PROTEIN DNAJ2" /note="Mb2394c, dnaJ2, len: 382 aa. Equivalent to Rv2373c, len: 382 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 382 aa overlap). Probable dnaJ2, chaperone protein, equivalent to Q49762|DNJ2_MYCLE|ML0625|B1937_F2_56 CHAPERONE PROTEIN from Mycobacterium leprae (378 aa), FASTA scores: opt: 2301, E(): 1.7e-120, (87.5% identity in 382 aa overlap). Also highly similar to OTHER CHAPERONE PROTEINS DNAJ/DNAJ2 e.g. Q9RDD7|DNJ2_STRCO|SCC77.21c from Streptomyces coelicolor (378 aa), FASTA scores: opt: 1456, E(): 1.2e-73, (54.8% identity in 385 aa overlap); O52164|DNJ2_STRAL from Streptomyces albus (379 aa) FASTA scores: opt: 1378, E(): 2.6e-69, (52.2% identity in 385 aa overlap); Q9S5A3|DNAJ_LISMO from Listeria monocytogenes (377 aa), FASTA scores: opt: 1013, E(): 4.6e-49, (41.3% identity in 385 aa overlap); etc. Also similar to Rv0352|MTCY13E10.12 from Mycobacterium tuberculosis. Contains 1 J domain and 1 CR domain. BELONGS TO THE DNAJ FAMILY. Protein product from Mb2394c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2394c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63967" /db_xref="InterPro:IPR001305" /db_xref="InterPro:IPR001623" /db_xref="InterPro:IPR002939" /db_xref="InterPro:IPR008971" /db_xref="InterPro:IPR012724" /db_xref="InterPro:IPR036410" /db_xref="InterPro:IPR036869" /db_xref="UniProtKB/Swiss-Prot:P63967" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01006.1" /translation="MARDYYGLLGVSKNASDADIKRAYRKLARELHPDVNPDEAAQAK FKEISVAYEVLSDPDKRRIVDLGGDPLESAAAGGNGFGGFGGLGDVFEAFFGGGFGGG AASRGPIGRVRPGSDSLLRMRLDLEECATGVTKQVTVDTAVLCDRCQGKGTNGDSVPI PCDTCGGRGEVQTVQRSLLGQMLTSRPCPTCRGVGVVIPDPCQQCMGDGRIRARREIS VKIPAGVGDGMRVRLAAQGEVGPGGGPAGDLYVEVHEQAHDVFVREGDHLHCTVSVPM VDAALGVTVTVDAILDGLSEITIPPGTQPGSVITLRGRGMPHLRSNTRGDLHVHVEVV VPTRLDHQDIELLRELKGRRDREVAEVRSTHAAAGGLFSRLRETFTGR" CDS complement(2626544..2627575) /codon_start=1 /transl_table=11 /gene="hrcA" /locus_tag="BQ2027_MB2395C" /product="probable heat shock protein transcriptional repressor hrca" /note="Mb2395c, hrcA, len: 343 aa. Equivalent to Rv2374c, len: 343 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 343 aa overlap). Probable hrcA, heat-inducible transcriptional repressor, equivalent to Q9CCN2|HRCA|ML0624 PUTATIVE HEAT-INDUCIBLE TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (343 aa), FASTA scores: opt: 1926, E(): 3.9e-107, (89.8% identity in 343 aa overlap). Also highly similar to other heat-inducible transcription repressor proteins e.g. Q9RDD6|HRCA|SCC77.22c from Streptomyces coelicolor (338 aa), FASTA scores: opt: 1227, E(): 1.1e-65, (58.8% identity in 335 aa overlap); O52163|HRCA_STRAL from Streptomyces albus (338 aa), FASTA scores: opt: 1196, E(): 7.7e-64, (56.1% identity in 335 aa overlap); P25499|HRCA_BACSU HEAT-INDUCIBLE TRANSCRIPTION REPRESSOR from Bacillus subtilis (343 aa), FASTA scores: opt: 538, E(): 8.4e-25, (28.9% identity in 325 aa overlap); etc. Almost identical, but conflict at C-terminus, to Q49749|YGRP|B1937_F1_18 PUTATIVE HEAT-INDUCIBLE TRANSCRIPTION REPRESSOR from Mycobacterium leprae (197 aa) FASTA scores: opt: 1126, E(): 6.9e-60, (91.8% identity in 195 aa overlap). BELONGS TO THE HRCA FAMILY. Protein product from Mb2395c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2395c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64399" /db_xref="InterPro:IPR002571" /db_xref="InterPro:IPR021153" /db_xref="InterPro:IPR023120" /db_xref="InterPro:IPR029016" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P64399" /protein_id="SIU01007.1" /translation="MGSADERRFEVLRAIVADFVATQEPIGSKSLVERHNLGVSSATV RNDMAVLEAEGYITQPHTSSGRVPTEKGYREFVDRLEDVKPLSSAERRAIQSFLESGV DLDDVLRRAVRLLAQLTRQVAVVQYPTLSTSTVRHLEVIALTPARLLMVVITDSGRVD QRIVELGDVIDDHQLAQLREILGQALEGKKLSAASVAVADLASQLGGAGGLGDAVGRA ATVLLESLVEHTEERLLLGGTANLTRNAADFGGSLRSILEALEEQVVVLRLLAAQQEA GKVTVRIGHETASEQMVGTSMVSTAYGTAHTVYGGMGVVGPTRMDYPGTIASVAAVAL YIGDVLGAR" CDS 2627747..2628064 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2396" /product="Transcription regulator of the Arc/MetJ class" /note="Mb2396, -, len: 105 aa. Equivalent to Rv2375, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 105 aa overlap). Conserved hypothetical protein, highly similar to only CAC32314|2SCD60.09c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (98 aa), FASTA scores: opt: 425, E(): 5.7e-24, (63.25% identity in 98 aa overlap). Protein product from Mb2396 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2396 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR014447" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Z8" /protein_id="SIU01008.1" /translation="MIFKGVREGKPYPEHGLSYRDWSQIPPQQIRLDELVTTTTVLAL DRLLSEDSTFYGDLFPHAVKWRGTTYLEDGLHRAVRAALRNRTVLHARVFDMDASPGG RRS" CDS complement(2628091..2628597) /codon_start=1 /transl_table=11 /gene="cfp2" /locus_tag="BQ2027_MB2397C" /standard_name="mtb12" /product="LOW MOLECULAR WEIGHT ANTIGEN CFP2 (LOW MOLECULAR WEIGHT PROTEIN ANTIGEN 2) (CFP-2)" /note="Mb2397c, cfp2, len: 168 aa. Equivalent to Rv2376c, len: 168 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 168 aa overlap). cfp2 (alternate gene name: mtb12), low molecular weight antigen, secreted protein similar to Q49771|MB12_MYCLE|ML0620|B1937_F3_91 LOW MOLECULAR WEIGHT ANTIGEN MTB12 HOMOLOG PRECURSOR from Mycobacterium leprae (167 aa), FASTA scores: opt: 682, E(): 1.7e-32, (65.5% identity in 165 aa overlap). BELONGS TO THE MTB12 FAMILY. Protein product from Mb2397c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2397c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5P9" /db_xref="UniProtKB/Swiss-Prot:P0A5P9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01009.1" /translation="MKMVKSIAAGLTAAAAIGAAAAGVTSIMAGGPVVYQMQPVVFGA PLPLDPASAPDVPTAAQLTSLLNSLADPNVSFANKGSLVEGGIGGTEARIADHKLKKA AEHGDLPLSFSVTNIQPAAAGSATADVSVSGPKLSSPVTQNVTFVNQGGWMLSRASAM ELLQAAGN" CDS complement(2628697..2628912) /codon_start=1 /transl_table=11 /gene="mbtH" /locus_tag="BQ2027_MB2398C" /product="MbtH-like NRPS chaperone => MbtH" /note="Mb2398c, mbtH, len: 71 aa. Equivalent to Rv2377c, len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (98.6% identity in 71 aa overlap). Putative mbtH, conserved protein with no function assigned (see first and second citation), similar to hypothetical proteins or proteins found in several gene clusters for biosynthesis or transport of siderophores and other nonribosomally synthesized peptides e.g. Q9Z388|SCE8.11c PUTATIVE SMALL CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (71 aa), FASTA scores: opt: 345, E(): 1.4e-19, (68.2% identity in 66 aa overlap); Q9F8V3|CUMB COUY PROTEIN (probably involved in the biosynthesis of aminocoumarin antibiotic coumermycin A(1)) (see third citation below) from Streptomyces rishiriensis (71 aa), FASTA scores: opt: 329, E(): 2.2e-18, (63.2% identity in 68 aa overlap); Q9F5J2|SIM-CB MBTH-LIKE PROTEIN (probably protein involved in the biosynthesis of aminocoumarin antibiotic coumermycin A(1)) from Streptomyces antibioticus (70 aa), FASTA scores: opt: 308, E(): 8.4e-17, (65.6% identity in 64 aa overlap); Q9FB14 MBTH-LIKE PROTEIN (involved in the biosynthesis of the antitumor drug bleomycin) (see fourth citation below) from Streptomyces verticillus FASTA scores: opt: 220, E(): 8.8e-10, (41.2% identity in 68 aa overlap); etc. Protein product from Mb2398c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2398c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR005153" /db_xref="InterPro:IPR037407" /db_xref="InterPro:IPR038020" /db_xref="UniProtKB/Swiss-Prot:P59965" /protein_id="SIU01010.1" /translation="MSTNPFDDDNGAFFVLVNDEDQHSLWPVFADIPAGWRVVHGEAS RAACLDYVEKNWTDLRPKSLRDAMAED" CDS complement(2628890..2630185) /codon_start=1 /transl_table=11 /gene="mbtG" /locus_tag="BQ2027_MB2399C" /product="LYSINE-N-OXYGENASE MBTG (L-LYSINE 6-MONOOXYGENASE) (LYSINE N6-HYDROXYLASE)" /note="Mb2399c, mbtG, len: 431 aa. Equivalent to Rv2378c, len: 431 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 431 aa overlap). mbtG, lysine-N -oxygenase (hydroxylase) (EC 1.14.13.59) showing some similarity with various proteins including ornithine and lysine-N-oxygenases, e.g. Q9K6Q1|TRKA|BH3677 POTASSIUM UPTAKE PROTEIN from Bacillus halodurans (350 aa), FASTA scores: opt: 153, E(): 0.016, (25.2% identity in 246 aa overlap); P56584|SID1_USTMA L-ORNITHINE 5-MONOOXYGENASE (EC 1.13.12.-) from Ustilago maydis (Smut fungus) (570 aa), FASTA scores: opt: 136, E(): 0.31, (22.85% identity in 127 aa overlap); Q9HHV0|HXYA|VNG6214G MONOOXYGENASE from Halobacterium sp. strain NRC-1 (477 aa), FASTA scores: opt: 119, E(): 3.4, (40.0% identity in 70 aa overlap); O69828|SC1A6.23 PUTATIVE LYSINE N-HYDROXLASE (FRAGMENT) from Streptomyces coelicolor (134 aa), BLAST score: 76 (similarity in part for this one); etc. COFACTORS: FAD (BY SIMILARITY). Protein product from Mb2399c detected using SWATH mass spectrometry. Mb2399c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TYQ9" /db_xref="InterPro:IPR025700" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:Q7TYQ9" /protein_id="SIU01011.1" /translation="MNPTLAVLGAGAKAVAVAAKASVLRDMGVDVPDVIAVERIGVGA NWQASGGWTDGAHRLGTSPEKDVGFPYRSALVPRRNAELDERMTRYSWQSYLIATASF AEWIDRGRPAPTHRRWSQYLAWVADHIGLKVIHGEVERLAVTGDRWALCTHETTVQAD ALMITGPGQAEKSLLPGNPRVLSIAQFWDRAAGHDRINAERVAVIGGGETAASMLNEL FRHRVSTITVISPQVTLFTRGEGFFENSLFSDPTDWAALTFDERRDALARTDRGVFSA TVQEALLADDRIHHLRGRVAHAVGRQGQIRLTLSTNRGSENFETVHGFDLVIDGSGAD PLWFTSLFSQHTLDLLELGLGGPLTADRLQEAIGYDLAVTDVTPKLFLPTLSGLTQGP GFPNLSCLGLLSDRVLGAGIFTPTKHNDTRRSGEHQSFR" CDS complement(2630182..2634567) /codon_start=1 /transl_table=11 /gene="mbtF" /locus_tag="BQ2027_MB2400C" /product="PEPTIDE SYNTHETASE MBTF (PEPTIDE SYNTHASE)" /note="Mb2400c, mbtF, len: 1461 aa. Equivalent to Rv2379c, len: 1461 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1461 aa overlap). mbtF, peptide synthetase (see citations below), similar in part to several synthases e.g. O52820|PCZA363.4 PROTEIN from Amycolatopsis orientalis (4077 aa), FASTA scores: opt: 1873, E(): 1.1e-99, (35.55% identity in 1522 aa overlap); O07944|SNBDE PRISTINAMYCIN I SYNTHASE 3 AND 4 from Streptomyces pristinaespiralis (4848 aa), FASTA scores: opt: 1817, E(): 2.1e-96, (33.65% identity in 1463 aa overlap); O52821 PROTEIN SIMILAR TO PEPTIDE SYNTHETASE from Amycolatopsis orientalis (1860 aa) FASTA scores: opt: 1705, E(): 2.9e-90, (34.75% identity in 1344 aa overlap); Q9XCF2|PSTB PUTATIVE PEPTIDE SYNTHETASE (similar to Mycobacterium tuberculosis nrp protein) from Mycobacterium avium (2552 aa), FASTA scores: opt: 1687, E(): 4e-89, (35.45% identity in 1058 aa overlap); Q9ZET7 PEPTIDE SYNTHETASE (FRAGMENT) from Mycobacterium smegmatis (1438 aa), FASTA scores: opt: 1479, E(): 2.5e-77, (30.45% identity in 1507 aa overlap); etc. Contains PS00455 putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb2400c detected using SWATH mass spectrometry. Mb2400c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y126" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR010071" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR023213" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y126" /protein_id="SIU01012.1" /translation="MGPVAVTRADARGAIDDVMALSPLQQGLFSRATLVAAESGSEAA EADPYVIAMAADAAGPLDIALLRDCAAAMLTRHPNLRASFLHGNLSRPVQVIPSSAEV LWRHVRAHPSEVGALAAEERRRRFDVGRGPLIRFLLIELPDECWHLVIVAHHIVIDGW SLPLFVSELLALYRAGGHVAALPAAPRPYRDYIGWLAGRDQTASRAMWADHLNGLDGP TLLSPALADTPVQPGIPGRTEVRLDREATAELADAARTRGVTISTLVQMAWATTLSAF TGRGDVTFGVTVSGRPSELSGVETMIGLFINTVPLRVRLDARATVGGQCAVLQRQFAM LRDHSYLGFNEFRAIAGIGEMFDTLLVYENFPPGEVVGTAEFVANGVTFRPVALESLS HFPVTVAAHRSTGELTLLVEVLDGALGTMAPESLGRRVLAVLQRLVSRWDRPLRDVDI LLDGEHDPTAPGLPDVTTSAPAVHTRFAEIAAAQPDSVAVSWADGQLTYRELDALADR LATGLRRADVSRETPVAVALSRGPRYVAAMLAVLKAGGMIVPLDPAMPGERVAEILRQ TSAPVVIDEGVFAASVGADILEDDRAITVPVDQAAYVIFTSGTTGTPKGVIGTHRALS AYADDHIERVLRPAAQRLGRPLRIAHAWSFTFDAAWQPLVALLDGHAVHIVDDHRQRD AGALVEAIDRFGLDMIDTTPSMFAQLHNAGLLDRAPLAVLALGGEALGAATWRMIQQN CARTAMTAFNCYGPTETTVEAVVAAVAEHARPVIGRPTCTTRAYVMDSWLRPVPDGVA GELYLAGAQLTRGYLGRPAETAARFVAEPNGRGSRMYRTGDVVRRLPDGGLEFLGRSD DQVKIRGFRVEPGEIAAVLNGHHAVHGCHVTARGHASGPRLTAYVAGGPQPPPVAELR AMLLERLPRYLVPHHIVVLDELPLTPHGKIDENALAAINVTEGPATPPQTPTELVLAE AFADVMETSNVDVTAGFLQMGLDSIVALSVVQAARRRGIALRARLMVECDTIRELAAA IDSDAAWQAPANDAGEPIPVLPNTHWLYEYGDPRRLAQTEVIRLPDRITRERLDAVLA AVVDGHEVLRCRFDRDAMALVAQPKTDILSEVWVSGELVTAVAEQTLGVLASLDPQAG RLLSAVWLREPDGPGVLVLTAHVLAMDPASWRIVLGELDAGLHALAAGRAPSPARENT SYRQWSRLLAQRAKALDSVDFWVAELEGADPPLGARRVAPQTDRVGELAITMSISDAD LTARLLSTGRSMTDLLATAAARMVTAWRRQRGQQTPAPLLALETHGRADVHVDKTADT SDTVGLLSAIYPLRIHCDGATDFARIPGSGIDYGLLRYLRADTAERLRAHREPQLLLN YLGSLHVGVGDLAVDRALLADVGQLPEPEQPVRHELTVLAALLGPADAPVLATRWRTL PDILSADDVATLQSLWQGALAEITA" CDS complement(2634549..2639597) /codon_start=1 /transl_table=11 /gene="mbtE" /locus_tag="BQ2027_MB2401C" /product="PEPTIDE SYNTHETASE MBTE (PEPTIDE SYNTHASE)" /note="Mb2401c, mbtE, len: 1682 aa. Equivalent to Rv2380c, len: 1682 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1682 aa overlap). mbtE, peptide synthetase (see citations below), similar in part to several synthases e.g. O07944|SNBDE PRISTINAMYCIN I SYNTHASE 3 AND 4 from Streptomyces pristinaespiralis (4848 aa), FASTA scores: opt: 2635, E(): 1.9e-146, (36.8% identity in 1657 aa overlap); O05647|SNBDE VIRGINIAMYCIN S SYNTHETASE (FRAGMENT) from Streptomyces virginiae (1997 aa) FASTA scores: opt: 2580, E(): 1.6e-143, (40.65% identity in 1163 aa overlap); Q9R9I2|DHBF PROTEIN INVOLVED IN SIDEROPHORE PRODUCTION from Bacillus subtilis (2378 aa), FASTA scores: opt: 2388, E(): 3.6e-132, (33.9% identity in 1579 aa overlap); O68487|ACMB ACTINOMYCIN SYNTHETASE II from Streptomyces chrysomallus (2611 aa), FASTA scores: opt: 2165, E(): 4.9e-119, (35.0% identity in 1634 aa overlap); etc. Equivalent to AAK46743 from Mycobacterium tuberculosis strain CDC1551 (1787 aa) but shorter 105 aa. Contains PS00455 putative AMP-binding domain signature, and PS00012 Phosphopantetheine attachment site. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb2401c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2401c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0Z1" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR010071" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR023213" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Z1" /protein_id="SIU01013.1" /translation="MWFVQMADPSGALLNICVSYRITGDIDLARLRDAVNAVARRHRI LRTTYPVGDDGVAQPTVHADLRPGWTQYDLTDLSQRAQRLRLEVLAQREFCAPFELSR DAPLRITVVRTAADEHVLLLVAHHIAWDDGSWRVFFTDLTQAYSRADLGADLGPEHRP SAASGPDTTEADLNYWRAIMADPPEPLELPGPAGTCVPTSWRAARATLRLPADTAARV ATMAKNTGCTPYMVLLAAFGALVHRYTHSDDFLVAAPVLNRGAGTEDAIGYFGNTVAM RLRPQSAMSFRELLTATRDIASGAFAHQRINLDRVVRELNPDRRHGAERMTRVSFGFR EPDGGGFNPPGIECERYDLRSNITQLPLGFMVEFDRAGVLVEAEHLVEILEPALAKQM LRHFGVLLDNALAAPDNTLSGLALMDERDAARLREVSRGERFDTPVKTLVDLVNEQTT RTPDATAVVYEGQHFTYHDLNEASNRLGHWLIEQGIGSEDRVAVLLDKSPDLIVTALG VVKSGAVYVPVDPSYPQDRLDFILADCDAKLVLRTPVRELAGYRSDDPTDADRIRPLR PDNTAYLIYTSGTTGLPKGVAVPHRPVAEYFVWFKGEYDVDDTDRLLQVASPSFDVSI AEIFGTLACGARMVIPRPGGLTDIGYLTALLRDEGITAMHFVPSLLGLFLSLPGVSQW RTLQRVPIGGEPLPGEVADKFHATFDALLHNFYGPTETVINASRFKVVGPQGTRIVPI GRPKINTTMHLLDDSLQPVPTGVIGEIYIGGTHVAYGYHRRAGLTAERFVADPFNPGS RMYRSGDLARRNADGDIEFVGRADEQVKIRGFRIELGDVAAAIAVDPTVGQAVVVVSD LPRLGKSLVGYVTPAAGGDGPADVGVDLDRIRARVAAALPEYMLPAAYVVLDEIPITA HGKIDRAALPEPQIASDTEFRAPQTATERRLAQLFGELLGRDRVGADDSFFDLGGHSL LATKLVAAVRNAFGVDVGVREIFEFATVTALAGHIDTLDSDSARPRLTRVDHDGPVRL SSSQMRSWFNYRFDGPNAVNNIPFAAALHGPCDTNAFAAAITDVVARHEILRTVYREI GGVPHQIIQPPAEVPVRCAAGSDAAWLRAELNNERGYVFDLETDWPIRAALLSTPEQT VLSLVVHHIAGDHWSAGVLFTDLLTAYRARSTGQRPSWAPLPVQYADYSVWQSALLDD GAGIVGPQRDYWIRQLGGLAGETGLRPDFPRPALLSGAGDAVEFRLGAAIRDKLAAVS RDLGVTEFMLLQAAVAVVLHKAGGGVDVPIGAPVAGRSEANLDQLIGFFINIVVLRND LRGNPTLREVLQRTRQMALAAYAHQDLPFDQVVEAVNPQRSLSRNPLFDIVVHVREQM PQDHVIDTGPDGDTTLRVLEPTFDAAQADLSVNFFACGDEYRGHVIYRTELYERATAQ RFADWLVRVVEAFADRPDQPLREVEMVSAQARRRILDRSNAGAGTARVYLLDDALKPV PVGVVGDVYYGGGPAVGARLARPSETATRFVADPFAAQPGSRLYRNGERGVWKADGQL ELLAEIERLPTAQAAPVPAEPADTETERALAAILADVLEVGEVGRYDDFFNLGGDSIL ATQVAARARDGGIPLTARMVFEHPVLCELAAAVDAKPHVEAEPDDKHHAPMSTSGLSP DELSALTASWDQWP" CDS complement(2639737..2642751) /codon_start=1 /transl_table=11 /gene="mbtD" /locus_tag="BQ2027_MB2402C" /product="POLYKETIDE SYNTHETASE MBTD (POLYKETIDE SYNTHASE)" /note="Mb2402c, mbtD, len: 1004 aa. Equivalent to Rv2381c, len: 1004 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1004 aa overlap). mbtD, polyketide synthase (see citations below), similar in part to several synthases e.g. Q03132|ERY2_SACER|ERYA ERYTHRONOLIDE SYNTHASE, MODULES 3 AND 4 (EC 2.3.1.94) from Saccharopolyspora erythraea (Streptomyces erythraeus) (3567 aa), FASTA scores: opt: 971, E(): 1e-46, (29.35% identity in 1043 aa overlap); Q9F829|MEGAII MEGALOMICIN 6-DEOXYERYTHRONOLIDE B SYNTHASE 2 from Micromonospora megalomicea subsp. nigra (3562 aa), FASTA scores: opt: 787, E(): 2.4e-36, (29.35% identity in 1032 aa overlap); Q9L4W4|NYSB POLYKETIDE SYNTHASE from Streptomyces noursei (3192 aa), FASTA scores: opt: 761, E(): 6.6e-35, (29.55% identity in 1086 aa overlap); O30764|NIDA1 POLYKETIDE SYNTHASE MODULES 1 AND 2 from Streptomyces caelestis (4340 aa), FASTA scores: opt: 726, E(): 7.8e-33, (27.3% identity in 1052 aa overlap); etc. Contains PS00012 Phosphopantetheine attachment site. Protein product from Mb2402c detected using SWATH mass spectrometry. Mb2402c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y380" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="UniProtKB/TrEMBL:A0A1R3Y380" /protein_id="SIU01014.1" /translation="MAPKQLPDGRVAVLLSAHAEELIGPDARAIADYLERFPATTVTE VARQLRKTRRVRRHRAVLRAADRLELAEGLRALAAGREHPLIARSSLGSAPRQAFVFP GQGGHWPGMGAVAYRELPTYRTATDTCAAAFAAAGVDSPLPYLIAPPGTDERQAFCEI EIEGAQFVHAVALAEVWRSCGVLPDLTVGHSLGEVAAAYLAGSITLSDAVAVVAARAN VVGRLPGRYAVAALGIGEQDASALIATTGGWLELSVVNASSTVAVSGERQAVAAIVDT VRSSGHFARGITVGFPVHTSVLESLRDELCEQLPDSEFMEAPVQFIGGTTGDVVAPGT TFGDYWYANLRHTVRFDRAVESAIRCGARAFIEISAHPALLFAIGQNCEGAANLPDGP AVLVGSARRGERFVDALSANIVSAAVADPGYPWGDLGGDPLDGDVDLSGFPNAPMRAV PMWAHPEPLPPVSGLTIAVERWERMVPSTPVAGRHRHLAVLDLGAHRALAQTLCAAID SHPDTELSAARDAELILVIAPDFEHTDAVRAAGALADLVGAGLLDYPMHIGARCQSVC LVTVGAEQVDAADAVPSAGQAALAAMHRSIGFEHPEQTFSHLDLPSWDLDPVLGVSVI TAVLRGFGETALRGSVNGYTLFERTLADAPAVPNWSLDSGVLDDVVVTGGAGAIGMHY ARYLAEHGARRIVLLSRRAADQATVAMLRKQHGTVIVSPPCDITDPTQLSAIAAEYGG VGASLIVHAAGSVISGTAPGVTSAAVVDNFAAKVLGLAQMIELWPLRPDVRTLLCSSV MGVWGGHGVVAYSAANRLLDVMAAQLRAQGRHCVAVKWGLWQAPKAGEPARGIADAVT IARVERSGLRQMAPQQAIEASLHEFTVDPLVFAADAARLQMLLDSRQFERYEGPTDPN LTIVDAVRTQLAAVLGIPQAGEVNLQESLFDLGVDSMLALDLRNRLKRSIGATVSLAT LMGDITGDGLVAKLEDADERSHTAQKVDISRD" CDS complement(2642751..2644085) /codon_start=1 /transl_table=11 /gene="mbtC" /locus_tag="BQ2027_MB2403C" /product="POLYKETIDE SYNTHETASE MBTC (POLYKETIDE SYNTHASE)" /note="Mb2403c, mbtC, len: 444 aa. Equivalent to Rv2382c, len: 444 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 444 aa overlap). mbtC, polyketide synthase (see citations below), similar in part to several synthases e.g. Q9F7T9 AVERMECTIN POLYKETIDE SYNTHASE (FRAGMENT) from Streptomyces avermitilis (3626 aa), FASTA scores: opt: 1458, E(): 7e-82, (50.65% identity in 446 aa overlap); AAG23264|SPNA POLYKETIDE SYNTHASE LOADING AND EXTENDER MODULE 1 from Saccharopolyspora spinosa (2595 aa) FASTA scores: opt: 1441, E(): 6e-81, (49.1% identity in 446 aa overlap); O33954|TYLG TYLACTONE SYNTHASE STARTER MODULE AND MODULES 1 & 2 from Streptomyces fradiae (4472 aa) FASTA scores: opt: 1439, E(): 1.2e-80, (51.0% identity in 447 aa overlap); O30764|NIDA1 POLYKETIDE SYNTHASE MODULES 1 AND 2 from Streptomyces caelestis (4340 aa) FASTA scores: opt: 1432, E(): 3.3e-80, (50.9% identity in 442 aa overlap); etc. Protein product from Mb2403c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y1U7" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020841" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1U7" /protein_id="SIU01015.1" /translation="MSDNDPVVIVGLAIEAPGGVETADDYWTLLSEQREGLGPFPTDR GWALRELFDGSRRNGFKPIHNLGGFLSSATTFDPEFFRISPREATAMDPQQRVGLRVA WRTLENSGINPDDLAGHDVGCYVGASALEYGPALTEFSHHSGHLITGTSLGVISGRIA YTLDLAGPALTVDTSCSSALAAFHTAVQAIRAGDCDLALAGGVCVMGTPGYFVEFSKQ HALSDDGHCRPYSAHASGTAWAEGAAMFLLQRRSRATADRRRVLAEVRASCLNSDGLS DGLTAPSGDAQTRLLRRAIAQAAVVPADVGMVEGHGTATRLGDRTELRSLAASYGTAP AGRGPLLGSVKSNIGHAQAAAGGLGLVKVILAAQHAAIPPTLHVDEPSREIDWEKQGL RLADKLTPWRAVDGWRTAAVSAFGMSGTNSHVIVSMPDTVSAPERGPECGEV" CDS complement(2644075..2648319) /codon_start=1 /transl_table=11 /gene="mbtB" /locus_tag="BQ2027_MB2404C" /product="PHENYLOXAZOLINE SYNTHASE MBTB (PHENYLOXAZOLINE SYNTHETASE)" /note="Mb2404c, mbtB, len: 1414 aa. Equivalent to Rv2383c, len: 1414 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1414 aa overlap). mbtB, phenyloxazoline synthase (see citations below), similar to the N-terminal region of several synthetases e.g. Q9EWP5|SC4C2.17 PUTATIVE NON-RIBOSOMAL PEPTIDE SYNTHASE from Streptomyces coelicolor (2229 aa), FASTA scores: opt: 2878, E(): 4.1e-156, (46.85% identity in 1138 aa overlap); Q9Z399|IRP2 YERSINIABACTIN BIOSYNTHETIC from Yersinia pestis (2041 aa), FASTA scores: opt: 2297, E(): 5.3e-123, (38.55% identity in 1069 aa overlap); P48633|HMP2_YEREN|IRP2 HIGH-MOLECULAR-WEIGHT PROTEIN 2 (MAY BE INVOLVED IN THE NONRIBOSOMAL SYNTHESIS OF SMALL PEPTIDES) from Yersinia enterocolitica (2035 aa), FASTA scores: opt: 2275, E(): 9.4e-122, (38.45% identity in 1069 aa overlap); O85739|PCHE|PA4226 DIHYDROAERUGINOIC ACID SYNTHETASE from Pseudomonas aeruginosa (1438 aa) FASTA scores: opt: 2236, E(): 1.2e-119, (38.2% identity in 1330 aa overlap); Q9RFM8|PCHE PYOCHELIN SYNTHETASE from Pseudomonas aeruginosa (1438 aa), FASTA scores: opt: 2229, E(): 3e-119, (38.0% identity in 1329 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature, and PS00012 Phosphopantetheine attachment site. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb2404c detected using SWATH mass spectrometry." /db_xref="GOA:Q7TYQ4" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR001031" /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR010071" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR023213" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q7TYQ4" /protein_id="SIU01016.1" /translation="MVHATACSEIIRAEVAELLGVRADALHPGANLVGQGLDSIRMMS LVGRWRRKGIAVDFATLAATPTIEAWSQLVSAGTGVAPTAVAAPGDAGLSQEGEPFPL APMQHAMWVGRHDHQQLGGVAGHLYVEFDGARVDPDRLRAAATRLALRHPMLRVQFLP DGTQRIPPAAGSRDFPISVADLRHVAPDVVDQRLAGIRDAKSHQQLDGAVFELALTLL PGERTRLHVDLDMQAADAMSYRILLADLAALYDGREPPALGYTYQEYRQAIEAEETLP QPVRDADRDWWAQRIPQLPDPPALPTRAGGERDRRRSTRRWHWLDPQTRDALFARARA RGITPAMTLAAAFANVLARWSASSRFLLNLPLFSRQALHPDVDLLVGDFTSSLLLDVD LTGARTAAARAQAVQEALRSAAGHSAYPGLSVLRDLSRHRGTQVLAPVVFTSALGLGD LFCPDVTEQFGTPGWIISQGPQVLLDAQVTEFDGGVLVNWDVREGVFAPGVIDAMFTH QVDELLRLAAGDDAWDAPSPSALPAAQRAVRAALNGRTAAPSTEALHDGFFRQAQQQP DAPAVFASSGDLSYAQLRDQASAVAAALRAAGLRVGDTVAVLGPKTGEQVAAVLGILA AGGVYLPIGVDQPRDRAERILATGSVNLALVCGPPCQVRVPVPTLLLADVLAAAPAEF VPGPSDPTALAYVLFTSGSTGEPKGVEVAHDAAMNTVETFIRHFELGAADRWLALATL ECDMSVLDIFAALRSGGAIVVVDEAQRRDPDAWARLIDTYEVTALNFMPGWLDMLLEV GGGRLSSLRAVAVGGDWVRPDLARRLQVQAPSARFAGLGGATETAVHATIFEVQDAAN LPPDWASVPYGVPFPNNACRVVADSGDDCPDWVAGELWVSGRGIARGYRGRPELTAER FVEHDGRTWYRTGDLARYWHDGTLEFVGRADHRVKISGYRVELGEIEAALQRLPGVHA AAATVLPGGSDVLAAAVCVDDAGVTAESIRQQLADLVPAHMIPRHVTLLDRIPFTDSG KIDRAEVGALLAAEVERSGDRSAPYAAPRTVLQRALRRIVADILGRANDAVGVHDDFF ALGGDSVLATQVVAGIRRWLDSPSLMVADMFAARTIAALAQLLTGREANADRLELVAE VYLEIANMTSADVMAALDPIEQPAQPAFKPWVKRFTGTDKPGAVLVFPHAGGAAAAYR WLAKSLVANDVDTFVVQYPQRADRRSHPAADSIEALALELFEAGDWHLTAPLTLFGHC MGAIVAFEFARLAERNGVPVRALWASSGQAPSTVAASGPLPTADRDVLADMVDLGGTD PVLLEDEEFVELLVPAVKADYRALSGYSCPPDVRIRANIHAVGGNRDHRISREMLTSW ETHTSGRFTLSHFDGGHFYLNDHLDAVARMVSADVR" CDS 2648418..2650115 /codon_start=1 /transl_table=11 /gene="mbtA" /locus_tag="BQ2027_MB2405" /product="BIFUNCTIONAL ENZYME MBTA: SALICYL-AMP LIGASE (SAL-AMP LIGASE) + SALICYL-S-ArCP SYNTHETASE" /note="Mb2405, mbtA, len: 565 aa. Equivalent to Rv2384, len: 565 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 565 aa overlap). mbtA, bifunctional enzyme, including salicyl-AMP ligase (Sal-AMP ligase) (EC 6.-.-.-) and salicyl-S-ArCP synthetase (see first and second citations below), highly similar to other ligases e.g. Q9F638|MXCE from Stigmatella aurantiaca 2,3-DHBA-AMP ligase (protein involved in the biosynthesis of 2,3-dihydroxybenzoic acid, contains the AMP binding signature) (543 aa), FASTA scores: opt: 1683, E(): 2.8e-90, (48.25% identity in 545 aa overlap) (see third citation below); P40871|DHBE_BACSU|ENTE 2,3-DIHYDROXYBENZOATE-AMP LIGASE (EC 6.3.2.-) from Bacillus subtilis (539 aa), FASTA scores: opt: 1569, E(): 1.2e-83, (44.9% identity in 532 aa overlap); O07899|VIBE_VIBCHVC0772 VIBRIOBACTIN-SPECIFIC 2,3-DIHYDROXYBENZOATE-AMP LIGASE from Vibrio cholerae (543 aa), FASTA scores: opt: 1457, E(): 3.7e-77, (44.6% identity in 545 aa overlap); etc. Also similar to P95819|SNBA PRISTINAMYCIN I SYNTHETASE I from Streptomyces pristinaespiralis (582 aa), FASTA scores: opt: 1532, E(): 1.7e-81, (46.35% identity in 548 aa overlap); and Q9RFM9|PCHD SALICYL-AMP LIGASE from Pseudomonas aeruginosa (547 aa), FASTA scores: opt: 1415, E(): 1e-74, (45.95% identity in 533 aa overlap). Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY." /db_xref="GOA:A0A1R3Y1F4" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1F4" /protein_id="SIU01017.1" /translation="MPPKAADGRRPSPDGGLGGFVPFPADRAASYRAAGYWSGRTLDT VLSDAARRWPDRLAVADAGDRPGHGGLSYAELDQRADRAAAALHGLGITPGDRVLLQL PNGCQFAVALFALLRAGAIPVMCLPGHRAAELGHFAAVSAATGLVVADVASGFDYRPM ARELVADHPTLRHVIVDGDPGPFVSWAQLCAQAGTGSPAPPADPGSPALLLVSGGTTG MPKLIPRTHDDYVFNATASAALCRLSADDVYLVVLAAGHNFPLACPGLLGAMTVGATA VFAPDPSPEAAFAAIERHGVTVTALVPALAKLWAQSCEWEPVTPKSLRLLQVGGSKLE PEDARRVRTALTPGLQQVFGMAEGLLNFTRIGDPPEVVEHTQGRPLCPADELRIVNAD GEPVGPGEEGELLVRGPYTLNGYFAAERDNERCFDPDGFYRSGDLVRRRDDGNLVVTG RVKDVICRAGETIAASDLEEQLLSHPAIFSAAAVGLPDQYLGEKICAAVVFAGAPITL AELNGYLDRRGVAAHTRPDQLVAMPALPTTPIGKIDKRAIVRQLGIATGPVTTQRCH" CDS 2650211..2651131 /codon_start=1 /transl_table=11 /gene="mbtJ" /locus_tag="BQ2027_MB2406" /product="PUTATIVE ACETYL HYDROLASE MBTJ" /note="Mb2406, mbtJ, len: 306 aa. Equivalent to Rv2385, len: 306 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 306 aa overlap). Putative mbtJ, acetyl hydrolase (EC 3.1.1.-) (see citations below), showing some similarity with various hydrolases including acetyl hydrolases e.g. Q9ZBM4|MLCB1450.08|ML0314 PUTATIVE HYDROLASE/ESTERASE from Mycobacterium leprae (335 aa), FASTA scores: opt: 449, E(): 6.7e-21, (33.85% identity in 313 aa overlap); AAK47950|MT3591 Esterase from M. tuberculosis strain CDC1551 (327 aa), FASTA scores: opt: 469, E(): 3.6e-22, (35% identity in 283 aa overlap); Q9X8J4|SCE9.22 PUTATIVE ESTERASE from Streptomyces coelicolor (266 aa), FASTA scores: opt: 430,E(): 8.5e-20, (38% identity in 245 aa overlap); Q01109|BAH_STRHY ACETYL-HYDROLASE (EC 3.1.1.-) from Streptomyces hygroscopicus (299 aa), FASTA scores: opt: 420, E(): 4e-19, (35.1% identity in 265 aa overlap). Equivalent to AAK46748 from Mycobacterium tuberculosis strain CDC1551 (327 aa) but shorter 21 aa. Note that previously known as lipK. Protein product from Mb2406 detected using SWATH mass spectrometry. Mb2406 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y107" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y107" /protein_id="SIU01018.1" /translation="MVLRPITGAIPPDGPWGIWASRRIIAGLMGTFGPSLAGTRVEQV NSVLPDGRRVVGEWVYGPHNNAINAGPGGGAIYYVHGSGYTMCSPRTHRRLTSWLSSL TGLPVFSVDYRLAPRYRFPTAATDVRAAWDWLAHVCGLAAEHMVIAADSAGGHLTVDM LLQPEVAARPPAAVVLFSPLIDLTFRLGASRELQRPDPVVRADRAARSVALYYTGVDP AHHRLALDVAGGPPLPPTLIQVGGAEILEADARQLDADIRAAGGICELQVWPDQMHVF QALPRMTPEAAKAMTYVAQFIRSTTARGDL" CDS complement(2651135..2652487) /codon_start=1 /transl_table=11 /gene="mbtI" /locus_tag="BQ2027_MB2407C" /product="isochorismate synthase mbti" /note="Mb2407c, mbtI, len: 450 aa. Equivalent to Rv2386c, len: 450 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 450 aa overlap). Putative mbtI, isochorismate synthase (see citations below), similar to Q9X9I8|IRP9 SALICYLATE SYNTHETASE from Yersinia enterocolitica (434 aa), FASTA scores: opt: 887, E(): 7.5e-48, (37.45% identity in 422 aa overlap); and similar in C-terminal region to many anthranilate synthases component I (EC 4.1.3.27) e.g. Q9Z4W7|TRPE_STRCO|SCE8.07c from Streptomyces coelicolor (511 aa), FASTA scores: opt: 509, E(): 3e-24, (40.4% identity in 255 aa overlap); P33975|TRPE_HALVO from Halobacterium volcanii (Haloferax volcanii) (523 aa) FASTA scores: opt: 488, E(): 6.2e-23, (34.2% identity in 298 aa overlap); and similar to Q08653|TRPE_THEMA|TM0142 ANTHRANILATE SYNTHASE COMPONENT I from Thermotoga maritima (461 aa), FASTA scores: opt: 478, E(): 2.3e-22, (28.4% identity in 440 aa overlap); etc. COULD BE BELONG TO THE ANTHRANILATE SYNTHASE COMPONENT I FAMILY. Note that previously known as trpE2, an anthranilate synthase component I (EC 4.1.3.27). Protein product from Mb2407c detected using SWATH mass spectrometry." /db_xref="GOA:Q7TYQ1" /db_xref="InterPro:IPR005801" /db_xref="InterPro:IPR015890" /db_xref="InterPro:IPR019996" /db_xref="InterPro:IPR019999" /db_xref="UniProtKB/Swiss-Prot:Q7TYQ1" /protein_id="SIU01019.1" /translation="MSELSVATGAVSTASSSIPMPAGVNPADLAAELAAVVTESVDED YLLYECDGQWVLAAGVQAMVELDSDELRVIRDGVTRRQQWSGRPGAALGEAVDRLLLE TDQAFGWVAFEFGVHRYGLQQRLAPHTPLARVFSPRTRIMVSEKEIRLFDAGIRHREA IDRLLATGVREVPQSRSVDVSDDPSGFRRRVAVAVDEIAAGRYHKVILSRCVEVPFAI DFPLTYRLGRRHNTPVRSFLLQLGGIRALGYSPELVTAVRADGVVITEPLAGTRALGR GPAIDRLARDDLESNSKEIVEHAISVRSSLEEITDIAEPGSAAVIDFMTVRERGSVQH LGSTIRARLDPSSDRMAALEALFPAVTASGIPKAAGVEAIFRLDECPRGLYSGAVVML SADGGLDAALTLRAAYQVGGRTWLRAGAGIIEESEPEREFEETCEKLSTLTPYLVARQ " CDS 2652940..2653149 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2407A" /note="unnamed protein product; Mb2407A, len: 69 aa. No equivalent in M. tuberculosis H37Rv. Identified by de novo proteomics of Mycobacterium bovis AF2122/97 under exponential conditions,Mb2407A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing" /db_xref="UniProtKB/TrEMBL:A0A1R3Y121" /protein_id="SIU01020.1" /translation="MFVIRLADGEEVHGECDELTINPATGVLTVCRVDGFEETTTHYS PSAWRSVTHRKRGVGVRPSLVSTAQ" CDS 2653247..2654500 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2408" /product="putative sodium-dependent bicarbonate transporter" /note="Mb2408, -, len: 417 aa. Equivalent to Rv2387, len: 417 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 417 aa overlap). Conserved hypothetical protein, showing some similarities with others e.g. Q9K663|BH3869 HYPOTHETICAL PROTEIN from Bacillus halodurans (337 aa), FASTA scores: opt: 343, E(): 4.8e-14, (29.0% identity in 400 aa overlap); AAK25471|CC3509 HYPOTHETICAL PROTEIN from Caulobacter crescentus (365 aa), FASTA scores: opt: 282, E(): 3.2e-10, (32.6% identity in 399 aa overlap); P73953|SLR1512 [D90911_21] CONSERVED HYPOTHETICAL PROTEIN from Synechocystis sp. strain PCC6803 (374 aa), FASTA scores: opt: 230, E(): 5.5e-07; (24.75% identity in 408 aa overlap); etc. Contains PS00213 Lipocalin signature. Mb2408 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y132" /db_xref="InterPro:IPR010293" /db_xref="UniProtKB/TrEMBL:A0A1R3Y132" /protein_id="SIU01021.1" /translation="MLHEFWVNFTHNLFKPLLLFFYFGFLIPIFKVRFEFPYVLYQGL TLYLLLAIGWHGGEELAKIKPSNVGAIVGFMVVGFALNFVIGTLAYFLLSKLTAMRRV DRATVAGYYGSDSAGTFATCVAVLTSVGMAFDAYMPVMLAVMEIPGCLVALYLVARLR HRGMNEAGYMADEPGYTTAAMIGAGPGTPARPAHSDSLTAQAERGIEEELELSLEKRE HPNWDEDGVKDSGTNASIFSRELLQEVFLNPGLVLLFGGIVIGLISGLQGQKVLHDDD NFFVAAFQGVLCLFLLEMGMTASRKLKDLASAGSGFVFFGLLAPNLFATLGIIVAHGY AYVTNNDFAPGTYVLFAVLCGAASYIAVPAVQRLAIPEASPTLPLAASLGLTFSYNVT IGIPLYIEIARIVGQWFPATGASIG" CDS complement(2654497..2655624) /codon_start=1 /transl_table=11 /gene="hemN" /locus_tag="BQ2027_MB2409C" /product="PROBABLE OXYGEN-INDEPENDENT COPROPORPHYRINOGEN III OXIDASE HEMN (COPROPORPHYRINOGENASE) (COPROGEN OXIDASE)" /note="Mb2409c, hemN, len: 375 aa. Equivalent to Rv2388c, len: 375 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 375 aa overlap). Probable hemN, oxygen-independent coproporphyrinogen III oxidases (EC 1.3.3.-), highly similar to many PUTATIVE OXYGEN-INDEPENDENT COPROPORPHYRINOGEN III OXIDASES e.g. Q9RDD2|SCC77.26 from Streptomyces coelicolor (435 aa), FASTA scores: opt: 1358, E(): 1.5e-76, (56.55% identity in 382 aa overlap); BAB51237|MLR4627 from Rhizobium loti (Mesorhizobium loti) (392 aa), FASTA scores: opt: 696, E(): 1.1e-35, (36.8% identity in 383 aa overlap); Q9KUR0|VC0455 from Vibrio cholerae (391 aa), FASTA scores: opt: 691, 2.2e-35, (32.65% identity in 386 aa overlap); P54304|HEMN_BACSU from Bacillus subtilis (366 aa), FASTA scores: opt: 668, E(): 5.6e-34; (34.9% identity in 327 aa overlap); etc. Equivalent to AAK46752 from Mycobacterium tuberculosis strain CDC1551 (390 aa) but shorter 375 aa. BELONGS TO THE ANAEROBIC COPROPORPHYRINOGEN III OXIDASE FAMILY. Protein product from Mb2409c detected using SWATH mass spectrometry. Mb2409c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y134" /db_xref="InterPro:IPR004559" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR023404" /db_xref="InterPro:IPR034505" /db_xref="UniProtKB/TrEMBL:A0A1R3Y134" /protein_id="SIU01022.1" /translation="MPGQPFGVYLHVPFCLTRCGYCDFNTYTPAQLGGVSPDRWLLAL RAELELAAAKLDAPTVHTVYVGGGTPSLLGGERLATLLDMVRDHFVLAPDAEVSTEAN PESTWPEFFATIRAAGYTRVSLGMQSVAPRVLATLDRVHSPGRAAAAATEAIAEGFTH VNLDLIYGTPGESDDDLVRSVDATVQAGVDHVSAYALVVEHGTALARRVRRGELAAPD DDVLAHRYELVDARLSAAGFAWYEVSNWCRPGGECRHNLGYWDGGQWWGAGPGAHGYI GVTRWWNVKHPNTYAEILAGATLPVAGFEQLGADALHTEDVLLKVRLRQGLPLARLGA AERERAEAVLADGLLDYHGDRLVLTGRGRLLADAVVRTLLG" CDS complement(2655730..2656194) /codon_start=1 /transl_table=11 /gene="rpfD" /locus_tag="BQ2027_MB2410C" /product="PROBABLE RESUSCITATION-PROMOTING FACTOR RPFD" /note="Mb2410c, rpfD, len: 154 aa. Equivalent to Rv2389c, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 154 aa overlap). Probable rpfD, resuscitation-promoting factor. Possible autocrine and/or paracrine bacterial growth factor or cytokine (see citation below). Similar to others from Mycobacterium tuberculosis e.g. O07747|Rv1884c|MTCY180.34|RPFC PROBABLE RESUSCITATION-PROMOTING FACTOR from Mycobacterium tuberculosis (176 aa), FASTA scores: opt: 382, E(): 2.3e-17, (55.45% identity in 101 aa overlap); etc. Also similarity with Q9CBF8|ML2030 HYPOTHETICAL PROTEIN from Mycobacterium leprae (157 aa), FASTA scores: opt: 397, E(): 2.4e-18, (47.95% identity in 121 aa overlap); Q9F2Q2|SCE41.06c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (244 aa), FASTA scores: opt: 341, E(): 1.1e-14, (40.45% identity in 131 aa overlap); and O86308|Z96935|MLRPF_1 RPF PROTEIN PRECURSOR from Micrococcus luteus (220 aa), FASTA scores: opt: 301, E(): 3.6e-12, (39.4% identity in 132 aa overlap). Contains a secretory signal sequence in N-terminus. Supposed acts at very low concentration. Mb2410c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y0Z9" /db_xref="InterPro:IPR010618" /db_xref="InterPro:IPR023346" /db_xref="UniProtKB/TrEMBL:A0A1R3Y0Z9" /protein_id="SIU01023.1" /translation="MTPGLLTTAGAGRPRDRCARIVCTVFIETAVVATMFVALLGLST ISSKADDIDWDAIAQCESGGNWAANTGNGLYGGLQISQATWDSNGGVGSPAAASPQQQ IEVADNIMKTQGPGAWPKCSSCSQGDAPLGSLTHILTFLAAETGGCSGSRDD" CDS complement(2656191..2656748) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2411C" /product="FIG00821219: MCE associated membrane protein" /note="Mb2411c, -, len: 185 aa. Equivalent to Rv2390c, len: 185 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 185 aa overlap). Conserved hypothetical protein, similar to other Mycobacterium tuberculosis proteins Q11032|YD62_MYCTU|MTCY02B10.26c|Rv1362c hypothetical 23.5 kd protein (220 aa), FASTA scores: opt: 223, E(): 2.1e-07, (27.4% identity in 190 aa overlap); and Q11033|YD63_MYCTU|MTCY02B10.27c|Rv1363c hypothetical 28.3 kd protein (261 aa), FASTA scores: opt: 238, E(): 2.7e-08, (27.6% identity in 163 aa overlap)." /db_xref="GOA:A0A1R3Y390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y390" /protein_id="SIU01024.1" /translation="MAIFGRGHGASEPGGTGEPAETPGRGRLTRSVTGWVGAVAVVVS LAGSGWCGWVLFEKHQTDVAAGQALQAARSYVVKLATMDCERIDHNMRDILEGSTGEF KDKYGKSSAHLRQLLADNRVATHGTVVAASVKSATTNKVVVLMFIDQSVSNRNSPTPQ IDRSRIKVIMDKVNGRWLASKVELL" CDS 2657161..2658852 /codon_start=1 /transl_table=11 /gene="sira" /locus_tag="BQ2027_MB2412" /product="ferredoxin-dependent sulfite reductase sira" /note="Mb2412, nirA, len: 563 aa. Equivalent to Rv2391, len: 563 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 563 aa overlap). Probable nirA, ferredoxin-dependant nitrite reductase (EC 1.7.7.1), similar to many nitrate reductases e.g. CAC33947|SCBAC1A6.26c Putative nitrite/sulphite reductase from Streptomyces coelicolor (565 aa), FASTA scores: opt: 2335, E(): 1.2e-137, (60.1% identity in 567 aa overlap); Q9RZD6|DRA0013 FERREDOXIN-NITRITE REDUCTASE from Deinococcus radiodurans (563 aa), FASTA scores: opt: 1141, E(): 2.2e-63, (39.6% identity in 533 aa overlap); Q59656|NIRA (D31732|PEENIRNRT_1) ferredoxin-dependant* NITRITE REDUCTASE (*: see citation below) from Plectonema boryanum (654 aa), FASTA scores: opt: 805, E(): 1.9e-42, (31.7% identity in 517 aa overlap); Q55366|NIRA|SLR0898 FERREDOXIN-NITRITE REDUCTASE from Synechocystis sp. strain PCC 6803 (502 aa), FASTA scores: opt: 799, E(): 3.7e-42, (32.3% identity in 517 aa overlap). Highly similar (only in N-terminal part because shortened protein (fragment) owing to an IS900 insertion) to Q9K541|NIRA NITRATE REDUCTASE (FRAGMENT) from Mycobacterium paratuberculosis (198 aa), FASTA scores: opt: 798, E(): 2.1e-42, (65.4% identity in 182 aa overlap). Protein product from Mb2412 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2412 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYP6" /db_xref="InterPro:IPR005117" /db_xref="InterPro:IPR006066" /db_xref="InterPro:IPR006067" /db_xref="InterPro:IPR036136" /db_xref="UniProtKB/Swiss-Prot:Q7TYP6" /protein_id="SIU01025.1" /translation="MSAKENPQMTTARPAKARNEGQWALGHREPLNANEELKKAGNPL DVRERIENIYAKQGFDSIDKTDLRGRFRWWGLYTQREQGYDGTWTGDDNIDKLEAKYF MMRVRCDGGALSAAALRTLGQISTEFARDTADISDRQNVQYHWIEVENVPEIWRRLDD VGLQTTEACGDCPRVVLGSPLAGESLDEVLDPTWAIEEIVRRYIGKPDFADLPRKYKT AISGLQDVAHEINDVAFIGVNHPEHGPGLDLWVGGGLSTNPMLAQRVGAWVPLGEVPE VWAAVTSVFRDYGYRRLRAKARLKFLIKDWGIAKFREVLETEYLKRPLIDGPAPEPVK HPIDHVGVQRLKNGLNAVGVAPIAGRVSGTILTAVADLMARAGSDRIRFTPYQKLVIL DIPDALLDDLIAGLDALGLQSRPSHWRRNLMACSGIEFCKLSFAETRVRAQHLVPELE RRLEDINSQLDVPITVNINGCPNSCARIQIADIGFKGQMIDDGHGGSVEGFQVHLGGH LGLDAGFGRKLRQHKVTSDELGDYIDRVVRNFVKHRSEGERFAQWVIRAEEDDLR" CDS 2658849..2659613 /codon_start=1 /transl_table=11 /gene="cysH" /locus_tag="BQ2027_MB2413" /product="probable 3'-phosphoadenosine 5'-phosphosulfate reductase cysh (paps reductase, thioredoxin dep.) (padops reductase) (3'-phosphoadenylylsulfate reductase) (paps sulfotransferase)" /note="Mb2413, cysH, len: 254 aa. Equivalent to Rv2392, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 254 aa overlap). Probable cysH, 3'-phosphoadenosine 5'-phosphosulfate reductase (EC 1.8.4.8), similar to many e.g. P94498|O34620|CYH1_BACSU|CYSH from Bacillus subtilis (233 aa), FASTA scores: opt: 618, E(): 8.1e-32, (46.5% identity in 202 aa overlap); Q9KCT3|CYSH|BH1486 from Bacillus halodurans (231 aa), FASTA scores: opt: 560, E(): 3.6e-28, (41.3% identity in 230 aa overlap); P56860|CYSH_DEIRA from Deinococcus radiodurans (255 aa), FASTA scores: opt: 489, E(): 1.1e-23, (44.7% identity in 190 aa overlap); etc. BELONGS TO THE PAPS REDUCTASE FAMILY and CYSH SUBFAMILY. Note that operon cysA-cysW-cysT-subI, probably involved in sulfate transport, is near this putative ORF. Protein product from Mb2413 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2413 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65669" /db_xref="InterPro:IPR002500" /db_xref="InterPro:IPR004511" /db_xref="InterPro:IPR011798" /db_xref="InterPro:IPR014729" /db_xref="UniProtKB/Swiss-Prot:P65669" /protein_id="SIU01026.1" /translation="MSGETTRLTEPQLRELAARGAAELDGATATDMLRWTDETFGDIG GAGGGVSGHRGWTTCNYVVASNMADAVLVDLAAKVRPGVPVIFLDTGYHFVETIGTRD AIESVYDVRVLNVTPEHTVAEQDELLGKDLFARNPHECCRLRKVVPLGKTLRGYSAWV TGLRRVDAPTRANAPLVSFDETFKLVKVNPLAAWTDQDVQEYIADNDVLVNPLVREGY PSIGCAPCTAKPAEGADPRSGRWQGLAKTECGLHAS" CDS 2659610..2660455 /codon_start=1 /transl_table=11 /gene="che1" /locus_tag="BQ2027_MB2414" /product="ferrochelatase che1" /note="Mb2414, -, len: 281 aa. Equivalent to Rv2393, len: 281 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 281 aa overlap). Conserved hypothetical protein, with some similarity to Q9L2E8|SC7A8.10c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (274 aa), FASTA scores: opt: 407, E(): 2.8e-18, (37% identity in 246 aa overlap); CAC38793|SCI39.05 Conserved hypothetical protein from Streptomyces coelicolor (305 aa), FASTA scores: opt: 394, E(): 2e-17, (35.0% identity in 251 aa overlap); AAK44492|MT0272 Chalcone/stilbene synthase family protein from Mycobacterium tuberculosis (247 aa), FASTA scores: opt: 350, E(): 9.2e-15, (34.0% identity in 235 aa overlap); P95216|Rv0259c|MTCY06A4.03c|Z86089 hypothetical protein from Mycobacterium tuberculosis (247 aa), FASTA scores: opt: 345, E(): 1.9e-14,(33.6% identity in 235 aa overlap). Mb2414 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1G4" /db_xref="InterPro:IPR002762" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1G4" /protein_id="SIU01027.1" /translation="MTAPATMQSAAMLRSGAIEAPPATMQSAAMRWGHLPLAEESGTI APQLVLTAHGSKDPRSAANARAIAGRLARMRPGLDVRVAFCELNSPNLVDVLNRCRGA AVVTPLLLADAYHARVDIPAQIASCRVGHRVRQASVLGEDIRLVSALHERLTELGVSP FDHTLGVVVLAIGSSHPAANARTSTVASRLAEGTQWAAVTTAFITRPEASLADATDRL RRHGARRMVIAPWLLAPGILSDRVRGYAREAGIAMAQPLGAHPMVAATMWDRYRQAVA GRIAA" CDS 2660492..2662423 /codon_start=1 /transl_table=11 /gene="ggtB" /locus_tag="BQ2027_MB2415" /product="PROBABLE GAMMA-GLUTAMYLTRANSPEPTIDASE PRECURSOR GGTB (GAMMA-GLUTAMYLTRANSFERASE) (GLUTAMYL TRANSPEPTIDASE)" /note="Mb2415, ggtB, len: 643 aa. Equivalent to Rv2394, len: 643 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 643 aa overlap). Probable ggtB, gamma-glutamyltranspeptidase precursor (EC 2.3.2.2), similar to many e.g. Q9KVF2|VC0194 from Vibrio cholerae (588 aa), FASTA scores: opt: 943, E(): 7.5e-47, (40.0% identity in 597 aa overlap); O69935|SC3C8.26 from Streptomyces coelicolor (603 aa), FASTA scores: opt: 822, E(): 7.2e-40, (33.6% identity in 622 aa overlap); P54422|GGT_BACSU from Bacillus subtilis (587 aa) FASTA scores: opt: 491, E(): 8.2e-21, (33.4% identity in 574 aa overlap); etc. Has potential signal peptide and appropriately positioned prokaryotic lipoprotein attachment site (PS00013). Protein product from Mb2415 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2415 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y119" /db_xref="InterPro:IPR029055" /db_xref="UniProtKB/TrEMBL:A0A1R3Y119" /protein_id="SIU01028.1" /translation="MSVWLRAGALVAAVMLSLSGCGGFHAGAPSTAGPCEIVPNGTPA PKTPPATVPSSRNLATNPEIATGYRRDMTVVRTAHYAAATANPLATQVACRVLRDGGT AADAVVAAQAVLGLVEPQSSGIGGGGYLVYFDARTGSVQAYDGREVAPAAATENYLRW VSDVDRSAPRPNARASGRSIGVPGILRMLEMVHNEHGRTPWRDLFGPAVTLADGGFDI SARMGAAISDAAPQLRDDPEARKYFLNPDGSPKPAGTRLTNPAYSKTLSAIASAGANA FYSGDIAHDIVAAASDTSNGRTPGLLTIEDLAGYLAKRRQPLCTTYRGREICGMPSSG GVAVAATLGILEHFPMSDYAPSKVDLNGGRPTVMGVHLIAEAERLAYADRDQYIADVD FVQLPGGSLTTLVDPGYLAARAALISPQHSMGSARPGDFGAPTAVAPPVPEHGTSHLS VVDSYGNAATLTTTVESSFGSYHLVDGFILNNQLSDFSAEPHATDGSPVANRVEPGKR PRSSMAPTLVFDHSSAGRGALYAVLGSPGGSMIIQFVVKTLVAMLDWGLNPQQAVSLV DFGAANSPHTNLGGENPEINTSDDGDHDPLVQGLRALGHRVNLAEQSSGLSAITRSEA GWAGGADPRREGAVMGDDA" CDS 2662554..2662994 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2416" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN [FIRST PART]" /note="Mb2416, -, len: 146 aa. Equivalent to 5' end of Rv2395, len: 667 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 44 aa overlap). Probable conserved integral membrane protein, similar to AAK24613|CC2646 OLIGOPEPTIDE TRANSPORTER/OPT FAMILY PROTEIN from Caulobacter crescentus (666 aa), FASTA scores: opt: 1638, E(): 4.8e-86, (51.0% identity in 658 aa overlap); Q9PIS5|CJ0204 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Campylobacter jejuni (665 aa), FASTA scores: opt: 1484, E(): 2.9e-77, (40.6% identity in 658 aa overlap); and P44016|Y561_HAEIN hypothetical integral membrane protein from Haemophilus influenzae (635 aa), FASTA scores: opt: 1449, E(): 2.8e-75, (42.15% identity in 624 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2395 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits Rv2395 into 2 parts, Mb2416 and Mb2417. Mb2416 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y128" /db_xref="InterPro:IPR004813" /db_xref="UniProtKB/TrEMBL:A0A1R3Y128" /protein_id="SIU01029.1" /translation="MSGATVGAREITIRGVVLGALITLVFTAANVYLGLRVGLTFATS HTGRGDLDGRAAVVRQPLSGGEQYCSDDRVGGRHAVVDHLRVTGTAHDRLVERVSVLD NGGGVCTGRDPWRHVLNSVAPRTRHRIRPAVPRRRCRSRGSQDR" CDS 2663008..2664558 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2417" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN [SECOND PART]" /note="Mb2417, -, len: 516 aa. Equivalent to 3' end of Rv2395, len: 667 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 516 aa overlap). Probable conserved integral membrane protein, similar to AAK24613|CC2646 OLIGOPEPTIDE TRANSPORTER/OPT FAMILY PROTEIN from Caulobacter crescentus (666 aa), FASTA scores: opt: 1638, E(): 4.8e-86, (51.0% identity in 658 aa overlap); Q9PIS5|CJ0204 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Campylobacter jejuni (665 aa), FASTA scores: opt: 1484, E(): 2.9e-77, (40.6% identity in 658 aa overlap); and P44016|Y561_HAEIN hypothetical integral membrane protein from Haemophilus influenzae (635 aa), FASTA scores: opt: 1449, E(): 2.8e-75, (42.15% identity in 624 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2395 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-t) splits Rv2395 into 2 parts, Mb2416 and Mb2417. Mb2417 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y129" /db_xref="InterPro:IPR004813" /db_xref="InterPro:IPR004814" /db_xref="UniProtKB/TrEMBL:A0A1R3Y129" /protein_id="SIU01030.1" /translation="MEHNRRGIGVIALGAAAAAGYALLASLRVINNSLSATFRVGSGA TMIGASLSLALIGVGHLVGVTVGVAMIVGLAIAFGVMLPIRTAGQLPPDGDYAVAVAR IFSTDVRFIGAGAIAVAAAWTFLKILGPILRGIADAAVSARTRRRGQAVGQTERDIPI HIVAMVVLLSLIPIGWLLADFTDGTPLDDRRPGAIAAGVLLVLVIGLMVAAVCGYMAG LIGSSNSPISGVGILVVVLAGLLIKTAYGPATGSQIPALVAYTVFTAALVFGVATISN DNLQDLKTGQLVGATPWKQQVALIIGVLVGSVVMAPILQLMQAGFGFQGAPGATANAL AAPQAALMSALAKGVFGGSLNWSLVGVGALTGVIAVALDETLAKTTTNLRLPPLAVGM GMYLPAALTLMIPIGAFLGRIYDSWARWSGDDDERKKRLGVMLATGLIVGESLYGVLF AVIVATTGKEEPLAMVGDGFRFASQPLGAIVFAGLLAWLYQRTRVTASYRLAAPAGSS KPLPDLPG" CDS 2664707..2664922 /codon_start=1 /transl_table=11 /gene="aprA" /locus_tag="BQ2027_MB2417A" /product="Acid and phagosome regulated protein A AprA" /note="Mb2417A, len: 71 aa. Equivalent to Rv2395A len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). AprA, acid and phagosome regulated protein A, restricted to M. tuberculosis complex. Note completely overlapped by sRNA mcr7. Mb2417A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y138" /protein_id="SIU01031.1" /translation="MTMTASVAKVTAARPEPSAAWAEARRRVRQRREDMLRHPAFLSK QLPAEPADDDGVAAVYDIAIARRRRPA" CDS 2665034..2665198 /codon_start=1 /transl_table=11 /gene="aprB" /locus_tag="BQ2027_MB2417B" /product="Acid and phagosome regulated protein B AprB" /note="Mb2417B, len: 54 aa. Equivalent to Rv2395B len: 54 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 54 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). AprB, acid and phagosome regulated protein B, restricted to M. tuberculosis complex. Mb2417B found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y143" /protein_id="SIU01032.1" /translation="MPGLVPAMPLDALRPARQPTSGLGECATMRRPEAGNEKVAVIWE SLDVVPPESL" CDS 2665282..2666364 /codon_start=1 /transl_table=11 /gene="PE_PGRS41" /locus_tag="BQ2027_MB2418" /product="pe-pgrs family protein pe_pgrs41" /note="Mb2418, PE_PGRS41, len: 360 aa. Equivalent to Rv2396, len: 361 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 361 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to many e.g. AAK47132|MT2812 PE_PGRS family protein from Mycobacterium tuberculosis strain CDC1551 (454 aa), FASTA scores: opt: 1256, E(): 2.4e-44, (56.0% identity in 377 aa overlap); AAK46139|MT1866 PE_PGRS FAMILY PROTEIN from M. tuberculosis strain CDC1551 (491 aa), FASTA scores: opt: 1250, E(): 4.4e-44, (57.8% identity in 372 aa overlap); Y278_MYCTU|Rv0278C|MTV035.06c HYPOTHETICAL PE-PGRS FAMILY PROTEIN (957 aa), FASTA scores: opt: 1253, E(): 5.2e-44, (55.5% identity in 400 aa overlap); P71664|Rv1396c|MTCY21B4.13c HYPOTHETICAL GLYCINE-RICH 47.9 KDA PROTEIN (576 aa), FASTA scores: opt: 1236, E(): 1.8e-43, (55.55% identity in 402 aa overlap); etc. Contains PS00583 pfkB family of carbohydrate kinases signature 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 3 bp deletion (gcc-*) leads to a slightly shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (360 aa versus 361 aa). Mb2418 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y109" /protein_id="SIU01033.1" /translation="MSFLIASPEALAATATYLTGIGSAINAANAVAAAPTTEILAAGT DEVSTAISALFGAHAQAYQALSAHVAAFHDQFVHTLTAGAGSYMAAEAAASPLQALQL ELLNAINAPTLALLGRPLIGDGTDAAPGSGGAGGAGGILIGNGGTGGASDLAGTGRGG VGGAGGAGGLFGIGGAGGGCGSAVAIGGDGGAGGAGGVFSGGGAGGAGDAIGGSGGAG GTGGLLGGGGGAGGAGGAGGNGGGASNSASIGGDGGSGGAGGMLYGAGGVGGNGGAAV AIGGDGGAGGRAGAIGNGGDGGNGGTSNTPGGSGGDGGNGGNAGLIGSGGNGGNAEIV ISGGSVAGTGGNGGLLLGFNGTNGLP" CDS complement(2666389..2667444) /codon_start=1 /transl_table=11 /gene="cysA1" /locus_tag="BQ2027_MB2419C" /standard_name="cysA" /product="sulfate-transport atp-binding protein abc transporter cysa1" /note="Mb2419c, cysA1, len: 351 aa. Equivalent to Rv2397c, len: 351 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 351 aa overlap). Probable cysA1, sulfate-transport ATP-binding protein ABC transporter (see citations below), similar to OTHER SULFATE ABC TRANSPORTER ATP-BINDING PROTEINS e.g. P14788|CYSA_SYNP7 from Synechococcus sp. (344 aa), FASTA scores: opt: 1112, E(): 2.6e-56, (54.6% identity in 328 aa overlap); P74548|CYSA_SYNY3 from Synechocystis sp. (355 aa), FASTA scores: opt: 1063, E(): 1.7e-53, (51.9% identity in 343 aa overlap); Q9I6L0|CYSA|PA0280 from Pseudomonas aeruginosa (329 aa), FASTA scores: opt: 987, E(): 3.3e-49, (49.2% identity in 339 aa overlap); etc. Also similar to many ATP-binding proteins from Mycobacterium tuberculosis e.g. Rv2038c, Rv1238, Rv2832c, etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Note that previously known as cysA. Protein product from Mb2419c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2419c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4W3" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR005666" /db_xref="InterPro:IPR008995" /db_xref="InterPro:IPR014769" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR024765" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P0A4W3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01034.1" /translation="MTYAIVVADATKRYGDFVALDHVDFVVPTGSLTALLGPSGSGKS TLLRTIAGLDQPDTGTITINGRDVTRVPPQRRGIGFVFQHYAAFKHLTVRDNVAFGLK IRKRPKAEIKAKVDNLLQVVGLSGFQSRYPNQLSGGQRQRMALARALAVDPEVLLLDE PFGALDAKVREELRAWLRRLHDEVHVTTVLVTHDQAEALDVADRIAVLHKGRIEQVGS PTDVYDAPANAFVMSFLGAVSTLNGSLVRPHDIRVGRTPNMAVAAADGTAGSTGVLRA VVDRVVVLGFEVRVELTSAATGGAFTAQITRGDAEALALREGDTVYVRATRVPPIAGG VSGVDDAGVERVKVTST" CDS complement(2667461..2668279) /codon_start=1 /transl_table=11 /gene="cysW" /locus_tag="BQ2027_MB2420C" /product="PROBABLE SULFATE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER CYSW" /note="Mb2420c, cysW, len: 272 aa. Equivalent to Rv2398c, len: 272 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 272 aa overlap). Probable cysW, sulfate-transport integral membrane protein ABC transporter (see citations below), similar to others e.g. Q9K877|CYSW|BH3129 SULFATE ABC TRANSPORTER (PERMEASE) from Bacillus halodurans (287 aa), FASTA scores: opt: 765, E(): 4.1e-40, (43.8% identity in 249 aa overlap); P27370|CYSW_SYNP7 sulfate transport system (permease) protein from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (286 aa), FASTA scores: opt: 757, E(): 1.3e-39, (44.3% identity in 264 aa overlap); Q9I6K9|CYSW|PA0281 SULFATE TRANSPORT PROTEIN from Pseudomonas aeruginosa (289 aa), FASTA scores: opt: 753, E(): 2.3e-39, (44.4% identity in 250 aa overlap); P16702|P76534|CYSW_ECOLI SULFATE TRANSPORT SYSTEM PERMEASE from Escherichia coli (291 aa), FASTA scores: opt: 633, E(): 5.7e-32, (38.2% identity in 267 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane component signature. SIMILARITY WITH INTEGRAL MEMBRANE COMPONENTS OF OTHER BINDING-PROTEIN-DEPENDENT TRANSPORT SYSTEMS and BELONGS TO THE CYSTW SUBFAMILY. Mb2420c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1W5" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR005667" /db_xref="InterPro:IPR011866" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1W5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01035.1" /translation="MTSLPAARYLVRSVALGYVFVLLIVPVALILWRTFEPGFGQFYA WISTPAAISALNLSLLVVAIVVPLNVIFGVTTALVLARNRFRGKGVLQAIIDLPFAVS PVIVGVSLILLWGSAGALGFVEQDLGFKIIFGLPGIVLASMFVTCPFVVREVEPVLHE LGTDQEQAAATLGSGWWQTFWRITLPSIRWGLTYGIVLTVARTLGEYGAVIIVSSNLP GTSQTLTLLVSDRYHRGAEYGAYALSTLLMAVSVVVLIVQMVLDAHRARAVSEG" CDS complement(2668276..2669127) /codon_start=1 /transl_table=11 /gene="cysT" /locus_tag="BQ2027_MB2421C" /product="PROBABLE SULFATE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER CYST" /note="Mb2421c, cysT, len: 283 aa. Equivalent to Rv2399c, len: 283 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 283 aa overlap). Probable cysT, sulfate-transport integral membrane protein ABC transporter (see citations below), similar to others e.g. BAB48989|MLR1667 PERMEASE PROTEIN OF SULFATE ABC TRANSPORTER from Rhizobium loti (283 aa), FASTA scores: opt: 756, E(): 7.9e-40, (40.95% identity in 271 aa overlap); Q9K878|CYST|BH3128 SULFATE ABC TRANSPORTER (PERMEASE) from Bacillus halodurans (279 aa), FASTA scores: opt: 750, E(): 1.8e-39, (44.55% identity in 258 aa overlap); P16701|CYST_ECOLI|CYSU|CYST|B2424 from Escherichia coli (277 aa), FASTA scores: opt: 669, E(): 1.9e-34, (40.0% identity in 260 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane component signature, and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE CYSTW SUBFAMILY. Protein product from Mb2421c detected using shotgun mass spectrometry. Mb2421c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1B0" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR005667" /db_xref="InterPro:IPR011865" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1B0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01036.1" /translation="MTESLVGERRAPQFRARLSGPAGPPSVRVGMAVVWLSVIVLLPL AAIVWQAAGGGWRAFWLAVSSHAAMESFRVTLTISTAVTVINLVFGLLIAWVLVRDDF AGKRIVDAIIDLPFALPTIVASLVMLALYGNNSPVGLHFQHTATGVGVALAFVTLPFV VRAVQPVLLEIDRETEEAAASLGANGAKIFTSVVLPSLTPALLSGAGLAFSRAIGEFG SVVLIGGAVPGKTEVSSQWIRTLIENDDRTGAAAISVVLLSISFIVLLILRVVGARAA KREEMAA" CDS complement(2669124..2670194) /codon_start=1 /transl_table=11 /gene="subI" /locus_tag="BQ2027_MB2422C" /product="PROBABLE SULFATE-BINDING LIPOPROTEIN SUBI" /note="Mb2422c, subI, len: 356 aa. Equivalent to Rv2400c, len: 356 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 356 aa overlap). Probable subI, sulfate-binding lipoprotein component of sulfate transport system (see citations below), equivalent to Q9CCN3|SUBI|ML0615 (alias Q49748|B1937_F1_11, 358 aa) PUTATIVE SULPHATE-BINDING PROTEIN from Mycobacterium leprae (348 aa), FASTA scores: opt: 1775, E(): 2.3e-102, (76.45% identity in 340 aa overlap). Also similar to others and other substrate-binding proteins e.g. P27366|SUBI_SYNP7|SBPA SULFATE-BINDING PROTEIN PRECURSOR from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (350 aa), FASTA scores: opt: 703, E(): 4.6e-36, (35.6% identity in 351 aa overlap); Q9I6K7|SBP|PA0283 SULFATE-BINDING PROTEIN PRECURSOR from Pseudomonas aeruginosa (332 aa), FASTA scores: opt: 591, E(): 3.7e-29, (36.9% identity in 317 aa overlap); CAC49112|SMB21133 PUTATIVE SULFATE UPTAKE ABC TRANSPORTER PERIPLASMIC SOLUTE-BINDING PROTEIN PRECURSOR from Rhizobium meliloti (Sinorhizobium meliloti) (341 aa), FASTA scores: opt: 569, E(): 8.8e-28, (36.15% identity in 321 aa overlap); etc. BELONGS TO THE PROKARYOTIC SULFATE BINDING PROTEIN FAMILY. Protein product from Mb2422c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2422c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1G6" /db_xref="InterPro:IPR005669" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1G6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01037.1" /translation="MLSLTLSEASCIASASRWRHIIPAGVVCALIAGIGVGCHGGPSD VVGRAGPDRAHTSITLVAYAVPEPGWSAVIPAFNASEQGRGVQVITSYGASADQSRGV ADGKPADLVNFSVEPDIARLVKAGKVDKDWDADATKGIPFGSVVTFVVRAGNPKNIRD WDDLLRPGIEVITPSPLSSGSAKWNLLAPYAAKSDGGRNNQAGIDFVNTLVNEHVKLR PGSGREATDVFVQGSGDVLISYENEAIATERAGKPVQHVTPPQTFKIENPLAVVATST HLGAATAFRNFQYTVQAQKLWAQAGFRPVDPAVAADFADLFPVPAKLWTIADLGGWGS VDPQLFDKATGSITKIYLRATG" CDS 2670208..2670537 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2423" /product="HYPOTHETICAL PROTEIN" /note="Mb2423, -, len: 109 aa. Equivalent to Rv2401, len: 109 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 109 aa overlap). Hypothetical unknown protein. Equivalent to AAK46768 from Mycobacterium tuberculosis strain CDC1551 (134 aa) but shorter 25 aa. N-terminus extended since first submission (previously 72 aa)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y130" /protein_id="SIU01038.1" /translation="MRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSA ANERADIAPRKTRCCVHVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVRPR HPGYLGA" CDS complement(2670522..2670725) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2424C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb2424c, -, len: 67 aa. Equivalent to Rv2401A, len: 67 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 67 aa overlap). Possible conserved membrane protein, highly similar, but with 29 aa shorter, to ML0614|AL583919_34|Q49760 from Mycobacterium leprae (95 aa), FASTA scores: opt: 297, E(): 3.6e-15, (67.7% identity in 65 aa overlap). Has hydrophobic stretch. Mb2424c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y141" /db_xref="UniProtKB/TrEMBL:A0A1R3Y141" /protein_id="SIU01039.1" /translation="MGPMNGFLSWWDGVELWLSGLPFALQALAVMPVVLALAYFTAAL LDALLGRVIQLIRRARRPDQAPR" CDS 2670853..2672937 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2425" /product="Trehalase (EC" /EC_number="3.2.1.28" /note="Mb2425, -, len: 642 aa. Equivalent to Rv2402, len: 642 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 642 aa overlap). Conserved hypothetical protein, highly similar to others e.g. 9X8C4|SCE36.11c CONSERVED HYPOTHETICAL PROTEIN (FRAGMENT) from Streptomyces coelicolor (612 aa), FASTA scores: opt: 1283, E(): 6.5e-75, (41.9% identity in 623 aa overlap); Q9RJ38|SCI8.15 HYPOTHETICAL 66.3 KDA PROTEIN from Streptomyces coelicolor (595 aa), FASTA scores: opt: 1152, E(): 1.7e-66, (39.9% identity in 622 aa overlap), Q9S223|CI51.17 HYPOTHETICAL 68.4 KDA PROTEIN from Streptomyces coelicolor (612 aa), FASTA scores: opt: 1146, E(): 4.2e-66, (40.6% identity in 623 aa overlap); YAY3_SCHPO|Q10211|c4h3.03c HYPOTHETICAL 74.5 kd PROTEIN from Schizosaccharomyces pombe (Fission yeast) (649 aa) FASTA scores: opt: 999, E(): 1.3e-56, (35.0% identity in 642 aa overlap); etc. Contains possible helix-turn-helix motif, at aa 224-245 (+4.68 SD). Protein product from Mb2425 detected using SWATH mass spectrometry. Mb2425 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y140" /db_xref="InterPro:IPR008928" /db_xref="InterPro:IPR011613" /db_xref="InterPro:IPR012341" /db_xref="UniProtKB/TrEMBL:A0A1R3Y140" /protein_id="SIU01040.1" /translation="MKWTPSKSTDDDVGMVLHAQPPDQSTETAREAKALAGATDGATA TSADLHAPMALSSSSPLRNPFPPIADYAFLSDWETTCLISPAGSVEWLCVPRPDSPSV FGAILDRSAGHFRLGPYGVSVPSARRYLPGSLIMETTWQTHTGWLIVRDALVMGKWHD IERRSRTHRRTPMDWDAEHILLRTVRCVSGTVELMMSCEPAFDYHRLGATWEYSAEAY GEAIARANTEPDAHPTLRLTTNLRIGLEGREARARTRMKEGDDVFVALSWTKHPPPQT YDEAADKMWQTTECWRQWINIGNFPDHPWRAYLQRSALTLKGLTYSPTGALLAASTTS LPETPRGERNWDYRYAWIRDSTFALWGLYTLGLDREADDFFAFIADVSGANNNERHPL QVMYGVGGERSLVEAELHHLSGYDHARPVRIGNGAYNQRQHDIWGSILDSFYLHAKSR EQVPENLWPVLKRQVEEAIKHWREPDRGIWEVRGEPQHFTSSKVMCWVALDRGAKLAE RQGEKSYAQQWRAIADEIKADILEHGVDSRGVFTQRYGDEALDASLLLVVLTRFLPPD DPRVRNTVLAIADELTEDGLVLRYRVHETDDGLSGEEGTFTICSFWLVSALVEIGEVG RAKRLCERLLSFASPLLLYAEEIEPRSGRHLGNFPQAFTHLALINAVVHVIRAEEEAD SSGMFQPANAPM" CDS complement(2673015..2673770) /codon_start=1 /transl_table=11 /gene="lppR" /locus_tag="BQ2027_MB2426C" /product="PROBABLE CONSERVED LIPOPROTEIN LPPR" /note="Mb2426c, lppR, len: 251 aa. Equivalent to Rv2403c, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 251 aa overlap). Probable lppR, conserved lipoprotein, with weak similarity with MYCOBACTERIAL SERINE/THREONINE PROTEIN KINASES (EC 2.7.1.-) e.g. AAK45563|MT1304 from Mycobacterium tuberculosis strain CDC1551 (626 aa), FASTA scores: opt: 186, E(): 0.00023, (24.4% identity in 238 aa overlap), and the C-terminal part of Q11053|Rv1266c|MTCY50.16|PKNH_MYCTU from Mycobacterium tuberculosis (626 aa), FASTA scores: opt: 185, E()= 0.00027, (24.35% identity in 238 aa overlap). Has signal peptide and appropriate positioned prokaryotic lipoprotein attachment site (PS00013). Could belong to the SER/THR FAMILY of protein kinases. Protein product from Mb2426c detected using SWATH mass spectrometry. Mb2426c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR026954" /db_xref="InterPro:IPR038232" /db_xref="UniProtKB/TrEMBL:A0A1R3Y147" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01041.1" /translation="MTNRWRWVVPLFAVFLAAGCTTTTTGKAGLAPNAVPRPLMGSLI QRVPLDGAALSTLLNQPFQALPPFPPVFGGSDSLGDSDVSARPADCVGVGYLTQRNVY RSVEVKSVARVSWRHDGSSVKVDDLDEGVVALPSAAAADDLFARFSAQWKECDGTTLT VPASAFGQRSITDVRVADSVVAATVSLRRGTHSILASVPQARAVGVRGNCVVEVAVTF FGITHPSDQGSADISTSAVDIAHAMMDRISELS" CDS complement(2673767..2675728) /codon_start=1 /transl_table=11 /gene="lepA" /locus_tag="BQ2027_MB2427C" /product="PROBABLE GTP-BINDING PROTEIN LEPA (GTP-BINDING ELONGATION FACTOR)" /note="Mb2427c, lepA, len: 653 aa. Equivalent to Rv2404c, len: 653 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 653 aa overlap). Probable lepA, GTP-binding protein (a protein of unknown function, but apparently with membrane-related functions and very similar to protein synthesis elongation factors; see citations below). Equivalent to P53530|LEPA_MYCLE|ML0611|B1937_F3_81 GTP-BINDING PROTEIN from Mycobacterium leprae (646 aa), FASTA scores: opt: 3610, E(): 1.2e-205, (88.0% identity in 649 aa overlap). Also highly similar to many GTP-BINDING PROTEINS LEPA e.g. Q9RDC9|LEPA_STRCO|SCC77.29c from Streptomyces coelicolor (622 aa), FASTA scores: opt: 3046, E(): 2.3e-172, (74.3% identity in 626 aa overlap); P37949|LEPA_BACSU from B. subtilis (612 aa), FASTA scores: opt: 2430, E(): 5.3e-136, (58.7% identity in 610 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00301 GTP-binding elongation factors signature. BELONGS TO THE GTP-BINDING ELONGATION FACTOR FAMILY, LEPA SUBFAMILY. Protein product from Mb2427c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2427c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65270" /db_xref="InterPro:IPR000640" /db_xref="InterPro:IPR000795" /db_xref="InterPro:IPR005225" /db_xref="InterPro:IPR006297" /db_xref="InterPro:IPR009000" /db_xref="InterPro:IPR013842" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR031157" /db_xref="InterPro:IPR035647" /db_xref="InterPro:IPR035654" /db_xref="InterPro:IPR038363" /db_xref="UniProtKB/Swiss-Prot:P65270" /protein_id="SIU01042.1" /translation="MRTPCSQHRRDRPSAIGSQLPDADTLDTRQPPLQEIPISSFADK TFTAPAQIRNFCIIAHIDHGKSTLADRMLQLTGVVDERSMRAQYLDRMDIERERGITI KAQNVRLPWRVDKTDYVLHLIDTPGHVDFTYEVSRALEACEGAVLLVDAAQGIEAQTL ANLYLALDRDLHIIPVLNKIDLPAADPDRYAAEMAHIIGCEPAEVLRVSGKTGEGVSD LLDEVVRQVPPPQGDAEAPTRAMIFDSVYDIYRGVVTYVRVVDGKISPRERIMMMSTG ATHELLEVGIVSPEPKPCEGLGVGEVGYLITGVKDVRQSKVGDTVTSLSRARGAAAEA LTGYREPKPMVYSGLYPVDGSDYPNLRDALDKLQLNDAALTYEPETSVALGFGFRCGF LGLLHMEITRERLEREFGLDLISTSPNVVYRVHKDDGTEIRVTNPSDWPEGKIRTVYE PVVKTTIIAPSEFIGTIMELCQSRRGELGGMDYLSPERVELRYTMPLGEIIFDFFDAL KSRTRGYASLDYEEAGEQEAALVKVDILLQGEAVDAFSAIVHKDTAYAYGNKMTTKLK ELIPRQQFEVPVQAAIGSKIIARENIRAIRKDVLSKCYGGDITRKRKLLEKQKEGKKR MKTIGRVEVPQEAFVAALSTDAAGDKGKK" CDS 2675749..2676318 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2428" /product="RNA 3'-terminal phosphate cyclase (EC" /EC_number="6.5.1.4" /note="Mb2428, -, len: 189 aa. Equivalent to Rv2405, len: 189 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 189 aa overlap). Conserved hypothetical protein, identical (but N-terminus longer 40 residues) to AAK46773|MT2477 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551. Also highly similar, but N-terminus longer 38 residues, to Q9RD03|SCCM1.41 HYPOTHETICAL 17.4 KDA PROTEIN from Streptomyces coelicolor (154 aa), FASTA scores: opt: 451, E(): 2e-22, (48.7% identity in 154 aa overlap). Shows also similarity with hypothetical proteins from other species. Protein product from Mb2428 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2428 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y120" /db_xref="InterPro:IPR003477" /db_xref="UniProtKB/TrEMBL:A0A1R3Y120" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01043.1" /translation="MQRFAENLVFTEAPKLVRHLQNTQETLRTIRQAVKITANIMTTA VPSPPAEIAAGRPVTSTSCPTAARARRLVYAPDLDGRADPGEIVWTWVAYEQDPTRGK DRPVLVVGRDRSVLLGLLVSSQERHAADRDWVGIGSGAWDYEGRESWVRLDRVLDVPE ESIRREGAILEREVFDVVAARLRADYAWR" CDS complement(2676489..2676917) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2429C" /product="CBS domain protein" /note="Mb2429c, -, len: 142 aa. Equivalent to Rv2406c, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Conserved hypothetical protein. C-terminal region is identical with many CBS DOMAIN PROTEIN e.g. AAK46774|MT2478 CBS DOMAIN PROTEIN from Mycobacterium tuberculosis strain CDC1551 (aa 47-142), FASTA scores: opt: 594, E(): 1.9e-30, (98.97% identity in 97 aa overlap); etc. Also similar to other hypothetical proteins e.g. AAK24594|CC2626 CBS DOMAIN PROTEIN from Caulobacter crescentus (157 aa), FASTA scores: opt: 377, E(): 8.3e-17, (42.55% identity in 141 aa overlap); BAB47826|MLR0188 from Rhizobium loti; etc. Protein product from Mb2429c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2429c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000644" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3B0" /protein_id="SIU01044.1" /translation="MRIADVLRNKGAAVVTINPDATVGELLAGLAEQNIGAMVVVGAE GVVGIVSERDVVRQLHTYGASVLSRPVAKIMSTTVATCTKSDTVDKISVLMTENRVRH VPVLDGKKLIGIVSIGDVVKSRMGELEAEQQQLQSYITQG" CDS 2677177..2678019 /codon_start=1 /transl_table=11 /gene="rnz" /locus_tag="BQ2027_MB2430" /product="MBL-fold metallo-hydrolase superfamily" /note="Mb2430, -, len: 280 aa. Equivalent to Rv2407, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (97.5% identity in 280 aa overlap). Conserved hypothetical protein, highly similar (but longer at N-terminus) to AAK46775|MT2479 putative arylsulfatase from Mycobacterium tuberculosis strain CDC1551 (224 aa) FASTA scores: opt: 1433, E(): 2.5e-81, (96.43% identity in 224 aa overlap); O33130|MLCL536.01 HYPOTHETICAL PROTEIN from Mycobacterium leprae (220 aa), FASTA scores: opt: 658, E(): 1.5e-33, (56.75% identity in 215 aa overlap). Also similar to AAK23160|CC1176 Metallo-beta-lactamase family protein from Caulobacter crescentus (317 aa), FASTA scores: opt: 286, E(): 1.8e-10, (33% identity in 291 aa overlap). And similar to other hypothetical proteins eg Q49744|B1937_C1_163 HYPOTHETICAL 22.6 KDA PROTEIN (PRECURSOR) from Mycobacterium leprae (211 aa), FASTA scores: opt: 623, E(): 2.1e-31, (56.3% identity in 206 aa overlap); O27859|MTH1831 CONSERVED PROTEIN from Methanothermobacter thermautotrophicus (307 aa), FASTA scores: opt: 268, E(): 2.3e-09, (28.35% identity in 307 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 21 bp in-frame insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (280 aa versus 273 aa). Protein product from Mb2430 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2430 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYN1" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR013471" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/Swiss-Prot:Q7TYN1" /protein_id="SIU01045.1" /translation="MLEITLLGTGSPIPDPDRAGPSTLVRAGAQAFLVDCGRGVLQRA AAVGVGAAGLSAVLLTHLHSDHIAELGDVLITSWVTNFAADPAPLPIIGPPGTAEVVE ATLKAFGHDIGYRIAHHADLTTPPPIEVHEYTAGPAWDRDGVTIRVAPTDHRPVTPTI GFRIESDGASVVLAGDTVPCDSLDQLAAGADALVHTVIRKDIVTQIPQQRVKDICDYH SSVQEAAATANRAGVGTLVMTHYVPAIGPGQEEQWRALAATEFSGRIEVGNDLHRVEV HPRR" CDS 2678612..2679331 /codon_start=1 /transl_table=11 /gene="PE24" /locus_tag="BQ2027_MB2431" /product="possible pe family-related protein pe24" /note="Mb2431, PE24, len: 239 aa. Equivalent to Rv2408, len: 239 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 239 aa overlap). Possibly a member of PE family, similar to AAK46440|MT2159 from Mycobacterium tuberculosis strain CDC1551 (491 aa) FASTA scores: opt: 269, E(): 5.4e-08, (38.45% identity in 156 aa overlap) and AAK45466|MT1209 from Mycobacterium tuberculosis strain CDC1551 (308 aa), FASTA scores: opt: 265, E(): 6.3e-08, (36.0% identity in 197 aa overlap)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C0" /protein_id="SIU01046.1" /translation="MLIARPDILCSRGPEAMRAKAADLDLAAAAKTVGVQPAADQVAA AIAAILLSHAQIYQDISTQMAAFHDQLVENRTADSTSYASAEANAQQSLLNAMDAPSW QQRRETVGEVGLPADPAGSGTATAAVAAATTARAGSRSAAQATVAPIGGLKLRRESAL SQPGDLHHHVEVGDALPRVDPFQRGNVGVVAAYTHTDVLLGDLIVIGGVVVPPSTGPG LNPGMAAPVYRLSHHGITLRV" CDS complement(2679089..2679928) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2432C" /product="Protein containing transglutaminase-like domain, putative cysteine protease" /note="Mb2432c, -, len: 279 aa. Equivalent to Rv2409c, len: 279 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 279 aa overlap). Conserved hypothetical protein, equivalent to Q49757|YP69_MYCLE|G466976|B1937_F2_39 HYPOTHETICAL PROTEIN from Mycobacterium leprae (279 aa), FASTA scores: opt: 1564, E(): 4.6e-95, (82.1% identity in 279 aa overlap). Also similar to others e.g. Q9RSX6|DR1993 from Deinococcus radiodurans (274 aa), FASTA scores: opt: 494, E(): 4e-25, (35.1% identity in 282 aa overlap); BAB49898|Mll2875 from Rhizobium loti (Mesorhizobium loti) (294 aa), FASTA scores: opt: 382, E(): 8.9e-18, (29.75% identity in 269 aa overlap); Q9I305|PA1732 from Pseudomonas aeruginosa (266 aa), FASTA scores: opt: 326, E(): 3.7e-14, (31.25% identity in 275 aa overlap); etc. Also similar to Rv2569c|MTCY227.32 from Mycobacterium tuberculosis. Protein product from Mb2432c detected using SWATH mass spectrometry. Mb2432c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002931" /db_xref="InterPro:IPR013589" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1H6" /protein_id="SIU01047.1" /translation="MWRTRVVHTTGYVYQSPVTASYNEARLTPRSSSRQNLVLNRVET IPATRSYRYIDYWGTAVTAFDLHAPHTELTVTSSSVVETERPEPLAAKATWADLQSTA VIDRFDEVLRPTPHTPASARVDAVGRRIRKCHEPSEAVVAAARWARSELDYIPGTTSV HSSGLDALEQGKGVCQDFVHLSLMVLRSMGIPCRYVSGYLHPKRDAVVGKTVDGRSHA WVQAWTGGWWHYDPTNDNEITEQYISVGVGRDYTDVSPLKGIYSGEGVTDLDVVVEIT RLA" CDS complement(2679928..2680905) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2433C" /product="Protein containing domains DUF403" /note="Mb2433c, -, len: 325 aa. Equivalent to Rv2410c, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 325 aa overlap). Conserved hypothetical protein, equivalent to Q49770|CAC30114|ML0606 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (325 aa), FASTA scores: opt: 1928, E(): 3.5e-117, (90.75% identity in 325 aa overlap). Also some similarity with other hypothetical proteins e.g. Q9RST2|DR2041 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (316 aa), FASTA scores: opt: 329, E(): 5.3e-14, (32.4% identity in 318 aa overlap); C-terminus of Q9HUN7|PA4927 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (830 aa), FASTA scores: opt: 297, E(): 1.5e-11, (27.6% identity in 315 aa overlap); etc. Protein product from Mb2433c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2433c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007296" /db_xref="UniProtKB/TrEMBL:A0A1R3Y139" /protein_id="SIU01048.1" /translation="MLARNAEALYWIGRYVERADDTARILDVAVHQLLEDSSVDPDQA SRLLLRVLGIEPPDHELDVWSLTDLVAFSTNSQGGSSIVDAISAARENAKSAREVTSS ETWECLNTTYNALPERERAAKRLGPHEFLSFIEGRAAMFAGLADSTLLRDDGYRFMLL GRAIERVDMTVRLLLSRVGDSASSPAWVTLLRSAGAHDTYLRTYRGVLDAGRVVEFMM LDRLFPRSVFHSLKLAEHNLAELMHNPHSRIGATTEAQRLLGQARSELEFVQPGVLLE TLESRLAGLQTTCRDVGDALALQYFHAAPWVAWSDAGQRGQLVGSQEES" CDS complement(2680905..2682560) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2434C" /product="Protein containing domains DUF404, DUF407" /note="Mb2434c, -, len: 551 aa. Equivalent to Rv2411c, len: 551 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 551 aa overlap). Hypothetical protein, highly similar to Q49755|YO11_MYCLE|ML0605|MLCL536.05c|U19 37B|B1937_F1_4 HYPOTHETICAL 61.8 KDA PROTEIN from Mycobacterium leprae (561 aa), FASTA scores, opt: 3163, E(): 4.1e-178, (87.35% identity in 554 aa overlap). Also highly similar, except in N-terminus, to others e.g. Q55587|Y335_SYNY3|SLL0335 HYPOTHETICAL PROTEIN from Synechocystis sp. strain PCC 6803 (481 aa), FASTA scores: opt: 1620, E(): 1.2e-87, (52.8% identity in 468 aa overlap); Q9I307|PA1730 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (470 aa), FASTA scores: opt: 1574, E(): 5.8e-85, (52.7% identity in 467 aa overlap); Q9RST1|DR2042 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (655 aa), FASTA scores: opt: 1561, E(): 4.4e-84, (53.3% identity in 467 aa overlap); etc. Protein product from Mb2434c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2434c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007302" /db_xref="InterPro:IPR016450" /db_xref="UniProtKB/Swiss-Prot:P65002" /protein_id="SIU01049.1" /translation="MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDA QGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRV ISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIV PPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRV RAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRD LFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVL SSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVL KPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPR YVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRASAAARE LGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQQQQQQAFH" CDS 2682670..2682930 /codon_start=1 /transl_table=11 /gene="rpsT" /locus_tag="BQ2027_MB2435" /product="30s ribosomal protein s20 rpst" /note="Mb2435, rpsT, len: 86 aa. Equivalent to Rv2412, len: 86 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 86 aa overlap). Probable rpsT, 30s ribosomal protein s20, equivalent to O33132|RS20_MYCLE|L0604|MLCL536.06 30S RIBOSOMAL PROTEIN S20 from Mycobacterium leprae (86 aa), FASTA scores: opt: 456, E(): 4.6e-24, (87.20% identity in 86 aa overlap). Also highly similar or similar to others e.g. Q9RDM3|RPST|SCC123.01 30S RIBOSOMAL PROTEIN S20 from Streptomyces coelicolor (88 aa), FASTA scores: opt: 363, E(): 7.1e-18, (70.95% identity in 86 aa overlap); Q9KD79|RPST|BH1339 RIBOSOMAL PROTEIN S20 (BS20) from Bacillus halodurans (91 aa), FASTA scores: opt: 252, E(): 1.8e-10, (49.4% identity in 85 aa overlap); P02378|RS20_ECOLI 30s ribosomal protein s20 from Escherichia coli (86 aa), FASTA scores: opt: 210, E(): 1e-07, (42.4% identity in 85 aa overlap); etc. BELONGS TO THE S20P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb2435 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2435 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66506" /db_xref="InterPro:IPR002583" /db_xref="InterPro:IPR036510" /db_xref="UniProtKB/Swiss-Prot:P66506" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01050.1" /translation="MANIKSQQKRNRTNERARLRNKAVKSSLRTAVRAFREAAHAGDK AKAAELLASTNRKLDKAASKGVIHKNQAANKKSALAQALNKL" CDS complement(2682946..2683896) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2436C" /product="DNA polymerase III delta subunit (EC" /EC_number="2.7.7.7" /note="Mb2436c, -, len: 316 aa. Equivalent to Rv2413c, len: 316 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 316 aa overlap). Conserved hypothetical protein, highly similar to O33133|MLCL536.07c|ML0603|Q49756|G466975|B1937_F2_36 hypothetical 39.1 KDA protein from Mycobacterium leprae (389 aa), FASTA scores: opt: 1683, E(): 1.8e-88, (83.9% identity in 316 aa overlap). ML0603 is a putative lipoprotein with an N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site that is not present in Rv2413c as this seems to be 73 aa shorter. Also some similarity with various proteins from other organisms e.g. Q9RDM2|SCC123.02c PUTATIVE DNA-BINDING PROTEIN from Streptomyces coelicolor (336 aa), FASTA scores: opt: 792, E(): 6.1e-38, (42.4% identity in 316 aa overlap); Q9HX31|HOLA|PA3989 DNA POLYMERASE III, DELTA SUBUNIT from Pseudomonas aeruginosa (345 aa), FASTA scores: opt: 173, E(): 0.0084, (25.4% identity in 307 aa overlap); etc. Protein product from Mb2436c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2436c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y156" /db_xref="InterPro:IPR008921" /db_xref="InterPro:IPR010372" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y156" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01051.1" /translation="MHLVLGDEELLVERAVADVLRSARQRAGTADVPVSRMRAGDVGA YELAELLSPSLFAEERIVVLGAAAEAGKDAAAVIESAAADLPAGTVLVVVHSGGGRAK SLANQLRSMGAQVHPCARITKVSERADFIRSEFASLRVKVDDETVTALLDAVGSDVRE LASACSQLVADTGGAVDAAAVRRYHSGKAEVRGFDIADKAVAGDVAGAAEALRWAMMR GEPLVVLADALAEAVHTIGRVGPQSGDPYRLAAQLGMPPWRVQKAQKQARRWSRDTVA TAMRLVAELNANVKGAVADADYALESAVRQVAELVADRGR" CDS complement(2683901..2685472) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2437C" /product="DNA internalization-related competence protein ComEC/Rec2" /note="Mb2437c, -, len: 523 aa. Similar to Rv2414c, len: 514 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 483 aa overlap). Conserved hypothetical protein, showing some similarity with COME OPERON PROTEINS 3 (COMEC OR COME3) e.g. Q9RTB1|DR1854 PUTATIVE COMPETENCE PROTEIN COMEC/REC2 from Deinococcus radiodurans (755 aa), FASTA scores: opt: 311, E(): 8.2e-11, (27.3% identity in 538 aa overlap); P73100|COME|SLL1929 COME PROTEIN from Synechocystis sp. strain PCC 6803 (709 aa), FASTA scores: opt: 302, E(): 2.6e-10, (26.3% identity in 323 aa overlap) (no similarity on N-terminus); P39695|CME3_BACSU COME OPERON PROTEIN 3 from Bacillus subtilis (776 aa), FASTA scores: opt: 273, E(): 1.4e-08, (25.2% identity in 282 aa overlap) (no similarity on N-terminus); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-t) leads to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis H37Rv (523 aa versus 514 aa). Mb2437c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y162" /db_xref="InterPro:IPR004477" /db_xref="UniProtKB/TrEMBL:A0A1R3Y162" /protein_id="SIU01052.1" /translation="MGFGASRLDVRLVPAALVSWIVTAAGIVWPIGNVCALCCVVVAL GGGALWWCVARRSWHAPRLGSISAGLVAVGMVGAGYGLAVALRSEAVDRHPITVAFGT SALVTVTPSESPVSLGRGRLMFRATVQRLRDDETSGRVVVFARALDFGELMVGQPVQF RARISRPARHDLTVAVFNATGRPTVGRAGPVHRAAHIVRHRFAAAVREVLPADQATML PALVLGDTSTVTALTSREFRAAGLTHLTAVSGANVTIVCAAALVSARLIGPRAAVVCA AVALVAFVILVQPTASVLRAAVMGAIALVGMLSARRRQAIPALSGSVLVLLAAAPHLA VDIGFALSVAATGALVVIAPVWSRRLVDRGCPKVLADALAVAAAAQLVTAPLVAAISG RVSLVAVVANLAVAAVIAPITVLGSVAAVLVVPWPAGAQVLIRFTGPEVWWVLRVAHW ASGVPAATVPVAAGLPGVLLVGGATVFTVAQWRLALVSRGHVQNDGGGRHMSACLVAV RAGRPFVTPSWGERG" CDS complement(2685487..2686380) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2438C" /product="Late competence protein ComEA, DNA receptor" /note="Mb2438c, -, len: 297 aa. Equivalent to Rv2415c, len: 297 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 297 aa overlap). Hypothetical protein, with some similarity in C-terminal part to comE operon proteins 1 e.g. Q9EU10|COME|COME4|COME1|COME2|COME3 COME PROTEIN (a competence protein with DNA-binding activity) from Neisseria gonorrhoeae (99 aa), FASTA scores: opt: 190, E(): 0.0032, (49.2% identity in 61 aa overlap); Q9JYB8|NMB1657 from Neisseria meningitidis (205 aa) FASTA scores: opt: 191, E(): 0.0052, (49.2% identity in 61 aa overlap); CME1_BACSU|P39694 come operon protein 1 from Bacillus subtilis (205 aa), FASTA scores, opt: 181, E(): 0.017 (29.8% identity in 218 aa overlap); etc. Mb2438c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y127" /db_xref="InterPro:IPR003583" /db_xref="InterPro:IPR010994" /db_xref="InterPro:IPR019554" /db_xref="UniProtKB/TrEMBL:A0A1R3Y127" /protein_id="SIU01053.1" /translation="MRTELPAERLQRRLGAVPDIDSHAASAHLDPEPHDPTDDGPDHD EPRDDPNSLLPRWLPDTSRGQGWADRIRADPGRAGAVALAVIAALAVLVTVFTLIRDR TEPVMSAKLPPVEPVSPTNPRSSASPGSPDRSGLPVVVSVVGLVHTPGLVTLAPGARI ADALQAAGGAVDGADTVGLNMARQLGDGEQIVVGLAPPSGQPRVLGSSVGAGTPGPAG TSGTATTGPKTAPKTAEVLDLNTATVEQLDALPGIGPVTAAAIVAWRQRNGRFTSVDQ LADVDGIGPARLDKLRNLVRV" CDS complement(2686720..2687946) /codon_start=1 /transl_table=11 /gene="eis" /locus_tag="BQ2027_MB2439C" /product="enhanced intracellular survival protein eis,gcn5-related n-acetyltransferase" /note="Mb2439c, -, len: 408 aa. Equivalent to Rv2416c, len: 408 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 408 aa overlap). Conserved hypothetical protein, sharing similarity with Q9F309|SCC80.10 HYPOTHETICAL 44.7 KDA PROTEIN from Streptomyces coelicolor (413 aa), FASTA scores: opt: 382, E(): 1e-16, (31.45% identity in 407 aa overlap); Q9K4F4|SCD66.23 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (418 aa), FASTA scores: opt: 238, E(): 1.3e-07, (36.5% identity in 364 aa overlap): and Q54238|G1139577|ORF5 hypothetical protein from Streptomyces griseus (416 aa), FASTA scores: opt: 237, E(): 1.5e-07, (34.0 identity in 423 aa overlap). Protein product from Mb2439c detected using SWATH mass spectrometry. Mb2439c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59772" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="InterPro:IPR022902" /db_xref="InterPro:IPR025559" /db_xref="InterPro:IPR036527" /db_xref="InterPro:IPR041380" /db_xref="UniProtKB/Swiss-Prot:P59772" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01054.1" /translation="MPQSDSVTVTLCSPTEDDWPGMFLLAAASFTDFIGPESATAWRT LVPTDGAVVVRDGAGPGSEVVGMALYMDLRLAVPGEVVLPTAGLSFVAVAPTHRRRGL LRAMCAELHRRIADSGYPVAALHASEGGIYGRFGYGPATTLHELTVDRRFARFHADAP GGGLGGSSVRLVRPTEHRGEFEAIYERWRQQVPGGLLRPQVLWDELLAECKAAPGGDR ESFALLHPDGYALYRVDRTDLKLARVSELRAVTADAHCALWRALIGLDSMERISIITH PQDPLPHLLTDTRLARTTWRQDGLWLRIMNVPAALEARGYAHEVGEFSTVLEVSDGGR FALKIGDGRARCTPTDAAAEIEMDRDVLGSLYLGAHRASTLAAANRLRTKDSQLLRRL DAAFASDVPVQTAFEF" CDS complement(2688068..2688910) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2440C" /product="DegV family protein" /note="Mb2440c, -, len: 280 aa. Equivalent to Rv2417c, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Conserved hypothetical protein, highly similar to Q9RDL7|SCC123.07c HYPOTHETICAL 29.2 KDA PROTEIN from Streptomyces coelicolor (281 aa), FASTA scores: opt: 579, E(): 3.6e-27, (38.3% identity in 274 aa overlap). Also some similarity with DEGV proteins or hypothetical proteins from other organisms, e.g. Q9RSY3|DR1986 from Deinococcus radiodurans (281 aa), FASTA scores: opt: 393, E(): 3.4e-16, (31.0% identity in 280 aa overlap); P32436|DEGV_BACSU from Bacillus subtilis (281 aa), FASTA scores: opt: 365, E(): 1.5e-14, (27.8% identity in 284 aa overlap); BAB41937|BAB46307|SA0704|SAV0749 Conserved hypothetical protein from Staphylococcus aureus strain Mu50 and N315 (288 aa), FASTA scores: opt: 371, E(): 7e-15, (28.85% identity in 281 aa overlap); etc. Protein product from Mb2440c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2440c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67369" /db_xref="InterPro:IPR003797" /db_xref="UniProtKB/Swiss-Prot:P67369" /protein_id="SIU01055.1" /translation="MTVVVVTDTSCRLPADLREQWSIRQVPLHILLDGLDLRDGVDEI PDDIHKRHATTAGATPVELSAAYQRALADSGGDGVVAVHISSALSGTFRAAELTAAEL GPAVRVIDSRSAAMGVGFAALAAGRAAAAGDELDTVARAAAAAVSRIHAFVAVARLDN LRRSGRISGAKAWLGTALALKPLLSVDDGKLVLVQRVRTVSNATAVMIDRVCQLVGDR PAALAVHHVADPAAANDVAAALAERLPACEPAMVTAMGPVLALHVGAGAVGVCVDVGA SPPA" CDS complement(2688991..2689734) /codon_start=1 /transl_table=11 /gene="octT" /locus_tag="BQ2027_MB2441C" /product="Diglucosylglycerate octanoyltransferase (DGG octanoyltransferase) (EC" /EC_number="2.3.1.273" /note="Mb2441c, -, len: 247 aa. Equivalent to Rv2418c, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 247 aa overlap). Hypothetical unknown protein. Protein product from Mb2441c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2441c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1D0" /protein_id="SIU01056.1" /translation="MSSRRGRRPALLVFADSLAYYGPTGGLPADDPRIWPNIVASQLD WDLELIGRIGWTCRDVWWAATQDPRAWAALPRAGAVIFATGGMDSLPSVLPTALRELI RYVRPSWLRRWVRDGYAWVQPRLSPVARAALPPHLTAEYLEKTRGAIDFNRPGIPIIA SLPSVHIAETYGKAHHGRAGTVAAITEWAQHHDIPLVDLKAAVAEQILSGYGNRDGIH WNFEAHQAVAELMLKALAEAGVPNEKSRG" CDS complement(2689724..2690395) /codon_start=1 /transl_table=11 /gene="gpgp" /locus_tag="BQ2027_MB2442C" /product="glucosyl-3-phosphoglycerate phosphatase gpgp" /note="Mb2442c, -, len: 223 aa. Equivalent to Rv2419c, len: 223 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 223 aa overlap). Probable phosphoglycerate mutase (EC 5.4.2.1), equivalent to Q9CC00|ML1452 POSSIBLE PHOSPHOGLYCERATE MUTASE from Mycobacterium leprae (224 aa), FASTA scores: opt: 1206, E(): 8.8e-68, (80.35% identity in 224 aa overlap). Also highly similar to Q9RDL0|SCC123.14c PUTATIVE PHOSPHOGLYCERATE MUTASE from Streptomyces coelicolor (223 aa), FASTA scores: opt: 431, E(): 9.4e-20, (40.85% identity in 213 aa overlap); and similar to others e.g. Q9RVD2|DR1097 from Deinococcus radiodurans (232 aa), FASTA scores: opt: 291, E(): 4.6e-11, (39.3% identity in 173 aa overlap); etc. Some similarity to Q10512|Rv2228c|Y019_MYCTU|MT2287|MTcy427.09c hypothetical 39.2 kd protein from Mycobacterium tuberculosis (364 aa) FASTA scores: opt: 196, E(): 2.8e-06, (45.6% identity in 79 aa overlap). Contains PS00175 Phosphoglycerate mutase family phosphohistidine signature. BELONGS TO THE PHOSPHOGLYCERATE MUTASE FAMILY. Protein product from Mb2442c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2442c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1I3" /db_xref="InterPro:IPR001345" /db_xref="InterPro:IPR013078" /db_xref="InterPro:IPR029033" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1I3" /protein_id="SIU01057.1" /translation="MRARRLVMLRHGQTDYNVGSRMQGQLDTELSELGRTQAVAAAEV LGKRQPLLIVSSDLRRAYDTAVKLGERTGLVVRVDTRLRETHLGDWQGLTHAQIDADA PGARLAWREDATWAPHGGESRVDVAARSRPLVAELVASEPEWGGADEPDRPVVLVAHG GLIAVLSAALLKLPVANWPALGGMGNASWTQLSGHWAPGSDFESIRWRLDVWNASAQV SSDVL" CDS complement(2690392..2690772) /codon_start=1 /transl_table=11 /gene="rsfS" /locus_tag="BQ2027_MB2443C" /product="Ribosomal silencing factor RsfA" /note="Mb2443c, -, len: 126 aa. Equivalent to Rv2420c, len: 126 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 126 aa overlap). Conserved hypothetical protein, equivalent to Q9CBZ9|ML1453 HYPOTHETICAL PROTEIN from Mycobacterium leprae (129 aa), FASTA scores: opt: 681, E(): 1.6e-38, (87.0% identity in 123 aa overlap). Also highly similar to Q9RDK9|SCC123.15c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (148 aa), FASTA scores: opt: 447, E(): 5.8e-23, (52.7% identity in 129 aa overlap); and similar to others e.g. P54457|YQEL_BACSU HYPOTHETICAL PROTEIN from Bacillus subtilis (118 aa), FASTA scores: opt: 318, E(): 1.8e-14, (37.3% identity in 110 aa overlap); Q9KD89|BH1328 HYPOTHETICAL PROTEIN from Bacillus halodurans (117 aa), FASTA scores: opt: 296, E(): 5.1e-13, (37.6% identity in 109 aa overlap); etc. Protein product from Mb2443c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2443c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y150" /db_xref="InterPro:IPR004394" /db_xref="UniProtKB/TrEMBL:A0A1R3Y150" /protein_id="SIU01058.1" /translation="MTANREAIDMARVAAGAAAAKLADDVVVIDVSGQLVITDCFVIA SGSNERQVNAIVDEVEEKMRQAGYRPARREGAREGRWTLLDYRDIVVHIQHQDDRNFY ALDRLWGDCPVVPVDLSANSAGAQ" CDS complement(2690769..2691404) /codon_start=1 /transl_table=11 /gene="nadD" /locus_tag="BQ2027_MB2444C" /product="PROBABLE NICOTINATE-NUCLEOTIDE ADENYLYLTRANSFERASE NADD (DEAMIDO-NAD(+) PYROPHOSPHORYLASE) (DEAMIDO-NAD(+) DIPHOSPHORYLASE) (NICOTINATE MONONUCLEOTIDE ADENYLYLTRANSFERASE) (NAMN ADENYLYLTRANSFERASE)" /note="Mb2444c, nadD, len: 211 aa. Equivalent to Rv2421c, len: 211 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 211 aa overlap). Probable nadD, nicotinate-nucleotide adenylyltransferase (EC 2.7.7.18), equivalent to Q9CBZ8|NADD_MYCLE|ML1454 PROBABLE NICOTINATE-NUCLEOTIDE ADENYLYLTRANSFERASE from Mycobacterium leprae (214 aa), FASTA scores: opt: 1125, E(): 2.7e-66, (80.2% identity in 212 aa overlap). Also highly similar to Q9RDK7|NADD_STRCO PROBABLE NICOTINATE-NUCLEOTIDE ADENYLYLTRANSFERASE from Streptomyces coelicolor (188 aa), FASTA scores: opt: 855, E(): 9.8e-49, (66.5% identity in 194 aa overlap); and similar to others e.g. P54455|NADD_BACSU from Bacillus subtilis (189 aa), FASTA scores: opt: 351, E(): 7e-16, (36.1% identity in 191 aa overlap); etc. BELONGS TO THE NADD FAMILY. Protein product from Mb2444c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2444c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYM1" /db_xref="InterPro:IPR004821" /db_xref="InterPro:IPR005248" /db_xref="InterPro:IPR014729" /db_xref="UniProtKB/Swiss-Prot:Q7TYM1" /protein_id="SIU01059.1" /translation="MGGTFDPIHYGHLVAASEVADLFDLDEVVFVPSGQPWQKGRQVS AAEHRYLMTVIATASNPRFSVSRVDIDRGGPTYTKDTLADLHALHPDSELYFTTGADA LASIMSWQGWEELFELARFVGVSRPGYELRNEHITSLLGQLAKDALTLVEIPALAISS TDCRQRAEQSRPLWYLMPDSVVQYVSKCRLYCGACDAGARSTTSLAAGNGL" CDS 2691679..2691951 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2445" /product="HYPOTHETICAL PROTEIN" /note="Mb2445, -, len: 90 aa. Equivalent to Rv2422, len: 90 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3Y160" /protein_id="SIU01060.1" /translation="MPASVSTVLVDTSVAVAPVVADHDHHEDTFQALRGRTLGLAGHA AFERRTLATVAKLLAHTFPATRFLGAGAAMSLLPELAPAEIAGGAV" CDS 2692193..2693239 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2446" /product="SAM-dependent methyltransferase" /note="Mb2446, -, len: 348 aa. Equivalent to Rv2423, len: 348 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 348 aa overlap). Hypothetical unknown protein. Protein product from Mb2446 detected using SWATH mass spectrometry. Mb2446 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR041698" /db_xref="UniProtKB/TrEMBL:A0A1R3Y166" /protein_id="SIU01061.1" /translation="MDNLPIESAESTRLAKAAMTRRFYTRSVVKGEITLPAVPSMIDE YVTMCAGLFAGVGRKFSDEELAHLRAVLQGQLAEAYAASQRSTIVISYNAPMGPTLHY QVRAQWRTVAQEYENWIATREPPLFGTEPDARVWALANEAADPTTHRVLEIGAGTGRN ALALARRGHPVDVVEMTPKFADIIRSDAERDSLDVRVIMRDVFSTMDDLRQDYQLMVL SEVVPDFRTTQQLRNLFELAAQCLAPGARLVFNAFLANGDYAPDQAAREFGQQMYTGM CTRAEMSAAAAGLPLELVADDSVYDYEKTHLPPGAWPPTSWYADWIRGLDVFTTNVES CPIEMRWLVFQRRR" mobile_element complement(2693240..2694450) /mobile_element_type="insertion sequence:IS1558" /locus_tag="BQ2027_IS1558-2" /note="IS1558-2, len: 1211 nt. Equivalent to IS1558, len: 999 nt, from Mycobacterium tuberculosis strain H37Rv,(99.4% identity in 999 nt overlap)." repeat_region 2693240..2693252 /rpt_type=INVERTED /note="13 bp imperfect inverted repeat, IRR,GCAGTCGTAAAAG, flanking IS element IS1558." gene complement(2693240..2694450) /locus_tag="BQ2027_IS1558-2" CDS complement(2693372..2694064) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2447C" /product="PROBABLE TRANSPOSASE [SECOND PART]" /note="Mb2447c, -, len: 230 aa. Equivalent to 3' end of Rv2424c, len: 333 aa, from Mycobacterium tuberculosis strain H37Rv, (98.7% identity in 230 aa overlap). Probable transposase for IS1558, similar to IS element proteins e.g. AL021957|Rv2177c|MTV021_10 from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 1491, E(): 6.2e-87, (98.6% identity in 221 aa overlap); P19780|YIS1_STRCO HYPOTHETICAL INSERTION ELEMENT IS110 from Streptomyces coelicolor (45 aa), FASTA scores: opt: 203, E(): 1.7e-05; (27.3% identity in 238 aa overlap); etc. Contains PS01159 WW/rsp5/WWP domain signature. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2424c exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2 bp deletion (gt-*) splits Rv2424c into 2 parts, Mb2447c and Mb2448c. Mb2447c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y171" /db_xref="InterPro:IPR003346" /db_xref="UniProtKB/TrEMBL:A0A1R3Y171" /protein_id="SIU01062.1" /translation="MLADLARGSMRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLD AMIGALDEQIEQLMHPFCARRELIASIPGIGVGASATVISEIGADPAAWFPSAEHLAS WVRLCPGNHESAGKRHHGARRTGNQHLQPVLVECAWAAVRTDGYLREYYRRQVRKFGG FRSPAANKKAIIAVAHKLIVIIWHVLATGRPYQDLGADYFTTRMDPDKERRRLVAKLE AQGLGVTLEPAA" CDS complement(2694078..2694371) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2448C" /product="PROBABLE TRANSPOSASE [FIRST PART]" /note="Mb2448c, -, len: 97 aa. Similar to 5' end of Rv2424c, len: 333 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 67 aa overlap). Probable transposase for IS1558, similar to IS element proteins e.g. AL021957|Rv2177c|MTV021_10 from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 1491, E(): 6.2e-87, (98.6% identity in 221 aa overlap); P19780|YIS1_STRCO HYPOTHETICAL INSERTION ELEMENT IS110 from Streptomyces coelicolor (45 aa), FASTA scores: opt: 203, E(): 1.7e-05; (27.3% identity in 238 aa overlap); etc. Contains PS01159 WW/rsp5/WWP domain signature. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2424c exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2 bp deletion (gt-*) splits Rv2424c into 2 parts, Mb2447c and Mb2448c. Mb2448c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y137" /protein_id="SIU01063.1" /translation="MQCRAREERPGRKTDLLDAEWLVHLLECGLLRGWLIPPADIKAA RDVIRYRRKLVEHRTSKLQRLGNASRRRDQGRQRGVLGHPQVGAGDGGGAHRR" repeat_region complement(2694438..2694450) /rpt_type=INVERTED /note="13 bp imperfect inverted repeat, IRL,GCAGTCGCAAAAG, flanking IS element IS1558." CDS complement(2694460..2695902) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2449C" /product="Carbon monoxide oxidation accessory protein CoxE" /note="Mb2449c, -, len: 480 aa. Equivalent to Rv2425c, len: 480 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 480 aa overlap). Hypothetical protein; C-terminal half shares similarity to other unknown conserved proteins e.g. Q53065 HYPOTHETICAL 24.3 KDA PROTEIN from Rhodococcus erythropolis (219 aa), FASTA scores: opt: 398, E(): 9.9e-17, (34.15% identity in 202 aa overlap); C-terminus of O27843|MTH1815 CONSERVED PROTEIN from Methanothermobacter thermautotrophicus (346 aa), FASTA scores: opt: 341, E(): 3.7e-13, (31.35% identity in 233 aa overlap); etc. Protein product from Mb2449c detected using SWATH mass spectrometry. Mb2449c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR008912" /db_xref="InterPro:IPR011195" /db_xref="InterPro:IPR036465" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3C9" /protein_id="SIU01064.1" /translation="MAARRIRAARPLAPHGLPGHLVGFVEALRGSGISVGPSETVDAG RVMATLGLGDREVLREGIACAVLRRPDHRDTYDAMFDLWFPAALGARAVITTEDESAG SGGLPPDDVEAMRQLLLDLLANNQDLAGKDERLVEMIARIVEAYGKYSSSRGPSFSSY QALKAMALDELEGKLLAGLLAPYGDEPTATQEQIAKALAAQKIAQLRRMVDAETKRRT AEQLGREHVQMYGIPQLSENVEFLRASGEQLRQMRRVVAPLARTLATRLAARRRRARA GSIDLRKTLRKSMSTGGVPIDLVLHKPRPARPELVVLCDVSGSVAGFSHFTLLLVHAL RQQFSRVRVFAFIDSTDEVTHMFGPESDLAIAIQRITREAGVYARDGHSDYGNAFVSF MQGFPNVLSPRSSLLVLGDGRTNYRNPATDVLADMVTASRHAHWLNPEPKHLWGSGDS AVPRYQEVITMHECRSAKQLATVIDQLLPV" CDS complement(2695902..2696777) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2450C" /product="Carbon monoxide oxidation accessory protein CoxD" /note="Mb2450c, -, len: 291 aa. Equivalent to Rv2426c, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 291 aa overlap). Conserved hypothetical protein, highly similar to others e.g. Q51326|ORF4 from Pseudomonas carboxydovorans (295 aa), FASTA scores: opt: 853, E(): 3.7e-43, (48.75% identity in 277 aa overlap); BAB47746|MLR0088 from Rhizobium loti (309 aa), FASTA scores: opt :809, E(): 1.5e-40, (46.5% identity in 291 aa overlap); Q9Y9R8|APE2220 from Aeropyrum pernix (297 aa), FASTA scores: opt: 763, E(): 7.4e-38, (47.1% identity in 261 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb2450c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2450c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1Z5" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR011704" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Z5" /protein_id="SIU01065.1" /translation="MTVPARPTPLFADIADVSRRLAETGYLPDTATATAVFLADRLGK PLLVEGPAGVGKTELARAVAQATGSGLVRLQCYEGVDEARALYEWNHAKQILRIQAGS GDWEATKTDVLSEEFLLQRPLLTAIRRTEPTVLLIDETDKADIEIEGLLLEVLSDFAV TVPELGTLTATRAPFVLLTSNATRELSEALKRRCLYLHIDFPTPELERRILLSRVPEL PEHFAEELVRIIGVLRGMQLKKVPSIAETIDWGRTVLALGLDTIDDAVVAATLGVVLK HQSDQQRATGELRLN" CDS complement(2696804..2698051) /codon_start=1 /transl_table=11 /gene="proA" /locus_tag="BQ2027_MB2451C" /product="PROBABLE GAMMA-GLUTAMYL PHOSPHATE REDUCTASE PROTEIN PROA (GPR) (GLUTAMATE-5-SEMIALDEHYDE DEHYDROGENASE) (GLUTAMYL-GAMMA-SEMIALDEHYDE DEHYDROGENASE)" /note="Mb2451c, proA, len: 415 aa. Equivalent to Rv2427c, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 415 aa overlap). Probable proA, gamma-glutamyl phosphate reductase protein (EC 1.2.1.41), equivalent to Q9CBZ7|ML1458|PROA [GAMMA]-GLUTAMYL PHOSPHATE REDUCTASE from Mycobacterium leprae (409 aa), FASTA scores: opt: 2120, E(): 7.4e-118, (81.9% identity in 409 aa overlap). Also highly similar or similar to other GAMMA-GLUTAMYL PHOSPHATE REDUCTASES PROTEINS (GPR) e.g. Q9RDK1|PROA from Streptomyces coelicolor (428 aa), FASTA scores: opt: 1073, E(): 4.6e-56, (60.4% identity in 429 aa overlap); P45638|PROA_CORGL from Corynebacterium glutamicum (432 aa), FASTA scores: opt: 993, E(): 2.4e-51, (58.5% identity in 417 aa overlap); P96489|PROA_STRTR GAMMA-GLUTAMYL PHOSPHATE REDUCTASE from Streptococcus thermophilus (416 aa), FASTA scores: opt: 863, E(): 1.1e-43, (49.15% identity in 413 aa overlap); etc. BELONGS TO THE GAMMA-GLUTAMYL PHOSPHATE REDUCTASE FAMILY. Protein product from Mb2451c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2451c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65789" /db_xref="InterPro:IPR000965" /db_xref="InterPro:IPR012134" /db_xref="InterPro:IPR015590" /db_xref="InterPro:IPR016161" /db_xref="InterPro:IPR016162" /db_xref="InterPro:IPR016163" /db_xref="InterPro:IPR020593" /db_xref="UniProtKB/Swiss-Prot:P65789" /protein_id="SIU01066.1" /translation="MTVPAPSQLDLRQEVHDAARRARVAARRLASLPTTVKDRALHAA ADELLAHRDQILAANAEDLNAAREADTPAAMLDRLSLNPQRVDGIAAGLRQVAGLRDP VGEVLRGYTLPNGLQLRQQRVPLGVVGMIYEGRPNVTVDAFGLTLKSGNAALLRGSSS AAKSNEALVAVLRTALVGLELPADAVQLLSAADRATVTHLIQARGLVDVVIPRGGAGL IEAVVRDAQVPTIETGVGNCHVYVHQAADLDVAERILLNSKTRRPSVCNAAETLLVDA AIAETALPRLLAALQHAGVTVHLDPDEADLRREYLSLDIAVAVVDGVDAAIAHINEYG TGHTEAIVTTNLDAAQRFTEQIDAAAVMVNASTAFTDGEQFGFGAEIGISTQKLHARG PMGLPELTSTKWIAWGAGHTRPA" CDS complement(2698134..2698439) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2452C" /note="unnamed protein product; Mb2452c, -, len: 101 aa. Equivalent to the second part of oxyR' pseudogene (see citation below)" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1J3" /protein_id="SIU01067.1" /translation="MHEGHCLRDQTLDAAQHPGGVAGHRRAVRDRRAGGDTDSADRGR RRDHAKPAGTRPIRRPCPARRIGLVFSSFGGREKSYQRLAGIIGKLIRGDRQVRLIA" CDS complement(2698440..2698634) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2453C" /note="unnamed protein product; Mb2453c, -, len: 64 aa. Equivalent to the first part of oxyR' pseudogene. Mb2453c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing" /db_xref="UniProtKB/TrEMBL:A0A1R3Y157" /protein_id="SIU01068.1" /translation="MAGLRAFAAVAAKQWFSSAASILDMSQSTLRRAVVGSRSRCTPR ACLSGKQRVPLTALSELTLL" CDS 2698767..2699354 /codon_start=1 /transl_table=11 /gene="ahpC" /locus_tag="BQ2027_MB2454" /product="ALKYL HYDROPEROXIDE REDUCTASE C PROTEIN AHPC (ALKYL HYDROPEROXIDASE C)" /note="Mb2454, ahpC, len: 195 aa. Equivalent to Rv2428, len: 195 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 195 aa overlap). ahpC, alkyl hydroperoxide reductase C (EC 1.-.-.-) (see citations below), equivalent to other alkyl hydroperoxide reductases C mycobacterial proteins e.g. Q9CBF5|AHPC|ML2042 ALKYL HYDROPEROXIDE REDUCTASE from Mycobacterium leprae (195 aa) FASTA scores: opt: 1183, E(): 2.6e-72, (88.20% identity in 195 aa overlap); O87323|AHPC from Mycobacterium marinum (195 aa), FASTA scores: opt: 1215, E(): 1.9e-74, (90.8% identity in 195 aa overlap); Q57413|AHPC|AVI-3 from Mycobacterium avium (195 aa), FASTA scores: opt: 1201, E(): 1.6e-73, (90.25% identity in 195 aa overlap). Also highly similar to others from other organisms e.g. Q9FBP5|AHPC ALKYL HYDROPEROXIDE REDUCTASE from Streptomyces coelicolor (184 aa), FASTA scores: opt: 768, E(): 1.7e-44, (62.45% identity in 189 aa overlap); etc. Protein product from Mb2454 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2454 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y168" /db_xref="InterPro:IPR000866" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR024706" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3Y168" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01069.1" /translation="MPLLTIGDQFPAYQLTALIGGDLSKVDAKQPGDYFTTITSDEHP GKWRVVFFWPKDFTFVCPTEIAAFSKLNDEFEDRDAQILGVSIDSEFAHFQWRAQHND LKTLPFPMLSDIKRELSQAAGVLNADGVADRVTFIVDPNNEIQFVSATAGSVGRNVDE VLRVLDALQSDELCACNWRKGDPTLDAGELLKASA" CDS 2699380..2699913 /codon_start=1 /transl_table=11 /gene="ahpD" /locus_tag="BQ2027_MB2455" /product="ALKYL HYDROPEROXIDE REDUCTASE D PROTEIN AHPD (ALKYL HYDROPEROXIDASE D)" /note="Mb2455, ahpD, len: 177 aa. Equivalent to Rv2429, len: 177 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 177 aa overlap). ahpD, alkyl hydroperoxide reductase (EC 1.-.-.-), similar to other alkyl hydroperoxide reductases D proteins e.g. Q9RN73|AHPD from Streptomyces coelicolor (178 aa), FASTA scores: opt: 611, E(): 1.4e-33, (57.4% identity in 169 aa overlap); Q50441|AHPD_MYCSM AHPD PROTEIN (FRAGMENT) from Mycobacterium smegmatis (52 aa), FASTA score: opt:196. Protein product from Mb2455 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2455 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5N5" /db_xref="InterPro:IPR003779" /db_xref="InterPro:IPR004674" /db_xref="InterPro:IPR004675" /db_xref="InterPro:IPR029032" /db_xref="UniProtKB/Swiss-Prot:P0A5N5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01070.1" /translation="MSIEKLKAALPEYAKDIKLNLSSITRSSVLDQEQLWGTLLASAA ATRNPQVLADIGAEATDHLSAAARHAALGAAAIMGMNNVFYRGRGFLEGRYDDLRPGL RMNIIANPGIPKANFELWSFAVSAINGCSHCLVAHEHTLRTVGVDREAIFEALKAAAI VSGVAQALATIEALSPS" CDS complement(2699910..2700494) /codon_start=1 /transl_table=11 /gene="PPE41" /locus_tag="BQ2027_MB2456C" /product="ppe family protein ppe41" /note="Mb2456c, PPE41, len: 194 aa. Equivalent to Rv2430c, len: 194 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 194 aa overlap). Member of the Mycobacterium tuberculosis PPE family similar to others e.g. AAK46014|Rv1745|MT1745 from Mycobacterium tuberculosis (385 aa) FASTA scores: opt: 389, E(): 1.2e-17, (35.95% identity in 192 aa overlap); etc. Protein product from Mb2456c detected using SWATH mass spectrometry. Mb2456c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y173" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01071.1" /translation="MHFEAYPPEVNSANIYAGPGPDSMLAAARAWRSLDVEMTAVQRS FNRTLLSLMDAWAGPVVMQLMEAAKPFVRWLTDLCVQLSEVERQIHEIVRAYEWAHHD MVPLAQIYNNRAERQILIDNNALGQFTAQIADLDQEYDDFWDEDGEVMRDYRLRVSDA LSKLTPWKAPPPIAHSTVLVAPVSPSTASSRTDT" CDS complement(2700541..2700840) /codon_start=1 /transl_table=11 /gene="PE25" /locus_tag="BQ2027_MB2457C" /product="pe family protein pe25" /note="Mb2457c, PE25, len: 99 aa. Equivalent to Rv2431c, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Member of the Mycobacterium tuberculosis PE family (see first citation below), similar to others e.g. AAK47158|MT2839 from Mycobacterium tuberculosis (275 aa) FASTA scores: opt: 194, E(): 2.5e-06, (40.0% identity in 95 aa overlap); etc. Protein product from Mb2457c detected using SWATH mass spectrometry. Mb2457c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y179" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01072.1" /translation="MSFVITNPEALTVAATEVRRIRDRAIQSDAQVAPMTTAVRPPAA DLVSEKAATFLVEYARKYRQTIAAAAVVLEEFAHALTTGADKYATAEADNIKTFS" CDS complement(2701011..2701421) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2458C" /product="HYPOTHETICAL PROTEIN" /note="Mb2458c, -, len: 136 aa. Equivalent to Rv2432c, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Hypothetical unknown protein. Mb2458c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y148" /protein_id="SIU01073.1" /translation="MTVRAEHCRGAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEP GAMMGFPCRPALLPHLSRAVMRCVRTRSASTSLGVSVIAGQLPAAGSRHRLGAPCRHV RWWLASDGHWGMVSYIPTALNVSMGGIVGWRCVP" CDS complement(2701418..2701708) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2459C" /product="HYPOTHETICAL PROTEIN" /note="Mb2459c, -, len: 96 aa. Equivalent to Rv2433c, len: 96 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 96 aa overlap). Hypothetical unknown protein." /db_xref="GOA:A0A1R3Y3D5" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3D5" /protein_id="SIU01074.1" /translation="MGLRDADERWDTVGQAIGLFLRGHTLRTAAPTALIVGTVLCAVN QGATLAEGAATIGTWVRMVINYLVPFLVASVGYLGARRGVRRASGRSDPSAQ" CDS complement(2701689..2703134) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2460C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2460c, -, len: 481 aa. Equivalent to Rv2434c, len: 481 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 481 aa overlap). Probable conserved transmembrane protein, with some similarity to BAB48444|MLR0973 PROBABLE INTEGRAL MEMBRANE PROTEIN from Rhizobium loti (410 aa), FASTA scores: opt: 298, E(): 4.1e-11, (27.25% identity in 389 aa overlap); and also similarity with other hypothetical proteins and/or putative integral membrane proteins. Mb2460c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y204" /db_xref="InterPro:IPR000595" /db_xref="InterPro:IPR006685" /db_xref="InterPro:IPR010920" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR016846" /db_xref="InterPro:IPR018490" /db_xref="UniProtKB/TrEMBL:A0A1R3Y204" /protein_id="SIU01075.1" /translation="MNLLDSTWFYWAVGIAIGLPAGLIVLTELHNILVRRNSHLARQA SLLRNYLLPLGAVLLLLVKASEVPAEDPTVRVLTTAFGFLVLVLLLSLLNATLFQGAP QQSWRKRLPAIFVDVARFALIGIGLAVILSYIWGVRVGGLFAALGVTSVVIGLMLQNS VGQIVSGLFMLFEQPFRIDDWLETPTARGRVVEVNWRAVHIDTGSGLQIMPNSMLATT AFTNLSRPAGAHECSITTTFSTSDPPDKVCAMLNRAASALPHVKPGVVPATIARGAAE YRTTVRLTSPADEGPTQATFLRWVWYAARREGLHLDEADDEFSTAERVESALRTVVGP ELRLSSSDQQSLARYARLVRYGTDEIVQHAGVVPMGITFVIAGSVRLTVTTDDGSVVA IATLKKGTFLGLTALTRQPDPAGAVALEEVTALQIGREHLEQVVMNKPMLLQELGRVI DERQRKAQQAIRRDLHQSPAAAGEHRGPARR" CDS complement(2703131..2705323) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2461C" /product="probable cyclase (adenylyl-or guanylyl-)(adenylate-or guanylate-)" /note="Mb2461c, -, len: 730 aa. Equivalent to Rv2435c, len: 730 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 730 aa overlap). Probably a cyclase (adenylyl- or guanylyl-cyclase; EC 4.6.1.1 or 4.6.1.2 respectively); C-terminal domain (aa 500-730) similar to domain at C-terminus of a series of adenylate/guanylate cyclases (EC 4.6.-.-) e.g. O30820|CYA AAK45931|MT1661 from Mycobacterium tuberculosis (443 aa) FASTA scores: opt: 446, E(): 1.3e-19, (30.55% identity in 301 aa overlap); BAB50179|MLL3242 CYCLASE (ADENYLYL OR GUANYLYL) from Rhizobium loti (356 aa), FASTA scores: opt: 372, E(): 3.4e-15, (28.75% identity in 219 aa overlap); etc. BELONGS TO ADENYLYL CYCLASE CLASS-4/GUANYLYL CYCLASE FAMILY. Mb2461c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1E6" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1E6" /protein_id="SIU01076.1" /translation="MTSGEALDSVAESESTPAKKRHKNVLRRRPRFRASIQSKLMVLL LLTSIVSVAAIAAIVYQSGRTSLRAAAYERLTQLRESQKRAVETLFSDLTNSLVIYER GLTVVDAVVRFTAGFDQLADATISPAQQQAIVNYYNNEFITPVERTTGDKLDITALLP TSPAQRYLQAYYTAPFTSDQDAMRLDDAGDGSAWSAANAQFNSYFREIVTRFDYDDAV LLDTRGNIVYTLSKDPDLGTNILTGPYRESNLRDAYLKALGANAVDFTWITDFKPYQP QLGVPTAWLVAPVEAGGKTQGVLALPLPIDKINKIMTADRQWQAAGMGSGTETYLAGP DSLMRSDSRLFLQDPEEYRKQVVAAGTSLDVVNRAIQFGGTTLLQPVATEGLRAAQRG QTGTVTSTDYTGSRELEAYAPLNVPDSDLHWSILATRNDSEAFAAVASFSRALVLVTV GIIVVICVASMLIAHAMVRPIRRLEVGTQKISAGDYEVNIPVKSRDEIGDLTAAFNEM SRNLQTKEELLNEQRKENDRLLLSMMPEPVVERYRLGEQTIAQEHQDVTVLFADILGV DEISSGLSGNELVKIVDELVRQFDSAAEHLGVERIRTLHNGYLAGCGVTTPRLDNIPR TVDFALEMRRIVDRFNCQTGNDLHLRVGINTGDVISGLVGRSSVVYDMWGAAVSLAYQ MHSGSPQPGIYVTSQVYEAMRDVWQFTAAGTISVGGLEEPIYRLSERS" CDS 2705804..2706718 /codon_start=1 /transl_table=11 /gene="rbsK" /locus_tag="BQ2027_MB2462" /product="RIBOKINASE RBSK" /note="Mb2462, rbsK, len: 304 aa. Equivalent to Rv2436, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 304 aa overlap). Probable rbsK, ribokinase (EC 2.7.1.15), similar to others e.g. Q9RZ99|DRA0055 from Deinococcus radiodurans (300 aa) FASTA scores: opt: 485, E(): 9.1e-21, (44.55% identity in 301 aa overlap); P36945|P96733|RBSK_BACSU from Bacillus subtilis (293 aa), FASTA scores: opt: 398, E(): 8.5e-16, (36.35% identity in 297 aa overlap); P05054|RBSK_ECOLI|B3752|Z5253|ECS4694 from Escherichia coli strain K12 (309 aa), FASTA scores: opt: 387, E(): 3.8e-15, (34.7% identity in 314 aa overlap); etc. Contains PS00583 pfkB family of carbohydrate kinases signature 1. BELONGS TO THE PFKB FAMILY OF CARBOHYDRATE KINASES. Protein product from Mb2462 detected using SWATH mass spectrometry. Mb2462 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1K3" /db_xref="InterPro:IPR002139" /db_xref="InterPro:IPR011611" /db_xref="InterPro:IPR011877" /db_xref="InterPro:IPR029056" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1K3" /protein_id="SIU01077.1" /translation="MANASETNVGPMAPRVCVVGSVNMDLTFVVDALPRPGETVLAAS LTRTPGGKGANQAVAAARAGAQVQFSGAFGDDPAAAQLRAHLRANAVGLDRTVTVPGP SGTAIIVVDASAENTVLVAPGANAHLTPVPSAVANCDVLLTQLEIPVATALAAARAAQ SADAVVMVNASPAGQDRSSLQDLAAIADVVIANEHEANDWPSPPTHFVITLGVRGARY VGADGVFEVPAPTVTPVDTAGAGDVFAGVLAANWPRNPGSPAERLRALRRACAAGVLA TLVSGAGDCAPAAAAIDAALRANRHNGS" CDS 2706950..2707369 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2463" /product="conserved transmembrane protein" /note="Mb2463, -, len: 139 aa. Equivalent to Rv2437, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Conserved hypothetical protein, with some similarity to CONSERVED HYPOTHETICAL PROTEINS e.g. O06539|RV1139C|MTCI65.06c from Mycobacterium tuberculosis (166 aa); AAK45430|MT1172 from Mycobacterium tuberculosis (124 aa), FASTA scores: opt: 166, E(): 0.00013, (35.7% identity in 112 aa overlap); BAB48937|Mlr1600 from Rhizobium loti (222 aa), FASTA scores: opt: 163, E(): 0.00033, (28.1% identity in 121 aa overlap); etc. Mb2463 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y167" /db_xref="InterPro:IPR007318" /db_xref="UniProtKB/TrEMBL:A0A1R3Y167" /protein_id="SIU01078.1" /translation="MLQRTNVVQPLNTLRMVWIQVAGIIPATAGIAATVYAQLAMGDS WRIGVDEQENTTLVRTGPFKWVRHPIYTAMMAFGLGLLLVTPNLVALAGFILLVATLE VHVRRVEEPYLLRTHSAVYRGYTASVGRFVPGVGLIR" CDS complement(2707366..2709405) /codon_start=1 /transl_table=11 /gene="nadE" /locus_tag="BQ2027_MB2464C" /product="GLUTAMINE-DEPENDENT NAD(+) SYNTHETASE NADE (NAD(+) SYNTHASE [GLUTAMINE-HYDROLYSING])" /note="Mb2464c, nadE, len: 679 aa. Equivalent to Rv2438c, len: 679 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 679 aa overlap). nadE, glutamine-dependent NAD(+) synthetase (EC 6.3.5.1) (see citation below), equivalent to Q9CBZ6|NADE_MYCLE|ML1463 Glutamine-dependent NAD(+) synthetase from Mycobacterium leprae (680 aa), FASTA scores: opt: 3877, E(): 0. Also similar to others e.g. O83759|NADE_TREPA|TP0780 from Treponema pallidum (679 aa), FASTA scores: opt: 543, E(): 1.1e-25; O74940|NADE_SCHPO|SPCC553.02 from Schizosaccharomyces pombe (Fission yeast) (700 aa), FASTA scores: opt: 354, E(): 4.7e-14; P38795|NADE_YEAST|YHR074W from Saccharomyces cerevisiae (Baker's yeast) (714 aa), FASTA scores: opt: 339, E(): 4e-13; etc. Contains PS00591 Glycosyl hydrolases family 10 active site. BELONGS TO THE NAD SYNTHETASE FAMILY IN THE C-TERMINAL SECTION. N-terminus shorter since first submission. Protein product from Mb2464c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2464c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5L7" /db_xref="InterPro:IPR003010" /db_xref="InterPro:IPR003694" /db_xref="InterPro:IPR014445" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR022310" /db_xref="InterPro:IPR036526" /db_xref="InterPro:IPR041856" /db_xref="UniProtKB/Swiss-Prot:P0A5L7" /protein_id="SIU01079.1" /translation="MNFYSAYQHGFVRVAACTHHTTIGDPAANAASVLDMARACHDDG AALAVFPELTLSGYSIEDVLLQDSLLDAVEDALLDLVTESADLLPVLVVGAPLRHRHR IYNTAVVIHRGAVLGVVPKSYLPTYREFYERRQMAPGDGERGTIRIGGADVAFGTDLL FAASDLPGFVLHVEICEDMFVPMPPSAEAALAGATVLANLSGSPITIGRAEDRRLLAR SASARCLAAYVYAAAGEGESTTDLAWDGQTMIWENGALLAESERFPKGVRRSVADVDT ELLRSERLRMGTFDDNRRHHRELTESFRRIDFALDPPAGDIGLLREVERFPFVPADPQ RLQQDCYEAYNIQVSGLEQRLRALDYPKVVIGVSGGLDSTHALIVATHAMDREGRPRS DILAFALPGFATGEHTKNNAIKLARALGVTFSEIDIGDTARLMLHTIGHPYSVGEKVY DVTFENVQAGLRTDYLFRIANQRGGIVLGTGDLSELALGWSTYGVGDQMSHYNVNAGV PKTLIQHLIRWVISAGEFGEKVGEVLQSVLDTEITPELIPTGEEELQSSEAKVGPFAL QDFSLFQVLRYGFRPSKIAFLAWHAWNDAERGNWPPGFPKSERPSYSLAEIRHWLQIF VQRFYSFSQFKRSALPNGPKVSHGGALSPRGDWRAPSDMSARIWLDQIDREVPKG" CDS 2709283..2709561 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2465" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2465, -, len: 92 aa. Equivalent to Rv2438A, len: 92 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 92 aa overlap). Conserved hypothetical protein, showing few similarity with various enzymes e.g. part of O83441|VAA1_TREPA|ATPA1|TP0426 V-TYPE ATP SYNTHASE ALPHA CHAIN 1 (EC 3.6.1.34) from Treponema pallidum (589 aa), FASTA scores: opt: 110, E(): 1.5, (40.3% identity in 72 aa overlap); N-terminus of O95178|NIGM_HUMAN NADH-UBIQUINONE OXIDOREDUCTASE AGGG SUBUNIT PRECURSOR (EC 1.6.5.3) (EC 1.6.99.3) from Homo sapiens (105 aa), FASTA scores: opt: 109, E(): 1.5, (35.5% identity in 62 aa overlap); N-terminus of Q9HJ76|TA1096 PROBABLE GLYCEROL KINASE from Thermoplasma acidophilum (488 aa); etc." /db_xref="UniProtKB/TrEMBL:A0A1R3Y181" /protein_id="SIU01080.1" /translation="MARTGHVQYRRGVGRRVTDGGVVSAGGNAHEPVLVGGVKVHRPF IVAQRRQNARITRRVSTLDTVESPALLADGGIDRRGDATDWAAADPGP" CDS complement(2709691..2710821) /codon_start=1 /transl_table=11 /gene="proB" /locus_tag="BQ2027_MB2466C" /product="PROBABLE GLUTAMATE 5-KINASE PROTEIN PROB (GAMMA-GLUTAMYL KINASE) (GK)" /note="Mb2466c, proB, len: 376 aa. Equivalent to Rv2439c, len: 376 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 376 aa overlap). Probable proB, glutamate 5-kinase protein (GK) (EC 2.7.2.11), equivalent to Q9CBZ5|PROB|ML1464 from Mycobacterium leprae (367 aa) FASTA scores: opt: 1937, E(): 1.1e-102, (84.4% identity in 366 aa overlap). Also highly similar to other glutamate 5-kinase proteins e.g. P46546|PROB_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (369 aa), FASTA scores: opt: 1241, E(): 3e-63, (54.35% identity in 368 aa overlap); Q9ZG98|PROB_MEIRU GLUTAMATE 5-KINASE from Meiothermus ruber (390 aa), FASTA scores: opt: 825, E(): 1.2e-39, (45.05% identity in 353 aa overlap); Q9RDJ9|PROB|SCC123.25c from Streptomyces coelicolor (374 aa), FASTA scores: opt: 1193, E(): 1.6e-60, (55.85% identity in 367 aa overlap); etc. Contains PS00902 Glutamate 5-kinase signature. BELONGS TO THE GLUTAMATE 5-KINASE FAMILY. Protein product from Mb2466c detected using SWATH mass spectrometry. Mb2466c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59958" /db_xref="InterPro:IPR001048" /db_xref="InterPro:IPR001057" /db_xref="InterPro:IPR002478" /db_xref="InterPro:IPR005715" /db_xref="InterPro:IPR011529" /db_xref="InterPro:IPR015947" /db_xref="InterPro:IPR019797" /db_xref="InterPro:IPR036393" /db_xref="InterPro:IPR036974" /db_xref="InterPro:IPR041739" /db_xref="UniProtKB/Swiss-Prot:P59958" /protein_id="SIU01081.1" /translation="MRSPHRDAIRTARGLVVKVGTTALTTPSGMFDAGRLAGLAEAVE RRMKAGSDVVIVSSGAIAAGIEPLGLSRRPKDLATKQAAASVGQVALVNSWSAAFARY GRTVGQVLLTAHDISMRVQHTNAQRTLDRLRALHAVAIVNENDTVATNEIRFGDNDRL SALVAHLVGADALVLLSDIDGLYDCDPRKTADATFIPEVSGPADLDGVVAGRSSHLGT GGMASKVSAALLAADAGVPVLLAPAADAATALADASVGTVFAARPARLSARRFWVRYA AEATGALTLDAGAVRAVVRQRRSLLAAGITAVSGRFCGGDVVELRAPDAAMVARGVVA YDASELATMVGRSTSELPGELRRPVVHADDLVAVSAKQAKQV" CDS complement(2710821..2712260) /codon_start=1 /transl_table=11 /gene="obg" /locus_tag="BQ2027_MB2467C" /product="PROBABLE GTP1/OBG-FAMILY GTP-BINDING PROTEIN OBG" /note="Mb2467c, obg, len: 479 aa. Equivalent to Rv2440c, len: 479 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 479 aa overlap). Probable obg, nucleotide-binding protein, equivalent to Q9CBZ4|ML1465 GTP1/OBG-FAMILY GTP-BINDING PROTEIN from Mycobacterium leprae (478 aa), FASTA scores: opt: 1328, E(): 8.4e-70, (58.9% identity in 479 aa overlap). Also highly similar to others e.g. P95722|OBG GTP-BINDING PROTEIN from Streptomyces coelicolor (478 aa), FASTA scores: opt: 1311, E(): 8.2e-69, (60.7% identity in 476 aa overlap); P20964|OBG_BACSU SPO0B-ASSOCIATED GTP-BINDING PROTEIN from Bacillus subtilis (428 aa), FASTA scores: opt: 1006, E(): 3.9e-51, (42.9% identity in 436 aa overlap); Q9KDK0|OBG|BH1213 GTP-BINDING PROTEIN INVOLVED IN INITIATION OF SPORULATION from Bacillus halodurans (427 aa), FASTA scores: opt: 978, E(): 1.7e-49, (41.95% identity in 436 aa overlap); etc. Highly similar (identical but shorter 5 aa) to AAK46813|MT2516 GTP-BINDING PROTEIN from Mycobacterium tuberculosis strain CDC1551 (484 aa), FASTA scores: opt: 3205, E(): 7.9e-179, (100% identity in 479 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE GTP1/OBG FAMILY. Protein product from Mb2467c detected using SWATH mass spectrometry. Mb2467c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYK5" /db_xref="InterPro:IPR006073" /db_xref="InterPro:IPR006074" /db_xref="InterPro:IPR006169" /db_xref="InterPro:IPR014100" /db_xref="InterPro:IPR015349" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR031167" /db_xref="InterPro:IPR036346" /db_xref="InterPro:IPR036726" /db_xref="UniProtKB/Swiss-Prot:Q7TYK5" /protein_id="SIU01082.1" /translation="MPRFVDRVVIHTRAGSGGNGCASVHREKFKPLGGPDGGNGGRGG SIVFVVDPQVHTLLDFHFRPHLTAASGKHGMGNNRDGAAGADLEVKVPEGTVVLDENG RLLADLVGAGTRFEAAAGGRGGLGNAALASRVRKAPGFALLGEKGQSRDLTLELKTVA DVGLVGFPSAGKSSLVSAISAAKPKIADYPFTTLVPNLGVVSAGEHAFTVADVPGLIP GASRGRGLGLDFLRHIERCAVLVHVVDCATAEPGRDPISDIDALETELACYTPTLQGD AALGDLAARPRAVVLNKIDVPEARELAEFVRDDIAQRGWPVFCVSTATRENLQPLIFG LSQMISDYNAARPVAVPRRPVIRPIPVDDSGFTVEPDGHGGFVVSGARPERWIDQTNF DNDEAVGYLADRLARLGVEEELLRLGARSGCAVTIGEMTFDWEPQTPAGEPVAMSGRG TDPRLDSNKRVGAAERKAARSRRREHGDG" CDS complement(2712346..2712606) /codon_start=1 /transl_table=11 /gene="rpmA" /locus_tag="BQ2027_MB2468C" /product="50s ribosomal protein l27 rpma" /note="Mb2468c, rpmA, len: 86 aa. Equivalent to Rv2441c, len: 86 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 86 aa overlap). Probable rpmA, 50S RIBOSOMAL PROTEINS L27, equivalent to Q9CBZ3|RL27_MYCLE from Mycobacterium leprae (88 aa), FASTA scores: opt: 504, E(): 7.6e-28, (93.2% identity in 81 aa overlap). Also highly similar to others e.g. P95757|RL27_STRGR from Streptomyces griseus (85 aa), FASTA scores: opt: 442, E(): 1.2e-23, (81.5% identity in 81 aa overlap); etc. Contains PS00831 Ribosomal protein L27 signature. BELONGS TO THE L27P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb2468c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2468c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66128" /db_xref="InterPro:IPR001684" /db_xref="InterPro:IPR018261" /db_xref="UniProtKB/Swiss-Prot:P66128" /protein_id="SIU01083.1" /translation="MAHKKGASSSRNGRDSAAQRLGVKRYGGQVVKAGEILVRQRGTK FHPGVNVGRGGDDTLFAKTAGAVEFGIKRGRKTVSIVGSTTA" CDS complement(2712621..2712935) /codon_start=1 /transl_table=11 /gene="rplU" /locus_tag="BQ2027_MB2469C" /product="50s ribosomal protein l21 rplu" /note="Mb2469c, rplU, len: 104 aa. Equivalent to Rv2442c, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 104 aa overlap). Probable rplU, 50S RIBOSOMAL PROTEIN L21, equivalent to Q9CBZ2|RL21_MYCLE from Mycobacterium leprae (103 aa), FASTA scores: opt: 579, E(): 4.8e-31, (91.1% identity in 102 aa overlap). Also highly similar to others e.g. P95756|RL21_STRGR from Streptomyces griseus (106 aa), FASTA scores: opt: 362, E(): 5.4e-17, (56.0% identity in 100 aa overlap); etc. Protein product from Mb2469c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2469c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P66118" /db_xref="InterPro:IPR001787" /db_xref="InterPro:IPR018258" /db_xref="InterPro:IPR028909" /db_xref="InterPro:IPR036164" /db_xref="UniProtKB/Swiss-Prot:P66118" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01084.1" /translation="MMATYAIVKTGGKQYKVAVGDVVKVEKLESEQGEKVSLPVALVV DGATVTTDAKALAKVAVTGEVLGHTKGPKIRIHKFKNKTGYHKRQGHRQQLTVLKVTG IA" CDS 2713283..2714758 /codon_start=1 /transl_table=11 /gene="dctA" /locus_tag="BQ2027_MB2470" /product="PROBABLE C4-DICARBOXYLATE-TRANSPORT TRANSMEMBRANE PROTEIN DCTA" /note="Mb2470, dctA, len: 491 aa. Equivalent to Rv2443, len: 491 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 491 aa overlap). Probable dctA, C4-dicarboxylate-transport transmembrane protein, similar to other C4-DICARBOXYLATE TRANSPORT PROTEINS e.g. AAK46817|MT2519 from Mycobacterium tuberculosis strain CDC1551 (491 aa); Q9L1K8|SC6A11.12 PUTATIVE SODIUM:DICARBOXYLATE SYMPORTER from Streptomyces coelicolor (466 aa), FASTA scores: opt: 1797, E(): 2.9e-98, (61.3% identity in 452 aa overlap); Q9RRG7|DR2525 from Deinococcus radiodurans (463 aa); P50334|DCTA_SALTY from Salmonella typhimurium (428 aa) FASTA scores: opt: 1241, E(): 1.3e-65, (47.2% identity in 415 aa overlap); etc. BELONGS TO THE SODIUM DICARBOXYLATE SYMPORTER FAMILY (SDF) (DAACS FAMILY). Protein product from Mb2470 detected using SWATH mass spectrometry. Mb2470 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y209" /db_xref="InterPro:IPR001991" /db_xref="InterPro:IPR036458" /db_xref="UniProtKB/TrEMBL:A0A1R3Y209" /protein_id="SIU01085.1" /translation="MTAPLDRAPVTDLPANNKGRDRTHWLYLAVIFAVIAGVIVGLTA PSTGKSLTVLGTVFVNLIKMMIAPVIFCTIVLGIGSVRKAAAVGKVGGLALAYFLTMS SVALGIGLIVGNLLSPGRDLHLRPGAVGSGAALAGQAAESHGIAGFIQQIIPRSLPSA LTEGNVLQVLLVALLVGFAVQGLGPAGESILRAVENLQKLVFKVLVMVLWLAPIGAFG AIANIVATTGFNAVTNLLLLMAGFYLTCVVFVFGVLGVLLRIVSGLSIFRLLRYLARE YLLIFATSSSEVVLPRLITKMKHLGVQSSTVGVVVPTGYSFNLDGTAIYLTMASLFIA DAMGHRLTWGEQIALLAFMIIASKGAAGVSGAGLATLAGGLQAHRPELLDGVGLIVGI DRFMSEARSLTNFSGNAVATILVASWTKTIDLSKADEVLRGRDPFDESTMVDPHDEEP PAATPHGGGVPTNPALCDFEQVSLGGLVGRPAGPQRADVDG" CDS complement(2714697..2717558) /codon_start=1 /transl_table=11 /gene="rne" /locus_tag="BQ2027_MB2471C" /product="POSSIBLE RIBONUCLEASE E RNE" /note="Mb2471c, rne, len: 953 aa. Equivalent to Rv2444c, len: 953 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 953 aa overlap). Possible rne, ribonuclease E (EC 3.1.-.-), highly similar to others e.g. Q9CBZ1|ML1468 POSSIBLE RIBONUCLEASE from Mycobacterium leprae (924 aa), FASTA scores: opt: 3713, E(): 2.4e-174, (74.2% identity in 966 aa overlap); Q9SI08|AT2G04270 PUTATIVE RIBONUCLEASE E from Arabidopsis thaliana (502 aa), FASTA scores: opt: 674, E(): 7.5e-26, (31.2% identity in 410 aa overlap); etc. Similar at C-terminal end to P21513|RNE_ECOLI|AMS|HMP1|B1084 ribonuclease E (EC 3.1.4.-) (RNASE E) from Escherichia coli strain K12 (1061 aa), FASTA scores: opt: 554, E(): 9.9e-20, (37.8% identity in 386 aa overlap). Also similar in medium part to several cytoplasmic axial filament proteins e.g. Q9HVU4|CAFA|PA4477 from Pseudomonas aeruginosa (485 aa), FASTA scores: opt: 664, E(): 2.3e-25, (42.8% identity in 418 aa overlap); etc. Equivalent to AAK46818 from Mycobacterium tuberculosis strain CDC1551 (621 aa) but longer 332 aa in N-terminal part. SEEMS TO BELONG TO THE RNE FAMILY. Protein product from Mb2471c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2471c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1F2" /db_xref="InterPro:IPR003029" /db_xref="InterPro:IPR004659" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR019307" /db_xref="InterPro:IPR022967" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1F2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01086.1" /translation="MIDGAPPSDPPEPSQHEELPDRLRVHSLARTLGTTSRRVLDALT ALDGRVRSAHSTVDRVDAVRVRDLLATHLETAGVLAASVHAPEASEEPESRLMLETQE TRNADVERPHYMPLFVAPQPIPEPLADDEDVDDGPDYVADDSDADDEGQLDRPANRRR RRGRRGRGRGRGEQGGSDGDPVDQQSEPRAQQFTSADAAETDDGDDRDSEDTEAGDNG EDENGSLEAGNRRRRRRRRRKSASGDDNDAALEGPLPDDPPNTVVHERVPRAGDKAGN SQDGGSGSTEIKGIDGSTRLEAKRQRRRDGRDAGRRRPPVLSEAEFLARREAVERVMV VRDRVRTEPPLPGTRYTQIAVLEDGIVVEHFVTSAASASLVGNIYLGIVQNVLPSMEA AFVDIGRGRNGVLYAGEVNWDAAGLGGADRKIEQALKPGDYVVVQVSKDPVGHKGARL TTQVSLAGRFLVYVPGASSTGISRKLPDTERQRLKEILREVVPSDAGVIIRTASEGVK EDDIRADVARLRERWEQIEAKAQETKEKAAGAAVALYEEPDVLVKVIRDLFNEDFVGL IVSGDEAWNTINEYVNSVAPELVSKLTKYESADGPDGQSAPDVFTVHRIDEQLAKAMD RKVWLPSGGTLVIDRTEAMTVIDVNTGKFTGAGGNLEQTVTKNNLEAAEEIVRQLRLR DIGGIVVIDFIDMVLESNRDLVLRRLTESLARDRTRHQVSEVTSLGLVQLTRKRLGTG LIEAFSTSCPNCSGRGILLHADPVDSAAATGRKSEPGARRGKRSKKSRSEESSDRSMV AKVPVHAPGEHPMFKAMAAGLSSLAGRGDEESGEPAAELAEQAGDQPPTDLDDTAQAD FEDTEDTDEDEDELDADEDLEDLDDEDLDEDLDVEDSDSDDEDSDEDAADADVDEEDA AGLDGSPGEVDVPGVTELAPTRPRRRVAGRPAGPPIRLD" CDS complement(2717888..2718298) /codon_start=1 /transl_table=11 /gene="ndkA" /locus_tag="BQ2027_MB2472C" /standard_name="ndk" /product="PROBABLE NUCLEOSIDE DIPHOSPHATE KINASE NDKA (NDK) (NDP KINASE) (NUCLEOSIDE-2-P KINASE)" /note="Mb2472c, ndkA, len: 136 aa. Equivalent to Rv2445c, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Probable ndkA (alternate gene name: ndk), nucleoside diphosphate kinase (EC 2.7.4.6), equivalent to Q9CBZ0|NDK|ML1469 from Mycobacterium leprae (136 aa), FASTA scores: opt: 762, E(): 1.5e-42, (87.4% identity in 135 aa overlap); and O85501|NDK from Mycobacterium smegmatis (139 aa), FASTA scores: opt: 714, E(): 1.9e-39, (80.7% identity in 135 aa overlap). Also highly similar to others e.g. P50589|NDK_STRCO from Streptomyces coelicolor (137 aa), FASTA scores: opt: 535, 6.8e-28, (60.3% identity in 136 aa overlap); O29491|NDK_ARCFU|AF0767 from Archaeoglobus fulgidus (151 aa), FASTA scores: opt: 521, E(): 5.9e-27, (58.0% identity in 131 aa overlap); P31103|NDK_BACSU from Bacillus subtilis (151 aa), FASTA scores: opt: 515, E(): 1.4e-26, (56.5% identity in 131 aa overlap); etc. BELONGS TO THE NDK FAMILY. Protein product from Mb2472c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2472c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P84283" /db_xref="InterPro:IPR001564" /db_xref="InterPro:IPR034907" /db_xref="InterPro:IPR036850" /db_xref="UniProtKB/Swiss-Prot:P84283" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01087.1" /translation="MTERTLVLIKPDGIERQLIGEIISRIERKGLTIAALQLRTVSAE LASQHYAEHEGKPFFGSLLEFITSGPVVAAIVEGTRAIAAVRQLAGGTDPVQAAAPGT IRGDFALETQFNLVHGSDSAESAQREIALWFPGA" CDS complement(2718341..2718712) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2473C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb2473c, -, len: 123 aa. Equivalent to Rv2446c, len: 123 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 123 aa overlap). Probable conserved integral membrane protein, highly similar to Q9CBY9|ML1470 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (123 aa), FASTA scores: opt: 468, E(): 6.7e-23, (66.65% identity in 108 aa overlap). Also similar to Q9L1G5|SCC88.24c PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (118 aa), FASTA scores: opt: 130, E(): 0.13, (37.2% identity in 86 aa overlap); and some similarity to O06852|Y13070 hypothetical Streptomyces coelicolor gene also between fpgs and ndk genes (see citation below) (117 aa), FASTA scores: opt: 128, E(): 0.17, (36.0% identity in 86 aa overlap). Mb2473c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y174" /db_xref="InterPro:IPR025327" /db_xref="UniProtKB/TrEMBL:A0A1R3Y174" /protein_id="SIU01088.1" /translation="MTDRSREPADPWKGFSAVMAATLILEAIVVLLAIPVVDAVGGGL RPASLGYLVGLAVLLILLTGLQRRPWAIWVNLGAQPVLVAGFAVYPGVGFIGVLFAAL WVLIAYLRAEVRRRRDYRVSQ" CDS complement(2718709..2720172) /codon_start=1 /transl_table=11 /gene="folC" /locus_tag="BQ2027_MB2474C" /product="PROBABLE FOLYLPOLYGLUTAMATE SYNTHASE PROTEIN FOLC (FOLYLPOLY-GAMMA-GLUTAMATE SYNTHETASE) (FPGS)" /note="Mb2474c, folC, len: 487 aa. Equivalent to Rv2447c, len: 487 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 487 aa overlap). Probable folC, folylpolyglutamate synthase (EC 6.3.2.17), equivalent to Q9CBY8|FOLC|ML1471 from Mycobacterium leprae (485 aa), FASTA scores: opt: 2425, E(): 2.2e-134, (78.7% identity in 483 aa overlap). Also highly similar to others e.g. Q9L1G4|FPGS|O08416|Y13070 from Streptomyces coelicolor (444 aa), FASTA scores: opt: 774, E(): 6.3e-38, (53.9% identity in 462 aa overlap); P15925|FOLC_LACCA|FGS from Lactobacillus casei (428 aa), FASTA scores: opt: 631, E(): 1.4e-29, (34.55% identity in 437 aa overlap); Q05865|FOLC_BACSU from Bacillus subtilis (430 aa), FASTA scores: opt: 421, E(): 2.6e-17, (32.9% identity in 383 aa overlap); etc. Contains PS01012 Folylpolyglutamate synthase signature 2. BELONGS TO THE FOLYLPOLYGLUTAMATE SYNTHASE FAMILY. Protein product from Mb2474c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2474c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y188" /db_xref="InterPro:IPR001645" /db_xref="InterPro:IPR004101" /db_xref="InterPro:IPR013221" /db_xref="InterPro:IPR018109" /db_xref="InterPro:IPR036565" /db_xref="InterPro:IPR036615" /db_xref="UniProtKB/TrEMBL:A0A1R3Y188" /protein_id="SIU01089.1" /translation="MNSTNSGPPDSGSATGVVPTPDEIASLLQVEHLLDQRWPETRID PSLTRISALMDLLGSPQRSYPSIHIAGTNGKTSVARMVDALVTALHRRTGRTTSPHLQ SPVERISIDGKPISPAQYVATYREIEPLVALIDQQSQASAGKGGPAMSKFEVLTAMAF AAFADAPVDVAVVEVGMGGRWDATNVINAPVAVITPISIDHVDYLGADIAGIAGEKAG IITRAPDGSPDTVAVIGRQVPKVMEVLLAESVRADASVAREDSEFAVLRRQIAVGGQV LQLQGLGGVYSDIYLPLHGEHQAHNAVLALASVEAFFGAGAQRQLDGDAVRAGFAAVT SPGRLERMRSAPTVFIDAAHNPAGASALAQTLAHEFDFRFLVGVLSVLGDKDVDGILA ALEPVFDSVVVTHNGSPRALDVEALALAAGERFGPDRVRTAENLRDAIDVATSLVDDA AADPDVAGDAFSRTGIVITGSVVTAGAARTLFGRDPQ" CDS complement(2720169..2722799) /codon_start=1 /transl_table=11 /gene="valS" /locus_tag="BQ2027_MB2475C" /product="PROBABLE VALYL-tRNA SYNTHASE PROTEIN VALS (VALYL-tRNA SYNTHETASE) (VALINE--tRNA LIGASE) (VALINE TRANSLASE)" /note="Mb2475c, valS, len: 876 aa. Equivalent to Rv2448c, len: 876 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 876 aa overlap). Probable valS, valyl-tRNA synthetases (EC 6.1.1.9), equivalent to Q9CBY7|VALS|ML1472 VALYL-TRNA SYNTHASE from Mycobacterium leprae (886 aa), FASTA scores: opt: 5181,E(): 0, (85.4% identity in 876 aa overlap). Also highly similar to others e.g. O06851|SYV_STRCO from Streptomyces coelicolor (874 aa), FASTA scores: opt: 2470, E(): 1.6e-143, (60.45% identity in 880 aa overlap); Q9X2D7|SYV_THEMA|VALS|TM1817 from Thermotoga maritima (865 aa), FASTA scores: opt: 2418, E(): 2.4e-140, (44.2% identity in 891 aa overlap); Q05873|SYV_BACSU|VALS from Bacillus subtilis (880 aa), FASTA scores: opt: 2063, E(): 1.4e-118, (46.08% identity in 894 aa overlap); etc. Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. Contains probable coiled-coil from aa 810 to 846. BELONGS TO CLASS-I AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb2475c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2475c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67600" /db_xref="InterPro:IPR001412" /db_xref="InterPro:IPR002300" /db_xref="InterPro:IPR002303" /db_xref="InterPro:IPR009008" /db_xref="InterPro:IPR009080" /db_xref="InterPro:IPR010978" /db_xref="InterPro:IPR013155" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR019499" /db_xref="InterPro:IPR033705" /db_xref="InterPro:IPR037118" /db_xref="UniProtKB/Swiss-Prot:P67600" /protein_id="SIU01090.1" /translation="MLPKSWDPAAMESAIYQKWLDAGYFTADPTSTKPAYSIVLPPPN VTGSLHMGHALEHTMMDALTRRKRMQGYEVLWQPGTDHAGIATQSVVEQQLAVDGKTK EDLGRELFVDKVWDWKRESGGAIGGQMRRLGDGVDWSRDRFTMDEGLSRAVRTIFKRL YDAGLIYRAERLVNWSPVLQTAISDLEVNYRDVEGELVSFRYGSLDDSQPHIVVATTR VETMLGDTAIAVHPDDERYRHLVGTSLAHPFVDRELAIVADEHVDPEFGTGAVKVTPA HDPNDFEIGVRHQLPMPSILDTKGRIVDTGTRFDGMDRFEARVAVRQALAAQGRVVEE KRPYLHSVGHSERSGEPIEPRLSLQWWVRVESLAKAAGDAVRNGDTVIHPASMEPRWF SWVDDMHDWCISRQLWWGHRIPIWYGPDGEQVCVGPDETPPQGWEQDPDVLDTWFSSA LWPFSTLGWPDKTAELEKFYPTSVLVTGYDILFFWVARMMMFGTFVGDDAAITLDGRR GPQVPFTDVFLHGLIRDESGRKMSKSKGNVIDPLDWVEMFGADALRFTLARGASPGGD LAVSEDAVRASRNFGTKLFNATRYALLNGAAPAPLPSPNELTDADRWILGRLEEVRAE VDSAFDGYEFSRACESLYHFAWDEFCDWYLELAKTQLAQGLTHTTAVLAAGLDTLLRL LHPVIPFLTEALWLALTGRESLVSADWPEPSGISVDLVAAQRINDMQKLVTEVRRFRS DQGLADRQKVPARMHGVRDSDLSNQVAAVTSLAWLTEPGPDFEPSVSLEVRLGPEMNR TVVVELDTSGTIDVAAERRRLEKELAGAQKELASTAAKLANADFLAKAPDAVIAKIRD RQRVAQQETERITTRLAALQ" CDS complement(2722887..2724146) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2476C" /product="putative membrane protein" /note="Mb2476c, -, len: 419 aa. Equivalent to Rv2449c, len: 419 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 419 aa overlap). Conserved hypothetical protein, highly similar to hypothetical proteins e.g. P95139|Rv2953|MTCY349.37c from M. tuberculosis (418 aa), FASTA scores: opt: 1829, E(): 4.7e-103, (67.3% identity in 419 aa overlap); AAK47353|MT3027 from Mycobacterium tuberculosis strain CDC1551 (418 aa), FASTA score: opt: 1829, E(): 4.7e-103, (67.3 identity in 419 aa overlap); Q9CD87|ML0129 from Mycobacterium leprae (418 aa), FASTA scores: opt: 1727, E(): 6.8e-97, (65.45% identity in 414 aa overlap); etc. Protein product from Mb2476c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2476c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y194" /db_xref="InterPro:IPR005097" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y194" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01091.1" /translation="MTATPREFDIVLYGATGFVGKLTAEYLARAGGDARIALAGRSTQ RVLAVREALGESAQTWPILTADASLPSTLQAMAARAQVVVTTVGPYTRYGLPLVAACA AAGTDYADLTGEPMFMRNSIDLYHKQAADTGARIVHACGFDSVPSDLSVYALYHAARE DGAGELTDTNCVVRSFKGGFSGGTIASMLEVLSTASNDPDARRQLSDPYMLSPDRGAE PELGPQPDLPSRRGRRLAPELAGVWTAGFIMAPTNTRIVRRSNALLDWAYGRRFRYSE TMSVGSTVLAPVVSVVGGGVGNAMFGLASRYIRLLPRGLVKRVVPKPGTGPSAAARER GYYRIETYTTTTTGARYLARMAQDGDPGYKATSVLLGECGLALALDRDKLSDMRGVLT PAAAMGDALLERLPAAGVSLQTTRLAS" CDS complement(2724236..2724754) /codon_start=1 /transl_table=11 /gene="rpfE" /locus_tag="BQ2027_MB2477C" /product="PROBABLE RESUSCITATION-PROMOTING FACTOR RPFE" /note="Mb2477c, rpfE, len: 172 aa. Equivalent to Rv2450c, len: 172 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 172 aa overlap). Probable rpfE, resuscitation-promoting factor (see first citation below), similar to O86308|Z96935|MLRPF_1 RPF PROTEIN PRECURSOR from Micrococcus luteus (220 aa), FASTA scores: opt: 291, E(): 3e-7, (48.75% identity in 80 aa overlap). C-terminus is similar to other Mycobacterial rpf proteins e.g. O05594|Rv1009|MTCI237.26|RPFB PROBABLE RESUSCITATION-PROMOTING FACTOR from Mycobacterium tuberculosis (362 aa), FASTA scores: opt: 344, E(): 1.4e-09, (42.85% identity in 147 aa overlap); etc. C-terminal region similar to N-terminal region of Q9F2Q2|SCE41.06c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (244 aa), FASTA scores: opt: 355, E(): 3.1e-10, (56.65% identity in 90 aa overlap). Also similar to Q9F2Q1|SCE41.07c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (near Q9F2Q2|SCE41.06c) (341 aa) FASTA scores: opt: 317, E(): 2.5e-08, (51.7% identity in 87 aa overlap). With Mycobacterium leprae, high similarity between the two corresponding C-terminal regions of two HYPOTHETICAL PROTEINS, Q9CD53|ML0240 (375 aa), FASTA scores: opt: 339, E(): 2.5e-09, (59.15% identity in 93 aa overlap) and O33049|MLCB57.05c|ML2151 (174 aa), FASTA scores: opt: 329, E(): 4e-09, (58.14% identity in 86 aa overlap). Contains a possible secretory signal sequence in N-terminus. Possible autocrine and/or paracrine bacterial growth factor or cytokine (see citation below). Protein product from Mb2477c detected using SWATH mass spectrometry. Mb2477c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010618" /db_xref="InterPro:IPR023346" /db_xref="UniProtKB/TrEMBL:A0A1R3Y199" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01092.1" /translation="MKNARTTLIAAAIAGTLVTTSPAGIANADDAGLDPNAAAGPDAV GFDPNLPPAPDAAPVDTPPAPEDAGFDPNLPPPLAPDFLSPPAEEAPPVPVAYSVNWD AIAQCESGGNWSINTGNGYYGGLQFTAGTWRANGGSGSAANASREEQIRVAENVLRSQ GIRAWPVCGRRG" CDS 2724836..2725234 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2478" /product="HYPOTHETICAL PROLINE AND SERINE RICH PROTEIN" /note="Mb2478, -, len: 132 aa. Equivalent to Rv2451, len: 132 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 132 aa overlap). Hypothetical unknown pro-, ser-rich protein." /db_xref="UniProtKB/TrEMBL:A0A1R3Y170" /protein_id="SIU01093.1" /translation="MGRAVSVRHGSGALDLPGAAASRRLRVGQPIQPSPAPLARGSVD SIVEISCCPSAGPRGPYDDDLDSSSPANRDISSITSRSRRGGTIVVAGQKCGFGSAVS LRPRRYREPNHANIVTPDTDLSPSWPWSGI" CDS complement(2725422..2725568) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2479C" /product="HYPOTHETICAL PROTEIN" /note="Mb2479c, -, len: 48 aa. Equivalent to Rv2452c, len: 48 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 48 aa overlap). Hypothetical unknown protein. Mb2479c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3E7" /protein_id="SIU01094.1" /translation="MAFRDILVLFSMKTLLTLAMAAASSTALTTVGVSGARLITYCVG VEDI" CDS complement(2725592..2726197) /codon_start=1 /transl_table=11 /gene="mobA" /locus_tag="BQ2027_MB2480C" /product="PROBABLE MOLYBDOPTERIN-GUANINE DINUCLEOTIDE BIOSYNTHESIS PROTEIN A MOBA" /note="Mb2480c, mobA, len: 201 aa. Equivalent to Rv2453c, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 201 aa overlap). Probable mobA, molybdopterin-guanine dinucleotide biosynthesis protein A, similar to others e.g. Q9F8G7 from Carboxydothermus hydrogenoformans (224 aa), FASTA scores: opt: 249, E(): 3.9e-08, (30.6% identity in 173 aa overlap); P95645|MOBA_RHOSH|MOB|Y09560 from Rhodobacter sphaeroides (199 aa), FASTA scores: opt: 240, E(): 1.2e-07, (33.9% identity in 186 aa overlap); Q9X7K0|MOBA_RHOCA from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (191 aa), FASTA scores: opt: 217, E(): 2.9e-06, (37.4% identity in 123 aa overlap); etc. BELONGS TO THE MOBA FAMILY. Protein product from Mb2480c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2480c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65403" /db_xref="InterPro:IPR013482" /db_xref="InterPro:IPR025877" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/Swiss-Prot:P65403" /protein_id="SIU01095.1" /translation="MAELAPDTVPLAGVVLAGGESRRMGRDKATLPLPGGTTTLVEHM VGILGQRCAPVFVMAAPGQPLPTLPVPVLRDELPGLGPLPATGRGLRAAAEAGVRLAF VCAVDMPYLTVELIEDLARRAVQTDAEVVLPWDGRNHYLAAVYRTDLADRVDTLVGAG ERKMSALVDASDALRIVMADSRPLTNVNSAAGLHAPMQPGR" CDS complement(2726199..2727320) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2481C" /product="PROBABLE OXIDOREDUCTASE (BETA SUBUNIT)" /note="Mb2481c, -, len: 373 aa. Equivalent to Rv2454c, len: 373 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 373 aa overlap). Probable oxidoreductase, beta subunit (EC 1.-.-.-), similar to Q9F2W7|SCD20.12c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (352 aa), FASTA scores: opt: 1461, E(): 6.4e-85, (65.3% identity in 343 aa overlap) alias Q9RKS5|STAH10.34c PUTATIVE OXIDOREDUCTASE BETA-SUBUNIT from Streptomyces coelicolor (350 aa), FASTA scores: opt: 1429, E(): 6.7e-83, (64.0% identity in 342 aa overlap); and similar in part to others e.g. Q9Z5X3 FERREDOXIN OXIDOREDUCTASE B-SUBUNIT from Frankia sp. (346 aa), FASTA scores: opt: 1143, E(): 7.5e-65, (51.2% identity in 336 aa overlap); BAB21495|KORB FERREDOXIN OXIDOREDUCTASE BETA SUBUNIT from Hydrogenobacter thermophilus TK-6 (295 aa), FASTA scores: opt: 682, E(): 8.3e-36, (48.25% identity in 201 aa overlap); etc. Note that the upstream ORF (MTV008.11c|Rv2455c) is possibly an oxidoreductase alpha subunit. Protein product from Mb2481c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2481c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1H5" /db_xref="InterPro:IPR011766" /db_xref="InterPro:IPR029061" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1H5" /protein_id="SIU01096.1" /translation="MTRSGDEAQLMTGVTGDLAGTELGLTPSLTKNAGVPTTDQPQKG KDFTSDQEVRWCPGCGDYVILNTIRNFLPELGLRRENIVFISGIGCSSRFPYYLETYG FHSIHGRAPAIATGLALAREDLSVWVVTGDGDALSIGGNHLIHALRRNINVTILLFNN RIYGLTKGQYSPTSEVGKVTKSTPMGSLDHPFNPVSLALGAEATFVGRALDSDRNGLT EVLRAAAQHRGAALVEILQDCPIFNDGSFDALRKEGAEERVIKVRHGEPIVFGANGEY CVVKSGFGLEVAKTADVAIDEIIVHDAQVDDPAYAFALSRLSDQNLDHTVLGIFRHIS RPTYDDAARSQVVAARNAAPSGTAALQSLLHGRDTWTVD" CDS complement(2727317..2729278) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2482C" /product="PROBABLE OXIDOREDUCTASE (ALPHA SUBUNIT)" /note="Mb2482c, -, len: 653 aa. Equivalent to Rv2455c, len: 653 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 653 aa overlap). Probable oxidoreductase, alpha subunit (EC 1.-.-.-), similar to others e.g. Q9F2W6|SCD20.13c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (645 aa), FASTA scores: opt: 2017, E(): 1e-111, (66.45% identity in 617 aa overlap) alias Q9RKS4|STAH10.35c PUTATIVE OXIDOREDUCTASE ALPHA-SUBUNIT from Streptomyces coelicolor (630 aa), FASTA scores: opt: 2008, E(): 3.4e-111, (66.45% identity in 614 aa overlap); Q9YA13|APE2126 LONG HYPOTHETICAL 2-OXOACID--FERREDOXIN OXIDOREDUCTASE ALPHA CHAIN from Aeropyrum pernix (644 aa) FASTA scores: opt: 687, E(): 4.6e-33, (33.35% identity in 441 aa overlap); etc. Note that the downstream ORF (MTV008.10c|Rv2454c) is possibly an oxidoreductase beta subunit. Protein product from Mb2482c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2482c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1M4" /db_xref="InterPro:IPR002869" /db_xref="InterPro:IPR002880" /db_xref="InterPro:IPR009014" /db_xref="InterPro:IPR019752" /db_xref="InterPro:IPR022367" /db_xref="InterPro:IPR029061" /db_xref="InterPro:IPR033412" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1M4" /protein_id="SIU01097.1" /translation="MDPNGSGAGPESHDAAFHAAPDRQRLENVVIRFAGDSGDGMQLT GDRFTSEAALFGNDLATQPNYPAEIRAPAGTLPGVSSFQIQIADYDILTAGDRPDVLV AMNPAALKANIGDLPLGGMVIVNSDEFTKRNLTKVGYVTNPLESGELSDYVVHTVAMT TLTLGAVEAIGASKKDGQRAKNMFALGLLSWMYGRELEHSEAFIREKFARKPEIAEAN VLALKAGWNYGETTEAFGTTYEIPPATLPPGEYRQISGNTALAYGIVVAGQLAGLPVV LGSYPITPASDILHELSKHKNFNVVTFQAEDEIGGICAALGAAYGGALGVTSTSGPGI SLKSEALGLGVMTELPLLVIDVQRGGPSTGLPTKTEQADLLQALYGRNGESPVAVLAP RSPADCFETALEAVRIAVSYHTPVILLSDGAIANGSEPWRIPDVNALPPIKHTFAKPG EPFQPYARDRETLARQFAIPGTPGLEHRIGGLEAANGSGDISYEPTNHDLMVRLRQAK IDGIHVPDLEVDDPTGDAELLLIGWGSSYGPIGEACRRARRRGTKVAHAHLRYLNPFP ANLGEVLRRYPKVVAPELNLGQLAQVLRGKYLVDVQSVTKVKGVSFLADEIGRFIRAA LAGRLAELEQDKTLVARLSAATAGAGANG" CDS complement(2729510..2730766) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2483C" /product="probable conserved integral membrane transport protein" /note="Mb2483c, -, len: 418 aa. Equivalent to Rv2456c, len: 418 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 418 aa overlap). Probable conserved integral membrane transport protein, involved in a efflux system, weakly similar to many e.g. Q9RUR0|YD22_DEIRA|DR1322 PUTATIVE SUGAR EFFLUX TRANSPORTER from Deinococcus radiodurans (389 aa), FASTA scores: opt: 224, E(): 8.4e-06, (24.45% identity in 409 aa overlap); Q9UYY0|PAB0913 MULTIDRUG RESISTANCE PROTEIN from Pyrococcus abyssi (410 aa), FASTA scores: opt: 210, E(): 5.6e-05, (21.8% identity in 408 aa overlap); etc. Contains PS00216 Sugar transport proteins signature 1. Mb2483c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y183" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y183" /protein_id="SIU01098.1" /translation="MSGTVVAVPPRVARALDLLNFSLADVRDGLGPYLSIYLLLIHDW DQASIGFVMAVGGIAAIVAQTPIGALVDRTTAKRALVVAGAVLVTAAAVAMPLFAGLY SISVLQAVTGIASSVFAPALAAITLGAVGPQFFARRIGRNEAFNHAGNASAAGATGAL AYFFGPVVVFWVLAGMALISVLATLRIPPDAVDHDLARGMDHAPGEPHPQPSRFTVLA HNRELVIFGAAVVAFHFANAAMLPLVGELLALHNRDEGTALMSSCIVAAQVVMVPVAY VVGTRADAWGRKPIFLVGFAVLTARGFLYTLSDNSYWLVGVQLLDGIGAGIFGALFPL VVQDVTHGTGHFNISLGAVTTATGIGAALSNLVAGWIVVVAGYDAAFMSLGALAGAGF LLYLVAMPETVDSDVRVRSRPTLGGK" CDS complement(2730782..2732062) /codon_start=1 /transl_table=11 /gene="clpX" /locus_tag="BQ2027_MB2484C" /product="PROBABLE ATP-DEPENDENT CLP PROTEASE ATP-BINDING SUBUNIT CLPX" /note="Mb2484c, clpX, len: 426 aa. Equivalent to Rv2457c, len: 426 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 426 aa overlap). Probable clpX, ATP-dependent clp protease ATP-binding subunit clpX (EC 3.4.-.-), equivalent to Q9CBY6|CLPX|ML1477 ATP-DEPENDENT CLP PROTEASE ATP-BINDING PROTEIN from Mycobacterium leprae (426 aa), FASTA scores: opt: 2652, E(): 1.4e-142, (96.0% identity in 426 aa overlap). Also highly similar to others e.g. Q9F316|CLPX from Streptomyces coelicolor (428 aa) FASTA scores: opt: 2178, E(): 8.2e-116, (77.8% identity in 428 aa overlap); P50866|CLPX_BACSU from Bacillus subtilis (420 aa), FASTA scores: opt: 1788, E(): 8.5e-94, (63.6% identity in 426 aa overlap); P33138|CLPX_ECOLI from Escherichia coli (423 aa), FASTA scores: opt: 1694, E(): 1.7e-88, (62.4% identity in 415 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE CLPX CHAPERONE FAMILY. Protein product from Mb2484c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2484c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A529" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR003959" /db_xref="InterPro:IPR004487" /db_xref="InterPro:IPR010603" /db_xref="InterPro:IPR019489" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR038366" /db_xref="UniProtKB/Swiss-Prot:P0A529" /protein_id="SIU01099.1" /translation="MARIGDGGDLLKCSFCGKSQKQVKKLIAGPGVYICDECIDLCNE IIEEELADADDVKLDELPKPAEIREFLEGYVIGQDTAKRTLAVAVYNHYKRIQAGEKG RDSRCEPVELTKSNILMLGPTGCGKTYLAQTLAKMLNVPFAIADATALTEAGYVGEDV ENILLKLIQAADYDVKRAETGIIYIDEVDKIARKSENPSITRDVSGEGVQQALLKILE GTQASVPPQGGRKHPHQEFIQIDTTNVLFIVAGAFAGLEKIIYERVGKRGLGFGAEVR SKAEIDTTDHFADVMPEDLIKFGLIPEFIGRLPVVASVTNLDKESLVKILSEPKNALV KQYIRLFEMDGVELEFTDDALEAIADQAIHRGTGARGLRAIMEEVLLPVMYDIPSRDD VAKVVVTKETVQDNVLPTIVPRKPSRSERRDKSA" CDS 2732353..2733261 /codon_start=1 /transl_table=11 /gene="mmuM" /locus_tag="BQ2027_MB2485" /product="PROBABLE HOMOCYSTEINE S-METHYLTRANSFERASE MMUM (S-METHYLMETHIONINE:HOMOCYSTEINE METHYLTRANSFERASE) (CYSTEINE METHYLTRANSFERASE)" /note="Mb2485, mmuM, len: 302 aa. Equivalent to Rv2458, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 302 aa overlap). Probable mmuM, homocysteine S-methyltransferase (EC 2.1.1.10), equivalent to Q9CBY5|ML1478 POSSIBLE TRANSFERASE from Mycobacterium leprae (293 aa), FASTA scores: opt: 1507, E(): 2.7e-86, (78.85% identity in 293 aa overlap). Also similar to others e.g. Q47690|MMUM_ECOLI|B0261 HOMOCYSTEINE S-METHYLTRANSFERASE from Escherichia coli strain K12 (310 aa), FASTA scores: opt: 863, E(): 2.4e-46, (47.65% identity in 298 aa overlap); Q9FUM7 HOMOCYSTEINE S-METHYLTRANSFERASE-4 from Zea mays (Maize) (342 aa), FASTA scores: opt: 324, E(): 6.8e-13, (44.45% identity in 306 aa overlap); Q9LUI7|HMT3 CYSTEINE METHYLTRANSFERASE from Arabidopsis thaliana (Mouse-ear cress) (347 aa), FASTA scores: opt: 312, E(): 3.8e-12, (41.85% identity in 313 aa overlap); etc. Identical to AAK46833|MT2533 HOMOCYSTEINE S-METHYLTRANSFERASE from Mycobacterium tuberculosis strain CDC1551 (302 aa). Protein product from Mb2485 detected using SWATH mass spectrometry. Mb2485 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1A0" /db_xref="InterPro:IPR003726" /db_xref="InterPro:IPR017226" /db_xref="InterPro:IPR036589" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1A0" /protein_id="SIU01100.1" /translation="MELVSDSVLISDGGLATELEARGHDLSDPLWSARLLVDAPHAIT AVHTAYFRAGAQIATTASYQASFEGFAARGIGHDDATVLLRRSVELAQAARDEVGVGG LSVAASVGPYGAALADGSEYRGCYGLSVAALMKWHLPRLEVLVDAGADMLALETIPDI DEAEALVNLVRRLATPAWLSYTINGTRTRAGQPLTDAFAVAAGVPEIVAVGVNCCAPD DVLPAIAFAVAHTGKPVIVYPNSGEGWDGRRRAWVGPRRFSGSSGQLAREWVAAGARI VGGCCRVRPIDIAEIGRALTTAPPRG" CDS 2733428..2734954 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2486" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN" /note="Mb2486, -, len: 508 aa. Equivalent to Rv2459, len: 508 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 508 aa overlap). Probable conserved integral membrane transport protein, member of major facilitator superfamily (MFS) possibly involved in drug transport, highly similar to many efflux proteins e.g. Q9RL22|SC5G9.04c PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (489 aa), FASTA scores: opt: 788, E(): 1.3e-38, (34.45% identity in 412 aa overlap); Q9I428|PA1316 PROBABLE MFS TRANSPORTER from Pseudomonas aeruginosa (513 aa), FASTA scores: opt: 782, E(): 3.1e-38, (32.75% identity in 519 aa overlap); P39886|TCMA_STRGA tetracenomycin C resistance and export protein from Streptomyces glaucescens (538 aa), FASTA scores: opt: 752, E(): 1.8e-36, (31.7% identity in 511 aa overlap); etc. Also highly similar to AAK46687|MT2395 DRUG TRANSPORTER from Mycobacterium tuberculosis strain CDC1551 (537 aa), FASTA scores: opt: 1396, E(): 5.6e-74, (44.45% identity in 504 aa overlap); and P71879|Rv2333c|MTCY3G12.01 PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis strain H37Rv (537 aa), FASTA scores: opt: 1385, E(): 2.5e-73, (44.25% identity in 504 aa overlap). Mb2486 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1A4" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1A4" /protein_id="SIU01101.1" /translation="MTPRQRLTVLATGLGIFMVFVDVNIVNVALPSIQKVFHTGEQGL QWAVAGYSLGIAAVLMSCALLGDRYGRRRSFVFGVTLFVVSSIVCVLPVSLAVFTVAR VIQGLGAAFISVLSLALLSHSFPNPRMKARAISNWMAIGMVGAASAPALGGLMVDGLG WRSVFLVNVPLGAIVWLLTLVGVDESQDPEPTQLDWVGQLTLIPAVALIAYTIIEAPR FDRQSAGFVAALLLAAGVLLWLFVRHEHRAAFPLVDLKLFAEPLYRSVLIVYFVVMSC FFGTLMVITQHFQNVRDLSPLHAGLMMLPVPAGFGVASLLAGRAVNKWGPQLPVLTCL AAMFIGLAIFAISMDHAHPVALVGLTIFGAGAGGCATPLLHLGMTKVDDGRAGMAAGM LNLQRSLGGIFGVAFLGTIVAAWLGAALPNTMADEIPDPIARAIVVDVIVDSANPHAH AAFIGPGHRITAAQEDEIVLAADAVFVSGIKLALGGAAVLLTGAFVLGWTRFPRTPAS " CDS complement(2735105..2735749) /codon_start=1 /transl_table=11 /gene="clpP2" /locus_tag="BQ2027_MB2487C" /product="PROBABLE ATP-DEPENDENT CLP PROTEASE PROTEOLYTIC SUBUNIT 2 CLPP2 (ENDOPEPTIDASE CLP 2)" /note="Mb2487c, clpP2, len: 214 aa. Equivalent to Rv2460c, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 214 aa overlap). Probable clpP2, ATP-dependent clp protease proteolytic subunit 2 (EC 3.4.21.92), equivalent to Q9CBY4|CLP2_MYCLE ATP-DEPENDENT CLP PROTEASE PROTEOLYTIC SUBUNIT from Mycobacterium leprae (214 aa). Also highly similar to others e.g. Q9ZH58|CLPP2 from Streptomyces coelicolor (236 aa), FASTA scores: opt: 918, E(): 2.1e-50, (66.35% identity in 214 aa overlap); O67357|CLPP_AQUAE|AQ_1339 from Aquifex aeolicus (201 aa), FASTA scores: opt: 680, E(): 1.4e-35, (52.0% identity in 194 aa overlap); P43867|CLPP_HAEIN from Haemophilus influenzae (193 aa), FASTA scores: opt: 662, E(): 1.8e-34, (53.35% identity in 193 aa overlap); etc. Contains PS00381 Endopeptidase Clp serine active site. Also similar to upstream ORF Rv2461c|MTV008.17c|clpP1 (200 aa), FASTA score: (48.3% identity in 172 aa overlap). BELONGS TO PEPTIDASE FAMILY S14, ALSO KNOWN AS CLPP FAMILY. Protein product from Mb2487c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2487c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63784" /db_xref="InterPro:IPR001907" /db_xref="InterPro:IPR018215" /db_xref="InterPro:IPR023562" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR033135" /db_xref="UniProtKB/Swiss-Prot:P63784" /protein_id="SIU01102.1" /translation="MNSQNSQIQPQARYILPSFIEHSSFGVKESNPYNKLFEERIIFL GVQVDDASANDIMAQLLVLESLDPDRDITMYINSPGGGFTSLMAIYDTMQYVRADIQT VCLGQAASAAAVLLAAGTPGKRMALPNARVLIHQPSLSGVIQGQFSDLEIQAAEIERM RTLMETTLARHTGKDAGVIRKDTDRDKILTAEEAKDYGIIDTVLEYRKLSAQTA" CDS complement(2735746..2736348) /codon_start=1 /transl_table=11 /gene="clpP1" /locus_tag="BQ2027_MB2488C" /product="PROBABLE ATP-DEPENDENT CLP PROTEASE PROTEOLYTIC SUBUNIT 1 CLPP1 (ENDOPEPTIDASE CLP)" /note="Mb2488c, clpP1, len: 200 aa. Equivalent to Rv2461c, len: 200 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 200 aa overlap). Probable clpP1, ATP-dependent clp protease proteolytic subunit 1 (EC 3.4.21.92), equivalent to Q9CBY3|CLP1_MYCLE ATP-DEPENDENT CLP PROTEASE PROTEOLYTIC SUBUNIT from Mycobacterium leprae (224 aa), FASTA scores: opt: 1226, E(): 1.3e-71, (95.0% identity in 200 aa overlap). Also highly similar to others e.g. Q9F315|CLPP1 from Streptomyces coelicolor (219 aa), FASTA scores: opt: 713, E(): 9.3e-39, (61.75% identity in 183 aa overlap); P80244|CLPP_BACSU from Bacillus subtilis (197 aa), FASTA scores: opt: 658, E(): 2.8e-35, (54% identity in 187 aa overlap); Q9WZF9|CLPP_THEMA|TM0695 from Thermotoga maritima (203 aa), FASTA scores: opt: 653, E(): 6.1e-35, (55.25% identity in 172 aa overlap); etc. Also similar to downstream ORF Rv2460c|MTV008.16c|clpP2 (214 aa), FASTA score: (48.3% identity in 172 aa overlap). BELONGS TO PEPTIDASE FAMILY S14, ALSO KNOWN AS CLPP FAMILY. Note that previously known as clp. Protein product from Mb2488c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2488c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A527" /db_xref="InterPro:IPR001907" /db_xref="InterPro:IPR023562" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR033135" /db_xref="UniProtKB/Swiss-Prot:P0A527" /protein_id="SIU01103.1" /translation="MSQVTDMRSNSQGLSLTDSVYERLLSERIIFLGSEVNDEIANRL CAQILLLAAEDASKDISLYINSPGGSISAGMAIYDTMVLAPCDIATYAMGMAASMGEF LLAAGTKGKRYALPHARILMHQPLGGVTGSAADIAIQAEQFAVIKKEMFRLNAEFTGQ PIERIEADSDRDRWFTAAEALEYGFVDHIITRAHVNGEAQ" CDS complement(2736465..2737865) /codon_start=1 /transl_table=11 /gene="tig" /locus_tag="BQ2027_MB2489C" /product="PROBABLE TRIGGER FACTOR (TF) PROTEIN TIG" /note="Mb2489c, tig, len: 466 aa. Equivalent to Rv2462c, len: 466 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 466 aa overlap). Probable tig, trigger factor (TF), a chaperone protein, equivalent to Q9CBY2|ML1481 POSSIBLE MOLECULAR CHAPERONE from Mycobacterium leprae (469 aa), FASTA scores: opt: 2171, E(): 7.2e-113, (70.1% identity in 468 aa overlap). Also similar to oyher trigger factors from several organisms e.g. Q9F314|SCC80.05c from Streptomyces coelicolor (468 aa), FASTA scores: opt: 1224, E(): 1.7e-60, (41.8% identity in 469 aa overlap); Q9K8F3|TIG_BACHD from Bacillus halodurans (431 aa), FASTA scores: opt: 675, E(): 3.6e-30, (28.5% identity in 421 aa overlap); P22257|TIG_ECOLI from Escherichia coli (432 aa), FASTA scores: opt: 493, E(): 4.2e-20, (23.35% identity in 433 aa overlap); etc. BELONGS TO THE FKBP-TYPE PPIASE FAMILY, TIG SUBFAMILY. Protein product from Mb2489c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2489c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYJ1" /db_xref="InterPro:IPR005215" /db_xref="InterPro:IPR008880" /db_xref="InterPro:IPR008881" /db_xref="InterPro:IPR027304" /db_xref="InterPro:IPR036611" /db_xref="InterPro:IPR037041" /db_xref="UniProtKB/Swiss-Prot:Q7TYJ1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01104.1" /translation="MKSTVEQLSPTRVRINVEVPFAELEPDFQRAYKELAKQVRLPGF RPGKAPAKLLEARIGREAMLDQIVNDALPSRYGQAVAESDVQPLGRPNIEVTKKEYGQ DLQFTAEVDIRPKISLPDLSALTVSVDPIEIGEDDVDAELQSLRTRFGTLTAVDRPVA VGDVVSIDLSATVDGEDIPNAAAEGLSHEVGSGRLIAGLDDAVVGLSADESRVFTAKL AAGEHAGQEAQVTVTVRSVKERELPEPDDEFAQLASEFDSIDELRASLSDQVRQAKRA QQAEQIRNATIDALLEQVDVPLPESYVQAQFDSVLHSALSGLNHDEARFNELLVEQGS SRAAFDAEARTASEKDVKRQLLLDALADELQVQVGQDDLTERLVTTSRQYGIEPQQLF GYLQERNQLPTMFADVRRELAIRAAVEAATVTDSDGNTIDTSEFFGKRVSAGEAEEAE PADEGAARAASDEATT" tRNA complement(2737905..2737977) /locus_tag="BQ2027_PROU" /product="tRNA-Pro" /note="proU, len: 74 nt. Equivalent to proU, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Pro; anticodon tgg." tRNA 2738115..2738185 /locus_tag="BQ2027_GLYV" /product="tRNA-Gly" /note="glyV, len: 71 nt. Equivalent to glyV, len: 71 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 nt overlap). tRNA-Gly; anticodon tcc." CDS 2738229..2739413 /codon_start=1 /transl_table=11 /gene="lipP" /locus_tag="BQ2027_MB2490" /product="PROBABLE ESTERASE/LIPASE LIPP" /note="Mb2490, lipP, len: 394 aa. Equivalent to Rv2463, len: 394 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 394 aa overlap). Probable lipP, esterase (EC 3.1.-.-), lipase similar to others eg O87861|ESTA ESTERASE A from Streptomyces chrysomallus (389 aa), FASTA scores: opt: 964, E(): 1.9e-53, (44.35% identity in 399 aa overlap); Q9I4S7|PA1047 PROBABLE ESTERASE from Pseudomonas aeruginosa (392 aa), FASTA scores: opt: 863, E(): 4.6e-47, (40.05% identity in 377 aa overlap); Q53403|ESTC ESTERASE III from Pseudomonas fluorescens (382 aa), FASTA scores: opt: 753, E(): 3.9e-40, (36.3% identity in 380 aa overlap); etc. Protein product from Mb2490 detected using SWATH mass spectrometry. Mb2490 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3Y234" /protein_id="SIU01105.1" /translation="MNQPDIKGSCASEFTKVRDAFERNFVLRNEVGAAVAVWVDGDLV VNLWGGSADAGGTRPWQHDTLATVLSGTKALTATCVHQLVDRGELDLHAPVARYWPEF GQAGKQAITLAMVMSHRSGAIGPRGRLGWEQVADWDFVCEQLAAAEPWWQPGAAQGYH MTTFGFILGEVFRRVTGRTVGQYLRTEIAEPLGADVHIGLHPGEQLRCADLVDKPHIR QLLADVQAPGYPTSLNEHPKAALSVSMGFAPDDELGSNDLQLWRQIEFPGTNGQVSAL GLATFYNGLAQEKLLSREHMELVRVSQGGFDTDLVLGPRVADHGWGLGYMLNQRGVNG PNPRIFGHGGLGGSFGFVDLEHRIGYAYVMNRFDATKANADPRSVVLSNEVYAALGVN RS" CDS complement(2739433..2740239) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2491C" /product="POSSIBLE DNA GLYCOSYLASE" /note="Mb2491c, -, len: 268 aa. Equivalent to Rv2464c, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 268 aa overlap). Possible DNA glycosylase (EC 3.2.2.-), showing some similarity to several other DNA glycosylases e.g. Q9F308|SCC80.11c PUTATIVE DNA REPAIR HYDROLASE (FRAGMENT) from Streptomyces coelicolor (306 aa), FASTA scores: opt: 894, E(): 6.1e-51, (51.05% identity in 282 aa overlap); O50606|MUTM|FPG_THETH FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE (EC 3.2.2.23) from Thermus aquaticus (267 aa), FASTA scores: opt: 342, E(): 4.6e-15, (32.4% identity in 250 aa overlap); Q9RCW5|SCM10.34c PUTATIVE FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE from Streptomyces coelicolor (287 aa), FASTA scores: opt: 321, E(): 1.1e-13, (29.35% identity in 259 aa overlap); etc. Identical to AAK46839|MT2539 FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE from Mycobacterium tuberculosis strain CDC1551. Also similar to other M. tuberculosis DNA glycosylases e.g. MTCY71.37 (32.9% identity in 277 aa overlap). BELONGS TO THE FPG FAMILY. Protein product from Mb2491c detected using SWATH mass spectrometry. Mb2491c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64159" /db_xref="InterPro:IPR000214" /db_xref="InterPro:IPR010663" /db_xref="InterPro:IPR010979" /db_xref="InterPro:IPR012319" /db_xref="InterPro:IPR015886" /db_xref="InterPro:IPR015887" /db_xref="InterPro:IPR035937" /db_xref="UniProtKB/Swiss-Prot:P64159" /protein_id="SIU01106.1" /translation="MPEGHTLHRLARLHQRRFAGAPVSVSSPQGRFADSASALNGRVL RRASAWGKHLFHHYVGGPVVHVHLGLYGTFTEWARPTDGWLPEPAGQVRMRMVGAEFG TDLRGPTVCESIDDGEVADVVARLGPDPLRSDANPSSAWSRITKSRRPIGALLMDQTV IAGVGNVYRNELLFRHRIDPQRPGRGIGEPEFDAAWNDLVSLMKVGLRRGKIIVVRPE HDHGLPSYLPDRPRTYVYRRAGEPCRVCGGVIRTALLEGRNVFWCPVCQT" CDS complement(2740245..2740733) /codon_start=1 /transl_table=11 /gene="rpib" /locus_tag="BQ2027_MB2492C" /product="ribose-5-phosphate isomerase" /note="Mb2492c, -, len: 162 aa. Equivalent to Rv2465c, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). Probable isomerase (EC 5.-.-.-), equivalent to AAK46840|MT2540 PUTATIVE CARBOHYDRATE-PHOSPHATE ISOMERASE from Mycobacterium tuberculosis strain CDC1551 (159 aa). Equivalent to Q9CBY1|ML1484 POSSIBLE PHOSPHOPENTOSE ISOMERASE from M. leprae (162 aa), FASTA scores: opt: 992, E(): 7.1e-59, (89.5% identity in 162 aa overlap). Also highly similar or similar to several diverse isomerases e.g. Q9L206|SC8E4.02c PUTATIVE ISOMERASE from Streptomyces coelicolor (159 aa), FASTA scores: opt: 661, E(): 6.1e-37, (61.45% identity in 153 aa overlap); P47636|Y396_MYCGE|MG396 HYPOTHETICAL LACA/RPIB FAMILY PROTEIN from Mycoplasma genitalium (152 aa), FASTA scores: opt: 357, E(): 8.2e-17, (42% identity in 150 aa overlap); P53527|Y396_MYCPN|MPN595|MP247 HYPOTHETICAL LACA/RPIB FAMILY PROTEIN from Mycoplasma pneumoniae (152 aa), FASTA scores: opt: 340, E(): 1.1e-15, (38.6% identity in 145 aa overlap); P26592|LACB_STAAU galactose-6-phosphate isomerase from Staphylococcus aureus (171 aa), FASTA scores: opt: 296, E(): 1e-12, (35.4% identity in 158 aa overlap) and P37351|RPIB_ECOLI ribose 5-phosphate isomerase b from Escherichia coli (149 aa), FASTA scores: opt: 262, E(): 1.6e-10, (32.2% identity in 146 aa overlap); etc. COULD BELONG TO THE LACA/RPIB FAMILY. Protein product from Mb2492c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2492c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYI9" /db_xref="InterPro:IPR003500" /db_xref="InterPro:IPR011860" /db_xref="InterPro:IPR036569" /db_xref="UniProtKB/Swiss-Prot:Q7TYI9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01107.1" /translation="MSGMRVYLGADHAGYELKQRIIEHLKQTGHEPIDCGALRYDADD DYPAFCIAAATRTVADPGSLGIVLGGSGNGEQIAANKVPGARCALAWSVQTAALAREH NNAQLIGIGGRMHTVAEALAIVDAFVTTPWSKAQRHQRRIDILAEYERTHEAPPVPGA PA" CDS complement(2740835..2741458) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2493C" /product="Protein disulfide oxidoreductase" /note="Mb2493c, -, len: 207 aa. Equivalent to Rv2466c, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 207 aa overlap). Conserved hypothetical protein, equivalent to Q9CBY0|ML1485 HYPOTHETICAL PROTEIN from Mycobacterium leprae (207 aa), FASTA scores: opt: 1154, E(): 1.1e-67, (80.6% identity in 206 aa overlap). Also highly similar to Q9L201|SC8E4A.04c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (216 aa), FASTA scores: opt: 789, E(): 4.6e-44, (57.9% identity in 213 aa overlap). Also similar to AAK46628|MT2344 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (230 aa), FASTA scores: opt: 324, E(): 6.1e-14, (30.4% identity in 194 aa overlap). Contains PS00195 Glutaredoxin active site. Protein product from Mb2493c detected using shotgun mass spectrometry. Mb2493c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y191" /db_xref="InterPro:IPR001853" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3Y191" /protein_id="SIU01108.1" /translation="MLEKAPQKSVADFWFDPLCPWCWITSRWILEVAKVRDIEVNFHV MSLAILNENRDDLPEQYREGMARAWGPVRVAIAAEQAHGAKVLDPLYTAMGNRIHNQG NHELDEVITQSLADAGLPAELAKAATSDAYDNALRKSHHAGMDAVGEDVGTPTIHVNG VAFFGPVLSKIPRGEEAGKLWDASVTFASYPHFFELKRTRTEPPQFD" CDS 2741560..2744145 /codon_start=1 /transl_table=11 /gene="pepN" /locus_tag="BQ2027_MB2494" /product="PROBABLE AMINOPEPTIDASE N PEPN (LYSYL AMINOPEPTIDASE) (LYS-AP) (ALANINE AMINOPEPTIDASE)" /note="Mb2494, pepN, len: 861 aa. Equivalent to Rv2467, len: 861 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 861 aa overlap). Probable pepN, aminopeptidase N (EC 3.4.11.2), equivalent to Q9CBX9|ML1486 PROBABLE AMINOPEPTIDASE from Mycobacterium leprae (862 aa), FASTA scores: opt: 4751,E(): 0, (83.3% identity in 862 aa overlap). Also highly similar to others e.g. Q11010|AMPN_STRLI|PEPN from Streptomyces lividans (857 aa), FASTA scores: opt: 2839, E(): 1.8e-170, (53.25% identity in 864 aa overlap); Q9L1Z2|PEPN from Streptomyces coelicolor (857 aa), FASTA scores: opt: 2834, E(): 3.8e-170, (53.1% identity in 864 aa overlap); P37896|AMPN_LACDL|PEPN from Lactobacillus delbrueckii (subsp. lactis) (842 aa), FASTA scores: opt: 719, E(): 2.4e-37, (31.65% identity in 439 aa overlap); etc. Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. BELONGS TO PEPTIDASE FAMILY M1 (ZINC METALLOPROTEASE), ALSO KNOWN AS THE PEPN SUBFAMILY. Note that previously known as pepD. Protein product from Mb2494 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2494 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1B2" /db_xref="InterPro:IPR001930" /db_xref="InterPro:IPR012778" /db_xref="InterPro:IPR014782" /db_xref="InterPro:IPR024571" /db_xref="InterPro:IPR042097" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1B2" /protein_id="SIU01109.1" /translation="MALPNLTRDQAVERAALITVDSYQIILDVTDGNGAPGERTFRST TTVVFDALPGADTVIDISAHTVRRASLNDQDLDVSGYDEAAGIPLRGLAQRNVVVVDA DCHYSNTGEGLHRFVDPVDGETYLYSQFETADAKRMFACFDQPDLKATFDVRVTAPAH WKVISNGAPLAAANGVHTFATTPRMSTYLVALIAGPYAAWTDTYIDDHGEIPLGIYCR ASLAEYMDAERLFTQTKQGFGFYHKHFGLPYAFGKYDQLFVPEFNAGAMENAGAVTFL EDYVFRSKVTRASYERRAETVLHEMAHMWFGDLVTMTWWDDLWLNESFATFASVLCQS EATEFTEAWTTFATVEKSWAYRQDQLPSTHPIAADIPDLAAVEVNFDGITYAKGASVL KQLVAYVGLERFLAGLRDYFRTHAFGNASFDDLLAALEKASGRDLSNWGEQWLKTTGL NTLRPDFEVDAEGRFTRFAVTQSGAAPGAGETRVHRLAVGIYDDDGSKSSGKLVRVHR EELDVSGPITNVPALVGVSRGKLILVNDDDLTYCSLRLDERSLQTALDRIADIAEPLP RTLVWSAAWEMTREAELRARDFVSLVSGGVHAETEVGVAQRLLLQAQTALGCYAEPGW ARERGWPQFADRLLELAREAEPGSDHQLAYINSLCSSVLSPRHVQTLGALLEGEPAAC GLAGLAVDTDLRWRIVTALATAGAIDADGPETPRIDAEVQRDPTAAGKRHAAQARAAR PQFVVKDEAFTTVVEDDTLANATGRAMIAGIAAPGQGELLKPFARRYFQAIPGVWARR SSEVAQSVVIGLYPHWDISEQGITAAEEFLSDPEVPPALRRLVLEGQAAVQRSLRARN FDADG" CDS complement(2744218..2744721) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2495C" /product="conserved protein" /note="Mb2495c, -, len: 167 aa. Equivalent to Rv2468c, len: 167 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 167 aa overlap). Conserved hypothetical protein, highly similar to Mycobacterium leprae HYPOTHETICAL PROTEINS Q9CC58|ML1255 (163 aa), FASTA scores: opt: 859, E(): 1.6e-49, (81.2% identity in 165 aa overlap) and Q9X7B5|MLCB1610.16 (169 aa), FASTA scores: opt: 859, E(): 1.6e-49, (81.2% identity in 165 aa overlap). Also weak similarity with Q9X8D7|SCE39.14c PUTATIVE GNTR-FAMILY REGULATOR from Streptomyces coelicolor (243 aa), FASTA scores: opt: 116, E(): 1.3, (30.1% identity in 156 aa overlap). Protein product from Mb2495c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2495c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR033437" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1B1" /protein_id="SIU01110.1" /translation="MTHRSSRLEVGPVARGDVATIEHAELPPGWVLTTSGRISGVTEP GELSVHYPFPIADLVALDDALTYSSRACQVRFAIYLGDLGRDTAARAREILGKVPTPD NAVLLAVSPNQCAIEVVYGSQVRGRGAESAAPLGVAAASSAFEQGELVDGLISAIRVL SAGIAPG" CDS complement(2744672..2744905) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2495A" /product="Conserved protein" /note="MB2495A, len: 77 aa. Equivalent to Rv2468A len: 77 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 77 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein. Mb2495A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1B3" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1B3" /protein_id="SIU01111.1" /translation="MEIHLFFVGIPLLLVVVLSVLIWSRKGPHPATYKLSEPWTHPPI LWAATDEVVGSAHGGHGHDASEFTVGGGASGTW" CDS complement(2744941..2745609) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2496C" /product="HNH endonuclease family protein" /note="Mb2496c, -, len: 222 aa. Equivalent to Rv2469c, len: 222 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 222 aa overlap). Conserved hypothetical protein, highly similar to other HYPOTHETICAL PROTEINS e.g. Q9X7B4|MLCB1610.15|ML1254 from Mycobacterium leprae (215 aa), FASTA scores: opt: 1183, E(): 3.3e-70, (77.9% identity in 222 aa overlap); Q9L1Y0|SC8E4A.25c from Streptomyces coelicolor (178 aa), FASTA scores: opt: 589, E(): 1.7e-31, (53.4% identity in 161 aa overlap) (N-terminal region is shorter 50 aa approximatively); Q9RRS6|DR2409 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (186 aa), FASTA scores: opt: 440, E(): 9.6e-22, (42.25% identity in 168 aa overlap) (N-terminal region is shorter 30 aa approximatively); etc. Mb2496c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR029471" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1B7" /protein_id="SIU01112.1" /translation="MAHGKKRRGHRSSGVAAGVTGPASCLHGVHSHRLASGVETHPPN RHESASIWNRRRVLLLNSTYEPLTALSMRRAIVMVICGKADVVHEDPSGPVIHSSTRS ILVPSVIQLRSYVRVPYRARVPMTRAALMHRDRFCCAYCGGKADTVDHVVPRSRGGAH SWENCVACCSPCNHRKGDRLLTELGWALRRAPLPPTGPHWRLLSAVKELDPSWARYLG EGAA" CDS 2745752..2746138 /codon_start=1 /transl_table=11 /gene="glbO" /locus_tag="BQ2027_MB2497" /product="globin (oxygen-binding protein) glbo" /note="Mb2497, glbO, len: 128 aa. Equivalent to Rv2470, len: 128 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 128 aa overlap). Possible glbO, globin-like protein, highly similar to Q9CC59|GLBO|ML1253 HEMOGLOBIN-LIKE (OXYGEN CARRIER) from Mycobacterium leprae (128 aa), FASTA scores: opt: 767, E(): 4e-47, (88.1% identity in 126 aa overlap); Q9X7B3|MLCB1610.14c PUTATIVE GLOBIN from Mycobacterium leprae (131 aa); Q9L250|SC6D10.14 PUTATIVE GLOBIN from Streptomyces coelicolor (137 aa), FASTA scores: opt: 466, E(): 5.7e-26, (53.6% identity in 125 aa overlap). Also similar to O31607 YJBI PROTEIN from Bacillus subtilis (132 aa), FASTA scores: opt: 294, E(): 6.6e-14; (39.85% identity in 128 aa overlap). COULD BELONG TO PROTOZOAN/CYANOBACTERIAL GLOBIN FAMILY PROTEIN. Protein product from Mb2497 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2497 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A596" /db_xref="InterPro:IPR001486" /db_xref="InterPro:IPR009050" /db_xref="InterPro:IPR012292" /db_xref="InterPro:IPR019795" /db_xref="UniProtKB/Swiss-Prot:P0A596" /protein_id="SIU01113.1" /translation="MPKSFYDAVGGAKTFDAIVSRFYAQVAEDEVLRRVYPEDDLAGA EERLRMFLEQYWGGPRTYSEQRGHPRLRMRHAPFRISLIERDAWLRCMHTAVASIDSE TLDDEHRRELLDYLEMAAHSLVNSPF" CDS 2746138..2747778 /codon_start=1 /transl_table=11 /gene="aglA" /locus_tag="BQ2027_MB2498" /product="PROBABLE ALPHA-GLUCOSIDASE AGLA (MALTASE) (GLUCOINVERTASE) (GLUCOSIDOSUCRASE) (MALTASE-GLUCOAMYLASE) (LYSOSOMAL ALPHA-GLUCOSIDASE) (ACID MALTASE)" /note="Mb2498, aglA, len: 546 aa. Equivalent to Rv2471, len: 546 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 546 aa overlap). Probable aglA, maltase (alpha-glucosidase) (EC 3.2.1.20), highly similar or similar to several e.g. Q60027|AGLA from Thermomonospora curvata (544 aa), FASTA scores: opt: 2071, E(): 4e-116, (57.7% identity in 525 aa overlap); Q9KZE3|AGLAE from Streptomyces coelicolor (534 aa), FASTA scores: opt: 1475, E(): 1.5e-80, (50.1% identity in 537 aa overlap); O86874|AGLA from Streptomyces lividans (534 aa), FASTA scores: opt: 1473, E(): 2e-80, (50.1% identity in 537 aa overlap); etc. SEEMS TO BELONG TO FAMILY 13 OF GLYCOSYL HYDROLASES, ALSO KNOWN AS THE ALPHA-AMYLASE FAMILY. Mb2498 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3F9" /db_xref="InterPro:IPR006047" /db_xref="InterPro:IPR017853" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3F9" /protein_id="SIU01114.1" /translation="MDQHQRPDPMGPGSPRASARRPEPDPMGEPWWSRAVFYQVYPRS FADSNGDGVGDLDGLASRLDHLQQLGVDAIWINPVTVSPMADHGYDVADPRDIDPLFG GMPAFERLVAAAHRQGIKVTMDVVPNHTSSAHPWFQAALADLPGSPARDRYFFRDGRG PDGSLPPNNWESVFGGPAWTRVREPDGNPGQWYLHLFDTEQPDLNSDNPEILDDFEKT LRFWLDRGVDGFRIDVAHGMAKPPGLPDSPDLGIEVLHHRDDDPRFNHPNVHAIHRDI RTVIDEYPGAVTVGEVWVHDNARWAEYLRPDELHLGFNFRLARTEFDAAEIRDAVANS LAAAALQNATPTWTLANHDVGREVSRYGGGEIGLRRAKAMAVVMLALPGVVFLYNGQE LGLPDVDLPDEVLQDPTWERSGRTERGRDGCRVPIPWSGNIPPFGFSTCPDTWLPMPP EWAALTAEKQRADAGSTLSFFRLALRLRRERNEFDGDVDWLAAPDDALIFRRHGGGLV CALNAAERPLALPAGEPILASAPLTDATLPPNAAAWLV" CDS 2747846..2748139 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2499" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2499, -, len: 97 aa. Equivalent to Rv2472, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 97 aa overlap). Conserved hypothetical protein, showing some similarity to O53451|Rv1103c|MTV017.56c from Mycobacterium tuberculosis strain H37Rv (106 aa), FASTA scores: opt: 135, E(): 0.026, (45.85% identity in 72 aa overlap); and AAK45393|MT1135 HYPOTHETICAL 11.4 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (78 aa) FASTA scores: opt: 139, E(): 0.011, (45.35% identity in 75 aa overlap). Mb2499 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y242" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01115.1" /translation="MMMRIAVRLPGEVITFVDSEVSQIRIPSRRAAVVLRASNASDAA ILTATEPNHHLDALAGQAAKLAPTSIDAAHPARPARRDPCLYPRTGQALPRTG" CDS 2748142..2748858 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2500" /product="POSSIBLE ALANINE AND PROLINE RICH MEMBRANE PROTEIN" /note="Mb2500, -, len: 238 aa. Equivalent to Rv2473, len: 238 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 238 aa overlap). Possible pro-,ala-rich membrane protein, with possible transmembrane domain around aa 81-104. Protein product from Mb2500 detected using shotgun mass spectrometry. Mb2500 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1J5" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1J5" /protein_id="SIU01116.1" /translation="MAPTSSSVASELLMPWPSAAASGVVGWRTTATASQRYHRPMSDT PFAEPYPEQRPPWGVPPPGWDGSSRPAPSTTPRSPGRWSLVAALALAVVSLGVGIVGW FHRQPHDKPSPAPSAPTFTSQQISDAKENVCAAHRIVRQAAVLNTNQANPVPGDPTGD LAVAANARLALYSGGDYLLRRLTAEPATPAELRDAVRSLANALQELAVNYLAGAPDSV VTPLRLALERDTRAVDPLCV" CDS complement(2748890..2749543) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2501C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2501c, -, len: 217 aa. Equivalent to Rv2474c, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 217 aa overlap). Hypothetical protein. Shows weak similarity with Q9L246|SC6D10.18c HYPOTHETICAL 24.9 KDA PROTEIN from Streptomyces coelicolor (238 aa), FASTA scores: opt: 111, E(): 5.6, (30% identity in 233 aa overlap), BLASTP scores: Score= 135, E= 3.5e-07, P= 3.5e-07, Identities= 55/182 (30%). Protein product from Mb2501c detected using SWATH mass spectrometry. Mb2501c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR016601" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1P6" /protein_id="SIU01117.1" /translation="MVERGLWLPDPAHRADLATFVDHALRLDDVAVIRIRARSTGLLS AWVATGFDVLASRVVAGKVRPDDLSVAARSLAHGLATTDASGYVDPGYSMDSAWRGGL PPESGFTYLDDVPARVMLDLAHRGARLAKEHGSSAGPPVSLLDQEVIQVSSADVVVGL PMRCVFALTAMGFLPQSAETISADELIRVRISPAWLRLDARFGSVYRHRGHAALVLR" CDS complement(2749549..2749965) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2502C" /product="Predicted thioesterase" /note="Mb2502c, -, len: 138 aa. Equivalent to Rv2475c, len: 138 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 138 aa overlap). Conserved hypothetical protein, showing similarity with Q9L245|SC6D10.19c HYPOTHETICAL 16.2 KDA PROTEIN from Streptomyces coelicolor (136 aa), FASTA scores: opt: 236, E(): 1.9e-09, (34.1% identity in 126 aa overlap). Also some similarity with AAK44393|Z97050|MTCI28_3 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis cosmid I (151 aa), FASTA scores: opt: 147, E(): 0.00025, (29.2% identity in 120 aa overlap). Protein product from Mb2502c detected using shotgun mass spectrometry. Mb2502c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR029069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1A2" /protein_id="SIU01118.1" /translation="MSVGFVTPVGVRWSDIDMYQHVNHATMVTILEEARVPFLKDAFG ADITSTGLLIADVRVTYKGQLRLSDSPLQVTIWTKRLRAVDFTLGYEVRSVNAEPDSR PAVIAESQLAAFHIEEQRLVRLSPHHREYLQRWFRG" CDS complement(2749962..2754833) /codon_start=1 /transl_table=11 /gene="gdh" /locus_tag="BQ2027_MB2503C" /product="PROBABLE NAD-DEPENDENT GLUTAMATE DEHYDROGENASE GDH (NAD-GDH) (NAD-DEPENDENT GLUTAMIC DEHYDROGENASE)" /note="Mb2503c, gdh, len: 1623 aa. Equivalent to Rv2476c, len: 1624 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1624 aa overlap). Probable gdh, glutamate dehydrogenase (EC 1.4.1.2). Highly similar to Q9X7B2|MLCB1610.10|ML1249 HYPOTHETICAL 177.9 KDA PROTEIN from Mycobacterium leprae (1622 aa), FASTA scores: opt: 8630,E(): 0, (81.45% identity in 1634 aa overlap). But highly similar to Q9F0J1|GDH NAD-GLUTAMATE DEHYDROGENASE from Streptomyces clavuligerus (1651 aa), FASTA scores: opt: 3833, E(): 0, (45.8% identity in 1600 aa overlap); (see first citation). Also similar with others e.g. AAG53963|PA3068|GDHB HYPOTHETICAL (NAD(+)-DEPENDENT GLUTAMATE DEHYDROGENASE from Pseudomonas aeruginosa (1620 aa), FASTA scores: opt: 2214, E(): 1e-124, (40.1% identity in 1561 aa overlap) (see second citation); and Q9Y8G5|GDHB NAD-SPECIFIC GLUTAMATE DEHYDROGENASE from Agaricus bisporus (1029 aa), FASTA scores: opt: 194, E(): 0.00099, (22.7% identity in 647 aa overlap) (see third citation); etc. Contains possible Helix-turn-helix motif at aa 1568 to 1589 (score 1098, +2.93 SD). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 3 bp deletion (cgg-*) leads to a slightly shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (1623 aa versus 1624 aa). Protein product from Mb2503c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2503c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1C3" /db_xref="InterPro:IPR007780" /db_xref="InterPro:IPR028971" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C3" /protein_id="SIU01119.1" /translation="MTIDPGAKQDVEAWTTFTASADIPDWISKAYIDSYRGPRDDSSE ATKAAEASWLPASLLTPAMLGAHYRLGRHRAAGESCVAVYRADDPAGFGPALQVVAEH GGMLMDSVTVLLHRLGIAYAAILTPVFDVHRSPTGELLRIEPKAEGTSPHLGEAWMHV ALSPAVDHKGLAEVERLLPKVLADVQRVATDATALIATLSELAGEVESNAGGRFSAPD RQDVGELLRWLGDGNFLLLGYQRCRVADGMVYGEGSSGMGVLRGRTGSRPRLTDDDKL LVLAQARVGSYLRYGAYPYAIAVREYVDGSVVEHRFVGLFSVAAMNADVLEIPTISRR VREALAMAESDPSHPGQLLLDVIQTVPRPELFTLSAQRLLTMARAVVDLGSQRQALLF LRADRLQYFVSCLVYMPRDRYTTAVRMQFEDILVREFGGTRLEFTARVSESPWALMHF MVRLPEVGVAGEGAAAPPVDVSEANRIRIQGLLTEAARTWADRLIGAAAAGSVGQADA MHYAAAFSEAYKQAVTPADAIGDIAVITELTDDSVKLVFSERDEQGVAQLTWFLGGRT ASLSQLLPMLQSMGVVVLEERPFSVTRPDGLPVWIYQFKISPHPTIPLAPTVAERAAT AHRFAEAVTAIWHGRVEIDRFNELVMRAGLTWQQVVLLRAYAKYLRQAGFPYSQSYIE SVLNEHPATVRSLVDLFEALFVPVPSGSASNRDAQAAAAAVAADIDALVSLDTDRILR AFASLVQATLRTNYFVTRQGSARCRDVLALKLNAQLIDELPLPRPRYEIFVYSPRVEG VHLRFGPVARGGLRWSDRRDDFRTEILGLVKAQAVKNAVIVPVGAKGGFVVKRPPLPT GDPAADRDATRAEGVACYQLFISGLLDVTDNVDHATASVNPPPEVVRRDGDDAYLVVA ADKGTATFSDIANDVAKSYGFWLGDAFASGGSVGYDHKAMGITARGAWEAVKRHFREI GIDTQTQDFTVVGIGDMSGDVFGNGMLLSKHIRLIAAFDHRHIFLDPNPDAAVSWAER RRMFELPRSSWGDYDRSLISEGGGVYSREQKAIPLSAQVRAVLGIDGSVDGGAAEMAP PNLIRAILRAPVDLLFNGGIGTYIKAESESDADVGDRANDPVRVNANQVRAKVIGEGG NLGVTALGRVEFDLSGGRINTDALDNSAGVDCSDHEVNIKILIDSLVSAGTVKADERT QLLESMTDEVAQLVLADNEDQNDLMGTSRANAASLLPVHAMQIKYLVAERGVNRELEA LPSEKEIARRSEAGIGLTSPELATLMAHVKLGLKEEVLATELPDQDVFASRLPRYFPT ALRERFTPEIRSHQLRREIVTTMLINDLVDTAGITYAFRIAEDVGVTPIDAVRTYVAT DAIFGVGHIWRRIRAANLPIALSDRLTLDTRRLIDRAGRWLLNYRPQPLAVGAEINRF AAMVKALTPRMSEWLRGDDKAIVEKTAAEFASQGVPEDLAYRVSTGLYRYSLLDIIDI ADIADIDAAEVADTYFALMDRLGTDGLLTAVSQLPRHDRWHSLARLAIRDDIYGALRS LCFDVLAVGEPGESSEQKIAEWEHLSASRVARARRTLDDIRASGQKDLATLSVAARQI RRMTRTSGRGISG" CDS complement(2754937..2756613) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2504C" /product="PROBABLE MACROLIDE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb2504c, -, len: 558 aa. Equivalent to Rv2477c, len: 558 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 558 aa overlap). Probable ATP binding protein ABC-transporter (see citation below), probably involved in macrolide transport, equivalent to Q9X7B1|MLCB1610.09|ML1248 PUTATIVE ABC TRANSPORTER ATP-BINDING PROTEIN from Mycobacterium leprae (556 aa) FASTA scores: opt: 3448, E(): 3.8e-176, (92.3% identity in 557 aa overlap). Also highly similar to many ATP binding proteins e.g. Q9L244|SC6D10.20c PUTATIVE ABC TRANSPORTER ATP-BINDING PROTEIN from Streptomyces coelicolor (547 aa), FASTA scores: opt: 2937, E(): 5.6e-149, (79.5% identity in 551 aa overlap); AAK24119|CC2148 ABC transporter ATP-binding protein from Caulobacter crescentus (555 aa), FASTA scores: opt: 2175, E(): 1.9e-108, (59.4% identity in 557 aa overlap); Q9HVJ1 PROBABLE ATP-BINDING COMPONENT OF ABC TRANSPORTER from Pseudomonas aeruginosa (554 aa), FASTA scores: opt: 2054, E(): 5.1e-102, (56.9% identity in 559 aa overlap); etc. Contains 2 x PS00017 ATP/GTP-binding site motif A (P-loop), 2 x PS00211 ABC transporters family signature, and probable coiled-coil from aa 273 to 311. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb2504c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2504c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1B9" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR022374" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR032781" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1B9" /protein_id="SIU01120.1" /translation="MAEFIYTMKKVRKAHGDKVILDDVTLSFYPGAKIGVVGPNGAGK SSVLRIMAGLDKPNNGDAFLATGATVGILQQEPPLNEDKTVRGNVEEGMGDIKIKLDR FNEVAELMATDYTDELMEEMGRLQEELDHADAWDLDAQLEQAMDALRCPPADEPVTNL SGGERRRVALCKLLLSKPDLLLLDEPTNHLDAESVQWLEQHLASYPGAILAVTHDRYF LDNVAEWILELDRGRAYPYEGNYSTYLEKKAERLAVQGRKDAKLQKRLTEELAWVRSG AKARQAKSKARLQRYEEMAAEAEKTRKLDFEEIQIPVGPRLGNVVVEVDHLDKGYDGR ALIKDLSFSLPRNGIVGVIGPNGVGKTTLFKTIVGLETPDSGSVKVGETVKLSYVDQA RAGIDPRKTVWEVVSDGLDYIQVGQTEVPSRAYVSAFGFKGPDQQKPAGVLSGGERNR LNLALTLKQGGNLILLDEPTNDLDVETLGSLENALLNFPGCAVVISHDRWFLDRTCTH ILAWEGDDDNEAKWFWFEGNFGAYEENKVERLGVDAARPHRVTHRKLTRG" CDS complement(2756694..2757179) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2505C" /product="Single-stranded DNA-binding protein" /note="Mb2505c, -, len: 161 aa. Equivalent to Rv2478c, len: 161 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 161 aa overlap). Conserved hypothetical protein, with weak similarity with many single-strand binding proteins e.g. Q9X8U3|SCH24.29 PUTATIVE SINGLE-STRAND BINDING PROTEIN from Streptomyces coelicolor (199 aa), FASTA scores: opt: 246, E(): 4.5e-08, (31.5% identity in 162 aa overlap); P46390|SSB_MYCLE|ML2684|MLCB1913.20c SINGLE-STRAND BINDING PROTEIN (SSB) (HELIX-DESTABILIZING PROTEIN) from Mycobacterium leprae (168 aa), FASTA scores: opt: 239, E(): 1e-07, (30.8% identity in 146 aa overlap); P18310|SSBF_ECOLI SINGLE-STRAND BINDING PROTEIN from Escherichia coli (178 aa), FASTA scores: opt: 116, E(): 2.9, (25.7% identity in 140 aa overlap); etc. Also similarity with Rv0054|P71711|MTCY21D4.17|SSB_MYCTU PROBABLE SINGLE-STRAND BINDING PROTEIN from M. tuberculosis (164 aa), FASTA scores: opt: 234, E(): 2e-07, (31.75% identity in 148 aa overlap). N-terminus shorter 8 aa from AAK46855|MT2553 SINGLE-STRAND DNA BINDING PROTEIN from Mycobacterium tuberculosis strain CDC1551." /db_xref="GOA:A0A1R3Y1C2" /db_xref="InterPro:IPR000424" /db_xref="InterPro:IPR011344" /db_xref="InterPro:IPR012340" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C2" /protein_id="SIU01121.1" /translation="MVGHIVNDLQRRKVGDQEVVKFRVASNSRRRTSDGGWEPGNSLF ITVNCWGRLVTGVGAALGKGAPVIVVGHVYTSEYEDRDGIRRSSLEMRATSVGPDLSR VIVRIEKPAYTGPSAGDLPAATGTGAAGAADAPASAADSVSDVVVDDAITGHNPLPIS A" CDS complement(2757788..2758111) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2506C" /product="HYPOTHETICAL PROTEIN" /note="Mb2506c, -, len: 107 aa. Equivalent to Rv2481c, len: 107 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 107 aa overlap). Hypothetical unknown protein. Mb2506c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C8" /protein_id="SIU01122.1" /translation="MALRRRHEPDGWPFSQRSEKPNAVRHAVRCSAVSAAASTANGTP VNWVSGRVTRAMGVHRQTRGGVASVHADSLRGAVLVHGQLRNSIPISANVPASGANTK SSIAH" CDS complement(2758127..2760496) /codon_start=1 /transl_table=11 /gene="plsB2" /locus_tag="BQ2027_MB2507C" /product="PROBABLE GLYCEROL-3-PHOSPHATE ACYLTRANSFERASE PLSB2 (GPAT)" /note="Mb2507c, plsB2, len: 789 aa. Equivalent to Rv2482c, len: 789 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 789 aa overlap). Probable plsB2, glycerol-3-phosphate acyltransferase (EC 2.3.1.15), highly similar to Q9X7B0|PLSB_MYCLE PROBABLE GLYCEROL-3-PHOSPHATE ACYLTRANSFERASE from Mycobacterium leprae (775 aa), FASTA scores: opt: 4210, E(): 0, (80.7% identity in 783 aa overlap). Also similar to others e.g. P00482|PLSB_ECOLI from Escherichia coli (806 aa), FASTA scores: opt: 521, E(): 3e-24, (24.35 identity in 612 aa overlap); Q9CLN7|PLSB_PASMU from Pasteurella multocida (809 aa), FASTA scores: opt: 529, E(): 9.7e-25, (27.05% identity in 540 aa overlap); Q9KVP8|PLSB_VIBCH from Vibrio cholerae (811 aa), FASTA scores: opt: 510, E(): 1.4e-23, (26.0% identity in 639 aa overlap); etc. Also highly similar to Q10775|PLSB1|Rv1551|MTCY48.14c from Mycobacterium tuberculosis (621 aa), FASTA scores: opt: 1013, E(): 1.5e-54, (34.65% identity in 586 aa overlap). BELONGS TO THE GPAT/DAPAT FAMILY. Protein product from Mb2507c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2507c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYH5" /db_xref="InterPro:IPR002123" /db_xref="InterPro:IPR022284" /db_xref="InterPro:IPR028354" /db_xref="InterPro:IPR041728" /db_xref="UniProtKB/Swiss-Prot:Q7TYH5" /protein_id="SIU01123.1" /translation="MTKPAADASAVLTAEDTLVLASTATPVEMELIMGWLGQQRARHP DSKFDILKLPPRNAPPAALTALVEQLEPGFASSPQSGEDRSIVPVRVIWLPPADRSRA GKVAALLPGRDPYHPSQRQQRRILRTDPRRARVVAGESAKVSELRQQWRDTTVAEHKR DFAQFVSRRALLALARAEYRILGPQYKSPRLVKPEMLASARFRAGLDRIPGATVEDAG KMLDELSTGWSQVSVDLVSVLGRLASRGFDPEFDYDEYQVAAMRAALEAHPAVLLFSH RSYIDGVVVPVAMQDNRLPPVHMFGGINLSFGLMGPLMRRSGMIFIRRNIGNDPLYKY VLKEYVGYVVEKRFNLSWSIEGTRSRTGKMLPPKLGLMSYVADAYLDGRSDDILLQGV SICFDQLHEITEYAAYARGAEKTPEGLRWLYNFIKAQGERNFGKIYVRFPEAVSMRQY LGAPHGELTQDPAAKRLALQKMSFEVAWRILQATPVTATGLVSALLLTTRGTALTLDQ LHHTLQDSLDYLERKQSPVSTSALRLRSREGVRAAADALSNGHPVTRVDSGREPVWYI APDDEHAAAFYRSSVIHAFLETSIVELALAHAKHAEGDRVAAFWAQAMRLRDLLKFDF YFADSTAFRANIAQEMAWHQDWEDHLGVGGNEIDAMLYAKRPLMSDAMLRVFFEAYEI VADVLRDAPPDIGPEELTELALGLGRQFVAQGRVRSSEPVSTLLFATARQVAVDQELI APAADLAERRVAFRRELRNILRDFDYVEQIARNQFVAREFKARQGRDRI" CDS complement(2760493..2762235) /codon_start=1 /transl_table=11 /gene="plsC" /locus_tag="BQ2027_MB2508C" /product="possible transmembrane phospholipid biosynthesis bifunctional enzyme plsc: putative l-3-phosphoserine phosphatase (o-phosphoserine phosphohydrolase) (psp) (pspase) + 1-acyl-sn-glycerol-3-phosphate acyltransferase (1-agp acyltransferase) (1-agpat) (lysophosphatidic acid acyltransferase) (lpaat)" /note="Mb2508c, plsC, len: 580 aa. Equivalent to Rv2483c, len: 580 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 580 aa overlap). Possible plsC, a transmembrane phospholipid biosynthesis bifunctionnal enzyme, including L-3-phosphoserine phosphatase (EC 3.1.3.3) and 1-acyl-Sn-glycerol-3-phosphate acyltransferase (EC 2.3.1.51), equivalent to Q9X7A9|PLSC|ML1245 PUTATIVE ACYLTRANSFERASE from Mycobacterium leprae (579 aa), FASTA scores: opt: 2835, E(): 9.2e-153, (77.15% identity in 573 aa overlap). C-terminal end is similar to many 1-ACYL-SN-GLYCEROL-3-PHOSPHATE ACYLTRANSFERASES (LYSOPHOSPHATIDIC ACIDACYLTRANSFERASES) e.g. Q9SDQ2 from Limnanthes floccosa (281 aa), FASTA scores: opt: 378, E(): 3.1e-14, (30.0% identity in 230 aa overlap) and Q42868|PLSC_LIMAL from Limnanthes alba (White meadowfoam) (281 aa), FASTA scores: opt: 374, E(): 5.2e-14, (30.55% identity in 221 aa overlap); and the N-terminal end is similar to many SERB FAMILY PROTEINS e.g. AAK44749|MT0526 from Mycobacterium tuberculosis strain CDC1551 (308 aa), FASTA scores: opt: 356, E(): 5.8e-13, (32.5% identity in 298 aa overlap) and Q49823|ML2424 from Mycobacterium leprae (300 aa), FASTA scores: opt: 346, E(): 2.1e-12, (32.0% identity in 278 aa overlap). So belongs to the 1-ACYL-SN-GLYCEROL-3-PHOSPHATE ACYLTRANSFERASE FAMILY and may belong to the SERB FAMILY. Protein product from Mb2508c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2508c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3G3" /db_xref="InterPro:IPR002123" /db_xref="InterPro:IPR006385" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3G3" /protein_id="SIU01124.1" /translation="MSAADEQGEERATRKSAPDLRLPGSVAEILASPAGPKVGAFFDL DGTLVAGFTAVILTQERLRRRDMGVGELLGMVQAGLNHTLGRIEFEDLIGKAAAALAG RLLTDLEEIGERLFAQRIESRIYPEMRELVRAHVARGHTVVLSSSALTIQVGPVARFL GINNMLTNKFETNEDGILTGGVLKPILWGPGKATAVQRFAAEHDIDLKDSYFYADGDE DVALMYLVGNPRPTNPEGKMAAVAKRRGWPILKFNSRGGVGIRRQLRTLAGLSTIVPV AAGAVGIGVLTGSRRRGVNFFTSTFSQLLLATSGVHLNVIGKENLTAQRPAVFIFNHR NQVDPVIAGALVRDNWVGVGKKELASDPIMGTLGKLLDGVFIDRDDPVAAVETLHTVE ERARNGLSIVIAPEGTRLDTTEVGSFKKGPFRIAMAAKIPIVPIVIRNAEIVASRNST TINPGTVDVAVFPPIPVDDWTLDALPDRIAEVRQLYLDTLADWPVDGLPAVDLYAEQK AARKARAQVAKATAKRVPAKKAPAKSAANKGAAATKAATKKASPKAKPSESKIAGKDG EASASPSSSAKGRS" CDS complement(2762232..2763707) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2509C" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb2509c, -, len: 491 aa. Equivalent to Rv2484c, len: 491 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 491 aa overlap). Conserved hypothetical protein, highly similar or similar to many Mycobacterial hypothetical proteins e.g. Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa), FASTA scores: opt: 2459, E(): 3e-138, (75.15% identity in 483 aa overlap); O53304|YU87_MYCTU|Rv3087|MTV013.08 from Mycobacterium tuberculosis (472 aa), FASTA scores: opt: 527, E(): 8.1e-24, (29.1% identity in 485 aa overlap); O53305|YU88_MYCTU|Rv3088|MT3173|MTV013.09 from Mycobacterium tuberculosis (474 aa), FASTA scores: opt: 370, E(): 1.6e-14, (26.05% identity in 422 aa overlap); etc. Protein product from Mb2509c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2509c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y252" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/TrEMBL:A0A1R3Y252" /protein_id="SIU01125.1" /translation="MAESGESPRLSDELGPVDYLMHRGEANPRTRSGIMALELLDGTP DWDRFRTRFENASRRVLRLRQKVVVPTLPTAAPRWVVDPDFNLDFHVRRVRVSGPATL REVLDLAEVILQSPLDISRPLWTATLVEGMADGRAAMLLHVSHAVTDGVGGVEMFAQI YDLERDPPPRSTPPQPIPEDLSPNDLMRRGINHLPIAVVGGVLDALSGAVSMAGRAVL EPVSTVSGILGYARSGIRVLNRAAEPSPLLRRRSLTTRTEAIDIRLADLHKAAKAGGG SINDAYLAGLCGALRRYHEALGVPISTLPMAVPVNLRAEGDAAGGNQFTGVNLAAPVG TIDPVARMKKIRAQMTQRRDEPAMNIIGSIAPVLSVLPTAVLEGITGSVIGSDVQASN VPVYPGDTYLAGAKILRQYGIGPLPGVAMMVVLISRGGWCTVTVRYDRASVRNDELFA QCLQAGFDEILALAGDPAPRVLPASFDTQGAGSVPRSVSGS" CDS complement(2763936..2765201) /codon_start=1 /transl_table=11 /gene="lipQ" /locus_tag="BQ2027_MB2510C" /product="PROBABLE CARBOXYLESTERASE LIPQ" /note="Mb2510c, lipQ, len: 421 aa. Equivalent to Rv2485c, len: 421 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 421 aa overlap). Probable lipQ, carboxylesterase protein (lipase) (EC 3.1.-.-). Similar (greater at the C-terminal end) to AAK46626|MT2342 PUTATIVE CARBOXYLESTERASE from Mycobacterium tuberculosis strain CDC1551 (431 aa), FASTA scores: opt: 1134, E(): 4.3e-60, (46.25% identity in 428 aa overlap); and Q50681|Rv2284|MTCY339.26c HYPOTHETICAL PROTEIN from M. tuberculosis strain H37Rv (431 aa), FASTA scores: opt: 1134, E(): 4.3e-60, (46.25% identity in 428 aa overlap). Also similar in part to other putative lipases/esterases e.g. AAK44451|MT0230 from Mycobacterium tuberculosis strain CDC1551 (403 aa), FASTA scores: opt: 763, E(): 4.6e-38, (37.95% identity in 390 aa overlap); Q9RY19|DR0133 from Deinococcus radiodurans (296 aa), FASTA scores: opt: 392, E(): 4e-16, (33.7% identity in 276 aa overlap); Q9Z545|SC9B2.14 from Streptomyces coelicolor (502 aa) FASTA scores: opt: 279, E(): 3.2e-09, (31.15% identity in 292 aa overlap); etc. Mb2510c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1K5" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1K5" /protein_id="SIU01126.1" /translation="MHIASVTSRCSRAGAEALRQGAQLAADARDTCRAGALLLRGSPC AIGWVAGWLSAEFPARVVTGHALSRISPRSIGRFGTSWAAQRADQILHAALVDAFGPD FRDLVWHPTGEQSEAARRSGLLNLPHIPGPHRRYAAQTSDIPYGPGGRENLLDIWRRP DLAPGRRAPVLIQVPGGAWTINGKRPQAYPLMSRMVELGWICVSINYSKSPRCTWPAH IVDVKRAIAWVRENIADYGGDPDFITITGGSAGAHLAALAALSANDPALQPGFESADT AVQAAAPYYGVYDLTNAENMHEMMMPFLEHFVMRSRYVDNPGLFKAASPISYVHSEAP PFFVLHGEKDPMVPSAQSRAFSAALRDAGAATVSYAELPNAHHAFDLAATVRSRMVAE AVSDFLGVIYGRRMGARKGSLALSSPPAS" tRNA 2765389..2765462 /locus_tag="BQ2027_ARGW" /product="tRNA-Arg" /note="argW, len: 74 nt. Equivalent to argW, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Arg; anticodon tct." CDS 2765563..2766333 /codon_start=1 /transl_table=11 /gene="echA14" /locus_tag="BQ2027_MB2511" /product="PROBABLE ENOYL-COA HYDRATASE ECHA14 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb2511, echA14, len: 256 aa. Equivalent to Rv2486, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 256 aa overlap). Probable echA14, enoyl-coA hydratase (EC 4.2.1.17), similar to others e.g. P24162|ECHH_RHOCA2|FADB1 from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (257 aa), FASTA scores; opt: 453, E(): 3.8e-23, (39.4% identity in 259 aa overlap); Q9ETY7|PACA|PAAG from Azoarcus evansii (273 aa), FASTA scores: opt: 404, E(): 5.7e-17, (37.5% identity in 224 aa overlap); P77467|PAAG_ECOLI from Escherichia coli (262 aa), FASTA scores: opt: 401, E(): 8.3e-17, (36.3% identity in 259 aa overlap); etc. Contains PS00166 Enoyl-CoA hydratase/isomerase signature. BELONGS TO THE ENOYL-COA HYDRATASE/ISOMERASE FAMILY. Protein product from Mb2511 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2511 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64019" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR018376" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/Swiss-Prot:P64019" /protein_id="SIU01127.1" /translation="MAQYDPVLLSVDKHVALITVNDPDRRNAVTDEMSAQLRAAIQRA EGDPDVHAVVVTGAGKAFCAGADLSALGAGVGDPAEPRLLRLYDGFMAVSSCNLPTIA AVNGAAVGAGLNLALAADVRIAGPAALFDARFQKLGLHPGGGATWMLQRAVGPQVARA ALLFGMCFDAESAVRHGLALMVADDPVTAALELAAGPAAAPREVVLASKATMRATASP GSLDLEQHELAKRLELGPQAKSVQSPEFAARLAAAQHR" CDS complement(2766513..2767016) /pseudo /codon_start=1 /transl_table=11 /gene="PE_PGRS42d" /locus_tag="BQ2027_MB2512C" /note="Mb2512c, PE_PGRS42d, len: 167 aa. Similar to 3' end of Rv2487c, len: 694 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 274 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of Gly-rich proteins, similar to many e.g. AAK47245|MT2919 PE_PGRS family protein from Mycobacterium tuberculosis strain CDC1515 (663 aa), FASTA scores: opt: 2317, E(): 2.3e-84, (58.35% identity in 622 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS42 exists as a single gene. In Mycobacterium bovis, 2 frameshifts, the first due to a single base insertion (*-c) and the second due to a single base deletion (g-*) splits PE_PGRS42 into 3 parts, PE_PGRS42a and PE_PGRS42b and PE_PGRS42d.;PE-PGRS FAMILY PROTEIN [THIRD PART]" CDS complement(2767013..2767960) /codon_start=1 /transl_table=11 /gene="PE_PGRS42b" /locus_tag="BQ2027_MB2513C" /product="PE-PGRS FAMILY PROTEIN [SECOND PART]" /note="Mb2513c, PE_PGRS42b, len: 315 aa. Similar to middle section of Rv2487c, len: 694 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 284 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of Gly-rich proteins, similar to many e.g. AAK47245|MT2919 PE_PGRS family protein from Mycobacterium tuberculosis strain CDC1515 (663 aa), FASTA scores: opt: 2317, E(): 2.3e-84, (58.35% identity in 622 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS42 exists as a single gene. In Mycobacterium bovis, 2 frameshifts, the first due to a single base insertion (*-c) and the second due to a single base deletion (g-*) splits PE_PGRS42 into 3 parts, PE_PGRS42a and PE_PGRS42b and PE_PGRS42d." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1A8" /protein_id="SIU01129.1" /translation="MFGHGGAGGTGGAGLAGANGVNPTPGPAASTGDSPADVSGIGDQ TGGDGGTGGHGTAGTPTGGTGGDGATATAGSGKATGGAGGDGGTAAAGGGGGNGGDGG VAQGDIASAFGGDGGNGSDGVAAGSGGGSGGAGGGAFVHIVTATSTGGSGGFGGNGAA SAASGADGGAGGAGGNGGAGGLLFGDGGNGGAGGAGGIGGDGATGGPGEAAATLASRG LTAQTPRQNPMWSAARVVMAARAAAALASAAPAGPAARAATAAPAGCCSATAATAATP GPAGMAAPALPVGLAVTAAVVAPRRFTKTRSLVSGRSVA" CDS complement(2767977..2768597) /codon_start=1 /transl_table=11 /gene="PE_PGRS42a" /locus_tag="BQ2027_MB2514C" /product="PE-PGRS FAMILY PROTEIN [FIRST PART]" /note="Mb2514c, PE_PGRS42a, len: 206 aa. Similar to 5' end of Rv2487c, len: 694 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of Gly-rich proteins, similar to many e.g. AAK47245|MT2919 PE_PGRS family protein from M. tuberculosis strain CDC1515 (663 aa), FASTA scores: opt: 2317, E(): 2.3e-84, (58.35% identity in 622 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS42 exists as a single gene. In Mycobacterium bovis, 2 frameshifts, the first due to a single base insertion (*-c) and the second due to a single base deletion (g-*) splits PE_PGRS42 into 3 parts, PE_PGRS42a and PE_PGRS42b and PE_PGRS42d." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1D3" /protein_id="SIU01130.1" /translation="MSLVIATPQLLATAALDLASIGSQVSAANAAAAMPTTEVVAAAA DEVSAAIAGLFGAHARQYQALSVQVAAFHEQFVQALTAAAGRYASTEAAVERSLLGAV NAPTEALLGRPLIGNGADGTAPGQPGAAGGLLFGQRWQRRGWRVRSNRRQRRRGRVDR QRRQRRGRWYRRGRRCRWERGVVVGQRRQRRCRRHQRGRRHRGCGR" CDS complement(2768679..2772092) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2515C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (LUXR-FAMILY)" /note="Mb2515c, -, len: 1137 aa. Equivalent to Rv2488c, len: 1137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1137 aa overlap). Probable transcriptional regulatory protein, belonging to luxR family, similar to many in Mycobacterium tuberculosis e.g. AAK44621|MT0399 from strain CDC1551 (1092 aa) FASTA scores: opt: 3767, E(): 1.8e-211, (56.75% identity in 1093 aa overlap); O53720|Rv0386|MTV036.21 from strain H37Rv (1085 aa), FASTA scores: opt: 3756, E(): 7.6e-211, (56.75% identity in 1089 aa overlap); AAK45665|MT1402 from strain CDC1551 (1159 aa), FASTA scores: opt: 3395, E(): 8.2e-190, (52.0% identity in 1093 aa overlap); etc. Also similar to transcriptional regulatory proteins luxR-family from other organisms e.g. Q9CBP3|ML1753 from Mycobacterium leprae (1106 aa), FASTA scores: opt: 2823, E(): 1.5e-156, (50.35% identity in 1116 aa overlap); Q9KYF4|SCD72A.02 from Streptomyces coelicolor (1114 aa), FASTA scores: opt: 915, E(): 1.7e-45, (30.7% identity in 1143 aa overlap); etc. Some similarity with Q9KXP6|SC9C5.28 HYPOTHETICAL 81.8 KDA PROTEIN from Streptomyces coelicolor (750 aa), FASTA scores: opt: 1085, E(): 1.6e-55, (35.45% identity in 722 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00622 Bacterial regulatory proteins, luxR family signature, probable coiled-coil from aa 585 to 616 and probable helix-turn-helix motif at aa 1086 to 1107 (score 1206, +3.29 SD). BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb2515c detected using SWATH mass spectrometry. Mb2515c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1C9" /db_xref="InterPro:IPR000792" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR002182" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR029787" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C9" /protein_id="SIU01131.1" /translation="MDRRPRDFEQSRRRCRCNALRAGSMLASMSKIHPGVDVVPVDWS ADGVSELVPTGTVTLLLADIEGATHLPGSQLDTTAIAKLDRTLTELVREHRGVCPVEQ GEGDSFLVAFARASDAVACALGLQRAPLAPIRLRIGMHTGEVSSPDEGNCVGPTIDRT ARLRELAHGGQTVLSGTTSDLVADLLPKDAWLNDLGTYRLDDLPRPERVVQLCHPDLH NAFPPLRTRKVVGAHCLPAQLTRLVGRVDEVAQVRGLLDVKRWVTLTGVGGVGKTRLA TQVASAVADGYPDGVWYVNLAPITDPALVPIAAARVLGLPDQPGRSTVDTIVRRIGDR RMLVVLDNCEHLLDGCAALIVALLGACPALRVLATSREPIAVAGEQIWRVPPLGHGEA IELFTDRAREARPELEITADNLALVTEICHRLDGIPLAIELAASRVRALALTEIVDSL HDRFRLLTGGSRIAVRRQQTMRASVDWSHALLTGPEQVLFRRLAVFPSGFDLDGAQAA AAGGDVQRYEVVDLLSLLADKSLVVTDDSDGRTRYRLLETVRQYALEKLRESGDADAV RARHRDHYAAVAAGLDAPSVAGHERRLNQAELEIDNLRAAFAFSRENGDTGHALLLAS CLQPLWRARGRLQEGLAWFAAALADHDAHPAGADPGLYARALADRALIDAVAGITDRL DDAQKALAIARDIEDPALLARALTACGGVAAYNADLARPWLAEAVGLARAVGDKWRLA EVLAWQAYVGFAGEGDPGATRAAGEEARSLADEIGDAFLSRSCRWALAAANLWQGNLE AAVGLSREVIGESDAAHDMVSSCAGQACLAHALAHRGDTEAAAAAQASIDTAVGLSPV LSGSACSALVFATLAAGDVAAAEHARESATRFFGASAAAIINDPTSSAQISCARGDLN AAHRLADGAASITRGVHRARALTTRCRIEIAQGDRHRAERDAHDALGVAASIGAYLWV PDILECLASVMADAGSNREAVRLFGAADAARGRMGAVRFGIYQAGCNSSLATLRKSMG DSEFDDAWAEGTALSIDEAIAYAQRGRGARKRPTSGWGALTPTELEVALLVGEGLSNK EIGVRLFISPRTVHSHLTHVYTKLGLSSRLQLAQQAARRGESERGPSRP" CDS complement(2772058..2772357) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2516C" /product="HYPOTHETICAL ALANINE RICH PROTEIN" /note="Mb2516c, -, len: 99 aa. Equivalent to Rv2489c, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Hypothetical unknown ala-rich protein. Mb2516c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1D1" /protein_id="SIU01132.1" /translation="MGVTAKAAEAAAPSSSFPSLRKPHRAGDSADRSAGDFDGTAHDA VVSVLAGDAASTGGLTIASGQHGHCRSAAMARRSPNASTKARRTHGPAAKRFRAI" CDS complement(2772466..2775918) /codon_start=1 /transl_table=11 /gene="PE_PGRS43b" /locus_tag="BQ2027_MB2517C" /product="PE-PGRS FAMILY PROTEIN [SECOND PART]" /note="Mb2517c, PE_PGRS43b, len: 1150 aa. Similar to 3' end of Rv2490c, len: 1660 aa, from Mycobacterium tuberculosis strain H37Rv, (98.0% identity in 1150 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS-subfamily of Gly-rich proteins, similar to many e.g. AAK47971|MT3612.1 PE_PGRS family protein from Mycobacterium tuberculosis strain CDC1551 (1715 aa), FASTA scores: opt: 5161, E(): 1.5e-187, (51.7% identity in 1752 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS43 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 8 bp insertion (*-gggggggg) splits PE_PGRS43 into 2 parts, PE_PGRS43a and PE_PGRS43b." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1D6" /protein_id="SIU01133.1" /translation="MTAIFLGSSGTPGEDGGNGGAGGAGGAGGAHAGDGGAGGAGGNG GAGGAGGNGAHGFNAVLVSDGGNGGDGGAGGRGGDGGAGGAGGDAPAGRAGSQGVGGD GGAGGAGGAPGNGGSGGRGDMAFKDGDGGAGGDGGDPGAGGKGGAGGAGATEGVTGAT GATVHSGGNGGKGGNGADATVAGANGGKGGAGGNGGLVGDGGAGGDGGSGAAGANGAN VGEDGADGTLSGQPGEGSEANGGQGGVGGGGAGGAGGDGGAGSSALGSGGNGGRGDAG QAGGAGGAGGAGGAGGSVSGDGGPGGKGGAGGAGGAGASGGGGGKGASGADSAEAVGG AGGKGGDGGVGGVGGDGGPGGDGGAGGAAPAGQVGSHGVGGVGGDGGLGGAGGNGGDG GHGSDGGDGGDGGDPGAGGLGGLGGDSGNGTRAASGVDASDHGPGSGGNGGNGGNGAQ ASVAGGAGGNGGDGGNAGRVGDGGAGGNGGDGAAGANGANSGAPGSDALALGQPGGNG GQGDAGQAGGAGGAGGAGGSVSGDGGAGGNGGAGGNGGVGASGGAGARGANGIDSIGG TGGAGGGGGDGGAGGVGGHGGDGGVGGAAPSGTVGSHGTGGVGGDGGLGGAGGVGGAG GNGGIGITVGGAGGAGGNGGDPGAGGRGGLGGDSGNGTSAANGVDASKHGPLTGGDGG VGGNGAKAAAAGGDGGQGGDGGNAGLFGDGGAGGDGADGTAAEALGGDGGAGGAGGKG GDAGDIGDGGDGGKGGDGAHGALGGLTVAGGNGGAGGAGGAGGAGGAFLGDGGNGGAG GQGGAGRGGSPGGGGGVGGHGGAGGDAGMNGGGGTGGQGGNGAAGGAGWSPDSDLKGF DGFDGGSGGAGGDGGAGGAGGTQTGDGGDGGAGGLGGAGGVGGNGVDGFDINETTGRD GDGGDGGYGGWGGAGGNGGAGGSAPAGEVGNRGVGGDGGDGGSGGDAGNGGLGGDGFT YLADFDGEPGGDGGDGGDGGWGRPGGQGGFGSTSGAHGKAGFGAPGGDGGDGGNGGHG GDGNGSFADAGDGGPGGNGGNGGLGGAGRDGGAPGGDGGDGGTGGSGGFGAPPPRSIG GGDGGDGGRGGDGGRGAGGLTSGGVGSSGESGGSGNGRGDPGSGGSGGEGGEGGPSIS VNVT" CDS complement(2775915..2777444) /codon_start=1 /transl_table=11 /gene="PE_PGRS43a" /locus_tag="BQ2027_MB2518C" /product="PE-PGRS FAMILY PROTEIN [FIRST PART]" /note="Mb2518c, PE_PGRS43a, len: 509 aa. Similar to 5' end of Rv2490c, len: 1660 aa, from Mycobacterium tuberculosis strain H37Rv, (98.0% identity in 444 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS-subfamily of Gly-rich proteins, similar to many e.g. AAK47971|MT3612.1 PE_PGRS family protein from Mycobacterium tuberculosis strain CDC1551 (1715 aa), FASTA scores: opt: 5161, E(): 1.5e-187, (51.7% identity in 1752 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS43 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 8 bp insertion (*-gggggggg) splits PE_PGRS43 into 2 parts, PE_PGRS43a and PE_PGRS43b." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1B4" /protein_id="SIU01134.1" /translation="MSYVIATPEMMATAAFDLARIGSQVSAASAVAAMPTTEVVAAGA DEVSAGIAALFSAHAQEYQALSAQAAAFHDQFVHTLTAAARWYTATEIANAAAMRVVL GAVNAPTQTLLGRPLIGDGAHGTAPGQPGGAGGLLFGNGGNGAAGAVGQVGGAGGAAG LFGIGGAGGAGGAGAPGGTGGTGGWLAGGGGVGGMGGAGGGAGGAGGNAGLFGNGGAG GAGGAGGGAGGAGGNAGWFGHGGAGGVGGVGAAGANGATPGQDGAAGVAGSDDGAGGD GLAGSDGGDGGAGGVGGNGGRGGWLLGNGGAGGVGGVGGAGGAGAAGGAGGAGATGIN GPAGISAAGGDGGAGGNGGAGGNGGVGGAGGAGGSAGLLGYVGRAGDGGAGGGGGLGG APGDGGAGGNGGSWLAAGDGGAGGHGGDPGLGGAGGAGGGGGRRAAPVLARGPMVWRP ATTGRSAAATAAKVAMAPTHRSPAVMAVTAVPVATAGWSVTVGPAVMAVTEPRVPAMP I" CDS 2777873..2778496 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2519" /product="Rossmann-fold nucleotide-binding protein" /note="Mb2519, -, len: 207 aa. Equivalent to Rv2491, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 207 aa overlap). Conserved hypothetical protein, similar in part to other hypothetical proteins e.g. O29139|AF1126 from Archaeoglobus fulgidus (151 aa), FASTA scores: opt: 293, E(): 2.8e-11, (42.85% identity in 126 aa overlap); O66531|AQ_134 from Aquifex aeolicus (151 aa), FASTA scores: opt: 261, E(): 2.6e-09, (37.75% identity in 106 aa overlap); Q9HKU3|TA0501 from Thermoplasma acidophilum (161 aa), FASTA scores: opt: 260, E(): 3.2e-09, (35.9% identity in 117 aa overlap); etc." /db_xref="InterPro:IPR005268" /db_xref="InterPro:IPR041164" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3G8" /protein_id="SIU01135.1" /translation="MVDTSAPASRLDTDPRRAHVSLSKHPYQIGVFGSGTIGPRVYEL AYQVGAEIAKQGHILISGGMTGTMEASSRGASDADGLVVGVLPGDKFTDGNAYSTIKI LSGMQFARNYITGLSCHGAIVVGGSSGAYEEARRVWEGRGPVVVLANSGSPTGASAQM LSMQEIFGVAFPEDKPKPWRVFSAATPAESVSLVIGLIRKGYAQHEP" CDS 2778486..2779238 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2520" /product="Thymidylate synthase" /note="Mb2520, -, len: 250 aa. Equivalent to Rv2492, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 250 aa overlap). Hypothetical unknown protein." /db_xref="InterPro:IPR036926" /db_xref="UniProtKB/TrEMBL:A0A1R3Y264" /protein_id="SIU01136.1" /translation="MSRRIINEFGVQIYGATIGDTWAGLVRAVLDLGSQCFDEDRERI ALSNVRIKSSVQNYPDLTIEEHCNSDQLKAMLDFMFNTDTMEDIDVVKSFSRGAKSYH RRIKEGRMIEFVIERLSLIPESKKAVVVFPTYEDYAAVMRNHRDDYLPCLVSIQFRLL PDGKDYVFHTTFYSRSMDAWQKGHGNLLSIAKLSDWVRENVSARIGRKIMLGPLDGMI CDVHIYKETYAEACKRLANLDLRRTQFDAVRN" CDS 2779291..2779512 /codon_start=1 /transl_table=11 /gene="vapb38" /locus_tag="BQ2027_MB2521" /product="possible antitoxin vapb38" /note="Mb2521, -, len: 73 aa. Equivalent to Rv2493, len: 73 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 aa overlap). Conserved hypothetical protein, highly similar to AAK46916|MT2606 HYPOTHETICAL 8.8 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (74 aa), FASTA scores: opt: 234, E(): 4e-09, (56.95% identity in 74 aa overlap); and similar to O53373|Rv3321c|MTV016.21c HYPOTHETICAL 8.8 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (80 aa), FASTA scores: opt: 126, E(): 0.055, (30.75% identity in 78 aa overlap); and with weak similarity with other Mycobacterial hypothetical proteins e.g. Q9CCR7|ML0525 from Mycobacterium leprae (58 aa), FASTA scores: opt: 115, E(): 0.22, (47.75% identity in 44 aa overlap); etc. Protein product from Mb2521 detected using SWATH mass spectrometry. Mb2521 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1L5" /protein_id="SIU01137.1" /translation="MRTTLDLDDDVIAAARELASSQRRSLGSVISELARRGLMPGRVE ADDGLPVIRVPAGTPPITPEMVRRALDED" CDS 2779518..2779943 /codon_start=1 /transl_table=11 /gene="vapc38" /locus_tag="BQ2027_MB2522" /product="possible toxin vapc38. contains pin domain." /note="Mb2522, -, len: 141 aa. Equivalent to Rv2494, len: 141 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 141 aa overlap). Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. P95023|EMBL:Z83863|MTCY159.26|Rv2530c (139 aa) FASTA scores: opt: 380 E(): 6.6e-19, (48.0% identity in 125 aa overlap); O53372|Rv3320c|MTV016.20c (142 aa), FASTA scores: opt: 287, E(): 1.3e-12, (41.6% identity in 125 aa overlap); AAK46915|MT2605 (strain CDC1551) (139 aa) FASTA scores: opt: 380, E(): 6.6e-19 (48.0% identity in 125 aa overlap); etc. Mb2522 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1R4" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1R4" /protein_id="SIU01138.1" /translation="MALLDVNALVALAWDSHIHHARIREWFTANATLGWATCPLTEAG FVRVSTNPKVLPSAIGIADARRVLVALRAVGGHRFLADDVSLVDDDVPLIVGYRQVTD AHLLTLARRRGVRLVTFDAGVFTLAQQRPKTPVELLTIL" CDS complement(2779966..2781147) /codon_start=1 /transl_table=11 /gene="bkdc" /locus_tag="BQ2027_MB2523C" /product="probable branched-chain keto acid dehydrogenase e2 component bkdc" /note="Mb2523c, pdhC, len: 393 aa. Equivalent to Rv2495c, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 393 aa overlap). Probable pdhC, dihydrolipoamide S-acetyltransferase, e2 component (EC 2.3.1.12), similar to others e.g. Q9XA49|SCGD3.30c from Streptomyces coelicolor (491 aa) FASTA scores: opt: 615, E(): 1.2e-28, (36.45% identity in 491 aa overlap; several gaps); P19262|ODO2_YEAST|KGD2|YDR148C|YD8358.05c from Saccharomyces cerevisiae (Baker's yeast) (463 aa) FASTA scores: opt: 533, E(): 7.1e-24, (28.55% identity in 396 aa overlap); Q9HN75|DSA|VNG2219G from Halobacterium sp. strain NRC-1 (478 aa), FASTA scores: opt: 521, E(): E(): 3.7e-23, (30.25% identity in 486 aa overlap; in part); etc. BELONGS TO THE 2-OXOACID DEHYDROGENASE FAMILY. Protein product from Mb2523c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2523c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1B8" /db_xref="InterPro:IPR000089" /db_xref="InterPro:IPR001078" /db_xref="InterPro:IPR004167" /db_xref="InterPro:IPR011053" /db_xref="InterPro:IPR023213" /db_xref="InterPro:IPR036625" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1B8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01139.1" /translation="MSGEDSIRSFPVPDLGEGLQEVTVTCWSVAVGDDVEINQTLCSV ETAKAEVEIPSPYAGRIVELGGAEGDVLKVGAELVRIDTGPTAVAQPNGEGAVPTLVG YGADAAIETSRRTSRPLAAPVVRKLAKELAVDLAALQRGSGAGGVITRADVLAAARGG VGAGPDVRPVHGVHARMAEKMTLSHKEIPTAKASVEVICAELLRLRDWFVSAAPEITP FALTLRLLVIALKHNVILNSTWVDSGEGPQVHVHRGVHLGFGAATERGLLVPVVTDAQ DKNTRELASRVAELITGAREGTLTPAELRGSTFTVSNFGALGVDDGVPVINHPEAAIL GLGAIKPRPVVVGGEVVARPTMTLTCVFDHRVVDGAQVAQFMCELRDLIESPETALLD L" CDS complement(2781144..2782190) /codon_start=1 /transl_table=11 /gene="bkdb" /locus_tag="BQ2027_MB2524C" /product="probable branched-chain keto acid dehydrogenase e1 component, beta subunit bkdb" /note="Mb2524c, pdhB, len: 348 aa. Equivalent to Rv2496c, len: 348 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 348 aa overlap). Probable pdhB, pyruvate dehydrogenase e1 component, beta subunit (EC 1.2.4.1), similar to others e.g. Q9Y8I6||PDHB from Halobacterium volcanii (Haloferax volcanii) (327 aa) FASTA scores: opt: 1050, E(): 6.4e-60, (49.7% identity in 324 aa overlap); Q9KG98|BH0214 from Bacillus halodurans (328 aa), FASTA scores: opt: 987, E(): 6.9e-56, (45.7% identity in 324 aa overlap); Q9HN76|PDHB|VNG2218G from Halobacterium sp. strain NRC-1 (297 aa), FASTA scores: opt: 968, E(): 1.1e-54, (51.2% identity in 297 aa overlap); P21874|ODPB_BACST|PDHB PYRUVATE DEHYDROGENASE E1 COMPONENT from Bacillus stearothermophilus (324 aa), FASTA scores: opt: 951, E(): 1.4e-53, (47.6% identity in 321 aa overlap); etc. Also similar to Q9XA61|SCGD3.17c PUTATIVE BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASE E1, BETA SUBUNIT (2-oxoisovalerate dehydrogenase) (EC 1.2.4.4) from Streptomyces coelicolor, (326 aa), FASTA scores: opt: 1178, E(): 4.1e-68, (55.0% identity in 322 aa overlap); Q9XA48|SCGD3.31c PUTATIVE BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASE E1 BETA SUBUNIT from Streptomyces coelicolor (334 aa), FASTA scores: opt: 1173, E(): 8.8e-68, (55.6% identity in 320 aa overlap); Q53593|BKDB E1-BETA BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASE from Streptomyces avermitilis (334 aa), FASTA scores: opt: 1132, E(): 3.7e-65, (55.0% identity in 320 aa overlap); etc. Protein product from Mb2524c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2524c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1E4" /db_xref="InterPro:IPR005475" /db_xref="InterPro:IPR009014" /db_xref="InterPro:IPR029061" /db_xref="InterPro:IPR033248" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1E4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01140.1" /translation="MTQIADRPARPDETLAVAVSDITQSLTMVQAINRALYDAMAADE RVLVFGEDVAVEGGVFRVTEGLADTFGADRCFDTPLAESAIIGIAVGLALRGFVPVPE IQFDGFSYPAFDQVVSHLAKYRTRTRGEVDMPVTVRIPSFGGIGAAEHHSDSTESYWV HTAGLKVVVPSTPGDAYWLLRHAIACPDPVMYLEPKRRYHSRGMVDTSRPEPPIGHAM VRRSGTDVTVVTYGNLVSTALSSADTAEQQHDWSLEVIDLRSLAPLDFDTIAASIQRT GRCVVMHEGPRSLGYGAGLAARIQEEMFYQLEAPVLRACGFDTPYPPARLEKLWLPGP DRLLDCVERVLRQP" CDS complement(2782201..2783304) /codon_start=1 /transl_table=11 /gene="bkda" /locus_tag="BQ2027_MB2525C" /product="probable branched-chain keto acid dehydrogenase e1 component, alpha subunit bkda" /note="Mb2525c, pdhA, len: 367 aa. Equivalent to Rv2497c, len: 367 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 367 aa overlap). Probable pdhA, pyruvate dehydrogenase e1 component, alpha subunit (EC 1.2.4.1), similar to many e.g. Q9Y8I5|PDHA from Halobacterium volcanii (Haloferax volcanii) (368 aa) FASTA scores: opt: 961, E(): 1.3e-52, (45.6% identity in 351 aa overlap); BAB40585 from Bacillus sp. UTB2301 (356 aa) FASTA scores: opt: 947, E(): 9.1e-52, (43.1% identity in 355 aa overlap); Q9KG99|BH0213 from Bacillus halodurans (367 aa), FASTA scores: opt: 896, E(): 1.4e-48, (42.65% identity in 340 aa overlap); etc. Also similar to several PUTATIVE BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASES E1, BETA SUBUNIT (EC 1.2.4.4), alternate name : 2-oxoisovalerate dehydrogenase, e.g. Q53592|BKDA from Streptomyces avermitilis (381 aa), FASTA scores: opt: 980, E(): 8.5e-54, (45.65% identity in 370 aa overlap); etc. Protein product from Mb2525c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2525c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1D9" /db_xref="InterPro:IPR001017" /db_xref="InterPro:IPR017596" /db_xref="InterPro:IPR029061" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1D9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01141.1" /translation="MGEGSRRPSGMLMSVDLEPVQLVGPDGTPTAERRYHRDLPEETL RWLYEMMVVTRELDTEFVNLQRQGELALYTPCRGQEAAQVGAAACLRKTDWLFPQYRE LGVYLVRGIPPGHVGVAWRGTWHGGLQFTTKCCAPMSVPIGTQTLHAVGAAMAAQRLD EDSVTVAFLGDGATSEGDVHEALNFAAVFTTPCVFYVQNNQWAISMPVSRQTAAPSIA HKAIGYGMPGIRVDGNDVLACYAVMAEAAARARAGDGPTLIEAVTYRLGPHTTADDPT RYRSQEEVDRWATLDPIPRYRTYLQDQGLWSQRLEEQVTARAKHVRSELRDAVFDAPD FDVDEVFTTVYAEITPGLQAQREQLRAELARTD" CDS complement(2783563..2784384) /codon_start=1 /transl_table=11 /gene="citE" /locus_tag="BQ2027_MB2526C" /product="PROBABLE CITRATE (PRO-3S)-LYASE (BETA SUBUNIT) CITE (CITRASE) (CITRATASE) (CITRITASE) (CITRIDESMOLASE) (CITRASE ALDOLASE)" /note="Mb2526c, citE, len: 273 aa. Equivalent to Rv2498c, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 273 aa overlap). Probable citE, citrate lyase, beta subunit (EC 4.1.3.6), similar to others e.g. Q9S3L3|CITE from Corynebacterium glutamicum (Brevibacterium flavum) (217 aa), FASTA scores: opt: 565, E(): 1.5e-28, (41.85% identity in 215 aa overlap); Q9HRM8|CITE|VNG0627G from Halobacterium sp. strain NRC-1 (303 aa), FASTA scores: opt: 535, E(): 1.5e-26, (41.65% identity in 276 aa overlap); Q9S2U9|SC4G6.02 from Streptomyces coelicolor (274 aa), FASTA scores: opt: 426, E(): 1e-19, (37.6% identity in 274 aa overlap); P77770|CILB_ECOLI from Escherichia coli (307 aa), FASTA scores: opt: 265, E(): 1.5e-10, (32.8% identity in 265 aa overlap); etc. Also similar to Rv3075c|MTCY22D7.06 from Mycobacterium tuberculosis, FASTA score: (35.2% identity in 264 aa overlap). Protein product from Mb2526c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2526c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1E1" /db_xref="InterPro:IPR005000" /db_xref="InterPro:IPR011206" /db_xref="InterPro:IPR015813" /db_xref="InterPro:IPR040442" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1E1" /protein_id="SIU01142.1" /translation="MNLRAAGPGWLFCPADRPERFAKAAAAADVVILDLEDGVAEAQK PAARNALRDTPLDPERTVVRINAGGTADQARDLEALAGTAYTTVMLPKAESAAQVIEL APRDVIALVETARGAVCAAEIAAADPTVGMMWGAEDLIATLGGSSSRRADGAYRDVAR HVRSTILLAASAFGRLALDAVHLDILDVEGLQEEARDAAAVGFDVTVCIHPSQIPVVR KAYRPSHEKLAWARRVLAASRSERGAFAFEGQMVDSPVLTHAETMLRRAGEATSE" CDS complement(2784381..2784938) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2527C" /product="POSSIBLE OXIDASE REGULATORY-RELATED PROTEIN" /note="Mb2527c, -, len: 185 aa. Equivalent to Rv2499c, len: 185 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 185 aa overlap). Possible oxidase regulatory-related protein, similar to many maoC MONOAMINE OXIDASE REGULATORY PROTEIN e.g. Q9RUZ1|DR1239 MAOC-RELATED PROTEIN from Deinococcus radiodurans (160 aa), FASTA scores: opt: 519, E(): 7.6e-28, (58.1% identity in 148 aa overlap); BAB48392|MLR0905 Probable monoamine oxidase regulatory protein from Rhizobium loti (Mesorhizobium loti) (150 aa), FASTA scores: opt: 480, E(): 2.9e-25, (49.0% identity in 149 aa overlap); Q9HN18|MAOC1|VNG2290G MONOAMINE OXIDASE REGULATORY-LIKE from Halobacterium sp. strain NRC-1 (208 aa), FASTA scores: opt: 419, E(): 4.6e-21, (45.6% identity in 158 aa overlap); P77455|MAOC_ECOLI|PAAZ|B1387 MaoC protein (Phenylacetic acid degradation protein paaZ) from Escherichia coli strain K12 (681 aa), FASTA scores: opt: 252, E(): 1.9e-09, (36.0% identity in 172 aa overlap); etc. But also similar to other proteins with different putative functions e.g. Q9HRM9|MAOC2|VNG0626G MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN from Halobacterium sp strain NRC-1 (157 aa), FASTA scores: opt: 380, E(): 1.5e-18, (45.75% identity in 153 aa overlap); Q9KIF1 FKBR2 from Streptomyces hygroscopicus var. ascomyceticus (175 aa), FASTA scores: opt: 355, E(): 7.6e-17, (42.0% identity in 150 aa overlap); CAC36828|Q99Q03|SAPE Spore associated protein from Streptomyces coelicolor (174 aa), FASTA scores: opt: 318, E(): 2.2e-14, (41.45% identity in 152 aa overlap); etc. Protein product from Mb2527c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2527c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002539" /db_xref="InterPro:IPR029069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1E5" /protein_id="SIU01143.1" /translation="MTKHAGDRESDDAVSACRVAGSTVGRRILQRGLWFEEFQIGTTY LHRPGRTVTEADNVLFTTLTMNTQSLHLDAAWAGQQPGFRGERLVNSMFTLSTMVGLS VAQLTLGTIVANLGFSEVSFPKPVFHGDTLYAETVCTGKRESKSRPGEGIVTLEHIAR NQHGEVVARAVRTTLVQKQSIKEAQ" CDS complement(2784935..2786119) /codon_start=1 /transl_table=11 /gene="fadE19" /locus_tag="BQ2027_MB2528C" /product="POSSIBLE ACYL-COA DEHYDROGENASE FADE19 (MMGC)" /note="Mb2528c, fadE19, len: 394 aa. Equivalent to Rv2500c, len: 394 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 394 aa overlap). Possible fadE19 (alternate gene name: mmgC), acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9XCG6|ACDH from Streptomyces coelicolor (386 aa), FASTA scores: opt: 1714, E(): 1.1e-98, (69.45% identity in 383 aa overlap); Q9XCG5|ACDH from Streptomyces avermitilis (386 aa), FASTA scores: opt: 1713, E(): 1.3e-98, (70.0% identity in 383 aa overlap); Q9L7W5|FENK from Bacillus subtilis (370 aa), FASTA scores: opt: 1094, E(): 2.3e-60, (48.4% identity in 372 aa overlap); etc. Contains PS00072 Acyl-CoA dehydrogenases signature 1, PS00073 Acyl-CoA dehydrogenases signature 2. BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb2528c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2528c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1C1" /db_xref="InterPro:IPR006089" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C1" /protein_id="SIU01144.1" /translation="MTTTTTTISGGILPKEYQDLRDTVADFARTVVAPVSAKHDAEHS FPYEIVAKMGEMGLFGLPFPEEYGGMGGDYFALSLVLEELGKVDQSVAITLEAAVGLG AMPIYRFGTEEQKQKWLPDLTSGRALAGFGLTEPGAGSDAGSTRTTARLEGDEWIING SKQFITNSGTDITSLVTVTAVTGTTGTAADAKKEISTIIVPSGTPGFTVEPVYNKVGW NASDTHPLTFADARVPRENLLGARGSGYANFLSILDEGRIAIAALATGAAQGCVDESV KYANQRQSFGQPIGAYQAIGFKIARMEARAHVARTAYYDAAAKMLAGKPFKKEAAIAK MISSEAAMDNSRDATQIHGGYGFMNEYPVARHYRDSKVLEIGEGTTEVQLMLIARSLG LQ" CDS complement(2786124..2788088) /codon_start=1 /transl_table=11 /gene="accA1" /locus_tag="BQ2027_MB2529C" /product="PROBABLE ACETYL-/PROPIONYL-COENZYME A CARBOXYLASE ALPHA CHAIN (ALPHA SUBUNIT) ACCA1: BIOTIN CARBOXYLASE + BIOTIN CARBOXYL CARRIER PROTEIN (BCCP)" /note="Mb2529c, accA1, len: 654 aa. Equivalent to Rv2501c, len: 654 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 654 aa overlap). Probable accA1 (alternate gene name: bccA), acetyl-/propionyl-coenzyme A carboxylase (alpha subunit) [INCLUDES: BIOTIN CARBOXYLASE (EC 6.3.4.14); BIOTIN CARBOXYL CARRIER PROTEIN (BCCP)], similar to others eg Q9L076|FABG from Streptomyces coelicolor (646 aa), FASTA scores: opt: 2071, E(): 1e-113, (57.8% identity in 659 aa overlap); AAK24139|Q9A6C6|CC2168 from Caulobacter crescentus (654 aa), FASTA scores: opt: 1754, E(): 3.7e-95, (47.2% identity in 661 aa overlap); etc. Contains PS00188 Biotin-requiring enzymes attachment site, PS00866 Carbamoyl-phosphate synthase subdomain signature 1, and PS00867 Carbamoyl-phosphate synthase subdomain signature 2. Protein product from Mb2529c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2529c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A509" /db_xref="InterPro:IPR000089" /db_xref="InterPro:IPR001882" /db_xref="InterPro:IPR005479" /db_xref="InterPro:IPR005481" /db_xref="InterPro:IPR005482" /db_xref="InterPro:IPR011053" /db_xref="InterPro:IPR011054" /db_xref="InterPro:IPR011761" /db_xref="InterPro:IPR011764" /db_xref="InterPro:IPR016185" /db_xref="UniProtKB/Swiss-Prot:P0A509" /protein_id="SIU01145.1" /translation="MFDTVLVANRGEIAVRVIRTLRRLGIRSVAVYSDPDVDARHVLE ADAAVRLGPAPARESYLDIGKVLDAAARTGAQAIHPGYGFLAENADFAAACERARVVF LGPPARAIEVMGDKIAAKNAVAAFDVPVVPGVARAGLTDDALVTAAAEVGYPVLIKPS AGGGGKGMRLVQDPARLPEALVSARREAMSSFGDDTLFLERFVLRPRHIEVQVLADAH GNVVHLGERECSLQRRHQKVIEEAPSPLLDPQTRERIGVAACNTARCVDYVGAGTVEF IVSAQRPDEFFFMEMNTRLQVEHPVTEAITGLDLVEWQLRVGAGEKLGFAQNDIELRG HAIEARVYAEDPAREFLPTGGRVLAVFEPAGPGVRVDSSLLGGTVVGSDYDPLLTKVI AHGADREEALDRLDQALARTAVLGVQTNVEFLRFLLADERVRVGDLDTAVLDERSADF TARPAPDDVLAAGGLYRQWALARRAQGDLWAAPSGWRGGGHMAPVRTAMRTPLRSETV SVWGPPESAQVQVGDGEIDCASVQVTREQMSVTISGLRRDYRWAEADRHLWIADERGT WHLREAEEHKIHRAVGARPAEVVSPMPGSVIAVQVESGSQISAGDVVVVVEAMKMEHS LEAPVSGRVQVLVSVGDQVKVEQVLARIKD" CDS complement(2788093..2789682) /codon_start=1 /transl_table=11 /gene="accD1" /locus_tag="BQ2027_MB2530C" /product="PROBABLE ACETYL-/PROPIONYL-COA CARBOXYLASE (BETA SUBUNIT) ACCD1" /note="Mb2530c, accD1, len: 529 aa. Equivalent to Rv2502c, len: 529 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 529 aa overlap). Probable accD1, acetyl-/propionyl-CoA carboxylase (beta subunit) (EC 6.4.1.-), similar, but with N-terminus shorter, to Q9L077|ACCD1 from Streptomyces coelicolor (538 aa), FASTA scores: opt: 2747, E(): 1.9e-159, (77.9% identity in 516 aa overlap). Also similar to others e.g. AAK24141 CC2170 from Caulobacter crescentus (530 aa), FASTA scores: opt: 2413, E(): 3.8e-139, (69.4% identity in 529 aa overlap); BAB54131|MLL7731 from Rhizobium loti (537 aa), FASTA scores: opt: 2399, E(): 2.7e-138, (67.4% identity in 527 aa overlap); etc. COULD BELONG TO THE ACCD/PCCB FAMILY. Protein product from Mb2530c detected using SWATH mass spectrometry. Mb2530c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y277" /db_xref="InterPro:IPR011762" /db_xref="InterPro:IPR011763" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR034733" /db_xref="UniProtKB/TrEMBL:A0A1R3Y277" /protein_id="SIU01146.1" /translation="MTTPSIAIAPSFADEHRRLVAELNNKLAAAALGGNERARKRHVS RGKLLPRERVDRLLDPGSPFLELAPLAAGGMYSDESPGAGIITGIGRVSGRQCVIVAN DATVKGGTYYPMTVKKHLRAQEVALQNMLPCIYLVDSGGAFLPRQDEVFPDREHFGRI FYNQATMSAKGIPQVAAVLGSCTAGGAYVPAMSDEAVIVREQGTIFLGGPPLVKAATG EIVSAEELGGGDLHSRTSGVTDHLADDDEDALRIVRAIADTFGPCEPAQWDVRRSVEP KYPQAELYDVVPPDPRVPYDVHEVVVRIVDGSEFSEFKAKYGKTLVTAFARVHGHPVG IVANNGVLLSESALKGAHFIELCDKRKIPLLFLQNIAGFMVGRDYEAGGIAKHGAKMV TAVACARVPKLTVVIGGSYGAGNYSMCGRAYSPRFLWMWPNARISVMGGEQAASVLAT VRGEQLSAAGTPWSPDEEEAFKAPIRAQYEDQGNPYYSTARLWDDGIIDPADTRTVVG LALSLCAHAPLDQVGYGVFRM" CDS complement(2789679..2790335) /codon_start=1 /transl_table=11 /gene="scoB" /locus_tag="BQ2027_MB2531C" /product="PROBABLE SUCCINYL-COA:3-KETOACID-COENZYME A TRANSFERASE (BETA SUBUNIT) SCOB (3-OXO-ACID:COA TRANSFERASE) (OXCT B) (SUCCINYL CoA:3-OXOACID CoA-TRANSFERASE)" /note="Mb2531c, scoB, len: 218 aa. Equivalent to Rv2503c, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 218 aa overlap). Probable scoB, 3-oxo acid:CoA transferase, beta subunit (succinyl-CoA:3-ketoacid-CoA transferase) (EC 2.8.3.5). Highly similar to others e.g. Q9XAM8|SC4C6.12c from Streptomyces coelicolor (217 aa), FASTA scores: opt: 1048, E(): 2.6e-60, (73.9% identity in 207 aa overlap); Q9XD82|PCAJ from Streptomyces sp. 2065 (214 aa), FASTA scores: opt: 1031, E(): 3.2e-59, (70.8% identity in 209 aa overlap); AAK53493|LPSJ from Xanthomonas campestris (pv. campestris) (212 aa), FASTA scores: opt: 886, E(): 6.6e-50, (62.5% identity in 208 aa overlap); P42316|SCOB_BACSU from Bacillus subtilis (216 aa), FASTA scores: opt: 820, E(): 1.2e-45, (58.2% identity in 201 aa overlap); etc. BELONGS TO THE 3-OXOACID COA-TRANSFERASE SUBUNIT B FAMILY. Protein product from Mb2531c detected using SWATH mass spectrometry. Mb2531c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63651" /db_xref="InterPro:IPR004164" /db_xref="InterPro:IPR004165" /db_xref="InterPro:IPR012791" /db_xref="InterPro:IPR037171" /db_xref="UniProtKB/Swiss-Prot:P63651" /protein_id="SIU01147.1" /translation="MSAPGWSRDEMAARVAAEFEDGQYVNLGIGMPTLIPNHIPDGVH VVLHSENGILGVGPYPRREDVDADLINAGKETVTTLPGAAFFSSSTSFGIIRGGHLDV AVLGAMQVSVTGDLANWMIPGKMVKGMGGAMDLVHGARKVIVMMEHTAKDGSPKILER CTLPLTGVGCVDRIVTELAVIDVCADGLHLVQTAPGVSVDEVVAKTQPPLVLRDLATQ " CDS complement(2790332..2791078) /codon_start=1 /transl_table=11 /gene="scoA" /locus_tag="BQ2027_MB2532C" /product="PROBABLE SUCCINYL-COA:3-KETOACID-COENZYME A TRANSFERASE (ALPHA SUBUNIT) SCOA (3-OXO ACID:CoA TRANSFERASE) (OXCT A) (SUCCINYL-COA:3-OXOACID-COENZYME A TRANSFERASE)" /note="Mb2532c, scoA, len: 248 aa. Equivalent to Rv2504c, len: 248 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 248 aa overlap). Probable scoA, succinyl-CoA:3-ketoacid-Coenzyme A transferase, alpha subunit (3-oxo acid:CoA transferase) (EC 2.8.3.6). Highly similar to others e.g. Q9XAM7|SC4C6.13c from Streptomyces coelicolor (260 aa), FASTA scores: opt: 1130, E(): 2.2e-64, (69.9% identity in 249 aa overlap); Q9XD83|PCAI from Streptomyces sp. 2065 (251 aa), FASTA scores: opt: 1121, E(): 8.1e-64, (69.5% identity in 249 aa overlap); etc. BELONGS TO THE 3-OXOACID COA-TRANSFERASE SUBUNIT A FAMILY. Protein product from Mb2532c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2532c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63649" /db_xref="InterPro:IPR004163" /db_xref="InterPro:IPR004165" /db_xref="InterPro:IPR012792" /db_xref="InterPro:IPR037171" /db_xref="UniProtKB/Swiss-Prot:P63649" /protein_id="SIU01148.1" /translation="MDKVVATAAEAVADIANGSSLAVGGFGLCGIPEALIAALVDSGV TDLETVSNNCGIDGVGLGLLLQHKRIRRTVSSYVGENKEFARQFLAGELEVELTPQGT LAERLRAGGMGIPAFYTPAGVGTQVADGGLPWRYDASGGVAVVSPAKETREFDGVTYV LERGIRTDFALVHAWQGDRHGNLMYRHAAANFNPECASAGRITIAEVEHLVEPGEIDP ATVHTPGVFVHRVVHVPNPAKKIERETVRQ" CDS complement(2791161..2792804) /codon_start=1 /transl_table=11 /gene="fadD35" /locus_tag="BQ2027_MB2533C" /product="PROBABLE FATTY-ACID-COA LIGASE FADD35 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb2533c, fadD35, len: 547 aa. Equivalent to Rv2505c, len: 547 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 547 aa overlap). Probable fadD35, fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to many e.g. Q9Z5A6|SC2G5.17 from Streptomyces coelicolor (541 aa), FASTA scores: opt: 2202, E(): 8e-131, (61.55% identity in 528 aa overlap); Q9F9U4|FADD from Pseudomonas stutzeri (Pseudomonas perfectomarina), FASTA scores: opt: 1551, E(): 7.3e-90, (55.55% identity in 551 aa overlap); Q987S7|MLR6932 from Rhizobium loti (Mesorhizobium loti) (590 aa), FASTA scores: opt: 1453, E(): 1.1e-83, (50.7% identity in 564 aa overlap); etc." /db_xref="GOA:A0A1R3Y1C6" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1C6" /protein_id="SIU01149.1" /translation="MAAAEVVDPNRLSYDRGPSAPSLLESTIGANLAATAARYGHREA LVDMVARRRFNYSELLTDVHRLATGLVRAGIGPGDRVGIWAPNRWEWVLVQYATAEIG AILVTINPAYRVREVEYALRQSGVAMVIAVASFKDADYAAMLAEVGPRCPDLADVILL ESDRWDALAGAEPDLPALQQTAARLDGSDPVNIQYTSGTTAYPKGVTLSHRNILNNGY LVGELLGYTAQDRICIPVPFYHCFGMVMGNLAATSHGAAMVIPAPGFDPAATLRAVQD ERCTSLYGVPTMFIAELGLPDFTDYELGSLRTGIMAGAACPVEVMRKVISRMHMPGVS ICYGMTETSPVSTQTRADDSVDRRVGTVGRVGPHLEIKVVDPATGETVPRGVVGEFCT RGYSVMAGYWNDPQKTAEVIDADGWMHTGDLAEMDPSGYVRIAGRIKDLVVRGGENIS PREIEELLHTHPDIVDGHVIGVPDAKYGEELMAVVKLRNDAPELTIERLREYCMGRIA RFKIPRYLWIVDEFPMTVTGKVRKVEMRQQALEYLRGQQ" CDS 2792920..2793567 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2534" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb2534, -, len: 215 aa. Equivalent to Rv2506, len: 215 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 215 aa overlap). Probable transcriptional regulator, tetR family, similar to many others e.g. Q9L078|SCC105.06c PUTATIVE TETR-FAMILY REGULATORY PROTEIN from Streptomyces coelicolor (208 aa), FASTA scores: opt: 333, E(): 1.5e-14, (48.75% identity in 197 aa overlap); Q9X7X6|SC6A5.30c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (404 aa), FASTA scores: opt: 267, E(): 4.8e-10, (30.45% identity in 207 aa overlap) (similarity only with C-terminus for this one); Q9FBI8|SCP8.33c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (213 aa), FASTA scores: opt: 239, E(): 1.8e-08, (29.9% identity in 184 aa overlap); etc. Also similar to transcriptional regulatory proteins from Mycobacterium tuberculosis e.g. O05858|Rv3208|MTCY07D11.18c (228 aa), FASTA scores: opt: 218, E(): 4.4e-07, (30.35% identity in 191 aa overlap); C-terminus of P95251|Rv1963c|MTV051.01c|MTCY09F9.01 (406 aa), FASTA scores: opt: 238, E(): 3.6e-08, (28.25% identity in 177 aa overlap); P96839|Rv3557c|MTCY06G11.04c (200 aa), FASTA scores: opt: 215, E(): 6.2e-07, (38.25% identity in 148 aa overlap); etc. Equivalent to AAK46885 from Mycobacterium tuberculosis strain CDC1551 (231 aa) but shorter 16 aa. Contains probable helix-turn-helix motif at aa 46-67, (Score 1660, +4.84 SD). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb2534 detected using SWATH mass spectrometry. Mb2534 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1E9" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="InterPro:IPR041490" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1E9" /protein_id="SIU01150.1" /translation="MTASAPDGRPGQPEATNRRSQLKSDRRFQLLAAAERLFAERGFL AVRLEDIGAAAGVSGPAIYRHFPNKESLLVELLVGVSARLLAGARDVTTRSANLAAAL DGLIEFHLDFALGEADLIRIQDRDLAHLPAVAERQVRKAQRQYVEVWVGVLRELNPGL AEADARLMAHAVFGLLNSTPHSMKAADSKPARTVRARAVLRAMTVAALSAADRCL" CDS 2793646..2794467 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2535" /product="POSSIBLE CONSERVED PROLINE RICH MEMBRANE PROTEIN" /note="Mb2535, -, len: 273 aa. Equivalent to Rv2507, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 273 aa overlap). Possible conserved pro-rich membrane protein (N-terminal half is Proline-rich), highly similar to Q9CCU3|ML0431 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (259 aa) (alias O07711|MLCL383.38c but longer 2 aa), FASTA scores: opt: 968, E(): 1.4e-31, (60.35% identity in 275 aa overlap). Contains potential membrane spanning region. Mb2535 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1E8" /db_xref="InterPro:IPR008693" /db_xref="InterPro:IPR038468" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1E8" /protein_id="SIU01151.1" /translation="MNDPRRPQRFGPPLSGYGPTGPQVPPNPPTADPAYADQSPYAST YGGYVSPPWSPGGPPPRPPQWPPGPHEASPTQQLPQYWQYDQPPPGGFPPDGLTPPPP QGPRTPRWLWFAAGSAVLLVVALVIALVIANGSVKKQTAIEPLPPMPGPSPTRPTTTT PTPPSPSAAPAPTTTTGTPSETVAGAMQTVVYDVTGEGRAISITYMDSGNVIQTEFNV ALPWRKEVSLSKSSLHPASVTIVNIGHNVTCSVTVAGVQVRQRTGAGLTICDAPS" CDS complement(2794464..2795801) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2536C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE LEUCINE AND ALANINE RICH PROTEIN" /note="Mb2536c, -, len: 445 aa. Equivalent to Rv2508c, len: 445 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 445 aa overlap). Probable conserved integral membrane leu-, ala-rich protein, equivalent to Q9CCU4|ML0430 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (454 aa) (alias O07710|MLCL383.37 longer 10 aa), FASTA scores: opt: 2205, E(): 2.5e-124, (75.75% identity in 441 aa overlap). Also similar to hypothetical or membrane proteins e.g. BAB50841|MLL4103 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (458 aa), FASTA scores: opt: 396, E(): 2.4e-16, (27.75% identity in 447 aa overlap); Q9RKX9|SC6D7.19c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (486 aa), FASTA scores: opt: 323, E(): 5.7e-12, (28.95% identity in 428 aa overlap); P42306|YXIO_BACSU PROBABLE INTEGRAL MEMBRANE PROTEIN from Bacillus subtilis (428 aa), FASTA scores: opt: 220, E(): 7.2e-06, (20.35% identity in 413 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. Q10564|Y876_MYCTU|Rv0876c|MT0899|MTCY31. 04c (548 aa), FASTA scores: opt: 184, E(): 0.0012, (24.7% identity in 466 aa overlap). Mb2536c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1F8" /db_xref="InterPro:IPR024671" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1F8" /protein_id="SIU01152.1" /translation="MNNPGSRAGTLLHFRVVAWAMWDCGSTGLNAIVTTFVFSVYLTS AVGQGLPGGTSPASWLGRAGAVAGLTIGVLAPVVGVWVESPHRRRVALSVLTGTAVAL TCAMFLIRDDPRYLWAGLVLLAATAASSDLSSVPYNAMLRQLSTPSTAGRISGFGWAS GYVGSVALLLVIYLGFMSGSGSQRGLLQLPVANGLNVRMAMLVAAAWLALLGLPLLLV AHRLPDSGAASHPSTGLLGGYRKLWTEISAEWRRDRNLVYFLVASAIFRDGLAAIFAF GAVLGVNAYGLTQADVLIFGAAASVVAAVGAVLGGFVDHRIGSKPVIVGSLAAIIAAA LTLLTLSGPTAFWACGLLLCVFIGPAQSSARALLLHMAQHGKEGVAFGLYTMTGRAVS FLGPWLFSVFVDVFHTVRAGLGGVCLVLTTGLLLMLRVQVSRHGGALTTAQSS" CDS 2795886..2796692 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2537" /product="probable short-chain type dehydrogenase/reductase" /note="Mb2537, -, len: 268 aa. Equivalent to Rv2509, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 268 aa overlap). Probable ala-rich oxidoreductase, short-chain dehydrogenase/reductase (EC 1.-.-.-), equivalent to O07709|MLCL383.36c|ML0429 DEHYDROGENASE (PUTATIVE OXIDOREDUCTASE) from Mycobacterium leprae (268 aa), FASTA scores: opt: 1509, E(): 2.6e-84, (88.75% identity in 267 aa overlap). Also highly similar to others e.g. O86553|SC1F2.16c PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (276 aa), FASTA scores: opt: 492, E(): 9.5e-23, (38.15% identity in 262 aa overlap); Q9I5R3|PA0658 PROBABLE SHORT-CHAIN DEHYDROGENASE from Pseudomonas aeruginosa (266 aa), FASTA scores: opt: 472, E(): 1.5e-21, (37.8% identity in 246 aa overlap); AAK22120|CC0133 OXIDOREDUCTASE (SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY) from Caulobacter crescentus (266 aa), FASTA scores: opt: 428, E(): 6.9e-19, (35.8% identity in 243 aa overlap); etc. Also highly similar or similar to oxidoreductases from Mycobacterium tuberculosis e.g. Q10782|Rv1544|MTCY48.21 PUTATIVE KETOACYL REDUCTASE (EC 1.3.1.-) (267 aa), FASTA scores: opt: 656, E(): 1.1e-32, (43.05% identity in 267 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb2537 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2537 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1F1" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1F1" /protein_id="SIU01153.1" /translation="MPIPAPSPDARAVVTGASQNIGAALATELAARGHHLIVTARRED VLTELAARLADKYRVTVDVRPADLADPQERSKLADELAARPISILCANAGTATFGPIA SLDLAGEKTQVQLNAVAVHDLTLAVLPGMIERKAGGILISGSAAGNSPIPYNATYAAT KAFVNTFSESLRGELRGSGVHVTVLAPGPVRTELPDASEASLVEKLVPDFLWISTEHT ARVSLNALERNKMRVVPGLTSKAMSVASQYAPRAIVAPIVGAFYKRLGGS" CDS complement(2796696..2798297) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2538C" /product="Predicted ATPase" /note="Mb2538c, -, len: 533 aa. Equivalent to Rv2510c, len: 533 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 533 aa overlap). Hypothetical unknown protein, highly similar, but longer approximatively 20 aa, to others e.g. Q9ABY0|CC0090 HYPOTHETICAL PROTEIN from Caulobacter crescentus (516 aa), FASTA scores: opt: 1282, E(): 8.4e-63, (45.1% identity in 490 aa overlap); Q9A130|SPY0500 HYPOTHETICAL PROTEIN from Streptococcus pyogenes (500 aa), FASTA scores: opt: 1281, E(): 9.3e-63, (43.8% identity in 491 aa overlap); Q985L5|MLR7622 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (515 aa), FASTA scores: opt: 1259, E(): 1.5e-61, (44.1% identity in 510 aa overlap); P39342|YJGR_ECOLI|B4263 HYPOTHETICAL 54.3 KDA PROTEIN from Escherichia coli strain K12 (500 aa), FASTA scores: opt: 1257, E(): 1.9e-61, (42.7% identity in 501 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb2538c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2538c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR033186" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1D2" /protein_id="SIU01154.1" /translation="MGTESAAGGPGGPAQRIAAGYTVEGQALQLGTVVVDGEPDPSAQ IRIPLATVNRHGLVAGATGTGKTKTLQLIAEQLSAAGVAVLMADVKGDLSGLARPGEA ADKTAARAKDTGDDWVPTAFPVEFLSLGASGVGVPVRATISSFGPILLAKVLGLNATQ ESTLGLIFHWADQRGLPLLDLKDLRAVITHLTSDEGKVELKSLGAVSPTTAGVILRAL VNLEAEGADTFFGEPELRPEDLLRVDSQGRGIISLLEFGSQALRPAMFSTFLMWVLAD LFTFLPEVGDLDKPKLVFFFDEAHLLFTDASKAFLEQVEQTVKLIRSKGVGVFFCTQL PTDLPNDVLSQLGARIQHALRAFTPDDHKALRKTVRTYPKTDVYDLESALTSLGTGEA VVTVLSEKGAPTPVAWTRMRAPRSLMAAIGAEAIGAAAQASSLQAVYGQTIDRPSAHE ILSAKLAPAQEAPAQEAPAPRGQYDPLPWPDDFEVPPMPAPVEPQGPAVWEEILKNPT VKSVLNTTAREITRSIFGTGRRRRK" CDS 2798365..2799012 /codon_start=1 /transl_table=11 /gene="orn" /locus_tag="BQ2027_MB2539" /product="oligoribonuclease orn" /note="Mb2539, orn, len: 215 aa. Equivalent to Rv2511, len: 215 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 215 aa overlap). Probable orn, oligoribonuclease (EC 3.1.-.-), equivalent to O07708|ORN_MYCLE|ORN|ML0427|MLCL383.34c OLIGORIBONUCLEASE from Mycobacterium leprae (215 aa), FASTA scores: opt: 1170, E(): 3.5e-65, (84.5% identity in 213 aa overlap). Also highly similar to many e.g. P57667|ORN_STRGR|ORNA from Streptomyces griseus (201 aa), FASTA scores: opt: 807, E(): 7.7e-43, (59.0% identity in 200 aa overlap); ORN_STRCO|ORNA|2SC13.01 from Streptomyces coelicolor (200 aa), FASTA scores: opt: 799, E(): 2.4e-42, (59.7% identity in 201 aa overlap); P39287|ORN_ECOLI|B4162 from Escherichia coli strain K12 (180 aa), FASTA scores: opt: 519, E(): 3.9e-25, (47.4% identity in 173 aa overlap); etc. BELONGS TO THE OLIGORIBONUCLEASE FAMILY. Protein product from Mb2539 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2539 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65598" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR013520" /db_xref="InterPro:IPR022894" /db_xref="InterPro:IPR036397" /db_xref="UniProtKB/Swiss-Prot:P65598" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01155.1" /translation="MQDELVWIDCEMTGLDLGSDKLIEIAALVTDADLNILGDGVDVV MHADDAALSGMIDVVAEMHSRSGLIDEVKASTVDLATAEAMVLDYINEHVKQPKTAPL AGNSIATDRAFIARDMPTLDSFLHYRMIDVSSIKELCRRWYPRIYFGQPPKGLTHRAL ADIHESIRELRFYRRTAFVPQPGPSTSEIAAVVAELSDGAGAQEETDSAEAPQSG" tRNA 2799062..2799134 /locus_tag="BQ2027_HIST" /product="tRNA-His" /note="hisT, len: 73 nt. Equivalent to hisT, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-His, anticodon gtg." mobile_element complement(2799702..2801136) /mobile_element_type="insertion sequence:IS1081" /locus_tag="BQ2027_IS1081-3" /note="IS1081-3, len: 1435 nt. Equivalent to IS1081, len: 1450 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1435 nt overlap)." gene complement(2799702..2801136) /locus_tag="BQ2027_IS1081-3" repeat_region 2799740..2799754 /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRR,TCGCGTGATCCTTCG, flanking IS element IS1081." CDS complement(2799764..2801011) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2540C" /product="TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1081" /note="Mb2540c, -, len: 415 aa. Equivalent to Rv2512c, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 415 aa overlap). Transposase for IS1081, identical to P35882|TRA1_MYCBO transposase for insertion sequence element IS1081 from Mycobacterium bovis (415 aa), FASTA scores: opt: 2680, E(): 1.9e-162, (100.0% identity in 415 aa overlap). Also highly similar to others from Mycobacterium tuberculosis e.g. P96354|Rv1047|MTCY10G2.02c|Rv3115|MTCY164.25|Rv3023c|MTV01 2.38c (415 aa), FASTA scores: opt: 2675, E(): 3.9e-162, (99.75% identity in 415 aa overlap). Contains PS00435 Peroxidases proximal heme-ligand signature, PS01007 Transposases, Mutator family, signature. BELONGS TO THE MUTATOR FAMILY OF TRANSPOSASE." /db_xref="GOA:P60231" /db_xref="InterPro:IPR001207" /db_xref="UniProtKB/Swiss-Prot:P60231" /protein_id="SIU01156.1" /translation="MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" CDS 2801011..2801148 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2541" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2541, -, len: 45 aa. Equivalent to Rv2512A, len: 45 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 45 aa overlap). Conserved hypothetical protein, equivalent to N-terminus of Rv1046c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (239 aa), FASTA scores: opt: 293, E(): 2.1e-18, (100.0% identity in 42 aa overlap). Mb2541 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1N6" /protein_id="SIU01157.1" /translation="MRHFLRLTFAGRFEGSRDDRPLLGYDTPTGLTCPYTTPLDVTPR R" repeat_region complement(2801049..2801063) /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRL,TCGCGTGATCCTTCG, flanking IS element IS1081." CDS 2801369..2801791 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2542" /product="HYPOTHETICAL PROTEIN" /note="Mb2542, -, len: 140 aa. Equivalent to Rv2513, len: 140 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 140 aa overlap). Hypothetical unknown protein. Mb2542 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1T5" /protein_id="SIU01158.1" /translation="MDDIAAFKLDSLPDITFTVTRAISSGGENPAGFLNFAARREQPE ILGGGGRPGPVGPEAVDTPRIRGGKVPFVFRTLPGYTFYASQIEPRVGDPEGPTLLAG FGNIPETSQRSPGWIRITCKGPDDDEELEFFGFAGPES" CDS complement(2802085..2802546) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2543C" /product="regulated by general enzymatic activity" /note="Mb2543c, -, len: 153 aa. Equivalent to Rv2514c, len: 153 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 153 aa overlap). Conserved hypothetical protein, showing some similarity to Q9PG05|XF0497 HYPOTHETICAL PROTEIN from Xylella fastidiosa (155 aa), FASTA scores: opt: 215, E(): 1.4e-07, (30.6% identity in 160 aa overlap)." /db_xref="InterPro:IPR016541" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1D7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01159.1" /translation="MLYSFDTSAILNGRRDLFRPAVFRSLWGRVEDAISAGQIRSVDE VQRELARRDDDAKRWADGQTGLFCPLDEQIQQAARHILRLHPNMVRQGGRRSAADPFV IALAMVNNATVVTQETASGNIEKPRIPDVCDALGVPWLTLMGYIEAQGWTF" CDS complement(2802552..2803799) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2544C" /product="DNA-binding protein" /note="Mb2544c, -, len: 415 aa. Equivalent to Rv2515c, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 415 aa overlap). Conserved hypothetical protein, showing some similarity to Q9PG06|XF0496 HYPOTHETICAL PROTEIN from Xylella fastidiosa (391 aa), FASTA scores: opt: 388, E(): 4.4e-18, (27.8% identity in 399 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. Protein product from Mb2544c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y1F6" /db_xref="InterPro:IPR001387" /db_xref="InterPro:IPR010359" /db_xref="InterPro:IPR010982" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1F6" /protein_id="SIU01160.1" /translation="MGIGHPMWVGWCIIIAMRSIPASVESSVLRWARESCGLTEVAAA RKLGLPDDRVAAWEVGEVVPTIAQLRKAAEVYKRSLAVFFLSEPPEGFDTLRDFRRLD GAASGQWTPGLHEEFRRAHTQRDFALELADAEDREIPGAWRLPLSGDEADADIAARIR KALIEVSPLPIPVASVDPYEHLNAWVSAIETSGVLVLATRGGKVAIDEMRGMCLYFDE LPVIVLNGSDHPRPRLFSLLHEFVHVVLHTEGLCDVIADAHPSTQDRSLEARCNAIAA AVLMPADVVRARPEVIVRSETPSSWDYESLRPVAAHFGVSAEAFLRRLSTLGIVPVEV YRQRRAEFIAAHEDEAERARSAGGGNWYRNTVRDLGKGYVRAVTDAHRRRVIDSNTAA IYLDAKVSQIPKLAESAELRSVV" CDS complement(2803918..2804721) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2545C" /product="HYPOTHETICAL PROTEIN" /note="Mb2545c, -, len: 267 aa. Equivalent to Rv2516c, len: 267 aa, from Mycobacterium tuberculosis strain H37Rv, (99.625% identity in 267 aa overlap). Hypothetical unknown protein. Contains probable helix-turn-helix motif at aa 98 to 119 (Score 1743, +5.12 SD). C-terminus extended since first submission (+ 18 aa). Protein product from Mb2545c detected using SWATH mass spectrometry. Mb2545c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1F5" /protein_id="SIU01161.1" /translation="MTADWVVTFTFDADPSMETMDAWETQLEGFDALVSRVPGHGIDV TVYAPGDWSVFDALAKMAGEVMPVVQAKSPIAVQIISEPEHRLRAEAFTTPELMSAAE IADELGVSRQRVHQLRSTAGFPAPLADLRGGAVWDAAAVRRFAETWERKPGRPHTGTA KFAYSWAVGPAVGRSGKAPNVRWRVENPDKIRFVLRNIGDDIAEDVEIDLSRIDAITR NVPKKTVIRPGEGLNMVLIAAWGHPLPNQLYVRWAGQDEWAAVPLHPAH" CDS complement(2804718..2804969) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2546C" /product="unknown protein" /note="Mb2546c, -, len: 83 aa. Equivalent to Rv2517c, len: 83 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 83 aa overlap). Hypothetical unknown protein. Equivalent to AAK46899 from Mycobacterium tuberculosis strain CDC1551 (97 aa) but shorter 14 aa. Protein product from Mb2546c detected using SWATH mass spectrometry. Mb2546c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1G2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01162.1" /translation="MNSAIIKIAKWAQSQQWTVEDDASGYTRFYNPQGVYIARFPATP SNEYRRMRDLLGALKKAGLTWPPPSKKERRAQHRKEGAQ" CDS complement(2805317..2806543) /codon_start=1 /transl_table=11 /gene="ldtb" /locus_tag="BQ2027_MB2547C" /product="probable l,d-transpeptidase ldtb" /note="Mb2547c, lppS, len: 408 aa. Equivalent to Rv2518c, len: 408 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 408 aa overlap). Probable lppS, conserved lipoprotein, highly similar to O07707|MLCL383.3 HYPOTHETICAL 43.6 KDA PROTEIN from Mycobacterium leprae (407 aa), FASTA scores: opt: 2300, E(): 1.2e-130, (82.5% identity in 406 aa overlap); Q9CCU5|LPPS|ML0426 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (404 aa), FASTA scores: opt: 2279, E(): 2.3e-129, (82.4% identity in 403 aa overlap); and Q9CB49|ML2446 POSSIBLE LIPOPROTEIN from Mycobacterium leprae (441 aa), FASTA scores: opt: 736, E(): 8.4e-37, (35.6% identity in 399 aa overlap). Also similar to other proteins from several organisms e.g. Q9X811|SC6G10.26c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (424 aa), FASTA scores: opt: 867, E(): 1.1e-44, (32.25% identity in 403 aa overlap); Q9L1E8|SC3D11.14 PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (416 aa), FASTA scores: opt: 737, E(): 7e-37, (32.95% identity in 413 aa overlap); Q9KYV1|SCE22.11 PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (407 aa), FASTA scores: opt: 721, E(): 6.2e-36, (33.5% identity in 400 aa overlap). And similar to several hypothetical mycobacterial proteins e.g. Q11149|Y483_MYCTU|Rv0483|MT0501|MTCY20G9.09 (451 aa), FASTA scores: opt: 763, E(): 2.1e-38, (34.85% identity in 402 aa overlap). Has very long signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb2547c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2547c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1G0" /db_xref="InterPro:IPR005490" /db_xref="InterPro:IPR038063" /db_xref="InterPro:IPR041280" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1G0" /protein_id="SIU01163.1" /translation="MPKVGIAAQAGRTRVRRAWLTALMMTAVMIGAVACGSGRGPAPI KVIADKGTPFADLLVPKLTASVTDGAVGVTVDAPVSVTAADGVLAAVTMVNDNGRPVA GRLSPDGLRWSTTEQLGYNRRYTLNATALGLGGAATRQLTFQTSSPAHLTMPYVMPGD GEVVGVGEPVAIRFDENIADRGAAEKAIKITTNPPVEGAFYWLNNREVRWRPEHFWKP GTAVDVAVNTYGVDLGEGMFGEDNVQTHFTIGDEVIATADDNTKILTVRVNGEVVKSM PTSMGKDSTPTANGIYIVGSRYKHIIMDSSTYGVPVNSPNGYRTDVDWATQISYSGVF VHSAPWSVGAQGHTNTSHGCLNVSPSNAQWFYDHVKRGDIVEVVNTVGGTLPGIDGLG DWNIPWDQWRAGNAKA" tRNA 2806703..2806774 /locus_tag="BQ2027_LYSU" /product="tRNA-Lys" /note="lysU, len: 73 nt. Equivalent to lysU, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Lys, anticodon ctt." CDS 2806993..2808471 /codon_start=1 /transl_table=11 /gene="PE26" /locus_tag="BQ2027_MB2548" /product="pe family protein pe26" /note="Mb2548, PE26, len: 492 aa. Equivalent to Rv2519, len: 492 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 492 aa overlap). Member of the M. tuberculosis PE family, highly similar to many e.g. Q50630|YP91_MYCTU|Rv2591|MT2668.1|MTCY227.10c (543 aa), FASTA scores: opt: 848, E(): 3e-30, (39.55% identity in 445 aa overlap). Mb2548 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1E2" /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR001969" /db_xref="InterPro:IPR021109" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1E2" /protein_id="SIU01164.1" /translation="MSRLIVAPDWLASAAAEVQSIGSALSAANAAAAAPTTLLVAAAE DEVSAAAAALFANYGREYQTLSVRFASLDQQFAQALNSAAASYQTAEATGASLVQTAT QGVLGVINAPTEFMFGRSLIGDGADGTAASPIGEPGGILYGDGGNGYSQTTPGAVGGA GGSAGFIGNGGAAGAGGPGAGGGTGGLGGWLWGNNGAAGTGDPVNVAVPLRVENNFPL VNLLVNRGPTVPILLDTGSSSLVIPFWKIGWQNLGLPTGFDVVHYGNGVSIVYADVPT TVDFGGGAATTPTSVHVGILPYPRNLDSLVLIASGGAFGPNGNGILGIGPNVGSYAVS GPGNVVTTDLPGQLNEGTLIDIPGGYMQFGPNTGTPITSVTGAPITVLNVQIGGYDPN GGYWSLPSIFDSGGNHGTLPAVILGTGQTTGYAPPGTVISISIHDNQTLLYQYTTTAS NSPVVTADPRLNTGLTPFLLGPVYISNNPSGVGTVVFNYPPP" CDS complement(2808596..2808823) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2549C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb2549c, -, len: 75 aa. Equivalent to Rv2520c, len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 75 aa overlap). Possible conserved membrane protein, equivalent to O07706|MLCL383.32 HYPOTHETICAL 10.0 KDA PROTEIN from Mycobacterium leprae (91 aa), FASTA scores: opt: 290, E(): 4.1e-14, (58.65% identity in 75 aa overlap); and Q9CCU6|ML0425 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (75 aa), FASTA scores: opt: 286, E(): 6.6e-14, (57.35% identity in 75 aa overlap). Protein product from Mb2549c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2549c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3J5" /db_xref="InterPro:IPR022062" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3J5" /protein_id="SIU01165.1" /translation="MVDRDPNTIKQEIDQTRDQLAATIDSLAERANPRRLADDAKTRV IAFLRKPIVTVSLVGIGSVVVVVVIHKIRNR" CDS 2808892..2809365 /codon_start=1 /transl_table=11 /gene="bcp" /locus_tag="BQ2027_MB2550" /product="PROBABLE BACTERIOFERRITIN COMIGRATORY PROTEIN BCP" /note="Mb2550, bcp, len: 157 aa. Equivalent to Rv2521, len: 157 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 157 aa overlap). Probable bcp, bacterioferritin comigratory protein, equivalent to O07705|BCP|ML0424 from Mycobacterium leprae (161 aa), FASTA scores: opt: 829, E(): 6.8e-46, (79.6% identity in 157 aa overlap). Also highly similar to Q9KZQ2|SCE6.38 HYPOTHETICAL 16.8 KDA PROTEIN Streptomyces coelicolor (155 aa), FASTA scores: opt: 727, E(): 2e-39, (69.5% identity in 154 aa overlap); P23480|AAG57590|BCP_ECOLI|B2480|BAB36765|Z3739|ECS3342 BACTERIOFERRITIN COMIGRATORY PROTEIN from Escherichia coli strain K12 (156 aa), FASTA scores: opt: 513, E(): 8.3e-26, (48.3% identity in 149 aa overlap); Q9RW23|DR0846 BACTERIOFERRITIN COMIGRATORY PROTEIN from Deinococcus radiodurans (175 aa), FASTA scores: opt: 465, E(): 1e-22, (46.5% identity in 157 aa overlap); P44411|BCP_HAEIN|HI0254 BACTERIOFERRITIN COMIGRATORY PROTEIN from Haemophilus influenzae (155 aa), FASTA scores: opt: 453, E(): 5.3e-22, (47.5% identity in 139 aa overlap); etc. Also similar to Mycobacterium tuberculosis Rv1608c|MTV046.06|bcpB and Rv2238c|MTCY427.19c|hpE. Protein product from Mb2550 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2550 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y297" /db_xref="InterPro:IPR000866" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR024706" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3Y297" /protein_id="SIU01166.1" /translation="MTKTTRLTPGDKAPAFTLPDADGNNVSLADYRGRRVIVYFYPAA STPGCTKQACDFRDNLGDFTTAGLNVVGISPDKPEKLATFRDAQGLTFPLLSDPDREV LTAWGAYGEKQMYGKTVQGMIRSTFVVDEDGKIVVAQYNVKATGHVAKLRRDLSV" CDS complement(2809337..2810749) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2551C" /product="N-acyl-L-amino acid amidohydrolase (EC" /EC_number="3.5.1.14" /note="Mb2551c, -, len: 470 aa. Equivalent to Rv2522c, len: 470 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 470 aa overlap). Conserved hypothetical protein, equivalent, but longer 20 aa, to Q9X7E4|ML1193|MLCB458.08 from HYPOTHETICAL 46.6 KDA PROTEIN Mycobacterium leprae (442 aa), FASTA scores: opt: 2521, E(): 4.1e-142, (86.35% identity in 440 aa overlap). Also similar to various proteins e.g. Q9K425|SCG22.20 PUTATIVE PEPTIDASE from Streptomyces coelicolor (451 aa), FASTA scores: opt: 1097, E(): 1.1e-57, (42.5% identity in 451 aa overlap); Q9FCK3|2SC3B6.09 PUTATIVE PEPTIDASE from Streptomyces coelicolor (470 aa), FASTA scores: opt: 669, E(): 2.8e-32, (34.2% identity in 462 aa overlap); Q98AF9|MLL6018 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (486 aa), FASTA scores: opt: 622, E(): 1.7e-29, (33.95% identity in 442 aa overlap); Q9RSU7|DR2025 ARGE/DAPE/ACY1 FAMILY PROTEIN from Deinococcus radiodurans (459 aa), FASTA scores: opt: 616, E(): 3.7e-29, (34.15% identity in 442 aa overlap); etc (include some similarity to hypothetical proteins from C. elegans and yeast). Alternative start possible at 6687 but then no RBS obvious. Protein product from Mb2551c detected using SWATH mass spectrometry. Mb2551c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1P4" /db_xref="InterPro:IPR002933" /db_xref="InterPro:IPR011650" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1P4" /protein_id="SIU01167.1" /translation="MSASRRRIASKSGFSCDSASARELVERVREVLPSVRCDLEELVR IESVWADPDRRDEVHRSARAVADLLSQAGFDDVRIVSERGAPAVIARYPAPPGAPTVL LYAHHDVQPEGDRGQWVSPPFEPTERGGRLYGRGTADDKAGIATHVAAFWAHGGRPPV GVTVFVEGEEESGSPSLGRLLAAHRDALAADVIVIADSDNWSTDIPALTVSLRGMADC VVEVATLDHGLHSGLWGGVVPDALTVLVRLLASLHDDDGNVAVAGMHESTAARVDYPA GRVRAESGLLDGVSEIGTGSVPQRLWAKPAITVIGIDTTSVAAASNTLIPRARAKISI RVAPGGDATAHLDAVEAHLRRHAPWGAQVTVTRGEVGQPYAIEASGPVYDAARSAFRQ AWGADPIDMGMGGSIPFIAEFAAAFPQATILVTGVEDPGTQAHSVNESLHLGVLERAA TAEALLLAKLAAIPTGRAEA" CDS complement(2810746..2811138) /codon_start=1 /transl_table=11 /gene="acpS" /locus_tag="BQ2027_MB2552C" /product="HOLO-[ACYL-CARRIER PROTEIN] SYNTHASE ACPS (HOLO-ACP SYNTHASE) (CoA:APO-[ACP]PANTETHEINEPHOSPHOTRANSFERASE) (CoA:APO-[ACYL-CARRIER PROTEIN]PANTETHEINEPHOSPHOTRANSFERASE)" /note="Mb2552c, acpS, len: 130 aa. Equivalent to Rv2523c, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 130 aa overlap). acpS, holo-[Acyl Carrier Protein] synthase (EC 2.7.8.7) (see citation below), equivalent to Q9X7E3|ACPS_MYCLE|ML1192|MLCB458.07 HOLO-[ACYL-CARRIER PROTEIN] SYNTHASE from Mycobacterium leprae (130 aa), FASTA scores: opt: 732, E(): 5.5e-42, (87.5% identity in 128 aa overlap). Also similar to others e.g. O86785|ACPS_STRCO|SC6G4.22c from Streptomyces coelicolor (123 aa), FASTA scores: opt: 204, E(): 6.6e-07, (36.7% identity in 139 aa overlap); Q9KPB6|VC2457 from Vibrio cholerae (126 aa), FASTA scores: opt: 163, E(): 0.00036, (32.55% identity in 129 aa overlap); P24224|ACPS_ECOLI|DPJ|B2563 from Escherichia coli strain K12 (125 aa), FASTA scores: opt: 151, E(): 0.0022, (30.55% identity in 131 aa overlap); etc. BELONGS TO THE ACPS FAMILY. Protein product from Mb2552c detected using SWATH mass spectrometry. Mb2552c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4W9" /db_xref="InterPro:IPR002582" /db_xref="InterPro:IPR004568" /db_xref="InterPro:IPR008278" /db_xref="InterPro:IPR037143" /db_xref="UniProtKB/Swiss-Prot:P0A4W9" /protein_id="SIU01168.1" /translation="MGIVGVGIDLVSIPDFAEQVDQPGTVFAETFTPGERRDASDKSS SAARHLAARWAAKEAVIKAWSGSRFAQRPVLPEDIHRDIEVVTDMWGRPRVRLTGAIA EYLADVTIHVSLTHEGDTAAAVAILEAP" CDS complement(2811331..2820540) /codon_start=1 /transl_table=11 /gene="fas" /locus_tag="BQ2027_MB2553C" /product="PROBABLE FATTY ACID SYNTHASE FAS (FATTY ACID SYNTHETASE)" /note="Mb2553c, fas, len: 3069 aa. Equivalent to Rv2524c, len: 3069 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 3069 aa overlap). Probable fas, Fatty Acid Synthase (EC 2.3.1.-), equivalent to Q9X7E2|FAS|ML1191 PUTATIVE TYPE I FATTY ACID SYNTHASE from Mycobacterium leprae (3076 aa), FASTA scores: opt: 17484, E(): 0, (85.8% identity in 3081 aa overlap). Also similar to others e.g. Q04846|FAS|Q59497 from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (3104 aa), FASTA scores: opt: 3981, E(): 5.5e-203, (49.8% identity in 3099 aa overlap); Q48926|FAS from Mycobacterium bovis (2796 aa), FASTA scores: opt: 2098, E(): 3.9e-103, (59.7% identity in 2862 aa overlap) (see first citation below); P34731|FAS1_CANAL FATTY ACID SYNTHASE SUBUNIT BETA from Candida albicans (Yeast) (2037 aa), FASTA scores: opt: 955, E(): 1.3e-42, (27.4% identity in 1926 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00606 Beta-ketoacyl synthases active site. Protein product from Mb2553c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2553c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1F3" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR002539" /db_xref="InterPro:IPR003965" /db_xref="InterPro:IPR013565" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR029069" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1F3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01169.1" /translation="MTIHEHDRVSADRGGDSPHTTHALVDRLMAGEPYAVAFGGQGSA WLETLEELVSATGIETELATLVGEAELLLDPVTDELIVVRPIGFEPLQWVRALAAEDP VPSDKHLTSAAVSVPGVLLTQIAATRALARQGMDLVATPPVAMAGHSQGVLAVEALKA GGARDVELFALAQLIGAAGTLVARRRGISVLGDRPPMVSVTNADPERIGRLLDEFAQD VRTVLPPVLSIRNGRRAVVITGTPEQLSRFELYCRQISEKEEADRKNKVRGGDVFSPV FEPVQVEVGFHTPRLSDGIDIVAGWAEKAGLDVALARELADAILIRKVDWVDEITRVH AAGARWILDLGPGDILTRLTAPVIRGLGIGIVPAATRGGQRNLFTVGATPEVARAWSS YAPTVVRLPDGRVKLSTKFTRLTGRSPILLAGMTPTTVDAKIVAAAANAGHWAELAGG GQVTEEIFGNRIEQMAGLLEPGRTYQFNALFLDPYLWKLQVGGKRLVQKARQSGAAID GVVISAGIPDLDEAVELIDELGDIGISHVVFKPGTIEQIRSVIRIATEVPTKPVIMHV EGGRAGGHHSWEDLDDLLLATYSELRSRANITVCVGGGIGTPRRAAEYLSGRWAQAYG FPLMPIDGILVGTAAMATKESTTSPSVKRMLVDTQGTDQWISAGKAQGGMASSRSQLG ADIHEIDNSASRCGRLLDEVAGDAEAVAERRDEIIAAMAKTAKPYFGDVADMTYLQWL RRYVELAIGEGNSTADTASVGSPWLADTWRDRFEQMLQRAEARLHPQDFGPIQTLFTD AGLLDNPQQAIAALLARYPDAETVQLHPADVPFFVTLCKTLGKPVNFVPVIDQDVRRW WRSDSLWQAHDARYDADAVCIIPGTASVAGITRMDEPVGELLDRFEQAAIDEVLGAGV EPKDVASRRLGRADVAGPLAVVLDAPDVRWAGRTVTNPVHRIADPAEWQVHDGPENPR ATHSSTGARLQTHGDDVALSVPVSGTWVDIRFTLPANTVDGGTPVIATEDATSAMRTV LAIAAGVDSPEFLPAVANGTATLTVDWHPERVADHTGVTATFGEPLAPSLTNVPDALV GPCWPAVFAAIGSAVTDTGEPVVEGLLSLVHLDHAARVVGQLPTVPAQLTVTATAANA TDTDMGRVVPVSVVVTGADGAVIATLEERFAILGRTGSAELADPARAGGAVSANATDT PRRRRRDVTITAPVDMRPFAVVSGDHNPIHTDRAAALLAGLESPIVHGMWLSAAAQHA VTATDGQARPPARLVGWTARFLGMVRPGDEVDFRVERVGIDQGAEIVDVAARVGSDLV MSASARLAAPKTVYAFPGQGIQHKGMGMEVRARSKAARKVWDTADKFTRDTLGFSVLH VVRDNPTSIIASGVHYHHPDGVLYLTQFTQVAMATVAAAQVAEMREQGAFVEGAIACG HSVGEYTALACVTGIYQLEALLEMVFHRGSKMHDIVPRDELGRSNYRLAAIRPSQIDL DDADVPAFVAGIAESTGEFLEIVNFNLRGSQYAIAGTVRGLEALEAEVERRRELTGGR RSFILVPGIDVPFHSRVLRVGVAEFRRSLDRVMPRDADPDLIIGRYIPNLVPRLFTLD RDFIQEIRDLVPAEPLDEILADYDTWLRERPREMARTVFIELLAWQFASPVRWIETQD LLFIEEAAGGLGVERFVEIGVKSSPTVAGLATNTLKLPEYAHSTVEVLNAERDAAVLF ATDTDPEPEPEEDEPVAESPAPDVVSEAAPVAPAASSAGPRPDDLVFDAADATLALIA LSAKMRIDQIEELDSIESITDGASSRRNQLLVDLGSELNLGAIDGAAESDLAGLRSQV TKLARTYKPYGPVLSDAINDQLRTVLGPSGKRPGAIAERVKKTWELGEGWAKHVTVEV ALGTREGSSVRGGAMGHLHEGALADAASVDKVIDAAVASVAARQGVSVALPSAGSGGG ATIDAAALSEFTDQITGREGVLASAARLVLGQLGLDDPVNALPAAPDSELIDLVTAEL GADWPRLVAPVFDPKKAVVFDDRWASAREDLVKLWLTDEGDIDADWPRLAERFEGAGH VVATQATWWQGKSLAAGRQIHASLYGRIAAGAENPEPGRYGGEVAVVTGASKGSIAAS VVARLLDGGATVIATTSKLDEERLAFYRTLYRDHARYGAALWLVAANMASYSDVDALV EWIGTEQTESLGPQSIHIKDAQTPTLLFPFAAPRVVGDLSEAGSRAEMEMKVLLWAVQ RLIGGLSTIGAERDIASRLHVVLPGSPNRGMFGGDGAYGEAKSALDAVVSRWHAESSW AARVSLAHALIGWTRGTGLMGHNDAIVAAVEEAGVTTYSTDEMAALLLDLCDAESKVA AARSPIKADLTGGLAEANLDMAELAAKAREQMSAAAAVDEDAEAPGAIAALPSPPRGF TPAPPPQWDDLDVDPADLVVIVGGAEIGPYGSSRTRFEMEVENELSAAGVLELAWTTG LIRWEDDPQPGWYDTESGEMVDESELVQRYHDAVVQRVGIREFVDDGAIDPDHASPLL VSVFLEKDFAFVVSSEADARAFVEFDPEHTVIRPVPDSTDWQVIRKAGTEIRVPRKTK LSRVVGGQIPTGFDPTVWGISADMAGSIDRLAVWNMVATVDAFLSSGFSPAEVMRYVH PSLVANTQGTGMGGGTSMQTMYHGNLLGRNKPNDIFQEVLPNIIAAHVVQSYVGSYGA MIHPVAACATAAVSVEEGVDKIRLGKAQLVVAGGLDDLTLEGIIGFGDMAATADTSMM RGRGIHDSKFSRPNDRRRLGFVEAQGGGTILLARGDLALRMGLPVLAVVAFAQSFGDG VHTSIPAPGLGALGAGRGGKDSPLARALAKLGVAADDVAVISKHDTSTLANDPNETEL HERLADALGRSEGAPLFVVSQKSLTGHAKGGAAVFQMMGLCQILRDGVIPPNRSLDCV DDELAGSAHFVWVRDTLRLGGKFPLKAGMLTSLGFGHVSGLVALVHPQAFIASLDPAQ RADYQRRADARLLAGQRRLASAIAGGAPMYQRPGDRRFDHHAPERPQEASMLLNPAAR LGDGEAYIG" CDS complement(2821060..2821782) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2554C" /product="putative secreted protein" /note="Mb2554c, -, len: 240 aa. Equivalent to Rv2525c, len: 240 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 240 aa overlap). Conserved hypothetical protein, equivalent to Q9X7E1|ML1190|MLCB458.05 HYPOTHETICAL 25.3 KDA PROTEIN from Mycobacterium leprae (239 aa), FASTA scores: opt: 1358, E(): 1e-75, (82.15% identity in 241 aa overlap). Protein product from Mb2554c detected using SWATH mass spectrometry. Mb2554c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006311" /db_xref="InterPro:IPR015020" /db_xref="InterPro:IPR017853" /db_xref="InterPro:IPR019546" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1G1" /protein_id="SIU01170.1" /translation="MSVSRRDVLKFAAATPGVLGLGVVASSLRAAPASAGSLGTLLDY AAGVIPASQIRAAGAVGAIRYVSDRRPGGAWMLGKPIQLSEARDLSGNGLKIVSCYQY GKGSTADWLGGASAGVQHARRGSELHAAAGGPTSAPIYASIDDNPSYEQYKNQIVPYL RSWESVIGHQRTGVYANSKTIDWAVNDGLGSYFWQHNWGSPKGYTHPAAHLHQVEIDK RKVGGVGVDVNQILKPQFGQWA" CDS 2822299..2822526 /codon_start=1 /transl_table=11 /gene="vapb17" /locus_tag="BQ2027_MB2555" /product="possible antitoxin vapb17" /note="Mb2555, -, len: 75 aa. Equivalent to Rv2526, len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in aa overlap). Hypothetical unknown protein. Protein product from Mb2555 detected using SWATH mass spectrometry. Mb2555 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019239" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1G3" /protein_id="SIU01171.1" /translation="MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQ AAARRRRIVDHLAHAGTHVDADVLLSEQAWR" CDS 2822523..2822924 /codon_start=1 /transl_table=11 /gene="vapc17" /locus_tag="BQ2027_MB2556" /product="possible toxin vapc17" /note="Mb2556, -, len: 133 aa. Equivalent to Rv2527, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Hypothetical protein, showing some similarity to hypothetical proteins from Mycobacterium tuberculosis e.g. P95007|MTCY159.10c|Rv2546 (137 aa), FASTA scores: opt: 206, E(): 1.4e-07, (38.0% identity in 100 aa overlap); O33299|MTV002.22c|Rv2757c (138 aa), FASTA scores: opt: 201, E(): 3.1e-07, (35.7% identity in 126 aa overlap); and P96411|MTCY08D5.24c|Rv0229c (226 aa), FASTA scores: opt: 153, E(): 0.0011, (32.8% identity in 128 aa overlap). Protein product from Mb2556 detected using SWATH mass spectrometry. Mb2556 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1G9" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1G9" /protein_id="SIU01172.1" /translation="MTTWILDKSAHVRLVAGATPPAGIDLTDLAICDIGELEWLYSAR SATDYDSQQTSLRAYQILRAPSDIFDRVRHLQRDLAHHRGMWHRTPLPDLFIAETALH HRAGVLHHDRDYKRIAVVRPGFQACELSRGR" CDS complement(2822959..2823879) /codon_start=1 /transl_table=11 /gene="mrr" /locus_tag="BQ2027_MB2557C" /product="PROBABLE RESTRICTION SYSTEM PROTEIN MRR" /note="Mb2557c, mrr, len: 306 aa. Equivalent to Rv2528c, len: 306 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 306 aa overlap). Probable mrr, restriction system protein, similar to other mrr proteins e.g. Q9RWS8|DR0587|MRR from Deinococcus radiodurans (306 aa), FASTA scores: opt: 776, E(): 4.2e-40, (40.45% identity in 309 aa overlap); P24202|MRR_ECOLI|B4351 from Escherichia coli strain K12 (304 aa), FASTA scores: opt: 647, E(): 2.9e-32, (35.25% identity in 309 aa overlap); Q9RX07|DR0508 from Deinococcus radiodurans (336 aa), FASTA scores: opt: 456, E(): 1.3e-20, (37.3% identity in 319 aa overlap); etc. Protein product from Mb2557c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y1G5" /db_xref="InterPro:IPR007560" /db_xref="InterPro:IPR011335" /db_xref="InterPro:IPR025745" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1G5" /protein_id="SIU01173.1" /translation="MTIPDAQTLMRPILAYLADGQAKSAKDVIAAMSDEFGLSDDERA QMLPSGRQRTMYDRVHWSLTHMSQAGLLDRPTRGHVQVTDTGRQVLKAHPERVDMAVL REFPSYIAFRERTKAKQPVDATAKRPSGDDVQVSPEDLIDAALAENRAAVEGEILKKA LTLSPTGFEDLVIRLLEAMGYGRAGAVERTSASGDAGIDGIISQDPLGLDRIYVQAKR YAVDQTIGRPKIHEFAGALLGKQGDRGVYITTSSFSRGARQEAERINARIELIDGARL AELLVRYRVGVQAVQTVELLRLDEDFFDGL" CDS 2824083..2825474 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2558" /product="ERCC4-type nuclease" /note="Mb2558, -, len: 463 aa. Equivalent to Rv2529, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 463 aa overlap). Hypothetical unknown protein. Note that C-terminal part is similar to short region of Q53609|MTS1_STRAL|SALIM MODIFICATION METHYLASE SALI from Streptomyces albus G (587 aa), FASTA scores: opt: 170, E(): 0.016, (59.45% identity in 37 aa overlap). Protein product from Mb2558 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y1F9" /db_xref="InterPro:IPR006166" /db_xref="InterPro:IPR011335" /db_xref="InterPro:IPR024412" /db_xref="InterPro:IPR042254" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1F9" /protein_id="SIU01174.1" /translation="MHLAHRVASSRDTPSSSATPNAVSGSASNAADRPCLVRPPTAPP WAHGPRLRRDPTGGGSTPSIVLSRSTDRSKDGHRIVPAGARKSGVRASTERLPSTRKT TRSPDCRPSASRTAFGTVTCPFDVTMGSSECLLHRCRTPPVPSHSVELLVAANPAEDS RLPYLIRLPVGAGLVFATSDVWPRTKALYCHRLDIADWPADPVVVDRVELRSCSRRGA AIDVVAARARENRSQLVHTMARGRQVVFWQSPKTRKQSRPGVRTPTARAAGIPELHIV VDAHERYPYTFADKPAKTTREALPCGDYGLKVAGQLVAAVERKALADLTSGVLNGNLK YQLTELAALPRAAVVVEDRYSEIFAHSFARPTAIADGLAELQIGFPNVPIVFCQTRKL AQEYTYRYLAAALTWFVDDADATTVFEPAAAEPEPSSAELRAWAKSVGLPVSDRGRLR PQILQAWRAAHPR" CDS complement(2825475..2825894) /codon_start=1 /transl_table=11 /gene="vapc39" /locus_tag="BQ2027_MB2559C" /product="possible toxin vapc39. contains pin domain." /note="Mb2559c, -, len: 139 aa. Equivalent to Rv2530c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Conserved hypothetical protein, highly similar to two HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis (strains H37Rv and CDC1551): O53219|Rv2494|MTV008.50 (141 aa), FASTA scores: opt: 380, E(): 3.6e-19, (48.0% identity in 125 aa overlap); and O53372|Rv3320c|MTV016.20c (142 aa), FASTA scores: opt: 286, E(): 9.3e-13, (41.35% identity in 133 aa overlap); and similar to others e.g. O07760|Rv0617|MTCY19H5.04c (133 aa), FASTA scores: opt: 158, E(): 0.00048, (39.55% identity in 129 aa overlap). Also some similarity with CAC48798|SMB20412 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (54 aa), FASTA scores: opt: 184, E(): 3.7e-06, (53.85% identity in 52 aa overlap); and CAC48797|SMB20411 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (82 aa), FASTA scores: opt: 170, E(): 4.8e-05, (44.45% identity in 63 aa overlap). Protein product from Mb2559c detected using SWATH mass spectrometry. Mb2559c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3K9" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K9" /protein_id="SIU01175.1" /translation="MTALLDVNVLIALGWPNHVHHAAAQRWFTQFSSNGWATTPITEA GYVRISSNRSVMQVSTTPAIAIAQLAAMTSLAGHTFWPDDVPLIVGSAGDRDAVSNHR RVTDCHLIALAARYGGRLVTFDAALADSASAGLVEVL" CDS complement(2825891..2826115) /codon_start=1 /transl_table=11 /gene="vapb39" /locus_tag="BQ2027_MB2559AC" /product="possible antitoxin vapb39" /note="Mb2559Ac, -, len: 74 aa. Equivalent to Rv2530A, len: 74 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 aa overlap). Conserved hypothetical protein, similar to Q9CCR7|ML0525 HYPOTHETICAL PROTEIN from Mycobacterium leprae (58 aa), FASTA scores: opt: 179, E(): 1.8e-06, (63.65% identity in 44 aa overlap). Highly similar to O53218|Rv2493 from Mycobacterium tuberculosis (73 aa), FASTA scores: opt: 240, E(): 5.7e-11, (56.75% identity in 74 aa overlap); and Q92WE1|RB0399|SMB20413 HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti)p lasmid pSymB (megaplasmid 2) (75 aa), FASTA scores: opt: 226, E(): 6.5e-10, (56.00% identity in 75 aa overlap). Protein product from Mb2559Ac detected using SWATH mass spectrometry. Mb2559Ac found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A5" /protein_id="SIU01176.1" /translation="MRTTLQIDDDVLEDARSIARSEGKSVGAVISELARRSLRPVGIV EVDGFPVFDVPPDAPTVTSEDVVRALEDDV" CDS complement(2826146..2828989) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2560C" /product="PROBABLE AMINO ACID DECARBOXYLASE" /note="Mb2560c, -, len: 947 aa. Equivalent to Rv2531c, len: 947 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 947 aa overlap). Probable amino acid decarboxylase (EC 4.1.1.-), equivalent to Q9CCR8|ADI|ML0524 PUTATIVE AMINO ACID DECARBOXYLASE from Mycobacterium leprae (950 aa), FASTA scores: opt: 5426, E(): 0, (86.45% identity in 951 aa overlap). Also similar to other amino acid decarboxylases (but longer in N-terminus) e.g. Q9I2S7|PA1818 PROBABLE ORN/ARG/LYS AMINO ACID DECARBOXYLASE from Pseudomonas aeruginosa (751 aa), FASTA scores: opt: 434, E(): 2.5e-19, (29.15% identity in 738 aa overlap); Q9CML3|SPEF|PM0806 ORNITHINE DECARBOXYLASE from Pasteurella multocida (720 aa), FASTA scores: opt: 402, E(): 2.4e-17, (24.85% identity in 752 aa overlap); P21169|DCOR_ECOLI|SPEC|B2965|BAB37264|ECS3841|AA G58096 ORNITHINE DECARBOXYLASE ISOZYME (CONSTITUTIVE ENZYME) from Escherichia coli strain K12 (711 aa), FASTA scores: opt: 396, E(): 5.6e-17, (28.0% identity in 646 aa overlap); P44317|DCOR_HAEIN|SPEF|HI0591 ORNITHINE DECARBOXYLASE from Haemophilus influenzae (720 aa), FASTA scores: opt: 393, E(): 8.8e-17, (25.05% identity in 743 aa overlap); etc. SEEMS TO BELONG TO FAMILY 1 OF ORNITHINE, LYSINE, AND ARGININE DECARBOXYLASES. Note that previously known as adi. Protein product from Mb2560c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2560c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1Q3" /db_xref="InterPro:IPR000310" /db_xref="InterPro:IPR008286" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR036633" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Q3" /protein_id="SIU01177.1" /translation="MNPNSVRPRRLHVSALAAVANPSYTRLDTWNLLDDACRHLAEVD LAGLDTTHDVARAKRLMDRIGAYERYWLYPGAQNLATFRAHLDSHSTVRLTEEVSLAV RLLSEYGDRTALFDTSASLAEQELVAQAKQQQFYTVLLADDSPATAPDSLAECLRQLR NPADEVQFELLVVASIEDAITAVALNGEIQAAIIRHDLPLRSRDRVPLMTTLLGTDGD EAVANETHDWVECAEWIRELRPHIDLYLLTDESIAAETQDEPDVYDRTFYRLNDVTDL HSTVLAGLRNRYATPFFDALRAYAAAPVGQFHALPVARGASIFNSKSLHDMGEFYGRN IFMAETSTTSGGLDSLLDPHGNIKTAMDKAAVTWNANQTYFVTNGTSTANKIVVQALT RPGDIVLIDRNCHKSHHYGLVLAGAYPMYLDAYPLPQYAIYGAVPLRTIKQALLDLEA AGQLHRVRMLLLTNCTFDGVVYNPRRVMEEVLAIKPDICFLWDEAWYAFATAVPWARQ RTAMIAAERLEQMLSTAEYAEEYRNWCASMDGVDRSEWVDHRLLPDPNRARVRVYATH STHKSLSALRQASMIHVRDQDFKALTRDAFGEAFLTHTSTSPNQQLLASLDLARRQVD IEGFELVRHVYNMALVFRHRVRKDRLISKWFRILDESDLVPDAFRSSTVSSYRQVRQG ALADWNEAWRSDQFVLDPTRLTLFIGATGMNGYDFREKILMERFGIQINKTSINSVLL IFTIGVTWSSVHYLLDVLRRVAIDLDRSQKAASGADLALHRRHVEEITQDLPHLPDFS EFDLAFRPDDASSFGDMRSAFYAGYEEADREYVQIGLAGRRLAEGKTLVSTTFVVPYP PGFPVLVPGQLVSKEIIYFLAQLDVKEIHGYNPDLGLSVFTQAALARMEAARNAVATV GAALPAFEVPRDASALNGTVNGDSVLQGVAEDA" CDS complement(2829061..2829462) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2561C" /product="Transcription termination factor" /note="Mb2561c, -, len: 133 aa. Equivalent to Rv2532c, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Hypothetical unknown protein, equivalent to AAK46918 from Mycobacterium tuberculosis strain CDC1551 but shorter 157 aa. Protein product from Mb2561c detected using SWATH mass spectrometry. Mb2561c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1W4" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1W4" /protein_id="SIU01178.1" /translation="MTRLELRVVVAAVLAATVVLGAVVCAAYGLTIVASAMSIYALGV GAWLYHAIERLILARRISTVRTAAKPLQPLLPVMAAIMGLTQAVVRSLGDVTDLPARR RELSQLPVLRWVDNSGNRANRRIADSDDLAD" CDS complement(2829462..2829932) /codon_start=1 /transl_table=11 /gene="nusB" /locus_tag="BQ2027_MB2562C" /product="N UTILIZATION SUBSTANCE PROTEIN NUSB (NUSB PROTEIN)" /note="Mb2562c, nusB, len: 156 aa. Equivalent to Rv2533c, len: 156 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 156 aa overlap). nusB, N utilization substance protein (see citation below), equivalent to Q9CCR9|NUSB_MYCLE|ML0523 N UTILIZATION SUBSTANCE PROTEIN B from Mycobacterium leprae (190 aa), FASTA scores: opt: 749, E(): 2.6e-41, (75.7% identity in 148 aa overlap). Also highly similar to others e.g. Q9KXR0|SC9C5.14 from Streptomyces coelicolor (142 aa), FASTA scores: opt: 358, E(): 2.7e-16, (45.0% identity in 140 aa overlap); P54520|NUSB_BACSU from Bacillus subtilis (131 aa), FASTA scores: opt: 315, E(): 1.5e-13, (39.55% identity in 129 aa overlap); O83979|NUSB_TREPA|TP1015 from Treponema pallidum (141 aa), FASTA scores: opt: 268, E(): 1.6e-10, (36.95% identity in 138 aa overlap); etc. BELONGS TO THE NUSB FAMILY. Protein product from Mb2562c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2562c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYC8" /db_xref="InterPro:IPR006027" /db_xref="InterPro:IPR011605" /db_xref="InterPro:IPR035926" /db_xref="UniProtKB/Swiss-Prot:Q7TYC8" /protein_id="SIU01179.1" /translation="MSDRKPVRGRHQARKRAVDLLFEAEVRGISAAEVVDTRAALAEA KPDIARLHPYTAAVARGVSEHAAHIDDLITAHLRGWTLDRLPAVDRAILRVSVWELLH AADVPEPVVVDEAVQLAKELSTDDSPGFVNGVLGQVMLVTPQLRAAAQAVRGGA" CDS complement(2829935..2830498) /codon_start=1 /transl_table=11 /gene="efp" /locus_tag="BQ2027_MB2563C" /product="PROBABLE ELONGATION FACTOR P EFP" /note="Mb2563c, efp, len: 187 aa. Equivalent to Rv2534c, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 187 aa overlap). Probable efp, elongation factor P, equivalent to Q9CCS0|EFP|ML0522 ELONGATION FACTOR P from Mycobacterium leprae (187 aa), FASTA scores: opt: 1158, E(): 2.1e-67, (94.1% identity in 186 aa overlap). Also highly similar to many e.g. Q45288|EFP_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (187 aa), FASTA scores: opt: 843, E(): 3.4e-47, (69.5% identity in 187 aa overlap); Q9KXQ9|EFP from Streptomyces coelicolor (188 aa), FASTA scores: opt: 833, E(): 1.5e-46, (67.0% identity in 188 aa overlap); P49778|EFP_BACSU from Bacillus subtilis (185 aa), FASTA scores: opt: 607, E(): 4.6e-32, (47.8% identity in 182 aa overlap); P33398|EFP_ECOLI|B4147 from Escherichia coli strain K12 (187 aa), FASTA scores: opt: 503, E(): 1.8e-27, (42.3% identity in 182 aa overlap); etc. BELONGS TO THE ELONGATION FACTOR P FAMILY. Protein product from Mb2563c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2563c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P64035" /db_xref="InterPro:IPR001059" /db_xref="InterPro:IPR008991" /db_xref="InterPro:IPR011768" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR013185" /db_xref="InterPro:IPR013852" /db_xref="InterPro:IPR014722" /db_xref="InterPro:IPR015365" /db_xref="InterPro:IPR020599" /db_xref="UniProtKB/Swiss-Prot:P64035" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01180.1" /translation="MATTADFKNGLVLVIDGQLWTITEFQHVKPGKGPAFVRTKLKNV LSGKVVDKTFNAGVKVDTATVDRRDTTYLYRDGSDFVFMDSQDYEQHPLPEALVGDAA RFLLEGMPVQVAFHNGVPLYIELPVTVELEVTHTEPGLQGDRSSAGTKPATLQTGAQI NVPLFINTGDKLKVDSRDGSYLGRVNA" CDS complement(2830508..2831626) /codon_start=1 /transl_table=11 /gene="pepQ" /locus_tag="BQ2027_MB2564C" /product="probable cytoplasmic peptidase pepq" /note="Mb2564c, pepQ, len: 372 aa. Equivalent to Rv2535c, len: 372 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 372 aa overlap). Probable pepQ, cytoplasmic peptidase (EC 3.4.-.-), equivalent to Q9CCS1|PEPQ|ML0521 PUTATIVE CYTOPLASMIC PEPTIDASE from Mycobacterium leprae (376 aa), FASTA scores: opt: 1954, E(): 1.1e-105, (82.7% identity in 376 aa overlap). Also similar to other peptidases e.g. P54518|YQHT_BACSU PUTATIVE PEPTIDASE (BELONGS TO PEPTIDASE FAMILY M24B) from Bacillus subtilis (353 aa), FASTA scores: opt: 808, E(): 1.6e-39, (39.65% identity in 368 aa overlap); Q9KXQ8|SC9C5.16c PUTATIVE PEPTIDASE from Streptomyces coelicolor (368 aa), FASTA scores: opt: 803, E(): 3.2e-39, (43.15% identity in 380 aa overlap); Q9K950|BH2800 XAA-PRO DIPEPTIDASE from Bacillus halodurans (355 aa), FASTA scores: opt: 801, E(): 4.1e-39, (39.45% identity in 365 aa overlap); etc. Note that second part of protein is similar to second part of MTCY49.29c|Rv2089c|MT2150|MTCY49.29c PROBABLE DIPEPTIDASE (EC 3.4.13.-; BELONGS TO PEPTIDASE FAMILY M24B) from Mycobacterium tuberculosis (375 aa) (33.9% identity in 354 aa overlap) BLAST RESULTS: Score: 142 bits (359), E: 4e-33, Identities: 86/224 (38%), Positives: 119/224 (52%), Gaps: 4/224 (1%). COULD BE BELONG TO PEPTIDASE FAMILY M24B. Protein product from Mb2564c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2564c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1H0" /db_xref="InterPro:IPR000587" /db_xref="InterPro:IPR000994" /db_xref="InterPro:IPR001131" /db_xref="InterPro:IPR001714" /db_xref="InterPro:IPR029149" /db_xref="InterPro:IPR036005" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1H0" /protein_id="SIU01181.1" /translation="MTHSQRRDKLKAQIAASGLDAMLISDLINVRYLSGFSGSNGALL VFADERDAVLATDGRYRTQAASQAPDLEVAIERAVGRYLAGRAGEAGVGKLGFESHVV TVDGLDALAGALEGKNTELVRASGTVESLREVKDAGELALLRLACEAADAALTDLVAR GGLRPGRTERQVSRELEALMLDHGADAVSFETIVAAGANSAIPHHRPTDAVLQVGDFV KIDFGALVAGYHSDMTRTFVLGKAADWQLEIYQLVAEAQQAGRQALLPGAELRGVDAA ARQLIADAGYGEHFGHGLGHGVGLQIHEAPGIGVTSAGTLLAGSVVTVEPGVYLPGRG GVRIEDTLVVAGGTPKMPETAGQTPELLTRFPKELAIL" CDS 2831660..2832352 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2565" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2565, -, len: 230 aa. Equivalent to Rv2536, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 230 aa overlap). Probable conserved transmembrane protein, equivalent to Q9CCS2|ML0520 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (202 aa), FASTA scores: opt: 812, E(): 2e-41, (63.2% identity in 201 aa overlap). Also similar in part to Q9HMD5|VNG2594c from Halobacterium sp. strain NRC-1 (117 aa), FASTA scores: opt: 33.6, E(): 1.8, (33.6% identity in 116 aa overlap); and perhaps AAK65752|SMA1996 PUTATIVE ABC TRANSPORTER PERMEASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymA (323 aa), FASTA scores: opt: 117, E(): 6.1, (30.6% identity in 121 aa overlap). Protein product from Mb2565 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2565 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1I2" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1I2" /protein_id="SIU01182.1" /translation="MTNWMLRGLAFAAAMVVLRLFQGALINAWQMLSGLISLVLLLLF AIGGVVWGVMDGRADAKASPDPDRRQDLAMTWLLAGLVAGALSGAVAWLISLFYKAIY TGGPINELTTFAAFTALIVFLVGIVGVAVGRWLVDRQLAKAPVRHHGLAAEHERAADT DVFSAVRADDSPTGEMQVAQPEAQTAAVATVEREAPTEVIRTTESDTPTEVIRTDTEA DQTKPGDEPKKD" CDS complement(2832356..2832799) /codon_start=1 /transl_table=11 /gene="aroD" /locus_tag="BQ2027_MB2566C" /product="3-dehydroquinate dehydratase arod (aroq) (3-dehydroquinase) (type ii dhqase)" /note="Mb2566c, aroD, len: 147 aa. Equivalent to Rv2537c, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). aroD (alternate gene name: aroQ), 3-dehydroquinate dehydratase (EC 4.2.1.10) (see citation below), equivalent to Q9CCS3|AROD|ML0519 3-DEHYDROQUINATE DEHYDRATASE from Mycobacterium leprae (145 aa), FASTA scores: opt: 803, E(): 3.4e-46, (85.9% identity in 142 aa overlap). Also highly similar to many e.g. P96750|AROQ_CORPS from Corynebacterium pseudotuberculosis (146 aa), FASTA scores: opt: 559, E(): 4.1e-30, (61.05% identity in 136 aa overlap); Q9K949|BH2801 from Bacillus halodurans (145 aa), FASTA scores: opt: 453, E(): 4e-23, (52.15% identity in 138 aa overlap); P54517|AROQ_BACSU|YQHS from Bacillus subtilis (148 aa), FASTA scores: opt: 419, E(): 7.1e-21, (45.3% identity in 139 aa overlap); etc. Contains PS01029 Dehydroquinase class II signature. BELONGS TO THE TYPE-II 3-DEHYDROQUINASE FAMILY. Protein product from Mb2566c detected using SWATH mass spectrometry. Mb2566c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4Z7" /db_xref="InterPro:IPR001874" /db_xref="InterPro:IPR018509" /db_xref="InterPro:IPR036441" /db_xref="UniProtKB/Swiss-Prot:P0A4Z7" /protein_id="SIU01183.1" /translation="MSELIVNVINGPNLGRLGRREPAVYGGTTHDELVALIEREAAEL GLKAVVRQSDSEAQLLDWIHQAADAAEPVILNAGGLTHTSVALRDACAELSAPLIEVH ISNVHAREEFRRHSYLSPIATGVIVGLGIQGYLLALRYLAEHVGT" CDS complement(2832796..2833884) /codon_start=1 /transl_table=11 /gene="aroB" /locus_tag="BQ2027_MB2567C" /product="3-DEHYDROQUINATE SYNTHASE AROB" /note="Mb2567c, aroB, len: 362 aa. Equivalent to Rv2538c, len: 362 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 362 aa overlap). aroB, 3-dehydroquinate synthase (EC 4.2.3.4) (see citation below), equivalent to Q9CCS4|AROB_MYCLE|ML0518 3-DEHYDROQUINATE SYNTHASE from Mycobacterium leprae (361 aa), FASTA scores: opt: 2059, E(): 3.3e-117, (87.25% identity in 361 aa overlap). Also highly similar to many e.g. Q9KXQ6|AROB from Streptomyces coelicolor (363 aa), FASTA scores: opt: 1363, E(): 4e-75, (60.05% identity in 358 aa overlap); Q9X5D2|AROB_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (366 aa), FASTA scores: opt: 1154, E(): 1.7e-62, (50.95% identity in 359 aa overlap); P07639|AROB_ECOLI|B3389 from Escherichia coli strain K12 (362 aa), FASTA scores: opt: 771, E(): 2.4e-39, (40.6% identity in 345 aa overlap); etc. BELONGS TO THE DEHYDROQUINATE SYNTHASE FAMILY. Protein product from Mb2567c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2567c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4Z5" /db_xref="InterPro:IPR016037" /db_xref="InterPro:IPR030960" /db_xref="InterPro:IPR030963" /db_xref="UniProtKB/Swiss-Prot:P0A4Z5" /protein_id="SIU01184.1" /translation="MTDIGAPVTVQVAVDPPYPVVIGTGLLDELEDLLADRHKVAVVH QPGLAETAEEIRKRLAGKGVDAHRIEIPDAEAGKDLPVVGFIWEVLGRIGIGRKDALV SLGGGAATDVAGFAAATWLRGVSIVHLPTTLLGMVDAAVGGKTGINTDAGKNLVGAFH QPLAVLVDLATLQTLPRDEMICGMAEVVKAGFIADPVILDLIEADPQAALDPAGDVLP ELIRRAITVKAEVVAADEKESELREILNYGHTLGHAIERRERYRWRHGAAVSVGLVFA AELARLAGRLDDATAQRHRTILSSLGLPVSYDPDALPQLLEIMAGDKKTRAGVLRFVV LDGLAKPGRMVGPDPGLLVTAYAGVCAP" CDS complement(2833881..2834411) /codon_start=1 /transl_table=11 /gene="aroK" /locus_tag="BQ2027_MB2568C" /product="shikimate kinase arok (sk)" /note="Mb2568c, aroK, len: 176 aa. Equivalent to Rv2539c, len: 176 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 176 aa overlap). Probable aroK, shikimate kinase (EC 2.7.1.71) (see citation below), equivalent to Q9CCS5|AROK|ML0517 PUTATIVE SHIKIMATE KINASE from Mycobacterium leprae (199 aa), FASTA scores: opt: 852, E(): 1.3e-42, (79.65% identity in 167 aa overlap). Also highly similar to many e.g. Q9X5D1|AROK_CORG from Corynebacterium glutamicum (Brevibacterium flavum) (169 aa), FASTA scores: opt: 478, E(): 5.4e-21, (47.0% identity in 168 aa overlap); Q9KXQ5|AROK from Streptomyces coelicolor (171 aa), FASTA scores: opt: 465, E(): 3.1e-20, (49.1% identity in 167 aa overlap); P24167|AROK_ECOLI from Escherichia coli strain K12 (172 aa), FASTA scores: opt: 316, E(): 1.3e-11, (38.4% identity in 164 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A, and PS01128 Shikimate kinase signature. BELONGS TO THE SHIKIMATE KINASE FAMILY. Protein product from Mb2568c detected using SWATH mass spectrometry. Mb2568c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4Z3" /db_xref="InterPro:IPR000623" /db_xref="InterPro:IPR023000" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR031322" /db_xref="UniProtKB/Swiss-Prot:P0A4Z3" /protein_id="SIU01185.1" /translation="MAPKAVLVGLPGSGKSTIGRRLAKALGVGLLDTDVAIEQRTGRS IADIFATDGEQEFRRIEEDVVRAALADHDGVLSLGGGAVTSPGVRAALAGHTVVYLEI SAAEGVRRTGGNTVRPLLAGPDRAEKYRALMAKRAPLYRRVATMRVDTNRRNPGAVVR HILSRLQVPSPSEAAT" CDS complement(2834415..2835620) /codon_start=1 /transl_table=11 /gene="aroF" /locus_tag="BQ2027_MB2569C" /standard_name="aroC" /product="PROBABLE CHORISMATE SYNTHASE AROF (5-ENOLPYRUVYLSHIKIMATE-3-PHOSPHATE PHOSPHOLYASE)" /note="Mb2569c, aroF, len: 401 aa. Equivalent to Rv2540c, len: 401 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 401 aa overlap). Probable aroF (alternate gene name: aroC), chorismate synthase (EC 4.2.3.5), equivalent to Q9CCS6|AROF|ML0516 PUTATIVE CHORISMATE SYNTHASE from Mycobacterium leprae (407 aa), FASTA scores: opt: 2278, E(): 6.2e-123, (88.05% identity in 401 aa overlap). Also highly similar to many e.g. Q9X5D0|AROC_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (410 aa), FASTA scores: opt: 1811, E(): 3e-96, (70.3% identity in 397 aa overlap); Q9KXQ4|AROC_STRCO|AROF|SC9C5.20c from Streptomyces coelicolor (394 aa), FASTA scores: opt: 1710, E(): 1.7e-90, (67.0% identity in 385 aa overlap); Q9KCB7|AROC_BACHD|AROF|BH1656 from Bacillus halodurans (390 aa), FASTA scores: opt: 1196, E(): 3.9e-61, (48.7% identity in 386 aa overlap); etc. Contains PS00788 Chorismate synthase signature 2. BELONGS TO THE CHORISMATE SYNTHASE FAMILY. COFACTOR: REDUCED FLAVIN. Protein product from Mb2569c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2569c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63612" /db_xref="InterPro:IPR000453" /db_xref="InterPro:IPR020541" /db_xref="InterPro:IPR035904" /db_xref="UniProtKB/Swiss-Prot:P63612" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01186.1" /translation="MLRWITAGESHGRALVAVVEGMVAGVHVTSADIADQLARRRLGY GRGARMTFERDAVTVLSGIRHGSTLGGPIAIEIGNTEWPKWETVMAADPVDPAELADV ARNAPLTRPRPGHADYAGMLKYGFDDARPVLERASARETAARVAAGTVARAFLRQALG VEVLSHVISIGASAPYEGPPPRAEDLPAIDASPVRAYDKAAEADMIAQIEAAKKDGDT LGGVVEAVALGLPVGLGSFTSGDHRLDSQLAAAVMGIQAIKGVEIGDGFQTARRRGSR AHDEMYPGPDGVVRSTNRAGGLEGGMTNGQPLRVRAAMKPISTVPRALATVDLATGDE AVAIHQRSDVCAVPAAGVVVETMVALVLARAALEKFGGDSLAETQRNIAAYQRSVADR EAPAARVSG" CDS 2835635..2836042 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2570" /product="HYPOTHETICAL ALANINE RICH PROTEIN" /note="Mb2570, -, len: 135 aa. Equivalent to Rv2541, len: 135 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 135 aa overlap). Hypothetical unknown ala-rich protein, equivalent to AAK46926|MT2615.1 HYPOTHETICAL 38.9 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 but AAK46926|MT2615.1 longer at C-terminus. Questionable ORF. Some similarity with Rv2077A from Mycobacterium tuberculosis (99 aa)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1R3" /protein_id="SIU01187.1" /translation="MRRRRPPHVNAPTPCDRGDVRPPGCPASIPGVEVAGGTRARLRV TADGLQALAGRCATLAGELSAAVAPSGAVLSWQANAVAVNAAHARAGAAAAAVSARMR ATAAALGQAARRYAGQDTAAAAALGAVRPWGTH" CDS 2836338..2837549 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2571" /product="Putative transmembrane protein" /note="Mb2571, -, len: 403 aa. Equivalent to Rv2542, len: 403 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 265 aa overlap). Conserved hypothetical protein, highly similar to AAK46927|MT2616 HYPOTHETICAL 28.0 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (265 aa), FASTA scores: opt: 1776, E(): 2.3e-94, (99.25% identity in 265 aa overlap). And similar to several hypothetical proteins from Mycobacterium tuberculosis (strain H37Rv and CDC1551) e.g. P71654|Rv2797c|MTCY16B7.46 (562 aa), FASTA scores: opt: 537, E(): 2.6e-23, (40.75% identity in 292 aa overlap); P71547|Y963_MYCTU|Rv0963c|MT0992|MTCY10D7.11 (266 aa), FASTA scores: opt: 357, E(): 2.6e-13, (34.6% identity in 234 aa overlap); Q10685|YK77_MYCTU|Rv2077c|MT2137|MTCY49.1 6c (323 aa), FASTA scores: opt: 261, E(): 9.5e-08, (32.7% identity in 211 aa overlap); etc. Also similar to Q9RDQ9|SC4A7.03 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (406 aa), FASTA scores: opt: 247, E(): 7.3e-07, (30.35% identity in 303 aa overlap). Mb2571 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010427" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1X4" /protein_id="SIU01188.1" /translation="MLDAVSDARRDGFAVGEDYTVTDRSTGGSRQQRAARLGQAQGHA DFIRHRVGALLATDRDIATRVSAATQGLDELAFEDVPGVDTPAEDGVQAVDFRQAPPP GAPGGMSSGDIDAIDAANRALLQDMLAEYSRLPDGQVKTDRLADIAAIQEALRVPDSH LIYVARPDDPADMIPAVTAVGDPFTADHVSVTVPGVSGTTRQTIATMTQEARGLREEA RVIAHSVGESENVATIAWVGYQPPPVLASWNTVDDDLAQAGAPKLEAFLRDLQAGSHN PGHTTALFGHSYGSLLSGIALKDGASSLVDNAVLYGSPGFDATSPAKLGMNDHNFFVM TTPDDPIRYPARLAPLHGWGSDGADTIGTVGRQGTPARVGIRPQRDHRRIPGPLPLHP SADRRGIHSAG" CDS 2837676..2838335 /codon_start=1 /transl_table=11 /gene="lppA" /locus_tag="BQ2027_MB2572" /product="PROBABLE CONSERVED LIPOPROTEIN LPPA" /note="Mb2572, lppA, len: 219 aa. Equivalent to Rv2543, len: 219 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 219 aa overlap). Probable lppA, conserved lipoprotein, highly similar to upstream orf P95009|LPPB|Rv2544|MTCY159.12 PUTATIVE LIPOPROTEIN LPPB from Mycobacterium tuberculosis (220 aa), FASTA scores: opt: 1240, E(): 1.1e-73, (87.15% identity in 218 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /db_xref="InterPro:IPR032018" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1H8" /protein_id="SIU01189.1" /translation="MIAPQPISRTLPRWQRIVALTMIGISTALIGGCTMDHNPDTSRR LTGEQKIQLIDSMRNKGSYEAARERLTATARIIADRVSAAIPGQTWKFDDDPNIQQSD RNGALCDKLTADIARRPIANSVMFGATFSAEDFKIAANIVREEAAKYGATTESSLFNE SAKRDYDVQGNGYEFRLLQIKFATLNITGDCFLLQKVLDLPAGQLPPEPPIWPTTSTP H" CDS 2838332..2838991 /codon_start=1 /transl_table=11 /gene="lprR" /locus_tag="BQ2027_MB2573" /product="PROBABLE CONSERVED LIPOPROTEIN LPRR" /note="Mb2573, lprR, len: 219 aa. Equivalent to MT2619, len: 219 aa, from Mycobacterium tuberculosis strain CDC1551, (99.55% identity in 219 aa overlap). Also similar to Rv2543 and Rv2544, len: 219 aa and 220 aa, from Mycobacterium tuberculosis strain H37Rv, (88.15% identity in 219 aa overlap and 84.40% identity in 218 aa overlap). See citation below. Rv2543: Probable lppA, conserved lipoprotein, highly similar to upstream ORF P95009|LPPB|Rv2544|MTCY159.12 PUTATIVE LIPOPROTEIN LPPB from Mycobacterium tuberculosis (220 aa), FASTA scores: opt: 1240, E(): 1.1e-73, (87.15% identity in 218 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Rv2544: Probable lppB, conserved lipoprotein, highly similar to downstream ORF P95010|MTCY159.13c|LPPA|Rv2543|MTCY159.13 PUTATIVE LIPOPROTEIN LPPA from Mycobacterium tuberculosis (219 aa), FASTA scores: opt: 1242, E(): 4.8e-72, (87.15% identity in 218 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, there is a 656 bp insertion relative to Mycobacterium tuberculosis strain H37Rv." /db_xref="InterPro:IPR032018" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1I1" /protein_id="SIU01190.1" /translation="MIAPQPIPRTLPRWQRIVALTMIGISTALIGGCTMDQSPDTSRR LTDEQKIQLIDSMRNKGSYEAARERLTATARIIADRVSAAIPGQTWKFTEDPAGRKAD REGLSCKELTGDIARRPIADAVIFGTAFSAEDFKVVTNIVREEAAKYGATTESSLFNE SAKRDYDVQGNGYEFRLLQIKFATLNITGDCFLLQKVLDLPAGQLPPEPPIWPTTSTP H" CDS 2838988..2839650 /codon_start=1 /transl_table=11 /gene="lppB" /locus_tag="BQ2027_MB2574" /product="PROBABLE CONSERVED LIPOPROTEIN LPPB" /note="Mb2574, lppB, len: 220 aa. Equivalent to Rv2544, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 220 aa overlap). Probable lppB, conserved lipoprotein, highly similar to downstream ORF P95010|MTCY159.13c|LPPA|Rv2543|MTCY159.13 PUTATIVE LIPOPROTEIN LPPA from Mycobacterium tuberculosis (219 aa), FASTA scores: opt: 1242, E(): 4.8e-72, (87.15% identity in 218 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /db_xref="InterPro:IPR032018" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1H9" /protein_id="SIU01191.1" /translation="MIAPQPIPRTLPRWQRIVALTMIGISTALIGGCTMGQNPDKSPH LTGEQKIQLIDSMRHKGSYEAARERLTATAQIIADRVSAAIPGQTWKFNDDSYGQDFY RNGSLCKELSADIARRPMAKPVDFGSTFSAEDFKIAANIVREEAAKYGVTTESSLFNE SAKRDYDVQGNGYEFNLGQIKFATLNITGDCFLLQKVLDLPAGQLPPEPPIWPTTSTP TP" CDS 2839647..2839925 /codon_start=1 /transl_table=11 /gene="vapb18" /locus_tag="BQ2027_MB2575" /product="possible antitoxin vapb18" /note="Mb2575, -, len: 92 aa. Equivalent to Rv2545, len: 92 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 92 aa overlap). Conserved hypothetical protein. C-terminus highly similar to O33300|Rv2758c|MTV002.23c PROTEIN from Mycobacterium tuberculosis (88 aa), FASTA scores: opt: 151, E(): 9.8e-05, (66.65% identity in 45 aa overlap); and Q10771|Rv1560|MT1611|MTCY48.05 PROTEIN from Mycobacterium tuberculosis (72 aa), FASTA scores: opt: 84, E(): 8.2, (46.5% identity in 43 aa overlap). Mb2575 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019239" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1J0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01192.1" /translation="MSTTIVAGVIQGHLPVILPTRRRARDLGHTTALFRAQTLQCIYL SIEYLYVCSMSRRTTIDIDDILLARAQAALGTTGLKDRVDAALRAAVR" CDS 2840018..2840431 /codon_start=1 /transl_table=11 /gene="vapc18" /locus_tag="BQ2027_MB2576" /product="possible toxin vapc18" /note="Mb2576, -, len: 137 aa. Equivalent to Rv2546, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 137 aa overlap). Conserved hypothetical protein. Some similarity to several HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis (strain H37Rv and CDC1551) e.g. P96411|Rv0229c|MTCY08D5.24c (226 aa), FASTA scores: opt: 272, E(): 1.3e-11, (39.7% identity in 136 aa overlap); O33299|Rv2757c|MTV002.22c (138 aa), FASTA scores: opt: 265, E(): 2.5e-11, (38.5% identity in 135 aa overlap); P95026|Rv2527|MTCY159.29c (133 aa), FASTA scores: opt: 206, E(): 2.6e-07, (38.0% identity in 100 aa overlap); etc." /db_xref="GOA:A0A1R3Y1I5" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1I5" /protein_id="SIU01193.1" /translation="MVFCVDTSAWHHAARPEVARRWLAALSADQIGICDHVRLEILYS ANSATDYDALADELDGLARIPVGAETFTRACQVQRELAHVAGLHHRSVKIADLVIAAA AELSGTIVWHYDENYDRVAAITGQPTEWIVPRGTL" CDS 2840470..2840727 /codon_start=1 /transl_table=11 /gene="vapb19" /locus_tag="BQ2027_MB2577" /product="possible antitoxin vapb19" /note="Mb2577, -, len: 85 aa. Equivalent to Rv2547, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Conserved hypothetical protein. Some similarity to P71666|YD98_MYCTU|Rv1398c|MT1442|MTCY21B4.15c HYPOTHETICAL 9.4 KDA PROTEIN from Mycobacterium tuberculosis (85 aa), FASTA scores: opt: 108, E(): 0.33, (37.1% identity in 62 aa overlap); CAC45864|SMC01933 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (71 aa), FASTA scores: opt: 105, E(): 0.46, (28.4% identity in 74 aa overlap); Q97W38|SSO10342 HYPOTHETICAL PROTEIN from Sulfolobus solfataricus (58 aa), FASTA scores: opt: 94, E(): 2.3, (46.95% identity in 49 aa overlap). Mb2577 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1I0" /db_xref="InterPro:IPR002145" /db_xref="InterPro:IPR010985" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1I0" /protein_id="SIU01194.1" /translation="MRTQVTLGKEELELLDRAAKASGASRSELIRRAIHRAYGTGSKQ ERLAALDHSRGSWRGRDFTGTEYVDAIRGDLNERLARLGLA" CDS 2840724..2841101 /codon_start=1 /transl_table=11 /gene="vapc19" /locus_tag="BQ2027_MB2578" /product="possible toxin vapc19" /note="Mb2578, -, len: 125 aa. Equivalent to Rv2548, len: 125 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 125 aa overlap). Conserved hypothetical protein. Some similarity to various proteins e.g. P71665|Rv1397c|MTCY21B4.14c HYPOTHETICAL 15.0 KDA PROTEIN from Mycobacterium tuberculosis (133 aa), FASTA scores: opt: 265, E(): 7.1e-12, (42.3% identity in 123 aa overlap); Q97WY5|SSO1975 HYPOTHETICAL PROTEIN from Sulfolobus solfataricus (125 aa), FASTA scores: opt: 131, E(): 0.018, (30.0% identity in 110 aa overlap); O52285|YLE HYPOTHETICAL 14.9 KDA PROTEIN from Agrobacterium radiobacter (133 aa), FASTA scores: opt: 128, E(): 0.03, (32.8% identity in 125 aa overlap); etc. Protein product from Mb2578 detected using SWATH mass spectrometry. Mb2578 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3N1" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3N1" /protein_id="SIU01195.1" /translation="MKLIDTTIAVDHLRGEPAAAVLLAELINNGEEIAASELVRFELL AGVRESELAALEAFFSAVVWTLVTEDIARIGGRLARRYRSSHRGIDDVDYLIAATAIV VDADLLTTNVRHFPMFPDLQPPY" CDS complement(2841117..2841491) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2578A" /product="Conserved protein" /note="Mb2578A, len: 124 aa. Equivalent to Rv2548A len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 124 aa overlap). Transferred manually from H37Rv. Conserved protein. Protein product from Mb2578A detected using SWATH mass spectrometry. Mb2578A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2C5" /protein_id="SIU01196.1" /translation="MLPENLEQRVTALESQVRELADRVRASEQDAAAARVLAGAADRD VTEFVGEFRDFRRATIGSFNALREDFTALREEMTERFSHVEERFSRVDDGFTEMRGKL DGAAAGQQRIVELIEQLIADQG" CDS complement(2841591..2841986) /codon_start=1 /transl_table=11 /gene="vapc20" /locus_tag="BQ2027_MB2579C" /product="possible toxin vapc20" /note="Mb2579c, -, len:131 aa. Equivalent to Rv2549c, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Conserved hypothetical protein, showing some similarity to P73415|SLL1715 from Synechocystis sp. strain PCC 6803 (157 aa), FASTA scores: opt: 167, E(): 4.2e-05, (29.45% identity in 129 aa overlap); Q9HHY6|VNG6166H from Halobacterium sp. plasmid pNRC200 strain NRC-1 (144 aa), FASTA scores: opt: 133, E(): 0.011, (29.6% identity in 125 aa overlap); and Q9HSU3|VNG0072H from Halobacterium sp. strain NRC-1 (144 aa), FASTA scores: opt: 113, E(): 0.29, (25.75% identity in 136 aa overlap). Mb2579c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1S7" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="InterPro:IPR039018" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1S7" /protein_id="SIU01197.1" /translation="MIFVDTSFWAALGNAGDARHGTAKRLWASKPPVVMTSNHVLGET WTLLNRRCGHRAAVAAAAIRLSTVVRVEHVTADLEEQAWEWLVRHDEREYSFVDATSF AVMRKKGIQNAYAFDGDFSAAGFVEVRPE" CDS complement(2841983..2842228) /codon_start=1 /transl_table=11 /gene="vapb20" /locus_tag="BQ2027_MB2580C" /product="possible antitoxin vapb20" /note="Mb2580c, -, len: 81 aa. Equivalent to Rv2550c, len: 81 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 81 aa overlap). Hypothetical unknown protein. Mb2580c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1Y4" /db_xref="InterPro:IPR002145" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Y4" /protein_id="SIU01198.1" /translation="MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVA EHLRQPGPDPVDAFVGSFVGEADLSASVDDVVYGKHE" CDS complement(2842639..2843058) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2581C" /product="probable type IV peptidase" /note="Mb2581c, -, len: 139 aa. Equivalent to Rv2551c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Conserved hypothetical protein, similar to the second part of Q9XAP1|SC10A7.34c PUTATIVE TYPE IV PEPTIDASE from Streptomyces coelicolor (259 aa), FASTA scores: opt: 243, E(): 7.4e-08, (40.95% identity in 144 aa overlap). Also some similarity with other proteins e.g. AAK58497|GSPO GSPO PROTEIN from Acetobacter diazotrophicus (261 aa), FASTA scores: opt: 152, E(): 0.025, (33.35% identity in 135 aa overlap). Mb2581c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1I8" /db_xref="InterPro:IPR000045" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1I8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01199.1" /translation="MLAAAVLAWMGVLCVCDVRQRRLPNWLTLPGAGVILLFAGLAGR GVPALAGAAALAGVYLLVHLALPAAMGAGDVKLAIGLGGLTGCFGVEVWFLAALAAPL LTAVCGVMVTPWGVRTLPHGPSMCVASLGAVGLALLG" CDS complement(2843070..2843852) /codon_start=1 /transl_table=11 /gene="aroE" /locus_tag="BQ2027_MB2582C" /product="PROBABLE SHIKIMATE 5-DEHYDROGENASE AROE (5-DEHYDROSHIKIMATE REDUCTASE)" /note="Mb2582c, aroE, len: 260 aa. Equivalent to Rv2552c, len: 269 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 260 aa overlap). Probable aroE, shikimate 5-dehydrogenase (EC 1.1.1.25), equivalent to Q9CCS7|AROE|ML0515 PUTATIVE SHIKIMATE 5-DEHYDROGENASE from Mycobacterium leprae (278 aa), FASTA scores: opt: 1452, E(): 1.8e-77, (81.5% identity in 270 aa overlap). Also highly similar, but longer 101 aa, to Q9KH59|AROE PUTATIVE SHIKIMATE DEHYDROGENASE (FRAGMENT) from Mycobacterium marinum (148 aa), FASTA scores: opt: 729, E(): 1.3e-35, (76.35% identity in 148 overlap); Q9F7W3|AROE from Mycobacterium ulcerans (148 aa), FASTA scores: opt: 718, E(): 5.9e-35, (75.7% identity in 148 aa overlap). And also similar to to others e.g. Q9KXQ2|AROE from Streptomyces coelicolor (255 aa), FASTA scores: opt: 572, E(): 2.8e-26, (43.4% identity in 251 aa overlap); Q98DY3|MLR4492 from Rhizobium loti (Mesorhizobium loti) (280 aa), FASTA scores: opt: 385, E(): 2.2e-15, (34.85% identity in 284 aa overlap); P74591|AROE_SYNY3|SLR1559 from Synechocystis sp. strain PCC 6803 (290 aa), FASTA scores: opt: 347, E(): 3.7e-13, (30.9% identity in 275 aa overlap); P15770|AROE_ECOLI|B3281 from Escherichia coli strain K12 (272 aa), FASTA scores: opt: 230, E(): 7.7e-08, (29.5% identity in 251 aa overlap); etc. BELONGS TO THE SHIKIMATE DEHYDROGENASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base deletion (t-*) leads to a shorter product compared to Mycobacterium tuberculosis strain H37Rv (260 aa versus 269 aa). Protein product from Mb2582c detected using SWATH mass spectrometry. Mb2582c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1J1" /db_xref="InterPro:IPR010110" /db_xref="InterPro:IPR013708" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR041121" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1J1" /protein_id="SIU01200.1" /translation="MLGSPIAHSRSPQLHLAAYRALGLHDWTYERIECGAAELPVVVG GFGPEWVGVSVTMPGKFAALRFADERTARADLVGSANTLVRTPHGWRADNTDIDGVAG ALGAAAGHALVLGSGGTAPAAVVGLAELGVTDITVVARNSDKAARLVDLGTRVGVATR FCAFDSGGLADAVAAAEVLVSTIPAEVAAGYAGTLAAIPVLLDAIYDPWPTPLAAAVG SAGGRVISGLQMLLHQAFAQVEQFTGLPAPREAMTCALAALD" CDS complement(2843875..2845128) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2583C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb2583c, -, len: 417 aa. Equivalent to Rv2553c, len: 417 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 417 aa overlap). Probable conserved membrane protein, equivalent to Q9CCS8|ML0514 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (421 aa), FASTA scores: opt: 1955, E(): 1.1e-111, (72.7% identity in 414 aa overlap). Also similar in part to various proteins e.g. Q9L9G6|NOVB NOVB PROTEIN (aminodesoxychorismate lyase) from Streptomyces sphaeroides (284 aa), FASTA scores: opt: 451, E(): 2.9e-2, (37.95% identity in 203 aa overlap); Q9EWY3|2SCG38.36 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (253 aa), FASTA scores: opt: 419, E(): 2.3e-18, (39.2% identity in 171 aa overlap); Q9CHT3|YGCC HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (550 aa), FASTA scores: opt: 379, E(): 1.2e-15, (23.0% identity in 417 aa overlap); O25309|HP0587 AMINODEOXYCHORISMATE LYASE (PABC) from Helicobacter pylori (Campylobacter pylori) (329 aa), FASTA scores: opt: 290, E(): 2e-10, (31.65% identity in 180 aa overlap); etc. Protein product from Mb2583c detected using SWATH mass spectrometry. Mb2583c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1I9" /db_xref="InterPro:IPR003770" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1I9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01201.1" /translation="MPDGGHRHRAQPVSVRPNRHRRTRVSRAQRRHAQQIRRRRRVAG GFALSLLVVVVVVAVVVGAKLWQTMLGFGNDYTGPGKRDIVIQIRAGDSTTAVGETLL KHGVVATVRAFVDAAHGNTAISSIQPGFYRMRTEISAASAVARLTDPHNRVGKLVIPE GRQLDDTTDMKTNVVNPGIFALISRATCVDLDGTQRCVSVADLRAAASRSTPTMLSVP RWAVGPVMELGTDHRRIEGLIAPGTFNIDPSASAETILATLISAGAVEYMKSGLVDTA KSLGLSPYDILVVASLVQQEANTQDFPKVARVIYNRLHEHRTLEFDSTVNYPLDRREV ATSDTDRAQRTPWNTYMAQGLPATAICSPGVDALRAAEHPVPGDWLYFVTIDSQGTTL FTRDYQQHLANIELAKHNGVLDSAR" CDS complement(2845121..2845633) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2584C" /product="Putative pre-16S rRNA nuclease YqgF" /note="Mb2584c, -, len: 170 aa. Equivalent to Rv2554c, len: 170 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 170 aa overlap). Conserved hypothetical protein, equivalent to Q9CCS9|ML0513 HYPOTHETICAL PROTEIN from Mycobacterium leprae (184 aa), FASTA scores: opt: 701, E(): 2e-34, (72.05% identity in 161 aa overlap). Also highly similar to Q9KXQ0|SC9C5.24c HYPOTHETICAL 17.7 KDA PROTEIN from Streptomyces coelicolor (167 aa), FASTA scores: opt: 461, E(): 2.3e-20, (54.65% identity in 150 aa overlap); and similar to other hypothetical proteins e.g. Q9KDE4 from Bacillus halodurans (140 aa), FASTA scores: opt: 291, E(): 1.9e-10, (38.7% identity in 137 aa overlap); P74662|SLL1547 from Synechocystis sp. strain PCC 6803 (152 aa), FASTA scores: opt: 290, (36.55% identity in 145 aa overlap); Q52673|YQGF_RHOCA from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (159 aa), FASTA scores: opt: 246, E(): 8.4e-08, (34.8% identity in 135 aa overlap); etc. Protein product from Mb2584c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2584c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67488" /db_xref="InterPro:IPR005227" /db_xref="InterPro:IPR006641" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR037027" /db_xref="UniProtKB/Swiss-Prot:P67488" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01202.1" /translation="MVPAQHRPPDRPGDPAHDPGRGRRLGIDVGAARIGVACSDPDAI LATPVETVRRDRSGKHLRRLAALAAELEAVEVIVGLPRTLADRIGRSAQDAIELAEAL ARRVSPTPVRLADERLTTVSAQRSLRQAGVRASEQRAVIDQAAAVAILQSWLDERLAA MAGTQEGSDA" CDS complement(2845634..2848348) /codon_start=1 /transl_table=11 /gene="alaS" /locus_tag="BQ2027_MB2585C" /product="PROBABLE ALANYL-TRNA SYNTHETASE ALAS (ALANINE--TRNA LIGASE) (ALANINE TRANSLASE) (ALARS)" /note="Mb2585c, alaS, len: 904 aa. Equivalent to Rv2555c, len: 904 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 904 aa overlap). Probable alaS, alanyl-tRNA synthetase (EC 6.1.1.7), equivalent to Q9CCT0|ALAS|ML0512 ALANYL-TRNA SYNTHETASE from Mycobacterium leprae (908 aa), FASTA scores: opt: 5013, E(): 0, (84.65% identity in 907 aa overlap). Also highly similar to many e.g. Q9KXP9|ALAS from Streptomyces coelicolor (890 aa), FASTA scores: opt: 2159, E(): 3.8e-118, (53.45% identity in 907 aa overlap); Q9FFC7 Arabidopsis thaliana (Mouse-ear cress) (954 aa), FASTA scores: opt: 1963, E(): 1.1e-106, (41.1% identity in 925 aa overlap); Q9RS27|DR2300 from Deinococcus radiodurans (890 aa), FASTA scores: opt: 1352, E(): 4.1e-71, (38.05% identity in 915 aa overlap); etc. BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb2585c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2585c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYB1" /db_xref="InterPro:IPR002318" /db_xref="InterPro:IPR003156" /db_xref="InterPro:IPR009000" /db_xref="InterPro:IPR012947" /db_xref="InterPro:IPR018162" /db_xref="InterPro:IPR018163" /db_xref="InterPro:IPR018164" /db_xref="InterPro:IPR018165" /db_xref="InterPro:IPR023033" /db_xref="UniProtKB/Swiss-Prot:Q7TYB1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01203.1" /translation="MQTHEIRKRFLDHFVKAGHTEVPSASVILDDPNLLFVNAGMVQF VPFFLGQRTPPYPTATSIQKCIRTPDIDEVGITTRHNTFFQMAGNFSFGDYFKRGAIE LAWALLTNSLAAGGYGLDPERIWTTVYFDDDEAVRLWQEVAGLPAERIQRRGMADNYW SMGIPGPCGPSSEIYYDRGPEFGPAGGPIVSEDRYLEVWNLVFMQNERGEGTTKEDYQ ILGPLPRNNIDTGMGVERIALVLQDVHNVYETDLLRPVIDTVARVAARAYDVGNHEDD VRYRIIADHSRTAAILIGDGVSPGNDGRGYVLRRLLRRVIRSAKLLGIDAAIVGDLMA TVRNAMGPSYPELVADFERISRIAVAEETAFNRTLASGSRLFEEVASSTKKSGATVLS GSDAFTLHDTYGFPIELTLEMAAETGLQVDEIGFRELMAEQRRRAKADAAARKHAHAD LSAYRELVDAGATEFTGFDELRSQARILGIFVDGKRVPVVAHGVAGGAGEGQRVELVL DRTPLYAESGGQIADEGTISGTGSSEAARAAVTDVQKIAKTLWVHRVNVESGEFVEGD TVIAAVDPGWRRGATQGHSGTHMVHAALRQVLGPNAVQAGSLNRPGYLRFDFNWQGPL TDDQRTQVEEVTNEAVQADFEVRTFTEQLDKAKAMGAIALFGESYPDEVRVVEMGGPF SLELCGGTHVSNTAQIGPVTILGESSIGSGVRRVEAYVGLDSFRHLAKERALMAGLAS SLKVPSEEVPARVANLVERLRAAEKELERVRMASARAAATNAAAGAQRIGNVRLVAQR MSGGMTAADLRSLIGDIRGKLGSEPAVVALIAEGESQTVPYAVAANPAAQDLGIRAND LVKQLAVAVEGRGGGKADLAQGSGKNPTGIDAALDAVRSEIAVIARVG" CDS complement(2848439..2848828) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2586C" /product="UPF0047 protein Rv2556c" /note="Mb2586c, -, len: 129 aa. Equivalent to Rv2556c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Conserved hypothetical protein, highly similar to others e.g. Q9EWY5|2SCG38.34 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (140 aa), FASTA scores: opt: 488, E(): 8.2e-26, (58.8% identity in 131 aa overlap); Q9L9G4|NOVD NOVD PROTEIN from Streptomyces sphaeroides (143 aa), FASTA scores: opt: 474, E(): 7.2e-25, (60.85% identity in 120 aa overlap); Q9X2I5|TM1872 from Thermotoga maritima (132 aa), FASTA scores: opt: 270, E(): 2.7e-11, (39.55% identity in 129 aa overlap); etc. Protein product from Mb2586c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2586c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001602" /db_xref="InterPro:IPR035917" /db_xref="UniProtKB/Swiss-Prot:P67122" /protein_id="SIU01204.1" /translation="MLDVDTARRRIVDLTDAVRAFCTAHDDGLCNVFVPHATAGVAII ETGAGSDEDLVDTLVRLLPRDDRYRHAHGSYGHGADHLLPAFVAPSVTVPVSGGQPLL GTWQSIVLVDLNQDNPRRSVRLSFVEG" CDS 2848935..2849609 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2587" /product="conserved protein" /note="Mb2587, -, len: 224 aa. Equivalent to Rv2557, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Conserved hypothetical protein, highly similar only to upstream ORF Q50740|MTCY9C4.10c|Rv2558|MT2635 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (236 aa), FASTA scores: opt: 1007, E(): 6.9e-60, (69.2% identity in 224 aa overlap). Protein product from Mb2587 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2587 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007138" /db_xref="InterPro:IPR011008" /db_xref="UniProtKB/Swiss-Prot:P65004" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01205.1" /translation="MTGGATGALPRTMKEGWIVYARSTTIQAQSECIDTGIAHVRDVV MPALQGMDGCIGVSLLVDRQSGRCIATSAWETAEAMHASREQVTPIRDRCAEMFGGTP AVEEWEIAAMHRDHRSAEGACVRATWVKVPADQVDQGIEYYKSSVLPQIEGLDGFCSA SLLVDRTSGRAVSSATFDSFDAMERNRDQSNALKATSLREAGGEELDECEFELALAHL RVPELV" CDS 2849694..2850404 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2588" /product="conserved protein" /note="Mb2588, -, len: 236 aa. Equivalent to Rv2558, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 236 aa overlap). Conserved hypothetical protein, highly similar only to downstream ORF Q50741|MTCY9C4.11c|Rv2557|MT2645 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (224 aa), FASTA scores: opt: 1007, E(): 4.7e-59, (69.2% identity in 224 aa overlap). Protein product from Mb2588 detected using SWATH mass spectrometry. Mb2588 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011008" /db_xref="UniProtKB/Swiss-Prot:P65006" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01206.1" /translation="MPGSAGWRKVFGGTGGATGALPRHGRGSIVYARSTTIEAQPLSV DIGIAHVRDVVMPALQEIDGCVGVSLLVDRQSGRCIATSAWETLEAMRASVERVAPIR DRAALMFAGSARVEEWDIALLHRDHPSHEGACVRATWLKVVPDQLGRSLEFYRTSVLP ELESLDGFCSASLMVDHPACRRAVSCSTFDSMDAMARNRDRASELRSRRVRELGAEVL DVAEFELAIAHLRVPELV" CDS complement(2850434..2851792) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2589C" /product="Replication-associated recombination protein RarA" /note="Mb2589c, -, len: 452 aa. Equivalent to Rv2559c, len: 452 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 452 aa overlap). Conserved hypothetical ala-, leu-, val-rich protein, equivalent to Q9CCT1|ML0510 HYPOTHETICAL PROTEIN from Mycobacterium leprae (473 aa), FASTA scores: opt: 2411, E(): 3.9e-121, (83.4% identity in 452 aa overlap); O69490|O69490 HYPOTHETICAL 47.1 KDA PROTEIN from Mycobacterium leprae (447 aa), FASTA scores: opt: 2406, E(): 6.9e-121, (83.95% identity in 448 aa overlap). Also highly similar to Q9KXP4|SC9C5.30c CONSERVED ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (451 aa), FASTA scores: opt: 1742, E(): 1.5e-85, (64.4% identity in 430 aa overlap); Q9RT67|DR1898 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (434 aa), FASTA scores: opt: 1147, E(): 6.6e-54, (46.0% identity in 415 aa overlap); P45262|YCAJ_HAEIN|HI1590 HYPOTHETICAL PROTEIN from Haemophilus influenzae (446 aa), FASTA scores: opt: 1140, E(): 1.6e-53, (42.5% identity in 428 aa overlap); etc. Also similar to Q50629|MTCY227.09|RUVB|Rv2592c|MT2669|MTCY 227.09 HOLLIDAY JUNCTION DNA HELICASE from Mycobacterium tuberculosis (344 aa), (30.1% identity in 296 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb2589c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2589c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1T6" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR003959" /db_xref="InterPro:IPR008921" /db_xref="InterPro:IPR021886" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR032423" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1T6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01207.1" /translation="MPEAVSDGLFDVPGVPMTSGHDLGASAGAPLAVRMRPASLDEVV GQDHLLAPGSPLRRLVEGSGVASVILYGPPGSGKTTLAALISQATGRRFEALSALSAG VKEVRAVIENSRKALLHGEQTVLFIDEVHRFSKTQQDALLSAVEHRVVLLVAATTENP SFSVVAPLLSRSLILQLRPLTAEDTRAVVQRAIDDPRGLGRAVAVAPEAVDLLVQLAA GDARRALTALEVAAEAAQAAGELVSVQTIERSVDKAAVRYDRDGDQHYDVVSAFIKSV RGSDVDAALHYLARMLVAGEDPRFIARRLMILASEDIGMADPSALQVAVAAAQTVALI GMPEAQLTLAHATIHLATAPKSNAVTTALAAAMNDIKAGKAGLVPAHLRDGHYSGAAA LGNAQGYKYSHDDPDGVVAQQYPPDELVDVDYYRPTGRGGEREIAGRLDRLRAIIRKK RG" CDS 2851938..2852915 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2590" /product="PROBABLE PROLINE AND GLYCINE RICH TRANSMEMBRANE PROTEIN" /note="Mb2590, -, len: 325 aa. Equivalent to Rv2560, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 325 aa overlap). Probable transmembrane protein, pro-, gly-rich protein. Protein product from Mb2590 detected using SWATH mass spectrometry. Mb2590 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P59983" /db_xref="UniProtKB/Swiss-Prot:P59983" /protein_id="SIU01208.1" /translation="MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGT YLPPGYNAPPPPPGYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVP VLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYI ALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVALTFIGGLLC VIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGE LLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVRPAPPVGWPPGPQLA" CDS 2853272..2854009 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2591" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2591, -, len: 245 aa. Similar to Rv2561 and Rv2562, len: 97 aa and 129 aa, from Mycobacterium tuberculosis strain H37Rv, (87.3% identity in 79 aa overlap and 99.2% identity in 129 aa overlap). Conserved hypothetical protein, highly similar in part (and longer 33 aa) to upstream ORF AAK46951|RV2562|MT2638|MTCY9C4.06c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (212 aa), FASTA scores: opt: 205, E(): 2e-06, (76.1% identity in 46 aa overlap). Conserved hypothetical protein, highly similar, but shorter 83 aa, to downstream ORF AAK46951|RV2561|MT2638|MTCY9C4.07c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (97 aa), FASTA scores: opt: 866, E(): 2.2e-54, (100.0% identity in 129 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2561 and Rv2562 exist as 2 genes. In Mycobacterium bovis, a single base deletion (g-*) results in a single product which is more similar to Rv2561. Mb2591 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR020503" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1J7" /protein_id="SIU01209.1" /translation="MGIQRAVLLIADIGGYTNYMHWNRKHLAHAQWTVAQLLESVIDA AKGMKLAKLEGDAAFFWAPGGNTSVLVCDRPPQMRQRFRTRREQIKKDHPCDCKSCEQ RDNLSIKFVAHEGEVAEQKVKRNVELAGVDVILVHRMLKNEVPVSEYLFMTDVVAQCL DESVRKLATPLTHDFEGIGETSTHYTDLATSDMPPAVPDHSFFGLLWADVKFEWHALP YLLGFKKACAGFRSLGRGATEEPAEMG" CDS 2854152..2855201 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2592" /product="PROBABLE GLUTAMINE-TRANSPORT TRANSMEMBRANE PROTEIN ABC TRANSPORTER" /note="Mb2592, -, len: 349 aa. Equivalent to Rv2563, len: 349 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 349 aa overlap). Probable glutamine-transport transmembrane protein ABC transporter (see citation below), highly similar to O53617|Rv0072|MTV030.16 PUTATIVE ABC-TRANSPORTER TRANSMEMBRANE SUBUNIT from Mycobacterium tuberculosis (349 aa), FASTA scores: opt: 1772, E(): 1.1e-89, (76.2% identity in 349 aa overlap). Also some similarity with various hypothetical proteins e.g. Q9RYN1|DRA0279 HYPOTHETICAL 37.1 KDA PROTEIN from Deinococcus radiodurans (353 aa), FASTA scores: opt: 347, E(): 6.6e-12, (24.35% identity in 357 aa overlap); BAB58522|SAV2360 CONSERVED HYPOTHETICAL PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (351 aa), FASTA scores: opt: 262, E(): 2.9e-07, (19.4% identity in 356 aa overlap); Q9AK94|SC10A9.10c PUTATIVE ABC TRANSPORT SYSTEM TRANSMEMBRANE PROTEIN from Streptomyces coelicolor (379 aa), FASTA scores: opt: 172, E(): 0.025, (26.85% identity in 387 aa overlap); etc. Protein product from Mb2592 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2592 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65008" /db_xref="UniProtKB/Swiss-Prot:P65008" /protein_id="SIU01210.1" /translation="MLFAALRDVQWRKRRLVIAIVSTGLVFAMTLVLTGLVNGFRVEA ERTVDSMGVDAFVVKAGAAGPFLGSTPFAQIDLPQVARAPGVLAAAPLATAPSTIRQG TSARNVTAFGAPEHGPGMPRVSDGRAPSTPDEVAVSSTLGRNLGDDLQVGARTLRIVG IVPESTALAKIPNIFLTTEGLQQLAYNGQPTISSIGIDGMPRQLPDGYQTVNRADAVS DLMRPLKVAVDAITVVAVLLWIVAALIVGSVVYLSALERLRDFAVFKAIGVPTRSILA GLALQAVVVALLAAVVGGILSLLLAPLFPMTVVVPLSAFVALPAIATVIGLLASVAGL RRVVAIDPALAFGGP" CDS 2855204..2856196 /codon_start=1 /transl_table=11 /gene="glnQ" /locus_tag="BQ2027_MB2593" /product="PROBABLE GLUTAMINE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER GLNQ" /note="Mb2593, glnQ, len: 330 aa. Equivalent to Rv2564, len: 330 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 330 aa overlap). Probable glnQ, glutamine-transport ATP-binding protein ABC transporter (see citation below), highly similar to many e.g. Q9L0J9|SCD40A.12c PUTATIVE ABC-TRANSPORTER ATP-BINDING PROTEIN from Streptomyces coelicolor (246 aa), FASTA scores: opt: 598, E(): 2.5e-26, (46.35% identity in 218 aa overlap); O54136|SC2E9.11 from Streptomyces coelicolor (230 aa), FASTA scores: opt: 592, E(): 5.1e-26, (46.55% identity in 219 aa overlap); O29244|AF1018 from Archaeoglobus fulgidus (228 aa), FASTA scores: opt: 580, E(): 2.4e-25, (42.4% identity in 210 aa overlap); P75831|YBJZ_ECOLI|B0879 from Escherichia coli strain K12 (648 aa), FASTA scores: opt: 555, E(): 1.3e-23, (39.65% identity in 232 aa overlap); etc. Also highly similar to O53618|Rv0073|MTV030.17 ABC-TRANSPORTER ATP-BINDING SUBUNIT from Mycobacterium tuberculosis (330 aa), FASTA scores: opt: 1782, E(): 4.7e-92, (83.65% identity in 330 aa overlap); etc. Shows some similarity to Q11040|YC81_MYCTU|MTCY50.01|Rv1281c|MT1318 HYPOTHETICAL ABC TRANSPORTER ATP-BINDING PROTEIN from Mycobacterium tuberculosis (612 aa) (32.9 % identity in 234 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00211 ABC transporters family signature, and PS00889 Cyclic nucleotide-binding domain signature 2. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb2593 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2593 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63402" /db_xref="InterPro:IPR000595" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR018488" /db_xref="InterPro:IPR018490" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P63402" /protein_id="SIU01211.1" /translation="MGGLTISDLVVEYSSGGYAVRPIDGLSLDVAPGSLVILLGPSGC GKTTLLSCLGGILRPKSGSIKFDDVDITTLEGAALAKYRRDKVGIVFQAFNLVSSLTA LENVMVPLRAAGVSRAAARKRAEDLLIRVNLGERMKHRPGDMSGGQQQRVAVARAIAL DPQLILADEPTAHLDFIQVEEVLRLIRSLAQGDRVVVVATHDSRMLPLADRVLELMPA QVSPNQPPETVHVKAGEVLFEQSTMGDLIYVVSEGEFEIVRELADGGEELVKTAAPGD YFGEIGVLFHLPRSATVRARSDATAVGYTAQAFRERLGVTRVADLIEHRELASE" CDS 2856473..2858224 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2594" /product="Predicted esterase of the alpha-beta hydrolase superfamily" /note="Mb2594, -, len: 583 aa. Equivalent to Rv2565, len: 583 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 583 aa overlap). Conserved hypothetical protein, similar in part to Q9A6C3|CC2171 HYPOTHETICAL PROTEIN from Caulobacter crescentus (610 aa), FASTA scores: opt: 765, E(): 2.8e-37, (32.15% identity in 575 aa overlap). C-terminus also highly similar to various bacterial proteins e.g. O34731|YLBK_BACSU HYPOTHETICAL 28.3 KDA PROTEIN from Bacillus subtilis (260 aa), FASTA scores: opt: 386, E(): 2.2e-15, (33.05% identity in 245 aa overlap); CAC45997|SMC01003 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (321 aa), FASTA scores: opt: 352, E(): 2.5e-13, (29.65% identity in 280 aa overlap); Q9K9Q8|BH2587 HYPOTHETICAL PROTEIN from Bacillus halodurans (275 aa), FASTA scores: opt: 334, E(): 2.5e-12, (33.7% identity in 175 aa overlap); etc. And shows similarity to C-terminal half of some eukaryotic proteins e.g. Q9R114|NTE NEUROPATHY TARGET ESTERASE HOMOLOG from Mus musculus (Mouse) (1327 aa), FASTA scores: opt: 411, E(): 2.7e-16, (24.45% identity in 626 aa overlap); O60859 NEUROPATHY TARGET ESTERASE from Homo sapiens (Human) (1327 aa), FASTA scores: opt: 410, E(): 3.1e-16, (24.1% identity in 627 aa overlap); Q9U969|SWS|CG2212 SWISS CHEESE PROTEIN from Drosophila melanogaster (Fruit fly) (1425 aa), FASTA scores: opt: 401, E(): 1.1e-15, (27.75% identity in 544 aa overlap); etc. Also shows strong similarity to C-terminal half of O05884|Z95121|Rv3239c|MTY20B11.14c HYPOTHETICAL 110.2 KDA PROTEIN from Mycobacterium tuberculosis (1048 aa), FASTA scores: opt: 648, E(): 3e-30, (36.55% identity in 572 aa overlap); and O69695|Rv3728|MTV025.076 PUTATIVE TWO-DOMAIN MEMBRANE PROTEIN from Mycobacterium tuberculosis (1065 aa), FASTA scores: opt: 643, E(): 6e-30, (34.3% identity in 595 aa overlap). Protein product from Mb2594 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2594 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A643" /db_xref="InterPro:IPR000595" /db_xref="InterPro:IPR001423" /db_xref="InterPro:IPR002641" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR018490" /db_xref="UniProtKB/Swiss-Prot:P0A643" /protein_id="SIU01212.1" /translation="MTTARRRPKRRGTDARTALRNVPILADIDDEQLERLATTVERRH VPANQWLFHAGEPADSIYIVDSGRFVAVAPEGHVFAEMASGDSIGDLGVIAGAARSAG VRALRDGVVWRIAAETFTDMLEATPLLQSAMLRAMARMLRQSRPAKTARRPRVIGVVS NGDTAAAPMVDAIATSLDSHGRTAVIAPPVETTSAVQEYDELVEAFSETLDRAERSND WVLVVADRGAGDLWRHYVSAQSDRLVVLVDQRYPPDAVDSLATQRPVHLITCLAEPDP SWWDRLAPVSHHPANSDGFGALARRIAGRSLGLVMAGGGARGLAHFGVYQELTEAGVV IDRFGGTSSGAIASAAFALGMDAGDAIAAAREFIAGSDPLGDYTIPISALTRGGRVDR LVQGFFGNTLIEHLPRGFFSVSADMITGDQIIHRRGSVSGAVRASISIPGLIPPVHNG EQLLVDGGLLNNLPANVMCADTDGEVICVDLRRTFVPSKGFGLLPPIVTPPGLLRRLL TGTDNALPPLQETLLRAFDLAASTANLRELPRVAAIIEPDVSKIGVLNFKQIDAALEA GRMAARAALQAQPDLVR" CDS 2858235..2859836 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2595" /product="LONG CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb2595, -, len: 533 aa. Equivalent to 5' end of Rv2566, len: 1140 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 531 aa overlap). Long conserved hypothetical protein, equivalent to O53120|ML2678 OR MLCB1913.12 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1000 aa), FASTA scores: opt: 760, E(): 7.1e-38, (50.2% identity in 1128 aa overlap); and middle part equivalent to Q9ZB40 72.2 KDA PROTEIN (FRAGMENT) from Mycobacterium leprae (644 aa), FASTA scores: opt: 1017, E(): 1.5e-65, (45.65% identity in 655 aa overlap). Also highly similar to Q98HG6|MLL2877 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (1119 aa), FASTA scores: opt: 1413, E(): 3.7e-77, (52.4% identity in 1148 aa overlap); and N-terminus shows similarity with other proteins e.g. Q9HUN8|PA4926 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (311 aa), FASTA scores: opt: 278, E(): 3e-09, (29.95% identity in 284 aa overlap); and upstream ORF Q50652|YP69_MYCTU|Rv2569c|MT2645|MTCY227.32 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (314 aa), FASTA scores: opt: 252, E(): 1.1e-07, (28.9% identity in 315 aa overlap). Equivalent to AAK46955 from Mycobacterium tuberculosis strain CDC1551 (1156 aa) but shorter 16 aa. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2566 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2 bp deletion (gc-*) splits Rv2566 into 2 parts, Mb2595 and Mb2596. Mb2595 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002931" /db_xref="InterPro:IPR013589" /db_xref="InterPro:IPR018667" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1K4" /protein_id="SIU01213.1" /translation="MPLRPTQVSSTGRTRCAGRSGVISSAAMSIKVALEHRTSYTFDR LVRVYPHIVRLRPAPHSRTSIEAYSLRIEPADHFINWQQDALGNFLARLVFPNPMRQL RITVGLIADLKVINPFDFFIEDWAEIWPCAGMAYPKALADDLRPYLRPVDEDGDGSGP GELTQAWVRNFTVPDGTRTIDFLVALNRAINADVGYCVRMEPGVQTPDFTLRTGVGSC RDSAWLLVSILRQFGLAARFVSGYLVQLASDIEALDGPSGPAADFTDLHAWSEAYIPG AGWIGLDPTSGLLAGEGHIPLAATPHPASAAPISGGTDVCDTVLEFSNTVTRVHEDPR VTLPYTDESWKTICEVGQRVDERLAAADVRLTVGGEPTFVSVDNQVAEEWRTAADGPH KRERASDLAARLKAVWAPQGLIHRGQGRWYPGEPLPRWQIALYWRTDGRPLWTNDALL ADPWGAPPADPVDDDAAYRVLAGIADGLGLPISQVRPAYEDPLSRLAAAVRMPAGDPV ESGDDLGCDTNPDTPTGRAALLAPR" CDS 2859862..2861655 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2596" /product="LONG CONSERVED HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb2596, -, len: 597 aa. Equivalent to 3' end of Rv2566, len: 1140 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 597 aa overlap). Long conserved hypothetical protein, equivalent to O53120|ML2678 OR MLCB1913.12 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1000 aa), FASTA scores: opt: 760, E(): 7.1e-38, (50.2% identity in 1128 aa overlap); and middle part equivalent to Q9ZB40 72.2 KDA PROTEIN (FRAGMENT) from Mycobacterium leprae (644 aa), FASTA scores: opt: 1017, E(): 1.5e-65, (45.65% identity in 655 aa overlap). Also highly similar to Q98HG6|MLL2877 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (1119 aa), FASTA scores: opt: 1413, E(): 3.7e-77, (52.4% identity in 1148 aa overlap); and N-terminus shows similarity with other proteins e.g. Q9HUN8|PA4926 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (311 aa), FASTA scores: opt: 278, E(): 3e-09, (29.95% identity in 284 aa overlap); and upstream ORF Q50652|YP69_MYCTU|Rv2569c|MT2645|MTCY227.32 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (314 aa), FASTA scores: opt: 252, E(): 1.1e-07, (28.9% identity in 315 aa overlap). Equivalent to AAK46955 from Mycobacterium tuberculosis strain CDC1551 (1156 aa) but shorter 16 aa. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2566 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2 bp deletion (gc-*) splits Rv2566 into 2 parts, Mb2595 and Mb2596. Protein product from Mb2596 detected using SWATH mass spectrometry." /db_xref="InterPro:IPR018667" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1K2" /protein_id="SIU01214.1" /translation="MLPLHRRDDGQGWASANWRLRRGRIVLLEGDSPAGLRLPLDSIS WRPPRASFDADPVAVRSTLPAEPHTDRAVVEDPETAPTTALVAEVRGGLVHIFLPPTD ALEHFIDLVARVEAAATTANCPVVIEGYGPPPDPRLTSTTITPDPGVIEVNIAPTASF AEQRQQLETLYQQARLARLTTEAFDVDGTHGGTGGGNHITLGGVTPADSPLLRRPDLL VSLLTYWQRHPSLSYLFAGRFVGTTSQAPRVDEGRAEALYELEIAFAEILRLSPSSGG GRPQPWVTDRALRHLLTDITGNTHRAEFCIDKLYSPDSARGRLGLLELRGFEMPPHLH MAMVQSLLVRSLVAWFWDQPLRAPLIRHGANLHGRYLLPHFLIHDIADVAADLRAHGI AFETSWLDPFTEFRFPRIGTAVFDGIEIELRGAIEPWHTLGEEATAAGTARYVDSSVE RIQVRIIGADRHRYVVTCNGYPMPLLATDNPDIHVGGVRFKAWQPPSALHPTITVDGP LRFELIDIATATSCGGCTYHVAHPGGRAYDEPPVNAVEAEARRARRFEATGFTPGKLD LSDIREKQARISTDIGAPGILDLRRVRTVQQ" CDS 2861655..2864309 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2597" /product="Protein containing domains DUF404, DUF407, DUF403" /note="Mb2597, -, len: 884 aa. Equivalent to Rv2567, len: 884 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 884 aa overlap). Conserved hypothetical ala-, leu-rich protein, equivalent to O53121|ML2679|MLCB1913.13 HYPOTHETICAL PROTEIN from Mycobacterium leprae (893 aa), FASTA scores: opt: 4326, E(): 0, (75.2% identity in 883 aa overlap); and similar to Q49755|YO11_MYCLE|ML0605|MLCL536.05c|U1937B|B1937_F1_4 HYPOTHETICAL 61.8 KDA PROTEIN from Mycobacterium leprae (561 aa), FASTA scores: opt: 758, E(): 1.2e-38, (32.2% identity in 537 aa overlap). Also similar to others e.g. Q9HUN7|PA4927 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (830 aa), FASTA scores: opt: 1247, E(): 2.2e-68, (38.25% identity in 831 aa overlap); Q98HG7|MLL2876 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (803 aa), FASTA scores: opt: 937, E(): 1.9e-49, (32.15% identity in 828 aa overlap); CAC47419|SMC04057 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (802 aa), FASTA scores: opt: 900, E(): 3.4e-47, (30.85% identity in 852 aa overlap); etc. And similar to P71732|YO11_MYCTU|Rv2411c|MT2484|MTCY253.09 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (551 aa), FASTA scores: opt: 781, E(): 4.6e-40, (33.75% identity in 495 aa overlap). Protein product from Mb2597 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2597 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007296" /db_xref="InterPro:IPR025841" /db_xref="UniProtKB/Swiss-Prot:P59974" /protein_id="SIU01215.1" /translation="MAPSASAATNGYDVDRLLAGYRTARAQETLFDLRDGPGAGYDEF VDDDGNVRPTWTELADAVAERGKAGLDRLRSVVHSLIDHDGITYTAIDAHRDALTGDH DLEPGPWRLDPLPLVISAADWEVLEAGLVQRSRLLDAILADLYGPRSMLTEGVLPPEM LFAHPGYVRAANGIQMPGRHQLFMHACDLSRLPDGTFQVNADWTQAPSGSGYAMADRR VVAHAVPDLYEELAPRPTTPFAQALRLALIDAAPDVAQDPVVVVLSPGIYSETAFDQA YLATLLGFPLVESADLVVRDGKLWMRSLGTLKRVDVVLRRVDAHYADPLDLRADSRLG VVGLVEAQHRGTVTVVNTLGSGILENPGLLRFLPQLSERLLDESPLLHTAPVYWGGIA SERSHLLANVSSLLIKSTVSGETLVGPTLSSAQLADLAVRIEAMPWQWVGQELPQFSS APTNHAGVLSSAGVGMRLFTVAQRSGYAPMIGGLGYVLAPGPAAYTLKTVAAKDIWVR PTERAHAEVITVPVLAPPAKTGAGTWAVSSPRVLSDLFWMGRYGERAENMARLLIVTR ERYHVFRHQQDTDESECVPVLMAALGKITGYDTATGAGSAYDRADMIAVAPSTLWSLT VDPDRPGSLVQSVEGLALAARAVRDQLSNDTWMVLANVERAVEHKSDPPQSLAEADAV LASAQAETLAGMLTLSGVAGESMVHDVGWTMMDIGKRIERGLWLTALLQATLSTVRHP AAEQAIIEATLVACESSVIYRRRTVGKFSVAAVTELMLFDAQNPRSLVYQLERLRADL KDLPGSSGSSRPERMVDEMNTRLRRSHPEELEEVSADGLRAELAELLAGIHASLRDVA DVLTATQLALPGGMQPLWGPDQRRVMPA" CDS complement(2864306..2865331) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2598C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2598c, -, len: 341 aa. Equivalent to Rv2568c, len: 341 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 341 aa overlap). Conserved hypothetical protein, highly similar (but longer 60 aa) to Q98E75|MLR4376 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (308 aa), FASTA scores: opt: 566, E(): 4.1e-29, (40.2% identity in 291 aa overlap). Protein product from Mb2598c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2598c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR011201" /db_xref="InterPro:IPR031321" /db_xref="UniProtKB/Swiss-Prot:P65010" /protein_id="SIU01216.1" /translation="MRDFHCPNCGQRLAFENSACLSCGSALGFSLGRMALLVIADDAD VQLCANLHLAQCNWLVPSDQLGGLCSSCVLTIERPSDTNTAGLAEFARAEGAKRRLIA ELHELKLPIVGRDQDPDHGLAFRLLSSAHENVTTGHQNGVITLDLAEGDDVHREQLRV EMDEPYRTLLGHFRHEIGHYYFYRLIASSSDYLSRFNELFGDPDADYSQALDRHYRGG PPEGWQDSFVSSYATMHASEDWAETFAHYLHIRDALDTAAWCGLAPASATFDRPALGP SAFNTIIDKWLPLSWSLNMVNRSMGHDDLYPFVLPAAVLEKMRFIHTVVDEVAPDFEP AHSRRTV" CDS complement(2865324..2866268) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2599C" /product="Protein containing transglutaminase-like domain, putative cysteine protease" /note="Mb2599c, -, len: 314 aa. Equivalent to Rv2569c, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 314 aa overlap). Conserved hypothetical protein, equivalent to Q9CCT2|ML0508 HYPOTHETICAL PROTEIN from Mycobacterium leprae (313 aa), FASTA scores: opt: 1723, E(): 1.9e-95, (84.4% identity in 301 aa overlap); and some similarity with Q49757|YP69_MYCLE|ML0607|MLCL536.03c|B1937_F2_39 HYPOTHETICAL 31.1 KDA PROTEIN from Mycobacterium leprae (279 aa), FASTA scores: opt: 305, E(): 4.5e-11, (33.0% identity in 300 aa overlap). Also similar to to other hypothetical proteins e.g. Q9HUN8|PA4926 from Pseudomonas aeruginosa (311 aa), FASTA scores: opt: 704, E(): 8.7e-35, (39.7% identity in 320 aa overlap); Q98HG8|MLL2875 from Rhizobium loti (Mesorhizobium loti) (294 aa), FASTA scores: opt: 521, E(): 6.5e-24, (35.05% identity in 294 aa overlap); Q9A7W9|CC1600 from Caulobacter crescentus (325 aa), FASTA scores: opt: 510, E(): 3.2e-23, (34.4% identity in 2588 aa overlap); etc. Also some similarity with proteins from Mycobacterium tuberculosis e.g. P71734|Rv2409c|MTCY253.11 CONSERVED HYPOTHETICAL PROTEIN (279 aa), FASTA scores: opt: 312, E(): 1.7e-11, (34.45% identity in 296 aa overlap); and Q50732|Rv2566|MTCY9C4.02 LONG CONSERVED HYPOTHETICAL PROTEIN (1140 aa), FASTA scores: opt: 252, E(): 2.2e-07, (28.9% identity in 315 aa overlap) Protein product from Mb2599c detected using SWATH mass spectrometry. Mb2599c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002931" /db_xref="InterPro:IPR013589" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/Swiss-Prot:P0A5G6" /protein_id="SIU01217.1" /translation="MSADSSLSLPLSGTHRYRVTHRTEYRYSDVVTSSYGRGFLTPRN SLRQRCVAHRLTIDPAPADRSTSRDGYGNISSYFHVTEPHRTLTITSDSIVDVSPPPP GLYTSGPALQPWEAARPAGLPGSLATEFTLDLNPPEITDAVREYAAPSFLPKRPLVEV LRDLASRIYTDFTYRSGSTTISTGVNEVLLAREGVCQDFARLAIACLRANGLAACYVS GYLATDPPPGKDRMIGIDATHAWASVWTPQQPGRFEWLGLDPTNDQLVDQRYIVVGRG RDYADVPPLRGIIYTNSENSVIDVSVDVVPFEGDALHA" CDS 2866372..2866761 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2600" /product="YjbR family protein" /note="Mb2600, -, len: 129 aa. Equivalent to Rv2570, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Conserved hypothetical protein, similar to Q98GQ7|MLR3218 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (133 aa), FASTA scores: opt: 174, E(): 9.6e-05, (32.25% identity in 124 aa overlap); Q9A390|CC3314 HYPOTHETICAL PROTEIN from Caulobacter crescentus (129 aa), FASTA scores: opt: 155, E(): 0.0017, (33.35% identity in 108 aa overlap); and Q9A2Y0|CC3426 HYPOTHETICAL PROTEIN from Caulobacter crescentus (120 aa), FASTA scores: opt: 144, E(): 0.0083, (32.95% identity in 91 aa overlap). Mb2600 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P65014" /protein_id="SIU01218.1" /translation="MATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDRE ALTRAGSEPPSGDIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVR DLEELITEAWLMQAPKQLVQAFLANSG" CDS complement(2866753..2867820) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2601C" /product="PROBABLE TRANSMEMBRANE ALANINE AND VALINE AND LEUCINE RICH PROTEIN" /note="Mb2601c, -, len: 355 aa. Equivalent to Rv2571c, len: 355 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 355 aa overlap). Probable transmembrane ala-, val-, leu-rich protein, showing some similarity with other membrane proteins e.g. Q99340|YFDA_CORGL HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (359 aa), FASTA scores: opt: 338, E(): 2.5e-13, (29.4% identity in 255 aa overlap); Q9RD86|SCF43.02 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (379 aa), FASTA scores: opt: 208, E(): 2.1e-05, (26.05% identity in 303 aa overlap); Q9RD81|SCF43.07 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (419 aa), FASTA scores: opt: 205, E(): 3.5e-05, (25.15% identity in 362 aa overlap); etc. Protein product from Mb2601c detected using SWATH mass spectrometry. Mb2601c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65016" /db_xref="UniProtKB/Swiss-Prot:P65016" /protein_id="SIU01219.1" /translation="MSASLLVRTACGGRAVAQRLRTVLWPITQTSVVAGLAWYLTHDV FNHPQAFFAPISAVVCMSATNVLRARRAQQMIVGVALGIVLGAGVHALLGSGPIAMGV VVFIALSVAVLCARGLVAQGLMFINQAAVSAVLVLVFASNGSVVFERLFDALVGGGLA IVFSILLFPPDPVVMLCSARADVLAAVRDILAELVNTVSDPTSAPPDWPMAAADRLHQ QLNGLIEVRANAAMVARRAPRRWGVRSTVRDLDQQAVYLALLVSSVLHLARTIAGPGG DKLPTPVHAVLTDLAAGTGLADADPTAANEHAAAARATASTLQSAACGSNEVVRADIV QACVTDLQRVIERPGPSGMSA" CDS complement(2867827..2869662) /codon_start=1 /transl_table=11 /gene="aspS" /locus_tag="BQ2027_MB2602C" /product="probable aspartyl-trna synthetase asps (aspartate--trna ligase) (asprs) (aspartic acid translase)" /note="Mb2602c, aspS, len: 611 aa. Equivalent to Rv2572c, len: 596 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 595 aa overlap). Probable aspS, aspartyl-tRNA synthetase (EC 6.1.1.12), equivalent to P36429|SYD_MYCLE|ML0501|MLCB1259.19 ASPARTYL-TRNA SYNTHETASE from Mycobacterium leprae (589 aa), FASTA scores: opt: 3534, E(): 1.8e-215, (87.85% identity in 592 aa overlap). Also highly similar to many e.g. O67589|SYD_AQUAE|AQ_1677 from Aquifex aeolicus (603 aa), FASTA scores: opt: 1829, E(): 8.2e-108, (47.5% identity in 598 aa overlap); O32038|SYD_BACSU from Bacillus subtilis (592 aa), FASTA scores: opt: 1732, E(): 1.1e-101, (46.25% identity in 597 aa overlap); P21889|SYD_ECOLI|TLS|B1866 from Escherichia coli strain K12 (590 aa), FASTA scores: opt: 1588, E(): 1.3e-92, (47.35% identity in 581 aa overlap); etc. Contains PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1. BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base deletion (t-*) omitting a stop codon, leads to a longer protein with a different COOH end compared to its homolog in Mycobacterium tuberculosis strain H37Rv (611 aa versus 596 aa). Protein product from Mb2602c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2602c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TYA6" /db_xref="InterPro:IPR002312" /db_xref="InterPro:IPR004115" /db_xref="InterPro:IPR004364" /db_xref="InterPro:IPR004365" /db_xref="InterPro:IPR004524" /db_xref="InterPro:IPR006195" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR029351" /db_xref="UniProtKB/Swiss-Prot:Q7TYA6" /protein_id="SIU01220.1" /translation="MFVLRSHAAGLLREGDAGQQVTLAGWVARRRDHGGVIFIDLRDA SGIAQVVFRDPQDTEVLAQAHRLRAEFCVSVAGVVEIRPEGNANPEIATGEIEVNATS LTVLGECAPLPFQLDEPAGEELRLKYRYLDLRRDDPAAAIRLRSRVNAAARAVLARHD FVEIETPTITRSTPEGARDFLVPARLHPGSFYALPQSPQLFKQLLMVAGMERYYQIAR CYRDEDFRADRQPEFTQLDMEMSFVDAEDIIAISEEVLTELWALIGYRIPTPIPRIGY AEAMRRFGTDKPDLRFGLELVECTDFFSDTTFRVFQAPYVGAVVMPGGASQPRRTLDG WQDWAKQRGHRGLAYVLVAEDGTLGGPVAKNLTEAERTGLADHVGAKPGDCIFFSAGP VKSSRALLGAARVEIANRLGLIDPDAWAFVWVVDPPLFEPADEATAAGEVAVGSGAWT AVHHAFTAPKPEWEDRIESDTGSVLADAYDIVCNGHEIGGGSVRIHRRDIQERVFAVM GLDKAEVEEKFGFLLEAFMFGAPPHGGIAFGWDRTTALLAGMDSIREVIAFPKTGGGV DPLTDAPAPITAQQRKESGIDAQPKRVQRHDQIFAVTTSHPSVNK" CDS 2869812..2870642 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2603" /product="2-dehydropantoate 2-reductase (EC" /EC_number="1.1.1.169" /note="Mb2603, -, len: 246 aa. Equivalent to Rv2573, len: 246 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 246 aa overlap). Conserved hypothetical protein, similar to various proteins e.g. Q9ABG6|CC0261 HYPOTHETICAL PROTEIN from Caulobacter crescentus (290 aa), FASTA scores: opt: 516, E(): 5.8e-26, (40.1% identity in 237 aa overlap); Q99R37|SA2393 HYPOTHETICAL PROTEIN (similar to 2-dehydropantoate 2-reductase) from Staphylococcus aureus subsp. aureus N315 (286 aa), FASTA scores: opt: 368, E(): 1.8e-16, (31.75% identity in 230 aa overlap); Q9KPQ9|VC2307 2-DEHYDROPANTOATE 2-REDUCTASE from Vibrio cholerae (296 aa), FASTA scores: opt: 223, E(): 3.9e-07, (27.7% identity in 224 aa overlap); etc. Equivalent to AAK46962 from Mycobacterium tuberculosis strain CDC1551 (275 aa) but shorter 29 aa. Mb2603 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1L0" /db_xref="InterPro:IPR003710" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR013328" /db_xref="InterPro:IPR013332" /db_xref="InterPro:IPR013752" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1L0" /protein_id="SIU01221.1" /translation="MLHKAGYSPLLCGHTPRAGIELRRDGADPIVVPGPVHTSPREVA GPVDVLILAVKATQNDAARPWLTRLCDERTVVAVLQNGVEQVEQVQPHCPSSAVVPAI VWCSAETQPQGWVRLRGEAALVVPTGPAAEQFAGLLRGAGATVDCDPDFTTAAWRKLL VNALAGFMVLSGRRSAMFRRDDVAALSRRYVAECLAVARAEGARLDDDVVDEVVRLVR SAPQDMGTSMLADRAAHRPLEWDLRNGVIVRKARAHGLATPISDVLVPLLAAASDGPG " CDS 2870665..2871168 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2604" /product="conserved protein" /note="Mb2604, -, len: 167 aa. Equivalent to Rv2574, len: 167 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 167 aa overlap). Conserved hypothetical protein, showing similarity with Q9K3N3|SCG20A.07 HYPOTHETICAL 17.4 KDA PROTEIN from Streptomyces coelicolor (157 aa), FASTA scores: opt: 218, E(): 2.8e-08, (30.65% identity in 150 aa overlap). Protein product from Mb2604 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2604 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/Swiss-Prot:P65018" /protein_id="SIU01222.1" /translation="MYPCERVGLSFTETAPYLFRNTVDLAITPEQLFEVLADPQAWPR WATVITKVTWTSPEPFGAGTTRIVEMRGGIVGDEEFISWEPFTRMAFRFNECSTRAVG AFAEDYRVQAIPGGCRLTWTMAQKLAGPARPALFVFRPLLNLALRRFLRNLRRYTDAR FAAAQQS" CDS 2871198..2872079 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2605" /product="POSSIBLE CONSERVED MEMBRANE GLYCINE RICH PROTEIN" /note="Mb2605, -, len: 293 aa. Equivalent to Rv2575, len: 293 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 293 aa overlap). Possible conserved membrane gly-rich protein, highly similar to hypothetical proteins e.g. Q9RR98|DR2596 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (313 aa), FASTA scores: opt: 734, E(): 2.8e-38, (42.95% identity in 291 aa overlap); Q9HV81|PA4717 from Pseudomonas aeruginosa (297 aa), FASTA scores: opt: 641, E(): 1.5e-32, (43.35% identity in 300 aa overlap); Q98IA4|MLL2493 from Rhizobium loti (Mesorhizobium loti) (306 aa), FASTA scores: opt: 628, E(): 1e-31, (38.45% identity in 307 aa overlap); etc. Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. Protein product from Mb2605 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2605 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65020" /db_xref="InterPro:IPR007343" /db_xref="UniProtKB/Swiss-Prot:P65020" /protein_id="SIU01223.1" /translation="MTFNEGVQIDTSTTSTSGSGGGRRLAIGGGLGGLLVVVVAMLLG VDPGGVLSQQPLDTRDHVAPGFDLSQCRTGADANRFVQCRVVATGNSVDAVWKPLLPG YTRPHMRLFSGQVGTGCGPASSEVGPFYCPVDKTAYFDTDFFQVLVTQFGSSGGPFAE EYVVAHEYGHHVQNLLGVLGRAQQGAQGAAGSGVRTELQADCYAGVWAYYASTVKQES TGVPYLEPLSDKDIQDALAAAAAVGDDRIQQQTTGRTNPETWTHGSAAQRQKWFTVGY QTGDPNICDTFSAADLG" CDS complement(2872085..2872549) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2606C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb2606c, -, len: 154 aa. Equivalent to Rv2576c, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 154 aa overlap). Possible conserved membrane protein, showing similarity with Q9ZFC2 HYPOTHETICAL 15.7 KDA PROTEIN from Mycobacterium sp. FM10 (146 aa), FASTA scores: opt: 235, E(): 4.1e-08, (31.35% identity in 150 aa overlap). Mb2606c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65022" /db_xref="InterPro:IPR016793" /db_xref="UniProtKB/Swiss-Prot:P65022" /protein_id="SIU01224.1" /translation="MPAGVGNASGSVLDMTSVRTVPSAVALVTFAGAALSGVIPAIAR ADPVGHQVTYTVTTTSDLMANIRYMSADPPSMAAFNADSSKYMITLHTPIAGGQPLVY TATLANPSQWAIVTASGGLRVNPEFHCEIVVDGQVVVSQDGGSGVQCSTRPW" CDS 2872777..2873028 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2607" /product="CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb2607, -, len: 83 aa. Equivalent to 5' end of Rv2577, len: 529 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 83 aa overlap). Conserved hypothetical protein, showing similarity with various proteins from eukaryotes, in particular phosphatases, e.g. Q9SE01|PAP PURPLE ACID PHOSPHATASE PRECURSOR (EC 3.1.3.2) from Glycine max (Soybean) (464 aa), FASTA scores: opt: 190, E(): 0.00026, (27.3% identity in 388 aa overlap); Q9SVP2|F18A5.90|AT4G13700 HYPOTHETICAL 53.4 KDA PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (474 aa), FASTA scores: opt: 280, E(): 6.6e-10, (27.2% identity in 331 aa overlap); Q9FK32 SIMILARITY TO UNKNOWN PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (529 aa), FASTA scores: opt: 249, E(): 6.2e-08, (25.3% identity in 435 aa overlap); Q12546|APHA ACID PHOSPHATASE PRECURSOR from Aspergillus ficuum (614 aa), FASTA scores: opt: 207, E(): 2.9e-05, (22.95% identity in 458 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2577 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base transition (g-a) splits Rv2577 into 2 parts, Mb2607 and Mb2608. Mb2607 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006311" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3P7" /protein_id="SIU01225.1" /translation="MGADLKQPQDADSPPKGVSRRRFLTTGAAAVVGTGVGAGGTALL SSHPRGPAVWYQRGRSGAPPVGGLHLQFGRNASTEMVVS" CDS 2873062..2874366 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2608" /product="CONSERVED HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb2608, -, len: 434 aa. Equivalent to 3' end of Rv2577, len: 529 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 434 aa overlap). Conserved hypothetical protein, showing similarity with various proteins from eukaryotes, in particular phosphatases, e.g. Q9SE01|PAP PURPLE ACID PHOSPHATASE PRECURSOR (EC 3.1.3.2) from Glycine max (Soybean) (464 aa), FASTA scores: opt: 190, E(): 0.00026, (27.3% identity in 388 aa overlap); Q9SVP2|F18A5.90|AT4G13700 HYPOTHETICAL 53.4 KDA PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (474 aa), FASTA scores: opt: 280, E(): 6.6e-10, (27.2% identity in 331 aa overlap); Q9FK32 SIMILARITY TO UNKNOWN PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (529 aa), FASTA scores: opt: 249, E(): 6.2e-08, (25.3% identity in 435 aa overlap); Q12546|APHA ACID PHOSPHATASE PRECURSOR from Aspergillus ficuum (614 aa), FASTA scores: opt: 207, E(): 2.9e-05, (22.95% identity in 458 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2577 exists as a single gene. In Mycobacterium bovis, a single base transition (g-a) splits Rv2577 into 2 parts, Mb2607 and Mb2608." /db_xref="GOA:A0A1R3Y2F9" /db_xref="InterPro:IPR004843" /db_xref="InterPro:IPR008963" /db_xref="InterPro:IPR029052" /db_xref="InterPro:IPR039331" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2F9" /protein_id="SIU01226.1" /translation="MLGTPTSGFGSVVVAETRSYRDAKSNTEVRVNHAHLTNLTPDTD YVYAAVHDGTTPELGTARTAPSGRKPLRFTSFGDQSTPALGRLADGRYVSDNIGSPFA GDITIAIERIAPLFNLINGDLCYANLAQDRIRTWSDWFDNNTRSARYRPWMPAAGNHE NEVGNGPIGYDAYQTYFAVPDSGSSPQLRGLWYSFTAGSVRVISLHNDDVCYQDGGNS YVRGYSGGEQRRWLQAELANARRDSEIDWVVVCMHQTAISTADDNNGADLGIRQEWLP LFDQYQVDLVVCGHEHHYERSHPLRGALGTDTRTPIPVDTRSDLIDSTRGTVHLVIGG GGTSKPTNALLFPQPRCQVITGVGDFDPAIRRKPSIFVLEDAPWSAFRDRDNPYGFVA FDVDPGQPGGTTSIKATYYAVTGPFGGLTVIDQFTLTKPRGG" CDS complement(2874368..2875390) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2609C" /product="Radical SAM domain protein" /note="Mb2609c, -, len: 340 aa. Equivalent to Rv2578c, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 340 aa overlap). Conserved hypothetical protein, highly similar to hypothetical proteins (conserved or not) e.g. Q9ZBJ3|SC9C7.17c from Streptomyces coelicolor (348 aa), FASTA scores: opt: 998, E(): 1.6e-55, (47.6% identity in 355 aa overlap); Q9I763|PA0069 from Pseudomonas aeruginosa (352 aa), FASTA scores: opt: 560, E(): 6e-28, (36.6% identity in 284 aa overlap); Q986C9|MLL7417 from Rhizobium loti (Mesorhizobium loti) (356 aa), FASTA scores: opt: 550, E(): 2.6e-27, (39.15% identity in 240 aa overlap); etc. Mb2609c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65024" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR040086" /db_xref="UniProtKB/Swiss-Prot:P65024" /protein_id="SIU01227.1" /translation="MRWARQAVAVNGMPVDDGALPGLQRIGLVRSVRAPQFDGITFHE VLCKSALNKVPNAAALPFRYTVNGYRGCSHACRYCFARPTHEYLDFNPGTDFDTQVVV KTNVAAVLRHELRRPSWRRETVALGTNTDPYQRAEGRYALMPGIIGALAASGTPLSIL TKGTLLRRDLPLIAEAAQQVPVSVAVSLAVGDPELHRDVESGTPTPQARLALITAIRA AGLDCHVMVAPVLPQLTDSGEHLDQLLGQIAAAGATGVTVFGLHLRGSTRGWFMCWLA RAHPELVSRYRELYRRGPYLPPSYREMLRERVAPLIAKYRLAGDHRPAPPETEAALVP VQATLF" CDS 2875498..2876400 /codon_start=1 /transl_table=11 /gene="dhaA" /locus_tag="BQ2027_MB2610" /product="POSSIBLE HALOALKANE DEHALOGENASE DHAA (1-CHLOROHEXANE HALIDOHYDROLASE)" /note="Mb2610, dhaA, len: 300 aa. Equivalent to Rv2579, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 300 aa overlap). Possible dhaA, haloalkane dehalogenase (EC 3.8.1.5), strictly equivalent to Q9XB14|ISO-RV2579 HALOALKANE DEHALOGENASE (1-chlorohexane halidohydrolase) (EC 3.8.1.5) from Mycobacterium bovis (300 aa), FASTA scores: opt: 2075, E(): 7.1e-125, (99.35% identity in 300 aa overlap); note that only two residues, 120 and 293 are different. Also highly similar to others e.g. Q9ZER0|DHAAF HALOALKANE DEHALOGENASE from Mycobacterium sp strain GP1 (307 aa), FASTA scores: opt: 842, E(): 2.3e-46, (44.95% identity in 298 aa overlap); Q53042|DHAA HALOALKANE DEHALOGENASE from Rhodococcus rhodochrous, and Pseudomonas pavonaceae (293 aa), FASTA scores: opt: 837, E(): 4.5e-46, (44.6% identity in 298 aa overlap); etc. Note that this protein may also be a 1,3,4,6-tetrachloro-1,4-cyclohexadiene hydrolase (EC 3.8.1.-), because also highly similar to P51698|LINB_PSEPA 1,3,4,6-TETRACHLORO-1,4-CYCLOHEXADIENE HYDROLASE from Pseudomonas paucimobilis (Sphingomonas paucimobilis) (see first citation below) (296 aa), FASTA scores: opt: 1494, E(): 6.8e-88, (69.5% identity in 295 aa overlap). Also shows some similarity with proteins from Mycobacterium tuberculosis e.g. Q50670|YM96_MYCTU|Rv2296|MT2353|MTCY339. 14c PUTATIVE HALOALKANE DEHALOGENASE (300 aa), FASTA scores: opt: 302, E(): 5.3e-12, (30.85% identity in 295 aa overlap); and Q50600|YJ33_MYCTU|Rv1833c|MT1881|MTCY1A11.10 HYPOTHETICAL 32.2 KDA PROTEIN (286 aa), FASTA scores: opt: 286, E(): 5.3e-11, (29.85% identity in 288 aa overlap). MAY BE BELONG TO ALPHA/BETA HYDROLASE FOLD FAMILY. Note that previously known as linB. Protein product from Mb2610 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2610 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q9XB14" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR023594" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:Q9XB14" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01228.1" /translation="MTAFGVEPYGQPKYLEIAGKRMAYIDEGKGDAIVFQHGNPTSSY LWRNIMPHLEGLGRLVACDLIGMGASDKLSPSGPDRYSYGEQRDFLFALWDTLDLGDH VVLVLHDWGSALGFDWANQHRDRVQGIAFMEAIVTPMTWADWPPAVRGVFQGFRSPQG EPMALEHNIFVERVLPGAILRQLSDEEMNHYRRPFVNGGEDRRPTLSWPRNLPIDGEP AEVVALVNEYRSWLEETDMPKLFINAEPGAIITGRIRDYVRSWPNQTEITVPGVHFVQ EDSPEEIGAAIAQFVRQLRSAAGV" CDS complement(2876680..2877951) /codon_start=1 /transl_table=11 /gene="hisS" /locus_tag="BQ2027_MB2611C" /product="probable histidyl-trna synthetase hiss (histidine--trna ligase) (hisrs) (histidine--translase)" /note="Mb2611c, hisS, len: 423 aa. Equivalent to Rv2580c, len: 423 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 423 aa overlap). Probable hisS, histidyl-tRNA synthetase (EC 6.1.1.21), equivalent to P46696|SYH_MYCLE|HISS|ML0494|MLCB1259.12|B1177_C3_248 HISTIDYL-TRNA SYNTHETASE from Mycobacterium leprae (427 aa), FASTA scores: opt: 2380, E(): 2.1e-131, (85.85% identity in 417 aa overlap). Also highly similar to many e.g. Q9KXP2|HISS from Streptomyces coelicolor (425 aa), FASTA scores: opt: 1542, E(): 1.4e-82, (56.0% identity in 418 aa overlap); O32422|SYH_STAAU|HISS from Staphylococcus aureus (420 aa), FASTA scores: opt: 1135, E(): 7.4e-59, (44.9% identity in 412 aa overlap); P04804|SYH_ECOLI|HISS|B2514 from Escherichia coli strain K12 (423 aa), FASTA scores: opt: 1099, E(): 9.4e-57, (43.9% identity in 417 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb2611c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2611c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67484" /db_xref="InterPro:IPR004154" /db_xref="InterPro:IPR004516" /db_xref="InterPro:IPR006195" /db_xref="InterPro:IPR015807" /db_xref="InterPro:IPR033656" /db_xref="InterPro:IPR036621" /db_xref="InterPro:IPR041715" /db_xref="UniProtKB/Swiss-Prot:P67484" /protein_id="SIU01229.1" /translation="MTEFSSFSAPKGVPDYVPPDSAQFVAVRDGLLAAARQAGYSHIE LPIFEDTALFARGVGESTDVVSKEMYTFADRGDRSVTLRPEGTAGVVRAVIEHGLDRG ALPVKLCYAGPFFRYERPQAGRYRQLQQVGVEAIGVDDPALDAEVIAIADAGFRSLGL DGFRLEITSLGDESCRPQYRELLQEFLFGLDLDEDTRRRAGINPLRVLDDKRPELRAM TASAPVLLDHLSDVAKQHFDTVLAHLDALGVPYVINPRMVRGLDYYTKTAFEFVHDGL GAQSGIGGGGRYDGLMHQLGGQDLSGIGFGLGVDRTVLALRAEGKTAGDSARCDVFGV PLGEAAKLRLAVLAGRLRAAGVRVDLAYGDRGLKGAMRAAARSGARVALVAGDRDIEA GTVAVKDLTTGEQVSVSMDSVVAEVISRLAG" CDS complement(2877948..2878622) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2612C" /product="POSSIBLE GLYOXALASE II (HYDROXYACYLGLUTATHIONE HYDROLASE) (GLX II)" /note="Mb2612c, -, len: 224 aa. Equivalent to Rv2581c, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Possible glyoxalase II (EC 3.1.2.6), equivalent to Q49649|YP81_MYCLE|ML0493|MLCB1259.11|B1177_C3_247 HYPOTHETICAL 23.9 KDA PROTEIN from Mycobacterium leprae (218 aa), FASTA scores: opt: 1264, E(): 7.8e-73, (82.0% identity in 222 aa overlap). Also highly similar to Q9KXP1|SC9C5.33c POSSIBLE HYDROLASE from Streptomyces coelicolor (235 aa), FASTA scores: opt: 654, E(): 2.9e-34, (46.8% identity in 220 aa overlap); and similar to Q9CI24|YFCI HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (210 aa), FASTA scores: opt: 360, E(): 9.9e-16, (35.0% identity in 217 aa overlap); AAK75726|SP1646 METALLO-BETA-LACTAMASE SUPERFAMILY PROTEIN from Streptococcus pneumoniae (209 aa), FASTA scores: opt: 320, E(): 3.3e-13, (35.85% identity in 198 aa overlap); AAK80229|CAC2272 PREDICTED ZN-DEPENDENT HYDROLASE OF METALLO-BETA-LACTAMASE SUPERFAMILY from Clostridium acetobutylicum (199 aa), FASTA scores: opt: 282, E(): 8e-11, (32.7% identity in 217 aa overlap); etc. Equivalent to AAK46971 from Mycobacterium tuberculosis strain CDC1551 (246 aa) but shorter 22 aa. BELONGS TO THE GLYOXALASE II FAMILY. COFACTOR: BINDS TWO ZINC IONS. Protein product from Mb2612c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2612c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64262" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/Swiss-Prot:P64262" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01230.1" /translation="MLITGFPAGLLACNCYVLAERPGTDAVIVDPGQGAMGTLRRILD KNRLTPAAVLLTHGHIDHIWSAQKVSDTFGCPTYVHPADRFMLTDPIYGLGPRIAQLV AGAFFREPKQVVELDRDGDKIDLGGISVNIDHTPGHTRGSVVFRVLQATNNDKDIVFT GDTLFERAIGRTDLAGGSGRDLLRSIVDKLLVLDDSTVVLPGHGNSTTIGAERRFNPF LEGLSR" CDS 2878673..2879599 /codon_start=1 /transl_table=11 /gene="ppiB" /locus_tag="BQ2027_MB2613" /product="PROBABLE PEPTIDYL-PROLYL CIS-TRANS ISOMERASE B PPIB (CYCLOPHILIN) (PPIASE) (ROTAMASE) (PEPTIDYLPROLYL ISOMERASE)" /note="Mb2613, ppiB, len: 308 aa. Equivalent to Rv2582, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 308 aa overlap). Probable ppiB (alternate gene name: ppi), cyclophilin (peptidyl-prolyl cis-trans isomerase) (EC 5.2.1.8), equivalent to P46697|PPIB_MYCLE|PPI|ML0492|MLCB1259.10c|B1177_F3_97 PROBABLE PEPTIDYL-PROLYL CIS-TRANS ISOMERASE B from Mycobacterium leprae (295 aa), FASTA scores: opt: 1423, E(): 1.3e-66, (72.2% identity in 295 aa overlap). Aldo similar to others e.g. Q9KJG8|PPIB PEPTIDYL-PROLYL CIS-TRANS ISOMERASE from Streptomyces lividans (277 aa), FASTA scores: opt: 485, E(): 3.2e-18, (38.35% identity in 292 aa overlap); Q9KXP0|SC9C5.34 PEPTIDYL-PROLYL CIS-TRANS ISOMERASE from Streptomyces coelicolor (277 aa), FASTA scores: opt: 483, E(): 4.1e-18, (38.35% identity in 292 aa overlap); Q9RT72|DR1893 PEPTIDYL-PROLYL CIS-TRANS ISOMERASE from Deinococcus radiodurans (350 aa), FASTA scores: opt: 296, E(): 2.2e-08, (29.0% identity in 276 aa overlap); etc. BELONGS TO THE CYCLOPHILIN-TYPE PPIASE FAMILY. Protein product from Mb2613 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2613 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1L9" /db_xref="InterPro:IPR002130" /db_xref="InterPro:IPR029000" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1L9" /protein_id="SIU01231.1" /translation="MGHLTPVAAPRLACAFVPTNAQRRATAKRKLERQLERRAKQAKR RRILTIVGGSLAAVAVIVAVVFTVVVNKDDHQSTTSATPTDSASTSPPQAATAPPLPP FKPSANLGANCQYPPSPDKAVKPVKLPRTGKVPTDPAQVSVSMVTNQGNIGLMLANNE SPCTVNSFVSLAQQGFFKGTTCHRLTTSPMLAVLQCGDPKGDGTGGPGYQFANEYPTD QYSANDPKLNEPVIYPRGTLAMANAGPNTNSSQFFMVYRDSKLPPQYTVFGTIQADGL TTLDKIAKAGVAGGGEDGKPATEVTITSVLLD" CDS complement(2879685..2882057) /codon_start=1 /transl_table=11 /gene="relA" /locus_tag="BQ2027_MB2614C" /product="PROBABLE GTP PYROPHOSPHOKINASE RELA (ATP:GTP 3'-PYROPHOSPHOTRANSFERASE) (PPGPP SYNTHETASE I) ((P)PPGPP SYNTHETASE) (GTP DIPHOSPHOKINASE)" /note="Mb2614c, relA, len: 790 aa. Equivalent to Rv2583c, len: 790 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 790 aa overlap). Probable relA, GTP pyrophosphokinase (EC 2.7.6.5), equivalent to Q49640|RELA_MYCLE|ML0491|MLCB1259.09|B1177_C1_168 PROBABLE GTP PYROPHOSPHOKINASE from Mycobacterium leprae (787 aa), FASTA scores: opt: 4834, E(): 0, (93.4% identity in 790 aa overlap). Also highly similar to others e.g. O87331|RELA_CORGL|RELA|REL from Corynebacterium glutamicum (Brevibacterium flavum) (760 aa), FASTA scores: opt: 3375, E(): 1.6e-196, (67.0% identity in 758 aa overlap); O85709|RELA_STRAT from Streptomyces antibioticus (841 aa), FASTA scores: opt: 3209, E(): 1.9e-186, (63.85% identity in 786 aa overlap); Q9KDH1|RELA|BH1242 from Bacillus halodurans (728 aa), FASTA scores: opt: 2195,E(): 3.8e-125, (45.65% identity in 714 aa overlap); etc. BELONGS TO THE RELA / SPOT FAMILY. Protein product from Mb2614c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2614c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66015" /db_xref="InterPro:IPR002912" /db_xref="InterPro:IPR003607" /db_xref="InterPro:IPR004095" /db_xref="InterPro:IPR004811" /db_xref="InterPro:IPR007685" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR012676" /db_xref="InterPro:IPR033655" /db_xref="UniProtKB/Swiss-Prot:P66015" /protein_id="SIU01232.1" /translation="MAEDQLTAQAVAPPTEASAALEPALETPESPVETLKTSISASRR VRARLARRMTAQRSTTNPVLEPLVAVHREIYPKADLSILQRAYEVADQRHASQLRQSG DPYITHPLAVANILAELGMDTTTLVAALLHDTVEDTGYTLEALTEEFGEEVGHLVDGV TKLDRVVLGSAAEGETIRKMITAMARDPRVLVIKVADRLHNMRTMRFLPPEKQARKAR ETLEVIAPLAHRLGMASVKWELEDLSFAILHPKKYEEIVRLVAGRAPSRDTYLAKVRA EIVNTLTASKIKATVEGRPKHYWSIYQKMIVKGRDFDDIHDLVGVRILCDEIRDCYAA VGVVHSLWQPMAGRFKDYIAQPRYGVYQSLHTTVVGPEGKPLEVQIRTRDMHRTAEYG IAAHWRYKEAKGRNGVLHPHAAAEIDDMAWMRQLLDWQREAADPGEFLESLRYDLAVQ EIFVFTPKGDVITLPTGSTPVDFAYAVHTEVGHRCIGARVNGRLVALERKLENGEVVE VFTSKAPNAGPSRDWQQFVVSPRAKTKIRQWFAKERREEALETGKDAMAREVRRGGLP LQRLVNGESMAAVARELHYADVSALYTAIGEGHVSAKHVVQRLLAELGGIDQAEEELA ERSTPATMPRRPRSTDDVGVSVPGAPGVLTKLAKCCTPVPGDVIMGFVTRGGGVSVHR TDCTNAASLQQQAERIIEVLWAPSPSSVFLVAIQVEALDRHRLLSDVTRALADEKVNI LSASVTTSGDRVAISRFTFEMGDPKHLGHLLNAVRNVEGVYDVYRVTSAA" CDS complement(2882088..2882759) /codon_start=1 /transl_table=11 /gene="apt" /locus_tag="BQ2027_MB2615C" /product="ADENINE PHOSPHORIBOSYLTRANSFERASE APT (APRT) (AMP DIPHOSPHORYLASE) (AMP PYROPHOSPHORYLASE) (TRANSPHOSPHORIBOSIDASE)" /note="Mb2615c, apt, len: 223 aa. Equivalent to Rv2584c, len: 223 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 223 aa overlap). Probable apt, adenine phosphoribosyltransferase (EC 2.4.2.7), similar, but longer in N-terminus, to others e.g. O87330|APT_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (185 aa), FASTA scores: opt: 524, E(): 1.3e-24, (50.95% identity in 159 aa overlap); P52561|APT_STRCO from Streptomyces coelicolor (182 aa), FASTA scores: opt: 503, E(): 2.3e-23, (51.85% identity in 164 aa overlap); P47956|APT_MUSPA|APRT from Mus pahari (Shrew mouse) (180 aa), FASTA scores: opt: 419, E(): 2.5e-18, (44.7% identity in 170 aa overlap); P07672|P09993|P77121|APT_ECOLI|B0469 from Escherichia coli strain K12 (183 aa), FASTA scores: opt: 393, E(): 1.9e-18, (42.6% identity in 162 aa overlap); etc. Contains PS00103 Purine/ pyrimidine phosphoribosyl transferases signature, and PS00144 Asparaginase / glutaminase active site signature 1. BELONGS TO THE PURINE/PYRIMIDINE PHOSPHORIBOSYLTRANSFERASE FAMILY. Nearest initiation codon indicated by homology is TTG at 17426 or GTG at 17465. Protein product from Mb2615c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2615c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59959" /db_xref="InterPro:IPR000836" /db_xref="InterPro:IPR005764" /db_xref="InterPro:IPR029057" /db_xref="UniProtKB/Swiss-Prot:P59959" /protein_id="SIU01233.1" /translation="MCHGGTWAGDYVLNVIATGLSLKARGKRRRQRWVDDGRVLALGE SRRSSAISVADVVASLTRDVADFPVPGVEFKDLTPLFADRRGLAAVTEALADRASGAD LVAGVDARGFLVAAAVATRLEVGVLAVRKGGKLPRPVLSEEYYREYGAATLEILAEGI EVAGRRVVIIDDVLATGGTIGATRRLLERGGANVAGAAVVVELAGLSGRAALAPLPVH SLSRL" CDS complement(2882863..2884536) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2616C" /product="POSSIBLE CONSERVED LIPOPROTEIN" /note="Mb2616c, -, len: 557 aa. Equivalent to Rv2585c, len: 557 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 557 aa overlap). Possible conserved lipoprotein precursor, possibly attached to the membrane by a lipid anchor and substrate-binding protein involved in transport, equivalent to Q49646|YP85_MYCLE|ML0489|MLCB1259.07|B1177_C2_197 HYPOTHETICAL LIPOPROTEIN PRECURSOR from Mycobacterium leprae (555 aa), FASTA scores: opt: 2812, E(): 9.8e-158, (78.95% identity in 546 aa overlap); and C-terminus highly similar to C-terminus of Q49638|DCIAE|B1177_C1_166 DCIAE PROTEIN from Mycobacterium leprae (344 aa), FASTA scores: opt: 1177, E(): 7.4e-62, (78.6% identity in 229 aa overlap). Also similar in part to various proteins, principally substrate-binding proteins, e.g. O87329|DCIAE DIPEPTIDE-BINDING PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (502 aa), FASTA scores: opt: 614, E(): 1.2e-28, (30.7% identity in 427 aa overlap); Q9AKR0|OPPA|CAC49261 PUTATIVE OLIGOPEPTIDE UPTAKE ABC TRANSPORTER PERIPLASMIC SOLUTE-BINDING PROTEIN PRECURSOR from Rhizobium meliloti (Sinorhizobium meliloti) (532 aa), FASTA scores: opt: 209, E(): 7.7e-05, (22.85% identity in 460 aa overlap); P76128|YDDS_ECOLI|B1487|P77769|P76874 PUTATIVE ABC TRANSPORTER PERIPLASMIC BINDING PROTEIN from Escherichia coli strain K12 (516 aa), FASTA scores: opt: 182, E(): 0.0029, (20.0% identity in 315 aa overlap); etc. Protein product from Mb2616c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2616c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P59984" /db_xref="InterPro:IPR000914" /db_xref="InterPro:IPR039424" /db_xref="UniProtKB/Swiss-Prot:P59984" /protein_id="SIU01234.1" /translation="MAPRRRRHTRIAGLRVVGTATLVAATTLTACSGSAAAQIDYVVD GALVTYNTNTVIGAASAGAQAFARTLTGFGYHGPDGQVVADRDFGTVSVVEGSPLILD YQISDDAVYSDGRPVTCDDLVLAWAAQSGRFPGFDAATQAGYVDIANIECTAGQKKAR VSFIPDRSVVDHSQLFTATSLMPSHVIADQLHIDVTAALLSNNVSAVEQIARLWNSTW DLKPGRSHDEVRSRFPSSGPYKIESVLDDGAVVLVANDRWWGTKAITKRITVWPQGAD IQDRVNNRSVDVVDVAAGSSGSLVTPDSYQRTDYPSAGIEQLIFAPQGSLAQSRTRRA LALCVPRDAIARDAGVPIANSRLSPATDDALTDADGAAEARQFGRVDPAAARDALGGT PLTVRIGYGRPNARLAATIGTIADACAPAGITVSDVTVDTPGPQALRDGKIDVLLAST GGATGSGSSGSSAMDAYDLHSGNGNNLSGYANAQIDGIISALAVSADPAERARLLAEA APVLWDEMPTLPLYRQQRTLLMSTKMYAVSRNPTRWGAGWNMDRWALAR" CDS complement(2884542..2885870) /codon_start=1 /transl_table=11 /gene="secF" /locus_tag="BQ2027_MB2617C" /product="PROBABLE PROTEIN-EXPORT MEMBRANE PROTEIN SECF" /note="Mb2617c, secF, len: 442 aa. Equivalent to Rv2586c, len: 442 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 442 aa overlap). Probable secF, protein-export membrane protein (integral membrane protein), equivalent to P38386|SECF_MYCLE|SECF|ML0488|MLCB1259.06|B1177_C3_239 PROTEIN-EXPORT MEMBRANE PROTEIN from Mycobacterium leprae (471 aa), FASTA scores: opt: 1910, E(): 2.9e-104, (72.15% identity in 456 aa overlap). Also similar to others e.g. Q9AE06|SECF from Corynebacterium glutamicum (Brevibacterium flavum) (403 aa), FASTA scores: opt: 1198, E(): 9.8e-63, (47.1% identity in 399 aa overlap); Q53956|SECF_STRCO|SCL2.05c from Streptomyces coelicolor (373 aa), FASTA scores: opt: 670, E(): 6.4e-32, (39.25% identity in 400 aa overlap); Q55611|SECF_SYNY3|SLR0775 from Synechocystis sp. strain PCC 6803 (315 aa), FASTA scores: opt: 416, E(): 3.8e-17, (33.8% identity in 296 aa overlap); etc. BELONGS TO THE SECD/SECF FAMILY, SECF FAMILY. PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA|Rv3240c, SECD|Rv2587c, SECE|Rv0638, SECF, SECG|Rv1440 AND SECY|Rv0732. Protein product from Mb2617c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2617c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3Q8" /db_xref="InterPro:IPR005665" /db_xref="InterPro:IPR022645" /db_xref="InterPro:IPR022646" /db_xref="InterPro:IPR022813" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Q8" /protein_id="SIU01235.1" /translation="MASKAKTGRDDEATSAVELTEATESAVARTDGDSTTDTASKLGH HSFLSRLYTGTGAFEVVGRRRLWFGVSGAIVAVAIASIVFRGFTFGIDFKGGTTVSFP RGSTQVAQVEDVYYRALGSEPQSVVIVGAGASATVQIRSETLTSDQTAKLRDALFEAF GPKGTDGQPSKQAISDSAVSETWGGQITKKAVIALVVFLVLVALYITVRYERYMTISA ITAMLFDLTVTAGVYSLVGFEVTPATVIGLLTILGFSLYDTVIVFDKVEENTHGFQHT TRRTFAEQANLAINQTFMRSINTSLIGVLPVLTLMVVAVWLLGVGTLKDLALVQLIGI IIGTYSSIFFATPLLVTLRERTELVRNHTRRVLKRRNSGSPAGSEDASTDGGEQPAAA DEQSLVGITQASSQSAPRAAQGSSKPAPGARPVRPVGTRRPPGKRNAGRR" CDS complement(2885874..2887595) /codon_start=1 /transl_table=11 /gene="secD" /locus_tag="BQ2027_MB2618C" /product="probable protein-export membrane protein secd" /note="Mb2618c, secD, len: 573 aa. Equivalent to Rv2587c, len: 573 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 573 aa overlap).Probable secD, protein-export membrane protein (integral membrane protein), equivalent to P38387|SECD_MYCLE|ML0487|MLCB1259.05|B1177_C1_164 PROTEIN-EXPORT MEMBRANE PROTEIN from Mycobacterium leprae (571 aa), FASTA scores: opt: 2948, E(): 2.6e-97, (80.6% identity in 583 aa overlap). Also similar to others e.g. Q9AE07|SECD from Corynebacterium glutamicum (Brevibacterium flavum) (637 aa), FASTA scores: opt: 1023, E(): 1.9e-29, (44.95% identity in 596 aa overlap); Q53955|SECD_STRCO from Streptomyces coelicolor (570 aa), FASTA scores: opt: 864, E(): 7.2e-24, (38.0% identity in 584 aa overlap); O33517|SECD_RHOCA from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (554 aa), FASTA scores: opt: 551, E(): 7.6e-13, (32.25% identity in 304 aa overlap); etc. Equivalent to AAK46977 from Mycobacterium tuberculosis strain CDC1551 (554 aa) but longer 19 aa. BELONGS TO THE SECD/SECF FAMILY, SECD FAMILY. PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA|Rv3240c, SECD, SECE|Rv0638, SECF|Rv2586c, SECG|Rv1440 AND SECY|Rv0732. Protein product from Mb2618c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2618c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2G6" /db_xref="InterPro:IPR005791" /db_xref="InterPro:IPR022645" /db_xref="InterPro:IPR022646" /db_xref="InterPro:IPR022813" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2G6" /protein_id="SIU01236.1" /translation="MASSSAPVHPARYLSVFLVMLIGIYLLVFFTGDKHTAPKLGIDL QGGTRVTLTARTPDGSAPSREALAQAQQIISARVNGLGVSGSEVVVDGDNLVITVPGN DGSEARNLGQTARLYIRPVLNSMPAQPAAEEPQPAPSAEPQPPGQPAAPPPAQSGAPA SPQPGAQPRPYPQDPAPSPNPTSPASPPPAPPAEAPATDPRKDLAERIAQEKKLRQST NQYMQMVALQFQATRCESDDILAGNDDPKLPLVTCSTDHKTAYLLAPSIISGDQIQNA TSGMDQRGIGYVVDLQFKGPAANIWADYTAAHIGTQTAFTLDSQVVSAPQIQEAIPGG RTQISGGDPPFTAATARQLANVLKYGSLPLSFEPSEAQTVSATLGLSSLRAGMIAGAI GLLLVLVYSLLYYRVLGLLTALSLVASGSMVFAILVLLGRYINYTLDLAGIAGLIIGI GTTADSFVVFFERIKDEIREGRSFRSAVPRGWARARKTIVSGSAVTFLAAAVLYFLAI GQVKGFAFTLGLTTILDLVVVFLVTWPLVYLASKSSLLAKPAYNGLGAVQQVARERRA MARTGRG" CDS complement(2887705..2888052) /codon_start=1 /transl_table=11 /gene="yajC" /locus_tag="BQ2027_MB2619C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN SECRETION FACTOR YAJC" /note="Mb2619c, -, len: 115 aa. Equivalent to Rv2588c, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 115 aa overlap). Probable yajC, secretion factor, a conserved membrane protein (see first citation below), equivalent to Q49647|YP88_MYCLE|ML0486|MLCB1259.04|B1177_C3_235 HYPOTHETICAL 12.8 KDA PROTEIN from Mycobacterium leprae (114 aa), FASTA scores: opt: 499, E(): 2.7e-26, (77.0% identity in 100 aa overlap). Also similar to other proteins e.g. Q9AE08 HYPOTHETICAL 13.5 KDA PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (121 aa), FASTA scores: opt: 222, E(): 5e-08, (39.8% identity in 103 aa overlap); Q9L292|SCL2.07c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (169 aa), FASTA scores: opt: 203, E(): 1.2e-06, (32.05% identity in 106 aa overlap); Q9CDT0|YWAB UNKNOWN PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (110 aa), FASTA scores: opt: 150, E(): 0.0026, (30.85% identity in 94 aa overlap); etc. Protein product from Mb2619c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2619c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65026" /db_xref="InterPro:IPR003849" /db_xref="UniProtKB/Swiss-Prot:P65026" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01237.1" /translation="MESFVLFLPFLLIMGGFMYFASRRQRRAMQATIDLHDSLQPGER VHTTSGLEATIVAIADDTIDLEIAPGVVTTWMKLAIRDRILPDDDIDEELNEDLDKDV DDVAGERRVTNDS" CDS 2888219..2889568 /codon_start=1 /transl_table=11 /gene="gabT" /locus_tag="BQ2027_MB2620" /product="4-AMINOBUTYRATE AMINOTRANSFERASE GABT (GAMMA-AMINO-N-BUTYRATE TRANSAMINASE) (GABA TRANSAMINASE) (GLUTAMATE:SUCCINIC SEMIALDEHYDE TRANSAMINASE) (GABA AMINOTRANSFERASE) (GABA-AT)" /note="Mb2620, gabT, len: 449 aa. Equivalent to Rv2589, len: 449 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 449 aa overlap). Probable gabT, 4-aminobutyrate aminotransferase (EC 2.6.1.9), equivalent to P40829|GABT_MYCLE|ML0485|MLCB1259.03c|B1177_F2_67 4-AMINOBUTYRATE AMINOTRANSFERASE (446 aa), FASTA scores: opt: 2468, E(): 4.5e-141, (83.75% identity in 449 aa overlap). Also highly similar to others e.g. O86823|GABT from Streptomyces coelicolor (444 aa), FASTA scores: opt: 1832, E(): 8e-103, (63.9% identity in 443 aa overlap); AAK79395|CAC1427 from Clostridium acetobutylicum (445 aa), FASTA scores: opt: 1283, E(): 8.4e-70, (45.75% identity in 433 aa overlap); Q9KE66|BH0991 from Bacillus halodurans (443 aa), FASTA scores: opt: 1224, E(): 2.9e-66, (44.55% identity in 431 aa overlap); etc. Contains PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site. BELONGS TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. COFACTOR: PYRIDOXAL PHOSPHATE. Protein product from Mb2620 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2620 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63505" /db_xref="InterPro:IPR004632" /db_xref="InterPro:IPR005814" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/Swiss-Prot:P63505" /protein_id="SIU01238.1" /translation="MASLQQSRRLVTEIPGPASQALTHRRAAAVSSGVGVTLPVFVAR AGGGIVEDVDGNRLIDLGSGIAVTTIGNSSPRVVDAVRTQVAEFTHTCFMVTPYEGYV AVAEQLNRITPGSGPKRSVLFNSGAEAVENAVKIARSYTGKPAVVAFDHAYHGRTNLT MALTAKSMPYKSGFGPFAPEIYRAPLSYPYRDGLLDKQLATNGELAAARAIGVIDKQV GANNLAALVIEPIQGEGGFIVPAEGFLPALLDWCRKNHVVFIADEVQTGFARTGAMFA CEHEGPDGLEPDLICTAKGIADGLPLSAVTGRAEIMNAPHVGGLGGTFGGNPVACAAA LATIATIESDGLIERARQIERLVTDRLTTLQAVDDRIGDVRGRGAMIAVELVKSGTTE PDAGLTERLATAAHAAGVIILTCGMFGNIIRLLPPLTIGDELLSEGLDIVCAILADL" CDS 2889730..2893236 /codon_start=1 /transl_table=11 /gene="fadD9" /locus_tag="BQ2027_MB2621" /product="PROBABLE FATTY-ACID-CoA LIGASE FADD9 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb2621, fadD9, len: 1168 aa. Equivalent to Rv2590, len: 1168 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1168 aa overlap). Probable fadD9, fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to O69484|FADD9 (alias Q9CCT4|FADD9|ML0484 but longer 14 aa) PUTATIVE ACYL-COA SYNTHETASE from Mycobacterium leprae (1174 aa), FASTA scores: opt: 5247, E(): 0, (68.0% identity in 1178 aa overlap); Q49651|LCLA|B1177_F1_23 PUTATIVE LONG-CHAIN-FATTY-ACID--COA LIGASE from Mycobacterium leprae (827 aa), FASTA scores: opt: 3170, E(): 7.1e-181, (63.9% identity in 770 aa overlap). N-terminal (700 residues) similar to other long chain fatty acid ligases. And C-terminus highly similar to C-terminus of Q9XCF2|PSTB PSTB PROTEIN from Mycobacterium avium (2552 aa), FASTA scores: opt: 2083, E(): 8.4e-116, (40.8% identity in 1150 aa overlap) (and weak similarity on N-terminus); Q49653|POL1|B1177_F2_70 POL1 PROTEIN from Mycobacterium leprae (400 aa), FASTA scores: opt: 2066, E(): 2e-115, (76.25% identity in 404 aa overlap). C-terminal part highly similar to polyketide synthases and peptides synthases (weak similarity on N-terminus) e.g. Q10896|Rv0101|MTCY251.20|NRP PROBABLE PEPTIDE SYNTHETASE from Mycobacterium tuberculosis (2512 aa), FASTA scores: opt: 1988, E(): 3.7e-110, (40.2% identity in 1181 aa overlap); etc. Contains PS00455 putative AMP-binding domain signature, and PS00061 Short-chain alcohol dehydrogenase family signature. SEEMS TO BELONG TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY, AND TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb2621 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2621 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1N0" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR010080" /db_xref="InterPro:IPR013120" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1N0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01239.1" /translation="MSINDQRLTRRVEDLYASDAQFAAASPNEAITQAIDQPGVALPQ LIRMVMEGYADRPALGQRALRFVTDPDSGRTMVELLPRFETITYRELWARAGTLATAL SAEPAIRPGDRVCVLGFNSVDYTTIDIALIRLGAVSVPLQTSAPVTGLRPIVTETEPT MIATSIDNLGDAVEVLAGHAPARLVVFDYHGKVDTHREAVEAARARLAGSVTIDTLAE LIERGRALPATPIADSADDALALLIYTSGSTGAPKGAMYRESQVMSFWRKSSGWFEPS GYPSITLNFMPMSHVGGRQVLYGTLSNGGTAYYVAKSDLSTLFEDLALVRPTELCFVP RIWDMVFAEFHSEVDRRLVDGADRAALEAQVKAELRENVLGGRFVMALTGSAPISAEM TAWVESLLADVHLVEGYGSTEAGMVLNDGMVRRPAVIDYKLVDVPELGYFGTDQPYPR GELLVKTQTMFPGYYQRPDVTAEVFDPDGFYRTGDIMAKVGPDQFVYLDRRNNVLKLS QGEFIAVSKLEAVFGDSPLVRQIFIYGNSARAYPLAVVVPSGDALSRHGIENLKPVIS ESLQEVARAAGLQSYEIPRDFIIETTPFTLENGLLTGIRKLARPQLKKFYGERLERLY TELADSQSNELRELRQSGPDAPVLPTLCRAAAALLGSTAADVRPDAHFADLGGDSLSA LSLANLLHEIFGVDVPVGVIVSPASDLRALADHIEAARTGVRRPSFASIHGRSATEVH ASDLTLDKFIDAATLAAAPNLPAPSAQVRTVLLTGATGFLGRYLALEWLDRMDLVNGK LICLVRARSDEEAQARLDATFDSGDPYLVRHYRELGAGRLEVLAGDKGEADLGLDRVT WQRLADTVDLIVDPAALVNHVLPYSQLFGPNAAGTAELLRLALTGKRKPYIYTSTIAV GEQIPPEAFTEDADIRAISPTRRIDDSYANGYANSKWAGEVLLREAHEQCGLPVTVFR CDMILADTSYTGQLNLPDMFTRLMLSLAATGIAPGSFYELDAHGNRQRAHYDGLPVEF VAEAICTLGTHSPDRFVTYHVMNPYDDGIGLDEFVDWLNSPTSGSGCTIQRIADYGEW LQRFETSLRALPDRQRHTSLLPLLHNYREPAKPICGSIAPTDQFRAAVQEAKIGPDKD IPHLTAAIIAKYISNLRLLGLL" CDS 2893410..2895050 /codon_start=1 /transl_table=11 /gene="PE_PGRS44" /locus_tag="BQ2027_MB2622" /product="pe-pgrs family protein pe_pgrs44" /note="Mb2622, PE_PGRS44, len: 546 aa. Equivalent to Rv2591, len: 543 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 546 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to others e.g. O53845|Rv0834c|MTV043.26c from Mycobacterium tuberculosis (882 aa), FASTA scores: opt: 1813, E(): 5.8e-66, (55.3% identity in 568 aa overlap). Equivalent to AAK46982 from Mycobacterium tuberculosis strain CDC1551 (505 aa) but longer 38 aa. Contains PS00583 pfkB family of carbohydrate kinases signature 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 9 bp in-frame insertion (*-ggcggcacc) leads to a slightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (546 aa versus 543 aa). Mb2622 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1N3" /protein_id="SIU01240.1" /translation="MSFVTAAPEMLATAAQNVANIGTSLSAANATAAASTTSVLAAGA DEVSQAIARLFSDYATHYQSLNAQAAAFHHSFVQTLNAAGGAYSSAEAANASAQALEQ NLLAVINAPAQALFGRPLIGNGANGTAASPNGGDGGILYGNGGNGFSQTTAGVAGGAG GSAGLIGNGGNGGAGGAGAAGGAGGAGGWLLGNGGAGGPGGPTDVPAGTGGAGGAGGD APLIGWGGNGGPGGFAAFGNGGAGGNGGASGSLFGVGGAGGVGGSSEDVGGTGGAGGA GRGLFLGLGGDGGAGGTSNNNGGDGGAGGTAGGRLFSLGGDGGNGGAGTAIGSNAGDG GAGGDSSALIGYAQGGSGGLGGFGESTGGDGGLGGAGAVLIGTGVGGFGGLGGGSNGT GGAGGAGGTGATLIGLGAGGGGSIGGFAVNVGNGVGGLGGQGGQGAALIGLGAGGAGG AGGATVVGLGGNDGDGGDGGGLFSIGVGGDGGNAGNGAMPANGGNGGNAGVIANGSFA PSFVGFGGNGGNGVNGGTGGTGGSGGILFGANGANGPS" CDS complement(2895067..2896101) /codon_start=1 /transl_table=11 /gene="ruvB" /locus_tag="BQ2027_MB2623C" /product="PROBABLE HOLLIDAY JUNCTION DNA HELICASE RUVB" /note="Mb2623c, ruvB, len: 344 aa. Equivalent to Rv2592c, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 344 aa overlap). Probable ruvB, Holliday junction binding protein (EC 3.6.1.-) (see first citation below), equivalent to P40833|RUVB_MYCLE|ML0483|B1177_C3_227 HOLLIDAY JUNCTION DNA HELICASE from Mycobacterium leprae (349 aa), FASTA scores: opt: 2059, E(): 2.1e-106, (94.45% identity in 342 aa overlap). Also highly similar to others e.g. Q9AE09|RUVB from Corynebacterium glutamicum (Brevibacterium flavum) (363 aa), FASTA scores: opt: 1651, E(): 6.5e-84, (75.6% identity in 332 aa overlap); Q9L291|RUVB from Streptomyces coelicolor (357 aa), FASTA scores: opt: 1530, E(): 3e-77, (68.2% identity in 343 aa overlap); P08577|RUVB_ECOLI|B1860|Z2912|ECS2570 from Escherichia coli strains K12 and O157:H7 (336 aa), FASTA scores: opt: 1284, E(): 1e-63, (55.45% identity in 330 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE RUVB FAMILY. Protein product from Mb2623c detected using SWATH mass spectrometry. Mb2623c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66754" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004605" /db_xref="InterPro:IPR008823" /db_xref="InterPro:IPR008824" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="InterPro:IPR041445" /db_xref="UniProtKB/Swiss-Prot:P66754" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01241.1" /translation="MTERSDRDVSPALTVGEGDIDVSLRPRSLREFIGQPRVREQLQL VIEGAKNRGGTPDHILLSGPPGLGKTSLAMIIAAELGSSLRVTSGPALERAGDLAAML SNLVEHDVLFIDEIHRIARPAEEMLYLAMEDFRVDVVVGKGPGATSIPLEVAPFTLVG ATTRSGALTGPLRDRFGFTAHMDFYEPAELERVLARSAGILGIELGADAGAEIARRSR GTPRIANRLLRRVRDFAEVRADGVITRDVAKAALEVYDVDELGLDRLDRAVLSALTRS FGGGPVGVSTLAVAVGEEAATVEEVCEPFLVRAGMVARTPRGRVATALAWTHLGMTPP VGASQPGLFE" CDS complement(2896098..2896688) /codon_start=1 /transl_table=11 /gene="ruvA" /locus_tag="BQ2027_MB2624C" /product="PROBABLE HOLLIDAY JUNCTION DNA HELICASE RUVA" /note="Mb2624c, ruvA, len: 196 aa. Equivalent to Rv2593c, len: 196 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 196 aa overlap). Probable ruvA, Holliday junction binding protein (see citations below), equivalent to P40832|RUVA_MYCLE|ML0482|B1177_C2_188 HOLLIDAY JUNCTION DNA HELICASE from Mycobacterium leprae (203 aa), FASTA scores: opt: 923, E(): 9.9e-50, (76.85% identity in 203 aa overlap). Also highly similar to others e.g. Q9L290|RUVA from Streptomyces coelicolor (201 aa) (201 aa), FASTA scores: opt: 549, E(): 8.2e-27, (47.55% identity in 204 aa overlap); Q9AE10|RUVA from Corynebacterium glutamicum (Brevibacterium flavum) (206 aa), FASTA scores: opt: 440, E(): 4e-20, (47.1% identity in 206 aa overlap); P08576|RUVA_ECOLI|B1861|Z2913|ECS2571 from Escherichia coli strains K12 and O157:H7 (203 aa), FASTA scores: opt: 312, E(): 2.8e-12, (34.85% identity in 201 aa overlap); etc. BELONGS TO THE RUVA FAMILY. Protein product from Mb2624c detected using SWATH mass spectrometry. Mb2624c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66745" /db_xref="InterPro:IPR000085" /db_xref="InterPro:IPR003583" /db_xref="InterPro:IPR010994" /db_xref="InterPro:IPR011114" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR013849" /db_xref="InterPro:IPR036267" /db_xref="UniProtKB/Swiss-Prot:P66745" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01242.1" /translation="MIASVRGEVLEVALDHVVIEAAGVGYRVNATPATLATLRQGTEA RLITAMIVREDSMTLYGFPDGETRDLFLTLLSVSGVGPRLAMAALAVHDAPALRQVLA DGNVAALTRVPGIGKRGAERMVLELRDKVGVAATGGALSTNGHAVRSPVVEALVGLGF AAKQAEEATDTVLAANHDATTSSALRSALSLLGKAR" CDS complement(2896685..2897251) /codon_start=1 /transl_table=11 /gene="ruvC" /locus_tag="BQ2027_MB2625C" /product="PROBABLE CROSSOVER JUNCTION ENDODEOXYRIBONUCLEASE RUVC (HOLLIDAY JUNCTION NUCLEASE) (HOLLIDAY JUNCTION RESOLVASE)" /note="Mb2625c, ruvC, len: 188 aa. Equivalent to Rv2594c, len: 188 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 188 aa overlap). Probable ruvC, Holliday junction resolvase (EC 3.1.22.4) (see citations below), equivalent to P40834|RUVC_MYCLE|ML0481|B1177_C3_226 CROSSOVER JUNCTION ENDODEOXYRIBONUCLEASE from Mycobacterium leprae (188 aa), FASTA scores: opt: 984, E(): 2.3e-55, (81.0% identity in 184 aa overlap). Also highly similar to others e.g. Q9AE11|RUVC from Corynebacterium glutamicum (Brevibacterium flavum) (221 aa), FASTA scores: opt: 713, E(): 3.6e-38, (56.9% identity in 188 aa overlap); Q9L289|RUVC_STRCO|SCL2.10c from Streptomyces coelicolor (188 aa), FASTA scores: opt: 704, E(): 1.2e-37, (60.65% identity in 178 aa overlap); P24239|RUVC_ECOLI|B1863 from Escherichia coli strain K12 (172 aa), FASTA scores: opt: 322, E(): 1.6e-13, (38.65% identity in 163 aa overlap); etc. BELONGS TO THE RUVC FAMILY. COFACTOR: MAGNESIUM. Mb2625c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66761" /db_xref="InterPro:IPR002176" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR020563" /db_xref="InterPro:IPR036397" /db_xref="UniProtKB/Swiss-Prot:P66761" /protein_id="SIU01243.1" /translation="MRVMGVDPGLTRCGLSLIESGRGRQLTALDVDVVRTPSDAALAQ RLLAISDAVEHWLDTHHPEVVAIERVFSQLNVTTVMGTAQAGGVIALAAAKRGVDVHF HTPSEVKAAVTGNGSADKAQVTAMVTKILALQAKPTPADAADALALAICHCWRAPTIA RMAEATSRAEARAAQQRHAYLAKLKAAR" CDS 2897360..2897605 /codon_start=1 /transl_table=11 /gene="vapb40" /locus_tag="BQ2027_MB2626" /product="possible antitoxin vapb40" /note="Mb2626, -, len: 81 aa. Equivalent to Rv2595, len: 81 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 81 aa overlap). Conserved hypothetical protein, showing similarity with various bacterial proteins e.g. O28268|AF2011 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (86 aa), FASTA scores: opt: 120, E(): 0.13, (34.35% identity in 67 aa overlap); CAC46196|SMC01176 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (79 aa), FASTA scores: opt: 119, E(): 0.14, (33.35% identity in 63 aa overlap); P37554|SP5T_BACSU|SPOVT STAGE V SPORULATION PROTEIN T from Bacillus subtilis (178 aa), FASTA scores: opt: 104, E(): 2.9, (51.45% identity in 35 aa overlap); etc. Also similar to O07779|Rv0599c|MTCY19H5.23 hypothetical protein from Mycobacterium tuberculosis (78 aa), FASTA scores: opt: 160, E(): 0.00026, (35.8% identity in 81 aa overlap). Protein product from Mb2626 detected using SWATH mass spectrometry. Mb2626 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65028" /db_xref="InterPro:IPR007159" /db_xref="InterPro:IPR037914" /db_xref="UniProtKB/Swiss-Prot:P65028" /protein_id="SIU01244.1" /translation="MRTTIDVAGRLVIPKRIRERLGLRGNDQVEITERDGRIEIEPAP TGVELVREGSVLVARPERPLPPLTDEIVRETLDRTRR" CDS 2897602..2898006 /codon_start=1 /transl_table=11 /gene="vapc40" /locus_tag="BQ2027_MB2627" /product="possible toxin vapc40. contains pin domain." /note="Mb2627, -, len: 134 aa. Equivalent to Rv2596, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 134 aa overlap). Conserved hypothetical protein, only similar to O07780|Rv0598c|MTCY19H5.24 HYPOTHETICAL 14.8 KDA PROTEIN from Mycobacterium tuberculosis (137 aa), FASTA scores: opt: 254, E(): 8.8e-11, (41.55% identity in 130 aa overlap). Protein product from Mb2627 detected using SWATH mass spectrometry. Mb2627 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3R7" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3R7" /protein_id="SIU01245.1" /translation="MIAPDTSVLVAGFATWHEGHEAAVRALNRGVHLIAHAAVETYSV LTRLPPPHRIAPVAVHAYLADITSSNYLALDARSYRGLTDHLAEHDVTGGATYDALVG FTAKAAGAKLLTRDLRAVETYERLRVEVELVT" CDS 2898223..2898843 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2628" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb2628, -, len: 206 aa. Equivalent to Rv2597, len: 206 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 206 aa overlap). Probable membrane protein. Protein product from Mb2628 detected using SWATH mass spectrometry. Mb2628 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65030" /db_xref="InterPro:IPR025235" /db_xref="UniProtKB/Swiss-Prot:P65030" /protein_id="SIU01246.1" /translation="MGNLLVVIAVALFIAAIVVLVVAIRRPKTPATPGGRRDPLAFDA MPQFGPRQLGPGAIVSHGGIDYVVRGSVTFREGPFVWWEHLLEGGDTPTWLSVQEDDG RLELAMWVKRTDLGLQPGGQHVIDGVTFQETERGHAGYTTEGTTGLPAGGEMDYVDCA SAGQGADESMLLSFERWAPDMGWEIATGKSVLAGELTVYPAPPVSA" CDS 2898854..2899348 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2629" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2629, -, len: 164 aa. Equivalent to Rv2598, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 164 aa overlap). Conserved hypothetical protein, showing similarity with hypothetical proteins from Streptomyces coelicolor e.g. Q9X8S3|SCH10.34c (185 aa), FASTA scores: opt: 197, E(): 3.5e-06, (34.75% identity in 167 aa overlap); and Q9L088|SCC24.29c (172 aa), FASTA scores: opt: 149, E(): 0.0053, (37.65% identity in 146 aa overlap). Equivalent to AAK46988 from Mycobacterium tuberculosis strain CDC1551 (154 aa) but longer 10 aa. Mb2629 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024486" /db_xref="UniProtKB/Swiss-Prot:P65032" /protein_id="SIU01247.1" /translation="MPLHQLAIAPVDVSGALLGLVLNAPAPRPLATHRLAHTDGSALQ LGVLGASHVVTVEGRFCEEVSCVARSRGGDLPESTHAPGYHLQSHTETHDEAAFRRLA RHLRERCTRATGWLGGVFPGDDAALTALAAEPDGTGWRWRTWHLYPSASGGTVVHTTS RWRP" CDS 2899345..2899776 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2630" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb2630, -, len: 143 aa. Equivalent to Rv2599, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 143 aa overlap). Probable conserved membrane protein, equivalent to Q9K536|2599 HYPOTHETICAL 15.0 KDA PROTEIN (FRAGMENT) from Mycobacterium paratuberculosis (143 aa), FASTA scores: opt: 691, E(): 1.7e-33, (68.55% identity in 143 aa overlap). Shows weak similarity with Q9L089|SCC24.28c PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (131 aa), FASTA scores: opt: 130, E(): 0.52, (26.45% identity in 136 aa overlap). Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2. Protein product from Mb2630 detected using SWATH mass spectrometry. Mb2630 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025341" /db_xref="UniProtKB/TrEMBL:A0A1R3Y235" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01248.1" /translation="MSRNRLFLVAGILAVAAAVSLISGITLLNRDVGSYIASHYRQES RDVNGTRYLCTGSPKQVATTLVKYQTPAARASHTDTEYLRYRNNIVTVGPDGTYPCII RVENLSAGYNHGAYVFLGPGFTPGSPSGGSGGSPGGPGGSK" CDS 2899858..2900259 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2631" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb2631, -, len: 133 aa. Equivalent to Rv2600, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Probable conserved integral membrane protein, equivalent (but shorter 18 aa) to Q9K537|YQ00_MYCPA HYPOTHETICAL PROTEIN RV2600 HOMOLOG from Mycobacterium paratuberculosis (151 aa), FASTA scores: opt: 543, E(): 4.2e-28, (62.9% identity in 132 aa overlap). Also some similarity with other hypothetical or membrane proteins e.g. Q9L090|SCC24.27c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (146 aa), FASTA scores: opt: 241, E(): 8.7e-09, (34.8% identity in 135 aa overlap); O58487|PH0773 HYPOTHETICAL 15.0 KDA PROTEIN from Pyrococcus horikoshii (138 aa), FASTA scores: opt: 116, E(): 0.84, (34.35% identity in 96 aa overlap); etc. Equivalent to AAK46990 from Mycobacterium tuberculosis strain CDC1551 (152 aa) but shorter 19 aa. Mb2631 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P68914" /db_xref="InterPro:IPR007140" /db_xref="UniProtKB/Swiss-Prot:P68914" /protein_id="SIU01249.1" /translation="MVATVLYFLVGAAVLVAGFLMVNLLTPGDLRRLVFIDRRPNAVV LAATMYVALAIVTIAAIYASSNQLAQGLIGVAVYGIVGVALQGVALVILEIAVPGRFR EHIDAPALHPAVFATAVMLLAVAGVIAAALS" CDS 2900256..2901827 /codon_start=1 /transl_table=11 /gene="speE" /locus_tag="BQ2027_MB2632" /product="probable spermidine synthase spee (putrescine aminopropyltransferase) (aminopropyltransferase) (spdsy)" /note="Mb2632, speE, len: 523 aa. Equivalent to Rv2601, len: 523 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 523 aa overlap). Probable speE, spermidine synthase (EC 2.5.1.16), highly similar to many e.g. Q9L091|SCC24.26c from Streptomyces coelicolor (531 aa), FASTA scores: opt: 1493, E(): 1.3e-79, (48.45% identity in 514 aa overlap); Q9X8S2|SCH10.33c from Streptomyces coelicolor (554 aa), FASTA scores: opt: 1045, E(): 1.7e-53, (40.55% identity in 525 aa overlap); P09158|SPEE_ECOLI|B0121 from Escherichia coli strain K12 (287 aa), FASTA scores: opt: 368, E(): 2.9e-14, (30.5% identity in 272 aa overlap); etc. Protein product from Mb2632 detected using SWATH mass spectrometry." /db_xref="GOA:Q7TY95" /db_xref="InterPro:IPR001045" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR030373" /db_xref="InterPro:IPR030374" /db_xref="UniProtKB/Swiss-Prot:Q7TY95" /protein_id="SIU01250.1" /translation="MTSTRQAGEATEASVRWRAVLLAAVAACAACGLVYELALLTLAA SLNGGGIVATSLIVAGYIAALGAGALLIKPLLAHAAIAFIAVEAVLGIIGGLSAAALY AAFAFLDELDGSTLVLAVGTALIGGLVGAEVPLLMTLLQRGRVAGAADAGRTLANLNA ADYLGALVGGLAWPFLLLPQLGMIRGAAVTGIVNLAAAGVVSIFLLRHVVSGRQLVTA LCALAAALGLIATLLVHSHDIETTGRQQLYADPIIAYRHSAYQEIVVTRRGDDLRLYL DGGLQFCTRDEYRYTESLVYPAVSDGARSVLVLGGGDGLAARELLRQPGIEQIVQVEL DPAVIELARTTLRDVNAGSLDNPRVHVVIDDAMSWLRGAAVPPAGFDAVIVDLRDPDT PVLGRLYSTEFYALAARALAPGGLMVVQAGSPYSTPTAFWRIISTIRSAGYAVTPYHV HVPTFGDWGFALARLTDIAPTPAVPSTAPALRFLDQQVLEAATVFSGDIRPRTLDPST LDNPHIVEDMRHGWD" CDS 2901938..2902225 /codon_start=1 /transl_table=11 /gene="vapb41" /locus_tag="BQ2027_MB2633" /product="possible antitoxin vapb41" /note="Mb2633, -, len: 95 aa. Equivalent to Rv2601A, len: 95 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 95 aa overlap). Mb2633 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1N9" /db_xref="InterPro:IPR010985" /db_xref="InterPro:IPR013321" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1N9" /protein_id="SIU01251.1" /translation="MKTTLDLPDELMRAIKVRAAQQGRKMKDVVTELLRSGLSQTHSG APIPTPRRVQLPLVHCGGAATREQEMTPERVAAALLDQEAQWWSGHDDAAL" CDS 2902212..2902652 /codon_start=1 /transl_table=11 /gene="vapc41" /locus_tag="BQ2027_MB2634" /product="possible toxin vapc41. contains pin domain." /note="Mb2634, -, len: 146 aa. Equivalent to Rv2602, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 146 aa overlap). Conserved hypothetical protein, some weak similarity with proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O50457|Rv1242|MTV006.14 (143 aa), FASTA scores: opt: 147, E(): 0.0021, (26.25% identity in 141 aa overlap); P95023|Rv2530c|MTCY159.26 (139 aa), FASTA scores: opt: 131, E(): 0.027, (33.35% identity in 135 aa overlap); O53812|Rv0749|MTV041.23 (142 aa), FASTA scores: opt: 125, E(): 0.072, (26.45% identity in 140 aa overlap). Protein product from Mb2634 detected using SWATH mass spectrometry. Mb2634 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1P7" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1P7" /protein_id="SIU01252.1" /translation="MLLCDTNIWLALALSGHVHHRASRAWLDTINAPGVIHFCRATQQ SLLRLLTNRTVLGAYGSPPLTNREAWAAYAAFLDDDRIVLAGAEPDGLEAQWRAFAVR QSPAPKVWMDAYLAAFALTGGFELVTTDTAFTQYGGIELRLLAK" CDS complement(2902673..2903428) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2635C" /product="Probable transcriptional regulatory protein YebC" /note="Mb2635c, -, len: 251 aa. Equivalent to Rv2603c, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 251 aa overlap). Highly conserved hypothetical protein, equivalent to Q49645|YQ03_MYCLE|ML0475|U1177B|B1177_C2_181 HYPOTHETICAL 26.6 KDA PROTEIN from Mycobacterium leprae (251 aa), FASTA scores: opt: 1514, E(): 2.2e-84, (92.45% identity in 251 aa overlap). Also highly similar to Q9L288|SCL2.11c HYPOTHETICAL 26.8 KDA PROTEIN from Streptomyces coelicolor (250 aa), FASTA scores: opt: 1268, E(): 1.5e-69, (76.7% identity in 249 aa overlap); Q9AE12|YFCA HYPOTHETICAL STRUCTURAL PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (251 aa), FASTA scores: opt: 1231, E(): 2.6e-67, (72.9% identity in 251 aa overlap); O83487|Y474_TREPA|TP0474 HYPOTHETICAL PROTEIN from Treponema pallidum (245 aa), FASTA scores: opt: 780, E(): 4.4e-40, (47.75% identity in 245 aa overlap); P24237|YEBC_ECOLI|B1864 PROTEIN YEBC from Escherichia coli strain K12 (246 aa), FASTA scores: opt: 776, E(): 7.6e-40, (47.8% identity in 249 aa overlap); etc. Protein product from Mb2635c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2635c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67178" /db_xref="InterPro:IPR002876" /db_xref="InterPro:IPR017856" /db_xref="InterPro:IPR026564" /db_xref="InterPro:IPR029072" /db_xref="UniProtKB/Swiss-Prot:P67178" /protein_id="SIU01253.1" /translation="MSGHSKWATTKHKKAVVDARRGKMFARLIKNIEVAARVGGGDPA GNPTLYDAIQKAKKSSVPNENIERARKRGAGEEAGGADWQTIMYEGYAPNGVAVLIEC LTDNRNRAASEVRVAMTRNGGTMADPGSVSYLFSRKGVVTLEKNGLTEDDVLAAVLEA GAEDVNDLGDSFEVISEPAELVAVRSALQDAGIDYESAEASFQPSVSVPVDLDGARKV FKLVDALEDSDDVQNVWTNVDVSDEVLAALDDE" CDS complement(2903561..2904157) /codon_start=1 /transl_table=11 /gene="snop" /locus_tag="BQ2027_MB2636C" /product="probable glutamine amidotransferase snop" /note="Mb2636c, -, len: 198 aa. Equivalent to Rv2604c, len: 198 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 198 aa overlap). Conserved hypothetical protein, equivalent (but shorter 21 aa) to Q49637|HISH|B1177_C1_149 HISH PROTEIN (BELONGS TO THE YFL060C/YAAE/HI1648 FAMILY) (alias Q9CCT5|ML0474 HYPOTHETICAL PROTEIN 223 aa) from Mycobacterium leprae (219 aa), FASTA scores: opt: 1069, E(): 1.7e-60, (83.35% identity in 198 aa overlap). Also highly similar to hypothetical proteins or amidotransferases e.g. Q9L287|SCL2.12c HYPOTHETICAL 21.5 KDA PROTEIN from Streptomyces coelicolor (202 aa), FASTA scores: opt: 702, E(): 2.3e-37, (56.75% identity in 192 aa overlap); P37528|YAAE_BACSU HYPOTHETICAL 21.4 KDA PROTEIN from Bacillus subtilis (196 aa), FASTA scores: opt: 608, E(): 1.9e-31, (48.7% identity in 189 aa overlap); Q9KGN5|BH0023 AMIDOTRANSFERASE from Bacillus halodurans (196 aa), FASTA scores: opt: 583, E(): 7.4e-30, (48.7% identity in 195 aa overlap); etc. Also some similarity with several proteins from Mycobacterium tuberculosis e.g. O06589|HIS5_MYCTU|Rv1602|MT1638|MTCY336.02c AMIDOTRANSFERASE (EC 2.4.2.-) (206 aa), FASTA scores: opt: 154, E(): 0.00036, (30.6% identity in 193 aa overlap). Protein product from Mb2636c detected using SWATH mass spectrometry. Mb2636c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TY92" /db_xref="InterPro:IPR002161" /db_xref="InterPro:IPR021196" /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/Swiss-Prot:Q7TY92" /protein_id="SIU01254.1" /translation="MSVPRVGVLALQGDTREHLAALRECGAEPMTVRRRDELDAVDAL VIPGGESTTMSHLLLDLDLLGPLRARLADGLPAYGSCAGMILLASEILDAGAAGRQAL PLRAMNMTVRRNAFGSQVDSFEGDIEFAGLDDPVRAVFIRAPWVERVGDGVQVLARAA GHIVAVRQGAVLATAFHPEMTGDRRIHQLFVDIVTSAA" CDS complement(2904165..2905010) /codon_start=1 /transl_table=11 /gene="tesB2" /locus_tag="BQ2027_MB2637C" /product="PROBABLE ACYL-COA THIOESTERASE II TESB2 (TEII)" /note="Mb2637c, tesB2, len: 281 aa. Equivalent to Rv2605c, len: 281 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 281 aa overlap). Probable tesB2, acyl-CoA thioesterase II (EC 3.1.2.-), highly similar to others e.g. Q98EG9|MLL4250 from Rhizobium loti (Mesorhizobium loti) (286 aa), FASTA scores: opt: 563, E(): 3.9e-29, (47.75% identity in 287 aa overlap); CAC47767 from Rhizobium meliloti (Sinorhizobium meliloti) (294 aa), FASTA scores: opt: 553, E(): 1.8e-28, (49.3% identity in 280 aa overlap); P23911|TESB_ECOLI|B0452 from Escherichia coli strain K12 (285 aa), FASTA scores: opt: 487, E(): 3.1e-24, (41.9% identity in 277 aa overlap); etc. Also similar to O06135|TESB1|Rv1618|MTCY01B2.10 ACYL-COA THIOESTERASE II from Mycobacterium tuberculosis (300 aa), FASTA scores: opt: 425, E(): 1.1e-21, (34.9% identity in 278 aa overlap). BELONGS TO THE C/M/P THIOESTER HYDROLASE FAMILY. Protein product from Mb2637c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2637c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3T0" /db_xref="InterPro:IPR003703" /db_xref="InterPro:IPR029069" /db_xref="InterPro:IPR042171" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3T0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01255.1" /translation="MSIEEILDLEQLEVNIYRGSVFSPESGFLQRTFGGHVAGQSLVS AVRTVDPRYMVHSLHGYFLRPGDAKERTVFLVERIRDGGSLCTRRVNAVQHGETIFSM AASFQTEQEGITHQDVMPAAPPPDGLPGLNSIKVFDDAGFRQFDEWDVCIVPRERLRL LPGKASQQQVWLRHRDPLPDDPVLHICALAYMSDLTLLGSAQVNHLDVRDQLQVASLD HAMWFMRPFRADEWLLYDQSSPSASGGRALTRGEIFTRSGEMVAAVMQEGLTRHRRGH RSVGQ" CDS complement(2905039..2905938) /codon_start=1 /transl_table=11 /gene="snzp" /locus_tag="BQ2027_MB2638C" /product="possible pyridoxine biosynthesis protein snzp" /note="Mb2638c, -, len: 299 aa. Equivalent to Rv2606c, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 299 aa overlap). Conserved hypothetical protein, equivalent to O07145|YQ06_MYCLE|ML0450|MLCL581.12c HYPOTHETICAL 35.6 KDA PROTEIN from Mycobacterium leprae (307 aa), FASTA scores: opt: 1686, E(): 1.5e-95, (89.7% identity in 291 aa overlap). Also highly similar to other hypothetical proteins (or product of pyroA gene) e.g. Q9L286|SCL2.13c HYPOTHETICAL 32.2 KDA PROTEIN from Streptomyces coelicolor (303 aa), FASTA scores: opt: 1461, E(): 7.6e-82, (76.8% identity in 293 aa overlap); O14027|YEM4_SCHPO|SPAC29B12.04 HYPOTHETICAL 31.4 KDA PROTEIN from Schizosaccharomyces pombe (Fission yeast) (296 aa), FASTA scores: opt: 1318, E(): 3.8e-73, (70.35% identity in 290 aa overlap); Q9UW83|PYROA PROTEIN INVOLVED IN PYRIDOXINE BIOSYNTHESIS from Emericella nidulans (Aspergillus nidulans) (see citation below) (304 aa), FASTA scores: opt: 1288, E(): 2.6e-71, (67.9% identity in 302 aa overlap); etc. Protein product from Mb2638c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2638c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P60795" /db_xref="InterPro:IPR001852" /db_xref="InterPro:IPR011060" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR033755" /db_xref="UniProtKB/Swiss-Prot:P60795" /protein_id="SIU01256.1" /translation="MDPAGNPATGTARVKRGMAEMLKGGVIMDVVTPEQARIAEGAGA VAVMALERVPADIRAQGGVSRMSDPDMIEGIIAAVTIPVMAKVRIGHFVEAQILQTLG VDYIDESEVLTPADYAHHIDKWNFTVPFVCGATNLGEALRRISEGAAMIRSKGEAGTG DVSNATTHMRAIGGEIRRLTSMSEDELFVAAKELQAPYELVAEVARAGKLPVTLFTAG GIATPADAAMMMQLGAEGVFVGSGIFKSGAPEHRAAAIVKATTFFDDPDVLAKVSRGL GEAMVGINVDEIAVGHRLAQRGW" CDS 2906066..2906740 /codon_start=1 /transl_table=11 /gene="pdxH" /locus_tag="BQ2027_MB2639" /product="PROBABLE PYRIDOXAMINE 5'-PHOSPHATE OXIDASE PDXH (PNP/PMP OXIDASE) (PYRIDOXINEPHOSPHATE OXIDASE) (PNPOX) (PYRIDOXINE 5'-PHOSPHATE OXIDASE)" /note="Mb2639, pdxH, len: 224 aa. Equivalent to Rv2607, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Probable pdxH, pyridoxinephosphate oxidase (EC 1.4.3.5), equivalent to O33065|PDXH_MYCLE|ML2131|MLCB57.46 PYRIDOXAMINE 5'-PHOSPHATE OXIDASE from Mycobacterium leprae (219 aa), FASTA scores: opt: 1038, E(): 8.3e-61, (67.1% identity in 219 aa overlap). Also similar to others e.g. Q9I4S5|PDXH|PA1049 from Pseudomonas aeruginosa (215 aa), FASTA scores: opt: 608, E(): 1.1e-32, (49.55% identity in 218 aa overlap); Q9K3V7|SCD10.19c from Streptomyces coelicolor (234 aa), FASTA scores: opt: 600, E(): 3.9e-32, (42.3% identity in 234 aa overlap); P28225|PDXH_ECOLI|B1638 from Escherichia coli strain K12 (217 aa), FASTA scores: opt: 533, E(): 8.9e-28, (40.3% identity in 216 aa overlap); etc. BELONGS TO THE PYRIDOXAMINE 5'-PHOSPHATE OXIDASE FAMILY. COFACTOR: FMN. Protein product from Mb2639 detected using SWATH mass spectrometry. Mb2639 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65683" /db_xref="InterPro:IPR000659" /db_xref="InterPro:IPR011576" /db_xref="InterPro:IPR012349" /db_xref="InterPro:IPR019576" /db_xref="InterPro:IPR019740" /db_xref="UniProtKB/Swiss-Prot:P65683" /protein_id="SIU01257.1" /translation="MDDDAQMVAIDKDQLARMRGEYGPEKDGCGDLDFDWLDDGWLTL LRRWLNDAQRAGVSEPNAMVLATVADGKPVTRSVLCKILDESGVAFFTSYTSAKGEQL AVTPYASATFPWYQLGRQAHVQGPVSKVSTEEIFTYWSMRPRGAQLGAWASQQSRPVG SRAQLDNQLAEVTRRFADQDQIPVPPGWGGYRIAPEIVEFWQGRENRMHNRIRVANGR LERLQP" CDS 2906914..2908656 /codon_start=1 /transl_table=11 /gene="PPE42" /locus_tag="BQ2027_MB2640" /product="ppe family protein ppe42" /note="Mb2640, PPE42, len: 580 aa. Equivalent to Rv2608, len: 580 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 580 aa overlap). Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. O06828|Rv1430|MTCY493.24c from Mycobacterium tuberculosis (528 aa), FASTA scores: opt: 1004, E(): 5.9e-48, (56.05% identity in 307 aa overlap). Mb2640 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR013228" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y246" /protein_id="SIU01258.1" /translation="MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGS FASVTTGLAGDAWHGPASLAMTRAASPYVGWLNTAAGQAAQAAGQARLAASAFEATLA ATVSPAMVAANRTRLASLVAANLLGQNAPAIAAAEAEYEQIWAQDVAAMFGYHSAASA VATQLAPIQEGLQQQLQNVLAQLASGNLGSGNVGVGNIGNDNIGNANIGFGNRGDANI GIGNIGDRNLGIGNTGNWNIGIGITGNGQIGFGKPANPDVLVVGNGGPGVTALVMGGT DSLLSLPNIPLLEYAARFITPVHPGYTATFLETPSQFFPFTGLNSLTYDVSVAQGVTN LHTAIMAQLAAGNEVVVFGTSQSATIATFEMRYLQSLPAHLRPGLDELSFTLTGNPNR PDGGILTRFGFSIPQLGFTLSGATPADAYPTVDYAFQYDGVNDFPKYPLNVFATANAI AGILFLHSGLIALPPDLASGVVQPVSSPDVLTTYILLPSQDLPLLVPLRAIPLLGNPL ADLIQPDLRVLVELGYDRTAHQDVPSPFGLFPDVDWAEVAADLQQGAVQGVNDALSGL GLPPPWQPALPRLF" CDS complement(2908678..2909733) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2641C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb2641c, -, len: 351 aa. Equivalent to Rv2609c, len: 351 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 351 aa overlap). Probable conserved membrane protein, equivalent to O07146|MLCL581.13c|ML0451 HYPOTHETICAL 37.9 KDA PROTEIN from Mycobacterium leprae (349 aa), FASTA scores: opt: 1675, E(): 1.4e-95, (77.85% identity in 334 aa overlap). Also similar to hypothetical proteins: O69888|SC2E1.17|MUTT HYPOTHETICAL 19.4 KDA PROTEIN from Streptomyces coelicolor and Streptomyces lividans (172 aa), FASTA scores: opt: 345, E(): 3.5e-14, (44.7% identity in 161 aa overlap); Q9L285|SCL2.14c HYPOTHETICAL 19.8 KDA PROTEIN from Streptomyces coelicolor (180 aa), FASTA scores: opt: 179, E(): 0.00056, (43.25% identity in 171 aa overlap); and Q9RYE5|DR0004 MUTT/NUDIX FAMILY PROTEIN from Deinococcus radiodurans (350 aa), FASTA scores: opt: 153, E(): 0.037, (33.35% identity in 123 aa overlap). Contains PS00893 mutT domain signature. BELONGS TO THE MUTT/NUDIX FAMILY. Protein product from Mb2641c detected using SWATH mass spectrometry. Mb2641c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1P9" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR015797" /db_xref="InterPro:IPR020084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1P9" /protein_id="SIU01259.1" /translation="MTWLVLAGAVLLVVLVAFGAWGYQTANRLNRLNVRYDLSWQSLD SALARRAVVARAVAIDAYGGAPQGSRLAALADAAEGAPRHARENAENELSAALAMVNP ASLPAALIAELADAEARVLLARRFHNDAVRDTLALGERRLVRLLRLGGTAVLPTYFEI VERPHALVHGDQGASGRRTSARVVLLDDSGAVLLLCGSDPANPAFRDGAAPKWWFTVG GQVRPGERLAQAAARELAEETGLRVAPADMIGPIWRRDEVFEFNGSLIDSEEFYLVHR TRRFEPAVQGRTELERRYIRDARWCDANDIAQLVAAGERVYPLQLGELLPAANRLVDV ALDNGAARDAGVPQPIR" CDS complement(2909733..2910869) /codon_start=1 /transl_table=11 /gene="pimA" /locus_tag="BQ2027_MB2642C" /product="ALPHA-MANNOSYLTRANSFERASE PIMA" /note="Mb2642c, pimA, len: 378 aa. Equivalent to Rv2610c, len: 378 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 378 aa overlap). pimA, alpha-mannosyltransferase (EC 2.4.1.-) (see citations below), equivalent to O07147|MLCL581.14c|ML0452 PUTATIVE GLYCOSYLTRANSFERASE from Mycobacterium leprae (374 aa), FASTA scores: opt: 2044, E(): 8.8e-118, (82.25% identity in 378 aa overlap). N-terminus (from aa 1 to 27) equivalent to Q9FY7 PUTATIVE ALPHA-MANNOSYL TRANSFERASE (FRAGMENT) from Mycobacterium smegmatis (27 aa), BLASTP scores: 57.4 bits (137), E(): 3e-8, Identities = 25/27 (92%), Positives = 27/27 (99%) (see citation below). Also highly similar to Q9L284|SCL2.15c PUTATIVE SUGAR TRANSFERASE from Streptomyces coelicolor (387 aa), FASTA scores: opt: 1222, E(): 1.8e-67, (52.95% identity in 376 aa overlap); and similar in part to various proteins e.g. Q9YA73|APE2066 LONG HYPOTHETICAL N-ACETYLGLUCOSAMINYL-PHOSPHATIDYLINOSITOL BIOSYNTHETIC PROTEIN from Aeropyrum pernix (392 aa), FASTA scores: opt: 434, E(): 3e-19, (31.5% identity in 378 aa overlap); Q9UZA1|PAB0827 GALACTOSYLTRANSFERASE OR LPS BIOSYNTHESIS RFBU RELATED PROTEIN from Pyrococcus abyssi (371 aa), FASTA scores: opt: 382, E(): 4.3e-16, (28.2% identity in 383 aa overlap); O26275|MTH173 LPS BIOSYNTHESIS RFBU RELATED PROTEIN from Methanothermobacter thermautotrophicus (382 aa), FASTA scores: opt: 372, E(): 1.8e-15, (28.4% identity in 391 aa overlap); etc. Shows also some similarity with O05313|Rv1212c|MTCI364.24c HYPOTHETICAL 41.5 KDA PROTEIN from Mycobacterium tuberculosis (387 aa), FASTA scores: opt: 232, E(): 1.1e -07, (28.4% identity in 402 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb2642c detected using SWATH mass spectrometry. Mb2642c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TY88" /db_xref="InterPro:IPR028098" /db_xref="UniProtKB/Swiss-Prot:Q7TY88" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01260.1" /translation="MRIGMICPYSFDVPGGVQSHVLQLAEVMRTRGHLVSVLAPASPH AALPDYFVSGGRAVPIPYNGSVARLRFGPATHRKVKKWLAHGDFDVLHLHEPNAPSLS MLALNIAEGPIVATFHTSTTKSLTLTVFQGILRPMHEKIVGRIAVSDLARRWQMEALG SDAVEIPNGVDVDSFASAARLDGYPRQGKTVLFLGRYDEPRKGMAVLLDALPKVVQRF PDVQLLIVGHGDADQLRGQAGRLAAHLRFLGQVDDAGKASAMRSADVYCAPNTGGESF GIVLVEAMAAGTAVVASDLDAFRRVLRDGEVGHLVPVDPPDLQAAALADGLIAVLEND VLRERYVAAGNAAVRRYDWSVVASQIMRVYETVAGSGAKVQVAS" CDS complement(2910880..2911830) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2643C" /product="probable acyltransferase" /note="Mb2643c, -, len: 316 aa. Equivalent to Rv2611c, len: 316 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 316 aa overlap). Probable acyltransferase (EC 2.3.1.-), equivalent to O07148|MLCL581.15c|ML0453 HYPOTHETICAL 35.4 KDA PROTEIN from Mycobacterium leprae (320 aa), FASTA scores: opt: 1529, E(): 5e-90, (71.45% identity in 312 aa overlap); and equivalent to Q9F7Y8 PUTATIVE ACYLTRANSFERASE from Mycobacterium smegmatis (303 aa), FASTA scores: opt: 1464, E(): 6.5e-86, (72.15% identity in 291 aa overlap) (see citation below). Also highly similar to Q9L283|SCL2.16c PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (311 aa), FASTA scores: opt: 810, E(): 2.8e-44, (47.7% identity in 302 aa overlap); and similar to other acyltransferases e.g. Q9F0N3 ACYLTRANSFERASE from Campylobacter jejuni (295 aa), FASTA scores: opt: 207, E(): 6.4e-06, (20.45% identity in 220 aa overlap); Q9K379 ACYLTRANSFERASE (LIPID A BIOSYNTHESIS ACYLTRANSFERASE) from Campylobacter jejuni (295 aa), FASTA scores: opt: 203, E(): 1.1e-05, (20.0% identity in 220 aa overlap); etc. Protein product from Mb2643c detected using SWATH mass spectrometry. Mb2643c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1Q0" /db_xref="InterPro:IPR004960" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Q0" /protein_id="SIU01261.1" /translation="MIAGLKGLKLPKDPRSSVTRTATDWAYAAGWMAVRALPEFAVRN AFDTGARYFARHGGPEQLRKNLARVLGVPPAAVPDPLMCASLESYGRYWREVFRLPTM NHRKLARQLDRVIGGLDHLDAALAAGLGAVLALPHSGNWDMAGMWLVQRHGTFTTVAE RLKPESLYQRFIDYRESLGFEVLPLSGGERPPFEVLCERLRNNRVVCLMAERDLTRTG VEVDFFGEPTRMPVGPAKLAVETGAALLPTHCWFEGRGWGFQVYPALDCTSGDVAAIT QALADRFAQNIAAHPADWHMLQPQWLADLSESRRAQLRSR" CDS complement(2911827..2912480) /codon_start=1 /transl_table=11 /gene="pgsA1" /locus_tag="BQ2027_MB2644C" /product="pi synthase pgsa1 (phosphatidylinositol synthase) (cdp-diacylglycerol--inositol-3- phosphatidyltransferase)" /note="Mb2644c, pgsA1, len: 217 aa. Equivalent to Rv2612c, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 217 aa overlap). Probable pgsA1 (previously known as pgsA), PI synthase/CDP-diacylglyceride--inositol phosphatidyltransferase (EC 2.7.8.11), transmembrane protein, equivalent to O07149|MLCL581.16c|PGSA|ML0454 PUTATIVE PHOSPHATIDYLTRANSFERASE from Mycobacterium leprae (239 aa), FASTA scores: opt: 1141, E(): 4.1e-70, (79.35% identity in 213 aa overlap); and Q9F7Y9|PGSA PHOSPHATIDYLINOSITOL SYNTHASE from Mycobacterium smegmatis (222 aa), FASTA scores: opt: 981, E(): 2.7e-59, (67.3% identity in 217 aa overlap) (see citation below). Also similar to other proteins e.g. Q9L282|SCL2.17c PUTATIVE MEMBRANE TRANSFERASE from Streptomyces coelicolor (241 aa), FASTA scores: opt: 564, E(): 4.9e-31, (43.4% identity in 212 aa overlap); Q9UYD0|PGSA-LIKE|PAB1041 CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE from Pyrococcus abyssi (186 aa), FASTA scores: opt: 264, E(): 8.4e-11, (33.15% identity in 190 aa overlap); Q9HQS2|PGSA|VNG1030G CDP-DIACYLGLYCEROL-GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE from Halobacterium sp. strain NRC-1 (199 aa), FASTA scores: opt: 249, E(): 9.1e-10, (32.1% identity in 193 aa overlap); etc. Contains PS00379 CDP-alcohol phosphatidyltransferases signature. BELONGS TO THE CDP-ALCOHOL PHOSPHATIDYLTRANSFERASE CLASS-I FAMILY. Note that in M. smegmatis, the psgA homolog is essential to the survival of the bacteria and seems cannot be compensated by any other enzyme of M. smegmatis. Protein product from Mb2644c detected using SWATH mass spectrometry. Mb2644c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1Q7" /db_xref="InterPro:IPR000462" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Q7" /protein_id="SIU01262.1" /translation="MSKLPFLSRAAFARITTPIARGLLRVGLTPDVVTILGTTASVAG ALTLFPMGKLFAGACVVWFFVLFDMLDGAMARERGGGTRFGAVLDATCDRISDGAVFC GLLWWIAFHMRDRPLVIATLICLVTSQVISYIKARAEASGLRGDGGFIERPERLIIVL TGAGVSDFPFVPWPPALSVGMWLLAVASVITCVQRLHTVWTSPGAIDRMAIPGKGDR" CDS complement(2912477..2913064) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2645C" /product="FIG049476: HIT family protein" /note="Mb2645c, -, len: 195 aa. Equivalent to Rv2613c, len: 195 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 195 aa overlap). Conserved hypothetical protein, equivalent to Q9CCU0|ML0455 HYPOTHETICAL PROTEIN from Mycobacterium leprae (206 aa), FASTA scores: opt: 1074, E(): 7.4e-62, (84.7% identity in 196 aa overlap); and highly similar, but longer 18 aa, to O07150|MLCL581.17c HYPOTHETICAL 20.7 KDA PROTEIN from Mycobacterium leprae (186 aa), FASTA scores: opt: 1038, E(): 1.4e-59, (89.7% identity in 175 aa overlap). Also highly similar to other hypothetical proteins (often Hit family member) e.g. Q9F7Z0 from Mycobacterium smegmatis (see citation below) (205 aa), FASTA scores: opt: 975, E(): 1.6e-55, (79.35% identity in 184 aa overlap); Q9L279|SCL2.20 from Streptomyces coelicolor (186 aa), FASTA scores: opt: 638, E(): 5.8e-34, (52.85% identity in 176 aa overlap); Q9YFX8|APE0122 from Aeropyrum pernix (184 aa), FASTA scores: opt: 515, E(): 4.4e-26, (45.9% identity in 159 aa overlap); etc. It seems the Rv2613c and downstream ORF Rv2612c|psgA1 are expressed from the same promoter (see citation below) and that Rv2613c should be involved in lipid metabolism. Protein product from Mb2645c detected using shotgun mass spectrometry. Mb2645c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1Q2" /db_xref="InterPro:IPR001310" /db_xref="InterPro:IPR011146" /db_xref="InterPro:IPR036265" /db_xref="InterPro:IPR039383" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Q2" /protein_id="SIU01263.1" /translation="MSDEDRTDRATEDHTIFDRGVGQRDQLQRLWTPYRMNYLAEAPV KRDPNSSASPAQPFTEIPQLSDEEGLVVARGKLVYAVLNLYPYNPGHLMVVPYRRVSE LEDLTDLESAELMAFTQKAIRVIKNVSRPHGFNVGLNLGTSAGGSLAEHLHVHVVPRW GGDANFITIIGGSKVIPQLLRDTRRLLATEWARQP" CDS complement(2913057..2915135) /codon_start=1 /transl_table=11 /gene="thrS" /locus_tag="BQ2027_MB2646C" /product="PROBABLE THREONYL-TRNA SYNTHETASE THRS (THREONINE-TRNA SYNTHETASE)(ThrRS) (THREONINE-TRNA LIGASE)" /note="Mb2646c, thrS, len: 692 aa. Equivalent to Rv2614c, len: 692 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 692 aa overlap). Probable thrS, threonyl-tRNA synthetase (Threonine--tRNA ligase) (EC 6.1.1.3), equivalent to O07151|SYT_MYCLE|THRS|ML0456|MLCL581.18c THREONYL-TRNA SYNTHETASE from Mycobacterium leprae (702 aa), FASTA scores: opt: 3988, E(): 0, (84.05% identity in 702 aa overlap). Also highly similar to others e.g. Q9L278|THRS from Streptomyces coelicolor (658 aa), FASTA scores: opt: 1982, E(): 5.1e-114, (65.1% identity in 659 aa overlap); P56881|SYT_THETH|THRS from Thermus aquaticus (subsp. thermophilus) (659 aa), FASTA scores: opt: 1551, E(): 1.5e-87, (46.5% identity in 650 aa overlap); P00955|SYT_ECOLI from Escherichia coli (642 aa), FASTA scores: opt: 946, E(): 0, (40.7% identity in 612 aa overl ap); etc. Contains PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2. BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. COFACTOR: BINDS 1 ZINC ION (BY SIMILARITY). Protein product from Mb2646c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2646c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67583" /db_xref="InterPro:IPR002314" /db_xref="InterPro:IPR002320" /db_xref="InterPro:IPR004154" /db_xref="InterPro:IPR006195" /db_xref="InterPro:IPR012947" /db_xref="InterPro:IPR018163" /db_xref="InterPro:IPR033728" /db_xref="InterPro:IPR036621" /db_xref="UniProtKB/Swiss-Prot:P67583" /protein_id="SIU01264.1" /translation="MSAPAQPAPGVDGGDPSQARIRVPAGTTAATAVGEAGLPRRGTP DAIVVVRDADGNLRDLSWVPDVDTDITPVAANTDDGRSVIRHSTAHVLAQAVQELFPQ AKLGIGPPITDGFYYDFDVPEPFTPEDLAALEKRMRQIVKEGQLFDRRVYESTEQARA ELANEPYKLELVDDKSGDAEIMEVGGDELTAYDNLNPRTRERVWGDLCRGPHIPTTKH IPAFKLTRSSAAYWRGDQKNASLQRIYGTAWESQEALDRHLEFIEEAQRRDHRKLGVE LDLFSFPDEIGSGLAVFHPKGGIVRRELEDYSRRKHTEAGYQFVNSPHITKAQLFHTS GHLDWYADGMFPPMHIDAEYNADGSLRKPGQDYYLKPMNCPMHCLIFRARGRSYRELP LRLFEFGTVYRYEKSGVVHGLTRVRGLTMDDAHIFCTRDQMRDELRSLLRFVLDLLAD YGLTDFYLELSTKDPEKFVGAEEVWEEATTVLAEVGAESGLELVPDPGGAAFYGPKIS VQVKDALGRTWQMSTIQLDFNFPERFGLEYTAADGTRHRPVMIHRALFGSIERFFGIL TEHYAGAFPAWLAPVQVVGIPVADEHVAYLEEVATQLKSHGVRAEVDASDDRMAKKIV HHTNHKVPFMVLAGDRDVAAGAVSFRFGDRTQINGVARDDAVAAIVAWIADRENAVPT AELVKVAGRE" CDS 2915244..2915471 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2647" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2647, -, len: 75 aa. Equivalent to Rv2614A, len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (98.7% identity in 75 aa overlap). Conserved hypothetical protein. The region from aa 10-35 is similar to part of C-terminal part of several TRIOSEPHOSPHATE ISOMERASES (EC 5.3.1.1) e.g. P46711|TPIS_MYCLE|TPIA|TPI|ML0572|B1496_C1_1 27 from Mycobacterium leprae (261 aa), FASTA scores: opt: 112, E(): 0.95, (60.0% identity in 25 aa overlap); and O08408|TPIS_MYCTU|TPIA|TPI|Rv1438|MT1482|MTCY493.16c from Mycobacterium tuberculosis (261 aa), FASTA scores: opt: 104, E(): 3.3, (60.0% identity in 25 aa overlap); P19583|TPIS_CORGL|TPIA|TPI from Corynebacterium glutamicum (Brevibacterium flavum) (259 aa), FASTA scores: opt: 100, E(): 6, (45.45% identity in 33 aa overlap); etc. TRIOSEPHOSPHATE ISOMERASES PLAY AN IMPORTANT ROLE IN SEVERAL METABOLIC PATHWAYS (CATALYTIC ACTIVITY: D-GLYCERALDEHYDE 3-PHOSPHATE = DIHYDROXY-ACETONE PHOSPHATE)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3T9" /protein_id="SIU01265.1" /translation="MGDRYRAGDRVLYGGSMSPKDVDDLATQQDVDDGQSIERRWTGS GQRRWRRSPPTGHYRSNSQIQVWISGAGRLR" CDS complement(2915468..2916862) /codon_start=1 /transl_table=11 /gene="PE_PGRS45" /locus_tag="BQ2027_MB2648C" /product="pe-pgrs family protein pe_pgrs45" /note="Mb2648c, PE_PGRS45, len: 464 aa. Equivalent to Rv2615c, len: 461 aa, from Mycobacterium tuberculosis strain H37Rv, (98.7% identity in 464 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to many e.g. P71664|Rv1396c|MTCY21B4.13c from Mycobacterium tuberculosis (576 aa), FASTA scores: opt: 1629, E(): 4.8e-58, (56.65% identity in 482 aa overlap). Equivalent to AAK47006 from Mycobacterium tuberculosis strain CDC1551 (476 aa) but shorter 15 aa. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 9 bp in-frame insertion (*-ccgccgttt) leads to a slightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (464 aa versus 461 aa). Mb2648c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2J2" /protein_id="SIU01266.1" /translation="MSFVNVAPQLVSTAAADAARIGSAINTANTAAAATTQVLVAAQD EVSTAIAALFGSHGQHYQAISAQVAAYQERFVLALSQAGSTYAVAEAASATPLQNVLD AINAPVQSLTGRPLIGDGANGIDGTGQAGGNGGWLWGNGGNSGSGAPGQAGGAGGAAG LIGNGGAGGAGGQGLPFEAGANGGAGGAGGWLFGNGGAGGNGGIGGAGTNLAIGGHGG NGGNAGLIGAGGTGGAGGTGGGEPSAGASGGNGGNGGNGGLLIGNSGDGGAAGNGAGI SQNGPASGFGGNGGHAGTTGLIGNGGNGGAGGAGGDVSADFGGVGFGGQGGNGGAGGL LYGNGGAGGNGGAAGSPGSVTAFGGNGGSGGSGGNGGNALIGNAGAGGSAGAGGNGAS AGTAGGSGGDGGKGGNGGSVGLIGNGGNGGNGGNGGAGSLFNGAPGFGGPGGSGGASL LGPPGLAGTNGADG" CDS 2917207..2917707 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2649" /product="conserved protein" /note="Mb2649, -, len: 166 aa. Equivalent to Rv2616, len: 166 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 166 aa overlap). Conserved hypothetical protein, highly similar to bacterial proteins: Q9L1G0|SC3D11.02c HYPOTHETICAL 20.3 KDA PROTEIN from Streptomyces coelicolor (188 aa), FASTA scores: opt: 407, E(): 2.3e-20, (44.0% identity in 159 aa overlap); Q9X945 A3(2) GLYCOGEN METABOLISM CLUSTER from Streptomyces coelicolor (134 aa), FASTA scores: opt: 330, E(): 2.5e-15, (46.65% identity in 120 aa overlap) (N-terminus shorter); Q9RST8|DR2035 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (198 aa), FASTA scores: opt: 228, E(): 2.4e-08, (35.1% identity in 168 aa overlap). Protein product from Mb2649 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2649 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR014457" /db_xref="InterPro:IPR018960" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Y9" /protein_id="SIU01267.1" /translation="MDLNALADLPLTYPEVGATATGRLPAGYNHLDVSTQIGTGRQRF EQAADAVMHWGMQRNAGLRVRASSETAIVSAVVLVGIAFLRAPCRVVYVIDEPDVRGF GYGTLPGHPVSGEERFAVRCDPMTSVVFAEVLSFSRPATWASKAAGPLGAVTQRFIAQ RYLRAV" CDS complement(2917724..2918164) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2650C" /product="PROBABLE TRANSMEMBRANE PROTEIN" /note="Mb2650c, -, len: 146 aa. Equivalent to Rv2617c, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 146 aa overlap). Probable transmembrane protein, showing some similarity to hypothetical or membrane proteins e.g. CAC47207|SMC00744 PUTATIVE TRANSPORT PROTEIN TRANSMEMBRANE from Rhizobium meliloti (Sinorhizobium meliloti) (399 aa), FASTA scores: opt: 108, E(): 5.5, (29.15% identity in 144 aa overlap). Protein product from Mb2650c detected using shotgun mass spectrometry. Mb2650c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y256" /db_xref="InterPro:IPR032808" /db_xref="UniProtKB/TrEMBL:A0A1R3Y256" /protein_id="SIU01268.1" /translation="MSIRPTTSPALADQLKDPAYSAYVLLRTLFTVAPILFGLDKFFN LLTHPQHWNMYLAGWINDLVPGTADQCMYLVGAIEIVAGVLVAVAPRIGAWVVAAWLA GIILNLVTGPGFYDIALRDFGLLVGAIALARLAQGVHSGGIGRP" CDS 2918311..2918988 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2651" /product="Putative transcriptional regulatory protein" /note="Mb2651, -, len: 225 aa. Equivalent to Rv2618, len: 225 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 225 aa overlap). Conserved hypothetical protein, similar in part to Q9EWQ9|SC4C2.03 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (159 aa), FASTA scores: opt: 235, E(): 1.3e-07, (43.7% identity in 103 aa overlap); Q9HLM6|TA0201 HYPOTHETICAL PROTEIN from Thermoplasma acidophilum (215 aa), FASTA scores: opt: 164, E(): 0.0038, (23.4% identity in 201 aa overlap); and to mycobacterial proteins e.g. O06191|Rv2621c|MTCY01A10.11 HYPOTHETICAL 24.2 KDA PROTEIN from Mycobacterium tuberculosis (224 aa), FASTA scores: opt: 149, E(): 0.033, (28.05% identity in 196 aa overlap). Mb2651 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Q9" /protein_id="SIU01269.1" /translation="MDPVRRQLYQFVCSQSMPVSRDQAADAVGIPRHQAKFHLDRLTA EGLLDTEYARLTGRSGPGAGRTAKLYRRAGRDIALSLPQREYELAGRLMAAAIVLSAT TGEPTVEVLNRIAHDYGQAMGAAATTRPPADPAAALELTLDVLRKYGYEPRRPAGPGD DEVELVNCPFHALAREQTELACNMNHALITGVADALAPHSPAVRLAPGPARCCVVLKR CSAHDPE" CDS complement(2918973..2919326) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2652C" /product="Protein containing double-stranded beta-helix domain" /note="Mb2652c, -, len: 117 aa. Equivalent to Rv2619c, len: 117 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 117 aa overlap). Conserved hypothetical protein, highly similar to Q9L0F3|SCD31.14 HYPOTHETICAL 11.6 KDA PROTEIN from Streptomyces coelicolor (110 aa), FASTA scores: opt: 407, E(): 2.3e-21, (55.95% identity in 109 aa overlap). Also similarity with other short bacterial hypothetical proteins e.g. Q9F8B9 HYPOTHETICAL 12.4 KDA PROTEIN from Streptococcus agalactiae (112 aa), FASTA scores: opt: 143, E(): 0.0032, (32.45% identity in 74 aa overlap); etc. Protein product from Mb2652c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2652c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011051" /db_xref="InterPro:IPR014710" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1R7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01270.1" /translation="MESISLTSLAAEKLAEAQQTHSGRAAHTIHGGHTHELRQTVLAL LAGHDLSEHDSPGEATLQVLQGHVCLTAGEDAWNGRAGDYVAIPPTRHALHAVEDSVI MLTVLKSLPDAHSGS" CDS complement(2919339..2919764) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2653C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2653c, -, len: 141 aa. Equivalent to Rv2620c, len: 141 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 141 aa overlap). Probable conserved transmembrane protein, highly similar to O54184|SC7H1.25 HYPOTHETICAL 14.6 KDA PROTEIN from Streptomyces coelicolor (144 aa), FASTA scores: opt: 459, E(): 1.4e-22, (56.45% identity in 140 aa overlap). Protein product from Mb2653c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2653c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1R0" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1R0" /protein_id="SIU01271.1" /translation="MSAGPAIEVAVAFVWLGMVVAISFLEAPLKFRAAGVTLQIGLGI GRLVFRALNTVEVGFALVILAIVVVGSTPARIAAAFSVALAALAVQLIAVRPRLTRRS NQVLAGLQAPRSRGHHIYVGLEIVKVVALLVAGILLLNG" CDS complement(2919761..2920432) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2654C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb2654c, -, len: 223 aa. Equivalent to Rv2621c, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 224 aa overlap). Possible transcriptional regulator, similar in part to Q49688|MLCL536.29c|ML0592 PUTATIVE DNA-BINDING PROTEIN from Mycobacterium leprae (254 aa), FASTA scores: opt: 168, E(): 0.0018, (29.75% identity in 222 aa overlap). Shows similarity with Q9XAD0|SCC22.08c PUTATIVE DNA-BINDING PROTEIN from Streptomyces coelicolor (252 aa), FASTA scores: opt: 148, E(): 0.032, (29.4% identity in 204 aa overlap); and Q9RVM8|DR0999 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (225 aa), FASTA scores: opt: 195, E(): 3.3e-05, (29.6% identity in 213 aa overlap). Also some similarity with O06195|Rv2618|MTCY01A10.15c from Mycobacterium tuberculosis (225 aa), FASTA scores: opt: 149, E(): 0.025, (28.95% identity in 197 aa overlap). Contains helix-turn-helix motif at aa 31-52 (Score 1662, +4.85 SD). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 3 bp deletion (ggg-*) leads to a slightly shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (223 aa versus 224 aa). Mb2654c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1R6" /protein_id="SIU01272.1" /translation="MGVSVIIRSLQEPVGRRRAVLRALCASRVPMSIAAIAGKLGVHP NTVRFHLDNLVADGQVERVEPGRGRPGRPPLMFRAVRRTDSTGTRRYRLLAEILASGL AAERDSRAMALSAGRAWGRQLEAPPAGADTEETIDHLVAVLDDLGFAPERRASNGRQQ VGLRHCPFLELAETQAGVVCPVHLGIMRGALQTWAPVTVDRLDAFVEPDLCLAHFTPL EGAIR" CDS 2920510..2921331 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2655" /product="POSSIBLE METHYLTRANSFERASE (METHYLASE)" /note="Mb2655, -, len: 273 aa. Equivalent to Rv2622, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 273 aa overlap). Possible methyltransferase (EC 2.1.1.-), similar in part to others e.g. AAK75664|SP1578 PUTATIVE METHYLTRANSFERASE from Streptococcus pneumoniae (252 aa), FASTA scores: opt: 406, E(): 6.6e-18, (32.65% identity in 251 aa overlap); Q9F8B8 METHYLTRANSFERASE from Streptococcus agalactiae (254 aa), FASTA scores: opt: 381, E(): 2.3e-16, (31.75% identity in 252 aa overlap); Q9RJB6|SCF91.08 PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (231 aa), FASTA scores: opt: 159, E(): 0.0091, (33.1% identity in 151 aa overlap); etc. Also similar in part to several hypothetical proteins e.g. Q99YR0|SPY1582 HYPOTHETICAL PROTEIN from Streptococcus pyogenes (251 aa), FASTA scores: opt: 397, E(): 2.3e-17, (36.3% identity in 248 aa overlap). Protein product from Mb2655 detected using shotgun mass spectrometry. Mb2655 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1R1" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1R1" /protein_id="SIU01273.1" /translation="MANKRGNAGQPLPLSDRDDDHMQGHWLLARLGKRVLRPGGVELT RTLLARAEVTDADVLELAPGLGRTAAEILARNPRSYVGAESDPNAANLVRHVLAGRGD VRVTDAADTGLSDASADVVIGEAMLTMQGNAAKHTIVAEAARVLRPGGRYAIHELALV PDDVAEQVRTDLRQSLARALKVNARPLTVAEWSHLLAGHGLVVEHVVTASMALLQPRR VIADEGLLGALRFAGNLLIHRAARRRVLLMRHTFRRHRERLTAVAIVAHKPHVDS" CDS 2921467..2922360 /codon_start=1 /transl_table=11 /gene="TB31.7" /locus_tag="BQ2027_MB2656" /product="universal stress protein family protein tb31.7" /note="Mb2656, TB31.7, len: 297 aa. Equivalent to Rv2623, len: 297 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 297 aa overlap). TB31.7, conserved hypothetical protein, highly similar to hypothetical proteins from Mycobacterium tuberculosis e.g. Q10851|YK05_MYCTU|Rv2005c|MT2061|MTCY39.12 (295 aa), FASTA scores: opt: 1076, E(): 1.4e-60, (55.25% identity in 295 aa overlap); O53472|Rv2026c|MTV018.13c (294 aa), FASTA scores: opt: 988, E(): 4.8e-55, (51.5% identity in 295 aa overlap); Q10862|YJ96_MYCTU|Rv1996|MT2052|MTCY39.23c (317 aa), FASTA scores: opt: 688, E(): 4.1e-36, (45.1% identity in 315 aa overlap); etc. Also similar to several Streptomyces proteins e.g. Q9RIZ8|SCJ1.16c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (294 aa), FASTA scores: opt: 407, E(): 2e-18, (32.65% identity in 303 aa overlap); and other bacterial hypothetical proteins e.g. Q9HPP5|VNG1536 from Halobacterium sp (147 aa), FASTA scores: opt: 180, E(): 0.00022, (31.65% identity in 139 aa overlap). Protein product from Mb2656 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2656 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006015" /db_xref="InterPro:IPR006016" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1R2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01274.1" /translation="MSSGNSSLGIIVGIDDSPAAQVAVRWAARDAELRKIPLTLVHAV SPEVATWLEVPLPPGVLRWQQDHGRHLIDDALKVVEQASLRAGPPTVHSEIVPAAAVP TLVDMSKDAVLMVVGCLGSGRWPGRLLGSVSSGLLRHAHCPVVIIHDEDSVMPHPQQA PVLVGVDGSSASELATAIAFDEASRRNVDLVALHAWSDVDVSEWPGIDWPATQSMAEQ VLAERLAGWQERYPNVAITRVVVRDQPARQLVQRSEEAQLVVVGSRGRGGYAGMLVGS VGETVAQLARTPVIVARESLT" CDS complement(2922363..2923181) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2657C" /product="universal stress protein family protein" /note="Mb2657c, -, len: 272 aa. Equivalent to Rv2624c, len: 272 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 272 aa overlap). Conserved hypothetical protein, similar to several Streptomyces proteins e.g. Q9RIY5|SCJ1.29c HYPOTHETICAL 30.1 KDA PROTEIN from Streptomyces coelicolor (283 aa), FASTA scores: opt: 260, E(): 5e-09, (32.05% identity in 290 aa overlap). Also similar to Mycobacterium tuberculosis proteins O53474|Rv2028c|MTV018.15c (279 aa), FASTA scores: opt: 563, E(): 7e-28, (36.85% identity in 266 aa overlap); P95192|Rv3134c|MTCY03A2.240 (268 aa), FASTA scores: opt: 458, E(): 2.3e-21, (36.55% identity in 271 aa overlap); Q10851|YK05_MYCTU|Rv2005c|MT2061|MTCY39.12 (295 aa), FASTA scores: opt: 199, E(): 3.2e-05, (29.35% identity in 286 aa overlap); etc. Protein product from Mb2657c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2657c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006015" /db_xref="InterPro:IPR006016" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3U4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01275.1" /translation="MSGRGEPTMKTIIVGIDGSHAAITAALWGVDEAISRAVPLRLVS VIKPTHPSPDDYDRDLAHAERSLREAQSAVEAAGKLVKIETDIPRGPAGPVLVEASRD AEMICVGSVGIGRYASSILGSTATELAEKAHCPVAVMRSKVDQPASDINWIVVRMTDA PDNEAVLEYAAREAKLRQAPILALGGRPEELREIPDGEFERRVQDWHHRHPDVRVYPI TTHTGIARFLADHDERVQLAVIGGGEAGQLARLVGPSGHPVFRHAECSVLVVRR" CDS complement(2923196..2924377) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2658C" /product="PROBABLE CONSERVED TRANSMEMBRANE ALANINE AND LEUCINE RICH PROTEIN" /note="Mb2658c, -, len: 393 aa. Equivalent to Rv2625c, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 393 aa overlap). Probable conserved transmembrane ala-, leu-rich protein, similar to many hypothetical or membrane proteins e.g. Q55518|Y528_SYNY3|SLL0528 POTENTIAL INTEGRAL MEMBRANE PROTEIN from Synechocystis sp. strain PCC 6803 (379 aa), FASTA scores: opt: 552, E(): 5.6e-26, (30.75% identity in 374 aa overlap); Q9RJ56|SCI41.35c HYPOTHETICAL 39.8 KDA PROTEIN from Streptomyces coelicolor (374 aa), FASTA scores: opt: 419, E(): 5.7e-18, (31.6% identity in 383 aa overlap); CAC49448|SMB20925 CONSERVED HYPOTHETICAL MEMBRANE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (372 aa), FASTA scores: opt: 401, E(): 6.9e-17, (29.5% identity in 383 aa overlap); etc. Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. Protein product from Mb2658c detected using SWATH mass spectrometry. Mb2658c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2K2" /db_xref="InterPro:IPR000644" /db_xref="InterPro:IPR008915" /db_xref="InterPro:IPR016483" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2K2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01276.1" /translation="MRDAIPLGRIAGFVVNVHWSVLVILWLFTWSLATMLPGTVGGYP AVVYWLLGAGGAVMLLASLLAHELAHAVVARRAGVSVESVTLWLFGGVTALGGEAKTP KAAFRIAFAGPATSLALSATFGALAITLAGVRTPAIVISVAWWLATVNLLLGLFNLLP GAPLDGGRLVRAYLWRRHGDSVRAGIGAARAGRVVALVLIALGLAEFVAGGLVGGVWL AFIGWFIFAAAREEETRISTQQLFAGVRVADAMTAQPHTAPGWINVEDFIQRYVLGER HSAYPVADRDGSITGLVALRQLRDVAPSRRSTTSVGDIALPLHSVPTARPQEPLTALL ERMAPLGPRSRALVTEGSAVVGIVTPSDVARLIDVYRLAQPEPTFTTSPQDADRFSDA G" CDS complement(2924436..2924867) /codon_start=1 /transl_table=11 /gene="hrp1" /locus_tag="BQ2027_MB2659C" /product="hypoxic response protein 1 hrp1" /note="Mb2659c, -, len: 143 aa. Equivalent to Rv2626c, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Conserved hypothetical protein, similar to CAC49670|SMB21441 PUTATIVE INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE PROTEIN (EC 1.1.1.205) from Rhizobium meliloti (Sinorhizobium meliloti) (120 aa), FASTA scores: opt: 287, E(): 6.6e-12, (43.75% identity in 112 aa overlap) (has its N-terminus shorter 27 aa); AAK78655|CAC0678 CBS DOMAINS from Clostridium acetobutylicum (142 aa), FASTA scores: opt: 276, E(): 3.9e-11, (35.65% identity in 115 aa overlap); Q9K9P0|BH2605 BH2605 PROTEIN from Bacillus halodurans (142 aa), FASTA scores: opt: 276, E(): 3.9e-11, (35.65% identity in 115 aa overlap); etc. Also some similarity to P71737|Rv2406c|MTCY253.14 HYPOTHETICAL 15.1 KDA PROTEIN from Mycobacterium tuberculosis (142 aa), FASTA scores: opt: 145, E(): 0.00012, (22.3% identity in 112 aa overlap). Protein product from Mb2659c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2659c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000644" /db_xref="UniProtKB/TrEMBL:A0A1R3Y200" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01277.1" /translation="MTTARDIMNAGVTCVGEHETLTAAAQYMREHDIGALPICGDDDR LHGMLTDRDIVIKGLAAGLDPNTATAGELARDSIYYVDANASIQEMLNVMEEHQVRRV PVISEHRLVGIVTEADIARHLPEHAIVQFVKAICSPMALAS" CDS complement(2925381..2926622) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2660C" /product="conserved protein" /note="Mb2660c, -, len: 413 aa. Equivalent to Rv2627c, len: 413 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 413 aa overlap). Conserved hypothetical protein. Some similarity in C-terminal part of O53697|Rv0293c|MTV035.21c HYPOTHETICAL 44.0 KDA PROTEIN from Mycobacterium tuberculosis (400 aa), FASTA scores: opt: 392, E(): 1.9e-17, (31.1% identity in 299 aa overlap). Protein product from Mb2660c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2660c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y265" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01278.1" /translation="MASSASDGTHERSAFRLSPPVLSGAMGPFMHTGLYVAQSWRDYL GQQPDKLPIARPTIALAAQAFRDEIVLLGLKARRPVSNHRVFERISQEVAAGLEFYGN RGWLEKPSGFFAQPPPLTEVAVRKVKDRRRSFYRIFFDSGFTPHPGEPGSQRWLSYTA NNREYALLLRHPEPRPWLVCVHGTEMGRAPLDLAVFRAWKLHDELGLNIVMPVLPMHG PRGQGLPKGAVFPGEDVLDDVHGTAQAVWDIRRLLSWIRSQEEESLIGLNGLSLGGYI ASLVASLEEGLACAILGVPVADLIELLGRHCGLRHKDPRRHTVKMAEPIGRMISPLSL TPLVPMPGRFIYAGIADRLVHPREQVTRLWEHWGKPEIVWYPGGHTGFFQSRPVRRFV QAALEQSGLLDAPRTQRDRSA" CDS 2926932..2927294 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2661" /product="HYPOTHETICAL PROTEIN" /note="Mb2661, -, len: 120 aa. Equivalent to Rv2628, len: 120 aa, from Mycobacterium tuberculosis strain H37Rv, (98.3% identity in 120 aa overlap). Hypothetical unknown protein. Mb2661 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1R8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01279.1" /translation="MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHP RKVQSATIYQVTDRLHDGRTARVRGDEITSTVSGWLSELGTQSPLADELARAVRIGDW PAAYAIGEHLSVEIAVAV" CDS 2927641..2928765 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2662" /product="conserved protein" /note="Mb2662, -, len: 374 aa. Equivalent to Rv2629, len: 374 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 374 aa overlap). Conserved hypothetical protein, similar to Q9ZC00|SC1E6.22c HYPOTHETICAL 40.7 KDA PROTEIN from Streptomyces coelicolor (373 aa), FASTA scores: opt: 425, E(): 2.5e-18, (30.2% identity in 371 aa overlap). Protein product from Mb2662 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2662 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029064" /db_xref="InterPro:IPR040701" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1S3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01280.1" /translation="MRSERLRWLVAAEGPFASVYFDDSHDTLDAVERREATWRDVRKH LESRDAKQELIDSLEEAVRDSRPAVGQRGRALIATGEQVLVNEHLIGPPPATVIRLSD YPYVVPLIDLEMRRPTYVFAAVDHTGADVKLYQGATISSTKIDGVGYPVHKPVTAGWN GYGDFQHTTEEAIRMNCRAVADHLTRLVDAADPEVVFVSGEVRSRTDLLSTLPQRVAV RVSQLHAGPRKSALDEEEIWDLTSAEFTRRRYAEITNVAQQFEAEIGRGSGLAAQGLA EVCAALRDGDVDTLIVGELGEATVVTGKARTTVARDADMLSELGEPVDRVARADEALP FAAIAVGAALVRDDNRIAPLDGVGALLRYAATNRLGSHRS" CDS 2928767..2929306 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2663" /product="Archease" /note="Mb2663, -, len: 179 aa. Equivalent to Rv2630, len: 179 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 179 aa overlap). Hypothetical unknown protein. Mb2663 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR023572" /db_xref="InterPro:IPR036820" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1S0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01281.1" /translation="MLHRDDHINPPRPRGLDVPCARLRATNPLRALARCVQAGKPGTS SGHRSVPHTADLRIEAWAPTRDGCIRQAVLGTVESFLDLESAHAVHTRLRRLTADRDD DLLVAVLEEVIYLLDTVGETPVDLRLRDVDGGVDVTFATTDASTLVQVGAVPKAVSLN ELRFSQGRHGWRCAVTLDV" CDS 2929445..2930743 /codon_start=1 /transl_table=11 /gene="rtcB" /locus_tag="BQ2027_MB2664" /product="RNA-2',3'-PO4:RNA-5'-OH ligase" /note="Mb2664, -, len: 432 aa. Equivalent to Rv2631, len: 432 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 432 aa overlap). Conserved hypothetical protein, highly similar to several conserved hypothetical proteins from various species e.g. O29399|AF0862 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (482 aa), FASTA scores: opt: 1496, E(): 2.1e-80, (52.3% identity in 438 aa overlap) (has its N-terminus longer 38 aa); O27634|MTH1597 CONSERVED PROTEIN from Methanothermobacter thermautotrophicus (488 aa), FASTA scores: opt: 1428, E(): 2.1e-76, (50.9% identity in 438 aa overlap); Q9YB37|APE1758 HYPOTHETICAL 53.7 KDA PROTEIN APE1758 from Aeropyrum pernix (483 aa), FASTA scores: opt: 1422, E(): 4.6e-76, (49.3% identity in 440 aa overlap) (has its N-terminus longer 38 aa); etc. Equivalent to AAK47022 from Mycobacterium tuberculosis strain CDC1551 (432 aa) but longer 8 aa. 3' part extended since first submission (+175 aa). Protein product from Mb2664 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2664 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59975" /db_xref="InterPro:IPR001233" /db_xref="InterPro:IPR036025" /db_xref="UniProtKB/Swiss-Prot:P59975" /protein_id="SIU01282.1" /translation="MQVVNVATLPGIVRASYAMPDVHWGYGFPIGGVAATDVDNDGVV SPGGVGFDISCGVRLLVGEGLDREELQPRLPAVMDRLDRAIPRGVGTAGVWRLPDRNT LQEVLTGGARFAVEQGHGVALDLERCEDGGVMTGADAAKISDRALQRGLGQIGSLGSG NHFLEVQAVDRVYDPVAAAPMGLAEGTVCVMIHTGSRGLGHQICTDHVRQMEQAMGRY GIAVPDRQLACVPVHSPDGQAYLAAMAAAANYGRANRQLLTEATRRVFADATGTPLDL LYDVSHNLAKIETHPIDGQLRSVCVHRKGATRSLPPHHHELPAELAAVGQPVLIPGTM GTASYVLAGVTGNPAFFSTAHGAGRVLSRHQAARHTSGEAIRASLAKRGIIVRGTSRR GIAEEKPEAYKDVDEVIEASHQSGLARKVARLVPLGCVKG" CDS complement(2930782..2931063) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2665C" /product="conserved protein" /note="Mb2665c, -, len: 93 aa. Equivalent to Rv2632c, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Conserved hypothetical protein, highly similar to conserved hypothetical proteins from Mycobacterium tuberculosis: P71996|YH38_MYCTU|Rv1738|MT1780|MTCY04C12.23 (94 aa), FASTA scores: opt: 319, E(): 4.2e-15, (53.95% identity in 89 aa overlap); and Q9KK61 from Mycobacterium bovis BCG (56 aa), FASTA scores: opt: 178, E(): 9.2e-06, (52.95% identity in 51 aa overlap). Protein product from Mb2665c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2665c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR015057" /db_xref="InterPro:IPR038070" /db_xref="UniProtKB/Swiss-Prot:P65034" /protein_id="SIU01283.1" /translation="MTDSEHVGKTCQIDVLIEEHDERTRAKARLSWAGRQMVGVGLAR LDPADEPVAQIGDELAIARALSDLANQLFALTSSDIEASTHQPVTGLHH" CDS complement(2931208..2931693) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2666C" /product="Hemerythrin domain protein" /note="Mb2666c, -, len: 161 aa. Equivalent to Rv2633c, len: 161 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 161 aa overlap). Hypothetical unknown protein. Protein product from Mb2666c detected using SWATH mass spectrometry. Mb2666c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR012312" /db_xref="UniProtKB/Swiss-Prot:P65036" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01284.1" /translation="MNAYDVLKRHHTVLKGLGRKVGEAPVNSEERHVLFDEMLIELDI HFRIEDDLYYPALSAAGKPITGTHAEHRQVVDQLATLLRTPQRAPGYEEEWNVFRTVL EAHADVEERDMIPAPTPVHITDAELEELGDKMAARIEQLRGSPLYTLRTKGKADLLKA I" CDS complement(2931978..2934314) /codon_start=1 /transl_table=11 /gene="PE_PGRS46" /locus_tag="BQ2027_MB2667C" /product="pe-pgrs family protein pe_pgrs46" /note="Mb2667c, PE_PGRS46, len: 778 aa. Equivalent to Rv2634c, len: 778 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 778 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to many e.g. O53553|YZ08_MYCTU|Rv3508|MTV023.15 from Mycobacterium tuberculosis (1901 aa), FASTA scores: opt: 2553, E(): 2.2e-93, (53.8% identity in 866 aa overlap). Equivalent to AAK47026 from Mycobacterium tuberculosis strain CDC1551 (788 aa) but shorter 10 aa. Mb2667c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/Swiss-Prot:P0A691" /protein_id="SIU01285.1" /translation="MSFVIAVPEALTMAASDLANIGSTINAANAAAALPTTGVVAAAA DEVSAAVAALFGSYAQSYQAFGAQLSAFHAQFVQSLTNGARSYVVAEATSAAPLQDLL GVVNAPAQALLGRPLIGNGANGADGTGAPGGPGGLLLGNGGNGGSGAPGQPGGAGGDA GLIGNGGTGGKGGDGLVGSGAAGGVGGRGGWLLGNGGTGGAGGAAGATLVGGTGGVGG ATGLIGSGGFGGAGGAAAGVGTTGGVGGSGGVGGVFGNGGFGGAGGLGAAGGVGGAAS YFGTGGGGGVGGDGAPGGDGGAGPLLIGNGGVGGLGGAGAAGGNGGAGGMLLGDGGAG GQGGPAVAGVLGGMPGAGGNGGNANWFGSGGAGGQGGTGLAGTNGVNPGSIANPNTGA NGTDNSGNGNQTGGNGGPGPAGGVGEAGGVGGQGGLGESLDGNDGTGGKGGAGGTAGT DGGAGGAGGAGGIGETDGSAGGVATGGEGGDGATGGVDGGVGGAGGKGGQGHNTGVGD AFGGDGGIGGDGNGALGAAGGNGGTGGAGGNGGRGGMLIGNGGAGGAGGTGGTGGGGA AGFAGGVGGAGGEGLTDGAGTAEGGTGGLGGLGGVGGTGGMGGSGGVGGNGGAAGSLI GLGGGGGAGGVGGTGGIGGIGGAGGNGGAGGAGTTTGGGATIGGGGGTGGVGGAGGTG GTGGAGGTTGGSGGAGGLIGWAGAAGGTGAGGTGGQGGLGGQGGNGGNGGTGATGGQG GDFALGGNGGAGGAGGSPGGSSGIQGNMGPPGTQGADG" CDS 2934343..2934585 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2668" /product="HYPOTHETICAL PROTEIN" /note="Mb2668, -, len: 80 aa. Equivalent to Rv2635, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 80 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/Swiss-Prot:P65038" /protein_id="SIU01286.1" /translation="MVAADHRALGSNKSYPASQTAEAIWPPARTLRYDRQSPWLATGF DRRMSQTVTGVGVQNCAVSKRRCSAVDHSSRTPYRR" CDS 2934586..2935263 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2669" /product="Putative O-phosphotransferase (EC" /EC_number="2.7.1.-" /note="Mb2669, -, len: 225 aa. Equivalent to Rv2636, len: 225 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 225 aa overlap). Conserved hypothetical protein, showing some similarity with various proteins: Q98FG2|MLL3789 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (239 aa), FASTA scores: opt: 304, E(): 3.7e-13, (31.55% identity in 187 aa overlap); CAC46568|SMC04451 PUTATIVE CHLORAMPHENICOL PHOSPHOTRANSFERASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (220 aa), FASTA scores: opt: 175, E(): 0.00014, (28.0% identity in 225 aa overlap); Q56148|CPT_STRVL CHLORAMPHENICOL 3-O PHOSPHOTRANSFERASE (EC 2.7.1.-) from Streptomyces violaceus (Streptomyces venezuelae) (178 aa), FASTA scores: opt: 131, E(): 0.1, (31.75% identity in 170 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Translational start site uncertain, chosen by similarity. Protein product from Mb2669 detected using SWATH mass spectrometry. Mb2669 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65040" /db_xref="InterPro:IPR012853" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P65040" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01287.1" /translation="MINPTRARRMRYRLAAMAGMPEGKLILLNGGSSAGKTSLALAFQ DLAAECWMHIGIDLFWFALPPEQLDLARVRPEYYTWDSAVEADGLEWFTVHPGPILDL AMHSRYRAIRAYLDNGMNVIADDVIWTREWLVDALRVFEGCRVWMVGVHVSDEEGARR ELERGDRHPGWNRGSARAAHADAEYDFELDTTATPVHELARELHESYQACPYPMAFNR LRKRFLS" CDS 2935459..2936115 /codon_start=1 /transl_table=11 /gene="dedA" /locus_tag="BQ2027_MB2670" /product="POSSIBLE TRANSMEMBRANE PROTEIN DEDA" /note="Mb2670, dedA, len: 218 aa. Equivalent to Rv2637, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 218 aa overlap). Possible dedA, transmembrane protein, equivalent to Q49642|YQ37_MYCLE|ML0467|MLCL581.27|B1177_C2_172/B1177_C1_ 140 HYPOTHETICAL 23.1 KDA PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN, BELONGS TO THE DEDA FAMILY) from Mycobacterium leprae (214 aa), FASTA scores: opt: 1160, E(): 4.4e-64, (82.75% identity in 209 aa overlap); and O69601|Y364_MYCLE|ML0287|MLCB4.30 HYPOTHETICAL PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) (222 aa), FASTA scores: opt: 292, E(): 6.6e-11, (32.25% identity in 189 aa overlap). Also highly similar to other membrane proteins e.g. CAC42863|SCBAC36F5.27c PUTATIVE INTEGRAL MEMBRANE from Streptomyces coelicolor (211 aa), FASTA scores: opt: 837, E(): 2.6e-44, (59.2% identity in 201 aa overlap); Q55705|Y232_SYNY3|SLR0232 POTENTIAL INTEGRAL MEMBRANE PROTEIN from Synechocystis sp. strain PCC 6803 (218 aa), FASTA scores: opt: 415, E(): 1.9e-18, (37.85% identity in 206 aa overlap); Q9RV63|DR1167 DEDA PROTEIN from Deinococcus radiodurans (200 aa); P09548|DEDA_ECOLI|B2317|Z3579|ECS3201 DEDA PROTEIN (DSG-1 PROTEIN) from Escherichia coli strains K12 and O157:H7 (219 aa), BLAST scores: 178, E(): 1.8e-13, Identities = 53/175 (30%); etc. Also similar to O06314|Y364_MYCTU|Rv0364|MT0380|MTCY13E10.26 HYPOTHETICAL 24.5 KDA PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Mycobacterium tuberculosis (227 aa), FASTA scores: opt: 293, E(): 5.8e-11, (35.85% identity in 184 aa overlap). BELONGS TO THE DEDA FAMILY. Protein product from Mb2670 detected using SWATH mass spectrometry. Mb2670 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63912" /db_xref="InterPro:IPR032816" /db_xref="InterPro:IPR032818" /db_xref="UniProtKB/Swiss-Prot:P63912" /protein_id="SIU01288.1" /translation="MDVEALLQSIPPLMVYLVVGAVVGIESLGIPLPGEIVLVSAAVL SSHPELAVNPIGVGGAAVIGAVVGDSIGYSIGRRFGLPLFDRLGRRFPKHFGPGHVAL AERLFNRWGVRAVFLGRFIALLRIFAGPLAGALKMPYPRFLAANVTGGICWAGGTTAL VYFAGMAAQHWLERFSWIALVIAVIAGITAAILLRERTSRAIAELEAEHCRKAGTTAA " CDS 2936278..2936724 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2671" /product="Anti-sigma B factor antagonist RsbV" /note="Mb2671, -, len: 148 aa. Equivalent to Rv2638, len: 148 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 148 aa overlap). Conserved hypothetical protein, similar in part to Q9WVX8|RSBV_STRCO|BLDG|SCH5.12c ANTI-SIGMA B FACTOR ANTAGONIST from Streptomyces coelicolor (113 aa), FASTA scores: opt: 162, E(): 0.00066, (31.8% identity in 110 aa overlap); and showing weak similarity with various proteins e.g. O69205 HYPOTHETICAL 13.4 KDA PROTEIN from Actinosynnema pretiosum (subsp. auranticum) (128 aa), FASTA scores: opt: 157, E(): 0.0016, (29.8% identity in 114 aa overlap); Q9RJ93|SCF91.32 PUTATIVE ANTI-SIGMA FACTOR ANTAGONIST from Streptomyces coelicolor (183 aa), FASTA scores: opt: 148, E(): 0.0082, (30.85% identity in 107 aa overlap); etc. Also highly similar to hypothetical proteins from Mycobacterium tuberculosis: O07728|Rv1904|MTCY180.14c (143 aa), FASTA scores: opt: 456, E(): 3.9e-23, (52.8% identity in 125 aa overlap); and Q11035|YD65_MYCTU|Rv1365c|MT1411|MTCY02B10.29c (128 aa), FASTA scores: opt: 435, E(): 8.6e-22, (53.6% identity in 125 aa overlap). Mb2671 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1T2" /db_xref="InterPro:IPR002645" /db_xref="InterPro:IPR003658" /db_xref="InterPro:IPR036513" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1T2" /protein_id="SIU01289.1" /translation="MGLITTEPRSSPHPLSPRLVHELGDPHSTLRATTDGSGAALLIH AGGEIDGRNEHLWRQLVTEAAAGVTAPGPLIVDVTGLDFMGCCAFAALADEAQRCRCR GIDLRLVSHQPIVARIAEAGGLSRVLPIYPTVDTALGKGTAGPARC" CDS complement(2936899..2937231) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2672C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb2672c, -, len: 110 aa. Equivalent to Rv2639c, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap). Probable conserved integral membrane protein, highly similar to many bacterial hypothetical or membrane proteins e.g. Q9X889|YE14_STRCO|SCE15.14 POTENTIAL INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (112 aa), FASTA scores: opt: 597, E(): 3.1e-31, (73.15% identity in 108 aa overlap); Q55939|Y793_SYNY3|SLL0793 POTENTIAL INTEGRAL MEMBRANE PROTEIN from Synechocystis sp. strain PCC 6803 (108 aa), FASTA scores: opt: 341, E(): 4.9e-15, (51.4% identity in 109 aa overlap); O31553|YFJF_BACSU POTENTIAL INTEGRAL MEMBRANE PROTEIN from Bacillus subtilis (109 aa), FASTA scores: opt: 334, E(): 1.4e-14, (47.5% identity in 109 aa overlap); etc. Mb2672c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67147" /db_xref="InterPro:IPR003844" /db_xref="UniProtKB/Swiss-Prot:P67147" /protein_id="SIU01290.1" /translation="MVVRSILLFVLAAVAEIGGAWLVWQGVREQRGWLWAGLGVIALG VYGFFATLQPDAHFGRVLAAYGGVFVAGSLAWGMALDGFRPDRWDVIGALGCMAGVAV IMYAPRGH" CDS complement(2937351..2937710) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2673C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY ARSR-FAMILY)" /note="Mb2673c, -, len: 119 aa. Equivalent to Rv2640c, len: 119 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 119 aa overlap). Possible transcriptional regulator, arsR family, highly similar to many e.g. Q9L1V5|SC4A9.07 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (117 aa), FASTA scores: opt: 261, E(): 5.6e-10, (47.75% identity in 103 aa overlap); Q9X8X8|SCH35.28c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (122 aa), FASTA scores: opt: 252, E(): 2.2e-09, (37.05% identity in 116 aa overlap); Q9L220|SC1A2.21 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (119 aa), FASTA scores: opt: 252, E(): 2.2e-09, (37.05% identity in 116 aa overlap); P77295|YGAV_ECOLI|B2667 HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Escherichia coli strain K12 (99 aa), FASTA scores: opt: 156, E(): 0.0023, (34.1% identity in 88 aa overlap); etc. Also similar to upstream ORF P71941|Rv2642|MTCY441.12 PUTATIVE TRANSCRIPTIONAL REGULATORY PROTEIN from Mycobacterium tuberculosis (126 aa), FASTA scores: opt: 237, E(): 2e-08, (38.55% identity in 109 aa overlap). Contains helix-turn-helix motif at aa 59-80 (Score 1166, +3.16 SD). BELONGS TO THE ARSR FAMILY OF TRANSCRIPTIONAL REGULATORS. Mb2673c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1T4" /db_xref="InterPro:IPR001845" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1T4" /protein_id="SIU01291.1" /translation="MPKSLPVIDISAPVCCAPVAAGPMSDGDALAVALRLKALADPAR VKIMSYLFSSPAGEQVSGQLAAALSLSDGTVSHHLAQLRKAGLVISDRRGMHVFHRVH PEALQALCTVLNPNCCA" CDS 2937812..2938270 /codon_start=1 /transl_table=11 /gene="cadI" /locus_tag="BQ2027_MB2674" /product="CADMIUM INDUCIBLE PROTEIN CADI" /note="Mb2674, cadI, len: 152 aa. Equivalent to Rv2641, len: 152 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 152 aa overlap). cadI, conserved hypothetical protein. Gene induced by cadmium (see first citation below), highly similar to hypothetical proteins e.g. Q9L222|SC1A2.19c from Streptomyces coelicolor (152 aa), FASTA scores: opt: 509, E(): 2.3e-27, (55.05% identity in 149 aa overlap); P45945|YQCK_BACSU from Bacillus subtilis (146 aa), FASTA scores: opt: 295, E(): 5.4e-13, (33.55% identity in 146 aa overlap); and Q98CF8|MLL5167 from Rhizobium loti (Mesorhizobium loti) (124 aa), FASTA scores: opt: 110, E(): 1.3, (31.4% identity in 121 aa overlap). Some similarity with Q10548|Y887_MYCTU|Rv0887c|MT0910|MTCY31.15c from Mycobacterium tuberculosis (152 aa), FASTA scores: opt: 108, E(): 2.1, (25.7% identity in 148 aa overlap). Protein product from Mb2674 detected using SWATH mass spectrometry. Mb2674 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR004360" /db_xref="InterPro:IPR029068" /db_xref="InterPro:IPR037523" /db_xref="UniProtKB/Swiss-Prot:P0A5N7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01292.1" /translation="MSRVQLALNVDDLEAAITFYSRLFNAEPAKRKPGYANFAIADPP LKLVLLENPGTGGTLNHLGVEVGSSNTVHAEIARLTEAGLVTEKEIGTTCCFATQDKV WVTGPGGERWEVYTVLADSETFGSGPRHNDTSDGEASMCCDGQVAVGASG" CDS 2938406..2938786 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2675" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY ARSR-FAMILY)" /note="Mb2675, -, len: 126 aa. Equivalent to Rv2642, len: 126 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 126 aa overlap). Possible transcriptional regulator, arsR family, highly similar to many e.g. Q9X8X8|SCH35.28c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (122 aa), FASTA scores: opt: 390, E(): 3.7e-19, (56.55% identity in 122 aa overlap); Q9L220|SC1A2.21 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (119 aa), FASTA scores: opt: 378, E(): 2.3e-18, (59.8% identity in 97 aa overlap); Q9L1V5|SC4A9.07 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (117 aa), FASTA scores: opt: 359, E(): 4.1e-17, (56.9% identity in 116 aa overlap); P52144|ARR2_ECOLI|ARSR from Escherichia coli (117 aa), FASTA scores: opt: 202, E(): 1e-06, (39.8% identity in 88 aa overlap); etc. Also similar to downstream ORF P71939|Rv2640c|MTCY441.10c PUTATIVE TRANSCRIPTIONAL REGULATORY PROTEIN from Mycobacterium tuberculosis (119 aa), FASTA scores: opt: 237, E(): 5e-09, (38.55% identity in 109 aa overlap); and others from Mycobacterium tuberculosis e.g. O05840|Rv2358|MTCY27.22c. Contains PS00846 Bacterial regulatory proteins, arsR family signature. Contains helix-turn-helix motif at aa 58-79 (Score 1112, +2.97 SD). BELONGS TO THE ARSR FAMILY OF TRANSCRIPTIONAL REGULATORS. Mb2675 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1T1" /db_xref="InterPro:IPR001845" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR018334" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1T1" /protein_id="SIU01293.1" /translation="MSNLHPLPEVASCVVAPLVREPLNPPAAAEMAARFKALADPVRL QLLSSVASRAGGEACVCDISAGVEVSQPTISHHLKVLRDAGLLTSRRRASWVYYAVVP EALTVLSNLLSVHADAAPALGAPA" CDS 2938783..2940279 /codon_start=1 /transl_table=11 /gene="arsC" /locus_tag="BQ2027_MB2676" /product="PROBABLE ARSENIC-TRANSPORT INTEGRAL MEMBRANE PROTEIN ARSC" /note="Mb2676, arsC, len: 498 aa. Equivalent to Rv2643, len: 498 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 498 aa overlap). Probable arsC, arsenical resistance transport integral membrane protein, highly similar or similar to others e.g. Q9L1X4|SC3D9.05 POSSIBLE ARSENIC RESISTANCE MEMBRANE TRANSPORT PROTEIN from Streptomyces coelicolor (368 aa), FASTA scores: opt: 1729, E(): 2.2e-96, (74.3% identity in 358 aa overlap); Q9X8Y0|SCH35.26 PUTATIVE HEAVY METAL RESISTANCE MEMBRANE PROTEIN from Streptomyces coelicolor (369 aa), FASTA scores: opt: 1729, E(): 2.2e-96, (73.8% identity in 359 aa overlap); Q06598|ACR3_YEAST|ACR3|YPR201W|P9677.2 ARSENICAL-RESISTANCE PROTEIN from Saccharomyces cerevisiae (Baker's yeast) (404 aa), FASTA scores: opt: 591, E(): 4e-28, (36.6% identity in 380 aa overlap); etc. BELONGS TO THE ACR3 FAMILY. Protein product from Mb2676 detected using SWATH mass spectrometry. Mb2676 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1T7" /db_xref="InterPro:IPR002657" /db_xref="InterPro:IPR004706" /db_xref="InterPro:IPR023485" /db_xref="InterPro:IPR036196" /db_xref="InterPro:IPR038770" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1T7" /protein_id="SIU01294.1" /translation="MTETVTRTAAPAVVGKLSTLDRFLPVWIGSAMAAGLLLGRWIPG LHTALEGVQLDGISLPIALGLLIMMYPVLAKVRYDRLDTVTGDRKLLLSSLLLNWVLG PALMFALAWLLLADLPEYRTGLIIVGLARCIAMVIIWNDLACGDREAAAVLVALNSIF QVAMFAALGWFYLSVLPGWLGLEQTTIATSPWQIAKSVLIFLGIPLLAGYLSRRIGEK TKGRNWYESRFLPKVGPWALYGLLFTIVILFALQGDQITGRPLDVARIALPLLAYFAI MWVGGYLLGAALRLGYRRTTTLAFTAASNNFELAIAVAIATYGATSGQALAGVVGPLI EVPVLVGLVYVSLALRNRLAGPNATHDADKPSVLFVCVHNAGRSQMAAGLLTHLAGDR IEVRSAGTEPAGQVNPTAVAAMAEMGIDITANAPTLLTGGQVQSSDVVITMGCGDACP YFPGASYRNWKLPDPAGQPLDVVRMIRDDIADRVQALIAELLATAKTR" mobile_element 2939425..2940353 /mobile_element_type="insertion sequence:IS1081" /locus_tag="BQ2027_IS1081'-4" /note="IS1081'-4, len: 929 nt. Truncated form of IS1081,len: 1450 nt. Equivalent to IS1081', len: 909 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 909 nt overlap). Truncated at 3' end." CDS complement(2940406..2940723) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2677C" /product="HYPOTHETICAL PROTEIN" /note="Mb2677c, -, len: 105 aa. Equivalent to Rv2644c, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 105 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/Swiss-Prot:P65042" /protein_id="SIU01295.1" /translation="MSPRRTSGGVVPVDRYRIDEGLIVVLVFAGRDERRRTVCFADKF GCVHIGNPDLYRPQTSLPQPLPISSHAISGSRFVETTNRADQQEPIGPNRAELFDQAL HAG" tRNA complement(2941370..2941441) /locus_tag="BQ2027_VALT" /product="tRNA-Val" /note="valT, len: 72 nt. Equivalent to valT, len: 72 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 72 nt overlap). tRNA-Val, anticodon cac." tRNA 2941626..2941698 /locus_tag="BQ2027_GLYT" /product="tRNA-Gly" /note="glyT, len: 73 nt. Equivalent to glyT, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Gly, anticodon gcc." tRNA 2941725..2941795 /locus_tag="BQ2027_CYSU" /product="tRNA-Cys" /note="cysU, len: 71 nt. Equivalent to cysU, len: 71 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 nt overlap). tRNA-Cys, anticodon gca." tRNA 2941815..2941886 /locus_tag="BQ2027_VALU" /product="tRNA-Val" /note="valU, len: 72 nt. Equivalent to valU, len: 72 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 72 nt overlap). tRNA-Val, anticodon gac." CDS complement(2941860..2942081) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2678C" /product="HYPOTHETICAL PROTEIN" /note="Mb2678c, -, len: 73 aa. Equivalent to Rv2660c, len: 75 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 aa overlap). (questionable orf). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large 10982 bp deletion leads to deletion of genes between Mb2677c and Mb2678c compared to Mycobacterium tuberculosis strain H37Rv." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2M8" /protein_id="SIU01296.1" /translation="MIAGVDQALAATGQASQRAAGASGGVTVGVGVGTEQRNLSVVAP SQFTFSSRSPDFVDETAGQSWCAILGLNQ" CDS complement(2942078..2942467) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2679C" /product="HYPOTHETICAL PROTEIN" /note="Mb2679c, -, len: 129 aa. Equivalent to Rv2661c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Hypothetical unknown protein. Mb2679c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y216" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01297.1" /translation="MRARSDAGGQSVKSRTSNRSRSSRRSRVRSSISALVDNPQARPR ELPVLCGWPVVRVEPVCEFVPEPVCGQAEVLGEPAAAHRVTSARRSPSTTVCSRSQKA SAVVISSVSSVARVRRASVSSVDATTA" CDS 2942373..2942645 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2680" /product="HYPOTHETICAL PROTEIN" /note="Mb2680, -, len: 90 aa. Equivalent to Rv2662, len: 90 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). Hypothetical unknown protein. Mb2680 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y279" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01298.1" /translation="MDDLTRLRRELLDRFDVRDFTDWPPASLRALIATYDPWIDMTAS PPQPVSPGGPRLRLVRLTTNPSARAAPIGNGGDSSVCAGEKQCRPP" CDS 2942744..2942977 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2681" /product="HYPOTHETICAL PROTEIN" /note="Mb2681, -, len: 77 aa. Equivalent to Rv2663, len: 77 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 77 aa overlap). Hypothetical unknown protein. Mb2681 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1T9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01299.1" /translation="MEVRASARKHGINDDAMLHAYRNALRYVELEYHGEVQLLVIGPD QTGRLLELVIPADEPPRIIHANVLRPKFYDYLR" CDS 2942988..2943242 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2682" /product="HYPOTHETICAL PROTEIN" /note="Mb2682, -, len: 84 aa. Equivalent to Rv2664, len: 84 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 84 aa overlap). Hypothetical protein. Some weak similarity to nearby P71964|Rv2667|clpX'|MT2741|MTCY441.36 POSSIBLE ATP-DEPENDENT PROTEASE ATP-BINDING SUBUNIT from Mycobacterium tuberculosis (252 aa), FASTA scores: opt: 134, E(): 0.027, (31.15% identity in 77 aa overlap). Protein product from Mb2682 detected using SWATH mass spectrometry. Mb2682 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1U5" /protein_id="SIU01300.1" /translation="MKHKTDIDEWLDTIEPNPADAHDASHLRRIIAAKEAVQTAESEL RAAVNAARAAGDTWAAIGVALGITRQAAFQRFGPHSTASP" CDS complement(2943239..2943412) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2683C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2683c, -, len: 57 aa. Equivalent to Rv2664cA, len: 57 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). Conserved hypothetical protein; N-terminus similar to N-terminus of Rv1046c from Mycobacterium tuberculosis (239 aa). Mb2683c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1U6" /protein_id="SIU01301.1" /translation="MSLCVGVWGFVRVWGCGGQQTQNPCLANTTPRVSARRCDSASAL SGRRQLHRGGAPV" CDS 2943590..2943871 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2684" /product="HYPOTHETICAL ARGININE RICH PROTEIN" /note="Mb2684, -, len: 93 aa. Equivalent to Rv2665, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Hypothetical arg-rich protein, showing some similarity to N-terminus of P71640|Rv2811|MTCY16B7.32c HYPOTHETICAL 21.1 KDA PROTEIN from Mycobacterium tuberculosis (202 aa), FASTA scores: opt: 157, E(): 0.0011, (37.5% identity in 72 aa overlap); and also to part of O35132|CP2B_RAT|CYP27B1|CYP27B 25-HYDROXYVITAMIN D-1 ALPHA HYDROXYLASE, MITOCHONDRIAL PRECURSOR from Rattus norvegicus (Rat) (501 aa), FASTA scores: opt: 106, E(): 5.4, (34.5% identity in 87 aa overlap). Mb2684 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1U4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01302.1" /translation="MIVVRTAEAAEQALTEGQLVCPRRGCGDTLRRWRYGRRRHVRSL GSQVIDVRPQRVRCRRCESTHVLLPAALQPRLGRGGGGQLRPGVWCTGR" gene 2943837..2944765 /locus_tag="BQ2027_IS1081'-4" repeat_region 2943910..2943924 /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRL,TCGCGTGATCCTTCG, flanking IS element IS1081." CDS 2943962..2944765 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2685" /product="probable transposase for insertion sequence element is1081 (fragment)" /note="Mb2685, -, len: 267 aa. Equivalent to Rv2666, len: 267 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 267 aa overlap). Transposase (fragment), identical in region of overlap to P35882|TRA1_MYCBO|TRA1_MYCTU TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1081 from Mycobacterium tuberculosis or bovis (415 aa). Last 4 codons not part of gene. Contains PS01007 Transposases, Mutator family, signature." /db_xref="GOA:A0A1R3Y1U2" /db_xref="InterPro:IPR001207" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1U2" /protein_id="SIU01303.1" /translation="MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANHGRHNA" CDS 2944787..2945545 /codon_start=1 /transl_table=11 /gene="clpX'" /locus_tag="BQ2027_MB2686" /product="possible atp-dependent protease atp-binding subunit clpc2" /note="Mb2686, clpX', len: 252 aa. Equivalent to Rv2667, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 252 aa overlap). Possible clpX', ATP-dependent protease atp-binding subunit (EC 3.4.-.-), highly similar to Q9X8L2|SCE9.40 HYPOTHETICAL 27.3 KDA PROTEIN from Streptomyces coelicolor (258 aa), FASTA scores: opt: 877, E(): 2.2e-46, (57.25% identity in 255 aa overlap). The second half of the protein is highly similar to N-terminal of several CLP-FAMILY proteins e.g. P24428|CLPC_MYCLE|ML0235 PROBABLE ATP-DEPENDENT CLP PROTEASE ATP-BINDING SUBUNIT from Mycobacterium leprae (848 aa), FASTA scores: opt: 307, E(): 3.2e-11, (38.6% identity in 158 aa overlap); O06286|CLPC_MYCTU|Rv3596c|MT3703|MTCY07H7B.26 PROBABLE ATP-DEPENDENT CLP PROTEASE ATP-BINDING SUBUNIT from Mycobacterium tuberculosis (848 aa), FASTA scores: opt: 307, E(): 3.2e-11, (38.6% identity in 158 aa overlap); Q9S6T8|SCE94.24c PUTATIVE CLP-FAMILY ATP-BINDING PROTEASE from Streptomyces coelicolor (841 aa), FASTA scores: opt: 303, E(): 5.6e-11, (38.8% identity in 152 aa overlap); etc. Some weak similarity to nearby P71961|MTCY441.33|Rv2664 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (83 aa). Protein product from Mb2686 detected using SWATH mass spectrometry." /db_xref="GOA:P0A525" /db_xref="InterPro:IPR004176" /db_xref="InterPro:IPR036628" /db_xref="UniProtKB/Swiss-Prot:P0A525" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01304.1" /translation="MPEPTPTAYPVRLDELINAIKRVHSDVLDQLSDAVLAAEHLGEI ADHLIGHFVDQARRSGASWSDIGKSMGVTKQAAQKRFVPRAEATTLDSNQGFRRFTPR ARNAVVAAQNAAHGAASSEITPDHLLLGVLTDPAALATALLQQQEIDIATLRTAVTLP PAVTEPPQPIPFSGPARKVLELTFREALRLGHNYIGTEHLLLALLELEDGDGPLHRSG VDKSRAEADLITTLASLTGANAAGATDAGATDAG" CDS 2945624..2946145 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2687" /product="POSSIBLE EXPORTED ALANINE AND VALINE RICH PROTEIN" /note="Mb2687, -, len: 173 aa. Equivalent to Rv2668, len: 173 aa, from Mycobacterium tuberculosis strain H37Rv, (98.3% identity in 173 aa overlap). Hypothetical ala-, val-rich protein, possibly exported. Equivalent to AAK47057 from Mycobacterium tuberculosis strain CDC1551 (208 aa) but N-terminal part shorter 35 aa and with few differences. Has potential signal peptide sequence. Protein product from Mb2687 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2687 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3V8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01305.1" /translation="MRRWLIVLATLLVAAAGVAAANDVPRAWAGDAPIGHIGDTLRVD TGTYVADVTVSSVVPVDPPPGFAYTRSGVPVKSFPDSSVTRADVTVRAVRVPNSFILA TNFSFTGVTPFADAYKPRPCDASDWLDAALGNAPQGSIVRGGVYWDAYRDPVSVVVLL DKKTGQHLAQWNL" CDS 2946174..2946644 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2688" /product="gcn5-related n-acetyltransferase" /note="Mb2688, -, len: 156 aa. Equivalent to Rv2669, len: 156 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 156 aa overlap). Conserved hypothetical protein, showing some similarity to various proteins e.g. Q9A6M0|CC2073 ACETYLTRANSFERASE (GNAT FAMILY) from Caulobacter crescentus (178 aa), FASTA scores: opt: 242, E(): 1.2e-09, (30.9% identity in 165 aa overlap); Q99RQ8|SA2159 hypothetical protein similar to transcription repressor of sporulation, septation and degradation paiA from Staphylococcus aureus subsp. aureus N315 (171 aa), FASTA scores: opt: 214, E(): 9.8e-08, (27.5% identity in 160 aa overlap); BAB58531|SAV2369 HYPOTHETICAL 20.1 KDA PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (171 aa), FASTA scores: opt: 214, E(): 9.8e-08, (27.5% identity in 160 aa overlap); P21340|PAIA_BACSU|O32112 PROTEASE SYNTHASE AND SPORULATION from Bacillus subtilis (171 aa), FASTA scores: opt: 209, E(): 2.1e-07, (22.85% identity in 162 aa overlap); etc. Mb2688 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63426" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/Swiss-Prot:P63426" /protein_id="SIU01306.1" /translation="MTDADELAAVAARTFPLACPPAVAPEHIASFVDANLSSARFAEY LTDPRRAILTARHDGRIVGYAMLIRGDDRDVELSKLYLLPGYHGTGAAAALMHKVLAT AADWGALRVWLGVNQKNQRAQRFYAKTGFKINGTRTFRLGAHHENDYVMVRELV" CDS complement(2946622..2947731) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2689C" /product="AFG1 family ATPase" /note="Mb2689c, -, len: 369 aa. Equivalent to Rv2670c, len: 369 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 369 aa overlap). Conserved hypothetical protein, equivalent, but longer 164 aa, to O05683|MLC1351.22c HYPOTHETICAL 17.3 KDA PROTEIN from Mycobacterium leprae (160 aa), FASTA scores: opt: 847, E(): 1.2e-45, (82.4% identity in 159 aa overlap). And highly similar to Q9X824|SC9B1.04c PUTATIVE ATP/GTP-BINDING INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (350 aa), FASTA scores: opt: 1169, E(): 2e-65, (56.85% identity in 343 aa overlap); and Q9RWB0|DR0759 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (351 aa), FASTA scores: opt: 859, E(): 4e-46, (45.9% identity in 331 aa overlap). Also some similarity with other proteins e.g. P46442|YHCM_ECOLI|AAG58360|BAB37528 HYPOTHETICAL PROTEIN from Escherichia coli strains K12 and O157:H7 (375 aa), FASTA scores: opt: 237, E(): 2.1e-07, (28.0% identity in 325 aa overlap); Q9JRK2|NMA1520|NMB1306 PUTATIVE NUCLEOTIDE-BINDING PROTEIN from Neisseria meningitidis (serogroup A and B) (383 aa), FASTA scores: opt: 221, E(): 2.1e-06, (27.8% identity in 356 aa overlap); Q9HVX7|PA4438 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (364 aa), FASTA scores: opt: 211, E(): 8.5e-06, (28.9% identity in 353 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb2689c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y224" /db_xref="InterPro:IPR004435" /db_xref="InterPro:IPR005654" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y224" /protein_id="SIU01307.1" /translation="MTLIAARRYSATMHGSASEACGSVDHLVDRHPTVSPVRLIAQLR PPPTFAEVSFATYRPDPVEPTQAAAVVACQDFCRQAVERRAGRKKWFGKRDVLPGVGL YLDGGFGVGKTHLLASAYYQLPGTGPDAPTCPKAFATFGELTQLAGVFGFADCIDLLA NYTALCIDEFELDDPGNTTLISRLLSALVERGVSVAATSNTLPEQLGEGRFAAQDFLR EINTLASIFTTVRIEGPDYRHRDLPPAPAPLSDEEVAARAARVEGATLDDFDALCAHL ATMHPSRYLTLIEGVTAVFLTGVHGIDDQNVALRLVALVDRLYDAGIPVVASGAKLDT IFSEEMLAGGYRKKYLRATSRLLALTAGVIQAREP" CDS 2947730..2948506 /codon_start=1 /transl_table=11 /gene="ribD" /locus_tag="BQ2027_MB2690" /product="POSSIBLE BIFUNCTIONAL ENZYME RIBOFLAVIN BIOSYNTHESIS PROTEIN RIBD: DIAMINOHYDROXYPHOSPHORIBOSYLAMINOPYRIMIDINE DEAMINASE (RIBOFLAVIN-SPECIFIC DEAMINASE) + 5-AMINO-6-(5-PHOSPHORIBOSYLAMINO)URACIL REDUCTASE (HTP REDUCTASE)" /note="Mb2690, ribD, len: 258 aa. Equivalent to Rv2671, len: 258 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 258 aa overlap). Possible ribD (alternate gene name: ribG), bifunctional riboflavin biosynthesis protein incuding diaminohydroxyphosphoribosylaminopyrimidine deaminase and 5-amino-6-(5-phosphoribosylamino) uracil reductase (EC 3.5.4.26 and 1.1.1.193), highly similar to O05684|MLC1351.23|ML1340 POSSIBLE REDUCTASE from Mycobacterium leprae (268 aa), FASTA scores: opt: 1211, E(): 3e-68, (72.9% identity in 251 aa overlap). Also weakly similar to others e.g. Q9HWX2|RIBD|PA4056 RIBOFLAVIN-SPECIFIC DEAMINASE/REDUCTASE from Pseudomonas aeruginosa (373 aa), FASTA scores: opt: 211, E(): 6.3e-06, (30.1% identity in 216 aa overlap); Q9HQA1|RIBG|VNG1256G RIBOFLAVIN-SPECIFIC DEAMINASE from Halobacterium sp. strain NRC-1 (220 aa), FASTA scores: opt: 202, E(): 1.5e-05, (27.0% identity in 174 aa overlap); O28272|RIB7_ARCFU|AF2007 PUTATIVE 5-AMINO-6-(5-PHOSPHORIBOSYLAMINO)URACIL REDUCTASE (HTP REDUCTASE) (EC 1.1.1.193) from Archaeoglobus fulgidus (219 aa), FASTA scores: opt: 209, E(): 5.4e-06, (24.15% identity in 211 aa overlap); P25539|RIBD_ECOLI|RIBG|B0414 from Escherichia coli strain K12 (367 aa), FASTA scores: opt: 185, E(): 0.00026, (26.7% identity in 221 aa overlap); etc. But also similar to several hydrolases e.g. Q9X825|SC9B1.05 PUTATIVE HYDROLASE from Streptomyces coelicolor (265 aa), FASTA scores: opt: 536, E(): 2.9e-26, (44.25% identity in 235 aa overlap); Q9RKM1|SCD17.10 PUTATIVE BIFUNCTIONAL ENZYME DEAMINASE/REDUCTASE from Streptomyces coelicolor (376 aa), FASTA scores: opt: 228, E(): 5.6e-07, (33.5% identity in 188 aa overlap); etc. Equivalent to AAK47060 from Mycobacterium tuberculosis strain CDC1551 (239 aa) but longer 19 aa. SUPPOSED BELONG TO THE CYTIDINE AND DEOXYCYTIDYLATE DEAMINASES FAMILY IN THE N-TERMINAL SECTION; and TO THE HTP REDUCTASE FAMILY IN THE C-TERMINAL SECTION. Protein product from Mb2690 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2690 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y294" /db_xref="InterPro:IPR002734" /db_xref="InterPro:IPR024072" /db_xref="UniProtKB/TrEMBL:A0A1R3Y294" /protein_id="SIU01308.1" /translation="MPDSGQLGAADTPLRLLSSVHYLTDGELPQLYDYPDDGTWLRAN FISSLDGGATVDGTSGAMAGPGDRFVFNLLRELADVIVVGVGTVRIEGYSGVRMGVVQ RQHRQARGQSEVPQLAIVTRSGRLDRDMAVFTRTEMAPLVLTTTAVADDTRQRLAGLA EVIACSGDDPGTVDEAVLVSQLAARGLRRILTEGGPTLLGTFVERDVLDELCLTIAPY VVGGLARRIVTGPGQVLTRMRCAHVLTDDSGYLYTRYVKT" CDS 2948573..2950159 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2691" /product="POSSIBLE SECRETED PROTEASE" /note="Mb2691, -, len: 528 aa. Equivalent to Rv2672, len: 528 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 528 aa overlap). Possible secreted protease (EC 3.4.-.-), equivalent to O05685|MLC1351.24|ML1339 PUTATIVE SECRETED PROTEASE from Mycobacterium leprae (525 aa), FASTA scores: opt: 2722, E(): 9.4e-140, (74.45% identity in 528 aa overlap). Also similar to several exported proteinases from Streptomyces and Mycobacteria e.g. Q54399|SLPE PROTEINASE from Streptomyces lividans (513 aa), FASTA scores: opt: 429, E(): 6.8e-16, (26.2% identity in 538 aa overlap); Q9FCK9|2SC3B6.03c PEPTIDASE from Streptomyces coelicolor (513 aa), FASTA scores: opt: 421, E(): 1.8e-15, (26.45% identity in 541 aa overlap); Q10508|YM23_MYCTU from Mycobacterium tuberculosis (520 aa), FASTA scores: opt: 349, E(): 1.4e-11, (26.6% identity in 523 aa overlap); etc. Equivalent to AAK47061 from Mycobacterium tuberculosis strain CDC1551 (518 aa) but longer 10 aa. Protein product from Mb2691 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2691 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1V3" /db_xref="InterPro:IPR013595" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1V3" /protein_id="SIU01309.1" /translation="MATVVGMSRPMTSTAMLVALTCSATVLAACVPAFGADPRFATYS GAGPQGAATTTPPPAGPPPLAAPKNDLSWHDCTSRVYSNAGIPAAPGVKLECASYDTD LDPLVGGSTAVSIGVVRARSNQTPSDAGPLVFTTGSDLPSSTQLPVWLAHAGIDVLRS HPIVAVDRRGMGMSSPIDCRDHFDRDEMRDQAQFQAGDDPVANLSDISNTATTDCTDA IAPGESAYDNTHAASDIERLRKLWDVPALAFVGIGNGTQVALAYAASRPDNVARLILD SPIALGVSAEAAAEQQVQGQQAALDAFAAQCVAVNCALGSDPKGAVSALLSAARSGDG PGGASVAAVANAVATALGFPDSGRVDSTTKLADALAAARSGDMNLLSALINRADTTRD TDGQFISSCSDAVNRPTPDRVRELVVAWGKLYPQFGAVAALNLVKCVHWPSSSPPQPP KDLKVDVLLLGVQNDPIVGNEGVAATAATAINANAASKRVMWQGIGHGASIYSSCAVP PLVAYLDTGKLPDTDTYCPA" CDS 2950182..2951483 /codon_start=1 /transl_table=11 /gene="aftc" /locus_tag="BQ2027_MB2692" /product="possible arabinofuranosyltransferase aftc" /note="Mb2692, -, len: 433 aa. Equivalent to Rv2673, len: 433 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 433 aa overlap). Possible conserved integral membrane protein, equivalent to MLC1351.25|ML1338 POSSIBLE CONSERVED INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (440 aa), FASTA scores: opt: 2410, E(): 5.3e-143, (82.05% identity in 434 aa overlap); and showing some similarity with Q9CBX0|ML1504 PROBABLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (430 aa), FASTA scores: opt: 159, E(): 0.014, (24.4% identity in 340 aa overlap). Also similar to Q53873|SC6G4.11 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (411 aa), FASTA scores: opt: 383, E(): 1.4e-16, (29.6% identity in 422 aa overlap); and with weak similarity with P71061|YVFB HYPOTHETICAL PROTEIN from Bacillus subtilis (396 aa), FASTA scores: opt: 136, E(): 0.36, (24.35% identity in 279 aa overlap); and BAB60134|TVG1014811 HYPOTHETICAL PROTEIN from Thermoplasma volcanium (695 aa), FASTA scores: opt: 133, E(): 0.85, (26.45% identity in 280 aa overlap). Shows also some similarity with O06557|Rv1159|MTCI65.26 HYPOTHETICAL 47.1 KDA PROTEIN from Mycobacterium tuberculosis (431 aa), FASTA scores: opt: 149, E(): 0.059, (22.45% identity in 410 aa overlap); and O53515|Rv2181|MTV021.14 PUTATIVE MEMBRANE PROTEIN from Mycobacterium tuberculosis (427 aa), FASTA scores: opt: 129, E(): 1, (24.8% identity in 367 aa overlap). Protein product from Mb2692 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2692 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1U8" /db_xref="InterPro:IPR018584" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1U8" /protein_id="SIU01310.1" /translation="MYGALVTAADSIRTGLGASLLAGFRPRTGAPSTATILRSALWPA AVLSVLHRSIVLTTNGNITDDFKPVYRAVLNFRRGWDIYNEHFDYVDPHYLYPPGGTL LMAPFGYLPFAPSRYLFISINTAAILVAAYLLLRMFNFTLTSVAAPALILAMFATETV TNTLVFTNINGCILLLEVLFLRWLLDGRASRQWCGGLAIGLTLVLKPLLGPLLLLPLL NRQWRALVAAVVVPVVVNVAALPLVSDPMSFFTRTLPYILGTRDYFNSSILGNGVYFG LPTWLILFLRILFTAITFGALWLLYRYYRTGDPLFWFTTSSGVLLLWSWLVMSLAQGY YSMMLFPFLMTVVLPNSVIRNWPAWLGVYGFMTLDRWLLFNWMRWGRALEYLKITYGW SLLLIVTFTVLYFRYLDAKADNRLDGGIDPAWLTPEREGQR" CDS 2951652..2952062 /codon_start=1 /transl_table=11 /gene="msrb" /locus_tag="BQ2027_MB2693" /product="probable peptide methionine sulfoxide reductase msrb (protein-methionine-r-oxide reductase) (peptide met(o) reductase)" /note="Mb2693, -, len: 136 aa. Equivalent to Rv2674, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Conserved hypothetical protein, highly similar to various proteins e.g. Q9X828|SC9B1.08 PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (135 aa), FASTA scores: opt: 653, E(): 1.8e-37, (71.1% identity in 128 aa overlap); O26807|MTH711 TRANSCRIPTIONAL REGULATOR from Methanothermobacter thermautotrophicus (151 aa), FASTA scores: opt: 533, E(): 2.7e-29, (58.15% identity in 129 aa overlap); Q9C5C8|AT4G21860 HYPOTHETICAL 22.0 KDA PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (202 aa), FASTA scores: opt: 490, E(): 2.8e-26, (54.05% identity in 124 aa overlap); P39903|YEAA_ECOLI|B1778|Z2817|ECS2487 HYPOTHETICAL PROTEIN from Escherichia coli strains K12 and O157:H7 (137 aa), FASTA scores: opt: 426, E(): 4.4e-22, (46.8% identity in 126 aa overlap). Protein product from Mb2693 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2693 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1V0" /db_xref="InterPro:IPR002579" /db_xref="InterPro:IPR011057" /db_xref="InterPro:IPR028427" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1V0" /protein_id="SIU01311.1" /translation="MTRPKLELSDDEWRQKLTPQEFHVLRRAGTERPFTGEYTDTTTA GIYQCRACGAELFRSTEKFESHCGWPSFFDPKSSDAVTLRPDHSLGMTRTEVLCANCD SHLGHVFAGEGYPTPTDKRYCINSISLRLVPGSV" CDS complement(2952130..2952882) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2694C" /product="Methyltransferase (EC colocalized with Q" /EC_number="2.1.1.-" /note="Mb2694c, -, len: 250 aa. Equivalent to Rv2675c, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 250 aa overlap). Conserved hypothetical protein. C-terminus highly similar to Q50010|U1764Z from Mycobacterium leprae (69 aa), FASTA scores: opt: 284, E(): 4.6e-11, (68.25% identity in 63 aa overlap). Shows some similarity with Q9P3V6|SPAC1348.04 (alias Q9P3E7|Q9P7U5) HYPOTHETICAL 16.6 KDA PROTEIN from Schizosaccharomyces pombe (Fission yeast) (145 aa), FASTA scores: opt: 203, E(): 9.5e-06, (33.05% identity in 118 aa overlap); Q9ZSZ7|BMCT METHYL CHLORIDE TRANSFERASE from Batis maritima (230 aa), FASTA scores: opt: 197, E(): 3.3e-05, (28.85% identity in 156 aa overlap); P72459|STSG METHYLTRANSFERASE from Streptomyces griseus (253 aa), FASTA scores: opt: 194, E(): 5.5e-05, (24.45% identity in 229 aa overlap); etc. Also similar to various proteins from Mycobacterium tuberculosis e.g. P71805|Rv1377c|MTCY02B12.11c HYPOTHETICAL 22.8 KDA PROTEIN (212 aa), FASTA scores: opt: 431, E(): 8.3e-20, (39.1% identity in 197 aa overlap); O06426|Rv0560c|MTCY25D10.39c HYPOTHETICAL 25.9 KDA PROTEIN (241 aa), FASTA scores: opt: 379, E(): 1.6e-16, (35.95% identity in 178 aa overlap); O69667|Rv3699|MTV025.047 PUTATIVE METHYLTRANSFERASE (233 aa), FASTA scores: opt: 297, E(): 2e-11, (30.55% identity in 193 aa overlap); etc. Protein product from Mb2694c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2694c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR041698" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1V7" /protein_id="SIU01312.1" /translation="MTAQFDPADPTRFEEMYRDDRVAHGLPAATPWDIGGPQPVVQQL VALGAIRGEVLDPGTGPGHHAIYYAAKGYAATGIDGSVAAIERARDNARKAGVSVNFQ VGDATTLDGLDGRFDTVVDCAFYHTFSTAPELQRCYVRALRRASKPGARLYMFEFGEH NVNGFSMPRSLSEDDFRQVLPVGGWEITYLGTTTYQVNLSVEALELMAARNPDMADQV RCVLERFRAIKPWLVGGRVHAPFWEVHATRVD" CDS complement(2952879..2953574) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2695C" /product="Coproheme decarboxylase HemQ (no EC)" /note="Mb2695c, -, len: 231 aa. Equivalent to Rv2676c, len: 231 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 231 aa overlap). Conserved hypothetical protein, equivalent to Q9CCB2|ML1045 (alias Q50009|U1764Y but longer 66 aa) HYPOTHETICAL PROTEIN from Mycobacterium leprae (231 aa), FASTA scores: opt: 1401, E(): 8.7e-88, (87.45% identity in 231 aa overlap). Also highly similar to O69830|SC1B5.02 HYPOTHETICAL 28.1 KDA PROTEIN from Streptomyces coelicolor (243 aa), FASTA scores: opt: 915, E(): 7.7e-55, (61.25% identity in 222 aa overlap); and similar to others e.g. Q9RUB0|DR1481 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (289 aa), FASTA scores: opt: 327, E(): 6.1e-15, (31.8% identity in 176 aa overlap); Q97WP2|SSO2169 HYPOTHETICAL PROTEIN from Sulfolobus solfataricus (223 aa), FASTA scores: opt: 285, E(): 3.4e-12, (31.3% identity in 163 aa overlap); BAB59947|TVG0805714 HYPOTHETICAL PROTEIN from Thermoplasma volcanium (223 aa), FASTA scores: opt: 206, E(): 7.7e-07, (25.0% identity in 176 aa overlap); etc. Protein product from Mb2695c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2695c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010644" /db_xref="InterPro:IPR011008" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1U9" /protein_id="SIU01313.1" /translation="MARLDYDALNATLRYLMFSVFSVSPGALGDQRDAIIDDASTFFK QQEERGVVVRGLYDVAGLRADADFMVWTHAERVEALQATYADFRRTTTLGRACTPVWS GVGLHRPAEFNKSHIPAFLAGEEPGAYICVYPFVRSYEWYLLPDEERRRMLAEHGMAA RGYKDVRANTVPAFALGDYEWILAFEAPELDRIVDLMRELRATDARRHTRAETPFFTG PRVPVEQLVHSLP" CDS complement(2953580..2954938) /codon_start=1 /transl_table=11 /gene="hemY" /locus_tag="BQ2027_MB2696C" /product="PROBABLE PROTOPORPHYRINOGEN OXIDASE HEMY (PROTOPORPHYRINOGEN-IX OXIDASE) (PROTOPORPHYRINOGENASE) (PPO)" /note="Mb2696c, hemY, len: 452 aa. Equivalent to Rv2677c, len: 452 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 452 aa overlap). Probable hemY, protoporphyrinogen oxidase (EC 1.3.3.4), equivalent to Q50008|PPOX_MYCLE|HEMY|ML1044 PROTOPORPHYRINOGEN OXIDASE from Mycobacterium leprae (451 aa), FASTA scores: opt: 2211, E(): 8.8e-118, (75.4% identity in 455 aa overlap). Also similar to others e.g. Q9RV99|DR1130 from Deinococcus radiodurans (462 aa), FASTA scores: opt: 523, E(): 2.7e-22, (29.8% identity in 453 aa overlap); O32434|PPOX_PROFR|HEMY from Propionibacterium freudenreichii shermanii (527 aa), FASTA scores: opt: 344, E(): 4e-12, (32.1% identity in 495 aa overlap); P32397|PPOX_BACSU|HEMY|HEMG from Bacillus subtilis (470 aa), FASTA scores: opt: 305, E(): 5.9e-10, (26.8% identity in 463 aa overlap); etc. BELONGS TO THE PROTOPORPHYRINOGEN OXIDASE FAMILY. COFACTOR: CONTAINS ONE FAD PER HOMODIMER. Protein product from Mb2696c detected using SWATH mass spectrometry. Mb2696c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5A8" /db_xref="InterPro:IPR002937" /db_xref="InterPro:IPR004572" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P0A5A8" /protein_id="SIU01314.1" /translation="MTPRSYCVVGGGISGLTSAYRLRQAVGDDATITLFEPADRLGGV LRTEHIGGQPMDLGAEAFVLRRPEMPALLAELGLSDRQLASTGARPLIYSQQRLHPLP PQTVVGIPSSAGSMAGLVDDATLARIDAEAARPFTWQVGSDPAVADLVADRFGDQVVA RSVDPLLSGVYAGSAATIGLRAAAPSVAAALDRGATSVTDAVRQALPPGSGGPVFGAL DGGYQVLLDGLVRRSRVHWVRARVVQLERGWVLRDETGGRWQADAVILAVPAPRLARL VDGIAPRTHAAARQIVSASSAVVALAVPGGTAFPHCSGVLVAGDESPHAKAITLSSRK WGQRGDVALLRLSFGRFGDEPALTASDDQLLAWAADDLVTVFGVAVDPVDVRVRRWIE AMPQYGPGHADVVAELRAGLPPTLAVAGSYLDGIGVPACVGAAGRAVTSVIEALDAQV AR" CDS complement(2954935..2956008) /codon_start=1 /transl_table=11 /gene="hemE" /locus_tag="BQ2027_MB2697C" /product="probable uroporphyrinogen decarboxylase heme (uroporphyrinogen iii decarboxylase) (uro-d) (upd)" /note="Mb2697c, hemE, len: 357 aa. Equivalent to Rv2678c, len: 357 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 357 aa overlap). Probable hemE, uroporphyrinogen decarboxylase (EC 4.1.1.37), equivalent to P46809|DCUP_MYCLE|HEME|ML1043 UROPORPHYRINOGEN DECARBOXYLASE from Mycobacterium leprae (357 aa), FASTA scores: opt: 2017, E(): 8.2e-111, (83.75% identity in 357 aa overlap). Also highly similar to many e.g. O69861|DCUP_STRCO|HEME|SC1C3.19 from Streptomyces coelicolor (355 aa), FASTA scores: opt: 1165, E(): 5.6e-61, (58.15% identity in 349 aa overlap); P32395|DCUP_BACSU|HEME from Bacillus subtilis (353 aa), FASTA scores: opt: 859, E(): 4.5e-43, (44.1% identity in 356 aa overlap); Q9RV96|DCUP_DEIRA|HEME|DR1133 from Deinococcus radiodurans (344 aa), FASTA scores: opt: 850, E(): 1.5e-42, (43.0% identity in 349 aa overlap); etc. Equivalent to AAK47067 from Mycobacterium tuberculosis strain CDC1551 (372 aa) but shorter 15 aa. Contains PS00907 Uroporphyrinogen decarboxylase signature 2. BELONGS TO THE UROPORPHYRINOGEN DECARBOXYLASE FAMILY. Protein product from Mb2697c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2697c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TY47" /db_xref="InterPro:IPR000257" /db_xref="InterPro:IPR006361" /db_xref="InterPro:IPR038071" /db_xref="UniProtKB/Swiss-Prot:Q7TY47" /protein_id="SIU01315.1" /translation="MSTRRDLPQSPYLAAVTGRKPSRVPVWFMRQAGRSLPEYRALRE RYSMLAACFEPDVACEITLQPIRRYDVDAAILFSDIVVPLRAAGVDLDIVADVGPVIA DPVRTAADVAAMKPLDPQAIQPVLVAASLLVAELGDVPLIGFAGAPFTLASYLVEGGP SRHHAHVKAMMLAEPASWHALMAKLTDLTIAFLVGQIDAGVDAIQVFDSWAGALSPID YRQYVLPHSARVFAALGEHGVPMTHFGVGTAELLGAMSEAVTAGERPGRGAVVGVDWR TPLTDAAARVVPGTALQGNLDPAVVLAGWPAVERAARAVVDDGRRAVDAGAAGHIFNL GHGVLPESDPAVLADLVSLVHSL" CDS 2956061..2956891 /codon_start=1 /transl_table=11 /gene="echA15" /locus_tag="BQ2027_MB2698" /product="PROBABLE ENOYL-COA HYDRATASE ECHA15 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb2698, -, len: 276 aa. Equivalent to Rv2679, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 276 aa overlap). Probable echA15, enoyl-CoA hydratase (EC 4.2.1.17), similar to P53526|ECHC_MYCLE|ECHA12|ML1241|MLCB1610.01|B1170_C2_224 PROBABLE ENOYL-COA HYDRATASE from Mycobacterium leprae (294 aa), FASTA scores: opt: 368, E(): 2.5e-16, (32.15% identity in 277 aa overlap). Also highly similar to Q9RXX1|DR0184 from Deinococcus radiodurans (273 aa), FASTA scores: opt: 993, E(): 2.2e-56, (58.15% identity in 263 aa overlap); and similar to many e.g. Q9ETY7|PACA|PAAG from Azoarcus evansii (273 aa), FASTA scores: opt: 396, E(): 3.8e-18, (34.9% identity in 258 aa overlap); O29299|AF0963|FAD-3 from Archaeoglobus fulgidus (259 aa), FASTA scores: opt: 363, E(): 4.7e-16, (30.4% identity in 250 aa overlap); P77467|PAAG_ECOLI|B1394 from Escherichia coli strain W (262 aa), FASTA scores: opt: 357, E(): 1.1e-15, (31.75% identity in 252 aa overlap); etc. Also similar to O53163|ECHC_MYCTU|ECHA12|FADB2|Rv1472|MT1518|MT V007.19 ENOYL-COA HYDRATASE from Mycobacterium tuberculosis (285 aa), FASTA scores: opt: 355, E(): 1.6e-15, (31.3% identity in 265 aa overlap); and O06542|ECHA10|Rv1142c|MTCI65.09c|Z95584 ENOYL-COA HYDRATASE from Mycobacterium tuberculosis (268 aa). Contains PS00166 Enoyl-CoA hydratase/isomerase signature. BELONGS TO THE ENOYL-COA HYDRATASE/ISOMERASE FAMILY. Protein product from Mb2698 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2698 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2P7" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR014748" /db_xref="InterPro:IPR018376" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2P7" /protein_id="SIU01316.1" /translation="MPVTYDDFPSLRCEIHDQPGHEGVLELVLDSPGLNSVGPHMHRD LADIWPVIDRDPAVRVVLVRGEGKAFSSGGSFDLIAETIGDYQGRLRIMREARDLVLN LVNFDKPVVSAIRGPAVGAGLVVALLADISVAGRAAKIIDGHTKLGVAAGDHAAICWP LLVGMAKAKYYLLTCEPLSGEEAERIGLVSICVDDDDVLPTATRLAERLAAGAQNAIR WTKRSLNHWYRMFGPAFETSLGLEFIGFGGPDVREGLAAHREKRPARFGADPDPGAGS " CDS 2957153..2957785 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2699" /product="conserved protein" /note="Mb2699, -, len: 210 aa. Equivalent to Rv2680, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 210 aa overlap). Conserved hypothetical protein, equivalent to Q50005|ML1041|U1764V HYPOTHETICAL PROTEIN from Mycobacterium leprae (196 aa), FASTA scores: opt: 1136, E(): 9.7e-66, (83.95% identity in 193 aa overlap). Also similar to O69860|SC1C3.18c HYPOTHETICAL 24.7 KDA PROTEIN from Streptomyces coelicolor (238 aa), FASTA scores: opt: 516, E(): 5.7e-26, (45.5% identity in 189 aa overlap); and similar in part to Q9I6V4|PA0178 PROBABLE TWO-COMPONENT SENSOR from Pseudomonas aeruginosa (639 aa), FASTA scores: opt: 120, E(): 3.1, (33.05% identity in 115 aa overlap); and a few other proteins. Equivalent to AAK47069 from Mycobacterium tuberculosis strain CDC1551 (178 aa) but longer 32 aa; and N-terminus highly similar to N-terminus of AAK48352|MT3984 HYPOTHETICAL 4.2 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (38 aa), FASTA scores: opt: 102, E(): 3.6, (62.05% identity in 29 aa overlap). Protein product from Mb2699 detected using shotgun mass spectrometry. Mb2699 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021555" /db_xref="UniProtKB/TrEMBL:A0A1R3Y229" /protein_id="SIU01317.1" /translation="MTSAGDDAERSDEEERRLTSAEPALFREAVAAMNAVTVRPEIEL GPIRPPQRLAPYSYALGAEIKHPELDVIPERSEGDAFGRLIMLYDPDGSDAWDGTIRL VAYVQADLDSSEAVDPLLPEVAWSWLVDALTARTDQVRALGGTVTATTSVRYGDISGP PRAHQLELRASWTATTPDLGAHVQAFCDVLEHAAGLPPAGVTDLGSRSRA" CDS 2957787..2959103 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2700" /product="Ribonuclease D (EC" /EC_number="3.1.26.3" /note="Mb2700, -, len: 438 aa. Equivalent to Rv2681, len: 438 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 438 aa overlap). Conserved hypothetical ala-rich protein, equivalent to Q50004|ML1040|U1764U HYPOTHETICAL PROTEIN from Mycobacterium leprae (429 aa), FASTA scores: opt: 2146, E(): 1.1e-119, (77.4% identity in 416 aa overlap). Also highly similar to O69858|SC1C3.16c HYPOTHETICAL 42.5 KDA PROTEIN from Streptomyces coelicolor (394 aa), FASTA scores: opt: 1336, E(): 9e-72, (51.6% identity in 405 aa overlap); and with some similarity to RIBONUCLEASES D e.g. Q983F2|MLL8354 from Rhizobium loti (Mesorhizobium loti) (383 aa), FASTA scores: opt: 379, E(): 3.9e-15, (31.6% identity in 323 aa overlap); Q9A7L8|CC1704 from Caulobacter crescentus (389 aa), FASTA scores: opt: 370, E(): 1.3e-14, (31.45% identity in 318 aa overlap); CAC45770 from Rhizobium meliloti (Sinorhizobium meliloti) (383 aa), FASTA scores: opt: 331, E(): 2.7e-12, (27.75% identity in 357 aa overlap); etc. Protein product from Mb2700 detected using SWATH mass spectrometry. Mb2700 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2A8" /db_xref="InterPro:IPR002121" /db_xref="InterPro:IPR002562" /db_xref="InterPro:IPR010997" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR036397" /db_xref="InterPro:IPR041605" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A8" /protein_id="SIU01318.1" /translation="MCPEPSHAGAAESEGTESEPTPLLRPAGGIPDLCVTVGEIAAAA ELLDRGRGPFAVDAERASGFRYSGRAYLIQIRRAEAGTVLIDPVSHGGDPLTVLAPVA EVLSTNEWILHSADQDLPCLAEVGMRPPALYDTELAGRLAGFDRVNLAAMVERLLGLG LTKGHGAADWSKRPLPSAWLNYAALDVELLIELRAVISRVLAEQGKTDWAAQEFEHLR SFESRPPPAAARQDRWRRTSGIHKVHDRRGLAAVRELWTARDRIAQRRDIAPRRILPD SAIIDAAIADPKSVDDLVALPVFGGRNQRRSAAVWWAALAAARESPDPPEIAEPANGP PPPGRWVRRKPAAAARLDAARAALTEVSQRVRVPTENLVSPDLVRRLCWEWEDISQSS PDPIAAVEAYLRTGQARAWQLELVVPILTAALTGAPDAGAQGDDGS" CDS complement(2959100..2961016) /codon_start=1 /transl_table=11 /gene="dxs1" /locus_tag="BQ2027_MB2701C" /standard_name="dxs" /product="PROBABLE 1-DEOXY-D-XYLULOSE 5-PHOSPHATE SYNTHASE DXS1 (1-DEOXYXYLULOSE-5-PHOSPHATE SYNTHASE) (DXP SYNTHASE) (DXPS)" /note="Mb2701c, dxs1, len: 638 aa. Equivalent to Rv2682c, len: 638 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 638 aa overlap). Probable dxs1, 1-deoxy-D-xylulose 5-phosphate synthase (EC 2.2.-.-), equivalent to Q50000|DXS_MYCLE|TKTB|ML1038 1-DEOXY-D-XYLULOSE 5-PHOSPHATE SYNTHASE from Mycobacterium leprae (643 aa), FASTA scores: opt: 3635, E(): 5.6e-209, (86.4% identity in 632 aa overlap). Also highly similar to other Q9X7W3|DXS_STRCO|DXS|SC6A5.17 from Streptomyces coelicolor (656 aa), FASTA scores: opt: 2501, E(): 2e-141, (61.3% identity in 623 aa overlap); Q9K971|DXS_BACHD|DXS|BH2779 from Bacillus halodurans (629 aa), FASTA scores: opt: 1612, E(): 1.8e-88, (41.35% identity in 619 aa overlap); P77488|DXS_ECOLI|DXS|B0420 from Escherichia coli strain K12 (619 aa), FASTA scores: opt: 1511, E(): 1.8e-82, (39.5% identity in 625 aa overlap); etc. Also similar to O50408|Rv3379c|MTV004.37c from Mycobacterium tuberculosis (536 aa). BELONGS TO THE TRANSKETOLASE FAMILY. DXS SUBFAMILY. COFACTOR: THIAMINE PYROPHOSPHATE. Note that previously known as dxs. Protein product from Mb2701c detected using SWATH mass spectrometry. Mb2701c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A555" /db_xref="InterPro:IPR005474" /db_xref="InterPro:IPR005475" /db_xref="InterPro:IPR005477" /db_xref="InterPro:IPR009014" /db_xref="InterPro:IPR020826" /db_xref="InterPro:IPR029061" /db_xref="InterPro:IPR033248" /db_xref="UniProtKB/Swiss-Prot:P0A555" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01319.1" /translation="MLQQIRGPADLQHLSQAQLRELAAEIREFLIHKVAATGGHLGPN LGVVELTLALHRVFDSPHDPIIFDTGHQAYVHKMLTGRSQDFATLRKKGGLSGYPSRA ESEHDWVESSHASAALSYADGLAKAFELTGHRNRHVVAVVGDGALTGGMCWEALNNIA ASRRPVIIVVNDNGRSYAPTIGGVADHLATLRLQPAYEQALETGRDLVRAVPLVGGLW FRFLHSVKAGIKDSLSPQLLFTDLGLKYVGPVDGHDERAVEVALRSARRFGAPVIVHV VTRKGMGYPPAEADQAEQMHSTVPIDPATGQATKVAGPGWTATFSDALIGYAQKRRDI VAITAAMPGPTGLTAFGQRFPDRLFDVGIAEQHAMTSAAGLAMGGLHPVVAIYSTFLN RAFDQIMMDVALHKLPVTMVLDRAGITGSDGASHNGMWDLSMLGIVPGIRVAAPRDAT RLREELGEALDVDDGPTALRFPKGDVGEDISALERRGGVDVLAAPADGLNHDVLLVAI GAFAPMALAVAKRLHNQGIGVTVIDPRWVLPVSDGVRELAVQHKLLVTLEDNGVNGGA GSAVSAALRRAEIDVPCRDVGLPQEFYEHASRSEVLADLGLTDQDVARRITGWVAALG TGVCASDAIPEHLD" CDS 2961160..2961657 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2702" /product="FOG- CBS domain" /note="Mb2702, -, len: 165 aa. Equivalent to Rv2683, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 165 aa overlap). Conserved hypothetical protein, equivalent, but shorter 19 aa, to Q49999|ML1037|U1764Q HYPOTHETICAL PROTEIN from Mycobacterium leprae (184 aa), FASTA scores: opt: 750, E(): 1.2e-41, (73.8% identity in 164 aa overlap). Shows some similarity with other HYPOTHETICAL PROTEINS e.g. Q988S9|MLL6611 from Rhizobium loti (Mesorhizobium loti) (232 aa), FASTA scores: opt: 128, E(): 0.25, (25.5% identity in 149 aa overlap); Q9YFL5|APE0233 from Aeropyrum pernix (340 aa), FASTA scores: opt: 123, E(): 0.73, (29.1% identity in 141 aa overlap); BAB60477|TVG1377730 from Thermoplasma volcanium (174 aa), FASTA scores: opt: 118, E(): 0.86, (28.8% identity in 59 aa overlap); etc. Protein product from Mb2702 detected using shotgun mass spectrometry. Mb2702 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000644" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1V9" /protein_id="SIU01320.1" /translation="MKVNIDPTAPTFATYRRDMRAEQMAEDYPVVSIDSDALDAARML AEHRLPGLLVTAGAGKQYAVLPASQVVRFIVPRYVQDDPSLAGVLNESTADRCAERLS GKKVRDVLPDHLVEVPPANADDTIIEVAAVMARLRSPLLAVVKDGSLLGVVTASRLLA AALKT" CDS 2961662..2962951 /codon_start=1 /transl_table=11 /gene="arsA" /locus_tag="BQ2027_MB2703" /product="PROBABLE ARSENIC-TRANSPORT INTEGRAL MEMBRANE PROTEIN ARSA" /note="Mb2703, arsA, len: 429 aa. Equivalent to Rv2684, len: 429 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 429 aa overlap). Probable arsA, arsenic-transport integral membrane protein, equivalent to P46838|AG45_MYCLE|ML1036 46 KDA PROBABLE INTEGRAL MEMBRANE PROTEIN (antigen 45, a transmembrane protein related to arsenical pumps) from Mycobacterium leprae (429 aa), FASTA scores: opt: 2067, E(): 9.9e-118, (74.05% identity in 428 aa overlap); and upstream orf O07187|YQ85_MYCTU|ARSB|Rv2685|MT2759|MTCY05A6.06 PROBABLE INTEGRAL MEMBRANE 45.2 KDA PROTEIN ARSB from Mycobacterium tuberculosis (428 aa), FASTA scores: opt: 2148, E(): 1.3e-122, (76.58% identity in 427 aa overlap). Also highly similar to other proteins e.g. Q9UY19|PAB1107 TRANSPORT PROTEIN from Pyrococcus abyssi (425 aa), FASTA scores: opt: 1109, E(): 8.3e-60, (41.45% identity in 427 aa overlap); O59575|PH1912 HYPOTHETICAL 46.0 KDA PROTEIN from Pyrococcus horikoshii (424 aa), FASTA scores: opt: 1101, E(): 2.5e-59, (41.95% identity in 429 aa overlap); Q9KDI2|BH1231 HYPOTHETICAL 46.0 KDA PROTEIN from Bacillus halodurans (428 aa), FASTA scores: opt: 1018, E(): 2.7e-54, (38.9% identity in 427 aa overlap); etc. BELONGS TO THE NADC/P/PHO87 FAMILY OF TRANSPORTERS, P SUBFAMILY (ARS FAMILY). Mb2703 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A607" /db_xref="InterPro:IPR000802" /db_xref="InterPro:IPR004680" /db_xref="UniProtKB/Swiss-Prot:P0A607" /protein_id="SIU01321.1" /translation="MSVVAVTIFVAAYVLIASDRVNKTMVALTGAAAVVVLPVITSHD IFYSHDTGIDWDVIFLLVGMMIIVGVLRQTGVFEYTAIWAAKRARGSPLRIMILLVLV SALASALLDNVTTVLLIAPVTLLVCDRLNINTTSFLMAEVFASNIGGAATLVGDPPNI IVASRAGLTFNDFMLHLTPLVVIVLIALIAVLPRLFGSITVEADRIADVMALDEGEAI RDRGLLVKCGAVLVLVFAAFVAHPVLHIQPSLVALLGAGMLIVVSGLTRSEYLSSVEW DTLLFFAGLFIMVGALVKTGVVNDLARAATQLTGGNIVATAFLILGVSAPISGIIDNI PYVATMTPLVAELVAVMGGQPSTDTPWWALALGADFGGNLTAIGASANVVMLGIARRA GAPISFWEFTRKGAVVTAVSIALAAIYLWLRYFVLLH" CDS 2963031..2964317 /codon_start=1 /transl_table=11 /gene="arsB1" /locus_tag="BQ2027_MB2704" /product="PROBABLE ARSENIC-TRANSPORT INTEGRAL MEMBRANE PROTEIN ARSB1" /note="Mb2704, arsB1, len: 428 aa. Equivalent to Rv2685, len: 428 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 428 aa overlap). Probable arsB1, arsenic-transport integral membrane protein, equivalent to P46838|AG45_MYCLE|ML1036 46 KDA PROBABLE INTEGRAL MEMBRANE PROTEIN (antigen 45, a transmembrane protein related to arsenical pumps) from Mycobacterium leprae (429 aa), FASTA scores: opt: 2048, E(): 7.3e-120, (74.25% identity in 427 aa overlap); and downstream ORF O07186|YQ84_MYCTU|ARSA|Rv2684|MT2758|MTCY05A6.05 PROBABLE INTEGRAL MEMBRANE PROTEIN ARSA from Mycobacterium tuberculosis (429 aa), FASTA scores: opt: 2154, E(): 1.9e-126, (76.8% identity in 427 aa overlap). Also highly similar to other proteins e.g. O59575|PH1912 HYPOTHETICAL 46.0 KDA PROTEIN from Pyrococcus horikoshii (424 aa), FASTA scores: opt: 1075, E(): 1.9e-59, (43.55% identity in 427 aa overlap); Q9UY19|PAB1107 TRANSPORT PROTEIN from Pyrococcus abyssi (425 aa), FASTA scores: opt: 1062, E(): 1.3e-58, (41.8% identity in 428 aa overlap); Q9KDI2|BH1231 HYPOTHETICAL 46.0 KDA PROTEIN from Bacillus halodurans (428 aa), FASTA scores: opt: 993, E(): 2.4e-54, (39.55% identity in 430 aa overlap); etc. BELONGS TO THE NADC/P/PHO87 FAMILY OF TRANSPORTERS, P SUBFAMILY. Note that previously known as arsB. Protein product from Mb2704 detected using SWATH mass spectrometry. Mb2704 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1W7" /db_xref="InterPro:IPR000802" /db_xref="InterPro:IPR004680" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1W7" /protein_id="SIU01322.1" /translation="MSIIAITVFVAGYALIASDRVSKTRVALTCAAIMVGAGIVGSDD VFYSHEAGIDWDVIFLLLGMMIIVSVLRHTGVFEYVAIWAVKRANAAPLRIMILLVLV TALGSALLDNVTTVLLIAPVTLLVCDRLGVNSTPFLVAEVFASNVGGAATLVGDPPNI IIASRAGLTFNDFLIHMAPAVLVVMIALIGLLPWLLGSVTAEPDRVADVLSLNEREAI HDRGLLIKCGVVLVLVFAAFIAHPVLHIQPSLVALLGAGVLVRFSGLERSDYLSSVEW DTLLFFAGLFVMVGALVKTGVVEQLARAATELTGGNELLTVGLILGISAPVSGIIDNI PYVATMTPIVTELVAAMPGHVHPDTFWWALALSADFGGNLTAVGASANVVMLGIARRS CTPISFWKFTRKGAVVTAVSLVLSAVYLWLRYFVFG" CDS complement(2964328..2965086) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2705C" /product="antibiotic-transport integral membrane leucine and alanine and valine rich protein abc transporter" /note="Mb2705c, -, len: 252 aa. Equivalent to Rv2686c, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 252 aa overlap). Probable antibiotic-transport integral membrane leu-, ala-, val-rich protein ABC transporter (see citation below). The region from aa ~115 to 160 is highly similar to N-terminus of Q49998|U1764P HYPOTHETICAL PROTEIN from Mycobacterium leprae (53 aa), FASTA scores: opt: 151, E(): 0.011, (58.15% identity in 43 aa overlap). Shows some similarity with membrane proteins e.g. AAK75541|SP1447 MEMBRANE PROTEIN from Streptococcus pneumoniae (298 aa), FASTA scores: opt: 139, E(): 0.21, (29.65% identity in 135 aa overlap); Q9K4C9|2SC6G5.26c PUTATIVE ABC TRANSPORTER INTEGRAL MEMBRANE SUBUNIT from Streptomyces coelicolor (249 aa), FASTA scores: opt: 138, E(): 0.21, (26.9% identity in 253 aa overlap); Q53627|MTRB MEMBRANE PROTEIN INVOLVED IN MITHRAMYCIN RESISTANCE from Streptomyces argillaceus (233 aa), FASTA scores: opt: 136, E(): 0.27, (26.7% identity in 191 aa overlap); etc. Protein product from Mb2705c detected using SWATH mass spectrometry. Mb2705c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1W1" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1W1" /protein_id="SIU01323.1" /translation="MRAISSLAGPRALAAFGRNDIRGTYRDPLLVMLVIAPVIWTTGV ALLTPLFTEMLARRYGFDLVGYYPLILTAFLLLTSIIVAGALAAFLVLDDVDAGTMTA LRVTPVPLSVFFGYRAATVMVVTTIYVVATMSCSGILEPGLVSSLIPIGLVAGLSAVV TLLLILAVANNKIQGLAMVRALGMLIAGLPCLPWFISSNWNLAFGVLPPYWAAKAFWV ASDHGTWWPYLVGGAVYNLAIVWVLFRRFRAKHA" CDS complement(2965083..2965796) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2706C" /product="antibiotic-transport integral membrane leucine and valine rich protein abc transporter" /note="Mb2706c, -, len: 237 aa. Equivalent to Rv2687c, len: 237 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 237 aa overlap). Probable antibiotic-transport integral membrane leu-, val-rich protein ABC transporter (see citation below), showing some similarity with two other hypothetical proteins, BAB59668|TVG0517148 from Thermoplasma volcanium (241 aa), FASTA scores: opt: 136, E(): 0.32, (23.1% identity in 208 aa overlap); and Q97U55|SSO3168 from Sulfolobus solfataricus (249 aa), FASTA scores: opt: 136, E(): 0.33, (25.15% identity in 195 aa overlap). Has some hydrophobic stretches and contains bacterial regulatory proteins, araC family signature (PS00041). Mb2706c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1W6" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1W6" /protein_id="SIU01324.1" /translation="MTRLVPALRLELTLQVRQKFLHAAVFSGLIWLAVLLPMPVSLRP VAEPYVLVGDIAIIGFFFVGGTVFFEKQERTIGAIVSTPLRFWEYLAAKLTVLLAISL FVAVVVATIVHGLGYHLLPLVAGIVLGTLLMLLVGFSSSLPFASVTDWFLAAVIPLAI MLAPPVVHYSGLWPNPVLYLIPTQGPLLLLGAAFDQVSLAPWQVGYAVVYPIVCAAGL CRAAKALFGRYVVQRSGVL" CDS complement(2965793..2966698) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2707C" /product="antibiotic-transport atp-binding protein abc transporter" /note="Mb2707c, -, len: 301 aa. Equivalent to Rv2688c, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 301 aa overlap). Probable antibiotic-transport ATP-binding protein ABC transporter (see citation below), highly similar to AAK47077|MT2762 ABC TRANSPORTER ATP-BINDING PROTEIN from Mycobacterium tuberculosis strain CDC1551 (317 aa), FASTA scores: opt: 1714, E(): 5.1e-93, (95.6% identity in 274 aa overlap). Also highly similar to other ATP-BINDING PROTEINS ABC TRANSPORTER e.g. Q9K639|BH3893 from Bacillus halodurans (282 aa), FASTA scores: opt: 644, E(): 1.4e-30, (38.% identity in 285 aa overlap); O58550|PH0820 from Pyrococcus horikoshii (312 aa), FASTA scores: opt: 574, E(): 1.8e-26, (39.1% identity in 307 aa overlap); Q9WYM0|TM0389 from Thermotoga maritima (301 aa), FASTA scores: opt: 536, E(): 2.9e-24, (36.1% identity in 291 aa overlap); etc. Has ATP/GTP-binding site motif A (P-loop) at N-terminus (PS00017). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb2707c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2707c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3X6" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3X6" /protein_id="SIU01325.1" /translation="MTALNRAVASARVGTEVIRVRGLTFRYPKAAEPAVRGMEFTVGR GEIFGLLGPSGAGKSTTQKLLIGLLRDHGGQATVWDKEPAEWGPDYYERIGVSFELPN HYQKLTGYENLRFFASLYAGATADPMQLLAAVGLADDAHTLVGKYSKGMQMRLTFARS LINDPELLFLDEPTSGLDPVNARKIKDIIVDLKARGRTIFLTTHDMATADELCDRVAF VVDGRIVALDSPTELKIARSRRRVRVEYRGDGGGLETAEFGMDGLADDPAFHSVLRNH HVETIHSREASLDDVFVEVTGRQLT" CDS complement(2966893..2968110) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2708C" /product="conserved alanine and valine and glycine rich protein" /note="Mb2708c, -, len: 405 aa. Equivalent to Rv2689c, len: 405 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 405 aa overlap). Conserved hypothetical ala-, val-, gly-rich protein, similar to O54099|SC10A5.06 HYPOTHETICAL 49.5 KDA PROTEIN from Streptomyces coelicolor (458 aa), FASTA scores: opt: 455, E(): 2.7e-20, (38.35% identity in 417 aa overlap); and shows weak similarity in part with several methyltransferases (EC 2.1.1.-) e.g. Q9X0H9|TM1094 PUTATIVE RNA METHYLTRANSFERASE from Thermotoga maritima (439 aa), FASTA scores: opt: 306, E(): 3e-11, (25.9% identity in 436 aa overlap); AK79403|CAC1435 S-ADENOSYLMETHIONINE-DEPENDENT METHYLTRANSFERASES from Clostridium acetobutylicum (456 aa), FASTA scores: opt: 294, E(): 1.6e-10, (23.4% identity in 449 aa overlap); Q9A8M7|CC1326 RNA METHYLTRANSFERASE from Caulobacter crescentus (415 aa), FASTA scores: opt: 247, E(): 1.1e-07, (28.4% identity in 433 aa overlap); etc. Equivalent to AAK47078 from Mycobacterium tuberculosis strain CDC1551 (434 aa) but shorter 29 aa. Other less probable starts possible. Protein product from Mb2708c detected using SWATH mass spectrometry. Mb2708c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Q7" /db_xref="InterPro:IPR002792" /db_xref="InterPro:IPR010280" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Q7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01326.1" /translation="MTRAGDDAVNLTLVTGAPANGGSCVAHHEGRVVFVRYALPGERV RARVTAQRGSYWHAEAFEVIDPSPDRIGSLCSIAGADGAGCCDLAFAAPEAARTLKAQ VVANQLERLGRHSWQGEAQPLSDAGPTGWRIRVRLDVGADRRPGFHRYHSGELVTDLD CGQLPVGMLDGLVAADWPPEAQLYVALDDDGERHVVCSVRQGPRNRTRTVTNVVEGAY HAHQRVHRRSWRVPVTAFWQAHRDAAAVYSDLIADWAQPAPGMTAWDLYGGAGVFAAV LGEAVGESGRVLTVDTSRLASGAARAALVDLPQVEVVTGSVRRVLAVQPAGADLAVLD PPRSGAGREVVDLLAGAGVPRLIHIGCEAASFARDIGLYRGHGYAVEKIKVFDAFPLT HYVECVALLTRKV" CDS complement(2968266..2970257) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2709C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE ALANINE AND VALINE AND LEUCINE RICH PROTEIN" /note="Mb2709c, -, len: 663 aa. Equivalent to Rv2690c, len: 657 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 657 aa overlap). Probable conserved integral membrane ala-, val-, leu-rich protein, highly similar to others e.g. O54098|SC10A5.05 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (691 aa), FASTA scores: opt: 2007, E(): 1.6e-116, (62.35% identity in 669 aa overlap); O69917|SC3C8.04c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (644 aa), FASTA scores: opt: 923, E(): 1.7e-49, (35.3% identity in 669 aa overlap); AAK78253|CAC0272 AMINO ACID TRANSPORTER from Clostridium acetobutylicum (620 aa), FASTA scores: opt: 674, E(): 4.1e-34, (36.55% identity in 640 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (t-c) leads to a slightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (663 aa versus 657 aa). Protein product from Mb2709c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2709c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y240" /db_xref="InterPro:IPR002293" /db_xref="UniProtKB/TrEMBL:A0A1R3Y240" /protein_id="SIU01327.1" /translation="MSKLSTAARRLLIGRPFRSDRLSHTLLPKRIALPVFASDAMSSI AYAPEEIFLVLSVAGLAAYSMAPLIGLAVAAVLLVVVSSYRQNVHAYPSGGGDYEVVT TNLGATGGLVVASALMVDYVLTVAVSISSAASNIGSVSPFVYEHKVLFAVGAIVLIMA MNLRGVRESGLAFAIPTYAFIAGIGTMLVWGLFRIFVLGNPVRAESAAFEMHAEHGQI VGFALVFLVARSFSSGCAALTGVEAISNGVPAFQKPKSRNAATTLLMLGIIAVSMFMG MIVLAVETGVQVVDDPDTQLTGAPPGYQQKTLVAQLAQAVFGGFYLGFLLIAAVTALI LVLAANTAFNGFPVLGSVLAQHSYLPRQLHTRGDRLAFSNGILFLAAAAIGAVVAFRA ELTALIQLYIVGVFISFTMSQVGMVRHWTRLLSAETDPRARRAMLRSRAVNTVGFVST GTVLLIVLVTKFLAGAWIAIVAMGGFFMMMKLIHRHYDAVNRELAEQAEEAEITLPSR NHAVVLVSKLHLPTLRALTYARATRPDVLEAVTVNVDDAETRELVRQWQDSDVSVPLK VIASPYREITRPVLDYVKRVSKESPRTVVTVFIPEYVVGRWWEQLLHNQSALRLKGRL LFMPGVMVTSVPWQLTSSERIKTLQPHAAPGDTRRGIFD" CDS 2970392..2971075 /codon_start=1 /transl_table=11 /gene="ceoB" /locus_tag="BQ2027_MB2710" /product="TRK SYSTEM POTASSIUM UPTAKE PROTEIN CEOB" /note="Mb2710, ceoB, len: 227 aa. Equivalent to Rv2691, len: 227 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 227 aa overlap). ceoB (alternate gene name: trkA), TRK system potassium uptake protein (see citation below), highly similar to others e.g. Q53949|TRKA_STRCO|SC2E9.17c from Streptomyces coelicolor (223 aa), FASTA scores: opt: 781, E(): 5.8e-42, (53.2% identity in 220 aa overlap); O27333|TRKA_METTH|MTH1265 from Methanobacterium thermoautotrophicum (216 aa), FASTA scores: opt: 287, E(): 5.3e-11, (27.0% identity in 211 aa overlap); O54141|SC2E9.16c from Streptomyces coelicolor (226 aa), FASTA scores: opt: 269, E(): 7.3e-10, (29.9% identity in 214 aa overlap); etc. Also similar to upstream orf O07194|CEOC|TRKA_MYCTU|TRKA|TRKB|Rv2692|MT2766|MTCY05A 6.13 TRK SYSTEM POTASSIUM UPTAKE PROTEIN from Mycobacterium tuberculosis (220 aa), FASTA scores: opt: 259, E(): 3e-09, (26.55% identity in 226 aa overlap). Contains a motif common to NAD+ binding pockets (see citation below). BELONGS TO THE TRKA FAMILY. Protein product from Mb2710 detected using shotgun mass spectrometry. Mb2710 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2B8" /db_xref="InterPro:IPR003148" /db_xref="InterPro:IPR006036" /db_xref="InterPro:IPR006037" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036721" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2B8" /protein_id="SIU01328.1" /translation="MRVVVMGCGRVGASVADGLSRIGHEVAIIDRDSAAFNRLSPQFA GERVLGQGFDRDVLLRAGIQGADAFAAVSSGDNSNIISARLARETFGVPRVVARIYDA KRAEVYERLGIPTIATVPWTTDRLLNALMQDTETAKWRDPTGTVAVAEVVLHEDWVGH RATDLEQATGARIAFLIRFGTGVLPEPKTVLQAGDKVYIAAISGRAAEAAAIAALPPS EDFESGARR" CDS 2971072..2971734 /codon_start=1 /transl_table=11 /gene="ceoC" /locus_tag="BQ2027_MB2711" /product="TRK SYSTEM POTASSIUM UPTAKE PROTEIN CEOC" /note="Mb2711, ceoC, len: 220 aa. Equivalent to Rv2692, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 220 aa overlap). ceoC (alternate gene names: trkA and trkB), TRK system potassium uptake protein (see citation below), highly similar to others e.g. O54141|SC2E9.16c from Streptomyces coelicolor (226 aa), FASTA scores: opt: 870, E(): 9.4e-48, (58.8% identity in 216 aa overlap); Q58505|TRKA_METJA|MJ1105 from Methanococcus jannaschii (218 aa), FASTA scores: opt: 361, E(): 9.7e-16, (29.8% identity in 218 aa overlap); O27333|TRKA_METTH|MTH1265 from Methanobacterium thermoautotrophicum (216 aa), FASTA scores: opt: 326, E(): 1.5e-13, (30.1% identity in 216 aa overlap); etc. Also similar to downstream orf O07193|CEOB|TRKA|Rv2691|MTCY05A6.12 TRK SYSTEM POTASSIUM UPTAKE PROTEIN from Mycobacterium tuberculosis (227 aa), FASTA scores: opt: 259, E(): 2.6e-09, (26.55% identity in 226 aa overlap). Contains a motif common to NAD+ binding pockets (see citation below). BELONGS TO THE TRKA FAMILY. Protein product from Mb2711 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2711 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1X0" /db_xref="InterPro:IPR003148" /db_xref="InterPro:IPR006036" /db_xref="InterPro:IPR006037" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036721" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1X0" /protein_id="SIU01329.1" /translation="MKVAVAGAGAVGRSVTRELVENGHDITLIERNPDHLDAAAIPEA HWRLGDACELSLLESIHLEEFDVVVAATGDDKVNVVLSLLAKTEFAVPRVVARVNDPR NEWLFNDAWGVDVAVSTPRMLASLIEEAVTVGDLVRLMEFRTGQANLVEITLPDNTPW GGKPVRKLQLPRDAALVTILRGPRVIVPEADEPLEGGDELLFVAVTEAEEELSRLLLP SM" CDS complement(2971745..2972416) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2712C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE ALANINE AND LEUCINE RICH PROTEIN" /note="Mb2712c, -, len: 223 aa. Equivalent to Rv2693c, len: 223 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 223 aa overlap). Probable conserved integral membrane ala-, leu-rich protein, showing some similarity to O54140|SC2E9.15 HYPOTHETICAL 29.6 KDA PROTEIN from Streptomyces coelicolor (272 aa), FASTA scores: opt: 212, E(): 4.3e-06, (23.5% identity in 247 aa overlap). Mb2712c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1X1" /db_xref="InterPro:IPR016566" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1X1" /protein_id="SIU01330.1" /translation="MNANRTSAQRLLAQAGGVSGLVYSSLPVVTFVVASSAAGLLPAI GFALSMAGLILLWRLLRRESARPVVAGFCGVAVCALIAYLVGQSKGYFLLGIWMSLLW AVVFTLSILIRRPIVGYLWSWLSGRDRAWRDVSRAVFAFDVATLGWTLVFAARFIVQR HLYDADKTGWLGVARIGMGWPLTALAALATYAAIKAAQRAILASHDAAAVGGAAEFDA DAGRE" CDS complement(2972447..2972815) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2713C" /product="OB-fold nucleic acid binding protein" /note="Mb2713c, -, len: 122 aa. Equivalent to Rv2694c, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 122 aa overlap). Conserved hypothetical protein, highly similar in part to SC2E9.14 HYPOTHETICAL 16.9 KDA PROTEIN from Streptomyces coelicolor (154 aa), FASTA scores: opt: 299, E(): 1.9e-13, (41.05% identity in 117 aa overlap. Equivalent to AAK47083 from Mycobacterium tuberculosis strain CDC1551 (157 aa) but shorter 35 aa. Protein product from Mb2713c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2713c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR016499" /db_xref="InterPro:IPR033454" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1X3" /protein_id="SIU01331.1" /translation="MGAQGYLRRLTRRLTEDLEQRDVEELSDEVLNAGAQRAIDCQRG QEVTVVGTLRSVETNGKGCSGGVSAELFDGSDTVTLVWLGQRRIPGIDTGRTLRVRGR LGKLENGTKAIYNPHYEIQR" CDS 2972964..2973671 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2714" /product="Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)" /note="Mb2714, -, len: 235 aa. Equivalent to Rv2695, len: 235 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 235 aa overlap). Conserved hypothetical ala-rich protein, equivalent to Q49994|ML1030|U1764L HYPOTHETICAL PROTEIN from Mycobacterium leprae (232 aa), FASTA scores: opt: 1166, E(): 6.3e-63, (76.95% identity in 230 aa overlap). Also shows some similarity with other hypothetical proteins e.g. Q986S2|MLR7232 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (277 aa), FASTA scores: opt: 150, E(): 0.059, (33.55% identity in 173 aa overlap); CAC47772|SMC03810 HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (269 aa), FASTA scores: opt: 143, E(): 0.15, (28.05% identity in 228 aa overlap); Q9A5N6|CC2411 3-OXOADIPATE ENOL-LACTONE HYDROLASE/4-CARBOXYMUCONOLACTONE DECARBOXYLASE from Caulobacter crescentus (393 aa), FASTA scores: opt: 138, E(): 0.41, (26.45% identity in 238 aa overlap); etc. Protein product from Mb2714 detected using SWATH mass spectrometry. Mb2714 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Y2" /protein_id="SIU01332.1" /translation="MAVDLDGVTTVLLPGTGSDNDYVRRAFSAPLRRAGAVLVTPVPH PGRLIDGYRAALDDAARDGPVVVGGVSLGAAVAAAWALEHPDRAVAVLAALPAWTGEP ELAPAAQAARYTAARLRCDGLAATTTRMRASSPVWLAEELTRSWRVQWPELPDAMEEA AAYVAPSRAELARLVAPLAVAAAVDDPIHPLQVAADWVSVAPHAALRTVTLDEIGADA AALGSACLAALAEVSGA" CDS complement(2973877..2974656) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2715C" /product="conserved alanine and glycine and valine rich protein" /note="Mb2715c, -, len: 259 aa. Equivalent to Rv2696c, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 259 aa overlap). Conserved hypothetical ala-, gly-, val-rich protein, equivalent (but shorter 18 aa) to Q49993|ML1029|U1764K HYPOTHETICAL PROTEIN from Mycobacterium leprae (273 aa), FASTA scores: opt: 1174, E(): 2.1e-63, (70.6% identity in 262 aa overlap). Also similar to O54135|SC2E9.10 from Streptomyces coelicolor (250 aa), FASTA scores: opt: 213, E(): 9.8e-06, (28.25% identity in 255 aa overlap); and showing weak similarity with other proteins. Protein product from Mb2715c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2715c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR022183" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1X2" /protein_id="SIU01333.1" /translation="MAFGRRTGKDGGKRKAGHAPVQPADEHVRPEDTVVASAAAASGV EDQEELQGPFDIDDFDDPSVAVLARLDLGSVLIPMPAAGQVQVELTESGVPSAVWVIT PNGRYSIAAYAAPKTGGLWREVAGELADSLRKDSAKVSIKDGPWGREVIGIAAGVVRF IGVNGYRWMIRCVVNGPQETVDALTEEAREALADTVVRRGDTPLPVRTPLPVHLPEPM AAQLREAAAAQADTQRQAAAGVARRGAQGSAMQQLRSTTGG" CDS complement(2974731..2975195) /codon_start=1 /transl_table=11 /gene="dut" /locus_tag="BQ2027_MB2716C" /product="probable deoxyuridine 5'-triphosphate nucleotidohydrolase dut (dutpase) (dutp pyrophosphatase) (deoxyuridine 5'-triphosphatase) (dutp diphosphatase) (deoxyuridine-triphosphatase)" /note="Mb2716c, dut, len: 154 aa. Equivalent to Rv2697c, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 154 aa overlap). Probable dut, deoxyuridine 5'-triphosphate nucleotidohydrolase (EC 3.6.1.23), equivalent to Q49992|DUT_MYCLE|ML1028 DEOXYURIDINE 5'-TRIPHOSPHATE NUCLEOTIDOHYDROLASE from Mycobacterium leprae (154 aa), FASTA scores: opt: 928, E(): 2.1e-51, (90.25% identity in 154 aa overlap). Also highly similar to others e.g. O54134|DUT_STRCO|SC2E9.09 from Streptomyces coelicolor (183 aa), FASTA scores: opt: 534, E(): 1.2e-26, (56.1% identity in 148 aa overlap); O66592|DUT_AQUAE|AQ_220 from Aquifex aeolicus (150 aa), FASTA scores: opt: 398, E(): 3.3e-18, (48.05% identity in 152 aa overlap); Q9X3X5|DUT_ZYMMO from Zymomonas mobilis (146 aa), FASTA scores: opt: 396, E(): 4.4e-18, (49.0% identity in 147 aa overlap); etc. BELONGS TO THE DUTPASE FAMILY. Protein product from Mb2716c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2716c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A553" /db_xref="InterPro:IPR008181" /db_xref="InterPro:IPR029054" /db_xref="InterPro:IPR033704" /db_xref="InterPro:IPR036157" /db_xref="UniProtKB/Swiss-Prot:P0A553" /protein_id="SIU01334.1" /translation="MSTTLAIVRLDPGLPLPSRAHDGDAGVDLYSAEDVELAPGRRAL VRTGVAVAVPFGMVGLVHPRSGLATRVGLSIVNSPGTIDAGYRGEIKVALINLDPAAP IVVHRGDRIAQLLVQRVELVELVEVSSFDEAGLASTSRGDGGHGSSGGHASL" CDS 2975221..2975706 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2717" /product="PROBABLE CONSERVED ALANINE RICH TRANSMEMBRANE PROTEIN" /note="Mb2717, -, len: 161 aa. Equivalent to Rv2698, len: 161 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 161 aa overlap). Probable conserved ala-rich transmembrane protein, equivalent to Q49991|ML1027|U1764I POSSIBLE MEMBRANE PROTEIN from Mycobacterium leprae (157 aa), FASTA scores: opt: 886, E(): 1.1e-49, (78.9% identity in 161 aa overlap). Also similar to O54132|SC2E9.07c HYPOTHETICAL 16.5 KDA PROTEIN from Streptomyces coelicolor (154 aa), FASTA scores: opt: 230, E(): 7.1e-08, (35.7% identity in 154 aa overlap). Protein product from Mb2717 detected using SWATH mass spectrometry. Mb2717 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3X7" /db_xref="InterPro:IPR021443" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3X7" /protein_id="SIU01335.1" /translation="MSGTRLAPHSVRYRERLWVPWWWWPLAFALAALIAFEVNLGVAA LPDWVPFATLFTVAAGTLLWLGRVEIRVTAGSADGAGVKLWAGPAHLPVAVIARSAEI PATAKSAALGRQLDPAAYVLHRAWVGPMVLVVLDDPNDPTPYWLVSCRHPERVLSALR S" CDS complement(2975711..2976013) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2718C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2718c, -, len: 100 aa. Equivalent to Rv2699c, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). Conserved hypothetical protein, very equivalent to Q49990|ML1026|U1764J HYPOTHETICAL PROTEIN from Mycobacterium leprae (100 aa), FASTA scores: opt: 632, E(): 7.7e-36, (96.0% identity in 100 aa overlap). Also highly similar to O54130|SC2E9.05 HYPOTHETICAL 11.0 KDA PROTEIN from Streptomyces coelicolor (98 aa), FASTA scores: opt: 465, E(): 1.1e-24, (71.45% identity in 98 aa overlap). Protein product from Mb2718c detected using shotgun mass spectrometry. Mb2718c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025242" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2R7" /protein_id="SIU01336.1" /translation="MPTDYDAPRRTETDDVSEDSLEELKARRNEAASAVVDVDESESA ESFELPGADLSGEELSVRVVPKQADEFTCSSCFLVQHRSRLASEKNGVMICTDCAA" CDS 2976251..2976901 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2719" /product="possible conserved secreted alanine rich protein" /note="Mb2719, -, len: 216 aa. Equivalent to Rv2700, len: 216 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 216 aa overlap). Possible secreted ala-rich protein, equivalent to Q4998|ML1025|U1764H POSSIBLE SECRETED PROTEIN from Mycobacterium leprae (216 aa), FASTA scores: opt: 1198, E(): 1.2e-65, (82.4% identity in 216 aa overlap). Also showing some similarity with Q9AK75|2SCD60.08c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (204 aa), FASTA scores: opt: 193, E(): 8.9e-05, (31.25% identity in 192 aa overlap). Protein product from Mb2719 detected using shotgun mass spectrometry. Mb2719 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y250" /db_xref="InterPro:IPR027381" /db_xref="UniProtKB/TrEMBL:A0A1R3Y250" /protein_id="SIU01337.1" /translation="MVAQITEGTAFDKHGRPFRRRNPRPAIVVVAFLVVVTCVMWTLA LTRPPDVREAAVCNPPPQPAGSAPTNLGEQVSRTDMTDVAPAKLSDTKVHVLNASGRG GQAADIAGALQDLGFAQPTAANDPIYAGTRLDCQGQIRFGTAGQATAAALWLVAPCTE LYHDSRADDSVDLALGTDFTTLAHNDDIDAVLANLRPGATEPSDPALLAKIHANSC" CDS complement(2976911..2977783) /codon_start=1 /transl_table=11 /gene="suhB" /locus_tag="BQ2027_MB2720C" /product="inositol-1-monophosphatase suhb" /note="Mb2720c, suhB, len: 290 aa. Equivalent to Rv2701c, len: 290 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 290 aa overlap). Possible suhB, extragenic suppressor protein, equivalent to P46813|SUHB_MYCLE|SUHB|SSYA|ML1024 EXTRAGENIC SUPPRESSOR PROTEIN from Mycobacterium leprae (291 aa), FASTA scores: opt: 1424, E(): 4.9e-78, (77.55% identity in 294 aa overlap). Similar (except at N-terminus) to others e.g. O54128|SUHB from Streptomyces coelicolor (209 aa), FASTA scores: opt: 560, E(): 1.7e-26, (46.95% identity in 213 aa overlap); Q9CNV8|SUHB|PM0315 from Pasteurella multocida (267 aa), FASTA scores: opt: 479, E(): 1.5e-21, (39.3% identity in 234 aa overlap); P44333|SUHB_HAEIN|HI0937 from Haemophilus influenzae (267 aa), FASTA scores: opt: 438, E(): 4.1e-19, (34.7% identity in 248 aa overlap); P22783|SUHB_ECOLI|SSYA|B2533 from Escherichia coli strain K12 (267 aa), FASTA scores: opt: 419, E(): 5.7e-18, (34.45% identity in 267 aa overlap); etc. And also similar to putative myo-inositol-1(or 4)-monophosphatases e.g. Q9S1M1|SPCA from Streptoverticillium netropsis (Streptoverticillium flavopersicus) (266 aa), FASTA scores: opt: 556, E(): 3.6e-26, (45.4% identity in 240 aa overlap); Q9S3X5|SPCA from Streptomyces spectabilis (264 aa), FASTA scores: opt: 502, E(): 6.1e-23, (46.05% identity in 265 aa overlap); CAC47357 from Rhizobium meliloti (Sinorhizobium meliloti) (266 aa), FASTA scores: opt: 452, E(): 6e-20, (38.5% identity in 244 aa overlap); etc. Equivalent to AAK47090 from Mycobacterium tuberculosis strain CDC1551 (277 aa) but longer 13 aa. Contains PS00630 Inositol monophosphatase family signatures 1 and 2 (PS00629 and PS00630). BELONGS TO THE INOSITOL MONOPHOSPHATASE FAMILY. Protein product from Mb2720c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2720c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65166" /db_xref="InterPro:IPR000760" /db_xref="InterPro:IPR020550" /db_xref="InterPro:IPR020583" /db_xref="InterPro:IPR033942" /db_xref="UniProtKB/Swiss-Prot:P65166" /protein_id="SIU01338.1" /translation="MTRPDNEPARLRSVAENLAAEAAAFVRGRRAEVFGISRAGDGDG AVRAKSSPTDPVTVVDTDTERLLRDRLAQLRPGDPILGEEGGGPADVTATPSDRVTWV LDPIDGTVNFVYGIPAYAVSIGAQVGGITVAGAVADVAARTVYSAATGLGAHLTDERG RHVLRCTGVDELSMALLGTGFGYSVRCREKQAELLAHVVPLVRDVRRIGSAALDLCMV AAGRLDAYYEHGVQVWDCAAGALIAAEAGARVLLSTPRAGGAGLVVVAAAPGIADELL AALQRFNGLEPIPD" CDS 2977906..2978703 /codon_start=1 /transl_table=11 /gene="ppgK" /locus_tag="BQ2027_MB2721" /product="POLYPHOSPHATE GLUCOKINASE PPGK (POLYPHOSPHATE-GLUCOSE PHOSPHOTRANSFERASE)" /note="Mb2721, ppgK, len: 265 aa. Equivalent to Rv2702, len: 265 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 265 aa overlap). ppgK, polyphosphate glucokinase (EC 2.7.1.2) (see citation below), equivalent, but shorter 60 aa, to Q49988|PPGK_MYCLE|ML1023|U1764FG POLYPHOSPHATE GLUCOKINASE from Mycobacterium leprae (324 aa), FASTA scores: opt: 1411, E(): 5.6e-80, (82.8% identity in 262 aa overlap). Also highly similar (or just similar) to others e.g. Q9ADE8|PPGK from Streptomyces coelicolor (246 aa), FASTA scores: opt: 912, E(): 3e-49, (57.3% identity in 239 aa overlap); Q9AGV8|PPGK from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (277 aa), FASTA scores: opt: 890, E(): 7.5e-48, (57.75% identity in 239 aa overlap); P40184|GLK_STRCO|SC6E10.20c from Streptomyces coelicolor (317 aa), FASTA scores: opt: 233, E(): 3.2e-07, (31.3% identity in 163 aa overlap); etc. Protein product from Mb2721 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2721 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y1Y1" /db_xref="InterPro:IPR000600" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Y1" /protein_id="SIU01339.1" /translation="MTSTGPETSETPGATTQRHGFGIDVGGSGIKGGIVDLDTGQLIG DRIKLLTPQPATPLAVAKTIAEVVNGFGWRGPLGVTYPGVVTHGVVRTAANVDKSWIG TNARDTIGAELGGQQVTILNDADAAGLAETRYGAGKNNPGLVVLLTFGTGIGSAVIHN GTLIPNTEFGHLEVGGKEAEERAASSVKEKNDWTYPKWAKQVTRVLIAIENAIWPDLF IAGGGISRKADKWVPLLENRTPVVPAALQNTAGIVGAAMASVADTTH" CDS 2978883..2980469 /codon_start=1 /transl_table=11 /gene="sigA" /locus_tag="BQ2027_MB2722" /standard_name="mysA; rpoV; rpoD" /product="RNA POLYMERASE SIGMA FACTOR SIGA (SIGMA-A)" /note="Mb2722, sigA, len: 528 aa. Equivalent to Rv2703, len: 528 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 528 aa overlap). sigA (formerly named mysA, and also known as rpoV or rpoD), RNA polymerase sigma factor (see citations below), equivalent (but shorter 55 aa) to Q9S5K3|RPOT (alias Q59532) RNA POLYMERASE SIGMA FACTOR from Mycobacterium leprae (576 aa), FASTA scores: opt: 2638, E(): 8.6e-115, (80.35% identity in 535 aa overlap). Also similar to others e.g. Q59552|MYSA from Mycobacterium smegmatis (466 aa), FASTA scores: opt: 2259, E(): 2.3e-97, (76.5% identity in 528 aa overlap); Q45302|SIGA from Corynebacterium glutamicum (Brevibacterium flavum) (497 aa), FASTA scores: opt: 1972, E(): 4.3e-84, (67.35% identity in 505 aa overlap); Q59813|HRDB from Streptomyces aureofaciens (525 aa), FASTA scores: opt: 1654, E(): 2.1e-69, (67.5% identity in 468 aa overlap); etc. Contains sigma-70 family signatures 1 and 2 (PS00715 and PS00716). BELONGS TO THE SIGMA-70 FACTOR FAMILY. Protein product from Mb2722 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2722 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A603" /db_xref="InterPro:IPR000943" /db_xref="InterPro:IPR007624" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR007630" /db_xref="InterPro:IPR009042" /db_xref="InterPro:IPR012760" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR028630" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/Swiss-Prot:P0A603" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01340.1" /translation="MAATKASTATDEPVKRTATKSPAASASGAKTGAKRTAAKSASGS PPAKRATKPAARSVKPASAPQDTTTSTIPKRKTRAAAKSAAAKAPSARGHATKPRAPK DAQHEAATDPEDALDSVEELDAEPDLDVEPGEDLDLDAADLNLDDLEDDVAPDADDDL DSGDDEDHEDLEAEAAVAPGQTADDDEEIAEPTEKDKASGDFVWDEDESEALRQARKD AELTASADSVRAYLKQIGKVALLNAEEEVELAKRIEAGLYATQLMTELSERGEKLPAA QRRDMMWICRDGDRAKNHLLEANLRLVVSLAKRYTGRGMAFLDLIQEGNLGLIRAVEK FDYTKGYKFSTYATWWIRQAITRAMADQARTIRIPVHMVEVINKLGRIQRELLQDLGR EPTPEELAKEMDITPEKVLEIQQYAREPISLDQTIGDEGDSQLGDFIEDSEAVVAVDA VSFTLLQDQLQSVLDTLSEREAGVVRLRFGLTDGQPRTLDEIGQVYGVTRERIRQIES KTMSKLRHPSRSQVLRDYLD" CDS 2980506..2980934 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2723" /product="RidA/YER057c/UK114 superfamily, group 6" /note="Mb2723, -, len: 142 aa. Equivalent to Rv2704, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Conserved hypothetical protein, highly similar (but shorter 25 aa) to Q9RYB7|DR0033 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (157 aa), FASTA scores: opt: 381, E(): 1.5e-17, (54.85% identity in 124 aa overlap); and highly similar to various proteins e.g. CAC47758|SMC03796 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (126 aa), FASTA scores: opt: 302, E(): 1.4e-12, (46.6% identity in 126 aa overlap); Q98E55|MLL4402 from Rhizobium loti (Mesorhizobium loti) (130 aa), FASTA scores: opt: 252, E(): 2.1e-09, (40.15% identity in 127 aa overlap); Q9K3V5|SCD10.21 PUTATIVE ACETYLTRANSFERASE from Streptomyces coelicolor (291 aa), FASTA scores: opt: 247, E(): 8.7e-09, (41.3% identity in 138 aa overlap) (homology only in N-terminal region); etc. Mb2723 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR006175" /db_xref="InterPro:IPR035959" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Y3" /protein_id="SIU01341.1" /translation="MSASRTMVSSGSEFESAVGYSRAVRIGPLVVVAGTTGSGDDIAA QTRDALRRIEIALGQAGATLADVVRTRIYVTDISRWREVGEVHAQAFGKIRPVTSMVE VTALIAPGLLVEIEADAYVGSAVADRNSGAGPKDPSPAGG" CDS complement(2980862..2981251) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2724C" /product="Glutathione S-transferase domain protein" /note="Mb2724c, -, len: 129 aa. Equivalent to Rv2705c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Conserved hypothetical protein, similar to others e.g. Q9RXR5|DR0242 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (112 aa), FASTA scores: opt: 259, E(): 9.4e-10, (40.5% identity in 116 aa overlap); CAC45122|SMC02246 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (115 aa), FASTA scores: opt: 208, E(): 1.6e-06, (38.3% identity in 107 aa overlap); Q98B88|MLL5682 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (116 aa), FASTA scores: opt: 173, E(): 0.00026, (34.95% identity in 103 aa overlap); etc. Protein product from Mb2724c detected using SWATH mass spectrometry. Mb2724c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009297" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Z2" /protein_id="SIU01342.1" /translation="MRMTPDPAMLVHLCGVQEWSHARERGGIYPESDKTGYIHLSTLE QVHLPANRLYRGRADLVLLYIDPAALDSPVRWEPGVPTDPRSMLFPHLYGPLPVRAVI GAAAYPPAGDGSFGPAPEFRSATADPT" CDS complement(2981248..2981505) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2725C" /product="HYPOTHETICAL PROTEIN" /note="Mb2725c, -, len: 85 aa. Equivalent to Rv2706c, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1X8" /protein_id="SIU01343.1" /translation="MLVGVMLAEKKLGSGGQLGAHPSCSATAVAAVCSSQLRTGQSCV HGSPFSGIFTFSDVRGSRRVPRPLSGVSFLTTFAPANRAGW" CDS 2981621..2982595 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2726" /product="PROBABLE CONSERVED TRANSMEMBRANE ALANINE AND LEUCINE RICH PROTEIN" /note="Mb2726, -, len: 324 aa. Equivalent to Rv2707, len: 324 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 324 aa overlap). Probable conserved transmembrane ala-, leu-rich protein, equivalent to Q49985|ML1017|U1764D POSSIBLE CONSERVED INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (330 aa), FASTA scores: opt: 1617, E(): 2.5e-91, (75.4% identity in 325 aa overlap). Also similar to other membrane proteins e.g. Q9ADF6|SCBAC1A6.31 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (344 aa), FASTA scores: opt: 593, E(): 5.9e-29, (36.2% identity in 268 aa overlap); Q99SZ8|SA1699 HYPOTHETICAL PROTEIN (similar to transporter) from Staphylococcus aureus subsp. aureus N315 (405 aa), FASTA scores: opt: 318, E(): 3.7e-12, (27.9% identity in 265 aa overlap); O34437|YFKH HYPOTHETICAL PROTEIN (similar to transporter) from Bacillus subtilis (275 aa), FASTA scores: opt: 309, E(): 9.7e-12, (29.3% identity in 263 aa overlap); etc. Mb2726 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y1Y8" /db_xref="InterPro:IPR017039" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Y8" /protein_id="SIU01344.1" /translation="MSDQVPKPHRHHIWRITRRTLSKSWDDSIFSESAQAAFWSALSL PPLLLGMLGSLAYVAPLFGPDTLPAIEKSALSTAHSFFSPSVVNEIIEPTIGDITNNA RGEVASLGFLISLWAGSSAISAFVDAVVEAHDQTPLRHPVRQRFFALFLYVVMLVFLV ATAPVMVVGPRKVSEHIPESLANLLRYGYYPALILGLTVGVILLYRVALPVPLPTHRL VLGAVLAIAVFLIATLGLRVYLAWITRTGYTYGALATPIAFLLFAFFGGFAIMLGAEL NAAVQEEWPAPATHAHRLGNWLKARIGVGTTTYSSTAQHSAVAAEPPS" CDS complement(2982596..2982844) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2727C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2727c, -, len: 82 aa. Equivalent to Rv2708c, len: 82 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 82 aa overlap). Conserved hypothetical protein, equivalent (but shorter 25 aa) to Q49984|ML1016|U1764C HYPOTHETICAL PROTEIN from Mycobacterium leprae (107 aa), FASTA scores: opt: 492, E(): 7.3e-27, (87.8% identity in 82 aa overlap). Also highly similar to Q9L1U7|SCE59.06c HYPOTHETICAL 10.4 KDA PROTEIN from Streptomyces coelicolor (97 aa), FASTA scores: opt: 200, E(): 4.4e-07, (51.6% identity in 62 aa overlap). Protein product from Mb2727c detected using shotgun mass spectrometry. Mb2727c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021400" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Y9" /protein_id="SIU01345.1" /translation="MSGMQTQTIERTDADERVDDGTGSDTPKYFHYVKKDKIAESAVM GSHVVALCGEVFPVTRAPKPGSPVCPDCKRIYDTLKKG" CDS 2982887..2983333 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2728" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2728, -, len: 148 aa. Equivalent to Rv2709, len: 148 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 148 aa overlap). Probable conserved transmembrane protein, equivalent to Q9CCB4|ML1015 (alias Q49983|U1764B but extended in N-terminus) POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (139 aa), FASTA scores: opt: 578, E(): 5.5e-31, (70.75% identity in 123 aa overlap). Shows also similarity with Q9RJ48|SCI8.05 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (159 aa), FASTA scores: opt: 119, E(): 0.57, (31.95% identity in 119 aa overlap). Protein product from Mb2728 detected using SWATH mass spectrometry. Mb2728 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2S7" /db_xref="InterPro:IPR021449" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2S7" /protein_id="SIU01346.1" /translation="MWDSRVMKHGLRLGFNGQFDDFDDFDDKGRPVLISAAAPSYEVE HRTRVRKYLTLMAFRVPALILAAIAYGAWHNGLISLLIVAASVPLPWMAVLIANDRPP RRADEPRRFDVARRRIPLFPTAERPALEPRRQPAERSAPRGFADHG" CDS 2983509..2984480 /codon_start=1 /transl_table=11 /gene="sigB" /locus_tag="BQ2027_MB2729" /standard_name="mysB" /product="RNA POLYMERASE SIGMA FACTOR SIGB" /note="Mb2729, sigB, len: 323 aa. Equivalent to Rv2710, len: 323 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 323 aa overlap). sigB (formerly known as mysB), RNA polymerase sigma factor (see citations below), equivalent to Q59531|ML1014 RNA POLYMERASE SIGMA FACTOR from Mycobacterium leprae (319 aa), FASTA scores: opt: 1935, E(): 1.9e-109, (96.2% identity in 316 aa overlap). Also highly similar to others e.g. Q59553|MYSB from Mycobacterium smegmatis (319 aa), FASTA scores: opt: 1874, E(): 9.1e-106, (92.4% identity in 316 aa overlap); Q9ANT6|SIGB from Brevibacterium flavum (331 aa), FASTA scores: opt: 1525, E(): 9.9e-85, (78.9% identity in 303 aa overlap); Q60158|RPOV from Mycobacterium bovis (528 aa), FASTA scores: opt: 1246, E(): 9.3e-68, (62.85% identity in 315 aa overlap); etc. Contains sigma-70 factors family signatures 1 and 2 (PS00715 and PS00716). And contains possible helix-turn-helix motif at aa 282-303 (Score 1887, +5.61 SD). BELONGS TO THE SIGMA-70 FACTOR FAMILY. Protein product from Mb2729 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2729 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y258" /db_xref="InterPro:IPR000943" /db_xref="InterPro:IPR007624" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR007630" /db_xref="InterPro:IPR009042" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3Y258" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01347.1" /translation="MADAPTRATTSRVDSDLDAQSPAADLVRVYLNGIGKTALLNAAG EVELAKRIEAGLYAEHLLETRKRLGENRKRDLAAVVRDGEAARRHLLEANLRLVVSLA KRYTGRGMPLLDLIQEGNLGLIRAMEKFDYTKGFKFSTYATWWIRQAITRGMADQSRT IRLPVHLVEQVNKLARIKREMHQHLGREATDEELAAESGIPIDKINDLLEHSRDPVSL DMPVGSEEEAPLGDFIEDAEAMSAENAVIAELLHTDIRSVLATLDEREHQVIRLRFGL DDGQPRTLDQIGKLFGLSRERVRQIERDVMSKLRHGERADRLRSYAS" CDS 2984613..2985305 /codon_start=1 /transl_table=11 /gene="ideR" /locus_tag="BQ2027_MB2730" /standard_name="dtxR" /product="IRON-DEPENDENT REPRESSOR AND ACTIVATOR IDER" /note="Mb2730, ideR, len: 230 aa. Equivalent to Rv2711, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 230 aa overlap). ideR (formerly known as dtxR), iron dependent repressor and activator (see citations below), equivalent to Q9CCB5|ML1013 IRON DEPENDENT REPRESSOR from Mycobacterium leprae (230 aa), FASTA scores: opt: 1365, E(): 3.8e-77, (90.0% identity in 230 aa overlap). Also highly similar to others e.g. Q50379|DTXR from Mycobacterium smegmatis (233 aa), FASTA scores: opt: 1291, E(): 1.4e-72, (86.1% identity in 230 aa overlap); Q9F7T3|IDER from Corynebacterium equii (Rhodococcus equi) (230 aa), FASTA scores: opt: 1130, E(): 1.2e-62, (74.8% identity in 230 aa overlap); P33120|DTXR_CORDI from Corynebacterium diphtheriae (226 aa), FASTA scores: opt: 803, E(): 1.6e-42, (57.85% identity in 230 aa overlap); etc. BELONGS TO THE FUR FAMILY. Protein product from Mb2730 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2730 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A673" /db_xref="InterPro:IPR001367" /db_xref="InterPro:IPR007167" /db_xref="InterPro:IPR008988" /db_xref="InterPro:IPR022687" /db_xref="InterPro:IPR022689" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="InterPro:IPR036421" /db_xref="InterPro:IPR038157" /db_xref="UniProtKB/Swiss-Prot:P0A673" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01348.1" /translation="MNELVDTTEMYLRTIYDLEEEGVTPLRARIAERLDQSGPTVSQT VSRMERDGLLRVAGDRHLELTEKGRALAIAVMRKHRLAERLLVDVIGLPWEEVHAEAC RWEHVMSEDVERRLVKVLNNPTTSPFGNPIPGLVELGVGPEPGADDANLVRLTELPAG SPVAVVVRQLTEHVQGDIDLITRLKDAGVVPNARVTVETTPGGGVTIVIPGHENVTLP HEMAHAVKVEKV" CDS complement(2985318..2986376) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2731C" /product="Heme binding protein" /note="Mb2731c, -, len: 352 aa. Equivalent to Rv2712c, len: 352 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 352 aa overlap). Hypothetical unknown ala-, leu-rich protein. Mb2731c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025447" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Z1" /protein_id="SIU01349.1" /translation="MTKYRGQFELNRPATLIAALPAILGFVPEKSLVLVSLAAGELGS VMRADLCDELADRVGHLAELVAAANPAAAIAVIVDANGAQCPRCNEEYRQLCAALAAA LSQRDIVLWAAHVVDRVAAGGRWHCVDGCGCSGVIDDPSASPLAMAAVLDGRQLYPRR SDLQAVIAVDDPVRSAELAVALGHQAADREIAHRADSVGCSRQDVENALAAAARVADG QSLSDTELARLGCALGDARVRDMLYALAVGENAGAAESLWALLARVLPEPWRVEALVL LAFSAYARGDGPLAGVSLQAALCCEPGHRMAGMLDTALQSGLRPEHIRDIAVTGYQRA EQLGIRLPPRRAFGQRAG" CDS 2986489..2987895 /codon_start=1 /transl_table=11 /gene="sthA" /locus_tag="BQ2027_MB2732" /product="PROBABLE SOLUBLE PYRIDINE NUCLEOTIDE TRANSHYDROGENASE STHA (STH) (NAD(P)(+) TRANSHYDROGENASE [B-SPECIFIC]) (NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE)" /note="Mb2732, sthA, len: 468 aa. Equivalent to Rv2713, len: 468 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 468 aa overlap). Probable sthA, soluble pyridine nucleotide transhydrogenase (EC 1.6.1.1), highly similar to others e.g. Q983E2|MLR8366 from Rhizobium loti (Mesorhizobium loti) (481 aa), FASTA scores: opt: 1447, E(): 4.1e-78, (49.55% identity in 460 aa overlap); P27306|STHA_ECOLI|STH|UDHA|B3962 from Escherichia coli strain K12 (465 aa), FASTA scores: opt: 1267, E(): 1.7e-67, (43.05% identity in 462 aa overlap); O05139|STHA_PSEFL|STH from Pseudomonas fluorescens (463 aa), FASTA scores: opt: 1257, E(): 6.6e-67, (43.8% identity in 461 aa overlap); etc. Also highly similar to CAC46308|SMC00300 PUTATIVE OXIDOREDUCTASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (467 aa), FASTA scores: opt: 1466, E(): 3e-79, (49.55% identity in 462 aa overlap). Shows some similarity to MTCY359.04, E(): 3.1e-08; MTCY210.05, E(): 3.4e-08. Contains ATP/GTP-binding site motif A (P-loop; PS00017). BELONGS TO THE PYRIDINE NUCLEOTIDE-DISULFIDE OXIDOREDUCTASES CLASS-I. COFACTOR: FAD (BY SIMILARITY). Protein product from Mb2732 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2732 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66007" /db_xref="InterPro:IPR001100" /db_xref="InterPro:IPR004099" /db_xref="InterPro:IPR016156" /db_xref="InterPro:IPR022962" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P66007" /protein_id="SIU01350.1" /translation="MREYDIVVIGSGPGGQKAAIASAKLGKSVAIVERGRMLGGVCVN TGTIPSKTLREAVLYLTGMNQRELYGASYRVKDRITPADLLARTQHVIGKEVDVVRNQ LMRNRVDLIVGHGRFIDPHTILVEDQARREKTTVTGDYIIIATGTRPARPSGVEFDEE RVLDSDGILDLKSLPSSMVVVGAGVIGIEYASMFAALGTKVTVVEKRDNMLDFCDPEV VEALKFHLRDLAVTFRFGEEVTAVDVGSAGTVTTLASGKQIPAETVMYSAGRQGQTDH LDLHNAGLEVQGRGRIFVDDRFQTKVDHIYAVGDVIGFPALAATSMEQGRLAAYHAFG EPTDGITELQPIGIYSIPEVSYVGATEVELTKSSIPYEVGVARYRELARGQIAGDSYG MLKLLVSTEDLKLLGVHIFGTSATEMVHIGQAVMGCGGSVEYLVDAVFNYPTFSEAYK NAALDVMNKMRALNQFRR" CDS 2988113..2989087 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2733" /product="conserved alanine and leucine rich protein" /note="Mb2733, -, len: 324 aa. Equivalent to Rv2714, len: 324 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 324 aa overlap). Conserved hypothetical ala-, leu-rich protein, equivalent to Q49847|ML1009|B2235_F1_6 HYPOTHETICAL PROTEIN from Mycobacterium leprae (326 aa), FASTA scores: opt: 1881, E(): 5.8e-107, (89.7% identity in 320 aa overlap); and similar to Q49797|MLCB2533.03c|B2126_F1_36 HYPOTHETICAL PROTEIN from Mycobacterium leprae (317 aa), FASTA scores: opt: 376, E(): 1.2e-15, (30.1% identity in 279 aa overlap); and Q9CC38|ML1306 HYPOTHETICAL PROTEIN from Mycobacterium leprae (274 aa), FASTA scores: opt: 367, E(): 3.6e-15, (29.8% identity in 275 aa overlap). Also highly similar to Q9S2K6|SC7H2.11c HYPOTHETICAL 34.2 KDA PROTEIN from Streptomyces coelicolor (312 aa), FASTA scores: opt: 770, E(): 1.4e-39, (40.9% identity in 286 aa overlap); and similar to Q9ADA5|SCI52.04 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (333 aa), FASTA scores: opt: 386, E(): 3e-16, (29.05% identity in 296 aa overlap). Also similar to O33260|Rv2125|MTCY261.21 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (292 aa), FASTA scores: opt: 387, E(): 2.3e-16, (29.45% identity in 292 aa overlap). Protein product from Mb2733 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2733 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR008492" /db_xref="InterPro:IPR019151" /db_xref="InterPro:IPR038389" /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Z3" /protein_id="SIU01351.1" /translation="MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHAL EGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPE LSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHT RPITMTAHSNNRELISDFQPWISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLT QTDYPAAAQALLEQVAKTGSLQLPLAALAEAAAEVQAKIDEQVQASAEVAQVVAALER QYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKKSDDDPT" CDS 2989146..2990171 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2734" /product="POSSIBLE HYDROLASE" /note="Mb2734, -, len: 341 aa. Equivalent to Rv2715, len: 341 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 341 aa overlap). Possible hydrolase (EC 3.-.-.-), showing some similarity with other hydrolases e.g. Q9I5B0|PA0829 PROBABLE HYDROLASE from Pseudomonas aeruginosa (313 aa), FASTA scores: opt: 336, E(): 9.9e-14, (28.05% identity in 289 aa overlap); BAB55888 HYDROLASE (FRAGMENT) from Terrabacter sp. DBF63 (319 aa), FASTA scores: opt: 326, E(): 4.2e-13, (27.95% identity in 290 aa overlap); O52866|CEH|EH SOLUBLE EPOXIDE HYDROLASE from Corynebacterium SP (285 aa), FASTA scores: opt: 325, E(): 4.4e-13, (29.95% identity in 284 aa overlap); etc. Also shows some similarity to P96811|EPHF|Rv0134|MTCI5.08 HYPOTHETICAL 33.8 KDA PROTEINfrom Mycobacterium tuberculosis (300 aa), FASTA scores: E(): 1.8e-10, (27.7% identity in 271 aa overlap). Contains lipases, serine active site motif (PS00120). Protein product from Mb2734 detected using SWATH mass spectrometry. Mb2734 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A573" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P0A573" /protein_id="SIU01352.1" /translation="MTERKRNLRPVRDVAPPTLQFRTVHGYRRAFRIAGSGPAILLIH GIGDNSTTWNGVHAKLAQRFTVIAPDLLGHGQSDKPRADYSVAAYANGMRDLLSVLDI ERVTIVGHSLGGGVAMQFAYQFPQLVDRLILVSAGGVTKDVNIVFRLASLPMGSEAMA LLRLPLVLPAVQIAGRIVGKAIGTTSLGHDLPNVLRILDDLPEPTASAAFGRTLRAVV DWRGQMVTMLDRCYLTEAIPVQIIWGTKDVVLPVRHAHMAHAAMPGSQLEIFEGSGHF PFHDDPARFIDIVERFMDTTEPAEYDQAALRALLRRGGGEATVTGSADTRVAVLNAIG SNERSAT" CDS 2990220..2990906 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2735" /product="Phenazine biosynthesis protein PhzF like" /note="Mb2735, -, len: 228 aa. Equivalent to Rv2716, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 228 aa overlap). Conserved hypothetical protein, similar to other proteins e.g. Q9RKR0|SCC75A.14 HYPOTHETICAL 23.3 KDA PROTEIN from Streptomyces coelicolor (214 aa), FASTA scores: opt: 447, E(): 4e-22, (44.1% identity in 220 aa overlap); Q9HHG6|PHZF|VNG6408G PHENAZINE BIOSYNTHETIC PROTEIN from Halobacterium sp. strain NRC-1 (299 aa), FASTA scores: opt: 201, E(): 6.1e-06, (30.4% identity in 148 aa overlap) (similarity only at N-terminus); P73125|SLR1019 HYPOTHETICAL 34.1 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (314 aa), FASTA scores: opt: 196, E(): 1.4e-05, (28.5% identity in 298 aa overlap); etc. Protein product from Mb2735 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2735 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5G8" /db_xref="InterPro:IPR003719" /db_xref="UniProtKB/Swiss-Prot:P0A5G8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01353.1" /translation="MAIEVSVLRVFTDSDGNFGNPLGVINASKVEHRDRQQLAAQSGY SETIFVDLPSPGSTTAHATIHTPRTEIPFAGHPTVGASWWLRERGTPINTLQVPAGIV QVSYHGDLTAISARSEWAPEFAIHDLDSLDALAAADPADFPDDIAHYLWTWTDRSAGS LRARMFAANLGVTEDEATGAAAIRITDYLSRDLTITQGKGSLIHTTWSPEGWVRVAGR VVSDGVAQLD" CDS complement(2990915..2991409) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2736C" /product="DUF1794" /note="Mb2736c, -, len: 164 aa. Equivalent to Rv2717c, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 164 aa overlap). Conserved hypothetical protein, equivalent to Q9CCB8|ML1006 (alias Q49838 but shortened N-terminus) HYPOTHETICAL PROTEIN from Mycobacterium leprae (161 aa), FASTA scores: opt: 797, E(): 2.3e-46, (73.8% identity in 164 aa overlap). Also highly similar to other eukaryotic proteins e.g. O64527|YUP8H12R.14 HYPOTHETICAL PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (166 aa), FASTA scores: opt: 393, E(): 2.3e-19, (42.4% identity in 158 aa overlap); Q9Y325 CGI-36 PROTEIN from Homo sapiens (Human) (165 aa), FASTA scores: opt: 294, E(): 9.5e-13, (33.95% identity in 159 aa overlap); etc. Protein product from Mb2736c detected using SWATH mass spectrometry. Mb2736c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TY17" /db_xref="InterPro:IPR012674" /db_xref="InterPro:IPR014878" /db_xref="InterPro:IPR022939" /db_xref="UniProtKB/Swiss-Prot:Q7TY17" /protein_id="SIU01354.1" /translation="MTRDLAPALQALSPLLGSWAGRGAGKYPTIRPFEYLEEVVFAHV GKPFLTYTQQTRAVADGKPLHSETGYLRVCRPGCVELVLAHPSGITEIEVGTYSVTGD VIELELSTRADGSIGLAPTAKEVTALDRSYRIDGDELSYSLQMRAVGQPLQDHLAAVL HRQR" CDS complement(2991461..2991925) /codon_start=1 /transl_table=11 /gene="nrdr" /locus_tag="BQ2027_MB2737C" /product="probable transcriptional regulatory protein nrdr" /note="Mb2737c, -, len: 154 aa. Equivalent to Rv2718c, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 154 aa overlap). Conserved hypothetical protein, equivalent to Q49844|ML1005|U2235A|B2235_C2_209 HYPOTHETICAL 17.3 KDA PROTEIN from Mycobacterium leprae (154 aa), FASTA scores: opt: 937, E(): 1.5e-52, (92.7% identity in 151 aa overlap). Highly similar to O86848|NRDR_STRCL PUTATIVE REGULATORY PROTEIN from Streptomyces clavuligerus (172 aa), FASTA scores: opt: 750, E(): 1.1e-40, (73.65% identity in 148 aa overlap); O69980|SC4H2.25 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (182 aa), FASTA scores: opt: 725, E(): 4.6e-39, (73.1% identity in 145 aa overlap); Q9KPU0|VC2272 HYPOTHETICAL PROTEIN from Vibrio cholerae (156 aa), FASTA scores: opt: 462, E(): 1.8e-22, (47.3% identity in 148 aa overlap); etc. Protein product from Mb2737c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2737c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67314" /db_xref="InterPro:IPR003796" /db_xref="InterPro:IPR005144" /db_xref="UniProtKB/Swiss-Prot:P67314" /protein_id="SIU01355.1" /translation="MHCPFCRHPDSRVIDSRETDEGQAIRRRRSCPECGRRFTTVETA VLAVVKRSGVTEPFSREKVISGVRRACQGRQVDDDALNLLAQQVEDSVRAAGSPEIPS HDVGLAILGPLRELDEVAYLRFASVYRSFSSADDFAREIEALRAHRNLSAHS" CDS complement(2992088..2992585) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2738C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb2738c, -, len: 165 aa. Equivalent to Rv2719c, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 165 aa overlap). Possible conserved membrane protein, equivalent to Q49846|ML1004|B2235_C3_243 POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (164 aa), FASTA scores: opt: 486, E(): 4e-21, (55.2% identity in 163 aa overlap). Mb2738c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2T8" /db_xref="InterPro:IPR018392" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2T8" /protein_id="SIU01356.1" /translation="MTPVRPPHTPDPLNLRGPLDGPRWRRAEPAQSRRPGRSRPGGAP LRYHRTGVGMSRTGHGSRPVPPATTVGLALLAAAITLWLGLVAQFGQMITGGSADGSA DSTGRVPDRLAVVRVETGESLHDVAVRVAPNAPTRQVADRIRELNGLQTPALAVGQTL IAPVG" CDS 2992893..2993546 /codon_start=1 /transl_table=11 /gene="lexA" /locus_tag="BQ2027_MB2739" /product="REPRESSOR LEXA" /note="Mb2739, lexA, len: 217 aa. Equivalent to Rv2720, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 217 aa overlap). LexA repressor (EC 3.4.21.88) (see citations below), equivalent to Q49848|LEXA_MYCLE|ML1003|B2235_F2_55 LEXA REPRESSOR from Mycobacterium leprae (217 aa), FASTA scores: opt: 1255, E(): 7.1e-70, (89.8% identity in 216 aa overlap). Also highly similar to others e.g. O69979|LEXA_STRCO|SC4H2.24c from Streptomyces coelicolor (234 aa), FASTA scores: opt: 1034, E(): 2.6e-56, (70.5% identity in 217 aa overlap); O86847|LEXA_STRCL from Streptomyces clavuligerus (239 aa), FASTA scores: opt: 1021, E(): 1.6e-55, (69.1% identity in 217 aa overlap); Q9KAD3|LEXA_BACHD from Bacillus halodurans (207 aa), FASTA scores: opt: 645, E(): 1.5e-32, (47.9% identity in 213 aa overlap); etc. BELONGS TO PEPTIDASE FAMILY S24; ALSO KNOWN AS THE UMUD/LEXA FAMILY. Protein product from Mb2739 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2739 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TY15" /db_xref="InterPro:IPR006197" /db_xref="InterPro:IPR006199" /db_xref="InterPro:IPR006200" /db_xref="InterPro:IPR015927" /db_xref="InterPro:IPR036286" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="InterPro:IPR039418" /db_xref="UniProtKB/Swiss-Prot:Q7TY15" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01357.1" /translation="MLSADSALTERQRTILDVIRASVTSRGYPPSIREIGDAVGLTST SSVAHQLRTLERKGYLRRDPNRPRAVNVRGADDAALPPVTEVAGSDALPEPTFAPVLG RIAAGGPILAEEAVEDVFPLPRELVGEGTLFLLKVIGDSMVEAAICDGDWVVVRQQNV ADNGDIVAAMIDGEATVKTFKRAGGQVWLMPHNPAFDPIPGNDATVLGKVVTVIRKV" CDS complement(2993568..2995667) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2740C" /product="POSSIBLE CONSERVED TRANSMEMBRANE ALANINE AND GLYCINE RICH PROTEIN" /note="Mb2740c, -, len: 699 aa. Equivalent to Rv2721c, len: 699 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 699 aa overlap). Possible conserved transmembrane ala-, gly-rich protein, equivalent to Q49837|ML1002|U2235I POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (687 aa), FASTA scores: opt: 2703, E(): 6.6e-135, (60.3% identity in 713 aa overlap). Shows some similaity to Q01377|CSP1 PS1 PROTEIN PRECURSOR (SECRETED PROTEIN) from Corynebacterium glutamicum (Brevibacterium flavum) (657 aa), FASTA scores: opt: 276, E(): 3.8e-07, (29.4% identity in 272 aa overlap); and Q9KIJ0 Rv2721c-LIKE PROTEIN from Mycobacterium paratuberculosis (246 aa), FASTA scores: opt: 178, E(): 0.025, (37.5% identity in 120 aa overlap). Protein product from Mb2740c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2740c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2F1" /db_xref="InterPro:IPR013207" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2F1" /protein_id="SIU01358.1" /translation="MNGQRGQLSTLIGRTLLGLAATAVTAVLLAPTVAASPMGDAEDA MMAAWEKAGGDTSTLGVRKGDVYPIGDGFALDFAGGKMFFTPATGAKYLYGPLLDKYE SLGGAADSDLGFPTINEVPGLAGPDSRVSTFSAADNPVIFWTPEHGAFVVRGALNAAW DKLGSSGGVLGAPVGDETYDGEVTAQKFSGGEVSWNRATKEFTTVPAVLAEQLKGLQV AIDPSAAINMAWRAAGGAAGPLGAKKGGQYPIGGDGIAQDFVGGKVFFSPATGANAVE GEILAKYESLGGPVSSDLGFPIANETDGGFGPSSRIVRFSAADKPVIFWTPDHGAFVV RGAMVAAWDKLRGPNGKLGAPVGDQTVDGDVVSQKFTGGMISWNRAKNTFTTDPANLA PLLSGLQVSGQNQPSTSAMPPPGKKFTWHWWWLGAAALGVLLVVMVALVVFGLRRRRR GYDAAAYDDDRAGDVEYGTAADGDWPPDEDFGSEHFGFGDQFPPEPVAPDAGSTPRVS WPRGAGAAVGDAEHLPGEEGYGSDLLSGPSNVGVEEEDTDAVDTTPTPVVSQADLSEV GPDLIVPERVVPETFVPQAFVPEAVAPEAVPPDVHAADLADTGLPAAAVSAAEDRGGR HAAAEPPEPPSAGVRPAIHLPLEDPYQMPNGYPVKASVSFGLYYPPGSALYHDTLAEL WFASEEVAQVNGFIRAD" CDS 2995683..2995931 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2741" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2741, -, len: 82 aa. Equivalent to Rv2722, len: 82 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 82 aa overlap). Conserved hypothetical protein, similar to Q9CCB9|ML1001 HYPOTHETICAL PROTEIN from Mycobacterium leprae (91 aa), FASTA scores: opt: 154, E(): 0.00053, (37.5% identity in 88 aa overlap). Equivalent to AAK47111 from Mycobacterium tuberculosis strain CDC1551 (94 aa) but shorter by 12 aa." /db_xref="UniProtKB/TrEMBL:A0A1R3Y1Z8" /protein_id="SIU01359.1" /translation="MPCLARQPVDLPPWAGPRCGPYCPRARITLLQRTTIAKSNRKYY ENGYPADVKLMPGHAAVVSNRAAARAGFALPCRKRQPD" CDS 2995957..2997150 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2742" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb2742, -, len: 397 aa. Equivalent to Rv2723, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 397 aa overlap). Probable conserved integral membrane protein, highly similar to others e.g. Q9Z503|SCC54.23c PUTATIVE INTEGRAL MEMBRANE EXPORT PROTEIN from Streptomyces coelicolor (333 aa), FASTA scores: opt: 883, E(): 2.4e-48, (46.4% identity in 332 aa overlap); Q9RD18|SCM1.25c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (316 aa), FASTA scores: opt: 865, E(): 3.1e-47, (47.55% identity in 324 aa overlap); P96554|Y319_MYXXA INTEGRAL MEMBRANE PROTEIN (PROBABLE) from Myxococcus xanthus (319 aa), FASTA scores: opt: 626, E(): 3.4e-32, (34.65% identity in 323 aa overlap); P42601|YGJT_ECOLI|B3088 from Escherichia coli strain K12 INTEGRAL MEMBRANE PROTEIN (PROBABLE) (321 aa), FASTA scores: opt: 541, E(): 7.7e-27, (35.1% identity in 279 aa overlap); etc. Protein product from Mb2742 detected using SWATH mass spectrometry. Mb2742 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A615" /db_xref="InterPro:IPR005496" /db_xref="InterPro:IPR022369" /db_xref="UniProtKB/Swiss-Prot:P0A615" /protein_id="SIU01360.1" /translation="MGASGLVWTLTIVLIAGLMLVDYVLHVRKTHVPTLRQAVIQSAT FVGIAILFGIAVVVFGGSELAVEYFACYLTDEALSVDNLFVFLVIISSFGVPRLAQQK VLLFGIAFALVTRTGFIFVGAALIENFNSAFYLFGLVLLVMAGNLARPTGLESRDAET LKRSVIIRLADRFLRTSQDYNGDRLFTVSNNKRMMTPLLLVMIAVGGTDILFAFDSIP ALFGLTQNVYLVFAATAFSLLGLRQLYFLIDGLLDRLVYLSYGLAVILGFIGVKLMLE ALHDNKIPFINGGKPVPTVEVSTTQSLTVIIIVLLITTAASFWSARGRAQNAMARARR YATAYLDLHYETESAERDKIFTALLAAERQINTLPTKYRMQPGQDDDLMTLLCRAHAA RDAHM" CDS complement(2997179..2998339) /codon_start=1 /transl_table=11 /gene="fadE20" /locus_tag="BQ2027_MB2743C" /product="PROBABLE ACYL-CoA DEHYDROGENASE FADE20" /note="Mb2743c, fadE20, len: 386 aa. Equivalent to Rv2724c, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 386 aa overlap). Probable fadE20, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. Q9X7Y2|SC6A5.36 from Streptomyces coelicolor (382 aa), FASTA scores: opt: 1583, E(): 6.9e-94, (62.7% identity in 378 aa overlap); Q9HVY0|PA4435 from Pseudomonas aeruginosa (381 aa), FASTA scores: opt: 1468, E(): 1.6e-86, (57.65% identity in 380 aa overlap); Q9ABZ1|CC0079 from Caulobacter crescentus (391 aa), FASTA scores: opt: 1298, E(): 1.2e-75, (51.9% identity in 391 aa overlap); etc. Also similar to many other Mycobacterium tuberculosis proteins e.g. O06164|FADE19|Rv2500c|MTCY07A7.06c ACYL-CoA DEHYDROGENASE (394 aa) (34.3% identity in 382 aa overlap). Contains acyl-CoA dehydrogenases signature 2 (PS00073). BELONGS TO THE ACYL-CoA DEHYDROGENASES FAMILY. Protein product from Mb2743c detected using SWATH mass spectrometry. Mb2743c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y203" /db_xref="InterPro:IPR006089" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y203" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01361.1" /translation="MGSATKYQRTLFEPEHELFRESYRAFLDRHVAAYHDEWEKTKIV DRGVWLEAGKQGFLGMAVPEEYGGGGNADFRYNTVITEETCAGRYSGIGFGLHNDIVA PYLLALATEEQKRRWFPNFCTGELTTAIAMTEPGTGSDLQGITTRAVKHGDHYVLNGS KTFITNGINSDLVIVVAQTDPEKGAQGFSLLVVERGMAGFERGRQLDKIGLDAQDTAE LSFTDVAVPAENLLGQEGMGFIYLMQNLPQERISIAIMAAAGMESVLEQTLQYAKERK AFGRSIGSFQNSRFLLAELATEATVVRIMVDEFIKLHLAGKLTAEQAAMAKWYATEKQ VYLNDRCLQLHGGYGYMREYPVARAYLDSRVQTIYGGTTEIMKEIIGRGLGV" CDS complement(2998475..2999962) /codon_start=1 /transl_table=11 /gene="hflX" /locus_tag="BQ2027_MB2744C" /product="PROBABLE GTP-BINDING PROTEIN HFLX" /note="Mb2744c, hflX, len: 495 aa. Equivalent to Rv2725c, len: 495 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 495 aa overlap). Probable hflX (hfl for high frequency of lysogenization), GTP-binding protein (EC 3.1.5.-),equivalent to Q9CCC0|ML0997 (alias Q49843|HFLX but longer) POSSIBLE ATP/GTP-BINDING PROTEIN from Mycobacterium leprae (488 aa), FASTA scores: opt: 2562, E(): 1.1e-133, (84.55% identity in 485 aa overlap). Also highly similar to many e.g. Q9XCC1 from Streptomyces fradiae (425 aa), FASTA scores: opt: 1280, E(): 3.2e-63, (57.7% identity in 423 aa overlap); P73965|HFLX|SLR1521 from Synechocystis sp. strain PCC 6803 (534 aa), FASTA scores: opt: 1028, E(): 2.8e-49, (44.7% identity in 414 aa overlap); P25519|HFLX_ECOLI|B4173 from Escherichia coli strain K12 (426 aa), FASTA scores: opt: 916, E(): 3.4e-43, (40.1% identity in 414 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb2744c detected using SWATH mass spectrometry. Mb2744c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y218" /db_xref="InterPro:IPR006073" /db_xref="InterPro:IPR016496" /db_xref="InterPro:IPR025121" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR030394" /db_xref="InterPro:IPR032305" /db_xref="InterPro:IPR042108" /db_xref="UniProtKB/TrEMBL:A0A1R3Y218" /protein_id="SIU01362.1" /translation="MPANSDARPAATCHHRVLAMTYPDPPQTGLSDFTPSLGELALED RSALRRVAGLSTELADVSEVEYRQLRLERVVLVGVWTEGSAADNRASLAELAALAETA GSQVLEGLIQRRDKPDPSTYIGSGKAAELREVIVATGADTVICDGELSPAQLTALEKA VQVKVIDRTALILDIFAQHATSREGKAQVSLAQMEYMLPRLRGWGESMSRQAGGRAGG SGGGVGLRGPGETKIETDRRRIRERMAKLRRDIRAMKQVRDTQRSRRRHSDVPSIAIV GYTNAGKSSLLNALTGAGVLVQDALFATLEPTTRRAEFGDGRPVVLTDTVGFVRHLPT QLVEAFRSTLEEVVHADLLVHVVDGSDGHPLAQIDAVRQVISEVIADHDGDPPPELLV VNKVDVASDLMLAKLRHGLPGAVFVSARTGDGIDALRRRMAELVVPADTAVDVVIPYD RGDLVARVHADGRIQQAEHKPEGTRIKARVPEALAATLREFAPRA" CDS complement(2999979..3000848) /codon_start=1 /transl_table=11 /gene="dapF" /locus_tag="BQ2027_MB2745C" /product="PROBABLE DIAMINOPIMELATE EPIMERASE DAPF (DAP EPIMERASE)" /note="Mb2745c, dapF, len: 289 aa. Equivalent to Rv2726c, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 289 aa overlap). Probable dapF, diaminopimelate epimerase (EC 5.1.1.7), equivalent to P46814|DAPF_MYCLE|ML0996|B2235_C3_233 DIAMINOPIMELATE EPIMERASE from Mycobacterium leprae (296 aa), FASTA scores: opt: 1488, E(): 2.1e-83, (76.05% identity in 292 aa overlap). Also highly similar to O69969|DAPF_STRCO|SC4H2.14 from Streptomyces coelicolor (289 aa), FASTA scores: opt: 439, E(): 1.4e-19, (45.6% identity in 296 aa overlap); and similar to many e.g. O29511|DAPF_ARCFU|AF0747 from Archaeoglobus fulgidus (280 aa), FASTA scores: opt: 310, E(): 9.7e-12, (33.8% identity in 296 aa overlap); Q51564|DAPF_PSEAE|PA5278 from Pseudomonas aeruginosa (276 aa), FASTA scores: opt: 272, E(): 2e-09, (30.15% identity in 292 aa overlap); P08885|DAPF_ECOLI|B3809 from Escherichia coli strain K12 (274 aa), FASTA scores: opt: 266, E(): 4.5e-09, (30.4% identity in 296 aa overlap); etc. BELONGS TO THE DIAMINOPIMELATE EPIMERASE FAMILY. Protein product from Mb2745c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2745c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63898" /db_xref="InterPro:IPR001653" /db_xref="InterPro:IPR018510" /db_xref="UniProtKB/Swiss-Prot:P63898" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01363.1" /translation="MIFAKGHGTQNDFVLLPDVDAELVLTAARVAALCDRRKGLGADG VLRVTTAGAAQAVGVLDSLPEGVRVTDWYMDYRNADGSAAQMCGNGVRVFAHYLRASG LEVRDEFVVGSLAGPRPVTCHHVEAAYADVSVDMGKANRLGAGEAVVGGRRFHGLAVD VGNPHLACVDSQLTVDGLAALDVGAPVSFDGAQFPDGVNVEVLTAPVDGAVWMRVHER GVGETRSCGTGTVAAAVAALAAVGSPTGTLTVHVPGGEVVVTVTDATSFLRGPSVLVA RGDLADDWWNAMG" CDS complement(3000873..3001817) /codon_start=1 /transl_table=11 /gene="miaA" /locus_tag="BQ2027_MB2746C" /product="PROBABLE TRNA DELTA(2)-ISOPENTENYLPYROPHOSPHATE TRANSFERASE MIAA (IPP TRANSFERASE) (ISOPENTENYL-DIPHOSPHATE:TRNA ISOPENTENYLTRANSFERASE) (IPTASE) (IPPT)" /note="Mb2746c, miaA, len: 314 aa. Equivalent to Rv2727c, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 314 aa overlap). Probable miaA, tRNA delta(2)-isopentenylpyrophosphate transferase (EC 2.5.1.8), equivalent to P46811|MIAA_MYCLE|ML0995|B2235_C3_232 TRNA DELTA(2)-ISOPENTENYLPYROPHOSPHATE TRANSFERASE from Mycobacterium leprae (311 aa), FASTA scores: opt: 1679, E(): 3.2e-89, (81.85% identity in 314 aa overlap). Also highly similar to many e.g. O69967|MIAA_STRCO|SC4H2.12 from Streptomyces coelicolor (312 aa), FASTA scores: opt: 1006, E(): 1.2e-50, (55.5% identity in 301 aa overlap); O31795|MIAA_BACSU from Bacillus subtilis (314 aa), FASTA scores: opt: 671, E(): 1.9e-31, (38.55% identity in 293 aa overlap);P16384|MIAA_ECOLI|TRPX|B4171 from Escherichia coli strain K12 and Shigella flexneri (316 aa), FASTA scores: opt: 565, E(): 2.3e-25, (35.2% identity in 307 aa overlap);etc. Contains PS00017 ATP/GTP-binding site motif A (P -loop). BELONGS TO THE IPP TRANSFERASE FAMILY. Protein product from Mb2746c detected using SWATH mass spectrometry. Mb2746c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65353" /db_xref="InterPro:IPR018022" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR039657" /db_xref="UniProtKB/Swiss-Prot:P65353" /protein_id="SIU01364.1" /translation="MRPLAIIGPTGAGKSQLALDVAARLGARVSVEIVNADAMQLYRG MDIGTAKLPVSERRGIPHHQLDVLDVTETATVARYQRAAAADIEAIAARGAVPVVVGG SMLYVQSLLDDWSFPATDPSVRARWERRLAEVGVDRLHAELARRDPAAAAAILPTDAR RTVRALEVVELTGQPFAASAPRIGAPRWDTVIVGLDCQTTILDERLARRTDLMFDQGL VEEVRTLLRNGLREGVTASRALGYAQVIAALDAGAGADMMRAAREQTYLGTRRYVRRQ RSWFRRDHRVHWLDAGVASSPDRARLVDDAVRLWRHVT" CDS complement(3001814..3002509) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2747C" /product="conserved alanine rich protein" /note="Mb2747c, -, len: 231 aa. Equivalent to Rv2728c, len: 231 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 231 aa overlap). Conserved hypothetical ala-rich protein, equivalent to Q49835|ML0994|B2235_C1_162 HYPOTHETICAL PROTEIN from Mycobacterium leprae (232 aa), FASTA scores: opt: 1037, E(): 1.2e-54, (68.55% identity in 232 aa overlap). Also similar to O69964|SC4H2.09 from Streptomyces coelicolor (237 aa), FASTA scores: opt: 300, E(): 7.7e-11, (32.8% identity in 241 aa overlap); and some similarity with other proteins e.g. Q14234|ELN ELASTIN from Homo sapiens (Human) (757 aa), FASTA scores: opt: 161, E(): 0.03, (30.6% identity in 242 aa overlap); P55488|Y4IE HYPOTHETICAL 15.4 KDA PROTEIN from Rhizobium sp. strain NGR234 (135 aa), FASTA scores: opt: 147, E(): 0.061, (34.95% identity in 123 aa overlap). Shows also some similarity with P71657|Rv1387|MTCY21B4.04 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (539 aa), FASTA scores: opt: 159, E(): 0.035, (34.8% identity in 135 aa overlap). Protein product from Mb2747c detected using SWATH mass spectrometry. Mb2747c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y408" /protein_id="SIU01365.1" /translation="MLSAIGIVPSAPVLVPELAGAAAAELADLGAAVIAAASLLPKSW IAVGTGRADDVVRPTDVGTFAGFGADVRVGLAPQDGDGVAVPVELPLCALLTAWVRGQ ARPEARAQVHVYASDHGSDAAVARGRQLRADIDREPDPIGVLVVADGLNTLTPRAPGG YDPDGAGMQRALDDALASGDLAVLTRLPAQVLGRVAFQVLAGLAEPGPRSAKEFYRGA PHGVGYFAGVWQP" CDS complement(3002618..3003523) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2748C" /product="probable conserved integral membrane alanine valine and leucine rich protein" /note="Mb2748c, -, len: 301 aa. Equivalent to Rv2729c, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 301 aa overlap). Probable conserved integral membrane ala-, val-, leu-rich protein, similar to P42459|YLEU_CORGL HYPOTHETICAL 29.6 KDA PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum)(270 aa), FASTA scores: opt: 365, E(): 4.7e-15, (30.75% identity in 221 aa overlap); and to other integral membrane proteins (principally from Streptomyces sp.) e.g. Q9EWZ8|2SCG38.21 from Streptomyces coelicolor (302 aa), FASTA scores: opt: 365, E(): 5.2e-15, (32.0% identity in 278 aa overlap); Q9S267|SCI30A.06 from Streptomyces coelicolor (297 aa), FASTA scores: opt: 356, E(): 1.8e-14, (31.5% identity in 289 aa overlap); AAK81278|CAC3346 from Clostridium acetobutylicum (472 aa), FASTA scores: opt: 154, E(): 0.038, (24.1% identity in 224 aa overlap); etc. Mb2748c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2U4" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2U4" /protein_id="SIU01366.1" /translation="MASVEFATILALGAALLAGIGYVTLQRSARQVTAEEYVGHLTLF HLSLRHALWWLGSLAAVASFTLQAIALTMGSVVLVQSLQATALLFALLIDARLTHHRC TPREWMWAVLLAGAVAVIVMSGNPAAGTTRAPFSTWAVVAVVVVPAVVLCVVGARIAS GSLSAVLLAVASSATLAVFTVLTKGVVTELGEGFATLIRTPELYAWILVLPIGLMLQQ SSLRVGALTASLPTITVARPVIASVLGITVLDEVLHTGRVALVALVAVVVVVVVATVA LARDEVAMMTVSAGELGAAGQLAVR" CDS 3003590..3004066 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2749" /product="HYPOTHETICAL PROTEIN" /note="Mb2749, -, len: 158 aa. Equivalent to Rv2730, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 158 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3Y275" /protein_id="SIU01367.1" /translation="MMMNWRQTNITTKRCAQTRASSSASEFCGIFAAPGLMRNCHHGG SAPSAVGGSAVQLTVAYGPQRFHGRCASNSSVRPLTTGGSWTPTSISSTDGGKAQGHD THDRQISRRTVCQAASILASILLETVAGPGEGIGPTTSVPLRAADARHTREGLQGR" CDS 3004074..3005426 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2750" /product="conserved alanine and arginine rich protein" /note="Mb2750, -, len: 450 aa. Equivalent to Rv2731, len: 450 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 450 aa overlap). Conserved hypothetical ala-, arg-rich protein, highly similar in part to Q49849|B2235_F2_77 HYPOTHETICAL PROTEIN from Mycobacterium leprae (266 aa), FASTA scores: opt: 368, E(): 1e-10, (73.5% identity in 83 aa overlap); and Q9KXN9|SC9C5.35 HYPOTHETICAL 6.5 KDA PROTEIN (FRAGMENT) from Streptomyces coelicolor (58 aa), FASTA scores: opt: 214, E(): 0.00065, (51.7% identity in 58 aa overlap). Also similar to Q9L296|SCL2.01 HYPOTHETICAL 37.4 KDA PROTEIN (FRAGMENT) from Streptomyces coelicolor (328 aa), FASTA scores: opt: 843, E(): 3.7e-33, (45.95% identity in 296 aa overlap) (but N-terminus shorter); and shows some similarity with other proteins e.g. Q26938 KINETOPLAST-ASSOCIATED PROTEIN (KAP) from Trypanosoma cruzi (1052 aa), FASTA scores: opt: 223, E(): 0.0022, (30.3% identity in 297 aa overlap). Start site chosen by RBS and to avoid overlap, although there are several other possible start sites further upstream. Protein product from Mb2750 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2750 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007139" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2G1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01368.1" /translation="MTADEPRSDDSSGSAPQPAATPVPRPGPRPGPRPVPRPTSYPVG AHPPSDPHRFGRIDDDGTVWLVSASGERIVGSWQAGDPEAAFAHFGRRFDDLSTEIML MDERLASGTGDARKIKAHAIALAETLPTACVLGDVDALADRLTSIRDRAEVIAAADRS RREEHRAAQTARKEALAAEAEELAANATQWKVAGDRLRAILDEWKTISGVDRKVDDAL WKRYSTARDTFNRRRGSHFAELDRERSGVRQSKERLCERAEELSESTDWTATSAEFRK LLADWKAAGRASKDVDDALWRRFKAAQDSFFTARNAATAEKEAELRANADAKEALLAE AERLDTTNHEAARAALRSIAEKWDAIGKVSRERAAELERRLRAVEKKVREAGEADWSD PQARARAEQFRARAEQFEHQAEKAAAAGRTKEADEAKANAEQWRQWAEAAADALTRRP " CDS complement(3005423..3006037) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2751C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2751c, -, len: 204 aa. Equivalent to Rv2732c, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 204 aa overlap). Probable conserved transmembrane protein, similar to Q49834 hypothetical protein B2235_C1_155 from Mycobacterium leprae (209 aa), FASTA scores: opt: 932, E(): 0, (70.6% identity in 201 aa overlap). Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide. Protein product from Mb2751c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2751c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y212" /db_xref="UniProtKB/TrEMBL:A0A1R3Y212" /protein_id="SIU01369.1" /translation="MMSHEHDAGDLDALRAEIEAAERRVAREIEPGARALVVAILVFV LLGSFILPHTGSVRGWDVLFSSHGAGRAAVALPSRVFAWLALVFGVGFSMLALLTRRW ALAWVALAGSAMASGTGLLAVWSRQTVAAGHPGPGIGLIVAWITAIVLTFHWAQVVWS RTIVQLAAEERRRRVVAQQQCKTLLDHVQTDSEAGTTPDRGTDR" CDS complement(3006034..3007572) /codon_start=1 /transl_table=11 /gene="miaB" /locus_tag="BQ2027_MB2752C" /product="tRNA-i(6)A37 methylthiotransferase (EC" /EC_number="2.8.4.3" /note="Mb2752c, -, len: 512 aa. Equivalent to Rv2733c, len: 512 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 512 aa overlap). Conserved hypothetical ala-, arg-rich protein. Similar to other hypothetical proteins from a range of organisms e.g. Y195_MYCLE|Q49842 hypothetical 56.0 kd protein b2235_c2_195 from Mycobacterium leprae (516 aa), FASTA scores: opt: 2689, E(): 0, (80.4% identity in 509 aa overlap). Protein product from Mb2752c detected using SWATH mass spectrometry. Mb2752c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67086" /db_xref="InterPro:IPR002792" /db_xref="InterPro:IPR005839" /db_xref="InterPro:IPR006463" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013848" /db_xref="InterPro:IPR020612" /db_xref="InterPro:IPR023404" /db_xref="InterPro:IPR038135" /db_xref="UniProtKB/Swiss-Prot:P67086" /protein_id="SIU01370.1" /translation="MVAHDAAAGVTGEGAGPPVRRAPARTYQVRTYGCQMNVHDSERL AGLLEAAGYRRATDGSEADVVVFNTCAVRENADNRLYGNLSHLAPRKRANPDMQIAVG GCLAQKDRDAVLRRAPWVDVVFGTHNIGSLPTLLERARHNKVAQVEIAEALQQFPSSL PSSRESAYAAWVSISVGCNNSCTFCIVPSLRGREVDRSPADILAEVRSLVNDGVLEVT LLGQNVNAYGVSFADPALPRNRGAFAELLRACGDIDGLERVRFTSPHPAEFTDDVIEA MAQTRNVCPALHMPLQSGSDRILRAMRRSYRAERYLGIIERVRAAIPHAAITTDLIVG FPGETEEDFAATLDVVRRARFAAAFTFQYSKRPGTPAAQLDGQLPKAVVQERYERLIA LQEQISLEANRALVGQAVEVLVATGEGRKDTVTARMSGRARDGRLVHFTAGQPRVRPG DVITTKVTEAAPHHLIADAGVLTHRRTRAGDAHTAGQPGRAVGLGMPGVGLPVSAAKP GGCR" CDS 3007869..3008723 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2753" /product="Bacteriophage protein gp37" /note="Mb2753, -, len: 284 aa. Equivalent to Rv2734, len: 284 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 284 aa overlap). Conserved hypothetical protein, highly similar to various proteins e.g. Q984J2|MLR7981 ABC TRANSPORTER ATP-BINDING PROTEIN from Rhizobium loti (Mesorhizobium loti) (286 aa), FASTA scores: opt: 877, E(): 9e-50, (52.45% identity in 246 aa overlap) (N-terminus longer); Q98DH1|MLL4707 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (249 aa), FASTA scores: opt: 829, E(): 1.1e-46, (50.4% identity in 244 aa overlap); AAK65865|SMA2239 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (259 aa), FASTA scores: opt: 796, E(): 1.5e-44, (50.0% identity in 252 aa overlap); etc. Mb2753 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011101" /db_xref="UniProtKB/TrEMBL:A0A1R3Y213" /protein_id="SIU01371.1" /translation="MSDRSAIEWTGATWNPVTGCDRVSPGCDHCYAMTLAKRLKAMGS DKYQTDGDPRTSGPGFGVTIHPRSLDEPFRWRSPRTVFVNSMADLFHARVALWFIREV FEVMRATPQHTYQILTKRSLRLRRLAHKLEWPSNVWMGVSVENVDAFRRIEDLRQVPA AVRFLSCEPLLGPLDGINLGSIDWVIAGGESGPNFRPIDPQWVRHIRDTCTAADVPFF FKQWGGRTPKAFGRELDGRCWDEMPLIEIRNPDPRTTSRVHADPMLATAPTESAQRSN PGQLVRQR" CDS complement(3008608..3009600) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2754C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2754c, -, len: 330 aa. Equivalent to Rv2735c, len: 330 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 330 aa overlap). Conserved hypothetical protein, showing some similarity with Q98DH2|MLR4706 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (302 aa), FASTA scores: opt: 140, E(): 0.062, (27.0% identity in 200 aa overlap); and Q9PHA1|XF0043 HYPOTHETICAL PROTEIN from Xylella fastidiosa (293 aa), FASTA scores: opt: 120, E(): 1.2, (30.75% identity in 117 aa overlap). Mb2754c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR031009" /db_xref="UniProtKB/TrEMBL:A0A1R3Y227" /protein_id="SIU01372.1" /translation="MAREWSYWTRNKLEILAGYLPAFNRASQTSRERIYLDLMAGQPE NIDRDMGEKFDGSSLIAMKADPPFTRLRFCELNPLASELDVALRTRFPGDGRYRVVAG DSNVTIDETLAELGPWRWAPTFAFIDQQAAEVHWETINKVAAFRQNPRNLKTELWMLM SPTMIARGVKGTNAELFIEQVTRMYGDADWKRIQAARWRHHLTAPAYRAEMVNLMRVK LEYELGYKYSHRIPMQMHNKVTIFDMVFATDHWAGDAIMCHLYNRAAQKEPEMMRQAK SAKQQKESEDRGEMGLFSVGELAVQDSNAGQILWAPSPTWDPRARGWWSEDPGF" CDS complement(3009610..3010134) /codon_start=1 /transl_table=11 /gene="recX" /locus_tag="BQ2027_MB2755C" /product="REGULATORY PROTEIN RECX" /note="Mb2755c, recX, len: 174 aa. Equivalent to Rv2736c, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 174 aa overlap). Probable recX, regulatory protein (see citation below), equivalent to P37859|RECX_MYCLE|ML0988|U2235B REGULATORY PROTEIN RECX from Mycobacterium leprae (171 aa), FASTA scores: opt: 848, E(): 2e-46, (77.0% identity in 174 aa overlap); and CAA67596|RECX|P94965|RECX_MYCSM REGULATORY PROTEIN RECX from Mycobacterium smegmatis (188 aa), FASTA scores: opt: 679, E(): 8.8e-36, (66.45% identity in 164 aa overlap). Also similar (or highly similar to) others e.g. O50488|RECX_STRCO|SC4H8.09 from Streptomyces coelicolor (188 aa), FASTA scores: opt: 371, E(): 1.9e-16, (42.7% identity in 164 aa overlap); Q9LCZ3|RECX from Xanthomonas campestris pv. citri (162 aa), FASTA scores: opt: 189, E(): 4.4e-05, (32.45% identity in 151 aa overlap); P37860|RECX_PSEAE|PA3616 from Pseudomonas aeruginosa (153 aa), FASTA scores: opt: 159, E(): 0.0032, (30.65% identity in 137 aa overlap); etc. BELONGS TO THE RECX FAMILY. Mb2755c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5U9" /db_xref="InterPro:IPR003783" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/Swiss-Prot:P0A5U9" /protein_id="SIU01373.1" /translation="MTVSCPPPSTSEREEQARALCLRLLTARSRTRAELAGQLAKRGY PEDIGNRVLDRLAAVGLVDDTDFAEQWVQSRRANAAKSKRALAAELHAKGVDDDVITT VLGGIDAGAERGRAEKLVRARLRREVLIDDGTDEARVSRRLVAMLARRGYGQTLACEV VIAELAAERERRRV" CDS complement(3010100..3012472) /codon_start=1 /transl_table=11 /gene="recA" /locus_tag="BQ2027_MB2756C" /product="RECA PROTEIN (RECOMBINASE A) [CONTAINS: ENDONUCLEASE PI-MTUI (MTU RECA INTEIN)]." /note="Mb2756c, recA, len: 790 aa. Equivalent to Rv2737c, len: 790 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 790 aa overlap). recA, recombinase A (EC 3.1.-.-), (see citations below), equivalent to Q59560|RECA_MYCSM RECA PROTEIN from Mycobacterium smegmatis (349 aa), FASTA scores: opt: 1495, E(): 1.9e-79, (93.15% identity in 249 aa overlap); and P35901|RECA_MYCLE|ML0987 RECA PROTEIN from Mycobacterium leprae (711 aa), FASTA scores: opt: 1217, E(): 4.5e-63, (46.7% identity in 814 aa overlap). Also highly similar to many e.g. Q9REV6|RECA_AMYMD from Amycolatopsis mediterranei (Nocardia mediterranei) (348 aa), FASTA scores: opt: 1450, E(): 7.6e-77, (89.25% identity in 251 aa overlap); P42442|RECA_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (376 aa), FASTA scores: opt: 1355, E(): 2.6e-71, (76.55% identity in 273 aa overlap); P41054|RECA_STRAM from Streptomyces ambofaciens (372 aa), FASTA scores: opt: 1347, E(): 7.6e-71, (82.1% identity in 246 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00321 recA signature, and PS00881 Protein splicing signature. BELONGS TO THE RECA FAMILY. THIS PROTEIN UNDERGOES A PROTEIN SELF SPLICING THAT INVOLVES A POST-TRANSLATIONAL EXCISION OF THE INTERVENING REGION (INTEIN) FOLLOWED BY PEPTIDE LIGATION. BELONGS TO THE HOMING ENDONUCLEASE FAMILY IN THE INTEIN SECTION. Protein product from Mb2756c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2756c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5U5" /db_xref="InterPro:IPR003586" /db_xref="InterPro:IPR003587" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004042" /db_xref="InterPro:IPR004860" /db_xref="InterPro:IPR006141" /db_xref="InterPro:IPR006142" /db_xref="InterPro:IPR013765" /db_xref="InterPro:IPR020584" /db_xref="InterPro:IPR020587" /db_xref="InterPro:IPR020588" /db_xref="InterPro:IPR023400" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR027434" /db_xref="InterPro:IPR030934" /db_xref="InterPro:IPR036844" /db_xref="UniProtKB/Swiss-Prot:P0A5U5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01374.1" /translation="MTQTPDREKALELAVAQIEKSYGKGSVMRLGDEARQPISVIPTG SIALDVALGIGGLPRGRVIEIYGPESSGKTTVALHAVANAQAAGGVAAFIDAEHALDP DYAKKLGVDTDSLLVSQPDTGEQALEIADMLIRSGALDIVVIDSVAALVPRAELEGEM GDSHVGLQARLMSQALRKMTGALNNSGTTAIFINQLRDKIGVMFGSPETTTGGKALKF YASVRMDVRRVETLKDGTNAVGNRTRVKVVKNKCLAEGTRIFDPVTGTTHRIEDVVDG RKPIHVVAAAKDGTLHARPVVSWFDQGTRDVIGLRIAGGAIVWATPDHKVLTEYGWRA AGELRKGDRVAQPRRFDGFGDSAPIPADHARLLGYLIGDGRDGWVGGKTPINFINVQR ALIDDVTRIAATLGCAAHPQGRISLAIAHRPGERNGVADLCQQAGIYGKLAWEKTIPN WFFEPDIAADIVGNLLFGLFESDGWVSREQTGALRVGYTTTSEQLAHQIHWLLLRFGV GSTVRDYDPTQKRPSIVNGRRIQSKRQVFEVRISGMDNVTAFAESVPMWGPRGAALIQ AIPEATQGRRRGSQATYLAAEMTDAVLNYLDERGVTAQEAAAMIGVASGDPRGGMKQV LGASRLRRDRVQALADALDDKFLHDMLAEELRYSVIREVLPTRRARTFDLEVEELHTL VAEGVVVHNCSPPFKQAEFDILYGKGISREGSLIDMGVDQGLIRKSGAWFTYEGEQLG QGKENARNFLVENADVADEIEKKIKEKLGIGAVVTDDPSNDGVLPAPVDF" CDS 3012667..3012840 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2757" /product="CONSERVED HYPOTHETICAL CYSTEINE RICH PROTEIN (FRAGMENT)" /note="Mb2757, -, len: 57 aa. Equivalent to Rv2737A, len: 57 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 57 aa overlap). Conserved hypothetical cys-rich protein (possibly gene fragment), similar to central part of AJ243803_1|glgA from Streptomyces coelicolor glgA (181 aa), FASTA scores: opt: 210, E(): 6.1e-09, (59.25% identity in 54 aa overlap). Mb2757 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y418" /db_xref="InterPro:IPR024726" /db_xref="UniProtKB/TrEMBL:A0A1R3Y418" /protein_id="SIU01375.1" /translation="MRPDLRARLVRITDDLLNTASLAGSGVLTGPDLTFRRRSCCLFY RVPAGGKCGDCPL" CDS complement(3012854..3013060) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2758C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2758c, -, len: 68 aa. Equivalent to Rv2738c, len: 68 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 68 aa overlap). Conserved hypothetical protein, equivalent to Q9CCC1|ML0986 HYPOTHETICAL PROTEIN from Mycobacterium leprae (67 aa), FASTA scores: opt: 397, E(): 3.7e-22, (83.6% identity in 67 aa overlap). Also highly similar to O50484|SC4H8.05 HYPOTHETICAL 7.5 KDA PROTEIN from Streptomyces coelicolor (64 aa), FASTA scores: opt: 185, E(): 5.9e-07, (39.7% identity in 63 aa overlap). Second part of the protein is highly similar to C-terminus of upstream ORF O33285|Rv2742c|MTV002.07c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (277 aa), FASTA scores: opt: 200, E(): 1.7e-07, (78.4% identity in 37 aa overlap). Protein product from Mb2758c detected using SWATH mass spectrometry. Mb2758c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021408" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2V4" /protein_id="SIU01376.1" /translation="MLAGVRLTEFHERVALHFGAAYGSSVLLDHVLTGFDGRSAAQAI EDGVEPRDVWRALCADFDVPHDRW" CDS complement(3013071..3014237) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2759C" /product="POSSIBLE ALANINE RICH TRANSFERASE" /note="Mb2759c, -, len: 388 aa. Equivalent to Rv2739c, len: 388 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 388 aa overlap). Possible ala-rich transferase (EC 2.-.-.-), equivalent to Q49841|ML0985|MLCB33.02c|U2235C POSSIBLE GLYCOSYLTRANSFERASE from Mycobacterium leprae (392 aa), FASTA scores: opt: 2112, E(): 5.1e-114, (80.95% identity in 388 aa overlap). Shows some similarity with other transferases e.g. Q9S1V2|SCJ4.21 PUTATIVE GLYCOSYL TRANSFERASE from Streptomyces coelicolor (407 aa), FASTA scores: opt: 290, E(): 2e-09, (27.75% identity in 382 aa overlap); Q9RYI3|DRA0329 PUTATIVE GLYCOSYLTRANSFERASE from Deinococcus radiodurans (418 aa), FASTA scores: opt: 267, E(): 4.3e-08, (29.05% identity in 396 aa overlap); P96560|GTFC GLYCOSYLTRANSFERASE from Amycolatopsis orientalis (409 aa), FASTA scores: opt: 253, E(): 2.7e-07, (27.75% identity in 418 aa overlap); etc. Equivalent to AAK47130 from Mycobacterium tuberculosis strain CDC1551 (420 aa) but shorter 32 aa. Protein product from Mb2759c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2759c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y285" /db_xref="InterPro:IPR007235" /db_xref="UniProtKB/TrEMBL:A0A1R3Y285" /protein_id="SIU01377.1" /translation="MRVAVVAGPDPGHSFPAIALCQRFRAAADTPTLFTGVEWLEAAR AAGIDAVELDGLAATDRDLDAGARIHRRAAQMAVLNVPRLRALEPELVVSDVITACGG MAAELLGIPWVELNPHPLYLPSKGLPPIGSGLAAGTGIRGRLRDATMRALTGRSWRAG LRQRAAVRVEIGLPARDPGPLRRLIATLPALEVPRPDWPAEAVVVGPLHFEPTDRVLA IPAGTGPVVVVAPSTALTGTAGLTEVALQSLTPGETVPSGSRLVVSRLSGADLTVPPW AVAGLGSQAELLTRADLVICGGGHGMVAKTLLAGVPMVVVPGGGDQWEIANRVVRQGS AVLIRPLTADALVAAVNEVLSSPRFREAARRAAASVAGAADPVRVCHDALALAG" CDS 3014281..3014730 /codon_start=1 /transl_table=11 /gene="ephg" /locus_tag="BQ2027_MB2760" /product="epoxide hydrolase" /note="Mb2760, -, len: 149 aa. Equivalent to Rv2740, len: 149 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 149 aa overlap). Conserved hypothetical protein, equivalent, but shorter 17 aa, to Q9CCC2|ML0984 (alias Q49850 but longer) HYPOTHETICAL PROTEIN from Mycobacterium leprae (164 aa), FASTA scores: opt: 481, E(): 9.7e-26, (52.0% identity in 150 aa overlap). Protein product from Mb2760 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2760 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2H1" /db_xref="InterPro:IPR013100" /db_xref="InterPro:IPR032710" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2H1" /protein_id="SIU01378.1" /translation="MAELTETSPETPETTEAIRAVEAFLNALQNEDFDTVDAALGDDL VYENVGFSRIRGGRRTATLLRRMQGRVGFEVKIHRIGADGAAVLTERTDALIIGPLRV QFWVCGVFEVDDGRITLWRDYFDVYDMFKGLLRGLVALVVPSLKATL" CDS 3014940..3016454 /codon_start=1 /transl_table=11 /gene="PE_PGRS47" /locus_tag="BQ2027_MB2761" /product="pe-pgrs family protein pe_pgrs47" /note="Mb2761, PE_PGRS47, len: 504 aa. Equivalent to Rv2741, len: 525 aa, from Mycobacterium tuberculosis strain H37Rv, (94.37% identity in 515 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to others e.g. Q10637|YD25_MYCTU|Rv1325c|MT1367|MTCY130.10c HYPOTHETICAL PE-PGRS FAMILY PROTEIN (603 aa), FASTA scores: opt: 1936, E(): 1.1e-71, (56.95% identity in 611 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, deletions of a single base (t-*) and of 84 bp (H37Rv2.3054724-3054807-*) leads to a shorter product with a diferent 5' start compared to its homolog in Mycobacterium tuberculosis strain H37Rv (504 aa versus 525 aa). NO PE motif. Mb2761 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y222" /protein_id="SIU01379.1" /translation="MACLEVSDVICDRGTGVLTAAAMDLASIGSTVSAASAAASAPTV AILAAGADEVSIAVAALFGMHGQAYQALSVQASAFHQQFVQALTAGAYSYASAEAAAV TPLQQLVDVINAPFRSALGRPLIGNGANGKPGTGQDGGAGGLLYGSGGNGGSGLAGSG QKGGNGGAAGLFGNGGAGGAGASNQAGNGGAGGNGGAGGLIWGTAGTGGNGGFTTFLD AAGGAGGAGGAGGLFGAGGAGGVGGAALGGGAQAAGGNGGAGGVGGLFGAGGAGGAGG FGDTGGTGGDGGSGGLFGVGGAGGHGGFGSAAGGDGGAGGAGGTVFGSGGAGGAGGVA TVAGHGGHGGNAGLLYGTGGAGGAGGFGGFGGDGGDDGIGGLVGSGGAGGSGGTGTLS GGRGGAGGNAGTFYGSGGAGGAGGESDNGDGGNGGVGGKAGLVGEGGNGGDGGATIAG KGGSGGNGGNAWLTGQGGNGGNAAFGKAGTGSVGVGGAGGLLEGQNGENGLLPS" CDS complement(3016478..3016726) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2762C" /product="CONSERVED HYPOTHETICAL ARGININE RICH PROTEIN [SECOND PART]" /note="Mb2762c, -, len: 82 aa. Equivalent to 3' end of Rv2742c, len: 277 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 82 aa overlap). Conserved hypothetical arg-rich protein. Extreme N-terminus is highly similar to the N-teminus of Q9CCC1ML0986 HYPOTHETICAL PROTEIN from Mycobacterium leprae (67 aa), FASTA scores: opt: 183, E(): 0.00052, (71.05% identity in 38 aa overlap); and to the downstream ORF O33281|Rv2738c|MTV002.03c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (68 aa), FASTA scores: opt: 200, E(): 5.5e-05, (78.4% identity in 37 aa overlap). Questionable ORF. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2742c exists as a single gene. In Mycobacterium bovis, a frameshift due to a 23 bp insertion splits Rv2742c into two parts, Mb2762c and Mb2763c. Mb2762c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021408" /db_xref="UniProtKB/TrEMBL:A0A1R3Y217" /protein_id="SIU01380.1" /translation="MPQGADARGWRHTADGVPRVGQPAIRRGVPGFWCWLDHVLTGFG GRNAICAIEDGVEPRVAWWALCTDFDVPRSMGRRTPGG" CDS complement(3016747..3017334) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2763C" /product="CONSERVED HYPOTHETICAL ARGININE RICH PROTEIN [FIRST PART]" /note="Mb2763c, -, len: 195 aa. Equivalent to 5' end of Rv2742c, len: 277 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 195 aa overlap). Conserved hypothetical arg-rich protein. Extreme N-terminus is highly similar to the N-teminus of Q9CCC1ML0986 HYPOTHETICAL PROTEIN from Mycobacterium leprae (67 aa), FASTA scores: opt: 183, E(): 0.00052, (71.05% identity in 38 aa overlap); and to the downstream ORF O33281|Rv2738c|MTV002.03c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (68 aa), FASTA scores: opt: 200, E(): 5.5e-05, (78.4% identity in 37 aa overlap). Questionable ORF. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2742c exists as a single gene. In Mycobacterium bovis, a frameshift due to a 23 bp insertion splits Rv2742c into two parts, Mb2762c and Mb2763c. Mb2763c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y225" /protein_id="SIU01381.1" /translation="MLVDELGVKIVHAQHVPAPYLVQRMREIHERDENRQRHAQVDVQ RRRDQPERGQHQHRRNRDADHHPDGRTLAGQIVAHPVSHRVRQPRPVAIADVLPRVGP RADCVVAHSLQGSPRRRERRRGQTAHQRLGRRSGNAIACPLYLENAAGPEPDTKRAEG RRFGAFGGGDLRWMADRVPRQGSGRRGLGSRSGAG" CDS complement(3017406..3018218) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2764C" /product="possible conserved transmembrane alanine rich protein" /note="Mb2764c, -, len: 270 aa. Equivalent to Rv2743c, len: 270 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 270 aa overlap). Possible conserved membrane ala-rich protein, equivalent to Q49833|MLCB33.04c|B2235_C1_148 UNKNOWN PROTEIN from Mycobacterium leprae (123 aa), FASTA scores: opt: 639, E(): 3.3e-31, (74.8% identity in 123 aa overlap). Protein product from Mb2764c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2764c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y231" /db_xref="UniProtKB/TrEMBL:A0A1R3Y231" /protein_id="SIU01382.1" /translation="MAVKAGQRRPWRSLLQRGVDTAGDLADLVAQKISVAIDPRARLL RRRRRALRWGLVFTAGCLLWGLVTALLAAWGWFTSLLVITGTIAVTQAIPATLLLLRY RWLRSEPLPVRRPASVRRLPPPGSAARPAMSALGASERGFFSLLGVMERGAMLPADEI RDLTAAANQTSAAMVATAAEVVSMERAVQCSAASRSYLVPTINAFTAQLSTGVRQYNE MVTAAAQLVSSANGAGGAGPGQQRYREELAGATDRLVAWAQAFDELGGLPRR" CDS complement(3018237..3019049) /codon_start=1 /transl_table=11 /gene="35kd_ag" /locus_tag="BQ2027_MB2765C" /product="CONSERVED 35 KDA ALANINE RICH PROTEIN" /note="Mb2765c, 35kd_ag, len: 270 aa. Equivalent to Rv2744c, len: 270 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 270 aa overlap). 35kd_ag, conserved ala-rich protein 35-kd antigen (see citation below). N-terminal part is equivalent to Q49840|MLCB33.06c|B2235_C2_187 HYPOTHETICAL PROTEIN from Mycobacterium leprae (167 aa), FASTA scores: opt: 789, E(): 3.4e-35, (85.05% identity in 147 aa overlap); and C-terminal part equivalent to Q49845|MLCB33.05c|B2235_C3_214 HYPOTHETICAL PROTEIN from Mycobacterium leprae (114 aa), FASTA scores: opt: 465, E(): 3.6e-18, (65.8% identity in 114 aa overlap); note that these two proteins from Mycobacterium leprae are adjacent. Shows some similarity with Q55707||Y617_SYNY3|SLL0617 HYPOTHETICAL 28.9 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (267 aa), FASTA scores: opt: 155, E(): 0.19, (23.4% identity in 252 aa overlap); and C-terminus of Q9L4N1|EMM M PROTEIN from Streptococcus equisimilis (592 aa), FASTA scores: opt: 165, E(): 0.11, (23.45% identity in 260 aa overlap). C-terminus also similar to AAK45945|MT1676 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (85 aa), FASTA scores: opt: 159, E(): 0.047, (50.9% identity in 55 aa overlap). Protein product from Mb2765c detected using shotgun mass spectrometry. Mb2765c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007157" /db_xref="UniProtKB/TrEMBL:A0A1R3Y211" /protein_id="SIU01383.1" /translation="MANPFVKAWKYLMALFSSKIDEHADPKVQIQQAIEEAQRTHQAL TQQAAQVIGNQRQLEMRLNRQLADIEKLQVNVRQALTLADQATAAGDAAKATEYNNAA EAFAAQLVTAEQSVEDLKTLHDQALSAAAQAKKAVERNAMVLQQKIAERTQLLSQLEQ AKMQEQVSASLRSMSELAAPGNTPSLDEVRDKIERRYANAIGSAELAESSVQGRMLEV EQAGIQMAGHSRLEQIRASMRGEALPAGGTTATPRPATETSGGAIAEQPYGQ" CDS complement(3019179..3019517) /codon_start=1 /transl_table=11 /gene="clgr" /locus_tag="BQ2027_MB2766C" /product="transcriptional regulatory protein clgr" /note="Mb2766c, -, len: 112 aa. Equivalent to Rv2745c, len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 112 aa overlap). Possible transcriptional regulatory protein, highly similar to O86815|SC7C7.10 HYPOTHETICAL 13.6 KDA PROTEIN from Streptomyces coelicolor (126 aa), FASTA scores: opt: 300, E(): 2.4e-13, (60.45% identity in 86 aa overlap); and highly similar to other transcriptional regulators e.g. Q9X7S1|SC5H1.13c POSSIBLE DNA-BINDING PROTEIN from Streptomyces coelicolor (157 aa), FASTA scores: opt: 254, E(): 3.3e-10, (50.0% identity in 94 aa overlap) (N-terminus longer); Q9F885|POPR TRANSCRIPTIONAL REGULATOR from Streptomyces lividans (148 aa), FASTA scores: opt: 248, E(): 7.8e-10, (53.6% identity in 97 aa overlap) (N-terminus longer); Q9FCH1|2SCD46.12 PUTATIVE DNA-BINDING PROTEIN from Streptomyces coelicolor (141 aa), FASTA scores: opt: 162, E(): 0.00038, (33.0% identity in 106 aa overlap); etc. Mb2766c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y247" /db_xref="InterPro:IPR001387" /db_xref="InterPro:IPR010982" /db_xref="UniProtKB/TrEMBL:A0A1R3Y247" /protein_id="SIU01384.1" /translation="MAALVREVVGDVLRGARMSQGRTLREVSDSARVSLGYLSEIERG RKEPSSELLSAICTALQLPLSVVLIDAGERMARQERLARATPAGRATGATIDASTKVV IAPVVSLAVA" CDS complement(3019588..3020217) /codon_start=1 /transl_table=11 /gene="pgsA3" /locus_tag="BQ2027_MB2767C" /product="PROBABLE PGP SYNTHASE PGSA3 (CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE) (PHOSPHATIDYLGLYCEROPHOSPHATE SYNTHASE)" /note="Mb2767c, pgsA3, len: 209 aa. Equivalent to Rv2746c, len: 209 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 209 aa overlap). Probable pgsA3, PGP synthase (EC 2.7.8.5) (see citation below), transmembrane protein, equivalent, but longer 19 aa, to Q49839|O08087|PGSA|ML0979 PGSA from Mycobacterium leprae (193 aa), FASTA scores: opt: 925, E(): 3.7e-53, (77.15% identity in 188 aa overlap). Also highly similar to O86813|PGSA PHOSPHATIDYLGLYCEROPHOSPHATE SYNTHASE from Streptomyces coelicolor (263 aa), FASTA scores: opt: 692, E(): 6.6e-38, (57.85% identity in 185 aa overlap) (has its N-terminus longer); and similar to others (generally with N-terminus shorter) e.g. Q99XI0|PGSA|SPY2196 PHOSPHATIDYLGLYCEROPHOSPHATE SYNTHASE from Streptococcus pyogenes (180 aa), FASTA scores: opt: 368, E(): 5.4e-17, (39.9% identity in 168 aa overlap); Q9ZE96|PGSA_RICPR|PGSA|RP049 CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE from Rickettsia prowazekii (181 aa), FASTA scores: opt: 343, E(): 2.3e-15, (40.1% identity in 172 aa overlap); P06978|PGSA_ECOLI|PGSA|B1912|Z3000|ECS2650 CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE from Escherichia coli strains K12 and O157:H7 (181 aa), FASTA scores: opt: 322, E(): 5.3e-14, (34.45% identity in 180 aa overlap); etc. Also some similarity to PGSA2|Rv1822|MTCY1A11.21c PROBABLE CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE from Mycobacterium tuberculosis (209 aa), FASTA score: (27.1% identity in 166 aa overlap). Contains PS00379 CDP-alcohol phosphatidyltransferases signature. BELONGS TO THE CDP-ALCOHOL PHOSPHATIDYLTRANSFERASE CLASS-I FAMILY. Mb2767c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y425" /db_xref="InterPro:IPR000462" /db_xref="InterPro:IPR004570" /db_xref="UniProtKB/TrEMBL:A0A1R3Y425" /protein_id="SIU01385.1" /translation="MSRSTRYSVAVSAQPETGQIAGRARIANLANILTLLRLVMVPVF LLALFYGGGHHSAARVVAWAIFATACITDRFDGLLARNYGMATEFGAFVDPIADKTLI GSALIGLSMLGDLPWWVTVLILTRELGVTVLRLAVIRRGVIPASWGGKLKTFVQAVAI GLFVLPLSGPLHVAAVVVMAAAILLTVITGVDYVARALRDIGGIRQTAS" CDS 3020248..3020772 /codon_start=1 /transl_table=11 /gene="arga" /locus_tag="BQ2027_MB2768" /product="probable l-glutamate alpha-n-acetyltranferase arga (alpha-n-acetylglutamate synthase)" /note="Mb2768, -, len: 174 aa. Equivalent to Rv2747, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 174 aa overlap). Possible transferase (EC 2.-.-.-), equivalent to O05559|ML0978|MLCB33.08 PUTATIVE ACETYLTRANSFERASE from Mycobacterium leprae (180 aa), FASTA scores: opt: 997, E(): 1.2e-57, (86.8% identity in 174 aa overlap). Also similar to various transferases e.g. Q9X8N2|SCE94.27c PUTATIVE ACETYLTRANSFERASE from Streptomyces coelicolor (169 aa), FASTA scores: opt: 656, E(): 1.3e-35, (60.35% identity in 164 aa overlap); C-terminus of Q9K3D6|ARGH(A) ARGININOSUCCINASE AND N-ACETYLGLUTAMATE SYNTHASE from Moritella sp. 2693 (629 aa), FASTA scores: opt: 243, E(): 2e-08, (31.95% identity in 144 aa overlap); C-terminus of Q9JW21|ARGA OR NMA0580 PUTATIVE ACETYLGLUTAMATE SYNTHASE from Neisseria meningitidis serogroup A (436 aa), FASTA scores: opt: 201, E(): 7.8e-06, (32.75% identity in 119 aa overlap); etc. Also similar to hypothetical proteins e.g. O67372|AQ_1359 HYPOTHETICAL 21.1 KDA PROTEIN from Aquifex aeolicus (181 aa), FASTA scores: opt: 348, E(): 1.2e-15, (42.35% identity in 137 aa overlap). Protein product from Mb2768 detected using SWATH mass spectrometry. Mb2768 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2W2" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR010167" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2W2" /protein_id="SIU01386.1" /translation="MTERPRDCRPVVRRARTSDVPAIKQLVDTYAGKILLEKNLVTLY EAVQEFWVAEHPDLYGKVVGCGALHVLWSDLGEIRTVAVDPAMTGHGIGHAIVDRLLQ VARDLQLQRVFVLTFETEFFARHGFTEIEGTPVTAEVFDEMCRSYDIGVAEFLDLSYV KPNILGNSRMLLVL" CDS complement(3020841..3023492) /codon_start=1 /transl_table=11 /gene="ftsK" /locus_tag="BQ2027_MB2769C" /product="POSSIBLE CELL DIVISION TRANSMEMBRANE PROTEIN FTSK" /note="Mb2769c, ftsK, len: 883 aa. Equivalent to Rv2748c, len: 883 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 883 aa overlap). Possible ftsK, cell division transmembrane protein, equivalent to O05560|ML0977|FTSK|MLCB33.09c CELL DIVISION PROTEIN from Mycobacterium leprae (886 aa), FASTA scores: opt: 3147, E(): 7.9e-175, (78.1% identity in 885 aa overlap). Also similar to other members of the spoIIIE/ftsK family e.g. O86810|SC7C7.05 FTSK HOMOLOG from Streptomyces coelicolor (929 aa), FASTA scores: opt: 2256, E(): 3.8e-123, (49.05% identity in 924 aa overlap); Q9CF25|FTSK CELL DIVISION PROTEIN FTSK from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (763 aa), FASTA scores: opt: 1438, E(): 9.1e-76, (37.7% identity in 751 aa overlap); AAK75005|Q97RE4|SP0878 SPOE FAMILY PROTEIN from Streptococcus pneumoniae (767 aa), FASTA scores: opt: 1405, E(): 7.5e-74, (48.0% identity in 477 aa overlap); P46889|FTSK_ECOLI|B0890 from Escherichia coli strain K12 (1329 aa), FASTA scores: opt: 759, E(): 0, (44.5% identity in 537 aa overlap) (similarity in C-terminal half); etc. Equivalent to AAK47139 from Mycobacterium tuberculosis strain CDC1551 (968 aa) but shorter 85 aa. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE FTSK/SPOIIIE FAMILY. Protein product from Mb2769c detected using SWATH mass spectrometry. Mb2769c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y287" /db_xref="InterPro:IPR002543" /db_xref="InterPro:IPR018541" /db_xref="InterPro:IPR025199" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="InterPro:IPR041027" /db_xref="UniProtKB/TrEMBL:A0A1R3Y287" /protein_id="SIU01387.1" /translation="MLGPPGTPRVGRRDAARSLVTLLRRPWQRGEQIAVTSVADGVDG VIATRLAVMSSKTVARSGTRTSRSKATSRGASRSARSAVPRKRSRPVKGVGRPSRRHH RSLLVSTGLACGRAMRAVWMMAAKGTGGAARSIGRARDIEPGHRRDGIALVLLGLAVV VAASSWFDAARPLGAWVDALLRTFIGSAVVMLPLVAAAVAVVLMRTSPNPDSRPRLIL GASLIGLSFLGLCHLWAGSPEAPESRLRAAGFIGFAIGGPLSDGLTAWIAAPLLFIGA LFGLLLLAGITIREVPDAMRAMFGTRLLPREYADDFEDFADFDGDDADTVEVARQDFS DGYYDEVPLCSDDGPPAWPSAEVPQDDTATIPEASAGRGSGRRGRRKDTQVLDRIVEG PYTLPSLDLLISGDPPKKRSAANTHMAGAIGEVLTQFKVDAAVTGCTRGPTVTRYEVE LGPGVKVEKITALQRNIAYAVATESVRMLAPIPGKSAVGIEVPNTDREMVRLADVLTA RETRRDHHPLVIGLGKDIEGDFISANLAKMPHLLVAGSTGSGKSSFVNSMLVSLLTRA TPEEVRMILIDPKMVELTPYEGIPHLITPIITQPKKAAAALAWLVDEMEQRYQDMQAS RVRHIDDFNDKVRSGAITAPLGSQREYRPYPYVVAIVDELADLMMTAPRDVEDAIVRI TQKARAAGIHLVLATQRPSVDVVTGLIKTNVPSRLAFATSSLTDSRVILDQAGAEKLI GMGDGLFLPMGASKPLRLQGAYVSDEEIHAVVTACKEQAEPEYTEGVTTAKPTAERTD VDPDIGDDMDVFLQAVELVVSSQFGSTSMLQRKLRVGFAKAGRLMDLMETRGIVGPSE GSKAREVLVKPDELAGTLAAIRGDGGE" CDS 3023491..3023805 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2770" /product="Antibiotic biosynthesis monooxygenase" /note="Mb2770, -, len: 104 aa. Equivalent to Rv2749, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 104 aa overlap). Conserved hypothetical protein, showing some similarity with Q9I1R9|PA2198 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (114 aa), FASTA scores: opt: 157, E(): 0.00081, (35.0% identity in 100 aa overlap); and O86332|Rv0793|MTV042.03 HYPOTHETICAL 11.2 KDA PROTEIN from Mycobacterium tuberculosis (101 aa), FASTA scores: opt: 143, E(): 0.0062, (26.9% identity in 93 aa overlap). Protein product from Mb2770 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2770 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007138" /db_xref="InterPro:IPR011008" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2I1" /protein_id="SIU01388.1" /translation="MPVVVVATLTAKPESVDTVRDILTRAVDDVHREPGCQLYALHET GETFIFVEQWADAEALKAHSGAPAVATMFTAAGEHLVGAPDIKLLQPVPAGDPSKGQL RR" CDS 3023802..3024620 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2771" /product="PROBABLE DEHYDROGENASE" /note="Mb2771, -, len: 272 aa. Equivalent to Rv2750, len: 272 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 272 aa overlap). Probable dehydrogenase (EC 1.-.-.-), highly similar to other dehydrogenases/reductases e.g. Q9L5X5|COX CHOLESTEROL OXIDASE from Nocardioides simplex (Arthrobacter simplex) (270 aa), FASTA scores: opt: 836, E(): 1.8e-43, (55.7% identity in 264 aa overlap); Q9RA05|LIMC CARVEOL DEHYDROGENASE from Rhodococcus erythropolis (277 aa), FASTA scores: opt: 792, E(): 8.6e-41, (48.55% identity in 274 aa overlap); Q9F5J1|SIM-NJ1|SIMD2 PUTATIVE 3-KETO-ACYL-REDUCTASE from Streptomyces antibioticus (273 aa), FASTA scores: opt: 435, E(): 3.7e-19, (35.75% identity in 263 aa overlap); etc. Also highly similar to AAK44941MT0715 OXIDOREDUCTASE (SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY) from Mycobacterium tuberculosis strain CDC1551 (275 aa), FASTA scores: opt: 702, E(): 2.4e-35, (44.45% identity in 270 aa overlap); and similar to many other Mycobacterium tuberculosis dehydrogenases. Protein product from Mb2771 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2771 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y233" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR023985" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y233" /protein_id="SIU01389.1" /translation="MIDRPLEGKVAFITGAARGLGRAHAVRLAADGANIIAVDICEQI ASVPYPLSTADDLAATVELVEDAGGGIVARQGDVRDRASLSVALQAGLDEFGRLDIVV ANAGIAMMQAGDDGWRDVIDVNLTGVFHTVQVAIPTLIEQGTGGSIVLISSAAGLVGI GSSDPGSLGYAAAKHGVVGLMRAYANHLAPQNIRVNSVHPCGVDTPMINNEFFQQWLT TADMDAPHNLGNALPVELVQPTDIANAVAWLASEEARYVTGVTLPVDAGFVNKR" CDS 3024624..3025514 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2772" /product="O-Methyltransferase involved in polyketide biosynthesis" /note="Mb2772, -, len: 296 aa. Equivalent to Rv2751, len: 296 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 296 aa overlap). Conserved hypothetical protein, similar in part to others e.g. Q98LR1|MLR0915 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (299 aa), FASTA scores: opt: 279, E(): 1.6e-11, (32.85% identity in 210 aa overlap); Q9FBX1|SC8E7.10 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (283 aa), FASTA scores: opt: 232, E(): 2.4e-08, (27.9% identity in 269 aa overlap); Q9FMY9 HYPOTHETICAL PROTEIN (GENOMIC DNA, CHROMOSOME 5, P1 CLONE:MJB21) from Arabidopsis thaliana (Mouse-ear cress) (370 aa), FASTA scores: opt: 205, E(): 2.1e-06, (28.9% identity in 211 aa overlap); etc. Also similar in part to several proteins from Mycobacterium tuberculosis: P72053|Rv3787c|MTCY13D12.21 HYPOTHETICAL 33.4 KDA PROTEIN (308 aa), FASTA scores: opt: 266, E(): 1.3e-10, (29.6% identity in 267 aa overlap); O53795|MBE50c|Rv0731c|MTV041.05c HYPOTHETICAL 34.9 KDA PROTEIN (318 aa), FASTA scores: opt: 266, E(): 1.3e-10, (32.05% identity in 281 aa overlap); O53841|Rv0830|MTV043.22 HYPOTHETICAL 33.4 KDA PROTEIN (301 aa), FASTA scores: opt: 263, E(): 2e-10, (31.3% identity in 262 aa overlap); etc. BELONGS TO THE MTCY13D12.21 / MTCY210.45C / MTCY78.29C FAMILY. Protein product from Mb2772 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2772 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y226" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y226" /protein_id="SIU01390.1" /translation="MARNPAAQTAFGPMVLAAVEQNEPPGRRLVDDDLADLFLPRPLR WLAGATRSAVLRRLLISASEWSGRGLWANLACRKRFIGDKLDEALGDIDAVVILGAGL DTRAYRLTRRVRMPVFEVDLPVNIARKAKTVRRVLGELPLSVRLVALDFEHDDLLTAL AEHGYRTEYRVFFVCEGVTQYLTERAVRRTLEGLRAAAPGSRMVFTYVRRDFIDGTNR YGTRTLYHTVRQRRQLWHFGLDPEEVAGFLADYGWRLTEQAGPEELVQRYVEPTGRNL NASQIEWSAYAEKSEPVTPR" CDS complement(3025501..3027177) /codon_start=1 /transl_table=11 /gene="rnj" /locus_tag="BQ2027_MB2773C" /product="Ribonuclease J2 (endoribonuclease in RNA processing)" /note="Mb2773c, -, len: 558 aa. Equivalent to Rv2752c, len: 558 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 558 aa overlap). Conserved hypothetical protein, equivalent to Q9CBW5|ML1512 HYPOTHETICAL PROTEIN from Mycobacterium leprae (558 aa), FASTA scores: opt: 3301, E(): 1.2e-195, (89.05% identity in 558 aa overlap). Also highly similar to other hypothetical proteins from a wide range of prokaryotes e.g. CAC19480|P54122|YOR4_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (718 aa), FASTA scores: opt: 2142, E(): 3.5e-124, (57.2% identity in 554 aa overlap) (N-terminus longer); O86842|SC9A10.09 from Streptomyces coelicolor (561 aa), FASTA scores: opt: 2077, E(): 2.9e-120, (55.95% identity in 556 aa overlap); Q9ZI80 from Streptomyces toyocaensis (528 aa), FASTA scores: opt: 1843, E(): 7.3e-106, (52.45% identity in 528 aa overlap) (N-terminus shorter 30 aa); etc. Protein product from Mb2773c detected using SWATH mass spectrometry. Mb2773c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y230" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR004613" /db_xref="InterPro:IPR011108" /db_xref="InterPro:IPR030854" /db_xref="InterPro:IPR036866" /db_xref="InterPro:IPR041636" /db_xref="InterPro:IPR042173" /db_xref="UniProtKB/TrEMBL:A0A1R3Y230" /protein_id="SIU01391.1" /translation="MDVDLPPPGPLTSGGLRVTALGGINEIGRNMTVFEHLGRLLIID CGVLFPGHDEPGVDLILPDMRHVEDRLDDIEALVLTHGHEDHIGAIPFLLKLRPDIPV VGSKFTLALVAEKCREYRITPVFVEVREGQSTRHGVFECEYFAVNHSTPDALAIAVYT GAGTILHTGDIKFDQLPPDGRPTDLPGMSRLGDTGVDLLLCDSTNAEIPGVGPSESEV GPTLHRLIRGADGRVIVACFASNVDRVQQIIDAAVALGRRVSFVGRSMVRNMRVARQL GFLRVADSDLIDIAAAETMAPDQVVLITTGTQGEPMSALSRMSRGEHRSITLTAGDLI VLSSSLIPGNEEAVFGVIDALSKIGARVVTNAQARVHVSGHAYAGELLFLYNGVRPRN VMPVHGTWRMLRANAKLAASTGVPQESILLAENGVSVDLVAGKASISGAVPVGKMFVD GLIAGDVGDITLGERLILSSGFVAVTVVVRRGTGQPLAAPHLHSRGFSEDPKALEPAV RKVEAELESLVAANVTDPIRIAQGVRRTVGKWVGETYRRQPMIVPTVIEV" CDS complement(3027208..3028110) /codon_start=1 /transl_table=11 /gene="dapA" /locus_tag="BQ2027_MB2774C" /product="PROBABLE DIHYDRODIPICOLINATE SYNTHASE DAPA (DHDPS) (DIHYDRODIPICOLINATE SYNTHETASE)" /note="Mb2774c, dapA, len: 300 aa. Equivalent to Rv2753c, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 300 aa overlap). Probable dapA, dihydrodipicolinate synthase (EC 4.2.1.52), equivalent to Q9CBW4|DAPA_MYCLE|ML1513 DIHYDRODIPICOLINATE SYNTHASE from Mycobacterium leprae (300 aa), FASTA scores: opt: 1699, E(): 2.2e-98, (86.65% identity in 300 aa overlap). Also highly similar to many e.g. P19808|DAPA_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (301 aa), FASTA scores: opt: 1089, E(): 2e-60, (58.7% identity in 288 aa overlap); O86841|DAPA_STRCO|SC9A10.08 from Streptomyces coelicolor (299 aa), FASTA scores: opt: 1044, E(): 1.3e-57, (55.75% identity in 287 aa overlap); P05640|DAPA_ECOLI (292 aa), FASTA scores: opt: 515, E(): 0, (33.8% identity in 287 aa overlap); etc. Contains PS00665 and PS00666 Dihydrodipicolinate synthetase signatures 1 and 2. BELONGS TO THE DHDPS FAMILY. Protein product from Mb2774c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2774c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63946" /db_xref="InterPro:IPR002220" /db_xref="InterPro:IPR005263" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR020624" /db_xref="InterPro:IPR020625" /db_xref="UniProtKB/Swiss-Prot:P63946" /protein_id="SIU01392.1" /translation="MTTVGFDVAARLGTLLTAMVTPFSGDGSLDTATAARLANHLVDQ GCDGLVVSGTTGESPTTTDGEKIELLRAVLEAVGDRARVIAGAGTYDTAHSIRLAKAC AAEGAHGLLVVTPYYSKPPQRGLQAHFTAVADATELPMLLYDIPGRSAVPIEPDTIRA LASHPNIVGVKDAKADLHSGAQIMADTGLAYYSGDDALNLPWLAMGATGFISVIAHLA AGQLRELLSAFGSGDIATARKINIAVAPLCNAMSRLGGVTLSKAGLRLQGIDVGDPRL PQVAATPEQIDALAADMRAASVLR" CDS complement(3028179..3028931) /codon_start=1 /transl_table=11 /gene="thyx" /locus_tag="BQ2027_MB2775C" /product="probable thymidylate synthase thyx (ts) (tsase)" /note="Mb2775c, -, len: 250 aa. Equivalent to Rv2754c, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 250 aa overlap). Conserved hypothetical ala-rich protein, equivalent to Q9CBW3|YF14_MYCLE|ML1514 HYPOTHETICAL 28.0 KDA PROTEIN from Mycobacterium leprae (254 aa), FASTA scores: opt: 1351, E(): 1e-84, (81.5% identity in 254 aa overlap). Also highly similar to 40111|YDAP_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (250 aa), FASTA scores: opt: 1071, E(): 1.2e-65, (62.85% identity in 245 aa overlap); Q05259|VG48_BPML5 GENE 48 PROTEIN (GP48) from Mycobacteriophage L5 (243 aa), FASTA scores: opt: 610, E(): 3.2e-34, (49.55% identity in 220 aa overlap); O64238|VG48_BPMD2 GENE 48 PROTEIN (GP48) from Mycobacteriophage D29 (235 aa), FASTA scores: opt: 593, E(): 4.5e-33, (46.95% identity in 245 aa overlap); etc. BELONGS TO THE THY1 FAMILY. Protein product from Mb2775c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2775c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66931" /db_xref="InterPro:IPR003669" /db_xref="InterPro:IPR036098" /db_xref="UniProtKB/Swiss-Prot:P66931" /protein_id="SIU01393.1" /translation="MAETAPLRVQLIAKTDFLAPPDVPWTTDADGGPALVEFAGRACY QSWSKPNPKTATNAGYLRHIIDVGHFSVLEHASVSFYITGISRSCTHELIRHRHFSYS QLSQRYVPEKDSRVVVPPGMEDDADLRHILTEAADAARATYSELLAKLEAKFADQPNA ILRRKQARQAARAVLPNATETRIVVTGNYRAWRHFIAMRASEHADVEIRRLAIECLRQ LAAVAPAVFADFEVTTLADGTEVATSPLATEA" CDS complement(3029175..3029450) /codon_start=1 /transl_table=11 /gene="hsdS'" /locus_tag="BQ2027_MB2776C" /product="possible type i restriction/modification system specificity determinant (fragment) hsds.1 (s protein)" /note="Mb2776c, hsdS', len: 91 aa. Equivalent to Rv2755c, len: 91 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 91 aa overlap). Possible hsdS', fragment of type I restriction/modification system specificity determinant (S protein), similar to the N-terminus of other hsdS proteins e.g. O34140|HSDS from Klebsiella pneumoniae (439 aa), FASTA scores: opt: 303, E(): 2.1e-13, (46.65% identity in 90 aa overlap); P72419|STY|SBLI from Salmonella typhimurium (434 aa), FASTA scores: opt: 278, E(): 1.1e-11, (47.65% identity in 86 aa overlap); and Q9P9X9|XF2741 from Xylella fastidiosa (412 aa), FASTA scores: opt: 144, E(): 0.015, (31.7% identity in 82 aa overlap). Also some similarity with O33303|Rv2761c|MTV002.26c|HSDS POSSIBLE TYPE I RESTRICTION/MODIFICATION SYSTEM SPECIFICITY DETERMINANT from Mycobacterium tuberculosis (364 aa), FASTA scores: opt: 145, E(): 0.012, (29.9% identity in 87 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y254" /protein_id="SIU01394.1" /translation="MSDGWKTLRFGEVLELQRGHDLPAASRGSGTVPVIGSFGVTGMH DTAAYDGPGVAIGRSGAAIGTATFVAGPIWPLDTCLFVRDFKGNDPR" CDS complement(3029447..3031069) /codon_start=1 /transl_table=11 /gene="hsdM" /locus_tag="BQ2027_MB2777C" /product="POSSIBLE TYPE I RESTRICTION/MODIFICATION SYSTEM DNA METHYLASE HSDM (M PROTEIN) (DNA METHYLTRANSFERASE)" /note="Mb2777c, hsdM, len: 540 aa. Equivalent to Rv2756c, len: 540 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 540 aa overlap). Possible hsdM, type I restriction/modification system DNA methylase (M protein) (EC 2.1.1.-), highly similar to others e.g. Q9P9X8|XF2742 from Xylella fastidiosa (519 aa), FASTA scores: opt: 1613, E(): 1.9e-96, (52.3% identity in 543 aa overlap); O34139|HSDM from Klebsiella pneumoniae (539 aa), FASTA scores: opt: 1267, E(): 4.4e-74, (45.9% identity in 549 aa overlap); P72418|STY|SBLI|HSDM from Salmonella typhimurium (539 aa), FASTA scores: opt: 1263, E(): 8e-74, (45.7% identity in 549 aa overlap); etc. Possible alternative start site (GTG) overlapping with termination codon of previous ORF 90 bp upstream. Note that the corresponding endonuclease (M protein) does not appear to be present in Mycobacterium tuberculosis. Protein product from Mb2777c detected using SWATH mass spectrometry. Mb2777c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y434" /db_xref="InterPro:IPR003356" /db_xref="InterPro:IPR022749" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR038333" /db_xref="UniProtKB/TrEMBL:A0A1R3Y434" /protein_id="SIU01395.1" /translation="MPPRKKQAPQAPSTMKELKDTLWKAADKLRGSLSASQYKDVILG LVFLKYVSDAYDERREAIRAELAAEGMEESQIEDLIDDPEQYQGYGVFVVPVSARWKF LAENTKGKPAVGGEPAKNIGQLIDEAMDAVMKANPTLGGTLPRLYNKDNIDQRRLGEL IDLFNSARFSRQGEHRARDLMGEVYEYFLGNFARAEGKRGGEFFTPPSVVKVIVEVLE PSSGRVYDPCCGSGGMFVQTEKFIYEHDGDPKDVSIYGQESIEETWRMAKMNLAIHGI DNKGLGARWSDTFARDQHPDVQMDYVMANPPFNIKDWARNEEDPRWRFGVPPANNANY AWIQHILYKLAPGGRAGVVMANGSMSSNSNGEGDIRAQIVEADLVSCMVALPTQLFRS TGIPVCLWFFAKDKAAGKQGSIDRCGQVLFIDARELGDLVDRAERALTNEEIVRIGDT FHAWRGSKSAAVKGIMYEDVPGFCKSATLAEIKATDYALTPGRYVGTPAVEDDGEPID EKMARLSKALLEAFDESARLERVVREQLGRLR" CDS complement(3031156..3031572) /codon_start=1 /transl_table=11 /gene="vapc21" /locus_tag="BQ2027_MB2778C" /product="possible toxin vapc21" /note="Mb2778c, -, len: 138 aa. Equivalent to Rv2757c, len: 138 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 138 aa overlap). Conserved hypothetical protein, similar to several other M. tuberculosis hypothetical proteins e.g. P96411|Rv0229c| MTCY08D5.24c (226 aa), FASTA scores: opt: 354, E(): 4.6e-18, (45.25% identity in 137 aa overlap) (N-terminus longer 89 aa); P95007|RV2546|MTCY159.10c (137 aa), FASTA scores: opt: 265, E(): 7.5e-12, (38.5% identity in 135 aa overlap); O07228|Rv0301|MTCY63.06 (141 aa), FASTA scores: opt: 259, E(): 2.1e-11, (42.4% identity in 132 aa overlap); etc. Mb2778c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2X2" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2X2" /protein_id="SIU01396.1" /translation="MTTRYLLDKSAAYRAHLPAVRHRLEPLMERGLLARCGITDLEFG VSARSREDHRTLGTYRRDALEYVNTPDTVWVRAWEIQEALTDKGFHRSVKIPDLIIAA VAEHHGIPVMHYDQDFERIAAITRQPVEWVVAPGTA" CDS complement(3031569..3031835) /codon_start=1 /transl_table=11 /gene="vapb21" /locus_tag="BQ2027_MB2779C" /product="possible antitoxin vapb21" /note="Mb2779c, -, len: 88 aa. Equivalent to Rv2758c, len: 88 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 88 aa overlap). Conserved hypothetical protein, similar to several other Mycobacterium tuberculosis hypothetical proteins e.g. P95008|Rv2545 (92 aa), FASTA scores: opt: 151, E(): 0.00028, (66.65% identity in 45 aa overlap); Q10771|YF60_MYCTU|RV1560|MT1611|MTCY48.05c (72 aa), FASTA scores: opt: 106, E(): 0.52, (39.15% identity in 46 aa overlap); O06565|Rv1113|MTCY22G8.02 (65 aa), FASTA scores: opt: 97, E(): 2.2, (33.35% identity in 69 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature. Mb2779c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019239" /db_xref="UniProtKB/TrEMBL:A0A1R3Y296" /protein_id="SIU01397.1" /translation="MHRGYALVVCSPGVTRTMIDIDDDLLARAAKELGTTTKKDTVHA ALRAALRASAARSLMNRMAENATGTQDEALVNAMWRDGHPENTA" CDS complement(3031861..3032256) /codon_start=1 /transl_table=11 /gene="vapc42" /locus_tag="BQ2027_MB2780C" /product="possible toxin vapc42. contains pin domain." /note="Mb2780c, -, len: 131 aa. Equivalent to Rv2759c, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Conserved hypothetical protein, highly similar to three M. tuberculosis hypothetical proteins O07769|Y609_MYCTU|Rv0609|MT0638|MTCY19H5.13c (133 aa), FASTA scores: opt: 364, E(): 5.1e-18, (49.6% identity in 131 aa overlap); P96914|Y624_MYCTU|Rv0624|MT0652|MTCY20H10 .05 (131 aa), FASTA scores: opt: 324, E(): 2.9e-15, (42.85% identity in 126 aa overlap); and Q10874|YJ82_MYCTU|Rv1982c|MT2034|MTCY39.37 (139 aa), FASTA scores: opt: 271, E(): 1.4e-11, (38.6% identity in 127 aa overlap). Also similar to other hypothetical proteins from other bacteria e.g. CAC45376|SMC00900 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (128 aa), FASTA scores: opt: 286, E(): 1.2e-12, (39.55% identity in 129 aa overlap); Q981I7|MLL9357 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (131 aa), FASTA scores: opt: 257, E(): 1.2e-10, (36.35% identity in 132 aa overlap); Q9AAG1|CC0639 HYPOTHETICAL PROTEIN from Caulobacter crescentus (131 aa), FASTA scores: opt: 217, E(): 6.9e-08, (33.35% identity in 132 aa overlap); etc. Mb2780c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67243" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/Swiss-Prot:P67243" /protein_id="SIU01398.1" /translation="MIVDTSAIVAIVSGESGAQVLKEALERSPNSRMSAPNYVELCAI MQRRDRPEISRLVDRLLDDYGIQVEAVDADQARVAAQAYRDYGRGSGHPARLNLGDTY SYALAQVTGEPLLFRGDDFTHTDIRPACT" CDS complement(3032253..3032522) /codon_start=1 /transl_table=11 /gene="vapb42" /locus_tag="BQ2027_MB2781C" /product="possible antitoxin vapb42" /note="Mb2781c, -, len: 89 aa. Equivalent to Rv2760c, len: 89 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 89 aa overlap). Conserved hypothetical protein, showing some similarity with two hypothetical proteins from Mycobacterium tuberculosis O07770|Rv0608|MTCY19H5.14c (81 aa), FASTA scores: opt: 128, E(): 0.057, (37.5% identity in 88 aa overlap); and P96913|Rv0623|MTCY20H10.04 (84 aa), FASTA scores: opt: 99, E(): 5.5, (37.1% identity in 89 aa overlap). Also showing some similarity with CAC45377|SMC00899 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (84 aa), FASTA scores: opt: 116, E(): 0.38, (36.25% identity in 91 aa overlap). Protein product from Mb2781c detected using SWATH mass spectrometry." /db_xref="InterPro:IPR011660" /db_xref="UniProtKB/TrEMBL:A0A1R3Y243" /protein_id="SIU01399.1" /translation="MSLNIKSQRTVALVRELAARTGTNQTAAVEDAVARRLSELDRED RARAEARRAAAEQTLRDLDKLLSDDDKRLIRRHEVDLYDDSGLPR" CDS complement(3032532..3033626) /codon_start=1 /transl_table=11 /gene="hsdS" /locus_tag="BQ2027_MB2782C" /product="POSSIBLE TYPE I RESTRICTION/MODIFICATION SYSTEM SPECIFICITY DETERMINANT HSDS (S PROTEIN)" /note="Mb2782c, hsdS, len: 364 aa. Equivalent to Rv2761c, len: 364 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 364 aa overlap). Possible hsdS, type I restriction/modification system specificity determinant (S protein), similar in part to other hsdS protein (S PROTEINS) e.g. Q9P9X9|XF2741 from Xylella fastidiosa (412 aa), FASTA scores: opt: 252, E(): 7.4e-09, (24.95% identity in 401 aa overlap); N-terminus of Q9RC12 TYPE I S-SUBUNIT from Lactobacillus delbrueckii (subsp. lactis) (389 aa), FASTA scores: opt: 232, E(): 1.4e-07, (28.1% identity in 185 aa overlap); N-terminus of P72419|STY|SBLI from Salmonella typhimurium (434 aa), FASTA scores: opt: 221, E(): 8e-07, (28.45% identity in 130 aa overlap); C-terminus of P17222|PRRB_ECOLI from Escherichia coli strain CTR5X (401 aa), FASTA scores: opt: 197, E(): 2.8e-05, (27.05% identity in 148 aa overlap); etc. SEEMS TO BELONG TO TYPE-I RESTRICTION SYSTEM S METHYLASE FAMILY. Protein product from Mb2782c detected using SWATH mass spectrometry. Mb2782c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y232" /db_xref="InterPro:IPR000055" /db_xref="UniProtKB/TrEMBL:A0A1R3Y232" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01400.1" /translation="MSRVEKVEKVRLGDHLDFSNGHTSGHTSPASEPGGRYPVYGANG VIGYSAQHNARGPLIVVGRVGSYCGSLRYCDSDVWVTDNALACRAKKPEETRYWYYAL LGFGLNRYRAGSGQPLLSQGVLRNVSVSAVAAPDRPRIGEILGAFDDKIAANDRVIEA AEALMLAIVGRLSAYVPLSSLASRSTACLDAQHFDSTVAHYSFAAFDGGAQPSRVGGR TIRSAKLVVSQPCVLFPKLNPRIPRIWNITSLPSEMALASTEFVVLRPVGVDTSALWA ALRQPDVLAELRQLVGGMTGSRQRIQPTQLLRVWVRDVRRLTPGHAAAIANLGALCNE RRIESARLASCRDALLPLLMSGIDGLPAGR" CDS complement(3033623..3033895) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2783C" /product="CONSERVED HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb2783c, -, len: 90 aa. Equivalent to 3' end of Rv2762c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). Conserved hypothetical protein, similar to C-terminus of hypothetical proteins: Q9A380|CC3324 from Caulobacter crescentus (409 aa), FASTA scores: opt: 181, E(): 9.8e-05, (43.55% identity in 101 aa overlap); Q98KQ4|MLR1373 from Rhizobium loti (Mesorhizobium loti) (399 aa), FASTA scores: opt: 174, E(): 0.00028, (46.35% identity in 82 aa overlap); and Q9HZZ9|PA2844 from Pseudomonas aeruginosa (402 aa), FASTA scores: opt: 158, E(): 0.0033, (40.0% identity in 80 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2762c exists as a single gene. In Mycobacterium bovis, a single base transition (c-t) splits Rv2762c into 2 parts, Mb2783c and Mb2784c. Mb2783c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y239" /protein_id="SIU01401.1" /translation="MAAKRRHLYYVRPLDGHPVARVDRKTDRAADSLPVAGVLGELDI PPVTVAEGLAGELASMASWLGLGGIAVSTRGDLAGELCAATKRTNG" CDS complement(3033911..3034042) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2784C" /product="CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb2784c, -, len: 43 aa. Equivalent to 5' end of Rv2762c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 43 aa overlap). Conserved hypothetical protein, similar to C-terminus of hypothetical proteins: Q9A380|CC3324 from Caulobacter crescentus (409 aa), FASTA scores: opt: 181, E(): 9.8e-05, (43.55% identity in 101 aa overlap); Q98KQ4|MLR1373 from Rhizobium loti (Mesorhizobium loti) (399 aa), FASTA scores: opt: 174, E(): 0.00028, (46.35% identity in 82 aa overlap); and Q9HZZ9|PA2844 from Pseudomonas aeruginosa (402 aa), FASTA scores: opt: 158, E(): 0.0033, (40.0% identity in 80 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2762c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base transition (c-t) splits Rv2762c into 2 parts, Mb2783c and Mb2784c. Mb2784c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y253" /protein_id="SIU01402.1" /translation="MSAATAAWDRRAAVVVGGVAEPGSAGPIAGADRKRLISRIQVR" CDS complement(3034116..3034595) /codon_start=1 /transl_table=11 /gene="dfrA" /locus_tag="BQ2027_MB2785C" /product="dihydrofolate reductase dfra (dhfr) (tetrahydrofolate dehydrogenase)" /note="Mb2785c, dfrA, len: 159 aa. Equivalent to Rv2763c, len: 159 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 159 aa overlap). Probable dfrA (alternate gene name: folA), dihydrofolate reductase (EC 1.5.1.3), equivalent to O30463|FOLA DIHYDROFOLATE REDUCTASE from Mycobacterium avium (see citation below) (181 aa), FASTA scores: opt: 802, E(): 4.5e-48, (70.2% identity in 161 aa overlap); and Q9CBW1|FOLA|ML1518 DIHYDROFOLATE REDUCTASE from Mycobacterium leprae (165 aa), FASTA scores: opt: 782, E(): 1e-46, (70.55% identity in 163 aa overlap). Also highly similar to many e.g. Q9K168|DYR_NEIMB|FOLA|NMB0308 from Neisseria meningitidis (serogroup B) (162 aa), FASTA scores: opt: 469, E(): 3.8e-25, (46.65% identity in 163 aa overlap); P12833|DYR3_SALTY|DHFRIII from Salmonella typhimurium (162 aa), FASTA scores: opt: 367, E(): 4e-18, (45.4% identity in 141 aa overlap); Q59408|DYRC_ECOLI|DHFRXIII from Escherichia coli strain RA33.2 (165 aa), FASTA scores: opt: 313, E(): 2.2e-14, (41.9% identity in 136 aa overlap); etc. Contains PS00075 Dihydrofolate reductase signature. BELONGS TO THE DIHYDROFOLATE REDUCTASE FAMILY. Protein product from Mb2785c detected using SWATH mass spectrometry. Mb2785c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A547" /db_xref="InterPro:IPR001796" /db_xref="InterPro:IPR012259" /db_xref="InterPro:IPR017925" /db_xref="InterPro:IPR024072" /db_xref="UniProtKB/Swiss-Prot:P0A547" /protein_id="SIU01403.1" /translation="MVGLIWAQATSGVIGRGGDIPWRLPEDQAHFREITMGHTIVMGR RTWDSLPAKVRPLPGRRNVVLSRQADFMASGAEVVGSLEEALTSPETWVIGGGQVYAL ALPYATRCEVTEVDIGLPREAGDALAPVLDETWRGETGEWRFSRSGLRYRLYSYHRS" CDS complement(3034666..3035457) /codon_start=1 /transl_table=11 /gene="thyA" /locus_tag="BQ2027_MB2786C" /product="PROBABLE THYMIDYLATE SYNTHASE THYA (TS) (TSASE)" /note="Mb2786c, thyA, len: 263 aa. Equivalent to Rv2764c, len: 263 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 263 aa overlap). Probable thyA, thymidylate synthase (EC 2.1.1.45), equivalent to Q9CBW0|TYSY_MYCLE|THYA|ML1519 THYMIDYLATE SYNTHASE from Mycobacterium leprae (266 aa), FASTA scores: opt: 1602, E(): 5.9e-102, (85.5% identity in 262 aa overlap). Also highly similar to many e.g. P00470|TYSY_ECOLI|B2827|Z4144|ECS3684|BAB37107|AAG57938 from Escherichia coli strains K12 and O157:H7 (264 aa), FASTA scores: opt: 1309, E(): 5.9e-82, (66.65% identity in 261 aa overlap); P48464|TYSY_SHIFL|THYA from Shigella flexneri (264 aa), FASTA scores: opt: 1303, E(): 1.5e-81, (65.9% identity in 261 aa overlap); P54081|TYSB_BACAM|THYB|THYBA from Bacillus amyloliquefaciens (264 aa), FASTA scores: opt: 1235, E(): 6.7e-77, (66.65% identity in 261 aa overlap); etc. Contains PS00091 Thymidylate synthase active site. BELONGS TO THE THYMIDYLATE SYNTHASE FAMILY. Protein product from Mb2786c detected using SWATH mass spectrometry. Mb2786c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67045" /db_xref="InterPro:IPR000398" /db_xref="InterPro:IPR020940" /db_xref="InterPro:IPR023451" /db_xref="InterPro:IPR036926" /db_xref="UniProtKB/Swiss-Prot:P67045" /protein_id="SIU01404.1" /translation="MTPYEDLLRFVLETGTPKSDRTGTGTRSLFGQQMRYDLSAGFPL LTTKKVHFKSVAYELLWFLRGDSNIGWLHEHGVTIWDEWASDTGELGPIYGVQWRSWP APSGEHIDQISAALDLLRTDPDSRRIIVSAWNVGEIERMALPPCHAFFQFYVADGRLS CQLYQRSADLFLGVPFNIASYALLTHMMAAQAGLSVGEFIWTGGDCHIYDNHVEQVRL QLSREPRPYPKLLLADRDSIFEYTYEDIVVKNYDPHPAIKAPVAV" CDS 3035622..3036359 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2787" /product="PROBABLE ALANINE RICH HYDROLASE" /note="Mb2787, -, len: 245 aa. Equivalent to Rv2765, len: 245 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 245 aa overlap). Probable ala-rich hydrolase (EC 3.-.-.-), similar to various hydrolases or hypothetical proteins e.g. Q9KYM6|SC9H11.13c PUTATIVE HYDROLASE from Streptomyces coelicolor (251 aa), FASTA scores: opt: 630, E(): 1.4e-33, (43.1% identity in 246 aa overlap); Q9A5T9|CC2358 DIENELACTONE HYDROLASE FAMILY PROTEIN from Caulobacter crescentus (286 aa), FASTA scores: opt: 592, E(): 4.5e-31, (38.45% identity in 242 aa overlap); Q9FCF1|2SCD46.33 PUTATIVE HYDROLASE (DIENELACTONE HYDROLASE FAMILY) from Streptomyces coelicolor (254 aa), FASTA scores: opt: 500, E(): 3.9e-25, (37.7% identity in 252 aa overlap); P73163|DLHH_SYNY3|SLL1298 PUTATIVE CARBOXYMETHYLENEBUTENOLIDASE (DIENELACTONE HYDROLASE) (EC 3.1.1.45) from Synechocystis sp. (strain PCC 6803) (246 aa), FASTA scores: opt: 276, E(): 1.3e-10, (26.95% identity in 230 aa overlap); etc. Protein product from Mb2787 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2787 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y443" /db_xref="InterPro:IPR002925" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y443" /protein_id="SIU01405.1" /translation="MPKTTDTAATPDGTCAVRLFTPDGPGRWPGVVMFPDAGGVRDTF DRMAAKLAGFGYVVLLPDVYYREGDWAPFDMKTAFGDPQERARIMFMIGTLTPDRVTR DADALLNYLASRPEVIGDRFGVCGYCMGGRMSVVVAGRLPDRVAAAAAFHPGGLVANS PDSPHLLADRISATVYIGGAENDPSFTADHAEKLDKAFSAAGVPHRIECYPAAHGFAV PDNPSYDAAADERHWAAMTETFGAALN" CDS complement(3036574..3037356) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2788C" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb2788c, -, len: 260 aa. Equivalent to Rv2766c, len: 260 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 260 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to others (from bacteria and eukaryota) e.g. Q9K3Y8|2SCG61.27c PUTATIVE SHORT CHAIN OXIDOREDUCTASE from Streptomyces coelicolor (253 aa), FASTA scores: opt: 722, E(): 7.4e-39, (44.75% identity in 248 aa overlap); Q93790|F54F3.4 HYPOTHETICAL SDR PROTEIN from Caenorhabditis elegans (260 aa), FASTA scores: opt: 613, E(): 6.9e-32, (41.7% identity in 247 aa overlap); O95162|O95162|SCAD-SRL PEROXISOMAL SHORT-CHAIN ALCOHOL DEHYDROGENASE from Homo sapiens (Human) (260 aa), FASTA scores: opt: 594, E(): 1.1e-30, (39.6% identity in 250 aa overlap); P51831|FABG_BACSU 3-OXOACYL-[ACYL-CARRIER PROTEIN] from Bacillus subtilis (246 aa), FASTA scores: opt: 504, E(): 4e-28, (37.2% identity in 247 aa overlap); etc. Also similar to many other Mycobacterium tuberculosis acyl-carrier proteins e.g. MTCY03C7.07 (38.5% identity in 244 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Note that previously known as fabG5, a 3-oxoacyl-[acyl-carrier-protein]. Protein product from Mb2788c detected using shotgun mass spectrometry. Mb2788c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2Y3" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Y3" /protein_id="SIU01406.1" /translation="MTSLDLTGRTAIITGASRGIGLAIAQQLAAAGAHVVLTARRQEA ADEAAAQVGDRALGVGAHAVDEDAARRCVDLTLERFGSVDILINNAGTNPAYGPLLEQ DHARFAKIFDVNLWAPLMWTSLVVTAWMGEHGGAVVNTASIGGMHQSPAMGMYNATKA ALIHVTKQLALELSPRIRVNAICPGVVRTRLAEALWKDHEDPLAATIALGRIGEPADI ASAVAFLVSDAASWITGETMIIDGGLLLGNALGFRAAPSTEH" CDS complement(3037353..3037706) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2789C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb2789c, -, len: 117 aa. Equivalent to Rv2767c, len: 117 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 117 aa overlap). Possible membrane protein, showing very weak similarity with Q9L2H7|SCC121.09 PUTATIVE METAL TRANSPORT ABC TRANSPORTER from Streptomyces coelicolor (256 aa), FASTA scores: opt: 110, E(): 1, (33.05% identity in 112 aa overlap). Questionable ORF. Protein product from Mb2789c detected using shotgun mass spectrometry." /db_xref="GOA:A0A1R3Y2A7" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A7" /protein_id="SIU01407.1" /translation="MVGYEGARGRAGREMSESATAGARSSRIPFGIIRNHEAVRPRRS RHLNHARDTPQMVAVAQVWREVVQATAIAIAPPLPVVSWGLISLAFLSHTVRGRYRRS PPAESGHHSNRRQAK" CDS complement(3037880..3039064) /codon_start=1 /transl_table=11 /gene="PPE43" /locus_tag="BQ2027_MB2790C" /product="ppe family protein ppe43" /note="Mb2790c, PPE43, len: 394 aa. Equivalent to Rv2768c, len: 394 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 394 aa overlap). Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. upstream ORF O33312|Rv2770c|MTV002.35c (402 aa), FASTA scores: opt: 1135, E(): 6.1e-51, (62.15% identity in 391 aa overlap); and P96362|Rv1039c|MTCY10G2.10 from M. tuberculosis (391 aa), FASTA scores: opt: 1721, E(): 6.8e-81, (70.35% identity in 398 aa overlap). Equivalent to AAK47157 from Mycobacterium tuberculosis strain CDC1551 (462 aa) but shorter 68 aa." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2K0" /protein_id="SIU01408.1" /translation="MDFGALPPEINSTRMYAGAGAAPLMAAGATWNGLAVELSTTASS VESVIMQLTTEQWLGPASMSMVVAAQPYLAWLTYTAESAAHAAAQAMASAAAFEAAFA MTVPPAEVAANRALLAALVATNVLGQNTPAIMATEAHYGEMWAQDALAMYGYAASSAA AGRLNPLITPSQTANMAGLAGQAAAVSHAAAASTVQQVGLGSLISNLPNAVMGFASPL TSAADAAGLGGIIQDIEELLGITFVQNAINGAVNTTAWFVMATIRNAVFLGHAFAALN PATVTAAADAVPAAAAAAGLAHTVTPVGVGGASLTASLGEASSVGGLSVPAGWSTAAP AMTSGTTALEGSGWAVPEEAGPVAAMPGMAGISGAAKGAGAYAGPRYGFKPIVMPKQV VV" CDS complement(3039144..3039971) /codon_start=1 /transl_table=11 /gene="PE27" /locus_tag="BQ2027_MB2791C" /product="pe family protein pe27" /note="Mb2791c, PE27, len: 275 aa. Equivalent to Rv2769c, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 275 aa overlap). Member of the Mycobacterium tuberculosis PE family, highly similar to many (notably in N-terminal part) e.g. P96361|Rv1040c|MTCY10G2.09 from Mycobacterium tuberculosis (275 aa), FASTA scores: opt: 1111, E(): 5.9e-52, (68.55% identity in 283 aa overlap)." /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR022171" /db_xref="UniProtKB/TrEMBL:A0A1R3Y255" /protein_id="SIU01409.1" /translation="MSFLTTQPEELAAAAGKLETIGSAMVAQNAAAAAPTTTGVIPAA ADEISVLQASLFTAYGTLYQQVSAEAAAVYDLFVKTLGVSAGTYAATEAANSSAAASP LSGIASILGSTPGKVPSWISDIANIFNIGAGNWASAASDLLGLASGGLLPAAEEAALE EGLEGAGLSELGAAEAAVGEAPIAAGLGAAPLAAGLSRASSIGALSVPPSWAGQANLV SSTSTLQGAGWTTAAPHGAAGTVIPGMPGLASATRSSAGFGAPRYGAKPIVMPKPAV" CDS complement(3040295..3041443) /codon_start=1 /transl_table=11 /gene="PPE44" /locus_tag="BQ2027_MB2792C" /product="ppe family protein ppe44" /note="Mb2792c, PPE44, len: 382 aa. Equivalent to Rv2770c, len: 382 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 382 aa overlap). Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. downstream ORF O33310|Rv2768c|MTV002.33c from M. tuberculosis (394 aa), FASTA scores: opt: 1135, E(): 2.2e-53, (62.15% identity in 391 aa overlap); and P96362|Rv1039c|MTCY10G2.10 from Mycobacterium tuberculosis (391 aa), FASTA scores: opt: 1010, E(): 1e-46, (55.95% identity in 395 aa overlap). Equivalent to AAK47159 from M. tuberculosis strain CDC1551 (402 aa) but shorter 20 aa. Start changed since first submission (-20 aa). Mb2792c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y245" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01410.1" /translation="MDFGALPPEVNSARMYGGAGAADLLAAAAAWNGIAVEVSTAASS VGSVITRLSTEHWMGPASLSMAAAVQPYLVWLTCTAESSALAAAQAMASAAAFETAFA LTVPPAEVVANRALLAELTATNILGQNVSAIAATEARYGEMWAQDASAMYGYAAASAV AARLNPLTRPSHITNPAGLAHQAAAVGQAGASASARQVGLSHLISDVADAVLSFASPV MSAADTGLEAVRQFLNLDVPLFVESAFHGLGGVADFATAAIGNMTLLADAMGTVGGAA PGGGAAAAVAHAVAPAGVGGTALTADLGNASVVGRLSVPASWSTAAPATAAGAALDGT GWAVPEEDGPIAVMPPAPGMVVAANSVGADSGPRYGVKPIVMPKHGLF" CDS complement(3041567..3042019) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2793C" /product="Multimeric flavodoxin WrbA" /note="Mb2793c, -, len: 150 aa. Equivalent to Rv2771c, len: 150 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 150 aa overlap). Conserved hypothetical protein, equivalent to Q9CBV8|ML1525 HYPOTHETICAL PROTEIN from Mycobacterium leprae (151 aa), FASTA scores: opt: 489, E(): 1.7e-27, (52.7% identity in 148 aa overlap). Also highly similar to Q9RD46|SCF56.21 HYPOTHETICAL 15.7 KDA PROTEIN from Streptomyces coelicolor (151 aa), FASTA scores: opt: 671, E(): 2.2e-40, (67.8% identity in 146 aa overlap). Protein product from Mb2793c detected using shotgun mass spectrometry. Mb2793c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029039" /db_xref="UniProtKB/TrEMBL:A0A1R3Y249" /protein_id="SIU01411.1" /translation="MRRLLIVHHTPSPHMQEMFEAVVSGATDPEIEGVEVVRRPALTV SPIEMLEADGYLLGTPANLGYISGALKHAFDVCYYPCLDTTRGRSFGAYIHGNEGTEG AERAVDAITTGLGWVQAAETVVVMGKPSKADIEACWNLGATVAAQLMG" CDS complement(3042105..3042578) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2794C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2794c, -, len: 157 aa. Equivalent to Rv2772c, len: 157 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 157 aa overlap). Probable conserved transmembrane protein, equivalent to Q9CBV7|ML1526 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (160 aa), FASTA scores: opt: 767, E(): 1.5e-43, (76.6% identity in 154 aa overlap); and similar to P46830|YDAB_MYCBO from Mycobacterium bovis (177 aa), FASTA scores: opt: 337, E(): 3.9e-15, (40.75% identity in 135 aa overlap). Also similar to O86837|SC9A10.04 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (151 aa), FASTA scores: opt: 338, E(): 3e-15, (43.75% identity in 144 aa overlap). Protein product from Mb2794c detected using SWATH mass spectrometry. Mb2794c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y262" /db_xref="UniProtKB/TrEMBL:A0A1R3Y262" /protein_id="SIU01412.1" /translation="MTRRTLYVQLIIAFMCVAMVAYLVMLGRVAVAMIGSGRAAAAGL GLALLILPVIGLWAMIATLRAGFAYQRLARLIAEDGLDIDASALPRRASGRIQRDAAD ALFAAVRTELEDDADDWRRWYRLARAYDYAGDRRRAREAMKTALQLEGRARPGAR" CDS complement(3042590..3043327) /codon_start=1 /transl_table=11 /gene="dapB" /locus_tag="BQ2027_MB2795C" /product="DIHYDRODIPICOLINATE REDUCTASE DAPB (DHPR)" /note="Mb2795c, dapB, len: 245 aa. Equivalent to Rv2773c, len: 245 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 245 aa overlap). dapB, dihydrodipicolinate reductase (EC 1.3.1.26) (see first citation below), highly similar to many e.g. P40110|DAPB_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (248 aa), FASTA scores: opt: 1030, E(): 1.8e-58, (65.45% identity in 246 aa overlap); O86836|DAPB_STRCO|SC9A10.03 from Streptomyces coelicolor (250 aa), FASTA scores: opt: 997, E(): 2.3e-56, (61.15% identity in 247 aa overlap); P42976|DAPB_BACSU from Bacillus subtilis (267 aa), FASTA scores: opt: 608, E(): 1.7e-31, (45.95% identity in 209 aa overlap); P46829|DAPB_MYCBO from Mycobacterium bovis (see second citation below) (271 aa), FASTA scores: opt: 505, E(): 6.3e-25, (36.2% identity in 246 aa overlap); etc. BELONGS TO THE DIHYDRODIPICOLINATE REDUCTASE FAMILY. Protein product from Mb2795c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2795c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TXX0" /db_xref="InterPro:IPR000846" /db_xref="InterPro:IPR022663" /db_xref="InterPro:IPR022664" /db_xref="InterPro:IPR023940" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:Q7TXX0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01413.1" /translation="MRVGVLGAKGKVGATMVRAVAAADDLTLSAELDAGDPLSLLTDG NTEVVIDFTHPDVVMGNLEFLIDNGIHAVVGTTGFTAERFQQVESWLVAKPNTSVLIA PNFAIGAVLSMHFAKQAARFFDSAEVIELHHPHKAEAPSGTAARTAKLIAEARKGLPP NPDATSTSLPGARGADVDGIPVHAVRLAGLVAHQEVLFGTEGEILTIRHDSLDRTSFV PGVLLAVRRIAERPGLTVGLEPLLDLH" CDS complement(3043338..3043742) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2796C" /product="HYPOTHETICAL PROTEIN" /note="Mb2796c, -, len: 134 aa. Equivalent to Rv2774c, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 134 aa overlap). Hypothetical unknown protein. Mb2796c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y271" /protein_id="SIU01414.1" /translation="MGTAVEVGWRDPCGLAVGELRCAPAVSDQPVVGCAGCPLVDMVD FAPVTGCVAVGSTMGAVPALLRVRFPWPPFEPDVRLSPYLALHGICRWGGSDSCDRTT VQVFHLHSINKRLTAHAGFGAAAVVGLEDGPV" CDS 3043895..3044356 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2797" /product="gcn5-related n-acetyltransferase" /note="Mb2797, -, len: 153 aa. Equivalent to Rv2775, len: 153 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 153 aa overlap). Hypothetical unknown protein, showing weak similarity with hypothetical proteins e.g. Q9ZBJ7|SC9C7.13c from Streptomyces coelicolor (179 aa), FASTA scores: opt: 167, E(): 0.00024, (29.05% identity in 148 aa overlap). Equivalent to AAK47164 from Mycobacterium tuberculosis strain CDC1551 (185 aa) but shorter 32 aa. Protein product from Mb2797 detected using SWATH mass spectrometry. Mb2797 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y455" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3Y455" /protein_id="SIU01415.1" /translation="MHYPVWRQSWTGILDPYLLDMIGSPKLWVEESYPQSLKRGGWSM WIAESGGQPIGMTMFGPDIAHPDRIQIDALYVAENSQRHGIGGRLLNRALHSHPSADM ILWCAEKNSKARGFYEKKDFHIDGRTFTWKPLSGVNVPHVGYRLYRSAPPG" CDS complement(3044360..3045289) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2798C" /product="PROBABLE OXIDOREDUCTASE" /note="Mb2798c, -, len: 309 aa. Equivalent to Rv2776c, len: 309 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 309 aa overlap). Probable oxidoreductase (EC 1.-.-.-), similar to other oxidoreductases e.g. Q9KZ15|SC10B7.17 PUTATIVE IRON-SULFUR OXIDOREDUCTASE from Streptomyces coelicolor (364 aa), FASTA scores: opt: 846, E(): 1.2e-45, (46.75% identity in 308 aa overlap); O88034|SC5A7.28c IRON-SULFUR OXIDOREDUCTASE BETA SUBUNIT from Streptomyces coelicolor (313 aa), FASTA scores: opt: 745, E(): 2.3e-39, (41.45% identity in 316 aa overlap); P33164|PDR_BURCE|OPHA1 PHTHALATE DIOXYGENASE REDUCTASE from Burkholderia cepacia (Pseudomonas cepacia) (321 aa), FASTA scores: opt: 616, E(): 2.9e-31, (33.65% identity in 309 aa overlap); etc. Equivalent to AAK47165 from Mycobacterium tuberculosis strain CDC1551 (363 aa) but shorter 54 aa. Contains PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature and PS00063 Aldo/keto reductase family putative active site signature. SEEMS TO BELONG TO THE 2FE2S PLANT-TYPE FERREDOXIN FAMILY IN THE C-TERMINAL SECTION. Mb2798c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2Z3" /db_xref="InterPro:IPR000951" /db_xref="InterPro:IPR001041" /db_xref="InterPro:IPR001433" /db_xref="InterPro:IPR006058" /db_xref="InterPro:IPR008333" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR017927" /db_xref="InterPro:IPR017938" /db_xref="InterPro:IPR036010" /db_xref="InterPro:IPR039261" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Z3" /protein_id="SIU01416.1" /translation="MRRTNPAVVTKRELVAPDVVALTLADPGGGLLPAWSPGGHIDVQ LPSGRRRQYSLCGVPGRRTDYRIAIRRIADGGGGSIEMHEAFDVGDTCEFEGPRNAFH LGLAERDVLFVIGGIGVTPILPMIRAAEQRGIDWRAIYAGRGREYMPFLDEVVAVAPG RVTVWADDEHGRFASVDELLAGAGPTTAVYVCGPPGMLEAVRVARNQHADAPLHYERF SPPPVVDGVPFELELARSRRVLRVPANRSALDVMLDWDPTTAYSCQQGFCGTCKVRVL AGQVDRRGRIIEGDNEMLVCVSRAVSGRVVIDA" CDS complement(3045471..3046541) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2799C" /product="FF domain protein" /note="Mb2799c, -, len: 356 aa. Equivalent to Rv2777c, len: 356 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 356 aa overlap). Conserved hypothetical protein, highly similar (but longer in N-terminus) to hypothetical proteins Q9KZ16|SC10B7.16 from Streptomyces coelicolor (296 aa), FASTA scores: opt: 980, E(): 6.8e-57, (51.25% identity in 281 aa overlap); and Q9HYS0|PA3325 from Pseudomonas aeruginosa (295 aa), FASTA scores: opt: 816, E(): 4e-46, (43.75% identity in 288 aa overlap); and similar (but longer in N-terminus) to other hypothetical proteins e.g. Q9I3H1|PA1542 from Pseudomonas aeruginosa (278 aa), FASTA scores: opt: 234, E(): 6.3e-08, (31.8% identity in 258 aa overlap). Equivalent to AAK47166 from Mycobacterium tuberculosis strain CDC1551 (393 aa) but shorter 37 aa. Mb2799c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR016516" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2B7" /protein_id="SIU01417.1" /translation="MNVEVHSAPGWRAGSSPLGYAQLYLPTRDVYWGDMSGIYVNAVA TFSEGAAMVSVDDRATGPHSSESRAADHERLVLEPRDVEFDWTNLPFHYVPNEPMATH VLNVLHMLLPAGEEFFVRVFKKTLPLIKDDQLRLDVQGFIGQEAMHSQAHSGVVDHFD AQGVDVTAFTNQIRWLFEKLLGESPRRSPRRQYSWLLEQVSFIAAIEHYTAVMGEWIL NSPQLDAVGADPVMLDMLRWHGAEEVEHKAVAFDTMKHLRAGYWRQVRAQLTVTPVML LLWIRGVRFMYSVDPYLPPGTKPRWRDYFKAARRGLVPGLPRLLRVVGHYYKPGFHPS QLGGLGAAVDYLAVSPAARASH" CDS complement(3046699..3047169) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2800C" /product="conserved protein" /note="Mb2800c, -, len: 156 aa. Equivalent to Rv2778c, len: 156 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 156 aa overlap). Conserved hypothetical protein, similar to Q9CBF7|ML2031 HYPOTHETICAL PROTEIN from Mycobacterium leprae (151 aa), FASTA scores: opt: 227, E(): 8.5e-09, (35.95% identity in 153 aa overlap). Also similar to AAK46204|MT1931.1 HYPOTHETICAL 17.8 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (158 aa), FASTA scores: opt: 238, E(): 1.5e-09, (35.75% identity in 151 aa overlap); or O07748|Rv1883c|MTCY180.35 HYPOTHETICAL 17.3 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (158 aa), FASTA scores: opt: 212, E(): 9.7e-08, (34.45% identity in 151 aa overlap); note that AAK46204|MT1931.1 and O07748|Rv1883c|MTCY180.35 are essentially the same protein except for a small (5 aa) gap. Protein product from Mb2800c detected using shotgun mass spectrometry. Mb2800c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2L0" /protein_id="SIU01418.1" /translation="MPDPDGPSVTVTVEIDANPDLVYGLITDLPTLASLAEEVVAMQL RKGDDVRKGAVFVGRNENGGRRWTTTCTVTDADPGRVFAFDVRSGIIPISRWQYGIVA TEHGCRVTESTWDRRPSWFRAVARMATGVKDRASVNTEHIRRTLQRLKDRAEAG" CDS complement(3047201..3047716) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2801C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY LRP/ASNC-FAMILY)" /note="Mb2801c, -, len: 171 aa. Equivalent to Rv2779c, len: 179 aa, from Mycobacterium tuberculosis strain H37Rv, (94.475% identity in 179 aa overlap). Possible transcriptional regulator, from the Lrp/AsnC family, similar (but longer ~30 aa in N-terminus) to others e.g. CAC42842|SCBAC36F5.06 PUTATIVE ASNC-FAMILY TRANSCRIPTIONAL REGULATORY PROTEIN from Streptomyces coelicolor (163 aa), FASTA scores: opt: 333, E(): 4.4e-16, (39.7% identity in 141 aa overlap); O07920|AZLB_BACSU TRANSCRIPTIONAL REGULATOR (ASNC FAMILY) from Bacillus subtilis; Q9I233|PA2082 PROBABLE TRANSCRIPTIONAL REGULATOR (ASNC FAMILY) from Pseudomonas aeruginosa (158 aa), FASTA scores: opt: 322, E(): 2.5e-15, (33.1% identity in 148 aa overlap); etc. Also similar to P96896|Rv3291c|MTCY71.31c from Mycobacterium tuberculosis (33.3% identity in 120 aa overlap). Equivalent to AAK47168 from Mycobacterium tuberculosis strain CDC1551 (181 aa). SEEMS TO BELONG TO THE ASNC FAMILY OF TRANSCRIPTIONAL REGULATORS. Start changed since first submission (+8 aa). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 24 bp deletion leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (171 aa versus 179 aa). Mb2801c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y261" /db_xref="InterPro:IPR000485" /db_xref="InterPro:IPR011008" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR019887" /db_xref="InterPro:IPR019888" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y261" /protein_id="SIU01419.1" /translation="MIILFRGHIRDNSTEHKTRRAASSKDVRPAELDEVDRRILSLLH GDARMPNNALADTVGIAPSTCHGRVRRLVDLGVIRGFYTDIDPVAVGLPLQAMISVNL QSSARGKIRSFIQQIRRKRQGADDFILHVAARDTEDLRSFVVENLNADADVAGTQTSL IFEHLRGAAPI" CDS 3047782..3048180 /codon_start=1 /transl_table=11 /gene="alda" /locus_tag="BQ2027_MB2802" /product="SECRETED L-ALANINE DEHYDROGENASE ALDa [FIRST PART] (40 KDA ANTIGEN) (TB43)" /note="Mb2802, alda, len: 132 aa. Similar to 5' end of Rv2780, len: 371 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 89 aa overlap). ald, secreted L-alanine dehydrogenase (EC 1.4.1.1) (40 kd antigen) (see first citations below); equivalent to Q9CBV6|ALD|ML1532 L-ALANINE DEHYDROGENASE from Mycobacterium leprae (371 aa), FASTA scores: opt: 2081, E(): 4e-115, (85.45% identity in 371 aa overlap). Also highly similar to others e.g. Q9S227|SCI51.13c from Streptomyces coelicolor (371 aa), FASTA scores: opt: 1575, E(): 2.3e-85, (66.05% identity in 371 aa overlap); Q9K827|BH3180 from Bacillus halodurans (371 aa), FASTA scores: opt: 1341, E(): 1.4e-71, (56.45% identity in 372 aa overlap); Q9RT70|DR1895 from Deinococcus radiodurans (390 aa), FASTA scores: opt: 1319, E(): 2.8e-70, (54.2% identity in 371 aa overlap); etc. Contains PS00836 and PS00837 Alanine dehydrogenase & pyridine nucleotide transhydrogenase signature 1 and 2. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, ald exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (a-*) splits ald into 2 parts, alda and aldb. Mb2802 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y257" /db_xref="InterPro:IPR007886" /db_xref="InterPro:IPR008141" /db_xref="InterPro:IPR008142" /db_xref="UniProtKB/TrEMBL:A0A1R3Y257" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01420.1" /translation="MRVGIPTETKNNEFRVAITPAGVAELTRRGHEVLIQAGAGEGSA ITDADFKAAGAQLVGTADQVWADADLLLKVKEPIAAEYGRLRHGRSCSRSCIWPRHVL APMRCWIPAPRQLPTRPSRPPTAHYPCLPR" CDS 3048177..3048896 /codon_start=1 /transl_table=11 /gene="aldb" /locus_tag="BQ2027_MB2803" /product="SECRETED L-ALANINE DEHYDROGENASE ALDb [SECOND PART] (40 KDA ANTIGEN) (TB43)" /note="Mb2803, aldb, len: 239 aa. Equivalent to 3' end of Rv2780, len: 371 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 239 aa overlap). ald, secreted L-alanine dehydrogenase (EC 1.4.1.1) (40 kd antigen) (see first citations below); equivalent to Q9CBV6|ALD|ML1532 L-ALANINE DEHYDROGENASE from Mycobacterium leprae (371 aa), FASTA scores: opt: 2081, E(): 4e-115, (85.45% identity in 371 aa overlap). Also highly similar to others e.g. Q9S227|SCI51.13c from Streptomyces coelicolor (371 aa), FASTA scores: opt: 1575, E(): 2.3e-85, (66.05% identity in 371 aa overlap); Q9K827|BH3180 from Bacillus halodurans (371 aa), FASTA scores: opt: 1341, E(): 1.4e-71, (56.45% identity in 372 aa overlap); Q9RT70|DR1895 from Deinococcus radiodurans (390 aa), FASTA scores: opt: 1319, E(): 2.8e-70, (54.2% identity in 371 aa overlap); etc. Contains PS00836 and PS00837 Alanine dehydrogenase & pyridine nucleotide transhydrogenase signature 1 and 2. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, ald exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (a-*) splits ald into 2 parts, alda and aldb. Protein product from Mb2803 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2803 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y259" /db_xref="InterPro:IPR007698" /db_xref="InterPro:IPR008141" /db_xref="InterPro:IPR008143" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y259" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01421.1" /translation="MSEVAGRLAAQVGAYHLMRTQGGRGVLMGGVPGVEPADVVVIGA GTAGYNAARIANGMGATVTVLDINIDKLRQLDAEFCGRIHTRYSSAYELEGAVKRADL VIGAVLVPGAKAPKLVSNSLVAHMKPGAVLVDIAIDQGGCFEGSRPTTYDHPTFAVHD TLFYCVANMPASVPKTSTYALTNATMPYVLELADHGWRAACRSNPALAKGLSTHEGAL LSERVATDLGVPFTEPASVLA" CDS complement(3048911..3049945) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2804C" /product="POSSIBLE ALANINE RICH OXIDOREDUCTASE" /note="Mb2804c, -, len: 344 aa. Equivalent to Rv2781c, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 344 aa overlap). Possible ala-rich oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases or hypothetical proteins e.g. Q9RDD8|SCC77.20c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (364 aa), FASTA scores: opt: 912, E(): 5.3e-47, (45.55% identity in 336 aa overlap); Q9FDD4|2-NPDL PUTATIVE 2-NITROPROPANE DIOXYGENASE from Streptomyces ansochromogenes (363 aa), FASTA scores: opt: 869, E(): 1.9e-44, (44.2% identity in 337 aa overlap); O05413|YRPB 2-NITROPROPANE DIOXYGENASE from Bacillus subtilis (347 aa), FASTA scores: opt: 560, E(): 4.9e-26, (33.75% identity in 317 aa overlap); etc. Protein product from Mb2804c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2804c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y272" /db_xref="InterPro:IPR004136" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3Y272" /protein_id="SIU01422.1" /translation="MVLGFWDIAVPIVGAPMAGGPSTPALAAAVSNAGGLGFVAGGYL SADRLADDIAAARAATTGPIGANLFVPQPSVADWAQLEYYADELEEVAEYYHTEVGQP VYGDDDDWVRKLEVVADVRPEVVSFTFGAPPPDVVQRLSALGLLVSITVTSVYEAGVA IAAGADSLVVQGPAAGGHRGTFAPDMEPGTESLHQLLDRIGSAHDVPLVAAGGLGTAE DVAAVLRRGAIAAQVGTALLLADEAGTNAAHRAALKNPEFDATLVTRAFSGRYARGLA NNFTRLLDHVAPLGYPEVHQMTKPIRAAAVQADDPHGTNLWAGSAHRKTRPGPAADII ASLTPDVCSA" CDS complement(3050006..3051322) /codon_start=1 /transl_table=11 /gene="pepR" /locus_tag="BQ2027_MB2805C" /product="PROBABLE ZINC PROTEASE PEPR" /note="Mb2805c, pepR, len: 438 aa. Equivalent to Rv2782c, len: 438 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 438 aa overlap). Probable pepR, protease/peptidase (EC 3.4.99.-), equivalent to O32965|YR82_MYCLE|ML0855|MLCB22.26c HYPOTHETICAL ZINC PROTEASE from Mycobacterium leprae (445 aa), FASTA scores: opt: 2346, E(): 4.3e-146, (84.3% identity in 421 aa overlap). Also highly similar to others e.g. O86835|YA12_STRCO|SC9A10.02 from Streptomyces coelicolor (459 aa), FASTA scores: opt: 1394, E(): 1.1e-83, (51.9% identity in 416 aa overlap); Q04805|YMXG_BACSU|YMXG from Bacillus subtilis (409 aa), FASTA scores: opt: 1014, E(): 7.9e-59, (37.55% identity in 410 aa overlap); Q9KA85|BH2405 from Bacillus halodurans (413 aa), FASTA scores: opt: 967, E(): 9.6e-56, (38.6% identity in 417 aa overlap); etc. Contains PS00143 Insulinase family, zinc-binding region signature. BELONGS TO PEPTIDASE FAMILY M16, ALSO KNOWN AS THE INSULINASE FAMILY. COFACTOR: REQUIRES DIVALENT CATIONS FOR ACTIVITY. BINDS ZINC. Protein product from Mb2805c detected using SWATH mass spectrometry. Mb2805c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5S9" /db_xref="InterPro:IPR001431" /db_xref="InterPro:IPR007863" /db_xref="InterPro:IPR011249" /db_xref="InterPro:IPR011765" /db_xref="UniProtKB/Swiss-Prot:P0A5S9" /protein_id="SIU01423.1" /translation="MPRRSPADPAAALAPRRTTLPGGLRVVTEFLPAVHSASVGVWVG VGSRDEGATVAGAAHFLEHLLFKSTPTRSAVDIAQAMDAVGGELNAFTAKEHTCYYAH VLGSDLPLAVDLVADVVLNGRCAADDVEVERDVVLEEIAMRDDDPEDALADMFLAALF GDHPVGRPVIGSAQSVSVMTRAQLQSFHLRRYTPERMVVAAAGNVDHDGLVALVREHF GSRLVRGRRPVAPRKGTGRVNGSPRLTLVSRDAEQTHVSLGIRTPGRGWEHRWALSVL HTALGGGLSSRLFQEVRETRGLAYSVYSALDLFADSGALSVYAACLPERFADVMRVTA DVLESVARDGITEAECGIAKGSLRGGLVLGLEDSSSRMSRLGRSELNYGKHRSIEHTL RQIEQVTVEEVNAVARHLLSRRYGAAVLGPHGSKRSLPQQLRAMVG" CDS complement(3051300..3053558) /codon_start=1 /transl_table=11 /gene="gpsI" /locus_tag="BQ2027_MB2806C" /product="BIFUNCTIONAL PROTEIN POLYRIBONUCLEOTIDE NUCLEOTIDYLTRANSFERASE GPSI: GUANOSINE PENTAPHOSPHATE SYNTHETASE + POLYRIBONUCLEOTIDE NUCLEOTIDYLTRANSFERASE (POLYNUCLEOTIDE PHOSPHORYLASE) (PNPASE)" /note="Mb2806c, gpsI, len: 752 aa. Equivalent to Rv2783c, len: 752 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 752 aa overlap). Probable gpsI, polyribonucleotide nucleotidyltransferase (EC 2.7.7.8; 2.7.6.-), equivalent to Q9CCF8|GPSI|ML0854 (alias O32966) PUTATIVE POLYRIBONUCLEOTIDE PHOSPHORYLASE / GUANOSINE PENTAPHOSPHATE SYNTHETASE from Mycobacterium leprae (773 aa), FASTA scores: opt: 4304, E(): 0, (89.95% identity in 757 aa overlap). Also highly similar to others e.g. O86656|GPSI GUANOSINE PENTAPHOSPHATE SYNTHETASE/ POLYRIBONUCLEOTIDE NUCLEOTIDYLTRANSFERASE (FRAGMENT) from Streptomyces coelicolor (716 aa), FASTA scores: opt: 3393, E(): 5.8e-192, (72.77% identity in 718 aa overlap); Q53597|GPSI GUANOSINE PENTAPHOSPHATE SYNTHETASE from Streptomyces antibioticus (740 aa), FASTA scores: opt: 3314, E(): 2.6e-187, (70.55% identity in 733 aa overlap); P72659|PNP|SLL1043 POLYRIBONUCLEOTIDE NUCLEOTIDYLTRANSFERASE from Synechocystis sp. strain PCC 6803 (718 aa), FASTA scores: opt: 1244, E(): 1.7e-65, (45.05% identity in 750 aa overlap); etc. Note that S. antibioticus guanosine pentaphosphate synthetase is a multifunctional enzyme that also acts as a polyribonucleotide nucleotidyltransferase. Start site chosen by homology from several alternatives. Protein product from Mb2806c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2806c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TXW0" /db_xref="InterPro:IPR001247" /db_xref="InterPro:IPR003029" /db_xref="InterPro:IPR004087" /db_xref="InterPro:IPR004088" /db_xref="InterPro:IPR012162" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR014069" /db_xref="InterPro:IPR015848" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR022967" /db_xref="InterPro:IPR027408" /db_xref="InterPro:IPR036345" /db_xref="InterPro:IPR036456" /db_xref="InterPro:IPR036612" /db_xref="UniProtKB/Swiss-Prot:Q7TXW0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01424.1" /translation="MSAAEIDEGVFETTATIDNGSFGTRTIRFETGRLALQAAGAVVA YLDDDNMLLSATTASKNPKEHFDFFPLTVDVEERMYAAGRIPGSFFRREGRPSTDAIL TCRLIDRPLRPSFVDGLRNEIQIVVTILSLDPGDLYDVLAINAASASTQLGGLPFSGP IGGVRVALIDGTWVGFPTVDQIERAVFDMVVAGRIVEGDVAIMMVEAEATENVVELVE GGAQAPTESVVAAGLEAAKPFIAALCTAQQELADAAGKSGKPTVDFPVFPDYGEDVYY SVSSVATDELAAALTIGGKAERDQRIDEIKTQVVQRLADTYEGREKEVGAALRALTKK LVRQRILTDHFRIDGRGITDIRALSAEVAVVPRAHGSALFERGETQILGVTTLDMIKM AQQIDSLGPETSKRYMHHYNFPPFSTGETGRVGSPKRREIGHGALAERALVPVLPSVE EFPYAIRQVSEALGSNGSTSMGSVCASTLALLNAGVPLKAPVAGIAMGLVSDDIQVEG AVDGVVERRFVTLTDILGAEDAFGDMDFKVAGTKDFVTALQLDTKLDGIPSQVLAGAL EQAKDARLTILEVMAEAIDRPDEMSPYAPRVTTIKVPVDKIGEVIGPKGKVINAITEE TGAQISIEDDGTVFVGATDGPSAQAAIDKINAIANPQLPTVGERFLGTVVKTTDFGAF VSLLPGRDGLVHISKLGKGKRIAKVEDVVNVGDKLRVEIADIDKRGKISLILVADEDS TAAATDAATVTS" CDS complement(3053912..3054427) /codon_start=1 /transl_table=11 /gene="lppU" /locus_tag="BQ2027_MB2807C" /product="PROBABLE LIPOPROTEIN LPPU" /note="Mb2807c, lppU, len: 171 aa. Equivalent to Rv2784c, len: 171 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 171 aa overlap). Probable lppU, lipoprotein, sharing no homology with other proteins. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb2807c detected using SWATH mass spectrometry. Mb2807c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y465" /protein_id="SIU01425.1" /translation="MRAWLAAATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQ ATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGC MSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRFAV CVEDVTGGPRS" CDS complement(3054440..3054709) /codon_start=1 /transl_table=11 /gene="rpsO" /locus_tag="BQ2027_MB2808C" /product="30s ribosomal protein s15 rpso" /note="Mb2808c, rpsO, len: 89 aa. Equivalent to Rv2785c, len: 89 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 89 aa overlap). Probable rpsO, 30s ribosomal protein S15, equivalent to O32967|RS15_MYCLE|RPSO|ML0853|MLCB22.28c 30S RIBOSOMAL PROTEIN S15 from Mycobacterium leprae (89 aa), FASTA scores: opt: 522, E(): 7.4e-34, (92.15% identity in 89 aa overlap). Also highly similar to many e.g. O86655|RS15_STRCO|RPSO|SC3C3.22 from Streptomyces coelicolor (95 aa), FASTA scores: opt: 408, E(): 6.7e-25, (62.9% identity in 89 aa overlap); P05766|RS15_BACST|RPSO from Bacillus stearothermophilus (88 aa), FASTA scores: opt: 385, E(): 4e-23, (62.5% identity in 88 aa overlap); P21473|RS15_BACSU|RPSO from Bacillus subtilis (88 aa), FASTA scores: opt: 351, E(): 1.9e-20, (57.95% identity in 88 aa overlap); P02371|RS15_ECOLI|RPSO|SEC|B3165 from Escherichia coli strain K12 (88 aa), FASTA scores: opt: 295, E(): 4.5e-22, (52.3% identity in 88 aa overlap); etc. Contains PS00362 Ribosomal protein S15 signature. BELONGS TO THE S15P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb2808c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2808c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66430" /db_xref="InterPro:IPR000589" /db_xref="InterPro:IPR005290" /db_xref="InterPro:IPR009068" /db_xref="UniProtKB/Swiss-Prot:P66430" /protein_id="SIU01426.1" /translation="MALTAEQKKEILRSYGLHETDTGSPEAQIALLTKRIADLTEHLK VHKHDHHSRRGLLLLVGRRRRLIKYISQIDVERYRSLIERLGLRR" CDS complement(3054866..3055861) /codon_start=1 /transl_table=11 /gene="ribF" /locus_tag="BQ2027_MB2809C" /product="probable bifunctional fad synthetase/riboflavin biosynthesis protein ribf: riboflavin kinase (flavokinase) + fmn adenylyltransferase (fad pyrophosphorylase) (fad synthetase)(fad diphosphorylase) (flavin adenine dinucleotide synthetase)" /note="Mb2809c, ribF, len: 331 aa. Equivalent to Rv2786c, len: 331 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 331 aa overlap). Probable ribF, FAD synthetase/riboflavin biosynthesis protein, bifunctional enzyme (EC 2.7.1.26; 2.7.7.2), equivalent to O32968|RIBF|ML0852 RIBOFLAVIN KINASE from Mycobacterium leprae (331 aa), FASTA scores: opt: 1923, E(): 2.3e-115, (87.45% identity in 327 aa overlap). Also highly similar to many e.g. Q59263|RIBF_CORAM from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (338 aa), FASTA scores: opt: 899, E(): 5.7e-50, (45.8% identity in 321 aa overlap); Q9Z530|SC9F2.05c from Streptomyces coelicolor (318 aa), FASTA scores: opt: 862, E(): 1.3e-47, (52.45% identity in 324 aa overlap); P08391|RIBF_ECOLI|B0025|Z0029\ECS0028 from Escherichia coli strains K12 and O157:H7 (313 aa), FASTA scores: opt: 517, E(): 1.3e-25, (36.05% identity in 305 aa overlap); etc. Protein product from Mb2809c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2809c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2C3" /db_xref="InterPro:IPR002606" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR015864" /db_xref="InterPro:IPR015865" /db_xref="InterPro:IPR023465" /db_xref="InterPro:IPR023468" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2C3" /protein_id="SIU01427.1" /translation="MRRRLAIVQRWRGQDEIPTDWGRCVLTIGVFDGVHRGHAELIAH AVKAGRARGVPAVLMTFDPHPMEVVYPGSHPAQLTTLTRRAELVQDLGIEVFLVMPFT TDFMKLTPDRFIHELLVEHLHVVEVVVGENFTFGKKAAGNVDTLRRAGERFGFAVESM SLVSEHHSNETVTFSSTYIRSCVDAGDMVAAMEALGRPHRVEGVVVRGEGRGAELGFP TANVAPPMYSAIPADGVYAAWFTVLGHGPVTGTVVPGERYQAAVSVGTNPTFSGRTRT VEAFVLDTTADLYGQHVALDFVGRIRGQKKFESVRQLVAAMGADTERARDLLSTG" CDS 3056072..3057835 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2810" /product="ATPases involved in chromosome partitioning" /note="Mb2810, -, len: 587 aa. Equivalent to Rv2787, len: 587 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 587 aa overlap). Conserved hypothetical ala-rich protein, equivalent to Q9CCI1|ML0798 HYPOTHETICAL PROTEIN from Mycobacterium leprae (592 aa), FASTA scores: opt: 2994, E(): 6.9e-179, (76.5% identity in 587 aa overlap); and similar in part to other proteins from Mycobacterium leprae e.g. O33082|MLCB628.11 HYPOTHETICAL 52.0 KDA PROTEIN (478 aa), FASTA scores: opt: 481, E(): 2.3e-22, (30.95% identity in 294 aa overlap). Also similar in part to O86637|SC3C3.03c HYPOTHETICAL 112.1 KDA PROTEIN from Streptomyces coelicolor (1083 aa), FASTA scores: opt: 488, E(): 1.5e-22, (28.95% identity in 297 aa overlap). And similar to other hypothetical proteins from Mycobacterium tuberculosis e.g. O06396|Rv0530|MTCY25D10.09 (405 aa), FASTA scores: opt: 625, E(): 2.2e-31, (34.05% identity in 320 aa overlap); O69740|Rv3876|MTV027.11 (666 aa), FASTA scores: opt: 453, E(): 1.6e-20, (29.2% identity in 370 aa overlap); P96217|Rv3860|MTCY01A6.08c (390 aa), FASTA scores: opt: 443, E(): 4.7e-20, (29.95% identity in 354 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Mb2810 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002586" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2M0" /protein_id="SIU01428.1" /translation="MSTFRECRSMFDAAVKSYQSGDLANARAAFGRLTVENPDMSDGW LGLLACGDHHLDTLAGAHQHSEALYSETRRVGLTDGELSAVVMAPMYLGLRVWSRATI GLAYASALIIADRHDEAAATLDDPVITEDTGAAQYRQFVMATLFHKTRSWSNLLKVTE ISPPSGATDVRDEVADAVAALASTAAASLGQFQFALELAEQVSTTNPRVTADVTLTRA WCLRELGDDDAARVALSATTTGDAPRTNTTAEQAGSPQPKFRHPYDDGRDLLVARRRP PAGDGWRKAVTKMTFGRVNPEPSAKREQTDELIQRICAPLADVHKLAFVSAKGGVGKT TMTVLVGNAVARLRGDRVMAVDVDADLGDLSARFSERGGPQTNIEHFVSSQHTKRYAD VRVHTVMNKDRLEMLGAQNDPRSTYKFGPEDYGAAMQILETHCNVILLDCGTPVNGPL FSNILNDVTGLVVVASEDVRGVEGALVTLDWLGAHGFGRLLQHTVVVLNAIQKTRSLV DCGAAENQFRKRVPDFFRIPYDPHLATGLAVDFSSLKRRTRNAVLDLAGGLAQHYPAS RVRPRGEDSWKTWIETMRQVG" mobile_element 3056724..3058755 /mobile_element_type="insertion sequence:IS1602" /locus_tag="BQ2027_IS1602" /note="IS1602, len: 2032 nt. Equivalent to IS1602, len: 2032 nt, from Mycobacterium tuberculosis strain H37Rv,(99.9% identity in 2032 nt overlap)." CDS 3057920..3058606 /codon_start=1 /transl_table=11 /gene="sirR" /locus_tag="BQ2027_MB2811" /product="PROBABLE TRANSCRIPTIONAL REPRESSOR SIRR" /note="Mb2811, sirR, len: 228 aa. Equivalent to Rv2788, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 228 aa overlap). Probable sirR, transcriptional repressor, highly similar to others e.g. Q9RRF3|DR2539 PUTATIVE IRON DEPENDENT REPRESSOR from Deinococcus radiodurans (232 aa), FASTA scores: opt: 518, E(): 4.5e-26, (41.2% identity in 221 aa overlap); Q9HRU8|SIRR|VNG0536G from Halobacterium sp. strain NRC-1 (233 aa), FASTA scores: opt: 516, E(): 6.1e-26, (40.45% identity in 220 aa overlap); Q9KIJ2|SLOR REGULATOR SLOR from Streptococcus mutans (217 aa), FASTA scores: opt: 418, E(): 1.2e-19, (36.15% identity in 213 aa overlap); etc. Also some similarity to Q50495|IDER_MYCTU|MTCY05A6.32|IDER|DTXR|Rv2711|MT2784|MTCY 05A6.32 IRON-DEPENDENT REPRESSOR from Mycobacterium tuberculosis (230 aa), FASTA scores: opt: 266, E(): 7.1e-10, (27.6% identity in 221 aa overlap). Contains helix-turn-helix motif at aa 32-53 (Score 1327, +3.71 SD). COULD BELONG TO THE CRP/FNR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb2811 detected using shotgun mass spectrometry. Mb2811 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y267" /db_xref="InterPro:IPR000485" /db_xref="InterPro:IPR001367" /db_xref="InterPro:IPR007167" /db_xref="InterPro:IPR008988" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR022687" /db_xref="InterPro:IPR022689" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="InterPro:IPR036421" /db_xref="UniProtKB/TrEMBL:A0A1R3Y267" /protein_id="SIU01429.1" /translation="MRADEEPGDLSAVAQDYLKVIWTAQEWSQDKVSTKMLAERIGVS ASTASESIRKLAEQGLVDHEKYGAVTLTDSGRRAALAMVRRHRLLETFLVNELGYRWD EVHDEAEVLEHAVSDRLMARIDAKLGFPQRDPHGAPIPGADGQVPTPPARQLWACRDG DTGTVARISDADPQMLRYFASIGISLDSRLRVLARREFAGMISVAIDSADGATVDLGS PAAQAIWVVS" CDS complement(3058667..3059899) /codon_start=1 /transl_table=11 /gene="fadE21" /locus_tag="BQ2027_MB2812C" /product="PROBABLE ACYL-CoA DEHYDROGENASE FADE21" /note="Mb2812c, fadE21, len: 410 aa. Equivalent to Rv2789c, len: 410 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 410 aa overlap). Probable fadE21, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. P45857|ACDB_BACSU|MMGC from Bacillus subtilis (379 aa), FASTA scores: opt: 689, E(): 9.3e-37, (35.75% identity in 400 aa overlap); Q9K6D1|ACDA|BH3798 from Bacillus halodurans (380 aa), FASTA scores: opt: 679, E(): 4.1e-36, (37.3% identity in 405 aa overlap); Q06319|ACDS_MEGEL from Megasphaera elsdenii (383 aa), FASTA scores: opt: 650, E(): 3e-34, (37.7% identity in 334 aa overlap); etc. Contains acyl-CoA dehydrogenases signature 1 (PS00072). BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb2812c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2812c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y270" /db_xref="InterPro:IPR006089" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y270" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01430.1" /translation="MFEWSDTDLMVRDAVRQFIDKEIRPHQDALETGELSPYPIARKL FSQFGLDVLLAESVNQMLDGERAKREKRDSSGSFGLADQASMVAVLVSELAGVSIGLL STVAVSLGLGAATIMSRGTLAQQERWVPTLVTLEKIAAWAITEPDSGSDAFGGMKTHV TRDGEDYILNGHKTFITNGPYADVLVVYAKLADGEPASDWRNRPVLVFVLDAGMPGLT QGKPFKKMGMMSSPTGELFFDNVRLTPDRLLCAEGDGRDSARANFAVERLGVALMSLG IINECHRLCVDYAKTRTLWGRNIGQFQLIQLKLAKMEVARINVQNMVFQAIERLKAGK QLTLAEASAIKLYSSEAATDVAMEAVQLFGGNGYMAEYRVEQLARDAKSLMIYAGSNE VQVTHIAKGLLGEPASRA" CDS complement(3059925..3061130) /codon_start=1 /transl_table=11 /gene="ltp1" /locus_tag="BQ2027_MB2813C" /product="PROBABLE LIPID-TRANSFER PROTEIN LTP1" /note="Mb2813c, ltp1, len: 401 aa. Equivalent to Rv2790c, len: 401 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 401 aa overlap). Probable ltp1, lipid-transfer protein, highly similar to many eukaryotic sterol-carrier proteins/lipid-transfer protein precursors (see first citation below) e.g. O62742|SCP2 STEROL CARRIER PROTEIN X from Oryctolagus cuniculus (Rabbit) (547 aa), FASTA scores: opt: 1710, E(): 6e-102, (63.7% identity in 394 aa overlap); Q9QW19 3-OXOACYL-CoA THIOLASE HOMOLOG (FRAGMENT) (see citation below) from Rattus sp. (405 aa), FASTA scores: opt: 1696, E(): 3.8e-101, (63.2% identity in 394 aa overlap); P11915|NLTP_RAT|SCP2|SCP-2 NONSPECIFIC LIPID-TRANSFER PROTEIN PRECURSOR from Rattus norvegicus (Rat) (547 aa), FASTA scores: opt: 1696, E(): 4.8e-101, (63.2% identity in 394 aa overlap); P32020|NLTP_MOUSE|SCP2|SCP-2 NONSPECIFIC LIPID-TRANSFER PROTEIN PRECURSOR from Mus musculus (Mouse) (547 aa), FASTA scores: opt: 1681, E(): 4.3e-100, (62.7% identity in 394 aa overlap); etc. Contains PS00098 Thiolases acyl-enzyme intermediate signature and PS00737 Thiolases signature 2. Also similar to other M. tuberculosis proteins e.g. O06144|Rv1627c|MTCY01B2.19c (402 aa) (35.8% identity in 413 aa overlap). Protein product from Mb2813c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2813c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y282" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020613" /db_xref="InterPro:IPR020615" /db_xref="InterPro:IPR020616" /db_xref="InterPro:IPR020617" /db_xref="UniProtKB/TrEMBL:A0A1R3Y282" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01431.1" /translation="MPNQGSSNKVYVIGVGMTKFEKPGRREGWDYPDMARESGTKALR DAGIDYREVEQGYVGYVYGESTSGQRALYELGMTGIPIVNVNNNCSTGSTALYLGAQA IRGGLADCVLALGFEKMQPGALGGGADDRESPLGRHVKALAEIDEFGFPVAPWMFGAA GREHMKKYGTTAEHFAKIGYKNHKHSVNNPYAQFQDEYTLDDILASKMISDPLTKLQC SPTSDGSAAVVLASEDYLANHNLAGRAVEIVGQAMTTDFASTFDGSARNIIGYDMTVQ AAQRVYQQSGLGPKDFGVIELHDCFSANELLLYEALGLCGPGEAPELIDDNQTTYGGR WVVNPSGGLISKGHPLGATGLAQCAELTWQLRGTAEARQVDNVTAALQHNIGLGGAAV VTAYQRAER" gene 3061136..3063167 /locus_tag="BQ2027_IS1602" CDS complement(3061163..3062542) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2814C" /product="PROBABLE TRANSPOSASE" /note="Mb2814c, -, len: 459 aa. Equivalent to Rv2791c, len: 459 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 459 aa overlap). Probable IS1602 transposase for IS1602 element, similar to many e.g. P95117|Rv2978c|MTCY349.09 from Mycobacterium tuberculosis (459 aa), FASTA scores: opt: 2718, E(): 6.3e-165, (86.05% identity in 459 aa overlap). Mb2814c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001959" /db_xref="InterPro:IPR010095" /db_xref="InterPro:IPR021027" /db_xref="UniProtKB/TrEMBL:A0A1R3Y281" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01432.1" /translation="MAKFEIPEGWMVQAFRFTLDPTAEQARALARHFGARRKAYNWTV ATLKADIDAWQATGIQTAKPSLRVLRKRWNTVKNDVCVNIETGVVWWPECSKEAYADG IDGAVDAYWNWQNSRSGKRDGKRMGFPRFKKKGRDPDRVTFTTGAMRVEPDRRHLTLP VIGTVRTHENTRRVERLIAKGRSRVLAITVRRNGTRIDASVRVLVQRPQQPKVTDPGS RVGVDVGVRRLATVATADGAVLERVPNPRPLDAALNELRHVCRARSRCTKGSRRYRER TTEISRLHRRVNDVRTHHLHCLTTHLAKTHGRIVVEGLDAAGMLRQQGLSGARARRRG LSDAALGTPRRHLSYKTGWYGSQLVVADRWFPSSKTCHVCGHVQEIGWAEHWQCDSCS ASHQRDDCAAINLARYEDTSSVVGPVGAAVKRGADRKTRPGRAGGREARKGSSRKAAE QPRDGVQVA" CDS complement(3062542..3063123) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2815C" /product="POSSIBLE RESOLVASE" /note="Mb2815c, -, len: 193 aa. Equivalent to Rv2792c, len: 193 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 193 aa overlap). Possible IS1602 resolvase, highly similar to many from Mycobacterium tuberculosis e.g. O07773|Rv0605|MTCY19H5.17c POSSIBLE RESOLVASE (202 aa), FASTA scores: opt: 1040, E(): 1.9e-62, (85.05% identity in 194 aa overlap). Contains PS00397 Site-specific recombinases active site and possible helix-turn-helix motif at aa 1-2 (Score 1687, +4.93 SD). Mb2815c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y260" /db_xref="InterPro:IPR006118" /db_xref="InterPro:IPR006119" /db_xref="InterPro:IPR036162" /db_xref="InterPro:IPR041718" /db_xref="UniProtKB/TrEMBL:A0A1R3Y260" /protein_id="SIU01433.1" /translation="MNLAVWAERNGVARVTVYRWFHAGLLPVPARKAGRLILVDDQPA DRSRRARTAVYARVSSADQKPDLDRQVARVTAWATAEQIAVDKVVTEVGSALNGHRRK FLALLRDPSVKRIVVEHRDRFCRFGSEYVEAALAAQGRELVVVDSAEVDDDLVRDMTE ILTSMCARLYGKRAAQNRAKRALAAAAEESEAA" CDS complement(3063325..3064221) /codon_start=1 /transl_table=11 /gene="truB" /locus_tag="BQ2027_MB2816C" /product="PROBABLE TRNA PSEUDOURIDINE SYNTHASE B TRUB (TRNA PSEUDOURIDINE 55 SYNTHASE) (PSI55 SYNTHASE) (PSEUDOURIDYLATE SYNTHASE) (URACIL HYDROLYASE)" /note="Mb2816c, truB, len: 298 aa. Equivalent to Rv2793c, len: 298 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 298 aa overlap). Probable truB, tRNA pseudouridine synthase (EC 4.2.1.70), equivalent to Q9Z5I4|TRUB_MYCLE|ML1546 OR MLCB596.24 TRNA PSEUDOURIDINE SYNTHASE B from Mycobacterium leprae (320 aa), FASTA scores: opt: 1403, E(): 2.9e-83, (74.05% identity in 293 aa overlap). Also highly similar to many e.g. Q9Z528|TRUB_STRCO|SC9F2.07c from Streptomyces coelicolor (301 aa), FASTA scores: opt: 870, E(): 7.6e-49, (50.7% identity in 296 aa overlap); P09171|TRUB_ECOLI|P35|B3166|Z4527|ECS4047 from Escherichia coli strains K12 and O157:H7 (314 aa), FASTA scores: opt: 574, E(): 1e-29, (42.5% identity in 214 aa overlap); Q9PGR1|TRUB_XYLFA|XF0237 from Xylella fastidiosa (302 aa), FASTA scores: opt: 569, E(): 2.1e-29, (41.05% identity in 285 aa overlap); etc. BELONGS TO THE TRUB FAMILY OF PSEUDOURIDINE SYNTHASES. Protein product from Mb2816c detected using SWATH mass spectrometry. Mb2816c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P62189" /db_xref="InterPro:IPR002501" /db_xref="InterPro:IPR014780" /db_xref="InterPro:IPR015225" /db_xref="InterPro:IPR015947" /db_xref="InterPro:IPR020103" /db_xref="InterPro:IPR032819" /db_xref="InterPro:IPR036974" /db_xref="UniProtKB/Swiss-Prot:P62189" /protein_id="SIU01434.1" /translation="MSATGPGIVVIDKPAGMTSHDVVGRCRRIFATRRVGHAGTLDPM ATGVLVIGIERATKILGLLTAAPKSYAATIRLGQTTSTEDAEGQVLQSVPAKHLTIEA IDAAMERLRGEIRQVPSSVSAIKVGGRRAYRLARQGRSVQLEARPIRIDRFELLAARR RDQLIDIDVEIDCSSGTYIRALARDLGDALGVGGHVTALRRTRVGRFELDQARSLDDL AERPALSLSLDEACLLMFARRDLTAAEASAAANGRSLPAVGIDGVYAACDADGRVIAL LRDEGSRTRSVAVLRPATMHPG" CDS complement(3064218..3064901) /codon_start=1 /transl_table=11 /gene="pptt" /locus_tag="BQ2027_MB2817C" /product="phosphopantetheinyl transferase pptt (coa:apo-[acp]pantetheinephosphotransferase) (coa:apo-[acyl-carrier protein]pantetheinephosphotransferase)" /note="Mb2817c, -, len: 227 aa. Equivalent to Rv2794c, len: 227 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 227 aa overlap). Conserved hypothetical protein, equivalent to Q9Z5I5|ML1547|MLCB596.23 PUTATIVE IRON-CHELATING COMPLEX SUBUNIT from Mycobacterium leprae (227 aa), FASTA scores: opt: 1248, E(): 9.1e-77, (79.75% identity in 227 aa overlap). Also highly similar to various proteins e.g. Q9F0Q6|PPTA PHOSPHOPANTETHEINYL TRANSFERASE from Streptomyces verticillus (246 aa), FASTA scores: opt: 692, E(): 2.8e-39, (46.65% identity in 225 aa overlap); O88029|SC5A7.23 HYPOTHETICAL 24.5 KDA PROTEIN from Streptomyces coelicolor (226 aa), FASTA scores: opt: 679, E(): 2e-38, (46.9% identity in 226 aa overlap); O24813 DNA FOR L-PROLINE 3-HYDROXYLASE from Streptomyces sp. (208 aa), FASTA scores: opt: 631, E(): 3.2e-35, (48.1% identity in 208 aa overlap); etc. Protein product from Mb2817c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2817c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y471" /db_xref="InterPro:IPR003542" /db_xref="InterPro:IPR008278" /db_xref="InterPro:IPR037143" /db_xref="InterPro:IPR041354" /db_xref="UniProtKB/TrEMBL:A0A1R3Y471" /protein_id="SIU01435.1" /translation="MTVGTLVASVLPATVFEDLAYAELYSDPPGLTPLPEEAPLIARS VAKRRNEFITVRHCARIALDQLGVPPAPILKGDKGEPCWPDGVVGSLTHCAGYRGAVV GRRDAVRSVGIDAEPHDVLPNGVLDAISLPAERADMPRTMPAALHWDRILFCAKEATY KAWFPLTKRWLGFEDAHITFETDSTGWTGRFVSRILIDGSTLSGPPLTTLRGRWSVER GLVLTAIVL" CDS complement(3064898..3065872) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2818C" /product="Metallophosphoesterase, SimX4 hydrolase" /note="Mb2818c, -, len: 324 aa. Equivalent to Rv2795c, len: 324 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 324 aa overlap). Conserved hypothetical protein, equivalent to Q9Z5I6|ML1548|MLCB596.22 HYPOTHETICAL 37.5 KDA PROTEIN from Mycobacterium leprae (321 aa), FASTA scores: opt: 2018, E(): 6.3e-128, (87.4% identity in 318 aa overlap). Also highly similar to O88028|SC5A7.22 HYPOTHETICAL 33.5 KDA PROTEIN from Streptomyces coelicolor (295 aa), FASTA scores: opt: 1202, E(): 3.4e-73, (57.2% identity in 285 aa overlap); and Q9AMH7|SIMX4 SIMX4 PROTEIN from Streptomyces antibioticus (293 aa), FASTA scores: opt: 1045, E(): 1.2e-62, (51.4% identity in 286 aa overlap). C-terminus highly similar to Q9F0Q7 HYPOTHETICAL 9.6 KDA PROTEIN (FRAGMENT) from Streptomyces verticillus (81 aa), FASTA scores: opt: 395, E(): 1.8e-19, (68.35% identity in 79 aa overlap). Also similar to other proteins e.g. Q9FWV7 HYPOTHETICAL 45.3 KDA PROTEIN from Oryza sativa (Rice) (402 aa), FASTA scores: opt: 294, E(): 3.6e-12, (26.45% identity in 340 aa overlap). Protein product from Mb2818c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2818c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y311" /db_xref="InterPro:IPR004843" /db_xref="UniProtKB/TrEMBL:A0A1R3Y311" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01436.1" /translation="MTWKGSGQETVGAEPTLWAISDLHTGHLGNKPVAESLYPSSPDD WLIVAGDVAERTDEIRWSLDLLRRRFAKVIWVPGNHELWTTNRDPMQIFGRARYDYLV NMCDEMGVVTPEHPFPVWTERGGPATIVPMFLLYDYSFLPEGANSKAEGVAIAKERNV VATDEFLLSPEPYPTRDAWCHERVAATRARLEQLDWMQPTVLVNHFPLLRQPCDALFY PEFSLWCGTTKTADWHTRYNAVCSVYGHLHIPRTTWYDGVRFEEVSVGYPREWRRRKP YSWLRQVLPDPQYAPGYLNDFGGHFVITPEMRTQAAQFRERLRQRQSR" CDS complement(3066017..3066580) /codon_start=1 /transl_table=11 /gene="lppV" /locus_tag="BQ2027_MB2819C" /product="PROBABLE CONSERVED LIPOPROTEIN LPPV" /note="Mb2819c, lppV, len: 187 aa. Equivalent to Rv2796c, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 187 aa overlap). Probable lppV, conserved lipoprotein, similar to others from Mycobacterium tuberculosis e.g. P95009|LPPB|Rv2544|MTCY159.12c PROBABLE CONSERVED LIPOPROTEIN (220 aa), FASTA scores: opt: 168, E(): 0.00066, (22.45% identity in 196 aa overlap); and P95010|LPPA|RV2543|MTCY159.13c PROBABLE CONSERVED LIPOPROTEIN (219 aa), FASTA scores: opt: 165, E(): 0.001, (23.1% identity in 199 aa overlap). Protein product from Mb2819c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2819c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR032018" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2D7" /protein_id="SIU01437.1" /translation="MRWPTAWLLALVCVMATGCGPSGHGTRAGEEGPLSPEKVAELEN PLRAKPPLEDAKDQYRAAVTQLANAITALVPGLTWRTDMDTWTGCGGEYEWTRAKAAY FMIVFSGPIPDDKWLQAVQIVKDGVEQFGATGFGVMKNKPADHDVYFAGHGGVEFKFS TQKAAVLTAQSDCRISRTDTPKPSPTP" CDS complement(3066580..3068268) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2820C" /product="lipid metabolism" /note="Mb2820c, -, len: 562 aa. Equivalent to Rv2797c, len: 562 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 562 aa overlap). Conserved hypothetical ala-rich protein. C-terminus highly similar to several mycobacterial proteins e.g. AAK46927|MT2616 HYPOTHETICAL 28.0 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (265 aa), FASTA scores: opt: 535, E(): 4.6e-22, (42.95% identity in 263 aa overlap); P95011|Rv2542|MTCY159.14c HYPOTHETICAL 42.4 KDA PROTEIN from Mycobacterium tuberculosis (403 aa), FASTA scores: opt: 537, E(): 5e-22, (40.75% identity in 292 aa overlap) (similarity in the second half of protein); P71547|Y963_MYCTU|Rv0963c|MT0992|MTCY10D7.11 HYPOTHETICAL 28.1 KDA PROTEIN (266 aa), FASTA scores: opt: 314, E(): 5.7e-10, (39.0% identity in 254 aa overlap); etc. Contains PS00120 Lipases, serine active site. Mb2820c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010427" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2N0" /protein_id="SIU01438.1" /translation="MPLTVADIDRWNAQAVREVFHAASARAEVTFEASRQLAALSIFA NSGGKTAEAAAHHNAGIRRDLDAHGNEALAVARAADRAADGIVKVQSELAALRHAAAA AELTIDALINRVVPIPGLRSTEAQWARTLAKQTELQAELDAIMAEANAVDEELASAVN MADGDAPIPADSGPPVGPEGLTPTQLASDANEERLREERARLQAHLERLQAEYDQLSV RAARDYHNGILDGDAVGRLAALTDELSAARGRLGELDAVDEALSRAPETYLTQLQIPE DPNQQVLAAVAVGNPDTAANVSVTVPGVGSTTRGALPGMVTEARDLRSEVIRQLNAAG KPASVATIAWMGYHPPPNPLDTGSAGDLWQTMTDGQAHAGAADLSRYLQQVRANNPSG HLTVLGHSYGSLTASLALQDLDAQSAHPVNDVVFYGSPGLELYSPAQLGLDHGHAYVM QAPHDLITNLVAPLAPLHGWGLDPYLTPGFTELSSQAGFDPGGIWRDGVYAHGDYPRS FLDAAGQPQLRMSGYNLAAIAAGLPDNTVGPPLLPPILGGGMPAAPGPALRGGR" CDS complement(3068272..3068598) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2821C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2821c, -, len: 108 aa. Equivalent to Rv2798c, len: 108 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 108 aa overlap). Conserved hypothetical ala-rich protein, similar to P71545|Y965_MYCTU|Rv0965c|MT0993|MTCY10D7.09 HYPOTHETICAL 14.5 KDA PROTEIN from Mycobacterium tuberculosis (139 aa), FASTA scores: opt: 198, E(): 8e-07, (38.9% identity in 90 aa overlap). Mb2821c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y274" /protein_id="SIU01439.1" /translation="MFQISPEQWMHSAAQVTTQGEGLAVGHLSSDYRMQAAQFGWQGA SAMALNAKMDDWLDASRALLTRIGDHAFGLQEAAIQHAAAEAERAQALAQVGVSADVV AGPRGV" CDS 3068729..3069361 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2822" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb2822, -, len: 210 aa. Equivalent to Rv2799, len: 209 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 210 aa overlap). Probable membrane protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 3 bp insertion (*-ggt) leads to a slightly longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (210 aa versus 209 aa). Protein product from Mb2822 detected using SWATH mass spectrometry. Mb2822 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y273" /db_xref="InterPro:IPR024520" /db_xref="UniProtKB/TrEMBL:A0A1R3Y273" /protein_id="SIU01440.1" /translation="MYTPGKGPPRAGGVVFTRVRLIGGLGALTAAVVVVVGTVGWQGI PPAPTGGDAVQLRSTAAPMSTTMKSPIVATTDPSPFDPCRDIPFDVIQRLGLAYTPPE AEEGLRCHFDAGNYQMAVEPIIWRTYAQTLPPDAIETTIAGHRAAQYWVRKPTYHNSF WYSSCMVTFKTSYGVIQQSLFYSTVYSEPDVDCPSTNLQRANDLVPYYRF" CDS 3069380..3071029 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2823" /product="POSSIBLE HYDROLASE" /note="Mb2823, -, len: 549 aa. Equivalent to Rv2800, len: 549 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 549 aa overlap). Possible hydrolase (EC 3.-.-.-), an esterase (EC 3.1.1.-) or an acylase (EC 3.-.-.-). Similar, but longer in N-terminus, to esterases or acylases e.g. Q9L9D7|COCE COCAINE ESTERASE from Rhodococcus sp. MB1 'Bresler 1999' (574 aa), FASTA scores: opt: 510, E(): 3.1e-23, (33.6% identity in 571 aa overlap); Q9L3U2|STTE PUTATIVE ACYLASE from Streptomyces rochei (Streptomyces parvullus) (554 aa), FASTA scores: opt: 492, E(): 3.7e-22, (34.45% identity in 569 aa overlap); CAC49652|SMB21424 PUTATIVE ESTERASE OR ACYLASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (578 aa), FASTA scores: opt: 405, E(): 7.1e-17, (34.45% identity in 569 aa overlap); etc. Protein product from Mb2823 detected using SWATH mass spectrometry. Mb2823 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y288" /db_xref="InterPro:IPR000383" /db_xref="InterPro:IPR005674" /db_xref="InterPro:IPR008979" /db_xref="InterPro:IPR013736" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y288" /protein_id="SIU01441.1" /translation="MSTTSARPERPKLRALTGRVGGQALGGLLGLPRATTRYTVGHVR VPMRDGVQLVADHYAPATSQPVGTLLVRGPYGRRFPFSLVFARIYAARGYHVVLQSVR GTFGSGGVFEPMVNEAADGADTVAWLREQPWFTGRFGTIGLPYLGFTQWALLHDPPPE LAAAVITVGPHDFRASVWGTGSFTVNDFLGWSDLVSHQEDPGRIRAGIRQLTAPRRVA RTAATLPLGESARTLLGTGAPWFESWVEHTDRDDPFWDRLRFPAALDRVQVPVLLVGG WQDIFLRQTLQQYRHLRDRGVHVALTVGPWTHTQMLTKGLATGARESLDWLDAHLGRA PALRPSPVRVFVTGQGWRHLPDWPPATTERAWYLQPGGRLGESAPASGTPPATFRYHP ADPTPTTGGPLLSSNGGYRDDSRLATRADVLCFTGAPLTHDLCVHGNPVVELVHSSDN PYVDVFVRVSEVDAKGRSRNVSDGYRRLGDAPELVRVELDAIAHRFRADSRIRVLIAG SWFPRYARNLGTPEPILTGRQLKPATHAVHFGRSRLLLPVG" CDS complement(3071131..3071487) /codon_start=1 /transl_table=11 /gene="mazf9" /locus_tag="BQ2027_MB2824C" /product="toxin mazf9" /note="Mb2824c, -, len: 118 aa. Equivalent to Rv2801c, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 118 aa overlap). Conserved hypothetical protein, highly similar to Q9RWK4|DR0662 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (115 aa), FASTA scores: opt: 306, E(): 2e-15, (43.95% identity in 116 aa overlap); and similar to AAK78474|CAC0494 PEMK FAMILY OF DNA-BINDING PROTEINS from Clostridium acetobutylicum (122 aa), FASTA scores: opt: 217, E(): 7.3e-09, (33.35% identity in 117 aa overlap); P96622|YDCE YDCE PROTEIN from Bacillus subtilis (116 aa), FASTA scores: opt: 194, E(): 3.5e-07, (33.35% identity in 117 aa overlap); Q9PHH8|XFA0027 PLASMID MAINTENANCE PROTEIN from Xylella fastidiosa (108 aa), FASTA scores: opt: 188, E(): 9.1e-07, (40.85% identity in 115 aa overlap); etc. Also similar to Q10867|YJ91_MYCTU|Rv1991c|MT2046|MTCY39.28 HYPOTHETICAL 12.3 KDA PROTEIN from Mycobacterium tuberculosis (114 aa), FASTA scores: opt: 190, E(): 6.8e-07, (36.75% identity in 117 aa overlap). Mb2824c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y286" /db_xref="InterPro:IPR003477" /db_xref="InterPro:IPR011067" /db_xref="UniProtKB/TrEMBL:A0A1R3Y286" /protein_id="SIU01442.1" /translation="MMRRGEIWQVDLDPARGSEANNQRPAVVVSNDRANATATRLGRG VITVVPVTSNIAKVYPFQVLLSATTTGLQVDCKAQAEQIRSIATERLLRPIGRVSAAE LAQLDEALKLHLDLWS" CDS complement(3071471..3071701) /codon_start=1 /transl_table=11 /gene="mazE9" /locus_tag="BQ2027_MB2824A" /product="Possible antitoxin MazE9" /note="Mb2824A, len: 76 aa. Equivalent to Rv2801A len: 76 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 76 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible mazE9, antitoxin, part of toxin-antitoxin (TA) operon with Rv2801c (See Pandey and Gerdes, 2005; Zhu et al., 2006). This region is a possible MT-complex-specific genomic island (See Becq et al.,2007). Mb2824A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y280" /protein_id="SIU01443.1" /translation="MKLSVSLSDDDVAILDAYVKRAGLPSRSAGLQHAIRVLRYPTLE DDYANAWQEWSAAGDTDAWEQTVGDGVGDAPR" CDS complement(3071744..3072787) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2825C" /product="HYPOTHETICAL ARGININE AND ALANINE RICH PROTEIN" /note="Mb2825c, -, len: 347 aa. Equivalent to Rv2802c, len: 347 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 347 aa overlap). Hypothetical unknown arg-, ala-rich protein. C-terminus shows some similarity with N-terminal part of hypothetical proteins Q98K84|MLR1592 from Rhizobium loti (Mesorhizobium loti) (104 aa), FASTA scores: opt: 138, E(): 0.12, (37.35% identity in 91 aa overlap); and CAC47718|SMC03294 from Rhizobium meliloti (Sinorhizobium meliloti) (114 aa), FASTA scores: opt: 128, E(): 0.53, (31.4% identity in 86 aa overlap). Equivalent to AAK47191 from Mycobacterium tuberculosis strain CDC1551 (357 aa) but shorter 10 aa. Protein product from Mb2825c detected using SWATH mass spectrometry. Mb2825c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR018744" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A2" /protein_id="SIU01444.1" /translation="MARQPLEQRVARAAQAALARQRFVSAIDVLLGLGWLAPSHVDQW RQGRVDSLEQVVQANLSKITAVMAALRRWARDRGLNPSETDYVARTRDRRRLRFSVTG EDAIERAYRTHWVSPELSERAVARQSRRPDLVVIMPVNDWSCASCGGSGDLMFLEDAG PLCLDCADLGHLVFLPSGDAALTRRAKRASRLSAVVVRWSRARKRYERQGILVEAEAL ERAENECLADAEVRARRRERDEARRANEDLRLQAEFGAAIRTLFPNCPAGRAEAIARH AATRGSGRIGRSAAGRALDPEAVRLAVAASVRHIDTSFDELLMSGVDRETARHRVGEH VEEVLRDWRATSR" CDS 3072786..3073253 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2826" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2826, -, len: 155 aa. Equivalent to Rv2803, len: 155 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 155 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from other organisms, and with some similarity to C-terminal part of Rv0918|Z95210_12 hypothetical protein from Mycobacterium tuberculosis (158 aa), FASTA scores: opt: 204, E(): 9e-07, (42.35% identity in 85 aa overlap). Replaces original 2803c on other strand. Mb2826 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y477" /db_xref="InterPro:IPR010985" /db_xref="InterPro:IPR014795" /db_xref="InterPro:IPR016547" /db_xref="UniProtKB/TrEMBL:A0A1R3Y477" /protein_id="SIU01445.1" /translation="MTCPSLVGLRTEAAELSYSDQPDALGVAMRERREQQNLVRPPRR NASRRINTDQTSTKYVYITYMPETLTGRLNFRLSPEQEQALRHAAALTGQSLSGFVLS AAVDHAHDLLARANRIELSEAAFRRFVAALDEPDEAAPELVRLARRKSRIPPH" mobile_element 3073369..3074775 /mobile_element_type="insertion sequence:IS1604" /locus_tag="BQ2027_IS1604'" /note="IS1604', len: 1407 nt. Equivalent to IS1604, len: 1409 nt, from Mycobacterium tuberculosis strain H37Rv,(99.8% identity in 1409 nt overlap). Possible defective IS element due to frameshift." CDS complement(3073429..3074058) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2827C" /product="HYPOTHETICAL PROTEIN" /note="Mb2827c, -, len: 209 aa. Equivalent to Rv2804c, len: 209 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 209 aa overlap). Hypothetical unknown protein, overlaps neighbouring ORF Rv2805|MTCY16B7.38c." /db_xref="UniProtKB/TrEMBL:A0A1R3Y322" /protein_id="SIU01446.1" /translation="MHDHQVLAARHAHQGPHVLQQRPGFVAEAPRPKATPVDLLGRAR QPRAGQHLPRRRATHPRGGHHRIQNLAVVPPHHRRQQQRGHSRRSIGSTSPSDDSASY SQRPRDVADPPVEASTLEGQEAVVTVELGGAVVDGVDDQGAGAVVPGTGHGSDEGIEE KIATETGALLLPVERQASEDEHWDRVGSGWPRPGRDGTRIRSMLPMASA" CDS 3073831..3074235 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2828" /product="Mobile element protein" /note="Mb2828, -, len: 134 aa. Equivalent to Rv2805, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 134 aa overlap). Conserved hypothetical protein, highly similar to N-terminal region of downstream ORF P71644|Rv2807|MTCY16B7.36c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (384 aa), FASTA scores: opt: 525, E(): 6.4e-29, (78.2% identity in 101 aa overlap). Also highly similar to N-terminus of other proteins: Q9KK74 HYPOTHETICAL 47.4 KDA PROTEIN from Brevibacterium linens (418 aa), FASTA scores: opt: 480, E(): 8.8e-26, (64.15% identity in 106 aa overlap); AAK40065 Rv3128c-LIKE PROTEIN from Mycobacterium celatum (423 aa), FASTA scores: opt: 218, E(): 1.2e-07, (46.05% identity in 89 aa overlap); Q981U5|MLR9230 from Rhizobium loti (Mesorhizobium loti) (504 aa), FASTA scores: opt: 131, E(): 0.15, (29.4% identity in 126 aa overlap). Overlaps neighbouring ORF Rv2804c|MTCY16B7.39. Mb2828 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2E9" /protein_id="SIU01447.1" /translation="MGRDNGKILDPVVATTGMGRSTARQMLTGPRLPGPAEQVDGRSL RPRGFSDEARALLEHVWALMGMPCGKYLVVMHDLWLPLLTAAGDLDKPLVTEASVAEL KATALPGANRMPHWAAGTLPDGFPARAVRTRT" CDS 3074232..3074423 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2829" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb2829, -, len: 63 aa. Equivalent to Rv2806, len: 63 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 63 aa overlap). Possible membrane protein, sharing no homology. Mb2829 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2P0" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2P0" /protein_id="SIU01448.1" /translation="MKTNPRYGPAFYSVMTVLFLALFVLNVCTHGSTLGLISTGGLAV LMGYIGYRGWSGKRHINRQ" CDS 3074622..3075776 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2830" /product="Mobile element protein" /note="Mb2830, -, len: 384 aa. Equivalent to Rv2807, len: 384 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 384 aa overlap). Conserved hypothetical protein, highly similar, but shorter 35 aa, to Q9KK74 HYPOTHETICAL 47.4 KDA PROTEIN from Brevibacterium linens (418 aa), FASTA scores: opt: 1865, E(): 9.4e-116, (69.75% identity in 380 aa overlap); and with similarity with other hypothetical proteins or transposases e.g. Q981U5|MLR9230 PROTEIN from Rhizobium loti (Mesorhizobium loti) (504 aa), FASTA scores: opt: 636, (36.05% identity in 377 aa overlap); CAC47689 PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ISRM18 from Rhizobium meliloti (Sinorhizobium meliloti) (507 aa), FASTA scores: opt: 553, E(): 6.6e-29, (33.5% identity in 370 aa overlap); etc. Also similar to Rv3128c|MTCY164.38c (336 aa) (47.2% identity in 339 aa overlap); and high similarity at N-terminal region with Rv2805|MTCY16B7.38c (79.2% identity in 101 aa overlap)." /db_xref="GOA:A0A1R3Y284" /db_xref="InterPro:IPR001584" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR036397" /db_xref="UniProtKB/TrEMBL:A0A1R3Y284" /protein_id="SIU01449.1" /translation="MVSTTGMGRSTARRMLTGPGLPEPAEQVDGRRLRARGFSDDARA LLEHVWALMGMPCGKYLVVMLELWLPLVAAAGDLDKPFATEAAVAELKAMSAATVDRY LKPARERMRIKGISTTKPSPLLRNSITIHTCSDEAPKVPGVIEADTVAHCGPSLIGEF ARTLTMTDLVTGWTENASIRNNAAKWILEGIKECQQRFPFPMTVFDSDCGGEFINHDV AGWLQARDIAQTRSRPYQKNDQAHVESKNNHVVRKHAFYWRYDTGEELELLNRLWPLV SLRCNFFTPTKKPVGYTSTVNGRRKRIYDKPATPWQRLQASGVLDAQQLSTVAARIEG FNPADLTRQINAIQMQLLDLAKTKTEALATARHIDLQSLQPSINRLAKAK" CDS 3076010..3076267 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2831" /product="HYPOTHETICAL PROTEIN" /note="Mb2831, -, len: 85 aa. Equivalent to Rv2808, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Hypothetical unknown protein. Mb2831 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y283" /protein_id="SIU01450.1" /translation="MSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDA VIDVLSDALMKTFGPDWVPNDYGLKIERAIDAYLETWPIYR" CDS 3076372..3076683 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2832" /product="HYPOTHETICAL PROTEIN" /note="Mb2832, -, len: 103 aa. Equivalent to Rv2809, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 103 aa overlap). Hypothetical unknown protein. Questionable ORF. Mb2832 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A6" /protein_id="SIU01451.1" /translation="MTYAARDDTTLPKLLAQMRWVVLVDKRQLAVLLLENEGPVASAT DPLDTRGDSDYENQPVDAVERLCRRLADQAVRQWGFMQGLKQKLGPGVDVRMKLVEWN R" CDS complement(3076705..3077103) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2833C" /product="PROBABLE TRANSPOSASE" /note="Mb2833c, -, len: 132 aa. Equivalent to Rv2810c, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 132 aa overlap). Probable transposase for IS1555, similar to C-terminal domain of transposases for defective IS1555 e.g. Q9LCS0|TNPA TRANSPOSASE from Arthrobacter sp. TM1 (435 aa), FASTA scores: opt: 294, E(): 1.8e-13, (55.1% identity in 98 aa overlap); Q50440|TNPA INSERTION ELEMENT TNPR AND TNPA GENE from Mycobacterium smegmatis (413 aa), FASTA scores: opt: 274, E(): 4.7e-12, (56.25% identity in 96 aa overlap); etc." /db_xref="InterPro:IPR002560" /db_xref="UniProtKB/TrEMBL:A0A1R3Y291" /protein_id="SIU01452.1" /translation="MRLQAHTGGPPVALRQETTGGPSPTNDLITEPPRHYKQQTRVRQ APALLTVSAGTGVPVVLEELAKLGRTLWRCRHDVLAYFDHHASNGPTEAINGRLEALC RNALGFRNLTHYRIRSLLHCGNLAQLIHAL" mobile_element complement(3076708..3077106) /mobile_element_type="insertion sequence:IS1555" /locus_tag="BQ2027_IS1555'" /note="IS1555', len: 399 nt. Equivalent to IS1555, len: 399 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 399 nt overlap). Probable defective IS element." gene complement(3076708..3077106) /locus_tag="BQ2027_IS1555'" CDS 3077103..3077711 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2834" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2834, -, len: 202 aa. Equivalent to Rv2811, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 202 aa overlap). Conserved hypothetical protein. C-terminus equivalent to C-terminus of AAK47198|MT2878 HYPOTHETICAL 17.7 KDA PROTEIN Mycobacterium tuberculosis strain CDC1551 (178 aa), FASTA scores: opt: 609, E(): 1.5e-32, (61.0% identity in 182 aa overlap); and C-terminus highly similar to P72038|Rv3771c|MTCY13D12.05c HYPOTHETICAL 11.3 KDA PROTEIN from Mycobacterium tuberculosis (108 aa), FASTA scores: opt: 465, E(): 2.8e-23, (73.6% identity in 106 aa overlap). Also some similarity with P71962|Rv2665|MTCY441.34 HYPOTHETICAL 10.5 KDA PROTEIN from Mycobacterium tuberculosis (93 aa), FASTA scores: opt: 153, E(): 0.0057, (39.05% identity in 64 aa overlap); and Q9A6W6|CC1966 HYPOTHETICAL PROTEIN CC1966 from Caulobacter crescentus (189 aa), FASTA scores: opt: 115, E(): 2.6, (39.4% identity in 104 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y292" /protein_id="SIU01453.1" /translation="MVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPA GPVELCPRRSRCTGCGVTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDV ARPAETVRGWLRRFAERVEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAA AIGRRFSLPTVSLAETAVAVSGGRLLAPGWPGEWVQHESTLP" gene 3077781..3079187 /locus_tag="BQ2027_IS1604'" CDS 3077782..3078462 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2835" /product="PUTATIVE TRANSPOSASE [FIRST PART]" /note="Mb2835, -, len: 226 aa. Similar to 5' end of Rv2812, len: 469 aa, from Mycobacterium tuberculosis strain H37Rv, (92.4% identity in 224 aa overlap). Putative transposase for IS1604, similar to putative transposases and hypothetical proteins e.g. Q9EZM2|PUTATIVE TRANSPOSASE from Mycobacterium paratuberculosis (395 aa), FASTA scores: opt: 329, E(): 3e-13, (27.05% identity in 362 aa overlap); CAC46499 PUTATIVE TRANSPOSASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (390 aa), FASTA scores: opt: 327, E(): 3.9e-13, (30.5% identity in 367 aa overlap); etc. Contains possible helix-turn-helix motif at aa 50-71 (Score 1140, +3.07 SD). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2812 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2 bp deletion (tg-*) splits Rv2812 into 2 parts, Mb2835 and Mb2836." /db_xref="GOA:A0A1R3Y2B1" /db_xref="InterPro:IPR009057" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2B1" /protein_id="SIU01454.1" /translation="MAVGDDEEKVRAERARAIGLFRYQLIWEAADAAHSTKQRGKMVR ELASREHTDPFGRRVRISRQTIDRWIRGWRAGGFDALVPNPRQCTPRTPAEVLELAVA LRRENPQRTAAAIRRILRTQLGWAPDERTLQRNFHRLGLTGATTGSAPAVFGRFEAEH PNALWTGDVLHGIRIDLRKTYLFAFLDDHSRLVPGYRGPCRGHGAAGRRTAPGAGLPR RAQRGVCR" CDS 3078476..3079189 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2836" /product="PUTATIVE TRANSPOSASE [SECOND PART]" /note="Mb2836, -, len: 265 aa. Equivalent to 3' end of Rv2812, len: 469 aa, from Mycobacterium tuberculosis strain H37Rv, (99.623% identity in 265 aa overlap). Putative transposase for IS1604, similar to putative transposases and hypothetical proteins e.g. Q9EZM2|PUTATIVE TRANSPOSASE from Mycobacterium paratuberculosis (395 aa), FASTA scores: opt: 329, E(): 3e-13, (27.05% identity in 362 aa overlap); CAC46499 PUTATIVE TRANSPOSASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (390 aa), FASTA scores: opt: 327, E(): 3.9e-13, (30.5% identity in 367 aa overlap); etc. Contains possible helix-turn-helix motif at aa 50-71 (Score 1140, +3.07 SD). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2812 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2 bp deletion (tg-*) splits Rv2812 into 2 parts, Mb2835 and Mb2836." /db_xref="GOA:A0A1R3Y486" /db_xref="InterPro:IPR001584" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR015378" /db_xref="InterPro:IPR036397" /db_xref="UniProtKB/TrEMBL:A0A1R3Y486" /protein_id="SIU01455.1" /translation="MDAWLLRACAKLGVRLVHSTPGRPQGRGKIERFFRTVREQFLVE ITGEPDVVGRHYVADLAELNRLFTAWVETVYHRSVHSETGQTPLARWSAGGPIPLPAP ETLTEAFLWEEHRRVTKTATVSLHGNRYEIDPALVGRKVELVFDPFDLTRIEVRLAGA PMGRAIPYHIGRHSHPKAKPETPTAPPKPSGIDYAQLIETAHAAELARGVNYTALTGA ADQIPGQLDLLTGQEAQPK" CDS 3079186..3079998 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2837" /product="Type II secretory pathway, component ExeA" /note="Mb2837, -, len: 270 aa. Equivalent to Rv2813, len: 270 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 270 aa overlap). Conserved hypothetical protein, similar to various proteins (notably secreted proteins) e.g. Q9ZFL2 HYPOTHETICAL 30.4 KDA PROTEIN from Bacillus stearothermophilus (266 aa), FASTA scores: opt: 518, E(): 1.4e-26, (33.85% identity in 266 aa overlap); P45754|GSPA_AERHY|EXEA GENERAL SECRETION PATHWAY PROTEIN from Aeromonas hydrophila (547 aa), FASTA scores: opt: 386, E(): 1.1e-17, (32.05% identity in 265 aa overlap); Q9KPC7|VC2445 GENERAL SECRETION PATHWAY PROTEIN A from Vibrio cholerae (529 aa), FASTA scores: opt: 366, E(): 2.2e-16, (31.1% identity in 270 aa overlap); Q56674|VC0403 MANNOSE-SENSITIVE HEMAGGLUTININ D from Vibrio cholerae (281 aa), FASTA scores: opt: 317, E(): 2.1e-13, (27.85% identity in 262 aa overlap); etc. Also highly similar to AAK40072 Rv2813-LIKE PROTEIN from Mycobacterium celatum (270 aa), FASTA scores: opt: 1628, E(): 2.8e-99, (90.75% identity in 270 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y331" /protein_id="SIU01456.1" /translation="MMHKLISYYGFSRMPFGRDLAPGMLHRHSAHNEAVARIGWCIAD RRIGVITGEVGAGKTVAVRAALASLDRSRHTVIYLPDPTVGVQGIHHRIVASLGGQPL THHATLAPQAADALAAEQAERGRTPVVVVEEAHLLGYDQLEALRLLTNHDLDSSSPFA CLLIGQPTLRRRMKLGVLAALDQRIGLRYAMPPMTDTNTGSYLRHHLKLAGRDDALFS DDAIGLIHQTSRGYPRAVNNLALQALVAAFAADKAIVDESTTRTAIAEVTAD" repeat_region complement(3080147..3084550) /rpt_type=DIRECT /note="4404 bp DR, direct repeat region composed of 42 repeat_units of 36 bases pairs, one of them has been interrupted by the insertion of an IS6110 elements." repeat_region complement(3080147..3080182) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" gene complement(3080147..3084550) repeat_region complement(3080218..3080253) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080291..3080326) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080365..3080400) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080444..3080479) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080521..3080556) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080595..3080630) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080669..3080704) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080742..3080777) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080816..3080851) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080887..3080922) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3080960..3080995) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3081033..3081068) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3081105..3081140) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3081177..3081212) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3081251..3081286) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3081325..3081360) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3081396..3081415) /note="3' part direct repeat, CCCCGAGAGGGGACGGAAAC, of sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region 3081413..3081415 /rpt_type=DIRECT /note="3 bp direct repeat, GGG, flanking IS element IS6110." mobile_element complement(3081416..3082770) /mobile_element_type="insertion sequence:IS6110" /locus_tag="BQ2027_IS6110" /note="IS6110, len: 1355 nt. Equivalent to IS6110, len: 1355 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1355 nt overlap)." repeat_region 3081416..3081443 /rpt_type=INVERTED /note="28 bp perfect inverted repeat, IRR,TGAACCGCCCCGGCAATGTCCGGAGACTC, flanking IS element IS6110." gene complement(3081416..3082770) /locus_tag="BQ2027_IS6110" CDS complement(3081458..3082294) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2838C" /product="PROBABLE TRANSPOSASE" /note="Mb2838c, -, len: 278 aa. Equivalent to Rv2814c, len: 312 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 278 aa overlap). Probable transposase, highly similar to others e.g. P97137|Rv0796|MTV042.06 PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS986/IS6110 from Mycobacterium tuberculosis (328 aa), FASTA scores: opt: 2103, E(): 6.1e-132, (100.0% identity in 312 aa overlap); etc. Start unlikely." /db_xref="GOA:P59800" /db_xref="InterPro:IPR001584" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR025948" /db_xref="InterPro:IPR036397" /db_xref="InterPro:IPR038965" /db_xref="UniProtKB/Swiss-Prot:P59800" /protein_id="SIU01457.1" /translation="MPIAPSTYYDHINREPSRRELRDGELKEHISRVHAANYGVYGAR KVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTIADPATARPADLVQRRFGP PAPNRLWVADLTYVSTWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWT RQQEGVLDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETINGL YKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPA AG" CDS complement(3082393..3082719) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2839C" /product="PROBABLE TRANSPOSASE" /note="Mb2839c, -, len: 108 aa. Equivalent to Rv2815c, len: 108 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 108 aa overlap). Probable transposase, identical from aa 51 with P19772|YIA2_MYCTU PUTATIVE TRANSPOSASE (INSERTION ELEMENT IS986) from Mycobacterium tuberculosis (59 aa), FASTA scores: opt: 365, E(): 1.1e-19, (96.6% identity in 59 aa overlap); and other transposases." /db_xref="GOA:P59801" /db_xref="InterPro:IPR002514" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/Swiss-Prot:P59801" /protein_id="SIU01458.1" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" repeat_region complement(3082743..3082770) /rpt_type=INVERTED /note="28 bp perfect inverted repeat, IRL,TGAACCGCCCCGGCAATGTCCGGAGACTC, flanking IS element IS6110." repeat_region 3082771..3082773 /rpt_type=DIRECT /note="3 bp direct repeat, GGG, flanking IS element IS6110." repeat_region complement(3082774..3082789) /note="5' part direct repeat, GTCGTCAGACCCAAAA, of sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3082830..3082865) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3082905..3082940) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3082978..3083013) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083054..3083089) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083131..3083166) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083203..3083238) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083275..3083310) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083348..3083383) /note="direct repeat, 32 out of 36 bp identical to sequence CGGATCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083409..3083444) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083486..3083521) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083558..3083593) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083633..3083668) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083707..3083742) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083781..3083816) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083852..3083887) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3083927..3083962) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3084001..3084036) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3084076..3084111) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3084148..3084183) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3084219..3084254) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3084292..3084327) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3084364..3084399) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3084441..3084476) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3084515..3084550) /note="direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" CDS complement(3084599..3084940) /codon_start=1 /transl_table=11 /gene="cas2" /locus_tag="BQ2027_MB2840C" /product="CRISPR-associated protein Cas2" /note="Mb2840c, -, len: 113 aa. Equivalent to Rv2816c, len: 113 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 113 aa overlap). Conserved hypothetical protein, highly similar in part to N-terminus of several proteins e.g. O28403|AF1876 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (94 aa), FASTA scores: opt: 137, E(): 0.0022, (47.55% identity in 61 aa overlap); Q97Y85|SSO8090 HYPOTHETICAL PROTEIN from Sulfolobus solfataricus (88 aa), FASTA scores: opt: 124, E(): 0.02, (37.3% identity in 59 aa overlap); etc." /db_xref="GOA:A0A1R3Y289" /db_xref="InterPro:IPR019199" /db_xref="InterPro:IPR021127" /db_xref="UniProtKB/TrEMBL:A0A1R3Y289" /protein_id="SIU01459.1" /translation="MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLA KILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRG RLVSAEEFVFF" CDS complement(3084941..3085957) /codon_start=1 /transl_table=11 /gene="cas1" /locus_tag="BQ2027_MB2841C" /product="CRISPR-associated protein Cas1" /note="Mb2841c, -, len: 338 aa. Equivalent to Rv2817c, len: 338 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 338 aa overlap). Conserved hypothetical protein, showing similarity with O30236|AF2435 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (322 aa), FASTA scores: opt: 397, E(): 2.4e-19, (28.2% identity in 298 aa overlap); Q9KFX9|BH0341 HYPOTHETICAL PROTEIN from Bacillus halodurans (343 aa), FASTA scores: opt: 337, E(): 2.8e-15, (27.35% identity in 300 aa overlap); Q9X2B7|TM1797 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (319 aa), FASTA scores: opt: 321, E(): 3.3e-14, (26.5% identity in 268 aa overlap); etc. Protein product from Mb2841c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y290" /db_xref="InterPro:IPR002729" /db_xref="InterPro:IPR042206" /db_xref="InterPro:IPR042211" /db_xref="UniProtKB/TrEMBL:A0A1R3Y290" /protein_id="SIU01460.1" /translation="MVQLYVSDSVSRISFADGRVIVWSEELGESQYPIETLDGITLFG RPTMTTPFIVEMLKRERDIQLFTTDGHYQGRISTPDVSYAPRLRQQVHRTDDPAFCLS LSKRIVSRKILNQQALIRAHTSGQDVAESIRTMKHSLAWVDRSGSLAELNGFEGNAAK AYFTALGHLVPQEFAFQGRSTRPPLDAFNSMVSLGYSLLYKNIIGAIERHSLNAYIGF LHQDSRGHATLASDLMEVWRAPIIDDTVLRLIADGVVDTRAFSKNSDTGAVFATREAT RSIARAFGNRIARTATYIKGDPHRYTFQYALDLQLQSLVRVIEAGHPSRLVDIDITSE PSGA" CDS complement(3086207..3087118) /codon_start=1 /transl_table=11 /gene="csm6" /locus_tag="BQ2027_MB2842C" /product="CRISPR-associated protein Csm6" /note="Mb2842c, -, len: 303 aa. Equivalent to 5' end of Rv2818c, len: 382 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 303 aa overlap). Hypothetical unknown protein, equivalent to AAK47210 from Mycobacterium tuberculosis strain CDC1551 (430 aa) but shorter 48 aa. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (c-t) introduces a premature stop codon that leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (303 aa versus 382 aa). Protein product from Mb2842c detected using SWATH mass spectrometry. Mb2842c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR013489" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2B3" /protein_id="SIU01461.1" /translation="MLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRF DLFVPVFRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPA RALSKPGDRESPDAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVS YDYSAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYI SALALLAKREQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEI RCALKHPPKSPNAEWYLYTKDWLALLR" CDS complement(3087214..3088341) /codon_start=1 /transl_table=11 /gene="csm5" /locus_tag="BQ2027_MB2843C" /product="CRISPR-associated protein, Csm5 family" /note="Mb2843c, -, len: 375 aa. Equivalent to Rv2819c, len: 375 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 375 aa overlap). Hypothetical unknown protein (see citations below). Mb2843c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR005537" /db_xref="InterPro:IPR010173" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01462.1" /translation="MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDME LLYADIPAHKRKSFEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEP RRASRGRGGRMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQ PVRVPGHQTREHRQYGERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDL LICQKMDMNVHGKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAE TAASVNQARYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGK VVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCELSIRRAE" CDS complement(3088338..3089246) /codon_start=1 /transl_table=11 /gene="csm4" /locus_tag="BQ2027_MB2844C" /product="CRISPR-associated RAMP protein, Csm4 family" /note="Mb2844c, -, len: 302 aa. Equivalent to Rv2820c, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 302 aa overlap). Hypothetical unknown protein." /db_xref="InterPro:IPR005510" /db_xref="InterPro:IPR040932" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A0" /protein_id="SIU01463.1" /translation="MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMG GQQLLGELVACSTLRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQ LGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLL ATGSESELGLLTRLLKGISALGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTS LPTDDELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGIL DVSLGGNHPVYSYARPLFLALPESAA" CDS complement(3089227..3089937) /codon_start=1 /transl_table=11 /gene="csm3" /locus_tag="BQ2027_MB2845C" /product="CRISPR-associated RAMP Csm3" /note="Mb2845c, -, len: 236 aa. Equivalent to Rv2821c, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 236 aa overlap). Conserved hypothetical protein, similar to several hypothetical proteins e.g. Q9X2C9|TM1809 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (247 aa), FASTA scores: opt: 318, E(): 8.2e-15, (39.45% identity in 213 aa overlap); O27152|MTH1080 CONSERVED HYPOTHETICAL PROTEIN from Methanothermobacter thermautotrophicus (245 aa), FASTA scores: opt: 294, E(): 3.9e-13, (34.8% identity in 224 aa overlap); BAB59251|TVG0114661 HYPOTHETICAL PROTEIN from Thermoplasma volcanium (229 aa), FASTA scores: opt: 252, E(): 3.3e-10, (33.8% identity in 225 aa overlap); etc. Protein product from Mb2845c detected using SWATH mass spectrometry." /db_xref="InterPro:IPR005537" /db_xref="InterPro:IPR013412" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2C4" /protein_id="SIU01464.1" /translation="MTTSYAKIEITGTLTVLTGLQIGAGDGFSAIGAVDKPVVRDPLS RLPMIPGTSLKGKVRTLLSRQYGADTETFYRKPNEDHAHIRRLFGDTEEYMTGRLVFR DTKLTNKDDLEARGAKTLTEVKFENAINRVTAKANLRQMERVIPGSEFAFSLVYEVSF GTPGEEQKASLPSSDEIIEDFNAIARGLKLLELDYLGGSGTRGYGQVKFSNLKARAAV GALDGSLLEKLNHELAAV" CDS complement(3089947..3090321) /codon_start=1 /transl_table=11 /gene="csm2" /locus_tag="BQ2027_MB2846C" /product="CRISPR-associated protein, Csm2 family" /note="Mb2846c, -, len: 124 aa. Equivalent to Rv2822c, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 124 aa overlap). Hypothetical unknown protein." /db_xref="InterPro:IPR010149" /db_xref="UniProtKB/TrEMBL:A0A1R3Y492" /protein_id="SIU01465.1" /translation="MSVIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFD EAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGL LRFCRYMEALAAYKKYLDPKDK" CDS complement(3090318..3092756) /codon_start=1 /transl_table=11 /gene="cas10" /locus_tag="BQ2027_MB2847C" /product="CRISPR-associated protein, Csm1 family" /note="Mb2847c, -, len: 812 aa. Equivalent to Rv2823c, len: 809 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 812 aa overlap). Conserved hypothetical protein, similar in part to others e.g. Q9X2D1|TM1811Thermotoga maritima (717 aa), FASTA scores: opt: 401, E(): 3.6e-18, (27.15% identity in 773 aa overlap); O27154|MTH1082 CONSERVED HYPOTHETICAL PROTEIN from Methanothermobacter thermautotrophicus (822 aa), FASTA scores: opt: 306, E(): 6e-12, (25.55% identity in 872 aa overlap); Q59066|MJ1672 HYPOTHETICAL PROTEIN from Methanococcus jannaschii (800 aa), FASTA scores: opt: 302, E(): 1.1e-11, (24.9% identity in 812 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 9 bp insertion (*-cggcgatgt) leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv." /db_xref="InterPro:IPR000160" /db_xref="InterPro:IPR003607" /db_xref="InterPro:IPR013408" /db_xref="InterPro:IPR041062" /db_xref="UniProtKB/TrEMBL:A0A1R3Y339" /protein_id="SIU01466.1" /translation="MNPQLIEAIIGCLLHDIGKPVQRAALGYPGRHSAIGRAFMKKVW LRDSRNPSQFTDEVDEADIGVSDRRILDAISYHHSSALRTAAENGRLAADAPAYIAYI ADNIAAGTDRRKADSDDGHGASTWDPDTPLYSMFNRFGSGTANLAFAPEMLDDRKPIN IPSPRRIEFDKDRYAAIVNKLKAILVDLERSDTYLASLLNVLEATLSFVPSSTDASEV VDVSLFDHLKLTGALGACIWHYLQATGQSDFKSALFDKQDTFYNEKAFLLTTFDVSGI QDFIYTIHSSGAAKMLRARSFYLEMLTEHLIDELLARVGLSRANLNYSGGGHAYLLLP NTESARKSVEQFEREANDWLLENFATRLFIATGSVPLAANDLMRRPNESASQASNRAL RYSGLYRELSEQLSAKKLARYSADQLRELNSRDHDGQKGDRECSVCHTVNRTVSADDE PKCSLCQALTAASSQIQSESRRFLLISDGATKGLPLPFGATLTFCSRADADKALQQPQ TRRRYAKNKFFAGECLGTGLWVGDYVAQMEFGDYVKRASGIARLGVLRLDVDNLGQAF THGFMEQGNGKFNTISRTAAFSRMLSLFFRQHINYVLARPKLRPITGDDPARPREATI IYSGGDDVFVVGAWDDVIEFGIELRERFHEFTQGKLTVSAGIGMFPDKYPISVMAREV GDLEDAAKSLPGKNGVALFDREFTFGWDELLSKVIEEKYRHIADYFSGNEERGMAFIY KLLELLAERDDRITKARWVYFLTRMRNPTGDTAPFQQFANRLHQWFQDPTDAKQLKTA LHLYIYRTRKEESE" CDS complement(3092753..3093697) /codon_start=1 /transl_table=11 /gene="cas6" /locus_tag="BQ2027_MB2848C" /product="CRISPR-associated endoribonuclease Cas6" /note="Mb2848c, -, len: 314 aa. Equivalent to Rv2824c, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 314 aa overlap). Hypothetical unknown protein. Protein product from Mb2848c detected using SWATH mass spectrometry. Mb2848c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2H0" /db_xref="InterPro:IPR010156" /db_xref="InterPro:IPR019267" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2H0" /protein_id="SIU01467.1" /translation="MAARRGGIRRTDLLRRSGQPRGRHRASAAESGLTWISPTLILVG FSHRGDRRMTEPLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVP VNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRS LEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAI VDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYI AALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP" CDS complement(3093875..3094522) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2849C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2849c, -, len: 215 aa. Equivalent to Rv2825c, len: 215 aa, from Mycobacterium tuberculosis strain H37Rv, (97.7% identity in 215 aa overlap). Conserved hypothetical protein, similar to Q9RY53|DR0097 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (189 aa), FASTA scores: opt: 261, E(): 8e-11, (33.5% identity in 176 aa overlap); and shows some similarity with N-terminus of O27278|MTH1210 MRR RESTRICTION SYSTEM RELATED PROTEIN from Methanothermobacter thermautotrophicus (340 aa), FASTA scores: opt: 133, E(): 0.091, (28.55% identity in 112 aa overlap). Equivalent to AAK47217 from Mycobacterium tuberculosis strain CDC1551 (246 aa) but shorter 31 aa; and equivalent to upstream ORF P71624|Rv2828c|MTCY16B7.14 from Mycobacterium tuberculosis strain H37Rv (alias AAK47221 from strain CDC1551) (181 aa), FASTA scores: opt: 1169, E(): 8.5e-74, (98.35% identity in 181 aa overlap). Mb2849c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR008307" /db_xref="InterPro:IPR014923" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Q8" /protein_id="SIU01468.1" /translation="MELPGAKRLGDDRRPLGTLRCWRHSDIGPARGIVVTPALKEWSA AVHALLDGRQTVLLRKGGIGEKRFEVAAHEFLLFPTVAHSHAERVRPEHRDLLGPAAA DSTDECVLLRAAAKVVAALPVNRPEGLDAIEDLHIWTAESVRADRLDFRPKHKLAVLV VCAIPLAEPVRLARRPEYGGCTSWVQLPLTPQLAEPVHDEAALAEVAARVREAVG" CDS complement(3094692..3095576) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2850C" /product="HYPOTHETICAL PROTEIN" /note="Mb2850c, -, len: 294 aa. Equivalent to Rv2826c, len: 294 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 294 aa overlap). Hypothetical unknown protein. Protein product from Mb2850c detected using SWATH mass spectrometry. Mb2850c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR014942" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A1" /protein_id="SIU01469.1" /translation="MAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGD NRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQST RGDGRHWQLRVRHTELGEPRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLP VVAEAEACAEKLARYRRVALARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRG TRPLRVEDVLAARSEHDFQPDSIGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAA CDERHRREVENALAVLRS" CDS complement(3095579..3096466) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2851C" /product="HYPOTHETICAL PROTEIN" /note="Mb2851c, -, len: 295 aa. Equivalent to Rv2827c, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 295 aa overlap). Hypothetical unknown protein, equivalent to AAK47219 from Mycobacterium tuberculosis strain CDC1551 (315 aa) but shorter 20 aa. Protein product from Mb2851c detected using SWATH mass spectrometry. Mb2851c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR018547" /db_xref="InterPro:IPR025159" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y299" /protein_id="SIU01470.1" /translation="MVSPAGADRRIPTWASRVVSGLARDRPVVVTKEDLTQRLTEAGC GRDPDSAIRELRRIGWLVQLPVKGTWAFIPPGEAAISDPYLPLRSWLARDQNAGFMLA GASAAWHLGYLDRQPDGRIPIWLPPAKRLPDGLASYVSVVRIPWNAADTALLAPRPAL LVRRRLDLVAWATGLPALGPEALLVQIATRPASFGPWADLVPHLDDLVADCSDERLER LLSGRPTSAWQRASYLLDSGGEPARGQALLAKRHTEVMPVTRFTTAHSRDRGESVWAP EYQLVDELVVPLLRVIGKA" CDS complement(3096771..3097316) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2852C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2852c, -, len: 181 aa. Equivalent to Rv2828c, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 181 aa overlap). Conserved hypothetical protein, similar to Q9RY53|DR0097 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (189 aa), FASTA scores: opt: 267, E(): 1.9e-11, (34.1% identity in 176 aa overlap); and shows some similarity with N-terminus of O27278|MTH1210 MRR RESTRICTION SYSTEM RELATED PROTEIN from Methanothermobacter thermautotrophicus (340 aa), FASTA scores: opt: 133, E(): 0.07, (28.55% identity in 112 aa overlap). Also equivalent to downstream ORF P71627|Rv2825c|MTCY16B7.17 from Mycobacterium tuberculosis strain H37Rv (alias AAK47217 from strain CDC1551, 246 aa) (215 aa), FASTA scores: opt: 1173, E(): 8.3e-75, (98.9% identity in 181 aa overlap). Mb2852c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR008307" /db_xref="InterPro:IPR014923" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2C7" /protein_id="SIU01471.1" /translation="MTPALKEWSAAVHALLDGRQTVLLRKGGIGEKRFEVAAHEFLLF PTVAHSHAERVRPAHRDLLGPAAADSTDECVLLRAAAKVVAALPVNRPEGLDAIEDLH IWTAESVRADRLDFRPKHRLAVLVVSAIPLAEPVRLARTPEYGGCTSWVQLPVTPTLA APVHDEAALAEVAARVREAVG" CDS complement(3097313..3097582) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2852A" /product="Conserved hypothetical protein" /note="Mb2852A, len: 89 aa. Equivalent to Rv2828A len: 89 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 89 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved hypothetical protein,present in many mycobacteria. Equivalent to BCG2848c and Mb2852A (100% identity to both in 89 aa overlap),Mb2852A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR018735" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2B0" /protein_id="SIU01472.1" /translation="MCRNITELRGLQPPATPVEIAAAARQYVRKVSGITHPSAATAEA FEAAVAEVTATTTRLLDALPPRRQPPKTVPPLRRPDVAARLAGSR" CDS complement(3097603..3097995) /codon_start=1 /transl_table=11 /gene="vapc22" /locus_tag="BQ2027_MB2853C" /product="possible toxin vapc22" /note="Mb2853c, -, len: 130 aa. Equivalent to Rv2829c, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 130 aa overlap). Conserved hypothetical protein similar to AAK65872|SMA2253 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (125 aa), FASTA scores: opt: 171, E(): 7.7e-05, (34.9% identity in 129 aa overlap); and shows some similarity with other proteins e.g. Q9AH69 HYPOTHETICAL 14.7 KDA PROTEIN from Neisseria meningitidis (128 aa), FASTA scores: opt: 148, E(): 0.0031, (28.1% identity in 121 aa overlap). Protein product from Mb2853c detected using SWATH mass spectrometry. Mb2853c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2A9" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="InterPro:IPR041705" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2A9" /protein_id="SIU01473.1" /translation="MTTVLLDSHVAYWWSAEPQRLSMAASQAIEHADELAVAAISWFE LAWLAEQERIQLAIPVLSWLQQLAEHVRTVGITPSVAATAVALPSSFPGDPADRLIYA TAIEHGWRLVTKDRRLRSHRHPRPVTVW" CDS complement(3097992..3098207) /codon_start=1 /transl_table=11 /gene="vapb22" /locus_tag="BQ2027_MB2854C" /product="possible antitoxin vapb22" /note="Mb2854c, -, len: 71 aa. Equivalent to Rv2830c, len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (98.6% identity in 71 aa overlap). Hypothetical protein, some similarity to Z97182|MTCY19H5.26|Rv0596c Hypothetical protein from Mycobacterium tuberculosis (85 aa), FASTA scores: opt: 88, E(): 1.3, (41.7% identity in 36 aa overlap); and to PHD_BPP1|Q06253 bacteriophage P1 phd gene (73 aa), FASTA scores: opt: 79, E(): 3.8, (35.9% identity in 39 aa overlap). Mb2854c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006442" /db_xref="InterPro:IPR036165" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2D2" /protein_id="SIU01474.1" /translation="MTATEVKAKILSLLDEVAQGEEIEITKHGRTVARLVAATGPHAL KGRFSGVAMAAVDDDELFTTGVSWNVS" CDS 3098254..3099003 /codon_start=1 /transl_table=11 /gene="echA16" /locus_tag="BQ2027_MB2855" /product="PROBABLE ENOYL-COA HYDRATASE ECHA16 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb2855, echA16, len: 249 aa. Equivalent to Rv2831, len: 249 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 249 aa overlap). Probable echA16, enoyl-CoA hydratase (EC 4.2.1.17), similar to others e.g. O23468|AT4G16210 from Arabidopsis thaliana (Mouse-ear cress) (244 aa), FASTA scores: opt: 491, E(): 7.3e-25, (42.1% identity in 190 aa overlap); Q98LI4|MLL1009 from Rhizobium loti (Mesorhizobium loti) (258 aa), FASTA scores: opt: 491, E(): 7.6e-25, (40.75% identity in 248 aa overlap); O07137|ECH8_MYCLE|ML2402|MLCB1306.05c from Mycobacterium leprae (257 aa), FASTA scores: opt: 478, E(): 5.3e-24, (38.05% identity in 226 aa overlap); P76082|PAAF_ECOLI|B1393 from scherichia coli strain K12 (255 aa), FASTA scores: opt: 439, E(): 1.9e-21, (37.55% identity in 221 aa overlap); etc. Also similar to O53418|ECH8_MYCTU|ECHA8|Rv1070c|MT1100|MTV017.23c from Mycobacterium tuberculosis (257 aa), FASTA scores: opt: 471, E(): 1.5e-23, (38.05% identity in 226 aa overlap). Protein product from Mb2855 detected using shotgun mass spectrometry. Mb2855 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y498" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3Y498" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01475.1" /translation="MTDDILLIDTDERVRTLTLNRPQSRNALSAALRDRFFAALADAE ADDDIDVVILTGADPVFCAGLDLKELAGQTALPDISPRWPAMTKPVIGAINGAAVTGG LELALYCDILIASEHARFADTHARVGLLPTWGLSVRLPQKVGIGLARRMSLTGDYLSA TDALRAGLVTEVVAHDQLLPTARRVAASIVGNNQNAVRALLASYHRIDESQTAAGLWL EACAAKQFCTSGDTIAANREAVLQRGRAQVR" CDS complement(3099082..3100164) /codon_start=1 /transl_table=11 /gene="ugpC" /locus_tag="BQ2027_MB2856C" /product="PROBABLE Sn-GLYCEROL-3-PHOSPHATE TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER UGPC" /note="Mb2856c, ugpC, len: 360 aa. Equivalent to Rv2832c, len: 360 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 360 aa overlap). Probable ugpC, Sn-glycerol-3-phosphate transport ATP-binding protein ABC transporter (see first citation below), similar to others: CAC48805 PROBABLE GLYCEROL-3-PHOSPHATE ABC TRANSPORTER ATP-BINDING PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (349 aa), FASTA scores: opt: 1018, E(): 4.1e-53, (48.6% identity in 356 aa overlap); Q98G42|MLL3499|UGPC SN-GLYCEROL-3-PHOSPHATE TRANSPORT ATP-BINDING PROTEIN from Rhizobium loti (Mesorhizobium loti) (366 aa), FASTA scores: opt: 1016, E(): 5.6e-53, (48.5% identity in 367 aa overlap). But also highly similar to many msiK proteins, ABC transporter ATP-binding proteins possibly involved in transport of cellolbiose and maltose (see second citation below) e.g. P96483|MSIK MSIK PROTEIN from Streptomyces reticuli (see citation below) (377 aa), FASTA scores: opt: 1277, E(): 1.9e-68, (58.05% identity in 379 aa overlap); Q9L0Q1|MSIK ABC TRANSPORTER ATP-BINDING PROTEIN from Streptomyces coelicolor (378 aa), FASTA scores: opt: 1276, E(): 2.1e-68, (57.65% identity in 380 aa overlap); Q54333|MSIK from Streptomyces lividans (314 aa), FASTA scores: opt: 1217, E(): 5.9e-65, (63.7% identity in 292 aa overlap); and other ABC-TYPE SUGAR TRANSPORT PROTEINS. Also highly similar to O53482|Rv2038c|MTV018.25c ABC-TYPE SUGAR TRANSPORT PROTEIN from Mycobacterium tuberculosis (357 aa), FASTA scores: opt: 1248, E(): 9.4e-67, (56.8% identity in 354 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONG TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb2856c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y349" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR008995" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR040582" /db_xref="UniProtKB/TrEMBL:A0A1R3Y349" /protein_id="SIU01476.1" /translation="MANVQYSAVTQRYPGADAPTVDNLDLDIADGEFLVLVGPSGCGK STTLRVLAGLEPIESGRISIGDVDVTHLPPRARDVAMVFQNYALYPNMTVAANMGFAL RNAGMSRADTRRRVLEVADMLELTDLLDRKPAKLSGGQRQRVAMGRAIVRRPRVFCMD EPLSNLDAKLRVSTRSQISGLQRRLGTTTVYVTHDQVEAMTMGDRVAVLKDGVLQQVD TPRALYDDPVNTFVATFIGAPAMNLIDAAVAHGVVRAPDLAIPVPDPAAERVLVGVRP ESWDVASIGTPGSLTVHVELVEELGFESFVYATPVDQRGWSSRAPRIVFRTDRRTAVR VGESLAIVPHSQEVRLFNSRTETRLR" CDS complement(3100157..3101467) /codon_start=1 /transl_table=11 /gene="ugpB" /locus_tag="BQ2027_MB2857C" /product="PROBABLE Sn-GLYCEROL-3-PHOSPHATE-BINDING LIPOPROTEIN UGPB" /note="Mb2857c, ugpB, len: 436 aa. Equivalent to Rv2833c, len: 436 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 436 aa overlap). Probable ugpB, Sn-glycerol-3-phosphate binding lipoprotein component of Sn-glycerol-3-phosphate transport system (see citation below), similar to various transporters substrate-binding periplasmic proteins e.g. Q9KDY2|BH1079 GLYCEROL-3-PHOSPHATE ABC TRANSPORTER (GLYCEROL-3-PHOSPHATE BINDING PROTEIN) from Bacillus halodurans (459 aa), FASTA scores: opt: 357, E(): 3.1e-14, (23.4% identity in 406 aa overlap); P72397|MALE PUTATIVE MALTOSE-BINDING PROTEIN from Streptomyces coelicolor (423 aa), FASTA scores: opt: 318, E(): 7e-12, (23.7% identity in 430 aa overlap); AAK78409|CAC0429 GLYCEROL-3-PHOSPHATE ABC-TRANSPORTER PERIPLASMIC COMPONENT from Clostridium acetobutylicum (447 aa), FASTA scores: opt: 305, E(): 4.5e-11, (27.15% identity in 438 aa overlap); P10904|UGPB_ECOLI|B3453 GLYCEROL-3-PHOSPHATE-BINDING PERIPLASMIC PROTEIN PRECURSOR from Escherichia coli strain K12 (438 aa); etc. Contains signal sequence and appropriately positioned prokaryotic lipoprotein attachment site (PS00013)." /db_xref="InterPro:IPR006059" /db_xref="InterPro:IPR006311" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2H5" /protein_id="SIU01477.1" /translation="MDPLNRRQFLALAAAAAGVTAGCAGMGGGGSVKSGSGPIDFWSS HPGQSSAAERELIGRFQDRFPTLSVKLIDAGKDYDEVAQKFNAALIGTDVPDVVLLDD RWWFHFALIGVLTALDDLFGQVGVDTTDYVDSLLADYEFNGRHYAVPYARSTPLFYYN KAAWQQAGLPDRGLQSWSEFDEWGPELQRVVGAGRSAHGWANADLISWTFQGPNWAFG GAYSDKWTLTLTEPATIAAGNFYRNSIHGKGYAAVANDIANEFATGILASAVASTGSL PGITASARFDFGAAPLPTGPDAAPACPTGGAGLAIPAKLSEERKVNALKFIAFVTNPT NTAYFSQQTGYLPVRKSAVDDASERHYLADNPRARVALDQLPHTRTQDYARVFLPGGD RIISAGLESIGLRGADVTKTFTNIQKRLQVILDRQIMRKLAGHG" CDS complement(3101470..3102297) /codon_start=1 /transl_table=11 /gene="ugpE" /locus_tag="BQ2027_MB2858C" /product="PROBABLE Sn-GLYCEROL-3-PHOSPHATE TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER UGPE" /note="Mb2858c, ugpE, len: 275 aa. Equivalent to Rv2834c, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 275 aa overlap). Probable ugpE, Sn-glycerol-3-phosphate transport integral membrane protein ABC transporter (see citation below), similar to various permeases e.g. Q9KDY3|BH1078 GLYCEROL-3-PHOSPHATE ABC TRANSPORTER from Bacillus halodurans (270 aa), FASTA scores: opt: 620, E(): 4.3e-32, (34.7% identity in 268 aa overlap); Q9X0K6|TM1122 GLYCEROL-3-PHOSPHATE ABC TRANSPORTER PERMEASE PROTEIN from Thermotoga maritima (276 aa), FASTA scores: opt: 605, E(): 3.9e-31, (32.5% identity in 274 aa overlap); AAG58557|UGPE SN-GLYCEROL 3-PHOSPHATE TRANSPORT SYSTEM (INTEGRAL MEMBRANE PROTEIN) from Escherichia coli strain O157:H7 and EDL933 (281 aa), FASTA scores: opt: 574, E(): 3.7e-29, (32.95% identity in 264 aa overlap); P10906|UGPE_ECOLI|B3451 SN-GLYCEROL-3-PHOSPHATE TRANSPORT SYSTEM PERMEASE PROTEIN from Escherichia coli strain K12 (281 aa), FASTA scores: opt: 569, E(): 7.6e-29, (32.6% identity in 264 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature." /db_xref="GOA:A0A1R3Y2R8" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2R8" /protein_id="SIU01478.1" /translation="MTPDRLRSSVGYAAMLLVVTLIAGPLLFVFFTSFKDQPDIYAQP TSWWPLRWYPQNYRTATEQIPFWTFLRNSLIITSVLAVVKFTLGVLSAFGLVFVRFPG RTAVFLVIIAALMVPNQITVISNYALISHLGLRNTFAGIILPLAGVAFGTFLMRNHFL SLPAEIIEAARMDGARWWQLLLRVVLPMSRPTMVAVGVITVVNEWNEYLWPFLMSDDE SVAPLPIGLTFLQQAEGVTNWGPVMAVTLLAMLPILLVFIALQRQMIKGLTSGAVKG" CDS complement(3102294..3102836) /codon_start=1 /transl_table=11 /gene="ugpAb" /locus_tag="BQ2027_MB2859C" /product="PROBABLE Sn-GLYCEROL-3-PHOSPHATE TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER UGPAB [SECOND PART]" /note="Mb2859c, ugpAb, len: 180 aa. Equivalent to 3' end of Rv2835c, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 180 aa overlap). Probable ugpA, Sn-glycerol-3-phosphate transport integral membrane protein ABC transporter (see citation below), similar to various permeases e.g. Q9RK71|SCF11.19 PROBABLE SUGAR TRANSPORTER INNER MEMBRANE PROTEIN from Streptomyces coelicolor (316 aa), FASTA scores: opt: 643, E(): 3.1e-35, (38.85% identity in 291 aa overlap); Q9KDY4|BH1077 GLYCEROL-3-PHOSPHATE ABC TRANSPORTER (PERMEASE) from Bacillus halodurans (315 aa), FASTA scores: opt: 548, E(): 6.2e-29, (31.5% identity in 295 aa overlap); AAK78407|CAC0427 GLYCEROL-3-PHOSPHATE ABC-TRANSPORTER, PERMEASE COMPONENT from Clostridium acetobutylicum (304 aa), FASTA scores: opt: 538, E(): 2.8e-28, (29.1% identity in 292 aa overlap); etc. Contains PS00062 Aldo/keto reductase family signature 2, and PS00402 Binding-protein-dependent transport systems inner membrane comp signature. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, ugpA exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (t-*) splits ugpA into 2 parts, ugpAa and ugpAb." /db_xref="GOA:A0A1R3Y2B5" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2B5" /protein_id="SIU01479.1" /translation="MISGAAVGLAAQFVFDPHFGLIQDLLRRIGVGVPDFYQDARWAL FMVTITYVWKNLGYTFVIYLAALQGVRRDLLEAAEIDGASRWAVFRRVLLPQLRPTTF FLSITVLINSLQVFDVINVMTRGGPEGTGTTTMVYQVYVETFRNFRAGYGATVATIMF LVLLAVTYYQVRVMDRGQRQ" CDS complement(3102833..3103204) /codon_start=1 /transl_table=11 /gene="ugpAa" /locus_tag="BQ2027_MB2860C" /product="PROBABLE Sn-GLYCEROL-3-PHOSPHATE TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER UGPAA [FIRST PART]" /note="Mb2860c, ugpAa, len: 123 aa. Similar to 5' end of Rv2835c, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Probable ugpA, Sn-glycerol-3-phosphate transport integral membrane protein ABC transporter (see citation below), similar to various permeases e.g. Q9RK71|SCF11.19 PROBABLE SUGAR TRANSPORTER INNER MEMBRANE PROTEIN from Streptomyces coelicolor (316 aa), FASTA scores: opt: 643, E(): 3.1e-35, (38.85% identity in 291 aa overlap); Q9KDY4|BH1077 GLYCEROL-3-PHOSPHATE ABC TRANSPORTER (PERMEASE) from Bacillus halodurans (315 aa), FASTA scores: opt: 548, E(): 6.2e-29, (31.5% identity in 295 aa overlap); AAK78407|CAC0427 GLYCEROL-3-PHOSPHATE ABC-TRANSPORTER, PERMEASE COMPONENT from Clostridium acetobutylicum (304 aa), FASTA scores: opt: 538, E(): 2.8e-28, (29.1% identity in 292 aa overlap); etc. Contains PS00062 Aldo/keto reductase family signature 2, and PS00402 Binding-protein-dependent transport systems inner membrane comp signature. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, ugpA exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (t-*) splits ugpA into 2 parts, ugpAa and ugpAb. Mb2860c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2B6" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2B6" /protein_id="SIU01480.1" /translation="MAAPQRARLRSSKERVRDYALFVVLVGPNVALLLLFVYRPLADN IRLSFFDWNVSDPSARFVGLSNYTEWFTRSDTRQIVFNTAVSPVPRWSARWCWGWRWR CCSIDRCVDETWCAPLFSRRS" CDS complement(3103291..3104610) /codon_start=1 /transl_table=11 /gene="dinF" /locus_tag="BQ2027_MB2861C" /product="POSSIBLE DNA-DAMAGE-INDUCIBLE PROTEIN F DINF" /note="Mb2861c, dinF, len: 439 aa. Equivalent to Rv2836c, len: 439 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 439 aa overlap). Possible dinF, DNA-damage-inducible protein F, integral membrane protein, similar to others e.g. BAB38450|ECS5027|AAG59243 from Escherichia coli strain O157:H7 (459 aa), FASTA scores: opt: 501, E(): 2.7e-21, (29.55% identity in 443 aa overlap); P28303|DINF_ECOLI|B4044 from Escherichia coli strain K12 (459 aa), FASTA scores: opt: 491, E(): 1e-20, (29.35% identity in 443 aa overlap); Q98B90|MLR5680 from Rhizobium loti (Mesorhizobium loti) (471 aa), FASTA scores: opt: 466, E(): 2.7e-19, (30.7% identity in 433 aa overlap); etc. But also similar or highly similar to other hypothetical proteins e.g. Q9X8U6|SCH24.32c HYPOTHETICAL 46.3 KDA PROTEIN from Streptomyces coelicolor (448 aa), FASTA scores: opt: 981, E(): 1.1e-48, (42.35% identity in 437 aa overlap). Contains PS00213 Lipocalin signature. Mb2861c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2D3" /db_xref="InterPro:IPR002528" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2D3" /protein_id="SIU01481.1" /translation="MSQVGHRAGGRQIAQLALPALGVLAAEPLYLLFDIAVVGRLGAI SLAGLAIGSLVLGLVGSQATFLSYGTTARAARRYGAGNRVAAVTEGVQATWLALGLGA LVVVVVEATATPLVSAIASGDGITAAALPWLRIAILGTPAILVSLAGNGWLRGVQDTV RPLRYVVAGFGSSALLCPLLVYGWLGLPRWGLTGSAVANLVGQWLAALLFAGALLAER VSLRPDRAVLGAQLMMARDLIVRTLAFQVCYVSAAAVAARFGAAALAAHQVVLQLWGL LALVLDSLAIAAQSLVGAALGAGDAGHAKAVAWRVTAFSLLAAGILAAALGLGSSVLP GLFTDDRSVLAAIGVLWWFMVVQLPFAGIVFAVDGVLLGAGDAAFMRTATVASALVGF LPLVWLSLAYGWGLAGIWSGLGTFIVLRLIFVGWRAYSGRWAVTGAA" CDS complement(3104617..3105627) /codon_start=1 /transl_table=11 /gene="nrnA" /locus_tag="BQ2027_MB2862C" /product="Cyclic-di-AMP phosphodiesterase MSMEG_2630" /note="Mb2862c, -, len: 336 aa. Equivalent to Rv2837c, len: 336 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 336 aa overlap). Conserved hypothetical protein, showing some similarity with other proteins e.g. O67552|AQ_1630 HYPOTHETICAL 36.2 KDA PROTEIN from Aquifex aeolicus (325 aa), FASTA scores: opt: 498, E(): 3.6e-25, (32.8% identity in 314 aa overlap); Q9X1T1|TM1595 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (333 aa), FASTA scores: opt: 482, E(): 4.1e-24, (34.85% identity in 304 aa overlap); Q9RW43|DR0826 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (338 aa), FASTA scores: opt: 444, E(): 1.3e-21, (33.85% identity in 331 aa overlap); etc. Equivalent to AAK47229 from Mycobacterium tuberculosis strain CDC1551 (316 aa) but longer 20 aa. Protein product from Mb2862c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2862c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2C0" /db_xref="InterPro:IPR001667" /db_xref="InterPro:IPR003156" /db_xref="InterPro:IPR038763" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2C0" /protein_id="SIU01482.1" /translation="MTTIDPRSELVDGRRRAGARVDAVGAAALLSAAARVGVVCHVHP DADTIGAGLALALVLDGCGKRVEVSFAAPATLPESLRSLPGCHLLVRPEVMRRDVDLV VTVDIPSVDRLGALGDLTDSGRELLVIDHHASNDLFGTANFIDPSADSTTTMVAEILD AWGKPIDPRVAHCIYAGLATDTGSFRWASVRGYRLAARLVEIGVDNATVSRTLMDSHP FTWLPLLSRVLGSAQLVSEAVGGRGLVYVVVDNREWVAARSEEVESIVDIVRTTQQAE VAAVFKEVEPHRWSVSMRAKTVNLAAVASGFGGGGHRLAAGYTTTGSIDDAVASLRAA LG" CDS complement(3105602..3106153) /codon_start=1 /transl_table=11 /gene="rbfA" /locus_tag="BQ2027_MB2863C" /product="PROBABLE RIBOSOME-BINDING FACTOR A RBFA (P15B PROTEIN)" /note="Mb2863c, rbfA, len: 183 aa. Equivalent to Rv2838c, len: 183 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 183 aa overlap). Probable rbfA, ribosome-binding factor A, equivalent to Q9Z5I8|RBFA_MYCLE|ML1555|MLCB596.15 PROBABLE RIBOSOME-BINDING FACTOR A from Mycobacterium leprae (164 aa), FASTA scores: opt: 739, E(): 1.8e-40, (75.6% identity in 160 aa overlap). Also highly similar or similar to others e.g. Q9Z527|RBFA_STRCO|SC9F2.08c from Streptomyces coelicolor (160 aa), FASTA scores: opt: 425, E(): 2.8e-20, (50.35% identity in 141 aa overlap); P32731|RBFA_BACSU from Bacillus subtilis (117 aa), FASTA scores: opt: 199, E(): 7.8e-06, (32.4% identity in 108 aa overlap); P09170|RBFA_ECOLI|P15B|B3167 from Escherichia coli strain K12 (132 aa), FASTA scores: opt: 166, E(): 0.0011, (29.65% identity in 118 aa overlap); etc. BELONGS TO THE RBFA FAMILY. Note that appears to be longer in C-terminus than other RbfA proteins. Protein product from Mb2863c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2863c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65965" /db_xref="InterPro:IPR000238" /db_xref="InterPro:IPR015946" /db_xref="InterPro:IPR020053" /db_xref="InterPro:IPR023799" /db_xref="UniProtKB/Swiss-Prot:P65965" /protein_id="SIU01483.1" /translation="MADAARARRLAKRIAAIVASAIEYEIKDPGLAGVTITDAKVTAD LHDATVYYTVMGRTLHDEPNCAGAAAALERAKGVLRTKVGAGTGVRFTPTLTFTLDTI SDSVHRMDELLARARAADADLARVRVGAKPAGEADPYRDNGSVAQSPAPGGLGIRTSD GPEAVEAPLTCGGDTGDDDRPKE" CDS complement(3106153..3108855) /codon_start=1 /transl_table=11 /gene="infB" /locus_tag="BQ2027_MB2864C" /product="PROBABLE TRANSLATION INITIATION FACTOR IF-2 INFB" /note="Mb2864c, infB, len: 900 aa. Equivalent to Rv2839c, len: 900 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 900 aa overlap). Probable infB, translation initiation factor IF-2, highly similar, but in part, to Q9Z5I9|IF2_MYCLE|ML1556|MLCB596.14 TRANSLATION INITIATION FACTOR IF-2 from Mycobacterium leprae (924 aa), FASTA scores: opt: 4548, E(): 2.4e-132, (83.6% identity in 933 aa overlap). Also similar in part to others e.g. Q9K3E2|SC5H4.30 from Streptomyces coelicolor (835 aa), FASTA scores: opt: 2559, E(): 1.3e-71, (59.9% identity in 833 aa overlap); P17889|IF2_BACSU|INFB from Bacillus subtilis (716 aa), FASTA scores: opt: 1782, E(): 6.6e-48, (46.65% identity in 686 aa overlap); P02995|IF2_ECOLI|INFB|SSYG|B3168|Z4529|ECS4049 from Escherichia coli strains O157:H7 and K12 (890 aa), FASTA scores: opt: 1708, E(): 1.3e-45, (46.2% identity in 662 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE IF-2 FAMILY. Protein product from Mb2864c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2864c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65132" /db_xref="InterPro:IPR000178" /db_xref="InterPro:IPR000795" /db_xref="InterPro:IPR005225" /db_xref="InterPro:IPR006847" /db_xref="InterPro:IPR009000" /db_xref="InterPro:IPR015760" /db_xref="InterPro:IPR023115" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036925" /db_xref="UniProtKB/Swiss-Prot:P65132" /protein_id="SIU01484.1" /translation="MAAGKARVHELAKELGVTSKEVLARLSEQGEFVKSASSTVEAPV ARRLRESFGGSKPAPAKGTAKSPGKGPDKSLDKALDAAIDMAAGNGKATAAPAKAADS GGAAIVSPTTPAAPEPPTAVPPSPQAPHPGMAPGARPGPVPKPGIRTPRVGNNPFSSA QPADRPIPRPPAPRPGTARPGVPRPGASPGSMPPRPGGAVGGARPPRPGAPRPGGRPG APGAGRSDAGGGNYRGGGVGAAPGTGFRGRPGGGGGGRPGQRGGAAGAFGRPGGAPRR GRKSKRQKRQEYDSMQAPVVGGVRLPHGNGETIRLARGASLSDFADKIDANPAALVQA LFNLGEMVTATQSVGDETLELLGSEMNYNVQVVSPEDEDRELLESFDLSYGEDEGGEE DLQVRPPVVTVMGHVDHGKTRLLDTIRKANVREAEAGGITQHIGAYQVAVDLDGSQRL ITFIDTPGHEAFTAMRARGAKATDIAILVVAADDGVMPQTVEAINHAQAADVPIVVAV NKIDKEGADPAKIRGQLTEYGLVPEEFGGDTMFVDISAKQGTNIEALEEAVLLTADAA LDLRANPDMEAQGVAIEAHLDRGRGPVATVLVQRGTLRVGDSVVAGDAYGRVRRMVDE HGEDVEVALPSRPVQVIGFTSVPGAGDNFLVVDEDRIARQIADRRSARKRNALAARSR KRISLEDLDSALKETSQLNLILKGDNAGTVEALEEALMGIQVDDEVVLRVIDRGVGGI TETNVNLASASDAVIIGFNVRAEGKATELASREGVEIRYYSVIYQAIDEIEQALRGLL KPIYEENQLGRAEIRALFRSSKVGLIAGCLVTSGVMRRNAKARLLRDNIVVAENLSIA SLRREKDDVTEVRDGFECGLTLGYADIKEGDVIESYELVQKERA" CDS complement(3108941..3109240) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2865C" /product="COG2740: Predicted nucleic-acid-binding protein implicated in transcription termination" /note="Mb2865c, -, len: 99 aa. Equivalent to Rv2840c, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Conserved hypothetical protein, equivalent to Q9Z5J0|ML1557|MLCB596.13 HYPOTHETICAL 11.6 KDA PROTEIN from Mycobacterium leprae (106 aa), FASTA scores: opt: 501, E(): 2.3e-29, (501% identity in 96 aa overlap). Also highly similar to other hypothetical proteins e.g. Q9KYR0|SC5H4.29 from Streptomyces coelicolor (101 aa), FASTA scores: opt: 256, E(): 1.4e-11, (50.6% identity in 81 aa overlap); Q9APM9 from Myxococcus xanthus (111 aa), FASTA scores: opt: 174, E(): 1.3e-05, (42.25% identity in 97 aa overlap); and similar to to others e.g. N-terminus of CAC41675|SMC02913 from Rhizobium meliloti (Sinorhizobium meliloti) (230 aa), FASTA scores: opt: 172, E(): 3e-05, (42.4% identity in 66 aa overlap). Protein product from Mb2865c detected using SWATH mass spectrometry. Mb2865c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007393" /db_xref="InterPro:IPR035931" /db_xref="InterPro:IPR037465" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4A8" /protein_id="SIU01485.1" /translation="MRTCVGCRKRGLAVELLRVVAVSTGNGNYAVIVDTATSLPGRGA WLHPLRQCAQQAIRRRAFARALRIAGSPDTSAVVEYLESLGELEPPGNRTGSNRT" CDS complement(3109367..3110410) /codon_start=1 /transl_table=11 /gene="nusA" /locus_tag="BQ2027_MB2866C" /product="PROBABLE N UTILIZATION SUBSTANCE PROTEIN A NUSA" /note="Mb2866c, nusA, len: 347 aa. Equivalent to Rv2841c, len: 347 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 347 aa overlap). Probable nusA, N-utilization substance protein A, equivalent to Q9Z5J1|NUSA|ML1558 PROBABLE TRANSCRIPTION TERMINATION/ANTITERMINATION FACTOR from Mycobacterium leprae (347 aa), FASTA scores: opt: 2054, E(): 5.4e-120, (91.95% identity in 347 aa overlap). Also highly similar to others e.g. Q9KYR1|SC5H4.28 PUTATIVE TRANSCRIPTIONAL TERMINATION/ANTITERMINATION FACTOR from Streptomyces coelicolor (340 aa), FASTA scores: opt: 1346, E(): 4.3e-76, (63.35% identity in 341 aa overlap); P32727|NUSA_BACSU N UTILIZATION SUBSTANCE PROTEIN A (371 aa), FASTA scores: opt: 847, E(): 4.1e-45, (43.95% identity in 346 aa overlap); Q9KA74|NUSA|BH2416 TRANSCRIPTIONAL TERMINATOR from Bacillus halodurans (382 aa), FASTA scores: opt: 846, E(): 4.8e-45, (43.15% identity in 373 aa overlap); etc. BELONGS TO THE NUSA FAMILY. Protein product from Mb2866c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2866c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5M3" /db_xref="InterPro:IPR003029" /db_xref="InterPro:IPR009019" /db_xref="InterPro:IPR010213" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR013735" /db_xref="InterPro:IPR015946" /db_xref="InterPro:IPR022967" /db_xref="InterPro:IPR025249" /db_xref="InterPro:IPR030842" /db_xref="InterPro:IPR036555" /db_xref="UniProtKB/Swiss-Prot:P0A5M3" /protein_id="SIU01486.1" /translation="MNIDMAALHAIEVDRGISVNELLETIKSALLTAYRHTQGHQTDA RIEIDRKTGVVRVIARETDEAGNLISEWDDTPEGFGRIAATTARQVMLQRFRDAENER TYGEFSTREGEIVAGVIQRDSRANARGLVVVRIGTETKASEGVIPAAEQVPGESYEHG NRLRCYVVGVTRGAREPLITLSRTHPNLVRKLFSLEVPEIADGSVEIVAVAREAGHRS KIAVRSNVAGLNAKGACIGPMGQRVRNVMSELSGEKIDIIDYDDDPARFVANALSPAK VVSVSVIDQTARAARVVVPDFQLSLAIGKEGQNARLAARLTGWRIDIRGDAPPPPPGQ PEPGVSRGMAHDR" CDS complement(3110407..3110958) /codon_start=1 /transl_table=11 /gene="rimP" /locus_tag="BQ2027_MB2867C" /product="Bacterial ribosome SSU maturation protein RimP" /note="Mb2867c, -, len: 183 aa. Equivalent to Rv2842c, len: 183 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 183 aa overlap). Conserved hypothetical protein, similar to Q9Z5J2|MLCB596.11 HYPOTHETICAL 13.7 KDA PROTEIN from Mycobacterium leprae (122 aa), FASTA scores: opt: 192, E(): 2.1e-12, (50.0% identity in 128 aa overlap) (N-terminus shorter). Also similar in part to several hypothetical proteins e.g. Q9KYR2|SC5H4.27 HYPOTHETICAL 19.8 KDA PROTEIN from Streptomyces coelicolor (177 aa), FASTA scores: opt: 288, E(): 2.1e-12, (37.15% identity in 148 aa overlap); O66619|Y260_AQUAE|AQ_260 HYPOTHETICAL PROTEIN from Aquifex aeolicus (158 aa), FASTA scores: opt: 230, E(): 1.7e-08, (31.35% identity in 153 aa overlap); Q9KU82|VC0641 HYPOTHETICAL PROTEIN from Vibrio cholerae (151 aa), FASTA scores: opt: 198, E(): 2.5e-06, (30.9% identity in 152 aa overlap); etc. Protein product from Mb2867c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2867c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67215" /db_xref="InterPro:IPR003728" /db_xref="InterPro:IPR028989" /db_xref="InterPro:IPR028998" /db_xref="InterPro:IPR035956" /db_xref="InterPro:IPR036847" /db_xref="UniProtKB/Swiss-Prot:P67215" /protein_id="SIU01487.1" /translation="MTTGLPSQRQVIELLGADFACAGYEIEDVVIDARARPPRIAVIA DGDAPLDLDTIAALSRRASALLDGLDGANKIRGRYLLEVSSPGVERPLTSEKHFRRAR GRKVELVLSDGSRLTGRVGEMRAGTVALVIREDRGWAVREIPLAEIVKAVVQVEFSPP APAELELAQSSEMGLARGTEAGA" CDS 3111153..3111698 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2868" /product="PROBABLE CONSERVED TRANSMEMBRANE ALANINE RICH PROTEIN" /note="Mb2868, -, len: 181 aa. Equivalent to Rv2843, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 181 aa overlap). Probable conserved transmembrane ala-rich protein, equivalent to Q9Z5J3|ML1560|MLCB596.10c HYPOTHETICAL 17.5 KDA PROTEIN from Mycobacterium leprae (178 aa), FASTA scores: opt: 707, E(): 1.4e-32, (70.25% identity in 168 aa overlap). Protein product from Mb2868 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2868 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2S8" /db_xref="InterPro:IPR006311" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2S8" /protein_id="SIU01488.1" /translation="MLRAAPVINRLTNRPISRRGVLAGGAALAALGVVSACGESAPKA PAVEELRSPLDQARHDGALAAAAATAIGIPPQVAAALTVVATQRTSHARALATEIARA AGKLVSATSETSSSSPSPTDPAAPPPAVSDVIDSLRTSAGEASRLVATTSGYRAGLLA SIAASCTASYTVALVPSGPSI" CDS 3111695..3112183 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2869" /product="conserved alanine rich protein" /note="Mb2869, -, len: 162 aa. Equivalent to Rv2844, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). Conserved hypothetical ala-rich protein, equivalent to Q9Z5J4|ML1561|MLCB596.09c HYPOTHETICAL 17.5 KDA PROTEIN from Mycobacterium leprae (165 aa), FASTA scores: opt: 771, E(): 4.9e-46, (71.5% identity in 165 aa overlap). Also similar to Q9KYR4|SC5H4.25c HYPOTHETICAL 16.8 KDA PROTEIN from Streptomyces coelicolor (167 aa), FASTA scores: opt: 242, E(): 1.6e-09, (38.9% identity in 144 aa overlap). Protein product from Mb2869 detected using shotgun mass spectrometry. Mb2869 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009078" /db_xref="InterPro:IPR012347" /db_xref="InterPro:IPR029447" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2C8" /protein_id="SIU01489.1" /translation="MTSSEPAHGATPKRSPSEGSADNAALCDALAVEHATIYGYGIVS ALSPPGVNFLVADALKQHRHRRDDVIVMLSARGVTAPIAAAGYQLPMQVSSAADAARL AVRMENDGATAWRAVVEHAETADDRVFASTALTESAVMATRWNRVLGAWPITAAFPGG DE" CDS complement(3112184..3113932) /codon_start=1 /transl_table=11 /gene="proS" /locus_tag="BQ2027_MB2870C" /product="probable prolyl-trna synthetase pros (proline--trna ligase) (prors) (global rna synthesis factor) (proline translase)" /note="Mb2870c, proS, len: 582 aa. Equivalent to Rv2845c, len: 582 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 582 aa overlap). Probable proS, prolyl-tRNA synthetase (EC 6.1.1.15), highly similar to others e.g. Q9KYR6|SYP_STRCO|PROS|SC5H4.23 from Streptomyces coelicolor (567 aa), FASTA scores: opt: 1161, E(): 9e-64, (57.15% identity in 574 aa overlap); P56124|SYP_HELPY|PROS|HP0238 from Helicobacter pylori (Campylobacter pylori) (577 aa), FASTA scores: opt: 1082, E(): 6.6e-59, (37.8% identity in 553 aa overlap); P16659|SYP_ECOLI|PROS|DRPA|B0194 from Escherichia coli strain K12 (572 aa), FASTA scores: opt: 926, E(): 2.6e-49, (39.85% identity in 587 aa overlap); etc. Contains PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1. BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb2870c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2870c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXQ5" /db_xref="InterPro:IPR002314" /db_xref="InterPro:IPR002316" /db_xref="InterPro:IPR004154" /db_xref="InterPro:IPR004500" /db_xref="InterPro:IPR006195" /db_xref="InterPro:IPR007214" /db_xref="InterPro:IPR023717" /db_xref="InterPro:IPR033730" /db_xref="InterPro:IPR036621" /db_xref="InterPro:IPR036754" /db_xref="UniProtKB/Swiss-Prot:Q7TXQ5" /protein_id="SIU01490.1" /translation="MITRMSELFLRTLRDDPADAEVASHKLLIRAGYIRPVAPGLYSW LPLGLRVLRNIERVIRDEMNAIGGQEILFPALLPRAPYETTNRWTQYGDSVFRLKDRR GNDYLLGPTHEELFTLTVKGEYSSYKDFPLTLYQIQTKYRDEARPRAGILRAREFVMK DSYSFDIDAAGLKAAYRAHREAYQRIFDRLQVRYVIVSAVSGAMGGSASEEFLAESPS GEDAFVRCLESGYTANVEAVVTARPDTLPIDGLPEAVVHDTGDTPTIASLVAWANEAD LGRTVTAADTLKNVLIKVRQPGGDTELLAIGVPGDREVDDKRLGAALEPADYALLDDD DFAKHPFLVKGYIGPKALRENNVRYLVDPRIVDGTSWITGADQPGRHVVGLVAGRDFT ADGTIEAAEVREGDPSPDGAGPLVMARGIEIGHIFQLGSKYTDAFTADVLGEDGKPVR LTMGSYGIGVSRLVAVVAEQHHDELGLRWPSTVAPFDVHLVIANKDAQARAGATALAA DLDRLGVEVLLDDRQASPGVKFKDAELLGMPWIVVVGRGWADGVVELRDRFSGQTREL VAGASLATDIAAAVTG" CDS complement(3114021..3115613) /codon_start=1 /transl_table=11 /gene="efpA" /locus_tag="BQ2027_MB2871C" /product="POSSIBLE INTEGRAL MEMBRANE EFFLUX PROTEIN EFPA" /note="Mb2871c, efpA, len: 530 aa. Equivalent to Rv2846c, len: 530 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 530 aa overlap). Possible efpA, integral membrane efflux protein, member of major facilitator superfamily (MFS) possibly involved in transport of drug (see citations below), equivalent to Q9Z5J5|ML1562|MLCB596.08 PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Mycobacterium leprae (534 aa), FASTA scores: opt: 2881, E(): 4.1e-160, (86.55% identity in 535 aa overlap). Also highly similar to several membrane proteins e.g. O69986|SC4H2.31c TRANSMEMBRANE EFFLUX PROTEIN (515 aa), FASTA scores: opt: 1063, E(): 2.2e-54, (39.65% identity in 406 aa overlap); Q9FBQ5|SCD86A.02c PUTATIVE TRANSPORT INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (503 aa), FASTA scores: opt: 918, E(): 5.8e-46, (33.7% identity in 469 aa overlap); Q9KYU0|SCE22.23c PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (514 aa), FASTA scores: opt: 888, E(): 3.3e-44, (32.85% identity in 469 aa overlap); etc. Protein product from Mb2871c detected using SWATH mass spectrometry. Mb2871c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2D5" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2D5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01491.1" /translation="MTALNDTERAVRNWRAGRPHRPAPMRPPRSEETASERPSRYYPT WLPSRSFIAAVIAIGGMQLLATMDSTVAIVALPKIQNELSLSDAGRSWVITAYVLTFG GLMLLGGRLGDTIGRKRTFIVGVALFTISSVLCAVAWDEATLVIARLSQGVGSAIASP TGLALVATTFPKGPARNAATAVFAAMTAIGSVMGLVVGGALTEVSWRWAFLVNVPIGL VMIYLARTALRETNKERMKLDATGAILATLACTAAVFAFSIGPEKGWMSGITIGSGLV ALAAAVAFVIVERTAENPVVPFHLFRDRNRLVTFSAILLAGGVMFSLTVCIGLYVQDI LGYSALRAGVGFIPFVIAMGIGLGVSSQLVSRFSPRVLTIGGGYLLFGAMLYGSFFMH RGVPYFPNLVMPIVVGGIGIGMAVVPLTLSAIAGVGFDQIGPVSAIALMLQSLGGPLV LAVIQAVITSRTLYLGGTTGPVKFMNDVQLAALDHAYTYGLLWVAGAAIIVGGMALFI GYTPQQVAHAQEVKEAIDAGEL" CDS complement(3115636..3116853) /codon_start=1 /transl_table=11 /gene="cysG" /locus_tag="BQ2027_MB2872C" /product="POSSIBLE MULTIFUNCTIONAL ENZYME SIROHEME SYNTHASE CYSG: UROPORPHYRIN-III C-METHYLTRANSFERASE (UROGEN III METHYLASE) (SUMT) (UROPORPHYRINOGEN III METHYLASE) (UROM) + PRECORRIN-2 OXIDASE + FERROCHELATASE" /note="Mb2872c, cysG, len: 405 aa. Equivalent to Rv2847c, len: 405 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 405 aa overlap). Possible cysG, multifunctional enzyme, siroheme synthase containing uroporphyrin-iii c-methyltransferase (EC 2.1.1.107), precorrin-2 oxidase (EC 1.-.-.-) and ferrochelatase (EC 4.99.1.-). C-terminus highly similar to many uroporphyrin-iii c-methyltransferases e.g. Q51720|COBA UROPORPHYRINOGEN III METHYLTRANSFERASE from Propionibacterium freudenreichii (257 aa), FASTA scores: opt: 776, E(): 1.5e-39, (48.95% identity in 243 aa overlap); Q9HMY4|UROM|VNG2331G S-ADENOSYL-L-METHIONINE:UROPORPHYRINOGEN III METHYLTRANSFERASE from Halobacterium sp. strain NRC-1 (246 aa), FASTA scores: opt: 704, E(): 3.1e-35, (49.4% identity in 245 aa overlap); P42437|NASF_BACSU|NASBE UROPORPHYRIN-III C-METHYLTRANSFERASE from Bacillus subtilis (483 aa), FASTA scores: opt: 610, E(): 2.4e-29, (42.1% identity in 240 aa overlap); etc. And highly similar over entire length to other proteins e.g. Q9L1C9|SCL11.09c UROPORPHYRINOGEN III METHYLTRANSFERASE from Streptomyces coelicolor (410 aa), FASTA scores: opt: 1481, E(): 5.6e-82, (58.45% identity in 409 aa overlap); Q9I0M7|CYSG|PA2611 SIROHEME SYNTHASE from Pseudomonas aeruginosa (465 aa), FASTA scores: opt: 609, E(): 2.7e-29, (34.7% identity in 444 aa overlap); P11098|CYSG_ECOLI|B3368|Z4729|ECS4219 SIROHEME SYNTHASE from Escherichia coli stains O157:H7 and K12 (457 aa), FASTA scores: opt: 543, E(): 9.1e-27, (31.3% identity in 450 aa overlap); etc. BELONGS TO A FAMILY THAT GROUPS SUMT, CYSG, CBIF/COBM AND CBIL/COBI. Note that previously known as cysG2. Protein product from Mb2872c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2872c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2D0" /db_xref="InterPro:IPR000878" /db_xref="InterPro:IPR006366" /db_xref="InterPro:IPR012409" /db_xref="InterPro:IPR014776" /db_xref="InterPro:IPR014777" /db_xref="InterPro:IPR035996" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2D0" /protein_id="SIU01492.1" /translation="MTENPYLVGLRLAGKKVVVVGGGTVAQRRLPLLIASGADVHVIA PSVTPAVEAMDQITLSVRDYRDGDLDGAWYAIAATDDARVNVAVVAEAERRRIFCVRA DIAVEGTAVTPASFSYAGLSVGVLAGGEHRRSAAIRSAIREALQQGVITAQSSDVLSG GVALVGGGPGDPELITVRGRRLLAQADVVVADRLAPPELLAELPPHVEVIDAAKIPYG RAMAQDAINAVLIERARSGNFVVRLKGGDPFVFARGYEEVLACAHAGIPVTVVPGVTS AIAVPAMAGVPVTHRAMTHEFVVVSGHLAPGHPESLVNWDALAALTGTIVLLMAVERI ELFVDVLLKGGRTADTPVLVVQHGTTAAQQTLRATLADTPEKVRAAGIRPPAIIVIGA VVGLSGVRGLNNS" CDS complement(3117076..3118449) /codon_start=1 /transl_table=11 /gene="cobB" /locus_tag="BQ2027_MB2873C" /product="PROBABLE COBYRINIC ACID A,C-DIAMIDE SYNTHASE COBB" /note="Mb2873c, cobB, len: 457 aa. Equivalent to Rv2848c, len: 457 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 457 aa overlap). Probable cobB, cobyrinic acid A,C-diamide synthase, highly similar to others e.g. O27509|COBB_METTH|MTH1460 from Methanobacterium thermoautotrophicum (447 aa), FASTA scores: opt: 980, E(): 1.3e-49, (39.65% identity in 454 aa overlap); Q9KBM8|BH1898 from Bacillus halodurans (465 aa), FASTA scores: opt: 928, E(): 1.4e-46, (37.0% identity in 457 aa overlap); O68108|COBB_RHOCA from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (435 aa), FASTA scores: opt: 921, E(): 3.3e-46, (39.35% identity in 437 aa overlap); etc. BELONGS TO THE COBB/COBQ FAMILY, COBB SUBFAMILY. Protein product from Mb2873c detected using SWATH mass spectrometry. Mb2873c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63836" /db_xref="InterPro:IPR002586" /db_xref="InterPro:IPR004484" /db_xref="InterPro:IPR011698" /db_xref="InterPro:IPR017929" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/Swiss-Prot:P63836" /protein_id="SIU01493.1" /translation="MRVSAVAVAAPASGSGKTTIATGLIGALRQAGHTVAPFKVGPDF IDPGYHALAAGRPGRNLDPVLVGERLIGPLYAHGVAGADIAVIEGVLGLFDGRIGPAG GAPAAGSTAHVAALLGAPVILVVDARGQSHSVAALLHGFSTFDTATRIAGVILNRVGS ARHEQVLRQACDQAGVAVLGAIPRTAELELPTRYLGLVTAVEYGRRARLAVQAMTAVV ARHVDLAAVIACAGSQAAHPPWDPVIAVGNTARQPATVAIAAGRAFTFGYAEHAEMLR AAGAEVVEFDPLSETLPEGTDAVVLPGGFPEQFTAELSANDTVRRQINELAAAGAPVH AECAGLLYLVSELDGHPMCGVVAGSARFTQHLKLGYRDAVAVVDSALYSVGERVVGHE FHRTAVTFADSYQPAWVYQGQDVDDVRDGAVHSGVHASYLHTHPAATPGAVARFVAHA ACNTPRA" CDS complement(3118449..3119072) /codon_start=1 /transl_table=11 /gene="cobO" /locus_tag="BQ2027_MB2874C" /product="PROBABLE COB(I)ALAMIN ADENOSYLTRANSFERASE COBO (CORRINOID ADENOSYLTRANSFERASE) (CORRINOID ADOTRANSFERASE ACTIVITY)" /note="Mb2874c, cobO, len: 207 aa. Equivalent to Rv2849c, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 207 aa overlap). Probable cobO, cob(I)alamin adenosyltransferase (EC 2.5.1.17), highly similar to Q9RJ17|COBO from Streptomyces coelicolor (199 aa), FASTA scores: opt: 918, E(): 1.1e-55, (64.75% identity in 207 aa overlap); and similar to others e.g. O30785|COBO from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (212 aa), FASTA scores: opt: 329, E(): 2.8e-15, (44.3% identity in 185 aa overlap); P29930|COBO_PSEDE from Pseudomonas denitrificans (213 aa), FASTA scores: opt: 280, E(): 6.5e-12, (38.9% identity in 185 aa overlap); P31570|BTUR_SALTY|COBA from Salmonella typhimurium (196 aa), FASTA scores: opt: 278, E(): 8.4e-12, (39.8% identity in 196 aa overlap); etc. COFACTOR: MANGANESE. Note that previously known as cobA. Protein product from Mb2874c detected using SWATH mass spectrometry. Mb2874c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2E7" /db_xref="InterPro:IPR003724" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2E7" /protein_id="SIU01494.1" /translation="MPQGNPLAVPNDGLTTRARRNMPILAVHTGEGKGKSTAAFGMAL RAWNAGLDIAVFQFVKSAKWKVGEEAAFRQLGRLHDQHGIGGAVEWHKMGAGWSWTRT SRKAGTDVDRAAAAADGCAEIALRLATQRHDFYLLDEFTYPLKWGWLDVDEVVDVLRA RPGHQHVVITGRDAPQRLVAAADLVTEMTKVKHPMDAGRKGQKGIEW" CDS complement(3119093..3120982) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2875C" /product="POSSIBLE MAGNESIUM CHELATASE" /note="Mb2875c, -, len: 629 aa. Equivalent to Rv2850c, len: 629 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 629 aa overlap). Possible magnesium-chelatase (EC 4.99.1.-), highly similar (but with gaps) to magnesium-chelatases from notably photosynthetic organisms involved in chlorophyll biosynthesis e.g. Q9RJ18|SCI8.35c PUTATIVE CHELATASE from Streptomyces coelicolor (672 aa), FASTA scores: opt: 1941, E(): 2.1e-85, (54.65% identity in 675 aa overlap); Q9HZQ5|PA2942 PROBABLE MAGNESIUM CHELATASE from Pseudomonas aeruginosa (338 aa), FASTA scores: opt: 991, E(): 2.7e-40, (49.45% identity in 368 aa overlap); O33549|BCHI MG PROTOPORPHYRIN IX CHELATASE SUBUNIT from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (334 aa), FASTA scores: opt: 833, E(): 9.4e-33, (50.65% identity in 318 aa overlap); O30819|BCHI_RHOSH MAGNESIUM-CHELATASE 38 KDA SUBUNIT from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (334 aa), FASTA scores: opt: 828, E(): 1.6e-32, (50.3% identity in 318 aa overlap); etc. Equivalent to AAK47242 from Mycobacterium tuberculosis strain CDC1551 (610 aa) but longer 19 aa. COULB BELONG TO THE MG-CHELATASE SUBUNITS D/I FAMILY. Protein product from Mb2875c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2875c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4C6" /db_xref="InterPro:IPR002035" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR011704" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036465" /db_xref="InterPro:IPR041628" /db_xref="InterPro:IPR041702" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4C6" /protein_id="SIU01495.1" /translation="MKPYPFSAIVGHDRLRLALLLCAVRPEIGGALIRGEKGTAKSTA VRGLAALLSVATGSTETGLVELPLGATEDRVVGSLDLQRVMRDGEHAFSPGLLARAHG GVLYVDEVNLLHDHLVDILLDAAAMGRVHVERDGISHSHEARFVLIGTMNPEEGELRP QLLDRFGLTVDVQASRDIDVRVQVIRRRMAYEADPDAFVARYADADAELAHRIAAARA TVDDVVLGDNELRRIAALCAAFDVDGMRADLVVARTAAAHAAWRGVRTVEEQDIQAAA ELALPHRRRRDPFDDHGIDRDQLDEALALASVDPEPEPDPPGGGQSANEPASQPNSRS KSTEPGAPSSMGDDPPRPASPRLRSSPRPSAPPSKIFRTRALRVPGVGTGAPGRRSRA RNASGSVVAAAEVSDPDAHGLHLFATLLAAGERAFGAGPLRPWPDDVRRAIRESREGN LVIFVVDASGSMAARDRMAAVSGATLSLLRDAYQRRDKVAVITFRQHEATLLLSPTSS AHIAGRRLARFSTGGKTPLAEGLLAARALIIREKVRDRARRPLVVVLTDGRATAGPDP LGRSRTAAAGLVAEGAAAVVVDCETSYVRLGLAAQLARQLGAPVVRLEQLHADYLVHA VRGVA" CDS complement(3120979..3121449) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2876C" /product="gcn5-related n-acetyltransferase" /note="Mb2876c, -, len: 156 aa. Equivalent to Rv2851c, len: 156 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 156 aa overlap). Conserved hypothetical protein, similar to various bacterial proteins e.g. Q9KP14|VC2565 ELAA PROTEIN from Vibrio cholerae (149 aa), FASTA scores: opt: 360, E(): 1e-18, (46.05% identity in 139 aa overlap); Q9I717|PA0115 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (150 aa), FASTA scores: opt: 341, E(): 2.4e-17, (43.65% identity in 142 aa overlap); Q9K8M4|BH2982 HYPOTHETICAL PROTEIN from Bacillus halodurans (155 aa), FASTA scores: opt: 320, E(): 8e-16, (40.85% identity in 142 aa overlap); P52077|ELAA_ECOLI|B2267 PROTEIN ELAA from Escherichia coli strain K12 (153 aa), FASTA scores: opt: 269, E(): 3.8e-12, (35.7% identity in 140 aa overlap); etc. Protein product from Mb2876c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2876c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67105" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/Swiss-Prot:P67105" /protein_id="SIU01496.1" /translation="MTEALRRVWAKDLDARALYELLKLRVEVFVVEQACPYPELDGRD LLAETRHFWLETPDGEVTCTLRLMEEHAGGEKVFRIGRLCTKRDARGQGHSNRLLCAA LAEVGDYPCRIDAQAYLTAMYAQHGFVRDGDEFLDDGIPHVPMLRPGSGQVERP" CDS complement(3121508..3122989) /codon_start=1 /transl_table=11 /gene="mqo" /locus_tag="BQ2027_MB2877C" /product="PROBABLE MALATE:QUINONE OXIDOREDUCTASE MQO (MALATE DEHYDROGENASE [ACCEPTOR])" /note="Mb2877c, mqo, len: 493 aa. Equivalent to Rv2852c, len: 493 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 493 aa overlap). Probable mqo, malate:quinone oxidoreductase (EC 1.1.99.16), highly similar to others e.g. O69282|MQO_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (499 aa), FASTA scores: opt: 1701, E(): 1.2e-101, (50.7% identity in 495 aa overlap); Q9Z9Q7|BH3960 from Bacillus halodurans (500 aa), FASTA scores: opt: 1632, E(): 3.3e-97, (48.55% identity in 486 aa overlap); Q9HYF4|MQOA|PA3452 from Pseudomonas aeruginosa (523 aa), FASTA scores: opt: 1604, E(): 2.1e-95, (49.1% identity in 487 aa overlap) (N-terminus longer); P33940|MQO_ECOLI|B2210 from Escherichia coli strain K12 (548 aa), FASTA scores: opt: 1525, E(): 2.7e-90, (48.15% identity in 492 aa overlap); etc. BELONGS TO THE MQO FAMILY. COFACTORS: FAD. Protein product from Mb2877c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2877c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65420" /db_xref="InterPro:IPR006231" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P65420" /protein_id="SIU01497.1" /translation="MSDLARTDVVLIGAGIMSATLGVLLRRLEPNWSITLIERLDAVA AESSGPWNNAGTGHSALCEMNYTPEMPDGSIDITKAVRVNEQFQVTRQFWAYAAENGI LTDVRSFLNPVPHVSFVHGSRGVEYLRRRQKALAGNPLFAGTEFIESPDEFARRLPFM AAKRAFSEPVALNWAADGTDVDFGALAKQLIGYCVQNGTTALFGHEVRNLSRQSDGSW TVTMCNRRTGEKRKLNTKFVFVGAGGDTLPVLQKSGIKEVKGFAGFPIGGRFLRAGNP ALTASHRAKVYGFPAPGAPPLGALHLDLRFVNGKSWLVFGPYAGWSPKFLKHGQISDL PRSIRPDNLLSVLGVGLTERRLLNYLISQLRLSEPERVSALREFAPSAIDSDWELTIA GQRVQVIRRDERNGGVLEFGTTVIGDADGSIAGLLGGSPGASTAVAIMLDVLQKCFAN RYQSWLPTLKEMVPSLGVQLSNEPALFDEVWSWSTKALKLGAA" CDS 3123196..3125043 /codon_start=1 /transl_table=11 /gene="PE_PGRS48" /locus_tag="BQ2027_MB2878" /product="pe-pgrs family protein pe_pgrs48" /note="Mb2878, PE_PGRS48, len: 615 aa. Equivalent to Rv2853, len: 615 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 615 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to many e.g. O53884|Rv0872c|MTV043.65c from Mycobacterium tuberculosis (606 aa), FASTA scores: opt: 1405, E(): 1.4e-97, (64.6% identity in 619 aa overlap). Equivalent to AAK47245 from Mycobacterium tuberculosis strain CDC1551 (663 aa) but shorter 48 aa. Mb2878 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2T7" /protein_id="SIU01498.1" /translation="MLYVVASPDLMTAAATNLAEIGSAISTANGAAALPTVEVVAAAA DEVSTQIAALFGAHARSYQTLSTQAAAFHSRFVQALTTAAASYASVEAANASPLQVAL DVINAPAQTLLGRPLIGNGADGSTPGQAGGPGGLLYGNGGNGAAGGPNQAGGAGGNAG LIGNGGAGGAGGVGAVGGNGGTGGLLFGNGGAGGQGGLGLAGINGGSGGQGGHGGNAI LFGQGGAGGPGGTGAMGVAGTNPTPIGTAAPGSDGVNQIGNGGNTDLTGGAGGDGNAG STTVNGGNGGTGGAARNSSGGTGNSFGGAGGAGGDGANGGDGGAGGEALTEGGATAVS GAGGKGGNAEASGGAGGNGGKGGFAQATTSVTGGNGGNGGNGHDSNAPGGAGGSGGVG GDGGRGGLLAGNGGTGGAGGNGGTGGAGAPGGAGGAGGKADIANSLGDNATVTGGNGG TGGDGGSALGTGGAGGAGGLGGHGGAGGLLIGNGGAGGAGGLGGAGGAGGAGGEGGAG GAGGEAIPGGASTNSAGGDGGAGGTGGNGGDGGAGGAPGLGGAGGAGGWLIGQSGSTG GGGAGGAGGAGGAGGAGGSGGAGGHGDTTSGKNGSSGTAGFDGNPGQPG" CDS 3125080..3126120 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2879" /product="Lysophospholipase (EC" /EC_number="3.1.1.5" /note="Mb2879, -, len: 346 aa. Equivalent to Rv2854, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap). Hypothetical unknown protein, showing similarity with Q9CD03|ML2603 HYPOTHETICAL PROTEIN from Mycobacterium leprae (279 aa), FASTA scores: opt: 154, E(): 0.0083, (33.35% identity in 87 aa overlap). Protein product from Mb2879 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2879 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR022742" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2E0" /protein_id="SIU01499.1" /translation="MTGWVPDVLPGYWQCTIPLGPDPDDEGDIVATLVGRGPQTGKAR GDTTGAHHTVLAVHGYTDYFFHTELADHFANRGFAFYALDLRKCGRSRAPGQTPHFIT DLARYDTELEHSLSIINEQNRSAKVLVYGHSAGGLIVSLWLDRLRQRGEITRAGVTGL VLNSPFLDLQGPAILRLPLTSAFFAAMARMRPKWVARPPKEGGYGCTLHRDYDGEFDY NLQWKPVGGFPVTFGWIHASRRGHARLHRGIDVGVPNLILCSDHTVREKADPATLHRG DAVLDVTHITRWAGCIGNRSTVIAVADAKHDVFLSLPQPRQMAYRRLDLWLDDYLGTH NDTDASASSGKG" CDS 3126133..3127512 /codon_start=1 /transl_table=11 /gene="mtr" /locus_tag="BQ2027_MB2880" /product="nadph-dependent mycothiol reductase mtr" /note="Mb2880, mtr, len: 459 aa. Equivalent to Rv2855, len: 459 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 459 aa overlap). Probable mtr, mycothiol reductase (EC 1.-.-.-), proven enzymatically but previously described as glutathione reductase homolog (gene name: gorA) (see citation below). Similar to others e.g. Q9L7K8|MERA MERCURIC REDUCTASE from Streptomyces sp. CHR28 (474 aa), FASTA scores: opt: 719, E(): 9e-38, (35.2% identity in 460 aa overlap); P30341|MERA_STRLI MERCURIC REDUCTASE (EC 1.16.1.1) from Streptomyces lividans (474 aa), FASTA scores: opt: 712, E(): 2.5e-37, (34.95% identity in 455 aa overlap); Q98ED5|MLL4296 FERRIC LEGHEMOGLOBIN REDUCTASE-2 PRECURSOR, DIHYDROLIPOAMIDE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (468 aa), FASTA scores: opt: 670, E(): 1.1e-34, (30.8% identity in 471 aa overlap); etc. BELONGS TO THE PYRIDINE NUCLEOTIDE-DISULPHIDE OXIDOREDUCTASES CLASS-I. COFACTOR: FAD. Protein product from Mb2880 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2880 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2E2" /db_xref="InterPro:IPR001100" /db_xref="InterPro:IPR004099" /db_xref="InterPro:IPR012999" /db_xref="InterPro:IPR016156" /db_xref="InterPro:IPR017817" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2E2" /protein_id="SIU01500.1" /translation="METYDIAIIGTGSGNSILDERYASKRAAICEQGTFGGTCLNVGC IPTKMFVYAAEVAKTIRGASRYGIDAHIDRVRWDDVVSRVFGRIDPIALSGEDYRRCA PNIDVYRTHTRFGPVQADGRYLLRTDAGEEFTAEQVVIAAGSRPVIPPAILASGVDYH TSDTVMRIAELPEHIVIVGSGFIAAEFAHVFSALGVRVTLVIRGSCLLRHCDDTICER FTRIASTKWELRTHRNVVDGQQRGSGVALRLDDGCTINADLLLVATGRVSNADLLDAE QAGVDVEDGRVIVDEYQRTSARGVFALGDVSSPYLLKHVANHEARVVQHNLLCDWEDT QSMIVTDHRYVPAAVFTDPQIAAVGLTENQAVAKGLDISVKIQDYGDVAYGWAMEDTS GIVKLITERGSGRLLGAHIMGYQASSLIQPLIQAMSFGLTAAEMARGQYWIHPALPEV VENALLGLR" CDS 3127612..3128730 /codon_start=1 /transl_table=11 /gene="nicT" /locus_tag="BQ2027_MB2881" /product="POSSIBLE NICKEL-TRANSPORT INTEGRAL MEMBRANE PROTEIN NICT" /note="Mb2881, nicT, len: 372 aa. Equivalent to Rv2856, len: 372 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 372 aa overlap). Possible nicT, nickel-transport integral membrane protein, similar to transport proteins and hydrogenase cluster proteins e.g. BAB58860|SAV2698 HYPOTHETICAL 37.9 KDA PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (338 aa), FASTA scores: opt: 1082, E(): 7.1e-60, (48.05% identity in 335 aa overlap); Q97ZB2|HOXN HIGH-AFFINITY NICKEL-TRANSPORT PROTEIN from Sulfolobus solfataricus (373 aa), FASTA scores: opt: 922, E(): 6.6e-50, (42.2% identity in 372 aa overlap); P23516|HOXN_ALCEU HIGH-AFFINITY NICKEL TRANSPORT PROTEIN (INTEGRAL MEMBRANE PROTEIN) from Alcaligenes eutrophus (Ralstonia eutropha) (351 aa), FASTA scores: opt: 904, E(): 8.3e-49, (41.9% identity in 339 aa overlap); Q45247|HUPN_BRAJA HYDROGENASE NICKEL INCORPORATION PROTEIN from Bradyrhizobium japonicum (381 aa), FASTA scores: opt: 853, E(): 1.3e-45, (41.65% identity in 329 aa overlap); etc. SEEMS TO BELONG TO THE HOXN/HUPN/NIXA FAMILY OF NICKEL TRANSPORTERS (NiCoT FAMILY). Mb2881 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2E6" /db_xref="InterPro:IPR004688" /db_xref="InterPro:IPR011541" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2E6" /protein_id="SIU01501.1" /translation="MASSQLDRQRSRSAKMNRALTAAEWWRLGLMFAVIVALHLVGWL TVTLLVEPARLSLGGKAFGIGVGLTAYTLGLRHAFDADHIAAIDNTTRKLMSDGHRPL AVGFFFSLGHSTVVFGLAVMLVTGLKAIVGPVENDSSTLHHYTGLIGTSISGAFLYLI GILNVIVLVGIVRVFAHLRRGDYDEAELEQQLDNRGLLIRFLGRFTKSLTKSWHMYPV GFLFGLGFDTATEIALLVLAGTSAAAGLPWYAILCLPVLFAAGMCLLDTIDGSFMNFA YGWAFSSPVRKIYYNITVTGLSVAVALLIGSVELLGLIANQLGWQGPFWDWLGGLDLN TVGFVVVAMFALTWAIALLVWHYGRVEERWTPAPDRTT" CDS complement(3129511..3130287) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2882C" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb2882c, -, len: 258 aa. Equivalent to Rv2857c, len: 258 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 258 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to various dehydrogenases e.g. O88068|SCI35.33c PROBABLE DEHYDROGENASE (SDR FAMILY) from Streptomyces coelicolor (260 aa), FASTA scores: opt: 1208, E(): 2e-68, (72.35% identity in 253 aa overlap); Q9I376|PA1649 from Pseudomonas aeruginosa PROBABLE SHORT-CHAIN DEHYDROGENASE (253 aa), FASTA scores: opt: 569, E(): 2.1e-28, (39.2% identity in 255 aa overlap); Q9EX74|MLHA SDR-LIKE ENZYME from Rhodococcus erythropolis (246 aa), FASTA scores: opt: 567, E(): 2.8e-28, (41.15% identity in 248 aa overlap); etc. Also similar to many Mycobacterium tuberculosis dehydrogenases e.g. FABG3|Rv2002|MT2058|MTCY39.16c PUTATIVE OXIDOREDUCTASE (260 aa), FASTA score: (38.3% identity in 248 aa overlap). BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb2882c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2882c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2E1" /protein_id="SIU01502.1" /translation="MMDLSQRLAGRVAVITGGGSGIGLAAGRRMRAEGATIVVGDVDV EAGGAAADELSGLFVPTDVCDEDAVNGLFDGAAETYGRIDIAFNNAGISPPEDNLIEN TELAAWQRVQDVNLKSVYLCCRAALRHMVLAGKGSIVNTASFVAVMGSATSQISYTAS KGGVLAMSRELGVQFARQGIRVNALCPGPVNTPLLQELFAKNPERAARRMVHVPLGRF AEPDEIAAAVAFLASDDASFITASTFLVDGGISSAYVTPL" CDS complement(3130284..3131651) /codon_start=1 /transl_table=11 /gene="aldC" /locus_tag="BQ2027_MB2883C" /product="PROBABLE ALDEHYDE DEHYDROGENASE ALDC" /note="Mb2883c, aldC, len: 455 aa. Equivalent to Rv2858c, len: 455 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 455 aa overlap). Probable aldC, aldehyde dehydrogenase (EC 1.2.1.3), similar to many e.g. O88069|SCI35.34c PUTATIVE ALDEHYDE DEHYDROGENASE from Streptomyces coelicolor (483 aa), FASTA scores: opt: 1872, E(): 6.4e-109, (64.5% identity in 448 aa overlap); Q9FAB1|ALDH|BT-ALDH ALDEHYDE DEHYDROGENASE from Bacillus thermoleovorans (497 aa), FASTA scores: opt: 1157, E(): 2.1e-64, (44.3% identity in 458 aa overlap); O33455|CYMC P-CUMIC ALDEHYDE DEHYDROGENASE from Pseudomonas putida (494 aa), FASTA scores: opt: 1149, E(): 6.5e-64, (43.15% identity in 452 aa overlap); P40047|DHA5_YEAST|ALD5|ALDH5|ALD3|YER073W ALDEHYDE DEHYDROGENASE from Saccharomyces cerevisiae (Baker's yeast) (519 aa), FASTA scores: opt: 1091, E(): 2.7e-60, (38.55% identity in 459 aa overlap); P80668|FEAB_ECOLI|PADA|MAOB|B1385 PHENYLACETALDEHYDE DEHYDROGENASE (EC 1.2.1.39) from Escherichia coli strain K12 (499 aa), FASTA scores: opt: 1074, E(): 3e-59, (42.2% identity in 462 aa overlap); etc. Also similar to many M. tuberculosis dehydrogenases e.g. P71823|Rv0768|MTCY369.13 (489 aa), FASTA score: (38.1% identity in 467 aa overlap). Contains PS00687 Aldehyde dehydrogenases glutamic acid active site and PS00070 Aldehyde dehydrogenases cysteine active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY. Protein product from Mb2883c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2883c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2F3" /db_xref="InterPro:IPR015590" /db_xref="InterPro:IPR016160" /db_xref="InterPro:IPR016161" /db_xref="InterPro:IPR016162" /db_xref="InterPro:IPR016163" /db_xref="InterPro:IPR029510" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2F3" /protein_id="SIU01503.1" /translation="MSTTQLINPATEEVLASVDHTDANAVDDAVQRARAAQRRWARLA PAQRAAGLRAFAAAVQAHLDELAALEVANSGHPIVSAEWEAGHVRDVLAFYAASPERL SGRQIPVAGGVDVTFNEPMGVVGVITPWNFPMVIASWAIAPALAAGNAVLVKPAELTP LTTMRLGELAVEAGLDEDLLQVLPGKGTVVGERFVTHPDIRKIVFTGSTEVGKRVMAG AAAQVKRVTLELGGKSANIVFHDCDLERAATTAPAGVFDNAGQDCCARSRILVQRSVY DRFMELLEPAVHSIVVGDPGSRATEMGPLVSRAHRDKVAGYVPDDAPVAFRGTAPAGR GFWFPPTVLTPKRGDRTVTDEIFGPVVVVLTFDDEADAISLANDTAYGLSGSIWTDDL SRALRVARAVESGNLSVNSHSSVRFNTPFGGFKQSGVGRELGPDAPLQFTETKNVFIA VGEEM" CDS complement(3131648..3132574) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2884C" /product="POSSIBLE AMIDOTRANSFERASE" /note="Mb2884c, -, len: 308 aa. Equivalent to Rv2859c, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 308 aa overlap). Possible amidotransferase (EC 6.3.5.- or 2.-.-.-), equivalent (but longer 58 aa) to Q9CBU9|ML1573 POSSIBLE AMIDOTRANSFERASE from Mycobacterium leprae (249 aa), FASTA scores: opt: 1226, E(): 3e-64, (71.55% identity in 239 aa overlap). Also similar to other amidotransferases and hypothetical proteins, but shorter in N-terminus e.g. O88072|SCI35.37 HYPOTHETICAL 25.3 KDA PROTEIN from Streptomyces coelicolor (242 aa), FASTA scores: opt: 683, E(): 1.2e-32, (47.65% identity in 235 aa overlap); AAK79730|Q97I88|CAC1764 PREDICTED GLUTAMINE AMIDOTRANSFERASE from Clostridium acetobutylicum (241 aa), FASTA scores: opt: 458, E(): 1.6e-19, (32.95% identity in 246 aa overlap); AAK75201|Q97QV9|SP1089 GLUTAMINE AMIDOTRANSFERASE CLASS I from Streptococcus pneumoniae (229 aa), FASTA scores: opt: 431, E(): 5.6e-18, (34.75% identity in 236 aa overlap); etc. Contains three 17 aa repeats at the N-terminus very similar to those in other Mycobacterium tuberculosis proteins e.g. Q10699|YY30_MYCTU|Rv2090|MT2151|MTCY49.30 PUTATIVE 5'-3' EXONUCLEASE RV2090 (EC 3.1.11.-). Protein product from Mb2884c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2884c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2F8" /db_xref="InterPro:IPR011697" /db_xref="InterPro:IPR017926" /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2F8" /protein_id="SIU01504.1" /translation="MDLSASRSDGGDPLRPASPRLRSPVSDGGDPLRPASPRLRSPVS DGGDPLRPASPRLRSPLGASRPVVGLTAYLEQVRTGVWDIPAGYLPADYFEGITMAGG VAVLLPPQPVDPESVGCVLDSLHALVITGGYDLDPAAYGQEPHPATDHPRPGRDAWEF ALLRGALQRGMPVLGICRGTQVLNVALGGTLHQHLPDILGHSGHRAGNGVFTRLPVHT ASGTRLAELIGESADVPCYHHQAIDQVGEGLVVSAVDVDGVIEALELPGDTFVLAVQW HPEKSLDDLRLFKALVDAASGYAGRQSQAEPR" CDS complement(3132555..3133928) /codon_start=1 /transl_table=11 /gene="glnA4" /locus_tag="BQ2027_MB2885C" /product="PROBABLE GLUTAMINE SYNTHETASE GLNA4 (GLUTAMINE SYNTHASE) (GS-II)" /note="Mb2885c, glnA4, len: 457 aa. Equivalent to Rv2860c, len: 457 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 457 aa overlap). Probable glnA4, glutamine synthetase class II (EC 6.3.1.2), similar to many glutamine synthases e.g. O88070|SCI35.35c from Streptomyces coelicolor (462 aa), FASTA scores: opt: 1947, E(): 8.2e-120, (64.15% identity in 452 aa overlap); Q98H15|MLL3074 from Rhizobium loti (Mesorhizobium loti) (465 aa), FASTA scores: opt: 1321, E(): 7.8e-79, (46.7% identity in 452 aa overlap); Q98EM0|MLL4187 from Rhizobium loti (Mesorhizobium loti) (456 aa), FASTA scores: opt: 698, E(): 4.6e-38, (33.5% identity in 454 aa overlap); Q9CDL9|GLNA from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (446 aa), FASTA scores: opt: 633, E(): 8.2e-34, (32.45% identity in 456 aa overlap); etc. Also similar to three other potential glutamine synthases in Mycobacterium tuberculosis: Q10378|GLN2_MYCTU|GLNA2|Rv2222c|MT2280|MTCY190.33c|MTCY427 .03c PROBABLE GLUTAMINE SYNTHETASE (446 aa), FASTA score: (31.1% identity in 453 aa overlap); Rv1878|glnA3 and Rv2220|glnA1. BELONGS TO THE GLUTAMINE SYNTHETASE FAMILY. Protein product from Mb2885c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2885c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4D4" /db_xref="InterPro:IPR008146" /db_xref="InterPro:IPR014746" /db_xref="InterPro:IPR036651" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4D4" /protein_id="SIU01505.1" /translation="MTGPGSPPLAWTELERLVAAGDVDTVIVAFTDMQGRLAGKRISG RHFVDDIATRGVECCSYLLAVDVDLNTVPGYAMASWDTGYGDMVMTPDLSTLRLIPWL PGTALVIADLVWADGSEVAVSPRSILRRQLDRLKARGLVADVATELEFIVFDQPYRQA WASGYRGLTPASDYNIDYAILASSRMEPLLRDIRLGMAGAGLRFEAVKGECNMGQQEI GFRYDEALVTCDNHAIYKNGAKEIADQHGKSLTFMAKYDEREGNSCHIHVSLRGTDGS AVFADSNGPHGMSSMFRSFVAGQLATLREFTLCYAPTINSYKRFADSSFAPTALAWGL DNRTCALRVVGHGQNIRVECRVPGGDVNQYLAVAALIAGGLYGIERGLQLPEPCVGNA YQGADVERLPVTLADAAVLFEDSALVREAFGEDVVAHYLNNARVELAAFNAAVTDWER IRGFERL" CDS complement(3134088..3134945) /codon_start=1 /transl_table=11 /gene="mapB" /locus_tag="BQ2027_MB2886C" /product="methionine aminopeptidase mapb (map) (peptidase m)" /note="Mb2886c, mapB, len: 285 aa. Equivalent to Rv2861c, len: 285 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 285 aa overlap). Probable mapB (alternate gene name: map), methionine aminopeptidase (EC 3.4.11.18), equivalent to Q9CBU7|MAPB|ML1576 METHIONINE AMINOPEPTIDASE from Mycobacterium leprae (285 aa), FASTA scores: opt: 1729, E(): 1e-99, (89.75% identity in 283 aa overlap). Also highly similar to many e.g. Q9RKR2|MAP3 from Streptomyces coelicolor (285 aa), FASTA scores: opt: 1385, E(): 2e-78, (70.65% identity in 283 aa overlap); Q9SW64|C7A10.320|AT4G37040 from Arabidopsis thaliana (Mouse-ear cress) (305 aa), FASTA scores: opt: 914, E(): 3e-49, (50.35% identity in 286 aa overlap); P07906|AMPM_ECOLI|MAP|B0168|Z0178|ECS0170 from Escherichia coli strains K12 and O157:H7 (264 aa), FASTA scores: opt: 793, E(): 8.5e-42, (51.0% identity in 245 aa overlap); etc. BELONGS TO PEPTIDASE FAMILY M24A; ALSO KNOWN AS THE MAP FAMILY 1. COFACTOR: COBALT; BINDS 2 IONS PER SUBUNIT. Note that this gene has an N-terminal extension present in the human map, but not in the prokaryotic map's. An alternative start, with RBS, will give a protein equivalent to the shorter prokaryotic map's. Protein product from Mb2886c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2886c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5J3" /db_xref="InterPro:IPR000994" /db_xref="InterPro:IPR001714" /db_xref="InterPro:IPR002467" /db_xref="InterPro:IPR036005" /db_xref="UniProtKB/Swiss-Prot:P0A5J3" /protein_id="SIU01506.1" /translation="MPSRTALSPGVLSPTRPVPNWIARPEYVGKPAAQEGSEPWVQTP EVIEKMRVAGRIAAGALAEAGKAVAPGVTTDELDRIAHEYLVDNGAYPSTLGYKGFPK SCCTSLNEVICHGIPDSTVITDGDIVNIDVTAYIGGVHGDTNATFPAGDVADEHRLLV DRTREATMRAINTVKPGRALSVIGRVIESYANRFGYNVVRDFTGHGIGTTFHNGLVVL HYDQPAVETIMQPGMTFTIEPMINLGALDYEIWDDGWTVVTKDRKWTAQFEHTLLVTD TGVEILTCL" CDS complement(3134987..3135571) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2887C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2887c, -, len: 194 aa. Equivalent to Rv2862c, len: 194 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 194 aa overlap). Conserved hypothetical protein, showing some similarity with others e.g. Q9X8X5|SCH35.31c HYPOTHETICAL 19.6 KDA PROTEIN from Streptomyces coelicolor (180 aa), FASTA scores: opt: 266, E(): 2.2e-11, (34.65% identity in 179 aa overlap); Q9Z5H1|ML0169|MLCB373.19 HYPOTHETICAL 22.1 KDA PROTEIN from Mycobacterium leprae (200 aa), FASTA scores: opt: 195, E(): 2.3e-06, (30.15% identity in 189 aa overlap); etc. Also some similarity to P71544|Y966_MYCTU|Rv0966c|MT0994|MTCY10D7.08 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (230 aa), FASTA scores: opt: 209, E(): 2.6e-07, (31.5% identity in 184 aa overlap). Mb2887c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR012551" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2K8" /protein_id="SIU01507.1" /translation="MTETGGDMVALRVSDADPNGTMRRLHNAVALGLINIDEFEQRSS RVSFARTRSELDGLVGDLPRPGAIVTSAADRVELRGWAGSLKRHGEWIVPTRLALVRR LGSIELDLVKARFAGPVVVIELDMMFGSLEVRLPNGASASIDDVEVYVGSASDRRKDA PAEGTPHVVLTGRMVCGSVVIKGPRRALLRRHRG" CDS 3135729..3135923 /codon_start=1 /transl_table=11 /gene="vapB23" /locus_tag="BQ2027_MB2887A" /product="Possible antitoxin VapB23" /note="Mb2887A, len: 64 aa. Equivalent to Rv2862A len: 82 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 64 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible vapB23, antitoxin,part of toxin-antitoxin (TA) operon with Rv2863 (See Pandey and Gerdes, 2005)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2U8" /protein_id="SIU01508.1" /translation="MSLSNWLRQAGLRQLEAQRQRPLRTAQELREFFASRPDETGAEP DWQAHLQVMAESRRRGLPAP" CDS 3135920..3136300 /codon_start=1 /transl_table=11 /gene="vapc23" /locus_tag="BQ2027_MB2888" /product="possible toxin vapc23" /note="Mb2888, -, len: 126 aa. Equivalent to Rv2863, len: 126 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 126 aa overlap). Conserved hypothetical protein, similar to hypothetical proteins from Mycobacterium tuberculosis e.g. Q50595|YI38_MYCTU|Rv1838c|MT1886|MTCY1A11.05|MTCY359.35 CONSERVED HYPOTHETICAL PROTEIN (131 aa), FASTA scores: opt: 299, E(): 6.5e-15, (39.0% identity in 123 aa overlap)." /db_xref="GOA:A0A1R3Y2E4" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2E4" /protein_id="SIU01509.1" /translation="MIFVDTNVFMYAVGRDHPLRMPAREFLEHSLEHQDRLVTSAEAM QELLNAYVPVGRNSTLDSALTLVRALTEIWPVEAADVAHARTLHHRHPGLGARDLLHL ACCQRRGVTRIKTFDHTLASAFRS" CDS complement(3136382..3138193) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2889C" /product="POSSIBLE PENICILLIN-BINDING LIPOPROTEIN" /note="Mb2889c, -, len: 603 aa. Equivalent to Rv2864c, len: 603 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 603 aa overlap). Possible penicillin-binding lipoprotein, probably located in periplasm, equivalent to Q9CBU6|ML1577 PROBABLE PENICILLIN BINDING PROTEIN from Mycobacterium leprae (608 aa), FASTA scores: opt: 3352, E(): 2.1e-193, (81.5% identity in 606 aa overlap). Also shows some similarity to others e.g. P72405|PCBR from Streptomyces clavuligerus (551 aa), FASTA scores: opt: 543, E(): 6.1e-25, (28.4% identity in 567 aa overlap); Q9F2L0|SCH63.18c from Streptomyces coelicolor (546 aa), FASTA scores: opt: 519, E(): 1.7e-23, (29.3% identity in 577 aa overlap); Q9RKD1|SCE87.07 from Streptomyces coelicolor (541 aa), FASTA scores: opt: 472, E(): 1.1e-20, (34.3% identity in 318 aa overlap); etc. Equivalent to AAK47258 from Mycobacterium tuberculosis strain CDC1551 (618 aa) but shorter 15 aa. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site, and PS00017 ATP/GTP-binding site motif A (P-loop). Mb2889c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2F2" /db_xref="InterPro:IPR001460" /db_xref="InterPro:IPR007887" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2F2" /protein_id="SIU01510.1" /translation="MVTKTTLASATSGLLLLAVVAMSGCTPRPQGPGPAAEKFFAALA IGDTASAAQLSDNPNEAREALNAAWAGLQAAHLDAQVLSAKYAEDTGTVAYRFSWHLP KDRIWTYDGQLKMARDEGRWHVRWTTSGLHPKLGEHQTFALRADPPRRASVNEVGGTD VLVPGYLYHYSLDAGQAGRELFGTAHAVVGALHPFDDTLNDPQLLAEQASSSTQPLDL VTLHADDSNRVAAAIGQLPGVVITPQAELLPTDKHFAPAVLNDVKKAVVDELDGKAGW RVVSVNQNGVDVSVLHEVAPSPASSVSITLDRVVQNAAQHAVNTRGGKAMIVVIKPST GEILAIAQNAGADADGPVATTGLYPPGSTFKMITAGAAVERDLATPETLLGCPGEIDI GHRTIPNYGGFDLGVVPMSRAFASSCNTTFAELSSRLPPRGLTQAARRYGIGLDYQVD GITTVTGSVPPTVDLAERTEDGFGQGKVLASPFGMALVAATVAAGKTPVPQLIAGRPT AVEGDATPISQKMIDALRPMMRLVVTNGTAKEIAGCGEVFGKTGEAEFPGGSHSWFAG YRGDLAFASLIVGGGSSEYAVRMTKVMFESLPPGYLA" CDS 3138465..3138746 /codon_start=1 /transl_table=11 /gene="relf" /locus_tag="BQ2027_MB2890" /product="antitoxin relf" /note="Mb2890, -, len: 93 aa. Equivalent to Rv2865, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Conserved hypothetical protein, showing weak similarity with P58235|YR54_SYNY3|SSR2754 HYPOTHETICAL 9.7 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (87 aa), FASTA scores: opt: 134, E(): 0.007, (30.65% identity in 75 aa overlap); BAB58570|SAV2408 CONSERVED HYPOTHETICAL PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (83 aa), FASTA scores: opt: 124, E(): 0.037, (27.5% identity in 80 aa overlap). Also similar to Rv1247|MTV006.19c HYPOTHETICAL 9.8 KDA PROTEIN from Mycobacterium tuberculosis (89 aa), FASTA scores: opt: 249, E(): 2.6e-11, (44.2% identity in 86 aa overlap). Protein product from Mb2890 detected using SWATH mass spectrometry. Mb2890 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006442" /db_xref="InterPro:IPR036165" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2G0" /protein_id="SIU01511.1" /translation="MRILPISTIKGKLNEFVDAVSSTQDQITITKNGAPAAVLVGADE WESLQETLYWLAQPGIRESIAEADADIASGRTYGEDEIRAEFGVPRRPH" CDS 3138750..3139013 /codon_start=1 /transl_table=11 /gene="relg" /locus_tag="BQ2027_MB2891" /product="toxin relg" /note="Mb2891, -, len: 87 aa. Equivalent to Rv2866, len: 87 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 87 aa overlap). Conserved hypothetical protein, similar to O50461|Rv1246c|MTV006.18c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (97 aa), FASTA scores: opt: 290, E(): 3.6e-16, (54.1% identity in 85 aa overlap). Mb2891 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007712" /db_xref="InterPro:IPR035093" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2E5" /protein_id="SIU01512.1" /translation="MPYTVRFTTTARRDLHKLPPRILAAVVEFAFGDLSREPLRVGKP LRRELAGTFSARRGTYRLLYRIDDEHTTVVILRVDHRADIYRR" CDS complement(3139386..3140240) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2892C" /product="gcn5-related n-acetyltransferase" /note="Mb2892c, -, len: 284 aa. Equivalent to Rv2867c, len: 284 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 284 aa overlap). Conserved hypothetical protein, similar to Q9KYR8|SC5H4.21 HYPOTHETICAL 31.3 KDA PROTEIN from Streptomyces coelicolor (287 aa), FASTA scores: opt: 798, E(): 2.4e-45, (47.95% identity in 269 aa overlap). Protein product from Mb2892c detected using SWATH mass spectrometry. Mb2892c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2G3" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR013653" /db_xref="InterPro:IPR016181" /db_xref="InterPro:IPR016794" /db_xref="InterPro:IPR025289" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2G3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01513.1" /translation="MSAPPISRLVGERQVSVVRDAAAVWRVLDDDPIESCMVAARVAD HGIDPNAIGGELWTRRGAHESLCFAGANLIPLRGGPIDLNAFADVAMSTPRRCSSLVG RADLVLPMWQRLEPVWGPARDVRDNQPLMALATHPSCAIDTGVRQVRPEELDSYLVAA VDMFIGEVGVDPRLGDGGRGYRRRVAGLIAAGRAWARFEHGQVIFKAEVGSQSPAVGQ IQGVWVHPEWRGIGLGTAGTATLAAVIVGSGRIASLYVNSFNTVARAAYARVGFKEIG TFATVLLD" CDS complement(3140296..3141459) /codon_start=1 /transl_table=11 /gene="gcpE" /locus_tag="BQ2027_MB2893C" /product="PROBABLE GCPE PROTEIN" /note="Mb2893c, gcpE, len: 387 aa. Equivalent to Rv2868c, len: 387 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 387 aa overlap). Probable gcpE protein (protein e), equivalent to Q9CBU5|GCPE|ML1581 HYPOTHETICAL PROTEIN GCPE from Mycobacterium leprae (392 aa), FASTA scores: opt: 2247, E(): 6.8e-134, (87.65% identity in 388 aa overlap). Highly similar to essential gene of unknown function from Escherichia coli and other prokaryotes e.g. Q9X7W2|GCPE_STRCO|SC6A5.16 GCPE PROTEIN HOMOLOG from Streptomyces coelicolor (384 aa), FASTA scores: opt: 1965, E(): 3.8e-116, (78.2% identity in 385 aa overlap); P54482|GCPE_BACSU GCPE PROTEIN HOMOLOG from Bacillus subtilis (377 aa), FASTA scores: opt: 1157, E(): 2.6e-65, (49.55% identity in 351 aa overlap); P27433|GCPE_ECOLI|B2515|Z3778|ECS3377 GCPE PROTEIN (PROTEIN E) from Escherichia coli strains K12 and O157:H7 (372 aa), FASTA scores: opt: 984, E(): 2e-54, (44.15% identity in 360 aa overlap); etc. BELONGS TO THE GCPE FAMILY. Protein product from Mb2893c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2893c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXN6" /db_xref="InterPro:IPR004588" /db_xref="InterPro:IPR011005" /db_xref="InterPro:IPR016425" /db_xref="UniProtKB/Swiss-Prot:Q7TXN6" /protein_id="SIU01514.1" /translation="MTVGLGMPQPPAPTLAPRRATRQLMVGNVGVGSDHPVSVQSMCT TKTHDVNSTLQQIAELTAAGCDIVRVACPRQEDADALAEIARHSQIPVVADIHFQPRY IFAAIDAGCAAVRVNPGNIKEFDGRVGEVAKAAGAAGIPIRIGVNAGSLDKRFMEKYG KATPEALVESALWEASLFEEHGFGDIKISVKHNDPVVMVAAYELLAARCDYPLHLGVT EAGPAFQGTIKSAVAFGALLSRGIGDTIRVSLSAPPVEEVKVGNQVLESLNLRPRSLE IVSCPSCGRAQVDVYTLANEVTAGLDGLDVPLRVAVMGCVVNGPGEAREADLGVASGN GKGQIFVRGEVIKTVPEAQIVETLIEEAMRLAAEMGEQAPGATPSGSPIVTVS" CDS complement(3141476..3142690) /codon_start=1 /transl_table=11 /gene="rip" /locus_tag="BQ2027_MB2894C" /product="membrane bound metalloprotease" /note="Mb2894c, -, len: 404 aa. Equivalent to Rv2869c, len: 404 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 404 aa overlap). Probable conserved transmembrane protein, equivalent to Q9CBU4|ML1582 PROBABLE INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (404 aa), FASTA scores: opt: 2250, E(): 1.1e-128, (82.2% identity in 404 aa overlap). Also weakly similar to other membrane proteins or hypothetical proteins e.g. Q9A710|CC1916 PUTATIVE MEMBRANE-ASSOCIATED ZINC METALLOPROTEASE from Caulobacter crescentus (398 aa), FASTA scores: opt: 368, E(): 7.8e-15, (28.1% identity in 427 aa overlap). Protein product from Mb2894c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2894c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4F1" /db_xref="InterPro:IPR001478" /db_xref="InterPro:IPR008915" /db_xref="InterPro:IPR036034" /db_xref="InterPro:IPR041489" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4F1" /protein_id="SIU01515.1" /translation="MMFVTGIVLFALAILISVALHECGHMWVARRTGMKVRRYFVGFG PTLWSTRRGETEYGVKAVPLGGFCDIAGMTPVEELDPDERDRAMYKQATWKRVAVLFA GPGMNLAICLVLIYAIALVWGLPNLHPPTRAVIGETGCVAQEVSQGKLEQCTGPGPAA LAGIRSGDVVVKVGDTPVSSFDEMAAAVRKSHGSVPIVVERDGTAIVTYVDIESTQRW IPNGQGGELQPATVGAIGVGAARVGPVRYGVFSAMPATFAFTGDLTVEVGKALAALPT KVGALVRAIGGGQRDPQTPISVVGASIIGGDTVDHGLWVAFWFFLAQLNLILAAINLL PLLPFDGGHIAVAVFERIRNMVRSARGKVAAAPVNYLKLLPATYVVLVLVVGYMLLTV TADLVNPIRLFQ" CDS complement(3142698..3143939) /codon_start=1 /transl_table=11 /gene="dxr" /locus_tag="BQ2027_MB2895C" /product="PROBABLE 1-DEOXY-D-XYLULOSE 5-PHOSPHATE REDUCTOISOMERASE DXR (DXP REDUCTOISOMERASE) (1-DEOXYXYLULOSE-5-PHOSPHATE REDUCTOISOMERASE)" /note="Mb2895c, dxr, len: 413 aa. Equivalent to Rv2870c, len: 413 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 413 aa overlap). Probable dxr, 1-deoxy-D-xylulose 5-phosphate reductoisomerase (EC 1.1.1.-), equivalent to Q9CBU3|DXR|ML1583 1-DEOXY-D-XYLULOSE 5-PHOSPHATE REDUCTOISOMERASE from Mycobacterium leprae (406 aa), FASTA scores: opt: 2145, E(): 1e-124, (84.05% identity in 395 aa overlap). Also highly similar to others e.g. Q9AJD7|DXR from Kitasatospora griseola (Streptomyces griseolosporeus) (386 aa), FASTA scores: opt: 1176, E(): 5.2e-65, (56.45% identity in 388 aa overlap); Q9KYS1|DXR_STRCO|SC5H4.18 from Streptomyces coelicolor (401 aa), FASTA scores: opt: 1079, E(): 5.1e-59, (52.25% identity in 396 aa overlap); P45568|DXR|B0173 from Escherichia coli strain K12 (398 aa), FASTA scores: opt: 120, E(): 0.032, (52.9% identity in 34 aa overlap); etc. Contains PS00133 Zinc carboxypeptidases, zinc-binding region 2 signature. BELONGS TO THE DXR FAMILY. N-terminus shortened since first submission. Protein product from Mb2895c detected using SWATH mass spectrometry. Mb2895c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64013" /db_xref="InterPro:IPR003821" /db_xref="InterPro:IPR013512" /db_xref="InterPro:IPR013644" /db_xref="InterPro:IPR026877" /db_xref="InterPro:IPR036169" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P64013" /protein_id="SIU01516.1" /translation="MTNSTDGRADGRLRVVVLGSTGSIGTQALQVIADNPDRFEVVGL AAGGAHLDTLLRQRAQTGVTNIAVADEHAAQRVGDIPYHGSDAATRLVEQTEADVVLN ALVGALGLRPTLAALKTGARLALANKESLVAGGSLVLRAARPGQIVPVDSEHSALAQC LRGGTPDEVAKLVLTASGGPFRGWSAADLEHVTPEQAGAHPTWSMGPMNTLNSASLVN KGLEVIETHLLFGIPYDRIDVVVHPQSIIHSMVTFIDGSTIAQASPPDMKLPISLALG WPRRVSGAAAACDFHTASSWEFEPLDTDVFPAVELARQAGVAGGCMTAVYNAANEEAA AAFLAGRIGFPAIVGIIADVLHAADQWAVEPATVDDVLDAQRWARERAQRAVSGMASV AIASTAKPGAAGRHASTLERS" CDS 3144066..3144323 /codon_start=1 /transl_table=11 /gene="vapb43" /locus_tag="BQ2027_MB2896" /product="possible antitoxin vapb43" /note="Mb2896, -, len: 85 aa. Equivalent to Rv2871, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Conserved hypothetical protein (see citation below), similar to other CONSERVED HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O50456|Rv1241|MTV006.13 (86 aa), FASTA scores: opt: 172, E(): 2.9e-05, (37.2% identity in 86 aa overlap); O53811|Rv0748|MTV041.22 (85 aa), FASTA scores: opt: 170, E(): 4e-05, (35.3% identity in 85 aa overlap); etc. Mb2896 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5H0" /db_xref="InterPro:IPR002145" /db_xref="InterPro:IPR010985" /db_xref="UniProtKB/Swiss-Prot:P0A5H0" /protein_id="SIU01517.1" /translation="MRTTIRIDDELYREVKAKAARSGRTVAAVLEDAVRRGLNPPKPQ AAGRYRVQPSGKGGLRPGVDLSSNAALAEAMNDGVSVDAVR" CDS 3144310..3144753 /codon_start=1 /transl_table=11 /gene="vapc43" /locus_tag="BQ2027_MB2897" /product="possible toxin vapc43. contains pin domain." /note="Mb2897, -, len: 147 aa. Equivalent to Rv2872, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). Conserved hypothetical protein (see citation below), similar to other CONSERVED HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O53683|Rv0277c|MTV035.05c (142 aa), FASTA scores: opt: 357, E(): 1.4e-17, (41.45% identity in 140 aa overlap); O53812|Rv0749|MTV041.23 (142 aa), FASTA scores: opt: 350, E(): 4.3e-17, (41.55% identity in 142 aa overlap); etc. Mb2897 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65044" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/Swiss-Prot:P65044" /protein_id="SIU01518.1" /translation="MLCVDVNVLVYAHRADLREHADYRGLLERLANDDEPLGLPDSVL AGFIRVVTNRRVFTEPTSPQDAWQAVDALLAAPAAMRLRPGERHWMAFRQLASDVDAN GNDIADAHLAAYALENNATWLSADRGFARFRRLRWRHPLDGQTHL" CDS 3144833..3145495 /codon_start=1 /transl_table=11 /gene="mpt83" /locus_tag="BQ2027_MB2898" /standard_name="mpt83" /product="cell surface lipoprotein mpt83 (lipoprotein p23)" /note="Mb2898, mpb83, len: 220 aa. Equivalent to Rv2873, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 220 aa overlap). mpt83 (alternate gene name: mpb83), cell surface lipoprotein (see citations below). Also similar to upstream ORF Q50769|MP70_MYCTU|MPT70|MPB70|Rv2875|MT2943|MTCY274.06 which is also known as MAJOR SECRETED IMMUNOGENIC PROTEIN MPT70 PRECURSOR from Mycobacterium tuberculosis (193 aa), FASTA scores: opt: 806, E(): 2.7e-38, (70.25% identity in 185 aa overlap). BELONGS TO THE MPT70 / MPT83 FAMILY. ATTACHED TO THE MEMBRANE BY A LIPID ANCHOR. Protein product from Mb2898 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2898 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0CAX7" /db_xref="InterPro:IPR000782" /db_xref="InterPro:IPR036378" /db_xref="UniProtKB/Swiss-Prot:P0CAX7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01519.1" /translation="MINVQAKPAAAASLAAIAIAFLAGCSSTKPVSQDTSPKPATSPA APVTTAAMADPAADLIGRGCAQYAAQNPTGPGSVAGMAQDPVATAASNNPMLSTLTSA LSGKLNPDVNLVDTLNGGEYTVFAPTNAAFDKLPAATIDQLKTDAKLLSSILTYHVIA GQASPSRIDGTHQTLQGADLTVIGARDDLMVNNAGLVCGGVHTANATVYMIDTVLMPP AQ" CDS 3145775..3147862 /codon_start=1 /transl_table=11 /gene="dipZ" /locus_tag="BQ2027_MB2899" /product="POSSIBLE INTEGRAL MEMBRANE C-TYPE CYTOCHROME BIOGENESIS PROTEIN DIPZ" /note="Mb2899, dipZ, len: 695 aa. Equivalent to Rv2874, len: 695 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 695 aa overlap). Possible dipZ, cytochrome c-type biogenesis protein (see citation below), probable integral membrane protein, similar in part to others or hypothetical proteins e.g. CAC48606|SMB20213 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (627 aa), FASTA scores: opt: 844, E(): 7.3e-43, (32.65% identity in 643 aa overlap); Q9ZMH0|CCDA OR JHP0250 PUTATIVE CYTOCHROME C-TYPE BIOGENESIS PROTEIN from Helicobacter pylori J99 (Campylobacter pylori J99) (239 aa), FASTA scores: opt: 250, E(): 1.4e-07, (27.3% identity in 227 aa overlap); Q9LA04|CCDA C-TYPE CYTOCHROME BIOGENESIS PROTEIN from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (252 aa), FASTA scores: opt: 245, E(): 2.9e-07, (27.85% identity in 244 aa overlap); etc. Also similar to O06393|CCSA|Rv0527|MTCY25D10.06 CYTOCHROME C-TYPE BIOGENESIS PROTEIN from Mycobacterium tuberculosis (259 aa), FASTA scores: opt: 280, E(): 2.4e-09, (29.3% identity in 239 aa overlap). Protein product from Mb2899 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2899 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59960" /db_xref="InterPro:IPR000866" /db_xref="InterPro:IPR003834" /db_xref="InterPro:IPR008979" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR036249" /db_xref="InterPro:IPR041017" /db_xref="UniProtKB/Swiss-Prot:P59960" /protein_id="SIU01520.1" /translation="MVESRRAAAAASAYASRCGIAPATSQRSLATPPTISVPSGEGRC RCHVARGAGRDPRRRLRRRRWCGRCGYHSHLTGGEFDVNRLCQQRSRERSCQLVAVPA DPRPKRQRITDVLTLALVGFLGGLITGISPCILPVLPVIFFSGAQSVDAAQVAKPEGA VAVRRKRALSATLRPYRVIGGLVLSFGMVTLLGSALLSVLHLPQDAIRWAALVALVAI GAGLIFPRFEQLLEKPFSRIPQKQIVTRSNGFGLGLALGVLYVPCAGPILAAIVVAGA TATIGLGTVVLTATFALGAALPLLFFALAGQRIAERVGAFRRRQREIRIATGSVTILL AVALVFDLPAALQRAIPDYTASLQQQISTGTEIREQLNLGGIVNAQNAQLSNCSDGAA QLESCGTAPDLKGITGWLNTPGNKPIDLKSLRGKVVLIDFWAYSCINCQRAIPHVVGW YQAYKDSGLAVIGVHTPEYAFEKVPGNVAKGAANLGISYPIALDNNYATWTNYRNRYW PAEYLIDATGTVRHIKFGEGDYNVTETLVRQLLNDAKPGVKLPQPSSTTTPDLTPRAA LTPETYFGVGKVVNYGGGGAYDEGSAVFDYPPSLAANSFALRGRWALDYQGATSDGND AAIKLNYHAKDVYIVVGGTGTLTVVRDGKPATLPISGPPTTHQVVAGDRLASETLEVR PSKGLQVFSFTYG" CDS 3147958..3148539 /codon_start=1 /transl_table=11 /gene="mpt70" /locus_tag="BQ2027_MB2900" /standard_name="mpt70" /product="major secreted immunogenic protein mpt70" /note="Mb2900, mpb70, len: 193 aa. Equivalent to Rv2875, len: 193 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 193 aa overlap). mpt70 (alternate gene name: mpb70), major secreted immunogenic protein MPT70 precursor (see citations below). Also similar to downstream ORF Q10790|MP83_MYCTU|MPT83|MPB83|Rv2873|MT2940 |MTCY274.04 CELL SURFACE LIPOPROTEIN MPT83 PRECURSOR (LIPOPROTEIN P23) (220 aa), FASTA scores: opt: 806, E(): 1.2e-40, (70.25% identity in 185 aa overlap). BELONGS TO THE MPT70 / MPT83 FAMILY. GENERALLY FOUND AS A MONOMER; HOMODIMER IN CULTURE FLUIDS. Protein product from Mb2900 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2900 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A669" /db_xref="InterPro:IPR000782" /db_xref="InterPro:IPR036378" /db_xref="UniProtKB/Swiss-Prot:P0A669" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01521.1" /translation="MKVKNTIAATSFAAAGLAALAVAVSPPAAAGDLVGPGCAEYAAA NPTGPASVQGMSQDPVAVAASNNPELTTLTAALSGQLNPQVNLVDTLNSGQYTVFAPT NAAFSKLPASTIDELKTNSSLLTSILTYHVVAGQTSPANVVGTRQTLQGASVTVTGQG NSLKVGNADVVCGGVSTANATVYMIDSVLMPPA" CDS 3148591..3148905 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2901" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb2901, -, len: 104 aa. Equivalent to Rv2876, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 104 aa overlap). Possible conserved transmembrane protein, equivalent (but longer 16 aa) to Q9CBU2|ML1584 POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (84 aa), FASTA scores: opt: 444, E(): 8.3e-26, (73.85% identity in 88 aa overlap). Mb2901 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2F6" /db_xref="InterPro:IPR024341" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2F6" /protein_id="SIU01522.1" /translation="MFGQWEFDVSPTGGIAVASTEVEHFAGSQHEVDTAEVPSAAWGR SRIDHRTWHIVGLCIFGFLLAMLRGNHVGHVEDWFLITFAAVVLFVLARDLWGRRRGW IR" CDS complement(3148936..3149799) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2902C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb2902c, -, len: 287 aa. Equivalent to Rv2877c, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 287 aa overlap). Probable conserved integral membrane protein, Mer family possibly involved in transport of mercury, similar to others, and to the fourth protein of the mercury resistance operon of Streptomyces sp (or other organisms), and to putative cytochrome-c biogenesis proteins e.g. Q9XBD1|CZA382.20C PUTATIVE INTEGRAL MEMBRANE TRANSPORTER from Amycolatopsis orientalis (298 aa), FASTA scores: opt: 913, E(): 7.6e-46, (51.55% identity in 293 aa overlap); P30344|MER4_STRLI MERCURY RESISTANCE PROBABLE HG TRANSPORT PROTEIN from Streptomyces lividans (319 aa), FASTA scores: opt: 427, E(): 1.2e-17, (32.85% identity in 289 aa overlap); Q9M5P3 PUTATIVE CYTOCHROME C BIOGENESIS PROTEIN PRECURSOR from Arabidopsis thaliana (Mouse-ear cress) (354 aa), FASTA scores: opt: 229, E(): 4e-06, (29.85% identity in 221 aa overlap); etc. Contains PS00044 Bacterial regulatory proteins, lysR family signature. Note that previously known as merT. Mb2902c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2H3" /db_xref="InterPro:IPR003834" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2H3" /protein_id="SIU01523.1" /translation="MNEALIGLAFAAGLVAALNPCGFAMLPAYLLLVVHGQDSAGRTG PLSAVGRAAAATVGMALGFLTVFGIFGALTISAATAVQRYLPYATVLIGLALIALGGW LLLGRGLTALTPRSLGVRWAPTVRLGSMYGYGISYAVASLSCTIGPFLAVTGAGLRGG SVVGSVAIYLAYVAGLTLVVGVLAVAAATASSALADRLRRILPFVNRISGALLVVVGL YVGYYGLYELRLIAGVGANPQDAVIAAAGRLQGALAGWVNQHGAWPWAVLLVVLVVGA FAGTWFRRVRR" CDS complement(3149804..3150325) /codon_start=1 /transl_table=11 /gene="mpt53" /locus_tag="BQ2027_MB2903C" /standard_name="mpt53" /product="soluble secreted antigen mpt53 precursor" /note="Mb2903c, mpb53, len: 173 aa. Equivalent to Rv2878c, len: 173 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 173 aa overlap). mpt53, secreted protein (contains N-terminal signal sequence) (see citations below). Shows some similarity with several disulfide bond interchange proteins e.g. P43787|THIX_HAEIN THIOREDOXIN-LIKE PROTEIN HI1115 from Haemophilus influenzae (167 aa), FASTA scores: opt: 200, E(): 1.4e-06, (28.9% identity in 135 aa overlap); P52237|TIPB_PSEFL THIOL:DISULFIDE INTERCHANGE PROTEIN TIPB PRECURSOR (CYTOCHROME C BIOGENESIS PROTEIN TIPB) (178 aa), FASTA scores: opt: 184, E(): 1.8e-05, (26.3% identity in 171 aa overlap); etc. Also highly similar to O53924|DSBF|Rv1677|MTV047.12 PUTATIVE LIPOPROTEIN from Mycobacterium tuberculosis (182 aa), FASTA scores: opt: 482, E(): 5.7e-26, (52.8% identity in 142 aa overlap). COULD BE BELONG TO THE THIOREDOXIN FAMILY. Note that also previously known as dsbE. Protein product from Mb2903c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2903c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A619" /db_xref="InterPro:IPR000866" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/Swiss-Prot:P0A619" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01524.1" /translation="MSLRLVSPIKAFADGIVAVAIAVVLMFGLANTPRAVAADERLQF TATTLSGAPFDGASLQGKPAVLWFWTPWCPFCNAEAPSLSQVAAANPAVTFVGIATRA DVGAMQSFVSKYNLNFTNLNDADGVIWARYNVPWQPAFVFYRADGTSTFVNNPTAAMS QDELSGRVAALTS" CDS complement(3150511..3151605) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2904C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2904c, -, len: 364 aa. Equivalent to 5' end of Rv2880c and 3' end of Rv2879c, len: 275 aa and 189 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 177 aa overlap and 100.0% identity in 188 aa overlap). Rv2880c: Conserved hypothetical protein, highly similar in N-terminus to others e.g. O86754|SC6A9.22c HYPOTHETICAL 40.4 KDA PROTEIN from Streptomyces coelicolor (368 aa), FASTA scores: opt: 663, E(): 2.6e-33, (52.6% identity in 213 aa overlap); Q55880|Y098_SYNY3|SLL0098 HYPOTHETICAL 38.9 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (350 aa), FASTA scores: opt: 362, E(): 7.3e-15, (38.9% identity in 162 aa overlap); O66732|AQ_416 HYPOTHETICAL 40.2 KDA PROTEIN from Aquifex aeolicus (348 aa), FASTA scores: opt: 321, E(): 2.4e-12, (39.75% identity in 146 aa overlap); etc. Appears to be a frame shift with respect to preceding ORF but we can detect no error in the cosmid sequence to account for this. Rv2879c: Conserved hypothetical protein, similar to others e.g. C-terminus of Q9RVT6|DR0936 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (346 aa), FASTA scores: opt: 505, E(): 1e-26, (46.5% identity in 185 aa overlap); O34617|YLON_BACSU HYPOTHETICAL 41.6 KDA PROTEIN from Bacillus subtilis (363 aa), FASTA scores: opt: 459, E(): 1.2e-24, (40.5% identity in 185 aa overlap); YFGB_ECOLI|P36979 hypothetical 43.1 kd protein from Escherichia coli (384 aa), FASTA scores, opt: 410, E(): 2.8e-21, (41.7% identity in 187 aa overlap); etc. Appears to be a frame shift with respect to following ORF but we can detect no error in the cosmid sequence to account for this. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2880c and Rv2879c exist as 2 genes. In Mycobacterium bovis, a single base deletion (g-*) leads to a single product. Protein product from Mb2904c detected using SWATH mass spectrometry and 0. Mb2904c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A645" /db_xref="InterPro:IPR004383" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR027492" /db_xref="InterPro:IPR040072" /db_xref="UniProtKB/Swiss-Prot:P0A645" /protein_id="SIU01525.1" /translation="MVPELMFDEPRPGRPPRHLADLDAAGRASAVAELGLPAFRAKQL AHQYYGRLIADPRQMTDLPAAVRDRIAGAMFPNLLTASADITCDAGQTRKTLWRAVDG TMFESVLMRYPRRNTVCISSQAGCGMACPFCATGQGGLTRNLSTAEILEQVRAGAAAL RDDFGDRLSNVVFMGMGEPLANYARVLAAVQRITARPPSGFGISARAVTVSTVGLAPA IRNLADARLGVTLALSLHAPDDGLRDTLVPVNNRWRISEALDAARYYANVTGRRVSIE YALIRDVNDQPWRADLLGKRLHRVLGPLAHVNLIPLNPTPGSDWDASPKPVEREFVKR VRAKGVSCTVRDTRGREISAACGQLAAVGG" CDS complement(3151628..3152548) /codon_start=1 /transl_table=11 /gene="cdsA" /locus_tag="BQ2027_MB2905C" /product="PROBABLE INTEGRAL MEMBRANE PHOSPHATIDATE CYTIDYLYLTRANSFERASE CDSA (CDP-DIGLYCERIDE SYNTHETASE) (CDP-DIGLYCERIDE PYROPHOSPHORYLASE) (CDP-DIACYLGLYCEROL SYNTHASE) (CDS) (CTP:PHOSPHATIDATE CYTIDYLYLTRANSFERASE) (CDP-DAG SYNTHASE) (CDP-DG SYNTHETASE)" /note="Mb2905c, cdsA, len: 306 aa. Equivalent to Rv2881c, len: 306 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 306 aa overlap). Probable cdsA, phosphatidate cytidylyltransferase (EC 2.7.7.41), integral membrane protein, equivalent to Q9CBU1|CDSA_MYCLE|ML1589 PHOSPHATIDATE CYTIDYLYLTRANSFERASE from Mycobacterium leprae (312 aa), FASTA scores: opt: 1470, E(): 1.1e-84, (70.3% identity in 313 aa overlap). Also similar to others e.g. Q9KPV7|VC2255 from Vibrio cholerae (280 aa), FASTA scores: opt: 383, E(): 1.1e-16, (29.3% identity in 280 aa overlap); Q9CDT2|CDSA from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (267 aa), FASTA scores: opt: 361, E(): 2.6e-15, (29.05% identity in 265 aa overlap); P06466|CDSA_ECOLI|CDS|B0175|Z0186|ECS0177 from Escherichia coli strains K12 and O157:H7 (249 aa), FASTA scores: opt: 352, E(): 9.2e-15, (40.4% identity in 156 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE CDS FAMILY. Protein product from Mb2905c detected using shotgun mass spectrometry. Mb2905c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63759" /db_xref="InterPro:IPR000374" /db_xref="UniProtKB/Swiss-Prot:P63759" /protein_id="SIU01526.1" /translation="MTTNDAGTGNPAEQPARGAKQQPATETSRAGRDLRAAIVVGLSI GLVLIAVLVFVPRVWVAIVAVATLVATHEVVRRLREAGYLIPVIPLLIGGQAAVWLTW PFGAVGALAGFGGMVVVCMIWRLFMQDSVTRPTTGGAPSPGNYLSDVSATVFLAVWVP LFCSFGAMLVYPENGSGWVFCMMIAVIASDVGGYAVGVLFGKHPMVPTISPKKSWEGF AGSLVCGITATIITATFLVGKTPWIGALLGVLFVLTTALGDLVESQVKRDLGIKDMGR LLPGHGGLMDRLDGILPSAVAAWIVLTLLP" CDS complement(3152571..3153128) /codon_start=1 /transl_table=11 /gene="frr" /locus_tag="BQ2027_MB2906C" /product="RIBOSOME RECYCLING FACTOR FRR (RIBOSOME RELEASING FACTOR) (RRF)" /note="Mb2906c, frr, len: 185 aa. Equivalent to Rv2882c, len: 185 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 185 aa overlap). Probable frr, ribosome recycling factor, equivalent to O33046|RRF_MYCLE|FRR|ML1590|MLCB250.76 RIBOSOME RECYCLING FACTOR from Mycobacterium leprae (185 aa), FASTA scores: opt: 1063, E(): 2.6e-60, (90.8% identity in 185 aa overlap). Also highly similar to others e.g. O86770|RRF_STRCO|FRR|SC6A9.40c from Streptomyces coelicolor (185 aa), FASTA scores: opt: 783, E(): 1.5e-42, (63.25% identity in 185 aa overlap); P81101|RRF_BACSU|FRR from Bacillus subtilis (184 aa), FASTA scores: opt: 640, E(): 1.7e-33, (51.65% identity in 182 aa overlap); P16174|RRF_ECOLI|FRR|B0172|Z0183|ECS0174 from Escherichia coli strains K12 and O157:H7 (185 aa), FASTA scores: opt: 473, E(): 1.4e-23, (40.2% identity in 184 aa overlap); etc. BELONGS TO THE RRF FAMILY. Protein product from Mb2906c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2906c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66735" /db_xref="InterPro:IPR002661" /db_xref="InterPro:IPR023584" /db_xref="InterPro:IPR036191" /db_xref="UniProtKB/Swiss-Prot:P66735" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01527.1" /translation="MIDEALFDAEEKMEKAVAVARDDLSTIRTGRANPGMFSRITIDY YGAATPITQLASINVPEARLVVIKPYEANQLRAIETAIRNSDLGVNPTNDGALIRVAV PQLTEERRRELVKQAKHKGEEAKVSVRNIRRKAMEELHRIRKEGEAGEDEVGRAEKDL DKTTHQYVTQIDELVKHKEGELLEV" CDS complement(3153300..3154085) /codon_start=1 /transl_table=11 /gene="pyrH" /locus_tag="BQ2027_MB2907C" /product="PROBABLE URIDYLATE KINASE PYRH (UK) (URIDINE MONOPHOSPHATE KINASE) (UMP KINASE)" /note="Mb2907c, pyrH, len: 261 aa. Equivalent to Rv2883c, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 261 aa overlap). Probable pyrH, uridylate kinase (EC 2.7.4.-), equivalent to O33045|PYRH_MYCLE|ML1591|MLCB250.75 URIDYLATE KINASE from Mycobacterium leprae (279 aa), FASTA scores: opt: 1437, E(): 3.8e-81, (85.05% identity in 274 aa overlap). Also highly similar to others e.g. O69913|PYRH from Streptomyces coelicolor (253 aa), FASTA scores: opt: 1086, E(): 1.4e-59, (68.9% identity in 251 aa overlap); P74457|PYRH_SYNY3|SLL0144 from Synechocystis sp. strain PCC 6803 (260 aa), FASTA scores: opt: 851, E(): 4.1e-45, (55.85% identity in 231 aa overlap); P29464|PYRH_ECOLI|SMBA|B0171|Z0182|ECS0173 from strains K12 and O157:H7 (240 aa), FASTA scores: opt: 666, E(): 1.1e-35, (45.7% identity in 232 aa overlap); etc. Protein product from Mb2907c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2907c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65930" /db_xref="InterPro:IPR001048" /db_xref="InterPro:IPR011817" /db_xref="InterPro:IPR015963" /db_xref="InterPro:IPR036393" /db_xref="UniProtKB/Swiss-Prot:P65930" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01528.1" /translation="MTEPDVAGAPASKPEPASTGAASAAQLSGYSRVLLKLGGEMFGG GQVGLDPDVVAQVARQIADVVRGGVQIAVVIGGGNFFRGAQLQQLGMERTRSDYMGML GTVMNSLALQDFLEKEGIVTRVQTAITMGQVAEPYLPLRAVRHLEKGRVVIFGAGMGL PYFSTDTTAAQRALEIGADVVLMAKAVDGVFAEDPRVNPEAELLTAVSHREVLDRGLR VADATAFSLCMDNGMPILVFNLLTDGNIARAVRGEKIGTLVTT" CDS 3154320..3155078 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2908" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb2908, -, len: 252 aa. Equivalent to Rv2884, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 252 aa overlap). Probable transcriptional regulatory protein, highly similar to others e.g. Q05943|GLNR_STRCO|SCD84.26c TRANSCRIPTIONAL REGULATORY PROTEIN from Streptomyces coelicolor (267 aa), FASTA scores: opt: 609, E(): 2.7e-34, (46.4% identity in 224 aa overlap); Q55733|SLL0396 REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEM from Synechocystis sp. strain PCC 6803 (224 aa), FASTA scores: opt: 330, E(): 3e-15, (31.8% identity in 217 aa overlap); Q9A4S3|CC2757 DNA-BINDING RESPONSE REGULATOR from Caulobacter crescentus (223 aa), FASTA scores: opt: 311, E(): 6e-14, (30.3% identity in 221 aa overlap); etc. Also highly similar to O53830|Rv0818|MTV043.10 PUTATIVE REGULATORY PROTEIN from Mycobacterium tuberculosis (255 aa), FASTA scores: opt: 665, E(): 3.8e-38, (47.6% identity in 227 aa overlap). THE N-TERMINAL REGION IS SIMILAR TO THAT OF OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS. Mb2908 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2G7" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039420" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2G7" /protein_id="SIU01529.1" /translation="MPTGPTTGKWHPHEVWRYLLEVLLLTDEADLESALPELESFAQS VQRAPLDDPGAAKGADADVAIIDARADLAAARRVCRRLTTSAPALAVVAVVAPANFVA VDGDWIFDDVLLNAAGGAELQARLRLAITRRRSTLAGTLQFGDLVLHPASYTASLGDR DLGLTLTEFKLMNFLVQHAGRAFTRTRLMREVWGYECHGRIRTVDVHVRRLRAKLGAE HESMIDTVRGVGYMAVTPPQPRWIISESILNRCK" mobile_element complement(3155093..3157360) /mobile_element_type="insertion sequence:IS1539" /locus_tag="BQ2027_IS1539" /note="IS1539, len: 2268 nt. Equivalent to IS1539, len: 2267 nt, from Mycobacterium tuberculosis strain H37Rv,(99.9% identity in 2268 nt overlap)." gene complement(3155093..3157360) /locus_tag="BQ2027_IS1539" CDS complement(3155157..3156476) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2909C" /product="probable transposase" /note="Mb2909c, -, len: 439 aa. Equivalent to Rv2885c, len: 460 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 437 aa overlap). Putative transposase for IS1539. Contains PS00017 ATP/GTP-binding site motif A (P-loop). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-c) leads to a truncation resulting in a shorter product compared to its homolog in Mycobacterium tuberculosis stain H37Rv (439 aa versus 460 aa). Mb2909c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR001959" /db_xref="InterPro:IPR010095" /db_xref="InterPro:IPR021027" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2H2" /protein_id="SIU01530.1" /translation="MMARLKVPEGWCVQAFRFTLNPTQTQAASLARHFGARRKAFNWT VTALKADIKAWRADGTESAKPSLRVLRKRWNTVKDQVCVNAQTGQVWWPECSKEAYAD GIAGAVDAYWNWQSCRAGKRAGKTVGVPRFKKKGRDADRVCFTTGAMRVEPDRRHLTL PVIGTIRTYENTRRVERLIAKGRARVLAITVRRNGTRLDASVRVLVQRPQQRRVALPD SRVGVDVGVRRLATVADAEGTVLEQVPNPRPLDAALRGLRRVSRARSRCTKGSRRYCE RTTELSRLHRRVNDVRTHHLHVLTTRLAKTHGRIVVEGLDAAGMLRQKGLPGARARRR ALSDAALATPRRHLSYKTGWYGSSLVVADRWFPSSKTCHACRHVQDIGWDEIWQCDGC SITHQRDDNAAINLARYEEPPSVVGPVGAAVKRGADRKTGPGPAGWP" CDS complement(3156473..3157360) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2910C" /product="PROBABLE RESOLVASE" /note="Mb2910c, -, len: 295 aa. Equivalent to Rv2886c, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 295 aa overlap). Probable resolvase for IS1539. Contains PS00213 Lipocalin signature." /db_xref="GOA:P65046" /db_xref="InterPro:IPR006119" /db_xref="InterPro:IPR036162" /db_xref="UniProtKB/Swiss-Prot:P65046" /protein_id="SIU01531.1" /translation="MSRILTHVPGRTVNRSYALPALVGSAAGRLSGNHSHGREAYIAL PQWACSRQPSTPPLQTPGRINALWSLRPVLPMPGRGCQLLRLGGRWLSVVCCRNGSMN LVVWAEGNGVARVIAYRWLRVGRLPVPARRVGRVILVDEPAGQPGRWGRTAVCARLSS ADQKVDLDRQVVGVTAWATAEQIPVGKVVTEVGSALYGRRRTFLTLLGDPTVRRIVMK RRDRLGRFGFECVQAVLAADGRELVVVDSADVDDDVVGDITEILTSICARLYGKRAAG NRAARAVAAAARAGGHEAR" CDS 3157359..3157778 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2911" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb2911, -, len: 139 aa. Equivalent to Rv2887, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Probable transcriptional regulatory protein, highly similar to Q9EX59|SC1A4.04 PUTATIVE MARR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (151 aa), FASTA scores: opt: 354, E(): 6.6e-16, (42.95% identity in 135 aa overlap); and similar to others e.g. AAF97817|SLYA TRANSCRIPTIONAL REGULATOR SLYA from Escherichia coli strain EPEC 2348/69 (146 aa), FASTA scores: opt: 181, E(): 0.0001, (27.25% identity in 132 aa overlap); P55740|SLYA_ECOLI|AAG56631|B1642|Z2657|ECS2351 TRANSCRIPTIONAL REGULATOR SLYA from Escherichia coli strains K12 and O157:H7 (146 aa), FASTA scores: opt: 177, E(): 0.00018, (27.25% identity in 132 aa overlap); etc. Contains probable helix-turn-helix motif at aa 50-71 (Score 1182, +3.21 SD). Protein product from Mb2911 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2911 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67748" /db_xref="InterPro:IPR000835" /db_xref="InterPro:IPR023187" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P67748" /protein_id="SIU01532.1" /translation="MGLADDAPLGYLLYRVGAVLRPEVSAALSPLGLTLPEFVCLRML SQSPGLSSAELARHASVTPQAMNTVLRKLEDAGAVARPASVSSGRSLPATLTARGRAL AKRAEAVVRAADARVLARLTAPQQREFKRMLEKLGSD" CDS complement(3157792..3159213) /codon_start=1 /transl_table=11 /gene="amiC" /locus_tag="BQ2027_MB2912C" /product="PROBABLE AMIDASE AMIC (AMINOHYDROLASE)" /note="Mb2912c, amiC, len: 473 aa. Equivalent to Rv2888c, len: 473 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 473 aa overlap). Probable amiC, amidase (EC 3.5.1.4), equivalent to O33040|AMI3_MYCLE|AMIC|ML1596|MLCB250.65 PUTATIVE AMIDASE AMIC from Mycobacterium leprae (468 aa), FASTA scores: opt: 2361, E(): 4.2e-139, (76.7% identity in 468 aa overlap). Also similar to others e.g. Q9A8N0|CC1323 PUTATIVE 6-AMINOHEXANOATE-CYCLIC-DIMER HYDROLASE from Caulobacter crescentus (521 aa), FASTA scores: opt: 925, E(): 7.4e-50, (36.55% identity in 465 aa overlap); O28325|YJ54_ARCFU|AF1954 PUTATIVE AMIDASE (EC 3.5.1.4) from Archaeoglobus fulgidus (453 aa), FASTA scores: opt: 659, E(): 2.2e-33, (31.1% identity in 460 aa overlap); Q55424|AMID_SYNY3|SLL0828 PUTATIVE AMIDASE from Synechocystis sp. strain PCC 6803 (506 aa), FASTA scores: opt: 643, E(): 2.4e-32, (30.7% identity in 466 aa overlap); etc. Also similar to O05835|AMI1_MYCTU|AMIA2|Rv2363|MT2432|MTCY27.17c PUTATIVE AMIDASE AMIA2 (484 aa), FASTA scores: opt: 656, E(): 3.6e-33, (35.9% identity in 465 aa overlap); and Q11056|AMI2_MYCTU|AMIB2|Rv1263|MT1301|MTCY50.19c PUTATIVE AMIDASE from Mycobacterium tuberculosis (462 aa), FASTA scores: opt: 650, E(): 8.2e-33, (33.45% identity in 472 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-poop). BELONGS TO THE AMIDASE FAMILY. Protein product from Mb2912c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2912c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63495" /db_xref="InterPro:IPR000120" /db_xref="InterPro:IPR020556" /db_xref="InterPro:IPR023631" /db_xref="InterPro:IPR036928" /db_xref="UniProtKB/Swiss-Prot:P63495" /protein_id="SIU01533.1" /translation="MSRVHAFVDDALGDLDAVALADAIRSGRVGRADVVEAAIARAEA VNPALNALAYAAFDVARDAAAMGTGQEAFFSGVPTFIKDNVDVAGQPSMHGTDAWEPY AAVADSEITRVVLGTGLVSLGKTQLSEFGFSAVAEHPRLGPVRNPWNTDYTAGASSSG SGALVAAGVVPIAHANDGGGSIRIPAACNGLVGLKPSRGRLPLEPEYRRLPVGIVANG VLTRTVRDTAAFYREAERLWRNHQLPPVGDVTSPVKQRLRIAVVTRSVLREASPEVRQ LTLKLAGLLEELGHRVEHVDHPPAPASFVDDFVLYWGFLALAQVRSGRRTFGRTFDPT RLDELTLGLARHTGRNLHRLPLAIMRLRMLRRRSVRFFGTYDVLLTPTVAEATPQVGY LAPTDYQTVLDRLSSWVVFTPVQNVTGVPAISLPLAQSADGMPVGMMLSADTGREALL LELAYELEEARPWARIHAPNIAE" CDS complement(3159220..3160035) /codon_start=1 /transl_table=11 /gene="tsf" /locus_tag="BQ2027_MB2913C" /product="PROBABLE ELONGATION FACTOR TSF (EF-TS)" /note="Mb2913c, tsf, len: 271 aa. Equivalent to Rv2889c, len: 271 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 271 aa overlap). Probable tsf, elongation factor, equivalent to O33039|EFTS_MYCLE|TSF|ML1597|MLCB250.64 ELONGATION FACTOR from Mycobacterium leprae (276 aa), FASTA scores: opt: 1430, E(): 1.9e-80, (83.7% identity in 276 aa overlap). Also highly similar to others e.g. Q9X5Z9|EFTS_STRRA|TSF from Streptomyces ramocissimus (278 aa), FASTA scores: opt: 928, E(): 1.1e-49, (57.05% identity in 277 aa overlap); O31213|EFTS_STRCO|TSF|SC2E1.42 from Streptomyces coelicolor (278 aa), FASTA scores: opt: 927, E(): 1.3e-49, (56.3% identity in 277 aa overlap); P80700|EFTS_BACSU|TSF from Bacillus subtilis (292 aa), FASTA scores: opt: 650, E(): 1.3e-32, (43.85% identity in 276 aa overlap); etc. Contains PS01127 Elongation factor Ts signature 2. BELONGS TO THE EF-TS FAMILY. Protein product from Mb2913c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2913c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXN0" /db_xref="InterPro:IPR001816" /db_xref="InterPro:IPR009060" /db_xref="InterPro:IPR014039" /db_xref="InterPro:IPR018101" /db_xref="InterPro:IPR036402" /db_xref="UniProtKB/Swiss-Prot:Q7TXN0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01534.1" /translation="MANFTAADVKRLRELTGAGMLACKNALAETDGDFDKAVEALRIK GAKDVGKRAERATAEGLVAAKDGALIELNCETDFVAKNAEFQTLADQVVAAAAAAKPA DVDALKGASIGDKTVEQAIAELSAKIGEKLELRRVAIFDGTVEAYLHRRSADLPPAVG VLVEYRGDDAAAAHAVALQIAALRARYLSRDDVPEDIVASERRIAEETARAEGKPEQA LPKIVEGRLNGFFKDAVLLEQASVSDNKKTVKALLDVAGVMVTRFVRFEVGQA" CDS complement(3160047..3160910) /codon_start=1 /transl_table=11 /gene="rpsB" /locus_tag="BQ2027_MB2914C" /product="30s ribosomal protein s2 rpsb" /note="Mb2914c, rpsB, len: 287 aa. Equivalent to Rv2890c, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 287 aa overlap). Probable rpsB, 30s ribosomal protein s2, equivalent to O33038|RS2_MYCLE|RPSB|ML1598|MLCB250.63 30S RIBOSOMAL PROTEIN S2 from Mycobacterium leprae (277 aa), FASTA scores: opt: 1593, E(): 2.3e-93, (91.5% identity in 270 aa overlap). Also highly similar to others e.g. O31212|RS2_STRCO|RPSB|SC2E1.41 from Streptomyces coelicolor (310 aa), FASTA scores: opt: 1302, E(): 6.1e-75, (70.6% identity in 289 aa overlap); Q9KA63|RPSB|BH2427 from Bacillus halodurans (244 aa), FASTA scores: opt: 991, E(): 2.3e-55, (59.6% identity in 255 aa overlap); P21464|RS2_BACSU|RPSB from Bacillus subtilis (245 aa), FASTA scores: opt: 959, E(): 2.4e-53, (58.55% identity in 246 aa overlap); etc. Contains PS00962 Ribosomal protein S2 signature 1. BELONGS TO THE S2P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb2914c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2914c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66538" /db_xref="InterPro:IPR001865" /db_xref="InterPro:IPR005706" /db_xref="InterPro:IPR018130" /db_xref="InterPro:IPR023591" /db_xref="UniProtKB/Swiss-Prot:P66538" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01535.1" /translation="MAVVTMKQLLDSGTHFGHQTRRWNPKMKRFIFTDRNGIYIIDLQ QTLTFIDKAYEFVKETVAHGGSVLFVGTKKQAQESVAAEATRVGMPYVNQRWLGGMLT NFSTVHKRLQRLKELEAMEQTGGFEGRTKKEILGLTREKNKLERSLGGIRDMAKVPSA IWVVDTNKEHIAVGEARKLGIPVIAILDTNCDPDEVDYPIPGNDDAIRSAALLTRVIA SAVAEGLQARAGLGRADGKPEAEAAEPLAEWEQELLASATASATPSATASTTALTDAP AGATEPTTDAS" CDS 3161194..3161943 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2915" /product="Membrane proteins related to metalloendopeptidases" /note="Mb2915, -, len: 249 aa. Equivalent to Rv2891, len: 249 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 249 aa overlap). Conserved hypothetical protein, similar in N-terminus to O69910|SC2E1.40c HYPOTHETICAL 22.8 KDA PROTEIN from Streptomyces coelicolor (226 aa), FASTA scores: opt: 315, E(): 3.4e-11, (40.7% identity in 145 aa overlap). C-terminus overlaps neigbouring ORF." /db_xref="InterPro:IPR011055" /db_xref="InterPro:IPR016047" /db_xref="UniProtKB/Swiss-Prot:P65048" /protein_id="SIU01536.1" /translation="MAKSPARRCTAKVRRVLSRSVLILCWSLLGAAPAHADDSRLGWP LRPPPAVVRQFDAASPNWNPGHRGVDLAGRPGQPVYAAGSATVVFAGLLAGRPVVSLA HPGGLRTSYEPVVAQVRVGQPVSAPTVIGALAAGHPGCQAAACLHWGAMWGPASGANY VDPLGLLKSTPIRLKPLSSEGRTLHYRQAEPVFVNEAAAGALAGAGHRKSPKQGVFRG AAQGGDIVARQPPGRWVCPSSAGGPIGWHRQ" CDS complement(3161722..3162948) /codon_start=1 /transl_table=11 /gene="PPE45" /locus_tag="BQ2027_MB2916C" /product="ppe family protein ppe45" /note="Mb2916c, PPE45, len: 408 aa. Equivalent to Rv2892c, len: 408 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 408 aa overlap). Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. O06386|Rv3621c|MTCY15C10.31|MTCY07H7B.01 from M. tuberculosis (413 aa), FASTA scores: opt: 957, E(): 6.2e-46, (44.7% identity in 423 aa overlap). Mb2916c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A695" /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/Swiss-Prot:P0A695" /protein_id="SIU01537.1" /translation="MDFGVLPPEINSGRMYAGPGSGPMMAAAAAWDSLAAELGLAAGG YRLAISELTGAYWAGPAAASMVAAVTPYVAWLSATAGQAEQAGMQARAAAAAYELAFA MTVPPPVVVANRALLVALVATNFFGQNTPAIAATEAQYAEMWAQDAAAMYAYAGSAAI ATELTPFTAAPVTTSPAALAGQAAATVSSTVPPLATTAAVPQLLQQLSSTSLIPWYSA LQQWLAENLLGLTPDNRMTIVRLLGISYFDEGLLQFEASLAQQAIPGTPGGAGDSGSS VLDSWGPTIFAGPRASPSVAGGGAVGGVQTPQPYWYWALDRESIGGSVSAALGKGSSA GSLSVPPDWAARARWANPAAWRLPGDDVTALRGTAENALLRGFPMASAGQSTGGGFVH KYGFRLAVMQRPPFAG" CDS 3163348..3164325 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2917" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb2917, -, len: 325 aa. Equivalent to Rv2893, len: 325 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 325 aa overlap). Possible oxidoreductase (EC 1.-.-.-), showing similarity with various proteins and/or oxidoreductases e.g. Q9AE05|RIF11 eleventh protein in the rif biosynthetic gene cluster from Amycolatopsis mediterranei (Nocardia mediterranei) (294 aa), FASTA scores: opt: 270, E(): 4.8e-10, (34.5% identity in 313 aa overlap); O52567 REDUCTASE from Amycolatopsis mediterranei (Nocardia mediterranei) (153 aa), FASTA scores: opt: 251, E(): 5e-09, (42.4% identity in 125 aa overlap); Q58929|MER|MJ1534 F420-DEPENDENT METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE (EC 1.5.99.-) from Methanococcus jannaschii (331 aa), FASTA scores: opt: 249, E(): 1.2e-08, (29.7% identity in 283 aa overlap); etc. Also some similarity with others proteins from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. P71844|Rv0791c|MTCY369.35c PUTATIVE OXIDOREDUCTASE (347 aa), FASTA scores: opt: 264, E(): 1.3e-09, (29.05% identity in 272 aa overlap); and P96809|Rv0132|MTCI5.06c PUTATIVE OXIDOREDUCTASE (360 aa), FASTA scores: opt: 260, E(): 2.4e-09, (33.05% identity in 239 aa overlap). Protein product from Mb2917 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y2X5" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019923" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2X5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01538.1" /translation="MTVASTAHHTRRLRFGLAAPLPRAGTQMRAFAQAVEAAGFDVLA FPDHLVPSVSPFAGATAAAMATQRLHTGTLVLNNDFRHPVDTAREAAGVATLAEGRFE LGLGAGHRRSEYDAAGITFDSGATRVARLIESAHLIRALLDAEPVDFDGQHYRVHAEA GSLVAPPKVRVPLLVGGNGTEVLRLGGRIADIVGLAGISHNRDATQVRFTHFDADGLA DRIAVVRHAAGDRFEAIELNALIQAVVCTNDRNAAAAELAATLGGITPEQVLESPFLL LGTHEQMAEALAARQRRFGVSYWTVFDEWAGRASAMRDIAEVIALLRYG" CDS complement(3164322..3165218) /codon_start=1 /transl_table=11 /gene="xerC" /locus_tag="BQ2027_MB2918C" /product="PROBABLE INTEGRASE/RECOMBINASE XERC" /note="Mb2918c, xerC, len: 298 aa. Equivalent to Rv2894c, len: 298 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 298 aa overlap). Probable xerC, integrase/recombinase, equivalent to Q9CBU0|XERC|ML1600|MLCB250.62 INTEGRASE/RECOMBINASE from Mycobacterium leprae (297 aa), FASTA scores: opt: 1624, E(): 2e-97, (85.15% identity in 296 aa overlap). Also highly similar to others integrases/recombinases (generally xerC and xerD) e.g. Q9HTS4|SSS|PA5280 SITE-SPECIFIC RECOMBINASE from Pseudomonas aeruginosa (303 aa), FASTA scores: opt: 660, E(): 3.2e-35, (41.8% identity in 299 aa overlap); Q9HXQ6|XERD|PA3738 INTEGRASE/RECOMBINASE from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 656, E(): 5.7e-35, (40.05% identity in 297 aa overlap); Q9KCP0|BH1529 INTEGRASE/RECOMBINASE from Bacillus halodurans (299 aa), FASTA scores: opt: 645, E(): 2.9e-34, (37.35% identity in 300 aa overlap); etc. Also similar to O33200|Rv1701|MTCI125.23 INTEGRASE/RECOMBINASE from Mycobacterium tuberculosis (311 aa), FASTA scores: opt: 646, E(): 2.6e-34, (43.1% identity in 304 aa overlap). BELONGS TO THE 'PHAGE' INTEGRASE FAMILY. Protein product from Mb2918c detected using SWATH mass spectrometry. Mb2918c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67629" /db_xref="InterPro:IPR002104" /db_xref="InterPro:IPR004107" /db_xref="InterPro:IPR010998" /db_xref="InterPro:IPR011010" /db_xref="InterPro:IPR013762" /db_xref="InterPro:IPR023009" /db_xref="UniProtKB/Swiss-Prot:P67629" /protein_id="SIU01539.1" /translation="MQAILDEFDEYLALQCGRSVHTRRAYLGDLRSLFAFLADRGSSL DALTLSVLRSWLAATAGAGAARTTLARRTSAVKAFTAWAVRRGLLAGDPAARLQVPKA RRTLPAVLRQDQALRAMAAAESGAEQGDPLALRDRLIVELLYATGIRVSELCGLDVDD IDTGHRLVRVLGKGNKQRTVPFGQPAADALHAWLVDGRRALVTAESGHALLLGARGRR LDVRQARTAVHQTVAAVDGAPDMGPHGLRHSAATHLLEGGADLRVVQELLGHSSLATT QLYTHVAVARLRAVHERAHPRA" CDS complement(3165309..3166160) /codon_start=1 /transl_table=11 /gene="viuB" /locus_tag="BQ2027_MB2919C" /product="POSSIBLE MYCOBACTIN UTILIZATION PROTEIN VIUB" /note="Mb2919c, viuB, len: 283 aa. Equivalent to Rv2895c, len: 283 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 283 aa overlap). Possible viuB, mycobactin utilization protein, highly similar to Q9RJ78|SCI41.06 HYPOTHETICAL 31.5 KDA PROTEIN from Streptomyces coelicolor (280 aa), FASTA scores: opt: 639, E(): 5.1e-32, (46.3% identity in 285 aa overlap); and similar to other proteins e.g. Q9F641|MXCB protein of the biosynthetic gene cluster of the myxochelin-type iron chelator from Stigmatella aurantiaca (270 aa), FASTA scores: opt: 417, E(): 2.2e-18, (34.2% identity in 263 aa overlap); Q56646|VIUB_VIBCH|VC2210 VIBRIOBACTIN UTILIZATION PROTEIN from Vibrio cholerae (271 aa), FASTA scores: opt: 395, E(): 5.1e-17, (31.0% identity in 274 aa overlap); Q56743|VIUB_VIBVU VULNIBACTIN UTILIZATION PROTEIN V from Vibrio vulnificus (271 aa), FASTA scores: opt: 390, E(): 1e-16, (33.95% identity in 274 aa overlap); etc. Equivalent to AAK47289 from Mycobacterium tuberculosis strain CDC1551 (321 aa) but shorter 38 aa. Protein product from Mb2919c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2919c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65050" /db_xref="InterPro:IPR007037" /db_xref="InterPro:IPR013113" /db_xref="InterPro:IPR017927" /db_xref="InterPro:IPR017938" /db_xref="InterPro:IPR039261" /db_xref="InterPro:IPR039374" /db_xref="UniProtKB/Swiss-Prot:P65050" /protein_id="SIU01540.1" /translation="MAGRPLHAFEVVATRHLAPHMVRVVLGGSGFDTFVPSDFTDSYI KLVFVDDDVDVGRLPRPLTLDSFADLPTAKRPPVRTMTVRHVDAAAREIAVDIVLHGE HGVAGPWAAGAQRGQPIYLMGPGGAYAPDPAADWHLLAGDESAIPAIAAALEALPPDA IGRAFIEVAGPDDEIGLTAPDAVEVNWVYRGGRADLVPEDRAGDHAPLIEAVTTTAWL PGQVHVFIHGEAQAVMHNLRPYVRNERGVDAKWASSISGYWRRGRTEEMFRKWKKELA EAEAGTH" CDS complement(3166193..3167362) /codon_start=1 /transl_table=11 /gene="dprA" /locus_tag="BQ2027_MB2920C" /product="Rossmann fold nucleotide-binding protein Smf possibly involved in DNA uptake" /note="Mb2920c, -, len: 389 aa. Equivalent to Rv2896c, len: 389 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 389 aa overlap). Conserved hypothetical protein, similar to others proteins e.g. Q9ZJ08|FIR2 from Rhodococcus fascians (293 aa), FASTA scores: opt: 663, E(): 3.3e-32, (43.7% identity in 286 aa overlap); O69892|SC2E1.21 HYPOTHETICAL 37.9 KDA PROTEIN from Streptomyces coelicolor (382 aa), FASTA scores: opt: 600, E(): 2.2e-28, (46.45% identity in 267 aa overlap); Q9JWZ4|DPRA|NMA0158 DPRA HOMOLOG from Neisseria meningitidis (serogroup A) (395 aa), FASTA scores: opt: 495, E(): 4.1e-22, (34.6% identity in 347 aa overlap); etc. Mb2920c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2I9" /db_xref="InterPro:IPR003488" /db_xref="InterPro:IPR041614" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2I9" /protein_id="SIU01541.1" /translation="MIDPTARAWAYLSRVAEPPCAQLAALVRCVGPVEAADRVRRGQV GNELAQHTGARRGIDRAADDLELLMRRGGRLITPDDDEWPVLAFAAFSGAGARARPCG HSPLVLWALGPARLDEVAPRAAAVVGTRAATAYGEHVAADLAAGLAERDVAVVSGGAY GIDGAAHRAALDSEGITVAVLAGGFDIPYPAGHSALLHRIAQHGVLFTEYPPGVRPAR HRFLTRNRLVAAVARAAVVVEAGLRSGAANTAAWARALGRVVAAVPGPVTSSASAGCH TLLRHGAELVTRADDIVEFVGHIGELAGDEPRPGAALDVLSEAERQVYEALPGRGAAT IDEIAVGSGLLPAQVLGPLAILEVAGLAECRDGRWRILRAGAGQAAAKGAAARLV" CDS complement(3167359..3168870) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2921C" /product="AAA+ ATPase superfamily protein YifB/ComM, associated with DNA recombination" /note="Mb2921c, -, len: 503 aa. Equivalent to Rv2897c, len: 503 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 503 aa overlap). Conserved hypothetical protein, possibly Mg-chelatase, highly similar to hypothetical proteins and chelatases e.g. Q9RTV0|DR1656 MG(2+) CHELATASE FAMILY PROTEIN from Deinococcus radiodurans (519 aa), FASTA scores: opt: 1333, E(): 3.6e-68, (46.55% identity in 505 aa overlap);Q55372|SLR0904 HYPOTHETICAL 55.1 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (509 aa), FASTA scores: opt: 1271, E(): 1.2e-64, (42.65% identity in 504 aa overlap); Q9HTR4|PA5290 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (497 aa), FASTA scores: opt: 1248, E(): 2.3e-63, (45.9% identity in 503 aa overlap); Q9K0Z6|COMM|NMB0405 COMPETENCE PROTEIN (MG-CHELATASE) from Neisseria meningitidis (serogroup B), FASTA scores: opt: 1229, E(): 2.8e-62, (43.2% identity in 509 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /db_xref="InterPro:IPR000523" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004482" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR025158" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P68908" /protein_id="SIU01542.1" /translation="MALGRAFSVAVRGLDGEIVEIEADITSGLPGVHLVGLPDAALQE SRDRVRAAVTNCGNSWPMARLTLALSPATLPKMGSVYDIALAAAVLSAQQKKPWERLE NTLLLGELSLDGRVRPVRGVLPAVLAAKRDGWPAVVVPADNLPEASLVDGIDVRGVRT LGQLQSWLRGSTGLAGRITTADTTPESAADLADVVGQSQARFAVEVAAAGAHHLMLTG PPGVGKTMLAQRLPGLLPSLSGSESLEVTAIHSVAGLLSGDTPLITRPPFVAPHHSSS VAALVGGGSGMARPGAVSRAHRGVLFLDECAEISLSALEALRTPLEDGEIRLARRDGV ACYPARFQLVLAANPCPCAPADPQDCICAAATKRRYLGKLSGPLLDRVDLRVQMHRLR AGAFSAADGESTSQVRQRVALAREAAAQRWRPHGFRTNAEVSGPLLRRKFRPSSAAML PLRTALDRGLLSIRGVDRTLRVAWSLADLAGRTSPGIDEVAAALSFRQTGARR" CDS complement(3168870..3169256) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2922C" /product="UPF0102 protein YraN" /note="Mb2922c, -, len: 128 aa. Equivalent to Rv2898c, len: 128 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 128 aa overlap). Conserved hypothetical protein, highly similar to O33024|YS98_MYCLE|ML1607|MLCB250.49 HYPOTHETICAL 11.0 KDA PROTEIN from Mycobacterium leprae (96 aa), FASTA scores: opt: 318, E(): 2.3e-16, (58.35% identity in 96 aa overlap). Also similar to other hypothetical proteins e.g. O69890|YE19_STRCO|SC2E1.19 from Streptomyces coelicolor (130 aa), FASTA scores: opt: 253, E(): 1.7e-11, (39.65% identity in 121 aa overlap); Q9HVZ1|PA4424 from Pseudomonas aeruginosa (125 aa), FASTA scores: opt: 234, E(): 4.2e-10, (40.85% identity in 115 aa overlap); O86871 from Streptomyces lividans (85 aa), FASTA scores: opt: 224, E(): 1.8e-09, (46.45% identity in 84 aa overlap); etc. Equivalent to AAK47292 from Mycobacterium tuberculosis strain CDC1551 (141 aa) but shorter 13 aa. Mb2922c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67231" /db_xref="InterPro:IPR003509" /db_xref="InterPro:IPR011335" /db_xref="InterPro:IPR011856" /db_xref="UniProtKB/Swiss-Prot:P67231" /protein_id="SIU01543.1" /translation="MTTLKTMTRVQLGAMGEALAVDYLTSMGLRILNRNWRCRYGELD VIACDAATRTVVFVEVKTRTGDGYGGLAHAVTERKVRRLRRLAGLWLADQEERWAAVR IDVIGVRVGPKNSGRTPELTHLQGIG" CDS complement(3169504..3170334) /codon_start=1 /transl_table=11 /gene="fdhD" /locus_tag="BQ2027_MB2923C" /product="possible fdhd protein homolog" /note="Mb2923c, fdhD, len: 276 aa. Equivalent to Rv2899c, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 276 aa overlap). Possible fdhD protein, highly similar to other bacterial fdhd proteins e.g. Q9ZBW0|FDHD_STRCO|SC4B5.08c from Streptomyces coelicolor (282 aa), FASTA scores: opt: 1032, E(): 3.6e-59, (59.0% identity in 278 aa overlap); BAB59387|TVG0258796 from Thermoplasma volcanium (279 aa), FASTA scores: opt: 536, E(): 3.4e-27, (38.65% identity in 282 aa overlap); Q9HL17|FDHD_THEAC|TA0423 from Thermoplasma acidophilum (282 aa), FASTA scores: opt: 529, E(): 9.6e-27, (38.8% identity in 281 aa overlap); P32177|FDHD_ECOLI FDHD PROTEIN from Escherichia coli strain K12 (277 aa), FASTA scores: opt: 297, E(): 8.6e-12, (33.35% identity in 261 aa overlap); etc. BELONGS TO THE FDHD FAMILY. Mb2923c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64119" /db_xref="InterPro:IPR003786" /db_xref="InterPro:IPR016193" /db_xref="UniProtKB/Swiss-Prot:P64119" /protein_id="SIU01544.1" /translation="MGYATAHRRVRHLSADQVITRPETLAVEEPLEIRVNGTPVTVTM RTPGSDFELVQGFLLAEGVVAHREDVLTVSYCGRRVEGNATGASTYNVLDVALAPGVK PPDVDVTRTFYTTSSCGVCGKASLQAVSQVSRFAPGGDPATVAADTLKAMPDQLRRAQ KVFARTGGLHAAALFGVDGAMLAVREDIGRHNAVDKVIGWAFERDRIPLGASVLLVSG RASFELTQKALMAGIPVLAAVSAPSSLAVSLADASGITLVAFLRGDSMNVYTRADRIT " CDS complement(3170334..3172673) /codon_start=1 /transl_table=11 /gene="fdhF" /locus_tag="BQ2027_MB2924C" /product="POSSIBLE FORMATE DEHYDROGENASE H FDHF (FORMATE-HYDROGEN-LYASE-LINKED, SELENOCYSTEINE-CONTAINING POLYPEPTIDE) (FORMATE DEHYDROGENASE-H ALPHA SUBUNIT) (FDH-H)" /note="Mb2924c, fdhF, len: 779 aa. Equivalent to Rv2900c, len: 779 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 779 aa overlap). Possible fdhF, formate dehydrogenase (EC 1.2.1.2), highly similar to others formate dehydrogenases and prokaryotic molybdopterin-containing oxidoreductases e.g. Q9S2J9|SC7H2.18 PUTATIVE FORMATE DEHYDROGENASE from Streptomyces coelicolor (759 aa), FASTA scores: opt: 3038, E(): 2.7e-180, (59.7% identity in 767 aa overlap); Q9HU08|PA5181 PROBABLE OXIDOREDUCTASE from Pseudomonas aeruginosa (773 aa), FASTA scores: opt: 2560, E(): 1.1e-150, (53.2% identity in 761 aa overlap); P78160 FORMATE DEHYDROGENASE A CHAIN (EC 1.2.1.2) (FRAGMENT) from Escherichia coli strain K12 (740 aa), FASTA scores: opt: 2002, E(): 3.7e-116, (43.1% identity in 733 aa overlap); P07658|FDHF_ECOLI|P78137|B4079 FORMATE DEHYDROGENASE from Escherichia coli strain K12 (715 aa), FASTA scores: opt: 305, E(): 5.6e-13, (25.5% identity in 748 aa overlap); etc. BELONGS TO THE PROKARYOTIC MOLYBDOPTERIN-CONTAINING OXIDOREDUCTASE FAMILY. Protein product from Mb2924c detected using SWATH mass spectrometry. Mb2924c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65409" /db_xref="InterPro:IPR006656" /db_xref="InterPro:IPR006657" /db_xref="InterPro:IPR009010" /db_xref="InterPro:IPR010046" /db_xref="InterPro:IPR037951" /db_xref="InterPro:IPR041953" /db_xref="UniProtKB/Swiss-Prot:P65409" /protein_id="SIU01545.1" /translation="MYVEAVRWQRSAASRDVLADYDEQAVTVAPRKREAAGVRAVMVS LQRGMQQMGALRTAAALARLNQRNGFDCPGCAWPEEPGGRKLAEFCENGAKAVAEEAT KRTVTAEFFARHSVAELSAKPEYWLSQQGRLAHPMVLRPGDDHYRPISWDAAYQLIAE QLNGLDSPDRAVFYTSGRTSNEAAFCYQLLVRSFGTNNLPDCSNMCHESSGAALTDSI GIGKGSVTIGDVEHADLIVIAGQNPGTNHPRMLSVLGKAKANGAKIIAVNPLPEAGLI RFKDPQKVNGVVGHGIPIADEFVQIRLGGDMALFAGLGRLLLEAEERVPGSVVDRSFV DNHCAGFDGYRRRTLQVGLDTVMDATGIELAQLQRVAAMLMASQRTVICWAMGLTQHA HAVATIGEVTNVLLLRGMIGKPGAGVCPVRGHSNVQGDRTMGIWEKMPEQFLAALDRE FGITSPRAHGFDTVAAIRAMRDGRVSVFMGMGGNFASATPDTAVTEAALRRCALTVQV STKLNRSHLVHGATALILPTLGRTDRDTRNGRKQLVSVEDSMSMVHLSRGSLHPPSDQ VRSEVQIICQLARALFGPGHPVPWERFADDYDTIRDAIAAVVPGCDDYNHKVRVPDGF QLPHPPRDAREFRTSTGKANFAVNPLQWVPVPPGRLVLQTLRSHDQYNTTIYGLDDRY RGVKGGRRVVFINPADIETFGLTAGDRVDLVSEWTDGQGGLQERRAKDFLVVAYSTPV GNAAAYYPETNPLVPLDHTAAQSNTPVSKAIIVRLEPTA" CDS complement(3172731..3173036) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2925C" /product="Protein often found in Actinomycetes clustered with signal peptidase and/or RNaseHII" /note="Mb2925c, -, len: 101 aa. Equivalent to Rv2901c, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). Conserved hypothetical protein, very equivalent to O33023|ML1610|MLCB250.41 HYPOTHETICAL 12.3 KDA PROTEIN from Mycobacterium leprae (101 aa), FASTA scores: opt: 658, E(): 2.6e-43, (99.0% identity in 101 aa overlap). Also highly similar to O69889|SC2E1.18 HYPOTHETICAL PROTEIN from Streptomyces coelicolor and Streptomyces lividans (102 aa), FASTA scores: opt: 515, E(): 2.2e-32, (75.0% identity in 100 aa overlap). Protein product from Mb2925c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2925c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019592" /db_xref="UniProtKB/Swiss-Prot:P65052" /protein_id="SIU01546.1" /translation="MSAEDLEKYETEMELSLYREYKDIVGQFSYVVETERRFYLANSV EMVPRNTDGEVYFELRLADAWVWDMYRPARFVKQVRVVTFKDVNIEEVEKPELRLPE" CDS complement(3173090..3173884) /codon_start=1 /transl_table=11 /gene="rnhB" /locus_tag="BQ2027_MB2926C" /product="PROBABLE RIBONUCLEASE HII PROTEIN RNHB (RNASE HII)" /note="Mb2926c, rnhB, len: 264 aa. Equivalent to Rv2902c, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 264 aa overlap). Probable rnhB, ribonuclease HII (EC 3.1.26.4), equivalent to O33022|RNH2_MYCLE|RNHB|ML1611|MLCB250.40 RIBONUCLEASE HII from Mycobacterium leprae (240 aa), FASTA scores: opt: 1242, E(): 6.9e-72, (76.75% identity in 245 aa overlap). Also similar (but longer ~20 aa) to others e.g. Q9HXY9|RNHB|PA3642 RIBONUCLEASE HII from Pseudomonas aeruginosa (201 aa), FASTA scores: opt: 572, E(): 3.1e-29, (52.7% identity in 184 aa overlap); Q9PEI7|RNH2_XYLFA|RNHB|XF1041 RIBONUCLEASE HII from Xylella fastidiosa (234 aa), FASTA scores: opt: 556, E(): 3.6e-28, (50.25% identity in 185 aa overlap); P10442|RNH2_ECOLI|RNHB|B0183 RIBONUCLEASE HII from Escherichia coli strain K-12 (213 aa), FASTA scores: opt: 519, E(): 7.4e-26, (48.65% identity in 183 aa overlap); etc. BELONGS TO THE RNASE HII FAMILY. COFACTOR: MANGANESE (BY SIMILARITY). Protein product from Mb2926c detected using SWATH mass spectrometry. Mb2926c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXM7" /db_xref="InterPro:IPR001352" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR022898" /db_xref="InterPro:IPR024567" /db_xref="InterPro:IPR036397" /db_xref="UniProtKB/Swiss-Prot:Q7TXM7" /protein_id="SIU01547.1" /translation="MTKTWPPRTVIRKSGGLRGMRTLESALHRGGLGPVAGVDEVGRG ACAGPLVVAACVLGPGRIASLAALDDSKKLSEQAREKLFPLICRYAVAYHVVFIPSAE VDRHGVHVANIEGMRRAVAGLAVRPGYVLSDGFRVPGLPMPSLPVIGGDAAAACIAAA SVLAKVSRDRVMVALDADHPGYGFAEHKGYSTPAHSRALARLGPCPQHRYSFINVRRV ASGSNTAEVADGQPDPRDGTAQTGEGRWSKSSHPATMRATGRAQGT" CDS complement(3173898..3174782) /codon_start=1 /transl_table=11 /gene="lepB" /locus_tag="BQ2027_MB2927C" /product="PROBABLE SIGNAL PEPTIDASE I LEPB (SPASE I) (LEADER PEPTIDASE I)." /note="Mb2927c, lepB, len: 294 aa. Equivalent to Rv2903c, len: 294 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 294 aa overlap). Probable lepB, signal peptidase I (EC 3.4.21.89) (TYPE II MEMBRANE PROTEIN) (see first citation below), equivalent to O33021|LEP_MYCLE|ML1612|MLCB250.39 PROBABLE SIGNAL PEPTIDASE I from Mycobacterium leprae (289 aa), FASTA scores: opt: 1335, E(): 1.8e-77, (69.75% identity in 301 aa overlap). Also similar to many e.g. O86869|SIPX SIGNAL PEPTIDASE I from Streptomyces lividans (320 aa), FASTA scores: opt: 474, E(): 1e-22, (43.55% identity in 248 aa overlap); O69884|SIP1|SIPW PUTATIVE SIGNAL PEPTIDASE I from Streptomyces coelicolor and Streptomyces lividans (259 aa), FASTA scores: opt: 226, E(): 5e-07, (36.0% identity in 214 aa overlap); P42668|LEP_BACLI|SIP SIGNAL PEPTIDASE I from Bacillus licheniformis (186 aa), FASTA scores: opt: 218, E(): 1.3e-06, (34.5% identity in 194 aa overlap); etc. Contains PS00501 Signal peptidases I serine active site,and PS00761 Signal peptidases I signature 3. BELONGS TO PEPTIDASE FAMILY S26; ALSO KNOWN AS TYPE I LEADER PEPTIDASE FAMILY. Protein product from Mb2927c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2927c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Y2" /db_xref="InterPro:IPR000223" /db_xref="InterPro:IPR015927" /db_xref="InterPro:IPR019533" /db_xref="InterPro:IPR019756" /db_xref="InterPro:IPR019758" /db_xref="InterPro:IPR036286" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Y2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01548.1" /translation="MTETTDSPSERQPGPAEPELSSRDPDIAGQVFDAAPFDAAPDAD SEGDSKAAKTDEPRPAKRSTLREFAVLAVIAVVLYYVMLTFVARPYLIPSESMEPTLH GCSTCVGDRIMVDKLSYRFGSPQPGDVIVFRGPPSWNVGYKSIRSHNVAVRWVQNALS FIGFVPPDENDLVKRVIAVGGQTVQCRSDTGLTVNGRPLKEPYLDPATMMADPSIYPC LGSEFGPVTVPPGRVWVMGDNRTHSADSRAHCPLLCTNDPLPGTVPVANVIGKARLIV WPPSRWGVVRSVNPQQGR" CDS complement(3174840..3175181) /codon_start=1 /transl_table=11 /gene="rplS" /locus_tag="BQ2027_MB2928C" /product="50s ribosomal protein l19 rpls" /note="Mb2928c, rplS, len: 113 aa. Equivalent to Rv2904c, len: 113 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 113 aa overlap). Probable rplS, 50S ribosomal protein L19, equivalent to O33020|RL19_MYCLE 50S RIBOSOMAL PROTEIN L19 from Mycobacterium leprae (113 aa), FASTA scores: opt: 702, E(): 1.4e-45, (93.8% identity in 113 aa overlap). Also highly similar to others e.g. O69883|RL19_STRCO from Streptomyces coelicolor (116 aa), FASTA scores: opt: 571, E(): 9.5e-36, (77.25% identity in 110 aa overlap); O31742|RL19_BACSU from Bacillus subtilis (115 aa), FASTA scores: opt: 523, E(): 3.8e-32, (72.9% identity in 107 aa overlap); RL19_BACST|P30529 from Bacillus stearothermophilus (116 aa), FASTA scores: opt: 518, E(): 9.1e-32, (71.7% identity in 106 aa overlap); etc. BELONGS TO THE L19P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb2928c detected using shotgun mass spectrometry. Mb2928c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66081" /db_xref="InterPro:IPR001857" /db_xref="InterPro:IPR008991" /db_xref="InterPro:IPR018257" /db_xref="InterPro:IPR038657" /db_xref="UniProtKB/Swiss-Prot:P66081" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01549.1" /translation="MNRLDFVDKPSLRDDIPAFNPGDTINVHVKVIEGAKERLQVFKG VVIRRQGGGIRETFTVRKESYGVGVERTFPVHSPNIDHIEVVTRGDVRRAKLYYLREL RGKKAKIKEKR" CDS 3175556..3176500 /codon_start=1 /transl_table=11 /gene="lppW" /locus_tag="BQ2027_MB2929" /product="PROBABLE CONSERVED ALANINE RICH LIPOPROTEIN LPPW" /note="Mb2929, lppW, len: 314 aa. Equivalent to Rv2905, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 314 aa overlap). Probable lppW, conserved ala-rich lipoprotein, with slight similarity to beta-lactamases and hypothetical proteins e.g. Q9S1P7|SCJ9A.23 HYPOTHETICAL 36.3 KDA PROTEIN from Streptomyces coelicolor (336 aa), FASTA scores: opt: 222, E(): 2.8e-06, (25.5% identity in 298 aa overlap); O69914|SC3C8.01 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (302 aa), FASTA scores: opt: 201, E(): 5.1e-05, (24.9% identity in 257 aa overlap); P14559|BLAC_STRAL BETA-LACTAMASE PRECURSOR from Streptomyces albus G (314 aa), FASTA scores: opt: 113, E(): 3.3, (25.2% identity in 278 aa overlap); etc. Has signal peptide and appropriately positioned prokaryotic lipoprotein lipid attachment site: ATTACHED TO THE MEMBRANE BY A LIPID ANCHOR (POTENTIAL). Protein product from Mb2929 detected using SWATH mass spectrometry. Mb2929 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65305" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/Swiss-Prot:P65305" /protein_id="SIU01550.1" /translation="MRARPLTLLTALAAVTLVVVAGCEARVEAEAYSAADRISSRPQA RPQPQPVELLLRAITPPRAPAASPNVGFGELPTRVRQATDEAAAMGATLSVAVLDRAT GQLVSNGNTQIIATASVAKLFIADDLLLAEAEGKVTLSPEDHHALDVMLQSSDDGAAE RFWSQDGGNAVVTQVARRYGLRSTAPPSDGRWWNTISSAPDLIRYYDMLLDGSGGLPL DRAAVIIADLAQSTPTGIDGYPQRFGIPDGLYAEPVAVKQGWMCCIGSSWMHLSTGVI GPERRYIMVIESLQPADDATARATITQAVRTMFPNGRI" CDS complement(3176593..3177285) /codon_start=1 /transl_table=11 /gene="trmD" /locus_tag="BQ2027_MB2930C" /product="PROBABLE TRNA (GUANINE-N1)-METHYLTRANSFERASE TRMD (M1G-METHYLTRANSFERASE) (TRNA [GM37] METHYLTRANSFERASE)" /note="Mb2930c, trmD, len: 230 aa. Equivalent to Rv2906c, len: 230 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 230 aa overlap). Probable trmD, tRNA m1G methyltransferase (EC 2.1.1.31), equivalent to O33017|TRMD_MYCLE from Mycobacterium leprae (238 aa), FASTA scores: opt: 1363, E(): 8.1e-86, (87.2% identity in 227 aa overlap). Also highly similar to others e.g. O69882|TRMD_STRCO from Streptomyces coelicolor and S. lividans (277 aa), FASTA scores: opt: 841, E(): 4.5e-50, (55.55% identity in 234 aa overlap); Q9A0B6 from Streptococcus pyogenes (243 aa), FASTA scores: opt: 698, E(): 2.5e-40, (47.6% identity in 227 aa overlap); P07020|TRMD_ECOLI|TRMD|B2607|Z3901|ECS3470 from Escherichia coli strain O157:H7 (255 aa), FASTA scores: opt: 573, E(): 3.8e-33, (42.1% identity in 228 aa overlap); etc. BELONGS TO THE RNA METHYLTRANSFERASE TRMD FAMILY. Protein product from Mb2930c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2930c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66969" /db_xref="InterPro:IPR002649" /db_xref="InterPro:IPR016009" /db_xref="InterPro:IPR023148" /db_xref="InterPro:IPR029026" /db_xref="InterPro:IPR029028" /db_xref="UniProtKB/Swiss-Prot:P66969" /protein_id="SIU01551.1" /translation="MRIDIVTIFPACLDPLRQSLPGKAIESGLVDLNVHDLRRWTHDV HHSVDDAPYGGGPGMVMKAPVWGEALDEICSSETLLIVPTPAGVLFTQATAQRWTTES HLVFACGRYEGIDQRVVQDAARRMRVEEVSIGDYVLPGGESAAVVMVEAVLRLLAGVL GNPASHQDDSHSTGLDGLLEGPSYTRPASWRGLDVPEVLLSGDHARIAAWRREVSLQR TRERRPDLSHPD" CDS complement(3177289..3177819) /codon_start=1 /transl_table=11 /gene="rimM" /locus_tag="BQ2027_MB2931C" /product="PROBABLE 16S RRNA PROCESSING PROTEIN RIMM" /note="Mb2931c, rimM, len: 176 aa. Equivalent to Rv2907c, len: 176 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 176 aa overlap). Probable rimM, 16S rRNA processing protein, equivalent to O33016|RIMM_MYCLE PROBABLE 16S RRNA PROCESSING protein from Mycobacterium leprae (179 aa), FASTA scores: opt: 797, E(): 2.4e-46, (73.15% identity in 175 aa overlap). Also highly similar to others e.g. O69881|RIMM_STRCO from Streptomyces coelicolor (188 aa), FASTA scores: opt: 485, E(): 2.3e-25, (48.85% identity in 176 aa overlap); Q9KA14|RIMM_BACHD from Bacillus halodurans (173 aa), FASTA scores: opt: 289, E(): 3.2e-12, (30.65% identity in 173 aa overlap); P21504|RIMM_ECOLI|RIMM|B2608 from Escherichia coli strain K12 (182 aa), FASTA scores: opt: 237, E(): 1e-08, (29.4% identity in 177 aa overlap). BELONGS TO THE RIMM FAMILY. Mb2931c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P66654" /db_xref="InterPro:IPR002676" /db_xref="InterPro:IPR009000" /db_xref="InterPro:IPR011033" /db_xref="InterPro:IPR011961" /db_xref="InterPro:IPR027275" /db_xref="InterPro:IPR036976" /db_xref="UniProtKB/Swiss-Prot:P66654" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01552.1" /translation="MELVVGRVVKSHGVTGEVVVEIRTDDPADRFAPGTRLRAKGPFD GGAEGSAVSYVIESVRQHGGRLLVRLAGVADRDAADALRGSLFVIDADDLPPIDEPDT YYDHQLVGLMVQTATGEGVGVVTEVVHTAAGELLAVKRDSDEVLVPFVRAIVTSVSLD DGIVEIDPPHGLLNLE" CDS complement(3177833..3178075) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2932C" /product="KH domain RNA binding protein YlqC" /note="Mb2932c, -, len: 80 aa. Equivalent to Rv2908c, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 80 aa overlap). Conserved hypothetical protein, equivalent to O33015|YT08_MYCLE from Mycobacterium leprae (80 aa), FASTA scores: opt: 492, E(): 3.1e-29, (93.75% identity in 80 aa overlap). Also highly similar to others e.g. O69880|YE09_STRCO from Streptomyces coelicolor (79 aa), FASTA scores: opt: 356, E(): 3e-19, (71.6% identity in 74 aa overlap); Q9KA12|BH2482 PROTEIN from Bacillus halodurans (76 aa), FASTA scores: opt: 220, E(): 2.9e-09, (48.6% identity in 72 aa overlap); O31738|YLQC_BACSU HYPOTHETICAL 9.1 KDA PROTEIN from Bacillus subtilis (81 aa), FASTA scores: opt: 172, E(): 1e-05, (39.2% identity in 74 aa overlap); etc. BELONGS TO THE UPF0109 FAMILY. Protein product from Mb2932c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2932c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67237" /db_xref="InterPro:IPR009019" /db_xref="InterPro:IPR015946" /db_xref="InterPro:IPR020627" /db_xref="UniProtKB/Swiss-Prot:P67237" /protein_id="SIU01553.1" /translation="MSAVVVDAVEHLVRGIVDNPDDVRVDLITSRRGRTVEVHVHPDD LGKVIGRGGRTATALRTLVAGIGGRGIRVDVVDTDQ" CDS complement(3178083..3178571) /codon_start=1 /transl_table=11 /gene="rpsP" /locus_tag="BQ2027_MB2933C" /product="30s ribosomal protein s16 rpsp" /note="Mb2933c, rpsP, len: 162 aa. Equivalent to Rv2909c, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). Probable rpsP, 30S ribosomal protein S16, equivalent to O33014|RS16_MYCLE 30S RIBOSOMAL PROTEIN S16 from Mycobacterium leprae (160 aa), FASTA scores: opt: 828, E(): 1.6e-39, (82.5% identity in 160 aa overlap). Also highly similar to others e.g. O69879|RS16_STRCO 30S RIBOSOMAL PROTEIN S16 from Streptomyces coelicolor (139 aa), FASTA scores: opt: 486, E(): 1.9e-20, (56.95% identity in 144 aa overlap); P80379|RS16_THETH 30S RIBOSOMAL PROTEIN S16 from Thermus Thermophilus (88 aa), FASTA scores: opt: 280, E(): 4.8e-09, (53.25% identity in 77 aa overlap) (C-terminus shorter); P21474|RS16_BACSU|RPSP 30S RIBOSOMAL PROTEIN S16 (BS17) from Bacillus subtilis (89 aa,), FASTA scores: opt: 258, E(): 8.2e-08, (42.85% identity in 91 aa overlap) (C-terminus shorter); etc. BELONGS TO THE S16P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb2933c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2933c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66436" /db_xref="InterPro:IPR000307" /db_xref="InterPro:IPR020592" /db_xref="InterPro:IPR023803" /db_xref="UniProtKB/Swiss-Prot:P66436" /protein_id="SIU01554.1" /translation="MAVKIKLTRLGKIRNPQYRVAVADARTRRDGRAIEVIGRYHPKE EPSLIEINSERAQYWLSVGAQPTEPVLKLLKITGDWQKFKGLPGAQGRLKVAAPKPSK LEVFNAALAAADGGPTTEATKPKKKSPAKKAAKAAEPAPQPEQPDTPALGGEQAELTA ES" CDS complement(3178755..3179198) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2934C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2934c, -, len: 147 aa. Equivalent to Rv2910c, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). Conserved hypothetical protein, showing some similarity with hypothetical proteins from other organisms e.g. Q9JN76|MMYY HYPOTHETICAL 17.4 KDA PROTEIN from Streptomyces coelicolor (153 aa), FASTA scores: opt: 164, E(): 0.00026, (35.05% identity in 129 aa overlap); etc. Also some similarity with protein from Mycobacterium tuberculosis e.g. O07237|Rv0310c|MTCY63.15c (163 aa), FASTA scores: opt: 165, E(): 0.00023, (26.3% identity in 137 aa overlap); P96815|Rv0138|MTCI5.12 (167 aa), FASTA scores: opt: 132, E(): 0.048, (30.25% identity in 109 aa overlap); etc. Mb2934c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR032710" /db_xref="InterPro:IPR037401" /db_xref="UniProtKB/Swiss-Prot:P65054" /protein_id="SIU01555.1" /translation="MCAVLDRSMLSVAEISDRLEIQQLLVDYSSAIDQRRFDDLDRVF TPDAYIDYRALGGIDGRYPKIKQWLSQVLGNFPVYAHMLGNFSVRVDGDTASSRVICF NPMVFAGDRQQVLFCGLWYDDDFVRTPDGWRIIRRVETKCFQKMM" CDS 3179267..3180142 /codon_start=1 /transl_table=11 /gene="dacB2" /locus_tag="BQ2027_MB2935" /product="probable penicillin-binding protein dacb2 (d-alanyl-d-alanine carboxypeptidase) (dd-peptidase) (dd-carboxypeptidase) (pbp) (dd-transpeptidase) (serine-type d-ala-d-ala carboxypeptidase) (d-amino acid hydrolase)" /note="Mb2935, dacB2, len: 291 aa. Equivalent to Rv2911, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 291 aa overlap). Probable dacB2, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein) (EC 3.4.16.4), an ala-rich protein. Highly similar (except in N-terminus) to Q9CCM2|ML0691 PUTATIVE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE from Mycobacterium leprae (411 aa), FASTA scores: opt: 749, E(): 9.3e-39, (46.75% identity in 276 aa overlap). Also similar to penicillin binding proteins / D-alanyl-D-alanine carboxypeptidases e.g. Q9KCJ8|SC4G1.16c D-ALANYL-D-ALANINE CARBOXYPEPTIDASE from Streptomyces coelicolor (382 aa), FASTA scores: opt: 386, E(): 2.1e-16, (31.25% identity in 285 aa overlap); P35150|DACB_BACSU PENICILLIN-BINDING PROTEIN 5* PRECURSOR from Bacillus subtilis (382 aa), FASTA scores: opt: 384, E(): 3.6e-17, (30.7% identity in 244 aa overlap); Q9K8X5|DACB|BH2877 D-ALANYL-D-ALANINE CARBOXYPEPTIDASE (PENICILLIN-BINDING PROTEIN 5) from Bacillus halodurans (395 aa), FASTA scores: opt: 359, E(): 9.7e-15, (30.3% identity in 241 aa overlap); P33364|PBP7_ECOLI|PBPG|B2134 penicillin-binding protein 7 precursor from Escherichia coli strain K12 (313 aa), FASTA scores: opt: 273, E(): 7.5e-10, (27.8% identity in 263 aa overlap); etc. Also similar to O53380|Rv3330|MTV016.30 PENICILLIN-BINDING PROTEIN from Mycobacterium tuberculosis (405 aa), FASTA scores: opt: 746, E(): 1.4e-38, (47.0% identity in 266 aa overlap). Seems to contain PF00768 Peptidase_S11 domain PFAM. BELONGS TO PEPTIDASE FAMILY S11; ALSO KNOWN AS THE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE 1 FAMILY. Thought to be a membrane-bound protein. Note that previously known as dacB. Protein product from Mb2935 detected using SWATH mass spectrometry. Mb2935 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3C8" /db_xref="InterPro:IPR001967" /db_xref="InterPro:IPR012338" /db_xref="InterPro:IPR018044" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3C8" /protein_id="SIU01556.1" /translation="MRKLMTATAALCACAVTVSAGAAWADADVQPAGSVPIPDGPAQT WIVADLDSGQVLAGRDQNVAHPPASTIKVLLALVALDELDLNSTVVADVADTQAECNC VGVKPGRSYTARQLLDGLLLVSGNDAANTLAHMLGGQDVTVAKMNAKAATLGATSTHA TTPSGLDGPGGSGASTAHDLVVIFRAAMANPVFAQITAEPSAMFPSDNGEQLIVNQDE LLQRYPGAIGGKTGYTNAARKTFVGAAARGGRRLVIAMMYGLVKEGGPTYWDQAATLF DWGFALNPQASVGSL" CDS complement(3180202..3180789) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2936C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb2936c, -, len: 195 aa. Equivalent to Rv2912c, len: 195 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 195 aa overlap). Probable transcription regulatory protein, tetR family, showing similarity with others e.g. Q9K3V9|SCD10.17 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (202 aa), FASTA scores: opt: 185, E(): 4.4e-05, (31.15% identity in 167 aa overlap); Q9KFQ0 TETR-FAMILY from Bacillus halodurans (185 aa), FASTA scores: opt: 164, E(): 0.001, (35.6% identity in 73 aa overlap); P17446|BETI_ECOLI|BETI|B0313 regulatory protein from Escherichia coli strain K12 (195 aa), FASTA scores: opt: 126, E(): 0.024, (24.5% identity in 196 aa overlap); etc. Contains possible helix-turn-helix motif at aa 33-54 (+2.71 SD). POSSIBLY BELONGS TO THE TETR/ACRR FAMILY. Mb2936c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P67441" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023772" /db_xref="UniProtKB/Swiss-Prot:P67441" /protein_id="SIU01557.1" /translation="MARTQQQRREETVARLLQASIDTIIEVGYARASAAVITKRAGVS VGALFRHFETMGDFMAATAYEVLRRQLETFTKQVAEIPADRPALPAALTILRDITAGS TNAVLYELMVAARTDEKLKETLQNVLGQYSAKIHDAARALPGAESFPEETFPVIVALM TNVFDGAAIVRGVLPQPELEEQRIPMLTALLTAGL" CDS complement(3180791..3182626) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2937C" /product="possible d-amino acid aminohydrolase (d-amino acid hydrolase)" /note="Mb2937c, -, len: 611 aa. Equivalent to Rv2913c, len: 611 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 611 aa overlap). Possible D-amino acid aminohydrolase (EC 3.5.1.-), similar (principally in N-terminus) to D-amino acid aminohydrolases e.g. Q9V2D3|NDAD|PAB0090 D-AMINOACYLASE (ASPARTATE, GLUTAMATE ETC) from Pyrococcus abyssi (526 aa), FASTA scores: opt: 336, E(): 2.2e-13, (27.55% identity in 581 aa overlap); P94212|NDDD_ALCXX N-ACYL-D-ASPARTATE DEACYLASE (EC 3.5.1.83) (N-ACYL-D-ASPARTATE AMIDOHYDROLASE) from Alcaligenes xylosoxydans xylosoxydans (Achromobacter xylosoxidans) (498 aa), FASTA scores: opt: 221, E(): 3.4e-06, (25.95% identity in 532 aa overlap); Q9AGH8 D-AMINOACYLASE (EC 3.5.1.81) from Alcaligenes faecalis (484 aa), FASTA scores: opt: 218, E(): 5.1e-06, (28.35% identity in 434 aa overlap); etc. Protein product from Mb2937c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2937c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65531" /db_xref="InterPro:IPR011059" /db_xref="InterPro:IPR013108" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/Swiss-Prot:P65531" /protein_id="SIU01558.1" /translation="MLAWRQLNDLEETVTYDVIIRDGLWFDGTGNAPLTRTLGIRDGV VATVAAGALDETGCPEVVDAAGKWVVPGFIDVHTHYDAEVLLDPGLRESVRHGVTTVL LGNCSLSTVYANSEDAADLFSRVEAVPREFVLGALRDNQTWSTPAEYIEAIDALPLGP NVSSLLGHSDLRTAVLGLDRATDDTVRPTEAELAKMAKLLDEALEAGMLGMSGMDAAI DKLDGDRFRSRALPSTFATWRERRKLISVLRHRGRILQSAPDVDNPVSALLFFLASSR IFNRRKGVRMSMLVSADAKSMPLAVHVFGLGTRVLNKLLGSQVRFQHLPVPFELYSDG IDLPVFEEFGAGTAALHLRDQLQRNELLADRSYRRSFRREFDRIKLGPSLWHRDFHDA VIVECPDKSLIGKSFGAIADERGLHPLDAFLDVLVDNGERNVRWTTIVANHRPNQLNK LAAEPSVHMGFSDAGAHLRNMAFYNFGLRLLKRARDADRAGQPFLSIERAVYRLTGEL AEWFGIGAGTLRQGDRADFAVIDPTHLDESVDGYHEEAVPYYGGLRRMVNRNDATVVA TGVGGTVVFRGGQFGGQFRDGYGQNVKSGRYLRAGELGAALSRSA" CDS complement(3182695..3184452) /codon_start=1 /transl_table=11 /gene="pknI" /locus_tag="BQ2027_MB2938C" /product="PROBABLE TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE I PKNI (PROTEIN KINASE I) (STPK I) (PHOSPHORYLASE B KINASE KINASE) (HYDROXYALKYL-PROTEIN KINASE)" /note="Mb2938c, pknI, len: 585 aa. Equivalent to Rv2914c, len: 585 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 585 aa overlap). Probable pknI, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citation below), ala-rich protein, highly similar to many in Mycobacterium tuberculosis and other bacteria e.g. Q9RLQ7|MBK PUTATIVE SERINE/THREONINE PROTEIN KINASE from Mycobacterium bovis BCG (291 aa), FASTA scores: opt: 376, E(): 1.1e-10, (36.95% identity in 287 aa overlap); P33973|PKN1_MYXXA serine/threonine-protein kinase from Myxococcus xanthus (693 aa), FASTA scores: opt: 286, E(): 5.4e-10, (29.9% identity in 374 aa overlap); P72003|PKNF_MYCTU|Rv1746|MT1788|MTCY28.09 PROBABLE SERINE/THREONINE-PROTEIN KINASE from Mycobacterium tuberculosis (476 aa), FASTA scores: opt: 675, E(): 1.7e-24, (39.75% identity in 468 aa overlap); Q10697|PKNJ_MYCTU|Rv2088|MT2149|MTCY49.28 PROBABLE SERINE/THREONINE-PROTEIN KINASE from Mycobacterium tuberculosis (589 aa), FASTA scores: opt: 574, E(): 1e-19, (34.85% identity in 479 aa overlap); etc. Equivalent to AAK47308 from Mycobacterium tuberculosis strain CDC1551 (603 aa) but shorter 18 aa. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Protein product from Mb2938c detected using SWATH mass spectrometry. Mb2938c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65731" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR011009" /db_xref="UniProtKB/Swiss-Prot:P65731" /protein_id="SIU01559.1" /translation="MALASGVTFAGYTVVRMLGCSAMGEVYLVQHPGFPGWQALKVLS PAMAADDEFRRRFQRETEVAARLFHPHILEVHDRGEFDGQLWIAMDYVDGIDATQHMA DRFPAVLPVGEVLAIVTAVAGALDYAHQRGLLHRDVNPANVVLTSQSAGDQRILLADF GIASQPSYPAPELSAGADVDGRADQYALALTAIHLFAGAPPVDRSHTGPLQPPKLSAF RPDLARLDGVLSRALATAPADRFGSCREFADAMNEQAGVAIADQSSGGVDASEVTAAA GEEAYVVDYPAYGWPEAVDCKEPSARAPAPAAPTPQRRGSMLQSAAGVLARRLDNFST ATKAPASPTRRRPRRILVGAVAVLLLAGLFAVGIVIGRKTNTTATEVARPPTSGSAVP SAPTTTVAVTAPVPLDGTYRIEIQRSKQTYDYTPTPQPPDVNTWWAFRTSCTPTECLA AATMLDDNDHTQAKTPPVRPFLMQFGEGQWKSRPETVQFPCVGPNGSPSTQATTQLLA LRPQPQGDLVGEMVVTVHSNECGQQGAVIRIPAVASRSGDLPPAVTVPDPATIPDTPD TTSTATLTPPTTTAPGPGR" CDS complement(3184496..3185608) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2939C" /product="Xaa-Pro dipeptidase (EC" /EC_number="3.4.13.9" /note="Mb2939c, -, len: 370 aa. Equivalent to Rv2915c, len: 370 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 370 aa overlap). Conserved hypothetical protein, posssibly XAA-PRO dipeptidase (prolidase) (EC 3.4.13.9), highly similar to CAC38796|SCI39.08c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (363 aa), FASTA scores: opt: 1341, E(): 5.5e-76, (56.65% identity in 362 aa overlap); and similar to prolidases (XAA-PRO dipeptidase) e.g. Q9ABC9|CC0300 PUTATIVE XAA-PRO DIPEPTIDASE from Caulobacter crescentus (428 aa), FASTA scores: opt: 327, E(): 7.4e-13, (30.2% identity in 374 aa overlap); Q97XD4 PROLIDASE from Sulfolobus solfataricus (396 aa), FASTA scores: opt: 271, E(): 2.1e-09, (30.5% identity in 354 aa overlap); Q9WX55 PROLIDASE from Microbacterium esteraromaticum (393 aa), FASTA scores: opt: 256, E(): 1.8e-08, (27.95% identity in 365 aa overlap); etc. Also similar to O53619|Rv0074|MTV030.18 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (411 aa), FASTA scores: opt: 243, E(): 1.2e-07, (27.5% identity in 389 aa overlap). Protein product from Mb2939c detected using SWATH mass spectrometry. Mb2939c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P68916" /db_xref="InterPro:IPR006680" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/Swiss-Prot:P68916" /protein_id="SIU01560.1" /translation="MKRVDTIRPRSRAVRLHVRGLGLPDETAIQLWIVDGRISTEPVA GADTVFDGGWILPGLVDAHCHVGLGKHGNVELDEAIAQAETERDVGALLLRDCGSPTD TRGLDDHEDLPRIIRAGRHLARPKRYIAGFAVELEDESQLPAAVAEQARRGDGWVKLV GDWIDRQIGDLAPLWSDDVLKAAIDTAHAQGARVTAHVFSEDALPGLINAGIDCIEHG TGLTDDTIALMLEHGTALVPTLINLENFPGIADAAGRYPTYAAHMRDLYARGYGRVAA AREAGVPVYAGTDAGSTIEHGRIADEVAALQRIGMTAHEALGAACWDARRWLGRPGLD DRASADLLCYAQDPRQGPGVLQHPDLVILRGRTFGP" CDS complement(3185636..3187213) /codon_start=1 /transl_table=11 /gene="ffh" /locus_tag="BQ2027_MB2940C" /product="probable signal recognition particle protein ffh (fifty-four homolog) (srp protein)" /note="Mb2940c, ffh, len: 525 aa. Equivalent to Rv2916c, len: 525 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 525 aa overlap). Probable ffh, signal recognition particle protein (ala-, gly-, leu-rich protein), equivalent to O33013|SR54_MYCLE SIGNAL RECOGNITION PARTICLE from Mycobacterium leprae (521 aa), FASTA scores: opt: 2968, E(): 1.6e-145, (87.85% identity in 526 aa overlap). Also highly similar to others e.g. O69874|FFH from Streptomyces coelicolor (550 aa), FASTA scores: opt: 2025, E(): 6e-97, (63.8% identity in 519 aa overlap) (N-terminus longer 34 aa); P37105|SR54_BACSU from Bacillus subtilis (446 aa), FASTA scores: opt: 1451, E(): 1.9e-67, (51.5% identity in 435 aa overlap); BAB57399|FFH from Staphylococcus aureus subsp. aureus Mu50 (455 aa), FASTA scores: opt: 1418, E(): 9.4e-66, (48.65% identity in 448 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE SRP FAMILY OF GTP-BINDING PROTEINS. NOTE THAT SIGNAL RECOGNITION PARTICLE CONSISTS OF A SMALL CYTOPLASMIC RNA (SC-RNA) MOLECULE AND PROTEIN FFH. THE PROTEIN HAS A TWO DOMAIN STRUCTURE: THE G-DOMAIN BINDS GTP; THE M-DOMAIN BINDS THE RNA AND ALSO BINDS THE SIGNAL SEQUENCE. Protein product from Mb2940c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2940c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66845" /db_xref="InterPro:IPR000897" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004125" /db_xref="InterPro:IPR004780" /db_xref="InterPro:IPR013822" /db_xref="InterPro:IPR022941" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036891" /db_xref="InterPro:IPR042101" /db_xref="UniProtKB/Swiss-Prot:P66845" /protein_id="SIU01561.1" /translation="MFESLSDRLTAALQGLRGKGRLTDADIDATTREIRLALLEADVS LPVVRAFIHRIKERARGAEVSSALNPAQQVVKIVNEELISILGGETRELAFAKTPPTV VMLAGLQGSGKTTLAGKLAARLRGQGHTPLLVACDLQRPAAVNQLQVVGERAGVPVFA PHPGASPESGPGDPVAVAAAGLAEARAKHFDVVIVDTAGRLGIDEELMAQAAAIRDAI NPDEVLFVLDAMIGQDAVTTAAAFGEGVGFTGVALTKLDGDARGGAALSVREVTGVPI LFASTGEKLEDFDVFHPDRMASRILGMGDVLSLIEQAEQVFDAQQAEEAAAKIGAGEL TLEDFLEQMLAVRKMGPIGNLLGMLPGAAQMKDALAEVDDKQLDRVQAIIRGMTPQER ADPKIINASRRLRIANGSGVTVSEVNQLVERFFEARKMMSSMLGGMGIPGIGRKSATR KSKGAKGKSGKKSKKGTRGPTPPKVKSPFGVPGMPGLAGLPGGLPDLSQMPKGLDELP PGLADFDLSKLKFPGKK" CDS 3187291..3189171 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2941" /product="DNA or RNA helicase of superfamily protein II" /note="Mb2941, -, len: 626 aa. Equivalent to Rv2917, len: 626 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 626 aa overlap). Conserved hypothetical ala-, arg-rich protein, highly similar (but longer 34 aa) to O33011|ML1624|MLCB250.18C HYPOTHETICAL 65.2 KDA PROTEIN from Mycobacterium leprae (596 aa), FASTA scores: opt: 3117, E(): 9e-183, (79.8% identity in 584 aa overlap). Also highly similar to Q9S2E8|SCE19A.36C HYPOTHETICAL 66.2 KDA PROTEIN from Streptomyces coelicolor (598 aa), FASTA scores: opt: 1921, E(): 1.1e-109, (56.08% identity in 567 aa overlap); and Q9S3Y6|SDRA SDRA PROTEIN from Streptomyces coelicolor (597 aa), FASTA scores: opt: 1896, E(): 3.6e-108, (55.75% identity in 567 aa overlap). And shows some similarity with others proteins from other organisms. Equivalent to AAK47311 putative RNA helicase from Mycobacterium tuberculosis strain CDC1551 (602 aa) but longer 24 aa. Contains PS00017 ATP/GTP-binding site motif (P-loop)." /db_xref="GOA:A0A1R3Y2J6" /db_xref="InterPro:IPR006935" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2J6" /protein_id="SIU01562.1" /translation="MRVTRLVDAESTRCDVGPAPKSVAMLHFTAATSRFRLGRERANS VRSDGGWGVLQPVSATFNPPLRGWQRRALVQYLGTQPRDFLAVATPGSGKTSFALRIA AELLRYHTVEQVTVVVPTEHLKVQWAHAAAAHGLSLDPKFANSNPQTSPEYHGVMVTY AQVASHPTLHRVRTEARKTLVVFDEIHHGGDAKTWGDAIREAFGDATRRLALTGTPFR SDDSPIPFVSYQPDADGVLRSQADHTYGYAEALADGVVRPVVFLAYSGQARWRDSAGE EYEARLGEPLSAEQTARAWRTALDPEGEWMPAVITAADRRLRQLRAHVPDAGGMIIAS DRTTARAYARLLTTMTAEEPTVVLSDDPGSSARITEFAQGTSRWLVAVRMVSEGVDVP RLSVGVYATNASTPLFFAQAIGRFVRSRRPGETASIFVPSVPNLLQLASALEVQRNHV LGRPHRESAHDPLDGDPATRTQTERGGAERGFTALGADAELDQVIFDGSSFGTATPTG SDEEADYLGIPGLLDAEQMRALLHRRQDEQLRKRAQLQKGATQPATSGASASVHGQLR DLRRELHTLVSIAHHRTGKPHGWIHDELRRRCGGPPIAAATRAQIKARIDALRQLNSE RS" CDS complement(3189182..3191608) /codon_start=1 /transl_table=11 /gene="glnD" /locus_tag="BQ2027_MB2942C" /product="PROBABLE [PROTEIN-PII] URIDYLYLTRANSFERASE GLND (PII URIDYLYL-TRANSFERASE) (URIDYLYL REMOVING ENZYME) (UTASE)" /note="Mb2942c, glnD, len: 808 aa. Equivalent to Rv2918c, len: 808 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 808 aa overlap). Probable glnD, uridylyltransferase (ala-rich protein) (EC 2.7.7.59), similar to other uridylyltransferases e.g. O69873||SC2E1.02 from Streptomyces coelicolor (835 aa), FASTA scores: opt: 1473, E(): 2.8e-81, (41.03% identity in 858 aa overlap); P43919|GLND_HAEIN from Haemophilus influenzae (863 aa), FASTA scores: opt: 333, E(): 2.5e-12, (25.4% identity in 819 aa overlap); P27249|GLND_ECOLI|GLND|B0167 from Escherichia coli strain K12 (890 aa), FASTA scores: opt: 306, E(): 1.1e-10, (27.75% identity in 858 aa overlap); etc. BELONGS TO THE GLND FAMILY. Mb2942c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2L8" /db_xref="InterPro:IPR002912" /db_xref="InterPro:IPR003607" /db_xref="InterPro:IPR006674" /db_xref="InterPro:IPR010043" /db_xref="InterPro:IPR013546" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2L8" /protein_id="SIU01563.1" /translation="MEAESPCAASDLAVARRELLSGNHRELDPVGLRQTWLDLHESWL IDKADEIGIADASGFAIVGVGGLGRRELLPYSDLDVLLLHDGKPADILRPVADRLWYP LWDANIRLDHSVRTVSEALTIANSDLMAALGMLEARHIAGDQQLSFALIDGVRRQWRN GIRSRMGELVEMTYARWRRCGRIAQRAEPDLKLGRGGLRDVQLLDALALAQLIDRHGI GHTDRPAGSLDGAYRTLLDVRTELHRVSGRGRDHLLAQFADEISAALGFGDRFDLART LSSAGRTIGYHAEAGLRTAANALPRRGISALVRRPKRRPLDEGVVEYAGEIVLARDAE PEHDPGLVLRVAAASADTGLPIGAATLSRLAASVPDLPTPWPQEALDDLLVVLSAGPT TVATIEALDRTGLWGRLLPEWEPIRDLPPRDVAHKWTVDRHVVETAVHAAPLATRVAR PDLLALGALLHDIGKGRGTDHSVLGAELVIPVCTRLGLSPPDVWTLSKLVRHHLLLPI TATRRDLNDPKTIEAVSEALGGDPQLLEVLHALSEADSKATGPGVWSDWKASLVDDLV RRCRMVMAGESLPQAEPTAPHYLSLAADHGVHVEISPRDGERIDAVIVAPDERGLVSK AAAVLALNSLRVHSASVNVHQGVAITEFVVSPLFGSPPAAELVRQQFVGALNGDVDVL GMLQKRDSDAASLVSARAGDVQAGVPVTRTAAPPRILWLDTAAPAKLILEVRAMDRAG LLALLAGALEGAGAGIVWAKVNTFGSTAADVFCVTVPAELDARAAVEQHLLEVLGASV DVVVDEPVGD" CDS complement(3191666..3192004) /codon_start=1 /transl_table=11 /gene="glnB" /locus_tag="BQ2027_MB2943C" /product="PROBABLE NITROGEN REGULATORY PROTEIN P-II GLNB" /note="Mb2943c, glnB, len: 112 aa. Equivalent to Rv2919c, len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 112 aa overlap). Probable glnB, nitrogen regulatory protein, highly similar to others e.g. Q9X705|GLNB PII PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (112 aa), FASTA scores: opt: 531, E(): 4.5e-30, (68.75% identity in 112 aa overlap); P21193|GLNB_AZOBR NITROGEN REGULATORY PROTEIN P-II from Azospirillum brasilense (112 aa), FASTA scores: opt: 496, E(): 1.2e-27, (60.7% identity in 112 aa overlap); P05826|GLNB_ECOLI|B2553|Z3829|ECS3419|STY2808 NITROGEN REGULATORY PROTEIN P-II from Escherichia coli strains K12 and O157:H7 (112 aa), FASTA scores: opt: 487, E(): 5.3e-27, (61.6% identity in 112 aa overlap); etc. Contains PS00496 P-II protein urydylation site. BELONGS TO THE P(II) PROTEIN FAMILY. Protein product from Mb2943c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2943c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64250" /db_xref="InterPro:IPR002187" /db_xref="InterPro:IPR002332" /db_xref="InterPro:IPR011322" /db_xref="InterPro:IPR015867" /db_xref="InterPro:IPR017918" /db_xref="UniProtKB/Swiss-Prot:P64250" /protein_id="SIU01564.1" /translation="MKLITAIVKPFTLDDVKTSLEDAGVLGMTVSEIQGYGRQKGHTE VYRGAEYSVDFVPKVRIEVVVDDSIVDKVVDSIVRAARTGKIGDGKVWVSPVDTIVRV RTGERGHDAL" CDS complement(3192001..3193434) /codon_start=1 /transl_table=11 /gene="amt" /locus_tag="BQ2027_MB2944C" /product="PROBABLE AMMONIUM-TRANSPORT INTEGRAL MEMBRANE PROTEIN AMT" /note="Mb2944c, amt, len: 477 aa. Equivalent to Rv2920c, len: 477 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 477 aa overlap). Probable amt, ammonium-transport integral membrane protein (ala-, gly-, leu-, val-rich protein), highly similar to others e.g. Q9ZBP6|SC7A1.27 AMMONIUM TRANSPORTER from Streptomyces coelicolor (448 aa), FASTA scores: opt: 1246, E(): 7.3e-67, (54.1% identity in 462 aa overlap); P54146|AMT_CORGL AMMONIUM TRANSPORT SYSTEM from Corynebacterium glutamicum (452 aa), FASTA scores: opt: 953, E(): 2.1e-49, (41.45% identity in 475 aa overlap); Q07429|NRGA_BACSU PROBABLE AMMONIUM TRANSPORTER (MEMBRANE PROTEIN NRGA) from Bacillus subtilis (404 aa), FASTA scores: opt: 721, E(): 0, (44.4% identity in 430 aa overlap); etc. BELONGS TO THE AMT1/MEP/NRGA FAMILY OF AMMONIUM TRANSPORTERS (TC 2.49). Mb2944c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63520" /db_xref="InterPro:IPR001905" /db_xref="InterPro:IPR018047" /db_xref="InterPro:IPR024041" /db_xref="InterPro:IPR029020" /db_xref="UniProtKB/Swiss-Prot:P63520" /protein_id="SIU01565.1" /translation="MDQFPIMGVPDGGDTAWMLVSSALVLLMTPGLAFFYGGMVRSKS VLNMIMMSISAMGVVTVLWALYGYSIAFGDDVGNIAGNPSQYWGLKGLIGVNAVAADP STQTAAVNIPLAGTLPATVFVAFQLMFAIITVALISGAVADRLKFGAWLLFAGLWATF VYFPVAHWVFAFDGFAAEHGGWIANKLHAIDFAGGTAVHINAGVAALMLAIVLGKRRG WPATLFRPHNLPFVMLGAALLWFGWYGFNAGSATTANGVAGATFVTTTIATAAAMLGW LLTERVRDGKATTLGAASGIVAGLVAITPSCSSVNVLGALAVGVSAGVLCALAVGLKF KLGFDDSLDVVGVHLVGGLVGTLLVGLLAAPEAPAINGVAGVSKGLFYGGGFAQLERQ ALGACSVLVYSGIITLILALILKFTIGLRLDAEQESTGIDEAEHAESGYDFAVASGSV LPPRVTVEDSRNGIQERIGQKVEAEPK" CDS complement(3193911..3195179) /codon_start=1 /transl_table=11 /gene="ftsY" /locus_tag="BQ2027_MB2945C" /product="probable cell division protein ftsy (srp receptor) (signal recognition particle receptor)" /note="Mb2945c, ftsY, len: 422 aa. Equivalent to Rv2921c, len: 422 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 422 aa overlap). Probable ftsY, membrane-associated cell division protein, equivalent to O33010|FTSY_MYCLE CELL DIVISION PROTEIN FTSY HOMOLOG from Mycobacterium leprae (430 aa), FASTA scores: opt: 1760, E(): 1.1e-108, (81.35% identity in 429 aa overlap). Also similar to others e.g. Q9I6C1|FTSY|PA0373 SIGNAL RECOGNITION PARTICLE RECEPTOR FTSY from Pseudomonas aeruginosa (455 aa), FASTA scores: opt: 882, E(): 5.1e-40, (42.08% identity in 385 aa overlap); Q9KVJ6|FTSY CELL DIVISION PROTEIN from Vibrio cholerae (391 aa), FASTA scores: opt: 837, E(): 1.2e-37, (36.3% identity in 394 aa overlap); P10121|FTSY_ECOLI|FTSY|B3464 CELL DIVISION PROTEIN from Escherichia coli strain K12 (497 aa), FASTA scores: opt: 800, E(): 1.3e-35, (39.75% identity in 327 aa overlap); etc. Also similar to Q9ZBP9|SC7A1.24 PUTATIVE PROKARYOTIC DOCKING PROTEIN from Streptomyces coelicolor (412 aa), FASTA scores: opt: 1461, E(): 4.3e-71, (60.3% identity in 423 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00300 SRP54-type proteins GTP-binding domain signature. BELONGS TO THE SRP FAMILY OF GTP-BINDING PROTEINS. Protein product from Mb2945c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2945c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66843" /db_xref="InterPro:IPR000897" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004390" /db_xref="InterPro:IPR013822" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036225" /db_xref="InterPro:IPR042101" /db_xref="UniProtKB/Swiss-Prot:P66843" /protein_id="SIU01566.1" /translation="MWEGLWIATAVIAALVVIAALTLGLVLYRRRRISLSPRPERGVV DRSGGYTASSGITFSQTPTTQPAERIDTSGLPAVGDDATVPRDAPKRTIADVHLPEFE PEPQAPEVPEADAIAPPEGRLERLRGRLARSQNALGRGLLGLIGGGDLDEDSWQDVED TLLVADLGPAATASVVSQLRSRLASGNVRTEADARAVLRDVLINELQPGMDRSIRALP HAGHPSVLLVVGVNGTGKTTTVGKLARVLVADGRRVVLGAADTFRAAAADQLQTWAAR VGAAVVRGPEGADPASVAFDAVDKGIAAGADVVLIDTAGRLHTKVGLMDELDKVKRVV TRRASVDEVLLVLDATIGQNGLAQARVFAEVVDISGAVLTKLDGTAKGGIVFRVQQEL GVPVKLVGLGEGPDDLAPFEPAAFVDALLG" CDS complement(3195229..3198846) /codon_start=1 /transl_table=11 /gene="smc" /locus_tag="BQ2027_MB2946C" /product="PROBABLE CHROMOSOME PARTITION PROTEIN SMC" /note="Mb2946c, smc, len: 1205 aa. Equivalent to Rv2922c, len: 1205 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1205 aa overlap). Probable smc, chromosome partition protein (ala-, arg-, leu-, glu-rich protein, possibly coiled-coil protein) (see * below), equivalent (but longer 84 aa) to Q9CBT5|SMC|ML1629|MLCB250.01 POSSIBLE CELL DIVISION PROTEIN from Mycobacterium leprae (1203 aa), FASTA scores: opt: 5957, E(): 0, (79.15% identity in 1205 aa overlap). Also highly similar to other chromosome segregation proteins e.g. Q9ZBQ2|SC7A1.21 PUTATIVE CHROMOSOME ASSOCIATED PROTEIN from Streptomyces coelicolor (1186 aa), FASTA scores: opt: 2633, E(): 4.1e-120, (53.03% identity in 1205 aa overlap); P51834|SMC_BACSU CHROMOSOME PARTITION PROTEIN from Bacillus subtilis (1186 aa), FASTA scores: opt: 1009, E(): 2.1e-41, (30.75% identity in 1205 aa overlap); Q9CHC9|SMC CHROMOSOME SEGREGATION PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (924 aa), FASTA scores: opt: 996, E(): 7.5e-41, (29.75% identity in 874 aa overlap); etc. Equivalent to AAK47317 from Mycobacterium tuberculosis strain CDC1551 (1205 aa) but longer 84 aa. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE SMC FAMILY. N-terminus shortened since first submission. [*Note: Cobbe N., Heck M.M.S.- Phylogenetic analysis of SMC proteins (OCT-2001)]. Protein product from Mb2946c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2946c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2R1" /db_xref="InterPro:IPR003395" /db_xref="InterPro:IPR010935" /db_xref="InterPro:IPR011890" /db_xref="InterPro:IPR024704" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036277" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2R1" /protein_id="SIU01567.1" /translation="MYLKSLTLKGFKSFAAPTTLRFEPGITAVVGPNGSGKSNVVDAL AWVMGEQGAKTLRGGKMEDVIFAGTSSRAPLGRAEVTVSIDNSDNALPIEYTEVSITR RMFRDGASEYEINGSSCRLMDVQELLSDSGIGREMHVIVGQGKLEEILQSRPEDRRAF IEEAAGVLKHRKRKEKALRKLDTMAANLARLTDLTTELRRQLKPLGRQAEAAQRAAAI QADLRDARLRLAADDLVSRRAEREAVFQAEAAMRREHDEAAARLAVASEELAAHESAV AELSTRAESIQHTWFGLSALAERVDATVRIASERAHHLDIEPVAVSDTDPRKPEELEA EAQQVAVAEQQLLAELDAARARLDAARAERADRERRAAEADRAHLAAVREEADRREGL ARLAGQVETMRARVESIDESVARLSERIEDAAMRAQQTRAEFETVQGRIGELDQGEVG LDEHHERTVAALRLADERVAELQSAERAAERQVASLRARIDALAVGLQRKDGAAWLAH NRSGAGLFGSIAQLVKVRSGYEAALAAALGPAADALAVDGLTAAGSAVSALKQADGGR AVLVLSDWPAPQAPQSASGEMLPSGAQWALDLVESPPQLVGAMIAMLSGVAVVNDLTE AMGLVEIRPELRAVTVDGDLVGAGWVSGGSDRKLSTLEVTSEIDKARSELAAAEALAA QLNAALAGALTEQSAGQDAAEQALAALNESDTAISAMYEQLGRLGQEARAAEEEWNRL LQQRTEQEAVRTQTLDDVIQLETQLRKAQETQRVQVAQPIDRQAISAAADRARGVEVE ARLAVRTAEERANAVRGRADSLRRAAAAEREARVRAQQARAARLHAAAVAAAVADCGR LLAGRLHRAVDGASQLRDASAAQRQQRLAAMAAVRDEVNTLSARVGELTDSLHRDELA NAQAALRIEQLEQMVLEQFGMAPADLITEYGPHVALPPTELEMAEFEQARERGEQVIA PAPMPFDRVTQERRAKRAERALAELGRVNPLALEEFAALEERYNFLSTQLEDVKAARK DLLGVVADVDARILQVFNDAFVDVEREFRGVFTALFPGGEGRLRLTEPDDMLTTGIEV EARPPGKKITRLSLLSGGEKALTAVAMLVAIFRARPSPFYIMDEVEAALDDVNLRRLL SLFEQLREQSQIIIITHQKPTMEVADALYGVTMQNDGITAVISQRMRGQQVDQLVTNS S" CDS complement(3198858..3199139) /codon_start=1 /transl_table=11 /gene="acyP" /locus_tag="BQ2027_MB2947C" /product="PROBABLE ACYLPHOSPHATASE ACYP (ACYLPHOSPHATE PHOSPHOHYDROLASE)" /note="Mb2947c, acyP, len: 93 aa. Equivalent to Rv2922A, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Probable acyP, acylphosphatase (acylphosphate phosphohydrolase) (EC 3.6.1.7), highly similar to others e.g. Q9ZBQ3|SC7A1.20 PUTATIVE ACYLPHOSPHATASE from Streptomyces coelicolor (93 aa), FASTA scores: opt: 345, E(): 9.5e-19, (58.9% identity in 90 aa overlap); P75877|ACYP_ECOLI|YCCX|B0968|Z1320|ECS1 052 PUTATIVE ACYLPHOSPHATASE from Escherichia coli strains K12 and O157:H7 (92 aa), FASTA scores: opt: 220, E(): 2e-09, (44.95% identity in 89 aa overlap); Q9RVU3|DR0929 PUTATIVE ACYLPHOSPHATASE from Deinococcus radiodurans (87 aa), FASTA scores: opt: 193, E(): 2.1e-07, (44.3% identity in 79 aa overlap); etc. BELONGS TO THE ACYLPHOSPHATASE FAMILY. Protein product from Mb2947c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2947c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P69418" /db_xref="InterPro:IPR001792" /db_xref="InterPro:IPR017968" /db_xref="InterPro:IPR020456" /db_xref="InterPro:IPR036046" /db_xref="UniProtKB/Swiss-Prot:P69418" /protein_id="SIU01568.1" /translation="MSAPDVRLTAWVHGWVQGVGFRWWTRCRALELGLTGYAANHADG RVLVVAQGPRAACQKLLQLLQGDTTPGRVAKVVADWSQSTEQITGFSER" CDS complement(3199126..3199539) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2948C" /product="Predicted redox protein, regulator of disulfide bond formation" /note="Mb2948c, -, len: 137 aa. Equivalent to Rv2923c, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 137 aa overlap). Conserved hypothetical protein, showing similarity with other hypothetical proteins e.g. P24246|YHFA_ECOLI|B3356|Z4717|ECS4207 from Escherichia coli strains K12 and O157:H7 (134 aa), FASTA scores: opt: 110, E(): 1.9, (25.9% identity in 135 aa overlap); etc. Protein product from Mb2948c detected using SWATH mass spectrometry. Mb2948c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003718" /db_xref="InterPro:IPR015946" /db_xref="InterPro:IPR036102" /db_xref="UniProtKB/Swiss-Prot:P65056" /protein_id="SIU01569.1" /translation="MTQLWVERTGTRRYIGRSTRGAQVLVGSEDVDGVFTPGELLKIA LAACSGMASDQPLARRLGDDYQAVVKVSGAADRDQERYPLIEETMELDLSGLTEDEKE RLLVVINRAVELACTVGRTLKSGTTVNLEVVDVGA" CDS complement(3199641..3200510) /codon_start=1 /transl_table=11 /gene="fpg" /locus_tag="BQ2027_MB2949C" /product="PROBABLE FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE FPG (FAPY-DNA GLYCOSYLASE)" /note="Mb2949c, fpg, len: 289 aa. Equivalent to Rv2924c, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 289 aa overlap). Probable fpg (alternate gene name: mutM), formamidopyrimidine-DNA glycosylase (EC 3.2.2.23), equivalent to O69470|FPG_MYCLE FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE from Mycobacterium leprae (282 aa), FASTA scores: opt: 1563, E(): 1.3e-96, (80.6% identity in 289 aa overlap). Also highly similar to other formamidopyrimidine-DNA glycosylases e.g. Q9ZBQ6|FPG_STRCO from Streptomyces coelicolor (286 aa), FASTA scores: opt: 1047, E(): 2.9e-62, (57.55% identity in 292 aa overlap); P95744|FPG_SYNEN from Synechococcus elongatus naegeli (284 aa), FASTA scores: opt: 569, E(): 1.9e-30, (37.95% identity in 290 aa overlap); P05523|FPG_ECOLI|MUTM|FPG|B3635 from Escherichia coli strain K12 (269 aa), FASTA scores: opt: 424, E(): 8.2e-21, (33.9% identity in 289 aa overlap); etc. BELONGS TO THE FPG FAMILY. COFACTOR: BINDS 1 ZINC ION. Mb2949c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64151" /db_xref="InterPro:IPR000214" /db_xref="InterPro:IPR010663" /db_xref="InterPro:IPR010979" /db_xref="InterPro:IPR012319" /db_xref="InterPro:IPR015886" /db_xref="InterPro:IPR015887" /db_xref="InterPro:IPR020629" /db_xref="InterPro:IPR035937" /db_xref="UniProtKB/Swiss-Prot:P64151" /protein_id="SIU01570.1" /translation="MPELPEVEVVRRGLQAHVTGRTITEVRVHHPRAVRRHDAGPADL TARLRGARINGTDRRGKYLWLTLNTAGVHRPTDTALVVHLGMSGQMLLGAVPCAAHVR ISALLDDGTVLSFADQRTFGGWLLADLVTVDGSVVPVPVAHLARDPLDPRFDCDAVVK VLRRKHSELKRQLLDQRVVSGIGNIYADEALWRAKVNGAHVAATLRCRRLGAVLHAAA DVMREALAKGGTSFDSLYVNVNGESGYFERSLDAYGREGENCRRCGAVIRRERFMNRS SFYCPRCQPRPRK" CDS complement(3200814..3201536) /codon_start=1 /transl_table=11 /gene="rnc" /locus_tag="BQ2027_MB2950C" /product="PROBABLE RIBONUCLEASE III RNC (RNASE III)" /note="Mb2950c, rnc, len: 240 aa. Equivalent to Rv2925c, len: 240 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 240 aa overlap). Probable rnc, ribonuclease III (RNase III) (EC 3.1.26.3), equivalent to O69469|RNC_MYCLE RIBONUCLEASE III from Mycobacterium leprae (238 aa). Also highly similar to other ribonucleases III e.g. Q9ZBQ7|RNC_STRCO from Streptomyces coelicolor (272 aa), FASTA scores: opt: 889, E(): 5.4e-51, (62.2% identity in 225 aa overlap) (N-terminus longer 21 aa); P51833|RNC_BACSU from Bacillus subtilis (249 aa), FASTA scores: opt: 493, E(): 5e-25, (43.25% identity in 215 aa overlap); P05797|RNC_ECOLI|RNC|B2567|Z3848|ECS3433 from Escherichia coli strain O157:H7 and K12 (226 aa), FASTA scores: opt: 459, E(): 7.9e-23, (41.8% identity in 213 aa overlap); etc. Contains PS00517 Ribonuclease III family signature. Protein product from Mb2950c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2950c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66667" /db_xref="InterPro:IPR000999" /db_xref="InterPro:IPR011907" /db_xref="InterPro:IPR014720" /db_xref="InterPro:IPR036389" /db_xref="UniProtKB/Swiss-Prot:P66667" /protein_id="SIU01571.1" /translation="MIRSRQPLLDALGVDLPDELLSLALTHRSYAYENGGLPTNERLE FLGDAVLGLTITDALFHRHPDRSEGDLAKLRASVVNTQALADVARRLCAEGLGVHVLL GRGEANTGGADKSSILADGMESLLGAIYLQHGMEKAREVILRLFGPLLDAAPTLGAGL DWKTSLQELTAARGLGAPSYLVTSTGPDHDKEFTAVVVVMDSEYGSGVGRSKKEAEQK AAAAAWKALEVLDNAMPGKTSA" CDS complement(3201533..3202156) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2951C" /product="FIG01269488: protein, clustered with ribosomal protein L32p" /note="Mb2951c, -, len: 207 aa. Equivalent to Rv2926c, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 207 aa overlap). Conserved hypothetical protein, equivalent to O69468|ML1660|MLCB1243.14 HYPOTHETICAL 23.5 KDA PROTEIN from Mycobacterium leprae (217 aa), FASTA scores: opt: 866, E(): 1.4e-48, (67.2% identity in 192 aa overlap). Also similar in part to other hypothetical proteins e.g. Q9WXZ8 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (182 aa), FASTA scores: opt: 254, E(): 3.4e-09, (31.45% identity in 143 aa overlap); Q9ZBQ9|SC7A1.14 HYPOTHETICAL 23.5 KDA PROTEIN from Streptomyces coelicolor (217 aa), FASTA scores: opt: 244, E(): 1.7e-08, (45.5% identity in 189 aa overlap); O65982 HYPOTHETICAL 26.2 KDA PROTEIN from Clostridium thermosaccharolyticum (Thermoanaerobacterium thermosaccharolyticum) (228 aa), FASTA scores: opt: 220, E(): 6.1e-07, (32.45% identity in 148 aa overlap); etc. Equivalent to AAK47323 from Mycobacterium tuberculosis strain CDC1551 (195 aa) but longer 12 aa. Protein product from Mb2951c detected using SWATH mass spectrometry. Mb2951c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003772" /db_xref="UniProtKB/Swiss-Prot:P65058" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01572.1" /translation="MDLGGVRRRISLMARQHGPTAQRHVASPMTVDIARLGRRPGAMF ELHDTVHSPARIGLELIAIDQGALLDLDLRVESVSEGVLVTGTVAAPTVGECARCLSP VRGRVQVALTELFAYPDSATDETTEEDEVGRVVDETIDLEQPIIDAVGLELPFSPVCR PDCPGLCPQCGVPLASEPGHRHEQIDPRWAKLVEMLGPESDTLRGER" CDS complement(3202207..3202944) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2952C" /product="FIG00814129: Possible chaperone" /note="Mb2952c, -, len: 245 aa. Equivalent to Rv2927c, len: 245 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 245 aa overlap). Conserved hypothetical protein, equivalent to Q9CBS6|ML1661|MLCB1243.13 (alias O69467) HYPOTHETICAL PROTEIN from Mycobacterium leprae (247 aa), FASTA scores: opt: 1440, E(): 4.9e-76, (90.6% identity in 245 aa overlap). Also similar to many hypothetical proteins from other organisms e.g. Q9ZBR0|SC7A1.13 HYPOTHETICAL 41.0 KDA PROTEIN from Streptomyces coelicolor (379 aa), FASTA scores: opt: 266, E(): 3.4e-08, (29.9% identity in 234 aa overlap); etc. Also some similarity with P46815|AG84_MYCLE|ML0922 ANTIGEN 84 from Mycobacterium leprae (266 aa), FASTA scores: opt: 193, E(): 0.00043, (28.7% identity in 136 aa overlap) (see citation below); and P46816|AG84_MYCTU|WAG31|Rv2145c|MT2204|MTCY270.23 ANTIGEN 84 from Mycobacterium tuberculosis (260 aa), FASTA scores: opt: 178, E(): 0.0031, (34.35% identity in 131 aa overlap) (see citation below). Contains potential coiled-coil region. Protein product from Mb2952c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2952c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P65060" /protein_id="SIU01573.1" /translation="MYRVFEALDELSAIVEEARGVPMTAGCVVPRGDVLELIDDIKDA IPGELDDAQDVLDARDSMLQDAKTHADSMVSSATTEAESILNHARTEADRILSDAKAQ ADRMVSEARQHSERMVADAREEAIRIATAAKREYEASVSRAQAECDRLIENGNISYEK AVQEGIKEQQRLVSQNEVVAAANAESTRLVDTAHAEADRLRGECDIYVDNKLAEFEEF LNGTLRSVGRGRHQLRTAAGTHDYAVR" CDS 3203183..3203968 /codon_start=1 /transl_table=11 /gene="tesA" /locus_tag="BQ2027_MB2953" /product="PROBABLE THIOESTERASE TESA" /note="Mb2953, tesA, len: 261 aa. Equivalent to Rv2928, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 261 aa overlap). Probable tesA, thioesterase (EC 3.1.2.-), equivalent to Q9Z5K4|ML2359|MLCB12.04c PUTATIVE THIOESTERASE from Mycobacterium leprae (261 aa), FASTA scores: opt: 1326, E(): 3.7e-80, (73.2% identity in 261 aa overlap). Also similar to others e.g. Q9ZGI1 THIOESTERASE II PIKAV from Streptomyces venezuelae (281 aa), FASTA scores: opt: 535, E(): 6.6e-28, (38.05% identity in 234 aa overlap); Q9L4W2|NYSE thioesterase involved in synthesis of the polyene antifungal antibiotic nystatin from Streptomyces noursei (see citation below) (251 aa), FASTA scores: opt: 523, E(): 3.8e-27, (34.53% identity in 223 aa overlap); Q54145 THIOESTERASE from Streptomyces fradiae (253 aa), FASTA scores: opt: 495, E(): 2.7e-25, (37.85% identity in 230 aa overlap); etc. Protein product from Mb2953 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2953 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63461" /db_xref="InterPro:IPR001031" /db_xref="InterPro:IPR012223" /db_xref="InterPro:IPR020802" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P63461" /protein_id="SIU01574.1" /translation="MLARHGPRYGGSVNGHSDDSSGDAKQAAPTLYIFPHAGGTAKDY VAFSREFSADVKRIAVQYPGQHDRSGLPPLESIPTLADEIFAMMKPSARIDDPVAFFG HSMGGMLAFEVALRYQSAGHRVLAFFVSACSAPGHIRYKQLQDLSDREMLDLFTRMTG MNPDFFTDDEFFVGALPTLRAVRAIAGYSCPPETKLSCPIYAFIGDKDWIATQDDMDP WRDRTTEEFSIRVFPGDHFYLNDNLPELVSDIEDKTLQWHDRA" CDS 3203955..3204266 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2954" /product="HYPOTHETICAL PROTEIN" /note="Mb2954, -, len: 103 aa. Equivalent to Rv2929, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 103 aa overlap). Hypothetical unknown protein; unlikely ORF but some weak similarity to C-terminal half of P18319|UREG_KLEAE urease accessory protein from klebsiella aerogenes (205 aa), FASTA scores: opt: 99, E(): 1.1, (38.6% identity in 57 aa overlap). Mb2954 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P65062" /protein_id="SIU01575.1" /translation="MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFAEDSPYVGTGL ERMASDTHGGGGGRPVTPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTSVL T" CDS 3204682..3206433 /codon_start=1 /transl_table=11 /gene="fadD26" /locus_tag="BQ2027_MB2955" /product="fatty-acid-amp ligase fadd26 (fatty-acid-amp synthetase) (fatty-acid-amp synthase)" /note="Mb2955, fadD26, len: 583 aa. Equivalent to Rv2930, len: 583 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 583 aa overlap). fadD26, fatty-acid-CoA synthetase (EC 6.2.1.-) (see first and third citations below), equivalent to Q9Z5K5|FADD26|ML2358|MLCB12.03c PROBABLE ACYL-CoA SYNTHASE from Mycobacterium leprae (583 aa), FASTA scores: opt: 3026, E(): 9.2e-180, (76.85% identity in 583 aa overlap). Also highly similar to many e.g. Q9CD84|ML0132 PUTATIVE ACYL-CoA SYNTHETASE from Mycobacterium leprae (680 aa), FASTA scores: opt: 2324, E(): 3.2e-136, (61.35% identity in 572 aa overlap); P71495 ACYL-CoA SYNTHASE from Mycobacterium bovis (582 aa), FASTA scores: opt: 2304, E(): 5e-135, (59.85% identity in 583 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Q50586|FD25_MYCTU|RV1521|MTCY19G5.07 PUTATIVE FATTY-ACID--CoA LIGASE (583 aa), FASTA scores: opt: 2188, E(): 7.6e-128, (57.55% identity in 584 aa overlap); etc. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. N-terminus shortened since first submission. Note that Rv2930|fadD26 belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2955 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2955 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXM1" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q7TXM1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01576.1" /translation="MPVTDRSVPSLLQERADQQPDSTAYTYIDYGSDPKGFADSLTWS QVYSRACIIAEELKLCGLPGDRVAVLAPQGLEYVLAFLGALQAGFIAVPLSTPQYGIH DDRVSAVLQDSKPVAILTTSSVVGDVTKYAASHDGQPAPVVVEVDLLDLDSPRQMPAF SRQHTGAAYLQYTSGSTRTPAGVIVSHTNVIANVTQSMYGYFGDPAKIPTGTVVSWLP LYHDMGLILGICAPLVARRRAVLMSPMSFLRRPARWMQLLATSGRCFSAAPNFAFELA VRRTSDQDMAGLDLRDVVGIVSGSERIHVATVRRFIERFAPYNLSPTAIRPSYGLAEA TLYVAAPEAGAAPKTVRFDYEQLTAGQARPCGTDGSVGTELISYGSPDPSSVRIVNPE TMVENPPGVVGEIWVHGDHVTMGYWQKPKQTAQVFDAKLVDPAPAAPEGPWLRTGDLG VISDGELFIMGRIKDLLIVDGRNHYPDDIEATIQEITGGRAAAIAVPDDITEQLVAII EFKRRGSTAEEVMLKLRSVKREVTSAISKSHSLRVADLVLVSPGSIPITTSGKIRRSA CVERYRSDGFKRLDVAV" CDS 3206430..3212060 /codon_start=1 /transl_table=11 /gene="ppsA" /locus_tag="BQ2027_MB2956" /product="PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSA" /note="Mb2956, ppsA, len: 1876 aa. Equivalent to Rv2931, len: 1876 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 1876 aa overlap). ppsA, type-I polyketide synthase (see citations below), highly similar to others from Mycobacterium leprae e.g. Q9Z5K6|ML2357|MLCB12.02c PUTATIVE POLYKETIDE SYNTHASE from Mycobacterium leprae (1871 aa), FASTA scores: opt: 7566, E(): 0, (76.1% identity in 1888 aa overlap); Q9S384|ML2356|MLCB12.01c PUTATIVE POLYKETIDE SYNTHASE from Mycobacterium leprae (1540 aa), FASTA scores: opt: 4026, E(): 9.8e-212, (45.7% identity in 1811 aa overlap); Q49932|PKSC|L518_F1_2 PUTATIVE POLYKETIDE SYNTHASE (1446 aa), FASTA scores: opt: 4026, E(): 9.4e-212, (70.6% identity in 885 aa overlap). Also similar to polyketide synthases from other bacteria e.g. C-terminus of Q9L8C7|EPOC POLYKETIDE SYNTHASE from Polyangium cellulosum (7257 aa), FASTA scores: opt: 2592, E(): 5.2e-133, (32.55% identity in 2245 aa overlap); P22367|MSAS_PENPA 6-methylsalicylic acid synthase from Penicillium patulum (Penicillium griseofulvum) (1774 aa), FASTA scores: opt: 2391, E(): 0, (34.2% identity in 1815 aa overlap); etc. And also highly similar to others from Mycobacterium tuberculosis e.g. Q10978|PPSB_MYCTU|RV2932 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE (1538 aa), FASTA scores: opt: 4227, E(): 0, (46.8% identity in 1810 aa overlap) (gap in middle); etc. Contains PS00606 Beta-ketoacyl synthases active site, and PS00012 Phosphopantetheine attachment site. Note that Rv2931|ppsA belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2956 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2956 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXM0" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/Swiss-Prot:Q7TXM0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01577.1" /translation="MTGSISGEADLRHWLIDYLVTNIGCTPDEVDPDLSLADLGVSSR DAVVLSGELSELLGRTVSPIDFWEHPTINALAAYLAAPEPSPDSDAAVKRGARNSLDE PIAVVGMGCRFPGGISCPEALWDFLCERRSSISQVPPQRWQPFEGGPPEVAAALARTT RWGSFLPDIDAFDAEFFEISPSEADKMDPQQRLLLEVAWEALEHAGIPPGTLRRSATG VFAGACLSEYGAMASADLSQVDGWSNSGGAMSIIANRLSYFLDLRGPSVAVDTACSSS LVAIHLACQSLRTQDCHLAIAAGVNLLLSPAVFRGFDQVGALSPTGQCRAFDATADGF VRGEGAGVVVLKRLTDAQRDGDRVLAVICGSAVTQDGRSNGLMAPNPAAQMAVLRAAY TNAGMQPSEVDYVEAHGTGTLLGDPIEARALGTVLGRGRPEDSPLLIGSVKTNLGHTE AAAGIAGFIKTVLAVQHGQIPPNQHFETANPHIPFTDLRMKVVDTQTEWPATGHPRRA GVSSFGFGGTNAHVVIEQGQEVRPAPGQGLSPAVSTLVVAGKTMQRVSATAGMLADWM EGPGADVALADVAHTLNHHRSRQPKFGTVVARDRTQAIAGLRALAAGQHAPGVVNPAE GSPGPGTVFVYSGRGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQAGFSLHDVLANG EELVGIEQIQLGLIGMQLALTELWCSYGVQPDLVIGHSMGEVAAAVVAGALTPAEGLR VTATRSRLMAPLSGQGGMALLELDAPTTEALIADFPQVTLGIYNSPRQTVIAGPTEQI DELITRVRARDRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGIISTTYAD LHTQPVFDAEHWATNMRNPVHFQQAIASAGSGADGAYHTFIEISAHPLLTQAIIDTLH SAQPGARYTSLGTLQRDTDDVVTFRTNLNKAHTIHPPHTPHPPEPHPPIPTTPWQHTR HWITTKYPAGSVGSAPRAGTLLGQHTTVATVSASPPSHLWQARLAPDAKPYQGGHRFH QVEVVPASVVLHTILSAATELGYSALSEVRFEQPIFADRPRLIQVVADNRAISLASSP AAGTPSDRWTRHVTAQLSSSPSDSASSLNEHHRANGQPPERAHRDLIPDLAELLAMRG IDGLPFSWTVASWTQHSSNLTVAIDLPEALPEGSTGPLLDAAVHLAALSDVADSRLYV PASIEQISLGDVVTGPRSSVTLNRTAHDDDGITVDVTVAAHGEVPSLSMRSLRYRALD FGLDVGRAQPPASTGPVEAYCDATNFVHTIDWQPQTVPDATHPGAEQVTHPGPVAIIG DDGAALCETLEGAGYQPAVMSDGVSQARYVVYVADSDPAGADETDVDFAVRICTEITG LVRTLAERDADKPAALWILTRGVHESVAPSALRQSFLWGLAGVIAAEHPELWGGLVDL AINDDLGEFGPALAELLAKPSKSILVRRDGVVLAPALAPVRGEPARKSLQCRPDAAYL ITGGLGALGLLMADWLADRGAHRLVLTGRTPLPPRRDWQLDTLDTELRRRIDAIRALE MRGVTVEAVAADVGCREDVQALLAARDRDGAAPIRGIIHAAGITNDQLVTSMTGDAVR QVMWPKIGGSQVLHDAFPPGSVDFFYLTASAAGIFGIPGQGSYAAANSYLDALARARR QQGCHTMSLDWVAWRGLGLAADAQLVSEELARMGSRDITPSEAFTAWEFVDGYDVAQA VVVPMPAPAGADGSGANAYLLPARNWSVMAATEVRSELEQGLRRIIAAELRVPEKELD TDRPFAELGLNSLMAMAIRREAEQFVGIELSATMLFNHPTVKSLASYLAKRVAPHDVS QDNQISALSSSAGSVLDSLFDRIESAPPEAERSV" CDS 3212057..3216673 /codon_start=1 /transl_table=11 /gene="ppsB" /locus_tag="BQ2027_MB2957" /product="PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSB" /note="Mb2957, ppsB, len: 1538 aa. Equivalent to Rv2932, len: 1538 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1538 aa overlap). ppsB, type-I polyketide synthase (see citations below), highly similar to others from Mycobacterium leprae e.g. Q9S384|ML2356|MLCB12.01c PUTATIVE POLYKETIDE SYNTHASE (1540 aa), FASTA scores: opt: 7284, E(): 0, (76.3% identity in 1561 aa overlap); Q49932|PKSC|L518_F1_2 PUTATIVE POLYKETIDE SYNTHASE (1446 aa), FASTA scores: opt: 6811, E(): 0, (76.2% identity in 1462 aa overlap); etc. Also similar to polyketide synthases from other bacteria e.g. Q9KIZ6|EPOE EPOE PROTEIN from Polyangium cellulosum (3798 aa), FASTA scores: opt: 3052, E(): 3.3e-165, (38.35% identity in 1538 aa overlap); etc. And also highly similar to others from Mycobacterium tuberculosis e.g. Q10977|PPSA_MYCTU|RV2931 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE (1876 aa), FASTA scores: opt: 4227, E(): 0, (46.9% identity in 1810 aa overlap); P96203|PPSD|Rv2934|MTCY19H9.02 PKSE PROTEIN (1827 aa), FASTA scores: opt: 3756, E(): 1.8e-205, (42.9% identity in 1808 aa overlap); etc. Overlaps and extends CDS from neighbouring cosmid MTCY338.21. Contains PS00606 Beta-ketoacyl synthases active site. Note that Rv2932|ppsB belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2957 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2957 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXL9" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="UniProtKB/Swiss-Prot:Q7TXL9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01578.1" /translation="MMRTAFSRISGMTAQQRTSLADEFDRVSRIAVAEPVAVVGIGCR FPGDVDGPESFWDFLVAGRNAISTVPADRWDAEAFYHPDPLTPGRMTTKWGGFVPDVA GFDAEFFGITPREAAAMDPQQRMLLEVAWEALEHAGIPPDSLGGTRTAVMMGVYFNEY QSMLAASPQNVDAYSGTGNAHSITVGRISYLLGLRGPAVAVDTACSSSLVAVHLACQS LRLRETDLALAGGVSITLRPETQIAISAWGLLSPQGRCAAFDAAADGFVRGEGAGVVV LKRLTDAVRDGDQVLAVVRGSAVNQDGRSNGVTAPNTAAQCDVIADALRSGDVAPDSV NYVEAHGTGTVLGDPIEFEALAATYGHGGDACALGAVKTNIGHLEAAAGIAGFIKATL AVQRATIPPNLHFSQWNPAIDAASTRFFVPTQNSPWPTAEGPRRAAVSSFGLGGTNAH VIIEQGSELAPVSEGGEDTGVSTLVVTGKTAQRMAATAQVLADWMEGPGAEVAVADVA HTVNHHRARQATFGTVVARDRAQAIAGLRALAAGQHAPGVVSHQDGSPGPGTVFVYSG RGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQAGFSLRDVIATGKELVGIEQIQLGL IGMQLTLTELWRSYGVQPDLVIGHSMGEVAAAVVAGALTPAEGLRVTATRARLMAPLS GQGGMALLGLDAAATEALIADYPQVTVGIYNSPRQTVIAGPTEQIDELIARVRAQNRF ASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPIFDAEHWA TNMRNPVRFQQAIASAGSGADGAYHTFIEISAHPLLTQAIADTLEDAHRPTKSAAKYL SIGTLQRDADDTVTFRTNLYTADIAHPPHTCHPPEPHPTIPTTPWQHTHHWIATTHPS TAAPEDPGSNKVVVNGQSTSESRALEDWCHQLAWPIRPAVSADPPSTAAWLVVADNEL CHELARAADSRVDSLSPPALAAGSDPAALLDALRGVDNVLYAPPVPGELLDIESAYQV FHATRRLAAAMVASSATAISPPKLFIMTRNAQPISEGDRANPGHAVLWGLGRSLALEH PEIWGGIIDLDDSMPAELAVRHVLTAAHGTDGEDQVVYRSGARHVPRLQRRTLPGKPV TLNADASQLVIGATGNIGPHLIRQLARMGAKTIVAMARKPGALDELTQCLAATGTDLI AVAADATDPAAMQTLFDRFGTELPPLEGIYLAAFAGRPALLSEMTDDDVTTMFRPKLD ALALLHRLSLKSPVRHFVLFSSVSGLLGSRWLAHYTATSAFLDSFAGARRTMGLPATV VDWGLWKSLADVQKDATQISAESGLQPMADEVAIGALPLVMNPDAAVATVVVAADWPL LAAAYRTRGALRIVDDLLPAPEDVGKGESEFRTSLRSCPAEKRRDMLFDHVGALAATV MGMPPTEPLDPSAGFFQLGMDSLMSVTLQRALSESLGEFLPASVVFDYPTVYSLTDYL ATVLPELLEIGATAVATQQATDSYHELTEAELLEQLSERLRGTQ" CDS 3216670..3223236 /codon_start=1 /transl_table=11 /gene="ppsC" /locus_tag="BQ2027_MB2958" /product="PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSC" /note="Mb2958, ppsC, len: 2188 aa. Equivalent to Rv2933, len: 2188 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 2188 aa overlap). ppsC, type-I polyketide synthase (see citations below), highly similar to others from Mycobacterium leprae e.g. Q49933|PKSD|ML2355|L518_F1_3 PUTATIVE POLYKETIDE SYNTHASE (2201 aa), FASTA scores: opt: 6973, E(): 0, (82.32% identity in 2217 aa overlap); Q49624|PKS3|MASA|ML1229|B1170_C2_209 PROBABLE MYCOCEROSIC ACID SYNTHASE (2118 aa), FASTA scores: opt: 4015, E(): 2.9e-208, (36.6% identity in 2184 aa overlap); etc. Also similar to polyketide synthases from other bacteria e.g. C-terminus of Q9L8C7 POLYKETIDE SYNTHASE from Polyangium cellulosum (7257 aa), FASTA scores: opt: 3909, E(): 3.6e-202, (40.15% identity in 2220 aa overlap); Q9KIZ7|EPOD EPOD PROTEIN from Polyangium cellulosum (7257 aa), FASTA scores: opt: 3886, E(): 6.2e-201, (40.05% identity in 2220 aa overlap); etc. And also highly similar to others from Mycobacterium tuberculosis e.g. P96291|Rv2940c (2111 aa), FASTA scores: opt: 4204, E(): 0, (39.1% identity in 2176 aa overlap); Q10977|PPSA_MYCTU|RV2931 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE (1876 aa), FASTA scores: opt: 3793, E(): 2.4e-196, (46.65% identity in 1612 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site, and PS00012 Phosphopantetheine attachment site. Note that Rv2933|ppsC belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2958 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2958 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXL8" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042104" /db_xref="PDB:5NJI" /db_xref="UniProtKB/Swiss-Prot:Q7TXL8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01579.1" /translation="MTAATPDRRAIITEALHKIDDLTARLEIAEKSSSEPIAVIGMGC RFPGGVNNPEQFWDLLCAGRSGIVRVPAQRWDADAYYCDDHTVPGTICSTEGGFLTSW QPDEFDAEFFSISPREAAAMDPQQRLLIEVAWEALEDAGVPQHTIRGTQTSVFVGVTA YDYMLTLAGRLRPVDLDAYIPTGNSANFAAGRLAYILGARGPAVVIDTACSSSLVAVH LACQSLRGRESDMALVGGTNLLLSPGPSIACSRWGMLSPEGRCKTFDASADGYVRGEG AAVVVLKRLDDAVRDGNRILAVVRGSAVNQDGASSGVTVPNGPAQQALLAKALTSSKL TAADIDYVEAHGTGTPLGDPIELDSLSKVFSDRAGSDQLVIGSVKTNLGHLEAAAGVA GLMKAVLAVHNGYIPRHLNFHQLTPHASEAASRLRIAADGIDWPTTGRPRRAGVSSFG VSGTNAHVVIEQAPDPMAAAGTEPQRGPVPAVSTLVVFGKTAPRVAATASVLADWLDG PGAAVPLADVAHTLNHHRARQTRFGTVAAVDRRQAVIGLRALAAGQSAPGVVAPREGS IGGGTVFVYSGRGSQWAGMGRQLLADEPAFAAAIAELEPEFVAQGGFSLRDVIAGGKE LVGIEQIQLGLIGMQLALTALWRSYGVTPDAVIGHSMGEVAAAVVAGALTPAQGLRVT AVRSRLMAPLSGQGTMALLELDAEATEALIADYPEVSLGIYASPRQTVISGPPLLIDE LIDKVRQQNGFATRVNIEVAPHNPAMDALQPAMRSELADLTPQPPTIPIISTTYADLG ISLGSGPRFDAEHWATNMRNPVRFHQAIAHAGADHHTFIEISAHPLLTHSISDTLRAS YDVDNYLSIGTLQRDAHDTLEFHTNLNTTHTTHPPQTPHPPEPHPVLPTTPWQHTQHW ITATSAAYHRPDTHPLLGVGVTDPTNGTRVWESELDPDLLWLADHVIDDLVVLPGAAY AEIALAAATDTFAVEQDQPWMISELDLRQMLHVTPGTVLVTTLTGDEQRCQVEIRTRS GSSGWTTHATATVARAEPLAPLDHEGQRREVTTADLEDQLDPDDLYQRLRGAGQQHGP AFQGIVGLAVTQAGVARAQVRLPASARTGSREFMLHPVMMDIALQTLGATRTATDLAG GQDARQGPSSNSALVVPVRFAGVHVYGDITRGVRAVGSLAAAGDRLVGEVVLTDANGQ PLLVVDEVEMAVLGSGSGATELTNRLFMLEWEPAPLEKTAEATGALLLIGDPAAGDPL LPALQSSLRDRITDLELASAADEATLRAAISRTSWDGIVVVCPPRANDESMPDEAQLE LARTRTLLVASVVETVTRMGARKSPRLWIVTRGAAQFDAGESVTLAQTGLRGIARVLT FEHSELNTTLVDIEPDGTGSLAALAEELLAGSEADEVALRDGQRYVNRLVPAPTTTSG DLAAEARHQVVNLDSSGASRAAVRLQIDQPGRLDALNVHEVKRGRPQGDQVEVRVVAA GLNFSDVLKAMGVYPGLDGAAPVIGGECVGYVTAIGDEVDGVEVGQRVIAFGPGTFGT HLGTIADLVVPIPDTLADNEAATFGVAYLTAWHSLCEVGRLSPGERVLIHSATGGVGM AAVSIAKMIGARIYTTAGSDAKREMLSRLGVEYVGDSRSVDFADEILELTDGYGVDVV LNSLAGEAIQRGVQILAPGGRFIELGKKDVYADASLGLAALAKSASFSVVDLDLNLKL QPARYRQLLQHILQHVADGKLEVLPVTAFSLHDAADAFRLMASGKHTGKIVISIPQHG SIEAIAAPPPLPLVSRDGGYLIVGGMGGLGFVVARWLAEQGAGLIVLNGRSAPSDEVA AAIAELNASGSRIEVITGDITEPDTAERLVRAVEDAGFRLAGVVHSAMVLADEIVLNM TDSAARRVFAPKVTGSWRLHVATAARDVDWWLTFSSAAALLGTPGQGAYAAANSWVDG LVAHRRSAGLPAVGINWGPWADVGRAQFFKDLGVEMINAEQGLAAMQAVLTADRGRTG VFSLDARQWFQSFPAVAGSSLFAKLHDSAARKSGQRRGGGAIRAQLDALDAAERPGHL ASAIADEIRAVLRSGDPIDHHRPLETLGLDSLMGLELRNRLEASLGITLPVALVWAYP TISDLATALCERMDYATPAAAQEISDTEPELSDEEMDLLADLVDASELEAATRGES" CDS 3223233..3228716 /codon_start=1 /transl_table=11 /gene="ppsD" /locus_tag="BQ2027_MB2959" /product="PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSD" /note="Mb2959, ppsD, len: 1827 aa. Equivalent to Rv2934, len: 1827 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1827 aa overlap). ppsD, type-I polyketide synthase (see citations below), highly similar to others from Mycobacterium leprae e.g. Q9CB70|ML2354 POLYKETIDE SYNTHASE (1822 aa), FASTA scores: opt: 9779, E(): 0, (80.35% identity in 1836 aa overlap); Q49940|L518_F3_67|PFSE (1815 aa), FASTA scores: opt: 9658, E(): 0, (79.85% identity in 1831 aa overlap); etc. Also similar to polyketide synthases from other bacteria e.g. C-terminus of Q9RNB2|MCYD|Q9FDU1 POLYKETIDE SYNTHASE (MCYD PROTEIN) from Microcystis aeruginosa (3906 aa), FASTA scores: opt: 2961, E(): 6e-159, (32.15% identity in 1827 aa overlap); etc. And also highly similar to others from Mycobacterium tuberculosis e.g. Q10978|PPSB_MYCTU|RV2932 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE (1538 aa), FASTA scores: opt: 3756, E(): 3.8e-204, (42.85% identity in 1808 aa overlap) (gaps in middle); P96202|PPSC|RV2933 POLYKETIDE SYNTHASE (2188 aa), FASTA scores: opt: 3463, E(): 1.7e-187, (39.2% identity in 2165 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site, PS00017 ATP/GTP-binding site motif A, PS00013 Prokaryotic membrane lipoprotein lipid attachment site, and PS00012 Phosphopantetheine attachment site. Note that Rv2934|ppsD belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2959 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2959 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXL7" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/Swiss-Prot:Q7TXL7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01580.1" /translation="MTSLAERAAQLSPNARAALARELVRAGTTFPTDICEPVAVVGIG CRFPGNVTGPESFWQLLADGVDTIEQVPPDRWDADAFYDPDPSASGRMTTKWGGFVSD VDAFDADFFGITPREAVAMDPQHRILLEVAWEALEHAGIPPDSLSGTRTGVMMGLSSW DYTIVNIERRADIDAYLSTGTPHCAAVGRIAYLLGLRGPAVAVDTACSSSLVAIHLAC QSLRLRETDVALAGGVQLTLSPFTAIALSKWSALSPTGRCNSFDANADGFVRGEGCGV VVLKRLADAVRDQDRVLAVVRGSATNSDGRSNGMTAPNALAQRDVITSALKLADVTPD SVNYVETHGTGTVLGDPIEFESLAATYGLGKGQGESPCALGSVKTNIGHLEAAAGVAG FIKAVLAVQRGHIPRNLHFTRWNPAIDASATRLFVPTESAPWPAAAGPRRAAVSSFGL SGTNAHVVVEQAPDTAVAAAGGMPYVSALNVSGKTAARVASAAAVLADWMSGPGAAAP LADVAHTLNRHRARHAKFATVIARDRAEAIAGLRALAAGQPRVGVVDCDQHAGGPGRV FVYSGQGSQWASMGQQLLANEPAFAKAVAELDPIFVDQVGFSLQQTLIDGDEVVGIDR IQPVLVGMQLALTELWRSYGVIPDAVIGHSMGEVSAAVVAGALTPEQGLRVITTRSRL MARLSGQGAMALLELDADAAEALIAGYPQVTLAVHASPRQTVIAGPPEQVDTVIAAVA TQNRLARRVEVDVASHHPIIDPILPELRSALADLTPQPPSIPIISTTYESAQPVADAD YWSANLRNPVRFHQAVTAAGVDHNTFIEISPHPVLTHALTDTLDPDGSHTVMSTMNRE LDQTLYFHAQLAAVGVAASEHTTGRLVDLPPTPWHHQRFWVTDRSAMSELAATHPLLG AHIEMPRNGDHVWQTDVGTEVCPWLADHKVFGQPIMPAAGFAEIALAAASEALGTAAD AVAPNIVINQFEVEQMLPLDGHTPLTTQLIRGGDSQIRVEIYSRTRGGEFCRHATAKV EQSPRECAHAHPEAQGPATGTTVSPADFYALLRQTGQHHGPAFAALSRIVRLADGSAE TEISIPDEAPRHPGYRLHPVVLDAALQSVGAAIPDGEIAGSAEASYLPVSFETIRVYR DIGRHVRCRAHLTNLDGGTGKMGRIVLINDAGHIAAEVDGIYLRRVERRAVPLPLEQK IFDAEWTESPIAAVPAPEPAAETTRGSWLVLADATVDAPGKAQAKSMADDFVQQWRSP MRRVHTADIHDESAVLAAFAETAGDPEHPPVGVVVFVGGASSRLDDELAAARDTVWSI TTVVRAVVGTWHGRSPRLWLVTGGGLSVADDEPGTPAAASLKGLVRVLAFEHPDMRTT LVDLDITQDPLTALSAELRNAGSGSRHDDVIAWRGERRFVERLSRATIDVSKGHPVVR QGASYVVTGGLGGLGLVVARWLVDRGAGRVVLGGRSDPTDEQCNVLAELQTRAEIVVV RGDVASPGVAEKLIETARQSGGQLRGVVHAAAVIEDSLVFSMSRDNLERVWAPKATGA LRMHEATADCELDWWLGFSSAASLLGSPGQAAYACASAWLDALVGWRRASGLPAAVIN WGPWSEVGVAQALVGSVLDTISVAEGIEALDSLLAADRIRTGVARLRADRALVAFPEI RSISYFTQVVEELDSAGDLGDWGGPDALADLDPGEARRAVTERMCARIAAVMGYTDQS TVEPAVPLDKPLTELGLDSLMAVRIRNGARADFGVEPPVALILQGASLHDLTADLMRQ LGLNDPDPALNNADTIRDRARQRAAARHGAAMRRRPKPAVQGG" CDS 3228722..3233188 /codon_start=1 /transl_table=11 /gene="ppsE" /locus_tag="BQ2027_MB2960" /product="PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSE" /note="Mb2960, ppsE, len: 1488 aa. Equivalent to Rv2935, len: 1488 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1488 aa overlap). ppsE, type-I polyketide synthase (see citations below), equivalent to Q49934|PKSF|ML2353|L518_F1_8 PUTATIVE POLYKETIDE SYNTHASE from Mycobacterium leprae (1489 aa), FASTA scores: opt: 8156, E(): 0, (82.05% identity in 1493 aa overlap). Also similar to polyketide synthases from other bacteria e.g. Q9RAH3|NOSB NOSB PROTEIN from Nostoc sp. GSV224 (1244 aa), FASTA scores: opt: 2438, E(): 8.8e-137, (43.75% identity in 969 aa overlap); Q9KIZ8|EPOC EPOC PROTEIN from Polyangium cellulosum (1832 aa), FASTA scores: opt: 2272, E(): 8.6e-127, (39.95% identity in 1061 aa overlap); O54155|SC3F7.12 POLYKETIDE SYNTHASE from Streptomyces coelicolor (2297 aa), FASTA scores: opt: 1522, E(): 3.6e-82, (36.35% identity in 1057 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site. Note that Rv2935|ppsE belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2960 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2960 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXL6" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR023213" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036736" /db_xref="UniProtKB/Swiss-Prot:Q7TXL6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01581.1" /translation="MSIPENAIAVVGMAGRFPGAKDVSAFWSNLRRGKESIVTLSEQE LRDAGVSDKTLADPAYVRRAPLLDGIDEFDAGFFGFPPLAAQVLDPQHRLFLQCAWHA LEDAGADPARFDGSIGVYGTSSPSGYLLHNLLSHRDPNAVLAEGLNFDQFSLFLQNDK DFLATRISHAFNLRGPSIAVQTACSSSLVAVHLACLSLLSGECDMALAGGSSLCIPHR VGYFTSPGSMVSAVGHCRPFDVRADGTVFGSGVGLVVLKPLAAAIDAGDRIHAVIRGS AINNDGSAKMGYAAPNPAAQADVIAEAHAVSGIDSSTVSYVECHGTGTPLGDPIEIQG LRAAFEVSQTSRSAPCVLGSVKSNIGHLEVAAGIAGLIKTILCLKNKALPATLHYTSP NPELRLDQSPFVVQSKYGPWECDGVRRAGVSSFGVGGTNAHVVLEEAPAEASEVSAHA EPAGPQVILLSAQTAAALGESRTALAAALETQDGPRLSDVAYTLARRRKHNVTMAAVV HDREHAATVLRAAEHDNVFVGEAAHDGEHGDRADAAPTSDRVVFLFPGQGAQHVGMAK GLYDTEPVFAQHFDTCAAGFRDETGIDLHAEVFDGTATDLERIDRSQPALFTVEYALA KLVDTFGVRAGAYIGYSTGEYIAATLAGVFDLQTAIKTVSLRARLMHESPPGAMVAVA LGPDDVTQYLPPEVELSAVNDPGNCVVAGPKDQIRALRQRLTEAGIPVRRVRATHAFH TSAMDPMLGQFQEFLSRQQLRPPRTPLLSNLTGSWMSDQQVVDPASWTRQISSPIRFA DELDVVLAAPSRILVEVGPGGSLTGSAMRHPKWSTTHRTVRLMRHPLQDVDDRDTFLR ALGELWSAGVEVDWTPRRPAVPHLVSLPGYPFARQRHWVEPNHTVWAQAPGANNGSPA GTADGSTAATVDAARNGESQTEVTLQRIWSQCLGVSSVDRNANFFDLGGDSLMAISIA MAAANEGLTITPQDLYEYPTLASLTAAVDASFASSGLAKPPEAQANPAVPPNVTYFLD RGLRDTGRCRVPLILRLDPKIGLPDIRAVLTAVVNHHDALRLHLVGNDGIWEQHIAAP AEFTGLSNRSVPDGVAAGSPEERAAVLGILAELLEDQTDPNAPLAAVHIAAAHGGPHY LCLAIHAMVTDDSSRQILATDIVTAFGQRLAGEEITLEPVSTGWREWSLRCAALATHP AALDTRSYWIENSTKATLWLADALPNAHTAHPPRADELTKLSSTLSVEQTSELDDGRR RFRRSIQTILLAALGRTIAQTVGEGVVAVELEGEGRSVLRPDVDLRRTVGWFTTYYPV PLACATGLGALAQLDAVHNTLKSVPHYGIGYGLLRYVYAPTGRVLGAQRTPDIHFRYA GVIPELPSGDAPVQFDSDMTLPVREPIPGMGHAIELRVYRFGGSLHLDWWYDTRRIPA ATAEALERTFPLALSALIQEAIAAEHTEHDDSEIVGEPEAGALVDLSSMDAG" CDS 3233199..3234194 /codon_start=1 /transl_table=11 /gene="drrA" /locus_tag="BQ2027_MB2961" /product="daunorubicin-dim-transport atp-binding protein abc transporter drra" /note="Mb2961, drrA, len: 331 aa. Equivalent to Rv2936, len: 331 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 331 aa overlap). Probable drrA, daunorubicin-DIM-transport resistance ATP-binding protein ABC transporter, probably involved in daunorubicin resistance and phthiocerol dimycocerosate transport (see citations below), equivalent to Q49938|DRRA|ML2352|L518_F2_43|DRRA PROBABLE DAUNORUBICIN RESISTANCE ATP-BINDING PROTEIN from Mycobacterium leprae (331 aa), FASTA scores: opt: 1842, E(): 4.2e-103, (85.2% identity in 331 aa overlap). Also highly similar to others e.g. Q9XCF7 DRRA from Mycobacterium avium (315 aa), FASTA scores: opt: 1040, E(): 4.7e-55, (54.35% identity in 309 aa overlap); Q9X5J8 DAUNORUBICIN RESISTANCE PROTEIN A from Mycobacterium avium (315 aa), FASTA scores: opt: 1030, E(): 1.9e-54, (53.7% identity in 309 aa overlap); P32010|DRRA_STRPE DAUNORUBICIN RESISTANCE ATP-BINDING PROTEIN from Streptomyces peucetius (330 aa), FASTA scores: opt: 852, E(): 9e-44, (47.15% identity in 318 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Note that Rv2936|drrA belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2961 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2961 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2L1" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR005894" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2L1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01582.1" /translation="MRNDDMAVVVNGVRKTYGKGKIVALDDVSFKVRRGEVIGLLGPN GAGKTTMVDILSTLTRPDAGSAIIAGYDVVSEPAGVRRSIMVTGQQVAVDDALSGEQN LVLFGRLWGLSKSAARKRAAELLEQFSLVHAGKRRVGTYSGGMRRRIDIACGLVVQPQ VAFLDEPTTGLDPRSRQAIWDLVASFKKLGIATLLTTQYLEEADALSDRIILIDHGII IAEGTANELKHRAGDTFCEIVPRDLKDLDAIVAALGSLLPEHHRAMLTPDSDRITMPA PDGIRMLVEAARRIDEARIELADIALRRPSLDDVFLAMTTDPTESLTHLVSGSAR" CDS 3234191..3235060 /codon_start=1 /transl_table=11 /gene="drrB" /locus_tag="BQ2027_MB2962" /product="daunorubicin-dim-transport integral membrane protein abc transporter drrb" /note="Mb2962, drrB, len: 289 aa. Equivalent to Rv2937, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 289 aa overlap). Probable drrB, daunorubicin-DIM-transport integral membrane protein ABC transporter, probably involved in daunorubicin resistance and phthiocerol dimycocerosate transport (see citations below), equivalent to Q49935|DRRB|ML2351|L518_F1_9 DAUNORUBICIN RESISTANCE TRANSMEMBRANE PROTEIN from Mycobacterium leprae (288 aa), FASTA scores: opt: 1252, E(): 5.3e-72, (64.0% identity in 289 aa overlap). Also similar to others e.g. Q9XCF8 DRRB PROTEIN from Mycobacterium avium (246 aa), FASTA scores: opt: 423, E(): 1.5e-19, (30.85% identity in 243 aa overlap); Q9S6H4 DAUNORUBICIN RESISTANCE PROTEIN B from Mycobacterium avium (246 aa), FASTA scores: opt: 420, E(): 2.3e-19, (30.85% identity in 243 aa overlap); P32011|DRRB_STRPE DAUNORUBICIN RESISTANCE TRANSMEMBRANE PROTEIN from Streptomyces peucetius (283 aa), FASTA scores: opt: 242, E(): 4.7e-08, (27.85% identity in 219 aa overlap); etc. Note that Rv293|drrB belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2962 detected using SWATH mass spectrometry. Mb2962 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2N5" /db_xref="InterPro:IPR000412" /db_xref="InterPro:IPR004377" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2N5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01583.1" /translation="MSGPAIDASPALTFNQSSASIQQRRLSTGRQMWVLYRRFAAPSL LNGEVLTTVGAPIIFMVGFYIPFAIPWNQFVGGASSGVASNLGQYITPLVTLQAVSFA AIGSGFRAATDSLLGVNRRFQSMPMAPLTPLLARVWVAVDRCFTGLVISLVCGYVIGF RFHRGALYIVGFCLLVIAIGAVLSFAADLVGTVTRNPDAMLPLLSLPILIFGLLSIGL MPLKLFPHWIHPFVRNQPISQFVAALRALAGDTTKTASQVSWPVMAPTLTWLFAFVVI LALSSTIVLARRP" CDS 3235057..3235887 /codon_start=1 /transl_table=11 /gene="drrC" /locus_tag="BQ2027_MB2963" /product="PROBABLE DAUNORUBICIN-DIM-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER DRRC" /note="Mb2963, drrC, len: 276 aa. Equivalent to Rv2938, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 276 aa overlap). Probable drrC, daunorubicin-DIM-transport integral membrane protein ABC transporter, probably involved in daunorubicin resistance and phthiocerol dimycocerosate transport (see citations below), equivalent to Q9CB71|ML2350 PROBABLE ANTIBIOTIC RESISTANCE MEMBRANE PROTEIN from Mycobacterium leprae (276 aa), FASTA scores: opt: 1434, E(): 1.2e-81, (79.0% identity in 276 aa overlap); and Q49941|DRRC|L518_F3_76 PUTATIVE DAUNORUBICIN RESISTANCE TRANSMEMBRANE PROTEIN from Mycobacterium leprae (244 aa), FASTA scores: opt: 1194, E(): 8.3e-67, (76.85% identity in 242 aa overlap). Also similar to others e.g. Q9XCF9 DRRC PROTEIN from Mycobacterium avium (263 aa), FASTA scores: opt: 538, E(): 3.7e-26, (32.65% identity in 251 aa overlap); Q9S6H3 DAUNORUBICIN RESISTANCE PROTEIN C from Mycobacterium avium (263 aa), FASTA scores: opt: 533, E(): 7.6e-26, (32.25% identity in 251 aa overlap); P32011|DRRB_STRPE DAUNORUBICIN RESISTANCE TRANSMEMBRANE PROTEIN from Streptomyces peucetius (283 aa), FASTA scores: opt: 276, E(): 6.6e-10, (21.07% identity in 261 aa overlap); etc. Note that Rv2938|drrC belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2963 detected using SWATH mass spectrometry. Mb2963 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2N4" /db_xref="InterPro:IPR000412" /db_xref="InterPro:IPR004377" /db_xref="InterPro:IPR005943" /db_xref="InterPro:IPR013525" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2N4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01584.1" /translation="MITTTSQEIELAPTRLPGSQNAARLFVAQTLLQTNRLLTRWARD YITVIGAIVLPILFMVVLNIVLGNLAYVVTHDSGLYSIVPLIALGAAITGSTFVAIDL MRERSFGLLARLWVLPVHRASGLISRILANAIRTLVTTLVMLGTGVVLGFRFRQGLIP SLMWISVPVILGIAIAAMVTTVALYTAQTVVVEGVELVQAIAIFFSTGLVPLNSYPGW IQPFVAHQPVSYAIAAMRGFAMGGPVLSPMIGMLVWTAGICVVCAVPLAIGYRRASTH " CDS 3235934..3237202 /codon_start=1 /transl_table=11 /gene="papA5" /locus_tag="BQ2027_MB2964" /product="POSSIBLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA5" /note="Mb2964, papA5, len: 422 aa. Equivalent to Rv2939, len: 422 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 422 aa overlap). Possible papA5, conserved polyketide synthase (PKS) associated protein (see first citation below), equivalent to Q49939 HYPOTHETICAL 45.6 KDA PROTEIN from Mycobacterium leprae (423 aa), FASTA scores: opt: 2398, E(): 4.5e-144, (84.05% identity in 426 aa overlap); and Q02279|YMA3_MYCBO HYPOTHETICAL 38.1 KDA PROTEIN from Mycobacterium bovis (354 aa), FASTA scores: opt: 2193, E(): 3.6e-131, (97.4% identity in 343 aa overlap). And C-terminus highly similar to to Q9S381 HYPOTHETICAL 5.0 KDA PROTEIN (FRAGMENT) from Mycobacterium leprae (44 aa), FASTA scores: opt: 275, E(): 1.4e-10, (88.65% identity in 44 aa overlap). Also similar in part to various synthetases e.g. Q9AE01|RIF20 RIF20 PROTEIN from Amycolatopsis mediterranei (Nocardia mediterranei) (403 aa), FASTA scores: opt: 282, E(): 2.7e-10, (30.3% identity in 393 aa overlap); middle part of Q00869|ESYN1 ENNIATIN SYNTHETASE (FRAGMENT) (N-methyl peptide synthetase) from Fusarium equiseti (3131 aa), FASTA scores: opt: 180, E(): 0.0036, (26.85% identity in 242 aa overlap); N-terminus of Q9FB18 PEPTIDE SYNTHETASE NRPS2-1 from Streptomyces verticillus (2626 aa), FASTA scores: opt: 159, E(): 0.068, (23.65% identity in 351 aa overlap); etc. Note that Rv2939|papA5 belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly). Protein product from Mb2964 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2964 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q02279" /db_xref="InterPro:IPR023213" /db_xref="InterPro:IPR031641" /db_xref="UniProtKB/Swiss-Prot:Q02279" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01585.1" /translation="MFPGSVIRKLSHSEEVFAQYEVFTSMTIQLRGVIDVDALSDAFD ALLETHPVLASHLEQSSDGGWNLVADDLLHSGICVIDGTAATNGSPSGNAELRLDQSV SLLHLQLILREGGAELTLYLHHCMADGHHGAVLVDELFSRYTDAVTTGDPGPITPQPT PLSMEAVLAQRGIRKQGLSGAERFMSVMYAYEIPATETPAVLAHPGLPQAVPVTRLWL SKQQTSDLMAFGREHRLSLNAVVAAAILLTEWQLRNTPHVPIPYVYPVDLRFVLAPPV APTEATNLLGAASYLAEIGPNTDIVDLASDIVATLRADLANGVIQQSGLHFGTAFEGT PPGLPPLVFCTDATSFPTMRTPPGLEIEDIKGQFYCSISVPLDLYSCAVYAGQLIIEH HGHIAEPGKSLEAIRSLLCTVPSEYGWIME" CDS complement(3237365..3243700) /codon_start=1 /transl_table=11 /gene="mas" /locus_tag="BQ2027_MB2965C" /product="PROBABLE MULTIFUNCTIONAL MYCOCEROSIC ACID SYNTHASE MEMBRANE-ASSOCIATED MAS" /note="Mb2965c, mas, len: 2111 aa. Equivalent to Rv2940c, len: 2111 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 2111 aa overlap). Probable mas, mycocerosic acid synthase membrane associated, multifunctional enzyme (see citations below), almost identical to Q02251|MCAS_MYCBO|MAS MYCOCEROSIC ACID SYNTHASE from Mycobacterium bovis (2110 aa), FASTA scores: opt: 13226, E(): 0, (95.8% identity in 2115 aa overlap) (see first citation below); and equivalent to Q9CD78|MAS|ML0139 PUTATIVE MYCOCEROSIC SYNTHASE from Mycobacterium leprae (2116 aa), FASTA scores: opt: 12142, E(): 0, (87.95% identity in 2119 aa overlap); and Q49624|PKS3|MASA|ML1229|B1170_C2_209 PROBABLE MYCOCEROSIC ACID SYNTHASE from Mycobacterium leprae (2118 aa), FASTA scores: opt: 8421, E(): 0, (60.8% identity in 2127 aa overlap). Also similar to other synthases e.g. C-terminus of Q9L8C7|EPOC POLYKETIDE SYNTHASE from Polyangium cellulosum (7257 aa), FASTA scores: opt: 4332, E(): 0, (40.85% identity in 2149 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. O53901|PKS5|Rv1527c|MTV045.01c|MTCY19G5.01 POLYKETIDE SYNTHASE (2108 aa), FASTA scores: opt: 5059, E(): 0, (65.9% identity in 2121 aa overlap); etc. Contains several domains, organized in the following order: beta-ketoacyl synthase (PS00606), acyl transferase, dehydratase-enoyl reductase, beta-ketoreductase, acyl carrier protein. Contains PS00012 Phosphopantetheine attachment site. Protein product from Mb2965c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2965c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q02251" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/Swiss-Prot:Q02251" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01586.1" /translation="MESRVTPVAVIGMGCRLPGGINSPDKLWESLLRGDDLVTEIPPD RWDADDYYDPEPGVPGRSVSRWGGFLDDVAGFDAEFFGISEREATSIDPQQRLLLETS WEAIEHAGLDPASLAGSSTAVFTGLTHEDYLVLTTTAGGLASPYVVTGLNNSVASGRI AHTLGLHGPAMTFDTACSSGLMAVHLACRSLHDGEADLALAGGCAVLLEPHACVAASA QGMLSSTGRCHSFDADADGFVRSEGCAMVLLKRLPDALRDGNRIFAVVRGTATNQDGR TETLTMPSEDAQVAVYRAALAAAGVQPETVGVVEAHGTGTPIGDPIEYRSLARVYGAG TPCALGSAKSNMGHSTASAGTVGLIKAILSLRHGVVPPLLHFNRLPDELSDVETGLFV PQAVTPWPNGNDHTPKRVAVSSFGMSGTNVHAIVEEAPAEASAPESSPGDAEVGPRLF MLSSTSSDALRQTARQLATWVEEHQDCVAASDLAYTLARGRAHRPVRTAVVAANLPEL VEGLREVADGDALYDAAVGHGDRGPVWVFSGQGSQWAAMGTQLLASEPVFAATIAKLE PVIAAESGFSVTEAITAQQTVTGIDKVQPAVFAVQVALAATMEQTYGVRPGAVVGHSM GESAAAVVAGALSLEDAARVICRRSKLMTRIAGAGAMGSVELPAKQVNSELMARGIDD VVVSVVASPQSTVIGGTSDTVRDLIARWEQRDVMAREVAVDVASHSPQVDPILDDLAA ALADIAPMTPKVPYYSATLFDPREQPVCDGAYWVDNLRNTVQFAAAVQAAMEDGYRVF AELSPHPLLTHAVEQTGRSLDMSVAALAGMRREQPLPHGLRGLLTELHRAGAALDYSA LYPAGRLVDAPLPAWTHARLFIDDDGQEQRAQGACTITVHPLLGSHVRLTEEPERHVW QGDVGTSVLSWLSDHQVHNVAALPGAAYCEMALAAAAEVFGEAAEVRDITFEQMLLLD EQTPIDAVASIDAPGVVNFTVETNRDGETTRHATAALRAAEDDCPPPGYDITALLQAH PHAVNGTAMRESFAERGVTLGAAFGGLTTAHTAEAGAATVLAEVALPASIRFQQGAYR IHPALLDACFQSVGAGVQAGTATGGLLLPLGVRSLRAYGPTRNARYCYTRLTKAFNDG TRGGEADLDVLDEHGTVLLAVRGLRMGTGTSERDERDRLVSERLLTLGWQQRALPEVG DGEAGSWLLIDTSNAVDTPDMLASTLTDALKSHGPQGTECASLSWSVQDTPPNDQAGL EKLGSQLRGRDGVVIVYGPRVGDPDEHSLLAGREQVRHLVRITRELAEFEGELPRLFV VTRQAQIVKPHDSGERANLEQAGLRGLLRVISSEHPMLRTTLIDVDEHTDVERVAQQL LSGSEEDETAWRNGDWYVARLTPSPLGHEERRTAVLDPDHDGMRVQVRRPGDLQTLEF VASDRVPPGPGQIEVAVSMSSINFADVLIAFGRFPIIDDREPQLGMDFVGVVTAVGEG VTGHQVGDRVGGFSEGGCWRTFLTCDANLAVTLPPGLTDEQAITAATAHATAWYGLND LAQIKAGDKVLIHSATGGVGQAAISIARAKGAEIFATAGNPAKRAMLRDMGVEHVYDS RSVEFAEQIRRDTDGYGVDIVLNSLTGAAQRAGLELLAFGGRFVEIGKADVYGNTRLG LFPFRRGLTFYYLDLALMSVTQPDRVRELLATVFKLTADGVLTAPQCTHYPLAEAADA IRAMSNAEHTGKLVLDVPRSGRRSVAVTPEQAPLYRRDGSYIITGGLGGLGLFFASKL AAAGCGRIVLTARSQPNPKARQTIEGLRAAGADIVVECGNIAEPDTADRLVSAATATG LPLRGVLHSAAVVEDATLTNITDELIDRDWSPKVFGSWNLHRATLGQPLDWFCLFSSG AALLGSPGQGAYAAANSWVDVFAHWRRAQGLPVSAIAWGAWGEVGRATFLAEGGEIMI TPEEGAYAFETLVRHDRAYSGYIPILGAPWLADLVRRSPWGEMFASTGQRSRGPSKFR MELLSLPQDEWAGRLRRLLVEQASVILRRTIDADRSFIEYGLDSLGMLEMRTHVETET GIRLTPKVIATNNTARALAQYLADTLAEEQAAAPAAS" CDS 3244320..3246062 /codon_start=1 /transl_table=11 /gene="fadD28" /locus_tag="BQ2027_MB2966" /standard_name="acoas" /product="fatty-acid-amp ligase fadd28 (fatty-acid-amp synthetase) (fatty-acid-amp synthase)" /note="Mb2966, fadD28, len: 580 aa. Equivalent to Rv2941, len: 580 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 580 aa overlap). fadD28 (alternate gene name: acoas), fatty-acid-CoA synthetase (EC 6.2.1.-) (see citations below), almost identical to P71495 ACYL-COA SYNTHASE from Mycobacterium bovis (582 aa), FASTA scores: opt: 3828, E(): 0, (99.15% identity in 580 aa overlap); and equivalent to Q9CD79|FADD28|ML0138 ACYL-COA SYNTHETASE from Mycobacterium leprae (579 aa), FASTA scores: opt: 3183, E(): 8.8e-186, (81.9% identity in 580 aa overlap). And also highly similar to others Mycobacteria proteins e.g. O07797|FADD23|Rv3826|MTCY409.04c PUTATIVE FATTY-ACID-COA SYNTHETASE from Mycobacterium tuberculosis (584 aa); etc. Contains PS00018 EF-hand calcium-binding domain. Note that Rv2941|fadD28 and Rv2942|mmpL7 are transcriptionally coupled (proven experimentaly). Protein product from Mb2966 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2966 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q02278" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q02278" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01587.1" /translation="MIVRSLPAALRACARLQPHDPAFTFMDYEQDWDGVAITLTWSQL YRRTLNVARELSRCGSTGDRVVISAPQGLEYVVAFLGALQAGRIAVPLSVPQGGVTDE RSDSVLSDSSPVAILTTSSAVDDVVQHVARRPGESPPSIIEVDLLDLDAPNGYTFKED EYPSTAYLQYTSGSTRTPAGVVMSHQNVRVNFEQLMSGYFADTDGIPPPNSALVSWLP FYHDMGLVIGICAPILGGYPAVLTSPVSFLQRPARWMHLMASDFHAFSAAPNFAFELA ARRTTDDDMAGRDLGNILTILSGSERVQAATIKRFADRFARFNLQERVIRPSYGLAEA TVYVATSKPGQPPETVDFDTESLSAGHAKPCAGGGATSLISYMLPRSPIVRIVDSDTC IECPDGTVGEIWVHGDNVANGYWQKPDESERTFGGKIVTPSPGTPEGPWLRTGDSGFV TDGKMFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAISVPGDRSTEKLVAIIE LKKRGDSDQDAMARLGAIKREVTSALSSSHGLSVADLVLVAPGSIPITTSGKVRRGAC VEQYRQDQFARLDA" mobile_element 3244939..3247151 /mobile_element_type="insertion sequence:IS1533" /locus_tag="BQ2027_IS1533" /note="IS1533, len: 2213 nt. Equivalent to IS1533, len: 2042 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 2042 nt overlap). Minimum region corresponding to IS1533" CDS 3246055..3248817 /codon_start=1 /transl_table=11 /gene="mmpL7" /locus_tag="BQ2027_MB2967" /product="CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN MMPL7" /note="Mb2967, mmpL7, len: 920 aa. Equivalent to Rv2942, len: 920 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 920 aa overlap). mmpL7, conserved transmembrane transport protein (see citations below), member of RND superfamily, highly similar to Q9XB10 HYPOTHETICAL 99.5 KDA PROTEIN from Mycobacterium bovis BCG (945 aa), FASTA scores: opt: 488, E(): 4.9e-20, (29.5% identity in 918 aa overlap); and to others from Mycobacteria e.g. O53735|MML4_MYCTU from Mycobacterium tuberculosis (945 aa), FASTA scores: opt: 481, E(): 1.2e-19, (25.9% identity in 922 aa overlap); etc. Also similar to other membrane proteins e.g. O54101|MMLB_STRCO|SC10A5.10c PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (847 aa), FASTA scores: opt: 256, E(): 7.2e-07, (25.15% identity in 545 aa overlap); etc. Contains PS00639 Eukaryotic thiol (cysteine) proteases histidine active site, PS00079 Multicopper oxidases signature 1, and PS00044 Bacterial regulatory proteins, lysR family signature. BELONGS TO THE MMPL FAMILY. Note that Rv2941|fadD28 and Rv2942|mmpL7 are transcriptionally coupled (proven experimentaly). Protein product from Mb2967 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2967 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65371" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/Swiss-Prot:P65371" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01588.1" /translation="MPSPAGRLHRIRYIRLKKSSPDCRATITSGSADGQRRSPRLTNL LVVAAWVAAAVIANLLLTFTQAEPHDTSPALLPQDAKTAAATSRIAQAFPGTGSNAIA YLVVEGGSTLEPQDQPYYDAAVGALRADTRHVGSVLDWWSDPVTAPLGTSPDGRSATA MVWLRGEAGTTQAAESLDAVRSVLRQLPPSEGLRASIVVPAITNDMPMQITAWQSATI VTVAAVIAVLLLLRARLSVRAAAIVLLTADLSLAVAWPLAAVVRGHDWGTDSVFSWTL AAVLTIGTITAATMLAARLGSDAGHSAAPTYRDSLPAFALPGACVAIFTGPLLLARTP ALHGVGTAGLGVFVALAASLTVLPALIALAGASRQLPAPTTGAGWTGRLSLPVSSASA LGTAAVLAICMLPIIGMRWGVAENPTRQGGAQVLPGNALPDVVVIKSARDLRDPAALI AINQVSHRLVEVPGVRKVESAAWPAGVPWTDASLSSAAGRLADQLGQQAGSFVPAVTA IKSMKSIIEQMSGAVDQLDSTVNVTLAGARQAQQYLDPMLAAARNLKNKTTELSEYLE TIHTWIVGFTNCPDDVLCTAMRKVIEPYDIVVTGMNELSTGADRISAISTQTMSALSS APRMVAQMRSALAQVRSFVPKLETTIQDAMPQIAQASAMLKNLSADFADTGEGGFHLS RKDLADPSYRHVRESMFSSDGTATRLFLYSDGQLDLAAAARAQQLEIAAGKAMKYGSL VDSQVTVGGAAQIAAAVRDALIHDAVLLAVILLTVVALASMWRGAVHGAAVGVGVLAS YLAALGVSIALWQHLLDRELNALVPLVSFAVLASCGVPYLVAGIKAGRIADEATGARS KGAVSGRGAVAPLAALGGVFGAGLVLVSGGSFSVLSQIGTVVVLGLGVLITVQRAWLP TTPGRR" repeat_region 3249346..3249350 /rpt_type=DIRECT /note="5 bp direct repeat, CCGTT, flanking IS element IS1533." repeat_region 3249351..3249404 /rpt_type=INVERTED /note="54 bp imperfect inverted repeat, IRL, TGTCGACGGCACGTGAAAACTGACCCCGGCGCGGCACCCGAATTTTGACCCCCT, flanking IS element IS1533." gene 3249351..3251563 /locus_tag="BQ2027_IS1533" CDS 3249449..3250690 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2968" /product="PROBABLE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1533" /note="Mb2968, -, len: 413 aa. Equivalent to Rv2943, len: 413 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 413 aa overlap). Probable transposase for insertion sequence IS1533, similar to other transposases e.g. P15025|ISTA_ECOLI ista protein (insertion sequence IS21) from Escherichia coli (390 aa), FASTA scores: opt: 268, E(): 5.1e-11, (24.1% identity in 378 aa overlap). Contains potential helix-turn-helix motif at aa 19-40 (Score 1611, +4.67 SD). Mb2968 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2M3" /db_xref="InterPro:IPR001584" /db_xref="InterPro:IPR012337" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2M3" /protein_id="SIU01589.1" /translation="MLTVEDWAEIRRLHRAEGLPIKMIARVLGISKNTVKSALESNQQ PKYERAPQGSIVDAVEPRIRELLQAYPTMPATVIAERIGWERSIRVLSARVAELRPVY LPPDPASRTTYVAGEIAQCDFWFPPIELPVGFGQTRTAKQLPVLTMVCAYSRWLLAML LPSRCAEDLFAGWWRLIEALGAVPRVLVWDGEGAIGRWRGGRSELTTECQAFRGTLAA KVLICRPADPEAKGLIERAHDYLERSFLPGRVFASPADFNAQLGAWLALVNTRTRRAL GCAPTDRIGADRAAMLSLPPVAPATGWCTSLRLPRDHYVRCDSNDYSVHPGVIGHRVL VRADLERVHVFCDGELVADHERIWAVHQTVSDPAHVEAAKVLRRRHFSAASPVVEPQV QVRSLSDYDDALGVDIDGGVA" CDS 3250690..3251220 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2969" /product="POSSIBLE TRANSPOSASE" /note="Mb2969, -, len: 176 aa. Equivalent to Rv2943A, len: 176 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 176 aa overlap). Possible transposase, similar to many e.g. AJ238712|MBO238712_2 PUTATIVE TRANSPOSASE (IS21-l) from Mycobacterium bovis BCG (266 aa), FASTA scores: opt: 762, E(): 0, (100.0% identity in 118 aa overlap). Possible frameshift after codon 118 i.e. near position 3290056, to fuse with Rv2944. Mb2969 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2N8" /db_xref="InterPro:IPR002611" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2N8" /protein_id="SIU01590.1" /translation="MPTTKATQRRDVSTEIAYLTRALKAPTLRESVSRLADRARAENW SHEEYLAACLQREVSARESHGGEGRIRAARFPARKSLEEFDFEHARGLKRDTIAHLGT LDFITARDNVVFLGPAWHREDSSCGRPGDTRVSGRSSGAVRHRRRMGSTARRGSPRRA HLRRTHPALPLSAPGG" repeat_region complement(3251510..3251563) /rpt_type=INVERTED /note="54 bp imperfect inverted repeat, IRR, TGTCAACGGCACCCGAAAACTGACCCCCTGACGGCATCTGAAAATTGACCCCCT, flanking IS element IS1533." repeat_region 3251564..3251568 /rpt_type=DIRECT /note="5 bp direct repeat, CCGTT, flanking IS element IS1533." CDS complement(3251609..3252310) /codon_start=1 /transl_table=11 /gene="lppX" /locus_tag="BQ2027_MB2970C" /product="PROBABLE CONSERVED LIPOPROTEIN LPPX" /note="Mb2970c, lppX, len: 233 aa. Equivalent to Rv2945c, len: 233 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 233 aa overlap). Probable lppX, conserved lipoprotein, equivalent to Q9CD80 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (233 aa), FASTA scores: opt: 1165, E(): 2.1e-65, (76.4% identity in 233 aa overlap); and similar to Q9CCP6|ML0557 from Mycobacterium leprae (238 aa), FASTA scores: opt: 338, E(): 7.4e-14, (30.75% identity in 231 aa overlap). Also similar to others from Mycobacterium tuberculosis e.g. P71679|LPRG_MYCTU LIPOPROTEIN (236 aa), FASTA scores: opt: 342, E(): 4.1e-14, (32.05% identity in 231 aa overlap); etc. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site, and has in its N-terminal a signal peptide. BELONGS TO THE LPPX/LPRAFG FAMILY OF LIPOPROTEINS. Protein product from Mb2970c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2970c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65307" /db_xref="InterPro:IPR009830" /db_xref="InterPro:IPR029046" /db_xref="UniProtKB/Swiss-Prot:P65307" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01591.1" /translation="MNDGKRAVTSAVLVVLGACLALWLSGCSSPKPDAEEQGVPVSPT ASDPALLAEIRQSLDATKGLTSVHVAVRTTGKVDSLLGITSADVDVRANPLAAKGVCT YNDEQGVPFRVQGDNISVKLFDDWSNLGSISELSTSRVLDPAAGVTQLLSGVTNLQAQ GTEVIDGISTTKITGTIPASSVKMLDPGAKSARPATVWIAQDGSHHLVRASIDLGSGS IQLTQSKWNEPVNVD" CDS complement(3252332..3252553) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2970CA" /note="unnamed protein product; Mb2970cA, len: 73 aa. Identified by de novo proteomics of Mycobacterium bovis AF2122/97 under exponential conditions. Mb2970cA transcript and transcriptional start site identified in Mycobacterium bovis strain AF2122/97 grown under exponential conditions, also found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2M2" /protein_id="SIU01592.1" /translation="MAGPGDGERRNGASEEAGNLAGPGDGERRNGASEEAGNLAGPGD GERRNGASEEAGSHAQRDPINLHSACGPI" CDS complement(3252602..3258940) /codon_start=1 /transl_table=11 /gene="pks1" /locus_tag="BQ2027_MB2971C" /product="probable polyketide synthase pks15" /note="Mb2971c, pks1, len: 2112 aa. Equivalent to Rv2947c and Rv2946c, len: 496 aa and 1616 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 488 aa overlap and 99.9% identity in 1616 aa overlap). Rv2947c: Probable pks15, polyketide synthase. Almost identical to G560508|Q50469 PKS002B protein from Mycobacterium tuberculosis (495 aa), FASTA scores: opt: 3270, E(): 0, (99.6% identity in 496 a a overlap). Similar to Mycobacterium tuberculosis proteins MTCY338.20|RV2931|PPSA_MYCTU ppsA phenolpthiocerol synthesis (1876 aa) (49.9% identity in 465 aa overlap); MTCY24G1.09|RV2940C|P96291 Putative mas, mycocerosic acid synthase (2111 aa) (50.2% identity in 454 aa overlap); and MTCY22H8.03|RV2382C|P71718 hypothetical protein (444 aa) (47.6% identity in 437 aa overlap). Contains PS00606 Beta-ketoacyl synthases active site. Rv2946c: Probable pks1, polyketide synthase, similar to many e.g. ML035|AL583917|Q9CD81 putative polyketide synthase from Mycobacterium leprae (2103 aa), Fasta scores: opt: 8761, E(): 0, (82.6% identity in 1620 aa overlap); etc. Almost identical in part to G560507|Q50470 PKS002C protein from Mycobacterium tuberculosis (fragment) (950 aa), Fasta scores: opt: 5685, E(): 0, (95.3% identity in 927 aa overlap). Also similar to Mycobacterium tuberculosis polyketide synthases pks7|Rv1661|P94996 (2126 aa) (54.6% identity in 1632 aa); pks12|Rv2048c|O53490 (4151 aa) (58.0% identity in 1606 aa); pks8|rv1662|O65933 (1602 aa) (59.7% identity in 1144 aa). Contains a PS00012 Phosphopantetheine attachment site. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis H37Rv, pks1 and pks15 exist as 2 genes. In Mycobacterium bovis, a single base insertion (*-g) results in a single product that is more similar to pks1. Protein product from Mb2971c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2971c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TXK8" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR015083" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036299" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/Swiss-Prot:Q7TXK8" /protein_id="SIU01593.1" /translation="MIEEQRTMSVEGADQQSEKLFHYLKKVAVELDETRARLREYEQR ATEPVAVVGIGCRFPGGVDGPDGLWDVVSAGRDVVSEFPTDRGWDVEGLYDPDPDAEG KTYTRWGAFLDDATGFDAGFFGIAPSEVLAMDPQQRLMLEVSWEALEHAGIDPLSLRG SATGVYTGIFAASYGNRDTGGLQGYGLTGTSISVASGRVSYVLGLQGPAVSVDTACSS SLVAIHWAMSSLRSGECDLALAGGVTVMGLPSIFVGFSRQRGLAADGRCKAFAAAADG TGWGEGAGVVVLERLSDARRLGHSVLAVVRGSAVNQDGASNGLTAPNGLAQQRVIQAA LANAGLSAADVDVVEAHGTATTLGDPIEAQALLSTYGQGRPAEQPLWVGSIKSNMGHT QAAAGVAGVIKMVQAMRHGVMPATLHVDEPSPRVDWTSGAVSVLTEAREWSVDGRPRR AAVSSFGISGTNAHLILEEAPVPAPAEAPVEASESTGGPRPSMVPWVISARSAEALTA QAGRLMAHVQANPGLDPIDVGCSLASRSVFEHRAVVVGASREQLIAGLAGLAAGEPGA GVAVGQPGSVGKTVVVFPGQGAQRIGMGRELYGELPVFAQAFDAVADELDRHLRLPLR DVIWGADADLLDSTEFAQPALFAVEVASFAVLRDWGVLPDFVMGHSVGELAAAHAAGV LTLADAAMLVVARGRLMQALPAGGAMVAVAASEDEVEPLLGEGVGIAAINAPESVVIS GAQAAANAIADRFAAQGRRVHQLAVSHAFHSPLMEPMLEEFARVAARVQAREPQLGLV SNVTGELAGPDFGSAQYWVDHVRRPVRFADSARHLQTLGATHFIEAGPGSGLTGSIEQ SLAPAEAMVVSMLGKDRPELASALGAAGQVFTTGVPVQWSAVFAGSGGRRVQLPTYAF QRRRFWETPGADGPADAAGLGLGATEHALLGAVVERPDSDEVVLTGRLSLADQPWLAD HVVNGVVLFPGAGFVELVIRAGDEVGCALIEELVLAAPLVMHPGVGVQVQVVVGAADE SGHRAVSVYSRGDQSQGWLLNAEGMLGVAAAETPMDLSVWPPEGAESVDISDGYAQLA ERGYAYGPAFQGLVAIWRRGSELFAEVVAPGEAGVAVDRMGMHPAVLDAVLHALGLAV EKTQASTETRLPFCWRGVSLHAGGAGRVRARFASAGADAISVDVCDATGLPVLTVRSL VTRPITAEQLRAAVTAAGGASDQGPLEVVWSPISVVSGGANGSAPPAPVSWADFCAGS DGDASVVVWELESAGGQASSVVGSVYAATHTALEVLQSWLGADRAATLVVLTHGGVGL AGEDISDLAAAAVWGMARSAQAENPGRIVLIDTDAAVDASVLAGVGEPQLLVRGGTVH APRLSPAPALLALPAAESAWRLAAGGGGTLEDLVIQPCPEVQAPLQAGQVRVAVAAVG VNFRDVVAALGMYPGQAPPLGAEGAGVVLETGPEVTDLAVGDAVMGFLGGAGPLAVVD QQLVTRVPQGWSFAQAAAVPVVFLTAWYGLADLAEIKAGESVLIHAGTGGVGMAAVQL ARQWGVEVFVTASRGKWDTLRAMGFDDDHIGDSRTCEFEEKFLAVTEGRGVDVVLDSL AGEFVDASLRLLVRGGRFLEMGKTDIRDAQEIAANYPGVQYRAFDLSEAGPARMQEML AEVRELFDTRELHRLPVTTWDVRCAPAAFRFMSQARHIGKVVLTMPSALADRLADGTV VITGATGAVGGVLARHLVGAYGVRHLVLASRRGDRAEGAAELAADLTEAGAKGQVVAC DVADRAAVAGLFAQLSREYPPVRGVIHAAGVLDDAVITSLTPDRIDTVLRAKVDAAWN LHQATSDLDLSMFVLCSSIAATVGSPGQGNYSAANAFLDGLAAHRQAAGLAGISLAWG LWEQPGGMTAHLSSRDLARMSRSGLAPMSPAEAVELFDAALAIDHPLAVATLLDRAAL DARAQAGALPALFSGLARRPRRRQIDDTGDATSSKSALAQRLHGLAADEQLELLVGLV CLQAAAVLGRPSAEDVDPDTEFGDLGFDSLTAVELRNRLKTATGLTLPPTVIFDHPTP TAVAEYVAQQMSGSRPTESGDPTSQVVEPAAAEVSVHA" CDS complement(3258937..3261054) /codon_start=1 /transl_table=11 /gene="fadD22" /locus_tag="BQ2027_MB2972C" /product="p-hydroxybenzoyl-amp ligase fadd22" /note="Mb2972c, fadD22, len: 705 aa. Equivalent to Rv2948c, len: 705 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 705 aa overlap). Probable fadD22, fatty-acid-CoA synthetase (EC 6.2.1.-). Highly similar to many e.g. Q9CD82|ML0134 putative acyl-CoA synthetase from Mycobacterium leprae (707 aa), fasta scores: opt: 3554, E(): 6.4e-209, (75.9% identity in 705 aa overlap). Almost identical to G560509|Q50468 PKS002A protein from Mycobacterium tuberculosis (705 aa), fasta scores: opt: 4647, E(): 0, (99.7% identity in 705 aa overlap). Protein product from Mb2972c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2972c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXK7" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q7TXK7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01594.1" /translation="MRNGNLAGLLAEQASEAGWYDRPAFYAADVVTHGQIHDGAARLG EVLRNRGLSSGDRVLLCLPDSPDLVQLLLACLARGVMAFLANPELHRDDHALAARNTE PALVVTSDALRDRFQPSRVAEAAELMSEAARVAPGGYEPMGGDALAYATYTSGTTGPP KAAIHRHADPLTFVDAMCRKALRLTPEDTGLCSARMYFAYGLGNSVWFPLATGGSAVI NSAPVTPEAAAILSARFGPSVLYGVPNFFARVIDSCSPDSFRSLRCVVSAGEALELGL AERLMEFFGGIPILDGIGSTEVGQTFVSNRVDEWRLGTLGRVLPPYEIRVVAPDGTTA GPGVEGDLWVRGPAIAKGYWNRPDSPVANEGWLDTRDRVCIDSDGWVTYRCRADDTEV IGGVNVDPREVERLIIEDEAVAEAAVVAVRESTGASTLQAFLVATSGATIDGSVMRDL HRGLLNRLSAFKVPHRFAVVDRLPRTPNGKLVRGALRKQSPTKPIWELSLTEPGSGVR AQRDDLSASNMTIAGGNDGGATLRERLVALRQERQRLVVDAVCAEAAKMLGEPDPWSV DQDLAFSELGFDSQMTVTLCKRLAAVTGLRLPETVGWDYGSISGLAQYLEAELAGGHG RLKSAGPVNSGATGLWAIEEQLNKVEELVAVIADGEKQRVADRLRALLGTIAGSEAGL GKLIQAASTPDEIFQLIDSELGK" CDS complement(3261071..3261670) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2973C" /product="chorismate pyruvate lyase" /note="Mb2973c, -, len: 199 aa. Equivalent to Rv2949c, len: 199 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 199 aa overlap). Conserved hypothetical protein, equivalent to Q9CD83|ML0133 HYPOTHETICAL PROTEIN from Mycobacterium leprae (210 aa), FASTA scores: opt: 797, E(): 7.4e-47, (62.55% identity in 195 aa overlap). Equivalent to AAK47348 from Mycobacterium tuberculosis strain CDC1551 (212 aa) but shorter 13 aa. Protein product from Mb2973c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2973c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TXK6" /db_xref="InterPro:IPR002800" /db_xref="InterPro:IPR028978" /db_xref="UniProtKB/Swiss-Prot:Q7TXK6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01595.1" /translation="MTECFLSDQEIRKLNRDLRILIAANGTLTRVLNIVADDEVIVQI VKQRIHDVSPKLSEFEQLGQVGVGRVLQRYIILKGRNSEHLFVAAESLIAIDRLPAAI ITRLTQTNDPLGEVMAASHIETFKEEAKVWVGDLPGWLALHGYQNSRKRAVARRYRVI SGGQPIMVVTEHFLRSVFRDAPHEEPDRWQFSNAITLAR" CDS complement(3261696..3263555) /codon_start=1 /transl_table=11 /gene="fadD29" /locus_tag="BQ2027_MB2974C" /product="fatty-acid-amp ligase fadd29 (fatty-acid-amp synthetase) (fatty-acid-amp synthase)" /note="Mb2974c, fadD29, len: 619 aa. Equivalent to Rv2950c, len: 619 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 619 aa overlap). Probable fadD29, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to various mycobacterial enzymes believed to be involved in polyketide or fatty acid synthesis. Equivalent (but shorter 61 aa) to Q9CD84 from Mycobacterium leprae (680 aa), FASTA scores: opt: 3280, E(): 2.2e-192, (80.15% identity in 620 aa overlap); and highly similar to others from Mycobacterium leprae e.g. Q9Z5K5 PROBABLE ACYL-COA SYNTHASE (583 aa), FASTA scores: opt: 2358, E(): 3.4e-136, (62.35% identity in 579 aa overlap). Also similar to others from Mycobacterium tuberculosis e.g. Q10976|FD26_MYCTU PUTATIVE FATTY-ACID--CoA LIGASE (583 aa), FASTA scores: opt: 2416, E(): 1e-139, (63.15% identity in 581 aa overlap) (N-terminus shorter); etc. Equivalent to AAK47349 from Mycobacterium tuberculosis strain CDC1551 (582 aa) but longer 37 aa. Protein product from Mb2974c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2974c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXK5" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q7TXK5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01596.1" /translation="MKTNSSFHAAGEVATQPAWGTGEQAAQPLNGSTSRFAMSESSLA DLLQKAASQYPNRAAYKFIDYDTDPAGFTETVTWWQVHRRAMIVAEELWIYASSGDRV AILAPQGLEYIIAFMGVLQAGLIAVPLPVPQFGIHDERISSALRDSAPSIILTTSSVI DEVTTYAPHACAAQGQSAPIVVAVDALDLSSSRALDPTRFERPSTAYLQYTSGSTRAP AGVVLSHKNVITNCVQLMSDYIGDSEKVPSTPVSWLPFYHDMGLMLGIILPMINQDTA VLMSPMAFLQRPARWMQLLAKHRAQISSAPNFGFELAVRRTSDDDMAGLDLGHVRTIV TGAERVNVATLRRFTERFAPFNLSETAIRPSYGLAEATVYVATAGPGRAPKSVCFDYQ QLSVGQAKRTENGSEGANLVSYGAPRASTVRIVDPETRMENPAGTVGEIWVQGDNVGL GYWRNPQQTEATFRARLVTPSPGTSEGPWLRTGDLGVIFEGELFITGRIKELLVVDGA NHYPEDIEATIQEITGGRVVAIAVPDDRTEKLVTIIELMKRGRTDEEEKNRLRTVKRE VASAISRSHRLRVADVVMVAPGSIPVTTSGKVRRSASVERYLHHEFSRLDAMA" CDS complement(3264203..3265348) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2975C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb2975c, -, len: 381 aa. Equivalent to Rv2951c, len: 381 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 381 aa overlap). Possible oxidoreductase (EC 1.-.-.-), equivalent to Q9CD85 PUTATIVE OXIDOREDUCTASE from Mycobacterium leprae (382 aa), FASTA scores: opt: 2225, E(): 7.6e-134, (84.8% identity in 382 aa overlap); and similar to O30260 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (363 aa), FASTA scores: opt: 652, E(): 6.1e-34, (32.55% identity in 344 aa overlap). Also similar to various oxidoreductases e.g. O29071|AF1196 N5,N10-METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE from Archaeoglobus fulgidus (348 aa), FASTA scores: opt: 381, E(): 9.7e-17, (27.7% identity in 354 aa overlap); Q58929|MER|MJ1534 F420-DEPENDENT METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE (EC 1.5.99.-) from Methanococcus jannaschii (331 aa), FASTA scores: opt: 372, E(): 3.5e-16, (30.85% identity in 295 aa overlap); Q9UXP0 PUTATIVE F420-DEPENDENT N5,N10-METHYLENE-TETRAHYDROMETHANOPTERIN REDUCTASE from Methanolobus tindarius (326 aa), FASTA scores: opt: 343, E(): 2.4e-14, (27.4% identity in 314 aa overlap); etc. Protein product from Mb2975c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2975c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TXK4" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/Swiss-Prot:Q7TXK4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01597.1" /translation="MGGLRFGFVDALVHSRLPPTLPARSSMAAATVMGADSYWVGDHL NALVPRSIATSEYLGIAAKFVPKIDANYEPWTMLGNLAFGLPSRLRLGVCVTDAGRRN PAVTAQAAATLHLLTRGRAILGIGVGEREGNEPYGVEWTKPVARFEEALATIRALWNS NGELISRESPYFPLHNALFDLPPYRGKWPEIWVAAHGPRMLRATGRYADAWIPIVVVR PSDYSRALEAVRSAASDAGRDPMSITPAAVRGIITGRNRDDVEEALESVVVKMTALGV PGEAWARHGVEHPMGADFSGVQDIIPQTMDKQTVLSYAAKVPAALMKEVVFSGTPDEV IDQVAEWRDHGLRYVVLINGSLVNPSLRKTVTAVLPHAKVLRGLKKL" CDS 3265541..3266353 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2976" /product="POSSIBLE METHYLTRANSFERASE (METHYLASE)" /note="Mb2976, -, len: 270 aa. Equivalent to Rv2952, len: 270 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 270 aa overlap). Probable methyltransferase (EC 2.1.1.-), equivalent to Q9CD86|ML0130 HYPOTHETICAL PROTEIN from Mycobacterium leprae (270 aa), FASTA scores: opt: 1584, E(): 6.1e-99, (83.7% identity in 270 aa overlap). Also highly similar to Q9RMN9|MTF2 PUTATIVE METHYLTRANSFERASE from Mycobacterium smegmatis (274 aa), FASTA scores: opt: 902, E(): 3.8e-53, (56.35% identity in 252 aa overlap). Also similar to other methyltransferases e.g. Q9ADL4|SORM O-METHYLTRANSFERASE from Polyangium cellulosum (346 aa), FASTA scores: opt: 390, E(): 1.1e-18, (36.25% identity in 251 aa overlap); Q54303|RAPM METHYLTRANSFERASE from Streptomyces hygroscopicus (317 aa), FASTA scores: opt: 315, E(): 1.1e-13, (40.75% identity in 135 aa overlap); etc. Very similar to C-terminal part of Q50584|Rv1523|MTCY19G5.05c HYPOTHETICAL 37.9 KDA PROTEIN from Mycobacterium tuberculosis (358 aa), FASTA score: opt: 965, E(): 2.7e-57, (60.3% identity in 247 aa overlap). Protein product from Mb2976 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2976 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXK3" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7TXK3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01598.1" /translation="MAFSRTHSLLARAGSTSTYKRVWRYWYPLMTRGLGNDEIVFINW AYEEDPPMDLPLEASDEPNRAHINLYHRTATQVDLGGKQVLEVSCGHGGGASYLTRTL HPASYTGLDLNQAGIKLCKKRHRLPGLDFVRGDAENLPFDDESFDVVLNVEASHCYPH FRRFLAEVVRVLRPGGYFPYADLRPNNEIAAWEADLAATPLRQLSQRQINAEVLRGIG NNSQKSRDLVDRHLPAFLRFAGREFIGVQGTQLSRYLEGGELSYRMYCFTKD" CDS 3266379..3267635 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2977" /product="enoyl reductase" /note="Mb2977, -, len: 418 aa. Equivalent to Rv2953, len: 418 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 418 aa overlap). Conserved hypothetical protein, equivalent to Q9CD87|ML0129 HYPOTHETICAL PROTEIN from Mycobacterium leprae (418 aa), FASTA scores: opt: 2357, E(): 2.7e-143, (86.6% identity in 418 aa overlap). Also highly similar to Q9X7N5|SC5F2A.12c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (396 aa), FASTA scores: opt: 491, E(): 7e-24, (38.35% identity in 417 aa overlap); and similar to other hypothetical proteins e.g. Q9VG81 CG5167 PROTEIN from Drosophila melanogaster (Fruit fly) (431 aa), FASTA scores: opt: 393, E(): 1.4e-17, (26.55% identity in 433 aa overlap); Q9GZE9|F22F7.1 HYPOTHETICAL PROTEIN from Caenorhabditis elegans (426 aa), FASTA scores: opt: 338, E(): 4.6e-14, (27.05% identity in 425 aa overlap); P73855|SLL1601 HYPOTHETICAL 44.8 KDA PROTEIN from Synechocystis sp. (strain PCC 6803) (414 aa), FASTA scores: opt: 565, E(): 1.3e-28, (35.7% identity in 409 aa overlap); etc. Also highly similar to other proteins from Mycobacterium tuberculosis e.g. RV2449C|O53176|MTV008.05C HYPOTHETICAL 44.4 KDA PROTEIN (419 aa), FASTA scores: opt: 1835, E(): 7e-110, (67.55% identity in 419 aa overlap); etc. Protein product from Mb2977 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2977 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TXK2" /db_xref="InterPro:IPR005097" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:Q7TXK2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01599.1" /translation="MSPAEREFDIVLYGATGFSGKLTAEHLAHSGSTARIALAGRSSE RLRGVRMMLGPNAADWPLILADASQPLTLEAMAARAQVVLTTVGPYTRYGLPLVAACA KAGTDYADLTGELMFCRNSIDLYHKQAADTGARIILACGFDSIPSDLNVYQLYRRSVE DGTGELCDTDLVLRSFSQRWVSGGSVATYSEAMRTASSDPEARRLVTDPYTLTTDRGA EPELGAQPDFLRRPGRDLAPELAGFWTGGFVQAPFNTRIVRRSNALQEWAYGRRFRYS ETMSLGKSMAAPILAAAVTGTVAGTIGLGNKYFDRLPRRLVERVTPKPGTGPSRKTQE RGHYTFETYTTTTTGARYRATFAHNVDAYKSTAVLLAQSGLALALDRDRLAELRGVLT PAAAMGDALLARLPGAGVVMGTTRLS" CDS complement(3267766..3268491) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2978C" /product="SAM-dependent methyltransferases" /note="Mb2978c, -, len: 241 aa. Equivalent to Rv2954c, len: 241 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 241 aa overlap). Hypothetical unknown protein. Equivalent to AAK47354 from Mycobacterium tuberculosis strain CDC1551 (199 aa) but longer 42 aa. Protein product from Mb2978c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2978c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR041698" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Q0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01600.1" /translation="MRLPGMLRPTAERHFHSIFYLRHNARRQEHLATLGLDLGNKSVL EVGAGIGDHTQFFLDRGCKVLCTEPRGENLDVIRQRFGSNPNVTVDHLDLDGDLPAEA HQYDVVYCYGVLYHLSRPAEALAWMCDRAVDLLLLETCVSYSGEDEPFLVSERASSPS QAITGTGCRPSRVWVMNRLREKMPHVYVTATQPRHRQFPLDWRANGPIASTGLARAVF VASRAPLNLPTLVEELPMVQRRC" CDS complement(3268680..3269645) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2979C" /product="methyltransferase, FkbM family domain protein" /note="Mb2979c, -, len: 321 aa. Equivalent to Rv2955c, len: 321 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 321 aa overlap). Conserved hypothetical protein, similar to others e.g. Q98NV5|MLL9724 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (284 aa), FASTA scores: opt: 231, E(): 6.5e-08, (34.6% identity in 182 aa overlap); Q9AGG2|NLPE1 NLPE1 from Rhizobium etli (249 aa), FASTA scores: opt: 212, E(): 1.1e-06, (27.85% identity in 255 aa overlap); Q9KXY2 HYPOTHETICAL 31.3 KDA PROTEIN from Streptomyces coelicolor(291 aa), FASTA scores: opt: 211, E(): 1.4e-06, (30.9% identity in 249 aa overlap); etc. Protein product from Mb2979c detected using shotgun mass spectrometry. Mb2979c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006342" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2P3" /protein_id="SIU01601.1" /translation="MQFQDVRLMRVVVCRRLGPAKGQRRWHPLDLGTTGCFENLGAQR PTYRMRAIRMLECAMPNRLVRSLQRWRPFGLPPHRWRLAPWYWRGLQVTLEPGSAIAW IVRLTGGFEETEIDIAAALYSALYPDRCILDVGANVGIHSLAWARLAPVVALEPAPGT HSRLEANVAANGLQDRIRTLRTAAGDAVGEVDFFVAADSAFSSLNDTGRIRIRERTRV PCTTLDALAAELPLPVGLLKIDVEGLERAVIAGAAELLRRDRPVLLVEIYGGAASNPD PERTIADIRAYGYEPFVYADDAGLQPYQRHRDDRYCYFFIPSRKG" CDS 3269768..3270499 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2980" /product="SAM-dependent methyltransferases" /note="Mb2980, -, len: 243 aa. Equivalent to Rv2956, len: 243 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 243 aa overlap). Conserved hypothetical protein, highly similar to O86299|GSC GSC PROTEIN from Mycobacterium avium subsp. silvaticum Mycobacterium avium (240 aa), FASTA scores: opt: 1070, E(): 3.5e-63, (67.5% identity in 240 aa overlap); and O86294|GSC GSC PROTEIN from Mycobacterium paratuberculosis (240 aa), FASTA scores: opt: 1070, E(): 3.5e-63, (67.5% identity in 240 aa overlap). Also some similarity with other proteins from other organisms e.g. Q9L727 NODULATION PROTEIN NOEI from Rhizobium fredii (Sinorhizobium fredii) (241 aa), FASTA scores: opt: 205, E(): 3.5e-06, (27.25% identity in 198 aa overlap); Q9AGG1|LPEA LPEA PROTEIN from Rhizobium etli (286 aa), FASTA scores: opt: 201, E(): 7.2e-06, (28.85% identity in 208 aa overlap); P74191|SLL1173 HYPOTHETICAL 28.0 KDA PROTEIN Synechocystis sp. (strain PCC 6803) (244 aa), FASTA scores: opt: 274, E(): 1e-10, (30.65% identity in 225 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. P71792|RV1513|MTCY277.35 HYPOTHETICAL 26.7 KDA PROTEIN (243 aa), FASTA scores: opt: 1105, E(): 1.7e-65, (70.05% identity in 237 aa overlap); etc. Protein product from Mb2980 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2980 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006342" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2N6" /protein_id="SIU01602.1" /translation="MKSLKLARFIARSAAFEVSRRYSERDLKHQFVKQLKSRRVDVVF DVGANSGQYAAGLRRAAYKGRIVSFEPLSGPFTILESKASTDPLWDCRQHALGDSDGT VTINIAGNAGQSSSVLPMLKSHQNAFPPANYVGTQEASIHRLDSVAPEFLGMNGVAFL KVDVQGFEKQVLAGGKSTIDDHCVGMQLELSFLPLYEGGMLIPEALDLVYSLGFTLTG LLPCFIDANNGRMLQADGTFFREDD" CDS 3270570..3271397 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2981" /product="POSSIBLE GLYCOSYL TRANSFERASE" /note="Mb2981, -, len: 275 aa. Equivalent to Rv2957, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 275 aa overlap). Possible glycosyl transferase (EC 2.4.1.-); possibly secreted protein. Highly similar to O88109|GSD|GTFD GSD PROTEIN from Mycobacterium avium subsp. silvaticum, Mycobacterium paratuberculosis, and Mycobacterium avium (266 aa), FASTA scores: opt: 1010, E(): 2.5e-62, (68.8% identity in 221 aa overlap). Also some similarity with other proteins and especially glycosyl transferases e.g. Q9AEE4 HYPOTHETICAL 31.4 KDA PROTEIN from Leptospira interrogans (265 aa), FASTA scores: opt: 371, E(): 3.3e-18, (34.43% identity in 212 aa overlap); Q9EXY4 PUTATIVE GLYCOSYL TRANSFERASE from Escherichia coli (248 aa), FASTA scores: opt: 339, E(): 5e-16, (32.4% identity in 210 aa overlap); Q9RCC4 GLYCOSYLTRANSFERASE-LIKE PROTEIN from Yersinia pestis (247 aa), FASTA scores: opt: 333, E(): 1.3e-15, (31.8% identity in 217 aa overlap); Q9EXY1 PUTATIVE GLYCOSYL TRANSFERASE from Escherichia coli (248 aa), FASTA scores: opt: 328, E(): 2.9e-15, (31.9% identity in 210 aa overlap); etc. Equivalent to AAK47357 from Mycobacterium tuberculosis strain CDC1551 (256 aa) but longer 19 aa. Protein product from Mb2981 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2981 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5A0" /db_xref="InterPro:IPR001173" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/Swiss-Prot:P0A5A0" /protein_id="SIU01603.1" /translation="MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDS IARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLA TGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRL LFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASY NEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR" CDS complement(3271814..3271948) /codon_start=1 /transl_table=11 /gene="Mb2982cA" /product="possible glycosyl transferase" /note="Mb2982cA, -, len: 44 aa. Similar to 3' end of Rv2958c, len: 428 aa, from Mycobacterium tuberculosis strain H37Rv. Possible glycosyl transferase (EC 2.4.1.-) REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift in due to a single base insertion (*-g) results in Rv2958c being split into two genes Mb2982c and Mb2982cA" /db_xref="GOA:A0A1R3Y2Q5" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Q5" /protein_id="SIU01604.1" /translation="MAAAVKQVLSGAEFRQAARRLAEAFGPDFAGFPQHIESALRLVC " CDS complement(3272001..3273101) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2982C" /product="possible glycosyl transferase" /note="Mb2982c, -, len: 366 aa. Similar to 5' end of Rv2958c, len: 428 aa, from Mycobacterium tuberculosis strain H37Rv, (83.3% identity in 371 aa overlap). Possible glycosyl transferase (EC 2.4.1.-), highly similar to Q9CD88|ML0128 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (435 aa), FASTA scores: opt: 2116, E(): 5.8e-126, (75.05% identity in 417 aa overlap); and Q9CD91|ML0125 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (438 aa), FASTA scores: opt: 2104, E(): 3.3e-125, (74.65% identity in 418 aa overlap). Also shows some similarity to variety of glycosyl transferases e.g. Q9RYI3 PUTATIVE GLYCOSYLTRANSFERASE from Deinococcus radiodurans (418 aa), FASTA scores: opt: 317, E(): 1.9e-12, (31.0% identity in 297 aa overlap); Q9S1V2 PUTATIVE GLYCOSYL TRANSFERASE from Streptomyces coelicolor (407 aa), FASTA scores: opt: 264, E(): 4.1e-09, (27.2% identity in 342 aa overlap); P72650|CRTX|SLR1125 ZEAXANTHIN GLUCOSYL TRANSFERASE from Synechocystis sp. strain PCC 6803 (419 aa), FASTA scores: opt: 251, E(): 2.8e-08, (26.8% identity in 295 aa overlap); etc. Very similar to P95130|MTCY349.25 from Mycobacterium tuberculosis (449 aa), FASTA score: opt: 2215, E(): 3.3e-132, (77.25% identity in 422 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base insertion (*-g) leads to a shorter product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb2982c detected using SWATH mass spectrometry. Mb2982c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:Q7TXJ8" /protein_id="SIU01605.1" /translation="MEETSVAGDPGPDAGTSTAPNAAPEPVARRQRILFVGEAATLAH VVRPFVLARSLDPSRYEVHFACDPRFNKLLGPLPFPHHPIHTVPSEEVLLKIAQGRLF YNTRTLRKYIAADRKILNEIAPDVVVGDNRLSLSVSARLAGIPYIAIANAYWSPQARR RFPLPDVPWTRFFGVRPVSILYRLYRPLIFALYCLPLNWLRRKHGLSSLGWDLCRIFT DGDYTLYADVPELVPTYNLPANHRYLGPVLWSPDVKPPTWWHSLPTDRPIIYATLGSS GGKNLLQVVLNALGRFTRDGDRGHRWPEPPEERAGQRLRRGLPAGRSGCSALRRGALQ RRQPDDAAGVGGRGAGDRAPQQHGPALEHGGP" CDS complement(3273202..3273939) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2983C" /product="POSSIBLE METHYLTRANSFERASE (METHYLASE)" /note="Mb2983c, -, len: 245 aa. Equivalent to Rv2959c, len: 245 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 245 aa overlap). Possible methyltransferase (EC 2.1.1.-), highly similar to Q9CD89|ML0127 from Mycobacterium leprae (229 aa), FASTA scores: opt: 1183, E(): 3.9e-69, (76.1% identity in 226 aa overlap). Also some similarity with other methyltransferases and other proteins e.g. Q51079 PUTATIVE METHYL TRANSFERASE from Nocardia lactamdurans (236 aa), FASTA scores: opt: 156, E(): 0.0086, (23.25% identity in 159 aa overlap); Q98ID5 CEPHALOSPORIN HYDROXYLASE from Rhizobium loti (Mesorhizobium loti) (217 aa), FASTA scores: opt: 275, E(): 1.7e-10, (29.65% identity in 199 aa overlap); etc. And also similar to P72897 HYPOTHETICAL 27.8 KDA PROTEIN from Mycobacterium tuberculosis (249 aa), FASTA scores: opt: 292, E(): 1.5e-11, (31.25% identity in 208 aa overlap). Protein product from Mb2983c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2983c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXJ7" /db_xref="InterPro:IPR007072" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7TXJ7" /protein_id="SIU01606.1" /translation="MGLVWRSRTSLVGQLIGLVRLVASFAAQLFYRPSDAVAEEYHKW YYGNLVWTKTTYMGINCWKSVSDMWNYQEILSELQPSLVIEFGTRYGGSAVYFANIMR QIGQPFKVLTVDNSHKALDPRARREPDVLFVESSSTDPAIAEQIQRLKNEYPGKIFAI LDSDHSMNHVLAEMKLLRPLLSAGDYLVVEDSNINGHPVLPGFGPGPYEAIEAYEDEF PNDYKHDAERENKFGWTSAPNGFLIRN" CDS complement(3274054..3274302) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2984C" /product="HYPOTHETICAL PROTEIN" /note="Mb2984c, -, len: 82 aa. Equivalent to Rv2960c, len: 82 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 82 aa overlap). Hypothetical unknown protein, equivalent to AAK47362 from Mycobacterium tuberculosis strain CDC1551 (116 aa) but shorter 34 aa. Shortened version of MTCY349.28 avoiding overlap. Mb2984c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2V7" /protein_id="SIU01607.1" /translation="MGRNATAVVSLPVVALSPRAGQAGYLWQSITRGLRVTPICCYHP PCGGGVQKMLSRKLGRVCPAPSPKDAARGAHNVGANAV" CDS 3274384..3274773 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2985" /product="PROBABLE TRANSPOSASE" /note="Mb2985, -, len: 129 aa. Equivalent to Rv2961, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Probable transposase, highly similar to C-terminus of O50414|Rv3387|MTV004.45 PUTATIVE TRANSPOSASE from Mycobacterium tuberculosis (225 aa), FASTA scores: opt: 605, E(): 7.2e-34, (66.65% identity in 129 aa overlap); and similar to others e.g. CAC47401 PUTATIVE PARTIAL TRANSPOSASE FOR ISRM17 PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (174 aa), FASTA scores: opt: 183, E(): 2.6e-05, (30.25% identity in 129 aa overlap); etc. Mb2985 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y340" /db_xref="InterPro:IPR002559" /db_xref="UniProtKB/TrEMBL:A0A1R3Y340" /protein_id="SIU01608.1" /translation="MEHGNPHDAPQLAPAVERITTRAGRPPGTVTADRGYGEKRVEDD LHDLGVRTVAIPRKGRPSQARRAEEQRPSFRRTVKWRTGSEGRISTLKRNYGWNRSCI DGTEGTRIWTRHGILTHNLIKISSLAA" CDS complement(3274874..3276223) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2986C" /product="POSSIBLE GLYCOSYL TRANSFERASE" /note="Mb2986c, -, len: 449 aa. Equivalent to Rv2962c, len: 449 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 449 aa overlap). Possible glycosyl transferase (EC 2.4.1.-), highly similar or identical to Mycobacterium tuberculosis proteins G560522 U0002JA, G560521 U0002H, G560522 U0002JA, G560519 U0002KA. Equivalent (but longer 21 aa) to Q9CD91 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (438 aa), FASTA scores: opt: 2229, E(): 1.3e-133, (77.45% identity in 426 aa overlap); and highly similar to Q9CD88 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (435 aa), FASTA scores: opt: 2129, E(): 2.7e-127, (74.35% identity in 425 aa overlap); and others from Mycobacterium leprae. Also shows some similarity to variety of glycosyl transferases e.g. Q9RYI3|DRA0329 PUTATIVE GLYCOSYL TRANSFERASE from Deinococcus radiodurans (418 aa), FASTA scores: opt: 340, E(): 5.5e-14, (31.2% identity in 330 aa overlap); P72650 ZEAXANTHIN GLUCOSYL TRANSFERASE from Synechocystis sp. (strain PCC 6803) (419 aa), FASTA scores: opt: 244, E(): 6.6e-08, (26.2% identity in 294 aa overlap); etc. Also highly similar to P95134 HYPOTHETICAL 46.8 KDA PROTEIN from Mycobacterium tuberculosis (428 aa), FASTA scores: opt: 2215, E(): 9.6e-133, (77.25% identity in 422 aa overlap). Protein product from Mb2986c detected using SWATH mass spectrometry. Mb2986c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXJ4" /db_xref="InterPro:IPR002213" /db_xref="UniProtKB/Swiss-Prot:Q7TXJ4" /protein_id="SIU01609.1" /translation="MRVSCVYATASRWGGPPVASEVRGDAAISTTPDAAPGLAARRRR ILFVAEAVTLAHVVRPFALAQSLDPSRYEVHFACDPRYNQLLGPLPFRHHAIHTIPSE RFFGNLTQGRFYAMRTLRKYVEADLRVLDEIAPDLVVGDLRISLSVSARLAGIPYIAI ANAYWSPYAQRRFPLPDVIWTRLFGVRLVKLLYRLERPLLFALQCMPLNWVRRRHGLS SLGWNLCRIFTDGDHTLYADVPELMPTYDLPANHEYLGPVLWSPAGKPPTWWDSLPTD RPIVYATLGTSGGRNLLQLVLNALAELPVTVIAATAGRSDLKTVPANAFVADYLPGEA AAARSAVVVCNGGSLTTQQALVAGVPVIGVAGNLDQHLNMEAVERAGAGVLLRTERLK SQRVAGAVMQVISRSEYRQAAARLADAFGRDRVGFPQHVENALRLMPENRPRTWLAS" CDS 3276337..3277557 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2987" /product="PROBABLE INTEGRAL MEMBRANE PROTEIN" /note="Mb2987, -, len: 406 aa. Equivalent to Rv2963, len: 406 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 406 aa overlap). Probable integral membrane protein. Protein product from Mb2987 detected using SWATH mass spectrometry. Mb2987 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2R0" /db_xref="InterPro:IPR005524" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2R0" /protein_id="SIU01610.1" /translation="MTSTKVEDRVTAAVLGAIGHALALTASMTWEILWALILGFALSA VVQAVVRRSTIVTLLGDDRPRTLVIATGLGAASSSCSYAAVALARSLFRKGANFTAAM AFEIGSTNLVVELGIILALLMGWQFTAAEFVGGPIMILVLAVLFRLFVGARLIDAARE QAERGLAGSMEGHAAMDMSIKREGSFWRRLLSPPGFTSIAHVFVMEWLAILRDLILGL LIAGAIAAWVPESFWQSFFLANHPAWSAVWGPIIGPIVAIVSFVCSIGNVPLAAVLWN GGISFGGVIAFIFADLLILPILNIYRKYYGARMMLVLLGTFYASMVVAGYLIELLFGT TNLIPSQRSATVMTAEISWNYTTWLNVIFLVIAAALVVRFITSGGLPMLRMMGGSPDA PHDHHDRHDDHLGH" CDS 3277630..3278562 /codon_start=1 /transl_table=11 /gene="purU" /locus_tag="BQ2027_MB2988" /product="PROBABLE FORMYLTETRAHYDROFOLATE DEFORMYLASE PURU (FORMYL-FH(4) HYDROLASE)" /note="Mb2988, purU, len: 310 aa. Equivalent to Rv2964, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 310 aa overlap). Probable purU, formyltetrahydrofolate deformylase (EC 3.5.1.10), highly similar to others e.g. Q9RWT1|DR0584 FORMYLTETRAHYDROFOLATE DEFORMYLASE from Deinococcus radiodurans (298 aa), FASTA scores: opt: 1005, E(): 4.9e-52, (52.25% identity in 297 aa overlap); Q9K7U4 FORMYLTETRAHYDROFOLATE DEFORMYLASE from Bacillus halodurans (289 aa), FASTA scores: opt: 982, E(): 1.1e-50, (51.8% identity in 280 aa overlap); Q55135|PURU_SYNY3|SLL0070 FORMYLTETRAHYDROFOLATE DEFORMYLASE from Synechocystis sp. strain PCC 6803 (284 aa), FASTA scores: opt: 839, E(): 2.9e-42, (48.2% identity in 280 aa overlap); etc. Protein product from Mb2988 detected using SWATH mass spectrometry. Mb2988 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5T7" /db_xref="InterPro:IPR002376" /db_xref="InterPro:IPR002912" /db_xref="InterPro:IPR004810" /db_xref="InterPro:IPR036477" /db_xref="InterPro:IPR041729" /db_xref="UniProtKB/Swiss-Prot:P0A5T7" /protein_id="SIU01611.1" /translation="MGKGSMTAHATPNEPDYPPPPGGPPPPADIGRLLLRCHDRPGII AAVSTFLARAGANIISLDQHSTAPEGGTFLQRAIFHLPGLTAAVDELQRDFGSTVADK FGIDYRFAEAAKPKRVAIMASTEDHCLLDLLWRNRRGELEMSVVMVIANHPDLAAHVR PFGVPFIHIPATRDTRTEAEQRQLQLLSGNVDLVVLARYMQILSPGFLEAIGCPLINI HHSFLPAFTGAAPYQRARERGVKLIGATAHYVTEVLDEGPIIEQDVVRVDHTHTVDDL VRVGADVERAVLSRAVLWHCQDRVIVHHNQTIVF" CDS complement(3279431..3279916) /codon_start=1 /transl_table=11 /gene="kdtB" /locus_tag="BQ2027_MB2989C" /standard_name="coaD" /product="PROBABLE PHOSPHOPANTETHEINE ADENYLYLTRANSFERASE KDTB (PANTETHEINE-PHOSPHATE ADENYLYLTRANSFERASE) (PPAT) (DEPHOSPHO-COA PYROPHOSPHORYLASE)" /note="Mb2989c, kdtB, len: 161 aa. Equivalent to Rv2965c, len: 161 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 161 aa overlap). Probable kdtB (alternate gene name: coaD), phosphopantetheine adenylyltransferase (EC 2.7.7.3), equivalent to O69466|COAD_MYCLE PHOSPHOPANTETHEINE ADENYLYLTRANSFERASE from Mycobacterium leprae (160 aa), FASTA scores: opt: 881, E(): 2.5e-54, (84.1% identity in 157 aa overlap). Also highly similar to others e.g. Q9ZBR1|COAD_STRCO from Streptomyces coelicolor (159 aa), FASTA scores: opt: 575, E(): 5.8e-33, (54.1% identity in 159 aa overlap); Q9WZK0|COAD_THEMA from Thermotoga maritima (161 aa), FASTA scores: opt: 509, E(): 2.4e-28, (50.0% identity in 154 aa overlap); P23875|COAD_ECOLICOAD|KDTB|B3634|Z5058|ECS4509 from Escherichia coli strain O157:H7 and K12 (159 aa), FASTA scores: opt: 459, E(): 7.3e-25, (45.15% identity in 155 aa overlap); etc. BELONGS TO THE COAD FAMILY. Protein product from Mb2989c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2989c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A531" /db_xref="InterPro:IPR001980" /db_xref="InterPro:IPR004821" /db_xref="InterPro:IPR014729" /db_xref="UniProtKB/Swiss-Prot:P0A531" /protein_id="SIU01612.1" /translation="MTGAVCPGSFDPVTLGHVDIFERAAAQFDEVVVAILVNPAKTGM FDLDERIAMVKESTTHLPNLRVQVGHGLVVDFVRSCGMTAIVKGLRTGTDFEYELQMA QMNKHIAGVDTFFVATAPRYSFVSSSLAKEVAMLGGDVSELLPEPVNRRLRDRLNTER T" CDS complement(3280002..3280568) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2990C" /product="POSSIBLE METHYLTRANSFERASE (METHYLASE)" /note="Mb2990c, -, len: 188 aa. Equivalent to Rv2966c, len: 188 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 188 aa overlap). Possible methyltransferase (EC 2.1.1.-), equivalent (but shorter 36 aa) to O69465|MLCB1243.09 HYPOTHETICAL 23.0 KDA PROTEIN from Mycobacterium leprae (220 aa), FASTA scores: opt: 872, E(): 9.1e-50, (74.2% identity in 182 aa overlap). Also similar to others e.g. Q9ZBR2|SC7A1.11 PUTATIVE METHYLASE from Streptomyces coelicolor (195 aa), FASTA scores: opt: 510, E(): 3.7e-26, (47.5% identity in 179 aa overlap); Q9F842 HYPOTHETICAL METHYLTRANSFERASE (FRAGMENT) from Mycobacterium smegmatis (80 aa), FASTA scores: opt: 386, E(): 2.5e-18, (75.0% identity in 80 aa overlap); P10120|YHHF_ECOLI|YHHFZ|B3465 PUTATIVE METHYLASE from Escherichia colistrain K12 (198 aa), FASTA scores: opt: 319, E(): 1.1e-13, (35.5% identity in 183 aa overlap); etc. Contains PS00092 N-6 Adenine-specific DNA methylases signature. Protein product from Mb2990c detected using SWATH mass spectrometry. Mb2990c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2R5" /db_xref="InterPro:IPR002052" /db_xref="InterPro:IPR004398" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2R5" /protein_id="SIU01613.1" /translation="MTRIIGGVAGGRRIAVPPRGTRPTTDRVRESLFNIVTARRDLTG LAVLDLYAGSGALGLEALSRGAASVLFVESDQRSAAVIARNIEALGLSGATLRRGAVA AVVAAGTTSPVDLVLADPPYNVDSADVDAILAALGTNGWTREGTVAVVERATTCAPLT WPEGWRRWPQRVYGDTRLELAERLFANV" CDS complement(3280764..3284147) /codon_start=1 /transl_table=11 /gene="pca" /locus_tag="BQ2027_MB2991C" /product="PROBABLE PYRUVATE CARBOXYLASE PCA (PYRUVIC CARBOXYLASE)" /note="Mb2991c, pca, len: 1127 aa. Equivalent to Rv2967c, len: 1127 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1127 aa overlap). Probable pca, pyruvate carboxylase (ala-rich protein) (EC 6.4.1.1), equivalent to Q9F843|PYC PYRUVATE CARBOXYLASE from Mycobacterium smegmatis (1127 aa), FASTA scores: opt: 6232, E(): 0, (83.3% identity in 1127 aa overlap). Also highly similar to others e.g. Q9RK64|SCF11.26c PYRUVATE CARBOXYLASE from Streptomyces coelicolor (1124 aa), FASTA scores: opt: 5526, E(): 0, (74.65% identity in 1125 aa overlap); O54587|PYC PYRUVATE CARBOXYLASE from Corynebacterium glutamicum (Brevibacterium flavum) (1140 aa), FASTA scores: opt: 4811, E(): 0, (64.5% identity in 1132 aa overlap); Q9DDT1 PYRUVATE CARBOXYLASE from Brachydanio rerio (Zebrafish) (1180 aa), FASTA scores: opt: 3133, E(): 1.1e-171, (47.8% identity in 1142 aa overlap); etc. Contains PS00867 Carbamoyl-phosphate synthase subdomain signature 2, PS00165 Serine/threonine dehydratases pyridoxal-phosphate attachment site, and PS00188 Biotin-requiring enzymes attachment site. Protein product from Mb2991c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2991c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2R6" /db_xref="InterPro:IPR000089" /db_xref="InterPro:IPR000891" /db_xref="InterPro:IPR001882" /db_xref="InterPro:IPR003379" /db_xref="InterPro:IPR005479" /db_xref="InterPro:IPR005481" /db_xref="InterPro:IPR005482" /db_xref="InterPro:IPR005930" /db_xref="InterPro:IPR011053" /db_xref="InterPro:IPR011054" /db_xref="InterPro:IPR011761" /db_xref="InterPro:IPR011764" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR016185" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2R6" /protein_id="SIU01614.1" /translation="MFSKVLVANRGEIAIRAFRAAYELGVGTVAVYPYEDRNSQHRLK ADESYQIGDIGHPVHAYLSVDEIVATARRAGADAIYPGYGFLSENPDLAAACAAAGIS FVGPSAEVLELAGNKSRAIAAAREAGLPVLMSSAPSASVDELLSVAAGMPFPLFVKAV AGGGGRGMRRVGDIAALPEAIEAASREAESAFGDPTVYLEQAVINPRHIEVQILADNL GDVIHLYERDCSVQRRHQKVIELAPAPHLDAELRYKMCVDAVAFARHIGYSCAGTVEF LLDERGEYVFIEMNPRVQVEHTVTEEITDVDLVASQLRIAAGETLEQLGLRQEDIAPH GAALQCRITTEDPANGFRPDTGRISALRTAGGAGVRLDGSTNLGAEISPYFDSMLVKL TCRGRDLPTAVSRARRAIAEFRIRGVSTNIPFLQAVLDDPDFRAGRVTTSFIDERPQL LTARASADRGTKILNFLADVTVNNPYGSRPSTIYPDDKLPDLDLRAAPPAGSKQRLVK LGPEGFARWLRESAAVGVTDTTFRDAHQSLLATRVRTSGLSRVAPYLARTMPQLLSVE CWGGATYDVALRFLKEDPWERLATLRAAMPNICLQMLLRGRNTVGYTPYPEIVTSAFV QEATATGIDIFRIFDALNNIESMRPAIDAVRETGSAIAEVAMCYTGDLTDPGEQLYTL DYYLKLAEQIVDAGAHVLAIKDMAGLLRPPAAQRLVSALRSRFDLPVHLHTHDTPGGQ LASYVAAWHAGADAVDGAAAPLAGTTSQPALSSIVAAAAHTEYDTGLSLSAVCALEPY WEALRKVYAPFESGLPGPTGRVYHHEIPGGQLSNLRQQAIALGLGDRFEEIEEAYAGA DRVLGRLVKVTPTSKVVGDLALALVGAGVSADEFASDPARFGIPESVLGFLRGELGDP PGGWPEPLRTAALAGRGAARPTAQLAADDEIALSSVGAKRQATLNRLLFPSPTKEFNE HREAYGDTSQLSANQFFYGLRQGEEHRVKLERGVELLIGLEAISEPDERGMRTVMCIL NGQLRPVLVRDRSIASAVPAAEKADRGNPGHIAAPFAGVVTVGVCVGERVGAGQTIAT IEAMKMEAPITAPVAGTVERVAVSDTAQVEGGDLLVVVS" CDS complement(3284172..3284804) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2992C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb2992c, -, len: 210 aa. Equivalent to Rv2968c, len: 210 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 210 aa overlap). Probable conserved integral membrane protein, equivalent to O69464 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (214 aa), FASTA scores: opt: 1060, E(): 1.4e-58, (71.95% identity in 214 aa overlap). Also highly similar to others e.g. Q9F844 HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN from Mycobacterium smegmatis (187 aa), FASTA scores: opt: 883, E(): 1.2e-47, (62.8% identity in 190 aa overlap); Q9KXP3 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (240 aa), FASTA scores: opt: 503, E(): 4.6e-24, (38.0% identity in 192 aa overlap); etc. Mb2992c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4P3" /db_xref="InterPro:IPR012932" /db_xref="InterPro:IPR038354" /db_xref="InterPro:IPR041714" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4P3" /protein_id="SIU01615.1" /translation="MVAARPAERSGDPAAVRVPVPSAWWVLIGGVIGLFASMTLTVEK VRILLDPIYVPSCNVNPIVSCGSVMTTPQASLLGFPNPLLGIAGFTVVVVTGVLAVAK VPLPRWYWIGLAVGILVGVAFVHWLIFQSLYRIGALCPYCMVVWAVIATLLVVVASIV FGPMRENRGSQERVGARLLYQWRWSLATLWFTTVFLLIMVRFWDYWSTLI" CDS complement(3284810..3285577) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2993C" /product="POSSIBLE CONSERVED MEMBRANE OR SECRETED PROTEIN" /note="Mb2993c, -, len: 255 aa. Equivalent to Rv2969c, len: 255 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 255 aa overlap). Possible conserved membrane or exported protein, equivalent to Q9CBS4|ML1667 POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (264 aa), FASTA scores: opt: 1101, E(): 9.9e-68, (65.9% identity in 258 aa overlap); and highly similar to O69463 PUTATIVE TRANSMEMBRANE PROTEIN from Mycobacterium leprae (258 aa), FASTA scores: opt: 1097, E(): 1.8e-67, (65.5% identity in 258 aa overlap). C-terminus also highly similar to Q9KK65|996A160 EXPORTED PROTEIN (FRAGMENT) from Mycobacterium avium (85 aa), FASTA scores: opt: 418, E(): 2e-21, (72.95% identity in 85 aa overlap). Also weakly similar to membrane or exported proteins e.g. Q9S2U7|SC4G6.04c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (275 aa), FASTA scores: opt: 312, E(): 7.6e-14, (28.25% identity in 230 aa overlap); Q9XAB6|SCC22.22C PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (255 aa), FASTA scores: opt: 181, E(): 6.4e-05, (27.0% identity in 226 aa overlap); etc. Also some similarity with P72001|PKNE_MYCTU from Mycobacterium tuberculosis (566 aa), FASTA scores: opt: 264, E(): 2.3e-10, (30.5% identity in 177 aa overlap). Protein product from Mb2993c detected using shotgun mass spectrometry. Mb2993c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3I9" /db_xref="InterPro:IPR012336" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3I9" /protein_id="SIU01616.1" /translation="MADKSKRPPRFDLKSADGSFGRLVQIGGTTIVVVFAVVLVFYIV TSRDDKKDGVAGPGDAVRVTSSKLVTQPGTSNPKAVVSFYEDFLCPACGIFERGFGPT VSKLVDIGAVAADYTMVAILDSASNQHYSSRAAAAAYCVADESIEAFRRFHAALFSKD IQPAELGKDFPDNARLIELAREAGVVGKVPDCINSGKYIEKVDGLAAAVNVHATPTVR VNGTEYEWSTPAALVAKIKEIVGDVPGIDSAAATATS" CDS complement(3285674..3286804) /codon_start=1 /transl_table=11 /gene="lipN" /locus_tag="BQ2027_MB2994C" /product="PROBABLE LIPASE/ESTERASE LIPN" /note="Mb2994c, lipN, len: 376 aa. Equivalent to Rv2970c, len: 376 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 376 aa overlap). Probable lipN, lipase/esterase (EC 3.1.1.-), similar to others e.g. Q9AA37|CC0771 PUTATIVE ESTERASE from Caulobacter crescentus (380 aa), FASTA scores: opt: 822, E(): 8e-46, (42.15% identity in 318 aa overlap); Q9XDR4 ESTERASE HDE from petroleum-degrading bacterium HD-1 (317 aa), FASTA scores: opt: 738, E(): 2e-40, (48.85% identity in 262 aa overlap); O52270 LIPASE from Pseudomonas sp. (strain B11-1) (308 aa), FASTA scores: opt: 683, E(): 7.3e-37, (41.3% identity in 288 aa overlap); etc. Also similar to P71668 HYPOTHETICAL 34.1 KDA PROTEIN from Mycobacterium tuberculosis (320 aa), FASTA scores: opt: 715, E(): 6.3e-39, (42.3% identity in 298 aa overlap). Equivalent to AAK47374 from Mycobacterium tuberculosis strain CDC1551 (309 aa) but longer 67 aa. Protein product from Mb2994c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2994c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2W7" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2W7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01617.1" /translation="MTKSLPGVADLRLGANHPRMWTRRVQGTVVNVGVKVLPWIPTPA KRILSAGRSVIIDGNTLDPTLQLMLSTSRIFGVDGLAVDDDIVASRAHMRAICEAMPG PQIHVDVTDLSIPGPAGEIPARHYRPSGGGATPLLVFYHGGGWTLGDLDTHDALCRLT CRDADIQVLSIDYRLAPEHPAPAAVEDAYAAFVWAHEHASDEFGALPGRVAVGGDSAG GNLSAVVCQLARDKARYEGGPTPVLQWLLYPRTDFTAQTRSMGLFGNGFLLTKRDIDW FHTQYLRDSDVDPADPRLSPLLAESLSGLAPALIAVAGFDPLRDEGESYAKALRAAGT AVDLRYLGSLTHGFLNLFQLGGGSAAGTNELISALRAHLSRV" CDS 3287035..3287205 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2995" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb2995, -, len: 56 aa. Equivalent to Rv2970A, len: 56 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 56 aa overlap). Conserved hypothetical protein, similar to C-terminal part of several oxidoreductases e.g. Rv2971|Z83018|MTCY349_22 from Mycobacterium tuberculosis (282 aa), FASTA scores: opt: 158, E(): 3.6e-06, (45.0% identity in 60 aa overlap). May represent a gene fragment. Mb2995 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y350" /db_xref="InterPro:IPR018170" /db_xref="InterPro:IPR036812" /db_xref="UniProtKB/TrEMBL:A0A1R3Y350" /protein_id="SIU01618.1" /translation="MLIRWHIQLGNIVIPKSVNPMRIASNFDAFDFPRSMTEPGLVRI RKPSISQAGEMT" CDS 3287202..3288050 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2996" /product="PROBABLE OXIDOREDUCTASE" /note="Mb2996, -, len: 282 aa. Equivalent to Rv2971, len: 282 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 282 aa overlap). Probable oxidoreductase (EC 1.-.-.-), possibly aldo/keto reductase, equivalent to O69462 PUTATIVE OXIDOREDUCTASE from Mycobacterium leprae (282 aa), FASTA scores: opt: 1495, E(): 4.9e-93, (82.35% identity in 272 aa overlap). Also similar to others e.g. Q9KYM9|SC9H11.10C OXIDOREDUCTASE from Streptomyces coelicolor (276 aa), FASTA scores: opt: 849, E(): 1.2e-49, (51.7% identity in 267 aa overlap); Q9ZBW7|SC4B5.01C PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (277 aa), FASTA scores: opt: 847, E(): 1.7e-49, (49.1% identity in 271 aa overlap); Q46857|YQHE_ECOLI|YQHE|B3012 HYPOTHETICAL OXIDOREDUCTASE from Escherichia coli strain K12 (275 aa), FASTA scores: opt: 827, E(): 3.7e-48, (47.45% identity in 276 aa overlap); etc. Contains PS00063 Aldo /keto reductase family putative active site signature; and PS00062 Aldo/keto reductase family signature 2. Protein product from Mb2996 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2996 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXI6" /db_xref="InterPro:IPR018170" /db_xref="InterPro:IPR020471" /db_xref="InterPro:IPR023210" /db_xref="InterPro:IPR036812" /db_xref="UniProtKB/Swiss-Prot:Q7TXI6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01619.1" /translation="MTGESGAAAAPSITLNDEHTMPVLGLGVAELSDDETERAVSAAL EIGCRLIDTAYAYGNEAAVGRAIAASGVAREELFVTTKLATPDQGFTRSQEACRASLD RLGLDYVDLYLIHWPAPPVGKYVDAWGGMIQSRGEGHARSIGVSNFTAEHIENLIDLT FVTPAVNQIELHPLLNQDELRKANAQHTVVTQSYCPLALGRLLDNPTVTSIASEYVKT PAQVLLRWNLQLGNAVVVRSARPERIASNFDVFDFELAAEHMDALGGLNDGTRVREDP LTYAGT" CDS complement(3288124..3288837) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2997C" /product="POSSIBLE CONSERVED MEMBRANE OR EXPORTED PROTEIN" /note="Mb2997c, -, len: 237 aa. Equivalent to Rv2972c, len: 237 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 237 aa overlap). Possible conserved membrane or exported protein, equivalent (but longer 52 aa) to O69461|MLCB1243.02 HYPOTHETICAL 20.5 KDA PROTEIN from Mycobacterium leprae (180 aa), FASTA scores: opt: 581, E(): 8.2e-32, (55.75% identity in 174 aa overlap). Also similar to membrane or exported proteins e.g. Q9F2P3|SCE41.16C PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (258 aa), FASTA scores: opt: 498, E(): 4.1e-26, (44.08% identity in 186 aa overlap); Q99QB5|SCP1.323C PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (219 aa), FASTA scores: opt: 329, E(): 8.5e-15, (36.35% identity in 176 aa overlap); Q9ACQ1|SCP1.267 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (219 aa), FASTA scores: opt: 286, E(): 6.6e-12, (32.03% identity in 231 aa overlap); etc. Mb2997c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011089" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2R9" /protein_id="SIU01620.1" /translation="MNRRTLLWLSAIAALALVVAYQTLGSSAGRHADEFAARAGVPTV QPGADVLAGIAVLPKRIHRYDYRRSAFGHPWDDRNDAPGGHNGCDTRDDILDRDLVDK TYVSIKRCPNAVATGTLRDPYTNTTVAFQRGASVGQSVQIDHIVPLSYAWDMGAYRWP NSERMRFANDPANLLAVQGQANQDKGDSPPAQWMPPNKAFACQYAMQFIAVLRGYSLP VDQPSSDVLRQAAATCPTG" CDS complement(3288834..3291047) /codon_start=1 /transl_table=11 /gene="recG" /locus_tag="BQ2027_MB2998C" /product="PROBABLE ATP-DEPENDENT DNA HELICASE RECG" /note="Mb2998c, recG, len: 737 aa. Equivalent to Rv2973c, len: 737 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 737 aa overlap). Probable recG, ATP-dependent DNA helicase (EC 3.6.1.-), equivalent to O69460|RECG_MYCLE ATP-DEPENDENT DNA HELICASE from Mycobacterium leprae (743 aa), FASTA scores: opt: 3846, E(): 0, (79.3% identity in 744 aa overlap). Also highly similar to others e.g. Q9ZBR3|SC7A1.10 PUTATIVE ATP-DEPENDENT DNA HELICASE from Streptomyces coelicolor (742 aa), FASTA scores: opt: 1249, E(): 1.1e-67, (46.2% identity in 758 aa overlap); Q9PGE8 ATP-DEPENDENT DNA HELICASE from Xylella fastidiosa (718 aa), FASTA scores: opt: 1174, E(): 3.5e-63, (42.1% identity in 539 aa overlap); P24230|RECG_ECOLI|RECG|B3652 from Escherichia coli strain K12 (693 aa), FASTA scores: opt: 457, E(): 7.3e-22, (35.2% identity in 733 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE HELICASE FAMILY, RECG SUBFAMILY. Protein product from Mb2998c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2998c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64323" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR004609" /db_xref="InterPro:IPR011545" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR033454" /db_xref="UniProtKB/Swiss-Prot:P64323" /protein_id="SIU01621.1" /translation="MASLSDRLDRVLGATAADALDEQFGMRTVDDLLRHYPRSYVEGA ARVGIGDARPEAGEHITIVDVITDTYSFPMKKKPNRKCLRITVGGGRNKVTATFFNAD YIMRDLTKHTKVMLSGEVGYYKGAMQLTHPAFLILDSPDGKNHGTRSLKSIADASKAI SGELVVEEFERRFFPIYPASTKVQSWDIFKCVRQVLDVLDRVDDPLPAELRAKHGLIP EDEALRAIHLAESQSLRERARERLTFDEAVGLQWALVARRHGELSESGPSAAWKSNGL AAELLRRLPFELTAGQREVLDVLSDGLAANRPLNRLLQGEVGSGKTIVAVLAMLQMVD AGYQCALLAPTEVLAAQHLRSIRDVLGPLAMGGQLGGAENATRVALLTGSMTAGQKKQ VRAEIASGQVGIVIGTHALLQEAVDFHNLGMVVVDEQHRFGVEQRDQLRAKAPAGITP HLLVMTATPIPRTVALTVYGDLETSTLRELPLGRQPIATNVIFVKDKPAWLDRAWRRI IEEAAAGRQAYVVAPRIDESDDTDVQGGVRPSATAEGLFSRLRSAELAELRLALMHGR LSADDKDAAMAAFRAGEVDVLVCTTVIEVGVDVPNATVMLVMDADRFGISQLHQLRGR IGRGEHPSVCLLASWVPPDTPAGQRLRAVAGTMDGFALADLDLKERKEGDVLGRNQSG KAITLRLLSLAEHEEYIVAARDFCIEAYKNPTDPALALMAARFTSTDRIEYLDKS" CDS complement(3291050..3292711) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB2999C" /product="Dihydroxyacetone kinase-like protein, phosphatase domain / Dihydroxyacetone kinase-like protein, kinase domain" /note="Mb2999c, -, len: 553 aa. Equivalent to Rv2975c and Rv2974c, len: 84 aa and 470 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 83 aa overlap and 100.0% identity in 470 aa overlap). Rv2975c: Conserved hypothetical protein, similar to N-terminus of others e.g. Q9ZBR4|SC7A1.09 HYPOTHETICAL 59.5 KDA PROTEIN from Streptomyces coelicolor (589 aa), FASTA scores: opt: 141, E(): 0.0019, (41.25% identity in 80 aa overlap); Q98R49|MYPU_1610 HYPOTHETICAL PROTEIN from Mycoplasma pulmonis (545 aa), FASTA scores: opt: 127, E(): 0.023, (48.0% identity in 50 aa overlap); Q9K9Z6|BH2498 HYPOTHETICAL PROTEIN from Bacillus halodurans (557 aa), FASTA scores: opt: 126, E(): 0.028, (34.55% identity in 81 aa overlap); etc. Also some similarity with N-terminus of P47609|Y369_MYCGE|MG369 HYPOTHETICAL PROTEIN from Mycoplasma genitalium (557 aa), FASTA scores: opt: 108, E(): 0.7, (36.75% identity in 49 aa overlap); this, and preceding ORF, are similar to Y369_MYCGE and YLOV PROTEIN but no cosmid sequence error was identified. Rv2974c: Conserved hypothetical ala-rich protein, highly similar to others e.g. C-terminus of Q9ZBR4|SC7A1.09 HYPOTHETICAL 59.5 KDA PROTEIN from Streptomyces coelicolor (589 aa), FASTA scores: opt: 774, E(): 1.3e-36, (41.0% identity in 495 aa overlap); Q9K9Z6|BH2498 HYPOTHETICAL PROTEIN from Bacillus halodurans (557 aa), FASTA scores: opt: 268, E(): 8e-08, (27.7% identity in 502 aa overlap) (N-terminus longer 76 aa); Q9X293 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (497 aa), FASTA scores: opt: 265, E(): 1.1e-07, (24.9% identity in 470 aa overlap) (N-terminus longer 43 aa); etc. Also some similarity with P47609|Y369_MYCGE|MG369 HYPOTHETICAL PROTEIN from Mycoplasma genitalium (557 aa), FASTA scores: opt: 154, E(): 0.25, (20.25% identity in 489 aa overlap); this, and following ORF, are similar to Y369_MYCGE but no cosmid sequence error was identified. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis H37Rv, Rv2975c and Rv2974c exist as 2 genes. In Mycobacterium bovis, a 2 bp deletion (cg-*) results in a single product that is more similar to Rv2974c. Protein product from Mb2999c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb2999c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2Q2" /db_xref="InterPro:IPR004007" /db_xref="InterPro:IPR019986" /db_xref="InterPro:IPR033470" /db_xref="InterPro:IPR036117" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Q2" /protein_id="SIU01622.1" /translation="MGTADRPLDASALRDWAHAVVSDLILHIDEINRLNVFPVADSDT GVNMLFTMRAAVVEADLHANSQADAEDVARVAAALAAGALNGARGNSGVILSQILRGI AEVTATAAAASGAVLRAVDANALGAALWRGVELVVASMGGVEVPGTIVSVLRAAAGAV DQCAHEGLAGAVTAAGDAAVIALEKTPEQLDVLADAGAVDAGGRGLLVLLDALRSTIC GQAPARAVYEPSPRALPTDTATQRPAPQFEVMYLLAVCDAAAADQLRDRLKELGESVA IAAAPPDSYSVHVHTDDAGAAVEAGLAVGRVSRIVISALGSGTSGLPAGGWTRGRAVL AVVDGDGAAELFAGEGACVLRPGPDAVTPAADISAHQLVRAVVDTGAAHVMVLPNGYV AAEELVAGCTAAIGWGVDVVPVPTGSMVQGLAALAVHDAARQAVDDGYSMARAAGASR HGSVRIATQKALTWAGTCKPGDGLGIAGDEVLIVADDVAAAAIGLVDLLLASGGDLVT VLIGAGVTEDVAVVLERHVHDHHPGTELVSYRTGHRGDALLIGVE" CDS complement(3292711..3292917) /codon_start=1 /transl_table=11 /gene="Mb2999cA" /locus_tag="BQ2027_MB2999CA" /note="Mb2999cA, len: 68 aa. Identified by de novo proteomics of Mycobacterium bovis AF2122/97 under exponential conditions. Mb2999cA transcript and transcriptional start site identified in Mycobacterium bovis strain AF2122/97 grown under exponential conditions,Mb2999cA found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2S4" /protein_id="SIU01623.1" /translation="MAGTPTCSPRLAQDSHCCAVAPKRSALRGHVSTRLGRESGVVTV LSPRLVRLGARAGAEEVCCGGGVL" CDS complement(3293170..3293853) /codon_start=1 /transl_table=11 /gene="ung" /locus_tag="BQ2027_MB3000C" /product="PROBABLE URACIL-DNA GLYCOSYLASE UNG (UDG)" /note="Mb3000c, ung, len: 227 aa. Equivalent to Rv2976c, len: 227 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 227 aa overlap). Probable ung, uracil-DNA glycosylase (EC 3.2.2.-), equivalent to Q9CBS3 URACIL-DNA GLYCOSYLASE from Mycobacterium leprae (227 aa), FASTA scores: opt: 1394, E(): 8.8e-85, (88.1% identity in 227 aa overlap). Also highly similar to others e.g. Q9EX12 from Streptomyces coelicolor (225 aa), FASTA scores: opt: 1134, E(): 1.3e-67, (72.75% identity in 224 aa overlap); Q9K682|UNG_BACHD from Bacillus halodurans (224 aa), FASTA scores: opt: 652, E(): 8.9e-36, (45.5% identity in 222 aa overlap); P39615|UNG_BACSU from Bacillus subtilis (225 aa), FASTA scores: opt: 625, E(): 5.4e-34, (45.5% identity in 222 aa overlap); etc. BELONGS TO THE URACIL-DNA GLYCOSYLASE FAMILY. Protein product from Mb3000c detected using shotgun mass spectrometry. Mb3000c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67072" /db_xref="InterPro:IPR002043" /db_xref="InterPro:IPR005122" /db_xref="InterPro:IPR018085" /db_xref="InterPro:IPR036895" /db_xref="UniProtKB/Swiss-Prot:P67072" /protein_id="SIU01624.1" /translation="MTARPLSELVERGWAAALEPVADQVAHMGQFLRAEIAAGRRYLP AGSNVLRAFTFPFDNVRVLIVGQDPYPTPGHAVGLSFSVAPDVRPWPRSLANIFDEYT ADLGYPLPSNGDLTPWAQRGVLLLNRVLTVRPSNPASHRGKGWEAVTECAIRALAARA APLVAILWGRDASTLKPMLAAGNCVAIESPHPSPLSASRGFFGSRPFSRANELLVGMG AEPIDWRLP" CDS complement(3293886..3294887) /codon_start=1 /transl_table=11 /gene="thiL" /locus_tag="BQ2027_MB3001C" /product="PROBABLE THIAMINE-MONOPHOSPHATE KINASE THIL (THIAMINE-PHOSPHATE KINASE)" /note="Mb3001c, thiL, len: 333 aa. Equivalent to Rv2977c, len: 333 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 333 aa overlap). Possible thiL, thiamin-monophosphate kinase (EC), equivalent to Q9CBS2 PROBABLE THIAMINE-MONOPHOSPHATE KINASE from Mycobacterium leprae (325 aa), FASTA scores: opt: 1738, E(): 4.5e-98, (80.9% identity in 314 aa overlap). Also highly similar to others e.g. Q9ZBR7|SC7A1.06 PUTATIVE THIAMINE MONPHOSPHATE KINASE from Streptomyces coelicolor (322 aa), FASTA scores: opt: 959, E(): 7.8e-51, (51.1% identity in 319 aa overlap); O05514|THIL_BACSU THIAMINE-MONOPHOSPHATE KINASE from Bacillus subtilis (325 aa), FASTA scores: opt: 476, E(): 1.5e-21, (35.15% identity in 273 aa overlap); P77785|THIL_ECOLI|THIL|B0417 THIAMINE-MONOPHOSPHATE KINASE from Escherichia coli strain K12 (325 aa), FASTA scores: opt: 418, E(): 5e-18, (36.9% identity in 282 aa overlap); etc. BELONGS TO THE THIAMINE-MONOPHOSPHATE KINASE FAMILY. Note that the start, as given, is in IS1538. Protein product from Mb3001c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3001c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4P8" /db_xref="InterPro:IPR006283" /db_xref="InterPro:IPR010918" /db_xref="InterPro:IPR016188" /db_xref="InterPro:IPR036676" /db_xref="InterPro:IPR036921" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4P8" /protein_id="SIU01625.1" /translation="MTTKDHSLATESPTLQQLGEFAVIDRLVRGRRQPATVLLGPGDD AALVSAGDGRTVVSTDMLVQDSHFRLDWSTPQDVGRKAIAQNAADIEAMGARATAFVV GFGAPAETPAAQASALVDGMWEEAGRIGAGIVGGDLVSCRQWVVSVTAIGDLDGRAPV LRSGAKAGSVLAVVGELGRSAAGYALWCNGIEDFAELRRRHLVPQPPYGHGAAAAAVG AQAMIDVSDGLLADLRHIAEASGVRIDLSAAALAADRDALTAAATALGTDPWPWVLSG GEDHALVACFVGPVPAGWRTIGRVLDGPARVLVDGEEWTGYAGWQSFGEPDNQGSLG" mobile_element complement(3294867..3296891) /mobile_element_type="insertion sequence:IS1538" /locus_tag="BQ2027_IS1538" /note="IS1538, len: 2025 nt. Equivalent to IS1538, len: 2025 nt, from Mycobacetrium tuberculosis strain H37Rv,(99.9% identity in 2025 nt overlap). Similar to other IS elements in Mycobacterium tuberculosis e.g. IS1535,IS1536, IS1537, & IS1539 (EM_NEW:MTCY274 Z74024 Mycobacterium tuberculosis cosmid Y274)" repeat_region 3294867..3294872 /rpt_type=INVERTED /note="6 bp perfect inverted repeat, IRR, TGAGTG, flanking IS element IS1538." gene complement(3294867..3296891) /locus_tag="BQ2027_IS1538" CDS complement(3294884..3296263) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3002C" /product="probable transposase" /note="Mb3002c, -, len: 459 aa. Equivalent to Rv2978c, len: 459 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 459 aa overlap). Probable resolvase for IS1538, with low level matches to transposon resolvases; highly similar from aa 101 to YX1C_MYCTU|Q10831 from Mycobacterium tuberculosis (295 aa), FASTA scores: opt: 809, E(): 0, (69.1% identity in 194 aa overlap). Contains PS00397 Site-specific recombinases active site, and possible helix-turn-helix motiv at aa 2-23. Mb3002c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001959" /db_xref="InterPro:IPR010095" /db_xref="InterPro:IPR021027" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K4" /protein_id="SIU01626.1" /translation="MPKFEVPDGWTVQAFRFTLDPTEDQAKALARHFGARRKAYNWTV ATLKADIQAWHASGTVTAKPSLRVLRKRWNTVKDDVCVNTETGVAWWPECSKEAYADG IAGAVEAYWNWQTSRAGKRAGKRVGFPRFKRKGRDQDRVSFTTGAMRVEPDRRHLTLP VIGTVRTHENTRRIERLIKAGRARVLAISVRRNGTRLDASVRVLVQRPQQPKVVHPGS RVGVDVGVRRLATVATADGTAIEQVENPRPLGAALRELRHVCRARSRCTKGSRRYRER TTQISRLHRRVNDVRTHHLHVLTTRLAQTHGRIVVEGLDATEMLRQKGLPGARARRRG LSDAALGTPRRHLSYKTVWYGSALVVADRWFPSSKTCHACRHVQDIGWDEQWQCDRCS VVHQRDDCAAINLARYEETSSIVGPVGAAVKRGADRKTGPRPAGGCEARKGSSPKAAE QPRDGVQVA" CDS complement(3296263..3296847) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3003C" /product="PROBABLE RESOLVASE" /note="Mb3003c, -, len: 194 aa. Equivalent to Rv2979c, len: 194 aa, from Mycobacterium tuberculosis strain H37Rv, (98.5% identity in 194 aa overlap). Probable resolvase for IS1538, with low level matches to transposon resolvases; highly similar from aa 101 to YX1C_MYCTU|Q10831 from Mycobacterium tuberculosis (295 aa), FASTA scores: opt: 809, E(): 0, (69.1% identity in 194 aa overlap). Contains PS00397 Site-specific recombinases active site, and possible helix-turn-helix motiv at aa 2-23. Mb3003c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y2X7" /db_xref="InterPro:IPR006118" /db_xref="InterPro:IPR006119" /db_xref="InterPro:IPR036162" /db_xref="InterPro:IPR041718" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2X7" /protein_id="SIU01627.1" /translation="MNLATWAERNGVARGTAYRWFRAGLLSVMARRVGRLILVDEPAG DAGMRSPTAVYARVSSADQKADLDRQVARVTAWAAAQQMPVDKVVTEVGSAFNEHRRK FLSLLRDPSVHRIVVEHRDRFCRLGSKYVQAAFAAQGRELVVVDSAEVGDDLVRDMTE ILTSMCARLYGKRAAENRTKRALAAAAGEDHEAA" repeat_region complement(3296886..3296891) /rpt_type=INVERTED /note="6 bp perfect inverted repeat, IRL, TGAGTG, flanking IS element IS1538." CDS 3297059..3297604 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3004" /product="POSSIBLE CONSERVED SECRETED PROTEIN" /note="Mb3004, -, len: 181 aa. Equivalent to Rv2980, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 181 aa overlap). Possible conserved secreted protein, equivalent to Q9CBS1 POSSIBLE SECRETED PROTEIN from Mycobacterium leprae (191 aa), FASTA scores: opt: 794, E(): 2.3e-40, (67.25% identity in 177 aa overlap). Also some weak similarity with other hypothetical proteins or secreted proteins e.g. C-terminus of Q98F98|MLL3872 MLL3872 PROTEIN from Rhizobium loti (Mesorhizobium loti) (575 aa), FASTA scores: opt: 148, E(): 0.16, (28.35% identity in 194 aa overlap); Q9L0W9|SCH22A.13C PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (167 aa), FASTA scores: opt: 114, E(): 7.5, (40.0% identity in 80 aa overlap); etc. Equivalent to AAK47385 from Mycobacterium tuberculosis strain CDC1551 (214 aa) but shorter 33 aa. Has hydrophobic stretch near N-terminus. Protein product from Mb3004 detected using SWATH mass spectrometry. Mb3004 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y358" /db_xref="InterPro:IPR021903" /db_xref="UniProtKB/TrEMBL:A0A1R3Y358" /protein_id="SIU01628.1" /translation="MTGESDGPPRAVLIAAAALAAAVIGVILVVAANRQPPERPVVIP AVPAPQATGPGCKALLAALPQRLGEYRRAPVAEPTTAGATAWRTGPNSTPVILRCGLD RPAEFVVGSAIQVVDRVQWFQVAAQNPDEPGRSTWYTVDRPVYVALTLPSGSGPTAIQ ELSDVIDHTIPAVPIDPAPAR" CDS complement(3297777..3298889) /codon_start=1 /transl_table=11 /gene="ddlA" /locus_tag="BQ2027_MB3005C" /standard_name="ddl" /product="PROBABLE D-ALANINE--D-ALANINE LIGASE DDLA (D-ALANYLALANINE SYNTHETASE) (D-ALA-D-ALA LIGASE)" /note="Mb3005c, ddlA, len: 370 aa. Equivalent to Rv2981c, len: 373 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 373 aa overlap). Probable ddlA (alternate gene name: ddl), D-alanine--D-alanine ligase A (EC 6.3.2.4), equivalent to Q9CBS0|Q9CBS0 D-ALANINE-D-ALANINE LIGASE A from Mycobacterium leprae (384 aa), FASTA scores: opt: 2001, E(): 2.4e-115, (81.75% identity in 367 aa overlap); and Q9ZGN0|DDL_MYCSM D-ALANINE--D-ALANINE LIGASE from Mycobacterium smegmatis (373 aa), FASTA scores: opt: 1934, E(): 3.1e-111, (77.95% identity in 372 aa overlap). Also highly similar to others e.g. Q9ZBR9|DDL_STRCO from Streptomyces coelicolor (389 aa), FASTA scores: opt: 1187, E(): 2.2e-65, (52.0% identity in 379 aa overlap); P15051|DDLA_SALTY from Salmonella typhimurium and Salmonella typhi (363 aa), FASTA scores: opt: 946, E(): 1.3e-50, (44.5% identity in 364 aa overlap); P23844|DDLA_ECOLI|DDLA|B0381|Z0477|ECS043 1 from Escherichia coli strain O157:H7 and K12 (364 aa), FASTA scores: opt: 938, E(): 3.9e-50, (43.55% identity in 363 aa overlap); etc. Contains PS00843 D-alanine--D-alanine ligase signature 1. BELONGS TO THE D-ALANINE--D-ALANINE LIGASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 9 bp deletion (acgccggtc-*) leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (370 aa versus 373 aa). Protein product from Mb3005c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3005c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TXH9" /db_xref="InterPro:IPR000291" /db_xref="InterPro:IPR005905" /db_xref="InterPro:IPR011095" /db_xref="InterPro:IPR011127" /db_xref="InterPro:IPR011761" /db_xref="InterPro:IPR013815" /db_xref="InterPro:IPR016185" /db_xref="UniProtKB/Swiss-Prot:Q7TXH9" /protein_id="SIU01629.1" /translation="MSANDRRVRVAVVFGGRSNEHAISCVSAGSILRNLDSRRFDVIA VGITPAGSWVLTDANPDALTITNRELPQVKSGSGTELALPADPRRGGQLVSLPPGAGE VLESVDVVFPVLHGPYGEDGTIQGLLELAGVPYVGAGVLASAVGMDKEFTKKLLAADG LPVGAYAVLRPPRSTLHRQECERLGLPVFVKPARGGSSIGVSRVSSWDQLPAAVARAR RHDPKVIVEAAISGRELECGVLEMPDGTLEASTLGEIRVAGVRGREDSFYDFATKYLD DAAELDVPAKVDDQVAEAIRQLAIRAFAAIDCRGLARVDFFLTDDGPVINEINTMPGF TTISMYPRMWAASGVDYPTLLATMIETALARGVGLH" CDS complement(3298967..3299971) /codon_start=1 /transl_table=11 /gene="gpdA2" /locus_tag="BQ2027_MB3006C" /standard_name="gpsA" /product="PROBABLE GLYCEROL-3-PHOSPHATE DEHYDROGENASE [NAD(P)+] GPDA2 (NAD(P)H-DEPENDENT GLYCEROL-3-PHOSPHATE DEHYDROGENASE)" /note="Mb3006c, gpdA2, len: 334 aa. Equivalent to Rv2982c, len: 334 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 334 aa overlap). Probable gpdA2 (alternate gene name: gpsA), glycerol-3-phosphate dehydrogenase [NAD(P)+] (EC 1.1.1.94), equivalent to Q9CBR9|GPDA_MYCLE GLYCEROL-3-PHOSPHATE DEHYDROGENASE [NAD(P)+] from Mycobacterium leprae (349 aa), FASTA scores: opt: 1686, E(): 1.7e-95, (77.95% identity in 349 aa overlap). Also highly similar to others e.g. Q9ZBS0|GPDA_STRCO from Streptomyces coelicolor (336 aa), FASTA scores: opt: 1165, E(): 9.8e-64, (56.25% identity in 327 aa overlap); P46919|GPDA_BACSU from Bacillus subtilis (345 aa), FASTA scores: opt: 872, E(): 7.5e-46, (44.9% identity in 325 aa overlap); P37606|GPDA_ECOLI|GPSA|B3608|Z5035|ECS4486. from Escherichia coli strain O157:H7 and K12 (339 aa), FASTA scores: opt: 799, E(): 2.1e-41, (42.9% identity in 331 aa overlap); etc. Also highly similar to O53761|GPD2_MYCTU PROBABLE GLYCEROL-3-PHOSPHATE DEHYDROGENASE from Mycobacterium tuberculosis (341 aa), FASTA scores: opt: 740, E(): 8.4e-38, (40.35% identity in 322 aa overlap). BELONGS TO THE NAD-DEPENDENT GLYCEROL-3-PHOSPHATE DEHYDROGENASE FAMILY. Protein product from Mb3006c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3006c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59961" /db_xref="InterPro:IPR006109" /db_xref="InterPro:IPR006168" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR011128" /db_xref="InterPro:IPR013328" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P59961" /protein_id="SIU01630.1" /translation="MAGIASTVAVMGAGAWGTALAKVLADAGGEVTLWARRAEVADQI NTTRYNPDYLPGALLPPSIHATADAEEALGGASTVLLGVPAQTMRANLERWAPLLPEG ATLVSLAKGIELGTLMRMSQVIISVTGAEPAQVAVISGPNLASEIAECQPAATVVACS DSGRAVALQRALNSGYFRPYTNADVVGTEIGGACKNIIALACGMAVGIGLGENTAAAI ITRGLAEIIRLGTALGANGATLAGLAGVGDLVATCTSPRSRNRSFGERLGRGETLQSA GKACHVVEGVTSCESVLALASSYDVEMPLTDAVHRVCHKGLSVDEAITLLLGRRTKPE " CDS 3300090..3300734 /codon_start=1 /transl_table=11 /gene="cofC" /locus_tag="BQ2027_MB3007" /product="2-phospho-L-lactate guanylyltransferase (EC" /EC_number="2.7.7.68" /note="Mb3007, -, len: 214 aa. Equivalent to Rv2983, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 214 aa overlap). Conserved hypothetical ala-rich protein, equivalent to O33128|ML1680|MLCB637.37c HYPOTHETICAL 22.0 KDA PROTEIN from Mycobacterium leprae (216 aa), FASTA scores: opt: 1080, E(): 9e-61, (79.05% identity in 215 aa overlap). Also similar to other hypothetical proteins e.g. Q9ZBS2|SC7A1.01C from Streptomyces coelicolor (212 aa), FASTA scores: opt: 420, E(): 2.9e-19, (43.5% identity in 207 aa overlap); O26710|MTH613 from Methanothermobacter thermautotrophicus (223 aa), FASTA scores: opt: 193, E(): 5.8e-05, (30.0% identity in 190 aa overlap); Q9RKG8|SCE46.21 from Streptomyces coelicolor (210 aa), FASTA scores: opt: 139, E(): 0.14, (27.65% identity in 206 aa overlap); etc. Protein product from Mb3007 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3007 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXH8" /db_xref="InterPro:IPR002835" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/Swiss-Prot:Q7TXH8" /protein_id="SIU01631.1" /translation="MSGTPDDGDIGLIIAVKRLAAAKTRLAPVFSAQTRENVVLAMLV DTLTAAAGVGSLRSITVITPDEAAAAAAAGLGADVLADPTPEDDPDPLNTAITAAERV VAEGASNIVVLQGDLPALQTQELAEAISAARHHRRSFVADRLGTGTAVLCAFGTALHP RFGPDSSARHRRSGAVELTGAWPGLRCDVDTPADLTAARQLGVGPATARAVAHR" CDS 3300826..3303054 /codon_start=1 /transl_table=11 /gene="ppk" /locus_tag="BQ2027_MB3008" /product="polyphosphate kinase ppk (polyphosphoric acid kinase) (atp-polyphosphate phosphotransferase)" /note="Mb3008, ppk, len: 742 aa. Equivalent to Rv2984, len: 742 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 742 aa overlap). Probable ppk, polyphosphate kinase (EC 2.7.4.1), equivalent to O33127|PPK_MYCLE POLYPHOSPHATE KINASE from Mycobacterium leprae (739 aa), FASTA scores: opt: 4264, E(): 0, (87.85% identity in 742 aa overlap). Also highly similar to others e.g. Q9KZV6|PPK_STRCO from Streptomyces coelicolor (746 aa), FASTA scores: opt: 1979, E(): 2.6e-117, (59.9% identity in 701 aa overlap); Q9KD27|PPK_BACHD from Bacillus halodurans (705 aa), FASTA scores: opt: 1319, E(): 1.4e-75, (45.55% identity in 674 aa overlap); Q9PAC7|PPK_XYLFA from Xylella fastidiosa (698 aa), FASTA scores: opt: 1300, E(): 2.2e-74, (43.3% identity in 693 aa overlap); etc. BELONGS TO THE POLYPHOSPHATE KINASE FAMILY. Protein product from Mb3008 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3008 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65769" /db_xref="InterPro:IPR003414" /db_xref="InterPro:IPR024953" /db_xref="InterPro:IPR025198" /db_xref="InterPro:IPR025200" /db_xref="InterPro:IPR036830" /db_xref="InterPro:IPR036832" /db_xref="InterPro:IPR041108" /db_xref="UniProtKB/Swiss-Prot:P65769" /protein_id="SIU01632.1" /translation="MMSNDRKVTEIENSPVTEVRPEEHAWYPDDSALAAPPAATPAAI SDQLPSDRYLNRELSWLDFNARVLALAADKSMPLLERAKFLAIFASNLDEFYMVRVAG LKRRDEMGLSVRSADGLTPREQLGRIGEQTQQLASRHARVFLDSVLPALGEEGIYIVT WADLDQAERDRLSTYFNEQVFPVLTPLAVDPAHPFPFVSGLSLNLAVTVRQPEDGTQH FARVKVPDNVDRFVELAAREASEEAAGTEGRTALRFLPMEELIAAFLPVLFPGMEIVE HHAFRITRNADFEVEEDRDEDLLQALERELARRRFGSPVRLEIADDMTESMLELLLRE LDVHPGDVIEVPGLLDLSSLWQIYAVDRPTLKDRTFVPATHPAFAERETPKSIFATLR EGDVLVHHPYDSFSTSVQRFIEQAAADPNVLAIKQTLYRTSGDSPIVRALIDAAEAGK QVVALVEIKARFDEQANIAWARALEQAGVHVAYGLVGLKTHCKTALVVRREGPTIRRY CHVGTGNYNSKTARLYEDVGLLTAAPDIGADLTDLFNSLTGYSRKLSYRNLLVAPHGI RAGIIDRVEREVAAHRAEGAHNGKGRIRLKMNALVDEQVIDALYRASRAGVRIEVVVR GICALRPGAQGISENIIVRSILGRFLEHSRILHFRAIDEFWIGSADMMHRNLDRRVEV MAQVKNPRLTAQLDELFESALDPCTRCWELGPDGQWTASPQEGHSVRDHQESLMERHR SP" CDS 3303137..3304090 /codon_start=1 /transl_table=11 /gene="mutT1" /locus_tag="BQ2027_MB3009" /product="POSSIBLE HYDROLASE MUTT1" /note="Mb3009, mutT1, len: 317 aa. Equivalent to Rv2985, len: 317 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 317 aa overlap). Possible mutT1, long mutt protein (hydrolase) (EC 3.-.-.-), highly similar to O33126|MLCB637.35 HYPOTHETICAL 34.5 KDA PROTEIN from Mycobacterium leprae (312 aa), FASTA scores: opt: 1514, E(): 5.1e-91, (71.85% identity in 316 aa overlap); and Q9CBR8|ML1682 HYPOTHETICAL PROTEIN from Mycobacterium leprae (311 aa), FASTA scores: opt: 1510, E(): 9.2e-91, (71.5% identity in 316 aa overlap). Also similar to Q50195|L222-ORF6|ML2698 HYPOTHETICAL PROTEIN from Mycobacterium leprae (251 aa), FASTA scores: opt: 231, E(): 1.1e-07, (36.7% identity in 128 aa overlap). Also similar to shorter mutt proteins and related hypothetical protein e.g. Q9EUS6 HYPOTHETICAL 16.6 KDA PROTEIN from Streptomyces griseus subsp. griseus (152 aa), FASTA scores: opt: 380, E(): 1.7e-17, (50.75% identity in 130 aa overlap); Q9KZV8|SCD84.10C PUTATIVE MUTT-LIKE PROTEIN from Streptomyces coelicolor (142 aa), FASTA scores: opt: 376, E(): 2.9e-17, (46.1% identity in 128 aa overlap); P96590|MUTT MUTT PROTEIN from Bacillus subtilis (149 aa), FASTA scores: opt: 180, E(): 0.00017, (35.25% identity in 122 aa overlap); etc. Also similar to O05437 HYPOTHETICAL 27.1 KDA PROTEIN from Mycobacterium tuberculosis (248 aa), FASTA scores: opt: 224, E(): 3.2e-07, (34.03% identity in 144 aa overlap). Contains PS00893 mutT domain signature. SEEMS TO BELONG TO THE MUTT/NUDIX FAMILY PROTEIN. Protein product from Mb3009 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3009 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2T2" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR013078" /db_xref="InterPro:IPR015797" /db_xref="InterPro:IPR020084" /db_xref="InterPro:IPR020476" /db_xref="InterPro:IPR029033" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2T2" /protein_id="SIU01633.1" /translation="MSIQNSSARRRSAGRIVYAAGAVLWRPGSADSEGPVEIAVIHRP RYDDWSLPKGKVDPGETAPVGAVREILEETGHRANLGRRLLTVTYPTDSPFRGVKKVH YWAARSTGGEFTPGSEVDELIWLPVPDAMNKLDYAQDRKVLCRFAKHPADTQTVLVVR HGTAGSKAHFSGDDSKRPLDKRGRAQAEALVPQLLAFGATDVYAADRVRCHQTMEPLA AELNVTIHNEPTLTEESYANNPKRGRHRVLQIVEQVGTPVICTQGKVIPDLITWWCER DGVHPDKSRNRKGSTWVLSLSAGRLVTADHIGGALAANVRA" CDS complement(3304148..3304792) /codon_start=1 /transl_table=11 /gene="hupB" /locus_tag="BQ2027_MB3010C" /standard_name="hup; hlp; lbp21" /product="dna-binding protein hu homolog hupb (histone-like protein) (hlp) (21-kda laminin-2-binding protein)" /note="Mb3010c, hupB, len: 214 aa. Equivalent to Rv2986c, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 214 aa overlap). Probable hupB (alternate gene names: hup, hlp, lbp21), DNA-binding protein HU homolog (resembles fusion between HU and histone) (see first citation below), equivalent to others from Mycobacteria e.g. Q9XB18|DBH_MYCBO from Mycobacterium bovis (205 aa), FASTA scores: opt: 1050, E(): 5.6e-45, (95.35% identity in 214 aa overlap); Q9ZHC5|DBH_MYCSM from Mycobacterium smegmatis (208 aa), FASTA scores: opt: 1035, E(): 3.1e-44, (80.2% identity in 217 aa overlap); and O33125|DBH_MYCLE from Mycobacterium leprae (200 aa), FASTA scores: opt: 914, E(): 2.7e-38, (80.1% identity in 216 aa overlap). Also highly similar to others from other organisms e.g. O86537|DBH2_STRCO from Streptomyces coelicolor (218 aa), FASTA scores: opt: 569, E(): 2.6e-21, (51.35% identity in 220 aa overlap); P08821|DBH1_BACSU from Bacillus subtilis (92 aa), FASTA scores: opt: 280, E(): 2.5e-07, (45.05% identity in 91 aa overlap) (C-terminus shorter); etc. Contains PS00045 Bacterial histone-like DNA-binding proteins signature. BELONGS TO THE BACTERIAL HISTONE-LIKE PROTEIN FAMILY. Note that its C-terminal domain is very rich in lysine and alanine. Protein product from Mb3010c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3010c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q9XB18" /db_xref="InterPro:IPR000119" /db_xref="InterPro:IPR010992" /db_xref="InterPro:IPR020816" /db_xref="UniProtKB/Swiss-Prot:Q9XB18" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01634.1" /translation="MNKAELIDVLTQKLGSDRRQATAAVENVVDTIVRAVHKGDSVTI TGFGVFEQRRRAARVARNPRTGETVKVKPTSVPAFRPGAQFKAVVSGAQRLPAEGPAV KRGVGASAAKKVAKKAPAKKATKAAKKAATKAPARKAATKAPAKKAATKAPAKKAVKA TKSPAKKVTKAVKKTAVKASVRKAATKAPAKKAAAKRPATKAPAKKATARRGRK" CDS complement(3305005..3305601) /codon_start=1 /transl_table=11 /gene="leuD" /locus_tag="BQ2027_MB3011C" /product="PROBABLE 3-ISOPROPYLMALATE DEHYDRATASE (SMALL SUBUNIT) LEUD (ISOPROPYLMALATE ISOMERASE) (ALPHA-IPM ISOMERASE) (IPMI)" /note="Mb3011c, leuD, len: 198 aa. Equivalent to Rv2987c, len: 198 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 198 aa overlap). Probable leuD, 3-isopropylmalate dehydratase, small subunit (EC 4.2.1.33), equivalent to O33124|LEUD_MYCLE 3-ISOPROPYLMALATE DEHYDRATASE SMALL SUBUNIT from Mycobacterium leprae (198 aa), FASTA scores: opt: 1155, E(): 4.2e-72, (87.75% identity in 196 aa overlap). Also highly similar to many e.g. O86535|LEUD_STRCO from Streptomyces coelicolor (197 aa), FASTA scores: opt: 765, E(): 2.6e-45, (59.0% identity in 195 aa overlap); P04787|LEUD_SALTY from Salmonella typhimurium (201 aa), FASTA scores: opt: 528, E(): 5.2e-29, (45.05% identity in 191 aa overlap); P30126|LEUD_ECOLI|LEUD|B0071 from Escherichia coli strain K12 (201 aa), FASTA scores: opt: 498, E(): 6e-27, (43.45% identity in 191 aa overlap); etc. TBparse score is 0.939. Protein product from Mb3011c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3011c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65278" /db_xref="InterPro:IPR000573" /db_xref="InterPro:IPR004431" /db_xref="InterPro:IPR015928" /db_xref="InterPro:IPR033940" /db_xref="UniProtKB/Swiss-Prot:P65278" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01635.1" /translation="MEAFHTHSGIGVPLRRSNVDTDQIIPAVFLKRVTRTGFEDGLFA GWRSDPAFVLNLSPFDRGSVLVAGPDFGTGSSREHAVWALMDYGFRVVISSRFGDIFR GNAGKAGLLAAEVAQDDVELLWKLIEQSPGLEITANLQDRIITAATVVLPFKIDDHSA WRLLEGLDDIALTLRKLDEIEAFEGACAYWKPRTLPAP" CDS complement(3305626..3307047) /codon_start=1 /transl_table=11 /gene="leuC" /locus_tag="BQ2027_MB3012C" /product="PROBABLE 3-ISOPROPYLMALATE DEHYDRATASE (LARGE SUBUNIT) LEUC (ISOPROPYLMALATE ISOMERASE) (ALPHA-IPM ISOMERASE) (IPMI)" /note="Mb3012c, leuC, len: 473 aa. Equivalent to Rv2988c, len: 473 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 473 aa overlap). Probable leuC, 3-isopropylmalate dehydratase, large subunit (EC 4.2.1.33), equivalent to O33123|LEU2_MYCLE 3-ISOPROPYLMALATE DEHYDRATASE SMALL SUBUNIT from Mycobacterium leprae (476 aa), FASTA scores: opt: 2818, E(): 1.3e-171, (88.75% identity in 471 aa overlap). Also highly similar to many e.g. Q44427|LEU2_ACTTI from Actinoplanes teichomyceticus (485 aa), FASTA scores: opt: 1958, E(): 6.5e-117, (71.0% identity in 479 aa overlap); P55251|LEU2_RHIPU from Rhizomucor pusillus (755 aa), FASTA scores: opt: 1937, E(): 1.9e-115, (61.25% identity in 467 aa overlap) (C-terminus longer); P30127|LEU2_ECOLI|LEUC|B0072 from Escherichia coli strain K12 (465 aa), FASTA scores: opt: 1896, E(): 5.5e-113, (61.6% identity in 456 aa overlap); etc. Contains PS00450 Aconitase family signature. BELONGS TO THE ACONITASE/IPM ISOMERASE FAMILY. TBparse score is 0.895. Protein product from Mb3012c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3012c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXH6" /db_xref="InterPro:IPR001030" /db_xref="InterPro:IPR004430" /db_xref="InterPro:IPR015931" /db_xref="InterPro:IPR018136" /db_xref="InterPro:IPR033941" /db_xref="InterPro:IPR036008" /db_xref="UniProtKB/Swiss-Prot:Q7TXH6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01636.1" /translation="MALQTGEPRTLAEKIWDDHIVVSGGGCAPDLIYIDLHLVHEVTS PQAFDGLRLAGRRVRRPELTLATEDHNVPTVDIDQPIADPVSRTQVETLRRNCAEFGI RLHSMGDIEQGIVHVVGPQLGLTQPGMTIVCGDSHTSTHGAFGALAMGIGTSEVEHVL ATQTLPLRPFKTMAVNVDGRLPDGVSAKDIILALIAKIGTGGGQGHVIEYRGSAIESL SMEGRMTICNMSIEAGARAGMVAPDETTYAFLRGRPHAPTGAQWDTALVYWQRLRTDV GAVFDTEVYLDAASLSPFVTWGTNPGQGVPLAAAVPDPQLMTDDAERQAAEKALAYMD LRPGTAMREIAVDAVFVGSCTNGRIEDLRVVAEVLRGRKVADGVRMLIVPGSMRVRAQ AEAEGLGEIFTDAGAQWRQAGCSMCLGMNPDQLASGERCAATSNRNFEGRQGAGGRTH LVSPAVAAATAVRGTLSSPADLN" CDS 3307119..3307820 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3013" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb3013, -, len: 233 aa. Equivalent to Rv2989, len: 233 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 233 aa overlap). Probable transcriptional regulator (ala-rich protein), highly similar to O86533|SC1C2.33c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (238 aa), FASTA scores: opt: 711, E(): 2.3e-38, (53.05% identity in 230 aa overlap); and similar to others e.g. Q9KND6 PUTATIVE TRANSCRIPTIONAL REGULATOR from Vibrio cholerae (244 aa), FASTA scores: opt: 232, E(): 1.2e-07, (29.75% identity in 232 aa overlap); Q9R9U0|SRPS EFFLUX PUMP REGULATOR from Pseudomonas putida (259 aa), FASTA scores: opt: 224, E(): 4.1e-07, (28.35% identity in 247 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. O06806|Rv1773c|MTCY28.39 HYPOTHETICAL 26.6 KDA PROTEIN (248 aa), FASTA scores: opt: 239, E(): 4.4e-08, (29.85% identity in 231 aa overlap); P71977|RV1719|MTCY04C12.04 HYPOTHETICAL 27.9 KDA PROTEIN (259 aa), FASTA scores: opt: 215, E(): 1.6e-06, (31.85% identity in 223 aa overlap); etc. Equivalent to AAK47396 from Mycobacterium tuberculosis strain CDC1551 (267 aa) but shorter 34 aa. Contains possible helix-turn-helix motif at aa 25-46 (Score 1005, +2.61 SD). Protein product from Mb3013 detected using shotgun mass spectrometry. Mb3013 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Y6" /db_xref="InterPro:IPR005471" /db_xref="InterPro:IPR014757" /db_xref="InterPro:IPR029016" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Y6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01637.1" /translation="MRQHSGIGVLDKAVGVLHAVAESPCGLAELCDRTDLPRATAYRL AAALEVHRLLGRGQDGHWRLGPAITELATHVDDPLLVACAAVLPQLRDATGESVQVYR REGTSRVCVAALEPAAGLRDTVPVGARLPMTAGSGAKVLLAHTDAATQAAVLPKAVFS ARALAEVCRRGWAQSVAEREPGVASVSAPVRDGRGVVIAAISVSGPIDRMGRRPGVRW AADLLSAADALTRRL" CDS complement(3307831..3308691) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3014C" /product="HYPOTHETICAL PROTEIN" /note="Mb3014c, -, len: 286 aa. Equivalent to Rv2990c, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 286 aa overlap). Hypothetical unknown protein. Protein product from Mb3014c detected using SWATH mass spectrometry. Mb3014c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y369" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01638.1" /translation="MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEG VHGERPWGTVLDAGTGVKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRL LVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGL EPYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEER RFPIRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHG NDYVIAVEPM" CDS 3308954..3309445 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3015" /product="Pyridoxamine 5-phosphate oxidase" /note="Mb3015, -, len: 163 aa. Equivalent to Rv2991, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 163 aa overlap). Conserved hypothetical protein, similar to others e.g. Q9K3X7|2SCG61.39. HYPOTHETICAL 17.6 KDA PROTEIN from Streptomyces coelicolor (153 aa), FASTA scores: opt: 266, E(): 2.1e-11, (34.85% identity in 155 aa overlap); Q9CNX3|PM0299 HYPOTHETICAL PROTEIN from Pasteurella multocida (171 aa), FASTA scores: opt: 175, E(): 5.1e-05, (31.3% identity in 131 aa overlap); Q9KZI9|SCG8A.10 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (142 aa), FASTA scores: opt: 163, E(): 0.00031, (32.4% identity in 108 aa overlap); etc. Also some similarity to O06553|MTCI65.22|Rv1155 hypothetical protein from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 127, E(): 0.1, (32.9% identity in 73 aa overlap); and to several proteins of similar size that confer resistance to 5-Nitroimidazole antibiotics in Bacteroides. Protein product from Mb3015 detected using SWATH mass spectrometry. Mb3015 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2S5" /db_xref="InterPro:IPR011576" /db_xref="InterPro:IPR012349" /db_xref="InterPro:IPR019920" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2S5" /protein_id="SIU01639.1" /translation="MGTKQRADIVMSEAEIADFVNSSRTGTLATIGPDGQPHLTAMWY AVIDGEIWLETKAKSQKAVNLRRDPRVSFLLEDGDTYDTLRGVSFEGVTEIVEEPEAL HRVGVSVWERYTGPYTDECKPMVDQMMNKRVGVRIVARRTRSWDHRKLGLPHMSVGGS TAP" tRNA complement(3309518..3309590) /locus_tag="BQ2027_GLUU" /product="tRNA-Glu" /note="gluU, len: 73 nt. Equivalent to gluU, len: 73 nt, from Mycobacterrium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Glu; anticodon ctc." tRNA complement(3309630..3309701) /locus_tag="BQ2027_GLNU" /product="tRNA-Gln" /note="glnU, len: 72 nt. Equivalent to glnU, len: 72 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 72 nt overlap). tRNA-Gln; anticodon ctg." CDS complement(3309777..3311249) /codon_start=1 /transl_table=11 /gene="gltS" /locus_tag="BQ2027_MB3016C" /standard_name="gltX" /product="glutamyl-trna synthetase glts (glutamate--trna ligase) (glutamyl-trna synthase) (glurs)" /note="Mb3016c, gltS, len: 490 aa. Equivalent to Rv2992c, len: 490 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 490 aa overlap). Probable gltS (alternate gene name: gltX), glutamyl-tRNA synthase (EC 6.1.1.17), equivalent to O33120|SYE_MYCLE GLUTAMYL-TRNA SYNTHETASE from Mycobacterium leprae (502 aa), FASTA scores: opt: 2660, E(): 2.3e-163, (81.35% identity in 488 aa overlap). Also highly similar to others e.g. O86528|SYE_STRCO from Streptomyces coelicolor (494 aa), FASTA scores: opt: 1777, E(): 1.4e-106, (57.45% identity in 484 aa overlap); P22250|SYE_BACSU from Bacillus subtilis (483 aa), FASTA scores: opt: 1099, E(): 5.4e-63, (38.45% identity in 489 aa overlap); O51345|SYE_BORBU|GLTX|BB0372 from Borrelia burgdorferi (Lyme disease spirochete) (490 aa), FASTA scores: opt: 1009, E(): 3.3e-57, (34.85% identity in 491 aa overlap); etc. BELONGS TO CLASS-I AMINOACYL-TRNA SYNTHETASE FAMILY. TBparse score is 0.891. Protein product from Mb3016c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3016c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A637" /db_xref="InterPro:IPR000924" /db_xref="InterPro:IPR004527" /db_xref="InterPro:IPR008925" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR020058" /db_xref="InterPro:IPR020061" /db_xref="InterPro:IPR020751" /db_xref="InterPro:IPR020752" /db_xref="InterPro:IPR033910" /db_xref="UniProtKB/Swiss-Prot:P0A637" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01640.1" /translation="MTATETVRVRFCPSPTGTPHVGLVRTALFNWAYARHTGGTFVFR IEDTDAQRDSEESYLALLDALRWLGLDWDEGPEVGGPYGPYRQSQRAEIYRDVLARLL AAGEAYHAFSTPEEVEARHVAAGRNPKLGYDNFDRHLTDAQRAAYLAEGRQPVVRLRM PDDDLAWNDLVRGPVTFAAGSVPDFALTRASGDPLYTLVNPCDDALMKITHVLRGEDL LPSTPRQLALHQALIRIGVAERIPKFAHLPTVLGEGTKKLSKRDPQSNLFAHRDRGFI PEGLLNYLALLGWSIADDHDLFGLDEMVAAFDVADVNSSPARFDQKKADALNAEHIRM LDVGDFTVRLRDHLDTHGHHIALDEAAFAAAAELVQTRIVVLGDAWELLKFFNDDQYV IDPKAAAKELGPDGAAVLDAALAALTSVTDWTAPLIEAALKDALIEGLALKPRKAFSP IRVAATGTTVSPPLFESLELLGRDRSMQRLRAARQLVGHA" CDS complement(3311246..3311965) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3017C" /product="POSSIBLE 2-HYDROXYHEPTA-2,4-DIENE-1,7-DIOATE ISOMERASE (HHDD ISOMERASE)" /note="Mb3017c, -, len: 239 aa. Equivalent to Rv2993c, len: 239 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 239 aa overlap). Possible 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase (EC 5.3.3.-), equivalent to O33119|ML1689|MLCB637.28 POSSIBLE 2-HYDROXYHEPTA-2,4-DIENE- 1,7-DIOATE ISOMERASE from Mycobacterium leprae (242 aa), FASTA scores: opt: 1427, E(): 4.4e-86, (85.9% identity in 241 aa overlap). Also similar to others e.g. Q9LBE3|DR1609 from Deinococcus radiodurans (250 aa), FASTA scores: opt: 723, E(): 5.5e-40, (49.05% identity in 216 aa overlap); O27551|MTH1507 from Methanothermobacter thermautotrophicus (260 aa), FASTA scores: opt: 708, E(): 5.4e-39, (52.1% identity in 213 aa overlap); Q9HQR6|VNG1037G|HPCE from Halobacterium sp. (strain NRC-1) (244 aa), FASTA scores: opt: 590, E(): 2.7e-31, (43.65% identity in 220 aa overlap); etc. Start chosen by homology, but ORF could continue upstream. TBparse score is 0.896. Protein product from Mb3017c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3017c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2T1" /db_xref="InterPro:IPR011234" /db_xref="InterPro:IPR018833" /db_xref="InterPro:IPR036663" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2T1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01641.1" /translation="MTAREIAEHPFGTPTFTGRSWPLADVRLLAPILASKVVCVGKNY ADHIAEMGGRPPADPVIFLKPNTAIIGPNTPIRLPANASPVHFEGELAIVIGRACKDV PAAQAVDNILGYTIGNDVSARDQQQSDGQWTRAKGHDTFCPVGPWIVTDLAPFDPADL ELRTVVNGDVKQHARTSLMIHDIGAIVEWISAIMTLLPGDLILTGTPAGVGPIEDGDT VSITIEGIGTLTNPVVRKGKP" CDS 3312297..3313634 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3018" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3018, -, len: 445 aa. Equivalent to Rv2994, len: 445 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 445 aa overlap). Probable conserved integral membrane protein, member of major facilitator superfamily (MFS) possibly involved in transport of drug. C-terminal part highly similar to O33118|MLCB637.27c HYPOTHETICAL 14.7 KDA PROTEIN (probable pseudogene product) from Mycobacterium leprae (134 aa), FASTA scores: opt: 483, E(): 2.7e-21, (60.9% identity in 138 aa overlap). Also similar to various transporters e.g. Q9I5C8|PA0811 PROBABLE MFS TRANSPORTER from Pseudomonas aeruginosa (415 aa), FASTA scores: opt: 289, E(): 1.3e-09, (26.05% identity in 399 aa overlap); O30210|AF0025 CYANATE TRANSPORT PROTEIN from Archaeoglobus fulgidus (393 aa), FASTA scores: opt: 281, E(): 3.7e-09, (24.05% identity in 399 aa overlap); Q9RI35|SCJ12.25C PUTATIVE NITRATE/NITRITE TRANSPORTER from Streptomyces coelicolor (412 aa), FASTA scores: opt: 264, E(): 3.8e-08, (24.95% identity in 409 aa overlap); Q9A5N5|CC2412 MAJOR FACILITATOR FAMILY TRANSPORTER from Caulobacter crescentus (405 aa), FASTA scores: opt: 263, E(): 4.3e-08, (27.55% identity in 399 aa overlap); etc. First start taken; similarity to P21191|NORA_STAAU QUINOLONE RESISTANCE PROTEIN from Staphylococcus aureus (388 aa) suggests alternative start at 7319 but then no positively charged aa before first transmembrane segment. Protein product from Mb3018 detected using SWATH mass spectrometry. Mb3018 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2S6" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2S6" /protein_id="SIU01642.1" /translation="MSRDPTGVGARWAIMIVSLGVTASSFLFINGVAFLIPRLENARG TPLSHAGLLASMPSWGLVVTMFAWGYLLDHVGERMVMAVGSALTAAAAYAAASVHSLL WIGVFLFLGGMAAGGCNSAGGRLVSGWFPPQQRGLAMGIRQTAQPLGIASGALVIPEL AERGVHAGLMFPAVVCTLAAVASVLGIVDPPRKSRTKASEQELASPYRGSSILWRIHA ASALLMMPQTVTVTFMLVWLINHHGWSVAQAGVLVTISQLLGALGRVAVGRWSDHVGS RMRPVRLIAAAAAATLFLLAAVDNEGSRYDVLLMIAISVIAVLDNGLEATAITEYAGP YWSGRALGIQNTTQRLMAAAGPPLFGSLITTAAYPTAWALCGVFPLAAVPLVPVRLLP PGLETRARRQSVRRHRWWQAVRCHAWPNGPRRPGPPGQPRRVRQGGTAITPPT" CDS complement(3313486..3314496) /codon_start=1 /transl_table=11 /gene="leuB" /locus_tag="BQ2027_MB3019C" /product="probable 3-isopropylmalate dehydrogenase leub (beta-ipm dehydrogenase) (imdh) (3-ipm-dh)" /note="Mb3019c, leuB, len: 336 aa. Equivalent to Rv2995c, len: 336 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 336 aa overlap). leuB, 3-isopropylmalate dehydrogenase (EC 1.1.1.85) (see citation below), identical except one bp to P94929|LEU3_MYCBO 3-ISOPROPYLMALATE DEHYDROGENASE from Mycobacterium bovis (336 aa), FASTA scores: opt: 2168, E(): 5.1e-132, (99.7% identity in 336 aa overlap); and equivalent to O33117|LEU3_MYCLE 3-ISOPROPYLMALATE DEHYDROGENASE from Mycobacterium leprae (336 aa), FASTA scores: opt: 1864, E(): 1.8e-112, (83.95% identity in 336 aa overlap). Also highly similar to others e.g. P94631|LEU3_CORGL from Corynebacterium glutamicum (340 aa), FASTA scores: opt: 1526, E(): 1e-90, (69.9% identity in 339 aa overlap); O86504 from Streptomyces coelicolor (347 aa), FASTA scores: opt: 1470, E(): 4.2e-87, (67.85% identity in 339 aa overlap); Q9UZ05|PAB2424 from Pyrococcus abyssi (354 aa), FASTA scores: opt: 998, E(): 1e-56, (50.0% identity in 322 aa overlap); etc. Note that also shows high similarity with many tartrate dehydrogenases (EC 1.1.1.93). BELONGS TO THE ISOCITRATE AND ISOPROPYLMALATE DEHYDROGENASES FAMILY. Protein product from Mb3019c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3019c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P94929" /db_xref="InterPro:IPR019818" /db_xref="InterPro:IPR023698" /db_xref="InterPro:IPR024084" /db_xref="UniProtKB/Swiss-Prot:P94929" /protein_id="SIU01643.1" /translation="MKLAIIAGDGIGPEVTAEAVKVLDAVVPGVQKTSYDLGARRFHA TGEVLPDSVVAELRNHDAILLGAIGDPSVPSGVLERGLLLRLRFELDHHINLRPARLY PGVASPLSGNPGIDFVVVREGTEGPYTGNGGAIRVGTPNEVATEVSVNTAFGVRRVVA DAFERARRRRKHLTLVHKTNVLTLAGGLWLRTVDEVGECYPDVEVAYQHVDAATIHMI TDPGRFDVIVTDNLFGDIITDLAAAVCGGIGLAASGNIDATRANPSMFEPVHGSAPDI AGQGIADPTAAIMSVALLLSHLGEHDAAARVDRAVEAHLATRGSERLATSDVGERIAA AL" CDS complement(3314511..3316097) /codon_start=1 /transl_table=11 /gene="serA1" /locus_tag="BQ2027_MB3020C" /standard_name="serA" /product="PROBABLE D-3-PHOSPHOGLYCERATE DEHYDROGENASE SERA1 (PGDH)" /note="Mb3020c, serA1, len: 528 aa. Equivalent to Rv2996c, len: 528 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 528 aa overlap). Probable serA1, D-3-phosphoglycerate dehydrogenase (EC 1.1.1.95), equivalent to SERA_MYCLE D-3-PHOSPHOGLYCERATE DEHYDROGENASE from Mycobacterium leprae (528 aa), FASTA scores: opt: 2974, E(): 1.9e-166, (89.6% identity in 528 aa overlap). Also highly similar to many e.g. Q9Z564 from Streptomyces coelicolor (529 aa), FASTA scores: opt: 1879, E(): 2.1e-102, (57.6% identity in 526 aa overlap); O29445|SERA_ARCFU from Archaeoglobus fulgidus (527 aa), FASTA scores: opt: 1252, E(): 9.6e-66, (41.3% identity in 530 aa overlap); P35136|SERA_BACSU from Bacillus subtilis (525 aa), FASTA scores: opt: 1172, E(): 4.5e-61, (37.9% identity in 528 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00065 D-isomer specific 2-hydroxyacid dehydrogenases NAD-binding signature, and PS00670 D-isomer specific 2-hydroxyacid dehydrogenases signature 2. BELONGS TO THE D-ISOMER SPECIFIC 2-HYDROXYACID DEHYDROGENASES FAMILY. Note that previously known as serA. Protein product from Mb3020c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3020c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A545" /db_xref="InterPro:IPR002912" /db_xref="InterPro:IPR006139" /db_xref="InterPro:IPR006140" /db_xref="InterPro:IPR006236" /db_xref="InterPro:IPR029009" /db_xref="InterPro:IPR029752" /db_xref="InterPro:IPR029753" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P0A545" /protein_id="SIU01644.1" /translation="MSLPVVLIADKLAPSTVAALGDQVEVRWVDGPDRDKLLAAVPEA DALLVRSATTVDAEVLAAAPKLKIVARAGVGLDNVDVDAATARGVLVVNAPTSNIHSA AEHALALLLAASRQIPAADASLREHTWKRSSFSGTEIFGKTVGVVGLGRIGQLVAQRI AAFGAYVVAYDPYVSPARAAQLGIELLSLDDLLARADFISVHLPKTPETAGLIDKEAL AKTKPGVIIVNAARGGLVDEAALADAITGGHVRAAGLDVFATEPCTDSPLFELAQVVV TPHLGASTAEAQDRAGTDVAESVRLALAGEFVPDAVNVGGGVVNEEVAPWLDLVRKLG VLAGVLSDELPVSLSVQVRGELAAEEVEVLRLSALRGLFSAVIEDAVTFVNAPALAAE RGVTAEICKASESPNHRSVVDVRAVGADGSVVTVSGTLYGPQLSQKIVQINGRHFDLR AQGINLIIHYVDRPGALGKIGTLLGTAGVNIQAAQLSEDAEGPGATILLRLDQDVPDD VRTAIAAAVDAYKLEVVDLS" CDS 3316127..3317569 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3021" /product="POSSIBLE ALANINE RICH DEHYDROGENASE" /note="Mb3021, -, len: 480 aa. Equivalent to Rv2997, len: 480 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 480 aa overlap). Possible ala-rich dehydrogenase (EC 1.-.-.-), similar to others dehydrogenases and hypothetical proteins e.g. Q9EYI5 PUTATIVE DEHYDROGENASE from Streptomyces nogalater (472 aa), FASTA scores: opt: 1131, E(): 1.7e-61, (41.0% identity in 471 aa overlap); Q9ZBG4|SC9B5.16 PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (472 aa), FASTA scores: opt: 1064, E(): 2e-57, (39.05% identity in 471 aa overlap); Q98BS8 PROBABLE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (524 aa), FASTA scores: opt: 196, E(): 0.00021, (25.1% identity in 526 aa overlap); etc. Shows strong simlarity throughout its length to O06826|MTCY493.22c|Rv1432 HYPOTHETICAL 50.5 KDA PROTEIN from Mycobacterium tuberculosis (473 aa), FASTA scores: opt: 1220, E(): 6.1e-67, (42.35% identity in 465 aa overlap)." /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4R1" /protein_id="SIU01645.1" /translation="MDVTVVGSGPNGLATAVICARAGLNVQVVEAQATFGGGARSAAD FEFPEVLHDVCSAVHPLALASPFFAEFDLPARGVTLTVPDIAYANPLPGRPAAIAYHD LAHTSAKLDDGASWRRLLGPLVAHSETVVEFMLSDKRSLPTALGSVLRLGLRMLAQGT PAWRSLAGEDARALFTGVAAHAISPLPSLVSAGAGLMLATLAHSVGWPIPVGGTQAIA DALIADLRAHGGRLAAGVEITEPQRSVVVFDTAPTALLRVYRDKLPHRYAKALRRYRF RAGIAKVDFVLSDEIPWSDPRLRRAATLHLGGTRDQMARAEADVAAGRHADWPMVLAA CPHVADPGRIDETGRRPFWTYAHVPSGSTLDATETVTSVLERFAPGFRDIVVAARAVP AARMADHNANYVGGDITVGANSTWRAIAGPTPRLNPWRTPIPKVYLCSAATPPGAGVH GMCGWYAARTLLRTEFGITRMPPLGHELRP" CDS 3317842..3318303 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3022" /product="HYPOTHETICAL PROTEIN" /note="Mb3022, -, len: 153 aa. Equivalent to Rv2998, len: 153 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 153 aa overlap). Hypothetical unknown protein. Note that equivalent to AAK47405 Hypothetical 19.4 kDa protein from Mycobacterium tuberculosis strain CDC1551 (186 aa) but sequence differs in N-terminus. Mb3022 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3L8" /protein_id="SIU01646.1" /translation="MDVIWSATIATTVATGMRKPRMHGMPPITSGSMVTRVTRMSIRL AGDSTLGRFSTSRLGLSSAKSKPEGDFGTACGAVSGGDAGVVALAEGVDDGQSKPGAA GGARGVGGFRESRADCGEQFGVASWTPQGEFEFGGQEAKGVRSSWPASLTN" CDS complement(3318252..3318455) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3023C" /product="Signal transduction histidine kinase" /note="Mb3023c, -, len: 67 aa. Equivalent to Rv2998A, len: 67 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 67 aa overlap). Probable conserved hypothetical protein, (possibly gene fragment), highly similar to central part of two-component sensor proteins e.g. O07777|Rv0601c|MTCY19H5.21 TWO COMPONENT SENSOR (FRAGMENT) from Mycobacterium tuberculosis (156 aa), FASTA scores: opt: 212, E(): 3.7e-09, (58.2% identity in 67 aa overlap); Q9L2B6|SC8F4.08 PROBABLE TWO-COMPONENT SENSOR KINASE from Streptomyces coelicolor (478 aa), FASTA scores: opt: 193, E(): 2.6e-07, (47.05% identity in 68 aa overlap); etc." /db_xref="GOA:A0A1R3Y2Z6" /db_xref="InterPro:IPR003661" /db_xref="InterPro:IPR036097" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Z6" /protein_id="SIU01647.1" /translation="MERMRIRAAGISATDPHARLPLPLARDEIRYLGTTFNDLLQRLQ DALERERQFVSDAGHELRTPLAS" CDS 3318629..3319594 /codon_start=1 /transl_table=11 /gene="lppY" /locus_tag="BQ2027_MB3024" /product="PROBABLE CONSERVED LIPOPROTEIN LPPY" /note="Mb3024, lppY, len: 321 aa. Equivalent to Rv2999, len: 321 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 321 aa overlap). Probable lppY, conserved lipoprotein, highly similar to O07774|LPQO|Rv0604|MTCY19H5.18c PUTATIVE LIPOPROTEIN from Mycobacterium tuberculosis (316 aa), FASTA scores: opt: 1153, E(): 5e-62, (53.2% identity in 312 aa overlap); and showing similarity with AAK80743|CAC2799 UNCHARACTERIZED CONSERVED PROTEIN SIMILAR TO LPPY/LPQO OF Mycobacterium tuberculosis from Clostridium acetobutylicum (152 aa), FASTA scores: opt: 165, E(): 0.0077, (26.08% identity in 138 aa overlap); and Q9F2T1|SCD65.01c PUTATIVE LIPOPROTEIN (FRAGMENT) from Streptomyces coelicolor (146 aa), FASTA scores: opt: 126, E(): 1.6, (% identity in aa overlap). Equivalent to AAK47407 from Mycobacterium tuberculosis strain CDC1551 (329 aa) but shorter 8 aa. Contains probable N-terminal signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb3024 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3024 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR011094" /db_xref="UniProtKB/TrEMBL:A0A1R3Y379" /protein_id="SIU01648.1" /translation="MAGAKHAGRIVAITTAAAVILAACSSGSKGGAGSGHAGKARSAV TTTDADWKPVADALGRSGKLGDNNTAYRINLPRNDLHITSYGVDIKPGLSLGGYAAFA RYDNNETLLMGDLVITEEELPKVTDALQAHGIAQTALHKHLLQQDPPVWWTHIHGMGD AARLAQGLKAALDATTIGPPTPPPARQPPVDIDVAGVDQALGRKGTQDGGLLKYSIPR KDTIIEDGHVLPAVSLNLTTVINFQPVGRGRAAINGDFILIAPEVQEVIRAMRAGNIT IVELHNHGLTEEPRLFYMHYWAVDDAVTLARALRPAMDATNLQSS" CDS 3319639..3320298 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3025" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3025, -, len: 219 aa. Equivalent to Rv3000, len: 219 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 219 aa overlap). Possible conserved transmembrane protein, similar to various membrane proteins e.g. P77307|YBBM_ECOLI|B0491 HYPOTHETICAL 28.2 KDA PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Escherichia coli strain K12 (259 aa), FASTA scores: opt: 292, E(): 3.1e-11, (30.25% identity in 218 aa overlap); N-terminus of Q9BJF3 PUTATIVE ABC TRANSPORTER (FRAGMENT) from Sterkiella histriomuscorum (1319 aa), FASTA scores: opt: 274, E(): 1.3e-09, (39.6% identity in 101 aa overlap); Q9C9W0|T23K23.21 PUTATIVE ABC TRANSPORTER from Arabidopsis thaliana (Mouse-ear cress) (263 aa), FASTA scores: opt: 258, E(): 4.4e-09, (30.1% identity in 196 aa overlap); P74369|YG47_SYNY3|SLR1647 HYPOTHETICAL 28.1 KDA PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Synechocystis sp. strain PCC 6803 (259 aa), FASTA scores: opt: 257, E(): 5.1e-09, (37.75% identity in 98 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Mb3025 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2T4" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR005226" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2T4" /protein_id="SIU01649.1" /translation="MAVHGFLLERVSVVRDEATVLRQVSAHFPAGRCSAVRGASGSGK TTLLRLLNRLIDPTSGKVWLDGVPLTDLDVLVLRRRVGLVAQAPVVLTDAVLNEVRVG RPDLPEGRVTELLARLCLGQSAREAFLPHQRSALRTALIPAIDSTKVVGLISLPGAMS GLILAGVDPLTAIRYQIVVMYLLLAATAVAALTCARLAERALFDRAHRLVSLPAATRR A" CDS complement(3320612..3321613) /codon_start=1 /transl_table=11 /gene="ilvC" /locus_tag="BQ2027_MB3026C" /product="PROBABLE KETOL-ACID REDUCTOISOMERASE ILVC (Acetohydroxy-acid isomeroreductase) (Alpha-keto-beta-hydroxylacil reductoisomerase)" /note="Mb3026c, ilvC, len: 333 aa. Equivalent to Rv3001c, len: 333 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 333 aa overlap). Probable ilvC, ketol-acid reductoisomerase (EC 1.1.1.86), equivalent or highly similar to others e.g. Q59500|ILVC_MYCAV from Mycobacterium avium (333 aa), FASTA scores: opt: 1977, E(): 3.2e-113, (87.7% identity in 333 aa overlap); O33114|ILVC_MYCLE from Mycobacterium leprae (333 aa), FASTA scores: opt: 1924, E(): 5.3e-110, (86.5% identity in 333 aa overlap); Q9Z565|ILVC_STRCO|SC8D9.26 from Streptomyces coelicolor (332 aa), FASTA scores: opt: 1494, E(): 8.3e-84, (67.5% identity in 326 aa overlap); Q59818|ILVC_STRAW from Streptomyces avermitilis (333 aa) FASTA scores: opt: 1487, E(): 2.2e-83, (66.8% identity in 326 aa overlap); etc. BELONGS TO THE KETOL-ACID REDUCTOISOMERASES FAMILY. Protein product from Mb3026c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3026c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65150" /db_xref="InterPro:IPR000506" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR013023" /db_xref="InterPro:IPR013116" /db_xref="InterPro:IPR014359" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P65150" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01650.1" /translation="MFYDDDADLSIIQGRKVGVIGYGSQGHAHSLSLRDSGVQVRVGL KQGSRSRPKVEEQGLDVDTPAEVAKWADVVMVLAPDTAQAEIFAGDIEPNLKPGDALF FGHGLNVHFGLIKPPADVAVAMVAPKGPGHLVRRQFVDGKGVPCLVAVEQDPRGDGLA LALSYAKAIGGTRAGVIKTTFKDETETDLFGEQTVLCGGTEELVKAGFEVMVEAGYPA ELAYFEVLHELKLIVDLMYEGGLARMYYSVSDTAEFGGYLSGPRVIDAGTKERMRDIL REIQDGSFVHKLVADVEGGNKQLEELRRQNAEHPIEVVGKKLRDLMSWVDRPITETA" CDS complement(3321651..3322157) /codon_start=1 /transl_table=11 /gene="ilvN" /locus_tag="BQ2027_MB3027C" /standard_name="ilvH" /product="PROBABLE ACETOLACTATE SYNTHASE (SMALL SUBUNIT) ILVN (ACETOHYDROXY-ACID SYNTHASE) (AHAS) (ALS)" /note="Mb3027c, ilvN, len: 168 aa. Equivalent to Rv3002c, len: 168 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 168 aa overlap). Probable ilvN (alternate gene name: ilvH), acetolactate synthase, small subunit (EC 4.1.3.18), equivalent or highly similar to others e.g. O33113|ILVH_MYCLE|MLCB637.21 from Mycobacterium leprae (169 aa), FASTA scores: opt: 843, E(): 5.1e-47, (83.5% identity in 164 aa overlap); Q59499|ILVH_MYCAV|ILVN from Mycobacterium avium (167 aa), FASTA scores: opt: 798, E(): 3.7e-44, (81.05% identity in 169 aa overlap); Q9Z566|ILVN from Streptomyces coelicolor (174 aa), FASTA scores: opt: 678, E(): 1.7e-36, (64.8% identity in 159 aa overlap); etc. BELONGS TO THE ACETOLACTATE SYNTHASE SMALL SUBUNIT FAMILY. Protein product from Mb3027c detected using shotgun mass spectrometry. Mb3027c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65162" /db_xref="InterPro:IPR002912" /db_xref="InterPro:IPR004789" /db_xref="InterPro:IPR019455" /db_xref="InterPro:IPR027271" /db_xref="InterPro:IPR039557" /db_xref="UniProtKB/Swiss-Prot:P65162" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01651.1" /translation="MSPKTHTLSVLVEDKPGVLARVAALFSRRGFNIESLAVGATECK DRSRMTIVVSAEDTPLEQITKQLNKLINVIKIVEQDDEHSVSRELALIKVQADAGSRS QVIEAVNLFRANVIDVSPESLTVEATGNRGKLEALLRVLEPFGIREIAQSGMVSLSRG PRGIGTAK" CDS complement(3322157..3324013) /codon_start=1 /transl_table=11 /gene="ilvB1" /locus_tag="BQ2027_MB3028C" /standard_name="ilvB" /product="acetolactate synthase (large subunit) ilvb1 (acetohydroxy-acid synthase)" /note="Mb3028c, ilvB1, len: 618 aa. Equivalent to Rv3003c, len: 618 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 618 aa overlap). Probable ilvB1, acetolactate synthase, large subunit (EC 4.1.3.18), equivalent or highly similar to others e.g. O33112|ILVB_MYCLE|MLCB637.20|ML1696 from Mycobacterium leprae (625 aa), FASTA scores: opt: 3653, E(): 5.4e-208, (87.1% identity in 627 aa overlap); Q59498|ILVB_MYCAV from Mycobacterium avium (621 aa), FASTA scores: opt: 3473, E(): 2.3e-197, (84.7% identity in 614 aa overlap); P42463|ILVB_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (626 aa), FASTA scores: opt: 2754, E(): 5.9e-155, (65.8% identity in 589 aa overlap); etc. Contains PS00187 Thiamine pyrophosphate enzymes signature. COFACTOR: THIAMINE PYROPHOSPHATE, AND MAGNESIUM (BY SIMILARITY). Note that previously known as ilvB. Protein product from Mb3028c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3028c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A623" /db_xref="InterPro:IPR000399" /db_xref="InterPro:IPR011766" /db_xref="InterPro:IPR012000" /db_xref="InterPro:IPR012001" /db_xref="InterPro:IPR012846" /db_xref="InterPro:IPR029035" /db_xref="InterPro:IPR029061" /db_xref="InterPro:IPR039368" /db_xref="UniProtKB/Swiss-Prot:P0A623" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01652.1" /translation="MSAPTKPHSPTFKPEPHSAANEPKHPAARPKHVALQQLTGAQAV IRSLEELGVDVIFGIPGGAVLPVYDPLFDSKKLRHVLVRHEQGAGHAASGYAHVTGRV GVCMATSGPGATNLVTPLADAQMDSIPVVAITGQVGRGLIGTDAFQEADISGITMPIT KHNFLVRSGDDIPRVLAEAFHIAASGRPGAVLVDIPKDVLQGQCTFSWPPRMELPGYK PNTKPHSRQVREAAKLIAAARKPVLYVGGGVIRGEATEQLRELAELTGIPVVTTLMAR GAFPDSHRQNLGMPGMHGTVAAVAALQRSDLLIALGTRFDDRVTGKLDSFAPEAKVIH ADIDPAEIGKNRHADVPIVGDVKAVITELIAMLRHHHIPGTIEMADWWAYLNGVRKTY PLSYGPQSDGSLSPEYVIEKLGEIAGPDAVFVAGVGQHQMWAAQFIRYEKPRSWLNSG GLGTMGFAIPAAMGAKIALPGTEVWAIDGDGCFQMTNQELATCAVEGIPVKVALINNG NLGMVRQWQSLFYAERYSQTDLATHSHRIPDFVKLAEALGCVGLRCEREEDVVDVINQ ARAINDCPVVIDFIVGADAQVWPMVAAGTSNDEIQAARGIRPLFDDITEGHA" CDS 3324375..3324713 /codon_start=1 /transl_table=11 /gene="cfp6" /locus_tag="BQ2027_MB3029" /product="LOW MOLECULAR WEIGHT PROTEIN ANTIGEN 6 (CFP-6)" /note="Mb3029, cfp6, len: 112 aa. Equivalent to Rv3004, len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 112 aa overlap). cfp6, low molecular weight protein antigen 6 (CFP-6) (cf note * below). Weak homology with Q9RKZ5|SC6D7.02 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (156 aa), FASTA scores: opt: 109, E(): 0.78, (39.4% identity in 122 aa overlap). CAUTION: THE INITIATOR METHIONINE MAY BE FURTHER UPSTREAM MAKING THE SEQUENCE A PRECURSOR. [* Note: Bhaskar S., Mukherjee R.: Isolation, purification and immunological characterization of low molecular weight protein antigens from culture filtrate of Mycobacterium tuberculosis H37Rv. Unpublished. Submitted (NOV-1998) to the SWISS-PROT data bank]. Protein product from Mb3029 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3029 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5P3" /db_xref="InterPro:IPR019692" /db_xref="UniProtKB/Swiss-Prot:P0A5P3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01653.1" /translation="MAHFAVGFLTLGLLVPVLTWPVSAPLLVIPVALSASIIRLRTLA DERGVTVRTLVGSRAVRWDDIDGLRFHRGSWARATLKDGTELRLPAVTFATLPHLTEA SSGRVPNPYR" CDS complement(3324720..3325559) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3030C" /product="Membrane protein 2, distant similarity to thiosulphate:quinone oxidoreductase DoxD" /note="Mb3030c, -, len: 279 aa. Equivalent to Rv3005c, len: 279 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 279 aa overlap). Conserved hypothetical protein, equivalent to O33110|MLCB637.18|ML1698 HYPOTHETICAL 29.5 KDA PROTEIN from Mycobacterium leprae (277 aa), FASTA scores: opt: 1245, E(): 1.2e-65, (70.5% identity in 278 aa overlap). Also similar, but longer 100 aa in N-terminus, to other hypothetical proteins, few membrane proteins, e.g. Q9RKN9|SCC75A.35 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (180 aa), FASTA scores: opt: 326, E(): 3.9e-12, (44.2% identity in 138 aa overlap); P96694|YDFP|AB001488 HYPOTHETICAL PROTEIN from Bacillus subtilis (129 aa), FASTA scores: opt:273, E(): 3.7e-09, (33.1% identity in 130 aa overlap); Q9KKT1|VCA1019 HYPOTHETICAL PROTEIN from Vibrio cholerae (148 aa), FASTA scores: opt: 258, E(): 3.1e-08, (34.9% identity in 126 aa overlap); etc. Mb3030c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2V3" /db_xref="InterPro:IPR032808" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2V3" /protein_id="SIU01654.1" /translation="MTSSNDSHWQRPDDSPGPMPGRPVSASLVDPEDDLTPARYAGDF GSGTTTVIPPYDAASSGVGNSGYSLIEAAEPLPYVQPQPGRQVPAGSAGIDMDDDERV RAAGRRGTQNLGLLILRVGLGAVLIAHGLQKLFGWWDGQGLAGFQNSLSDIGYQHAEI LAYVSAGGEIVAGVLLVLGLFTPLAAAGALAFLINGLLAGISAQHSRPVAYFLQDGHE YQITLVVMAVAVILSGPGRYGLDAARGWAHRPFIGSFVALLGGIAAGIAVWVLLNGAN PLA" CDS 3325736..3326857 /codon_start=1 /transl_table=11 /gene="lppZ" /locus_tag="BQ2027_MB3031" /product="PROBABLE CONSERVED LIPOPROTEIN LPPZ" /note="Mb3031, lppZ, len: 373 aa. Equivalent to Rv3006, len: 373 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 373 aa overlap). Probable lppZ, conserved lipoprotein, equivalent to O33109|MLCB637.17C|ML1699 putative lipoprotein from M. leprae (372 aa), FASTA scores: opt: 2211, E(): 4.3e-100, (87.1% identity in 373 aa overlap). Shows also similarity (in part) with Q9Z571|SC8D9.20c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (447 aa), FASTA scores: opt: 185, E(): 0.051, (31.6% identity in 300 aa overlap); Q9Z9R3|BH2090 GLUCOSE DEHYDROGENASE-B from Bacillus halodurans (371 aa), FASTA scores: opt: 206, E(): 0.0043, (28.3% identity in 205 aa overlap); and other GLUCOSE DEHYDROGENASES B. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site, followed by a proline-rich domain. Protein product from Mb3031 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3031 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4S0" /db_xref="InterPro:IPR011041" /db_xref="InterPro:IPR011042" /db_xref="InterPro:IPR012938" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4S0" /protein_id="SIU01655.1" /translation="MWTTRLVRSGLAALCAAVLVSSGCARFNDAQSQPFTTEPELRPQ PSSTPPPPPPLPPVPFPKECPAPGVMQGCLESTSGLIMGIDSKTALVAERITGAVEEI SISAEPKVKTVIPVDPAGDGGLMDIVLSPTYSQDRLMYAYISTPTDNRVVRVADGDIP KDILTGIPKGAAGNTGALIFTSPTTLVVMTGDAGDPALAADPQSLAGKVLRIEQPTTI DQTPPTTALSGIGSGGGLCIDPVDGSLYVADRTPTADRLQRITKNSEVSTVWTWPDKP GVAGCAAMDGTVLVNLINTKLTVAVRLAPSTGAVTGEPDVVRKDTHAHAWALRMSPDG NVWGATVNKTAGDAEKLDDVVFPLFPQGGGFPRNNDDKT" CDS complement(3326863..3327477) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3032C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb3032c, -, len: 204 aa. Equivalent to Rv3007c, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 204 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to Q9EWU5|3SC5B7.04c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (162 aa), FASTA scores: opt: 376, E(): 1.5e-18, (41.35% identity in 150 aa overlap); Q9K416|SCG22.29c PUTATIVE FLAVIN-DEPENDENT REDUCTASE PROTEIN from Streptomyces coelicolor (169 aa), FASTA scores: opt: 246, E(): 1e-09, (34.1% identity in 135 aa overlap); and some similarity to coupling proteins of 4-hydroxyphenylacetic hydroxylase/monooxygenase e.g. Q9HWT6|HPAC|PA4092 Pseudomonas aeruginosa (170 aa), FASTA score: opt: 214; O68232|HPAC Photorhabdus luminescens (Xenorhabdus luminescens) (172 aa), FASTA score: opt: 198; Q9RPU2|HPAC Salmonella dublin (170 aa), FASTA score: opt: 197; etc. Equivalent to AAK47416 from Mycobacterium tuberculosis strain CDC1551 (236 aa) but shorter 32 aa. Start chosen by similarity. Mb3032c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3M9" /db_xref="InterPro:IPR002563" /db_xref="InterPro:IPR012349" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3M9" /protein_id="SIU01656.1" /translation="MSEDVARIHDGDVIDESFDELMGMLDHPVFVVTTQADGHPAGCL VSFATQTSVQPPSFMVGLPRSTGTSEVASRSEHLAVHVLSQRQHVLAELFGSQTEEEV NKFARCSWRAGPCGMPILDDAAAWFIGRTASRSDVGDYVAYLLEPVSVWAPECSEDLL YLSDLDFDVDDIDPGKEASPRFYERERGDETRRYGVVRFTLDVP" CDS 3327671..3328294 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3033" /product="HYPOTHETICAL PROTEIN" /note="Mb3033, -, len: 207 aa. Equivalent to Rv3008, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 207 aa overlap). Hypothetical unknown protein. Start uncertain. Protein product from Mb3033 detected using SWATH mass spectrometry. Mb3033 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y305" /protein_id="SIU01657.1" /translation="MLTVVAVIGILECGLVLHMPDNDLWYCGPWTLWVMAGRGVASGA GVWRGDRVATPLAVAITAAGLVSGARIGPGAAAKRDPQLAQWNEIRSHYQEIAEWIDH DTATAHPAVAATQISAAGSFGRANMVDYLGLLDSRADETVRRDEFSRWLSAKPDYLVT TEQSVDAATIALPEFRHAYDRAATIGTLNVYRRNSPDGDEPLPADGN" CDS complement(3328291..3329820) /codon_start=1 /transl_table=11 /gene="gatB" /locus_tag="BQ2027_MB3034C" /product="PROBABLE GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE (SUBUNIT B) GATB (Glu-ADT SUBUNIT B)" /note="Mb3034c, gatB, len: 509 aa. Equivalent to Rv3009c, len: 509 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 509 aa overlap). Probable gatB, Glu-tRNA-Gln amidotransferase, subunit B (EC 6.3.5.-), equivalent to O33107|GATB_MYCLE|MLCB637_15 GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE from Mycobacterium leprae (509 aa), FASTA scores: opt: 2973, E(): 2.9e-173, (88.4% identity in 509 aa overlap). Also highly similar to other Glu- tRNA-Gln amidotransferases e.g. Q9Z578|GATB|SC8D9.13 from Streptomyces coelicolor (504 aa), FASTA scores: opt: 2264, E(): 3.6e-130, (66.0% identity in 495 aa overlap); P74215|GATB_SYNY3|SLL1435 from Synechocystis sp. strain PCC 6803 (519 aa), FASTA scores: opt: 1289, E(): 6.7e-71, (42.0% identity in 485 aa overlap); Q9X100|GATB_THEMA|TM1273 GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE from Thermotoga maritima (482 aa), FASTA scores: opt: 1165, E(): 2.2e-63, (40.05% identity in 487 aa overlap); etc. For more information about function, see citation below. Similar to many members of the pet112 family. BELONGS TO THE GATB FAMILY. Protein product from Mb3034c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3034c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64200" /db_xref="InterPro:IPR003789" /db_xref="InterPro:IPR004413" /db_xref="InterPro:IPR006075" /db_xref="InterPro:IPR014746" /db_xref="InterPro:IPR017958" /db_xref="InterPro:IPR017959" /db_xref="InterPro:IPR018027" /db_xref="InterPro:IPR023168" /db_xref="InterPro:IPR042114" /db_xref="UniProtKB/Swiss-Prot:P64200" /protein_id="SIU01658.1" /translation="MTVAAGAAKAAGAELLDYDEVVARFQPVLGLEVHVELSTATKMF CGCTTTFGGEPNTQVCPVCLGLPGSLPVLNRAAVESAIRIGLALNCEIVPWCRFARKN YFYPDMPKNYQISQYDEPIAINGYLDAPLEDGTTWRVEIERAHMEEDTGKLTHIGSET GRIHGATGSLIDYNRAGVPLIEIVTKPIVGAGARAPQIARSYVTALRDLLRALDVSDV RMDQGSMRCDANVSLKPAGTTEFGTRTETKNVNSLKSVEVAVRYEMQRQGAILASGGR ITQETRHFHEAGYTSAGRTKETAEDYRYFPEPDLEPVAPSRELVERLRQTIPELPWLS RRRIQQEWGVSDEVMRDLVNAGAVELVAATVEHGASSEAARAWWGNFLAQKANEAGIG LDELAITPAQVAAVVALVDEGKLSNSLARQVVEGVLAGEGEPEQVMTARGLALVRDDS LTQAAVDEALAANPDVADKIRGGKVAAAGAIVGAVMKATRGQADAARVRELVLEACGQ G" CDS complement(3329850..3330881) /codon_start=1 /transl_table=11 /gene="pfkA" /locus_tag="BQ2027_MB3035C" /product="PROBABLE 6-PHOSPHOFRUCTOKINASE PFKA (PHOSPHOHEXOKINASE) (PHOSPHOFRUCTOKINASE)" /note="Mb3035c, pfkA, len: 343 aa. Equivalent to Rv3010c, len: 343 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 343 aa overlap). Probable pfkA, phosphofructokinase (EC 2.7.1.11), equivalent to O33106|K6PF_MYCLE|MLCB637.14 6-PHOSPHOFRUCTOKINASE from Mycobacterium leprae (343 aa), FASTA scores: opt: 2099, E(): 4.1e-122, (90.4% identity in 343 aa overlap). Also highly similar to others e.g. Q9FC99|K6P3_STRCO from Streptomyces coelicolor (341 aa), FASTA scores: opt: 1329, E(): 1.1e-74, (58.9% identity in 338 aa overlap); Q9L1L8|K6P2_STRCO|PFKA2|PFK2|SC6A11.02 6-PHOSPHOFRUCTOKINASE 2 from Streptomyces coelicolor (341 aa), FASTA scores: opt: 1303, E(): 4.5e-73, (56.7% identity in 342 aa overlap); Q9KH71|PFP PPI-DEPENDENT PHOSPHOFRUCTOKINASE from Dictyoglomus thermophilum (346 aa), FASTA scores: opt: 893, E(): 8.4e-48, (41.85% identity in 344 aa overlap); etc. Contains PS00433 Phosphofructokinase signature. BELONGS TO THE PHOSPHOFRUCTOKINASE FAMILY. Protein product from Mb3035c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3035c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65691" /db_xref="InterPro:IPR000023" /db_xref="InterPro:IPR012003" /db_xref="InterPro:IPR012829" /db_xref="InterPro:IPR015912" /db_xref="InterPro:IPR022953" /db_xref="InterPro:IPR035966" /db_xref="UniProtKB/Swiss-Prot:P65691" /protein_id="SIU01659.1" /translation="MRIGVLTGGGDCPGLNAVIRAVVRTCHARYGSSVVGFQNGFRGL LENRRVQLHNDDRNDRLLAKGGTMLGTARVHPDKLRAGLPQIMQTLDDNGIDVLIPIG GEGTLTAASWLSEENVPVVGVPKTIDNDIDCTDVTFGHDTALTVATEAIDRLHSTAES HERVMLVEVMGRHAGWIALNAGLASGAHMTLIPEQPFDIEEVCRLVKGRFQRGDSHFI CVVAEGAKPAPGTIMLREGGLDEFGHERFTGVAAQLAVEVEKRINKDVRVTVLGHIQR GGTPTAYDRVLATRFGVNAADAAHAGEYGQMVTLRGQDIGRVPLADAVRKLKLVPQSR YDDAAAFFG" CDS complement(3330977..3332461) /codon_start=1 /transl_table=11 /gene="gatA" /locus_tag="BQ2027_MB3036C" /product="PROBABLE GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE (SUBUNIT A) GATA (Glu-ADT SUBUNIT A)" /note="Mb3036c, gatA, len: 494 aa. Equivalent to Rv3011c, len: 494 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 494 aa overlap). Probable gatA, Glu-tRNA-Gln amidotransferase, subunit A (EC 6.3.5.-), equivalent to O33105|GATA|ML1702|MLCB637.13 GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE from Mycobacterium leprae (497 aa), FASTA scores: opt: 2839, E(): 3.5e-161, (88.8% identity in 492 aa overlap). Also highly similar to other Glu-tRNA-Gln amidotransferases e.g. Q9Z580|GATA_STRCO from Streptomyces coelicolor (497 aa), FASTA scores: opt: 2231, E(): 4.5e-125, (70.3% identity in 486 aa overlap); P73558|GATA_SYNY3|SLR0877 from Synechocystis sp. strain PCC 6803 (483 aa), FASTA scores: opt: 1593, E(): 3.3e-87, (55.85% identity in 487 aa overlap); O06491|GATA_BACSU GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE from Bacillus subtilis (485 aa), FASTA scores: opt: 1389, E(): 4.3e-75, (51.7% identity in 468 aa overlap); etc. For more information about function, see citation below. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE AMIDASE FAMILY. Protein product from Mb3036c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3036c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TXG2" /db_xref="InterPro:IPR000120" /db_xref="InterPro:IPR004412" /db_xref="InterPro:IPR020556" /db_xref="InterPro:IPR023631" /db_xref="InterPro:IPR036928" /db_xref="UniProtKB/Swiss-Prot:Q7TXG2" /protein_id="SIU01660.1" /translation="MTDIIRSDAATLAAKIAIKEVSSTEITRACLDQIEATDETYHAF LHVAADEALAAAAAVDKQVAAGEPLPSALAGVPLALKDVFTTSDMPTTCGSKILEGWR SPYDATLTARLRAAGIPILGKTNMDEFAMGSSTENSAYGPTRNPWNLDRVPGGSGGGS AAALAAFQAPLAIGSDTGGSIRQPAALTATVGVKPTYGTVSRYGLVACASSLDQGGPC ARTVLDTALLHQVIAGHDPRDSTSVDAEVPDVVGAARAGAVGDLRGVRVGVVRQLHGG EGYQPGVLASFEAAVEQLTALGAEVSEVDCPHFDHALAAYYLILPSEVSSNLARFDAM RYGLRVGDDGTRSAEEVMAMTRAAGFGPEVKRRIMIGTYALSAGYYDAYYNQAQKVRT LIARDLDAAYRSVDVLVSPTTPTTAFRLGEKVDDPLAMYLFDLCTLPLNLAGHCGMSV PSGLSPDDGLPVGLQIMAPALADDRLYRVGAAYEAARGPLLSAI" CDS complement(3332458..3332757) /codon_start=1 /transl_table=11 /gene="gatC" /locus_tag="BQ2027_MB3037C" /product="PROBABLE GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE (SUBUNIT C) GATC (Glu-ADT SUBUNIT C)" /note="Mb3037c, gatC, len: 99 aa. Equivalent to Rv3012c, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Probable gatC, Glu-tRNA-Gln amidotransferase, subunit C (EC 6.3.5.-), equivalent to O33104|GATC_MYCLE|MLCB637.12 GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE from Mycobacterium leprae (99 aa), FASTA scores: opt: 483, E(): 3.1e-25, (74.75% identity in 99 aa overlap). Also highly similar to other Glu-tRNA-Gln amidotransferases e.g. Q9Z581|GATC_STRCO|SC8D9.10 from Streptomyces coelicolor (98 aa), FASTA scores: opt: 298, E(): 4e-13, (53.7% identity in 95 aa overlap); O06492|GATC_BACSU from B. subtilis (96 aa), FASTA scores: opt: 222, E(): 3.7e-08, (43.15% identity in 95 aa overlap); Q9KF29|BH0665 from Bacillus halodurans (96 aa), FASTA scores: opt: 211, E(): 1.9e-07, (41.05% identity in 95 aa overlap); etc. For more information about function, see citation below. BELONGS TO THE GATC FAMILY. Protein product from Mb3037c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3037c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64206" /db_xref="InterPro:IPR003837" /db_xref="InterPro:IPR036113" /db_xref="UniProtKB/Swiss-Prot:P64206" /protein_id="SIU01661.1" /translation="MSQISRDEVAHLARLARLALTETELDSFAGQLDAILTHVSQIQA VDVTGVQATDNPLKDVNVTRPDETVPCLTQRQVLDQAPDAVDGRFAVPQILGDEQ" CDS 3332842..3333498 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3038" /product="Amino acid-binding ACT protein" /note="Mb3038, -, len: 218 aa. Equivalent to Rv3013, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 218 aa overlap). Conserved hypothetical protein, equivalent to O33103|MLCB637_11c HYPOTHETICAL 24.4 KDA PROTEIN from Mycobacterium leprae (230 aa), FASTA scores: opt: 1188, E(): 2.6e-67, (83.95% identity in 218 aa overlap). Equivalent to AAK47422 from Mycobacterium tuberculosis strain CDC1551 (240 aa) but shorter 22 aa. Protein product from Mb3038 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="InterPro:IPR002912" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2U5" /protein_id="SIU01662.1" /translation="MRSYLLRIELADRPGSLGSLAVALGSVGADILSLDVVERGNGYA IDDLVVELPPGAMPDTLITAAEALNGVRVDSVRPHTGLLEAHRELELLDHVAAAEGAT ARLQVLVNEAPRVLRVSWCTVLRSSGGELHRLAGSPGAPETRANSAPWLPIERAAALD GGADWVPQAWRDMDTTMVAAPLGDTHTAVVLGRPGPEFRPSEVARLGYLAGIVATMLR " CDS complement(3333572..3335647) /codon_start=1 /transl_table=11 /gene="ligA" /locus_tag="BQ2027_MB3039C" /standard_name="lig" /product="dna ligase [nad dependent] liga (polydeoxyribonucleotide synthase [nad+])" /note="Mb3039c, ligA, len: 691 aa. Equivalent to Rv3014c, len: 691 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 691 aa overlap). Probable ligA (alternate gene name: lig), DNA ligase NAD-dependent (EC 6.5.1.2), equivalent to O33102|DNLJ_MYCLE|LIGA|LIG|ML1705|MLCB637.10 DNA LIGASE from Mycobacterium leprae (694 aa), FASTA scores: opt: 3844, E(): 0, (84.7% identity in 687 aa overlap). Also highly similar to many prokaryotic and eukaryotic ligases e.g. Q9Z585|LIGA|SC8D9.06 from Streptomyces coelicolor (735 aa), FASTA scores: opt: 2002, E(): 4e-113, (59.4% identity in 714 aa overlap); P49421|DNLJ_RHOMR|LIGA|LIG from Rhodothermus marinus (712 aa), FASTA scores: opt: 1835, E(): 4.6e-103, (45.55% identity in 685 aa overlap); P15042|DNLJ_ECOLI|LIGA|LIG|DNAL|PDEC|LOP|B2411 from Escherichia coli strain K12 (671 aa), FASTA scores: opt: 1696, E(): 1.1e-94, (43.8% identity in 680 aa overlap); etc. BELONGS TO THE NAD-DEPENDENT DNA LIGASE FAMILY. Protein product from Mb3039c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3039c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63974" /db_xref="InterPro:IPR001357" /db_xref="InterPro:IPR001679" /db_xref="InterPro:IPR004149" /db_xref="InterPro:IPR004150" /db_xref="InterPro:IPR010994" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR013839" /db_xref="InterPro:IPR013840" /db_xref="InterPro:IPR018239" /db_xref="InterPro:IPR033136" /db_xref="InterPro:IPR036420" /db_xref="InterPro:IPR041663" /db_xref="UniProtKB/Swiss-Prot:P63974" /protein_id="SIU01663.1" /translation="MSSPDADQTAPEVLRQWQALAEEVREHQFRYYVRDAPIISDAEF DELLRRLEALEEQHPELRTPDSPTQLVGGAGFATDFEPVDHLERMLSLDNAFTADELA AWAGRIHAEVGDAAHYLCELKIDGVALSLVYREGRLTRASTRGDGRTGEDVTLNARTI ADVPERLTPGDDYPVPEVLEVRGEVFFRLDDFQALNASLVEEGKAPFANPRNSAAGSL RQKDPAVTARRRLRMICHGLGHVEGFRPATLHQAYLALRAWGLPVSEHTTLATDLAGV RERIDYWGEHRHEVDHEIDGVVVKVDEVALQRRLGSTSRAPRWAIAYKYPPEEAQTKL LDIRVNVGRTGRITPFAFMTPVKVAGSTVGQATLHNASEIKRKGVLIGDTVVIRKAGD VIPEVLGPVVELRDGSEREFIMPTTCPECGSPLAPEKEGDADIRCPNARGCPGQLRER VFHVASRNGLDIEVLGYEAGVALLQAKVIADEGELFALTERDLLRTDLFRTKAGELSA NGKRLLVNLDKAKAAPLWRVLVALSIRHVGPTAARALATEFGSLDAIAAASTDQLAAV EGVGPTIAAAVTEWFAVDWHREIVDKWRAAGVRMVDERDESVPRTLAGLTIVVTGSLT GFSRDDAKEAIVARGGKAAGSVSKKTNYVVAGDSPGSKYDKAVELGVPILDEDGFRRL LADGPASRT" CDS complement(3335678..3336691) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3040C" /product="Methionine synthase, vitamin-B12 independent, putative" /note="Mb3040c, -, len: 337 aa. Equivalent to Rv3015c, len: 337 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 337 aa overlap). Conserved hypothetical protein, equivalent to Q9CBR6|ML1706 HYPOTHETICAL PROTEIN from Mycobacterium leprae (337 aa), FASTA scores: opt: 1703, E(): 3.1e-92, (78.05% identity in 337 aa overlap); and (but longer 47 aa) O33101|MLCB637.09 HYPOTHETICAL 30.0 KDA PROTEIN from Mycobacterium leprae (290 aa), FASTA scores: opt: 1564, E(): 2.4e-78, (78.6% identity in 290 aa overlap). Also similar to Q9Z586|SC8D9.05 HYPOTHETICAL 35.0 KDA PROTEIN from Streptomyces coelicolor (331 aa), FASTA scores: opt: 774, E(): 4.7e-38, (43.4% identity in 334 aa overlap); and showing similarity with other proteins e.g. Q39586|METE_CHLRE 5-METHYLTETRAHYDROPTEROYLTRIGLUTAMATE-- HOMOCYSTEINE METHYLTRANSFERASE from Chlamydomonas reinhardtii (814 aa), FASTA scores: opt: 162, E(): 0.048, (27.05% identity in 355 aa overlap). Protein product from Mb3040c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3040c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2W5" /db_xref="InterPro:IPR002629" /db_xref="InterPro:IPR038071" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2W5" /protein_id="SIU01664.1" /translation="MSVFATATGIGSWPGTAAREAAQVVVGELAGALAYLTELPARGV GADMLGRAGGLLVDVAIDTVPRGYRIAARPGAVTRRAASLLDEDMDALEEAWETAGLR GCGRAVKVQAPGPVTLVAGLELANGHRAITDPGAVRDLAASLAEGVAAHRAALARRLD TPVVVQFDEPSLPAALGGRLTGVTALSPVAPLDETVAEALLDTCIAAVDADVALHSCS PDLPWDLLQRSRISAVSVDASTLQAADLDAVAAFVESGRTVVLGLVPVTAPERAPSME EVAAAAVAVTDRLGVPRSALRDRLGVSPACGLANATGQWARTAVGLARDVAEAFARDP EAI" CDS 3336785..3337414 /codon_start=1 /transl_table=11 /gene="lpqA" /locus_tag="BQ2027_MB3041" /product="PROBABLE LIPOPROTEIN LPQA" /note="Mb3041, lpqA, len: 209 aa. Equivalent to Rv3016, len: 209 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 209 aa overlap). Probable lpqA, lipoprotein. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Mb3041 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR026954" /db_xref="InterPro:IPR038232" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4S9" /protein_id="SIU01665.1" /translation="MVGLTRPLLLCGATLLIAACTRVVGGTASATFGGDRQGMLDVAT ILLDQSRMQAITGSGDDLTIIPTMDTTYPVDVDDFAQPIPRECRFIYAETAVFGSEIE AFHKTTFQDRPDGSLISEAAAAYRDAGTARRAFDTLAVTVHDCAASPAGWLFVSRWTA GGNSLHIRAGDCGRDYRVLSAALLEVTFCGFPESVSDIVMTNIAANVPG" CDS complement(3337517..3337879) /codon_start=1 /transl_table=11 /gene="esxQ" /locus_tag="BQ2027_MB3042C" /standard_name="ES6_8; TB12.9" /product="esat-6 like protein esxq (tb12.9) (esat-6 like protein 8)" /note="Mb3042c, esxQ, len: 120 aa. Equivalent to Rv3017c, len: 120 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 120 aa overlap). esxQ, putative ESAT-6 like protein 8, possibly secreted protein, very similar to AAK47433|MT3104 PUTATIVE SECRETED ESAT-6 LIKE PROTEIN 9 from Mycobacterium tuberculosis strain CDC1551 (96 aa), FASTA scores: opt: 315, E(): 1.2e-14, (65.7% identity in 70 aa overlap); Rv3019c|O53266|MTV012.33c PUTATIVE SECRETED ESAT-6 LIKE PROTEIN 9 from Mycobacterium tuberculosis (96 aa), FASTA scores: opt: 315, E(): 1.2e-14, (65.7% identity in 70 aa overlap) and Rv0288|O53693|CFP7|MT0301|MTV035.16 10 KDA ANTIGEN CFP7 (LOW MOLECULAR WEIGHT PROTEIN ANTIGEN 7) (CFP-7) from Mycobacterium tuberculosis (95 aa), FASTA scores: opt: 303, E(): 7.4e-14, (66.2% identity in 68 aa overlap). BELONGS TO THE ESAT6 FAMILY." /db_xref="GOA:P64092" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P64092" /protein_id="SIU01666.1" /translation="MSQSMYSYPAMTANVGDMAGYTGTTQSLGADIASERTAPSRACQ GDLGMSHQDWQAQWNQAMEALARAYRRCRRALRQIGVLERPVGDSSDCGTIRVGSFRG RWLDPRHAGPATAADAGD" CDS complement(3337966..3339279) /codon_start=1 /transl_table=11 /gene="PPE46" /locus_tag="BQ2027_MB3043C" /product="ppe family protein ppe46" /note="Mb3043c, PPE46, len: 437 aa. Equivalent to Rv3018c, len: 434 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 431 aa overlap). Member of PPE family but lacks Gly, Ala rich repeats at C-terminal domain, closest to MTCY261.19. Also very similar to following ORF MTV012.35c. Nearly identical in parts to Mycobacterium tuberculosis protein erroneously described as DIHYDROFOLATE REDUCTASE (X59271|MTFOLA_1) P31500|DYR_MYCTU (214 aa), FASTA scores: opt: 972, E(): 4.4e-42, (80.0% identity in 195 aa overlap); and Z97559|MTCY261_19 from M. tuberculosis cosmid (473 aa), FASTA scores: opt: 806, E(): 0; (38.8% identity in 479 aa overlap); and O53268|MTV012.35c from Mycobacterium tuberculosis (358 aa), FASTA scores: opt: 1714, E(): 3.3e-79, (78.3% identity in 355 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 56 bp deletion leads to a product slightly different at the Nh2-terminus part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (434 aa versus 437 aa)." /db_xref="GOA:A0A1R3Y315" /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y315" /protein_id="SIU01667.1" /translation="MSSHRTPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEY AAVAQELSAVVAAVGAGVWQGPSAELFVAAYVPYVAWLVQASADSAAAAGEHEAAAAG YVCALAEMPTLPELAANHLTHAVLVATNFFGINTIPIALNEADYVRMWVQAATVMSAY EAVVGAALVATPHTGPAPVIVKPGANEASNAVAAATITPFPWHEIVQFLEETFAAYDQ YLSALLSELPAVAWVWFQLFVDILGFNIIGFIITLASNAQLLTEFAINASYVAVGLLY AIAGVIDIVVEWVIGNLFGVVPLLGGPLLGALAAAVVPGVAGLAGVAGLAAVPAVGAA AGAPAALVGSVAPVSGGVVSPQARLVSAVEPAPASTSVSVLASDRGAGALGFVGTAGK ESVGQPAGLTVLADEFGDGAPVPMLPGSWGPDLVGVAGDGGLVSV" CDS complement(3339300..3339386) /codon_start=1 /transl_table=11 /gene="PE27A" /locus_tag="BQ2027_MB3044C" /product="pe family protein pe27a" /note="Mb3044c, PE27A, len: 28 aa. Equivalent to Rv3018A, len: 28 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 28 aa overlap). Member of M. tuberculosis PE family, most similar to Rv0285 (102 aa), FASTA scores: opt: 147, E(): 3.5e-05, (92.85% identity in 28 aa overlap); etc." /db_xref="UniProtKB/TrEMBL:A0A1R3Y399" /protein_id="SIU01668.1" /translation="MTLSVVPEGLAAASAAVEALTARLAAAH" CDS complement(3339682..3339972) /codon_start=1 /transl_table=11 /gene="esxR" /locus_tag="BQ2027_MB3045C" /standard_name="ES6_9; TB10.3" /product="secreted esat-6 like protein esxr (tb10.3) (esat-6 like protein 9)" /note="Mb3045c, esxR, len: 96 aa. Equivalent to Rv3019c, len: 96 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 96 aa overlap). esxR, putative secreted ESAT-6 like protein 9 (see citations below), most similar to O53693|AAK44525|Rv0288|CFP7|MT0301|MTV035.16 10 KDA ANTIGEN CFP7 (LOW MOLECULAR WEIGHT PROTEIN ANTIGEN 7) (CFP-7) from Mycobacterium tuberculosis (95 aa), FASTA scores: opt: 566, E(): 5.1e-31, (84.3% identity in 95 aa overlap). Also similar to Q9CD33|ML2531 POSSIBLE CELL SURFACE PROTEIN from Mycobacterium leprae (96 aa), FASTA scores: opt: 472, E(): 8.3e-25, (66.6% identity in 96 aa overlap); O53264|Rv3017c|MTV012.31c PUTATIVE SECRETED ANTIGEN from Mycobacterium tuberculosis (120 aa), FASTA scores: opt: 321, E(): 9.6e-15, (67.15% identity in 70 aa overlap); Q57165|AAK48357|O84901|X79562|ESAT6|Rv3875|MT398 9|MTV027.1 0esat6 gene from Mycobacterium tuberculosis strain Erdman (94 aa), FASTA scores: opt: 131, E(): 0.028, (26.1% identity in 88 aa overlap). BELONGS TO THE ESAT6 FAMILY. TBparse score is 0.906." /db_xref="GOA:P64094" /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P64094" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01669.1" /translation="MSQIMYNYPAMMAHAGDMAGYAGTLQSLGADIASEQAVLSSAWQ GDTGITYQGWQTQWNQALEDLVRAYQSMSGTHESNTMAMLARDGAEAAKWGG" CDS complement(3340007..3340300) /codon_start=1 /transl_table=11 /gene="esxS" /locus_tag="BQ2027_MB3046C" /standard_name="PE28" /product="esat-6 like protein esxs" /note="Mb3046c, esxS, len: 97 aa. Equivalent to Rv3020c, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 97 aa overlap). Member of Mycobacterium tuberculosis PE family (see first citation below), similar to others e.g. AAK44524|MT0300 PE FAMILY PROTEIN from M. tuberculosis strain CDC1551 (97 aa), FASTA scores: opt: 564, E(): 5.9e-30, (91.75% identity in 97 aa overlap). Has potential helix-turn-helix motif at positions 14-35. TBparse score is 0.912. SEEMS TO BELONG TO THE ESAT6 FAMILY (see second citation below)." /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2X0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01670.1" /translation="MSLLDAHIPQLIASHTAFAAKAGLMRHTIGQAEQQAMSAQAFHQ GESAAAFQGAHARFVAAAAKVNTLLDIAQANLGEAAGTYVAADAAAASSYTGF" CDS complement(3340347..3341654) /codon_start=1 /transl_table=11 /gene="PPE47" /locus_tag="BQ2027_MB3047C" /product="ppe family protein ppe47" /note="Mb3047c, PPE47, len: 435 aa. Equivalent to Rv3022c (PPE48) and Rv3021c (PPE47), len: 81 aa and 358 aa, from Mycobacterium tuberculosis strain H37Rv, (98.8% identity in 81 aa overlap and 98.6% identity in 354 aa overlap). Member of Mycobacterium tuberculosis PPE family. Should be continuation of upstream ORF MTV012.36c but is frameshifted due to missing base at 36448 in v012. Sequence has been checked but no error apparent. Very similar to neighbouring ORF O53265|MTV012.32c|Rv3018c from Mycobacterium tuberculosis (434 aa), FASTA scores: opt: 1714, E(): 6.6e-770, (78.3% identity in 355 aa overlap) and AAK47430|MT3101 (strongly in the N-terminal part) (310 aa), FASTA scores: opt: 897, E(): 4.5e-37, (66.95% identity in 227 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis H37Rv, PPE47 and PPE48 exist as 2 separate genes. In Mycobacterium bovis, a single base insertion (*-g) leads to a single product." /db_xref="GOA:A0A1R3Y2W1" /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2W1" /protein_id="SIU01671.1" /translation="MTAPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEYAAV AQELSAVVAAVGAGVWQGPSAELFVAAYVPYVAWLVQASADSAAAAGEHEAAAAGYVC ALAEMPTLPELAANHLTHAVLVATNFFGINTIPIALNEADYVRMWVQAATVMSAYEAV VGAALVATPHTGPAPVIVKPGANEASNAVAAATITPFPFGELAKFLEMAAQAFTEVGE LIMKSAEAWAVGFVELITGLVNFEPWLVLTGMIDMFFATVGFALGVFVLVPLLEFAVV LELAILSIGWIISNIFGAIPVLAGPLLGALAAAVVPGVAGVTGLAGLAAVPAVGAAAG APAALVGSVAPVSGGVVSPQARLVSAVEPAPASTSVSVLASDRGAGALGFVGTAGKES VGQPAGLTVLADEFGDGAPVPMLPGSWGPDLVGVAGDGGLVSV" CDS complement(3341651..3341965) /codon_start=1 /transl_table=11 /gene="PE29" /locus_tag="BQ2027_MB3048C" /product="pe family protein pe29" /note="Mb3048c, PE29, len: 104 aa. Equivalent to Rv3022A, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 104 aa overlap). Member of the Mycobacterium tuberculosis PE family, similar to many others e.g. Rv0285|AL021930_12 from Mycobacterium tuberculosis (102 aa), FASTA scores: opt: 497, E(): 3e-21, (80.39% identity in 102 aa overlap); etc." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2V9" /protein_id="SIU01672.1" /translation="MTLRVVPEGLAAASAAVEALTARLAAAHAGAAPAITAVVAPAAD PVSLQSAVGFSALGSEHAAIAGEGVEELGRSGVAVGESGIGYAAGDAVAAATYLVSGG SL" repeat_region 3342277..3342284 /rpt_type=DIRECT /note="8 bp direct repeat, CCAGTCGC, flanking IS element IS1081." mobile_element complement(3342285..3343719) /mobile_element_type="insertion sequence:IS1081" /locus_tag="BQ2027_IS1081-5" /note="IS1081-5, len: 1435 nt. Equivalent to IS1081, len: 1450 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1435 nt overlap)." gene complement(3342285..3343719) /locus_tag="BQ2027_IS1081-5" repeat_region 3342323..3342337 /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRR,TCGCGTGATCCTTCG, flanking IS element IS1081." CDS complement(3342347..3343594) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3049C" /product="PROBABLE TRANSPOSASE" /note="Mb3049c, -, len: 415 aa. Equivalent to Rv3023c, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 415 aa overlap). Probable IS1081 transposase. Contains PS01007 Transposases, Mutator family, signature. Similars to P35882|TRA1_MYCTU|Rv1199c|MTCI364.11c and Rv2512c|MTCY07A7.18c TRANSPOSASES FOR INSERTION SEQUENCE ELEMENT IS1081 (415 aa), FASTA scores: opt: 2675, E(): 1.8e-162, (100.0% identity in 415 aa overlap). TBparse score is 0.894. BELONGS TO THE MUTATOR FAMILY OF TRANSPOSASE." /db_xref="GOA:P60231" /db_xref="InterPro:IPR001207" /db_xref="UniProtKB/Swiss-Prot:P60231" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01673.1" /translation="MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" repeat_region complement(3343632..3343646) /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRL,TCGCGTGATCCTTCG, flanking IS element IS1081." repeat_region 3343720..3343727 /rpt_type=DIRECT /note="8 bp direct repeat, CCAGTCGC, flanking IS element IS1081." CDS complement(3343757..3344860) /codon_start=1 /transl_table=11 /gene="trmU" /locus_tag="BQ2027_MB3050C" /product="PROBABLE tRNA (5-METHYLAMINOMETHYL-2-THIOURIDYLATE)-METHYLTRANSFERASE TRMU" /note="Mb3050c, trmU, len: 367 aa. Equivalent to Rv3024c, len: 367 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 367 aa overlap). Probable trmU, tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase (EC 2.1.1.61), equivalent to O33099|TRMU_MYCLE|ML1707|MLCB637.07 PROBABLE tRNA (5-METHYLAMINOMETHYL-2-THIOURIDYLATE)-METHYLTRANSFERASE from Mycobacterium leprae (358 aa), FASTA scores: opt: 2033, E(): 5.5e-116, (85.45% identity in 357 aa overlap). Also highly similar to others e.g. O86583|TRMU_STRCO|SC2A11.22 from Streptomyces coelicolor (376 aa), FASTA scores: opt: 1336, E(): 1e-73, (56.9% identity in 369 aa overlap); BAB49856|MLR2824 from Rhizobium loti (378 aa), FASTA scores: opt: 826, E(): 8.3e-43, (42.35% identity in 359 aa overlap); Q9ZDM1|TRMU_RICPR|RP306 from Rickettsia prowazekii (358 aa), FASTA scores: opt: 800, E(): 3e-41, (40.1% identity in 359 aa overlap); etc. BELONGS TO THE TRMU FAMILY. Protein product from Mb3050c detected using SWATH mass spectrometry. Mb3050c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66977" /db_xref="InterPro:IPR004506" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR023382" /db_xref="UniProtKB/Swiss-Prot:P66977" /protein_id="SIU01674.1" /translation="MKVLAAMSGGVDSSVAAARMVDAGHEVVGVHMALSTAPGTLRTG SRGCCSKEDAADARRVADVLGIPFYVWDFAEKFKEDVINDFVSSYARGETPNPCVRCN QQIKFAALSARAVALGFDTVATGHYARLSGGRLRRAVDRDKDQSYVLAVLTAQQLRHA AFPIGDTPKRQIRAEAARRGLAVANKPDSHDICFIPSGNTKAFLGERIGVRRGVVVDA DGVVLASHDGVHGFTIGQRRGLGIAGPGPNGRPRYVTAIDADTATVHVGDVTDLDVQT LTGRAPVFTAGAAPSGPVDCVVQVRAHGETVSAVAELIGDALFVQLHAPLRGVARGQT LVLYRPDPAGDEVLGSATIAGASGLSTGGNPGA" CDS complement(3344857..3346038) /codon_start=1 /transl_table=11 /gene="iscS" /locus_tag="BQ2027_MB3051C" /standard_name="nifS" /product="cysteine desulfurase iscs (nifs protein homolog) (nitrogenase metalloclusters biosynthesis protein nifs)" /note="Mb3051c, iscS, len: 393 aa. Equivalent to Rv3025c, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 393 aa overlap). Probable iscS (alternate gene name: nifS), cysteine desulfurase (NifS-like protein) (EC 4.4.1.-), equivalent to MLCB637.06|O33098 NIFS-LIKE PROTEIN from Mycobacterium leprae (396 aa), FASTA scores: opt: 2186, E(): 2.7e-122, (84.9% identity in 391 aa overlap). Also highly similar to many e.g. O86581|SC2A11.20 PUTATIVE PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASE from Streptomyces coelicolor (389 aa), FASTA scores: opt: 1568, E(): 1.1e-85, (61.7% identity in 389 aa overlap); P57795|ISCS|NIFS CYSTEINE DESULFURASE (NIFS PROTEIN HOMOLOG) from Methanosarcina thermophila (404 aa), FASTA scores: opt: 1059, E(): 1.6e-55, (46.2% identity in 381 aa overlap); O54055|ISCS_RUMFL|ISCS|NIFS CYSTEINE DESULFURASE from Ruminococcus flavefaciens (396 aa), FASTA scores: opt: 973, E(): 2e-50, (43.3% identity in 381 aa overlap); P57794|NIFS_ACEDI CYSTEINE DESULFURASE from Acetobacter diazotrophicus (400 aa), FASTA scores: opt: 958, E(): 1.6e-49, (41.1% identity in 392 aa overlap); etc. Also similar to Rv1464|MTV007.11 from Mycobacterium tuberculosis. Contains PS00595 Aminotransferases class-V pyridoxal-phosphate attachment site. BELONGS TO CLASS-V OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES, NIFS/ISCS SUBFAMILY. COFACTOR: PYRIDOXAL PHOSPHATE (BY SIMILARITY). Protein product from Mb3051c detected using SWATH mass spectrometry. Mb3051c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4T3" /db_xref="InterPro:IPR000192" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR016454" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4T3" /protein_id="SIU01675.1" /translation="MAYLDHAATTPMHPAAIEAMAAVQRTIGNASSLHTSGRSARRRI EEARELIADKLGARPSEVIFTAGGTESDNLAVKGIYWARRDAEPHRRRIVTTEVEHHA VLDSVNWLVEHEGAHVTWLPTAADGSVSATALREALQSHDDVALVSVMWANNEVGTIL PIAEMSVVAMEFGVPMHSDAIQAVGQLPLDFGASGLSAMSVAGHKFGGPPGVGALLLR RDVTCVPLMHGGGQERDIRSGTPDVASAVGMATAAQIAVDGLEENSARLRLLRDRLVE GVLAEIDDVCLNGADDPMRLAGNAHFTFRGCEGDALLMLLDANGIECSTGSACTAGVA QPSHVLIAMGVDAASARGSLRLSLGHTSVEADVDAALEVLPGAVARARRAALAAAGAS R" CDS complement(3346135..3347049) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3052C" /product="Acyl-CoA:1-acyl-sn-glycerol-3-phosphate acyltransferase (EC" /EC_number="2.3.1.51" /note="Mb3052c, -, len: 304 aa. Equivalent to Rv3026c, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 304 aa overlap). Conserved hypothetical protein, similar to Q9RCZ0|SCM10.08C PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (275 aa), FASTA scores: opt: 393, E(): 2.2e-17, (41.4% identity in 299 aa overlap). Similar in part to other hypothetical proteins and acyltransferases e.g. BAB51968|MLR5533 from Rhizobium loti (266 aa), FASTA scores: opt: 280, E(): 2.4e-10, (29.45% identity in 258 aa overlap); Q9KIH9 PUTATIVE ACYLTRANSFERASE (PUTATIVE ACYLTRANSFERASE TRANSMEMBRANE PROTEIN) (EC 2.3.1.) from Rhizobium meliloti (Sinorhizobium meliloti) (292 aa), FASTA scores: opt: 252, E(): 1.4e-08, (30.5% identity in 210 aa overlap); O69114|PLSC PUTATIVE 1-ACYL-SN-GLYCEROL-3-PHOSPHATE ACYLTRANSFERASE from Burkholderia pseudomallei (Pseudomonas pseudomallei) (289 aa), FASTA scores: opt: 216, E(): 2.4e-06, (30.85% identity in 269 aa overlap); etc. So may be a member of acyltransferase family protein. Protein product from Mb3052c detected using shotgun mass spectrometry." /db_xref="GOA:A0A1R3Y3N4" /db_xref="InterPro:IPR002123" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3N4" /protein_id="SIU01676.1" /translation="MSAPAVTEHSWLPRATCGVSCVSVGDAAQVRRPLVVLRVALRVM LALLLVPGVPLVVMPLPGRTRVQRIYCRLVLRLFGVRITVSGSPVRNLRGVLVVSGHV SWLDVFCIGSVLPGSFVARADMFTGRTIGIVARILKIIPIERASLRRLPGVVDTIARR LRAGQTVVAFPEGTTWCGRPGDDAGRPAARAGAGCSHRGCGAFYPAMFQAAIDAGRPV QPLRLTYHHVDGTVSTAPAFVGDDTLVRSVCRLLTVRRTLAWVRVESLQLPGTDRRNL ARRCQSAVLAGALGQSGQRPGRRHVPAT" CDS complement(3347046..3347786) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3053C" /product="gcn5-related n-acetyltransferase" /note="Mb3053c, -, len: 246 aa. Equivalent to Rv3027c, len: 246 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 246 aa overlap). Conserved hypothetical protein, similar, but shorter 30 aa in N-terminus, to others e.g. Q9RCY9|SCM10.09c from Streptomyces coelicolor (256 aa), FASTA scores: opt: 498, E(): 7.8e-24, (47.7% identity in 237 aa overlap); BAB50158|MLR3216 from Rhizobium loti (291 aa), FASTA scores: opt: 359, E(): 3.7e-15, (33.35% identity in 246 aa overlap); etc. Equivalent to AAK47441 from Mycobacterium tuberculosis strain CDC1551 (281 aa) but shorter 35 aa. Mb3053c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y325" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3Y325" /protein_id="SIU01677.1" /translation="MVEAAQRLRYDVFSTTPGFALPAAADTRRDGDRFDEYCDHLLVR DDDTGELVGCYRMLAPAGAIAAGGLYTATEFDVCAFDPLRPSLVEMGRAVVREGHRNG GVVLLMWAGILAYLDRYGYDYVTGCVSVPIGGDGETPGSRLRGVRDFILNRHAAPPQC QVYPYRPVRVDGRSLDDILPPPRPAVPPLMRGYLRLGARACGEPAHDPDFGVGDFCLL LDKDHADTRYLRRLRSVAAASEMVNDAR" CDS complement(3348047..3349003) /codon_start=1 /transl_table=11 /gene="fixB" /locus_tag="BQ2027_MB3054C" /standard_name="etfA" /product="PROBABLE ELECTRON TRANSFER FLAVOPROTEIN (ALPHA-SUBUNIT) FIXB (ALPHA-ETF) (ELECTRON TRANSFER FLAVOPROTEIN LARGE SUBUNIT) (ETFLS)" /note="Mb3054c, fixB, len: 318 aa. Equivalent to Rv3028c, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 318 aa overlap). Probable fixB (alternate gene name: etfA), electron transfer flavoprotein (alpha subunit) for various dehydrogenases. Equivalent to O33096|ETFA_MYCLE|FIXB|ML1711|MLCB637.04 ELECTRON TRANSFER FLAVOPROTEIN from Mycobacterium leprae (318 aa), FASTA scores: opt: 1788, E(): 1.1e-87, (89.3% identity in 318 aa overlap). Also highly similar to many e.g. Q9K418|SCG22.27c from Streptomyces coelicolor (320 aa), FASTA scores: opt: 1161, E(): 1.6e-54, (59.45% identity in 323 aa overlap); AAK08137|etfa from Rhodobacter sphaeroides (308 aa), FASTA scores: opt: 792, E(): 5.1e-35, (45.95% identity in 309 aa overlap); P38974|ETFA_PARDE ELECTRON TRANSFER FLAVOPROTEIN from Paracoccus denitrificans (307 aa), FASTA scores: opt: 789, E(): 7.4e-35, (45.95% identity in 309 aa overlap); etc. BELONGS TO THE ETF ALPHA-SUBUNIT / FIXB FAMILY. Protein product from Mb3054c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3054c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3A9" /db_xref="InterPro:IPR001308" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR014730" /db_xref="InterPro:IPR014731" /db_xref="InterPro:IPR018206" /db_xref="InterPro:IPR029035" /db_xref="InterPro:IPR033947" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3A9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01678.1" /translation="MAEVLVLVEHAEGALKKVSAELITAARALGEPAAVVVGVPGTAA PLVDGLKAAGAAKIYVAESDLVDKYLITPAVDVLAGLAESSAPAGVLIAATADGKEIA GRLAARIGSGLLVDVVDVREGGVGVHSIFGGAFTVEAQANGDTPVITVRAGAVEAEPA AGAGEQVSVEVPAAAENAARITAREPAVAGDRPELTEATIVVAGGRGVGSAENFSVVE ALADSLGAAVGASRAAVDSGYYPGQFQVGQTGKTVSPQLYIALGISGAIQHRAGMQTS KTIVAVNKDEEAPIFEIADYGVVGDLFKVAPQLTEVIKARKG" CDS complement(3349042..3349842) /codon_start=1 /transl_table=11 /gene="fixA" /locus_tag="BQ2027_MB3055C" /standard_name="etfB" /product="PROBABLE ELECTRON TRANSFER FLAVOPROTEIN (BETA-SUBUNIT) FIXA (BETA-ETF) (ELECTRON TRANSFER FLAVOPROTEIN SMALL SUBUNIT) (ETFSS)" /note="Mb3055c, fixA, len: 266 aa. Equivalent to Rv3029c, len: 266 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 266 aa overlap). Probable fixA (alternate gene name: etfB), electron transfer flavoprotein (beta-subunit). Equivalent of O33095|ETFB_MYCLE|FixA|MLCB637.03 ELECTRON TRANSFER FLAVOPROTEIN from Mycobacterium leprae (266 aa), FASTA scores: opt: 1603, E(): 7.6e-87, (95.1% identity in 266 aa overlap). Also highly similar to others e.g. Q9K417|SCG22.28c from Streptomyces coelicolor (262 aa), FASTA scores: opt: 860, E(): 2.3e-43, (52.4% identity in 263 aa overlap); O85691|ETFB_MEGEL from Megasphaera elsdenii (270 aa), FASTA scores: opt: 548, E(): 4.2e-25, (35.15% identity in 273 aa overlap); etc. Also highly similar in particular to Q9KHD0|NONH FLAVOPROTEIN REDUCTASE from Streptomyces griseus subsp. griseus (this one is required for macrotetrolide biosynthesis in Streptomyces griseus) (261 aa), FASTA scores: opt: 867, E(): 8.8e-44, (54.0% identity in 263 aa overlap). BELONGS TO THE ETF BETA-SUBUNIT / FIXA FAMILY. Protein product from Mb3055c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3055c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64098" /db_xref="InterPro:IPR000049" /db_xref="InterPro:IPR012255" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR014730" /db_xref="InterPro:IPR033948" /db_xref="UniProtKB/Swiss-Prot:P64098" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01679.1" /translation="MTNIVVLIKQVPDTWSERKLTDGDFTLDREAADAVLDEINERAV EEALQIREKEAADGIEGSVTVLTAGPERATEAIRKALSMGADKAVHLKDDGMHGSDVI QTGWALARALGTIEGTELVIAGNESTDGVGGAVPAIIAEYLGLPQLTHLRKVSIEGGK ITGERETDEGVFTLEATLPAVISVNEKINEPRFPSFKGIMAAKKKEVTVLTLAEIGVE SDEVGLANAGSTVLASTPKPAKTAGEKVTDEGEGGNQIVQYLVAQKII" CDS 3350073..3350897 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3056" /product="SAM-dependent methyltransferase" /note="Mb3056, -, len: 274 aa. Equivalent to Rv3030, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 274 aa overlap). Conserved hypothetical protein, equivalent to O33094|MLCB637.02c|ML1713 hypothetical 30.8 KDa protein from Mycobacterium leprae (280 aa), FASTA scores: opt: 1388, E(): 5.5e-83, (78.2% identity in 280 aa overlap). N-terminus has similarity to hypothetical proteins from a number of organisms and to Q54303|EMBL:X86780|RAPM methyltransferase from Streptomyces hygroscopicus (317 aa), FASTA scores: opt: 191, E(): 3.6e-05, (35.65% identity in 101 aa overlap). Protein product from Mb3056 detected using SWATH mass spectrometry. Mb3056 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Y0" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Y0" /protein_id="SIU01680.1" /translation="MCAFVPHVPRHSRGDNPPSASTASPAVLTLTGERTIPDLDIENY WFRRHQVVYQRLAPRCTARDVLEAGCGEGYGADLIACVARQVIAVDYDETAVAHVRSR YPRVEVMQANLAELPLPDASVDVVVNFQVIEHLWDQARFVRECARVLRGSGLLMVSTP NRITFSPGRDTPINPFHTRELNADELTSLLIDAGFVDVAMCGLFHGPRLRDMDARHGG SIIDAQIMRAVAGAPWPPELAADVAAVTTADFEMVAAGHDRDIDDSLDLIAIAVRP" CDS 3350894..3352474 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3057" /product="Glycogen branching enzyme, GH-57-type, archaeal (EC" /EC_number="2.4.1.18" /note="Mb3057, -, len: 526 aa. Equivalent to Rv3031, len: 526 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 526 aa overlap). Conserved hypothetical protein, equivalent to Q9CBR4|ML1714 HYPOTHETICAL PROTEIN from Mycobacterium leprae (522 aa), FASTA scores: opt: 3167, E(): 4.4e-190, (86.15% identity in 526 aa overlap); and highly similar to truncated O33093|MLCB637.01c HYPOTHETICAL 37.2 KDA PROTEIN (FRAGMENT) from Mycobacterium leprae (338 aa), FASTA scores: opt: 2041, E(): 5.7e-120, (84.8% identity in 342 aa overlap). Also some similarity to hypothetical proteins Q9V0M7|PAB1857 from Pyrococcus abyssi (602 aa), FASTA scores: opt: 477, E(): 3.5e-22, (31.2% identity in 556 aa overlap); and Synechocystis P74630|D90916|SLL0735 from Synechocystis sp. strain PCC 6803 (529 aa), FASTA scores: opt: 282, E(): 4.7e-10, (28.6% identity in 560 aa overlap). Protein product from Mb3057 detected using SWATH mass spectrometry. Mb3057 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2X1" /db_xref="InterPro:IPR004300" /db_xref="InterPro:IPR011330" /db_xref="InterPro:IPR015293" /db_xref="InterPro:IPR027291" /db_xref="InterPro:IPR028995" /db_xref="InterPro:IPR037090" /db_xref="InterPro:IPR040042" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2X1" /protein_id="SIU01681.1" /translation="MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAY LPLLQVLAALADENRHRLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVR YARQSKSADYPSCTPEALRAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVE LLGGPLAHPFQPLLAPRLREFALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYAT AGVSHFMVDGPSLHGDTALGRPVGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFH TYDHLTGLKPARVTGRNVPSEQKAPYDPERADRAVDVHVADFVDVVRNRLLSESERIG RPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAAGVRVGTLSDAIADGFVGDPVELP PSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTIDKALAQTASLDGPLPRDHVADQ ILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATREIAGALAAGRRDTARRLAEG WNRADGLFGALDARRLPK" CDS 3352506..3353750 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3058" /product="alpha (1->4) glucosyltransferase" /note="Mb3058, -, len: 414 aa. Equivalent to Rv3032, len: 414 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 414 aa overlap). Possible transferase (EC 2.-.-.-), equivalent to Q9CBR3|ML1715 PUTATIVE TRANSFERASE from Mycobacterium leprae (438 aa), FASTA scores: opt: 2456, E(): 7.3e-145, (87.9% identity in 414 aa overlap). Also similar to hypothetical proteins and various transferases e.g. P73369|SLL1971 HYPOTHETICAL 46.2 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (404 aa), FASTA scores: opt: 584, E(): 7.3e-29, (34.5% identity in 400 aa overlap); Q9Z5B7|SC2G5.06 PUTATIVE TRANSFERASE from Streptomyces coelicolor (406 aa), FASTA scores: opt: 509, E(): 3.3e-24, (35.9% identity in 413 aa overlap); Q9UZA1|PAB0827 GALACTOSYLTRANSFERASE (LPS BIOSYNTHESIS RFBU RELATED PROTEIN) from Pyrococcus abyssi (371 aa), FASTA scores: opt: 381, E(): 2.6e-16, (26.75% identity in 404 aa overlap); etc. Protein product from Mb3058 detected using SWATH mass spectrometry. Mb3058 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2W8" /db_xref="InterPro:IPR001296" /db_xref="InterPro:IPR028098" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2W8" /protein_id="SIU01682.1" /translation="MRILMVSWEYPPVVIGGLGRHVHHLSTALAAAGHDVVVLSRCPS GTDPSTHPSSDEVTEGVRVIAAAQDPHEFTFGNDMMAWTLAMGHAMIRAGLRLKKLGT DRSWRPDVVHAHDWLVAHPAIALAQFYDVPMVSTIHATEAGRHSGWVSGALSRQVHAV ESWLVRESDSLITCSASMNDEITELFGPGLAEITVIRNGIDAARWPFAARRPRTGPAE LLYVGRLEYEKGVHDAIAALPRLRRTHPGTTLTIAGEGTQQDWLIDQARKHRVLRATR FVGHLDHTELLALLHRADAAVLPSHYEPFGLVALEAAAAGTPLVTSNIGGLGEAVING QTGVSCAPRDVAGLAAAVRSVLDDPAAAQRRARAARQRLTSDFDWQTVATATAQVYLA AKRGERQPQPRLPIVEHALPDR" CDS 3353784..3354173 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3058A" /product="Conserved protein" /note="Mb3058A, len: 129 aa. Equivalent to Rv3032A len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein. Protein product from Mb3058A detected using SWATH mass spectrometry. Mb3058A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y2X8" /protein_id="SIU01683.1" /translation="MKPQDQGLHFPYRYDLRLAPMWLPFRWPGSQGVTVTEDGRFVAR YGPFRVEAPLSSVRDAHITGPYRWWTAVGPRLSMVDDGLTFGTNAAAGVCIHFEPRIH RVIGLRDHSALTVTVADPEGLVAALSS" CDS 3354352..3354900 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3059" /product="unknown protein" /note="Mb3059, -, len: 182 aa. Equivalent to Rv3033, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 182 aa overlap). Hypothetical unknown protein. Protein product from Mb3059 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3059 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025637" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Y5" /protein_id="SIU01684.1" /translation="MAHSIVRTLLASGAATALIAIPTACSFSIGTSHSHSVSKAEVAR QITAKMTDAAGNKPESVTCPSDLPAEVGAELNCEMKIKDRTFNVNVTVTSVDGSDVKF DMVETVDKNQVANIISDKLFQRVGARPDSVTCPDNLKGVEGAKLRCRLTDGSKTYGIS VIVTSVDAGDVNFDFKVDDHPE" CDS complement(3354991..3355893) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3060C" /product="POSSIBLE TRANSFERASE" /note="Mb3060c, -, len: 300 aa. Equivalent to Rv3034c, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 300 aa overlap). Possible transferase (2.-.-.-), equivalent to AAK47449|MT3119 Hexapeptide transferase family protein from Mycobacterium tuberculosis strain CDC1551 but N-terminus shorter 39 residues (262 aa), FASTA scores: opt: 1773, E(): 4.7e-105, (100.0% identity in 262 aa overlap). Similar to Q9CBR1|ML1719 from Mycobacterium leprae but also shorter in N-terminus (245 aa), FASTA scores: opt: 1549, E(): 6.6e-91, (90.6% identity in 244 aa overlap). Some weakly similarity with other transferases (C-terminal part shows some similarity to acetyltransferase from Methanococcus jannaschii (214 aa)). Alternative start possible at 3395077 but codon usage not as good. Protein product from Mb3060c detected using SWATH mass spectrometry. Mb3060c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4U7" /db_xref="InterPro:IPR001451" /db_xref="InterPro:IPR011004" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4U7" /protein_id="SIU01685.1" /translation="MNVLSLGSSSGVVWGRVPITAPAGAATGVTSRADAHSQMRRYAQ TGPTAKLSSAPMTTMWGAPLHRRWRGSRLRDPRQAKFLTLASLKWVLANRAYTPWYLV RYWRLLRFKLANPHIITRGMVFLGKGVEIHATPELAQLEIGRWVHIGDKNTIRAHEGS LRFGDKVVLGRDNVINTYLDIEIGDSVLMADWCYICDFDHRMDDITLPIKDQGIIKSP VRIGPDTWIGVKVSVLRGTTIGRGCVLGSHAVVRGAIPDYSIAVGAPAKVVKNRQLSW EASAAQRAELAAALADIERKKAAR" CDS 3356351..3357433 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3061" /product="FOG- WD40-like repeat" /note="Mb3061, -, len: 360 aa. Equivalent to Rv3035, len: 360 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 360 aa overlap). Conserved hypothetical protein, equivalent to Q9CBR0|ML1720 HYPOTHETICAL PROTEIN from Mycobacterium leprae (364 aa), FASTA scores: opt: 1963, E(): 1.4e-108, (75.8% identity in 363 aa overlap). Protein product from Mb3061 detected using SWATH mass spectrometry. Mb3061 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002372" /db_xref="InterPro:IPR011047" /db_xref="InterPro:IPR015943" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3P0" /protein_id="SIU01686.1" /translation="MAAGPALSARGYLALNGQTPAGCSLMEWQNDNNGRQRWCVRLVQ GGGFAGPLFDGFDNLYVGQPGAIISFPPTQWTRWRQPVIGMPSTPRFLGHGRLLVSTH LGQLLVFDTRRGMVVGSPVDLVDGIDPTDATRGLADCAPARPGCPVAAAPAFSSVNGT VVVSVWQPGEPAAKLVGLKYHAEQLVREWTSDAVSAGVLASPVLSADGSTVYVNGRDH RLWALNAADGKAKWSAPLGFLAQTPPALTPHGLIVSGGGPDTALAAFRDAGDHAEGAW RRDDVTALSTASLAGTGVGYTVISGPNHDGTPGLSLLVFDPANGHTVNSYPLPGATGY PVGVSVGNDRRVVTATSDGQVYSFAP" CDS complement(3357430..3358113) /codon_start=1 /transl_table=11 /gene="TB22.2" /locus_tag="BQ2027_MB3062C" /product="PROBABLE CONSERVED SECRETED PROTEIN TB22.2" /note="Mb3062c, TB22.2, len: 227 aa. Equivalent to Rv3036c, len: 227 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 227 aa overlap). Probable TB22.2, conserved secreted protein, with putative N-terminal signal peptide, highly similar to secreted immunogenic protein MPT64/MPB64 P19996|Rv1980c|MTCY39.39 from Mycobacterium tuberculosis and Mycobacterium bovis (228 aa), FASTA scores: opt: 681, E(): 2.5e-35, (45.8% identity in 227 aa overlap). Protein product from Mb3062c detected using shotgun mass spectrometry. Mb3062c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR021729" /db_xref="InterPro:IPR037126" /db_xref="UniProtKB/TrEMBL:A0A1R3Y336" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01687.1" /translation="MRYLIATAVLVAVVLVGWPAAGAPPSCAGLGGTVQAGQICHVHA SGPKYMLDMTFPVDYPDQRALTDYITQNRDGFVNVAQGSPLRDQPYQMDATSEQHSSG QPPQATRSVVLKFFQDLGGAHPSTWYKAFNYNLATSQPITFDTLFVPGTTPLDSIYPI VQRELARQTGFGAAILPSTGLDPAHYQNFAITDDSLIFYFAQGELLPSFVGACQAQVP RSAIPPLAI" CDS complement(3358186..3359262) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3063C" /product="SAM-dependent methyltransferase" /note="Mb3063c, -, len: 358 aa. Equivalent to Rv3037c, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 358 aa overlap). Conserved hypothetical protein, similar in part to others e.g. O86799|SC6G4.36c from Streptomyces coelicolor (426 aa), FASTA scores: opt: 545, E(): 5.5e-27, (36.15% identity in 354 aa overlap); Q9UZW6|PAB0687 from Pyrococcus abyssi (386 aa), FASTA scores: opt: 262, E(): 3.5e-09, (31.0% identity in 200 aa overlap). Mb3063c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR041497" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3B9" /protein_id="SIU01688.1" /translation="MRARFGARAPWLVETTLLRRRAAGKLGELCPNVGVSQWLFTDEA LQQATAAPVARHRARRLAGRVVHDATCSIGTELAALRELAVRAVGSDIDPVRLAMARH NLAALGMEADLCRADVLHPVTRDAVVVIDPARRSNGRRRFHLADYQPGLGPLLDRYRG RDVVVKCAPGIDFEEVGRLGFEGEIEVISYRGGVREACLWSAGLAGSGIRRRASILDS GEQIGDDEPDDCGVRPAGKWIVDPDGAVVRAGLVRNYGARHGLWQLDPQIAYLSGDRL PPALRGFEVLEQLAFDERRLRQVLSALDCGAAEILVRGVAIDPDALRRRLRLRGSRPL AVVITRIGAGSLSHVTAYVCRPSR" CDS complement(3359397..3360380) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3064C" /product="Methyltransferase Rv3038c, type 11" /note="Mb3064c, -, len: 327 aa. Equivalent to Rv3038c, len: 327 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 327 aa overlap). Conserved hypothetical protein, equivalent to Q9CBQ9|ML1723 HYPOTHETICAL PROTEIN from Mycobacterium leprae (327 aa), FASTA scores: opt: 1843, E(): 6.1e-108, (80.75% identity in 327 aa overlap). Weak similarity with e.g. Q9KZI3|SCG8A.16 PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (199 aa), FASTA scores: opt: 227, E(): 3.9e-07, (31.95% identity in 191 aa overlap) and O52570 METHYLTRANSFERASE from Amycolatopsis mediterranei (272 aa), FASTA scores: opt: 228, E(): 4.3e-07, (31.7% identity in 164 aa overlap). Contains PS00044 Bacterial regulatory proteins, lysR family signature but shows no similarity to known LysR family members. Protein product from Mb3064c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3064c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2X3" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2X3" /protein_id="SIU01689.1" /translation="MTRSSNIPADATPNPHATAEQVAAARHDSKLAQVLYHDWEAENY DEKWSISYDQRCVDYARGRFDAIVPDEVIAQLPYDRALELGCGTGFFLLNLIQAGVAR RGSVTDLSPGMVKVATRNGQALGLDIDGRVADAEGIPYDDDAFDLVVGHAVLHHIPDV ELSLREVVRVLKPGGRFVFAGEPTTVGDGYARTLSTLTWRVVTNATKLPGLRGWRRPQ GELDESSRAAALEALVDLHTFTPQDLQRIAHNAGAVEVQTATEEFTAAMLGWPLRTFE CTVPPGRLGWGWARFAFTSWKTLGWVDANVWRHVVPKGWFYNVMITGVKPS" CDS complement(3360391..3361155) /codon_start=1 /transl_table=11 /gene="echA17" /locus_tag="BQ2027_MB3065C" /product="PROBABLE ENOYL-COA HYDRATASE ECHA17 (CROTONASE) (UNSATURED ACYL-CoA HYDRATASE) (ENOYL HYDRASE)" /note="Mb3065c, echA17, len: 254 aa. Equivalent to Rv3039c, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 254 aa overlap). Probable echA17, Enoyl-CoA Hydratase/Isomerase Superfamily member (crotonase) (EC 4.2.1.17). Similar to many e.g. Q9L1E6|SC3D11.16 PUTATIVE ENOYL-COA HYDRATASE from Streptomyces coelicolor (255 aa), FASTA scores: opt: 625, E(): 1.5e-30, (45.55% identity in 224 aa overlap); O07137||ECH8_MYCLE|ML2402|MLCB1306.05c PROBABLE ENOYL-COA HYDRATASE ECHA8 from Mycobacterium leprae (257 aa), FASTA scores: opt: 448, E(): 6.4e-20, (35.3% identity in 235 aa overlap), P97087|CRT CROTONASE / ENOYL-COA HYDRATASE from Clostridium thermosaccharolyticum (Thermoanaerobacterium thermosaccharolyticum) (259 aa), FASTA scores: opt: 420, E(): 3.1e-18, (31.2% identity in 234 aa overlap). Also similar to Mycobacterium tuberculosis AAK45356|O53418|Rv1070c|ECHA8|MT1100|MTV017.23c PROBABLE ENOYL-COA HYDRATASE ECHA8 (257 aa), FASTA scores: opt: 450, E(): 4.9e-20, (36.4% identity in 226 aa overlap). BELONGS TO THE ENOYL-COA HYDRATASE/ISOMERASE FAMILY. Protein product from Mb3065c detected using SWATH mass spectrometry. Mb3065c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXE1" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/Swiss-Prot:Q7TXE1" /protein_id="SIU01690.1" /translation="MPEFVNVVVSDGSQDAGLAMLLLSRPPTNAMTRQVYREVVAAAN ELGRRDDVAAVILYGGHEIFSAGDDMPELRTLSAQEADTAARIRQQAVDAVAAIPKPT VAAITGYALGAGLTLALAADWRVSGDNVKFGATEILAGLIPSGDGMARLTRAAGPSRA KELVFSGRFFDAEEALALGLIDDMVAPDDVYDAAAAWARRFLDGPPHALAAAKAGISD VYELAPAERIAAERRRYVEVFAAGQGGGSKGDRGGR" CDS complement(3361164..3362030) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3066C" /product="NUDIX hydrolase-like protein SCO3573" /note="Mb3066c, -, len: 288 aa. Equivalent to Rv3040c, len: 288 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 288 aa overlap). Conserved hypothetical protein, highly similar to Q9XA40|SCH17.07c hypothetical protein from Streptomyces coelicolor (312 aa), FASTA scores: opt: 648, E(): 5.2e-34, (50.0% identity in 260 aa overlap). Also similar to Q9F7R7 PREDICTED MUTT SUPERFAMILY HYDROLASE from uncultured proteobacterium EBAC31A08 (264 aa), FASTA scores: opt: 295, E(): 1.3e-11, (27.2% identity in 257 aa overlap); AAK24293|CC2322 hypothetical protein from Caulobacter crescentus (254 aa), BLAST scores: 185 (32% identity) AND 131 (37% identity), etc. Protein product from Mb3066c detected using shotgun mass spectrometry. Mb3066c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Y1" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR015797" /db_xref="InterPro:IPR039121" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Y1" /protein_id="SIU01691.1" /translation="MNSPREPLVPPPTPRPAATVMLVRDPDAGSASGLAVFLMRRHAA MDFAAGVMVFPGGGVDDRDRDADLGRLGAWAGPPPQWWAQRFGIEPDLAEALVCAAAR ETFEESGVLFAGPVDQDHSAPNSIVSDASVYGDARRALADRTLSFADFLQREKLVLRS DLLRPWANWVTPEAELTRRYDTYFFVGALPEGQRADGENTESDRAGWVLPADAIADFA AGRNFLLPPTWTQLDSLAGHTVADVLAVERQIVPVQPQLARNGDNWEIEFFDSDRYNQ ARRSGGSTGWPL" CDS complement(3362027..3362890) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3067C" /product="PROBABLE CONSERVED ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb3067c, -, len: 287 aa. Equivalent to Rv3041c, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 287 aa overlap). Probable conserved ATP-binding protein ABC transporter (see citation below), equivalent to Q9CBQ7|ML1726 PUTATIVE ABC TRANSPORTER PROTEIN ATP-BINDING PROTEIN from Mycobacterium leprae (305 aa), FASTA scores: opt: 1576, E(): 8.6e-85, (83.4% identity in 289 aa overlap). Also similar to other putative ATP-binding proteins ABC transporters e.g. Q9X9Z4|SCI5.06C from Streptomyces coelicolor (265 aa), FASTA scores: opt: 893, E(): 4.8e-45, (53.3% identity in 257 aa overlap); Q9L156|SC5C11.16c from Streptomyces coelicolor (279 aa), FASTA scores: opt: 680, E(): 1.3e-32, (45.4% identity in 271 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb3067c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3067c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2X9" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2X9" /protein_id="SIU01692.1" /translation="MRHDSRVLDNGGPDAADPDLLIDFRNVSLRRNGRTLVGPLDWAV ELDERWVIVGPNGAGKTSLLRIAAAAEHPSSGVAFVLGERLGRVDVSELRARVGLSSS ALAERVPGDERVRDLVVSAGYAVLGRWRERYEAVDYHRAIDMLESLGAEHLANRTYGT LSEGERKRVLIARALMTDPELLLLDEPAAGLDLGGREELVARLADLAADPDAPALVLV THHVEEIPPGFSHCLLLSEARVVAAGLLPDALTAENLSTAFGQEITLEVADGRYFARR RRSRAAHRRQS" CDS complement(3362905..3364134) /codon_start=1 /transl_table=11 /gene="serB2" /locus_tag="BQ2027_MB3068C" /product="PROBABLE PHOSPHOSERINE PHOSPHATASE SERB2 (PSP) (O-PHOSPHOSERINE PHOSPHOHYDROLASE) (PSPASE)" /note="Mb3068c, serB2, len: 409 aa. Equivalent to Rv3042c, len: 409 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 409 aa overlap). Probable serB2, Phosphoserine phosphatase (EC 3.1.3.3), equivalent to Q9CBQ6|ML1727 PUTATIVE PHOSPHOSERINE PHOSPHATASE from Mycobacterium leprae (411 aa), FASTA scores: opt: 2173, E(): 1.3e-117, (86.3% identity in 408 aa overlap). Also similar to other e.g. Q9S281|SCI28.02 from Streptomyces coelicolor (410 aa), FASTA scores: opt: 1209, E(): 3e-62, (51.75% identity in 400 aa overlap); Q9HUK|PA4960 from Pseudomonas aeruginosa (429 aa), FASTA scores: opt: 704, E(): 3.1e-33, (40.95% identity in 393 aa overlap); O28142|SERB_ARCTU|AF2138 from Archaeoglobus fulgidus (344 aa), FASTA scores: opt: 671, E(): 2e-31, (37.25% identity in 325 aa overlap); and P06862|SERB_ECOLI (322 aa), FASTA scores: opt: 628, E(): 5.7e-29, (46.8% identity in 235 aa overlap). BELONGS TO THE SERB FAMILY. Protein product from Mb3068c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3068c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Y9" /db_xref="InterPro:IPR002912" /db_xref="InterPro:IPR023190" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Y9" /protein_id="SIU01693.1" /translation="MPAKVSVLITVTGMDQPGVTSALFEVLAQHGVELLNVEQVVIRG RLTLGVLVSCPLDVADGTALRDDVASAIHGVGLDVAIERSDDLPIIRQPSTHTIFVLG RPITAGAFSAVAREVAALGVNIDFIRGISDYPVTGLELRVSVPPGCVGPLQIALTKVA AEEHVDVAVEDYGLAWRTKRLIVFDVDSTLVQGEVIEMLAARAGAQGQVAAITEAAMR GELDFAESLQRRVATLAGLPATVIDDVAEQLELMPGARTTIRTLRRLGFRCGVVSGGF RRIIEPLARELMLDFVASNELEIVDGILTGRVVGPIVDRPGKAKALRDFASQYGVPME QTVAVGDGANDIDMLGAAGLGIAFNAKPALREVADASLSHPYLDTVLFLLGVTRGEIE AADAGDCGVRRVEIPAD" CDS complement(3364172..3365893) /codon_start=1 /transl_table=11 /gene="ctaD" /locus_tag="BQ2027_MB3069C" /product="PROBABLE CYTOCHROME C OXIDASE POLYPEPTIDE I CTAD (CYTOCHROME AA3 SUBUNIT 1)" /note="Mb3069c, ctaD, len: 573 aa. Equivalent to Rv3043c, len: 573 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 573 aa overlap). Probable ctaD, integral membrane cytochrome C oxidase polypeptide I (EC 1.9.3.1), equivalent to Q9CBQ5|ML1728 from Mycobacterium leprae (574 aa), FASTA scores: opt: 3738, E(): 3.8e-216, (95.4% identity in 566 aa overlap). Also similar to other CYTOCHROME C OXIDASES POLYPEPTIDE I e.g. Q9AEL9|CTAD from Corynebacterium glutamicum (Brevibacterium flavum) (584 aa), FASTA scores: opt: 3065, E(): 6.8e-176, (72.65% identity in 567 aa overlap); Q9X813|SC6G10.28c from Streptomyces coelicolor (578 aa), FASTA scores: opt: 2888, E(): 2.6e-165, (71.7% identity in 544 aa overlap); Q9K451|CTAD from Streptomyces coelicolor (573 aa), FASTA scores: opt: 2757, E(): 1.8e-157, (70.2% identity in 537 aa overlap). Contains PS00077 Cytochrome c oxidase subunit I, copper B binding region signature. BELONGS TO THE HEME-COPPER RESPIRATORY OXIDASE FAMILY. Protein product from Mb3069c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3069c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63853" /db_xref="InterPro:IPR000883" /db_xref="InterPro:IPR014241" /db_xref="InterPro:IPR023615" /db_xref="InterPro:IPR023616" /db_xref="InterPro:IPR036927" /db_xref="UniProtKB/Swiss-Prot:P63853" /protein_id="SIU01694.1" /translation="MTAEAPPLGELEAIRPYPARTGPKGSLVYKLITTTDHKMIGIMY CVACISFFFIGGLLALLMRTELAAPGLQFLSNEQFNQLFTMHGTIMLLFYATPIVFGF ANLVLPLQIGAPDVAFPRLNAFSFWLFVFGATIGAAGFITPGGAADFGWTAYTPLTDA IHSPGAGGDLWIMGLIVAGLGTILGAVNMITTVVCMRAPGMTMFRMPIFTWNIMVTSI LILIAFPLLTAALFGLAADRHLGAHIYDAANGGVLLWQHLFWFFGHPEVYIIALPFFG IVSEIFPVFSRKPIFGYTTLVYATLSIAALSVAVWAHHMFATGAVLLPFFSFMTYLIA VPTGIKFFNWIGTMWKGQLTFETPMLFSVGFMVTFLLGGLTGVLLASPPLDFHVTDSY FVVAHFHYVLFGTIVFATFAGIYFWFPKMTGRLLDERLGKLHFWLTFIGFHTTFLVQH WLGDEGMPRRYADYLPTDGFQGLNVVSTIGAFILGASMFPFVWNVFKSWRYGEVVTVD DPWGYGNSLEWATSCPPPRHNFTELPRIRSERPAFELHYPHMVERLRAEAHVGRHHDE PAMVTSS" CDS 3366108..3367187 /codon_start=1 /transl_table=11 /gene="fecB" /locus_tag="BQ2027_MB3070" /product="PROBABLE FEIII-DICITRATE-BINDING PERIPLASMIC LIPOPROTEIN FECB" /note="Mb3070, fecB, len: 359 aa. Equivalent to Rv3044, len: 359 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 359 aa overlap). Probable fecB, FeIII dicitrate-binding periplasmic lipoprotein (see citation below), equivalent to Q9CBQ4|FECB|ML1729 PUTATIVE FEIII-DICITRATE TRANSPORTER LIPOPROTEIN from Mycobacterium leprae (364 aa), FASTA scores: opt: 1816, E(): 1.1e-96, (75.65% identity in 357 aa overlap); and Q9LA57|FECB from M. avium (364 aa), FASTA scores: opt: 1769, E(): 5.1e-94. Similar to many periplasmic FeIII-dicitrate transporters e.g. P72593|FECB|SLR1319 from Synechocystis sp. strain PCC 6803 (315 aa), FASTA scores: opt: 459, E(): 3.6e-19, (31.35% identity in 303 aa overlap); and P72611|FECB|SLR1492 from Synechocystis sp. strain PCC 6803. N-terminus longer (approximatively 30 aa) to AAK47459 from Mycobacterium tuberculosis strain CDC1551 (327 aa). Has signal peptide and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb3070 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3070 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002491" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V9" /protein_id="SIU01695.1" /translation="MRSTVAVAVAAAVIAASSGCGSDQPAHKASQSMITPTTQIAGAG VLGNDRKPDESCARAAAAADPGPPTRPAHNAAGVSPEMVQVPAEAQRIVVLSGDQLDA LCALGLQSRIVAAALPNSSSSQPSYLGTTVHDLPGVGTRSAPDLRAIAAAHPDLILGS QGLTPQLYPQLAAIAPTVFTAAPGADWENNLRGVGAATARIAAVDALITGFAEHATQV GTKHDATHFQASIVQLTANTMRVYGANNFPASVLSAVGVDRPPSQRFTDKAYIEIGTT AADLAKSPDFSAADADIVYLSCASEAAAERAAVILDSDPWRKLSANRDNRVFVVNDQV WQTGEGMVAARGIVDDLRWVDAPIN" CDS 3367257..3368297 /codon_start=1 /transl_table=11 /gene="adhC" /locus_tag="BQ2027_MB3071" /product="PROBABLE NADP-DEPENDENT ALCOHOL DEHYDROGENASE ADHC" /note="Mb3071, adhC, len: 346 aa. Equivalent to Rv3045, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap). Probable adhC, NADP-dependent alcohol dehydrogenase (EC 1.1.1.2), equivalent to Q9CBQ3|ADHA|ML1730 ALCOHOL DEHYDROGENASES from Mycobacterium leprae (362 aa), FASTA scores: opt: 1982, E(): 1.3e-111, (85.85% identity in 346 aa overlap); Q9AE96|ADHC from Mycobacterium smegmatis (348 aa), FASTA scores: opt: 1808, E(): 3.4e-101, (78.95% identity in 347 aa overlap); Q9EWF1|SCK13.33c PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (346 aa), FASTA scores: opt: 1508, E(): 3.3e-83, (64.45% identity in 346 aa overlap); O06007|ADHA from Bacillus subtilis (349 aa), FASTA scores: opt: 1412, E(): 1.9e-77, (61.8% identity in 335 aa overlap); etc. Contains PS00059 Zinc-containing alcohol dehydrogenases signature. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY. HIGH SIMILARITY WITH OTHER BACTERIAL ADH'S. Protein product from Mb3071 detected using shotgun mass spectrometry. Mb3071 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4X1" /db_xref="InterPro:IPR002328" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P0A4X1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01696.1" /translation="MSTVAAYAAMSATEPLTKTTITRRDPGPHDVAIDIKFAGICHSD IHTVKAEWGQPNYPVVPGHEIAGVVTAVGSEVTKYRQGDRVGVGCFVDSCRECNSCTR GIEQYCKPGANFTYNSIGKDGQPTQGGYSEAIVVDENYVLRIPDVLPLDVAAPLLCAG ITLYSPLRHWNAGANTRVAIIGLGGLGHMGVKLGAAMGADVTVLSQSLKKMEDGLRLG AKSYYATADPDTFRKLRGGFDLILNTVSANLDLGQYLNLLDVDGTLVELGIPEHPMAV PAFALALMRRSLAGSNIGGIAETQEMLNFCAEHGVTPEIELIEPDYINDAYERVLASD VRYRFVIDISAL" CDS complement(3368286..3368660) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3072C" /product="conserved protein" /note="Mb3072c, -, len: 124 aa. Equivalent to Rv3046c, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 124 aa overlap). Conserved hypothetical protein, similar to several hypothetical mycobacterial proteins e.g. Q50171|ML2258 U296W HYPOTHETICAL PROTEIN from Mycobacterium leprae (100 aa), FASTA scores: opt: 194, E(): 7.6e-06, (35.9% identity in 103 aa overlap); and O06409|Rv0543c|MTCY25D10.22c from Mycobacterium tuberculosis (100 aa), FASTA scores: opt: 192, E(): 1e-05, (34.7% identity in 98 aa overlap). Protein product from Mb3072c detected using shotgun mass spectrometry. Mb3072c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021784" /db_xref="UniProtKB/TrEMBL:A0A1R3Y345" /protein_id="SIU01697.1" /translation="MTKTFSHPHFFRSVLRWLQVGYPEGVPGPDRVALLSLLRSTPLT EEQIGEVVRHFTENGSPAVADRVIDRDEIAEFISEVTHHDAGPENIQRVAGILAAAGW PLAGVDVGESESGSDRAPASQG" CDS complement(3368994..3369278) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3073C" /product="HYPOTHETICAL PROTEIN" /note="Mb3073c, -, len: 94 aa. Equivalent to Rv3047c, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 94 aa overlap). Hypothetical unknown protein. Mb3073c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3C6" /protein_id="SIU01698.1" /translation="MGGPFDADAEAHFDEVAEAFAKLTNVDRDVGVDLEKELCMTVEA DDRSDALVTRRLLPRVPRCIPLAARLAPGTIGCPSFWNPIATGGASRQAL" CDS complement(3369376..3370350) /codon_start=1 /transl_table=11 /gene="nrdF2" /locus_tag="BQ2027_MB3074C" /standard_name="nrdG" /product="RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE (BETA CHAIN) NRDF2 (RIBONUCLEOTIDE REDUCTASE SMALL SUBUNIT) (R2F PROTEIN)" /note="Mb3074c, nrdF2, len: 324 aa. Equivalent to Rv3048c, len: 324 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 324 aa overlap). nrdF2, ribonucleoside-diphosphate reductase, beta chain (EC 1.17.4.1) (see citation below), equivalent to Q9CBQ2|RIR2_MYCL|NRDF|ML1731 RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE BETA CHAIN from Mycobacterium leprae (325 aa), FASTA scores: opt: 2009, E(): 1.3e-123, (93.5% identity in 324 aa overlap). Also similar to other ribonucleoside-diphosphate reductases e.g. Q9XD62|NRDF from Corynebacterium glutamicum (Brevibacterium flavum) (334 aa), FASTA scores: opt: 1648, E(): 4.2e-100, (78.35% identity in 314 aa overlap); O69274|NRDF from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (329 aa), FASTA scores: opt: 1626, E(): 1.1e-98, (75.3% identity in 320 aa overlap); P37146|NRDF|B2676 from Escherichia coli (319 aa), FASTA scores: opt: 1569, E(): 5.7e-95, (71.3% identity in 317 aa overlap). Contains PS00368 Ribonucleotide reductase small subunit signature. BELONGS TO THE RIBONUCLEOSIDE DIPHOSPHATE REDUCTASE SMALL CHAIN FAMILY. COFACTOR: BINDS 2 IRON IONS (BY SIMILARITY). Note that previously known as nrdG. Protein product from Mb3074c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3074c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Y4" /db_xref="InterPro:IPR000358" /db_xref="InterPro:IPR009078" /db_xref="InterPro:IPR012348" /db_xref="InterPro:IPR026494" /db_xref="InterPro:IPR030475" /db_xref="InterPro:IPR033909" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Y4" /protein_id="SIU01699.1" /translation="MTGNAKLIDRVSAINWNRLQDEKDAEVWDRLTGNFWLPEKVPVS NDIPSWGTLTAGEKQLTMRVFTGLTMLDTIQGTVGAVSLIPDALTPHEEAVLTNIAFM ESVHAKSYSQIFSTLCSTAEIDDAFRWSEENRNLQRKAEIVLQYYRGDEPLKRKVAST LLESFLFYSGFYLPMYWSSRAKLTNTADMIRLIIRDEAVHGYYIGYKFQRGLALVDDV TRAELKDYTYELLFELYDNEVEYTQDLYDEVGLTEDVKKFLRYNANKALMNLGYEALF PRDETDVNPAILSALSPNADENHDFFSGSGSSYVIGKAVVTEDDDWDF" CDS complement(3370481..3372055) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3075C" /product="PROBABLE MONOOXYGENASE" /note="Mb3075c, -, len: 524 aa. Equivalent to Rv3049c, len: 524 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 524 aa overlap). Probable monooxygenase (EC 1.-.-.-), similar to several monooxygenases e.g. Q9I3H5|PA1538 PROBABLE FLAVIN-CONTAINING MONOOXYGENASE from Pseudomonas aeruginosa (527 aa), FASTA scores: opt: 1577, E(): 3.9e-90, (47.3% identity in 501 aa overlap); Q9RKB5|SCE87.23c MONOOXYGENASE from Streptomyces coelicolor (519 aa), FASTA scores: opt: 1522, E(): 9.8e-87, (47.4% identity in 485 aa overlap); Q9I218|PA2097 PROBABLE FLAVIN-BINDING MONOOXYGENASE from Pseudomonas aeruginosa (491 aa), FASTA scores: opt: 1366, E(): 4.2e-77, (43.75% identity in 489 aa overlap); etc. Also similar to Q10532|Rv0892|Y892_MYCTU|MT0916|MTCY31.20 PROBABLE MONOOXYGENASE (EC 1.14.13.-) from Mycobacterium tuberculosis strain H37Rv (495 aa), FASTA scores: opt: 1147, E(): 1.5e-63, (38.0% identity in 479 aa overlap). Protein product from Mb3075c detected using SWATH mass spectrometry. Mb3075c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y300" /db_xref="InterPro:IPR020946" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y300" /protein_id="SIU01700.1" /translation="MSIADTAAKPSTPSPANQPPVRTRAVIIGTGFSGLGMAIALQKQ GVDFVILEKADDVGGTWRDNTYPGCACDIPSHLYSFSFEPKADWKHLFSYWDEILGYL KGVTDKYGLRRYIEFNSLVDRGYWDDDECRWHVFTADGREYVAQFLISGAGALHIPSF PEIAGRDEFAGPAFHSAQWDHSIDLTGKRVAIVGTGASAIQIVPEIVGQVAELQLYQR TPPWVVPRTNEELPVSLRRALRTVPGLRALLRLGIYWAQEALAYGMTKRPNTLKIIEA YAKYNIRRSVKDRELRRKLTPRYRIGCKRILNSSTYYPAVADPKTELITDRIDRITHD GIVTADGTGREVFREADVIVYATGFHVTDSYTYVQIKGRHGEDLVDRWNREGIGAHRG ITVANMPNLFFLLGPNTGLGHNSVVFMIESQIHYVADAIAKCDRMGVQALAPTREAQD RFNQELQRRLAGSVWNSGGCRSWYLDEHGKNTVLWCGYTWQYWLTTRSVNPAEYRFFG IGNGLSSDRATVAAAN" CDS complement(3372189..3372929) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3076C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY ASNC-FAMILY)" /note="Mb3076c, -, len: 246 aa. Equivalent to Rv3050c, len: 246 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 246 aa overlap). Probable transcriptional regulatory protein tetR-family, equivalent but shorter to Q9CBQ1|ML1733 from Mycobacterium leprae (275 aa), FASTA scores: opt: 1381,(E): 2.7e-79, (86.25% identity in 240 aa overlap); AAK44712|MT0489 from Mycobacterium tuberculosis strain CDC1551 (256 aa), FASTA scores: opt: 328,(E): 1.8e-13, (30.75% identity in 234 aa overlap); etc. Also some similarity to O53757|Rv0472c|MTV038.16c. Alternative starts possible at 68052 or 67923. Has potential helix-turn-helix motif at positons 51-72. Protein product from Mb3076c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3076c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Z4" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Z4" /protein_id="SIU01701.1" /translation="MVRIPRPHPSAKPGVKVDARSERWREHRKKVRNEIVDAAFRAID RLGPELSVRQIAEEAGTAKPKIYRHFTDKSDLLEAIGMRLRDMLWAAIFPSLDLATDS AREVIRRSVEEYVNLVDQHPNVLRVFIQGRSAKQSEATVRTLNEGREITLAMAEMFNN ELREMELNRAALELAAFAAFGSAASATEWWLGPEPDSPRRMPREQFVAHLTTIMMGVI VGTAEALGIAVDPDQPIHDAVPNNPAVR" CDS complement(3373057..3375138) /codon_start=1 /transl_table=11 /gene="nrdE" /locus_tag="BQ2027_MB3077C" /product="ribonucleoside-diphosphate reductase (alpha chain) nrde (ribonucleotide reductase small subunit) (r1f protein)" /note="Mb3077c, nrdE, len: 693 aa. Equivalent to Rv3051c, len: 693 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 693 aa overlap). nrdE, ribonucleotide-diphosphate reductase, alpha chain (EC 1.17.4.1) (see citations below), equivalent to Q9CBQ0|NRDE|ML1734 from Mycobacterium leprae (693 aa), FASTA scores: opt: 4259,E(): 0, (93.2% identity in 693 aa overlap). Similar to other Ribonucleoside-diphosphate reductases e.g. Q9XD63|NRDE from Corynebacterium glutamicum (Brevibacterium flavum) (707 aa), FASTA scores: opt: 3683,E(): 0, (79.35% identity in 693 aa overlap); O69273|NRDE from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (720 aa), FASTA scores: opt: 3555, E(): 1.7e-214, (76.1% identity in 694 aa overlap); P39452|NRDE|B2675 from Escherichia coli (713 aa), FASTA scores: opt: 3430, E(): 1.1e-206, (73.6% identity in 693 aa overlap); etc. Equivalent to AAK47468|MT3137 from Mycobacterium tuberculosis strain CDC1551 (725 aa) but shorter in N-terminus. Contains PS00089 Ribonucleotide reductase large subunit signature. BELONGS TO THE RIBONUCLEOSIDE DIPHOSPHATE REDUCTASE LARGE CHAIN FAMILY. Protein product from Mb3077c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3077c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5W9" /db_xref="InterPro:IPR000788" /db_xref="InterPro:IPR008926" /db_xref="InterPro:IPR013346" /db_xref="InterPro:IPR013509" /db_xref="InterPro:IPR013554" /db_xref="InterPro:IPR026459" /db_xref="InterPro:IPR039718" /db_xref="UniProtKB/Swiss-Prot:P0A5W9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01702.1" /translation="MLNLYDADGKIQFDKDREAAHQYFLQHVNQNTVFFHNQDEKLDY LIRENYYEREVLDQYSRNFVKTLLDRAYAKKFRFPTFLGAFKYYTSYTLKTFDGKRYL ERFEDRVVMVALTLAAGDTALAELLVDEIIDGRFQPATPTFLNSGKKQRGEPVSCFLL RVEDNMESIGRSINSALQLSKRGGGVALLLTNIREHGAPIKNIENQSSGVIPIMKLLE DAFSYANQLGARQGAGAVYLHAHHPDIYRFLDTKRENADEKIRIKTLSLGVVIPDITF ELAKRNDDMYLFSPYDVERVYGVPFADISVTEKYYEMVDDARIRKTKIKAREFFQTLA ELQFESGYPYIMFEDTVNRANPIDGKITHSNLCSEILQVSTPSLFNEDLSYAKVGKDI SCNLGSLNIAKTMDSPDFAQTIEVAIRALTAVSDQTHIKSVPSIEQGNNDSHAIGLGQ MNLHGYLARERIFYGSDEGIDFTNIYFYTVLYHALRASNRIAIERGTHFKGFERSKYA SGEFFDKYTDQIWEPKTQKVRQLFADAGIRIPTQDDWRRLKESVQAHGIYNQNLQAVP PTGSISYINHSTSSIHPIVSKVEIRKEGKIGRVYYPAPYMTNDNLEYYEDAYEIGYEK IIDTYAAATQHVDQGLSLTLFFKDTATTRDVNKAQIYAWRKGIKTLYYIRLRQMALEG TEVEGCVSCML" CDS complement(3375204..3375656) /codon_start=1 /transl_table=11 /gene="nrdI" /locus_tag="BQ2027_MB3078C" /product="PROBABLE NRDI PROTEIN" /note="Mb3078c, nrdI, len: 150 aa. Equivalent to Rv3052c, len: 150 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 150 aa overlap). Probable nrdI, equivalent to Q9CBP9|NRDI|ML1735 from Mycobacterium leprae (138 aa), FASTA scores: opt: 765, E(): 3.8e-44, (79.7% identity in 138 aa overlap), and similar to many NRDI PROTEINS e.g. Q47415|NRDI_ECOLI|B2674 from Escherichia coli (136 aa), FASTA scores: opt: 574, E(): 1.9e-31, (62.2% identity in 135 aa overlap). BELONGS TO THE NRDI FAMILY. Protein product from Mb3078c detected using SWATH mass spectrometry. Mb3078c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65549" /db_xref="InterPro:IPR004465" /db_xref="InterPro:IPR020852" /db_xref="InterPro:IPR029039" /db_xref="UniProtKB/Swiss-Prot:P65549" /protein_id="SIU01703.1" /translation="MDIAGRSLVYFSSVSENTHRFVQKLGIPATRIPLHGRIEVDEPY VLILPTYGGGRANPGLDAGGYVPKQVIAFLNNDHNRAQLRGVIAAGNTNFGAEFCYAG DVVSRKCSVPYLYRFELMGTEDDVAAVRTGLAEFWKEQTCHQPSLQSL" CDS complement(3375691..3375930) /codon_start=1 /transl_table=11 /gene="nrdH" /locus_tag="BQ2027_MB3079C" /product="PROBABLE GLUTAREDOXIN ELECTRON TRANSPORT COMPONENT OF NRDEF (GLUTAREDOXIN-LIKE PROTEIN) NRDH" /note="Mb3079c, nrdH, len: 79 aa. Equivalent to Rv3053c, len: 79 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 79 aa overlap). Probable nrdH, glutaredoxin-like protein, equivalent to Q9CBP8|NRDH|ML1736 from Mycobacterium leprae (80 aa), FASTA scores: opt: 478, E(): 2.7e-27, (91.15% identity in 79 aa overlap), and similar to many glutaredoxin-like proteins e.g. Q9XD65|NRDH from Corynebacterium glutamicum (Brevibacterium flavum) (77 aa), FASTA scores: opt: 382, E(): 1.5e-20, (72.35% identity in 76 aa overlap); and Q56108|NRDH_SALTY from Salmonella typhimurium (81 aa), FASTA scores: opt: 243, E(): 9.9e-11, (45.85% identity in 72 aa overlap). BELONGS TO THE GLUTAREDOXIN FAMILY. Protein product from Mb3079c detected using SWATH mass spectrometry. Mb3079c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y306" /db_xref="InterPro:IPR002109" /db_xref="InterPro:IPR011909" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3Y306" /protein_id="SIU01704.1" /translation="MTVTVYTKPACVQCSATSKALDKQGIAYQKVDISLDSEARDYVM ALGYLQAPVVVAGNDHWSGFRPDRIKALAGAALTA" CDS complement(3376393..3376947) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3080C" /product="NADPH:quinone oxidoreductase" /note="Mb3080c, -, len: 184 aa. Equivalent to Rv3054c, len: 184 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 184 aa overlap). Conserved hypothetical protein, similar to Q9RD22|SCM1.21 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (187 aa), FASTA scores: opt: 651, E(): 1.5e-33, (56.8% identity in 175 aa overlap). Also shares similarity with other hypothetical proteins and Chromate reductases e.g. AAK56853|CHRR from Pseudomonas putida (186 aa), FASTA scores: opt: 339, E(): 3.3e-14, (38.75% identity in 160 aa overlap). Contains aminotransferases class-II pyridoxal-phosphate attachment site (PS00599) near C-terminus. Protein product from Mb3080c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y4W7" /db_xref="InterPro:IPR005025" /db_xref="InterPro:IPR029039" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4W7" /protein_id="SIU01705.1" /translation="MSDTKSDIKILALVGSLRAASFNRQIAELAAKVAPDGVTVTMFE GLGDLPFYNEDIDTATEVPAPVSALREAASDAHAALVVTPEYNGSIPAVIKNAIDWLS RPFGDGALKDKPLAVIGGSMGRYGGVWAHDETRKSFSIAGTRVVDAIKLSVPFQTLGK SVADDAGLAANVRDAVGNLAAEVG" CDS 3377039..3377653 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3081" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb3081, -, len: 204 aa. Equivalent to Rv3055, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 204 aa overlap). Possible transcriptional regulatory protein, similar to Q9RD23|SCM1.20c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (234 aa), FASTA scores: opt: 471, E(): 4.6e-23, (44.9% identity in 187 aa overlap); and with low similarity to other e.g. Q9ADK8|2SCK31.12 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (198 aa), FASTA scores: opt: 208, 2.5e-06, (32.9% identity in 155 aa overlap); Q9ADD9|SCBAC20F6.11c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (199 aa), FASTA scores: opt: 182, E(): 0.00012, (31.0% identity in 184 aa overlap). Contains potential helix-turn-helix motif from aa 48 to 69 (+3.42 SD). SO MAY BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3081 detected using SWATH mass spectrometry. Mb3081 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3Q9" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Q9" /protein_id="SIU01706.1" /translation="MSGAERLGDLPVFARQEPVPERGDAARNRALLLEAARRLIARSG ADAITMDDVAAAAGVGKGTLFRRFGSRAGLMMVLLDEDERASQQAFLFGPPPLGPDAP PLDRLIAFGRERMRFVHAHHQLLSEANRDPQTRHSAALSVLRTHLRVLLASAPTTGDL DAQTDALLALLDVDYVEHQLNAGGHTLQTLGDAWESLARKLCGR" CDS 3377663..3378703 /codon_start=1 /transl_table=11 /gene="dinP" /locus_tag="BQ2027_MB3082" /product="possible dna-damage-inducible protein p dinp (dna polymerase v) (pol iv 2) (dna nucleotidyltransferase (dna-directed))" /note="Mb3082, dinP, len: 346 aa. Equivalent to Rv3056, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap). Possible dinP, DNA-damage-inducible protein, similar to other e.g. AAK45855|MT1589 from Mycobacterium tuberculosis strain CDC1551 (485 aa), FASTA scores: opt: 620, E(): 6.1e-32, (37.2% identity in 344 aa overlap); BAB49140|MLR1877 from Rhizobium loti (Mesorhizobium loti) (415 aa), FASTA scores: opt: 533, E(): 1.8e-26, (34.35% identity in 358 aa overlap); and BAB54888|MLL9709 from Rhizobium loti (Mesorhizobium loti) (361 aa), FASTA scores: opt: 532, E(): 1.8e-26, (35.35% identity in 348 aa overlap). Extensive similarity to proteins induced by DNA damage such as dinP, mucB, umuC. Mb3082 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63988" /db_xref="InterPro:IPR001126" /db_xref="InterPro:IPR017961" /db_xref="InterPro:IPR022880" /db_xref="InterPro:IPR036775" /db_xref="UniProtKB/Swiss-Prot:P63988" /protein_id="SIU01707.1" /translation="MPTAAPRWILHVDLDQFLASVELLRHPELAGLPVIVGGNGDPTE PRKVVTCASYEARAYGVRAGMPLRTAARRCPEATFLPSNPAAYNAASEEVVALLRDLG YPVEVWGWDEAYLAVAPGTPDDPIEVAEEIRKVILSQTGLSCSIGISDNKQRAKIATG LAKPAGIYQLTDANWMAIMGDRTVEALWGVGPKTTKRLAKLGINTVYQLAHTDSGLLM STFGPRTALWLLLAKGGGDTEVSAQAWVPRSRSHAVTFPRDLTCRSEMESAVTELAQR TLNEVVASSRTVTRVAVTVRTATFYTRTKIRKLQAPSTDPDVITAAARHVLDLFELDR PVRLLGVRLELA" CDS complement(3378757..3379620) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3083C" /product="PROBABLE SHORT CHAIN ALCOHOL DEHYDROGENASE/REDUCTASE" /note="Mb3083c, -, len: 287 aa. Equivalent to Rv3057c, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 287 aa overlap). Probable oxidoreductase, probably short-chain alcohol dehydrogenase/reductase (EC 1.1.-.-). Equivalent to Q9CBP7|ML1740 POSSIBLE SHORT CHAIN DEHYDROGENASES/REDUCTASE from Mycobacterium leprae (312 aa), FASTA scores: opt: 1563, E(): 6e-89, (81.8% identity in 280 aa overlap). Also similar to many oxidoreductases e.g. Q9ZBX8|SCD78.21c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (585 aa), FASTA scores: opt: 541, E(): 6.7e-26, (37.25% identity in 263 aa overlap); AAK47506|MT3170 OXIDOREDUCTASE, SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY from Mycobacterium tuberculosis strain CDC1551 (276 aa), FASTA scores: opt: 521, E(): 6.1e-25, (36.25% identity in 276 aa overlap); AAK45541|MT1283 OXIDOREDUCTASE, SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY from Mycobacterium tuberculosis strain CDC1551 (276 aa), FASTA scores: opt: 471, E(): 7.2e-22, (32.4% identity in 281 aa overlap). Also similar to O50460|Rv1245c|MTV006.17C DEHYDROGENASE (276 aa). Contains short-chain alcohol dehydrogenase family signature (PS00061). MAY BELONG TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb3083c detected using SWATH mass spectrometry. Mb3083c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3D4" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3D4" /protein_id="SIU01708.1" /translation="MLQRGAGQYFAGKRCFVTGAASGIGRATALRLAAQGAELYLTDR DRDGLAQTVCDARALGAQVPEHRVLDVSDYQDVAAFAADIHARHPSMDVVLNIAGVSA WGTVDQLTHDQWSRMVAINLMGPIHVIETLVPPMVAAGRGGHLVNVSSAAGLVGLPWH AAYSASKYGLRGLSEVLRFDLARHGIGVSVVVPGAVKTPLVNTVEIAGVDRDDPRVNR WVERFSGHAVTPEKAADKILAGVTRNRYLVYTSADIRALYAFKRYAWWPYTLVMRRVN VFFTRALRPGP" CDS complement(3379684..3380334) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3084C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb3084c, -, len: 216 aa. Equivalent to Rv3058c, len: 216 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 216 aa overlap). Possible transcriptional regulatory protein, tetR-family, showing reasonable similarity to others e.g. AAK48337|MT3970 from Mycobacterium tuberculosis strain CDC1551 (216 aa), FASTA scores: opt: 261, E(): 2.8e-10, (31.7% identity in 221 aa overlap); Q49962|ML1070|U1756B from Mycobacterium leprae (217 aa), FASTA scores: opt: 234, E(): 1.8e-08, (27.2% identity in 195 aa overlap); Q9CDD3|ML0064 from Mycobacterium leprae (214 aa), FASTA scores: opt: 199, E(): 3.6e-06, (25.65% identity in 195 aa overlap); O66121|CPRS from Streptomyces coelicolor (215 aa), FASTA scores: opt: 183, E(): 4.2e-05, (26.0% identity in 196 aa overlap). Equivalent to AAK47476|MT3144 from Mycobacterium tuberculosis strain CDC1551 (237 aa) but N-terminus shorter 21 residues. Start was predicted by TBparse but alternatives (ATG) are possible. COULD BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3084c detected using shotgun mass spectrometry. Mb3084c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Z1" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Z1" /protein_id="SIU01709.1" /translation="MTSHAADEKQAAPPMRRRGDRHRQAILRAARELLEETPFAELSV RAISLRAGVARSGFYFYFDSKYSVLAQILAEATEELEEASQHFSARQPGESPEQFVNR MIGSVAAVYANNDPVLRACNAARQSDMEIRDILERQFQVLLRETIGVFEAEVKAGTAH PISEDLPTLVRTLAATTALMLTGDALLVGPDSDAARRVRVLEQMWLNALWGGGKAP" CDS 3380450..3381928 /codon_start=1 /transl_table=11 /gene="cyp136" /locus_tag="BQ2027_MB3085" /product="PROBABLE CYTOCHROME P450 136 CYP136" /note="Mb3085, cyp136, len: 492 aa. Equivalent to Rv3059, len: 492 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 492 aa overlap). Probable cyp136, cytochrome P450 136 (EC 1.14.-.-), similar to other cytochrome P450-dependent oxidases e.g. Q59990|CYP120|CYP|SLR0574 PUTATIVE CYTOCHROME P450 120 from Synechocystis sp. strain PCC 6803 (444 aa), FASTA scores: opt: 579, E(): 1.5e-29, (27.3% identity in 443 aa overlap); Q64654|CYP51|CP51_RAT CYTOCHROME P450 51 (EC 1.14.14.-) (LANOSTEROL 14-ALPHA DEMETHYLASE) from Rattus norvegicus (Rat) (503 aa), FASTA scores: opt: 549, E(): 1.4e-27, (26.2% identity in 458 aa overlap); Q9JIY3|CYP51 LANOSTEROL 14-ALPHA-DEMETHYLASE from Mus musculus (Mouse) (486 aa), FASTA scores: opt: 546, E(): 2.1e-27, (25.75% identity in 458 aa overlap). Contains cytochrome P450 cysteine heme-iron ligand signature (PS00086). BELONGS TO THE CYTOCHROME P450 FAMILY. Protein product from Mb3085 detected using SWATH mass spectrometry. Mb3085 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y314" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002403" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/TrEMBL:A0A1R3Y314" /protein_id="SIU01710.1" /translation="MATIHPPAYLLDQAKRRFTPSFNNFPGMSLVEHMLLNTKFPEKK LAEPPPGSGLKPVVGDAGLPILGHMIEMLRGGPDYLMFLYKTKGPVVFGDSAVLPGVA ALGPDAAQVIYSNRNKDYSQQGWVPVIGPFFHRGLMLLDFEEHMFHRRIMQEAFVRSR LAGYLEQMDRVVSRVVADDWVVNDARFLVYPAMKALTLDIASMVFMGHEPGTDHELVT KVNKAFTITTRAGNAVIRTSVPPFTWWRGLRARELLENYFTARVKERREASGNDLLTV LCQTEDDDGNRFSDADIVNHMIFLMMAAHDTSTSTATTMAYQLAAHPEWQQRCRDESD RHGDGPLDIESLEQLESLDLVMNESIRLVTPVQWAMRQTVRDTELLGYYLPKGTNVIA YPGMNHRLPEIWTDPLTFDPERFTEPRNEHKRHRYAFTPFGGGVHKCIGMVFGQLEIK TILHRLLRRYRLELSRPDYQPRWDYSAMPIPMDGMPIVLRPR" CDS complement(3382699..3384171) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3086C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY GNTR-FAMILY)" /note="Mb3086c, -, len: 490 aa. Equivalent to Rv3060c, len: 490 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 490 aa overlap). Probable transcriptional regulatory protein, showing reasonable similarity to several members of the GntR family e.g. BAB54431|MLL8575 from Rhizobium loti (Mesorhizobium loti) (247 aa), FASTA scores: opt: 274, E(): 3.5e-10, (30.35% identity in 224 aa overlap); P96570|ESMR from Burkholderia cepacia (Pseudomonas cepacia) (277 aa), FASTA scores: opt: 229, E(): 2.8e-07, (25.85% identity in 240 aa overlap); Q9S276|SCI28.07 from Streptomyces coelicolor (230 aa), FASTA scores: opt: 211, E(): 3.4e-06, (27.25% identity in 220 aa overlap); etc. Seems to have two domains: residues 1-260 resemble UxuR, and 260-490 resemble PdhR, ExuR, etc. Contains bacterial regulatory proteins, GntR family signature (PS00043). Helix-turn-helix motif (+3.13 SD) at aa 38-59. SEEMS TO BELONG TO THE GNTR FAMILY OF TRANSCRIPTIONAL REGULATORS. Mb3086c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y304" /db_xref="InterPro:IPR000524" /db_xref="InterPro:IPR008920" /db_xref="InterPro:IPR011711" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y304" /protein_id="SIU01711.1" /translation="MSTEPDAVWTDKRASKIARRIEADIVRRGWPIGASLGSESALQQ RFCVSRSVLREAVRLVEHHQVARMRRGPNGGLFICEPNAGPATRAVVIYLEYLGTTIG DLLGARLVLEPLAASLAAEHIDEPGIERLRAVLRAEERWRPGLPPPPEQFYRVLAEQS KNPVLQLFIDILMRLTKRYVQKSGTQSAGEAVEAAGQVHNEHSDIVAAVTAGDSAWAK TLSERHVEAVAGWLQQHQRGNDAAVRNGGRAREPRRAQQLILGAPRGKLAEVLAATIG DDIAASGWQVGSVFGTETALLERYQVSRAVLREAVRLLEYHAIAHMRRGPGGGLVVTT PQPQASIDTIALYLQYRKPSREDLRCVRDAIEIDNVAKVVKRRSEPEVASFLDTLGRP RLDNPTDDVRAAAVEEFRFHVGLARAAGNTMLDLFLLILVELFRRHLSSTEQALPTWS DVVAVGHAHVRILEAIGSGDDSLARCRTRRHLDAAASWWL" CDS complement(3384220..3386022) /codon_start=1 /transl_table=11 /gene="fadE22b" /locus_tag="BQ2027_MB3087C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE22b [SECOND PART]" /note="Mb3087c, fadE22b, len: 600 aa. Equivalent to 3' end of Rv3061c, len: 721 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 600 aa overlap). Probable fadE22, Acyl-CoA Dehydrogenase (EC 1.3.99.-), similar to many e.g. AAK44503|MT0284 from Mycobacterium tuberculosis strain CDC1551 (731 aa), FASTA scores: opt: 1804, E(): 1.1e-101, (43.45% identity in 743 aa overlap); AAK48037|MT3678 from Mycobacterium tuberculosis strain CDC1551 (711 aa), FASTA scores: opt: 1630, E(): 3.9e-91, (42.55% identity in 733 aa overlap); and extensive similarity in C-terminal part to many acyl-CoA dehydrogenases e.g. Q9A5G9|CC2478 from Caulobacter crescentus (407 aa), FASTA scores: opt: 767, E(): 4.8e-39, (36.7% identity in 376 aa overlap). Also similar to many hypothetical proteins. COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, fadE22 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c) splits fadE22 into 2 parts, fadE22a and fadE22b. Protein product from Mb3087c detected using SWATH mass spectrometry. Mb3087c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y2Z8" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y2Z8" /protein_id="SIU01712.1" /translation="MGLDSQVQVTDGVADGEAGIVLGAGLAELLLVAAGDDVLVLERG RKGVSVDVPENFDPTRRSGRVRLDNVRVTTDDILLGAYESALARARTLLAAEAVGGAA DCVDSAVAYAKVRQQFGRTIATFQAVKHHCANMLVAAESAIAAVWDAARAAAEDEEQF RLAAAVAAALAFPAYARNAELNIQVHGGIGFTWEHDAHLHLRRALVTVGLFGGDAPVR DVFERTAAGVTRAISLDLPAQAEELRARIRSDAAEIAALEKDAQRDKLIETGYVMPHW PRPWGRAAGAVEQLVIEEEFSAAGIERPDYSITGWVILTLIQHGTPWQIERFVEKALR QQEIWCQLFSEPDAGSDAASVKTRATRVEGGWKINGQKVWTSGAQYCARGLATVRTDP DAPKHAGITTVIIDMLAPGVEVRPLRQITGDSEFNEVFFNDVFVPDEDVVGAPNSGWT VARATLGNERVSIGGSGSYYEAMAAKLVQLVQRRSDAFAGAPIRVGAFLAEDHALRLL NLRRAARSVEGAGPGPEGNITKLKVAEHMIEGAAIAAALWGPEIALLDGPGRVIGRTV MGARGMAIAGGTSEVTRNQIAERILGMPRDPLIS" CDS complement(3386042..3386386) /codon_start=1 /transl_table=11 /gene="fadE22a" /locus_tag="BQ2027_MB3088C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE22a [FIRST PART]" /note="Mb3088c, fadE22a, len: 114 aa. Equivalent to 5' end of Rv3061c, len: 721 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 111 aa overlap). Probable fadE22, Acyl-CoA Dehydrogenase (EC 1.3.99.-), similar to many e.g. AAK44503|MT0284 from Mycobacterium tuberculosis strain CDC1551 (731 aa), FASTA scores: opt: 1804, E(): 1.1e-101, (43.45% identity in 743 aa overlap); AAK48037|MT3678 from Mycobacterium tuberculosis strain CDC1551 (711 aa), FASTA scores: opt: 1630, E(): 3.9e-91, (42.55% identity in 733 aa overlap); and extensive similarity in C-terminal part to many acyl-CoA dehydrogenases e.g. Q9A5G9|CC2478 from Caulobacter crescentus (407 aa), FASTA scores: opt: 767, E(): 4.8e-39, (36.7% identity in 376 aa overlap). Also similar to many hypothetical proteins. COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, fadE22 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c) splits fadE22 into 2 parts, fadE22a and fadE22b. Mb3088c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y307" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y307" /protein_id="SIU01713.1" /translation="MGIALTDDHRELSGVARAFLTSQKVRWAARASLDAAGDARPPFW QNLAELGWLGLHIDERHGGSGYGLSELVVVIEELGRAVAPGLFVPTVIASAVVAKEGT DDQRARLLPGAD" CDS 3386543..3388066 /codon_start=1 /transl_table=11 /gene="ligB" /locus_tag="BQ2027_MB3089" /product="probable atp-dependent dna ligase ligb (polydeoxyribonucleotide synthase [atp]) (polynucleotide ligase [atp]) (sealase) (dna repair protein) (dna joinase)" /note="Mb3089, ligB, len: 507 aa. Equivalent to Rv3062, len: 507 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 507 aa overlap). Probable ligB, DNA ligase ATP-dependent (EC 6.5.1.1), highly similar to numerous archaebacterial and eukaryotic polynucleotide DNA ligases, e.g. Q9FCB1|DNLI_STRCO|LIG|2SCG58.02 from Streptomyces coelicolor (512 aa), FASTA scores: opt: 1677, E(): 2.5e-90, (55.65% identity in 512 aa overlap); Q9HR35|DNLI_HALN1|LIG|VNG0881G from Halobacterium sp. strain NRC-1 (561 aa), FASTA scores: opt: 985, E(): 5.6e-50, (42.25% identity in 440 aa overlap); Q9V185|DNLI_PYRAB|LIG|PAB2002 from Pyrococcus abyssi (559 aa), FASTA scores: opt: 978, E(): 1.4e-49, (39.05% identity in 443 aa overlap); etc. Also similar to Rv3731|MTV025.079|LIGC POSSIBLE DNA LIGASE from M. tuberculosis (358 aa). Similarity at N-terminus is poor so first start codon was taken. Contains (PS00697) ATP-dependent DNA ligase AMP-binding site signature, and (PS00017) ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-DEPENDENT DNA LIGASE FAMILY. Protein product from Mb3089 detected using SWATH mass spectrometry. Mb3089 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TTR7" /db_xref="InterPro:IPR000977" /db_xref="InterPro:IPR012308" /db_xref="InterPro:IPR012309" /db_xref="InterPro:IPR012310" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR016059" /db_xref="InterPro:IPR022865" /db_xref="InterPro:IPR036599" /db_xref="UniProtKB/Swiss-Prot:Q7TTR7" /protein_id="SIU01714.1" /translation="MLLHDVAITSMDVAATSSRLTKVARIAALLHRAAPDTQLVTIIV SWLSGELPQRHIGVGWAALRSLPPPAPQPALTVTGVDATLSKIGTLSGKGSQAQRAAL VAELFSAATEAEQTFLLRLLGGELRQGAKGGIMADAVAQAAGLPAATVQRAAMLGGDL AAAAAAGLSGAALDTFTLRVGRPIGPMLAQTATSVHDALERHGGTTIFEAKLDGARVQ IHRANDQVRIYTRSLDDVTARLPEVVEATLALPVRDLVADGEAIALCPDNRPQRFQVT ASRFGRSVDVAAARATQPLSVFFFDILHRDGTDLLEAPTTERLAALDALVPARHRVDR LITSDPTDAANFLDATLAAGHEGVMAKAPAARYLAGRRGAGWLKVKPVHTLDLVVLAV EWGSGRRRGKLSNIHLGARDPATGGFVMVGKTFKGMTDAMLDWQTTRFHEIAVGPTDG YVVQLRPEQVVEVALDGVQRSSRYPGGLALRFARVVRYRADKDPAEADTIDAVRALY" CDS 3388202..3390478 /codon_start=1 /transl_table=11 /gene="cstA" /locus_tag="BQ2027_MB3090" /product="PROBABLE CARBON STARVATION PROTEIN A HOMOLOG CSTA" /note="Mb3090, cstA, len: 758 aa. Equivalent to Rv3063, len: 758 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 758 aa overlap). Probable cstA, integral membrane starvation-induced stress response protein, similar to other e.g. P15078|CSTA_ECOLI|B0598 from Escherichia coli strain K12 (701 aa), FASTA scores: opt: 2357, E(): 9.5e-137, (51.25% identity in 712 aa overlap); AAG54933|CSTA from Escherichia coli strain O157:H7 EDL933 (701 aa), FASTA scores: opt: 2356, E(): 1.1e-136, (51.1% identity in 712 aa overlap); etc. Predicted to be membrane associated. Similarity suggests start at GTG at 16801 in Y22D7 but no RBS obvious so TBparse-predicted start at 16881 taken. BELONGS TO THE CSTA FAMILY. Mb3090 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4W9" /db_xref="InterPro:IPR003706" /db_xref="InterPro:IPR025299" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4W9" /protein_id="SIU01715.1" /translation="MAAPTPSNRIEERSGHASCVRADADLPPVAILGRSPITLRHKIF FVAVAVIGALAWTVVAFFRNEPVNAVWIVVAAGCTYIIGFRFYARLIEMKVVRPRDDH ATPAEILDDGTDYVPTDRRVVFGHHFAAIAGAGPLVGPVLATQMGYLPSSIWIVVGAV LAGCVQDYLVLWISVRRRGRSLGQMVRDELGATAGVAALVGIPVIITIVIAVLALVVV RALAKSPWGVFSIAMTIPIAIFMGCYLRFLRPGRVSEVSLIGIGLLLLAVVSGDWVAH TSWGAAWFSLSPVTLCWLLISYGFAASVLPVWLLLAPRDYLSTFMKVGTIALLAIGVC AAHPIIEAPAVSKFAGSGNGPVFAGSLFPFLFITIACGALSGFHALICSGTTPKMLEK EGQMRVIGYCGMMTESFVAVIALLTAAILDQHLYFTLNAPSLHTHDSAATAAKYVNGL GLTGSPVTPDHISQAAASVGEQTIVSRTGGAPTLAFGMAEMLHRVVGGVGLKAFWYHF AIMFEALFILTTVDAGTRAARFMISDALGNFGGVLRKLQNPSWRPGAWACSLVVVAAW GSILLLGVTDPLGGINTLFPLFGIANQLLAGIAPTVITVVVIKKGRLKWAWIPGIPLL WDLAVTLTASWQKIFSADPSVGYWTQHAHYAAAQHAGETAFGSATNADEINDVVRNTF VQGTLSIVFVVVVVLVVVAGVIVALKTIRGRGIPLAEDDPAPSTLFAPAGLIPTAAER KLQRRLGAPASASVAAPD" CDS complement(3390784..3391209) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3091C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3091c, -, len: 141 aa. Equivalent to Rv3064c, len: 141 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 141 aa overlap). Probable conserved integral membrane protein, similar to many e.g. Q9KY40|SCC8A.08 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (153 aa), FASTA scores: opt: 391, E(): 2.4e-18, (48.45% identity in 130 aa overlap); Q9K461|SC2H12.23c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (151 aa), FASTA scores: opt: 339, E(): 5.1e-15, (46.7% identity in 124 aa overlap); BAB48975|MLR1652 hypothetical protein from Rhizobium loti (Mesorhizobium loti) (130 aa), FASTA scores: opt: 319, E(): 8.7e-14, (41.45% identity in 123 aa overlap); Q9JR31|NMA2196|NMB0291 CONSERVED HYPOTHETICAL INNER MEMBRANE PROTEIN from Neisseria meningitidis serogroup A and B (132 aa), FASTA scores: opt: 303, E(): 9.4e-13, (43.65% identity in 126 aa overlap); etc. Mb3091c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3R9" /db_xref="InterPro:IPR032808" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3R9" /protein_id="SIU01716.1" /translation="MVKDLDRRLAGCLPAVLSLFRLVYGLLFAGYGSMILFGWPVTSA QPVEFGSWPGWYAGVIELVAGLLIATGLFTRAVAFVASGEMAVAYFWMHQPYALWPIG GPPDGNGGTPAILFCFGFFLLVFTGGGIYSIDARRTVTA" CDS 3391346..3391669 /codon_start=1 /transl_table=11 /gene="mmr" /locus_tag="BQ2027_MB3092" /standard_name="emrE" /product="MULTIDRUGS-TRANSPORT INTEGRAL MEMBRANE PROTEIN MMR" /note="Mb3092, mmr, len: 107 aa. Equivalent to Rv3065, len: 107 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 107 aa overlap). mmr, integral membrane multidrugs resistance transporter (see citation below), equivalent to Q9CBP1|ML1756 PROBABLE MULTIDRUG RESISTANCE PROTEIN from Mycobacterium leprae (107 aa), FASTA scores: opt: 534, E(): 3.3e-28, (77.55% identity in 107 aa overlap). Also highly similar to bacterial proteins involved in resistance to ethidium bromide or methyl viologen e.g. O87866|QACG_STASP QUATERNARY AMMONIUM COMPOUND-RESISTANCE PROTEIN QACG (QUARTERNARY AMMONIUM DETERMINANT G) from Staphylococcus sp. strain ST94 (107 aa), FASTA scores: opt: 307, E(): 1.8e-13, (39.8% identity in 103 aa overlap); P96460|QAC QUATERNARY AMMONIUM COMPOUNDS RESISTANCE PROTEIN QAC from Staphylococcus aureus (107 aa), FASTA scores: opt: 304, E(): 2.8e-13, (40.4% identity in 104 aa overlap); Q57225|QACE_ECOLI QUATERNARY AMMONIUM COMPOUND-RESISTANCE PROTEIN QACE (QUARTERNARY AMMONIUM DETERMINANT E) from Escherichia coli (110 aa), FASTA scores: opt: 300, E(): 5.2e-13, (48.15% identity in 108 aa overlap); AAG55967|Z1870 METHYLVIOLOGEN RESISTANCE PROTEIN ENCODED WITHIN PROPHAGE CP-933X from Escherichia coli strain O157:H7 EDL933 (110 aa); P23895|EMRE|MVRC|EB|B0543 EMRE PROTEIN from Escherichia coli (110 aa), FASTA scores: opt: 290, E(): 2.3e-12, (43.55% identity in 101 aa overlap); etc. Also similar to the SugE protein of enteric bacteria. BELONGS TO THE SMALL MULTIDRUG RESISTANCE (SMR) PROTEIN FAMILY. Note that previously known as emrE. Mb3092 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P69927" /db_xref="InterPro:IPR000390" /db_xref="UniProtKB/Swiss-Prot:P69927" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01717.1" /translation="MIYLYLLCAIFAEVVATSLLKSTEGFTRLWPTVGCLVGYGIAFA LLALSISHGMQTDVAYALWSAIGTAAIVLVAVLFLGSPISVMKVVGVGLIVVGVVTLN LAGAH" CDS 3391666..3392274 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3093" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY DEOR-FAMILY)" /note="Mb3093, -, len: 202 aa. Equivalent to Rv3066, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 202 aa overlap). Probable transcriptional regulatory protein deoR-family, with some similarity to transcriptional regulators and hypothetical proteins, e.g. Q9X9V5|SCI7.35c HYPOTHETICAL 21.1 KDA PROTEIN from Streptomyces coelicolor (197 aa), FASTA scores: opt: 398, E(): 5.7e-19, (40.3% identity in 191 aa overlap); AAG55222|Z1073 PUTATIVE DEOR-TYPE TRANSCRIPTIONAL REGULATOR from Escherichia coli strain O157:H7 EDL933 (178 aa), FASTA scores: opt: 257, E(): 7.9e-10, (28.4% identity in 176 aa overlap); Q9HXU1|PA3699 PROBABLE TRANSCRIPTIONAL REGULATOR (TETR/ACRR FAMILY) from Pseudomonas aeruginosa (237 aa), FASTA scores: opt: 229, E(): 6.7e-08, (32.1% identity in 187 aa overlap); etc. Protein product from Mb3093 detected using SWATH mass spectrometry. Mb3093 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3E4" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3E4" /protein_id="SIU01718.1" /translation="MTAGSDRRPRDPAGRRQAIVEAAERVIARQGLGGLSHRRVAAEA NVPVGSTTYYFNDLDALREAALAHAANASADLLAQWRSDLDKDRDLAATLARLTTVYL ADQDRYRTLNELYMAAAHRPELQRLARLWPDGLLALLEPRIGRRAANAVTVFFDGATL HALITGTPLSTDELTDAIARLVADGPEQREVGQSAHAGRTPD" CDS 3392387..3392797 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3094" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3094, -, len: 136 aa. Equivalent to Rv3067, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Conserved hypothetical protein, weakly similar to other mycobacterium proteins e.g. O53953|Rv1804c|MTV049.26c (108 aa), FASTA scores: opt: 183, E(): 0.00053, (36.6% identity in 82 aa overlap); O07222|Rv1810|MTCY16F9.04c (118 aa), FASTA scores: opt: 149, E(): 0.05, (30.95% identity in 84 aa overlap). Has hydrophobic stretch at N-terminus. Start chosen on basis of codon usage but upstream ATG also possible. Mb3094 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007969" /db_xref="UniProtKB/TrEMBL:A0A1R3Y301" /protein_id="SIU01719.1" /translation="MLTVGVGIGAAILLGWFTLAHRHPDQPGAAATPPPAGLTTRSAP TAAPPSTLQSPDLDSVFLGNLHDRGISFTNPDAAVYNGKMVCTNLGGGMTVQQVVEAL QSSSPALGDRTTAYVAVSIRTYCPKYDAVLPPGS" tRNA complement(3392798..3392870) /locus_tag="BQ2027_ALAU" /product="tRNA-Ala" /note="alaU, len: 73 nt. Equivalent to alaU, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Ala, anticodon ggc." CDS complement(3392938..3394581) /codon_start=1 /transl_table=11 /gene="pgmA" /locus_tag="BQ2027_MB3095C" /product="PROBABLE PHOSPHOGLUCOMUTASE PGMA (GLUCOSE PHOSPHOMUTASE) (PGM)" /note="Mb3095c, pgmA, len: 547 aa. Equivalent to Rv3068c, len: 547 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 547 aa overlap). Probable pgmA, phosphoglucomutase (EC 5.4.2.2), highly similar to other phosphoglucomutases e.g. Q9L117|PGM from Streptomyces coelicolor (546 aa), FASTA scores: opt: 2569, E(): 2.8e-149, (71.4% identity in 545 aa overlap); Q9ABY5|CC0085 from Caulobacter crescentus (545 aa), FASTA scores: opt: 2465, E(): 6.2e-143, (70.4% identity in 541 aa overlap); P38569|PGMU_ACEXY|CELB from Acetobacter xylinum (555 aa), FASTA scores: opt: 2206, E(): 4e-127, (62.25% identity in 543 aa overlap); P74643|PGM|SLL0726 from Synechocystis sp. strain PCC 6803 (567 aa), FASTA scores: opt: 2168, E(): 8.5e-125, (60.0% identity in 550 aa overlap); P36938|PGMU_ECOLI|PGM|B0688 from Escherichia coli (546 aa), FASTA scores: opt: 2111, E(): 2.5e-121, (58.2% identity in 550 aa overlap). Also similar to other phosphomannomutases. Has phosphoglucomutase and phosphomannomutase signature (PS00710) and ATP/GTP-binding site motif A (P-loop) (PS00017). BELONGS TO THE PHOSPHOHEXOSE MUTASES FAMILY. Protein product from Mb3095c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3095c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y324" /db_xref="InterPro:IPR005843" /db_xref="InterPro:IPR005844" /db_xref="InterPro:IPR005845" /db_xref="InterPro:IPR005846" /db_xref="InterPro:IPR005852" /db_xref="InterPro:IPR016055" /db_xref="InterPro:IPR016066" /db_xref="InterPro:IPR036900" /db_xref="UniProtKB/TrEMBL:A0A1R3Y324" /protein_id="SIU01720.1" /translation="MVANPRAGQPAQPEDLVDLPHLVTAYYSIEPDPDDLAQQVAFGT SGHRGSALTGTFNELHILAITQAIVEYRAAQGTTGPLFIGRDTHGLSEPAWVSALEVL AANQVVAVVDSRDRYTPTPAISHAILTYNRGRTEALADGIVVTPSHNPPSDGGIKYNP PNGGPADTAATTAIAKRANEILLARSMVKRLPLARALRTAQRHDYLGHYVDDLPNVVD IAAIREAGVRIGADPLGGASVDYWGEIAHRHGLDLTVVNPLVDATWRFMTLDTDGKIR MDCSSPDAMAGLIRTMFGNRERYQIATGNDADADRHGIVTPDEGLLNPNHYLAVAIEY LYTHRPSWPAGIAVGKTVVSSSIIDRVVAGIGRQLVEVPVGFKWFVDGLIGATLGFGG EESAGASFLRRDGSVWTTDKDGIIMALLAAEILAVTGATPSQRYHALAGEYGGPCYAR IDAPADREQKARLARLSADQVSATELAGEPITAKLTTAPGNGAALGGLKVTTANAWFA ARPSGTEDVYKIYAESFRGPQHLVEVQQTAREVVDRVIG" CDS 3394651..3395049 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3096" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3096, -, len: 132 aa. Equivalent to Rv3069, len: 132 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 132 aa overlap). Probable conserved transmembrane protein, similar to several hypothetical and CRCB bacterial proteins e.g. Q9A6V2|CC1981 CRCB PROTEIN (see citation below; seems to be involved in camphor resistance and chromosome condensation, promoting or protecting chromosome folding) from Caulobacter crescentus (127 aa), FASTA scores: opt: 275, E(): 1.6e-11, (41.1% identity in 124 aa overlap); Q9FC39|SC4G1.10 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (154 aa), FASTA scores: opt: 258, E(): 2.5e-10, (42.15% identity in 121 aa overlap); Q9V0X2|PAB1925 CRCB PROTEIN (see citation below) from Pyrococcus abyssi (123 aa), FASTA scores: opt: 256, E(): 2.8e-10, (39.8% identity in 113 aa overlap); O59171|PH1502 HYPOTHETICAL 13.6 KDA PROTEIN from Pyrococcus horikoshii (123 aa), FASTA scores: opt: 249, E(): 8.2e-10, (38.65% identity in 119 aa overlap); etc. Mb3096 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63863" /db_xref="InterPro:IPR003691" /db_xref="UniProtKB/Swiss-Prot:P63863" /protein_id="SIU01721.1" /translation="MPNHDYRELAAVFAGGALGALARAALSALAIPDPARWPWPTFTV NVVGAFLVGYFTTRLLERLPLSSYRRPLLGTGLCGGLTTFSTMQVETISMIEHGHWGL AAAYSVVSITLGLLAVHLATVLVRRVRIRR" CDS 3395046..3395426 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3097" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3097, -, len: 126 aa. Equivalent to Rv3070, len: 126 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 126 aa overlap). Probable conserved integral membrane protein, similar to several hypothetical and CRCB bacterial proteins e.g. Q9FC37|SC4G1.12 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (124 aa), FASTA scores: opt: 280, E(): 3.1e-11, (45.3% identity in 117 aa overlap); O25823|HP1225 CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN from Helicobacter pylori (Campylobacter pylori) (130 aa), FASTA scores: opt: 225, E(): 1e-07, (33.35% identity in 123 aa overlap); O07590|YHDU HYPOTHETICAL 12.4 KDA PROTEIN from Bacillus subtilis (118 aa), FASTA scores: opt: 224, E(): 1.1e-07, (37.85% identity in 111 aa overlap); Q9KVS9|VC0060 CRCB PROTEIN (see citation below; seems involved in camphor resistance and chromosome condensation, promoting or protecting chromosome folding) from Vibrio cholera (126 aa), FASTA scores: opt: 221, E(): 1.8e-07, (33.35% identity in 126 aa overlap); etc. Mb3097 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63865" /db_xref="InterPro:IPR003691" /db_xref="UniProtKB/Swiss-Prot:P63865" /protein_id="SIU01722.1" /translation="MTASTALTVAIWIGVMLIGGIGSVLRFLVDRSVARRLARTFPYG TLTVNITGAALLGFLAGLALPKDAALLAGTGFVGAYTTFSTWMLETQRLGEDRQMVSA LANIVVSVVLGLAAALLGQWIAQI" CDS 3395423..3396532 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3098" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3098, -, len: 369 aa. Equivalent to Rv3071, len: 369 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 369 aa overlap). Conserved hypothetical protein, weakly similar in N-terminus of Q9A4V0|CC2725 HYPOTHETICAL PROTEIN CC2725 from Caulobacter crescentus (113 aa), FASTA scores: opt: 141, E(): 0.031, (27.6% identity in 105 aa overlap). C-terminal region also weakly similar to other hypothetical proteins e.g. Q9FC38|YG11_STRCO from Streptomyces coelicolor (114 aa), FASTA scores: opt: 151, E(): 0.007, (31.65% identity in 98 aa overlap). Protein product from Mb3098 detected using SWATH mass spectrometry. Mb3098 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003793" /db_xref="InterPro:IPR011322" /db_xref="InterPro:IPR015867" /db_xref="UniProtKB/TrEMBL:A0A1R3Y317" /protein_id="SIU01723.1" /translation="MNEQCLKLTAYFGERQRAVGGAGRFLADAMLDLFGSHNVATSVM LRGTTSFGPKHEFRCDQSLSLSEDPPVTVAAVDIESKIRSLVDDVTAMTDRGLVTLER ARLVTRHSGAEEFGDIDSRNGDAAKLTIYAGRQVRVAGAPAYYTICELLHRHGFAGAT VLLGVDGTAHGRRRRARFFGRNVNVPLMIIAVGTPAQVAVAAMELTAALPNPLLTIER VRLCKRDGELFARPQQLPQTDDQGRTLWQKLMVHTAEATHHEGLPIHRALVHRLMQSE TARGATALRGIWGFYGDHKPHGDKLFQLVRRVPVTTIIVDTPQAIARSFDIVDELTNW HGLVTSEMVPAAVSLTGSRDGTQKTGETPLARYDY" CDS complement(3396757..3397281) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3099C" /product="Probable F420-dependent oxidoreductase family protein" /note="Mb3099c, -, len: 174 aa. Equivalent to Rv3072c, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 174 aa overlap). Hypothetical protein, similar in part to O87779 HYPOTHETICAL 18.1 KDA PROTEIN (FRAGMENT) from Mycobacterium paratuberculosis (166 aa), FASTA scores: opt: 238, E(): 2.5e-08, (42.6% identity in 108 aa overlap); Q9AH10 PUTATIVE F420-DEPENDENT DEHYDROGENASE from Rhodococcus erythropolis (295 aa), FASTA scores: opt: 228, E(): 1.7e-07, (34.25% identity in 111 aa overlap); P71557|Y953_MYCTU|Rv0953c|MTCY10D7.21 POSSIBLE OXIDOREDUCTASE from Mycobacterium tuberculosis strain H37Rv (304 aa), FASTA scores: opt: 208, E(): 3.2e-06, (38.9% identity in 108 aa overlap); etc. N-terminal region similar to several proteins from M. tuberculosis (see MAST results on the web site http: //www.genolist.pasteur.fr/TubercuList/mast/P18.1.html). Mb3099c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y328" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3Y328" /protein_id="SIU01724.1" /translation="MACVRRSCDVTGTARAGIGAGADPAVVDAVAVAADDCGFATLWV GEHVVMVDRPASRYPYSRDGVIAVPAQADWLDPMIALSFAAAASSRVDVATGVLLLPE HNPVIVAKEAASLDRLSGRRLTLGVASDGPRRSSTRSECHSSGAQSAPPNTSLQCAHY GATTSHRSTATVGS" CDS complement(3397288..3397644) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3100C" /product="DUF488 family protein SAV0238" /note="Mb3100c, -, len: 118 aa. Equivalent to Rv3073c, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 118 aa overlap). Conserved hypothetical protein, highly similar to other e.g. Q9F3D7|SC2H2.18 from Streptomyces coelicolor (119 aa), FASTA scores: opt: 399, E(): 2.5e-20, (53.05% identity in 115 aa overlap); Q9K4K9|SC5F8.15c from Streptomyces coelicolor (117 aa), FASTA scores: opt: 334, E(): 6e-16, (49.1% identity in 112 aa overlap); Q9HKD5|TA0666 from Thermoplasma acidophilum (134 aa), FASTA scores: opt: 334, E(): 6.7e-16, (42.35% identity in 111 aa overlap); BAB53507|MLL7394 from Rhizobium loti (Mesorhizobium loti) (120 aa), FASTA scores: opt: 309, E(): 3e-14, (43.65% identity in 110 aa overlap); etc. Mb3100c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007438" /db_xref="UniProtKB/Swiss-Prot:P65064" /protein_id="SIU01725.1" /translation="MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIW CKDVAPSKELREWYHHQPERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRH VARSHAAVLAQLLNGR" CDS 3397738..3399012 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3101" /product="CONSERVED 13E12 REPEAT FAMILY PROTEIN" /note="Mb3101, -, len: 424 aa. Equivalent to Rv3074, len: 424 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 424 aa overlap). Conserved hypothetical protein, highly similar but shorter (46 aa) to P71806|Rv1378c|MTCY02B12.12c HYPOTHETICAL 51.3 KDA PROTEIN from Mycobacterium tuberculosis (475 aa), FASTA scores: opt: 2009, E(): 5.8e-113, (72.95% identity in 429 aa overlap); and also similar to other hypothetical mycobacterium proteins e.g. O33266|Rv0336|MTCY279.03 (503 aa), FASTA scores: opt: 337, E(): 7.5e-13, (28.6% identity in 381 aa overlap); O33360|Rv0515|MTCY20G10.05 (503 aa), FASTA scores: opt: 337, E(): 7.5e-13, (28.6% identity in 381 aa overlap); etc. Mb3101 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3T4" /protein_id="SIU01726.1" /translation="MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDA ARRAAEGAAGVPAARRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDC GALSEWRATLIVRESACLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDP QAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRG QVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMV ASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAP IRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTG SRHRSGAPPHLPAVTVSELEVRIGIALARYAA" CDS complement(3399009..3399932) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3102C" /product="Similar to citrate lyase beta chain, 3" /note="Mb3102c, -, len: 307 aa. Equivalent to Rv3075c, len: 307 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 307 aa overlap). Conserved hypopthetical protein, with some similarity to Q9I562|PA0883 PROBABLE ACYL-COA LYASE BETA CHAIN from Pseudomonas aeruginosa (275 aa), FASTA scores: opt: 408, E(): 9.2e-19, (35.15% identity in 273 aa overlap); Q9S2U9|SC4G6.02 PUTATIVE CITRATE LYASE BETA CHAIN from Streptomyces coelicolor (274 aa), FASTA scores: opt: 384, E(): 3.1e-17, (34.7% identity in 265 aa overlap); O06162|CITE|Rv2498c|MTCY07A7.04c from Mycobacterium tuberculosis (273 aa), FASTA scores: opt: 349, E(): 5.1e-15, (35.2% identity in 264 aa overlap); etc. Several initiation codons possible, first one chosen. Protein product from Mb3102c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3102c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y375" /db_xref="InterPro:IPR005000" /db_xref="InterPro:IPR011206" /db_xref="InterPro:IPR015813" /db_xref="InterPro:IPR040442" /db_xref="UniProtKB/TrEMBL:A0A1R3Y375" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01727.1" /translation="MTSMYEQVDTNTADPVAGSRIDPVLARSWLLVNGAHGDRFESAA HSRADIVVLDIEDAVAPKDKHAARDNAVRWFGDGNADWVRINGFGTPWWADDLAMLAD SPVGGVMLAMVESVDHVTETAKRLPNVPIVALVETARGLERINEIAAAKGTFRLAFGI GDFRRDTGFGEDPATLAYARSRFTIAARAAGLPSAIDGPTIGSNALKLIEATAVSAEF GMTGKICLSPDQCPVVNEGLSPSQDEIVWAKEFFAEFARDGGEIRNGSDLPRIARATK ILDLARAYGIEVSDFEDEPVHMPAPTDTYHY" CDS 3400031..3400507 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3103" /product="Cyclase/Dehydrase" /note="Mb3103, -, len: 158 aa. Equivalent to Rv3076, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 158 aa overlap). Conserved hypothetical protein, weakly similar to Q9AK12|SC8D11.07 HYPOTHETICAL 17.0 KDA PROTEIN from Streptomyces coelicolor (151 aa), FASTA scores: opt: 110, E(): 1.5, (25.5% identity in 145 aa overlap). Protein product from Mb3103 detected using SWATH mass spectrometry. Mb3103 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3F1" /protein_id="SIU01728.1" /translation="MVLDGVVSDTRRSRTIAARQQTIWDVLADFGSLSSWVEGVDHSC VLNHGPDGGALGSTRRVQVGRNTLVERVIEFDPPTTLAYRIEGLPARLRKVTNRWTLR PADPVGAVTVVTLTSTIEIGGNPLARLAELVVGRAMAKRSNTMLAGLAQRLEDKHG" CDS 3400500..3402311 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3104" /product="POSSIBLE HYDROLASE" /note="Mb3104, -, len: 603 aa. Equivalent to Rv3077, len: 603 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 603 aa overlap). Possible hydrolase (EC 3.1.-.-), with some similarity to variety of hydrolases (aryl- and steryl sulfatases principaly) e.g. Q45087|PEHA PHOSPHONATE MONOESTER HYDROLASE from Burkholderia caryophylli (514 aa), FASTA scores: opt: 239, E(): 7.2e-07, (23.95% identity in 413 aa overlap); Q9I1E5|PA2333 PROBABLE SULFATASE from Pseudomonas aeruginosa (538 aa), FASTA scores: opt: 231, E(): 2.3e-06, (28.1% identity in 516 aa overlap); P31447|YIDJ_ECOLI|B3678 PUTATIVE SULFATASE (EC 3.1.6.-) from Escherichia coli (497 aa), FASTA scores: opt: 222, E(): 7.4e-06, (27.7% identity in 390 aa overlap); etc. Note that previously known as atsF. Protein product from Mb3104 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3104 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y312" /db_xref="InterPro:IPR000917" /db_xref="InterPro:IPR017850" /db_xref="UniProtKB/TrEMBL:A0A1R3Y312" /protein_id="SIU01729.1" /translation="MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHG ISFTRHYTGSLACVPSRPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLG NWFRAAGYDTHYDGKWHISHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPY GFSGWVGPEPHGAGLANSGFRRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASF VNPHDIVLFPAWVWRSPLKPSPLDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGL TRMVSRNYARNAQRYRDLYYRLHAEVDGPIDRVRRAVTEGGSEDAMLVRTSDHGDLLG AHGGLHQKWFNLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVD VVAAALAESFSEVHPLPGRDLMPVVDGASADEGRAIYLMTRDNVLEGDTGASLLSRQL GRIVNPPAPLRIKVPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPG VRHLATNGMGGDAYRTDPLDDQWELYDLTADPIEAYNRWTDPQLHELRQHLRMLLKQQ RAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGRFVR" CDS 3402312..3402713 /codon_start=1 /transl_table=11 /gene="hab" /locus_tag="BQ2027_MB3105" /product="PROBABLE HYDROXYLAMINOBENZENE MUTASE HAB" /note="Mb3105, hab, len: 133 aa. Equivalent to Rv3078, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Probable hab, hydroxylaminobenzene mutase (5.-.-.-) (see experiments in first citation), highly similar to two hydroxylaminobenzene mutases from Pseudomonas pseudoalcaligenes O52214|HABA (135 aa), FASTA scores: opt: 495, E(): 6.8e-25, (51.1% identity in 133 aa overlap); and O52216|HABB (164 aa), FASTA scores: opt: 479, E(): 8.2e-24, (51.9% identity in 133 aa overlap) (see first citation); and to Q9AH35|NBZB HYDROXYLAMINOBENZENE MUTASE from Pseudomonas putida (164 aa), FASTA scores: opt: 476, E(): 1.3e-23, (51.8% identity in 133 aa overlap) (see second citation). Gene name according to Pseudomonas pseudoalcaligenes nomenclature. Also similarity with putative different membrane proteins involved in transport (protein predicted to be a transmembrane protein). Mb3105 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y334" /db_xref="UniProtKB/TrEMBL:A0A1R3Y334" /protein_id="SIU01730.1" /translation="MQKLLFTIGLALFLIGLLTGLVIPALKNPRMALSSHLEGVLNGM FLVVLGLLWPHIDLPEAWQVIAVALIVYSAYANWLATLLAAAWGAGRKFAPIATGDHK APAAKEGFVSFLLLSLSVAIVIGVVIVIIGL" CDS complement(3402729..3403556) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3106C" /product="Probable F420-dependent oxidoreductase family protein" /note="Mb3106c, -, len: 275 aa. Equivalent to Rv3079c, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 275 aa overlap). Conserved hypothetical protein, similar to other hypothetical mycobacterium proteins e.g. P71557|Y953_MYCTU|Rv0953c|MTCY10D7.21 POSSIBLE OXIDOREDUCTASE from Mycobacterium tuberculosis strain H37Rv (282 aa), FASTA scores: opt: 668, E(): 2.4e-34, (40.55% identity in 281 aa overlap); O06216|Rv2161c|MTCY270.07 from M. tuberculosis strain H37Rv (288 aa), FASTA scores: opt: 595, E(): 8.5e-30, (40.9% identity in 274 aa overlap); O87779 from Mycobacterium paratuberculosis (166 aa), FASTA scores: opt: 464, E(): 7.2e-22, (41.55% identity in 166 aa overlap); etc. Also some similarity to other proteins e.g. Q9AH10 PUTATIVE F420-DEPENDENT DEHYDROGENASE from Rhodococcus erythropolis (295 aa), FASTA scores: opt: 401, E(): 9.6e-18, (30.2% identity in 288 aa overlap); Q9AE04|RIF17 RIF17 PROTEIN from Amycolatopsis mediterranei (356 aa), FASTA scores: opt: 298, E(): 2.8e-11, (35.0% identity in 203 aa overlap); AAK48081|MT3720 LUCIFERASE-RELATED PROTEIN from Mycobacterium tuberculosis strain CDC1551 (395 aa), FASTA scores: opt: 223, E(): 1.4e-06, (29.4% identity in 211 aa overlap). Protein product from Mb3106c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y323" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019921" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3Y323" /protein_id="SIU01731.1" /translation="MQFGVLTFVTDEGIGPAELGAALEHRGFESLFLAEHTHIPVNTQ SPYPGGGPIPEKYYRTLDPFVALAAAAATTQSLVLGTGIALIPERDPIVTAKEVASLD LVSQGRFRFGVGVGWLREEVANHGVDPAVRGRVIDERLRAIIEIWTQEQAEFHGTYVD FDPIYCWPKPVTKPYPPLYVGGGPANFPRIARLNAGWIAISPSPQRLSGPLQRLRAMA GGDVPVTVCQWGEAAAKDLEGYRHLGVERVLLELPTEPRDPTLRYLDKLQAELARLA" CDS complement(3403615..3406947) /codon_start=1 /transl_table=11 /gene="pknK" /locus_tag="BQ2027_MB3107C" /product="serine/threonine-protein kinase transcriptional regulatory protein pknk (protein kinase k) (stpk k)" /note="Mb3107c, pknK, len: 1110 aa. Equivalent to Rv3080c, len: 1110 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1110 aa overlap). Probable pknK, serine/threonine protein kinase involved in transcriptional regulatory function (EC 2.7.1.-) (see citation below). Similar but shorter in N-terminus (approximatively 300 residues) to others e.g. Q48411|ACOK TRANSCRIPTIONAL REGULATORY PROTEIN OF aco ABCD operon from Klebsiella pneumoniae (921 aa), FASTA scores: opt: 886, E(): 7.6e-37, (27.75% identity in 829 aa overlap); Q9HX92|PA3921 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS) (906 aa), FASTA scores: opt: 760, E(): 1.5e-30, (29.55% identity in 822 aa overlap); Q9I2X9|PA1760 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS) (907 aa), FASTA scores: opt: 696, E(): 2.3e-27, (25.85% identity in 685 aa overlap); P06993|MALT (alias BAB37683|ECS4260 and AAG58520|MALT) POSITIVE REGULATOR OF MAL REGULON from Escherichia coli strain O157:H7 (901 aa), FASTA scores: opt: 660, E(): 1.4e-25, (29.25% identity in 530 aa overlap); Q9KNF3|VCA0011 MALT REGULATORY PROTEIN from Vibrio cholerae (BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS) (921 aa), FASTA scores: opt: 626, E(): 7.2e-24, (25.8% identity in 659 aa overlap); etc. N-terminal region similar to N-terminus of serine/threonine kinases e.g. Q9KK90|PKMA SERINE/THREONINE KINASE (SIMILAR TO THE SER/THR FAMILY OF PROTEIN KINASES) from Amycolatopsis mediterranei (589 aa), FASTA scores: opt: 545, E(): 5.7e-20, (34.45% identity in 334 aa overlap); Q9RPT5|AMK SERINE/THREONINE PROTEIN KINASE HOMOLOG (SIMILAR TO THE SER/THR FAMILY OF PROTEIN KINASES) from Amycolatopsis mediterranei (606 aa), FASTA scores: opt: 537, E(): 1.5e-19, (35.55% identity in 346 aa overlap); Q9L0I0|PKAD PROTEIN SERINE/THREONINE KINASE from Streptomyces coelicolor (599 aa), FASTA scores: opt: 520, E(): 1e-18, (36.1% identity in 324 aa overlap); etc. N-terminal part also similar to O53510|PKNL_MYCTU|Rv2176|MT2232|MTV021.09 PROBABLE SERINE/THREONINE-PROTEIN KINASE from Mycobacterium tuberculosis strain H37Rv (399 aa), FASTA scores: opt: 511, E(): 2.1e-18, (35.15% identity in 313 aa overlap). Contains PS00107 Protein kinases ATP-binding region signature and PS00017 ATP/GTP-binding site motif A (P-loop). Contains Hank's kinase subdomain. FIRST PART OF THE PROTEIN SEEMS BELONG TO THE SER/THR FAMILY OF PROTEIN KINASES, AND SECOND PARTS SEEMS BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3107c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3107c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TXA9" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR016236" /db_xref="InterPro:IPR017441" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041664" /db_xref="UniProtKB/Swiss-Prot:Q7TXA9" /protein_id="SIU01732.1" /translation="MTDVDPHATRRDLVPNIPAELLEAGFDNVEEIGRGGFGVVYRCV QPSLDRAVAVKVLSTDLDRDNLERFLREQRAMGRLSGHPHIVTVLQVGVLAGGRPFIV MPYHAKNSLETLIRRHGPLDWRETLSIGVKLAGALEAAHRVGTLHRDVKPGNILLTDY GEPQLTDFGIARIAGGFETATGVIAGSPAFTAPEVLEGASPTPASDVYSLGATLFCVL TGHAAYERRSGERVIAQFLRITSQPIPDLRKQGLPADVAAAIERAMARHPADRPATAA DVGEELRDVQRRNGVSVDEMPLPVELGVERRRSPEAHAAHRHTGGGTPTVPTPPTPAT KYRPSVPTGSLVTRSRLTDILRAGGRRRLILIHAPSGFGKSTLAAQWREELSRDGAAV AWLTIDNDDNNEVWFLSHLLESIRRVRPTLAESLGHVLEEHGDDAGRYVLTSLIDEIH ENDDRIAVVIDDWHRVSDSRTQAALGFLLDNGCHHLQLIVTSWSRAGLPVGRLRIGDE LAEIDSAALRFDTDEAAALLNDAGGLRLPRADVQALTTSTDGWAAALRLAALSLRGGG DATQLLRGLSGASDVIHEFLSENVLDTLEPELREFLLVASVTERTCGGLASALAGITN GRAMLEEAEHRGLFLQRTEDDPNWFRFHQMFADFLHRRLERGGSHRVAELHRRASAWF AENGYLHEAVDHALAAGDPARAVDLVEQDETNLPEQSKMTTLLAIVQKLPTSMVVSRA RLQLAIAWANILLQRPAPATGALNRFETALGRAELPEATQADLRAEADVLRAVAEVFA DRVERVDDLLAEAMSRPDTLPPRVPGTAGNTAALAAICRFEFAEVYPLLDWAAPYQEM MGPFGTVYAQCLRGMAARNRLDIVAALQNFRTAFEVGTAVGAHSHAARLAGSLLAELL YETGDLAGAGRLMDESYLLGSEGGAVDYLAARYVIGARVKAAQGDHEGAADRLSTGGD TAVQLGLPRLAARINNERIRLGIALPAAVAADLLAPRTIPRDNGIATMTAELDEDSAV RLLSAGDSADRDQACQRAGALAAAIDGTRRPLAALQAQILHIETLAATGRESDARNEL APVATKCAELGLSRLLVDAGLA" CDS 3406999..3408237 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3108" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3108, -, len: 412 aa. Equivalent to Rv3081, len: 412 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 412 aa overlap). Conserved hypothetical protein. Second part of the protein (approximatively residues 250-412) shares weak similarity with other hypothetical proteins e.g. Q9YEU3|APE0488 from Aeropyrum pernix (188 aa), FASTA scores: opt: 149, E(): 0.019, E(): 0.019, (29.5% identity in 173 aa overlap); and first part shares weak similarity with C-terminal part of Q9RVT9|DR0933 ALPHA-AMLYASE from Deinococcus radiodurans (644 aa), FASTA scores: opt: 127, E(): 1.4, (27.25% identity in 198 aa overlap). Equivalent to AAK47502|MT3166 HYPOTHETICAL 48.3 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (436 aa) but shorter 24 aa in N-terminus. Contains PS00850 Glycine radical signature and possible helix-turn-helix motif at aa 53-74. Protein product from Mb3108 detected using SWATH mass spectrometry. Mb3108 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR018700" /db_xref="UniProtKB/TrEMBL:A0A1R3Y327" /protein_id="SIU01733.1" /translation="MTPHYRQAAASRLDTHRTQKLRSQTNGGKDRHQLTYEQFARMLT LMGPSDLWTVERAARHWGVSASRARAILSSRHIHRVSGYPAQAIKAVTLRQGARTDLK TANHLVPAAQAFTMAETGAAIGETEDERARLRIFFEFLRGADETGTSALDLIVDEPAL IGEHRFDALLAAAAEYISARWGRPGPLWSVSIERFLDTAWWVSDLPSARAFAAVWTPA PFRRRGIYLDRHDLTSDGVCVMPEPVFNRTELQRAFTALAAKLERRGVVGQVHVVGGA AMLLAYNSRVTTRDIDALFSTDGPMLEAIREVADEMGWPRTWLNNQASGYVSRTPGEG APVFDHPFLHVVATPAQHLLAMKVVAARGVRDGEDIRLLLDRLRITSAAGVWEIVARY FPAETITDRSRLLVEDLLNQ" CDS complement(3408363..3409385) /codon_start=1 /transl_table=11 /gene="virS" /locus_tag="BQ2027_MB3109C" /product="VIRULENCE-REGULATING TRANSCRIPTIONAL REGULATOR VIRS (ARAC/XYLS FAMILY)" /note="Mb3109c, virS, len: 340 aa. Equivalent to Rv3082c, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 340 aa overlap). virS, transcriptional regulatory protein araC/xylS family, probably involved in virulence (see citation below). Similar to many transcriptional regulators araC/xylS family e.g. Q9HZ25|PA3215 PROBABLE TRANSCRIPTIONAL REGULATOR (ARAC/XYLS FAMILY) from Pseudomonas aeruginosa (337 aa), FASTA scores: opt: 379, E(): 3e-17, (30.4% identity in 306 aa overlap); Q9Z3Y6|PHBR POLYHYDROXYBUTYRATE TRANSCRIPTIONAL ACTIVATOR from Pseudomonas sp. 61-3 (379 aa), FASTA scores: opt: 336, E(): 2e-14, (26.35% identity in 334 aa overlap); P72171|ORUR|PA0831 ORNITHINE UTILIZATION TRANSCRIPTIONAL REGULATOR oruR from Pseudomonas aeruginosa (339 aa), FASTA scores: opt: 274, E(): 1.9e-10, (23.7% identity in 321 aa overlap); Q9ZFW7 VIRULENCE REGULATING HOMOLOG from Pseudomonas alcaligenes (346 aa), FASTA scores: opt: 262, E(): 1.2e-09, (24.5% identity in 339 aa overlap); etc. Also similar to O69703|Rv3736|MTV025.084 PUTATIVE REGULATORY PROTEIN (ARAC/XYLS FAMILY) from Mycobacterium tuberculosis strain H37Rv (353 aa), FASTA scores: opt: 656, E(): 3.5e-35, (36.95% identity in 333 aa overlap). Has potential helix-turn-helix motif at positions 252-273. BELONGS TO THE ARAC/XYLS FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3109c detected using SWATH mass spectrometry. Mb3109c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y337" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR018060" /db_xref="InterPro:IPR032687" /db_xref="UniProtKB/TrEMBL:A0A1R3Y337" /protein_id="SIU01734.1" /translation="MELGSLIRATNLWGYTDLMRELGADPLPFLRRFDIPPGIEHQED AFMSLAGFVRMLEASAAELDCPDFGLRLARWQGLGILGPVAVIARNAATLFGGLEAIG RYLYVHSPALTLTVSSTTARSNVRFGYEVTEPGIPYPLQGYELSMANAARMIRLLGGP QARARVFSFRHAQLGTDAAYREALGCTVRFGRTWCGFEVDHRLAGRPIDHADPETKRI ATKYLESQYLPSDATLSERVVGLARRLLPTGQCSAEAIADQLDMHPRTLQRRLAAEGL RCHDLIERERRAQAARYLAQPGLYLSQIAVLLGYSEQSALNRSCRCWFGMTPRQYRAY GGVSGR" CDS 3409463..3410950 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3110" /product="PROBABLE MONOOXYGENASE (HYDROXYLASE)" /note="Mb3110, -, len: 495 aa. Equivalent to Rv3083, len: 495 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 495 aa overlap). Probable monooxygenase (EC 1.-.-.-), highly similar to other putative monooxygenases flavin-binding family e.g. AAK48336|MT3969 from Mycobacterium tuberculosis strain CDC1551 (489 aa), FASTA scores: opt: 1692, E(): 4.9e-98, (49.7% identity in 489 aa overlap); Q9A588|CC2569 from Caulobacter crescentus (498 aa), FASTA scores: opt: 1684, E(): 1.6e-97, (52.25% identity in 484 aa overlap); Q9APW3 from Pseudomonas aeruginosa (508 aa), FASTA scores: opt: 1603, E(): 1.8e-92, (49.8% identity in 484 aa overlap); etc." /db_xref="GOA:A0A1R3Y4Y2" /db_xref="InterPro:IPR020946" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Y2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01735.1" /translation="MNQHFDVLIIGAGLSGIGTACHVTAEFPDKTIALLERRERLGGT WDLFRYPGVRSDSDMFTFGYKFRPWRDVKVLADGASIRQYIADTATEFGIDEKIHYGL KVNTAEWSSRQCRWTVAGVHEATGETRTYTCDYLISCTGYYNYDAGYLPDFPGVHRFG GRCVHPQHWPEDLDYSGKKVVVIGSGATAVTLVPAMAGSNPGSAAHVTMLQRSPSYIF SLPAVDKISEVLGRFLPDRWVYEFGRRRNIAIQRKLYQACRRWPKLMRRLLLWEVRRR LGRSVDMSNFTPNYLPWDERLCAVPNGDLFKTLASGAASVVTDQIETFTEKGILCKSG REIEADIIVTATGLNIQMLGGMRLIVDGAEYQLPEKMTYKGVLLENAPNLAWIIGYTN ASWTLKSDIAGAYLCRLLRHMADNGYTVATPRDAQDCALDVGMFDQLNSGYVKRGQDI MPRQGSKHPWRVLMHYEKDAKILLEDPIDDGVLHFAAAAQDHAAA" CDS 3410956..3411882 /codon_start=1 /transl_table=11 /gene="lipR" /locus_tag="BQ2027_MB3111" /product="PROBABLE ACETYL-HYDROLASE/ESTERASE LIPR" /note="Mb3111, lipR, len: 308 aa. Equivalent to Rv3084, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 308 aa overlap). Probable lipR, N-Acetyl-hydrolase/esterase (EC 3.1.1.-), similar to other e.g. Q01109|BAH_STRH from Streptomyces hygroscopicus (299 aa), FASTA scores: opt: 558, E(): 4.1e-26, (40.25% identity in 246 aa overlap); Q9X8J4|SCE9.22 from Streptomyces coelicolor (266 aa), FASTA scores: opt: 544, E(): 2.5e-25, (36.95% identity in 257 aa overlap); Q56171|DEA from Streptomyces viridochromogenes (299 aa), FASTA scores: opt: 532, E(): 1.4e-24, (38.6% identity in 254 aa overlap); etc. Also similar to O06350|LIPF|Rv3487c|MTCY13E12.41c (277 aa), FASTA score: opt: 291, E(): 8.5e-10, (28.5% identity in 239 aa overlap). MAY BE BELONG TO THE 'GDXG' FAMILY OF LIPOLYTIC ENZYMES. Protein product from Mb3111 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y3T8" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3T8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01736.1" /translation="MNLRKNVIRSVLRGARPLFASRRLGIAGRRVLLATLTAGARAPK GTRFQRVSIAGVPVQRVQPPHAATSGTLIYLHGGAYALGSARGYRGLAAQLAAAAGMT ALVPDYTRAPHAHYPVALEEMAAVYTRLLDDGLDPKTTVIAGDSAGGGLTLALAMALR DRGIQAPAALGLICPWADLAVDIEATRPALRDPLILPSMCTEWAPRYVGSSDPRLPGI SPVYGDMSGLPPIVMQTAGDDPICVDADKIETACAASKTSIEHRRFAGMWHDFHLQVS LLPEARDAIADLGARLRGHLHQSQGQPRGVVK" CDS 3411879..3412709 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3112" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb3112, -, len: 276 aa. Equivalent to Rv3085, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 276 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various oxidoreductases in the short chain dehydrogenases/reductases family e.g. Q9CC98|ML1094 SHORT CHAIN ALCOHOL DEHYDROGENASE from Mycobacterium leprae (277 aa), FASTA scores: opt: 1059, E(): 4.8e-56, (61.65% identity in 266 aa overlap); Q9I3H6|PA1537 PROBABLE SHORT-CHAIN DEHYDROGENASE from Pseudomonas aeruginos (295 aa), FASTA scores: opt: 858, E(): 4.7e-44, (48.4% identity in 285 aa overlap); Q9CBP7|ML1740 POSSIBLE SHORT CHAIN REDUCTASE from Mycobacterium leprae (312 aa), FASTA scores: opt: 500, E(): 1e-22, (36.6% identity in 257 aa overlap); etc. Also similar to mycobacterium proteins O50460|Rv1245c|MTV006.17c DEHYDROGENASE SIMILAR TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES FAMILY (276 aa), FASTA scores: opt: 1200, E(): 1.9e-64, (65.2% identity in 273 aa overlap); and P95101|Rv3057c|MTCY22D7.24 HYPOTHETICAL DEHYDROGENASE (287 aa). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb3112 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y383" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y383" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01737.1" /translation="MSSFEGKVAVITGAGSGIGRALALNLSEKRAKLALSDVDTDGLA KTVRLAQALGAQVKSDRLDVAEREAVLAHADAVVAHFGTVHQVYNNAGIAYNGNVDKS EFKDIERIIDVDFWGVVNGTKAFLPHVIASGDGHIVNISSLFGLIAVPGQSAYNAAKF AVRGFTEALRQEMLVARHPVKVTCVHPGGIKTAVARNATVADGEDQQTFAEFFDRRLA LHSPEMAAKTIVNGVAKGQARVVVGLEAKAVDVLARIMGSSYQRLVAAGVAKFFPWAK " CDS 3412740..3413846 /codon_start=1 /transl_table=11 /gene="adhD" /locus_tag="BQ2027_MB3113" /product="PROBABLE ZINC-TYPE ALCOHOL DEHYDROGENASE ADHD (ALDEHYDE REDUCTASE)" /note="Mb3113, adhD, len: 368 aa. Equivalent to Rv3086, len: 368 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 368 aa overlap). Probable adhD, zinc-type alcohol dehydrogenase (EC 1.1.1.-), highly similar to many e.g. O69045 HYPOTHETICAL ALCOHOL DEHYDROGENASE from Rhodococcus rhodochrous (370 aa), FASTA scores: opt: 1255, E(): 8.7e-68, (50.4% identity in 367 aa overlap); P25406|ADHB_UROHA ALCOHOL DEHYDROGENASE I-B from Uromastyx hardwickii (Indian spiny-tailed lizard) (375 aa), FASTA scores: opt: 787, E(): 8.2e-40, (35.9% identity in 373 aa overlap); P72324||ADHI_RHOSH ALCOHOL DEHYDROGENASE CLASS III from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (376 aa), FASTA scores: opt: 787, E(): 8.3e-40, (35.1% identity in 379 aa overlap). Also highly similar to P71818|Rv0761c|MTCY369.06c HYPOTHETICAL ZINC-TYPE ALCOHOL DEHYDROGENASE-LIKE PROTEIN from Mycobacterium tuberculosis strain H37Rv (375 aa), FASTA scores: opt: 1186, E(): 1.2e-63, (47.3% identity in 368 aa overlap). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE. POSSIBLY REQUIRES ZINC FOR ITS ACTIVITY. Protein product from Mb3113 detected using SWATH mass spectrometry. Mb3113 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3F2" /db_xref="InterPro:IPR002328" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR023921" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3F2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01738.1" /translation="MKTTAAVLFEAGKPFELMELDLDGPGPGEVLVKYTAAGLCHSDL HLTDGDLPPRFPIVGGHEGSGVIEEVGAGVTRVKPGDHVVCSFIPNCGTCRYCCTGRQ NLCDMGATILEGCMPDGSFRFHSQGTDFGAMCMLGTFAERATVSQHSVVKVDDWLPLE TAVLVGCGVPSGWGTAVNAGNLRAGDTAVIYGVGGLGINAVQGATAAGCKYVVVVDPV AFKRETALKFGATHAFADAASAAAKVDELTWGQGADAALILVGTVDDEVVSAATAVIG KGGTVVITGLADPAKLTVHVSGTDLTLHEKTIKGSLFGSCNPQYDIVRLLRLYDAGQL MLDELVTTTYNLEQVNQGYQDLRDGKNIRGVIVH" CDS 3413884..3415302 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3114" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb3114, -, len: 472 aa. Equivalent to Rv3087, len: 472 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 472 aa overlap). Hypothetical protein, similar to several Mycobacterium tuberculosis proteins e.g. MTCY08D5.16, MTCY28.26, MTCY493.29c. Also similar to Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa). Protein product from Mb3114 detected using SWATH mass spectrometry. Mb3114 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y321" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/TrEMBL:A0A1R3Y321" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01739.1" /translation="MRRLNGVDALMLYLDGGSAYNHTLKISVLDPSTDPDGWSWPKAR QMFEERAHLLPVFRLRYLPTPLGLHHPIWVEDPEFDLDAHVRRVVCPAPGGMAEFCAL VEQIYAHPLDRDRPLWQTWVVEGLDGGRVALVTLLHHAYSDGVGVLDMLAAFYNDAPD EAPVVAPPWEPPPLPSTRQRLGWALRDLPSRLGKIAPTVRAVRDRVRIEREFAKDGDR RVPPTFDRSAPPGPFQRGLSRSRRFSCESFPLAEVREVSKTLGVTINDVFLACVAGAV RRYLERCGSPPTDAMVATMPLAVTPAAERAHPGNYSSVDYVWLRADIADPLERLHATH LAAEATKQHFAQTKDADVGAVVELLPERLISGLARANARTKGRFDTFKNVVVSNVPGP REPRYLGRWRVDQWFSTGQISHGATLNMTVWSYCDQFNLCVMADAVAVRNTWELVGGF RASHEELLAAARAQATPKEMAT" CDS 3415299..3416723 /codon_start=1 /transl_table=11 /gene="tgs4" /locus_tag="BQ2027_MB3115" /product="putative triacylglycerol synthase (diacylglycerol acyltransferase) tgs4" /note="Mb3115, -, len: 474 aa. Equivalent to Rv3088, len: 474 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 474 aa overlap). Hypothetical protein, similar to several Mycobacterium tuberculosis proteins e.g. MTCY31.23 (505 aa), MTCY13E12.34c (497 aa) and MTCY493.29c (459 aa). Also similar to Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa). Protein product from Mb3115 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3115 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67209" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="UniProtKB/Swiss-Prot:P67209" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01740.1" /translation="MTRINPIDLSFLLLERANRPNHMAAYTIFEKPKGQKSSFGPRLF DAYRHSQAAKPFNHKLKWLGTDVAAWETVEPDMGYHIRHLALPAPGSMQQFHETVSFL NTGLLDRGHPMWECYIIDGIERGRIAILLKVHHALIDGEGGLRAMRNFLSDSPDDTTL AGPWMSAQGADRPRRTPATVSRRAQLQGQLQGMIKGLTKLPSGLFGVSADAADLGAQA LSLKARKASLPFTARRTLFNNTAKSAARAYGNVELPLADVKALAKATGTSVNDVVMTV IDDALHHYLAEHQASTDRPLVAFMPMSLREKSGEGGGNRVSAELVPMGAPKASPVERL KEINAATTRAKDKGRGMQTTSRQAYALLLLGSLTVADALPLLGKLPSANVVISNMKGP TEQLYLAGAPLVAFSGLPIVPPGAGLNVTFASINTALCIAIGAAPEAVHEPSRLAELM QRAFTELQTEAGTTSPTTSKSRTP" CDS 3416720..3418231 /codon_start=1 /transl_table=11 /gene="fadD13" /locus_tag="BQ2027_MB3116" /product="PROBABLE CHAIN-FATTY-ACID-CoA LIGASE FADD13 (FATTY-ACYL-CoA SYNTHETASE)" /note="Mb3116, fadD13, len: 503 aa. Equivalent to Rv3089, len: 503 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 503 aa overlap). Probable fadD13, Acyl-CoA Synthetase (EC 6.2.1.-), similar to many e.g. MTCI28.06, MTCY08D5.09, MTCY06G11.08 from Mycobacterium tuberculosis strain H37Rv; and to Q9F7P5 PREDICTED ACID--CoA LIGASE FADD13 from uncultured proteobacterium EBAC31A08 (504 aa), FASTA scores: opt: 1126, E(): 2.4e-62, (38.85% identity in 502 aa overlap); Q9EY88|FCS FERULOYL-CoA SYNTHETASE from Amycolatopsis sp. strain HR167 (491 aa), FASTA scores: opt: 1073, E(): 4.5e-59, (38.5% identity in 504 aa overlap); BAB49118|MLR1843 PROBABLE ACID-CoA LIGASE from Rhizobium loti (Mesorhizobium loti) (495 aa), FASTA scores: opt: 937, E(): 1.2e-50, (36.2% identity in 503 aa overlap); Q9KZC1|SC6F7.21 PROBABLE LONG-CHAIN-FATTY-ACID-CoA LIGASE from Streptomyces coelicolor (511 aa), FASTA scores: opt: 899, E(): 2.8e-48, (36.1% identity in 510 aa overlap); Q9A5P7|CC2400 PUTATIVE ACID-CoA LIGASE from Caulobacter crescentus (496 aa), FASTA scores: opt: 874, E(): 9.8e-47, (35.1% identity in 507 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature and PS00061 Short-chain alcohol dehydrogenase family signature. TBparse score is 0.877. Protein product from Mb3116 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3116 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y332" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y332" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01741.1" /translation="MKNIGWMLRQRATVSPRLQAYVEPSTDVRMTYAQMNALANRCAD VLTALGIAKGDRVALLMPNSVEFCCLFYGAAKLGAVAVPINTRLAAPEVSFILSDSGS KVVIYGAPSAPVIDAIRAQADPPGTVTDWIGADSLAERLRSAAADEPAVECGGDDNLF IMYTSGTTGHPKGVVHTHESVHSAASSWASTIDVRYRDRLLLPLPMFHVAALTTVIFS AMRGVTLISMPQFDATKVWSLIVEERVCIGGAVPAILNFMRQVPEFAELDAPDFRYFI TGGAPMPEALIKIYAAKNIEVVQGYALTESCGGGTLLLSEDALRKAGSAGRATMFTDV AVRGDDGVIREHGEGEVVIKSDILLKEYWNRPEATRDAFDNGWFRTGDIGEIDDEGYL YIKDRLKDMIISGGENVYPAEIESVIIGVPGVSEVAVIGLPDEKWGEIAAAIVVADQN EVSEQQIVEYCGTRLARYKLPKKVIFAEAIPRNPTGKILKTVLREQYSATVPK" CDS 3419170..3420057 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3117" /product="unknown alanine and valine rich protein" /note="Mb3117, -, len: 295 aa. Equivalent to Rv3090, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 295 aa overlap). Hypothetical unknown Ala-, Val-rich protein. Hydrophobic stretch at N-terminus. Protein product from Mb3117 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y326" /db_xref="InterPro:IPR001107" /db_xref="UniProtKB/TrEMBL:A0A1R3Y326" /protein_id="SIU01742.1" /translation="MTWQIVFVVICVIVAGVAALFWRLPSDDTTRSRAKTVTIAAVAA AAVFFFLGCFTIVGTRQFAIMTTFGRPTGVSLNNGFHGKWPWQMTHPMDGAVQIDKYV KEGNTDQRITVRLGNQSTALADVSIRWQLKQAAAPELFQQYKTFDNVRVNLIERNLSV ALNEVFAGFNPLDPRNLDVSPLPSLAKRAADILRQDVGGQVDIFDVNVPTIQYDQSTE DKINQLNQQRAQTSIALEAQRTAEAQAKANEILSRSISDDPNVVVQNCITAAINKGIS PLGCWPGSSALPTIAVPGR" CDS 3420075..3421766 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3118" /product="conserved protein" /note="Mb3118, -, len: 563 aa. Equivalent to Rv3091, len: 563 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 563 aa overlap). Hypothetical protein, similar in part to O60859 NEUROPATHY TARGET ESTERASE from Homo sapiens (Human) (1327 aa), FASTA scores: opt: 177, E(): 0.0062, (30.65% identity in 173 aa overlap); and Q9I385|PA1640 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (345 aa), FASTA scores: opt: 152, E(): 0.069, (27.8% identity in 180 aa overlap). Protein product from Mb3118 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3118 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y338" /db_xref="InterPro:IPR002641" /db_xref="InterPro:IPR016035" /db_xref="UniProtKB/TrEMBL:A0A1R3Y338" /protein_id="SIU01743.1" /translation="MPIPFADGMLSRLGRRGAALDLIEEFEDESGEPPASLSPADLLA AEPALLLQKMENRLVRHHLANPDVLSGEQLRKLRYILNFARLADFEPGAAGPGGSRGR GDISVGGQVAPWRSRVVDALYAPLREEPDPVTALEGAKDVLATLVDDQDDQRRVLIER HGSDFSATELDAEVGYKKLVTVLGGGGGAGFVYIGGMQRLLAAGQVPDYMIGSSFGSI IGSLVARELPVPIDEYAEWAKTVSYRAILGPERRRSRHGLAGMFTLRFDQFAHTLLSR ADGERMRMSDLAIPFDVVVAGVRRQPYAALPSRFRHRERSTLTLRSLPFLPIGIGPWV AARMWQVAAFIDLRVVKPIVISADGATRDVNVVDAASFSSAIPGVLHHETSDPRMLPI LDELCADQDVAAMVDGGAASNVPVELAWERVRDGRLGTRNACYLAFDCFHPHWDPRHL WLVPITQAVQLQMVRNLPYADHLVRFEPTLSPVNLAPSAAAIDRACRWGRDSVEPAIA VTSALLEPTWWEGDRPPAAEPKERTKSAASSMSAVMAAIQAPTGRFRRWRSRHLT" CDS complement(3421773..3422693) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3119C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3119c, -, len: 306 aa. Equivalent to Rv3092c, len: 306 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 306 aa overlap). Probable conserved integral membrane protein, highly similar to Q9RUT5|DR1297 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (311 aa), FASTA scores: opt: 941, E(): 9.8e-51, (55.65% identity in 309 aa overlap); Q9A8B8|CC1436 HYPOTHETICAL PROTEIN from Caulobacter crescentus (314 aa), FASTA scores: opt: 791, E(): 1.6e-41, (46.9% identity in 305 aa overlap); and also highly similar to Q9I2N8|PA1857 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (307 aa), FASTA scores: opt: 373, E(): 8.1e-16, (40.8% identity in 321 aa overlap); BAB36119|ECS2696 PUTATIVE METHYL-INDEPENDENT MISMATCH REPAIR PROTEIN from Escherichia coli strain O157:H7 (305 aa), FASTA scores: opt: 335, E(): 1.7e-13, (39.75% identity in 307 aa overlap). Protein product from Mb3119c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3119c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y347" /db_xref="InterPro:IPR008526" /db_xref="UniProtKB/TrEMBL:A0A1R3Y347" /protein_id="SIU01744.1" /translation="MSGGLFGLLDHVAVLARLAAASIDDIGAAAGRATAKAAGVVIDD TAVTPQYVHRITAERELPIIKRIAIGSVRNKLLLILPGALLLSQLVPWLLTPLLMLGA TYLCYEGAEKVCGVIGGRGHDAAPQVAERELVAGAIRTDFILSAEIMVIALNEVADQP FVPRLIVLVIVALVITAAVYGVVAVIVQMDDVGLRLTQTASRFGQRIGGGLVAGMPKL LSALSAVGMGAMLWVGGHMVLVGSDHLGWHAPYRLVHHLDDHLVGSAGGALTWLVSTA ACAATGLVIGIVVVALVHLVCFRPPRSRSL" CDS complement(3422719..3423723) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3120C" /product="HYPOTHETICAL OXIDOREDUCTASE" /note="Mb3120c, -, len: 334 aa. Equivalent to Rv3093c, len: 334 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 334 aa overlap). Hypothetical oxidoreductase (EC 1.-.-.-), with some similarity with various oxidoreductases e.g. Q58929|MER|MJ1534 N5,N10-METHYLENE TETRAHYDROMETHANOPTERIN REDUCTASE (EC 1.5.99.-) from Methanococcus jannaschii (331 aa), FASTA scores: opt: 300, E(): 1.1e-10, (24.1% identity in 324 aa overlap); and Q9ZA30|GRA-ORF29 PUTATIVE FMN-DEPENDENT MONOOXYGENASE from Streptomyces violaceoruber (343 aa), FASTA scores: opt: 264, E(): 1.5e-08, (30.45% identity in 335 aa overlap); Q9CCV8|ML0348 POSSIBLE COENZYME F420-DEPENDENT OXIDOREDUCTASE from Mycobacterium leprae (350 aa), FASTA scores: opt: 220, E(): 6.4e-06, (26.5% identity in 328 aa overlap); etc. Protein product from Mb3120c detected using SWATH mass spectrometry. Mb3120c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y502" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR022526" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3Y502" /protein_id="SIU01745.1" /translation="MTDIEVALPFWLDRPDHEATDVALAAADTGFAALWIGEMATYDA FALATSIGLRTPNMTLKVGPLAVGVRGPVGLALGVSSVASLTGCRVDLALGASSPAIV AGWHGRPWAHHVPVMRETIECLRSIFTGARVEYSGRHVNSRGFRLRGAAPDTRIALGA FGPGMIRLAAQHADEVVLNLASPFRVGRVRAAIDSAAAAAGRAAPRLTVWVPVAVNPG AAAHSQLAAQLAVYLAPPGYGEMFSALGFDGLVRSARSRATRRELAVAVPSELLDRVC ALGSPDRVAARLRAYADAGADCVAVVPATAEDPGGRVALRALRPGGLYGTAGDNDGRR " CDS complement(3423720..3424850) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3121C" /product="Acyl-CoA dehydrogenase/oxidase domain protein" /note="Mb3121c, -, len: 376 aa. Equivalent to Rv3094c, len: 376 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 376 aa overlap). Conserved hypothetical protein, some similarity with various proteins e.g. Q9RMR9|NRGC NRGC PROTEIN (corresponding gene seems regulated by NifA) from Bradyrhizobium japonicum (388 aa), FASTA scores: opt: 677, E(): 5.8e-35, (34.55% identity in 353 aa overlap); P26698|PIGM_RHOSO PIGMENT PROTEIN from Rhodococcus sp. strain ATCC 21145 (387 aa), FASTA scores: opt: 480, E(): 1.2e-22, (28.7% identity in 376 aa overlap); Q9F0J3|NCNH HYDROXYLASE from Streptomyces arenae (405 aa), FASTA scores: opt: 441, E(): 3.3e-20, (29.25% identity in 352 aa overlap); etc. Equivalent to AAK47516 from Mycobacterium tuberculosis strain CDC1551 (395 aa) but N-terminus shorter 19 aa. Protein product from Mb3121c detected using SWATH mass spectrometry. Mb3121c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3U3" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013107" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3U3" /protein_id="SIU01746.1" /translation="MNQSETEIEILAEKIARWARARSAEIERDRRLPDELVTRLREAG LLRATMPREVAAPELAPGRALRCAEAVARGDASAGWCVSIAITSALLVAYLPARSREE MFGGGRGVAAGVWAPRGTARSVDGGVVVSGRWPFCSGINHADIMFAGCFVDDRQVPSV VALNKDELQVLDTWHTLGLRGTGSHDCVADDVFVPADRVFSVFDGPIVDRPLYRFPVF GFFALSIGAAALGNARAAIDDLVELAGGKKGLGSTRTLAERSATQAAAATAESALGAA RALFYEVIEAAWQVSHDAEAVPVTMRNRLRLAATHAVRTSADVVRSMYDLAGGTAIYD NAPLQRRFRDAFTATAHFQVNEASRELPGRVLLDQPADVSML" CDS 3424932..3425408 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3122" /product="HYPOTHETICAL TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb3122, -, len: 158 aa. Equivalent to Rv3095, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 158 aa overlap). Possible regulatory protein, because contains possible helix-turn-helix motif at aa 39-61 (+4.83 SD). Similar to hypothetical proteins e.g. Q9I0C9|PA2713 from Pseudomonas aeruginosa (159 aa), FASTA scores: opt: 486, E(): 1.6e-25, (45.95% identity in 148 aa overlap); Q9AAF6|CC0645 from Caulobacter crescentus (188 aa), FASTA scores: opt: 479, E(): 5.3e-25, (45.75% identity in 153 aa overlap); Q9K408|2SCG61.07 from Streptomyces coelicolor (157 aa), FASTA scores: opt: 407, E(): 2.8e-20, (43.9% identity in 139 aa overlap); etc. Protein product from Mb3122 detected using SWATH mass spectrometry. Mb3122 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A649" /db_xref="InterPro:IPR002577" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/Swiss-Prot:P0A649" /protein_id="SIU01747.1" /translation="MAVSDLSHRFEGESVGRALELVGERWTLLILREAFFGVRRFGQL ARNLGIPRPTLSSRLRMLVEVGLFDRVPYSSDPERHEYRLTEAGRDLFAAIVVLMQWG DEYLPRPEGPPIKLRHHTCGEHADPRLICTHCGEEITARNVTPEPGPGFKAKLASS" CDS 3425506..3426645 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3123" /product="putative cellulase family" /note="Mb3123, -, len: 379 aa. Equivalent to Rv3096, len: 379 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 379 aa overlap). Hypothetical protein, with slight similarity to several proteins e.g. Q09671|OYEB_SCHPO|SPAC5H10.10 PUTATIVE NADPH DEHYDROGENASE C5H10.10 (EC 1.6.99.1) (OLD YELLOW ENZYME HOMOLOG) from Schizosaccharomyces pombe (Fission yeast) (392 aa), FASTA scores: opt: 125, E(): 1.1, (25.45% identity in 165 aa overlap); and Q12603|XYNA_DICTH BETA-1,4-XYLANASE (EC 3.2.1.8) (ENDO-1,4-BETA-XYLANASE) from Dictyoglomus thermophilum (352 aa), FASTA scores: opt: 124, E(): 1.2, (25.65% identity in 195 aa overlap); etc. Contains glycosyl hydrolases family 5 signature (PS00659). TBparse score is 0.932. Protein product from Mb3123 detected using SWATH mass spectrometry. Mb3123 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR017853" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3G2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01748.1" /translation="MHRRTALKLPLLLAAGTVLGQAPRAAAGEPGRWSADRAHRWYQA HGWLVGANYITSNAINQLEMFQPGTYDPRRIDNELGLARFHGFNTVRVFLHDLLWAQD APGFQTRLAQFVAIAARYHIKPLFVLFDSCWDPLPRPGRQRAPRAGVHNSGWVQSPGA ERLDDRRYASTLYNYVTGVLGQFRNDDRVLGWDLWNEPDNPARVYRKVERKDKLERVA ELLPQVFRWARTVDPVQPLTSGVWQGNWGDPGRRSTISAIQLDNADVITFHSYAAPAE FEGRIAELAPLQRPILCTEYLARSQGSTVEGILPIAKRHNVGAFNWGLVAGKTQTYLP WDSWDHPYRAPPKVWFHDLLHPNGRPYRDGEVQTIRKLNGMPSQD" CDS complement(3426737..3428050) /codon_start=1 /transl_table=11 /gene="lipy" /locus_tag="BQ2027_MB3124C" /product="pe-pgrs family protein, triacylglycerol lipase lipy (esterase/lipase) (triglyceride lipase) (tributyrase)" /note="Mb3124c, PE_PGRS63, len: 437 aa. Equivalent to Rv3097c, len: 437 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 437 aa overlap). Probable Triacylglycerol lipase (EC 3.1.1.3), and member of the M. tuberculosis PE-family PGRS subfamily of gly-rich proteins; N-terminal part similar to N-terminus of M. tuberculosis PE-PGRS family members e.g. Q10637|Y03A_MYCTU hypothetical glycine-rich 49.6 kd protein (603 aa). Other relatives include MTCY1A11.25c; MTCY21B4.13c; MTCY270.06; MTCY359.33; MTC1A11.04. Mb3124c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y333" /db_xref="InterPro:IPR000084" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y333" /protein_id="SIU01749.1" /translation="MVSYVVALPEVMSAAATDVASIGSVVATASQGVAGATTTVLAAA EDEVSAAIAALFSAHGQDYQALSAQLAVFHERFVQALTGAAKGYAAAELANASLLQSE FASGIGNGFATIHQEIQRAPTALAAGFTQVPPFAAAQAGIFTGTPSGAAGFDIASLWP VKPLLSLSALETHFAIPNNPLLALIASDIPPLSWFLGNSPPPLLNSLLGQTVQYTTYD GMSVVQITPAHPTGEYVVAIHGGAFILPPSIFHWLNYSVTAYQTGATVQVPIYPLVQE GGTAGTVVPAMAGLISTQIAQHGVSNVSVVGDSAGGNLALAAAQYMVSQGNPVPSSMV LLSPWLDVGTWQISQAWAGNLAVNDPLVSPLYGSLNGLPPTYVYSGSLDPLAQQAVVL EHTAVVQGAPFSFVLAPWQIHDWILLTPWGLLSWPQINQQLGIAA" CDS complement(3428169..3428621) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3125C" /product="HYPOTHETICAL PROTEIN" /note="Mb3125c, -, len: 150 aa. Equivalent to Rv3098c, len: 150 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 150 aa overlap). Hypothetical unknown protein (shorter version of MTCY164.09c). Mb3125c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y353" /protein_id="SIU01750.1" /translation="MASLRIAEVDPVDRSPNHHASGSVETSSSRSRSASVRACLIHTS RSSSCSARRMTSLLRSPLRIAALMICSSFSVGRKPMVAVMSTTIADVAQSYSNCSTHS GTPTPAFAASFLLEAINAPRVIAGRFASESVRFPAAAPHGSVPSRLPV" CDS 3428565..3428885 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3125A" /product="PemK-like protein" /note="Mb3125A, len: 106 aa. Equivalent to Rv3098A len: 106 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 106 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). PemK-like protein. Protein product from Mb3125A detected using SWATH mass spectrometry. Mb3125A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y342" /db_xref="InterPro:IPR003477" /db_xref="InterPro:IPR011067" /db_xref="UniProtKB/TrEMBL:A0A1R3Y342" /protein_id="SIU01751.1" /translation="MVIRGAVYRVDFGDAKRGHEQRGRRYAVVISPGSMPWSVVTVVP TSTSAQPAVFRPELEVMGTKTRFLVDQIRTIGIVYVHGDPVDYLDRDQMAKVEHAVAR YLGL" gene complement(3428772..3429378) /gene="ssr" misc_RNA complement(3428772..3429378) /gene="ssr" /product="10Sa RNA" /note="ssr, len: 607 nt. Equivalent to ssr, len: 607 nt,from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 607 nt overlap). Match to EM_BA:MT10SARNA X60301 M.tuberculosis gene for 10Sa RNA." CDS complement(3429372..3430223) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3126C" /product="conserved protein" /note="Mb3126c, -, len: 283 aa. Equivalent to Rv3099c, len: 283 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 283 aa overlap). Conserved hypothetical protein, some similarity with hypothetical proteins e.g. Q9XA69|SCGD3.09 from Streptomyces coelicolor (274 aa), FASTA scores: opt: 384, E(): 1.8e-17, (32.7% identity in 269 aa overlap); and P71606|Y036_MYCTU|Rv0036c from Mycobacterium tuberculosis strain H37Rv (257 aa), FASTA scores: opt: 179, E(): 0.00024, (25.85% identity in 205 aa overlap). Protein product from Mb3126c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3126c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y335" /db_xref="InterPro:IPR010872" /db_xref="InterPro:IPR017517" /db_xref="InterPro:IPR024344" /db_xref="InterPro:IPR034660" /db_xref="UniProtKB/TrEMBL:A0A1R3Y335" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01752.1" /translation="MTTPGRPLTTLDKSDVLAGLFAVWHSLDALLDGLLETDWQATSP LPGWDVKAVVSHIIGTESFLLGIAAPEPDTDVSALAHVRNPIGVMNECWVRHLGTESG VGLLERFRAVTSQRRKVLASLSDDEWNAPTTTPSGPDSYGRFMRIRIFDCWMHEQDIR AAVQRPSSDDELGGPASPLVLDEIAATMGFVVGKLAKAPDGSRVLLELTGPLSRSIRV SVDGRARVVDDFGGPAPTATIRLDGLQFTRLAGGRPMSPARSQDVELGGDKELAGHIL ERLNFVI" CDS complement(3430260..3430742) /codon_start=1 /transl_table=11 /gene="smpB" /locus_tag="BQ2027_MB3127C" /product="PROBABLE SSRA-BINDING PROTEIN SMPB" /note="Mb3127c, smpB, len: 160 aa. Equivalent to Rv3100c, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 160 aa overlap). Probable smpB, small protein b related to several bacterial small protein b homologs e.g. O32881|SSRP_MYCLE|ML0671|MLCB1779.19c from Mycobacterium leprae (160 aa), FASTA scores: opt: 914, E(): 1.1e-52, (84.9% identity in 159 aa overlap); Q9L1S9|SMPB from Streptomyces coelicolor (159 aa), FASTA scores: opt: 568, E(): 3.3e-30, (55.15% identity in 145 aa overlap); O32230|SSRP_BACSU from Bacillus subtilis (156 aa), FASTA scores: opt: 511, E(): 1.7e-26, (47.05% identity in 153 aa overlap); etc. BELONGS TO THE SSRP FAMILY. Mb3127c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A613" /db_xref="InterPro:IPR000037" /db_xref="InterPro:IPR020081" /db_xref="InterPro:IPR023620" /db_xref="UniProtKB/Swiss-Prot:P0A613" /protein_id="SIU01753.1" /translation="MSKSSRGGRQIVASNRKARHNYSIIEVFEAGVALQGTEVKSLRE GQASLADSFATIDDGEVWLRNAHIPEYRHGSWTNHEPRRNRKLLLHRRQIDTLVGKIR EGNFALVPLSLYFAEGKVKVELALARGKQARDKRQDMARRDAQREVLRELGRRAKGMT " CDS complement(3430745..3431638) /codon_start=1 /transl_table=11 /gene="ftsX" /locus_tag="BQ2027_MB3128C" /product="PUTATIVE CELL DIVISION PROTEIN FTSX (SEPTATION COMPONENT-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER)" /note="Mb3128c, ftsX, len: 297 aa. Equivalent to Rv3101c, len: 297 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 297 aa overlap). Putative ftsX, cell division protein, septation component transport integral membrane protein ABC transporter (see citations below), equivalent to O32882|FTSX_MYCLE|ML0670|MLCB1779.20c CELL DIVISION PROTEIN from Mycobacterium leprae (297 aa), FASTA scores: opt: 1597, E(): 9.2e-93, (80.8% identity in 297 aa overlap); and similar to others e.g. Q9L1S7|SCE59.27c from Streptomyces coelicolor (305 aa), FASTA scores: opt: 585, E(): 1.9e-29, (34.55% identity in 304 aa overlap); O34876|FTSX_BACSU from Bacillus subtilis (296 aa), FASTA scores: opt: 318, E(): 9.1e-13, (24.65% identity in 300 aa overlap); Q9K6X3|FTSX|BH3601 from Bacillus halodurans (298 aa), FASTA scores: opt: 290, E(): 5.2e-11, (22.75% identity in 299 aa overlap); etc. BELONGS TO THE FTSX FAMILY. Protein product from Mb3128c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3128c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TX91" /db_xref="InterPro:IPR003838" /db_xref="InterPro:IPR004513" /db_xref="InterPro:IPR040690" /db_xref="UniProtKB/Swiss-Prot:Q7TX91" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01754.1" /translation="MRFGFLLNEVLTGFRRNVTMTIAMILTTAISVGLFGGGMLVVRL ADSSRAIYLDRVESQVFLTEDVSANDSSCDTTACKALREKIETRSDVKAVRFLNRQQA YDDAIRKFPQFKDVAGKDSFPASFIVKLENPEQHKDFDTAMKGQPGVLDVLNQKELID RLFAVLDGLSSAAFAVALVQAIGAILLIANMVQVAAYTRRTEIGIMRLVGASRWYTQL PFLVEAMLAATMGVGIAVAGLMVVRALFLENALNQFYQANLIAKVDYADILFITPWLL LLGVAMSGLTAYLTLRLYVRR" CDS complement(3431639..3432328) /codon_start=1 /transl_table=11 /gene="ftsE" /locus_tag="BQ2027_MB3129C" /product="PUTATIVE CELL DIVISION ATP-BINDING PROTEIN FTSE (SEPTATION COMPONENT-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER)" /note="Mb3129c, ftsE, len: 229 aa. Equivalent to Rv3102c, len: 229 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 229 aa overlap). Putative ftsE, cell division protein, septation component transport ATP-binding protein ABC transporter (see citations below), equivalent to O32883|FTSE|ML0669 CELL DIVISION ATP-BINDING PROTEIN from Mycobacterium leprae (229 aa), FASTA scores: opt: 1384, E(): 2.4e-74, (91.7% identity in 229 aa overlap); and similar to Q9L1S6|FTSE from Streptomyces coelicolor (229 aa), FASTA scores: opt: 914, E(): 8.7e-47, (62.85% identity in 226 aa overlap); Q9A0S4|FTSE|SPY0644 from Streptococcus pyogenes (230 aa), FASTA scores: opt: 866, E(): 5.7e-44, (57.9% identity in 228 aa overlap); Q9CGX0|FTSE from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (230 aa), FASTA scores: opt: 792, E(): 1.3e-39, (52.2% identity in 228 aa overlap); etc. Other relatives from Mycobacterium tuberculosis include: MTCY253.24; MTCY16B7.10; MTCY9C4.04c; MTCY50.01; MTCY05A6.09c; MTCY04C12.31. Contains PS00017 ATP/GTP-binding site motif A (P-loop) and ABC transporters family signature (PS00211). BELONG TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb3129c detected using shotgun mass spectrometry. Mb3129c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y512" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR005286" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y512" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01755.1" /translation="MITLDHVTKQYKSSARPALDDINVKIDKGEFVFLIGPSGSGKST FMRLLLAAETPTSGDVRVSKFHVNKLRGRHVPKLRQVIGCVFQDFRLLQQKTVYDNVA FALEVIGKRTDAINRVVPEVLETVGLSGKANRLPDELSGGEQQRVAIARAFVNRPLVL LADEPTGNLDPETSRDIMDLLERINRTGTTVLMATHDHHIVDSMRQRVVELSLGRLVR DEQRGVYGMDR" CDS complement(3432372..3432809) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3130C" /product="HYPOTHETICAL PROLINE-RICH PROTEIN" /note="Mb3130c, -, len: 145 aa. Equivalent to Rv3103c, len: 145 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 145 aa overlap). Hypothetical unknown pro-rich protein, with some similarity to Proline-rich proteins e.g. Q39789 PROLINE-RICH CELL WALL PROTEIN from Gossypium hirsutum (Upland cotton) (214 aa), FASTA scores: opt: 267, E(): 0.00014, (40% identity in 110 aa overlap). Equivalent to AAK47525 from M. mycobacterium strain CDC1551 (158 aa) but shorter 13 aa. Mb3130c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3U7" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3U7" /protein_id="SIU01756.1" /translation="MKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRPQAP GPGDSPPTQVVPPGFVPDPDYTWVPRTRVQPPTVKATPTTTSSTPPVSPPETTTDSAV PPPFELPPPFGPGTTTPTPPAPLPQPGPGPTAGTYPKSEPPTR" CDS complement(3432811..3433737) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3131C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3131c, -, len: 308 aa. Equivalent to Rv3104c, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 308 aa overlap). Possible conserved transmembrane protein, with some similarity to hypthetical proteins e.g. Q9L1X9|SC8E4A.26 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (408 aa), FASTA scores: opt: 514, E(): 4.3e-25, (35.2% identity in 287 aa overlap); Q9XA89|CF43A.26c HYPOTHETICAL 36.1 KDA PROTEIN from Streptomyces coelicolor (333 aa), FASTA scores: opt: 482, E(): 3.7e-23, (34.9% identity in 301 aa overlap); Q55987|SLR0765 HYPOTHETICAL 68.9 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (617 aa), FASTA scores: opt: 429, E(): 1.3e-19, (30.6% identity in 278 aa overlap); etc. Protein product from Mb3131c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3131c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3A1" /db_xref="InterPro:IPR006685" /db_xref="InterPro:IPR010920" /db_xref="InterPro:IPR011014" /db_xref="InterPro:IPR011066" /db_xref="InterPro:IPR023408" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3A1" /protein_id="SIU01757.1" /translation="MTTSGTVLATSIAQHWHNFWRGEIGDWILNRGLRIVMLLIAAVL AARFVTWLANRVTRRLDLGFTESDALVRSEATKHRQAVASVISWVSIVLIYVVVVYEV IDVLPVPVGALVGPAAVLGAALGFGAQRLVQDLLAGFFIIVEKQYGFGDLVELSMVGS PENAAGTVEDVTLRVTKLRSSEGEVFTVPNGNIVKSVNLSKDWARAVVDIPVPTSADL GRVNEVLHQECEHARHDSLLGELLLDEPTVMGVERIEVDTVTLRLVARTLPGKQFEAG RQLRVLVIRALTRAGIVTAADARAAVAESPEQ" CDS complement(3433727..3434863) /codon_start=1 /transl_table=11 /gene="prfB" /locus_tag="BQ2027_MB3132C" /product="PROBABLE PEPTIDE CHAIN RELEASE FACTOR 2 PRFB (RF-2)" /note="Mb3132c, prfB, len: 378 aa. Equivalent to Rv3105c, len: 378 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 378 aa overlap). Probable prfB, peptide chain release factor 2, equivalent to O32885|RF2_MYCLE|ML0667|MLCB1779.24c from Mycobacterium leprae, FASTA scores: opt: 2197, E(): 1.8e-126, (90.05% identity in 372 aa overlap); and also similar to other peptide chain release factors e.g. Q9L1S3|PRFB from Streptomyces coelicolor (368 aa), FASTA scores: opt: 1674, E(): 1.2e-94, (69.3% identity in 365 aa overlap); O67695|RF2_AQUAE|PRFB|AQ_1840 from Aquifex aeolicus (373 aa), FASTA scores: opt: 1082, E(): 1.3e-58, (44.45% identity in 369 aa overlap); P28367|RF2_BACSU from B. subtilis (366 aa), FASTA scores: opt: 1030, E(): 1.9e-55, (44.0% identity in 359 aa overlap); etc. Also related to Q10605|MTCY373.19|RF1_MYCTU|Rv1299|MT1338 peptide chain release factor 1 (rf-1) (357 aa), FASTA scores: opt: 646, E(): 1.1e-34, (38.6% identity in 350 aa overlap). Contains prokaryotic-type class I peptide chain release factors signature (PS00745). BELONGS TO THE PROKARYOTIC AND MITOCHONDRIAL RELEASE FACTORS FAMILY. Protein product from Mb3132c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3132c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66027" /db_xref="InterPro:IPR000352" /db_xref="InterPro:IPR004374" /db_xref="InterPro:IPR005139" /db_xref="UniProtKB/Swiss-Prot:P66027" /protein_id="SIU01758.1" /translation="MPVTLAAVDPDRQADIAALDCTLTTVERVLDVEGLRSRIEKLEH EASDPHLWDDQTRAQRVTSELSHTQGELRRVEELRRRLDDLPVLYELAAEEAGAAAAD AVAEADAELKSLRADIEATEVRTLLSGEYDEREALVTIRSGAGGVDAADWAEMLMRMY IRWAEQHKYPVEVFDTSYAEEAGIKSATFAVHAPFAYGTLSVEQGTHRLVRISPFDNQ SRRQTSFAEVEVLPVVETTDHIDIPEGDVRVDVYRSSGPGGQSVNTTDSAVRLTHIPS GIVVTCQNEKSQLQNKIAAMRVLQAKLLERKRLEERAELDALKADGGSSWGNQMRSYV LHPYQMVKDLRTEYEVGNPAAVLDGDLDGFLEAGIRWRNRRNDD" CDS 3434967..3436337 /codon_start=1 /transl_table=11 /gene="fprA" /locus_tag="BQ2027_MB3133" /product="nadph:adrenodoxin oxidoreductase fpra (nadph-ferredoxin reductase)" /note="Mb3133, fprA, len: 456 aa. Equivalent to Rv3106, len: 456 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 456 aa overlap). Probable fprA, NADPH:adrenodoxin oxidoreductase (EC 1.18.1.2), equivalent to O32886|MLCB1779.25|FPRA|ML0666 from Mycobacterium leprae (456 aa), FASTA scores: opt: 2505, E(): 1.2e-142, (81,05% identity in 459 aa overlap); also similar to other NADPH:adrenodoxin oxidoreductases e.g. Q9RX19|DR0496 from Deinococcus radiodurans (479 aa), FASTA scores: opt: 1331, E(): 2.6e-72, (48.9% identity in 454 aa overlap); Q9RK35|SCF15.02 from Streptomyces coelicolor (454 aa), FASTA scores: opt: 1102, E(): 1.3e-58, (41.35% identity in 462 aa overlap); P82861 from Salvelinus fontinalis (Brook trout) (498 aa), FASTA scores: opt: 827, E(): 4e-42, (41.3% identity in 460 aa overlap); Q9V3T9|ADRO_DROME from Drosophila melanogaster (Fruit fly) (466 aa), FASTA scores: opt: 790, E(): 6.3e-40, (39.45% identity in 459 aa overlap); etc. Also similar to Q10547|FPRB|Rv0886|MT0909|MTCY31.14 from Mycobacterium tuberculosis strain H37Rv (575 aa), FASTA scores: opt: 894, E(): 4.4e-46, (42.05% identity in 459 aa overlap). Protein product from Mb3133 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3133 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y343" /db_xref="InterPro:IPR021163" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y343" /protein_id="SIU01759.1" /translation="MRPYYIAIVGSGPSAFFAAASLLKAADTTEDLDMAVDMLEMLPT PWGLVRSGVAPDHPKIKSISKQFEKTAEDPRFRFFGNVVVGEHVQPGELSERYDAVIY AVGAQSDRMLNIPGEDLPGSIAAVDFVGWYNAHPHFEQISPDLSGARAVVIGNGNVAL DVARILLTDPDVLARTDIADHALESLRPRGIQEVVIVGRRGPLQAAFTTLELRELADL DGVDVVIDPAELDGITDEDAAAVGKVCKQNIKVLRGYADREPRPGHRRMVFRFLTSPI EIKGKRKVERIVLGRNELVSDGSGRVAAKDTGEREELPAQLVVRSVGYRGVPTPGLPF DDQSGTIPNVGGRINGSPNEYVVGWIKRGPTGVIGTNKKDAQDTVDTLIKDLGNAKEG AECKSFPEDHADQVADWLAARQPKLVTSAHWQVIDAFERAAGEPHGRPRVKLASLAEL LRIGLG" CDS complement(3436338..3437921) /codon_start=1 /transl_table=11 /gene="agpS" /locus_tag="BQ2027_MB3134C" /product="POSSIBLE ALKYLDIHYDROXYACETONEPHOSPHATE SYNTHASE AGPS (ALKYL-DHAP SYNTHASE) (ALKYLGLYCERONE-PHOSPHATE SYNTHASE)" /note="Mb3134c, agpS, len: 527 aa. Equivalent to Rv3107c, len: 527 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 527 aa overlap). Possible agpS, alkyl-dihydroxyacetonephosphate synthase (EC 2.5.1.26), similar to others and some various enzymes e.g. AAK46595|MT2311 PUTATIVE ALKYL-DIHYDROXYACETONEPHOSPHATE SYNTHASE from Mycobacterium tuberculosis strain CDC1551 (529 aa), FASTA scores: opt: 1052, E(): 2.1e-58, (37.1% identity in 542 aa overlap); Q9RJ97|SCF91.28c PUTATIVE FLAVOPROTEIN from Streptomyces coelicolor (530 aa), FASTA scores: opt: 972, E(): 2.2e-53, (36.2% identity in 544 aa overlap); O96759|ADAS_DICDI ALKYLDIHYDROXYACETONEPHOSPHATE SYNTHASE (EC 2.5.1.26) from Dictyostelium discoideum (Slime mold) (611 aa), FASTA scores: opt: 617, E(): 4.5e-31, (33.95% identity in 480 aa overlap); O97157|ADAS_TRYBB ALKYLDIHYDROXYACETONEPHOSPHATE SYNTHASE from Trypanosoma brucei (613 aa), FASTA scores: opt: 567, E(): 6.2e-28, (29.15% identity in 521 aa overlap); etc. Also similar to O53525|Rv2251|MTV022.01 HYPOTHETICAL 49.8 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (475 aa), FASTA scores: opt: 1019, E(): 2.3e-56, (38.6% identity in 487 aa overlap). BELONGS TO THE FAD-BINDING OXIDOREDUCTASE/TRANSFERASE FAMILY 4. COFACTOR: FAD (BY SIMILARITY). Protein product from Mb3134c detected using SWATH mass spectrometry. Mb3134c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y363" /db_xref="InterPro:IPR004113" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR016164" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3Y363" /protein_id="SIU01760.1" /translation="MRSWWGWGTVEDALSDQETQALQSRVAALVSGHDLSDHPPPDLT ALGLAAPRVSPPASLAALCSSDLVDRAGHARGKAYRDIARNLQGQLDHLPDLIARPRS EQDVIDVLDWCAREGIAVIPYGGGSSVVGGVEPRFDEPVVTVDVTAMSAVLEIDRVSR AARIQAGAFGPSIEHQLRPHDLTLRHFPQSFGFSTLGGWLATRSGGHFATLYTHIDDL TESLRIVTPVGISESRRLPGSGAGPSPDRLFLGSEGTLGIITEAWMRLQHRPRWQVTV SVVFDDWAAAVAATRTIAQAGLYPANCRLLDPAEALLNAGTSVGGGLLVLAFESADHP IDPWLHRAVAITAEHGGTVTAQRSRGTTSDATEHNAAANWRSAFLRMPYQRDALVRRG VIAETFETACTWDGFDTLHAAVTDAARTAIWKVCGTGVVTCRFTHVYPDGPAPYYGIY AGGRWGSLDAQWDEIKAAVSEAISASGGTITHHHAVGRDHRAWYDRQRPDPFAAALRA AKSALDPAGILNPGVLLGR" mobile_element 3437873..3439307 /mobile_element_type="insertion sequence:IS1081" /locus_tag="BQ2027_IS1081-6" /note="IS1081-6, len: 1435 nt. Equivalent to IS1081, len: 1450 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1435 nt overlap)." CDS 3438020..3438460 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3135" /product="HYPOTHETICAL PROTEIN" /note="Mb3135, -, len: 146 aa. Equivalent to Rv3108, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 146 aa overlap). Hypothetical unknown protein." /db_xref="UniProtKB/TrEMBL:A0A1R3Y351" /protein_id="SIU01761.1" /translation="MTPNAASTGDSAKNTITGCCLITARALVARTRSISLPGMPFRMP ADYHNASSDEPTNRHPWPAPARCCRHEWRTMRRTNACDRRRFGLSLTIHEDACRIISV VPVVLEVRRAEPAHPATPYPEPLARCSRSPGLNESSHMSGRVPP" CDS 3438609..3439688 /codon_start=1 /transl_table=11 /gene="moaA1" /locus_tag="BQ2027_MB3136" /standard_name="moaA" /product="PROBABLE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A MOAA1" /note="Mb3136, moaA1, len: 359 aa. Equivalent to Rv3109, len: 359 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 359 aa overlap). Probable moaA1, molybdenum cofactor biosynthesis protein, highly similar to others e.g. P39757|MOAA_BACSU|NARA|NARAB from Bacillus subtilis (341 aa), FASTA scores: opt: 810, E(): 6.2e-44, (39.75% identity in 327 aa overlap); O67929|MOAA_AQUAE|AQ_2183 from Aquifex aeolicus (320 aa), FASTA scores: opt: 794, E(): 6e-43, (40.55% identity in 323 aa overlap); Q9ZIM6|MOAA_STACA from Staphylococcus carnosus (340 aa), FASTA scores: opt: 783, E(): 3.2e-42, (38.65% identity in 326 aa overlap); etc. Also highly similar to O53143|MOAA3|MOA3_MYCTU|MT3427 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A 3 from Mycobacterium tuberculosis strain F4 (378 aa), FASTA scores: opt: 1762, E(): 4.7e-104, (74.3% identity in 350 aa overlap); and similar to O53881|MOA2_MYCTU|MOAA2|Rv0869c|MT0892|MTV043.6 2 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A 2 from Mycobacterium tuberculosis strain H37Rv (360 aa), FASTA scores: opt: 657, E(): 3e-34, (36.55% identity in 309 aa overlap). BELONGS TO THE MOAA / NIFB / PQQE FAMILY. Note that previously known as moaA." /db_xref="GOA:Q7TX84" /db_xref="InterPro:IPR000385" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR010505" /db_xref="InterPro:IPR013483" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/Swiss-Prot:Q7TX84" /protein_id="SIU01762.1" /translation="MSTPTLPDMVAPSPRVRVKDRCRRMMGDLRLSVIDQCNLRCRYC MPEEHYTWLPRQDLLSVKEISAIVDVFLSVGVSKVRITGGEPLIRPDLPEIVRTLSAK VGEDSGLRDLAITTNGVLLADRVDGLKAAGMKRITVSLDTLQPERFKAISQRNSHDKV IAGIKAVAAAGFTDTKIDTTVMRGANHDELADLIEFARTVNAEVRFIEYMDVGGATHW AWEKVFTKANMLESLEKRYGRIEPLPKHDTAPANRYALPDGTTFGIIASTTEPFCATC DRSRLTADGLWLHCLYAISGINLREPLRAGATHDDLVETVTTGWRRRADRGAEQRLAQ RERGVFLPLSTLKADPHLEMHTRGG" CDS 3439739..3440134 /codon_start=1 /transl_table=11 /gene="moaB1" /locus_tag="BQ2027_MB3137" /standard_name="moaB" /product="PROBABLE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE MOAB1 (PHS) (4-ALPHA-HYDROXY-TETRAHYDROPTERIN DEHYDRATASE) (PTERIN-4-A-CARBINOLAMINE DEHYDRATASE) (PHENYLALANINE HYDROXYLASE-STIMULATING PROTEIN) (PHS) (PTERIN CARBINOLAMINE DEHYDRATASE) (PCD)" /note="Mb3137, moaB1, len: 131 aa. Equivalent to Rv3110, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Probable moaB1, pterin-4-alpha-carbinolamine dehydratase (EC 4.2.1.96), similar to others e.g. P73790|SSL2296 from Synechocystis sp. strain PCC 6803 (96 aa), FASTA scores: opt: 195, E(): 6.2e-07, (35.4% identity in 96 aa overlap); Q9PAB4|PHS_XYLFA|XF2604 from Xylella fastidiosa (116 aa), FASTA scores: opt: 187, E(): 2.6e-06, (36.25% identity in 102 aa overlap); AAK42360|Q97WM6|PHS_SULSO|SSO2187 from Sulfolobus solfataricus (114 aa), FASTA scores: opt: 177, E(): 1.3e-05, (34.6% identity in 78 aa overlap); etc. Also highly similar to AAK47768|MT3426 PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE from Mycobacterium tuberculosis CDC1551 (124 aa), FASTA scores: opt: 383, E(): 7.7e-20, (50.0% identity in 110 aa overlap). BELONGS TO THE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE FAMILY. Note that previously known as moaB." /db_xref="GOA:A0A1R3Y357" /db_xref="InterPro:IPR001533" /db_xref="InterPro:IPR036428" /db_xref="UniProtKB/TrEMBL:A0A1R3Y357" /protein_id="SIU01763.1" /translation="MTVSTPEQHEQRASHDASEGKHNVCQGRLAALADAAVSEKLGAL PGWQLLDMRLSRAFQCTNFDQSIDFMNRVASIANDINHHPDIAVLDKRSVRVTAWTRK LGYLTDIDFDLAASVEAMYATEFADRPAR" CDS 3440131..3440643 /codon_start=1 /transl_table=11 /gene="moaC1" /locus_tag="BQ2027_MB3138" /standard_name="moaC" /product="PROBABLE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN C MOAC1" /note="Mb3138, moaC1, len: 170 aa. Equivalent to Rv3111, len: 170 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 170 aa overlap). Probable moaC1, molybdopterin cofactor biosynthesis protein, highly similar to others e.g. Q9HX95|MOAC|PA3918 from Pseudomonas aeruginosa (160 aa), FASTA scores: opt: 576, E(): 2.2e-29, (62.1% identity in 153 aa overlap); Q9ZFA6|MOAC from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (159 aa), FASTA scores: opt: 541, E(): 3.4e-27, (59.85% identity in 157 aa overlap); BAB48171|MLR0616 from Rhizobium loti (Mesorhizobium loti) (160 aa), FASTA scores: opt: 531, E(): 1.5e-26, (58.75% identity in 160 aa overlap); P30747|MOAC_ECOLI|CHLA3|B0783 from Escherichia coli strain K12 (160 aa), FASTA scores: opt: 527, E(): 2.6e-26, (58.5% identity in 159 aa overlap); etc. Also highly similar to O53376|MOAC3|Rv3324c|MTV016.24c PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN C 3 from Mycobacterium tuberculosis (177 aa), FASTA scores: opt: 738, E(): 1.7e-39, (71.5% identity in 165 aa overlap); AAK47767|MT3425 MOLYBDOPTERIN COFACTOR BIOSYNTHESIS PROTEIN C from Mycobacterium tuberculosis strain CDC1551 (184 aa), FASTA scores: opt: 734, E(): 3.1e-39, (71.8% identity in 163 aa overlap); and Rv0864|MOAC2|MTV043.57 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN C 2 (167 aa). Note that previously known as moaC." /db_xref="GOA:P0A5K5" /db_xref="InterPro:IPR002820" /db_xref="InterPro:IPR023045" /db_xref="InterPro:IPR036522" /db_xref="UniProtKB/Swiss-Prot:P0A5K5" /protein_id="SIU01764.1" /translation="MIDHALALTHIDERGAARMVDVSEKPVTLRVAKASGLVIMKPST LRMISDGAAAKGDVMAAARIAGIAAAKRTGDLIPLCHPLGLDAVSVTITPCEPDRVKI LATTTTLGRTGVEMEALTAVSVAALTIYDMCKAVDRAMEISQIVLQEKSGGRSGVYRR SASDLACQSR" CDS 3440660..3440911 /codon_start=1 /transl_table=11 /gene="moaD1" /locus_tag="BQ2027_MB3139" /standard_name="moaD" /product="PROBABLE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN D MOAD1 (MOLYBDOPTERIN CONVERTING FACTOR SMALL SUBUNIT) (MOLYBDOPTERIN [MPT] CONVERTING FACTOR, SUBUNIT 1)" /note="Mb3139, moaD1, len: 83 aa. Equivalent to Rv3112, len: 83 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 83 aa overlap). Probable moaD1, molybdenum cofactor biosynthesis protein (molybdopterin converting factor (subunit 1)), similar to others e.g. Q9HJF0|TA1019 from Thermoplasma acidophilum (85 aa), FASTA scores: opt: 144, E(): 0.0012, (31.7% identity in 82 aa overlap); BAB59710|TVG0556526 from Thermoplasma volcanium (90 aa), FASTA scores: opt: 144, E(): 0.0012, (31.7% identity in 82 aa overlap); P30748|MOAD_ECOLI|CHLA4|CHLM|B0784 from Escherichia coli strain K12 (81 aa), FASTA scores: opt: 116, E(): 0.11, (36.9% identity in 84 aa overlap); etc. N-terminus also highly similar to to O53375|GPHA|Rv3323c|MTV016.23c MOAD-MOAE FUSION PROTEIN from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 333, E(): 2e-16, (65.05% identity in 83 aa overlap); and some similarity with Rv0868c|MTV043.61c|MOAD2 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN D 2 (92 aa). Note that previously known as moaD." /db_xref="InterPro:IPR003749" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR016155" /db_xref="UniProtKB/TrEMBL:A0A1R3Y520" /protein_id="SIU01765.1" /translation="MIKVNVLYFGAVREACDETPREEVEVQNGTDVGNLVDQLQQKYP RLRDHCQRVQMAVNQFIAPLSTVLGDGDEVAFIPQVAGG" CDS 3441033..3441701 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3140" /product="POSSIBLE PHOSPHATASE" /note="Mb3140, -, len: 222 aa. Equivalent to Rv3113, len: 222 aa, from Mycobacterium tuberculosis strain H37Rv, (98.6% identity in 222 aa overlap). Possible phosphatase (EC 3.1.3.-), with weak similarity to other phosphatases e.g. Q9KYY0|SCE33.02c from Streptomyces coelicolor (223 aa), FASTA scores: opt: 368, E(): 1.2e-16, (32.9% identity in 222 aa overlap); and Q55039|GPH_SYNP7|CBBZ PHOSPHOGLYCOLATE PHOSPHATASE from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (212 aa), FASTA scores: opt: 176, E(): 0.00025, (24.7% identity in 182 aa overlap)." /db_xref="GOA:A0A1R3Y3V3" /db_xref="InterPro:IPR006439" /db_xref="InterPro:IPR023198" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="InterPro:IPR041492" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3V3" /protein_id="SIU01766.1" /translation="MTSRDGFTIVCDWNGTLCDDRTILLDAVGQTLVNEGFEPLSQQQ LIQRFARPLRTFFENACGRDLLTSEWERVQSTFRRIYRSREAEVTLVEDAYDVLAQGN RSAAGQFLLSLAPHDELMHFVQKYGIAKWFNEIRGRTRPDQEKPMMLAELIMQRSLNP TRVVHIGDSLEDAAAASAVGAISVLVTGASRQPPDRVMLKQLQPFVASSLKQALQYAG GDGD" CDS 3441718..3442248 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3141" /product="Cytosine/adenosine deaminases" /note="Mb3141, -, len: 176 aa. Equivalent to Rv3114, len: 176 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 176 aa overlap). Conserved hypothetical protein, with some similarity to Q9F9W7 CYTOSINE DEAMINASE from Bifidobacterium longum (143 aa), FASTA scores: opt: 207, E(): 2.2e-07, (37.05% identity in 108 aa overlap); and Q9RV23|DR1207 CELL CYCLE PROTEIN MESJ, PUTATIVE/CYTOSINE DEAMINASE-RELATED PROTEIN from Deinococcus radiodurans (600 aa), FASTA scores: opt: 212, E(): 3.5e-07, (33.35% identity in 177 aa overlap). Equivalent to AAK47536|MT3196 CYTIDINE AND DEOXYCYTIDYLATE DEAMINASE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (187 aa) but shorter 11 aa." /db_xref="GOA:A0A1R3Y3B1" /db_xref="InterPro:IPR002125" /db_xref="InterPro:IPR016193" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3B1" /protein_id="SIU01767.1" /translation="MVAARLPFGWPADSGVTADIIEAAMELAIDTARHATAPFGAALL DVTTLRAFSGGNTYFESGDRFAHAETNVLRAAMSTLPELSNHVLISTAEPCPMCAAAS VLSGVRAIIFGTSIETLIQCGWFQIRISASDVVAASTRPTRPSVYSGFLSHKTDLLYR NSENRRAMNPWTDPSH" repeat_region 3442277..3442284 /rpt_type=DIRECT /note="8 bp direct repeat, AGGAGGAG, flanking IS element IS1081." gene 3442285..3443719 /locus_tag="BQ2027_IS1081-6" repeat_region 3442358..3442372 /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRL,TCGCGTGATCCTTCG, flanking IS element IS1081." CDS 3442410..3443657 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3142" /product="PROBABLE TRANSPOSASE" /note="Mb3142, -, len: 415 aa. Equivalent to Rv3115, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 415 aa overlap). Probable IS1081 transposase, similar to others. Has transposases, mutator family, signature (PS01007). Other copies are MTCY10G2.02c, MTCY441.35, MTCY77.03c." /db_xref="GOA:P60231" /db_xref="InterPro:IPR001207" /db_xref="UniProtKB/Swiss-Prot:P60231" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01768.1" /translation="MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" repeat_region complement(3443667..3443681) /rpt_type=INVERTED /note="15 bp perfect inverted repeat, IRR,TCGCGTGATCCTTCG, flanking IS element IS1081." repeat_region 3443720..3443727 /rpt_type=DIRECT /note="8 bp direct repeat, AGGAGGAG, flanking IS element IS1081." CDS 3443735..3444904 /codon_start=1 /transl_table=11 /gene="moeB2" /locus_tag="BQ2027_MB3143" /standard_name="moeB" /product="PROBABLE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN MOEB2 (MPT-SYNTHASE SULFURYLASE) (MOLYBDOPTERIN SYNTHASE SULPHURYLASE)" /note="Mb3143, moeB2, len: 389 aa. Equivalent to Rv3116, len: 389 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 389 aa overlap). Probable moeB2, molybdopterin cofactor biosynthesis protein, equivalent to Q9CCG8|MOEZ|ML0817 PROTEIN PROBABLY INVOLVED IN MOLYBDOPTERIN BIOSYNTHESIS from Mycobacterium leprae (395 aa), FASTA scores: opt: 1433, E(): 8e-80, (57.8% identity in 384 aa overlap). Very similar to members of the HESA/MOEB/THIF family e.g. Q9FCL0|2SC3B6.02 PUTATIVE SULFURYLASE from Streptomyces coelicolor (392 aa), FASTA scores: opt: 1562, E(): 1.1e-87, (58.15% identity in 380 aa overlap); Q9XC37|PDTORFF MOEB-LIKE PROTEIN (PUTATIVE SULFURYLASE) from Pseudomonas stutzeri (Pseudomonas perfectomarina) (391 aa), FASTA scores: opt: 1311, E(): 2.1e-72, (52.4% identity in 395 aa overlap); O54307|MPT|MOEB MPT-SYNTHASE SULFURYLASE from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (391 aa), FASTA scores: opt: 1238, E(): 5.7e-68, (51.4% identity in 393 aa overlap); P74344|MOEB|SLL1536 MOLYBDOPTERIN BIOSYNTHESIS MOEB PROTEIN from Synechocystis sp. strain PCC 6803 (392 aa), FASTA scores: opt: 1212, E(): 2.2e-66, (46.5% identity in 398 aa overlap); etc. Also highly similar to O05860|MTCY07D11.20|MOEB1|Rv3206c PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN from Mycobacterium tuberculosis strain H37Rv (392 aa), FASTA scores: opt: 1445, E(): 1.5e-80, (56.25% identity in 400 aa overlap). BELONGS TO THE HesA /MoeB/ThiF FAMILY. Note that previously known as moeB. Protein product from Mb3143 detected using SWATH mass spectrometry. Mb3143 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y352" /db_xref="InterPro:IPR000594" /db_xref="InterPro:IPR001763" /db_xref="InterPro:IPR035985" /db_xref="InterPro:IPR036873" /db_xref="UniProtKB/TrEMBL:A0A1R3Y352" /protein_id="SIU01769.1" /translation="MTEALIPAPSQISLTRDEVRRYSRHLIIPDIGVNGQQRLKDARV LCIGAGGLGSPALLYLAAAGVGTIGIIDGDHVDESNLQRQIIHGTSDVGRPKVESAAE AVAEINPHVRVTQYREMLTHDNALEIFGDHDLIVDGTDNFTTRYLINDAAVLAGKPYV WGSIYRFNGQTSVFWPGRGPCYRCLHPAPPPPGLVPSCAEGGVLGAICATIASIQVTE VLKLLTGVGTPLVGRLLMYEALDATYHQIRIAKNPDCAICGDAPTITELVDDSVSCAS TQSVDPELVISCDELRTKQQSDQNFLLVDVREPAEFDIAHIPGSILIPKGEIGSAAGL AQLPLDKEIVLYCKSGIRSAQALTTLKAAGLHNVKHLDGGIAEWTRTIDSSLLVY" CDS 3444933..3445895 /codon_start=1 /transl_table=11 /gene="cyp141" /locus_tag="BQ2027_MB3144" /product="probable cytochrome p450 141 cyp141" /note="Mb3144, cysA3, len: 320 aa. Equivalent to 5' end of Rv3117 and 3' end of Rv3121, len: 277 aa and 400 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 255 aa overlap and 56.7% identity in 150 aa overlap). Probable cysA3, thiosulfate sulfurtransferase (EC 2.8.1.1), equivalent to Q50036|CYSA|CYSA3|ML2198|THTR_MYCLE PUTATIVE SULFURTRANSFERASE THIOSULFATE from Mycobacterium leprae (277 aa). Also highly similar to other putative thiosulfate sulfurtransferases e.g. P16385|THTR_SACER|CYSA from Saccharopolyspora erythraea (Streptomyces erythraeus) (281 aa), FASTA scores: opt: 1442, E(): 1.7e-84, (75.55% identity in 274 aa overlap); Q9RXT9DR0217|DR0217 from Deinococcus radiodurans (286 aa), FASTA scores: opt: 1046, E(): 2.6e-59, (53.8% identity in 275 aa overlap); Q9HMT7|TSSA|VNG2393G from Halobacterium sp. strain NRC-1 (293 aa), FASTA scores: opt: 1030, E(): 2.7e-58, (56.1% identity in 278 aa overlap); Q9Y8N8|APE2595 from Aeropyrum pernix (218 aa), FASTA scores: opt: 808, E(): 2.7e-44, (53.5% identity in 215 aa overlap); etc. Identical second copy present as Rv0815c|AL022004|MTV043.07c|MT0837|O05793| cysA2 (277 aa) (100.0% identity in 277 aa overlap). Also shows some similarity to P96888|THT2_MYCTU|SSEA|Rv3283|MT3382|MTCY71.23 PUTATIVE THIOSULFATE SULFURTRANSFERASE from Mycobacterium tuberculosis (297 aa), FASTA scores: opt: 955, E(): 1.6e-53, (50.2% identity in 271 aa overlap); and Q59570|THT3_MYCTU|SSEB|Rv2291|MT2348|MTCY339.19c PUTATIVE THIOSULFATE SULFURTRANSFERASE from Mycobacterium tuberculosis (284 aa), FASTA scores: E(): 1.4e-14, (26.7% identity in 292 aa overlap). Contains rhodanese active site and C-terminal signatures (PS00380, PS00683). BELONGS TO THE RHODANESE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large deletion of 2775 bp (RD12) leads to the loss of the COOH part of cysA3, the following CDSs, sseC1, moaE1, Rv3120 and a large part of cyp141 except the COOH end, compared to the homolog in Mycobacterium tuberculosis strain H37Rv. Protein product from Mb3144 detected using SWATH mass spectrometry. Mb3144 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TX80" /db_xref="InterPro:IPR001763" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="InterPro:IPR036873" /db_xref="UniProtKB/Swiss-Prot:Q7TX80" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01770.1" /translation="MARCDVLVSADWAESNLHAPKVVFVEVDEDTSAYDRDHIAGAIK LDWRTDLQDPVKRDFVDAQQFSKLLSERGIANEDTVILYGGNNNWFAAYAYWYFKLYG HEKVKLLDGGRKKWELDGRPLSSDPVSRPVTSYTASPPDNTIRAFRDEVLAAINVKNL IDVRSPDEFSGKILAPAHLPQEQSQRPGHIPGAINVPWSRAANEDGTFKSDEELAKLY ADAGLDNSKETIAYCRIGERSSHTWFVLRELLGHQNVNIAFGYGPHACPASAYSRMCL TTFFTSLTQRFPQLQLARPFEDLERRGKGLHSVGIKELLVTWPT" CDS 3446273..3446743 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3145" /product="HYPOTHETICAL PROTEIN" /note="Mb3145, -, len: 155 aa. Equivalent to Rv3122, len: 155 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 155 aa overlap). Hypothetical unknown protein. Mb3145 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y361" /protein_id="SIU01771.1" /translation="MYSGCWINNQNGETRVGEDSLEDLEQRRARLYDQLAATGDFRRG SISENYRRCGKPNCVCAQEGHPGHGPRYLWTRTVAGRGTKGRQLSVEEVDKVRAELAN YHRFAQVSEQIVAVNEAICEARPPNPAATAPPAGTTGHKKGGSATRSRRSSPPS" CDS 3446753..3447247 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3146" /product="HYPOTHETICAL PROTEIN" /note="Mb3146, -, len: 164 aa. Equivalent to Rv3123, len: 164 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 164 aa overlap). Hypothetical unknown protein, but N-terminus shares weak similarity with N-terminal part of O93439|CMESO-1 BHLH TRANSCRIPTION FACTOR from Gallus gallus (Chicken) (287 aa), FASTA scores: opt: 129, E(): 0.81, (38.75% identity in 80 aa overlap). Mb3146 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y355" /protein_id="SIU01772.1" /translation="MRSRSVRWDPRCRPGRSGVGDPHCDDPAGLLAAGAAAGRQHRAP GPAHRLRARALRVVRRLPRQEPRYRAGPGPVAPRLLPLPHLRAWDGAPWIWNLATAIL PEATPIVDLYHARQHVHDLAGQLAPALGEHHSDWLTARLVDLDSGDIETLVQQPIGQH TGHT" CDS 3447690..3448559 /codon_start=1 /transl_table=11 /gene="moar1" /locus_tag="BQ2027_MB3147" /product="transcriptional regulatory protein moar1" /note="Mb3147, -, len: 289 aa. Equivalent to Rv3124, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 289 aa overlap). Probable transcriptional regulatory protein, similar to many Streptomyces and Mycobacterium tuberculosis regulatory proteins e.g. Q11052|YC67_MYCTU|Rv1267c|MT1305|MTCY50.15 from Mycobacterium tuberculosis strain H37Rv (388 aa), FASTA scores: opt: 963, E(): 2e-56, (55.15% identity in 252 aa overlap); O53145 from Mycobacterium tuberculosis (381 aa); P71484|EMBR from Mycobacterium avium (384 aa), FASTA scores: opt: 859, E(): 1.5e-49, (52.2% identity in 249 aa overlap); Q9XCC3|TYLT from Streptomyces fradiae (404 aa), FASTA scores: opt: 462, E(): 3.1e-23, (35.05% identity in 254 aa overlap); Q9XCC4|TYLS from Streptomyces fradiae (277 aa), FASTA scores: opt: 456, E(): 5.6e-23, (33.45% identity in 269 aa overlap); etc. Start chosen by similarity, alternative possible (see AAK47548 from Mycobacterium tuberculosis strain CDC1551, longer N-terminus (311 aa))." /db_xref="GOA:A0A1R3Y367" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR005158" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3Y367" /protein_id="SIU01773.1" /translation="MQFNVLGPLELNLRGTKLPLGTPKQRAVLAMLLLSRNQVVAADA LVQAIWEKSPPARARRTVHTYICNLRRTLSDAGVDSRNILVSEPPGYRLLIGDRQQCD LDRFVAAKESGLRASAKGYFSEAIRYLDSALQNWRGPVLGDLRSFMFVQMFSRALTED ELLVHTKLAEAAIACGRADVVIPKLERLVAMHPYRESLWKQLMLGYYVNEYQSAAIDA YHRLKSTLAEELGVEPAPTIRALYHKILRQLPMDDLVGRVTRGRVDLRGGNGAKVEEL TESDKDLLPIGLA" CDS complement(3448660..3449835) /codon_start=1 /transl_table=11 /gene="PPE49" /locus_tag="BQ2027_MB3148C" /product="ppe family protein ppe49" /note="Mb3148c, PPE49, len: 391 aa. Equivalent to Rv3125c, len: 391 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 391 aa overlap). Member of the M. tuberculosis PPE family, similar to other e.g. P95247|Rv2352c|MTCY98.21c (391 aa), FASTA scores: opt: 1576, E(): 3.8e-72, (62.55% identity in 398 aa overlap), MTCY98.0029c, MTCY03A2.22c, MTCY10G2.10, MTCY02B10.25c, MTCI364.08, M TCY21C12.09c, MTCY48.17." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y376" /protein_id="SIU01774.1" /translation="MVLGFSWLPPEINSARMFAGAGSGPLFAAASAWEGLAVDLWASA SSFESVLAALTTGPWTGPASMSMAAAASPYVGWLSTVASQAQLAAIQARAAATAFEAA LAATVHPTAVTANRVSLASLIAANVLGQNTPAIAATEFDYLEMWAQDVAAMVGYHAGA KSVAATLAPFSLPPVSLAGLAAQVGTQVAGMATTASAAVTPVVEGAMASVPTVMSGMQ SLVSQLPLQHASMLFLPVRILTSPITTLASMARESATRLGPPAGGLAAANTPNPSGAA IPAFKPLGGRELGAGMSAGLGQAQLVGSMSVPPTWQGSIPISMASSAMSGLGVPPNPV ALTQAAGAAGGGMPMMLMPMSISGAGAGMPGGLMDRDGAGWHVTQARLTVIPRTGVG" CDS complement(3449992..3450306) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3149C" /product="HYPOTHETICAL PROTEIN" /note="Mb3149c, -, len: 104 aa. Equivalent to Rv3126c, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 104 aa overlap). Hypothetical unknown protein. Shortened version of MTCY164.36c, avoiding overlap." /db_xref="UniProtKB/TrEMBL:A0A1R3Y521" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01775.1" /translation="MVIRFDQIGSLVLSMKSLASLSFQRCLRENSSLVAALDRLDAAV DELSALSFDALTTPERDRARRDRDHHPWSRSRSQLSPRMAHGAVHQCQWPKAVWAVID NP" CDS 3450331..3451365 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3150" /product="Nitroreductase" /note="Mb3150, -, len: 344 aa. Equivalent to Rv3127, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 344 aa overlap). Hypothetical protein, highly similar to Mycobacterium tuberculosis protein O53476|Rv2032|MTV018.19 (331 aa), FASTA scores: opt: 1212, E(): 6e-69, (56.7% identity in 321 aa overlap), and also similar to P95195|MTCY03A2.27c (332 aa), FASTA scores: opt: 521, E(): 1.6e-25; (35.0% identity in 326 aa overlap). Some similarity to C-terminal half of hypothetical Mycobacterium tuberculosis proteins. Protein product from Mb3150 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3150 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3W2" /db_xref="InterPro:IPR000415" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3W2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01776.1" /translation="MLKNAVLLACRAPSVHNSQPWRWVAESGSEHTTVHLFVNRHRTV PATDHSGRQAIISCGAVLDHLRIAMTAAHWQANITRFPQPNQPDQLATVECSPIDHVT AGQRNRAQAILQRRTDRLPFDSPMYWHLFEPALRDAVDKDVAMLDVVSDDQRTRLVVA SQLSEVLRRDDPYYHAELEWWTSPFVLAHGVPPDTLASDAERLRVDLGRDFPVRSYQN RRAELADDRSKVLVLSTPSDTRADALRCGEVLSTILLECTMAGMATCTLTHLIESSDS RDIVRGLTRQRGEPQALIRVGIAPPLAAVPAPTPRRPLDSVLQIRQTPEKGRNASDRN ARETGWFSPP" CDS complement(3451352..3451702) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3151C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3151c, -, len: 116 aa. Equivalent to 3' end of Rv3128c, len: 337 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 116 aa overlap). Conserved hypothetical protein, similar to other conserved hypothetical proteins. This ORF corresponds to a fusion of MTCY164.38 and MTCY164.39c. Has in-frame amber stop codon but is similar throughout its length to Rv2807|MTCY16B7.36c|Z81331 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (384 aa), FASTA scores: opt: 954, E(): 0, (47.2% identity in 339 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3128c exists as a single gene with an in-frame amber stop codon. In Mycobacterium bovis, Rv3128c is split into 2, Mb3151c and Mb3152c." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3C7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01777.1" /translation="MLNRMWKLVNDRLNYLTPTIKPIGYASSADGRRRRLYDAPQTPL DRPLAARVLSAAQQADLITYRDSLNPAQIGRKIADLQNRLLILAKEKTEQLYLANIPT ALPDIHKGILIKAG" CDS complement(3451784..3452365) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3152C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3152c, -, len: 193 aa. Equivalent to 5' end of Rv3128c, len: 337 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 193 aa overlap). Conserved hypothetical protein, similar to other conserved hypothetical proteins. This ORF corresponds to a fusion of MTCY164.38 and MTCY164.39c. Has in-frame amber stop codon but is similar throughout its length to Rv2807|MTCY16B7.36c|Z81331 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (384 aa), FASTA scores: opt: 954, E(): 0, (47.2% identity in 339 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3128c exists as a single gene with an in-frame amber stop codon. In Mycobacterium bovis, Rv3128c is split into 2, Mb3151c and Mb3152c." /db_xref="GOA:A0A1R3Y3I6" /db_xref="InterPro:IPR001584" /db_xref="InterPro:IPR012337" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3I6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01778.1" /translation="MWSASGGQCGKYLAASMVLQLDGLERHGVLEFGRDRYGPEVREE LLAMSAASIDRYLKTAKAKDQISGVSTTKPSPLLRNSIKVRRAGDEVEAEPGFFEGDT VAHCGPTLKGEFAHTLNLTDVHIGWVFTRTVRNNARTHILAGLKASVTEIPHGITGLD FDNGTGFLNKPVISWAGDNGIYFTRFRPYKKNH" CDS 3452844..3453176 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3153" /product="Putative DNA-binding protein" /note="Mb3153, -, len: 110 aa. Equivalent to Rv3129, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap). Conserved hypothetical protein, with some similarity to various hypothetical proteins from Streptomyces coelicolor e.g. Q9RI34|SCJ12.26 HYPOTHETICAL 14.5 KDA PROTEIN (137 aa), FASTA scores: opt: 141, E(): 0.0016, (39.3% identity in 84 aa overlap); Q9RI49|SCJ12.09c HYPOTHETICAL 15.8 KDA PROTEIN (146 aa), FASTA scores: opt: 141, E(): 0.0017, (38.05% identity in 92 aa overlap); Q9RJ05|SCJ1.09C POSSIBLE DNA-BINDING PROTEIN (233 aa), FASTA scores: opt: 140, E(): 0.0029, (34.85% identity in 89 aa overlap); Q9XA48|SCGD3.31c PUTATIVE BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASE E1 BETA SUBUNIT (334 aa); etc. Mb3153 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y362" /db_xref="InterPro:IPR012349" /db_xref="InterPro:IPR024747" /db_xref="UniProtKB/TrEMBL:A0A1R3Y362" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01779.1" /translation="MVQGRTVLFRTAEGAKLFSAVAKCAVAFEADDHNVAEGWSVIVK VRAQVLTTDAGVREAERAQLLPWTATLKRHCVRVIPWEITGRHFRFGPEPDRSQTFAC EASSHNQR" CDS complement(3453159..3454550) /codon_start=1 /transl_table=11 /gene="tgs1" /locus_tag="BQ2027_MB3154C" /product="triacylglycerol synthase (diacylglycerol acyltransferase) tgs1" /note="Mb3154c, -, len: 463 aa. Equivalent to Rv3130c, len: 463 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 463 aa overlap). Conserved hypothetical protein, similar to several other hypothetical Mycobacterium tuberculosis strain H37Rv proteins e.g. O06795|YH60_MYCTU|Rv1760|MTCY28.26 HYPOTHETICAL 54.1 KDA PROTEIN (502 aa), FASTA scores: opt: 586, E(): 9.8e-29, (28.95% identity in 463 aa overlap). Protein product from Mb3154c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3154c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A651" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="UniProtKB/Swiss-Prot:P0A651" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01780.1" /translation="MNHLTTLDAGFLKAEDVDRHVSLAIGALAVIEGPAPDQEAFLSS LAQRLRPCTRFGQRLRLRPFDLGAPKWVDDPDFDLGRHVWRIALPRPGNEDQLFELIA DLMARRLDRGRPLWEVWVIEGLADSKWAILTKLHHCMADGIAATHLLAGLSDESMSDS FASNIHTTMQSQSASVRRGGFRVNPSEALTASTAVMAGIVRAAKGASEIAAGVLSPAA SSLNGPISDLRRYSAAKVPLADVEQVCRKFDVTINDVALAAITESYRNVLIQRGERPR FDSLRTLVPVSTRSNSALSKTDNRVSLMLPNLPVDQENPLQRLRIVHSRLTRAKAGGQ RQFGNTLMAIANRLPFPMTAWAVGLLMRLPQRGVVTVATNVPGPRRPLQIMGRRVLDL YPVSPIAMQLRTSVAMLSYADDLYFGILADYDVVADAGQLARGIEDAVARLVAISKRR KVTRRRGALSLVV" CDS 3454735..3455733 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3155" /product="Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2" /note="Mb3155, -, len: 332 aa. Equivalent to Rv3131, len: 332 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 332 aa overlap). Hypothetical protein, similar to other hypothetical bacterial proteins e.g. O53476|Rv2032|MTV018.19 (331 aa), FASTA scores: opt: 568, E(): 2.5e-27, (36.7% identity in 321 aa overlap); O05800|Rv3127|MTCY164.37 (344 aa), FASTA scores: opt: 521, E(): 1.9e-24, (34.95% identity in 326 aa overlap); Q9RI33|SCJ12.27c from Streptomyces coelicolor (335 aa), FASTA scores: opt: 441, E(): 1.3e-19, (35.75% identity in 319 aa overlap); Q9RI44|SCJ12.14 from Streptomyces coelicolor (309 aa), FASTA scores: opt: 328, E(): 9.3e-13, (27.9% identity in 308 aa overlap); Q9CBP5|ML1751 from Mycobacterium leprae (721 aa), FASTA scores: opt: 137, E(): 0.78, (26.15% identity in 298 aa overlap); etc. Equivalent to AAK47555 from Mycobacterium tuberculosis strain CDC1551 but shorter 12 aa. Protein product from Mb3155 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3155 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y371" /db_xref="InterPro:IPR000415" /db_xref="UniProtKB/TrEMBL:A0A1R3Y371" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01781.1" /translation="MNTHFPDAETVRTVLTLAVRAPSIHNTQPWRWRVCPTSLELFSR PDMQLRSTDPDGRELILSCGVALHHCVVALASLGWQAKVNRFPDPKDRCHLATIGVQP LVPDQADVALAAAIPRRRTDRRAYSCWPVPGGDIALMAARAARGGVMLRQVSALDRMK AIVAQAVLDHVTDEEYLRELTIWSGRYGSVAGVPARNEPPSDPSAPIPGRLFAGPGLS QPSDVLPADDGAAILALGTETDDRLARLRAGEAASIVLLTATAMGLACCPITEPLEIA KTRDAVRAEVFGAGGYPQMLLRVGWAPINADPLPPTPRRELSQVVEWPEELLRQRC" CDS complement(3455713..3457449) /codon_start=1 /transl_table=11 /gene="devS" /locus_tag="BQ2027_MB3156C" /product="TWO COMPONENT SENSOR HISTIDINE KINASE DEVS" /note="Mb3156c, devS, len: 578 aa. Equivalent to Rv3132c, len: 578 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 578 aa overlap). devS, membrane-bound two component sensor histidine kinase (EC 2.7.3.-) (see citations below; dev for Differentially Expressed in Virulent strain), similar to others two component sensors e.g. Q9RI43|SCJ12.15c PUTATIVE TWO-COMPONENT SENSOR from Streptomyces coelicolor (585 aa), FASTA scores: opt: 1305, E(): 2.5e-69, (41.35% identity in 573 aa overlap); Q9ZBY4|SCD78.15 PUTATIVE TWO COMPONENT SENSOR from Streptomyces coelicolor (560 aa), FASTA scores: opt: 1194, E(): 8.1e-63, (41.05% identity in 558 aa overlap); O85371|CPRS TWO COMPONENT REGULATOR from Rhodococcus sp (563 aa), FASTA scores: opt: 803, E(): 8.3e-40, (38.4% identity in 552 aa overlap); Q9L094|SCC24.23 PUTATIVE TWO-COMPONENT SENSOR HISTIDINE KINASE from Streptomyces coelicolor (similarity only in C-terminus for this one); etc. Also highly similar to mycobacterium O53473|Rv2027c|MTV018.14c PUTATIVE MEMBRANE PROTEIN (573 aa), FASTA scores: opt: 2333, E(): 7.6e-130, (61.45% identity in 576 aa overlap). Protein product from Mb3156c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3156c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y364" /db_xref="InterPro:IPR003018" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR011712" /db_xref="InterPro:IPR029016" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3Y364" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01782.1" /translation="MTTGGLVDENDGAAMRPLRHTLSQLRLHELLVEVQDRVEQIVEG RDRLDGLVEAMLVVTAGLDLEATLRAIVHSATSLVDARYGAMEVHDRQHRVLHFVYEG IDEETVRRIGHLPKGLGVIGLLIEDPKPLRLDDVSAHPASIGFPPYHPPMRTFLGVPV RVRDESFGTLYLTDKTNGQPFSDDDEVLVQALAAAAGIAVANARLYQQAKARQSWIEA TRDIATELLSGTEPATVFRLVAAEALKLTAADAALVAVPVDEDMPAADVGELLVIETV GSAVASTVGRTIPVAGAVLREVFVNGIPRRVDRVDLEGLDELADAGPALLLPLRARGT VAGVVVVLSQGGPGAFTDEQLEMMAAFADQAALAWQLATSQRRMRELDVLTDRDRIAR DLHDHVIQRLFAIGLALQGAVPHERNPEVQQRLSDVVDDLQDVIQEIRTTIYDLHGAS QGITRLRQRIDAAVAQFADSGLRTSVQFVGPLSVVDSALADQAEAVVREAVSNAVRHA KASTLTVRVKVDDDLCIEVTDNGRGLPDEFTGSGLTNLRQRAEQAGGEFTLASVPGAS GTVLRWSAPLSQ" CDS complement(3457446..3458099) /codon_start=1 /transl_table=11 /gene="devR" /locus_tag="BQ2027_MB3157C" /product="TWO COMPONENT TRANSCRIPTIONAL REGULATORY PROTEIN DEVR (PROBABLY LUXR/UHPA-FAMILY)" /note="Mb3157c, devR, len: 217 aa. Equivalent to Rv3133c, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 217 aa overlap). devR, two component transcriptional regulator (see first citation below; dev for Differentially Expressed in Virulent strain), highly similar to several e.g. O85372|CPRR TWO COMPONENT REGULATOR from Rhodococcus sp. (212 aa), FASTA scores: opt: 868, E(): 6.2e-46, (65.05% identity in 206 aa overlap); Q9RI42|SCJ12.16c PUTATIVE LUXR FAMILY TWO-COMPONENT RESPONSE REGULATOR from Streptomyces coelicolor (233 aa), FASTA scores: opt: 849, E(): 9.7e-45, (60.55% identity in 218 aa overlap); Q9XA59|SCGD3.19 PUTATIVE TWO-COMPONENT SYSTEM RESPONSE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (218 aa), FASTA scores: opt: 835, E(): 6.5e-44, (61.55% identity in 208 aa overlap); and similar to others. Contains bacterial regulatory proteins, LuxR family signature (PS00622) near C-terminus as seen in bvgA, comA, dctR, degU, evgA, fimZ, fixJ, gacA, glpR, narL, narP, nodW, rcsB and uhpA. Helix-turn-helix motif at 166-187 (+3.15 SD). BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS. THE N-TERMINAL REGION IS SIMILAR TO THAT OF OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS. Protein product from Mb3157c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3157c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y377" /db_xref="InterPro:IPR000792" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR016032" /db_xref="UniProtKB/TrEMBL:A0A1R3Y377" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01783.1" /translation="MVKVFLVDDHEVVRRGLVDLLGADPELDVVGEAGSVAEAMARVP AARPDVAVLDVRLPDGNGIELCRDLLSRMPDLRCLILTSYTSDEAMLDAILAGASGYV VKDIKGMELARAVKDVGAGRSLLDNRAAAALMAKLRGAAEKQDPLSGLTDQERTLLGL LSEGLTNKQIADRMFLAEKTVKNYVSRLLAKLGMERRTQAAVFATELKRSRPPGDGP" CDS complement(3458127..3458933) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3158C" /product="universal stress protein family protein" /note="Mb3158c, -, len: 268 aa. Equivalent to Rv3134c, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 268 aa overlap). Conserved Ala-, Val-rich protein (see citations below), related to other hypothetical Mycobacterium tuberculosis proteins e.g. O53474|Rv2028c|MTV018.15c (279 aa), FASTA scores: opt: 562, E(): 3.2e-28, (40.65% identity in 273 aa overlap); O06188|Rv2624c|MTCY01A10.08 (272 aa), FASTA scores: opt: 458, E(): 1.1e-21, (36.55% identity in 271 aa overlap); O53472|R2026c|MTV018.13c (294 aa), FASTA scores: opt: 232, E(): 1.9e-07, (30.45% identity in 276 aa overlap); etc. Shares some similarity with other hypothetical proteins from Streptomyces coelicolor e.g. Q9RIZ8|SCJ1.16c (294 aa), FASTA scores: opt: 207, E(): 6.9e-06, (28.9% identity in 263 aa overlap); Q9K4L5|SC5F8.09 PUTATIVE STRESS-INDUCIBLE PROTEIN (312 aa), FASTA scores: opt: 204, E(): 1.1e-05, (28.4% identity in 271 aa overlap); etc. Equivalent to AAK47558|MT3220 Universal stress protein family from Mycobacterium tuberculosis strain CDC1551 (268 aa). Rv3134c seems cotranscribed with devR-devS (see second citation below). Protein product from Mb3158c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3158c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006015" /db_xref="InterPro:IPR006016" /db_xref="UniProtKB/TrEMBL:A0A1R3Y386" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01784.1" /translation="MSDPRPARAVVVGIDGSRAATHAALWAVDEAVNRDIPLRLVYVI DPSQLSAAGEGGGQSAARAALHDASRKVEATGQPVKIETEVLCGRPLTKLMQESRSAA MLCVGSVGLDHVRGRRGSVAATLAGSALCPVAVIHPSPAEPATTSQVSAVVAEVDNGV VLRHAFEEARLRGVPLRAVAVHAAETPDDVEQGSRLAHVHLSRRLAHWTRLYPEVRVD RAIAGGSACRHLAANAKPGQLFVADSHSAHELCGAYQPGCAVLTVRSANL" CDS 3459518..3460663 /codon_start=1 /transl_table=11 /gene="PPE50" /locus_tag="BQ2027_MB3159" /product="ppe family protein ppe50" /note="Mb3159, PPE50, len: 381 aa. Similar to 5' end of Rv3135, len: 132 aa, from Mycobacterium tuberculosis strain H37Rv, (88.5% identity in 131 aa overlap). Member of the Mycobacterium tuberculosis Ala-, Gly-rich PPE family, similar to P95190|Rv3136|MTCY03A2.22c (380 aa), FASTA scores: opt: 494, E(): 6.7e-25, (57.25% identity in 131 aa overlap) (next ORF downstream), MTY21C12_9, MTCY3C7_24, MTCI125_27, MTV049_12, MTV049_9, MTV049_11, MTCY274_24 etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large 1337 bp insertion leads to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (381 aa versus 132 aa). Protein product from Mb3159 detected using SWATH mass spectrometry. Mb3159 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y534" /protein_id="SIU01785.1" /translation="MDYAFLPPEINSARMYSGPGPNSMLVAAASWDALAAELASAAEN YGSVIARLTGMHWWGPASTSMLAMSAPYVEWLERTAAQTKQTATQARAAAAAFEQAHA MTVPPALVTANRAELKALIASNLLGQNTAAIAAIEAQYAEMWAQDAAAMYGYATTSAA ARQLTPFSSPQQTTNPAGLAAQNAAVTQAATNSAGNTPTALSQLSSFLSQAVEAPTGW PNILPDDFTILDGILAAYATVGVTQDIESICAGIIGAENNLGLLGAASENPAELAPGA FGIDAALSSAEKGAAASMHDAVLASAGRAGSIGPMSVPPSWATPSSTPVSALSGAGLT TLDGTDVAEHGTPGLPGVPAGTDKRASGVIPRYGVRLTVMSRPPAAG" CDS 3461315..3462457 /codon_start=1 /transl_table=11 /gene="PPE51" /locus_tag="BQ2027_MB3160" /product="ppe family protein ppe51" /note="Mb3160, PPE51, len: 380 aa. Equivalent to Rv3136, len: 380 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 380 aa overlap). Member of the Mycobacterium tuberculosis Ala-, Gly-rich PPE family, similar to Q9AGF0|Ov2770c Rv2770c-LIKE PROTEIN from M. microti (397 aa), FASTA scores: opt: 917, E(): 9e-41, (46.15% identity in 388 aa overlap); O33312|Rv2770c|MTV002.35c, MTV002_36, MTCI125_26, MTCY10G2_10, MTCI364_8, MTV049_28, MTV049_29, etc. TBparse score is 0.923. Protein product from Mb3160 detected using SWATH mass spectrometry. Mb3160 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3W9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01786.1" /translation="MDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEA YGSVLSGLAALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQAYA MTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAAMYGYATASAA AALLTPFSPPRQTTNPAGLTAQAAAVSQATDPLSLLIETVTQALQALTIPSFIPEDFT FLDAIFAGYATVGVTQDVESFVAGTIGAESNLGLLNVGDENPAEVTPGDFGIGELVSA TSPGGGVSASGAGGAASVGNTVLASVGRANSIGQLSVPPSWAAPSTRPVSALSPAGLT TLPGTDVAEHGMPGVPGVPVAAGRASGVLPRYGVRLTVMAHPPAAG" CDS complement(3462466..3462798) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3160A" /product="Conserved protein" /note="Mb3160A, len: 110 aa. Equivalent to Rv3136A len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein. Mb3160A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3E3" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3E3" /protein_id="SIU01787.1" /translation="MGWEFGVLLILIAVLAVFLAPRLIPRGPRGDLASGTLLVTGVSP RPDAGGQQYVTIAGIITGPTVNEYAVYQRMAVDVDQWPTVGQILPVVYSPKNPDNWTF TPNGPPVG" CDS 3462914..3463696 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3161" /product="PROBABLE MONOPHOSPHATASE" /note="Mb3161, -, len: 260 aa. Equivalent to Rv3137, len: 260 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 260 aa overlap). Probable monophosphatase (EC 3.1.3.-), equivalent to O32889|MLCB1779_19|ML0662 PUTATIVE MONOPHOSPHATASE from Mycobacterium leprae (255 aa), FASTA scores: opt: 1403, E(): 1.2e-81, (81.8% identity in 253 aa overlap). Also similar to Q9K4B1|SC7E4.05c from Streptomyces coelicolor (266 aa), FASTA scores: opt: 969, E(): 3.5e-54, (57.9% identity in 259 aa overlap); Q53743|PUR3 MONO-PHOSPHATASE from Streptomyces lipmanii (Streptomyces alboniger) (273 aa), FASTA scores: opt: 862, E(): 2.1e-47, (55.25% identity in 257 aa overlap); BAB50023|MLL3039 MONO-PHOSPHATASE from Rhizobium loti (Mesorhizobium loti) (262 aa), FASTA scores: opt: 448, E(): 3.2e-21, (31.37% identity in 255 aa overlap); etc. Contains inositol monophosphatase family signature 1 (PS00629). Protein product from Mb3161 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y3K2" /db_xref="InterPro:IPR000760" /db_xref="InterPro:IPR011809" /db_xref="InterPro:IPR020583" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K2" /protein_id="SIU01788.1" /translation="MSHDDLMLALALADRADELTRVRFGALDLRIDTKPDLTPVTDAD RAVESDVRQTLGRDRPGDGVLGEEFGGSTTFTGRQWIVDPIDGTKNFVRGVPVWASLI ALLEDGVPSVGVVSAPALQRRWWAARGRGAFASVDGARPHRLSVSSVAELHSASLSFS SLSGWARLGLRERFIGLTDTVWRVRAYGDFLSYCLVAEGAVDIAAEPQVSVWDLAALD IVVREAGGRLTSLDGVAGPHGGSAVATNGLLHDEVLTRLNAG" CDS 3463716..3464804 /codon_start=1 /transl_table=11 /gene="pflA" /locus_tag="BQ2027_MB3162" /product="PROBABLE PYRUVATE FORMATE LYASE ACTIVATING PROTEIN PFLA (FORMATE ACETYLTRANSFERASE ACTIVATING ENZYME) ([PYRUVATE FORMATE-LYASE] ACTIVATING ENZYME)" /note="Mb3162, pflA, len: 362 aa. Equivalent to Rv3138, len: 362 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 362 aa overlap). Probable pflA, pyruvate formate lyase activating protein (EC 1.97.1.4), similar to other e.g. Q9V0N1|PAB1859 from Pyrococcus abyssi (348 aa), FASTA scores: opt: 926, E(): 1.1e-52, (39.95% identity in 343 aa overlap); O27446|MTH1395 from Methanobacterium thermoautotrophicum (335 aa), FASTA scores: opt: 909, E(): 1.3e-51, (42.2% identity in 327 aa overlap); O28939|AF1330 from Archaeoglobus fulgidus (336 aa), FASTA scores: opt: 884, E(): 5.6e-50, (42.0% identity in 319 aa overlap); etc. Also similar to O50099|PH1391 HYPOTHETICAL 40.2 KDA PROTEIN from Pyrococcus horikoshii (348 aa), FASTA scores: opt: 934, E(): 3.3e-53, (40.5% identity in 343 aa overlap); and other hypothetical proteins. TBparse score is 0.881. Mb3162 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y372" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR016431" /db_xref="InterPro:IPR027596" /db_xref="InterPro:IPR034457" /db_xref="UniProtKB/TrEMBL:A0A1R3Y372" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01789.1" /translation="MSDPFTIATKHWHRLHDSRIQCDVCPRACKLHEGQRGLCFVRGR FDDQVKLTSYGRSSGFCVDPIEKKPLNHFLPGSATLSFGTAGCNLACKFCQNWDISKS REIDVLANRAAPADIARTAHELGCRSVAFTYNDPTIFWEYAADVADACHDQGIKAVAV TAGYMCPEPRAEFYRRVDAANVDLKAFTEDFYRKVCVSHLRNVLDTLAYLRHQTNVWL EITTLLIPGRNDSDAEVAAECRWIRENLGVDVPVHFTASHPDYKMMDTPATLPATLTR AREIGIGEGLRFVYTGNVHDAVGGSTSCPGCRATVIVRDWYSIRHYALTEDGRCQACG YQMPGVYDGPAGHWGQRRLPLLTSLSRM" CDS 3464884..3466290 /codon_start=1 /transl_table=11 /gene="fadE24" /locus_tag="BQ2027_MB3163" /product="PROBABLE ACYL-CoA DEHYDROGENASE FADE24" /note="Mb3163, fadE24, len: 468 aa. Equivalent to Rv3139, len: 468 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 468 aa overlap). Probable fadE24, acyl-CoA dehydrogenase (1.3.99.-), equivalent to O32890|MLCB1779.30|FADE24|ML0661 PUTATIVE ACYL-CoA DEHYDROGENASE from Mycobacterium leprae (465 aa), FASTA scores: opt: 2587, E(): 4e-153, (83.6% identity in 464 aa overlap). Similar to other e.g. Q9HUH0|PA4995 from Pseudomonas aeruginosa (429 aa), FASTA scores: opt: 1139, E(): 2.8e-63, (45.3% identity in 426 aa overlap); Q9K6D0|MMGC|BH3799 from Bacillus halodurans (379 aa), FASTA scores: opt: 603, E(): 4.7e-30, (30.3% identity in 366 aa overlap); Q9K6D1|ACDA|BH3798 from Bacillus halodurans (380 aa), FASTA scores: opt: 601, E(): 6.3e-30, (32.25% identity in 363 aa overlap); etc. Contains acyl-CoA dehydrogenases signature 2 (PS00073) near C-terminus. BELONGS TO THE ACYL-CoA DEHYDROGENASES FAMILY. TBparse score is 0.881. Protein product from Mb3163 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3163 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y393" /db_xref="InterPro:IPR006089" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y393" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01790.1" /translation="MTNTTSAANAAKPSGARTDRRGRTTGVGLAPHKRTGIDVALALL TPIVGQEFLDKYRLRDPLNRSLRYGVKTMFATAGAATRQFQRVQGLRGGPTRLKSSGR DYFDLTPDDDQKLIIETVDEFAEEVLRPAAHDADDAATYPSDLTAKAAELGITAINIP EDFDGIAEHRSSVTNVLVAEALAYGDMGLALPILAPGGVASALTHWGSADQQATYLKE FAGENVPQACVAITEPQPLFDPTRLKTTAVRTPSGYRLDGVKSLIPAAADAELFIVGA QLGGKPALFIVESAASGLTVKADPSMGIRGAALGQVELCGVSVPLNARLGEDEASDND YSEALALARLGWAALAVGTSHAVLDYVVPYVKQRQAFGEPIAHRQAVAFMCANIAIEL DGLRLITWRGASRAEQGLPFAREAALAKRLGSDKGMQIGLDGVQLLGGHGYTKEHPVE RWYRDLRAIGVAEGVVVI" CDS 3466311..3467516 /codon_start=1 /transl_table=11 /gene="fadE23" /locus_tag="BQ2027_MB3164" /product="PROBABLE ACYL-CoA DEHYDROGENASE FADE23" /note="Mb3164, fadE23, len: 401 aa. Equivalent to Rv3140, len: 401 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 401 aa overlap). Probable fadE23, acyl-CoA dehydrogenase (1.3.99.-) (see citation below), equivalent to O32891|MLCB1779.31|FADE23|ML0660 PUTATIVE ACYL-CoA DEHYDROGENASE from Mycobacterium leprae (400 aa), FASTA scores: opt: 2307, E(): 3e-136, (89.5% identity in 401 aa overlap). Also similar to others e.g. Q9HUH1|PA4994 from Pseudomonas aeruginosa (402 aa), FASTA scores: opt: 1558, E(): 1.2e-89, (61.0% identity in 400 aa overlap); O31251 from Acinetobacter sp. ADP1 (401 aa), FASTA scores: opt: 1509, E(): 1.3e-86, (58.2% identity in 402 aa overlap); Q9K6D1|ACDA OR BH3798 from Bacillus halodurans (380 aa), FASTA scores: opt: 612, E(): 8.4e-31, (38.2% identity in 293 aa overlap); Q9AHX9|FADFX from Pseudomonas putida (375 aa), FASTA scores: opt: 584, E(): 4.6e-29, (32.7% identity in 379 aa overlap); etc. COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY. TBparse score is 0.890. Protein product from Mb3164 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3164 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y382" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR036250" /db_xref="UniProtKB/TrEMBL:A0A1R3Y382" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01791.1" /translation="MAINLELPRKLQAIIVKTHQGAAEMMRPIARKYDLKEHAYPVEL DTLINLFEGAAESFNFAGAHSLRDEDEGKDENHNGANMAAVVQTMEASWGDVAMMLSL PYQGLGNAAISAVATDEQLERLGKVWAAMAITEPEFGSDSAAVSTTATLDGDEYVING EKIFVTAGSRATHIVVWATLDKSLGRPAIKSFIVPREHPGVTVERLEHKLGIKGSDTA VIRFDNARIPKGNLLGNPEIEVGKGFAGVMETFDNTRPIVAAMAVGIGRAALEEIRSV LTGAGVEISYDKPSHTQSAAAAEFLRMEADWEASYLLSLRAAWQADNNIPNSKEASMS KAKAGRMASDVTCKTVELAGTTGYSEQSLLEKWARDSKILDIFEGTQQIQQLVVARRL LGLSSSELK" CDS 3467616..3468587 /codon_start=1 /transl_table=11 /gene="fadB4" /locus_tag="BQ2027_MB3165" /product="PROBABLE NADPH QUINONE OXIDOREDUCTASE FADB4 (NADPH:QUINONE REDUCTASE) (ZETA-CRYSTALLIN)" /note="Mb3165, fadB4, len: 323 aa. Equivalent to Rv3141, len: 323 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 323 aa overlap). Probable fadB4, quinone oxidoreductase (EC 1.6.5.5), showing strong similarity to variety of quinone oxidoreductases and domains in polyketide and fatty acid synthases e.g. Q9HTV6|PA5234 PROBABLE OXIDOREDUCTASE from Pseudomonas aeruginosa (325 aa), FASTA scores: opt: 737, E(): 1.4e-35, (39.65% identity in 328 aa overlap); Q9RYQ7|DRA0251 PUTATIVE NADPH QUINONE OXIDOREDUCTASE from Deinococcus radiodurans (336 aa), FASTA scores: opt: 688, E(): 1e-32, (40.6% identity in 325 aa overlap); Q9RVG8|DR1061 PUTATIVE NADPH QUINONE OXIDOREDUCTASE from Deinococcus radiodurans (388 aa), FASTA scores: opt: 559, E(): 3.3e-25, (36.3% identity in 325 aa overlap); BAB49685|MLL2594 PROBABLE QUINONE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (326 aa), FASTA scores: opt: 519, E(): 5.9e-23, (34.25% identity in 330 aa overlap); Q9LXZ4|T5P19_110 QUINONE REDUCTASE-LIKE PROTEIN from Arabidopsis thaliana (348 aa), FASTA scores: opt: 517, E(): 8.1e-23, (33.55% identity in 322 aa overlap); etc. Also similar to Q9AA38|CC0770 ZINC-CONTAINING ALCOHOL DEHYDROGENASE from Caulobacter crescentus (325 aa), FASTA scores: opt: 673, E(): 7.2e-32, (40.2% identity in 326 aa overlap); and Q9ABX4|CC0096 ZINC-CONTAINING ALCOHOL DEHYDROGENASE from Caulobacter crescentus (332 aa), FASTA scores: opt: 623, E(): 5.7e-29, (40.7% identity in 334 aa overlap). Also resembles Mycobacterium tuberculosis proteins P96826|Rv0149|MTCI5_23, MTCY13D12.11, MTCY24G1.03, MTCY19H9.01. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY, QUINONE OXIDOREDUCTASE SUBFAMILY. TBparse score is 0.904. Thought to be differentially expressed within host cells (see first citation below). Protein product from Mb3165 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3165 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y374" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y374" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01792.1" /translation="MRAVRVTRLEGPDAVEVAEVEEPTSAGVVIEVHAAGVAFPDALL TRGRYQYRPEPPFVLGAEIAGVVRSAPDNSQVRSGDRVVGLTMLTGGMAEVAVLSPER VFKLPDNMTFEAGAGVLFNDLTVYFALAVRGRLQAGETVLVHGAAGGIGTSTLRLAPA LGASRTVAVVSTQEKAELATVAGATDVVLAEGFKDAVQELTNGRGVDIVVDPVGGDRF TDSLRSLAAGGRLLVIGFTGGEIPTVKVNRLLLNNIDVVGVGWGAWSLTHPDALAQQW SQLERLLRSGKLPPPEPVVYPLDQAAAAIASLENRTAKGKVVLRVRD" CDS complement(3468639..3469067) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3166C" /product="HYPOTHETICAL PROTEIN" /note="Mb3166c, -, len: 142 aa. Equivalent to Rv3142c, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 142 aa overlap). Hypothetical unknown protein. Equivalent to AAK47569 from Mycobacterium tuberculosis strain CDC1551 but shorter 33 aa. Protein product from Mb3166c detected using SWATH mass spectrometry. Mb3166c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y388" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01793.1" /translation="MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLT LPAIETSPAEVVAIDPNDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVH PDDRVTAWELYGKYHGYAACLAPGKLRVVRQDVADANGDQ" CDS 3469175..3469576 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3167" /product="PROBABLE RESPONSE REGULATOR" /note="Mb3167, -, len: 133 aa. Equivalent to Rv3143, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Probable response regulator, similar to other sensory transduction regulatory proteins e.g. Q9X810|SC6G10.25 from Streptomyces coelicolor (133 aa), FASTA scores: opt: 474, E(): 2.8e-24, (54.15% identity in 120 aa overlap); Q9KZ82|SCE25.04c from Streptomyces coelicolor (225 aa), FASTA scores: opt: 144, E(): 0.016, (32.3% identity in 127 aa overlap); Q9RZT4|DRB0029 from Deinococcus radiodurans (416 aa), FASTA scores: opt: 145, E(): 0.024, (30.65% identity in 124 aa overlap). SIMILAR TO OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS. Protein product from Mb3167 detected using shotgun mass spectrometry. Mb3167 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y395" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR011006" /db_xref="UniProtKB/TrEMBL:A0A1R3Y395" /protein_id="SIU01794.1" /translation="MPDSSTALRILVYSDNVQTRERVMRALGKRLHPDLPDLTYVEVA TGPMVIRQMDRGGIDLAILDGEATPTGGMGIAKQLKDELASCPPILVLTGRPDDTWLA SWSRAEAAVPHPVDPIVLGRTVLSLLRAPAH" CDS complement(3469609..3470838) /codon_start=1 /transl_table=11 /gene="PPE52" /locus_tag="BQ2027_MB3168C" /product="ppe family protein ppe52" /note="Mb3168c, PPE52, len: 409 aa. Equivalent to Rv3144c, len: 409 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 409 aa overlap). Member of the M. tuberculosis PPE family, Gly-, Ala-rich, similar to others e.g. P71868|Rv3533c|MTCY03C7.23 (582 aa), FASTA scores: opt: 1007, E(): 5.2e-35, (56.2% identity in 306 aa overlap); and MTV014_3, MTCY6G11_5, MTCY98.0034c, MTCY31.06c, MTCY48.17, MTCY98.0029c, MTCY03C7.17c, etc. Protein product from Mb3168c detected using SWATH mass spectrometry. Mb3168c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y535" /protein_id="SIU01795.1" /translation="MSFVVLPPEINSLRMFIGAGTAPMLAAAAAWDGLAEELGTAAQS FASVTAGLAGQAWQGPAALAMAAAAAPYAGWLTAAAAQSAGAAGQARAVASIFEAAQA ATVLPAAVAANRDAFVQLVMTNLFGQNAPLIAAAEGVYEEMWAADVAAMSGYYSGASA IAAQVVPWASLLQRFPGLGAGATGATGGESVGTGATGGESVGTGGGESVGTGGATASG GGVGYVGGGVASAGLAAGDPAHGSVGQGNFGGGDVGAGDVVASSATSAHAGVVSPGFI GAPLALAALGQMARGGTNSAPGTATESARAPEPAASAPPEAVVEVPELEVPAMGVLPT VDPKVAAKAAPLSTTRVGQSAGSGIPESTLRTAQGQQASETSAAEETAPSLRPEAAAG QLRPRVRQDPKIQMRGG" CDS 3471203..3471589 /codon_start=1 /transl_table=11 /gene="nuoA" /locus_tag="BQ2027_MB3169" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN A) NUOA (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN A)" /note="Mb3169, nuoA, len: 128 aa. Equivalent to Rv3145, len: 128 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 128 aa overlap). Probable nuoA, integral membrane NADH dehydrogenase, chain A (EC 1.6.5.3), similar to others e.g. Q9XAQ4|NUOA from Streptomyces coelicolor (119 aa), FASTA scores: opt: 405, E(): 5.4e-20, (68.75% identity in 128 aa overlap); Q9RU86|DR1506 from Deinococcus radiodurans (160 aa), FASTA scores: opt: 327, E(): 9e-15, (40.3% identity in 124 aa overlap); BAB47039|NDHC from Triticum aestivum (Wheat), FASTA scores: opt: 273, E(): 2.6e-11, (38.1% identity in 126 aa overlap); etc. Also similar to a NADH-PLASTOQUINONE OXIDOREDUCTASES e.g. P26303|NU3C_WHEAT|NDHC from Triticum aestivum (Wheat) (120 aa), FASTA scores: opt: 273, E(): 2.6e-1, (38.1% identity in 126 aa overlap). BELONGS TO THE COMPLEX I SUBUNIT 3 FAMILY. TBparse score is 0.895. Protein product from Mb3169 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3169 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65564" /db_xref="InterPro:IPR000440" /db_xref="InterPro:IPR023043" /db_xref="InterPro:IPR038430" /db_xref="UniProtKB/Swiss-Prot:P65564" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01796.1" /translation="MNVYIPILVLAALAAAFAVVSVVIASLVGPSRFNRSKQAAYECG IEPASTGARTSIGPGAASGQRFPIKYYLTAMLFIVFDIEIVFLYPWAVSYDSLGTFAL VEMAIFMLTVFVAYAYVWRRGGLTWD" CDS 3471598..3472152 /codon_start=1 /transl_table=11 /gene="nuoB" /locus_tag="BQ2027_MB3170" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN B) NUOB (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN B)" /note="Mb3170, nuoB, len: 184 aa. Equivalent to Rv3146, len: 184 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 184 aa overlap). Probable nuoB, NADH dehydrogenase, chain B (EC 1.6.5.3), similar to others e.g. Q9XAQ5|NUOB from Streptomyces coelicolor (184 aa), FASTA scores: opt: 989, E(): 1.4e-56, (78.25% identity in 184 aa overlap); Q56218|NQO6_THETH|NQO6 from Thermus aquaticus (subsp. thermophilus) (181 aa), FASTA scores: opt: 720, E(): 2.6e-39, (64.45% identity in 152 aa overlap); Q9RU87|DR1505 from Deinococcus radiodurans (181 aa), FASTA scores: opt: 719, E(): 3e-39, (62.6% identity in 155 aa overlap); etc. BELONGS TO THE COMPLEX I 20 KDA SUBUNIT FAMILY. MAY CONTAIN AN IRON-SULFUR 4FE-4S CLUSTER. TBparse score is 0.912. Protein product from Mb3170 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3170 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65576" /db_xref="InterPro:IPR006137" /db_xref="InterPro:IPR006138" /db_xref="UniProtKB/Swiss-Prot:P65576" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01797.1" /translation="MGLEEQLPGGILLSTVEKVAGYVRKNSLWPATFGLACCAIEMMA TAGPRFDIARFGMERFSATPRQADLMIVAGRVSQKMAPVLRQIYDQMAEPKWVLAMGV CASSGGMFNNYAIVQGVDHVVPVDIYLPGCPPRPEMLLHAILKLHEKIQQMPLGINRE RAIAEAEEAALLARPTIEMRGLLR" CDS 3472149..3472859 /codon_start=1 /transl_table=11 /gene="nuoC" /locus_tag="BQ2027_MB3171" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN C) NUOC (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN C)" /note="Mb3171, nuoC, len: 236 aa. Equivalent to Rv3147, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 236 aa overlap). Probable nuoC, NADH dehydrogenase, chain C (EC 1.6.5.3), similar to others e.g. Q9XAQ6|NUOC from Streptomyces coelicolor (251 aa), FASTA scores: opt: 1113, E(): 2.6e-64, (67.35% identity in 236 aa overlap); Q9A6X2|CC1954 from Caulobacter crescentus (197 aa), FASTA scores: opt: 351, E(): 1.6e-15, (41.65% identity in 132 aa overlap); BAB48757|MLL1369 from Rhizobium loti (Mesorhizobium loti) (201 aa), FASTA scores: opt: 347, E(): 3e-15, (42.4% identity in 132 aa overlap); etc. Also similar to Q9UUU0|NUGM NUGM PROTEIN PRECURSOR (EC 1.6.99.3) from Yarrowia lipolytica (Candida lipolytica) (281 aa), FASTA scores: opt: 356, E(): 1.1e-15, (34.55% identity in 162 aa overlap). Also similar to MTCY251.05, FASTA score: E():4.9e-05. Equivalent to AAK47574 from Mycobacterium tuberculosis strain CDC1551 but longer 26 aa. BELONGS TO THE COMPLEX I 30 KDA SUBUNIT FAMILY. TBparse score is 0.893. Protein product from Mb3171 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3171 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65572" /db_xref="InterPro:IPR001268" /db_xref="InterPro:IPR010218" /db_xref="InterPro:IPR037232" /db_xref="UniProtKB/Swiss-Prot:P65572" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01798.1" /translation="MSPPNQDAQEGRPDSPTAEVVDVRRGMFGVSGTGDTSGYGRLVR QVVLPGSSPRPYGGYFDDIVDRLAEALRHERVEFEDAVEKVVVYRDELTLHVRRDLLP RVAQRLRDEPELRFELCLGVSGVHYPHETGRELHAVYPLQSITHNRRLRLEVSAPDSD PHIPSLFAIYPTNDWHERETYDFFGIIFDGHPALTRIEMPDDWQGHPQRKDYPLGGIP VEYKGAQIPPPDERRGYN" CDS 3472859..3474181 /codon_start=1 /transl_table=11 /gene="nuoD" /locus_tag="BQ2027_MB3172" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN D) NUOD (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN D)" /note="Mb3172, nuoD, len: 440 aa. Equivalent to Rv3148, len: 440 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 440 aa overlap). Probable nuoD, NADH dehydrogenase, chain B (EC 1.6.5.3), similar to others e.g. Q9XAQ7|NUOD from Streptomyces coelicolor (440 aa), FASTA scores: opt: 2198, E(): 1e-131, (73.9% identity in 429 aa overlap); P15689|NUCM_PARTE from Paramecium tetraurelia (400 aa), FASTA scores: opt: 922, E(): 5.8e-51, (38.5% identity in 408 aa overlap); Q9RU89|NUOD_DEIRA|DR1503 from Deinococcus radiodurans (401 aa), FASTA scores: opt: 922, E(): 5.8e-51, (47.75% identity in 404 aa overlap); etc. Equivalent to AAK47575 from Mycobacterium tuberculosis strain CDC1551 but longer 42 aa. Contains helix-turn-helix motif at aa 340-361. BELONGS TO THE COMPLEX I 49 KDA SUBUNIT FAMILY. Protein product from Mb3172 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3172 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65570" /db_xref="InterPro:IPR001135" /db_xref="InterPro:IPR014029" /db_xref="InterPro:IPR022885" /db_xref="InterPro:IPR029014" /db_xref="InterPro:IPR038290" /db_xref="UniProtKB/Swiss-Prot:P65570" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01799.1" /translation="MTAIADSAGGAGETVLVAGGQDWQQVVDAARSADPGERIVVNMG PQHPSTHGVLRLILEIEGETVVEARCGIGYLHTGIEKNLEYRYWTQGVTFVTRMDYLS PFFNETAYCLGVEKLLGITDEIPERVNVIRVLMMELNRISSHLVALATGGMELGAMTP MFVGFRAREIVLTLFEKITGLRMNSAYIRPGGVAQDLPPNAATEIAEALKQLRQPLRE MGELLNENAIWKARTQGVGYLDLTGCMALGITGPILRSTGLPHDLRKSEPYCGYQHYE FDVITDDSCDAYGRYMIRVKEMWESMKIVEQCLDKLRPGPTMISDRKLAWPADLQVGP DGLGNSPKHIAKIMGSSMEALIHHFKLVTEGIRVPAGQVYVAVESPRGELGVHMVSDG GTRPYRVHYRDPSFTNLQSVAAMCEGGMVADLIAAVASIDPVMGGVDR" CDS 3474178..3474936 /codon_start=1 /transl_table=11 /gene="nuoE" /locus_tag="BQ2027_MB3173" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN E) NUOE (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN E)" /note="Mb3173, nuoE, len: 252 aa. Equivalent to Rv3149, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 252 aa overlap). Probable nuoE, NADH dehydrogenase, chain E (EC 1.6.5.3), similar to others e.g. Q9XAQ8|NUOE from Streptomyces coelicolor (290 aa), FASTA scores: opt: 1002, E(): 5.7e-55, (69.5% identity in 213 aa overlap); P40915|NUHM_NEUCR|NUO-24 from Neurospora crassa (263 aa), FASTA scores: opt: 412, E(): 1.9e-18, (38055% identity in 192 aa overlap); P19234|NUHM_RAT from Rattus norvegicus (Rat) (241 aa), FASTA scores: opt: 410, E(): 2.4e-18, (23.9% identity in 237 aa overlap); etc. BELONGS TO THE COMPLEX I 24 KDA SUBUNIT FAMILY. BINDS A 2FE-2S CLUSTER (POTENTIAL). Protein product from Mb3173 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3173 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65574" /db_xref="InterPro:IPR002023" /db_xref="InterPro:IPR036249" /db_xref="InterPro:IPR041921" /db_xref="InterPro:IPR042128" /db_xref="UniProtKB/Swiss-Prot:P65574" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01800.1" /translation="MTQPPGQPVFIRLGPPPDEPNQFVVEGAPRSYPPDVLARLEVDA KEIIGRYPDRRSALLPLLHLVQGEDSYLTPAGLRFCADQLGLTGAEVSAVASFYTMYR RRPTGEYLVGVCTNTLCAVMGGDAIFDRLKEHLGVGHDETTSDGVVTLQHIECNAACD YAPVVMVNWEFFDNQTPESARELVDSLRSDTPKAPTRGAPLCGFRQTSRILAGLPDQR PDEGQGGPGAPTLAGLQVARKNDMQAPPTPGADE" CDS 3474933..3476270 /codon_start=1 /transl_table=11 /gene="nuoF" /locus_tag="BQ2027_MB3174" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN F) NUOF (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN F)" /note="Mb3174, nuoF, len: 445 aa. Equivalent to Rv3150, len: 445 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 445 aa overlap). Probable nuoF, NADH dehydrogenase, chain F (EC 1.6.5.3), similar to others e.g. Q9XAQ9|NUOF_STRCO from Streptomyces coelicolor (449 aa), FASTA scores: opt: 2314, E(): 3.5e-139, (76.25% identity in 434 aa overlap); NUF2_RHIME from Rhizobium meliloti (421 aa), FASTA scores: opt: 1545, E(): 1.8e-90, (53.1% identity in 424 aa overlap); Q9RU92|DR1500 from Deinococcus radiodurans (444 aa), FASTA scores: opt: 1445, E(): 4.1e-84, (52.9% identity in 427 aa overlap); etc. Contains respiratory-chain NADH dehydrogenase 51 Kd subunit signature 2 (PS00645). BELONGS TO THE COMPLEX I 51 KDA SUBUNIT FAMILY. COFACTOR: FMN AND ONE 4FE-4S CLUSTER (PROBABLE). Protein product from Mb3174 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3174 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65568" /db_xref="InterPro:IPR001949" /db_xref="InterPro:IPR011537" /db_xref="InterPro:IPR011538" /db_xref="InterPro:IPR019554" /db_xref="InterPro:IPR019575" /db_xref="InterPro:IPR037207" /db_xref="InterPro:IPR037225" /db_xref="UniProtKB/Swiss-Prot:P65568" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01801.1" /translation="MTTQATPLTPVISRHWDDPESWTLATYQRHDRYRGYQALQKALT MPPDDVISIVKDSGLRGRGGAGFATGTKWSFIPQGDTGAAAKPHYLVVNADESEPGTC KDIPLMLATPHVLIEGVIIAAYAIRAHHAFVYVRGEVVPVLRRLHNAVAEAYAAGFLG RNIGGSGFDLELVVHAGAGAYICGEETALLDSLEGRRGQPRLRPPFPAVAGLYGCPTV INNVETIASVPSIILGGIDWFRSMGSEKSPGFTLYSLSGHVTRPGQYEAPLGITLREL LDYAGGVRAGHRLKFWTPGGSSTPLLTDEHLDVPLDYEGVGAAGSMLGTKALEIFDET TCVVRAVRRWTEFYKHESCGKCTPCREGTFWLDKIYERLETGRGSHEDIDKLLDISDS ILGKSFCALGDGAASPVMSSIKHFRDEYLAHVEGGGCPFDPRDSMLVANGVDA" CDS 3476267..3478687 /codon_start=1 /transl_table=11 /gene="nuoG" /locus_tag="BQ2027_MB3175" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN G) NUOG (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN G)" /note="Mb3175, nuoG, len: 806 aa. Equivalent to Rv3151, len: 806 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 806 aa overlap). Probable nuoG, NADH dehydrogenase I, chain G (EC 1.6.5.3), similar to others e.g. Q9XAR0|NUOG_STRCO from Streptomyces coelicolor (843 aa), FASTA scores: opt: 1968, E(): 5.2e-107, (62.45% identity in 818 aa overlap); P56914|NUG2_RHIME from Rhizobium meliloti (853 aa), FASTA scores: opt: 964, E(): 1.6e-48, (30.6% identity in 840 aa overlap); etc. But also similarity with other proteins e.g. P77908|FDHA FORMATE DEHYDROGENASE, ALPHA SUBUNIT (EC 1.2.1.43) (FORMATE DEHYDROGENASE [NADP+]) from Moorella thermoacetica (Clostridium thermoaceticum) (893 aa), FASTA scores: opt: 928, E(): 2e-46, (28.65% identity in 865 aa overlap); and Q9UUU3|NUAM NUAM PROTEIN PRECURSOR (EC 1.6.99.3) from Yarrowia lipolytica (Candida lipolytica) (728 aa), FASTA scores: opt: 894, E(): 1.7e-44, (31.95% identity in 676 aa overlap). Equivalent to AAK47578 from Mycobacterium tuberculosis strain CDC1551 but longer 15 aa. Contains respiratory-chain NADH dehydrogenase 75 kDa subunit signature 2 (PS00642). BELONGS TO THE COMPLEX I 75 KDA SUBUNIT FAMILY. COFACTOR: MAY BIND TWO 4FE-4S CLUSTER AND ONE 2FE-2S CLUSTER. TBparse score is 0.887. Protein product from Mb3175 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3175 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59962" /db_xref="InterPro:IPR000283" /db_xref="InterPro:IPR001041" /db_xref="InterPro:IPR006656" /db_xref="InterPro:IPR006657" /db_xref="InterPro:IPR006963" /db_xref="InterPro:IPR009010" /db_xref="InterPro:IPR010228" /db_xref="InterPro:IPR019574" /db_xref="InterPro:IPR036010" /db_xref="UniProtKB/Swiss-Prot:P59962" /protein_id="SIU01802.1" /translation="MTQAADTDIRVGQPEMVTLTIDGVEISVPKGTLVIRAAELMGIQ IPRFCDHPLLEPVGACRQCLVEVEGQRKPLASCTTVATDDMVVRTQLTSEIADKAQHG VMELLLINHPLDCPMCDKGGECPLQNQAMSNGRTDSRFTEAKRTFAKPINISAQVLLD RERCILCARCTRFSDQIAGDPFIDMQERGALQQVGIYADEPFESYFSGNTVQICPVGA LTGTAYRFRARPFDLVSSPSVCEHCASGCAQRTDHRRGKVLRRLAGDDPEVNEEWNCD KGRWAFTYATQPDVITTPLIRDGGDPKGALVPTSWSHAMAVAAQGLAAARGRTGVLVG GRVTWEDAYAYAKFARITLGTNDIDFRARPHSAEEADFLAARIAGRHMAVSYADLESA PVVLLVGFEPEDESPIVFLRLRKAARRHRVPVYTIAPFATGGLHKMSGRLIKTVPGGE PAALDDLATGAVGDLLATPGAVIMVGERLATVPGGLSAAARLADTTGARLAWVPRRAG ERGALEAGALPTLLPGGRPLADEVARAQVCAAWHIAELPAAAGRDADGILAAAADETL AALLVGGIEPADFADPDAVLAALDATGFVVSLELRHSAVTERADVVFPVAPTTQKAGA FVNWEGRYRTFEPALRGSTLQAGQSDHRVLDALADDMGVHLGVPTVEAAREELAALGI WDGKHAAGPHIAATGPTQPEAGEAILTGWRMLLDEGRLQDGEPYLAGTARTPVVRLSP DTAAEIGAADGEAVTVSTSRGSITLPCSVTDMPDRVVWLPLNSAGSTVHRQLRVTIGS IVKIGAGS" CDS 3478803..3480035 /codon_start=1 /transl_table=11 /gene="nuoH" /locus_tag="BQ2027_MB3176" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN H) NUOH (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN H)" /note="Mb3176, nuoH, len: 410 aa. Equivalent to Rv3152, len: 410 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 410 aa overlap). Probable nuoH, integral membrane NADH dehydrogenase I, chain H (EC 1.6.5.3), similar to others e.g. Q9XAR1 Q9XAR1|NUOH from Streptomyces coelicolor (467 aa), FASTA scores: opt: 1630, E(): 3.4e-90, (58.35% identity in 413 aa overlap); Q9RU94|DR1498 from Deinococcus radiodurans (397 aa), FASTA scores: opt: 1081, E(): 2e-57, (45.5% identity in 391 aa overlap); Q9ZCF7|NUOH_RICPR|RP796 from Rickettsia prowazekii (339 aa), FASTA scores: opt: 976, E(): 3.4e-51, (46.2% identity in 329 aa overlap); etc. Contains respiratory-chain NADH dehydrogenase subunit 1 signature 2 (PS00668). Some similarity to MTCY251.02 (FASTA score: E(): 1.2e-07). BELONGS TO THE COMPLEX I SUBUNIT 1 FAMILY. Protein product from Mb3176 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3176 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65562" /db_xref="InterPro:IPR001694" /db_xref="InterPro:IPR018086" /db_xref="UniProtKB/Swiss-Prot:P65562" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01803.1" /translation="MTTFGHDTWWLVAAKAIAVFVFLMLTVLVAILAERKLLGRMQLR PGPNRVGPKGALQSLADGIKLALKESITPGGIDRFVYFVAPIISVIPAFTAFAFIPFG PEVSVFGHRTPLQITDLPVAVLFILGLSAIGVYGIVLGGWASGSTYPLLGGVRSTAQV ISYEVAMGLSFATVFLMAGTMSTSQIVAAQDGVWYAFLLLPSFVIYLISMVGETNRAP FDLPEAEGELVAGFHTEYSSLKFAMFMLAEYVNMTTVSALAATLFFGGWHAPWPLNMW ASANTGWWPLIWFTAKVWGFLFIYFWLRATLPRLRYDQFMALGWKLLIPVSLVWVMVA AIIRSLRNQGYQYWTPTLVFSSIVVAAAMVLLLRKPLSAPGARASARQRGDEGTSPEP AFPTPPLLAGATKENAGG" CDS 3480028..3480663 /codon_start=1 /transl_table=11 /gene="nuoI" /locus_tag="BQ2027_MB3177" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN I) NUOI (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN I)" /note="Mb3177, nuoI, len: 211 aa. Equivalent to Rv3153, len: 211 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 211 aa overlap). Probable nuoI, NADH dehydrogenase I, chain I (EC 1.6.5.3), similar to others e.g. Q9XAR2|NUOI from Streptomyces coelicolor (211 aa), FASTA scores: opt: 825, E(): 9.3e-44, (70.1% identity in 164 aa overlap); Q56224|NQO9_THETH from Thermus aquaticus (subsp. thermophilus) (182 aa), FASTA scores: opt: 543, E(): 1.8e-26, (50.9% identity in 163 aa overlap); Q9RU95|DR1497 from Deinococcus radiodurans (178 aa), FASTA scores: opt: 527, E(): 1.7e-25, (48.75% identity in 162 aa overlap); etc. Contains two 4Fe-4S ferredoxins, iron-sulfur binding region signatures (PS00198). BELONGS TO THE COMPLEX I 23 KDA SUBUNIT FAMILY. THE IRON-SULFUR CENTERS ARE SIMILAR TO THOSE OF 'BACTERIAL-TYPE' 4FE-4S FERREDOXINS. COFACTOR: BINDS TWO 4FE-4S CLUSTERS. Protein product from Mb3177 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3177 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TX57" /db_xref="InterPro:IPR010226" /db_xref="InterPro:IPR017896" /db_xref="InterPro:IPR017900" /db_xref="UniProtKB/Swiss-Prot:Q7TX57" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01804.1" /translation="MANTDRPALPHKRAVPPSRADSGPRRRRTKLLDAVAGFGVTLGS MFKKTVTEEYPERPGPVAARYHGRHQLNRYPDGLEKCIGCELCAWACPADAIYVEGAD NTEEERFSPGERYGRVYQINYLRCIGCGLCIEACPTRALTMTYDYELADDNRADLIYE KDRLLAPLLPEMAAPPHPRAPGATDKDYYLGNVTAEGLRGVRESQTTGDSR" CDS 3480660..3481448 /codon_start=1 /transl_table=11 /gene="nuoJ" /locus_tag="BQ2027_MB3178" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN J) NUOJ (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN J)" /note="Mb3178, nuoJ, len: 262 aa. Equivalent to Rv3154, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 262 aa overlap). Probable nuoJ, transmembrane NADH dehydrogenase I, chain J (EC 1.6.5.3), similar to others e.g. Q9XAR3|NUOJ from Streptomyces coelicolor (285 aa), FASTA scores: opt: 991, E(): 3.2e-52, (63.7% identity in 243 aa overlap); Q9JX90|NUOJ|NMA0006 from Neisseria meningitidis (serogroup A) (223 aa), FASTA scores: opt: 329, E(): 9.6e-13, (34.85% identity in 175 aa overlap); Q9K1B2|NMB0253 from Neisseria meningitidis (serogroup B) (223 aa), FASTA scores: opt: 326, E(): 1.5e-12, (34.85% identity in 175 aa overlap); etc. But also similarity with Q00243|NU6C_PLEBO|NDH6 NADH-PLASTOQUINONE OXIDOREDUCTASE CHAIN 6 HOMOLOG (EC 1.6.5.3) (CATALYTIC ACTIVITY: NADH + PLASTOQUINONE = NAD(+) + PLASTOQUINOL) from Plectonema boryanum (199 aa), FASTA scores: opt: 287, E(): 2.8e-10, (34.35% identity in 195 aa overlap). SIMILAR TO POLYPEPTIDE 6 OF THE NADH-UBIQUINOL OXIDOREDUCTASE OF CHLOROPLASTS OR MITOCHONDRIA. Protein product from Mb3178 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3178 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y544" /db_xref="InterPro:IPR001457" /db_xref="InterPro:IPR042106" /db_xref="UniProtKB/TrEMBL:A0A1R3Y544" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01805.1" /translation="MTAVLASDVIVRTSTGEAVMFWVLSALALLGAVGVVLAVNAVYS AMFLAMTMIILAVFYMAQDALFLGVVQVVVYTGAVMMLFLFVLMLIGVDSAESLKETL RGQRVAAVLTGVGFGVLLISTIGQVATRGFAGLTVANANGNVEGLAALIFSRYLWAFE LTSALLITAAVGAMVLAHRERFERRKTQRELSQERFRPGGHPTPLPNPGVYARHNAVD VAALLPDGSYSELSVPRMLRTRGADGLQTPSPGAVSGSLEGGAS" CDS 3481445..3481744 /codon_start=1 /transl_table=11 /gene="nuoK" /locus_tag="BQ2027_MB3179" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN K) NUOK (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN K)" /note="Mb3179, nuoK, len: 99 aa. Equivalent to Rv3155, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Probable nuoK, integral membrane NADH dehydrogenase I, chain K (EC 1.6.5.3), similar to others e.g. Q9XAR4|NUOK from Streptomyces coelicolor (99 aa), FASTA scores: opt: 509, E(): 2.7e-31, (78.55% identity in 98 aa overlap); Q56226|NQOB_THETH|NQO11 from Thermus aquaticus (subsp. thermophilus) (95 aa), BLAST scores: initn: 298, init1: 180, bits: 85.7, FASTA scores: opt: 313, E(): 9.4e-17, (53.7% identity in 95 aa overlap); Q9RU97|DR1495 from Deinococcus radiodurans (103 aa), FASTA scores: opt: 309, E(): 2e-16, (52.0% identity in 100 aa overlap); etc. But also similarity with NADH-PLASTOQUINONE OXIDOREDUCTASES CHAIN 4L e.g. Q9MUL4|NULC_MESVI|NDHE from Mesostigma viride (EC 1.6.5.3) (CATALYTIC ACTIVITY: NADH + PLASTOQUINONE = NAD(+) + PLASTOQUINOL) (101 aa), FASTA scores: opt: 280, E(): 2.8e-14, (40.6% identity in 101 aa overlap); and P06261|NULC_TOBAC|NDHE|NDH4L from Nicotiana tabacum (Common tobacco) (101 aa), FASTA scores: opt: 259, E(): 1e-12, (43.0% identity in 93 aa overlap). SIMILAR TO POLYPEPTIDE 4L OF THE NADH-UBIQUINOL OXIDOREDUCTASE OF CHLOROPLASTS OR MITOCHONDRIA. Protein product from Mb3179 detected using shotgun mass spectrometry. Mb3179 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65566" /db_xref="InterPro:IPR001133" /db_xref="InterPro:IPR039428" /db_xref="UniProtKB/Swiss-Prot:P65566" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01806.1" /translation="MNPANYLYLSVLLFTIGASGVLLRRNAIVMFMCVELMLNAVNLA FVTFARMHGHLDAQMIAFFTMVVAACEVVVGLAIIMTIFRTRKSASVDDANLLKG" CDS 3481755..3483656 /codon_start=1 /transl_table=11 /gene="nuoL" /locus_tag="BQ2027_MB3180" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN L) NUOL (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN L)" /note="Mb3180, nuoL, len: 633 aa. Equivalent to Rv3156, len: 633 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 633 aa overlap). Probable nuoL, integral membrane NADH dehydrogenase I, chain L (EC 1.6.5.3), similar to others e.g. Q9XAR5|NUOL_STRCO from Streptomyces coelicolor (654 aa), FASTA scores: opt: 2074, E(): 1.1e-111, (61.1% identity in 648 aa overlap); Q56227|NQOC_THETH|NQO12 from Thermus aquaticus (subsp. thermophilus) (606 aa), FASTA scores: opt: 1420, E(): 3.8e-74, (43.35% identity in 630 aa overlap); Q9ZJV6|NUOL|JHP1192 from Helicobacter pylori J99 (Campylobacter pylori J99) (612 aa), FASTA scores: opt: 1279, E(): 4.7e-66, (41.65% identity in 516 aa overlap); etc. Also similar to MTCY251.04 (FASTA score: E(): 1.3e-11) and MTCY03A2.01c (FASTA score: E(): 2.3e-10). SIMILAR TO POLYPEPTIDE 5 OF THE NADH-UBIQUINOL OXIDOREDUCTASE OF CHLOROPLASTS OR MITOCHONDRIAL. Protein product from Mb3180 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3180 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3I0" /db_xref="InterPro:IPR001516" /db_xref="InterPro:IPR001750" /db_xref="InterPro:IPR003945" /db_xref="InterPro:IPR018393" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3I0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01807.1" /translation="MTTSLGTHYTWLLVALPLAGAAILLFGGRRTDAWGHLLGCAAAL AAFGVGAMLLADMLGRDGLERAIHQQVFTWIPAGGLQVDFGLQIDQLSMCFVLLISGV GSLIHIYSVGYMAEDPDRRRFFGYLNLFLASMLLLVVADNYVLLYVGWEGVGLASYLL IGFWYHKPSAATAAKKAFVMNRVGDAGLAVGMFLTFSTFGTLSYAGVFAGVPAASRAV LTAIGLLMLLGACAKSAQVPLQAWLGDAMEGPTPVSALIHAATMVTAGVYLIVRSGPL YNLAPTAQLAVVIVGAVTLLYGAIIGCAKDDIKRALAASTISQIGYMVLAAGLGPAGY AFAIMHLLTHGFFKAGLFLGSGAVIHAMHEEQDMRRYGGLRAALPVTFATFGLAYLAI IGVPPFAGFFSKDAIIEAALGAGGIRGSLLGGAALLGAGVTAFYMTRVMLMTFFGEKR WTPGAHPHEAPAVMTWPMILLAVGSVFSGGLLAVGGTLRHWLQPVVGSHEEATHALPT WVATTLALGVVAVGIAVAYRMYGTAPIPRVAPVRVSALTAAARADLYGDAFNEEVFMR PGAQLTNAVVAVDDAGVDGSVNALATLVSQTSNRLRQMQTGFARNYALSMLVGAVLVA AALLVVQLW" CDS 3483653..3485314 /codon_start=1 /transl_table=11 /gene="nuoM" /locus_tag="BQ2027_MB3181" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN M) NUOK (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN M)" /note="Mb3181, nuoM, len: 553 aa. Equivalent to Rv3157, len: 553 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 553 aa overlap). Probable nuoM, integral membrane NADH dehydrogenase I, chain M (EC 1.6.5.3), similar to others e.g. Q9XAR6|NUOM from Streptomyces coelicolor (523 aa), FASTA scores: opt: 1621, E(): 4.2e-89, (56.55% identity in 541 aa overlap); P50974|NUOM_RHOCA|NUOM from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (512 aa), FASTA scores: opt: 996, E(): 6.5e-52, (38.2% identity in 521 aa overlap); P29925|NQOD_PARDE|NQO13 from Paracoccus denitrificans (513 aa), FASTA scores: opt: 987, E(): 2.2e-51, (37.05% identity in 540 aa overlap); etc. Also similar to MTCY251.04 (FASTA score: E(): 3.3e-16) and MTCY03A2.02c (FASTA score: E(): 9.6e-13). SIMILAR TO POLYPEPTIDE 4 OF THE NADH-UBIQUINOL OXIDOREDUCTASE OF CHLOROPLASTS OR MITOCHONDRIAL. Protein product from Mb3181 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3181 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3M4" /db_xref="InterPro:IPR001750" /db_xref="InterPro:IPR003918" /db_xref="InterPro:IPR010227" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3M4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01808.1" /translation="MNNVPWLSVLWLVPLAGAVLIILLPPGRRRLAKWAGMVVSVLTL AVSIVVAAEFKPSAEPYQFVEKHSWIPAFGAGYTLGVDGIAVVLVLLTTVLIPLLLVA GWNDATDADDLSPASGRYPQRPAPPRLRSSGGERTRGVHAYVALTLAIESMVLMSVIA LDVLLFYVFFEAMLIPMYFLIGGFGQGAGRSRAAVKFLLYNLFGGLIMLAAVIGLYVV TAQYDSGTFDFREIVAGVAAGRYGADPAVFKALFLGFMFAFAIKAPLWPFHRWLPDAA VESTPATAVLMMAVMDKVGTFGMLRYCLQLFPDPSTYFRPLIVTLAIIGVIYGAIVAI GQTDMMRLIAYTSISHFGFIIAGIFVMTTQGQSGSTLYMLNHGLSTAAVFLIAGFLIA RRDSRSIADYGGVQKVAPILAGTFMVSAMATVSLPGLAPFISEFLVLLGTFSRYWLAA AFGVTALVLSAVYMLWLYQRVMTGPIAEGNERIGDLVGREMIVVAPLIALLLVLGVYP KPVLDIINPAVENTMTTIGQHDPAPSVAHPVPAVGASRTAEGPHP" CDS 3485311..3486906 /codon_start=1 /transl_table=11 /gene="nuoN" /locus_tag="BQ2027_MB3182" /product="PROBABLE NADH DEHYDROGENASE I (CHAIN N) NUON (NADH-UBIQUINONE OXIDOREDUCTASE CHAIN N)" /note="Mb3182, nuoN, len: 531 aa. Equivalent to Rv3158, len: 531 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 531 aa overlap). Probable nuoN, integral membrane NADH dehydrogenase I, chain N (EC 1.6.5.3), similar to others e.g. Q9XAR7|SC10A7.08c from Streptomyces coelicolor (552 aa), FASTA scores: opt: 1493, E(): 1.1e-81, (56.7% identity in 543 aa overlap); Q9PGI2|XF0318 from Xylella fastidiosa (485 aa), FASTA scores: opt: 942, E(): 7.4e-49, (39.6% identity in 379 aa overlap); CAB51628|NUON2 from Rhizobium meliloti (Sinorhizobium meliloti) (479 aa), FASTA scores: opt: 934, E(): 2.2e-48, (35.5% identity in 479 aa overlap); etc. But also similarity with NADH-PLASTOQUINONE OXIDOREDUCTASES CHAIN 4L (EC 1.6.5.3) (CATALYTIC ACTIVITY: NADH + PLASTOQUINONE = NAD(+) + PLASTOQUINOL) e.g. P29801|NU2C_SYNP7|NDHB from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (521 aa), FASTA scores: opt: 921, E(): 1.4e-47, (40.25% identity in 395 aa overlap). BELONGS TO THE COMPLEX I SUBUNIT 2 FAMILY. Protein product from Mb3182 detected using SWATH mass spectrometry. Mb3182 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5M1" /db_xref="InterPro:IPR001750" /db_xref="InterPro:IPR010096" /db_xref="UniProtKB/Swiss-Prot:P0A5M1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01809.1" /translation="MILPAPHVEYFLLAPMLIVFSVAVAGVLAEAFLPRRWRYGAQVT LALGGSAVALIAVIVVARSIHGSGHAAVLGAIAVDRATLFLQGTVLLVTIMAVVFMAE RSARVSPQRQNTLAVARLPGLDSFTPQASAVPGSDAERQAERAGATQTELFPLAMLSV GGMMVFPASNDLLTMFVALEVLSLPLYLMCGLARNRRLLSQEAAMKYFLLGAFSSAFF LYGVALLYGATGTLTLPGIRDALAARTDDSMALAGVALLAVGLLFKVGAVPFHSWIPD VYQGAPTPITGFMAAATKVAAFGALLRVVYVALPPLHDQWRPVLWAIAILTMTVGTVT AVNQTNVKRMLAYSSVAHVGFILTGVIADNPAGLSATLFYLVAYSFSTMGAFAIVGLV RGADGSAGSEDADLSHWAGLGQRSPIVGVMLSMFLLAFAGIPLTSGFVSKFAVFRAAA SAGAVPLVIVGVISSGVAAYFYVRVIVSMFFTEESGDTPHVAAPGVLSKAAIAVCTVV TVVLGIAPQPVLDLADQAAQLLR" CDS complement(3486912..3488681) /codon_start=1 /transl_table=11 /gene="PPE53" /locus_tag="BQ2027_MB3183C" /product="ppe family protein ppe53" /note="Mb3183c, PPE53, len: 589 aa. Equivalent to Rv3159c, len: 590 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 590 aa overlap). Member of the Mycobacterium tuberculosis PPE_family of Gly-, Asn-rich proteins. Highly similar to P71868|Rv3533c|MTCY03C7.23 (582 aa), FASTA scores: opt: 2289, E(): 3.2e-98, (63.5% identity in 600 aa overlap); and also similar to MTCY48_17, MTV041_29, MTCY6G11_5, MTCY98_24, etc. TBparse score is 0.921. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, albeit a 2143 bp insertion occurs overlapping the NH2-terminal part, this leads to an equivalent product, compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Mb3183c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3B2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01810.1" /translation="MNYSVLPPEINSLRMFTGAGSAPMLAASVAWDRLAAELAVAASS FGSVTSGLAGQSWQGAAAAMAAAAAPYAGWLAAAAARAAGASAQAKAVASAFEAARAA TVHPMLVAANRNAFVQLVLSNLFGQNAPAIAAAEAMYEQMWAADVAAMVGYHGGASAA AAQLSSWSIGLQQALPAAPSALAAAIGLGNIGVGNLGGGNTGEYNLGSGNSGNANVGS GNSGNANVGSGNDGATNLGSGNIGNTNLGSGNVGNVNLGSGNRGFGNLGNGNFGSGNL GSGNTGSTNFGGGNLGSFNLGSGNIGSSNIGFGNNGDNNLGLGNNGNNNIGFGLTGDN LVGIGALNSGIGNLGFGNSGNNNIGFFNSGNNNVGFFNSGNNNFGFGNAGDINTGFGN AGDTNTGFGNAGFFNMGIGNAGNEDMGVGNGGSFNVGVGNAGNQSVGFGNAGTLNVGF ANAGSINTGFANSGSINTGGFDSGDRNTGFGSSVDQSVSSSGFGNTGMNSSGFFNTGN VSAGYGNNGDVQSGINNTNSGGFNVGFYNSGAGTVGIANSGLQTTGIANSGTLNTGVA NTGDHSSGGFNQGSDQSGFFGQP" CDS complement(3488929..3490827) /codon_start=1 /transl_table=11 /gene="PPE70" /locus_tag="BQ2027_MB3184C" /product="PPE FAMILY PROTEIN" /note="Mb3184c, PPE70, len: 685 aa. Equivalent to MT3248, len: 686 aa, from Mycobacterium tuberculosis strain CDC1551, (99.708% identity in 686 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, an insertion of 2143 bp exists between PPE53 and Rv3160c compared to Mycobacterium tuberculosis strain H37Rv. This leads to a additional gene, PPE70 equivalent to MT3248 from Mycobacterium tuberculosis strain CDC1551. Mb3184c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3A8" /protein_id="SIU01811.1" /translation="MNYSVLPPEINSLRMFTGAGSAPMLAASVAWDGLAAELAVAASS FGSVTSGLAGQSWQGAAAAMAAAAAPYAGWLAAAAARAAGASAQAKAVASAFEAARAA TVHPMLVAANRNAFVQLVLSNLFGQNAPAIAAAEAMYEQMWAADVAAMVGYHGGASAA AAALPSWQQALRGLPGLGQVASAISGGAASMFAAPAAATAAVTPPALNTGLGNIGSWN LGGGNVGLLNLGSGNFGSLNLGGGNTGNANLGGGNWGFANLGSGNIGNTNFGNGNQGN LNFGSGNLLGNGNFGFGNAFGDGNLGSGNVGSTNLGSGNFGSFNVGSGNMGMSNIGFG NLGNNNLGFGNNGNNNIGFGLTGDNLVGIGALNSGIGNMGFGNSGNNNIGFFNSGNGN VGFFNSGDGNTGFGNAGDVNTGFWNGGPFNTGFGNGGNTNFGFGNAGFQNMGHGNAGG VNVGSGNAGLANTGDFNSGGVVSGIGGNTGSFNSGNLNTGFGNAGDLNTGLFNSGDVN TGIGSTVDQPGSVSGFGNTGTSVSGFNNSGNLTSGFGNMNSNVFDSTSGFQNIGDANV GFFNSGNSNEGFFNTGMFNNGIYNSGVASTGIANSGNASSGVANSGDNSSGAFNQGDN QAGFFGQP" CDS complement(3491002..3491643) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3185C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb3185c, -, len: 213 aa. Equivalent to Rv3160c, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap). Possible transcriptional regulator, with some similarity to others e.g. Q9S3L4|AMTR AMTR PROTEIN (global repressor in the nitrogen regulation system; see first citation below for more information) (222 aa), FASTA scores: opt: 182, E(): 7.3e-05, (27.9% identity in 208 aa overlap); Q9X7X9|SC6A5.33c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (223 aa), FASTA scores: opt: 176, E(): 0.00018, (26.5% identity in 185 aa overlap); Q9XA31|SCH69.03c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (209 aa), FASTA scores: opt: 173, E(): 0.00027, (27.25% identity in 176 aa overlap); BAB54133|MLL7734 TRANSCRIPTIONAL REGULATOR from Rhizobium loti (Mesorhizobium loti) (213 aa), FASTA scores: opt: 172, E(): 0.00031, (23.55% identity in 204 aa overlap); etc. Also similar to hypothetical proteins from Mycobacterium tuberculosis strain H37Rv e.g. P96839|Rv3557v|MTCY06G11.04c (200 aa), FASTA scores: opt: 169, E(): 0.00046, (26.75% identity in 157 aa overlap). Contains probable helix-turn-helix motif from aa 31 to 52 (Score 1857, +5.51 SD). SIMILAR TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.901. Protein product from Mb3185c detected using SWATH mass spectrometry. Mb3185c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y394" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023772" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y394" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01812.1" /translation="MPRQAGRWSPTALRILGAAAELIALRGYSSTSTRDIAAAVGVEQ PAIYKHFSAKRDILAALVRLAVEWPLELFGHITAMPVPAVVKLHRWLTESLDHLHASP YVLVSILITPDLHQESFVAERELVAEMERALVGLIETGQGEGDVRAMHPLSAARLVQA LFDALALPEFAVSPDEIVEFAMTALLSDPDRLAEIRAAADALEIQTAPPDRGL" CDS complement(3491654..3492715) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3186C" /product="POSSIBLE DIOXYGENASE" /note="Mb3186c, -, len: 382 aa. Equivalent to Rv3161c, len: 382 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 382 aa overlap). Possible dioxygenase (EC 1.-.-.-), similar to subunit of several dioxygenases and related proteins e.g. BAB50510|MLR3662 DIOXYGENASE, ALPHA SUBUNIT from Rhizobium loti (Mesorhizobium loti) (400 aa), FASTA scores: opt: 413, E(): 6.2e-20, (28.4% identity in 331 aa overlap); Q9A3T0|CC3122 RIESKE 2FE-2S FAMILY PROTEIN from Caulobacter crescentus (404 aa), FASTA scores: opt: 405, E(): 2.1e-19, (27.95% identity in 372 aa overlap); Q9HTF4|PA5410 PROBABLE RING HYDROXYLATING DIOXYGENASE, ALPHA-SUBUNIT from Pseudomonas aeruginosa (429 aa), FASTA scores: opt: 392, E(): 1.6e-18, (25.8% identity in 399 aa overlap); Q9AGK6|PHTAA PHTHALATE DIOXYGENASE LARGE SUBUNIT from Arthrobacter keyseri (473 aa), FASTA scores: opt: 385, E(): 5.2e-18, (34.0% identity in 206 aa overlap); P76253|YEAW_ECOLI PUTATIVE DIOXYGENASE, ALPHA SUBUNIT from Escherichia coli (374 aa), FASTA scores: opt: 376, E(): 1.7e-17, (27.05% identity in 344 aa overlap); etc. TBparse score is 0.932. Protein product from Mb3186c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3186c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3A6" /db_xref="InterPro:IPR001663" /db_xref="InterPro:IPR015879" /db_xref="InterPro:IPR017941" /db_xref="InterPro:IPR036922" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3A6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01813.1" /translation="MPPAAYTSSELWQLERERIFNRSWMLVAHVDQLAKTGDYVTVSV AGEPVMVVRDVDGQLHALSPICRHRLMLMVEPGAGRIDTLTCQYHLWRYGLDGRLRGA PHMAANLDFNRRECRLPQFAVATWNGLVWINLDADAEPIAAHLDLTDDEFAGYRLGEM VQVESWSHEWRANWKVAAENGHENYHVLGLHRQTLEPFVPGGGDLDVRQYSRWALRLR VPFTVPVEAKSLQLNEVQKSNLVVLWTFPNSALAIAGERVVWFGFIPQSIDRVQVLGG VLTTPELAADAAATAQTSQFVMAMINDEDRLGLEAVQVGAGSRFAERGHLSSKEWPGM LAFYRNLAMALVGDHPGAS" CDS complement(3492718..3493308) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3187C" /product="POSSIBLE INTEGRAL MEMBRANE PROTEIN" /note="Mb3187c, -, len: 196 aa. Similar to 5' end of Rv3162c, len: 145 aa, from Mycobacterium tuberculosis strain H37Rv, (86.7% identity in 98 aa overlap). Possible integral membrane protein, with some similarity to C-terminal part of Q10803|Rv2877c|MTCY274.08c hypothetical protein from Mycobacterium tuberculosis (287 aa), FASTA scores: opt: 112, E(): 6.9, (29.65% identity in 135 aa overlap); and other hypothetical proteins from other organisms. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base deletion (t-*), leads to a longer product with different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Mb3187c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3B5" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3B5" /protein_id="SIU01814.1" /translation="MTSFAHPGTRGLSTVFGLMMVGSAAVGSHGLAVVVGLAAVIAVG VAAVFRLAATLAVVLSVVMIVVSGPTHVLAALSGFAPPSTWCADTGPVLSPGAGRRPL PPLVSRSLGWLRRRSRCKCHGCRWRHRWPCWLPTCWPPVRSRGEPPAGRSGRCALLTI GYQSITWRVCYQLITEPSSETSLPTSGITSTTIHRR" CDS complement(3493305..3494576) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3188C" /product="POSSIBLE CONSERVED SECRETED PROTEIN" /note="Mb3188c, -, len: 423 aa. Equivalent to Rv3163c, len: 423 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 423 aa overlap). Possible conserved secreted protein, with some similarity to other hypothetical bacterial proteins e.g. Q9Z539|SC9B2.20c from Streptomyces coelicolor (460 aa), FASTA scores: opt: 666, E(): 1.5e-33, (33.55% identity in 417 aa overlap); O58486|PH0774 from Pyrococcus horikoshii (410 aa), FASTA scores: opt: 329, E(): 6.9e-13, (23.8% identity in 424 aa overlap); Q9UZ66|PAB0849 from Pyrococcus abyssi (410 aa), FASTA scores: opt: 322, E(): 1.9e-12, (24.15% identity in 389 aa overlap); etc. Also some similarity with P71761|Rv1480|MTV007.27|MTCY277.01 from Mycobacterium tuberculosis (317 aa), FASTA scores: opt: 198, E(): 6.3e-05, (26.75% identity in 269 aa overlap). Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature. Mb3188c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002881" /db_xref="UniProtKB/TrEMBL:A0A1R3Y546" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01815.1" /translation="MIQTCEVELRWRASQLTLAIATCAGVALAAAVVAGRWQLIAFAA PLLGVLCSISWQRPVPVIQVHGDPDSQRCFENEHVRVTVWVTTESVDAAVELTVSALA GMQFEALESVSRRTTTVSAVAQRWGRYPIRARVAVVARGGLLMGAGTVDAAEIVVFPL TPPQSTPLPQTELLDRLGAHLTRHVGPGVEYADIRPYVPGDQLRAVNWVVSARRGRLH VTRRLTDRAADVVVLIDMYRQPAGPATEATERVVRGAAQVVQTALRNGDRAGIVALGG NRPRWLGADIGQRQFYRVLDTVLGAGEGFENTTGTLAPRAAVPAGAVVIAFSTLLDTE FALALIELRKRGHVVVAVDVLDSCPLQDQLDPLVVRMWALQRSAMYRDMATIGVDVLS WPADHSLQQSMGALPNRRRRGRGRASRARLP" CDS complement(3494606..3495568) /codon_start=1 /transl_table=11 /gene="moxR3" /locus_tag="BQ2027_MB3189C" /product="PROBABLE METHANOL DEHYDROGENASE TRANSCRIPTIONAL REGULATORY PROTEIN MOXR3" /note="Mb3189c, moxR3, len: 320 aa. Equivalent to Rv3164c, len: 320 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 320 aa overlap). Probable moxR3, methanol dehydrogenase regulatory protein, highly similar to Q9Z538|SC9B2.21c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (332 aa), FASTA scores: opt: 1227, E(): 1.7e-67, (60.25% identity in 302 aa overlap); Q9UZ67|MOXR-3|PAB0848 METHANOL DEHYDROGENASE REGULATORY PROTEIN from Pyrococcus abyssi (314 aa), FASTA scores: opt: 1126, E(): 2.3e-61, (54.1% identity in 305 aa overlap); Q9HSH7|MOXR|VNG0223G METHANOL DEHYDROGENASE REGULATORY PROTEIN from Halobacterium sp. strain NRC-1 (318 aa), FASTA scores: opt: 1072, E(): 4.5e-58, (51.45% identity in 315 aa overlap); Q9RVV4|DR0918 MOXR-RELATED PROTEIN from Deinococcus radiodurans (354 aa), FASTA scores: opt: 1000, E(): 1.2e-53, (50.95% identity in 318 aa overlap); etc. Also high similarity with several hypothetical bacterial proteins. Protein product from Mb3189c detected using SWATH mass spectrometry. Mb3189c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3Z2" /db_xref="InterPro:IPR011703" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041628" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Z2" /protein_id="SIU01816.1" /translation="MIMPAATTTAHCEAVLDEIGRVVVGKRSALTLILTAVLARGHVL IEDLPGLGKTLIARSFAAALGLDFTRVQFTPDLLPADLLGSTIYDMQSGRFEFRAGPI FTNLLLADEINRTPPKTQAALLEAMAEGQVSIDGQTHKLAMPFIVLATDNPIEYEGTY PLPEAQLDRFAIRLELRYLSERDETSMLRRRLERGSADPTVNQVVDCHDLLAMRESVE QVTVHEDVLHYVVSLANATRHHPQVAVGASPRAELDLVQLSRARALLLGRDYVIPEDV KELATAAVAHRITLRPEMWVRKIAGADVVSELLRRLPVPRISGT" CDS complement(3495576..3496058) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3190C" /product="unknown protein" /note="Mb3190c, -, len: 160 aa. Equivalent to Rv3165c, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 160 aa overlap). Hypothetical unknown protein. Mb3190c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3I7" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3I7" /protein_id="SIU01817.1" /translation="MKRLIALGIFLIVGIELLALILHDRRLVLAGSGLALALVLLNVR RMLGNRDELTAAPDSDDLGEGLRRWLSNTETTIRWSESTRADWDRHLRPMLARRFEIA TGHRQAKDPVAFAATGRMLFGDELWEWVNPNNVTHTGDRQPGPGRAALEEILQKLEQV " CDS complement(3496055..3497014) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3191C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3191c, -, len: 319 aa. Equivalent to Rv3166c, len: 319 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 319 aa overlap). Probable transmembrane protein, similar but longer (52 aa) to O32895|MLCB1779.35c hypothetical protein from Mycobacterium leprae (119 aa), FASTA scores: opt: 289, E(): 3.7e-10, (44.25% identity in 122 aa overlap). Also some similarity to Q9Z536|SC9B2.23c PUTATIVE TRANSMEMBRANE PROTEIN from Streptomyces coelicolor (339 aa), FASTA scores: opt: 247, E(): 2.5e-07, (28.2% identity in 326 aa overlap); and in N-terminus to Q9RS20|DR2307 PUTATIVE MULTIDRUG-EFFLUX TRANSPORTER from Deinococcus radiodurans (410 aa), FASTA scores: opt: 135,E(): 1, (32.35% identity in 136 aa overlap). Mb3191c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3N5" /db_xref="InterPro:IPR025403" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3N5" /protein_id="SIU01818.1" /translation="MPGTKPGSDKPTGRVVVVIVLLMLAGAALRGHLPADDGAPLAAA GGSRAALMFIVAALAATLALIALAIITRLRHPLPVAPSAGELSAMLGGAAGRPNWRVL LLGLGTILAWLLIAILLARLFVPDDVGPAAPIPDSTATPDASSTTPSRPQPPQDNNDD VLGILFASTIGLFLMVVAGSLITSRRQRKSAPARISGDRIESPAPSARSESLARAAEI GLAEMADLRREPREAIIACYVAMERELSHVPGVAPQDFDTPTEVLARAVEHRALHGAS AAALVSLFAEARFSPHVMNEEHREVAMRLLRLVLDELSTRTAI" CDS complement(3497094..3497720) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3192C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb3192c, -, len: 208 aa. Equivalent to Rv3167c, len: 208 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 208 aa overlap). Probable transcriptional regulator, tetR family, similar to several transcriptional regulators e.g. Q9L2A4|SC8F4.22c (TETR/ACRR FAMILY) from Streptomyces coelicolor (234 aa), FASTA scores: opt: 317, E(): 7.5e-13, (33.35% identity in 210 aa overlap); Q9RK47|SCF12.11 (TETR/ACRR FAMILY) from Streptomyces coelicolor (206 aa), FASTA scores: opt: 293, E(): 2.1e-11, (32.65% identity in 199 aa overlap); Q54288 REGULATOR OF ANTIBIOTIC TRANSPORT COMPLEXES (TETR/ACRR FAMILY) (204 aa), FASTA scores: opt: 260, E(): 2.4e-09, (30.75% identity in 205 aa overlap); etc. Equivalent to AAK47595 from Mycobacterium tuberculosis strain CDC1551 but shorter 21 aa. Contains probable helix-turn-helix motif from aa 42 to 63 (Score 1727, +5.07 SD). MAY BE BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /db_xref="GOA:A0A1R3Y3A3" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR011075" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3A3" /protein_id="SIU01819.1" /translation="MKADLPSLDKAPGAGRPRDPRIDSAILSATAELLVQIGYSNLSL AAVAERAGTTKSALYRRWSSKAELVHEAAFPAAPTALQAAAGDIAADIRMMIAATRDV FTTPVVRAALPGLVADMTADAELNARVLARFADLFAAVRMRLREAVDRGEAHPDVDPD RLIELIGGATMLRMLLYPDDMLDDAWVDQTTAIVVRGVHRAAPGGSVV" CDS 3497765..3498901 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3193" /product="putative aminoglycoside phosphotransferase" /note="Mb3193, -, len: 378 aa. Equivalent to Rv3168, len: 378 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 378 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins e.g. Q9M7Y6|F3E22.6 from Arabidopsis thaliana (Mouse-ear cress) (314 aa), FASTA scores: opt: 236, E(): 1.1e-07, (27.35% identity in 234 aa overlap); Q9RYW2|DRA0194 from Deinococcus radiodurans (386 aa), FASTA scores: opt: 207, E(): 9.1e-06, (23.45% identity in 320 aa overlap); etc. Also some similarity with O69727|Rc3761c|MTV025.109c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (351 aa), FASTA scores: opt: 193, E(): 6.4e-05, (29.4% identity in 242 aa overlap). Protein product from Mb3193 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y3C4" /db_xref="InterPro:IPR002575" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR041726" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3C4" /protein_id="SIU01820.1" /translation="MANEPAIGAIDRLQRSSRDVTTLPAVISRWLSSVLPGGAAPEVT VESGVDSTGMSSETIILTARRQQDGRSIQQKLVARVAPAAEDVPVFPTYRLDHQFEVI RLVGELTDVPVPRVRWIETTGDVLGTPFFLMDYVEGVVPPDVMPYTFGDNWFADAPAE RQRQLQDATVAALATLHSIPNAQNTFSFLTQGRTSDTTLHRHFNWVRSWYDFAVEGIG RSPLLERTFEWLQSHWPDDAAAREPVLLWGDARVGNVLYRDFQPVAVLDWEMVALGPR ELDVAWMIFAHRVFQELAGLATLPGLPEVMREDDVRATYQALTGVELGDLHWFYVYSG VMWACVFMRTGARRVHFGEIEKPDDVESLFYHAGLMKHLLGEEH" CDS 3498901..3500025 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3194" /product="conserved protein" /note="Mb3194, -, len: 374 aa. Equivalent to Rv3169, len: 374 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 374 aa overlap). Conserved hypothetical protein, with similarity to other hypothetical proteins: Q9A8W6|CC1232 from Caulobacter crescentus (368 aa), FASTA scores: opt: 669, E(): 3.3e-34, (34.05% identity in 376 aa overlap); and O32901|MLCB1779.41 from Mycobacterium leprae (127 aa), FASTA scores: opt: 179, E(): 0.00034, (29.0% identity in 131 aa overlap). Also weak similarity with P95149|Rv1866|MTCY359.07c (804 aa), FASTA scores: opt: 121, E(): 6.4, (37.0% identity in 119 aa overlap). Equivalent to AAK47597 from Mycobacterium tuberculosis strain CDC1551 but shorter 43 aa. Protein product from Mb3194 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3194 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3B6" /protein_id="SIU01821.1" /translation="MPQMLGPLDEYPLHQLPQPIAWPGSSDRNFYDRSYFNAHDRTGN IFLITGIGYYPNLGVKDAFVLIRRADIQTAVHLSDAIDSDRLHQHVNGYRVEVVEPLR KLRIVLDETEGVAADLTWEGLFDVVQEQPHVLRSGNRVTLDAQRFAQLGTWSGRIVVD GERIAVDPATWLGSRDRSWGIRPVGEPEPAGRPADPPFEGMWWLYVPLAFDDFAVVLI IQEEPDGFRSLNDCTRIWRDGHVEQLGWPRVRIHYRSGTRIPTGATIEASTPDGAPVH FDVESKLAVPTHVGGGYGGDSDWSHGMWKGEKFVERRTYDMTDPTIIARAGFGVIDHV GRALCRDGDGNPVQGWGLFEHGALGRHDPSGFADWSTLAP" CDS 3500168..3501514 /codon_start=1 /transl_table=11 /gene="aofH" /locus_tag="BQ2027_MB3195" /product="PROBABLE FLAVIN-CONTAINING MONOAMINE OXIDASE AOFH (AMINE OXIDASE) (MAO)" /note="Mb3195, aofH, len: 448 aa. Equivalent to Rv3170, len: 448 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 448 aa overlap). Probable aofH, flavin-containing (mono)amine oxidase (EC 1.4.3.4), similar to many eukaryotic monoamine oxidases e.g. P49253|AOF_ONCMY from Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) (522 aa), FASTA scores: opt: 869, E(): 5.3e-44, (37.7% identity in 448 aa overlap); P21396|AOFA_RAT|MAOA from Rattus norvegicus (Rat) (526 aa), FASTA scores: opt: 839, E(): 3.2e-42, (37.45% identity in 446 aa overlap); Q99NA8|MAO-A from Cavia porcellus (Guinea pig) (506 aa), FASTA scores: opt: 836, E(): 4.6e-42, (37.0% identity in 446 aa overlap); P21398|AOFA_BOVIN from Bos taurus (Bovine) (527 aa), FASTA scores: opt: 806, E(): 2.8e-40, (37.0% identity in 446 aa overlap); P21397|AOFA_HUMAN (527 aa), FASTA scores: opt: 801, E(): 5.6e-40, (37.2% identity in 446 aa overlap); etc. Alternative start possible at position 3538487. BELONGS TO THE FLAVIN MONOAMINE OXIDASE FAMILY. COFACTOR: FAD (POTENTIAL). Protein product from Mb3195 detected using SWATH mass spectrometry. Mb3195 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63534" /db_xref="InterPro:IPR001613" /db_xref="InterPro:IPR002937" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:P63534" /protein_id="SIU01822.1" /translation="MTNPPWTVDVVVVGAGFAGLAAARELTRQGHEVLVFEGRDRVGG RSLTGRVAGVPADMGGSFIGPTQDAVLALATELGIPTTPTHRDGRNVIQWRGSARSYR GTIPKLSLTGLIDIGRLRWQFERIARGVPVAAPWDARRARELDDVSLGEWLRLVRATS SSRNLMAIMTRVTWGCEPDDVSMLHAARYVRAAGGLDRLLDVKNGAQQDRVPGGTQQI AQAAAAQLGARVLLNAAVRRIDRHGAGVTVTSDQGQAEAGFVIVAIPPAHRVAIEFDP PLPPEYQQLAHHWPQGRLSKAYAAYSTPFWRASGYSGQALSDEAPVFITFDVSPHADG PGILMGFVDARGFDSLPIEERRRDALRCFASLFGDEALDPLDYVDYRWGTEEFAPGGP TAAVPPGSWTKYGHWLREPVGPIHWASTETADEWTGYFDGAVRSGQRAAAEVAALL" CDS complement(3501509..3502408) /codon_start=1 /transl_table=11 /gene="hpx" /locus_tag="BQ2027_MB3196C" /product="POSSIBLE NON-HEME HALOPEROXIDASE HPX" /note="Mb3196c, hpx, len: 299 aa. Equivalent to Rv3171c, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 299 aa overlap). Possible hpx, non-heme haloperoxidase (EC 1.11.1.-), similar to other hydrolases (principaly epoxide hydrolases) and non-heme chloroperoxidases e.g. Q9RKB6|SCE87.22c PUTATIVE HYDROLASE from Streptomyces coelicolor (314 aa), FASTA scores: opt: 431, E(): 6e-20, (38.05% identity in 297 aa overlap); Q9HZ14|PA3226 PROBABLE HYDROLASE (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Pseudomonas aeruginosa (275 aa), FASTA scores: opt: 236, E(): 1e-07, (29.6% identity in 277 aa overlap); Q9DBL9|1300003 D03RIK PROTEIN SIMILAR TO ALPHA/BETA HYDROLASE FOLD from Mus musculus (Mouse) (351 aa), FASTA scores: opt: 223, E(): 8.3e-07, (24.35% identity in 304 aa overlap); AAK46260|MT1988 EPOXIDE HYDROLASE from Mycobacterium tuberculosis strain CDC1551 (356 aa), FASTA scores: opt: 223, E(): 8.4e-07, (40.7% identity in 113 aa overlap); P49323|PRXC_STRLI|CPO|CPOL NON-HEME CHLOROPEROXIDASE (EC 1.11.1.10) (CHLORIDE PEROXIDASE) from Streptomyces lividans (275 aa), FASTA scores: opt: 220, E(): 1e-06, (29.5% identity in 305 aa overlap); etc. Equivalent to AAK47599 Hydrolase, alpha/beta hydrolase family from Mycobacterium tuberculosis strain CDC1551 but shorter 24 aa. Start chosen by similarity, alternative with good RBS possible. Protein product from Mb3196c detected using SWATH mass spectrometry. Mb3196c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3B7" /db_xref="InterPro:IPR022742" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3B7" /protein_id="SIU01823.1" /translation="MTVRAADGTPLHTQVFGPPHGYPIVLTHGFVCAIRAWAYQIADL AGDYRVIAFDHRGHGRSGVPRRGAYSLNHLAADLDSVLDATLAPRERAVVAGHSMGGI TIAAWSDRYRHKVRRRTDAVALINTTTGDLVRKVKLLSVPRELSPVRVLAGRSLVNTF GGFPLPGAARALSRHVISTLAVAADADPSATRLVYELFTQMSAAGRGGCAKMLVEEVG SAHLNLDGLTVPTLVIGGVRDRLTPISQSRRIARTAPNVVGLVELPGGHCSMLERHQE VNSHLRALAESVTRHVRDRRISS" CDS complement(3502545..3503027) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3197C" /product="HYPOTHETICAL PROTEIN" /note="Mb3197c, -, len: 160 aa. Equivalent to Rv3172c, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 160 aa overlap). Hypothetical unknown protein. Mb3197c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3C1" /protein_id="SIU01824.1" /translation="MSVALLREMFDRMVVAKNAELIEHYYDPDFLMYSDGLSQSFAKF RDSHRKLYATAISYAVEYDEHAWVEAQTRLPGGCGSPRRDLARSRPASRWYSLPPTAT AEFTGSGRRRGRVGATWPPSTITETTTDRLAMRNQLRAGAATLLFCDPMLQRFPATRK " CDS complement(3503106..3503708) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3198C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR/ACRR-FAMILY)" /note="Mb3198c, -, len: 200 aa. Equivalent to Rv3173c, len: 200 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 200 aa overlap). Probable transcriptional regulatory protein tetR family, similar to several bacterial putative regulatory proteins e.g. Q9EWI2|SC7H9.14 from Streptomyces coelicolor (195 aa), FASTA scores: opt: 319, E(): 1.7e-13, (34.55% identity in 195 aa overlap); O85695|3SCF60.04 from Streptomyces lividans and Streptomyces coelicolor (192 aa), FASTA scores: opt: 297, E(): 4.3e-12, (37.45% identity in 187 aa overlap); BAB50853|MLR4117 from Rhizobium loti (Mesorhizobium loti) (205 aa), FASTA scores: opt: 280, E(): 5.5e-11, (31.45% identity in 194 aa overlap); BAB53760|MLL8133 from Rhizobium loti (Mesorhizobium loti) (194 aa), FASTA scores: opt: 270, E(): 2.3e-10, (34.05% identity in 185 aa overlap); etc. Also similar to other regulators from Mycobacterium tuberculosis e.g. P96839|Rv3557c|MTCY06G11.04c (200 aa), FASTA scores: opt: 154, E(): 0.0013, (38.8% identity in 80 aa overlap). Contains probable helix-turn-helix motif from aa 39 to 60 (Score 1251, +3.45 SD). SIMILAR TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3198c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3198c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y553" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y553" /protein_id="SIU01825.1" /translation="MPPVTRTTEPPRRGGRGARQRILKAAAELFYCEGINATGVELIA NKASVSKRTLYQHFPSKSALVEEYLRGLRQAAGEADKMPKASNATPRERLLALFDRPN RGDGRMRGCPFHNAAVEAAGEMPGVERIVHSHKRDYIKGLARLAREAGAAHPRSLGNQ LAVLFEGAAALSTSLDDAGPWAHARAAAEVLIDQATARPV" CDS 3503801..3504508 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3199" /product="PROBABLE SHORT-CHAIN DEHYDROGENASE/REDUCTASE" /note="Mb3199, -, len: 235 aa. Equivalent to Rv3174, len: 235 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 235 aa overlap). Probable oxidoreductase short-chain dehyrogenase/reductase (EC 1.-.-.-), similar to others e.g. Q9RPT7|SITS from Streptomyces albus (223 aa), FASTA scores: opt: 654, E(): 6.1e-32, (49.3% identity in 215 aa overlap); Q9RI61|SCJ11.46 from Streptomyces coelicolor (230 aa), FASTA scores: opt: 626, E(): 2.9e-30, (50.9% identity in 224 aa overlap); Q9A5Z1|CC2306 from Caulobacter crescentus (252 aa), FASTA scores: opt: 430, E(): 1.3e-18, (39.45% identity in 228 aa overlap); Q51641 INSECT-TYPE DEHYDROGENASE (249 aa), FASTA scores: opt: 301, E(): 5.7e-11, (38.3% identity in 188 aa overlap); Q9HXC9|PA3883 from Pseudomonas aeruginosa (276 aa), FASTA scores: opt: 296, E(): 1.2e-10, (29.55% identity in 247 aa overlap); etc. MAY BE BELONG TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Mb3199 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Z4" /protein_id="SIU01826.1" /translation="MTSLAERTALVTGANRGMGREYVAQLLGRKVAKVYAATRNPLAI DVSDPRVIPLQLDVTDAVSVAEAADLATDVGILINNAGISRASSVLDKDTSALRGELE TNLFGPLALASAFADRIAERSGAIVNVSSVLAWLPLGMSYGVSKAAMWSATESMRIEL APRGVQVVGVYVGLVDTDMGRFADAPKSDPADVVRQVLDGIEAGKEDVLADEMSRQVR ASLNVPARERIARLMGN" CDS 3504523..3506010 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3200" /product="POSSIBLE AMIDASE (AMINOHYDROLASE)" /note="Mb3200, -, len: 495 aa. Equivalent to Rv3175, len: 495 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 495 aa overlap). Possible amidase (EC 3.5.1.-), similar to others e.g. Q9F6D0|ZHUL ENANTIOMER SELECTIVE AMIDASE from Streptomyces sp. R1128 (507 aa), FASTA scores: opt: 1328, E(): 7.5e-69, (44.5% identity in 492 aa overlap); BAB51815|MLR5350 PROBABLE AMIDASE from Rhizobium loti (Mesorhizobium loti) (457 aa), FASTA scores: opt: 7487, E(): 1.3e-35, (35.9% identity in 482 aa overlap); O28325|YJ54_ARCFU|AF1954 PUTATIVE AMIDASE (EC 3.5.1.4) from Archaeoglobus fulgidus (453 aa), FASTA scores: opt: 532, E(): 3.2e-23, (32.05% identity in 471 aa overlap); etc. But also similar to glutamyl-tRNA amidotransferases who belong to amidase family e.g. Q9RTA9|DR1856 GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE, SUBUNIT A from Deinococcus radiodurans (482 aa), FASTA scores: opt: 560, E(): 8.2e-25, (30.6% identity in 513 aa overlap); Q9LCX3|GATA GLU/ASP-TRNA AMIDOTRANSFERASE SUBUNIT A from Thermus aquaticus (subsp. thermophilus) (471 aa), FASTA scores: opt: 558, E(): 1.1e-24, (30.85% identity in 486 aa overlap); Q49091|GATA_MORCA GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE SUBUNIT A (EC 6.3.5.-) from Moraxella catarrhalis (492 aa), FASTA scores: opt: 526, E(): 7.5e-23, (30.45% identity in 473 aa overlap); etc. SEEMS TO BELONG TO THE AMIDASE FAMILY. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /db_xref="GOA:A0A1R3Y3J8" /db_xref="InterPro:IPR023631" /db_xref="InterPro:IPR036928" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3J8" /protein_id="SIU01827.1" /translation="MAMSAKASDDIAWLPATAQLAVLAAKKVSSAELVELYLSRIDTY NASLNAIVTVDPDAARRVAKRSDAARARGDELGPLHGLPITVKDSYETAGMRTTCGRR DLADYVPTQDAEAVARLRRAGAIIMGKTNMPTGNQDVQASNPVFGRTNNPWDAARTSG GSAGGGAAATAAGLTSFDYGSEIGGSTRIPAHYCGLYGHKSTWRSVPLVGHIPSAPGN PGRWGQADMACAGVQVRGARDIIPALEATVGPMRADGGFSYALAPPRAGALKDFRVAV WAEDPHCPIDADVRRAMDDAVAALRAAGAHVVEQPATIPVDMAVSHNIFQSLVFGAFA VDRSTLSPASAAALGLRAVRHPRGEAANALGATLQSHRAWLFADAARHEMRDRWAGFF NEFDVLLLPVTPTPAPLHHNKDHDRLGRTIDVDGVSRSYWDQLKWNALANIAGTPATT MPITTTATGLPIGIQAMGPAGGDRTTVEFAALLTEVLGGFRVPPL" CDS complement(3506007..3506324) /codon_start=1 /transl_table=11 /gene="mesTb" /locus_tag="BQ2027_MB3201C" /standard_name="lipS" /product="PROBABLE EPOXIDE HYDROLASE MESTB [SECOND PART] (EPOXIDE HYDRATASE) (ARENE-OXIDE HYDRATASE)" /note="Mb3201c, mesTb, len: 105 aa. Equivalent to 3' end of Rv3176c, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 105 aa overlap). Probable mesT,epoxide hydrolase (EC3.3.2.3), similar to others e.g. O15007|PEG1|MEST|Q92571|O14973 MEST PROTEIN (MESODERM SPECIFIC TRANSCRIPT (MOUSE) HOMOLOG) (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Homo sapiens (Human) (335 aa), FASTA scores: opt: 348, E(): 6e-15, (32.15% identity in 280 aa overlap); AAH06639|Q07646 MEST PROTEIN from Mus musculus (Mouse) (335 aa), FASTA scores: opt: 342, E(): 1.4e-14, (31.45% identity in 280 aa overlap); Q9I8E7|MEST EPOXIDE HYDROLASE (EC 3.3.2.3) from Fugu rubripes (Japanese pufferfish) (Takifugu rubripes) (326 aa), FASTA scores: opt: 322, E(): 2.7e-13, (29.55% identity in 301 aa overlap); Q9PUC9|PEG1|MEST EPOXIDE HYDROLASE from Brachydanio rerio (Zebrafish) (Zebra danio) (344 aa), FASTA scores: opt: 322, E(): 2.8e-13, (32.35% identity in 207 aa overlap); Q9HYH6|PA3429 PROBABLE EPOXIDE HYDROLASE from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 258, E(): 3e-09, (29.85% identity in 288 aa overlap); O31243|ECHA EPOXIDE HYDROLASE from Agrobacterium radiobacter (294 aa), FASTA scores: opt: 202, E(): 1.1e-05, (27.0% identity in 278 aa overlap); etc. Also similar to Q50599|Rv1834|MT1882|MTCY1A11.09c HYPOTHETICAL 31.7 KDA PROTEIN from Mycobacterium tuberculosis (288 aa), FASTA scores: opt: 294, E(): 1.5e-11, (29.95% identity in 287 aa overlap). Equivalent to AAK47604 from Mycobacterium tuberculosis strain CDC1551 (339 aa) but shorter 21 aa. SIMILAR TO ALPHA/BETA HYDROLASE FOLD. MAY BE BELONG TO PEPTIDASE FAMILY S33. Note that previously known as lipS. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, mesT exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits mesT into 2 parts, mesTa and mesTb." /db_xref="GOA:A0A1R3Y3P5" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3P5" /protein_id="SIU01828.1" /translation="MKELHDAISRRDGVRVLPATAGFVDEHREHAARWDLARIISALG DEVAFGVVGSAEDPFEGEQLRLARERLADSVEITELAGGHLTTAEQPDRLAEVIAALP ERS" CDS complement(3506336..3506962) /codon_start=1 /transl_table=11 /gene="mesTa" /locus_tag="BQ2027_MB3202C" /standard_name="lipS" /product="PROBABLE EPOXIDE HYDROLASE MESTA [FIRST PART] (EPOXIDE HYDRATASE) (ARENE-OXIDE HYDRATASE)" /note="Mb3202c, mesTa, len: 208 aa. Equivalent to 5' end of Rv3176c, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 201 aa overlap). Probable mesT, epoxide hydrolase (EC 3.3.2.3), similar to others e.g. O15007|PEG1|MEST|Q92571|O14973 MEST PROTEIN (MESODERM SPECIFIC TRANSCRIPT (MOUSE) HOMOLOG) (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Homo sapiens (Human) (335 aa), FASTA scores: opt: 348, E(): 6e-15, (32.15% identity in 280 aa overlap); AAH06639|Q07646 MEST PROTEIN from Mus musculus (Mouse) (335 aa), FASTA scores: opt: 342, E(): 1.4e-14, (31.45% identity in 280 aa overlap); Q9I8E7|MEST EPOXIDE HYDROLASE (EC 3.3.2.3) from Fugu rubripes (Japanese pufferfish) (Takifugu rubripes) (326 aa), FASTA scores: opt: 322, E(): 2.7e-13, (29.55% identity in 301 aa overlap); Q9PUC9|PEG1|MEST EPOXIDE HYDROLASE from Brachydanio rerio (Zebrafish) (Zebra danio) (344 aa), FASTA scores: opt: 322, E(): 2.8e-13, (32.35% identity in 207 aa overlap); Q9HYH6|PA3429 PROBABLE EPOXIDE HYDROLASE from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 258, E(): 3e-09, (29.85% identity in 288 aa overlap); O31243|ECHA EPOXIDE HYDROLASE from Agrobacterium radiobacter (294 aa), FASTA scores: opt: 202, E(): 1.1e-05, (27.0% identity in 278 aa overlap); etc. Also similar to Q50599|Rv1834|MT1882|MTCY1A11.09c HYPOTHETICAL 31.7 KDA PROTEIN from Mycobacterium tuberculosis (288 aa), FASTA scores: opt: 294, E(): 1.5e-11, (29.95% identity in 287 aa overlap). Equivalent to AAK47604 from Mycobacterium tuberculosis strain CDC1551 (339 aa) but shorter 21 aa. SIMILAR TO ALPHA/BETA HYDROLASE FOLD. MAY BE BELONG TO PEPTIDASE FAMILY S33. Note that previously known as lipS. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, mesT exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits mesT into 2 parts, mesTa and mesTb. Mb3202c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3B3" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3B3" /protein_id="SIU01829.1" /translation="MTHRASALISAQEWFSAGERVGYDAERPGINPRSPLRAFIRRAA GTGVTRTFLPGWPDGSYGWAKVEAFLSSRFHFPRIYLDYIGHGDSDKPRDYPYSTFER ADLVEALWHAEGIAQTVVVAFDYSCIVSLELLARRIDRERAGNDQRTRITACLLANGG IFADGHTHAWYTTPLLTSPLGAAITPIGQRSWRMFAPFLRPSSRADTH" CDS 3507109..3507969 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3203" /product="POSSIBLE PEROXIDASE (NON-HAEM PEROXIDASE)" /note="Mb3203, -, len: 286 aa. Equivalent to Rv3177, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Possible peroxidase (non-haem peroxidase) (EC 1.11.1.-), highly similar to Q9KJF9|W78 CULTIVAR SPECIFICITY PROTEIN (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) W78 from Rhizobium leguminosarum (287 aa), FASTA scores: opt: 1059, E(): 2.3e-59, (61.4% identity in 272 aa overlap); BAB48728|MLL1328 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (286 aa), FASTA scores: opt: 746, E(): 1.1e-39, (43.25% identity in 282 aa overlap). Similar to nonheme chloroperoxidases and related esterases e.g. O73957|SAL LIPOLYTIC ENZYME from Sulfolobus acidocaldarius (314 aa), FASTA scores: opt: 408, E(): 1.9e-18, (32.4% identity in 287 aa overlap); Q9AJM9|BIOH PROTEIN INVOLVED IN BIOTIN SYNTHESIS from Kurthia sp. 538-KA26 (267 aa), FASTA scores: opt: 324, E(): 3.2e-13, (30.0% identity in 250 aa overlap); Q9CBB1|ML2269 PUTATIVE HYDROLASE (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Mycobacterium leprae (265 aa); O05691|THCF_RHOER NON-HEME HALOPEROXIDASE (EC 1.11.1.-) from Rhodococcus erythropolis (SIMILAR TO OTHER BACTERIAL NON-HEME BROMO-AND CHLORO-PEROXIDASES) (274 aa), FASTA scores: opt: 279, E(): 2.2e-10, (29.0% identity in 276 aa overlap); Q53540|EST ESTERASE (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Pseudomonas putida (276 aa), FASTA scores: opt: 271, E(): 7.1e-10, (29.65% identity in 280 aa overlap); etc. Also similar to O06420|BPOC|Rv0554|MTCY25D10.33 HYPOTHETICAL 28.3 KDA PROTEIN (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from M. tuberculosis (262 aa), FASTA scores: opt: 280, E(): 1.8e-10, (28.0% identity in 257 aa overlap). Equivalent to AAK47605 from Mycobacterium tuberculosis strain CDC1551 (300 aa) but shorter 14 aa. SIMILAR TO ALPHA/BETA HYDROLASE FOLD." /db_xref="GOA:A0A1R3Y3D2" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3D2" /protein_id="SIU01830.1" /translation="MPQRQAGDIGATYQDAPTKSINVGGTRFVYRRLGADAGVPVIFL HHLGAVLDNWDPRVVDGIAAKHPVVTFDNRGVGASEGQTPDTVTTMADDAIAFVRALG FDQVDLLGFSLGGFVAQVIAQQEPQLVRKIILAGTGPAGGVGIGKVTFGTIRESIKAT LTFRDPKELRFFTRTDSGKSAARQFVKRLKERKDNRDKSITVRAFRSQLKAIHAWGTQ KPSDLTSIGHPVLIANGDDDTMVPTSNSLDLADRLPDATLRIYPDAGHGGIFQHHAQF VDDALQFLES" CDS 3508100..3508459 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3204" /product="Putative F420H(2)-dependent quinone reductase Rv3178 (Fqr) (EC" /EC_number="1.1.98.-" /note="Mb3204, -, len: 119 aa. Equivalent to Rv3178, len: 119 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 119 aa overlap). Hypothetical protein, with some similarity to other hypothetical bacterial proteins (principaly mycobacterium and streptomyces proteins) e.g. P71854|Rv3547|MTCY03C7.09c from M. tuberculosis strain H37Rv (151 aa), FASTA scores: opt: 310, E(): 2e-14, (40.5% identity in 116 aa overlap); Q9ZH81 from M. paratuberculosis (144 aa), FASTA scores: opt: 274, E(): 5.6e-12, (38.9% identity in 108 aa overlap); O85698|3SCF60.07 from Streptomyces lividans and Streptomyces coelicolor (149 aa), FASTA scores: opt: 235, E(): 2.7e-09, (35.2% identity in 108 aa overlap); Q10772|YF58_MYCTU|Rv1558|MT1609|MTCY48.07c (148 aa); Q9WX21|SCE68.11 from Streptomyces coelicolor (305 aa); etc. Equivalent to AAK47606 from Mycobacterium tuberculosis strain CDC1551 (171 aa) but shorter 52 aa. Protein product from Mb3204 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y3C2" /db_xref="InterPro:IPR004378" /db_xref="InterPro:IPR012349" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3C2" /protein_id="SIU01831.1" /translation="MRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVA SALGQAENPQWYRNLPPNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDS CQSWTERGIPVIILRPR" CDS 3509280..3510569 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3205" /product="ATPase" /note="Mb3205, -, len: 429 aa. Equivalent to Rv3179, len: 429 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 429 aa overlap). Conserved hypothetical protein, highly similar to Q9KH61 PUTATIVE ATP/GTP BINDING PROTEIN from Mycobacterium smegmatis (428 aa), FASTA scores: opt: 2466, E(): 1.5e-148, (89.7% identity in 428 aa overlap) (no article found on the NCBI web site (July 2001)); and to other hypothetical bacterial proteins e.g. O07781|Rv0597c|MTCY19H5.25 from M. tuberculosis (411 aa), FASTA scores: opt: 1031, E(): 8e-58, (41.5% identity in 417 aa overlap); BAB54715|MLR9349 from Rhizobium loti (Mesorhizobium loti) (435 aa), FASTA scores: opt: 365, E(): 1.1e-15, (31.75% identity in 416 aa overlap); etc. Equivalent to AAK47609 from Mycobacterium tuberculosis strain CDC1551 (454 aa) but shorter 25 aa. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb3205 detected using SWATH mass spectrometry." /db_xref="InterPro:IPR025420" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041682" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3B8" /protein_id="SIU01832.1" /translation="MVHDEAGHELIERHMLEQLREVAEYTRVVLINGPRQAGKTTLLQ QLHAELGGWLRSLDVDVERASARADPEGYIMSAPRPTFLDEVQCAGDPLILAIKTATD RDRRPRQFFLSGSTRFLTVPTLSESLAGRVAILDLWPLSVAERSGVRPEIIAQLFTEP QVVLGTEPAPVTRHEYLQLACAGGFPEVVQRPAGRARSRWFSDYLRTVTQRDVRELKR IEQTDRLPRFMRYLAAITAQELNVAEAARVIGVDAGTIRSDLALFETVYLVHRLPAWS RNLTAKIKKRSKIHVVDSGFAAWLRGQSADSLARPTAEGAGPIMETFVINELMKLRAA TELEVDLYHFRDRDGREIDCILQTPDSRVVGVEVKASATVNVHDFRHLSFARDRLGDE FITGVLFYTGARALPFGDRLMALPINLLWNGQSVSSL" CDS complement(3510916..3511350) /codon_start=1 /transl_table=11 /gene="vapC49" /locus_tag="BQ2027_MB3206C" /product="Predicted nucleic acid-binding protein, contains PIN domain" /note="Mb3206c, -, len: 144 aa. Equivalent to Rv3180c, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 144 aa overlap). Hypothetical unknown ala-rich protein. Contains probable coiled-coil domain from aa 40 to 70. Protein product from Mb3206c detected using shotgun mass spectrometry. Mb3206c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3C5" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3C5" /protein_id="SIU01833.1" /translation="MPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEV RAALAAAARNHDLTESELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGAD AVHLASALAVGDPGLVVAVWDRRLHTGAHAAGCRVAPAQLDP" CDS complement(3511353..3511805) /codon_start=1 /transl_table=11 /gene="vapB49" /locus_tag="BQ2027_MB3207C" /product="Antitoxin of toxin-antitoxin stability system" /note="Mb3207c, -, len: 150 aa. Equivalent to Rv3181c, len: 150 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 150 aa overlap). Hypothetical protein, with some similarity to other mycobacterium proteins e.g. Q50718|YY07_MYCTU|Rv3407|MT3515|MTCY78.21c (99 aa), FASTA scores: opt: 123, E(): 0.25, (33.7% identity in 89 aa overlap); and O50412|Rv3385c|MTV004.43c (102 aa), FASTA scores: opt: 123, E(): 0.26, (39.7% identity in 68 aa overlap). Protein product from Mb3207c detected using SWATH mass spectrometry. Mb3207c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006442" /db_xref="InterPro:IPR036165" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3D1" /protein_id="SIU01834.1" /translation="MQLGRKVTSHHDIDRFGVASTADESVYRPLPPRLRLAQVNLSRR RCRTQSDMYKSRFSECTVQSVDVSVTELRAHLSDWLDRARAGGEVVITERGIPIARLA ALDSTDTLERLTAEGVIGKATAQRPVAAGRPRPRPQRPVSDRVSDQRR" CDS 3512036..3512380 /codon_start=1 /transl_table=11 /gene="higB3" /locus_tag="BQ2027_MB3208" /product="Toxin HigB" /note="Mb3208, -, len: 114 aa. Equivalent to Rv3182, len: 114 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 114 aa overlap). Hypothetical protein, with some similarity to other hypothetical bacterial proteins e.g. O53468|Rv2022c|MTV018.09c from M. tuberculosis (201 aa), FASTA scores: opt: 335, E(): 3.6e-16, (51.9% identity in 104 aa overlap); and Q9L3R6|ORF119 from Anabaena sp. strain PCC 7120 (119 aa), FASTA scores: opt: 250, E(): 1.6e-10, (42.1% identity in 95 aa overlap). Equivalent to AAK47614 from Mycobacterium tuberculosis strain CDC1551 (94 aa) but longer 20 aa. Mb3208 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR009241" /db_xref="UniProtKB/TrEMBL:A0A1R3Y560" /protein_id="SIU01835.1" /translation="MAVILLPQVERWFFALNRDAMASVTGAIDLLEMEGPTLGRPVVD KVNDSTFHNMKELRPAGTSIRILFAFDPARQAILLLGGDKAGNWKRWYDNNIPIADQR SENWLASEHGGG" CDS 3512377..3512706 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3209" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb3209, -, len: 109 aa. Equivalent to Rv3183, len: 109 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 109 aa overlap). Possible transcriptional regulator, similar to others e.g. Q9S1D9|YPPCP1.08c from Yersinia pestis (99 aa), FASTA scores: opt: 119, E(): 0.47, (40.55% identity in 74 aa overlap); Q9X153|TM1330 from Thermotoga maritima (111 aa), FASTA scores: opt: 115, E(): 0.91, (40.35% identity in 57 aa overlap); P95258|Rv1956|MTCY09F9.08c (alias AAK46277 putative DNA-binding protein from strain CDC1551) (149 aa), FASTA scores: opt: 116, E(): 1, (42.25% identity in 71 aa overlap). Also similar to O53467|Rv2021c|MTV018.08c from Mycobacterium tuberculosis (101 aa), FASTA scores: opt: 214, E(): 5.8e-07, (43.0% identity in 107 aa overlap). Contains probable helix-turn-helix motif from aa 51 to 72 (Score 1803, +5.33 SD). Mb3209 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y409" /db_xref="InterPro:IPR001387" /db_xref="InterPro:IPR010982" /db_xref="InterPro:IPR039554" /db_xref="UniProtKB/TrEMBL:A0A1R3Y409" /protein_id="SIU01836.1" /translation="MTMARNWRDIRADAVAQGRVDLQRAAVAREEMRDAVLAHRLAEI RKALGHARQADVAALMGVSQARVSKLESGDLSHTELGTLQAYVAALGGHLRIVAEFGE NTVELTA" CDS 3513244..3513591 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3210" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3210, -, len: 115 aa. Equivalent to Rv3188, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 115 aa overlap). Conserved hypothetical protein, with similarity to other proteins from Mycobacterium tuberculosis: Q10868|YJ90_MYCTU|Rv1990c|MT2044|MTCY39.29 HYPOTHETICAL PROTEIN (113 aa), FASTA scores: opt: 184, E(): 8.1e-06, (28.45% identity in 109 aa overlap); and O06299|Rv0348|MTCY13E10.08 HYPOTHETICAL PROTEIN (217 aa), FASTA scores: opt: 129, E(): 0.074, (30.0% identity in 100 aa overlap). Also some similarity with C-terminus of Q9XA59|SCGD3.19 PUTATIVE TWO-COMPONENT SYSTEM RESPONSE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (218 aa), FASTA scores: opt: 114, E(): 0.76, (30.0% identity in 110 aa overlap) (for this one, no similarity exists in the N-terminal region with the N-terminus of other regulatory components of sensory transduction systems). Protein product from Mb3210 detected using SWATH mass spectrometry. Mb3210 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024467" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K0" /protein_id="SIU01837.1" /translation="MAVTLDRAVEASEIVDALKPFGVTQVDVAAVIQVSDRAVRGWRT GDIRPERYDRLAQLRDLVLLLSDSLTPRGVGQWLHAKNRLLDGQRPVDLLAKDRYEDV RSAAESFIDGAYV" CDS 3513588..3514208 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3211" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3211, -, len: 206 aa. Equivalent to Rv3189, len: 206 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 206 aa overlap). Conserved hypothetical protein, weakly similar to other proteins from Mycobacterium tuberculosis e.g. O86329|MBTE|Rv2380c|MTCY22H8.05 (1682 aa), FASTA scores: opt: 135, E(): 0.79, (27.8% identity in 187 aa overlap); and Q10869|YJ89_MYCTU|Rv1989c|MT2043MTCY39.30 (186 aa), FASTA scores: opt: 122, E(): 0.85, (32.25% identity in 93 aa overlap). Protein product from Mb3211 detected using SWATH mass spectrometry. Mb3211 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR014914" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Q0" /protein_id="SIU01838.1" /translation="MKLADAIATAPRRTLKGTYWHQGPTRHPVTSCADPARGPGRYHR TGEPGVWYASNKEQGAWAELFRHFVDDGVDPFEVRRRVGRVAVTLQVLDLTDERTRSH LGVDETDLLSDDYTTTQAIAAARDANFDAVLAPAAALPGCQTLAVFVHALPNIEPERS EVRQPPPRLANLLPLIRPHEHMPDSVRRLLATLTRAGAEAIRRRRR" CDS complement(3514368..3515633) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3212C" /product="HYPOTHETICAL PROTEIN" /note="Mb3212c, -, len: 421 aa. Equivalent to Rv3190c, len: 421 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 421 aa overlap). Hypothetical unknown protein. Protein product from Mb3212c detected using SWATH mass spectrometry. Mb3212c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3C3" /protein_id="SIU01839.1" /translation="MEYVQLFSKGRLNDLAGSLAGFLGKASQATAQRLQSWDADDLLN TPVDDVVEQLVELGSVECPDLRVDDAFMLPATEVDQQYRDWGEQRTRRVTRLVLVVPF EGHKDIFNLRPDQFTTMPPQVLRLQGHEIHLAIDNPSNDAAAINAAFHKQIANIEKYL GWSRRQIDLHNQGLRNELPGMVARRREQLLATRNLQAEIGFPVRRRKDADTYAAPISR KSVRPRPHRPAGARAAFKPEPAMQDEDYQSALRVLRNQRNALERTPSVAAKLDGEEIR DMLLVGLNAQFEGDAGGELFNGAGKTDILIRVDDRNIFIGECKVWSGPRTMDDALKQL FGYLVWRDTKAAILLFIRNKDVTAVIDNAIAKIKEHPNHKRCPAHRAGADQYEFTMHA DGDPEREIHLTLIPFALRPTAEVPTTTIP" CDS 3515801..3516010 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3212A" /product="Conserved protein" /note="Mb3212A, len: 69 aa. Equivalent to Rv3190A len: 69 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 69 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Conserved protein. Mb3212A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3E0" /protein_id="SIU01840.1" /translation="MITVLDMNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYE ALKELEAQVIALQRSEGKGLLSRLS" mobile_element complement(3516208..3517597) /mobile_element_type="insertion sequence:IS1603" /locus_tag="BQ2027_IS1603" /note="IS1603, len: 1390 nt. Equivalent to IS1603, len: 1032 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1032 nt overlap)." repeat_region 3516208..3516270 /rpt_type=INVERTED /note="63 bp imperfect inverted repeat, IRR, TGTCAGCGGCAACCGAAAACTGATCAGGTGTCGGCAAGGTGGTTTCTAGGCGGTGTCG C AACA, flanking IS element IS1603." gene complement(3516208..3517597) /locus_tag="BQ2027_IS1603" CDS complement(3516257..3517291) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3213C" /product="PROBABLE TRANSPOSASE" /note="Mb3213c, -, len: 344 aa. Equivalent to Rv3191c, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 344 aa overlap). Probable transposase, similar to many especially Q9K2N8 PUTATIVE TRANSPOSASE from Pseudomonas aeruginosa (338 aa), FASTA scores: opt: 837, E(): 1.3e-43, (42.55% identity in 336 aa overlap); Q9RBF4 INSERTION SEQUENCE IS1088 from Alcaligenes eutrophus (Ralstonia eutropha) (342 aa), FASTA scores: opt: 823, E(): 9.2e-43, (43.05% identity in 337 aa overlap); and Q51379 PUTATIVE TRANSPOSASE from Pseudomonas alcaligenes (338 aa), FASTA scores: opt: 818, E(): 1.8e-42, (42.35% identity in 333 aa overlap). Contains probable helix-turn-helix motif from aa 25 to 46 (Score 1968, +5.89 SD)." /db_xref="GOA:A0A1R3Y3D3" /db_xref="InterPro:IPR001584" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR025246" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3D3" /protein_id="SIU01841.1" /translation="MRQISSRYLSEEERINIADLRRSGLSIRKIADQLGRAPSTVSRE LRRNSRRDGQYRPFEAHRWAVQRRVRRHRRRIDKNPDLCELIAELLAQRWSPQQIARH LRRKYPDDRSMWLCHESIYQAVYQPQSRLIRPPQVKSPHRGPLRTGRTHRRAHLRPGR RRPRFAQPMLSIHQRPFDPADRSEPGHWEGDLIVGKNQGSAIGTLVERQTRLIRLLHL PTHDAYCLRIAITETMSDLPVTLVRSITWDQGIEMARHIDITADLGAPVYFCDSRSPW QRASNENSNGLLRQYFPKGTSLSTYTPDHLRAVEYEINNRPRQVLGHRSPAELFTALL TSPDHQLLRR" repeat_region complement(3517535..3517597) /rpt_type=INVERTED /note="63 bp imperfect inverted repeat, IRL, TGTCGGCGGCAACTGAATACTGACCAGAGCGCGGCAAGGTGGGTTCTAGTCAACGTCG C AACA, flanking IS element IS1603." tRNA complement(3518316..3518389) /locus_tag="BQ2027_METU" /product="tRNA-Met" /note="metU, len: 74 nt. Equivalent to metU, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-f-Met, anticodon cat. Described in EM_BA: MTMETA Y08623 M.tuberculosis as metA gene. Name changed to metU as metA encodes homoserine transsuccinylase.; fMet" CDS 3518509..3518970 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3214" /product="Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases" /note="Mb3214, -, len: 153 aa. Equivalent to Rv3192, len: 153 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 153 aa overlap). Conserved hypothetical ala- and pro-rich protein, with weak similarity to N-terminal half of several proteins e.g. Q11030|YD60_MYCTU|Rv1360|MT1405|MTCY02B10.24 HYPOTHETICAL 37.3 KDA PROTEIN from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 245, E(): 3.7e-08, (33.1% identity in 157 aa overlap); O30260|AF2411 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (363 aa), FASTA scores: opt: 144, E(): 0.072, (32.6% identity in 92 aa overlap); Q9ZA30|GRA-ORF29 PUTATIVE FMN-DEPENDENT MONOOXYGENASE from Streptomyces violaceoruber (343 aa), FASTA scores: opt: 133, E(): 0.33, (25.15% identity in 159 aa overlap). Protein product from Mb3214 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y3D0" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3D0" /protein_id="SIU01842.1" /translation="MIPQPLSQLGDLARRPGRRVLCSPKTAAPSISNATVASPAAPGL ELSTGIALAFPRGPFVPAAAAWELQEATSGKFQLGLGTQVRKNVVHRYGMAFHRPGPR LRYLLAVKACFAVFQTGTPDHHGEFDNPDFITAQWSPARIDPPGPSPAGPR" CDS complement(3519140..3522118) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3215C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3215c, -, len: 992 aa. Equivalent to Rv3193c, len: 992 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 992 aa overlap). Probable conserved transmembrane protein, with hydrophobic N-terminal domain (~1-340 aa), highly similar to Q9CCM6|ML0644 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (983 aa), FASTA scores: opt: 5421, E(): 0, (86.15% identity in 989 aa overlap); and O53609|Rv0064|MTV030.07 PUTATIVE MEMBRANE PROTEIN from Mycobacterium tuberculosis strain H37Rv (979 aa), FASTA scores: opt: 3204, E(): 2.1e-142, (50.25% identity in 985 aa overlap). C-terminal part (709-990 aa) highly similar to O32904|MLCB1779.46 HYPOTHETICAL 29.1 KDA PROTEIN from Mycobacterium leprae (277 aa), FASTA scores: opt: 1521, E(): 3.4e-64, (82.6% identity in 282 aa overlap). Also some similarity to hypothetical proteins generally transmembrane e.g. Q9FCI4|2SC3B6.28 from Streptomyces coelicolor (815 aa), FASTA scores: opt: 951, E(): 3.4e-37, (39.2% identity in 826 aa overlap); P72637|SLL1060 from Synechocystis sp. strain PCC 6803 (1032 aa), FASTA scores: opt: 938, E(): 1.6e-36, (29.95% identity in 855 aa overlap); O28851|AF1421 from Archaeoglobus fulgidus (880 aa), FASTA scores: opt: 526, E(): 2.6e-17, (28.05% identity in 970 aa overlap); etc. Protein product from Mb3215c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3215c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TX22" /db_xref="InterPro:IPR005372" /db_xref="UniProtKB/Swiss-Prot:Q7TX22" /protein_id="SIU01843.1" /translation="MGMRSAARMPKLTRRSRILIMIALGVIVLLLAGPRLIDAYVDWL WFGELGYRSVFTTMLATRIVVCLVAGVVVGGIVFGGLALAYRTRPVFVPDADNDPVAR YRAVVLARLRLVGIGIPAAIGLLAGIVAQSYWARIQLFLHGGDFGVRDPQFGRDLGFY AFELPFYRLMLSYMLVSVFLAFVANLVAHYIFGGIRLSGRTGALSRSARVQLVSLVGV LVLLKAVAYWLDRYELLSHTRGGKPFTGAGYTDINAVLPAKLILMAIALICAAAVFSA IALRDLRIPAIGLVLLLLSSLIVGAGWPLIVEQISVKPNAAQKESEYISRSITATRQA YGLTSDVVTYRNYSGDSPATAQQVAADRATTSNIRLLDPTIVSPAFTQFQQGKNFYYF PDQLSIDRYLDRNGNLRDYVVAARELNPDRLIDNQRDWINRHTVYTHGNGFIASPANT VRGIANDPNQNGGYPEFLVNVVGANGTVVSDGPAPLDQPRIYFGPVISNTSADYAIVG RNGDDREYDYETNIDTKRYTYTGSGGVPLGGWLARSVFAAKFAERNFLFSNVIGSNSK ILFNRDPAQRVEAVAPWLTTDSAVYPAIVNKRLVWIVDGYTTLDNYPYSELTSLSSAT ADSNEVAFNRLVPDKKVSYIRNSVKATVDAYDGTVTLYQQDEKDPVLKAWMQVFPGTV KPKSDIAPELAEHLRYPEDLFKVQRMLLAKYHVNDPVTFFSTSDFWDVPLDPNPTASS YQPPYYIVAKNIAKDDNSASYQLISAMNRFKRDYLAAYISASSDPATYGNLTVLTIPG QVNGPKLANNAITTDPAVSQDLGVIGRDNQNRIRWGNLLTLPVAQGGLLYVEPVYASP GASDAASSYPRLIRVAMMYNDKVGYGPTVRDALTGLFGPGAGATATGIAPTEAAVPPS PAANPPPPASGPQPPPVTAAPPVPVGAVTLSPAKVAALQEIQAAIGAARDAQKKGDFA AYGSALQRLDEAITKFNDAG" CDS complement(3522210..3523232) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3216C" /product="POSSIBLE CONSERVED SECRETED PROTEIN" /note="Mb3216c, -, len: 340 aa. Equivalent to Rv3194c, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 340 aa overlap). Possible conserved secreted protein (N-terminal stretch hydrophobic), equivalent to Q9CCM7|ML0643 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (340 aa), FASTA scores: opt: 1822, E(): 1.6e-102, (80.3% identity in 340 aa overlap). Also similar to other proteins e.g. Q9FCI6|2SC3B6.26 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (364 aa), FASTA scores: opt: 430, E(): 1.1e-18, (40.95% identity in 359 aa overlap); Q9S3Y5|SDRC SDRC PROTEIN from Streptomyces coelicolor (241 aa), FASTA scores: opt: 396, E(): 8.9e-17, (35.2% identity in 318 aa overlap) (similarity in part for this one); O34470|YLBL YLBL PROTEIN from Bacillus subtilis (350 aa), FASTA scores: opt: 385, E(): 5.6e-16, (27.7% identity in 350 aa overlap); etc. Protein product from Mb3216c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3216c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3D6" /db_xref="InterPro:IPR001478" /db_xref="InterPro:IPR008269" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR027065" /db_xref="InterPro:IPR036034" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3D6" /protein_id="SIU01844.1" /translation="MNRRILTLMVALVPIVVFGVLLAVVTVPFVALGPGPTFDTLGEI DGKQVVQIVGTQTYPTSGHLNMTTVSQRDGLTLGEALALWLSGQEQLMPRDLVYPPGK SREEIENDNAADFKRSEAAAEYAALGYLKYPKAVTVASVMDPGPSVDKLQAGDAIDAV DGTPVGNLDQFTALLKNTKPGQEVTIDFRRKNEPPGIAQITLGKNKDRDQGVLGIEVV DAPWAPFAVDFHLANVGGPSAGLMFSLAVVDKLTSGHLVGSTFVAGTGTIAVDGKVGQ IGGITHKMAAARAAGATVFLVPAKNCYEASSDSPPGLKLVKVETLSQAVDALHAMTSG SPTPSC" CDS 3523310..3524728 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3217" /product="secretion" /note="Mb3217, -, len: 472 aa. Equivalent to Rv3195, len: 472 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 472 aa overlap). Hypothetical protein, equivalent to Q49746|ML0642|B1937_C3_231 HYPOTHETICAL 50.3 KDA PROTEIN from Mycobacterium leprae (479 aa), FASTA scores: opt: 2503, E(): 1e-138, (79.35% identity in 475 aa overlap). Similar in part to Q9FCI9|2SC3B6.23c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (487 aa), FASTA scores: opt: 1382, E(): 2.7e-73, (46.4% identity in 489 aa overlap); Q9X8I7|SCE9.14 HYPOTHETICAL 41.2 KDA PROTEIN from Streptomyces coelicolor (375 aa), FASTA scores: opt: 319, E(): 2.4e-11, (25.6% identity in 383 aa overlap); etc. Mb3217 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR018766" /db_xref="UniProtKB/TrEMBL:A0A1R3Y569" /protein_id="SIU01845.1" /translation="MSTGEVMGDLPFGFSSGDDPPEDPSGRDKRGKDGADSGSGANPL GAFGIGGEFNMADLGQIFTRLGEMFGGVGTAMAAGKTSGPVNYDLARQVASSSIGFIA PIPAATNSAIADAVHLADTWLDGATSLPAGATKAVGWSPTDWVDNTLATWKRLCDPMA QQISTVWASSLPEEAKSMAGPLLSIMSQMGGIAFGSQLGQALGRLSREVLTSTDIGLP LGPKGVAAILPGAVESFAAGLEQPRSEILTFLATREAAHHRLFSHVPWLASQLLGAVE AYAMGMKIDMTGIEELARDINPTSLADPAAMEQLLSQGVFEPKATPAQTQALERLETL LALIEGWVQTVVTAALGERIPGEAALSETLRRRRASGGPAEQTFATLVGLELRPRKLR EAGALWERLTRAVGMDARDAVWQHPDLLPATDDLDDPAAFIDRVIGGDTSGIDEAIAE LERDQQARGADDSGHDGGPVDN" CDS 3524734..3525633 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3218" /product="secretion" /note="Mb3218, -, len: 299 aa. Equivalent to Rv3196, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 299 aa overlap). Hypothetical protein, with some similarity to other hypothetical proteins e.g. Q9FCJ5|2SC3B6.17c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (442 aa), FASTA scores: opt: 233, E(): 3.5e-07, (29.9% identity in 261 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y416" /protein_id="SIU01846.1" /translation="MSARSVAPSQVMRRAASALYSLNPAMPVLLRPDGAVQVGWDPRR AVLVRPPRGLTATGLAALLRSMRSPIPITELQRQAAERGLVDGDAMANLVAQLVGAGV ATPLANPGNLDSRRRAASIRVHGRGPLSDLLVQALRCSGARIRHSSQPHAAVTPAGVD LVVLSDYLVADPHMVRDLHTERVPHLPVRVRDGTGMVGPLVVPGVTSCLGCADLHRSD RDAAWPAIAAQLRDTVGVADRATLLATAALALSQVNRVIAAVRGQEATPEPPSALNTT LEFDLNAGSIVARQWTRHPRCFC" CDS complement(3525642..3525842) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3219C" /product="unknown protein" /note="Mb3219c, -, len: 66 aa. Equivalent to Rv3196A, len: 66 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 66 aa overlap). Hypothetical unknown protein. Protein product from Mb3219c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3219c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3L2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01847.1" /translation="MQEGGPQETMSARSTQHDAADALFRAIIETLDKHRNERTLTEDV LDTLARAYASISTNVPEQGRLG" CDS 3525980..3527323 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3220" /product="PROBABLE CONSERVED ATP-BINDING PROTEIN ABC TRANSPORTER" /note="Mb3220, -, len: 447 aa. Equivalent to Rv3197, len: 447 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 447 aa overlap). Probable conserved ATP-binding protein ABC transporter, highly similar to Mycobacterium leprae proteins: Q9CCM8|ML0640 HYPOTHETICAL PROTEIN (473 aa), FASTA scores: opt: 2512, E(): 2.1e-140, (83.0% identity in 447 aa overlap). Interestingly, the N-terminal half (1-219 aa) corresponds to Q49747|ABC1|B1937_C3_233 ABC1 PROTEIN from Mycobacterium leprae (267 aa), FASTA scores: opt: 1276, E(): 6.3e-68, (88.6% identity in 219 aa overlap); and the C-terminal half (239-447 aa) corresponds to Q49745|B1937_C2_179 HYPOTHETICAL 23.1 KDA PROTEIN (206 aa), FASTA scores: opt: 1138, E(): 6.5e-60, (77.05% identity in 209 aa overlap); two adjacent orfs from Mycobacterium leprae. Also highly similar to other proteins (generally ABC transporters) e.g. Q9FCJ6|2SC3B6.16c HYPOTHETICAL 51.3 KDA PROTEIN from Streptomyces coelicolor (469 aa), FASTA scores: opt: 1340, E(): 1.8e-71, (45.9% identity in 449 aa overlap); O65576|ABC1AT ABC1 PROTEIN (alias Q9SBB2|T15B16.14|AT4G01660 PUTATIVE ABC TRANSPORTER) from Arabidopsis thaliana (Mouse-ear cress) (623 aa), FASTA scores: opt: 543, E(): 1.7e-24, (28.4% identity in 405 aa overlap); O27682|MTH1645 ABC TRANSPORTER from Methanobacterium thermoautotrophicum (623 aa), FASTA scores: opt: 497, E(): 7.8e-22, (33.0% identity in 309 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb3220 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3220 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3R3" /db_xref="InterPro:IPR002575" /db_xref="InterPro:IPR004147" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR034646" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3R3" /protein_id="SIU01848.1" /translation="MDDGSVSDIKRGRAARNAKLASIPVGFAGRAALGLGKRLTGKSK DEVTAELMEKAANQLFTVLGELKGGAMKVGQALSVMEAAIPDEFGEPYREALTKLQKD APPLPASKVHRVLDGQLGTKWRERFSSFNDTPVASASIGQVHKAIWSDGREVAVKIQY PGADEALRADLKTMQRMVGVLKQLSPGADVQGVVDELVERTEMELDYRLEAANQRAFA KAYHDHPRFQVPHVVASAPKVVIQEWIEGVPMAEIIRHGTTEQRDLIGTLLAELTFDA PRRLGLMHGDAHPGNFMLLPDGRMGIIDFGAVAPMPGGFPIELGMTIRLAREKNYDLL LPTMEKAGLIQRGRQVSVREIDEMLRQYVEPIQVEVFHYTRKWLQKMTVSQIDRSVAQ IRTARQMDLPAKLAIPMRVIASVGAILCQLDAHVPIKALSEELIPGFAEPDAIVV" CDS complement(3527357..3527635) /codon_start=1 /transl_table=11 /gene="whiB7" /locus_tag="BQ2027_MB3221C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN WHIB-LIKE WHIB7" /note="Mb3221c, whiB7, len: 92 aa. Equivalent to Rv3197A, len: 92 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 92 aa overlap). Probable whiB7, WhiB-like regulatory protein (see citation below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Equivalent to Q49765|WHIB7|ML0639|B1937_F2_68 PUTATIVE TRANSCRIPTIONAL REGULATOR WHIB7 from Mycobacterium leprae (89 aa), FASTA scores: opt: 441, E(): 6.3e-24, (69.3% identity in 88 aa overlap). Similar to Q9FCJ8|2SC3B6.14 PUTATIVE DNA-BINDING PROTEIN from Streptomyces coelicolor (122 aa), FASTA scores: opt: 348, E(): 2.2e-17, (57.7% identity in 78 aa overlap); Q9AD55|SCP1.95 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (102 aa), FASTA scores: opt: 166, E(): 7.1e-05, (39.4% identity in 76 aa overlap); etc. Mb3221c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3E1" /db_xref="InterPro:IPR003482" /db_xref="InterPro:IPR017956" /db_xref="InterPro:IPR034768" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3E1" /protein_id="SIU01849.1" /translation="MSVLTVPRQTPRQRLPVLPCHVGDPDLWFADTPAGLEVAKTLCV SCPIRRQCLAAALQRAEPWGVWGGEIFDQGSIVSHKRPRGRPRKDAVA" CDS complement(3528065..3530167) /codon_start=1 /transl_table=11 /gene="uvrD2" /locus_tag="BQ2027_MB3222C" /product="probable atp-dependent dna helicase ii uvrd2" /note="Mb3222c, UvrD2, len: 700 aa. Equivalent to Rv3198c, len: 700 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 700 aa overlap). Probable UvrD2, DNA helicase II homolog (EC 3.6.1.-), equivalent to P53528|UVRD_MYCLE|VRD|UVRD2|ML0637|B1937_F1_27 PROBABLE DNA HELICASE II HOMOLOG from Mycobacterium leprae (714 aa), FASTA scores: opt: 3749, E(): 0, (82.85% identity in 706 aa overlap); and C-terminal half (466-700 aa) corresponds to Q49764|RECQ|B1937_F2_66 PUTATIVE DNA HELICASE RECQ (EC 3.6.1.-) (242 aa), FASTA scores: opt: 1267, E(): 1.4e-69, (82.5% identity in 234 aa overlap); products of two adjacent ORFS in Mycobacterium leprae. Also similar to other DNA helicases e.g. Q9FCK0|2SC3B6.12 from Streptomyces coelicolor (785 aa), FASTA scores: opt: 1687, E(): 1.2e-94, (52.05% identity in 728 aa overlap); P71561|CRA|IVRD|Rv0949|MT0976|MTCY10D7.25c ATP-DEPENDENT DNA HELICASE PCRA from Mycobacterium tuberculosis (771 aa), FASTA scores: opt: 715, E(): 1e-35, (34.1% identity in 710 aa overlap); Q9CD72|PCRA_MYCLE|UVRD|ML0153 ATP-DEPENDENT DNA HELICASE PCRA from Mycobacterium leprae (778 aa), FASTA scores: opt: 687, E(): 5.1e-34, (32.0% identity in 719 aa overlap); O83991|TP1028 DNA HELICASE II (UVRD) from Treponema pallidum (670 aa), FASTA scores: opt: 652, E(): 6e-32, (30.25% identity in 671 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE UVRD SUBFAMILY OF HELICASES. Protein product from Mb3222c detected using SWATH mass spectrometry. Mb3222c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64321" /db_xref="InterPro:IPR000212" /db_xref="InterPro:IPR002121" /db_xref="InterPro:IPR010997" /db_xref="InterPro:IPR013986" /db_xref="InterPro:IPR014016" /db_xref="InterPro:IPR014017" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR034739" /db_xref="UniProtKB/Swiss-Prot:P64321" /protein_id="SIU01850.1" /translation="MSIASDPLIAGLDDQQREAVLAPRGPVCVLAGAGTGKTRTITHR IASLVASGHVAAGQVLAVTFTQRAAGEMRSRLRALDAAARTGSGVGAVQALTFHAAAY RQLRYFWSRVIADTGWQLLDSKFAVVARAASRTRLHASTDDVRDLAGEIEWAKASLIG PEEYVTAVAAARRDPPLDAAQIAAVYSEYEALKARGDGVTLLDFDDLLLHTAAAIEND AAVAEEFQDRYRCFVVDEYQDVTPLQQRVLSAWLGDRDDLTVVGDANQTIYSFTGASP RFLLDFSRRFPDAAVVRLERDYRSTPQVVSLANRVIAAARGRVAGSKLRLSGQREPGP VPSFHEHSDEPAEAATVAASIARLIASGTPPSEVAILYRVNAQSEVYEEALTQAGIAY QVRGGEGFFNRQEIKQALLALQRVSERDTDAALSDVVRAVLAPLGLTAQPPVGTRARE RWEALTALAELVDDELAQRPALQLPGLLAELRRRAEARHPPVVQGVTLASLHAAKGLE WDAVFLVGLADGTLPISHALAHGPNSEPVEEERRLLYVGITRARVHLALSWALSRSPG GRQSRKPSRFLNGIAPQTRADPVPGTSRRNRGAAARCRICNNELNTSAAVMLRRCETC AADVDEELLLQLKSWRLSTAKEQNVPAYVVFTDNTLIAIAELLPTDDAALIAIPGIGA RKLEQYGSDVLQLVRGRT" CDS 3530291..3530545 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3223" /product="POSSIBLE GLUTAREDOXIN PROTEIN" /note="Mb3223, -, len: 84 aa. Equivalent to Rv3198A, len: 84 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 84 aa overlap). Possible glutaredoxin protein (EC 1.-.-.-), highly similar to Q9FCK1|2SC3B6.11c PUTATIVE GLUTAREDOXIN-LIKE PROTEIN from Streptomyces coelicolor (80 aa), FASTA scores: opt: 293, E(): 2.2e-14, (55.15% identity in 78 aa overlap); and Q9RSN9|DR2085 PUTATIVE GLUTAREDOXIN from Deinococcus radiodurans (81 aa), FASTA scores: opt: 198, E(): 1.2e-07, (53.55% identity in 56 aa overlap). Also similar to several hypothetical bacterial proteins e.g. Q9X8C2|SCE36.09 HYPOTHETICAL 13.0 KDA PROTEIN from Streptomyces coelicolor (114 aa), FASTA scores: opt: 181, E(): 2.6e-06, (44.45% identity in 72 aa overlap). Protein product from Mb3223 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3223 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3D8" /db_xref="InterPro:IPR002109" /db_xref="InterPro:IPR011915" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3D8" /protein_id="SIU01851.1" /translation="MITAALTIYTTSWCGYCLRLKTALTANRIAYDEVDIEHNRAAAE FVGSVNGGNRTVPTVKFADGSTLTNPSADEVKAKLVKIAG" CDS complement(3530558..3531499) /codon_start=1 /transl_table=11 /gene="nudC" /locus_tag="BQ2027_MB3224C" /product="PROBABLE NADH PYROPHOSPHATASE NUDC (NAD+ DIPHOSPHATASE) (NAD+ PYROPHOSPHATASE) (NADP PYROPHOSPHATASE)" /note="Mb3224c, nudC, len: 313 aa. Equivalent to Rv3199c, len: 313 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 313 aa overlap). Probable nudC, NADH pyrophosphatase (EC 3.6.1.22), similar in particular to Q9CXN4|4933433B15RIK from Mus musculus (Mouse) (356 aa), FASTA scores: opt: 493, E(): 7.4e-24, (39.65% identity in 232 aa overlap); Q9ABG1|CC0266 MUTT/NUDIX FAMILY PROTEIN from Caulobacter crescentus (313 aa), FASTA scores: opt: 479, E(): 5.1e-23, (38.3% identity in 222 aa overlap); O86062|NUDC_PSEAE|NUDC|PA1823 NADH PYROPHOSPHATASE from Pseudomonas aeruginosa (278 aa), FASTA scores: opt: 371,2 E(): 3e-16, (43.15% identity in 153 aa overlap); Q9RV62|NUDC_DEIRA|NUDC|DR1168 NADH PYROPHOSPHATASE from Deinococcus radiodurans (280 aa), FASTA scores: opt: 363, E(): 9.6e-16, (34.45% identity in 270 aa overlap); etc. Caution: equivalent to AAK47636 from Mycobacterium tuberculosis strain CDC1551 (386 aa) but shorter 72 aa. Contains PS00893 mutT domain signature. BELONGS TO THE NUDIX HYDROLASE FAMILY, NUDC SUBFAMILY. COFACTOR: REQUIRES DIVALENT IONS: MANGANESE OR MAGNESIUM. Protein product from Mb3224c detected using SWATH mass spectrometry. Mb3224c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TX14" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR015375" /db_xref="InterPro:IPR015376" /db_xref="InterPro:IPR015797" /db_xref="InterPro:IPR020084" /db_xref="InterPro:IPR022925" /db_xref="UniProtKB/Swiss-Prot:Q7TX14" /protein_id="SIU01852.1" /translation="MTNVSGVDFQLRSVPLLSRVGADRADRLRTDMEAAAAGWPGAAL LRVDSRNRVLVANGRVLLGAAIELADKPPPEAVFLGRVEGGRHVWAVRAALQPIADPD IPAEAVDLRGLGRIMDDTSSQLVSSASALLNWHDNARFSALDGAPTKPARAGWSRVNP ITGHEEFPRIDPAVICLVHDGADRAVLARQAAWPERMFSLLAGFVEAGESFEVCVARE IREEIGLTVRDVRYLGSQPWPFPRSLMVGFHALGDPDEEFSFSDGEIAEAAWFTRDEV RAALAAGDWSSASESKLLLPGSISIARVIIESWAACE" CDS complement(3531558..3532625) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3225C" /product="POSSIBLE TRANSMEMBRANE CATION TRANSPORTER" /note="Mb3225c, -, len: 355 aa. Equivalent to Rv3200c, len: 355 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 355 aa overlap). Possible transmembrane cation transporter, similar to many transmembrane proteins and putative potassium channels e.g. Q9XA52|SCGD3.27C PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (365 aa), FASTA scores: opt: 1022, E(): 2.6e-53, (49.85% identity in 325 aa overlap); Q9RRZ3|DR2336 PUTATIVE POTASSIUM CHANNEL from Deinococcus radiodurans (320 aa), FASTA scores: opt: 436, E(): 1e-18, (30.9% identity in 304 aa overlap); O28600|AF1673 PUTATIVE POTASSIUM CHANNEL from Archaeoglobus fulgidus (314 aa), FASTA scores: opt: 363, E(): 2.1e-14, (27.2% identity in 309 aa overlap); Q57604|Y13B_METJAMJ0138.1|MJ0138.1 PUTATIVE POTASSIUM CHANNEL from Methanococcus jannaschii (333 aa), FASTA scores: opt: 356, E(): 5.7e-14, (26.0% identity in 281 aa overlap); P73132|SLL0993 POTASSIUM CHANNEL from Synechocystis sp. strain PCC 6803 (365 aa), FASTA scores: opt: 330, E(): 2.1e-12, (27.8% identity in 324 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb3225c detected using shotgun mass spectrometry. Mb3225c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3F6" /db_xref="InterPro:IPR003148" /db_xref="InterPro:IPR013099" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3F6" /protein_id="SIU01853.1" /translation="MAGSWRRLRGLDEKLTAQPGYALVGVLRIPQRRASPARVISRRV VVAVVALLLTAGIVYVDRDGYLDAQGDRLTFLDCLYYAAVTLSTTGYGDITPISEFAR AINIFVITPLRIAFLILLVGTTLEVLTETSRQAYKIQRWRSRVRNHTVVIGYGTKGKT AVAAMVSDELVPGEIVVVDTDSGVLERAAAAGLVTVHGDATKSDVLRLAGTQHASSII VATSRDDTAVLVTLTAREIAPKAKIVASIREAENQHLLRQSGADTVVVSSETAGRLLG IATTTPSVVEMIEDLLTPEAGLAVAEREVEQAEVGGSPRHLRDIVLGVVRDGQLLRIG APEVDAIEASDRLLYIRQVGR" CDS complement(3532687..3535992) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3226C" /product="PROBABLE ATP-DEPENDENT DNA HELICASE" /note="Mb3226c, -, len: 1101 aa. Equivalent to Rv3201c, len: 1101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1101 aa overlap). Probable ATP-dependent DNA helicase (EC 3.6.1.-), similar to others e.g. Q9FCK4|2SC3B6.08 from Streptomyces coelicolor (1222 aa), FASTA scores: opt: 1209, E(): 5.4e-63, (38.45% identity in 1199 aa overlap); P71561|PCRA_MYCTU|CRA|IVRD|Rv0949|MT0976|MTCY10D7.25c from Mycobacterium tuberculosis (771 aa), FASTA scores: opt: 403, E(): 6.5e-16, (28.15% identity in 717 aa overlap); Q9FCK5|2SC3B6.07 from Streptomyces coelicolor (1159 aa), FASTA scores: opt: 349, E(): 1.3e-12, (29.2% identity in 1144 aa overlap); Q9L3M1|UVRD from Prochlorococcus sp. (512 aa; fragment), FASTA scores: opt: 290, E(): 2e-09, (27.95% identity in 479 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /db_xref="GOA:A0A1R3Y3E6" /db_xref="InterPro:IPR000212" /db_xref="InterPro:IPR011335" /db_xref="InterPro:IPR014016" /db_xref="InterPro:IPR014017" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR034739" /db_xref="InterPro:IPR038726" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3E6" /protein_id="SIU01854.1" /translation="MTQTAAPARYSPAELACALGLFPPTAEQAAVIAAPPGPLVVIAG AGAGKTETMAARVVWLVANGYAEPGQVLGLTFTRKAAGQLLRRVRSRLARLAGIGLGC GDPAACAPVVSTYHAFAGSLLRDYGLLLPLEPDTRLLSETELWQLAFDVVSGYDGVLC TDKSPAAVTSIVVRLWGQLGEHLVDTRALRDTHVELERLVHALPAGRYQRDRGPSQWL LRMLATQTQRAELVPLLDALGERMHAGKVMDFAMQMASAARLAATSPQVGQDLRRRYR VVLLDEYQDTGHAQRVVLSSLFGGGVDDGLALTAVGDPIQSIYGWRGASATNLPRFTT DFPLSDGTPAPVLELLTSWRNPPQALRVANGISAEARRRSVAVRALRPRPDAPPGAVR CALLPDVQAEREWIADHLRMRYQRAEADGVKPPTAAVLVRRNADAAAIADTLRARGIP AEVVGLAGLLSIPEVAEVVAMLRLVADPTAGAAAMRVLTGPRWRLGARDLAALWRRAL TLSGESPSTASPESIAMAASADADNPCLADAISDPGSAEGYSVAGYGRIGALAGELSA LRGRLGHSLPDLVAEVRRVLGVDCEVRASAPVSGGWAGPEHLDAFADVVAGYAERASA RSSEASVAGLLAYLDVAEVVENGLPPAELTVACDRVQVLTVHAAKGLEWQVVAVAHLS RGVFPSTVSRSSWLTDPAELPPLLRGDRASAGAHGIPVLDTSAVADRKQLSDKISEHR RLLDRRRVDEERRLLYVAVTRAEDTLLVSGHHWGPTGTKPRGPSEFLCELKDIIDRSA AAGDPCGVVEQWASAPAGDERNPLCDNAIEAVWPADPLAARRGDVERGAALVAAAMSA DLPGSTTDIDHPPRPGDAPWSTDVDALLAERAHAARGAPARGLPNHLSVSSLVELVGD PVGARQRLMCRLPKRPDPHAWLGDAFHAWVQQFYGAELLFDLGDLPGAADREVGDPEE LAALQRAFTASSWAARTPAAVEVPFEMPIGDTVVRGRIDAVFVDPDGGATVVDWKTGK PPHGPAAMRQAAVQLAVYRLAWAALRGCPTSSVRTAFYYVRSGITVVPDELPAPGELA MLLTDCAGRRSDT" CDS complement(3535989..3539156) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3227C" /product="POSSIBLE ATP-DEPENDENT DNA HELICASE" /note="Mb3227c, -, len: 1055 aa. Equivalent to Rv3202c, len: 1055 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1055 aa overlap). Possible ATP-dependent DNA helicase (EC 3.6.1.-), showing some similarity to UvrD proteins e.g. Q9FCK5|2SC3B6.07 PUTATIVE ATP-DEPENDENT DNA HELICASE from Streptomyces coelicolor (1159 aa), FASTA scores: opt: 666, E(): 1e-29, (34.5% identity in 1154 aa overlap); Q9L7T3|UVRD|PA5443 MISMATCH REPAIR PROTEIN MUTU (DNA HELICASE II) from Pseudomonas aeruginosa (728 aa), FASTA scores: opt: 239, E(): 7.3e-06, (23.8% identity in 677 aa overlap) (no similarity in C-terminal part for this one); etc. C-terminal region similar to Q9FDU2|ORF3 ORF3 PROTEIN (FRAGMENT) from Streptomyces griseus (551 aa), FASTA scores: opt: 800, E(): 1.7e-37, (36.2% identity in 525 aa overlap); and Q9ZG15 HYPOTHETICAL 35.5 KDA PROTEIN from Rhodococcus erythropolis (323 aa), FASTA scores: opt: 232, E(): 9.7e-06, (28.55% identity in 266 aa overlap). Mb3227c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y579" /db_xref="InterPro:IPR000212" /db_xref="InterPro:IPR013986" /db_xref="InterPro:IPR014016" /db_xref="InterPro:IPR014017" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR034739" /db_xref="InterPro:IPR038726" /db_xref="UniProtKB/TrEMBL:A0A1R3Y579" /protein_id="SIU01855.1" /translation="MSHIWGVEAGAALAPGLRGPVLVLGGPGTGKSTLLVEAAVAHIG AGTDPESVLLLTGSGRMGMRARSALTTALLRSRTNGPCRAAIREPVVRTVHSYAYAVL RKAAQRAGDALPRLLTSAEQDAIIRELLAGDAEDGPAATTTWPAHLRPALTTAGFATE LRNLLARCAERGLDPLELQQLGRRRGRPEWIAAGQFAQRYEQVMLLRGAVGLAAPQAT APALSAAELVGAALEAFAVDPELLAAERARVRTLLVDDAQQLDPQAARLVRMLAAGTE LALIAGDPNQAVFGFRGGEPTGLLADDPPPAGGAPIPSVTLTVSHRCAPAVARAVTGI ARRLPGRSVGRRIEGTGTEVGSVTVRLAGSAHAEAAMIADALRRAHLIDGVPWSQMAV IVRSVPRAVRLPRALAAAGVPVAPPAVGGPLSAEPAVRALLTVLEATADGLDGDQALL LLTGPIGGVDPVSLRQLRRTLQRARPGQTSRKFGDLLVEVLGGDAPPSGPGSRALRRV RAVLTAAARCHRSGSLGGQDPRHTLWAAWQRSGLQRRWLAASEHGGAAAVQATRDLET VTALFDITDHYVSRTSGASLRGLVEHVTALQLPVVRPEPAAPTEQVMVLSAHAALGHE WDLVVIAGLQDGLWPNTVPRGGVLGTQRLLDELDGVTKDASMRAPLLAEERRLLVTAM GRARRRLLVTAVDSDAGGGGHEAVLPSAFFFEIAQWADGDGEPVAMQPVSAPRVLSAA AVVGRLRAVVCAPACAVDDADRDCAATQLARLAKAGVPGADPSEWHGLAPVSTSDPLC DSDDLVTLTPSTLQALNDCPLRWLAERHGGTNTRELPSAVGSVLHALFAEPGRSESQL LAELDRVWGHLPFGAQWYSANELARHRAMIQAFVQWRAQSRSELTEVGVEVDIDGALE DGSGQARKIRLRGRADRLERDPAGRLVIVDIKTGKTPVSKDDAQQHAQLAMYQLAVAE GLVRAGDEPGGARLVYVGKSGAAGVAERKQDPLTPAARDEWRNLVRQLAAATAGPQFI ARRNDGCTHCPLRPGCPAHVRGSAP" CDS 3539233..3539451 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3227A" /note="unnamed protein product; Mb3227A, len: 72 aa. Identified by de novo proteomics of Mycobacterium bovis AF2122/97 under exponential conditions. Mb3227A transcript and transcriptional start site identified in Mycobacterium bovis strain AF2122/97 grown under exponential conditions,Mb3227A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing" /db_xref="UniProtKB/TrEMBL:A0A1R3Y427" /protein_id="SIU01856.1" /translation="MATMAAVVGGGPQDEIPEADAVEQGRAVDFDDEAGLDTAYLSGG AGDRDASEADVVDQAFVVPVADDEEIDR" CDS 3539482..3540267 /codon_start=1 /transl_table=11 /gene="lipV" /locus_tag="BQ2027_MB3228" /product="POSSIBLE LIPASE LIPV" /note="Mb3228, lipV, len: 224 aa. Equivalent to Rv3203, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Possible lipV, hydrolase lipase (EC 3.1.-.-), showing some similarity to other lipases e.g. Q9JSN0|NMA2216 PUTATIVE HYDROLASE from Neisseria meningitidis (serogroup A) (312 aa), FASTA scores: opt: 192, E(): 0.00016, (45.2% identity in 73 aa overlap); Q9RK95|SCF1.09 PUTATIVE HYDROLASE from Streptomyces coelicolor (258 aa), FASTA scores: opt: 188, E(): 0.00024, (30.1% identity in 226 aa overlap); Q9KZC3|SC6F7.19c PUTATIVE LIPASE from Streptomyces coelicolor (269 aa), FASTA scores: opt: 179, E(): 0.00086, (36.35% identity in 121 aa overlap); etc. Equivalent to AAK47641 Hydrolase, alpha/beta hydrolase family from Mycobacterium tuberculosis strain CDC1551 (261 aa) but shorter 37 aa. Contains serine active site signature of lipases (PS00120). Protein product from Mb3228 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3228 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3L7" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3L7" /protein_id="SIU01857.1" /translation="MIIDLHVQRYGPSGPARVLTIHGVTEHGRIWHRLAHHLPEIPIA APDLLGHGRSPWAAPWTIDANVSALAALLDNQGDGPVVVVGHSFGGAVAMHLAAARPD QVAALVLLDPAVALDGSRVREVVDAMLASPDYLDPAEARAEKATGAWADVDPPVLDAE LDEHLVALPNGRYGWRISLPAMVCYWSELARDIVLPPVGTATTLVRAVRASPAYVSDQ LLAALDKRLGADFELLDFDCGHMVPQAKPTEVAAVIRSRLGPR" CDS 3540270..3540575 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3229" /product="POSSIBLE DNA-METHYLTRANSFERASE (MODIFICATION METHYLASE)" /note="Mb3229, -, len: 101 aa. Equivalent to Rv3204, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 101 aa overlap). Possible DNA methyltransferase (EC 2.1.1.-), similar to many hypothetical bacteriel proteins and methyltransferases e.g. Q9KT40|VC1065 METHYLATED-DNA--PROTEIN-CYSTEINE METHYLTRANSFERASE-RELATED PROTEIN from Vibrio cholerae (100 aa), FASTA scores: opt: 170, E(): 2.8e-05, (34.35% identity in 99 aa overlap); Q9UTN9|SPAC1250.04c PUTATIVE METHYLTRANSFERASE from Schizosaccharomyces pombe (Fission yeast) (108 aa), FASTA scores: opt: 161, E(): 0.00013, (36.65% identity in 101 aa overlap); Q9YDF4|APE0959 175 AA LONG HYPOTHETICAL METHYLATED-DNA--PROTEIN-CYSTEINE METHYLTRANSFERASE from Aeropyrum pernix (175 aa), FASTA scores: opt: 144, E(): 0.003, (37.95% identity in 87 aa overlap); Q50855 PUTATIVE METHYLGUANINE-DNA METHYLTRANSFERASE from Myxococcus xanthus (147 aa), FASTA scores: opt: 141, E(): 0.0041, (37.65% identity in 93 aa overlap); etc. Protein product from Mb3229 detected using SWATH mass spectrometry. Mb3229 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3S1" /db_xref="InterPro:IPR014048" /db_xref="InterPro:IPR036217" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3S1" /protein_id="SIU01858.1" /translation="MAPVTDEQVELVRSLVAAIPLGRVSTYGDIAALAGLSSPRIVGW IMRTDSSDLPWHRVIRASGRPAQHLATRQLELLRAEGVLSVDGRVALSEIRYEFPPG" CDS complement(3540582..3541460) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3230C" /product="conserved protein" /note="Mb3230c, -, len: 292 aa. Equivalent to Rv3205c, len: 292 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 292 aa overlap). Hypothetical protein, highly similar to Q9CCG7|ML0818 HYPOTHETICAL PROTEIN from Mycobacterium leprae (297 aa), FASTA scores: opt: 1745, E(): 9.1e-98, (87.3% identity in 291 aa overlap). Protein product from Mb3230c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3230c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR013402" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3E8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01859.1" /translation="MGSTRLTGVNVEPPPEHVLVAFGLAGAQPILLGAGWEGGWRCGE VVLSMVADNARAAWSARVRETLFVDGVRLARPVRSTDGRYVVSGWRADTFVAGAPEPR HDEVVSAAVRLHEATGKLERPRFLTQGPAAPWAEIDVFVAADRAGWEERPLQSVPPGV PTAPPAADPQRSIDLINQLAGLRKPTKSPNQLVHGDLYGTVLFAGTAPPGITDITPYW RPASWAAGVAVVDALSWGAADDGLIERWNALPEWPQMLLRALMFRLAVYALHPRSTAE AFPGLAHTAALVRLVL" CDS complement(3541487..3542662) /codon_start=1 /transl_table=11 /gene="moeB1" /locus_tag="BQ2027_MB3231C" /standard_name="moeZ" /product="PROBABLE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN MOEB1 (MPT-SYNTHASE SULFURYLASE) (MOLYBDOPTERIN SYNTHASE SULPHURYLASE)" /note="Mb3231c, moeB1, len: 391 aa. Equivalent to Rv3206c, len: 392 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 392 aa overlap). Probable moeB1, molybdopterin cofactor biosynthesis protein, equivalent to Q9CCG8|MOEZ|ML0817 PROTEIN PROBABLY INVOLVED IN MOLYBDOPTERIN BIOSYNTHESIS from Mycobacterium leprae (395 aa), FASTA scores: opt: 2285, E(): 3.3e-130, (86.45% identity in 391 aa overlap.) Very similar to members of the HESA/MOEB/THIF family e.g. Q9FCL0|2SC3B6.02 PUTATIVE SULFURYLASE from Streptomyces coelicolor (392 aa), FASTA scores: opt: 1776, E(): 1.4e-99, (65.3% identity in 395 aa overlap); Q9XC37|PDTORFF MOEB-LIKE PROTEIN (PUTATIVE SULFURYLASE) from Pseudomonas stutzeri (Pseudomonas perfectomarina) (391 aa), FASTA scores: opt: 1526, E(): 1.5e-84, (59.1% identity in 391 aa overlap); O54307|MPT|MOEB MPT-SYNTHASE SULFURYLASE from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (391 aa), FASTA scores: opt: 1309, E(): 1.8e-71, (52.95% identity in 387 aa overlap); P74344|MOEB|SLL1536 MOLYBDOPTERIN BIOSYNTHESIS MOEB PROTEIN from Synechocystis sp. strain PCC 6803 (392 aa), FASTA scores: opt: 1308, E(): 2e-71, (50.65% identity in 397 aa overlap); etc. Also highly similar to O05792|MOEB2|Rv3116|MTCY164.26 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN from Mycobacterium tuberculosis (389 aa), FASTA scores: opt: 1440, E(): 2.3e-79, (57.25% identity in 386 aa overlap). Has hydrophobic segment from ~45-71. BELONGS TO THE HesA /MoeB/ThiF FAMILY. Note that previously known as moeZ. Protein product from Mb3231c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3231c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3H4" /db_xref="InterPro:IPR000594" /db_xref="InterPro:IPR001763" /db_xref="InterPro:IPR035985" /db_xref="InterPro:IPR036873" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3H4" /protein_id="SIU01860.1" /translation="MSTSLPPLVEPASALSREEVARYSRHLIIPDLGVDGQKRLKNAR VLVIGAGGLGAPTLLYLAAAGVGTIGIVDFDVVDESNLQRQVIHGVADVGRSKAQSAR DSIVAINPLIRVRLHELRLAPSNAVDLFKQYDLILDGTDNFATRYLVNDAAVLAGKPY VWGSIYRFEGQASVFWEDAPDGLGVNYRDLYPEPPPGMVPSCAEGGVLGIICASVASV MGTEAIKLITGIGETLLGRLLVYDALEMSYRTITIRKDPSTPKITELVDYEQFCGVVA DDAAQAAKGSTITPRELRDWLDSGRKLALIDVRDPVEWDIVHIDGAQLIPKSLINSGE GLAKLPQDRTAVLYCKTGVRSAEALAAVKKAGFSDAVHLQGGIVAWAKQMQPDMVMY" CDS complement(3542753..3543610) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3232C" /product="Metallopeptidase" /note="Mb3232c, -, len: 285 aa. Equivalent to Rv3207c, len: 285 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 285 aa overlap). Hypothetical protein, highly similar but shorter (57 aa) to Q9CCG9|ML0816 HYPOTHETICAL PROTEIN from Mycobacterium leprae (341 aa), FASTA scores: opt: 1676, E(): 9.7e-96, (81.0% identity in 284 aa overlap). Also similar to C-terminus of Q9FBI6|SCP8.36 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (559 aa), FASTA scores: opt: 426, E(): 8.4e-19, (37.35% identity in 281 aa overlap); and similar to other hypothetical proteins (generally membrane proteins) e.g. Q9K456|SC2H12.28C PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (314 aa), FASTA scores: opt: 341, E(): 8.8e-14, (29.75% identity in 296 aa overlap). Contains neutral zinc metallopeptidases, zinc-binding region signature (PS00142). Protein product from Mb3232c detected using SWATH mass spectrometry. Mb3232c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3F7" /db_xref="InterPro:IPR006026" /db_xref="InterPro:IPR022603" /db_xref="InterPro:IPR024079" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3F7" /protein_id="SIU01861.1" /translation="MSTYGWRAYALPVLMVLTTVVVYQTVTGTSTPRPAAAQTVRDSP AIGVVGTAILDAPPRGLAVFDANLPAGTLPDGGPFTEAGDKTWRVVPGTTPQVGQGTV KVFRYTVEIENGLDPTMYGGDNAFAQMVDQTLTNPKGWTHNPQFAFVRIDSGKPNFRI SLVSPTTVRGGCGYEFRLETSCYNPSFGGMDRQSRVFINEARWVRGAVPFEGDVGSYR QYVINHEVGHAIGYLRHEPCDQQGGLAPVMMQQTFSTSNDDAAKFDPDFVKADGKTCR FNPWPYPIP" CDS 3543956..3544642 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3233" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb3233, -, len: 228 aa. Equivalent to Rv3208, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 228 aa overlap). Probable transcriptional regulator tetR family, equivalent to Q9CCH0|ML0815 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (228 aa), FASTA scores: opt: 1248, E(): 1.4e-74, (82.4% identity in 227 aa overlap). Also highly similar to Q9FBI8|SCP8.33c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (213 aa), FASTA scores: opt: 629, E(): 4e-34, (45.8% identity in 203 aa overlap); Q9KIL9|F58R F58R (FRAGMENT) from Streptomyces coelicolor A3(2) (149 aa), FASTA scores: opt: 497, E(): 1.3e-25, (50.35% identity in 147 aa overlap); Q9K3T5|SCE66.08 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (225 aa), FASTA scores: opt: 344, E(): 1.8e-15, (31.15% identity in 212 aa overlap); Q9RYK4|DRA0308 TRANSCRIPTIONAL REGULATOR, TETR FAMILY from Deinococcus radiodurans (239 aa), FASTA scores: opt: 290, E(): 6.5e-12, (30.5% identity in 223 aa overlap); etc. And also similar to Mycobacterium tuberculosis proteins P96381|Rv1019|MTCY10G2.30c HYPOTHETICAL 21.7 KDA PROTEIN (197 aa), FASTA scores: opt: 356, E(): 2.7e-16, (34.4% identity in 189 aa overlap); MTV034_4; MTY07A7A_3; MTV032_1; MTCY07A7_12; etc. Contains probable helix-turn-helix motif at aa 60-81 (Score 1517, +4.35 SD). SIMILAR TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3233 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3233 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3F4" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3F4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01862.1" /translation="MSDLAKTAQRRALRSSGSARPDEDVPAPNRRGNRLPRDERRGQL LVVASDVFVDRGYHAAGMDEIADRAGVSKPVLYQHFSSKLELYLAVLHRHVENLVSGV HQALSTTTDNRQRLHVAVQAFFDFIEHDSQGYRLIFENDFVTEPEVAAQVRVATESCI DAVFALISADSGLDPHRARMIAVGLVGMSVDCARYWLDADKPISKSDAVEGTVQFAWG GLSHVPLTRS" CDS complement(3544629..3544901) /codon_start=1 /transl_table=11 /gene="TB9.4" /locus_tag="BQ2027_MB3234C" /product="Putative ATP-binding protein" /note="Mb3234c, TB9.4, len: 90 aa. Equivalent to Rv3208A, len: 90 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). TB9.4, conserved hypothetical protein (see citations below), equivalent to Q9CCH1|ML0814 HYPOTHETICAL PROTEIN from Mycobacterium leprae (82 aa), FASTA scores: opt: 411, E(): 1.8e-22, (81.0% identity in 79 aa overlap). Also similar, but shorter in N-terminus, to Q9FBI9|SCP8.32c PUTATIVE ATP-BINDING PROTEIN from Streptomyces coelicolor (94 aa), FASTA scores: opt: 246, E(): 8.1e-11, (53.4% identity in 73 aa overlap); Q9DGP6 (alias Q9DGP4) GLUTAMATE DECARBOXYLASE 67 KDA ISOFORM (FRAGMENT) from Alepocephalus bairdii (182 aa), FASTA scores: opt: 100, E(): 2.6, (35.3% identity in 85 aa overlap). Corresponds to Statens Serum Institute antigen, CYP10 TB9.4. Has N-terminal sequence, VEVKIGITDSPRELV. Protein product from Mb3234c detected using shotgun mass spectrometry. Mb3234c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021456" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3G0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01863.1" /translation="MEVKIGITDSPRELVFSSAQTPSEVEELVSNALRDDSGLLTLTD ERGRRFLIHTARIAYVEIGVADARRVGFGVGVDAAAGSAGKVATSG" CDS 3545226..3545786 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3235" /product="Proline rich protein similar to MmpS3" /note="Mb3235, -, len: 186 aa. Equivalent to Rv3209, len: 186 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 186 aa overlap). Conserved hypothetical thr-, pro-rich protein, equivalent (but shorter 36 aa in N-terminus) to Q9CCH2|ML0813 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (195 aa), FASTA scores: opt: 508, E(): 1.4e-15, (58.4% identity in 185 aa overlap). Also some similarity with Q10390|MMS3_MYCTU|MMPS3|Rv2198c|MT2254|MTCY190.09c PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN from M. tuberculosis (299 aa), FASTA scores: opt: 339, E(): 3.7e-08, (35.0% identity in 180 aa overlap); and Q9CCE9|MMPS3|ML0877 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (293 aa), FASTA scores: opt: 272, E(): 2.8e-05, (36.4% identity in 173 aa overlap). Mb3235 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR008693" /db_xref="InterPro:IPR038468" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3G4" /protein_id="SIU01864.1" /translation="MALGAVATAVIINSGDSTSTKAIVGAPAPRTVISTSPRPTAPTS TSPHPSPSTLRPQLPPETVTTVAPPGTGPTTVPTRTPTAAPPQTAVPPPAPLNPRTVV YRVTGTKQLFDLVNVVYTDARGFPVTDFNVSLPWTKMVVLNPGVQTESVVATSLYSRL NCSIVNTGAQTVVASTNNAIIATCTR" CDS complement(3545796..3546491) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3236C" /product="conserved protein" /note="Mb3236c, -, len: 231 aa. Equivalent to Rv3210c, len: 231 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 231 aa overlap). Conserved hypothetical protein, similar (but N-terminus shorter) to Q9FBJ1|SCP8.30 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (260 aa), FASTA scores: opt: 599, E(): 1.1e-30, (42.5% identity in 233 aa overlap); and some similarity to Q9RRV1|DR2384 PHENYLACETIC ACID DEGRADATION PROTEIN PAAC from Deinococcus radiodurans (263 aa), FASTA scores: opt: 129, E(): 0.43, (27.9% identity in 172 aa overlap); and Q9F621 FLGK PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (472 aa). Protein product from Mb3236c detected using shotgun mass spectrometry. Mb3236c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR009078" /db_xref="InterPro:IPR012347" /db_xref="UniProtKB/TrEMBL:A0A1R3Y586" /protein_id="SIU01865.1" /translation="MPSPSSADQVADSPRPRLPADHPGVNELFALLAYGEVAAFYRLT DEARMAPDLRGRISMASMAAAEMGHYELLRNALERRGVDVVSAMSKYTSALENYHRLT TPSTWLEALVKTYVADALAADLYLEIADGLPDEVADVVRAALSETGHSQFVVAEVRAA VTASGKQRSRLALWSRRLLGEAITQAQLVLADHDELVDLVVSGSGGLSQLGAFFDRLQ QTHDQRMRELGLS" CDS 3546750..3548333 /codon_start=1 /transl_table=11 /gene="rhlE" /locus_tag="BQ2027_MB3237" /product="PROBABLE ATP-DEPENDENT RNA HELICASE RHLE" /note="Mb3237, rhlE, len: 527 aa. Equivalent to Rv3211, len: 527 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 527 aa overlap). Probable rhlE, ATP-dependent RNA helicase, equivalent (but shorter 22 aa) to Q9CCH3|RHLE|ML0811 PUTATIVE ATP-DEPENDENT RNA HELICASE from Mycobacterium leprae (544 aa), FASTA scores: opt: 2497, E(): 8.7e-131, (74.75% identity in 531 aa overlap). Also highly similar to other RNA helicases e.g. Q9FBJ2|SCP8.29c from Streptomyces coelicolor (879 aa), FASTA scores: opt: 1458, E(): 3.6e-73, (52.5% identity in 522 aa overlap); Q9DF36 from Xenopus laevis (African clawed frog) (800 aa), FASTA scores: opt: 792, E(): 2.3e-36, (37.15% identity in 385 aa overlap); Q99Z38|DEAD|SPY1415 from Streptococcus pyogenes (759 aa), FASTA scores: opt: 779, E(): 1.1e-35, (37.1% identity in 380 aa overlap); P33906|DEAD|CSDA from Klebsiella pneumoniae (642 aa), FASTA scores: opt: 768, E(): 4e-35, (43.4% identity in 387 aa overlap); etc. Contains ATP/GTP-binding site motif A (PS00017) and DEAD-box subfamily ATP-dependent helicases signature (PS00039). SIMILAR TO DEAD/DEAH BOX HELICASE FAMILY AND SIMILAR TO HELICASE C-TERMINAL DOMAIN. Protein product from Mb3237 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3237 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y436" /db_xref="InterPro:IPR000629" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR011545" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR014014" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y436" /protein_id="SIU01866.1" /translation="MTAVKHTTESTFAKLGARDEIVRALGEEGIKRPFAIQELTLPLA LDGEDVIGQARTGMGKTFAFGVPLLQRITSGDGTRPLTGAPRALVVVPTRELCLQVTD DLATAGKYLTAGPDTDDAAAVRRRLSVVSIYGGRPYEPQIEALRAGADVVVGTPGRLL DLCQQGHLQLGGLSVLVLDEADEMLDLGFLPDIERILRQIPADRQSMLFSATMPDPII TLARTFMVRPTHIRAEAPHSSAVHDATEQFVYRAHALDKVELVSRVLQARDRGATMIF TRTKRTAQKVADELTERGFAVGAVHGDLGQLAREKALKAFRTGGIDVLVATDVAARGI DIDDVTHVINYQCPEDEKMYVHRIGRTGRAGRTGVAVTLVDWDELPRWSMIDQALGLG SPDPAETYSNSPHLYAELAIPATAGGTVGPARKSQGRRRDTDCDGQKTAQHARNTPRR RRTRGGKPVTGHPGTNPISSPIVGGDATSEPGSGTASDSGSDVVSGSRSGNGEAARRR RRRRRRPTHAQDGFAARAN" CDS 3548346..3549569 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3238" /product="conserved alanine valine rich protein" /note="Mb3238, -, len: 407 aa. Equivalent to Rv3212, len: 407 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 407 aa overlap). Hypothetical ala-, val-rich protein, equivalent to Q9CCH4|ML0810 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (407 aa), FASTA scores: opt: 2158, E(): 5.3e-119, (79.85% identity in 407 aa overlap). Weak similarity to several eukaryotic transcription factors e.g. P08393|ICP0_HSV11|ICP0|IE110 TRANS-ACTING TRANSCRIPTIONAL PROTEIN from Herpes simplex virus (type 1 / strain 17) (775 aa), FASTA scores: opt: 115, E(): 2, (26.9% identity in 334 aa overlap). Protein product from Mb3238 detected using SWATH mass spectrometry. Mb3238 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3M0" /db_xref="InterPro:IPR011047" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3M0" /protein_id="SIU01867.1" /translation="MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAA VAVPTPAPAREVPTSLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWS YARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDG TTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLE ACTNQADLRLVLLRPGKEDDEPIQRIVPEPGARPGSGARVLVVSQNNTAVYLPARSGA QPRVDVIDETGATVSSTLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTI AAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSG SRVIEQRGDTLVALG" CDS complement(3549645..3550445) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3239C" /product="POSSIBLE SOJ/PARA-RELATED PROTEIN" /note="Mb3239c, -, len: 266 aa. Equivalent to Rv3213c, len: 266 aa, from Mycobacterium tuberculosis strain H37Rv, (99.624% identity in 266 aa overlap). Possible soj/parA-related protein, very similar in particular to Soj/ParA proteins (and relatives) from Bacillus subtilis that inhibit the initiation of sporulation by preventing phosphorylation of Spo0A (see first citation below for more information) e.g. Q9S228|SCI51.12c from Streptomyces coelicolor (340 aa), FASTA scores: opt: 746, E(): 1.6e-40, (48.2% identity in 249 aa overlap); Q9HT11|SOJ|PA5563 from Pseudomonas aeruginosa (262 aa), FASTA scores: opt: 649, E(): 2.1e-34, (42.2% identity in 256 aa overlap); Q9PB62|XF2282 from Xylella fastidiosa (264 aa), FASTA scores: opt: 624, E(): 8.3e-33, (42.25% identity in 251 aa overlap); Q9K5N0|SOJ_BACHD|SOJ|BH4058 from Bacillus halodurans (253 aa), FASTA scores: opt: 621, E(): 1.2e-32, (41.55% identity in 248 aa overlap); P37522|SOJ_BACSU (253 aa), FASTA scores: opt: 620, E(): 1.4e-32, (41.65% identity in 245; etc. Also similar to various mycobacterial proteins: U00021_10 from Mycobacterium leprae, MTCI125_29 from Mycobacterium tuberculosis, MLCB1351_6 from Mycobacterium leprae, MTV028_9c|Rv3918c|PARA PROBABLE CHROMOSOME PARTITIONING PROTEIN from Mycobacterium tuberculosis, MSGDNAB_18 from Mycobacterium leprae. SEEMS TO BELONG TO THE PARA FAMILY. Protein product from Mb3239c detected using shotgun mass spectrometry. Mb3239c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025669" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3T6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01868.1" /translation="MTDTRVLAVANQKGGVAKTTTVASLGAAMVEKGRRVLLVDLDPQ GCLTFSLGQDPDKLPVSVHEVLLGEVEPNAVLVTTMEGMTLLPANIDLAGAEAMLLMR AGREYALKRALAKFSDRFDVVIIDCPPSLGVLTLNGLTAADEAIVPLQCEMLAHRGVG QFLRTVADVQQITNPNLRLLGALPTLYDSRTTHTRDVLLDVADRYDLQVLAPPIPRTV RFAEASASGSSVMAGRKNKGAVAYRELAQALLKHWKTGRPLPTFTVDL" CDS 3550599..3551210 /codon_start=1 /transl_table=11 /gene="gpm2" /locus_tag="BQ2027_MB3240" /product="POSSIBLE PHOSPHOGLYCERATE MUTASE GPM2 (PHOSPHOGLYCEROMUTASE) (PGAM) (BPG-DEPENDENT PGAM)" /note="Mb3240, gpm2, len: 203 aa. Equivalent to Rv3214, len: 203 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 203 aa overlap). Possible gpm2, phosphoglycerate mutase (EC 5.4.2.1), similar to many mutases especially phosphoglycerate mutases e.g. Q9F3H5|2SCC13.14c PUTATIVE MUTASE from Streptomyces coelicolor (198 aa), FASTA scores: opt: 487, E(): 4.4e-25, (42.25% identity in 194 aa overlap); BAB49378|MLL2186 PROBABLE PHOSPHOGLYCERATE MUTASE from Rhizobium loti (Mesorhizobium loti) (193 aa), FASTA scores: opt: 423, E(): 7e-21, (41.2% identity in 182 aa overlap); Q9RKV8|SC9G1.08c PUTATIVE PHOSPHATASE from Streptomyces coelicolor (199 aa), FASTA scores: opt: 419, E(): 1.3e-20, (41.1% identity in 185 aa overlap); Q9RDL0|SCC123.14c PUTATIVE PHOSPHOGLYCERATE MUTASE from Streptomyces coelicolor (223 aa), FASTA scores: opt: 240, E(): 8.8e-09, (36.9% identity in 168 aa overlap); Q9X194|TM1374 PHOSPHOGLYCERATE MUTASE from Thermotoga maritima (201 aa), FASTA scores: opt: 218, E(): 2.3e-07, (33.15% identity in 202 aa overlap); etc. But N-terminus also similar to Q9CCH5|ENTC|ML0808 PUTATIVE ISOCHORISMATE SYNTHASE from Mycobacterium leprae (577 aa), FASTA scores: opt: 346, E(): 2.1e-15, (55.05% identity in 109 aa overlap). N-terminus shows also some similarity with other M. tuberculosis proteins e.g. MTCY427.09c; MTCY20G9.15; MTCY428.28. Equivalent to AAK47652 from Mycobacterium tuberculosis strain CDC1551 (228 aa) but shorter 25 aa. Note that previously known as entD. Protein product from Mb3240 detected using shotgun mass spectrometry. Mb3240 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR013078" /db_xref="InterPro:IPR029033" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3F8" /protein_id="SIU01869.1" /translation="MGVRNHRLLLLRHGETAWSTLGRHTGGTEVELTDTGRTQAELAG QLLGELELDDPIVICSPRRRTLDTAKLAGLTVNEVTGLLAEWDYGSYEGLTTPQIRES EPDWLVWTHGCPAGESVAQVNDRADSAVALALEHMSSRDVLFVSHGHFSRAVITRWVQ LPLAEGSRFAMPTASIGICGFEHGVRQLAVLGLTGHPQPIAAG" CDS 3551207..3552325 /codon_start=1 /transl_table=11 /gene="entC" /locus_tag="BQ2027_MB3241" /product="PROBABLE ISOCHORISMATE SYNTHASE ENTC (ISOCHORISMATE HYDROXYMUTASE) (ENTEROCHELIN BIOSYNTHESIS)" /note="Mb3241, entC, len: 372 aa. Equivalent to Rv3215, len: 372 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 372 aa overlap). Probable entC, isochorismate synthase (EC 5.4.99.6), equivalent to Q9CCH5|ENTC|ML0808 PUTATIVE ISOCHORISMATE SYNTHASE from Mycobacterium leprae (577 aa), FASTA scores: opt: 1817, E(): 5.5e-105, (73.5% identity in 366 aa overlap). Also similar to others e.g. Q9F639|MXCD PROTEIN INVOLVED IN MYXOCHELIN-TYPE IRON CHELATOR BIOSYNTHESIS (see citation below) from Stigmatella aurantiaca (408 aa), FASTA scores: opt: 893, E(): 6.2e-48, (41.6% identity in 382 aa overlap); P45744|DHBC_BACSU ISOCHORISMATE SYNTHASE from Bacillus subtilis (398 aa), FASTA scores: opt: 883, E(): 2.5e-47, (40.45% identity in 393 aa overlap); Q9KI93|CSBC ISOCHORISMATE SYNTHASE (FRAGMENT) from Azotobacter vinelandii (361 aa), FASTA scores: opt: 794, E(): 7.6e-42, (45.65% identity in 298 aa overlap); and the two Escherichia coli proteins AAG54928|ENTC (alias BAB34055|ECS0632) ISOCHORISMATE HYDROXYMUTASE 2 from Escherichia coli strain O157:H7 (391 aa), FASTA scores: opt: 744, E(): 1e-38, (38.8% identity in 340 aa overlap); P10377|ENTC|B0593 ISOCHORISMATE SYNTHASE from Escherichia coli strain K12 (391 aa), FASTA scores: opt: 744, E(): 1e-38, (38.8% identity in 340 aa overlap); etc. Stronger similarity to Escherichia coli entC. Also similar to MTCY253.35. Protein product from Mb3241 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3241 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3I2" /db_xref="InterPro:IPR004561" /db_xref="InterPro:IPR005801" /db_xref="InterPro:IPR015890" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3I2" /protein_id="SIU01870.1" /translation="MSAHVATLHPEPPFALCGPRGTLIARGVRTRYCDVRAAQAALRS GTAPILLGALPFDVSRPAALMVPDGVLRARKLPDWPTGPLPKVRVAAALPPPADYLTR IGRARDLLAAFDGPLHKVVLARAVQLTADAPLDARVLLRRLVVADPTAYGYLVDLTSA GNDDTGAALVGASPELLVARSGNRVMCKPFAGSAPRAADPKLDAANAAALASSAKNRH EHQLVVDTMRVALEPLCEDLTIPAQPQLNRTAAVWHLCTAITGRLRNISTTAIDLALA LHPTPAVGGVPTKAATELIAELEGDRGFYAGAVGWCDGRGDGHWVVSIRCAQLSADRR AALAHAGGGIVAESDPDDELEETTTKFATILTALGVEQ" CDS 3552473..3552805 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3242" /product="gcn5-related n-acetyltransferase, pseudogene" /note="Mb3242, -, len: 110 aa. Equivalent to Rv3216, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap). Possible acetyltransferase (2.3.1.-), similar but shorter to many e.g. Q9AB32|CC0402 ACETYLTRANSFERASE (GNAT FAMILY) from Caulobacter crescentus (159 aa), FASTA scores: opt: 325, E(): 3.8e-17, (45.65% identity in 103 aa overlap); P79081|ATS1 PUTATIVE ACETYLTRANSFERASE ATS1 from Schizosaccharomyces pombe (Fission yeast) (168 aa), FASTA scores: opt: 313, E(): 3.1e-16, (47.6% identity in 105 aa overlap); Q9I640|PA0478 PROBABLE N-ACETYLTRANSFERASE from Pseudomonas aeruginosa (158 aa), FASTA scores: opt: 308, E(): 6.9e-16, (50.0% identity in 98 aa overlap); Q9KHE3 PUTATIVE ACETYLTRANSFERASE from Anabaena sp. strain PCC 7120 (164 aa), FASTA scores: opt: 269, E(): 5.4e-13, (41.75% identity in 103 aa overlap); etc. Also some similarity to diamine acetyltransferases (EC 2.3.1.57) e.g. Q28999|ATDA_PIG|SAT from Sus scrofa (Pig) (171 aa), FASTA scores: opt: 152, E(): 0.00025, (23.15% identity in 108 aa overlap). Mb3242 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /protein_id="SIU01871.1" /translation="MRGHVAEVNGGVAAMALWFLNFSTWDGVAGIYVEDLFVWPRFRR RGLARGLLSTLARECVDNRYTRLAWSVLNWNSDAIALYDRIGGQPQHEWTIYRLSGPR LAALAAPR" CDS complement(3552757..3553188) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3243C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3243c, -, len: 143 aa. Equivalent to Rv3217c, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Probable conserved integral membrane protein, equivalent (highly similar but shorter 30 aa) to Q9CCH6|ML0806 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (173 aa). Also similar to others e.g. Q9F3L9|2SC7G11.04 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (152 aa), FASTA scores: opt: 177, E(): 0.00024, (33.8% identity in 136 aa overlap). And shows similarity to O34238|MVIN|VC0680 VIRULENCE FACTOR MVIN HOMOLOG from Vibrio (525 aa), FASTA scores: opt: 126, E(): 0.97, (30.9% identity in 68 aa overlap). First GTG taken. Mb3243c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3G1" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3G1" /protein_id="SIU01872.1" /translation="MPVRAPAAVRGAGLIVAVQGGAALVVAAALLVRGLAGADQHIVN GLGTAGWFVLVGGAVLAAGCRLAVGKLWGRGLAVFAQLLLLPVAWYLIVGSHQPAIGI PVGIIALGVLVLLFSPPSIRWAAGRDQRGAASAANRGPDSR" CDS 3553477..3554442 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3244" /product="Transcription regulator [contains diacylglycerol kinase catalytic domain]" /note="Mb3244, -, len: 321 aa. Equivalent to Rv3218, len: 321 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 321 aa overlap). Conserved hypothetical protein, similar to several hypothetical bacterial proteins e.g. Q9F3M0|2SC7G11.03c from Streptomyces coelicolor (322 aa), FASTA scores: opt: 694, E(): 4.2e-35, (39.95% identity in 328 aa overlap); Q9A0J4|SPY0752 from Streptomyces pyogenes (340 aa), FASTA scores: opt: 187, E(): 0.00033, (30.5% identity in 141 aa overlap); O31502|YERQ from Bacillus subtilis (303 aa), FASTA scores: opt: 184, E(): 0.00045, (34.15% identity in 126 aa overlap); etc. Protein product from Mb3244 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3244 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3H1" /db_xref="InterPro:IPR001206" /db_xref="InterPro:IPR016064" /db_xref="InterPro:IPR017438" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3H1" /protein_id="SIU01873.1" /translation="MRAVLIVNPTATATTPAGRDLLAHALESRLQLTVEHTNHRGHGT ELGQAAVADGVDLVVVHGGDGTVSAVVNGMLGRPGTTPVRPVPAVAVVPGGSANVLAR ALGISADPIAATNQLIQLLDDYGRHQQWRRIGLIDCGERWAVFNAGMGVDAEVVAAVE AERDKGGKVTAWRYIRAAVRAVLACTRREPALTLQLPNRDPITGVHFVFVSNSSPWTY ANNRPVWTNPDCRFESGLGVFATTSMKVVPTLRVVRQMFAKQPKFEFNHVINNDDVAC LRVTSMGPPIASQFDGDYLGVRETMTFRAVPDALAVVAPPARKRI" CDS 3554722..3554976 /codon_start=1 /transl_table=11 /gene="whiB1" /locus_tag="BQ2027_MB3245" /product="transcriptional regulatory protein whib-like whib1. contains [4fe-4s]2+ cluster." /note="Mb3245, whiB1, len: 84 aa. Equivalent to Rv3219, len: 84 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 84 aa overlap). Probable whiB1, WhiB-like regulatory protein (see citation below), similar to WhiB paralogue of Streptomyces coelicolor. Equivalent to Q9CCH7|WHIB1|ML0804 PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (84 aa), FASTA scores: opt: 580, E(): 3.5e-35, (95.25% identity in 84 aa overlap). Highly similar to several e.g. Q9X952|WBLE DEVELOPMENTAL REGULATORY PROTEIN WHIB-PARALOG from Streptomyces coelicolor (85 aa), FASTA scores: opt: 477, E(): 9.2e-28, (75.3% identity in 81 aa overlap); Q9AD55|SCP1.95 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (102 aa), FASTA scores: opt: 383, E(): 6.1e-21, (60.75% identity in 79 aa overlap); Q9K4K8|SC5F8.16c from Streptomyces coelicolor (83 aa), FASTA scores: opt: 346, E(): 2.5e-18, (54.75% identity in 84 aa overlap); etc. Protein product from Mb3245 detected using shotgun mass spectrometry. Mb3245 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3G7" /db_xref="InterPro:IPR003482" /db_xref="InterPro:IPR034768" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3G7" /protein_id="SIU01874.1" /translation="MDWRHKAVCRDEDPELFFPVGNSGPALAQIADAKLVCNRCPVTT ECLSWALNTGQDSGVWGGMSEDERRALKRRNARTKARTGV" CDS complement(3555038..3556543) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3246C" /product="PROBABLE TWO COMPONENT SENSOR KINASE" /note="Mb3246c, -, len: 501 aa. Equivalent to Rv3220c, len: 501 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 501 aa overlap). Probable sensor (probably histidine kinase), equivalent to Q9CCH8|ML0803 PUTATIVE TWO-COMPONENT SYSTEM SENSOR KINASE from Mycobacterium leprae (500 aa). Similar to others e.g. Q9F3M1|2SC7G11.01 PUTATIVE HISTIDINE KINASE (FRAGMENT) from Streptomyces coelicolor (372 aa), FASTA scores: opt: 1038, E(): 7.4e-56, (48.95% identity in 380 aa overlap); Q9A3K5|CC3198 SENSOR HISTIDINE KINASE from Caulobacter crescentus (327 aa), FASTA scores: opt: 311, E(): 1.2e-11, (33.35% identity in 201 aa overlap) (similarity only in C-terminal part for this one); Q9A2T2|CC3474 PUTATIVE SENSOR HISTIDINE KINASE from Caulobacter crescentus (547 aa); etc. C-terminal half shows similarity to many sensor proteins, that respond to various stimuli from Methanobacterium thermoautotrophicum e.g. O26568|MTH468 SENSORY TRANSDUCTION HISTIDINE KINASE (554 aa), FASTA scores: opt: 425, E(): 2.1e-18, (34.0% identity in 244 aa overlap); O26546|MTH446 SENSORY TRANSDUCTION REGULATORY PROTEIN (583 aa), FASTA scores: opt: 380, E(): 1.2e-15, (37.15% identity in 202 aa overlap); O26913|MTH823 SENSORY TRANSDUCTION REGULATORY PROTEIN (677 aa), FASTA scores: opt: 375, E(): 2.7e-15, (35.4% identity in 195 aa overlap); etc. SEEMS SIMILAR TO OTHER PROKARYOTIC SENSORY TRANSDUCTION HISTIDINE KINASES. Protein product from Mb3246c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3246c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y591" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR011495" /db_xref="InterPro:IPR022066" /db_xref="InterPro:IPR035965" /db_xref="InterPro:IPR036890" /db_xref="InterPro:IPR038424" /db_xref="UniProtKB/TrEMBL:A0A1R3Y591" /protein_id="SIU01875.1" /translation="MSTLGDLLAEHTVLPGSAVDHLHAVVGEWQLLADLSFADYLMWV RRNDGVLVCVAQCRPNTGPTVVHTDAVGTVVAANSMPLVAATFSGGVPGREGAVGQQN SCQHDGHSVEVSPVRFGDQVVAVLTRHQPELAARRRSGHLETAYRLCATDLLRMLAEG TFPDAGDVAMSRSSPRAGDGFIRLDVDGVVSYASPNALSAYHRMGLTTELEGVNLIDA TRPLISDPFEAHEVDEHVQDLLAGDGKGMRMEVDAGGATVLLRTLPLVVAGRNVGAAI LIRDVTEVKRRDRALISKDATIREIHHRVKNNLQTVAALLRLQARRTSNAEGREALIE SVRRVSSIALVHDALSMSVDEQVNLDEVIDRILPIMNDVASVDRPIRINRVGDLGVLD SDRATALIMVITELVQNAIEHAFDPAAAEGSVTIRAERSARWLDVVVHDDGLGLPQGF SLEKSDSLGLQIVRTLVSAELDGSLGMRDARERGTDVVLRVPVGRRGRLML" CDS complement(3556560..3556775) /codon_start=1 /transl_table=11 /gene="TB7.3" /locus_tag="BQ2027_MB3247C" /product="BIOTINYLATED PROTEIN TB7.3" /note="Mb3247c, TB7.3, len: 71 aa. Equivalent to Rv3221c, len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 aa overlap). Biotinylated protein TB7.3 (see citation below), equivalent (appears to have one additional residue) to Q9CCH9|ML0802|BTB7_MYCLE BIOTINYLATED PROTEIN TB7.3 HOMOLOG from Mycobacterium leprae (70 aa), FASTA scores: opt: 367, E(): 4e-18, (90.0% identity in 70 aa overlap); Q9XCD6|BTB7_MYCSM BIOTINYLATED PROTEIN TB7.3 HOMOLOG from Mycobacterium smegmatis (70 aa), FASTA scores: opt: 341, E(): 2.1e-16, (84.05% identity in 69 aa overlap). Similar to C-terminal part of various proteins e.g. Q9HPP8|ACC|VNG1532G BIOTIN CARBOXYLASE from Halobacterium sp. strain NRC-1 (610 aa), FASTA scores: opt: 212, E(): 4e-07, (50.0% identity in 68 aa overlap); Q58628|PYCB_METJA|MJ1231 PYRUVATE CARBOXYLASE SUBUNIT B from Methanococcus jannaschii (567 aa), FASTA scores: opt: 192, E(): 7.8e-06, (44.8% identity in 58 aa overlap); Q9ZAA7|GCDC GLUTACONYL-COA DECARBOXYLASE GAMMA SUBUNIT from Acidaminococcus fermentans (145 aa), FASTA scores: opt: 184, E(): 8.9e-06, (39.4% identity in 66 aa overlap); etc. Mb3247c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000089" /db_xref="InterPro:IPR011053" /db_xref="UniProtKB/Swiss-Prot:P0A511" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01876.1" /translation="MAEDVRAEIVASVLEVVVNEGDQIDKGDVVVLLESMKMEIPVLA EAAGTVSKVAVSVGDVIQAGDLIAVIS" CDS complement(3557060..3557365) /codon_start=1 /transl_table=11 /gene="rsha" /locus_tag="BQ2027_MB3248C" /product="anti-sigma factor rsha" /note="Mb3248c, -, len: 101 aa. Equivalent to Rv3221A, len: 101 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 101 aa overlap). Possible anti-sigma factor, similar to Q9XCD7|AAD41811.1 unknown protein from Mycobacterium smegmatis, linked to sigma factor sigH (see first citation below) (101 aa), FASTA scores: opt: 422, E(): 3.4e-22, (64.9% identity in 94 aa overlap); and to Q9RL96|RsrA anti-sigma factor from Streptomyces coelicolor (see second citation) (105 aa), FASTA scores: opt: 163, E(): 0.00016, (32.05% identity in 78 aa overlap). Protein product from Mb3248c detected using shotgun mass spectrometry. Mb3248c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR014295" /db_xref="InterPro:IPR024020" /db_xref="InterPro:IPR027383" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3M6" /protein_id="SIU01877.1" /translation="MSENCGPTDAHADHDDSHGGMGCAEVIAEVWTLLDGECTPETRE RLRRHLEACPGCLRHYGLEERIKALIGTKCRGDRAPEGLRERLRLEIRRTTIIRGGP" CDS complement(3557362..3557913) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3249C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3249c, -, len: 183 aa. Equivalent to Rv3222c, len: 183 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 183 aa overlap). Hypothetical protein, with some similarity to Q9SZD2|F19B15.50|AT4G29020 GLYCINE-RICH PROTEIN LIKE from Arabidopsis thaliana (Mouse-ear cress) (158 aa), FASTA scores: opt: 131, E(): 0.77, (33.35% identity in 126 aa overlap); Q9S222|SCI51.18 PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (548 aa), FASTA scores: opt: 133, E(): 1.6, (36.25% identity in 149 aa overlap); etc. Also some similarity to other hypothetical Mycobacterium tuberculosis proteins e.g. O06292|Rv0341|MTCY13E10.01 (479 aa), FASTA scores: opt: 141, E(): 0.5, (31.2% identity in 170 aa overlap); AAK45760|MT1497.1 PE_PGRS FAMILY PROTEIN from strain CDC1551 (1408 aa), FASTA scores: opt: 137, E(): 2, (31.75% identity in 148 aa overlap); etc. Mb3249c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3U1" /protein_id="SIU01878.1" /translation="MSSPVSSRRLANLVKESLQGSVLGGVVSDAVLPAVSDDVKPGAG EDAYRVPVVVAAGSGAVVQVGGLEVGSAAVAGEVADTVAELFVCRPTEPDVGDFVGLA GGAGDAGQAGQQFGLGVGVRGESFGARRRLALSTVGASGATAGLRKTHDGHHGCQARG ALTQRRLYIGNPSEITDTRMVHQ" CDS complement(3557910..3558560) /codon_start=1 /transl_table=11 /gene="sigH" /locus_tag="BQ2027_MB3250C" /standard_name="rpoE" /product="ALTERNATIVE RNA POLYMERASE SIGMA-E FACTOR (SIGMA-24) SIGH (RPOE)" /note="Mb3250c, sigH, len: 216 aa. Equivalent to Rv3223c, len: 216 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 216 aa overlap). sigH (alternate gene name: rpoE), alternative RNA polymerase sigma factor (see citations below), similar to many e.g. Q9XCD8|SIGH from Mycobacterium smegmatis (215 aa), FASTA scores: opt: 1187, E(): 8.1e-69, (87.75% identity in 212 aa overlap); O87834|SIGR from Streptomyces coelicolor (227 aa), FASTA scores: opt: 913, E(): 2.6e-51, (68.8% identity in 202 aa overlap); O68520|RPOE1 from Myxococcus xanthus (213 aa), FASTA scores: opt: 452, E(): 6.7e-22, (42.8% identity in 187 aa overlap); Q06198|RPSH_PSEAE|ALGU|ALGT|PA0762 from Pseudomonas aeruginosa (193 aa), FASTA scores: opt: 301, E(): 2.7e-12, (29.9% identity in 194 aa overlap); etc. Equivalent to AAK47662 RNA polymerase sigma-70 factor from Mycobacterium tuberculosis strain CDC1551 (284 aa), but shorter 68 aa. Has sigma-70 factors ECF subfamily signature (PS01063). So BELONGS TO THE SIGMA-70 FACTOR FAMILY, ECF SUBFAMILY. Start chosen on basis of similarity, other potential starts upstream. Protein product from Mb3250c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3250c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P66808" /db_xref="InterPro:IPR000838" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR013249" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR014293" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039425" /db_xref="UniProtKB/Swiss-Prot:P66808" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01879.1" /translation="MADIDGVTGSAGLQPGPSEETDEELTARFERDAIPLLDQLYGGA LRMTRNPADAEDLLQETMVKAYAGFRSFRHGTNLKAWLYRILTNTYINSYRKKQRQPA EYPTEQITDWQLASNAEHSSTGLRSAEVEALEALPDTEIKEALQALPEEFRMAVYYAD VEGFPYKEIAEIMDTPIGTVMSRLHRGRRQLRGLLADVARDRGFARGEQAHEGVSS" CDS 3558860..3559708 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3251" /product="possible iron-regulated short-chain dehydrogenase/reductase" /note="Mb3251, -, len: 282 aa. Equivalent to Rv3224, len: 282 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 282 aa overlap). Probable oxidoreductase, possible short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to BAB49551|MLL2413 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (288 aa), FASTA scores: opt: 1053, E(): 6.4e-59, (57.95% identity in 276 aa overlap); Q9AB34|CC0400 SHORT CHAIN DEHYDROGENASE FAMILY PROTEIN from Caulobacter crescentus (285 aa), FASTA scores: opt: 1051, E(): 8.5e-59, (55.9% identity in 281 aa overlap); and Q9VB10|CG5590 HYPOTHETICAL PROTEIN (SIMILAR TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY) from Drosophila melanogaster (Fruit fly) (412 aa), FASTA scores: opt: 966, E(): 2.5e-53, (52.15% identity in 278 aa overlap). Similar to various proteins (principaly oxidoreductases) e.g. Q18639|C45B11.3 HYPOTHETICAL PROTEIN (SIMILAR TO THE SDR FAMILY) from Caenorhabditis elegans (293 aa), FASTA scores: opt: 921, E(): 1.2e-50, (51.3% identity in 271 aa overlap); Q9HZV5|PA2892 PROBABLE SHORT-CHAIN DEHYDROGENASE from Pseudomonas aeruginosa (274 aa), FASTA scores: opt: 847, E(): 5.1e-46, (49.25% identity in 274 aa overlap); Q9I6V0|PA0182 PROBABLE SHORT-CHAIN DEHYDROGENASE (SIMILAR TO THE SDR FAMILY) from Pseudomonas aeruginosa (250 aa), FASTA scores: opt: 333, E(): 8.3e-14, (29.8% identity in 245 aa overlap); Q9HY98|PA3511 PROBABLE SHORT-CHAIN DEHYDROGENASE from Pseudomonas aeruginosa (253 aa), FASTA scores: opt: 330, E(): 1.3e-13, (31.2% identity in 250 aa overlap); etc. Related proteins in Mycobacterium tuberculosis include MTCY02B10.14, MTCY369.14, and MTCY09F9.36. Has ATP/GTP-binding site motif A, (PS00017) near C-terminus. MAY BE BELONG TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb3251 detected using shotgun mass spectrometry. Mb3251 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3J2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01880.1" /translation="MSLNGKTMFISGASRGIGLAIAKRAARDGANIALIAKTAEPHPK LPGTVFTAAKELEEAGGQALPIVGDIRDPDAVASAVATTVEQFGGIDICVNNASAINL GSITEVPMKRFDLMNGIQVRGTYAVSQACIPHMKGRENPHILTLSPPILLEKKWLRPT AYMMAKYGMTLCALGIAEEMRADGIASNTLWPRTMVATAAVQNLLGGDEAMARSRKPE VYADAAYVIVNKPATEYTGKTLLCEDVLVESGVTDLSVYDCVPGATLGVDLWVEDANP PGYLPA" CDS 3559722..3559832 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3252" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3252, -, len: 62 aa. Equivalent to Rv3224A, len: 62 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 62 aa overlap). Conserved hypothetical protein (possibly gene fragment), overlaps Rv3224. Similar to N-terminus of ML0799|AL583919_131 conserved hypothetical protein from Mycobacterium leprae (135 aa), FASTA scores: opt: 104, E(): 0.78, (59.37% identity in 32 aa overlap). Note that upstream ORF Rv3224B is similar to C-terminus of ML0799. There appears to be no frameshift as sequence is identical in strain CDC1551 and in Mycobacterium bovis. Mb3252 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3H9" /protein_id="SIU01881.1" /translation="MILELPDERAVAIVPVPSKLSLKAAGGPRGAQSGHG" CDS 3559810..3560028 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3253" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3253, -, len: 72 aa. Equivalent to Rv3224B, len: 72 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 72 aa overlap). Conserved hypothetical protein (possibly gene fragment), similar to C-terminal part of ML0799|AL583919_131 conserved hypothetical protein from Mycobacterium leprae (135 aa), FASTA scores: opt: 229, E(): 2e-09, (60.00% identity in 70 aa overlap). Note that downstream ORF Rv3224A is similar to N-terminus of ML0799. There appears to be no frameshift as sequence is identical in strain CDC1551 and in Mycobacterium bovis. Mb3253 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3H7" /db_xref="InterPro:IPR007214" /db_xref="InterPro:IPR036754" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3H7" /protein_id="SIU01882.1" /translation="MPKAAMAKPAAAEQATGYVVGGISPFGQRKRLRTVVDVSALSWD RVLRCRQTALGRHGGPAGPDHLDQRDHR" CDS complement(3560025..3561449) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3254C" /product="gcn5-related n-acetyltransferase, phosphorylase" /note="Mb3254c, -, len: 474 aa. Equivalent to Rv3225c, len: 474 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 474 aa overlap). Possible transferase (EC 2.-.-.-). C-terminal part shows some similarity to various bacterial proteins e.g. BAB49093|MLL1809 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (298 aa), FASTA scores: opt: 557, E(): 2.8e-26, (34.55% identity in 295 aa overlap); P14509|KKA8_ECOLI|APHA AMINOGLYCOSIDE 3'-PHOSPHOTRANSFERASE from Escherichia coli (271 aa), FASTA scores: opt: 194, E(): 0.00018, (27.75% identity in 227 aa overlap); Q53826|CPH CAPREOMYCIN PHOSPHOTRANSFERASE from Streptomyces capreolus (281 aa), FASTA scores: opt: 178, E(): 0.0017, (30.5% identity in 269 aa overlap); Q9CDM4|YWIA UNKNOWN PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (213 aa), FASTA scores: opt: 167, E(): 0.0061, (2705% identity in 149 aa overlap); Q9X843|SC9B1.24 PUTATIVE TRANSFERASE (FRAGMENT) from Streptomyces coelicolor (317 aa), FASTA scores: opt: 165, E(): 0.011, (26.05% identity in 280 aa overlap); etc. Start uncertain. Protein product from Mb3254c detected using SWATH mass spectrometry. Mb3254c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3H8" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR002575" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3H8" /protein_id="SIU01883.1" /translation="MRFAKLSDGLSDGIVTLSPLCLDDVDAHLAGGDERLVRWLSGMP STRASVEAYIRHCREQWVTGGPLRSFGIRTVAETIVGTIDLRFDGEGLASGQVNVAYG LYPSWRGRGLATRAVDLVCQYAAEHGATEAVIKVEPENSASARVALRAGFAFVRRICE QDGTVFDRYERVLRAKMHADEVDIDEDLVRRLLRAQFPQWADLPIAPVRSAGTDNAMY RLGEDLAVRIPRIGWAIESLRTEQQWLPRIAAHLGVASPVPVGLGSPAEGFGWPWSVC RWVAGENPSAAEFVEPNRAVEDLADFITTLRATDPMGGPPAKRGAPLGEQDAEVRAAL AALDGIIDVHAATAAWESALRVPPYAGPPMWFHGDLSRFNILTAQGRLTGVIDFGLMG VGDPSVDLIIAWNLLSAPARAQFRVAVGAADDDWMRGRGRALAIALIALPYYQDTNPP LAASARYAIGEVLADFRYGARPGC" CDS complement(3561573..3562331) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3255C" /product="Phage protein" /note="Mb3255c, -, len: 252 aa. Equivalent to Rv3226c, len: 252 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 252 aa overlap). Conserved hypothetical protein, similar to various hypothetical bacterial proteins e.g. Q9CCI2|ML0793 PUTATIVE BACTERIOPHAGE PROTEIN from Mycobacterium leprae (252 aa), FASTA scores: opt: 1183, E(): 3.8e-68, (70.65% identity in 252 aa overlap); BAB54183|MLR7795 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (369 aa), FASTA scores: opt: 417, E(): 2.9e-19, (33.75% identity in 252 aa overlap); O64131 YOQW PROTEIN from Bacteriophage SPBc2 (224 aa), FASTA scores: opt: 413, E(): 3.4e-19, (38.5% identity in 244 aa overlap); O31916 YOQW PROTEIN from Bacillus subtilis (224 aa), FASTA scores: opt: 413, E(): 3.4e-19, (38.5% identity in 244 aa overlap); O34906 YOAM PROTEIN from Bacillus subtilis (227 aa), FASTA scores: opt: 401, E(): 2e-18, (37.7% identity in 244 aa overlap); Q9K4A5|SC7E4.11 HYPOTHETICAL 30.8 KDA PROTEIN from Streptomyces coelicolor (271 aa), FASTA scores: opt: 383, E(): 3.3e-17, (39.6% identity in 283 aa overlap); etc. Protein product from Mb3255c detected using SWATH mass spectrometry. Mb3255c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3H5" /db_xref="InterPro:IPR003738" /db_xref="InterPro:IPR036590" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3H5" /protein_id="SIU01884.1" /translation="MCGRFAVTTDPAQLAEKITAIDEATGCGGGKTSYNVAPTDTIAT VVSRHSEPDDEPTRRVRLMRWGLIPSWIKAGPGGAPDAKGPPLINARADKVATSPAFR SAVRSKRCLVPMDGWYEWRVDPDATPGRPNAKTPFFLHRHDGALLFTAGLWSVWKSYR SAPPLLSCTVITTDAVGELAEIHDRMPLLLAEEDWDDWLNPDAPPDPELLARPPDVRD IALRQVSTLVNNVRNNGPELLEPARSQPEQIQLL" CDS 3562386..3563738 /codon_start=1 /transl_table=11 /gene="aroA" /locus_tag="BQ2027_MB3256" /product="3-PHOSPHOSHIKIMATE 1-CARBOXYVINYLTRANSFERASE AROA (5-ENOLPYRUVYLSHIKIMATE-3-PHOSPHATE SYNTHASE) (EPSP SYNTHASE) (EPSPS)" /note="Mb3256, aroA, len: 450 aa. Equivalent to Rv3227, len: 450 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 450 aa overlap). aroA, 3-phosphoshikimate 1-carboxyvinyl transferase (EC 2.5.1.19) (see citation below), equivalent (but C-terminus longer) to Q9CCI3|AROA|ML0792 PUTATIVE 3-PHOSPHOSHIKIMATE 1-CARBOXYVINYL TRANSFERASE from Mycobacterium leprae (430 aa), FASTA scores: opt: 1466, E(): 1.4e-78, (55.05% identity in 427 aa overlap). Contains PS00885 EPSP synthase signature 2. BELONGS TO THE EPSP SYNTHASE FAMILY. Protein product from Mb3256 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3256 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TWY4" /db_xref="InterPro:IPR001986" /db_xref="InterPro:IPR006264" /db_xref="InterPro:IPR013792" /db_xref="InterPro:IPR023193" /db_xref="InterPro:IPR036968" /db_xref="UniProtKB/Swiss-Prot:Q7TWY4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01885.1" /translation="MKTWPAPTAPTPVRATVTVPGSKSQTNRALVLAALAAAQGRGAS TISGALRSRDTELMLDALQTLGLRVDGVGSELTVSGRIEPGPGARVDCGLAGTVLRFV PPLAALGSVPVTFDGDQQARGRPIAPLLDALRELGVAVDGTGLPFRVHGNGSLAGGTV AIDASASSQFVSGLLLSAASFTDGLTVQHTGSSLPSAPHIAMTAAMLRQAGVDIDDST PNRWQVRPGPVAARRWDIEPDLTNAVAFLSAAVVSGGTVRITGWPRVSVQPADHILAI LRQLNAVVIHADSSLEVRGPTGYDGFDVDLRAVGELTPSVAALAALASPGSVSRLSGI AHLRGHETDRLAALSTEINRLGGTCRETPDGLVITATPLRPGIWRAYADHRMAMAGAI IGLRVAGVEVDDIAATTKTLPEFPRLWAEMVGPGQGWGYPQPRSGQRARRATGQGSGG " CDS 3563735..3564727 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3257" /product="Ribosome small subunit biogenesis RbfA-release protein RsgA" /note="Mb3257, -, len: 330 aa. Equivalent to Rv3228, len: 330 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 330 aa overlap). Conserved hypothetical protein, equivalent to Q9CCI4|ML0791 HYPOTHETICAL PROTEIN from Mycobacterium leprae (327 aa), FASTA scores: opt: 1828, E(): 1e-98, (84.0% identity in 331 aa overlap). Also similar to several hypothetical bacterial proteins e.g. Q9K4A8|SC7E4.08c from Streptomyces coelicolor (337 aa), FASTA scores: opt: 1051, E(): 1e-53, (52.65% identity in 338 aa overlap); Q9HUL3|PA4952 from Pseudomonas aeruginosa (339 aa), FASTA scores: opt: 392, E(): 1.4e-15, (34.85% identity in 281 aa overlap); Q9PFV1|XF0556 from Xylella fastidiosa (341 aa), FASTA scores: opt: 367, E(): 4e-14, (36.85% identity in 247 aa overlap); P45339|YJEQ_HAEIN|HI1714 from Haemophilus influenzae (346 aa), FASTA scores: opt: 355, E(): 2e-13, (31.65% identity in 281 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A. Protein product from Mb3257 detected using SWATH mass spectrometry. Mb3257 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y447" /db_xref="InterPro:IPR004881" /db_xref="InterPro:IPR010914" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR030378" /db_xref="UniProtKB/TrEMBL:A0A1R3Y447" /protein_id="SIU01886.1" /translation="MRPGDYDESDVKVRSGRSSRPRTKTRPEHADAEAAMVVSVDRGR WGCVLGGRPDRRITAMRARELGRTPIVVGDDVDVVGDLSGRPDTLARIVRRAPRRTVL RRTADDTDPTERVVVANADQLLIVVALADPPPRTGLVDRALIAAYAGGLTPILCLTKT DLAPAEPFGKQFADLELTVTAAGVDDPLLAVADLLAGKITVLLGHSGVGKSTLVNRLV PEADRAVGEVTEIGRGRHTSTRSVALPLGDTLSGSGWVIDTPGIRSFGLAHIQPDNVL LAFSDLAEATRECPRGCGHMGPPADPECALDTLSGPAARRAAAARRLLAVLSQT" CDS complement(3564760..3566043) /codon_start=1 /transl_table=11 /gene="desa3" /locus_tag="BQ2027_MB3258C" /product="POSSIBLE LINOLEOYL-COA DESATURASE (DELTA(6)-DESATURASE)" /note="Mb3258c, -, len: 427 aa. Equivalent to Rv3229c, len: 427 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 427 aa overlap). Possible linoleoyl-CoA desaturase (EC 1.14.19.3), showing similarity with desaturases and other proteins e.g. Q08871|DES6|SLL0262 LINOLEOYL-COA DESATURASE from Synechocystis sp. strain PCC 6803 (359 aa), FASTA scores: opt: 319, E(): 4e-13, (25.1% identity in 295 aa overlap); Q54795|DESD DELTA 6 DESATURASE from Spirulina platensis (368 aa), FASTA scores: opt: 268, E(): 7.7e-10, (25.0% identity in 300 aa overlap); Q9ZTU8|S276 PROTEIN WITH SIMILARITY TO CYTOCHROME B5 DOMAIN from Triticum aestivum (Wheat) (469 aa), FASTA scores: opt: 240, E(): 5.9e-08, (27.05% identity in 266 aa overlap); etc. Note that previously known as desA3. Protein product from Mb3258c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3258c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3P3" /db_xref="InterPro:IPR005804" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3P3" /protein_id="SIU01887.1" /translation="MAITDVDVFAHLTDADIENLAAELDAIRRDVEESRGERDARYIR RTIAAQRALEVSGRLLLAGSSRRLAWWTGALTLGVAKIIENMEIGHNVMHGQWDWMND PEIHSSTWEWDMSGSSKHWRYTHNFVHHKYTNILGMDDDVGYGMLRVTRDQRWKRYNI FNVVWNTILAIGFEWGVALQHLEIGKIFKGRADREAAKTRLREFSAKAGRQVFKDYVA FPALTSLSPGATYRSTLTANVVANVIRNVWSNAVIFCGHFPDGAEKFTKTDMIGEPKG QWYLRQMLGSANFNAGPALRFMSGNLCHQIEHHLYPDLPSNRLHEISVRVREVCDRYD LPYTTGSFLVQYGKTWRTLAKLSLPDKYLRDNADDAPETRSERMFAGLGPGFAGADPV TGRRRGLKTAIAAVRGRRRSKRMAKSVTEPDDLAA" CDS complement(3566121..3567263) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3259C" /product="HYPOTHETICAL OXIDOREDUCTASE" /note="Mb3259c, -, len: 380 aa. Equivalent to Rv3230c, len: 380 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 380 aa overlap). Putative oxidoreductase (EC 1.-.-.-), with some similarity to various proteins, especially reductases e.g. Q9HUS4|PA4889 PROBABLE OXIDOREDUCTASE from Pseudomonas aeruginosa (366 aa), FASTA scores: opt: 516, E(): 1.8e-24, (33.8% identity in 367 aa overlap); P95533|TDNB ELECTRON TRANSFER PROTEIN from Pseudomonas putida (337 aa), FASTA scores: opt: 380, E(): 4e-16, (30.7% identity in 277 aa overlap); BAB34381|ECS0958 NADH OXIDOREDUCTASE FOR THE HCP from Escherichia coli strain O157:H7 (322 aa), FASTA scores: opt: 369, E(): 1.8e-15, (28.65% identity in 328 aa overlap); Q44253|ATDA5 ANILINE DIOXYGENASE REDUCTASE COMPONENT from Acinetobacter sp. (336 aa), FASTA scores: opt: 305, E(): 1.6e-11, (27.4% identity in 303 aa overlap); etc. Protein product from Mb3259c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3259c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3U8" /db_xref="InterPro:IPR001041" /db_xref="InterPro:IPR001433" /db_xref="InterPro:IPR001709" /db_xref="InterPro:IPR008333" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR017927" /db_xref="InterPro:IPR017938" /db_xref="InterPro:IPR036010" /db_xref="InterPro:IPR039261" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3U8" /protein_id="SIU01888.1" /translation="MSKKHTTLNASIIDTRRPTVAGADRHPGWHALRKIAARITTPLL PDDYLHLANPLWSARELRGRILGVRRETEDSATLFIKPGWGFSFDYQPGQYIGIGLLV DGCWRWRSYSLTSSPAASGSARMVTVTVKAMPEGFLSTHLVAGVKPGTIVRLAAPQGN FVLPDPAPPLILFLTAGSGITPVMSMLRTLVRRNQITDVVHLHSAPTAADVMFGAELA ALAADHPGYRLSVRETRAQGRLDLTRIGQQVPDWRERQTWACGPEGVLNQADKVWSSA GASDRLHLERFAVSKTAPAGAGGTVTFARSGKSVAADAATSLMDAGEGAGVQLPFGCR MGICQSCVVDLVEGHVRDLRTGQRHEPGTRVQTCVSAASGDCVLDI" CDS complement(3567373..3567882) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3260C" /product="link to acyltransferase activity" /note="Mb3260c, -, len: 169 aa. Equivalent to Rv3231c, len: 169 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 169 aa overlap). Hypothetical protein, only similar to Q9KYX9|SCE33.03c HYPOTHETICAL 17.4 KDA PROTEIN from Streptomyces coelicolor (167 aa), FASTA scores: opt: 415, E(): 6.6e-19, (49.1% identity in 171 aa overlap). Protein product from Mb3260c detected using SWATH mass spectrometry. Mb3260c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3J0" /protein_id="SIU01889.1" /translation="MTQVYIPATLAMLQRLVADGALWPVNGTAFAVTPTLRESYAEGD DEELAEVALREAALASLRLLAADIGATADALPPRRAVLAAEVDDATYRPDLDDAVVRL AGPITIDQVVAAYVDNAGAEPAVMAAIAVIDAADLGDEDAELVVGDAQDHDLAWYANQ ELPFLLDLL" CDS complement(3567879..3568766) /codon_start=1 /transl_table=11 /gene="ppk2" /locus_tag="BQ2027_MB3261C" /product="polyphosphate kinase ppk2 (polyphosphoric acid kinase)" /note="Mb3261c, pvdS, len: 295 aa. Equivalent to Rv3232c, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 295 aa overlap). Possible pvdS, an alternative RNA polymerase sigma factor, highly similar (but N-terminus longer 25-50 residues approximatively) to Q9RIZ9|SCJ1.15 PUTATIVE REGULATOR from Streptomyces coelicolor (267 aa), FASTA scores: opt: 1189, E(): 1.4e-70, (65.65% identity in 262 aa overlap); Q9KU02|VC0728 HYPOTHETICAL PROTEIN from Vibrio cholerae (258 aa), FASTA scores: opt: 1074, E(): 4.5e-63, (62.6% identity in 254 aa overlap); P72119|PVDS PAO SUBSTRAIN OT684 PYOVERDINE GENE TRANSCRIPTIONAL REGULATOR PVDS (FRAGMENT) from Pseudomonas aeruginosa (see citation below) (237 aa), FASTA scores: opt: 988, E(): 1.8e-57, (60.8% identity in 227 aa overlap). Also highly similar to Q9I154|PA2428 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (304 aa), FASTA scores: opt: 1057, E(): 6.8e-62, (60.7% identity in 252 aa overlap); Q9I6Z1|PA0141 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 990, E(): 1.6e-57, (54.6% identity in 249 aa overlap); and other hypothetical bacterial proteins. Could be a member of a subfamily of RNA polymerase sigma factors which direct the synthesis of extracellular products by bacteria. Start uncertain. Protein product from Mb3261c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3261c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3K6" /db_xref="InterPro:IPR016898" /db_xref="InterPro:IPR022486" /db_xref="InterPro:IPR022488" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K6" /protein_id="SIU01890.1" /translation="MDIPSVDVSTATNDGASSRAKGHRSAAPGRRKISDAVYQAELFR LQTEFVKLQEWARHSGARLVVIFEGRDGAGKGGAIKRITEYLNPRVARIAALPAPTDR ERGQWYYQRYIAHLPAKGEIVLFDRSWYNRAGVEKVMGFCTPQEYVLFLRQTPIFEQM LIDDGILLRKYWFSVSDAEQLRRFKARRNDPVRQWKLSPMDLESVYRWEDYSRAKDEM MVHTDTPVSPWYVVESDIKKHARLNMMAHLLSTIDYADVEKPKVKLPPRPLVSGNYRR PPRELSTYVDDYVATLIAR" CDS complement(3568790..3570199) /codon_start=1 /transl_table=11 /gene="tgs3" /locus_tag="BQ2027_MB3262C" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb3262c, -, len: 469 aa. Equivalent to Rv3234c and Rv3233c, len: 271 aa and 196 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 266 aa overlap and 100.0% identity in 196 aa overlap). Rv3234c: Hypothetical protein, similar to C-terminus of Mycobacterium tuberculosis hypothetical proteins e.g. P71694|Rv1425|MTCY21B4.43|MTCY493.29c (459 aa), FASTA scores: opt: 498, E(): 5.2e-24, (36.8% identity in 261 aa overlap); MTCY03A2.28; MTCY31.23; MTCY493_29; MTCY28_26; MTV013_8; MTY13E12_33; etc. Also similar to Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa), FASTA scores: opt: 309, E(): 4.3e-12, (33.35% identity in 189 aa overlap). Rv3233c: Hypothetical protein, similar to C-terminus of Q9RIU8|SCM11.13c HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 308, E(): 1.2e-12, (32.0% identity in 200 aa overlap); and several hypothetical Mycobacterium tuberculosis proteins e.g. O06343|YY80_MYCTU|Rv3480c|MTCY13E12.33c (497 aa), FASTA scores: opt: 248, E(): 9.8e-09, (27.5% identity in 200 aa overlap); MTCY28_26; MTCY493_29; MTCY31_25; MTCY31_25. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis H37Rv, Rv3234c and Rv3233c exist as 2 genes. In Mycobacterium bovis, a single base insertion (*-g) leads to a single product that is more similar to Rv3234c. Protein product from Mb3262c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3262c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3I4" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3I4" /protein_id="SIU01891.1" /translation="MVTRLSASDASFYQLENTATPMYVGLLLILRRPRAGLSYEALLE TVEQRLPQIPRYRQKVQEVKLGLARPVWIDDRDFDITYHVRRSALPSPGSDEQLHELI ARLAARPLDKSRPLWEMYLVEGLEKNRIALYTKSHQALINGVTALAIGHVIADRTRRP PAFPEDIWVPERDPGTTRLLLRAVGDWLVRPGAQLQAVGSAVAGLVTNSGQLVETGRK VLDIARTVARGTAPSSPLNATVSRNRRFTVARASLDDYRTVRARYDCDVHDVVLTVIA GALGNWLMSRGEAVAPTATVRAMAPLSVYADDQLDSTGPGQAISQVTPFLVDLPVGEG NAVVRLSQIAHATESNPTAASLVDARTIVTLSGLAPATLHAMGVRVATSFSARLFNLL ITNAPGTQSQMYIAGTKLLETYSVPPLLHNQALAISVTSYNGMLYFGINADRDAMSDV DLLPGLLSQALDELLEASR" CDS 3570310..3570951 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3263" /product="link to acyltransferase activity" /note="Mb3263, -, len: 213 aa. Equivalent to Rv3235, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap). Hypothetical unknown ala-, arg-, pro-rich protein. Mb3263 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3I3" /protein_id="SIU01892.1" /translation="MMASNQTAAQHSSATLQQAPRSIDDAGGCPLTISPIANSPGDTF AVTPVVEYEPPPRNIPPCGQSSHAARRPHTPQLARRQPIRPSGRAPAAVTSTAKSPRL RQAGTFADAALRRVLEVIDRRRPVGQLRPLLAPGLVDSVLAVSRTAAGHQQGAAMLRR IRLTPAGPDTADTAAEVFGTYSRGDRIHAIACRVEQRPAGNETRWLMVALHIG" CDS complement(3570969..3572126) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3264C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN" /note="Mb3264c, -, len: 385 aa. Equivalent to Rv3236c, len: 385 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 385 aa overlap). Probable conserved integral membrane transport protein, possibly cation (Na/H) transporter, equivalent to Q9CCI5|ML0782 putative transmembrane transport protein from Mycobacterium leprae (385 aa), FASTA scores: opt: 1975, E(): 2.4e-108, (81.55% identity in 385 aa overlap). Highly similar to others e.g. O69958|SC4H2.03c putative transmembrane transport protein from Streptomyces coelicolor (411 aa), FASTA scores: opt: 1226, E(): 1.6e-64, (53.5% identity in 372 aa overlap); Q9XAKO|SC66T3.13c putative transmembrane transport protein from Streptomyces coelicolor (403 aa), FASTA scores: opt: 1198, E(): 6.8e-63, (53.25% identity in 370 aa overlap); Q9RV80|DR1149 putative Na+/H+ antiporter from Deinococcus radiodurans (383 aa), FASTA scores: opt: 1069, E(): 2.3e-55, (47.35% identity in 376 aa overlap); Q9L191|SC10G8.11 putative transmembrane transport protein from Streptomyces coelicolor (446 aa), FASTA scores: opt: 695, E(): 1.9e-33, (38.05% identity in 384 aa overlap); Q9RRW8|DR2367 putative glutathione-regulated potassium-efflux system protein KEFB from Deinococcus radiodurans (575 aa), FASTA scores: opt: 414, E(): 6.2e-17, (30.25% identity in 380 aa overlap); etc. SEEMS TO BELONG TO THE CPA2 FAMILY. Note that previously known as kefB. Mb3264c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3I8" /db_xref="InterPro:IPR006153" /db_xref="InterPro:IPR038770" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3I8" /protein_id="SIU01893.1" /translation="MEVSRALLFELGVLLAVLAVLGAVARRFALSPIPVYLLAGLSLG NGGILGVAAAGEFIATGAPIGVVLLLLALGLEFSATEFASSLRHHLPSAGVDIVLNAT PGAVAGWLLGLDGVAILGLAGVTYISSSGVIARLLEDLRRLGNRETPAVLSVLVLEDF AMAAYLPLFAVLATDGSWLEAVVGMTVAIAALLGAFAASYRWGHHVGRLVTHPDSEQL LLRVLGITLIVAAVAESLHASAAVGAFLVGLTLTGETADRARMVLTPLRDLFATIFFL GIGLSVDPGKLVSMLPVALALAAVTAATKVATGMFAARREGVARRGQLRAGTALVARG EFSLIIIGLAGASIPGVAALATAYVFVMAIVGPILARYTGGGLPAAAVASN" CDS complement(3572131..3572613) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3265C" /product="Potassium channel TrkA, possible KefG analog required for KefB activity" /note="Mb3265c, -, len: 160 aa. Equivalent to Rv3237c, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 160 aa overlap). Conserved hypothetical protein, equivalent to Q9CCI6|ML0781 HYPOTHETICAL PROTEIN from Mycobacterium leprae (160 aa), FASTA scores: opt: 828, E(): 1.5e-45, (80.6% identity in 160 aa overlap); and similar to other hypothetical bacterial proteins and more weakly to putative potassium channels e.g. Q9RV81|DR1148 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (175 aa), FASTA scores: opt: 420, E(): 9.5e-20, (37.95% identity in 158 aa overlap); O69959|SC4H2.04c HYPOTHETICAL 17.1 KDA PROTEIN from Streptomyces coelicolor (161 aa), FASTA scores: opt: 315, E(): 3.8e-13, (40.0% identity in 150 aa overlap); Q9HNH3|PCHB|VNG2104G POTASSIUM CHANNEL HOMOLOG from Halobacterium sp. strain NRC-1 (418 aa), FASTA scores: opt: 158, E(): 0.007, (31.45% identity in 124 aa overlap); Q58752|YD57_METJA|MJ1357 PUTATIVE POTASSIUM CHANNEL PROTEIN from Methanococcus jannaschii (343 aa), FASTA scores: opt: 143, E(): 0.053, (33.8% identity in 68 aa overlap). Protein product from Mb3265c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3265c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3J1" /db_xref="InterPro:IPR006037" /db_xref="InterPro:IPR026278" /db_xref="InterPro:IPR036721" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3J1" /protein_id="SIU01894.1" /translation="MDVKEVLLPGVGLRYEFTSYRGDRIGIVARRSGGFDVVLYGRDD PDEARPVLRLTDEEAEAVAQILGAPRIAERFTELTREVPGLKAGQIHIRAGSLFVDRP LGDTRARTRTGASIVAIVRDEDVLASPGPTDVLRAGDVLIVIGTEDGIAGVEQIVEKG " CDS complement(3572674..3573408) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3266C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3266c, -, len: 244 aa. Equivalent to Rv3238c, len: 244 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 244 aa overlap). Probable conserved integral membrane protein, similar to several hypothetical proteins and transmembrane proteins e.g. Q9UN92|NRM29 MULTISPANNING NUCLEAR ENVELOPE MEMBRANE PROTEIN NURIM (FRAGMENT) from Homo sapiens (Human) (261 aa), FASTA scores: opt: 281, E(): 3.3e-11, (30.7% identity in 189 aa overlap); Q9VEG9|CG7655 HYPOTHETICAL PROTEIN from Drosophila melanogaster (Fruit fly) (253 aa), FASTA scores: opt: 242, E(): 1.1e-08, (27.7% identity in 242 aa overlap); BAB48937|MLR1600 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (222 aa), FASTA scores: opt: 137, E(): 0.066, (28.1% identity in 185 aa overlap); BAB57936|SAV1774 AESENICAL PUMP MEMBRANE PROTEIN HOMOLOG from Staphylococcus aureus subsp. aureus Mu50 (430 aa), FASTA scores: opt: 125, E(): 0.68, (25.7% identity in 144 aa overlap); etc. Protein product from Mb3266c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3266c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5B6" /db_xref="InterPro:IPR009915" /db_xref="InterPro:IPR033580" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5B6" /protein_id="SIU01895.1" /translation="MKRYLTIIYGAASYLVFLVAFGYAIGFVGDVVVPRTVDHAIAAP IGQAVVVNLVLLGVFAVQHSVMARQGFKRWWTRFVPPSIERSTYVLLASVALLLLYWQ WRTMPAVIWDVRQPAGRVALWALFWLGWATVLTSTFMINHFELFGLRQVYLAWRGKPY TEIGFQAHLLYRWVRHPIMLGFVVAFWATPMMTAGHLLFAIGATGYILVALQFEERDL LAALGDQYRDYRREVSMLLPWPHRHT" CDS complement(3573467..3576613) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3267C" /product="PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN" /note="Mb3267c, -, len: 1048 aa. Equivalent to Rv3239c, len: 1048 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1048 aa overlap). Probable conserved transmembrane protein, organised in two domains. Domain comprising first ~500 aa residues is similar to various antibiotic resistance and efflux proteins and contains sugar transport proteins signature 1 (PS00216); e.g. Q9RL22|SC5G9.04c PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (489 aa), FASTA scores: opt: 905, E(): 3.1e-41, (36.95% identity in 482 aa overlap); and O68912|FRNF PUTATIVE ANTIBIOTIC ANTIPORTER from Streptomyces roseofulvus (517 aa), FASTA scores: opt: 866, E(): 4.1e-39, (37.1% identity in 512 aa overlap). Second part, corresponding to last 550 aa residues, is very similar to Q50733|Rv2565|MTCY9C4.03c hypothetical 62.1 kd protein from Mycobacterium tuberculosis (583 aa), FASTA scores: E(): 2.1e-28, (36.5% identity in 572 aa overlap). Also equivalent to Rv3728|MTV025.076 PUTATIVE TWO-DOMAIN MEMBRANE PROTEIN (SIMILAR TO SUGAR TRANSPORTER FAMILY) from Mycobacterium tuberculosis (1065 aa), FASTA scores: opt: 4328, E(): 0, (64.15% identity in 1046 aa overlap); and similar to other Mycobacterium tuberculosis proteins: MTCY3G12.01, E(): 6.3e-32; MTCY98.02c, E(): 6.3e-32; MTCY9C4.03c, E(): 1.5e-26; MTCY369.27c, E(): 2.5e-26. Equivalent to AAK47679 Drug transporter from Mycobacterium tuberculosis strain CDC1551 (1065 aa) but shorter 20 aa. Contains cyclic nucleotide-binding domain signature 2 (PS00889). Probably member of major facilitator superfamily (MFS). Mb3267c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y472" /db_xref="InterPro:IPR000595" /db_xref="InterPro:IPR001423" /db_xref="InterPro:IPR002641" /db_xref="InterPro:IPR004638" /db_xref="InterPro:IPR005829" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR018488" /db_xref="InterPro:IPR018490" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y472" /protein_id="SIU01896.1" /translation="MHISLHGGKGFANLTRRRRPSSASVLLVAGFGAFLAFLDSTIVN IAFPDIQRSFPSYDIGSLSWILNGYNIVFAAFMVAAGRLADLLGRRRTFLSGVLVFTI ASGLCAVAGSVEQLVAFRVLQGIGAAILVPASLALVVEGFDAARRAHAIGLWGAAAAI AAGLGPPIGGLLVEWAGWRWVLLVNVPLGIVAAIATKRMLVESRASGRRRMPDLRGAL LLAVTLGLVTLGLVKGPDWGWLSVATVGSFLASVLTSVGFVHSSRSHPAPLVEPALLR SRSFVAGNLLTLVAAAGFYCYGLTHVLYLNYVWHYSLLKAGFAIAPAAVVAAVVAAAL GRVAGRHGHRVIVLVGALVWAGSLVWYLQRVGSEPDFLRVWLPGQLLQGIGVGATLPV LSSAALAEVAKGGSYATSSAVVSTTRQLGAVLGVAVMVILIGKPEHGTAEEALRRGWA MAAICFIAVAVAAAVLGRTNRNPVQMPAPEPAIAPRLEPPIPQPAAAPIEHWAAGDAD PLGNLPLFAGLDAATLAQLGEHVEDVELEAGCYLFHEGDPSDSLYVIRTGRVQVLQDS IVLKELGRGEVLGELGLLIDAPRSATVRALRDTKLVRLTKAQFDEIADHGALAALVKV LATRLREAPPPATDSTSPEVVVSVIGVSGDAPVPAVAAGLLTALSARLRAVDPGRVDR DGLDRAERVADKVVLHAAVEDAGWRDFCLRVADRIVLVAGDPNPQAARLPARARGADL VLAGPAASREHRRQWEELITPRSVHVVHYRRILENVRPLAARIAGRSIGLVLGGGGAR GFAHLGVLDELERVGVTIDRFAGTSMGAVIAVFGACGMDAATADAYAYEYFIRHNPLS DYAFPVRGLVHGRRTLTLLEAAFGDRLVEELPKEFRCVSVDLLARRPVVHRRGRLVDV IGCSLRLPGIYPPQVYNGRLHVDGGVLDNLPVSTRASPDGPLIAVSIGLGGGGPGSAR QDGSPKVPGIGDTLMRTMTIGSQRGADAALSLAQVVIRPDTGAVGLLEFHQIDAAREA GRVAAREAMPHIMALLNR" CDS complement(3576692..3579541) /codon_start=1 /transl_table=11 /gene="secA1" /locus_tag="BQ2027_MB3268C" /standard_name="secA" /product="PROBABLE PREPROTEIN TRANSLOCASE SECA1 1 SUBUNIT" /note="Mb3268c, secA1, len: 949 aa. Equivalent to Rv3240c, len: 949 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 949 aa overlap). Probable secA1, preprotein translocase subunit, component of secretion apparatus, highly similar to many e.g. P57996|SEA1_MYCLE from Mycobacterium leprae (940 aa), FASTA scores: opt: 5044, E(): 0, (87.5% identity in 849 aa overlap); P95759|SECA_STRGR from Streptomyces griseus (940 aa), FASTA scores: opt: 2612, E(): 1.9e-134, (61.35% identity in 960 aa overlap); P28366|SECA_BACSU|DIV+ from Bacillus subtilis (841 aa), FASTA scores: opt: 1776, E(): 4.9e-89, (48.05% identity in 837 aa overlap); etc. BELONGS TO THE SECA FAMILY. PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA, SECD|Rv2587c, SECE|Rv0638, SECF|Rv2586c, SECG|Rv1440 AND SECY|Rv0732. Note that previously known as secA. Protein product from Mb3268c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3268c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Y9" /db_xref="InterPro:IPR000185" /db_xref="InterPro:IPR011115" /db_xref="InterPro:IPR011116" /db_xref="InterPro:IPR011130" /db_xref="InterPro:IPR014018" /db_xref="InterPro:IPR020937" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036266" /db_xref="InterPro:IPR036670" /db_xref="UniProtKB/Swiss-Prot:P0A5Y9" /protein_id="SIU01897.1" /translation="MLSKLLRLGEGRMVKRLKKVADYVGTLSDDVEKLTDAELRAKTD EFKRRLADQKNPETLDDLLPEAFAVAREAAWRVLDQRPFDVQVMGAAALHLGNVAEMK TGEGKTLTCVLPAYLNALAGNGVHIVTVNDYLAKRDSEWMGRVHRFLGLQVGVILATM TPDERRVAYNADITYGTNNEFGFDYLRDNMAHSLDDLVQRGHHYAIVDEVDSILIDEA RTPLIISGPADGASNWYTEFARLAPLMEKDVHYEVDLRKRTVGVHEKGVEFVEDQLGI DNLYEAANSPLVSYLNNALKAKELFSRDKDYIVRDGEVLIVDEFTGRVLIGRRYNEGM HQAIEAKEHVEIKAENQTLATITLQNYFRLYDKLAGMTGTAQTEAAELHEIYKLGVVS IPTNMPMIREDQSDLIYKTEEAKYIAVVDDVAERYAKGQPVLIGTTSVERSEYLSRQF TKRRIPHNVLNAKYHEQEATIIAVAGRRGGVTVATNMAGRGTDIVLGGNVDFLTDQRL RERGLDPVETPEEYEAAWHSELPIVKEEASKEAKEVIEAGGLYVLGTERHESRRIDNQ LRGRSGRQGDPGESRFYLSLGDELMRRFNGAALETLLTRLNLPDDVPIEAKMVTRAIK SAQTQVEQQNFEVRKNVLKYDEVMNQQRKVIYAERRRILEGENLKDQALDMVRDVITA YVDGATGEGYAEDWDLDALWTALKTLYPVGITADSLTRKDHEFERDDLTREELLEALL KDAERAYAAREAELEEIAGEGAMRQLERNVLLNVIDRKWREHLYEMDYLKEGIGLRAM AQRDPLVEYQREGYDMFMAMLDGMKEESVGFLFNVTVEAVPAPPVAPAAEPAELAEFA AAAAAAAQQRSAVDGGARERAPSALRAKGVASESPALTYSGPAEDGSAQVQRNGGGAH KTPAGVPAGASRRERREAARRQGRGAKPPKSVKKR" CDS complement(3579620..3580264) /codon_start=1 /transl_table=11 /gene="hpf" /locus_tag="BQ2027_MB3269C" /product="Ribosome hibernation promoting factor Hpf" /note="Mb3269c, -, len: 214 aa. Equivalent to Rv3241c, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 214 aa overlap). Conserved hypothetical protein, similar to many hypothetical proteins and to some putative ribosomal proteins e.g. Q9CCI7|ML0778 HYPOTHETICAL PROTEIN from Mycobacterium leprae (229 aa), FASTA scores: opt: 1234, E(): 1.3e-72, (89.3% identity in 206 aa overlap); Q9KYX2|SCE33.11c HYPOTHETICAL 27.9 KDA PROTEIN from Streptomyces coelicolor (254 aa), FASTA scores: opt: 487, E(): 2.2e-24, (47.6% identity in 210 aa overlap); Q9FLV3 PROTEIN SIMILAR TO RIBOSOMAL PROTEIN 30S SUBUNIT from Arabidopsis thaliana (Mouse-ear cress) (365 aa), FASTA scores: opt: 264, E(): 7e-10, (26.4% identity in 212 aa overlap); P19954|RR30_SPIOL|RPS22 PLASTID-SPECIFIC 30S RIBOSOMAL PROTEIN 1, chloroplast, from Spinacia oleracea (Spinach) (302 aa), FASTA scores: opt: 261, E(): 9.3e-10, (26.15% identity in 214 aa overlap); P47995|YSEA_STACA HYPOTHETICAL PROTEIN IN SECA 5'REGION (ORF1) (FRAGMENT) (BELONGS TO THE S30AE FAMILY OF RIBOSOMAL PROTEINS) from Staphylococcus carnosus (165 aa), FASTA scores: opt: 201, E(): 4.2e-06, (33.35% identity in 147 aa overlap); etc. Protein product from Mb3269c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3269c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3V1" /db_xref="InterPro:IPR003489" /db_xref="InterPro:IPR032528" /db_xref="InterPro:IPR034694" /db_xref="InterPro:IPR036567" /db_xref="InterPro:IPR038416" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3V1" /protein_id="SIU01898.1" /translation="MDSGQVLAEPKSNAEIVFKGRNVEIPDHFRIYVSQKLARLERFD RTIYLFDVELDHERNRRQRKSCQRVEITARGRGPVVRGEACADSFYAALESAVVKLES RLRRGKDRRKVHYGDKTPVSLAEATAVVPAPENGFNTRPAEAHDHDGAVVEREPGRIV RTKEHPAKPMSVDDALYQMELVGHDFFLFYDKDTERPSVVYRRHAYDYGLIRLA" CDS complement(3580580..3581221) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3270C" /product="Competence protein F homolog, phosphoribosyltransferase domain; protein YhgH required for utilization of DNA as sole source of carbon and energy" /note="Mb3270c, -, len: 213 aa. Equivalent to Rv3242c, len: 213 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap). Conserved hypothetical protein, highly similar in N-terminus to Q9CCI9|ML0776 HYPOTHETICAL PROTEIN from Mycobacterium leprae (85 aa), FASTA scores: opt: 324, E(): 1.7e-13, (78.1% identity in 64 aa overlap). Also similar to Q9RUJ7|DR1389 PUTATIVE COMPETENCE PROTEIN COMF from Deinococcus radiodurans (219 aa), FASTA scores: opt: 223, E(): 6.3e-07, (35.8% identity in 215 aa overlap); BAB50338|MLL3453 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (240 aa), FASTA scores: opt: 218, E(): 1.4e-06, (28.5% identity in 224 aa overlap); Q9A9Y1|CC0830 COMPETENCE PROTEIN F from Caulobacter crescentus (265 aa), FASTA scores: opt: 182, E(): 0.00026, (30.15% identity in 219 aa overlap); etc. Equivalent to AAK47682 from Mycobacterium tuberculosis strain CDC1551 (241 aa) but shorter 29 aa. Contains purine/pyrimidine phosphoribosyl transferases signature (PS00103). SEEMS TO BELONG TO PURINE/PYRIMIDINE PHOSPHORIBOSYL TRANSFERASE FAMILY. Mb3270c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3J7" /db_xref="InterPro:IPR000836" /db_xref="InterPro:IPR029057" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3J7" /protein_id="SIU01899.1" /translation="MLDLVLPLECGGCGAPATRWCAACAAELSVAAGEPHVVSPRVDP QVPVFALGRYAGVRRQAILAMKEHGRRDLVAPLACALIVGVDHLLSWGMLENPLTMVP APTRRWAARRRGGDPVSRMARIAGATLGRHHDVTVVPALRMRALARDSVGLGASARER NITGRVLLRGQRPRNEVVLVDDIITTGATARESVRVLQAAGVRVGAVLAVAAA" CDS complement(3581259..3582101) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3271C" /product="unknown protein" /note="Mb3271c, -, len: 280 aa. Equivalent to Rv3243c, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Hypothetical unknown protein. Protein product from Mb3271c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3271c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3L6" /protein_id="SIU01900.1" /translation="MSPRVPRLRWDDPFRALDMLASLWSSTGMSLVSAGAAQAVAAPY RTLFTTLQQLLIGKEVTVRIGDHDVVLTVTELDSALEPQGLAVGQLGEVRVAARGISW DQHHLHSAVAVLRNVHIRPGVPPLVIAAPVELSSALPTEIFDDVLRQATPQLRGELSE SGAARLRWARRPDWGGLEVDVDVAGTTSQTTLWLRPRTVITGQRRWTLPARTPAYRVP LPELPHGLRITDVSLAADCLQLSALLPEWRTELPLRYLESVITQLSQGALSFVWPPLR SGAD" CDS complement(3582169..3583920) /codon_start=1 /transl_table=11 /gene="lpqB" /locus_tag="BQ2027_MB3272C" /product="PROBABLE CONSERVED LIPOPROTEIN LPQB" /note="Mb3272c, lpqB, len: 583 aa. Equivalent to Rv3244c, len: 583 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 583 aa overlap). Probable lpqB, conserved lipoprotein; contains appropriately placed lipoprotein signature (PS00013). Equivalent to Q9CCJ0|LPQB|ML0775 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (589 aa), FASTA scores: opt: 3375, E(): 1.4e-186, (87.9% identity in 579 aa overlap). Also similar to various proteins (in particular transferases) e.g. Q9KYX0|SCE33.13c PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (615 aa), FASTA scores: opt: 228, E(): 1.3e-05, (25.5% identity in 624 aa overlap); O87992|BBLPS1.19c PUTATIVE GLUTAMINE AMIDOTRANSFERASE from Bordetella bronchiseptica (Alcaligenes bronchisepticus) (628 aa), FASTA scores: opt: 162, E(): 0.079, (28.05% identity in 171 aa overlap); Q9L2F4|SC7A8.01 PUTATIVE SUGAR KINASE (FRAGMENT) from Streptomyces coelicolor (434 aa), FASTA scores: opt: 143, E(): 0.72, (27.65% identity in 293 aa overlap); etc. Protein product from Mb3272c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3272c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWW9" /db_xref="InterPro:IPR018910" /db_xref="InterPro:IPR019606" /db_xref="InterPro:IPR023959" /db_xref="UniProtKB/Swiss-Prot:Q7TWW9" /protein_id="SIU01901.1" /translation="MRLTILLFLGAVLAGCASVPSTSAPQAIGTVERPVPSNLPKPSP GMDPDVLLREFLKATADPANRHLAARQFLTESASNAWDDAGSALLIDHVVFVETRSAE KVSVTMRADILGSLSDVGVFETAEGQLPDPGPIELVKTSGGWRIDRLPNGVFLDWQQF QETYKRNTLYFADPTGKTVVPDPRYVAVSDRDQLATELVSKLLAGPRPEMARTVRNLL APPLRLRGPVTRADGGKSGIGRGYGGARVDMEKLSTTDPHSRQLLAAQIIWTLARADI RGPYVINADGAPLEDRFAEGWTTSDVAATDPGVADGAAAGLHALVNGSLVAMDAQRVT PVPGAFGRMPEQTAAAVSRSGRQVASVVTLGRGAPDEAASLWVGDLGGEAVQSADGHS LLRPSWSLDDAVWVVVDTNVVLRAIQDPASGQPARIPVDSTAVASRFPGAINDLQLSR DGTRAAMVIGGQVILAGVEQTQAGQFALTYPRRLGFGLGSSVVSLSWRTGDDIVVTRT DAAHPVSYVNLDGVNSDAPSRGLQTPLTAIAANPSTVYVAGPQGVLMYSASVESRPGW ADVPGLMVPGAAPVLPG" CDS complement(3583920..3585623) /codon_start=1 /transl_table=11 /gene="mtrB" /locus_tag="BQ2027_MB3273C" /product="TWO COMPONENT SENSORY TRANSDUCTION HISTIDINE KINASE MTRB" /note="Mb3273c, mtrB, len: 567 aa. Equivalent to Rv3245c, len: 567 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 567 aa overlap). mtrB, sensor-like histidine kinase (EC 2.7.3.-) (see citations below), equivalent to Q9CCJ1|MTRB OR ML0774 PUTATIVE TWO-COMPONENT SYSTEM SENSOR KINASE from Mycobacterium leprae (562 aa), FASTA scores: opt: 3208, E(): 7.4e-173, (88.7% identity in 566 aa overlap). Also similar to others e.g. Q9KYW9|SCE33.14c PUTATIVE TWO-COMPONENT SYSTEM HISTIDINE KINASE from Streptomyces coelicolor (688 aa), FASTA scores: opt: 1355, E(): 1.1e-68, (48.95% identity in 515 aa overlap); etc. Relatives in Mycobacterium tuberculosis are: MTCY369.03, E(): 1.5e-22; MTCY20G9.16, E(): 1.9e-17. SIMILAR TO OTHER PROKARYOTIC SENSORY TRANSDUCTION HISTIDINE KINASES. Protein product from Mb3273c detected using SWATH mass spectrometry. Mb3273c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59963" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR003661" /db_xref="InterPro:IPR004358" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR036097" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/Swiss-Prot:P59963" /protein_id="SIU01902.1" /translation="MIFGSRRRIRGRRGRSGPMTRGLSALSRAVAVAWRRSLQLRVVA LTLGLSLAVILALGFVLTSQVTNRVLDIKVRAAIDQIERARTTVSGIVNGEETRSLDS SLQLARNTLTSKTDPASGAGLAGAFDAVLMVPGDGPRAASTAGPVDQVPNALRGFVKA GQAAYQYATVQTEGFSGPALIIGTPTLSRVANLELYLIFPLASEQATITLVRGTMATG GLVLLVLLAGIALLVSRQVVVPVRSASRIAERFAEGHLSERMPVRGEDDMARLAVSFN DMAESLSRQIAQLEEFGNLQRRFTSDVSHELRTPLTTVRMAADLIYDHSADLDPTLRR STELMVSELDRFETLLNDLLEISRHDAGVAELSVEAVDLRTTVNNALGNVGHLAEEAG IELLVDLPAEQVIAEVDARRVERILRNLIANAIDHAEHKPVRIRMAADEDTVAVTVRD YGVGLRPGEEKLVFSRFWRSDPSRVRRSGGTGLGLAISVEDARLHQGRLEAWGEPGEG ACFRLTLPLVRGHKVTTSPLPMKPIPQPVLQPVAQPNPQPMPPEYKERQRPREHAEWS G" CDS complement(3585673..3586359) /codon_start=1 /transl_table=11 /gene="mtrA" /locus_tag="BQ2027_MB3274C" /product="TWO COMPONENT SENSORY TRANSDUCTION TRANSCRIPTIONAL REGULATORY PROTEIN MTRA" /note="Mb3274c, mtrA, len: 228 aa. Equivalent to Rv3246c, len: 228 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 228 aa overlap). mtrA, transcriptional activator, response regulator (see citations below), equivalent to Q9CCJ2|MTRA|ML0773 PUTATIVE TWO-COMPONENT RESPONSE REGULATOR from Mycobacterium leprae (228 aa), FASTA scores: opt: 1458, E(): 1.4e-85, (98.7% identity in 228 aa overlap). Also highly similar to others e.g. Q9F9J5|SCRA PUTATIVE RESPONSE REGULATOR from Streptomyces coelicolor (228 aa), FASTA scores: opt: 1141, E(): 1.9e-65, (74.9% identity in 227 aa overlap); Q9KYW8|SCE33.15c PUTATIVE TWO-COMPONENT SYSTEM RESPONSE REGULATOR from Streptomyces coelicolor (229 aa), FASTA scores: opt: 1141, E(): 1.9e-65, (74.9% identity in 227 aa overlap); Q9F868|REGX3 RESPONSE REGULATOR REGX3 from Mycobacterium smegmatis (228 aa), FASTA scores: opt: 730, E(): 2.3e-39, (50.90% identity in 222 aa overlap); etc. Relatives in Mycobacterium tuberculosis are: U01971|MTU01971_1; Q11156|RGX3_MYCTU; MTCY20G9.17, E(): 0; MTCY31.31c, E(): 3.4e-29; MTCY369.02, E(): 5.7e-28. SIMILAR TO BACTERIAL REGULATORY PROTEINS INVOLVED IN SIGNAL TRANSDUCTION. THE N-TERMINAL REGION IS SIMILAR TO THAT OF OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS. Experiments showed mtrA is differentially expressed in virulent and avirulent strains during growth in macrophages. Protein product from Mb3274c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3274c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5Z5" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039420" /db_xref="UniProtKB/Swiss-Prot:P0A5Z5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01903.1" /translation="MDTMRQRILVVDDDASLAEMLTIVLRGEGFDTAVIGDGTQALTA VRELRPDLVLLDLMLPGMNGIDVCRVLRADSGVPIVMLTAKTDTVDVVLGLESGADDY IMKPFKPKELVARVRARLRRNDDEPAEMLSIADVEIDVPAHKVTRNGEQISLTPLEFD LLVALARKPRQVFTRDVLLEQVWGYRHPADTRLVNVHVQRLRAKVEKDPENPTVVLTV RGVGYKAGPP" CDS complement(3586429..3587073) /codon_start=1 /transl_table=11 /gene="tmk" /locus_tag="BQ2027_MB3275C" /product="thymidylate kinase tmk (dtmp kinase) (thymidylic acid kinase) (tmpk)" /note="Mb3275c, tmk, len: 214 aa. Equivalent to Rv3247c, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 214 aa overlap). Probable tmk, thymidylate kinase (EC 2.7.4.9), equivalent to Q9CCJ3|TMK|ML0772 PUTATIVE THYMIDYLATE KINASE from Mycobacterium leprae (210 aa), FASTA scores: opt: 1023, E(): 4.8e-57, (77.3% identity in 207 aa overlap). Also similar to other thymidylate kinases e.g. Q9RQJ9|KTHY_CAUCR|TMK|CC1824 from Caulobacter crescentus (208 aa), FASTA scores: opt: 179, E(): 0.0003, (31.3% identity in 214 aa overlap); Q9V1E9|KTHY_PYRAB|TMK|PAB0319 from Pyrococcus abyssi (205 aa), FASTA scores: opt: 176, E(): 0.00045, (29.1% identity in 189 aa overlap); etc. BELONGS TO THE THYMIDYLATE KINASE FAMILY. Protein product from Mb3275c detected using SWATH mass spectrometry. Mb3275c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3J9" /db_xref="InterPro:IPR018094" /db_xref="InterPro:IPR018095" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR039430" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3J9" /protein_id="SIU01904.1" /translation="MLIAIEGVDGAGKRTLVEKLSGAFRAAGRSVATLAFPRYGQSVA ADIAAEALHGEHGDLASSVYAMATLFALDRAGAVHTIQGLCRGYDVVILDRYVASNAA YSAARLHENAAGKAAAWVQRIEFARLGLPKPDWQVLLAVSAELAGERSRGRAQRDPGR ARDNYERDAELQQRTGAVYAELAAQGWGGRWLVVGADVDPGRLAATLAPPDVPS" CDS complement(3587170..3588657) /codon_start=1 /transl_table=11 /gene="sahH" /locus_tag="BQ2027_MB3276C" /product="PROBABLE ADENOSYLHOMOCYSTEINASE SAHH (S-ADENOSYL-L-HOMOCYSTEINE HYDROLASE) (ADOHCYASE)" /note="Mb3276c, sahH, len: 495 aa. Equivalent to Rv3248c, len: 495 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 495 aa overlap). Probable sahH, adenosylhomocysteinase (EC 3.3.1.1), equivalent to Q9CCJ4|SAHH|ML0771 PUTATIVE S-ADENOSYL-L-HOMOCYSTEINE HYDROLASE from Mycobacterium leprae (492 aa), FASTA scores: opt: 3019, E(): 1.3e-177, (91.4% identity in 489 aa overlap). Also highly similar to other adenosylhomocysteinases e.g. Q9KZM1|SAHH from Streptomyces coelicolor (485 aa), FASTA scores: opt: 2258, E(): 5.7e-131, (70.0% identity in 483 aa overlap); P51540|SAHH_TRIVA from Trichomonas vaginalis (486 aa), FASTA scores: opt: 2005, E(): 1.8e-115, (62.05% identity in 477 aa overlap); P35007|SAHH_CATRO from Catharanthus roseus (Rosy periwinkle) (Madagascar periwinkle) (485 aa), FASTA scores: opt: 1941, E(): 1.5e-111, (60.15% identity in 492 aa overlap); etc. Has S-adenosyl-L-homocysteine hydrolase signature (PS00739). BELONGS TO THE ADENOSYLHOMOCYSTEINASE FAMILY. Protein product from Mb3276c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3276c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWW7" /db_xref="InterPro:IPR000043" /db_xref="InterPro:IPR015878" /db_xref="InterPro:IPR020082" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR042172" /db_xref="UniProtKB/Swiss-Prot:Q7TWW7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01905.1" /translation="MTGNLVTKNSLTPDVRNGIDFKIADLSLADFGRKELRIAEHEMP GLMSLRREYAEVQPLKGARISGSLHMTVQTAVLIETLTALGAEVRWASCNIFSTQDHA AAAVVVGPHGTPDEPKGVPVFAWKGETLEEYWWAAEQMLTWPDPDKPANMILDDGGDA TMLVLRGMQYEKAGVVPPAEEDDPAEWKIFLNLLRTRFETDKDKWTKIAESVKGVTEE TTTGVLRLYQFAAAGDLAFPAINVNDSVTKSKFDNKYGTRHSLIDGINRGTDALIGGK KVLICGYGDVGKGCAEAMKGQGARVSVTEIDPINALQAMMEGFDVVTVEEAIGDADIV VTATGNKDIIMLEHIKAMKDHAILGNIGHFDNEIDMAGLERSGATRVNVKPQVDLWTF GDTGRSIIVLSEGRLLNLGNATGHPSFVMSNSFANQTIAQIELWTKNDEYDNEVYRLP KHLDEKVARIHVEALGGHLTKLTKEQAEYLGVDVEGPYKPDHYRY" CDS complement(3588762..3589397) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3277C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb3277c, -, len: 211 aa. Equivalent to Rv3249c, len: 211 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 211 aa overlap). Possible transcriptional regulatory protein, tetR family, with similarity to several e.g. Q9AE61|ALKB1 PUTATIVE TETR-REGULATORY from Rhodococcus erythropolis (208 aa), FASTA scores: opt: 503, E(): 7.7e-26, (40.6% identity in 192 aa overlap); CAC37620 PUTATIVE TETR-REGULATORY PROTEIN from Prauserella rugosa (212 aa), FASTA scores: opt: 246, E(): 4.4e-09, (27.95% identity in 186 aa overlap); Q9K4B0|SC7E4.06 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (203 aa), FASTA scores: opt: 224, E(): 1.1e-07, (34.5% identity in 197 aa overlap); Q11063|YC55_MYCTU|Rv1255c|MT1294|MTCY50.27 HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Mycobacterium tuberculosis (202 aa), FASTA scores: opt: 191, E(): 1.6e-05, (28.35% identity in 180 aa overlap); etc. Equivalent to AAK47689 from Mycobacterium tuberculosis strain CDC1551 (230 aa) but shorter 19 aa. COULD BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Possible helix-turn helix motif at aa 44-65 (+6.66 SD). Protein product from Mb3277c detected using SWATH mass spectrometry. Mb3277c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y488" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR040611" /db_xref="UniProtKB/TrEMBL:A0A1R3Y488" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01906.1" /translation="MSTPSATVAPVKRIPYAEASRALLRDSVLDAMRDLLLTRDWSAI TLSDVARAAGISRQTIYNEFGSRQGLAQGYALRLADRLVDNVHASLDANVGNFYEAFL QGFRSFFAESAADPLVISLLTGVAKPDLLQLITTDSAPIITRASARLAPAFTDTWVAT TDNDANVLSRAIVRLCLSYVSMPPEADHDVAADLARLITPFAERHGVINVP" CDS complement(3589394..3589576) /codon_start=1 /transl_table=11 /gene="rubB" /locus_tag="BQ2027_MB3278C" /product="PROBABLE RUBREDOXIN RUBB" /note="Mb3278c, -, len: 60 aa. Equivalent to Rv3250c, len: 60 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 60 aa overlap). Probable rubB, rubredoxin, highly similar to other rubredoxins e.g. Q9AE66|RUBA4 from Rhodococcus erythropolis (60 aa), FASTA scores: opt: 391, E(): 2.2e-21, (83.05% identity in 59 aa overlap); Q9AE63|RUBA2 from Rhodococcus erythropolis (63 aa), FASTA scores: opt: 380, E(): 1.4e-20, (83.9% identity in 56 aa overlap); P42453|RUBR_ACICA|RUBA from Acinetobacter calcoaceticus (54 aa), FASTA scores: opt: 315, E(): 4.9e-16, (69.8% identity in 53 aa overlap); Q9HTK7|PA5351 from Pseudomonas aeruginosa (55 aa), FASTA scores: opt: 298, E(): 8e-15, (64.15% identity in 53 aa overlap); Q9PGC3|XF0379 from Xylella fastidiosa (57 aa), FASTA scores: opt: 263, E(): 2.5e-12, (59.25% identity in 54 aa overlap); etc. Also similar to neighbouring ORF M. tuberculosis RubA (MTCY20B11.26c). Contains rubredoxin signature (PS00202). BELONGS TO THE RUBREDOXIN FAMILY. Mb3278c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3R1" /db_xref="InterPro:IPR018527" /db_xref="InterPro:IPR024934" /db_xref="InterPro:IPR024935" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3R1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01907.1" /translation="MNDYKLFRCIQCGFEYDEALGWPEDGIAAGTRWDDIPDDWSCPD CGAAKSDFEMVEVARS" CDS complement(3589581..3589748) /codon_start=1 /transl_table=11 /gene="rubA" /locus_tag="BQ2027_MB3279C" /product="PROBABLE RUBREDOXIN RUBA" /note="Mb3279c, -, len: 55 aa. Equivalent to Rv3251c, len: 55 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 55 aa overlap). Probable rubA, rubredoxin, highly similar to other rubredoxins (but sometimes shorter) e.g. Q9AE67|RUBA3 from Rhodococcus erythropolis (61 aa), FASTA scores: opt: 335, E(): 1e-17, (73.6% identity in 53 aa overlap); P00272|RUB2_PSEOL|ALKG from Pseudomonas oleovorans (172 aa), FASTA scores: opt: 278, E(): 2.7e-13, (65.3% identity in 49 aa overlap); CAC38028|ALKG from Alcanivorax borkumensis (174 aa), FASTA scores: opt: 271, E(): 8.6e-13, (62.0% identity in 50 aa overlap); Q9WWW4|ALKG from Pseudomonas putida (175 aa), FASTA scores: opt: 270, E(): 1e-12, (61.8% identity in 55 aa overlap); etc. Also highly similar to C-terminus of Q9XBM1|ALKB ALKANE 1-MONOOXYGENASE (EC 1.14.15.3) from Prauserella rugosa (490 aa), FASTA scores: opt: 296, E(): 2.9e-14, (75.5% identity in 49 aa overlap). Also similar to neighbouring ORF Mycobacterium tuberculosis rubB (MTCY20B11.25c). Contains rubredoxin signature (PS00202). BELONGS TO THE RUBREDOXIN FAMILY. Mb3279c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3V9" /db_xref="InterPro:IPR018527" /db_xref="InterPro:IPR024934" /db_xref="InterPro:IPR024935" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3V9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01908.1" /translation="MAAYRCPVCDYVYDEANGDAREGFPAGTGWDQIPDDWCCPDCAV REKVDFEKIGG" CDS complement(3589748..3590998) /codon_start=1 /transl_table=11 /gene="alkB" /locus_tag="BQ2027_MB3280C" /product="PROBABLE TRANSMEMBRANE ALKANE 1-MONOOXYGENASE ALKB (ALKANE 1-HYDROXYLASE) (LAURIC ACID OMEGA-HYDROXYLASE) (OMEGA-HYDROXYLASE) (FATTY ACID OMEGA-HYDROXYLASE) (ALKANE HYDROXYLASE-RUBREDOXIN)" /note="Mb3280c, alkB, len: 416 aa. Equivalent to Rv3252c, len: 416 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 416 aa overlap). Probable alkB, transmembrane alkane-1-monooxygenase (EC 1.14.15.3), highly similar to many e.g. Q9AE68|ALKB2 from Rhodococcus erythropolis (408 aa), FASTA scores: opt: 2018, E(): 9.6e-122, (68.6% identity in 415 aa overlap); Q9AFD5|ALKB from Nocardioides sp. CF8 (483 aa), FASTA scores: opt: 1485, E(): 1.4e-87, (56.55% identity in 405 aa overlap); Q9XAU0|ALKB1 from Rhodococcus erythropolis (391 aa), FASTA scores: opt: 1400, E(): 3.3e-82, (62.6% identity in 396 aa overlap); Q9XBM1|ALKB from Prauserella rugosa (490 aa), FASTA scores: opt: 1266, E(): 1.5e-73, (57.55% identity in 410 aa overlap); CAC40954|ALKB4 from Rhodococcus erythropolis (386 aa), FASTA scores: opt: 1190, E(): 9.1e-69, (54.3% identity in 383 aa overlap); etc. Mb3280c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3K1" /db_xref="InterPro:IPR005804" /db_xref="InterPro:IPR033885" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K1" /protein_id="SIU01909.1" /translation="MTTQIGSGGPEAPRPPEVEEWRDKKRYLWLMGLIAPTALVVMLP LIWGMNQLGWHAAAQVPLWIGPILLYVLLPLLDLRFGPDGQNPPDEVTDRLENDKYYR YCTYIYIPFQYLSVVLGAYLFTAANLSWLGFDGALSWAGKLGVALSVGVLGGVGINTA HEMGHKKDSLERWLSKITLAQTCYGHFYIEHNRGHHVRVSTPEDPASARFGETLWEFL PRSVIGGLRSAVHLEAQRLRRLGVSPWNPMTYLRNDVLNAWLMSVVLWGGLIAVFGPA LIPFVIIQAVFGFSLLEAVNYLEHYGLLRQKSANGRYERCAPVHSWNSDHIVTNLFLY HLQRHSDHHANPTRRYQTLRSMAGAPNLPSGYASMISLTYFPPLWRKVMDHRVLEHYG GDITRVNLHPRVREKALARYGASA" CDS complement(3591107..3592594) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3281C" /product="POSSIBLE CATIONIC AMINO ACID TRANSPORT INTEGRAL MEMBRANE PROTEIN" /note="Mb3281c, -, len: 495 aa. Equivalent to Rv3253c, len: 495 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 495 aa overlap). Possible cationic amino acid transporter, integral membrane protein, similar to many e.g. O69844|SC1C3.02 PUTATIVE CATIONIC AMINO ACID TRANSPORTER from Streptomyces coelicolor (503 aa), FASTA scores: opt: 1649, E(): 5.8e-92, (52.6% identity in 485 aa overlap); Q9AE69 PUTATIVE TRANSPORTER (FRAGMENT) from Rhodococcus erythropolis (385 aa), FASTA scores: opt: 1594, E(): 9.7e-89, (62.0% identity in 387 aa overlap); Q9PBD7|XF2207 CATIONIC AMINO ACID TRANSPORTER from Xylella fastidiosa (483 aa), FASTA scores: opt: 1079, E(): 1.2e-57, (40.55% identity in 493 aa overlap); Q9SRU9|F20H23.25 PUTATIVE CATIONIC AMINO ACID TRANSPORTER from Arabidopsis thaliana (Mouse-ear cress) (614 aa), FASTA scores: opt: 802, E(): 6.7e-41, (36.4% identity in 445 aa overlap); P30823|CTR1_RAT|SLC7A1|ATRC1 HIGH-AFFINITY CATIONIC AMINO ACID TRANSPORTER-1 from Rattus norvegicus (Rat) (624 aa), FASTA scores: opt: 782, E(): 1.1e-39, (36.1% identity in 432 aa overlap); etc. Relatives in Mycobacterium tuberculosis include: MTCY3G12.14, E(): 5.6e-31; MTCY39.19, E(): 1.6e-14. SEEMS TO BELONG TO THE APC FAMILY. Mb3281c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3L9" /db_xref="InterPro:IPR002293" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3L9" /protein_id="SIU01910.1" /translation="MAGRRRMKSVEQSIADTDEPTTRLRKDLTWWDLVVFGVSVVIGA GIFTVTASTAGDITGPAIWISFLIAAATCALAALCYAEFASTLPVAGSAYTFSYATFG EFLAWVIGWNLVLELAMGAAVVAKGWSSYLGTVFGFGNGTGHLGSLQLDWGALVIVTL VATLIALGTKLSSRFSAVVTAIKVSVVVLVVVVGAFYIRAANYSPFIPEPEVQHHGGG LDQSVFSLLTGAQGSHYGWYGVLAGASIVFFAFIGFDIVATMAEETKRPQRDVPRGIL ASLGVVTLLYVAVSVVLSGMVPYTQLRTVPGRGPANLATAFQANGVYWASGIISVGAL AGLTTVVMVLMLGQCRVLFAMARDGLVPRQLAKTGSRGTPVRVTVLVAVLVATTASVF PITKLEEMVNVGTLFAFILVSAGVVVLRRTRPDLQRGFTAPWVPLLPIAAVCACLWLM LNLTALTWIRFGIWLVAGTAIYVGYGRRHSAQGLRQARESATRRC" CDS 3592685..3594073 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3282" /product="Monooxygenase" /note="Mb3282, -, len: 462 aa. Equivalent to Rv3254, len: 462 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 462 aa overlap). Conserved hypothetical protein, similar to CAC37877|SC1G7.02 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (440 aa), FASTA scores: opt: 606, E(): 6.2e-31, (31.7% identity in 445 aa overlap); O86550|SC1F2.13c HYPOTHETICAL 50.7 KDA PROTEIN from Streptomyces coelicolor (476 aa), FASTA scores: opt: 577, E(): 4.5e-29, (32.5% identity in 400 aa overlap); Q9L0A8|SCC24.09 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (468 aa), FASTA scores: opt: 380, E(): 1.3e-16, (30.7% identity in 391 aa overlap); BAB48792|MLL1411 PROBABLE FAD-DEPENDENT MONOOXYGENASE from Rhizobium loti (Mesorhizobium loti) (421 aa), FASTA scores: opt: 128, E(): 1.1, (25.2% identity in 397 aa overlap); Q9L7X9|BENF BENZOATE-SPECIFIC PORIN-LIKE PROTEIN from Pseudomonas putida (397 aa), FASTA scores: opt: 119, E(): 4, (24.85% identity in 157 aa overlap); etc. Also similar to N-terminus of AAK46259|MT1987 PUTATIVE FERREDOXIN REDUCTASE, ELECTRON TRANSFER COMPONENT from Mycobacterium tuberculosis strain CDC1551 (839 aa), FASTA scores: opt: 493, E(): 1.5e-23, (30.65% identity in 382 aa overlap). Protein product from Mb3282 detected using SWATH mass spectrometry. Mb3282 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K5" /protein_id="SIU01911.1" /translation="MVIGASIAGLCAARVLSDFYSTVTVFERDELPEAPANRATVPQD RHLHMLMARGAQEFDSLFPGLLHDMVAAGVPMLENRPDCIYLGAAGHVLGTGHTLRKE FTAYVPSRPHLEWQLRRRVLQLSNVQIVRRLVTEPQFERRQQRVVGVLLDSPGSGQDR EREEFIAADLVVDAAGRGTRLPVWLTQWGYRRPAEDTVDIGISYASHQFRIPDGLIAE KVVVAGASHDQSLGLGMLCYEDGTWVLTTFGVADAKPPPTFDEMRALADKLLPARFTA ALAQAQPIGCPAFHAFPASRWRRYDKLERFPRGIVPFGDAVASFNPTFGQGMTMTSLQ AGHLRRALKARNSAMKGDLAAELNRATAKTTYPVWMMNAIGDISFHHATAEPLPRWWR PAGSLFDQFLGAAETDPVLAEWFLRRFSLLDSLYMVPSVPIIGRAIAHNLRLWLKEQR ERRQPVTTRRSP" CDS complement(3594051..3595277) /codon_start=1 /transl_table=11 /gene="manA" /locus_tag="BQ2027_MB3283C" /product="PROBABLE MANNOSE-6-PHOSPHATE ISOMERASE MANA (PHOSPHOMANNOSE ISOMERASE) (PHOSPHOMANNOISOMERASE) (PMI) (PHOSPHOHEXOISOMERASE) (PHOSPHOHEXOMUTASE)" /note="Mb3283c, manA, len: 408 aa. Equivalent to Rv3255c, len: 408 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 408 aa overlap). Probable manA, mannose-6-phosphate isomerase (EC 5.3.1.8), equivalent to Q9CCJ5|MANA|ML0765 from Mycobacterium leprae (410 aa), FASTA scores: opt: 2271, E(): 1.6e-133, (84.45% identity in 411 aa overlap). Also similar to many e.g. Q9KZL9|MANA from Streptomyces coelicolor (383 aa), FASTA scores: opt: 946, E(): 2.4e-51, (44.4% identity in 403 aa overlap); Q9KV87|VC0269 from Vibrio cholerae (399 aa), FASTA scores: opt: 726, E(): 1.1e-37, (34.15% identity in 404 aa overlap); Q9CMJ5|PMI|PM0829 from Pasteurella multocida (400 aa), FASTA scores: opt: 640, E(): 2.4e-32, (32.5% identity in 391 aa overlap); etc. SIMILAR TO FAMILY 1 OF MANNOSE-6-PHOSPHATE ISOMERASES. Protein product from Mb3283c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3283c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3K3" /db_xref="InterPro:IPR001250" /db_xref="InterPro:IPR011051" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR016305" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K3" /protein_id="SIU01912.1" /translation="MELLRGALRTYAWGSRTAIAEFTGRPVPAAHPEAELWFGAHPGD PAWLQTPHGQTSLLEALVADPEGQLGSASRARFGDVLPFLVKVLAADEPLSLQAHPSA EQAVEGYLREERMGIPVSSPVRNYRDTSHKPELLVALQPFEALAGFREAARTTELLRA LAVSDLDPFIDLLSEGSDADGLRALFTTWITAPQPDIDVLVPAVLDGAIQYVSSGATE FGAEAKTVLELGERYPGDAGVLAALLLNRISLAPGEAIFLPAGNLHAYVRGFGVEVMA NSDNVLRGGLTPKHVDVPELLRVLDFAPTPKARLRPPIRREGLGLVFETPTDEFAATL LVLDGDHLGHEVDASSGHDGPQILLCTEGSATVHGKCGSLTLQRGTAAWVAADDGPIR LTAGQPAKLFRATVGL" CDS complement(3595285..3596325) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3284C" /product="Sugar binding protein" /note="Mb3284c, -, len: 346 aa. Equivalent to Rv3256c, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap). Conserved hypothetical protein, equivalent to Q9CCJ6|ML0764 HYPOTHETICAL PROTEIN from Mycobacterium leprae (365 aa), FASTA scores: opt: 1574, E(): 1.4e-82, (75.35% identity in 365 aa overlap). Also similar to other hypothetical bacterial proteins e.g. Q9KZL8|SCE34.07c from Streptomyces coelicolor (375 aa), FASTA scores: opt: 171, E(): 0.012, (31.1% identity in 376 aa overlap); P55709|Y4YA_RHISN from Rhizobium sp. strain NGR234 (457 aa), FASTA scores: opt: 140, E(): 0.84, (28.75% identity in 233 aa overlap). Protein product from Mb3284c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3284c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K7" /protein_id="SIU01913.1" /translation="MNVARAIDLEDTEGLIAADRGALLRAASMAGAQVRAIAAAADEG ELDLLRGSDRPRSVIWVTGRGTAETAGTILASTLGAGAAEPIVLASAAPPWVGPLDVL IVAGDDPGDPALVGAAAIGVRRGARVVVVAPYEGPLRDSTAGRVAVLEPRLRVPDEFG LSRYLAAGLAALQTVDPKLRIDLASLADELDAEALRNSAGREVFTNPAKALAARVSGC QLALAGDNAATLALARHGSSVMLRIANQVVAATRLSDAVVALRAGTPPDALFHDEEID GPAPQRLRVLALALAGERTVVAARVAGLDDAYLVAAEDVPELLDAPVGSGGAVLAVRL EMAAVYLRLVRG" CDS complement(3596322..3597719) /codon_start=1 /transl_table=11 /gene="pmma" /locus_tag="BQ2027_MB3285C" /product="probable phosphomannomutase pmma (pmm) (phosphomannose mutase)" /note="Mb3285c, manB, len: 465 aa. Equivalent to Rv3257c, len: 465 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 465 aa overlap). Probable manB, phosphomannomutase (EC 5.4.2.8) (see citation below), equivalent to Q9CCJ7|PMMA|ML0763 from Mycobacterium leprae (468 aa), FASTA scores: opt: 2533, E(): 2e-145, (83.1% identity in 468 aa overlap). Also similar to many e.g. Q9KZL6|MANB from Streptomyces coelicolor (454 aa), FASTA scores: opt: 1820, E(): 2e-102, (63.2% identity in 459 aa overlap); Q9PGN8|XF0260 from Xylella fastidiosa (500 aa), FASTA scores: opt: 1085, E(): 4.7e-58, (40.7% identity in 462 aa overlap); Q9EY19|MANB from Salmonella enterica subsp. arizonae (456 aa), FASTA scores: opt: 988, E(): 3.1e-52, (38.65% identity in 445 aa overlap); etc. BELONGS TO THE PHOSPHOHEXOSE MUTASES FAMILY. Note that previously known as pmmA. Protein product from Mb3285c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3285c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3K8" /db_xref="InterPro:IPR005841" /db_xref="InterPro:IPR005843" /db_xref="InterPro:IPR005844" /db_xref="InterPro:IPR005845" /db_xref="InterPro:IPR005846" /db_xref="InterPro:IPR016055" /db_xref="InterPro:IPR036900" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3K8" /protein_id="SIU01914.1" /translation="MSWPAAAVDRVIKAYDVRGLVGEEIDESLVTDLGAAFARLMRTE DARPVVIGHDMRDSSPSLADAFAAGVTGQGLDVVRVGLASTDQLYFASGLLDCPGAMF TASHNPAAYNGIKMCRAAAKPVGADTGLTAIRDDLIAGVARYDGTPGTIADQDVLVDY GAFLRSLVDTSGLRPLRVAVDAGNGMAGHTAPAVLGVIDSITLLPLYFELDGSFPNHE ANPLDPANLVDLQAYVRDTGADIGLAFDGDADRCFVVDERGQPVSPSTVTALVAAREL NREIGATIIHNVITSRAVPELVAERGGTPLRSRVGHSYIKALMAETGAIFGGEHSAHY YFRDFWGADSGMLAALHVLAALGEQSRPLSELTADYQRYESSGEINFTVVDSSACVEA VLKSFGNRIVSIDHLDGVTVDLGDDSWFNLRSSNTEPLLRLNVEGRSVGDVDAVVRQV SAEIAAQSAHAKAGP" CDS complement(3597821..3598312) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3286C" /product="sugar metabolism" /note="Mb3286c, -, len: 163 aa. Equivalent to Rv3258c, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). Conserved hypothetical protein, equivalent to Q9CCJ8|ML0762 HYPOTHETICAL PROTEIN from Mycobacterium leprae (165 aa), FASTA scores: opt: 840, E(): 9.9e-42, (76.9% identity in 169 aa overlap). Also similar to Q9KZL4|SCE34.11c HYPOTHETICAL 15.0 KDA PROTEIN from Streptomyces coelicolor (140 aa), FASTA scores: opt: 353, E(): 1.1e-13, (48.3% identity in 147 aa overlap); and shows really weak similarity to other bacterial proteins. Mb3286c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR021888" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5E7" /protein_id="SIU01915.1" /translation="MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDS TAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVR EGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPD PAD" CDS 3598435..3598854 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3287" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3287, -, len: 139 aa. Equivalent to Rv3259, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Conserved hypothetical protein, equivalent, but shorter 29 aa, to Q9CCJ9|ML0761 HYPOTHETICAL PROTEIN from Mycobacterium leprae (167 aa), FASTA scores: opt: 846, E(): 2.2e-47, (89.2% identity in 139 aa overlap). C-terminus highly similar to Q9S425 HYPOTHETICAL 6.0 KDA PROTEIN (FRAGMENT) from Mycobacterium smegmatis (54 aa), FASTA scores: opt: 275, E(): 2.7e-11, (81.15% identity in 53 aa overlap). Also similar to Q9KZL3|SCE34.12 from Streptomyces coelicolor (117 aa), FASTA scores: opt: 152, E(): 0.004, (34.15% identity in 126 aa overlap). Equivalent to AAK47699 from Mycobacterium tuberculosis strain CDC1551 (175 aa) but shorter 36 aa. Protein product from Mb3287 detected using SWATH mass spectrometry. Mb3287 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010428" /db_xref="InterPro:IPR038555" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4B0" /protein_id="SIU01916.1" /translation="MRGPLLPPTVPGWRSRAERFDMAVLEAYEPIERRWQERVSQLDI AVDEIPRIAAKDPESVQWPPEVIADGPIALARLIPAGVDVRGNATRARIVLFRKPIER RAKDTEELGELLHEILVAQVAIYLDVDPSVIDPTIDD" CDS complement(3598882..3599151) /codon_start=1 /transl_table=11 /gene="whiB2" /locus_tag="BQ2027_MB3288C" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN WHIB-LIKE WHIB2" /note="Mb3288c, whiB2, len: 89 aa. Equivalent to Rv3260c, len: 89 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 89 aa overlap). Probable whiB2, WhiB-like regulatory protein (see first citation below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Equivalent to Q9CCK0|WHIB2|ML0760 PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (89 aa), FASTA scores: opt: 550, E(): 6.1e-31, (85.4% identity in 89 aa overlap). Also similar to others e.g. Q9S426 WHMD REGULATORY PROTEIN (see second citation below) from Mycobacterium smegmatis (129 aa), FASTA scores: opt: 488, E(): 1.4e-26, (83.55% identity in 85 aa overlap); Q06387|WHIB-STV WHIB-STV PROTEIN from Streptomyces griseocarneus (87 aa), FASTA scores: opt: 443, E(): 1.2e-23, (74.7% identity in 83 aa overlap); Q05429|WHIB|WHIB1 TRANSCRIPTION-LIKE FACTOR WHIB from Streptomyces aureofaciens (87 aa), FASTA scores: opt: 442, E(): 1.3e-23, (74.7% identity in 83 aa overlap); etc. Equivalent to AAK47700 WhiB-related protein from Mycobacterium tuberculosis strain CDC1551 (123 aa) but shorter 34 aa. Also similar to other Mycobacterium tuberculosis proteins: MTCY07D11.07c (45.1% identity in 71 aa overlap) and MTCY78.13c (37.4% identity in 91 aa overlap). Start chosen by homology but ORF continues to ATG upstream at 3754. Mb3288c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3S0" /db_xref="InterPro:IPR003482" /db_xref="InterPro:IPR034768" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3S0" /protein_id="SIU01917.1" /translation="MVPEAPAPFEEPLPPEATDQWQDRALCAQTDPEAFFPEKGGSTR EAKKICMGCEVRHECLEYALAHDERFGIWGGLSERERRRLKRGII" CDS 3599553..3600548 /codon_start=1 /transl_table=11 /gene="fbiA" /locus_tag="BQ2027_MB3289" /product="PROBABLE F420 BIOSYNTHESIS PROTEIN FBIA" /note="Mb3289, fbiA, len: 331 aa. Equivalent to Rv3261, len: 331 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 331 aa overlap). Probable fbiA, F420 biosynthesis protein, equivalent to FBIA F420 biosynthesis protein fbiA from Mycobacterium bovis BCG (see citations below). Also equivalent, but shorter 46 aa, to Q9CCK1|ML0759 HYPOTHETICAL PROTEIN from Mycobacterium leprae (379 aa), FASTA scores: opt: 1855, E(): 3.9e-110, (79.3% identity in 333 aa overlap). Also similar to others e.g. Q9KZK9|SCE34.17 HYPOTHETICAL 33.6 KDA PROTEIN from Streptomyces coelicolor (319 aa), FASTA scores: opt: 1151, E(): 1.2e-65, (55.1% identity in 332 aa overlap); O29345|AF0917 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (296 aa), FASTA scores: opt: 469, E(): 1.7e-22, (31.15% identity in 302 aa overlap); Q58653|MJ1256 HYPOTHETICAL PROTEIN from Methanococcus jannaschii (311 aa), FASTA scores: opt: 436, E(): 2.2e-20, (27.35% identity in 274 aa overlap); etc. Protein product from Mb3289 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3289 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWV4" /db_xref="InterPro:IPR002882" /db_xref="InterPro:IPR010115" /db_xref="UniProtKB/Swiss-Prot:Q7TWV4" /protein_id="SIU01918.1" /translation="MKVTVLAGGVGGARFLLGVQQLLGLGQFAANSAHSDADHQLSAV VNVGDDAWIHGLRVCPDLDTCMYTLGGGVDPQRGWGQRDETWHAMQELVRYGVQPDWF ELGDRDLATHLVRTQMLQAGYPLSQITEALCDRWQPGARLLPATDDRCETHVVITDPV DESRKAIHFQEWWVRYRAQVPTHSFAFVGAEKSSAATEAIAALADADIIMLAPSNPVV SIGAILAVPGIRAALREATAPIVGYSPIIGEKPLRGMADTCLSVIGVDSTAAAVGRHY GARCATGILDCWLVHDGDHAEIDGVTVRSVPLLMTDPNATAEMVRAGCDLAGVVA" CDS 3600545..3601891 /codon_start=1 /transl_table=11 /gene="fbiB" /locus_tag="BQ2027_MB3290" /product="PROBABLE F420 BIOSYNTHESIS PROTEIN FBIB" /note="Mb3290, fbiB, len: 448 aa. Equivalent to Rv3262, len: 448 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 448 aa overlap). Probable fbiB, F420 biosynthesis protein, equivalent to FBIB F420 biosynthesis protein fbiB from Mycobacterium bovis BCG (see citations below). Also equivalent to Q9CCK2|ML0758 PUTATIVE OXIDOREDUCTASE from Mycobacterium leprae (457 aa), FASTA scores: opt: 2411, E(): 3.5e-137, (82.25% identity in 445 aa overlap). Also similar to Q9KZK8|SCE34.18 PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (443 aa), FASTA scores: opt: 1180, E(): 2.2e-63, (51.75% identity in 433 aa overlap); other oxidoreductases in C-terminus; and several hypothetical bacterial proteins. Protein product from Mb3290 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3290 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWV3" /db_xref="InterPro:IPR000415" /db_xref="InterPro:IPR002847" /db_xref="InterPro:IPR008225" /db_xref="InterPro:IPR019943" /db_xref="InterPro:IPR023661" /db_xref="InterPro:IPR029479" /db_xref="UniProtKB/Swiss-Prot:Q7TWV3" /protein_id="SIU01919.1" /translation="MTGPEHGSASTIEILPVIGLPEFRPGDDLSAAVAAAAPWLRDGD VVVVTSKVVSKCEGRLVPAPEDPEQRDRLRRKLIEDEAVRVLARKDRTLITENRLGLV QAAAGVDGSNVGRSELALLPVDPDASAATLRAGLRERLGVTVAVVITDTMGRAWRNGQ TDAAVGAAGLAVLRNYAGVRDPYGNELVVTEVAVADEIAAAADLVKGKLTATPVAVVR GFGVSDDGSTARQLLRPGANDLFWLGTAEALELGRQQAQLLRRSVRRFSTDPVPGDLV EAAVAEALTAPAPHHTRPTRFVWLQTPAIRARLLDRMKDKWRSDLTSDGLPADAIERR VARGQILYDAPEVVIPMLVPDGAHSYPDAARTDAEHTMFTVAVGAAVQALLVALAVRG LGSCWIGSTIFAADLVRDELDLPVDWEPLGAIAIGYADEPSGLRDPVPAADLLILK" CDS 3602187..3603848 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3291" /product="PROBABLE DNA METHYLASE (MODIFICATION METHYLASE) (METHYLTRANSFERASE)" /note="Mb3291, -, len: 553 aa. Equivalent to Rv3263, len: 553 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 553 aa overlap). Probable DNA methylase (EC 2.1.1.-), equivalent to Q9CCK4|ML0756 PROBABLE DNA METHYLASE from Mycobacterium leprae (555 aa), FASTA scores: opt: 2980, E(): 2.1e-184, (81.9% identity in 541 aa overlap). Also similar to others e.g. P25240|MT57_ECOLI|ECO57IM MODIFICATION METHYLASE from Escherichia coli (544 aa), FASTA scores: opt: 595, E(): 1e-30, (30.35% identity in 507 aa overlap); P25201|MTA1_ACICA|ACCIM MODIFICATION METHYLASE ACCI from Acinetobacter calcoaceticus (540 aa), FASTA scores: opt: 366, E(): 5.7e-16, (23.35% identity in 467 aa overlap); Q56752|M-ACCI ACCI METHYLASE from Bergeyella zoohelcum (541 aa), FASTA scores: opt: 365, E(): 6.6e-16, (22.95% identity in 466 aa overlap); etc. Contains PS00092 N-6 Adenine-specific DNA methylases signature. Alternative start site at aa 25. Mb3291 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3M1" /db_xref="InterPro:IPR002052" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3M1" /protein_id="SIU01920.1" /translation="MQPSHPTRPGAVIRYVGSSLDTCPMTTFAGKTAASADKVRGGYY TPPAVARFLAHWVHQAGPKILEPSCGDGRILRELSAITDHAHGVELVAREAKKSRDFA SVDTENLFTWLHKTQLGSWDGVAGNPPYIRFGNWASEQRDPALELMRRVGLRPTKLTN AWVPFVVASTTLARDGGRVGLVVPAELLQVTYAAQLREFLLSRYREITLVTFERLVFD GILQEVVLFCGVVGPGPAHIRTVRLGDANDLNALGDKDFTNESAPALLHEKEKWTKYF LDPAQIRLLRGLKQSATMIRLGELADVDVGIVTGRNSFFTFTDAKAQALGLRAHCVPL VSRSAQLSGLIYDEDCRACDVAGNHRTWLLDAADYPTDPALVAHITAGEAAGVHLGYK CSIRKPWWSTPSLWMPDLFMLRQIHFAPRLTVNAAAATSTDTVHRVRLDPNVDPATLA AVFHNSATFAFAEIMGRSYGGGILELEPREAEQLPMPPPAYGSAELAQDVDLLLKANE IDKALDVVDRHVLIDGLGLSPRLVAGCRAAWLTLRDRRTKRGSRR" CDS complement(3603908..3604987) /codon_start=1 /transl_table=11 /gene="manC" /locus_tag="BQ2027_MB3292C" /product="d-alpha-d-mannose-1-phosphate guanylyltransferase manb (d-alpha-d-heptose-1-phosphate guanylyltransferase)" /note="Mb3292c, manC, len: 359 aa. Equivalent to Rv3264c, len: 359 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 359 aa overlap). Probable manC, mannose-1-phosphate guanyltransferase (EC 2.7.7.22), equivalent to Q9CCK6|RMLA2|ML0753 PUTATIVE SUGAR-PHOSPHATE NUCLEOTIDYL TRANSFERASE from Mycobacterium leprae (358 aa), FASTA scores: opt: 2075, E(): 2.7e-115, (86.9% identity in 359 aa overlap). Also similar to others e.g. Q9KZK6|SCE34.20c PUTATIVE NUCLEOTIDE PHOSPHORYLASE from Streptomyces coelicolor (360 aa), FASTA scores: opt: 1314, E(): 2.2e-70, (57.0% identity in 358 aa overlap); Q9KZP4|SC1A8A.08 PUTATIVE MANNOSE-1-PHOSPHATE GUANYLTRANSFERASE from Streptomyces coelicolor (831 aa), FASTA scores: opt: 699, E(): 8.6e-34, (34.45% identity in 354 aa overlap) (only similarity in N-terminus for this one); P74589|SLL1496 MANNOSE-1-PHOSPHATE GUANYLTRANSFERASE from Synechocystis sp. strain PCC 6803 (843 aa), FASTA scores: opt: 692, E(): 2.3e-33, (35.1% identity in 342 aa overlap) (only similarity in N-terminus for this one too); BAB59222|TVG0079558 MANNOSE-1-PHOSPHATE GUANYLTRANSFERASE from Thermoplasma volcanium (359 aa), FASTA scores: opt: 664, E(): 5.2e-32, (34.6% identity in 338 aa overlap); Q9ZTW5|GMP GDP-MANNOSE PYROPHOSPHORYLASE from Solanum tuberosum (Potato) (361 aa), FASTA scores: opt: 636, E(): 2.3e-30, (34.65% identity in 361 aa overlap); etc. BELONGS TO FAMILY 2 OF MANNOSE-6-PHOSPHATE ISOMERASES. Note that previously known as rmlA2. Protein product from Mb3292c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3292c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3L4" /db_xref="InterPro:IPR001451" /db_xref="InterPro:IPR005835" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3L4" /protein_id="SIU01921.1" /translation="MATHQVDAVVLVGGKGTRLRPLTLSAPKPMLPTAGLPFLTHLLS RIAAAGIEHVILGTSYKPAVFEAEFGDGSALGLQIEYVTEEHPLGTGGGIANVAGKLR NDTAMVFNGDVLSGADLAQLLDFHRSNRADVTLQLVRVGDPRAFGCVPTDEEDRVVAF LEKTEDPPTDQINAGCYVFERNVIDRIPQGREVSVEREVFPALLADGDCKIYGYVDAS YWRDMGTPEDFVRGSADLVRGIAPSPALRGHRGEQLVHDGAAVSPGALLIGGTVVGRG AEIGPGTRLDGAVIFDGVRVEAGCVIERSIIGFGARIGPRALIRDGVIGDGADIGARC ELLSGARVWPGVFLPDGGIRYSSDV" CDS complement(3604989..3605882) /codon_start=1 /transl_table=11 /gene="wbbL1" /locus_tag="BQ2027_MB3293C" /standard_name="wbbL" /product="dtdp-rha:a-d-glcnac-diphosphoryl polyprenol,a-3-l-rhamnosyl transferase wbbl1 (alpha-l-rhamnose-(1->3)-alpha-d-glcnac(1->p)-p- decaprenyl)" /note="Mb3293c, wbbL1, len: 297 aa. Equivalent to Rv3265c, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (98.7% identity in 301 aa overlap). Probable wbbL1, dTDP-RHA:A-D-GLCNAC-DIPHOSPHORYL POLYPRENOL A-3-L-RHAMNOSYL TRANSFERASE (EC 2.-.-.-) (see citation below), equivalent to Q9CCK7|WBBL|ML0752 PUTATIVE DTDP-RHAMNOSYL TRANSFERASE from Mycobacterium leprae (308 aa), FASTA scores: opt: 1788, E(): 3e-104, (85.05% identity in 301 aa overlap); and Q9RN50|WBBL|Q9RN49 (see note * below) DTDP-RHA:A-D-GLCNAC-DIPHOSPHORYL POLYPRENOL, A-3-L-RHAMNOSYL TRANSFERASE from Mycobacterium smegmatis (296 aa), FASTA scores: opt: 1494, E(): 6.1e-86, (72.35% identity in 293 aa overlap). Note that previously known as wbbL. [* Note: UNPUBLISHED (experimental study on Mycobacterium smegmatis). Submitted (SEP-1999) to the EMBL/GenBank/DDBJ databases - The cell wall arabinogalactan linker formation enzyme, dTDP-Rha:a-D-GlcNAc-diphosphoryl polyprenol, a-3-L-rhamnosyl transferase is essential for mycobacterial viability - Mills J.A., Motichka K., Jucker M., Wu H.P., Uhlic B.C., Stern R.J., Scherman M.S., Vissa V.D., Yan W., Pan F., Kimbrel S., Kundu M., McNeil M.]. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, an in-frame deletion of 12 bp leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (297 aa versus 301 aa). Protein product from Mb3293c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3293c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3L3" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3L3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01922.1" /translation="MVAVTYSPGPHLERFLASLSLATERPVSVLLADNGSTDGTPQAA VQRYPNVRLLPTGANLGYGTAVNRTIAQLGEMAGDAGEPWVDDWVIVANPDVQWGPGS IDALLDAASRWPRAGALGPLIRDPDGSVYPSARQMPSLIRGGMHAVLGPFWPRNPWTT AYRQERLEPSERPVGWLSGSCLLVRRSAFGQVGGFDERYFMYMEDVDLGDRLGKAGWL SVYVPSAEVLHHKAHSTGRDPASHLAAHHKSTYIFLADRHSGWWRAPLRWTLRGSLAL RSHLMVRRSRRRKLKLVEGRH" CDS complement(3605893..3606807) /codon_start=1 /transl_table=11 /gene="rmlD" /locus_tag="BQ2027_MB3294C" /product="dtdp-6-deoxy-l-lyxo-4-hexulose reductase rmld (dtdp-rhamnose modification protein) (dtdp-rhamnose biosynthesis protein) (dtdp-rhamnose synthase)" /note="Mb3294c, rmld, len: 304 aa. Equivalent to Rv3266c, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 304 aa overlap). Possible rmld, dTDP-rhamnose modification protein, highly similar to Q9CCK8 putative dTDP-rhamnose modification protein from Mycobacterium leprae (311 aa), FASTA scores, opt: 1440, E(): 1.1e-78, (74.7% identity in 312 aa overlap); and similar to several dtdp-4-dehydrorhamnose reductase (EC 1.1.1.133) e.g. STRL_STRGR|P29781 from Streptomyces griseus (304 aa), FASTA scores, opt: 788, E(): 0, (47.4% identity in 304 aa overlap). Protein product from Mb3294c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3294c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3M5" /db_xref="InterPro:IPR005913" /db_xref="InterPro:IPR029903" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3M5" /protein_id="SIU01923.1" /translation="MAGRSERLVITGAGGQLGSHLTAQAAREGRDMLALTSSQWDITD PAAAERIIRHGDVVINCAAYTDVDGAESNEAVAYAVNATGPQHLARACARVGARLIHV STDYVFDGDFGGAEPRPYEPTDETAPQGVYARSKLAGEQAVLAAFPEAAVVRTAWVYT GGTGKDFVAVMRRLAAGHGRVDVVDDQTGSPTYVADLAEALLALADAGVRGRVLHAAN EGVVSRFGQARAVFEECGADPQRVRPVSSAQFPRPAPRPSYSALSSRQWALAGLTPLR HWRSALATALAAPANSTSIDRRLPSTRD" CDS 3606883..3608379 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3295" /product="Cell envelope-associated transcriptional attenuator LytR-CpsA-Psr, subfamily A1" /note="Mb3295, -, len: 498 aa. Equivalent to Rv3267, len: 498 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 498 aa overlap). Conserved hypothetical protein, CPSA-related protein, equivalent to Q9CCK9|ML0750 HYPOTHETICAL PROTEIN from Mycobacterium leprae (489 aa), FASTA scores: opt: 2523, E(): 5e-138, (78.9% identity in 498 aa overlap); and Q50160|CPSA (HYPOTHETICAL PROTEIN CPSA) from Mycobacterium leprae (516 aa), FASTA scores: opt: 868, E(): 1.2e-42, (34.7% identity in 507 aa overlap). Also similar to O06347|CPSA|Rv3484|MTCY13E12.37 CPSA from Mycobacterium tuberculosis (512 aa), FASTA scores: opt: 928, E(): 4.2e-46, (37.35% identity in 498 aa overlap); and O53834|Rv0822c|MTV043.14c HYPOTHETICAL 72.9 KDA PROTEIN from Mycobacterium tuberculosis (684 aa), FASTA scores: opt: 434, E(): 1.5e-17, (30.9% identity in 541 aa overlap). Also similar to Q9KZK0|SCE34.26 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (507 aa), FASTA scores: opt: 437, E(): 8.1e-18, (28.55% identity in 469 aa overlap); O68907 FRNA PROTEIN from Streptomyces roseofulvus (770 aa), FASTA scores: opt: 388, E(): 7.6e-15, (32.6% identity in 267 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A. Protein product from Mb3295 detected using SWATH mass spectrometry. Mb3295 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR004474" /db_xref="InterPro:IPR027381" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3M3" /protein_id="SIU01924.1" /translation="MMSAQRVVRTVRTARAISTALAVAIVLGTGVAWSSVRSFEDGIF HMSAPSLGHGGDDGAIDILLVGLDSRTDAHGNPLSAEELATLHAGDEEATNTDTIILI RVPNNGKSATAISIPRDSYVAAPGLGKTKINGVYGQTRETKRAGLVQAGASPTEAAAA GTEAGREALIKTVADLTGVTVDHYAEIGLLGFALIADALGGVDVCLKEPVYEPLSGAD FPAGRQKLNGPQALSFVRQRHDLPRGDLDRVVRQQAVMAALAHRVISGQTLSSPATLK RLEQAVQRSVVLSSGWDIMDFVRQLQKLAGGNVAFATIPVLDGAGWSDDGMQSVVRVD PRQVQDWVVGLLHEQDQGKTDELAYTPAKTTANVVNDTDINGLAAAVSKVLSSKGFTT GSVGNNDGDHVPGSQVRAAKADDLGAQQVAKELGGLPVVADASIAPGSVRVVLANDYS GPGSGLGGSDPNGVVSPARAFNLGSADDTTPPPSPILTAGSDAPECIN" CDS 3608418..3609107 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3296" /product="regulated by transcription" /note="Mb3296, -, len: 229 aa. Equivalent to Rv3268, len: 229 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 229 aa overlap). Conserved hypothetical protein, similar to Q9KZK4|SCE34.22 HYPOTHETICAL 27.1 KDA PROTEIN from Streptomyces coelicolor (263 aa), FASTA scores: opt: 442, E(): 5.9e-20, (40.1% identity in 242 aa overlap). Also weak similarity to N-terminal part (approximatively 1530 to 1740 residues) of O07944|SNBDE PRISTINAMYCIN I SYNTHASE 3 AND 4 from Streptomyces pristinaespiralis (4848 aa), FASTA scores: opt: 159, E(): 0.11, (30.35% identity in 224 aa overlap). Protein product from Mb3296 detected using SWATH mass spectrometry. Mb3296 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR017523" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5G3" /protein_id="SIU01925.1" /translation="MLRADPVGPRITYYDDATGERIELSAVTLANWAAKTGNLLRDEL AAGPASRVAILLPAHWQTAAVLFGVWWIGAQAILDDSPADVALCTADRLAEADAVVNS AAVAGEVAVLSLDPFGRPATGLPVGVTDYATAVRVHGDQIVPEHNPGPVLAGRSVEQI LRDCAASAAARGLTAADRVLSTASWAGPDELVDGLLAILAAGASLVQVANPDPAMLQR RIATEKVTRVL" CDS 3609232..3609513 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3297" /product="conserved protein" /note="Mb3297, -, len: 93 aa. Equivalent to Rv3269, len: 93 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 93 aa overlap). Conserved hypothetical protein, similar to many Mycobacterium proteins and chaperonins/heat shock proteins e.g. Q9CCL0|ML0748 HYPOTHETICAL PROTEIN from Mycobacterium leprae (92 aa), FASTA scores: opt: 427, E(): 6.8e-21, (73.65% identity in 91 aa overlap); Q10865|Rv1993c|MT2049|MTCY39.26c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (90 aa), FASTA scores: opt: 313, E(): 1.2e-13, (60.7% identity in 84 aa overlap); P71542|Y968_MYCTU|Rv0968|MTCY10D7.06c (98 aa), FASTA scores: opt: 294, E(): 2.2e-12, (55.1% identity in 98 aa overlap); Q50827|MOPA|GROEL|CH60_MYCVA CHAPERONIN (PROTEIN CPN60) from Mycobacterium vaccae (120 aa), FASTA scores: opt: 107, E(): 2.1, (39.5% identity in 81 aa overlap); Q9AEB3|HSP65 HEAT SHOCK PROTEIN (FRAGMENT) from Mycobacterium gadium (122 aa), FASTA scores: opt: 102, E(): 4.4, (38.25% identity in 81 aa overlap); Q49374|CH60_MYCGN|MOPA|GROEL CHAPERONIN (PROTEIN CPN60) from Mycobacterium genavense (120 aa), FASTA scores: opt: 99, E(): 6.8, (40.25% identity in 82 aa overlap); etc. Protein product from Mb3297 detected using shotgun mass spectrometry. Mb3297 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009963" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4C1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01926.1" /translation="MAIQVFLAKATTTVITGLAGVTAYEILKKAAAKAPLRQTAVSAA ALGLRGTRKAEEAAESARLKVADVMAEARERIGEESPTPAISDLHDHDH" CDS 3609524..3611680 /codon_start=1 /transl_table=11 /gene="ctpC" /locus_tag="BQ2027_MB3298" /product="PROBABLE METAL CATION-TRANSPORTING P-TYPE ATPASE C CTPC" /note="Mb3298, ctpC, len: 718 aa. Equivalent to Rv3270, len: 718 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 718 aa overlap). Probable ctpC, metal cation-transport ATPase P-type (EC 3.6.3.-), integral membrane protein, equivalent to Q9CCL1|CTPC|ML0747 PUTATIVE CATION TRANSPORT ATPASE from Mycobacterium leprae (725 aa), FASTA scores: opt: 3908, E(): 0, (85.95% identity in 713 aa overlap). Also similar to O66027|MTAA METAL TRANSPORTING ATPASE MTA72 from Mycobacterium tuberculosis (680 aa), FASTA scores: opt: 3756, E(): 5.5e-213, (91.45% identity in 679 aa overlap); and to other ATPases e.g. Q9ZHC7|SILP_SALTY PUTATIVE CATION TRANSPORTING P-TYPE ATPASE from Salmonella typhimurium (824 aa), FASTA scores: opt: 1145, E(): 1.3e-59, (36.55% identity in 643 aa overlap); Q9HX93|PA3920 PROBABLE METAL TRANSPORTING P-TYPE ATPASE from Pseudomonas aeruginosa (792 aa), FASTA scores: opt: 1140, E(): 2.4e-59, (35.95% identity in 745 aa overlap); etc. Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB. Protein product from Mb3298 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3298 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A503" /db_xref="InterPro:IPR001757" /db_xref="InterPro:IPR008250" /db_xref="InterPro:IPR018303" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR023298" /db_xref="InterPro:IPR023299" /db_xref="InterPro:IPR027256" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/Swiss-Prot:P0A503" /protein_id="SIU01927.1" /translation="MTLEVVSDAAGRMRVKVDWVRCDSRRAVAVEEAVAKQNGVRVVH AYPRTGSVVVWYSPRRADRAAVLAAIKGAAHVAAELIPARAPHSAEIRNTDVLRMVIG GVALALLGVRRYVFARPPLLGTTGRTVATGVTIFTGYPFLRGALRSLRSGKAGTDALV SAATVASLILRENVVALTVLWLLNIGEYLQDLTLRRTRRAISELLRGNQDTAWVRLTD PSAGSDAATEIQVPIDTVQIGDEVVVHEHVAIPVDGEVVDGEAIVNQSAITGENLPVS VVVGTRVHAGSVVVRGRVVVRAHAVGNQTTIGRIISRVEEAQLDRAPIQTVGENFSRR FVPTSFIVSAIALLITGDVRRAMTMLLIACPCAVGLSTPTAISAAIGNGARRGILIKG GSHLEQAGRVDAIVFDKTGTLTVGRPVVTNIVAMHKDWEPEQVLAYAASSEIHSRHPL AEAVIRSTEERRISIPPHEECEVLVGLGMRTWADGRTLLLGSPSLLRAEKVRVSKKAS EWVDKLRRQAETPLLLAVDGTLVGLISLRDEVRPEAAQVLTKLRANGIRRIVMLTGDH PEIAQVVADELGIDEWRAEVMPEDKLAAVRELQDDGYVVGMVGDGINDAPALAAADIG IAMGLAGTDVAVETADVALANDDLHRLLDVGDLGERAVDVIRQNYGMSIAVNAAGLLI GAGGALSPVLAAILHNASSVAVVANSSRLIRYRLDR" CDS complement(3611677..3612345) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3299C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3299c, -, len: 222 aa. Equivalent to Rv3271c, len: 222 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 222 aa overlap). Probable conserved integral membrane protein, similar to others e.g. Q9RD35|SCM1.07c from Streptomyces coelicolor (230 aa), FASTA scores: opt: 360, E(): 4.7e-16, (33.85% identity in 195 aa overlap); Q9X897|SCE2.02c from Streptomyces coelicolor (234 aa), FASTA scores: opt: 357, E(): 7.3e-16, (33.85% identity in 195 aa overlap); Q9D0E0 2610024A01RIK PROTEIN from Mus musculus (Mouse) (288 aa), FASTA scores: opt: 191, E(): 3.7e-05, (23.65% identity in 207 aa overlap). Mb3299c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3X5" /db_xref="InterPro:IPR002524" /db_xref="InterPro:IPR026765" /db_xref="InterPro:IPR027469" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3X5" /protein_id="SIU01928.1" /translation="METTTEHRDESTLDSPVSVAREAEWQRNVRWARWLAWVSLAVLL TEGAVGLWQGIAVGSVALTGWALGGGSEGLASAMVLWRFTGDRTWSATAEHRAQRGVA VSFWLTAPYLVAESIRHLAGEHRAETSVIGIGLTAIALLLMPVLGWANHRVGERLGSG ATAGEGTQNYLCAAQAAAVLLGLAITAVWSNGWWIDPAIGLAIAGIAVWQGIRTWRGH GCGC" CDS 3612446..3613630 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3300" /product="L-carnitine dehydratase/bile acid-inducible protein F" /note="Mb3300, -, len: 394 aa. Equivalent to Rv3272, len: 394 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 394 aa overlap). Conserved hypothetical protein, similar to various proteins e.g. Q9I672|PA0446 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (407 aa), FASTA scores: opt: 643, E(): 6.8e-32, (33.15% identity in 389 aa overlap); Q9RJU8|SCF41.21 PUTATIVE RACEMASE from Streptomyces coelicolor (403 aa), FASTA scores: opt: 541, E(): 1.1e-25, (31.95% identity in 385 aa overlap); O87838|SC8A6.04c PUTATIVE TRANSFERASE from Streptomyces coelicolor (410 aa), FASTA scores: opt: 539, E(): 1.5e-25, (29.95% identity in 395 aa overlap); Q9I563|PA0882 from Pseudomonas aeruginosa (400 aa), FASTA scores: opt: 530, E(): 5.2e-25, (28.8% identity in 396 aa overlap); BAB60328|TVG1215416 L-CARNITINE DEHYDRATASE from Thermoplasma volcanium (399 aa), FASTA scores: opt: 529, E(): 6e-25, (32.9% identity in 383 aa overlap); etc. C-terminus is similar to Q49678|U00012_27|B1308_C3_195 from Mycobacterium leprae (130 aa) (60.0% identity in 115 aa overlap). Also partially similar to MTCY359_7 from M. tuberculosis (778 aa) (29.9% identity in 388 aa overlap). Protein product from Mb3300 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3300 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3M7" /db_xref="InterPro:IPR003673" /db_xref="InterPro:IPR023606" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3M7" /protein_id="SIU01929.1" /translation="MPTSNPAKPLDGFRVLDFTQNVAGPLAGQVLVDLGAEVIKVEAP GGEAARQITSVLPGRPPLATYFLPNNRGKKSVTVDLTTEQAKQQMLRLADTADVVLEA FRPGTMEKLGLGPDDLRSRNPNLIYARLTAYGGNGPHGSRPGIDLVVAAEAGMTTGMP TPEGKPQIIPFQLVDNASGHVLAQAVLAALLHRERNGVADVVQVAMYDVAVGLQANQL MMHLNRAASDQPKPEPAPKAKRRKGVGFATQPSDAFRTADGYIVISAYVPKHWQKLCY LIGRPDLVEDQRFAEQRSRSINYAELTAELELALASKTATEWVQLLQANGLMACLAHT WKQVVDTPLFAESDLTLEVGRGADTITVIRTPARYASFRAVVTDPPPTAGEHNAVFLA RP" CDS 3613635..3615929 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3301" /product="PROBABLE TRANSMEMBRANE CARBONIC ANHYDRASE (CARBONATE DEHYDRATASE) (CARBONIC DEHYDRATASE)" /note="Mb3301, -, len: 764 aa. Equivalent to Rv3273, len: 764 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 764 aa overlap). Probable transmembrane protein (N-terminal part is hydrophobic) with probable carbonic anhydrase activity (in C-terminal part) (EC 4.2.1.1). Possibly involved in transport of sulfate. Equivalent to Q9CBA3|ML2279 PUTATIVE TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium leprae (496 aa), FASTA scores: opt: 1637, E(): 1.8e-89, (59.15% identity in 487 aa overlap). Similar to various proteins (principally sulfate transporters) e.g. Q9X927|SCH5.25 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (830 aa), FASTA scores: opt: 1325, E(): 8e-71, (40.85% identity in 788 aa overlap); Q9I729|PA0103 PROBABLE SULFATE TRANSPORTER from Pseudomonas aeruginosa (523 aa), FASTA scores: opt: 1015, E(): 1.3e-52, (39.95% identity in 488 aa overlap); Q9KN88|VCA0077 SULFATE PERMEASE FAMILY PROTEIN from Vibrio cholerae (553 aa), FASTA scores: opt: 629, E(): 9.6e-30, (30.95% identity in 423 aa overlap); etc. C-terminal part (aa 550-764) shows similarity to carbonic anhydrase e.g. P27134|CYNT_SYNP7 CARBONIC ANHYDRASE (EC 4.2.1.1) (272 aa), FASTA scores: opt: 350, E(): 8.1e-15, (33.8% identity in 201 aa overlap). Contains PS00704 Prokaryotic-type carbonic anhydrases signature 1. SEEMS TO BELONG TO THE SULP FAMILY. Protein product from Mb3301 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3301 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3N0" /db_xref="InterPro:IPR001765" /db_xref="InterPro:IPR001902" /db_xref="InterPro:IPR011547" /db_xref="InterPro:IPR015892" /db_xref="InterPro:IPR036874" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3N0" /protein_id="SIU01930.1" /translation="MTIPRSQHMSTAVNSCTEAPASRSQWMLANLRHDVPASLVVFLV ALPLSLGIAIASGAPIIAGVIAAVVGGIVAGAVGGSPVQVSGPAAGLTVVVAELIDEL GWPMLCLMTIAAGALQIVFGLSRMARAALAIAPVVVHAMLAGIGITIALQQIHVLLGG TSHSSAWRNIVALPDGILHHELHEVIVGGTVIAILLMWSKLPAKVRIIPGPLVAIAGA TVLALLPVLQTERIDLQGNFFDAIGLPKLAEMSPGGQPWSHEISAIALGVLTIALIAS VESLLSAVGVDKLHHGPRTDFNREMVGQGSANVVSGLLGGLPITGVIVRSSANVAAGA RTRMSTILHGVWILLFASLFTNLVELIPKAALAGLLIVIGAQLVKLAHIKLAWRTGNF VIYAITIVCVVFLNLLEGVAIGLVVAIVFLLVRVVRAPVEVKPVGGEQSKRWRVDIDG TLSFLLLPRLTTVLSKLPEGSEVTLNLNADYIDDSVSEAISDWRRAHETRGGVVAIVE TSPAKLHHAHARPPKRHFASDPIGLVPWRSARGKDRGSASVLDRIDEYHRNGAAVLHP HIAGLTDSQDPYELFLTCADSRILPNVITASGPGDLYTVRNLGNLVPTDPDDRSVDAA LDFAVNQLGVSSVVVCGHSSCAAMTALLEDDPANTTTPMMRWLENAHDSLVVFRNHHP ARRSAESAGYPEADQLSIVNVAVQVERLTRHPILATAVAAADLQVIGIFFDISTARVY EVGPNGIICPDEPADRPVDHESAQ" CDS complement(3615918..3617087) /codon_start=1 /transl_table=11 /gene="fadE25" /locus_tag="BQ2027_MB3302C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE25" /note="Mb3302c, fadE25, len: 389 aa. Equivalent to Rv3274c, len: 389 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 389 aa overlap). Probable fadE25, Acyl-CoA Dehydrogenase (EC 1.3.99.-), equivalent to P46703|ACDP_MYCLE|FADE25|ACD|ML0737|B1308_F1 _34 PROBABLE ACYL-COA DEHYDROGENASE FADE25 from Mycobacterium leprae (389 aa), FASTA scores: opt: 2394, E(): 3.8e-143, (92.05% identity in 389 aa overlap). Also similar to many e.g. Q9RIQ5|FADE FATTY ACID ACYL-COA DEHYDROGENASE from Streptomyces lividans (385 aa), FASTA scores: opt: 1692, E(): 4.9e-99, (67.35% identity in 383 aa overlap); P45867|ACDA_BACSU|ACD from Bacillus subtilis (379 aa), FASTA scores: opt: 1212, E(): 7.2e-69, (51.85% identity in 376 aa overlap); Q9K6D1|ACDA|BH3798 from Bacillus halodurans (380 aa), FASTA scores: opt: 1209, E(): 1.1e-68, (51.7% identity in 377 aa overlap); P52042|ACDS_CLOAB|BCD from Clostridium acetobutylicum (379 aa), FASTA scores: opt: 1056, E(): 4.6e-59, (44.6% identity in 379 aa overlap); etc. Contains PS00072 Acyl-CoA dehydrogenases signature 1, PS00073 Acyl-CoA dehydrogenases signature 2. BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3302c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3302c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63428" /db_xref="InterPro:IPR006089" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/Swiss-Prot:P63428" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01931.1" /translation="MVGWAGNPSFDLFKLPEEHDEMRSAIRALAEKEIAPHAAEVDEK ARFPEEALVALNSSGFNAVHIPEEYGGQGADSVATCIVIEEVARVDASASLIPAVNKL GTMGLILRGSEELKKQVLPALAAEGAMASYALSEREAGSDAASMRTRAKADGDHWILN GAKCWITNGGKSTWYTVMAVTDPDRGANGISAFMVHKDDEGFTVGPKERKLGIKGSPT TELYFENCRIPGDRIIGEPGTGFKTALATLDHTRPTIGAQAVGIAQGALDAAIAYTKD RKQFGESISTFQAVQFMLADMAMKVEAARLMVYSAAARAERGEPDLGFISAASKCFAS DVAMEVTTDAVQLFGGAGYTTDFPVERFMRDAKITQIYEGTNQIQRVVMSRALLR" CDS complement(3617112..3617636) /codon_start=1 /transl_table=11 /gene="purE" /locus_tag="BQ2027_MB3303C" /product="PROBABLE PHOSPHORIBOSYLAMINOIMIDAZOLE CARBOXYLASE CATALYTIC SUBUNIT PURE (AIR CARBOXYLASE) (AIRC)" /note="Mb3303c, purE, len: 174 aa. Equivalent to Rv3275c, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 174 aa overlap). Probable purE, phosphoribosylaminoimidazole carboxylase catalytic subunit (EC 4.1.1.21), equivalent to P46702|PUR6_MYCLE|PURE|ML0736|B1308_F3_98 from Mycobacterium leprae (171 aa), FASTA scores: opt: 878, E(): 1.5e-43, (81.55% identity in 168 aa overlap). Also similar to others e.g. Q9AXD0|AIRC from Nicotiana tabacum (Common tobacco) (623 aa), FASTA scores: opt: 712, E(): 1.4e-33, (69.35% identity in 160 aa overlap) (similarity in C-terminal part for this one); Q44679|PUR6_CORAM from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (177 aa), FASTA scores: opt: 651, E(): 1.5e-30, (68.25% identity in 148 aa overlap); Q55498|PUR6_SYNY3|PURE|SLL0901 from Synechocystis sp. strain PCC 6803 (176 aa), FASTA scores: opt: 639, E(): 7.1e-30, (60.5% identity in 167 aa overlap); etc. Protein product from Mb3303c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3303c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3M8" /db_xref="InterPro:IPR000031" /db_xref="InterPro:IPR024694" /db_xref="InterPro:IPR033747" /db_xref="InterPro:IPR035893" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3M8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01932.1" /translation="MTPAGERPRVGVIMGSDSDWPVMADAAAALAEFDIPAEVRVVSA HRTPEAMFSYARGAAERGLEVIIAGAGGAAHLPGMVAAATPLPVIGVPVPLGRLDGLD SLLSIVQMPAGVPVATVSIGGARNAGLLAVRMLGAANPQLRARIVAFQDRLADVVAAK DAELQRLAGKLTRD" CDS complement(3617633..3618922) /codon_start=1 /transl_table=11 /gene="purK" /locus_tag="BQ2027_MB3304C" /product="PROBABLE PHOSPHORIBOSYLAMINOIMIDAZOLE CARBOXYLASE ATPASE SUBUNIT PURK (AIR CARBOXYLASE) (AIRC)" /note="Mb3304c, purK, len: 429 aa. Equivalent to Rv3276c, len: 429 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 429 aa overlap). Probable purK, phosphoribosylaminoimidazole carboxylase ATPase subunit (EC 4.1.1.21), equivalent to P46701|PURK_MYCLE|ML0735|B1308_F1_32 PHOSPHORIBOSYLAMINOIMIDAZOLE CARBOXYLASE ATPASE SUBUNIT from Mycobacterium leprae (439 aa), FASTA scores: opt: 2168, E(): 2.3e-123, (76.15% identity in 444 aa overlap). Also similar to others e.g. Q44678|PURK_CORAM from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (413 aa), FASTA scores: opt: 1179, E(): 9.1e-64, (48.35% identity in 389 aa overlap); Q9KZ85|PURK from Streptomyces coelicolor (368 aa), FASTA scores: opt: 1150, E(): 4.7e-62, (55.35% identity in 345 aa overlap); Q54975|PURK_SYNP7 from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (395 aa), FASTA scores: opt: 772, E(): 3e-39, (38.1% identity in 383 aa overlap); etc. BELONGS TO THE PURK / PURT FAMILY. Protein product from Mb3304c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3304c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65899" /db_xref="InterPro:IPR003135" /db_xref="InterPro:IPR005875" /db_xref="InterPro:IPR011054" /db_xref="InterPro:IPR011761" /db_xref="InterPro:IPR013815" /db_xref="InterPro:IPR016185" /db_xref="InterPro:IPR040686" /db_xref="UniProtKB/Swiss-Prot:P65899" /protein_id="SIU01933.1" /translation="MMAVASSRTPAVTSFIAPLVAMVGGGQLARMTHQAAIALGQNLR VLVTSADDPAAQVTPNVVIGSHTDLAALRRVAAGADVLTFDHEHVPNELLEKLVADGV NVAPSPQALVHAQDKLVMRQRLAAAGVAVPRYAGIKDPDEIDVFAARVDAPIVVKAVR GGYDGRGVRMARDVADARDFARECLADGVAVLVEERVDLRRELSALVARSPFGQGAAW PVVQTVQRDGTCVLVIAPAPALPDDLATAAQRLALQLADELGVVGVLAVELFETTDGA LLVNELAMRPHNSGHWTIDGARTSQFEQHLRAVLDYPLGDSDAVVPVTVMANVLGAAQ PPAMSVDERLHHLFARMPDARVHLYGKAERPGRKVGHINFLGSDVAQLCERAELAAHW LSHGRWTDGWDPHRASDDAVGVPPACGGRSDEEERRL" CDS 3618876..3619694 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3305" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3305, -, len: 271 aa. Equivalent to Rv3277, len: 271 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 271 aa overlap). Probable conserved transmembrane protein, equivalent, but longer 49 aa, to Q49673|B1308_C1_121|ML0734 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (228 aa), FASTA scores: opt: 1266, E(): 6.1e-78, (84.2% identity in 228 aa overlap). Also similar to various proteins (principally unknowns) e.g. Q9KZ84|SCE25.02 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (190 aa), FASTA scores: opt: 197, E(): 3.6e-06, (32.0% identity in 150 aa overlap); BAB50058|MLL3086 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (136 aa), FASTA scores: opt: 176, E(): 6.9e-05, (34.7% identity in 147 aa overlap); O29640|AF0615 HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (129 aa), FASTA scores: opt: 120, E(): 0.38, (23.35% identity in 120 aa overlap); Q9KJU8|GTCA TEICHOIC ACID GLYCOSYLATION PROTEIN from Listeria innocua (145 aa), FASTA scores: opt: 117, E(): 0.67, (23.85% identity in 151 aa overlap); etc. Equivalent to AAK47718 from Mycobacterium tuberculosis strain CDC1551 (256 aa) but longer 16 aa. Contains PS00044 Bacterial regulatory proteins, lysR family signature. Protein product from Mb3305 detected using SWATH mass spectrometry. Mb3305 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3N7" /db_xref="InterPro:IPR007267" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3N7" /protein_id="SIU01934.1" /translation="MNEVTAGVRELATAIMVSRHLTGVLAGHGSQTVTYHFASILCSS VHSLVVSFADATIARLPGVVQPYAQRHHELIKFAIVGGTTFIIDTAIFYTLKLTVLEP KPVTAKVIAGIVAVIASYVLNREWSFRDRGGRERHHEALLFFAFSGVGVLLSMAPLWF SSYILQLRVPTVSLTMENIADFISAYIIGNLLQMAFRFWAFRRWVFPDEFARNPDKAL ESALTAGGIAEVFEDVLEGGFEDGNVTLLRAWRNRANRFAQLGDSSEPRVSKTL" CDS complement(3619649..3620167) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3306C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3306c, -, len: 172 aa. Equivalent to Rv3278c, len: 172 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 172 aa overlap). Probable conserved transmembrane protein, equivalent to Q9CCL2|ML0733 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (172 aa), FASTA scores: opt: 1024, E(): 6e-61, (83.15% identity in 172 aa overlap); and Q49672|B1308_F2_67 HYPOTHETICAL PROTEIN from Mycobacterium leprae (181 aa), FASTA scores: opt: 1024, E(): 6.3e-61, (83.15% identity in 172 aa overlap) (this is certainly the same putative protein but with N-terminus longer). Also some similarity to other hypothetical proteins (generally membrane proteins) e.g. O26822|MTH726 HYPOTHETICAL PROTEIN from Methanobacterium thermoautotrophicum (204 aa), FASTA scores: opt: 147, E(): 0.0079, (24.6% identity in 187 aa overlap); Q9X8H4|SCE9.01 HYPOTHETICAL 47.7 KDA PROTEIN (FRAGMENT) from Streptomyces coelicolor (436 aa), FASTA scores: opt: 151, E(): 0.0079, (28.1% identity in 153 aa overlap). Protein product from Mb3306c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3306c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5H1" /db_xref="InterPro:IPR005182" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5H1" /protein_id="SIU01935.1" /translation="MSYPENVLAAGEQVVLHRHPHWNRLIWPVVVLVLLTGLAAFGSG FVNSTPWQQIAKNVIHAVIWGIWLVIVGWLTLWPFLSWLTTHFVVTNRRVMFRHGVLT RSGIDIPLARINSVEFRDRIFERIFRTGTLIIESASQDPLEFYNIPRLREVHALLYHK VFDTLGSDESPS" CDS complement(3620210..3621010) /codon_start=1 /transl_table=11 /gene="birA" /locus_tag="BQ2027_MB3307C" /product="POSSIBLE BIFUNCTIONAL PROTEIN BIRA: BIOTIN OPERON REPRESSOR + BIOTIN--[ACETYL-COA-CARBOXYLASE] SYNTHETASE (BIOTIN--PROTEIN LIGASE)" /note="Mb3307c, birA, len: 266 aa. Equivalent to Rv3279c, len: 266 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 266 aa overlap). Possible birA, bifunctional protein: biotin operon repressor and biotin--[acetyl-CoA-carboxylase] synthetase (EC 6.3.4.15), equivalent to Q9CCL3|BIRA|ML0732 BIOTIN APO-PROTEIN LIGASE from Mycobacterium leprae (274 aa), FASTA scores: opt: 1189, E(): 2.3e-66, (71.2% identity in 271 aa overlap). But as it lacks a BirA h-t-h domain at N-terminus, may simply be biotin apo-protein ligase. Also similar to others e.g. Q9CNX6|BIRA|PM0296 from Pasteurella multocida (312 aa), FASTA scores: opt: 347, E(): 2.7e-14, (32.95% identity in 270 aa overlap); Q9HWC0|BIRA|PA4280 from Pseudomonas aeruginosa (312 aa), FASTA scores: opt: 335, E(): 1.5e-13, (34.2% identity in 272 aa overlap); Q9A6Z0|CC1936 from Caulobacter crescentus (250 aa), FASTA scores: opt: 332, E(): 1.9e-13, (33.6% identity in 238 aa overlap); P06709|BIRA_ECOLI (321 aa), FASTA scores: opt: 314, E(): 3.1e-12, (34.15% identity in 249 aa overlap); etc. SIMILAR WITH OTHER BACTERIAL BIRA AND WITH EUKARYOTIC BIOTIN APO-PROTEIN LIGASE. Protein product from Mb3307c detected using SWATH mass spectrometry. Mb3307c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4E2" /db_xref="InterPro:IPR003142" /db_xref="InterPro:IPR004143" /db_xref="InterPro:IPR004408" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01936.1" /translation="MTDRDRLRPPLDERSLRDQLIGAGSGWRQLDVVAQTGSTNADLL ARAASGADIDGVVLIAEHQTAGRGRHGRGWAATARAQIILSVGVRVVDVPVQAWGWLS LAAGLAVLDSVAPLIAVPPAETGLKWPNDVLARGGKLAGILAEVAQPFVVLGVGLNVT QAPEEVDPDATSLLDLGVAAPDRNRIASRLLRELEARIIQWRNANPQLAANYRARSLT IGSRVRVELPGGQDVVGIARDIDDQGRLCLDVGGRTVVVSAGDVVHLR" CDS 3621060..3622706 /codon_start=1 /transl_table=11 /gene="accD5" /locus_tag="BQ2027_MB3308" /product="PROBABLE PROPIONYL-COA CARBOXYLASE BETA CHAIN 5 ACCD5 (PCCASE) (PROPANOYL-COA:CARBON DIOXIDE LIGASE)" /note="Mb3308, accD5, len: 548 aa. Equivalent to Rv3280, len: 548 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 548 aa overlap). Probable accD5, propyonyl-CoA carboxylase beta chain 5 (EC 6.4.1.3), equivalent to P53002|PCCB_MYCLE|ACCD5|ML0731|B1308_C1_125 PROBABLE PROPIONYL-COA CARBOXYLASE BETA CHAIN 5 from Mycobacterium leprae (549 aa), FASTA scores: opt: 3241, E(): 4e-192, (88.7% identity in 549 aa overlap). Also similar to many e.g. O87201|DTSR2 DTSR2 PROTEIN INVOLVED IN GLUTAMATE PRODUCTION from orynebacterium glutamicum (Brevibacterium flavum) (537 aa), FASTA scores: opt: 2604, E(): 6.9e-153, (74.1% identity in 529 aa overlap) (see first citation below); P53003|PCCB_SACER from Saccharopolyspora erythraea (Streptomyces erythraeus) (546 aa), FASTA scores: opt: 2466, E(): 2.2e-144, (70.2% identity in 530 aa overlap); O88155|DTSR1 DTSR1 PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (543 aa), FASTA scores: opt: 2375, E(): 8.8e-139, (67.1% identity in 529 aa overlap (see citation below); Q9X4K7|PCCB from Streptomyces coelicolor (530 aa), FASTA scores: opt: 2360, E(): 7.3e-138, (67.9% identity in 533 aa overlap); O24789|MXPCCB from Myxococcus xanthus (524 aa), FASTA scores: opt: 1868, E(): 1.5e-107, (56.85% identity in 524 aa overlap); etc. Also similar with METHYLMALONYL-COA DECARBOXYLASES e.g. O59018|PH1287 from Pyrococcus horikoshii (522 aa), FASTA scores: opt: 1841, E(): 6.7e-106, (54.15% identity in 528 aa overlap). Also similarity with MTCY427.28 (43.8% identity in 434 aa overlap). BELONGS TO THE ACCD/PCCB FAMILY. Protein product from Mb3308 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3308 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3U0" /db_xref="InterPro:IPR011762" /db_xref="InterPro:IPR011763" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR034733" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3U0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01937.1" /translation="MTSVTDRSAHSAERSTEHTIDIHTTAGKLAELHKRREESLHPVG EDAVEKVHAKGKLTARERIYALLDEDSFVELDALAKHRSTNFNLGEKRPLGDGVVTGY GTIDGRDVCIFSQDATVFGGSLGEVYGEKIVKVQELAIKTGRPLIGINDGAGARIQEG VVSLGLYSRIFRNNILASGVIPQISLIMGAAAGGHVYSPALTDFVIMVDQTSQMFITG PDVIKTVTGEEVTMEELGGAHTHMAKSGTAHYAASGEQDAFDYVRELLSYLPPNNSTD APRYQAAAPTGPIEENLTDEDLELDTLIPDSPNQPYDMHEVITRLLDDEFLEIQAGYA QNIVVGFGRIDGRPVGIVANQPTHFAGCLDINASEKAARFVRTCDCFNIPIVMLVDVP GFLPGTDQEYNGIIRRGAKLLYAYGEATVPKITVITRKAYGGAYCVMGSKDMGCDANL AWPTAQIAVMGASGAVGFVYRQQLAEAAANGEDIDKLRLRLQQEYEDTLVNPYVAAER GYVDAVIPPSHTRGYIGTALRLLERKIAQLPPKKHGNVPL" CDS 3622687..3623157 /codon_start=1 /transl_table=11 /gene="acce5" /locus_tag="BQ2027_MB3309" /product="probable bifunctional protein acetyl-/propionyl-coenzyme a carboxylase (epsilon chain) acce5" /note="Mb3309, -, len: 156 aa. Similar to Rv3281, len: 177 aa, from Mycobacterium tuberculosis strain H37Rv, (88.1% identity in 177 aa overlap). Conserved hypothetical protein, equivalent (but longer 14 aa and with a gap between aa 82-102) to AAK47723|MT3380 from Mycobacterium tuberculosis strain CDC1551 (142 aa), FASTA scores: opt: 830, E(): 3.1e-40, (86.5% identity in 163 aa overlap). C-terminus highly similar to Q49671|B1308_C3_211|ML0730 from Mycobacterium leprae (84 aa), FASTA scores: opt: 393, E(): 7.6e-16, (68.95% identity in 87 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, an in-frame deletion of 63 bp leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (156 aa versus 177 aa). Protein product from Mb3309 detected using shotgun mass spectrometry. Mb3309 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR032716" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Y2" /protein_id="SIU01938.1" /translation="MGTCPCESSERNEPVSRVSGTNEVSDGNETNNPAEVSDGNETNN PAEVSDGNETNNPAPVSRVSGTNEVSDGNETNNPAPVTEKPLHPHEPHIEILRGQPTD QELAALIAVLGSISGSTPPAQPEPTRWGLPVDQLRYPVFSWQRITLQEMTHMRR" CDS 3623154..3623822 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3310" /product="Septum formation protein Maf" /note="Mb3310, -, len: 222 aa. Equivalent to Rv3282, len: 222 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 222 aa overlap). Conserved hypothetical protein, equivalent to Q49670|ML0729 1308R (HYPOTHETICAL PROTEIN ML0729) from Mycobacterium leprae (213 aa), FASTA scores: opt: 945, E(): 5.5e-54, (68.55% identity in 213 aa overlap). Also similar to Q9EWV6|2SCK31.18 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (206 aa), FASTA scores: opt: 459, E(): 1.3e-22, (47.35% identity in 209 aa overlap); P74331|MAF OR SLL0905 MAF PROTEIN from Synechocystis sp. strain PCC 6803 (195 aa), FASTA scores: opt: 401, E(): 6.9e-19, (43.0% identity in 207 aa overlap); and shows weak similarity with various proteins e.g. Q9BUL6 ACETYLSEROTONIN O-METHYLTRANSFERASE-LIKE from Homo sapiens (Human) (621 aa), FASTA scores: opt: 282, E(): 8.9e-11, (31.6% identity in 193 aa overlap); O95671|ASMTL ASMTL PROTEIN from Homo sapiens (Human) (629 aa), FASTA scores: opt: 282, E(): 9e-11, (31.6% identity in 193 aa overlap); BAB51136|MLR4491 MAF PROTEIN from Rhizobium loti (Mesorhizobium loti) (199 aa), FASTA scores: opt: 269, E(): 2.3e-10, (29.3% identity in 198 aa overlap); etc. Protein product from Mb3310 detected using SWATH mass spectrometry. Mb3310 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWT7" /db_xref="InterPro:IPR003697" /db_xref="InterPro:IPR029001" /db_xref="UniProtKB/Swiss-Prot:Q7TWT7" /protein_id="SIU01939.1" /translation="MTRLVLGSASPGRLKVLRDAGIEPLVIASHVDEDVVIAALGPDA VPSDVVCVLAAAKAAQVATTLTGTQRIVAADCVVVACDSMLYIEGRLLGKPASIDEAR EQWRSMAGRAGQLYTGHGVIRLQDNKTVYRSAETAITTVYFGTPSASDLEAYLASGES LRVAGGFTLDGLGGWFIDGVQGNPSNVIGLSLPLLRSLVQRCGLSVAALWAGNAGGPA HKQQ" CDS 3623863..3624756 /codon_start=1 /transl_table=11 /gene="sseA" /locus_tag="BQ2027_MB3311" /product="PROBABLE THIOSULFATE SULFURTRANSFERASE SSEA (RHODANESE) (THIOSULFATE CYANIDE TRANSSULFURASE) (THIOSULFATE THIOTRANSFERASE)" /note="Mb3311, sseA, len: 297 aa. Equivalent to Rv3283, len: 297 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 297 aa overlap). Probable sseA, thiosulfate sulfurtransferase (EC 2.8.1.1), equivalent P46700|THT2_MYCLE|SSEA|ML0728|B1308_C1_127 PUTATIVE THIOSULFATE SULFURTRANSFERASE SSEA from Mycobacterium leprae (296 aa), FASTA scores: opt: 1742, E(): 5.5e-108, (83.45% identity in 296 aa overlap). Also highly similar to others e.g. Q9RXT9|DR0217 from Deinococcus radiodurans (286 aa), FASTA scores: opt: 1057, E(): 1.2e-62, (53.86% identity in 273 aa overlap); P16385|THTR_SACER|CYSA from Saccharopolyspora erythraea (Streptomyces erythraeus) (281 aa), FASTA scores: opt: 1006, E(): 2.7e-59, (51.25% identity in 277 aa overlap); P71121|THTR_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (225 aa), FASTA scores: opt: 897, E(): 3.6e-52, (59.05% identity in 215 aa overlap); etc. Also highly similar to O05793|CYSA1|CYSA|Rv3117|MT3199|MTCY164.27|CYSA2|RV0815c|M T0837|MTV043.07c|THTR_MYCTU PUTATIVE THIOSULFATE SULFURTRANSFERASE (EC 2.8.1.1) from Mycobacterium tuberculosis (277 aa), FASTA scores: opt: 955, E(): 6.3e-56, (50.2% identity in 271 aa overlap); and Q50036|THTR_MYCLE|CYSA|CYSA3|ML2198 PUTATIVE THIOSULFATE SULFURTRANSFERASE from Mycobacterium leprae (277 aa), FASTA scores: opt: 931, E(): 2.5e-54, (48.9% identity in 276 aa overlap). Shows some similarity to MTCY339.19c (30.3% identity in 254 aa overlap). Contains PS00683 Rhodanese C-terminal signature. BELONGS TO THE RHODANESE FAMILY. Protein product from Mb3311 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3311 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TWT6" /db_xref="InterPro:IPR001307" /db_xref="InterPro:IPR001763" /db_xref="InterPro:IPR036873" /db_xref="UniProtKB/Swiss-Prot:Q7TWT6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01940.1" /translation="MPLPADPSPTLSAYAHPERLVTADWLSAHMGAPGLAIVESDEDV LLYDVGHIPGAVKIDWHTDLNDPRVRDYINGEQFAELMDRKGIARDDTVVIYGDKSNW WAAYALWVFTLFGHADVRLLNGGRDLWLAERRETTLDVPTKTCTGYPVVQRNDAPIRA FRDDVLAILDAQPLIDVRSPEEYTGKRTHMPDYPEEGALRAGHIPTAVHIPWGKAADE SGRFRSREELERLYDFINPDDQTVVYCRIGERSSHTWFVLTHLLGKADVRNYDGSWTE WGNAVRVPIVAGEEPGVVPVV" CDS 3624753..3625184 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3312" /product="Sulfur acceptor protein => iron-sulfur cluster assembly SufE" /note="Mb3312, -, len: 143 aa. Equivalent to Rv3284, len: 143 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 143 aa overlap). Conserved hypothetical protein, with similarity to other bacterial hypothetical proteins e.g. Q9RXU0|DR0216 from Deinococcus radiodurans (147 aa), FASTA scores: opt: 425, E(): 9.1e-21, (46.55% identity in 146 aa overlap); BAB37094|ECS3671 from Escherichia coli strain O157:H7 (147 aa), FASTA scores: opt: 187, E(): 2.2e-05, (29.5% identity in 139 aa overlap); AAG57925|YGDK from Escherichia coli strain O157:H7 EDL933 (147 aa), FASTA scores: opt: 187, E(): 2.2e-05, (32.05% identity in 139 aa overlap); etc. Protein product from Mb3312 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3312 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR003808" /db_xref="UniProtKB/Swiss-Prot:P67124" /protein_id="SIU01941.1" /translation="MTAPASLPAPLAEVVSDFAEVQGQDKLRLLLEFANELPALPSHL AESAMEPVPECQSPLFLHVDASDPNRVRLHFSAPAEAPTTRGFASILAAGLDEQPAAD ILAVPEDFYTELGLAALISPLRLRGMSAMLARIKRRLREAD" CDS 3625292..3627094 /codon_start=1 /transl_table=11 /gene="accA3" /locus_tag="BQ2027_MB3313" /product="PROBABLE BIFUNCTIONAL PROTEIN ACETYL-/PROPIONYL-COENZYME A CARBOXYLASE (ALPHA CHAIN) ACCA3: BIOTIN CARBOXYLASE + BIOTIN CARBOXYL CARRIER PROTEIN (BCCP)" /note="Mb3313, accA3, len: 600 aa. Equivalent to Rv3285, len: 600 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 600 aa overlap). Probable accA3, bifunctional protein acetyl-/propionyl-coenzyme A carboxylase, alpha chain (EC 6.3.4.14) (see citations below) equivalent to P46392|BCCA_MYCLE|BCCA|ML0726|B1308_C1_129 ACETYL-/PROPIONYL-COENZYME A CARBOXYLASE ALPHA CHAIN from Mycobacterium leprae (598 aa), FASTA scores: opt: 3510, E(): 1.1e-196, (89.3% identity in 601 aa overlap). Also highly similar to other proteins e.g. P71122|ACCBC ACYL COENZYME A CARBOXYLASE from Corynebacterium glutamicum (Brevibacterium flavum) (591 aa), FASTA scores: opt: 2776, E(): 5.6e-154, (71.95% identity in 592 aa overlap); Q54119|BCPA2 BIOTIN CARBOXYLASE AND BIOTIN CARBOXYL CARRIER PROTEIN from Saccharopolyspora erythraea (Streptomyces erythraeus) (591 aa), FASTA scores: opt: 2723, E(): 6.7e-151, (70.5% identity in 590 aa overlap); Q54105|BCPA BIOTIN CARBOXYLASE AND BIOTIN CARBOXYL CARRIER PROTEIN from Saccharopolyspora erythraea (Streptomyces erythraeus) (597 aa), FASTA scores: opt: 2721, E(): 8.9e-151, (70.05% identity in 594 aa overlap); Q9EWV4|2SCK31.20 PUTATIVE ACYL-COA CARBOXYLASE COMPLEX A SUBUNIT from Streptomyces coelicolor (590 aa), FASTA scores: opt: 2626, E(): 2.9e-145, (68.25% identity in 595 aa overlap); etc. Contains PS00867 Carbamoyl-phosphate synthase subdomain signature 2, PS00188 Biotin-requiring enzymes attachment site. SIMILAR TO OTHER BIOTIN-DEPENDENT ENZYMES AND CARBAMOYL-PHOSPHATE SYNTHETASES. Protein product from Mb3313 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3313 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3P4" /db_xref="InterPro:IPR000089" /db_xref="InterPro:IPR001882" /db_xref="InterPro:IPR005479" /db_xref="InterPro:IPR005481" /db_xref="InterPro:IPR005482" /db_xref="InterPro:IPR011053" /db_xref="InterPro:IPR011054" /db_xref="InterPro:IPR011761" /db_xref="InterPro:IPR011764" /db_xref="InterPro:IPR016185" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3P4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01942.1" /translation="MASHAGSRIARISKVLVANRGEIAVRVIRAARDAGLPSVAVYAE PDAESPHVRLADEAFALGGQTSAESYLDFAKILDAAAKSGANAIHPGYGFLAENADFA QAVIDAGLIWIGPSPQSIRDLGDKVTARHIAARAQAPLVPGTPDPVKGADEVVAFAEE YGLPIAIKAAHGGGGKGMKVARTIDEIPELYESAVREATAAFGRGECYVERYLDKPRH VEAQVIADQHGNVVVAGTRDCSLQRRYQKLVEEAPAPFLTDFQRKEIHDSAKRICKEA HYHGAGTVEYLVGQDGLISFLEVNTRLQVEHPVTEETAGIDLVLQQFRIANGEKLDIT EDPTPRGHAIEFRINGEDAGRNFLPAPGPVTKFHPPSGPGVRVDSGVETGSVIGGQFD SMLAKLIVHGADRAEALARARRALNEFGVEGLATVIPFHRAVVSDPAFIGDANGFSVH TRWIETEWNNTIEPFTDGEPLDEDARPRQKVVVEIDGRRVEVSLPADLALSNGGGCDP VGVIRRKPKPRKRGAHTGAAASGDAVTAPMQGTVVKFAVEEGQEVVAGDLVVVLEAMK MENPVTAHKDGTITGLAVEAGAAITQGTVLAEIK" CDS complement(3627104..3627889) /codon_start=1 /transl_table=11 /gene="sigF" /locus_tag="BQ2027_MB3314C" /product="alternative rna polymerase sigma factor sigf" /note="Mb3314c, sigF, len: 261 aa. Equivalent to Rv3286c, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 261 aa overlap). Probable sigF, stress response/stationary phase RNA polymerase sigma factor (see citations below), similar to several Streptomyces RNA polymerase sigma factors e.g. Q9RPC8|SIGH from Streptomyces coelicolor A3(2) (354 aa), FASTA scores: opt: 869, E(): 1.1e-45, (51.15% identity in 258 aa overlap); Q9RIT0|SIG1 from Streptomyces coelicolor (361 aa), FASTA scores: opt: 869, E(): 1.1e-45, (51.15% identity in 258 aa overlap); Q9ADM4|2SC10A7.38c from Streptomyces coelicolor (318 aa), FASTA scores: opt: 776, E(): 4.6e-40, (48.75% identity in 240 aa overlap); P37971|RPOF_STRCO|SIGF|RPOX|2SCD60.01c from Streptomyces coelicolor (287 aa), FASTA scores: opt: 717, E(): 1.6e-36, (44.5% identity in 245 aa overlap); P37970|RPOF_STRAU|SIGF|RPOX from Streptomyces aureofaciens (297 aa); etc. Contains possible helix-turn-helix motif at aa 229-250 (+7.38 SD). SIMILAR TO THE SIGMA-70 FACTOR FAMILY. Seems expressed in stationary phase and under stress conditions in vitro (see citations below). Protein product from Mb3314c detected using SWATH mass spectrometry. Mb3314c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3Q3" /db_xref="InterPro:IPR000943" /db_xref="InterPro:IPR007624" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR007630" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR014322" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Q3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01943.1" /translation="MTARAAGGSASRANEYADVPEMFRELVGLPAGSPEFQRHRDKIV QRCLPLADHIARRFEGRGEPRDDLIQVARVGLVNAAVRFDVKTGSDFVSFAVPTIMGE VRRHFRDNSWSVKVPRRLKELHLRLGTATADLSQRLGRAPSASELAAELGMDRAEVIE GLLAGSSYHTLSIDSGGGSDDDARAITDTLGDVDAGLDQIENREVLRPLLEALPERER TVLVLRFFDSMTQTQIAERVGISQMHVSRLLAKSLARLRDQLE" CDS complement(3627886..3628323) /codon_start=1 /transl_table=11 /gene="rsbW" /locus_tag="BQ2027_MB3315C" /standard_name="usfX" /product="ANTI-SIGMA FACTOR RSBW (SIGMA NEGATIVE EFFECTOR)" /note="Mb3315c, rsbW, len: 145 aa. Equivalent to Rv3287c, len: 145 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 145 aa overlap). rsbW (alternate gene name: usfX), anti-sigma factor (see citations below), similar to Q49667|B1308_F3_89 from Mycobacterium leprae (75 aa), FASTA scores: opt: 308, E(): 2.5e-15, (72.2% identity in 72 aa overlap); Q9R3X8|PRS1|USHX|PRS PRS1 PROTEIN (ANTI-SIGMA FACTOR) from Streptomyces coelicolor (137 aa), FASTA scores: opt: 184, E(): 3.7e-06, (36.8% identity in 106 aa overlap); O50231 PUTATIVE SIGMA-B REGULATOR from Bacillus licheniformis (160 aa), FASTA scores: opt: 122, E(): 0.13, (23.9% identity in 92 aa overlap); and P17904|RSBW_BACSU ANTI-SIGMA B FACTOR (SIGMA-B NEGATIVE EFFECTOR RSBW) from Bacillus subtilis (160 aa), FASTA scores: opt: 108, E(): 1.3, (21.25% identity in 127 aa overlap). Equivalent to AAK47729 from Mycobacterium tuberculosis strain CDC1551 (145 aa) but longer 99 aa. INDUCTION BY HEAT SHOCK, SALT STRESS, OXIDATIVE STRESS, GLUCOSE LIMITATION AND OXYGEN LIMITATION. N-terminus shortened since first submission (previously 242 aa). Protein product from Mb3315c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3315c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3P6" /protein_id="SIU01944.1" /translation="MADSDLPTKGRQRGVRAVELNVAARLENLALLRTLVGAIGTFED LDFDAVADLRLAVDEVCTRLIRSALPDATLRLVVDPRKDEVVVEASAACDTHDVVAPG SFSWHVLTALADDVQTFHDGRQPDVAGSVFGITLTARRAASSR" CDS complement(3628521..3628934) /codon_start=1 /transl_table=11 /gene="usfY" /locus_tag="BQ2027_MB3316C" /product="PUTATIVE PROTEIN USFY" /note="Mb3316c, usfY, len: 137 aa. Equivalent to Rv3288c, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 137 aa overlap). usfY, putative protein (see citation below). Has no significant homologues. May not be contranscribed with the usfX and sigF proteins. Protein product from Mb3316c detected using SWATH mass spectrometry. Mb3316c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5H7" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5H7" /protein_id="SIU01945.1" /translation="MGQIPPQPVRRVLPLMVVPGNGQKWRNRTETEEAMGDTYRDPVD HLRTTRPLAGESLIDVVHWPGYLLIVAGVVGGVGALAAFGTGHHAEGMTFGVVAIVVT VVGLAWLAFEHRRIRKIADRWYTEHPEVRRQRLAG" CDS complement(3628969..3629346) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3317C" /product="POSSIBLE TRANSMEMBRANE PROTEIN" /note="Mb3317c, -, len: 125 aa. Equivalent to Rv3289c, len: 125 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 125 aa overlap). Possible transmembrane protein, showing slight similarity to other membrane proteins or glycoproteins. Protein product from Mb3317c detected using shotgun mass spectrometry. Mb3317c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4E6" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E6" /protein_id="SIU01946.1" /translation="MHEVGGPSRGDRLGRDDSEVHSAIRFAVVAAVVGVGFLIMGALL VSTCSGVDTAACGPPQRILLALGGPLILCAAGLWAFLRTYRVWRAEGTWWGWHGAGWF LLTLMVLTLCIGVPPIAGPVMAP" CDS complement(3629380..3630729) /codon_start=1 /transl_table=11 /gene="lat" /locus_tag="BQ2027_MB3318C" /product="PROBABLE L-LYSINE-EPSILON AMINOTRANSFERASE LAT (L-LYSINE AMINOTRANSFERASE) (LYSINE 6-AMINOTRANSFERASE)" /note="Mb3318c, lat, len: 449 aa. Equivalent to Rv3290c, len: 449 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 449 aa overlap). Probable lat, lysine-epsilon aminotransferase (EC 2.6.1.36), similar to Q05174|LAT_NOCLA from Nocardia lactamdurans (450 aa), FASTA scores: opt: 1702, E(): 1.1e-99, (60.35% identity in 439 aa overlap); and Q01767|Q53823|LAT_STRCL from Streptomyces clavuligerus (457 aa), FASTA scores: opt: 1676, E(): 4.9e-98, (60.15% identity in 434 aa overlap). Also some similarity to 4-AMINOBUTYRATE AMINOTRANSFERASE PROTEINS (GAMMA-AMINO-N-BUTYRATE TRANSAMINASES). BELONGS TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. COFACTOR: PYRIDOXAL PHOSPHATE. Protein product from Mb3318c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3318c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63510" /db_xref="InterPro:IPR005814" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR017657" /db_xref="UniProtKB/Swiss-Prot:P63510" /protein_id="SIU01947.1" /translation="MAAVVKSVALAGRPTTPDRVHEVLGRSMLVDGLDIVLDLTRSGG SYLVDAITGRRYLDMFTFVASSALGMNPPALVDDREFHAELMQAALNKPSNSDVYSVA MARFVETFARVLGDPALPHLFFVEGGALAVENALKAAFDWKSRHNQAHGIDPALGTQV LHLRGAFHGRSGYTLSLTNTKPTITARFPKFDWPRIDAPYMRPGLDEPAMAALEAEAL RQARAAFETRPHDIACFVAEPIQGEGGDRHFRPEFFAAMRELCDEFDALLIFDEVQTG CGLTGTAWAYQQLDVAPDIVAFGKKTQVCGVMAGRRVDEVADNVFAVPSRLNSTWGGN LTDMVRARRILEVIEAEGLFERAVQHGKYLRARLDELAADFPAVVLDPRGRGLMCAFS LPTTADRDELIRQLWQRAVIVLPAGADTVRFRPPLTVSTAEIDAAIAAVRSALPVVT" CDS complement(3630780..3631232) /codon_start=1 /transl_table=11 /gene="lrpa" /locus_tag="BQ2027_MB3319C" /product="probable transcriptional regulatory protein lrpa (lrp/asnc-family)" /note="Mb3319c, -, len: 150 aa. Equivalent to Rv3291c, len: 150 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 150 aa overlap). Probable transcriptional regulator asnC-family, similar to other regulatory proteins e.g. Q9RKY4|SC6D7.14 from Streptomyces coelicolor (165 aa), FASTA scores: opt: 503, E(): 9.1e-26, (50.35% identity in 143 aa overlap); Q9KYP0|SCD69.13 from Streptomyces coelicolor (167 aa), FASTA scores: opt: 310, E(): 2.7e-13, (37.2% identity in 129 aa overlap); BAB50701|MLL3910 from Rhizobium loti (Mesorhizobium loti) (152 aa), FASTA scores: opt: 282, E(): 1.6e-11, (39.55% identity in 129 aa overlap); O87635|LRP_KLEAE from Klebsiella aerogenes (163 aa), FASTA scores: opt: 279, E(): 2.5e-11, (38.1% identity in 147 aa overlap); etc. Contains helix-turn-helix motif at aa 22-43 (+3.94 SD). COULD BELONG TO THE ASNC FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3319c detected using shotgun mass spectrometry. Mb3319c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3Z0" /db_xref="InterPro:IPR000485" /db_xref="InterPro:IPR011008" /db_xref="InterPro:IPR019887" /db_xref="InterPro:IPR019888" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Z0" /protein_id="SIU01948.1" /translation="MNEALDDIDRILVRELAADGRVTLSELATRAGLSVSAVQSRVRR LESRGVVQGYSARINPEAVGHLLSAFVAITPLDPSQPDDAPARLEHIEEVESCYSVAG EESYVLLVRVASARALEDLLQRIRTTANVRTRSTIILNTFYSDRQHIP" CDS 3631263..3632510 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3320" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3320, -, len: 415 aa. Equivalent to Rv3292, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 415 aa overlap). Conserved hypothetical protein, similar to P76097|YDCJ_ECOLI|B1423 HYPOTHETICAL 51.0 KDA PROTEIN from Escherichia coli strain K12 (447 aa), FASTA scores: opt: 747, E(): 5.6e-39, (38.55% identity in 449 aa overlap); BAB35451|ECS2028 HYPOTHETICAL 51.0 KDA PROTEIN from Escherichia coli strain O157:H7 (447 aa), FASTA scores: opt: 744, E(): 8.6e-39, (38.3% identity in 449 aa overlap); AAG56352|Z2297 PROTEIN from Escherichia coli O157:H7 EDL933 (212 aa), FASTA scores: opt: 454, E(): 4.6e-21, (41.75% identity in 206 aa overlap); and similar in part with Q49664|B1308_C1_136 from Mycobacterium leprae (71 aa), FASTA scores: opt: 305, E(): 3.2e-12, (70.0% identity in 70 aa overlap). Protein product from Mb3320 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3320 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR009770" /db_xref="UniProtKB/Swiss-Prot:P65066" /protein_id="SIU01949.1" /translation="MSRSKRLQTGQLRARFAAGLSAMYAAEVPAYGTLVEVCAQVNSD YLTRHRRAERLGSLQRVTAERHGAIRVGNPAELAAVADLFAAFGMLPVGYYDLRTAES PIPVVSTAFRPIDANELAHNPFRVFTSMLAIEDRRYFDADLRTRVQTFLARRQLFDPA LLAQARAIAADGGCDADDAPAFVAAAVAAFALSREPVEKSWYDELSRVSAVAADIAGV GSTHINHLTPRVLDIDDLYRRMTERGITMIDTIQGPPRTDGPDVLLRQTSFRALAEPR MFRDEDGTVTPGILRVRFGEVEARGVALTPRGRERYEAAMAAADPAAVWATHFPSTDA EMAAQGLAYYRGGDPSAPIVYEDFLPASAAGIFRSNLDRDSQTGDGPDDAGYNVDWLA GAIGRHIHDPYALYDALAQEERR" CDS 3632537..3634021 /codon_start=1 /transl_table=11 /gene="pcd" /locus_tag="BQ2027_MB3321" /product="PROBABLE PIPERIDEINE-6-CARBOXILIC ACID DEHYDROGENASE PCD (PIPERIDEINE-6-CARBOXYLATE DEHYDROGENASE)" /note="Mb3321, pcd, len: 494 aa. Equivalent to Rv3293, len: 494 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 494 aa overlap). Probable pcd, piperideine-6-carboxylic acid dehydrogenase (EC 1.5.-.-), highly similar to others e.g. O85725|PCD SEMIALDEHYDE DEHYDROGENASE from Streptomyces clavuligerus (512 aa), FASTA scores: opt: 2214, E(): 6.7e-121, (68.75% identity in 496 aa overlap) (see first citation below); Q9I4U7|PA1027 PROBABLE ALDEHYDE DEHYDROGENASE from Pseudomonas aeruginosa (529 aa), FASTA scores: opt: 1984, E(): 1.4e-107, (64.5% identity in 493 aa overlap); BAB49892|MLL2867 ALDEHYDE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (504 aa), FASTA scores: opt: 1964, E(): 2e-106, (62.8% identity in 476 aa overlap); Q9A8Y1|CC1216 ALDEHYDE DEHYDROGENASE from Caulobacter crescentus (507 aa), FASTA scores: opt: 1909, E(): 3.1e-103, (59.95% identity in 497 aa overlap); O54199|PCD PIPERIDEINE-6-CARBOXILIC ACID DEHYDROGENASE from Streptomyces clavuligerus (496 aa), FASTA scores: opt: 1748, E(): 6.4e-94, (60.6% identity in 467 aa overlap); and Q9F1U8|PCD PIPERIDEINE-6-CARBOXYLATE DEHYDROGENASE from 'Flavobacterium' lutescens (510 aa), FASTA scores: opt: 1656, E(): 1.4e-88, (54.05% identity in 481 aa overlap) (see second citation below); etc. Contains PS00687 Aldehyde dehydrogenases glutamic acid active site. Note that ORF Rv3290c seems to encoded the putative lat enzyme. Previously known as aldB. Protein product from Mb3321 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3321 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3Q7" /db_xref="InterPro:IPR015590" /db_xref="InterPro:IPR016161" /db_xref="InterPro:IPR016162" /db_xref="InterPro:IPR016163" /db_xref="InterPro:IPR029510" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Q7" /protein_id="SIU01950.1" /translation="MLEACQAIGVTAALGEPGEHSLPASTPITGDVLFSIAPTTPEQA DHAIAAAAATFTAWRSTPAPVRGALVARLGELLTAHQQDLATLVTVEVGKITAEARGE VQEMIDVCQFSVGLSRQLYGRTIASERAGHRLLETWHPLGVVGVITAFNFPVAVWAWN TAVALVCGDTVVWKPSELTPLTALACQALLSRAAADVGAPAAVGGLLLGGAERGAQLV DDPRVALLSATGSVRMGQQVGPRVARRFGRVLLELGGNNAAIVAPSADLELAVRCIVF AAAGTAGQRCTSLRRLIVHRSVADDVVARVVGAYRQLAIGDPSAPDTLVGPLIHEAAY RDMVAALERARTDGGEVIGGDRREVGSPGAYYVAPAVVRMPSQTAIVATETFAPILYV LTYDDLDEAIALNNAVPQGLSSSIFTTDLREAEHFLDQSDCGIANVNIGTSGAEIGGA FGGEKQTGGGRESGSDAWKAYMRRATNTVNYSSELPLAQGVKFG" CDS complement(3634121..3634930) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3322C" /product="Predicted ATPase (AAA+ superfamily)" /note="Mb3322c, -, len: 269 aa. Equivalent to Rv3294c, len: 269 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 269 aa overlap). Conserved hypothetical protein, similar to several conserved hypothetical proteins from Mycobacterium tuberculosis: O07781|Rv0597c (411 aa), FASTA scores: opt: 682, E(): 3.6e-37, (44.85% identity in 243 aa overlap); O53329|Rv3179 (454 aa), FASTA scores: opt: 561, E(): 3.3e-29, (42.20% identity in 218 aa overlap); Q10849|YK08_MYCTU|Rv2008c (441 aa), FASTA scores: opt: 194, E(): 3.9e-05, (30.10% identity in 239 aa overlap). Also some similarity with proteins from other organisms. Replace previous Rv3294 on opposite strand." /db_xref="InterPro:IPR025420" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3P9" /protein_id="SIU01951.1" /translation="MGLPRRPCCDTTGSARYRESVRRYPRIGEDSAAYRRRLCRESAK ARNVDRVVKRDAADVSNLQRIADLPRLIRLLAARSASELNLSSLATDAEIPVRTLPPY LDLLETLYLIDRIPAWSTNLSKRVVDRPKVLLLDSGLAARLVNVSPTGAGPHANPNAA GAIIETFVIAELRRQLGWSQQAPRLFHYRDRDGAEVDLILETADGLIAAIEIKSAATL RGRDTRSISRLRDKVGARFAGGVILHTGPQAQPFGDRLAAVPIDILWSPSG" CDS 3635001..3635666 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3323" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb3323, -, len: 221 aa. Equivalent to Rv3295, len: 221 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 221 aa overlap). Probable transcriptional regulator tetR-family, equivalent to Q9CCL4|ML0717 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (223 aa), FASTA scores: opt: 1260, E(): 7.2e-75, (85.45% identity in 220 aa overlap). Also highly similar to other streptomyces regulators e.g. Q9RD77|SCF43.11 from Streptomyces coelicolor (205 aa), FASTA scores: opt: 442, E(): 9.8e-22, (38.6% identity in 202 aa overlap); Q9RKY8|SC6D7.09 from Streptomyces coelicolor (220 aa), FASTA scores: opt: 215, E(): 5.9e-07, (31.85% identity in 135 aa overlap); Q9L0U5|SCD35.06 from Streptomyces coelicolor (240 aa), FASTA scores: opt: 214, E(): 7.4e-07, (28.2% identity in 156 aa overlap); etc. SIMILAR TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Contains potential helix-turn-helix motif at aa 33-54 (+4.42 SD). Protein product from Mb3323 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3323 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3Q4" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Q4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU01952.1" /translation="MATARRRLSPQDRRAELLALGAEVFGKRPYDEVRIDEIAERAGV SRALMYHYFPDKRAFFAAVVKDEADRLYAATNKAPAPGMTMFEEIRTGVLAYMAYHQQ NPEAAWAAYVGLGRSDPVLLGIDDEAKNRQMEHIMSRIAEVVSGIDRDNTLDPEVERD LRVIIHGWLAFTFELCRQRIMDPSTDAERLADACAHALLDAISRLPQIPAELADAMAT ARM" CDS 3635710..3640251 /codon_start=1 /transl_table=11 /gene="lhr" /locus_tag="BQ2027_MB3324" /product="PROBABLE ATP-DEPENDENT HELICASE LHR (LARGE HELICASE-RELATED PROTEIN)" /note="Mb3324, lhr, len: 1513 aa. Equivalent to Rv3296, len: 1512 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1512 aa overlap). Probable lhr, ATP-dependent helicase (EC 3.6.1.-), similar to others e.g. P30015|LHR_ECOLI|RHLF|B1653 from Escherichia coli stain K12 (1538 aa), FASTA scores: opt: 2930, E(): 1.5e-159, (47.55% identity in 1569 aa overlap); AAG56642|LHR from Escherichia coli stain O157:H7 EDL933 (1538 aa), FASTA scores: opt: 2930, E(): 1.5e-159, (47.6% identity in 1561 aa overlap); O86821|SC7C7.16c from Streptomyces coelicolor (1690 aa), FASTA scores: opt: 2919, E(): 7e-159, (53.55% identity in 1703 aa overlap); Q9HYW9|PA3272 from Pseudomonas aeruginosa (1448 aa), FASTA scores: opt: 907, E(): 6.2e-44, (35.85% identity in 1512 aa overlap); etc. SIMILAR TO DEAD/DEAH BOX HELICASE FAMILY AND TO HELICASE C-TERMINAL DOMAIN. Contains PS00017 ATP/GTP-binding site motif A and possible helix-turn-helix motif. Protein product from Mb3324 detected using SWATH mass spectrometry. Mb3324 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3R0" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR011545" /db_xref="InterPro:IPR013701" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3R0" /protein_id="SIU01953.1" /translation="MRFAQPSALSRFSALTRDWFTSTFAAPTAAQASAWAAIADGDNT LVIAPTGSGKTLAAFLWALDSLAGSEPMSERPAATRVLYVSPLKALAVDVERNLRTPL AGLTRLAERQGLPAPQIRVGVRSGDTPPALRRQLVSQPPDVLITTPESLFLMLTSAAR QTLTGVQTVIIDEIHAIAATKRGAHLALSLERLDDLSSRRRAQRIGLSATVRPPEELA RFLSGQSPTTIVAPPAAKTVELSVQVPVPDMANLTDNTIWPDVEARLVDLIESHNSTI VFANSRRLAERLTARLNEIHAARCGIELAPDTNQQVAGGAPAHIMGSGQTFGAPPVLA RAHHGSISKEQRAVVEEDLKRGQLKAVVATSSLELGIDMGAVDLVIQVQAPPSVASGL QRIGRAGHQVGEISRGVLFPKHRTDLLGCAVSVQRMLAGEIETMRVPANPLDILAQHT VAAAALEPLDADAWFDTVRRAAPFATLPRSLFEATLDLLSGKYPSTEFAELRPRLVYD RDTGTLTARPGAQRLAVTSGGAIPDRGLFAVYLATERPSRVGELDEEMVYESRPGDVI SLGATSWRITEITHDRVLVIPAPGQPARLPFWRGDDAGRPAELGAALGALTGELAALD RTAFGTRCAGLGFDDYATDNLWRLLDDQRTATAVVPTDSTLLVERFRDELGDWRVILH SPYGLRVHGPLALAVGRRLRDRYGIDEKPTASDNGIMVRLPDTVSAGEDSPPGAELFV FDADEIDPIVTTEVAGSALFASRFRESAARALLLPRRHPGRRSPLWQQRQRAARLLEV ARKYPDFPIVLETVRECLQNVYDVPILVELMARIAQRRVRVAEAETAKPSPFAASLLF GYVGAFMYEGDTPLAERRAAALALDGTLLAELLGRVELRELLDPDVIAATSRQLQHLA ADRVARDAEGVADLLRLLGPLTEDEIAARAGAPEVSGWLDGLRAAKRALVVSFAGRSW WVAVEDMGRLRDGVGAAVPVGLPASFTEAVADPLGELLGRYARTHTPFTTAAAAARFG LGLRVTADVLGRLASDGRLVRGEFVAAAEGSAGGEQWCDAEVLRILRRRSLAALRAQA EPVSTAAYGRFLPAWQHVSAGNSGIDGLAAVIDQLAGVRIPASAIEPLVLAPRIRDYS PAMLDELLASGDVTWSGAGSISGSDGWIALHPADSAPMTLAEPAEIDFTDAHRAILAS LGTGGAYFFRQLTHDGLTEAELKAALWELIWAGRVTGDTFAPVRAVLGGAGTRKRAAP AHGGHRPPRLSRYRLTHAQARNADPTVAGRWSALPLPEPDSTLRAHYQAELLLNRHGV LTKDAVAAEGVAGGFATLYKVLSAFEDAGRCQRGYFIESLGGAQFAVASTVDRLRSYL DGVDPEQPDYHAVVLAAADPANPYGAALPWPASSADGTARPGRKAGALVVLVDGELAW FLERGGRSLLTFTDDPEANHAAAIGLADLVTAGRVASILVERADGMPVLQPGGRASAA LTALLAAGFVRTPRGLRRR" CDS 3640255..3641022 /codon_start=1 /transl_table=11 /gene="nei" /locus_tag="BQ2027_MB3325" /product="PROBABLE ENDONUCLEASE VIII NEI" /note="Mb3325, nei, len: 255 aa. Equivalent to Rv3297, len: 255 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 255 aa overlap). Probable nei, endonuclease VIII (EC 3.2.-.-), similar to others e.g. O86820|END8_STRCO|NEI|SC7C7.15c from Streptomyces coelicolor (276 aa), FASTA scores: opt: 770, E(): 1.2e-42, (50.35% identity in 268 aa overlap); P50465|END8_ECOLI|NEI|B0714 from Escherichia coli strain K12 (262 aa), FASTA scores: opt: 310, E(): 6.3e-13, (28.1% identity in 267 aa overlap); AAG55037|NEI from Escherichia coli strain O157:H7 EDL933 (263 aa), FASTA scores: opt: 301, E(): 2.4e-12, (27.7% identity in 267 aa overlap); etc. BELONGS TO THE FPG FAMILY. Mb3325 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64157" /db_xref="InterPro:IPR000214" /db_xref="InterPro:IPR010979" /db_xref="InterPro:IPR012319" /db_xref="InterPro:IPR015886" /db_xref="InterPro:IPR015887" /db_xref="InterPro:IPR035937" /db_xref="UniProtKB/Swiss-Prot:P64157" /protein_id="SIU01954.1" /translation="MPEGDTVWHTAATLRRHLAGRTLTRCDIRVPRFAAVDLTGEVVD EVISRGKHLFIRTGTASIHSHLQMDGSWRVGNRPVRVDHRARIILEANQQEQAIRVVG VDLGLLEVIDRHNDGAVVAHLGPDLLADDWDPQRAAANLIVAPDRPIAEALLDQRVLA GIGNVYCNELCFVSGVLPTAPVSAVADPRRLVTRARDMLWVNRFRWNRCTTGDTRAGR RLWVYGRAGQGCRRCGTLIAYDTTDERVRYWCPACQR" CDS complement(3641045..3641959) /codon_start=1 /transl_table=11 /gene="lpqC" /locus_tag="BQ2027_MB3326C" /product="POSSIBLE ESTERASE LIPOPROTEIN LPQC" /note="Mb3326c, lpqC, len: 304 aa. Equivalent to Rv3298c, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 304 aa overlap). Possible lpqC, esterase lipoprotein (EC 3.1.-.-), equivalent to Q9CCL5|LPQC|ML0715 PUTATIVE SECRETED HYDROLASE from Mycobacterium leprae (304 aa), FASTA scores: opt: 1543, E(): 1.3e-87, (71.6% identity in 303 aa overlap); and Q49658|B1308_F2_43 TUBULIN FAMILY PROTEIN from Mycobacterium leprae (302 aa), FASTA scores: opt: 1541, E(): 1.7e-87, (72.0% identity in 300 aa overlap). Also similar to Q9I5Z3|PA0543 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (322 aa), FASTA scores: opt: 439, E(): 8.9e-20, (32.3% identity in 319 aa overlap); Q9F2K9|SCH63.19c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (348 aa), FASTA scores: opt: 394, E(): 5.5e-17, (30.25% identity in 334 aa overlap); etc. And similar to O86367|LPQP|Rv0671|MTCI376.03c from Mycobacterium tuberculosis strain H37Rv (280 aa), FASTA scores: opt: 519, E(): 9.8e-25, (39.25% identity in 275 aa overlap). Probably lipoprotein, esterase membrane-bound, with 18 aa signal sequence as it contains appropriately positioned (PS00013) Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb3326c detected using SWATH mass spectrometry. Mb3326c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5I1" /db_xref="InterPro:IPR010126" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I1" /protein_id="SIU01955.1" /translation="MPWARMLSLIVLMVCLAGCGGDQLLARHASSVATFQFGGLTRSY RLHVPPAEPSGLVISLHGGGGTGAGQEALTDFDAVADAADLLVVYPDGYDKSWADGRG ASPADRRHLDDVGFLVALAAKLVHDFDIAPGHVFATGMSNGGFMSNRLACDRADIFAA VAPVAGTLGVGVTCNPSRPVSVLEAHGTADPLVPFNGGAVRGRGGLSHSISVASLVDR WRAVDGCQGDPSAAELPDVGDGTMVHLFDSSSCAAGTEVISYQIDNGGHTWPGGRQYL PKAVIGATTRAFDGSQVIAQFFATHGRD" CDS complement(3641986..3644898) /codon_start=1 /transl_table=11 /gene="atsB" /locus_tag="BQ2027_MB3327C" /product="PROBABLE ARYLSULFATASE ATSB (ARYL-SULFATE SULPHOHYDROLASE) (SULFATASE)" /note="Mb3327c, atsB, len: 970 aa. Equivalent to Rv3299c, len: 970 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 970 aa overlap). Probable atsB, arylsulfatase (EC 3.1.6.1), similar to P51691|ARS_PSEAE|ATSA|PA0183 (alias CAA88421|ATSA) from Pseudomonas aeruginosa (535 aa), FASTA scores: opt: 645, E(): 5.8e-31, (32.0% identity in 550 aa overlap); Q9L4Y2|ATSA from Klebsiella pneumoniae (577 aa), FASTA scores: opt: 504, E(): 1.7e-22, (26.3% identity in 566 aa overlap); and P20713|ATSA|ARS_KLEAE (precursor) from Klebsiella pneumoniae (464 aa), FASTA scores: opt: 502, E(): 1.8e-22, (26.85% identity in 451 aa overlap). Also similar to Mycobacterium tuberculosis proteins O06776|MTI376.13c|ATSD|Rv0663 (787 aa) (43.6% identity in 796 aa overlap) and P95059|MTCY210.30|ATSA|R0711 (787 aa) (38.4% identity in 797 aa overlap). Equivalent to AAK47741 from Mycobacterium tuberculosis strain CDC1551 (992 aa) but shorter 22 aa. Contains PS00523 Sulfatases signature 1 and PS01095 Chitinases family 18 active site signature. BELONGS TO THE SULFATASE FAMILY. Protein product from Mb3327c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3327c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4G2" /db_xref="InterPro:IPR000917" /db_xref="InterPro:IPR009200" /db_xref="InterPro:IPR017850" /db_xref="InterPro:IPR024607" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4G2" /protein_id="SIU01956.1" /translation="MMSEDNALVLVAGYQDLDSARHDFQTLVDAAKDKSIPLQGAVLI GKDAEGSPVLVDTGNRLGRRGAAWGAGVGLAIGLFSPALLASAALGAATGALAGTFAH HRIKTGLADKIGQALAAGRAVVIAVTEAQGRLEAGQALASSPMKSVAELSRSTLRSLG AALREAMGKFNPDRTRLPLPQRRFGGVVGRTMAESVGDWSIVPSPFPPDDAPNVLIVL IDDAGFGGPDTFGGAIRTPTLSRLAQNGLIYNRFHVTAVCSPTRAALLTGRNHHRVGF GSVCEFPGPYPGYSAVRPRSCAALPRILRDNGYVTGAFGKWHLTPDNVQGAAGPFDNW PLGWGFDHFWGFPSGAAGQYDPIISQDNSVIGIPEGSGEDGRPYYFPDDLTDKAIEWL HTVRAQNATKPWMLYYATGATHAPHHVFKEWADKYRGEFDDGWDVYRQKTFERQKRLG IIPPDAELTERPDLFPAWDSMSEAQKRLFARQMEVFAGFSENADWNVGRLLDAIEDLG ESDNTLVFYIWGDNGASMEGTNTGSFNEMTFLNGLDLDAERQLELIEQYGGIAALGDE FTAPHFASAWAHASNTPLQWGKQMASHLGGTRDPLVVAWPARIRPDGRVRSQFTHCID IAPTVLAAIGLPEPTHVDGFEQEPMDGTSFVRTFDDAEAEDRHTVQYFENFGSRAIYK DGWWACARLDKAPWDLSPETMRRFAPGTYDPDQDVWELYYLPDDFSQAKNLAAEHPDK VAELTQLWWQEAERNRVLPLLGGLAVMFGDLPPLPTTARFSFKGDVQNIQRGMVPRIC GRSYAIEARLHIPDGGAQGVIVANADFMGGFALWVDEQRHLHHTYSFLGVETYRQVSS EPLPTGDVTVRMLFDSHQPVAASGGRVTLWADDRLIGEGELPQTVPLAFTSYAGMDIG RDNGLVVDRGYEDKAPYAFTGTVTEVIFDLKPVHPEAARALHEHASVQAVGQGAAG" CDS complement(3644918..3645835) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3328C" /product="Pseudouridine synthase (EC , PA2043 type" /EC_number="4.2.1.70" /note="Mb3328c, -, len: 305 aa. Equivalent to Rv3300c, len: 305 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 305 aa overlap). Conserved hypothetical protein, similar to various proteins (notably pseudoridine synthase family proteins) e.g. Q9RJ76|SCI41.08 PUTATIVE RIBOSOMAL PSEUDOURIDINE SYNTHASE from Streptomyces coelicolor (324 aa), FASTA scores: opt: 876, E(): 4.5e-48, (52.1% identity in 313 aa overlap); Q9I272|PA2043 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (300 aa), FASTA scores: opt: 676, E(): 1.8e-35, (42.55% identity in 268 aa overlap); Q9JZW8|NMB0867 YABO/YCEC/SFHB FAMILY PROTEIN from Neisseria meningitidis (serogroup B) (307 aa), FASTA scores: opt: 597, E(): 1.8e-30, (42.9% identity in 282 aa overlap); Q9JUY2|NMA1085 HYPOTHETICAL PROTEIN from Neisseria meningitidis (serogroup A) (307 aa), FASTA scores: opt: 597, E(): 1.8e-30, (42.9% identity in 282 aa overlap); Q12362|RIB2_YEAST|RIB2|YOL066C DRAP DEAMINASE (PSEUDOURIDINE SYNTHASE FAMILY PROTEIN) from Saccharomyces cerevisiae (Baker's yeast) (591 aa), FASTA scores: opt: 338, E(): 6.9e-14, (32.95% identity in 246 aa overlap); Q9RTS2|DR1684 PUTATIVE PSEUDOURIDINE SYNTHASE from Deinococcus radiodurans (321 aa), FASTA scores: opt: 319, E(): 6.5e-13, (32.75% identity in 235 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical protein Q10786|Y04P_MYCTU|MTCY48.25c|Rv1540|MT1592 (308 aa) (28.8% identity in 299 aa overlap). Mb3328c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3U6" /db_xref="InterPro:IPR006145" /db_xref="InterPro:IPR006224" /db_xref="InterPro:IPR020103" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3U6" /protein_id="SIU01957.1" /translation="MALRPEDRLLSVHDVLGPVRVRLLGGSVLAELTARFGVAARAKV LAGEVVDNDGAVVDSGTVLPPGSVVHLYRDLPDEVPVPFDVPVLHQDADIVVVDKPHF LATMPRGRHVAQTALVRLRRELGLPELSPAHRLDRLTAGVLLFTTRREVRGSYQTMFA RGLVRKTYLARAPVAPGLALPRLVRSRIVKRRGHLQAVCEPGVPNAETLVERIARDGL YRLTPTTGRTHQLRVHMAALGIPIMGDPLYPNVISVAAHDFSTPLQLLAQRIEFDDPL TGSHREFASTRTLTGATLPTWSAAADCRP" CDS complement(3645847..3646512) /codon_start=1 /transl_table=11 /gene="phoY1" /locus_tag="BQ2027_MB3329C" /product="PROBABLE PHOSPHATE-TRANSPORT SYSTEM TRANSCRIPTIONAL REGULATORY PROTEIN PHOU HOMOLOG 1 PHOY1" /note="Mb3329c, phoY1, len: 221 aa. Equivalent to Rv3301c, len: 221 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 221 aa overlap). Probable phoY1, phosphate-transport system regulatory protein, highly similar to Q50047|phoY|PHOU1|PHOY1|ML2188 PHOSPHATE TRANSPORT SYSTEM PROTEIN PHOU HOMOLOG 1 from Mycobacterium leprae (222 aa), FASTA scores: opt: 929, E(): 7.8e-51, (61.45% identity in 218 aa overlap). Also highly similar to Q9FCE2|2SCD46.42c PUTATIVE REGULATORY PROTEIN (FRAGMENT) from Streptomyces coelicolor (123 aa), FASTA scores: opt: 324, E(): 1.8e-13, (43.65% identity in 103 aa overlap); Q9L0R3|SCD8A.01c PUTATIVE PHOSPHATE TRANSPORT SYSTEM REGULATORY PROTEIN (FRAGMENT) from Streptomyces coelicolor (139 aa), FASTA scores: opt: 309, E(): 1.7e-12, (36.7% identity in 139 aa overlap); Q52989|PHOU_RHIME PHOSPHATE TRANSPORT SYSTEM PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (237 aa), FASTA scores: opt: 292, E(): 3.1e-11, (26.3% identity in 213 aa overlap); etc. And highly similar to Mycobacterium tuberculosis O53833|PHU2_MYCTU|MTV043_13c|PHOU2|PHOY2|Rv0821c|MT0843 PHOSPHATE TRANSPORT SYSTEM PROTEIN PHOU HOMOLOG 2 (213 aa) (63.4% identity in 213 aa overlap). BELONGS TO THE PHOU FAMILY. Protein product from Mb3329c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3329c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65719" /db_xref="InterPro:IPR026022" /db_xref="InterPro:IPR028366" /db_xref="InterPro:IPR038078" /db_xref="UniProtKB/Swiss-Prot:P65719" /protein_id="SIU01958.1" /translation="MRTVYHQRLTELAGRLGEMCSLAGIAMKRATQALLEADIGAAEQ VIRDHERIVAMRAQVEKEAFALLALQHPVAGELREIFSAVQIIADTERMGALAVHIAK ITRREYPNQVLPEEVRNCFADMAKVAIALGDSARQVLVNRDPQEAAQLHDRDDAMDDL HRHLLSVLIDREWRHGVRVGVETALLGRFFERFADHAVEVGRRVIFMVTGVLPTEDEI STY" CDS complement(3646620..3648377) /codon_start=1 /transl_table=11 /gene="glpD2" /locus_tag="BQ2027_MB3330C" /product="PROBABLE GLYCEROL-3-PHOSPHATE DEHYDROGENASE GLPD2" /note="Mb3330c, glpD2, len: 585 aa. Equivalent to Rv3302c, len: 585 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 585 aa overlap). Probable glpd2, glycerol-3-phosphate dehydrogenase (EC 1.1.99.5), equivalent to P53435|GLPD_MYCLE|ML0713|L308_C1_179 GLYCEROL-3-PHOSPHATE DEHYDROGENASE (EC 1.1.99.5) from Mycobacterium leprae (585 aa), FASTA scores: opt: 3489, E(): 2.2e-198, (90.75% identity in 584 aa overlap). Also highly similar to many e.g. Q9L0I3|SCD63.06 from Streptomyces coelicolor (568 aa), FASTA scores: opt: 2203, E(): 1.6e-122, (59.95% identity in 564 aa overlap); Q9RVK8|DR1019 from Deinococcus radiodurans (522 aa), FASTA scores: opt: 949, E(): 1.4e-48, (37.0% identity in 538 aa overlap); BAB53412|MLR7270 from Rhizobium loti (Mesorhizobium loti) (505 aa), FASTA scores: opt: 861, E(): 2.2e-43, (37.3% identity in 488 aa overlap); P18158|GLPD_BACSU from B. subtilis (555 aa), FASTA scores: opt: 768, E(): 7.2e-38, (32.85% identity in 484 aa overlap); etc. Also similar to Mycobacterium tuberculosis protein Q10502|GLPD_MYCTU|MTCY427_31c|Rv2249c GLYCEROL-3-PHOSPHATE DEHYDROGENASE (516 aa), FASTA scores: opt: 843, E(): 2.6e-42, (36.5% identity in 515 aa overlap). Contains PS00978 FAD-dependent glycerol-3-phosphate dehydrogenase signature 2. COFACTOR: FAD (BY SIMILARITY). BELONGS TO THE FAD-DEPENDENT GLYCEROL-3-PHOSPHATE DEHYDROGENASE FAMILY. Protein product from Mb3330c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3330c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64185" /db_xref="InterPro:IPR000447" /db_xref="InterPro:IPR006076" /db_xref="InterPro:IPR031656" /db_xref="InterPro:IPR036188" /db_xref="InterPro:IPR038299" /db_xref="UniProtKB/Swiss-Prot:P64185" /protein_id="SIU01959.1" /translation="MSNPIQAPDGGQGWPAAALGPAQRAVAWKRLGTEQFDVVVIGGG VVGSGCALDAATRGLKVALVEARDLASGTSSRSSKMFHGGLRYLEQLEFGLVREALYE RELSLTTLAPHLVKPLPFLFPLTKRWWERPYIAAGIFLYDRLGGAKSVPAQRHFTRAG ALRLSPGLKRSSLIGGIRYYDTVVDDARHTMTVARTAAHYGAVVRCSTQVVALLREGD RVIGVGVRDSENGAVAEVRGHVVVNATGVWTDEIQALSKQRGRFQVRASKGVHVVVPR DRIVSDVAMILRTEKSVMFVIPWGSHWIIGTTDTDWNLDLAHPAATKADIDYILGTVN AVLATPLTHADIDGVYAGLRPLLAGESDDTSKLSREHAVAVPAAGLVAIAGGKYTTYR VMAADAIDAAVQFIPARVAPSITEKVSLLGADGYFALVNQAEHVGALQGLHPYRVRHL LDRYGSLISDVLAMAASDPSLLSPITEAPGYLKVEAAYAAAAEGALHLEDILARRMRI SIEYPHRGVDCAREVAEVVAPVLGWTAADIDREVANYMARVEAEVLSQAQPDDVSADM LRASAPEARAEILEPVPLD" CDS complement(3648458..3649873) /codon_start=1 /transl_table=11 /gene="lpdA" /locus_tag="BQ2027_MB3331C" /product="nad(p)h quinone reductase lpda" /note="Mb3331c, lpdA, len: 471 aa. Equivalent to Rv3303c, len: 493 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 471 aa overlap). Probable lpdA, dihydrolipoamide dehydrogenase (EC 1.8.1.4), similar to other e.g. Q9EWV3|2SCK31.22c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (475 aa), FASTA scores: opt: 1420, E(): 2.4e-77, (54.9% identity in 471 aa overlap); Q9A7J2|CC1731 LIPOAMIDE DEHYDROGENASE (E3 COMPONENT,PYRUVATE DEHYDROGENASE COMPLEX) from Caulobacter crescentus (466 aa), FASTA scores: opt: 696, E(): 3.6e-34, (29.6% identity in 463 aa overlap); Q04829|LPD|DLDH_HALVO DIHYDROLIPOAMIDE DEHYDROGENASE from Halobacterium volcanii (Haloferax volcanii) (474 aa), FASTA scores: opt: 675, E(): 6.5e-33, (29.3% identity in 471 aa overlap); P50970|DLDH_ZYMMO|LPD DIHYDROLIPOAMIDE DEHYDROGENASE from Zymomonas mobilis, FASTA scores: opt: 658, E(): 6.6e-32, (30.4% identity in 464 aa overlap); etc. BELONGS TO THE PYRIDINE NUCLEOTIDE-DISULFIDE OXIDOREDUCTASES CLASS-I. COFACTOR: FAD (BY SIMILARITY). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, truncation due to a single base transversion (c-a) results in a shorter product compared to its homolog in Mycobacterium tuberculosis stain H37Rv (471 aa versus 493 aa). Protein product from Mb3331c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3331c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3R8" /db_xref="InterPro:IPR001100" /db_xref="InterPro:IPR004099" /db_xref="InterPro:IPR016156" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3R8" /protein_id="SIU01960.1" /translation="MVTRIVILGGGPAGYEAALVAATSHPETAQVTVIDCDGIGGAAV LDDCVPSKTFIASTGLRTELRRAPHLGFHIDFDDAKISLPQIHARVKTLAAAQSADIT AQLLSMGVQVIAGRGELIDSTPGLARHRIKATAADGSTSEHEADVVLVATGASPRILP SAQPDGERILTWRQLYDLDALPDHLIVVGSGVTGAEFVDAYTELGVPVTVVASQDHVL PYEDADAALVLEESFAERGVRLFKNARAASVTRTGAGVLVTMTDGRTVEGSHALMTIG SVPNTSGLGLERVGIQLGRGNYLTVDRVSRTSATGIYAAGDCTGLLPLASVAAMQGRI AMYHALGEGVSPIRLRTVAATVFTRPEIAAVGVPQSVIDAGSVAARTIMLPLRTNARA KMSEMRHGFVKIFCRRSTGVVIGGVVVAPIASELILPIAVAVQNRITVNELAQTLAVY PSLSGSITEAARRLMAHDDLD" CDS 3650076..3650555 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3332" /product="AIG2-like domain protein" /note="Mb3332, -, len: 159 aa. Equivalent to Rv3304, len: 159 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 159 aa overlap). Hypothetical conserved protein, very similar to Q9CCL6|ML0711 HYPOTHETICAL PROTEIN from Mycobacterium leprae (159 aa), FASTA scores: opt: 1041, E(): 6.1e-62, (91.8% identity in 159 aa overlap); and Q49927|L308_F3_97 from M. leprae (174 aa), FASTA scores: opt: 974, E(): 1.8e-57, (91.2% identity in 149 aa overlap) . Also highly similar to Q9AD81|SCK13.10c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (145 aa), FASTA scores: opt: 615, E(): 7.8e-34, (60.55% identity in 147 aa overlap); and shows some similarity to other various hypotheticals proteins. ORF continues upstream with possible start at 2198 (equivalent to AAK47746 from Mycobacterium tuberculosis strain CDC1551 (212 aa) but shorter 53 aa). Protein product from Mb3332 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3332 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3R2" /db_xref="InterPro:IPR013024" /db_xref="InterPro:IPR017939" /db_xref="InterPro:IPR036568" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3R2" /protein_id="SIU01961.1" /translation="MPLYAAYGSNMHPEQMLERAPHSPMAGTGWLPGWRLTFGGEDIG WEGALATVVEDPDSKVFVVLYDMTPADEKNLDRWEGSEFGIHQKIRCRVERISSDTTT DPVLAWLYVLDAWEGGLPSARYLGVMADAAEIAGAPSDYVHDLRTRPARNIGPGTIA" CDS complement(3650574..3651743) /codon_start=1 /transl_table=11 /gene="amiA1" /locus_tag="BQ2027_MB3333C" /standard_name="amiA" /product="possible n-acyl-l-amino acid amidohydrolase amia1 (n-acyl-l-amino acid aminohydrolase)" /note="Mb3333c, amiA1, len: 389 aa. Equivalent to Rv3305c, len: 389 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 389 aa overlap). Possible amiA1, N-acyl-L-amino acid amidohydrolase (or peptidase) (EC 3.5.1.-), similar to many proteins e.g. Q9AK43|2SCK8.09 PUTATIVE PEPTIDASE from Streptomyces coelicolor (410 aa), FASTA scores: opt: 1015, E(): 3.9e-54, (50.8% identity in 374 aa overlap); Q9UZ30|PAB0873 AMINO ACID AMIDOHYDROLASE from Pyrococcus abyssi (383 aa), FASTA scores: opt: 823, E(): 1.6e-42, (38.2% identity in 369 aa overlap); O58453|PH0722 LONG HYPOTHETICAL AMINO ACID AMIDOHYDROLASE from Pyrococcus horikoshii (388 aa), FASTA scores: opt: 815, E(): 4.8e-42, (38.75% identity in 369 aa overlap); O34980|YTNL_BACSU HYPOTHETICAL 45.2 KDA PROTEIN from B. subtilis (416 aa), FASTA scores: opt: 805, E(): 2.1e-41, (37.85% identity in 367 aa overlap); Q9KCF8|BH1613 N-ACYL-L-AMINO ACID AMIDOHYDROLASE from Bacillus halodurans (404 aa), FASTA scores: opt: 795, E(): 8.1e-41, (37.7% identity in 382 aa overlap); BAB50445|MLR3583 HYPOTHETICAL HIPPURATE HYDROLASE from Rhizobium loti (Mesorhizobium loti) (387 aa), FASTA scores: opt: 761, E(): 8.9e-39, (37.65% identity in 385 aa overlap); Q9RXH4|DR0339 PUTATIVE N-ACYL-L-AMINO ACID AMIDOHYDROLASE from Deinococcus radiodurans (392 aa), FASTA scores: opt: 745, E(): 8.4e-38, (36.15% identity in 379 aa overlap); etc. Contains PS00639 Eukaryotic thiol (cysteine) proteases histidine active site. Note that previously known as amiA. Protein product from Mb3333c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3333c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3R4" /db_xref="InterPro:IPR002933" /db_xref="InterPro:IPR017439" /db_xref="InterPro:IPR036264" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3R4" /protein_id="SIU01962.1" /translation="MSLADAAESWLAAHHDDLVGWRRHIHRYPELGRQEYATTQFVAE RLADAGLNPKVLPGGTGLTCDFGPQHQPRIALRADMDALPMAERTGAPYASTMPNVAH ACGHDAHTAILLGAALALASVPELPVGVRLIFQAAEELMPGGAIDAIAAGALAGVSRI FALHCDPRLEVGKVAVRQGPITSAADSIEITLYSPGGHTSRPHLTADLVYGLGTLVTG LPGVLSRRIDPRNSTVLVWGAVNAGMAANAIPQTGVLSGTVRTASRQTWVDLEELVRQ AISALLLPLAIEHTLQYRRGVPPVVNEEISTRILAHAIEAIGPGVLADTRQSGGGEDF SWYLEEVPGAMARLGVWSGDGLQLDLHQPTFDIDERALAIGLRVMVNIIEQAAAH" CDS complement(3651740..3652924) /codon_start=1 /transl_table=11 /gene="amiB1" /locus_tag="BQ2027_MB3334C" /standard_name="amiB" /product="probable amidohydrolase amib1 (aminohydrolase)" /note="Mb3334c, amiB1, len: 394 aa. Equivalent to Rv3306c, len: 394 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 394 aa overlap). Probable amiB1, aminohydrolase (EC 3.5.1.-), similar to several belonging to peptidase family M40 (and to hypothetical proteins) e.g. P54983|AMHX_BACSU AMIDOHYDROLASE AMHX from Bacillus subtilis (EC 3.5.1.-) (389 aa), FASTA scores: opt: 286, E(): 9.9e-10, (26.6% identity in 351 aa overlap); P76052|ABGB_ECOLI Aminobenzoyl-glutamate utilizatio from Escherichia coli (481 aa), FASTA scores: opt: 383, E(): 2.1e-15, (30.5% identity in 328 aa overlap); P44765|YDAJ_HAEIN HYPOTHETICAL PROTEIN HI0584 from Haemophilus influenzae (423 aa), FASTA scores: opt: 297, E(): 2.4e-10, (29.6% identity in 274 aa overlap). Note that previously known as amiB. Protein product from Mb3334c detected using SWATH mass spectrometry. Mb3334c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3S2" /db_xref="InterPro:IPR002933" /db_xref="InterPro:IPR011650" /db_xref="InterPro:IPR017144" /db_xref="InterPro:IPR017439" /db_xref="InterPro:IPR036264" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3S2" /protein_id="SIU01963.1" /translation="MPAASASDRVEELVRRRGGELVELSHAIHAEPELAFAEHRSCAK AQALVAERGFEITTAAGGLDTAFRADYGSGPLVVGVCAEYDALPGIGHACGHNIIAAS AVGTALALAEVADDLGLTVALLGTPAEESGGGKALMLQAGTFDDVAVAVMVHPGPTDI AGARSLALSEVTVRYRGKESHAAVAPHLGVNAADAVTVAQVAIGVLRQQLAPGQMVHG IVTDGGQAVNVIPGQARLQYAMRAVESDSLRELQTRMFACFAAGALAAGCEYEIDEAA PAYAELKPDPWLADVCREEMQRLGREPLLPALEAELPLGSTDMGNVTQVLPGIHPVIG LDAGAATVHQRAFTVASAGASADRAVVDGAIMLARTVVRLAQTPDERDRVLAAQQRRA AR" CDS 3652989..3653795 /codon_start=1 /transl_table=11 /gene="deoD" /locus_tag="BQ2027_MB3335" /standard_name="punA" /product="PROBABLE PURINE NUCLEOSIDE PHOSPHORYLASE DEOD (INOSINE PHOSPHORYLASE) (PNP)" /note="Mb3335, deoD, len: 268 aa. Equivalent to Rv3307, len: 268 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 268 aa overlap). Probable deoD (alternate gene name: punA), purine nucleoside phosphorylase (EC 2.4.2.1), similar to others especially P46862|PUNA_MYCLE|DEOD_MYCLE|ML0707|L308_F2_56 from M. leprae (268 aa), FASTA scores: opt: 1373, E(): 1.5e-74, (82.05% identity in 262 aa overlap); Q9EWV2|2SCK31.24 from Streptomyces coelicolor (274 aa), FASTA scores: opt: 1026, E(): 6.4e-54, (60.5% identity in 266 aa overlap); P81989|PUNA_CELSP from Cellulomonas sp (282 aa), FASTA scores: opt: 963, E(): 3.6e-50, (58.9% identity in 270 aa overlap); Q9X1T2|TM1596 from Thermotoga maritima (265 aa), FASTA scores: opt: 584, E(): 1.1e-27, (39.55% identity in 263 aa overlap); etc. BELONGS TO THE PNP/MTAP FAMILY 2 OF PHOSPHORYLASES. Protein product from Mb3335 detected using SWATH mass spectrometry. Mb3335 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A539" /db_xref="InterPro:IPR000845" /db_xref="InterPro:IPR011268" /db_xref="InterPro:IPR011269" /db_xref="InterPro:IPR018099" /db_xref="InterPro:IPR035994" /db_xref="UniProtKB/Swiss-Prot:P0A539" /protein_id="SIU01964.1" /translation="MADPRPDPDELARRAAQVIADRTGIGEHDVAVVLGSGWLPAVAA LGSPTTVLPQAELPGFVPPTAAGHAGELLSVPIGAHRVLVLAGRIHAYEGHDLRYVVH PVRAARAAGAQIMVLTNAAGGLRADLQVGQPVLISDHLNLTARSPLVGGEFVDLTDAY SPRLRELARQSDPQLAEGVYAGLPGPHYETPAEIRMLQTLGADLVGMSTVHETIAARA AGAEVLGVSLVTNLAAGITGEPLSHAEVLAAGAASATRMGALLADVIARF" CDS 3653799..3655403 /codon_start=1 /transl_table=11 /gene="pmmB" /locus_tag="BQ2027_MB3336" /product="PROBABLE PHOSPHOMANNOMUTASE PMMB (PHOSPHOMANNOSE MUTASE)" /note="Mb3336, pmmB, len: 534 aa. Equivalent to Rv3308, len: 534 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 534 aa overlap). Probable pmmB, phosphomannomutase (EC 5.4.2.8), equivalent to Q9CCL7|PMMB|ML0706 PUTATIVE PHOSPHO-SUGAR MUTASE from Mycobacterium leprae (538 aa), FASTA scores: opt: 2681, E(): 1.4e-150, (76.95% identity in 538 aa overlap). Also similar to others e.g. Q9AD82|SCK13.08c from Streptomyces coelicolor (549 aa), FASTA scores: opt: 1378, E(): 8.9e-74, (46.7% identity in 529 aa overlap); Q9ZHL4|PMM (FRAGMENT so no homology at N-terminus for this one) from Haemophilus ducreyi (443 aa), FASTA scores: opt: 935, E(): 9.6e-48, (39.4% identity in 449 aa overlap); P18159|YHXB_BACSU from Bacillus subtilis (565 aa), FASTA scores: opt: 776, E(): 2.7e-38, (31.7% identity in 574 aa overlap); etc. Contains PS00710 Phosphoglucomutase and phosphomannomutase phosphoserine signature. BELONGS TO THE PHOSPHOHEXOSE MUTASES FAMILY. Protein product from Mb3336 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3336 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5J3" /db_xref="InterPro:IPR005841" /db_xref="InterPro:IPR005843" /db_xref="InterPro:IPR005844" /db_xref="InterPro:IPR005845" /db_xref="InterPro:IPR005846" /db_xref="InterPro:IPR016055" /db_xref="InterPro:IPR016066" /db_xref="InterPro:IPR036900" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5J3" /protein_id="SIU01965.1" /translation="MTPENWIAHDPDPQTAAELAACGPDELKARFSRPLAFGTAGLRG HLRGGPDAMNLAVVLRATWAVARVLTDRGLAGSPVIVGRDARHGSPAFAAAAAEVLAA AGFSVLLLPDPAPTPVVAFAVRHTGAAAGIQITASHNPATDNGYKVYVDGGLQLLAPT DRQIEAAMATAPPADQIARKTVNPSENRASDLIDRYIQRAAGVRRCAGSVRVALTPLH GVGGAMAVETLRRAGFTEVHTVATQFAPNPDFPTVTLPNPEEPGATDALLTLATDVDA DVAIALDPDADRCAVGIPTVSGWRMLSGDETGWLLGDYILSQTDDRASPPETRVVAST VVSSRMLAAIAAHHAAVHVETLTGFKWLARADANLPGTLVYAYEEAIGHCVDPTAVRD KDGISAAVLVCDLVAALKGQGRSVTDALDELARCYGVHEVAALSRPVGGAVETTDLMR RLREDPPRRLAGFPATVTDIGDTLILTGGDDNMLVRVAVRPSGTEPKLKCYLEIRCAV TGDLPAARQLVRARIDELSASVRRWW" CDS complement(3655405..3656121) /codon_start=1 /transl_table=11 /gene="upp" /locus_tag="BQ2027_MB3337C" /product="PROBABLE URACIL PHOSPHORIBOSYLTRANSFERASE UPP (UMP PYROPHOSPHORYLASE) (UPRTASE) (UMP DIPHOSPHORYLASE)" /note="Mb3337c, upp, len: 207 aa. Equivalent to Rv3309c, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 207 aa overlap). Probable upp, uracil phosphoribosyltransferase (EC 2.4.2.9), identical to P94928|UPP uracil phosphoribosyltransferase from Mycobacterium bovis (207 aa). Also similar to others e.g. P36399|UPP_STRSL from Streptococcus salivarius (209 aa), FASTA scores: opt: 658, E(): 4.7e-35, (48.3% identity in 207 aa overlap); Q9A194|UPP|SPY0392 from Streptococcus pyogenes (209 aa), FASTA scores: opt:650, E(): 1.5e-34, (47.35% identity in 207 aa overlap); Q9RE01|UPP from Lactobacillus plantarum (209 aa), FASTA scores: opt: 644, E(): 3.7e-34, (46.4% identity in 207 aa overlap); etc. BELONGS TO THE UPRTASE FAMILY. Protein product from Mb3337c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3337c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A659" /db_xref="InterPro:IPR000836" /db_xref="InterPro:IPR005765" /db_xref="InterPro:IPR029057" /db_xref="InterPro:IPR034332" /db_xref="UniProtKB/Swiss-Prot:P0A659" /protein_id="SIU01966.1" /translation="MDGVDRSRGWTHPYQPPFRGPSHDCYIGFNAVQVHVVDHPLAAA RLTTLRDERTDNAGFRAALRELTLLLIYEATRDAPCEPVPIRTPLAETVGSRLTKPPL LVPVLRAGLGMVDEAHAALPEAHVGFVGVARDEQTHQPVPYLDSLPDDLTDVPVMVLD PMVATGGSMTHTLGLLISRGAADITVLCVVAAPEGIAALQKAAPNVRLFTAAIDEGLN EVAYIVPGLGDAGDRQFGPR" CDS 3656133..3657032 /codon_start=1 /transl_table=11 /gene="sapm" /locus_tag="BQ2027_MB3338" /product="acid phosphatase (acid phosphomonoesterase) (phosphomonoesterase) (glycerophosphatase)" /note="Mb3338, -, len: 299 aa. Equivalent to Rv3310, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 299 aa overlap). Possible acid phosphatase (EC 3.1.3.2), similar to several fungal or bacterial acid phosphatases e.g. BAB50846|MLR4110 from Rhizobium loti (Mesorhizobium loti) (292 aa), FASTA scores: opt: 460, E(): 4.8e-22, (38.65% identity in 295 aa overlap); P34724|PHOA_ASPNG from Aspergillus niger (417 aa), FASTA scores: opt: 172, E(): 0.0013, (29.1% identity in 306 aa overlap); P08540|PHOX_KLULA from Kluyveromyces lactis (Yeast) (421 aa), FASTA scores: opt: 170, E(): 0.0018, (27.8% identity in 266 aa overlap); P37274|PHOA_PENCH from Penicillium chrysogenum (412 aa), FASTA scores: opt: 163, E(): 0.0049, (29.05% identity in 303 aa overlap); etc. Protein product from Mb3338 detected using SWATH mass spectrometry. Mb3338 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3V0" /db_xref="InterPro:IPR007312" /db_xref="InterPro:IPR017850" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3V0" /protein_id="SIU01967.1" /translation="MLRGIQALSRPLTRVYRALAVIGVLAASLLASWVGAVPQVGLAA SALPTFAHVVIVVEENRSQAAIIGNKSAPFINSLAANGAMMAQAFAETHPSEPNYLAL FAGNTFGLTKNTCPVNGGALPNLGSELLSAGYTFMGFAEDLPAVGSTVCSAGKYARKH VPWVNFSNVPATLSVPFSAFPKPQNYPGLPTVSFVIPNADNDMHDGSIAQGDAWLNRH LSAYANWAKTNNSLLVVTWDEDDGSSRNQIPTVFYGAHVRPGTYNETISHYNVLSTLE QIYGLPKTGYATNAPPITDIWGD" CDS 3657056..3658318 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3339" /product="conserved protein" /note="Mb3339, -, len: 420 aa. Equivalent to Rv3311, len: 420 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 420 aa overlap). Conserved hypothetical protein, equivalent to Mycobacterium leprae hypothetical proteins Q9CCL8|ML0703 (423 aa), FASTA scores: opt: 2185, E(): 5.5e-120, (77.55% identity in 423 aa overlap); Q49918|L308_F2_61 (167 aa), FASTA scores: opt: 929, E(): 3.5e-47, (84.4% identity in 167 aa overlap) (similarity at C-terminus for this one); and Q49914|L308_F1_17 (166 aa), FASTA scores: opt: 900, E(): 1.7e-45, (79.0% identity in 162 aa overlap) (similarity at N-terminus for this one); Q49923|U0308N (86 aa) FASTA scores: opt: 149, E(): 0.052, (48.35% identity in 60 aa overlap); etc. Note that the Rv3311 corresponding protein in Mycobacterium leprae is similar to products of two adjacent ORFs. Also some similarity to Q9XI61|F9L1.1 HYPOTHETICAL PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (523 aa), FASTA scores: opt: 134, E(): 1.8, (25.1% identity in 203 aa overlap). Equivalent to AAK47753 from Mycobacterium tuberculosis strain CDC1551 (431 aa) but shorter 12 aa. Protein product from Mb3339 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3339 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y410" /protein_id="SIU01968.1" /translation="MVADLVPIRLSLSAGDRYTLWAPRWRDAGDEWEAFLGKDDDLYG FESVSDLVAFVRTDTENDLVDHPAWQDLTGAHAHNLNPAEDNQFDLVVVEELLAEKPT AESVAALAASLAIVSAIGSVCELAAVSKFFNGNPILGTVSGGLEHFTGKAGNKRWNSI AEVIGRSWDDVLAAIDEIISTPEVDAELSEKVAEELAEEPEGAEEVAAEVEATQDTQE AAESDDEEADAPGDSVVLGGDRDFWLQVGIDPIQIMTGTATFYTLRCYLDDRPIFLGR NGRISVFGSERALARYLADEHDHDLSDLSTYDDIRTAATDGSLAVAVTDDNVYVLSGL VDDFADGPDAVDREQLDLAVELLRDIGDYSEDSAVDKALETTRPLGQLVAYVLDPHSV GKPTAPYAAAVREWEKLERFVESRLRRE" CDS complement(3658339..3659265) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3340C" /product="Hydrolase, alpha/beta fold family" /note="Mb3340c, -, len: 308 aa. Equivalent to Rv3312c, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 308 aa overlap). Hypothetical protein, similar to various proteins (principally hypothetical unknowns or hydrolases) e.g. Q9M9P2|T17B22.7 HYPOTHETICAL PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (326 aa), FASTA scores: opt: 261, E(): 2.6e-09, (27.55% identity in 323 aa overlap); Q9FWB6 PUTATIVE ALPHA/BETA HYDROLASE from Oryza sativa (Rice) (354 aa), FASTA scores: opt: 241, E(): 4.9e-08, (28.9% identity in 301 aa overlap) (note that Q9FWB6 correspond to Q9FWB5 PUTATIVE ALPHA/BETA HYDROLASE (353 aa) but longer 1 aa; and to Q9AUW9 HYPOTHETICAL PROTEIN (332 aa) but longer 22 aa); Q9M382|F24B22.200 HYPOTHETICAL PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (342 aa), FASTA scores: opt: 222, E(): 8e-07, (27.6% identity in 319 aa overlap); Q9HWM9|PA4152 PROBABLE HYDROLASE from Pseudomonas aeruginosa (370 aa), FASTA scores: opt: 176, E(): 0.00071, (29.2% identity in 209 aa overlap); Q9L3R2 HYDROLASE from Rhizobium leguminosarum (261 aa), FASTA scores: opt: 174, E(): 0.00071, (28.9% identity in 173 aa overlap); P49323|PRXC_STRLI|CPO|CPOL NON-HEME CHLOROPEROXIDASE (EC 1.11.1.10) from Streptomyces lividans (275 aa), FASTA scores: opt: 172, E(): 0.001, (30.9% identity in 194 aa overlap) (similarity only at N-terminus for this one); etc. Some similarity in N-terminal part to non-heme chloroperoxidases. Also similar to O05293|Rv1191|MTCI364.03 HYPOTHETICAL PROTEIN from M. tuberculosis (304 aa), FASTA scores: opt: 417, E(): 3.1e-19, (32.6% identity in 279 aa overlap) (note that Rv1191 is equivalent to AAK45485 from Mycobacterium tuberculosis strain CDC1551 but shorter 14 aa, and that AAK45485 is annoted Hydrolase, alpha/beta hydrolase family). Mb3340c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3T5" /protein_id="SIU01969.1" /translation="MTGPPPSLPERIRTDEADVLMLPDGRALAYLEWGDSTGYPAFYF HGTPSSRLEGAFADGAARRTGFRLIAIDRPGYGRSTFQAGRNFRDWPADVCALADAFE LEEFGVVGHSGAGPHLFACGAVIPRTRLAFVGALGPWGPLATPDIMRSLNAADRCYAR LARSGPRLFGALFAPLGWCAKYTPGLFSTLLAAAVPAADKHLLSDERFGRHLRAIQLE AFRQGSRGAAYESFLQFRPWGFDLAEVAVPTHIWLGDRDSFVPRAMGEYLQRAIPHVD LHWAHGKGHFNIEDWDAILAACALDIGKRRGG" CDS complement(3659640..3659951) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3341C" /product="SECRETED PROTEIN ANTIGEN" /note="Mb3341c, -, len: 103 aa. Equivalent to Rv3312A, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 103 aa overlap). Secreted protein antigen, described in Corixa patent as having N-terminal sequence YYWCPGQPFDPAWGP. Equivalent to AAK47756 from Mycobacterium tuberculosis strain CDC1551 (114 aa) but shorter 11 aa. Mb3341c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWR6" /db_xref="UniProtKB/Swiss-Prot:Q7TWR6" /protein_id="SIU01970.1" /translation="MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPG QPFDPAWGPNWDPYTCHDDFHRDSDGPDHSRDYPGPILEGPVLDDPGAAPPPPAAGGG A" CDS complement(3660022..3661119) /codon_start=1 /transl_table=11 /gene="add" /locus_tag="BQ2027_MB3342C" /product="PROBABLE ADENOSINE DEAMINASE ADD (ADENOSINE AMINOHYDROLASE)" /note="Mb3342c, add, len: 365 aa. Equivalent to Rv3313c, len: 365 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 365 aa overlap). Probable add, adenosine deaminase (EC 3.5.4.4), equivalent to Q9CCL9|ADD|ML0700 PUTATIVE ADENOSINE DEAMINASE from Mycobacterium leprae (362 aa), FASTA scores: opt: 2097, E(): 1.4e-127, (88.2% identity in 356 aa overlap) . Also similar to many e.g. Q9AK25|2SCK8.27 from Streptomyces coelicolor (396 aa), FASTA scores: opt: 1578, E(): 3.7e-94, (66.65% identity in 360 aa overlap); Q17747|C06G3.5 from Caenorhabditis elegans (349 aa), FASTA scores: opt: 435, E(): 1.1e-20, (29.6% identity in 348 aa overlap); P22333|ADD_ECOLI|B1623 from Escherichia coli strain K12 (333 aa), FASTA scores: opt: 380, E(): 3.7e-17, (29.4% identity in 340 aa overlap); etc. BELONGS TO THE ADENOSINE AND AMP DEAMINASES FAMILY. Protein product from Mb3342c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3342c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63908" /db_xref="InterPro:IPR001365" /db_xref="InterPro:IPR006330" /db_xref="InterPro:IPR028893" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/Swiss-Prot:P63908" /protein_id="SIU01971.1" /translation="MTAAPTLQTIRLAPKALLHDHLDGGLRPATVLDIAGQVGYDDLP ATDVDALASWFRTQSHSGSLERYLEPFSHTVAVMQTPEALYRVAFECAQDLAADSVVY AEVRFAPELHISCGLSFDDVVDTVLTGFAAGEKACAADGQPITVRCLVTAMRHAAMSR EIAELAIRFRDKGVVGFDIAGAEAGHPPTRHLDAFEYMRDHNARFTIHAGEAFGLPSI HEAIAFCGADRLGHGVRIVDDIDVDADGGFQLGRLAAILRDKRIPLELCPSSNVQTGA VASIAEHPFDLLARARFRVTVNTDNRLMSDTSMSLEMHRLVEAFGYGWSDLARFTVNA MKSAFIPFDQRLAIIDEVIKPRFAALMGHSE" CDS complement(3661119..3662402) /codon_start=1 /transl_table=11 /gene="deoA" /locus_tag="BQ2027_MB3343C" /product="PROBABLE THYMIDINE PHOSPHORYLASE DEOA (TDRPASE) (PYRIMIDINE PHOSPHORYLASE)" /note="Mb3343c, deoA, len: 427 aa. Equivalent to Rv3314c, len: 427 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 427 aa overlap). Probable deoA, thymidine phosporylase (EC 2.4.2.4), highly similar to many e.g. Q9AK36|DEOA from Streptomyces coelicolor (427 aa), FASTA scores: opt: 1668, E(): 3.2e-90, (62.35% identity in 425 aa overlap); Q9CFM5|PDP from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (430 aa), FASTA scores: opt: 1031, E(): 5.5e-53, (46.45% identity in 392 aa overlap); P19971|TYPH_HUMAN|ECGF1 from Homo sapiens (Human) (482 aa), FASTA scores: opt: 957, E(): 1.3e-48, (44.45% identity in 441 aa overlap); P07650|TYPH_ECOLI|DEOA|TPP|TTG|B4382 from Escherichia coli strain K12 (440 aa), FASTA scores: opt: 847, E(): 3.2e-42, (41.55% identity in 438 aa overlap); etc. Contains PS00647 Thymidine and pyrimidine-nucleoside phosphorylases signature. BELONGS TO THE THYMIDINE/PYRIMIDINE-NUCLEOSIDE PHOSPHORYLASES FAMILY. Protein product from Mb3343c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3343c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3S3" /db_xref="InterPro:IPR000053" /db_xref="InterPro:IPR000312" /db_xref="InterPro:IPR013102" /db_xref="InterPro:IPR017459" /db_xref="InterPro:IPR017872" /db_xref="InterPro:IPR018090" /db_xref="InterPro:IPR035902" /db_xref="InterPro:IPR036320" /db_xref="InterPro:IPR036566" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3S3" /protein_id="SIU01972.1" /translation="MTDFAFDAPTVIRTKRDGGRLSDAAIDWVVKAYTDGRVADEQMS ALLMAIVWRGMDRGEIARWTAAMLASGARLDFTDLPLATVDKHSTGGVGDKITLPLVP VVAACGGAVPQASGRGLGHTGGTLDKLESITGFTANLSNQRVREQLCDVGAAIFAAGQ LAPADAKLYALRDITGTVESLPLIASSIMSKKLAEGAGALVLDVKVGSGAFMRSPVQA RELAHTMVELGAAHGVPTRALLTEMNCPLGRTVGNALEVAEALEVLAGGGPPDVVELT LRLAGEMLELAGIHGRDPAQTLRDGTAMDRFRWLVAAQGGDLSKPLPIGSHSETVTAG ASGTMGDIDAMAVGLAAWRLGAGRSRPGARVQHGAGVRIHRRPGEPVVVGEPLFTLYT NAPERFGAARAELAGGWSIRDSPPQVRPLIVDRIV" CDS complement(3662399..3662800) /codon_start=1 /transl_table=11 /gene="cdd" /locus_tag="BQ2027_MB3344C" /product="PROBABLE CYTIDINE DEAMINASE CDD (CYTIDINE AMINOHYDROLASE) (CYTIDINE NUCLEOSIDE DEAMINASE)" /note="Mb3344c, cdd, len: 133 aa. Equivalent to Rv3315c, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Probable cdd, cytidine deaminase (EC 3.5.4.5), equivalent to Q9CBD3|CDD|ML2174 CYTIDINE DEAMINASE from Mycobacterium leprae (134 aa), FASTA scores: opt: 516, E(): 5.8e-28, (56.8% identity in 132 aa overlap). Also highly similar to many e.g. Q9AK37|2SCK8.15 from Streptomyces coelicolor (130 aa), FASTA scores: opt: 523, E(): 1.9e-28, (60.0% identity in 130 aa overlap); Q9KD53|CDD|BH1366 from Bacillus halodurans (132 aa), FASTA scores: opt: 305, E(): 9.2e-14, (41.55% identity in 130 aa overlap); P56389|CDD_MOUSE|CDA|CDD from Mus musculus (Mouse) (146 aa), FASTA scores: opt: 287, E(): 1.6e-12, (40.3% identity in 124 aa overlap); P19079|CDD_BACSU (136 aa), FASTA scores: opt: 270, E(): 2.1e-11, (28.6% identity in 127 aa overlap); etc. Contains PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature. BELONGS TO THE CYTIDINE AND DEOXYCYTIDYLATE DEAMINASES FAMILY. COFACTOR: ZINC (BY SIMILARITY). Protein product from Mb3344c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3344c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3S6" /db_xref="InterPro:IPR002125" /db_xref="InterPro:IPR016193" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3S6" /protein_id="SIU01973.1" /translation="MPDVDWNMLRGNATQAAAGAYVPYSRFAVGAAALVDDGRVVTGC NVENVSYGLTLCAECAVVCALHSTGGGRLLALACVDGHGSVLMPCGRCRQVLLEHGGS ELLIDHPVRPRRLGDLLPDAFGLDDLPRERR" CDS 3663037..3663375 /codon_start=1 /transl_table=11 /gene="sdhC" /locus_tag="BQ2027_MB3345" /product="PROBABLE SUCCINATE DEHYDROGENASE (CYTOCHROME B-556 SUBUNIT) SDHC (SUCCINIC DEHYDROGENASE) (FUMARATE REDUCTASE) (FUMARATE DEHYDROGENASE) (FUMARIC HYDROGENASE)" /note="Mb3345, sdhC, len: 112 aa. Equivalent to Rv3316, len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 112 aa overlap). Probable sdhC, cytochrome B-556 of succinate dehydrogenase SdhC subunit (EC 1.3.99.1), transmembrane protein, equivalent (but shorter 35 aa) to Q9CCM0|SDHC|ML0699 PUTATIVE SUCCINATE DEHYDROGENASE CYTOCHROME B-556 SUBUNIT from Mycobacterium leprae (153 aa), FASTA scores: opt: 692, E(): 1.2e-39, (88.4% identity in 112 aa overlap). Also similar to others e.g. Q9KZ88|SC5G8.26c from Streptomyces coelicolor (126 aa), FASTA scores: opt: 484, E(): 8.3e-26, (65.65% identity in 99 aa overlap); Q9RVR8|DR0954 from Deinococcus radiodurans (118 aa), FASTA scores: opt: 195, E(): 1.7e-06, (36.8% identity in 87 aa overlap); Q9HQ63|DHSD_HALN1|SDHD|SDHC|VNG1310G from Halobacterium sp. strain NRC-1 (130 aa), FASTA scores: opt: 192, E(): 2.9e-06, (37.85% identity in 74 aa overlap); P72109|DHSD_NATPH|SDHD|SDHC from Natronomonas pharaonis (Natronobacterium pharaonis) (130 aa), FASTA scores: opt: 183, E(): 1.1e-05, (35.15% identity in 74 aa overlap); etc. PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN, AN IRON-SULFUR, CYTOCHROME B-556, AND AN HYDROPHOBIC ANCHOR PROTEIN. BELONGS TO THE CYTOCHROME B560 FAMILY. Protein product from Mb3345 detected using SWATH mass spectrometry. Mb3345 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3S5" /db_xref="InterPro:IPR000701" /db_xref="InterPro:IPR014314" /db_xref="InterPro:IPR034804" /db_xref="InterPro:IPR039023" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3S5" /protein_id="SIU01974.1" /translation="MWSWVCHRISGATIFFFLFVHVLDAAMLRVSPQTYNAVLATYKT PIVGLMEYGLVAAVLFHALNGIRVILIDFWSEGPRYQRLMLWIIGSVFLLLMVPAGVV VGIHMWEHFR" CDS 3663372..3663806 /codon_start=1 /transl_table=11 /gene="sdhD" /locus_tag="BQ2027_MB3346" /product="PROBABLE SUCCINATE DEHYDROGENASE (HYDROPHOBIC MEMBRANE ANCHOR SUBUNIT) SDHD (SUCCINIC DEHYDROGENASE) (FUMARATE REDUCTASE) (FUMARATE DEHYDROGENASE) (FUMARIC HYDROGENASE)" /note="Mb3346, sdhD, len: 144 aa. Equivalent to Rv3317, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 144 aa overlap). Probable sdhD, membrane anchor of succinate dehydrogenase SdhD subunit (EC 1.3.99.1), equivalent (but shorter 19 aa) to Q49915|SDHD|ML0698|L308_F1_25 PUTATIVE SUCCINATE DEHYDROGENASE HYDROPHOBIC MEMBRANE ANCHOR PROTEIN from Mycobacterium leprae (163 aa), FASTA scores: opt: 878, E(): 1.9e-51, (85.2% identity in 142 aa overlap). Also similar to others e.g. Q9KZ89|SC5G8.25c from Streptomyces coelicolor (160 aa), FASTA scores: opt: 553, E(): 6.6e-30, (58.85% identity in 141 aa overlap); Q9RVR9|DR0953 from Deinococcus radiodurans (125 aa), FASTA scores: opt: 251, E(): 5.5e-10, (37.15% identity in 113 aa overlap); O29573|DHSD_ARCFU|SDHD|AF0684 from Archaeoglobus fulgidus (117 aa), FASTA scores: opt: 160, E(): 0.00056, (25.95% identity in 108 aa overlap); etc. PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN, AN IRON-SULFUR, CYTOCHROME B-556, AND AN HYDROPHOBIC ANCHOR PROTEIN. Protein product from Mb3346 detected using shotgun mass spectrometry. Mb3346 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5K3" /db_xref="InterPro:IPR000701" /db_xref="InterPro:IPR034804" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5K3" /protein_id="SIU01975.1" /translation="MSAPVRQRSHDRPASLDNPRSPRRRAGMPNFEKFAWLFMRFSGV VLVFLAIGHLFIMLMWDNGVYRLDFNFVAQRWASPFWQTWDLLLLWLAQLHGGNGLRT IIDDYSRKDTTRFWLNSLLVLSMLFTLMLGTYVIVTFDPNIS" CDS 3663935..3665707 /codon_start=1 /transl_table=11 /gene="sdhA" /locus_tag="BQ2027_MB3347" /product="PROBABLE SUCCINATE DEHYDROGENASE (FLAVOPROTEIN SUBUNIT) SDHA (SUCCINIC DEHYDROGENASE) (FUMARATE REDUCTASE) (FUMARATE DEHYDROGENASE) (FUMARIC HYDROGENASE)" /note="Mb3347, sdhA, len: 590 aa. Equivalent to Rv3318, len: 590 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 590 aa overlap). Probable sdhA, flavoprotein of succinate dehydrogenase SdhA subunit (EC 1.3.99.1), equivalent to Q9CCM1|SDHA|ML0697 SUCCINATE DEHYDROGENASE FLAVOPROTEIN SUBUNIT from Mycobacterium leprae (584 aa), FASTA scores: opt: 3657, E(): 1.2e-217, (92.55% identity in 590 aa overlap). Also highly similar to others e.g. Q9KZ90|DHSA from Streptomyces coelicolor (584 aa), FASTA scores: opt: 2813, E(): 1.1e-165, (70.5% identity in 586 aa overlap); Q9RVS0|DR0952 from Deinococcus radiodurans (583 aa), FASTA scores: opt: 2203, E(): 4.1e-128, (57.35% identity in 593 aa overlap); P31038|DHSA_RICPR|SDHA|RP128 from Rickettsia prowazekii (596 aa), FASTA scores: opt: 1892, E(): 5.8e-109, (50.0% identity in 588 aa overlap); P10444|DHSA_ECOLI|SDHA|B0723|Z0877|ECS0748 from Escherichia coli strains K12 and O157:H7 (588 aa), FASTA scores: opt: 1844, E(): 5.2e-106, (48.75% identity in 591 aa overlap); etc. Contains PS00504 Fumarate reductase / succinate dehydrogenase FAD-binding site. COFACTOR: FAD. SIMILAR TO THE FLAVOPROTEIN SUBUNITS OF OTHER SPECIES SUCCINATE DEHYDROGENASE AND OF FUMARATE REDUCTASE. PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN, AN IRON-SULFUR, CYTOCHROME B-556, AND AN HYDROPHOBIC ANCHOR PROTEIN. Protein product from Mb3347 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3347 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4I8" /db_xref="InterPro:IPR003952" /db_xref="InterPro:IPR003953" /db_xref="InterPro:IPR011281" /db_xref="InterPro:IPR014006" /db_xref="InterPro:IPR015939" /db_xref="InterPro:IPR027477" /db_xref="InterPro:IPR036188" /db_xref="InterPro:IPR037099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4I8" /protein_id="SIU01976.1" /translation="MICQHRYDVVIVGAGGAGMRAAVEAGPRVRTAVLTKLYPTRSHT GAAQGGMCAALANVEDDNWEWHTFDTVKGGDYLADQDAVEIMCKEAIDAVLDLEKMGM PFNRTPEGRIDQRRFGGHTRDHGKAPVRRACYAADRTGHMILQTLYQNCVKHDVEFFN EFYALDLALTQTPSGPVATGVIAYELATGDIHVFHAKAVVIATGGSGRMYKTTSNAHT LTGDGIGIVFRKGLPLEDMEFHQFHPTGLAGLGILISEAVRGEGGRLLNGEGERFMER YAPTIVDLAPRDIVARSMVLEVLEGRGAGPLKDYVYIDVRHLGEEVLEAKLPDITEFA RTYLGVDPVTELVPVYPTCHYLMGGIPTTVTGQVLRDNTSVVPGLYAAGECACVSVHG ANRLGTNSLLDINVFGRRAGIAAASYAQGHDFVDMPPNPEAMVVGWVSDILSEHGNER VADIRGALQQSMDNNAAVFRTEETLKQALTDIHALKERYSRITVHDKGKRFNTDLLEA IELGFLLELAEVTVVGALNRKESRGGHAREDYPNRDDVNYMRHTMAYKEIGADKEGPE LRSDVRLDFKPVVQTRYEPKERKY" CDS 3665707..3666498 /codon_start=1 /transl_table=11 /gene="sdhB" /locus_tag="BQ2027_MB3348" /product="PROBABLE SUCCINATE DEHYDROGENASE (IRON-SULPHUR PROTEIN SUBUNIT) SDHB (SUCCINIC DEHYDROGENASE) (FUMARATE REDUCTASE) (FUMARATE DEHYDROGENASE) (FUMARIC HYDROGENASE)" /note="Mb3348, sdhB, len: 263 aa. Equivalent to Rv3319, len: 263 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 263 aa overlap). Probable sdhB, iron-sulphur protein succinate dehydrogenase SdhB subunit (EC 1.3.99.1), equivalent to Q49916|SDHB|ML0696|L308_F1_28 SUCCINATE DEHYDROGENASE IRON-SULFUR PROTEIN from Mycobacterium leprae (264 aa), FASTA scores: opt: 1678, E(): 4.7e-99, (89.8% identity in 264 aa overlap). Also highly similar to other e.g. Q9KZ91|DHSB from Streptomyces coelicolor (257 aa), FASTA scores: opt: 1125, E(): 4.6e-64, (64.1% identity in 262 aa overlap); Q9RVS1|DR0951 from Deinococcus radiodurans (264 aa), FASTA scores: opt: 1014, E(): 5e-57, (57.25% identity in 255 aa overlap); Q9PEF5|XF1073 from Xylella fastidiosa (261 aa), FASTA scores: opt: 681, E(): 5.8e-36, (45.1% identity in 244 aa overlap); P07014|DHSB_ECOLI|SDHB|B0724 from Escherichia coli strain K12 (238 aa), FASTA scores: opt: 657, E(): 1.8e-34, (43.75% identity in 240 aa overlap); etc. Contains PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature. COFACTOR: BINDS THREE DIFFERENT IRON-SULFUR CLUSTERS: A 2FE-2S, A 3FE-4S AND A 4FE-4S. THE IRON-SULFUR CENTERS ARE SIMILAR TO THOSE OF 'PLANT-TYPE' 2FE-2S AND 'BACTERIAL-TYPE' 4FE-4S FERREDOXINS. PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN, AN IRON-SULFUR, CYTOCHROME B-556, AND AN HYDROPHOBIC ANCHOR PROTEIN. Protein product from Mb3348 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3348 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3V7" /db_xref="InterPro:IPR004489" /db_xref="InterPro:IPR009051" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR017896" /db_xref="InterPro:IPR017900" /db_xref="InterPro:IPR025192" /db_xref="InterPro:IPR036010" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3V7" /protein_id="SIU01977.1" /translation="MSVEPDVETLDPPLPPVPDGAVMVTVKIARFNPDEPDAFAATGG WQSFRVPCLPSDRLLNLLIYIKGYLDGTLTFRRSCAHGVCGSDAMRINGVNRLACKVL MRDLLPKKKGKSLTVTVEPIRGLPVEKDLVVDMEPFFDAYRAIKPYLITSGNPPTRER IQSPTDRARYDDTTKCILCACCTTSCPVFWHEGSYFGPAAIVNAHRFIFDSRDEAAAE RLDILNEVDGVWRCRTTFNCTESCPRGIEVTKAIQEVKRALMFTR" CDS complement(3666577..3667005) /codon_start=1 /transl_table=11 /gene="vapc44" /locus_tag="BQ2027_MB3349C" /product="possible toxin vapc44. contains pin domain." /note="Mb3349c, -, len: 142 aa. Equivalent to Rv3320c, len: 142 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 142 aa overlap). Conserved hypothetical protein, similar to several hypothetical proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. P95023|Rv2530c|MTCY159.26 (139 aa), FASTA scores: opt: 292, E(): 4.8e-14, (41.5% identity in 135 aa overlap); O53219|Rv2494|MTV008.50 (141 aa), FASTA scores: opt: 287, E(): 1.1e-13, (41.6% identity in 125 aa overlap); O07760|Rv0617|MTCY19H5.04c (133 aa), FASTA scores: opt: 252, E(): 3.3e-11, (37.8% identity in 127 aa overlap); etc. Mb3349c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y419" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y419" /protein_id="SIU01978.1" /translation="MRALLDVNVLLALLDRDHVDHERARAWITGQIERGWASCAITQN GFVRVISQPRYPSPISVAHAIDLLARATHTRYHEFWSCTVSILDSKVIDRSRLHSPKQ VTDAYLLALAVAHDGRFVTFDQSIALTAVPGATKQHLATL" CDS complement(3667009..3667251) /codon_start=1 /transl_table=11 /gene="vapb44" /locus_tag="BQ2027_MB3350C" /product="possible antitoxin vapb44" /note="Mb3350c, -, len: 80 aa. Equivalent to Rv3321c, len: 80 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 80 aa overlap). Conserved hypothetical protein, similar at N-terminal region to several proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. AAK48167|MT3800 DNA-BINDING PROTEIN (COPG FAMILY) from strain CDC1551 (74 aa), FASTA scores: opt: 142, E(): 0.0016, (48.85% identity in 43 aa overlap); AAK46916|MT2606 HYPOTHETICAL 8.0 KDA PROTEIN from strain CDC1551 (74 aa), FASTA scores: opt: 139, E(): 0.0026, (37.2% identity in 78 aa overlap); O50456|Rv1241|MTV006.13 HYPOTHETICAL 9.9 KDA PROTEIN from strain H37Rv (86 aa), FASTA scores: opt: 134, E(): 0.0066, (39.0% identity in 82 aa overlap); etc. Protein product from Mb3350c detected using SWATH mass spectrometry. Mb3350c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3U2" /protein_id="SIU01979.1" /translation="MRTTLSIDDDVLLAVKERARREKRTAGEILSDLARQALTNQNPQ PAASQEDAFHGFEPLPHRGGAVSNALIDRLRDEEAV" CDS complement(3667373..3667987) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3351C" /product="POSSIBLE METHYLTRANSFERASE" /note="Mb3351c, -, len: 204 aa. Equivalent to Rv3322c, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 204 aa overlap). Conserved hypothetical protein, showing weak similarity to proteins including several methyltransferases (EC 2.1.1.-) e.g. Q9X9V1|ORF8 PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (208 aa), FASTA scores: opt: 193, E(): 1e-05, (36.35% identity in 132 aa overlap); and Q9XA90|SCF43A.25c PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (215 aa), FASTA scores: opt: 161, E(): 0.0014, (32.05% identity in 131 aa overlap); P74712|SLR1183 HYPOTHETICAL 21.3 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (194 aa), FASTA scores: opt: 155, E(): 0.0032, (27.35% identity in 150 aa overlap); Q9ABW8|CC0102 RRNA METHYLTRANSFERASE RSMB from Caulobacter crescentus (429 aa), FASTA scores: opt: 148, E(): 0.018, (31.5% identity in 162 aa overlap); etc. Also highly similar to O05796|Rv3120|MTCY164.30 HYPOTHETICAL 21.8 KDA PROTEIN from Mycobacterium tuberculosis (200 aa), FASTA scores: opt: 691, E(): 1.2e-38, (56.5% identity in 200 aa overlap); and shows weak similarity to O69667|Rv3699|MTV025.047 PUTATIVE METHYLTRANSFERASE from Mycobacterium tuberculosis (233 aa), FASTA scores: opt: 155, E(): 0.0037, (29.15% identity in 168 aa overlap). Protein product from Mb3351c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3351c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3S8" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR041698" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3S8" /protein_id="SIU01980.1" /translation="MSVQTDPALREHPNRVDWNARYERAGSAHAPFAPVPWLADVLRA GVPDGPVLELASGRSGTALALAAHGRQVTAIDVSDVALLQLDSEAVRRGVADRLNLVQ ADLGCWEPGETRFALVLSRLFWDAAIFHRACEAVMPGGVLAWESLALSGAEAGTASAK RRVKPGEPACLLPADFTVVHEGQGNCDSAPSRIMIARRSPLPGA" CDS complement(3667984..3668649) /codon_start=1 /transl_table=11 /gene="moaX" /locus_tag="BQ2027_MB3352C" /product="PROBABLE MOAD-MOAE FUSION PROTEIN MOAX" /note="Mb3352c, moaX, len: 221 aa. Equivalent to Rv3323c, len: 221 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 221 aa overlap). Probable moaX, MoaD-MoaE fusion protein, similar (whole or partial) to several MoaD and MoaE proteins e.g. Q9RR88|DR2607 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN D/E from Deinococcus radiodurans (229 aa), FASTA scores: opt: 407, E(): 1.8e-18, (32.75% identity in 223 aa overlap); Q9K8I7|MOAE|BH3019 MOLYBDOPTERIN CONVERTING FACTOR (SUBUNIT 2) from Bacillus halodurans (156 aa), FASTA scores: opt: 375, E(): 1.3e-16, (41.65% identity in 132 aa overlap); O31705|MOAE MOLYBDOPTERIN CONVERTING FACTOR (SUBUNIT 2) from Bacillus subtilis (157 aa), FASTA scores: opt: 368, E(): 3.6e-16, (41.65% identity in 132 aa overlap); etc. C-terminus highly similar to O05795|MOAE_MYCTU|Rv3119|MT3201|MTCY164.29|MOAE1 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN E from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 733, E(): 5.4e-39, (76.2% identity in 143 aa overlap); and N-terminus highly similar to O05789|MOAD1|Rv3112|MTCY164.22 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN D from Mycobacterium tuberculosis (83 aa), FASTA scores: opt: 333, E(): 3.2e-14, (65.05% identity in 83 aa overlap). Protein product from Mb3352c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3352c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3S9" /db_xref="InterPro:IPR003448" /db_xref="InterPro:IPR003749" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR016155" /db_xref="InterPro:IPR036563" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3S9" /protein_id="SIU01981.1" /translation="MITVNVLYFGAVREACKVAHEKISLESGTTVDGLVDQLQIDYPP LADFRKRVRMAVNESIAPASTILDDGDTVAFIPQVAGGSDVYCRLTDEPLSVDEVLNA ISGPSQGGAVIFVGTVRNNNNGHEVTKLYYEAYPAMVHRTLMDIIEECERQADGVRVA VAHRTGELRIGDAAVVIGASAPHRAAAFDAARMCIERLKQDVPIWKKEFALDGVEWVA NRP" CDS complement(3668650..3669183) /codon_start=1 /transl_table=11 /gene="moaC3" /locus_tag="BQ2027_MB3353C" /product="PROBABLE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN C 3 MOAC3" /note="Mb3353c, moaC3, len: 177 aa. Equivalent to Rv3324c, len: 177 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 177 aa overlap). Probable moaC3, molybdopterin cofactor biosynthesis protein, highly similar to others e.g. Q9HX95|MOAC|PA3918 from Pseudomonas aeruginosa (160 aa), FASTA scores: opt: 567, E(): 7.5e-30, (58.35% identity in 156 aa overlap); Q9RKA8|MOAC from Streptomyces coelicolor (170 aa), FASTA scores: opt: 553, E(): 6.3e-29, (58.25% identity in 158 aa overlap); P30747|MOAC_ECOLI|CHLA3|B0783 from Escherichia coli strain K12 (160 aa), FASTA scores: opt: 516, E(): 1.5e-26, (55.95% identity in 159 aa overlap); etc. Also highly similar to O05788|MOAC1|Rv3111|MTCY164.21 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN C from Mycobacterium tuberculosis (170 aa), FASTA scores: opt: 734, E(): 1.3e-40, (71.8% identity in 163 aa overlap); and Rv0864|MOAC2|MTV043.57 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN (167 aa). Protein product from Mb3353c detected using SWATH mass spectrometry. Mb3353c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65393" /db_xref="InterPro:IPR002820" /db_xref="InterPro:IPR023045" /db_xref="InterPro:IPR036522" /db_xref="UniProtKB/Swiss-Prot:P65393" /protein_id="SIU01982.1" /translation="MNDHDGVLTHLDEQGAARMVDVSAKAVTLRRARASGAVLMKPST LDMICHGTAAKGDVIATARIAGIMAAKRTGELIPLCHPLGIEAVTVTLEPQGADRLSI AATVTTVARTGVEMEALTAVTVTALTVYDMCKAVDRAMTITDIRLDEKSGGRSGHYRR HDADVKPSDGGSTEDGC" mobile_element 3668986..3670071 /mobile_element_type="insertion sequence:IS1547" /locus_tag="BQ2027_IS1547-2" /note="IS1547-2, len: 1086 nt. Equivalent to IS1547, len: 1086 nt, from Mycobacterium tuberculosis strain H37Rv,(99.8% identity in 1086 nt overlap). Region corresponding to IS1547, positions 1982 3067 in EM_NEW:MTY13470." CDS complement(3669180..3669554) /codon_start=1 /transl_table=11 /gene="moaB3" /locus_tag="BQ2027_MB3354C" /product="PROBABLE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE MOAB3 (PHS) (4-ALPHA-HYDROXY-TETRAHYDROPTERIN DEHYDRATASE) (PTERIN-4-A-CARBINOLAMINE DEHYDRATASE) (PHENYLALANINE HYDROXYLASE-STIMULATING PROTEIN) (PHS) (PTERIN CARBINOLAMINE DEHYDRATASE) (PCD)" /note="Mb3354c, moaB3, len: 124 aa. No equivalent in Mycobacterium tuberculosis strain H37Rv, but equivalent to MT3426, len: 124 aa, from Mycobacterium tuberculosis strain CDC1551, (100.000% identity in 124 aa overlap). Probable moaB3, pterin-4-alpha-carbinolamine dehydratase (EC 4.2.1.96), similar to others e.g. MOAB1|Rv3110|MT3193|MTCY164.20 PROBABLE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE from Mycobacterium tuberculosis strain H37Rv (142 aa), FASTA scores: opt: 383, E(): 1.2e-21, (50.000% identity in 110 aa overlap); Q96YL0|STS230 PUTATIVE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE from Sulfolobus tokodaii (93 aa), FASTA scores: opt: 208, E(): 1.5e-08, (37.500% identity in 88 aa overlap); Q8YNL6|ASR4549 PTERIN-4A-CARBINOLAMINE DEHYDRATASE from Anabaena sp. strain PCC 7120 (93 aa), FASTA scores: opt: 179, E(): 2.2e-06, (37.363% identity in 91 aa overlap); P73790|PHS_SYNY3|SSL2296 PUTATIVE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE from Synechocystis sp. strain PCC 6803 (96 aa), FASTA scores: opt: 175, E(): 4.6e-06, (33.7% identity in 92 aa overlap); etc. BELONGS TO THE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE FAMILY. REMARK-M.bovis-M.tuberculosis: Belongs to the RvD5 region. Absent in Mycobacterium tuberculosis strain H37Rv. Mb3354c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3T7" /db_xref="InterPro:IPR001533" /db_xref="InterPro:IPR036428" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3T7" /protein_id="SIU01983.1" /translation="MAHGNVSRCEESSLHDVCCGRLSALTDRELSERLTALPGWELVD GKLRHTFGFGSFDQSMKFVAKIAAIADKFNHHPDICVHNKRSVRLTCWTRQMHCLTRV DFDLAEAFSAVHDEQCSQQVAR" CDS complement(3669651..3670715) /codon_start=1 /transl_table=11 /gene="moaA3" /locus_tag="BQ2027_MB3355C" /product="PROBABLE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A MOAA1" /note="Mb3355c, moaA3, len: 354 aa. No equivalent in Mycobacterium tuberculosis strain H37Rv, but equivalent to MT3427|MOAA3, len: 378 aa, from Mycobacterium tuberculosis strain CDC1551, (100.000% identity in 354 aa overlap). Probable moaA3, molybdenum cofactor biosynthesis protein, similar to others e.g. O05786|MOA1_MYCTU|MOAA1|MOAA|Rv3109|MT3192|MTCY164.19 PROBABLE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A from Mycobacterium tuberculosis strain H37Rv (359 aa), FASTA scores: opt: 1762, E(): 3.7e-108, (74.3% identity in 350 aa overlap); Q99S04|MOAA|SA2063 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A from Staphylococcus aureus strain N315 (340 aa), FASTA scores: opt: 819, E(): 3.2e-46, (38.25% identity in 324 aa overlap); O67929|MOAA_AQUAE|AQ_2183 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A from Aquifex aeolicus (320 aa), FASTA scores: opt: 815, E(): 5.6e-46, (41.8% identity in 323 aa overlap); etc. BELONGS TO THE MOAA/NIFB/PQQE FAMILY. REMARK-M.bovis-M.tuberculosis: Belongs to the RvD5 region. Absent in Mycobacterium tuberculosis strain H37Rv. Mb3355c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P62588" /db_xref="InterPro:IPR000385" /db_xref="InterPro:IPR006638" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR010505" /db_xref="InterPro:IPR013483" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/Swiss-Prot:P62588" /protein_id="SIU01984.1" /translation="MSRSGLCINESPIRDRCGRTMGDLRLSVIDQCNLRCRYCMPEAE YAWLPRADLLSVDEISLIVDAFIAVGVDKIRLTGGEPLIRSDLAAIIEVISAKVGDGS GLQDLAITTNGVLLADQARKLKSAGMRRITISLDTLRPDRFKAISQRGTHYKVIEGIE AVAAAGFTDTKLDSVVIRGFNDDELSDLIEFARNVNAEVRFIEYMDVGGATQWSMDKV FTKAQMLSTLGKKYGPIAALPKYDSAPANRYRLPDGTTFGIIASTTEPFCATCDRSRL TADGIWLHCLYALSGINLRESVRAGASANDVVQILQRGWRDRANRGAEQRLAQRTRQV FLPVSRLKRDPHLEMHTRGG" CDS 3670770..3671120 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3356" /product="HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb3356, -, len: 116 aa. No equivalent in Mycobacterium tuberculosis strain H37Rv, but similar to 5' end of a hypothetical CDS, len: 237 aa, from Mycobacterium tuberculosis strain F4, (100.0% identity in 109 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: Belongs to the RvD5 region. Absent in Mycobacterium tuberculosis strain H37Rv." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L8" /protein_id="SIU01985.1" /translation="MKNPSHWIAPPTRPPTAMGDARHPRRDLGHRALSGAGRGDGARE DHRNTFRVHTGTTCLLLIRALSGVHVHYCSTPASMESQCARRTTGFGGACGSGKATAT RWPFIHDGDPNSRQ" CDS 3671098..3671484 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3357" /product="HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb3357, -, len: 128 aa. No equivalent in Mycobacterium tuberculosis strain H37Rv, but similar to 3' end of a hypothetical CDS, len: 237 aa, from Mycobacterium tuberculosis strain F4, (100.0% identity in 109 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: Belongs to the RvD5 region. Absent in Mycobacterium tuberculosis strain H37Rv. Mb3357 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4J5" /protein_id="SIU01986.1" /translation="MTRIPDSDVEKPGSSGSVSSTGPEVAARRHGRRCGCDRNRSPPR SKSVTLDTRHVGGAGILRIRAFPIRPWRPVNRSSNGPATDEDCDGPKRTPVRYSRTAR DKRRFPQSTVGHTEATCGLGDADHSS" CDS 3671507..3672505 /codon_start=1 /transl_table=11 /gene="embR2" /locus_tag="BQ2027_MB3358" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN EMBR2" /note="Mb3358, embR2, len: 332 aa. No equivalent in Mycobacterium tuberculosis strain H37Rv, but equivalent to MT3428, len: 381 aa, from Mycobacterium tuberculosis strain CDC1551, (100.000% identity in 381 aa overlap). Possible embR2, transcriptional regulatory protein, highly similar to other mycobacteria EmbR proteins e.g. Q11052|YC67_MYCTU|Rv1267c|MT1305|MTCY50.15|EMBR PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN EMBR from Mycobacterium tuberculosis (388 aa), FASTA scores: opt: 1420, E(): 7.3e-84, (57.3% identity in 370 aa overlap); P71484|EMBR EMBR PROTEIN from Mycobacterium avium (384 aa) (see citation below), FASTA scores: opt: 1338, E(): 1.4e-78, (53.65% identity in 384 aa overlap); and also O05797|Rv3124|MT3208|MTCY164.34 PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN from Mycobacterium tuberculosis (311 aa), FASTA scores: opt: 928, E(): 3e-52, (53.75% identity in 255 aa overlap). Also similar to other transcriptional regulatory proteins from other organisms e.g. Q9XCC3|TYLT HYPOTHETICAL PATHWAY SPECIFIC REGULATORY PROTEIN from Streptomyces fradiae (404 aa), FASTA scores: opt: 506, E(): 5.7e-25, (38.153% identity in 249 aa overlap); etc. BELONGS TO THE AFSR/DNRI/REDD FAMILY OF REGULATORS. REMARK-M.bovis-M.tuberculosis: Belongs to the RvD5 region. Absent in Mycobacterium tuberculosis strain H37Rv. Mb3358 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3W7" /db_xref="InterPro:IPR000253" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR005158" /db_xref="InterPro:IPR008984" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR016032" /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3W7" /protein_id="SIU01987.1" /translation="MIDAAWEQDRPEGSRATVYTYVSNLRRLVSTTGADSHSILASAP PGYRLAVADNQYDVARFISQRSAGLRAAAAGSFEQASDHLSAALAEWRGPVLDDLREF SFVTRLANSLVEDKIIAHTALAEAEIACGRADSVISELEELILEHPYHEALWRQLIAA YYVSERQSDALDAYRRLKTSLAEDLGVDPGPKVRTLYEQVLRQQALDTRVVVQAAAGD IIRALEHSPGMTDRSPRAAIRDAAGHRSPLGRLPLRIGRSKSNDMVLPDGKVSPYHAV IVNTGESFMITDLRSVNGVYVRGRRIATTATLNDGDHIRIGDHELTFEVIPHESGR" CDS complement(3672600..3673244) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3359C" /product="HYPOTHETICAL PROTEIN" /note="Mb3359c, -, len: 214 aa. No equivalent in Mycobacterium tuberculosis strain H37Rv, but similar to 3' end of MT3429, len: 172 aa, from Mycobacterium tuberculosis strain CDC1551, (100% identity in 172 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: Belongs to the RvD5 region. Absent in Mycobacterium tuberculosis strain H37Rv. Mb3359c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/TrEMBL:A0A1R3Y435" /protein_id="SIU01988.1" /translation="MSQTPGDPEQTTATRRLSHRHTHLAAHTTPTLRRKGPPFRAEMG CFCVCSAQVQEVAKNSLRGVPESVVMSYSYFVELPRLEDIEPGAHTDVLIANSRVDQG RIRAAVEAVFDAHPALGTVFEPRVDTLTSRPGGGGWGWGVEPPGAAVAEVIARHSASF DMYTGRLFAVSLLPGSPDRLVLTASRLCVDDASWQTVVEDLVRQYDESVLVPAR" gene 3673399..3674484 /locus_tag="BQ2027_IS1547-2" CDS 3673411..3675123 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3360" /product="PROBABLE TRANSPOSASE FUSION PROTEIN" /note="Mb3360, -, len: 570 aa. Equivalent to Rv3327, len: 570 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 570 aa overlap). Probable fusion protein. Indeed, N-terminal part corresponds to entire O07269 transposase of IS1547 (383 aa), and C-terminal part identical to MTCI249B.03c (210 aa). N-terminal part is identical to MTV042_7 (188 aa); C-terminal part (aa 378-570) is similar to hypothetical 20.5 KD protein from Escherichia coli P76222|YNJA_ECOLI (182 aa), FASTA scores: opt: 292, E(): 5.3e-11, (32.6% identity in 181 aa overlap). Protein product from Mb3360 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y3W4" /db_xref="InterPro:IPR002525" /db_xref="InterPro:IPR003346" /db_xref="InterPro:IPR029032" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3W4" /protein_id="SIU01989.1" /translation="MVVVGTDAHKYSHTFVATDEVGRQLGEKTVKATTAGHATAIMWA REQFGLELIWGIEDCRNMSARLERDLLAAGQQVVRVPTKLMAQTRKSARSRGKSDPID ALAVARAVLRETDLPLATHDETSRELKLLTDRRDVLVAQRTSAINRLRWLVHELDPER APAARSLDAAKHQQALRTWLDTQPGLVAELARAELTDIIRLTGEINTLAQRISARVHQ VAPALLEIPGCAELTAAKIVGEAAGVTRFKSEAAFACHAAVAPIPVWSGNTAGQMRLS RSGNRQLNAALHRIALTQIRMTDSRGQAYYQRLQDAGKTKRAALRCLKRRLARTVFQA LRTVHQPSSEHTQPAAACHRSYCSSHLGEPPRLTDMTQKTRIQPLPPKRAGLLIRALY RIAKRRFGEVPEPFTVTAHHRRLLIANVVHEALLQRASRKLPPSVRELAVFWTARSIG CSWCVDFGAMLQRLDGLDVDRLTDIDNYATSSKFSDDERAAIAYAEAMTADPHSVTDE QVADLRARFGEAGVIELTYQIGVENMRARMNSALGITEQGFNSGDACRVPWAAPDVPS AESR" CDS complement(3675056..3675994) /codon_start=1 /transl_table=11 /gene="sigJ" /locus_tag="BQ2027_MB3361C" /product="PROBABLE ALTERNATIVE RNA POLYMERASE SIGMA FACTOR (FRAGMENT) SIGJ" /note="Mb3361c, sigJ, len: 312 aa. Equivalent to Rv3328c, len: 312 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 312 aa overlap). Probable sigJ, alternative RNA polymerase sigma factor (see citation below), highly similar to many e.g. Q9K3H7|2SCG18.10c from Streptomyces coelicolor (295 aa), FASTA scores: opt: 642, E(): 7.3e-31, (42.8% identity in 292 aa overlap); Q9A3D8|CC3266 from Caulobacter crescentus (291 aa), FASTA scores: opt: 607, E(): 8.4e-29, (39.8% identity in 294 aa overlap); Q9RD74|SCF43.14c from Streptomyces coelicolor (324 aa), FASTA scores: opt: 555, E(): 1.1e-25, (41.1% identity in 297 aa overlap); etc. Similar also to U00022_20 from Mycobacterium leprae; and MTCI28_22 and MSU87307_1. Also similar to O50445|SIGI|Rv1189|MTV005.25|MTCI364.01 PUTATIVE RNA POLYMERASE SIGMA FACTOR from Mycobacterium tuberculosis (290 aa), FASTA scores: opt: 426, E(): 4.2e-18, (32.65% identity in 294 aa overlap). Equivalent to AAK47774 from Mycobacterium tuberculosis strain CDC1551 (282 aa) but longer 30 aa. Contains probable helix-turn-helix motif at aa 129-150 (Score 1126, +3.02 SD). BELONGS TO THE SIGMA-70 FACTOR FAMILY, ECF SUBFAMILY. Protein product from Mb3361c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3361c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3W3" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR013249" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR032710" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR037401" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3W3" /protein_id="SIU01990.1" /translation="MEVSEFEALRQHLMSVAYRLTGTVADAEDIVQEAWLRWDSQDTV IADPRAWLTTVVSRLGLDKLRSAAHRRETYTGTWLPEPVVTGLDATDPLAAVVAAEDA RFAAMVVLERLRPDQRVAFVLHDGFAVPFAEVAEVLGTSEAAARQLASRARKAVTAQP ALISGDPDPAHNEVVGRLMAAMAAGDLDTVVSLLHPDVTFTGDSNGKAPTAVRAVRGS DKVVRFILGLVQRYGPGLFGANQLALVNGELGAYTAGLPGVDGYRAMAPRITAITVRD GKVCALWDIANPDKFTGSPLKERRAQPTGRGRHHRN" CDS 3676054..3677370 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3362" /product="PROBABLE AMINOTRANSFERASE" /note="Mb3362, -, len: 438 aa. Equivalent to Rv3329, len: 438 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 438 aa overlap). Probable aminotransferase (EC 2.6.1.-), similar to many e.g. O86744|SC6A9.12 from Streptomyces coelicolor (457 aa), FASTA scores: opt: 2120, E(): 5.1e-125, (70.1% identity in 438 aa overlap); Q9I6J2|PA0299 from Pseudomonas aeruginosa (456 aa), FASTA scores: opt: 983, E(): 5.7e-54, (38.1% identity in 425 aa overlap); Q53196|Y4UB_RHISN from Rhizobium sp. strain NGR234 plasmid sym pNGR234a (467 aa), FASTA scores: opt: 971, E(): 3.3e-53, (39.25% identity in 438 aa overlap); P33189|YHXA_BACSU from Bacillus subtilis (450 aa), FASTA scores: opt: 933, E(): 7.5e-51, (40.25% identity in 435 aa overlap); etc. Equivalent to AAK47775 from Mycobacterium tuberculosis strain CDC1551 (466 aa) but shorter 28 aa. COFACTOR: PYRIDOXAL PHOSPHATE. COULD BELONG TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Start uncertain. Protein product from Mb3362 detected using SWATH mass spectrometry. Mb3362 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3V6" /db_xref="InterPro:IPR005814" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3V6" /protein_id="SIU01991.1" /translation="MHFARHGAGIQHPVIVRGDGVTIFDDRGKSYLDALSGLFVVQVG YGRAELAEAAARQAGTLGYFPLWGYATPPAIELAERLARYAPGDLNRVFFTSGGTEAV ETAWKVAKQYFKLTGKPGKHKVISRSIAYHGTTQGALAITGLPLFKAPFEPLTPGGFR VPNTNFYRAPLHTDLKEFGRWAADRIAEAIEFEGPDTVAAVFLEPVQNAGGCIPAPPG YFERVREICDRYDVLLVSDEVICAFGRIGSMFACEDLGYVPDMITCAKGLTSGYSPLG AMIASDRLFEPFNDGETMFAHGYTFGGHPVSAAVGLANLDIFEREGLSDHVKRNSPAL RATLEKLYDLPIVGDIRGEGYFFGIELVKDQATKQTFTDDERARLLGQVSAALFEAGL YCRTDDRGDPVVQVAPPLISGQPEFDTIETILRSVLTDTGRKYLHL" CDS 3677439..3678656 /codon_start=1 /transl_table=11 /gene="dacB1" /locus_tag="BQ2027_MB3363" /product="probable penicillin-binding protein dacb1 (d-alanyl-d-alanine carboxypeptidase) (dd-peptidase) (dd-carboxypeptidase) (pbp) (dd-transpeptidase) (serine-type d-ala-d-ala carboxypeptidase) (d-amino acid hydrolase)" /note="Mb3363, dacB1, len: 405 aa. Equivalent to Rv3330, len: 405 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 405 aa overlap). Probable dacB1, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein) (EC 3.4.16.4), equivalent to Mycobacterium leprae proteins Q9CCM2|ML0691 PUTATIVE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE (411 aa), FASTA scores: opt: 2066, E(): 2.5e-102, (77.15% identity in 416 aa overlap); Q49917|L308_F1_36 (228 aa), FASTA scores: opt: 1241, E(): 7.9e-59, (78.9% identity in 232 aa overlap) (note that this protein corresponds to C-terminal part of the putative protein encoded by Rv3330, aa 174-405); and Q49921|PBPC (182 aa), FASTA scores: opt: 736, E(): 3.7e-32, (73.95% identity in 169 aa overlap) (note that this protein corresponds to N-terminal part of the putative protein encoded by Rv3330, aa 1-158); note L308_F1_36 (228 aa) and PBPC (182 aa) are two consecutive Mycobacterium leprae ORFs . Also similar to others e.g. Q9FC34|SC4G1.16c PUTATIVE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE from Streptomyces coelicolor (413 aa), FASTA scores: opt: 572, E(): 3.4e-23, (33.75% identity in 382 aa overlap); P35150|DACB_BACSU PENICILLIN-BINDING PROTEIN 5* PRECURSOR (D-ALANYL-D-ALANINE CARBOXYPEPTIDASE) from Bacillus subtilis (382 aa), FASTA scores: opt: 422, E(): 2.8e-15, (31.3% identity in 249 aa overlap); Q9K8X5|DACB|BH2877 D-ALANYL-D-ALANINE CARBOXYPEPTIDASE (PENICILLIN-BINDING PROTEIN) from Bacillus halodurans (395 aa), FASTA scores: opt: 421, E(): 3.2e-15, (31.95% identity in 241 aa overlap); etc. Also similar to M. tuberculosis Q10828|Rv2911|MTCY274.43 PROBABLE PENICILLIN-BINDING PROTEIN (BELONGS TO PEPTIDASE FAMILY S11; ALSO KNOWN AS THE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE 1 FAMILY) (291 aa), FASTA scores: opt: 746, E(): 1.6e-32, (47.0% identity in 266 aa overlap). Has hydrophobic stretches at both N- and C-termini. Certainly membrane-bound protein. BELONGS TO PEPTIDASE FAMILY S11; ALSO KNOWN AS THE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE 1 FAMILY. Protein product from Mb3363 detected using SWATH mass spectrometry. Mb3363 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3V5" /db_xref="InterPro:IPR001967" /db_xref="InterPro:IPR012338" /db_xref="InterPro:IPR018044" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3V5" /protein_id="SIU01992.1" /translation="MAFLRSVSCLAAAVFAVGTGIGLPTAAGEPNAAPAACPYKVSTP PAVDSSEVPAAGEPPLPLVVPPTPVGGNALGGCGIITAPGSAPAPGDVSAEAWLVADL DSGAVIAARDPHGRHRPASVIKVLVAMASINTLTLNKSVAGTADDAAVEGTKVGVNTG GTYTVNQLLHGLLMHSGNDAAYALARQLGGMPAALEKINLLAAKLGGRDTRVATPSGL DGPGMSTSAYDIGLFYRYAWQNPVFADIVATRTFDFPGHGDHPGYELENDNQLLYNYP GALGGKTGYTDDAGQTFVGAANRDGRRLMTVLLHGTRQPIPPWEQAAHLLDYGFNTPA GTQIGTLIEPDPSLMSTDRNPADRQRVDPQAAARISAADALPVRVGVAVIGALIVFGL IMVARAMNRRPQH" CDS 3678752..3680260 /codon_start=1 /transl_table=11 /gene="sugI" /locus_tag="BQ2027_MB3364" /product="PROBABLE SUGAR-TRANSPORT INTEGRAL MEMBRANE PROTEIN SUGI" /note="Mb3364, sugI, len: 502 aa. Equivalent to Rv3331, len: 502 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 502 aa overlap). Probable sugI, sugar-transport integral membrane protein, possibly member of major facilitator superfamily (MFS), similar to several transporters e.g. P37021|GALP_ECOLI|B2943 GALACTOSE-PROTON SYMPORTER (GALACTOSE TRANSPORTER) from Escherichia coli strain K12 (464 aa), FASTA scores: opt: 818, E(): 1.8e-39, (31.85% identity in 446 aa overlap); P96742|YWTG METABOLITE-TRANSPORT-RELATED PROTEIN from Bacillus subtilis (457 aa), FASTA scores: opt: 810, E(): 5e-39, (33.2% identity in 428 aa overlap); AAG58074|GALP (alias BAB37242|ECS3819) GALACTOSE-PROTON SYMPORT OF TRANSPORT SYSTEM from Escherichia coli strain O157:H7 EDL933 (464 aa), FASTA scores: opt: 810, E(): 5.1e-39, (32.2% identity in 432 aa overlap); P46333|CSBC_BACSU|SS92BR PROBABLE METABOLITE TRANSPORT PROTEIN from Bacillus subtilis (461 aa), FASTA scores: opt: 792, E(): 5.4e-38, (33.7% identity in 442 aa overlap); etc. Equivalent to AAK47777|MT343 from Mycobacterium tuberculosis strain CDC1551 (500 aa) but with some divergence between residues 229 and 254. Contains PS00216 Sugar transport proteins signature 1 and PS00217 Sugar transport proteins signature 2. BELONGS TO THE SUGAR TRANSPORTER FAMILY. Mb3364 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3V4" /db_xref="InterPro:IPR003663" /db_xref="InterPro:IPR005828" /db_xref="InterPro:IPR005829" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3V4" /protein_id="SIU01993.1" /translation="MTTLWQPHRNDYSPIPGRGVHARRGARRPRPRGGRAERPGTGQL TRSGRRALLVGLTAASVGVLYGYDLSAIAGALLSLSEEFELTTREQELLTTTAVLGQI AGALGGGILANAIGRKKSVVLIVAGYAVFALLGATSVSVPMLVVARLLLGVTIGLSVV VVPVYVAESAPAAVRGSLVTAYQLATLSGIVVGYLVGYLLAGSHGWRAMFGLAAAPAT LLLPLLWRMPDTARWYLLKGRIADARSALRRIQPEADIDAELADMAAAVDERGGGIGE MVRRPYLRATLFVIALGFLVQITGINAIIYYSPRLFAAMGFAGYFAMLALPAMVQVAG LAAVCASLFLVDRLGRRPILLSGIATMITADAVLITVFANDSDGGTGLVLGFAGVLLF IIGFNFGFGSLVWVYAAESFPSRLRSMGSSLMLTSTLTANAIVAAFSLTMLRVLGGAG VFAVFGTFAVVAFVVVYRFAPETKGRKLEEIRHFWENGGRWPAERSPAADEP" CDS 3680257..3681408 /codon_start=1 /transl_table=11 /gene="nagA" /locus_tag="BQ2027_MB3365" /product="PROBABLE N-ACETYLGLUCOSAMINE-6-PHOSPHATE DEACETYLASE NAGA (GLCNAC 6-P DEACETYLASE)" /note="Mb3365, nagA, len: 383 aa. Equivalent to Rv3332, len: 383 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 383 aa overlap). Probable nagA, N-acetylglucosamine-6-phosphate deacetylase (EC 3.5.1.25), similar to many e.g. Q9KXV7|SCD95A.17c PUTATIVE DEACETYLASE from Streptomyces coelicolor (381 aa), FASTA scores: opt: 1090, E(): 1.6e-55, (47.8% identity in 385 aa overlap); Q9PDB4|XF1465 N-ACETYLGLUCOSAMINE-6-PHOSPHATE DEACETYLASE from Xylella fastidiosa (386 aa), FASTA scores: opt: 667, E(): 3.5e-31, (38.3% identity in 394 aa overlap); Q9AAZ9|CC0443 N-ACETYLGLUCOSAMINE-6-PHOSPHATE DEACETYLASE from Caulobacter crescentus (378 aa), FASTA scores: opt: 661, E(): 7.5e-31, (38.9% identity in 383 aa overlap); O34450||NAGA_BACSU N-ACETYLGLUCOSAMINE-6-PHOSPHATE DEACETYLASE from Bacillus subtilis (396 aa), FASTA scores: opt: 571, E(): 1.2e-25, (32.45% identity in 376 aa overlap); etc. Equivalent to AAK47778 from Mycobacterium tuberculosis strain CDC1551 (346 aa) but longer 37 aa. BELONGS TO THE NAGA FAMILY. Protein product from Mb3365 detected using SWATH mass spectrometry. Mb3365 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3W5" /db_xref="InterPro:IPR003764" /db_xref="InterPro:IPR006680" /db_xref="InterPro:IPR011059" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3W5" /protein_id="SIU01994.1" /translation="MTVLGADAVVIDGRICRPGWVHTADGRILSGGAGAPPMPADAEF PDAIVVPGFVDMHVHGGGGASFADGNAADIARAAEFHLRHGTTTTLASLVTAGPAELL SAVGALAEATRDGVVAGIHLEGPWLSPARCGAHDHTRMRAPDPAEIESVLAAADGAVR MVTLAPELPGSDAAIRRFRDAEVVVAVGHTDATYTQTRHAIDLGATVGTHLFNAMPPL DHRAPGPVLALLCDPRVTVEIIADGVHVHPAVVHAVIEAVGPDRVAVVTDAIAAAGCG DGAFRLGTMPIEVESSVARVAGASTLAGSTTTMDQLFRTVAGLGSKSDSAGDVALAAA VQVTSATPARALGLTGVGRLAAGYAANLVVLDRDLRVTAVMVNDDWRVG" CDS complement(3681599..3682444) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3366C" /product="HYPOTHETICAL PROLINE RICH PROTEIN" /note="Mb3366c, -, len: 281 aa. Equivalent to Rv3333c, len: 281 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 281 aa overlap). Hypothetical unknown pro-rich protein. Equivalent to AAK47780 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (265 aa) but longer 16 aa." /db_xref="InterPro:IPR007969" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5M9" /protein_id="SIU01995.1" /translation="MFTGIASHAGALGAALVVLIGAAILHDGPAAADPNQDDRFLALL EKKEIPAVANVPRVIDAAHKVCRKLDGGMPVNDIVDGLRNDAYNIDPVMRLYPVRLTT TMTRFISAAVEIYCPNHHSKMAFAMANFEPGSNEPTHRVAASTRSAVNSGSDLRASVS DMTIMSPGWREPTGAMLASVLGAVRAGDPLIPNPPPIPVPPPAAQTLIPPPPIVAPPP PRPAPPQQPPPPPPEVEPPAGVPQSGGAAGSGGAGSGGGGGGDGPVEPSPARPMPPGF IRLAP" CDS 3682919..3683359 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3367" /product="PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY MERR-FAMILY)" /note="Mb3367, -, len: 146 aa. Equivalent to Rv3334, len: 146 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 146 aa overlap). Probable transcriptional regulator, similar to many regulatory proteins (notably mercury resistance operon regulators) e.g. Q9HXV1|PA3689 PROBABLE TRANSCRIPTIONAL REGULATOR MERR FAMILY from Pseudomonas aeruginosa (156 aa), FASTA scores: opt: 275, E(): 1.6e-11, (35.95% identity in 139 aa overlap); Q9AKR6|PBRR LEAD RESISTANCE OPERON REGULATOR from Ralstonia metallidurans strain CH34 (plasmid pMOL30) (145 aa), FASTA scores: opt: 267, E(): 5.2e-11, (35.8% identity in 134 aa overlap); P95838|MERR MERCURIC RESISTANCE OPERON REGULATOR from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (144 aa), FASTA scores: opt: 266, E(): 6e-11, (31.35% identity in 118 aa overlap); P22853|MERR_BACSR MERCURIC RESISTANCE OPERON REGULATOR from Bacillus sp. strain RC607 (132 aa), FASTA scores: opt: 262, E(): 1e-10, (34.6% identity in 130 aa overlap); etc. Contains probable helix-turn-helix motif at aa 1-22 (Score 1478, +4.22 SD). SEEMS TO BELONG TO THE MERR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3367 detected using SWATH mass spectrometry. Mb3367 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4L2" /db_xref="InterPro:IPR000551" /db_xref="InterPro:IPR009061" /db_xref="InterPro:IPR015358" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4L2" /protein_id="SIU01996.1" /translation="MKISEVAALTNTSTKTLRFYENSGLLPPPARTASGYRNYGPEIV DRLRFIHRGQAAGLALQEVRQILAIHDRGEAPCAHVRQLLSTRIDEVRAQIAELIALE GHLQTLLDHASYGPPTEHDHSTVCWILESDLDEPTAIEVSDIHA" CDS complement(3683393..3684262) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3368C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3368c, -, len: 289 aa. Equivalent to Rv3335c, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 289 aa overlap). Probable conserved integral membrane protein, equivalent to Q49909|ML0687 PUTATIVE MEMBRANE PROTEIN U0308AA from Mycobacterium leprae (313 aa), FASTA scores: opt: 1299, E(): 8.9e-75, (68.75% identity in 288 aa overlap). Also similar to other hypothetical bacterial proteins e.g. BAB37825|ECS4402 from Escherichia coli strain O157:H7 (alias P37642|YHJD_ECOLI|B3522 strain K12) (337 aa), FASTA scores: opt: 591, E(): 4.2e-30, (35.15% identity in 273 aa overlap); P45417|YHJD_ERWCH from Erwinia chrysanthemi (328 aa), FASTA scores: opt: 500, E(): 2.2e-24, (34.9% identity in 275 aa overlap); Q9KZA0|SC5G8.14 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (321 aa), FASTA scores: opt: 321, E(): 4.3e-13, (27.3% identity in 271 aa overlap); etc. Mb3368c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3X1" /db_xref="InterPro:IPR005274" /db_xref="InterPro:IPR017039" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3X1" /protein_id="SIU01997.1" /translation="MGELAEPGVLDRLRARFGWLDHVVRAFTRFNDRNGSLFAAGLTY YTIFAIFPLLMVGFGVGGFALSRRPELLTTLEERIRTSVSGAVGQQLVDLMNSAIDAR ASVGVIGLATAAWVGLGWMWHLREALSQMWAHPVAPAGYLRTKLSDLAAMVGTFVVIV ATIALTVLGHARPMAAVLRWLEIPQFSVFDEIFRGISVLVSVLVSWVLFTWMIGRLPR EPVGLVTAARAGLMAAVGFELFKQVGAIYLQIVLRSPAGAVFGPVLGLMVFAFVTAWL ILFATAWAATASA" CDS complement(3684283..3685293) /codon_start=1 /transl_table=11 /gene="trpS" /locus_tag="BQ2027_MB3369C" /product="PROBABLE TRYPTOPHANYL-TRNA SYNTHETASE TRPS (TRYPTOPHAN--TRNA LIGASE) (TRPRS) (TRYPTOPHAN TRANSLASE)" /note="Mb3369c, trpS, len: 336 aa. Equivalent to Rv3336c, len: 336 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 336 aa overlap). Probable trpS, tryptophanyl-tRNA synthetase (EC 6.1.1.2), equivalent to Q49901|SYW_MYCLE|TRPS|ML0686|L308_C1_147 TRYPTOPHANYL-TRNA SYNTHETASE from Mycobacterium leprae (343 aa), FASTA scores: opt: 1859, E(): 4.8e-107, (83.75% identity in 339 aa overlap). Also similar to many e.g. Q9KZA7|TRPS2 from Streptomyces coelicolor (339 aa), FASTA scores: opt: 1359, E(): 2.6e-76, (60.3% identity in 335 aa overlap); Q9EYY6|TRPS from Klebsiella aerogenes (334 aa), FASTA scores: opt: 1077, E(): 5.5e-59, (52.15% identity in 328 aa overlap); P00954|SYW_ECOLI|TRPS|B3384 from Escherichia coli strain K12 (334 aa), FASTA scores: opt: 1074, E(): 8.3e-59, (51.85% identity in 328 aa overlap); etc. Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. BELONGS TO CLASS-I AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb3369c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3369c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67591" /db_xref="InterPro:IPR001412" /db_xref="InterPro:IPR002305" /db_xref="InterPro:IPR002306" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR024109" /db_xref="UniProtKB/Swiss-Prot:P67591" /protein_id="SIU01998.1" /translation="MSTPTGSRRIFSGVQPTSDSLHLGNALGAVAQWVGLQDDHDAFF CVVDLHAITIPQDPEALRRRTLITAAQYLALGIDPGRATIFVQSQVPAHTQLAWVLGC FTGFGQASRMTQFKDKSARQGSEATTVGLFTYPVLQAADVLAYDTELVPVGEDQRQHL ELARDVAQRFNSRFPGTLVVPDVLIPKMTAKIYDLQDPTSKMSKSAGTDAGLINLLDD PALSAKKIRSAVTDSERDIRYDPDVKPGVSNLLNIQSAVTGTDIDVLVDGYAGHGYGD LKKDTAEAVVEFVNPIQARVDELTADPAELEAVLAAGAQRAHDVASKTVQRVYDRLGF LL" CDS 3685318..3686211 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3370" /product="Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)" /note="Mb3370, -, len: 297 aa. Equivalent to Rv3337 and Rv3338, len: 128 aa and 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 81 aa overlap and 100.0% identity in 214 aa overlap). Rv3337: Conserved hypothetical protein, equivalent to N-terminus of Q49926|ML0685 TPEA (PUTATIVE HYDROLASE) from Mycobacterium leprae (303 aa), FASTA scores: *****opt: 362, E(): 3e-17, (74.3% identity in 70 aa overlap). Also weak similarity in N-terminus to BAB49078|MLR1789 PROBABLE EPOXIDE HYDROLASE from Rhizobium loti (Mesorhizobium loti) (300 aa), FASTA scores: opt: 116, E(): 0.74, (38.9% identity in 54 aa overlap). Homology suggests this ORF should be in frame with the following ORF MTV016.38 but no sequence error could be found. Short distance to start of trpS suggests region may not be protein-coding. C-terminus extended since first submission (+47 aa). Rv3338: Hypothetical protein, equivalent to C-termini of Q49926|ML0685 TPEA (PUTATIVE HYDROLASE) from Mycobacterium leprae (303 aa), FASTA scores: opt: 984, E(): 2.6e-56, (65.4% identity in 214 aa overlap); and O32873|MLCB1779.02 HYPOTHETICAL 31.8 KDA PROTEIN (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Mycobacterium leprae (292 aa), FASTA scores: opt: 984, E(): 2.5e-56, (65.4% identity in 214 aa overlap). Also similar to C-termini of several hypothetical proteins (generally hydrolases) e.g. Q9K3H6|2SCG18.11 PUTATIVE HYDROLASE from Streptomyces coelicolor (316 aa), FASTA scores: opt: 213, E(): 1.4e-06, (29.75% identity in 185 aa overlap). Homology suggests that this ORF should be in frame with the previous ORF MTV016.37 but no sequence error could be found. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3337 and Rv3338 exist as 2 genes with an overlap region between them. In Mycobacterium bovis, a single base insertion (*-t) leads to a single product. Protein product from Mb3370 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3370 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3Y4" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Y4" /protein_id="SIU01999.1" /translation="MPSPSTTGHHAACGTGGTGFSVGSMRSPIRVGSGEPVLLLHPFL MSQTVWEKVAQQLADTGRFEVFAPTMAGHNGGPASGTRFLSSAVLADHVERQLDELGW ETSHIVGNSLGGWVAFELERRGRARSVTGIAPAGGWTRWSPVKFEVIAKFIAGAPILA VAHILGQRALRLPFSRLLATLPISATPDGVSERELSGIIDDAAHCPAYFQLLVKALVL PGLQELEHTAVPSHVVLCEQDRVVPPSRFSRHFTDSLPAGHRLTVLDGVGHVPMFEAP GRITELITSFIEECCPHVRAS" CDS complement(3686278..3687507) /codon_start=1 /transl_table=11 /gene="icd1" /locus_tag="BQ2027_MB3371C" /product="PROBABLE ISOCITRATE DEHYDROGENASE [NADP] ICD1 (OXALOSUCCINATE DECARBOXYLASE) (IDH) (NADP+-SPECIFIC ICDH) (IDP)" /note="Mb3371c, icd1, len: 409 aa. Equivalent to Rv3339c, len: 409 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 409 aa overlap). Probable icd1, isocitrate dehydrogenase NADP-dependant (EC 1.1.1.42), highly similar to many e.g. Q9A5C8|CC2522 from Caulobacter crescentus (403 aa), FASTA scores: opt: 1972, E(): 4.6e-115, (72.45% identity in 403 aa overlap); AAF73472|ICD from Rhizobium meliloti (404 aa), FASTA scores: opt: 1968, E(): 8.1e-115, (73.2% identity in 403 aa overlap); P50215|IDH_SPHYA from Sphingomonas yanoikuyae (406 aa), FASTA scores: opt: 1964, E(): 1.4e-114, (71.45% identity in 403 aa overlap); etc. Contains PS00470 Isocitrate and isopropylmalate dehydrogenases signature. BELONGS TO THE ISOCITRATE AND ISOPROPYLMALATE DEHYDROGENASES FAMILY. Protein product from Mb3371c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3371c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65098" /db_xref="InterPro:IPR004790" /db_xref="InterPro:IPR019818" /db_xref="InterPro:IPR024084" /db_xref="UniProtKB/Swiss-Prot:P65098" /protein_id="SIU02000.1" /translation="MSNAPKIKVSGPVVELDGDEMTRVIWKLIKDMLILPYLDIRLDY YDLGIEHRDATDDQVTIDAAYAIKKHGVGVKCATITPDEARVEEFNLKKMWLSPNGTI RNILGGTIFREPIVISNVPRLVPGWTKPIVIGRHAFGDQYRATNFKVDQPGTVTLTFT PADGSAPIVHEMVSIPEDGGVVLGMYNFKESIRDFARASFSYGLNAKWPVYLSTKNTI LKAYDGMFKDEFERVYEEEFKAQFEAAGLTYEHRLIDDMVAACLKWEGGYVWACKNYD GDVQSDTVAQGYGSLGLMTSVLMTADGKTVEAEAAHGTVTRHYRQYQAGKPTSTNPIA SIFAWTRGLQHRGKLDGTPEVIDFAHKLESVVIATVESGKMTKDLAILIGPEQDWLNS EEFLDAIADNLEKELAN" CDS 3687790..3689139 /codon_start=1 /transl_table=11 /gene="metC" /locus_tag="BQ2027_MB3372" /product="PROBABLE O-ACETYLHOMOSERINE SULFHYDRYLASE METC (HOMOCYSTEINE SYNTHASE) (O-ACETYLHOMOSERINE (THIOL)-LYASE) (OAH SULFHYDRYLASE) (O-ACETYL-L-HOMOSERINE SULFHYDRYLASE)" /note="Mb3372, metC, len: 449 aa. Equivalent to Rv3340, len: 449 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 449 aa overlap). Probable metC, O-acetyl-L-homoserine sulfhydrylase (EC 4.2.99.10), highly similar to many e.g. Q9K9P2|BH2603 O-ACETYLHOMOSERINE SULFHYDRYLASE from Bacillus halodurans (430 aa), FASTA scores: opt: 1716, E(): 3.3e-97, (60.45% identity in 425 aa overlap); Q9HUE4|METY|PA5025 HOMOCYSTEINE SYNTHASE from Pseudomonas aeruginosa (425 aa), FASTA scores: opt: 1517, E(): 4.4e-85, (56.95% identity in 425 aa overlap); Q9WZY4|TM0882 O-ACETYLHOMOSERINE SULFHYDRYLASE from Thermotoga maritima (430 aa), FASTA scores: opt: 1488, E(): 2.6e-83, (55.75% identity in 418 aa overlap); BAB54344|MLR8465 O-ACETYLHOMOSERINE SULFHYDRYLASE from Rhizobium loti (Mesorhizobium loti) (426 aa), FASTA scores: opt: 1445, E(): 1.1e-80, (53.2% identity in 419 aa overlap); P50125|CYSD_EMENI O-ACETYLHOMOSERINE (THIOL)-LYASE from Emericella nidulans (Aspergillus nidulans) (437 aa), FASTA scores: opt: 1442, E(): 1.7e-80, (53.7% identity in 430 aa overlap); etc. Contains PS00868 Cys/Met metabolism enzymes pyridoxal-phosphate attachment site. COFACTOR: PYRIDOXAL PHOSPHATE. BELONGS TO THE TRANS-SULFURATION ENZYMES FAMILY. Protein product from Mb3372 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3372 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3W1" /db_xref="InterPro:IPR000277" /db_xref="InterPro:IPR006235" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3W1" /protein_id="SIU02001.1" /translation="MSADSNSTDADPTAHWSFETKQIHAGQHPDPTTNARALPIYATT SYTFDDTAHAAALFGLEIPGNIYTRIGNPTTDVVEQRIAALEGGVAALFLSSGQAAET FAILNLAGAGDHIVSSPRLYGGTYNLFHYSLAKLGIEVSFVDDPDDLDTWQAAVRPNT KAFFAETISNPQIDLLDTPAVSEVAHRNGVPLIVDNTIATPYLIQPLAQGADIVVHSA TKYLGGHGAAIAGVIVDGGNFDWTQGRFPGFTTPDPSYHGVVFAELGPPAFALKARVQ LLRDYGSAASPFNAFLVAQGLETLSLRIERHVANAQRVAEFLAARDDVLSVNYAGLPS SPWHERAKRLAPKGTGAVLSFELAGGIEAGKAFVNALKLHSHVANIGDVRSLVIHPAS TTHAQLSPAEQLATGVSPGLVRLAVGIEGIDDILADLELGFAAARRFSADPQSVAAF" CDS 3689151..3690290 /codon_start=1 /transl_table=11 /gene="metA" /locus_tag="BQ2027_MB3373" /product="PROBABLE HOMOSERINE O-ACETYLTRANSFERASE META (HOMOSERINE O-TRANS-ACETYLASE) (HOMOSERINE TRANSACETYLASE) (HTA)" /note="Mb3373, metA, len: 379 aa. Equivalent to Rv3341, len: 379 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 379 aa overlap). Probable metA, homoserine o-acetyltransferase (EC 2.3.1.31), equivalent to O32874|METX_MYCLE|META|ML0682|MLCB1779.11 HOMOSERINE O-ACETYLTRANSFERASE from Mycobacterium leprae (382 aa), FASTA scores: opt: 2263, E(): 9.2e-129, (85.0% identity in 380 aa overlap). Also highly similar to many e.g. O68640|METX_CORGL|META from Corynebacterium glutamicum (Brevibacterium flavum) (379 aa), FASTA scores: opt: 1135, E(): 5.9e-61, (48.5% identity in 371 aa overlap); Q9AAS1|CC0525 from Caulobacter crescentus (382 aa), FASTA scores: opt: 860, E(): 2e-44, (40.5% identity in 363 aa overlap); P94891|METX_LEPME from Leptospira meyeri (379 aa), FASTA scores: opt: 787, E(): 4.9e-40, (38.2% identity in 385 aa overlap); etc. BELONGS TO THE AB HYDROLASE FAMILY, HTA SUBFAMILY. Protein product from Mb3373 detected using SWATH mass spectrometry. Mb3373 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5J9" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR008220" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P0A5J9" /protein_id="SIU02002.1" /translation="MTISDVPTQTLPAEGEIGLIDVGSLQLESGAVIDDVCIAVQRWG KLSPARDNVVVVLHALTGDSHITGPAGPGHPTPGWWDGVAGPGAPIDTTRWCAVATNV LGGCRGSTGPSSLARDGKPWGSRFPLISIRDQVQADVAALAALGITEVAAVVGGSMGG ARALEWVVGYPDRVRAGLLLAVGARATADQIGTQTTQIAAIKADPDWQSGDYHETGRA PDAGLRLARRFAHLTYRGEIELDTRFANHNQGNEDPTAGGRYAVQSYLEHQGDKLLSR FDAGSYVILTEALNSHDVGRGRGGVSAALRACPVPVVVGGITSDRLYPLRLQQELADL LPGCAGLRVVESVYGHDGFLVETEAVGELIRQTLGLADREGACRR" CDS 3690287..3691018 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3374" /product="POSSIBLE METHYLTRANSFERASE (METHYLASE)" /note="Mb3374, -, len: 243 aa. Equivalent to Rv3342, len: 243 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 243 aa overlap). Possible methyltransferase (EC 2.1.1.-), similar to various proteins e.g. Q9I5X8|PA0558 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (255 aa), FASTA scores: opt: 496, E(): 4.4e-24, (39.85% identity in 236 aa overlap); Q9XBC9|CZA382.22c PUTATIVE RRNA METHYLASE from Amycolatopsis orientalis (259 aa), FASTA scores: opt: 473, E(): 1.2e-22, (42.45% identity in 245 aa overlap); Q9UTA8|SPAC25B8.10 PUTATIVE METHYLTRANSFERASE from Schizosaccharomyces pombe (Fission yeast) (256 aa), FASTA scores: opt: 470, E(): 1.9e-22, (35.7% identity in 238 aa overlap); and Q9UTA9|SPAC25B8.09 PUTATIVE METHYLTRANSFERASE from Schizosaccharomyces pombe (Fission yeast) (251 aa), FASTA scores: opt: 418, E(): 3.4e-19, (31.2% identity in 237 aa overlap); etc. Start uncertain. BELONGS TO THE METHYLTRANSFERASE SUPERFAMILY. Protein product from Mb3374 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3374 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65349" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P65349" /protein_id="SIU02003.1" /translation="MTCSRRDMSLSFGSAVGAYERGRPSYPPEAIDWLLPAAARRVLD LGAGTGKLTTRLVERGLDVVAVDPIPEMLDVLRAALPQTVALLGTAEEIPLDDNSVDA VLVAQAWHWVDPARAIPEVARVLRPGGRLGLVWNTRDERLGWVRELGEIIGRDGDPVR DRVTLPEPFTTVQRHQVEWTNYLTPQALIDLVASRSYCITSPAQVRTKTLDRVRQLLA THPALANSNGLALPYVTVCVRATLA" CDS complement(3691027..3695043) /codon_start=1 /transl_table=11 /gene="PPE54" /locus_tag="BQ2027_MB3375C" /product="ppe family protein ppe54" /note="Mb3375c, PPE54, len: 1338 aa. Similar to 3' end of Rv3343c, len: 2523 aa, from Mycobacterium tuberculosis strain H37Rv, (98.6% identity in 1151 aa overlap). Member of the Mycobacterium tuberculosis PPE family, MPTR subgroup of Gly-, Asn-rich proteins. Most similar to O50379|Rv3350c|MTV004.07c|MTV004_5 from Mycobacterium tuberculosis strain H37Rv (3716 aa), FASTA scores: opt: 4672, E(): 4e-211, (44.2% identity in 3174 aa overlap); and also similar to MTV004_3, MTCY63_9, MTY13E10_17, MTY13E10_16, MTCY180_1, MTV050_1, MTCY3C7_23, MTV014_3, MTCY63_10; etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 3555 bp deletion leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3X3" /protein_id="SIU02004.1" /translation="MSFVVMPPEINSLLIYTGAGPGPLLAAAAAWDELAAELGSAAAA FGSVTSGLVGGIWQGPSSVAMAAAAAPYAGWLSAAAASAESAAGQARAVVGVFEAALA ATVDPFVIAANRSRLVSLALSNLFGQNTPAIAAAEFDYELMWAQDVAAMLGYHTGASA AAEALAPFGSPLASLAAAAEPAKSLAVNLGLANVGLFNAGSGNVGSYNVGAGNVGSYN VGGGNIGGNNVGLGNVGFGNVGLANSGLTPGLMGLGNIGFGNAGSYNFGLANMGVGNI GFANTGSGNFGIGLTGDNLTGFGGFNTGSGNVGLFNSGTGNVGFFNSGTGNWGVFNSG SYNTGIGNSGIASTGLFNAGGFNTGVVNAGSYNTGSFNAGQANTGGFNPGSVNTGWLN TGDTNTGVANSGDVNTGAFISGNYSNGVLWRGDYQGLLGFSYRPAVLPQTPFLDLTLT GGLGSVVIPAIDIPAIRPEFSANVAIDSFTVPSIPIPQIDLAATTVSVGLGPITVPHL DIPRVPVTLNYLFGSQPGGPLKIGPITGLFNTPIGLTPLALSQIVIGASSSQGTITAF LANLPFSTPVVTIDEIPLLASITGHSEPVDIFPGGLTIPAMNPLSINLSGGTGAVTIP AITIGEIPFDLVAHSTLGPVHILIDLPAVPGFGNTTGAPSSGFFNSGAGGVSGFGNVG AMVSGGWNQAPSALLGGGSGVFNAGTLHSGVLNFGSGMSGLFNTSVLGLGAPALVSGL GSVGQQLSGLLASGTALHQGLVLNFGLADVGLGNVGLGNVGDFNLGAGNVGGFNVGGG NIGGNNVGLGNVGWGNFGLGNSGLTPGLMGLGNIGFGNAGSYNFGLANMGVGNIGFAN TGSGNFGIGLTGDNLTGFGGFNTGSGNVGLFNSGTGNVGFFNSGTGNWGVFNSGSYNT GIGNSGIASTGLFNAGGFNTGVVNAGSYNTGSFNAGQANTGGFNPGSVNTGWLNTGDT NTGVANSGDVNTGAFISGNYSNGAFWRGDYQGLLGFSYTSTIIPEFTVANIHASGGAG PIIVPSIQFPAIPLDLSATGHIGGFTIPPVSISPITVRIDPVFDLGPITVQDITIPAL GLDPATGVTVGPIFSSGSIIDPFSLTLLGFINVNVPAIQTAPSEILPFTVLLSSLGVT HLTPEITIPGFHIPVDPIHVELPLSVTIGPFVSPEITIPQLPLGLALSGATPAFAFPL EITIDRIPVVLDVNALLGPINAGLVIPPVPGFGNTTAVPSSGFFNIGGGGGLSGFHNL GAGMSGVLNAISDPLLGSASGFANFGTQLSGILNRGADISGVYNTGALGLITSALVSG FGNVGQQLAGLIYTGTGP" CDS complement(3695092..3699105) /codon_start=1 /transl_table=11 /gene="PE_PGRS50b" /locus_tag="BQ2027_MB3376C" /product="pe-pgrs family protein pe_pgrs49" /note="Mb3376c, PE_PGRS50b, len: 1337 aa. Equivalent to middle part of Rv3345c (PE_PGRS50) and Rv3344c (PE_PGRS49), len: 1538 aa and 484 aa, from Mycobacterium tuberculosis strain H37Rv, (81.45% identity in 992 aa overlap and 100.000% identity in 477 aa overlap). Rv3345c: Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Similar to AAK47791 from strain CDC1551 but with some big gaps (after residues 501 and 1419; and for AAK47791 after residue 991). Similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O53559|Rv3514|MTV023.21 (1489 aa), FASTA scores: opt: 4508, E(): 7e-161, (52.1% identity in 1529 aa overlap); MTV004_1, MTV023_21, MTV023_15, MTCY493_4, MTV039_16, MTV008_46, MTV023_14, MTV023_19, MTV043_26, MTCY493_2, MTCY441_4; etc. Rv3344c: Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-, ala-rich proteins. Appears to be a gene fragment, should be in-frame with following ORF, MTV016.45c, frameshift required around 49595 but could not be found on checking BAC and cosmid clones. Similar to many from M. tuberculosis strains H37Rv and CDC1551 e.g. O53557|Rv3512|MTV023.19 (1079 aa), FASTA scores: opt: 1595, E(): 1.8e-54, (52.0% identity in 544 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS50 exists as a single gene. In Mycobacterium bovis, a single base deletion (g-*) splits PE_PGRS50 into 2 parts, PE_PGRS50a and PE_PGRS50b. Also in Mycobacterium tuberculosis strain H37Rv, PE_PGRS49 and PE_PGRS50 exist as 2 genes. In Mycobacterium bovis, a single base deletion (c-*) leads to PE_PGRS49 and PE_PGRS50b existing as a single product." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5N3" /protein_id="SIU02005.1" /translation="MTLAVNQGAGGDGGNGGEVGVGGKGGAGGVSANPALNGSAGANG TAPTSGGNGGNGGAGATPTVAGENGGAGGNGGHGGSVGNGGAGGAGGNGVAGTGLALN GGNGGIGGNGGSAAGTGGDGGKGGNGGAGANGQDFSASANGANGGQGGNGGNGGIGGK GGDAFATFAKAGNGGAGGNGGAAGNGGGGAAGDVTLAINQGAGGAGGNGGNVGVAGQG GAGGKGAIPAMKGATGADGTAPTSGGDGGNGGNGASPTVAGGNGGDGGKGGSGGNVGN GGNGGAGGNGAAGQAGTPGPTSGDSGTSGTDGGAGGNGGAGGAGGTLAGHGGNGGKGV NGGQGGIGGAGERGADGAGPNANGANGENGGSGGNGGDGGAGGNGGAGGKAQAAGYTD GATGTGGDGGNGGDGGKAGDGGAGANGLNSGAMLPGGGTVGNPGTGGNGGNGGNAGVG GTGGKAGTGSLTGLDGTDGITPNGGNGGNGGNGGKGGTAGNGSGAAGGNGGNGGSGLN GGDAGNGGNGGGALNQAGFFGTGGKGGNGGNGGAGMINGGLGGFGGAGGGGAVDVAAT TGGAGGNGGAGGFASTGLGGPGGAGGPGGAGDFASGVGGVGGAGGDGGAGGVGGFGGQ GGIGGEGRTGGNGGSGGDGGGGISLGGQGGNGGFGGAGGNGGIGTDAGGAGGAGGAGG NGGSSKSTTTGNAGSGGAGGNGGTGLNGAGGAGGAGGNAGVAGVSFGNAVGGDGGNGG NGGHGGDGTTGGAGGKGGNGSSGAASGSGVVNVTAGHGGNGGNGGNSGNSTGVAGLAG GAAGAGGNGGGTSSAAGHGGSGGNGGSGGSGGSGTTGGAGAAGGNGGAGAGGGSLSTG QSGGHGGSGGAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGGDAGNAGSGGNGGKGGDG VGPGSTGGAGGKGGAGANGGSSNGNARGGNAGNGGHGGAGGSGDTGGAGGAGGQGGFG GTGGSGSGIGGGAGGNGGNGGAGGTGVVLGGKGGDGGNGDHGGPATNPGSGSRGGAGG SGGNGGAGGNATGSGGKGGAGGNGGDGSFGATSGPASIGVTGAPGGNGGKGGAGGSNP NGSGGDGGKGGNGGAGGNGGSIGANSGIVGGSGGAGGAGGAGGNGSLSSGEGGKGGDG GHGGDGVGGNSSVTQGGSGGGGGAGGAGGSGFFGGKGGFGGDGGQGGPNGGGTVGTVA GGGGNGGVGGRGGDGVFAGAGGQGGLGGQGGNGGGSTGGNGGLGGAGGGGGNAPDGGF GGNGGKGGQGGIGGGTQSATGLGGDGGDGGDGGNGGNSGAKAGGAGGKGQAGQPNSGT EPGFGGDGGLGGAGATP" CDS complement(3699102..3700721) /codon_start=1 /transl_table=11 /gene="PE_PGRS50a" /locus_tag="BQ2027_MB3377C" /product="PE-PGRS FAMILY PROTEIN [FIRST PART]" /note="Mb3377c, PE_PGRS50a, len: 539 aa. Equivalent to 5' end of Rv3345c, len: 1538 aa, from Mycobacterium tuberculosis strain H37Rv, (88.25% identity in 315 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Similar to AAK47791 from strain CDC1551 but with some big gaps (after residues 501 and 1419; and for AAK47791 after residue 991). Similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O53559|Rv3514|MTV023.21 (1489 aa), FASTA scores: opt: 4508, E(): 7e-161, (52.1% identity in 1529 aa overlap); MTV004_1, MTV023_21, MTV023_15, MTCY493_4, MTV039_16, MTV008_46, MTV023_14, MTV023_19, MTV043_26, MTCY493_2, MTCY441_4; etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PE_PGRS50 exists as a single gene. In Mycobacterium bovis, a single base deletion (g-*) splits PE_PGRS50 into 2 parts, PE_PGRS50a and PE_PGRS50b." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4L8" /protein_id="SIU02006.1" /translation="MVMSLMVAPELVAAAAADLTGIGQAISAANAAAAGPTTQVLAAA GDEVSAAIAALFGTHAQEYQALSARVATFHEQFVRSLTAAGSAYATAEAANASPLQAL EQQVLGAINAPTQLWLGRPLIGDGVHGAPGTGQPGGAGGLLWGNGGNGGSGAAGQVGG PGGAAGLFGNGGSGGSGGAGAAGGVGGSGGWLNGNGGAGGAGGTGANGGAGGNAWLFG AGGSGGAGTNGGVGGSGGFVYGNGGAGGIGGIGGIGGNGGDAGLFGNGGAGGPGPRAC RVPPASTAATAATAATAEPAATAGAAGYWLATAGPAGPAASAATVVRAALAIRVSPST TVPAVTAVTAATPAWAGPVGPAACWRVRTVPPAPPPPAAATAAMAASAPPPTHPYKPA GPAVMAVMAGWSATAAPAAPAVPVMRVPPALPVPPYNRRAVTAPMAAPGATAVMAEMA APSTATAASAARAVPAVAAAPAETDSTPPPWVRPVPMAVWAATAARAVTAVVAAPAAP RSPAPPARPGTAAPAVTAARPVMAEPVPPVM" CDS complement(3701145..3701402) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3378C" /product="conserved transmembrane protein" /note="Mb3378c, -, len: 85 aa. Equivalent to Rv3346c, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Conserved hypothetical protein, highly similar to mycobacterium hypothetical proteins O50384|Rv3355c|MTV004.12c from strain H37Rv (97 aa), FASTA scores: opt: 413, E(): 4.6e-23, (85.55% identity in 97 aa overlap); O32878|MLCB1779.16c|ML0675 from Mycobacterium leprae (91 aa), FASTA scores: opt: 349, E(): 1.7e-18, (67.35% identity in 95 aa overlap). Mb3378c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3X4" /db_xref="InterPro:IPR021385" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3X4" /protein_id="SIU02007.1" /translation="MTVRAVLRRTVGAQWPILAGVNFWRRGALLIGIGVGVAAVLRLV LSEERAGLLVVRSKGIDFVTTVTVAAAMVYIASTIDPLGTG" CDS complement(3701658..3704843) /codon_start=1 /transl_table=11 /gene="PPE55b" /locus_tag="BQ2027_MB3379C" /product="PPE FAMILY PROTEIN [SECOND PART]" /note="Mb3379c, PPE55b, len: 1061 aa. Equivalent to 3' end of Rv3347c, len: 3157 aa, from Mycobacterium tuberculosis strain H37Rv, (99.812% identity in 1061 aa overlap). Member of the Mycobacterium tuberculosis PPE family, Gly-, Ala-, Asn-rich protein. Similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551, e.g. O50379|Rv3350c|MTV004.07c (3716 aa), FASTA scores: opt: 6497, E(): 0, (61.65% identity in 3756 aa overlap); and other upstream ORFs MTV004_5, MTY13E10_15, MTCY28_16, MTCY63_9, MTY13E10_17, MTCY180_1; etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE55 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (g-*) splits PPE55 into 2 parts, PPE55a and PPE55b. Mb3379c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002989" /db_xref="UniProtKB/TrEMBL:A0A1R3Y453" /protein_id="SIU02008.1" /translation="MNSGAGNIGLFNSGTGNIGFFNSGTGNWGLFNSGSFNTGIGNSG TGSTGLFNAGGFTTGLANAGSYNTGSFNVGDTNTGGFNPGSINTGWFNTGNANTGIAN SGNVDTGALMSGNFSNGILWRGNYEGLFSYSYSLDVPRITILDAHFTGAFGPVVVPPI PVPAINAHLTGNAAMGAFTIPQIDIPALNPNVTGSVGFGPIAVPSVTIPALTAARAVL DMAASVGATSEIEPFIVWTSSGAIGPTWYSVGRIYNAGDLFVGGNIISGIPTLSTTGP VHAVFNAASQAFNTPALNIHQIPLGFQVPGSIDAITLFPGGLTFPANSLLNLDVFVGT PGATIPAITFPEIPANADGELYVIAGNIPLINIPPTPGIGNTTTVPSSGFFNTGAGGG SGFGNFGANMSGWWNQAHTALAGAGSGIANVGTLHSGVLNLGSGLSGIYNTSTLPLGT PALVSGLGNVGDHLSGLLASNVGQNPITIVNIGLANVGNGNVGLGNIGNLNLGAANIG DVNLGFGNIGDVNLGFGNIGGGNVGFGNIGDANFGFGNSGLAAGLAGMGNIGLGNAGS GNVGWANMGLGNIGFGNTGTNNLGIGLTGDNQSGIGGLNSGTGNIGLFNSGTGNIGFF NSGTANFGLFNSGSYNTGIGNSGVASTGLVNAGGFNTGVANAGSYNTGSFNAGDTNTG GFNPGSTNTGWFNTGNANTGVANAGNVNTGALITGNFSNGILWRGNYEGLAGFSFGYP IPLFPAVGADVTGDIGPATIIPPIHIPSIPLGFAAIGHIGPISIPNIAIPSIHLGIDP TFDVGPITVDPITLTIPGLSLDAAVSEIRMTSGSSSGFKVRPSFSFFAVGPDGMPGGE VSILQPFTVAPINLNPTTLHFPGFTIPTGPIHIGLPLSLTIPGFTIPGGTLIPQLPLG LGLSGGTPPFDLPTVVIDRIPVELHASTTIGPVSLPIFGFGGAPGFGNDTTAPSSGFF NTGGGGGSGFSNSGSGMSGVLNAISDPLLGSASGFANFGTQLSGILNRGAGISGVYNT GTLGLVTSAFVSGFMNVGQQLSGLLFAGTGP" CDS complement(3704840..3711130) /codon_start=1 /transl_table=11 /gene="PPE55a" /locus_tag="BQ2027_MB3380C" /product="PPE FAMILY PROTEIN [FIRST PART]" /note="Mb3380c, PPE55a, len: 2096 aa. Similar to 5' end of Rv3347c, len: 3157 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 2044 aa overlap). Member of the Mycobacterium tuberculosis PPE family, Gly-, Ala-, Asn-rich protein. Similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551, e.g. O50379|Rv3350c|MTV004.07c (3716 aa), FASTA scores: opt: 6497, E(): 0, (61.65% identity in 3756 aa overlap); and other upstream ORFs MTV004_5, MTY13E10_15, MTCY28_16, MTCY63_9, MTY13E10_17, MTCY180_1; etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE55 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (g-*) splits PPE55 into 2 parts, PPE55a and PPE55b. Mb3380c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Y7" /protein_id="SIU02009.1" /translation="MNFPVLPPEINSVLMYSGAGSSPLLAAAAAWDGLAEELGSAAVS FGQVTSGLTAGVWQGAAAAAMAAAAAPYAGWLGSVAAQAVAVAGQARAAVAAFEAALA ATVDPAAVAVNRMAMRALAMSNLLGQNAAAIAAVEAEYELMWAADVAAMAGYHSGASA AAAALPAFSPPAQALGGGVGAFLNAIFAGPAKMLRLNAGLGNVGNYNVGLGNVGIFNL GAANVGAQNLGAANAGSGNFGFGNIGNANFGFGNSGLGLPPGMGNIGLGNAGSSNYGL ANLGVGNIGFANTGSNNIGIGLTGDNLTGIGGLNSGTGNLGLFNSGTGNIGFFNSGTG NFGVFNSGSYNTGVGNAGTASTGLFNVGGFNTGVANVGSYNTGSFNAGNTNTGGFNPG NVNTGWLNTGNTNTGIANSGNVNTGAFISGNFSNGVLWRGDYEGLWGLSGGSTIPAIP IGLELNGGVGPITVLPIQILPTIPLNIHQTFSLGPLVVPDIVIPAFGGGTAIPISVGP ITISPITLFPAQNFNTTFPVGPFFGLGVVNISGIEIKDLAGNVTLQLGNLNIDTRINQ SFPVTVNWSTPAVTIFPNGISIPNNPLALLASASIGTLGFTIPGFTIPAAPLPLTIDI DGQIDGFSTPPITIDRIPLNLGASVTVGPILINGVNIPATPGFGNTTTAPSSGFFNSG DGGVSGFGNFGAGSSGWWNQAQTEVAGAGSGFANFGSLGSGVLNFGSGVSGLYNTGGL PPGTPAVVSGIGNVGEQLSGLSSAGTALNQSLIINLGLADVGSVNVGFGNVGDFNLGA ANIGDLNVGLGNVGGGNVGFGNIGDANFGLGNAGLAAGLAGVGNIGLGNAGSGNVGFG NMGVGNIGFGNTGTNNLGIGLTGDNQTGIGGLNSGAGNIGLFNSGTGNVGLFNSGTGN FGLFNSGGYNTGIGNGGTGSTGLFNAGNFNTGVANPGSYNTGSFNVGDTNTGGFNPGS INTGWFNTGNANTGVANSGTVNTGAFITGNDSNGILWRGNFEGLFGLNVGITIPEFPI HWTSTGGIGPIIIPDTTILPPIHLGLTGQANYGFAVPDIPIPAIHIDFDGAADAGFTA SATTLLSALGITGQFRFGPITVSNVQLNPFNVNLKLQFLHDAFPNEFPDPTISVQIQV AIPLTSATLGGLALPLQQTIDAIELPAISFSQSIPIDIPPIDIPASTINGISMSEVVP IDVSVDIPAVTITGTRIDPIPLNFDVLSSAGPINISIIDIPALPGFGNSTELPSSGFF NTGGGGGSGIANFGAGVSGLLNQASSPMVGTLSGLGNAGSLASGVLNSGVDISGMFNV STLGSAPAVISGFGNLGNHVSGVSIDGLLAMLTSGGSGGSGQPSIIDAAIAELRHLNP LNIVNLGNVGSYNLGFANVGDVNLGAGNLGNLNLGGGNLGGQNLGLGNLGDGNVGFGN LGHGNVGFGNSGLGALPGIGNIGLGNAGSNNVGFGNMGLGNIGFGNTGTNNLGIGLTG DNQTGFGGLNSGAGNLGLFNSGTGNIGFFNTGTGNWGLFNSGSYNTGIGNSGTGSTGL FNAGSFNTGLANAGSYNTGSLNAGNTNTGGFNPGNVNTGWFNAGHTNTGGFNTGNVNT AAFNSGSFNNGALWTGDHHGLVGFSYSIEITGSTLVDINETLNLGPVHIDQIDIPGMS LFDIHELVNIGPFRIEPIDVPAVVLDIHETMVIPPIVFLPSMTIGGQTYTIPLDTPPA PAPPPFRLPLLFVNALGDNWIVGASNSTGMSGGFVTAPTQGILIHTGPSSATTGSLAL TLPTVTIPTITTSPIPLKIDVSGGLPAFTLFPGGLNIPQNAIPLTIDASGVLDPITIF PGGFTIDPLPLSLALNISVPDSSVPIIIVPPTPGFGNATATPSSGFFNSGAGGVSGFG NFGAGSSGWWNQAHAALAGAGSGVLNVGTLNSGVLNVGSGISGLYNTAIVGLGTPALV SGAGNVGQQLSGVLAAGTALTQSPIINLGLADVGNYNLGLGNVGDFNLGAANLGDLNL GLGNIGNANVGFGNIGHGNVGFGNSGLGRRSASAISGWAMRAAPTLAWPTWVWATSGS PTPAPTTSGLGWPATTRPASAA" mobile_element 3706864..3707893 /mobile_element_type="insertion sequence:IS1608" /locus_tag="BQ2027_IS1608'-1" /note="IS1608'-1, len: 1030 nt. Equivalent to IS1608',len: 489 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 489 nt overlap)." gene 3711277..3712306 /locus_tag="BQ2027_IS1608'-1" CDS 3711710..3712201 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3381" /product="PROBABLE TRANSPOSASE" /note="Mb3381, -, len: 163 aa. Equivalent to Rv3348, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). Probable transposase, partially similar to several insertion elements e.g. P19834|YI11_STRCL INSERTION ELEMENT IS116 HYPOTHETICAL 44.8 KDA PROTEIN (SIMILAR TO IS900 OF MYCOBACTERIUM PARATUBERCULOSIS) from Streptomyces clavuligerus (399 aa), FASTA scores: opt: 146, E(): 0.016, (29.1% identity in 158 aa overlap). Mb3381 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A0G2QBZ4" /db_xref="InterPro:IPR002525" /db_xref="UniProtKB/TrEMBL:A0A0G2QBZ4" /protein_id="SIU02010.1" /translation="MTAENPGRSRRTLVGIDAAITACHHIAIRDDVGARSIRFSVEPT LAGLRTLTDKLSGYDDIDATVEPTSMTWLPLTIAVENAGDTMHMAGARHCARLRGAIV GKSKSDVIDAEVLTRASEVFDLTPLTLPTPAQLALRRSVIRRAGAVIDANRSWRRLMS LAR" mobile_element complement(3712238..3712978) /mobile_element_type="insertion sequence:IS1561" /locus_tag="BQ2027_IS1561'" /note="IS1561', len: 738 nt. Equivalent to IS1561', len: 738 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 738 nt overlap)." CDS complement(3712238..3712978) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3382C" /product="PROBABLE TRANSPOSASE" /note="Mb3382c, -, len: 246 aa. Equivalent to Rv3349c,IS1561', len: 246 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 246 aa overlap). Probable transposase pseudogene fragment, similar to part of Q50911|U10634 IS204 PUTATIVE TRANSPOSASE from NOCARDIA ASTEROIDES (377 aa), FASTA scores: opt: 288, E(): 8.3e-11, (48.5% identity in 97 aa overlap); and others." /db_xref="InterPro:IPR002560" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3X9" /protein_id="SIU02011.1" /translation="MAIDPAAAYASAIRTPGLLPNAKLVVDHFHVTTLANDALTAVRR RVTWAFHDRRGRKIDPQWANRRRLLTARERLSDKSFAKMRNRINAVDPRAQILSAWIA KEELRTLLSTVRTGGDPHLARHHLHRFLPGASTRRSPNCSPWPPPLTSHPRSTPSWSP ASPTRASVVGEVAEMLGDIDGQCVQVEVPVPERGPAGCGGLDGLGRAGVSATPRVCAA MTAVNVAGRCAGQQADVGPTPQHRCRGR" CDS complement(3713892..3717551) /codon_start=1 /transl_table=11 /gene="PPE56d" /locus_tag="BQ2027_MB3383C" /product="PPE FAMILY PROTEIN [THIRD PART]" /note="Mb3383c, PPE56d, len: 1219 aa. Equivalent to 3' end of Rv3350c, len: 3716 aa, from Mycobacterium tuberculosis strain H37Rv, (100.000% identity in 1219 aa overlap). Member of the Mycobacterium tuberculosis PPE family of Gly-, Ala-, Asn-rich proteins, similar to many Mycobacterium tuberculosis proteins from strains H37Rv and CDC1551, e.g. O50378|Rv3347c|MTV004.03c (3157 aa), FASTA scores: opt: 6497, E(): 0, (61.65% identity in 3756 aa overlap); MTCY28_16, MTV050_2, MTY13E10_17, MTCY63_10, MTCY180_1, MTCY63_9, MTV050_1, MTV014_3, MTY13E10_15; etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE56 exists as a single gene. In Mycobacterium bovis, 2 frameshifts due to single base transversion (c-a) and a single base deletion (g-*) splits PPE56 into 3 parts, PPE56a, PPE56b and PPE56d. Mb3383c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002989" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3X8" /protein_id="SIU02012.1" /translation="MLNVGTLGSGVLNVGSGVSGIYNTSVLPLGTPAVLSGLGNVGHQ LSGVSAAGTALNQIPILNIGLADVGNFNVGFGNVGDVNLGAANLGAQNLGLGNVGTGN LGFANVGHGNIGFGNSGLTAGAAGLGNTGFGNAGSANYGFANQGVRNIGLANTGTGNI GIGLVGDNLTGIGGLNSGAGNIGLFNSGTGNIGFFNSGTGNFGIGNSGSFNTGIGNSG TGSTGLFNAGSFNTGVANAGSYNTGSFNAGDTNTGGFNPGTINTGWFNTGHTNTGIAN SGNVGTGAFMSGNFSNGLLWRGDHEGLFSLFYSLDVPRITIVDAHLDGGFGPVVLPPI PVPAVNAHLTGNVAMGAFTIPQIDIPALTPNITGSAAFRIVVGSVRIPPVSVIVEQII NASVGAEMRIDPFEMWTQGTNGLGITFYSFGSADGSPYATGPLVFGAGTSDGSHLTIS ASSGAFTTPQLETGPITLGFQVPGSVNAITLFPGGLTFPATSLLNLDVTAGAGGVDIP AITWPEIAASADGSVYVLASSIPLINIPPTPGIGNSTITPSSGFFNAGAGGGSGFGNF GAGTSGWWNQAHTALAGAGSGFANVGTLHSGVLNLGSGVSGIYNTSTLGVGTPALVSG LGNVGHQLSGLLSGGSAVNPVTVLNIGLANVGSHNAGFGNVGEVNLGAANLGAHNLGF GNIGAGNLGFGNIGHGNVGVGNSGLTAGVPGLGNVGLGNAGGNNWGLANVGVGNIGLA NTGTGNIGIGLTGDYQTGIGGLNSGAGNLGLFNSGAGNVGFFNTGTGNFGLFNSGSFN TGVGNSGTGSTGLFNAGSFNTGVANAGSYNTGSFNVGDTNTGGFNPGSINTGWLNAGN ANTGVANAGNVNTGAFVTGNFSNGILWRGDYQGLAGFAVGYTLPLFPAVGADVSGGIG PITVLPPIHIPPIPVGFAAVGGIGPIAIPDISVPSIHLGLDPAVHVGSITVNPITVRT PPVLVSYSQGAVTSTSGPTSEIWVKPSFFPGIRIAPSSGGGATSTQGAYFVGPISIPS GTVTFPGFTIPLDPIDIGLPVSLTIPGFTIPGGTLIPTLPLGLALSNGIPPVDIPAIV LDRILLDLHADTTIGPINVPIAGFGGAPGFGNSTTLPSSGFFNTGAGGGSGFSNTGAG MSGLLNAMSDPLLGSASGFANFGTQLSGILNRGAGISGVYNTGALGVVTAAVVSGFGN VGQQLSGLLFTGVGP" CDS complement(3717545..3723679) /codon_start=1 /transl_table=11 /gene="PPE56b" /locus_tag="BQ2027_MB3384C" /product="PPE FAMILY PROTEIN [SECOND PART]" /note="Mb3384c, PPE56b, len: 2044 aa. Equivalent to middle part of Rv3350c, len: 3716 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 2018 aa overlap). Member of the Mycobacterium tuberculosis PPE family of Gly-, Ala-, Asn-rich proteins, similar to many M. tuberculosis proteins from strains H37Rv and CDC1551, e.g. O50378|Rv3347c|MTV004.03c (3157 aa), FASTA scores: opt: 6497, E(): 0, (61.65% identity in 3756 aa overlap); MTCY28_16, MTV050_2, MTY13E10_17, MTCY63_10, MTCY180_1, MTCY63_9, MTV050_1, MTV014_3, MTY13E10_15; etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE56 exists as a single gene. In Mycobacterium bovis, 2 frameshifts due to single base transversion (c-a) and a single base deletion (g-*) splits PPE56 into 3 parts, PPE56a, PPE56b and PPE56d." /db_xref="InterPro:IPR002989" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Y3" /protein_id="SIU02013.1" /translation="MLNGDIGPITIQPIPILPTIPLSIHQTVNLGPLVVPDIVIPAFG GGIGIPINIGPLTITPITLFAQQTFVNQLPFPTFSLGKITIPQIQTFDSNGQLVSFIG PIVIDTTIPGPTNPQIDLTIRWDTPPITLFPNGISAPDNPLGLLVSVSISNPGFTIPG FSVPAQPLPLSIDIEGQIDGFSTPPITIDRIPLTVGGGVTIGPITIQGLHIPAAPGVG NTTTAPSSGFFNSGAGGVSGFGNVGAGSSGWWNQAPSALLGAGSGVGNVGTLGSGVLN LGSGISGFYNTSVLPFGTPAAVSGIGNLGQQLSGVSAAGTTLRSMLAGNLGLANVGNF NTGFGNVGDVNLGAANIGGHNLGLGNVGDGNLGLGNIGHGNLGFANLGLTAGAAGVGN VGFGNAGINNYGLANMGVGNIGFANTGTGNIGIGLVGDHRTGIGGLNSGIGNIGLFNS GTGDVGFFNSGTGNFGIGNSGRFNTGIGNSGTASTGLFNAGSFSTGIANTGDYNTGSF NAGDTNTGGFNPGGINTGWFNTGHANTGLANAGTFGTGAFMTGDYSNGLLWRGGYEGL VGVRVGPTISQFPVTVHAIGGVGPLHVAPVPVPAVHVEITDATVGLGPFTVPPISIPS LPIASITGSVDLAANTISPIRALDPLAGSIGLFLEPFRLSDPFITIDAFQVVAGVLFL ENIIVPGLTVSGQILVTPTPIPLTLNLDTTPWTLFPNGFTIPAQTPVTVGMEVANDGF TFFPGGLTFPRASAGVTGLSVGLDAFTLLPDGFTLDTVPATFDGTILIGDIPIPIIDV PAVPGFGNTTTAPSSGFFNTGGGGGSGFANVGAGTSGWWNQGHDVLAGAGSGVANAGT LSSGVLNVGSGISGWYNTSTLGAGTPAVVSGIGNLGQQLSGFLANGTVLNRSPIVNIG WADVGAFNTGLGNVGDLNWGAANIGAQNLGLGNLGSGNVGFGNIGAGNVGFANSGPAV GLAGLGNVGLSNAGSNNWGLANLGVGNIGLANTGTGNIGIGLVGDYQTGIGGLNSGSG NIGLFNSGTGNVGFFNTGTGNFGLFNSGSFNTGIGNSGTGSTGLFNAGNFNTGIANPG SYNTGSFNVGDTNTGGFNPGDINTGWFNTGIMNTGTRNTGALMSGTDSNGMLWRGDHE GLFGLSYGITIPQFPIRITTTGGIGPIVIPDTTILPPLHLQITGDADYSFTVPDIPIP AIHIGINGVVTVGFTAPEATLLSALKNNGSFISFGPITLSNIDIPPMDFTLGLPVLGP ITGQLGPIHLEPIVVAGIGVPLEIEPIPLDVISLSESIPIRIPVDIPASVIDGISMSE VVPIDASVDIPAVTITGTTISAIPLGFDIRTSAGPLNIPIIDIPAAPGFGNSTQMPSS GFFNTGAGGGSGIGNLGAGVSGLLNQAGAGSLVGTLSGLGNAGTLASGVLNSGTAISG LFNVSTLDATTPAVISGFSNLGDHMSGVSIDGLIAILTFPPAESVFDQIIDAAIAELQ HLDIGNALALGNVGGVNLGLANVGEFNLGAGNVGNINVGAGNLGGSNLGLGNVGTGNL GFGNIGAGNFGFGNAGLTAGAGGLGNVGLGNAGSGSWGLANVGVGNIGLANTGTGNIG IGLTGDYRTGIGGLNSGTGNLGLFNSGTGNIGFFNTGTGNFGLFNSGSYSTGVGNAGT ASTGLFNAGNFNTGLANAGSYNTGSLNVGSFNTGGVNPGTVNTGWFNTGHTNTGLFNT GNVNTGAFNSGSFNNGALWTGDYHGLVGFSFSIDIAGSTLLDLNETLNLGPIHIEQID IPGMSLFDVHEIVEIGPFTIPQVDVPAIPLEIHESIHMDPIVLVPATTIPAQTRTIPL DIPASPGSTMTLPLISMRFEGEDWILGSTAAIPNFGDPFPAPTQGITIHTGPGPGTTG ELKISIPGFEIPQIATTRFLLDVNISGGLPAFTLFAGGLTIPTNAIPLTIDASGALDP ITIFPGGYTIDPLPLHLALNLTVPDSSIPIIDVPPTPGFGNTTATPSSGFFNSGAVGC RGSETSGRTCRAGGTRRRARWRGRDRGC" CDS complement(3723737..3725041) /codon_start=1 /transl_table=11 /gene="PPE56a" /locus_tag="BQ2027_MB3385C" /product="PPE FAMILY PROTEIN [FIRST PART]" /note="Mb3385c, PPE56a, len: 434 aa. Equivalent to 5' end of Rv3350c, len: 3716 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 434 aa overlap). Member of the Mycobacterium tuberculosis PPE family of Gly-, Ala-, Asn-rich proteins, similar to many Mycobacterium tuberculosis proteins from strains H37Rv and CDC1551, e.g. O50378|Rv3347c|MTV004.03c (3157 aa), FASTA scores: opt: 6497, E(): 0, (61.65% identity in 3756 aa overlap); MTCY28_16, MTV050_2, MTY13E10_17, MTCY63_10, MTCY180_1, MTCY63_9, MTV050_1, MTV014_3, MTY13E10_15; etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE56 exists as a single gene. In Mycobacterium bovis, 2 frameshifts due to single base transversion (c-a) and a single base deletion (g-*) splits PPE56 into 3 parts, PPE56a, PPE56b and PPE56d." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Y6" /protein_id="SIU02014.1" /translation="MEFPVLPPEINSVLMYSGAGSSPLLAAAAAWDGLAEELGSAAVS FGQVTSGLTAGVWQGAAAAAMAAAAAPYAGWLGSVAAAAEAVAGQARVVVGVFEAALA ATVDPALVAANRARLVALAVSNLLGQNTPAIAAAEAEYELMWAADVAAMAGYHSGASA AAAALPAFSPPAQALGGGVGAFLTALFASPAKALSLNAGLGNVGNYNVGLGNVGVFNL GAGNVGGQNLGFGNAGGTNVGFGNLGNGNVGFGNSGLGAGLAGLGNIGLGNAGSSNYG FANLGVGNIGFGNTGTNNVGVGLTGNHLTGIGGLNSGTGNIGLFNSGTGNVGFFNSGT GNFGVFNSGNYNTGVGNAGTASTGLFNAGNFNTGVVNVGSYNTGSFNAGDTNTGGFNP GGVNTGWLNTGNTNTGIANSGNVNTGAFISGNFNNGVLWVGD" CDS complement(3725285..3726079) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3386C" /product="Putative oxidoreductase" /note="Mb3386c, -, len: 264 aa. Equivalent to Rv3351c, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 264 aa overlap). Hypothetical protein, highly similar to C-terminal region (aa 292-479) of O53608|Rv0063|MTV030.06 OXIDOREDUCTASE from M. tuberculosis (479 aa), FASTA scores: opt: 699, E(): 1.7e-36, (54.75% identity in 190 aa overlap). Shows some similarity to Q9KYD6|SCD72A.20 PUTATIVE LIPOPROTEIN (FRAGMENT) from Streptomyces coelicolor (403 aa), FASTA scores: opt: 192, E(): 9.1e-05, (27.9% identity in 154 aa overlap); and P71091|YGAK HYPOTHETICAL 54.4 KDA PROTEIN from Bacillus subtilis (480 aa), FASTA scores: opt: 174, E(): 0.0014, (26.5% identity in 166 aa overlap). Note that the two upstream ORFs Rv3352c and Rv3353c also show similarity to Rv0063 (MTV030_7). Sequence was checked but no errors found." /db_xref="GOA:A0A1R3Y5N9" /db_xref="InterPro:IPR012951" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5N9" /protein_id="SIU02015.1" /translation="MLASCPARSGAAVADAIKSAVGVQPSGVEHKTLRRMDLVRYLAG GHTTYPPEGFVAGSDVIGTTNPAAAQAIVAAIGTWPPAAGRASALIDSLGGAVGDMDP EGSAFPWCRQSAVVQWYVNTPSDGQVATANKWLSDAHHAVQHFSVGGYVNYLEANAAA SQYFGANLSRLTTVRRKYDPDRIMYSGLDFSTRQVAERLLPALGFRVRFGVLVIRCAL CTDTVKRLGTLPNLTWSRLKVNVAVTQEQAGVMDLPALPVRRTPRR" CDS complement(3726161..3726532) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3387C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb3387c, -, len: 123 aa. Equivalent to Rv3352c, len: 123 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 123 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to part of several oxidoreductases (and hypothetical proteins) from diverse organisms e.g. Q9KYD6|SCD72A.20 PUTATIVE LIPOPROTEIN (FRAGMENT) from Streptomyces coelicolor (403 aa), FASTA scores: opt: 348, E(): 7.9e-15, (51.0% identity in 102 aa overlap); BAB53081|MLR6875 PROBABLE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (479 aa), FASTA scores: opt: 262, E(): 2.3e-09, (53.85% identity in 78 aa overlap); O94206|OX1 OXIDOREDUCTASE from Claviceps purpurea (Ergot fungus) (483 aa), FASTA scores: opt: 245, E(): 2.7e-08, (42.6% identity in 115 aa overlap); Q9KHK2|ENCM PUTATIVE FAD-DEPENDENT OXYGENASE ENCM from Streptomyces maritimus (464 aa), FASTA scores: opt: 238, E(): 7.2e-08, (43.95% identity in 91 aa overlap); etc. Also highly similar to part of O53608|Rv0063|MTV030.06 OXIDOREDUCTASE (479 aa), FASTA scores: opt: 599, E(): 1.6e-30, (71.55% identity in 123 aa overlap); and to other Mycobacterium tuberculosis proteins e.g. Rv3353c and Rv3351c. All show similarity to a family of oxidoreductases in M. tuberculosis, suggesting that frameshift mutations may have occurred. Sequence has been checked but no errors were found." /db_xref="GOA:A0A1R3Y4P0" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4P0" /protein_id="SIU02016.1" /translation="MSAATDLYAVHQALAGESRAIPTGSCPTVGVAGLTLGGGLGADS RHAGLTCDALKSATVVLPGGDAVSASADDHAELFWALRGGGGGNFGVTTSMTFARFPT ADCDVVRVDFAPSAAAQVLVG" CDS complement(3726675..3726935) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3388C" /product="Putative oxidoreductase" /note="Mb3388c, -, len: 86 aa. Equivalent to Rv3353c, len: 86 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 86 aa overlap). Hypothetical protein, showing some similarity to Q9X5Q4|MITR MITR PROTEIN from Streptomyces lavendulae (514 aa), FASTA scores: opt: 134, E(): 0.09, (29.5% identity in 78 aa overlap); and weak to Q49720|B1549_C3_218 from Mycobacterium leprae (222 aa), FASTA scores: opt: 99, E(): 8.8, (32.9% identity in 76 aa overlap). But highly similar to N-terminal part of O53608|Rv0063|MTV030.06 OXIDOREDUCTASE from M. tuberculosis (479 aa), FASTA scores: opt: 305, E(): 4.9e-13, (52.9% identity in 87 aa overlap); and some similarity can be found with Rv3352c and Rv3351c. All show similarity to a family of oxidoreductases in M. tuberculosis, suggesting that frameshift mutations may have occurred. Sequence has been checked but no errors were found. Start changed since original submission. Mb3388c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Y5" /protein_id="SIU02017.1" /translation="MSRQTFLRGAVGAPATSAVFPTILARATPGDGWASLASSIGGQV LLPANGRAFTSGKQIFNSNYSGLNPAAVVTVASQADVRKAVS" CDS 3727050..3727439 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3389" /product="Possible lipoprotein LprJ" /note="Mb3389, -, len: 129 aa. Equivalent to Rv3354, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 129 aa overlap). Conserved hypothetical protein, equivalent (but shorter 29 aa) to Q9CCM4|ML0676 HYPOTHETICAL PROTEIN from Mycobacterium leprae (158 aa), FASTA scores: opt: 467, E(): 3.3e-21, (55.9% identity in 127 aa overlaps). Highly similar to O33192|LPRJ|Rv1690|MTCI125.12 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (127 aa), FASTA scores: opt: 329, E(): 4.7e-13, (46.95% identity in 115 aa overlap); and also similar to other Mycobacterium tuberculosis hypothetical proteins e.g. O07222|Rv1810|MTCY16F9.04c (118 aa), FASTA scores: opt: 195, E(): 4.2e-05, (37.15% identity in 113 aa overlap); MTCI125_11, MTCY16F9_4, MTV049_25. Protein product from Mb3389 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3389 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007969" /db_xref="UniProtKB/TrEMBL:A0A1R3Y459" /protein_id="SIU02018.1" /translation="MNLRRHQTLTLRLLAASAGILSAAAFAAPAQANPVDDAFIAALN NAGVNYGDPVDAKALGQSVCPILAEPGGSFNTAVASVVARAQGMSQDMAQTFTSIAIS MYCPSVMADAASGNLPALPDMPGLPGS" CDS complement(3727453..3727746) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3390C" /product="probable integral membrane protein" /note="Mb3390c, -, len: 97 aa. Equivalent to Rv3355c, len: 97 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 97 aa overlap). Hypothetical protein, equivalent to O32878|MLCB1779.16c|ML0675 HYPOTHETICAL 9.6 KDA PROTEIN from Mycobacterium leprae (91 aa), FASTA scores: opt: 439, E(): 3.9e-23, (78.9% identity in 90 aa overlap). Identical, but with a gap, to O50377|Rv3346c|MTV004.02c HYPOTHETICAL 8.9 KDA PROTEIN from Mycobacterium tuberculosis (85 aa), FASTA scores: opt: 413, E(): 2.1e-21, (85.55% identity in 97 aa overlap). Also some similarity to other proteins e.g. Q9K3J5|SC2A6.10 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (178 aa), FASTA scores: opt: 147, E(): 0.003, (31.25% identity in 80 aa overlap). Mb3390c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y3Z7" /db_xref="InterPro:IPR021385" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Z7" /protein_id="SIU02019.1" /translation="MTVRAVFRRTVGAQWPILLVGSIFAVGFVLAGANFWRRGALLIG IGVGVAAVLRLVLSEERAGLLVVRSKGIDFVTTVTVAAAMVYIASTIDPLGTG" CDS complement(3727743..3728588) /codon_start=1 /transl_table=11 /gene="folD" /locus_tag="BQ2027_MB3391C" /product="PROBABLE BIFUNCTIONAL PROTEIN FOLD: METHYLENETETRAHYDROFOLATE DEHYDROGENASE + METHENYLTETRAHYDROFOLATE CYCLOHYDROLASE" /note="Mb3391c, folD, len: 281 aa. Equivalent to Rv3356c, len: 281 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 281 aa overlap). Probable folD, bifunctional enzyme include methylenetetrahydrofolate dehydrogenase (EC 1.5.1.5) and methenyltetrahydrofolate cyclohydrolase (EC 3.5.4.9), equivalent to O32879|FOLD|ML0674 METHYLENETETRAHYDROFOLATE DEHYDROGENASE (PUTATIVE METHYLENETETRAHYDROFOLATE DEHYDROGENASE/METHENYLTETRAHYDROFOLATE CYCLOHYDROLASE) from Mycobacterium leprae (282 aa), FASTA scores: opt: 1624, E(): 1.2e-93, (86.45% identity in 281 aa overlap). Also similar to many others e.g. Q9K3J6|FOLD from Streptomyces coelicolor (284 aa), FASTA scores: opt: 1223, E(): 9.5e-69, (66.65% identity in 279 aa overlap); Q9K966|FOLD from Bacillus halodurans (279 aa), FASTA scores: opt: 886, E(): 7.7e-48, (47.15% identity in 280 aa overlap); P54382|FOLD_BACSU from Bacillus subtilis (283 aa), FASTA scores: opt: 820, E(): 9.7e-44, (45.7% identity in 280 aa overlap); P51696|FOLD_PHOPO from Photobacterium phosphoreum (285 aa), FASTA scores: opt: 778, E(): 4e-41, (44.9% identity in 283 aa overlap); P24186|FOLD_ECOLI|ADS|B0529 from Escherichia coli (287 aa), FASTA scores: opt: 741, E(): 0,44.4, (44.4% identity in 277 aa overlap); etc. Also highly similar to MLCB1779_9 from Mycobacterium leprae cosmid B1779 (282 aa) (86.5% identity in 281 aa overlap). SIMILAR TO OTHER DEHYDROGENASE/CYCLOHYDROLASE ENZYMES OR DOMAINS. Protein product from Mb3391c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3391c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TWN0" /db_xref="InterPro:IPR000672" /db_xref="InterPro:IPR020630" /db_xref="InterPro:IPR020631" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:Q7TWN0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02020.1" /translation="MGAIMLDGKATRDEIFGDLKPRVAALDAAGRTPGLGTILVGDDP GSQAYVRGKHADCAKVGITSIRRDLPADISTATLNETIDELNANPDCTGYIVQLPLPK HLDENAALERVDPAKDADGLHPTNLGRLVLGTPAPLPCTPRGIVHLLRRYDISIAGAH VVVIGRGVTVGRPLGLLLTRRSENATVTLCHTGTRDLPALTRQADIVVAAVGVAHLLT ADMVRPGAAVIDVGVSRTDDGLVGDVHPDVWELAGHVSPNPGGVGPLTRAFLLTNVVE LAERR" CDS 3728712..3728987 /codon_start=1 /transl_table=11 /gene="relj" /locus_tag="BQ2027_MB3392" /product="antitoxin relj" /note="Mb3392, -, len: 91 aa. Equivalent to Rv3357, len: 91 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 91 aa overlap). Conserved hypothetical protein, highly similar to other hypothetical proteins e.g. Q9Z4V7|YU1E_STRCO (alias CAC37261|SCBAC17D6.02) ORFU1E (BELONGS TO THE PHD/YEFM FAMILY) from Streptomyces coelicolor (87 aa), FASTA scores: opt: 344, E(): 1.9e-17, (62.05% identity in 87 aa overlap); P46147|YEFM_ECOLI|B2017 from Escherichia coli strain K12 (83 aa), FASTA scores: opt: 215, E(): 1.6e-08, (50.0% identity in 72 aa overlap); BAB58570|SAV2408 from Staphylococcus aureus subsp. aureus Mu50 (83 aa), FASTA scores: opt: 161, E(): 8.8e-05, (39.95% identity in 77 aa overlap); Q9Z5W8 PUTATIVE PHD PROTEIN from Francisella novicid (85 aa), FASTA scores: opt: 143, E(): 0.0016, (28.9% identity in 83 aa overlap); etc. Also similar to Rv1247c|MTV006.19c (89 aa) (36.9% identity in 84 aa overlap). SEEMS TO BELONG TO THE PHD/YEFM FAMILY. Protein product from Mb3392 detected using SWATH mass spectrometry." /db_xref="InterPro:IPR006442" /db_xref="InterPro:IPR036165" /db_xref="UniProtKB/Swiss-Prot:P65068" /protein_id="SIU02021.1" /translation="MSISASEARQRLFPLIEQVNTDHQPVRITSRAGDAVLMSADDYD AWQETVYLLRSPENARRLMEAVARDKAGHSAFTKSVDELREMAGGEE" CDS 3728984..3729241 /codon_start=1 /transl_table=11 /gene="relk" /locus_tag="BQ2027_MB3393" /product="toxin relk" /note="Mb3393, -, len: 85 aa. Equivalent to Rv3358, len: 85 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 85 aa overlap). Conserved hypohetical protein, highly similar to other hypohetical proteins e.g. Q9Z4V8|SCBAC17D6.03 from Streptomyces coelicolor (84 aa), FASTA scores: opt: 393, E(): 1.1e-21, (59.75% identity in 82 aa overlap); P56605|YOEB_ECOLI from Escherichia coli (84 aa), FASTA scores: opt: 305, E(): 2.2e-15, (49.35% identity in 77 aa overlap); Q9Z5W7 PUTATIVE DOC PROTEIN from Francisella novicida (68 aa), FASTA scores: opt: 253, E(): 9.6e-12, (51.6% identity in 62 aa overlap); BAB58569|SAV2407 from Staphylococcus aureus subsp. aureus Mu50 (88 aa), FASTA scores: opt: 250, E(): 2e-11, (40.5% identity in 84 aa overlap); etc. Mb3393 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64529" /db_xref="InterPro:IPR009614" /db_xref="InterPro:IPR035093" /db_xref="UniProtKB/Swiss-Prot:P64529" /protein_id="SIU02022.1" /translation="MRSVNFDPDAWEDFLFWLAADRKTARRITRLIGEIQRDPFSGIG KPEPLQGELSGYWSRRIDDEHRLVYRAGDDEVTMLKARYHY" CDS 3729283..3730473 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3394" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb3394, -, len: 396 aa. Equivalent to Rv3359, len: 396 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 396 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to N-terminal part of various proteins (hypothetical unknowns or oxidoreductases) e.g. Q9ZB94 HYPOTHETICAL 69.3 KDA PROTEIN from Rhodococcus erythropolis (649 aa), FASTA scores: opt: 509, E(): 3e-24, (30.0% identity in 380 aa overlap); O29991|AF0248 NADH-DEPENDENT FLAVIN OXIDOREDUCTASE from Archaeoglobus fulgidus (378 aa), FASTA scores: opt: 478, E(): 1.6e-22, (32.45% identity in 379 aa overlap); Q9HUH9|PA4986 PROBABLE OXIDOREDUCTASE from Pseudomonas aeruginosa (648 aa), FASTA scores: opt: 412, E(): 3.3e-18, (30.45% identity in 384 aa overlap); Q9KCT8|BH1481 NADH OXIDASE from Bacillus halodurans (338 aa), FASTA scores: opt: 404, E(): 6.1e-18, (30.2% identity in 275 aa overlap); etc. Some weak similarity to Mycobacterium leprae MLCB1779_10. Protein product from Mb3394 detected using SWATH mass spectrometry. Mb3394 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y3Z9" /db_xref="InterPro:IPR001155" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Z9" /protein_id="SIU02023.1" /translation="MAPGSCEAPDVFNPAKLGPLTLRNRVIKAATFEARTPDALVTDD LIEYHRLPAAGGVAMTTVAYCAVSPGGRTGGNQIWMRPHAVPGLRRLTEAIHAEGAAI SAQIGHAGPVADARSNQATALAPVRFFNPIAMRFAQKATREDIDDVLAAHAHAARLAV DAGFDAVEIHLGHNYLASAFLSPLLNRRDDEFGGSLQNRAKVARGLVMAVRRAVRQQV AVTAKLNMTDGIRGGITVDEALTTARWLQDDGGLDAIELTAGSSLVNPMYLFRGDAPV KEFAAAFKPPLRWGIRMTGHRFFREYPYRDAYLLREARLFRAELTIPLILLGGITNRT TMDLAMAEGFEFVAMARALLAEPDLVNRIAAEGSQVRSACTHCNQCMATIYRRTHCVV TGAP" CDS 3730590..3730958 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3395" /product="ABC transporter, ATP-binding protein" /note="Mb3395, -, len: 122 aa. Equivalent to Rv3360, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 122 aa overlap). Hypothetical protein, highly similar to the N-terminus of O65934|Rv1747|MTCY28.10|MTCY04C12.31 probable ABC-transporter ATP-binding protein from Mycobacterium tuberculosis (865 aa), FASTA scores: opt: 480, E(): 4.7e-25, (61.0% identity in 118 aa overlap); and some similarity with the N-terminus of P96214|Rv3863|MTCY01A6.05c HYPOTHETICAL 41.1 KDA PROTEIN from Mycobacterium tuberculosis (392 aa), FASTA scores: opt: 138, E(): 0.033, (31.95% identity in 97 aa overlap). Some weak similarity with the N-terminus of other hypothetical proteins e.g. P73823|CYAA|SLR1991 ADENYLATE CYCLASE from Synechocystis sp. strain PCC 6803 (337 aa), FASTA scores: opt: 127, E(): 0.16, (28.55% identity in 112 aa overlap). Protein product from Mb3395 detected using SWATH mass spectrometry. Mb3395 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000253" /db_xref="InterPro:IPR008984" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Z5" /protein_id="SIU02024.1" /translation="MSRPHPPVLTVRSDRSQQCFAAGRDVVVGSDLRADMRVAHPLIA RAHLLLRFDRGNWIAIDNDSQSGMFVDGQRVSEVDIYDGLTINIGKPTGPWITFEVGH HQGIIGRLSRTPSSRPGSPI" CDS complement(3730955..3731506) /codon_start=1 /transl_table=11 /gene="mfpA" /locus_tag="BQ2027_MB3396C" /product="Pentapeptide repeat family protein, MfpA => Quinolone resistance protein MfpA(Mt)" /note="Mb3396c, -, len: 183 aa. Equivalent to Rv3361c, len: 183 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 183 aa overlap). Conserved hypothetical protein, with some similarity to various proteins e.g. P74221|YB52_SYNY3|SLR1152 HYPOTHETICAL 36.2 KDA PROTEIN SLR (CONTAINS 5 PENTAPEPTIDE REPEAT DOMAINS) from Synechocystis sp. strain PCC 6803 (331 aa), FASTA scores: opt: 252, E(): 3.9e-10, (30.55% identity in 167 aa overlap); Q9SE95 FH PROTEIN INTERACTING PROTEIN FIP2 from Arabidopsis thaliana (Mouse-ear cress) (298 aa), FASTA scores: opt: 207, E(): 4.4e-07, (30.35% identity in 168 aa overlap); Q9A735|CC1891 PENTAPEPTIDE REPEAT FAMILY PROTEIN from Caulobacter crescentus (250 aa), FASTA scores: opt: 181, E(): 2.3e-05, (24.05% identity in 187 aa overlap); etc. Mb3396c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR001646" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5P1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02025.1" /translation="MQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQH RGSAFRNCTFERTTLWHSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGL NLTGCRLRETSLVDTDLRKCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLV GARVDVDQAVAFAAAHGLCLAGG" CDS complement(3731513..3732094) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3397C" /product="PROBABLE ATP/GTP-BINDING PROTEIN" /note="Mb3397c, -, len: 193 aa. Equivalent to Rv3362c, len: 193 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 193 aa overlap). Probable ATP/GTP-binding protein, similar to others from Streptomyces coelicolor e.g. O86519|SC1C2.18c (174 aa), FASTA scores: opt: 731, E(): 9.8e-41, (66.85% identity in 169 aa overlap); Q9XAE1|SC6G9.41c (191 aa), FASTA scores: opt: 730, E(): 1.2e-40, (63.55% identity in 173 aa overlap); Q9L235|SC1A2.06 (184 aa), FASTA scores: opt: 650, E(): 1.9e-35, (55.95% identity in 177 aa overlap); Q9RJ74|SCI41.10c (176 aa), FASTA scores: opt: 618, E(): 2.3e-33, (55.9% identity in 161 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Mb3397c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR004130" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4P7" /protein_id="SIU02026.1" /translation="MALKHSEASGTASTKIVIAGGFGSGKTTFVGAVSEIMPLRTEAM VTDASAGVDMLEATPDKRSTTVAMDFGRITLGEDLVLYLFGTPGQRRFWFMWDDLVRG AIGAIVLVDCRRLQDSFAAVDFFEHRNLPFLIAINEFDSAPRYPVSAVRDALTLPAHI PVINVDARNRRSATDALIAVSEYALATLSPAGG" CDS complement(3732075..3732443) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3398C" /product="DUF742 protein, component of G-protein-coupled receptor (GPCR) system" /note="Mb3398c, -, len: 122 aa. Equivalent to Rv3363c, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 122 aa overlap). Conserved hypothetical protein, similar to others from Streptomyces coelicolor e.g. O86523|SC1C2.23c (132 aa), FASTA scores: opt: 236, E(): 9e-09, (38.5% identity in 122 aa overlap); O86520|SC1C2.19c (190 aa), FASTA scores: opt: 231, E(): 2.7e-08, (41.0% identity in 122 aa overlap); Q9X834|SC9B1.14c (119 aa), FASTA scores: opt: 188, E(): 1.1e-05, (37.5% identity in 120 aa overlap); Q9ADJ4|SCBAC14E8.05 (113 aa), FASTA scores: opt: 167, E(): 0.00025, (33.05% identity in 109 aa overlap); etc. Mb3398c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007995" /db_xref="UniProtKB/TrEMBL:A0A1R3Y400" /protein_id="SIU02027.1" /translation="MFNPAGDRPKAGLVRPYTLTAGRTGTDVDLPLQAPVQTLPAGPA GRWPAYDMRRRILQLCIGSPSVAEISARLDLPVGVARVLVGDLVTSGYLRVHATLTDR STRDERHELIGRTLRGLKAL" CDS complement(3732421..3732813) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3399C" /product="Roadblock/LC7 protein, putative GTPase-activating (GAP) component of G-protein-coupled receptor (GPCR) system" /note="Mb3399c, -, len: 130 aa. Equivalent to Rv3364c, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 130 aa overlap). Conserved hypothetical protein, highly similar to others from Streptomyces coelicolor e.g. O86524|SC1C2.24c (137 aa), FASTA scores: opt: 466, E(): 1.3e-22, (58.6% identity in 116 aa overlap); O86521|SC1C2.20c (140 aa), FASTA scores: opt: 445, E(): 2.7e-21, (56.9% identity in 116 aa overlap); Q9KZI6|SCG8A.13c (145 aa), FASTA scores: opt: 341, E(): 9.5e-15, (51.3% identity in 113 aa overlap); etc. Protein product from Mb3399c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3399c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR004942" /db_xref="UniProtKB/TrEMBL:A0A1R3Y461" /protein_id="SIU02028.1" /translation="MKARLPDSPLDWLVSKFAREVPGVAHALLVSVDGLPVAASEHLP RERADQLAAVTSGLASLAGGAAQLFDGGQVLQSVVEMQNGYLLLMQVGDGSALAALAA TGCDIGQIGYEMAILVERVGGVVQSCRR" CDS complement(3732810..3735440) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3400C" /product="Putative sensor and ATPase, component of G-protein-coupled receptor (GPCR) system" /note="Mb3400c, -, len: 876 aa. Equivalent to Rv3365c, len: 876 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 876 aa overlap). Conserved hypothetical protein, similar to various proteins from Streptomyces coelicolor e.g. O86525|SC1C2.25c HYPOTHETICAL 139.7 KDA PROTEIN (SIMILAR TO OTHER PROKARYOTIC SENSORY TRANSDUCTION HISTIDINE KINASES) (1329 aa), FASTA scores: opt: 879, E(): 5.4e-32, (29.9% identity in 924 aa overlap) (similarity in N-terminal part for this one); O86522|SC1C2.21c HYPOTHETICAL 119.9 KDA PROTEIN (SIMILAR TO OTHER PROKARYOTIC SENSORY TRANSDUCTION HISTIDINE KINASES) (1111 aa), FASTA scores: opt: 855, E(): 5.6e-31, (28.9% identity in 892 aa overlap) (similarity in N-terminal part for this one); Q9KZI5|SCG8A.14c PUTATIVE MEMBRANE PROTEIN (862 aa), FASTA scores: opt: 791, E(): 3.3e-28, (30.8% identity in 828 aa overlap); Q9KZN0|SC1A8A.22c (943 aa), FASTA scores: opt: 660, E(): 2.5e-22, (27.65% identity in 893 aa overlap); etc. Similar in part to two consecutive Mycobacterium leprae hypothetical ORFs, probably representing a pseudogene: O07701|MLCL383.27 (118 aa), FASTA scores: opt: 430, E(): 1e-12, (58.25% identity in 115 aa overlap); and O07700|MLCL383.26 (111 aa), FASTA scores: opt: 271, E(): 1.3e-05, (50.4% identity in 121 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. Protein product from Mb3400c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3400c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y413" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3Y413" /protein_id="SIU02029.1" /translation="MTMFARPTIPVAAAASDISAPAQPARGKPQQRPPSWSPRNWPVR WKVFTIALLPLVVAMVLAGLRVEAAMASTSGLRLVAARAEMIPAITKYMSALDVAVLA SSTGHDVEGAQKNFTARKYELQTRLADTDVIADVRSGVNTLLNGGQALLDKVLADSIG LRDRVTAYAPLLLTAQNVIDASVRVDSEQIRTQVQGLSRAVGARGQMTMQEILVTRGA DLAEPQLRSAMVTLAGTEPSTLFGMSAALGAGSPDTKNLQQQMVTRMAIMSDPAVALV NNPELLHSIQITRDIAEQVITDTTEAVTKSVQSQATDRRDAAIRDAVLVLAAIATAIV VVLVVARTLVGPMRVLRDGALKVAHTDLDGEIAAVRAGDEPIPEPLAVYTTEEIGQVA HAVDELHTRALLLAGEETRLRLLVNEMFETMSRRSRSLVDQQLSVIDQLERNEEDPAR LDSLFRLDHLAARLRRNSANLLVLAGAQITRDHREPVPLSTVISAAVSEVEDYRRVDI ARVPDCAVVGAAAGGVIHLLAELIDNALRYSSPTTPVRVAAAIGSEGSVLLRISDSGL GMTDADRRMANMRLRAGGEVTPDSARHMGLFVVGRLAGRHGIRVGLRGPVTGEQGTGT TAEVYLPLAVLEGTAPAQPPKPRVFAIKPPCPEPAAADPTDVPAAIGPLPPVTLLPRR TPGSSGIADVPAQPMQQRRRELKTPWWEDRFQQEPKQPPAPEPRPAPPPAKPAPPAGP VDDDVIYRRMLSEMVGDPHELAHSPDLDWKSVWDHGWSAAAEAADKPVQSRTDYGLPV REPGARLVPGAAVPEGPDREHPGAALASNGGLHPGRAPRHAAAVRDPDAVRASISSHF GGVRTGRSHARESSQGPNQQ" CDS 3735676..3736140 /codon_start=1 /transl_table=11 /gene="spoU" /locus_tag="BQ2027_MB3401" /product="PROBABLE tRNA/rRNA METHYLASE SPOU (tRNA/rRNA METHYLTRANSFERASE)" /note="Mb3401, spoU, len: 154 aa. Equivalent to Rv3366, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 154 aa overlap). Probable spoU, tRNA/rRNA methylase (EC 2.1.1.-), equivalent to Q9CCU7|ML0419 PUTATIVE tRNA/rRNA METHYLTRANSFERASE from Mycobacterium leprae (158 aa), FASTA scores: opt: 861, E(): 1.2e-50, (83.75% identity in 154 aa overlap); and O07698|MLCL383.24c rRNA METHYLASE from Mycobacterium leprae (169 aa), FASTA scores: opt: 861, E(): 1.3e-50, (83.75% identity in 154 aa overlap). Also highly similar to many members of the spoU family of rRNA methylases e.g. Q9K199|NMB0268 RNA METHYLTRANSFERASE (TRMH FAMILY) from Neisseria meningitidis (serogroup B) (154 aa), FASTA scores: opt: 534, E(): 7.6e-29, (50.0% identity in 154 aa overlap); and Q9JSM8|NMA2218 from Neisseria meningitidis (serogroup A) (154 aa), FASTA scores: opt: 526, E(): 2.6e-28, (49.35% identity in 154 aa overlap); Q9HU57|PA5127 from Pseudomonas aeruginosa (153 aa), FASTA scores: opt: 531, E(): 1.2e-28, (52.95% identity in 151 aa overlap); P33899|YIBK_ECOLI|B3606 from Escherichia coli strain K12 (157 aa), FASTA scores: opt: 511, E(): 2.6e-27, (49.35% identity in 154 aa overlap); etc. BELONGS TO THE RNA METHYLTRANSFERASE TRMH FAMILY. Protein product from Mb3401 detected using SWATH mass spectrometry. Mb3401 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y407" /db_xref="InterPro:IPR001537" /db_xref="InterPro:IPR016914" /db_xref="InterPro:IPR029026" /db_xref="InterPro:IPR029028" /db_xref="UniProtKB/TrEMBL:A0A1R3Y407" /protein_id="SIU02030.1" /translation="MFRLLFVSPRIAPNTGNAIRTCAATGCELHLVEPLGFDLSEPKL RRAGLDYHDLASVTVHASLAHAWEALSPARVFAFTAQATTLFTNVGYRAGDVLMFGPE PTGLDEATLADTHITGQVRIPMLAGRRSLNLSNAAAVAVYEAWRQHGFAGAV" CDS 3736507..3738387 /codon_start=1 /transl_table=11 /gene="PE_PGRS51" /locus_tag="BQ2027_MB3402" /product="pe-pgrs family protein pe_pgrs51" /note="Mb3402, PE_PGRS51, len: 626 aa. Similar to Rv3367, len: 588 aa, from Mycobacterium tuberculosis strain H37Rv, (93.75% identity in 626 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins. Similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O50415|Rv3388|MTV004.46 (731 aa), FASTA scores: opt: 1999, E(): 7.2e-72, (55.0% identity in 620 aa overlap); and MTV004_44, MTV043_65, MTV006_15, MTCY63_2, MTCY21B4_13, MTV023_21, MTV008_43, MTCY24A1_4, MTV023_15; etc. Equivalent to AAK47814 from M. tuberculosis strain CDC1551 (628 aa) but shorter 37 aa. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 105 bp and of 9 bp (*-ggcagcggt) lead to longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (626 aa versus 588 aa). Mb3402 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y3Z3" /protein_id="SIU02031.1" /translation="MSFVVAVPEALAAAASDVANIGSALSAANAAAAAGTTGLLAAGA DEVSAALASLFSGHAVSYQQVAAQATALHDQFVQALTGAGGSYALTEAANVQQNLLNA INAPTQALLGRPLIGDGAVGTASSPDGQDGGLLFGNGGAGYNSAATPGMAGGNGGNAG LIGNGGTGGSGGAGAAGGAGGSGGWLYGNGGNGGIGGNAIVAGGAGGNGGAGGAAGLW GSGGSGGQGGNGLTGNDGVNPAPVTNPALNGAAGDSNIEPQTSVLIGTQGGDGTPGGA GVNGGNGGAGGDANGNPANTSIANAGAGGNGAAGGDGGANGGAGGAGGQAASAGSSVG GDGGNGGAGGTGTNGHAGGAGGAGGQAASAGSSVGGDGGNGGAGGTGTNGHAGGAGGA GGAGGRGGWLVGSGGNGGNGGNGAAGGNGAIGGTGGAGGVPANQGGNSALGTQPVSGD GGDGGNGGTGGTGGRGGDGGSGGAGGASGWLMGNGGNGGNGGTGGSGGVGGNGGIGGD GAGGGNATSTSSIPFDAHGGNGGAGGDAGHGGTGGDGGDGGHAGTGGRGGLLAGQHAN SGNGGGGGTGGAGGTHGTPGSGNAGGTGTGNADSTNGGPGSDGLGGDAFNGSRGTDGN PG" CDS complement(3738388..3739032) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3403C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb3403c, -, len: 214 aa. Equivalent to Rv3368c, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 214 aa overlap). Possible oxidoreductase (EC 1.-.-.-), equivalent to O07697|MLCL383.23|ML0418 HYPOTHETICAL 23.6 KDA PROTEIN (PUTATIVE OXIDOREDUCTASE) from Mycobacterium leprae (210 aa), FASTA scores: opt: 1215, E(): 1.5e-74, (81.4% identity in 210 aa overlap). Also similar to O30106|AF0131 PUTATIVE NAD(P)H-FLAVIN OXIDOREDUCTASE from Archaeoglobus fulgidus (194 aa), FASTA scores: opt: 139, E(): 0.028, (29.0% identity in 207 aa overlap); Q60049|NOX_THETH NADH DEHYDROGENASE from Thermus aquaticus (subsp. thermophilus) (205 aa), FASTA scores: opt: 169, E(): 0.00028, (28.3% identity in 212 aa overlap); and shows some similarity to other hypothetical proteins (unknowns or oxidoreductases). Protein product from Mb3403c detected using shotgun mass spectrometry. Mb3403c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y406" /db_xref="InterPro:IPR000415" /db_xref="InterPro:IPR029479" /db_xref="UniProtKB/TrEMBL:A0A1R3Y406" /protein_id="SIU02032.1" /translation="MTLNLSVDEVLTTTRSVRKRLDFDKPVPRDVLMECLELALQAPT GSNSQGWQWVFVEDAAKKKAIADVYLANARGYLSGPAPEYPDGDTRGERMGRVRDSAT YLAEHMHRAPVLLIPCLKGREDESAVGGVSFWASLFPAVWSFCLALRSRGLGSCWTTL HLLDNGEHKVADVLGIPYDEYSQGGLLPIAYTQGIDFRPAKRLPAESVTHWNGW" CDS 3739031..3739465 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3404" /product="FMN binding protein" /note="Mb3404, -, len: 144 aa. Equivalent to Rv3369, len: 144 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 144 aa overlap). Conserved hypothetical protein. C-terminus is similar to N-terminus of O07696|MLCL383.22c HYPOTHETICAL 14.7 KDA PROTEIN from Mycobacterium leprae (131 aa), FASTA scores: opt: 174, E(): 6e-05, (67.55% identity in 37 aa overlap). Also some slight similarity to Q9EWU1|3SC5B7.08c from Streptomyces coelicolor (153 aa), FASTA scores: opt: 125, E(): 0.13, (31.05% identity in 116 aa overlap). Protein product from Mb3404 detected using shotgun mass spectrometry. Mb3404 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y404" /db_xref="InterPro:IPR011576" /db_xref="InterPro:IPR012349" /db_xref="InterPro:IPR019966" /db_xref="UniProtKB/TrEMBL:A0A1R3Y404" /protein_id="SIU02033.1" /translation="MWAGYRWAMSVELTQEVSARLTSDLYGWLTTVARSGQPVPRLVW FYFDGTDLTVYSMPQAAKVAHITAHPQVSLNLDSDGNGAGIIVVGGTAAVVATDVDCR DDAPYWAKYREDAAKFGLTEAIAAYSTRLKITPTRVWTTPTG" CDS complement(3739554..3742793) /codon_start=1 /transl_table=11 /gene="dnaE2" /locus_tag="BQ2027_MB3405C" /product="PROBABLE DNA POLYMERASE III (ALPHA CHAIN) DNAE2 (DNA NUCLEOTIDYLTRANSFERASE)" /note="Mb3405c, dnaE2, len: 1079 aa. Equivalent to Rv3370c, len: 1079 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 1079 aa overlap). Probable dnaE2, DNA polymerase III, alpha chain (EC 2.7.7.7), similar to many e.g. BAB51086|MLR4428 from Rhizobium loti (Mesorhizobium loti) (1118 aa), FASTA scores: opt: 1103, E(): 8.9e-59, (37.65% identity in 1075 aa overlap); Q9S291|SCI11.28c from Streptomyces coelicolor (1185 aa), FASTA scores: opt: 937, E(): 1e-48, (33.4% identity in 1090 aa overlap); O67125|DP3A_AQUAE|DNAE|AQ_1008 from Aquifex aeolicus (1161 aa), FASTA scores: opt: 895, E(): 3.4e-46, (29.9% identity in 1071 aa overlap); O51526|DP3A_BORBU from Borrelia burgdorferi (Lyme disease spirochete) (1147 aa), FASTA scores: opt: 835, E(): 1.4e-42, (30.05% identity in 888 aa overlap); etc. Equivalent to AAK47817 from Mycobacterium tuberculosis strain CDC1551 (1098 aa) but shorter 19 aa. Also similar to Mycobacterium tuberculosis DP3A_MYCTU|MTCY48.18c (29.6% identity in 1110 aa overlap). BELONGS TO DNA POLYMERASE TYPE-C FAMILY, DNAE SUBFAMILY. Mb3405c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWL9" /db_xref="InterPro:IPR003141" /db_xref="InterPro:IPR004013" /db_xref="InterPro:IPR004805" /db_xref="InterPro:IPR011708" /db_xref="InterPro:IPR016195" /db_xref="InterPro:IPR023073" /db_xref="InterPro:IPR029460" /db_xref="InterPro:IPR040982" /db_xref="UniProtKB/Swiss-Prot:Q7TWL9" /protein_id="SIU02034.1" /translation="MERVLNGKPRHAGVPAFDADGDVPRSRKRGAYQPPGRERVGSSV AYAELHAHSAYSFLDGASTPEELVEEAARLGLCALALTDHDGLYGAVRFAEAAAELDV RTVFGAELSLGATARTERPDPPGPHLLVLARGPEGYRRLSRQLAAAHLAGGEKGKPRY DFDALTEAAGGHWHILTGCRKGHVRQALSQGGPAAAQRALADLVDRFTPSRVSIELTH HGHPLDDERNAALAGLAPRFGVGIVATTGAHFADPSRGRLAMAMAAIRARRSLDSAAG WLAPLGGAHLRSGEEMARLFAWCPEAVTAAAELGERCAFGLQLIAPRLPPFDVPDGHT EDSWLRSLVMAGARERYGPPKSAPRAYSQIEHELKVIAQLRFPGYFLVVHDITRFCRD NDILCQGRGSAANSAVCYALGVTAVDPVANELLFERFLSPARDGPPDIDIDIESDQRE KVIQYVYHKYGRDYAAQVANVITYRGRSAVRDMARALGFSPGQQDAWSKQVSHWTGQA DDVDGIPEQVIDLATQIRNLPRHLGIHSGGMVICDRPIADVCPVEWARMANRSVLQWD KDDCAAIGLVKFDLLGLGMLSALHYAKDLVAEHKGIEVDLARLDLSEPAVYEMLARAD SVGVFQVESRAQMATLPRLKPRVFYDLVVEVALIRPGPIQGGSVHPYIRRRNGVDPVI YEHPSMAPALRKTLGVPLFQEQLMQLAVDCAGFSAAEADQLRRAMGSKRSTERMRRLR GRFYDGMRALHGAPDEVIDRIYEKLEAFANFGFPESHALSFASLVFYSAWFKLHHPAA FCAALLRAQPMGFYSPQSLVADARRHGVAVHGPCVNASLAHATCENAGTEVRLGLGAV RYLGAELAEKLVAERTANGPFTSLPDLTSRVQLSVPQVEALATAGALGCFGMSRREAL WAAGAAATGRPDRLPGVGSSSHIPALPGMSELELAAADVWATGVSPDSYPTQFLRADL DAMGVLPAERLGSVSDGDRVLIAGAVTHRQRPATAQGVTFINLEDETGMVNVLCTPGV WARHRKLAHTAPALLIRGQVQNASGAITVVAERMGRLTLAVGARSRDFR" CDS 3742985..3744325 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3406" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb3406, -, len: 446 aa. Equivalent to Rv3371, len: 446 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 446 aa overlap). Hypothetical protein, similar to many Mycobacterium tuberculosis (strains H37Rv and CDC1551) hypothetical proteins e.g. O07035|YV30_MYCTU|Rv3130c|MTCY03A2.28|MTCY164.41c (463 aa), FASTA scores: opt: 556, E(): 7.7e-28, (44.95% identity in 447 aa overlap); MTY20B11_9, MTCY28_26, MTV013_8, MTCY21B4_43, MTCY493_29; etc. Also similar to O07692|MLCL383_9|MLCL383.18c HYPOTHETICAL 14.1 KDA PROTEIN from Mycobacterium leprae (129 aa), FASTA scores: opt: 293, E(): 1.3e-11, (47.85% identity in 117 aa overlap). Mb3406 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5P4" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5P4" /protein_id="SIU02035.1" /translation="MAQLTALDAGFLKSRDPERHPGLAIGAVAVVNGAAPSYDQLKTV LTERIKSIPRCTQVLATEWIDYPGFDLTQHVRRVALPRPGDEAELFRAIALALERPLD PDRPLWECWIIEGLNGNRWAILIKIHHCMAGAMSAAHLLARLCDDADGSAFANNVDIK QIPPYGDARSWAETLWRMSVSIAGAVCTAAARAVSWPAVTSPAGPVTTRRRYQAVRVP RDAVDAVCHKFGVTANDVALAAITEGFRTVLLHRGQQPRADSLRTLEKTDGSSAMLPY LPVEYDDPVRRLRTVHNRSQQSGRRQPDSLSDYTPLMLCAKMIHALARLPQQGIVTLA TSAPGPRHQLRLMGQKMDQVLPIPPTALQLSTGVAVLSYGDELVFGITADYDAASEMQ QLVNGIELGVARLVALSDDSVLLFTKDRRKRSSRALPSAARRGRPSVPTARARH" CDS 3744367..3745542 /codon_start=1 /transl_table=11 /gene="otsB2" /locus_tag="BQ2027_MB3407" /product="trehalose 6-phosphate phosphatase otsb2 (trehalose-phosphatase) (tpp)" /note="Mb3407, otsB2, len: 391 aa. Equivalent to Rv3372, len: 391 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 391 aa overlap). Possible otsB2, trehalose-6-phosphate phosphatase (EC 3.1.3.12), equivalent to Q49734|OTSB2|OTSP|B1620_F1_1|MLCL383.17c PUTATIVE TREHALOSE-PHOSPHATASE from Mycobacterium leprae (429 aa), FASTA scores: opt: 1675, E(): 2.4e-91, (67.05% identity in 425 aa overlap). Also weakly similar to several trehalose phosphatases e.g. Q9C8B3|F10O5.8 from Arabidopsis thaliana (Mouse-ear cress) (366 aa), FASTA scores: opt: 432, E(): 3.1e-18, (36.65% identity in 281 aa overlap); O27788|MTH1760 from Methanobacterium thermoautotrophicum (264 aa), FASTA scores: opt: 347, E(): 2.5e-13, (30.75% identity in 221 aa overlap); Q9FWQ2 from Oryza sativa (Rice) (382 aa), FASTA scores: opt: 338, E(): 1.1e-12, (32.5% identity in 320 aa overlap); etc. Also similar to part of Mycobacterium tuberculosis Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c (1327 aa), FASTA scores: opt: 1192, E(): 1.6e-62, (56.65% identity in 339 aa overlap). Protein product from Mb3407 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3407 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWL7" /db_xref="InterPro:IPR003337" /db_xref="InterPro:IPR006379" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/Swiss-Prot:Q7TWL7" /protein_id="SIU02036.1" /translation="MRKLGPVTIDPRRHDAVLFDTTLDATQELVRQLQEVGVGTGVFG SGLDVPIVAAGRLAVRPGRCVVVSAHSAGVTAARESGFALIIGVDRTGCRDALRRDGA DTVVTDLSEVSVRTGDRRMSQLPDALQALGLADGLVARQPAVFFDFDGTLSDIVEDPD AAWLAPGALEALQKLAARCPIAVLSGRDLADVTQRVGLPGIWYAGSHGFELTAPDGTH HQNDAAAAAIPVLKQAAAELRQQLGPFPGVVVEHKRFGVAVHYRNAARDRVGEVAAAV RTAEQRHALRVTTGREVIELRPDVDWDKGKTLLWVLDHLPHSGSAPLVPIYLGDDITD EDAFDVVGPHGVPIVVRHTDDGDRATAALFALDSPARVAEFTDRLARQLREAPLRAT" CDS 3745779..3746669 /codon_start=1 /transl_table=11 /gene="echA18" /locus_tag="BQ2027_MB3408" /product="probable enoyl-coa hydratase (fragment) echa18.1 (enoyl hydrase) (unsaturated acyl-coa hydratase) (crotonase)" /note="Mb3408, echA18, len: 296 aa. Equivalent to Rv3373 and Rv3374, len: 213 aa and 82 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 213 aa overlap and 100.0% identity in 82 aa overlap). Rv3373: Probable echA18, enoyl-CoA hydratase (EC 4.2.1.17), similar to others e.g. P97087|CRT from Clostridium thermosaccharolyticum (Thermoanaerobacterium thermosaccharolyticum) (259 aa), FASTA scores: opt: 423, E(): 3.4e-20, (37.95% identity in 174 aa overlap); Q9X7Q4|SC5F2A.31c from Streptomyces coelicolor (257 aa), FASTA scores: opt: 399, E(): 1.2e-18, (45.05% identity in 171 aa overlap); BAB52005|MLL5584 from Rhizobium loti (Mesorhizobium loti) (257 aa), FASTA scores: opt: 385, E(): 9.6e-18, (41.95% identity in 174 aa overlap); etc. Also some similarity to 3-HYDROXYBUTYRYL-COA DEHYDRATASES (EC 4.2.1.55) e.g. P52046|CRT_CLOAB from Clostridium acetobutylicum (261 aa), FASTA scores: opt: 414, E(): 1.3e-19, (38.3% identity in 175 aa overlap). And similar to other hydratases from Mycobacterium tuberculosis e.g. O53418|ECH8_MYCTU|Rv1070c|MT1100|MTV017.23c PROBABLE ENOYL-COA HYDRATASE (257 aa), FASTA scores: opt: 365, E(): 1.9e-16, (39.1% identity in 174 aa overlap). BELONGS TO THE ENOYL-COA HYDRATASE/ISOMERASE FAMILY. Note that this homology extends across the stop codon and directly into the next ORF MTV004.29, suggesting a possible readthrough of the TGA stop codon. Rv3374: Probable echA18', enoyl-CoA hydratase C-terminus (EC 4.2.1.17), similar to the C-terminus of several enoyl-CoA hydratases e.g. Q9I5I4|PA0745 from Pseudomonas aeruginosa (272 aa), FASTA scores: opt: 123, E(): 0.13, (34.55% identity in 81 aa overlap); P97087|CRT from Clostridium thermosaccharolyticum (Thermoanaerobacterium thermosaccharolyticum) (259 aa), FASTA scores: opt: 115, E(): 0.45, (32.95% identity in 82 aa overlap); Q9I002|PA2841 from Pseudomonas aeruginosa (263 aa), FASTA scores: opt: 108, E(): 1.4, (30.95% identity in 84 aa overlap); etc. Also some similarity to C-terminus of O29956|AF0285 3-HYDROXYACYL-COA DEHYDROGENASE from Archaeoglobus fulgidus (658 aa), FASTA scores: opt: 116, E(): 0.81, (34.15% identity in 82 aa overlap); and other enzymes. And similar to other hydratases from M. tuberculosis e.g. O53418|ECH8_MYCTU|Rv1070c|MT1100|MTV017. 23c PROBABLE ENOYL-COA HYDRATASE (257 aa), FASTA scores: opt: 111, E(): 0.83, (36.05% identity in 86 aa overlap). This homology extends across the upstream TGA stop codon into the upstream ORF MTV004.28, suggesting possible readthrough of the previous stop codon. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis H37Rv, echA18 and echA18' exist as 2 genes. In Mycobacterium bovis, a single base transversion (t-g) leads to a single product. Protein product from Mb3408 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3408 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y405" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR014748" /db_xref="InterPro:IPR018376" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3Y405" /protein_id="SIU02037.1" /translation="MRRRAMTKMDEASNPCGGDIEAEMCQLMREQPPAEGVVDRVALQ RHRNVALITLSHPQAQNALNLASWRRLKRLLDDLAGESGLRAVVLRGAGDKAFAAGAD IKEFPNTRMSAADAAEYNESLAVCLRALTTMPIPVIAAVRGLAVGGGCELATACDVCI ATDDARFGIPLGKLGVTTGFTEADTVARLIGPAALKYLLFSGELIGIEEAARWGLVQK VVAPQDLAAATAKLVGQVCRQSAVTMRAAKVVANMHGRALTGADTDALIRFGVEAYEG ADLREGVAAFSQGRPPKFDD" CDS 3746674..3748101 /codon_start=1 /transl_table=11 /gene="amiD" /locus_tag="BQ2027_MB3409" /product="PROBABLE AMIDASE AMID (ACYLAMIDASE) (ACYLASE)" /note="Mb3409, amiD, len: 475 aa. Equivalent to Rv3375, len: 475 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 475 aa overlap). Probable amiD, amidase (EC 3.5.1.4), similar to various amidases e.g. Q53116|AMDA ENANTIOMERASE-SELECTIVE AMIDASE from Rhodococcus sp. (462 aa), FASTA scores: opt: 1036, E(): 1.6e-54, (38.6% identity in 464 aa overlap); Q9ZHK8|PZAA NICOTINAMIDASE/PYRAZINAMIDASE from Mycobacterium smegmatis (468 aa), FASTA scores: opt: 930, E(): 3.4e-48, (36.3% identity in 463 aa overlap); Q9A551|CC2613 PYRAZINAMIDASE/NICOTINAMIDASE from Caulobacter crescentus (464 aa), FASTA scores: opt: 841, E(): 7.1e-43, (39.45% identity in 469 aa overlap); O69768|AMID_PSEPU AMIDASE from Pseudomonas putida (466 aa), FASTA scores: opt: 800, E(): 2e-40, (33.6% identity in 467 aa overlap); O28325|YJ54_ARCFU|AF1954 PUTATIVE AMIDASE from Archaeoglobus fulgidu (453 aa), FASTA scores: opt: 669, E(): 1.3e-32, (30.4% identity in 467 aa overlap); etc. Also some similarity to AMIB2|Rv1263|MT1301|MTCY50.19c putative amidase from Mycobacterium tuberculosis (462 aa), (31.5% identity in 466 aa overlap). SEEMS BELONG TO THE AMIDASE FAMILY. Protein product from Mb3409 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3409 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63497" /db_xref="InterPro:IPR000120" /db_xref="InterPro:IPR020556" /db_xref="InterPro:IPR023631" /db_xref="InterPro:IPR036928" /db_xref="UniProtKB/Swiss-Prot:P63497" /protein_id="SIU02038.1" /translation="MTDADSAVPPRLDEDAISKLELTEVADLIRTRQLTSAEVTESTL RRIERLDPQLKSYAFVMPETALAAARAADADIARGHYEGVLHGVPIGVKDLCYTVDAP TAAGTTIFRDFRPAYDATVVARLRAAGAVIIGKLAMTEGAYLGYHPSLPTPVNPWDPT AWAGVSSSGCGVATAAGLCFGSIGSDTGGSIRFPTSMCGVTGIKPTWGRVSRHGVVEL AASYDHVGPITRSAHDAAVLLSVIAGSDIHDPSCSAEPVPDYAADLALTRIPRVGVDW SQTTSFDEDTTAMLADVVKTLDDIGWPVIDVKLPALAPMVAAFGKMRAVETAIAHADT YPARADEYGPIMRAMIDAGHRLAAVEYQTLTERRLEFTRSLRRVFHDVDILLMPSAGI ASPTLETMRGLGQDPELTARLAMPTAPFNVSGNPAICLPAGTTARGTPLGVQFIGREF DEHLLVRAGHAFQQVTGYHRRRPPV" CDS 3748209..3748862 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3410" /product="Hydrolase, haloacid dehalogenase-like family" /note="Mb3410, -, len: 217 aa. Equivalent to Rv3376, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 217 aa overlap). Hypothetical protein, similar to various bacterial proteins (notably hydrolases) e.g. Q9RUP0|DR1344 HYDROLASE from Deinococcus radiodurans (222 aa), FASTA scores: opt: 348, E(): 1.8e-15, (36.75% identity in 215 aa overlap); Q9RXA1|DR0414 HYDROLASE (CBBY/CBBZ/GPH/YIEH FAMILY) from Deinococcus radiodurans (155 aa), FASTA scores: opt: 233, E(): 3.5e-08, (36.4% identity in 151 aa overlap); Q9X0Q9|TM1177 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (225 aa), FASTA scores: opt: 231, E(): 6.6e-08, (27.6% identity in 221 aa overlap); Q9ABI3|CC0244 HYDROLASE, HALOACID DEHALOGENASE-LIKE from Caulobacter crescentus (213 aa), FASTA scores: opt: 213, E(): 9.1e-07, (28.95% identity in 221 aa overlap); BAB38231|ECS4808 PUTATIVE PHOSPHATASE from Escherichia coli strain O157:H7 (206 aa), FASTA scores: opt: 210, E(): 1.4e-06, (26.95% identity in 193 aa overlap); etc. Protein product from Mb3410 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3410 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y426" /db_xref="InterPro:IPR006439" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/TrEMBL:A0A1R3Y426" /protein_id="SIU02039.1" /translation="MSISAVVFDRDGVLTSFDWTRAEEDVRRITGLPLEEIERRWGGW LNGLTIDDAFVETQPISEFLSSLARELELGSKARDELVRLDYMAFAQGYPDARPALEE ARRRGLKVGVLTNNSLLVSARSLLQCAALHDLVDVVLSSQMIGAAKPDPRAYQAIAEA LGVSTTSCLFFDDIADWVEGARCAGMRAYLVDRSGQTRDGVVRDLSSLGAILDGAGP" CDS complement(3748954..3750405) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3411C" /product="halimadienyl diphosphate synthase" /note="Mb3411c, -, len: 483 aa. Similar to the 5' end of Rv3377c, len: 501 aa, from Mycobacterium tuberculosis strain H37Rv, (89.9% identity in 464 aa overlap). Possible cyclase; similarity with various proteins, notably cyclases involved in steroid biosynthesis in plants and bacteria e.g. BAB52679|MLR6369 from Rhizobium loti (Mesorhizobium loti) (516 aa), FASTA scores: opt: 533, E(): 5.6e-27, (30.45% identity in 522 aa overlap); Q9ZTN8 COPALYL DIPHOSPHATE SYNTHASE 1 from Cucurbita maxima (Pumpkin) (Winter squash) (823 aa), FASTA scores: opt: 484, E(): 1.2e-23, (28.35% identity in 388 aa overlap); Q38710|AC22 ABIETADIENE CYCLASE from Abies grandis (868 aa), FASTA scores: opt: 382, E(): 5.2e-17, (25.55% identity in 462 aa overlap); Q41771|AN1 KAURENE SYNTHASE A from Zea mays (Maize) (823 aa), FASTA scores: opt: 377, E(): 1.1e-16, (29.75% identity in 390 aa overlap); Q9AJE4 DITERPENE CYCLASE-1 from Kitasatospora griseola (Streptomyces griseolosporeus) (499 aa), FASTA scores: opt: 336, E(): 3.2e-14, (27.5% identity in 513 aa overlap); Q9SAU6 E-ALPHA-BISABOLENE SYNTHASE (FRAGMENT) from Abies grandis (782 aa), FASTA scores: opt: 317, E(): 7.8e-13, (25.25% identity in 479 aa overlap); etc. Note that this and the upstream ORF MTV004.36c have a significantly lower GC bias than the rest of the genome. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a 4 bp to 3 bp substitution (caat-aac) leads to a shorter product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Mb3411c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR008930" /db_xref="UniProtKB/TrEMBL:A0A1R3Y420" /protein_id="SIU02040.1" /translation="METFRTLLAKAALGNGISSTAYDTAWVAKLGQLDDELSDLALNW LCERQLPDGSWGAEFPFCYEDRLLSTLAAMISLTSNKHRRRRAAQVEKGLLALKNLTS GAFEGPQLDIKDATVGFELIAPTLMAEAARLGLAICHEESILGELVGVREQKLRKLGG SKINKHITAAFSVELAGQDGVGMLDVDNLQETNGSVKYSPSASAYFALHVKPGDKRAL AYISSIIQAGDGGAPAFYQAEIFEIVWSLWNLSRTDIDLSDPEIVRTYLPYLDHVEQH WVRGRGVGWTGNSTLEDCDTTSVAYDVLSKFGRSPDIGAVLQFEDADWFRTYFHEVGP SISTNVHVLGALKQAGYDKCHPRVRKVLEFIRSSKEPGRFCWRDKWHRSAYYTTAHLI CAASNYDDALCSDAVGGFLIRRGPMARGDFSTAKRLRKRQHIAFKLWRIGRGTAAHPC RRRSVARVGGYRNTANRHTRRCGLPRHFTARRR" CDS complement(3750410..3751300) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3412C" /product="diterpene synthase" /note="Mb3412c, -, len: 296 aa. Equivalent to Rv3378c, len: 296 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 296 aa overlap). Hypothetical unknown protein. Note that this ORF and the downstream ORF MTV004.35c have a significantly lower GC bias than the rest of the genome. Mb3412c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y403" /db_xref="InterPro:IPR036424" /db_xref="UniProtKB/TrEMBL:A0A1R3Y403" /protein_id="SIU02041.1" /translation="MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLEC NPQYDDYQQAALRQSIRILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLA NDEEILSFYKEHEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGV FGNDAAESVAQFSISWNETHGKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLL SSGKTSLYFTVAPSYYMTETTLRRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRA QPDRVFGVGCVHDGIWFAEG" CDS complement(3751309..3752919) /codon_start=1 /transl_table=11 /gene="dxs2" /locus_tag="BQ2027_MB3413C" /product="PROBABLE 1-DEOXY-D-XYLULOSE 5-PHOSPHATE SYNTHASE DXS2 (1-DEOXYXYLULOSE-5-PHOSPHATE SYNTHASE) (DXP SYNTHASE) (DXPS)" /note="Mb3413c, dxs2, len: 536 aa. Equivalent to Rv3379c, len: 536 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 536 aa overlap). Probable dxs2, 1-deoxy-D-xylulose 5-phosphate synthase (EC 2.2.-.-), similar to many e.g. Q9F1V2|DXS from Kitasatospora griseola (Streptomyces griseolosporeus) (649 aa), FASTA scores: opt: 1274, E(): 5.4e-71, (50.9% identity in 570 aa overlap); Q9X7W3|DXS_STRCO|SC6A5.17 from Streptomyces coelicolor (656 aa), FASTA scores: opt: 1248, E(): 2.2e-69, (50.55% identity in 568 aa overlap); Q9RBN6|DXS_STRC1 from Streptomyces sp. strain CL190 (631 aa), FASTA scores: opt: 1237, E(): 1e-68, (49.1% identity in 570 aa overlap); Q50000|DXS_MYCLE|TKTB|ML1038 from M. leprae (643 aa), FASTA scores: opt: 1215, E(): 2.4e-67, (46.75% identity in 571 aa overlap); Q9R6S7|DXS_SYNLE from Synechococcus leopoliensis (636 aa), FASTA scores: opt: 849, E(): 8.9e-45, (38.55% identity in 550 aa overlap); etc. Also similar to O07184|DXS_MYCTU|Rv2682c|MT2756|MTCY05A6.03c from M. tuberculosis (638 aa), FASTA scores: opt: 1226, E(): 4.9e-68, (48.9% identity in 558 aa overlap). BELONGS TO THE TRANSKETOLASE FAMILY, DXS SUBFAMILY. COFACTOR: THIAMINE PYROPHOSPHATE (BY SIMILARITY). Note that the N-terminus of this putative protein appears to have been interrupted by the adjacent IS6110 element. Mb3413c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y414" /db_xref="InterPro:IPR005475" /db_xref="InterPro:IPR005477" /db_xref="InterPro:IPR009014" /db_xref="InterPro:IPR020826" /db_xref="InterPro:IPR029061" /db_xref="InterPro:IPR033248" /db_xref="UniProtKB/TrEMBL:A0A1R3Y414" /protein_id="SIU02042.1" /translation="MFDTGHQTYPHKLLTGRGKDFATLRQADGLSGYPNRHESPHDWV ENSHASVSLAWVDGIAKALALQGQCDRRVIAVIGDGALTGGVAWEGLNNLGAATRPVI VVLNDNGRSYDPTAGALAAHLEELRVGTPRGPNLFENMGFTYIGPVDGHNIPDTCAVL RKAAAAARPVVVHAVTSKGRGYPPAEADERDHMHAYGVVDIATGLASTPSQRSWTDVF EDEIARIADDRSDVVGLTAAMRLPTGLGALSRRYPHRVFDSGIAEQHLLASAAGLAAA GTHPVVAVYSTFLHRAFDQLLFDIGLHRLPVTLVLDRAGVTGPDGPSHHGLWDLALLA CVPGFQIACPRDAPRLRQQLRTAIATAAPTAVRFPKGAPGEPITAEHTIGGLDVLHTP PPHWRPDVLLVAVGAMSRPCMDAARCLSEEQIGVTVVDPQWVWPISPALTELAGRHRI TVCVEDAIADVGIGAHLSHHIGRTHPRTRTYTLGLPPAYIPHASRDHILSSHGLTGPA IRIRCKSLLNALHEVPGPEDHPDSGDSY" mobile_element 3752270..3753837 /mobile_element_type="insertion sequence:IS1560" /locus_tag="BQ2027_IS1560-2" /note="IS1560-2, len: 1568 nt. Equivalent to IS1560, len: 1568 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 1568 nt overlap). Possible IS element 1560. Second copy in MTCY10G2 fr om: 11273 to: 12919." CDS complement(3753144..3754133) /codon_start=1 /transl_table=11 /gene="lytB1" /locus_tag="BQ2027_MB3414C" /product="PROBABLE LYTB-RELATED PROTEIN LYTB1" /note="Mb3414c, lytB1, len: 329 aa. Equivalent to Rv3382c, len: 329 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 329 aa overlap). Probable lytB1, lytB-related protein, highly similar to many e.g. Q9HVM7|LYTB_PSEAE|PA4557 from Pseudomonas aeruginosa (314 aa), FASTA scores: opt: 1048, E(): 2e-55, (53.2% identity in 314 aa overlap); Q9JR39|LYTB|NMA0624|NMB1831 from Neisseria meningitidis (serogroup A and B) (322 aa), FASTA scores: opt: 1041, E(): 5.4e-55, (52.25% identity in 312 aa overlap); P22565|LYTB_ECOLI|B0029 from Escherichia coli strain K12 (316 aa), FASTA scores: opt: 1013, E(): 2.5e-53, (51.45% identity in 311 aa overlap) (for more information about lytB protein, see citation below); Q9X781|LYTB_MYCLE|LYTB2|ML1938|MLCB1222.06c from Mycobacterium leprae (332 aa), FASTA scores: opt: 979, E(): 2.8e-51, (51.3% identity in 312 aa overlap); etc. Also similar to Q9PAS9|XF2416 DRUG TOLERANCE PROTEIN from Xylella fastidiosa (316 aa), FASTA scores: opt: 1043, E(): 4.1e-55, (53.65% identity in 315 aa overlap). And similar to O53458|Rv1110|LYTB2|MTV017.63 from Mycobacterium tuberculosis (335 aa), FASTA scores: opt: 975, E(): 4.9e-51, (51.3% identity in 312 aa overlap). BELONGS TO THE LYTB FAMILY. Mb3414c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5I3" /db_xref="InterPro:IPR003451" /db_xref="UniProtKB/Swiss-Prot:P0A5I3" /protein_id="SIU02043.1" /translation="MAEVFVGPVAQGYASGEVTVLLASPRSFCAGVERAIETVKRVLD VAEGPVYVRKQIVHNTVVVAELRDRGAVFVEDLDEIPDPPPPGAVVVFSAHGVSPAVR AGADERGLQVVDATCPLVAKVHAEAARFAARGDTVVFIGHAGHEETEGTLGVAPRSTL LVQTPADVAALNLPEGTQLSYLTQTTLALDETADVIDALRARFPTLGQPPSEDICYAT TNRQRALQSMVGECDVVLVIGSCNSSNSRRLVELAQRSGTPAYLIDGPDDIEPEWLSS VSTIGVTAGASAPPRLVGQVIDALRGYASITVVERSIATETVRFGLPKQVRAQ" CDS complement(3754133..3755185) /codon_start=1 /transl_table=11 /gene="idsB" /locus_tag="BQ2027_MB3415C" /product="POSSIBLE POLYPRENYL SYNTHETASE IDSB (POLYPRENYL TRANSFERASE) (POLYPRENYL DIPHOSPHATE SYNTHASE)" /note="Mb3415c, idsB, len: 350 aa. Equivalent to Rv3383c, len: 350 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 350 aa overlap). Possible idsB, polyprenyl transferase (polyprenyl diphosphate synthase) (EC 2.5.1.-), similar to many prenyltransferases involved in lipid biosynthesis e.g. Q9RGW1|GTR GERANYL TRANSFERASE from Streptomyces coelicolor (386 aa), FASTA scores: opt: 908, E(): 3.7e-50, (48.8/% identity in 334 aa overlap); Q9KWG0|GGDPS GERANYL GERANYL DIPHOSPHATE SYNTHASE from Kitasatospora griseola (Streptomyces griseolosporeus) (348 aa), FASTA scores: opt: 801, E(): 2e-43, (41.5% identity in 347 aa overlap); Q9X7V8|SC6A5.12 PUTATIVE POLYPRENYL SYNTHETASE from Streptomyces coelicolor (378 aa), FASTA scores: opt: 779, E(): 5.3e-42, (44.45% identity in 324 aa overlap); Q9S5E9 FARNESYL, GERANYLGERANYL, GERANYLFARNESYL, HEXAPRENYL, HEPTAPRENYL DIPHOSPHATE SYNTHASE (SELF-HEPPS) from Synechococcus elongatus (324 aa), FASTA scores: opt: 563, E(): 2.3e-28, (39.85% identity in 241 aa overlap) (see citation below); O26156|IDSA_METTH|MTH50 BIFUNCTIONAL SHORT CHAIN ISOPRENYL DIPHOSPHATE SYNTHASE [INCLUDES: FARNESYL PYROPHOSPHATE SYNTHETASE (EC 2.5.1.1) (FPP SYNTHETASE) (DIMETHYLALLYLTRANSFERASE) AND GERANYLTRANSTRANSFERASE (EC 2.5.1.10)] from Methanobacterium thermoautotrophicum (325 aa), FASTA scores: opt: 540, E(): 6.5e-27, (35.75% identity in 319 aa overlap); P95999|GGPP_SULSO|GDS|GDS-1|SSO0061|C05010|C05_049 GERANYLGERANYL PYROPHOSPHATE SYNTHETASE (GGPP SYNTHETASE) (GGPS) [INCLUDES: DIMETHYLALLYLTRANSFERASE (EC 2.5.1.1)AND GERANYLTRANSTRANSFERASE (EC 2.5.1.10) AND FARNESYLTRANSTRANSFERASE (EC 2.5.1.29)] from Sulfolobus solfataricus (332 aa), FASTA scores: opt: 511, E(): 4.5e-25 (36.9% identity in 244 aa overlap); etc. Also similar to Q50727|GGPP_MYCTU|Rv3398c|MT3506|MTCY78.30 PROBABLE MULTIFUNCTIONAL GERANYLGERANYL PYROPHOSPHATE SYNTHETASE [INCLUDES: DIMETHYLALLYLTRANSFERASE (EC 2.5.1.1); GERANYLTRANSTRANSFERASE (EC 2.5.1.10); FARNESYLTRANSTRANSFERASE (EC 2.5.1.29)] from Mycobacterium tuberculosis (359 aa), FASTA scores: opt: 687, E(): 3.4e-36, (39.1% identity in 325 aa overlap). Contains PS00723 Polyprenyl synthetases signature 1. BELONGS TO THE FPP/GGPP SYNTHETASES FAMILY. Mb3415c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y421" /db_xref="InterPro:IPR000092" /db_xref="InterPro:IPR008949" /db_xref="InterPro:IPR033749" /db_xref="UniProtKB/TrEMBL:A0A1R3Y421" /protein_id="SIU02044.1" /translation="MGGVLTLDAAFLGSVPADLGKALLERARADCGPVLHRAIESMRE PLATMAGYHLGWWNADRSTAAGSSGKYFRAALVYAAAAACGGDVGDATPVSAAVELVH NFTLLHDDVMDGDATRRGRPTVWSVWGVGGAILLGDALHATAVRILTGLTDECVAVRA IRRLQMSCLDLCIGQFEDCLLEGQPEVTVDDYLRMAAGKTAALTGCCCALGALVANAD DATIAALERFGHELGLAFQCVDDLIGIWGDPGVTGKPVGNDLARRKATLPVVAALNSR SEAATELAALYQAPAAMTASDVERATALVKVAGGGHVAQRCADERIQAAIAALPDAVR SPDLIALSQLICRREC" CDS complement(3755939..3756331) /codon_start=1 /transl_table=11 /gene="vapc46" /locus_tag="BQ2027_MB3416C" /product="possible toxin vapc46. contains pin domain." /note="Mb3416c, -, len: 130 aa. Equivalent to Rv3384c, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 130 aa overlap). Hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins P95252|Rv1962c|MTCY09F9.02 (135 aa), FASTA scores: opt: 266, E(): 1.6e-10, (43.1% identity in 130 aa overlap); and Q50717|YY08_MYCTU|Rv3408|MTCY78.20c (136 aa), FASTA scores: opt: 243, E(): 4.8e-09, (35.1% identity in 131 aa overlap). Protein product from Mb3416c detected using SWATH mass spectrometry. Mb3416c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5Q5" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5Q5" /protein_id="SIU02045.1" /translation="MAAIYLDSSAIVKLAVREPESDALRRYLRTRHPRVSSALARAEV MRALLDKGESARKAGRRALAHLDLLRVDKRVLDLAGGLLPFELRTLDAIHLATAQRLG VDLGRLCTYDDRMRDAAKTLGMAVIAPS" CDS complement(3756331..3756639) /codon_start=1 /transl_table=11 /gene="vapb46" /locus_tag="BQ2027_MB3417C" /product="possible antitoxin vapb46" /note="Mb3417c, -, len: 102 aa. Equivalent to Rv3385c, len: 102 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 102 aa overlap). Hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Q50718|Y09M_MYCTU|MTCY78.21c|Rv3407|MT3515 (99 aa), FASTA scores: opt: 155, E(): 0.001, (41.05% identity in 78 aa overlap); O07782|Rv0596c|MTCY19H5.26 (85 aa), FASTA scores: opt: 136, E(): 0.016, (39.45% identity in 71 aa overlap); P96916|Rv0626|MTCY20H10.07 (86 aa), FASTA scores: opt: 130, E(): 0.04, (51.2% identity in 41 aa overlap); etc. Also similar to PREVENT HOST DEATH (PHD) PROTEINS e.g. CAA66834|PHD from Escherichia coli (73 aa), FASTA scores: opt: 113, E(): 0.45, (39.4% identity in 66 aa overlap); and Q06253|PHD_BPP1 from Bacteriophage P1 (73 aa), FASTA scores: opt: 113, E(): 0.45, (39.4% identity in 66 aa overlap). Protein product from Mb3417c detected using SWATH mass spectrometry. Mb3417c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR006442" /db_xref="InterPro:IPR036165" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Q5" /protein_id="SIU02046.1" /translation="MTPTACATVSTMTSVGVRALRQRASELLRRVEAGETIEITDRGR PVALLSPLPQGGPYEQLLASGEIERATLDVVDLPEPLDLDAGVELPSVTLARLREHER " repeat_region 3756681..3756682 /rpt_type=DIRECT /note="2 bp direct repeat, GT, flanking IS element IS1560." repeat_region 3756683..3756707 /rpt_type=INVERTED /note="25 bp inverted repeat, IRL,TAATTACTAGGACCTGAAAAAGTCG, flanking IS element IS1560." gene 3756683..3758250 /locus_tag="BQ2027_IS1560-2" CDS 3756788..3757492 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3418" /product="POSSIBLE TRANSPOSASE" /note="Mb3418, -, len: 234 aa. Equivalent to Rv3386, len: 234 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 234 aa overlap). Possible transposase, showing very weak similarity to several IS element transposases. Highly similar (but shorter) to P963659|MTCY10G2_13|Rv1036c from Mycobacterium tuberculosis (112 aa), FASTA scores: opt: 507, E(): 8.3e-25, (83.9% identity in 87 aa overlap)." /db_xref="InterPro:IPR008490" /db_xref="UniProtKB/TrEMBL:A0A1R3Y417" /protein_id="SIU02047.1" /translation="MFRTVGDQASLWESVLPEELRRLPEELARVDALLDDSAFFCPFV PFFDPRMGRPSIPMETYLRLMFLKFRYRLGYESLCREVTDSITWRRFCRIPLEGSVPH PTTLMKLTTRCGEDAVAGLNEALLAKAASEKLLRTNKVRADTTVVEGDVGYPTDTGLL AKAVGSMARTVARIKAADAGSAPLGGSSGPRDRLQAAVTRRAATRSGAGLRAPDHRGA SRDRRAGADRGCRGGT" CDS 3757482..3758159 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3419" /product="POSSIBLE TRANSPOSASE" /note="Mb3419, -, len: 225 aa. Equivalent to Rv3387, len: 225 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 225 aa overlap). Possible transposase, showing very weak similarity to other IS element proteins, and similar to various hypothetical proteins." /db_xref="GOA:A0A1R3Y474" /db_xref="InterPro:IPR002559" /db_xref="UniProtKB/TrEMBL:A0A1R3Y474" /protein_id="SIU02048.1" /translation="MVRNAQRAVRRASGRRKAWLRQAINHLEKLIGRTERVVDQARSR LAGVMPDSSSRLVSLHDADARPIRKGRLGKPVEFGYKAQVVDNADGVILDHSVELGNP ADAPQLAPAIERISRRTGRPPRAVTADRGCGDASVEDDLHQLGVRNVAIPRKSKPSAT RRAFEHRRAFRDKIKWRTGSEGRINHLKRSYGWNRTELTGITGARTWCGHGVFAHNLV KISTLAA" repeat_region complement(3758226..3758250) /rpt_type=INVERTED /note="25 bp inverted repeat, IRR,TAATTACTAAGACCTGAAAAAGTCG, flanking IS element IS1560." repeat_region 3758251..3758252 /rpt_type=DIRECT /note="2 bp direct repeat, GT, flanking IS element IS1560." CDS 3758349..3760598 /codon_start=1 /transl_table=11 /gene="PE_PGRS52" /locus_tag="BQ2027_MB3420" /product="pe-pgrs family protein pe_pgrs52" /note="Mb3420, PE_PGRS52, len: 749 aa. Similar to Rv3388, len: 731 aa, from Mycobacterium tuberculosis strain H37Rv, (94.8% identity in 734 aa overlap). Member of the M. tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many PE-family proteins from M. tuberculosis strains H37Rv and CDC1551 e.g. O53553|YZ08_MYCTU|RV3508|MTV023.15 (1901 aa), FASTA scores: opt: 2380, E(): 3.6e-87, (53.8% identity in 773 aa overlap); and MTV023_21, MTV023_18, MTV023_14, MTV039_16, MTCY441_4. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 60 bp and 78 bp, and deletions of 75 bp and 9 bp (cggcggcgc-*), lead to a longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (749 aa versus 731 aa)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y432" /protein_id="SIU02049.1" /translation="MSFVIANPEMLAAAATDLAGIRSAISAATAAAAAPTIQVAAAGA DEVSLAISALFGQHAQAYQALSAQATIFHDQFVQALTSGGNLYAAAESHTVEQMVLNA INAPTQTLFGRPLIGDGANGTAENPDGQNGGLLFGNGGNGFTQTTAGVAGGNGGSAGL IGNGGAGGIGGAGTGTGGHGGAGGAGGRAWLWGTGGAGGAGAAAIGNAVTPGGAGGAG GAGGDGGWLFGDGGAGGTGGNGGSGFNSLTSSVGGAGGAGGHAGLFGAGGTGGTGGIG GQNTETGPAASNGGAGGAGGGGGYLVGDGGAGGTGGAGGKNSSGGATLTGGTGGTGGA GGAAGWLYGSGGAGGAGGLNNAGGATGGTGGTGGAGGSGAWLYGNGGAAGAGGNGGNN TSAGTGGVGASGGTGGNAGLIGAGGHGGAGGAGGNQTGGVGNGGAGGNGGAGGAGGQL YGNGGDGGNGGAGGANIAGGNGSDGGAAGHGGAGGSARLIGAGGHGGDGGAGGNTAGR RADAIAGTGGDGGNGGNGGLLSGNAGAGGHGGAGGSSTATTTTGTPPTGATGGNGGNG GAGGTAGFTGSGGIGGNGGAGGTGGNAGVALSVGSTGGLGGNGGSGGLGGGGGSLFGN GGAGGVGATGGNAGSGIGPASVGGNGGKGGVGAAGGLAGQIGNGGSGGSGGAGGNGGT GDTAGNGGNGGAGAVGGNAQLIGNGGNGGGGGNGGTGATPGTGGAGAAGGTGGTLFGA PGTTGADGT" CDS complement(3760669..3761541) /codon_start=1 /transl_table=11 /gene="htdy" /locus_tag="BQ2027_MB3421C" /product="probable 3-hydroxyacyl-thioester dehydratase htdy" /note="Mb3421c, -, len: 290 aa. Equivalent to Rv3389c, len: 290 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 290 aa overlap). Possible dehydrogenase (EC 1.-.-.-), similar to parts of several bacterial dehydrogenases and eukaryotic short-chain dehydrogenases involved in steroid biosynthesis e.g. Q9UVH9|FOX2 FOX2 PROTEIN (a multifunctional protein of the peroxisomal beta-oxidation) (SIMILAR TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY) from Glomus mosseae (1015 aa), FASTA scores: opt: 649, E(): 7.5e-33, (40.9% identity in 269 aa overlap); Q9L009|SCC30.12c PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (333 aa), FASTA scores: opt: 602, E(): 2.7e-30, (40.35% identity in 305 aa overlap); AAH03098 HYDROXYSTEROID (17-BETA) DEHYDROGENASE 4 from Homo sapiens (Human) (736 aa), FASTA scores: opt: 592, E(): 2.1e-29, (41.55% identity in 272 aa overlap); P51659|DHB4_HUMAN ESTRADIOL 17 BETA-DEHYDROGENASE 4 from Homo sapiens (Human) (736 aa), FASTA scores: opt: 592, E(): 2.1e-29, (41.55% identity in 272 aa overlap); Q19058|E04F6.3 HYDRATASE-DEHYDROGENASE-EPIMERASE from Caenorhabditis elegans (298 aa), FASTA scores: opt: 573, E(): 1.6e-28, (41.0% identity in 266 aa overlap); O42484 17-BETA-HYDROXYSTEROID DEHYDROGENASE TYPE IV from Gallus gallus (Chicken) (735 aa), FASTA scores: opt: 573, E(): 3.2e-28, (39.8% identity in 279 aa overlap); etc. And also similar in part to Q9LBK1|PHAJ2|PA1018 (R)-SPECIFIC ENOYL-COA HYDRATASE from Pseudomonas aeruginosa (288 aa), FASTA scores: opt: 601, E(): 2.7e-30, (40.5% identity in 294 aa overlap). And similar to P71863|UFAA2|Rv3538|MTCY03C7.18c HYPOTHETICAL 30.2 KDA PROTEIN from Mycobacterium tuberculosis (286 aa), FASTA scores: opt: 609, E(): 8.7e-31, (39.65% identity in 285 aa overlap). HAS SOME SIMILARITY TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb3421c detected using shotgun mass spectrometry. Mb3421c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002539" /db_xref="InterPro:IPR029069" /db_xref="InterPro:IPR039569" /db_xref="UniProtKB/TrEMBL:A0A1R3Y429" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02050.1" /translation="MAIDPNSIGAVTEPMLFEWTDRDTLLYAIGVGAGTGDLAFTTEN SHGIDQQVLPTYAVICCPAFGAAAKVGTFNPAALLHGSQGIRLHAPLPAAGKLSVVTE VADIQDKGEGKNAIVVLRGRGCDPESGSLVAETLTTLVLRGQGGFGGARGERPAAPEF PDRHPDARIDMLTREDQALIYRLSGDRNPLHSDPWFATQLAGFPKPILHGLCTYGVAG RALVAELGGGVAANITSIAARFTKPVFPGETLSTVIWRTEPGRAVFRTEVAGSDGAEA RVVLDDGAVEYVAG" CDS 3761615..3762325 /codon_start=1 /transl_table=11 /gene="lpqD" /locus_tag="BQ2027_MB3422" /product="PROBABLE CONSERVED LIPOPROTEIN LPQD" /note="Mb3422, lpqD, len: 236 aa. Equivalent to Rv3390, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 236 aa overlap). Probable lpqD, a conserved lipoprotein with some similarity to various bacterial proteins e.g. Q9F3Q7|SC10F4.03 PUTATIVE ISOMERASE from Streptomyces coelicolor (224 aa), FASTA scores: opt: 416, E(): 2.5e-18, (33.0% identity in 197 aa overlap); Q9ZAX0|PGM 2,3-PDG DEPENDENT PHOSPHOGLYCERATE MUTASE from Amycolatopsis methanolica (205 aa), FASTA scores: opt: 314, E(): 3.7e-12, (28.55% identity in 203 aa overlap); P73454|SLR1748 HYPOTHETICAL 24.2 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (214 aa), FASTA scores: opt: 201, E(): 2.8e-05, (23.8% identity in 189 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical proteins e.g. O53817|Rv0754|MTV041.28 PGRS-FAMILY PROTEIN (584 aa), FASTA scores: opt: 219, E(): 5.1e-06, (39.8% identity in 226 aa overlap). Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb3422 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3422 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR013078" /db_xref="InterPro:IPR029033" /db_xref="UniProtKB/TrEMBL:A0A1R3Y412" /protein_id="SIU02051.1" /translation="MAKRTPVRKACTVLAVLAATLLLGACGGPTQPRSITLTFIRNAQ SQANADGIIDTDMPGSGLSADGKAEAQQVAHQVSRRDVDSIYSSPMAADQQTAGPLAG ELGKQVEILPGLQAINAGWFNGKPESMANSTYMLAPADWLAGDVHNTIPGSISGTEFN SQFSAAVRKIYDSGHNTPVVFSQGVAIMIWTLMNARNSRDSLLTTHPLPNIGRVVITG NPVTGWRLVEWDGIRNFT" CDS 3762371..3764323 /codon_start=1 /transl_table=11 /gene="acrA1" /locus_tag="BQ2027_MB3423" /product="POSSIBLE MULTI-FUNCTIONAL ENZYME WITH ACYL-CoA-REDUCTASE ACTIVITY ACRA1" /note="Mb3423, acrA1, len: 650 aa. Equivalent to Rv3391, len: 650 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 650 aa overlap). Possible acrA1, multi functional protein with fatty acyl-CoA reductase activity in C-terminal part (EC 1.2.1.-). Indeed C-terminal part highly similar to P94129|ACR1 FATTY ACYL-COA REDUCTASE from Acinetobacter calcoaceticus (295 aa), FASTA scores: opt: 767, E(): 1.4e-36, (45.4% identity in 260 aa overlap); and similar to other oxidoreductases dehydrogenases/reductases e.g. Q9Y3A1 CGI-93 PROTEIN (SIMILARITY WITH SDR FAMILY) from Homo sapiens (Human) (291 aa), FASTA scores: opt: 363, E(): 1.5e-13, (38.65% identity in 194 aa overlap); Q9L146|SC6D11.09 PUTATIVE OXIDOREDUCTASE (SIMILARITY WITH SDR FAMILY) from Streptomyces coelicolor (343 aa), FASTA scores: opt: 346, E(): 1.6e-12, (30.4% identity in 283 aa overlap); Q9HSR4|YUSZ1|VNG0115G OXIDOREDUCTASE from Halobacterium sp. strain NRC-1 (260 aa), FASTA scores: opt: 338, E(): 3.7e-12, (33.85% identity in 248 aa overlap); etc. C-terminus also similar to Mycobacterium tuberculosis proteins Q10783|YF43_MYCTU|Rv1543|MTCY48.22c PUTATIVE OXIDOREDUCTASE (341 aa), FASTA scores: opt: 787, E(): 1.2e-37, (39.8% identity in 319 aa overlap); O06413|Rv0547c|MTCY25D10.26c HYPOTHETICAL 31.8 KDA PROTEIN (294 aa), FASTA scores: opt: 565, E(): 4.7e-25, (36.8% identity in 242 aa overlap); O53398|Rv1050|MTV017.03 OXIDOREDUCTASE (SDR FAMILY) (301 aa), FASTA scores: opt: 436, E(): 1.1e-17, (32.2% identity in 292 aa overlap). N-terminus (aa 1-320) is similar to P37693|HETM_ANASP polyketide synthase hetM from Anabaena sp. (506 aa), FASTA scores: opt: 188, E(): 1.3e-07, (27.7% identity in 361 aa overlap); so certainly a multi-domain enzyme. SEEMS TO BELONG TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Note that this ORF corresponds to the gene ORF2|Q11197 (in citation mentioned below), but longer 266 aa, due to use of a more upstream start site. Mb3423 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR013120" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y424" /protein_id="SIU02052.1" /translation="MRYVVTGGTGFIGRHVVSRLLDGRPEARLWALVRRQSLSRFERL AGQWGDRVRPLVGDLTELELSERTIAELGDIDHVLHCAAVHDTTWADATRAVIELAAR LDATFHHVSSIAVAGDFAGHYTEADFDVGQRLPTPYHRMTFEAERLVRSTPGLRYRIY RPAVVVGDSRTGEMDTIDGPYYLFGVLAKLAVLPSFTPMLLPDIGRTNIVPVDYVADA LVALMHADGRDGQTFHLTAPTAIGLRGIYRGIAGAAGLPPLLGTLPGFVAAPVLNARG RAKVLRNMAATQLGIPAEIFDVVGCAPTFTSDTTREALRGTGIHVPEFATYAPGLWRY WAEHLDPDRARRNDPLLGRHVIITGASSGIGRASAIAVAKRGATVFALARNGNALDEL VTEIRAHGGQAHAFTCDVTDSASVEHTVKDILGRFDHVDYLVNNAGRSIRRSVVNSTD RLHDYERVMAVNYFGAVRMVLALLPHWRERRFGHVVNVSSAGVQARNPKYSSYLPTKA ALDAFADVVASETLSDHITFTNIHMPLVATPMIVPSRRLNPVRAISAERAAAMVIRGL VEKPARIDTPLGTLAEAGNYVAPRLSRRILHQLYLGYPDSAAAQGISRPDADRPPAPR RPRRSARAGVPRPLRRLGRLVPGVHW" CDS complement(3764324..3765187) /codon_start=1 /transl_table=11 /gene="cmaA1" /locus_tag="BQ2027_MB3424C" /product="CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE 1 CMAA1 (CYCLOPROPANE FATTY ACID SYNTHASE) (CFA SYNTHASE) (CYCLOPROPANE MYCOLIC ACID SYNTHASE 1)" /note="Mb3424c, cmaA1, len: 287 aa. Equivalent to Rv3392c, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 287 aa overlap). cmaA1, cyclopropane mycolic acid synthase 1 (EC 2.1.1.79), characterized in 1995 as CFA1_MYCTU|Q11195|CMAA1|CMA1 cyclopropane-fatty-acyl-phospholipid synthase 1 (see citations below). Highly similar to Mycobacterium tuberculosis proteins MTCY20H10.23c (58.7% identity in 286 aa overlap); MTCY20H10.24c (68.6% identity); MTCY20H10.25c (73.5% identity); MTCY20H10.26c (57.0% identity); and MTCY20G9.30c (55.7% identity). Also highly similar to Q9CBK3|MMAA4|ML1903 METHYL MYCOLIC ACID SYNTHASES from Mycobacterium leprae (298 aa), FASTA scores: opt: 1098, E(): 1e-63, (57.0% identity in 286 aa overlap). Equivalent to AAK44898|MT0672 from Mycobacterium tuberculosis strain CDC1551 (317 aa) but shorter 30 aa and with some differences in residues between the proteins. Protein product from Mb3424c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3424c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y428" /db_xref="InterPro:IPR003333" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y428" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02053.1" /translation="MPDELKPHFANVQAHYDLSDDFFRLFLDPTQTYSCAYFERDDMT LQEAQIAKIDLALGKLGLQPGMTLLDVGCGWGATMMRAVEKYDVNVVGLTLSKNQANH VQQLVANSESLRSKRVLLAGWEQFDEPVDRIVSIGAFEHFGHERYDAFFSLAHRLLPA DGVMLLHTITGLHPKEIHERGLPMSFTFARFLKFIVTEIFPGGRLPSIPMVQECASAN GFTVTRVQSLQPHYAKTLDLWSAALQANKGQAIALQSEEVYERYMKYLTGCAEMFRIG YIDVNQFTCQK" CDS 3765211..3766137 /codon_start=1 /transl_table=11 /gene="iunH" /locus_tag="BQ2027_MB3425" /product="PROBABLE NUCLEOSIDE HYDROLASE IUNH (PURINE NUCLEOSIDASE)" /note="Mb3425, iunH, len: 308 aa. Equivalent to Rv3393, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 308 aa overlap). Probable iunH, nucleoside hydrolase (EC 3.2.2.-), similar to others e.g. Q9RXB2|DR0403 from Deinococcus radiodurans (314 aa), FASTA scores: opt: 497, E(): 6e-24, (34.3% identity in 312 aa overlap); Q27546|IUNH_CRIFA from Crithidia fasciculata (314 aa), FASTA scores: opt: 475, E(): 1.4e-22, (31.45% identity in 318 aa overlap); Q9CK67|IUNH from Pasteurella multocida (310 aa), FASTA scores: opt: 464, E(): 6.9e-22, (30.9% identity in 314 aa overlap); Q9A549|CC2615 from Caulobacter crescentus (323 aa), FASTA scores: opt: 464, E(): 7.2e-22, (37.85% identity in 280 aa overlap); etc. Note that also similar to BAB34113|ECS0690 (alias AAG54985|YBEK) PUTATIVE TRNA SYNTHETASE from Escherichia coli strain O157:H7 (311 aa), FASTA scores: opt: 483, E(): 4.5e-23, (33.0% identity in 315 aa overlap). The active site histidine is conserved. Mb3425 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y430" /db_xref="InterPro:IPR001910" /db_xref="InterPro:IPR023186" /db_xref="InterPro:IPR036452" /db_xref="UniProtKB/TrEMBL:A0A1R3Y430" /protein_id="SIU02054.1" /translation="MSVVFADVDTGIDDALAVIYLLASPDADLVGIASTGGNIAVGQV CANNLSLLELCGAADIPVSKGADEPLGGRWPDHPKFHGPKGIGYAELPASNRRLTDYD ATTAWIAAAHSHAGDLIGLVTGPLTNLALALRAEPALPRLLRRLVIMGGMFDGQPITE WNIRVDPEAASEVFTAWAGQRQLPIVCGLDLTRRVAMTPDILARLASVCGSSPVMRVI EDALRFYFESHEARGHGYLAYMHDPLAAAVAMDPELLTTRTATVDVDPTGATVTDWSG KRNPNARIGMSVDPAVFFDRFVERIGRFARRT" CDS complement(3766192..3767775) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3426C" /product="DNA polymerase IV-like protein ImuB" /note="Mb3426c, -, len: 527 aa. Equivalent to Rv3394c, len: 527 aa, from Mycobacterium tuberculosis strain H37Rv, (99.810% identity in 527 aa overlap). Hypothetical protein, with some similarity to various bacterial proteins e.g. BAB51085|MLR4427 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (545 aa), FASTA scores: opt: 267, E(): 2.8e-08, (26.5% identity in 509 aa overlap); BAB48362|MLR0866 DNA DAMAGE INDUCIBLE PROTEIN P from Rhizobium loti (Mesorhizobium loti) (438 aa), FASTA scores: opt: 245, E(): 4.6e-07, (25.5% identity in 290 aa overlap); Q9S292|SCI11.27c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (322 aa), FASTA scores: opt: 202, E(): 0.00012, (28.5% identity in 323 aa overlap); etc. Also similarity with P95102|DINP|RV3056|MTCY22D7.25c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (346 aa), FASTA scores: opt: 211, E(): 3.9e-05, (26.45% identity in 306 aa overlap). Equivalent to AAK47838 from Mycobacterium tuberculosis strain CDC1551 (492 aa) but longer 35 aa." /db_xref="GOA:A0A1R3Y5R5" /db_xref="InterPro:IPR001126" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5R5" /protein_id="SIU02055.1" /translation="MMASARVLAIWCMDWPAVAAAAAAGLSATAPVAVTLANRVIACS ATARAAGVRRGLRRREAAARCPQLFIATADADRDARLFEGVIAAVDDLVPRAELLRPG LLVLPVRGPARFFGSEQMAAERLIDAVAAAGAECQVGIADRLSTAVFAARAGRIVEPG GDARFLSLLSIRQLATEPSLSGPGRDDLTDLLWRMGIRTIGQFAALSRTDVASRFGAD AVAAHRFARGEPERAPCGREPPPDLAAELACDPPIDRVDAAAFAGRSLAAELHRALMA AGVGCTRLAIHAVTANGEERSRVWRCAEPLTEDATADRVRWQLDGWLNNRNARDRPTA AVTLLRLQAVETVSASEGLQLPLWGGLGEQDRLRARRALVRVQGLLGPEAVRVPVLSG GHGPAERITLTVLGLVAPEPVPQADPGQPWPGRLPDPSPAVLFDDPVDLLDAQGNPIR VTSRGMFSADPARLRVRGRDDRLRWWAGPWPDDERWWDPDRASGRTARAQVLLDGDPG TALLLCYRQRRWYLEGSYE" CDS complement(3767772..3768386) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3427C" /product="damage tolerance" /note="Mb3427c, -, len: 204 aa. Equivalent to Rv3395c, len: 204 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 204 aa overlap). Conserved hypothetical protein, with some similarity with RECA PROTEINS (RECOMBINASES A) e.g. P16238|RECA_THIFE from Thiobacillus ferrooxidans (346 aa), FASTA scores: opt: 131, E(): 1.1, (31.45% identity in 140 aa overlap); Q59560|RECA_MYCSM from M. smegmatis (349 aa), FASTA scores: opt: 121, E(): 4.4, (30.25% identity in 129 aa overlap); etc. Note that shortened since first submission to avoid overlap with Rv3395A. Equivalent to AAK47839 from Mycobacterium tuberculosis strain CDC1551 (227 aa) but shorter 23 aa. Mb3427c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4R5" /protein_id="SIU02056.1" /translation="MTVAFASDQRLENGAEQLESLRRQMALLSEKVSGGPSRSGDLVP AGPVSLPPGTVGVLSGARSLLLSMVASVTAAGGNAAIVGQPDIGLLAAVEMGADLSRL AMIPDPGTDPVEVAAVLIDGMDLVVLGLGGRRVTRARARAVVARARQKGCTLLVTDGD WQGVSTRLAARVCGYEITPALRGVPTPGLGRISGVRLQINGRGR" CDS 3768469..3769095 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3428" /product="PROBABLE MEMBRANE PROTEIN" /note="Mb3428, -, len: 208 aa. Equivalent to Rv3395A, len: 208 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 208 aa overlap). Probable membrane protein, with potential transmembrane stretches from aa 7..25 and 55..77. Weak similarity to Q9F2P3|SCE41.16C PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (258 aa), FASTA scores: opt: 107, E(): 7.4, (34.05% identity in 94 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y423" /protein_id="SIU02057.1" /translation="MQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATD NTTDGFELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLH NAAEALHRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLR GGSVTTADHTLILVAGNGDLDVARRLVEEAGGDWNATTIAHGRREFVN" CDS complement(3769251..3770828) /codon_start=1 /transl_table=11 /gene="guaA" /locus_tag="BQ2027_MB3429C" /product="PROBABLE GMP SYNTHASE [GLUTAMINE-HYDROLYZING] GUAA (GLUTAMINE AMIDOTRANSFERASE) (GMP SYNTHETASE)" /note="Mb3429c, guaA, len: 525 aa. Equivalent to Rv3396c, len: 525 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 525 aa overlap). Probable guaA, gmp synthase (EC 6.3.5.2) (see citation below), equivalent to P46810|GUAA_MYCLE|ML0395|B1620_C2_205 GMP SYNTHASE [GLUTAMINE-HYDROLYZING] from Mycobacterium leprae (529 aa), FASTA scores: opt: 2992, E(): 8.5e-168, (86.85% identity in 525 aa overlap). Also highly similar to others e.g. O52831|GUAA_CORAM from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (524 aa), FASTA scores: opt: 2636, E(): 5.9e-147, (76.2% identity in 521 aa overlap); Q9L0H2|GUAA_STRCO from Streptomyces coelicolor (526 aa), FASTA scores: opt: 2451, E(): 4.1e-136, (71.55% identity in 513 aa overlap); Q9KF78|GUAA_BACHD from Bacillus Halodurans (513 aa), FASTA scores: opt: 1819, E(): 4.1e-99, (52.55% identity in 510 aa overlap); etc. Contains PS00442 Glutamine amidotransferases class-I active site. BELONGS TO THE TYPE-1 GLUTAMINE AMIDOTRANSFERASE FAMILY IN THE N-TERMINAL SECTION. AND BELONGS TO THE GMP SYNTHASE FAMILY IN THE C-TERMINAL SECTION. Protein product from Mb3429c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3429c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5A2" /db_xref="InterPro:IPR001674" /db_xref="InterPro:IPR004739" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR017926" /db_xref="InterPro:IPR022310" /db_xref="InterPro:IPR022955" /db_xref="InterPro:IPR025777" /db_xref="InterPro:IPR029062" /db_xref="UniProtKB/Swiss-Prot:P0A5A2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02058.1" /translation="MVQPADIDVPETPARPVLVVDFGAQYAQLIARRVREARVFSEVI PHTASIEEIRARQPVALVLSGGPASVYADGAPKLDPALLDLGVPVLGICYGFQAMAQA LGGIVAHTGTREYGRTELKVLGGKLHSDLPEVQPVWMSHGDAVTAAPDGFDVVASSAG APVAAFEAFDRRLAGVQYHPEVMHTPHGQQVLSRFLHDFAGLGAQWTPANIANALIEQ VRTQIGDGHAICGLSGGVDSAVAAALVQRAIGDRLTCVFVDHGLLRAGERAQVQRDFV AATGANLVTVDAAETFLEALSGVSAPEGKRKIIGRQFIRAFEGAVRDVLDGKTAEFLV QGTLYPDVVESGGGSGTANIKSHHNVGGLPDDLKFTLVEPLRLLFKDEVRAVGRELGL PEEIVARQPFPGPGLGIRIVGEVTAKRLDTLRHADSIVREELTAAGLDNQIWQCPVVL LADVRSVGVQGDGRTYGHPIVLRPVSSEDAMTADWTRVPYEVLERISTRITNEVAEVN RVVLDITSKPPATIEWE" CDS complement(3770840..3771748) /codon_start=1 /transl_table=11 /gene="phyA" /locus_tag="BQ2027_MB3430C" /standard_name="crtB" /product="PROBABLE PHYTOENE SYNTHASE PHYA" /note="Mb3430c, phyA, len: 302 aa. Equivalent to Rv3397c, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 302 aa overlap). Probable phyA (alternate gene name: crtB), phytoene synthase (EC 2.5.1.-), similar to many e.g. Q9X7V5|SC6A5.09 from Streptomyces coelicolor (312 aa), FASTA scores: opt: 791, E(): 2.8e-43, (48.25% identity in 286 aa overlap); Q9RW07|DR0862 from Deinococcus radiodurans (325 aa), FASTA scores: opt: 482, E(): 1.5e-23, (35.25% identity in 292 aa overlap); Q9JRU9|NMB1168|NMB1130 from Neisseria meningitidis (serogroup B) (290 aa), FASTA scores: opt: 446, E(): 2.8e-21, (34.25% identity in 260 aa overlap); P37272|PSY_CAPAN from Capsicum annuum (Bell pepper) (419 aa), FASTA scores: opt: 431, E(): 3.4e-20, (33.0% identity in 288 aa overlap); etc. Also similar to Q9JUF5|NMA1339 PUTATIVE POLY-ISOPRENYL TRANSFERASE (EC 2.5.1.) from Neisseria meningitidis (serogroup A) (290 aa), FASTA scores: opt: 450, E(): 1.6e-21, (34.6% identity in 260 aa overlap). Contains PS01045 Squalene and phytoene synthases signature 2. BELONGS TO THE PHYTOENE/SQUALENE SYNTHETASE FAMILY. Mb3430c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65861" /db_xref="InterPro:IPR008949" /db_xref="InterPro:IPR017828" /db_xref="InterPro:IPR019845" /db_xref="InterPro:IPR033904" /db_xref="UniProtKB/Swiss-Prot:P65861" /protein_id="SIU02059.1" /translation="MTEIEQAYRITESITRTAARNFYYGIRLLPREKRAALSAVYALG RRIDDVADGELAPETKITELDAIRKSLDNIDDSSDPVLVALADAARRFPVPIAMFAEL IDGARMEIDWTGCRDFDELIVYCRRGAGTIGKLCLSIFGPVSTATSRYAEQLGIALQQ TNILRDVREDFLNGRIYLPRDELDRLGVRLRLDDTGALDDPDGRLAALLRFSADRAAD WYSLGLRLIPHLDRRSAACCAAMSGIYRRQLALIRASPAVVYDRRISLSGLKKAQVAA AALASSVTCGPAHGPLPADLGSHPSH" CDS complement(3771777..3772856) /codon_start=1 /transl_table=11 /gene="idsA1" /locus_tag="BQ2027_MB3431C" /standard_name="idsA" /product="PROBABLE MULTIFUNCTIONAL GERANYLGERANYL PYROPHOSPHATE SYNTHETASE IDSA1 (GGPP SYNTHETASE) (GGPPSASE) (GERANYLGERANYL DIPHOSPHATE SYNTHASE): DIMETHYLALLYLTRANSFERASE (PRENYLTRANSFERASE) (GERANYL-DIPHOSPHATE SYNTHASE) + GERANYLTRANSTRANSFERASE (FARNESYL-DIPHOSPHATE SYNTHASE) (FARNESYL-PYROPHOSPHATE SYNTHETASE) (FARNESYL DIPHOSPHATE SYNTHETASE) (FPP SYNTHETASE) + FARNESYLTRANSTRANSFERASE (GERANYLGERANYL-DIPHOSPHATE SYNTHASE)" /note="Mb3431c, idsA1, len: 359 aa. Equivalent to Rv3398c, len: 359 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 359 aa overlap). Probable idsA1, geranylgeranyl pyrophosphate synthetase (GGPP synthetase) including: dimethylallyltransferase (EC 2.5.1.1), geranyltranstransferase (EC 2.5.1.10), and farnesyltranstransferase (EC 2.5.1.29). Most similar to AE000797_3|O26156|Q53479 bifunctional short chain isoprenyl diphosphate synthase from Methanobacterium thermoautotrop (325 aa), FASTA scores: opt: 605, E(): 0, (37.1% identity in 329 aa overlap); homology suggests ATG at 30121 or TTG at 30145 to be the initiation codon. Contains PS00444 Polyprenyl synthetases signature 2. BELONGS TO THE FPP/GGPP SYNTHETASES FAMILY; BELONGS TO A FAMILY THAT GROUPS TOGETHER FPP SYNTHETASE, GGPP SYNTHETASE AND HEXAPRENYL PYROPHOSPHATE SYNTHETASE. Note that previously known as idsA." /db_xref="GOA:P0A5H9" /db_xref="InterPro:IPR000092" /db_xref="InterPro:IPR008949" /db_xref="InterPro:IPR033749" /db_xref="UniProtKB/Swiss-Prot:P0A5H9" /protein_id="SIU02060.1" /translation="MRGTDEKYGLPPQPDSDRMTRRTLPVLGLAHELITPTLRQMADR LDPHMRPVVSYHLGWSDERGRPVNNNCGKAIRPALVFVAAEAAGADPHSAIPGAVSVE LVHNFSLVHDDLMDRDEHRRHRPTVWALWGDAMALLAGDAMLSLAHEVLLDCDSPHVG AALRAISEATRELIRGQAADTAFESRTDVALDECLKMAEGKTAALMAASAEVGALLAG APRSVREALVAYGRHIGLAFQLVDDLLGIWGRPEITGKPVYSDLRSRKKTLPVTWTVA HGGSAGRRLAAWLVDETGSQTASDDELAAVAELIECGGGRRWASAEARRHVTQGIDMV ARIGIPDRPAAELQDLAHYIVDRQA" CDS 3772879..3773925 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3432" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb3432, -, len: 348 aa. Equivalent to Rv3399, len: 348 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 348 aa overlap). Hypothetical protein, similar to other Mycobacterium tuberculosis (strains H37Rv and CDC1551) hypothetical proteins e.g. P95074|Rv0726c|MTCY210.45c (367 aa), FASTA scores: opt: 1188, E(): 7.7e-69, (60.05% identity in 308 aa overlap); MTCY31.21c (38.0% identity in 308 aa overlap), MTV041_5, MTCY4C12_14, MTY13D12_21, MTV043_22, MTCY210_44, MTCI5_19, MTCI5_20, MTV035_9, MTCY180_22, MTCY31_23, MTY13D12_1, MTCY180_29; etc. Protein product from Mb3432 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3432 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59986" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P59986" /protein_id="SIU02061.1" /translation="MARPMGKLPSNTRKCAQCAMAEALLEIAGQTINQKDLGRSGRMT RTDNDTWDLASSVGATATMIATARALASRAENPLINDPFAEPLVRAVGIDLFTRLASG ELRLEDIGDHATGGRWMIDNIAIRTKFYDDFFGDATTAGIRQVVILAAGLDTRAYRLP WPPGTVVYEIDQPAVIKFKTRALANLNAEPNAERHAVAVDLRNDWPTALKNAGFDPAR PTAFSAEGLLSYLPPQGQDRLLDAITALSAPDSRLATQSPLVLDLAEEDEKKMRMKSA AEAWRERGFDLDLTELIYFDQRNDVADYLAGSGWQVTTSTGKELFAAQGLPPFEDDHI TRFADRRYISAVLK" CDS 3773989..3774777 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3433" /product="PROBABLE HYDROLASE" /note="Mb3433, -, len: 262 aa. Equivalent to Rv3400, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 262 aa overlap). Probable hydrolase (EC 3.-.-.-), strongly equivalent to Q49741|YY00_MYCLE|ML0393|B1620_F3_119 HYPOTHETICAL 28.6 KDA PROTEIN from Mycobacterium leprae (261 aa), FASTA scores: opt: 1293, E(): 2.2e-71, (74.45% identity in 262 aa overlap). Similar to several various proteins (notably hydrolases) e.g. Q9L2I7|SCF42.32 PUTATIVE HYDROLASE from Streptomyces coelicolor (246 aa), FASTA scores: opt: 888, E(): 7.7e-47, (56.35% identity in 245 aa overlap); Q9EX06|2SCG38.13 PUTATIVE HYDROLASE from Streptomyces coelicolor (238 aa), FASTA scores: opt: 195, E(): 8.1e-05, (29.5% identity in 234 aa overlap); Q9I5X4|PA0562 PROBABLE HYDROLASE from Pseudomonas aeruginosa (224 aa), FASTA scores: opt: 190, E(): 0.00015, (27.8% identity in 248 aa overlap); O06995|PGMB_BACSU|YVDM PUTATIVE BETA-PHOSPHOGLUCOMUTASE from Bacillus subtilis (226 aa), FASTA scores: opt: 190, E(): 0.00016, (33.9% identity in 245 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical protein Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c (1327 aa), FASTA scores: opt: 413, E(): 2e-17, (34.9% identity in 238 aa overlap). Interestingly, note that Rv3400 and Rv3401 are similar to beginning and end of Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c with approx. 270 aa missing from the middle. Protein product from Mb3433 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3433 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65070" /db_xref="InterPro:IPR006439" /db_xref="InterPro:IPR010976" /db_xref="InterPro:IPR023198" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="InterPro:IPR041492" /db_xref="UniProtKB/Swiss-Prot:P65070" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02062.1" /translation="MANWYRPNYPEVRSRVLGLPEKVRACLFDLDGVLTDTASLHTKA WKAMFDAYLAERAERTGEKFVPFDPAADYHTYVDGKKREDGVRSFLSSRAIEIPDGSP DDPGAAETVYGLGNRKNDMLHKLLRDDGAQVFDGSRRYLEAVTAAGLGVAVVSSSANT RDVLATTGLDRFVQQRVDGVTLREEHIAGKPAPDSFLRAAELLGVTPDAAAVFEDALS GVAAGRAGNFAVVVGINRTGRAAQAAQLRRHGADVVVTDLAELL" CDS 3774792..3777152 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3434" /product="conserved protein" /note="Mb3434, -, len: 786 aa. Equivalent to Rv3401, len: 786 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 786 aa overlap). Hypothetical conserved protein, may be an hydrolase or a transferase, equivalent to Q49736|ML0392|B1620_F1_30 HYPOTHETICAL 88.1 KDA PROTEIN from Mycobacterium leprae (792 aa), FASTA scores: opt: 4820, E(): 0, (91.45% identity in 782 aa overlap). Also highly similar to Q9L2I8|SCF42.31c PUTATIVE GLYCOSYL TRANSFERASE from Streptomyces coelicolor (792 aa), FASTA scores: opt: 3060, E(): 2.9e-179, (59.25% identity in 785 aa overlap); and similar to others e.g. Q9K109|NMB0390 MALTOSE PHOSPHORYLASE from Neisseria meningitidis (serogroup B) (752 aa), FASTA scores: opt: 980, E(): 3.5e-52, (29.2% identity in 774 aa overlap); Q9JSW8|MAPA|NMA2098 PUTATIVE MALTOSE PHOSPHORYLASE (EC 2.4.1.8) from Neisseria meningitidis (serogroup A) (752 aa), FASTA scores: opt: 956, E(): 1e-50, (28.4% identity in 764 aa overlap); O06993|YVDK_BACSU HYPOTHETICAL 88.3 KDA PROTEIN (BELONGS TO FAMILY 65 OF GLYCOSYL HYDROLASES) from Bacillus subtilis (757 aa), FASTA scores: opt: 926, E(): 6.9e-49, (28.5% identity in 754 aa overlap); Q9CF04|MAPA MALTOSEPHOSPHORYLASE from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (751 aa), FASTA scores: opt: 907, E(): 1e-47, (26.95% identity in 753 aa overlap); P77154|YCJT_ECOLI|B1316 HYPOTHETICAL 84.9 KDA PROTEIN (BELONGS TO FAMILY 65 OF GLYCOSYL HYDROLASES) from Escherichia coli strain K12 (755 aa), FASTA scores: opt: 392, E(): 2.9e-16, (27.5% identity in 774 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical protein Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c (1327 aa), (27.2% identity in 802 aa overlap); note that Rv3400 and Rv3401 are similar to beginning and end of Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c with approx. 270 aa missing from the middle. Protein product from Mb3434 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3434 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y442" /db_xref="InterPro:IPR005194" /db_xref="InterPro:IPR005195" /db_xref="InterPro:IPR005196" /db_xref="InterPro:IPR008928" /db_xref="InterPro:IPR011013" /db_xref="InterPro:IPR012341" /db_xref="InterPro:IPR017045" /db_xref="InterPro:IPR037018" /db_xref="UniProtKB/TrEMBL:A0A1R3Y442" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02063.1" /translation="MITEDAFPVEPWQVRETKLNLNLLAQSESLFALSNGHIGLRGNL DEGEPFGLPGTYLNSFYEIRPLPYAEAGYGYPEAGQTVVDVTNGKIFRLLVGDEPFDV RYGELISHERILDLRAGTLTRRAHWRSPAGKQVKVTSTRLVSLAHRSVAAIEYVVEAI EEFVRVTVQSELVTNEDVPETSADPRVSAILDRPLQAVEHERTERGALLMHRTRASAL MMAAGMEHEVEVPGRVEITTDARPDLARTTVICGLRPGQKLRIVKYLAYGWSSLRSRP ALRDQAAGALHGARYSGWQGLLDAQRAYLDDFWDSADVEVEGDPECQQAVRFGLFHLL QASARAERRAIPSKGLTGTGYDGHAFWDTEGFVLPVLTYTAPHAVADALRWRASTLDL AKERAAELGLEGAAFPWRTIRGQESSAYWPAGTAAWHINADIAMAFERYRIVTGDGSL EEECGLAVLIETARLWLSLGHHDRHGVWHLDGVTGPDEYTAVVRDNVFTNLMAAHNLH TAADACLRHPEAAEAMGVTTEEMAAWRDAADAANIPYDEELGVHQQCEGFTTLAEWDF EANTTYPLLLHEAYVRLYPAQVIKQADLVLAMQWQSHAFTPEQKARNVDYYERRMVRD SSLSACTQAVMCAEVGHLELAHDYAYEAALIDLHDLHRNTRDGLHMASLAGAWTALVV GFGGLRDDEGILSIDPQLPDGISRLRFRLRWRGFRLIVDANHTDVTFILGDGPGTQLT MRHAGQDLTLHTDTPSTIAVRTRKPLLPPPPQPPGREPVHRRALAR" CDS complement(3777880..3778932) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3436C" /product="conserved hypothetical protein" /note="Mb3436c, -, len: 350 aa. Equivalent to 5' end of Rv3402c, len: 412 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 350 aa overlap). Conserved hypothetical protein, probably involved in cell process, similar to various proteins generally involved in extracellular compounds (lipopolysaccharide O-antigen) biosynthesis e.g. O68392|RFBE PEROSAMINE SYNTHETASE from Brucella melitensis (367 aa), FASTA scores: opt: 420, E(): 1.2e-19, (26.15% identity in 375 aa overlap); Q9L6C1 3,4-DEHYDRATASE-LIKE PROTEIN from Streptomyces antibioticus (393 aa), FASTA scores: opt: 419, E(): 1.5e-19, (30.65% identity in 385 aa overlap); Q9RR26|OLENI DEHYDRATASE from Streptomyces antibioticus (393 aa), FASTA scores: opt: 416, E(): 2.3e-19, (30.65% identity in 385 aa overlap); O33942 ERYCIV PROTEIN from Saccharopolyspora erythraea (Streptomyces erythraeus) (401 aa), FASTA scores: opt: 410, E(): 5.6e-19, (31.75% identity in 362 aa overlap); Q9UZI4|ASPB-LIKE1|PAB0774 ASPARTATE AMINOTRANSFERASE (ASPB-LIKE1) from Pyrococcus abyssi (366 aa), FASTA scores: opt: 402, E(): 1.7e-18, (27.05% identity in 377 aa overlap); O88001|WLBC PUTATIVE AMINO-SUGAR BIOSYNTHESIS PROTEIN from Bordetella bronchiseptica (Alcaligenes bronchisepticus) (366 aa), FASTA scores: opt: 394, E(): 5.6e-18, (26.8% identity in 347 aa overlap); Q45378|BPLC DNA FOR LIPOPOLYSACCHARIDE BIOSYNTHESIS from Bordetella pertussis (366 aa), FASTA scores: opt: 393, E(): 6.5e-18, (26.8% identity in 347 aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transversion (c-g) leads to a shorter product. Mb3436c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWJ6" /db_xref="InterPro:IPR000653" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/Swiss-Prot:Q7TWJ6" /protein_id="SIU02064.1" /translation="MKIRTLSGSVLEPPSAVRATPGTSMLKLEPGGSTIPKIPFIRPS FPGPAELAEDFVQIAQANWYTNFGPNERRFARALRDYLGPHLHVATLANGTLALLAAL HVSFGAGTRDRYLLMPSFTFVGVAQAALWTGYRPWFIDIDANTWQPCVHSACAVIERF RDRIAGILLANVFGVGNPQISVWEELAAEWELPIVLDSAAGFGSTYADGERLGGRGAC EIFSFHATKPFAVGEGGALVSRDPRLVEHAYKFQNFGLVQTRESIQLGMNGKLSEISA AIGLRQLVGLDRRLASRRKVLECYRTGMADAGVRFQDNANVASLCFASACCTSADHKA AVLGSLRRHAIEARDY" CDS complement(3779303..3780904) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3437C" /product="HYPOTHETICAL PROTEIN" /note="Mb3437c, -, len: 533 aa. Equivalent to Rv3403c, len: 533 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 533 aa overlap). Hypothetical unknown protein, but some weak similarity to Q9KJP2 HYPOTHETICAL 54.9 KDA PROTEIN from Myxococcus xanthus (504 aa), FASTA scores: opt: 157, E(): 0.011, (24.1% identity in 548 aa overlap). Protein product from Mb3437c detected using SWATH mass spectrometry. Mb3437c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65072" /db_xref="InterPro:IPR036188" /db_xref="InterPro:IPR038732" /db_xref="UniProtKB/Swiss-Prot:P65072" /protein_id="SIU02065.1" /translation="MLAFPYLMTMITPPTFDVAFIGSGAACSMTLLEMADALLSSPSA SPKLRIAVVERDEQFWCGIPYGQRSSIGSLAIQKLDDFADEPEKAAYRIWLEQNKQRW LAFFQAEGGAAAARWICDNRDALDGNQWGELYLPRFLFGVFLSEQMIAAIAALGERDL AEIVTIRAEAMSAHSADGHYRIGLRPSGNGPTAIAAGKVVVAIGSPPTKAILASDSEP AFTYINDFYSPGGESNVARLRDSLDRVESWEKRNVLVVGSNATSLEALYLMRHDARIR ARVRSITVISRSGVLPYMICNQPPEFDFPRLRTLLCTEAIAAADLMSAIRDDLATAEE RSLNLADLYDAVAALFGQALHKMDLVQQEEFFCVHGMNFTKLVRRAGRDCRQASEELA ADGTLSLLAGEVLRVDACASGQPFATMTYRAAGAEHTHPVPFAAVVNCGGFEELDTCS SPFLVSAMQNGLCRPNRTNRGLLVNDDFEASPGFCVIGPLVGGNFTPKIRFWHVESAP RVRSLAKSLAASLLASLQPVALAPC" CDS complement(3780921..3781625) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3438C" /product="formyltransferase" /note="Mb3438c, -, len: 234 aa. Equivalent to Rv3404c, len: 234 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 234 aa overlap). Conserved hypothetical protein, some similarity to several METHIONYL-TRNA FORMYLTRANSFERASES e.g. BAB51418|MLL4854 from Rhizobium loti (Mesorhizobium loti) (317 aa), FASTA scores: opt: 210, E(): 1.7e-06, (27.55% identity in 178 aa overlap); P94463|FMT_BACSU from Bacillus subtilis (317 aa), FASTA scores: opt: 199, E(): 8.8e-06, (28.25% identity in 177 aa overlap); O51091||FMT_BORBU|BB0064 from Borrelia burgdorferi (Lyme disease spirochete) (312 aa), FASTA scores: opt: 187, E(): 5.2e-05, (30.2% identity in 192 aa overlap); etc. Protein product from Mb3438c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3438c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65074" /db_xref="InterPro:IPR002376" /db_xref="InterPro:IPR036477" /db_xref="InterPro:IPR040660" /db_xref="UniProtKB/Swiss-Prot:P65074" /protein_id="SIU02066.1" /translation="MTILILTDNVHAHALAVDLQARHGDMDVYQSPIGQLPGVPRCDV AERVAEIVERYDLVLSFHCKQRFPAALIDGVRCVNVHPGFNPYNRGWFPQVFSIIDGQ KVGVTIHEIDDQLDHGPIIAQRECAIESWDSSGSVYARLMDIERELVLEHFDAIRDGS YTAKSPATEGNLNLKKDFEQLRRLDLNERGTFGHFLNRLRALTHDDFRNAWFVDASGR KVFVRVVLEPEKPAEA" CDS complement(3781743..3782309) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3439C" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb3439c, -, len: 188 aa. Equivalent to Rv3405c, len: 188 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 188 aa overlap). Possible transcriptional regulator, showing weak similarity to other bacterial regulatory proteins e.g. Q9KE70|BH0987 from Bacillus halodurans (203 aa), FASTA scores: opt: 168, E(): 0.0016, (34.8% identity in 92 aa overlap); Q9A5F7|CC2493 Caulobacter crescentus (204 aa), FASTA scores: opt: 160, E(): 0.0051, (32.6% identity in 89 aa overlap); Q9RDR0|SC4A7.02 from Streptomyces coelicolor (227 aa), FASTA scores: opt: 159, E(): 0.0064, (37.0% identity in 189 aa overlap); etc. Also some similarity to hypothetical Mycobacterium tuberculosis regulatory proteins e.g. O05858|Rv3208|MTCY07D11.18c, MTCI125_6, MTCY7D11_18, MTCY10G2_30; etc. Contains potential helix-turn-helix motif from aa 39-60 (+2.97 SD). Protein product from Mb3439c detected using SWATH mass spectrometry. Mb3439c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67443" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023772" /db_xref="UniProtKB/Swiss-Prot:P67443" /protein_id="SIU02067.1" /translation="MTTRPATDRRKMPTGREEVAAAILQAATDLFAERGPAATSIRDI AARSKVNHGLVFRHFGTKDQLVGAVLDHLGTKLTRLLHSEAPADIIERALDRHGRVLA RALLDGYPVGQLQQRFPNVAELLDAVRPRYDSDLGARLAVAHALALQFGWRLFAPMLR SATGIDELTGDELRLSVNDAVARILEPH" CDS 3782371..3783258 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3440" /product="PROBABLE DIOXYGENASE" /note="Mb3440, -, len: 295 aa. Equivalent to Rv3406, len: 295 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 295 aa overlap). Probable dioxygenase (EC 1.-.-.-), highly similar to Q9WWU|ATSK PUTATIVE ALPHA-KETOGLUTARATE DEPENDENT DIOXYGENASE from Pseudomonas putida (301 aa), FASTA scores: opt: 994, E(): 3.9e-57, (53.7% identity in 283 aa overlap); Q9I6U1|PA0193 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (300 aa), FASTA scores: opt: 1024, E(): 4.4e-59, (53.65% identity in 287 aa overlap); Q9HX81|TAUD|PA3935 TAURINE DIOXYGENASE from Pseudomonas aeruginosa (277 aa), FASTA scores: opt: 599, E(): 1.4e-31, (39.35% identity in 277 aa overlap); and similar to other dioxygenases e.g. AAG54718|TAUD (alias BAB33845|ECS0422) TAURINE DIOXYGENASE 2-OXOGLUTARATE-DEPENDENT from Escherichia coli strain O157:H7 (283 aa), FASTA scores: opt: 595, E(): 2.5e-31, (38.1% identity in 281 aa overlap); etc. BELONGS TO THE TFDA FAMILY OF DIOXYGENASES." /db_xref="GOA:P65076" /db_xref="InterPro:IPR003819" /db_xref="InterPro:IPR042098" /db_xref="UniProtKB/Swiss-Prot:P65076" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02068.1" /translation="MTDLITVKKLGSRIGAQIDGVRLGGDLDPAAVNEIRAALLAHKV VFFRGQHQLDDAEQLAFAGLLGTPIGHPAAIALADDAPIITPINSEFGKANRWHTDVT FAANYPAASVLRAVSLPSYGGSTLWANTAAAYAELPEPLKCLTENLWALHTNRYDYVT TKPLTAAQRAFRQVFEKPDFRTEHPVVRVHPETGERTLLAGDFVRSFVGLDSHESRVL FEVLQRRITMPENTIRWNWAPGDVAIWDNRATQHRAIDDYDDQHRLMHRVTLMGDVPV DVYGQASRVISGAPMEIAG" CDS 3783293..3783592 /codon_start=1 /transl_table=11 /gene="vapb47" /locus_tag="BQ2027_MB3441" /product="possible antitoxin vapb47" /note="Mb3441, -, len: 99 aa. Equivalent to Rv3407, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Hypothetical protein, similar to other hypothetical proteins from M. tuberculosis strains H37Rv and CDC1551 e.g. AAK46285|MT2013 (90 aa), FASTA scores: opt: 160, E(): 0.00021, (37.1% identity in 89 aa overlap); O50412|Rv3385c|MTV004.43c (102 aa), FASTA scores: opt: 155, E(): 0.00051, (41.05% identity in 78 aa overlap), MTCY19H5.26, MTCY20H10.07, MTI376.09c, MTCY427.21, etc. Protein product from Mb3441 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3441 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR006442" /db_xref="InterPro:IPR036165" /db_xref="UniProtKB/Swiss-Prot:P65078" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02069.1" /translation="MRATVGLVEAIGIRELRQHASRYLARVEAGEELGVTNKGRLVAR LIPVQAAERSREALIESGVLIPARRPQNLLDVTAEPARGRKRTLSDVLNEMRDEQ" CDS 3783589..3783999 /codon_start=1 /transl_table=11 /gene="vapc47" /locus_tag="BQ2027_MB3442" /product="possible toxin vapc47. contains pin domain." /note="Mb3442, -, len: 136 aa. Equivalent to Rv3408, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 136 aa overlap). Hypothetical protein, similar to other hypothetical proteins from M. tuberculosis strains H37Rv and CDC1551 e.g. O50411|Rv3384c|MTV004.42c (130 aa), FASTA scores: opt: 243, E(): 1.7e-09, (35.1% identity in 131 aa overlap); P95252|Rv1962c|MTCY09F9.02 (135 aa), FASTA scores: opt: 191, E(): 5e-06, (35.5% identity in 138 aa overlap), etc. Protein product from Mb3442 detected using SWATH mass spectrometry. Mb3442 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y440" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y440" /protein_id="SIU02070.1" /translation="MIYMDTSALTKLLISEPETTELRTWLTAQSGQGEDAATSTLGRV ELMRVVARYGQPGQTERARYLLDGLDILPLTEPVIGLAETIGPATLRSLDAIHLAAAA QIKRELTAFVTYDHRLLSGCREVGFVTASPGAVR" CDS complement(3784032..3785768) /codon_start=1 /transl_table=11 /gene="choD" /locus_tag="BQ2027_MB3443C" /product="cholesterol oxidase chod (cholesterol-o2 oxidoreductase)" /note="Mb3443c, choD, len: 578 aa. Equivalent to Rv3409c, len: 578 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 578 aa overlap). Probable choD, cholesterol oxidase precursor (EC 1.1.3.6), equivalent to Q9CCV1|CHOD|ML0389 (alias Q59530|CHOD|B1620_C3_240) PUTATIVE CHOLESTEROL OXIDASE from Mycobacterium leprae (569 aa), FASTA scores: opt: 3510, E(): 3.8e-198, (88.6% identity in 569 aa overlap). Also highly similar to Q9L0H6|SCD63.13 PUTATIVE CHOLESTEROL OXIDASE from Streptomyces coelicolor (602 aa), FASTA scores: opt: 1101, E(): 5.2e-57, (60.05% identity in 586 aa overlap); and similar to other oxidoreductases e.g. Q9A7T6|CC1634 OXIDOREDUCTASE (GMC FAMILY) from Caulobacter crescentus (579 aa), FASTA scores: opt: 221, E(): 1.8e-05, (25.2% identity in 583 aa overlap). BELONGS TO THE GMC OXIDOREDUCTASES FAMILY. COFACTOR: FAD FLAVOPROTEIN. Contains PS00017 ATP/GTP-binding site motif A. Protein product from Mb3443c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3443c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y431" /db_xref="InterPro:IPR007867" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y431" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02071.1" /translation="MKPDYDVLIIGSGFGGSVTALRLTEKGYRVGVLEAGRRFSDEEF AKTSWDLRKFLWAPRLGCYGIQRIHPLRNVMILAGAGVGGGSLNYANTLYVPPEPFFA DQQWSHITDWRGELMPHYQQAQRMLGVVQNPTFTDADRIVKEVADEMGFGDTWVPTPV GVFFGPDGTKTPGKTVPDPYFGGAGPARTGCLECGCCMTGCRHGAKNTLVKNYLGLAE SAGAQVIPMTTVKGFERRSDGLWEVRTVRTGSWLRRDRRTFTATQLVLAAGTWGTQHL LFKMRDRGRLPGLSKRLGVLTRTNSESIVGAATLKVNPDLDLTHGVAITSSIHPTADT HIEPVRYGKGSNAMGLLQTLMTDGSGPQGTDVPRWRQLLQTASQDPRGTIRMLNPRQW SERTVIALVMQHLDNSITTFTKRGKLGIRWYSSKQGHGEPNPTWIPIGNQVTRRIAAK IDGVAGGTWGELFNIPLTAHFLGGAVIGDDPEHGVIDPYHRVYGYPTLYVVDGAAISA NLGVNPSLSIAAQAERAASLWPNKGETDRRPPQGEPYRRLAPIQPAHPVVPADAPGAL RWLPIDPVSNAG" CDS complement(3785824..3786951) /codon_start=1 /transl_table=11 /gene="guaB3" /locus_tag="BQ2027_MB3444C" /product="PROBABLE INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE GUAB3 (IMP DEHYDROGENASE) (INOSINIC ACID DEHYDROGENASE) (INOSINATE DEHYDROGENASE) (IMP OXIDOREDUCTASE) (INOSINE-5'-MONOPHOSPHATE OXIDOREDUCTASE) (IMPDH) (IMPD)" /note="Mb3444c, guaB3, len: 375 aa. Equivalent to Rv3410c, len: 375 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 375 aa overlap). Probable guaB3, inosine-5'-monophosphate (IMP) dehydrogenase (EC 1.1.1.205), equivalent to Q49721|YY10_MYCLE|ML0388|B1620_C2_193 HYPOTHETICAL 38.9 KDA PROTEIN from Mycobacterium leprae (375 aa), FASTA scores: opt: 2182, E(): 9.5e-122, (90.6% identity in 373 aa overlap). Highly similar to Q9RHY9 GUAB ORF GENES FOR IMP DEHYDROGENASE, HYPOTHETICAL PROTEIN from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (376 aa), FASTA scores: opt: 1490, E(): 7.6e-81, (61.0% identity in 382 aa overlap); Q9L0I6|SCD63.03 PUTATIVE INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE from Streptomyces coelicolor (374 aa), FASTA scores: opt: 1275, E(): 3.8e-68, (52.95% identity in 372 aa overlap); P73853|GUAB|SLR1722 IMP DEHYDROGENASE SUBUNIT from Synechocystis sp. strain PCC 6803 (387 aa), FASTA scores: opt: 882, E(): 6.7e-45, (41.3% identity in 373 aa overlap); and similar to other inosine-5'-monophosphate dehydrogenases e.g. P44334|IMDH_HAEIN|GUAB|HI0221 from Haemophilus influenzae (488 aa), FASTA scores: opt: 267, E(): 1.8e-08, (34.25% identity in 216 aa overlap); etc. Also highly similar to the C-terminus of Q50753|GUAA/B HOMOLOGY TO Mycobacterium leprae GUAA (FRAGMENT) from Mycobacterium tuberculosis (130 aa), FASTA scores: opt: 506, E(): 4.6e-23, (85.05% identity in 87 aa overlap). SIMILAR TO OTHER EUKARYOTIC AND PROKARYOTIC IMPDH AND TO GMP REDUCTASE. Protein product from Mb3444c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3444c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65171" /db_xref="InterPro:IPR001093" /db_xref="InterPro:IPR005992" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/Swiss-Prot:P65171" /protein_id="SIU02072.1" /translation="MVEIGMGRTARRTYELSEISIVPSRRTRSSKDVSTAWQLDAYRF EIPVVAHPTDALVSPEFAIELGRLGGLGVLNGEGLIGRHLDVEAKIAQLLEAAAADPE PSTAIRLLQELHAAPLNPDLLGAAVARIREAGVTTAVRVSPQNAQWLTPVLVAAGIDL LVIQGTIVSAERVASDGEPLNLKTFISELDIPVVAGGVLDHRTALHLMRTGAAGVIVG YGSTQGVTTTDEVLGISVPMATAIADAAAARRDYLDETGGRYVHVLADGDIHTSGELA KAIACGADAVVLGTPLAESAEALGEGWFWPAAAAHPSLPRGALLQIAVGERPPLARVL GGPSDDPFGGLNLVGGLRRSMAKAGYCDLKEFQKVGLTVGG" CDS complement(3786971..3788560) /codon_start=1 /transl_table=11 /gene="guaB2" /locus_tag="BQ2027_MB3445C" /product="PROBABLE INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE GUAB2 (IMP DEHYDROGENASE) (INOSINIC ACID DEHYDROGENASE) (INOSINATE DEHYDROGENASE) (IMP OXIDOREDUCTASE) (INOSINE-5'-MONOPHOSPHATE OXIDOREDUCTASE) (IMPDH) (IMPD)" /note="Mb3445c, guaB2, len: 529 aa. Equivalent to Rv3411c, len: 529 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 529 aa overlap). Probable guaB2, inosine-5'-monophosphate (IMP) dehydrogenase (EC 1.1.1.205), equivalent to Q49729|IMDH_MYCLE|GUAB|ML0387|B1620_C3_238 INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE from Mycobacterium leprae (529 aa), FASTA scores: opt: 3154, E(): 4.4e-165, (92.45% identity in 529 aa overlap). Highly similar to other inosine-5'-monophosphate dehydrogenases e.g. Q9RHZ0|GUAB from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (506 aa), FASTA scores: opt: 2284, E(): 1.5e-117, (67.9% identity in 505 aa overlap); Q9L0I7|SCD63.02 from Streptomyces coelicolor (501 aa), FASTA scores: opt: 2178, E(): 9e-112, (67.2% identity in 491 aa overlap); O67820|IMDH_AQUAE|GUAB|AQ_2023 from Aquifex aeolicus (490 aa), FASTA scores: opt: 1820, E(): 3.2e-92, (58.1% identity in 487 aa overlap); etc. Also similar to Q50716|YY10_MYCTU|Rv3410c|MT3518|MTCY78.18 HYPOTHETICAL 38.9 KDA PROTEIN from Mycobacterium tuberculosis (38.6% identity in 158 aa overlap). Contains PS00487 IMP dehydrogenase / GMP reductase signature. SIMILAR TO OTHER EUKARYOTIC AND PROKARYOTIC IMPDH AND TO GMP REDUCTASE. Protein product from Mb3445c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3445c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65168" /db_xref="InterPro:IPR000644" /db_xref="InterPro:IPR001093" /db_xref="InterPro:IPR005990" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR015875" /db_xref="UniProtKB/Swiss-Prot:P65168" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02073.1" /translation="MSRGMSGLEDSSDLVVSPYVRMGGLTTDPVPTGGDDPHKVAMLG LTFDDVLLLPAASDVVPATADTSSQLTKKIRLKVPLVSSAMDTVTESRMAIAMARAGG MGVLHRNLPVAEQAGQVEMVKRSEAGMVTDPVTCRPDNTLAQVDALCARFRISGLPVV DDDGALVGIITNRDMRFEVDQSKQVAEVMTKAPLITAQEGVSASAALGLLRRNKIEKL PVVDGRGRLTGLITVKDFVKTEQHPLATKDSDGRLLVGAAVGVGGDAWVRAMMLVDAG VDVLVVDTAHAHNRLVLDMVGKLKSEVGDRVEVVGGNVATRSAAAALVDAGADAVKVG VGPGSICTTRVVAGVGAPQITAILEAVAACRPAGVPVIADGGLQYSGDIAKALAAGAS TAMLGSLLAGTAEAPGELIFVNGKQYKSYRGMGSLGAMRGRGGATSYSKDRYFADDAL SEDKLVPEGIEGRVPFRGPLSSVIHQLTGGLRAAMGYTGSPTIEVLQQAQFVRITPAG LKESHPHDVAMTVEAPNYYAR" CDS 3788767..3789177 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3446" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3446, -, len: 136 aa. Equivalent to Rv3412, len: 136 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 136 aa overlap). Hypothetical protein, strongly similar only to Q49742|YY12_MYCLE|ML0386|B1620_F3_131 HYPOTHETICAL 15.3 KDA PROTEIN from Mycobacterium leprae (137 aa), FASTA scores: opt: 933, E(): 6.3e-52, (93.4% identity in 136 aa overlap). Protein product from Mb3446 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3446 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR035165" /db_xref="UniProtKB/Swiss-Prot:P65080" /protein_id="SIU02074.1" /translation="MRDHLPPGLPPDPFADDPCDPSAALEAVEPGQPLDQQERMAVEA DLADLAVYEALLAHKGIRGLVVCCDECQQDHYHDWDMLRSNLLQLLIDGTVRPHEPAY DPEPDSYVTWDYCRGYADASLNEAAPDADRFRRR" CDS complement(3789187..3790086) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3447C" /product="unknown alanine and proline rich protein" /note="Mb3447c, -, len: 299 aa. Equivalent to Rv3413c, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 299 aa overlap). Hypothetical unknown ala-, pro-rich protein. Mb3447c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65082" /db_xref="InterPro:IPR031928" /db_xref="UniProtKB/Swiss-Prot:P65082" /protein_id="SIU02075.1" /translation="MREFGNPLGDRPPLDELARTDLLLDALAEREEVDFADPRDDALA ALLGQWRDDLRWPPASALVSQDEAVAALRAGVAQRRRARRSLAAVGSVAAALLVLSGF GAVVADARPGDLLYGLHAMMFNRSRVSDDQIVLSAKANLAKVEQMIAQGQWAEAQDEL AEVSSTVQAVTDGSRRQDLINEVNLLNTKVETRDPNATLRPGSPSNPAAPGSVGNSWT PLAPVVEPPTPPTPASAAEPSMSAGVSESPMPNSTSTVAASPSTPSSKPEPGSIDPSL EPADEATNPAGQPAPETPVSPTH" CDS complement(3790079..3790717) /codon_start=1 /transl_table=11 /gene="sigD" /locus_tag="BQ2027_MB3448C" /product="PROBABLE ALTERNATIVE RNA POLYMERASE SIGMA-D FACTOR SIGD" /note="Mb3448c, sigD, len: 212 aa. Equivalent to Rv3414c, len: 212 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 212 aa overlap). Probable sigD, alternative RNA polymerase sigma-D factor (see citation below), similar to others (notably from Streptomyces coelicolor) e.g. Q9L0I8|SCD63.01 from Streptomyces coelicolor (195 aa), FASTA scores: opt: 533, E(): 9.6e-28, (47.25% identity in 182 aa overlap); Q9FDS3|ADSA from Streptomyces griseus (258 aa), FASTA scores: opt: 223, E(): 1.8e-07, (28.95% identity in 183 aa overlap); BAB48649|MLL1224 from Rhizobium loti (Mesorhizobium loti) (187 aa), FASTA scores: opt: 202, E(): 3.2e-06, (30.4% identity in 194 aa overlap); P38133|RPOE_STRCO|SIGE|SCE94.07 from Streptomyces coelicolor (176 aa), FASTA scores: opt: 200, E(): 4.1e-06, (35.25% identity in 156 aa overlap); P37978|CNRH_ALCEU from Alcaligenes eutrophus (Ralstonia eutropha), FASTA scores: opt: 197, E(): 6.9e-06, (30.35% identity in 191 aa overlap); etc. C-terminus strongly similar to N-terminal part of Q49727|S1620B|B1620_C3_233 HYPOTHETICAL 6.2 KDA PROTEIN from Mycobacterium leprae (59 aa), FASTA scores: opt: 217, E(): 1.3e-07, (90.25% identity in 41 aa overlap). BELONGS TO THE SIGMA-70 FACTOR FAMILY. Protein product from Mb3448c detected using SWATH mass spectrometry. Mb3448c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66812" /db_xref="InterPro:IPR000838" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR013249" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039425" /db_xref="UniProtKB/Swiss-Prot:P66812" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02076.1" /translation="MVDPGVSPGCVRFVTLEISPSMTMQGERLDAVVAEAVAGDRNAL REVLETIRPIVVRYCRARVGTVERSGLSADDVAQEVCLATITALPRYRDRGRPFLAFL YGIAAHKVADAHRAAGRDRAYPAETLPERWSADAGPEQMAIEADSVTRMNELLEILPA KQREILILRVVVGLSAEETAAAVGSTTGAVRVAQHRALQRLKDEIVAAGDYA" CDS complement(3790735..3791562) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3449C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3449c, -, len: 275 aa. Equivalent to Rv3415c, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 275 aa overlap). Conserved hypothetical protein, equivalent to Q9CCV3|ML0383 HYPOTHETICAL PROTEIN from Mycobacterium leprae (281 aa), FASTA scores: opt: 1278, E(): 4.2e-71, (73.5% identity in 279 aa overlap); and Q49858|B229_C1_175 HYPOTHETICAL 27.4 KDA PROTEIN from Mycobacterium leprae (264 aa), FASTA scores: opt: 1186, E(): 1.7e-65, (74.05% identity in 258 aa overlap). And C-terminus highly similar to N-terminal part of Q49726|B1620_C3_232 HYPOTHETICAL 12.9 KDA PROTEIN from Mycobacterium leprae (122 aa), FASTA scores: opt: 580, E(): 1.1e-28, (74.6% identity in 126 aa overlap). Also some similarity with P71677|RIBD_MYCTU|RIBG|Rv1409|MT1453|MTCY21B4.26 RIBOFLAVIN BIOSYNTHESIS PROTEIN R (339 aa), FASTA scores: opt: 143, E(): 0.13, (28.25% identity in 184 aa overlap). Protein product from Mb3449c detected using SWATH mass spectrometry. Mb3449c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011990" /db_xref="UniProtKB/TrEMBL:A0A1R3Y441" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02077.1" /translation="MNETPHAPVVEQVLVAAAFGNQPGSWPLPTAITPHHLWLRAVAA GGQGRYAHAYGDLSVLRRLVPAGPLASLAHSTQGSLLRQLGWHTLARGWDGRALALAG ADREAGADALIGLAADALGVGRFAAAGALLDRADPLVVSPLVADRLAVRRRWVAAELA MATGDGATAVRHAEEAVELTQAMAVASARHRVKSDVVLAAALCSAGAVARARAVGEEA LDATARFGLLPLRWALACLLIDIGTVTFSAQQLRELTKIRNICAGQVRRAGGCWRTA" CDS 3791933..3792241 /codon_start=1 /transl_table=11 /gene="whiB3" /locus_tag="BQ2027_MB3450" /standard_name="whmB" /product="transcriptional regulatory protein whib-like whib3. contains [4fe-4s] cluster." /note="Mb3450, whiB3, len: 102 aa. Equivalent to Rv3416, len: 102 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 102 aa overlap). whiB3 (alternate gene name: whmB), WhiB-like regulatory protein (see citations below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Equivalent to Q49871|WHIB3|WHIB|ML0382|B229_F1_2|B1620_F3_137 PROBABLE TRANSCRIPTION FACTOR WHIB3 from Mycobacterium leprae (102 aa), FASTA scores: opt: 657, E(): 7.9e-39, (86.25% identity in 102 aa overlap). Also highly similar to Q9Z6E9|WHIB3 from Mycobacterium smegmatis (96 aa), FASTA scores: opt: 604, E(): 3.5e-35, (80.4% identity in 102 aa overlap); and O88103|WHID|SC6G4.45c|WBLB from Streptomyces coelicolor (112 aa), FASTA scores: opt: 437, E(): 1.4e-23, (62.5% identity in 96 aa overlap). Also similar to O05847|WHIB1|Rv3219|MTCY07D11.07c from Mycobacterium tuberculosis (84 aa), FASTA scores: opt: 215, E(): 2.5e-08, (44.45% identity in 81 aa overlap). Note that primer extension analysis revealed three transcriptional start sites and that expression from the three potential promoters is growth phase-dependent (see third citation). Mb3450 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWJ2" /db_xref="InterPro:IPR003482" /db_xref="InterPro:IPR034768" /db_xref="UniProtKB/Swiss-Prot:Q7TWJ2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02078.1" /translation="MPQPEQLPGPNADIWNWQLQGLCRGMDSSMFFHPDGERGRARTQ REQRAKEMCRRCPVIEACRSHALEVGEPYGVWGGLSESERDLLLKGTMGRTRGIRRTA " CDS complement(3792313..3793932) /codon_start=1 /transl_table=11 /gene="groEL1" /locus_tag="BQ2027_MB3451C" /standard_name="cpn60_1" /product="60 KDA CHAPERONIN 1 GROEL1 (PROTEIN CPN60-1) (GROEL PROTEIN 1)" /note="Mb3451c, groEL1, len: 539 aa. Equivalent to Rv3417c, len: 539 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 539 aa overlap). groEL1 (alternate genbe name: cpn60_1), 60 kd chaperonin 1 (protein cpn60 1) (see citations below), equivalent to P37578|CH61_MYCLE|B1620_C3_228|GROL1|GROEL1|GROEL- 1|GROE1|ML0381|B229_ 60 KDA CHAPERONIN 1 from Mycobacterium leprae (537 aa), FASTA scores: opt: 2846, E(): 1.5e-154, (82.95% identity in 539 aa overlap). Also highly similar to others e.g. Q00767|CH61_STRAL|GROL1|GROEL1 from Streptomyces albus G (539 aa), FASTA scores: opt: 2130, E(): 8.1e-114, (61.9% identity in 541 aa overlap); P40171|CH61_STRCO|GROL1|GROEL1|SC6G4.40 from Streptomyces coelicolor (540 aa), FASTA scores: opt: 2119, E(): 3.4e-113, (61.8% identity in 542 aa overlap); etc. Also similar to P06806|CH62_MYCTU|Q48931|Rv0440|MTV037.04|GROL2 |GROEL2|GRO EL-2|HSP65 (62.2% identity in 527 aa overlap). Contains PS00017 ATP/GTP-binding site motif A, PS00296 Chaperonins cpn60 signature. BELONGS TO THE CHAPERONIN (HSP60) FAMILY. Protein product from Mb3451c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3451c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A519" /db_xref="InterPro:IPR001844" /db_xref="InterPro:IPR002423" /db_xref="InterPro:IPR018370" /db_xref="InterPro:IPR027409" /db_xref="InterPro:IPR027410" /db_xref="InterPro:IPR027413" /db_xref="UniProtKB/Swiss-Prot:P0A519" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02079.1" /translation="MSKLIEYDETARRAMEVGMDKLADTVRVTLGPRGRHVVLAKAFG GPTVTNDGVTVAREIELEDPFEDLGAQLVKSVATKTNDVAGDGTTTATILAQALIKGG LRLVAAGVNPIALGVGIGKAADAVSEALLASATPVSGKTGIAQVATVSSRDEQIGDLV GEAMSKVGHDGVVSVEESSTLGTELEFTEGIGFDKGFLSAYFVTDFDNQQAVLEDALI LLHQDKISSLPDLLPLLEKVAGTGKPLLIVAEDVEGEALATLVVNAIRKTLKAVAVKG PYFGDRRKAFLEDLAVVTGGQVVNPDAGMVLREVGLEVLGSARRVVVSKDDTVIVDGG GTAEAVANRAKHLRAEIDKSDSDWDREKLGERLAKLAGGVAVIKVGAATETALKERKE SVEDAVAAAKAAVEEGIVPGGGASLIHQARKALTELRASLTGDEVLGVDVFSEALAAP LFWIAANAGLDGSVVVNKVSELPAGHGLNVNTLSYGDLAADGVIDPVKVTRSAVLNAS SVARMVLTTETVVVDKPAKAEDHDHHHGHAH" CDS complement(3794027..3794329) /codon_start=1 /transl_table=11 /gene="groES" /locus_tag="BQ2027_MB3452C" /standard_name="cpn10; mpt57" /product="10 KDA CHAPERONIN GROES (PROTEIN CPN10) (PROTEIN GROES) (BCG-A HEAT SHOCK PROTEIN) (10 KDA ANTIGEN)" /note="Mb3452c, groES, len: 100 aa. Equivalent to Rv3418c, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). groES (alternate gene names: cpn10, mpt57), 10 kDa chaperonin (protein cpn10) (see citations below), equivalent to P24301|CH10_MYCLE|MOPB|GROES|CHPA|ML0380|B1620_C3_227|B229 _C3_247 from Mycobacterium leprae (99 aa), FASTA scores: opt: 568, E(): 2.1e-31, (89.9% identity in 99 aa overlap). And also strongly identical to others e.g. O86017|CH10_MYCAV|MOPB|GROES from Mycobacterium avium and Mycobacterium paratuberculosis (99 aa), FASTA scores: opt: 611, E(): 2.9e-34, (96.95% identity in 99 aa overlap); P15020|CH10_MYCBO|MOPB|GROES from Mycobacterium bovis (99 aa), FASTA scores: opt: 596, E(): 2.9e-33, (98.95% identity in 94 aa overlap); P40172|CH10_STRCO|GROES|SC6G4.39 from Streptomyces coelicolor and Streptomyces lividans (102 aa), FASTA scores: opt: 480, E(): 1.6e-25, (76.75% identity in 99 aa overlap); etc. Also identical to MSG10KAG_1, MT10KAG_1, MTBCGA_1. Contains PS00681 Chaperonins cpn10 signature. BELONGS TO THE GROES CHAPERONIN FAMILY. Protein product from Mb3452c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3452c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P15020" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR018369" /db_xref="InterPro:IPR020818" /db_xref="InterPro:IPR037124" /db_xref="UniProtKB/Swiss-Prot:P15020" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02080.1" /translation="MAKVNIKPLEDKILVQANEAETTTASGLVIPDTAKEKPQEGTVV AVGPGRWDEDGEKRIPLDVAEGDTVIYSKYGGTEIKYNGEEYLILSARDVLAVVSK" CDS complement(3794596..3795630) /codon_start=1 /transl_table=11 /gene="gcp" /locus_tag="BQ2027_MB3453C" /product="PROBABLE O-SIALOGLYCOPROTEIN ENDOPEPTIDASE GCP (GLYCOPROTEASE)" /note="Mb3453c, -, len: 344 aa. Equivalent to Rv3419c, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 344 aa overlap). Probable gcp, glycoprotease (EC 3.4.24.57), equivalent to P37969|GCP_MYCLE|GCP|ML0379|U229E|U1620c|B229_C3_246|B1620 _C3_226 PROBABLE GLYCOPROTEASE from Mycobacterium leprae (351 aa), FASTA scores: opt: 1898, E(): 2.4e-101, (86.1% identity in 345 aa overlap). Highly similar to others e.g. O86793|GCP_STRCO|GCP|SC6G4.30 from Streptomyces coelicolor (374 aa), FASTA scores: opt: 1282, E(): 4.1e-66, (60.45% identity in 344 aa overlap); Q9WXZ2|TM0145 from Thermotoga maritima (327 aa), FASTA scores: opt: 867, E(): 1.9e-42, (45.4% identity in 337 aa overlap); P05852|GCP_ECOLI|B3064 from Escherichia coli strain K12 (337 aa), FASTA scores: opt: 838, E(): 9e-41, (46.55% identity in 346 aa overlap); etc. Shows some similarity to Q50707|YY21_MYCTU|Rv3421c|MTCY78.08 (33.9% identity in 127 aa overlap). Contains PS01016 Glycoprotease family signature. BELONGS TO PEPTIDASE FAMILY M22; ALSO KNOWN AS THE GLYCOPROTEASE FAMILY. Protein product from Mb3453c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3453c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65802" /db_xref="InterPro:IPR000905" /db_xref="InterPro:IPR017860" /db_xref="InterPro:IPR017861" /db_xref="InterPro:IPR022450" /db_xref="UniProtKB/Swiss-Prot:P65802" /protein_id="SIU02081.1" /translation="MTTVLGIETSCDETGVGIARLDPDGTVTLLADEVASSVDEHVRF GGVVPEIASRAHLEALGPAMRRALAAAGLKQPDIVAATIGPGLAGALLVGVAAAKAYS AAWGVPFYAVNHLGGHLAADVYEHGPLPECVALLVSGGHTHLLHVRSLGEPIIELGST VDDAAGEAYDKVARLLGLGYPGGKALDDLARTGDRDAIVFPRGMSGPADDRYAFSFSG LKTAVARYVESHAADPGFRTADIAAGFQEAVADVLTMKAVRAATALGVSTLLIAGGVA ANSRLRELATQRCGEAGRTLRIPSPRLCTDNGAMIAAFAAQLVAAGAPPSPLDVPSDP GLPVMQGQVR" CDS complement(3795627..3796103) /codon_start=1 /transl_table=11 /gene="rimI" /locus_tag="BQ2027_MB3454C" /product="ribosomal-protein-alanine acetyltransferase rimi (acetylating enzyme for n-terminal of ribosomal protein s18)" /note="Mb3454c, rimI, len: 158 aa. Equivalent to Rv3420c, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 158 aa overlap). Probable rimI, ribosomal-protein-alanine acetyltransferase (EC 2.3.1.128), equivalent to C-terminal part of Q49857|YY21_MYCLE|ML0378|B229_C1_170 HYPOTHETICAL 38.0 KDA PROTEIN from Mycobacterium leprae (359 aa), FASTA scores: opt: 772, E(): 2.7e-44, (72.1% identity in 154 aa overlap). Similar notably to ribosomal-protein-alanine acetyltransferases e.g. Q9AC11|CC0058 from Caulobacter crescentus (150 aa), FASTA scores: opt: 223, E(): 4.9e-08, (37.5% identity in 136 aa overlap); Q9KFD4|BH0547 from Bacillus halodurans (151 aa), FASTA scores: opt: 222, E(): 5.8e-08, (35.2% identity in 142 aa overlap); Q9PG61|XF0441 from Xylella fastidiosa (156 aa), FASTA scores: opt: 207, E(): 5.9e-07, (32.2% identity in 149 aa overlap); Q9HVB7|RIMI|PA4678 from Pseudomonas aeruginosa (150 aa), FASTA scores: opt: 203, E(): 1.1e-06, (32.45% identity in 151 aa overlap); P09453|RIMI_ECOLI|B4373 from Escherichia coli strain K12 (148 aa), FASTA scores: opt: 196, E(): 3.1e-06, (33.55% identity in 149 aa overlap); etc. BELONGS TO THE ACETYLTRANSFERASE FAMILY, RIMI SUBFAMILY. Protein product from Mb3454c detected using SWATH mass spectrometry. Mb3454c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y448" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR006464" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:A0A1R3Y448" /protein_id="SIU02082.1" /translation="MTADTEPVTIGALTRADAQRCAELEAQLFVGDDPWPPAAFNREL ASPHNHYVGARSGGTLVGYAGISRLGRTPPFEYEVHTIGVDPAYQGRGIGRRLLRELL DFARGGVVYLEVRTDNDAALALYRSVGFQRVGLRRRYYRVSGADAYTMRRDSGDPS" CDS complement(3796100..3796735) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3455C" /product="tRNA threonylcarbamoyladenosine biosynthesis protein TsaB" /note="Mb3455c, -, len: 211 aa. Equivalent to Rv3421c, len: 211 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 211 aa overlap). Conserved hypothetical protein, equivalent to Q49857|YY21_MYCLE|ML0378|B229_C1_170 HYPOTHETICAL 38.0 KDA PROTEIN from Mycobacterium leprae (359 aa), FASTA scores: opt: 1000, E(): 1.8e-50, (75.6% identity in 205 aa overlap). Also similar to other hypothetical bacterial proteins e.g. O86791|SC6G4.28 from Streptomyces coelicolor (217 aa), FASTA scores: opt: 453, E(): 3.3e-19, (48.1% identity in 212 aa overlap); Q9AC10|CC0059 (GLYCOPROTEASE FAMILY PROTEIN) from Caulobacter crescentus (211 aa), FASTA scores: opt: 248, E(): 2e-07, (34.3% identity in 210 aa overlap); Q9KQK9|VC1989 from Vibrio cholerae (237 aa), FASTA scores: opt: 238, E(): 8.2e-07, (28.85% identity in 208 aa overlap); BAB51966|Mlr5530 from Rhizobium loti (Mesorhizobium loti) (225 aa), FASTA scores: opt: 237, E(): 9e-07, (35.0% identity in 220 aa overlap); etc. Some similarity to upstream Q50709|GCP_MYCTU|Rv3419c|MT3528|MTCY78.10 from M. tuberculosis (344 aa), (33.9% identity in 127 aa overlap). Mb3455c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65084" /db_xref="InterPro:IPR000905" /db_xref="InterPro:IPR022496" /db_xref="UniProtKB/Swiss-Prot:P65084" /protein_id="SIU02083.1" /translation="MSRVQISTVLAIDTATPAVTAGIVRRHDLVVLGERVTVDARAHA ERLTPNVLAALADAALTMADLDAVVVGCGPGPFTGLRAGMASAAAYGHALGIPVYGVC SLDAIGGQTIGDTLVVTDARRREVYWARYCDGIRTVGPAVNAAADVDPGPALAVAGAP EHAALFALPCVEPSRPSPAGLVAAVNWADKPAPLVPLYLRRPDAKPLAVCT" CDS complement(3796732..3797238) /codon_start=1 /transl_table=11 /gene="tsaE" /locus_tag="BQ2027_MB3456C" /product="tRNA threonylcarbamoyladenosine biosynthesis protein TsaE" /note="Mb3456c, -, len: 168 aa. Equivalent to Rv3422c, len: 168 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 168 aa overlap). Conserved hypothetical protein, equivalent to Q49864|YY22_MYCLE|ML0377|U229F|B229_C2_205 HYPOTHETICAL 17.6 KDA PROTEIN from Mycobacterium leprae (161 aa), FASTA scores: opt: 752, E(): 8.3e-38, (77.4% identity in 146 aa overlap). Also similar to other hypothetical bacterial proteins e.g. O86788|YJEE_STRCO|SC6G4.25 from Streptomyces coelicolor (148 aa), FASTA scores: opt: 377, E(): 1.2e-15, (50.85% identity in 120 aa overlap); Q9X1W7|TM1632 from Thermotoga maritima (161 aa), FASTA scores: opt: 247, E(): 6.2e-08, (39.4% identity in 137 aa overlap); Q9RRY1|DR2351 from Deinococcus radiodurans (148 aa), FASTA scores: opt: 236, E(): 2.6e-07, (38.6% identity in 127 aa overlap); etc. Contains PS00017 ATP /GTP-binding site motif A. Protein product from Mb3456c detected using SWATH mass spectrometry. Mb3456c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67172" /db_xref="InterPro:IPR003442" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P67172" /protein_id="SIU02084.1" /translation="MSREGIRRRPKARAGLTGGGTATLPRVEDTLTLGSRLGEQLCAG DVVVLSGPLGAGKTVLAKGIAMAMDVEGPITSPTFVLARMHRPRRPGTPAMVHVDVYR LLDHNSADLLSELDSLDLDTDLEDAVVVVEWGEGLAERLSQRHLDVRLERVSHSDTRI ATWSWGRS" CDS complement(3797235..3798461) /codon_start=1 /transl_table=11 /gene="alr" /locus_tag="BQ2027_MB3457C" /product="alanine racemase alr" /note="Mb3457c, alr, len: 408 aa. Equivalent to Rv3423c, len: 408 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 408 aa overlap). Probable alr, alanine racemase (EC 5.1.1.1), equivalent to P38056|ALR_MYCLE|ML0375|B229_C3_243 ALANINE RACEMASE from Mycobacterium leprae (388 aa), FASTA scores: opt: 2160, E(): 2.3e-124, (84.35% identity in 384 aa overlap). Also highly similar to other alanine racemases e.g. Q9L888|ALR_MYCAV from Mycobacterium avium (391 aa), FASTA scores: opt: 2103, E(): 6.8e-121, (83.6% identity in 384 aa overlap); P94967|ALR_MYCSM from M. smegmatis (389 aa), FASTA scores: opt: 1721, E(): 1.3e-97, (67.25% identity in 385 aa overlap); O86786|ALR_STRCO|SC6G4.23 from Streptomyces coelicolor (391 aa), FASTA scores: opt: 1041, E(): 3.7e-56, (47.65% identity in 380 aa overlap); etc. BELONGS TO THE ALANINE RACEMASE FAMILY. Protein product from Mb3457c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3457c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4X3" /db_xref="InterPro:IPR000821" /db_xref="InterPro:IPR001608" /db_xref="InterPro:IPR009006" /db_xref="InterPro:IPR011079" /db_xref="InterPro:IPR020622" /db_xref="InterPro:IPR029066" /db_xref="UniProtKB/Swiss-Prot:P0A4X3" /protein_id="SIU02085.1" /translation="MKRFWENVGKPNDTTDGRGTTSLAMTPISQTPGLLAEAMVDLGA IEHNVRVLREHAGHAQLMAVVKADGYGHGATRVAQTALGAGAAELGVATVDEALALRA DGITAPVLAWLHPPGIDFGPALLADVQVAVSSLRQLDELLHAVRRTGRTATVTVKVDT GLNRNGVGPAQFPAMLTALRQAMAEDAVRLRGLMSHMVYADKPDDSINDVQAQRFTAF LAQAREQGVRFEVAHLSNSSATMARPDLTFDLVRPGIAVYGLSPVPALGDMGLVPAMT VKCAVALVKSIRAGEGVSYGHTWIAPRDTNLALLPIGYADGVFRSLGGRLEVLINGRR CPGVGRICMDQFMVDLGPGPLDVAEGDEAILFGPGIRGEPTAQDWADLVGTIHYEVVT SPRGRITRTYREAENR" CDS complement(3798755..3799117) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3458C" /product="HYPOTHETICAL PROTEIN" /note="Mb3458c, -, len: 120 aa. Equivalent to Rv3424c, len: 120 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 120 aa overlap). Hypothetical unknown protein. Mb3458c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/Swiss-Prot:P65086" /protein_id="SIU02086.1" /translation="MPNPVTMLYGRKADLVILPHVLAEERPHPYSTPGRKRGAQIALT TGIDALASFAPQIVNPRHGLSRVVQCLGGCENKRHAYFRSISKTPHIRARGVPSVCAV RTVGVDGAKRPPKPIPVQ" CDS 3799280..3799816 /codon_start=1 /transl_table=11 /gene="PPE57" /locus_tag="BQ2027_MB3459" /product="ppe family protein ppe57" /note="Mb3459, PPE57, len: 178 aa. Equivalent to 5' end of Rv3425, len: 176 aa, from Mycobacterium tuberculosis strain H37Rv, (90.9% identity in 176 aa overlap). Member of the Mycobacterium tuberculosis PPE family, similar to many e.g. O06246|Rv3429|MTCY77.01 (178 aa), FASTA scores: opt: 781, E(): 7e-44, (69.9% identity in 176 aa overlap); and downstream Q50702|YY26_MYCTU|Rv3426|MTCY78.03c (232 aa), FASTA scores: opt: 517, E(): 1.2e-26, (68.0% identity in 125 aa overlap); MTV049_11, MTCY428_16, MTV049_22, MTV049_30, MTCY261_4; etc. Rv3429: Member of the M. tuberculosis PPE family, similar to many e.g. the upstream Q50703|YY25_MYCTU|Rv3425|MTCY78.04c (176 aa), FASTA scores: opt: 781, E(): 1.9e-44, (69.9% identity in 176 aa overlap); and Q50702|YY26_MYCTU|Rv3426|MTCY78.03c (232 aa), FASTA scores: opt: 555, E(): 1.7e-29, (72.0% identity in 125 aa overlap) (but diverges at 3' end)); etc. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large deletion of 4926 bp (RD6) leads to the loss of the COOH part of PPE57, and the following CDSs, PPE58, Rv3427c, Rv3428c and PPE59 compared to Mycobacterium tuberculosis strain H37Rv. Mb3459 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y451" /protein_id="SIU02087.1" /translation="MHPMIPAEYISNIIYEGPGADSLFFASGQLRELAYSVETTAESL EDELDELDENWKGSSSDLLADAVERYLQWLSKHSSQLKHAAWVINGLANAYNDTRRKV VPPEEIAANREEVHRLIASNVAGVNTPAIAGLDAQYQQYRAQNIAVMNDYQSTARFIL AYLPRWQEPPQIYGGGGG" CDS complement(3799757..3800920) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3460C" /product="POSSIBLE TRANSPOSASE" /note="Mb3460c, -, len: 387 aa. Equivalent to Rv3430c,IS1540-2, len: 387 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 387 aa overlap). Possible IS1540 transposase, similar to several e.g. Q49592 transposase from Mycobacterium intracellulare (340 aa), FASTA scores: opt: 1377, E(): 1.6e-81, (64.2% identity in 338 aa overlap); similarity is lost at C-terminus due to possible frameshift after aa 297." /db_xref="GOA:A0A1R3Y4B4" /db_xref="InterPro:IPR001584" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR036397" /db_xref="InterPro:IPR038965" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4B4" /protein_id="SIU02088.1" /translation="MIDTAIEEMIPLIGVRAACAATGRAPASYYRAHSKRLSAQSDTF TSTAVTDPSGPRESAQPRALSAAEREHVLAVLNSQRFADMAPAVVYATLLDEGIYLCS ESTMYRLLRERGQTGDRRRQATHPAAVKPELVAHQPNSVWSWDITKLRGPAKWSYYYL YVILDIFSRYVVGWMVASRESKVLAERLIAQTLAAQHISADQLTLHADRGSSMSSKPV ALLLADLGVTKSHSHPHTSNDNPLSEAQFKTLKYRPDFPKRFESIEAARVHCDRFFGW YNHEHKHSGIGLHTPADVHYGRADQIRRHRATVLDTAYRDHLERIRSQTTRATRATGL QRDQPTTEGGPADSINPRKSCLRNVDRFRPGLLDLPAPAPVDLRRLLPSGQIR" mobile_element complement(3799757..3800920) /mobile_element_type="insertion sequence:IS1540" /locus_tag="BQ2027_IS1540-2" /note="IS1540-2, len: 1163 nt. Equivalent to IS1540, len: 1163 nt, from Mycobacterium tuberculosis strain H37Rv,(99.9% identity in 1163 nt overlap)." CDS complement(3801409..3802254) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3461C" /product="possible transposase (fragment)" /note="Mb3461c, -, len: 281 aa. Equivalent to Rv3431c, len: 281 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 281 aa overlap). Possible truncated transposase for IS1552, similar to, but shorter than other transposases e.g. P72303 from Rhodococcus opacus (418 aa), FASTA scores: opt: 1509, E(): 1.2e-91, (80.95% identity in 278 aa overlap); Q9AKV5 from Mycobacterium paratuberculosis (395 aa), FASTA scores: opt: 1115, E(): 7.8e-66, (63.45% identity in 268 aa overlap); etc. Mb3461c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y489" /db_xref="InterPro:IPR001207" /db_xref="UniProtKB/TrEMBL:A0A1R3Y489" /protein_id="SIU02089.1" /translation="MFAELIRAGLQALIEAEATEAIGAGRYERSDGRIVHRNGHRPKT VSTTAGDIEVQIPKLRAGSFFPSLLERRRRIDKALHAVIMEAYVHGVSTRSVDDLVAA MGVQAGVSKSEVSRICAGLDTEIEAFRTRSLTHTEFPYVFCDATFCKVRVGAHVVSQA LVVATGVSIDGTREVLGTAVGDSESYEFWREFLASLKARGLTGVHLVISDAHAGLKAA VAQQFSGASWQRCRVHFMRNLYTAVAAKHAPAVTVAVKTIFAHTDPEEVGAQWDRVAD PLCQP" mobile_element complement(3801411..3802255) /mobile_element_type="insertion sequence:IS1552" /locus_tag="BQ2027_IS1552" /note="IS1552, len: 845 nt. Equivalent to IS1552, len: 845 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 845 nt overlap)." gene complement(3801411..3802255) /locus_tag="BQ2027_IS1552" CDS complement(3802487..3803869) /codon_start=1 /transl_table=11 /gene="gadB" /locus_tag="BQ2027_MB3462C" /product="PROBABLE GLUTAMATE DECARBOXYLASE GADB" /note="Mb3462c, gadB, len: 460 aa. Equivalent to Rv3432c, len: 460 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 460 aa overlap). Probable gadB, glutamate decarboxylase (EC 4.1.1.15), similar to many e.g. P73043|GAD|SLL1641 from Synechocystis sp. strain PCC 6803 (467 aa), FASTA scores: opt: 1684, E(): 6.2e-99, (55.35% identity in 457 aa overlap); Q9X8J5|SCE9.23 from Streptomyces coelicolor (475 aa), FASTA scores: opt: 1650, E(): 8.9e-97, (57.4% identity in 446 aa overlap); Q9AQU4|GAD from Oryza sativa (Rice) (501 aa), FASTA scores: opt: 1498, E(): 3.7e-87, (51.6% identity in 432 aa overlap); Q07346|DCE_PETHY from Petunia hybrida (Petunia) (500 aa), FASTA scores: opt: 1485, E(): 2.5e-86, (51.15% identity in 437 aa overlap); etc. BELONGS TO GROUP II DECARBOXYLASES (DDC, GAD, HDC AND TYRDC). Protein product from Mb3462c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3462c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y460" /db_xref="InterPro:IPR002129" /db_xref="InterPro:IPR010107" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/TrEMBL:A0A1R3Y460" /protein_id="SIU02090.1" /translation="MSRSHPSVPAHSIAPAYTGRMFTAPVPALRMPDESMDPEAAYRF IHDELMLDGSSRLNLATFVTTWMDPEAEKLMAETFDKNMIDKDEYPATAAIEARCVSM VADLFHAEGLRDHDPTSATGVSTIGSSEAVMLGGLALKWRWRQRVGSWKGRMPNLVMG SNVQVVWEKFCRYFDVEPRYLPMERGRYVITPEQVLAAVDENTIGVVAILGTTYTGEL EPIAEICAALDKLAAGGGVDVPVHVDAASGGFVVPFLHPDLVWDFRLPRVVSINVSGH KYGLTYPGVGFVVWRGPEHLPEDLVFRVNYLGGDMPTFTLNFSRPGNQVVGQYYNFLR LGRDGYTKVMQALSHTARWLGDQLREVDHCEVISDGSAIPVVSFRLAGDRGYTEFDVS HELRTFGWQVPAYTMPDNATDVAVLRIVVREGLSADLARALHDDAVTALAALDKVKPG GHFDAQHFAH" CDS complement(3803907..3805328) /codon_start=1 /transl_table=11 /gene="nnr" /locus_tag="BQ2027_MB3463C" /product="NAD(P)H-hydrate epimerase (EC / ADP-dependent (S)-NAD(P)H-hydrate dehydratase (EC" /EC_number="5.1.99.6" /EC_number="4.2.1.136" /note="Mb3463c, -, len: 473 aa. Equivalent to Rv3433c, len: 473 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 473 aa overlap). Hypothetical protein, member of YKL151c/yjeF family, equivalent to P37391|YY33_MYCLE|ML0373|U229G|B229_C2_201 HYPOTHETICAL 47.2 KDA PROTEIN from Mycobacterium leprae (473 aa), FASTA scores: opt: 2650, E(): 5e-136, (84.55% identity in 473 aa overlap). Also similar to other hypothetical bacterial proteins e.g. Q9X3W3 from Zymomonas mobilis (484 aa), FASTA scores: opt: 700, E(): 1.2e-30, (33.7% identity in 484 aa overlap); O86783|SC6G4.20c from Streptomyces coelicolor (485 aa), FASTA scores: opt: 563, E(): 3.2e-23, (48.45% identity in 489 aa overlap); Q9LC81 from Arthrobacter sp. Q36 (313 aa), FASTA scores: opt: 553, E(): 7.9e-23, (44.2% identity in 303 aa overlap); etc. Contains PS01049 Hypothetical YKL151c/yjeF family signature 1, PS01050 Hypothetical YKL151c/yjeF family signature 2. Protein product from Mb3463c detected using SWATH mass spectrometry. Mb3463c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y479" /db_xref="InterPro:IPR000631" /db_xref="InterPro:IPR004443" /db_xref="InterPro:IPR017953" /db_xref="InterPro:IPR029056" /db_xref="InterPro:IPR030677" /db_xref="InterPro:IPR036652" /db_xref="UniProtKB/TrEMBL:A0A1R3Y479" /protein_id="SIU02091.1" /translation="MRHYYSVDTIRAAEAPLLASLPDGALMRRAAFGLATEIGRELTA RTGGVVGRRVCAVVGSGDNGGDALWAATFLRRRGAAADAVLLNPDRTHRKALAAFTKS GGRLVESVSAATDLVIDGVVGISGSGPLRPAAAQVFAAVQAAAIPVVAVDIPSGIDVA TGAITGPAVHAALTVTFGGLKPVHALADCGRVVLVDIGLDLAHTDVLGFEATDVAARW PVPGPRDDKYTQGVTGVLAGSSTYPGAAVLCTGAAVAATSGMVRYAGTAHAEVLAHWP EVIASPTPAAAGRVQAWVVGPGLGTDEAGAAALWFALDTDLPVLVDADGLTMLADHPD LVAGRNAPTVLTPHAGEFARLAGAPPGDDRVGACRQLADALGATVLLKGNVTVIADPG GPVYLNPAGQSWAATAGSGDVLSGMIGALLASGLPSGEAAAAAAFVHARAAAAAAADP GPGDAPTSASRISGHIRAALAAL" CDS complement(3805330..3806043) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3464C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3464c, -, len: 237 aa. Equivalent to Rv3434c, len: 237 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 237 aa overlap). Possible conserved transmembrane protein, showing some similarity with Q9CGH7|YLDB HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (258 aa), FASTA scores: opt: 248, E(): 1.6e-09, (28.8% identity in 198 aa overlap); and P94983|Rv1648|MTCY06H11.13 from Mycobacterium tuberculosis (268 aa), FASTA scores: opt: 205, E(): 1.2e-06, (31.45% identity in 194 aa overlap). Mb3464c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y482" /db_xref="UniProtKB/TrEMBL:A0A1R3Y482" /protein_id="SIU02092.1" /translation="MADASVVARLRSWALAVWHFVSNAPLTYAWLVVLVITTIIQNNL TGSQLHFVLLHRSTNIAELGRDPLEVLFSSLLWIDGRNLEPYLLLFTLFLAPAEHWLG HLRWLTVGLTAHIGATYLSEGLLYLAIQHRDASERMVHARDIGVSYFLVGVMAVLTYH IAKPWRWGYLGVLLVIFGFPLIAMDKAELDFTTVGHFASILIGLLFYPMARERDGRLW NPARIKSLLHRRGTRGRRA" CDS complement(3806054..3806908) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3465C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3465c, -, len: 284 aa. Equivalent to Rv3435c, len: 284 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 284 aa overlap). Probable conserved transmembrane protein, showing some similarity with P95061|Rv0713|MTCY210.32 HYPOTHETICAL 33.9 KDA PROTEIN from Mycobacterium tuberculosis (313 aa), FASTA scores: opt: 557, E(): 1.3e-26, (35.8% identity in 282 aa overlap); and O32991|MLCB2492.12 from Mycobacterium leprae (95 aa), FASTA scores: opt: 150, E(): 0.022, (35.3% identity in 85 aa overlap). Equivalent to AAK47881 from Mycobacterium tuberculosis strain CDC1551 (312 aa) but shorter 28 aa. Mb3465c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y463" /db_xref="InterPro:IPR027948" /db_xref="UniProtKB/TrEMBL:A0A1R3Y463" /protein_id="SIU02093.1" /translation="MGRILRVVVGLVLVIAAYVTVIALYHSTGLGRPHEVAHGRPTAD GTTVTLHVEQLQTIKGVLVANLAVSPGTELLDSQTQGLKDDLTVTVTSVVTPTKRTWS SGSLPGVFPVPLTISGDPANWPFDHYRSGPITVQLYRGAAHAPERVSVTFVDRLPGWN VDISGVGDANVPAPYRVGLHRSPSSVAFGTVIVGVLIALAGVGLFVAVQTARGRRQFQ PPMTTWYAAMLFAVIPLRNALPDAPPIGFWIDVTVVLWVVVALVTSMVLYILCWWWHL KPDVDETM" CDS complement(3807130..3809004) /codon_start=1 /transl_table=11 /gene="glmS" /locus_tag="BQ2027_MB3466C" /product="PROBABLE GLUCOSAMINE--FRUCTOSE-6-PHOSPHATE AMINOTRANSFERASE [ISOMERIZING] GLMS (HEXOSEPHOSPHATE AMINOTRANSFERASE) (D-FRUCTOSE-6-PHOSPHATE AMIDOTRANSFERASE) (GFAT) (L-GLUTAMINE-D-FRUCTOSE-6-PHOSPHATE AMIDOTRANSFERASE) (GLUCOSAMINE-6-PHOSPHATE SYNTHASE)" /note="Mb3466c, glmS, len: 624 aa. Equivalent to Rv3436c, len: 624 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 624 aa overlap). Probable glmS, glucosamine--fructose-6-phosphate aminotransferase (EC 2.6.1.16), equivalent to P40831|GLMS_MYCLE|ML0371|B229_C3_238 GLUCOSAMINE--FRUCTOSE-6-PHOSPHATE AMINOTRANSFERASE [ISOMERIZING] from Mycobacterium leprae (623 aa), FASTA scores: opt: 3584, E(): 4.7e-214, (89.3% identity in 627 aa overlap). Also highly similar to others e.g. O68956|GLMS_MYCSM from Mycobacterium smegmatis (627 aa), FASTA scores: opt: 3517, E(): 6.5e-210, (87.25% identity in 627 aa overlap); O86781|GLMS_STRCO|SC6G4.18 from Streptomyces coelicolor (614 aa), FASTA scores: opt: 2364, E(): 1.3e-138, (64.95% identity in 625 aa overlap); Q9K1P9|NMB0031 from Neisseria meningitidis (serogroup B) and Q9JWN9|GLMS|NMA0276 from Neisseria meningitidis (serogroup A) (612 aa), FASTA scores: opt: 1445, E(): 8.4e-82, (43.55% identity in 627 aa overlap); etc. BELONGS TO THE TYPE-2 GATASE DOMAIN IN THE N-TERMINAL SECTION. BELONGS TO THE SIS FAMILY, GLMS SUBFAMILY, IN THE C-TERMINAL SECTION. Protein product from Mb3466c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3466c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A589" /db_xref="InterPro:IPR001347" /db_xref="InterPro:IPR005855" /db_xref="InterPro:IPR017932" /db_xref="InterPro:IPR029055" /db_xref="InterPro:IPR035466" /db_xref="InterPro:IPR035490" /db_xref="UniProtKB/Swiss-Prot:P0A589" /protein_id="SIU02094.1" /translation="MCGIVGYVGRRPAYVVVMDALRRMEYRGYDSSGIALVDGGTLTV RRRAGRLANLEEAVAEMPSTALSGTTGLGHTRWATHGRPTDRNAHPHRDAAGKIAVVH NGIIENFAVLRRELETAGVEFASDTDTEVAAHLVARAYRHGETADDFVGSVLAVLRRL EGHFTLVFANADDPGTLVAARRSTPLVLGIGDNEMFVGSDVAAFIEHTREAVELGQDQ AVVITADGYRISDFDGNDGLQAGRDFRPFHIDWDLAAAEKGGYEYFMLKEIAEQPAAV ADTLLGHFVGGRIVLDEQRLSDQELREIDKVFVVACGTAYHSGLLAKYAIEHWTRLPV EVELASEFRYRDPVLDRSTLVVAISQSGETADTLEAVRHAKEQKAKVLAICNTNGSQI PRECDAVLYTRAGPEIGVASTKTFLAQIAANYLLGLALAQARGTKYPDEVEREYHELE AMPDLVARVIAATGPVAELAHRFAQSSTVLFLGRHVGYPVALEGALKLKELAYMHAEG FAAGELKHGPIALIEDGLPVIVVMPSPKGSATLHAKLLSNIREIQTRGAVTIVIAEEG DETVRPYADHLIEIPAVSTLLQPLLSTIPLQVFAASVARARGYDVDKPRNLAKSVTVE " CDS 3809026..3809502 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3467" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3467, -, len: 158 aa. Equivalent to Rv3437, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 158 aa overlap). Possible conserved transmenbrane protein, C-terminus similar to N-terminal part of O06345|Rv3482c|MTCY13E12.35c HYPOTHETICAL 28.5 KDA PROTEIN from Mycobacterium tuberculosis (260 aa), FASTA scores: opt: 140, E(): 0.1, (58.8% identity in 34 aa overlap); and Q9XAN5|SC4C6.05c PUTATIVE MEMBRANE PROTEIN from Streptomyces (347 aa), coelicolor FASTA scores: opt: 112, E(): 6.8, (50.0% identity in 32 aa overlap). Questionable ORF. Protein product from Mb3467 detected using SWATH mass spectrometry. Mb3467 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5V3" /db_xref="InterPro:IPR018929" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5V3" /protein_id="SIU02095.1" /translation="MVGRAVPSPNRRYRRVWPPRTKGQHLSNPYAQHQLKLIRHTGAL ILWQQRTYVVSGTREQCEAAYKSAQTYNLLVGWWSLVSLPAMNWIALISNFNAIRRVR AAADGASVPHGPHAIAHPAVPRGPIPAGWYPDPSGAGLRYWDGATWTHWTHPPRHR" CDS 3809512..3810354 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3468" /product="conserved protein" /note="Mb3468, -, len: 280 aa. Equivalent to Rv3438, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Conserved hypothetical protein, equivalent to Q9CCV6|ML0370 HYPOTHETICAL PROTEIN from Mycobacterium leprae (289 aa), FASTA scores: opt: 1491, E(): 9.2e-81, (79.85% identity in 283 aa overlap); and highly similar (but shorter 41 aa) to Q49872|B229_F1_20 HYPOTHETICAL 34.0 KDA PROTEIN from Mycobacterium leprae (324 aa), FASTA scores: opt: 1491, E(): 1e-80, (79.85% identity in 283 aa overlap). Shows some similarity to Q9KIU3|LIPA LIPASE from plasmid pAH114 uncultured bacterium (281 aa), FASTA scores: opt: 168, E(): 0.0081, (29.3% identity in 140 aa overlap). Protein product from Mb3468 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3468 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V5" /protein_id="SIU02096.1" /translation="MPRIRKLVAALHRRGPHRVLRGDLAFAGLPGVVYTPEAGLHLPG VAFGHDWLTGTSRYSGLLEHLASWGIVAAAPDSERGLAPSVLNLAFDLGVALDIVAGV RLGPGKISVHPAKLGLVGHGFGGSAAVFAAAGLTGTHVKSVAAIFPTVTNPAAEQPAA TLDVPGLILTAPGDPKTLTSNALGLSRAWDKATLRIVSKARAGGLVEGRRLTKVLGLP GPHRRTQRSVRALLTGYLLYTLGGDKTYRRFADPDLQLPKTDPIDPEAPPITPGEKIV TLLK" CDS complement(3810374..3811777) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3469C" /product="Conserved alanine and proline rich protein" /note="Mb3469c, -, len: 467 aa. Equivalent to Rv3439c, len: 467 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 467 aa overlap). Conserved hypothetical ala-, pro-rich protein, similar in part to N-terminal part of Q49853|B229_C1_154 HYPOTHETICAL 11.2 KDA PROTEIN from Mycobacterium leprae (103 aa), FASTA scores: opt: 265, E(): 0.0013, (51.1% identity in 90 aa overlap). Mb3469c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y457" /protein_id="SIU02097.1" /translation="MADRLNVAERLAEGRPAAEHTQSYVRACHLVGYQHPDLTAYPAQ IHDWYGSEDGLDLHALDADCAQLRAAASVLMEALRMERSQVAVLAAAWTGSGADAAVH FVQRHCETGNSVVTEVRAAAQRCESLRDNLWQLVDSKVATAIAIDERALAQRPAWLAA AEALTTEGADRPTAVEVVRQQIQPYVDDDVRNDWLTTMRSTTAGVAASYDAVTDQLAS APRAHFEIPDDLGPGRQPSPASVPAQPSATAAITPAAALPPPDPVPAVTSRPVTPSDF GSAPGDGSATPAGVGSAGGFGDAGGTGGLGGFAGLAGLANRIVDAVDSLLGSVAEQLG DPLAADNPPGAVDPFAEDAADNADDGDDAHPEEADEAAEPKEATEPDEADEVDDADES VPAERAQDVAEEATLPPVAEPPPPAAPPVAEPPPPVAAPAPPGAPEPANGPSPEALSE GATPCEIAADELPQAGP" CDS complement(3811780..3812091) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3470C" /product="HYPOTHETICAL PROTEIN" /note="Mb3470c, -, len: 103 aa. Equivalent to Rv3440c, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 103 aa overlap). Hypothetical unknown protein. Mb3470c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4B8" /protein_id="SIU02098.1" /translation="MRPDSVNSAGIDIAAVYAVADRFSAAAELIDDAIGNHLTRLAFG GACAGRGHASRGDALRCRLDRLAGELSVWSRAAVQIAFALRAGANRYAEADLCAAARI G" CDS complement(3812139..3813485) /codon_start=1 /transl_table=11 /gene="mrsA" /locus_tag="BQ2027_MB3471C" /product="PROBABLE PHOSPHO-SUGAR MUTASE / MRSA PROTEIN HOMOLOG" /note="Mb3471c, mrsA, len: 448 aa. Equivalent to Rv3441c, len: 448 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 448 aa overlap). Probable mrsA, phosphoglucomutase or phosphomannomutase (EC 5.4.2.-), equivalent to Q49869|URED|B229_C3_234 MRSA PROTEIN HOMOLOG from Mycobacterium leprae (463 aa), FASTA scores: opt: 2449, E(): 6.3e-135, (87.65% identity in 445 aa overlap); and highly similar (but longer 178 aa) to Q49862|UREC|B229_C2_192 PUTATIVE UREASE OPERON UREC PROTEIN from Mycobacterium leprae (288 aa), FASTA scores: opt: 1442, E(): 1.3e-76, (86.5% identity in 267 aa overlap). Highly similar to phospho-sugar mutases e.g. Q53876|SC6G4.14 PUTATIVE PHOSPHO-SUGAR MUTASE (SIMILAR TO PHOSPHOMANNOMUTASES) from Streptomyces coelicolor (452 aa), FASTA scores: opt: 1710, E(): 5e-92, (60.45% identity in 450 aa overlap); Q9KG46|BH0267 PHOSPHOGLUCOSAMINE MUTASE from Bacillus halodurans (447 aa), FASTA scores: opt: 1351, E(): 3.5e-71, (48.4% identity in 444 aa overlap); BAB58323|GLMM PHOSPHOGLUCOSAMINE-MUTASE from Staphylococcus aureus subsp. aureus Mu50 (451 aa) and Q99QR5|GLMM(FEMD)|SA1965 PHOSPHOGLUCOSAMINE-MUTASE from Staphylococcus aureus subsp. aureus N315. (451 aa), FASTA scores: opt: 1315, E(): 4.3e-69, (48.45% identity in 446 aa overlap); P95685|FEMD|GLMM PHOSPHOGLUCOSAMINE-MUTASE (451 aa), FASTA scores: opt: 1310, E(): 8.5e-69, (48.2% identity in 446 aa overlap); P95575|MRSA_PSESY MRSA PROTEIN HOMOLOG from Pseudomonas syringae (pv. syringae) (447 aa), FASTA scores: opt: 1143, E(): 4.2e-59, (42.75% identity in 447 aa overlap); etc. Contains PS00710 Phosphoglucomutase and phosphomannomutase phosphoserine signature. BELONGS TO THE PHOSPHOHEXOSE MUTASES FAMILY. Protein product from Mb3471c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3471c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWH9" /db_xref="InterPro:IPR005841" /db_xref="InterPro:IPR005843" /db_xref="InterPro:IPR005844" /db_xref="InterPro:IPR005845" /db_xref="InterPro:IPR005846" /db_xref="InterPro:IPR006352" /db_xref="InterPro:IPR016055" /db_xref="InterPro:IPR016066" /db_xref="InterPro:IPR036900" /db_xref="UniProtKB/Swiss-Prot:Q7TWH9" /protein_id="SIU02099.1" /translation="MGRLFGTDGVRGVANRELTAELALALGAAAARRLSRSGAPGRRV AVLGRDPRASGEMLEAAVIAGLTSEGVDALRVGVLPTPAVAYLTGAYDADFGVMISAS HNPMPDNGIKIFGPGGHKLDDDTEDQIEDLVLGVSRGPGLRPAGAGIGRVIDAEDATE RYLRHVAKAATARLDDLAVVVDCAHGAASSAAPRAYRAAGARVIAINAEPNGRNINDG CGSTHLDPLRAAVLAHRADLGLAHDGDADRCLAVDANGDLVDGDAIMVVLALAMKEAG ELACNTLVATVMSNLGLHLAMRSAGVTVRTTAVGDRYVLEELRAGDYSLGGEQSGHIV MPALGSTGDGIVTGLRLMTRMVQTGSSLSDLASAMRTLPQVLINVEVVDKATAAAAPS VRTAVEQAAAELGDTGRILLRPSGTEPMIRVMVEAADEGVAQRLAATVADAVSTAR" CDS complement(3813610..3814065) /codon_start=1 /transl_table=11 /gene="rpsI" /locus_tag="BQ2027_MB3472C" /product="30s ribosomal protein s9 rpsi" /note="Mb3472c, rpsI, len: 151 aa. Equivalent to Rv3442c, len: 151 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 151 aa overlap). Probable rpsI, ribosomal protein S9, equivalent to P40828|RS9_MYCLE|ML0365|B229_C2_191 30S RIBOSOMAL PROTEIN S9 (153 aa), FASTA scores: opt: 800, E(): 2.1e-42, (83.85% identity in 155 aa overlap). Also highly similar to others e.g. Q53875|RS9_STRCO|SC6G4.13 from Streptomyces coelicolor (170 aa), FASTA scores: opt: 533, E(): 5.7e-26, (60.75% identity in 135 aa overlap); Q9KGD4|RPSI|BH0169 (BS10) from Bacillus halodurans (130 aa), FASTA scores: opt: 469, E(): 3.8e-22, (58.65% identity in 121 aa overlap); Q9CDG7|RPSI from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (130 aa), FASTA scores: opt: 451, E(): 4.9e-21, (58.65% identity in 121 aa overlap); P07842|RS9_BACST|RPSI from Bacillus stearothermophilus (129 aa), FASTA scores: opt: 448, E(): 7.4e-21, (54.55% identity in 121 aa overlap); etc. Contains PS00360 Ribosomal protein S9 signature. BELONGS TO THE S9P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb3472c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3472c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66640" /db_xref="InterPro:IPR000754" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR020574" /db_xref="InterPro:IPR023035" /db_xref="UniProtKB/Swiss-Prot:P66640" /protein_id="SIU02100.1" /translation="MTETTPAPQTPAAPAGPAQSFVLERPIQTVGRRKEAVVRVRLVP GTGKFDLNGRSLEDYFPNKVHQQLIKAPLVTVDRVESFDIFAHLGGGGPSGQAGALRL GIARALILVSPEDRPALKKAGFLTRDPRATERKKYGLKKARKAPQYSKR" CDS complement(3814062..3814505) /codon_start=1 /transl_table=11 /gene="rplM" /locus_tag="BQ2027_MB3473C" /product="50s ribosomal protein l13 rplm" /note="Mb3473c, rplM, len: 147 aa. Equivalent to Rv3443c, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). Probable rplM, 50S ribosomal protein L13, equivalent to P38014|RL13_MYCLE|RPLM|ML0364|B229_C3_232 from Mycobacterium leprae (147 aa), FASTA scores: opt: 917, E(): 7.5e-53, (91.15% identity in 147 aa overlap). Also highly similar to others e.g. Q53874|RL13_STRCO|RPLM|SC6G4.12 from Streptomyces coelicolor (147 aa), FASTA scores: opt: 668, E(): 1.1e-36, (65.5% identity in 145 aa overlap); Q9X1G5|RL13_THEMA|RPLM|TM1454 from Thermotoga maritima (149 aa), FASTA scores: opt: 536, E(): 4.4e-28, (53.65% identity in 136 aa overlap); O67722|RL13_AQUAE|RPLM|AQ_1877 from Aquifex aeolicus (144 aa), FASTA scores: opt: 529, E(): 1.2e-27, (53.2% identity in 141 aa overlap); etc. BELONGS TO THE L13P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb3473c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3473c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66066" /db_xref="InterPro:IPR005822" /db_xref="InterPro:IPR005823" /db_xref="InterPro:IPR023563" /db_xref="InterPro:IPR036899" /db_xref="UniProtKB/Swiss-Prot:P66066" /protein_id="SIU02101.1" /translation="MPTYAPKAGDTTRSWYVIDATDVVLGRLAVAAANLLRGKHKPTF APNVDGGDFVIVINADKVAISGDKLQHKMVYRHSGYPGGLHKRTIGELMQRHPDRVVE KAILGMLPKNRLSRQIQRKLRVYAGPEHPHSAQQPVPYELKQVAQ" CDS complement(3814738..3815040) /codon_start=1 /transl_table=11 /gene="esxT" /locus_tag="BQ2027_MB3474C" /product="putative esat-6 like protein esxt" /note="Mb3474c, esxT, len: 100 aa. Equivalent to Rv3444c, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). esxT, conserved hypothetical protein, equivalent to Q9CCV7|ML0363 POSSIBLE SECRETED PROTEIN from Mycobacterium leprae (104 aa), FASTA scores: opt: 362, E(): 1.1e-18, (71.25% identity in 73 aa overlap). C-terminal part highly similar to Q49852|B229_C1_150 HYPOTHETICAL 5.3 KDA PROTEIN from Mycobacterium leprae (49 aa), FASTA scores: opt: 227, E(): 1.4e-09, (68.9% identity in 45 aa overlap). Mb3474c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y497" /protein_id="SIU02102.1" /translation="MNADPVLSYNFDAIEYSVRQEIHTTAARFNAALQELRSQIAPLQ QLWTREAAAAYHAEQLKWHQAASALNEILIDLGNAVRHGADDVAHADRRAAGAWAR" CDS complement(3815061..3815378) /codon_start=1 /transl_table=11 /gene="esxU" /locus_tag="BQ2027_MB3475C" /product="ESAT-6 like protein EsxU" /note="Mb3475c, len: 105 aa. Equivalent to Rv3445c len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 105 aa overlap). EsxU, ESAT-6 like protein (see citations below), showing weak similarity to O30373|VCD|PA2257 pyoverdine biosynthesis protein from Pseudomonas aeruginosa (215 aa), FASTA scores: opt: 103,E(): 5.6, (32.35% identity in 133 aa overlap). Seems to belong to the ESAT6 family. Start changed since first submission (-20 aa). Mb3475c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y467" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02103.1" /translation="MSTPNTLNADFDLMRSVAGITDARNEEIRAMLQAFIGRMSGVPP SVWGGLAAARFQDVVDRWNAESTRLYHVLHAIADTIRHNEAALREAGQIHARHIAAAG GDL" CDS complement(3815431..3816645) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3476C" /product="Membrane protein Rv3446c, component of Type VII secretion system ESX-4" /note="Mb3476c, -, len: 404 aa. Equivalent to Rv3446c, len: 404 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 404 aa overlap). Hypothetical unknown ala-, val-rich protein. Protein product from Mb3476c detected using SWATH mass spectrometry." /db_xref="InterPro:IPR023840" /db_xref="UniProtKB/TrEMBL:A0A1R3Y481" /protein_id="SIU02104.1" /translation="MSPHRAVIEAGPGAIRRLCCGADVVADTAVSAAALAAIDDQVAL LDERPVAVDSLWFDALRSVAVDHRDGPVVVHPSWWSAARVEVVTAAARTLTRDVVVHP RSWLLRQASSGVSAATVVVEIAERLVLVAGAEVAAVARRTDAESVAGQVGSVIARMTR GITAVVLIDVPSTVAGAAALAAAIAGAVRGTGSSVVEIDGVRLARLARAALPPSDEPA DPAARPATRSRVPTLARVAAAGVALALLAPAAVVRHGATTLQRPPTTLLVEGRVALTI PADWSTQRVVSGPGSARVQVTSPADPEVALHVTQSPVPGETLPGTAQRLKRAIDASPA GVFVDFNPSDIRAGRPAVTYREVRAGHQVRWTILLDGAVRISVGCQSGPGHEDLLREV CAQAVRSVHAVG" CDS complement(3816642..3820352) /codon_start=1 /transl_table=11 /gene="eccc4" /locus_tag="BQ2027_MB3477C" /product="esx conserved component eccc4. esx-4 type vii secretion system protein. probable membrane protein." /note="Mb3477c, -, len: 1236 aa. Equivalent to Rv3447c, len: 1236 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1236 aa overlap). Probable conserved membrane protein, similar to various bacterial proteins e.g. O86653|SC3C3.20c ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 1186, E(): 1.9e-60, (42.9% identity in 1312 aa overlap); Q9L0T6|SCD35.15c from Streptomyces coelicolor (1525 aa), FASTA scores: opt: 932, E(): 9.2e-46, (27.2% identity in 1374 aa overlap); Q9CD30|ML2535 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1329 aa), FASTA scores: opt: 910, E(): 1.5e-44, (34.4% identity in 1319 aa overlap); Q9KE81|BH0975 HYPOTHETICAL PROTEIN from Bacillus halodurans (1489 aa), FASTA scores: opt: 805, E(): 1.9e-38, (25.85% identity in 1292 aa overlap); etc. The C-terminal region is similar to Q9CDD7|ML0052 (alias O33086|MLCB628.15c) HYPOTHETICAL PROTEIN from Mycobacterium leprae (597 aa), FASTA scores: opt: 850, E(): 2.3e-41, (35.2% identity in 588 aa overlap); and O6973|Rv3871|MTV027.06 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (591 aa), FASTA scores: opt: 845, E(): 4.3e-41, (35.3% identity in 586 aa overlap). N-terminal part shows similarity with HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis e.g. O69735|Rv3870|MTV027.05 (747 aa), FASTA scores: opt: 761, E(): 3.6e-36, (38.2% identity in 746 aa overlap). Equivalent to AAK47893 from Mycobacterium tuberculosis strain CDC1551 (1200 aa) but longer 36 aa. Contains three of PS00017 ATP/GTP-binding site motif A (P-loop)." /db_xref="GOA:A0A1R3Y5W3" /db_xref="InterPro:IPR002543" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR023836" /db_xref="InterPro:IPR023837" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5W3" /protein_id="SIU02105.1" /translation="MNSGPACATADILVAPPPELRRSEPSSLLIRLLPVVMSVATVGV MVTVFLPGSPATRHPTFLAFPMMMLVSLVVTAVTGRGRRHVSGIHNDRVDYLGYLSVL RTSVTQTAAAQHVSLNWTHPDPATLWTLIGGPRMWERRPGAADFCRIRVGVGSAPLAT RLVVGQLPPAQRADPVTRAALRCFLAAHATIADAPIAIPLRVGGPIAIDGDPTKVRGL LRAMICQLAVWHSPEELLIAGVVSDRNRAHWDWLKWLPHNQHPNACDALGPAPMVYST LAEMQNALAATVLAHVVAIVDTAERGNGAITGVITIEVGARRDGAPPVVRCAGEVTAL ACPDQLEPQDALVCARRLAAHRVGHSGRTFIRGSGWAELVGIGDVAAFDPSTLWRNVN QHDRLRVPIGVTPDGTAVQLDIKEAAEQGMGPHGLCVGATGSGKSELLRTIALGMMAR NSPEVLNLLLVDFKGGATFLDLAGAPHVAAVITNLAEEAPLVARMQDALAGEMSRRQQ LLRMAGHLVSVTAYQRARQTGAQLPCLPILFIVVDEFSELLSQHPEFVDVFLAIGRVG RSLGMHLLLASQRLDEGRLRGLETHLSYRMCLKTWSASESRNVLGTQDAYQLPNTPGA GLLQTGTGELIRFQTAFVSGPLRRASPSAVHPVAPPSVRPFTTHAAAPVTAGPVGGTA EVPTPTVLHAVLDRLVGHGPAAHQVWLPPLDEPPMLGALLRDAEPAQAELAVPIGIVD RPFEQSRVPLTIDLSGAAGNVAVVGAPQTGKSTALRTLIMALAATHDAGRVQFYCLDF GGGALAQVDELPHVGAVAGRAQPQLASRMLAELESAVRFREAFFRDHGIDSVARYRQL RAKSAAESFADIFLVIDGWASLRQEFAALEESIVALAAQGLSFGVHVALSAARWAEIR PSLRDQIGSRIELRLADPADSELDRRQAQRVPVDRPGRGLSRDGMHMVIALPDLDGVA LRRRSGDPVAPPIPLLPARVDYDSVVARAGDELGAHILLGLEERRGQPVAVDFGRHPH LLVLGDNECGKTAALRTLCREIVRTHTAARAQLLIVDFRHTLLDVIESEHMGGYVSSP AALGAKLSSLVDLLQARMPAPDVSQAQLRARSWWSGPDIYVVVDDYDLVAVSSGNPLM VLLEYLPHARDLSLHLVVARRSGGAARALFEPVLASLRDLGCRALLMSGRPDEGALFG SSRPMPLPPGRGILVTGAGDEQLVQVAWSPPP" CDS 3820466..3821869 /codon_start=1 /transl_table=11 /gene="eccd4" /locus_tag="BQ2027_MB3478" /product="esx conserved component eccd4. esx-4 type vii secretion system protein. probable integral membrane protein." /note="Mb3478, -, len: 467 aa. Equivalent to Rv3448, len: 467 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 467 aa overlap). Probable conserved integral membrane protein, showing some similarity with Q9CD35|ML2529 from Mycobacterium leprae (485 aa), FASTA scores: opt: 371, E(): 3.6e-14, (27.25% identity in 481 aa overlap); and two proteins from Mycobacterium tuberculosis O86362|Rv0290|MTV035.18 (472 aa), FASTA scores: opt: 429, E(): 1.6e-17, (28.6% identity in 479 aa overlap); and O05457|Rv3887c|MTCY15F10.25 (509 aa), FASTA scores: opt: 203, E(): 0.00019, (25.6% identity in 492 aa overlap). Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature." /db_xref="GOA:A0A1R3Y4X5" /db_xref="InterPro:IPR006707" /db_xref="InterPro:IPR024962" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4X5" /protein_id="SIU02106.1" /translation="MPTSDPGLRRVTVHAGAQAVDLTLPAAVPVATLIPSIVDILGDR GASPATAARYQLSALGAPALPNATTLAQCGIRDGAVLVLHKSSAQPPTPRCDDVAEAV AAALDTTARPQCQRTTRLSGALAASCITAGGGLMLVRNALGTNVTRYSDATAGVVAAA GLAALLFAVIACRTYRDPIAGLTLSVIATIFGAVAGLLAVPGVPGVHSVLVAAMAAAA TSVLAMRITGCGGITLTAVACCAVVVAAATLVGAITAAPVPAIGSLDTLASFGLLEVS ARMAVLLAGLSPRLPPALNPDDADALPTTDRLTTRANRADAWLTSLLAAFAASATIGA IGTAVATHGIHRSSMGGIALAAVTGALLLLRARSADTRRSLVFAICGITTVATAFTVA ADRALEHGPWIAALTAMLAAVAMFLGFVAPALSLSPVTYRTIELLECLALIAMVPLTA WLCGAYSAVRHLDLTWT" CDS 3821866..3823233 /codon_start=1 /transl_table=11 /gene="mycp4" /locus_tag="BQ2027_MB3479" /product="probable membrane-anchored mycosin mycp4 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-4)" /note="Mb3479, -, len: 455 aa. Equivalent to Rv3449, len: 455 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 455 aa overlap). Probable secreted serine protease (EC 3.4.21.-). Similar to hypothetical unknowns or proteases from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK48366|MT3998 SUBTILASE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (411 aa), FASTA scores: opt: 747, E(): 3.5e-33, (45.65% identity in 416 aa overlap); O05461|Rv3883c|MTCY15F10.29 HYPOTHETICAL PROTEIN (446 aa), FASTA scores: opt: 747, E(): 3.8e-33, (45.45% identity in 451 aa overlap); O53695|Rv0291|MTV035.19 HYPOTHETICAL PROTEIN (461 aa), FASTA scores: opt: 660, E(): 1.9e-28, (44.0% identity in 457 aa overlap); etc. And similar to hypothetical proteases from Mycobacterium leprae e.g. O33076|MLCB628.04|ML0041 HYPOTHETICAL 45.7 KDA PROTEIN (PROBABLE SECRETED PROTEASE) (446 aa), FASTA scores: opt: 683, E(): 1.1e-29, (43.8% identity in 450 aa overlap); Q9CD36|ML2528 PUTATIVE PROTEASE (475 aa), FASTA scores: opt: 608, E(): 1.3e-25, (43.0% identity in 451 aa overlap); Q9CBV3|ML1538 POSSIBLE PROTEASE (567 aa), FASTA scores: opt: 389, E(): 9.7e-14, (33.8% identity in 562 aa overlap); etc. Also some similarity to other proteases from several organisms e.g. O31788|APRX ALKALINE SERINE PROTEASE from Bacillus subtilis (442 aa), FASTA scores: opt: 296, E(): 8.3e-09, (29.4% identity in 313 aa overlap); O86650|SC3C3.17c PUTATIVE SECRETED SERINE PROTEASE from Streptomyces coelicolor (450 aa), FASTA scores: opt: 279, E(): 7e-08, (33.55% identity in 343 aa overlap); Q9KBJ7|APRX|BH193 INTRACELLULAR ALKALINE SERINE PROTEASE from Bacillus halodurans (444 aa), FASTA scores: opt: 257, E(): 1.1e-06, (28.65% identity in 335 aa overlap); O86642|SC3C3.08 SERINE PROTEASE from Streptomyces coelicolor (413 aa), FASTA scores: opt: 243, E(): 5.7e-06, (38.25% identity in 387 aa overlap); etc. Has putative signal peptide at N-terminus and hydrophobic stretch at C-terminus. Contains three signatures typical of subtilase family: aspartic acid active site (PS00136), histidine active site (PS00137), serine active site (PS00138)." /db_xref="GOA:A0A1R3Y468" /db_xref="InterPro:IPR000209" /db_xref="InterPro:IPR015500" /db_xref="InterPro:IPR022398" /db_xref="InterPro:IPR023827" /db_xref="InterPro:IPR023828" /db_xref="InterPro:IPR023834" /db_xref="InterPro:IPR036852" /db_xref="UniProtKB/TrEMBL:A0A1R3Y468" /protein_id="SIU02107.1" /translation="MTTSRTLRLLVVSALATLSGLGTPVAHAVSPPPIDERWLPESAL PAPPRPTVQREVCTEVTAESGRAFGRAERSAQLADLDQVWRLTRGAGQRVAVIDTGVA RHRRLPKVVAGGDYVFTGDGTADCDAHGTLVAGIIAAAPDAQSDNFSGVAPDVTLISI RQSSSKFAPVGDPSSTGVGDVDTMAKAVRTAADLGASVINISSIACVPAAAAPDDRAL GAALAYAVDVKNAVIVAAAGNTGGAAQCPPQAPGVTRDSVTVAVSPAWYDDYVLTVGS VNAQGEPSAFTLAGPWVDVAATGEAVTSLSPFGDGTVNRLGGQHGSIPISGTSYAAPV VSGLAALIRARFPTLTARQVMQRIESTAHHPPAGWDPLVGNGTVDALAAVSSDSIPQA GTATSDPAPVAVPVPRRSTPGPSDRRALHTAFAGAAICLLALMATLATASRRLRPGRN GIAGD" CDS complement(3823198..3824610) /codon_start=1 /transl_table=11 /gene="eccb4" /locus_tag="BQ2027_MB3480C" /product="esx conserved component eccb4. esx-4 type vii secretion system protein. probable membrane protein." /note="Mb3480c, -, len: 470 aa. Equivalent to Rv3450c, len: 470 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 470 aa overlap). Probable conserved membrane protein (possible membrane spanning region near N-terminus). Similar to hypothetical unknowns proteins from Mycobacterium leprae e.g. O33088|MLCB628.17C|ML0054 HYPOTHETICAL 51.9 KDA PROTEIN (PUTATIVE MEMBRANE PROTEIN)(481 aa), FASTA scores: opt: 708, E(): 6.4e-32, (32.9% identity in 480 aa overlap); Q9CD29|ML2536 (552 aa), FASTA scores: opt: 394, E(): 1.7e-14, (33.6% identity in 503 aa overlap); etc. Also similar to other proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O69734|Rv3869|MTV027.04 (480 aa), FASTA scores: opt: 717, E(): 2e-32, (32.55% identity in 479 aa overlap); O05449|Rv3895c|MTCY15F10.17 (495 aa), FASTA scores: opt: 670, E(): 8.3e-30, (36.4% identity in 475 aa overlap); O5368|Rv0283|MTV035.11 (538 aa), FASTA scores: opt: 467, E(): 1.5e-18, (36.3% identity in 493 aa overlap); etc. Mb3480c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4C4" /db_xref="InterPro:IPR007795" /db_xref="InterPro:IPR042485" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4C4" /protein_id="SIU02108.1" /translation="MPSPATTWLHVSGYRFLLRRIECALLFGDVCAATGALRARTTSL ALGCVLAIVAAMGCAFVALLRPQSALGQAPIVMGRESGALYVRVDDVWHPVLNLASAR LIAATNANPQPVSESELGHTKRGPLLGIPGAPQLLDQPLAGAESAWAICDSDNGGSTT VVVGPAEDSSAQVLTAEQMILVATESGSPTYLLYGGRRAVVDLADPAVVWALRLQGRV PHVVAQSLLNAVPEAPRITAPRIRGGGRASVGLPGFLVGGVVRITRASGDEYYVVLED GVQRIGQVAADLLRFGDSQGSVNVPTVAPDVIRVAPIVNTLPVSAFPDRPPTPVDGSP GRAVTTLCVTWTPAQPGAARVAFLAGSGPPVPLGGVPVTLAQADGRGPALDAVYLPPG RSAYVAARSLSGGGTGTRYLVTDTGVRFAIHDDDVAHDLGLPTAAIPAPWPVLATLPS GPELSRANASVARDTVAPGP" CDS 3824731..3825519 /codon_start=1 /transl_table=11 /gene="cut3" /locus_tag="BQ2027_MB3481" /product="PROBABLE CUTINASE PRECURSOR CUT3" /note="Mb3481, cut3, len: 262 aa. Equivalent to Rv3451, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 262 aa overlap). Probable cut3, cutinase precursor (EC 3.1.1.-), similar to others e.g. Q9KK87 from Mycobacterium avium (220 aa), FASTA scores: opt: 540, E(): 3.5e-24, (43.4% identity in 219 aa overlap); Q00298|CUTI_BOTCI|CUTA from Botrytis cinerea (Botryotinia fuckeliana) (202 aa), FASTA scores: opt: 214, E(): 2e-05, (31.45% identity in 210 aa overlap); Q9Y7G8 from Pyrenopeziza brassicae (203 aa), FASTA scores: opt: 203, E(): 8.5e-05, (31.05% identity in 190 aa overlap); P29292|CUTI_ASCRA from Ascochyta rabiei (223 aa), FASTA scores: opt: 155, E(): 0.054, (31.65% identity in 120 aa overlap). Similar to other proteins from Mycobacterium tuberculosis e.g. the downstream ORF O06319|Rv3452|MTCY13E12.05 HYPOTHETICAL 23.1 KDA PROTEIN (226 aa), FASTA scores: opt: 775, E(): 1e-37, (58.65% identity in 220 aa overlap); Q50664|CUT2_MYCTU|Rv2301|MT2358|MTCY339.08c PROBABLE CUTINASE PRECURSOR (219 aa), FASTA scores: opt: 565, E(): 1.3e-25, (44.85% identity in 223 aa overlap); Q10837|CUT1_MYCTU|Rv1984c|MT2037|MTCY39.35 PROBABLE CUTINASE PRECURSOR (217 aa), FASTA scores: opt: 489, E(): 3e-21, (47.05% identity in 221 aa overlap); etc. Equivalent to AAK47897 from Mycobacterium tuberculosis strain CDC1551 (247 aa) but longer 15 aa. Contains cutinase, serine active site motif (PS00155). BELONGS TO THE CUTINASE FAMILY. Alternative start possible at 3733. Start changed since first submission (+15 aa). Mb3481 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A537" /db_xref="InterPro:IPR000675" /db_xref="InterPro:IPR011150" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P0A537" /protein_id="SIU02109.1" /translation="MNNRPIRLLTSGRAGLGAGALITAVVLLIALGAVWTPVAFADGC PDAEVTFARGTGEPPGIGRVGQAFVDSLRQQTGMEIGVYPVNYAASRLQLHGGDGAND AISHIKSMASSCPNTKLVLGGYSQGATVIDIVAGVPLGSISFGSPLPAAYADNVAAVA VFGNPSNRAGGSLSSLSPLFGSKAIDLCNPTDPICHVGPGNEFSGHIDGYIPTYTTQA ASFVVQRLRAGSVPHLPGSVPQLPGSVLQMPGTAAPAPESLHGR" CDS 3825566..3826246 /codon_start=1 /transl_table=11 /gene="cut4" /locus_tag="BQ2027_MB3482" /product="PROBABLE CUTINASE PRECURSOR CUT4" /note="Mb3482, cut4, len: 226 aa. Equivalent to Rv3452, len: 226 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 226 aa overlap). Probable cut4, cutinase precursor (EC 3.1.1.-), similar to other e.g. Q9KK87 from Mycobacterium avium (220 aa), FASTA scores: opt: 522, E(): 7.3e-24, (46.6% identity in 221 aa overlap); P30272|CUTI_MAGGR|CUT1 from Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) (228 aa), FASTA scores: opt: 205, E(): 3.8e-05, (29.25% identity in 164 aa overlap); Q00298|CUTI_BOTCI|CUTA from Botrytis cinerea (Botryotinia fuckeliana) (202 aa), FASTA scores: opt: 204, E(): 3.9e-05, (33.5% identity in 209 aa overlap); etc. Similar to other proteins from Mycobacterium tuberculosis e.g. upstream ORF O06318|CUT3_MYCTU|Rv3451|MT3557|MTCY13E1 2.04 PROBABLE CUTINASE PRECURSOR (247 aa), FASTA scores: opt: 773, E(): 1.3e-38, (59.35% identity in 209 aa overlap); Q50664|CUT2_MYCTU|Rv2301|MT2358|MTCY339.08c PROBABLE CUTINASE PRECURSOR (219 aa), FASTA scores: opt: 704, E(): 1.3e-34, (53.4% identity in 219 aa overlap); etc. Contains PS00155 Cutinase, serine active site. BELONGS TO THE CUTINASE FAMILY. Alternative start possible at 4553 in cSCY13E12 but no RBS. Mb3482 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y473" /db_xref="InterPro:IPR000675" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y473" /protein_id="SIU02110.1" /translation="MIPRPQPHSGRWRAGAARRLTSLVAAAFAAATLLLTPALAPPAS AGCPDAEVVFARGTGEPPGLGRVGQAFVSSLRQQTNKSIGTYGVNYPANGDFLAAADG ANDASDHIQQMASACRATRLVLGGYSQGAAVIDIVTAAPLPGLGFTQPLPPAADDHIA AIALFGNPSGRAGGLMSALTPQFGSKTINLCNNGDPICSDGNRWRAHLGYVPGMTNQA ARFVASRI" CDS 3826518..3828203 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3483" /product="probable conserved integral membrane protein" /note="Mb3483, -, len: 561 aa. Equivalent to Rv3453 and Rv3454, len: 110 aa and 422 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 107 aa overlap and 99.8% identity in 422 aa overlap). Rv3453: Possible conserved transmembrane protein, showing weak similarity with other proteins e.g. Q9F6C3 PUTATIVE ABC TRANSPORTER from Propionibacterium thoenii (424 aa), FASTA scores: opt: 104, E(): 6.8, (40.6% identity in 69 aa overlap). Rv3454: Probable conserved integral membrane protein, showing some similarity to various proteins (generally transporters) e.g. Q9I5C8|PA0811 PROBABLE MFS TRANSPORTER from Pseudomonas aeruginosa (415 aa), FASTA scores: opt: 145, E(): 0.13, (28.2% identity in 188 aa overlap); Q01266|YHYC_PSESN HYPOTHETICAL PROTEIN IN HYUC 3'REGION (ORF 5) (FRAGMENT) from Pseudomonas sp. strain NS671 (245 aa), FASTA scores: opt: 130, E(): 0.75, (24.65% identity in 134 aa overlap); Q9I242|PA2073 PROBABLE TRANSPORTER (MEMBRANE SUBUNIT) from Pseudomonas aeruginosa (476 aa), FASTA scores: opt: 125, E(): 2.5, (24.6% identity in 252 aa overlap); etc. Equivalent to AAK47900 from Mycobacterium tuberculosis strain CDC1551 (562 aa) but shorter 140 aa. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis H37Rv, Rv3453 and Rv3454 exist as 2 genes. In Mycobacterium bovis, a single base deletion (t-*) results in a single product that is more similar to Rv3454. Protein product from Mb3483 detected using SWATH mass spectrometry. Mb3483 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y496" /db_xref="InterPro:IPR030191" /db_xref="UniProtKB/TrEMBL:A0A1R3Y496" /protein_id="SIU02111.1" /translation="MPGVITNSESPTAADHDRITATRETLEDYTLRLAPRSYRRWPPA VVGISALGGIAYLADFAIGANVGITWGTANALCGIAIFALVVFVTGLPLAYYAARYNI DLDLITRGSGFGYYGSVVTNVIFATFTFIFFALEGSIMAQGLKLGLHIPLWAGYACST LIIFPLVVYGMKVLSQLQLWTTPLWLILMAAPFGYLVVSHPDSIGQFFSYAGKDGHGG LSFGSVLLAAGVCLSLIAQIAEQIDYLRFMPPRTPENANRWWTWTLLAGPGWVAFGAT KQIIGLFLAVYLMANIPGSSTIANQPVHQFMQIYRTFVPGWLALTLAVILVILSQIKI NVTNAYSGSLAWTNSFTRLTKHYPGRVVFLGVNLAIALILMEANMFDFLNTILGCYAN CGMAWVVAVASDIGFNKYLLGLSPKTPEFRRGMLYAINPVGFGSLLLAAGLSIVTFFG GLGAALQPYSPLVAIVTALVMPPILAAATKGKYYLRRTHDGIDLPMYDEHGNPSAAVL TCHVCHQDFERPDMLACQTHGAHVCSLCLSTDKQAEHVLPGLARAHIPGDQVP" CDS complement(3828165..3828935) /codon_start=1 /transl_table=11 /gene="truA" /locus_tag="BQ2027_MB3484C" /product="PROBABLE TRNA PSEUDOURIDINE SYNTHASE A TRUA (PSEUDOURIDYLATE SYNTHASE I) (PSEUDOURIDINE SYNTHASE I) (URACIL HYDROLYASE)" /note="Mb3484c, truA, len: 256 aa. Equivalent to Rv3455c, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 256 aa overlap). Probable truA, pseudouridine synthase A (EC 4.2.1.70), equivalent to Q9X796|TRUA_MYCLE|ML1955|MLCB1222.25c TRNA PSEUDOURIDINE SYNTHASE A from Mycobacterium leprae (249 aa), FASTA scores: opt: 1345, E(): 3.2e-80, (77.25% identity in 246 aa overlap). Also highly similar to others e.g. O86776|TRUA_STRCO|SC6G4.09 from Streptomyces coelicolor (284 aa), FASTA scores: opt: 595, E(): 1.7e-31, (49.8% identity in 259 aa overlap); Q9RS37|DR2290 from Deinococcus radiodurans (280 aa), FASTA scores: opt: 383, E(): 1e-17, (41.2% identity in 216 aa overlap); Q9PJT0|TRUA_CHLMU|TC0748 from Chlamydia muridarum (267 aa), FASTA scores: opt: 334, E(): 1.5e-14, (37.65% identity in 231 aa overlap); P07649|TRUA_ECOLI|HIST|ASUC|LEUK|B2318 from Escherichia coli strain K12 (270 aa), FASTA scores: opt: 315, E(): 2.5e-13, (33.35% identity in 240 aa overlap); etc. BELONGS TO THE TRUA FAMILY OF PSEUDOURIDINE SYNTHASES. Protein product from Mb3484c detected using SWATH mass spectrometry. Mb3484c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65847" /db_xref="InterPro:IPR001406" /db_xref="InterPro:IPR020095" /db_xref="InterPro:IPR020097" /db_xref="InterPro:IPR020103" /db_xref="UniProtKB/Swiss-Prot:P65847" /protein_id="SIU02112.1" /translation="MGQRTVAGDLDAALTTIFRTPVRLRAAGRTDAGVHASGQVAHVD VPADALPNAYPRAGHVGDPEFLPLLRRLGRFLPADVRILDITRAPAGFDARFSALRRH YVYRLSTAPYGVEPQQARYITAWPRELDLDAMTAASRDLMGLHDFAAFCRHREGATTI RDLQRLDWSRAGTLVTAHVTADAFCWSMVRSLVGALLAVGEHRRATTWCRELLTATGR SSDFAVAPAHGLTLIQVDYPPDDQLASRNLVTRDVRSG" CDS complement(3829003..3829545) /codon_start=1 /transl_table=11 /gene="rplQ" /locus_tag="BQ2027_MB3485C" /product="50s ribosomal protein l17 rplq" /note="Mb3485c, rplQ, len: 180 aa. Equivalent to Rv3456c, len: 180 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 180 aa overlap). Probable rplQ, 50S ribosomal protein L17, equivalent to Q9X797|RL17_MYCLE|ML1956|MLCB1222.26c 50S RIBOSOMAL PROTEIN L17 from Mycobacterium leprae (170 aa), FASTA scores: opt: 874, E(): 9.5e-45, (81.85% identity in 171 aa overlap). Also highly similar to other e.g. O86775|RL17_STRCO|SC6G4.08 from Streptomyces coelicolor (168 aa), FASTA scores: opt: 609, E(): 3.7e-29, (60.0% identity in 170 aa overlap); BAB47931|MLR0326 from Rhizobium loti (Mesorhizobium loti) (143 aa), FASTA scores: opt: 404, E(): 3.7e-17, (49.65% identity in 139 aa overlap); Q9Z9H5|RL17_THETH|RPLQ from Thermus aquaticus (subsp. thermophilus) (118 aa), FASTA scores: opt: 366, E(): 5.5e-15, (53.15% identity in 111 aa overlap); P02416|RL17_ECOLI|RPLQ|B3294 from Escherichia coli strain K12 (127 aa), FASTA scores: opt: 347, E(): 7.6e-14, (50.4% identity in 119 aa overlap); etc. BELONGS TO THE L17P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb3485c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3485c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5V5" /db_xref="InterPro:IPR000456" /db_xref="InterPro:IPR036373" /db_xref="UniProtKB/Swiss-Prot:P0A5V5" /protein_id="SIU02113.1" /translation="MPKPTKGPRLGGSSSHQKAILANLATSLFEHGRITTTEPKARAL RPYAEKLITHAKKGALHNRREVLKKLRDKDVVHTLFAEIGPFFADRDGGYTRIIKIEA RKGDNAPMAVIELVREKTVTSEANRARRVAAAQAKAKKAAAMPTEESEAKPAEEGDVV GASEPDAKAPEEPPAEAPEN" CDS complement(3829577..3830620) /codon_start=1 /transl_table=11 /gene="rpoA" /locus_tag="BQ2027_MB3486C" /product="PROBABLE DNA-DIRECTED RNA POLYMERASE (ALPHA CHAIN) RPOA (TRANSCRIPTASE ALPHA CHAIN) (RNA POLYMERASE ALPHA SUBUNIT) (DNA-DIRECTED RNA NUCLEOTIDYLTRANSFERASE)" /note="Mb3486c, rpoA, len: 347 aa. Equivalent to Rv3457c, len: 347 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 347 aa overlap). Probable rpoA, alpha chain of RNA polymerase (EC 2.7.7.6), equivalent to Q9X798|RPOA_MYCLE|ML1957|MLCB1222.27c DNA-DIRECTED RNA POLYMERASE ALPHA from Mycobacterium leprae (347 aa), FASTA scores: opt: 2139, E(): 1.3e-123, (95.65% identity in 347 aa overlap). Also highly similar to others e.g. P72404|RPOA_STRCO|C6G4.07 from Streptomyces coelicolor (340 aa), FASTA scores: opt: 1672, E(): 4.7e-95, (75.55% identity in 348 aa overlap); Q9X4V6|RPOA_STRGT from Streptomyces granaticolor (340 aa), FASTA scores: opt: 1671, E(): 5.4e-95, (75.55% identity in 348 aa overlap); P20429|RPOA_BACSU from Bacillus subtilis (314 aa), FASTA scores: opt: 939, E(): 3e-50, (48.9% identity in 311 aa overlap); etc. Contains (PS00017) ATP/GTP-binding site motif A (P-loop). BELONGS TO THE RNA POLYMERASE ALPHA CHAIN FAMILY. Protein product from Mb3486c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3486c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66702" /db_xref="InterPro:IPR011260" /db_xref="InterPro:IPR011262" /db_xref="InterPro:IPR011263" /db_xref="InterPro:IPR011773" /db_xref="InterPro:IPR036603" /db_xref="InterPro:IPR036643" /db_xref="UniProtKB/Swiss-Prot:P66702" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02114.1" /translation="MLISQRPTLSEDVLTDNRSQFVIEPLEPGFGYTLGNSLRRTLLS SIPGAAVTSIRIDGVLHEFTTVPGVKEDVTEIILNLKSLVVSSEEDEPVTMYLRKQGP GEVTAGDIVPPAGVTVHNPGMHIATLNDKGKLEVELVVERGRGYVPAVQNRASGAEIG RIPVDSIYSPVLKVTYKVDATRVEQRTDFDKLILDVETKNSISPRDALASAGKTLVEL FGLARELNVEAEGIEIGPSPAEADHIASFALPIDDLDLTVRSYNCLKREGVHTVGELV ARTESDLLDIRNFGQKSIDEVKIKLHQLGLSLKDSPPSFDPSEVAGYDVATGTWSTEG AYDEQDYAETEQL" CDS complement(3830768..3831373) /codon_start=1 /transl_table=11 /gene="rpsD" /locus_tag="BQ2027_MB3487C" /product="30s ribosomal protein s4 rpsd" /note="Mb3487c, rpsD, len: 201 aa. Equivalent to Rv3458c, len: 201 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 201 aa overlap). Probable rpsD, 30S ribosomal protein S4, equivalent to Q9X799|RS4_MYCLE|RPSD|ML1958|MLCB1222.28c 30S RIBOSOMAL PROTEIN S4 from Mycobacterium leprae (201 aa), FASTA scores: opt: 1271, E(): 2.2e-73, (93.5% identity in 201 aa overlap); and P45811|RS4_MYCBO|RPSD from Mycobacterium bovis (131 aa), FASTA scores: opt: 867, E(): 4.9e-48, (100.0% identity in 130 aa overlap). Also highly similar to others e.g. P81288|RS4_BACST|RPSD from Bacillus stearothermophilus (198 aa), FASTA scores: opt: 665, E(): 4e-35, (52.25% identity in 201 aa overlap); Q9K7Z8|RPSD|BH3209 from Bacillus halodurans (200 aa), FASTA scores: opt: 626, E(): 1.2e-32, (48.75% identity in 203 aa overlap); Q9X1I3|RS4_THEMA|RPSD|TM1473 from Thermotoga maritima (209 aa), FASTA scores: opt: 591, E(): 2e-30, (45.0% identity in 209 aa overlap); etc. Contains ribosomal protein S4 signature (PS00632) and ATP/GTP binding site motif (PS00017). BELONGS TO THE S4P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb3487c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3487c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P45811" /db_xref="InterPro:IPR001912" /db_xref="InterPro:IPR002942" /db_xref="InterPro:IPR005709" /db_xref="InterPro:IPR018079" /db_xref="InterPro:IPR022801" /db_xref="InterPro:IPR036986" /db_xref="UniProtKB/Swiss-Prot:P45811" /protein_id="SIU02115.1" /translation="MARYTGPVTRKSRRLRTDLVGGDQAFEKRPYPPGQHGRARIKES EYLLQLQEKQKARFTYGVMEKQFRRYYEEAVRQPGKTGEELLKILESRLDNVIYRAGL ARTRRMARQLVSHGHFNVNGVHVNVPSYRVSQYDIVDVRDKSLNTVPFQIARETAGER PIPSWLQVVGERQRVLIHQLPERAQIDVPLTEQLIVEYYSK" CDS complement(3831382..3831801) /codon_start=1 /transl_table=11 /gene="rpsK" /locus_tag="BQ2027_MB3488C" /product="30s ribosomal protein s11 rpsk" /note="Mb3488c, rpsK, len: 139 aa. Equivalent to Rv3459c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 139 aa overlap). Probable rpsK, 30S ribosomal protein S11, equivalent to Q9X7A0|RS11_MYCLE|RPSK|ML1959|MLCB1222.29c 30S RIBOSOMAL PROTEIN S11 from Mycobacterium leprae (138 aa), FASTA scores: opt: 819, E(): 7.6e-44, (89.95% identity in 139 aa overlap); and P45812|RS11_MYCBO 30S RIBOSOMAL PROTEIN S11 from Mycobacterium bovis (139 aa), FASTA scores: opt: 867, E(): 8.4e-47, (94.25% identity in 139 aa overlap). Also highly similar to others e.g. P72403|RS11_STRCO|SC6G4.06 from Streptomyces coelicolor (134 aa), FASTA scores: opt: 729, E(): 2.6e-38, (79.85% identity in 139 aa overlap); O50633|RS11_BACHD|RPSK|BH0161 from Bacillus halodurans (129 aa), FASTA scores: opt: 618, E(): 1.7e-31, (70.3% identity in 128 aa overlap); P04969|RS11_BACSU|RPSK from Bacillus subtilis (131 aa), FASTA scores: opt: 601, E(): 2e-30, (69.0% identity in 129 aa overlap); etc. Contains ribosomal protein S11 signature (PS00054). BELONGS TO THE S11P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb3488c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3488c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P45812" /db_xref="InterPro:IPR001971" /db_xref="InterPro:IPR018102" /db_xref="InterPro:IPR019981" /db_xref="InterPro:IPR036967" /db_xref="UniProtKB/Swiss-Prot:P45812" /protein_id="SIU02116.1" /translation="MPPAKKGPATSARKGQKTRRREKKNVPHGAAHIKSTFNNTIVTI TDPQGNVIAWASSGHVGFKGSRKSTPFAAQLAAENAARKAQDHGVRKVDVFVKGPGSG RETAIRSLQAAGLEVGAISDVTPQPHNGVRPPNRRRV" CDS complement(3831805..3832179) /codon_start=1 /transl_table=11 /gene="rpsM" /locus_tag="BQ2027_MB3489C" /product="30s ribosomal protein s13 rpsm" /note="Mb3489c, rpsM, len: 124 aa. Equivalent to Rv3460c, len: 124 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 124 aa overlap). Probable rpsM, 30S ribosomal protein S13, equivalent to Q9X7A1|RS13_MYCLE|RPSM|ML1960|MLCB1222.30c 30S RIBOSOMAL PROTEIN S13 from Mycobacterium leprae (124 aa), FASTA scores: opt: 762, E(): 1.5e-43, (92.75% identity in 124 aa overlap); and P45813|RS13_MYCBO|RPSM from Mycobacterium bovis (123 aa), FASTA scores: opt: 727, E(): 3e-41, (98.25% identity in 114 aa overlap). Also highly similar to others e.g. O86773|RS13_STRCO|SC6G4.05 from Streptomyces coelicolor (126 aa), FASTA scores: opt: 631, E(): 6.2e-35, (73.75% identity in 122 aa overlap); Q9RA65|RPS13 from Thermus aquaticus (subsp. thermophilus) (126 aa), FASTA scores: opt: 552, E(): 9.8e-30, (62.6% identity in 123 aa overlap); P20282|RS13_BACSU|RPSM from Bacillus subtilis (120 aa), FASTA scores: opt: 533, E(): 1.7e-28, (64245% identity in 121 aa overlap); etc. Contains ribosomal protein S13 signature (PS00646). BELONGS TO THE S13P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb3489c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3489c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P45813" /db_xref="InterPro:IPR001892" /db_xref="InterPro:IPR010979" /db_xref="InterPro:IPR018269" /db_xref="InterPro:IPR019980" /db_xref="InterPro:IPR027437" /db_xref="UniProtKB/Swiss-Prot:P45813" /protein_id="SIU02117.1" /translation="MARLVGVDLPRDKRMEVALTYIFGIGRTRSNEILAATGIDRDLR TRDLTEEQLIHLRDYIEANLKVEGDLRREVQADIRRKIEIGCYQGLRHRRGMPVRGQR TKTNARTRKGPKRTIAGKKKAR" CDS complement(3832395..3832508) /codon_start=1 /transl_table=11 /gene="rpmJ" /locus_tag="BQ2027_MB3490C" /product="50s ribosomal protein l36 rpmj" /note="Mb3490c, rpmJ, len: 37 aa. Equivalent to Rv3461c, len: 37 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 37 aa overlap). Probable rpmJ, 50S ribosomal protein L36, equivalent to P45810|RL36_MYCBO|RPMJ from Mycobacterium bovis (37 aa); and Q9X7A2|RL36_MYCLE|RPMJ|ML1961|MLCB1222.31c 50S RIBOSOMAL PROTEIN L36 from Mycobacterium leprae (37 aa), FASTA scores: opt: 241, E(): 9.7e-14, (86.5% identity in 37 aa overlap). Also highly similar to others e.g. O86772|RL36_STRCO|SC6G4.04 from Streptomyces coelicolor (37 aa), FASTA scores: opt: 233, E(): 4.5e-13, (83.8% identity in 37 aa overlap); P07841|RL36_BACST|RPMJ from Bacillus stearothermophilus (37 aa), FASTA scores: opt: 214, E(): 1.6e-11, (72.95% identity in 37 aa overlap); P12230|RK36_SPIOL|RPL36 from Spinacia oleracea (Spinach) (37 aa), FASTA scores: opt: 211, E(): 2.9e-11, (70.25% identity in 37 aa overlap); etc. Contains PS00828 Ribosomal protein L36 signature. BELONGS TO THE L36P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb3490c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3490c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5W7" /db_xref="InterPro:IPR000473" /db_xref="InterPro:IPR035977" /db_xref="UniProtKB/Swiss-Prot:P0A5W7" /protein_id="SIU02118.1" /translation="MKVNPSVKPICDKCRLIRRHGRVMVICSDPRHKQRQG" CDS complement(3832541..3832762) /codon_start=1 /transl_table=11 /gene="infA" /locus_tag="BQ2027_MB3491C" /product="PROBABLE TRANSLATION INITIATION FACTOR IF-1 INFA" /note="Mb3491c, infA, len: 73 aa. Equivalent to Rv3462c, len: 73 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 aa overlap). Probable infA, initiation factor IF-1, equivalent to P45957|ML1962|INFA TRANSLATION INITIATION FACTOR IF-1 from Mycobacterium bovis (72 aa) and Mycobacterium leprae (72 aa), FASTA scores: opt: 472, E(): 6.6e-28, (100.0% identity in 72 aa overlap). Also highly similar to others e.g. O54209|IF1_STRCO|INFA|SC6G4.03 from Streptomyces coelicolor (73 aa), FASTA scores: opt: 424, E(): 2e-24, (84.95% identity in 73 aa overlap); O50630|IF1_BACHD|INFA|BH0158 from Bacillus halodurans (71 aa), FASTA scores: opt: 388, E(): 8.1e-22, (77.8% identity in 72 aa overlap); Q9XD14|IF1_LEPIN|INFA from Leptospira interrogans (71 aa), FASTA scores: opt: 376, E(): 6e-21, (80.0% identity in 70 aa overlap); etc. CONTAINS 1 'S1 MOTIF' DOMAIN. BELONGS TO THE IF-1 FAMILY. Protein product from Mb3491c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3491c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5H6" /db_xref="InterPro:IPR004368" /db_xref="InterPro:IPR006196" /db_xref="InterPro:IPR012340" /db_xref="UniProtKB/Swiss-Prot:P0A5H6" /protein_id="SIU02119.1" /translation="MAKKDGAIEVEGRVVEPLPNAMFRIELENGHKVLAHISGKMRQH YIRILPEDRVVVELSPYDLSRGRIVYRYK" CDS 3833016..3833873 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3492" /product="Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases" /note="Mb3492, -, len: 285 aa. Equivalent to Rv3463, len: 285 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 285 aa overlap). Conserved hypothetical protein, similar to Q9RDA2|SCE20.23 HYPOTHETICAL 31.4 KDA PROTEIN from Streptomyces coelicolor (290 aa), FASTA scores: opt: 770, E(): 2.2e-41, (48.6% identity in 247 aa overlap); and Q9X7Y1|SC6A5.35 PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (341 aa), (see BLASTP results), FASTA scores: opt: 119, E(): 2.9, (24.1% identity in 274 aa overlap). Protein product from Mb3492 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3492 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y487" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019922" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3Y487" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02120.1" /translation="MTNCAAGKPSSGPNLGRFGSFGRGVTPQQATEIEALGYGAVWVG GSPPAALSWVEPILQATTTLCVATGIVNIWSAPAQRVAESFHRIEAAYPGRFLLGIGV GHAEMISEYRKPYNALVEYLDRLDDYGVPANRRVVAALGPRVLGLSARRSAGAHPYLT TPEHTARARELIGPSAFLAPEHKVVLTTDSARARTVGRQALDMYFNLANYRNNWKRLG FTDDEVSRPGSDRLVDAVVAYGTPDAIAARLNEHLLAGADHVPIQVLTEDDNLVSALT ELAKPLRLT" CDS 3833946..3834941 /codon_start=1 /transl_table=11 /gene="rmlB1" /locus_tag="BQ2027_MB3493" /standard_name="rfbB" /product="dtdp-glucose 4,6-dehydratase rmlb" /note="Mb3493, rmlB1, len: 331 aa. Equivalent to Rv3464, len: 331 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 331 aa overlap). Probable rmlB1 (alternate gene name: rfbB), DTDP-glucose-4,6-dehydratase (EC 4.2.1.46), nearly identical to Q50556|RMLB rhamnose biosynthesis protein (EC 4.2.1.46) from Mycobacterium tuberculosis (329 aa) (previously rfbB, now known as rmlB). Equivalent to Q9CBH7|RMLB|ML1964 DTDP-GLUCOSE 4,6-DEHYDRATASE (alias Q9X7A3|RMLB PUTATIVE DTDP-(GLUCOSE OR RHAMNOSE)-4,6-DEHYDRATASE (331 aa)) from Mycobacterium leprae (333 aa), FASTA scores: opt: 1925, E(): 1.9e-112, (84.0% identity in 331 aa overlap). Also highly similar to others e.g. Q9UZH2|RFBB|PAB0785 from Pyrococcus abyssi (333 aa), FASTA scores: opt: 1115, E(): 4.2e-62, (51.55% identity in 322 aa overlap); O27817|MTH1789 from Methanobacterium thermoautotrophicum (336 aa), FASTA scores: opt: 1104, E(): 2.1e-61, (51.65% identity in 331 aa overlap); BAB60064|TVG0950610 from Thermoplasma volcanium (318 aa), FASTA scores: opt: 1102, E(): 2.6e-61, (49.65% identity in 310 aa overlap); etc. Also related to P72050|MTCY13D12.18|RV3784 HYPOTHETICAL 36.3 KDA PROTEIN (SIMILAR TO GALACTOWALDENASES FROM EUKARYOTIC AND PROKARYOTIC ORIGIN) from Mycobacterium tuberculosis (326 aa), FASTA scores: E(): 1.4e-26, (33.8% identity in 320 aa overlap). Protein product from Mb3493 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3493 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4A5" /db_xref="InterPro:IPR005888" /db_xref="InterPro:IPR016040" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4A5" /protein_id="SIU02121.1" /translation="MRLLVTGGAGFIGTNFVHSAVREHPDDAVTVLDALTYAGRRESL ADVEDAIRLVQGDITDAELVSQLVAESDAVVHFAAESHVDNALDNPEPFLHTNVIGTF TILEAVRRHGVRLHHISTDEVYGDLELDDRARFTESTPYNPSSPYSATKAGADMLVRA WVRSYGVRATISNCSNNYGPYQHVEKFIPRQITNVLTGRRPKLYGAGANVRDWIHVDD HNSAVRRILDRGRIGRTYLISSEGERDNLTVLRTLLRLMDRDPDDFDHVTDRVGHDLR YAIDPSTLYDELCWAPKHTDFEEGLRTTIDWYRDNESWWRPLKDATEARYQERGQ" CDS 3834943..3835551 /codon_start=1 /transl_table=11 /gene="rmlC" /locus_tag="BQ2027_MB3494" /standard_name="rfbC" /product="dtdp-4-dehydrorhamnose 3,5-epimerase rmlc (dtdp-4-keto-6-deoxyglucose 3,5-epimerase) (dtdp-l-rhamnose synthetase) (thymidine diphospho-4-keto-rhamnose 3,5-epimerase)" /note="Mb3494, rmlC, len: 202 aa. Equivalent to Rv3465, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 202 aa overlap). Probable rmlC (alternate gene name: rfbC), dtdp-4-dehydrorhamnose 3,5-epimerase (EC 5.1.3.13), nearly identical to O33170|RMLC RMLC PROTEIN from Mycobacterium tuberculosis (203 aa), FASTA scores: opt: 1171, E(): 2.6e-71, (89.5% identity in 200 aa overlap) (previously known as rfbC). Equivalent to Q9X7A4|RMLC|ML1965 PUTATIVE DTDP-4-DEHYDRORHAMNOSE 3,5-EPIMERASE from Mycobacterium leprae (202 aa), FASTA scores: opt: 1072, E(): 1.1e-64, (75.4% identity in 199 aa overlap). Also highly similar to others e.g. Q9F8S7|CUMY from Streptomyces rishiriensis (198 aa), FASTA scores: opt: 671, E(): 7e-38, (51.3% identity in 193 aa overlap); Q9L6C5 from Streptomyces antibioticus (202 aa), FASTA scores: opt: 665, E(): 1.8e-37, (49.25% identity in 197 aa overlap); P29783|STRM_STRGR from Streptomyces griseus (200 aa), FASTA scores: opt: 608, E(): 1.2e-33, (49.25% identity in 201 aa overlap); Q54265|STRM from Streptomyces glaucescens (200 aa), FASTA scores: opt: 603, E(): 2.5e-33, (46.7% identity in 197 aa overlap); etc. Also highly similar to Q9S4D4|TYLJ PUTATIVE NDP-HEXOSE 3-EPIMERASE from Streptomyces fradiae (205 aa), FASTA scores: opt: 625, E(): 8.6e-35, (45.9% identity in 194 aa overlap). Protein product from Mb3494 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3494 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4B1" /db_xref="InterPro:IPR000888" /db_xref="InterPro:IPR011051" /db_xref="InterPro:IPR014710" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4B1" /protein_id="SIU02122.1" /translation="MKARELDVPGAWEITPTIHVDSRGLFFEWLTDHGFRAFAGHSLD VRQVNCSVSSAGVLRGLHFAQLPPSQAKYVTCVSGSVFDVVVDIREGSPTFGRWDSVL LDDQDRRTIYVSEGLAHGFLALQDNSTVMYLCSAEYNPQREHTICATDPTLAVDWPLV DGAAPSLSDRDAAAPSFEDVRASGLLPRWEQTQRFIGEMRGT" repeat_region 3835634..3837026 /rpt_family="REP" /note="REP-8, len: 1393 nt. Equivalent to REP, len: 1372 nt, from Mycobacterium tuberculosis strain H37Rv, (98.8% identity in 1368 nt overlap). REP13E12, 1371 bp repeat,copies in Mycobacterium tuberculosis cosmids; cY336 from: 14471 to: 15821 (approx. 100% identity); cY251 from: 11693 to: 13109 (approx. 100% identity); cI65 from: 14515 to: 15905 (approx 75% identity); cI125 from: 27240 to: 28597 (approx. 65% Identity); cY22G8 from: 13352 to 14689 (approx. 65% identity); and cY9F9 from: 9019 to: 10451 (approx. 65% identity); also nearly identical to EM_BA :MB35021 U35021 Mycobacterium bovis BCG DNA flanking deletion region 3 from: 56 to: 1466." CDS 3835634..3836302 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3495" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3495, -, len: 222 aa. Equivalent to Rv3466, len: 222 aa, from Mycobacterium tuberculosis strain H37Rv, (98.2% identity in 222 aa overlap). Conserved hypothetical ORF in REP13E12 repeat, but extending 5' of repeat. Has segment of identity to other REP13E12 ORF's e.g. MTCY336.16, MTCI65.15c, MTCY09F9.19, cMTCY251.14c. Mb3495 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3Y485" /protein_id="SIU02123.1" /translation="MGSGSRERIVEVFDALDAELDRLDEVSFEVLTTPERLRSLERLE CLVRRLPAVGHTLINQLDTQASEEELGGTLCCALANRLRITKPDAALRIADAADLGPR RALTGEPLAPQLTATATAQRQGLIGEAHIKVIRALFRPPARRGGCVHPPGRRSRPGRQ SRSISSRRAGPLRPAGHGLATPRRRPHRHRTRPQTRHHPEQPAIRRHVTAKWLPDPPS AGHL" CDS 3836073..3837026 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3496" /product="13E12 repeat family protein" /note="Mb3496, -, len: 317 aa. Equivalent to Rv3467, len: 317 aa, from Mycobacterium tuberculosis strain H37Rv, (98.1% identity in 317 aa overlap). Conserved hypothetical ORF in REP13E12 repeat, identical to ORF's from other REP13E12 copies e.g. MTCY251.13c, MTCI65.15c, MTCY09F9.19, cMTCY336.17. Also identical to Mycobacterium bovis Q50655 HYPOTHETICAL 34.6 KD PROTEIN (317 aa) in identical repeat." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4A6" /protein_id="SIU02124.1" /translation="MSTRQAAEADLAGKAAQYRPDELARYAQRVMDWLHPDGDLTDTE RARKRGITLSNQQYDGMSRLSGYLTPQARATFEAVLAKLAAPGATNPDDHTPVIDTTP DAAAIDRDTRSQAQRNHDGLLAGLRALIASGELGQHNGLPVSIVVTTTLTDLQTGAGK GFTGGGTLLPMADVIRMTSHAHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIM LFANDRGCTKPGCDAPAYHSQAHHVTGWTSTGRTDITELTLACDPDNRLAEKGWTTRK NTHGHTEWLPPPHLDHGQPRTNTFHHPERFLHNQDDDDEPD" CDS complement(3837084..3838178) /codon_start=1 /transl_table=11 /gene="rmlB2" /locus_tag="BQ2027_MB3497C" /standard_name="rfbB" /product="possible dtdp-glucose 4,6-dehydratase" /note="Mb3497c, rmlB2, len: 364 aa. Equivalent to Rv3468c, len: 364 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 364 aa overlap). Possible rmlB2 (alternate gene name: rfbB), DTDP-glucose-4,6-dehydratase (EC 4.2.1.46), similar to others e.g. O08246|MTME from Streptomyces argillaceus (331 aa), FASTA scores: opt: 238, E(): 1.2e-07, (29.65% identity in 344 aa overlap); Q9LFG7|F4P12_220 from Arabidopsis thaliana (Mouse-ear cress) (433 aa), FASTA scores: opt: 237, E(): 1.8e-07, (27.25% identity in 308 aa overlap); Q9LZI2|F26K9_260 from Arabidopsis thaliana (Mouse-ear cress) (445 aa), FASTA scores: opt: 225, E(): 1e-06, (25.95% identity in 335 aa overlap); etc. Also similar to various enzymes and hypothetical unknowns proteins e.g. BAB48655|MLL1234 UDP-GLUCOSE 4-EPIMERASE from Rhizobium loti (Mesorhizobium loti) (307 aa), FASTA scores: opt: 757, E(): 4.6e-40, (43.4% identity in 302 aa overlap). First start taken, alternative at 17080 in cSCYY13E12 suggested by similarity. Note that previously known as rmlB3. Mb3497c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5Y1" /db_xref="InterPro:IPR001509" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5Y1" /protein_id="SIU02125.1" /translation="MGTHAATMRVRAGVRSSPLLLHAGTPPTAAAAESGMRTLVTGSS GHLGEALVRTLRARGADVVSLDSRPSRYTNIVGCVSDRALLRDVMAGVEVVFHAAAHH KPQLAFLPRQAFLDTNIIGTQTVLDAAVAANVRAFVMTSSTTVFGDALTPPADQPAAW IDESVTPIPKNIYGVTKASSEDLCQLAHRNDGLACVVLRVARFFVEGDDMPDLYDGRS QDNIKANEYACRRVALEDAVDAHLNAAQRAPQLGFGRYLVSATTPFTRDDLTQLRTDA ASVFARRVPLAAAVWTQRGWRFPDRLDRVYVNSRARRDLNWRPRFDLNAVAARLARGQ SVHTPLSQLVGSKAYAHSSYHRGVFAPARP" CDS complement(3838182..3839192) /codon_start=1 /transl_table=11 /gene="mhpE" /locus_tag="BQ2027_MB3498C" /product="PROBABLE 4-HYDROXY-2-OXOVALERATE ALDOLASE MHPE (HOA)" /note="Mb3498c, mhpE, len: 336 aa. Equivalent to Rv3469c, len: 336 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 336 aa overlap). Probable mhpE, 4-hydroxy-2-oxovalerate aldolase (EC 4.1.3.-), similar to others (principally from Pseudomonas species) e.g. Q99PZ1|SCP1.301|SCP1.53c from Streptomyces coelicolor (338 aa), FASTA scores: opt: 615, E(): 7.9e-31, (37.65% identity in 332 aa overlap); Q9X9Q0|NIKB NIKB PROTEIN (see first citation below) from Streptomyces tendae (357 aa), FASTA scores: opt: 571, E(): 4.4e-28, (34.5% identity in 339 aa overlap); P51014|BPHF_PSES1 from Pseudomonas sp. strain KKS102 (352 aa), FASTA scores: opt: 549, E(): 9.9e-27, (31.2% identity in 314 aa overlap); Q51983|CMTG_PSEPU from Pseudomonas putida (350 aa), FASTA scores: opt: 543, E(): 2.3e-26, (30.7% identity in 319 aa overlap); P51020|MHPE_ECOLI|MHPF|B0352 from Escherichia coli strain K12 (337 aa), FASTA scores: opt: 517, E(): 9.1e-25, (31.75% identity in 312 aa overlap); etc. Also similar to P71867|MTCY03C7.22|Rv3534c HYPOTHETICAL 36.4 KDA PROTEIN from Mycobacterium tuberculosis (346 aa), FASTA scores: E(): 7.5e-24, (31.9% identity in 310 aa overlap). Mb3498c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Y7" /db_xref="InterPro:IPR000891" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Y7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02126.1" /translation="MLMTATHREPIVLDTTVRDGSYAVNFQYTDDDVRRIVGDLDAAG IPYIEIGHGVTIGAAAAQGPAAHTDEEYFRAARSVVRNARLGAVIVPALARIETVDLA GDYLDFLRICVIATEFELVMPFVERAQSKGLEVSIQLVKSHLFEPDVLAAAGKRARDV GVRIVYVVDTTGTFLPEDARRYVEALRGASDVSVGFHGHNNLAMAVANTLEAFDAGAD FLDGTLMGFGRGAGNCQIECLVAALQRRGHLAAVDLDRIFDAARSDMLGRSPQSYGID PWEISFGFHGLDSLQVEHLRAAAQQAGLSVSHVIRQTAKSHAGQWLSPQDIDRVVVGM RA" CDS complement(3839253..3840911) /codon_start=1 /transl_table=11 /gene="ilvB2" /locus_tag="BQ2027_MB3499C" /product="PROBABLE ACETOLACTATE SYNTHASE (LARGE SUBUNIT) ILVB2 (AHAS) (ACETOHYDROXY-ACID SYNTHASE LARGE SUBUNIT) (ALS)" /note="Mb3499c, ilvB2, len: 552 aa. Equivalent to Rv3470c, len: 552 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 552 aa overlap). Probable ilvB2, acetolactate synthase large subunit (EC 4.1.3.18), similar to others e.g. P73913|ILVG|SLR2088 from Synechocystis sp. strain PCC 6803 (621 aa), FASTA scores: opt: 779, E(): 4.5e-39, (30.7% identity in 567 aa overlap); O78518|ILVB_GUITH from Guillardia theta (Cryptomonas phi) (575 aa), FASTA scores: opt: 742, E(): 6.9e-37, (28.8% identity in 566 aa overlap); Q59950|ILVX from Spirulina platensis (612 aa), FASTA scores: opt: 715, E(): 3e-35, (28.45% identity in 569 aa overlap); etc. Contains thiamine pyrophosphate enzymes signature (PS00187). Mb3499c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y484" /db_xref="InterPro:IPR000399" /db_xref="InterPro:IPR011766" /db_xref="InterPro:IPR012000" /db_xref="InterPro:IPR012001" /db_xref="InterPro:IPR029035" /db_xref="InterPro:IPR029061" /db_xref="UniProtKB/TrEMBL:A0A1R3Y484" /protein_id="SIU02127.1" /translation="MTVGDHLVARMRAAGISVVCGLPTSRLDSLLVRLSRDAGFQIVL ARHEGGAGYLADGFARASGKSAAVFVAGPGATNVISAVANASVNQVPMLILTGEVAVG EFGLHSQQDTSDDGLGLGATFRRFCRCSVSIESIANARSKIDSAFRALASIPRGPVHI ALPRDLVDERLPAHQLGTAAAGLGGLRTLAPCGPDVADEVIGRLDRSRAPMLVLGNGC RLDGIGEQIVAFCEKAGLPFATTPNGRGIVAETHPLSLGVLGIFGDGRADEYLFDTPC DLLIAVGVSFGGLVTRSFSPRWRGLKADVVHVDPDPSAVGRFVATSLGITTSGRAFVN ALNCGRPPRFCRRVGVRPPAPAALPGTPQARGESIHPLELMHELDRELAPNATICADV GTCISWTFRGIPVRRPGRFFATVDFSPMECGIAGAIGVALARPEEHVICIAGDGAFLM HGTEISTAVAHGIRVTWAVLNDGQMSASAGPVSGRMDPSPVARIGANDLAAMARALGA EGIRVDTRCELRAGVQKALAATGPCVLDIAIDPEINKPDIGLGR" CDS complement(3840917..3841450) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3500C" /product="Mannose-6-phosphate isomerase" /note="Mb3500c, -, len: 177 aa. Equivalent to Rv3471c, len: 177 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 177 aa overlap). Conserved hypothetical protein, similar to Q59013|MJ1618 HYPOTHETICAL PROTEIN from Methanococcus jannaschii (125 aa), FASTA scores: opt: 262, E(): 1.2e-09, (39.05% identity in 105 aa overlap); and O26452|MTH352 CONSERVED PROTEIN from Methanobacterium thermoautotrophicum (131 aa), FASTA scores: opt: 222, E(): 3.8e-07, (35.05% identity in 117 aa overlap). Equivalent to AAK47934 from Mycobacterium tuberculosis strain CDC1551 (184 aa) but shorter 7 aa." /db_xref="GOA:A0A1R3Y4E0" /db_xref="InterPro:IPR006045" /db_xref="InterPro:IPR011051" /db_xref="InterPro:IPR013096" /db_xref="InterPro:IPR014710" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E0" /protein_id="SIU02128.1" /translation="MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARA HAAAMFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQA TDEIYFVLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCAWGPAYL PERDQRMGEAAVIGAWP" CDS 3841471..3841977 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3501" /product="conserved protein" /note="Mb3501, -, len: 168 aa. Equivalent to Rv3472, len: 168 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 168 aa overlap). Conserved hypothetical protein, showing some similarity to other proteins e.g. Q9ZAT9|DPSH DAUNORUBICIN BIOSYNTHESIS ENZYME from Streptomyces peucetius (194 aa), FASTA scores: opt: 181, E(): 6.8e-05, (30.7% identity in 127 aa overlap); Q53879 DAUH/E from Streptomyces sp. C5 (151 aa), FASTA scores: opt: 168, E(): 0.00038, (29.25% identity in 127 aa overlap); and Q9L4U3|AKNV from Streptomyces galilaeus (144 aa), FASTA scores: opt: 122, E(): 0.36, (31.25% identity in 129 aa overlap). Protein product from Mb3501 detected using SWATH mass spectrometry. Mb3501 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR032710" /db_xref="InterPro:IPR037401" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4D1" /protein_id="SIU02129.1" /translation="MRPVDEQWIEILRIQALCARYCLTIDTQDGEGWAGCFTEDGAFE FDGWVIRGRPALREYADAHARVVRGRHLTTDLLYEVDGDVATGRSASVVTLATAAGYK ILGSGEYQDRLIKQDGQWRIAYRRLRNDRLVSDPSVAVNVADADVAAVVGHLLAAARR LGTQMSDT" CDS complement(3842057..3842842) /codon_start=1 /transl_table=11 /gene="bpoA" /locus_tag="BQ2027_MB3502C" /product="POSSIBLE PEROXIDASE BPOA (NON-HAEM PEROXIDASE)" /note="Mb3502c, bpoA, len: 261 aa. Equivalent to Rv3473c, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 261 aa overlap). Possible bpoA, peroxidase (non-haem peroxidase) (EC 1.11.1.-), similar to various enzymes or hypothetical unknown proteins e.g. O85849 HYPOTHETICAL 26.2 KDA PROTEIN from Sphingomonas aromaticivorans (247 aa), FASTA scores: opt: 684, E(): 4.9e-34, (43.8% identity in 242 aa overlap); AAK45412|MT1155 HYDROLASE, ALPHA/BETA HYDROLASE FOLD FAMILY from Mycobacterium tuberculosis strain CDC1551 (311 aa), FASTA scores: opt: 675, E(): 2e-33, (39.45% identity in 256 aa overlap); Q9K3V0|SCD10.27 PUTATIVE HYDROLASE from Streptomyces coelicolor (352 aa), FASTA scores: opt: 248, E(): 9.7e-08, (26.05% identity in 261 aa overlap); P29715|BPA2_STRAU|BPOA2 NON-HAEM BROMOPEROXIDASE (EC 1.11.1.-) (BROMIDE PEROXIDASE) (277 aa), FASTA scores: opt: 237, E(): 3.6e-07, (29.45% identity in 265 aa overlap); O31168|PRXC_STRAU|CPO|CPOT NON-HEME CHLOROPEROXIDASE (EC 1.11.1.10) (278 aa), FASTA scores: opt: 236, E(): 4.2e-07, (29.45% identity in 265 aa overlap); AAK62388|T5L19.180 LIPASE-LIKE PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (350 aa), FASTA scores: opt: 236, E(): 5.1e-07, (26.65% identity in 274 aa overlap); etc. Also similar to O06575|BPOB|Rv1123c|MTCY22G8.12c HYPOTHETICAL 32.5 KDA PROTEIN from Mycobacterium tuberculosis (302 aa), FASTA scores: opt: 675, E(): 2e-33, (39.45% identity in 256 aa overlap). Equivalent to AAK47936 from Mycobacterium tuberculosis strain CDC1551 (294 aa) but shorter 33 aa. May have been inactivated or truncated by neighbouring IS6110. Mb3502c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y491" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y491" /protein_id="SIU02130.1" /translation="MVFLHGGGQTRRSWGRAAAAVAERGWQAVTIDLRGHGESDWSSE GDYRLVSFAGDIQEVLRNLPGQPALVGASLGGFAAMLLAGELSPGIASAVVLVDIVPN MDLAGASRIHAFMAERVESGFGSLDEVADVIANYNPHRPRPSDPDGLVANLRRRGDRW YWHWDPQFIGGIAAFPPVEVTDVDRMNAAVATILRDEVPVLLVRGQVSDIVRQESADQ FLSRFPQVEFTDVRGAGHMVAGDRNDAFAGAVLDFLARHVGVR" CDS complement(3843122..3844471) /codon_start=1 /transl_table=11 /gene="kgtP" /locus_tag="BQ2027_MB3503C" /product="PROBABLE DICARBOXYLIC ACID TRANSPORT INTEGRAL MEMBRANE PROTEIN KGTP (DICARBOXYLATE TRANSPORTER)" /note="Mb3503c, kgtP, len: 449 aa. Equivalent to Rv3476c, len: 449 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 449 aa overlap). Probable kgtP, dicarboxylate-transport integral membrane protein, possibly member of major facilitator superfamily (MFS), highly similar to others e.g. Q9HT43|PA5530 from Pseudomonas aeruginosa (435 aa), FASTA scores: opt: 1209, E(): 2.3e-68, (47.05% identity in 425 aa overlap); Q9I6Q9|PCAT|PA0229 from Pseudomonas aeruginosa (432 aa), FASTA scores: opt: 1131, E(): 1.8e-63, (40.4% identity in 438 aa overlap); Q9WWZ2 from Pseudomonas putida (429 aa), FASTA scores: opt: 1090, E(): 6.5e-61, (41.2% identity in 425 aa overlap); P17448|KGTP_ECOLI|WITA|B2587 from Escherichia coli strain K12 (432 aa), FASTA scores: opt: 1083, E(): 1.8e-60, (40.05% identity in 422 aa overlap); etc. Also similar to O05301|MTCI364.12|Rv1200 HYPOTHETICAL 44.6 KDA PROTEIN from Mycobacterium tuberculosis (425 aa), FASTA scores: E(): 5.2e-25, (28.5% identity in 382 aa overlap). Contains sugar transport protein signatures 1 and 2 (PS00216, PS00217). BELONG TO THE SUGAR TRANSPORTER FAMILY. Mb3503c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4B6" /db_xref="InterPro:IPR005828" /db_xref="InterPro:IPR005829" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4B6" /protein_id="SIU02131.1" /translation="MTVSIAPPSRPSQAETRRAIWNTIRGSSGNLVEWYDVYVYTVFA TYFEDQFFDRADRNSTVYVYAIFAVTFVTRPVGSWFFGRFADRRGRRAALTFSVSLMA ACSLIVALVPSRSSIGVAAPILLILCRLVQGFATGGEYGTSVTYMSEAATRERRGYFS SFQYVTLVGGHVLAQFTLLVILAVFTREQVHEFGWRIGFAVGGGAAIVVFWLRRTMDE SLSQERLTAIKAGRDHDSGSLRELATHYWKPLLLCFLVTLGGTVAFYTYSVNAPAIVK SVYGSQAMTATWINLVGLILLMMLQPIGGMISDKIGRKPLLLWFGVGGLIYTYVLVTY LPETRSPTMSFLLVAVGYVILTGYCSINALVKSELFPAHVRALGVGVGYALANSVFGG TAPLIYQALKERDQVPMFIAYVTACIAVSLIVYVFFIKNKADTYLDREQGFAFYGHA" CDS 3844844..3845140 /codon_start=1 /transl_table=11 /gene="PE31" /locus_tag="BQ2027_MB3504" /product="pe family protein pe31" /note="Mb3504, PE31, len: 98 aa. Equivalent to Rv3477, len: 98 aa, from Mycobacterium tuberculosis strain H37Rv, (99.0% identity in 98 aa overlap). Member of the M. tuberculosis PE family, similar to O53941|Rv1791|MTV049.13 (99 aa), FASTA scores: opt: 373, E(): 4.3e-18, (64.65% identity in 99 aa overlap); MTCI364.07; MTCY21C12.10c; MTCY1A11.25c; MTC1A11.04; MTCY359.33; etc. Protein product from Mb3504 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3504 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4D3" /protein_id="SIU02132.1" /translation="MSFTAQPEMLAAAAGELRSLGATLKVSNAAAAVPTTGVVPPAAD EVSLLLATQFRTHAATYQTASAKAAVIHEQFVTTLATSASSYADTEAANAVVTG" CDS 3845177..3846358 /codon_start=1 /transl_table=11 /gene="PPE60" /locus_tag="BQ2027_MB3505" /standard_name="mtb39c" /product="pe family protein ppe60" /note="Mb3505, PPE60, len: 393 aa. Equivalent to Rv3478, len: 393 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 393 aa overlap). PPE60 (alternate gene name: mtb39c). Member of the M. tuberculosis PPE family, highly similar to others e.g. Q11031|YD61_MYCTU|Rv1361c|MT1406|MTCY02B10.25c (396 aa), FASTA scores: opt: 2165, E(): 1.1e-109, (85.35% identity in 396 aa overlap); MTCI364.08; MTCY10G2.10; MTCY03A2.22c; MTCY274.23c; MTCY164.34c; MTCY98.0029c; etc. Note that expression of Rv3478 was demonstrated in lysates by immunodetection (see citation below). Protein product from Mb3505 detected using SWATH mass spectrometry. Mb3505 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y494" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02133.1" /translation="MVDFGALPPEINSARMYAGPGSASLVAAAKMWDSVASDLFSAAS AFQSVVWGLTVGSWIGSSAGLMAAAASPYVAWMSVTAGQAQLTAAQVRVAAAAYETAY RLTVPPPVIAENRTELMTLTATNLLGQNTPAIEANQAAYSQMWGQDAEAMYGYAATAA TATEALLPFEDAPLITNPGGLLEQAVAVEEAIDTAAANQLMNNVPQALQQLAQPAQGV VPSSKLGGLWTAVSPHLSPLSNVSSIANNHMSMMGTGVSMTNTLHSMLKGLAPAAAQA VETAAENGVWAMSSLGSQLGSSLGSSGLGAGVAANLGRAASVGSLSVPPAWAAANQAV TPAARALPLTSLTSAAQTAPGHMLGGLPLGHSVNAGSGINNALRVPARAYAIPRTPAA G" CDS 3846571..3847872 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3507" /product="HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb3507, -, len: 433 aa. Equivalent to 5' end of Rv3479, len: 1075 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 418 aa overlap). Possible transmembrane protein, with hydrophobic stretches at C-terminus. Start changed since first submission (-54 aa). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3479 exists as a single gene. In Mycobacterium bovis, a 713 bp deletion splits Rv3479 into 2 parts, Mb3507 and Mb3508. Mb3507 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4B2" /db_xref="InterPro:IPR002641" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR019894" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4B2" /protein_id="SIU02134.1" /translation="MAGVTREINLLAQASQWRRLGGTFPTNSQLTNESAASLRLYAQL IDLLDMVVDVDILSGTSAGGINAALLASSRVTGSDLGGIRDLWLDLGALTELLRDPRD KKTPSLLYGDERIFAALAKRLPKLATGPFPPTTFPEAARTPSTTLYITTTLLAGETSR FTDSFGTLVQDVDRRGLFTFTETDLARPDTAPALALAARSSASFPLAFEPSFLPFTKG TAKKGEVPARPAMAPFTSLTRPHWVSDGGLLDNRPIGVLFKRIFDRPARRPVRRVLLF VVPSSGPAPDPMHEPPPDNVDEPLGLIDGLLKGLAAVTTQSIAADLRAIRAHQDCMEA RTDAKLRLAELAATLRNGTRLLTPSLLTDYRTREATKQAQTLTSALLRRLSTCPPESG PATESLPKSWSAELTVGGDADKVCRQLASFRCVLQEVMASQ" CDS 3847877..3848923 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3508" /product="HYPOTHETICAL TRANSMEMBRANE PROTEIN [SECOND PART]" /note="Mb3508, -, len: 348 aa. Equivalent to 3' end of Rv3479, len: 1075 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 348 aa overlap). Possible transmembrane protein, with hydrophobic stretches at C-terminus. Start changed since first submission (-54 aa). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3479 exists as a single gene. In Mycobacterium bovis, a 713 bp deletion splits Rv3479 into 2 parts, Mb3507 and Mb3508. Protein product from Mb3508 detected using SWATH mass spectrometry. Mb3508 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5Y5" /db_xref="InterPro:IPR024282" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5Y5" /protein_id="SIU02135.1" /translation="MWGRLDGAGWLVHVLLDPRRVRWIVGERADTNGPQSGAQWFLGK LKELGAPDFPSPGYPLPAVGGGPAQHLTEDMLLDELGFLDDPAKPLPASIPWTALWLS QAWQQRVLEEELDGLANTVLDPQPGKLPDWSPTSSRTWATKVLAAHPGDAKYALLNEN PIAGETFASDKGSPLMAHTVAKAAATAAGAAGSVRQLPSVLKPPLITLRTLTLSGYRV VSLTKGIARSTIIAGALLLVLGVAAAIQSVTVFGVTGLIAAGTGGLLVVLGTWQVSGR LLFALLSFSVVGAVLALATPVVREWLFGTQQQPGWVGTHAYWLGAQWWHPLVVVGLIA LVAIMIAAANPGRR" CDS complement(3848947..3849771) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3509C" /product="CONSERVED HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb3509c, -, len: 274 aa. Equivalent to 3' end of Rv3480c, len: 497 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 274 aa overlap). Conserved hypothetical protein, similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O69701|Y1D4_MYCTU|Rv3734c|MT3839|MTV025.082c (454 aa), FASTA scores: opt: 520, E(): 2e-23, (39.95% identity in 488 aa overlap); Q10554|Y895_MYCTU|Rv0895|MTCY31.23 (505 aa), FASTA scores: opt: 434, E(): 2.7e-18, (34.2% identity in 497 aa overlap); AAK45165|MT0919 (520 aa), FASTA scores: opt: 434, E(): 2.7e-18, (34.2% identity in 497 aa overlap); etc. Also similar to Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa), FASTA scores: opt: 272, E(): 1e-08, (28.85% identity in 485 aa overlap); and Q9RIU8|CM11.13c HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 254, E(): 1.1e-07, (30.4% identity in 497 aa overlap). SEEMS TO BELONG TO THE UPF0089 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3480c exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2 bp deletion (gt-*) splits Rv3480c into 2 parts, Mb3509c and Mb3510c. Protein product from Mb3509c detected using SWATH mass spectrometry. Mb3509c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Z3" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Z3" /protein_id="SIU02136.1" /translation="MAGAGRSTFELTKALVNAQLRSDHEYRNLVGSVQAPHCILNTRI SRNRRFATQQYPLDRLKAIGAQYDATINDVALAIIGGGLRRFLDELGELPNKSLIVVL PVNVRPKDDEGGGNAVATILATLGTDVADPVQRLAAVTASTRAAKAQLRSMDKDAILA YSAALMAPYGVQLASTLSGVKPPWPYTFNLCVSNVPGPEDVLYLRGSRMEASYPVSLV AHSQALNVTLQSYAGTLNFGFIGCRDTLPHLQRLAVYTGEALDQLAAADGAAGLGS" CDS complement(3849785..3850438) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3510C" /product="CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb3510c, -, len: 217 aa. Equivalent to 5' end of Rv3480c, len: 497 aa, from Mycobacterium tuberculosis strain H37Rv, (98.6% identity in 216 aa overlap). Conserved hypothetical protein, similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O69701|Y1D4_MYCTU|Rv3734c|MT3839|MTV025.082c (454 aa), FASTA scores: opt: 520, E(): 2e-23, (39.95% identity in 488 aa overlap); Q10554|Y895_MYCTU|Rv0895|MTCY31.23 (505 aa), FASTA scores: opt: 434, E(): 2.7e-18, (34.2% identity in 497 aa overlap); AAK45165|MT0919 (520 aa), FASTA scores: opt: 434, E(): 2.7e-18, (34.2% identity in 497 aa overlap); etc. Also similar to Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa), FASTA scores: opt: 272, E(): 1e-08, (28.85% identity in 485 aa overlap); and Q9RIU8|CM11.13c HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 254, E(): 1.1e-07, (30.4% identity in 497 aa overlap). SEEMS TO BELONG TO THE UPF0089 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3480c exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2 bp deletion (gt-*) splits Rv3480c into 2 parts, Mb3509c and Mb3510c. Mb3510c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y493" /db_xref="InterPro:IPR004255" /db_xref="UniProtKB/TrEMBL:A0A1R3Y493" /protein_id="SIU02137.1" /translation="MSQTARRLGPQDMFFLYSESSTTMMHVGALMPFTPPSGAPPDLL RQLVDESKASEVVEPWSLRLSHPELLYHPTQSWVVDDNFDLDYHVRRSALASPGDERE LGIPVSRLHSHALDLRRPPWEVHFIEGLEGGRFAIYIKMHHSLIDGYTGQKMLARSLS TDPHDTTHPLFFNIPTPGRSPADTQDSVGGGLIAGAGNVLDGLGDVVRGLGGRQRGR" CDS complement(3850529..3851218) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3511C" /product="PROBABLE INTEGRAL MEMBRANE PROTEIN" /note="Mb3511c, -, len: 229 aa. Equivalent to Rv3481c, len: 229 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 229 aa overlap). Probable integral membrane protein. No real similarity with others. Mb3511c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4E8" /db_xref="InterPro:IPR021315" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02138.1" /translation="MRGLLPVAGHWVSVLTGLVPLALVIALSPLSVIPAVLVVHSPQP RPSSLAFLGGWLLGLAVVTAVFVAASGALGGLSTTSPAWASWLRVVLGSALIVFGVLR WLTRHRHTEMPGWMRAFASFTPARAGLVGAVLVVVRPEVLIICAAAGLAIGSGGHGAA GSWIYTAFFAMLAASTVAIPILAYVAAGDRLDDSLERLKDWMEKNHAGMVAAILVVIG LLLLYNGVHAM" CDS complement(3851360..3852142) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3512C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb3512c, -, len: 260 aa. Equivalent to Rv3482c, len: 260 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 260 aa overlap). Probable conserved membrane protein. N-terminal region shares some similarity with N-terminus of O88067|SCI35.32c PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (319 aa), FASTA scores: opt: 155, E(): 0.023, (54.55% identity in 33 aa overlap); and with C-terminus of O06254|Rv3437|MTCY77.09 HYPOTHETICAL 17.9 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (alias AAK47883|MT3542.1 from strain CDC1551) (158 aa), FASTA scores: opt: 140, E(): 0.11, (58.8% identity in 34 aa overlap). Some similarity to others e.g. Q9XAN5|SC4C6.05c PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (347 aa), FASTA scores: opt: 131, E(): 0.75, (29.4% identity in 221 aa overlap). First start taken. Protein product from Mb3512c detected using SWATH mass spectrometry. Mb3512c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4E1" /db_xref="InterPro:IPR018929" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E1" /protein_id="SIU02139.1" /translation="MEHDVATSPPAGWYTDPDGSAGQRYWDGDRWTRHRRPNPSAPRS PLALRVDGLRSRWLGMPAGLRLTVPVAAVLTMVGVAVYAWIRPLPDDWSQLPKRLSCQ LRPGPTPPATITVASVDVSHPRGAVLRLVVRFAEPLPPSPSGSFASGFAGYLLTYTIA NNGKEFAELGPQQDTDELAIRKPGESRGTEPNMRPDRNTNARRTAPDTVEINLETKRL GLDQAPVDPQLTFAAQFRTPSTVTVDFGSQFCQGERLAGQRR" CDS complement(3852186..3852848) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3513C" /product="possible exported protein" /note="Mb3513c, -, len: 220 aa. Equivalent to Rv3483c, len: 220 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 220 aa overlap). Conserved hypothetical protein, similar to Q9CC94|ML1099 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (202 aa), FASTA scores: opt: 276, E(): 1.4e-08, (33.1% identity in 148 aa overlap). Also showing similarity with Mycobacterium tuberculosis proteins Q11065|LPRE_MYCTU|LPRE|Rv1252c|MT1291|MTCY50.30. PUTATIVE LIPOPROTEIN PRECURSOR (202 aa), FASTA scores: opt: 276, E(): 1.4e-08, (29.5% identity in 200 aa overlap); O53445|Rv1097c|MTV017.50c HYPOTHETICAL 29.9 KDA PROTEIN (293 aa), FASTA scores: opt: 161, E(): 0.047, (25.4% identity in 118 aa overlap); P71882|LPPP_MYCTU|Rv2330c|MT2392|MTCY3G12.04 PUTATIVE LIPOPROTEIN PRECURSOR (175 aa), FASTA scores: opt: 146, E(): 0.21, (28.25% identity in 184 aa overlap); and O06170|Rv2507|MTCY07A7.13 HYPOTHETICAL 28.5 KDA PROTEIN (273 aa), FASTA scores: opt: 148, E(): 0.23, (25.15% identity in 191 aa overlap). Protein product from Mb3513c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3513c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4A7" /db_xref="InterPro:IPR025971" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4A7" /protein_id="SIU02140.1" /translation="MSDEIDPDWPAPAYQPSDDVDTTPPAPGGSWPTAWLVALVVLAC VAAAVVAYAGMHRVRPGANQAAPATTSAPARPTSPASQVGPCGPDEATAVRAALAQLA PDSKTGRPWNSTPEDSNYDPCADLSAVLVTVQDATNSSPDQALMFHRGTFVGTATPRA YPFTNLIGPASTNDIVVLSYRTRQSCDGCQDGILTIVGFAWRGDHVQILDSLPELFDA PP" CDS 3853114..3854652 /codon_start=1 /transl_table=11 /gene="cpsA" /locus_tag="BQ2027_MB3514" /product="Cell envelope-associated transcriptional attenuator LytR-CpsA-Psr, subfamily A1" /note="Mb3514, cpsA, len: 512 aa. Equivalent to Rv3484, len: 512 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 512 aa overlap). Possible cpsA, hypothetical protein, equivalent to Q50160|CPSA|ML2247 HYPOTHETICAL PROTEIN CPSA from Mycobacterium leprae (516 aa), FASTA scores: opt: 2557, E(): 1.6e-143, (74.9% identity in 518 aa overlap); and with good similarity to Q9CCK9|ML0750 HYPOTHETICAL PROTEIN from Mycobacterium leprae (489 aa), FASTA scores: opt: 855, E(): 4.6e-43, (34.45% identity in 502 aa overlap). Also similar (or with similarity) to hypothetical proteins from Mycobacterium tuberculosis: P96872|Rv3267|MTCY71.07 (498 aa), FASTA scores: opt: 928, E(): 2.3e-47, (37.35% identity in 498 aa overlap); and O53834|Rv0822c|MTV043.14c (684 aa), FASTA scores: opt: 425, E(): 1.5e-17, (26.15% identity in 524 aa overlap). Shows also similarity with various bacterial proteins e.g. Q9KZK0|SCE34.26 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (507 aa), FASTA scores: opt: 329, E(): 5.3e-12, (28.85% identity in 478 aa overlap); Q9K4E6|2SC6G5.02 CONSERVED HYPOTHETICAL PROTEIN, POSSIBLE MEMBRANE PROTEIN, from Streptomyces coelicolor (382 aa), FASTA scores: opt: 305, E(): 1.1e-10, (29.8% identity in 386 aa overlap); O69850|SC1C3.08c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (366 aa), FASTA scores: opt: 304, E(): 1.2e-10, (29.6% identity in 395 aa overlap); Q9KZK3|SCE34.23 PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (396 aa), FASTA scores: opt: 296, E(): 3.8e-10, (31.25% identity in 349 aa overlap); AAK43602|CPSA CPSA PROTEIN from Streptococcus agalactiae (485 aa), FASTA scores: opt: 250, E(): 2.4e-07, (30.25% identity in 162 aa overlap); etc. Protein product from Mb3514 detected using SWATH mass spectrometry. Mb3514 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4C8" /db_xref="InterPro:IPR004474" /db_xref="InterPro:IPR027381" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4C8" /protein_id="SIU02141.1" /translation="MARSEGNRPRHRAVPQPSRIRKRLSRGVMTLVSVVALLMTGAGY WVAHGALGGITISQALTPEDPRSSGNNMNILLIGLDSRKDQEGNDLPWSVLKQLHAGD SDDGGYNTNTLILVHVGADGKVVAFSIPRDDWVPFTGVPGYNHIKIKEAYGLTKQYVA EQLANQGVSDRKELETRGREAARAATLRAVRSLTGVPIDYFAEINLAGFYDLAQTLGG VDVCLNHAVYDSYSGADFPAGRQRLNAAQALAFVRQRHGLDNGDLDRTHRQQAFLSSV MRELQDSGTFTNLDRLDNLMAVARKDVVLSAGWDEDLFRRMGDLAGGNVEFRTLPVVR YDNIDGQDVNIIDPTAIRAEVAAAFGSAPPTSQTAAAAKPNPSTVVDVVNAGSISGLA SQVSGALLKRGYTAGQVRDRESGDPFTTAIEYGAGAETDAQNVADLLGIDAPNHPDPA VAPGHIRVTVDTNFSLPAPDEATAAATSTETSTYPLYGGGTTTDPTPDQGAPIDGGGV PCVN" CDS complement(3854658..3855602) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3515C" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb3515c, -, len: 314 aa. Equivalent to Rv3485c, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 314 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar, but longer 41 aa, to P71824|Rv0769|MTCY369.14 PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE CY369.14 from Mycobacterium tuberculosis (248 aa), FASTA scores: opt: 462, E(): 1.8e-19, (34.0% identity in 253 aa overlap). Also similar to various dehydrogenases e.g. P25529|HDHA_ECOLI|HSDH|B1619 NAD-DEPENDENT 7 ALPHA-HYDROXYSTEROID DEHYDROGENASE (SDR FAMILY) (EC 1.1.1.159) from Escherichia coli strain K12 (alias BAB35750|ECS2327 or AAG56608|HDHA for strain O157:H7) (255 aa), FASTA scores: opt: 462, E(): 1.8e-19, (34.7% identity in 248 aa overlap); Q9FD15|RUBG PUTATIVE REDUCTASE (SDR FAMILY) from Streptomyces collinus (249 aa), FASTA scores: opt: 446, E(): 1.5e-18, (36.1% identity in 255 aa overlap); BAB51974|MLL5540 PUTATIVE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (253 aa), FASTA scores: opt: 442, E(): 2.5e-18, (36.25% identity in 251 aa overlap); Q08632|SDR1_PICAB SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE (SDR FAMILY) from Picea abies (Norway spruce) (Picea excelsa) (271 aa), FASTA scores: opt: 441, E(): 3.1e-18, (32.3% identity in 260 aa overlap); Q9A326|CC3380 2-DEOXY-D-GLUCONATE 3-DEHYDROGENASE from Caulobacter crescentus (260 aa), FASTA scores: opt: 436, E(): 5.7e-18, (32.8% identity in 253 aa overlap); Q16698|DECR_HUMAN 2,4-DIENOYL-COA REDUCTASE, MITOCHONDRIAL PRECURSOR (EC 1.3.1.34) from Homo sapiens (Human) (335 aa), FASTA scores: opt: 430, E(): 1.5e-17, (30.4% identity in 306 aa overlap); etc. Contains short-chain alcohol dehydrogenase family signature (PS00061). BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES FAMILY (SDR). Protein product from Mb3515c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3515c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4D9" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4D9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02142.1" /translation="MNSRAPRNLAVSSPSAQVTGRMVQNGENLFQFRREGPQVQLSFQ DRTYLVTGGGSGIGKGVAAGLVAAGAAVMIVGRNPDKLAAAVKDIEALKTGAIGYEPA DITDEEQTLRVVDAATAWHGRLHGVVHCAGGSQTIGPITQIDSQAWRRTVDLNVNGTM YVLKHAARELVRGGGGSFVGISSIAASNTHRWFGAYGVTKSAVDHMMKLAADELGPSW VRVNSIRPGLIRTDLVVPVTESPELSADYRVCTPLPRVGEVEDVANLAMFLLSDAASW ITGQVINVDGGHMLRRGPDFSPMLEPVFGADGLRGVVG" CDS 3855808..3856257 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3516" /product="DoxX family protein" /note="Mb3516, -, len: 149 aa. Equivalent to Rv3486, len: 149 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 149 aa overlap). Conserved hypothetical protein, similar to Q9RC47|YFID|BH3304 HYPOTHETICAL PROTEIN from Bacillus halodurans (129 aa), FASTA scores: opt: 186, E(): 2.1e-05, (40.0% identity in 95 aa overlap); and Q9KKT1|VCA1019 HYPOTHETICAL PROTEIN from Vibrio cholerae (148 aa), FASTA scores: opt: 128, E(): 0.15, (35.25% identity in 139 aa overlap). Some similarity to other proteins e.g. P54720|YFID_BACSU HYPOTHETICAL PROTEIN from Bacillus subtilis (134 aa), FASTA scores: opt: 165, E(): 0.00052, (31.75% identity in 126 aa overlap). Equivalent to AAK47949 from Mycobacterium tuberculosis strain CDC1551 (163 aa) but shorter 14 aa. Mb3516 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR032808" /db_xref="UniProtKB/TrEMBL:A0A1R3Y499" /protein_id="SIU02143.1" /translation="MHAEGPPSVICIRLLVGLVFLSEGIQKFMYPDQLGPGRFERIGI PAATFFADLDGVVEIVCGTLVLLGLLTRVAAVPLLIDMVGAIVLTKLRALQPGGFLGV EGFWGMAHAARTDLSMLLGLIFLLWSGPGRWSLDRRLSKRATACGAR" CDS complement(3856210..3857340) /codon_start=1 /transl_table=11 /gene="lipF" /locus_tag="BQ2027_MB3517C" /product="PROBABLE ESTERASE/LIPASE LIPF" /note="Mb3517c, lipF, len: 277 aa. Equivalent to Rv3487c, len: 277 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 277 aa overlap). Probable lipF, esterase/lipase (EC 3.-.-.-), highly similar, but shorter 50 aa, to O53424|LIPU|Rv1076|MTV017.29 PUTATIVE ESTERASE/LIPASE from Mycobacterium tuberculosis (297 aa), FASTA scores: opt: 1229, E(): 3.3e-71, (76.4% identity in 246 aa overlap); and similar to other putative lipases from Mycobacterium tuberculosis e.g. P71759|LIPK|RV2385|MTCY253.36c (306 aa), FASTA scores: opt: 468, E(): 1.2e-22, (36.2% identity in 254 aa overlap). Equivalent, but shorter 79 aa, to Q9ZBM4|MLCB1450.08|ML0314 PUTATIVE HYDROLASE (PUTATIVE ESTERASE) from Mycobacterium leprae (335 aa), FASTA scores: opt: 1225, E(): 6.6e-71, (73.6% identity in 250 aa overlap). Also similar to esterases and lipases of around 300 aa e.g. Q44087|EST ESTERASE PRECURSOR from Acinetobacter lwoffii (303 aa), FASTA scores: opt: 428, E(): 4.3e-20, (31.85% identity in 251 aa overlap); P18773|EST_ACICA ESTERASE (EC 3.1.1.-) from Acinetobacter calcoaceticus (303 aa), FASTA scores: opt: 420, E(): 1.4e-19, (31.5% identity in 251 aa overlap); Q9KIU1 ESTERASE from uncultured bacterium Plasmid pAH116 (308 aa), FASTA scores: opt: 405, E(): 1.3e-18, (35.1% identity in 242 aa overlap); Q9X8J4|SCE9.22 PUTATIVE ESTERASE from Streptomyces coelicolor (266 aa), FASTA scores: opt: 390, E(): 1e-17, (35.85% identity in 237 aa overlap); etc. Equivalent to AAK47950 from Mycobacterium tuberculosis strain CDC1551 (327 aa) but shorter 50 aa. Protein product from Mb3517c detected using SWATH mass spectrometry. Mb3517c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4B9" /db_xref="InterPro:IPR013094" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR033140" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4B9" /protein_id="SIU02144.1" /translation="MLKMSSYYARRPLQSSGCSNSDSCWDGAPIEITESGPSVAGRLA ALASRMTIKPLMTVGSYLSPLPLPLGFVDFACRVWRPGQGTVRTTINLPNATAQLVRA PGVRAADGAGRVVLYLHGGAFVMCGPNSHSRIVNALSGFAESPVLIVDYRLIPKHSLG MALDDCHDAYQWLRARGYRPEQIVLAGDSAGGYLALALAQRLQCDDEKPAAIVAISPL LQLAKGPKQDHPNIGTDAMFPARAFDALAAWVRAAAAKNMVDGRPEDLYEPLDHIESS LPPTLIHVSGSEVLLHDAQLGAGKLAAAGVCAEVRVWPGQAHLFQLATPLVPEATRSL RQIGQFIRDATADSSLSPVHRSRYVAGSPRAASRGAFGQSPI" CDS 3857703..3858026 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3518" /product="Transcriptional regulator, PadR family" /note="Mb3518, -, len: 107 aa. Equivalent to Rv3488, len: 107 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 107 aa overlap). Hypothetical protein, similar to various bacterial proteins e.g. O28730|AF1542 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (101 aa), FASTA scores: opt: 321, E(): 6.4e-15, (50.55% identity in 87 aa overlap); O50207 SQ1_IV (FRAGMENT) from Rhodococcus erythropolis (59 aa), FASTA scores: opt: 298, E(): 1.4e-13, (71.2% identity in 59 aa overlap); Q9KFB0|BH0575 BH0575 PROTEIN from Bacillus halodurans (102 aa), FASTA scores: opt: 294, E(): 4.1e-13, (43.15% identity in 95 aa overlap); etc. Also similar to M. tuberculosis P71704|Rv0047c|MTCY21D4.10c (180 aa) (37.8% identity in 82 aa overlap). Protein product from Mb3518 detected using SWATH mass spectrometry. Mb3518 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR005149" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5Z2" /protein_id="SIU02145.1" /translation="MREFQRAAVRLHILHHAADNEVHGAWLTQELSRHGYRVSPGTLY PTLHRLEADGLLVSEQRVVDGRARRVYRATPAGRAALTEDRRALEELAREVLGRQSHT AGNGT" CDS 3858108..3858272 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3519" /product="unknown protein" /note="Mb3519, -, len: 54 aa. Equivalent to Rv3489, len: 54 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 54 aa overlap). Hypothetical unknown protein. No similarity with other proteins. Protein product from Mb3519 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3519 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Z9" /protein_id="SIU02146.1" /translation="MSTKSDHGEIGDVEPLADSTASQARRVVAAYANDADECRIFLSM LGIGPAKLES" CDS 3858272..3859774 /codon_start=1 /transl_table=11 /gene="otsA" /locus_tag="BQ2027_MB3520" /product="alpha, alpha-trehalose-phosphate synthase [udp-forming] otsa (trehalose-6-phosphate synthase) (udp-glucose-glucosephosphate glucosyltransferase) (trehalosephosphate-udp glucosyltransferase) (trehalose-6-phosphate synthetase) (trehalose-phosphate synthase) (trehalose-phosphate synthetase) (transglucosylase) (trehalosephosphate-udp glucosyl transferase)" /note="Mb3520, otsA, len: 500 aa. Equivalent to Rv3490, len: 500 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 500 aa overlap). Probable otsA, alpha, alpha-trehalose-phosphate synthase (EC 2.4.1.15) (see citations below), equivalent to Q50167|OTSA|ML2254 PROBABLE TREHALOSE-PHOSPHATE SYNTHASE from Mycobacterium leprae (498 aa), FASTA scores: opt: 2706, E(): 1.6e-166, (80.3% identity in 497 aa overlap). Also similar to others e.g. Q92410|TPS1_CANAL from Candida albicans (Yeast) (478 aa), FASTA scores: opt: 895, E(): 4.9e-50, (37.15% identity in 479 aa overlap); Q00764|TPS1_YEASTTPS1|CIF1|BYP1|FDP1|GGS1|GLC6|YBR126c|YBR 0922 from Saccharomyces cerevisiae (Baker's yeast) (495 aa), FASTA scores: opt: 847, E(): 6.2e-47, (36.1% identity in 490 aa overlap); BAB48232|MLL0691 from Rhizobium loti (Mesorhizobium loti) (520 aa), FASTA scores: opt: 884, E(): 2.7e-49, (36.2% identity in 478 aa overlap); etc. Equivalent to AAK47953 from Mycobacterium tuberculosis strain CDC1551 (478 aa) but longer 22 aa. Protein product from Mb3520 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3520 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWE1" /db_xref="InterPro:IPR001830" /db_xref="UniProtKB/Swiss-Prot:Q7TWE1" /protein_id="SIU02147.1" /translation="MAPSGGQEAQICDSETFGDSDFVVVANRLPVDLERLPDGSTTWK RSPGGLVTALEPVLRRRRGAWVGWPGVNDDGAEPDLHVLDGPIIQDELELHPVRLSTT DIAQYYEGFSNATLWPLYHDVIVKPLYHREWWDRYVDVNQRFAEAASRAAAHGATVWV QDYQLQLVPKMLRMLRPDLTIGFFLHIPFPPVELFMQMPWRTEIIQGLLGADLVGFHL PGGAQNFLILSRRLVGTDTSRGTVGVRSRFGAAVLGSRTIRVGAFPISVDSGALDHAA RDRNIRRRAREIRTELGNPRKILLGVDRLDYTKGIDVRLKAFSELLAEGRVKRDDTVL VQLATPSRERVESYQTLRNDIERQVGHINGEYGEVGHPVVHYLHRPAPRDELIAFFVA SDVMLVTPLRDGMNLVAKEYVACRSDLGGALVLSEFTGAAAELRHAYLVNPHDLEGVK DGIEEALNQTEEAGRRRMRSLRRQVLAHDVDRWAQSFLDALAGAHPRGQG" CDS 3859926..3860504 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3521" /product="unknown protein" /note="Mb3521, -, len: 192 aa. Equivalent to Rv3491, len: 192 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 192 aa overlap). Hypothetical unknown protein. No significant homology with other proteins. Protein product from Mb3521 detected using SWATH mass spectrometry. Mb3521 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4F8" /protein_id="SIU02148.1" /translation="MNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQ AGVSGVTWTITALCDQPSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFS GRARMSSMLWTFQVNQADGVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPK LSKQPFSLQLIGPPPSPVQRYPLYCNNIAMCY" CDS complement(3860501..3860983) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3522C" /product="FIG00820195: MCE associated membrane protein" /note="Mb3522c, -, len: 160 aa. Equivalent to Rv3492c, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 160 aa overlap). Conserved hypothetical Mce-associated protein, showing some similarity to hypothetical Mycobacterium tuberculosis proteins e.g. O53974|Rv1973|MTV051.11 (near Mce operon 3) (160 aa), FASTA scores: opt: 214, E(): 2.6e-07, (25.3% identity in 154 aa overlap); and Q11032|YD62_MYCTU|Rv1362c|MT1407|MTCY02B10.26c (220 aa), FASTA scores: opt: 187, E(): 2e-05, (23.4% identity in 154 aa overlap). Contains lipocalin signature at C-terminus (PS00213). Protein product from Mb3522c detected using SWATH mass spectrometry. Mb3522c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4E5" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E5" /protein_id="SIU02149.1" /translation="MRRLISVAYALMVATIVGLSAAGGWFYWDRVQTGGEASARALLP KLAMQEIPQVFGYDYQTVERSLTAVYPLLTPDYRQEFQKSANAQIIPEAKKREVVVQA NVVGVGVMDAKRDCASVMVYLNRTVTDKTRQPLYDGSRLRVDFQRIDGKWLIAYITPI " CDS complement(3860983..3861711) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3523C" /product="FIG00821219: MCE associated membrane protein" /note="Mb3523c, -, len: 242 aa. Equivalent to Rv3493c, len: 242 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 242 aa overlap). Conserved hypothetical Mce-associated ala-, val-rich protein, showing weak similarity to O07422|Z97050|Rv0178|MTCI28.18 HYPOTHETICAL 25.9 KDA PROTEIN (near Mce operon1) from Mycobacterium tuberculosis (244 aa), FASTA scores: opt: 163, E(): 0.046, (24.65% identity in 211 aa overlap). Protein product from Mb3523c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3523c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4B3" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4B3" /protein_id="SIU02150.1" /translation="MAADTGVAGGQQSTTRRARRKASRPAGPAEGESSRPAQGAATVR AAARTESKPAKAAKPALRPVKPPPRRPAHRVLVGWLSLAAGLLAIAALAWGVTALVMQ NRDADARQARNQRFVDAATQTVVNMFSYTPDTIDESVNRFVNGTSGPLRGMLNANNNV DNLKGLFRATNATSEAVVNGAALEGIDEISDNASVLVSVRVTVADIDGVNKPSMPYRL RVIVHEDENGRMTGYDLKYPDGGN" CDS complement(3861711..3863405) /codon_start=1 /transl_table=11 /gene="mce4F" /locus_tag="BQ2027_MB3524C" /product="MCE-FAMILY PROTEIN MCE4F" /note="Mb3524c, mce4F, len: 564 aa. Equivalent to Rv3494c, len: 564 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 564 aa overlap). mce4F; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), similar to Mycobacterium tuberculosis proteins O07418|Rv0174|MTCI28.14|mce1F (515 aa); O07784|Rv0594|MTCY19H5.28c|mce2F (516 aa); and O53972|Rv1971|MTV051.09|mce3F (437 aa). Also similar to others e.g. Q9CD09|MCE1F|ML2594 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (516 aa), FASTA scores: opt: 1040, E(): 3.6e-31, (35.9% identity in 529 aa overlap); Q9F361|SC8A2.02c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (433 aa), FASTA scores: opt: 570, E(): 3.7e-14, (30.8% identity in 458 aa overlap); etc. Has hydrophobic stretch, possibly a signal peptide at the N-terminus. Protein product from Mb3524c detected using SWATH mass spectrometry. Mb3524c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4D6" /db_xref="InterPro:IPR003399" /db_xref="InterPro:IPR005693" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4D6" /protein_id="SIU02151.1" /translation="MIDRLAKIQLSIFAVITVITLSVMAIFYLRLPATFGIGTYGVSA DFVAGGGLYKNANVTYRGVAVGRVESVGLNPNGVTAHMRLNSGTAIPSNVTATVRSVS AIGEQYIDLVPPENPSSTKLRNGFRIQRQNTRIGQDVADLLRQAETLLGSLGDTRLRE LLHEAFIATNGAGPELARLIESARLLVDEANANYPQVSQLIDQAGPFLQAQIRAGGDI KSLADGLARFTWQLRAADPRLRDTLAGAPDAIDEANTAFSGIRPSFPALAASLANLGR VGVIYHKSIEQLLVVFPALFAAIITSAGGVPQDEGAKLDFKIDLHDPPPCMTGFLPPP LVRSPADESVREIPRDMYCKTAQNDPSTVRGARNYPCQEFPGKRAPTVQLCRDPRGYV PVGTNPWRGPPIPYGTEVTDGRNILPPNKFPYIPPGADPDPGVPIVGPPPPGQVAGPG PAPHQPAQPAPPPNDNGPPPPFTSWMPPGYPPEPPQVPYPATIPPPPPPEGTGPPPGP APGPQPQASGPAYTIYDQLSGAFADPAGGTGIFAPGMTGASSAENWVDLMRDPRQL" CDS complement(3863416..3864570) /codon_start=1 /transl_table=11 /gene="lprN" /locus_tag="BQ2027_MB3525C" /standard_name="mce4E" /product="POSSIBLE MCE-FAMILY LIPOPROTEIN LPRN (MCE-FAMILY LIPOPROTEIN MCE4E)" /note="Mb3525c, lprN, len: 384 aa. Equivalent to Rv3495c, len: 384 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 384 aa overlap). Possible lprN (alternate gene name: mce4E), lipoprotein which belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07417|LPRK|Rv0173|MTCI28.13|mce1E (390 aa); O07785|LPRL|Rv0593|MTCY19H5.29|mce2E (402 aa); and O53971|LPRM|Rv1970|MTV051.08|mce3E (377 aa). Also similar to others e.g. Q9F360|SC8A2.03c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (413 aa), FASTA scores: opt: 656, E(): 2.2e-32, (37.55% identity in 317 aa overlap); Q9CD10|LPRK|ML2593 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (392 aa), FASTA scores: opt: 616, E(): 5.5e-30, (28.95% identity in 373 aa overlap); etc. Contains possible signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb3525c detected using SWATH mass spectrometry. Mb3525c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003399" /db_xref="InterPro:IPR005693" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E7" /protein_id="SIU02152.1" /translation="MNRIWLRAIILTASSALLAGCQFGGLNSLPLPGTAGHGEGAYSV TVEMADVATLPQNSPVMVDDVTVGSVAGIVAVQRPDGSFYAAVKLDLDKNVLLPANAV AKVSQTSLLGSLHVELAPPTDRPPTGRLVDGSRITEANTDRFPTTEEVFSALGVVVNK GNVGALEEIIDETHQAVAGRQAQFVNLVPRLAELTAGLNRQVHDIIDALDGLNRVSAI LARDKDNLGRALDTLPDAVRVLNQNRDHIVDAFAALKRLTMVTSHVLAETKVDFGEDL KDLYSIVKALNDDRKDFVTSLQLLLTFPFPNFGIKQAVRGDYLNVFTTFDLTLRRIGE TFFTTAYFDPNMAHMDEILNPPDFLIGELANLSGQAADPFKIPPGTASGQ" CDS complement(3864567..3865922) /codon_start=1 /transl_table=11 /gene="mce4D" /locus_tag="BQ2027_MB3526C" /product="MCE-FAMILY PROTEIN MCE4D" /note="Mb3526c, mce4D, len: 451 aa. Equivalent to Rv3496c, len: 451 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 451 aa overlap). mce4D; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07416|Rv0172|MTCI28.12|mce1D (530 aa); O07786|Rv0592|MTCY19H5.30c|mce2D (508 aa); and O53970|Rv1969|MTV051.07|mce3D (423 aa). Also similar to others e.g. Q9CD11|MCE1D|ML2592 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (531 aa), FASTA scores: opt: 837, E(): 2.6e-34, (34.55% identity in 446 aa overlap); Q9F359|SC8A2.04c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (337 aa), FASTA scores: opt: 606, E(): 4.9e-23, (32.35% identity in 300 aa overlap); etc. Hydrophobic region at N-terminus. Protein product from Mb3526c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3526c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003399" /db_xref="InterPro:IPR005693" /db_xref="InterPro:IPR024516" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4A4" /protein_id="SIU02153.1" /translation="MMGRVAMLTGSRGLRYATVIALVAALVGGVYVLSSTGNKRTIVG YFTSAVGLYPGDQVRVLGVPVGEIDMIEPRSSDVKITMSVSKDVKVPVDVQAVIMSPN LVAARFIQLTPVYTGGAVLPDNGRIDLDRTAVPVEWDEVKEGLTRLAADLSPAAGELQ GPLGAAINQAADTLDGNGDSLHNALRELAQVAGRLGDSRGDIFGTVKNLQVLVDALSE SDEQIVQFAGHVASVSQVLADSSANLDQTLGTLNQALSDIRGFLRENNSTLIETVNQL NDFAQTLSDQSENIEQVLHVAGPGITNFYNIYDPAQGTLNGLLSIPNFANPVQFICGG SFDTAAGPSAPDYYRRAEICRERLGPVLRRLTVNYPPIMFHPLNTITAYKGQIIYDTP ATEAKSETPVPELTWVPAGGGAPVGNPADLQSLLVPPAPGPAPAPPAPGAGPGEHGGG G" CDS complement(3865919..3866992) /codon_start=1 /transl_table=11 /gene="mce4C" /locus_tag="BQ2027_MB3527C" /product="MCE-FAMILY PROTEIN MCE4C" /note="Mb3527c, mce4C, len: 357 aa. Equivalent to Rv3497c, len: 357 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 357 aa overlap). mce4C; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07415|R0171|MTCI28.11|mce1C (515 aa); O07787|Rv0591|MTCY19H5.31|mce2C (481 aa); and O53969|Rv1968|MTV051.06|mce3C (410 aa). Also similar to others e.g. Q9F358|SC8A2.05c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (351 aa), FASTA scores: opt: 658, E(): 1.1e-30, (33.95% identity in 318 aa overlap); Q9CD12|MCE1C|ML2591 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (519 aa), FASTA scores: opt: 555, E(): 1.2e-24, (28.35% identity in 328 aa overlap); etc. Hydrophobic region at N-terminus. Protein product from Mb3527c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3527c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4C5" /db_xref="InterPro:IPR003399" /db_xref="InterPro:IPR005693" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4C5" /protein_id="SIU02154.1" /translation="MLNRKPSSKHERDPLRTGIFGLVLVICVVLIAFGYSGLPFWPQG KTYDAYFTDAGGITPGNSVYVSGLKVGAVSAVSLAGNSAKVTFSVDRSIVVGDQSLAA IRTDTILGERSIAVSPAGSGKSTTIPLSRTTTPYTLNGVLQDLGRNANDLNRPQFEQA LNVFTQALHDATPQVRGAVDGLTSLSRALNRRDEALQGLLAHAKSVTSVLSERAEQVN KLVEDGNQLFAALDARRAALSALISGIDDVAAQISGFVADNRKEFGPALSKLNLVLAN LNERRDYITEALKRLPTYATTLGEVVGSGPGFNVNVYSVLPGPLVATVFDLVFQPGKL PDSLADYLRGFIQERWIIRPKSP" CDS complement(3866982..3868034) /codon_start=1 /transl_table=11 /gene="mce4B" /locus_tag="BQ2027_MB3528C" /product="MCE-FAMILY PROTEIN MCE4B" /note="Mb3528c, mce4B, len: 350 aa. Equivalent to Rv3498c, len: 350 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 350 aa overlap). mce4B; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07414|Rv0170|MTCI28.10|mce1B (346 aa); O07788|Rv0590|MTCY19H5.32c|mce2B (275 aa); and O53968|Rv1967|MTV051.05|mce3B (342 aa). Also similar to others e.g. Q9CD13|MCE1B|ML2590 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (346 aa), FASTA scores: opt: 803, E(): 6.1e-41, (41.05% identity in 346 aa overlap); Q9F357|SC8A2.06c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (354 aa), FASTA scores: opt: 624, E(): 3.4e-30, (32.55% identity in 338 aa overlap); etc. Hydrophobic region at N-terminus. Protein product from Mb3528c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3528c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y606" /db_xref="InterPro:IPR003399" /db_xref="InterPro:IPR005693" /db_xref="UniProtKB/TrEMBL:A0A1R3Y606" /protein_id="SIU02155.1" /translation="MAGSGVPSHRSMVIKVSVFAVVMLLVAAGLVVVFGDFRFGPTTV YHATFTDASRLKAGQKVRIAGVPVGSVKAVKLNPDHSIDVAFAIDRSYTLYSSTRAVI RYENLVGDRFLEITSGPGELRKLPPGGTINVAHTQPALDLDALLGGLRPVLKGFDADK INTITSAVIELLQGQGGPLANVLADTGAFSAALGARDQLIGEVITNLNAVLATVDAKS AQFSASVDQLQQLVSGLAKNRDPIAGAISPLASTTTDLTELLRNSRRPLQGILENARP LATELDNRKAEVNNDIEQLGEDYLRLSALGSYGAFFNIYFCSVTIKINGPAGSDILLP IGGQPDPSKGRCAFAK" CDS complement(3868034..3869236) /codon_start=1 /transl_table=11 /gene="mce4A" /locus_tag="BQ2027_MB3529C" /standard_name="mce4" /product="MCE-FAMILY PROTEIN MCE4A" /note="Mb3529c, mce4A, len: 400 aa. Equivalent to Rv3499c, len: 400 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 400 aa overlap). mce4A; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins P72013|MCE1|Rv0169|MTCI28.09|mce1A (454 aa); O07789|MCE2|Rv0589|MTCY19H5.33c|mce2A (404 aa); and O53967|MCE3|Rv1966|MTV051.04|mce3A (425 aa). Also similar to others e.g. Q9F356|SC8A2.07c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (418 aa), FASTA scores: opt: 619, E(): 7.8e-30, (32.4% identity in 352 aa overlap); Q9S4U5|MCE1 MYCOBACTERIAL CELL ENTRY PROTEIN from Mycobacterium bovis BCG (454 aa), FASTA scores: opt: 529, E(): 2.1e-24, (30.35% identity in 448 aa overlap); Q9CD14|MCE1A|ML2589 from Mycobacterium leprae (441 aa), FASTA scores: opt: 515, E(): 1.4e-23, (28.35% identity in 430 aa overlap); etc. Contains a possible N-terminal signal sequence. Note that previously known as mce4. Protein product from Mb3529c detected using SWATH mass spectrometry. Mb3529c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y500" /db_xref="InterPro:IPR003399" /db_xref="InterPro:IPR005693" /db_xref="InterPro:IPR024516" /db_xref="UniProtKB/TrEMBL:A0A1R3Y500" /protein_id="SIU02156.1" /translation="MSGGGSRRTSVRVAAALLAGLMVGSAVLTYLSYTAAFTSTDTVT VSSPRAGLVMEKGAKVKYRGIQVGKVTDISYSGNQARLKLAIDSGEMGFIPSNATVRI AGNTIFGAKSVEFIPPKTPSPKPLSPNAHVAASQVQLEVNTLFQSLIDLLHKIDPLET NATLSALSEGLRGHGDDLGALLSGLNTLTRQANPKLPALQEDFRKAAVVANVYADAAG DLNTVFDNLPTINKTIVDQKDNLNDTLLATIGLSNNAYETLAPAEQNFIDAINRLRAP LKVTSDYSPVFGCLFKGIARGVKEFAPLIGVRKAGLFTSSSFVLGAPSYTYPESLPIV NASGGPNCRGLPDIPTKQTGGSFYRAPFLVTDNALIPYQPFTELQVDAPSTLQFLFNG AFAERDDF" CDS complement(3869256..3870098) /codon_start=1 /transl_table=11 /gene="yrbE4B" /locus_tag="BQ2027_MB3530C" /product="conserved integral membrane protein yrbe4b. possible abc transporter." /note="Mb3530c, yrbE4B, len: 280 aa. Equivalent to Rv3500c, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). yrbE4B, hypothetical unknown integral membrane protein, part of mce4 operon and member of YrbE family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07413|Rv0168|MTCI28.08|yrbE1B (289 aa); O07790|Rv0588|MTCY19H5.34|yrbE2B (295 aa); and O53966|Rv1965|MTV051.03|yrbE3B (271 aa). Also highly similar to conserved hypothetical integral membrane proteins of the P45030|YRBE_HAEIN (261 aa) type, e.g. Q9CD15|YRBE1B|ML2588 from Mycobacterium leprae (289 aa), FASTA scores: opt: 973, E(): 1.5e-50, (50.2% identity in 269 aa overlap); P45030|YRBE_HAEIN|HI1086 from Haemophilus influenzae (261 aa), FASTA scores: opt: 270, E(): 6e-11, (25.4% identity in 264 aa overlap); etc. Protein product from Mb3530c detected using SWATH mass spectrometry. Mb3530c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4B5" /db_xref="InterPro:IPR030802" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4B5" /protein_id="SIU02157.1" /translation="MSYDVTIRFRRFFSRLQRPVDNFGEQALFYGETMRYVPNAITRY RKETVRLVAEMTLGAGALVMIGGTVGVAAFLTLASGGVIAVQGYSSLGDIGIEALTGF LSAFLNVRVVAPVIAGIALAATIGAGATAQLGAMRVSEEIDAVECMAVHSVSYLVSTR LIAGLVAIIPLYSLSVLAAFFAARFTTVFVNGQSAGLYDHYFNTFLIPSDLLWSFMQA IAMSIAVMLVHTYYGYNASGGSVGVGVAVGQAVRTSLIVVVVITLFISLAVYGASGNF NLSG" CDS complement(3870133..3870897) /codon_start=1 /transl_table=11 /gene="yrbE4A" /locus_tag="BQ2027_MB3531C" /product="conserved integral membrane protein yrbe4a. possible abc transporter." /note="Mb3531c, yrbE4A, len: 254 aa. Equivalent to Rv3501c, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 254 aa overlap). yrbE4A, hypothetical unknown integral membrane protein, part of mce4 operon and member of YrbE family (see citations below for more information), highly similar to Mycobacterium tuberculosis proteins O07412|Rv0167|MTCI28.07|yrbE1A (265 aa); O07791|Rv0587|MTCY19H5.35|yrbE2A (265 aa); and O53965|Rv1964|MTV051.02|yrbE3A (265 aa). Also highly similar to conserved hypothetical integral membrane proteins of the P45030|YRBE_HAEIN (261 aa) type, e.g. Q9CD16|YRBE1A|ML2587 from Mycobacterium leprae (267 aa), FASTA scores: opt: 1059, E(): 1e-57, (64.75% identity in 247 aa overlap); P45030|YRBE_HAEIN|HI1086 from Haemophilus influenzae (261 aa), FASTA scores: opt: 313, E(): 3e-14, (25.7% identity in 241 aa overlap); etc. Protein product from Mb3531c detected using SWATH mass spectrometry. Mb3531c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4H4" /db_xref="InterPro:IPR030802" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4H4" /protein_id="SIU02158.1" /translation="MIQQLAVPARAVGGFFEMSMDTARAAFRRPFQFREFLDQTWMVA RVSLVPTLLVSIPFTVLVAFTLNILLREIGAADLSGAGTAFGTITQLGPVVTVLVVAG AGATAICADLGARTIREEIDAMRVLGIDPIQRLVVPRVLASTLVALLLNGLVCAIGLS GGYAFSVFLQGVNPGAFINGLTVLTGLRELILAEIKALLFGVMAGLVGCYRGLTVKGG PKGVGNAVNETVVYAFICLFVINVVMTAIGVRISAQ" CDS complement(3871123..3872076) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3532C" /product="probable short-chain type dehydrogenase/reductase. possible 17-beta-hydroxysteroid dehydrogenase." /note="Mb3532c, -, len: 317 aa. Equivalent to Rv3502c, len: 317 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 317 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to Mycobacterium tuberculosis proteins P71853|Rv3548c|MTCY03C7.08 HYPOTHETICAL 31.1 KDA PROTEIN (304 aa), FASTA scores: opt: 739, E(): 6.2e-35, (45.15% identity in 310 aa overlap); and Q11020|YD50_MYCTU|FABG2|Rv1350|MT1393|MTCY02B10.14 PUTATIVE OXIDOREDUCTASE (247 aa), FASTA scores: opt: 475, E(): 5.1e-20, (40.15% identity in 254 aa overlap). Also similar to various dehydrogenases e.g. Q9I4V1|PA1023 PROBABLE SHORT-CHAIN DEHYDROGENASE from Pseudomonas aeruginosa (305 aa), FASTA scores: opt: 535, E(): 2.3e-23, (37.1% identity in 302 aa overlap); Q9UVH9|FOX2 FOX2 PROTEIN (SDR FAMILY) (1015 aa), FASTA scores: opt: 487, E(): 3.2e-20, (38.4% identity in 276 aa overlap); P22414|FOX2_CANTR PEROXISOMAL HYDRATASE-DEHYDROGENASE, D-3-HYDROXYACYL CoA DEHYDROGENASE (EC 1.1.1.-) (SDR FAMILY) from Candida tropicalis (Yeast) (906 aa) FASTA scores: opt: 481, E(): 6.4e-20, (38.0% identity in 250 aa overlap); P50171|DHB8_MOUSE|HSD17B8|HKE6|H2-KE6 ESTRADIOL 17 BETA-DEHYDROGENASE 8 from Mus musculus (Mouse) (260 aa) FASTA scores: opt: 459, E(): 4.3e-19, (39.75% identity in 259 aa overlap); CAC41362|BKR1 3-OXYACYL-[ACYL-CARRIER PROTEIN] REDUCTASE (EC 1.1.1.100) (FRAGMENT) from Brassica napus (Rape) (317 aa), FASTA scores: opt: 447, E(): 2.4e-18, (39.2% identity in 255 aa overlap); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Start uncertain. Protein product from Mb3532c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3532c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4F5" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4F5" /protein_id="SIU02159.1" /translation="MKLTESNRSPRTTNTTDLSGKVAVVTGAAAGLGRAEALGLARLG ATVVVNDVASALDASDVVDEIGAAAADAGAKAVAVAGDISQRATADELLASAVGLGGL DIVVNNAGITRDRMLFNMSDEEWDAVIAVHLRGHFLLTRNAAAYWRDKAKDAEGGSVF GRLVNTSSEAGLVGPVGQANYAAAKAGITALTLSAARALGRYGVCANVICPRARTAMT ADVFGAAPDVEAGQIDPLSPQHVVSLVQFLASPAAAEVNGQVFIVYGPQVTLVSPPHM ERRFSADGTSWDPTELTATLRDYFAGRDPEQSFSATDLMRQ" CDS complement(3872101..3872292) /codon_start=1 /transl_table=11 /gene="fdxD" /locus_tag="BQ2027_MB3533C" /product="PROBABLE FERREDOXIN FDXD" /note="Mb3533c, fdxD, len: 63 aa. Equivalent to Rv3503c, len: 63 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 63 aa overlap). Probable fdxD, ferredoxin, equivalent to Q9R6Z5|B229_C3_226 HYPOTHETICAL 9.3 KDA PROTEIN from Mycobacterium leprae (83 aa) FASTA scores: opt: 276, E(): 1.8e-13, (75.9% identity in 54 aa overlap). Also similar to several e.g. Q9R6Z5|PHDC from Nocardioides sp. strain KP7 (69 aa), FASTA scores: opt: 177, E(): 2.1e-06, (43.35% identity in 60 aa overlap); Q9X4X8|DITA3 DIOXYGENASE DITA FERREDOXIN COMPONENT from Pseudomonas abietaniphila (78 aa), FASTA scores: opt: 166, E(): 1.4e-05, (36.2% identity in 58 aa overlap); P00203|FER_MOOTH from Moorella thermoacetica (Clostridium thermoaceticum) (63 aa), FASTA scores: opt: 157, E(): 5.4e-05, (36.65% identity in 60 aa overlap); P18325|FER2_STRGO|SUBB from Streptomyces griseolus (64 aa) FASTA scores: opt: 157, E(): 5.5e-05, (39.35% identity in 61 aa overlap); etc. BELONGS TO THE BACTERIAL TYPE FERREDOXIN FAMILY. Mb3533c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4C2" /db_xref="InterPro:IPR001080" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4C2" /protein_id="SIU02160.1" /translation="MRVIVDRDRCEGNAVCLGIAPDIFDLDDEDYAVVKTDPIPVDQE DLAEQAIAECPRAALSRGE" CDS 3872507..3873709 /codon_start=1 /transl_table=11 /gene="fadE26" /locus_tag="BQ2027_MB3534" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE26" /note="Mb3534, fadE26, len: 400 aa. Equivalent to Rv3504, len: 400 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 400 aa overlap). Probable fadE26, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to other ACYL-COA DEHYDROGENASES from Mycobacterium tuberculosis e.g. P71858|FADE29|Rv3543c|MTCY03C7.13 (387 aa) FASTA scores: opt: 1031, E(): 7.5e-59, (46.25% identity in 402 aa overlap); and P95280|FADE17|Rv1934c|MTCY09F9.30 (409 aa), FASTA scores: opt: 617, E(): 3.1e-32, (32.6% identity in 423 aa overlap); etc. Also similar to others e.g. Q9A6G3|CC2131 from Caulobacter crescentus (403 aa) FASTA scores: opt: 710, E(): 3.2e-38, (33.4% identity in 413 aa overlap); Q9I4V2|PA1022 from Pseudomonas aeruginosa (381 aa), FASTA scores: opt: 522, E(): 3.7e-26, (34.1% identity in 358 aa overlap); Q9RJX2|SCF37.29c from Streptomyces coelicolor (393 aa), FASTA scores: opt: 509, E(): 2.6e-25, (34.45% identity in 363 aa overlap); etc. COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3534 detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y4E9" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E9" /protein_id="SIU02161.1" /translation="MRISYTPQQEELRRELRSYFATLMTPERREALSSVQGEYGVGNV YRETIAQMGRDGWLALGWPKEYGGQGRSAMDQLIFTDEAAIAGAPVPFLTINSVAPTI MAYGTDEQKRFFLPRIAAGDLHFSIGYSEPGAGTDLANLRTTAVRDGDDYVVNGQKMW TSLIQYADYVWLAVRTNPESSGAKKHRGISVLIVPTTAEGFSWTPVHTMAGPDTSATY YSDVRVPVANRVGEENAGWKLVTNQLNHERVALVSPAPIFGCLREVREWAQNTKDAGG TRLIDSEWVQLNLARVHAKAEVLKLINWELASSQSGPKDAGPSPADASAAKVFGTELA TEAYRLLMEVLGTAATLRQNSPGALLRGRVERMHRACLILTFGGGTNEVQRDIIGMVA LGLPRANR" CDS 3873734..3874855 /codon_start=1 /transl_table=11 /gene="fadE27" /locus_tag="BQ2027_MB3535" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE27" /note="Mb3535, fadE27, len: 373 aa. Equivalent to Rv3505, len: 373 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 373 aa overlap). Probable fadE27, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to other ACYL-COA DEHYDROGENASES from Mycobacterium tuberculosis e.g. P71857|FADE28|Rv3544c|MTCY03C7.12 (339 aa) FASTA scores: opt: 497, E(): 1.8e-22, (30.3% identity in 343 aa overlap); and P95281|FADE18|Rv1933c|MTCY09F9.31 (363 aa) FASTA scores: opt: 421, E(): 6.4e-18, (32.35% identity in 334 aa overlap). Also similar to other e.g. Q9A5G8|CC2479 from Caulobacter crescentus (344 aa), FASTA scores: opt: 425, E(): 3.5e-18, (30.75% identity in 351 aa overlap); Q9RJX3|SCF37.28c from Streptomyces coelicolor (362 aa) FASTA scores: opt: 317, E(): 1e-11, (32.8% identity in 372 aa overlap); Q9L8Q3|PDTORFO from Pseudomonas stutzeri (Pseudomonas perfectomarina) (513 aa), FASTA scores: opt: 301, E(): 1.2e-10, (25.9% identity in 394 aa overlap); etc. COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3535 detected using SWATH mass spectrometry. Mb3535 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4F7" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4F7" /protein_id="SIU02162.1" /translation="MDFTTTEAAQDLGGLVDTIVDAVCTPEHQRELDKLEQRFDRELW RKLIDAGILSSAAPESLGGDGFGVLEQVAVLVALGHQLAAVPYLESVVLAAGALARFG SPELQQGWGVSAVSGDRILTVALDGEMGEGPVQAAGTGHGYRLTGTRTQVGYGPVADA FLVPAETDSGAAVFLVAAGDPGVAVTALATTGLGSVGHLELNGAKVDAARRVGGTDVV VWLGTLSTLSRTAFQLGVLERGLQMTAEYARTREQFDRPIGSFQAVGQRLADGYIDVK GLRLTLTQAAWRVAEDSLASRECPQPADIDVATAGFWAAEAGHRVAHTIVHVHGGVGV DTDHPVHRYFLAAKQTEFALGGATGQLRRIGRELAETPA" CDS 3874926..3876434 /codon_start=1 /transl_table=11 /gene="fadD17" /locus_tag="BQ2027_MB3536" /product="fatty-acid-coa synthetase fadd17 (fatty-acid-coa synthase) (fatty-acid-coa ligase)" /note="Mb3536, fadD17, len: 502 aa. Equivalent to Rv3506, len: 502 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 502 aa overlap). Possible fadD17, fatty-acid-CoA synthetase (ligase) (EC 6.2.1.-), similar to P72007|FADD1|RV1750c|MTCY28.13c|MTCY04C12.34 from Mycobacterium tuberculosis (532 aa), FASTA scores: opt: 666, E(): 9.8e-32, (52.05% identity in 488 aa overlap). Also similar to various ligases/synthetases e.g. Q9EY88|FCS FERULOYL-COA SYNTHETASE from Amycolatopsis sp. HR167 (491 aa), FASTA scores: opt: 490, E(): 2.1e-21, (30.3% identity in 462 aa overlap); BAB33463|ECS0040 (alias AAG54340|CAIC) PROBABLE CROTONOBETAINE/CARNITINE-COA LIGASE from Escherichia coli strain O157:H7 (522 aa), FASTA scores: opt: 478, E(): 1.1e-20, (28.5% identity in 347 aa overlap); Q9KHL1|ENCH PUTATIVE ACYL-COA LIGASE from Streptomyces maritimus (535 aa), FASTA scores: opt: 477, E(): 1.3e-20, (28.7% identity in 453 aa overlap); Q50017|XCLC|ML1051 ACYL-COA SYNTHASE from Mycobacterium leprae (476 aa), FASTA scores: opt: 472, E(): 2.3e-20, (31.35% identity in 469 aa overlap); P31552|CAIC_ECOLI|B0037 from Escherichia coli strain K12 (522 aa), FASTA scores: opt: 467, E(): 4.8e-20, (28.75% identity in 348 aa overlap); Q9KBC2|BH2006 from Bacillus halodurans LONG-CHAIN ACYL-COA SYNTHETASE (LIGASE) (513 aa), FASTA scores: opt: 462, E(): 9.4e-20, (27.65% identity in 463 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. Mb3536 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TWC5" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR030310" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q7TWC5" /protein_id="SIU02163.1" /translation="MTPTHPTVTELLLPLSEIDDRGVYFEDSFTSWRDHIRHGAAIAA ALRERLDPARPPHVGVLLQNTPFFSATLVAGALSGIVPVGLNPVRRGAALAGDIAKAD CQLVLTGSGSAEVPADVEHINVDSPEWTDEVAAHRDTEVRFRSADLADLFMLIFTSGT SGDPKAVKCSHRKVAIAGVTITQRFSLGRDDVCYVSMPLFHSNAVLVGWAVAAACQGS MALRRKFSASQFLADVRRYGATYANYVGKPLSYVLATPELPDDADNPLRAVYGNEGVP GDIDRFGRRFGCVVMDGFGSTEGGVAITRTLDTPAGALGPLPGGIQIVDPDTGEPCPT GVVGELVNTAGPGGFEGYYNDEAAEAERMAGGVYHSGDLAYRDDAGYAYFAGRLGDWM RVDGENLGTAPIERVLMRYPDATEVAVYPVPDPVVGDQVMAALVLAPGTKFDADKFRA FLTEQPDLGHKQWPSYVRVSAGLPRTMTFKVIKRQLSAEGVACADPVWPIRR" CDS 3876605..3880687 /codon_start=1 /transl_table=11 /gene="PE_PGRS53" /locus_tag="BQ2027_MB3537" /product="pe-pgrs family protein pe_pgrs53" /note="Mb3537, PE_PGRS53, len: 1360 aa. Similar to Rv3507, len: 1381 aa, from Mycobacterium tuberculosis strain H37Rv, (93.635% identity in 1414 aa overlap). Member of the Mycobacterium tuberculosis PE protein family, PGRS subfamily of gly-rich proteins, similar to others from M. tuberculosis strains H37Rv and CDC1551 e.g. O06810|Rv1450c|MTCY493.04 (1329 aa), FASTA scores: opt: 2173, E(): 1.4e-135, (51.15% identity in 1412 aa overlap). Equivalent to AAK47970 from Mycobacterium tuberculosis strain CDC1551 (1384 aa) but with some minor differences between the proteins. Contains two PS00583 pfkB family of carbohydrate kinases signatures 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 18 bp, 72 bp and 9 bp (*-cggcggcac), and deletions of 90 bp, 54 bp and 18 bp, leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (1360 aa versus 1381 aa). Protein product from Mb3537 detected using shotgun mass spectrometry. Mb3537 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4D0" /protein_id="SIU02164.1" /translation="MSFVLVSPETVAAVATDLKRIGASLAHENASAAASTTAVVSAAA DEVSTAVAALFSQHAQGYQAAAAQVAAFHSRFVQALTAGAGAYAFAEAANASPLQSAM GAVSASAQTLLSRPLIGNGANATTPGGNGGDGGWLFGSGGNGAPGAAGQSGGNGGSAG LWGNGGAGGAGGSGGAAGGNGGNGGWLFGAGGTGGIGGTGAPGAMGGTGGNGGNGALL IGGGGLGGAGGMGGTGGGTGGTGGNGGNGALLIGAGGVGGAGGIGGQGTGAGGAAGAG GTGGNGGAGGLFMNGGDGGAGGQGGDGAAGDAAASAGGTGGKGGQGGDGGTGGAGGAG PVLFGHGGAGGMGGQGGTGGMGGAGGDGTTVIAAGTGGEGGTGGTGGTGGNGADAAAV VGFGANGDPGFAGGKGGNGGIGGAAVTGGVAGDGGTGGKGGTGGAGGAGNDAGSTGNP GGKGGDGGIGGAGGAGGAAGTGNGGHAGNTGDGGDGGTGGNGGNGTGGVNGADNTLNP DTPGGAGEPGGAGGAGGAGGAAGGPGGTGGTGGNGGNAGNNSTNAPVGGEGGAGGDGG AGGAGGAANGGTAGSQGTGGVGGDGGAGGNGGGGKAGTGNSGNFGVDGEAGFSGGAGG NGGVGGAAGANGGTGGSGGNGGDGGAGGIGGAGGNGIPGTGTEPAGGTGAKGGDGGDG GAGGAGGNAGGAGGNGGAGGQGGNAGQGGAGGAGGNAVIPGDGVGKAPHGGAGGSGGD GGKGGQGGSGGTGGSGAPIGGGAGGTGGSGGHAGKGGAGGIGAQGTTITVPGNGGNAG DGGNGGGGGAGGTGGDGATGTPAGNGGNGGNAGDGGNGGSGDFGGNTTSGASGSGGNG GNAGTAGSGGAGGTGGTGLSGGNGGNGGDGGNGAHGTVGAQFVPATSLPTPNGGAGGN GGTGSNGGAPGPAGAPGPTTGGNAGSQGIGGDGGNGGDGGKGGDGADAVNVVFMPTEP QAATGTAGSAGDPTGGNGGPGTPGSPMVAPPPPTPITQVQQGGDGGAGGTGSTNANDG TATGGKGGEGGVGSILGGPGGNGGTGGNASATGTNGVANAGNGGKGGDGGQFGAGGNG GAGGSVTDGSAGSTAGNGGNGGNATNGTIAGQPAGGNGSAGGKGGDGGNIAAGATGTA GNGGNGGNGNDGAVNAGTGGSGGNGGNAGGGGANGGDGGAGGAGGAGGRGGKGIDGGF GGDGGNGGSNNGTGAGGNGGNGGTGGVGSVGAAGGDGGNGGTGGFAGFGGTAGNGGSG GTGGAGGDGGTGGGGGNGGTGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAGGNGG TGGASEDGDNGNAGSGATGGTGGNGGTGGDGGAAGLGGVA" CDS 3880978..3885360 /codon_start=1 /transl_table=11 /gene="PE_PGRS54" /locus_tag="BQ2027_MB3538" /product="pe-pgrs family protein pe_pgrs54" /note="Mb3538, PE_PGRS54, len: 1460 aa. Similar to Rv3508, len: 1901 aa, from Mycobacterium tuberculosis strain H37Rv, (71.06% identity in 1901 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. downstream O53559|Rv3514|MTV023.21 (1489 aa), FASTA scores: opt: 6598, E(): 0, (71.05% identity in 1533 aa overlap). Equivalent to AAK47971 from Mycobacterium tuberculosis strain CDC1551 (1384 aa) but shorter 13 aa and with some minor differences between the proteins. Contains five PS00583 pfkB family of carbohydrate kinases signatures 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 9 bp (*-cggcgcggg), 27 bp and 18 bp, deletions of 168 bp, 603 bp and 48 bp, and several substitutions lead to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (1460 aa versus 1901 aa). Mb3538 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y615" /protein_id="SIU02165.1" /translation="MSFVLIAPEFVTAAAGDLTNLGSSISAANASAASATTQVLAAGA DEVSARIAALFGGFGLEYQAISAQVAAYHQRFVQALSTGAGAYASAEAAAAEQIVLGV INAPTQALLGRPLIGDGANATTPGGAGGAGGLLFGNGGAGAAGAPGQAGGPGGPAGLW GNGGPGGAGGSGGGTGGAGGAGGWLFGVGGAGGVGGAGGGTGGAGGPGGLIWGGGGAG GVGGAGGGTGGAGGRAELLFGAGGAGGAGGAGTDGGPGATGGTGGHGGVGGDGGWLAP GGAGGAGGQGGAGGAGSDGGALGGTGGTGGTGGAGGAGGRGALLLGAGGQGGLGGAGG QGGTGGAGGDGVLGGVGGTGGKGGVGGVAGLGGAGGAAGQLFSAGGAAGAVGVGGTGG QGGAGGMGGSGADNASGIGADGGAGGTGGNAGAGGAGGAAGTGGTGGVVGAAGKAGIG GTGGQGGAGGAGSAGTDATATGATGGTGFSGGAGGAGGAGGNTGVGGTNGSGGQGGTG GAGGAGGAGGVGADNPTGIGGAGGTGGAGGTGGTGGAAGAGGAGGAVGTGGTGGVVGD VGNAGIGGTGGKGGAGGTGFAGGAGGAGGQGGSSGAGGTNGSGGAGGTGGQGGAGGAG GAGADNPTGIGGAGGTGGTGGAAGAGGAGGAIGTGGTGGAVGSVGNAGIGGTGGTGGV GGAGGAGAAAAAGSSATGGAGFAGGAGGEGGAGGNSGVGGTNGSGGAGGAGGKGGTGG AGGSGADNPTGAGFAGGAGGTGGAAGAGGAGGATGTGGTGGVVGATGSAGIGGAGGRG GDGGDGASGLGLGLSGFDGGQGGQGGDGGSAGAGGINGAGGAGGDGGDGGDGATGAAG LGDNGGVGGDGGAGGAAGNGGNAGVGLTAKAGDGGAAGNGGNGGAGGAGGAGDNNFNG GQGGAGGQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAGTLGVGGSGGTGGDG GDAGAGGGGGFGGAAGKAGGGGNGGVGGDGGEGASGLGLDLSGFDGGQGGQGGAGGNA GAGGINGAGGTGGTGGAGGDGAPATLIGGPDGGDGGQGGIGGDGGNAGFGAGVPGDGG IGGTGGAGGAGGAGGAGDAGADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGD GGNGGAGGAGGNGGDGDGFIGGSGGTGGTGGDAGAGGLANTGGTAGNAGIGGAGGRGG DGGAGDSGALSQDGNGFAGGQGGQGGAGGNAGAGGINGAGGTGGTGGAGGDGAPATLI GGPDGGDGGQGGGAGFGSGVAGAAGAGGNGGKGGDGGTGGTGGTNFAGGQGGAGGRGG AGGNGANGVGDNAAGGDGGNGGAGGLGGGGGTGGTNGNGGLGGGGGNGGAGGAGGTPT GSGTEGTGGDGGDAGAGGNGGSATGVGNGGNGGDGGNGGDGGNGAPGGFGGGAGAGGL GGSGAGGGTDGDDGNGGSPGTDGS" CDS complement(3885527..3887074) /codon_start=1 /transl_table=11 /gene="ilvX" /locus_tag="BQ2027_MB3539C" /product="PROBABLE ACETOHYDROXYACID SYNTHASE ILVX (ACETOLACTATE SYNTHASE)" /note="Mb3539c, ilvX, len: 515 aa. Equivalent to Rv3509c, len: 515 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 515 aa overlap). Probable ilvX, acetohydroxyacid synthase (EC 4.1.3.18), equivalent to Mycobacterium leprae protein described as Acetolactate synthase I, valine sensitive, large subunit (EC 4.1.3.18) Q49865|ILVX|ILVI1|B229_C3_222 (515 aa), FASTA scores: opt: 2762, E(): 8.8e-145, (82.9% identity in 515 aa overlap). Also similar to various enzymes (principally acetohydroxyacid/acetolactate synthases) e.g. Q9AB41|CC0393 THIAMINE-PYROPHOSPHATE-REQUIRING ENZYME from Caulobacter crescentus (512 aa), FASTA scores: opt: 1572, E(): 2.8e-79, (50.95% identity in 514 aa overlap); BAB50432|MLL3567 ACETOLACTATE SYNTHASE I from Rhizobium loti (Mesorhizobium loti) (517 aa), FASTA scores: opt: 1440, E(): 5.2e-72, (47.9% identity in 548 aa overlap); P20906|MDLC_PSEPU BENZOYLFORMATE DECARBOXYLASE (EC 4.1.1.7) from Pseudomonas putida (528 aa), FASTA scores: opt: 356, E(): 2.5e-12, (28.1% identity in 530 aa overlap); Q9L123|SC6D11.33c PUTATIVE DECARBOXYLASE from Streptomyces coelicolor (526 aa), FASTA scores: opt: 325, E(): 1.3e-10, (33.2% identity in 530 aa overlap); Q9RDF9|SCC57A.40c PUTATIVE ACETOLACTATE SYNTHASE from Streptomyces coelicolor (564 aa), FASTA scores: opt: 304, E(): 1.9e-09, (28.55% identity in 550 aa overlap); P94783 VALINE-SENSITIVE ACETOHYDROXY ACID SYNTHASE from Citrobacter freundii (561 aa), FASTA scores: opt: 278, E(): 5.1e-08, (25.8% identity in 550 aa overlap); Q42767|AHAS ACETOHYDROXYACID SYNTHASE (EC 4.1.3.18)from Gossypium hirsutum (Upland cotton) (659 aa), FASTA scores: opt: 278, E(): 5.8e-08, (26.15% identity in 558 aa overlap); etc. Note that other Mycobacterium tuberculosis proteins, e.g. O53250|MTV012.17c|ILVB_MYCTU|Rv3003c|MT3083 |MTV012.17c, show better similarity to Acetolactate synthase I. SIMILAR TO OTHER ENZYMES WHICH REQUIRE TPP. COFACTOR: THIAMIN PYROPHOSPHATE (BY SIMILARITY). Protein product from Mb3539c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3539c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y507" /db_xref="InterPro:IPR011766" /db_xref="InterPro:IPR012001" /db_xref="InterPro:IPR029061" /db_xref="UniProtKB/TrEMBL:A0A1R3Y507" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02166.1" /translation="MNGAQALINTLVDGGVDVCFANPGTSEMHFVAALDAVPRMRGML TLFEGVATGAADGYARIAGRPAAVLLHLGPGLGNGLANLHNARRARVPMVVVVGDHAT YHKKYDAPLESDIDAVAGTVSGWVRRTEAAADVGADAEAAIAASRSGSQIATLILPAD VCWSDGAHAAAGVPAQAAAAPVDVGPVAGVLRSGEPAMMLIGGDATRGPGLTAAARIV QATGARWLCETFPTCLERGAGIPAVERLAYFAEGAAAQLDGVKHLVLAGARSPVSFFA YPGMPSDLVPAGCEVHVLAEPGGAADALAALADEVAPGTVAPVAGASRPQLPTGDLTS VSAADVVGALLPERAIVVDESNTCGVLLPQATAGAPAHDWLTLTGGAIGYGIPAAVGA AVAAPDRPVLCLESDGSAMYTISGLWSQARENLDVTTVIYNNGAYDILRIELQRVGAG SDPGPKALDLLDISRPTMDFVKIAEGMGVPARRVTTCEEFADALRAAFAEPGPHLIDV VVPSLVG" CDS complement(3887071..3887907) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3540C" /product="Amidohydrolase family protein" /note="Mb3540c, -, len: 278 aa. Equivalent to Rv3510c, len: 278 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 278 aa overlap). Conserved hypothetical protein, similar to Q50662|Rv2303c|MTCY339.06 HYPOTHETICAL 34.6 KDA PROTEIN from Mycobacterium tuberculosis (307 aa), FASTA scores: opt: 416, E(): 1.2e-19, (35.7% identity in 255 aa overlap). Middle of the putative protein highly similar to N-terminal end of Q49860|B229_C2_182 HYPOTHETICAL 11.0 KDA PROTEIN from Mycobacterium leprae (95 aa), FASTA scores: opt: 304, E(): 7.9e-13, (83.65% identity in 55 aa overlap). Also some similarity with other bacterial proteins e.g. P95886 ORF C02006 from Sulfolobus solfataricus (269 aa), FASTA scores: opt: 293, E(): 9.6e-12, (31.3% identity in 198 aa overlap); Q9XDF3|NONC NONC PROTEIN from Streptomyces griseus subsp. griseus (317 aa), FASTA scores: opt: 270, E(): 3.4e-10, (29.95% identity in 227 aa overlap); Q54229|NONR MACROTETROLIDE ANTIBIOTIC-RESISTANCE PROTEIN from Streptomyces griseus (347 aa), FASTA scores: opt: 270, E(): 3.6e-10, (29.95% identity in 227 aa overlap); etc. Protein product from Mb3540c detected using SWATH mass spectrometry. Mb3540c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4C3" /db_xref="InterPro:IPR006680" /db_xref="InterPro:IPR032465" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4C3" /protein_id="SIU02167.1" /translation="MTIDVWMQHPTQRFLHGDMFASLRRWTGGSIPETDIPIEATVSS MDAGGVTLGLLSAWRGPNGQDLISNDAVAEWVRLYPNRFAGLAAVDLDRPMAAVRELR RRVGEGFVGLRVVPWLWGAPPTDRRYYPLFAECVQSAVPFCTQVGHTGPLRPSETGRP IPYIDQVALDFPELVIVCGHVGYPWTEEMVAVARKHENVYIDTSAYTIKRLPGKLVRF MKTDTGQRKVLFGTNYPMIAHTHALTGLDELGLSDEARRDFLHGNAVRVFKLDPRGKV QT" CDS 3888267..3894083 /codon_start=1 /transl_table=11 /gene="PE_PGRS55" /locus_tag="BQ2027_MB3541" /product="pe-pgrs family protein pe_pgrs56" /note="Mb3541, PE_PGRS55, len: 1938 aa. Equivalent to Rv3511 and Rv3512, len: 714 aa and 1079 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 706 aa overlap and 96.0% identity in 1117 aa overlap). Rv3511: Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK47974|MT3615.3 (1217 aa) FASTA scores: opt: 2563, E(): 1.5e-94, (59.65% identity in 773 aa overlap); and upstream O53553|Rv3508|MTV023.15 (1901 aa), FASTA scores: opt: 2455, E(): 3.9e-90, (60.4% identity in 737 aa overlap); etc. Contains PS00583 pfkB family of carbohydrate kinases signature 1. Rv3512: Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK47974|MT3615.3 (1217 aa) FASTA scores: opt: 3688, E(): 4.5e-130, (53.95% identity in 1136 aa overlap); and downstream O53559|Rv3514|MTV023.21 (1489 aa), FASTA scores: opt: 3611, E(): 3.6e-127, (53.15% identity in 1195 aa overlap); etc. Frameshifted PGRS protein, could be continuation of upstream MTV023.18, but no error could be found. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis H37Rv, PE_PGRS55 and PE_PGRS56 exist as 2 genes. In Mycobacterium bovis, a 344 bp insertion results in a single product which is more similar to PE_PGRS55. There are also 3 additional in-frame insertions of 9 bp each (*-accggcgga, *-cggcaacgg and *-cggcggtac) and 2 substitutions compared to the homolog in Mycobacterium tuberculosis strain H37Rv. Mb3541 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4H8" /protein_id="SIU02168.1" /translation="MSFVLISPEVVSAAAGDLANVGSTISAANKAAAAATTQVLAAGA DEVSARIAALFGMYGLEYQAISAQVAAYHQQFVQTLRTGAASYMLAEATNVEQNLLNL INAPTQTLLGRPLIGDGANATTPGGAGGDGGLLFGSGGNGAPGAPGQAGGAGGSAGLL GNGGSGGAGGTGAPGGNGGNAGWLYGRGGVGGAGGIGGGTGGAGGHAWLFGHGGTGGI GGGPGGNGGWLLGNGGHGGAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGG NAAWLLGGGGTGGAGGIGGGNGGHGGNGGWLLGNGGNGGLGGDGDGGTGGGHGGNGGN PGWLLGTAGGGGNGGAGSTGTAGGGSGGTGGDGGTGGRGGLLMGAGAGGHGGTGGAGG AGVDGGGAGGAGGAGGNGGAGGQAALLFGRGGTGGAGGYGGDGGGGGDGFDGTMAGLG GTGGSGGTGGDGGAPGNGGAGGAGQLLSHSGVAGASGKGGAGGTGGNGGAGSAGADAP AGSGAMGSTGFAGGAGGDGGNGGGSGASQGNGGNGGNGGTGGKGGTGGAGMNSLDPLL AAQDGGQGGTGGTGGNAGAGGTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGT TGGAGGAGGAGGTGGTGGAAGTGTGGQQGNGGNGGNGGTGGKGGTGGAGMNSLDPLLA AQDGGQGGTGGTGGNAGAGGTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTT GGAGGAGGAGGTGGTGGAAGTGTGGQQGNGGNGGNGGTGGKGGTGGAGMNSLDPLLAA QDGGQGGTGGTGGNAGAGGTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTTG GAGGAGGAGGTGGTGGAAGTGTGGQQGNGGNGGNGGTGGKGGTGGDGALAGSSGGAGG KGGNGGDAGKAGTGSAPGTAGTGGDGGKGGNGGIGAAGTTGPVGTGASGGTGGSGGAG GTGGDGGAANGGTAGAGGAGGNGGKGGDGGAGVTSSTAGNSGGAGGSGGKGGDAGAGG AGATPGANGIAGNGGDGGDGAAGAVGISGATGAGDGGHGGTGGAGGNGGTGGAGGSGI DGVGGGTGGTGGNGGNGAIGGAGGDAGGSGNSGGNGGTGGKGGNAGAGGAAGSNGGTV GANGTGGDGGNGGAAGAATAGSNGGAGTGSAGGNGGTGGRGGSGGAGGDGIGGVGGGK GGNGADGEVGGAGGAGGSGPNTSPGGNGGQGGQGGSGGAGGAAGAGGAGGGANGTAGN GGQGGAGGTGGAGAASSATNGGSGGAGGTGGAGGTGGAGGDGVGGAGGGNGGHGGDAG DGGNGANGNNRSSGSFLAAGGTGGAAGDGGQGGQGGAGGGAGGQGGAGGAGGTGGNGG NITGGTAGTAGAAGNGGAAGKGGAGGQGGTGGGTGGQGGAGGDGGAGGTGGDRTVGGG TVPAGSGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGGNGGNGGNRNSGNGTGGAGG NGGGGANGGAGGAGGSGGGTGGNGGAGGDAGDAGNGGNGNGTGNGGNGGNGGIAGMGG NGGAGTGSGNGGNGGSGGNGGNAGMGGNSGTGSGDGGAGGNGGAAGTGGTGGDGGLTG TGGTGGSGGTGGDGGNGGNGGNGADNTANMTAQAGGDGGNGGDGGFGGGAGAGGGGLT AGANGTGGQGGAGGDGGNGAIGGHGPLTDDPGGNGGTGGNGGTGGTGGAGIGSLGGGT GGDGGNGGTGGNGGTGGEGGEVGGAGGTGGAAGNGGDGGTGGTGGGDGGAGGTGGTGG TGGLGDPRVGGSGGDGGTGGSGGAAGNGGNGGNAGAGGNGNGGTGGAGGIGGTGGNGG DAEPGVPPGAGGAGGAGTTGGKGGTGGNGSGTGSGGTGGDGGTGGGGGNGGTGWNGGK GDTGSGGGAGDGGKAPAGGTGGAGGDGGAGGKGGSGGV" CDS complement(3894212..3894868) /codon_start=1 /transl_table=11 /gene="fadD18" /locus_tag="BQ2027_MB3542C" /product="PROBABLE FATTY-ACID-COA LIGASE FADD18 (FRAGMENT) (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb3542c, fadD18, len: 218 aa. Equivalent to Rv3513c, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 218 aa overlap). Probable fadD18, fatty-acid-CoA synthetase (C-terminal fragment) (EC 6.2.1.-), almost identical to C-terminal end of downstream O53560|FADD19|Rv3515c|MTV023.22c, probably result of partial gene duplication. Also similar at the C-terminus to other fatty-acid-CoA synthetases e.g. Q9EXL2|FADD from Streptomyces griseus (540 aa), FASTA scores: opt: 586, E(): 1.2e-28, (52.45% identity in 185 aa overlap); AAB87139|MIG MEDIUM CHAIN ACYL-COA SYNTHETASE PRECURSOR from Mycobacterium avium (550 aa), FASTA scores: opt: 506, E(): 9.5e-24, (50.0% identity in 150 aa overlap); Q9A7C3|CC1801 PUTATIVE 4-COUMARATE--COA LIGASE from Caulobacter crescentus (561 aa), FASTA scores: opt: 430, E(): 4.4e-19, (45.75% identity in 153 aa overlap); Q9KDT0|BH1131 ACID-COA LIGASE from Bacillus halodurans (546 aa), FASTA scores: opt: 338, E(): 1.9e-13, (38.05% identity in 142 aa overlap); Q9RTR4|DR1692 LONG-CHAIN FATTY ACID--COA LIGASE from Deinococcus radiodurans (584 aa), FASTA scores: opt: 331, E(): 5.3e-13, (35.15% identity in 145 aa overlap); etc. Start uncertain." /db_xref="GOA:A0A1R3Y4G0" /db_xref="InterPro:IPR025110" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4G0" /protein_id="SIU02169.1" /translation="MAASLSENLSCHSSNMCRLSGNAATNLERPGEEPPGDRCTRRQA VRPARTLAKKGNIPVGYYKDEKKTAETFRTINGVRYAIPGDYAQVEEDGTVTMLGRGS VSINSGGEKVYPEEVEAALKGHPDVFDALVVGVPDPRYGQQVAAVVQARPGCRPSLAE LDSFVRSEIAGYKVPRSLWFVDEVKRSPAGKPDYRWAKEQTEARPADDVHAGHVTSGS " CDS 3894914..3897892 /codon_start=1 /transl_table=11 /gene="PE_PGRS57" /locus_tag="BQ2027_MB3543" /product="pe-pgrs family protein pe_pgrs57" /note="Mb3543, PE_PGRS57, len: 992 aa. Similar to Rv3514, len: 1489 aa, from Mycobacterium tuberculosis strain H37Rv, (80.6% identity in 1133 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK47971 (1715 aa) FASTA scores: opt: 6940, E(): 0, (67.0% identity in 1713 aa overlap); and upstream O53553|YZ08_MYCTU|Rv3508|MTV023.15 (1901 aa), FASTA scores: opt: 6598,E(): 0, (71.05% identity in 1533 aa overlap). Contains two PS00583 pfkB family of carbohydrate kinases signatures 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 9 bp (*-gcgggcggc) and 285 bp, and deletions of 240 bp, 951 bp and 609 bp, lead to shorter product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (992 aa versus 1489 aa). Mb3543 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4C9" /protein_id="SIU02170.1" /translation="MSFVLIAPEFVTAAAGDLTNLGSSISAANASAASATTQVLAAGA DEVSARIAALFGGFGLEYQAISAQVAAYHQRFVQALSTGAGAYASAEAAAAEQIVLGV INAPTQALLGRPLIGDGANATTPGGAGGAGGLLFGNGGAGAAGAPGQAGGPGGPAGLW GNGGPGGAGGSGGGTGGAGGAGGWLFGVGGAGGVGGAGGGTGGAGGPGGLIWGGGGAG GVGGAGGGTGGAGGRAELLFGAGGAGGAGGAGTDGGPGATGGTGGHGGVGGDGGWLAP GGAGGAGGQGGAGGAGSDGGALGGTGGTGGTGGAGGAGGRGALLLGAGGQGGLGGAGG QGGMGGAGGAGADNPTGIGGTGGDGGTGGSAGEGGAGGAAGQLFSASGAAGNAGVGGA GGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGTGGQ GGMGGAGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVGG AGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGTGG QGGAGGAGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVG GAGGQGGDGGAGGAGGAGGSSGAGGTNGSGGAGGTGGQGGAGGAGGAGADNPTGIGGT GGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQP GATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGTGGQGGAGGAGISFSNGSNGGTG GTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGSGGAGGSGGANFNGGTGGTGGTGGTG GKGGMGGIAGDGGPGGDGGNAGVGGKGGTNGNGGSGGTGGTGGPGGSGGAPTGSGTGG KGGAGGDGGDGADGGAATGVGDGGDGGNGGNGGNGGTGVGSPGGLGGAGGTGGLGGAG AGGGADGDDGDDGQPGNNGS" CDS complement(3898453..3900099) /codon_start=1 /transl_table=11 /gene="fadD19" /locus_tag="BQ2027_MB3544C" /product="fatty-acid-coa ligase fadd19 (fatty-acid-coa synthetase) (fatty-acid-coa synthase)" /note="Mb3544c, fadD19, len: 548 aa. Equivalent to Rv3515c, len: 548 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 548 aa overlap). Probable fadD19, fatty-acid-CoA synthetase (EC 6.2.1.-), similar (or with similarity) to many e.g. Q9EXL2|FADD FADD PROTEIN from Streptomyces griseus (540 aa), FASTA scores: opt: 1449, E(): 1.5e-81, (46.0% identity in 535 aa overlap); AAB87139|MIG MEDIUM CHAIN ACYL-COA SYNTHETASE PRECURSOR from Mycobacterium avium (550 aa), FASTA scores: opt: 1226, E(): 7.6e-68, (40.7% identity in 543 aa overlap); Q9A7C3|CC1801 PUTATIVE 4-COUMARATE--COA LIGASE from Caulobacter crescentus (561 aa), FASTA scores: opt: 979, E(): 1.2e-52, (34.05% identity in 531 aa overlap); O28502|AF1772 LONG-CHAIN-FATTY-ACID--COA LIGASE (FADD-7) from Archaeoglobus fulgidus (569 aa), FASTA scores: opt: 560, E(): 6.9e-27, (29.3% identity in 543 aa overlap); Q9A8N2|CC1321 LONG-CHAIN-FATTY-ACID--COA LIGASE from Caulobacter crescentus (583 aa), FASTA scores: opt: 544, E(): 6.7e-26, (27.2% identity in 518 aa overlap); P29212|LCFA_ECOLI|FADD|OLDD|B1805 LONG-CHAIN-FATTY-ACID--COA LIGASE from Escherichia coli strain K12 (561 aa), FASTA scores: opt: 460, E(): 4e-22, (26.3% identity in 567 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. Note that upstream MTV023.20c|Rv3513c|fadD18 is identical to C-terminal part of FADD19|Rv3515c|MTV023.22c (probably result of partial gene duplication). Protein product from Mb3544c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:Q7TWB7" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q7TWB7" /protein_id="SIU02171.1" /translation="MAVALNIADLAEHAIDAVPDRVAVICGDEQLTYAQLEDKANRLA HHLIDQGVQKDDKVGLYCRNRIEIVIAMLGIVKAGAILVNVNFRYVEGELRYLFDNSD MVALVHERRYADRVANVLPDTPHVRTILVVEDGSDQDYRRYGGVEFYSAIAAGSPERD FGERSADAIYLLYTGGTTGFPKGVMWRHEDIYRVLFGGTDFATGEFVKDEYDLAKAAA ANPPMIRYPIPPMIHGATQSATWMALFSGQTTVLAPEFNADEVWRTIHKHKVNLLFFT GDAMARPLVDALVKGNDYDLSSLFLLASTAALFSPSIKEKLLELLPNRVITDSIGSSE TGFGGTSVVAAGQAHGGGPRVRIDHRTVVLDDDGNEVKPGSGMRGVIAKKGNIPVGYY KDEKKTAETFRTINGVRYAIPGDYAQVEEDGTVTMLGRGSVSINSGGEKVYPEEVEAA LKGHPDVFDALVVGVPDPRYGQQVAAVVQARPGCRPSLAELDSFVRSEIAGYKVPRSL WFVDEVKRSPAGKPDYRWAKEQTEARPADDVHAGHVTSGG" CDS 3900173..3900964 /codon_start=1 /transl_table=11 /gene="echA19" /locus_tag="BQ2027_MB3545" /product="POSSIBLE ENOYL-COA HYDRATASE ECHA19 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb3545, echA19, len: 263 aa. Equivalent to Rv3516, len: 263 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 263 aa overlap). Possible echA19, enoyl-CoA hydratase (EC 4.2.1.17), similar to other e.g. Q9ZHG2|ECHA1 from Rhodococcus fascians (275 aa) FASTA scores: opt: 613, E(): 6.4e-32, (45.15% identity in 259 aa overlap); P76082|PAAF_ECOLI|B1393 from Escherichia coli strain K12 (255 aa), FASTA scores: opt: 523, E(): 3.3e-26, (33.6% identity in 256 aa overlap); Q9I393|PA1629 from Pseudomonas aeruginosa (261 aa), FASTA scores: opt: 475, E(): 3.8e-23, (36.85% identity in 247 aa overlap); etc. Also similar to many carnitine racemases eg BAB52369|MLL6015 from Rhizobium loti (Mesorhizobium loti) (257 aa), FASTA scores: opt: 546, E(): 1.1e-27, (36.65% identity in 251 aa overlap). Similar to several putative enoyl-CoA hydratases from Mycobacterium tuberculosis, e.g. P96404|ECHA1|Rv0222|MTCY08D5.17 (262 aa), FASTA scores: opt: 630, E(): 5.1e-33, (44.5% identity in 254 aa overlap); and O53783|ECHA5|Rv0675|MTV040.03 (263 aa) FASTA scores: opt: 499, E(): 1.1e-24, (40.5% identity in 252 aa overlap). COULD BELONG TO THE ENOYL-COA HYDRATASE/ISOMERASE FAMILY. Protein product from Mb3545 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3545 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4G3" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR014748" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4G3" /protein_id="SIU02172.1" /translation="MESGPDALVERRGHTLIVTMNRPAARNALSTEMMRIMVQAWDRV DNDPDIRCCILTGAGGYFCAGMDLKAATQKPPGDSFKDGSYDPSRIDALLKGRRLTKP LIAAVEGPAIAGGTEILQGTDIRVAGESAKFGISEAKWSLYPMGGSAVRLVRQIPYTL ACDLLLTGRHITAAEAKEMGLIGHVVPDGQALTKALELADAISANGPLAVQAILRSIR ETECMPENEAFKIDTQIGIKVFLSDDAKEGPRAFAEKRAPNFQNR" CDS 3901060..3901899 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3546" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3546, -, len: 279 aa. Equivalent to Rv3517, len: 279 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 279 aa overlap). Hypothetical protein, similar to several hypothetical mycobacterial proteins e.g. P71763|Rv1482c|MTCY277.03c from Mycobacterium tuberculosis strain H37Rv (339 aa) (alias AAK45794|MT1529 from M. tuberculosis strain CDC1551 (292 aa) but longer) FASTA scores: opt: 1040, E(): 3.7e-60, (59.0% identity in 273 aa overlap); O07396|MAV346 from M. avium (346 aa) FASTA scores: opt: 1018, E(): 1e-58, (57.2% identity in 278 aa overlap); O53421|Rv1073|MTV017.26 from Mycobacterium tuberculosis strain H37Rv (283 aa), FASTA scores: opt: 903, E(): 2.4e-51, (48.0% identity in 277 aa overlap); Q50134|U650AG|MLCB57.67c from Mycobacterium leprae (75 aa) FASTA scores: opt: 158, E(): 0.0015, (41.8% identity in 55 aa overlap); etc." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4C7" /protein_id="SIU02173.1" /translation="MIEPFLGSEAIASGALTRHRLRSAYATIHPDVYVSPGADLTAWS RAQAAWLWSRRRGVIAGQSAAAMHGAKWVDARQAAELLYDHRRPPAGIHTWSDRVADD EIQPISGMNTTTPARTALDLARRYPVGKAVAAIDALARATDLKLADVEMLAERYRGSR GIRNARIALDLVDPGAESPRETWLRLLLIRAGFPRPQTQIPVYDEYGQLVAVIDMGWA GIKVGVDYEGDHHRTDRRTFNKDIKRAEALTELGWTDVRVTVEDTEGGIIWRVSAAWQ RRT" CDS complement(3901954..3902535) /codon_start=1 /transl_table=11 /gene="cyp142b" /locus_tag="BQ2027_MB3547C" /product="PROBABLE CYTOCHROME P450 MONOOXYGENASE 142 CYP142B [SECOND PART]" /note="Mb3547c, cyp142b, len: 193 aa. Equivalent to 3' end of Rv3518c, len: 398 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 193 aa overlap). Probable cyp142, cytochrome P450 monoxygenase (EC 1.14.-.-), member of Cytochrome P450 family and similar to many e.g. Q9L465|CYP162A1|NIKQ from Streptomyces tendae (396 aa) FASTA scores: opt: 798, E(): 2e-43, (36.7% identity in 403 aa overlap); P33271|CPXK_SACER|CYP107B1 from Saccharopolyspora erythraea (Streptomyces erythraeus) (405 aa), FASTA scores: opt: 725, E(): 9.1e-39, (37.1% identity in 407 aa overlap); Q9X8Q3|CYP107P1|SCH10.14c from Streptomyces coelicolor (411 aa), FASTA scores: opt: 691, E(): 1.3e-36, (37.2% identity in 317 aa overlap); etc. Also similar to Q50696|C124_MYCTU|CYP124|Rv2266|MT2328|MTCY339.44c from Mycobacterium tuberculosis strain H37Rv (428 aa) FASTA scores: opt: 692, E(): 1.2e-36, (36.8% identity in 402 aa overlap). Equivalent to AAK47979 from Mycobacterium tuberculosis strain CDC1551 (372 aa) but longer 26 aa. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, cyp142 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits cyp142 into 2 parts, cyp142a and cyp142b. Protein product from Mb3547c detected using SWATH mass spectrometry. Mb3547c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4F0" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4F0" /protein_id="SIU02174.1" /translation="MSSEVDGERLSDDELVMETLLILIGGDETTRHTLSGGTEQLLRN RDQWDLLQRDPSLLPGAIEEMLRWTAPVKNMCRVLTADTEFHGTALCAGEKMMLLFES ANFDEAVFCEPEKFDVQRNPNSHLAFGFGTHFCLGNQLARLELSLMTERVLRRLPDLR LVADDSVLPLRPANFVSGLESMPVVFTPSPPLG" CDS complement(3902532..3903149) /codon_start=1 /transl_table=11 /gene="cyp142a" /locus_tag="BQ2027_MB3548C" /product="PROBABLE CYTOCHROME P450 MONOOXYGENASE 142 CYP142A [FIRST PART]" /note="Mb3548c, cyp142a, len: 205 aa. Equivalent to 5' end of Rv3518c, len: 398 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 197 aa overlap). Probable cyp142, cytochrome P450 monoxygenase (EC 1.14.-.-), member of Cytochrome P450 family and similar to many e.g. Q9L465|CYP162A1|NIKQ from Streptomyces tendae (396 aa) FASTA scores: opt: 798, E(): 2e-43, (36.7% identity in 403 aa overlap); P33271|CPXK_SACER|CYP107B1 from Saccharopolyspora erythraea (Streptomyces erythraeus) (405 aa), FASTA scores: opt: 725, E(): 9.1e-39, (37.1% identity in 407 aa overlap); Q9X8Q3|CYP107P1|SCH10.14c from Streptomyces coelicolor (411 aa), FASTA scores: opt: 691, E(): 1.3e-36, (37.2% identity in 317 aa overlap); etc. Also similar to Q50696|C124_MYCTU|CYP124|Rv2266|MT2328|MTCY339.44c from Mycobacterium tuberculosis strain H37Rv (428 aa) FASTA scores: opt: 692, E(): 1.2e-36, (36.8% identity in 402 aa overlap). Equivalent to AAK47979 from Mycobacterium tuberculosis strain CDC1551 (372 aa) but longer 26 aa. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, cyp142 exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits cyp142 into 2 parts, cyp142a and cyp142b. Mb3548c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y628" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/TrEMBL:A0A1R3Y628" /protein_id="SIU02175.1" /translation="MTEAPDVDLADGNFYASREARAAYRWMRANQPVFRDRNGLAAAS TYQAVIDAERQPELFSNAGGIRPDQPALPMMIDMDDPAHLLRRKLVNAGFTRKRVKDK EASIAALCDTLIDAVCERGECDFVRDLAAPLPMAVIGDMLGVRPEQRDMFLRWSDDLV TFLSSHVSQEDFQITMDAFAAYNDFTRATIAARRADPPTTWSACW" CDS 3903178..3903888 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3549" /product="Carboxy-Lyase" /note="Mb3549, -, len: 236 aa. Equivalent to Rv3519, len: 236 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 236 aa overlap). Hypothetical unknown protein. The C-terminal end is highly similar to N-terminal end of AAK47980|MT3620 HYPOTHETICAL 7.8 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (73 aa), FASTA scores: opt: 279, E(): 9.4e-12, (95.65% identity in 46 aa overlap). Start uncertain. Protein product from Mb3549 detected using shotgun mass spectrometry. Mb3549 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y513" /db_xref="InterPro:IPR010451" /db_xref="InterPro:IPR023375" /db_xref="UniProtKB/TrEMBL:A0A1R3Y513" /protein_id="SIU02176.1" /translation="MPVSQHTIAGTVLTMPVRIRTANLHSAMFSVPADPAQRLIDYSG LRVCEYLPGKAIVMQMLVRYVDGDLGRYHEYGTAIMVNPPGTQRRGPRALTRAAAFIH HLPVDQVFTLEAGRTIWGFPKIMADFNVTDGRRFGFDVSADGRLIAGIEFSTGLPVPT LGWQMLKTYSHHDGVTREIPWEMKVSGLRARLGGARLRLGDHPYAKELASLGLPKRAL LSQSAANVEMTFGDGHPI" CDS complement(3903953..3904996) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3550C" /product="POSSIBLE COENZYME F420-DEPENDENT OXIDOREDUCTASE" /note="Mb3550c, -, len: 347 aa. Equivalent to Rv3520c, len: 347 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 347 aa overlap). Possible coenzyme F420-dependent oxidoreductase (EC 1.-.-.-), equivalent to Q9CCV8|ML0348 POSSIBLE COENZYME F420-DEPENDENT OXIDOREDUCTASE from Mycobacterium leprae (350 aa), FASTA scores: opt: 2029, E(): 9.1e-120, (86.85% identity in 342 aa overlap). Similar to many coenzyme F420-dependent enzymes (and other proteins) e.g. Q9AD98|SCI52.11c PUTATIVE ATP/GTP-BINDING PROTEIN from Streptomyces coelicolor (351 aa), FASTA scores: opt: 859, E(): 1.6e-46, (41.9% identity in 346 aa overlap); Q9X7Y1|SC6A5.35 PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (341 aa), FASTA scores: opt: 800, E(): 7.9e-43, (38.95% identity in 339 aa overlap); Q9ZA30|GRA-ORF29 PUTATIVE FMN-DEPENDENT MONOOXYGENASE from Streptomyces violaceoruber (343 aa), FASTA scores: opt: 354, E(): 6.7e-15, (34.2% identity in 336 aa overlap); Q49598|MER COENZYME F420-DEPENDENT N5,N10-METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE from Methanopyrus kandleri (349 aa), FASTA scores: opt: 283, E(): 1.9e-10, (26.75% identity in 329 aa overlap); Q58929|MER|MJ1534 F420-DEPENDENT METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE from Methanococcus jannaschii (331 aa), FASTA scores: opt: 227, E(): 5.8e-07, (26.35% identity in 334 aa overlap); O27784|MTH1752 COENZYME F420-DEPENDENT N5,N10-METHYLENE TETRAHYDROMETHANOPTERIN REDUCTASE from Methanobacterium thermoautotrophicum (321 aa), FASTA scores: opt: 207, E(): 1e-05, (27.4% identity in 336 aa overlap); etc. Also similar to Q11030|YD60_MYCTU|Rv1360|MT1405|MTCY02B10.24 HYPOTHETICAL 37.3 KDA PROTEIN from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 313, E(): 2.5e-12, (28.0% identity in 311 aa overlap). Protein product from Mb3550c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3550c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4D5" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019951" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4D5" /protein_id="SIU02177.1" /translation="MEAGMKLGLQLGYWGAQPPQNHAELVAAAEDAGFDTVFTAEAWG SDAYTPLAWWGSSTQRVRLGTSVIQLSARTPTACAMAALTLDHLSGGRHILGLGVSGP QVVEGWYGQRFPKPLARTREYIDIVRQVWARESPVTSAGPHYRLPLTGEGTTGLGKAL KPITHPLRADIPIMLGAEGPKNVALAAEICDGWLPIFYSPRMAGMYNEWLDEGFARPG ARRSREDFEICATAQVVITDDRAAAFAGIKPFLALYMGGMGAEETNFHADVYRRMGYT QVVDEVTKLFRSGRKDEAAEIIPDELVDDAVIVGDIDHVRKQMAVWEAAGVTMMVVTA GSAEQVRDLAALV" CDS 3905149..3906060 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3551" /product="Predicted nucleic-acid-binding protein containing a Zn-ribbon" /note="Mb3551, -, len: 303 aa. Equivalent to Rv3521, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 303 aa overlap). Conserved hypothetical protein, similar to (although longer than) other conserved hypothetical proteins e.g. O29296|AF0966 from Archaeoglobus fulgidus (176 aa), FASTA scores: opt: 286, E(): 5.4e-11, (31.15% identity in 170 aa overlap); O30036|AF0203 from Archaeoglobus fulgidus (149 aa) FASTA scores: opt: 259, E(): 2.3e-09, (33.8% identity in 142 aa overlap); O29297|AF0965 from Archaeoglobus fulgidus (154 aa), FASTA scores: opt: 241, E(): 3.2e-08, (31.4% identity in 137 aa overlap); Q9Y995|APE2390 from Aeropyrum pernix (157 aa), FASTA scores: opt: 204, E(): 6.8e-06, (27.45% identity in 153 aa overlap); BAB60424|TVG1322512 from Thermoplasma volcanium (164 aa), FASTA scores: opt: 183, E(): 0.00015, (29.75% identity in 148 aa overlap); etc. Equivalent to AAK47982 from Mycobacterium tuberculosis strain CDC1551 (334 aa) but shorter 31 aa. Protein product from Mb3551 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3551 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002878" /db_xref="InterPro:IPR012340" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4J1" /protein_id="SIU02178.1" /translation="MGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSE MVPVSSVGTVASWTWQPEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIH TGARVHAHWADQPVGAITDIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHT ASHEESAYLRAIAQGKLVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTF AIVNIPFLGQRIKPPYVAAYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERW GLGIDNIEYFRPTGEPDADYDTYKHHL" CDS 3906076..3907140 /codon_start=1 /transl_table=11 /gene="ltp4" /locus_tag="BQ2027_MB3552" /product="possible lipid transfer protein or keto acyl-coa thiolase ltp4" /note="Mb3552, -, len: 354 aa. Equivalent to Rv3522, len: 354 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 354 aa overlap). Possible lipid carrier protein or keto acyl-CoA thiolase (EC 2.3.1.16), similar to several e.g. O30103|AF0134 3-KETOACYL-COA THIOLASE (ACAB-4) from Archaeoglobus fulgidus (398 aa) FASTA scores: opt: 352, E(): 5.3e-15, (30.45% identity in 381 aa overlap); O29295|AF0967 3-KETOACYL-COA THIOLASE (ACAB-9) from Archaeoglobus fulgidus (400 aa) FASTA scores: opt: 312, E(): 1.8e-12, (28.05% identity in 367 aa overlap); O29294|AF0968 3-KETOACYL-COA THIOLASE (ACAB-10) from Archaeoglobus fulgidus (388 aa), FASTA scores: opt: 293, E(): 2.9e-11, (25.9% identity in 309 aa overlap); O58409|PH0676 LONG HYPOTHETICAL NONSPECIFIC LIPID-TRANSFER PROTEIN (ACETHYL CoA SYNTHETASE) (EC 6.2.1.-) from Pyrococcus horikoshii (389 aa), FASTA scores: opt: 292, E(): 3.3e-11, (25.8% identity in 368 aa overlap); Q9Y9A3|APE2382 LONG HYPOTHETICAL NON SPECIFIC LIPID-TRANSFER PROTEIN from Aeropyrum pernix (360 aa) FASTA scores: opt: 270, E(): 7.8e-10, (27.25% identity in 363 aa overlap); Q9YDI4|APE0929 LONG HYPOTHETICAL NONSPECIFIC LIPID-TRANSFER PROTEIN from Aeropyrum pernix (400 aa), FASTA scores: opt: 258, E(): 4.9e-09, (26.45% identity in 306 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb3552 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3552 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4H2" /db_xref="InterPro:IPR016039" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4H2" /protein_id="SIU02179.1" /translation="MSVRDIAVVGFAHAPHVRRTDGTTNGVEMLMPCFAQLYDELGIT KADIGFWCSGSSDYLAGRAFSFISAIDSIGAVPPINESHVEMDAAWALYEAYIKLLTG EVDTALVYGFGKSSAGTLRRVLSRQTDPYTVAPLWPDSVSMAGLQARLGLDSGKWTHE QMARVAFDSFTNARRVDSVEPPITVGELLARPFFADPLRRHDIAPITDGAAAVVLAAD NRARELRENPAWITGIEHRIESPALGARDITESPSTKLAAKIATGGHTGDIDVAEIHG PFTHQHLIVAEAIRIPGKTKVNPSGGPLAANPMFAAGLERIGFAAQHIWDGSARRVLA HATSGPALQQNLVAVMEGRG" CDS 3907157..3908341 /codon_start=1 /transl_table=11 /gene="ltp3" /locus_tag="BQ2027_MB3553" /product="probable lipid carrier protein or keto acyl-coa thiolase ltp3" /note="Mb3553, -, len: 394 aa. Equivalent to Rv3523, len: 394 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 394 aa overlap). Probable lipid carrier protein or keto acyl-CoA thiolase (EC 2.3.1.16), similar to several e.g. O30037|AF0202 3-KETOACYL-COA THIOLASE (ACAB-6) from Archaeoglobus fulgidus (380 aa) FASTA scores: opt: 782, E(): 1.7e-40, (38.35% identity in 386 aa overlap); Q9Y9A1|APE2384 LONG HYPOTHETICAL NON SPECIFIC LIPID-TRANSFER PROTEIN (ACETHYL CoA SYNTHETASE) (EC 6.2.1.-) from Aeropyrum pernix (394 aa), FASTA scores: opt: 626, E(): 5.9e-31, (35.75% identity in 386 aa overlap); BAB59210|TVG0067506 LIPID TRANSFER PROTEIN from Thermoplasma volcanium (390 aa), FASTA scores: opt: 591, E(): 8.1e-29, (34.35% identity in 384 aa overlap); Q9YDI4|APE0929 LONG HYPOTHETICAL NONSPECIFIC LIPID-TRANSFER PROTEIN from Aeropyrum pernix (400 aa) FASTA scores: opt: 588, E(): 1.3e-28, (31.6% identity in 408 aa overlap); O30104|AF0133 3-KETOACYL-COA THIOLASE (ACAB-3) from Archaeoglobus fulgidus (411 aa) FASTA scores: opt: 583, E(): 2.6e-28, (39.8% identity in 412 aa overlap); O29811|AF0438 3-KETOACYL-COA THIOLASE (ACAB-8) from Archaeoglobus fulgidus (387 aa), FASTA scores: opt: 574, E(): 8.8e-28, (30.95% identity in 388 aa overlap); etc. Protein product from Mb3553 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3553 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4D8" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4D8" /protein_id="SIU02180.1" /translation="MAGKLAAVLGTGQTKYVAKRQDVSMNGLVREAIDRALADSGSTF DDIDAVVVGKAPDFFEGVMMPELFMADAMGATGKPLIRVHTAGSVGGSTGVVAASLVQ SGKYRRVLALAWEKQSESNAMWALSIPVPFTKPVGAGAGGYFAPHVRAYIRRSGAPAH IGAMVAVKDRLNGSRNPLAHLQQPDITLEKVMASQMLWDPIRFDETCPSSDGACAVVV GDEEIADARLAQGHPVAWIHGTALRTEPLAFAGRDQVNPQAGRDAAAALWKAAGITSP IDEIDAAEIYVPFSWFEPMWLENLGFAREGEGWKLTEAGETAIGGRLPVNPSGGVLSA NPIGASGLIRFAEAAIQVMGKAEARQVPGARKALGHAYGGGSQYFSMWVVGCEKPKQA AA" CDS 3908383..3909414 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3554" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb3554, -, len: 343 aa. Equivalent to Rv3524, len: 343 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 343 aa overlap). Probable conserved membrane protein, showing some similarity to C-terminal part of putative Mycobacterium tuberculosis proteins O05871|P95308|PKND_MYCTU|Rv0931c|MT0958|MTCY08C9.08 serine-threonine protein kinase PknD (EC 2.7.1.-) (664 aa) FASTA scores: opt: 727, E(): 8.3e-36, (45.3% identity in 298 aa overlap); O53893|Rv0980c|MTV044.08c PGRS-FAMILY PROTEIN (457 aa), FASTA scores: opt: 208, E(): 4.4e-05, (33.75% identity in 166 aa overlap); and O53891|Rv0978c|MTV044.06c PGRS-FAMILY PROTEIN (331 aa) FASTA scores: opt: 153, E(): 0.062, (30.75% identity in 117 aa overlap). Contains PS00237 G-protein coupled receptors signature. Protein product from Mb3554 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3554 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4G1" /db_xref="InterPro:IPR001258" /db_xref="InterPro:IPR013017" /db_xref="InterPro:IPR035016" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4G1" /protein_id="SIU02181.1" /translation="MVKFTPDSQTSVLRAGKCSGTLSPSRSRLQRGSWPVDSERRRYG WPRNRRTLAITGAAVVVVVTLAAIGYLIFEPKISGSSTSRQAASPTTPSPPSQVVVPI DLWNPDGVTVDLADAVYVADSGHKRLLKLPAGSNTPTTLPFTDTIGPGGVAVNSNRDV YVIDEDSHHVLKLAAGIEPPVELPFGSLGDAHGLAVDRSDSVYVVDYDNAKVLKLPPG ADTPTELPFVGLDHPYDVAVDGAGTVYVTDSGHNRVVALTAGSATPVHLPFADLSFPA GVTVDRDDSVYVADLNNNRVLKLAAGSNAQSQLPFTGLFSPTDVAVDNDGAVYVIDFY NRMLKLPTA" CDS complement(3909428..3909952) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3555C" /product="POSSIBLE SIDEROPHORE-BINDING PROTEIN" /note="Mb3555c, -, len: 174 aa. Equivalent to Rv3525c, len: 174 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 174 aa overlap). Possible siderophore-binding protein, similar to ferripyochelin binding proteins (and related) e.g. Q9RSN5|DR2089 FERRIPYOCHELIN-BINDING PROTEIN from Deinococcus radiodurans (240 aa), FASTA scores: opt: 472, E(): 3.3e-21, (46.9% identity in 162 aa overlap); O59257|PH1591 LONG HYPOTHETICAL FERRIPYOCHELIN BINDING PROTEIN from Pyrococcus horikoshii (173 aa), FASTA scores: opt: 431, E(): 6.7e-19, (40.0% identity in 170 aa overlap); Q9V158|FBP|PAB0393 FERRIPYOCHELIN BINDING PROTEIN from Pyrococcus abyssi (173 aa), FASTA scores: opt: 429, E(): 8.9e-19, (39.4% identity in 170 aa overlap); BAB47820|MLR0180 FERRIPYOCHELIN BINDING PROTEIN-LIKE from Rhizobium loti (Mesorhizobium loti) (175 aa), FASTA scores: opt: 415, E(): 6.1e-18, (42.55% identity in 141 aa overlap); etc. Protein product from Mb3555c detected using SWATH mass spectrometry. Mb3555c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001451" /db_xref="InterPro:IPR011004" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4H9" /protein_id="SIU02182.1" /translation="MPLFSFEGRSPRIDPTAFVAPTATLIGDVTIEAGASVWFNAVLR GDYAPVVVREGANVQDGAVLHAPPGIPVDIGPGATVAHLCVIHGVHVGSEALIANHAT VLDGAVIGARCMIAAGALVVAGTQIPAGMLVTGAPAKVKGPIEGTGAEMWVNVNPQAY RDLAARHLAGLEPM" CDS 3910067..3911227 /codon_start=1 /transl_table=11 /gene="ksha" /locus_tag="BQ2027_MB3556" /product="oxygenase component of 3-ketosteroid-9-alpha-hydroxylase ksha" /note="Mb3556, -, len: 386 aa. Equivalent to Rv3526, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 386 aa overlap). Hypothetical oxidoreductase (EC 1.-.-.-), highly similar, except in C-terminus (also longer 69 aa), to O69348|ORF12 PROTEIN (function unknown) from Rhodococcus erythropolis (316 aa) FASTA scores: opt: 1137, E(): 6.9e-65, (59.6% identity in 250 aa overlap). Also some similarity with several aminopyrrolnitrin oxidases (PRND proteins, involved in the pathway for pyrrolnitrin biosynthesis, a secondary metabolite derived from tryptophan which has strong anti-fungal activity) e.g. Q9RPG0|PRND from Myxococcus fulvus (379 aa), FASTA scores: opt: 322, E(): 4.4e-13, (25.85% identity in 352 aa overlap); Q9RPG4|PRND from Burkholderia cepacia (Pseudomonas cepacia) (373 aa) FASTA scores: opt: 306, E(): 4.5e-12, (25.2% identity in 373 aa overlap); P95483|PRND from Pseudomonas fluorescens (363 aa), FASTA scores: opt: 305, E(): 5.1e-12, (25.0% identity in 372 aa overlap); etc. And also some similarity to other putative enzymes like dioxygenases, oxidases, vanillate O-demethyl oxygenase, etc. Protein product from Mb3556 detected using SWATH mass spectrometry. Mb3556 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4D7" /db_xref="InterPro:IPR017941" /db_xref="InterPro:IPR036922" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4D7" /protein_id="SIU02183.1" /translation="MSTDTSGVGVREIDAGALPTRYARGWHCLGVAKDYLEGKPHGVE AFGTKLVVFADSHGDLKVLDGYCRHMGGDLSEGTVKGDEVACPFHDWRWGGDGRCKLV PYARRTPRMARTRSWTTDVRSGLLFVWHDHEGNPPDPAVRIPEIPEAASDEWTDWRWN RILIEGSNCRDIIDNVTDMAHFFYIHFGLPTYFKNVFEGHIASQYLHNVGRPDVDDLG TSYGEAHLDSEASYFGPSFMINWLHNRYGNYKSESILINCHYPVTQNSFVLQWGVIVE KPKGMSEEMTDKLSRVFTEGVSKGFLQDVEIWKHKTRIDNPLLVEEDGAVYQLRRWYE QFYVDVADIKPEMVERFEIEVDTKRANEFWNAEVEKNLKSREVSDDVPAEQH" CDS 3911233..3911682 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3557" /product="HYPOTHETICAL PROTEIN" /note="Mb3557, -, len: 149 aa. Equivalent to Rv3527, len: 149 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 149 aa overlap). Hypothetical unknown protein. Mb3557 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4G4" /protein_id="SIU02184.1" /translation="MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKS RDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTE ASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHDPNR" CDS complement(3912107..3912820) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3558C" /product="unknown protein" /note="Mb3558c, -, len: 237 aa. Equivalent to Rv3528c, len: 237 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 237 aa overlap). Hypothetical unknown protein. Protein product from Mb3558c detected using shotgun mass spectrometry. Mb3558c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y636" /protein_id="SIU02185.1" /translation="MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEG AYTFRALDKYPVKEAVLVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDAL FLFDVLLHQVSPDWDTILDMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPH SKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDC RGFGWLPNIQNRAFLFARQ" CDS complement(3913512..3914666) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3559C" /product="Sulfotransferase" /note="Mb3559c, -, len: 384 aa. Equivalent to Rv3529c, len: 384 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 384 aa overlap). Conserved hypothetical protein, showing some similarity to Q50695|YM67_MYCTU|Rv2267c|MT2329|MTCY339.43 HYPOTHETICAL 46.1 KDA PROTEIN from Mycobacterium tuberculosis (388 aa) FASTA scores: opt: 261, E(): 1.6e-09, (27.25% identity in 253 aa overlap). Protein product from Mb3559c detected using SWATH mass spectrometry. Mb3559c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y526" /protein_id="SIU02186.1" /translation="MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLL DAYQGEAGLTVLGSKMNRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRT GTTALHRLLGADPAHQGLHMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGY TGLHFMAAYELEECWQLLRQSLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQL IGLNDAEKRWVLKNPSHLFALDALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGW STKFVGAQIGADAMDTWSRGLERFNAARAKYDSAQFYDVDYHDLIADPLGTVADIYRH FGLTLSDEARQAMTTVHAESQSGARAPKHSYSLADYGLTVEMVKERFAGL" CDS complement(3914666..3915448) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3560C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb3560c, -, len: 260 aa. Equivalent to Rv3530c, len: 260 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 260 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases and hypothetical proteins e.g. BAB53258|Q987E5|MLL7083 PROBABLE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (258 aa), FASTA scores: opt: 405, E(): 5.3e-18, (33.45% identity in 263 aa overlap); Q9VNF3|CG12171 HYPOTHETICAL PROTEIN from Drosophila melanogaster (Fruit fly) (257 aa), FASTA scores: opt: 404, E(): 6.1e-18, (32.8% identity in 256 aa overlap); Q9A3X5|CC3076 OXIDOREDUCTASE (SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY) from Caulobacter crescentus (254 aa), FASTA scores: opt: 400, E(): 1.1e-17, (31.0% identity in 255 aa overlap); BAB50080|MLR3115 DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (259 aa), FASTA scores: opt: 393, E(): 3e-17, (31.9% identity in 254 aa overlap); Q9F5J1|SIM-NJ1|SIMD2 PUTATIVE 3-KETO-ACYL-REDUCTASE from Streptomyces antibioticus (273 aa), FASTA scores: opt: 388, E(): 6.3e-17, (31.6% identity in 250 aa overlap); etc. Protein product from Mb3560c detected using SWATH mass spectrometry. Mb3560c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E4" /protein_id="SIU02187.1" /translation="MTGMLKRKVIVVSGVGPGLGTTLAHRCARDGADLVLAARSAERL DDVAKQIIDTGRRAVAVRTDITDDDDVSNLVQATLAAYGKADVLINNAFRVPSMKPLA GTTFEHIRDAIELSALGTLRLIQAFTPALAQSHGAIVNVNSMVIRHSQPKYGTYKMAK SVLLAMSHSLATELGEQGIRVNSVAPGYIWGDTLKSYFDHQAGKYGTTVDQIYQATAA NSDLKRLPTEDEVASAILFLASDLASGITGQTLDVNCGEYHT" CDS complement(3915445..3916572) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3561C" /product="link to sulfotransferase activity" /note="Mb3561c, -, len: 375 aa. Equivalent to Rv3531c, len: 375 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 375 aa overlap). Hypothetical unknown protein. Protein product from Mb3561c detected using SWATH mass spectrometry. Mb3561c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4J9" /protein_id="SIU02188.1" /translation="MYSDPLREAIAEAEQLVAAAPHIETEADLLEGLQYLAGCIAGCM HLAFDYERDHPFLQSGTGPFTKMGLDNPDTLYFGTRLQANRDYVVSGRRGTTTDLSFQ LLGGEYTDYNVPASQAAFDDRELDIAADGSFEWRLRPSAPGQLVIREVYGDWSQQRGT LAIARLDTVGTAPPPLTRELMEKRYATAGSQLVNRVKTWLQFPQWFYLNIPVNTMVAP RLTPGGLATQYSSAGHFELRPGQALVITVPVSDAPYLGFQLGSMWYISLDYINHQTSL NASQAQADPDGKVRIVVAEQNPGVTNWVETLGHRRGFLQFRWQRVSRELTEADGPTVE LVDFDAIPAALPHYQHNKISEDDWRARIALRQRQIATRMLG" CDS 3916971..3918191 /codon_start=1 /transl_table=11 /gene="PPE61" /locus_tag="BQ2027_MB3562" /product="ppe family protein ppe61" /note="Mb3562, PPE61, len: 406 aa. Equivalent to Rv3532, len: 406 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 406 aa overlap). Member of the Mycobacterium tuberculosis PPE protein family, similar to many, e.g. O53956|Rv1807|MTV049.29 (403 aa), FASTA scores: opt: 954, E(): 1.1e-43, (44.1% identity in 417 aa overlap),Mb3562 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR022171" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4I7" /protein_id="SIU02189.1" /translation="MFMDFAMLPPEVNSTRMYSGPGAGSLWAAAAAWDQVSAELQSAA ETYRSVIASLTGWQWLGPSSVRMGAAVTPYVEWLTTTAAQARQTATQITAAATGFEQA FAMTVPPPAIMANRAQVLSLIATNFFGQNTAAIAALETQYAEMWEQDATAMYDYAATS AAARTLTPFTSPQQDTNSAGLPAQSAEVSRATANAGAADGNWLGNLLEEIGILLLPIA PELTPFFLEAGEIVNAIPFPSIVGDEFCLLDGLLAWYATIGSINNINSMGTGIIGAEK NLGILPELGSAAAAAAPPPADIAPAFLAPLTSMAKSLSDGALRGPGEVSAAMRGAGTI GQMSVPPAWKAPAVTTVRAFDATPMTTLPGGDAPAAGVPGLPGMPASGAGRAGVVPRY GVRLTVMTRPLSGG" CDS complement(3918333..3920081) /codon_start=1 /transl_table=11 /gene="PPE62" /locus_tag="BQ2027_MB3563C" /product="ppe family protein ppe62" /note="Mb3563c, PPE62, len: 582 aa. Equivalent to Rv3533c, len: 582 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 582 aa overlap). Member of the Mycobacterium tuberculosis PPE protein family, similar to many, e.g. O53309|Rv3159c|MTV014.03c (590 aa) FASTA scores: opt: 2289, E(): 2.3e-95, (63.5% identity in 600 aa overlap)." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4F2" /protein_id="SIU02190.1" /translation="MNYAVLPPELNSLRMFTGAGSAPMLAAAVAWDGLAAELGSAASS FGSVTSDLASQAWQGPAAAAMAAAAAPYAGWLSAAAARAAGAAAQAKAVASAFEAARA ATVHPLLVAANRNAFAQLVMSNWFGLNAPLIAAVEGAYEQMWAADVAAMVGYHSGASA AAEQLVPFQQALQQLPNLGIGNIGNANLGGGNTGDLNTGNGNIGNTNLGSGNRGDANL GSGNIGNSNVGGGNVGNGNFGSGNGRAGLPGSGNVGNGNLGNSNLGSGNTGNSNVGFG NTGNNNVGTGNAGSGNIGAGNTGSSNWGFGNNGIGNIGFGNTGNGNIGFGLTGNNQVG IGGLNSGSGNIGLFNSGTNNVGFFNSGNGNLGIGNSSDANVGIGNSGATVGPFVAGHN TGFGNSGSLNTGMGNAGGVNTGFGNGGAINLGFGNSGQLNAGSFNAGSINTGNFNSGQ GNTGDFNAGVRNTGWSNSGLTNTGAFNAGSLNTGFGAVGTGSGPNSGFGNAGTNNSGF FNTGVGSSGFQNGGSNNSGLQNAVGTVIAAGFGNTGAQTVGIANSGVLNSGFFNSGVH NSGGFNSENQRSGFGN" CDS complement(3920180..3921220) /codon_start=1 /transl_table=11 /gene="hsaf" /locus_tag="BQ2027_MB3564C" /product="PROBABLE 4-HYDROXY-2-OXOVALERATE ALDOLASE (HOA)" /note="Mb3564c, -, len: 346 aa. Equivalent to Rv3534c, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap). Probable 4-hydroxy-2-oxovalerate aldolase (EC 4.1.3.-), highly similar to others e.g. P51015|BPHI_PSESP from Pseudomonas sp. strain LB400 (346 aa), FASTA scores: opt: 1150, E(): 2.3e-61, (51.35% identity in 331 aa overlap); Q52040|BPHX3 from Pseudomonas pseudoalcaligenes (346 aa), FASTA scores: opt: 1147, E(): 3.5e-61, (51.35% identity in 331 aa overlap); P51017|NAHM_PSEPU from Pseudomonas putida (346 aa), FASTA scores: opt: 1145, E(): 4.7e-61, (50.9% identity in 330 aa overlap) (see citation below); P51020|MHPE_ECOLI|MHPF|B0352 from Escherichia coli strain K12 (337 aa), FASTA scores: opt: 1133, E(): 2.4e-60, (52.0% identity in 327 aa overlap); O24833|ATDG from Acinetobacter sp (340 aa), FASTA scores: opt: 1132, E(): 2.7e-60, (50.45% identity in 331 aa overlap); etc. Note that also highly similar to Q9ZI56|NAHM 2-OXO-4-HYDROXYPENTANOATE ALDOLASE from Pseudomonas stutzeri (Pseudomonas perfectomarina) (346 aa) FASTA scores: opt: 1168, E(): 2e-62, (51.05% identity in 331 aa overlap) (see citation below). Protein product from Mb3564c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3564c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TW97" /db_xref="InterPro:IPR000891" /db_xref="InterPro:IPR012425" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR017629" /db_xref="InterPro:IPR035685" /db_xref="UniProtKB/Swiss-Prot:Q7TW97" /protein_id="SIU02191.1" /translation="MTDMWDVRITDTSLRDGSHHKRHQFTKDEVGAIVAALDAAGVPV IEVTHGDGLGGSSFNYGFSKTPEQELIKLAAATAKEARIAFLMLPGVGTKDDIKEARD NGGSICRIATHCTEADVSIQHFGLARELGLETVGFLMMAHTIAPEKLAAQARIMADAG CQCVYVVDSAGALVLDGVADRVSALVAELGEDAQVGFHGHENLGLGVANSVAAVRAGA KQIDGSCRRFGAGAGNAPVEALIGVFDKIGVKTGIDFFDIADAAEDVVRPAMPAECLL DRNALIMGYSGVYSSFLKHAVRQAERYGVPASALLHRAGQRKLIGGQEDQLIDIALEI KRELDSGAAVTH" CDS complement(3921217..3922128) /codon_start=1 /transl_table=11 /gene="hsag" /locus_tag="BQ2027_MB3565C" /product="PROBABLE ACETALDEHYDE DEHYDROGENASE (ACETALDEHYDE DEHYDROGENASE [ACETYLATING])" /note="Mb3565c, -, len: 303 aa. Equivalent to Rv3535c, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 303 aa overlap). Probable acetaldehyde dehydrogenase (EC 1.2.1.10), highly similar to many e.g. BAB62056|TDNI from Pseudomonas putida (302 aa), FASTA scores: opt: 1159, E(): 1.5e-62, (60.45% identity in 301 aa overlap); Q9ZI57|NAHO from Pseudomonas stutzeri (Pseudomonas perfectomarina) (307 aa) FASTA scores: opt: 1151, E(): 4.6e-62, (59.55% identity in 299 aa overlap); Q9F9I4|CDOI from Comamonas sp. JS765 (302 aa) FASTA scores: opt: 1136, E(): 3.6e-61, (60.15% identity in 301 aa overlap); Q51962|NAHO from Pseudomonas putida (307 aa), FASTA scores: opt: 1133, E(): 5.6e-61, (58.55% identity in 299 aa overlap) (see citation below); P77580|MHPF_ECOLI|MHPF|MHPE|B0351 from Escherichia coli strain K12 (316 aa), FASTA scores: opt: 1040, E(): 2.2e-55, (56.85% identity in 306 aa overlap); etc. Protein product from Mb3565c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3565c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TTR4" /db_xref="InterPro:IPR000534" /db_xref="InterPro:IPR003361" /db_xref="InterPro:IPR015426" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:Q7TTR4" /protein_id="SIU02192.1" /translation="MPSKAKVAIVGSGNISTDLLYKLLRSEWLEPRWMVGIDPESDGL ARAAKLGLETTHEGVDWLLAQPDKPDLVFEATSAYVHRDAAPKYAEAGIRAIDLTPAA VGPAVIPPANLREHLDAPNVNMITCGGQATIPIVYAVSRIVEVPYAEIVASVASVSAG PGTRANIDEFTKTTARGVQTIGGAARGKAIIILNPADPPMIMRDTIFCAIPTDADREA IAASIHDVVKEVQTYVPGYRLLNEPQFDEPSINSGGQALVTTFVEVEGAGDYLPPYAG NLDIMTAAATKVGEEIAKETLVVGGAR" CDS complement(3922139..3922924) /codon_start=1 /transl_table=11 /gene="hsae" /locus_tag="BQ2027_MB3566C" /product="PROBABLE HYDRATASE" /note="Mb3566c, -, len: 261 aa. Equivalent to Rv3536c, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 261 aa overlap). Probable hydratase, 2-oxo-hepta-3-ene-1,7-dioate hydratase (EC 4.2.1.-) or 2-keto-4-pentenoate hydratase (EC 4.2.1.-). Indeed, highly similar to many 2-oxo-hepta-3-ene-1,7-dioate hydratases e.g. Q9CKS2|HPAH|PM1534 from Pasteurella multocida (267 aa) FASTA scores: opt: 743, E(): 1.5e-39, (45.5% identity in 266 aa overlap) Q9RZ31|DRA0122 from Deinococcus radiodurans (268 aa), FASTA scores: opt: 709, E(): 2e-37, (45.5% identity in 266 aa overlap); Q9HWQ4|HPCG|PA4127 from Pseudomonas aeruginosa (267 aa), FASTA scores: opt: 703, E(): 4.8e-37, (45.1% identity in 266 aa overlap); Q46982|HPAH|HPCG from Escherichia colis strain ATCC 11105 (267 aa), FASTA scores: opt: 679, E(): 1.6e-35, (41.35% identity in 266 aa overlap); etc. But also highly similar to many 2-keto-4-pentenoate hydratases (2-hydroxypentadienoic acidhydratases) e.g. Q9LAF7|PHED from Bacillus thermoglucosidasius (258 aa), FASTA scores: opt: 698, E(): 9.7e-37, (42.45% identity in 252 aa overlap); Q52442|BPHH from Pseudomonas sp (260 aa) FASTA scores: opt: 675, E(): 2.7e-35, (41.4% identity in 251 aa overlap); P77608|MHPD_ECOLI|B0350 from Escherichia coli strain K12 (269 aa), FASTA scores: opt: 674, E(): 3.2e-35, (42.75% identity in 255 aa overlap); Q52038|BPHX1 from Pseudomonas pseudoalcaligenes (260 aa), FASTA scores: opt: 663, E(): 1.5e-34, (40.6% identity in 251 aa overlap); etc. Protein product from Mb3566c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3566c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4E3" /db_xref="InterPro:IPR011234" /db_xref="InterPro:IPR036663" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4E3" /protein_id="SIU02193.1" /translation="MLRDATRDELAADLAQAERSRDPIGQLTAAHPEIDVVDAYEIQL INIRQRVAEGARVVGHKVGLSSPIMQQMMGVDEPDYGHLLDDMQVFEDTPVQASRYLS PRVEVEVGFILAADLPGAGCTEDDVLAATEALVPAIELIDTRIKDWQIKICDTIADNA SAAGFVLGAARVPPADLDVRAIDAKLTRNGEVVAEGRSDAVLGNPATAVAWLAGKVES FGVRLRKGDIVLPGSCTFAVEARAGDEFVADFTGLGLVRLSFE" CDS 3922997..3924688 /codon_start=1 /transl_table=11 /gene="kstd" /locus_tag="BQ2027_MB3567" /product="PROBABLE DEHYDROGENASE" /note="Mb3567, -, len: 563 aa. Equivalent to Rv3537, len: 563 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 563 aa overlap). Probable dehydrogenase (EC 1.-.-.-), similar to many dehydrogenases or hypothetical proteins e.g. Q9I1M6|PA2243 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (577 aa), FASTA scores: opt: 984, E(): 1.2e-48, (34.75% identity in 573 aa overlap); Q06401|3O1D_COMTE 3-OXOSTEROID 1-DEHYDROGENASE from Comamonas testosteroni (Pseudomonas testosteroni) (573 aa), FASTA scores: opt: 955, E(): 5.5e-47, (33.05% identity in 590 aa overlap); Q9RA02|KSTD1 3-KETOSTEROID DEHYDROGENASE from Rhodococcus erythropolis (510 aa), FASTA scores: opt: 631, E(): 1.4e-28, (39.15% identity in 557 aa overlap); P77815|KSDD 3-KETOSTEROID-1-DEHYDROGENASE from Nocardioides simplex (Arthrobacter simplex) (515 aa), FASTA scores: opt: 469, E(): 2.4e-19, (35.45% identity in 564 aa overlap); etc. Protein product from Mb3567 detected using SWATH mass spectrometry. Mb3567 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003953" /db_xref="InterPro:IPR027477" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4G9" /protein_id="SIU02194.1" /translation="MTVQEFDVVVVGSGAAGMVAALVAAHRGLSTVVVEKAPHYGGST ARSGGGVWIPNNEVLKRRGVRDTPEAARTYLHGIVGEIVEPERIDAYLDRGPEMLSFV LKHTPLKMCWVPGYSDYYPEAPGGRPGGRSIEPKPFNARKLGADMAGLEPAYGKVPLN VVVMQQDYVRLNQLKRHPRGVLRSMKVGARTMWAKATGKNLVGMGRALIGPLRIGLQR AGVPVELNTAFTDLFVENGVVSGVYVRDSHEAESAEPQLIRARRGVILACGGFEHNEQ MRIKYQRAPITTEWTVGASANTGDGILAAEKLGAALDLMDDAWWGPTVPLVGKPWFAL SERNSPGSIIVNMSGKRFMNESMPYVEACHHMYGGEHGQGPGPGENIPAWLVFDQRYR DRYIFAGLQPGQRIPSRWLDSGVIVQADTLAELAGKAGLPADELTATVQRFNAFARSG VDEDYHRGESAYDRYYGDPSNKPNPNLGEVGHPPYYGAKMVPGDLGTKGGIRTDVNGR ALRDDGSIIDGLYAAGNVSAPVMGHTYPGPGGTIGPAMTFGYLAALHIADQAGKR" CDS 3924690..3925550 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3568" /product="probable dehydrogenase. possible 2-enoyl acyl-coa hydratase." /note="Mb3568, -, len: 286 aa. Equivalent to Rv3538, len: 286 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 286 aa overlap). Probable dehydrogenase (EC 1.-.-.-), similar to Q9L009|SCC30.12c PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (333 aa), FASTA scores: opt: 842, E(): 3.6e-44, (48.4% identity in 285 aa overlap); and similar to C-terminal part of other (principally ESTRADIOL 17 BETA-DEHYDROGENASES/17-BETA-HYDROXYSTEROID DEHYDROGENASES) e.g. P70540 PEROXISOMAL MULTIFUNCTIONAL ENZYME TYPE II (SDR FAMILY) from Rattus norvegicus (Rat) (735 aa) FASTA scores: opt: 622, E(): 1.9e-30, (37.45% identity in 283 aa overlap); or P70523|MPF-2 MULTIFUNCTIONAL PROTEIN 2 (SDR FAMILY) (beta-oxidation protein displaying 2-enoyl-CoA hydratase and D-3-hydroxyacyl-CoA dehydrogenase activity) from Rattus norvegicus (Rat) (734 aa), FASTA scores: opt: 616, E(): 4.3e-30, (37.1% identity in 283 aa overlap); P51659|DHB4_HUMAN|HSD17B4|EDH17B4 ESTRADIOL 17 BETA-DEHYDROGENASE (EC 1.1.1.62) from Homo sapiens (Human) (736 aa), FASTA scores: opt: 614, E(): 5.7e-30, (35.9% identity in 284 aa overlap); P97852|DHB4_RAT|HSD17B4|EDH17B4 ESTRADIOL 17 BETA-DEHYDROGENASE from Rattus norvegicus (Rat) (735 aa) FASTA scores: opt: 613, E(): 6.6e-30, (37.1% identity in 283 aa overlap); Q9DBM3|HSD17B4 ESTRADIOL 17 BETA-DEHYDROGENASE from Mus musculus (Mouse) (735 aa) FASTA scores: opt: 611, E(): 8.7e-30, (36.5% identity in 285 aa overlap); etc. Also similar to Q11198|Rv3389c|MTV004.47c HYPOTHETICAL 30.3 KDA PROTEIN from Mycobacterium tuberculosis (290 aa), FASTA scores: opt: 609, E(): 5.3e-30, (39.65% identity in 285 aa overlap). Note that previously known as ufaA2. Protein product from Mb3568 detected using SWATH mass spectrometry. Mb3568 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002539" /db_xref="InterPro:IPR029069" /db_xref="InterPro:IPR039569" /db_xref="UniProtKB/TrEMBL:A0A1R3Y637" /protein_id="SIU02195.1" /translation="MPIDLDVALGAQLPPVEFSWTSTDVQLYQLGLGAGSDPMNPREL SYLADDTPQVLPTFGNVAATFHLTTPPTVQFPGIDIELSKVLHASERVEVPAPLPPSG SARAVTRFTDIWDKGKAAVICSETTATTPDGLLLWTQKRSIYARGEGGFGGKRGPSGS DVAPERAPDLQVAMPILPQQALLYRLCGDRNPLHSDPEFAAAAGFPRPILHGLCTYGM TCKAIVDALLDSDATAVAGYGARFAGVAYPGETLTVNVWKDGRRLVASVVAPTRDNAV VLSGVELVPA" CDS 3925687..3927126 /codon_start=1 /transl_table=11 /gene="PPE63" /locus_tag="BQ2027_MB3569" /product="ppe family protein ppe63" /note="Mb3569, PPE63, len: 479 aa. Equivalent to Rv3539, len: 479 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 479 aa overlap). Member of the Mycobacterium tuberculosis PPE protein family, similar to many e.g. O53949|Rv1800|MTV049.22 (655 aa), FASTA scores: opt: 914, E(): 7.3e-47, (37.55% identity in 490 aa overlap); etc. Protein product from Mb3569 detected using SWATH mass spectrometry. Mb3569 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR013228" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y532" /protein_id="SIU02196.1" /translation="MADFLTLSPEVNSARMYAGGGPGSLSAAAAAWDELAAELWLAAA SFESVCSGLADRWWQGPSSRMMAAQAARHTGWLAAAATQAEGAASQAQTMALAYEAAF AATVHPALVAANRALVAWLAGSNVFGQNTPAIAAAEAIYEQMWAQDVVAMLNYHAVAS AVGARLRPWQQLLHELPRRLGGEHSDSTNTELANPSSTTTRITVPGASPVHAATLLPF IGRLLAARYAELNTAIGTNWFPGTTPEVVSYPATIGVLSGSLGAVDANQSIAIGQQML HNEILAATASGQPVTVAGLSMGSMVIDRELAYLAIDPNAPPSSALTFVELAGPERGLA QTYLPVGTTIPIAGYTVGNAPESQYNTSVVYSQYDIWADPPDRPWNLLAGANALMGAA YFHDLTAYAAPQQGIEIAAVTSSLGGTTTTYMIPSPTLPLLLPLKQIGVPDWIVGGLN NVLKPLVDAGYSQYAPTAGPYFSHGNLVW" CDS complement(3927127..3928287) /codon_start=1 /transl_table=11 /gene="ltp2" /locus_tag="BQ2027_MB3570C" /product="probable lipid transfer protein or keto acyl-coa thiolase ltp2" /note="Mb3570c, ltp2, len: 386 aa. Equivalent to Rv3540c, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 386 aa overlap). Probable ltp2, lipid-transfer protein or keto acyl-CoA thiolase (EC 2.3.1.16), similar to several e.g. Q9X4X2|DITF DITF PROTEIN (hypothetical protein, similar to non-specific lipid-transfer protein and 3-ketoacyl-CoA thiolase) from Pseudomonas abietaniphila (397 aa), FASTA scores: opt: 665, E(): 5.3e-34, (33.4% identity in 392 aa overlap); O30255|AF2416 3-KETOACYL-COA THIOLASE (ACAB-12) from Archaeoglobus fulgidus (384 aa), FASTA scores: opt: 496, E(): 1.6e-23, (30.35% identity in 389 aa overlap); O28978|AF1291 3-KETOACYL-COA THIOLASE (ACAB-11) from Archaeoglobus fulgidus (392 aa), FASTA scores: opt: 494, E(): 2.2e-23, (30.6% identity in 379 aa overlap); O26884|MTH793 LIPID-TRANSFER PROTEIN (STEROL OR NONSPECIFIC) from Methanobacterium thermoautotrophicum (383 aa), FASTA scores: opt: 487, E(): 5.9e-23, (30.4% identity in 388 aa overlap); etc. Mb3570c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4F4" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4F4" /protein_id="SIU02197.1" /translation="MLSGQAAIVGIGATDFSKNSGRSELRLAAEAVLDALADAGLSPT DVDGLTTFTMDTNTEIAVARAAGIGELTFFSKIHYGGGAACATVQHAAMAVATGVADV VVAYRAFNERSGMRFGQVQTRLTENADSTGVDNSFSYPHGLSTPAAQVAMIARRYMHL SGATSRDFGAVSVADRKHAANNPKAYFYGKPITIEDHQNSRWIAEPLRLLDCCQETDG AVAIVVTSAARARDLKQRPVVIEAAAQGCSPDQYTMVSYYRPELDGLPEMGLVGRQLW AQSGLTPADVQTAVLYDHFTPFTLIQLEELGFCGKGEAKDFIADGAIEVGGRLPINTH GGQLGEAYIHGMNGIAEGVRQLRGTSVNPVAGVEHVLVTAGTGVPTSGLILG" CDS complement(3928287..3928676) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3571C" /product="Enoyl coenzyme A hydratase IgrE" /note="Mb3571c, -, len: 129 aa. Equivalent to Rv3541c, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Hypothetical protein, showing some similarity to Q9CBJ7|ML1909 HYPOTHETICAL PROTEIN from Mycobacterium leprae (142 aa) FASTA scores: opt: 110, E(): 1.2, (27.95% identity in 118 aa overlap); and other (see also BLASTP results) e.g. Q9L0M3|SCD82.08 HYPOTHETICAL 15.2 KDA PROTEIN from Streptomyces coelicolor (142 aa), FASTA scores: opt: 127, E(): 0.086, (27.65% identity in 123 aa overlap). Contains PS00075 Dihydrofolate reductase signature. Protein product from Mb3571c detected using SWATH mass spectrometry. Mb3571c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR029069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4K7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02198.1" /translation="MTVVGAVLPELKLYGDPTFIVSTALATRDFQDVHHDRDKAVAQG SKDIFVNILTDTGLVQRYVTDWAGPSALIKSIGLRLGVPWYAYDTVTFSGEVTAVNDG LITVKVVGRNTLGDHVTATVELSMRDS" CDS complement(3928673..3929608) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3572C" /product="Acyl-CoA transferase domain of IgrD / Nucleic-acid-binding domain of IgrD" /note="Mb3572c, -, len: 311 aa. Equivalent to Rv3542c, len: 311 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 311 aa overlap). Hypothetical protein, showing some similarity to other e.g. Q58947|MJ1552 from Methanococcus jannaschii (141 aa) FASTA scores: opt: 177, E(): 0.00065, (46.65% identity in 60 aa overlap); BAB59276|TVG0142586 from Thermoplasma volcanium (135 aa), FASTA scores: opt: 175, E(): 0.00083, (35.65% identity in 87 aa overlap); Q9HI85|TA1457 from Thermoplasma acidophilum (135 aa), FASTA scores: opt: 162, E(): 0.0052, (31.8% identity in 107 aa overlap); etc. Protein product from Mb3572c detected using SWATH mass spectrometry. Mb3572c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002878" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR029069" /db_xref="InterPro:IPR039375" /db_xref="InterPro:IPR039569" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4J6" /protein_id="SIU02199.1" /translation="MTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDR NPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGV VATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEM NWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRH PPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLG ELRGADPARVAIGMPVRATYIDFPDWSLYAWEPDE" CDS complement(3929605..3930768) /codon_start=1 /transl_table=11 /gene="fadE29" /locus_tag="BQ2027_MB3573C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE29" /note="Mb3573c, fadE29, len: 387 aa. Equivalent to Rv3543c, len: 387 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 387 aa overlap). Probable fadE29, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9A8P3|CC1310 from Caulobacter crescentus (404 aa), FASTA scores: opt: 624, E(): 9.4e-32, (32.75% identity in 400 aa overlap); Q9I4V2|PA1022 from Pseudomonas aeruginosa (381 aa), FASTA scores: opt: 550, E(): 3.9e-27, (33.7% identity in 350 aa overlap); O28976|AF1293 from Archaeoglobus fulgidus (384 aa), FASTA scores: opt: 529, E(): 8.1e-26, (30.0% identity in 393 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis e.g. O53549|FADE26|Rv3504|MTV023.11 (400 aa), FASTA scores: opt: 1031, E(): 2.8e-57, (46.0% identity in 402 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3573c detected using SWATH mass spectrometry. Mb3573c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4F9" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4F9" /protein_id="SIU02200.1" /translation="MFIDLTPEQRQLQAEIRQYFSNLISPDERTEMEKDRHGPAYRAV IRRMGRDGRLGVGWPKEFGGLGFGPIEQQIFVNEAHRADVPLPAVTLQTVGPTLQAHG SELQKKKFLPAILAGEAHFAIGYTEPEAGTDLASLRTTAVRDGDHYIVNGQKVFTTGA HDADYIWLACRTDPNAAKHKGISILIVDTKDPGYSWTPIILADGAHHTNATYYNDVRV PVDMLVGKENDGWRLITTQLNNERVMLGPAGRFASIYDRVHAWASVPGGNGVTPIDHD DVKRALGEIRAIWRINELLNWQVASAGEDINMADAAATKVFGTERVQRAGRLAEEIVG KYGNPAEPDTAELLRWLDAQTKRNLVITFGGGVNEVMREMIAASGLKVPRVPR" CDS complement(3930753..3931772) /codon_start=1 /transl_table=11 /gene="fadE28" /locus_tag="BQ2027_MB3574C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE28" /note="Mb3574c, fadE28, len: 339 aa. Equivalent to Rv3544c, len: 339 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 339 aa overlap). Probable fadE28, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9RJX3|SCF37.28c from Streptomyces coelicolor (362 aa), FASTA scores: opt: 334, E(): 5.1e-13, (27.65% identity in 329 aa overlap); Q9A5G8|CC2479 from Caulobacter crescentus (344 aa), FASTA scores: opt: 278, E(): 1.2e-09, (26.95% identity in 319 aa overlap); O29813|AF0436 from Archaeoglobus fulgidus (382 aa) FASTA scores: opt: 205, E(): 3.5e-05, (24.75% identity in 384 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis e.g. O53550|FADE27|Rv3505|MTV023.12 (373 aa) FASTA scores: opt: 497, E(): 7e-23, (30.3% identity in 343 aa overlap); and to P46703|ACDP_MYCLE|FADE25|ACD|ML0737|B1308_F1_34 PROBABLE ACYL-COA DEHYDROGENASE from Mycobacterium leprae (389 aa) FASTA scores: opt: 165, E(): 0.0012, (25.2% identity in 345 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Mb3574c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4H3" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4H3" /protein_id="SIU02201.1" /translation="MDFDPTAEQQAVADVVTSVLERDISWEALVCGGVTALPVPERLG GDGVGLFEVGALLTEVGRHGAVTPALATLGLGVVPLLELASAEQQDRFLAGVAKGGVL TAALNEPGAALPDRPATSFVGGRLSGTKVGVGYAEQADWMLVTADNAVVVVSPTADGV RMVRTPTSNGSDEYVMTMDGVAVADCDILADVAAHRVNQLALAVMGAYADGLVAGALR LTADYVANRKQFGKPLSTFQTVAAQLAEVYIASRTIDLVAKSVIWRLAEDLDAGDDLG VLGYWVTSQAPPAMQICHHLHGGMGMDVTYPMHRYYSTIKDLTRLLGGPSHRLELLGA RCSLT" CDS complement(3931772..3933073) /codon_start=1 /transl_table=11 /gene="cyp125" /locus_tag="BQ2027_MB3575C" /product="PROBABLE CYTOCHROME P450 125 CYP125" /note="Mb3575c, cyp125, len: 433 aa. Equivalent to Rv3545c, len: 433 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 433 aa overlap). Probable cyp125, cytochrome P-450 (EC 1.14.-.-), similar to others e.g. Q59723|LINC|CYP111 from Pseudomonas incognita (406 aa), FASTA scores: opt: 831, E(): 8e-45, (34.75% identity in 406 aa overlap); Q9X8Q3|CYP107P1|SCH10.14c from Streptomyces coelicolor (411 aa), FASTA scores: opt: 694, E(): 3.3e-36, (32.35% identity in 417 aa overlap); Q9L465|CYP162A1|NIKQ from Streptomyces tendae (396 aa) FASTA scores: opt: 664, E(): 2.5e-34, (34.15% identity in 413 aa overlap); O08469|CPXY_BACSU|CYPA|CYP107J1 from Bacillus subtilis (410 aa), FASTA scores: opt: 579, E(): 5.6e-29, (30.05% identity in 366 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis e.g. Q50696|CYP124|Rv2266|MT2328|MTCY339.44c (428 aa) FASTA scores: opt: 1040, E(): 6.1e-58, (40.75% identity in 432 aa overlap). BELONGS TO THE CYTOCHROME P450 FAMILY. Protein product from Mb3575c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3575c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63710" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002397" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/Swiss-Prot:P63710" /protein_id="SIU02202.1" /translation="MSWNHQSVEIAVRRTTVPSPNLPPGFDFTDPAIYAERLPVAEFA ELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKN DIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAA AGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSA ELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNS ITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKG QRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFN AVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH" CDS 3933185..3934360 /codon_start=1 /transl_table=11 /gene="fadA5" /locus_tag="BQ2027_MB3576" /product="PROBABLE ACETYL-COA ACETYLTRANSFERASE FADA5 (ACETOACETYL-COA THIOLASE)" /note="Mb3576, fadA5, len: 391 aa. Equivalent to Rv3546, len: 391 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 391 aa overlap). Probable fadA5, acetyl-CoA acetyltransferase (EC 2.3.1.9), similar to many e.g. Q9AA29|CC0779 from Caulobacter crescentus (390 aa), FASTA scores: opt: 999, E(): 7.1e-54, (43.5% identity in 400 aa overlap); Q9K783|BH3487 from Bacillus halodurans (393 aa), FASTA scores: opt: 843, E(): 2.6e-44, (37.45% identity in 398 aa overlap); Q9RRK9|DR2480 from Deinococcus radiodurans (399 aa), FASTA scores: opt: 826, E(): 2.8e-43, (38.15% identity in 396 aa overlap); P45369|THIL_CHRVI|PHBA from Chromatium vinosum (394 aa) FASTA scores: opt: 790, E(): 4.5e-41, (39.4% identity in 401 aa overlap); etc. Contains PS00737 Thiolases signature 2. BELONGS TO THE THIOLASE FAMILY. Protein product from Mb3576 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3576 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4F3" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020613" /db_xref="InterPro:IPR020616" /db_xref="InterPro:IPR020617" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4F3" /protein_id="SIU02203.1" /translation="MGYPVIVEATRSPIGKRNGWLSGLHATELLGAVQKAVVDKAGIQ SGLHAGDVEQVIGGCVTQFGEQSNNISRVAWLTAGLPEHVGATTVDCQCGSGQQANHL IAGLIAAGAIDVGIACGIEAMSRVGLGANAGPDRSLIRAQSWDIDLPNQFEAAERIAK RRGITREDVDVFGLESQRRAQRAWAEGRFDREISPIQAPVLDEQNQPTGERRLVFRDQ GLRETTMAGLGELKPVLEGGIHTAGTSSQISDGAAAVLWMDEAVARAHGLTPRARIVA QALVGAEPYYHLDGPVQSTAKVLEKAGMKIGDIDIVEINEAFASVVLSWARVHEPDMD RVNVNGGAIALGHPVGCTGSRLITTALHELERTDQSLALITMCAGGALSTGTIIERI" CDS 3934472..3934927 /codon_start=1 /transl_table=11 /gene="ddn" /locus_tag="BQ2027_MB3577" /product="deazaflavin-dependent nitroreductase ddn" /note="Mb3577, -, len: 151 aa. Equivalent to Rv3547, len: 151 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 151 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins e.g. O85698|3SCF60.07 from Streptomyces lividans and Streptomyces coelicolor (149 aa), FASTA scores: opt: 353, E(): 6.3e-17, (42.55% identity in 134 aa overlap); Q9WX21|SCE68.11 from Streptomyces coelicolor (305 aa) FASTA scores: opt: 290, E(): 2.1e-12, (38.5% identity in 122 aa overlap) (similarity in N-terminus for this protein); BAB52932|Q988L5|MLL6688 from Rhizobium loti (Mesorhizobium loti) (148 aa), FASTA scores: opt: 105, E(): 3, (26.75% identity in 86 aa overlap). Also similar to mycobacterial hypothetical proteins e.g. Q9ZH81 from M. paratuberculosis (144 aa), FASTA scores: opt: 366, E(): 8.2e-18, (43.9% identity in 123 aa overlap); and Q10772|YF58_MYCTU|Rv1558|MT1609|MTCY48.07c from M. tuberculosis (148 aa), FASTA scores: opt: 330, E(): 2.2e-15, (39.75% identity in 151 aa overlap); etc. Protein product from Mb3577 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3577 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4I0" /db_xref="InterPro:IPR004378" /db_xref="InterPro:IPR012349" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4I0" /protein_id="SIU02204.1" /translation="MPKSPPRFLNSPLSDFFIKWMSRINTWMYRRNDGEGLGGTFQKI PVALLTTTGRKTGQPRVNPLYFLRDGGRVIVAASKGGAEKNPMWYLNLKANPKVQVQI KKEVLDLTARDATDEERAEYWPQLVTMYPSYQDYQSWTDRTIPIVVCEP" CDS complement(3935010..3935924) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3578C" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb3578c, -, len: 304 aa. Equivalent to Rv3548c, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 304 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to various dehydrogenases/reductases (generally belonging to the SDR FAMILY) e.g. Q9I4V1|PA1023 from Pseudomonas aeruginosa (305 aa), FASTA scores: opt: 446, E(): 1.7e-17, (43.75% identity in 256 aa overlap); Q9A6K0|CC2093 from Caulobacter crescentus (301 aa) FASTA scores: opt: 437, E(): 5.3e-17, (42.8% identity in 257 aa overlap); Q9HYH8|PA3427 from Pseudomonas aeruginosa (303 aa), FASTA scores: opt: 399, E(): 6.5e-15, (45.5% identity in 257 aa overlap); Q9VXJ0|CG3415 from Drosophila melanogaster (Fruit fly) (598 aa), FASTA scores: opt: 402, E(): 7.5e-15, (40.7% identity in 285 aa overlap); etc. Also highly similar to O53547|Rv3502c|MTV023.09c PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from (317 aa) FASTA scores: opt: 739, E(): 1.6e-33, (45.15% identity in 310 aa overlap); and other proteins from Mycobacterium tuberculosis. Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb3578c detected using SWATH mass spectrometry. Mb3578c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y654" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y654" /protein_id="SIU02205.1" /translation="MGLVDGRVVIVTGAGGGIGRAHALAFAAEGARVVVNDIGVGLDG SPASGGSAAQDVVDEILAAGGQAVADGSDISDWDQAANLIQAAVETYGGVDVLVNNAG IVRDRMIANTSEEEFDAVIAVHLKGHFATMRHAASHWRGLSKAGKAPKDIDARIINTS SGAGLQGSVGQGNYSAAKAGIAALTLVGAAEMRRYGVTVNAIAPAARTRMTETVFAEV MAKPQEGFDAMAPENVSPLVVWLGSAESRDVTGKVFEVEGGIIRVAEGWAHGPQVDKG VKWDPAELGPVVSDLLAKSRPPVPVYGA" CDS complement(3935947..3936726) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3579C" /product="PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE" /note="Mb3579c, -, len: 259 aa. Equivalent to Rv3549c, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 259 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases (generally belong to the SDR FAMILY) e.g. Q9UKU3 from Homo sapiens (Human) (270 aa), FASTA scores: opt: 451, E(): 4.8e-21, (38.05% identity in 247 aa overlap); Q9S274|SCI28.09c from Streptomyces coelicolor (234 aa), FASTA scores: opt: 439, E(): 2.4e-20, (36.8% identity in 231 aa overlap); Q9PFI6|XF0671 from Xylella fastidiosa (247 aa), FASTA scores: opt: 437, E(): 3.4e-20, (37.7% identity in 252 aa overlap); etc. Also highly similar to O33308|FABG5|Rv2766c|MTV002.31c ALCOHOL DEHYDROGENASE (SDR FAMILY) from Mycobacterium tuberculosis (260 aa), FASTA scores: opt: 504, E(): 2.3e-24, (38.5% identity in 244 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Mb3579c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y543" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y543" /protein_id="SIU02206.1" /translation="MTLAEAADAINFGLAGRVVLVTGGVRGVGAGISSVFAEQGATVI TCARRAVDGQPYEFHRCDIRDEDSVKRLVGEIGERHGRLDMLVNNAGGSPYALAAEAT HNFHRKIVELNVLAPLLVSQHANVLMQAQPNGGSIVNICSVSGRRPTPGTAAYGAAKA GLENLTTTLAVEWAPKVRVNAVVVGMVETERSELFYGDAESIARVAATVPLGRLARPA DIGWAAAFLASDAASYISGATLEVHGGGEPPPYLGASSANK" CDS 3936781..3937524 /codon_start=1 /transl_table=11 /gene="echA20" /locus_tag="BQ2027_MB3580" /product="PROBABLE ENOYL-COA HYDRATASE ECHA20 (ENOYL HYDRASE) (UNSATURATED ACYL-COA HYDRATASE) (CROTONASE)" /note="Mb3580, echA20, len: 247 aa. Equivalent to Rv3550, len: 247 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 247 aa overlap). Probable echA20, enoyl-CoA hydratase (EC 4.2.1.17), similar to others e.g. Q9A7B0|CC1814 from Caulobacter crescentus (275 aa), FASTA scores: opt: 488, E(): 3.5e-24, (36.4% identity in 239 aa overlap); O84978|PHAA from Pseudomonas putida (293 aa), FASTA scores: opt: 383, E(): 2e-17, (33.85% identity in 254 aa overlap); BAB48479|Q98LI4|MLL1009 from Rhizobium loti (Mesorhizobium loti) (258 aa), FASTA scores: opt: 378, E(): 3.8e-17, (21.45% identity in 231 aa overlap); etc. COULD BELONG TO THE ENOYL-COA HYDRATASE/ISOMERASE FAMILY. Protein product from Mb3580 detected using SWATH mass spectrometry. Mb3580 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4H1" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4H1" /protein_id="SIU02207.1" /translation="MPITSTTPEPGIVAVTVDYPPVNAIPSKAWFDLADAVTAAGANS DTRAVILRAEGRGFNAGVDIKEMQRTEGFTALIDANRGCFAAFRAVYECAVPVIAAVN GFCVGGGIGLVGNSDVIVASEDATFGLPEVERGALGAATHLSRLVPQHLMRRLFFTAA TVDAATLQHFGSVHEVVSRDQLDEAALRVARDIAAKDTRVIRAAKEALNFIDVQRVNA SYRMEQGFTFELNLAGVADEHRDAFVKKS" CDS 3937524..3938402 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3581" /product="POSSIBLE COA-TRANSFERASE (ALPHA SUBUNIT)" /note="Mb3581, -, len: 292 aa. Equivalent to Rv3551, len: 292 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 292 aa overlap). Possible CoA-transferase, alpha subunit (EC 2.8.3.-), similar in part to other CoA-transferases e.g. Q59111|GCTA_ACIFE|GCTA GLUTACONATE COA-TRANSFERASE SUBUNIT A (EC 2.8.3.12) (GCT LARGE SUBUNIT) from Acidaminococcus fermentans (319 aa) FASTA scores: opt: 247, E(): 6.3e-09, (27.35% identity in 307 aa overlap); Q9XD83|PCAI from Streptomyces sp. 2065 (251 aa), FASTA scores: opt: 222, E(): 2.3e-07, (27.55% identity in 243 aa overlap); BAB50895|MLL4183 from Rhizobium loti (Mesorhizobium loti) (285 aa), FASTA scores: opt: 206, E(): 2.8e-06, (27.4% identity in 281 aa overlap); etc. Also some similarity with O06167|SCOA_MYCTU|RVv504c|MT2579|MTCY07A7.10c PROBABLE SUCCINYL-COA:3-KETOACID-COENZYME A TRANSFERASE SUBUNIT A from Mycobacterium tuberculosis (248 aa), FASTA scores: opt: 210, E(): 1.4e-06, (25.5% identity in 247 aa overlap). BELONGS TO THE GLUTACONATE COA-TRANSFERASE SUBUNIT A FAMILY. Note that this putative protein may combine with the putative protein encoded by the downstream ORF Rv3552 to form a CoA-transferase that comprises two subunits. Protein product from Mb3581 detected using SWATH mass spectrometry. Mb3581 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4L3" /db_xref="InterPro:IPR004165" /db_xref="InterPro:IPR037171" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4L3" /protein_id="SIU02208.1" /translation="MPDKRTSLDDAVAQLRSGMTIGIAGWGSRRKPMAFVRAILRSDV TDLTVVTYGGPDLGLLCSAGKVKRVYYGFVSLDSPPFYDPWFAHARTSGAIEAREMDE GMLRCGLQAAAQRLPFLPIRAGLGSSVPQFWAGELQTVTSPYPAPGGGYETLIAMPAL RLDAAFAHLNLGDSHGNAAYTGIDPYFDDLFLMAAERRFLSVERIVATEELVKSVPPQ ALLVNRMMVDAIVEAPGGAHFTTAAPDYGRDEQFQRHYAEAASTQVGWQQFVHTYLSG TEADYQAAVHNFGASR" CDS 3938399..3939151 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3582" /product="POSSIBLE COA-TRANSFERASE (BETA SUBUNIT)" /note="Mb3582, -, len: 250 aa. Equivalent to Rv3552, len: 250 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 250 aa overlap). Possible CoA-transferase, beta subunit (EC 2.8.3.-), similar in part to other CoA-transferases e.g. Q9I6R1|PA0227 from Pseudomonas aeruginosa (260 aa), FASTA scores: opt: 233, E(): 8.6e-08, (24.8% identity in 238 aa overlap); BAB50894|MLL4181 from Rhizobium loti (Mesorhizobium loti) (264 aa), FASTA scores: opt: 210, E(): 2.6e-06, (24.15% identity in 203 aa overlap); and AAK41345|Q97Z51|GCTB from Sulfolobus solfataricus (245 aa), FASTA scores: opt: 122, E(): 1.1, (25.5% identity in 243 aa overlap). POSSIBLY BELONGS TO THE GLUTACONATE COA-TRANSFERASE SUBUNIT B FAMILY. Note that this putative protein may combine with the putative protein encoded by the upstream ORF Rv3551 to form a CoA-transferase that comprises two subunits. Protein product from Mb3582 detected using SWATH mass spectrometry. Mb3582 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63653" /db_xref="InterPro:IPR004165" /db_xref="InterPro:IPR037171" /db_xref="UniProtKB/Swiss-Prot:P63653" /protein_id="SIU02209.1" /translation="MSTRAEVCAVACAELFRDAGEIMISPMTNMASVGARLARLTFAP DILLTDGEAQLLADTPALGKTGAPNRIEGWMPFGRVFETLAWGRRHVVMGANQVDRYG NQNISAFGPLQRPTRQMFGVRGSPGNTINHATSYWVGNHCKRVFVEAVDVVSGIGYDK VDPDNPAFRFVNVYRVVSNLGVFDFGGPDHSMRAVSLHPGVTPGDVRDATSFEVHDLD AAEQTRLPTDDELHLIRAVIDPKSLRDREIRS" CDS 3939249..3940316 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3583" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb3583, -, len: 355 aa. Equivalent to Rv3553, len: 355 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 355 aa overlap). Possible oxidoreductase (EC 1.-.-.-), highly similar (except in C-terminus) to Q9A327|CC3379 HYPOTHETICAL PROTEIN from Caulobacter crescentus (321 aa), FASTA scores: opt: 639, E(): 4.6e-29, (46.35% identity in 248 aa overlap); and Q9WZQ7|TM0800 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (314 aa), FASTA scores: opt: 622, E(): 4.1e-28, (37.95% identity in 340 aa overlap). Also similar to two TRANS-2-ENOYL-ACP REDUCTASES; Q99YD4|FABK|SPY1751 from Streptococcus pyogenes (323 aa), FASTA scores: opt: 604, E(): 4.4e-27, (33.25% identity in 346 aa overlap); and Q9FBC5|FABK from Streptococcus pneumoniae (324 aa), FASTA scores: opt: 553, E(): 3.3e-24, (32.1% identity in 346 aa overlap); and similar with several 2-NITROPROPANE DIOXYGENASES, e.g. Q9F7P8 from uncultured proteobacterium EBAC31A08 (322 aa), FASTA scores: opt: 505, E(): 1.7e-21, (33.6% identity in 348 aa overlap); Q9FMG0 (alias AAK44141) from Arabidopsis thaliana (Mouse-ear cress) (333 aa), FASTA scores: opt: 489, E(): 1.4e-20, (33.15% identity in 341 aa overlap); O28109|AF2173 (NCD2) from Archaeoglobus fulgidus (274 aa), FASTA scores: opt: 456, E(): 8.9e-19, (36.3% identity in 237 aa overlap); etc." /db_xref="GOA:A0A1R3Y4H7" /db_xref="InterPro:IPR004136" /db_xref="InterPro:IPR013785" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4H7" /protein_id="SIU02210.1" /translation="MRLRTPLTELIGIEHPVVQTGMGWVAGARLVSATANAGGLGILA SATMTLDELAAAITKVKAVTDKPFGVNIRADAADAGDRVELMIREGVRVASFALAPKQ QLIARLKEAGAVVIPSIGAAKHARKVAAWGADAMIVQGGEGGGHTGPVATTLLLPSVL DAVAGTGIPVIAAGGFFDGRGLAAALCYGAAGVAMGTRFLLTSDSTVPDAVKRRYLQA GLDGTVVTTRVDGMPHRVLRTELVEKLESGSRARGFAAALRNAGKFRRMSQMTWRSMI RDGLTMRHGKELTWSQVLMAANTPMLLKAGLVDGNTEAGVLASGQVAGILDDLPSCKE LIESIVLDAITHLQTASALVE" CDS 3940313..3942370 /codon_start=1 /transl_table=11 /gene="fdxB" /locus_tag="BQ2027_MB3584" /product="POSSIBLE ELECTRON TRANSFER PROTEIN FDXB" /note="Mb3584, fdxB, len: 685 aa. Equivalent to Rv3554, len: 685 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 685 aa overlap). Possible fdxB, two-domain protein, with ferredoxin reductase electron transfer component in C-terminal part (EC 1.-.-.-) and unknown function in N-terminal part. Indeed, N-terminal end is similar to O85832 HYPOTHETICAL 36.1 KDA PROTEIN from Sphingomonas aromaticivorans strain F199 (catabolic plasmid pNL1) (309 aa), FASTA scores: opt: 615, E(): 2.5e-30, (33.1% identity in 311 aa overlap); and P73428|SLL1468 HYPOTHETICAL 36.2 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (312 aa), FASTA scores: opt: 317, E(): 4.5e-12, (30.2% identity in 268 aa overlap). And C-terminal end is similar to Q9F9U6|PAAE protein involved in aerobic phenylacetate metabolism from Azoarcus evansii (360 aa), FASTA scores: opt: 935, E(): 7e-50, (43.85% identity in 351 aa overlap); CAC44653|PAAE|SCBAC17A6.08 PUTATIVE PHENYLACETIC ACID DEGRADATION NADH OXIDOREDUCTASE from Streptomyces coelicolor (368 aa), FASTA scores: opt: 93, E(): 9.5e-50, (41.95% identity in 372 aa overlap); Q9FA57|PACI FERREDOXIN from Azoarcus evansii (360 aa), FASTA scores: opt: 925, E(): 2.9e-49, (43.3% identity in 351 aa overlap); P76081|PAAE_ECOLI|B1392 PROBABLE PHENYLACETIC ACID DEGRADATION NADH OXIDOREDUCTASE from Escherichia coli strains K12 and W (356 aa), FASTA scores: opt: 910, E(): 2.4e-48, (43.05% identity in 353 aa overlap); Q9APJ6|PAAE ELECTRON TRANSFER PROTEIN (FRAGMENT) from Hyphomicrobium chloromethanicum (241 aa), FASTA scores: opt: 404, E(): 1.7e-17, (35.45% identity in 234 aa overlap); BAB51608|MLL5100 FERREDOXIN from Rhizobium loti (Mesorhizobium loti) (365 aa), FASTA scores: opt: 316, E(): 5.8e-12, (28.95% identity in 349 aa overlap); etc. C-terminus also similar to P96853|Rv3571|MTCY06G11.18 PUTATIVE ELECTRON TRANSFER PROTEIN from Mycobacterium tuberculosis (358 aa), FASTA scores: opt: 450, E(): 3.6e-20, (32.95% identity in 358 aa overlap). Contains PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature. BELONGS TO THE 2FE2S PLANT-TYPE FERREDOXIN FAMILY. COFACTOR: BINDS A 2FE-2S CLUSTER (BY SIMILARITY). Protein product from Mb3584 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3584 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4H5" /db_xref="InterPro:IPR001041" /db_xref="InterPro:IPR001433" /db_xref="InterPro:IPR005804" /db_xref="InterPro:IPR006058" /db_xref="InterPro:IPR008333" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR017927" /db_xref="InterPro:IPR017938" /db_xref="InterPro:IPR036010" /db_xref="InterPro:IPR039261" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4H5" /protein_id="SIU02211.1" /translation="MTDACQAEYAIAAMSTVEMDQAAPESAAHHPLPDPGESVPRLAL PTIGIFLATLTAFVGSTTAYISGWIPFWVTIPVNAAVTFVMFTVVHDASHYAISSIRW VNGLFGRLAWLFVGPVVAFPAFGYIHIQHHRHSNDDEQDPDTFASHGSLWVLPLRWSM VEYFYIKYYLPRGRSRPVIEVAETLVMMTLFLTGLIVAIVTGNFWTLAIVFLIPQRIG LTVLAWWFDWLPHHGLEDTQRSNRYRATRNRVGAEWLFTPVLLSQNYHLVHHLHPSVP FYRYLRTWRRNEEAYLERNAAISTVFGQQLNPDEYRQWKELNGRLARLLPVRMPARSS SPHAVLHRIPVASVDPITADATLVTFAVPEALRDAFRFEPGQHVTVRTDLGGQGIRRN YSICAPATRAQLRIAVKHIPGGAFSTFVANELKAGDVLELMTPTGRFGTPLDPLHRKH YVGLVAGSGITPVLSILATTLEIETESRFTLIYGNRTKESTMFRAELDRLESRYADRL EILHVLSSEPLHTPELRGRIDRDKLTRWLTSTLRPAGVDEWFICGPLAMATAVRETLI EHGVDSERIHLELFYGFDTPPATRPSYAGATVTFTLSGQRAIFDLVPGDSILEGALGL RSDAPYACMGGACGTCRAKLIEGNVEMDHNFALRKAELDAGYILTCQSHPTTPFVAVD YDA" CDS complement(3942458..3943327) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3585C" /product="conserved protein" /note="Mb3585c, -, len: 289 aa. Equivalent to Rv3555c, len: 289 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 289 aa overlap). Hypothetical protein, highly similar to others from Mycobacterium tuberculosis e.g. O53562|AL022022|Rv3517|MTV023.24 (279 aa), FASTA scores: opt: 874, E(): 8.3e-48, (49.45% identity in 275 aa overlap); P71763|Rv1482c|MTCY277.03c (339 aa), FASTA scores: opt: 755, E(): 3e-40, (45.75% identity in 260 aa overlap); O69681|Rv3714c|MTV025.062c (296 aa), FASTA scores: opt: 733, E(): 6.4e-39, (44.1% identity in 281 aa overlap); etc. Also highly similar to other mycobacterial hypothetical proteins e.g. O07396|MAV346 from M. avium (346 aa), FASTA scores: opt: 714, E(): 1.1e-37, (44.6% identity in 260 aa overlap); and Q50134|U650AG|MLCB57.67c from Mycobacterium leprae (75 aa), FASTA scores: opt: 130, E(): 0.17, (35.1% identity in 57 aa overlap) (only partial homology with this protein). Shows some similarity to P52392|NHSR_STRAS PUTATIVE NOSIHEPTIDE RESISTANCE REGULATORY PROTEIN (ORF699) from Streptomyces actuosus (233 aa), FASTA scores: opt: 120, E(): 1.9, (25.25% identity in 194 aa overlap). Mb3585c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007569" /db_xref="InterPro:IPR011335" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4K1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02212.1" /translation="MDELPWPVLGSEVLAAKAIPERAMRQLYEPVYPGVYAPAGVELT ARQRAHAAWLWSRRRAVVAGNSAAALLGAKWVNPALDAELVHANRKPPPRIVVHTDRL APHETVAVDGVAVTTPARTAFDIGRRTPSRLQAVQRLDALANSTDVKVADVQAVIAEH TGARGLVRLRAVLPLIDGGAESPQETWTRLVLIDAGLPKPQTQIRVFDDYGDFVARID LGYEQLRVGVEYDGPQHWTDPAQRARDIERSTALLDLGWTIIRVTSELLRYRRGTFVG RVDAAMRAAGWRP" CDS complement(3943432..3944592) /codon_start=1 /transl_table=11 /gene="fadA6" /locus_tag="BQ2027_MB3586C" /product="PROBABLE ACETYL-COA ACETYLTRANSFERASE FADA6 (ACETOACETYL-COA THIOLASE)" /note="Mb3586c, fadA6, len: 386 aa. Equivalent to Rv3556c, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 386 aa overlap). Probable fadA6, acetyl-CoA acetyltransferase (EC 2.3.1.9), similar to many e.g. Q9K409|2SCG61.06c from Streptomyces coelicolor (389 aa), FASTA scores: opt: 1091, E(): 2.9e-58, (48.1% identity in 399 aa overlap); Q9AAT4|CC0510 from Caulobacter crescentus (391 aa), FASTA scores: opt: 902, E(): 6.6e-47, (40.25% identity in 395 aa overlap); P45359|THL_CLOAB from Clostridium acetobutylicum (392 aa), FASTA scores: opt: 872, E(): 4.2e-45, (37.9% identity in 396 aa overlap); Q9I2A8|ATOB|PA2001 from Pseudomonas aeruginosa (393 aa), FASTA scores: opt: 872, E(): 4.2e-45, (41.3% identity in 397 aa overlap); etc. Contains PS00737 Thiolases signature 2. BELONGS TO THE THIOLASE FAMILY. Protein product from Mb3586c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3586c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4G6" /db_xref="InterPro:IPR002155" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR020613" /db_xref="InterPro:IPR020616" /db_xref="InterPro:IPR020617" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4G6" /protein_id="SIU02213.1" /translation="MTEAYVIDAVRTAVGKRGGALAGIHPVDLGALAWRGLLDRTDID PAAVDDVIAGCVDAIGGQAGNIARLSWLAAGYPEEVPGVTVDRQCGSSQQAISFGAQA IMSGTADVIVAGGVQNMSQIPISSAMTVGEQFGFTSPTNESKQWLHRYGDQEISQFRG SELIAEKWNLSREEMERYSLTSHERAFAAIRAGHFENEIITVETESGPFRVDEGPRES SLEKMAGLQPLVEGGRLTAAMASQISDGASAVLLASERAVKDHGLRPRARIHHISARA ADPVFMLTGPIPATRYALDKTGLAIDDIDTVEINEAFAPVVMAWLKEIKADPAKVNPN GGAIALGHPLGATGAKLFTTMLGELERIGGRYGLQTMCEGGGTANVTIIERL" CDS complement(3944657..3945259) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3587C" /product="transcriptional regulatory protein (probably tetr-family)" /note="Mb3587c, -, len: 200 aa. Equivalent to Rv3557c, len: 200 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 200 aa overlap). Probable transcriptional regulator, tetR family, similar to other e.g. Q9RRV9|DR2376 from Deinococcus radiodurans (197 aa) FASTA scores: opt: 326, E(): 2.3e-14, (31.2% identity in 189 aa overlap); Q9HZW2|PA2885 from Pseudomonas aeruginosa (198 aa), FASTA scores: opt: 308, E(): 3.5e-13, (31.55% identity in 187 aa overlap); Q9RFR4 from Pseudomonas fluorescens (207 aa), FASTA scores: opt: 291, E(): 4.7e-12, (29.75% identity in 195 aa overlap); Q9K8P5|BH2958 from Bacillus halodurans (215 aa), FASTA scores: opt: 271, E(): 9.9e-11, (23.95% identity in 192 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. O53641|Rv0158|MTV032.01 (214 aa), FASTA scores: opt: 232, E(): 3.5e-08, (25.5% identity in 192 aa overlap); and O06169|Rv2506|MTCY07A7.12 (215 aa), FASTA scores: opt: 215, E(): 4.5e-07, (35.15% identity in 148 aa overlap); etc. SEEMS TO BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3587c detected using SWATH mass spectrometry. Mb3587c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4J0" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="InterPro:IPR041490" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4J0" /protein_id="SIU02214.1" /translation="MDRVAGQVNSRRGELLELAAAMFAERGLRATTVRDIADGAGILS GSLYHHFASKEEMVDELLRGFLDWLFARYRDIVDSTANPLERLQGLFMASFEAIEHHH AQVVIYQDEAQRLASQPRFSYIEDRNKQQRKMWVDVLNQGIEEGYFRPDLDVDLVYRF IRDTTWVSVRWYRPGGPLTAQQVGQQYLAIVLGGITKEGV" CDS 3945608..3947266 /codon_start=1 /transl_table=11 /gene="PPE64" /locus_tag="BQ2027_MB3588" /product="ppe family protein ppe64" /note="Mb3588, PPE64, len: 552 aa. Equivalent to Rv3558, len: 552 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 552 aa overlap). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, similar to many e.g. P71868|Rv3533c|MTCY03C7.23 (582 aa), FASTA scores: opt: 1908, E(): 1.7e-83, (58.5% identity in 583 aa overlap). Protein product from Mb3588 detected using SWATH mass spectrometry. Mb3588 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR002989" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y660" /protein_id="SIU02215.1" /translation="MAHFSVLPPEINSLRMYLGAGSAPMLQAAAAWDGLAAELGTAAS SFSSVTTGLTGQAWQGPASAAMAAAAAPYAGFLTTASAQAQLAAGQAKAVASVFEAAK AAIVPPAAVAANREAFLALIRSNWLGLNAPWIAAVESLYEEYWAADVAAMTGYHAGAS QAAAQLPLPAGLQQFLNTLPNLGIGNQGNANLGGGNTGSGNIGNGNKGSSNLGGGNIG NNNIGSGNRGSDNFGAGNVGTGNIGFGNQGPIDVNLLATPGQNNVGLGNIGNNNMGFG NTGDANTGGGNTGNGNIGGGNTGNNNFGFGNTGNNNIGIGLTGNNQMGINLAGLLNSG SGNIGIGNSGTNNIGLFNSGSGNIGVFNTGANTLVPGDLNNLGVGNSGNANIGFGNAG VLNTGFGNASILNTGLGNAGELNTGFGNAGFVNTGFDNSGNVNTGNGNSGNINTGSWN AGNVNTGFGIITDSGLTNSGFGNTGTDVSGFFNTPTGPLAVDVSGFFNTASGGTVING QTSGIGNIGVPGTLFGSVRSGLNTGLFNMGTAISGLFNLRQLLG" CDS complement(3947275..3948063) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3589C" /product="probable oxidoreductase" /note="Mb3589c, -, len: 262 aa. Equivalent to Rv3559c, len: 262 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 262 aa overlap). Putative oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases e.g. Q9F5J1|SIM-NJ1|SIMD2 PUTATIVE 3-KETO-ACYL-REDUCTASE (SDR FAMILY) from Streptomyces antibioticus (273 aa), FASTA scores: opt: 510, E(): 2.8e-24, (40.15% identity in 249 aa overlap);Q9L2C9|SC7A8.29 PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (255 aa), FASTA scores: opt: 500, E(): 1.1e-23, (41.4% identity in 239 aa overlap); Q9HQ41|FABG|VNG1341G 3-OXOACYL-[ACYL-CARRIER-PROTEIN] REDUCTASE from Halobacterium sp. strain NRC-1 (255 aa) FASTA scores: opt: 500, E(): 1.1e-23, (40.0% identity in 250 aa overlap); etc. Also similar to oxidoreductases from Mycobacterium tuberculosis eg Q11020|YD50_MYCTU|FABG2|Rv1350|MT1393|MTCY02B10.14 PUTATIVE OXIDOREDUCTASE (247 aa), FASTA scores: opt: 497, E(): 1.6e-23, (39.2% identity in 245 aa overlap). Protein product from Mb3589c detected using SWATH mass spectrometry." /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y547" /protein_id="SIU02216.1" /translation="MNLSVAPKEIAGHGLLDGKVVVVTAAAGTGIGSATARRALAEGA DVVISDHHERRLGETAAELSALGLGRVEHVVCDVTSTAQVDALIDSTTARMGRLDVLV NNAGLGGQTPVADMTDDEWDRVLDVSLTSVFRATRAALRYFRDAPHGGVIVNNASVLG WRAQHSQSHYAAAKAGVMALTRCSAIEAAEYGVRINAVSPSIARHKFLDKTASAELLD RLVAGEAFGRAAEPWEVAATIAFLASDYSSYLTGEVISVSCQHP" CDS complement(3948060..3949217) /codon_start=1 /transl_table=11 /gene="fadE30" /locus_tag="BQ2027_MB3590C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE30" /note="Mb3590c, fadE30, len: 385 aa. Equivalent to Rv3560c, len: 385 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 385 aa overlap). Probable fadE30, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9I4V2|PA1022 from Pseudomonas aeruginosa (381 aa), FASTA scores: opt: 845, E(): 1.6e-47, (39.2% identity in 388 aa overlap); Q9A5G9|CC2478 from Caulobacter crescentus (407 aa), FASTA scores: opt: 734, E(): 2.8e-40, (35.5% identity in 386 aa overlap); Q9RJX2|SCF37.29c from Streptomyces coelicolor (393 aa), FASTA scores: opt: 656, E(): 3.2e-35, (37.9% identity in 351 aa overlap); etc. Also similar to acyl-CoA dehydrogenases from Mycobacterium tuberculosis e.g. P95280|FADE17|Rv1934c|MTCY09F9.30 (409 aa), FASTA scores: opt: 939, E(): 1.4e-53, (43.8% identity in 404 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3590c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y4H6" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4H6" /protein_id="SIU02217.1" /translation="MQDVEEFRAQVRGWLADNLAGEFAALKGLGGPGREHEAFEERRA WNQRLAAAGLTCLGWPEEHGGRGLSTAHRVAFYEEYARADAPDKVNHFGEELLGPTLI AFGTPQQQRRFLPRIRDVTELWCQGYSEPGAGSDLASVATTAELDGDQWVINGQKVWT SLAHLSQWCFVLARTEKGSQRHAGLSYLLVPLDQPGVQIRPIVQITGTAEFNEVFFDD ARTDADLVVGAPGDGWRVAMATLTFERGVSTLGQQIVYARELSNLVELARRTAAADDP LIRERLTRAWTGLRAMRSYALATMEGPAVEQPGQDNVSKLLWANWHRNLGELAMDVIG KPGMTMPDGEFDEWQRLYLFTRADTIYGGSNEIQRNIIAERVLGLPREAKG" CDS 3949265..3950788 /codon_start=1 /transl_table=11 /gene="fadD3" /locus_tag="BQ2027_MB3591" /product="PROBABLE FATTY-ACID-COA LIGASE FADD3 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)" /note="Mb3591, fadD3, len: 507 aa. Equivalent to Rv3561, len: 507 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 507 aa overlap). Probable fadD3, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to many substrate-CoA symthetases/ligases e.g. Q9KBC2|BH2006 LONG-CHAIN ACYL-COA SYNTHETASE from Bacillus halodurans (513 aa), FASTA scores: opt: 821, E(): 1.6e-43, (32.9% identity in 517 aa overlap); Q9EY88|FCS FERULOYL-COA SYNTHETASE from Amycolatopsis sp. HR167 (491 aa) FASTA scores: opt: 767, E(): 3.5e-40, (37.65% identity in 502 aa overlap); Q9ZIP5|MATB MALONYL CoA SYNTHETASE from Rhizobium leguminosarum (504 aa), FASTA scores: opt: 758, E(): 1.3e-39, (33.7% identity in 472 aa overlap); Q9CD27|FADD2|ML2546 ACYL-COA SYNTHASE from Mycobacterium leprae (548 aa), FASTA scores: opt: 700, E(): 5.6e-36, (31.85% identity in 515 aa overlap); P29212|LCFA_ECOLI|FADD|OLDD|B1805 LONG-CHAIN-FATTY-ACID--COA LIGASE from Escherichia coli strain K12 (561 aa), FASTA scores: opt: 532, E(): 6.3e-28, (30.0% identity in 533 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis eg O53306|FADD13|Rv3089|MTV013.10 (503 aa), FASTA scores: opt: 819, E(): 2.1e-43, (35.1% identity in 490 aa overlap). Contains PS00455 Putative AMP-binding domain signature. Protein product from Mb3591 detected using SWATH mass spectrometry. Mb3591 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4L9" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4L9" /protein_id="SIU02218.1" /translation="MINDLRTVPAALDRLVRQLPDHTALIAEDRRFTSTELRDAVYGA AAALIALGVEPADRVAIWSPNTWHWVVACLAIHHAGAAVVPLNTRYTATEATDILDRA GAPVLFAAGLFLGADRAAGLDRAALPALRHVVRVPVEADDGTWDEFIATGAGALDAVA ARAAAVAPQDVSDILFTSGTTGRSKGVLCAHRQSLSASASWAANGKITSDDRYLCINP FFHNFGYKAGILACLQTGATLIPHVTFDPLHALRAIERHRITVLPGPPTIYQSLLDHP ARKDFDLSSLRFAVTGAATVPVVLVERMQSELDIDIVLTAYGLTEANGMGTMCRPEDD AVTVATTCGRPFADFELRIADDGEVLLRGPNVMVGYLDDTEATAAAIDADGWLHTGDI GAVDQAGNLRINDRLKDMYICGGFNVYPAEVEQVLARMDGVADAAVIGVPDQRLGEVG RAFVVARPGTGLDEASVIAYTREHLANFKTPRSVRFVDVLPRNAAGKVSKPQLRELG" CDS 3950789..3951922 /codon_start=1 /transl_table=11 /gene="fadE31" /locus_tag="BQ2027_MB3592" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE31" /note="Mb3592, fadE31, len: 377 aa. Equivalent to Rv3562, len: 377 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 377 aa overlap). Probable fadE31, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9RJX2|SCF37.29c from Streptomyces coelicolor (393 aa), FASTA scores: opt: 657, E(): 1.7e-34, (36.45% identity in 351 aa overlap); Q9A5G9|CC2478 from Caulobacter crescentus (407 aa), FASTA scores: opt: 653, E(): 3.2e-34, (33.95% identity in 392 aa overlap); Q9EX72|MLHC from Rhodococcus erythropolis (324 aa) FASTA scores: opt: 631, E(): 6.5e-33, (36.95% identity in 330 aa overlap); P45867|ACDA_BACSU|ACD from Bacillus subtilis (379 aa), FASTA scores: opt: 347, E(): 1e-15, (28.6% identity in 385 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis e.g. P96842|FADE30|Rv3560c|MTCY06G11.07c (385 aa), FASTA scores: opt: 843, E(): 2.3e-46, (38.95% identity in 380 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3592 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y4M5" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4M5" /protein_id="SIU02219.1" /translation="MDLNFDDETLAFQAEVREFLAANAASIPTKSYDNAEGFAQHRYW DRVLFDAGLSVITWPAKYGGRDAPLLHWIVFEEEYFRAGAPGRASANGTSMLAPTLFA HGTAEQLDRILPKMASGEQIWAQAWSEPESGSDLASLRSTASKVDGGWLLNGQKIWSS RAPFADMGFGLFRSDPAVERHRGLTYFMFDLKAKGVTVRPIAQLGGDTGFGEIFLDDV FVPDRDVIGAPNDGWRAAMSTSSNERGMSLRSPARFLASAERLVQLWKDRGSPPEFAD RVADAWIKAQAYRLQTFGTVTRLAAGGELGAESSVTKVFWSELDVHLHQTALDLRGAD GELAGPWTEGLLFALGGPIYAGTNEIQRNIIAERLLGLPREKT" CDS 3951919..3952878 /codon_start=1 /transl_table=11 /gene="fadE32" /locus_tag="BQ2027_MB3593" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE32" /note="Mb3593, fadE32, len: 319 aa. Equivalent to Rv3563, len: 319 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 319 aa overlap). Probable fadE32, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9I4V4|PA1020 from Pseudomonas aeruginosa (370 aa), FASTA scores: opt: 347, E(): 7.6e-14, (35.15% identity in 333 aa overlap); Q9RJX3|SCF37.28c from Streptomyces coelicolor (362 aa), FASTA scores: opt: 300, E(): 5.3e-11, (32.4% identity in 349 aa overlap); Q9A5G8|CC2479 from Caulobacter crescentus (344 aa), FASTA scores: opt: 285, E(): 4.1e-10, (30.4% identity in 329 aa overlap); P45857|ACDB_BACSU|MMGC from Bacillus subtilis (379 aa), FASTA scores: opt: 230, E(): 1.1e-07, (25.5% identity in 357 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis eg P96846|FADE33|Rv3564|MTCY06G11.11 (318 aa), FASTA scores: opt: 478, E(): 7.6e-22, (32.9% identity in 292 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3593 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y4I6" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4I6" /protein_id="SIU02220.1" /translation="MTMEFALNEQQRDFAASIDAALGAADLPGVVRAWAAGDVAPGRK VWQQLANLGVTALGVAEKFDGLGASPVDLVVALERLGRWCVPGPVTESIAVAPILLAH DDRAERSHGLASGELIATVAMPPRVPRAVDADTAGLVLLAGDGSVTEGTPGDCHRSVD PSRRLYEVAASGQAWRAPKDVVARAYEFGALATAAQLVGAGQALLEAAVNYAKQRTQF GRAIGSYQAIKHKLADVHIAIELACPLVYGAAVSLEPRDVSAAKAAASEAALLAARSA LQTHGAIGFTCEHDLSLWLLRVQALHSAWGTPQEHRRRVLEAL" CDS 3952875..3953831 /codon_start=1 /transl_table=11 /gene="fadE33" /locus_tag="BQ2027_MB3594" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE33" /note="Mb3594, fadE33, len: 318 aa. Equivalent to Rv3564, len: 318 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 318 aa overlap). Probable fadE33, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to others e.g. Q9A5G8|CC2479 from Caulobacter crescentus (344 aa), FASTA scores: opt: 373, E(): 1.9e-15, (34.3% identity in 338 aa overlap); Q9I4V4|PA1020 from Pseudomonas aeruginosa (370 aa), FASTA scores: opt: 277, E(): 1.4e-09, (31.95% identity in 335 aa overlap); Q9X7Y6|SC6A5.40c from Streptomyces coelicolor (395 aa), FASTA scores: opt: 273, E(): 2.5e-09, (30.1% identity in 352 aa overlap); P45857|ACDB_BACSU|MMGC from Bacillus subtilis (379 aa), FASTA scores: opt: 478, E(): 7.9e-22, (32.9% identity in 292 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. P96845|FADE32|Rv3563|MTCY06G11.10 (319 aa), FASTA scores: opt: 478, E(): 7.9e-22, (32.9% identity in 292 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3594 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y4J2" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4J2" /protein_id="SIU02221.1" /translation="MTPPEERQMLRETVASLVAKHAGPAAVRAAMASDRGYDESLWRL LCEQVGAAALVIPEELGGAGGELADAAIVVQELGRALVPSPLLGTTLAELALLAAAKP DAQALTELAQGSAIGALVLDPDYVVNGDIADIVVAATSGQLTRWTRFSAQPVATMDPT RRLARLQSEETEPLCPDPGIADTAAILLAAEQIGAAERCLQLTVEYAKSRVQFGRPIG SFQALKHRMADLYVTIAAARAVVADACHAPTPTNAATARLAASEALSTAAAEGIQLHG GIAITWEHDMHLYFKRAHGSAQLLESPREVLRRLESEVWESP" CDS 3953828..3954994 /codon_start=1 /transl_table=11 /gene="aspB" /locus_tag="BQ2027_MB3595" /product="POSSIBLE ASPARTATE AMINOTRANSFERASE ASPB (TRANSAMINASE A) (ASPAT) (GLUTAMIC--OXALOACETIC TRANSAMINASE) (GLUTAMIC--ASPARTIC TRANSAMINASE)" /note="Mb3595, aspB, len: 388 aa. Equivalent to Rv3565, len: 388 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 388 aa overlap). Possible aspB, aspartate aminotransferase (EC 2.6.1.1), similar to many e.g. Q9A5J2|CC2455 AMINOTRANSFERASE CLASS I from Caulobacter crescentus (381 aa), FASTA scores: opt: 1112, E(): 1e-61, (45.85% identity in 384 aa overlap); Q9HV76|PA4722 PROBABLE AMINOTRANSFERASE from Pseudomonas aeruginosa (390 aa), FASTA scores: opt: 863, E(): 3.1e-46, (37.2% identity in 390 aa overlap); Q9RWP3|DR0623 ASPARTATE AMINOTRANSFERASE from Deinococcus radiodurans (388 aa), FASTA scores: opt: 713, E(): 6.3e-37, (35.5% identity in 383 aa overlap); Q9HQK2|ASPC2|VNG1121G ASPARTATE AMINOTRANSFERASE from Halobacterium sp. strain NRC-1 (391 aa), FASTA scores: opt: 710, E(): 9.8e-37, (34.45% identity in 380 aa overlap); O33822|AAT_THEAQ|ASPC ASPARTATE AMINOTRANSFERASE from Thermus aquaticus (383 aa), FASTA scores: opt: 695, E(): 8.2e-36, (35.1% identity in 376 aa overlap); etc. Contains PS00105 Aminotransferases class-I pyridoxal-phosphate attachment site. BELONGS TO CLASS-I OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. COFACTOR: PYRIDOXAL PHOSPHATE (BY SIMILARITY). Protein product from Mb3595 detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y4K8" /db_xref="InterPro:IPR004838" /db_xref="InterPro:IPR004839" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4K8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02222.1" /translation="MTDRVALRAGVPPFYVMDVWLAAAERQRTHGDLVNLSAGQPSAG APEPVRAAAAAALHLNQLGYSVALGIPELRDAIAADYQRRHGITVEPDAVVITTGSSG GFLLAFLACFDAGDRVAMASPGYPCYRNILSALGCEVVEIPCGPQTRFQPTAQMLAEI DPPLRGVVVASPANPTGTVIPPEELAAIASWCDASDVRLISDEVYHGLVYQGAPQTSC AWQTSRNAVVVNSFSKYYAMTGWRLGWLLVPTVLRRAVDCLTGNFTICPPVLSQIAAV SAFTPEATAEADGNLASYAINRSLLLDGLRRIGIDRLAPTDGAFYVYADVSDFTSDSL AFCSKLLADTGVAIAPGIDFDTARGGSFVRISFAGPSGDIEEALRRIGSWLPSQ" CDS complement(3954959..3955810) /codon_start=1 /transl_table=11 /gene="nat" /locus_tag="BQ2027_MB3596C" /product="ARYLAMINE N-ACETYLTRANSFERASE NAT (ARYLAMINE ACETYLASE)" /note="Mb3596c, nat, len: 283 aa. Equivalent to Rv3566c, len: 283 aa, from Mycobacterium tuberculosis H37Rv, (100.0% identity in 283 aa overlap). nat (alternate gene name: nhoA), arylamine N-acetyltransferase (EC 2.3.1.5) (see citation below), highly similar to O86309|NAT_MYCSM ARYLAMINE N-ACETYLTRANSFERASE from Mycobacterium smegmatis (see citation below) (275 aa), FASTA scores: opt: 1114, E(): 3e-66, (60.95% identity in 274 aa overlap). Also highly similar to others e.g. Q98D42|BAB51429|MLR4870 from Rhizobium loti (Mesorhizobium loti) (278 aa), FASTA scores: opt: 697, E(): 1.1e-38, (44.1% identity in 272 aa overlap); P77567|NHOA_ECOLI|B1463 from Escherichia coli strain K12 (281 aa), FASTA scores: opt: 537, E(): 4.4e-28, (38.85% identity in 273 aa overlap); Q00267|NHOA_SALTY from Salmonella typhimurium (281 aa), FASTA scores: opt: 507, E(): 4.3e-26, (34.8% identity in 273 aa overlap); etc. BELONGS TO THE ARYLAMINE N-ACETYLTRANSFERASE FAMILY. Note that previously known as nhoA (332 aa) and that nucleotide 4007874 has been changed since first submission (G deleted). Protein product from Mb3596c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3596c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5L9" /db_xref="InterPro:IPR001447" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/Swiss-Prot:P0A5L9" /protein_id="SIU02223.1" /translation="MALDLTAYFDRINYRGATDPTLDVLQDLVTVHSRTIPFENLDPL LGVPVDDLSPQALADKLVLRRRGGYCFEHNGLMGYVLAELGYRVRRFAARVVWKLAPD APLPPQTHTLLGVTFPGSGGCYLVDVGFGGQTPTSPLRLETGAVQPTTHEPYRLEDRV DGFVLQAMVRDTWQTLYEFTTQTRPQIDLKVASWYASTHPASKFVTGLTAAVITDDAR WNLSGRDLAVHRAGGTEKIRLADAAAVVDTLSERFGINVADIGERGALETRIDELLAR QPGADAP" CDS complement(3955795..3956061) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3597C" /product="HYPOTHETICAL PROTEIN" /note="Mb3597c, -, len: 88 aa. Equivalent to Rv3566A, len: 88 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 88 aa overlap). Hypothetical unknown protein. Mb3597c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4K0" /protein_id="SIU02224.1" /translation="MSGADPPTRRAFGQMARAATGWVSVSGQFAVAADTCRCEGTLFA VDPETHVANHNRCDIVGRLRDERPNTLRSVRRGDEVRMATWHWI" CDS complement(3956347..3956910) /codon_start=1 /transl_table=11 /gene="hsab" /locus_tag="BQ2027_MB3598C" /product="possible oxidoreductase. possible 3-hydroxy-9,10-seconandrost-1,3,5(10)-triene-9,17-dione hydroxylase." /note="Mb3598c, -, len: 187 aa. Equivalent to Rv3567c, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 187 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases and hypothetical proteins e.g. O69360 ORF61 PROTEIN from Rhodococcus erythropolis (194 aa) FASTA scores: opt: 974, E(): 3e-59, (77.05% identity in 183 aa overlap); Q9JN75|MMYF PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (174 aa), FASTA scores: opt: 451, E(): 1e-23, (43.65% identity in 158 aa overlap); P54990|NTAB_CHEHE|NMOB NITRILOTRIACETATE MONOOXYGENASE COMPONENT B (EC 1.14.13.-) from Chelatobacter heintzii (322 aa), FASTA scores: opt: 409, E(): 1.3e-20, (38.3% identity in 167 aa overlap)Chelatobacter heintzii; AAK62356 PUTATIVE NADH:FMN OXIDOREDUCTASE from Burkholderia sp. DBT1 (177 aa), FASTA scores: opt: 360, E(): 1.6e-17, (36.15% identity in 155 aa overlap). Mb3598c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y661" /db_xref="InterPro:IPR002563" /db_xref="InterPro:IPR012349" /db_xref="UniProtKB/TrEMBL:A0A1R3Y661" /protein_id="SIU02225.1" /translation="MSAQIDPRTFRSVLGQFCTGITVITTVHDDVPVGFACQSFAALS LEPPLVLFCPTKVSRSWQAIEASGRFCVNVLTEKQKDVSARFGSKEPDKFAGIDWRPS ELGSPIIEGSLAYIDCTVASVHDGGDHFVVFGAVESLSEVPAVKPRPLLFYRGDYTGI EPEKTTPAHWRDDLEAFLTTTTQDTWL" CDS complement(3956925..3957827) /codon_start=1 /transl_table=11 /gene="hsac" /locus_tag="BQ2027_MB3599C" /product="3,4-dhsa dioxygenase" /note="Mb3599c, bphC, len: 300 aa. Equivalent to Rv3568c, len: 300 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 300 aa overlap). Probable bphC, 2,3-dihydroxybiphenyl 1,2-dioxygenase (EC 1.13.11.39), highly similar to other e.g. Q9KWQ5|BPHC5 from Rhodococcus sp. RHA1 (300 aa), FASTA scores: opt: 1715, E(): 3.8e-103, (82.15% identity in 297 aa overlap); O50479|EDOB from Rhodococcus rhodochrous (300 aa) FASTA scores: opt: 1714, E(): 4.4e-103, (82.5% identity in 297 aa overlap); O69359|BPHC6 from Rhodococcus erythropolis (300 aa), FASTA scores: opt: 1647, E(): 9.1e-99, (78.25% identity in 299 aa overlap); Q9RBT2|BPHC1 from Pseudomonas sp. SY5 (301 aa) Pseudomonas sp. SY5 (298 aa) FASTA scores: opt: 767, E(): 3.9e-42, (42.8% identity in 299 aa overlap); P47228|BPHC_BURCE from Burkholderia cepacia (Pseudomonas cepacia) (297 aa), FASTA scores: opt: 670, E(): 6.8e-36, (40.55% identity in 296 aa overlap); etc. Contains PS00082 Extradiol ring-cleavage dioxygenases signature. BELONGS TO THE EXTRADIOL RING-CLEAVAGE DIOXYGENASE FAMILY. Mb3599c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y552" /db_xref="InterPro:IPR000486" /db_xref="InterPro:IPR004360" /db_xref="InterPro:IPR029068" /db_xref="InterPro:IPR037523" /db_xref="UniProtKB/TrEMBL:A0A1R3Y552" /protein_id="SIU02226.1" /translation="MSIRSLGYLRIEATDMAAWREYGLKVLGMVEGKGAPEGALYLRM DDFPARLVVVPGEHDRLLEAGWECANAEGLQEIRNRLDLEGTPYKEATAAELADRRVD EMIRFADPSGNCLEVFHGTALEHRRVVSPYGHRFVTGEQGMGHVVLSTRDDAEALHFY RDVLGFRLRDSMRLPPRMVGRPADGPPAWLRFFGCNPRHHSLAFLPMPTSSGIVHLMV EVEQADDVGLCLDRALRRKVPMSATLGRHVNDLMLSFYMKTPGGFDIEFGCEGRQVDD RDWIARESTAVSLWGHDFTVGARG" CDS complement(3957824..3958699) /codon_start=1 /transl_table=11 /gene="hsad" /locus_tag="BQ2027_MB3600C" /product="4,9-dhsa hydrolase" /note="Mb3600c, bphD, len: 291 aa. Equivalent to Rv3569c, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 291 aa overlap). Probable bphD, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase (EC 3.7.1.-), highly similar to others e.g. Q9KWQ6|BPHD2 from Rhodococcus sp. RHA1 (292 aa), FASTA scores: opt: 1468, E(): 1.3e-85, (75.5% identity in 294 aa overlap); Q52036 from Pseudomonas putida (286 aa), FASTA scores: opt: 785, E(): 1.9e-42, (45.1% identity in 295 aa overlap); Q52011|BPHD from Pseudomonas pseudoalcaligenes (286 aa), FASTA scores: opt: 774, E(): 9.3e-42, (44.05% identity in 295 aa overlap); P47229|BPHD_BURCE from Burkholderia cepacia (Pseudomonas cepacia) (286 aa) FASTA scores: opt: 772, E(): 1.2e-41, (44.5% identity in 295 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A. SIMILAR TO ALPHA/BETA HYDROLASE FOLD. Protein product from Mb3600c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3600c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4I4" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4I4" /protein_id="SIU02227.1" /translation="MTATEELTFESTSRFAEVDVDGPLKLHYHEAGVGNDQTVVLLHG GGPGAASWTNFSRNIAVLARHFHVLAVDQPGYGHSDKRAEHGQFNRYAAMALKGLFDQ LGLGRVPLVGNSLGGGTAVRFALDYPARAGRLVLMGPGGLSINLFAPDPTEGVKRLSK FSVAPTRENLEAFLRVMVYDKNLITPELVDQRFALASTPESLTATRAMGKSFAGADFE AGMMWREVYRLRQPVLLIWGREDRVNPLDGALVALKTIPRAQLHVFGQCGHWVQVEKF DEFNKLTIEFLGGGR" CDS complement(3958714..3959898) /codon_start=1 /transl_table=11 /gene="hsaa" /locus_tag="BQ2027_MB3601C" /product="possible oxidoreductase. possible 3-hydroxy-9,10-seconandrost-1,3,5(10)-triene-9,17-dione hydroxylase." /note="Mb3601c, -, len: 394 aa. Equivalent to Rv3570c, len: 394 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 394 aa overlap). Possible oxidoreductase (EC 1.-.-.-), most similar to hydroxylases and oxygenases (and also some similarity to acyl-coa dehydrogenases) e.g. O69349 HYDROXYLASE from Rhodococcus erythropolis (393 aa), FASTA scores: opt: 958, E(): 1.1e-53, (39.95% identity in 383 aa overlap); P26698|PIGM_RHOSO PIGMENT PROTEIN from Rhodococcus sp. strain ATCC 21145 (387 aa), FASTA scores: opt: 665, E(): 5.4e-35, (32.2% identity in 382 aa overlap); Q9ZGA9|LANZ5 OXYGENASE HOMOLOG from Streptomyces cyanogenus (397 aa) FASTA scores: opt: 588, E(): 4.5e-30, (30.55% identity in 386 aa overlap); Q9F0J3|NCNH HYDROXYLASE from Streptomyces arenae (405 aa), FASTA scores: opt: 580, E(): 1.5e-29, (31.25% identity in 336 aa overlap); O69789|BPFA INDOLE DIOXYGENASE from Rhodococcus opacus (399 aa), FASTA scores: opt: 558, E(): 3.7e-28, (31.8% identity in 387 aa overlap); etc. Protein product from Mb3601c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3601c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4N0" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013107" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4N0" /protein_id="SIU02228.1" /translation="MTSIQQRDAQSVLAAIDDLLPEIRDRAQATEDLRRLPDETVKAL DDVGFFTLLQPQQWGGLQCDPALFFEATRRLASVCGSTGWVSSIVGVHNWHLALFDQR AQEEVWGEDPSTRISSSYAPMGAGVVVDGGYLVNGSWNWSSGCDHASWTFVGGPVIKD GRPVDFGSFLIPRSEYEIKDVWYVVGLRGTGSNTLVVKDVFVPRHRFLSYKAMNDHTA GGLATNSAPVYKMPWGTMHPTTISAPIVGMAYGAYAAHVEHQGKRVRAAFAGEKAKDD PFAKVRIAEAASDIDAAWRQLIGNVSDEYALLAAGKEIPFELRARARRDQVRATGRSI ASIDRLFEASGATALSNEAPIQRFWRDAHAGRVHAANDPERAYVIFGNHEFGLPPGDT MV" CDS 3960045..3961121 /codon_start=1 /transl_table=11 /gene="kshb" /locus_tag="BQ2027_MB3602" /product="reductase component of 3-ketosteroid-9-alpha-hydroxylase kshb" /note="Mb3602, hmp, len: 358 aa. Equivalent to Rv3571, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 358 aa overlap). Possible hmp, oxidoreductase, hemoglobine-related protein (see citation below) (EC 1.-.-.-), similar to several e.g. Q44253|ATDA5 ANILINE DIOXYGENASE REDUCTASE COMPONENT from Acinetobacter sp (336 aa) FASTA scores: opt: 748, E(): 1.5e-38, (34.95% identity in 346 aa overlap); P95533|TDNB ELECTRON TRANSFER PROTEIN from Pseudomonas putida (337 aa), FASTA scores: opt: 723, E(): 5.2e-37, (36.35% identity in 341 aa overlap); AAK65059|SMA0752 POSSIBLE DIOXYGENASE REDUCTASE SUBUNIT from Rhizobium meliloti (Sinorhizobium meliloti) (353 aa) FASTA scores: opt: 495, E(): 4.9e-23, (31.9% identity in 345 aa overlap); P76081|PAAE_ECOLI|B1392 PROBABLE PHENYLACETIC ACID DEGRADATION NADH OXIDOREDUCTASE (356 aa), FASTA scores: opt: 364, E(): 5.1e-15, (34.45% identity in 357 aa overlap); Q9L131|HMPA FLAVOHEMOPROTEIN from Streptomyces coelicolor (398 aa), FASTA scores: opt: 352, E(): 3e-14, (32.8% identity in 247 aa overlap); etc. Contains PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature. Note that it has been shown hmp transcription increased at early stationary phase and is lower at late stationary phase and during exponential growth. Protein product from Mb3602 detected using SWATH mass spectrometry. Mb3602 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4P1" /db_xref="InterPro:IPR001041" /db_xref="InterPro:IPR001433" /db_xref="InterPro:IPR001709" /db_xref="InterPro:IPR006058" /db_xref="InterPro:IPR008333" /db_xref="InterPro:IPR012675" /db_xref="InterPro:IPR017927" /db_xref="InterPro:IPR017938" /db_xref="InterPro:IPR036010" /db_xref="InterPro:IPR039261" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4P1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02229.1" /translation="MTEAIGDEPLGDHVLELQIAEVVDETDEARSLVFAVPDGSDDPE IPPRRLRYAPGQFLTLRVPSERTGSVARCYSLCSSPYTDDALAVTVKRTADGYASNWL CDHAQVGMRIHVLAPSGNFVPTTLDADFLLLAAGSGITPIMSICKSALAEGGGQVTLL YANRDDRSVIFGDALRELAAKYPDRLTVLHWLESLQGLPSASALAKLVAPYTDRPVFI CGPGPFMQAARDALAALKVPAQQVHIEVFKSLESDPFAAVKVDDSGDEAPATAVVELD GQTHTVSWPRTAKLLDVLLAAGLDAPFSCREGHCGACACTLRAGKVNMGVNDVLEQQD LDEGLILACQSRPESDSVEVTYDE" CDS 3961139..3961669 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3603" /product="unknown protein" /note="Mb3603, -, len: 176 aa. Equivalent to Rv3572, len: 176 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 176 aa overlap). Hypothetical unknown protein. Protein product from Mb3603 detected using SWATH mass spectrometry. Mb3603 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4J7" /protein_id="SIU02230.1" /translation="MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLET HTFLDCHPNGSCDFVAGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQF ERVMKAGGSDVITTVYFGEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYP GVNLTSPSTCAQANFS" CDS complement(3961705..3963840) /codon_start=1 /transl_table=11 /gene="fadE34" /locus_tag="BQ2027_MB3604C" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE34" /note="Mb3604c, fadE34, len: 711 aa. Equivalent to Rv3573c, len: 711 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 711 aa overlap). Probable fadE34, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to others, especially in C-terminal half, e.g. Q9RJX2|SCF37.29c from Streptomyces coelicolor (393 aa) FASTA scores: opt: 780, E(): 2.8e-39, (44.1% identity in 347 aa overlap); Q9A6N8|CC2049 from Caulobacter crescentus (401 aa), FASTA scores: opt: 705, E(): 8.7e-35, (41.5% identity in 342 aa overlap); Q9EX72|MLHC from Rhodococcus erythropolis (324 aa), FASTA scores: opt: 673, E(): 6.1e-33, (42.05% identity in 283 aa overlap); P41367|ACDM_PIG|ACADM from Sus scrofa (Pig)(421 aa) FASTA scores: opt: 325, E(): 4.9e- 13, (28.5% identity in 368 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. P95097|FADE22|Rv3061c|MTCY22D7.20 (721 aa), FASTA scores: opt: 1635, E(): 2.7e-90, (42.65% identity in 729 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3604c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y4J8" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR013786" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR037069" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4J8" /protein_id="SIU02231.1" /translation="MVATVTDEQSAARELVRGWARTAASGAAATAAVRDMEYGFEEGN ADAWRPVFAGLAGLGLFGVAVPEDCGGAGGSIEDLCAMVDEAARALVPGPVATTAVAT LVVSDPKLRSALASGERFAGVAIDGGVQVDPKTSTASGTVGRVLGGAPGGVVLLPADG NWLLVDTACDEVVVEPLRATDFSLPLARMVLTSAPVTVLEVSGERVEDLAATVLAAEA AGVARWTLDTAVAYAKVREQFGKPIGSFQAVKHLCAQMLCRAEQADVAAADAARAAAD SDGTQLSIAAAVAASIGIDAAKANAKDCIQVLGGIGCTWEHDAHLYLRRAHGIGGFLG GSGRWLRRVTALTQAGVRRRLGVDLAEVAGLRPEIAAAVAEVAALPEEKRQVALADTG LLAPHWPAPYGRGASPAEQLLIDQELAAAKVERPDLVIGWWAAPTILEHGTPEQIERF VPATMRGEFLWCQLFSEPGAGSDLASLRTKAVRADGGWLLTGQKVWTSAAHKARWGVC LARTDPDAPKHKGITYFLVDMTTPGIEIRPLREITGDSLFNEVFLDNVFVPDEMVVGA VNDGWRLARTTLANERVAMATGTALGNPMEELLKVLGDMELDVAQQDRLGRLILLAQA GALLDRRIAELAVGGQDPGAQSSVRKLIGVRYRQALAEYLMEVSDGGGLVENRAVYDF LNTRCLTIAGGTEQILLTVAAERLLGLPR" CDS 3964112..3964711 /codon_start=1 /transl_table=11 /gene="kstr" /locus_tag="BQ2027_MB3605" /product="transcriptional regulatory protein kstr (probably tetr-family)" /note="Mb3605, -, len: 199 aa. Equivalent to Rv3574, len: 199 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 199 aa overlap). Probable transcriptional regulator tetR family, similar to others e.g. Q9KXK1|SCC53.10 from Streptomyces coelicolor (250 aa) FASTA scores: opt: 492, E(): 4.8e-25, (44.8% identity in 183 aa overlap); Q9RA03|KSTR from Rhodococcus erythropolis (208 aa), FASTA scores: opt: 294, E(): 3.1e-12, (28.9% identity in 187 aa overlap); BAB54261|MLR7895 from Rhizobium loti (Mesorhizobium loti) (193 aa), FASTA scores: opt: 166, E(): 0.00062, (32.05% identity in 78 aa overlap); P17446|BETI_ECOLI|B0313 from Escherichia coli strain K12 (195 aa), FASTA scores: opt: 142, E(): 0.0034, (25. 6% identity in 168 aa overlap); etc. Equivalent to AAK48038 from Mycobacterium tuberculosis strain CDC1551 (243 aa) but shorter 44 aa. Contains possible helix-turn-helix motif from aa 37-58 (+3.70 SD). POSSIBLY BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3605 detected using SWATH mass spectrometry. Mb3605 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4L5" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR041642" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4L5" /protein_id="SIU02232.1" /translation="MAVLAESELGSEAQRERRKRILDATMAIASKGGYEAVQMRAVAD RADVAVGTLYRYFPSKVHLLVSALGREFSRIDAKTDRSAVAGATPFQRLNFMVGKLNR AMQRNPLLTEAMTRAYVFADASAASEVDQVEKLIDSMFARAMANGEPTEDQYHIARVI SDVWLSNLLAWLTRRASATDVSKRLDLAVRLLIGDQDSA" CDS complement(3964717..3965796) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3606C" /product="TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY LACI-FAMILY)" /note="Mb3606c, -, len: 359 aa. Equivalent to Rv3575c, len: 359 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 359 aa overlap). Probable transcriptional regulator belonging to lacI family, similar to others e.g. BAB53947|MLL8376 from Rhizobium loti (Mesorhizobium loti) (358 aa), FASTA scores: opt: 707, E(): 2.6e-35, (35.5% identity in 355 aa overlap); Q9RRI9|DR2501 from Deinococcus radiodurans (359 aa) FASTA scores: opt: 544, E(): 1.6e-25, (40.35% identity in 347 aa overlap); Q9RL31|SCF51A.34 from Streptomyces coelicolor (347 aa), FASTA scores: opt: 307, E(): 2.9e-11, (30.0% identity in 330 aa overlap); O87590|CELR_THEFU from Thermomonospora fusca (340 aa), FASTA scores: opt: 280, E(): 1.2e-09, (32.3% identity in 353 aa overlap); P21867|RAFR_ECOLI from Escherichia coli (335 aa) FASTA scores: opt: 241, E(): 2.6e-07, (27.15% identity in 269 aa overlap); etc. Equivalent to AAK48039 from Mycobacterium tuberculosis strain CDC1551 (404 aa) but shorter 45 aa. Contains possible helix-turn-helix motif, at aa 9-30 (+5.86 SD). COULD BELONG TO THE LACI FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3606c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3606c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4I2" /db_xref="InterPro:IPR000843" /db_xref="InterPro:IPR010982" /db_xref="InterPro:IPR028082" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4I2" /protein_id="SIU02233.1" /translation="MSPTPRRRATLASLAAELKVSRTTVSNAFNRPDQLSADLRERVL ATAKRLGYAGPDPVARSLRTRKAGAVGLVMAEPLTYFFSDPAARDFVAGVAQSCEELG QGLQLVSVGSSRSLADGTAAVLGAGVDGFVVYSVGDDDPYLQVVLQRRLPVVVVDQPK DLSGVSRVGIDDRAAMRELAGYVLGLGHRELGLLTMRLGRDRRQDLVDAERLRSPTFD VQRERIVGVWEAMTAAGVDPDSLTVVESYEHLPTSGGTAAKVALQANPRLTALMCTAD ILALSAMDYLRAHGIYVPGQMTVTGFDGVPEALSRGLTTVAQPSLHKGHRAGELLLKP PRSGLPVIEVLDTELVRGRTAGPPA" CDS 3965986..3966699 /codon_start=1 /transl_table=11 /gene="lppH" /locus_tag="BQ2027_MB3607" /product="POSSIBLE CONSERVED LIPOPROTEIN LPPH" /note="Mb3607, lppH, len: 237 aa. Equivalent to Rv3576, len: 237 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 237 aa overlap). Possible lppH, conserved lipoprotein, similar in part with proteins from Mycobacterium tuberculosis; C-terminus of Q11053|PKNH_MYCTU|PKNH|Rv1266c|MT1304|MTCY50.16 PROBABLE SERINE/THREONINE-PROTEIN KINASE (EC 2.7.1.-) (626 aa) FASTA scores: opt: 396, E(): 6.5e-19, (36.0% identity in 200 aa overlap); and with P71740|LPPR|Rv2403c|MTCY253.17 PROBABLE LIPOPROTEIN PROTEIN (251 aa), FASTA scores: opt: 134, E(): 0.087, (22.7% identity in 207 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Note that previously known as pknM. Protein product from Mb3607 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3607 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR026954" /db_xref="InterPro:IPR038232" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4L0" /protein_id="SIU02234.1" /translation="MGKQLAALAALVGACMLAAGCTNVVDGTAVAADKSGPLHQDPIP VSALEGLLLDLSQINAALGATSMKVWFNAKAMWDWSKSVADKNCLAIDGPAQEKVYAG TGWTAMRGQRLDDSINDSKKRDHYAIQAVVGFPTAHDAEEFYSSSVQSWSSCSNRRFV EVTPGQDDAAWTVADVVNDNGMLSSSQVQEGGDGWTCQRALTARNNVTIDIVTCAYSQ PDLVAIGIANQIAAKVAKQ" CDS 3966890..3967756 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3608" /product="Zn-dependent hydrolases of the beta-lactamase fold" /note="Mb3608, -, len: 288 aa. Equivalent to Rv3577, len: 288 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 288 aa overlap), (other start sites possible upstream; equivalent to AAK48041 from Mycobacterium tuberculosis strain CDC1551 (379 aa) but shorter 91 aa). Hypothetical protein, showing some similarity to Q9RI88|SCJ11.16c HYPOTHETICAL 37.9 KDA PROTEIN from Streptomyces coelicolor (349 aa) FASTA scores: opt: 285, E(): 1.5e-10, (27.45% identity in 266 aa overlap). Mb3608 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/TrEMBL:A0A1R3Y662" /protein_id="SIU02235.1" /translation="MPTARSDAPLSVTWMGVATLLVDDGSSALMTDGYFSRPGLARVA AGKVSPSAERVDGCLARANVSRLTAVIPVHTHIDHAMDSALVADRTGAQLVGGESAAN VGRGYGLPEESLVVAVPGEPIQLGAFDVTLVESHHCPPDRFPGVISAPLTPPVKASAY RCGEAWSTLVHHRPSGRRLLIQDSAGFVSGALAGYRADAAYLSVGQLGLQPPSYLLEY WTETVRTVGVRRVILIHWDDFFRPLSKPLRALPYAADDLDLSIRILDELAAQDGVALQ MPTVWRREDPWM" CDS 3967770..3969011 /codon_start=1 /transl_table=11 /gene="arsB2" /locus_tag="BQ2027_MB3609" /product="POSSIBLE ARSENICAL PUMP INTEGRAL MEMBRANE PROTEIN ARSB2" /note="Mb3609, arsB2, len: 413 aa. Equivalent to Rv3578, len: 413 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 413 aa overlap). Possible arsB2, arsenical pump integral membrane protein, similar to many e.g. Q9I1J6|ARSB|PA2278 from Pseudomonas aeruginosa (427 aa), FASTA scores: opt: 375, E(): 3.1e-15, (32.15% identity in 429 aa overlap); Q9K8K7|ARSB|BH2999 from Bacillus halodurans (436 aa), FASTA scores: opt: 360, E(): 2.5e-14, (28.7% identity in 432 aa overlap); P52146|ARB2_ECOLI from Escherichia coli (plasmid R46) (429 aa), FASTA scores: opt: 345, E(): 2e-13, (29.8% identity in 426 aa overlap); etc. Also highly similar to Q9KYM0|SC9H11.21c PROBABLE MEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 730, E(): 1.7e-36, (53.95% identity in 443 aa overlap). SEEMS TO BELONG TO THE ARS FAMILY. Mb3609 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y561" /db_xref="InterPro:IPR000802" /db_xref="UniProtKB/TrEMBL:A0A1R3Y561" /protein_id="SIU02236.1" /translation="MTLAVALILLAVVLGFAVARPRGWPEAAAAVPAAVILLAIGAIS PQQAMAQVSGLARVVAFLGAVLVLAKLCDDEGLFEAAGAAMARASAESHRLLRQVFAV SAAITAALCLDATVVLLTPVVLATVRRLRTPVRPYAYATAHLANAASLLLPVSNLTNL LAYHGAGISFTKFTLLMALPWLSAVAAVYVVFRWFFARDLRVVPDRQQLKPAPRLPMF VLVVVALTLGGFAVAESVGLAPTWAALAGAAVLALRSLRRGHTSVLRIARAVNVSFLV FVLALGVVVHAVMLNGMAARMSAVLPTGSGLPALLGIAALASVLANVVNNLPATLVLV PLVAAGGPAAVLAVLLGVNIGPNLTYAGSLSNLLWRGVLRRHNVDASVGEYTRLGLCT VPAALAMAVLALWASAQVLGI" CDS complement(3969053..3970021) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3610C" /product="possible trna/rrna methyltransferase" /note="Mb3610c, -, len: 322 aa. Equivalent to Rv3579c, len: 322 aa, from Mycobacterium tuberculosis strain H37Rv, (99% identity in 322 aa overlap). Putative tRNA/rRNA methyltransferase (EC 2.1.1.-), equivalent, but longer, to Q9CCW4|ML0324 PUTATIVE METHYLTRANSFERASE from Mycobacterium leprae (278 aa), REMARK-M.bovis-M.tuberculosis: In the original M.bovis genome, a single base insertion (*-g) led to a longer product with a different COOH part compared to its homolog in Mycobacterium strain H37Rv (402 aa versus 322 aa). This was corrected in the new M.bovis genome update to a 322aa product Protein product from Mb3610c detected using SWATH mass spectrometry. Mb3610c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TW55" /db_xref="InterPro:IPR001537" /db_xref="InterPro:IPR004441" /db_xref="InterPro:IPR013123" /db_xref="InterPro:IPR029026" /db_xref="InterPro:IPR029028" /db_xref="InterPro:IPR029064" /db_xref="UniProtKB/Swiss-Prot:Q7TW55" /protein_id="SIU02237.1" /translation="MPGNSRRRGAVRKSGTKKGAGVGSGGQRRRGLEGRGPTPPAHLR PHHPAAKRARAQPRRPVKRADETETVLGRNPVLECLRAGVPATALYVALGTEADERLT ECVARAADSGIAIVELLRADLDRMTANHLHQGIALQVPPYNYAHPDDLLAAALDQPPA LLVALDNLSDPRNLGAIMRSVAAFGGHGVLIPQRRSASVTAVAWRTSAGAAARIPVAR ATNLTRTLKGWADRGVRVIGLDAGGGTALDDVDGTDSLVVVVGSEGKGLSRLVRQNCD EVVSIPMAAQAESLNASVAAGVVLAAIARQRRRPREPREQTQNRMI" CDS complement(3970022..3971431) /codon_start=1 /transl_table=11 /gene="cysS1" /locus_tag="BQ2027_MB3611C" /product="CYSTEINYL-TRNA SYNTHETASE 1 CYSS1 (CYSTEINE--TRNA LIGASE 1) (CYSRS 1) (CYSTEINE TRANSLASE)" /note="Mb3611c, cysS1, len: 469 aa. Equivalent to Rv3580c, len: 469 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 469 aa overlap). Probable cysS1, cysteinyl-tRNA synthetase (EC 6.1.1.16), equivalent to P57990|SYC1_MYCLE|CYSS1|CYSS|ML0323 CYSTEINYL-TRNA SYNTHETASE 1 from Mycobacterium leprae (473 aa) FASTA scores: opt: 2825, E(): 3.4e-172, (86.5% identity in 467 aa overlap). Also similar to many e.g. Q9L0Q6|SCD8A.08 from Streptomyces coelicolor (613 aa), FASTA scores: opt: 1834, E(): 4.7e-109, (57.5% identity in 461 aa overlap); Q9I2U7|CYSS|PA1795 from Pseudomonas aeruginosa (460 aa) FASTA scores: opt: 1197, E(): 1.2e-68, (41.65% identity in 468 aa overlap); P21888|SYC_ECOLI P21888|CYSS|B0526 from Escherichia coli strain K12 (461 aa), FASTA scores: opt: 1189, E(): 4e-68, (43.0% identity in 463 aa overlap); etc. BELONGS TO CLASS-I AMINOACYL-TRNA SYNTHETASE FAMILY. STRONGLY SIMILAR TO METHIONYL-TRNA SYNTHETASE. Protein product from Mb3611c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3611c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A635" /db_xref="InterPro:IPR009080" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR015273" /db_xref="InterPro:IPR015803" /db_xref="InterPro:IPR024909" /db_xref="InterPro:IPR032678" /db_xref="UniProtKB/Swiss-Prot:P0A635" /protein_id="SIU02238.1" /translation="MTDRARLRLHDTAAGVVRDFVPLRPGHVSIYLCGATVQGLPHIG HVRSGVAFDILRRWLLARGYDVAFIRNVTDIEDKILAKAAAAGRPWWEWAATHERAFT AAYDALDVLPPSAEPRATGHITQMIEMIERLIQAGHAYTGGGDVYFDVLSYPEYGQLS GHKIDDVHQGEGVAAGKRDQRDFTLWKGEKPGEPSWPTPWGRGRPGWHLECSAMARSY LGPEFDIHCGGMDLVFPHHENEIAQSRAAGDGFARYWLHNGWVTMGGEKMSKSLGNVL SMPAMLQRVRPAELRYYLGSAHYRSMLEFSETAMQDAVKAYVGLEDFLHRVRTRVGAV CPGDPTPRFAEALDDDLSVPIALAEIHHVRAEGNRALDAGDHDGALRSASAIRAMMGI LGCDPLDQRWESRDETSAALAAVDVLVQAELQNREKAREQRNWALADEIRGRLKRAGI EVTDTADGPQWSLLGGDTK" CDS complement(3971496..3971975) /codon_start=1 /transl_table=11 /gene="ispF" /locus_tag="BQ2027_MB3612C" /product="PROBABLE 2C-METHYL-D-ERYTHRITOL 2,4-CYCLODIPHOSPHATE SYNTHASE ISPF (MECPS)" /note="Mb3612c, ispF, len: 159 aa. Equivalent to Rv3581c, len: 159 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 159 aa overlap). Probable ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase (EC not defined), equivalent to Q9CCW5|ML0322 PUTATIVE 2-C-METHYL-D-ERYTHRITOL 2,4-CYCLODIPHOSPHATE SYNTHASE from Mycobacterium leprae (158 aa), FASTA scores: opt: 830, E(): 2.9e-47, (79.1% identity in 158 aa overlap). Also highly similar to others e.g. Q9L0Q7|ISPF_STRCO|SCD8A.07 from Streptomyces coelicolor (170 aa), FASTA scores: opt: 585, E(): 2.9e-31, (56.5% identity in 154 aa overlap); Q9PDT5|ISPF_XYLFA|XF1294 from Xylella fastidiosa (176 aa), FASTA scores: opt: 398, E(): 4.6e-19, (44.9% identity in 156 aa overlap); Q08113|ISDF_RHOCA|ISPDF from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (379 aa), FASTA scores: opt: 387, E(): 4.5e-18, (42.85% identity in 154 aa overlap) (only similar with C-terminal end of this bifunctional protein ISPD and ISPF); Q06756|ISPF_BACSU from Bacillus subtilis (158 aa), FASTA scores: opt: 367, E(): 4.5e-17, (41.2% identity in 153 aa overlap); etc. BELONGS TO THE ISPF FAMILY. Protein product from Mb3612c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3612c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65184" /db_xref="InterPro:IPR003526" /db_xref="InterPro:IPR020555" /db_xref="InterPro:IPR036571" /db_xref="UniProtKB/Swiss-Prot:P65184" /protein_id="SIU02239.1" /translation="MNQLPRVGLGTDVHPIEPGRPCWLVGLLFPSADGCAGHSDGDVA VHALCDAVLSAAGLGDIGEVFGVDDPRWQGVSGADMLRHVVVLITQHGYRVGNAVVQV IGNRPKIGWRRLEAQAVLSRLLNAPVSVSATTTDGLGLTGRGEGLAAIATALVVSLR" CDS complement(3971972..3972667) /codon_start=1 /transl_table=11 /gene="ispD" /locus_tag="BQ2027_MB3613C" /product="4-diphosphocytidyl-2c-methyl-d-erythritol synthase ispd (mep cytidylyltransferase) (mct)" /note="Mb3613c, ispD, len: 231 aa. Equivalent to Rv3582c, len: 231 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 231 aa overlap). Probable ispD, 4-diphosphocytidyl-2C-methyl-D-erythritol synthase (EC 2.7.7.-), equivalent to Q9CCW6|ML0321 PUTATIVE 4-DIPHOSPHOCYTIDYL-2C-METHYL-D-ERYTHRITOL SYNTHASE from Mycobacterium leprae (241 aa), FASTA scores: opt: 694, E(): 1.7e-35, (66.95% identity in 236 aa overlap). Also highly similar to others e.g. Q9L0Q8|ISPD_STRCO|SCD8A.06 from Streptomyces coelicolor (270 aa), FASTA scores: opt: 537, E(): 7.5e-26, (43.4% identity in 242 aa overlap); P74323|ISPD_SYNY3|SLR0951 from Synechocystis sp. strain PCC 6803 (230 aa), FASTA scores: opt: 410, E(): 3.8e-18, (36.15% identity in 224 aa overlap); Q9KGF8|ISPD_BACHD|BH0107 from Bacillus halodurans (228 aa) FASTA scores: opt: 367, E(): 1.6e-15, (34.65% identity in 228 aa overlap); Q08113|ISDF_RHOCA|ISPDF from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (379 aa)FASTA scores: opt: 359, E(): 7.8e-15, (34.1% identity in 223 aa overlap) (only similar with N-terminus of this bifunctional protein ISPD and ISPF); Q46893|ISPD_ECOLI|B2747 from Escherichia coli strain K12 (235 aa), FASTA scores: opt: 336, E(): 1.3e-13, (33.65% identity in 223 aa overlap); etc. BELONGS TO THE ISPD FAMILY. Protein product from Mb3613c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3613c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TW54" /db_xref="InterPro:IPR001228" /db_xref="InterPro:IPR018294" /db_xref="InterPro:IPR029044" /db_xref="InterPro:IPR034683" /db_xref="UniProtKB/Swiss-Prot:Q7TW54" /protein_id="SIU02240.1" /translation="MVREAGEVVAIVPAAGSGERLAVGVPKAFYQLDGQTLIERAVDG LLDSGVVDTVVVAVPADRTDEARQILGHRAMIVAGGSNRTDTVNLALAVLSGTAEPEF VLVHDAARALTPPALVARVVEALRDGYAAVVPVLPLSDTIKAVDANGVVLGTPERAGL RAVQTPQGFTTDLLLRSYQRGSLDLPAAEYTDDASLVEHIGGQVQVVDGDPLAFKITT KLDLLLAQAIVRG" CDS complement(3972684..3973172) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3614C" /product="possible transcription factor" /note="Mb3614c, -, len: 162 aa. Equivalent to Rv3583c, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). Possible transcriptional factor, identical to Q9CCW7|ML0320 PUTATIVE TRANSCRIPTION FACTOR from Mycobacterium leprae (165 aa), FASTA scores: opt: 1004, E(): 6.1e-56, (97.55% identity in 162 aa overlap); and Q9ZBM8|MLCB1450.01c PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (94 aa), FASTA scores: opt: 600, E(): 6e-31, (97.85% identity in 94 aa overlap). Also highly similar to others e.g. Q9L0Q9|SCD8A.05 from Streptomyces coelicolor (160 aa), FASTA scores: opt: 878, E(): 4.3e-48, (85.0% identity in 160 aa overlap); Q9K600|BH3935 from Bacillus halodurans (153 aa) FASTA scores: opt: 383, E(): 3.1e-17, (36.4% identity in 151 aa overlap); Q9KD36|BH1383 from Bacillus halodurans (164 aa) FASTA scores: opt: 305, E(): 2.4e-12, (33.55% identity in 164 aa overlap); etc. Protein product from Mb3614c detected using shotgun mass spectrometry. Mb3614c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003711" /db_xref="InterPro:IPR036101" /db_xref="InterPro:IPR042215" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4L1" /protein_id="SIU02241.1" /translation="MIFKVGDTVVYPHHGAALVEAIETRTIKGEQKEYLVLKVAQGDL TVRVPAENAEYVGVRDVVGQEGLDKVFQVLRAPHTEEPTNWSRRYKANLEKLASGDVN KVAEVVRDLWRRDQERGLSAGEKRMLAKARQILVGELALAESTDDAKAETILDEVLAA AS" CDS 3973457..3974005 /codon_start=1 /transl_table=11 /gene="lpqE" /locus_tag="BQ2027_MB3615" /product="POSSIBLE CONSERVED LIPOPROTEIN LPQE" /note="Mb3615, lpqE, len: 182 aa. Equivalent to Rv3584, len: 182 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 182 aa overlap). Possible lpqE, conserved lipoprotein, equivalent to Q9ZBM7|MLCB1450.02|LPQE|ML0319 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (183 aa), FASTA scores: opt: 722, E(): 6.2e-37, (63.45% identity in 175 aa overlap). Also similar in part to Q9KK69 EXPORTED PROTEIN 996A010 (FRAGMENT) from Mycobacterium avium (41 aa), FASTA scores: opt: 180, E(): 0.00012, (69.25% identity in 39 aa overlap); and Q9L0R0|SCD8A.04c PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (241 aa), FASTA scores: opt: 127, E(): 0.86, (27.15% identity in 173 aa overlap). Equivalent to AAK48048 from Mycobacterium tuberculosis strain CDC1551 (238 aa) but shorter 56 aa. Contains probable N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb3615 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3615 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65309" /db_xref="UniProtKB/Swiss-Prot:P65309" /protein_id="SIU02242.1" /translation="MNRCNIRLRLAGMTTWVASIALLAAALSGCGAGQISQTANQKPA VNGNRLTINNVLLRDIRIQAVQTSDFIQPGKAVDLVLVAVNQSPDVSDRLVGITSDIG SVTVAGDARLPASGMLFVGTPDGQIVAPGPLPSNQAAKATVNLTKPIANGLTYNFTFK FEKAGQGSVMVPISAGLATPHE" CDS 3974071..3975513 /codon_start=1 /transl_table=11 /gene="radA" /locus_tag="BQ2027_MB3616" /product="DNA REPAIR PROTEIN RADA (DNA REPAIR PROTEIN SMS)" /note="Mb3616, radA, len: 480 aa. Equivalent to Rv3585, len: 480 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 480 aa overlap). Probable radA, DNA repair protein, similar to many e.g. Q9X8L5|SCE94.02 from Streptomyces coelicolor (469 aa), FASTA scores: opt: 1607, E(): 3.1e-84, (56.15% identity in 454 aa overlap); Q9JV51|RADA|NMA0992 from Neisseria meningitidis (serogroup A) (459 aa), FASTA scores: opt: 1275, E(): 2.5e-65, (45.0% identity in 458 aa overlap); and Q9K040|RADA|NMB0782 from Neisseria meningitidis (serogroup B) (459 aa), FASTA scores: opt: 1269, E(): 5.4e-65, (44.5% identity in 456 aa overlap); P37572|RADA_BACSU|SMS from Bacillus subtilis (458 aa), FASTA scores: opt: 1204, E(): 2.7e-61, (39.55% identity in 455 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE RADA FAMILY. Protein product from Mb3616 detected using SWATH mass spectrometry. Mb3616 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65954" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004504" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR020568" /db_xref="InterPro:IPR020588" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041166" /db_xref="UniProtKB/Swiss-Prot:P65954" /protein_id="SIU02243.1" /translation="MANARSQYRCSECRHVSAKWVGRCLECGRWGTVDEVAVLSAVGG TRRRSVAPASGAVPISAVDAHRTRPCPTGIDELDRVLGGGIVPGSVTLLAGDPGVGKS TLLLEVAHRWAQSGRRALYVSGEESAGQIRLRADRIGCGTEVEEIYLAAQSDVHTVLD QIETVQPALVIVDSVQTMSTSEADGVTGGVTQVRAVTAALTAAAKANEVALILVGHVT KDGAIAGPRSLEHLVDVVLHFEGDRNGALRMVRGVKNRFGAADEVGCFLLHDNGIDGI VDPSNLFLDQRPTPVAGTAITVTLDGKRPLVGEVQALLATPCGGSPRRAVSGIHQARA AMIAAVLEKHARLAIAVNDIYLSTVGGMRLTEPSADLAVAIALASAYANLPLPTTAVM IGEVGLAGDIRRVNGMARRLSEAARQGFTIALVPPSDDPVPPGMHALRASTIVAALQY MVDIADHRGTTLATPPSHSGTGHVPLGRGT" CDS 3975518..3976594 /codon_start=1 /transl_table=11 /gene="disA" /locus_tag="BQ2027_MB3617" /product="DNA integrity scanning protein DisA" /note="Mb3617, -, len: 358 aa. Equivalent to Rv3586, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 358 aa overlap). Conserved hypothetical protein, highly similar to Q9X8L6|SCE94.03 PUTATIVE DNA-BINDING PROTEIN from Streptomyces coelicolor (374 aa), FASTA scores: opt: 1338, E(): 5e-75, (59.95% identity in 347 aa overlap); P37573|YACK_BACSU HYPOTHETICAL 40.7 KDA PROTEIN from Bacillus subtilis (360 aa), FASTA scores: opt: 875, E(): 1.4e-46, (42.15% identity in 344 aa overlap); Q9KGG0|BH0105 HYPOTHETICAL PROTEIN from Bacillus halodurans (357 aa), FASTA scores: opt: 844, E(): 1.1e-44, (40.3% identity in 350 aa overlap); Q9WY43|TM0200 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (357 aa), FASTA scores: opt: 735, E(): 5.7e-38, (39.4% identity in 353 aa overlap). Also some similarity with other proteins. Contains probable coiled-coil from 144 to 179. Protein product from Mb3617 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3617 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TW52" /db_xref="InterPro:IPR003390" /db_xref="InterPro:IPR010994" /db_xref="InterPro:IPR018906" /db_xref="InterPro:IPR023763" /db_xref="InterPro:IPR036888" /db_xref="InterPro:IPR038331" /db_xref="InterPro:IPR041663" /db_xref="UniProtKB/Swiss-Prot:Q7TW52" /protein_id="SIU02244.1" /translation="MHAVTRPTLREAVARLAPGTGLRDGLERILRGRTGALIVLGHDE NVEAICDGGFSLDVRYAATRLRELCKMDGAVVLSTDGSRIVRANVQLVPDPSIPTDES GTRHRSAERAAIQTGYPVISVSHSMNIVTVYVRGERHVLTDSATILSRANQAIATLER YKTRLDEVSRQLSRAEIEDFVTLRDVMTVVQRLELVRRIGLVIDYDVVELGTDGRQLR LQLDELLGGNDTARELIVRDYHANPEPPSTGQINATLDELDALSDGDLLDFTALAKVF GYPTTTEAQDSALSPRGYRAMAGIPRLQFAHADLLVRAFGTLQGLLAASAGDLQSVDG IGAMWARHVRDGLSQLAESTISDQ" CDS complement(3976595..3977389) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3618C" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb3618c, -, len: 264 aa. Equivalent to Rv3587c, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 264 aa overlap). Probable conserved membrane protein, equivalent to Q9CBJ2|ML1918 HYPOTHETICAL MEMBRANE PROTEIN from Mycobacterium leprae (263 aa), FASTA scores: opt: 1438, E(): 2.4e-57, (77.55% identity in 267 aa overlap). Contains hydrophobic stretch in N-terminus; possible signal sequence. Protein product from Mb3618c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3618c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y682" /db_xref="UniProtKB/TrEMBL:A0A1R3Y682" /protein_id="SIU02245.1" /translation="MLDLEPRGPLPTEIYWRRRGLALGIAVVVVGIAVAIVIAFVDSS AGAKPVSADKPASAQSHPGSPAPQAPQPAGQTEGNAAAAPPQGQNPETPTPTAAVQPP PVLKEGDDCPDSTLAVKGLTNAPQYYVGDQPKFTMVVTNIGLVSCKRDVGAAVLAAYV YSLDNKRLWSNLDCAPSNETLVKTFSPGEQVTTAVTWTGMGSAPRCPLPRPAIGPGTY NLVVQLGNLRSLPVPFILNQPPPPPGPVPAPGPAQAPPPESPAQGG" CDS complement(3977498..3978121) /codon_start=1 /transl_table=11 /gene="canb" /locus_tag="BQ2027_MB3619C" /product="beta-carbonic anhydrase canb" /note="Mb3619c, -, len: 207 aa. Equivalent to Rv3588c, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 207 aa overlap). Probable carbonic anhydrase (EC 4.2.1.1), equivalent to Q9CBJ1|ML1919 PUTATIVE CARBONIC ANHYDRASE from Mycobacterium leprae (213 aa), FASTA scores: opt: 1160, E(): 3.1e-66, (84.55% identity in 207 aa overlap). Also similar to many e.g. Q9X903|SCH35.03 from Streptomyces coelicolor (207 aa), FASTA scores: opt: 689, E(): 1.6e-36, (53.85% identity in 195 aa overlap); Q9RS89|DR2238 from Deinococcus radiodurans (264 aa), FASTA scores: opt: 451, E(): 2e-21, (39.7% identity in 189 aa overlap); Q39589|BETA-CA1 from Chlamydomonas reinhardtii (267 aa) FASTA scores: opt: 419, E(): 2.1e-19, (36.55% identity in 197 aa overlap); etc. Contains PS00704 and PS00705 Prokaryotic-type carbonic anhydrases signature 1 and 2. BELONGS TO THE PLANT AND PROKARYOTIC CARBONIC ANHYDRASE FAMILY. Protein product from Mb3619c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3619c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y564" /db_xref="InterPro:IPR001765" /db_xref="InterPro:IPR015892" /db_xref="InterPro:IPR036874" /db_xref="UniProtKB/TrEMBL:A0A1R3Y564" /protein_id="SIU02246.1" /translation="MPNTNPVAAWKALKEGNERFVAGRPQHPSQSVDHRAGLAAGQKP TAVIFGCADSRVAAEIIFDQGLGDMFVVRTAGHVTDSAVLGSIEYAVTVLNVPLIVVL GHDSCGAVNAALAAINDGTLPGGYVRDVVERVAPSVLLGRRDGLSRVDEFEQRHVHET VAILMARSSAISERIAGGSLAIVGVTYQLDDGRAVLRDHIGNIGEEV" CDS 3978120..3979034 /codon_start=1 /transl_table=11 /gene="mutY" /locus_tag="BQ2027_MB3620" /product="probable adenine glycosylase muty" /note="Mb3620, mutY, len: 304 aa. Equivalent to Rv3589, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 304 aa overlap). Probable mutY, adenine glycosylase (EC 3.2.2.-), equivalent to Q9CBJ0|MUTY|ML1920 PROBABLE DNA GLYCOSYLASE from Mycobacterium leprae (297 aa), FASTA scores: opt: 1592, E(): 2.6e-94, (74.9% identity in 303 aa overlap). Also similar to many DNA glycosylases (generally adenine glycosylases) e.g. Q9S6T7|SCE94.06 from Streptomyces coelicolor (308 aa), FASTA scores: opt: 965, E(): 2.6e-54, (50.5% identity in 297 aa overlap); Q9S6G1|MUTY from Streptomyces antibioticus (307 aa), FASTA scores: opt: 901, E(): 3.1e-50, (48.5% identity in 303 aa overlap); Q9HPQ6|MUTY|VNG1520G from Halobacterium sp. strain NRC-1 (312 aa), FASTA scores: opt: 566, E(): 7.2e-29, (39.85% identity in 296 aa overlap); BAB53965|MLL7523 from Rhizobium loti (Mesorhizobium loti) (396 aa), FASTA scores: opt: 511, E(): 2.8e-25, (39.65% identity in 237 aa overlap); Q05869|MUTY_SALTY|MUTB from Salmonella typhimurium (350 aa), FASTA scores: opt: 421, E(): 3.8e-20, (35.2% identity in 227 aa overlap); etc. COULD BELONG TO THE NTH/MUTY FAMILY. Mb3620 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4K5" /db_xref="InterPro:IPR000445" /db_xref="InterPro:IPR003265" /db_xref="InterPro:IPR003651" /db_xref="InterPro:IPR011257" /db_xref="InterPro:IPR023170" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4K5" /protein_id="SIU02247.1" /translation="MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQI LVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHE CATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAV HGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWC AWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQR DRALESLLADALVTRTVDGRFALPGEGF" CDS complement(3979031..3980785) /codon_start=1 /transl_table=11 /gene="PE_PGRS58" /locus_tag="BQ2027_MB3621C" /product="pe-pgrs family protein pe_pgrs58" /note="Mb3621c, PE_PGRS58, len: 584 aa. Equivalent to Rv3590c, len: 584 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 584 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to e.g. O53439|Rv1091|MTV017.44 (853 aa), FASTA scores: opt: 2005, E(): 1.4e-70, (54.95% identity in 646 aa overlap). Mb3621c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4P4" /protein_id="SIU02248.1" /translation="MSFVIVAPEALMSVASEVAGIGSALNAANAAAAAPTTGVLAAAA DEVSAAMAALFGAHAQEYQRLSAQAAGFHAQFVQALNAGVNSYASAEAANASPLQAVE QQVLGLINGPAQTLLGRPLIGNGADGAPGTGQPGGPGGLLWGNGGNGGSGVAGVGGPG GSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAGWLVGNGGAGGFGGVGTTVSGNGG AGGAAGAFGNGGVGGAGGAAVIGGLPGNGGAGGNAGLIGAGGDGGVGGVGAPGTNGMN PPPNQTSQAANGSPGANNGAGSGGAGLPGNPGAVPGRAGGAGGLGGSGSDTSEGPVTG GNGGNGGDGGPGAPGGNGAPGGIGVNTGTGWAYGGNGGNGGDGGAGARGGDGGNGGNG LALNGGNGIGGNGGAGGRGGTGAAGGNGGIGGGATGTLTFFGSGGDGGPGGAGANTAG TGGVGGVGGAGGQGGLLFGDGGNGGAGGAGGIGGTGASGGAGGKGGSGLVGGDGGNGG AGGAGGNGGKGGAGGAGGGAGMFSQPGVHGAGGTGGQGGAGGAGGAGGAAGAGTVVAG NPGDPGGFGAAGADGLPG" CDS complement(3980896..3981669) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3622C" /product="possible hydrolase" /note="Mb3622c, -, len: 257 aa. Equivalent to Rv3591c, len: 257 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 257 aa overlap). Possible hydrolase (EC 3.-.-.-), equivalent to Q9CBI9|ML1921 HYPOTHETICAL PROTEIN from Mycobacterium leprae (256 aa) FASTA scores: opt: 1421, E(): 5.6e-83, (78.5% identity in 251 aa overlap). Also similar to others e.g. Q9K3V0|SCD10.27 PUTATIVE HYDROLASE from Streptomyces coelicolor (352 aa), FASTA scores: opt: 193, E(): 5.2e-05, (33.35% identity in 270 aa overlap); O33745|STTC THIOESTERASE (EC 3.1.2.-) from Streptomyces sp (308 aa) FASTA scores: opt: 242, E(): 3.6e-08, (30.35% identity in 270 aa overlap); Q9RK95|SCF1.09 PUTATIVE HYDROLASE from Streptomyces coelicolor (258 aa), FASTA scores: opt: 239, E(): 4.9e-08, (30.75% identity in 247 aa overlap); Q9HZ14|PA3226 PROBABLE HYDROLASE from Pseudomonas aeruginosa (275 aa), FASTA scores: opt: 226, E(): 3.4e-07, (26.6% identity in 252 aa overlap); Q9HPT9|EST|VNG1474G CARBOXYLESTERASE from Halobacterium sp. strain NRC-1 (274 aa), FASTA scores: opt: 215, E(): 1.7e-06, (26.95% identity in 256 aa overlap); etc. Protein product from Mb3622c detected using SWATH mass spectrometry. Mb3622c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4R8" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4R8" /protein_id="SIU02249.1" /translation="MPRMPANLLTHRGGRGEPLVLVHGLMGRGSTWARQLPWLTLLGA VYTYDAPWHRGRDVADPHPISTERFVADLGDAVSALGAPTRMVGHSMGALHSWCLAAE RPELVSALVVEDMAPDFRGRTTGPWEPWLRALPVEFDSAEQVFAEFGPVAGRYFLDAF DRTATGWRLHGRTARWIEIAAEWGTRDYWAQWRAVRSPALLIEAGDGVTPPGQMRAMA ERDYPTAYLRVPDAGHLVHDEAPQVYRRAVESFLAGLTP" CDS 3981684..3982001 /codon_start=1 /transl_table=11 /gene="mhud" /locus_tag="BQ2027_MB3623" /product="possible heme degrading protein mhud" /note="Mb3623, TB11.2, len: 105 aa. Equivalent to Rv3592, len: 105 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 105 aa overlap). TB11.2, conserved hypothetical protein (see citations from 2000 below), equivalent to Q9CBI8|ML1922 HYPOTHETICAL PROTEIN from Mycobacterium leprae (105 aa) FASTA scores: opt: 591, E(): 2.5e-34, (84.6% identity in 104 aa overlap). Shows some similarity with other bacterial hypothetical proteins e.g. Q9RXN8|DR0272 from Deinococcus radiodurans (109 aa), FASTA scores: opt: 178, E(): 1e-05, (34.3% identity in 102 aa overlap); P38049|YHGC_BACSU from Bacillus subtilis (166 aa) FASTA scores: opt: 175, E(): 2.4e-05, (40.85% identity in 71 aa overlap); Q9K649|BH3883 from Bacillus halodurans (102 aa) FASTA scores: opt: 162, E(): 0.00012, (33.75% identity in 80 aa overlap); etc. Protein product from Mb3623 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3623 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007138" /db_xref="InterPro:IPR011008" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4L7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02250.1" /translation="MPVVKINAIEVPAGAGPELEKRFAHRAHAVENSPGFLGFQLLRP VKGEERYFVVTHWESDEAFQAWANGPAIAAHAGHRANPVATGASLLEFEVVLDVGGTG KTA" CDS 3981979..3983337 /codon_start=1 /transl_table=11 /gene="lpqF" /locus_tag="BQ2027_MB3624" /product="PROBABLE CONSERVED LIPOPROTEIN LPQF" /note="Mb3624, lpqF, len: 452 aa. Equivalent to Rv3593, len: 452 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 452 aa overlap). Probable lpqF, conserved lipoprotein, equivalent to Q9CBI7|MPQF|ML1923 PROBALE SECRETED PROTEIN from Mycobacterium leprae (454 aa), FASTA scores: opt: 2465, E(): 5.7e-144, (79.15% identity in 451 aa overlap). Also similar to Q9KJ91 HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces clavuligerus (430 aa), FASTA scores: opt: 609, E(): 5.2e-30, (30.3% identity in 350 aa overlap); and some similarity with putative beta-lactamases e.g. Q9RYR7|DRA0241 BETA LACTAMASE-RELATED PROTEIN from Deinococcus radiodurans (499 aa), FASTA scores: opt: 322, E(): 2.5e-12, (28.25% identity in 322 aa overlap). Equivalent to AAK48057 from Mycobacterium tuberculosis strain CDC1551 (438 aa) but longer 14 aa. Contains N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Mb3624 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR012338" /db_xref="InterPro:IPR040846" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4M3" /protein_id="SIU02251.1" /translation="MGPARLHNRRAGRRMLALSAAAALIVALASGCSSAPTPSANAAN HGHRIDTRTPPGLRAQQTMDMLNSDWPIGEIGVGTLAAPGQVDTVKTTMEALWWDRPF ALAGVDIGASVAALHLISSYGAQQDIRIHTDDDGWVDRFDVETQAPSIASWRDVDAVL SKTGARYSFQVAKVDNGRCDPVAGTSTGESLPLASIFKLYVLHALAGAVQHNTVSWDD LLTVTAKSKAVGSSGLELPVGARVSVRTAAEKMIATSDNMATDLLIERLGTRAIEEAL ASAGHHDPASMTPFPTMYELFSVGWGKPDLRDQWKHATQQVRAQILRQTNSTPYQPDP TRAHTPASNYGAEWYGSAEDICRVHAALRADAVGPASPVRQIMSAVPGIQLDRSVWPY IGAKAGGLPGDLTFSWYAVDKTGQPWVVSFQLNWPRDHGPTVTGWMLQVARQVFALIA PQ" CDS 3983484..3984311 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3625" /product="Phage endolysin" /note="Mb3625, -, len: 275 aa. Equivalent to Rv3594, len: 275 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 275 aa overlap). Hypothetical protein, highly similar in part with Q9ZX49|GP29 from Mycobacteriophage TM4 (547 aa), FASTA scores: opt: 526, E(): 1.3e-25, (46.25% identity in 186 aa overlap); and Q9FZS0|LYSA|GP2 from Mycobacterium phage Ms6 (384 aa) FASTA scores: opt: 147, E(): 0.064, (33.35% identity in 84 aa overlap). Mb3625 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4N9" /db_xref="InterPro:IPR002502" /db_xref="InterPro:IPR036505" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4N9" /protein_id="SIU02252.1" /translation="MGWIGDPIWLEEVLRPALGERLRVLDGWRERGHGDFRDIRGVMW HHTGNSRETAKSIARGRPDLPGPLANLHIAHSGVVTIVAVGVCWHAGRGSYPWLPTDN ANWHMIGVECAWPTIRRDGSYDAGERWPDAQIVSMRDVAAALTLKLGYGPERNIGHKE YAGAAQGKWDPGNLSMDWFRAEVAKDTRGEFDHPLTPPPAVIARPPILPKPRNPRDDR ILLEEVWDQLRGIEGRGWPVLGDKTIVDYLAELGNKVDALAAKLDAREGLDRPSDTR" CDS complement(3984225..3985703) /codon_start=1 /transl_table=11 /gene="PE_PGRS59" /locus_tag="BQ2027_MB3626C" /product="pe-pgrs family protein pe_pgrs59" /note="Mb3626c, PE_PGRS59, len: 492 aa. Equivalent to 5' end of Rv3595c, len: 439 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 257 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many e.g. O53439|Rv1091|MTV017.44 (853 aa), FASTA scores: opt: 1644, E(): 1.2e-57, (58.75% identity in 492 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base deletion (a-*) and a 27 bp insertion (*-gcgcgccggggccaccgtttccgccgg), lead to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis (492 aa versus 439 aa). Mb3626c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4K4" /protein_id="SIU02253.1" /translation="MSFVIAVPEFLSAAATDLANLGSTISAANAAASIPTTGVLAAGA DDVSAAIAALFGAHAQAYQTISAQAATFHAQFVQTLSAGAGAYANAEAANVQQSLLNA INAPTQALLGRPLIGDGADGTAPGQNGGAGGLLYGNGGNGAAGVNAGIAGGSGGAAGL IGNGGSGGAGGAGAAGGSGGQGGLLYGNGGAGGNGGAATIPGGNGGAGGAGGNAWLFG NGGAGGLGAAGAAGAAGVNPLTVPAGQGSMGNNGEPGGPASQAPSSGKPVAPVALAVL ACPLAGPAGPAETVAPARPAAPGGPAGPAALAVGVGSWSATAAQVASGAQGAKAASAP GAAPAVRAEWAAPGNQALGVTLVTGVTAGSAVTAAPAETAARAARAARAACSASPVAP GWAVPPVAGVMAGAVVSPVWPAPRALGLRAVAATATSASSALKVHPASPASRASPVDI VAAGPSAPRRLLEQLPSVTGPVEALARVEFCGQGVDLIAELG" CDS complement(3985811..3988357) /codon_start=1 /transl_table=11 /gene="clpC" /locus_tag="BQ2027_MB3627C" /product="probable atp-dependent protease atp-binding subunit clpc1" /note="Mb3627c, clpC, len: 848 aa. Equivalent to Rv3596c, len: 848 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 848 aa overlap). Probable clpC, ATP-dependent clp protease ATP-binding subunit (EC 3.4.-.-), equivalent to P24428|CLPC_MYCLE PROBABLE ATP-DEPENDENT CLP PROTEASE ATP-BINDING SUBUNIT from Mycobacterium leprae (848 aa), FASTA scores: opt: 5286, E(): 0, (97.15% identity in 845 aa overlap). Also highly similar to members of the clpA/clpB family e.g. Q9S6T8|SCE94.24c from Streptomyces coelicolor (841 aa) FASTA scores: opt: 4399, E(): 0, (81.0% identity in 848 aa overlap); Q9KGG2|CLPC|BH0103 from Bacillus halodurans (813 aa), FASTA scores: opt: 3279, E(): 3.8e-173, (61.9% identity in 808 aa overlap); Q55662|CLPC|SLL0020 from Synechocystis sp. strain PCC 6803 (821 aa), FASTA scores: opt: 3201, E(): 7.6e-169, (60.5% identity in 820 aa overlap); P51332|CLPC_PORPU from Porphyra purpurea (821 aa), FASTA scores: opt: 3045, E(): 3e-160, (57.65% identity in 817 aa overlap); P37571|CLPC_BACSU|MECB from Bacillus subtilis (810 aa), FASTA scores: opt: 2969, E(): 4.6e-156, (61.15% identity in 811 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE CLPA/CLPB FAMILY, CLPC SUBFAMILY. Protein product from Mb3627c detected using shotgun mass spectrometry. Mb3627c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A523" /db_xref="InterPro:IPR001270" /db_xref="InterPro:IPR001943" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR003959" /db_xref="InterPro:IPR004176" /db_xref="InterPro:IPR018368" /db_xref="InterPro:IPR019489" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036628" /db_xref="InterPro:IPR041546" /db_xref="UniProtKB/Swiss-Prot:P0A523" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02254.1" /translation="MFERFTDRARRVVVLAQEEARMLNHNYIGTEHILLGLIHEGEGV AAKSLESLGISLEGVRSQVEEIIGQGQQAPSGHIPFTPRAKKVLELSLREALQLGHNY IGTEHILLGLIREGEGVAAQVLVKLGAELTRVRQQVIQLLSGYQGKEAAEAGTGGRGG ESGSPSTSLVLDQFGRNLTAAAMEGKLDPVIGREKEIERVMQVLSRRTKNNPVLIGEP GVGKTAVVEGLAQAIVHGEVPETLKDKQLYTLDLGSLVAGSRYRGDFEERLKKVLKEI NTRGDIILFIDELHTLVGAGAAEGAIDAASILKPKLARGELQTIGATTLDEYRKYIEK DAALERRFQPVQVGEPTVEHTIEILKGLRDRYEAHHRVSITDAAMVAAATLADRYIND RFLPDKAIDLIDEAGARMRIRRMTAPPDLREFDEKIAEARREKESAIDAQDFEKAASL RDREKTLVAQRAEREKQWRSGDLDVVAEVDDEQIAEVLGNWTGIPVFKLTEAETTRLL RMEEELHKRIIGQEDAVKAVSKAIRRTRAGLKDPKRPSGSFIFAGPSGVGKTELSKAL ANFLFGDDDALIQIDMGEFHDRFTASRLFGAPPGYVGYEEGGQLTEKVRRKPFSVVLF DEIEKAHQEIYNSLLQVLEDGRLTDGQGRTVDFKNTVLIFTSNLGTSDISKPVGLGFS KGGGENDYERMKQKVNDELKKHFRPEFLNRIDDIIVFHQLTREEIIRMVDLMISRVAG QLKSKDMALVLTDAAKALLAKRGFDPVLGARPLRRTIQREIEDQLSEKILFEEVGPGQ VVTVDVDNWDGEGPGEDAVFTFTGTRKPPAEPDLAKAGAHSAGGPEPAAR" CDS complement(3988634..3988972) /codon_start=1 /transl_table=11 /gene="lsr2" /locus_tag="BQ2027_MB3628C" /product="iron-regulated h-ns-like protein lsr2" /note="Mb3628c, lsr2, len: 112 aa. Equivalent to Rv3597c, len: 112 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 112 aa overlap). Probable lsr2, identical to P24094|LSR2_MYCLE|ML0234 LSR2 PROTEIN PRECURSOR (15 KDA ANTIGEN) (A15) from Mycobacterium leprae (112 aa), FASTA scores: opt: 698, E(): 6.7e-37, (92.85% identity in 112 aa overlap). Also highly similar to others e.g. Q9X8N1|SCE94.26c from Streptomyces coelicolor (111 aa), FASTA scores: opt: 379, E(): 4.4e-17, (58.05% identity in 112 aa overlap); Q9ETI2|LSR2 from Corynebacterium equii (Rhodococcus equi) (119 aa), FASTA scores: opt: 328, E(): 6.9e-14, (47.5% identity in 120 aa overlap); and Q9RKK8|SCD25.12c from Streptomyces coelicolor (105 aa), FASTA scores: opt: 293, E(): 9.4e-12, (47.75% identity in 111 aa overlap). Protein product from Mb3628c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3628c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65649" /db_xref="InterPro:IPR024412" /db_xref="InterPro:IPR042254" /db_xref="InterPro:IPR042261" /db_xref="UniProtKB/Swiss-Prot:P65649" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02255.1" /translation="MAKKVTVTLVDDFDGSGAADETVEFGLDGVTYEIDLSTKNATKL RGDLKQWVAAGRRVGGRRRGRSGSGRGRGAIDREQSAAIREWARRNGHNVSTRGRIPA DVIDAYHAAT" CDS complement(3989076..3990593) /codon_start=1 /transl_table=11 /gene="lysS" /locus_tag="BQ2027_MB3629C" /product="LYSYL-TRNA SYNTHETASE 1 LYSS (LYSINE--TRNA LIGASE 1) (LYSRS 1) (LYSINE TRANSLASE)" /note="Mb3629c, lysS, len: 505 aa. Equivalent to Rv3598c, len: 505 aa, from Mycobacterium tuberculosis H37Rv, (100.0% identity in 505 aa overlap). Probable lysS, lysyl-tRNA synthetase 1 (EC 6.1.1.6), equivalent to P46861|SYK_MYCLE|LYSS|ML0233 LYSYL-TRNA SYNTHETASE from Mycobacterium leprae (507 aa), FASTA scores: opt: 2835, E(): 4.5e-172, (85.45% identity in 501 aa overlap); and similar with C-terminal part of Q9CC23|LYSX|ML1393 C-TERM LYSYL-TRNA SYNTHASE from Mycobacterium leprae (1039 aa) FASTA scores: opt: 1257, E(): 7.6e-72, (44.55% identity in 505 aa overlap). Also similar to others e.g. P37477|SYK_BACSU|LYSS from Bacillus subtilis (499 aa) FASTA scores: opt: 1294, E(): 1.9e-74, (42.35% identity in 498 aa overlap); Q9RHV9|SYK_BACST|LYSS from Bacillus stearothermophilus (494 aa), FASTA scores: opt: 1258, E(): 3.5e-72, (41.15% identity in 498 aa overlap); Q9PEB6|SYK_XYLFA|LYSS|XF1112 from Xylella fastidiosa (506 aa), FASTA scores: opt: 1228, E(): 2.9e-70, (43.05% identity in 495 aa overlap); etc. Also similar to P94974|SYK2_MYCTU|LYSS2|LYSX|Rv1640c|MTCY06H11.04c LYSYL-TRNA SYNTHETASE 2 from Mycobacterium tuberculosis (1172 aa), FASTA scores: opt: 1295, E(): 3.3e-74, (45.65% identity in 506 aa overlap). Contains PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1. BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb3629c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3629c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67608" /db_xref="InterPro:IPR002313" /db_xref="InterPro:IPR004364" /db_xref="InterPro:IPR004365" /db_xref="InterPro:IPR006195" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR018149" /db_xref="UniProtKB/Swiss-Prot:P67608" /protein_id="SIU02256.1" /translation="MSAADTAEDLPEQFRIRRDKRARLLAQGRDPYPVAVPRTHTLAE VRAAHPDLPIDTATEDIVGVAGRVIFARNSGKLCFATLQDGDGTQLQVMISLDKVGQA ALDAWKADVDLGDIVYVHGAVISSRRGELSVLADCWRIAAKSLRPLPVAHKEMSEESR VRQRYVDLIVRPEARAVARLRIAVVRAIRTALQRRGFLEVETPVLQTLAGGAAARPFA THSNALDIDLYLRIAPELFLKRCIVGGFDKVFELNRVFRNEGADSTHSPEFSMLETYQ TYGTYDDSAVVTRELIQEVADEAIGTRQLPLPDGSVYDIDGEWATIQMYPSLSVALGE EITPQTTVDRLRGIADSLGLEKDPAIHDNRGFGHGKLIEELWERTVGKSLSAPTFVKD FPVQTTPLTRQHRSIPGVTEKWDLYLRGIELATGYSELSDPVVQRERFADQARAAAAG DDEAMVLDEDFLAALEYGMPPCTGTGMGIDRLLMSLTGLSIRETVLFPIVRPHSN" CDS complement(3990605..3990688) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3629A" /product="Hypothetical short protein" /note="Mb3629A, len: 27 aa. Equivalent to Rv3599c len: 27 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 27 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Hypothetical unknown protein. Mb3629A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4M2" /protein_id="SIU02257.1" /translation="MPASSLGTGSPAADRLDATHERRREVI" CDS complement(3990694..3991512) /codon_start=1 /transl_table=11 /gene="coaX" /locus_tag="BQ2027_MB3630C" /product="Pantothenate kinase type III, CoaX-like (EC" /EC_number="2.7.1.33" /note="Mb3630c, -, len: 272 aa. Equivalent to Rv3600c, len: 272 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 272 aa overlap). Conserved hypothetical protein, identical to Q9CD56|ML0232 HYPOTHETICAL PROTEIN from Mycobacterium leprae (274 aa), FASTA scores: opt: 1585, E(): 1.3e-92, (90.5% identity in 274 aa overlap). Also highly similar to others e.g. Q9X8N6|SCE94.31c from Streptomyces coelicolor (265 aa) FASTA scores: opt: 878, E(): 3.9e-48, (51.5% identity in 268 aa overlap); and Q9KGH5|BH0086 from Bacillus halodurans (254 aa), FASTA scores: opt: 611, E(): 2.4e-31, (37.5% identity in 264 aa overlap). And similar to various bacterial proteins e.g. Q9F985 PUTATIVE 32 KDA REPLICATION PROTEIN from Bacillus stearothermophilus (258 aa), FASTA scores: opt: 594, E(): 2.8e-30, (37.45% identity in 267 aa overlap); P37564|YACB_BACSU from Bacillus subtilis (233 aa), FASTA scores: opt: 522, E(): 8.8e-26, (38.95% identity in 213 aa overlap); Q9RX54|DR0461 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (262 aa), FASTA scores: opt: 503, E(): 1.5e-24, (38.45% identity in 268 aa overlap); etc. Protein product from Mb3630c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3630c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TW42" /db_xref="InterPro:IPR004619" /db_xref="UniProtKB/Swiss-Prot:Q7TW42" /protein_id="SIU02258.1" /translation="MLLAIDVRNTHTVVGLLSGMKEHAKVVQQWRIRTESEVTADELA LTIDGLIGEDSERLTGTAALSTVPSVLHEVRIMLDQYWPSVPHVLIEPGVRTGIPLLV DNPKEVGADRIVNCLAAYDRFRKAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSS DAAAARSAALRRVELARPRSVVGKNTVECMQAGAVFGFAGLVDGLVGRIREDVSGFSV DHDVAIVATGHTAPLLLPELHTVDHYDQHLTLQGLRLVFERNLEVQRGRLKTAR" CDS complement(3991515..3991934) /codon_start=1 /transl_table=11 /gene="panD" /locus_tag="BQ2027_MB3631C" /product="PROBABLE ASPARTATE 1-DECARBOXYLASE PRECURSOR PAND (ASPARTATE ALPHA-DECARBOXYLASE)" /note="Mb3631c, panD, len: 139 aa. Equivalent to Rv3601c, len: 139 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 139 aa overlap). Probable panD, aspartate 1-decarboxylase (EC 4.1.1.11), identical to Q9CD57|PAND|ML0231 PUTATIVE ASPARTATE-1-DECARBOXYLASE from Mycobacterium leprae (142 aa), FASTA scores: opt: 733, E(): 5.5e-41, (82.85% identity in 140 aa overlap). Also highly similar to many e.g. CAC44328|PAND from Streptomyces coelicolor (139 aa), FASTA scores: opt: 578, E(): 6.4e-31, (75.0% identity in 120 aa overlap); Q9X4N0|PAND from Corynebacterium glutamicum (Brevibacterium flavum) (136 aa), FASTA scores: opt: 506, E(): 3e-26, (62.2% identity in 135 aa overlap); P52999|PAND_BACSU from Bacillus subtilis (127 aa) FASTA scores: opt: 421, E(): 9.6e-21, (54.75% identity in 123 aa overlap); P31664|PAND_ECOLI|B0131 from Escherichia coli strain K12 (126 aa), FASTA scores: opt: 388, E(): 1.3e-18, (50.45% identity in 113 aa overlap); etc. Mb3631c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P65661" /db_xref="InterPro:IPR003190" /db_xref="InterPro:IPR009010" /db_xref="UniProtKB/Swiss-Prot:P65661" /protein_id="SIU02259.1" /translation="MLRTMLKSKIHRATVTCADLHYVGSVTIDADLMDAADLLEGEQV TIVDIDNGARLVTYAITGERGSGVIGINGAAAHLVHPGDLVILIAYATMDDARARTYQ PRIVFVDAYNKPIDMGHDPAFVPENAGELLDPRLGVG" CDS complement(3991934..3992863) /codon_start=1 /transl_table=11 /gene="panC" /locus_tag="BQ2027_MB3632C" /product="pantoate--beta-alanine ligase panc (pantothenate synthetase) (pantoate activating enzyme)" /note="Mb3632c, panC, len: 309 aa. Equivalent to Rv3602c, len: 309 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 309 aa overlap). Probable panC, pantoate--beta-alanine ligase (EC 6.3.2.1), equivalent to O69524|PANC_MYCLE|ML0230|MLCB2548.01c PANTOATE--BETA-ALANINE LIGASE from Mycobacterium leprae (313 aa), FASTA scores: opt: 1541, E(): 3.4e-84, (82.15% identity in 297 aa overlap). Also similar to others e.g. O67891|PANC_AQUAE|AQ_2132 from Aquifex aeolicus (282 aa) FASTA scores: opt: 774, E(): 8.6e-39, (46.9% identity in 273 aa overlap); Q9HV69|PANC_PSEAE|PA4730 from Pseudomonas aeruginosa (283 aa), FASTA scores: opt: 770, E(): 1.5e-38, (51.45% identity in 276 aa overlap); Q9A6C8|CC2166 from Caulobacter crescentus (285 aa), FASTA scores: opt: 744, E(): 5.2e-37, (47.75% identity in 268 aa overlap); P31663|PANC_ECOLI|B0133 from Escherichia coli strain K12 (283 aa), FASTA scores: opt: 695, E(): 4.1e-34, (46.1% identity in 271 aa overlap); etc. BELONGS TO THE PANTOTHENATE SYNTHETASE FAMILY. Protein product from Mb3632c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3632c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5R1" /db_xref="InterPro:IPR003721" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR042176" /db_xref="UniProtKB/Swiss-Prot:P0A5R1" /protein_id="SIU02260.1" /translation="MTIPAFHPGELNVYSAPGDVADVSRALRLTGRRVMLVPTMGALH EGHLALVRAAKRVPGSVVVVSIFVNPMQFGAGEDLDAYPRTPDDDLAQLRAEGVEIAF TPTTAAMYPDGLRTTVQPGPLAAELEGGPRPTHFAGVLTVVLKLLQIVRPDRVFFGEK DYQQLVLIRQLVADFNLDVAVVGVPTVREADGLAMSSRNRYLDPAQRAAAVALSAALT AAAHAATAGAQAALDAARAVLDAAPGVAVDYLELRDIGLGPMPLNGSGRLLVAARLGT TRLLDNIAIEIGTFAGTDRPDGYRAILESHWRN" CDS complement(3992860..3993771) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3633C" /product="Ketopantoate reductase PanG (EC" /EC_number="1.1.1.169" /note="Mb3633c, -, len: 303 aa. Equivalent to Rv3603c, len: 303 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 303 aa overlap). Conserved hypothetical ala-, leu-rich protein, identical except at N-terminus (really different) to AAK48066|MT3708 CHALCONE/STILBENE SYNTHASE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (361 aa) FASTA scores: opt: 1742, E(): 8.3e-95, (100.0% identity in 275 aa overlap). Equivalent to O69525|MLCB2548.02c|ML0229 HYPOTHETICAL 32.7 KDA PROTEIN from Mycobacterium leprae (309 aa), FASTA scores: opt: 947, E(): 2.4e-48, (67.85% identity in 311 aa overlap). Also highly similar to Q9X845|SCE126.02c HYPOTHETICAL 42.2 KDA PROTEIN from Streptomyces coelicolor (420 aa), FASTA scores: opt: 683, E(): 8.5e-33, (49.3% identity in 284 aa overlap). Protein product from Mb3633c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3633c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4M9" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR018931" /db_xref="InterPro:IPR019665" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR037108" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4M9" /protein_id="SIU02261.1" /translation="MERFDGLRPARLKVGIISAGRVGTALGVALQRADHVVVACSAIS HASRRRAQRRLPDTPVLPPLDVAASAELLLLAVTDSELAGLVSGLAATSAVRPQTIVA HTSGANGIGILAPLAQQGCIPLAIHPAMTFTGSDEDISRLPDTCFGITAADDVGYAIG QSLVLEMGGEPFCVREDARILYHAALAHASNHIVTVLADALEALRAALSGGELLGQQT VDDQPGGIVERIVGPLARAALENTLQRGQAALTGPVARGDAAAVADHLAALADVDAAL AQAYRINALRTAQRAHAPADVVEVLTA" CDS complement(3993956..3995149) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3634C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN RICH IN ALANINE AND ARGININE AND PROLINE" /note="Mb3634c, -, len: 397 aa. Equivalent to Rv3604c, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 397 aa overlap). Probable conserved ala-, arg-, pro-rich transmembrane protein, equivalent to O69526|MLCB2548.03c|ML0228 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (432 aa), FASTA scores: opt: 869, E(): 2.9e-31, (59.7% identity in 432 aa overlap). Contains two possible membrane-spanning domains. N-terminus shortened since first submission (previously 462 aa). Protein product from Mb3634c detected using SWATH mass spectrometry. Mb3634c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4P9" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4P9" /protein_id="SIU02262.1" /translation="MTVLSRGARVRRGGRRPGWVLLTALLVLAIGASSALVFTDRVEL LKLAVLLALWAAVAGAFVSVLYRRQSDVDQARVRDLKLVYDLQLDREISARREYELTL ESQLRRELASELRAPAADEVAALRAELAALRTSLEILFDADLEHRPALGTGEKEARAA RALDGESPPADWVSSDRVMAVRGGDGASRTDEASIIDVPEVGVPPVSGGPRHYEAPPP PQPEPLFEPRHRPPPLPPQQERPVWQPVTSHGQWLPAETPGSQWASVEPETTPAAPPP GRRRRARHASPADQAYNPPAYVELAAQYGESGRRSRHSAEHRDHDIGGSGAGTGERPP SPPMAPPPPAEPTRRHRTADTPPDDSGGLHARDPLTGGQSVADLMARLQVESTGGGRR RRRGE" CDS complement(3995358..3995834) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3635C" /product="PROBABLE CONSERVED SECRETED PROTEIN" /note="Mb3635c, -, len: 158 aa. Equivalent to Rv3605c, len: 158 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 158 aa overlap). Probable conserved secreted or membrane protein, identical to O69527|MLCB2548.04c|ML0227 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (158 aa), FASTA scores: opt: 944, E(): 2.6e-56, (85.45% identity in 158 aa overlap). Also similar to other proteins e.g. Q9X8I2|SCE9.09 POSSIBLE SECRETED PROTEIN from Streptomyces coelicolor (162 aa), FASTA scores: opt: 174, E(): 9.2e-05, (31.25% identity in 128 aa overlap); etc. Contains possible N-terminal signal sequence. Protein product from Mb3635c detected using SWATH mass spectrometry. Mb3635c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4M6" /db_xref="InterPro:IPR021517" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4M6" /protein_id="SIU02263.1" /translation="MGPTRKRDLTAAVVGAAAVGYLLVAVLYRWFPPITVWTGLSLLA VAVAEALWARYVRVKISDGEIGDGPGWLHPLVVARSLMVAKASAWVGALVTGWWIGVL AYFLPRRSWLRAAAEDTTGTVVAAGSALALVVAALWLQHCCKSPQDPTEHADGAES" CDS complement(3995834..3996400) /codon_start=1 /transl_table=11 /gene="folK" /locus_tag="BQ2027_MB3636C" /product="2-AMINO-4-HYDROXY-6- HYDROXYMETHYLDIHYDROPTERIDINEPYROPHOSPHOKINASE FOLK (7,8-DIHYDRO-6-HYDROXYMETHYLPTERIN-PYROPHOSPHOKINASE) (HPPK) (6-HYDROXYMETHYL-7,8-DIHYDROPTERIN PYROPHOSPHOKINASE) (PPPK) (2-AMINO-4-HYDROXY-6-HYDROXYMETHYLDIHYDROPTERIDINE DIPHOSPHOKINASE) (7,8-DIHYDRO-6-HYDROXYMETHYLPTERIN- DIPHOSPHOKINASE) (6-HYDROXYMETHYL-7,8-DIHYDROPTERIN DIPHOSPHOKINASE)" /note="Mb3636c, folK, len: 188 aa. Equivalent to Rv3606c, len: 188 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 188 aa overlap). Probable folK, 2-amino-4-hydroxy-6-hydroxymethyldihydropterine pyrophosphokinase (EC 2.7.6.3), equivalent to O69528|HPPK_MYCLE|FOLK|ML0226\MLCB2548.05c 2-AMINO-4-HYDROXY-6-HYDROXYMETHYLDIHYDROPTERIDINE PYROPHOSPHOKINASE from Mycobacterium leprae (191 aa) FASTA scores: opt: 772, E(): 1.2e-44, (63.15% identity in 190 aa overlap). Also similar to many e.g. P71512|HPPK_METEX|FOLK|FOLA from Methylobacterium extorquens (158 aa), FASTA scores: opt: 292, E(): 1.4e-12, (36.85% identity in 171 aa overlap); O33726|HPPK_STRPY|FOLK|SPY1100 from Streptococcus pyogenes (166 aa), FASTA scores: opt: 234, E(): 1.1e-08, (34.3% identity in 175 aa overlap); Q9X8I1|SCE9.08 from Streptomyces coelicolor (203 aa), FASTA scores: opt: 232, E(): 1.7e-08, (43.25% identity in 185 aa overlap); P26281|HPPK_ECOLI|FOLK|B0142 from Escherichia coli strain K12 (158 aa), FASTA scores: opt: 198, E(): 2.6e-06, (32.85% identity in 143 aa overlap); etc. BELONGS TO THE HPPK FAMILY. Protein product from Mb3636c detected using SWATH mass spectrometry. Mb3636c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64144" /db_xref="InterPro:IPR000550" /db_xref="InterPro:IPR035907" /db_xref="UniProtKB/Swiss-Prot:P64144" /protein_id="SIU02264.1" /translation="MTRVVLSVGSNLGDRLARLRSVADGLGDALIAASPIYEADPWGG VEQGQFLNAVLIADDPTCEPREWLRRAQEFERAAGRVRGQRWGPRNLDVDLIACYQTS ATEALVEVTARENHLTLPHPLAHLRAFVLIPWIAVDPTAQLTVAGCPRPVTRLLAELE PADRDSVRLFRPSFDLNSRHPVSRAPES" CDS complement(3996397..3996798) /codon_start=1 /transl_table=11 /gene="folB" /locus_tag="BQ2027_MB3637C" /product="PROBABLE DIHYDRONEOPTERIN ALDOLASE FOLB (DHNA)" /note="Mb3637c, folB, len: 133 aa. Equivalent to Rv3607c, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Probable folB, dihydroneopterin aldolase (EC 4.1.2.25), equivalent to O69529|FOLB_MYCLE|ML0225|MLCB2548.06c PROBABLE DIHYDRONEOPTERIN ALDOLASE from Mycobacterium leprae (132 aa), FASTA scores: opt: 673, E(): 5.1e-37, (74.8% identity in 131 aa overlap). Also similar to many e.g. Q9X8I0|FOLB_STRCO|SCE9.07 from Streptomyces coelicolor (119 aa), FASTA scores: opt: 334, E(): 4.5e-15, (46.15% identity in 117 aa overlap); P74342|FOLB_SYNY3|SLR1626 from Synechocystis sp. strain PCC 6803 (118 aa) FASTA scores: opt: 287, E(): 5e-12, (38.45% identity in 117 aa overlap); P28823|FOLB_BACSU|FOLA from Bacillus subtilis (120 aa), FASTA scores: opt: 283, E(): 9.2e-12, (39.0% identity in 118 aa overlap); etc. BELONGS TO THE DHNA FAMILY. Note that previously known as folX. Protein product from Mb3637c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3637c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A581" /db_xref="InterPro:IPR006156" /db_xref="InterPro:IPR006157" /db_xref="UniProtKB/Swiss-Prot:P0A581" /protein_id="SIU02265.1" /translation="MADRIELRGLTVHGRHGVYDHERVAGQRFVIDVTVWIDLAEAAN SDDLADTYDYVRLASRAAEIVAGPPRKLIETVGAEIADHVMDDQRVHAVEVAVHKPQA PIPQTFDDVAVVIRRSRRGGRGWVVPAGGAV" CDS complement(3996791..3997633) /codon_start=1 /transl_table=11 /gene="folP1" /locus_tag="BQ2027_MB3638C" /product="DIHYDROPTEROATE SYNTHASE 1 FOLP (DHPS 1) (DIHYDROPTEROATE PYROPHOSPHORYLASE 1) (DIHYDROPTEROATE DIPHOSPHORYLASE 1)" /note="Mb3638c, folP1, len: 280 aa. Equivalent to Rv3608c, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Probable folP1, dihydropteroate synthase 1 (EC 2.5.1.15), equivalent to O69530|FOLP (alias Q9S0T0|FOLP and Q9R2U9|FOLP) DIHYDRONEOPTERIN ALDOLASE from Mycobacterium leprae (284 aa), FASTA scores: opt: 1418, E(): 7.2e-77, (76.75% identity in 284 aa overlap). Also highly similar to many e.g. Q9X8H8|SCE9.05 from Streptomyces coelicolor (288 aa), FASTA scores: opt: 953, E(): 2.4e-49, (56.0% identity in 266 aa overlap); Q9A3I0|CC3224 from Caulobacter crescentus (274 aa), FASTA scores: opt: 682, E(): 2.6e-33, (45.5% identity in 268 aa overlap); P73248|DHPS_SYNY3|FOLP|SLR2026 from Synechocystis sp. strain PCC 6803 (289 aa), FASTA scores: opt: 665, E(): 2.7e-32, (44.55% identity in 265 aa overlap); P26282|DHPS_ECOLI|FOLP|B3177 from Escherichia coli strain K12 (282 aa), FASTA scores: opt: 642, E(): 6.1e-31, (41.95% identity in 274 aa overlap); etc. Contains PS00792 Dihydropteroate synthase signature 1, PS00793 Dihydropteroate synthase signature 2. SIMILAR TO OTHER SPECIES DHPS. Protein product from Mb3638c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3638c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A579" /db_xref="InterPro:IPR000489" /db_xref="InterPro:IPR006390" /db_xref="InterPro:IPR011005" /db_xref="UniProtKB/Swiss-Prot:P0A579" /protein_id="SIU02266.1" /translation="MSPAPVQVMGVLNVTDDSFSDGGCYLDLDDAVKHGLAMAAAGAG IVDVGGESSRPGATRVDPAVETSRVIPVVKELAAQGITVSIDTMRADVARAALQNGAQ MVNDVSGGRADPAMGPLLAEADVPWVLMHWRAVSADTPHVPVRYGNVVAEVRADLLAS VADAVAAGVDPARLVLDPGLGFAKTAQHNWAILHALPELVATGIPVLVGASRKRFLGA LLAGPDGVMRPTDGRDTATAVISALAALHGAWGVRVHDVRASVDAIKVVEAWMGAERI ERDG" CDS complement(3997630..3998238) /codon_start=1 /transl_table=11 /gene="folE" /locus_tag="BQ2027_MB3639C" /product="GTP CYCLOHYDROLASE I FOLE (GTP-CH-I)" /note="Mb3639c, folE, len: 202 aa. Equivalent to Rv3609c, len: 202 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 202 aa overlap). Probable folE (alternate gene name: gchA), GTP cyclohydrolase I (EC 3.5.4.16), equivalent to O69531|GCH1_MYCLE|FOLE|ML0223|MLCB2548.08c GTP CYCLOHYDROLASE I from Mycobacterium leprae (205 aa) FASTA scores: opt: 1112, E(): 3.8e-63, (81.95% identity in 205 aa overlap). Also highly similar to many e.g. Q9X8I3|GCH1_STRCO|FOLE|SCE9.10c from Streptomyces coelicolor (201 aa), FASTA scores: opt: 873, E(): 4.2e-48, (67.4% identity in 187 aa overlap); Q9KCC7|MTRA|BH1646 from Bacillus halodurans (188 aa), FASTA scores: opt: 757, E(): 8.1e-41, (62.3% identity in 183 aa overlap); P19465|GCH1_BACSU|FOLE|MTRA from Bacillus subtilis (190 aa), FASTA scores: opt: 750, E(): 2.3e-40, (58.95% identity in 190 aa overlap); etc. Contains PS00860 GTP cyclohydrolase I signature 2. BELONGS TO THE GTP CYCLOHYDROLASE I FAMILY. Protein product from Mb3639c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3639c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64208" /db_xref="InterPro:IPR001474" /db_xref="InterPro:IPR018234" /db_xref="InterPro:IPR020602" /db_xref="UniProtKB/Swiss-Prot:P64208" /protein_id="SIU02267.1" /translation="MSQLDSRSASARIRVFDQQRAEAAVRELLYAIGEDPDRDGLVAT PSRVARSYREMFAGLYTDPDSVLNTMFDEDHDELVLVKEIPMYSTCEHHLVAFHGVAH VGYIPGDDGRVTGLSKIARLVDLYAKRPQVQERLTSQIADALMKKLDPRGVIVVIEAE HLCMAMRGVRKPGSVTTTSAVRGLFKTNAASRAEALDLILRK" CDS complement(3998254..4000536) /codon_start=1 /transl_table=11 /gene="ftsH" /locus_tag="BQ2027_MB3640C" /product="MEMBRANE-BOUND PROTEASE FTSH (CELL DIVISION PROTEIN)" /note="Mb3640c, ftsH, len: 760 aa. Equivalent to Rv3610c, len: 760 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 760 aa overlap). ftsH, membrane-bound protease (cell division protein) (EC 3.4.24.-) (see citation below), equivalent to Q9CD58|FTSH_MYCLE|ML0222 (alias O69532|FTSH) CELL DIVISION PROTEIN FTSH HOMOLOG from Mycobacterium leprae (787 aa), FASTA scores: opt: 4388, E(): 9.6e-205, (87.2% identity in 790 aa overlap). Also highly similar to many FTSH proteins e.g. O52395|FTSH from Mycobacterium smegmatis (769 aa), FASTA scores: opt: 3976, E(): 7.6e-185, (82.4% identity in 761 aa overlap); Q9X8I4|SCE9.11c from Streptomyces coelicolor (668 aa), FASTA scores: opt: 2417, E(): 1.4e-109, (57.2% identity in 668 aa overlap); P72991|FTH4_SYNY3|SLR1604 from Synechocystis sp. strain PCC 6803 (616 aa), FASTA scores: opt: 1926, E(): 7.2e-86, (49.35% identity in 612 aa overlap); P28691|FTSH_ECOLI|HFLB|MRSC|TOLZ|B3178 from Escherichia coli strain K12 (644 aa), FASTA scores: opt: 1859, E(): 1.3e-82, (48.95% identity in 605 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00674 AAA-protein family signature. BELONGS TO THE AAA FAMILY OF ATPASES AND PEPTIDASE FAMILY M41 (ZINC METALLOPROTEASE). COFACTOR: BINDS ONE ZINC ION (POTENTIAL). Protein product from Mb3640c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3640c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A4V9" /db_xref="InterPro:IPR000642" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR003959" /db_xref="InterPro:IPR003960" /db_xref="InterPro:IPR005936" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR037219" /db_xref="InterPro:IPR041569" /db_xref="UniProtKB/Swiss-Prot:P0A4V9" /protein_id="SIU02268.1" /translation="MNRKNVTRTITAIAVVVLLGWSFFYFSDDTRGYKPVDTSVAITQ INGDNVKSAQIDDREQQLRLILKKGNNETDGSEKVITKYPTGYAVDLFNALSAKNAKV STVVNQGSILGELLVYVLPLLLLVGLFVMFSRMQGGARMGFGFGKSRAKQLSKDMPKT TFADVAGVDEAVEELYEIKDFLQNPSRYQALGAKIPKGVLLYGPPGTGKTLLARAVAG EAGVPFFTISGSDFVEMFVGVGASRVRDLFEQAKQNSPCIIFVDEIDAVGRQRGAGLG GGHDEREQTLNQLLVEMDGFGDRAGVILIAATNRPDILDPALLRPGRFDRQIPVSNPD LAGRRAVLRVHSKGKPMAADADLDGLAKRTVGMTGADLANVINEAALLTARENGTVIT GPALEEAVDRVIGGPRRKGRIISEQEKKITAYHEGGHTLAAWAMPDIEPIYKVTILAR GRTGGHAVAVPEEDKGLRTRSEMIAQLVFAMGGRAAEELVFREPTTGAVSDIEQATKI ARSMVTEFGMSSKLGAVKYGSEHGDPFLGRTMGTQPDYSHEVAREIDEEVRKLIEAAH TEAWEILTEYRDVLDTLAGELLEKETLHRPELESIFADVEKRPRLTMFDDFGGRIPSD KPPIKTPGELAIERGEPWPQPVPEPAFKAAIAQATQAAEAARSDAGQTGHGANGSPAG THRSGDRQYGSTQPDYGAPAGWHAPGWPPRSSHRPSYSGEPAPTYPGQPYPTGQADPG SDESSAEQDDEVSRTKPAHG" CDS 4000603..4001145 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3641" /product="HYPOTHETICAL ARGININE AND PROLINE RICH PROTEIN" /note="Mb3641, -, len: 180 aa. Equivalent to Rv3611, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 173 aa overlap). Hypothetical unknown arg-, pro-rich protein. Possible ORF containing several direct repeats. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 111 bp deletion leads to a shorter product compared to its homolog in Mycobacterium tuberculosis (180 aa versus 217 aa). Mb3641 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4T1" /protein_id="SIU02269.1" /translation="MAIANPAEPGAAGRHHQPRGDRKPRAWRQCGPQNGPRRSQAITP EPGAAGRHHQPRGDRKPRAWRQCGPQNGPRRSQAITPEPGAAGRHHQPRGDRKPRAWR QCGPQNGPRRSQAITPEPGAAGRHHQPRGDRKPRAWRQCGPQNGPRRSQAITPEPGAA GRHWLDQRPVVPDGVGKSDS" CDS complement(4001060..4001389) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3642C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3642c, -, len: 109 aa. Equivalent to Rv3612c, len: 109 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 109 aa overlap). Conserved hypothetical protein. Residues 58 to 81 highly similar to N-terminal part of AAK46718|MT2424 HYPOTHETICAL 3.9 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (36 aa), FASTA scores: opt: 108, E(): 0.38, (69.25% identity in 26 aa overlap). Mb3642c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4N3" /protein_id="SIU02270.1" /translation="MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWAD RVSPGAVTHATGAMCPTLGAHQFEPNQVRCTACLTRTLSCRIFRRRRELPVVGLASGD PLHPALG" CDS complement(4001423..4001584) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3643C" /product="HYPOTHETICAL PROTEIN" /note="Mb3643c, -, len: 53 aa. Equivalent to Rv3613c, len: 53 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 53 aa overlap). Hypothetical unknown protein. Mb3643c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4N2" /protein_id="SIU02271.1" /translation="MCTMPKLWRAFMAGRPLGSTFTPRQPTGAAPNHVRALDDSIDPS SAPAARAAL" CDS complement(4001684..4002238) /codon_start=1 /transl_table=11 /gene="espd" /locus_tag="BQ2027_MB3644C" /product="esx-1 secretion-associated protein espd" /note="Mb3644c, -, len: 184 aa. Equivalent to Rv3614c, len: 184 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 184 aa overlap). Conserved hypothetical protein, equivalent to Q49730|ML0407|B1620_C3_264|MLCL383.03 HYPOTHETICAL 24.2 KDA PROTEIN from Mycobacterium leprae (216 aa) FASTA scores: opt: 899, E(): 1.7e-51, (71.3% identity in 188 aa overlap); and similar to two hypothetical proteins from Mycobacterium leprae: Q9CDD6|ML0056 (169 aa), FASTA scores: opt: 285, E(): 1.2e-11, (38.35% identity in 172 aa overlap); and O33090|MLCB628.19c (338 aa), FASTA scores: opt: 289, E(): 1.2e-11, (38.95% identity in 172 aa overlap). Also highly similar to O69732|Rv3867|MTV027.02 HYPOTHETICAL 19.9 KDA PROTEIN from Mycobacterium tuberculosis (183 aa), FASTA scores: opt: 563, E(): 1e-29, (54.9% identity in 173 aa overlap). Protein product from Mb3644c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3644c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Q6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02272.1" /translation="MDLPGNDFDSNDFDAVDLWGADGAEGWTADPIIGVGSAATPDTG PDLDNAHGQAETDTEQEIALFTVTNPPRTVSVSTLMDGRIDHVELSARVAWMSESQLA SEILVIADLARQKAQSAQYAFILDRMSQQVDADEHRVALLRKTVGETWGLPSPEEAAA AEAEVFATRYSDDCPAPDDESDPW" CDS complement(4002354..4002665) /codon_start=1 /transl_table=11 /gene="espc" /locus_tag="BQ2027_MB3645C" /product="esx-1 secretion-associated protein espc" /note="Mb3645c, -, len: 103 aa. Equivalent to Rv3615c, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 103 aa overlap). Conserved hypothetical protein, equivalent to Q49723|ML0406|B1620_C2_214|MLCL383 HYPOTHETICAL 11.1 KDA PROTEIN from Mycobacterium leprae (106 aa), FASTA scores: opt: 364, E(): 4.1e-18, (60.85% identity in 92 aa overlap). Also shows similarity to P96212|Rv3865|MTCY01A6.03 HYPOTHETICAL 10.6 KDA PROTEIN from Mycobacterium tuberculosis (103 aa), FASTA scores: opt: 198, E(): 6.8e-07, (36.25% identity in 102 aa overlap). Protein product from Mb3645c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3645c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65088" /db_xref="InterPro:IPR022536" /db_xref="UniProtKB/Swiss-Prot:P65088" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02273.1" /translation="MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITH GPYCSQFNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLF T" CDS complement(4002739..4003917) /codon_start=1 /transl_table=11 /gene="espa" /locus_tag="BQ2027_MB3646C" /product="esx-1 secretion-associated protein a, espa" /note="Mb3646c, -, len: 392 aa. Equivalent to Rv3616c, len: 392 aa, from Mycobacterium tuberculosis strain H37RV, (99.5% identity in 392 aa overlap). Conserved hypothetical ala-, gly-rich protein, equivalent to Q49722|ML0405|B1620_C2_213|MLCL383.01 HYPOTHETICAL 40.8 KDA PROTEIN from Mycobacterium leprae (394 aa) FASTA scores: opt: 1620, E(): 5.3e-75, (62.7% identity in 394 aa overlap). Also similar to P96213|Rv3864|MTCY01A6.04c HYPOTHETICAL 42.1 KDA PROTEIN from Mycobacterium tuberculosis (402 aa), FASTA scores: opt: 389, E(): 1.1e-12, (31.75% identity in 400 aa overlap). Protein product from Mb3646c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3646c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Q8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02274.1" /translation="MSRAFIIDPTISAIDGLYDLLGIGIPNQGGILYSSLEYFEKALE ELAAAFPGDGWLGSAADKYAGKNRNHVNFFQELADLDRQLISLIHDQANAVQTTRDIL EGAKKGLEFVRPVAVDLTYIPVVGHALSAAFQAPFCAGAMAVVGGALAYLAVKTLINA TQLLKLLAKLAELVAAAIADIISDVADIIKGILGEVWEFITNALNGLKELWDKLTGWV TGLFSRGWSNLESFFAGVPGLTGATSGLSQVTGLFGAAGLSASSGLAHADSLASSASL PALAGIGGGSGFGGLPSLAQVHAASTRQALRPRADGPVGAAAEQVGGQSQLVSAQGSQ GMGGPVGMGGMHPSSGASKGTTTKKYSEGAAAGTEDAERAPVEADAGGGQKVLVRNVV " CDS 4004383..4004898 /codon_start=1 /transl_table=11 /gene="lpqG" /locus_tag="BQ2027_MB3647" /product="PROBABLE CONSERVED LIPOPROTEIN LPQG" /note="Mb3647, lpqG, len: 171 aa. Equivalent to 3' end of Rv3623, len: 240 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 171 aa overlap). Probable lpqG, conserved lipoprotein, showing some similarity with hypothetical proteins e.g. Q57432 from Methanosarcina barkeri (251 aa), FASTA scores: opt: 319, E(): 6.8e-12, (31.2% identity in 218 aa overlap); Q9PEA5|XF1123 OUTER MEMBRANE PROTEIN from Xylella fastidiosa (242 aa) FASTA scores: opt: 312, E(): 1.7e-11, (28.25% identity in 237 aa overlap); BAB49547|MLR2408 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (236 aa), FASTA scores: opt: 304, E(): 5e-11, (27.05% identity in 244 aa overlap); etc. Has suitable signal peptide and prokaryotic membrane lipoprotein lipid attachment site (PS00013). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a large 5894 bp deletion leads to a shorter product with a different NH2 part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (171 aa versus 240 aa). Protein product from Mb3647 detected using SWATH mass spectrometry. Mb3647 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR007497" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6A5" /protein_id="SIU02275.1" /translation="MNQTNDRQQAVIDALVGAGLDRKDIRTTRVTVAPQYSNPEPAGT ATITGYRADNDIEVKIHPTDAASRLLALVVSTGGDATRISSVSYSIGDDSQLVKDARA RAFQDAKNRADQYAQLSGLRLGKVISISEASGAAPTHEAPAPPRGLSAVPLEPGQQTV GFSVTVVWELT" CDS complement(4004903..4005553) /codon_start=1 /transl_table=11 /gene="hpt" /locus_tag="BQ2027_MB3648C" /product="HYPOXANTHINE-GUANINE PHOSPHORIBOSYLTRANSFERASE HPT (HGPRT) (HGPRTase) (HYPOXANTHINE PHOSPHORIBOSYLTRANSFERASE) (IMP PYROPHOSPHORYLASE) (IMP DIPHOSPHORYLASE) (TRANSPHOSPHORIBOSYLTRANSFERASE) (GUANINE PHOSPHORIBOSYLTRANSFERASE)" /note="Mb3648c, hpt, len: 216 aa. Equivalent to Rv3624c, len: 216 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 216 aa overlap). Probable hpt (alternate gene name: hprT), hypoxanthine-guanine phosphoribosyltransferase (EC 2.4.2.8) (but seems to have a 35 aa extension at N-terminus), equivalent to other mycobacterial hypoxanthine-guanine phosphoribosyltransferases e.g. P96794 from Mycobacterium avium (203 aa), FASTA scores: opt: 1136, E(): 1.2e-65, (88.5% identity in 200 aa overlap); and O69537|HPT|ML0214 from Mycobacterium leprae (213 aa), FASTA scores: opt: 1115, E(): 2.8e-64, (81.6% identity in 212 aa overlap). Also similar to others e.g. Q9X8I5|SCE9.12c from Streptomyces coelicolor (187 aa), FASTA scores: opt: 724, E(): 2.4e-39, (60.55% identity in 180 aa overlap); P37472|HPRT_BACSU|HPT from Bacillus subtilis (180 aa) FASTA scores: opt: 574, E(): 9.1e-30, (48.6% identity in 181 aa overlap); etc. Equivalent to AAK48087 from Mycobacterium tuberculosis strain CDC1551 (202 aa) but longer 14 aa. Contains PS00103 Purine/pyrimidine phosphoribosyltransferases signature. BELONGS TO THE PURINE/PYRIMIDINE PHOSPHORIBOSYLTRANSFERASE FAMILY. Protein product from Mb3648c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3648c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5T1" /db_xref="InterPro:IPR000836" /db_xref="InterPro:IPR005904" /db_xref="InterPro:IPR029057" /db_xref="UniProtKB/Swiss-Prot:P0A5T1" /protein_id="SIU02276.1" /translation="MTPALVVGPAAWHAVHVTQSSSAITPGQTAELYPGDIKSVLLTA EQIQARIAELGEQIGNDYRELSATTGQDLLLITVLKGAVLFVTDLARAIPVPTQFEFM AVSSYGSSTSSSGVVRILKDLDRDIHGRDVLIVEDVVDSGLTLSWLSRNLTSRNPRSL RVCTLLRKPDAVHANVEIAYVGFDIPNDFVVGYGLDYDERYRDLSYIGTLDPRVYQ" CDS complement(4005550..4006521) /codon_start=1 /transl_table=11 /gene="mesJ" /locus_tag="BQ2027_MB3649C" /product="POSSIBLE CELL CYCLE PROTEIN MESJ" /note="Mb3649c, mesJ, len: 323 aa. Equivalent to Rv3625c, len: 323 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 323 aa overlap). Possible mesJ, cell cycle protein, equivalent to O69538|Y0C5_MYCLE|ML0213|MLCB2548.18c HYPOTHETICAL 34.1 KDA PROTEIN from Mycobacterium leprae (323 aa) FASTA scores: opt: 1592, E(): 9e-92, (78.0% identity in 327 aa overlap). Similar to bacterial hypothetical proteins Q9X8I6|SCE9.13c from Streptomyces coelicolor (352 aa) FASTA scores: opt: 705, E(): 1.4e-36, (47.85% identity in 305 aa overlap); and Q9HXZ3|PA3638 from Pseudomonas aeruginosa (442 aa), FASTA scores: opt: 382, E(): 2e-16, (40.6% identity in 271 aa overlap). But also similar (or with similarity) to bacterial cell cycle proteins (MESJ) e.g. Q9KPX0|VC2242 MESJ PROTEIN from Vibrio cholerae (440 aa), FASTA scores: opt: 363, E(): 3e-15, (34.8% identity in 253 aa overlap); Q9RV23|DR1207 (600 aa) CELL CYCLE PROTEIN MESJ (PUTATIVE/CYTOSINE DEAMINASE-RELATED PROTEIN) from Deinococcus radiodurans (600 aa), FASTA scores: opt: 310, E(): 7.6e-12, (36.6% identity in 265 aa overlap) (similar only at the N-terminal end); Q9PFJ8|XF0659 CELL CYCLE PROTEIN from Xylella fastidiosa (437 aa), FASTA scores: opt: 301, E(): 2.1e-11, (35.05% identity in 271 aa overlap); P52097|MESJ_ECOLI|B0188 PUTATIVE CELL CYCLE PROTEIN MESJ from Escherichia coli strain K12(432 aa) FASTA scores: opt: 299, E(): 2.8e-11, (34.65% identity in 277 aa overlap); etc. BELONGS TO THE UPF0072 (MESJ/YCF62) FAMILY. Protein product from Mb3649c detected using SWATH mass spectrometry. Mb3649c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67152" /db_xref="InterPro:IPR011063" /db_xref="InterPro:IPR012094" /db_xref="InterPro:IPR012795" /db_xref="InterPro:IPR014729" /db_xref="InterPro:IPR015262" /db_xref="UniProtKB/Swiss-Prot:P67152" /protein_id="SIU02277.1" /translation="MDRQSAVAQLRAAAEQFARVHLDACDRWSVGLSGGPDSLALTAV AARLWPTTALIVDHGLQPGSATVAETARIQAISLGCVDARVLCVQVGAAGGREAAARS ARYSALEEHRDGPVLLAHTLDDQAETVLLGLGRGSGARSIAGMRPYDPPWCRPLLGVR RSVTHAACRELGLTAWQDPHNTDRRFTRTRLRTEVLPLLEDVLGGGVAEALARTATAL REDTDLIDTIAAQALPGAAVAGSRGQELSTSALTALPDAVRRRVIRGWLLAGGATGLT DRQIRGVDRLVTAWRGQGGVAVGSTLRGQRLVAGRRDGVLVLRREPV" CDS complement(4006500..4007552) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3650C" /product="proteolysis/nucleotide biosynthesis" /note="Mb3650c, -, len: 350 aa. Equivalent to Rv3626c, len: 350 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 350 aa overlap). Conserved hypothetical protein, similar to Q9X8I7|SCE9.14c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (375 aa) FASTA scores: opt: 720, E(): 2.2e-38, (41.55% identity in 361 aa overlap); and shows some similarity to Q9HPS0|VNG1497C HYPOTHETICAL PROTEIN (317 aa) FASTA scores: opt: 226, E(): 4.5e-07, (29.7% identity in 347 aa overlap). Contains neutral zinc metallopeptidases, zinc-binding region signature (PS00142). Protein product from Mb3650c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3650c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR018766" /db_xref="InterPro:IPR022454" /db_xref="InterPro:IPR042271" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4S6" /protein_id="SIU02278.1" /translation="MTGASELTLGNTVDWEFAASVGERLARPAPPSTEYTRRQVIDEL TVAAEKAEPPVRDVTGLIADGVVPPARVVDRPAWIRSAAESMRAMTHGSAKPRGFLTG RITGAQTGAVLAFVASGILGQYDPFGAAGEGCLLLVYPNVIAVERQLRVEPSDFRLWV CLHEVTHRVQFTANPWLSGYMSQALNLLTFEPVDDIGRVVSRLADFIRSRGHGTDDSE VNPSGILGLVRAVQSEPQRKALDQLLVLGTLLEGHAEHVMDAVGPMVVPSVATIRRRF DDRRHHKQPPLQRLVRALLGFDAKLSQYTRGKAFVDHVVDRAGMKLFNTIWSGPETLP LPAEIENPQRWIDRVL" CDS complement(4007549..4008934) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3651C" /product="D-alanyl-D-alanine carboxypeptidase (EC" /EC_number="3.4.16.4" /note="Mb3651c, -, len: 461 aa. Equivalent to Rv3627c, len: 461 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 461 aa overlap). Hypothetical ala-rich protein which may have cleavable signal peptide at N-terminal end. Equivalent to O69539|MLCB2548.20c|ML0211 HYPOTHETICAL 47.2 KDA PROTEIN from Mycobacterium leprae (461 aa), FASTA scores: opt: 2295, E(): 3.5e-116, (76.2% identity in 462 aa overlap); and C-terminal end shows similarity with O05758|MLCB5.28c HYPOTHETICAL 24.1 KDA PROTEIN from Mycobacterium leprae (225 aa), FASTA scores: opt: 268, E(): 1.8e-07, (32.25% identity in 220 aa overlap). Also similar (or with similarity) to various proteins (notably penicillin binding proteins) e.g. Q9X8I8|SCE9.15c HYPOTHETICAL 45.9 KDA PROTEIN from Streptomyces coelicolor (459 aa) FASTA scores: opt: 707, E(): 8.3e-31, (35.75% identity in 439 aa overlap); Q9Z541|SC9B2.18c PUTATIVE CARBOXYPEPTIDASE from Streptomyces coelicolor (451 aa), FASTA scores: opt: 450, E(): 5.3e-17, (31.75% identity in 469 aa overlap); Q9JVV4|NMA0665 PUTATIVE PEPTIDASE from Neisseria meningitidis (serogroup A) (or Q9JY10|NMB1797 from serogroup B) (469 aa), FASTA scores: opt: 269, E(): 3e-07, (26.15% identity in 463 aa overlap); O85665|PBP3 PENICILLIN BINDING PROTEIN 3 from Neisseria gonorrhoeae (469 aa), FASTA scores: opt: 265, E(): 4.9e-07, (31.85% identity in 201 aa overlap); P45161|PBP4_HAEIN|DACB|HI1330 PENICILLIN-BINDING PROTEIN 4 PRECURSOR/PEPTIDASE (479 aa) FASTA scores: opt: 230, E(): 3.8e-05, (27.9% identity in 394 aa overlap); P24228|PBP4_ECOLI|DACB|B3182 PENICILLIN-BINDING PROTEIN 4 PRECURSOR from Escherichia coli strain K12 (477 aa), FASTA scores: opt: 166, E(): 0.1, (28.2% identity in 408 aa overlap); etc. Protein product from Mb3651c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3651c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4U0" /db_xref="InterPro:IPR000667" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4U0" /protein_id="SIU02279.1" /translation="MGPTRWRKSTHVVVGAAVLAFVAVVVAAAALVTTGGHRAGVRAP VPPPRPPTVKAGVVPVADTAATPSAAGVTAALAVVAVDPDLGKLAGRITDALTGQELW QRLDDVPLVPASTNKILTAAAALLTLDRQARISTRVVAGGQNPQGPVVLVGAGDPTLS AAPPGQDTWYHGAARIGDLVEQIRRSGVTPTAVQVDASAFSGPTMAPGWDPADIDNGD IAPIEAAMIDAGRIQPTTVNSRRSRTPALDAGRELAKALGLDPAAVTIASAPAGARQL AVVQSAPLIQRLSQMMNASDNVMAECIGREVAVAINRPQSFSGAVDAVTSRLNTAHID TAGAALVDSSGLSLDNRLTARTLDATMQAAAGPDQPALRPLLDLLPIAGGSGTLGERF LDAATDQGPAGWLRAKTGSLTAINSLVGVLTDRSGRVLTFAFISNEAGPNGRNAMDAL ATKLWFCGCTT" CDS 4009072..4009560 /codon_start=1 /transl_table=11 /gene="ppa" /locus_tag="BQ2027_MB3652" /product="INORGANIC PYROPHOSPHATASE PPA (PYROPHOSPHATE PHOSPHO-HYDROLASE) (PPASE) (INORGANIC DIPHOSPHATASE) (DIPHOSPHATE PHOSPHO-HYDROLASE)" /note="Mb3652, ppa, len: 162 aa. Equivalent to Rv3628, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). ppa, inorganic pyrophosphatase (EC 3.6.1.1) (see first citation), identical to O69540|IPYR_MYCLEPPA|ML0210|MLCB2548.21 INORGANIC PYROPHOSPHATASE from Mycobacterium leprae (162 aa) FASTA scores: opt: 1018, E(): 1.3e-59, (89.5% identity in 162 aa overlap). Also highly similar to many bacterial pyrophosphatases e.g. Q9X8I9|IPYR_STRCO|PPA|SCE9.16 from Streptomyces coelicolor (163 aa), FASTA scores: opt: 773, E(): 1.3e-43, (67.5% identity in 163 aa overlap); O05545|IPYR_GLUOX|PPA from Gluconobacter oxydans (Gluconobacter suboxydans) (176 aa), FASTA scores: opt: 553, E(): 3.2e-29, (53.8% identity in 145 aa overlap); P77992|IPYR_THELI|PPA from Thermococcus litoralis (176 aa) FASTA scores: opt: 537, E(): 3.5e-28, (49.35% identity in 152 aa overlap); P50308|IPYR_SULAC|PPA from Sulfolobus acidocaldarius (173 aa), FASTA scores: opt: 518, E(): 6e-27, (45.3% identity in 159 aa overlap); etc. BELONGS TO THE PPASE FAMILY. COFACTOR: REQUIRES THE PRESENCE OF DIVALENT METAL CATION. MAGNESIUM CONFERS THE HIGHEST ACTIVITY. BINDS 4 DIVALENT CATIONS PER SUBUNIT (BY SIMILARITY). Protein product from Mb3652 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3652 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65747" /db_xref="InterPro:IPR008162" /db_xref="InterPro:IPR036649" /db_xref="UniProtKB/Swiss-Prot:P65747" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02280.1" /translation="MQFDVTIEIPKGQRNKYEVDHETGRVRLDRYLYTPMAYPTDYGF IEDTLGDDGDPLDALVLLPQPVFPGVLVAARPVGMFRMVDEHGGDDKVLCVPAGDPRW DHVQDIGDVPAFELDAIKHFFVHYKDLEPGKFVKAADWVDRAEAEAEVQRSVERFKAG TH" CDS complement(4009606..4010703) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3653C" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3653c, -, len: 365 aa. Equivalent to Rv3629c, len: 365 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 365 aa overlap). Probable conserved integral membrane protein, equivalent to O69543|MLCB2548.26|ML0205 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (356 aa), FASTA scores: opt: 1547, E(): 3e-89, (66.2% identity in 361 aa overlap). Also similar to other membrane and hypothetical proteins e.g. CAC37534|SCIF3.15c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (363 aa), FASTA scores: opt: 819, E(): 7.7e-44, (51.55% identity in 351 aa overlap); Q9CGK3|YKJK HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (339 aa) FASTA scores: opt: 683, E(): 2.2e-35, (48.3% identity in 350 aa overlap); Q9KY24|SCC8A.24c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (380 aa) FASTA scores: opt: 528, E(): 1.1e-25, (50.25% identity in 372 aa overlap); Q9RJH8|SCF73.09 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (370 aa) FASTA scores: opt: 439, E(): 3.9e-20, (50.2% identity in 384 aa overlap); Q9PE36|XF1192 INTEGRAL MEMBRANE PROTEIN from Xylella fastidiosa (341 aa), FASTA scores: opt: 337, E(): 8.3e-14, (47.65% identity in 361 aa overlap); etc. Protein product from Mb3653c detected using SWATH mass spectrometry. Mb3653c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4N4" /db_xref="InterPro:IPR007427" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4N4" /protein_id="SIU02281.1" /translation="MSTFRIFGFSLLMTVVALVTGYLHGGPTALFLLAVLALLEVSLS FDNAIINAAILQRMSPFWQRMFLTIGILIAVFGMRLVFPLAIIWTTAGLDPVRAMELA LRPPAHGALEFADGSPSYEKLITAAHPQIAAFGGMFLLMLFLDFVVHDRDIKWLKWIE VPFARIGRLGQVPVIVASVGLVLAGALLTHSSDQRGTVLIAGLLGMVTYLVVNGISRA FRPAGLGEATPGVQARQAAGKAGCALFLYLEVLDAAFSFDGVTGAFAITTDPIIIALG LGVVGAMFVRSITIYLVRQDTLDRYVYLEHGAHWAIGALAIILLLSIDHRFAVPEWVT ASVGVVFIGAAFTESVRRNRLTVRSPTKFGS" CDS 4010824..4012119 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3654" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3654, -, len: 431 aa. Equivalent to Rv3630, len: 431 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 431 aa overlap). Probable conserved integral membrane, highly similar to P71789|YF10_MYCTU|Rv1510|MTCY277.32 HYPOTHETICAL 44.3 KDA PROTEIN from Mycobacterium tuberculosis (432 aa) FASTA scores: opt: 1940, E(): 2.3e-103, (70.75% identity in 424 aa overlap). Note that N-terminal end is highly similar to AAK45825|MT1558 HYPOTHETICAL 18.1 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (172 aa) FASTA scores: opt: 649, E(): 4.2e-30, (61.65% identity in 167 aa overlap); and C-terminal end is highly similar to AAK45826|MT1560 HYPOTHETICAL 25.8 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (256 aa), FASTA scores: opt: 1269, E(): 2.6e-65, (76.7% identity in 253 aa overlap). Contains PS00639 Eukaryotic thiol (cysteine) proteases histidine active site, so could be a protease. Mb3654 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P59985" /db_xref="UniProtKB/Swiss-Prot:P59985" /protein_id="SIU02282.1" /translation="MAVGAAAVTEVGDTASPVGSSGASGGAIASGSVARVGTATAVTA LCGYAVIYLAARNLAPNGFSVFGVFWGAFGLVTGAANGLLQETTREVRSLGYLDVSAD GRRTHPLRVSGMVGLGSLVVIAGSSPLWSGRVFAEARWLSVALLSIGLAGFCLHATLL GMLAGTNRWTQYGALMVADAVIRVVVAAATFVIGWQLVGFIWATVAGSVAWLIMLMTS PPTRAAARLMTPGATATFLRGAAHSIIAAGASAILVMGFPVLLKLTSNELGAQGGVVI LAVTLTRAPLLVPLTAMQGNLIAHFVDERTERIRALIAPAALIGGVGAVGMLAAGVVG PWIMRVAFGSEYQSSSALLAWLTAAAVAIAMLTLTGAAAVAAALHRAYSLGWVGATVG SGLLLLLPLSLETRTVVALLCGPLVGIGVHLVALARTDE" CDS 4012163..4012888 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3655" /product="possible transferase (possibly glycosyltransferase)" /note="Mb3655, -, len: 241 aa. Equivalent to Rv3631, len: 241 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 241 aa overlap). Possible transferase (EC 2.-.-.-), more specifically a glycosyltransferase (EC 2.4.-.-), equivalent to O69542|MLCB2548.24c|ML0207 PUTATIVE TRANSFERASE (PUTATIVE GLYCOSYLTRANSFERASE) from Mycobacterium leprae (239 aa) FASTA scores: opt: 1303, E(): 2.8e-72, (81.2% identity in 239 aa overlap). Also similar to many dolichyl-phosphate mannose synthases and hypothetical proteins e.g. O59263|PH1585 HYPOTHETICAL 34.6 KDA PROTEIN from Pyrococcus horikoshii (313 aa), FASTA scores: opt: 472, E(): 1.2e-21, (36.65% identity in 232 aa overlap); Q9V152|PAB1971 DOLICHYL-PHOSPHATE MANNOSE SYNTHASE from Pyrococcus abyssi (287 aa), FASTA scores: opt: 467, E(): 2.3e-21, (35.85% identity in 223 aa overlap); Q58619|YC22_METJA|MJ1222 HYPOTHETICAL PROTEIN from Methanococcus jannaschii (243 aa), FASTA scores: opt: 400, E(): 2.4e-17, (33.35% identity in 228 aa overlap); O26474|MTH374 DOLICHYL-PHOSPHATE MANNOSE SYNTHASE RELATED PROTEIN from Methanobacterium thermoautotrophicum (291 aa) FASTA scores: opt: 354, E(): 1.7e-14, (33.5% identity in 218 aa overlap); O26239|MTH136 DOLICHYL-PHOSPHATE MANNOSE SYNTHASE from Methanobacterium thermoautotrophicum (220 aa), FASTA scores: opt: 345, E(): 4.8e-14, (33.5% identity in 221 aa overlap); etc. Protein product from Mb3655 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3655 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Q7" /db_xref="InterPro:IPR001173" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Q7" /protein_id="SIU02283.1" /translation="MASKMDTETHYSDVWVVIPAFNEAAVIGKVVTDVRSVFDHVVCV DDGSTDGTGDIARRSGAHLVRHPINLGQGAAIQTGIEYARKQPGAQVFATFDGDGQHR VKDVAAMVDRLGAGDVDVVIGTRFGRPVGKASASRPPLMKRIVLQTGARLSRRGRRLG LTDTNNGLRVFNKTVADGLNITMSGMSHATEFIMLIAENHWRVAEEPVEVLYTEYSKS KGQPLLNGVNIIFDGFLRGRMPR" CDS 4012885..4013229 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3656" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb3656, -, len: 114 aa. Equivalent to Rv3632, len: 114 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 114 aa overlap). Possible conserved membrane protein, equivalent to O69541|MLCB2548.23c|ML0208 HYPOTHETICAL 12.9 KDA PROTEIN (PUTATIVE MEMBRANE PROTEIN) from Mycobacterium leprae (113 aa), FASTA scores: opt: 594, E(): 7.1e-35, (82.0% identity in 111 aa overlap). Protein product from Mb3656 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3656 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4R7" /db_xref="InterPro:IPR019277" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4R7" /protein_id="SIU02284.1" /translation="MNWIQVLLIASIIGLLFYLLRSRRSARSRAWVKVGYVLFVLAGI YAVLRPDDTTVVANWFGVRRGTDLMLYALVMAFSFTTLSTYMRFKDLELRYARIARAL ALEGAQAPEQCR" mobile_element 4012903..4015020 /mobile_element_type="insertion sequence:IS1534" /locus_tag="BQ2027_IS1534" /note="IS1534, len: 2118 nt. Equivalent to IS1534, len: 2136 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 2085 nt overlap). Putative IS element,IS1534, that resembles IS21; possibly defective." CDS 4013440..4014315 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3657" /product="Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin" /note="Mb3657, -, len: 291 aa. Equivalent to Rv3633, len: 291 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 291 aa overlap). Conserved hypothetical protein, similar to Q9X5S6|MMCH from Streptomyces lavendulae (254 aa), FASTA scores: opt: 368, E(): 3.2e-16, (35.05% identity in 194 aa overlap); Q9APW1 HYPOTHETICAL 32.7 KDA PROTEIN from Pseudomonas aeruginosa (295 aa), FASTA scores: opt: 359, E(): 1.3e-15, (37.65% identity in 170 aa overlap); Q9APV4 HYPOTHETICAL 34.1 KDA PROTEIN from Pseudomonas aeruginosa (309 aa), FASTA scores: opt: 316, E(): 7.6e-13, (28.65% identity in 262 aa overlap). And some similarity to Q9HGD7|FUM9 FUM9P from Gibberella moniliformis (300 aa), FASTA scores: opt: 254, E(): 6.5e-09, (29.95% identity in 157 aa overlap); and P47181|YJ9S_YEAST|YJR154W|J2240 HYPOTHETICAL 39.0 KDA PROTEIN from Saccharomyces cerevisiae (Baker's yeast) (346 aa), FASTA scores: opt: 190, E(): 8.5e-05, (26.75% identity in 127 aa overlap). Also similar to P71782|YF01_MYCTU|Rv1501|MT1550|MTCY277.23 from Mycobacterium tuberculosis (273 aa), FASTA scores: opt: 286, E(): 5.5e-11, (27.5% identity in 280 aa overlap). Protein product from Mb3657 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3657 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR008775" /db_xref="UniProtKB/Swiss-Prot:P67773" /protein_id="SIU02285.1" /translation="MTQSSSVERLVGEIDEFGYTVVEDVLDADSVAAYLADTRRLERE LPTVIANSTTVVKGLARPGHVPVDRVDHDWVRIDNLLLHGTRYEALPVHPKLLPVIEG VLGRDCLLSWCMTSNQLPGAVAQRLHCDDEMYPLPRPHQPLLCNALIALCDFTADNGA TQVVPGSHRWPERPSPPYPEGKPVEINAGDALIWNGSLWHTAAANRTDAPRPALTINF CVGFVRQQVNQQLSIPRELVRCFEPRLQELIGYGLYAGKMGRIDWRPPADYLDADRHP FLDAVADRLQTSVRL" CDS complement(4014316..4015260) /codon_start=1 /transl_table=11 /gene="galE1" /locus_tag="BQ2027_MB3658C" /product="UDP-GLUCOSE 4-EPIMERASE GALE1 (GALACTOWALDENASE) (UDP-GALACTOSE 4-EPIMERASE) (URIDINE DIPHOSPHATE GALACTOSE 4-EPIMERASE) (URIDINE DIPHOSPHO-GALACTOSE 4-EPIMERASE)" /note="Mb3658c, galE1, len: 314 aa. Equivalent to Rv3634c, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 314 aa overlap). galE1, UDP-glucose 4-epimerase (EC 5.1.3.2) (see citations below), equivalent to O69544|ML0204|RMLB2|MLCB2548.27c PUTATIVE SUGAR DEHYDRATASE (PUTATIVE SUGAR-NUCLEOTIDE DEHYDRATASE) from Mycobacterium leprae (319 aa), FASTA scores: opt: 1798, E(): 8.2e-100, (86.4% identity in 309 aa overlap). Also similar to other UDP-GLUCOSE 4-EPIMERASES e.g. Q9WYX9|TM0509 from Thermotoga maritima (309 aa) FASTA scores: opt: 877, E(): 4.8e-45, (45.8% identity in 308 aa overlap); Q57664|GALE_METJA|MJ0211 from Methanococcus jannaschii (305 aa), FASTA scores: opt: 792, E(): 5.4e-40, (42.05% identity in 309 aa overlap); Q9K6S7|BH3649 from Bacillus halodurans (311 aa), FASTA scores: opt: 723, E(): 7e-36, (40.5% identity in 316 aa overlap); Q9HSV1|GALE2|VNG0063G from Halobacterium sp. strain NRC-1 (328 aa), FASTA scores: opt: 597, E(): 2.3e-28, (36.35% identity in 322 aa overlap); etc. Contains short-chain alcohol dehydrogenase family signature (PS00061) but this maynot be significant. BELONGS TO THE SUGAR EPIMERASE FAMILY. Note that previously known as rmlB2, a DTDP-glucose 4,6-dehydratase (EC 4.2.1.46) (see third citation). Protein product from Mb3658c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3658c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR016040" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y594" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02286.1" /translation="MRALVTGAAGFIGSTLVDRLLADGHSVVGLDNFATGRATNLEHL ADNSAHVFVEADIVTADLHAILEQHRPEVVFHLAAQIDVRRSVADPQFDAAVNVIGTV RLAEAARQTGVRKIVHTSSGGSIYGTPPEYPTPETAPTDPASPYAAGKVAGEIYLNTF RHLYGLDCSHIAPANVYGPRQDPHGEAGVVAIFAQALLSGKPTRVFGDGTNTRDYVFV DDVVDAFVRVSADVGGGLRFNIGTGKETSDRQLHSAVAAAVGGPDDPEFHPPRLGDLK RSCLDIGLAERVLGWRPQIELADGVRRTVEYFRHKHTD" CDS 4015283..4017058 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3659" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3659, -, len: 591 aa. Equivalent to Rv3635, len: 591 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 591 aa overlap). Probable conserved transmembrane protein, equivalent, but longer 25 aa, to O69545|ML0203|MLCB2548.28 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (569 aa), FASTA scores: opt: 2933, E(): 4.6e-173, (77.0% identity in 569 aa overlap). Mb3659 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4S4" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4S4" /protein_id="SIU02287.1" /translation="MPAPRMPRVALVAVLLITVQLVVRVVLAFGGYFYWDDLILVGRA GTGGLLSPSYLFDDHDGHVMPGAFLVAGAIIRVAPLVWTGPAISLVVLQLLESLALLR ALYVISSWRPVLLIPLTFALFTPLAVPGFAWWAAALNSLPMLAALAWVCADAILLVRT GNHRYAVTGVLVYLGGLLFFEKAAVIPFVSFAVAALQCHVRGDRSALATVWRAGVRLW TPSLALTVGWVALYLAVVDQRRWSSDLSMTWDLLCRSVTHGIVPALAGGPWDWARWAP ASPWATPPAVVMVLGWLVLIAVLALSLVRKRRIGPVWLTAAGYAVACQVPIFLMRSSP FTALELAQTLRYFPDLVVVLALLAAVALQAPNRAGTRWLDASPARAVATVASAVLFLT SSLYSTATFLASWRDNPTEGYLKNAQASLAAAASGAPLLDQEVDPLVLQRVAWPENLA SHMFALLRVRPEFATTTTQLRMFTSTGRLVDAKVTWVRTIIAGPVPQCGYFVQPDRPE RLILDGPLLPGDWTVELNYLANSDGSMALALSDGPERKVPVHPGLNRVYARLPGAGDA ITVRANTTALSLCIGAAPVGFLAPA" repeat_region 4017315..4017363 /rpt_type=INVERTED /note="49 bp imperfect inverted repeat, IRL,CGGCAACTGAATACTGACCAGAGCGCGGCAACTGAAAATTGACCAGCTT, flanking IS element IS1534." gene 4017315..4019432 /locus_tag="BQ2027_IS1534" CDS 4017401..4017748 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3660" /product="possible transposase" /note="Mb3660, -, len: 115 aa. Equivalent to Rv3636, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 115 aa overlap). Possible transposase, weakly similar to others e.g. O69924|SC3C8.12 PUTATIVE TRANSPOSASE from Streptomyces coelicolor (487 aa) FASTA scores: opt: 132, E(): 0.12, (33.05% identity in 112 aa overlap); O96916 TC1-LIKE TRANSPOSASE from Anopheles gambiae (African malaria mosquito) (332 aa), FASTA scores: opt: 117, E(): 0.84, (30.75% identity in 91 aa overlap); Q9R2U5|IS466A|IS466A-ORF|TNPA|IS469|SCP1.276 TRANSPOSASE (INSERTION ELEMENT IS466S TRANSPOSASE) from Streptomyces coelicolor (513 aa), FASTA scores: opt: 114, E(): 2, (30.5% identity in 82 aa overlap); etc. Similar in part to P96288|Rv2943|MTCY24G1.06c HYPOTHETICAL 45.8 KDA PROTEIN from Mycobacterium tuberculosis (413 aa), FASTA scores: opt: 533, E(): 1.4e-28, (74.55% identity in 110 aa overlap). Contains possible helix-turn-helix motif from aa 19-40 (+4.98 SD). Mb3660 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036388" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4T7" /protein_id="SIU02288.1" /translation="MLSVEDWAEIRRLRRSERLPISEIARVLKISRNTVKSALASDGP PKYQRAAKGSVADEAEPRIRELLAAYPRMPATVIAERIGWWYSIRTLSGRVRELRPLY LPPDPASRDICGR" CDS 4018133..4018633 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3661" /product="possible transposase" /note="Mb3661, -, len: 166 aa. Equivalent to Rv3637, len: 166 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 166 aa overlap). Possible transposase. C-terminal end highly similar to Q9RLQ9|ISTA PUTATIVE TRANSPOSASE A (FRAGMENT) from Mycobacterium bovis (102 aa), FASTA scores: opt: 397, E(): 1.4e-19, (58.8% identity in 102 aa overlap). Weakly similar to others e.g. Q9KJ02 PUTATIVE TRANSPOSASE (FRAGMENT) from Polyangium cellulosum (329 aa), FASTA scores: opt: 191, E(): 1.6e-05, (32.1% identity in 134 aa overlap); Q9LCU2|ISTA COINTEGRASE from Pseudomonas aeruginosa (382 aa) FASTA scores: opt: 144, E(): 0.024, (26.8% identity in 123 aa overlap); P15025|ISTA_PSEAE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS21 from Pseudomonas aeruginosa (390 aa), FASTA scores: opt: 144, E(): 0.025, (26.85% identity in 123 aa overlap); etc. Also highly similar to C-terminal end of P96288|Rv2943|MTCY24G1.06c HYPOTHETICAL 45.8 KDA PROTEIN from Mycobacterium tuberculosis (413 aa) FASTA scores: opt: 722, E(): 1.5e-40, (63.7% identity in 168 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4U8" /protein_id="SIU02289.1" /translation="MPGRVFASPADFNTQLQAWLVRANHRQHRVLGCRPADRIEADTA AMLTLPPVGPSIGWRTSTRLPRDHYVRLDGNDYSVHPVAIGRRIEITADLSRVRVWCG GTLVADHDRIWAKHQTISDPEHVVAAKLLRRKRFDIVGPPHHVEVEQRLLTTYDTVLG LDGPVA" CDS 4018633..4019379 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3662" /product="possible transposase" /note="Mb3662, -, len: 248 aa. Equivalent to Rv3638, len: 248 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 248 aa overlap). Possible transposase, highly similar to Q9RLQ8|ISTB ISTB PROTEIN from Mycobacterium bovis (266 aa), FASTA scores: opt: 784, E(): 4e-46, (78.0% identity in 259 aa overlap); and similar to others e.g. P15026|ISTB_PSEAE INSERTION SEQUENCE IS21 PUTATIVE ATP-BINDING PROTEIN from Pseudomonas aeruginosa (265 aa), FASTA scores: opt: 420, E(): 2.2e-21, (38.8% identity in 255 aa overlap); Q45619|ISTB_BACST INSERTION SEQUENCE IS5376 PUTATIVE ATP-BINDING PROTEIN from Bacillus stearothermophilus (251 aa), FASTA scores: opt: 402, E(): 3.6e-20, (34.5% identity in 232 aa overlap); P15026|ISTB_ECOLI ISTB PROTEIN from Escherichia coli (265 aa), FASTA scores: opt: 419, E(): 8e-23, (38.8% identity in 255 aa overlap); etc. C-terminus highly similar to C-terminus of P96287|Rv2944|MTCY24G1.05 HYPOTHETICAL 25.5 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (alias AAK47343|MT3016 IS1533, ORFB from Mycobacterium tuberculosis strain CDC1551) (238 aa), FASTA scores: opt: 784, E(): 3.6e-46, (87.4% identity in 135 aa overlap)." /db_xref="GOA:A0A1R3Y4P5" /db_xref="InterPro:IPR002611" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR028350" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4P5" /protein_id="SIU02290.1" /translation="MAAKTATNSRDVAAELAYLTRALKAPTLRGAIEQLADRARTKTW SYEEFLAACLQREVSARESHGGEGRIRAARFPSRKSLEEFDFDHARGLKRDTIAHLGT LDFVTLAIGIAIRACQAGHRVLFATASQWVDRLAAAHHSGTLQSELIRLARYPLLVVD EVGYIPFEPEAANLFFQLVSSRYERASLIVTSNKPFGRWGEVFGDDVVAAAMIDRLVH HAEVIALKGDSYRIKDRDLGRVPTVTADDQ" repeat_region complement(4019384..4019432) /rpt_type=INVERTED /note="49 bp imperfect inverted repeat, IRR,CGGCAACCGAAAACTGATCAGGTGTCGGCAATCGAAAATTGACCAGCTT, flanking IS element IS1534." CDS complement(4019533..4020099) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3663C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3663c, -, len: 188 aa. Equivalent to Rv3639c, len: 188 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 188 aa overlap). Hypothetical protein, with C-terminus highly similar to N-terminus of P95044|Rv0698|MTCY210.15 HYPOTHETICAL 22.3 KDA PROTEIN from Mycobacterium tuberculosis (203 aa), FASTA scores: opt: 224, E(): 4.5e-07, (54.8% identity in 73 aa overlap)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4P6" /protein_id="SIU02291.1" /translation="MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSAT TCNYPPAANDSAQDGFRHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGP TPAPRGLATRQCPPRTVHVDRVRPNGAERALRARFRPILRPQFTLGDGANGLPLAACT KTGAYVPHLPYSPIAVDPQPSAGQQGPS" mobile_element complement(4020155..4021447) /mobile_element_type="insertion sequence:IS1553" /locus_tag="BQ2027_IS1553" /note="IS1553, len: 1293 nt. Equivalent to IS1553, len: 1293 nt, from Mycobacterium tuberculosis strain H37Rv,(99.9% identity in 1293 nt overlap). Putative IS element,IS1553." repeat_region 4020155..4020167 /rpt_type=INVERTED /note="13 bp imperfect inverted repeat, IRR,GAGTTCGTCGGTG, flanking IS element IS1553." gene complement(4020155..4021447) /locus_tag="BQ2027_IS1553" CDS complement(4020169..4021398) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3664C" /product="probable transposase" /note="Mb3664c, -, len: 409 aa. Equivalent to Rv3640c, len: 409 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 409 aa overlap). Probable transposase, highly similar to others e.g. Q48882 TRANSPOSASE from Mycobacterium avium (411 aa) FASTA scores: opt: 1574, E(): 6.2e-93, (59.75% identity in 400 aa overlap); Q9AKV5 PUTATIVE TRANSPOSASE (FRAGMENT) from Mycobacterium paratuberculosis (395 aa), FASTA scores: opt: 1566, E(): 1.9e-92, (60.0% identity in 395 aa overlap); Q48368 TRANSPOSASE from Mycobacterium avium (410 aa), FASTA scores: opt: 1561, E(): 4.1e-92, (59.4% identity in 404 aa overlap); etc." /db_xref="GOA:A0A1R3Y4R9" /db_xref="InterPro:IPR001207" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4R9" /protein_id="SIU02292.1" /translation="MALPQSALSELLDAFRTGDGVDLIRDAVRLVLQELSELEATERI GAARYERSDTRVTDRNGARSRVLSTQAGDVELRIPKLRKGSFFPAILEPRRRIDQALY AVVMEAYVHGISTRAVDDLVEAMGVETGISKSEVSRICAGLDEIVGAFRTRTLGHIEF PYVYLDATYLNVRNGTGQVVSMAVIVASGIAADGSREILGLDVGDSEDETFWRGFLTS LKGRGLGGVRLVISDQHAGLVKALKRCFQGAGHQRCRVHFARNLLAHVPKDKADMVAS MFRMIFSAPDAEAVHATWEEVRDRLAASFPKIGPLMDDARAEVLAFTAFPKAHWQKIW STNPLERINKEIKRRSRVVGIFPNPAAVIRLVGAVLADMHDEWQASERRYLSEASMAL LYPDSDNAVVAAISGGQ" repeat_region complement(4021435..4021447) /rpt_type=INVERTED /note="13 bp imperfect inverted repeat, IRL,GAGATCGTCGGTG, flanking IS element IS1553." CDS complement(4021574..4022209) /codon_start=1 /transl_table=11 /gene="fic" /locus_tag="BQ2027_MB3665C" /product="POSSIBLE CELL FILAMENTATION PROTEIN FIC" /note="Mb3665c, fic, len: 211 aa. Equivalent to Rv3641c, len: 211 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 211 aa overlap). Possible fic, cell filamentation protein, similar to others e.g. Q9PCU8|XF1657 CELL FILAMENTATION PROTEIN from Xylella fastidiosa (203 aa), FASTA scores: opt: 324, E(): 2.2e-14, (32.8% identity in 189 aa overlap); P20605|FIC_ECOLI|B3361 from Escherichia coli strain K12 (200 aa), FASTA scores: opt: 323, E(): 2.5e-14, (31.0% identity in 187 aa overlap); P20751|FIC_SALTY from Salmonella typhimurium (200 aa), FASTA scores: opt: 322, E(): 2.9e-14, (32.65% identity in 193 aa overlap); etc. Mb3665c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003812" /db_xref="InterPro:IPR036597" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4S8" /protein_id="SIU02293.1" /translation="MPHPWDTGDHERNWQGYFIPAMSVLRNRVGARTHAELRDAENDL VEARVIELREDPNLLGDRTDLAYLRAIHRQLFQDIYVWAGDLRTVGIEKEDESFCAPG GISRPMEHVAAEIYQLDRLRAVGEGDLAGQVAYRYDYVNYAHPFREGNGRSTREFFDL LLSERGSGLDWGKTDLEELHGACHVARANSDLTGLVAMFKGILDAEPTYDF" CDS complement(4022220..4022414) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3666C" /product="HYPOTHETICAL PROTEIN" /note="Mb3666c, -, len: 64 aa. Equivalent to Rv3642c, len: 64 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 64 aa overlap). Hypothetical unknown protein. Mb3666c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR041535" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4S5" /protein_id="SIU02294.1" /translation="MFVQATELQKVKRRFRNVRATRRNTELEGTRSTAATRADQNDYA RGKITAAELGERVRRRYNIQ" CDS 4022809..4023000 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3667" /product="HYPOTHETICAL PROTEIN" /note="Mb3667, -, len: 63 aa. Equivalent to Rv3643, len: 63 aa (questionable ORF), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 63 aa overlap). Identical to AAK48106 from Mycobacterium tuberculosis strain CDC1551 (33 aa) but longer 30 aa." /db_xref="UniProtKB/TrEMBL:A0A1R3Y6C2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02295.1" /translation="MERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYS RPIRAFRPLKSNRPGDIPT" tRNA complement(4023014..4023087) /locus_tag="BQ2027_THRU" /product="tRNA-Thr" /note="thrU, len: 73 nt. Equivalent to thrU, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Thr, anticodon cgt." CDS complement(4023165..4024370) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3668C" /product="possible dna polymerase" /note="Mb3668c, -, len: 401 aa. Equivalent to Rv3644c, len: 401 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 401 aa overlap). Possible DNA polymerase (EC 2.7.7.-), equivalent to O69546|MLCB2548.29c|ML0202 HYPOTHETICAL 42.7 KDA PROTEIN from Mycobacterium leprae (405 aa), FASTA scores: opt: 2180, E(): 6.1e-116, (84.4% identity in 404 aa overlap). Similar (in totality or in first 200 aa) to DNA polymerases III, delta' or gamma subunit, e.g. Q9X906|SCH5.03c PUTATIVE DNA POLYMERASE from Streptomyces coelicolor (401 aa), FASTA scores: opt: 1022, E(): 1.5e-50, (47.05% identity in 404 aa overlap); Q9RRS5|DR2410 DNA POLYMERASE III, TAU/GAMMA SUBUNIT from Deinococcus radiodurans (615 aa), FASTA scores: opt: 370, E(): 1.3e-13, (29.95% identity in 394 aa overlap); P28631|HOLB_ECOLI|B1099 DNA POLYMERASE III, DELTA' SUBUNIT from Escherichia coli strain K12 (334 aa), FASTA scores: opt: 345, E(): 2.2e-12, (33.45% identity in 239 aa overlap); Q9JTS1|DNAZX|NMA1656 DNA POLYMERASE III TAU AND GAMMA CHAINS from Neisseria meningitidis (serogroup A) (709 aa), FASTA scores: opt: 346, E(): 3.3e-12, (28.55% identity in 364 aa overlap); etc. Protein product from Mb3668c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3668c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y599" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR004622" /db_xref="InterPro:IPR008921" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y599" /protein_id="SIU02296.1" /translation="MSGVFTRLVGQQAVEAELLATAKAARRDSAHSAGGGGTMTHAWL LTGPPGSGRSVAALCFAAALQCTSGGEPGCGRCRACTTTLAGTHADVRRVIPEGLSIG VDEMRAIVQIAARRPTTGHWQIVVIEDADRLTEGAANALLKVVEEPPPSTVFLLCAPS VDPEDIAVTLRSRCRHVALVTPSTHAIAQVLSDGDGLDPDTANWAASVSGGHVGRARR LATDPQARQRRERALGLARDAATPSRAYAAAEELVAGAEAEALALTAQRIEAETEELR TALGAGGTGKGTGAALRGATGAMKDLERRQKSRQTRASRDALDRALIDLATYFRDALL VAAHAGGVRANHPDMADRVAALAAHAPPERLLRCIEAVLACREALAVNVKPKFAVDAM VATIGQELR" CDS 4024456..4026105 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3669" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3669, -, len: 549 aa. Equivalent to Rv3645, len: 549 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 549 aa overlap). Probable conserved transmembrane protein, equivalent, but longer 20 aa, to O69547|ML0201|MLCB2548.30 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (530 aa), FASTA scores: opt: 2958, E(): 1.5e-168, (85.5% identity in 530 aa overlap). Also closely related to several other hypothetical M. tuberculosis proteins, e.g. Q10631|YD18_MYCTU|Rv1318c|MT1359|MTCY130.03c (541 aa) FASTA scores: opt: 1105, E(): 2.7e-58, (39.35% identity in 506 aa overlap); Q10633|YD20_MYCTU|Rv1320c|MT1362|MTCY130. 05c (567 aa) FASTA scores: opt: 1031, E(): 7.1e-54, (38.1% identity in 509 aa overlap); Q10632|YD19_MYCTU|Rv1319c|MTCY130.04c (535 aa), FASTA scores: opt: 1016, E(): 5.3e-53, (37.1% identity in 531 aa overlap); etc. Also similar at C-terminal end to many adenylate cyclases (EC 4.6.1.1) e.g. O83498|TP0485 from Treponema pallidum (614 aa) FASTA scores: opt: 365, E(): 3.2e-14, (31.55% identity in 317 aa overlap); P94180|CYAA from Anabaena sp. strain PCC 7120 (735 aa), FASTA scores: opt: 364, E(): 4.2e-14, (32.75% identity in 229 aa overlap); etc. Protein product from Mb3669 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3669 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4T6" /db_xref="InterPro:IPR001054" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR029787" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4T6" /protein_id="SIU02297.1" /translation="MDAEAFVGFRQVPAARYGGLMATTAALPRRIHAFVRWVVRTPWP LFSLSMLQSDIIGALFVLGFLRYGLPPQDNIQLQDLPPVNLLIFVSTVIILFLAGAVV NLKLLMPVFRWQRRDNLLTEPDPAATELARSRALRMPLYRTLISLAVWATGGGVFILA SWSVAKHAAPVVAVATALGATATAIIGYLQSERVLRPVAVAALRSGVPENVNAPGVIL RLMLAWIPSTGVPLLAIVLAVAADKIALLHATPEALFNPILMMALAALGIGSVSTLLV AMSIADPLRQLRWALSEVQRGNYNAHMQIYDASELGLLQAGFNDMVRELSERQRLRDL FGRYVGEDVARRALERGTELGGQERDVAVLFVDLVGSTQLAATRPPAEVVQLLNEFFR VVVETVARHGGFVNKFQGDAALAIFGAPIEHPDGAGAALSAARELHDELIPVLGSAEF GIGVSAGRAIAGHIGAQARFEYTVIGDPVNEAARLTELAKLEDGHVLASAIAVSGALD AEALCWDVGEVVELRGRAAPTQLARPMNLAAPEEVSSEVRG" CDS complement(4026102..4028906) /codon_start=1 /transl_table=11 /gene="topA" /locus_tag="BQ2027_MB3670C" /product="dna topoisomerase i topa (omega-protein) (relaxing enzyme) (untwisting enzyme) (swivelase) (type i dna topoisomerase) (nicking-closing enzyme) (topo i)" /note="Mb3670c, topA, len: 934 aa. Equivalent to Rv3646c, len: 934 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 934 aa overlap). topA, DNA topoisomerase I (EC 5.99.1.2) (see citation below), equivalent to O69548|TOP1_MYCLE|TOPA|ML0200|MLCB2548.31c DNA TOPOISOMERASE I from Mycobacterium leprae (947 aa) FASTA scores: opt: 5150, E(): 0, (84.6% identity in 936 aa overlap). Also highly similar to many e.g. Q9X909|TOP1_STRCO|TOPA|SCH5.06c from Streptomyces coelicolor (952 aa), FASTA scores: opt: 2754, E(): 1.3e-153, (61.3% identity in 928 aa overlap); P73810|TOP1_SYNY3|TOPA|SLR2058 from Synechocystis sp. strain PCC 6803 (898 aa), FASTA scores: opt: 1442, E(): 9.1e-77, (47.15% identity in 927 aa overlap); P47368|TOP1_MYCGE|TOPA|MG122 from Mycoplasma genitalium (709 aa), FASTA scores: opt: 865, E(): 4.8e-43, (30.3% identity in 736 aa overlap); P06612|TOP1_ECOLI|TOPA|SUPX|B1274 from Escherichia coli strain K12 (865 aa), FASTA scores: opt: 397, E(): 0, (39.6% identity in 704 aa overlap); etc. BELONGS TO PROKARYOTIC TYPE I/III TOPOISOMERASE FAMILY. Protein product from Mb3670c detected using shotgun mass spectrometry. Mb3670c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A621" /db_xref="InterPro:IPR000380" /db_xref="InterPro:IPR003601" /db_xref="InterPro:IPR003602" /db_xref="InterPro:IPR005733" /db_xref="InterPro:IPR006171" /db_xref="InterPro:IPR013497" /db_xref="InterPro:IPR013824" /db_xref="InterPro:IPR013825" /db_xref="InterPro:IPR013826" /db_xref="InterPro:IPR023405" /db_xref="InterPro:IPR023406" /db_xref="InterPro:IPR025589" /db_xref="InterPro:IPR028612" /db_xref="InterPro:IPR034149" /db_xref="UniProtKB/Swiss-Prot:P0A621" /protein_id="SIU02298.1" /translation="MADPKTKGRGSGGNGSGRRLVIVESPTKARKLASYLGSGYIVES SRGHIRDLPRAASDVPAKYKSQPWARLGVNVDADFEPLYIISPEKRSTVSELRGLLKD VDELYLATDGDREGEAIAWHLLETLKPRIPVKRMVFHEITEPAIRAAAEHPRDLDIDL VDAQETRRILDRLYGYEVSPVLWKKVAPKLSAGRVQSVATRIIVARERDRMAFRSAAY WDILAKLDASVSDPDAAPPTFSARLTAVAGRRVATGRDFDSLGTLRKGDEVIVLDEGS ATALAAGLDGTQLTVASAEEKPYARRPYPPFMTSTLQQEASRKLRFSAERTMSIAQRL YENGYITYMRTDSTTLSESAINAARTQARQLYGDEYVAPAPRQYTRKVKNAQEAHEAI RPAGETFATPDAVRRELDGPNIDDFRLYELIWQRTVASQMADARGMTLSLRITGMSGH QEVVFSATGRTLTFPGFLKAYVETVDELVGGEADDAERRLPHLTPGQRLDIVELTPDG HATNPPARYTEASLVKALEELGIGRPSTYSSIIKTIQDRGYVHKKGSALVPSWVAFAV TGLLEQHFGRLVDYDFTAAMEDELDEIAAGNERRTNWLNNFYFGGDHGVPDSVARSGG LKKLVGINLEGIDAREVNSIKLFDDTHGRPIYVRVGKNGPYLERLVAGDTGEPTPQRA NLSDSITPDELTLQVAEELFATPQQGRTLGLDPETGHEIVAREGRFGPYVTEILPEPA ADAAAAAQGVKKRQKAAGPKPRTGSLLRSMDLQTVTLEDALRLLSLPRVVGVDPASGE EITAQNGRYGPYLKRGNDSRSLVTEDQIFTITLDEALKIYAEPKRRGRQSASAPPLRE LGTDPASGKPMVIKDGRFGPYVTDGETNASLRKGDDVASITDERAAELLADRRARGPA KRPARKAARKVPAKKAAKRD" CDS complement(4029259..4029837) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3671C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3671c, -, len: 192 aa. Equivalent to Rv3647c, len: 192 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 192 aa overlap). Hypothetical protein, equivalent to O69549|MLCB2548.32c|ML0199 HYPOTHETICAL 21.2 KDA PROTEIN from Mycobacterium leprae (200 aa), FASTA scores: opt: 1029, E(): 9e-58, (80.4% identity in 199 aa overlap). Protein product from Mb3671c detected using SWATH mass spectrometry. Mb3671c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V4" /protein_id="SIU02299.1" /translation="MSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAE SWRASALAEMIQEAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWL PGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPA LRISGRRRLSRLVENVGEPPDGAEAWVQWPRT" CDS complement(4029977..4030180) /codon_start=1 /transl_table=11 /gene="cspA" /locus_tag="BQ2027_MB3672C" /product="PROBABLE COLD SHOCK PROTEIN A CSPA" /note="Mb3672c, cspA, len: 67 aa. Equivalent to Rv3648c, len: 67 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 67 aa overlap). Probable cspA, cold shock protein A, identical to O69550|CSPB|CSPA|ML0198 SMALL COLD-SHOCK PROTEIN from Mycobacterium leprae (67 aa) FASTA scores: opt: 451, E(): 3.7e-27, (97.0% identity in 67 aa overlap). Also highly similar to many e.g. Q9KGW0|CSPA from Mycobacterium smegmatis (67 aa) FASTA scores: opt: 439, E(): 2.9e-26, (92.55% identity in 67 aa overlap); P54584|CSP_ARTGO from Arthrobacter globiformis (67 aa), FASTA scores: opt: 335, E(): 1.5e-18, (73.45% identity in 64 aa overlap); O30875|CSPA_MICLU from Micrococcus luteus (Micrococcus lysodeikticus); Q9Z5R4|CSPA_BORPE from Bordetella pertussis (67 aa) FASTA scores: opt: 294, E(): 1.7e-15, (59.7% identity in 67 aa overlap); etc. Contains 'cold-shock' DNA-binding domain signature (PS00352) at N-terminal end. BELONGS TO THE COLD-SHOCK DOMAIN (CSD) FAMILY. Protein product from Mb3672c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3672c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63849" /db_xref="InterPro:IPR002059" /db_xref="InterPro:IPR011129" /db_xref="InterPro:IPR012156" /db_xref="InterPro:IPR012340" /db_xref="InterPro:IPR019844" /db_xref="UniProtKB/Swiss-Prot:P63849" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02300.1" /translation="MPQGTVKWFNAEKGFGFIAPEDGSADVFVHYTEIQGTGFRTLEE NQKVEFEIGHSPKGPQATGVRSL" CDS 4030430..4032745 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3673" /product="probable helicase" /note="Mb3673, -, len: 771 aa. Equivalent to Rv3649, len: 771 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 771 aa overlap). Probable helicase (EC 3.6.-.-), similar to many (known or hypothetical) ATP-dependent helicases e.g. Q9X915|SCH5.13 PUTATIVE HELICASE from Streptomyces coelicolor (815 aa) FASTA scores: opt: 2550, E(): 9.6e-139, (52.45% identity in 774 aa overlap); Q05549|YDR291W|D9819.1 PROTEIN SIMILAR TO SEVERAL DNA HELICASES from Saccharomyces cerevisiae (Baker's yeast) (1077 aa), FASTA scores: opt: 1161, E(): 5.9e-59, (31.05% identity in 780 aa overlap); P50830|YPRA_BACSU HYPOTHETICAL HELICASE from Bacillus subtilis (749 aa), FASTA scores: opt: 1154, E(): 1.1e-58, (34.05% identity in 734 aa overlap); Q9KC10|BH1764 ATP-DEPENDENT RNA HELICASE from Bacillus halodurans (764 aa), FASTA scores: opt: 1122, E(): 8e-57, (32.3% identity in 759 aa overlap); etc. SEEMS SIMILAR TO DEAD/DEAH BOX HELICASE FAMILY, AND TO HELICASE C-TERMINAL DOMAIN." /db_xref="GOA:A0A1R3Y4Q3" /db_xref="InterPro:IPR001650" /db_xref="InterPro:IPR011545" /db_xref="InterPro:IPR014001" /db_xref="InterPro:IPR018973" /db_xref="InterPro:IPR022307" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Q3" /protein_id="SIU02301.1" /translation="MASFGSHLLAAAVAGTPPGERPLRHVAELPPQAGRPRGWPEWAE PDVVDAFADRGISSPWSHQAEAAELAYAGRHVVIGTGPASGKSLAYQLPVLNALATDS RARALYLSPTKALGHDQLRAAHALAAAVPRLADVAPTAYDGDSPDEVRRFARERSRWL FSNPEMTHLSVLRNHARWAVLLRNLRFVIVDECHYYRGVFGSNVAMVLRRLLRLCARY SAHPTVIFASATTASPGATAADLIGQPVVEVTEDGSPRGARTVALWEPALRSDVIGEH GAPVRRSAGAEAARVMADLIVEGAQTLTFVRSRRAAELTALGARARLVDIAPELSDTV ASYRAGYLAEDRSALHQALAEGQLRGLATTNALELGVDIAGLDAVVLAGFPGTVASFW QQAGRSGRRGQGALVVLIARDDPLDTYLVHHPAALLDKPVERVVIDPVNPHLLGPQLL CAATELPLDDAEVRSWGAVEVAESLVDDGLLRRRNGRYFPAPGVKPHAAVDVRGAIGG QIVIVEAGTGRLLGSVGVGQAPAAAHPGAVYLHQGETYVVDSLDFQDGIAFVHAEDPG YATFAREVTDIAVTGTGERLVFGPVALGLVPVTVTNHVVGYLRRQLSGEVLDFVELDM PEHTLPTTAVMYTITSDALVRSGIEATRIPGSLHAAEHAAIGLLPLVASCDRGDIGGM STATGPEGLPSVFVYDGYPGGAGFAERGFRRARTWLGATAEAIEACECPSGCPSCVQS PKCGNGNDPLDKAGAVRVLRLVLAELSEESP" CDS 4032882..4033166 /codon_start=1 /transl_table=11 /gene="PE33" /locus_tag="BQ2027_MB3674" /product="pe family protein pe33" /note="Mb3674, PE33, len: 94 aa. Equivalent to Rv3650, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 94 aa overlap). Short protein, member of the Mycobacterium tuberculosis PE family, but without the repetitive gly-rich region, similar to the N-terminal part of many e.g. O53809|Rv0746|MTV041.20 PGRS-FAMILY PROTEIN (783 aa), FASTA scores: opt: 363, E(): 2.1e-15, (76.55% identity in 81 aa overlap). Mb3674 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4S3" /protein_id="SIU02302.1" /translation="MSFVIAAPEALDSAATDLVVLGSTLGAATAAAAAQTTGIVAAAH DEVSAAIAALFSAHGQAYQAASAQAAAFHTRFIRARSRHPQQETTCRRVR" CDS 4033490..4034527 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3675" /product="Triple sensor-domain protein" /note="Mb3675, -, len: 345 aa. Equivalent to Rv3651, len: 345 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 345 aa overlap). Hypothetical protein, with some similarity to Q9ZHK1 HYPOTHETICAL 36.5 KDA PROTEIN from Rhodococcus sp. X309 (329 aa) FASTA scores: opt: 332, E(): 3.4e-13, (27.4% identity in 321 aa overlap). Protein product from Mb3675 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3675 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR041439" /db_xref="InterPro:IPR041458" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4T0" /protein_id="SIU02303.1" /translation="MTHDWLLVETLGDEPAVVARGRELKKLVPITTFLRRSPYLAAVR TAIAETLQTGQSLTSITPKHDRVIRTEPVIMTDGRMHGVQVWSGPTDAEPPDRPIPGP LKWDLTRGVATDTPESLTNSGKNPEVEITYGRAFAEDLPARELNPNETQVLAMAVKAK PGKTLCSIWDLTDWQGTPIRIGFVARSALEPGPNGRDHLVARAMNWRAETKAPAVPVD DLAQRILIGLAQAGVHRALVDLKTWTLLKWLDQPCSFYDWRRSAADGPRLHPDDQHVI DAMTRDLANGSASHVLRLPGHDVDWVPVHVTVNRIELEPDTFAGLVALRLPTDEELAD AGLPKATDVTT" CDS 4035281..4035595 /codon_start=1 /transl_table=11 /gene="PE_PGRS60" /locus_tag="BQ2027_MB3676" /product="pe-pgrs family-related protein pe_pgrs60" /note="Mb3676, PE_PGRS60, len: 104 aa. Equivalent to Rv3652, len: 104 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 104 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar at N-terminal end with many e.g. P56877|Y278_MYCTU|Rv0278c|MTV035.06c (957 aa) FASTA scores: opt: 242, E(): 3e-09, (77.35% identity in 53 aa overlap). Originally annotated as the first part of a PE-PGRS family protein (Rv3653/PE_PGRS61 being the second part) but more similar to a PE family protein. Length extended since first submission (+50 aa). Mb3676 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4T4" /protein_id="SIU02304.1" /translation="MSYVIAAPEALVAAATDLATLGSTIGAANAAAAGSTTALLTAGA DEVSAAIAAYSECTARPIRHSVRGRRRSMSGSCRPWPQVGAPMRPPRPPASRRCRARS IC" CDS 4035589..4036302 /codon_start=1 /transl_table=11 /gene="PE_PGRS61" /locus_tag="BQ2027_MB3677" /product="pe-pgrs family-related protein pe_pgrs61" /note="Mb3677, PE_PGRS61, len: 237 aa. Similar to Rv3653, len: 195 aa, from Mycobacterium tuberculosis strain H37Rv, (82.2% identity in 237 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to the C-termini of members of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, e.g. MTCY1A11_25, MTCY28_25, MTCY130_10, MTCY1A10_19, MTCY21B4_13, MTCI418B_6,MTCY28_34, MTV004_1, MTCY441_4; etc. Originally annotated as the second part of a PE-PGRS family protein (Rv3652/PE_PGRS60 being the first part). Start shortened since first submission (-50 aa). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to an in-frame insertion of 126 bp leads to a longer product than in Mycobacterium tuberculosis strain H37Rv, (237 aa versus 195 aa)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y6C9" /protein_id="SIU02305.1" /translation="MLNAPTQALLGRPLVGNGANGAPGTGANGGDGGILFGSGGAGGS GAAGMAGGNGGAAGLFGNGGAGGAGGSATAGAAGAGGNGGAGGLLFGTAGAGGNGGLS LGLGVAGGAGGAGGSGGSDTAGHGGTGGAGGLLFGAGGAGGAGGLGGFRGAGGTGGAG GDGGNAGLFGDGGAGGAGGAGEDGTTPGGNGGAGGVAGLFGDGGNGGNAGVGTPAGNV GAGGTGGLLLGQDGMTGLT" CDS complement(4036419..4036673) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3678C" /product="Apoptosis inhibitor Rv3654c" /note="Mb3678c, -, len: 84 aa. Equivalent to Rv3654c, len: 84 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 84 aa overlap). Hypothetical protein, similar to C-terminus of Q9X916|SCH5.14c MEMBRANE SPANNING PROTEIN from Streptomyces coelicolor (230 aa) FASTA scores: opt: 176, E(): 2.4e-05, (47.0% identity in 83 aa overlap). Equivalent to AAK48118 from Mycobacterium tuberculosis strain CDC1551 but shorter 18 aa. Mb3678c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR021202" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5A3" /protein_id="SIU02306.1" /translation="MVARHRAQAAADLASLAAAARLPSGLAAACARATLVARAMRVEH AQCRVVDLDVVVTVEVAVAFAGVATATARAGPAKVPTTPG" CDS complement(4036759..4037058) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3679C" /product="Apoptosis inhibitor Rv3655c" /note="Mb3679c, -, len: 99 aa. Equivalent to Rv3655c, len: 125 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 99 aa overlap). Hypothetical protein, with similarity to Q9X917|SCH5.15c HYPOTHETICAL 15.2 KDA PROTEIN from Streptomyces coelicolor (150 aa) FASTA scores: opt: 211, E(): 7.7e-07, (39.65% identity in 111 aa overlap). Equivalent to AAK48119 from Mycobacterium tuberculosis strain CDC1551 (99 aa) but longer 26 aa at the C-terminus. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base deletion (c-*) introducing a premature stop codon, leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (99 aa versus 125 aa). Mb3679c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4U2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02307.1" /translation="MEAALAIATLVLVLVLCLAGVTAVSMQVRCIDAAREAARLAARG DVRSATDVARSIAPRAALVQVHRDGEFVVATVTAHSNLLPTLDIAARAISVAEPG" CDS complement(4037082..4037288) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3680C" /product="secretion" /note="Mb3680c, -, len: 68 aa. Equivalent to Rv3656c, len: 68 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 68 aa overlap). Conserved hypothetical protein, similar to Q9X918|SCH5.16c SMALL HYPOTHETICAL PROTEIN from Streptomyces coelicolor (75 aa), FASTA scores: opt: 129, E(): 0.0039, (40.0% identity in 60 aa overlap). Equivalent to AAK48120 from Mycobacterium tuberculosis strain CDC1551 (42 aa) but longer 26 aa. Mb3680c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4V6" /db_xref="InterPro:IPR025338" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V6" /protein_id="SIU02308.1" /translation="MLVITMFRVLVARMTALAVDESGMSTVEYAIGTIAAAAFGAILY TVVTGDSIVSALNRIIGRALSTKV" CDS complement(4037298..4037873) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3681C" /product="POSSIBLE CONSERVED ALANINE RICH MEMBRANE PROTEIN" /note="Mb3681c, -, len: 191 aa. Equivalent to Rv3657c, len: 191 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 191 aa overlap). Possible conserved membrane protein, rich in ala residues, similar to Q9X919|SCH5.17c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (267 aa), FASTA scores: opt: 324, E(): 4.7e-12, (40.9% identity in 154 aa overlap)." /db_xref="GOA:A0A1R3Y4W1" /db_xref="InterPro:IPR018076" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4W1" /protein_id="SIU02309.1" /translation="MALWLGAGPSVVRARAGRPPRAHRPHQGLLLGRTDVADPLAVAA SLDVLAVCLAAGMAVSTAAAATAAVAPPRLARVLRRAADLLALGADPNIAWSRPPDLP PGTHDAQTDAVLRLARRSAASGAALADGIVELAVQVRHDAAQAAAAAAERAGVLIAGP LGLCFLPAFLCVGIVPLVVGLAGDVLQFGLV" CDS complement(4037897..4038697) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3682C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3682c, -, len: 266 aa. Equivalent to Rv3658c, len: 266 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 266 aa overlap). Probable conserved transmembrane protein, similar to Q9X920|SCH5.18c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (321 aa), FASTA scores: opt: 335, E(): 4.1e-13, (38.05% identity in 247 aa overlap). Mb3682c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4R2" /db_xref="InterPro:IPR018076" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4R2" /protein_id="SIU02310.1" /translation="MSGIASAALILSLALVVLPGSPRCRLTPDDTGRRVLLVGARRVA WGVGCVAVGVAALLPLPTVVAVAVLGATLGLRYRRRRRYLRRSREGQALEAALELVVG ELRAGAHPVRAFSIAADETGGPVAVALRAVAARARLGADVTAGLLAAARSSALPAYWE RLAVCWQLGSDHGLAIASLMRAAQRDVAERQRFSARVSAGMAGARASAAILAILPLLG VLLGQLIGARPLSFLLTGRVGGWLLVVGLTLACAGLLWSDRITDRPVL" CDS complement(4038694..4039752) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3683C" /product="Flp pilus assembly protein, ATPase CpaF" /note="Mb3683c, -, len: 352 aa. Equivalent to Rv3659c, len: 352 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 352 aa overlap). Conserved hypothetical protein, highly similar, but always shorter (various lengths) at N-terminus, to Q9X921|SCH5.19c PUTATIVE SECRETORY PROTEIN from Streptomyces coelicolor (523 aa), FASTA scores: opt: 1287, E(): 5.3e-66, (59.85% identity in 351 aa overlap); Q9HW98|PA4302 PROBABLE TYPE II SECRETION SYSTEM PROTEIN from Pseudomonas aeruginosa (421 aa), FASTA scores: opt: 776, E(): 5.4e-37, (42.8% identity in 320 aa overlap); AAK65510|CPAF2 PROBABLE CPAF2 PILUS ASSEMBLY PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymA (497 aa) FASTA scores: opt: 769, E(): 1.5e-36, (40.45% identity in 309 aa overlap); Q9KY93|SCK15.11 PUTATIVE SECRETORY PROTEIN from Streptomyces coelicolor (445 aa), FASTA scores: opt: 751, E(): 1.5e-35, (38.15% identity in 333 aa overlap); etc. Contains PS00017 ATP/GTP binding site motif A (P-loop). Note that previously known as trbB." /db_xref="InterPro:IPR001482" /db_xref="InterPro:IPR022399" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4R4" /protein_id="SIU02311.1" /translation="MLGDTEVLANLRVLQTELTGAGILEPLLSADGTTDVLVTAPDSV WVDDGNGLRRSQIRFADESAVRRLAQRLALAAGRRLDDAQPWVDGQLTGIGVGGFAVR LHAVLPPVATQGTCLSLRVLRPATQDLAALAAAGAIDPAAAALVADIVTARLAFLVCG GTGAGKTTLLAAMLGAVSPDERIVCVEDAAELAPRHPHLVKLVARRANVEGIGEVTVR QLVRQALRMRPDRIVVGEVRGAEVVDLLAALNTGHEGGAGTVHANNPGEVPARMEALG ALGGLDRAALHSQLAAAVQVLLHVARDRAGRRRLAEIAVLRQAEGRVQAVTVWHADRG MSDDAAALHDLLRSRASA" CDS complement(4039854..4040906) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3684C" /product="Septum site-determining protein MinD @ possible CpaE" /note="Mb3684c, -, len: 350 aa. Equivalent to Rv3660c, len: 350 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 350 aa overlap). Conserved hypothetical protein, similar to O33612 PROTEIN CONCERNED IN INHIBITION OF MORPHOLOGICAL DIFFERENTIATION IN Streptomyces azureus from Streptomyces cyaneus (Streptomyces curacoi) (370 aa), FASTA scores: opt: 655, E(): 5.9e-31, (42.2% identity in 315 aa overlap); Q9X922|SCH5.20c PUTATIVE SEPTUM SITE DETERMINING PROTEIN from Streptomyces coelicolor (396 aa), FASTA scores: opt: 592, E(): 2.9e-27, (43.25% identity in 275 aa overlap). And shows some similarity to AAK65513|CPAE2 PROBABLE CPAE2 PILUS ASSEMBLY PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymA (586 aa) FASTA scores: opt: 212, E(): 5.1e-05, (25.75% identity in 295 aa overlap); and several cell division inhibitors or septum site-determining proteins. Equivalent to AAK48124 from Mycobacterium tuberculosis strain CDC1551 (261 aa) but longer 89 aa. Mb3684c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR022521" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4T2" /protein_id="SIU02312.1" /translation="MLTDPGLRDELDRVAAAVGVRVVHLGGRHLVSRKTWSAAAAVVL DHAAADRCGRLALPRRTHVSVLTGTEAATATWAAAITVGAQHVLRMPEQEGELVRELA EAAESARDDGICGAVVAVIGGRGGAGASLFAVALAQAAADALLVDLDPWAGGIDLLVG GETAPGLRWPDLALQGGRLNWSAVRAALPRPRGISVLSGTRRGYELDAGPVDAVIDAG RRGGVTVVCDLPRRLTDATQAALDAADLVVLVSPCDVRACAAAATMAPVLTAINPNLG LVVRGPSPGGLRAAEVADVAGVPLLASMRAQPRLAEQLEHGGLRLRRRSVLASAARRV LGVLPRAGSGRHGRAA" CDS 4041406..4042269 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3685" /product="Phosphoserine phosphatase (EC" /EC_number="3.1.3.3" /note="Mb3685, -, len: 287 aa. Equivalent to Rv3661, len: 287 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 287 aa overlap). Conserved hypothetical protein, highly similar to O33611|IMD_STRCN from Streptomyces cyaneus (Streptomyces curacoi) protein involved in inhibition of morphological differentiation in Streptomyces azureus (BELONGS TO THE SERB FAMILY) (277 aa) FASTA scores: opt: 1073, E(): 3.5e-61, (61.45% identity in 262 aa overlap); and Q9X923|SCH5.21 PUTATIVE MORPHOLOGICAL DIFFERENTIATION-ASSOCIATED PROTEIN from Streptomyces coelicolor (268 aa), FASTA scores: opt: 1057, E(): 3.6e-60, (61.45% identity in 262 aa overlap). Also similar to various bacterial proteins (principally serB-related proteins) e.g. Q49823|ML2424 HYPOTHETICAL SERB PROTEIN from Mycobacterium leprae (300 aa), FASTA scores: opt: 452, E(): 1.4e-21, (35.8% identity in 257 aa overlap); Q9WX12|SCE68.20 HYPOTHETICAL 32.0 KDA PROTEIN from Streptomyces coelicolor (298 aa), FASTA scores: opt: 415, E(): 3.1e-19, (33.55% identity in 280 aa overlap); Q9RIT2|SERB PHOSPHOSERINE PHOSPHATASE (FRAGMENT) from Streptomyces coelicolor (266 aa), FASTA scores: opt: 405, E(): 1.2e-18, (34.1% identity in 261 aa overlap); etc. Also similar to Q11169|Y505_MYCTU|Rv0505c|MTCY20G9.32c HYPOTHETICAL 39.5 KDA PROTEIN from Mycobacterium tuberculosis (373 aa), FASTA scores: opt: 454, E(): 1.2e-21, (35.15% identity in 276 aa overlap). BELONGS TO THE SERB FAMILY. Protein product from Mb3685 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3685 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4U1" /db_xref="InterPro:IPR006385" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4U1" /protein_id="SIU02313.1" /translation="MTVSDSPAQRQTPPQTPGGTAPRARTAAFFDLDKTIIAKPSTLA FSKPFFAQGLLNRRAVLKSSYAQFIFLLSGADHDQMDRMRTHLTNMCAGWDVAQVRSI VNETLHDIVTPLVFAEAADLIAAHKLCGRDVVVVSASGEEIVGPIARALGATHAMATR MIVEDGKYTGEVAFYCYGEGKAQAIRELAASEGYPLEHCYAYSDSITDLPMLEAVGHA SVVNPDRGLRKEASVRGWPVLSFSRPVSLRDRIPAPSAAAIATTAAVGISALAAGAVT YALLRRFAFQP" CDS complement(4043024..4043794) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3686C" /product="Putative oxidoreductase" /note="Mb3686c, -, len: 256 aa. Equivalent to Rv3662c, len: 256 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 256 aa overlap). Conserved hypothetical protein, equivalent to Q9CB99|ML2289 HYPOTHETICAL PROTEIN from Mycobacterium leprae (256 aa) FASTA scores: opt: 1255, E(): 3.3e-69, (78.05% identity in 255 aa overlap). Also similar to Q9X924|SCH5.22c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (274 aa), FASTA scores: opt: 289, E(): 1.8e-10, (39.25% identity in 270 aa overlap). Protein product from Mb3686c detected using SWATH mass spectrometry." /db_xref="InterPro:IPR003812" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4U6" /protein_id="SIU02314.1" /translation="MTVDPLAPLMELPGVAAASDRVRDALSRVHRHRTNLRGWPVAAA EASLRAARASSVLDGGPARLHDAGAPTSGKPALSDPVFAGALRVGQALEGGAGPVVGV WRRAPLQALARLHMLAAADQVDDDRLGRPRSDADVGPRLELLADVVTHPTLASAPVVA AVAHGELLTLRPFGCADGVVARAVSRLVTIATGLDPHGLGVPEVIWMRQPAEYHDAAR RFAGGTPDGVAGWLLLCCGAMLDGAREALSIAESLSPG" CDS complement(4043791..4045437) /codon_start=1 /transl_table=11 /gene="dppD" /locus_tag="BQ2027_MB3687C" /product="PROBABLE DIPEPTIDE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER DPPD" /note="Mb3687c, dppD, len: 548 aa. Equivalent to Rv3663c, len: 548 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 548 aa overlap). Probable dppD, dipeptide-transport ATP-binding protein ABC-transporter (see citation below), similar to many ATP-binding proteins e.g. AAK65441|SMA1434 PROBABLE ABC TRANSPORTER ATP-BINDING PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymA (550 aa), FASTA scores: opt: 1528, E(): 1e-78, (46.25% identity in 545 aa overlap); O50270|MOAD MOAD PROTEIN from Agrobacterium radiobacter (588 aa), FASTA scores: opt: 1354, E(): 6.7e-69, (42.9% identity in 541 aa overlap); Q9KM01|VCA0588 PUTATIVE PEPTIDE ABC TRANSPORTER ATP-BINDING PROTEIN from Vibrio cholerae (530 aa), FASTA scores: opt: 951, E(): 3.1e-46, (44.0% identity in 534 aa overlap); BAB49448|MLR2279 ATP-BINDING PROTEIN OF PEPTIDE ABC TRANSPORTER from Rhizobium loti (Mesorhizobium loti) (604 aa), FASTA scores: opt: 949, E(): 4.4e-46, (41.55% identity in 544 aa overlap); etc. Contains 2 PS00211 ABC transporters family signature, and 2 PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb3687c detected using SWATH mass spectrometry. Mb3687c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6D0" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR013563" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6D0" /protein_id="SIU02315.1" /translation="MSVPAAPLLSVEGLEVTFGTDAPAVCGVDLAVRSGQTVAVVGES GSGKSTTAAAILGLLPAGGRITAGRVVFDGRDITGADAKRLRSIRGREIGYVPQDPMT NLNPVWKVGFQVTEALRANTDGRAARRRAVELLAEAGLPDPAKQAGRYPHQLSGGMCQ RALIAIGLAGRPRLLIADEPTSALDVTVQRQVLDHLQGLTDELGTALLLITHDLALAA QRAEAVVVVRRGVVVESGAAQSILQSPQHEYTRRLVAAAPSLTARSRRPPESRSRATT QAGDILVVSELTKIYRESRGAPWRRVESRAVDGVSFRLPRASTLAIVGESGSGKSTLA RMVLGLLQPTSGTVVFDGTYDVGALARDQVLAFRRRVQPVFQNPYSSLDPMYSVFRAI EEPLRVHHVGDRRQRQRAVRELVDQVALPSSILGRRPRELSGGQRQRVAIARALALRP EVLVCDEAVSALDVLVQAQILDLLADLQADLGLTYLFISHDLAVIRQIADDVLVMRAG RVVEHASTEEVFSRPRHEYTRQLLQAIPGAPSAPRKVGNL" CDS complement(4045434..4046234) /codon_start=1 /transl_table=11 /gene="dppC" /locus_tag="BQ2027_MB3688C" /product="PROBABLE DIPEPTIDE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER DPPC" /note="Mb3688c, dppC, len: 266 aa. Equivalent to Rv3664c, len: 266 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 266 aa overlap). Probable dppC, dipeptide-transport integral membrane protein ABC-transporter (see citation below), similar to many peptide permeases e.g. Q9F351|SC9E12.04 PUTATIVE PEPTIDE TRANSPORT SYSTEM INTEGRAL MEMBRANE from Streptomyces coelicolor (305 aa), FASTA scores: opt: 901, E(): 1.1e-47, (51.15% identity in 262 aa overlap); Q9KFX1|APPC|BH0349 OLIGOPEPTIDE ABC TRANSPORTER (PERMEASE) from Bacillus halodurans (305 aa), FASTA scores: opt: 652, E(): 1.5e-32, (35.55% identity in 270 aa overlap); P94312|DPPC_BACFI DIPEPTIDE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus firmus (304 aa), FASTA scores: opt: 642, E(): 5.9e-32, (35.75% identity in 263 aa overlap); P24139|OPPC_BACSU|SPO0KC OLIGOPEPTIDE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (305 aa), FASTA scores: opt: 637, E(): 1.2e-31, (37.4% identity in 262 aa overlap); P26904|DPPC_BACSU|DCIAC DIPEPTIDE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (320 aa), FASTA scores: opt: 621, E(): 1.2e-30, (39.9% identity in 263 aa overlap); etc. HAS SIMILARITY WITH INTEGRAL MEMBRANE COMPONENTS OF OTHER BINDING-PROTEIN-DEPENDENT TRANSPORT SYSTEMS. BELONGS TO THE OPPBC SUBFAMILY." /db_xref="GOA:A0A1R3Y5A6" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5A6" /protein_id="SIU02316.1" /translation="MIAAALILLILVVAAFPSLFTAADPTYADPSQSMLAPSAAHWFG TDLQGHDIYSRTVYGARASVTVGLGATLAVFVVGGALGALAGFYGSWIDAVVSRVTDV FLGLPLLLAAIVLMQVMHHRTVWTVIAILALFGWPQVARIARGAVLEVRASDYVLAAK ALGLNRFQILLRHALPNAVGPVIAVATVALGIFIVTEATLSYLGVGLPTSVVSWGGDI NVAQTRLRSGSPILFYPAGALAITVLAFMMMGDALRDALDPASRAWRA" CDS complement(4046290..4047216) /codon_start=1 /transl_table=11 /gene="dppB" /locus_tag="BQ2027_MB3689C" /product="PROBABLE DIPEPTIDE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER DPPB" /note="Mb3689c, dppB, len: 308 aa. Equivalent to Rv3665c, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 308 aa overlap). Probable dppB, dipeptide-transport integral membrane protein ABC-transporter (see citation below), similar to many peptide permeases e.g. Q9F352|SC9E12.03 PUTATIVE PEPTIDE TRANSPORT SYSTEM INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (307 aa), FASTA scores: opt: 1145, E(): 1.8e-61, (57.65% identity in 307 aa overlap); Q53191|Y4TP_RHISN PROBABLE PEPTIDE ABC TRANSPORTER PERMEASE PROTEIN Rhizobium sp. strain NGR234 (313 aa), FASTA scores: opt: 653, E(): 5.2e-32, (31.2% identity in 314 aa overlap); P24138|OPPB_BACSU OLIGOPEPTIDE TRANSPORT SYSTEM PERMEASE from Bacillus subtilis (311 aa), FASTA scores: opt: 643, E(): 2.1e-31, (33.45% identity in 305 aa overlap); etc. BELONGS TO THE OPPBC SUBFAMILY. Mb3689c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4U9" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4U9" /protein_id="SIU02317.1" /translation="MGWYVARRVAVMVPVFLGATLLIYGMVFLLPGDPVAALAGDRPL TPAVAAQLRSHYHLDDPFLVQYLRYLGGILHGDLGRAYSGLPVSAVLAHAFPVTIRLA LIALAVEAVLGIGFGVIAGLRQGGIFDSAVLVTGLVIIAIPIFVLGFLAQFLFGVQLE IAPVTVGERASVGRLLLPGIVLGAMSFAYVVRLTRSAVAANAHADYVRTATAKGLSRP RVVTVHILRNSLIPVVTFLGADLGALMGGAIVTEGIFNIHGVGGVLYQAVTRQETPTV VSIVTVLVLIYLITNLLVDLLYAALDPRIRYG" CDS complement(4047218..4048843) /codon_start=1 /transl_table=11 /gene="dppA" /locus_tag="BQ2027_MB3690C" /product="PROBABLE PERIPLASMIC DIPEPTIDE-BINDING LIPOPROTEIN DPPA" /note="Mb3690c, dppA, len: 541 aa. Equivalent to Rv3666c, len: 541 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 541 aa overlap). Probable dppA, dipeptide-binding lipoprotein component of dipeptide transport system (see citation below), similar to many substrate-binding proteins e.g. Q9F353|SC9E12.02 PUTATIVE PEPTIDE TRANSPORT SYSTEM SECRETED PEPTIDE-BINDING PROTEIN from Streptomyces coelicolor (544 aa), FASTA scores: opt: 1200, E(): 9e-67, (39.2% identity in 538 aa overlap); P24141|OPPA_BACSU OLIGOPEPTIDE-BINDING PROTEIN from Bacillus subtilis (545 aa), FASTA scores: opt: 523, E(): 7.9e-25, (26.15% identity in 516 aa overlap); P23843|OPPA_ECOLI PERIPLASMIC OLIGOPEPTIDE-BINDING PROTEIN from Escherichia coli (543 aa), FASTA scores: opt: 452, E(): 2e-20, (25.9% identity in 529 aa overlap); etc. Contains probable N-terminal signal sequence. Protein product from Mb3690c detected using SWATH mass spectrometry. Mb3690c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4X4" /db_xref="InterPro:IPR000914" /db_xref="InterPro:IPR030678" /db_xref="InterPro:IPR039424" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4X4" /protein_id="SIU02318.1" /translation="MVRRMRAALAALATGLLVLAPVAGCGGGVLSPDVVLVNGGEPPN PLIPTGTNDSNGGRIIDRLFAGLMSYDAVGKPSLEVAQSIESADNVNYRITVKPGWKF TDGSPVTAHSFVDAWNYGALSTNAQLQQHFFSPIEGFDDVAGAPGDKSRTTMSGLRVV NDLEFTVRLKAPTIDFTLRLGHSSFYPLPDSAFRDMAAFGRNPIGNGPYKLADGPAGP AWEHNVRIDLVPNPDYHGNRKPRNKGLRFEFYANLDTAYADLLSGNLDVLDTIPPSAL TVYQRDLGDHATSGPAAINQTLDTPLRLPHFGGEEGRLRRLALSAAINRPQICQQIFA GTRSPARDFTARSLPGFDPNLPGNEVLDYDPQRARRLWAQADAISPWSGRYAIAYNAD AGHRDWVDAVANSIKNVLGIDAVAAPQPTFAGFRTQITNRAIDSAFRAGWQGDYPSMI GFLAPLFTAGAGSNDVGYINPEFDAALAAAEAAPTLTESHELVNDAQRILFHDMPVVP LWDYISVVGWSSQVSNVTVTWNGLPDYENIVKA" CDS 4049551..4051506 /codon_start=1 /transl_table=11 /gene="acs" /locus_tag="BQ2027_MB3691" /product="ACETYL-COENZYME A SYNTHETASE ACS (ACETATE--CoA LIGASE) (ACETYL-CoA SYNTHETASE) (ACETYL-CoA SYNTHASE) (ACYL-ACTIVATING ENZYME) (ACETATE THIOKINASE) (ACETYL-ACTIVATING ENZYME) (ACETATE--COENZYME A LIGASE) (ACETYL-COENZYME A SYNTHASE)" /note="Mb3691, acs, len: 651 aa. Equivalent to Rv3667, len: 651 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 651 aa overlap). Probable acs, acetyl-coenzyme-A synthetase (EC 6.2.1.1), similar to many e.g. Q9X928|SCH5.26 from Streptomyces coelicolor (651 aa) FASTA scores: opt: 2850, E(): 1.9e-164, (66.05% identity in 639 aa overlap); Q55404|ACSA_SYNY3|ACS|SLL0542 from Synechocystis sp. strain PCC 6803 (653 aa), FASTA scores: opt: 2342, E(): 8.8e-134, (55.15% identity in 649 aa overlap); P31638|ACSA_ALCEU|ACOE from Alcaligenes eutrophus (Ralstonia eutropha) (660 aa), FASTA scores: opt: 2181, E(): 4.6e-124, (52.05% identity in 665 aa overlap); P27550|ACSA_ECOLI|ACS|B4069 from Escherichia coli strain K12 (652 aa), FASTA scores: opt: 1625, E(): 0, (48.3% identity in 646 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Protein product from Mb3691 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3691 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59871" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR011904" /db_xref="InterPro:IPR020845" /db_xref="InterPro:IPR025110" /db_xref="InterPro:IPR032387" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:P59871" /protein_id="SIU02319.1" /translation="MSESTPEVSSSYPPPAHFAEHANARAELYREAEEDRLAFWAKQA NRLSWTTPFTEVLDWSEAPFAKWFVGGELNVAYNCVDRHVEAGHGDRVAIHWEGEPVG DRRTLTYSDLLAEVSKAANALTDLGLVAGDRVAIYLPLIPEAVIAMLACARLGIMHSV VFGGFTAAALQARIVDAQAKLLITADGQFRRGKPSPLKAAADEALAAIPDCSVEHVLV VRRTGIEMAWSEGRDLWWHHVVGSASPAHTPEPFDSEHPLFLLYTSGTTGKPKGIMHT SGGYLTQCCYTMRTIFDVKPDSDVFWCTADIGWVTGHTYGVYGPLCNGVTEVLYEGTP DTPDRHRHFQIIEKYGVTIYYTAPTLIRMFMKWGREIPDSHDLSSLRLLGSVGEPINP EAWRWYRDVIGGGRTPLVDTWWQTETGSAMISPLPGIAAAKPGSAMTPLPGISAKIVD DHGDPLPPHTEGAQHVTGYLVLDQPWPSMLRGIWGDPARYWHSYWSKFSDKGYYFAGD GARIDPDGAIWVLGRIDDVMNVSGHRISTAEVESALVAHSGVAEAAVVGVTDETTTQA ICAFVVLRANYAPHDRTAEELRTEVARVISPIARPRDVHVVPELPKTRSGKIMRRLLR DVAENRELGDTSTLLDPTVFDAIRAAK" CDS complement(4051542..4052240) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3692C" /product="possible protease" /note="Mb3692c, -, len: 232 aa. Equivalent to Rv3668c, len: 232 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 232 aa overlap). Possible protease (EC 3.4.-.-) (and more specifically a putative alkaline serine protease (EC 3.4.21.-), equivalent to Q9CB98|ML2295 HYPOTHETICAL PROTEIN from Mycobacterium leprae (234 aa), FASTA scores: opt: 1249, E(): 7.4e-66, (77.5% identity in 231 aa overlap). Also similar at C-terminal end with many proteases e.g. O86984 ALKALINE SERINE PROTEASE PRECURSOR from Thermomonospora fusca (368 aa), FASTA scores: opt: 190, E(): 0.00056, (28.9% identity in 173 aa overlap); Q55353|SAPII ALKALINE SERINE PROTEASE II from Streptomyces sp (382 aa), FASTA scores: opt: 160, E(): 0.032, (27.15% identity in 199 aa overlap); O54109|SC10A5.18 PUTATIVE SECRETED PROTEASE from Streptomyces coelicolor (411 aa), FASTA scores: opt: 155, E(): 0.066, (26.4% identity in 163 aa overlap); Q54392|SAL|SCI11.35C SERINE PROTEASE SAL PRECURSOR (300 aa), FASTA scores: opt: 153, E(): 0.068, (28.1% identity in 185 aa overlap); P00778|PRLA_LYSEN|ALPHA-LP ALPHA-LYTIC PROTEASE PRECURSOR (397 aa), FASTA scores: opt: 154, E(): 0.074, (26.75% identity in 172 aa overlap); etc. Also similar with Q50618|YI15_MYCTU|Rv1815|MT1863|MTCY1A11.28c HYPOTHETICAL 22.8 KDA PROTEIN from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 134, E(): 0.69, (30.95% identity in 181 aa overlap). Protein product from Mb3692c detected using SWATH mass spectrometry. Mb3692c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4S1" /db_xref="InterPro:IPR009003" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4S1" /protein_id="SIU02320.1" /translation="MQTAHRRFAAAFAAVLLAVVCLPANTAAADDKLPLGGGAGIVVN GDTMCTLTTIGHDKNGDLIGFTSAHCGGPGAQIAAEGAENAGPVGIMVAGNDGLDYAV IKFDPAKVTPVAVFNGFAINGIGPDPSFGQIACKQGRTTGNSCGVTWGPGESPGTLVM QVCGGPGDSGAPVTVDNLLVGMIHGAFSDNLPSCITKYIPLHTPAVVMSINADLADIN AKNRPGAGFVPVPA" CDS 4052586..4053104 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3693" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3693, -, len: 172 aa. Equivalent to Rv3669, len: 172 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 172 aa overlap). Probable conserved transmembrane protein, equivalent to Q9CB97|ML2296 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (181 aa), FASTA scores: opt: 863, E(): 1.4e-47, (77.35% identity in 181 aa overlap). Also similar to two PUTATIVE INTEGRAL MEMBRANE TRANSPORT PROTEINS from Streptomyces coelicolor; Q9X930|SCH5.28 (162 aa) FASTA scores: opt: 265, E(): 6.3e-10, (37.4% identity in 155 aa overlap); and Q9X9W1|SCI7.29c (165 aa), FASTA scores: opt: 194, E(): 1.9e-05, (30.6% identity in 134 aa overlap). Contains two hydrophobic stretches in centre. Protein product from Mb3693 detected using shotgun mass spectrometry. Mb3693 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4S2" /db_xref="InterPro:IPR009937" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4S2" /protein_id="SIU02321.1" /translation="MSKIDRKNGVPSTLTTIPLADPHAGPAEPSIGDLIKDATTQMST LVRAEVELARAEITRDVKKGLTGSVFFISSLVVGFYSTFFFFFFVAELLDTWIWRWVA FLLVFAIMVVVTAVLALLGFLKVRRIRGPRQTIASVKETRTALTPGHDKTPVTPKPVT SDRATPVDPSGW" CDS 4053105..4054088 /codon_start=1 /transl_table=11 /gene="ephE" /locus_tag="BQ2027_MB3694" /product="POSSIBLE EPOXIDE HYDROLASE EPHE (EPOXIDE HYDRATASE) (ARENE-OXIDE HYDRATASE)" /note="Mb3694, ephE, len: 327 aa. Equivalent to Rv3670, len: 327 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 327 aa overlap). Possible ephE, epoxide hydrolase (EC 3.3.2.3) (see citation below), equivalent to Q9CB96|ML2297 PUTATIVE HYDROLASE from Mycobacterium leprae (324 aa), FASTA scores: opt: 1799, E(): 7.2e-105, (80.55% identity in 324 aa overlap). Also similar to many hydrolases (epoxide hydrolases) and hypothetical proteins e.g. Q9X931|SCH5.29 PUTATIVE HYDROLASE from Streptomyces coelicolor (324 aa), FASTA scores: opt: 687, E(): 1.4e-35, (40.65% identity in 327 aa overlap); Q9RRE3|DR2549 EPOXIDE HYDROLASE-RELATED PROTEIN from Deinococcus radiodurans (278 aa), FASTA scores: opt: 321, E(): 8.2e-13, (32.15% identity in 311 aa overlap); Q9K3Q1|2SCG4.13 PUTATIVE HYDROLASE from Streptomyces coelicolor (292 aa), FASTA scores: opt: 295, E(): 3.5e-11, (30.18% identity in 275 aa overlap); Q9S7P1 EPOXIDE HYDROLASE from Oryza sativa (Rice) (322 aa), FASTA scores: opt: 289, E(): 9.1e-11, (28.7% identity in 338 aa overlap); O23227|C7A10.830|AT4G36530 EPOXIDE HYDROLASE from Arabidopsis thaliana (Mouse-ear cress) (378 aa) FASTA scores: opt: 287, E(): 1.4e-10, (26.1% identity in 272 aa overlap); Q21147|K02F3.6 EPOXIDE HYDROLASE from Caenorhabditis elegans (386 aa), FASTA scores: opt: 283, E(): 2.5e-10, (33.35% identity in 156 aa overlap); etc. Also similar to P95276|EPHB|Rv1938|MTCY09F9.26c from Mycobacterium tuberculosis (356 aa), FASTA scores: opt: 296, E(): 3.6e-11, (29.7% identity in 340 aa overlap). Contains PS00213 Lipocalin signature. SIMILAR TO ALPHA/BETA HYDROLASE FOLD. Protein product from Mb3694 detected using SWATH mass spectrometry. Mb3694 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4U4" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR000639" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4U4" /protein_id="SIU02322.1" /translation="MAAPDPSMTRIAGPWRHLDVHANGIRFHVVEAVPSGQPEGPDAA TPPMQPALARPLVILLHGFGSFWWSWRHQLCGLTGARVVAVDLRGYGGSDKPPRGYDG WTLAGDTAGLIRALGHPSATLVGHADGGLACWTTALLHSRLVRAIALISSPHPAALRR STLTRRDQRHALLPTLLRYQLPIWPERLLTRNNAAEIERLVRARGCAKWLASEDFSQA IDHLRQAIQIPAAAHCALEYQRWAVRSQLRSEGRRFIRAMTQQLGMPLLHLRGDADPY VLADPVERTQRYAPHGRYISIAGAGHFSHEEAPEEVNRHLMRFLEQVHQLS" CDS complement(4054081..4055274) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3695C" /product="membrane-associated serine protease" /note="Mb3695c, -, len: 397 aa. Equivalent to Rv3671c, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 397 aa overlap). Possible serine protease membrane protein (EC 3.4.21.-), equivalent to Q9CB95|ML2298 PUTATIVE MEMBRANE-ASSOCIATED SERINE PROTEASE from Mycobacterium leprae (401 aa), FASTA scores: opt: 2061, E(): 2.3e-108, (80.9% identity in 398 aa overlap). Also similar to many serine proteases, but generally with extended N-terminus, e.g. Q9X932|SCH5.30c PUTATIVE SERINE PROTEASE (FRAGMENT) from Streptomyces coelicolor (385 aa), FASTA scores: opt: 835, E(): 1.2e-39, (39.9% identity in 386 aa overlap); Q9Z6T0|DEGP_CHLPN|HTRA|CPN0979|CP0877 PROBABLE SERINE PROTEASE DO-LIKE PRECURSOR from Chlamydia pneumoniae (Chlamydophila pneumoniae) (488 aa), FASTA scores: opt: 285, E(): 1e-08, (29.05% identity in 296 aa overlap); P73354|HTRA|SLR1204 SERINE PROTEASE from Synechocystis sp. strain PCC 6803 (452 aa), FASTA scores: opt: 284, E(): 1.1e-08, (29.55% identity in 308 aa overlap); Q9RWC4|DR0745 PERIPLASMIC SERINE PROTEASE, HTRA/DEGQ/DEGS FAMILY from Deinococcus radiodurans (366 aa), FASTA scores: opt: 271, E(): 4.9e-08, (35.45% identity in 206 aa overlap); etc. Also similar, but longer 114 aa at the N-terminus, to Q9S2P8|SC5F7.13 PUTATIVE PEPTIDASE from Streptomyces coelicolor (282 aa), FASTA scores: opt: 594, E(): 3.1e-26, (38.95% identity in 285 aa overlap). And similar, but longer 146 aa at the N-terminus, to O07175|PEPA|Rv0125|MTCI418B.07 from Mycobacterium tuberculosis (355 aa), FASTA scores: opt: 295, E(): 2.2e-09, (29.55% identity in 254 aa overlap); and Q9CCY9|ML2659 PROBABLE SECRETED SERINE PROTEASE from Mycobacterium leprae FASTA scores: opt: 286, E(): 6.9e-09, (30.6% identity in 255 aa overlap). Contains PS00135 Serine proteases, trypsin family, serine active site. Protein product from Mb3695c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3695c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4V3" /db_xref="InterPro:IPR001940" /db_xref="InterPro:IPR003825" /db_xref="InterPro:IPR009003" /db_xref="InterPro:IPR033116" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V3" /protein_id="SIU02323.1" /translation="MTPSQWLDIAVLAVAFIAAISGWRAGALGSMLSFGGVLLGATAG VLLAPHIVSQISAPRAKLFAALFLILALVVVGEVAGVVLGRAVRGAIRNRPIRLIDSV IGVGVQLVVVLTAAWLLAMPLTQSKEQPELAAAVKGSRVLARVNEAAPTWLKTVPKRL SALLNTSGLPAVLEPFSRTPVIPVASPDPALVNNPVVAATEPSVVKIRSLAPRCQKVL EGTGFVISPDRVMTNAHVVAGSNNVTVYAGDKPFEATVVSYDPSVDVAILAVPHLPPP PLVFAAEPAKTGADVVVLGYPGGGNFTATPARIREAIRLSGPDIYGDPEPVTRDVYTI RADVEQGDSGGPLIDLNGQVLGVVFGAAVDDAETGFVLTAGEVAGQLAKIGATQPVGT GACVS" CDS complement(4055280..4056101) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3696C" /product="NTP pyrophosphohydrolases including oxidative damage repair enzymes" /note="Mb3696c, -, len: 273 aa. Equivalent to Rv3672c, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 273 aa overlap). Conserved hypothetical protein, equivalent to Q9CB94|ML2299 HYPOTHETICAL PROTEIN from Mycobacterium leprae (266 aa) FASTA scores: opt: 1358, E(): 5.2e-75, (76.4% identity in 267 aa overlap). Also similar to others (generally in C-terminal end) e.g. Q9XA45|SCH17.02c HYPOTHETICAL 26.5 KDA PROTEIN from Streptomyces coelicolor (247 aa) FASTA scores: opt: 524, E(): 1.3e-24, (42.65% identity in 251 aa overlap); Q9AB27|CC0407 MUTT/NUDIX FAMILY PROTEIN from Caulobacter crescentus (216 aa), FASTA scores: opt: 285, E(): 3.2e-10, (36.2% identity in 174 aa overlap); BAB49788|MLL2727|Q98HS8 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (204 aa), FASTA scores: opt: 278, E(): 8.1e-10, (31.45% identity in 151 aa overlap); P43337|YEAB_ECOLI|B1813 HYPOTHETICAL 21.4 KDA PROTEIN from Escherichia coli strain K12 (192 aa) FASTA scores: opt: 252, E(): 2.9e-08, (35.9% identity in 170 aa overlap); etc. Contains PS01293 Uncharacterized protein family UPF0036 signature, LLT. Protein product from Mb3696c detected using SWATH mass spectrometry. Mb3696c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4V2" /db_xref="InterPro:IPR000059" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR015797" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V2" /protein_id="SIU02324.1" /translation="MSAGGTPLQAGATPTGSRGTVALRPDAGPSWLRPLVDNVGQIPD AYRRRLPADVLAMVTAAGAVSAMTSSRRDHREAAVLVLFSGPEAGPGDGGVPDDADLL LTVRASTLRHHAGQAAFPGGVVDPADDGPVATALREANEETGIDPSRLHPLATMERTF IAPSRFHVVPVLAYSPDPGPVAVVNEAETAIVARVPVRAFINPANRLMVYRRPHTRRW AGPAFLLNQMLVWGFTGQVISAVLDVAGWAQPWDTGDIRELDAAMVLIDDESDPR" CDS complement(4056233..4056916) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3697C" /product="POSSIBLE MEMBRANE-ANCHORED THIOREDOXIN-LIKE PROTEIN (THIOL-DISULFIDE INTERCHANGE RELATED PROTEIN)" /note="Mb3697c, -, len: 227 aa. Equivalent to Rv3673c, len: 227 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 227 aa overlap). Possible membrane protein, thioredoxin-like protein (thiol-disulfide interchange protein) (EC 1.-.-.-), equivalent to Q9CB93|ML2300 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (215 aa), FASTA scores: opt: 978, E(): 2.5e-52, (71.15% identity in 215 aa overlap). Some similarity with thioredoxin-related proteins e.g. P35160|RESA_BACSU RESA PROTEIN from Bacillus subtilis (181 aa), FASTA scores: opt: 212, E(): 5.7e-06, (30.55% identity in 108 aa overlap); Q9RXW6|DR0189 THIOL:DISULFIDE INTERCHANGE PROTEIN from Deinococcus radiodurans (185 aa) FASTA scores: opt: 206, E(): 1.3e-05, (33.8% identity in 139 aa overlap); Q9I505|PA0953 PROBABLE THIOREDOXIN from Pseudomonas aeruginosa (154 aa), FASTA scores: opt: 180, E(): 0.00044, (34.85% identity in 109 aa overlap); Q9KCP7|BH1522 THIOREDOXIN (THIOL:DISULFIDE INTERCHANGE PROTEIN) from Bacillus halodurans (177 aa), FASTA scores: opt: 178, E(): 0.00064, (31.75% identity in 107 aa overlap); P43221|TLPA_BRAJA THIOL:DISULFIDE INTERCHANGE PROTEIN (CYTOCHROME C BIOGENESIS PROTEIN) from Bradyrhizobium japonicum (221 aa), FASTA scores: opt: 189, E(): 0.00017, (26.85% identity in 227 aa overlap); etc. Also similar to O06392|Rv0526|MTCY25D10.05 HYPOTHETICAL 23.2 KDA PROTEIN from Mycobacterium tuberculosis (216 aa) FASTA scores: opt: 160, E(): 0.0093, (27.45% identity in 142 aa overlap). Contains PS00194 Thioredoxin family active site. POSSIBLY BELONGS TO THE THIOREDOXIN FAMILY. Protein product from Mb3697c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3697c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6F2" /db_xref="InterPro:IPR013740" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR017937" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6F2" /protein_id="SIU02325.1" /translation="MPSLPTTPAETAMTTLTGKTRWTIAILAVVAALMAALVAQLHDY SASSTISQRPAPREHRDGDTPEALAWSRQRANLPPCPAAGNGPGAAALRGVVVVCAGD GSAVDVARALAGRRVVINLWAHWCAPCMTELPVMAEYQRRVGPAVLVVTVHQGQNEAA ALSRLADLGVRLPTLQDDRRRVAAALRVANVMPATVVLRPDGSVAQTLPRAFGSADEI VAAVGNDAG" CDS complement(4056916..4057653) /codon_start=1 /transl_table=11 /gene="nth" /locus_tag="BQ2027_MB3698C" /product="PROBABLE ENDONUCLEASE III NTH (DNA-(APURINIC OR APYRIMIDINIC SITE)LYASE) (AP LYASE) (AP ENDONUCLEASE CLASS I) (ENDODEOXYRIBONUCLEASE (APURINIC OR APYRIMIDINIC)) (DEOXYRIBONUCLEASE (APURINIC OR APYRIMIDINIC))" /note="Mb3698c, nth, len: 245 aa. Equivalent to Rv3674c, len: 245 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 245 aa overlap). Probable nth, endonuclease III (EC 4.2.99.18), equivalent to Q9CB92|NTH|ML2301 PUTATIVE ENDONUCLEASE III from Mycobacterium leprae (272 aa), FASTA scores: opt: 1363, E(): 3.6e-81, (89.4% identity in 226 aa overlap). Also similar to many e.g. Q9XA44|SCH17.03c from Streptomyces coelicolor (250 aa), FASTA scores: opt: 937, E(): 2.2e-55, (61.65% identity in 219 aa overlap); P46303|UVEN_MICLU from Micrococcus luteus (Micrococcus lysodeikticus) (279 aa), FASTA scores: opt: 899, E(): 8.1e-53, (58.45% identity in 248 aa overlap); P73715|END3_SYNY3|NTH|SLR1822 from Synechocystis sp. strain PCC 6803 (219 aa), FASTA scores: opt: 684, E(): 1.7e-38, (52.2% identity in 203 aa overlap); P39788|END3_BACSU|NTH|JOOB from Bacillus subtilis (219 aa), FASTA scores: opt: 552, E(): 1.2e-29, (43.3% identity in 194 aa overlap); etc. Equivalent to AAK48142 from Mycobacterium tuberculosis strain CDC1551 (262 aa) but shorter 17 aa. Contains PS00764 Endonuclease III iron-sulfur binding region signature, and PS01155 Endonuclease III family signature. BELONGS TO THE NTH/MUTY FAMILY. COFACTOR: BINDS A 4FE-4S CLUSTER WHICH IS NOT IMPORTANT FOR THE CATALYTIC ACTIVITY, BUT WHICH IS PROBABLY INVOLVED IN THE PROPER POSITIONING OF THE ENZYME ALONG THE DNA STRAND (BY SIMILARITY). N-terminus extended since first submission (previously 226 aa). Protein product from Mb3698c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3698c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63541" /db_xref="InterPro:IPR000445" /db_xref="InterPro:IPR003265" /db_xref="InterPro:IPR003651" /db_xref="InterPro:IPR004035" /db_xref="InterPro:IPR004036" /db_xref="InterPro:IPR005759" /db_xref="InterPro:IPR011257" /db_xref="InterPro:IPR023170" /db_xref="UniProtKB/Swiss-Prot:P63541" /protein_id="SIU02326.1" /translation="MPGRWSAETRLALVRRARRMNRALAQAFPHVYCELDFTTPLELA VATILSAQSTDKRVNLTTPALFARYRTARDYAQADRTELESLIRPTGFYRNKAASLIG LGQALVERFGGEVPATMDKLVTLPGVGRKTANVILGNAFGIPGITVDTHFGRLVRRWR WTTAEDPVKVEQAVGELIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFG LGPTEPLLAAPLVQGPETDHLLALAGL" CDS 4057761..4058138 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3699" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb3699, -, len: 125 aa. Equivalent to Rv3675, len: 125 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 125 aa overlap). Possible membrane protein, with some similarity to Q9YCZ2|APE1120 HYPOTHETICAL 11.7 KDA PROTEIN from Aeropyrum pernix (103 aa), FASTA scores: opt: 100, E(): 9, (40.0% identity in 55 aa overlap). Mb3699 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V8" /protein_id="SIU02327.1" /translation="MFTLLVSWLLVACVPGLLMLATLGLGRLERFLARDTVTATDVAE FLEQAEAVDVHTLARNGMPEALDYLHRRQARRITDSPPLGSGAGPRYAGPLFVTDLDS PVEPPRHGQPNPQFRTARHANHV" CDS 4058237..4058911 /codon_start=1 /transl_table=11 /gene="crp" /locus_tag="BQ2027_MB3700" /product="transcriptional regulatory protein crp (crp/fnr-family)" /note="Mb3700, -, len: 224 aa. Equivalent to Rv3676, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Probable transcriptional regulator belonging to crp/fnr family, identical to Q9CB91|ML2302 PUTATIVE CRP/FNR-FAMILY TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (224 aa), FASTA scores: opt: 1408, E(): 8.8e-81, (95.95% identity in 224 aa overlap). Also highly similar to transcriptional regulators AAK58838 from Corynebacterium glutamicum (Brevibacterium flavum) (227 aa), FASTA scores: opt: 1178, E(): 1.9e-66, (79.9% identity in 224 aa overlap); and Q9XA42|SCH17.05 from Streptomyces coelicolor (224 aa), FASTA scores: opt: 869, E(): 3.4e-47, (54.45% identity in 224 aa overlap); and similar to others e.g. Q9RRX0|DR2362 from Deinococcus radiodurans (231 aa) FASTA scores: opt: 344, E(): 1.8e-14, (30.8% identity in 211 aa overlap); P29281|CRP_HAEIN from Haemophilus influenzae (224 aa), FASTA scores: opt: 330, E(): 1.3e-13, (32.25% identity in 189 aa overlap); P03020|CRP_ECOLI|CAP|CSM|B3357 from Escherichia coli strain K12 and Shigella flexneri (210 aa), FASTA scores: opt: 323, E(): 3.5e-13, (32.25% identity in 189 aa overlap); etc. Contains helix-turn-helix motif at aa 175-196 (Score 1990, +5.96 SD). BELONGS TO THE CRP/FNR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3700 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3700 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Y5" /db_xref="InterPro:IPR000595" /db_xref="InterPro:IPR012318" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR018490" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Y5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02328.1" /translation="MDEILARAGIFQGVEPSAIAALTKQLQPVDFPRGHTVFAEGEPG DRLYIIISGKVKIGRRAPDGRENLLTIMGPSDMFGELSIFDPGPRTSSATTITEVRAV SMDRDALRSWIADRPEISEQLLRVLARRLRRTNNNLADLIFTDVPGRVAKQLLQLAQR FGTQEGGALRVTHDLTQEEIAQLVGASRETVNKALADFAHRGWIRLEGKSVLISDSER LARRAR" CDS complement(4059017..4059811) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3701C" /product="possible hydrolase" /note="Mb3701c, -, len: 264 aa. Equivalent to Rv3677c, len: 264 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 264 aa overlap). Possible hydrolase (EC 3.-.-.-), equivalent to Q9CB90|ML2303 PUTATIVE HYDROLASE from Mycobacterium leprae (262 aa) FASTA scores: opt: 1400, E(): 8.5e-81, (82.05% identity in 262 aa overlap). Also similar to other hydrolases and hypothetical proteins e.g. Q9XA41|SCH17.06c PUTATIVE HYDROLASE from Streptomyces coelicolor (256 aa) FASTA scores: opt: 609, E(): 3.9e-31, (54.65% identity in 247 aa overlap); Q9A9Q1|CC0923 METALLO-BETA-LACTAMASE FAMILY PROTEIN from Caulobacter crescentus (297 aa), FASTA scores: opt: 306, E(): 4.7e-12, (35.45% identity in 268 aa overlap); Q9Y392 CGI-83 PROTEIN from Homo sapiens (Human) (288 aa), FASTA scores: opt: 281, E(): 1.7e-10, (33.2% identity in 259 aa overlap); Q9F7R6 PREDICTED METALLOBETA LACTAMASE FOLD PROTEIN from uncultured proteobacterium EBAC31A08 (265 aa), FASTA scores: opt: 257, E(): 5.1e-09, (32.55% identity in 252 aa overlap); Q9PBI4|XF2160 HYDROXYACYLGLUTATHIONE HYDROLASE from Xylella fastidiosa (258 aa), FASTA scores: opt: 232, E(): 1.9e-07, (30.3% identity in 165 aa overlap); etc. Protein product from Mb3701c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3701c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4X3" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036866" /db_xref="InterPro:IPR041516" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4X3" /protein_id="SIU02329.1" /translation="MSKTAESLTHPAYGQLRAVTDTASVLLADNPGLLTLDGTNTWVL RGPLSDELVVVDPGPDDDEHLARVAALGRIALVLISHRHGDHTSGIDKLVALTGAPVR AADPQFLRRDGETLTDGEVIDVAGLTITVLATPGHTADSLSFVLDDAVLTADTVLGCG TTVIDKEDGSLADYLESLHRLRGLGRRTVLPGHGPDLLDLEAIASGYLLHRHERLEQI RAALRDLGDDATVREVVEHVYLDVDEKLWNAAEWSVQAQLDYLRTR" CDS complement(4059818..4060273) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3702C" /product="RidA/YER057c/UK114 superfamily, group 1" /note="Mb3702c, -, len: 151 aa. Equivalent to Rv3678c, len: 151 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 151 aa overlap). Conserved hypothetical protein, equivalent, but shorter 23 aa, to Q9CB89|ML2304 HYPOTHETICAL PROTEIN from Mycobacterium leprae (174 aa), FASTA scores: opt: 746, E(): 2.1e-40, (78.15% identity in 151 aa overlap). Also highly similar to many hypothetical proteins or transcription regulators e.g. Q9XA38|SCH17.09c from Streptomyces coelicolor (155 aa), FASTA scores: opt: 637, E(): 1.5e-33, (69.1% identity in 152 aa overlap); BAB48205|MLR0658 from Rhizobium loti (Mesorhizobium loti) (154 aa), FASTA scores: opt: 500, E(): 6.8e-25, (55.35% identity in 150 aa overlap); BAB50615|MLR3802 TRANSCRIPTION REGULATOR from Rhizobium loti (Mesorhizobium loti) (153 aa), FASTA scores: opt: 425,E(): 3.8e-20, (44.35% identity in 151 aa overlap); Q9U0W7|L7276.02 from Leishmania major (163 aa) FASTA scores: opt: 404, E(): 8.5e-19, (47.7% identity in 151 aa overlap); Q9UZA3|PAB0825 PUTATIVE TRANSLATION INITIATION INHIBITOR from Pyrococcus abyssi (127 aa), FASTA scores: opt: 108, E(): 3.7, (30.75% identity in 130 aa overlap); etc. Contains PS00044 Bacterial regulatory proteins, lysR family signature. Protein product from Mb3702c detected using shotgun mass spectrometry. Mb3702c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR013813" /db_xref="InterPro:IPR035959" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4U3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02330.1" /translation="MSAKARLGQLGVTLPQVAAPLAAYVPAVRTGNLVYTAGQLPLEA GKLVRTGKLGADVNPEEGKTLARICALNALAAVDSLVDLDAVTRVVKVVGFVASAPGF HGQPSVINGASDLLAEVFGDSGAHARSAVGVSELPLDAPVEVELIVEVG" CDS complement(4060289..4060450) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3703C" /product="translation initiation" /note="Mb3703c, -, len: 53 aa. Equivalent to Rv3678A, len: 53 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 53 aa overlap). Conserved hypothetical protein, similar to SCH17.10|AL079353_10 conserved hypothetical protein from Streptomyces coelicolor (53 aa), FASTA scores: opt: 259, E(): 1.5e-13, (78.0% identity in 50 aa overlap). Protein product from Mb3703c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3703c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025234" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V1" /protein_id="SIU02331.1" /translation="MTQPTAWEYATVPLLTHATKQILDQWGADGWELVAVLPGPTGEQ HVAYLKRPK" CDS 4060535..4061557 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3704" /product="PROBABLE ANION TRANSPORTER ATPASE" /note="Mb3704, -, len: 340 aa. Equivalent to Rv3679, len: 340 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 340 aa overlap). Probable anion transporting ATPase (EC 3.6.1.-), equivalent to Q9CB88|ML2305 PROBABLE ANION TRANSPORTER PROTEIN from Mycobacterium leprae (341 aa), FASTA scores: opt: 1810, E(): 2.1e-98, (84.15% identity in 341 aa overlap). Also highly similar to Q9XA36|SCH17.11 PUTATIVE ION-TRANSPORTING ATPASE from Streptomyces coelicolor (325 aa), FASTA scores: opt: 989, E(): 1.4e-50, (52.15% identity in 328 aa overlap); and similar to many anion transporting ATPases (principally arsenite transporters) e.g. O50593|ARSA_ACIMU ARSENICAL PUMP-DRIVING ATPASE (ARSENITE-TRANSLOCATING ATPASE) from Acidiphilium multivorum (583 aa), FASTA scores: opt: 225, E(): 8.1e-06, (25.1% identity in 319 aa overlap); AAG43231|ARSA ARSENITE ACITVATED ATPASE from Salmonella typhimurium plasmid R46 FASTA scores: opt: 211, E(): 5.3e-05, (26.95% identity in 267 aa overlap); P52145|ARA2_ECOLI|ARSA ARSENICAL PUMP-DRIVING ATPASE from Escherichia coli plasmid IncN R46 (583 aa), FASTA scores: opt: 211, E(): 5.3e-05, (26.95% identity in 267 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). SOME SIMILARITY TO THE ARSA ATPASE FAMILY. Protein product from Mb3704 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3704 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65090" /db_xref="InterPro:IPR016300" /db_xref="InterPro:IPR025723" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P65090" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02332.1" /translation="MVATTSSGGSSVGWPSRLSGVRLHLVTGKGGTGKSTIAAALALT LAAGGRKVLLVEVEGRQGIAQLFDVPPLPYQELKIATAERGGQVNALAIDIEAAFLEY LDMFYNLGIAGRAMRRIGAVEFATTIAPGLRDVLLTGKIKETVVRLDKNKLPVYDAIV VDAPPTGRIARFLDVTKAVSDLAKGGPVHAQSEGVVKLLHSNQTAIHLVTLLEALPVQ ETLEAIEELAQMELPIGSVIVNRNIPAHLEPQDLAKAAEGEVDADSVRAGLLTAGVKL PDADFAGLLTETIQHATRITARAEIAQQLDALQVPRLELPTVSDGVDLGSLYELSESL AQQGVR" CDS 4061554..4062714 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3705" /product="PROBABLE ANION TRANSPORTER ATPASE" /note="Mb3705, -, len: 386 aa. Equivalent to Rv3680, len: 386 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 386 aa overlap). Probable anion transporting ATPase (EC 3.6.1.-), equivalent to Q9CB87|ML2306 PROBABLE ANION TRANSPORTER PROTEIN from Mycobacterium leprae (381 aa), FASTA scores: opt: 2131, E(): 6.5e-120, (88.1% identity in 370 aa overlap). Also highly similar, but shorter 29 aa, to Q9XA35|SCH17.12 PUTATIVE ION-TRANSPORTING ATPASE from Streptomyces coelicolor (481 aa), FASTA scores: opt: 1190, E(): 1.1e-63, (51.25% identity in 441 aa overlap); and similar to many anion transporting ATPases e.g. Q9UZA6|PAB1555 ANION TRANSPORTING ATPASE from Pyrococcus abyssi (330 aa) FASTA scores: opt: 242, E(): 3e-07, (24.6% identity in 297 aa overlap); Q9P7F8|SPAC1142.06 PUTATIVE ARSENITE-TRANSLOCATING from Schizosaccharomyces pombe (Fission yeast) (329 aa), FASTA scores: opt: 239, E(): 4.5e-07, (27.9% identity in 197 aa overlap); Q9HS79|ARSA1|VNG0365G ARSENICAL PUMP-DRIVING ATPASE from Halobacterium sp. strain NRC-1 (347 aa), FASTA scores: opt: 238, E(): 5.4e-07, (29.35% identity in 358 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb3705 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3705 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4W6" /db_xref="InterPro:IPR016300" /db_xref="InterPro:IPR025723" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4W6" /protein_id="SIU02333.1" /translation="MSVTPKTLDMGAILADTSNRVVVCCGAGGVGKTTTAAALALRAA EYGRTVVVLTIDPAKRLAQALGINDLGNTPQRVPLAPEVPGELHAMMLDMRRTFDEMV MQYSGPERAQSILDNQFYQTVATSLAGTQEYMAMEKLGQLLSQDRWDLIVVDTPPSRN ALDFLDAPKRLGSFMDSRLWRLLLAPGRGIGRLITGVMGLAMKALSTVLGSQMLADAA AFVQSLDATFGGFREKADRTYALLKRRGTQFVVVSAAEPDALREASFFVDRLSQESMP LAGLVFNRTHPMLCALPIERAIDAAETLDAETTDSDATSLAAAVLRIHAERGQTAKRE IRLLSRFTGANPTVPVVGVPSLPFDVSDLEALRALADQLTTVGNDAGRAAGR" CDS complement(4062957..4063313) /codon_start=1 /transl_table=11 /gene="whiB4" /locus_tag="BQ2027_MB3706C" /product="probable transcriptional regulatory protein whib-like whib4" /note="Mb3706c, whiB4, len: 118 aa. Equivalent to Rv3681c, len: 118 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 118 aa overlap). Probable whiB4, WhiB-like regulatory protein (see citation below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Equivalent to ML2307 HYPOTHETICAL PROTEIN from Mycobacterium leprae (116 aa). Also highly similar to Q9S2B9|SCH17.13c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (112 aa), FASTA scores: opt: 392, E(): 1e-20, (67.95% identity in 78 aa overlap); Q9X951|WBLA HYPOTHETICAL 14.3 KDA PROTEIN from Streptomyces coelicolor (129 aa), FASTA scores: opt: 392, E(): 1.1e-20, (67.95% identity in 78 aa overlap); Q9ACZ0|SCP1.161c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (268 aa), FASTA scores: opt: 273, E(): 4.4e-12, (50.0% identity in 78 aa overlap); Q06387|WHIB-STV from Streptomyces griseocarneus (87 aa) FASTA scores: opt: 231, E(): 1.5e-09, (43.85% identity in 73 aa overlap); etc. Also similar to several putative regulator proteins from Mycobacterium tuberculosis e.g. MTCY7D11_7; MTCY78_13; MTCY10H4_23; MTCY1A6_6; and U00016_29 from M. leprae. N-terminus shortened since first submission. Mb3706c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4W3" /db_xref="InterPro:IPR003482" /db_xref="InterPro:IPR034768" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4W3" /protein_id="SIU02334.1" /translation="MSGTRPAARRTNLTAAQNVVRSVDAEERIAWVSKALCRTTDPDE LFVRGAAQRKAAVICRHCPVMQECAADALDNKVEFGVWGGMTERQRRALLKQHPEVVS WSDYLEKRKRRTGTAG" CDS 4063675..4066107 /codon_start=1 /transl_table=11 /gene="ponA2" /locus_tag="BQ2027_MB3707" /product="PROBABLE BIFUNCTIONAL MEMBRANE-ASSOCIATED PENICILLIN-BINDING PROTEIN 1A/1B PONA2 (MUREIN POLYMERASE) [INCLUDES: PENICILLIN-INSENSITIVE TRANSGLYCOSYLASE (PEPTIDOGLYCAN TGASE) + PENICILLIN-SENSITIVE TRANSPEPTIDASE (DD-TRANSPEPTIDASE)]" /note="Mb3707, ponA2, len: 810 aa. Equivalent to Rv3682, len: 810 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 810 aa overlap). Probable ponA2, penicillin-binding protein (class A), bienzymatic membrane-associated protein with transglycosylase (EC 2.4.2.-) and transpeptidase (EC 3.4.-.-) activities. Almost identical to Q9CB85|PON1|ML2308 PENICILLIN BINDING PROTEIN (CLASS A) from Mycobacterium leprae (803 aa) FASTA scores: opt: 4743, E(): 3.3e-217, (87.7% identity in 806 aa overlap); or P72351|PON1|PBP1 HIGH-MOLECULAR-MASS CLASS A PENICILLIN BINDING PROTEIN from Mycobacterium leprae Cosmid B577 (821 aa), FASTA scores: opt: 4547, E(): 6.3e-208, (88.05% identity in 769 aa overlap) (see first citation below). Also similar to others e.g. Q9XA34|SCH17.14 from Streptomyces coelicolor (428 aa; fragment), FASTA scores: opt: 727, E(): 2.3e-27, (36.55% identity in 413 aa overlap); Q9F9V7|PONA from Mycobacterium smegmatis (715 aa), FASTA scores: opt: 446, E(): 6.6e-14, (27.65% identity in 771 aa overlap) (see second citation below); Q9CCY4|PONA|ML2688 from Mycobacterium leprae (708 aa), FASTA scores: opt: 413, E(): 2.4e-12, (26.8% identity in 660 aa overlap); Q9X6W0|PONB|MRCB|PA4700 from Pseudomonas aeruginosa (774 aa), FASTA scores: opt: 398, E(): 1.3e-11, (27.2% identity in 666 aa overlap); P45345|PBPB_HAEIN|MRCB|PONB|HI1725 (781 aa), FASTA scores: opt: 380, E(): 9.4e-11, (28.6% identity in 601 aa overlap); etc. Also similar to P71707|PONA1|Rv0050|MTCY21.13 PROBABLE BIFUNCTIONAL PENICILLIN-BINDING PROTEIN 1A/1B (PBP1) from M. tuberculosis (678 aa) FASTA scores: opt: 372, E(): 2e-10, (28.35% identity in 769 aa overlap). SEEMS TO BELONG TO THE TRANSGLYCOSYLASE FAMILY IN THE N-TERMINAL SECTION, AND TO THE TRANSPEPTIDASE FAMILY IN THE C-TERMINAL SECTION. Protein product from Mb3707 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3707 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6G6" /db_xref="InterPro:IPR001264" /db_xref="InterPro:IPR001460" /db_xref="InterPro:IPR005543" /db_xref="InterPro:IPR012338" /db_xref="InterPro:IPR023346" /db_xref="InterPro:IPR036950" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6G6" /protein_id="SIU02335.1" /translation="MPERLPAAITVLKLAGCCLLASVVATALTFPFAGGLGLMSNRAS EVVANGSAQLLEGQVPAVSTMVDAKGNTIAWLYSQRRFEVPSDKIANTMKLAIVSIED KRFADHSGVDWKGTLTGLAGYASGDLDTRGGSTLEQQYVKNYQLLVTAQTDAEKRAAV ETTPARKLREIRMALTLDKTFTKSEILTRYLNLVSFGNNSFGVQDAAQTYFGINASDL NWQQAALLAGMVQSTSTLNPYTNPDGALARRNVVLDTMIENLPGEAEALRAAKAEPLG VLPQPNELPRGCIAAGDRAFFCDYVQEYLSRAGISKEQVATGGYLIRTTLDPEVQAPV KAAIDKYASPNLAGISSVMSVIKPGKDAHKVLAMASNRKYGLDLEAGETMRPQPFSLV GDGAGSIFKIFTTAAALDMGMGINAQLDVPPRFQAKGLGSGGAKGCPKETWCVVNAGN YRGSMNVTDALATSPNTAFAKLISQVGVGRAVDMAIKLGLRSYANPGTARDYNPDSNE SLADFVKRQNLGSFTLGPIELNALELSNVAATLASGGVWCPPNPIDQLIDRNGNEVAV TTETCDQVVPAGLANTLANAMSKDAVGSGTAAGSAGAAGWDLPMSGKTGTTEAHRSAG FVGFTNRYAAANYIYDDSSSPTDLCSGPLRHCGSGDLYGGNEPSRTWFAAMKPIANNF GEVQLPPTDPRYVDGAPGSRVPSVAGLDVDAARQRLKDAGFQVADQTNSVNSSAKYGE VVGTSPSGQTIPGSIVTIQISNGIPPAPPPPPLPEDGGPPPPVGSQVVEIPGLPPITI PLLAPPPPAPPP" CDS 4066176..4067135 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3708" /product="Predicted phosphohydrolases" /note="Mb3708, -, len: 319 aa. Equivalent to Rv3683, len: 319 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 319 aa overlap). Conserved hypothetical protein, equivalent to Q9CB84|ML2309 HYPOTHETICAL PROTEIN from Mycobacterium leprae (330 aa) FASTA scores: opt: 1791, E(): 9e-107, (85.45% identity in 296 aa overlap). Also similar to Q9X935|SCH66.03 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (309 aa) FASTA scores: opt: 610, E(): 1.4e-31, (51.45% identity in 307 aa overlap); and Q9RRY7|YN45_DEIRA|DR2345 HYPOTHETICAL PROTEIN from Deinococcus radiodurans (305 aa) FASTA scores: opt: 243, E(): 3.2e-08, (31.1% identity in 315 aa overlap) and some similarity to other hypothetical bacterial proteins e.g. Q9CF81|YQED from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (278 aa) FASTA scores: opt: 200, E(): 1.6e-05, (26.85% identity in 287 aa overlap). Protein product from Mb3708 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3708 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR024654" /db_xref="InterPro:IPR029052" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5C0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02336.1" /translation="MAAVLPTLIRTGAVALGSAIAGIGYAALVERNAFVLREVTMPVL TPGSTPLRVLHISDLHMLPNQHRKQAWLRELASWEPDLVVNTGDNLAHPKAVPAVVQT LSDLLSRPGVFVFGSNDYFGPRLKNPMNYLTSPDHRVRGAALPWQDLRAAFTERGWLD LTHTRREFEVAGLHIAAAGVDDPHIDRDRYDTIAGPASPAANLRLGLTHSPEPRVLDR FAADGYQLVLAGHTHGGQLCLPLYGALVTNCGLDRSRAKGASHWGANMRLHVSAGIGT SPFAPVRFCCRPEATLLTLIATPMGGRDSSSNLGRSQPTVSVR" CDS 4067198..4068238 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3709" /product="PROBABLE LYASE" /note="Mb3709, -, len: 346 aa. Equivalent to Rv3684, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap). Probable lyase (EC 4.-.-.-), and more specifically a cysteine synthase (EC 4.2.99.8), highly similar to many lyases e.g. Q9K3N2|SCG20A.08c PUTATIVE LYASE from Streptomyces coelicolor (374 aa), FASTA scores: opt: 1469, E(): 3.7e-85, (63.35% identity in 341 aa overlap) (shorter 31 aa at N-terminus); Q9KT44|VC1061 CYSTEINE SYNTHASE (EC 4.2.99.8)/CYSTATHIONINE BETA-SYNTHASE FAMILY PROTEIN from Vibrio cholerae (355 aa), FASTA scores: opt: 1366, E(): 1.1e-78, (63.25% identity in 321 aa overlap); Q9I4R3|PA1061 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (365 aa), FASTA scores: opt: 1311, E(): 3.2e-75, (59.8% identity in 341 aa overlap); Q9PH18|XF0128 CYSTEINE SYNTHASE from Xylella fastidiosa (390 aa), FASTA scores: opt: 1288, E(): 9.5e-74, (58.55% identity in 333 aa overlap) (shorter 34 aa at N-terminus); P55708|Y4XP_RHISN PUTATIVE CYSTEINE SYNTHASE from Rhizobium sp. strain NGR234 plasmid sym pNGR234a (336 aa), FASTA scores: opt: 376, E(): 2.1e-16, (29.2% identity in 315 aa overlap); etc. Equivalent to AAK48153 from Mycobacterium tuberculosis strain CDC1551 (368 aa) but shorter 22 aa. Protein product from Mb3709 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3709 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4W2" /db_xref="InterPro:IPR001926" /db_xref="InterPro:IPR036052" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4W2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02337.1" /translation="MIEADARRSADTHLLRYPLPAAWCTDVDVELYLKDETTHITGSL KHRLARSLFLYALCNGWINENTTVVEASSGSTAVSEAYFAALLGLPFIAVMPAATSAS KIALIESQGGRCHFVQNSSQVYAEAERVAKETGGHYLDQFTNAERATDWRGNNNIAES IYVQMREEKHPTPEWIVVGAGTGGTSATIGRYIRYRRHATRLCVVDPENSAFFPAYSE GRYDIVMPTSSRIEGIGRPRVEPSFLPGVVDRMVAVPDAASIAAARHVSAVLGRRVGP STGTNLWGAFGLLAEMVKQGRSGSVVTLLADSGDRYADTYFSDEWVSAQGLDPAGPAA ALVEFERSCRWT" tRNA 4068300..4068373 /locus_tag="BQ2027_PROY" /product="tRNA-Pro" /note="proY, len: 74 nt. Equivalent to proY, len: 74 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 nt overlap). tRNA-Pro, anticodon cgg." CDS complement(4069054..4070484) /codon_start=1 /transl_table=11 /gene="cyp137" /locus_tag="BQ2027_MB3710C" /product="PROBABLE CYTOCHROME P450 137 CYP137" /note="Mb3710c, cyp137, len: 476 aa. Equivalent to Rv3685c, len: 476 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 476 aa overlap). Probable cyp137, cytochrome P-450 (EC 1.14.-.-), similar to many e.g. Q9VXY0|C4S3_DROME|CYP4S3|CG9081 from Drosophila melanogaster (Fruit fly) (495 aa), FASTA scores: opt: 376, E(): 1.2e-15, (28.35% identity in 413 aa overlap); Q59163|CYP110A2 from Anabaena variabilis (459 aa) FASTA scores: opt: 320, E(): 3.1e-12, (31.4% identity in 411 aa overlap); O23051|C883_ARATH from Arabidopsis thaliana (Mouse-ear cress) (490 aa), FASTA scores: opt: 313, E(): 8.8e-12, (28.25% identity in 425 aa overlap); etc. Also similar to many from Mycobacterium tuberculosis e.g. O53765|C13B_MYCTU|CYP135B1|Rv0568|MT0594|MTV039.06 (472 aa), FASTA scores: opt: 920, E(): 4.6e-49, (36.25% identity in 447 aa overlap); P96813|C138_MYCTU|CYP138|Rv0136|MT0144|MTCI5.10 (441 aa) FASTA scores: opt: 886, E(): 5.3e-47, (35.5% identity in 445 aa overlap); etc. BELONGS TO THE CYTOCHROME P450 FAMILY. Protein product from Mb3710c detected using SWATH mass spectrometry. Mb3710c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Z7" /db_xref="InterPro:IPR001128" /db_xref="InterPro:IPR002401" /db_xref="InterPro:IPR017972" /db_xref="InterPro:IPR036396" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Z7" /protein_id="SIU02338.1" /translation="MVLRSLASPAALTDPKRCASVVGVAAFAVRREHAPDALGGPPGL PAPRGFRAAFAAAYAVAYLAGGERRMLRLIRRYGPIMTMPILSLGDVAIVSDSALAKE VFTAPTDVLLGGEGVGPAAAIYGSGSMFVQEEPQHLRRRKLLTPPLHGAALDRYVPII ENSTRAAMHTWPVDRPFAMLTVARSLMLDVIVKVIFGVDDPEEVRRLGRPFERLLNLG VSEQLTVRYALRRLGALRVWPARARANTEIDDVVMALIAQRRADPRLGERHDVLSLLV SARGESGEQLSDSEIRDDLITLVLAGHETTATTLAWAFDLLLHHPDALRRVRAEAVGG GEAFTTAVINETLRVRPPAPLTARVAAQPLTIGGYRVEAGTRIVVHIIAINRSAEVYE HPHEFRPERFLGTRPQTYAWVPFGGGVKRCLGANFSMRELITVLHVLLREGEFTAVDD EPERIVRRSIMLVPRRGTRVRFRPAR" CDS complement(4070510..4070842) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3711C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3711c, -, len: 110 aa. Equivalent to Rv3686c, len: 110 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap). Hypothetical protein, similar to P96893|Rv3288c|MTCY71.28c HYPOTHETICAL 15.2 KDA PROTEIN from Mycobacterium tuberculosis (and Mycobacterium bovis) (137 aa) FASTA scores: opt: 106, E(): 5.6, (29.1% identity in 79 aa overlap); and a few hypothetical proteins e.g. Q9GUV6|L2259.2 from Leishmania major (360 aa) FASTA scores: opt: 118, E(): 2.1, (28.7% identity in 101 aa overlap). Equivalent to AAK48155 from Mycobacterium tuberculosis strain CDC1551 (166 aa) but shorter 56 aa. Mb3711c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4Y3" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Y3" /protein_id="SIU02339.1" /translation="MVYTGSDAGDHASAPQPSGSGSVPASVNVPGLVVAAVWAVGLVA GLVALTIGHLAVAAAALVVAVMAPWCRVAYIAHGQHRVCGETLRGTPAGETASFPTGW RGLRFSTR" CDS complement(4071082..4071327) /codon_start=1 /transl_table=11 /gene="rsfb" /locus_tag="BQ2027_MB3712C" /product="anti-anti-sigma factor rsfb (anti-sigma factor antagonist) (regulator of sigma f b)" /note="Mb3712c, -, len: 81 aa. Equivalent to Rv3687c, len: 122 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 81 aa overlap). Hypothetical protein, showing some similarity to sporulation proteins and sigma-factor genes e.g. Q9WVX8|RSBV_STRCO|BLDG|SCH5.12c ANTI-SIGMA B FACTOR ANTAGONIST from Streptomyces coelicolor (113 aa) FASTA scores: opt: 163, E(): 0.0007, (31.15% identity in 106 aa overlap); Q9F3A2|SC5F1.27c PUTATIVE ANTI-SIGMA FACTOR ANTAGONIST from Streptomyces coelicolor (114 aa) FASTA scores: opt: 159, E(): 0.0013, (29.8% identity in 104 aa overlap); P73609|SLR1859 HYPOTHETICAL 12.0 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (108 aa) FASTA scores: opt: 152, E(): 0.0034, (32.2% identity in 90 aa overlap); L47358|BACSPOI_1 spoIIA A from Paenibacillus polymyxa (117 aa), FASTA scores: opt: 107, E(): 0.23, (24.8% identity in 113 aa overlap); SQSIGB_4 rsbU, rsbV, rsbW & sigB genes from Steptomyces aureus (108 aa) (28.3% identity in 60 aa overlap); etc. Also similar to hypothetical proteins from Mycobacterium tuberculosis e.g. MTCY180_14 and MTCY441 _8. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, truncation at the 5' start due to a single base transversion (g-t) leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (81 aa versus 122 aa). Protein product from Mb3712c detected using SWATH mass spectrometry. Mb3712c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002645" /db_xref="InterPro:IPR036513" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V0" /protein_id="SIU02340.1" /translation="MADNPTALVIDLSAVEFLGSVGLKILAATSEKIGQSVKFGVVAR GSVTRRPIHLMGLDKTFRLFSTLHDALTGVRGGRIDR" CDS complement(4071652..4072116) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3713C" /product="GatB/Yqey domain-containing protein" /note="Mb3713c, -, len: 154 aa. Equivalent to Rv3688c, len: 154 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 154 aa overlap). Hypothetical protein, similar to other bacterial hypothetical proteins e.g. Q9X934|SCH66.02c from Streptomyces coelicolor (154 aa), FASTA scores: opt: 425, E(): 3.4e-19, (46.1% identity in 154 aa overlap); Q9WZF4|TM0690 from Thermotoga maritima (149 aa), FASTA scores: opt: 326, E(): 3.4e-13, (40.4% identity in 151 aa overlap); Q9PHU3|CJ0573 from Campylobacter jejuni (147 aa), FASTA scores: opt:290, E(): 5.1e-11, (36.4% identity in 151 aa overlap); etc. Also some similarity to upstream O69654|Rv3686c|MTV025.034c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis. Protein product from Mb3713c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3713c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4W4" /db_xref="InterPro:IPR003789" /db_xref="InterPro:IPR019004" /db_xref="InterPro:IPR023168" /db_xref="InterPro:IPR042184" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4W4" /protein_id="SIU02341.1" /translation="MAELKSQLRSDLTQAMKTQDKLRTATIRMLLAAIQTEEVSGKQA RELSDDEVIKVLARESRKRGEAAEIYTQNGRGELAATEHAEARIIDEYLPTPLTEGEL ADVADTAIAEVAEELGHRPSMKQMGLVMKAATVIAAGKADGARLSAAVKERL" CDS 4072116..4073471 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3714" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3714, -, len: 451 aa. Equivalent to Rv3689, len: 451 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 451 aa overlap). Probable conserved transmembrane protein, with Proline rich N-terminus, similar to Q9KYW6|SCE33.17 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (462 aa) FASTA scores: opt: 730, E(): 2.7e-21, (38.1% identity in 412 aa overlap). Mb3714 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4X6" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4X6" /protein_id="SIU02342.1" /translation="MHKRYAPQRPKPDTETYIEKCTDRRQDGGHDERRQLLRPVSMLP PGYPVEPPPVAPGYAPAGYPPYPATPPGYGPPGYGAPPSYGPPPGYGPPLGYPAAPPG CGPPPGYGPPLGYGPPLAPGAVKPGIIPLRPLTLSDIFNGAVGYIRANPKATLGLTAM VVVTLQIISLVALFGPMTAFGDIVTGEPDELTGAVVGGWSASFGASLLVSWLAGVLLS GMLTVIVGRAVFGSPITVGEAWAKVRGRLLALFGLALLEAAGVVAVLGLAVVILSGVA AAANEAAAALLGFPLLLVVGVSLAYLYVVLLFAPVLIVLERLPIVEAITRSFALVRHG FWRVLGIRLLTVLVVGVVGNAIAAPFMIVGEIVTAVTASDGSVTMRLVGATLSAIGVT IGQIVTAPFSAGVVVLLYTDRRIRAEAFDLVLQTGLEAGPAGGPAPVESTDNLWLTRP F" CDS 4073498..4074151 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3715" /product="PROBABLE CONSERVED MEMBRANE PROTEIN" /note="Mb3715, -, len: 217 aa. Equivalent to Rv3690, len: 217 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 217 aa overlap). Probable conserved membrane protein, similar to Q9KYW5|SCE33.18 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (231 aa), FASTA scores: opt: 419, E(): 1.5e-19, (36.0% identity in 211 aa overlap). Equivalent to AAK48159 from Mycobacterium tuberculosis strain CDC1551 (233 aa) but shorter 16 aa. Protein product from Mb3715 detected using SWATH mass spectrometry. Mb3715 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4X0" /db_xref="InterPro:IPR025403" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4X0" /protein_id="SIU02343.1" /translation="MPSIDIDREAAHQAAQRELDKPIYPKDSLTKELTDWIDEQLYRI LEKGSSIPGGWFTITVLLILLMIAVTAAVQIARRTMRTNRGGDYQLFDAGQLTAAQHR STAESYAAEGNWAAAIRHRLQAVARELEETGMLNPAAGRTANELASDAGEVLPHLAGE LTQAATAFNDVTYGERPGTQGAYQMIADLDDHLRSRSPAVVSAVQHPAVFDSWAQVR" CDS 4074277..4075278 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3716" /product="putative secreted protein" /note="Mb3716, -, len: 333 aa. Equivalent to Rv3691, len: 333 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 333 aa overlap). Conserved hypothetical protein, similar to Q9KYW4|SCE33.19 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (387 aa) FASTA scores: opt: 481, E(): 6e-23, (36.6% identity in 358 aa overlap). Equivalent to AAK48160 from Mycobacterium tuberculosis strain CDC1551 (381 aa) but shorter 48 aa. Protein product from Mb3716 detected using SWATH mass spectrometry. Mb3716 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025646" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4X1" /protein_id="SIU02344.1" /translation="MAPASTSSTGGHALATLLGNHGVEVVVADSIADVEAAARPDSLL LVAQTQYLVDNALLDRLAKAPGDLLLVAPTSRTRTALTPQLRIAAASPFNSQPNCTLR EANRAGSVQWGPSDTYQATGDLVLTSCYGGALVRFRAEGRTITVVGSSNFMTNGGLLP AGNAALAMNLAGNRPRLVWYAPDHIEGEMSSPSSLSDLIPENVHWTIWQLWLVVLLVA LWKGRRIGPLVAEELPVVIRASETVEGRGRLYRSRRARDRAADALRTATLQRLRPRLG VGAGAPAPAVVTTIAQRSKADPPFVAYHLFGPAPATDNDLLQLARALDDIERQVTHS" CDS 4075275..4076351 /codon_start=1 /transl_table=11 /gene="moxR2" /locus_tag="BQ2027_MB3717" /product="probable methanol dehydrogenase transcriptional regulatory protein moxr2" /note="Mb3717, moxR2, len: 358 aa. Equivalent to Rv3692, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 358 aa overlap). Probable moxR2, methanol dehydrogenase regulatory protein, highly similar (generally longer at N-terminus) to Q9KYW3|SCE33.20 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (329 aa), FASTA scores: opt: 1523, E(): 4.2e-74, (70.9% identity in 330 aa overlap); Q9Z538|SC9B2.21c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (332 aa) FASTA scores: opt: 1008, E(): 1.1e-46, (50.8% identity in 313 aa overlap); Q9UZ67|MOXR-3|PAB0848 METHANOL DEHYDROGENASE REGULATORY PROTEIN from Pyrococcus abyssi (314 aa), FASTA scores: opt: 989, E(): 1.1e-45, (50.65% identity in 302 aa overlap); Q9AAN1|CC0566 MOXR PROTEIN from Caulobacter crescentus (323 aa), FASTA scores: opt: 988, E(): 1.3e-45, (52.3% identity in 306 aa overlap); etc. Also similar to O53170|MTV007.26|MOXR|Rv1479 from Mycobacterium tuberculosis (377 aa); and O07392|AF002133_6|MOXR from Mycobacterium avium (309 aa). Also high similarity with several hypothetical bacterial proteins. Protein product from Mb3717 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3717 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6H5" /db_xref="InterPro:IPR011703" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041628" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6H5" /protein_id="SIU02345.1" /translation="MTQSASNPQAPPTQTPGAELPGYPPQAGGAPTAAPSGPHPHRAE AESARDALLALRAEVAKAVVGQDGVISGLVIALLCRGHVLLEGVPGVAKTLIVRAMSA ALQLEFKRVQFTPDLMPGDVTGSLVYDARTAEFVFRPGPVFTNLLLADEINRTPPKTQ AALLEAMEERQVSVEGEPKPLPNPFIVAATQNPIEYEGTYQLPEAQLDRFLLKLNVTL PARDSEIAILDRHAHGFDPRDLSAINPVAGPAELAAGREAVRHVLVANEVLGYIVDIV GATRSSPALQLGVSPRGATALLGTARSWAWLSGRDYVTPDDVKAMARPTLRHRVMLRP EAELEGATPDGVLDGILASVPVPR" CDS 4076485..4077807 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3718" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb3718, -, len: 440 aa. Equivalent to Rv3693, len: 440 aa (alternative start at 41910), from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 440 aa overlap). Possible conserved membrane protein, similar to Q9KYW2|SCE33.21 PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (436 aa), FASTA scores: opt: 875, E(): 3.3e-46, (56.25% identity in 448 aa overlap); Q9AAN0|CC0567 HYPOTHETICAL PROTEIN from Caulobacter crescentus (437 aa), FASTA scores: opt: 355, E(): 2.3e-14, (30.9% identity in 450 aa overlap); P73233|SLR2013 HYPOTHETICAL 48.5 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (435 aa), FASTA scores: opt: 340, E(): 1.9e-13, (29.7% identity in 438 aa overlap); etc. Equivalent to AAK48162 from Mycobacterium tuberculosis strain CDC1551 (475 aa) but shorter 35 aa. Also similar to other hypothetical proteins from Mycobacterium tuberculosis; MTV014_7; MTV007_27; and MTCY71_36 M. Protein product from Mb3718 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3718 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002881" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5C2" /protein_id="SIU02346.1" /translation="MILTGRTGLLALICVLPIALSPWPARAFVMLLVALAVAVTVDTL LAASTRKLRFTRSPYTSARLGQPVDASLLLCNGGRRRFRGQVRDAWPPSARAQPHTHD VDVAAGQRQQVHTALRPVRRGDQRAAMVTARSIGPLGLAGRQSSQSVPGLVRVLPPFL SRKHLPSRLAKLREIDGLLPTLIRGQGTEFDSLREYVVGDDVRSIDWRASARRADVMV RTWRPERDRRVVIVLDTGRMAAGRVGVDPTAADPAGWPRLDWSMDAALLLAALASRAG DHVDFLAHDRISRAGVFGASRSELLAQLVDAMAPLRPALIESDWHAMIATILRRTRRR SLVVLLTDLNATALDEGLLPVLPQLSARHHVLVAAVADPRVDQLAAGRSDAAAVYDAA AAERARNDRRAIASQLRRGGVDVIDAPPAEIAPGLADRYLAMKATGRL" CDS complement(4077881..4078873) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3719C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3719c, -, len: 330 aa. Equivalent to Rv3694c, len: 330 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 330 aa overlap). Possible conserved transmembrane protein, highly similar to Q9KZM4|SCE34.01c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (335 aa), FASTA scores: opt: 1113, E(): 2.5e-60, (51.5% identity in 334 aa overlap); and similar to Q9KEW6|BH0733 HYPOTHETICAL PROTEIN from Bacillus halodurans (355 aa), FASTA scores: opt: 381, E(): 6.1e-16, (24.15% identity in 331 aa overlap); Q9AAM9|CC0568 HYPOTHETICAL PROTEIN from Caulobacter crescentus (332 aa), FASTA scores: opt: 352, E(): 3.3e-14, (30.3% identity in 310 aa overlap); P74166|SLR1478 HYPOTHETICAL 35.4 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (317 aa), FASTA scores: opt: 330, E(): 6.8e-13, (25.65% identity in 308 aa overlap); etc. C-terminal end shows similarity to O29631|AF0624|AE001061_10 CONSERVED HYPOTHETICAL PROTEIN (putative nifU protein) from Archaeoglobus fulgidus (185 aa), FASTA scores: opt: 154, E(): 0.021, (29.0% identity in 131 aa overlap). Equivalent to AAK48163 from Mycobacterium tuberculosis strain CDC1551 (395 aa) but shorter 65 aa. Also some similarity to MTCY428_20 HYPOTHETICAL 43.7 KDA PROTEIN from Mycobacterium tuberculosis. Protein product from Mb3719c detected using SWATH mass spectrometry. Mb3719c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4X7" /db_xref="InterPro:IPR002798" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4X7" /protein_id="SIU02347.1" /translation="MDVDAFLLTNRGTWDRLDHLIKKRHSLSGAEIDELVELYQRVST HLSMLRSASSDQLMTGRLSSLVARARSAVTGAHAPLTRTFIRFWTVSFPVVAYRTWRW WLATAVAFFAVVVLIGFWVAGSHEVQSAIGTPTEIDELVSHDVQSYYSEHPAASFALQ VWVNNSWVATTCIAMSVVLGLPIPLVLFDNAANVGLIAGLMFQAGKGDFLLGLLLPHG LLELTAVFLAAAIGMRLGWSVISAGNRPRGQVLAEQGRGVVSVAVGLVGVFLVAGLIE AVVTPSPLPTFVRIAVGIIAEAVFLSYIGYFGRRAAQAGETGDMEDAPDVVPTG" CDS 4078965..4079897 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3720" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb3720, -, len: 310 aa. Equivalent to Rv3695, len: 310 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 310 aa overlap). Possible conserved membrane protein, equivalent, but longer 88 aa, to Q9CB83|ML2312 POSSIBLE MEMBRANE PROTEIN from Mycobacterium leprae (196 aa), FASTA scores: opt: 898, E(): 5.2e-36, (71.05% identity in 190 aa overlap). Also highly similar to Q9KZM3|SCE34.02 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (318 aa), FASTA scores: opt: 740,E(): 2.4e-28, (43.25% identity in 319 aa overlap); and similar to P72718|SLR0254 HYPOTHETICAL 30.4 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (266 aa), FASTA scores: opt: 287, E(): 6.1e-07, (29.6% identity in 260 aa overlap); Q9HW83|PA4318 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (265 aa), FASTA scores: opt: 250, E(): 3.5e-05, (32.0% identity in 203 aa overlap); Q9KEW5|BH0734 HYPOTHETICAL PROTEIN from Bacillus halodurans (266 aa), FASTA scores: opt: 168, E(): 0.0047, (25.95% identity in 231 aa overlap); etc. C-terminal end shows some similarity to proline-rich proteins e.g. Q62106 PROLINE-RICH SALIVARY PROTEIN (FRAGMENT) from Mus musculus (Mouse) (188 aa) (36.1% identity in 97 aa overlap). Equivalent to AAK48164 from Mycobacterium tuberculosis strain CDC1551 (269 aa) but longer 41 aa. Protein product from Mb3720 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3720 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y501" /db_xref="InterPro:IPR010432" /db_xref="UniProtKB/TrEMBL:A0A1R3Y501" /protein_id="SIU02348.1" /translation="MSEVVTGDAVVLDVQIAQLPVRAVSAVIDITIIFIGYILGLMLW ATALTQFDEALTTAFLIIFTVLALVGYPLVWETATRGRSVGKIVMGLRVVSDDGGPER FRQALFRALASVVEIWMLLGSPAVICSMLSPKAKRVGDVFAGTVVVSERGPRLGPPPV MPPSLAWWASSLQLSGLTAGQAEVARQFLVRAPQLDPALREQMAYRIAGDVVARIAPP PPPGVPPQLVLAAVLAERHRRELLRLRPTLPPAGQAPWAQMAPHRGWPPGLSGATPWS PQQPVIPWPEPDPPPQAAPWPQQAPDGPGFSPPG" CDS complement(4079961..4080758) /codon_start=1 /transl_table=11 /gene="glpKb" /locus_tag="BQ2027_MB3721C" /product="PROBABLE GLYCEROL KINASE GLPKB [SECOND PART] (ATP:GLYCEROL 3-PHOSPHOTRANSFERASE)(GLYCEROKINASE) (GK)" /note="Mb3721c, glpKb, len: 265 aa. Equivalent to 3' end of Rv3696c, len: 517 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 326 aa overlap). Probable glpK, glycerol kinase (EC 2.7.1.30), equivalent to Q9CB81|GLPK_MYCLE|ML2314 GLYCEROL KINASE from Mycobacterium leprae (508 aa), FASTA scores: opt: 3120, E(): 4.7e-189, (91.35% identity in 508 aa overlap). Also highly similar to others e.g. Q9RJM2|GLPK from Streptomyces coelicolor (507 aa), FASTA scores: opt: 2606, E(): 1.1e-156, (75.35% identity in 503 aa overlap); Q9ADA7|GLPK from Streptomyces coelicolor (512 aa) FASTA scores: opt: 2002, E(): 1.3e-118, (59.05% identity in 503 aa overlap); Q9X1E4|GLK2_THEMA|TM1430 from Thermotoga maritima (496 aa), FASTA scores: opt: 1838, E(): 2.7e-108, (54.8% identity in 498 aa overlap); P08859|GLPK_ECOLI|B3926 from Escherichia coli strain K12 (501 aa), FASTA scores: opt: 1740, E(): 4.1e-102, (52.3% identity in 499 aa overlap); etc. Contains PS00933 FGGY family of carbohydrate kinases signature 1, PS00070 Aldehyde dehydrogenases cysteine active site, PS00445 FGGY family of carbohydrate kinases signature 2. BELONGS TO THE FUCOKINASE / GLUCONOKINASE / GLYCEROKINASE / XYLULOKINASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis, glpK exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-g), splits glpK into 2 parts, glpKa and glpKb and removes activity. Protein product from Mb3721c detected using SWATH mass spectrometry. Mb3721c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Z5" /db_xref="InterPro:IPR018483" /db_xref="InterPro:IPR018485" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Z5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02349.1" /translation="MPITGVLADQHAAMVGQVCLAPGEAKNTYGTGNFLLLNTGETIV RSNNGLLTTVCYQFGNAKPVYALEGSIAVTGSAVQWLRDQLGIISGAAQSEALARQVP DNGGMYFVPAFSGLFAPYWRSDARGAIVGLSRFNTNAHLARATLEAICYQSRDVVDAM EADSGVRLQVLKVDGGITGNDLCMQIQADVLGVDVVRPVVAETTALGAAYAAGLAVGF WAAPSDLRANWREDKRWTPTWDDDERAAGYAGWRKAVQRTLDWVDVS" CDS complement(4080760..4081515) /codon_start=1 /transl_table=11 /gene="glpKa" /locus_tag="BQ2027_MB3722C" /product="PROBABLE GLYCEROL KINASE GLPKA [FIRST PART] (ATP:GLYCEROL 3-PHOSPHOTRANSFERASE)(GLYCEROKINASE) (GK)" /note="Mb3722c, glpKa, len: 251 aa. Equivalent to 5' end of Rv3696c, len: 517 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 191 aa overlap). Probable glpK, glycerol kinase (EC 2.7.1.30), equivalent to Q9CB81|GLPK_MYCLE|ML2314 GLYCEROL KINASE from Mycobacterium leprae (508 aa), FASTA scores: opt: 3120, E(): 4.7e-189, (91.35% identity in 508 aa overlap). Also highly similar to others e.g. Q9RJM2|GLPK from Streptomyces coelicolor (507 aa), FASTA scores: opt: 2606, E(): 1.1e-156, (75.35% identity in 503 aa overlap); Q9ADA7|GLPK from Streptomyces coelicolor (512 aa) FASTA scores: opt: 2002, E(): 1.3e-118, (59.05% identity in 503 aa overlap); Q9X1E4|GLK2_THEMA|TM1430 from Thermotoga maritima (496 aa), FASTA scores: opt: 1838, E(): 2.7e-108, (54.8% identity in 498 aa overlap); P08859|GLPK_ECOLI|B3926 from Escherichia coli strain K12 (501 aa), FASTA scores: opt: 1740, E(): 4.1e-102, (52.3% identity in 499 aa overlap); etc. Contains PS00933 FGGY family of carbohydrate kinases signature 1, PS00070 Aldehyde dehydrogenases cysteine active site, PS00445 FGGY family of carbohydrate kinases signature 2. BELONGS TO THE FUCOKINASE / GLUCONOKINASE / GLYCEROKINASE / XYLULOKINASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis, glpK exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-g), splits glpK into 2 parts, glpKa and glpKb and removes activity. Mb3722c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4V7" /db_xref="InterPro:IPR018483" /db_xref="InterPro:IPR018484" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4V7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02350.1" /translation="MSDAILGEQLAESSDFIAAIDQGTTSTRCMIFDHHGAEVARHQL EHEQILPRAGWVEHNPVEIWERTASVLISVLNATNLSPKDIAALGITNQRETTLVWNR HTGRPYYNAIVWQDTRTDRIASALDRDGRGNLIRRKAGLPPATYFSGGKLQWILENVD GVRAAAENGDALFGTPDTWVLWNLTGGPRGGCACHRCNQRQPDHVDGSRDAGLGRRAV VVVFDTSGHAARDRIVGAVGALRCHAGDRACRR" CDS complement(4081565..4082002) /codon_start=1 /transl_table=11 /gene="vapc48" /locus_tag="BQ2027_MB3723C" /product="possible toxin vapc48. contains pin domain." /note="Mb3723c, -, len: 145 aa. Equivalent to Rv3697c, len: 145 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 145 aa overlap). Possible conserved membrane protein, similar to many proteins from Mycobacterium tuberculosis e.g. Q10800|YS72_MYCTU|Rv2872|MT2939|MTCY274.03 (147 aa) FASTA scores: opt: 223, E(): 7.3e-08, (32.6% identity in 141 aa overlap); O53501|Rv2103c|MTV020.03 (144 aa), FASTA scores: opt: 215, E(): 2.4e-07, (31.4% identity in 137 aa overlap); O53812|Rv0749|MTV041.23 (142 aa), FASTA scores: opt: 192, E(): 7.6e-06, (31.25% identity in 144 aa overlap); etc. Mb3723c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4X2" /db_xref="InterPro:IPR002716" /db_xref="InterPro:IPR006226" /db_xref="InterPro:IPR022907" /db_xref="InterPro:IPR029060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4X2" /protein_id="SIU02351.1" /translation="MSETFDVDVLVHATHRASPFHDKAKTLVERFLARPGLVYLLWPV ALGYLRVVTHPTLLGAPLAPEVAVENIEQFTSRPHVRQVGEANGFWPVYRRVADPVKP RGNLVPDAHLVALMRHHGIATIWSHDRDFRKFEGIRIRDPFSG" CDS complement(4081999..4082223) /codon_start=1 /transl_table=11 /gene="vapB48" /locus_tag="BQ2027_MB3723A" /product="Possible antitoxin VapB48" /note="Mb3723A, len: 74 aa. Equivalent to Rv3697A len: 74 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 74 aa overlap). Transferred from H37Rv annotation using Rapid Annotation Transfer Tool (Nucleic Acids Res. 2011 May; 39(9): e57). Possible vapB48, antitoxin,part of toxin-antitoxin (TA) operon with Rv3697c, see Arcus et al. 2005. Similar to others in M. tuberculosis e.g. Rv3321c, Rv0748,Mb3723A found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Y9" /protein_id="SIU02352.1" /translation="MRTTIDLDDDILRALKRRQREERKTLGQLASELLAQALAAEPPP NVDIRWSTADLRPRVDLDDKDAVWAILDRG" CDS 4082253..4083782 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3724" /product="Glutamate-Cysteine ligase" /note="Mb3724, -, len: 509 aa. Equivalent to Rv3698, len: 509 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 509 aa overlap). Conserved hypothetical protein, highly similar to Q9AK89|SC10A9.15c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (505 aa), FASTA scores: opt: 1720, E(): 9e-103, (53.65% identity in 494 aa overlap). N-terminal end highly similar to CAC42136|SCBAC25F8.01 CONSERVED HYPOTHETICAL PROTEIN (FRAGMENT) from Streptomyces coelicolor (291 aa), FASTA scores: opt: 1078, E(): 8.7e-62, (52.6% identity in 291 aa overlap); and C-terminus highly similar to CAC44687|SCBAC17A6.42c (235 aa), FASTA scores: opt: 911, E(): 3.8e-51, (57.25% identity in 234 aa overlap)." /db_xref="GOA:A0A1R3Y4Y8" /db_xref="InterPro:IPR014746" /db_xref="InterPro:IPR016602" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Y8" /protein_id="SIU02353.1" /translation="MRTISPFLRCRHETCCISNVGEEVTRTTYSREHQREYRRKVRLC LDVFETMLAQTRFEADRPLTGIEIECNLVDADYQPAMSNRYVLDAIADPAYQTELGAY NIEFNVPPRPLPGRTCLELEDEVRASLNDAETKASCSGAHIVMIGILPTLMPEHLTDG WMSASARYAALNESIFKARGEDIPINIAGPEPLSCHAGSIAPESACTSVQLHLQLAPA DFPANWNAAQVLAGRQLALGANSPYFFGHQLWSETRIELFTQSTDARPEELKSRGVRP RVWFGERWITSVLDLFQENIRYFPTLLPEVSDEDPLAELSAGRIPHLSELRLHNGTVY RWNRPVYDVVDGRPHLRLENRVLPAGPTVVDMLANHAFYYGALRGLSEADPPLWTQMN FAAAQANFLAAARYGMDAQLDWPGLGEVTTRELVLGTLLPMAHEGLRRWGVDAEVRDR FLGVIGGRAQTGRNGARWQVATVAALQDGGLTRPAALAEMLRRYCEHMHSNEPVHTWD T" CDS 4083804..4084505 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3725" /product="SAM-dependent methyltransferase" /note="Mb3725, -, len: 233 aa. Equivalent to Rv3699, len: 233 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 233 aa overlap). Conserved hypothetical protein, showing similarity with hypothetical proteins e.g. Q9P3V6|SPAC1348.04 (alias Q9P3E7|SPAC750.03c or Q9P7U5|SPAC977.03) from Schizosaccharomyces pombe (Fission yeast) (145 aa), FASTA scores: opt: 188, E(): 7.5e-05, (31.65% identity in 120 aa overlap); and Q9KB70|BH2058 from Bacillus halodurans (241 aa) FASTA scores: opt: 185, E(): 0.00018, (27.8% identity in 162 aa overlap); Q9XA90|SCF43A.25c PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (215 aa), FASTA scores: opt: 166, E(): 0.0025, (29.95% identity in 147 aa overlap); etc. Also highly similar to O06426|Rv0560c|MTCY25D10.39c HYPOTHETICAL 25.9 KDA PROTEIN from Mycobacterium tuberculosis (241 aa), FASTA scores: opt: 690, E(): 6.5e-36, (53.4% identity in 234 aa overlap); and similar to other hypothetical proteins from Mycobacterium tuberculosis e.g. P71805|Rv1377c|MTCY02B12.11c (212 aa) FASTA scores: opt: 378, E(): 1.5e-16, (35.4% identity in 192 aa overlap); P71972|Rv2675c|MTCY441.44c (250 aa) FASTA scores: opt: 297, E(): 2e-11, (31.1% identity in 193 aa overlap); etc. Protein product from Mb3725 detected using shotgun mass spectrometry. Mb3725 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR041698" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Y6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02354.1" /translation="MTDEVMDWDSAYREQGAFEGPPPWNIGEPQPELATLIAAGKVRS DVLDAGCGYAELSLALAADGYTVVGIDLTPTAVAAATKAAEERGLTTASFVQADITEF AAYPAGSAGRFSTVIDSTLFHSLPVDSRDRYLSSVHRAAAPGASYYVLVFAKGAFPAE LEVKPNEVDEDELRAAVSKYWKIDEIRPAFIHVNPVTIPPQLAGAPVEFPPYDHDEKG RVKFPAYLLTAHKAG" CDS complement(4084508..4085680) /codon_start=1 /transl_table=11 /gene="egtE" /locus_tag="BQ2027_MB3726C" /product="Pyridoxal-phosphate-dependent protein EgtE (ergothioneine synthase)" /note="Mb3726c, -, len: 390 aa. Equivalent to Rv3700c, len: 390 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 390 aa overlap). Conserved hypothetical protein; could be a transferase (EC 2.-.-.-) or a lyase (EC 4.-.-.-). Indeed, similar to various enzymes e.g. Q53824|CAC CAPREOMYCIN ACETYLTRANSFERASE from Streptomyces capreolus (359 aa), FASTA scores: opt: 338, E(): 1.1e-12, (33.35% identity in 363 aa overlap); Q9HXX3|CSD_PSEAE|PA3667 PROBABLE CYSTEINE DESULFURASE (EC 4.4.1.-) from Pseudomonas aeruginosa (401 aa) FASTA scores: opt: 260, E(): 4.8e-08, (30.2% identity in 404 aa overlap); Q9X815|SC6G10.30 PUTATIVE AMINOTRANSFERASE from Streptomyces coelicolor (460 aa), FASTA scores: opt: 243, E(): 5.4e-07, (29.15% identity in 374 aa overlap); Q9A761|CC1865 AMINOTRANSFERASE CLASS V from Caulobacter crescentus (379 aa), FASTA scores: opt: 234, E(): 1.6e-06, (27.95% identity in 383 aa overlap); O74351|NFS1_SCHPO|SPBC21D10.11c PROBABLE CYSTEINE DESULFURASE from Schizosaccharomyces pombe (Fission yeast) (498 aa), FASTA scores: opt: 232, E(): 2.5e-06, (29.1% identity in 285 aa overlap); Q9RME8|NIFS NIFS PROTEIN (CYSTEINE DESULFURASE, TRNA SPLICING PROTEIN) from Zymomonas mobilis (370 aa), FASTA scores: opt: 230, E(): 2.6e-06, (32.85% identity in 201 aa overlap); etc. Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2. Protein product from Mb3726c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3726c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6I0" /db_xref="InterPro:IPR000192" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR027563" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6I0" /protein_id="SIU02355.1" /translation="MRRSGANSPAGDSLADRWRAARPPVAGLHLDSAACSRQSFAALD AAAQHARHEAEVGGYVAAEAAAAVLDAGRAAVAALSGLPDAEVVFTTGSLHALDLLLG SWPGENRTLACLPGEYGPNLAVMAAHGFDVRPLPTLQDGRVALDDAAFMLADDPPDLV HLTVVASHRGVAQPLAMVAQLCTELKLPLVVDAAQGLGHVDCAVGADVTYASSRKWIA GPRGVGVLAVRPELMERLRARLPAPDWMPPLTVAQQLGFGEANVAARVGFSVALGEHL ACGPQAIRARLAELGDIARTVLADVSGWRVVEAVDEPSAITTLAPIDGADPAAVRAWL LSQRRIVTTYAGVERAPLELPAPVLRISPHVDNTADDLDAFAEALVAATAATSGER" CDS complement(4085711..4086676) /codon_start=1 /transl_table=11 /gene="egtD" /locus_tag="BQ2027_MB3727C" /product="L-histidine N(alpha)-methyltransferase (EC" /EC_number="2.1.1.44" /note="Mb3727c, -, len: 321 aa. Equivalent to Rv3701c, len: 321 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 321 aa overlap). Conserved hypothetical protein, highly similar to other hypothetical proteins e.g. Q9RCZ8|SCM1.46 from Streptomyces coelicolor (251 aa), FASTA scores: opt: 897, E(): 1.1e-50, (59.9% identity in 242 aa overlap); P73759|SLR0865 from Synechocystis sp. strain PCC 6803 (337 aa), FASTA scores: opt: 779, E(): 5.7e-43, (40.35% identity in 327 aa overlap); Q9GWA1|LM12.997 from Leishmania major (383 aa) FASTA scores: opt: 616, E(): 2.1e-32, (39.05% identity in 297 aa overlap); etc. Protein product from Mb3727c detected using SWATH mass spectrometry. Mb3727c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5C8" /db_xref="InterPro:IPR017804" /db_xref="InterPro:IPR019257" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR032888" /db_xref="InterPro:IPR035094" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5C8" /protein_id="SIU02356.1" /translation="MRVSVANHLGEDAGHLALRRDVYSGLQKTPKSLPPKWFYDTVGS ELFDQITRLPEYYPTRAEAEILRARSAEVASACRADTLVELGSGTSEKTRMLLDALRH RGSLRRFVPFDVDASVLSATATAIQREYSGVEINAVCGDFEEHLTEIPRGGRRLFVFL GSTIGNLTPGPRAQFLTALAGVMRPGDSLLLGTDLVKDAARLVRAYDDPGGVTAQFNR NVLAVINRELEADFDVDAFQHVARWNSAEERIEMWLRADGRQRVRVGALDLTVDFDAG EEMLTEVSCKFRPQAVGAELAAAGLHRIRWWTDEAGDFGLSLAAK" CDS complement(4086673..4087374) /codon_start=1 /transl_table=11 /gene="egtC" /locus_tag="BQ2027_MB3728C" /product="Amidohydrolase EgtC (hercynylcysteine sulfoxide synthase)" /note="Mb3728c, -, len: 233 aa. Equivalent to Rv3702c, len: 233 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 233 aa overlap). Conserved hypothetical protein, highly similar to other hypothetical proteins Q9RCZ9|SCM1.45 from Streptomyces coelicolor (271 aa), FASTA scores: opt: 383, E(): 2.3e-17, (44.85% identity in 252 aa overlap); and P54004|Y199_SYNY3|SLR0199 from Synechocystis sp. strain PCC 6803 (304 aa), FASTA scores: opt: 292, E(): 1.7e-11, (30.05% identity in 263 aa overlap); and similar to others e.g. Q9KMU4|VCA0225 from Vibrio cholerae (254 aa), FASTA scores: opt: 260, E(): 1.6e-09, (29.8% identity in 245 aa overlap). Equivalent to AAK48172 from Mycobacterium tuberculosis strain CDC1551 (194 aa) but longer 39 aa. Protein product from Mb3728c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3728c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4X8" /db_xref="InterPro:IPR017808" /db_xref="InterPro:IPR017932" /db_xref="InterPro:IPR029055" /db_xref="InterPro:IPR032889" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4X8" /protein_id="SIU02357.1" /translation="MCRHLGWLGAQVAVSSLVLDPPQGLRVQSYAPRRQKHGLMNADG WGVGFFDGAIPRRWRSPAPLWGDTSFHSVAPALRSHCILAAVRSATVGMPIEVSATPP FTDGHWLLAHNGVVDRAVLPAGPAAESVCDSAILAATIFAHGLDALGDTIVKVGAADP NARLNILAANGSRLIATTWGDTLSILRRADGVVLASEPYDDDSGWGDVPDRHLVEVTQ KGVTLTALDRAKGPR" CDS complement(4087374..4088651) /codon_start=1 /transl_table=11 /gene="egtB" /locus_tag="BQ2027_MB3729C" /product="Iron(II)-dependent oxidoreductase EgtB (hercynine sythase)" /note="Mb3729c, -, len: 425 aa. Equivalent to Rv3703c, len: 425 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 425 aa overlap). Conserved hypothetical protein, similar to other hypothetical proteins e.g. Q9RD00|SCM1.44 from Streptomyces coelicolor (446 aa), FASTA scores: opt: 1480, E(): 1.4e-85, (53.9% identity in 421 aa overlap); P72841|SLR1303 from Synechocystis sp. strain PCC 6803 (410 aa), FASTA scores: opt: 533, E(): 4.5e-26, (36.6% identity in 429 aa overlap); Q9KYH7|SCC61A.16 from Streptomyces coelicolor (256 aa), FASTA scores: opt: 266, E(): 1.9e-09, (32.25% identity in 248 aa overlap); etc. Also similar to P95060|Rv0712|MTCY210.31 HYPOTHETICAL 32.7 KDA PROTEIN from Mycobacterium tuberculosis (299 aa), FASTA scores: opt: 243, E(): 5.9e-08, (30.6% identity in 304 aa overlap). Protein product from Mb3729c detected using SWATH mass spectrometry. Mb3729c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y510" /db_xref="InterPro:IPR005532" /db_xref="InterPro:IPR016187" /db_xref="InterPro:IPR017806" /db_xref="InterPro:IPR024775" /db_xref="InterPro:IPR032890" /db_xref="InterPro:IPR034660" /db_xref="InterPro:IPR042095" /db_xref="UniProtKB/TrEMBL:A0A1R3Y510" /protein_id="SIU02358.1" /translation="MTSPEQLACHLARARARTLRLVDFDDAELCCQYDPLMSPLVWDL AHIGQQEELWLLRGGDPGQPGLLPPAVEGLYDAFEHSRASRVELPLLSPARARSYCAT VRSAALDALAALPEDGDSFVFAMVISHENQHDETMLQALNLRTGSPLLAATSALPAGR PRMAGTSVLVAGGPFVLGVDAADEPCSLDNERPAHVVDVPAFRIGRVPVTNGEWQDFI DDGGYTQSRWWSERGWQHRQRAGLTAPQFWRSGGRTRTRFGHVEDIPADEPVQHVSYF EAEAYAAWAGARLPTEVEWEKACAWDPATGSRRRYPWGTEEPTDTYANLGGQTLRPAP VGAYPAGASACGAEQMLGDVWEWTTSPLRPWPGFVPMVYERYSQPFFGGDYRVLRGGS WAVEPAILRPSFRNWDHPYRRQIFAGVRLAWDI" CDS complement(4088648..4089946) /codon_start=1 /transl_table=11 /gene="gshA" /locus_tag="BQ2027_MB3730C" /product="GLUTAMATE--CYSTEINE LIGASE GSHA (GAMMA-GLUTAMYLCYSTEINE SYNTHETASE) (GAMMA-ECS) (GCS) (GAMMA-GLUTAMYL-L-CYSTEINE SYNTHETASE)" /note="Mb3730c, gshA, len: 432 aa. Equivalent to Rv3704c, len: 432 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 432 aa overlap). Possible gshA, glutamate--cysteine ligase (EC 6.3.2.2), similar to many e.g. Q9A2Z2|CC3414 GLUTAMATE--CYSTEINE LIGASE from Caulobacter crescentus (453 aa), FASTA scores: opt: 404, E(): 5.9e-17, (30.45% identity in 312 aa overlap); Q9SEH0|GSH1 GAMMA-GLUTAMYLCYSTEINYL SYNTHETASE PRECURSOR from Pisum sativum (Garden pea) (499 aa), FASTA scores: opt: 400, E(): 1.1e-16, (26.4% identity in 439 aa overlap); Q9RH09|GSH GAMMA-GLUTAMYLCYSTEINE SYNTHETASE from Zymomonas mobilis (462 aa), FASTA scores: opt: 397, E(): 1.6e-16, (28.95% identity in 304 aa overlap); P46309|GSH1_ARATH|GSH1|AT4G23100|F7H19.290 GLUTAMATE--CYSTEINE LIGASE from Arabidopsis thaliana (Mouse-ear cress) (522 aa), FASTA scores: opt: 395, E(): 2.3e-16, (27.25% identity in 385 aa overlap); etc. But note that this putative protein is also similar to Q9JMV4|GSHA PUTATIVE GLUTATHIONE SYNTHETASE (FRAGMENT) from Bradyrhizobium japonicum (460 aa), FASTA scores: opt: 498, E(): 1.3e-22, (33.35% identity in 333 aa overlap) (no significant publications found (August 2001). Mb3730c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y503" /db_xref="InterPro:IPR006336" /db_xref="InterPro:IPR014746" /db_xref="InterPro:IPR017809" /db_xref="InterPro:IPR035434" /db_xref="UniProtKB/TrEMBL:A0A1R3Y503" /protein_id="SIU02359.1" /translation="MTLAAMTAAASQLDNAAPDDVEITDSSAAAEYIADGCLVDGPLG RVGLEMEAHCFDPADPFRRPSWEEITEVLEWLSPLPGGSVVSVEPGGAVELSGPPADG VLAAIGAMTRDQAVLRSALANAGLGLVFLGADPLRSPVRVNPGARYRAMEQFFAASHS GVPGAAMMTSTAAIQVNLDAGPQEGWAERVRLAHALGPTMIAIAANSPMLGGRFSGWQ STRQRVWGQMDSARCGPILGASGDHPGIDWAKYALKAPVMMVRSPDTQDTRAVTDYVP FTDWVDGRVLLDGRRATVADLVYHLTTLFPPVRPRQWLEIRYLDSVPDEVWPAVVFTL VTLLDDPVAADLAVDAVEPVATAWDTAARIGLADRRLYLAANRCLAIAARRVPTELIG AMQRLVDHVDRGVCPADDFSDRVIAGGIASAVTGMMHGAS" CDS complement(4090078..4090722) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3731C" /product="conserved protein" /note="Mb3731c, -, len: 214 aa. Equivalent to Rv3705c, len: 214 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 214 aa overlap). Conserved hypothetical protein, equivalent to Q9CB80|ML2320 HYPOTHETICAL PROTEIN from Mycobacterium leprae (215 aa) FASTA scores: opt: 1145, E(): 5.9e-68, (79.45% identity in 214 aa overlap). Some similarity to the C-terminal end of Q11053|PKNH_MYCTU|Rv1266c|MT1304|MTCY50.16 PROBABLE SERINE/THREONINE-PROTEIN from Mycobacterium tuberculosis (626 aa), FASTA scores: opt: 175, E(): 0.0005, (24.9% identity in 201 aa overlap); and to the N-terminal end of P23903|E13B_BACCI|GLCA GLUCAN ENDO-1,3-BETA-GLUCOSIDASE A1 PRECURSOR from Bacillus circulans (682 aa), FASTA scores: opt: 122, E(): 1.6, (25.6% identity in 164 aa overlap). Protein product from Mb3731c detected using SWATH mass spectrometry. Mb3731c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR026954" /db_xref="InterPro:IPR038232" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4W8" /protein_id="SIU02360.1" /translation="MRIAAAVVSIGLAVIAGFAVPVADAHPSEPGVVSYAVLGKGSVG NIVGAPMGWEAVFTRPFQAFWVELPACNNWVDIGLPEVYDDPDLASFNGATTQTSATD QTHLVKQAVGVFASNDAADRAFHRVVDRTVGCSGQTTAIHLDDGTTQVWSFAGGPSTG TDEAWTKQEAGTDRRCFVQTRLRENVLLQAKVCQSGNAGPAVNVLAGAMQNTLG" CDS complement(4090851..4091240) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3732C" /product="CONSERVED HYPOTHETICAL PROLINE RICH PROTEIN" /note="Mb3732c, -, len: 129 aa. Equivalent to Rv3705A, len: 129 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 129 aa overlap). Conserved hypothetical protein, similar to downstream ORF O69674|Rv3706c|MTV025.054c CONSERVED HYPOTHETICAL PROLINE RICH PROTEIN from Mycobacterium tuberculosis (106 aa), FASTA scores: opt: 245, E(): 0.00013, (40.7% identity in 113 aa overlap). Mb3732c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Z0" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Z0" /protein_id="SIU02361.1" /translation="MTETPQPAAPPPSAATTSPPPSPQQEKPPRLYRAAAWVVIVAGI VFTVAVIFFSGALVLGQGKCPYHRYYHHGMFRPVGPVAPGPGMGWVFGFPGGPPPPGM GPGFPGGPGGPAVGPTGPGPTTAPARP" CDS complement(4091351..4091671) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3733C" /product="CONSERVED HYPOTHETICAL PROLINE RICH PROTEIN" /note="Mb3733c, -, len: 106 aa. Equivalent to Rv3706c, len: 106 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 106 aa overlap). Conserved ypothetical pro-rich protein, only similar to upstream ORF Rv3705A (129 aa), and AAK48176|MT3808.1 HYPOTHETICAL 13.0 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (129 aa), FASTA scores: opt: 245, E(): 4.4e-06, (40.7% identity in 113 aa overlap). Protein product from Mb3733c detected using shotgun mass spectrometry. Mb3733c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y4Z6" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Z6" /protein_id="SIU02362.1" /translation="MRHMSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFT GYILGKHAGHGGFHHRQHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPA TPAP" CDS complement(4091790..4092800) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3734C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3734c, -, len: 336 aa. Equivalent to Rv3707c, len: 336 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 336 aa overlap). Equivalent to Q9CB79|ML2321 HYPOTHETICAL PROTEIN from Mycobacterium leprae (336 aa), FASTA scores: opt: 1948, E(): 6.7e-110, (81.95% identity in 332 aa overlap); and P41402|YASD_MYCSM HYPOTHETICAL 35.9 KDA PROTEIN IN THE ASPARTOKINASE GENE CLUSTER from Mycobacterium smegmatis (333 aa), FASTA scores: opt: 1731, E(): 7.4e-97, (70.85% identity in 333 aa overlap). Protein product from Mb3734c detected using SWATH mass spectrometry. Mb3734c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025442" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Z1" /protein_id="SIU02363.1" /translation="MLRIGPTAGTGTPTGDYGIGATDLCEFVEFPSQLLQVCGDSFAG QGVGFGGWYAPVALHVDTESIDDPAGVRYTGVTGVGTPLLADPTPPGDSQLPAGVVQI NRRNYLMVTTTKDLQPQNSRLVRAEAARGGWQTVSGSRRNAAYQDGRQTQISGYYDPV PTPDSPTGWVYIVADSFTRGEPAVLYRATPESFTDRSRWQGWAGGPDGGWNKPPTPLW PDQLGEMSIRQIDGQTVLSYFNASTGNMEVRVAHHPTSLGAAPVTTVVRHDEWPEPAE SLPPPYDNRLAQPYGGYISPGSTIDELRIFVSQWDTRARQNGPYRVIQFAVNPFKPWS DP" CDS complement(4092940..4093977) /codon_start=1 /transl_table=11 /gene="asd" /locus_tag="BQ2027_MB3735C" /product="ASPARTATE-SEMIALDEHYDE DEHYDROGENASE ASD (ASA DEHYDROGENASE) (ASADH) (ASPARTIC SEMIALDEHYDE DEHYDROGENASE) (L-ASPARTATE-BETA-SEMIALDEHYDE DEHYDROGENASE)" /note="Mb3735c, asd, len: 345 aa. Equivalent to Rv3708c, len: 345 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 345 aa overlap). asd, aspartate-semialdehyde dehydrogenase (EC 1.2.1.11) (see citation below), equivalent to many e.g. P47730|DHAS_MYCBO|ASD from Mycobacterium bovis (345 aa) FASTA scores: opt: 2150, E(): 1.6e-124, (97.7% identity in 345 aa overlap); or Q9JN40|ASD from Mycobacterium bovis (323 aa), FASTA scores: opt: 2021, E(): 1.2e-116, (97.5% identity in 323 aa overlap); Q9CB78|ASD|ML2322 from Mycobacterium leprae (351 aa), FASTA scores: opt: 1889, E(): 1.6e-108, (84.45% identity in 347 aa overlap); P41404|DHAS_MYCSM|ASD from Mycobacterium smegmatis (346 aa), FASTA scores: opt: 1801, E(): 3.9e-103, (80.3% identity in 345 aa overlap); etc. Contains PS01103 Aspartate-semialdehyde dehydrogenase signature. BELONGS TO THE ASPARTATE-SEMIALDEHYDE DEHYDROGENASE FAMILY. Protein product from Mb3735c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3735c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A543" /db_xref="InterPro:IPR000319" /db_xref="InterPro:IPR000534" /db_xref="InterPro:IPR005986" /db_xref="InterPro:IPR012080" /db_xref="InterPro:IPR012280" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P0A543" /protein_id="SIU02364.1" /translation="MGLSIGIVGATGQVGQVMRTLLDERDFPASAVRFFASARSQGRK LAFRGQEIEVEDAETADPSGLDIALFSAGSAMSKVQAPRFAAAGVTVIDNSSAWRKDP DVPLVVSEVNFERDAHRRPKGIIANPNCTTMAAMPVLKVLHDEARLVRLVVSSYQAVS GSGLAGVAELAEQARAVIGGAEQLVYDGGALEFPPPNTYVAPIAFNVVPLAGSLVDDG SGETDEDQKLRFESRKILGIPDLLVSGTCVRVPVFTGHSLSINAEFAQPLSPERAREL LDGATGVQLVDVPTPLAAAGVDESLVGRIRRDPGVPDGRGLALFVSGDNLRKGAALNT IQIAELLTADL" CDS complement(4093978..4095243) /codon_start=1 /transl_table=11 /gene="ask" /locus_tag="BQ2027_MB3736C" /product="ASPARTOKINASE ASK (ASPARTATE KINASE) [CONTAINS: ASPARTOKINASE ALPHA SUBUNIT (ASK-ALPHA); AND ASPARTOKINASE BETA SUBUNIT (ASK-BETA)]" /note="Mb3736c, ask, len: 421 aa. Equivalent to Rv3709c, len: 421 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 421 aa overlap). ask, aspartokinase (EC 2.7.2.4) (see citation below), equivalent to Q9CB77|ASK|ML2323 from Mycobacterium leprae (421 aa), FASTA scores: opt: 2531, E(): 2e-140, (92.65% identity in 421 aa overlap); and P41403|AK_MYCSM|ASK from Mycobacterium smegmatis (421 aa), FASTA scores: opt: 2423, E(): 4e-134, (88.1% identity in 421 aa overlap); and to several other organisms e.g. Q9RQ25|ASKA from Amycolatopsis mediterranei (421 aa), FASTA scores: opt: 2026, E(): 5.8e-111, (72.2% identity in 421 aa overlap). Contains PS00324 Aspartokinase signature. BELONGS TO THE ASPARTOKINASE FAMILY. ALTERNATIVE PRODUCTS: THE ALPHA AND BETA SUBUNITS OF ASPARTOKINASE ARE PRODUCED BY THE USE OF ALTERNATIVE INITIATION SITES (BY SIMILARITY). Protein product from Mb3736c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3736c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A4Z9" /db_xref="InterPro:IPR001048" /db_xref="InterPro:IPR001341" /db_xref="InterPro:IPR002912" /db_xref="InterPro:IPR005260" /db_xref="InterPro:IPR018042" /db_xref="InterPro:IPR027795" /db_xref="InterPro:IPR036393" /db_xref="InterPro:IPR041740" /db_xref="UniProtKB/Swiss-Prot:P0A4Z9" /protein_id="SIU02365.1" /translation="MALVVQKYGGSSVADAERIRRVAERIVATKKQGNDVVVVVSAMG DTTDDLLDLAQQVCPAPPPRELDMLLTAGERISNALVAMAIESLGAHARSFTGSQAGV ITTGTHGNAKIIDVTPGRLQTALEEGRVVLVAGFQGVSQDTKDVTTLGRGGSDTTAVA MAAALGADVCEIYTDVDGIFSADPRIVRNARKLDTVTFEEMLEMAACGAKVLMLRCVE YARRHNIPVHVRSSYSDRPGTVVVGSIKDVPMEDPILTGVAHDRSEAKVTIVGLPDIP GYAAKVFRAVADADVNIDMVLQNVSKVEDGKTDITFTCSRDVGPAAVEKLDSLRNEIG FSQLLYDDHIGKVSLIGAGMRSHPGVTATFCEALAAVGVNIELISTSEIRISVLCRDT ELDKAVVALHEAFGLGGDEEATVYAGTGR" CDS 4095500..4097605 /codon_start=1 /transl_table=11 /gene="leuA" /locus_tag="BQ2027_MB3737" /product="2-isopropylmalate synthase leua (alpha-isopropylmalate synthase) (alpha-ipm synthetase) (ipms)" /note="Mb3737, leuA, len: 701 aa. Similar to Rv3710, len: 644 aa, from Mycobacterium tuberculosis strain H37Rv, (91.7% identity in 701 aa overlap). leuA, alpha-isopropylmalate synthase (EC 4.1.3.12) (see citations below), equivalent to Q9CB76|LEUA|ML2324 2-ISOPROPYLMALATE SYNTHASE from Mycobacterium leprae (607 aa), FASTA scores: opt: 3291, E(): 3.7e-192, (80.7% identity in 642 aa overlap). Also highly similar to many e.g. P42455|LEU1_CORGL|LEUA from Corynebacterium glutamicum (Brevibacterium flavum) (616 aa), FASTA scores: opt: 2547, E(): 5.3e-147, (63.25% identity in 645 aa overlap); O31046|LEU1_STRCO|LEUA from Streptomyces coelicolor (573 aa), FASTA scores: opt: 2226, E(): 1.5e-127, (57.8% identity in 616 aa overlap); BAB49833|Q98HN3|MLR2792 from Rhizobium loti (Mesorhizobium loti) (588 aa), FASTA scores: opt: 1849, E(): 1.1e-104, (58.0% identity in 536 aa overlap); etc. Equivalent to AAK48181 from Mycobacterium tuberculosis strain CDC1551 (659 aa) but shorter 15 aa. Contains PS00815 and PS00816 Alpha-isopropylmalate and homocitrate synthases signatures 1 and 2. BELONGS TO THE ALPHA-IPM SYNTHETASE / HOMOCITRATE SYNTHASE FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, an in-frame insertion of 171 bp leads to a longer product compared to its homolog in Mycobacterium tuberculosis (701 aa versus 644 aa). Protein product from Mb3737 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3737 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVV6" /db_xref="InterPro:IPR000891" /db_xref="InterPro:IPR002034" /db_xref="InterPro:IPR005668" /db_xref="InterPro:IPR013709" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR036230" /db_xref="InterPro:IPR039371" /db_xref="UniProtKB/Swiss-Prot:Q7TVV6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02366.1" /translation="MTTSESPDAYTESFGAHTIVKPAGPPRVGQPSWNPQRASSMPVN RYRPFAEEVEPIRLRNRTWPDRVIDRAPLWCAVDLRDGNQALIDPMSPARKRRMFDLL VRMGYKEIEVGFPSASQTDFDFVREIIEQGAIPDDVTIQVLTQCRPELIERTFQACSG AHRAIVHFYNSTSILQRRVVFRANRAEVQAIATDGARKCVEQAAKYPGTQWRFEYSPE SYTGTELEYAKQVCDAVGEVIAPTPERPIIFNLPATVEMTTPNVYADSIEWMSRNLAN RESVILSLHPHNDRGTAVAAAELGFAAGADRIEGCLFGNGERTGNVCLVTLGLNLFSR GVDPQIDFSNIDEIRRTVEYCNQLPVHERHPYGGDLVYTAFSGSHQDAINKGLDAMKL DADAADCDVDDMLWQVPYLPIDPRDVGRTYEAVIRVNSQSGKGGVAYIMKTDHGLSLP RRLQIEFSQVIQKIAEGTAGEGGEVSPKEMWDAFAEEYLAPVRPLERIRQHVDAADDD GGTTSITATVKINGVETEISGSGNGPLAAFVHALADVGFDVAVLDYYEHAMSAGDDAQ AAAYVEASVTIASPAQPGEAGRHASDPVTIASPAQPGEAGRHASDPVTIASPAQPGEA GRHASDPVTIASPAQPGEAGRHASDPVTIASPAQPGEAGRHASDPVTSKTVWGVGIAP SITTASLRAVVSAVNRAAR" CDS complement(4097671..4098660) /codon_start=1 /transl_table=11 /gene="dnaQ" /locus_tag="BQ2027_MB3738C" /product="probable dna polymerase iii (epsilon subunit) dnaq" /note="Mb3738c, dnaQ, len: 329 aa. Equivalent to Rv3711c, len: 329 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 329 aa overlap). Probable dnaQ, DNA polymerase III, epsilon subunit (EC 2.7.7.7), similar to many e.g. Q9RJ41|SCI8.12 from Streptomyces coelicolor (328 aa), FASTA scores: opt: 509, E(): 4.2e-25, (41.6% identity in 315 aa overlap); Q9JYS6|NMB1451 from Neisseria meningitidis (serogroup B) (and Q9JTR5|MA1665 from serogroup A) (470 aa), FASTA scores: opt: 247, E(): 2.6e-08, (33.15% identity in 172 aa overlap); O83649|DP3E_TREPA|DNAQ|TP0643 from Treponema pallidum (215 aa), FASTA scores: opt: 240, E(): 3.7e-08, (29.65% identity in 162 aa overlap); P03007|DP3E_ECOLI|MUTD|B0215 from Escherichia coli strain K12 (243 aa), FASTA scores: opt: 208, E(): 4.5e-06, (28.4% identity in 169 aa overlap); etc. Also similar to Q10384|YL91_MYCTU|Rv2191|MTCY190.02 from Mycobacterium tuberculosis (645 aa), FASTA scores: opt: 260, E(): 5e-09, (28.55% identity in 301 aa overlap). Mb3738c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Y4" /db_xref="InterPro:IPR006054" /db_xref="InterPro:IPR012337" /db_xref="InterPro:IPR013520" /db_xref="InterPro:IPR036397" /db_xref="InterPro:IPR036420" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Y4" /protein_id="SIU02367.1" /translation="MSHTWGRPASHQDRGWAVIDVETSGFRPGQARIISLAVLGLDAA GRLEQSVVSLLNPKVDPGPTHVHGLTAAMLDDQPQFADIAGEVVDVLRGRTLVAHNVA FDYAFLAAEAEIAEAELPVDFVMCTVELARRLQLGVDNLRLETLAAHWGVPQQRPHDA FDDARVLTGILAAALESARELDVWLPVHPVTRRRWPNGRVTHDELRPLKALAARMACP YLNPGRYVQGRPLVQGMRVGLAAEVKRTHEELVERILHAGLAYSDVVDRDTSLVVCNA TAPEHGKGYHALQLGVPVMPEARFMECIGAVVGGASVEDFTDVAPVEKQLALF" CDS 4098853..4100094 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3739" /product="POSSIBLE LIGASE" /note="Mb3739, -, len: 413 aa. Equivalent to Rv3712, len: 413 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 413 aa overlap). Possible ligase (EC 6.-.-.-), equivalent to O69522|ML2326|MLCB2407.24c HYPOTHETICAL 43.8 KDA PROTEIN (POSSIBLE LIGASE) from Mycobacterium leprae (411 aa), FASTA scores: opt: 2265, E(): 8e-129, (84.25% identity in 413 aa overlap). Also similar to ligases or hypothetical proteins e.g. Q9FCA1|2SCG58.12 PUTATIVE LIGASE from Streptomyces coelicolor (412 aa), FASTA scores: opt: 1168, E(): 6.7e-63, (45.8% identity in 406 aa overlap); P74303|SLR0938 HYPOTHETICAL 50.2 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (459 aa), FASTA scores: opt: 392, E(): 3.1e-16, (28.45% identity in 397 aa overlap); Q99ZX1|SPY1035 PUTATIVE UDP-N-ACETYLMURAMYL TRIPEPTIDE SYNTHETASE (EC 6.3.2.13) from Streptococcus pyogenes (445 aa), FASTA scores: opt: 335, E(): 8.1e-13, (29.2% identity in 438 aa overlap); Q9CGJ0|YLBD HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (449 aa), FASTA scores: opt: 324, E(): 3.8e-12, (28.75% identity in 445 aa overlap); Q9ZGG7|MURC UDP-N-ACETYLMURAMYL TRIPEPTIDE SYNTHETASE from Heliobacillus mobilis (455 aa), FASTA scores: opt: 292, E(): 3.2e-10, (30.75% identity in 449 aa overlap); etc. Protein product from Mb3739 detected using SWATH mass spectrometry. Mb3739 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y514" /db_xref="InterPro:IPR013221" /db_xref="InterPro:IPR013564" /db_xref="InterPro:IPR036565" /db_xref="UniProtKB/TrEMBL:A0A1R3Y514" /protein_id="SIU02368.1" /translation="MVTTRARLALAAGAGARWASRVTGRGAGAMIGGLVAMTLDRSIL RQLGMGRRTVVVTGTNGKSTTTRMTAAALGTLGAVATNAEGANMDAGLVAALAAHRDA ELAVLEVDEMHVPHISDAVDPAVVVLLNLSRDQLDRVGEINVIERTLRAGLARHPDAV VVANCDDVLMTSAAYDSPNVVWVAAGGAWSNDSVSCPRSSEVIVRKAPSQEDHWYSTG ADFKRPAPHWWFDDATLYGPDGLALPMRLALPGSVNRGNAAQAVAAAVALGADPAVAV AAVCQVDEVAGRYRTVRIGAHQARILLAKNPAGWQEALAMVDKHADGVVIAVNGRVPD GEDLSWLWDVRFEHFETTRVVAAGERGTDLAVRLGYAGVEHTLVHDTVAAIASCPPGR VEVVANYTAFLQLQRALARRG" CDS 4100099..4100794 /codon_start=1 /transl_table=11 /gene="cobQ2" /locus_tag="BQ2027_MB3740" /product="POSSIBLE COBYRIC ACID SYNTHASE COBQ2" /note="Mb3740, cobQ2, len: 231 aa. Equivalent to Rv3713, len: 231 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 231 aa overlap). Possible cobQ2, cobyric acid synthase (EC undetermined), equivalent to O69521|ML2327|MLCB2407.23c HYPOTHETICAL 24.5 KDA PROTEIN from Mycobacterium leprae (230 aa), FASTA scores: opt: 1313, E(): 4.7e-73, (86.1% identity in 230 aa overlap). Also partially similar to several cobyric acid synthases and hypothetical proteins e.g. Q9FCA0|2SCG58.13 HYPOTHETICAL 26.2 KDA PROTEIN from Streptomyces coelicolor (242 aa), FASTA scores: opt: 639, E(): 6.2e-32, (46.6% identity in 234 aa overlap); Q9ZGG8|COBQ COBYRIC ACID SYNTHASE from Heliobacillus mobilis (252 aa), FASTA scores: opt: 501, E(): 1.7e-23, (40.75% identity in 206 aa overlap); BAB58053|SAV1891 HYPOTHETICAL 27.4 KDA PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (243 aa), FASTA scores: opt: 400, E(): 2.3e-17, (35.95% identity in 217 aa overlap); Q9CGJ1|COBQ COBYRIC ACID SYNTHASE from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (261 aa), FASTA scores: opt: 353, E(): 1.8e-14, (35.3% identity in 201 aa overlap); O26880|COBQ_METTH|MTH787 PROBABLE COBYRIC ACID SYNTHASE from Methanobacterium thermoautotrophicum (504 aa), FASTA scores: opt: 201, E(): 5.6e-05, (33.35% identity in 171 aa overlap); etc. Also similar to hypothetical mycobacterial proteins O05811|COBB_MYCTU|Rv2848c|MT2914|MTCY24A1.09 (457 aa) and P71842|Rv0789c|MTCY369.33c (199 aa). SEEMS TO BELONG TO THE COBB/COBQ FAMILY, COBQ SUBFAMILY. Protein product from Mb3740 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3740 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y524" /db_xref="InterPro:IPR011698" /db_xref="InterPro:IPR017929" /db_xref="InterPro:IPR029062" /db_xref="InterPro:IPR033949" /db_xref="UniProtKB/TrEMBL:A0A1R3Y524" /protein_id="SIU02369.1" /translation="MVRIGLVLPDVMGTYGDGGNAVVLRQRLLLRGIAAEIVEITLAD PVPDSLDLYTLGGAEDYAQRLATRHLRRYPGLQRAAGRGAPVLAICAAIQVLGHWYET SSGDRVDGVGLLDVTTSPQDARTIGELVSKPLLAGLTQPLTGFENHRGGTVLGPGTSP LGAVVKGAGNRAGDGFDGAVAGSVVATYMHGPCLARNPELADLLLSKVVGELAPLDLP EVDLLRRERLSAR" CDS complement(4100803..4101693) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3741C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3741c, -, len: 296 aa. Equivalent to Rv3714c, len: 296 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 296 aa overlap). Conserved hypothetical protein, highly similar to O07396|MAV346 MAV346 PROTEIN from Mycobacterium avium (346 aa) FASTA scores: opt: 834, E(): 2.2e-46, (50.0% identity in 286 aa overlap); and also highly similar to several proteins from Mycobacterium tuberculosis e.g. O53421|Rv1073|MTV017.26 (283 aa), FASTA scores: opt: 869, E(): 1e-48, (51.1% identity in 270 aa overlap); P71763|Rv1482c|MTCY277.03c (339 aa), FASTA scores: opt: 775, E(): 1.3e-42, (46.35% identity in 289 aa overlap); P96837|Rv3555c|MTCY06G11.02c (289 aa), FASTA scores: opt: 733, E(): 5.9e-40, (44.15% identity in 281 aa overlap); etc. Partially similar to Q9Z512|UVRC_STRCO|SCC54.13c EXCINUCLEASE ABC SUBUNIT C from Streptomyces coelicolor (728 aa), FASTA scores: opt: 122, E(): 2.5, (27.0% identity in 174 aa overlap). Equivalent to AAK48186 from Mycobacterium tuberculosis strain CDC1551 (341 aa) but shorter 45 aa. Protein product from Mb3741c detected using SWATH mass spectrometry. Mb3741c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011335" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Y1" /protein_id="SIU02370.1" /translation="MLISRMSVRSASMSVMGDVFIGSEAITAGRLTRHELQRWYQPMF RGVYVSRRSVPTLWDRTVGAWLATRRHGVIAGNAASALHGAQWVDVDVAIELISPTTR PQHGLVIRRETLCDDEITRVVGLPVTTLARTAYDLGRHLSRGEAVARLDALMRATPFS RDDVLLLAKRHAGARGVRRLRDVLPLVDGGAASPKETWLRLLLIDAGLPVPTTQIPVV HRWRNVGVLDMGWEKYMVAAEYDGDQHRSDRGRYVKDQRRLRKLAELGWIVIRVIAED NPDDVVNRVRAALLARGWRP" CDS complement(4101761..4102372) /codon_start=1 /transl_table=11 /gene="recR" /locus_tag="BQ2027_MB3742C" /product="probable recombination protein recr" /note="Mb3742c, recR, len: 203 aa. Equivalent to Rv3715c, len: 203 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 203 aa overlap). Probable recR, recombination protein, equivalent to O69520|RECR_MYCLE|ML2329|MLCB2407.21 RECOMBINATION PROTEIN from Mycobacterium leprae (203 aa), FASTA scores: opt: 1246, E(): 9.2e-71, (91.6% identity in 202 aa overlap). Also highly similar to many e.g. Q9XAI4|RECR_STRCO|SC66T3.29c from Streptomyces coelicolor (199 aa), FASTA scores: opt: 952, E(): 1.9e-52, (68.3% identity in 202 aa overlap); P24277|RECR_BACSU|RECM|RECD from Bacillus subtilis (198 aa), FASTA scores: opt: 696, E(): 1.8e-36, (50.5% identity in 198 aa overlap); Q9ZNA2|RECR_DEIRA|DR0198 from Deinococcus radiodurans (220 aa), FASTA scores: opt: 673, E(): 5.2e-35, (49.75% identity in 195 aa overlap); etc. BELONGS TO THE RECR FAMILY. Protein product from Mb3742c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3742c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65991" /db_xref="InterPro:IPR000093" /db_xref="InterPro:IPR003583" /db_xref="InterPro:IPR006171" /db_xref="InterPro:IPR015967" /db_xref="InterPro:IPR023627" /db_xref="InterPro:IPR023628" /db_xref="InterPro:IPR034137" /db_xref="UniProtKB/Swiss-Prot:P65991" /protein_id="SIU02371.1" /translation="MFEGPVQDLIDELGKLPGIGPKSAQRIAFHLLSVEPSDIDRLTG VLAKVRDGVRFCAVCGNVSDNERCRICSDIRRDASVVCIVEEPKDIQAVERTREFRGR YHVLGGALDPLSGIGPDQLRIRELLSRIGERVDDVDVTEVIIATDPNTEGEATATYLV RMLRDIPGLTVTRIASGLPMGGDLEFADELTLGRALAGRRVLA" CDS complement(4102384..4102785) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3743C" /product="Nucleoid-associated protein YaaK" /note="Mb3743c, -, len: 133 aa. Equivalent to Rv3716c, len: 133 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 133 aa overlap). Conserved hypothetical protein, equivalent to O69519|Y1B6_MYCLE|ML2330|MLCB2407.20 HYPOTHETICAL 11.9 KDA PROTEIN from Mycobacterium leprae (116 aa), FASTA scores: opt: 616, E(): 2.6e-21, (84.55% identity in 110 aa overlap). Also highly similar to hypothetical ~12 kDa proteins in the vicinity of recR from other bacteria e.g. Q9XAI3|YT3D_STRCO|SC66T3.30c HYPOTHETICAL 11.7 KDA PROTEIN from Streptomyces coelicolor (115 aa), FASTA scores: opt: 379, E(): 9.5e-11, (50.8% identity in 122 aa overlap); BAB56641|SAV0479 CONSERVED HYPOTHETICAL PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (105 aa) FASTA scores: opt: 295, E(): 4.9e-07, (41.75% identity in 103 aa overlap); Q99WC4P24281|YAAK_BACSU HYPOTHETICAL 11.8 KDA PROTEIN IN DNAZ-RECR INTERGENIC REGION from Bacillus subtilis (107 aa), FASTA scores: opt: 272, E(): 5.3e-06, (39.4% identity in 104 aa overlap); P17577|YBAB_ECOLI|B0471|Z0588|ECS0524 from Escherichia coli strain K and O157:H7 (109 aa), FASTA scores: opt: 256, E(): 2.8e-05, (38.0% identity in 100 aa overlap); etc. Contains probable coiled-coil domain from aa 1-40. SEEMS TO BELONG TO THE UPF0133 FAMILY. Protein product from Mb3743c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3743c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A655" /db_xref="InterPro:IPR004401" /db_xref="InterPro:IPR036894" /db_xref="UniProtKB/Swiss-Prot:P0A655" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02372.1" /translation="MQPGGDMSALLAQAQQMQQKLLEAQQQLANSEVHGQAGGGLVKV VVKGSGEVIGVTIDPKVVDPDDIETLQDLIVGAMRDASQQVTKMAQERLGALAGAMRP PAPPAAPPGAPGMPGMPGMPGAPGAPPVPGI" CDS 4102920..4103645 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3744" /product="N-acetylmuramoyl-L-alanine amidase (EC" /EC_number="3.5.1.28" /note="Mb3744, -, len: 241 aa. Equivalent to Rv3717, len: 241 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 241 aa overlap). Conserved hypothetical protein, equivalent to O69518|MLCB2407.19c (alias Q9CB75|ML2331 256 aa) HYPOTHETICAL 25.1 KDA PROTEIN from Mycobacterium leprae (244 aa), FASTA scores: opt: 1325, E(): 5.7e-74, (81.95% identity in 244 aa overlap). Also similar to Q9KXK7|SCC53.04 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (336 aa), FASTA scores: opt: 536, E(): 1.2e-25, (41.2% identity in 233 aa overlap); and shows similarity with C-terminal end of other proteins e.g. Q9RMZ0|PXO2-42 PXO2-42 PROTEIN from Bacillus anthracis (531 aa), FASTA scores: opt: 191, E(): 0.00022, (26.6% identity in 222 aa overlap); Q9RTX0 PUTATIVE N-ACETYLMURAMOYL-L-ALANINE AMIDASE (603 aa); Q9LCR4|CWLU CWLU PROTEIN from Paenibacillus polymyxa (Bacillus polymyxa) (524 aa), FASTA scores: opt: 141, E(): 0.24, (29.2% identity in 219 aa overlap); etc. Shows similarity with C-terminal end of O53593|CWLM|Rv3915|MTV028.06 PUTATIVE HYDROLASE from Mycobacterium tuberculosis (406 aa), FASTA scores: opt: 176, E(): 0.0014, (25.7% identity in 218 aa overlap). Protein product from Mb3744 detected using SWATH mass spectrometry. Mb3744 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Z8" /db_xref="InterPro:IPR002508" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Z8" /protein_id="SIU02373.1" /translation="MIVGVLVAAATPIISSASATPANIAGMVVFIDPGHNGANDASIG RQVPTGRGGTKNCQASGTSTNSGYPEHTFTWETGLRLRAALNALGVRTALSRGNDNAL GPCVDERANMANALRPNAIVSLHADGGPASGRGFHVNYSAPPLNAIQAGPSVQFARIM RDQLQASGIPKANYIGQDGLYGRSDLAGLNLAQYPSILVELGNMKNPADSALMESAEG RQKYANALVRGVAGFLATQGQAR" CDS complement(4103687..4104130) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3745C" /product="conserved protein" /note="Mb3745c, -, len: 147 aa. Equivalent to Rv3718c, len: 147 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 147 aa overlap). Conserved hypothetical protein, equivalent to O69517|ML2332|MLCB2407.18 HYPOTHETICAL 15.5 KDA PROTEIN from Mycobacterium leprae (145 aa), FASTA scores: opt: 780, E(): 1.4e-44, (81.95% identity in 144 aa overlap). Also highly similar to Q9ZBJ2|SC9C7.18 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (147 aa) FASTA scores: opt: 475, E(): 1.7e-24, (52.05% identity in 146 aa overlap); and showing some similarity to various proteins e.g. P27538|PR2_PETCR PATHOGENESIS-RELATED PROTEIN 2 from Petroselinum crispum (Parsley) (Petroselinum hortense) (158 aa); P92918|ALL2_APIGR MAJOR ALLERGEN API G 2 from Apium graveolens (Celery) (159 aa); etc. Protein product from Mb3745c detected using shotgun mass spectrometry. Mb3745c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR014488" /db_xref="InterPro:IPR019587" /db_xref="InterPro:IPR023393" /db_xref="UniProtKB/TrEMBL:A0A1R3Y509" /protein_id="SIU02374.1" /translation="MGQVSAASTILINAEPTATLDALADYETVRPKILSPHYSEYQVL EGGKGRGTVAKWRLQATQSRVRDVQVNVDVAGHTVIEKDMNSSMVTNWTVAPAGPGSS VTVKTTWTGAGGVKGFFEKTFAPLGLKKIQAEVLSNLKTELEGDA" CDS 4104178..4105590 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3746" /product="FAD/FMN-containing dehydrogenase Mvan_5531" /note="Mb3746, -, len: 470 aa. Equivalent to Rv3719, len: 470 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 470 aa overlap). Conserved hypothetical protein, equivalent to O69516|ML2333|MLCB2407.17c HYPOTHETICAL 51.8 KDA PROTEIN from Mycobacterium leprae (459 aa), FASTA scores: opt: 2593, E(): 7.8e-161, (82.75% identity in 458 aa overlap). Also some similarity to Q9CU63|5830417J06RIK HYPOTHETICAL PROTEIN (FRAGMENT) from Mus musculus (Mouse) (479 aa) FASTA scores: opt: 454, E(): 6.1e-22, (27.1% identity in 413 aa overlap); Q9HBA8 SELADIN-1 (UNKNOWN) from Homo sapiens (Human) (516 aa), FASTA scores: opt: 444, E(): 2.9e-21, (26.7% identity in 412 aa overlap); O17397|DIMH_CAEEL|F52H2.6 DIMINUTO-LIKE PROTEIN from Caenorhabditis elegans (525 aa), FASTA scores: opt: 419, E(): 1.2e-19, (24.4% identity in 434 aa overlap); Q39085|DIM_ARATH|DWF1 CELL ELONGATION PROTEIN DIMINUTO from Arabidopsis thaliana (Mouse-ear cress) (561 aa) FASTA scores: opt: 318, E(): 4.8e-13, (24.6% identity in 455 aa overlap); etc. Also some similarity to M. tuberculosis hypothetical proteins P72056|Rv3790|MTCY13D12.24 (461 aa) FASTA scores: opt: 174, E(): 0.00016; (25.1% identity in 426 aa overlap); and Q50685|Rv2280|MTCY339_30c (459 aa). Protein product from Mb3746 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3746 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6K4" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR016164" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR016169" /db_xref="InterPro:IPR036318" /db_xref="InterPro:IPR040165" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6K4" /protein_id="SIU02375.1" /translation="MQGQLSRTRVYAVPVPGSAQSAYACGVERLLASYRSIPATASIR LAKPTSNLFRARVKHDARGLDASGLTGVIGIDPEARTADVAGMCTYEDLIAATLHYGL SPLVVPQLRTITLGGAVTGLGIESASFRNGLPHESVLEMDILTGAGELLTVSPGQHSD LYRAFPNSYGTLGYSTRLRIQLEPVRPFVALRHIRFSSLTAMVAAMERIIDTGGLDGE SVDYLDGVVFSADESYLCIGMQTSVPGPVSDYTGQDIYYRSIQHEAGIKEDRLTIHDY FWRWDTDWFWCSRSFGAQNPRLRRWWPRRYRRSSVYWRLMALDQRFGIADRFENSRGR PARERVVQDIEVPIERTCEFLEWFGENVPISPIWLCPLRLRDHAGWPLYPIRPDRSYV NIGFWSSVPVGATEGATNRKIENKVSALDGHKSLYSDSFYTREEFDELYGGETYNTVK KAYDPDSRLLDLYAKAVQRR" CDS 4105608..4106870 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3747" /product="POSSIBLE FATTY ACID SYNTHASE" /note="Mb3747, -, len: 420 aa. Equivalent to Rv3720, len: 420 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 420 aa overlap). Possible fatty-acyl-phospholipid synthase (EC 2.1.1.-), equivalent to Q9CB74|ML2334 (alias O69515|MLCB2407.16c, 439 aa) HYPOTHETICAL PROTEIN from Mycobacterium leprae (420 aa) FASTA scores: opt: 2508, E(): 4.7e-153, (86.45% identity in 420 aa overlap). Also similar (especially at the C-terminus) to various fatty-acid synthases (principally cyclopropane-fatty-acyl-phospholipid synthases (EC 2.1.1.79)) and hypothetical proteins e.g. Q9KZ58|SCE25.32c PUTATIVE FATTY ACID SYNTHASE from Streptomyces coelicolor (438 aa), FASTA scores: opt: 1101, E(): 5.5e-63, (46.1% identity in 425 aa overlap); P31049|YLP3_PSEPU HYPOTHETICAL 44.7 KDA PROTEIN from Pseudomonas putida (394 aa), FASTA scores: opt: 810, E(): 2.1e-44, (46.4% identity in 293 aa overlap); Q9HT28|PA5546 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (394 aa), FASTA scores: opt: 804, E(): 5.2e-44, (40.7% identity in 371 aa overlap); Q9RSD7|DR2187 PUTATIVE CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE from Deinococcus radiodurans (462 aa), FASTA scores: opt: 747, E(): 2.6e-40, (35.95% identity in 409 aa overlap); BAB50831|Q98ET6|MLL4091 CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE from Rhizobium loti (Mesorhizobium loti) (422 aa), FASTA scores: opt: 674, E(): 1.1e-35, (39.1% identity in 284 aa overlap); P30010|CFA_ECOLI|CDFA|B1661 CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIP SYNTHASE from Escherichia coli strain K12 (381 aa), FASTA scores: opt: 530, E(): 1.7e-26, (33.65% identity in 312 aa overlap); etc. Also similar to other proteins from Mycobacterium tuberculosis e.g. CMA2|Rv0503c|MTCY20G9.30c (302 aa); P96911|Rv0621|MTCY20H10 (354 aa); O50416|LPQD|Rv3390|MTV004.48 (236 aa); etc. Protein product from Mb3747 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3747 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5D8" /db_xref="InterPro:IPR003333" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5D8" /protein_id="SIU02376.1" /translation="MAEILEIFTATGQHPLKFTAYDGSTAGQDDATLGLDLRTPRGAT YLATAPGELGLARAYVSGDLQAHGVHPGDPYELLKTLTERVDFKRPSARVLANVVRSI GVEHILPIAPPPQEARPRWRRMANGLLHSKTRDAEAIHHHYDVSNNFYEWVLGPSMTY TCAVFPNAEASLEQAQENKYRLIFEKLRLEPGDRLLDVGCGWGGMVRYAARRGVRVIG ATLSAEQAKWGQKAVEDEGLSDLAQVRHSDYRDVAETGFDAVSSIGLTEHIGVKNYPF YFGFLKSKLRTGGLLLNHCITRHDNRSTSFAGGFTDRYVFPDGELTGSGRITTEIQQV GLEVLHEENFRHHYAMTLRDWCGNLVEHWDDAVAEVGLPTAKVWGLYMAASRVAFERN NLQLHHVLATKVDPRGDDSLPLRPWWQP" CDS complement(4106867..4108603) /codon_start=1 /transl_table=11 /gene="dnaZX" /locus_tag="BQ2027_MB3748C" /product="DNA POLYMERASE III (SUBUNIT GAMMA/TAU) DNAZ/X" /note="Mb3748c, dnaZX, len: 578 aa. Equivalent to Rv3721c, len: 578 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 578 aa overlap). Probable dnaZX, DNA polymerase III gamma (dnaZ) and tau (dnaX) (EC 2.7.7.7), equivalent to O69514|DNAZX|ML2335 DNA POLYMERASE III SUBUNIT GAMMA/TAU from Mycobacterium leprae (611 aa) FASTA scores: opt: 2344, E(): 4.7e-118, (78.75% identity in 602 aa overlap). Also highly similar to many e.g. Q9RKL5|DNAZ from Streptomyces coelicolor (784 aa) FASTA scores: opt: 1755, E(): 1.8e-86, (59.55% identity in 435 aa overlap); Q9KGM4|DNAX|BH0034 from Bacillus halodurans (564 aa), FASTA scores: opt: 946, E(): 2.5e-43, (37.4% identity in 460 aa overlap); P09122|DP3X_BACSU|DNAX|DNAH from Bacillus subtilis (563 aa), FASTA scores: opt: 841, E(): 1e-37, (30.8% identity in 510 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb3748c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3748c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P63976" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR008921" /db_xref="InterPro:IPR012763" /db_xref="InterPro:IPR022754" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P63976" /protein_id="SIU02377.1" /translation="MALYRKYRPASFAEVVGQEHVTAPLSVALDAGRINHAYLFSGPR GCGKTSSARILARSLNCAQGPTANPCGVCESCVSLAPNAPGSIDVVELDAASHGGVDD TRELRDRAFYAPVQSRYRVFIVDEAHMVTTAGFNALLKIVEEPPEHLIFIFATTEPEK VLPTIRSRTHHYPFRLLPPRTMRALLARICEQEGVVVDDAVYPLVIRAGGGSPRDTLS VLDQLLAGAADTHVTYTRALGLLGVTDVALIDDAVDALAACDAAALFGAIESVIDGGH DPRRFATDLLERFRDLIVLQSVPDAASRGVVDAPEDALDRMREQAARIGRATLTRYAE VVQAGLGEMRGATAPRLLLEVVCARLLLPSASDAESALLQRVERIETRLDMSIPAPQA VPRPSAAAAEPKHQPAREPRPVLAPTPASSEPTVAAVRSMWPTVRDKVRLRSRTTEVM LAGATVRALEDNTLVLTHESAPLARRLSEQRNADVLAEALKDALGVNWRVRCETGEPA AAASPVGGGANVATAKAVNPAPTANSTQRDEEEHMLAEAGRGDPSPRRDPEEVALELL QNELGARRIDNA" CDS complement(4108693..4110000) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3749C" /product="Aspartate transaminase (EC" /EC_number="2.6.1.1" /note="Mb3749c, -, len: 435 aa. Equivalent to Rv3722c, len: 435 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 435 aa overlap). Conserved hypothetical protein, equivalent to O69513|MLCB2407.14 (alias Q9CB73|ML2336, 463 aa) HYPOTHETICAL 46.8 KDA PROTEIN from Mycobacterium leprae (426 aa), FASTA scores: opt: 2505, E(): 8.3e-154, (87.25% identity in 424 aa overlap). Also highly similar to Q9RU17|DR1579 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (452 aa), FASTA scores: opt: 1162, E(): 3.1e-67, (44.8% identity in 422 aa overlap); and partially similar to Q9I371|PA1654 PROBABLE AMINOTRANSFERASE from Pseudomonas aeruginosa (388 aa) FASTA scores: opt: 162, E(): 0.0078, (25.85% identity in 348 aa overlap) and other aminotransferases. N-terminus extended since first submission (previously 408 aa). Protein product from Mb3749c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3749c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y516" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR024551" /db_xref="UniProtKB/TrEMBL:A0A1R3Y516" /protein_id="SIU02378.1" /translation="MSFDSLSPQELAALHARHQQDYAALQGMKLALDLTRGKPSAEQL DLSNQLLSLPGDDYRDPEGTDTRNYGGQHGLPGLRAIFAELLGIAVPNLIAGNNSSLE LMHDIVAFSMLYGGVDSPRPWIQEQDGIKFLCPVPGYDRHFAITETMGIEMIPIPMLQ DGPDVDLIEELVAVDPAIKGMWTVPVFGNPSGVTYSWETVRRLVQMRTAAPDFRLFWD NAYAVHTLTLDFPRQVDVLGLAAKAGNPNRPYVFASTSKITFAGGGVSFFGGSLGNIA WYLQYAGKKSIGPDKVNQLRHLRFFGDADGVRLHMLRHQQILAPKFALVAEVLDQRLS ESKIASWTEPKGGYFISLDVLPGTARRTVALAKDVGIAVTEAGASFPYRKDPDDKNIR IAPSFPSVPDLRNAVDGLATCALLAATETLLNQGLASSAPNVR" tRNA 4110217..4110302 /locus_tag="BQ2027_SERV" /product="tRNA-Ser" /note="serV, len: 86 nt. Equivalent to serV, len: 86 nt, from Mycobacterium tuberculosis strain H37RV, (100.0% identity in 86 nt overlap). tRNA-Ser, anticodon gga." CDS 4110408..4111172 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3750" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3750, -, len: 254 aa. Equivalent to Rv3723, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 254 aa overlap). Probable conserved transmembrane protein, with hydrophobic stretches at the N-terminus, and equivalent to O69512|ML2337|MLCB2407.13c PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (250 aa), FASTA scores: opt: 1029, E(): 1.2e-44, (64.45% identity in 253 aa overlap). Protein product from Mb3750 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3750 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y537" /db_xref="UniProtKB/TrEMBL:A0A1R3Y537" /protein_id="SIU02379.1" /translation="MGRKVAVLWHASFSIGAGVLYFYFVLPRWPELMGDTGHSLGTGL RIATGALVGLAALPVVFTLLRTRKPELGTPQLALSMRIWSIMAHVLAGALIVGTAISE VWLSLDAAGQWLFGIYGAAAAIAVLGFFGFYLSFVAELPPPPPKPLKPKKPKQRRLRR KKTAKGDEAEPEAAEEAENTELAAQEDEEAVEAPPESIESPGGEPESATREAPAAETA TAEEPRGGLRNRRPTGKTSHRRRRTRSGVQVAKVDE" CDS 4111339..4112040 /codon_start=1 /transl_table=11 /gene="cut5" /locus_tag="BQ2027_MB3751" /product="probable cutinase [second part] cut5b" /note="Mb3751, cut5, len: 233 aa. Equivalent to Rv3724A (cut5a) and Rv3724B (cut5b), len: 80 aa and 187 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 67 aa overlap and 100.0% identity in 166 aa overlap). Probable cut5a, truncated cutinase precursor (EC 3.1.1.-), similar to N-terminal end of others e.g. Q9KK87 SERINE ESTERASE CUTINASE from Mycobacterium avium (220 aa), FASTA scores: opt: 202, E(): 1.5e-06, (56.45% identity in 62 aa overlap); Q9XB09|RVD2-RV1758 PROTEIN (FRAGMENT) from Mycobacterium bovis BCG (143 aa), FASTA scores: opt: 200, E(): 1.5e-06, (61.4% identity in 57 aa overlap); and Q00298|CUTI_BOTCI|CUTA CUTINASE PRECURSOR from Botrytis cinerea (Botryotinia fuckeliana) (202 aa), FASTA scores: opt: 108, E(): 2.2, (40.4% identity in 52 aa overlap). Also highly similar to others from Mycobacterium tuberculosis e.g. O06318|CUT3_MYCTU|Rv3451|MT3557|MTCY13E1 2.04 PROBABLE CUTINASE PRECURSOR (247 aa), FASTA scores: opt: 189, E(): 1.2e-05, (58.0% identity in 50 aa overlap); Q50664|CUT2_MYCTU|Rv2301|MT2358|MTCY339.08c PROBABLE CUTINASE PRECURSOR (219 aa), FASTA scores: opt: 172, E(): 0.00015, (59.2% identity in 49 aa overlap); O06793|Rv1758|MTCY28.24|Z95890 HYPOTHETICAL 17.9 KDA PROTEIN (174 aa), FASTA scores: opt: 641, E(): 2.7e-29, (57.2% identity in 166 aa overlap); O06319|Rv3452|MTY13E12.05; and U00015_11 from Mycobaterium leprae. BELONGS TO THE CUTINASE FAMILY. Rest of cutinase ORF continues as Rv3724B|CUT5B, frameshifting could occur near position 4169668. Sequence has been checked but no errors found. Probable cut5b, truncated cutinase (EC 3.1.1.-), similar to C-terminal end of others e.g. Q9XB09|RVD2-RV1758 PROTEIN (FRAGMENT) from Mycobacterium bovis BCG (143 aa) FASTA scores: opt: 335, E(): 3.4e-12, (53.25% identity in 92 aa overlap); Q9KK87 SERINE ESTERASE CUTINASE from Mycobacterium avium (220 aa), FASTA scores: opt: 251, E(): 2.5e-07, (44.05% identity in 168 aa overlap). Also similar to proteins from Mycobacterium tuberculosis e.g. O06793|Rv1758|MTCY28.24 HYPOTHETICAL 17.9 KDA PROTEIN (174 aa), FASTA scores: opt: 641, E(): 2.5e-29, (57.25% identity in 166 aa overlap); O06319|Rv3452|MTCY13E12.05 HYPOTHETICAL 23.1 KDA PROTEIN (226 aa), FASTA scores: opt: 385, E(): 7.5e-15, (46.65% identity in 165 aa overlap); O06318|CUT3_MYCTU|Rv3451|MT3557|MTCY13E12.04 PROBABLE CUTINASE PRECURSOR (247 aa), FASTA scores: opt: 307, E(): 1.9e-10, (40.7% identity in 167 aa overlap); Q10837|CUT1_MYCTU|Rv1984c|MT2037|MTCY39.35 PROBABLE CUTINASE PRECURSOR (217 aa), FASTA scores: opt: 261, E(): 6.7e-08, (50.9% identity in 169 aa overlap); etc; and U00015_11 from Mycobacterium lepra. 5'-end of gene is Rv3724A|CUT5A; frameshifting may occur near position 4169668. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis, Rv3724A and Rv3724B exist as 2 genes with an overlap region between them. In Mycobacterium bovis, a single base deletion (t-*) leads to a single product. Protein product from Mb3751 detected using SWATH mass spectrometry and 0. Mb3751 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y4Z4" /db_xref="InterPro:IPR000675" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y4Z4" /protein_id="SIU02380.1" /translation="MDVIRWARRLAVVAGTAAAVTTPGLLSAHVPMVSAEPCPDVEVV FARGTGEPPGIGSVGGLFVDALRSQVGAKSLGVYAVNYPASNDFASSDFPKTVIDGIR DAGSHIQSMAMSCPQTRQVLGGYSQGAAVAGYVTSAVVPPAVPVQAVPAPMAPEVANH VAAVTLFGAPSAQFLGQYGAPPIAIGPLYQPKTLQLCADGDSICGDGNSPVAHGLYAV NGMVGQGANFAASRL" CDS 4112085..4113086 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3752" /product="possible oxidoreductase" /note="Mb3752, -, len: 333 aa. Equivalent to Rv3725, len: 309 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 250 aa overlap). Possible reductase (EC 1.-.-.-), similar to various oxidoreductases and hypothetical proteins e.g. O34285|HPNA HPNA PROTEIN from Zymomonas mobilis (337 aa), FASTA scores: opt: 317, E(): 6.1e-11, (30.5% identity in 272 aa overlap); Q9SZB3|F17M5.120|AT4G33360|AAK49584 HYPOTHETICAL 37.9 KDA PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (344 aa), FASTA scores: opt: 314, E(): 9.1e-11, (30.35% identity in 267 aa overlap); AAK59445|AT4G33360 PUTATIVE DIHYDROKAEMPFEROL 4-REDUCTASE from Arabidopsis thaliana (Mouse-ear cress) (332 aa), FASTA scores: opt: 313, E(): 1e-10, (30.8% identity in 263 aa overlap); Q9FSC6|CCR CINNAMOYL-COA REDUCTASE (EC 1.2.1.44) from Populus trichocarpa (Western balsam poplar) (338 aa), FASTA scores: opt: 305, E(): 2.9e-10, (30.3% identity in 274 aa overlap); Q9M631 CINNAMOYL CoA REDUCTASE from Populus tremuloides (Quaking aspen) (337 aa), FASTA scores: opt: 291, E(): 1.8e-09, (30.15% identity in 272 aa overlap); P73212|DFRA_SYNY3|LR1706 PUTATIVE DIHYDROFLAVONOL-4-REDUCTASE (EC 1.1.1.219) (DIHYDROKAEMPFEROL 4-REDUCTASE) from Synechocystis sp. strain PCC 6803 (343 aa), FASTA scores: opt: 278, E(): 1e-08, (29.35% identity in 259 aa overlap); etc. Also some similarity to proteins from Mycobacterium tuberculosis e.g. P96816|Rv0139|MTCI5.13 HYPOTHETICAL PROTEIN (340 aa) FASTA scores: opt: 234, E(): 3.2e-06, (28.25% identity in 269 aa overlap); and O06373|galE1|Rv3634c|MTCY15C10.18 PROBABLE UDP-GLUCOSE 4-EPIMERASE (314 aa) (27.3% identity in 194 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-a) leads to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis (333 aa versus 309 aa). Protein product from Mb3752 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3752 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y527" /db_xref="InterPro:IPR001509" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y527" /protein_id="SIU02381.1" /translation="MQNATMRVLVTGGTGFVGGWTAKAIADAGHSVRFLVRNPARLKT SVAKLGVDVSDFAVADISDRDSVREALNGCDAVVHSAALVATDPRETSRMLSTNMAGA QNVLGQAVELGMDPIVHVSSFTALFRPNLATLSADLPVAGGTDGYGQSKAQIEIYARG LQDAGAPVNITYPGMVLGPPVGDQFGEAGEGVRSALWMHVIPGRGAAWLIVDVRDVAA LHAALLESGRGPRRYTAGGHRIPVPELAKILGEVAGTTMLAVPVPDSALRVAGSVLDQ AGPYLPFNTPFTAAGMQYYTQMPEPDDSPSEKELGITYRDPRDTVADTVTALRGLGS" CDS 4113293..4114486 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3753" /product="possible dehydrogenase" /note="Mb3753, -, len: 397 aa. Equivalent to Rv3726, len: 397 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 397 aa overlap). Possible dehydrogenase (EC 1.-.-.-), similar to many e.g. O34788|YDJL DEHYDROGENASE from Bacillus subtilis (346 aa) FASTA scores: opt: 401, E(): 3.4e-17, (29.6% identity in 395 aa overlap); Q59696|ADH 2,3-BUTANEDIOL DEHYDROGENASE (EC 1.1.1.4) from seudomonas putida (362 aa), FASTA scores: opt: 326, E(): 1.3e-12, (29.45% identity in 387 aa overlap); AAG59541|YJJN PUTATIVE OXIDOREDUCTASE from Escherichia coli strain EDL933 (345 aa), FASTA scores: opt: 325, E(): 1.5e-12, (30.85% identity in 256 aa overlap); Q9HWM8|PA4153 2,3-BUTANEDIOL DEHYDROGENASE from Pseudomonas aeruginosa (363 aa), FASTA scores: opt: 324, E(): 1.8e-12, (30.5% identity in 387 aa overlap); etc. Protein product from Mb3753 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3753 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y528" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y528" /protein_id="SIU02382.1" /translation="MKAVTCTNAKLEVVDRPSPAPAKGQLLLDVLRCGICGSDLHARL HCDELADVMAESGYHAFMRSNQQVVFGHEFCGEVVDYGPGTRRTPRRGTPVVAMPLLR RGNKEVHGIGLSTMAPGAYAERLVVEQSLTFPVPNGLAPEIAALTEPMAVGWHAVRRG EVGKGDVAIVIGCGPIGLAVICMLKSRGVHTVIASDFSPGRRALATACGADSVVDPVQ DSPYAVAAGLGQGNRHLQSILDAFDLAVGTVESLQRLRLPWWHLWRAAEAAGAATPKR PVIFECVGVPGIIDGIIASAPLFSRVVVVGVCMGSDHIRPAMAINKEINLRFVLGYTP LEFRDTLHMLADGKVNAAPLITGTVGLPGVAAAFDALGDPEAHAKIMIDPKSNAASPQ PFRVE" CDS 4114827..4116635 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3754" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb3754, -, len: 602 aa. Equivalent to Rv3727, len: 602 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 602 aa overlap). Possible oxidoreductase (EC 1.-.-.-), similar to several plants phytoene dehydrogenases/desaturases (EC 1.3.-.-) e.g. Q9HSE1|CRTI3|VNG0277G PHYTOENE DEHYDROGENASE from Halobacterium sp. strain NRC-1 (541 aa), FASTA scores: opt: 299, E(): 1.1e-10, (29.85% identity in 576 aa overlap); Q9FZL6|CITPDS1 PHYTOENE DESATURASE from Citrus unshiu (Satsuma orange) (553 aa), FASTA scores: opt: 164, E(): 0.018, (24.2% identity in 434 aa overlap); Q07356|CRTI_ARATH|PDS|AT4G14210|DL3145c PHYTOENE DEHYDROGENASE PRECURSOR from Arabidopsis thaliana (Mouse-ear cress) (566 aa), FASTA scores: opt: 163, E(): 0.021, (23.95% identity in 434 aa overlap); etc. N-terminal end similar to O69871|SC1C3.29 PUTATIVE PROTOPORPHYRINOGEN OXIDASE (FRAGMENT) from Streptomyces coelicolor (61 aa), FASTA scores: opt: 154, E(): 0.012, (60.45% identity in 43 aa overlap). The region between aa 155-310 is highly similar to Q49778|B2126_C1_169 from Mycobacterium leprae (159 aa), FASTA scores: opt: 437, E(): 1.5e-19, (46.6% identity in 161 aa overlap). And the region between aa 462-546 is highly similar to the N-terminal end of Q50003|U1764T from Mycobacterium leprae (155 aa), FASTA scores: opt: 277, E(): 8.3e-10, (57.65% identity in 85 aa overlap). Mb3754 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y506" /db_xref="InterPro:IPR002937" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y506" /protein_id="SIU02383.1" /translation="MKPSPADTHVVIAGAGIAGLAAAMILAEAGVRVTLCEAASEAGG KAKSLRLADGHPTEHSLRVYTDTYQTLLTLFSRIPTEHDRTMLDNLVGVSMVSATAQG VIGRIAAPVALQRRRPTFARIIGKVVEPPRQLVRILLRGPMVIVGLAQRGVPATDVLH YLYAHLRLLWMCRERLLAELGDISYADYLQLGCKSAQAQEFFSAVPRIYVAARTSAEA AAIAPIVLKGLFRLKSNCPSALNDAKLPAIMMMDGPTSERMVDPWIRHLTRLGVDIHF NTRVGDLEFDDGRVTALISSDGRRFACDYALLAVPYLTLRELAKSAHVKRYLPQLTQQ HALALEASNGIQCFLRDLPATWPPFIRPGVVTTHLQSQWSLVCVLQGEGFWKNVRLPE GTRYVLSITWSDVETPGPVFDRPLSECTPDEILTECLTQCGLDKSNVLGWRIDHELKH LDEAEYEKVASELPPHLVSAPARGQRMVNFSPLTVLMPGARHRSPGICTSVPNLLLAG EVIYSPDLTLFVPTMEKAACSGYLAARQIMNMVASHAAPLRIDFRDPAPFAVLRRVDR WFWSRRRRPPDRSTFATPPTAMPAPSHLTDVDRSAS" CDS 4116745..4119942 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3755" /product="PROBABLE CONSERVED TWO-DOMAIN MEMBRANE PROTEIN" /note="Mb3755, -, len: 1065 aa. Equivalent to Rv3728, len: 1065 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1065 aa overlap). Probable conserved transmembrane protein organised into two domains. Domain comprising the first ~510 aa residues is similar to various multidrug resistance and efflux proteins and contains sugar transport protein signature 1 (PS00216). Domain corresponding to the last 550 aa residues contains cyclic nucleotide-binding domain signature 2 (PS00889) and is very similar to Q50733|YP65_MYCTU|Rv2565|MT2641|MTCY9C4 .03c hypothetical 62.1 kDa protein from Mycobacterium tuberculosis (31.0% identity in 546 aa overlap). Highly similar to O05884|Rv3239c|MTCY20B11.14c PROBABLE TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis (1048 aa) FASTA scores: opt: 4328, E(): 5e-201, (64.15% identity in 1046 aa overlap). N-terminal end similar to P71879|Rv2333c|MTCY3G12.01|MTCY98.02c (537 aa); P71836|Rv0783c|MTCY369.27c (540 aa); and O07753|Rv1877|MTCY180.41c (687 aa). SEEMS BELONG TO THE SUGAR TRANSPORTER FAMILY. Possibly member of major facilitator superfamily (MFS). Mb3755 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y515" /db_xref="InterPro:IPR000595" /db_xref="InterPro:IPR002641" /db_xref="InterPro:IPR004638" /db_xref="InterPro:IPR005829" /db_xref="InterPro:IPR011701" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR018488" /db_xref="InterPro:IPR018490" /db_xref="InterPro:IPR020846" /db_xref="InterPro:IPR036259" /db_xref="UniProtKB/TrEMBL:A0A1R3Y515" /protein_id="SIU02384.1" /translation="MHTVATNNAAPVIAAGPVGPSRRRRRVHAPLTRRRQPSSSAVLL VAAFGAFLAFLDSTIVNVAFPDIQRHFHSDISDLSWMLNAYNIVFAAFLVAAGRLADL MGRKRVFILGVALFTVASGLCAIAESVGELVAFRVLQGIGAAVLVPASLGLVVEAFPA ERRAHGVNLWGAAGAIAAGLGPPIGGALIEADGWRWVFLVNLPLGVFAVLAARRALVE NRAAGRRRVPDVRGAVLLAFALGLLTLGLIKGPDWGWASLPTSGSLLAAAVAMVGFVM SSRHHPAPMVEPTLLRIQSFVAGTGLTAVASAGFYAYLLTHVLFLNYVWGYTLLEAGM AVAPAALVAAVVAAVLGRVADRHGYRFIVGIGALIWAASLLWYLKVVGSQPDFLGEWL PGQILQGIGVGATFPLLGSAALARLAKGGSYATASAVTGTIRQVGAVIGVAVLVILVG TPAPGAAEEALRHRWALAAICFVAVGIGALSLGRIRPVPAAVEPPPGPPVAPLGARRP PRPAPVASPAAAVAPTPKTSREVNLLEALRFARPDTQQIELQAGSYLFHAGDVSDALY VVRSGRLQVLAGDGAKDEVVAELGRGQVVGELGVLLDAPRSASVRAVRDSSLMRVTKA EFAKIADAGVLGALAGVLAKRQHQTRVASQRTTPEVVVAVVGVDANAPVAMVATELCR ALSTRLRAVAPGRVDCDGLERAEQTADRVVLHAAVGDARWREFCLRVADRVVLVASNP AVPVAPLPTRATGADLVLAGRPAGREHRRAWEQLITPRSMHVVRREFVADDLRVLATR IAGRSVGLVLSGGAARACAHLGVLEELEAAGVTVDRFAGTSMGAIIAALAASGLDAAG VDAQIYEHFVRKSHGDYTLPSKGLIRGKRTQSTLRTIFGDHLVEELPKHFRCVSVDLL ARRPVVHRQGPLADVVGCSMRLPFLYAPLPYGGTLHVDGGVLDNVPVTTLVGKDGPLI AVNVASGGNPSPASGGHRRGKPRVPGLTDTLLRTMTISSAMASEKVLAQADLVIKPNP IGVGLMEYHQIDRAREAGRIAAREALPQIMELVHG" CDS 4120157..4122487 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3756" /product="POSSIBLE TRANSFERASE" /note="Mb3756, -, len: 776 aa. Equivalent to Rv3729, len: 776 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 776 aa overlap). Conserved hypothetical protein, possible transferase (EC 2.-.-.-), similar to several hypothetical proteins and various transferases e.g. O26919|MTH831 MOLYBDENUM COFACTOR BIOSYNTHESIS MOAA HOMOLOG from Methanobacterium thermoautotrophicum (497 aa), FASTA scores: opt: 697, E(): 4.8e-34, (30.7% identity in 492 aa overlap); Q58036|Y619_METJA|MJ0619 HYPOTHETICAL PROTEIN from Methanococcus jannaschii (506 aa), FASTA scores: opt: 670, E(): 2e-32, (30.6% identity in 497 aa overlap); O27968|AF2316 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (518 aa), FASTA scores: opt: 477, E(): 6.4e-21, (29.4% identity in 500 aa overlap); BAB60102|TVG0985801 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN from Thermoplasma volcanium (606 aa), FASTA scores: opt: 402, E(): 2.1e-16, (28.1% identity in 509 aa overlap); etc. C-terminus similar to methyltransferases e.g. Q9S0N6|AVED C5-O-METHYLTRANSFERASE from Streptomyces avermitilis (283 aa), FASTA scores: opt: 298, E(): 1.9e-10, (31.5% identity in 292 aa overlap). Also similar to the Mycobacterium tuberculosis proteins P71673|YE05_MYCTU|Rv1405c|MT1449|MTCY21B4.22c (274 aa); and Q50584|Rv1523|MTCY19G5.05c. Protein product from Mb3756 detected using SWATH mass spectrometry. Mb3756 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y6L4" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR029063" /db_xref="InterPro:IPR034474" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6L4" /protein_id="SIU02385.1" /translation="MFVEYTKSICPVCKVVVDAQVNIRHDKVYLRKRCREHGSFEALV YGDAQMYLESARFNKPGTFPLRFQTEVRDGCPSDCGLCPDHKQHACLGLIEVNTHCNL DCPICFADSGHQPDGYAITAAQCERMLDTLVAAEGEPEVVMFSGGEPTIHKQLLEFVD AAQARPVKTIIINTNGIRLASDRRFVDQLATRNRPGHPVHIYLQFDGLDEATHRRIRG HDLRDVKQRALDNCAAAGLTVSLVAAVERGLNEHELGAVIRHGMAQPGVQSVVFQPVT HAGRHVQFDPLTRLTNSDIIACITAQLPEWFRPGDFFPVPCCFPSCRSITYLLTDGEH VVPIPRLLNVEDYLDYVSNRVIPDLAIREALENLWSASAVPGTDTMTAQLQRATAALN CAEGCGINLPEALTHLTDRVFAIVIQDFQDPYTLNVKQLMKCCVQQITPDGRLIPFCA YNSVGYREQVREQLTGVPVPDIVPNAIPLAGLLADAPHGSKQANTGGSIARLAGPTRG APMALPPQQIKACCADAYSRDIVALLLGDSFHPGGATLTRRLADQLGLRSTGDPRRVA DIAAGPGASARLLASDYGVAVDGVDISEINVKRAQAAVAQTGLTERVRFHLGDAESVP LPDDTFDALVCECAFCTFPDKNAAAQQFARILRPGGLAGITDVTVGDGGLPAELTPLA AWVACIADARTVTDYTDILEGAGLRTRHIESHDESLLDMIDRIDARITALHVAAPEIL ADNGIRHDSVRDFTALARAAVQTGRIGYTLMIAEKP" CDS complement(4122552..4123592) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3757C" /product="ATP-dependent DNA ligase (EC" /EC_number="6.5.1.1" /note="Mb3757c, -, len: 346 aa. Equivalent to Rv3730c, len: 346 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 346 aa overlap). Conserved hypothetical protein, highly similar to Q9XAM1|SC4C6.19 HYPOTHETICAL 38.5 KDA PROTEIN from Streptomyces coelicolor (341 aa), FASTA scores: opt: 1313, E(): 2.2e-75, (59.25% identity in 336 aa overlap); and similar to C-terminal end of PUTATIVE ATP-DEPENDENT DNA LIGASES e.g. BAB49297|MLL2077 from Rhizobium loti (Mesorhizobium loti) (833 aa), FASTA scores: opt: 550, E(): 5.3e-27, (31.3% identity in 294 aa overlap); and BAB54816|MLL9625 from Rhizobium loti (Mesorhizobium loti) plasmid pMLb (883 aa) FASTA scores: opt: 492, E(): 2.5e-23, (33.7% identity in 291 aa overlap); etc. Also similar to the hypothetical proteins e.g. Q9ZC15|SC1E6.07 HYPOTHETICAL 34.9 KDA PROTEIN from Streptomyces coelicolor (319 aa) FASTA scores: opt: 537, E(): 1.5e-26, (34.95% identity in 292 aa overlap); Q9XAF7|SC6G9.25 HYPOTHETICAL 32.1 KDA PROTEIN from Streptomyces coelicolor (293 aa), FASTA scores: opt: 474, E(): 1.3e-22, (33.75% identity in 302 aa overlap); etc. Also highly similar to P95226|Rv0269c|MTCY06A4.13c HYPOTHETICAL 44.0 KDA PROTEIN from Mycobacterium tuberculosis (397 aa), FASTA scores: opt: 940, E(): 7.7e-52, (50.3% identity in 312 aa overlap). Protein product from Mb3757c detected using SWATH mass spectrometry. Mb3757c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR014145" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5E6" /protein_id="SIU02386.1" /translation="MAAAAEELDVDGIAVRLTSPDRMYFPKLGSHGTKRRLVEYYFAV AGGPMLTALRDRPTHLQRFPDGVDGEQIYQKRIPRHRPDYLQTCRVTFPSGRMADALK VTHPAAIVWAAQMGTITLHPWQVRCPDTEHPDELRIDLDPQPGTGFVEARTVAVDVLR SVLDDLGLVGYPKTSGGRGIHVFLRIATDWDFVEVRRAGIALAREVERRAPDAVTTSW WKEERGARIFIDFNQNARDRTMASAYSVRPTPIATVSMPLTWEELAGADPDDYTMTTV PELVKIRDDPWAGMDDVAQSIAPLLDLAAADEERGLGDMPYPPNYPKMPGEPKRVQPS RDTDLKGGNTSK" CDS 4123630..4124706 /codon_start=1 /transl_table=11 /gene="ligC" /locus_tag="BQ2027_MB3758" /product="possible atp-dependent dna ligase ligc (polydeoxyribonucleotide synthase [atp]) (polynucleotide ligase [atp]) (sealase) (dna repair protein) (dna joinase)" /note="Mb3758, ligC, len: 358 aa. Equivalent to Rv3731, len: 358 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 358 aa overlap). Possible ligC, DNA ligase ATP-dependent (EC 6.5.1.1), similar to numerous archaebacterial and eukaryotic polynucleotide DNA ligases e.g. Q9XAM3|SC4C6.17c from Streptomyces coelicolor (355 aa), FASTA scores: opt: 1429, E(): 1.7e-82, (60.4% identity in 361 aa overlap); BAB54870|MLL9685 from Rhizobium loti (Mesorhizobium loti) plasmid pMLb (337 aa), FASTA scores: opt: 667, E(): 1.2e-34, (40.35% identity in 347 aa overlap); Q9HH07|DNLI_THEFM|LIG from Thermococcus fumicolans (559 aa), FASTA scores: opt: 335, E(): 1.4e-13, (27.25% identity in 330 aa overlap); O59288|DNLI_PYRHO from Pyrococcus horikoshii (559 aa), FASTA scores: opt: 307, E(): 8e-12, (26.85% identity in 272 aa overlap); etc. Also similar to Rv3062|MTCY22D7_19c|LIGB PROBABLE DNA LIGASE from Mycobacterium tuberculosis (507 aa), FASTA score: (30.3% identity in 356 aa overlap). SEEMS TO BELONG TO THE ATP-DEPENDENT DNA LIGASE FAMILY. Protein product from Mb3758 detected using SWATH mass spectrometry. Mb3758 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y525" /db_xref="InterPro:IPR012309" /db_xref="InterPro:IPR012310" /db_xref="InterPro:IPR012340" /db_xref="UniProtKB/TrEMBL:A0A1R3Y525" /protein_id="SIU02387.1" /translation="MQLPVMPPVSPMLAKSVTAIPPDASYEPKWDGFRSICFRDGDQV ELGSRNERPMTRYFPELVAAIRAELPHRCVIDGEIIIATDHGLDFEALQQRIHPAESR VRMLADRTPASFIAFDLLALGDDDYTGRPFSERRAALVDAVTGSGADADLSIHVTPAT TDMATAQRWFSEFEGAGLDGVIAKPPHITYQPDKRVMFKIKHLRTADCVVAGYRVHKS GSDAIGSLLLGLYQEDGQLASVGVIGAFPMAERRRLLTELQPLVTSFDDHPWNWAAHV AGQRTPRKNEFSRWNVGKDLSFVPLRPERVVEVRYDHMEGARFRHTAQFNRWRPDRDP RSCSYAQLERPLTVSLSDIVPGLR" CDS 4124806..4125864 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3759" /product="conserved protein" /note="Mb3759, -, len: 352 aa. Equivalent to Rv3732, len: 352 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 352 aa overlap). Conserved hypothetical protein. The region between aa 175-352 is highly similar to the region between aa 72-257 of Q9KH39 HYPOTHETICAL 55.5 KDA PROTEIN from Mycobacterium smegmatis (511 aa), FASTA scores: opt: 1122, E(): 7.3e-63, (98.85% identity in 176 aa overlap). Also shows some similarity with Q55304 HYPOTHETICALK PROTEIN from Synechocystis sp. strain PCC 6803 (387 aa), FASTA scores: opt: 201, E(): 2.7e-05, (27.1% identity in 251 aa overlap); and P74254|SLR1173 HYPOTHETICAL 52.5 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (463 aa), FASTA scores: opt: 201, E(): 3.1e-05, (27.1% identity in 251 aa overlap). Also slightly similar to MTCY01B2_21 and DPO1_MYCTU DNA POLYMERASE I. Protein product from Mb3759 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3759 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y522" /db_xref="InterPro:IPR019283" /db_xref="UniProtKB/TrEMBL:A0A1R3Y522" /protein_id="SIU02388.1" /translation="MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGS QATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDT LSAPLIEHQRHWSLRRGVGASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTW LSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMR LSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHG SYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGA AGGAVVVVLRRRRRAHTG" CDS complement(4125884..4126384) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3760C" /product="MutT/Nudix family protein" /note="Mb3760c, -, len: 166 aa. Equivalent to Rv3733c, len: 166 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 166 aa overlap). Conserved hypothetical protein, highly similar to Q9FCB0|2SCG58.03 PUTATIVE MUTT-LIKE PROTEIN from Streptomyces coelicolor (153 aa), FASTA scores: opt: 541, E(): 7.2e-29, (52.7% identity in 148 aa overlap); and BAB49143|MLR1881 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (156 aa), FASTA scores: opt: 526, E(): 7.2e-28, (52.65% identity in 150 aa overlap). Mb3760c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y548" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR015797" /db_xref="InterPro:IPR020084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y548" /protein_id="SIU02389.1" /translation="MPKLSAGVLLYRARAGVVDVLLAHPGGPFWAGKDDGAWSIPKGE YTGGEDPWLAARREFSEEIGLCVPDGPRIDFGSLKQSGGKVVTVFGVRADLDITDARS STFELDWPKGSGKMRKFPEVDRVSWFPVARARTKLLKGQRGFLDRLMAHPAVAGLSEG PESLPR" CDS complement(4126398..4127762) /codon_start=1 /transl_table=11 /gene="tgs2" /locus_tag="BQ2027_MB3761C" /product="putative triacylglycerol synthase (diacylglycerol acyltransferase) tgs2" /note="Mb3761c, -, len: 454 aa. Equivalent to Rv3734c, len: 454 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 454 aa overlap). Hypothetical protein, highly similar to O69707|Y1E0_MYCTU|Rv3740c|MT3848|MTV025. 088c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (448 aa), FASTA scores: opt: 1917, E(): 1.3e-111, (61.4% identity in 451 aa overlap); and similar to many other proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. P71694|YE43_MYCTU|Rv1425|MT1468|MTCY21B4 .43|MTCY493.29c (459 aa), FASTA scores: opt: 824, E(): 1.1e-43, (36.5% identity in 460 aa overlap); Q50680|YM85_MYCTU|Rv2285|MT2343|MTCY339.25c (445 aa) FASTA scores: opt: 766, E(): 4.1e-40, (36.4% identity in 453 aa overlap); etc. Also similar to Q9RIU8|SCM11.13c HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 331, E(): 4.3e-13, (32.9% identity in 468 aa overlap); and Q9X7A8|ML1244|MLCB1610.05 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa), FASTA scores: opt: 296, E(): 7e-11, (28.35% identity in 413 aa overlap). Contains PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2. Start site chosen by homology, but may extend further upstream. Protein product from Mb3761c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3761c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67211" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/Swiss-Prot:P67211" /protein_id="SIU02390.1" /translation="MDLMMPNDSMFLFIESREHPMHVGGLSLFEPPQGAGPEFVREFT ERLVANDEFQPMFRKHPATIGGGIARVAWAYDDDIDIDYHVRRSALPSPGRVRDLLEL TSRLHTSLLDRHRPLWELHVVEGLNDGRFAMYTKMHHALIDGVSAMKLAQRTLSADPD DAEVRAIWNLPPRPRTRPPSDGSSLLDALFKMAGSVVGLAPSTLKLARAALLEQQLTL PFAAPHSMFNVKVGGARRCAAQSWSLDRIKSVKQAAGVTVNDAVLAMCAGALRYYLIE RNALPDRPLIAMVPVSLRSKEDADAGGNLVGSVLCNLATHVDDPAQRIQTISASMDGN KKVLSELPQLQVLALSALNMAPLTLAGVPGFLSAVPPPFNIVISNVPGPVDPLYYGTA RLDGSYPLSNIPDGQALNITLVNNAGNLDFGLVGCRRSVPHLQRLLAHLESSLKDLEQ AVGI" CDS 4127961..4128449 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3762" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3762, -, len: 162 aa. Equivalent to Rv3735, len: 162 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 162 aa overlap). Conserved hypothetical protein, highly similar to several bacterial hypothetical proteins e.g. Q9UX41|ORF-C09_016|SSO0651|AAK40956 from Sulfolobus solfataricus (163 aa), FASTA scores: opt: 627, E(): 1.2e-34, (55.9% identity in 161 aa overlap); O26795|MTH699 from Methanobacterium thermoautotrophicum (168 aa), FASTA scores: opt: 616, E(): 6.7e-34, (56.1% identity in 155 aa overlap); |Q9Y9J9|APE2289 from Aeropyrum pernix (191 aa), FASTA scores: opt: 591, E(): 3.4e-32, (54.65% identity in 161 aa overlap); etc. Contains PS00435 Peroxidases proximal heme-ligand signature. Protein product from Mb3762 detected using SWATH mass spectrometry. Mb3762 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR007153" /db_xref="InterPro:IPR036902" /db_xref="UniProtKB/TrEMBL:A0A1R3Y531" /protein_id="SIU02391.1" /translation="MSLAWDVVSVDKPDDVNVVIGQAHFIKAVEDLHEAMVGVSPSLR FGLAFCEASGPRLVRHTGNDGDLVELATRTALAIAAGHSFVIFLREGFPINILNPVQA VPEVCTIYCATANPVDVVVAVTPHGRGIVGVVDGQTPLGVETDRDIAQRRDLLRAIGY KL" CDS 4128506..4129567 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3763" /product="TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY ARAC/XYLS-FAMILY)" /note="Mb3763, -, len: 353 aa. Equivalent to Rv3736, len: 353 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 353 aa overlap). Probable transcriptional regulator, araC/xylS family, similar to many transcriptional regulators and hypothetical proteins e.g. CAC38740 HYPOTHETICAL 35.4 KDA PROTEIN from Bradyrhizobium japonicum (318 aa), FASTA scores: opt: 438, E(): 2e-20, (29.4% identity in 306 aa overlap); Q9HZ25|PA3215 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (337 aa), FASTA scores: opt: 395, E(): 1.1e-17, (30.3% identity in 320 aa overlap); Q9HTN1|PA5324 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (356 aa), FASTA scores: opt: 313, E(): 1.8e-12, (25.85% identity in 329 aa overlap); Q9Z3Y6|PHBR TRANSCRIPTIONAL REGULATOR PHBR from Pseudomonas sp. 61-3 (379 aa), FASTA scores: opt: 271, E(): 8.3e-10, (22.95% identity in 357 aa overlap); etc. Also highly similar to Q06861|VIRS_MYCTU|Rv3082c|MTV013.03c POSSIBLE VIRULENCE-REGULATING PROTEIN from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 656, E(): 3.7e-34, (36.95% identity in 333 aa overlap); and similar to other hypothetical mycobacterial proteins e.g. P71663|YD95_MYCTU|Rv1395|MT1440|MTCY21B4.12 (344 aa). Contains helix-turn-helix motif at aa 245-266 (Score 1140, +3.07 SD). SEEMS BELONG TO THE ARAC/XYLS FAMILY OF TRANSCRIPTIONAL REGULATORS. Mb3763 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y533" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR018060" /db_xref="InterPro:IPR032687" /db_xref="UniProtKB/TrEMBL:A0A1R3Y533" /protein_id="SIU02392.1" /translation="MSVVRGTALANYPSLVAGLGGDPATLLRAAGVRDQDVGNYDAFI SIRAAIRAIESAAAVTATMDFGRRLAQRQGIEILGPVGVAARTAATVGDALAIFNTFM AAYSPVIAIRITPLAGQRSFIALEFLLDEPASYPQTMELALGVALGVIRLLLGADYAP LAVHLPHDPLTPEAFYLQYFGCRPYFAERVGGFTMRTADLSRPLNRDDVAHRVVVDYL SSITPLGEGIVESVRTIVRQLLPTGAATLNVVAEQFHLHPKTLQRRLAEENTTFVILV DRVRKDVADRYLRTTGIGLTHLARELGYAEQSVLTRSCKRWFGTGPAAYRNQARLQTT VSAPGSGRGPNPGNVSVSC" CDS 4129571..4131160 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3764" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3764, -, len: 529 aa. Equivalent to Rv3737, len: 529 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 529 aa overlap). Probable conserved transmembrane protein, similar to others and also some hypothetical proteins e.g. AAK61331|THRE THREONINE EXPORT CARRIER from Corynebacterium glutamicum (Brevibacterium flavum) (489 aa), FASTA scores: opt: 773, E(): 1.8e-36, (37.25% identity in 424 aa overlap); Q9X8J0|SCE9.17 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (578 aa), FASTA scores: opt: 642, E(): 5.4e-29, (31.6% identity in 481 aa overlap) (shorter 119 aa at N-terminus); Q9CJU6|PM1895 HYPOTHETICAL PROTEIN from Pasteurella multocida (262 aa), FASTA scores: opt: 233, E(): 4.1e-06, (25.0% identity in 256 aa overlap); Q9S267|SCI30A.06 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (297 aa), FASTA scores: opt: 163, E(): 0.042, (29.65% identity in 263 aa overlap); etc. Also partially similar to O05435|Rv3910|MTCY15F10.01c|MTV028.01 HYPOTHETICAL 123.6 KDA PROTEIN from Mycobacterium tuberculosis (1184 aa) (34.4% identity in 125 aa overlap). Mb3764 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y517" /db_xref="InterPro:IPR010619" /db_xref="InterPro:IPR024528" /db_xref="UniProtKB/TrEMBL:A0A1R3Y517" /protein_id="SIU02393.1" /translation="MDQDRSDNTALRRGLRIALRGRRDPLPVAGRRSRTSGGIGDLHT RKVLDLTIRLAEVMLSSGSGTADVVATAQDVAQAYQLTDCVVDITVTTIIVSALATTD TPPVTIMRSVRTRSTDYSRLAELDRLVQRITSGGVAVDQAHEAMDELTERPHPYPRWL ATAGAAGFALGVAMLLGGTWLTCVLAAVTSGVIDRLGRLLNRIGTPLFFQRVFGAGIA TLVAVAAYLIAGQDPTALVATGIVVLLSGMTLVGSMQDAVTGYMLTALARLGDALFLT AGIVVGILISLRGVTNAGIQIELHVDATTTLATPGMPLPILVAVSGAALSGVCLTIAS YAPLRSVATAGLSAGLAELVLIGLGAAGFGRVVATWTAAIGVGFLATLISIRRQAPAL VTATAGIMPMLPGLAVFRAVFAFAVNDTPDGGLTQLLEAAATALALGSGVVLGEFLAS PLRYGAGRIGDLFRIEGPPGLRRAVGRVVRLQPAKSQQPTGTGGQRWRSVALEPTTAD DVDAGYRGDWPATCTSATEVR" CDS complement(4131157..4131513) /codon_start=1 /transl_table=11 /gene="PPE66" /locus_tag="BQ2027_MB3765C" /product="ppe family protein ppe66" /note="Mb3765c, PPE66, len: 118 aa. Equivalent to 3' end of Rv3738c, len: 315 aa, from Mycobacterium tuberculosis strain H37Rv, (98.1% identity in 105 aa overlap). Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. O53265|Rv3018c|MTV012.32c (434 aa), FASTA scores: opt: 464, E(): 2.2e-17, (47.05% identity in 338 aa overlap). Probably a continuation of the upstream ORF MTV025.87c. At position 97470-72 a stop codon is present which interrupts a possibly longer ORF, observed in related ORFs MTV012_32 or MTCY21B4_4. The sequence has been checked and no errors were detected. A similar situation, but with a frameshift separating the ORFs is found in MTV012_36/MTV012_35. Sequence similarity is also seen with MTCY251_15; MTCY261_19; MLCB2492_30 from Mycobacterium leprae; MTCY10G2_10; MTY21C12_9; MTCI125_26; MTCY164_36; MTCY6A4_1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a deletion of 1166 bp leads to the loss of the NH2 part of PPE66 and of Rv3739c compared to its homolog in Mycobacterium tuberculosis strain H37Rv. Mb3765c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y518" /protein_id="SIU02394.1" /translation="MTKASPVYLPGVKGRITADVAPVAVRPAAAPPLRESAAVRPEAR LVSAVAPAPAGTSASVLASDRGAGVLGFAGTAGKESVGRPAGLTTLAGGEFGGSPSVP MVPASWEQLVGAGEAG" CDS complement(4131539..4132885) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3766C" /product="possible triacylglycerol synthase (diacylglycerol acyltransferase)" /note="Mb3766c, -, len: 448 aa. Equivalent to Rv3740c, len: 448 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 448 aa overlap). Conserved hypothetical protein, highly similar to several other Mycobacterium tuberculosis hypothetical proteins e.g. O69701|Y1D4_MYCTU|Rv3734c|MT3839|MTV025.082c (454 aa) FASTA scores: opt: 1917, E(): 2.3e-112, (61.4% identity in 451 aa overlap); Q50680|YM85_MYCTU|Rv2285|MT2343|MTCY339.2 5c (445 aa) FASTA scores: opt: 858, E(): 3.4e-46, (37.4% identity in 460 aa overlap); Q10554|Y895_MYCTU|Rv0895|MT0919|MTCY31.23 (505 aa), FASTA scores: opt: 767, E(): 1.9e-40, (44.3% identity in 467 aa overlap); MTCY31_25; MTCY28_26; MTCY493_29; MTCY21B4_43; MTCY8D5_16; MTCY3A2_28; MTV013_8; MTY13E12_33; MTV013_9; MTY20B11_9; etc. Also similar to Q9RIU8|SCM11.13c HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 319, E(): 1.7e-12, (30.9% identity in 453 aa overlap). Mb3766c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y6M1" /db_xref="InterPro:IPR004255" /db_xref="InterPro:IPR009721" /db_xref="InterPro:IPR014292" /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6M1" /protein_id="SIU02395.1" /translation="MSPIDALFLSAESREHPLHVGALQLFEPPAGAGRGFVRETYQAM LQCREIAPLFRKRPTSLHGALINLGWSTDADVDLGYHARRSALPAPGRVRELLELTSR LHSNLLDRHRPLWETHVIEGLRDGRFAIYSKMHHALVDGVSGLTLMRQPMTTDPIEGK LRTAWSPATQHTAIKRRRGRLQQLGGMLGSVAGLAPSTLRLARSALIEQQLTLPFGAP HTMLNVAVGGARRCAAQSWPLDRVKAVKDAAGVSLNDVVLAMCAGALREYLDDNDALP DTPLVAMVPVSLRTDRDSVGGNMVGAVLCNLATHLDDPADRLNAIHASMRGNKNVLSQ LPRAQALAVSLLLLSPAALNTLPGLAKATPPPFNVCISNVPGAREPLYFNGARMVGNY PMSLVLDGQALNITLTSTADSLDFGVVGCCRSVPHVQRVLSHLETSLKELERAVGL" CDS complement(4132885..4133559) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3767C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb3767c, -, len: 224 aa. Equivalent to Rv3741c, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 224 aa overlap). Possible oxidoreductase, probably combines with product of upstream ORF MTV025.090c to form a functional monooxygenase (EC 1.-.-.-), highly similar to C-terminal end of various oxidoreductases e.g. Q9APW3 AROMATIC-RING HYROXYLASE from Pseudomonas aeruginosa (508 aa), FASTA scores: opt: 549, E(): 5.9e-28, (56.1% identity in 155 aa overlap); Q9A588|CC2569 MONOOXYGENASE (FLAVIN-BINDING FAMILY) from Caulobacter crescentus (498 aa), FASTA scores: opt: 487, E(): 5.6e-24, (39.55% identity in 225 aa overlap); Q9RZT0|DRB0033 ARYLESTERASE/MONOXYGENASE from Deinococcus radiodurans (833 aa), FASTA scores: opt: 460, E(): 4.7e-22, (38.5% identity in 226 aa overlap); etc. Also similar to C-terminal end of Mycobacterium tuberculosis proteins (generally monooxygenases) e.g. P96223|Rv3854c|MTCY01A6.14 HYPOTHETICAL 55.3 KDA PROTEIN (489 aa), FASTA scores: opt: 542, E(): 1.6e-27, (50.0% identity in 162 aa overlap); O53762|Rv0565c|MTV039.03c PUTATIVE MONOXYGENASE (486 aa), FASTA scores: opt: 462, E(): 2.2e-22, (37.15% identity in 226 aa overlap); O53300|Rv3083|MTV013.04 MONOXYGENASE (495 aa), FASTA scores: opt: 462, E(): 2.2e-22, (45.65% identity in 173 aa overlap); etc. Note similarity to MTCY01A6.14 and MTV013.04 continue in upstream ORF (MTV025.090c) after a gap of ~100 aa. Mb3767c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5F4" /protein_id="SIU02396.1" /translation="MIGRDRAYAVTRRKDIAKQRLVWRLCQRYPRAARRLIRHLNAKQ LAAGYPADEHFKPVYNPWDQRLCAVPDADMFKAIRDGRASVVTEAIDTFTENGIRLQS GRELAADISITATGLNLLAFGGINLSVDGVAVDVAEKVAFKGFLLSDVSNFAGPHGRT RAHHLLSAAARSHADPAAAGRRSPLADLKVLREGPVDDDHLRFTTSASASRLTVKRIT RSTPWN" CDS complement(4133556..4133951) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3768C" /product="POSSIBLE OXIDOREDUCTASE" /note="Mb3768c, -, len: 131 aa. Equivalent to Rv3742c, len: 131 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 131 aa overlap). Possible oxidoreductase, probably combines with product of downstream ORF MTV025.090c to form a functional monooxygenase (EC 1.-.-.-), highly similar to N-terminal end of various oxidoreductases e.g. Q9A588|CC2569 MONOOXYGENASE (FLAVIN-BINDING FAMILY) from Caulobacter crescentus (498 aa), FASTA scores: opt: 170, E(): 0.00048, (47.55% identity in 103 aa overlap); Q9APW3 AROMATIC-RING HYROXYLASE from Pseudomonas aeruginosa (508 aa) FASTA scores: opt: 160, E(): 0.0022, (50.55% identity in 87 aa overlap); Q9RZT0|DRB0033 ARYLESTERASE/MONOXYGENASE from Deinococcus radiodurans (833 aa), FASTA scores: opt: 153, E(): 0.0097, (45.45% identity in 88 aa overlap); etc. Also similar to C-terminal end of Mycobacterium tuberculosis proteins (generally monooxygenases) e.g. P96223|Rv3854c|MTCY01A6.14 HYPOTHETICAL 55.3 KDA PROTEIN (489 aa), FASTA scores: opt: 140, E(): 0.044, (37.1% identity in 132 aa overlap); O53300|Rv3083|MTV013.04 MONOXYGENASE (495 aa) FASTA scores: opt: 133, E(): 0.13, (43.05% identity in 79 aa overlap); O53762|Rv0565c|MTV039.03c PUTATIVE MONOXYGENASE (486 aa), FASTA scores: opt: 110, E(): 4.1, (42.85% identity in 77 aa overlap); etc. Note similarity to MTCY01A6.14 and MTV013.04 continue in downstream ORF (MTV025.089c) after a gap of ~100 aa. Mb3768c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y538" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y538" /protein_id="SIU02397.1" /translation="MHSEQSASIEHVDVLIVGAGISGTGAAYYLKTMQPAKTFAIVEA RYPAIRSDSDLHTFSYEFKPWQHEKATASADAIMVHRGRSLAGGDRTLRHRRTRHHEL RMVIIGSGATAVTLVPAMAQTAGAVTMPK" CDS complement(4134097..4136079) /codon_start=1 /transl_table=11 /gene="ctpJ" /locus_tag="BQ2027_MB3769C" /product="probable cation transporter p-type atpase ctpj" /note="Mb3769c, ctpJ, len: 660 aa. Equivalent to Rv3743c, len: 660 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 660 aa overlap). Probable ctpJ, cation-transporting P-type ATPase (EC 3.6.1.-), transmembrane protein highly similar to others e.g. Q9ZBF3|SC9B5.27 PUTATIVE CATION-TRANSPORTING ATPASE from Streptomyces coelicolor (638 aa), FASTA scores: opt: 1635, E(): 2.5e-86, (62.25% identity in 63.95 aa overlap); Q59997|CADA|SLR0797 CADMIUM-TRANSPORTING ATPASE from Synechocystis sp. strain PCC 6803 (642 aa), FASTA scores: opt: 1474, E(): 4.3e-77, (42.4% identity in 604 aa overlap); P30336|CADA_BACFI PROBABLE CADMIUM-TRANSPORTING ATPASE from Bacillus firmus (723 aa), FASTA scores: opt: 1327, E(): 1.3e-68, (36.6% identity in 626 aa overlap); etc. Also highly similar to O53160|CTPD_MYCTU|Rv1469|MT1515|MTV007.16 PROBABLE CATION-TRANSPORTING P-TYPE ATPASE D from Mycobacterium tuberculosis (657 aa), FASTA scores: opt: 1845, E(): 2.3e-98, (55.85% identity in 650 aa overlap). Contains PS00154 E1-E2 ATPases phosphorylation site and PS01229 Hypothetical family signature 2. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES)." /db_xref="GOA:A0A1R3Y539" /db_xref="InterPro:IPR001757" /db_xref="InterPro:IPR008250" /db_xref="InterPro:IPR018303" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR023298" /db_xref="InterPro:IPR023299" /db_xref="InterPro:IPR027256" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/TrEMBL:A0A1R3Y539" /protein_id="SIU02398.1" /translation="MAVRELSPARCTSASPLVLARRTKLFALSEMRWAALALGLFSAG LLTQLCGAPQWVRWALFLACYATGGWEPGLAGLQALQRRTLDVDLLMVVAAIGAAAIG QIAEGALLIVIFATSGALEALVTARTADSVRGLMGLAPGTATRVGAGGGEETVNAADL RIGDIVLVRPGERISADATVLAGGSEVDQATVTGEPLPVDKSIGDQVFAGTVNGTGAL RIRVDRLARDSVVARIATLVEQASQTKARTQLFIEKVEQRYSIGMVAVTLAVFAVPPL WGETLQRALLRAMTFMIVASPCAVVLATMPPLLAAIANAGRHGVLAKSAIVMEQLGTT TRIAFDKTGTLTRGTPELAGIWVYERRFTDDELLRLAAAAEYPSEHPLGAAIVKAAQS RRIRLPTVGEFTAHPGCRVTARVDGHVIAVGSATALLGTAGAAALEASMITAVDFLQG EGYTVVVVVCDSHPVGLLAITDQLRPEAAAAISAATKLTGAKPVLLTGDNRATADRLG VQVGIDDVRAGLLPDDKVAAVRQLQAGGARLTVVGDGINDAPALAAAHVGIAMGSARS ELTLQTADAVVVRDDLTTIPTVIAMSRRARRIVVANLIVAVTFIAGLVVWDLAFTLPL PLGVARHEGSTIIVGLNGLRLLRHTAWRRAAGTAHR" CDS 4136146..4136508 /codon_start=1 /transl_table=11 /gene="nmtr" /locus_tag="BQ2027_MB3770" /product="metal sensor transcriptional regulator (arsr-smtb family)" /note="Mb3770, -, len: 120 aa. Equivalent to Rv3744, len: 120 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 120 aa overlap). Probable transcriptional regulator, possible arsR family, highly similar to many e.g. Q9ZBF4|SC9B5.26c from Streptomyces coelicolor (120 aa), FASTA scores: opt: 480, E(): 2.4e-24, (63.25% identity in 117 aa overlap); O31844|YOZA YOZA REGULATOR from Bacillus subtilis (107 aa), FASTA scores: opt: 249, E(): 1.6e-09, (44.8% identity in 96 aa overlap); P30340|SMTB_SYNP7|SMTB from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (122 aa), FASTA scores: opt: 230, E(): 2.9e-08, (46.0% identity in 87 aa overlap); etc. Equivalent to AAK48216 from Mycobacterium tuberculosis strain CDC1551 (135 aa) but shorter 15 aa. Also similar to MTCY27_22; MTCY39_25; and MTCY441_12. Contains helix-turn-helix motif at aa 47-68 (Score 1815, +5.37 SD). SEEMS TO BELONG TO THE ARSR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3770 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3770 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y562" /db_xref="InterPro:IPR001845" /db_xref="InterPro:IPR011991" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR036390" /db_xref="UniProtKB/TrEMBL:A0A1R3Y562" /protein_id="SIU02399.1" /translation="MGHGVEGRNRPSAPLDSQAAAQVASTLQALATPSRLMILTQLRN GPLPVTDLAEAIGMEQSAVSHQLRVLRNLGLVVGDRAGRSIVYSLYDTHVAQLLDEAI YHSEHLHLGLSDRHPSAG" CDS complement(4136592..4136804) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3771C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3771c, -, len: 70 aa. Equivalent to Rv3745c, len: 70 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 70 aa overlap). Conserved hypothetical protein, highly similar to others e.g. N-terminus of Q9X4E6 HYPOTHETICAL 13.4 KDA PROTEIN from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (124 aa), FASTA scores: opt: 279, E(): 4.4e-14, (59.4% identity in 69 aa overlap); N-terminus of Q9A2A6|CC3660 HYPOTHETICAL PROTEIN from Caulobacter crescentus (172 aa) FASTA scores: opt: 272, E(): 1.9e-13, (63.35% identity in 60 aa overlap); N-terminus of P74345|SLR1628 HYPOTHETICAL 14.5 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (134 aa), FASTA scores: opt: 233, E(): 1.3e-10, (54.85% identity in 62 aa overlap); etc." /db_xref="InterPro:IPR018714" /db_xref="UniProtKB/TrEMBL:A0A1R3Y523" /protein_id="SIU02400.1" /translation="MSDCNVLGGALEQGGTDPLTGFYRDGCCATGPEDLGWHTICAVM TTEFLAHQRSVGNDLSIARPPRWLRP" CDS complement(4136877..4137212) /codon_start=1 /transl_table=11 /gene="PE34" /locus_tag="BQ2027_MB3772C" /product="probable pe family protein pe34 (pe family-related protein)" /note="Mb3772c, PE34, len: 111 aa. Equivalent to Rv3746c, len: 111 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 111 aa overlap). Probable member of the Mycobacterium tuberculosis PE family, but without the glycine-rich C-terminal part, similar to N-termini of many e.g. O69737|Rv3872|MTV027.07 (99 aa) FASTA scores: opt: 306, E(): 1e-13, (50.5% identity in 99 aa overlap); O53215|Rv2490c|MTV008.46 (1660 aa) FASTA scores: opt: 125, E(): 0.99, (34.25% identity in 111 aa overlap). Also weakly similar to MTV008_46; MTCI418B_6; MTCY130_1; MTY25D10_11; MTCY1A11_25; MTCY21B4_13; MTCY21B4_27; MTCY493_2; MTCY28_25; etc." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y541" /protein_id="SIU02401.1" /translation="MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAE EVSAWAVTAFTTAATGLLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIP RPGQTLARE" CDS 4137430..4137813 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3773" /product="link to PE family" /note="Mb3773, -, len: 127 aa. Equivalent to Rv3747, len: 127 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 127 aa overlap). Hypothetical protein, highly similar to downstream ORF O69715|Rv3748|MTV025.096 CONSERVED HYPOTHETICAL PROTEIN (119 aa), FASTA scores: opt: 494, E(): 6e-27, (64.4% identity in 118 aa overlap). Protein product from Mb3773 detected using shotgun mass spectrometry. Mb3773 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y536" /protein_id="SIU02402.1" /translation="MILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVL TQAEPDSSDRDITVEMRPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWV LVVTGGTGAISLPVLVSDMPATIGF" CDS 4137943..4138302 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3774" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3774, -, len: 119 aa. Equivalent to Rv3748, len: 119 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 119 aa overlap). Hypothetical protein, highly similar to upstream ORF O69714|Rv3747|MTV025.095 CONSERVED HYPOTHETICAL PROTEIN (127 aa), FASTA scores: opt: 496, E(): 2.5e-28, (64.4% identity in 118 aa overlap). Protein product from Mb3774 detected using SWATH mass spectrometry. Mb3774 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y519" /protein_id="SIU02403.1" /translation="MIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLT QAETDDPDRRVDVEVWPPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVL VVTGDAGTISLPLIVTG" CDS complement(4138335..4138844) /codon_start=1 /transl_table=11 /gene="vapC50" /locus_tag="BQ2027_MB3775C" /product="Putative ribonuclease VapC50 (RNase VapC50) (EC (Toxin VapC50)" /EC_number="3.1.-.-" /note="Mb3775c, -, len: 169 aa. Equivalent to Rv3749c, len: 169 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 169 aa overlap). Hypothetical protein, showing some similarity with O85864 HYPOTHETICAL 21.4 KDA PROTEIN from Sphingomonas aromaticivorans plasmid pNL1 (196 aa), FASTA scores: opt: 148, E(): 0.011, (32.7% identity in 104 aa overlap); Q9LCU6 HYPOTHETICAL 21.2 KDA PROTEIN from Arthrobacter sp. TM1 (192 aa), FASTA scores: opt: 125, E(): 0.35, (31.5% identity in 92 aa overlap); Q9L631|SPCB MYO-INOSITOL-2-DEHYDROGENASE from Streptomyces spectabilis (374 aa); Q9WJP8|PRE-S1 PRE-S1 PROTEIN (FRAGMENT) from Hepatitis B virus (88 aa); etc. Contains PS00092 N-6 Adenine-specific DNA methylases signature. Mb3775c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y529" /protein_id="SIU02404.1" /translation="MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFA FGYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFL LDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFG LGVLLPFDQ" CDS complement(4138912..4139304) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3776C" /product="possible excisionase" /note="Mb3776c, -, len: 130 aa. Equivalent to Rv3750c, len: 130 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 130 aa overlap). Possible excisionase, similar to others e.g. Q9LCU5 PUTATIVE EXCISIONASE from Arthrobacter sp. TM1 (174 aa) FASTA scores: opt: 297, E(): 1.2e-12, (40.35% identity in 114 aa overlap); O85865 PUTATIVE EXCISIONASE from Sphingomonas aromaticivorans plasmid pNL1 (152 aa), FASTA scores: opt: 223, E(): 7.3e-08, (39.15% identity in 97 aa overlap); Q9XBH1|XIS EXCISIONASE from Bacteroides fragilis (124 aa) FASTA scores: opt: 128, E(): 0.1, (30.7% identity in 88 aa overlap); etc. Also some similarity to transcriptional regulators. Also similar to Mycobacterium tuberculosis hypothetical proteins e.g. P71902|YN10_MYCTU|Rv2310|MT2372|MTCY3G12.24c (114 aa) FASTA scores: opt: 224, E(): 4.9e-08, (42.7% identity in 82 aa overlap). Contains helix-turn-helix motif at aa 55-76 (Score 1925,+5.74 SD). Mb3776c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6N1" /db_xref="InterPro:IPR009061" /db_xref="InterPro:IPR010093" /db_xref="InterPro:IPR041657" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6N1" /protein_id="SIU02405.1" /translation="MTSLLEVLGAPEVSVCGNAGQPMTLPEPVRDALYNVVLALSQGK GISLVPRHLKLTTQEAADLLNISRPTLVRLLEDGRIPFEKPGRHRRVSLDALLEYQQE TRSNRRAALGELSRDALGELQAALAEKK" CDS 4139580..4139795 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3777" /product="probable integrase (fragment)" /note="Mb3777, -, len: 71 aa. Equivalent to Rv3751, len: 71 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 71 aa overlap). Probable integrase (fragment), similar to part of many e.g. Q48908 INTEGRASE (FRAGMENT) from Mycobacterium paratuberculosis (191 aa), FASTA scores: opt: 206, E(): 5.5e-08, (57.65% identity in 59 aa overlap); Q9ZWV7|INT INTEGRASE from Corynephage 304L (395 aa), FASTA scores: opt: 156, E(): 0.00036, (45.75% identity in 59 aa overlap); Q9K722|BH3551 INTEGRASE (PHAGE-RELATED PROTEIN) from Bacillus halodurans (378 aa), FASTA scores: opt: 151, E(): 0.00079, (46.15% identity in 52 aa overlap); etc. Also similarity with various conjugative transposons. Also similar to Mycobacterium tuberculosis hypothetical proteins e.g. P71903|Rv2309c|MTCY3G12.25 (151 aa), FASTA scores: opt: 193, E(): 3.8e-07, (50.85% identity in 59 aa overlap); O53403|Rv1055|MTV017.08 (78 aa), FASTA scores: opt: 171, E(): 7.8e-06, (54.15% identity in 48 aa overlap); etc." /db_xref="GOA:A0A1R3Y5G1" /db_xref="InterPro:IPR002104" /db_xref="InterPro:IPR011010" /db_xref="InterPro:IPR013762" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5G1" /protein_id="SIU02406.1" /translation="MKRAKVQQITPHDLRHTAASLAVSAGVNVLALQRILGHKSAKVT LDTYADLFDADLDAVAVTLGKDADQQT" tRNA complement(4139837..4139923) /locus_tag="BQ2027_SERX" /product="tRNA-Ser" /note="serX, len: 87 nt. Equivalent to serX, len: 87 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 87 nt overlap). tRNA-Ser, anticodon cga." CDS complement(4139953..4140411) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3778C" /product="POSSIBLE CYTIDINE/DEOXYCYTIDYLATE DEAMINASE" /note="Mb3778c, -, len: 152 aa. Equivalent to Rv3752c, len: 152 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 152 aa overlap). Probable cytidine/deoxycytidylate deaminase (EC 3.5.4.-), equivalent to Q9CB32|ML2474 POSSIBLE CYTIDINE/DEOXYCYTIDYLATE DEAMINASE from Mycobacterium leprae (171 aa), FASTA scores: opt: 890, E(): 1.6e-50, (88.1% identity in 151 aa overlap). Also highly similar to other deaminases and hypothetical proteins e.g. Q9AK79|2SCD60.04c PUTATIVE DEAMINASE from Streptomyces coelicolor (143 aa), FASTA scores: opt: 559, E(): 2.9e-29, (66.45% identity in 146 aa overlap); Q9F9W7 CYTOSINE DEAMINASE from Bifidobacterium longum (143 aa) FASTA scores: opt: 512, E(): 3.1e-26, (54.85% identity in 144 aa overlap); P21335|YAAJ_BACSU HYPOTHETICAL 17.8 KDA PROTEIN from Bacillus subtilis (161 aa), FASTA scores: opt: 425, E(): 1.4e-20, (47.7% identity in 151 aa overlap); AAK74212|SP0020 CYTIDINE/DEOXYCYTIDYLATE DEAMINASE FAMILY PROTEIN from Streptococcus pneumoniae (155 aa), FASTA scores: opt: 401, E(): 4.7e-19, (46.25% identity in 147 aa overlap); P30134|YFHC_ECOLI|B2559 HYPOTHETICAL 20.0 KDA PROTEIN from Escherichia coli strain K12 (178 aa), FASTA scores: opt: 397, E(): 9.5e-19, (47.0% identity in 149 aa overlap); etc. Contains PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature. BELONGS TO THE CYTIDINE AND DEOXYCYTIDYLATE DEAMINASES FAMILY. Mb3778c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y550" /db_xref="InterPro:IPR002125" /db_xref="InterPro:IPR016192" /db_xref="InterPro:IPR016193" /db_xref="InterPro:IPR028883" /db_xref="UniProtKB/TrEMBL:A0A1R3Y550" /protein_id="SIU02407.1" /translation="MTTDEDLIRAALAVAATAGPRDVPVGAVVVGADGTELARAVNAR EALGDPTAHAEILAMRLAAGVLGDGWRLEGTTLAVTVEPCTMCAGALVLARVARLVFG AWEPKTGAVGSLWDVVRDRRLNHRPEVRGGVLARECAAPLEAFFARQRLG" CDS complement(4140427..4140948) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3779C" /product="conserved protein" /note="Mb3779c, -, len: 173 aa. Equivalent to Rv3753c, len: 166 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 165 aa overlap). Conserved hypothetical protein, only equivalent to Q9CB33|ML2473 HYPOTHETICAL PROTEIN from Mycobacterium leprae (159 aa) FASTA scores: opt: 920 E(): 1.4e-52, (88.6% identity in 158 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (t-c) leads to a longer product with a different 5' start compared to its homolog in Mycobacterium tuberculosis strain H37Rv (173 aa versus 166 aa). Protein product from Mb3779c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3779c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR023869" /db_xref="UniProtKB/TrEMBL:A0A1R3Y545" /protein_id="SIU02408.1" /translation="MGAQRASTQRPAADTPDGFGVAVVREEGRWRCSPMGPKALTSLR AAETELRELRSAGAVFGLLDVDDEFFVIVRPAPSGTRLLLSDATAALDYDIAAEVLDN LDAEIDPEDLEDADPFEEGDLGLLSDIGLPEAVLGVILDETDLYADEQLGRIAREMGF ADQLSAVIDRLGR" CDS 4141127..4142032 /codon_start=1 /transl_table=11 /gene="tyrA" /locus_tag="BQ2027_MB3780" /product="PREPHENATE DEHYDROGENASE TYRA (PDH) (HYDROXYPHENYLPYRUVATE SYNTHASE)" /note="Mb3780, tyrA, len: 301 aa. Equivalent to Rv3754, len: 301 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 301 aa overlap). Probable tyrA, prephenate dehydrogenase (EC 1.3.1.12), equivalent, but shorter 27 aa, to Q9CB34|ML2472 POSSIBLE PREPHENATE DEHYDROGENASE from Mycobacterium leprae (327 aa) FASTA scores: opt: 1600, E(): 1.6e-89, (80.0% identity in 300 aa overlap). Also similar to many pephenate dehydrogenases e.g. Q9RND8|TYRA from Bordetella bronchiseptica (Alcaligenes bronchisepticus) (299 aa), FASTA scores: opt: 345, E(): 9.7e-14, (32.85% identity in 271 aa overlap); Q9RVA7|DR1122 from Deinococcus radiodurans (372 aa) FASTA scores: opt: 341, E(): 2e-13, (35.65% identity in 216 aa overlap); P20692|TYRA_BACSU from Bacillus subtilis (372 aa), FASTA scores: opt: 314, E(): 8.6e-12, (27.75% identity in 263 aa overlap); etc. Also similar to Q04983|TYRC_ZYMMO TYRC PROTEIN [INCLUDES: CYCLOHEXADIENYL DEHYDROGENASE AND PREPHENATE DEHYDROGENASE ACTIVITIES] from Zymomonas mobilis (293 aa), FASTA scores: opt: 290, E(): 2e-10, (30.15% identity in 239 aa overlap). Equivalent to AAK48225 from Mycobacterium tuberculosis strain CDC1551 (323 aa) but shorter 22 aa. Protein product from Mb3780 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3780 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y570" /db_xref="InterPro:IPR003099" /db_xref="InterPro:IPR008927" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y570" /protein_id="SIU02409.1" /translation="MRAAAAAGREVFGYNRSVEGAHGARSDGFDAITDLNQTLTRAAA TEALIVLAVPMPALPGMLAHIRKSAPGCPLTDVTSVKCAVLDEVTAAGLQARYVGGHP MTGTAHSGWTAGHGGLFNRAPWVVSVDDHVDPTVWSMVMTLALDCGAMVVPAKSDEHD AAAAAVSHLPHLLAEALAVTAAEVPLAFALAAGSFRDATRVAATAPDLVRAMCEANTG QLAPAADRIIDLLSRARDSLQSHGSIADLADAGHAARTRYDSFPRSDIVTVVIGADKW REQLAAAGRAGGVITSALPSLDSPQ" CDS complement(4141995..4142594) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3781C" /product="conserved protein" /note="Mb3781c, -, len: 199 aa. Equivalent to Rv3755c, len: 199 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 199 aa overlap). Conserved hypothetical protein showing similarity to CAC47343|SMC03980 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (196 aa) FASTA scores: opt: 244, E(): 4.1e-09, (30.9% identity in 191 aa overlap); Q9I2B5|PA1994 from Pseudomonas aeruginosa (187 aa), FASTA scores: opt: 226, E(): 6e-08, (29.9% identity in 194 aa overlap); and Q98N73|MLR0268 HYPOTHETICAL PROTEIN (183 aa), FASTA scores: opt: 234, E(): 1.8e-08, (27.05% identity in 185 aa overlap). Protein product from Mb3781c detected using shotgun mass spectrometry. Mb3781c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR009467" /db_xref="UniProtKB/TrEMBL:A0A1R3Y540" /protein_id="SIU02410.1" /translation="MNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGR IVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQ GERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVS YTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM" CDS complement(4142600..4143319) /codon_start=1 /transl_table=11 /gene="proZ" /locus_tag="BQ2027_MB3782C" /product="POSSIBLE OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER PROZ" /note="Mb3782c, proZ, len: 239 aa. Equivalent to Rv3756c, len: 239 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 239 aa overlap). Possible proZ, osmoprotectant transport integral membrane protein ABC transporter (see citation below), similar to osmoprotection proteins (proW, proZ) involved in glycine betaine/L-proline/choline transport, e.g. BAB58609|Q99RI4|OPUCB|SA2236|SAV2447 OPUCB PROTEIN (PROBABLE GLYCINE BETAINE/CARNITINE/CHOLINE ABC TRANSPORTER) from Staphylococcus aureus (211 aa) FASTA scores: opt: 434, E(): 2.5e-18, (36.6% identity in 194 aa overlap); Q45461|OPBB_BACSU|OPUBB|PROW CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN (mediate the uptake of choline for synthesis of the osmoprotectant glycine betaine) from Bacillus subtilis (217 aa), FASTA scores: opt: 402, E(): 1.9e-16, (32.0% identity in 203 aa overlap); O34878|OPCB_BACSU|OPUCB GLYCINE BETAINE/CARNITINE/CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (217 aa), FASTA scores: opt: 385, E(): 1.8e-15, (30.2% identity in 222 aa overlap); P39775|O34657|OPUBD|PROZ|OPBD_BACSU CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (226 aa) FASTA scores: opt: 350, E(): 2e-13, (31.75% identity in 208 aa overlap); etc. COULD BELONG TO THE CYSTW SUBFAMILY. Mb3782c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y551" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y551" /protein_id="SIU02411.1" /translation="MNFLQQALSYLLTASNWTGPVGLAVRTCEHLEYTAVAVAASALI AVPVGLLIGHTGRGTLLVVGAVNGLRALPTLGVLLLGVLLFGLGLGPPLVALMLLGIP SLLASTYAGIASVDPLVVDAARAMGMTESQVLLRVEVPNALPLMLGGLRSATLQVVAT ATVAAYASLGGLGGYLIDGIKERRFHIALVGAMMVAALALTLDGLLALAGWVSVPGTG RMRKLAAVVDKPAAGGGHALR" CDS complement(4143316..4144005) /codon_start=1 /transl_table=11 /gene="proW" /locus_tag="BQ2027_MB3783C" /product="POSSIBLE OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER PROW" /note="Mb3783c, proW, len: 229 aa. Equivalent to Rv3757c, len: 229 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 229 aa overlap). Possible proW, osmoprotectant transport integral membrane protein ABC transporter (see citation below), similar to osmoprotection proteins (proW, proZ) involved in glycine betaine/L-proline/choline transport, e.g. BAB58607|Q99RI6|OPUCD|SA2234|SAV2445 OPUCD PROTEIN (PROBABLE GLYCINE BETAINE/CARNITINE/CHOLINE ABC TRANSPORTER) from Staphylococcus aureus (231 aa) FASTA scores: opt: 364, E(): 7.1e-15, (30.0% identity in 220 aa overlap); Q45461|OPBB_BACSU|OPUBB|PROW CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN (mediate the uptake of choline for synthesis of the osmoprotectant glycine betaine) from Bacillus subtilis (217 aa), FASTA scores: opt: 348, E(): 6.2e-14, (31.05% identity in 206 aa overlap); O34878|OPCB_BACSU|OPUCB GLYCINE BETAINE/CARNITINE/CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (217 aa), FASTA scores: opt: 343, E(): 1.2e-13, (30.1% identity in 206 aa overlap); O34742|OPCD_BACSU|OPUCD GLYCINE BETAINE/CARNITINE/CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (229 aa) FASTA scores: opt: 337, E(): 2.9e-13, (31.1% identity in 193 aa overlap); etc. COULD BELONG TO THE CYSTW SUBFAMILY. Mb3783c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y549" /db_xref="InterPro:IPR000515" /db_xref="InterPro:IPR035906" /db_xref="UniProtKB/TrEMBL:A0A1R3Y549" /protein_id="SIU02412.1" /translation="MHYLMTHPGAAWALTVVHLRLSLLPVLIGLMSAVPLGLLVQRAP LLRRLTTATASVIFTIPSLALFVVLPLIIGTRILDEANVIVALAAYTTALLVRAVLEA LDAVPAQVHDAATAIGYSRIAQMLKVELPLSIPVLVAGLRVVAVTNIAMVSVGSVIGI GGLGTWFTAGYQTNKSDQIVAGVVAMFLLAIVVDVVINLAGRLATPWERAPRAARRRR QVAAPITGGAR" CDS complement(4143993..4145123) /codon_start=1 /transl_table=11 /gene="proV" /locus_tag="BQ2027_MB3784C" /product="POSSIBLE OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER PROV" /note="Mb3784c, proV, len: 376 aa. Equivalent to Rv3758c, len: 376 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 376 aa overlap). Possible proV, osmoprotectant transport ATP-binding protein ABC transporter (see citation below), highly similar to osmoprotection proteins (proV) involved in glycine betaine/L-proline/choline transport, e.g. BAB58610|Q99RI3|OPUCA|SA2237|SAV2448 GLYCINE BETAINE/CARNITINE/CHOLINE ABC TRANSPORTER (ATP-BINDING) from Staphylococcus aureus (410 aa), FASTA scores: opt: 816, E(): 8.4e-39, (39.5% identity in 362 aa overlap); O34992|OPCA_BACSU|OPUCA GLYCINE BETAINE/CARNITINE/CHOLINE TRANSPORT ATP-BINDING PROTEIN from Bacillus subtilis (380 aa), FASTA scores: opt: 807, E(): 2.5e-38, (40.55% identity in 333 aa overlap); Q45460|OPBA_BACSU|OPUBA|PROV CHOLINE TRANSPORT ATP-BINDING PROTEIN from Bacillus subtilis (381 aa), FASTA scores: opt: 801, E(): 5.6e-38, (40.65% identity in 337 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS00211 ABC transporter family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb3784c detected using SWATH mass spectrometry. Mb3784c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y530" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR017871" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y530" /protein_id="SIU02413.1" /translation="MICFDDVSKVYAHGATAVDRLTLEVPNGMLTVFVGPSGCGKTTA LRMINRMVDPTSGTITVDGTDVSTVNAVKLRLGIGYVIQNAGLMPHQRVIDNVATVPV LKGQPRRAARKAGYEVLERVGLDPKVATRYPAQLSGGEQQRVGVARALAADPPILLMD EPFSAVDPVVRHELQNEILRLQAELHKTIVFVTHDIDEALKLADLVAVFAPGGALAQY DETARLLSSPANDFVSKFIGLGRGYRWLQLFDAAGLPVRDIEQVSVNGLSDARDRQVR DGWVLVVDGAGAPLGWIDADGRRRHRGGAALSDAMTVGGSVFRPNGNLSQALDAALSS PSGVGVAVDGGGKVIGGILAADVLAEFQKGKKAGGGAKPCTT" CDS complement(4145132..4146079) /codon_start=1 /transl_table=11 /gene="proX" /locus_tag="BQ2027_MB3785C" /product="POSSIBLE OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) BINDING LIPOPROTEIN PROX" /note="Mb3785c, proX, len: 315 aa. Equivalent to Rv3759c, len: 315 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 315 aa overlap). Possible proX, osmoprotectant-binding lipoprotein component of osmoprotectant transport system (see citation below), similar to osmoprotection proteins (proX) involved in glycine betaine/L-proline/choline transport, e.g. AAK79442|CAC1474 PROLINE/GLYCINE BETAINE ABC TRANSPORT SYSTEM PERIPLASMIC COMPONENT from Clostridium acetobutylicum (303 aa), FASTA scores: opt: 308, E(): 1.2e-11, (27.4% identity in 314 aa overlap); Q9X4J2|PROXL|SCE19A.33 PROXL PROTEIN from Streptomyces coelicolor (322 aa), FASTA scores: opt: 302, E(): 3e-11, (27.2% identity in 327 aa overlap); O29280|AF0982 OSMOPROTECTION PROTEIN (PROX) from Archaeoglobus fulgidus (292 aa), FASTA scores: opt: 235, E(): 3.4e-07, (23.15% identity in 285 aa overlap); etc. Also similar to MTV006_16 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis, and MLU15180_43 HYPOTHETICAL PROTEIN from Mycobacterium leprae. Equivalent to AAK48230 from Mycobacterium tuberculosis strain CDC1551 (343 aa) but shorter 28 aa. Contains probable N-terminal signal sequence. Protein product from Mb3785c detected using SWATH mass spectrometry. Mb3785c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y542" /db_xref="InterPro:IPR007210" /db_xref="UniProtKB/TrEMBL:A0A1R3Y542" /protein_id="SIU02414.1" /translation="MRMLRRLRRATVAAAVWLATVCLVASCANADPLGSATGSVKSIV VGSGDFPESQVIAEIYAQVLQANGFDVGRRLGIGSRETYIPALKDHSIDLVPEYIGNL LLYFQPDATVTMLDAVELELYKRLPGDLSILTPSPASDTDTVTVTAATAARWNLKTIA DLAPHSADVKFAAPSVFQTRPSGLPGLRHKYSLDIAPGNFVTINDGGGAVTVRALVEG TATAANLFSTSAAIPQNHLVVLEDPEHNFLAGNIVPLVNSRKKSDHLKDVLDAVSAKL TTAGLAELNAAVSGNSGVDPDQAARKWVRDNGFDHPVRQ" CDS 4146244..4146546 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3786" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN" /note="Mb3786, -, len: 100 aa. Equivalent to Rv3760, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). Possible conserved membrane protein, equivalent to Q50094|ML2366|MLCB12.11c PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (113 aa), FASTA scores: opt: 423, E(): 1.2e-20, (67.7% identity in 99 aa overlap). Also similar with Q9JST1|NMA2149 PUTATIVE INNER MEMBRANE HYPOTHETICAL PROTEIN from Neisseria meningitidis (serogroup A) (104 aa), FASTA scores: opt: 113, E(): 0.95, (33.85% identity in 62 aa overlap); and showing similarity with Q9ZAX7 ABC TRANSPORTER MEMBRANE PROTEIN SUBUNIT from Streptococcus mutans (498 aa), FASTA scores: opt: 108, E(): 6.7, (42.35% identity in 85 aa overlap) (similarity at C-terminus); and P33108|SECY_MICLU PREPROTEIN TRANSLOCASE SECY SUBUNIT from Micrococcus luteus (Micrococcus lysodeikticus) (436 aa), FASTA scores: opt: 106, E(): 8.2, (29.05% identity in 86 aa overlap). Equivalent to AAK48231 from Mycobacterium tuberculosis strain CDC1551 (117 aa) but shorter 17 aa. Protein product from Mb3786 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3786 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y6N8" /db_xref="InterPro:IPR010445" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6N8" /protein_id="SIU02415.1" /translation="MPGSVPGKAPEEPPVKFTRAAAVWSALIVGFLILILLLIFIAQN TASAQFAFFGWRWSLPLGVAILLAAVGGGLITVFAGTARILQLRRAAKKTHAAALR" CDS complement(4146568..4147623) /codon_start=1 /transl_table=11 /gene="fadE36" /locus_tag="BQ2027_MB3787C" /product="POSSIBLE ACYL-COA DEHYDROGENASE FADE36" /note="Mb3787c, fadE36, len: 351 aa. Equivalent to Rv3761c, len: 351 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 351 aa overlap). Possible fadE36, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many conserved hypothetical proteins and showing some similarity with few acyl-CoA dehydrogenases, e.g. Q9APX7|FADE36 FADE36 PROTEIN from Pseudomonas aeruginosa (360 aa), FASTA scores: opt: 147, E(): 0.046, (26.15% identity in 214 aa overlap); part of AAB52261.2|U97002 protein similar to acyl-CoA dehydrogenases and epoxide hydrolases from Caenorhabditis elegans (985 aa), FASTA score: (31.2% identity in 324 aa overlap). C-terminal part is highly similar to Q50095|U1740AK|MLU15183_45 hypothetical protein from Mycobacterium leprae cosmid B174 (122 aa), FASTA scores: opt: 341, E(): 7.3e-15, (57.6% identity in 99 aa overlap). Contains PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2. Protein product from Mb3787c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3787c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002575" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR041726" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5G9" /protein_id="SIU02416.1" /translation="MTSVDRLDGLDLGALDRYLRSLGIGRDGELRGELISGGRSNLTF RVYDDASSWLVRRPPLHGLTPSAHDMAREYRVVAALGDTPVPVARTISLCQDDSVLGA PFQVVEFVAGQVVRRRAELEALGSRSVIEGCVDALIRVLVDLHSIDPKAVGLSDFGKP DGYLERQVRRWGSQWELVRLPDDHRDADISRLHLALQQAIPQQSRTSIVHGDYRIDNT ILDTDDPCHVRAVVDWELSTLGDPLSDAALMCVYRDPALDLIVHAQAAWTSPLLPAAD ELADRYSLVSGQPLGHWEFYMALAYFKLAIIAAGIDYRRRMSEQAEGKDTAAESVPDV VAPLIARGLAEIAKKSG" CDS complement(4147702..4149582) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3788C" /product="possible hydrolase" /note="Mb3788c, -, len: 626 aa. Equivalent to Rv3762c, len: 626 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 626 aa overlap). Possible hydrolase (EC 3.-.-.-), highly similar to hypothetical proteins and beta-lactamases (EC 3.5.2.6) e.g. Q9RL04|SC5G9.23 HYPOTHETICAL 70.3 KDA PROTEIN from Streptomyces coelicolor (648 aa), FASTA scores: opt: 2088, E(): 3.7e-124, (52.9% identity in 624 aa overlap); P32717|YJCS_ECOLI|B4083 HYPOTHETICAL 73.2 KDA PROTEIN from Escherichia coli strain K12 (661 aa), FASTA scores: opt: 1911, E(): 5.7e-113, (46.9% identity in 631 aa overlap); Q9A824|CC1540 METALLO-BETA-LACTAMASE FAMILY PROTEIN from Caulobacter crescentus (647 aa), FASTA scores: opt: 1891, E(): 1e-111, (48.55% identity in 628 aa overlap); Q08347|YOL164W CHROMOSOME XV READING FRAME ORF from Saccharomyces cerevisiae (Baker's yeast) (646 aa) FASTA scores: opt: 1829, E(): 8.4e-108, (45.7% identity in 615 aa overlap); Q9I5I9|PA0740 PROBABLE BETA-LACTAMASE from Pseudomonas aeruginosa (658 aa), FASTA scores: opt: 1699, E(): 1.4e-99, (43.15% identity in 630 aa overlap); Q52556|SDSA ALKYL SULFATASE (protein involved in the degradation of sulfate esters of long-chain primaryal cohols e.g. SDS sodium dodecyl sulfate) from Pseudomonas sp (528 aa), FASTA scores: opt: 841, E(): 1.7e-45, (33.7% identity in 534 aa overlap); etc. N-terminual end also highly similar to Q48790|SEPA SEPA PROTEIN (protein implicated in cell separation) from Listeria monocytogenes (391 aa), FASTA scores: opt: 1256, E(): 8.3e-72, (49.6% identity in 363 aa overlap). Also slight similarity to P96253|Rv0407|MTCY22G10.03 HYPOTHETICAL 37.0 KDA PROTEIN from Mycobacterium tuberculosis (336 aa). Protein product from Mb3788c detected using SWATH mass spectrometry. Mb3788c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y554" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR029228" /db_xref="InterPro:IPR029229" /db_xref="InterPro:IPR036527" /db_xref="InterPro:IPR036866" /db_xref="InterPro:IPR038536" /db_xref="UniProtKB/TrEMBL:A0A1R3Y554" /protein_id="SIU02417.1" /translation="MPMEHKPPTAVIQAAHGEHSLPLHDTTDFDDADRGFIAALSPCV IKAADGRVVWDNDAYSFLDGAAPTSVHPSLWRQSQLTAKQGLYQVVPGIYQVRGFDIS NISFVEGDTGLIVIDPLVSTEVAAAALDLYRAHRGADRPVVAVIYTHSHVDHFGGVLG GTTQADVDAGKVAVLAPEGFTAHAVQENIYAGSAMMRRAGYMYGTVLARGLRGHVGCG LGQTLSTGEVSLVVPTVDITETGETHTIDGVEIEFQMAPGTEAPAEMHFYFPRFRALC MAENATHNLHNLLTLRGALVRDPRAWSGYLTEAIDTFADRTDVVFASHHWPTWGREKI VEFLSQQRDMHSYLHDQTLRLLNQGYTGVEIAEMFQLPPALQRAWHTHGYYGSVSHNV KAIYQRYMGWFDGNPGWLWPHPPEALAPRYVDALGGIDRVLELAREAFDAGDFRWAAT LLDHAVFADSEHAAARGLYADTLEQLAYGAECATWRNFFLTGAAELRDGNPGSSGQVP APTFFAQLTPDQIFDVLAISINGPRAWDLDLAIDFTFTEPDVNYRLTLRNGVLIHRKL PADPATANATVTVGDKVRLVAAALGDISSPGFEVFGDRTVLQTFLSVLDRPDSAFNIV TP" CDS 4149753..4150232 /codon_start=1 /transl_table=11 /gene="lpqH" /locus_tag="BQ2027_MB3789" /product="19 KDA LIPOPROTEIN ANTIGEN PRECURSOR LPQH" /note="Mb3789, lpqH, len: 159 aa. Equivalent to Rv3763, len: 159 aa, from Mycobacetrium tuberculosis strain H37Rv, (100.0% identity in 159 aa overlap). lpqH, conserved 19 KDa lipoprotein antigen precursor (see citations below), equivalent to P31502|19KD_MYCIT|MI22 19 KDA LIPOPROTEIN ANTIGEN PRECURSOR (MI22 ANTIGEN) from Mycobacterium intracellulare (162 aa), FASTA scores: opt: 773, E(): 6.2e-35, 75.95(% identity in 162 aa overlap); P46733|19KD_MYCAV 19 KDA LIPOPROTEIN ANTIGEN PRECURSOR from M. avium (161 aa), FASTA scores: opt: 743, E(): 2.5e-33, (72.5% identity in 160 aa overlap); and Q9X7A5|LPQH|ML1966 POSSIBLE LIPOPROTEIN from Mycobacterium leprae FASTA scores: opt: 371, E(): 2.2e-13, (42.6% identity in 162 aa overlap). POSSIBLY ATTACHED TO THE MEMBRANE BY A LIPID ANCHOR. SIMILAR TO OTHER MYCOBACTERIUM 19 KDA ANTIGEN. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Protein product from Mb3789 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3789 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5J1" /db_xref="InterPro:IPR008691" /db_xref="UniProtKB/Swiss-Prot:P0A5J1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02418.1" /translation="MKRGLTVAVAGAAILVAGLSGCSSNKSTTGSGETTTAAGTTASP GAASGPKVVIDGKDQNVTGSVVCTTAAGNVNIAIGGAATGIAAVLTDGNPPEVKSVGL GNVNGVTLGYTSGTGQGNASATKDGSHYKITGTATGVDMANPMSPVNKSFEIEVTCS" CDS complement(4150288..4151715) /codon_start=1 /transl_table=11 /gene="tcry" /locus_tag="BQ2027_MB3790C" /product="possible two component sensor kinase tcry" /note="Mb3790c, -, len: 475 aa. Equivalent to Rv3764c, len: 475 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 475 aa overlap). Possible histidine protein kinase (EC 2.7.3.-), part of a two-component regulatory system, similar to others e.g. Q9ADN6|2SC10A7.25 PUTATIVE TWO COMPONENT SYSTEM HISTIDINE KINASE from Streptomyces coelicolor (524 aa), FASTA scores: opt: 1332, E(): 5.4e-70, (49.9% identity in 477 aa overlap); Q9L3C1|KB|CAC42479 PUTATIVE HISTIDINE KINASE from Amycolatopsis mediterranei (469 aa), FASTA scores: opt: 515, E(): 1.4e-22, (36.1% identity in 313 aa overlap); P72560 HISTIDINE PROTEIN KINASE from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (438 aa), FASTA scores: opt: 480, E(): 1.4e-20, (40.1% identity in 232 aa overlap); P30847|P76401|BAES_ECOLI|B2078 SENSOR PROTEIN from Escherichia coli strain K12 (467 aa); etc. Also similar to others from Mycobacterium tuberculosis e.g. P96368|Rv1032c|MTCY10G2.17 (509 aa), FASTA scores: opt: 1007, E(): 4e-51, (43.5% identity in 416 aa overlap); and P71815|Rv0758|MTCY369.03 (485 aa), FASTA scores: opt: 738, E(): 1.6e-35, (28.6% identity in 438 aa overlap). Equivalent to AAK48235 from Mycobacterium tuberculosis strain CDC1551 (506 aa) but shorter 31 aa. Protein product from Mb3790c detected using SWATH mass spectrometry. Mb3790c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y574" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR003661" /db_xref="InterPro:IPR004358" /db_xref="InterPro:IPR005467" /db_xref="InterPro:IPR036097" /db_xref="InterPro:IPR036890" /db_xref="UniProtKB/TrEMBL:A0A1R3Y574" /protein_id="SIU02419.1" /translation="MGITAATEMALRRHLVAQLDNQLGGTSYRSVLMYPEKMPRPPWR RETHNYIRSGPGPRFLDAPGQPAGMVAAVVSDGTTVAAGYLTGSGSRAALTSTGRSQL ERIAGSRTPLTLDLDGLGRYRVLAAPSRNGHDVIVTGLSMGNVDATMLQMLIIFGIVT VIALVAATTAGIVIIKRALAPLRRVAQTASEVVDLPLDRGEVKLPVRVPEPDANPSTE VGQLGSALNRMLDHIAAALSARQASETRVRQFVADASHELRTPLAAIRGYTELTQRIG DDPEAVAHAMSRVASETERITRLVEDLLLLARLDSGRPLERGPVDMSRLAVDAVSDAH VAGPDHQWALDLPPEPVVIPGDAARLHQVVTNLLANARVHTGPGTIVTTRLSTGPTHV VLQVIDNGPGIPAALQSEVFERFARGDTSRSRQAGSTGLGLAIVSAVVKAHNGTITVS SSPGYTEFAVRLPLDGWQPLESSPR" CDS complement(4151786..4152490) /codon_start=1 /transl_table=11 /gene="tcrx" /locus_tag="BQ2027_MB3791C" /product="probable two component transcriptional regulatory protein tcrx" /note="Mb3791c, -, len: 234 aa. Equivalent to Rv3765c, len: 234 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 234 aa overlap). Probable response regulator of a two-component regulatory system, highly similar to others e.g. Q9ADN7|2SC10A7.24 PUTATIVE TWO COMPONENT SYSTEM RESPONSE REGULATOR from Streptomyces coelicolor (271 aa), FASTA scores: opt: 1111, E(): 4.8e-63, (72.3% identity in 231 aa overlap); Q9F161 RESPONSE REGULATOR from Corynebacterium glutamicum (Brevibacterium flavum) (232 aa), FASTA scores: opt: 692, E(): 1.2e-36, (46.0% identity in 226 aa overlap); Q9KZU5|SCD84.23c PUTATIVE TWO-COMPONENT SYSTEN RESPONSE REGULATOR from Streptomyces coelicolor (248 aa), FASTA scores: opt: 674, E(): 1.7e-35, (44.05% identity in 236 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Q50806|Rv1033c|MTCY10G2.16 RESPONSE REGULATOR HOMOLOG (257 aa), FASTA scores: opt: 947, E(): 1e-52, (59.5% identity in 232 aa overlap); P71814|Rv0757|MTCY369.02 PHOP-LIKE PROTEIN (247 aa) FASTA scores: opt: 829, E(): 2.8e-45, (54.65% identity in 225 aa overlap); O53894|Rv0981|MTV044.09 (230 aa), FASTA scores: opt: 662, E(): 9e-35, (44.65% identity in 224 aa overlap); and also similar to MTCY31_34; MTCY19H5_20; MTY13628_5; MTCY20G9_17; and to MLCB57_27 from Mycobacterium leprae; and MBY13627_3 from Mycobacterium bovis BCG. Equivalent to AAK48236 from Mycobacterium tuberculosis strain CDC1551 (286 aa) but shorter 52 aa. THE N-TERMINAL REGION IS SIMILAR TO THAT OF OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS. SIMILAR TO BACTERIAL REGULATORY PROTEINS INVOLVED IN SIGNAL TRANSDUCTION. Protein product from Mb3791c detected using SWATH mass spectrometry. Mb3791c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y557" /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR001867" /db_xref="InterPro:IPR011006" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039420" /db_xref="UniProtKB/TrEMBL:A0A1R3Y557" /protein_id="SIU02420.1" /translation="MRRADGQPVTVLVVDDEPVLAEMVSMALRYEGWNITTAGDGSSA IAAARRQRPDVVVLDVMLPDMSGLDVLHKLRSENPGLPVLLLTAKDAVEDRIAGLTAG GDDYVTKPFSIEEVVLRLRALLRRTGVTTVDSGAQLVVGDLVLDEDSHEVMRAGEPVS LTSTEFELLRFMMHNSKRVLSKAQILDRVWSYDFGGRSNIVELYISYLRKKIDNGREP MIHTLRGAGYVLKPAR" CDS 4152999..4153688 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3792" /product="HYPOTHETICAL PROTEIN" /note="Mb3792, -, len: 229 aa. Equivalent to Rv3766, len: 229 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 229 aa overlap). Hypothetical unknown protein. Segment 183 to 229 highly similar to C-terminal part of O06288|Rv3594|MTCY07H7B.28c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (275 aa), FASTA scores: opt: 128, E(): 0.92, (46.8% identity in 47 aa overlap). Protein product from Mb3792 detected using SWATH mass spectrometry. Mb3792 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR017853" /db_xref="UniProtKB/TrEMBL:A0A1R3Y559" /protein_id="SIU02421.1" /translation="MRSAFDSGRLTFGIVYTYARPNWWANANTVRSMIDAAGGLHPRV ALMLDVESGGNPPGDGSSWINRLYWNLADYAGSPVRIIGYANAYDFFNMWRVRPAGLR VIGAGYGSNPNLPGQVAHQYTDGSGYSPNLPQGAPPFGRCDMNSANGLTPQQFAAACG VTTTGGPLMALTDEEQTELLTKVREIWDQLRGPNGAGWPQLGQNEQGQDLTPVDAIAV IKNDVAAMLAE" CDS complement(4153702..4154646) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3793C" /product="possible s-adenosylmethionine-dependent methyltransferase" /note="Mb3793c, -, len: 314 aa. Equivalent to Rv3767c, len: 314 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 314 aa overlap). Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. P96823|Rv0146|MTCI5.20 HYPOTHETICAL 34.0 KDA PROTEIN (310 aa), FASTA scores: opt: 909, E(): 5.3e-50, (48.1% identity in 316 aa overlap); O53686|Rv0281|MTV035.09 (302 aa), FASTA scores: opt: 802, E(): 2.8e-43, (45.2% identity in 314 aa overlap); Q50726|YX99_MYCTU|Rv3399|MT3507|MTCY78.29 c (348 aa), FASTA scores: opt: 796, E(): 7.6e-43, (45.35% identity in 302 aa overlap); MTCY78_30; MTCY31_23; MTCY210_45; MTCY4C12_14; MTY13D12_21, MTCI5_19; MTCY180_22; etc. Contains probable N-terminal signal sequence Protein product from Mb3793c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3793c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVQ7" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7TVQ7" /protein_id="SIU02422.1" /translation="MPRTDNDSWAITESVGATALGVAAARAAETESDNPLINDPFARI FVDAAGDGIWSMYTNRTLLAGATDLDPDLRAPIQQMIDFMAARTAFFDEYFLATADAG VRQVVILASGLDSRAWRLPWPDGTVVYELDQPKVLEFKSATLRQHGAQPASQLVNVPI DLRQDWPKALQKAGFDPSKPCAWLAEGLVRYLPARAQDLLFERIDALSRPGSWLASNV PGAGFLDPERMRRQRADMRRMRAAAAKLVETEISDVDDLWYAEQRTAVAEWLRERGWD VSTATLPELLARYGRSIPHSGEDSIPPNLFVSAQRATS" CDS 4154776..4155135 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3794" /product="unknown protein" /note="Mb3794, -, len: 119 aa. Equivalent to Rv3768, len: 119 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 119 aa overlap). Hypothetical unknown protein. Protein product from Mb3794 detected using SWATH mass spectrometry. Mb3794 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR032710" /db_xref="InterPro:IPR037401" /db_xref="UniProtKB/TrEMBL:A0A1R3Y555" /protein_id="SIU02423.1" /translation="MGSTPPRTPQEVFAHHGQALAAGDLDEIVADYADDSFVITPAGI ARGKEGIRQLFVKLLDDIPNALWDLKTQIFEGDILFLEWTANSAVSRVDDGVDTFVFR DGTIWAHTVRYTPHPKT" CDS 4155321..4155593 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3795" /product="HYPOTHETICAL PROTEIN" /note="Mb3795, -, len: 90 aa. Equivalent to Rv3769, len: 90 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 90 aa overlap). Hypothetical unknown protein, possible coiled-coil protein. Protein product from Mb3795 detected using SWATH mass spectrometry. Mb3795 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y558" /protein_id="SIU02424.1" /translation="MTTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVRE HTGRLDRVTTKVGQLAAKSDDTNARVRSLEEGQAEIKDLLLRALDK" CDS complement(4155906..4156481) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3796C" /product="HYPOTHETICAL LEUCINE RICH PROTEIN" /note="Mb3796c, -, len: 191 aa. Equivalent to Rv3770c, len: 191 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 191 aa overlap). Hypothetical unknown leu-rich protein. Protein product from Mb3796c detected using shotgun mass spectrometry. Mb3796c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6P7" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6P7" /protein_id="SIU02425.1" /translation="MLSGIQQNTLMDNDPLAHGYYVADLLVALAVVVLMLRARRTRPE LARMLLLGTLIGLVWELPVFGLSAWTNTPIIEWATPLPLPTVVFLLAHSVWDGALLTM GWLLARALTGEPAGALGLTVQVLWGQLTALAVELSAILAGTWSYVDDLWFNPVMFWFR GHPVTAAMQLTWLLAPLCFAALVRRLALTAR" CDS complement(4156587..4156769) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3797C" /product="PROBABLE REMNANT OF A TRANSPOSASE" /note="Mb3797c, -, len: 60 aa. Equivalent to Rv3770A, len: 60 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 60 aa overlap). Probable remnant of a transposase, similar to many e.g. Rv2812|MTCY16B7.31c|Z81331_17 IS1604 putative transposase from Mycobacterium tuberculosis (469 aa), FASTA scores: opt: 204, E(): 1e-07, (80.5% identity in 41 aa overlap). Continuation of Rv3770B." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I5" /protein_id="SIU02426.1" /translation="MGSTPWCPNPCQCTLRTPVEVLELAVALRPENPDRTAGAIQRIL RAQLAGDRIALRGRGS" CDS complement(4156784..4156975) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3798C" /product="PROBABLE REMNANT OF A TRANSPOSASE" /note="Mb3798c, -, len: 63 aa. Equivalent to Rv3770B, len: 63 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 63 aa overlap). Probable remnant of a transposase, similar to many e.g. Rv2812|MTCY16B7.31c|Z81331_17 IS1604 putative transposase from Mycobacterium tuberculosis (469 aa), FASTA scores: opt: 379, E(): 1.6e-21, (93.55% identity in 62 aa overlap). Continues as Rv3770A. Mb3798c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y578" /protein_id="SIU02427.1" /translation="MRAERARAIGLFRYQLIREAADAAHSTKERGKMVRELASREHTD PFGRKVRISRHTIDRWIRN" CDS complement(4157110..4157436) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3799C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3799c, -, len: 108 aa. Equivalent to Rv3771c, len: 108 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 108 aa overlap). Hypothetical protein, highly similar, but shorter 81 aa, to P71640|Rv2811|MTCY16B7.32c HYPOTHETICAL 21.1 KDA PROTEIN from Mycobacterium tuberculosis (202 aa), FASTA scores: opt: 469, E(): 2.7e-25, (73.15% identity in 108 aa overlap),Mb3799c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y575" /db_xref="UniProtKB/TrEMBL:A0A1R3Y575" /protein_id="SIU02428.1" /translation="MPAPAEKALSQVGFRRIAADLARPAETVRGWLRRFAERAEAVRS VFTVMLRAVDPDPVMPDAAVGVFAYAVTVIAAVVTVIECQFALSTVSLAETAVAVSGG RLVAPG" tRNA complement(4157571..4157643) /locus_tag="BQ2027_ARGU" /product="tRNA-Arg" /note="argU, len: 73 nt. Equivalent to argU, len: 73 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 73 nt overlap). tRNA-Arg, anticodon acg." tRNA complement(4157674..4157762) /locus_tag="BQ2027_SERT" /product="tRNA-Ser" /note="serT, len: 89 nt. Equivalent to serT, len: 89 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 89 nt overlap). tRNA-Ser, anticodon gct." CDS 4157840..4158901 /codon_start=1 /transl_table=11 /gene="hisC2" /locus_tag="BQ2027_MB3800" /product="PROBABLE HISTIDINOL-PHOSPHATE AMINOTRANSFERASE HISC2 (IMIDAZOLE ACETOL-PHOSPHATE TRANSAMINASE) (IMIDAZOLYLACETOLPHOSPHATE AMINOTRANSFERASE)" /note="Mb3800, hisC2, len: 353 aa. Equivalent to Rv3772, len: 353 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 353 aa overlap). Probable hisC2, histidinol-phosphate aminotransferase (EC 2.6.1.9), highly similar to Q9ZBY8|SCD78.11 PUTATIVE HISTIDINOL-PHOPHATE AMINOTRANSFERASE from Streptomyces coelicolor (359 aa), FASTA scores: opt: 1165, E(): 7.1e-64, (52.55% identity in 356 aa overlap); and similar to many e.g. Q9EYX2 from Gardnerella vaginalis (317 aa) FASTA scores: opt: 814, E(): 1.7e-42, (45.15% identity in 308 aa overlap); Q9CMI7|HISH_1PM0838|HISH from Pasteurella multocida (365 aa), FASTA scores: opt: 701, E(): 1.5e-35, (35.05% identity in 351 aa overlap); O07131|HIS8_METFL|HISC|HISH from Methylobacillus flagellatum (368 aa), FASTA scores: opt: 645, E(): 4e-32, (34.5% identity in 345 aa overlap); etc. Contains PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site. BELONGS TO CLASS-II OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. COFACTOR: PYRIDOXAL PHOSPHATE. Protein product from Mb3800 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3800 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVQ0" /db_xref="InterPro:IPR001917" /db_xref="InterPro:IPR004839" /db_xref="InterPro:IPR005861" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="InterPro:IPR024892" /db_xref="UniProtKB/Swiss-Prot:Q7TVQ0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02429.1" /translation="MTARLRPELAGLPVYVPGKTVPGAIKLASNETVFGPLPSVRAAI DRATDTVNRYPDNGCVQLKAALARHLGPDFAPEHVAVGCGSVSLCQQLVQVTASVGDE VVFGWRSFELYPPQVRVAGAIPIQVPLTDHTFDLYAMLAAVTDRTRLIFVCNPNNPTS TVVGPDALARFVEAVPAHILIAIDEAYVEYIRDGMRPDSLGLVRAHNNVVVLRTFSKA YGLAGLRIGYAIGHPDVITALDKVYVPFTVSSIGQAAAIASLDAADELLARTDTVVAE RARVSAELRAAGFTLPPSQANFVWLPLGSRTQDFVEQAADARIVVRPYGTDGVRVTVA APEENDAFLRFARRWRSDQ" CDS complement(4158947..4159291) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3801C" /product="CONSERVED HYPOTHETICAL PROTEIN [SECOND PART]" /note="Mb3801c, -, len: 114 aa. Equivalent to 3' end of Rv3773c, len: 194 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 114 aa overlap). Hypothetical protein, highly similar to C-terminal end of O53773|Rv0576|MTV039.14 POSSIBLE TRANSCRIPTIONAL REGULATOR from Mycobacterium tuberculosis (434 aa), FASTA scores: opt: 575, E(): 8.3e-30, (47.4% identity in 192 aa overlap); and some similarity with other proteins from Mycobacterium tuberculosis e.g. P71985|Rv1727|MTCY04C12.12 (189 aa) FASTA scores: opt: 176, E(): 0.00022, (31.1% identity in 180 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis Rv3773c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-a) splits Rv3773c in two parts, Mb3802c and Mb3801c. Protein product from Mb3801c detected using SWATH mass spectrometry. Mb3801c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR017517" /db_xref="InterPro:IPR017520" /db_xref="UniProtKB/TrEMBL:A0A1R3Y567" /protein_id="SIU02430.1" /translation="MERLVSGAARSALDAWHRHGLEGDVSLGPGSMSAKVAVSVFSVE FLVHAWDYAVAVGSELKAADSLAEYVLELARKLIKPEERSVAGFNEPVDVPEDGGALE RLIAFTGRNPAR" CDS complement(4159296..4159532) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3802C" /product="CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]" /note="Mb3802c, -, len: 78 aa. Equivalent to 5' end of Rv3773c, len: 194 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 52 aa overlap). Hypothetical protein, highly similar to C-terminal end of O53773|Rv0576|MTV039.14 POSSIBLE TRANSCRIPTIONAL REGULATOR from Mycobacterium tuberculosis (434 aa), FASTA scores: opt: 575, E(): 8.3e-30, (47.4% identity in 192 aa overlap); and some similarity with other proteins from M. tuberculosis e.g. P71985|Rv1727|MTCY04C12.12 (189 aa) FASTA scores: opt: 176, E(): 0.00022, (31.1% identity in 180 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3773c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-a) splits Rv3773c into 2 parts, Mb3801c and Mb3802c. Mb3802c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y571" /protein_id="SIU02431.1" /translation="MPPESRPGPDSPPTDELACAEAALQVLQQVLHTIGRQDKAKQTP CPGYDVKKTNRAFAQLNHGPRRHGRRGILTACGH" CDS 4159556..4160380 /codon_start=1 /transl_table=11 /gene="echA21" /locus_tag="BQ2027_MB3803" /product="possible enoyl-coa hydratase echa21 (enoyl hydrase) (unsaturated acyl-coa hydratase) (crotonase)" /note="Mb3803, echA21, len: 274 aa. Equivalent to Rv3774, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 274 aa overlap). Possible echA21, enoyl-CoA hydratase (EC 4.2.1.17), equivalent to Q9CD94|ECHA1|ML0120 PUTATIVE ENOYL-COA HYDRATASE from Mycobacterium leprae (278 aa), FASTA scores: opt: 1593, E(): 2.2e-92, (88.3% identity in 274 aa overlap). Also similar to others e.g. Q9I2S4|PA1821 from Pseudomonas aeruginosa (270 aa), FASTA scores: opt: 761, E(): 2e-40, (42.3% identity in 267 aa overlap); Q9FHR8 from Arabidopsis thaliana (Mouse-ear cress) (278 aa) FASTA scores: opt: 638, E(): 9.9e-33, (39.4% identity in 269 aa overlap); Q9AB78|CC0353 from Caulobacter crescentus (286 aa), FASTA scores: opt: 601, E(): 2.1e-31, (39.25% identity in 266 aa overlap); etc. Protein product from Mb3803 detected using shotgun mass spectrometry. Mb3803 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y566" /db_xref="InterPro:IPR001753" /db_xref="InterPro:IPR014748" /db_xref="InterPro:IPR029045" /db_xref="UniProtKB/TrEMBL:A0A1R3Y566" /protein_id="SIU02432.1" /translation="MGETYESVTVETKDQVAQVTLIGPGKGNAMGPAFWSEMPEVFHA LDADREVRAIVITGSGKNFSYGLDVPAMGGMFAPLIADGALARPRTDFHTEILRMQKA INAVADCRTPTIAAVQGWCIGGAVDLISAVDIRYASADAKFSVREVKLAIVADMGSLA RLPLILSDGHLRELALTGKNIDAARAEKIGLVNDVYDDADQTLAAAHATAAEIAANPP LAVYGIKDVLDQQRTSAVSENLRYVAAWNAAFLPSKDLTEGISATFAKRPPQFTGE" CDS 4160392..4161639 /codon_start=1 /transl_table=11 /gene="lipE" /locus_tag="BQ2027_MB3804" /product="probable lipase lipe" /note="Mb3804, lipE, len: 415 aa. Equivalent to Rv3775, len: 415 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 415 aa overlap). Probable lipE, hydrolase lipase (EC 3.1.-.-), equivalent to Q9CD95|LIPE|ML0119 PROBABLE HYDROLASE from Mycobacterium leprae (411 aa), FASTA scores: opt: 2418, E(): 6.4e-144, (84.75% identity in 406 aa overlap). Also similar to other esterases e.g. Q9ABH2|CC0255 ESTERASE A from Caulobacter crescentus (374 aa), FASTA scores: opt: 427, E(): 2.4e-19, (28.9% identity in 391 aa overlap); O87861|ESTA ESTERASE A from Streptomyces chrysomallus (389 aa), FASTA scores: opt: 417, E(): 1e-18, (31.0% identity in 361 aa overlap); Q9RK50|SCF12.08 PUTATIVE ESTERASE from Streptomyces coelicolor (376 aa), FASTA scores: opt: 385, E(): 1e-16, (31.35% identity in 373 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. P71778|Rv1497|MTCY277.19 HYPOTHETICAL 45.8 KDA PROTEIN (429 aa), FASTA scores: opt: 457, E(): 3.5e-21, (30.4% identity in 395 aa overlap). Protein product from Mb3804 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3804 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001466" /db_xref="InterPro:IPR012338" /db_xref="UniProtKB/TrEMBL:A0A1R3Y568" /protein_id="SIU02433.1" /translation="MRAGDGKIRVPADLDAVTATGEEDHSEIDGAAVDRIWRAARHWY RAGMHPAIQLCIRHHGRVVLNRAIGHGWGNAPTDEADAEKIPVTTDTPFCVYSAAKAI TATVVHMLVERGHFALDDRVCEYLPSYTSHGKHRTTIRHVLTHSAGVPFPTGPRPDVR RADDHEYAVERLGELRPLYRPGLVHIYHALTWGPLMREIVYAATGKEIREILATEILD PLGFRWTNFGVAERDVPLVAPSHATGRQLPPVIAAVFRKAIGGTVHEIIPYTNTPFFL STILPSSNTVSTANELSRFMEILRRGGELDGVRVLSPETLRGAVTECRRLRPDFATGL MPLRWGTGFMLGSAKYGPFGRNAPAAFGHLGLVNIAVWADPERALSGGLISSGKPGRD PEAGRYGALLNAITAEIPRASSG" CDS 4161796..4163355 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3805" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3805, -, len: 519 aa. Equivalent to Rv3776, len: 519 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 519 aa overlap). Hypothetical protein, highly similar to Q10709|YL00_MYCTU|Rv2100|MTCY49.40 HYPOTHETICAL 58.9 KDA PROTEIN from Mycobacterium tuberculosis (550 aa) FASTA scores: opt: 1646, E(): 1.2e-83, (77.85% identity in 510 aa overlap) (homology from potential start at 7744); and similar to other proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O33266|Rv0336|MTCY279.03 (503 aa) FASTA scores: opt: 682, E(): 2.2e-30, (41.65% identity in 497 aa overlap). Protein product from Mb3805 detected using shotgun mass spectrometry. Mb3805 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR003615" /db_xref="InterPro:IPR003870" /db_xref="UniProtKB/TrEMBL:A0A1R3Y565" /protein_id="SIU02434.1" /translation="MFEISLSDPVELRDADDAALLAAIEDCARAEVAAGARRLSAIAE LTSRRTGNDQRADWACDGWDCAAAEVAAALTVSHRKASGQMHLSLTLNRLPQVAALFL AGQLSARLVSIIAWRTYLVRDPEALSLLDAALAKHATAWGPLSAPKLEKAIDSWIDRY DPAALRRTRISARSRDLCIGDPDEDAGTAALWGRLFATDAAMLDKRLTQLAHGVCDDD PRTIAQRRADALGALAAGADRLTCGCGNSDCPSSAGNHRQATGVVIHVVADAAALGAA PDPRLSGPEPALAPEAPATPAVKPPAALISGGGVVPAPLLAELIRGGAALSRVRHPGD LRSEPHYRPSAKLAEFVRIRDMTCRFPGCDQPTEFCDIDHTLPYPLGPTHPSNLKCLC RKHHLLKTFWTGWRDVQLPDGTIIWTAPNGHTYTTHPDSRIFLPSWHTTTAALPPAPS PPAIGPTHTLLMPRRRRTRAAELAHRIKRERAHVTQRNKPPPSGGDTAVAEGFEPPDG VSRLSLSRRVH" tRNA complement(4163288..4163374) /locus_tag="BQ2027_SERU" /product="tRNA-Ser" /note="serU, len: 87 nt. Equivalent to serU, len: 87 nt, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 87 nt overlap). tRNA-Ser, anticodon tga." CDS 4163401..4164387 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3806" /product="probable oxidoreductase" /note="Mb3806, -, len: 328 aa. Equivalent to Rv3777, len: 328 aa, from Mycobacterium tuberculosis strain H7Rv, (99.7% identity in 328 aa overlap). Probable oxidoreductase (EC 1.-.-.-), equivalent to Q9CD96|ML0118 PUTATIVE OXIDOREDUCTASE from Mycobacterium leprae (336 aa) FASTA scores: opt: 1661, E(): 1.1e-87, (76.0% identity in 325 aa overlap). Also highly similar to many e.g. Q9XA55|SCGD3.24c PUTATIVE QUINONE OXIDOREDUCTASE (EC 1.6.5.5) from Streptomyces coelicolor (326 aa) FASTA scores: opt: 1118, E(): 1.3e-64, (59.6% identity in 312 aa overlap); O65423|F18E5.200|F17L22.40|AT4G21580 PUTATIVE NADPH QUINONE OXIDOREDUCTASE from Arabidopsis thaliana (Mouse-ear cress) (325 aa), FASTA scores: opt: 1110, E(): 3e-56, (52.15% identity in 326 aa overlap); Q98FI0|MLL3767 NADPH QUINONE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (326 aa), FASTA scores: opt: 980, E(): 7.9e-49, (47.85% identity in 324 aa overlap); etc. Protein product from Mb3806 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3806 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y6Q7" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR014189" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6Q7" /protein_id="SIU02435.1" /translation="MTIMRAVVAESSDRLVWQEVPDVSAGPGEVLIKVAASGVNRADV LQAAGKYPPPPGVSDIIGLEVSGIVAAVGPGVTEWSAGQEVCALLAGGGYAEYVAVPA DQVLPIPPSVNLVDSAALPEVACTVWSNLVMTAHLRPGQLVLIHGGASGIGSHAIQVA RALAARVAITAGSPEKLELCRDLGAQITINYRDEDFVARLKQETDGSGADIILDIMGA SYLDRNIDALATDGQLIVIGMQGGVKAELNLGKLLTKRARVIGTTLRARPVSGPHGKA AIAQAVAASVWPMIAANRVRPVIGTRLPIQQAAQAHELMLSGKTFGKILLTV" CDS complement(4164406..4165602) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3807C" /product="POSSIBLE AMINOTRANSFERASE" /note="Mb3807c, -, len: 398 aa. Equivalent to Rv3778c, len: 398 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 398 aa overlap). Possible aminotransferase (EC 2.6.1.-), equivalent to Q9CD97|ML0117 HYPOTHETICAL PROTEIN from Mycobacterium leprae (398 aa) FASTA scores: opt: 2141, E(): 1.2e-123, (83.4% identity in 398 aa overlap). Also similar to others e.g. Q9K3K6|SCG20A.34 PUTATIVE AMINOTRANSFERASE from Streptomyces coelicolor (400 aa), FASTA scores: opt: 723, E(): 6.5e-37, (36.3% identity in 402 aa overlap); Q9KSS2|VC1184 NIFS-RELATED PROTEIN (AMINOTRANSFERASE-RELATED) from Vibrio cholerae (416 aa) FASTA scores: opt: 595, E(): 4.5e-29, (31.35% identity in 405 aa overlap); Q98NK4|MLR0102 AMINOTRANSFERASE from Rhizobium loti (Mesorhizobium loti) (425 aa), FASTA scores: opt: 563, E(): 4.2e-27, (29.4% identity in 408 aa overlap); Q9RY03|DR0151 NIFS-RELATED PROTEIN from Deinococcus radiodurans (401 aa), FASTA scores: opt: 484, E(): 2.7e-22, (32.35% identity in 399 aa overlap); Q9A766|CC1860 AMINOTRANSFERASE CLASS V from Caulobacter crescentus (408 aa), FASTA scores: opt: 390, E(): 1.5e-16, (27.85% identity in 413 aa overlap); etc. Protein product from Mb3807c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3807c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5J0" /db_xref="InterPro:IPR000192" /db_xref="InterPro:IPR011340" /db_xref="InterPro:IPR015421" /db_xref="InterPro:IPR015422" /db_xref="InterPro:IPR015424" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5J0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02436.1" /translation="MAYDVARVRGLHPSLGDGWVHFDAPAGMLIPDSVATTVSTAFRR SGASTVGAHPSARRSAAVLDAAREAVADLVNADPGGVVLGADRAVLLSLLAEASSSRA GLGYEVIVSRLDDEANIAPWLRAAHRYGAKVKWAEVDIETGELPTWQWESLISKSTRL VAVNSASGTLGGVTDLRAMTKLVHDVGALVVVDHSAAAPYRLLDIRETDADVVTVNAH AWGGPPIGAMVFRDPSVMNSFGSVSTNPYATGPARLEIGVHQFGLLAGVVASIEYLAA LDESARGSRRERLAVSMQSADAYLNRVFDYLMVSLRSLPLVMLIGRPEAQIPVVSFAV HKVPADRVVQRLADNGILAIANTGSRVLDVLGVNDVGGAVTVGLAHYSTMAEVDQLVR ALASLG" CDS 4165692..4167692 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3808" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN ALANINE AND LEUCINE RICH" /note="Mb3808, -, len: 666 aa. Equivalent to Rv3779, len: 666 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 666 aa overlap). Probable conserved transmembrane ala-, leu-rich protein, equivalent to Q9CD98|ML0116 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (654 aa), FASTA scores: opt: 1991, E(): 2e-112, (66.5% identity in 666 aa overlap). Shows some similarity with Q9RRU0|DR2395 PUTATIVE NA+/H+ ANTIPORTER from Deinococcus radiodurans (458 aa), FASTA scores: opt: 138, E(): 0.69, (31.9% identity in 138 aa overlap). Protein product from Mb3808 detected using SWATH mass spectrometry. Mb3808 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y597" /db_xref="UniProtKB/TrEMBL:A0A1R3Y597" /protein_id="SIU02437.1" /translation="MGLWFGTLIALILLIAPGAMVARIAQLRWPVAIAVGPALTYGVV ALAIIPYGALGIPWNGWTALAALAVTCAVATGLQLLLARFRDLDAEALAVSRWPAVTV AAGVLLGALLIGWAAYRGIPHWQSIPSTWDAVWHANTVRFILDTGQASSTHMGELRNV ETHAPLYYPSVFHGLVAVFCQLTGAAPTTGYTLSSLAASVWLFPVSAAVLTWRAVRSH PGALWSASCASAEWRAAGAAGTAAALSASFTAVPYVEFDTAAMPNLAAYGIAVPTMVL ITSTLRHRDRIPVAVLALVGVFSLHITGGIVVALLVSAWWLFEALRHPVRSRLADLLT LAGVAAMAGLVMLPQFLSVRQQEDIIAGHAFPTYLSKKRGLFDAVFQHSRHLNDFPVQ YALIVLAAIGGLILLVKKIWWPLAVWLLLIVMNVDAGTPLGGPIGGVAGALGEFFYHD PRRIAAATTLLLMLMAGVALFATVMLLVAAAKRLTDRFRPQPVSVWASATATLLIGAT LVSAWHYFPRHRFLFGDKYDSVMIDQKDLDAMAYLASLPGARDTLIGNANTDGTAWMY AVAGLHPLWTHYDYPLQQGPGYHRFIFWAYGRNGESDPRVLEAIQVLRIRYILTSTPT VRGFAVPDGLVSLETSRSWAKIYDNGEARIYEWRGTAAATHS" CDS 4167696..4168232 /codon_start=1 /transl_table=11 /gene="bpa" /locus_tag="BQ2027_MB3809" /product="Bacterial proteasome activator" /note="Mb3809, -, len: 178 aa. Equivalent to Rv3780, len: 178 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 178 aa overlap). Conserved hypothetical protein, equivalent to Q9CD99|ML0115 HYPOTHETICAL 19.1 KDA PROTEIN from Mycobacterium leprae (174 aa), FASTA scores: opt: 903, E(): 2.3e-48, (82.95% identity in 170 aa overlap). Also highly similar to Q9XA56|SCGD3.23c HYPOTHETICAL 19.5 KDA PROTEIN from Streptomyces coelicolor (179 aa), FASTA scores: opt: 692, E(): 1.8e-35, (65.9% identity in 170 aa overlap). Note that this putative protein is 4 aa longer at the N-terminus compared to previous annotation (in Nature 393: 537-544 (1998)). Protein product from Mb3809 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3809 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65092" /db_xref="InterPro:IPR019695" /db_xref="UniProtKB/Swiss-Prot:P65092" /protein_id="SIU02438.1" /translation="MRKRMVIGLSTGSDDDDVEVIGGVDPRLIAVQENDSDESSLTDL VEQPAKVMRIGTMIKQLLEEVRAAPLDEASRNRLRDIHATSIRELEDGLAPELREELD RLTLPFNEDAVPSDAELRIAQAQLVGWLEGLFHGIQTALFAQQMAARAQLQQMRQGAL PPGVGKSGQHGHGTGQYL" CDS 4168236..4169057 /codon_start=1 /transl_table=11 /gene="rfbE" /locus_tag="BQ2027_MB3810" /product="PROBABLE O-ANTIGEN/LIPOPOLYSACCHARIDE TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER RFBE" /note="Mb3810, rbfE, len: 273 aa. Equivalent to Rv3781, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 273 aa overlap). Probable rfbE, polysaccharide-transport ATP-binding protein ABC transporter (see citation below), involved in O-antigen/lipopolysaccharides (LPS) transport, equivalent to Q9CDA0|ML0114 PUTATIVE ABC TRANSPORTER ATP-BINDING COMPONENT from Mycobacterium leprae (272 aa), FASTA scores: opt: 1581, E(): 3e-83, (91.4% identity in 267 aa overlap). Also highly similar to AAK71283 LPS/O-ANTIGEN EXPORT PERMEASE from Coxiella burnetii (258 aa), FASTA scores: opt: 793, E(): 2.5e-38, (45.45% identity in 253 aa overlap); Q9PAF0|XF2568 ABC TRANSPORTER ATP-BINDING PROTEIN from Xylella fastidiosa (246 aa), FASTA scores: opt: 758, E(): 2.4e-36, (47.75% identity in 243 aa overlap); Q56903|RFBE_YEREN O-ANTIGEN EXPORT SYSTEM ATP-BINDING PROTEIN from Yersinia enterocolitica (239 aa), FASTA scores: opt: 697, E(): 7e-33, (48.65% identity in 224 aa overlap); Q50863|RFBB_MYXXA O-ANTIGEN EXPORT SYSTEM ATP-BINDING from Myxococcus xanthus (437 aa), FASTA scores: opt: 605, E(): 2e-27, (42.05% identity in 207 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Protein product from Mb3810 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3810 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y583" /db_xref="InterPro:IPR003439" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y583" /protein_id="SIU02439.1" /translation="MSDPHHPHIQTHNAWVEFPIFDAKSRSLKKAVLGKAGGTIGRNN SNVVVIEALRDITMELNLGDRVGLVGHNGAGKSTLLRLLSGIYEPTRGWAKVTGRVAP VFDLGIGMDPEISGYENIIIRGLFLGQTRKQMQAKVDEIAEFTELGEYLSMPLRTYST GMRVRLAMGVVTSIDPEILLLDEGIGAVDADFLRKAQSRLQNLVERSGILVFASHSNE FLARLCKTAIWIDHGVIRLAGGIEEVVRAYEGEDAARHVREVLAETQADRQNVQG" CDS 4169054..4169968 /codon_start=1 /transl_table=11 /gene="glft1" /locus_tag="BQ2027_MB3811" /product="udp-galactofuranosyl transferase glft1" /note="Mb3811, -, len: 304 aa. Equivalent to Rv3782, len: 304 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 304 aa overlap). Possible L-rhamnosyltransferase (EC 2.4.1.-), equivalent to Q9CDA1|RFBE|ML0113 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (283 aa), FASTA scores: opt: 1583, E(): 9.3e-96, (81.6% identity in 277 aa overlap). Also some similarity with AAK68916|WCFN PUTATIVE GLYCOSYLTRANSFERASE from Bacteroides fragilis (291 aa) FASTA scores: opt: 241, E(): 2.1e-08, (30.75% identity in 195 aa overlap); O58161|PH0424 HYPOTHETICAL 40.5 KDA PROTEIN from Pyrococcus horikoshii (348 aa), FASTA scores: opt: 194, E(): 2.8e-05, (23.85% identity in 302 aa overlap); O26448|MTH348 RHAMNOSYL TRANSFERASE from Methanothermobacter thermautotrophicus (313 aa) FASTA scores: opt: 177, E(): 0.00033, (28.2% identity in 333 aa overlap); O07868|CPS19BQ PUTATIVE RHAMNOSYL TRANSFERASE FASTA (300 aa) scores: opt: 156, E(): 0.0074, (25.45% identity in 232 aa overlap); and other putative transferases. Note that C-terminal end shows some similarity with part of Q05161|RFB O-ANTIGEN BIOSYNTHESIS PROTEIN B from Escherichia coli strain 0101. Note that previously known as rfbE. Protein product from Mb3811 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3811 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y572" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/TrEMBL:A0A1R3Y572" /protein_id="SIU02440.1" /translation="MTESVFAVVVTHRRPDELAKSLDVLTAQTRLPDHLIVVDNDGCG DSPVRELVAGQPIATTYLGSRRNLGGAGGFALGMLHALAQGADWVWLADDDGHAQDAR VLATLLACAEKYSLAEVSPMVCNIDDPTRLAFPLRRGLVWRRRASELRTEAGQELLPG IASLFNGALFRASTLAAIGVPDLRLFIRGDEVEMHRRLIRSGLPFGTCLDAAYLHPCG SDEFKPILCGRMHAQYPDDPGKRFFTYRNRGYVLSQPGLRKLLAQEWLRFGWFFLVTR RDPKGLWEWIRLRRLGRREKFGKPGGSA" CDS 4169965..4170807 /codon_start=1 /transl_table=11 /gene="rfbD" /locus_tag="BQ2027_MB3812" /product="PROBABLE O-ANTIGEN/LIPOPOLYSACCHARIDE TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER RFBD" /note="Mb3812, rfbD, len: 280 aa. Equivalent to Rv3783, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 280 aa overlap). Possible rfbD, polysaccharide-transport integral membrane protein ABC transporter (see citation below), involved in O-antigen/lipopolysaccharides (LPS) transport, equivalent to Q9CDA2|ML0112 PUTATIVE ABC TRANSPORTER COMPONENT from Mycobacterium leprae (276 aa), FASTA scores: opt: 1646, E(): 4e-102, (84.3% identity in 280 aa overlap). Also highly similar to Q9PAF1|XF2567 ABC TRANSPORTER PERMEASE PROTEIN from Xylella fastidiosa (267 aa), FASTA scores: opt: 723, E(): 7.6e-41, (41.3% identity in 259 aa overlap); and similar to others e.g. Q56902|RFBD_YEREN O-ANTIGEN EXPORT SYSTEM PERMEASE PROTEIN from Yersinia enterocolitica (259 aa), FASTA scores: opt: 566, E(): 2e-30, (28.05% identity in 264 aa overlap); Q06955|RFBH RFBH PROTEIN (involved in the export of lipopolysaccharide) (alias Q9KVA3|VC0246) LIPOPOLYSACCHARIDE/O-ANTIGEN TRANSPORT PROTEIN from Vibrio cholerae (257 aa), FASTA scores: opt: 358, E(): 1.3e-16, (24.4% identity in 258 aa overlap); Q9HTB8|WZM|PA5451 MEMBRANE SUBUNIT OF A-BAND LPS EFFLUX TRANSPORTER from Pseudomonas aeruginosa (265 aa), FASTA scores: opt: 263, E(): 2.7e-10, (25.45% identity in 263 aa overlap); etc. BELONGS TO THE ABC-2 SUBFAMILY OF INTEGRAL MEMBRANE PROTEINS. Protein product from Mb3812 detected using shotgun mass spectrometry. Mb3812 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y577" /db_xref="InterPro:IPR013525" /db_xref="UniProtKB/TrEMBL:A0A1R3Y577" /protein_id="SIU02441.1" /translation="MTFMDAQASFQTQSRTLARVRGDLVDGFRRHELWLHLGWQDIKQ RYRRSVLGPFWITIATGTTAVAMGGLYSKLFRLELSEHLPYVTLGLIVWNLINAAILD GAEVFVANEGLIKQLPAPLSVHVYRLVWRQMIFFAHNIVIYFVIAIIFPKPWSWADLS FLPALALIFLNCVWVSLCFGILATRYRDIGPLLFSVVQLLFFMTPIIWNDETLRRQGA GRWSSIVELNPLLHYLDIVRAPLLGAHQELRHWLVVLVLTVVGWMLAAFAMRQYRARV PYWV" CDS 4170963..4171943 /codon_start=1 /transl_table=11 /gene="rfbB" /locus_tag="BQ2027_MB3813" /product="possible dtdp-glucose 4,6-dehydratase" /note="Mb3813, rfbB, len: 326 aa. Equivalent to Rv3784, len: 326 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 326 aa overlap). Possible rfbB, dTDP-glucose 4,6-dehydratase (EC 4.2.1.46), similar to others e.g. Q9YCT1|APE1180 LONG HYPOTHETICAL DTDP-GLUCOSE 4,6-DEHYDRATASE from Aeropyrum pernix (330 aa) FASTA scores: opt: 598, E(): 3.7e-30, (34.9% identity in 315 aa overlap); O27817|MTH1789 DTDP-GLUCOSE 4,6-DEHYDRATASE from Methanothermobacter thermautotrophicus (336 aa) FASTA scores: opt: 587, E(): 1.8e-29, (34.9% identity in 315 aa overlap); Q9X5W0|GRSE TDP-GLUCOSE-4,6-DEHYDRATASE HOMOLOG from Streptomyces griseus (324 aa), FASTA scores: opt: 583, E(): 3.2e-29, (35.7% identity in 325 aa overlap); Q9K7J7|SPSJ|BH3364 SPORE COAT POLYSACCHARIDE SYNTHESIS (DTDP GLUCOSE 4, 6-DEHYDRATASE) from Bacillus halodurans (321 aa), FASTA scores: opt: 562, E(): 6.5e-28, (33.0% identity in 318 aa overlap); Q9UZH2|RFBB|PAB0785 DTDP-GLUCOSE 4,6-DEHYDRATASE from Pyrococcus abyssi (333 aa), FASTA scores: opt: 552, E(): 2.8e-27, (33.95% identity in 318 aa overlap); P27830|RFFG_ECOLI|B3788 DTDP-GLUCOSE 4,6-DEHYDRATASE from Escherichia coli strain K12 (355 aa), FASTA scores: opt: 401, E(): 7.5e-28, (31.3% identity in 348 aa overlap); etc. But also similar to several UDP-glucose 4-epimerases (EC 5.1.3.2) and other proteins e.g. O59375|PH1742 LONG HYPOTHETICAL UDP-GLUCOSE 4-EPIMERASE from Pyrococcus horikoshii (306 aa) FASTA scores: opt: 600, E(): 2.6e-30, (34.5% identity in 313 aa overlap); Q9ZGC7|LANH14 NDP-HEXOSE 4,6-DEHYDRATASE HOMOLOGfrom Streptomyces cyanogenus (326 aa), FASTA scores: opt: 593, E(): 7.6e-30, (36.45% identity in 321 aa overlap); Q57664|GALE_METJA|MJ0211 PUTATIVE UDP-GLUCOSE 4-EPIMERASE from Methanococcus jannaschii (305 aa) FASTA scores: opt: 575, E(): 9.6e-29, (32.6% identity in 313 aa overlap); etc. SEEMS TO BELONG TO THE SUGAR EPIMERASE FAMILY, DTDP-GLUCOSE DEHYDRATASE SUBFAMILY. Note that previously known as epiB. Protein product from Mb3813 detected using SWATH mass spectrometry. Mb3813 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR016040" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/TrEMBL:A0A1R3Y576" /protein_id="SIU02442.1" /translation="MEILVTGGAGFQGSHLTESLLANGHWVTVLDKSSRNAVRNMQGF RSHDRAAFISGSVTDGQTIDRAVRDHHVVFHLAAHVNVDQSLGDPESFLETNVMGTYR VLEAVRRYRNRLIYVSTCEVYGDGHNLKEGERLDEHAELKPNSPYGASKAAADRLCYS YFRSYGLDVTIVRPFNIFGVRQKAGRFGALIPRLVRQGINGEGLTIFGAGSATRDYLY VSDIVGAYNLVLRTPTLRGQAINFASGKDTRVRDIVEYVADKFGARIEHRDARPGEVQ RFPADISLAKSIGFQPQVEIWDGIDRYINWAKDQPQYPYEQDGFSGSSVL" CDS 4172027..4173100 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3814" /product="Nucleoside-diphosphate-sugar epimerases" /note="Mb3814, -, len: 357 aa. Equivalent to Rv3785, len: 357 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 357 aa overlap). Hypothetical unknown protein. Note that this putative protein is equivalent to AAK48258|MT3893 NAD-DEPENDENT EPIMERASE/DEHYDRATASE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (712 aa), but shorter 355 aa. Mb3814 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/Swiss-Prot:P65094" /protein_id="SIU02443.1" /translation="MVTVARRPVCPVTLTPGDPALASVRDLVDAWSAHDALAELVTMF GGAFPQTDHLEARLASLDKFSTAWDYRARARAARALHGEPVRCQDSGGGARWLIPRLD LPAKKRDAIVGLAQQLGLTLESTPQGTTFDHVLVIGTGRHSNLIRARWARELAKGRQV GHIVLAAASRRLLPSEDDAVAVCAPGARTEFELLAAAARDAFGLDVHPAVRYVRQRDD NPHRDSMVWRFAADTNDLGVPITLLEAPSPEPDSSRATSADTFTFTAHTLGMQDSTCL LVTGQPFVPYQNFDALRTLALPFGIQVETVGFGIDRYDGLGELDQQHPAKLLQEVRST IRAARALLERIEAGERMATDPRR" CDS complement(4173081..4174304) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3815C" /product="Membrane proteins related to metalloendopeptidases" /note="Mb3815c, -, len: 407 aa. Equivalent to Rv3786c, len: 407 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 407 aa overlap). Hypothetical unknown protein. Segment between aa 265-300 (approximatively) is highly similar to part of O03937|RORF1608 MINOR CAPSID PROTEIN from Bacteriophage phig1e (1608 aa), FASTA scores: opt: 242, E(): 8.4e-07, (26.85% identity in 272 aa overlap); Q9ETT9|ORF36 PUTATIVE PEPTIDASE from Corynebacterium equii (Rhodococcus equi) plasmid pREAT701 (p33701) and Plasmid virulence (546 aa), FASTA scores: opt: 231, E(): 1.6e-06, (34.15% identity in 167 aa overlap); O69910|SC2E1.40c HYPOTHETICAL 22.8 KDA PROTEIN. from Streptomyces coelicolor (226 aa) FASTA scores: opt: 218, E(): 4.6e-06, (34.15% identity in 164 aa overlap); and others. Protein product from Mb3815c detected using SWATH mass spectrometry. Mb3815c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR011055" /db_xref="InterPro:IPR016047" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/Swiss-Prot:P0A5H2" /protein_id="SIU02444.1" /translation="MRILAMTRAHNAGRTLAATLDSLAVFSDDIYVIDDRSTDDTAEI LANHPAVTNVVRARPDLPPTPWLIPESAGLELLYRMADFCRPDWVMMVDADWLVETDI DLRAVLARTPDDIVALMCPMVSRWDDPEYPDLIPVMGTAEALRGPLWRWYPGLRAGGK LMHNPHWPANITDHGRIGQLPGVRLVHSGWSTLAERILRVEHYLRLDPDYRFNFGVAY DRSLLFGYALDEVDLLKADYRRRIRGDFDPLEPGGRLPIDREPRAIGRGYGPHAGGFH PGVDFATDPGTPVYAVASGAVSAIDEVDGLVSLTIARCELDVVYVFRPGDEGRLVLGD RIAAGAQLGTIGAQGESADGYLHFEVRTQDGHVNPVRYLANMGLRPWPPPGRLRAVSG SYPPATPCTITAEDR" CDS complement(4174317..4175243) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3816C" /product="O-methyltransferase" /note="Mb3816c, -, len: 308 aa. Equivalent to Rv3787c, len: 308 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 308 aa overlap). Conserved hypothetical protein, highly similar to several mycobacterial hypothetical proteins e.g. P95074|Rv0726c|MTCY210.45c from Mycobacterium tuberculosis (367 aa), FASTA scores: opt: 1038, E(): 1.6e-58, (55.85% identity in 283 aa overlap); O53795|MBE50c|Rv0731c|MTV041.05c from Mycobacterium tuberculosis (318 aa), FASTA scores: opt: 1030, E()|Rv0731c|MTV041.05c from Mycobacterium tuberculosis (318 aa), FASTA scores: opt: 1030, E(): 4.5e-58, (56.15% identity in 292 aa overlap); Q9CCZ4|ML2640 from Mycobacterium leprae (310 aa) FASTA scores: opt: 709, E(): 9.9e-38, (43.75% identity in 279 aa overlap). Mb3816c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TVN7" /db_xref="InterPro:IPR007213" /db_xref="InterPro:IPR011610" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:Q7TVN7" /protein_id="SIU02445.1" /translation="MARTDDDSWDLATGVGATATLVAAGRARAARAAQPLIDDPFAEP LVRAVGVEFLTRWATGELDAADVDDPDAAWGLQRMTTELVVRTRYFDQFFLDAAAAGV RQAVILASGLDARGYRLPWPADTTVFEVDQPRVLEFKAQTLAGLGAQPTADLRMVPAD LRHDWPDALRRGGFDAAEPAAWIAEGLFGYLPPDAQNRLLDHVTDLSAPGSRLALEAF LGSADRDSARVEEMIRTATRGWREHGFHLDIWALNYAGPRHEVSGYLDNHGWRSVGTT TAQLLAAHDLPAAPALPAGLADRPNYWTCVLG" CDS 4175487..4175972 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3817" /product="Nucleoside diphosphate kinase regulator" /note="Mb3817, -, len: 161 aa. Equivalent to Rv3788, len: 161 aa, from Mycobacetrium tuberculosis strain H37Rv, (100.0% identity in 161 aa overlap). Hypothetical unknown protein. Protein product from Mb3817 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3817 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65096" /db_xref="InterPro:IPR001437" /db_xref="InterPro:IPR023459" /db_xref="InterPro:IPR036953" /db_xref="UniProtKB/Swiss-Prot:P65096" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02446.1" /translation="MSEKVESKGLADAARDHLAAELARLRQRRDRLEVEVKNDRGMIG DHGDAAEAIQRADELAILGDRINELDRRLRTGPTPWSGSETLPGGTEVTLRFPDGEVV TMHVISVVEETPVGREAETLTARSPLGQALAGHQPGDTVTYSTPQGPNQVQLLAVKLP S" CDS 4176081..4176446 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3818" /product="gtra family protein" /note="Mb3818, -, len: 121 aa. Equivalent to Rv3789, len: 121 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 121 aa overlap). Conserved hypothetical protein, equivalent to Q9CDA3|ML0110 HYPOTHETICAL 13.9 KDA PROTEIN from Mycobacterium leprae (123 aa) FASTA scores: opt: 587, E(): 7.3e-34, (72.95% identity in 122 aa overlap). Equivalent to AAK48262 from Mycobacterium tuberculosis strain CDC1551 (142 aa) but shorter 21 aa. Mb3818 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P64293" /db_xref="InterPro:IPR007267" /db_xref="UniProtKB/Swiss-Prot:P64293" /protein_id="SIU02447.1" /translation="MRFVVTGGLAGIVDFGLYVVLYKVAGLQVDLSKAISFIVGTITA YLINRRWTFQAEPSTARFVAVMLLYGITFAVQVGLNHLCLALLHYRAWAIPVAFVIAQ GTATVINFIVQRAVIFRIR" CDS 4176486..4177871 /codon_start=1 /transl_table=11 /gene="dpre1" /locus_tag="BQ2027_MB3819" /product="decaprenylphosphoryl-beta-d-ribose 2'-oxidase" /note="Mb3819, -, len: 461 aa. Equivalent to Rv3790, len: 461 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 461 aa overlap). Probable oxidoreductase (EC 1.-.-.-), equivalent to Q9CDA4|ML0109 PUTATIVE FAD-LINKED OXIDOREDUCTASE from Mycobacterium leprae (460 aa), FASTA scores: opt: 2722, E(): 1.4e-161, (86.55% identity in 461 aa overlap). Also highly similar to others e.g. Q9KZA4|SC5G8.10c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (457 aa), FASTA scores: opt: 1336, E(): 1.7e-75, (47.1% identity in 452 aa overlap); Q98KY4|MLL1265 PROBABLE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (449 aa), FASTA scores: opt: 636, E(): 4.9e-32, (36.0% identity in 439 aa overlap); Q9HDX8|SPAPB1A10.12c PUTATIVE D-ARABINONO-1,4-LACTONE OXIDASE from Schizosaccharomyces pombe (Fission yeast) (461 aa), FASTA scores: opt: 297, E(): 5.6e-11, (23.55% identity in 467 aa overlap); etc. C-terminal end has a high similarity to Q9AQD0 PUTATIVE OXIDOREDUCTASE (FRAGMENT) from Mycobacterium smegmatis (149 aa) FASTA scores: opt: 901, E(): 6.5e-49, (86.6% identity in 149 aa overlap). Protein product from Mb3819 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3819 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y598" /db_xref="InterPro:IPR006094" /db_xref="InterPro:IPR007173" /db_xref="InterPro:IPR016166" /db_xref="InterPro:IPR016169" /db_xref="InterPro:IPR036318" /db_xref="UniProtKB/TrEMBL:A0A1R3Y598" /protein_id="SIU02448.1" /translation="MLSVGATTTATRLTGWGRTAPSVANVLRTPDAEMIVKAVARVAE SGGGRGAIARGLGRSYGDNAQNGGGLVIDMTPLNTIHSIDADTKLVDIDAGVNLDQLM KAALPFGLWVPVLPGTRQVTVGGAIACDIHGKNHHSAGSFGNHVRSMDLLTADGEIRH LTPTGEDAELFWATVGGNGLTGIIMRATIEMTPTSTAYFIADGDVTASLDETIALHSD GSEARYTYSSAWFDAISAPPKLGRAAVSRGRLATVEQLPAKLRSEPLKFDAPQLLTLP DVFPNGLANKYTFGPIGELWYRKSGTYRGKVQNLTQFYHPLDMFGEWNRAYGPAGFLQ YQFVIPTEAVDEFKKIIGVIQASGHYSFLNVFKLFGPRNQAPLSFPIPGWNICVDFPI KDGLGKFVSELDRRVLEFGGRLYTAKDSRTTAETFHAMYPRVDEWISVRRKVDPLRVF ASDMARRLELL" CDS 4177872..4178636 /codon_start=1 /transl_table=11 /gene="dpre2" /locus_tag="BQ2027_MB3820" /product="decaprenylphosphoryl-d-2-keto erythro pentose reductase" /note="Mb3820, -, len: 254 aa. Equivalent to Rv3791, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 254 aa overlap). Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), equivalent to Q9CDA5|ML0108 PUTATIVE OXIDOREDUCTASE from Mycobacterium leprae (254 aa), FASTA scores: opt: 1458, E(): 1.6e-83, (89.0% identity in 254 aa overlap); and O05764 PUTATIVE PROTEIN BELONGING TO THE SHORT-CHAIN ALCOHOL DEHYDROGENASE from Mycobacterium smegmatis (254 aa), FASTA scores: opt: 1412, E(): 1.2e-80, (85.05% identity in 254 aa overlap). Also highly similar to Q9KZA5|SC5G8.09c PUTATIVE SHORT-CHAIN DEHYDROGENASE from Streptomyces coelicolor (256 aa), FASTA scores: opt: 733, E(): 1.8e-38, (45.3% identity in 254 aa overlap); and P43168|YMP3_STRCO HYPOTHETICAL OXIDOREDUCTASE from Streptomyces coelicolor (251 aa), FASTA scores: opt: 623, E(): 1.2e-31, (42.15% identity in 254 aa overlap); and similar to various oxidoreductases (principally acetoacetyl-CoA reductases) e.g. P14697|PHBB_ALCEU ACETOACETYL-COA REDUCTASE (EC 1.1.1.36) (246 aa) from Alcaligenes eutrophus (Ralstonia eutropha) (246 aa) FASTA scores: opt: 264, E(): 2.3e-09, (29.9% identity in 204 aa overlap); P45375|PHBB_CHRVI ACETOACETYL-COA REDUCTASE from Chromatium vinosum (246 aa), FASTA scores: opt: 261, E(): 3.5e-09, (27.45% identity in 226 aa overlap); Q9RT30|DR1938 OXIDOREDUCTASE (SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY) from Deinococcus radiodurans (283 aa), FASTA scores: opt: 251, E(): 1.7e-08, (27.55% identity in 236 aa overlap); etc. Also similar to Q10681|YK73_MYCTU|Rv2073c|MT2133|MTCY49.12 PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from Mycobacterium tuberculosis (249 aa), FASTA scores: opt: 589, E(): 1.5e-29, (41.25% identity in 252 aa overlap). Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Protein product from Mb3820 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3820 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P66784" /db_xref="InterPro:IPR002347" /db_xref="InterPro:IPR020904" /db_xref="InterPro:IPR036291" /db_xref="UniProtKB/Swiss-Prot:P66784" /protein_id="SIU02449.1" /translation="MVLDAVGNPQTVLLLGGTSEIGLAICERYLHNSAARIVLACLPD DPRREDAAAAMKQAGARSVELIDFDALDTDSHPKMIEAAFSGGDVDVAIVAFGLLGDA EELWQNQRKAVQIAEINYTAAVSVGVLLAEKMRAQGFGQIIAMSSAAGERVRRANFVY GSTKAGLDGFYLGLSEALREYGVRVLVIRPGQVRTRMSAHLKEAPLTVDKEYVANLAV TASAKGKELVWAPAAFRYVMMVLRHIPRSIFRKLPI" CDS 4178639..4180570 /codon_start=1 /transl_table=11 /gene="afta" /locus_tag="BQ2027_MB3821" /product="arabinofuranosyltransferase afta" /note="Mb3821, -, len: 643 aa. Equivalent to Rv3792, len: 643 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 643 aa overlap). Probable conserved transmembrane protein, equivalent, but longer 21 aa, to Q9CDA6|ML0107 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (632 aa), FASTA scores: opt: 1981, E(): 2.1e-110, (77.5% identity in 631 aa overlap). C-terminal end highly similar to C-terminus of O05765 PUTATIVE PRODUCT ORF 3 from Mycobacterium smegmatis (603 aa), FASTA scores: opt: 1261, E(): 1.4e-67, (70.7% identity in 266 aa overlap). Protein product from Mb3821 detected using SWATH mass spectrometry. Mb3821 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVN5" /db_xref="InterPro:IPR020959" /db_xref="InterPro:IPR020963" /db_xref="UniProtKB/Swiss-Prot:Q7TVN5" /protein_id="SIU02450.1" /translation="MPSRRKSPQFGHEMGAFTSARAREVLVALGQLAAAVVVAVGVAV VSLLAIARVEWPAFPSSNQLHALTTVGQVGCLAGLVGIGWLWRHGRFRRLARLGGLVL VSAFTVVTLGMPLGATKLYLFGISVDQQFRTEYLTRLTDTAALRDMTYIGLPPFYPPG WFWIGGRAAALTGTPAWEMFKPWAITSMAIAVAVALVLWWRMIRFEYALLVTVATAAV MLAYSSPEPYAAMITVLLPPMLVLTWSGLGARDRQGWAAVVGAGVFLGFAATWYTLLV AYGAFTVVLMALLLAGSRLQSGIKAAVDPLCRLAVVGAIAAAIGSTTWLPYLLRAARD PVSDTGSAQHYLPADGAALTFPMLQFSLLGAICLLGTLWLVMRARSSAPAGALAIGVL AVYLWSLLSMLATLARTTLLSFRLQPTLSVLLVAAGAFGFVEAVQALGKRGRGVIPMA AAIGLAGAIAFSQDIPDVLRPDLTIAYTDTDGYGQRGDRRPPGSEKYYPAIDAAIRRV TGKRRDRTVVLTADYSFLSYYPYWGFQGLTPHYANPLAQFDKRATQIDSWSGLSTADE FIAALDKLPWQPPTVFLMRHGAHNSYTLRLAQDVYPNQPNVRRYTVDLRTALFADPRF VVEDIGPFVLAIRKPQESA" CDS 4180570..4183854 /codon_start=1 /transl_table=11 /gene="embC" /locus_tag="BQ2027_MB3822" /product="INTEGRAL MEMBRANE INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE EMBC (ARABINOSYLINDOLYLACETYLINOSITOL SYNTHASE)" /note="Mb3822, embC, len: 1094 aa. Equivalent to Rv3793, len: 1094 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1094 aa overlap). embC, integral membrane protein, indolylacetylinositol arabinosyltransferase (EC 2.4.2.34) (see citations below), equivalent to Q9CDA7|EMBC|ML0106 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1070 aa) FASTA scores: opt: 6078,E(): 0, (82.95% identity in 1072 aa overlap); Q50393|EMBC PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1074 aa), FASTA scores: opt: 5523, E(): 0, (75.35% identity in 1072 aa overlap). Also similar to Q9CDA9|EMBB| ML0104 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1083 aa), FASTA scores: opt: 2789, E(): 1.9e-156, (44.0% identity in 1095 aa overlap); O30406|EMBB PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1082 aa), FASTA scores: opt: 2746, E(): 6.4e-154, (44.6% identity in 1096 aa overlap); etc. Also similar to to P72030|EMBB|Rv3795|MTCY13D12.29 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1098 aa), FASTA scores: opt: 2276, E(): 3.1e-126, (44.45% identity in 1118 aa overlap); and P72060|EMBA|Rv3794|MTCY13D12.28 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1094 aa), FASTA scores: opt: 1974, E(): 1.9e-108, (41.0% identity in 1110 aa overlap). Contains PS00044 Bacterial regulatory proteins, lysR family signature; and PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb3822 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3822 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVN4" /db_xref="InterPro:IPR007680" /db_xref="InterPro:IPR027451" /db_xref="InterPro:IPR032731" /db_xref="InterPro:IPR040920" /db_xref="InterPro:IPR042486" /db_xref="UniProtKB/Swiss-Prot:Q7TVN4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02451.1" /translation="MATEAAPPRIAVRLPSTSVRDAGANYRIARYVAVVAGLLGAVLA IATPLLPVNQTTAQLNWPQNGTFASVEAPLIGYVATDLNITVPCQAAAGLAGSQNTGK TVLLSTVPKQAPKAVDRGLLLQRANDDLVLVVRNVPLVTAPLSQVLGPTCQRLTFTAH ADRVAAEFVGLVQGPNAEHPGAPLRGERSGYDFRPQIVGVFTDLAGPAPPGLSFSASV DTRYSSSPTPLKMAAMILGVALTGAALVALHILDTADGMRHRRFLPARWWSIGGLDTL VIAVLVWWHFVGANTSDDGYILTMARVSEHAGYMANYYRWFGTPEAPFGWYYDLLALW AHVSTASIWMRLPTLAMALTCWWVISREVIPRLGHAVKTSRAAAWTAAGMFLAVWLPL DNGLRPEPIIALGILLTWCSVERAVATSRLLPVAIACIIGALTLFSGPTGIASIGALL VAIGPLRTILHRRSRRFGVLPLVAPILAAATVTAIPIFRDQTFAGEIQANLLKRAVGP SLKWFDEHIRYERLFMASPDGSIARRFAVLALVLALAVSVAMSLRKGRIPGTAAGPSR RIIGITIISFLAMMFTPTKWTHHFGVFAGLAGSLGALAAVAVTGAAMRSRRNRTVFAA VVVFVLALSFASVNGWWYVSNFGVPWSNSFPKWRWSLTTALLELTVLVLLLAAWFHFV ANGDGRRTARPTRFRARLAGIVQSPLAIATWLLVLFEVVSLTQAMISQYPAWSVGRSN LQALAGKTCGLAEDVLVELDPNAGMLAPVTAPLADALGAGLSEAFTPNGIPADVTADP VMERPGDRSFLNDDGLITGSEPGTEGGTTAAPGINGSRARLPYNLDPARTPVLGSWRA GVQVPAMLRSGWYRLPTNEQRDRAPLLVVTAAGRFDSREVRLQWATDEQAAAGHHGGS MEFADVGAAPAWRNLRAPLSAIPSTATQVRLVADDQDLAPQHWIALTPPRIPRVRTLQ NVVGAADPVFLDWLVGLAFPCQRPFGHQYGVDETPKWRILPDRFGAEANSPVMDHNGG GPLGITELLMRATTVASYLKDDWFRDWGALQRLTPYYPDAQPADLNLGTVTRSGLWSP APLRRG" CDS 4183940..4187224 /codon_start=1 /transl_table=11 /gene="embA" /locus_tag="BQ2027_MB3823" /product="INTEGRAL MEMBRANE INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE EMBA (ARABINOSYLINDOLYLACETYLINOSITOL SYNTHASE)" /note="Mb3823, embA, len: 1094 aa. Equivalent to Rv3794, len: 1094 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1094 aa overlap). embA, integral membrane protein, indolylacetylinositol arabinosyltransferase (EC 2.4.2.34) (see citations below), equivalent to P71485|EMBA ARABINOSYL TRANSFERASE from Mycobacterium avium (1108 aa), FASTA scores: opt: 5024, E(): 0, (81.9% identity in 1109 aa overlap); Q9CDA8|EMBA|ML0105 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1111 aa), FASTA scores: opt: 4782, E(): 0, (78.6% identity in 1111 aa overlap); Q50394|EMBA PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1092 aa), FASTA scores: opt: 4100, E(): 0, (67.4% identity in 1092 aa overlap). Also similar to Q9CDA7|EMBC|ML0106 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1070 aa), FASTA scores: opt: 1933, E(): 1.5e-100, (40.6% identity in 1108 aa overlap); Q50393|EMBC PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1074 aa), FASTA scores: opt: 1870, E(): 5.1e-97, (41.4% identity in 1113 aa overlap); etc. Also similar to P72059|EMBC|Rv3793|MTCY13D12.27 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1094 aa), FASTA scores: opt: 1974, E(): 7.7e-103, (40.9% identity in 1110 aa overlap); and P72030|EMBB|Rv3795|MTCY13D12.29 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1098 aa), FASTA scores: opt: 1288, E(): 2.1e-64, (42.5% identity in 1114 aa overlap). Supposed regulated by embR|Rv1267c. Protein product from Mb3823 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3823 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A561" /db_xref="InterPro:IPR007680" /db_xref="InterPro:IPR027451" /db_xref="InterPro:IPR032731" /db_xref="InterPro:IPR040920" /db_xref="InterPro:IPR042486" /db_xref="UniProtKB/Swiss-Prot:P0A561" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02452.1" /translation="MPHDGNERSHRIARLAAVVSGIAGLLLCGIVPLLPVNQTTATIF WPQGSTADGNITQITAPLVSGAPRALDISIPCSAIATLPANGGLVLSTLPAGGVDTGK AGLFVRANQDTVVVAFRDSVAAVAARSTIAAGGCSALHIWADTGGAGADFMGIPGGAG TLPPEKKPQVGGIFTDLKVGAQPGLSARVDIDTRFITTPGALKKAVMLLGVLAVLVAM VGLAALDRLSRGRTLRDWLTRYRPRVRVGFASRLADAAVIATLLLWHVIGATSSDDGY LLTVARVAPKAGYVANYYRYFGTTEAPFDWYTSVLAQLAAVSTAGVWMRLPATLAGIA CWLIVSRFVLRRLGPGPGGLASNRVAVFTAGAVFLSAWLPFNNGLRPEPLIALGVLVT WVLVERSIALGRLAPAAVAIIVATLTATLAPQGLIALAPLLTGARAIAQRIRRRRATD GLLAPLAVLAAALSLITVVVFRDQTLATVAESARIKYKVGPTIAWYQDFLRYYFLTVE SNVEGSMSRRFAVLVLLFCLFGVLFVLLRRGRVAGLASGPAWRLIGTTAVGLLLLTFT PTKWAVQFGAFAGLAGVLGAVTAFTFARIGLHSRRNLTLYVTALLFVLAWATSGINGW FYVGNYGVPWYDIQPVIASHPVTSMFLTLSILTGLLAAWYHFRMDYAGHTEVKDNRRN RILASTPLLVVAVIMVAGEVGSMAKAAVFRYPLYTTAKANLTALSTGLSSCAMADDVL AEPDPNAGMLQPVPGQAFGPDGPLGGISPVGFKPEGVGEDLKSDPVVSKPGLVNSDAS PNKPNAAITDSAGTAGGKGPVGINGSHAALPFGLDPARTPVMGSYGENNLAATATSAW YQLPPRSPDRPLVVVSAAGAIWSYKEDGDFIYGQSLKLQWGVTGPDGRIQPLGQVFPI DIGPQPAWRNLRFPLAWAPPEADVARIVAYDPNLSPEQWFAFTPPRVPVLESLQRLIG SATPVLMDIATAANFPCQRPFSEHLGIAELPQYRILPDHKQTAASSNLWQSSSTGGPF LFTQALLRTSTIATYLRGDWYRDWGSVEQYHRLVPADQAPDAVVEEGVITVPGWGRPG PIRALP" CDS 4187221..4190517 /codon_start=1 /transl_table=11 /gene="embB" /locus_tag="BQ2027_MB3824" /product="INTEGRAL MEMBRANE INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE EMBB (ARABINOSYLINDOLYLACETYLINOSITOL SYNTHASE)" /note="Mb3824, embB, len: 1098 aa. Equivalent to Rv3795, len: 1098 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1098 aa overlap). embB, integral membrane protein, indolylacetylinositol arabinosyltransferase (EC 2.4.2.34) (see citations below), equivalent to P71486|EMBB ARABINOSYL TRANSFERASE from Mycobacterium avium (1065 aa), FASTA scores: opt: 4998, E(): 0, (83.25% identity in 1076 aa overlap); Q9CDA9|EMBB|ML0104 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1083 aa), FASTA scores: opt: 4706, E(): 0, (78.0% identity in 1101 aa overlap); O30406|EMBB (alias Q50395) PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1082 aa), FASTA scores: opt: 4163, E(): 0, (68.4% identity in 1091 aa overlap); etc. Also similar to Q50393|EMBC PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1074 aa), FASTA scores: opt: 2482, E(): 5e-135, (44.7% identity in 1101 aa overlap); Q9CDA7|EMBC|ML0106 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1070 aa), FASTA scores: opt: 2259, E(): 3.4e-122, (43.4% identity in 1104 aa overlap); etc. Also similar to P72059|EMBC|Rv3793|MTCY13D12.27 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1094 aa), FASTA scores: opt: 2276, E(): 3.6e-123, (44.45% identity in 1118 aa overlap); and P72060|EMBA|Rv3794|MTCY13D12.28 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1094 aa), FASTA scores: opt: 1288, E(): 2.5e-66, (42.35% identity in 1114 aa overlap). Supposed regulated by embR|Rv1267c. Protein product from Mb3824 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3824 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVN3" /db_xref="InterPro:IPR007680" /db_xref="InterPro:IPR027451" /db_xref="InterPro:IPR032731" /db_xref="InterPro:IPR040920" /db_xref="InterPro:IPR042486" /db_xref="UniProtKB/Swiss-Prot:Q7TVN3" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02453.1" /translation="MTQCASRRKSTPSRAILGAFASARGTRWVATIAGLIGFVLSVAT PLLPVVQTTAMLDWPQRGQLGSVTAPLISLTPVDFTATVPCDVVRAMPPAGGVVLGTA PKQGKDANLQALFVVVSAQRVDVTDRNVVILSVPREQVTSPQCQRIEVTSTHAGTFAN FVGLKDPSGAPLRSGFPDPNLRPQIVGVFTDLTGPAPPGLAVSATIDTRFSTRPTTLK LLAIIGAIVATVVALIALWRLDQLDGRGSIAQLLLRPFRPASSPGGMRRLIPASWRTF TLTDAVVIFGFLLWHVIGANSSDDGYILGMARVADHAGYMSNYFRWFGSPEDPFGWYY NLLALMTHVSDASLWMRLPDLAAGLVCWLLLSREVLPRLGPAVAASKPAYWAAAMVLL TAWMPFNNGLRPEGIIALGSLVTYVLIERSMRYSRLTPAALAVVTAAFTLGVQPTGLI AVAALVAGGRPMLRILVRRHRLVGTLPLVSPMLAAGTVILTVVFADQTLSTVLEATRV RAKIGPSQAWYTENLRYYYLILPTVDGSLSRRFGFLITALCLFTAVFIMLRRKRIPSV ARGPAWRLMGVIFGTMFFLMFTPTKWVHHFGLFAAVGAAMAALTTVLVSPSVLRWSRN RMAFLAALFFLLALCWATTNGWWYVSSYGVPFNSAMPKIDGITVSTIFFALFAIAAGY AAWLHFAPRGAGEGRLIRALTTAPVPIVAGFMAAVFVASMVAGIVRQYPTYSNGWSNV RAFVGGCGLADDVLVEPDTNAGFMKPLDGDSGSWGPLGPLGGVNPVGFTPNGVPEHTV AEAIVMKPNQPGTDYDWDAPTKLTSPGINGSTVPLPYGLDPARVPLAGTYTTGAQQQS TLVSAWYLLPKPDDGHPLVVVTAAGKIAGNSVLHGYTPGQTVVLEYAMPGPGALVPAG RMVPDDLYGEQPKAWRNLRFARAKMPADAVAVRVVAEDLSLTPEDWIAVTPPRVPDLR SLQEYVGSTQPVLLDWAVGLAFPCQQPMLHANGIAEIPKFRITPDYSAKKLDTDTWED GTNGGLLGITDLLLRAHVMATYLSRDWARDWGSLRKFDTLVDAPPAQLELGTATRSGL WSPGKIRIGP" mobile_element 4189149..4190642 /mobile_element_type="insertion sequence:IS1557" /locus_tag="BQ2027_IS1557'-3" /note="IS1557'-3, len: 1494 nt. Equivalent to IS1557, len: 1509 nt, from Mycobacterium tuberculosis strain H37RV,(99.2% identity in 785 nt overlap). IS1557-3rd copy." CDS 4190585..4191712 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3825" /product="Metal-dependent hydrolases of the beta-lactamase superfamily III" /note="Mb3825, -, len: 375 aa. Equivalent to Rv3796, len: 375 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 375 aa overlap). Conserved hypothetical protein. C-terminal end similar in part to Q983J3|MLR8305 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (227 aa), FASTA scores: opt: 288, E(): 4e-09, (38.95% identity in 154 aa overlap). Similar to P54548|YQJK_BACSU HYPOTHETICAL PROTEIN (BELONGS TO THE ATSA/ELAC FAMILY) from Bacillus subtilis (307 aa) FASTA scores: opt: 263, E(): 1.3e-07, (26.1% identity in 295 aa overlap); and some similarity to other proteins e.g. AAK46775|MT2479 PUTATIVE ARYLSULFATASE from Mycobacterium tuberculosis strain CDC1551 (224 aa), FASTA scores: opt: 194, E(): 0.00072, (25.85% identity in 259 aa overlap). Equivalent to AAK48269 from Mycobacterium tuberculosis strain CDC1551 (338 aa) but longer 37 aa. SOME SIMILARITY TO THE A. CARRAGEENOVORA ATSA / E. COLI ELAC FAMILY. Note that previously known as atsH. Protein product from Mb3825 detected using SWATH mass spectrometry. Mb3825 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR036866" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5A9" /protein_id="SIU02454.1" /translation="MLLGMHQAGHVGTHERRAAATRRSALTAAGLAVVGAGVLGASAC SPQKSPQPSSPRLPDNALITLGVAAGPPPTPSRVGISSVLKIGRDLYVIDCGLGSLNA FTNAGLQFDDLKAMFITHLHTDHIVDYYNFFLSGGFLAPPGRAPVLVYGPGPAGGLPP SEVGNPNPATVNPANPTPGLAAATEALHRAFAYTSNIFIRDYGIDNVADLVKVTEIGL PPGSDYRNRAPKMSPFSVASDDNVSVTATLVSHYDVYPAFGFRFDLKKSGVSVTFSGD TTKSDNLITLAQGTDILVHEAVFSLDTAYFGNAFPPNYLVNSHISAEQVGEVAAAAKP KQLILSHYAPDDLPDSQWLDKIKKNYSGMTTIARDGQVFAL" CDS 4191792..4193573 /codon_start=1 /transl_table=11 /gene="fadE35" /locus_tag="BQ2027_MB3826" /product="PROBABLE ACYL-COA DEHYDROGENASE FADE35" /note="Mb3826, fadE35, len: 593 aa. Equivalent to Rv3797, len: 593 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 593 aa overlap). Probable fadE35, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9HY33|PA3593 from Pseudomonas aeruginosa (575 aa) FASTA scores: opt: 838, E(): 2.1e-46, (35.3% identity in 569 aa overlap); Q9ANZ8|AIDB from Burkholderia pseudomallei (Pseudomonas pseudomallei) (554 aa), FASTA scores: opt: 633, E(): 3.4e-33, (33.1% identity in 480 aa overlap); Q9HX44|PA3972 from Pseudomonas aeruginosa (549 aa) FASTA scores: opt: 560, E(): 1.7e-28, (29.9% identity in 569 aa overlap); P33224|AIDB_ECOLI|B4187 from Escherichia coli strain K12 (541 aa), FASTA scores: opt: 455, E(): 1e-21, (31.15% identity in 514 aa overlap); etc. Also similar to O86368|FADE8|Rv0672|MTCI376.02c ACYL-COA DEHYDROGENASE from Mycobacterium tuberculosis (542 aa), FASTA scores: opt: 479, E(): 2.9e-23, (32.2% identity in 460 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. Protein product from Mb3826 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3826 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y6S9" /db_xref="InterPro:IPR006091" /db_xref="InterPro:IPR009075" /db_xref="InterPro:IPR009100" /db_xref="InterPro:IPR034184" /db_xref="InterPro:IPR036250" /db_xref="InterPro:IPR041504" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6S9" /protein_id="SIU02455.1" /translation="MPEYDLEAVDKLPFSTPEKAQRYQTENYRGAMGLNWYLTDPTLQ FIMAYYLRPDELAFAEPHLTRIGELTGGPVTRWAEETDRNPPRLERYDRWGHDISRVV LPESFIQSKRAVIEARQAVRDDAARAGVKPSLALFAADYLLNQADIGMACALATGGNM VRSLVTAYAPPDVREFVLGKLNSGEWDGEAAQLLTERAGGSDLGALETTATRSGDVWL LNGFKWFASNCAGEAFVVLAKPEGAPDSTRGVATFLVLRTRRDGSRNGVRIRRLKDKL GTRSVASGEIEFVDAEAFLLSGEPSADAGPSDGKGLTRMMELTNRLRLGTASFALGNA RRALVESLCYAGQRRAFGGALIDKPLMRRKLAEMVVDVEAALAMVFDGFGAANHRQPR CLPQRIAVPVTKLKTCRLGITVASDAIEIHGGNGYIETWPVARLLRDAQVNTIWEGPD NILCLDVRRGIEQTRAHETLLARLRDAVSVSDDDDTTRLVSRRIEDLDAAITAWTKLD RQLAEARLFPLAQFMGDVYAGALLTEQAAWERATRGTDRKALVARLYARRYLADQGPL RGIDADCDEALQRFDELVAGAFTAEQT" repeat_region 4193561..4193587 /rpt_type=INVERTED /note="27 bp imperfect inverted repeat, IRL,CGAGCAGACGTAAAAGCCCCCAATTCG, flanking IS element IS1557'." gene 4193561..4195054 /locus_tag="BQ2027_IS1557'-3" CDS 4193700..4194317 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3827" /product="PUTATIVE TRANSPOSASE [FIRST PART]" /note="Mb3827, -, len: 205 aa. Equivalent to 5' end of Rv3798, len: 444 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 195 aa overlap). Putative transposase for insertion sequence element IS1557, highly similar to Q60255 SIMILAR TO TRANSPOSASE OF ISAE1 FROM ALCALIGENES EUTROPHUS H1-4 (FRAGMENT) from dibenzofuran-degrading bacterium DPO360 (163 aa) FASTA scores: opt: 767, E(): 3.2e-42, (67.25% identity in 168 aa overlap); and similar to P74920 TRANSPOSASE from Thiobacillus ferrooxidans (404 aa), FASTA scores: opt: 375, E(): 1.1e-16, (27.55% identity in 439 aa overlap); Q48349 TRANSPOSASE from Alcaligenes eutrophus (Ralstonia eutropha) (408 aa), FASTA scores: opt: 324, E(): 2e-13, (3.9% identity in 369 aa overlap); Q9FDC1|TNP TRANSPOSASE from Burkholderia mallei (Pseudomonas mallei) (386 aa) FASTA scores: opt: 282, E(): 9.8e-11, (25.85% identity in 391 aa overlap); etc. C-terminal end identical to O53804|Rv0741|MTV041.15 TRANSPOSASE from Mycobacterium tuberculosis (104 aa), FASTA scores: opt: 582, E(): 1.8e-30, (85.6% identity in 104 aa overlap). BELONGS TO THE TRANSPOSASE FAMILY 12. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3798 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 11 bp deletion splits Rv3798 into 2 parts, Mb3827 and Mb3828. Mb3827 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR002560" /db_xref="InterPro:IPR029261" /db_xref="InterPro:IPR032877" /db_xref="UniProtKB/TrEMBL:A0A0G2QBZ2" /protein_id="SIU02456.1" /translation="MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSA VLRRCGRCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVPWA RHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADTEKRIDRFANL RRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPSHPGLVLRCPGR" CDS 4194292..4195023 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3828" /product="PUTATIVE TRANSPOSASE [SECOND PART]" /note="Mb3828, -, len: 243 aa. Equivalent to 3' end of Rv3798, len: 444 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 243 aa overlap). Putative transposase for insertion sequence element IS1557, highly similar to Q60255 SIMILAR TO TRANSPOSASE OF ISAE1 FROM ALCALIGENES EUTROPHUS H1-4 (FRAGMENT) from dibenzofuran-degrading bacterium DPO360 (163 aa) FASTA scores: opt: 767, E(): 3.2e-42, (67.25% identity in 168 aa overlap); and similar to P74920 TRANSPOSASE from Thiobacillus ferrooxidans (404 aa), FASTA scores: opt: 375, E(): 1.1e-16, (27.55% identity in 439 aa overlap); Q48349 TRANSPOSASE from Alcaligenes eutrophus (Ralstonia eutropha) (408 aa), FASTA scores: opt: 324, E(): 2e-13, (3.9% identity in 369 aa overlap); Q9FDC1|TNP TRANSPOSASE from Burkholderia mallei (Pseudomonas mallei) (386 aa) FASTA scores: opt: 282, E(): 9.8e-11, (25.85% identity in 391 aa overlap); etc. C-terminal end identical to O53804|Rv0741|MTV041.15 TRANSPOSASE from Mycobacterium tuberculosis (104 aa), FASTA scores: opt: 582, E(): 1.8e-30, (85.6% identity in 104 aa overlap). BELONGS TO THE TRANSPOSASE FAMILY 12. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv3798 exists as a single gene. In Mycobacterium bovis, a frameshift due to a 11 bp deletion splits Rv3798 into 2 parts, Mb3827 and Mb3828." /db_xref="InterPro:IPR002560" /db_xref="UniProtKB/TrEMBL:A0A0G2QBZ3" /protein_id="SIU02457.1" /translation="MFFDALGAERAAQITHVSADAADWIADVVTERCPDAIQCADPFH VVAWATEALDVERRRAWNDARAIARTEPKWGRGRPGKNAAPRPGRERARRLKGARYAL WKNPEDLTERQSAKLAWIAKTDPRLYRAYLLKESLRHVFSVKGEEGKQALDRWISWAQ RCRIPVFVELAARIKRHRVAIDAALDHGLSQGLIESTNTKIRLLTRIAFGFRSPQALI ALAMLTLAGHRPTLPGRHNHPQISQ" repeat_region complement(4195028..4195054) /rpt_type=INVERTED /note="27 bp imperfect inverted repeat, IRR,CGAGCAGACGTAAAAGCCCCCATTTCG, flanking IS element IS1557'." CDS complement(4195065..4196633) /codon_start=1 /transl_table=11 /gene="accD4" /locus_tag="BQ2027_MB3829C" /product="PROBABLE PROPIONYL-COA CARBOXYLASE BETA CHAIN 4 ACCD4 (PCCASE) (PROPANOYL-COA:CARBON DIOXIDE LIGASE)" /note="Mb3829c, accD4, len: 522 aa. Equivalent to Rv3799c, len: 522 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 522 aa overlap). Probable accD4, propyonyl-CoA carboxylase beta chain 4 (EC 6.4.1.3), equivalent to Q9CDB0|ACCD4|ML0102 PUTATIVE ACYL COA CARBOXYLASE from Mycobacterium leprae (517 aa) FASTA scores: opt: 3154, E(): 8e-187, (91.2% identity in 511 aa overlap) . Also similar to many e.g. Q9X4K7|PCCB from Streptomyces coelicolor (530 aa), FASTA scores: opt: 1714, E(): 4.4e-98, (50.0% identity in 510 aa overlap); P53003|PCCB_SACER from Saccharopolyspora erythraea (Streptomyces erythraeus) (546 aa), FASTA scores: opt: 1549, E(): 6.6e-88, (50.65% identity in 519 aa overlap); Q9WZH5|TM0716 from Thermotoga maritima (515 aa) FASTA scores: opt: 1529, E(): 1.1e-86, (46.7% identity in 512 aa overlap); etc. Also similar to P53002|PCCB_MYCLE|ACCD5|PCCB|ML0731|B1308_C1_125 PROBABLE PROPIONYL-COA CARBOXYLASE BETA CHAIN 5 from Mycobacterium leprae (549 aa), FASTA scores: opt: 1493, E(): 1.9e-84, (49.8% identity in 514 aa overlap); and P96885|PCC5_MYCTU|ACCD5|PCCB|Rv3280|MT3379.1|MTCY71.20 PROBABLE PROPIONYL-COA CARBOXYLASE BETA CHAIN 5 from Mycobacterium tuberculosis (548 aa), FASTA scores: opt: 1471, E(): 4.2e-83, (49.15% identity in 515 aa overlap). BELONGS TO THE ACCD/PCCB FAMILY. Protein product from Mb3829c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3829c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5B0" /db_xref="InterPro:IPR011762" /db_xref="InterPro:IPR011763" /db_xref="InterPro:IPR029045" /db_xref="InterPro:IPR034733" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5B0" /protein_id="SIU02458.1" /translation="MTVTEPVLHTTAEKLAELRERLELAKEPGGEKAAAKRDKKGIPS ARARIYELVDPGSFMEIGALCRTPGDPNALYGDGVVTGHGLINGRPVGVFSHDQTVFG GTVGEMFGRKVARLMEWCAMVGCPIVGINDSGGARIQDAVTSLAWYAELGRRHELLSG LVPQISIILGKCAGGAVYSPIQTDLVVAVRDQGYMFVTGPDVIKDVTGEDVSLDELGG ADHQASYGNIHQVVESEAAAYQYVRDFLSFLPSNCFDKPPVVNPGLEPEITGHDLELD SIVPDSDNMAYDMHEVLLRIFDDGDFLDVAAQAGQAIITGYARVDGRTVGVVANQPMH MSGAIDNEASDKAARFIRFSDAFDIPLVFVVDTPGFLPGVEQEKNGIIKRGGRFLYAV VEADVPKVTITIRKSYGGAYAVMGSKQLTADLNFAWPTARIAVIGADGAAQLLMKRFP DPNAPEAQAIRKSFVENYNLNMAIPWIAAERGFIDAVIDPHETRLLLRKSMHLLRDKQ LWWRVGRKHGLIPV" CDS complement(4196630..4201831) /codon_start=1 /transl_table=11 /gene="pks13" /locus_tag="BQ2027_MB3830C" /product="POLYKETIDE SYNTHASE PKS13" /note="Mb3830c, pks13, len: 1733 aa. Equivalent to Rv3800c, len: 1733 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1733 aa overlap). Probable pks13, polyketide synthase (EC undetermined), equivalent to Q9CDB1|PKS13|ML0101 POLYKETIDE SYNTHASE from Mycobacterium leprae (1784 aa), FASTA scores: opt: 7454, E(): 0, (83.6% identity in 1748 aa overlap); and similar to Q9Z5K6|ML2357|MLCB12.02c PUTATIVE POLYKETIDE SYNTHASE from Mycobacterium leprae (1871 aa), FASTA scores: opt: 1682, E(): 1.2e-85, (38.3% identity in 1096 aa overlap). Also similar in part to many e.g. Q9ADL6|SORA SORAPHEN POLYKETIDE SYNTHASE A from Polyangium cellulosum (6315 aa) FASTA scores: opt: 1422, E(): 1e-70, (31.45% identity in 1616 aa overlap); AAK73501|AMPHI AMPHI PROTEIN (involved in amphotericin biosynthesis) from Streptomyces nodosus (9510 aa), FASTA scores: opt: 1441, E(): 1.2e-71, (30.45% identity in 1662 aa overlap); Q9RFL0|MTAB MTAB PROTEIN (involved in myxothiazol biosynthesis) from Stigmatella aurantiaca (4003 aa), FASTA scores: opt: 1429, E(): 2.8e-71, (33.8% identity in 1089 aa overlap); Q9L4X2|NYSJ from Streptomyces noursei (5435 aa), FASTA scores: opt: 1407, E(): 6.1e-70, (30.5% identity in 1764 aa overlap); CAC37876|SC1G7.01c from Streptomyces coelicolor (3489 aa) FASTA scores: opt: 1382, E(): 1e-68, (31.05% identity in 1489 aa overlap); etc. Also highly similar to Q10977|PPSA_MYCTU|Rv2931|MT3000|MTCY338.20 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE from Mycobacterium tuberculosis (1876 aa), FASTA scores: opt: 1728, E(): 3.4e-88, (36.95% identity in 1269 aa overlap); and P96203|PPSD|Rv2934|MTCY19H9.02. Contains PS00606 Beta-ketoacyl synthases active site. Protein product from Mb3830c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3830c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y596" /db_xref="InterPro:IPR001031" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR029058" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036736" /db_xref="UniProtKB/TrEMBL:A0A1R3Y596" /protein_id="SIU02459.1" /translation="MADVAESQENAPAERAELTVPEMRQWLRNWVGKAVGKAPDSIDE SVPMVELGLSSRDAVAMAADIEDLTGVTLSVAVAFAHPTIESLATRIIEGEPETDLAG DDAEDWSRTGPAERVDIAIVGLSTRFPGEMNTPEQTWQALLEGRDGITDLPDGRWSEF LEEPRLAARVAGARTRGGYLKDIKGFDSEFFAVAKTEADNIDPQQRMALELTWEALEH ARIPASSLRGQAVGVYIGSSTNDYSFLAVSDPTVAHPYAITGTSSSIIANRVSYFYDF HGPSVTIDTACSSSLVAIHQGVQALRNGEADVVVAGGVNALITPMVTLGFDEIGAVLA PDGRIKSFSADADGYTRSEGGGMLVLKRVDDARRDGDAILAVIAGSAVNHDGRSNGLI APNQDAQADVLRRAYKDAGIDPRTVDYIEAHGTGTILGDPIEAEALGRVVGRGRPADR PALLGAVKTNVGHLESAAGAASMAKVVLALQHDKLPPSINFAGPSPYIDFDAMRLKMI TTPTDWPRYGGYALGGVSSFGFGGANAHVVVREVLPRDVVEKEPEPEPEPKAAAEPAE APTLAGHALRFDEFGNIITDSAVAEEPEPELPGVTEEALRLKEAALEELAAQEVTAPL VPLAVSAFLTSRKKAAAAELADWMQSPEGQASSLESIGRSLSRRNHGRSRAVVLAHDH DEAIKGLRAVAAGKQAPNVFSVDGPVTTGPVWVLAGFGAQHRKMGKSLYLRNEVFAAW IEKVDALVQDELGYSVLELILDDAQDYGIETTQVTIFAIQIALGELLRHHGAKPAAVI GQSLGEAASAYFAGGLSLRDATRAICSRSHLMGEGEAMLFGEYIRLMALVEYSADEIR EVFSDFPDLEVCVYAAPTQTVIGGPPEQVDAILARAEAEGKFARKFATKGASHTSQMD PLLGELTAELQGIKPTSPTCGIFSTVHEGRYIKPGGEPIHDVEYWKKGLRHSVYFTHG IRNAVDSGHTTFLELAPNPVALMQVALTTADAGLHDAQLIPTLARKQDEVSSMVSTMA QLYVYGHDLDIRTLFSRASGPQDYANIPPTRFKRKEHWLPAHFSGDGSTYMPGTHVAL PDGRHVWEYAPRDGNVDLAALVRAAAAHVLPDAQLTAAEQRAVPGDGARLVTTMTRHP GGASVQVHARIDESFTLVYDALVSRAGSESVLPTAVGAATAIAVADGAPVAPETPAED ADAETLSDSLTTRYMPSGMTRWSPDSGETIAERLGLIVGSAMGYEPEDLPWEVPLIEL GLDSLMAVRIKNRVEYDFDLPPIQLTAVRDANLYNVEKLIEYAVEHRDEVQQLHEHQK TQTAEEIARAQAELLHGKVGKTEPVDSEAGVALPSPQNGEQPNPTGPALNVDVPPRDA AERVTFATWAIVTGKSPGGIFNELPRLDDEAAAKIAQRLSERAEGPITAEDVLTSSNI EALADKVRTYLEAGQIDGFVRTLRARPEAGGKVPVFVFHPAGGSTVVYEPLLGRLPAD TPMYGFERVEGSIEERAQQYVPKLIEMQGDGPYVLVGWSLGGVLAYACAIGLRRLGKD VRFVGLIDAVRAGEEIPQTKEEIRKRWDRYAAFAEKTFNVTIPAIPYEQLEELDDEGQ VRFVLDAVSQSGVQIPAGIIEHQRTSYLDNRAIDTAQIQPYDGHVTLYMADRYHDDAI MFEPRYAVRQPDGGWGEYVSDLEVVPIGGEHIQAIDEPIIAKVGEHMSRALGQIEADR TSEVGKQ" CDS complement(4201838..4203751) /codon_start=1 /transl_table=11 /gene="fadD32" /locus_tag="BQ2027_MB3831C" /product="fatty-acid-amp ligase fadd32 (fatty-acid-amp synthetase) (fatty-acid-amp synthase). also shown to have acyl-acp ligase activity." /note="Mb3831c, fadD32, len: 637 aa. Equivalent to Rv3801c, len: 637 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 637 aa overlap). Probable fadD32, fatty-acid-CoA synthetase (EC 6.2.1.-), equivalent to Q9CDB2|FADD32|ML0100 PUTATIVE ACYL-COA SYNTHETASE from Mycobacterium leprae (635 aa), FASTA scores: opt: 3892, E(): 0, (93.05% identity in 632 aa overlap); and highly similar to others from Mycobacterium leprae. Also similar to others from Mycobacterium tuberculosis e.g. P95288|FADD31|Rv1925|MTCY09F9.39c (620 aa), FASTA scores: opt: 1567, E(): 1.7e-88, (47.05% identity in 612 aa overlap); MTCY338_18, MTCY349_40, MTV005_21, MTCY24G1_8, MTCY19G5_7, MTCY4D9_17; and MBU75685_1 ACYL-COA LIGASE from Mycobacterium bovis. Protein product from Mb3831c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3831c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TTR2" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q7TTR2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02460.1" /translation="MFVTGESGMAYHNPFIVNGKIRFPANTNLVRHVEKWAKVRGDKL AYRFLDFSTERDGVARDILWSDFSARNRAVGARLQQVTQPGDRVAILCPQNLDYLISF FGALYSGRIAVPLFDPAEPGHVGRLHAVLDDCAPSTILTTTDSAEGVRKFIRARSAKE RPRVIAVDAVPTEVAATWQQPEANEETVAYLQYTSGSTRIPSGVQITHLNLPTNVVQV LNALEGQEGDRGVSWLPFFHDMGLITVLLASVLGHSFTFMTPAAFVRRPGRWIRELAR KPGETGGTFSAAPNFAFEHAAVRGVPRDDEPPLDLSNVKGILNGSEPVSPASMRKFFE AFAPYGLKQTAVKPSYGLAEATLFVSTTPMDEVPTVIHVDRDELNNQRFVEVAADAPN AVAQVSAGKVGVSEWAVIVDADTASELPDGQIGEIWLHGNNLGTGYWGKEEESAQTFK NILKSRISESRAEGAPDDALWVRTGDYGTYFKDHLYIAGRIKDLVIIDGRNHYPQDLE CTAQESTKALRVGYVAAFSVPANQLPQTVFDDSHAGLKFDPEDTSEQLVIVGERAAGT HKLDHQPIVDDIRAAIAVGHGVTVRDVLLVSAGTIPRTSSGKIGRRACRAAYLDGSLR SGVGSPTVFATSD" CDS complement(4204040..4205050) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3832C" /product="probable conserved membrane protein" /note="Mb3832c, -, len: 336 aa. Equivalent to Rv3802c, len: 336 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 336 aa overlap). Conserved hypothetical protein, equivalent to Q9CDB3|ML0099 HYPOTHETICAL PROTEIN from Mycobacterium leprae (336 aa) FASTA scores: opt: 1759, E(): 1.1e-85, (75.5% identity in 335 aa overlap). Contains probable N-terminal signal sequence followed by Pro-rich region. Protein product from Mb3832c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3832c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5B4" /db_xref="InterPro:IPR000675" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5B4" /protein_id="SIU02461.1" /translation="MAKNSRRKRHRILAWIAAGAMASVVALVIVAVVIMLRGAESPPS AVPPGVLPPGPTPAHPHKPRPAFQDASCPDVQMISVPGTWESSPQQNPLNPVQFPKAL LLKVTGPIAQQFAPARVQTYTVAYTAQFHNPLTTDNQMSYNDSRAEGTRAMVAAMTDM NNRCPLTSYVLIGFSQGAVIAGDVASDIGNGRGPVDEDLVLGVTLIADGRRQQGVGNQ VPPSPRGEGAEITLHEVPVLSGLGLTMTGPRPGGFGALDGRTNEICAQGDLICAAPAQ AFSPANLPTTLNTLAGGAGQPVHAMYATPEFWNSDGEPATEWTLNWAHQLIENAPHPK HR" CDS complement(4205248..4206147) /codon_start=1 /transl_table=11 /gene="fbpD" /locus_tag="BQ2027_MB3833C" /standard_name="mpt51; mpb51; fbpC1" /product="SECRETED MPT51/MPB51 ANTIGEN PROTEIN FBPD (MPT51/MPB51 ANTIGEN 85 COMPLEX C) (AG58C) (MYCOLYL TRANSFERASE 85C) (FIBRONECTIN-BINDING PROTEIN C) (85C)" /note="Mb3833c, fbpD, len: 299 aa. Equivalent to Rv3803c, len: 299 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 299 aa overlap). fbpD (alternate gene names: mpt51, mpb51, fbpC1), secreted MPB51/MPT51 antigen protein (fibronectin-binding protein C) (mycolyl transferase 85C) (EC 2.3.1.-) (see citations below), identical to Q48923|MPT51|MPB51 ANTIGEN PRECURSOR from Mycobacterium bovis (299 aa), FASTA scores: opt: 2093, E(): 1.5e-112, (100.0% identity in 299 aa overlap) (see second citation below); and highly similar to other Mycobacterial antigen precursors e.g. Q05868|MPT5_MYCLE|MPT51|ML0098 MPT51 ANTIGEN PRECURSOR from Mycobacterium leprae (301 aa), FASTA scores: opt: 1624, E(): 9.8e-86, (77.8% identity in 302 aa overlap); O52972|A85C_MYCAV|FBPC ANTIGEN 85-C PRECURSOR (FIBRONECTIN-BINDING PROTEIN C) from Mycobacterium avium (352 aa), FASTA scores: opt: 753, E(): 6.6e-36, (41.5% identity in 315 aa overlap); P21160|A85B_MYCKA ANTIGEN 85-B PRECURSOR (FIBRONECTIN-BINDING PROTEIN B) from Mycobacterium kansasii (325 aa), FASTA scores: opt: 574, E(): 1.1e-25, (37.55% identity in 309 aa overlap); P12942|A85B_MYCBO ANTIGEN 85-B PRECURSOR from Mycobacterium bovis (323 aa), FASTA scores: opt: 572, E(): 1.4e-25, (39.85% identity in 291 aa overlap); etc. Also similar to P31953|A85C_MYCTU|FBPC|MPT45|Rv0129c|MTCI5.03c| FBPC2 SECRETED ANTIGEN 85-C (MYCOLYL TRANSFERASE 85C) (FIBRONECTIN-BINDING PROTEIN C) from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 751, E(): 8.4e-36, (40.65% identity in 310 aa overlap); P17944|A85A_MYCTU|FBPA|MPT44|Rv3804c|MT3911|MTV026.09c SECRETED ANTIGEN 85-A (MYCOLYL TRANSFERASE 85A) (FIBRONECTIN-BINDING PROTEIN A) from Mycobacterium tuberculosis (338 aa), FASTA scores: opt: 592, E(): 1e-26, (39.05% identity in 302 aa overlap); etc. Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. Note that the secreted protein MPB51 is one of the major proteins in the culture filtrate of Mycobacterium bovis BCG. Protein product from Mb3833c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3833c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A4V7" /db_xref="InterPro:IPR000801" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P0A4V7" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02462.1" /translation="MKGRSALLRALWIAALSFGLGGVAVAAEPTAKAAPYENLMVPSP SMGRDIPVAFLAGGPHAVYLLDAFNAGPDVSNWVTAGNAMNTLAGKGISVVAPAGGAY SMYTNWEQDGSKQWDTFLSAELPDWLAANRGLAPGGHAAVGAAQGGYGAMALAAFHPD RFGFAGSMSGFLYPSNTTTNGAIAAGMQQFGGVDTNGMWGAPQLGRWKWHDPWVHASL LAQNNTRVWVWSPTNPGASDPAAMIGQAAEAMGNSRMFYNQYRSVGGHNGHFDFPASG DNGWGSWAPQLGAMSGDIVGAIR" CDS complement(4206327..4207343) /codon_start=1 /transl_table=11 /gene="fbpA" /locus_tag="BQ2027_MB3834C" /standard_name="mpt44; 85A" /product="SECRETED ANTIGEN 85-A FBPA (MYCOLYL TRANSFERASE 85A) (FIBRONECTIN-BINDING PROTEIN A) (ANTIGEN 85 COMPLEX A)" /note="Mb3834c, fbpA, len: 338 aa. Equivalent to Rv3804c, len: 338 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 338 aa overlap). fbpA (alternate gene names: mpt44, 85A), precursor of the 85-A antigen (fibronectin-binding protein A) (mycolyl transferase 85A) (EC 2.3.1.-) (see citations below), identical to P17944|P17996|FBPA|MPT44 ANTIGEN 85-A PRECURSOR from Mycobacterium bovis (338 aa), FASTA scores: opt: 2341, E(): 1.2e-132, (100.0% identity in 338 aa overlap); and highly similar to other Mycobacterial antigen precursors e.g. O52956|A85A_MYCAV|FBPA ANTIGEN 85-A PRECURSOR (85A) from Mycobacterium avium (347 aa), FASTA scores: opt: 1987, E(): 1.7e-111, (82.55% identity in 338 aa overlap); Q05861|A85A_MYCLE|FBPA|ML0097 ANTIGEN 85-A PRECURSOR (85A) from Mycobacterium leprae (330 aa), FASTA scores: opt: 1936, E(): 1.9e-108, (83.0% identity in 329 aa overlap); O06052|A85A_MYCGO|FBPA ANTIGEN 85-A PRECURSOR (85A) from Mycobacterium gordonae (339 aa), FASTA scores: opt: 1932, E(): 3.3e-108, (80.45% identity in 338 aa overlap); etc. Also highly similar to P31952|A85B_MYCTU|FBPB|Rv1886c|MT1934|MTCY180.32 SECRETED ANTIGEN 85-B from Mycobacterium tuberculosis (325 aa), FASTA scores: opt: 1830, E(): 3.9e-102, (78.85% identity in 317 aa overlap); P31953|A85C_MYCTU|FBPC|MPT45|Rv0129c|MTCI5.03c|FBPC2 SECRETED ANTIGEN 85-C from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 1597, E(): 3.4e-88, (67.25% identity in 336 aa overlap). Protein product from Mb3834c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3834c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0C2T1" /db_xref="InterPro:IPR000801" /db_xref="InterPro:IPR006311" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/Swiss-Prot:P0C2T1" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02463.1" /translation="MQLVDRVRGAVTGMSRRLVVGAVGAALVSGLVGAVGGTATAGAF SRPGLPVEYLQVPSPSMGRDIKVQFQSGGANSPALYLLDGLRAQDDFSGWDINTPAFE WYDQSGLSVVMPVGGQSSFYSDWYQPACGKAGCQTYKWETFLTSELPGWLQANRHVKP TGSAVVGLSMAASSALTLAIYHPQQFVYAGAMSGLLDPSQAMGPTLIGLAMGDAGGYK ASDMWGPKEDPAWQRNDPLLNVGKLIANNTRVWVYCGNGKPSDLGGNNLPAKFLEGFV RTSNIKFQDAYNAGGGHNGVFDFPDSGTHSWEYWGAQLNAMKPDLQRALGATPNTGPA PQGA" CDS complement(4207638..4209521) /codon_start=1 /transl_table=11 /gene="aftb" /locus_tag="BQ2027_MB3835C" /product="possible arabinofuranosyltransferase aftb" /note="Mb3835c, -, len: 627 aa. Equivalent to Rv3805c, len: 627 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 627 aa overlap). Probable conserved transmembrane protein, equivalent, but shorter 19 aa, to Q9CDB4|ML0096 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (649 aa), FASTA scores: opt: 3511, E(): 1.1e-204, (80.9% identity in 629 aa overlap). Equivalent to AAK48278 from Mycobacterium tuberculosis strain CDC1551 (641 aa) but shorter 14 aa. Protein product from Mb3835c detected using SWATH mass spectrometry. Mb3835c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5B7" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5B7" /protein_id="SIU02464.1" /translation="MVRVSLWLSVTAVAVLFGWGSWQRRWIADDGLIVLRTVRNLLAG NGPVFNQGERVEANTSTAWTYLLYVGGWVGGPMRLEYVALALAMVLSLLGMVLLMLGT GRLYAPSLRGRRAIMLPAGALVYIAVPPARDFATSGLESGLVLAYLGLLWWMMVCWSQ PLRARPDSQMFLGALAFVAGCSVLVRPEFALIGGLALIMMLIAARTWRRRVLIVLAGG FLPVAYQIFRMGYYGLLVPSTALAKDAAGDKWSQGMIYVSNFNRPYALWVPLVLSVPL GLLLMTARRRPSFLRPVLAPDYGRVARAVQSPPAVVAFIVGSGVLQALYWVRQGGDFM HGRVLLAPLFCLLAPVGVIPILLPDGKDFSRETGRWLVGALSGLWLGIAGWSLWAANS PGMGDDATRVTYSGIVDERRFYAQATGHAHPLTAADYLDYPRMAAVLTALNNTPEGAL LLPSGNYNQWDLVPMIRPSSGTAPGGKPAPKPQHAVFFTNMGMLGMNVGLDVRVIDQI GLVNPLAAHTERLKHARIGHDKNLFPDWVIADGPWVKWYPGIPGYIDQQWVTQAEAAL QCPATRAVLNSVRAPITLHRFLSNVLHSYEFTRYRIDRVPRYELVRCGLDVPDGPGPP PRE" CDS complement(4209610..4210518) /codon_start=1 /transl_table=11 /gene="ubia" /locus_tag="BQ2027_MB3836C" /product="decaprenylphosphoryl-5-phosphoribose (dppr) synthase (decaprenyl-phosphate 5-phosphoribosyltransferase)" /note="Mb3836c, -, len: 302 aa. Equivalent to Rv3806c, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 302 aa overlap). Possible conserved integral membrane protein, equivalent to Q9CDB5|ML0095 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (302 aa), FASTA scores: opt: 1677, E(): 3.9e-103, (83.75% identity in 302 aa overlap). Also highly similar to others e.g. Q9KZA2|SC5G8.12 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (322 aa), FASTA scores: opt: 937, E(): 2e-54, (51.4% identity in 292 aa overlap); AAK79783|CAC1818 CONSERVED MEMBRANE PROTEIN, POSSIBLE 4-HYDROXYBENZOATE from Clostridium acetobutylicum (290 aa), FASTA scores: opt: 467, E(): 1.5e-23, (26.9% identity in 290 aa overlap); Q98KY3|MLL1266 NODULATION PROTEIN NOEC (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Rhizobium loti (Mesorhizobium loti) (297 aa), FASTA scores: opt: 331, E(): 1.4e-14, (27.4% identity in 299 aa overlap); etc. And highly similar to C-terminal part of Q981F8|MLR9393 NODULATION PROTEIN NOEC (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Rhizobium loti (Mesorhizobium loti) plasmid pMLa (541 aa), FASTA scores: opt: 388, E(): 4e-18, (30.9% identity in 301 aa overlap); and P55585|Y4NM_RHISN INTEGRAL MEMBRANE PROTEIN (POSSIBLE PERMEASE/TRANSPORTER) from Rhizobium sp. strain NGR234 plasmid sym pNGR234a (516 aa), FASTA scores: opt: 380, E(): 1.3e-17, (31.85% identity in 295 aa overlap). Contains PS00225 Crystallins beta and gamma 'Greek key' motif signature. Protein product from Mb3836c detected using SWATH mass spectrometry. Mb3836c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6T2" /db_xref="InterPro:IPR000537" /db_xref="InterPro:IPR039653" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6T2" /protein_id="SIU02465.1" /translation="MSEDVVTQPPANLVAGVVKAIRPRQWVKNVLVLAAPLAALGGGV RYDYVEVLSKVSMAFVVFSLAASAVYLVNDVRDVEADREHPTKRFRPIAAGVVPEWLA YTVAVVLGVTSLAGAWMLTPNLALVMVVYLAMQLAYCFGLKHQAVVDICVVSSAYLIR AIAGGVATKIPLSKWFLLIMAFGSLFMVAGKRYAELHLAERTGAAIRKSLESYTSTYL RFVWTLSATAVVLCYGLWAFERDGYSGSWFAVSMIPFTIAILRYAVDVDGGLAGEPED IALRDRVLQLLALAWIATVGAAVAFG" CDS complement(4210525..4211022) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3837C" /product="POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3837c, -, len: 165 aa. Equivalent to Rv3807c, len: 165 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 165 aa overlap). Possible conserved transmembrane protein, equivalent to Q9CDB6|ML0094 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (192 aa), FASTA scores: opt: 714, E(): 2.4e-38, (72.85% identity in 151 aa overlap). Also highly similar to Q9KZA3|SC5G8.11 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (169 aa), FASTA scores: opt: 324, E(): 1.1e-13, (41.5% identity in 159 aa overlap); and similar in part to others e.g. Q9K3L3|SCG20A.27 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (230 aa), FASTA scores: opt: 277, E(): 1.3e-10, (41.65% identity in 168 aa overlap); P72269|ORF8 HYPOTHETICAL PROTEIN from Rhodococcus erythropolis (487 aa) FASTA scores: opt: 229, E(): 2.7e-07, (36.25% identity in 149 aa overlap); O86625|SC3A7.24c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (201 aa) FASTA scores: opt: 200, E(): 9.1e-06, (34.95% identity in 146 aa overlap); Q9KYD7|SCD72A.19 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (238 aa) FASTA scores: opt: 178, E(): 0.00026, (35.7% identity in 112 aa overlap); etc. Protein product from Mb3837c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3837c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5L1" /db_xref="InterPro:IPR000326" /db_xref="InterPro:IPR036938" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L1" /protein_id="SIU02466.1" /translation="MVAVQSALVDRPGMLATARGLSHFGEHCIGWLILALLGAIALPR RRREWLVAGAGAFVAHAIAVLIKRLVRRQRPDHPAIAVNVDTPSQLSFPSAHATSTTA AALLMGRATGLPLPVVLVPPMALSRILLGVHYPSDVAVGVALGATVGAIVDSVGGGRQ RARKR" CDS complement(4211051..4212964) /codon_start=1 /transl_table=11 /gene="glft2" /locus_tag="BQ2027_MB3838C" /product="bifunctional udp-galactofuranosyl transferase glft2" /note="Mb3838c, -, len: 637 aa. Equivalent to Rv3808c, len: 637 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 637 aa overlap). Galactofuranosyl transferase (EC 2.-.-.-) (see citations below), equivalent to Q9CDB7|ML0093 HYPOTHETICAL PROTEIN from Mycobacterium leprae (643 aa) FASTA scores: opt: 3751, E(): 0, (85.4% identity in 643 aa overlap). Contains a beta-glycosyltransferase domain A. Protein product from Mb3838c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3838c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5B2" /db_xref="InterPro:IPR029044" /db_xref="InterPro:IPR040492" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5B2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02467.1" /translation="MSELAASLLSRVILPRPGEPLDVRKLYLEESTTNARRAHAPTRT SLQIGAESEVSFATYFNAFPASYWRRWTTCKSVVLRVQVTGAGRVDVYRTKATGARIF VEGHDFTGTEDQPAAVETEVVLQPFEDGGWVWFDITTDTAVTLHSGGWYATSPAPGTA NIAVGIPTFNRPADCVNALRELTADPLVDQVIGAVIVPDQGERKVRDHPDFPAAAARL GSRLSIHDQPNLGGSGGYSRVMYEALKNTDCQQILFMDDDIRLEPDSILRVLAMHRFA KAPMLVGGQMLNLQEPSHLHIMGEVVDRSIFMWTAAPHAEYDHDFAEYPLNDNNSRSK LLHRRIDVDYNGWWTCMIPRQVAEELGQPLPLFIKWDDADYGLRAAEHGYPTVTLPGA AIWHMAWSDKDDAIDWQAYFHLRNRLVVAAMHWDGPKAQVIGLVRSHLKATLKHLACL EYSTVAIQNKAIDDFLAGPEHIFSILESALPQVHRIRKSYPDAVVLPAASELPPPLHK NKAMKPPVNPLVIGYRLARGIMHNLTAANPQHHRRPEFNVPTQDARWFLLCTVDGATV TTADGCGVVYRQRDRAKMFALLWQSLRRQRQLLKRFEEMRRIYRDALPTLSSKQKWET ALLPAANQEPEHG" CDS complement(4212961..4214160) /codon_start=1 /transl_table=11 /gene="glf" /locus_tag="BQ2027_MB3839C" /standard_name="ceoA" /product="UDP-GALACTOPYRANOSE MUTASE GLF (UDP-GALP MUTASE) (NAD+-FLAVIN ADENINE DINUCLEOTIDE-REQUIRING ENZYME)" /note="Mb3839c, glf, len: 399 aa. Equivalent to Rv3809c, len: 399 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 399 aa overlap). glf (alternate gene name: ceoA), UDP-galactopyranose mutase (EC 5.4.99.9) (see citations below), identical to previously sequenced gene, and equivalent to Q9CDB8|GLF|ML0092 PUTATIVE UDP-GALACTOPYRANOSE MUTASE from Mycobacterium leprae (413 aa), FASTA scores: opt: 2347, E(): 1.3e-140, (86.6% identity in 396 aa overlap). Also highly similar to others e.g. AAK61905|EPSJ UDP-GALACTOPYRANOSE MUTASE (PROTEIN INVOLVED IN EXOPOLYSACCHARIDES BIOSYNTHESIS) from Streptococcus thermophilus (365 aa), FASTA scores: opt: 972, E(): 5.9e-54, (45.85% identity in 375 aa overlap); P37747|GLF_ECOLI|B2036 UDP-GALACTOPYRANOSE MUTASE from Escherichia coli strain K12 (367 aa), FASTA scores: opt: 958, E(): 4.5e-53, (43.55% identity in 379 aa overlap); O86897|CAP33FN from Streptococcus pneumoniae (369 aa) FASTA scores: opt: 954, E(): 8.1e-53, (44.8% identity in 375 aa overlap); etc. COFACTOR: FAD (BY SIMILARITY). N-TERMINAL SHOWS SIMILARITY TO FAD OR NAD CONTAINING PROTEINS. Protein product from Mb3839c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3839c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5B9" /db_xref="InterPro:IPR004379" /db_xref="InterPro:IPR015899" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5B9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02468.1" /translation="MQPMTARFDLFVVGSGFFGLTIAERVATQLDKRVLVLERRPHIG GNAYSEAEPQTGIEVHKYGAHLFHTSNKRVWDYVRQFTDFTDYRHRVFAMHNGQAYQF PMGLGLVSQFFGKYFTPEQARQLIAEQAAEIDTADAQNLEEKAISLIGRPLYEAFVKG YTAKQWQTDPKELPAANITRLPVRYTFDNRYFSDTYEGLPTDGYTAWLQNMAADHRIE VRLNTDWFDVRGQLRPGSPAAPVVYTGPLDRYFDYAEGRLGWRTLDFEVEVLPIGDFQ GTAVMNYNDLDVPYTRIHEFRHFHPERDYPTDKTVIMREYSRFAEDDDEPYYPINTEA DRALLATYRARAKSETASSKVLFGGRLGTYQYLDMHMAIASALNMYDNVLAPHLRDGV PLLQDGA" CDS 4214424..4215278 /codon_start=1 /transl_table=11 /gene="pirG" /locus_tag="BQ2027_MB3840" /standard_name="erp; P36" /product="EXPORTED REPETITIVE PROTEIN PRECURSOR PIRG (CELL SURFACE PROTEIN) (EXP53)" /note="Mb3840, pirG, len: 284 aa. Equivalent to Rv3810, len: 284 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 284 aa overlap). pirG (alternate gene names: P36 or erp for Exported Repeated Protein), cell surface protein precursor (see citations below), equivalent to P19361|28KD_MYCLE|ML0091 28 KDA ANTIGEN PRECURSOR from Mycobacterium leprae (236 aa), FASTA scores: opt: 555, E(): 9.8e-18, (52.65% identity in 281 aa overlap). Mb3840 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5P5" /db_xref="InterPro:IPR008164" /db_xref="UniProtKB/Swiss-Prot:P0A5P5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02469.1" /translation="MPNRRRRKLSTAMSAVAALAVASPCAYFLVYESTETTERPEHHE FKQAAVLTDLPGELMSALSQGLSQFGINIPPVPSLTGSGDASTGLTGPGLTSPGLTSP GLTSPGLTDPALTSPGLTPTLPGSLAAPGTTLAPTPGVGANPALTNPALTSPTGATPG LTSPTGLDPALGGANEIPITTPVGLDPGADGTYPILGDPTLGTIPSSPATTSTGGGGL VNDVMQVANELGASQAIDLLKGVLMPSIMQAVQNGGAAAPAASPPVPPIPAAAAVPPT DPITVPVA" CDS 4215483..4217102 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3841" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3841, -, len: 539 aa. Equivalent to Rv3811, len: 539 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 539 aa overlap). Conserved hypothetical protein, showing some similarity to Q9KZK5|SCE34.21c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (416 aa), FASTA scores: opt: 603, E(): 8.1e-26, (34.4% identity in 404 aa overlap); Q9S2P9|SC5F7.14c HYPOTHETICAL 31.9 KDA PROTEIN from Streptomyces coelicolor (308 aa), FASTA scores: opt: 472, E(): 9.5e-19, (37.5% identity in 208 aa overlap). Middle section (approximatively aa 185-350/390) shows some similarity with Q9GK12 PEPTIDOGLYCAN RECOGNITION PROTEIN PRECURSOR from Camelus dromedarius (Dromedary) (Arabian camel) (193 aa) FASTA scores: opt: 274, E(): 4.6e-08, (32.2% identity in 177 aa overlap); O75594|PGLYRP|PGRP from Homo sapiens (Human) (196 aa), FASTA scores: opt: 272, E(): 6e-08, (30.9% identity in 220 aa overlap); Q9JLN4|PGRP PEPTIDOGLYCAN RECOGNITION PROTEIN from Rattus norvegicus (Rat) (182 aa), FASTA scores: opt: 253, E(): 6.2e-07, (32.15% identity in 171 aa overlap); etc. C-terminal end shows similarity with Q01377|CSP1_CORGL PS1 PROTEIN PRECURSOR (ONE OF THE TWO MAJOR SECRETED PROTEINS) from Corynebacterium glutamicum (Brevibacterium flavum) (657 aa), FASTA scores: opt: 250, E(): 2.7e-06, (39.45% identity in 109 aa overlap). Contains PS00687 Aldehydedehydrogenases glutamic acid active site. Note that previously known as csp. Mb3841 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y590" /db_xref="InterPro:IPR002502" /db_xref="InterPro:IPR006619" /db_xref="InterPro:IPR013207" /db_xref="InterPro:IPR015510" /db_xref="InterPro:IPR036505" /db_xref="UniProtKB/TrEMBL:A0A1R3Y590" /protein_id="SIU02470.1" /translation="MAATVVIVAWIANRPPASSHEPSPTPNTQLAEQPLIGLGGGVTV RELTQDTPFSLVALTGDLAGTSARVRAKRPDGDWGPWYQTEYETEPRDPAGTDGSVEL GGLNPGPRSTDPVFVGTTTTVQVAVTRPIDAPITQPPAGRPPNDLLDSGLGYRPATKE QPFGQNISAILISPPQAPPGTQWTPPTAVTMAGQPPAIISRAEWGADESLRCETPEYD RGVRAAVVHHTAGSNDYSPLESAGIVKAIYTYHSKTLGWCDIAYNALVDKYGQVFEGS AGGLTKPVEGFHTGGFNRNTWGVAMIGNFDDVAPTPIQIRTVGRLLGWRLGMDDVDPR SMVDLQSAGSSYTTFPGGAIARLPAIFTHRDVGNTDCPGNAAYAVMDEIRDIAAHFND PPEELIKALEGGAIYQRWQALGGMNSALGAPTSPEADAADGARYATFAKGAMYWSPVT DAQPITGAIYEAWASQSYERGPLGLPTSAEIQEPLQITQNFQHGTLNFERLTGNVTEV VDGITTPLATRPPSGPTVPPEHFTLPTHPIT" CDS 4217256..4218770 /codon_start=1 /transl_table=11 /gene="PE_PGRS62" /locus_tag="BQ2027_MB3842" /product="pe-pgrs family protein pe_pgrs62" /note="Mb3842, PE_PGRS62, len: 504 aa. Equivalent to Rv3812, len: 504 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 504 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many e.g. P96828|Rv0151c|MTCI5.25c (588 aa), FASTA scores: opt: 389, E(): 6.2e-14, (29.2% identity in 473 aa overlap); MTCY7H7B_27; MTCY493_24; MTCY441_4; MTCY39_36; MTCY1A11_4; MTCY359_33; MTCY130_10; MTCY98_9; etc. Mb3842 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5C4" /protein_id="SIU02471.1" /translation="MSFVVTVPEAVAAAAGDLAAIGSTLREATAAAAGPTTGLAAAAA DDVSIAVSQLFGRYGQEFQTVSNQLAAFHTEFVRTLNRGAAAYLNTESANGGQLFGQI EAGQRAVSAAAAAAPGGAYGQLVANTATNLESLYGAWSANPFPFLRQIIANQQVYWQQ IAAALANAVQNFPALVANLPAAIDAAVQQFLAFNAAYYIQQIISSQIGFAQLFATTVG QGVTSVIAGWPNLAAELQLAFQQLLVGDYNAAVANLGKAMTNLLVTGFDTSDVTIGTM GTTISVTAKPKLLGPLGDLFTIMTIPAQEAQYFTNLMPPSILRDMSQNFTNVLTTLSN PNIQAVASFDIATTAGTLSTFFGVPLVLTYATLGAPFASLNAIATSAETIEQALLAGN YLGAVGALIDAPAHALDGFLNSATVLDTPILVPTGLPSPLPPTVGITLHLPFDGILVP PHPVTATISFPGAPVPIPGFPTTVTVFGTPFMGMAPLLINYIPQQLALAIKPAA" CDS complement(4219079..4219900) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3843C" /product="Cof family hydrolase" /note="Mb3843c, -, len: 273 aa. Equivalent to Rv3813c, len: 273 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 273 aa overlap). Conserved hypothetical protein, equivalent to Q9CDB9|ML0089 HYPOTHETICAL PROTEIN from Mycobacterium leprae (281 aa) FASTA scores: opt: 1479, E(): 9.6e-81, (80.45% identity in 271 aa overlap); and similar to Q98LI0|MLL1014 from (280 aa) . Also similar to many hypothetical proteins from several organisms e.g. Q9ZBX2|SCD78.27c from Streptomyces coelicolor (280 aa), FASTA scores: opt: 597, E(): 2.2e-28, (43.25% identity in 266 aa overlap); Q9RXR7|DR0240 from Deinococcus radiodurans (284 aa), FASTA scores: opt: 543, E(): 3.5e-25, (38.65% identity in 264 aa overlap); Q99YH5|SPY1700 from Streptococcus pyogenes (274 aa) FASTA scores: opt: 373, E(): 4.3e-15, (30.75% identity in 270 aa overlap); P70947|YITU from Bacillus subtilis (270 aa) FASTA scores: opt: 353, E(): 6.5e-14, (30.0% identity in 280 aa overlap); etc. Protein product from Mb3843c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3843c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5B8" /db_xref="InterPro:IPR000150" /db_xref="InterPro:IPR006379" /db_xref="InterPro:IPR023214" /db_xref="InterPro:IPR036412" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5B8" /protein_id="SIU02472.1" /translation="MKPTVPALVACDVDGTLLDDGETVTKRTRDAVHAAVDAGTHFIL ATGRPPRWVRPIVDALGFAPMAVCANGAVIYDPGTDRVTSVRTLPVDALATLAEVATR VIPGAGLAVERIGERAHDTATPQFVSSPGYEHAWLNPDNTEVSIDHLLSAPAIKLLIR KAGAASADMAAELAKHVGFEGDITYSTNNGLVEIVPLGISKATGVDEIARPLGISDAE VVAFGDMPNDVPMLLRAGLGVAMGNAHPDALAVADEVTAPNSEDGVARVLERWWS" CDS complement(4219915..4220700) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3844C" /product="possible acyltransferase" /note="Mb3844c, -, len: 261 aa. Equivalent to Rv3814c, len: 261 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 261 aa overlap). Possible acyltransferase (EC 2.3.1.-), highly similar to Q9CDC0|ML0087 PUTATIVE ACYLTRANSFERASE from Mycobacterium leprae (257 aa), FASTA scores: opt: 753, E(): 7.7e-42, (46.75% identity in 246 aa overlap). Also highly similar to many acyltransferases and hypothetical proteins e.g. Q9K3R3|2SCG4.01 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (242 aa), FASTA scores: opt: 587, E(): 4.6e-31, (41.95% identity in 243 aa overlap); Q9ZBS1|SC7A1.02 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (264 aa), FASTA scores: opt: 293, E(): 6.6e-12, (29.2% identity in 267 aa overlap); Q9PNZ5|AAS|CJ0938 PUTATIVE 2-ACYLGLYCEROPHOSPHOETHANOLAMINE ACYLTRANSFERASE / ACYL-ACYL CARRIER PROTEIN SYNTHETASE from Campylobacter jejuni (1170 aa), FASTA scores: opt: 274, E(): 3.9e-10, (29.1% identity in 219 aa overlap) (similarity only with middle section); Q9EY25 PUTATIVE ACETYL TRANSFERASE from Xanthomonas oryzae pv. oryzae (249 aa), FASTA scores: opt: 238, E(): 2.4e-08, (29.2% identity in 209 aa overlap); etc. Also highly similar to downstream ORFs O07808|Rv3815c|MTCY409.15 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (251 aa), FASTA scores: opt: 1069, E(): 2.1e-62, (60.4% identity in 245 aa overlap); and O07807|Rv3816c|MTCY409.14 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (259 aa), FASTA scores: opt: 776, E(): 2.5e-43, (50.9% identity in 228 aa overlap). And similar to O53516|Rv2182c|MTV021.15c HYPOTHETICAL 27.0 KDA PROTEIN from Mycobacterium tuberculosis (247 aa), FASTA scores: opt: 239, E(): 2e-08, (30.6% identity in 232 aa overlap). Protein product from Mb3844c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3844c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5D0" /db_xref="InterPro:IPR002123" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5D0" /protein_id="SIU02473.1" /translation="MAEPFFRMMEILVPSIVAANGNKITFEGLENIPERGGALIALNH TSYVDWVPASIAAHHRRRRLRFMIKAEMQDVRAVNYVIKHAQLIPVDRSVGADAYAVA VQRLRAGELVGLHPEATISRSLELREFKTGAARMALEAQVPIIPMIVWGAHRIWPKDH PKNLFRNKIPIVAAIGSPVRPEGNAEQLNAVLRQAMNAILYRVQEEYPHPKGEHWVPR RLGGGAPTVEESRQLRIAELAKRRQKRGYDGVTSSRRSQVGPH" CDS complement(4220718..4221473) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3845C" /product="possible acyltransferase" /note="Mb3845c, -, len: 251 aa. Equivalent to Rv3815c, len: 251 a, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 251 aa overlap). Possible acyltransferase (EC 2.3.1.-), highly similar to Q9CDC0|ML0087 PUTATIVE ACYLTRANSFERASE from Mycobacterium leprae (257 aa), FASTA scores: opt: 845, E(): 2.7e-47, (53.25% identity in 246 aa overlap). Also highly similar to Q9K3R3|2SCG4.01 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (242 aa), FASTA scores: opt: 656, E(): 3.7e-35, (47.85% identity in 234 aa overlap); and similar to many putative acyltransferases and hypothetical proteins e.g. P74498|SLL1848 HYPOTHETICAL 24.3 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (225 aa) FASTA scores: opt: 275, E(): 1.2e-10, (34.8% identity in 181 aa overlap); Q9ZBS1|SC7A1.02 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (264 aa), FASTA scores: opt: 266, E(): 5.2e-10, (29.7% identity in 229 aa overlap); Q9PNZ5|AAS|CJ0938 PUTATIVE 2-ACYLGLYCEROPHOSPHOETHANOLAMINE ACYLTRANSFERASE/ ACYL-ACYL CARRIER PROTEIN SYNTHETASE from Campylobacter jejuni (1170 aa), FASTA scores: opt: 264, E(): 2.3e-09, (23.55% identity in 221 aa overlap) (similarity only with middle section); etc. Also highly similar to upstream ORF O07809|Rv3814c|MTCY409.16 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (261 aa), FASTA scores: opt: 1069, E(): 1e-61, (60.4% identity in 245 aa overlap); and downstream ORF O07807|Rv3816c|MTCY409.14 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (259 aa) FASTA scores: opt: 847, E(): 2e-47, (55.7% identity in 246 aa overlap). And similar to O53516|Rv2182c|MTV021.15c HYPOTHETICAL 27.0 KDA PROTEIN from Mycobacterium tuberculosis (247 aa), FASTA scores: opt: 237, E(): 3.6e-08, (30.9% identity in 233 aa overlap). Protein product from Mb3845c detected using SWATH mass spectrometry. Mb3845c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5C3" /db_xref="InterPro:IPR002123" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5C3" /protein_id="SIU02474.1" /translation="MAEPTYRVLEILAQLLVLATGTRITYVGEENVPDQGGAVVAINH TSYVDWLPAALAMHRRRRRMRFMIKAEMQRVRLVNFLIRHTRTIPVDRGAGGSAYAVA VQRLREGELVGVYPEATISRSFELKGFKTGAARMAAEADVPIVPVVVWGAQRIWTKDH PRQIGRAKVPVTVQVGRPLRAAAGIEQTNAALRESMTALLWQAQERYPHPAGAYWVPR RLGGGAPTLAEAARMEADEAAARAASRTPHESR" CDS complement(4221477..4222256) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3846C" /product="possible acyltransferase" /note="Mb3846c, -, len: 259 aa. Equivalent to Rv3816c, len: 259 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 259 aa overlap). Possible acyltransferase (EC 2.3.1.-), equivalent to Q9CDC0|ML0087 PUTATIVE ACYLTRANSFERASE from Mycobacterium leprae (257 aa) FASTA scores: opt: 1401, E(): 1.5e-80, (81.9% identity in 254 aa overlap). Also highly similar to many putative acyltransferases and hypothetical proteins e.g. Q9K3R3|2SCG4.01 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (242 aa), FASTA scores: opt: 758, E(): 2.4e-40, (51.7% identity in 234 aa overlap); Q9ZBS1|SC7A1.02 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (264 aa), FASTA scores: opt: 312, E(): 2e-12, (29.55% identity in 237 aa overlap); O67841|AAS|AQ_2058 2-ACYLGLYCEROPHOSPHOETHANOLAMINE ACYLTRANSFERASE from Aquifex aeolicus (211 aa), FASTA scores: opt: 281, E(): 1.5e-10, (32.7% identity in 162 aa overlap); etc. Also highly similar to upstream ORFs O07808|Rv3815c|MTCY409.15 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (251 aa), FASTA scores: opt: 847, E(): 6.7e-46, (55.7% identity in 246 aa overlap); and O07809|Rv3814c|MTCY409.16 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (261 aa), FASTA scores: opt: 776, E(): 1.9e-41, (50.9% identity in 228 aa overlap). Protein product from Mb3846c detected using shotgun mass spectrometry. Mb3846c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y6U4" /db_xref="InterPro:IPR002123" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6U4" /protein_id="SIU02475.1" /translation="MEPVYGTVIRLARLSWRIQGLKITVTGVDNLPTSGGAVVAINHT SYLDFTFAGLPAYQQGLGRKVRFMAKQEVFDHKITGPIMRSLRHIPVDRQDGSASYDA AVRMLKAGELVGVYPEATISRSFEIKEFKTGAARMAIEAGVPIVPHIVWGAQRIWTKD RPKKLFRPKVPVTIVVGERIEPTLPTAELNGLLHSRMQHLLERAQELYGPHPAGEFWV PHRLGGGAPSLAEAARLDAQEAAVRAARRAQRAHPAGAPEQ" CDS 4222332..4223087 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3847" /product="possible phosphotransferase" /note="Mb3847, -, len: 251 aa. Equivalent to Rv3817, len: 251 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 251 aa overlap). Possible phosphotransferase (EC 2.7.-.-), similar to many phosphotransferases e.g. O53023 KANAMYCIN MARKER from Escherichia coli (264 aa), FASTA scores: opt: 232, E(): 7.5e-08, (32.4% identity in 247 aa overlap); BAA78209|NEO NEOMYCINE PHOSPHOTRANSFERASE from Drosophila melanogaster (Fruit fly) (264 aa), FASTA scores: opt: 227, E(): 1.6e-07, (32.0% identity in 247 aa overlap); AAG09774 AMINOGLYCOSIDE 3'-PHOSPHOTRANSFERASE from Vibrio cholerae (264 aa), FASTA scores: opt: 227, E(): 1.6e-07, (32.0% identity in 247 aa overlap); P00552|KKA2_KLEPN|NEO|KAN AMINOGLYCOSIDE 3'-PHOSPHOTRANSFERASE from Klebsiella pneumoniae (264 aa), FASTA scores: opt: 227, E(): 1.6e-07, (32.0% identity in 247 aa overlap); etc. Protein product from Mb3847 detected using SWATH mass spectrometry. Mb3847 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5L2" /db_xref="InterPro:IPR002575" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR024165" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L2" /protein_id="SIU02476.1" /translation="MSFPSSPPALPAIVARFAVGRPVRAVWVNELGGVTFRVDSGMGA GCEFIKVARRGTADFANEARRLRWAAPYLAVPRVLGVGVDGDWAWLHTDALPGLSAVH PRWRASPQVAVPALGAGLRTLHDSLPVHSCPFDWSTASRLAKLAPARRAELGDSPPVD RLVVCHGDACSPNTILDDTGRCCGHVDFGNLGVADRWADLAVATLSLQWNFPDYPGQV RDDEFFAAYGVAPDPARIDYYRRLWQAEDDSSR" CDS 4223134..4224684 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3848" /product="Rieske (2Fe-2S) domain protein" /note="Mb3848, -, len: 516 aa. Equivalent to Rv3818, len: 516 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 516 aa overlap). Hypothetical unknown protein. Protein product from Mb3848 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3848 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5C1" /db_xref="InterPro:IPR017941" /db_xref="InterPro:IPR036866" /db_xref="InterPro:IPR036922" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5C1" /protein_id="SIU02477.1" /translation="MQVTSVGHAGFLIQTQAGSILCDPWVNPAYFASWFPFPDNSGLD WGALGECDYLYVSHLHKDHFDAENLRAHVNKDAVVLLPDFPVPDLRNELQKLGFHRFF ETTDSVKHRLRGPNGDLDVMIIALRAPADGPIGDSALVVADGETTAFNMNDARPVDLD VLASEFGHIDVHMLQYSGAIWYPMVYDMPARAKDAFGAQKRQRQMDRARQYIAQVGAT WVVPSAGPPCFLAPELRHLNDDGSDPANIFPDQMVFLDQMRAHGQDGGLLMIPGSTAD FTGTTLNSLRHPLPAEQVEAIFTTDKAAYIADYADRMAPVLAAQKAGWAAAAGEPLLQ PLRTLFEPIMLQSNEICDGIGYPVELAIGPETIVLDFPKRAVREPIPDERFRYGFAIA PELVRTVLRDNEPDWVNTIFLSTRFRAWRVGGYNEYLYTFFKCLTDERIAYADGWFAE AHDDSSSITLNGWEIQRRCPHLKADLSKFGVVEGNTLTCNLHGWQWRLDDGRCLTARG HQLRSSRP" CDS 4224681..4225016 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3849" /product="unknown protein" /note="Mb3849, -, len: 111 aa. Equivalent to Rv3819, len: 111 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 111 aa overlap). Hypothetical unknown protein. Contains PS00012 Phosphopantetheine attachment site. Protein product from Mb3849 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3849 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5D1" /protein_id="SIU02478.1" /translation="MMQFYDDGVVQLDRAALTLRRYHFPSGTAKVIPLDQIRGYQAES LGFLMARFNIWGRPDLRRWLPLDVYRPLKSTLVTLDVPGMRPKPACTPTRPKEFIALL DELLALHRT" CDS complement(4225104..4226510) /codon_start=1 /transl_table=11 /gene="papA2" /locus_tag="BQ2027_MB3850C" /product="POSSIBLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA2" /note="Mb3850c, papA2, len: 468 aa. Equivalent to Rv3820c, len: 468 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 468 aa overlap). Possible papA2, conserved polyketide synthase (PKS) associated protein, highly similar to Q49618|PAPA3|ML1230|B1170_C1_180 PKS-ASSOCIATED PROTEIN A3 from Mycobacterium leprae (471 aa), FASTA scores: opt: 1660, E(): 2.7e-102, (53.95% identity in 456 aa overlap). Also similar to Q9F2R3|SCD65.19c HYPOTHETICAL 52.8 KDA PROTEIN from Streptomyces coelicolor (473 aa), FASTA scores: opt: 575, E(): 1.8e-30, (27.8% identity in 464 aa overlap); and weakly similar to part of other proteins. Also high similarity with other PKS-ASSOCIATED PROTEINS from Mycobacterium tuberculosis; O50438|PAPA3|Rv1182|MTV005.18 (472 aa), FASTA scores: opt: 1694, E(): 1.5e-104, (53.8% identity in 461 aa overlap); and O07799|PAPA1|Rv3824c|MTCY409.06 (511 aa), FASTA scores: opt: 1664, E(): 1.6e-102, (53.9% identity in 462 aa overlap); and similar to C-terminal end of O53902|PAPA4|Rv1528c|MTV045.02 (165 aa), FASTA scores: opt: 186, E(): 4.1e-05, (37.9% identity in 66 aa overlap). Protein product from Mb3850c detected using shotgun mass spectrometry and SWATH mass spectrometry." /db_xref="GOA:Q7TVL3" /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/Swiss-Prot:Q7TVL3" /protein_id="SIU02479.1" /translation="MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQQ AQHLRRYRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDTYHSWFEFDN AEHIVRHTIADPADIEVVQAEHQNMTSAELRHHIATPQPLQWDCFLFGIIQSDDHFTF YASIAHLCVDPMIVGVLFIEIHMMYSALVGGDPPIELPPAGRYDDHCVRQYADTAALT LDSARVRRWVEFAANNDGTLPHFPLPLGDLSVPHTGKLLTETLMDEQQGERFEAACVA AGARFSGGVFACAALAERELTNCETFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVA SGLFDSAARVAQISFDSGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIAP LSTVANSDLNFRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNPIASESVANYIAAMK SIYIRTADGTLAILKPGT" CDS 4226658..4227371 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3851" /product="PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN" /note="Mb3851, -, len: 237 aa. Equivalent to Rv3821, len: 237 aa, from Mycobacterium tuberculosis strain H37Rv, (99.2% identity in 237 aa overlap). Probable conserved integral membrane protein, equivalent to Q49630|ML1233|B1170_F2_64 HYPOTHETICAL 24.4 KDA PROTEIN /INTEGRAL MEMBRANE PROTEIN (POTENTIAL) from Mycobacterium leprae (230 aa), FASTA scores: opt: 619, E(): 2.4e-32, (46.65% identity in 240 aa overlap). Shows some similarity to P29466|I1BC_HUMAN|CASP1|IL1BC|IL1BCE (404 aa), FASTA scores: opt: 126, E(): 0.88, (29.05% identity in 155 aa overlap). Also highly similar to P71796|Rv1517|MTCY277.39 HYPOTHETICAL 26.9 KDA PROTEIN from Mycobacterium tuberculosis (254 aa), FASTA scores: opt: 284, E(): 5.4e-11, (36.35% identity in 256 aa overlap). Start site chosen on basis of similarity to LEPB1170_F2_64 and MTCY277.39, but may extend further upstream. Mb3851 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5A0" /db_xref="InterPro:IPR021315" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5A0" /protein_id="SIU02480.1" /translation="MWSTVLVLALSVICEPVRIGLVVLMLNRRRPLLHLLTFLCGGYT MAGGVAMVTLVVLGATPLAGHFSVAEVQIGTGLIALLIAFALTTNVIGKHVRRATHAR VGDNGGRVLRESVPPSGTHKLAVRARCFLQGDSLYVAGVSGLGAALPSANYMGAMAAI LASGATPATQALAVVTFNVVAFTVAEVPLVSYLAAPRKTRAFMAALQSWLRSRSRRDA ALLVAAGGCLMLTLGLSNL" CDS 4227406..4228620 /codon_start=1 /transl_table=11 /gene="chp1" /locus_tag="BQ2027_MB3852" /product="FIG01386146: Possible exported protein, Rv1184c" /note="Mb3852, -, len: 404 aa. Equivalent to Rv3822, len: 404 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 404 aa overlap). Conserved hypothetical protein, similar in part to hypothetical proteins from Mycobacterium leprae: Q9CC62|ML1232 (358 aa) FASTA scores: opt: 601, E(): 1.1e-25, (36.7% identity in 335 aa overlap); and Q49633|B1170_F3_112 (391 aa) FASTA scores: opt: 601, E(): 1.2e-25, (36.25% identity in 347 aa overlap). Also similar to P71862|Rv3539|MTCY03C7.17c PPE FAMILY PROTEIN from Mycobacterium tuberculosis (479 aa), FASTA scores: opt: 547, E(): 1.3e-22, (38.1% identity in 281 aa overlap); O50440|Rv1184c|MTV005.20c (359 aa); O06828|Rv1430|MTCY493.24c (528 aa); O53642|Rv0159c|MTV032.02c (468 aa); etc. Mb3852 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR013228" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5D5" /protein_id="SIU02481.1" /translation="MKCPGVSDCVATVRHDNVFAIAAGLRWSAAVPPLHKGDAVTKLL VGAIAGGMLAYAAILGDGIASADTALIVPGTAPSPYGPLRSLYHFNPAMQPQIGANYY TPTATRHVVSYPGSFWPVTGLNSPTVGSSVSAGTNNLDAAIRSTDGPIFVAGLSQGTL VLDREQARLANDPTAPPPGQLTFIKAGDPNNLLWRAFRPGTHVPIIDYTVPAPVESQY DTINIVGQYDIFSDPPNRPGNLLADLNAIAAGGYYGHSATAFSDPARVAPRDITTTTN SLGATTTTYFIRTDQLPLVRALVDMAGLPPQAAGTVDAALRPIIDRAYQPGPAPAVNP RDLVQGIRGIPAIAPAIAIPIGSTTGASAATSTAAATAAATNALRGANVGPGANKALS MVRGLLPKGKKH" CDS complement(4228945..4232214) /codon_start=1 /transl_table=11 /gene="mmpL8" /locus_tag="BQ2027_MB3853C" /product="conserved integral membrane transport protein mmpl8" /note="Mb3853c, mmpL8, len: 1089 aa. Equivalent to Rv3823c, len: 1089 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 1089 aa overlap). Probable mmpL8, conserved integral membrane transport protein (see citation below), member of RND superfamily, equivalent to Q49619|MMLA_MYCLE|MMPL10|TP1|ML1231|B1170_C1 _181 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (1008 aa), FASTA scores: opt: 2718, E(): 7.3e-149, (56.25% identity in 1028 aa overlap). Also similar to others e.g. Q9XCF6|TMTPC from Mycobacterium avium (974 aa), FASTA scores: opt: 660, E(): 2.7e-30, (28.2% identity in 1050 aa overlap); Q9XCF5|TMTPB from Mycobacterium avium (963 aa), FASTA scores: opt: 653, E(): 6.7e-30, (27.0% identity in 1014 aa overlap); Q9KH53|TMTPC from Mycobacterium smegmatis (994 aa), FASTA scores: opt: 648, E(): 1.3e-29, (28.45% identity in 1013 aa overlap); etc. Also highly similar to other mmpL proteins from Mycobacterium tuberculosis; O50439|MMLA_MYCTU|MMPL10|RV1183|MT1220|MTV00 5.19 (1002 aa), FASTA scores: opt: 2777, E(): 2.9e-152, (58.25% identity in 996 aa overlap); Q50585|MMLC_MYCTU|MMPL12|Rv1522c|MT1573|MTCY19G5.06 (1146 aa), FASTA scores: opt: 2433, E(): 2.1e-132, (49.9% identity in 1050 aa overlap); and similar to others e.g. P95235|MML9_MYCTU|MMPL9|Rv2339|MT2402|MTCY98.08 (962 aa), FASTA scores: opt: 651, E(): 8.8e-30, (28.6% identity in 1038 aa overlap); etc. BELONGS TO THE MMPL FAMILY. Mb3853c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TVL0" /db_xref="InterPro:IPR000731" /db_xref="InterPro:IPR004707" /db_xref="InterPro:IPR004869" /db_xref="UniProtKB/Swiss-Prot:Q7TVL0" /protein_id="SIU02482.1" /translation="MCDVLMQPVRTPRPSTNLRSKPLRPTGDGGVFPRLGRLIVRRPW VVIAFWVALAGLLAPTVPSLDAISQRHPVAILPSDAPVLVSTRQMTAAFREAGLQSVA VVVLSDAKGLGAADERSYKELVDALRRDTRDVVMLQDFVTTPPLRELMTSKDNQAWIL PVGLPGDLGSTQSKQAYARVADIVEHQVAGSTLTANLTGPAATVADLNLTGQRDRSRI EFAITILLLVILLIIYRNPITMVLPLITIGMSVVVAQRLVAIAGLAGLGIANQSIIFM SGMMVGAGTDYAVFLISRYHDYLRQGADSDQAVKKALTSIGKVIAASAATVAITFLGM VFTQLGILKTVGPMLGISVAVVFFAAVTLLPALMVLTGRRGWIAPRRDLTRRFWRSSG VHIVRRPKTHLLASALVLVILAGCAGLARYNYDDRKTLPASVESSIGYAALDKHFPSN LIIPEYLFIQSSTDLRTPKALADLEQMVQRVSQVPGVAMVRGITRPAGRSLEQARTSW QAGEVGSKLDEGSKQIAAHTGDIDKLAGGANLMASKLGDVRAQVNRAISTVGGLIDAL AYLQDLLGGNRVLGELEGAEKLIGSMRALGDTIDADASFVANNTEWASPVLGALDSSP MCTADPACASARTELQRLVTARDDGTLAKISELARQLQATRAVQTLAATVSGLRGALA TVIRAMGSLGMSSPGGVRSKINLVNKGVNDLADGSRQLAEGVQLLVDQVKKMGFGLGE ASAFLLAMKDTATTPAMAGFYIPPELLSYATGESVKAETMPSEYRDLLGGLNVDQLKK VAAAFISPDGHSIRYLIQTDLNPFSTAAMDQIDAITAAARGAQPNTALADAKVSVVGL PVVLKDTRDYSDHDLRLIIAMTVCIVLLILIVLLRAIVAPLYLIGSVIVSYLAALGIG VIVFQFLLGQEMHWSIPGLTFVILVAVGADYNMLLISRLREEAVLGVRSGVIRTVAST GGVITAAGLIMAASMYGLVFASLGSVVQGAFVLGTGLLLDTFLVRTVTVPAIAVLVGQ ANWWLPSSWRPATWWPLGRRRGRAQRTKRKPLLPKEEEEQSPPDDDDLIGLWLHDGLR L" CDS complement(4232324..4233859) /codon_start=1 /transl_table=11 /gene="papA1" /locus_tag="BQ2027_MB3854C" /product="conserved polyketide synthase associated protein papa1" /note="Mb3854c, papA1, len: 511 aa. Equivalent to Rv3824c, len: 511 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 511 aa overlap). Possible papA1, conserved polyketide synthase (PKS) associated protein, highly similar to Q49618|PAPA3|ML1230|B1170_C1_180 PKS-ASSOCIATED PROTEIN A3 from Mycobacterium leprae (471 aa), FASTA scores: opt: 1879, E(): 7.1e-111, (55.5% identity in 465 aa overlap). Also similar to Q9F2R3|SCD65.19c HYPOTHETICAL 52.8 KDA PROTEIN from Streptomyces coelicolor (473 aa), FASTA scores: opt: 476, E(): 1.7e-22, (26.7% identity in 464 aa overlap); and similar in part to Q09164|SIMA|CYSYN CYCLOSPORIN SYNTHETASE from Tolypocladium inflatum (15281 aa) FASTA scores: opt: 238, E(): 2.8e-06, (22.35% identity in 371 aa overlap). Also highly similar to other PKS-ASSOCIATED PROTEINS from Mycobacterium tuberculosis; O50438|PAPA3|Rv1182|MTV005.18 (472 aa), FASTA scores: opt: 1862, E(): 8.4e-110, (55.95% identity in 470 aa overlap); and upstream ORF O07803|PAPA2|Rv3820c|MTCY409.10 (468 aa) FASTA scores: opt: 1664, E(): 2.5e-97, (53.9% identity in 462 aa overlap). Contains PS00453 FKBP-type peptidyl-prolyl cis-trans isomerase signature 1. Protein product from Mb3854c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3854c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TVK9" /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR023213" /db_xref="UniProtKB/Swiss-Prot:Q7TVK9" /protein_id="SIU02483.1" /translation="MRIGPVELSAVKDWDPAPGVLVSWHPTPASCAKALAAPVSAVPP SYVQARQIRSFSEQAARGLDHSRLLIASVEVFGHCDLRAMTYVINAHLRRHDTYRSWF ELRDTDHIVRHSIADPADIEFVPTTHGEMTSADLRQHIVATPDSLHWDCFSFGVIQRA DSFTFYASIDHLHADGQFVGVGLMEFQSMYTALIMGEPPIGLSEAGSYVDFCVRQHEY TSALTVDSPEVRAWIDFAEINNGTFPEFPLPLGDPSVRCGGDLLSMMLMDEQQTQRFE SACMAANARFIGGMLACIAIAIHELTGADTYFGITPKDIRTPADLMTQGWFTGQIPVT VPVAGLSFNEIARIAQTSFDTGADLAKVPFERVVELSPSLRRPQPLFSLVNFFDAQVG PLSAVTKLFEGLNVGTYSDGRVTYPLSTMVGRFDETAASVLFPDNPVARESVTAYLRA IRSVCMRIANGGTAERVGNVVALSPGRRNNIERMTWRSCRAGDFIDICNLKVANVTVD REA" CDS complement(4233910..4240290) /codon_start=1 /transl_table=11 /gene="pks2" /locus_tag="BQ2027_MB3855C" /product="POLYKETIDE SYNTHASE PKS2" /note="Mb3855c, pks2, len: 2126 aa. Equivalent to Rv3825c, len: 2126 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 2126 aa overlap). Probable pks2, polyketide synthase (EC undetermined), equivalent to Q9CD78|MAS|ML0139 PUTATIVE MYCOCEROSIC SYNTHASE from Mycobacterium leprae (2116 aa), FASTA scores: opt: 6828, E(): 0, (63.3% identity in 2128 aa overlap); and Q49624|PKS3|MASA|ML1229|B1170_C2_209 PROBABLE MYCOCEROSIC ACID SYNTHASE from Mycobacterium leprae (2118 aa) FASTA scores: opt: 5220, E(): 0, (62.4% identity in 2130 aa overlap); or similar in part to others from Mycobacterium leprae e.g. Q9CB70|ML2354 POLYKETIDE SYNTHASE (1822 aa) FASTA scores: opt: 2787, E(): 2.1e-145, (34.7% identity in 2135 aa overlap). Also highly similar to Q02251|MCAS_MYCBO|MAS MYCOCEROSIC ACID SYNTHASE from Mycobacterium bovis (2110 aa), FASTA scores: opt: 3495, E(): 2.6e-184, (61.65% identity in 2130 aa overlap). Also highly similar to other polyketide synthases from Mycobacterium tuberculosis e.g. O53901|PKS5|Rv1527c|MTV045.01c|MTCY19G5.01 (2108 aa) FASTA scores: opt: 9576, E(): 0, (69.8% identity in 2124 aa overlap); P96291|MAS|Rv2940c|MTCY24G1.09|MTCY19H9.08c (2111 aa), FASTA scores: opt: 3518, E(): 1.4e-185, (64.05% identity in 2126 aa overlap); O50437|PKS4|Rv1181|MTV005.17 (1582 aa), FASTA scores: opt: 3461, E(): 1.6e-182, (64.55% identity in 1609 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site and PS00012 Phosphopantetheine attachment site. Protein product from Mb3855c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3855c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVK8" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR013968" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020806" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR036736" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/Swiss-Prot:Q7TVK8" /protein_id="SIU02484.1" /translation="MGLGSAASGTGADRGAWTLAEPRVTPVAVIGMACRLPGGIDSPE LLWKALLRGDDLITEVPPDRWDCDEFYDPQPGVPGRTVCKWGGFLDNPADFDCEFFGI GEREAIAIDPQQRLLLETSWEAMEHAGLTQQTLAGSATGVFAGVTHGDYTMVAADAKQ LEEPYGYLGNSFSMASGRVAYAMRLHGPAITVDTACSSGLTAVHMACRSLHEGESDVA LAGGVALMLEPRKAAAGSALGMLSPTGRCRAFDVAADGFVSGEGCAVVVLKRLPDALA DGDRILAVIRGTSANQDGHTVNIATPSQPAQVAAYRAALAAGGVDAATVGMVEAHGPG TPIGDPIEYASVSEVYGVDGPCALASVKTNFGHTQSTAGVLGLIKVVLALKHGVVPRN LHFTRLPDEIAGITTNLFVPEVTTPWPTNGRQVPRRAAVSSYGFSGTNVHAVVEQAPQ TEAQPHAASTPPTGTPALFTLSASSADALRQTAQRLTDWIQQHADSLVLSDLAYTLAR RRTHRSVRTAVIASSVDELIAGLGEVADGDTVYQPAVGQDDRGPVWLFSGQGSQWAAM GADLLTNESVFAATVAELEPLIAAESGFSVTEAMTAPETVTGIDRVQPTIFAMQVALA ATMAAYGVRPGAVIGHSMGESAAAVVAGVLSAEDGVRVICRRSKLMATIAGSAAMASV ELPALAVQSELTALGIDDVVVAVVTAPQSTVIAGGTESVRKLVDIWERRDVLARAVAV DVASHSPQVDPILDELIAALADLNPKAPEIPYYSATLFDPREAPACDARYWADNLRHT VRFSAAVRSALDDGYRVFAELSPHPLLTHAVDQIAGSVGMPVAALAGMRREQPLPLGL RRLLTDLHNAGAAVDFSVLCPQGRLVDAPLPAWSHRFLFYDREGVDNRSPGGSTVAVH PLLGAHVRLPEEPERHAWQADVGTATLPWLGDHRIHNVAALPGAAYCEMALSAARAVL GEQSEVRDMRFEAMLLLDDQTPVSTVATVTSPGVVDFAVEALQEGVGHHLRRASAVLQ QVSGECEPPAYDMASLLEAHPCRVDGEDLRRQFDKHGVQYGPAFTGLAVAYVAEDATA TMLAEVALPGSIRSQQGLYAIHPALLDACFQSVGAHPDSQSVGSGLLVPLGVRRVRAY APVRTARYCYTRVTKVELVGVEADIDVLDAHGTVLLAVCGLRIGTGVSERDKHNRVLN ERLLTIEWHQRELPEMDPSGAGKWLLISDCAASDVTATRLADAFREHSAACTTMRWPL HDDQLAAADQLRDQVGSDEFSGVVVLTGSNTGTPHQGSADRGAEYVRRLVGIARELSD LPGAVPRMYVVTRGAQRVLADDCVNLEQGGLRGLLRTIGAEHPHLRATQIDVDEQTGV EQLARQLLATSEEDETAWRDNEWYVARLCPTPLRPQERRTIVADHQQSGMRLQIRTPG DMQTIELAAFHRVPPGPGQIEVAVRASSVNFADVLIAFGRYPSFEGHLPQLGTDFAGV VTAVGPGVTDHKVGDHVGGMSPNGCWGTFVTCDARLAATLPPGLGDAQAAAVTTAHAT AWYGLHELARIRAGDTVLIHSGTGGVGQAAIAIARAAGAEIFATAGTPQRRELLRNMG IEHVYDSRSIEFAEQIRRDTNGRGVDVVLNSVTGAAQLAGLKLLAFRGRFVEIGKRDI YGDTKLGLFPFRRNLSFYAVDLGLLSATHPEELRDLLGTVYRLTAAGELPMPQSTHYP LVEAATAIRVMGNAEHTGKLVLHIPQTGKSLVTLPPEQAQVFRPDGSYIITGGLGGLG LFLAEKMAAAGCGRIVLNSRTQPTQKMRETIEAIAAMGSEVVVECGDIAQPGTAERLV ATAVATGLPVRGVLHAAAVVEDATLANITDELLARDWAPKVHGAWELHEATSGQPLDW FCLFSSAAALTGSPGQSAYSAANSWLDAFAHWRQAQGLPATAIAWGAWSDIGQLGWWS ASPARASALEESNYTAITPDEGAYAFEALLRHNRVYTGYAPVIGAPWLVAFAERSRFF EVFSSSNGSGTSKFRVELNELPRDEWPARLRQLVAEQVSLILRRTVDPDRPLPEYGLD SLGALELRTRIETETGIRLAPKNVSATVRGLADHLYEQLAPDDAPAAALSSQ" CDS 4240497..4242251 /codon_start=1 /transl_table=11 /gene="fadD23" /locus_tag="BQ2027_MB3856" /product="probable fatty-acid-amp ligase fadd23 (fatty-acid-amp synthetase) (fatty-acid-amp synthase)" /note="Mb3856, fadD23, len: 584 aa. Equivalent to Rv3826, len: 584 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 584 aa overlap). Probable fadD23, fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to P71495 ACYL-COA SYNTHASE from Mycobacterium bovis (582 aa), FASTA scores: opt: 2571, E(): 4.4e-146, (66.15% identity in 576 aa overlap); Q9CD79|FADD28|ML0138 ACYL-COA SYNTHETASE from Mycobacterium leprae (579 aa) FASTA scores: opt: 2520, E(): 4.9e-143, (65.2% identity in 575 aa overlap); P54200|FD21_MYCLE PUTATIVE FATTY-ACID--COA LIGASE (ACYL-COA SYNTHETASE) from Mycobacterium leprae (579 aa), FASTA scores: opt: 2330, E(): 1.1e-131, (60.2% identity in 578 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. P96290|FADD28|Rv2941|MTCY24G1.08c (580 aa), FASTA scores: opt: 2587, E(): 4.9e-147, (66.5% identity in 576 aa overlap); O53903|FADD24|Rv1529|MTV045.03 (584 aa), FASTA scores: opt: 2457, E(): 2.9e-139, (63.35% identity in 584 aa overlap); Q50586|FADD25|Rv1521|MT1572|MTCY19G5.07 (583 aa) FASTA scores: opt: 2389, E(): 3.3e-135, (61.45% identity in 581 aa overlap); etc. Protein product from Mb3856 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3856 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVK7" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR040097" /db_xref="InterPro:IPR042099" /db_xref="UniProtKB/Swiss-Prot:Q7TVK7" /protein_id="SIU02485.1" /translation="MVSLSIPSMLRQCVNLHPDGTAFTYIDYERDSEGISESLTWSQV YRRTLNVAAEVRRHAAIGDRAVILAPQGLDYIVAFLGALQAGLIAVPLSAPLGGASDE RVDAVVRDAKPNVVLTTSAIMGDVVPRVTPPPGIASPPTVAVDQLDLDSPIRSNIVDD SLQTTAYLQYTSGSTRTPAGVMITYKNILANFQQMISAYFADTGAVPPLDLFIMSWLP FYHDMGLVLGVCAPIIVGCGAVLTSPVAFLQRPARWLQLMAREGQAFSAAPNFAFELT AAKAIDDDLAGLDLGRIKTILCGSERVHPATLKRFVDRFSRFNLREFAIRPAYGLAEA TVYVATSQAGQPPEIRYFEPHELSAGQAKPCATGAGTALVSYPLPQSPIVRIVDPNTN TECPPGTIGEIWVHGDNVAGGYWEKPDETERTFGGALVAPSAGTPVGPWLRTGDSGFV SEDKFFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAIAVPSNGVEKLVAIVEL NNRGNLDTERLSFVTREVTSAISTSHGLSVSDLVLVAPGSIPITTSGKVRRAECVKLY RHNEFTRLDAKPLQASDL" mobile_element complement(4242228..4244100) /mobile_element_type="insertion sequence:IS1537" /locus_tag="BQ2027_IS1537" /note="IS1537, len: 1873 nt. Equivalent to IS1537, len: 1873 nt, from Mycobacterium tuberculosis strain H37Rv,(99.9% identity in 1873 nt overlap)." gene complement(4242228..4244100) /locus_tag="BQ2027_IS1537" CDS complement(4242248..4243474) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3857C" /product="possible transposase" /note="Mb3857c, -, len: 408 aa. Equivalent to Rv3827c, len: 408 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 408 aa overlap). Possible transposase within IS1537 element, similar to several transposases e.g. O83029|TNPC|DR2324|DR0666|DR0978|DR1381|DR1651|DR1933 TRANSPOSASE from Deinococcus radiodurans(408 aa) FASTA scores: opt: 302, E(): 3.9e-12, (30.75% identity in 358 aa overlap); Q9RXX7|DR0178 PUTATIVE TRANSPOSASE from Deinococcus radiodurans (409 aa), FASTA scores: opt: 297, E(): 8.2e-12, (31.1% identity in 360 aa overlap); P73816|SLR2062 TRANSPOSASE from Synechocystis sp. strain PCC 6803 (400 aa), FASTA scores: opt: 296, E(): 9.3e-12, (30.05% identity in 353 aa overlap); etc. Highly similar to proteins from Mycobacterium tuberculosis e.g. O33333|Rv2791c|MTV002.56c TRANSPOSASE (459 aa) FASTA scores: opt: 2211, E(): 9.4e-136, (87.75% identity in 367 aa overlap); P95117|Rv2978c|MTCY349.09 HYPOTHETICAL 51.4 KDA PROTEIN (459 aa), FASTA scores: opt: 2165, E(): 9e-133, (85.85% identity in 367 aa overlap); Q10809|YS85_MYCTU|Rv2885c|MT2953|MTCY274.16c HYPOTHETICAL 51.3 KDA PROTEIN (460 aa), FASTA scores: opt: 2127, E(): 2.6e-130, (83.95% identity in 368 aa overlap); O0777|Rv0606|MTCY19H5.16c PROBABLE TRANSPOSASE (FRAGMENT) (247 aa), FASTA scores: opt: 1405, E(): 9.3e-84, (85.3% identity in 238 aa overlap); etc. Mb3857c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR001959" /db_xref="InterPro:IPR021027" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5M6" /protein_id="SIU02486.1" /translation="MMARFEVPEGWCVQAFRFTLDPTEDQARALARHFGARRKAYNWA VATLKADIEAWRVTGIGTVKPSLRVLRKRWNTVKDEVCVNAETGAVWWPECSKEAYAD GIGGAVDAYWNWQNSRSGKREGKTMGFPRFKKKGRDQDRVTFTTGAMRVEPDRRHLTL PVVGTVRTHENTRRIERLIATGRARVLAISVRRNGTRLDASVRVLVQRPQQPNVAQPG SRVGVDVGVRRLATVANEAGAVLEEVPNPRPLDAALKELRYASRARSRCTKGSRRYRE RTTEISRLHRRVNDVRTHHLHVLTTRLAQTHGHIVVEGLDAAGMLRQKGLPGARARRR GLSDSALGTPRRHLSYKTGWYGSALVVADRWFPSLSVEPTVRPGLARLVAVKRGREAA AWLPNNPETGCKSRDH" CDS complement(4243471..4244082) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3858C" /product="POSSIBLE RESOLVASE" /note="Mb3858c, -, len: 203 aa. Equivalent to Rv3828c, len 203 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 203 aa overlap). Possible resolvase within IS1537 element, similar to others e.g. Q97X40|SSO1915 FIRST ORF IN TRANSPOSON ISC1913 from Sulfolobus solfataricus (213 aa), FASTA scores: opt: 275, E(): 1.6e-11, (30.6% identity in 196 aa overlap); Q9V1M0|PAB2076 RESOLVASE RELATED PROTEIN from Pyrococcus abyssi (212 aa), FASTA scores: opt: 254, E(): 4.2e-10, (29.95% identity in 197 aa overlap); Q9RMU7|ORFA PUTATIVE TRANSPOSASE (BELONGS TO THE MERR FAMILY OF TRANSCRIPTIONAL REGULATORS) from elicobacter pylori (Campylobacter pylori) (217 aa), FASTA scores: opt: 243, E(): 2.3e-09, (31.8% identity in 154 aa overlap); etc. Also highly similar to proteins from Mycobacterium tuberculosis e.g. O33334|Rv2792c|MTV002.57c RESOLVASE (193 aa), FASTA scores: opt: 970, E(): 1.5e-58, (79.25% identity in 193 aa overlap); O07773|Rv0605|MTCY19H5.17c PUTATIVE RESOLVASE (202 aa), FASTA scores: opt: 964, E(): 4e-58, (76.25% identity in 202 aa overlap); P95116|Rv2979c|MTCY349.08 HYPOTHETICAL 21.4 KDA PROTEIN (194 aa), FASTA scores: opt: 895, E(): 1.8e-53, (74.75% identity in 194 aa overlap); Q10831|YS86_MYCTU|Rv2886c|MT2954|MTCY274.17c HYPOTHETICAL 31.9 KDA PROTEIN (295 aa), FASTA scores: opt: 826, E(): 1.1e-48, (66.2% identity in 204 aa overlap) (similarity only at C-terminus); etc. Contains PS00397 Site-specific recombinases active site. Possible helix-turn-helix motif from aa 11-32, Score 1305 (+3.63 SD)." /db_xref="GOA:A0A1R3Y5C9" /db_xref="InterPro:IPR006118" /db_xref="InterPro:IPR006119" /db_xref="InterPro:IPR036162" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5C9" /protein_id="SIU02487.1" /translation="MSVVCCRNRWMNLAVWAERNGVAWVIAYRWFRAGLLPVPAQRVG RLILVNDPAVEESGRGRTLVYARVSSADQRSDLDRRVARVTAWATSQHLSVDKVVAEG GWALNGHRRKFFALLGDPVVTRIVVEHRDRFCWFGSEYVEAALVAQGRELVVVDLAEV DDDLVGDMTEILTSMCARLYGERAAQNGAKRALAAAVGDAEAA" CDS complement(4244083..4245693) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3859C" /product="probable dehydrogenase" /note="Mb3859c, -, len: 536 aa. Equivalent to Rv3829c, len 536 aa, from Mycobacetrium tuberculosis strain H37Rv, (99.8% identity in 536 aa overlap). Probable oxidoreductase dehydrogenase (EC 1.-.-.-), similar to others e.g. Q9A3T1|CC3121 PHYTOENE DEHYDROGENASE-RELATED PROTEIN from Caulobacter crescentus (543 aa), FASTA scores: opt: 607, E(): 9.2e-28, (28.25% identity in 552 aa overlap); Q98FP6|MLR3676 PHYTOENE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (521 aa), FASTA scores: opt: 605, E(): 1.2e-27, (28.2% identity in 546 aa overlap); Q97W24|SSO2422 PHYTOENE DEHYDROGENASE RELATED PROTEIN from Sulfolobus solfataricus (518 aa), FASTA scores: opt: 388, E(): 4.4e-15, (27.35% identity in 530 aa overlap); Q98BS8|MLL5443 PROBABLE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (524 aa), FASTA scores: opt: 374, E(): 2.9e-14, (24.35% identity in aa overlap); etc. Also similar to MTCY493.22c|Rv1432|MTCY493.22c HYPOTHETICAL 50.5 KDA PROTEIN (probable dehydrogenase) from Mycobacterium tuberculosis (25.1% identity in 295 aa overlap). Mb3859c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5D4" /protein_id="SIU02488.1" /translation="MTGYDAIVIGAGHNGLTAAVLLQRAGLRTACLDAKRYAGGMAST VELFDGYRFDIAGSVQFPTSSAVSSELGLDSLPTVDLEVMSVALRGVGDDPVVQFTDP TKMLTHLHRVHGADAVTGMAGLLAWSQAPTRALGRFEAGTLPKSFDEMYACATNEFER SAIDDMLFGSVTDVLDRHFPDREKHGALRGSMTVLAVNTLYRGPATPGSAAALAFGLG VPEGDFVRWKKLRGGIGALTTHLSQLLERTGGEVRLRSKVTEIVVDNSRSSARVRGVR TAAGDTLTSPIVVSAIAPDVTINELIDPAVLPSEIRDRYLRIDHRGSYLQMHFALAQP PAFAAPYQALNDPSMQASMGIFCTPEQVQQQWEDCRRGIVPADPTVVLQIPSLHDPSL APAGKQAASAFAMWFPIEGGSKYGGYGRAKVEMGQNVIDKITRLAPNFKGSILRYTTF TPKHMGVMFGAPGGDYCHALLHSDQIGPNRPGPKGFIGQPIPIAGLYLGSAGCHGGPG ITFIPGYNAARQALADRRAANCCVLSGR" CDS complement(4245741..4246370) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3860C" /product="TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY TETR-FAMILY)" /note="Mb3860c, -, len: 209 aa. Equivalent to Rv3830c, len: 209 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 209 aa overlap). Probable transcriptional regulator tetR family, similar to others e.g. P39885|TCMR_STRGA TETRACENOMYCIN C TRANSCRIPTIONAL REPRESSOR from Streptomyces glaucescens (226 aa) FASTA scores: opt: 255, E(): 6.1e-10, (33.65% identity in 202 aa overlap); Q9RDR0|SC4A7.02 PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (227 aa) FASTA scores: opt: 230, E(): 2.8e-08, (30.05% identity in 213 aa overlap); Q9EWU3|3SC5B7.06 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (244 aa), FASTA scores: opt: 221, E(): 1.2e-07, (32.05% identity in 181 aa overlap); Q9AJ68|BUTR PUTATIVE TRANSCRIPTIONAL REPRESSOR from Streptomyces cinnamonensis (268 aa), FASTA scores: opt: 216, E(): 2.7e-07, (37.8% identity in 119 aa overlap); etc. Contains possible helix-turn-helix motif from aa 33-54, Score 1699 (+4.97 SD). SEEMS TO BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3860c detected using SWATH mass spectrometry. Mb3860c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5C7" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR023772" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5C7" /protein_id="SIU02489.1" /translation="MVRPPQTARSERTREALRQAALVRFLAQGVEATSAEQIAEDAGV SLRTFYRHFRSKHDLLFADYDAGLHWFRAALDARPADESIIDSVQAAIFSFPYDVDAV TKIASLRRGELEPSRIVRHMREVEADFADAIQAQLRRRNCDIAGAPDARLHIAVTARC VAAAVFGAMEAWMLGSDRSLGELARVCHVALESLRVGISDTWTTLTVSS" CDS 4246442..4246924 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3861" /product="HYPOTHETICAL PROTEIN" /note="Mb3861, -, len: 160 aa. Equivalent to Rv3831, len: 160 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 160 aa overlap). Hypothetical unknown protein. Mb3861 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5D2" /db_xref="InterPro:IPR021362" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5D2" /protein_id="SIU02490.1" /translation="MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYV VGIASIALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIAN VILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA " CDS complement(4246921..4247496) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3862C" /product="SAM-dependent methyltransferases" /note="Mb3862c, -, len: 191 aa. Equivalent to Rv3832c, len: 191 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 191 aa overlap). Conserved hypothetical protein, similar in part to various proteins e.g. Q9XBC9|CZA382.22c PUTATIVE RRNA METHYLASE from Amycolatopsis orientalis (259 aa), FASTA scores: opt: 196, E(): 1.3e-05, (38.2% identity in 110 aa overlap); CAC48459|SMB20059 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (259 aa), FASTA scores: opt: 188, E(): 4.3e-05, (33.8% identity in 136 aa overlap); Q98FP8|MLL3672 METHYL TRANSFERASE-LIKE PROTEIN from Rhizobium loti (Mesorhizobium loti) (264 aa), FASTA scores: opt: 180, E(): 0.00014, (32.05% identity in 156 aa overlap); etc. Protein product from Mb3862c detected using SWATH mass spectrometry." /db_xref="GOA:A0A1R3Y5D9" /db_xref="InterPro:IPR013216" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5D9" /protein_id="SIU02491.1" /translation="MAMNLLHRRHCSSAGWEKAVANQLLPWALQHVELGPRTLEIGPG YGATLQALLGLTASLTAVEVDNSMVERLNRRYGQRARIIRGDGTQTGLPDDHFTSVVC FTMLHHVASAQLQDQLFAEAYRVLQPGGVFAGSDGVPSLPFRLIHIADTYTPIAPADL PGRLRAVGFTDIHVDVAGARLRWRATKPVAA" CDS 4247552..4248343 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3863" /product="TRANSCRIPTIONAL REGULATORY PROTEIN (PROBABLY ARAC-FAMILY)" /note="Mb3863, -, len: 263 aa. Equivalent to Rv3833, len: 263 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 263 aa overlap). Probable transcriptional regulator belonging to araC family, similar to others e.g. Q9KYN4|SC9H11.05 PUTATIVE ARAC-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (289 aa), FASTA scores: opt: 754, E(): 1.2e-42, (50.45% identity in 232 aa overlap); Q9HXH2|PA3830 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (270 aa), FASTA scores: opt: 501, E(): 6.2e-26, (34.85% identity in 238 aa overlap); Q9HX87|PA3927 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (262 aa), FASTA scores: opt: 496, E(): 1.3e-25, (36.45% identity in 266 aa overlap); P76241|YEAM_ECOLI|B1790 HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Escherichia coli strain K12 (273 aa) FASTA scores: opt: 388, E(): 1.9e-18, (30.5% identity in 223 aa overlap); etc. Contains probable helix-turn-helix motif from aa 164-185, Score 2014 (+6.05 SD). SEEMS TO BELONG TO THE ARAC/XYLS FAMILY OF TRANSCRIPTIONAL REGULATORS. Mb3863 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5E1" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR011051" /db_xref="InterPro:IPR013096" /db_xref="InterPro:IPR014710" /db_xref="InterPro:IPR018060" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5E1" /protein_id="SIU02492.1" /translation="MSENSHHRLATTSLTLPPGARIERHRHPSHQIVYPSAGAVSVTT HAGTWITPVNRAIWIPAGCWHQHKFHGHTQFHGVALDPQRYRGGPATPTVLAVNPLMR ELIIACSQADRTDTDEHHRMLAVLQDQLPTTSIREPLWVPSPTDRRLRHACALIADNL TQPLTLQQIGGRIGVSQRTLSRLFSDELGMTFPQWRTQLRLQHALVLLAERHDVTSVA SECGWATPSAFIDTYRQAFGHTPGQAAKPMAATRLTRLRRARDRR" CDS complement(4248340..4249599) /codon_start=1 /transl_table=11 /gene="serS" /locus_tag="BQ2027_MB3864C" /product="SERYL-TRNA SYNTHETASE SERS (SERINE--TRNA LIGASE) (SERRS) (SERINE TRANSLASE)" /note="Mb3864c, serS, len: 419 aa. Equivalent to Rv3834c, len: 419 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 419 aa overlap). Probable serS, seryl-tRNA synthetase (EC 6.1.1.11), equivalent to Q9CDC1|SERS|ML0082 PUTATIVE SERYL-TRNA SYNTHASE from Mycobacterium leprae (417 aa), FASTA scores: opt: 2361, E(): 8.5e-138, (85.8% identity in 416 aa overlap). Also highly similar many e.g. Q9ZBX1|SYS_STRCO|SERS|SCD78.28c from Streptomyces coelicolor (425 aa), FASTA scores: opt: 1594, E(): 1.2e-90, (59.75% identity in 425 aa overlap); Q9X199|SYS_THEMA|SERS|TM1379 from Thermotoga maritima (425 aa), FASTA scores: opt: 1083, E(): 3.3e-59, (43.3% identity in 425 aa overlap); P37464|SYS_BACSU|SERS from Bacillus subtilis (425 aa), FASTA scores: opt: 1015, E(): 5e-55, (39.3% identity in 425 aa overlap); etc. Contains PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1. BELONGS TO CLASS-II AMINOACYL-TRNA SYNTHETASE FAMILY. Protein product from Mb3864c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3864c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67562" /db_xref="InterPro:IPR002314" /db_xref="InterPro:IPR002317" /db_xref="InterPro:IPR006195" /db_xref="InterPro:IPR010978" /db_xref="InterPro:IPR015866" /db_xref="InterPro:IPR033729" /db_xref="InterPro:IPR042103" /db_xref="UniProtKB/Swiss-Prot:P67562" /protein_id="SIU02493.1" /translation="MIDLKLLRENPDAVRRSQLSRGEDPALVDALLTADAARRAVIST ADSLRAEQKAASKSVGGASPEERPPLLRRAKELAEQVKAAEADEVEAEAAFTAAHLAI SNVIVDGVPAGGEDDYAVLDVVGEPSYLENPKDHLELGESLGLIDMQRGAKVSGSRFY FLTGRGALLQLGLLQLALKLAVDNGFVPTIPPVLVRPEVMVGTGFLGAHAEEVYRVEG DGLYLVGTSEVPLAGYHSGEILDLSRGPLRYAGWSSCFRREAGSHGKDTRGIIRVHQF DKVEGFVYCTPADAEHEHERLLGWQRQMLARIEVPYRVIDVAAGDLGSSAARKFDCEA WIPTQGAYRELTSTSNCTTFQARRLATRYRDASGKPQIAATLNGTLATTRWLVAILEN HQRPDGSVRVPDALVPFVGVEVLEPVA" CDS 4249732..4251081 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3865" /product="conserved membrane protein" /note="Mb3865, -, len: 449 aa. Equivalent to Rv3835, len: 449 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 449 aa overlap). Probable conserved membrane protein, equivalent to Q9CDC2|ML0081 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (450 aa), FASTA scores: opt: 2079, E(): 1.8e-74, (69.35% identity in 457 aa overlap). Protein product from Mb3865 detected using SWATH mass spectrometry. Mb3865 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5E8" /db_xref="InterPro:IPR026004" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5E8" /protein_id="SIU02494.1" /translation="MLDAPEQDPVDPGDPASPPHGEAEQPLPGPRWPRALRASATRRA LLLTALGGLLIAGLVTAIPAVGRAPERLAGYIASNPVPSTGAKINASFNRVASGDCLM WPDGTPESAAIVSCADEHRFEVAESIDMRTFPGMEYGQNAAPPSPARIQQISEEQCEA AVRRYLGTKFDPNSKFTISMLWPGDRAWRQAGERRMLCGLQSPGPNNQQLAFKGKVAD IDQSKVWPAGTCLGIDATTNQPIDVPVDCAAPHAMEVSGTVNLAERFPDALPSEPEQD GFIKDACTRMTDAYLAPLKLRTTTLTLIYPTLTLPSWSAGSRVVACSIGATLGNGGWA TLVNSAKGALLINGQPPVPPPDIPEERLNLPPIPLQLPTPRPAPPAQQLPSTPPGTQH LPAQQPVVTPTRPPESHAPASAAPAETQPPPPDAGAPPATQSPEATPPGPAEPAPAG" CDS 4251086..4251436 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3866" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3866, -, len: 116 aa. Similar to Rv3836, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 115 aa overlap). Conserved hypothetical protein, highly similar to Q9RKJ2|SCD25.30 HYPOTHETICAL 13.1 KDA PROTEIN from Streptomyces coelicolor (116 aa), FASTA scores: opt: 395, E(): 3.3e-19, (54.4% identity in 114 aa overlap); and similar to CAC47753|SMC0379 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (144 aa) FASTA scores: opt: 194, E(): 6e-06, (33.05% identity in 109 aa overlap); and Q98E37|MLL4425 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (201 aa), FASTA scores: opt: 184, E(): 3.7e-05, (29.75% identity in 121 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base insertion (*-g) introducing a premature stop codon, leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (116 aa versus 137 aa). Protein product from Mb3866 detected using SWATH mass spectrometry. Mb3866 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010428" /db_xref="InterPro:IPR038555" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6V4" /protein_id="SIU02495.1" /translation="MTVRMDPQRFDELVSDALDLIPPELADAMDNVVVLVANRHPQHE NLLGQYEGVALTERGSDYAGSLPDAITIYREALLDACDSEDEVVDQVAITVIHEVAHH FGIDDERLDQLGWA" CDS complement(4251695..4252393) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3867C" /product="PROBABLE PHOSPHOGLYCERATE MUTASE (PHOSPHOGLYCEROMUTASE) (PHOSPHOGLYCERATE PHOSPHOMUTASE)" /note="Mb3867c, -, len: 232 aa. Equivalent to Rv3837c, len: 232 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 232 aa overlap). Probable phosphoglycerate mutase (EC 5.4.2.-), equivalent to Q9CDC3|ML0079 PUTATIVE PHOSPHOGLYCERATE MUTASE from Mycobacterium leprae (231 aa), FASTA scores: opt: 1116, E(): 7.3e-66, (71.55% identity in 232 aa overlap). Also similar to others e.g. Q9ZAX0|PGM 2,3-PDG DEPENDENT PHOSPHOGLYCERATE MUTASE from Amycolatopsis methanolica (205 aa), FASTA scores: opt: 474, E(): 6.4e-24, (41.85% identity in 203 aa overlap); Q9F3Q7|SC10F4.03 PUTATIVE ISOMERASE from Streptomyces coelicolor (224 aa) FASTA scores: opt: 349, E(): 1e-15, (33.2% identity in 223 aa overlap); Q9RDL0|SCC123.14c PUTATIVE PHOSPHOGLYCERATE MUTASE from Streptomyces coelicolor (223 aa), FASTA scores: opt: 256, E(): 1.2e-09, (34.0% identity in 203 aa overlap); Q9RVD2|DR1097 PUTATIVE PHOSPHOGLYCERATE MUTASE from Deinococcus radiodurans (232 aa), FASTA scores: opt: 201, E(): 5.1e-06, (31.45% identity in 175 aa overlap); etc. Also similar to P71724|Rv2419c|MTCY428.28|MTCY253.01 HYPOTHETICAL 24.2 KDA PROTEIN from Mycobacterium tuberculosis (223 aa), FASTA scores: opt: 210, E(): 1.3e-06, (32.0% identity in 172 aa overlap). Contains PS00175 Phosphoglycerate mutase family phosphohistidine signature. Protein product from Mb3867c detected using SWATH mass spectrometry. Mb3867c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR013078" /db_xref="InterPro:IPR029033" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5N0" /protein_id="SIU02496.1" /translation="MSGRLVLLRHGQSYGNVERRLDTLPPGTALTPLGRDQARAFARS GCRRPALLAHSVAIRAYQTAAVVAAELDMVAHEVAGIHEVQVGELENRNDDEAVAEFN ATYSRWHRGELDVPLPGGETANDVLDRYLPVLADLRMRYLDDGDWDGDIVVVSHSAAI RLAAAVLAGVDGNFVLDNHLENVESVVLAPITDGRWSCVQWGLRKPPFCPDPAEAAAS PVTHAVTSSTDPMG" CDS complement(4252390..4253355) /codon_start=1 /transl_table=11 /gene="pheA" /locus_tag="BQ2027_MB3868C" /product="prephenate dehydratase phea" /note="Mb3868c, pheA, len: 321 aa. Equivalent to Rv3838c, len: 321 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 321 aa overlap). Possible pheA, prephenate dehydratase (EC 4.2.1.51) (see citation below), equivalent to Q9CDC4|PHEA|ML0078 PUTATIVE PREPHENATE DEHYDRATASE from Mycobacterium leprae (322 aa), FASTA scores: opt: 1690, E(): 1.3e-93, (84.25% identity in 311 aa overlap). Also highly similar to others e.g. P10341|PHEA_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (315 aa), FASTA scores: opt: 843, E(): 4e-43, (45.8% identity in 308 aa overlap); Q9ZBX0|SCD78.29c from Streptomyces coelicolor (310 aa), FASTA scores: opt: 820, E(): 9.2e-42, (46.45% identity in 312 aa overlap); Q44104|PHEA_AMYME|PDT from Amycolatopsis methanolica (304 aa), FASTA scores: opt: 707, E(): 4.9e-35, (45.7% identity in 313 aa overlap); etc. Contains PS00858 Prephenate dehydratase signature 2. Protein product from Mb3868c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3868c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TVJ6" /db_xref="InterPro:IPR001086" /db_xref="InterPro:IPR002912" /db_xref="InterPro:IPR008242" /db_xref="InterPro:IPR018528" /db_xref="UniProtKB/Swiss-Prot:Q7TVJ6" /protein_id="SIU02497.1" /translation="MVRIAYLGPEGTFTEAALVRMVAAGLVPETGPDALQRMPVESAP AALAAVRDGGADYACVPIENSIDGSVLPTLDSLAIGVRLQVFAETTLDVTFSIVVKPG RNAADVRTLAAFPVAAAQVRQWLAAHLPAADLRPAYSNADAARQVADGLVDAAVTSPL AAARWGLAALADGVVDESNARTRFVLVGRPGPPPARTGADRTSAVLRIDNQPGALVAA LAEFGIRGIDLTRIESRPTRTELGTYLFFVDCVGHIDDEAVAEALKAVHRRCADVRYL GSWPTGPAAGAQPPLVDEASRWLARLRAGKPEQTLVRPDDQGAQA" CDS 4253451..4254227 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3869" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3869, -, len: 258 aa. Equivalent to Rv3839, len: 258 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 258 aa overlap). Conserved hypothetical protein, similar in part to Q9RD78|SCF43.10cfrom HYPOTHETICAL 25.8 KDA PROTEIN Streptomyces coelicolor (241 aa), FASTA scores: opt: 270, E(): 3.2e-10, (33.45% identity in 272 aa overlap); and O00320|F25451_2 HYPOTHETICAL PROTEIN from Homo sapiens (Human) (339 aa), FASTA scores: opt: 126, E(): 0.77, (28.75% identity in 240 aa overlap)." /db_xref="InterPro:IPR037119" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5F0" /protein_id="SIU02498.1" /translation="MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLY DGSFAVAVPVDRGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETL DLIATDNPNPALLQVETPRSGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLA ARPDPFCEIESTLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEAR DGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPFRNGLRARR" CDS 4254253..4254666 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3870" /product="POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN" /note="Mb3870, -, len: 137 aa. Equivalent to Rv3840, len: 137 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 137 aa overlap). Possible transcriptional regulator, highly similar in part to PSR PROTEINS (PENICILLIN BINDING PROTEIN REPRESSORS) e.g. Q47828|PSR PSR PROTEIN from Enterococcus hirae (293 aa) FASTA scores: opt: 221, E(): 2.2e-07, (41.65% identity in 108 aa overlap); O86213|PSRFM PSRFM PROTEIN (FRAGMENT) from Enterococcus hirae (171 aa), FASTA scores: opt: 202, E(): 2.4e-06, (40.75% identity in 108 aa overlap); Q47865|PSR PENICILLIN BINDING PROTEIN REPRESSOR from Enterococcus hirae (148 aa), FASTA scores: opt: 201, E(): 2.5e-06, (51.65% identity in 60 aa overlap); etc. Also highly similar in part to other transcriptional regulators e.g. BAB57524|MSRR PEPTIDE METHIONINE SULFOXIDE REDUCTASE REGULATOR from Staphylococcus aureus subsp. aureus Mu50 (327 aa), FASTA scores: opt: 195, E(): 1.2e-05, (36.7% identity in 109 aa overlap); Q99Q02|MSRR|SA1195 PEPTIDE METHIONINE SULFOXIDE REDUCTASE REGULATOR from Staphylococcus aureus subsp. aureus N315, and Staphylococcus aureus (327 aa), FASTA scores: opt: 192, E(): 1.9e-05, (36.7% identity in 109 aa overlap); Q9K6Q8|LYTR|BH3670 ATTENUATOR FOR LYTABC AND LYTR EXPRESSION from Bacillus halodurans (304 aa), FASTA scores: opt: 171, E(): 0.00041, (34.5% identity in 113 aa overlap); etc. Mb3870 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR004474" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5D3" /protein_id="SIU02499.1" /translation="MAGCIQRFSHVRCLGPGLASDNPTTLISIPRDSYVPIPGHGRDK INAAFALGGGRLLTQTVELATGLHLDHYAEVGFSEFADLVDAFDPLAGVDLPAGCQTL DGRAALGYVRTRATPRADLEGSDVPVPAAAFETQP" mobile_element 4254615..4255644 /mobile_element_type="insertion sequence:IS1608" /locus_tag="BQ2027_IS1608'-2" /note="IS1608'-2, len: 1030 nt. Equivalent to IS1608',len: 489 nt, from Mycobacterium tuberculosis strain H37Rv,(100.0% identity in 489 nt overlap). At a different location in from Mycobacterium tuberculosis strain H37Rv" CDS 4254864..4255409 /codon_start=1 /transl_table=11 /gene="bfrB" /locus_tag="BQ2027_MB3871" /product="bacterioferritin bfrb" /note="Mb3871, bfrB, len: 181 aa. Equivalent to Rv3841, len: 181 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 181 aa overlap). Possible bfrB, bacterioferritin, similar to other ferritin or hypothetical proteins e.g. O26261|MTH158|RSGA FERRITIN LIKE PROTEIN from Methanothermobacter thermautotrophicus (171 aa), FASTA scores: opt: 277, E(): 6.6e-11, (30.1% identity in 166 aa overlap); Q99SZ3|SA1709 HYPOTHETICAL PROTEIN from Staphylococcus aureus subsp. aureus N315 (166 aa), FASTA scores: opt: 275, E(): 8.7e-11, (33.35% identity in 156 aa overlap); Q9X0L2|TM1128 FERRITIN from Thermotoga maritima (164 aa), FASTA scores: opt: 247, E(): 5.3e-09, (25.65% identity in 156 aa overlap); Q9KDT7|BH1124 FERRITIN from Bacillus halodurans (169 aa), FASTA scores: opt: 246, E(): 6.3e-09, (28.95% identity in 152 aa overlap); O29424|AF0834 PUTATIVE FERRITIN from Archaeoglobus fulgidu (169 aa), FASTA scores: opt: 246, E(): 6.3e-09, (28.95% identity in 152 aa overlap); etc. Also shows similarity with Rv1876|MTCY180.42|BFRA PROBABLE BACTERIOFERRITIN from Mycobacterium tuberculosis (159 aa). SEEMS BELONG TO THE BACTERIOFERRITIN FAMILY. Protein product from Mb3871 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3871 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5E2" /db_xref="InterPro:IPR001519" /db_xref="InterPro:IPR008331" /db_xref="InterPro:IPR009040" /db_xref="InterPro:IPR009078" /db_xref="InterPro:IPR012347" /db_xref="InterPro:IPR041719" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5E2" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02500.1" /translation="MTEYEGPKTKFHALMQEQIHNEFTAAQQYVAIAVYFDSEDLPQL AKHFYSQAVEERNHAMMLVQHLLDRDLRVEIPGVDTVRNQFDRPREALALALDQERTV TDQVGRLTAVARDEGDFLGEQFMQWFLQEQIEEVALMATLVRVADRAGANLFELENFV AREVDVAPAASGAPHAAGGRL" CDS complement(4255424..4256248) /codon_start=1 /transl_table=11 /gene="glpQ1" /locus_tag="BQ2027_MB3872C" /product="PROBABLE GLYCEROPHOSPHORYL DIESTER PHOSPHODIESTERASE GLPQ1 (GLYCEROPHOSPHODIESTER PHOSPHODIESTERASE)" /note="Mb3872c, glpQ1, len: 274 aa. Equivalent to Rv3842c, len: 274 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 274 aa overlap). Probable glpQ1, glycerophosphoryl diester phosphodiesterase (EC 3.1.4.46), equivalent to Q9CDC5|GLPQ|ML0074 PUTATIVE GLYCEROPHOSPHORYL DIESTER PHOSPHODIESTERASE from Mycobacterium leprae (271 aa), FASTA scores: opt: 1635, E(): 1.9e-100, (88.85% identity in 269 aa overlap). Also highly similar to others e.g. CAC44700|SCBAC25E3.13c PUTATIVE PHOSPHODIESTERASE from Streptomyces coelicolor (275 aa), FASTA scores: opt: 413, E(): 5.7e-20, (48.05% identity in 258 aa overlap); P37965|GLPQ_BACSU GLYCEROPHOSPHORYL DIESTER PHOSPHODIESTERASE from Bacillus subtilis (293 aa), FASTA scores: opt: 405, E(): 2e-19, (31.3% identity in 249 aa overlap); Q99VC9|GLPQ|SA0820 GLYCEROPHOSPHORYL DIESTER PHOSPHODIESTERASE from Staphylococcus aureus subsp. aureus N315 (309 aa) FASTA scores: opt: 341, E(): 3.5e-15, (29.3% identity in 273 aa overlap); etc. Protein product from Mb3872c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3872c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5F5" /db_xref="InterPro:IPR017946" /db_xref="InterPro:IPR030395" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5F5" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02501.1" /translation="MTWADEVLAGHPFVVAHRGASAARPEHTLAAYDLALKEGADGVE CDVRLTRDGHLVCVHDRRLDRTSTGAGLVSTMTLAQLRELEYGAWHDSWRPDGSHGDT SLLTLDALVSLVLDWHRPVKIFVETKHPVRYGSLVENKLLALLHRFGIAAPASADRSR AVVMSFSAAAVWRIRRAAPLLPTVLLGKTPRYLTSSAATAVGATAVGPSLPALKEYPQ LVDRSAAQGRAVYCWNVDEYEDIDFCREVGVAWIGTHHPGRTKAWLEDGRANGTTR" CDS complement(4256254..4257282) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3873C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3873c, -, len: 342 aa. Equivalent to Rv3843c, len: 342 aa, from Mycobacterium tuberculosis strain H37Rv, (99.4% identity in 342 aa overlap). Probable conserved transmembrane protein, equivalent to Q9CDC6|ML0073 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (344 aa), FASTA scores: opt: 1420, E(): 2.6e-68, (63.05% identity in 349 aa overlap). Protein product from Mb3873c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3873c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5E4" /db_xref="InterPro:IPR025565" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5E4" /protein_id="SIU02502.1" /translation="MIQVCSQCGTRWNVRERQRVWCPRCRGMLLAPLADMPAEARWRT PARPQVPTASDTRRTPPRLPPGFRWIAVRPGAAPPPRHGPRLRGPTPHYAGIPRWGLT DHVDQAPVPASAKAGPSPAAVRTTLLVSLLVFSIAVVVFVVRYVLLVINRNTLLNSVV ASASVWLGVLVSLAAIAAAGTTIVLLVRWLVARRAAAFMHQGLPERRSARELWAGCLL PMVNLLWAPLYVIELALVEDRYTRLRRPIVVWWIVWIVSNAISMFAFATSWVTDAQGI ANNTTMMVLAYLCAAAAVAAAARVFEGFEQKPVERPAHRWVVVNTDGRSAPASSVAVE LDGQEPAA" gene 4259027..4260056 /locus_tag="BQ2027_IS1608'-2" CDS 4259460..4259951 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3874" /product="possible transposase" /note="Mb3874, -, len: 163 aa. Equivalent to Rv3844, len: 163 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 163 aa overlap). Possible transposase, identical to P96234|Rv3348|MTV004.04 PUTATIVE TRANSPOSASE from Mycobacterium tuberculosis. Also some similarity with others e.g. N-terminal part of P19834|YI11_STRCL INSERTION ELEMENT IS116 HYPOTHETICAL 44.8 KDA PROTEIN from Streptomyces clavuligerus (399 aa) FASTA scores: opt: 146, E(): 0.017, (29.1% identity in 158 aa overlap). Mb3874 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A0G2QBZ4" /db_xref="InterPro:IPR002525" /db_xref="UniProtKB/TrEMBL:A0A0G2QBZ4" /protein_id="SIU02503.1" /translation="MTAENPGRSRRTLVGIDAAITACHHIAIRDDVGARSIRFSVEPT LAGLRTLTDKLSGYDDIDATVEPTSMTWLPLTIAVENAGDTMHMAGARHCARLRGAIV GKSKSDVIDAEVLTRASEVFDLTPLTLPTPAQLALRRSVIRRAGAVIDANRSWRRLMS LAR" CDS 4259966..4260325 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3875" /product="HYPOTHETICAL PROTEIN" /note="Mb3875, -, len: 119 aa. Equivalent to Rv3845, len: 119 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 119 aa overlap). Hypothetical unknown protein. Contains PS01137 Hypothetical YBL055c/yjjV family signature 1. Mb3875 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5F9" /protein_id="SIU02504.1" /translation="MDRVRRVVTDRDSGAGALARHPLAGRRTDPQLAAFYHRLMTTQR HCHTQATIAVARKLAERTRVTITTGRPYQLRDTNGDPVTARGAKELIDAHYHVDTRTH PHNRAHTDTMQNSKPAR" CDS 4260510..4260725 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3875A" /note="unnamed protein product; Mb3875A, 71 aa long, hypothetical conserved protein. Equivalent to hypothetical protein MT395B from M. tuberculosis CDC1551 (100% identity in 71 aa overlap)" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6W1" /protein_id="SIU02505.1" /translation="MVFDKPTVSCLSVSHFQRLFRVAQHNPMPVEIRRDYTHTQHLDH RDSGRRRLTSSFAPPAPAATTQRHGSS" CDS complement(4260850..4260942) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3875CB" /note="unnamed protein product; Mb3875cB, 30 aa long, newly annotated transposase. Equivalent to transposases across multiple M. tuberculosis and M. bovis species (pfam01610: DDE_Tnp_ISL3)(100% identity in 29 aa overlap) including M. tuberculosis H37Rv; TMC 102(Direct submission, 22-09-2014. Isolated from a human lung in New York in 1905. Annotated using automated computational analysis unsing gene predction methods. Hazbon,M.H., Riojas,M.A., Damon,A.M., Alalade,R.O., Cantwell,B.J.,Monaco,A., King,S. and Sohrabi,A.) However this transposase CDS is currently unannotated in H37Rv (May 2015). Mb3875cB found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5N1" /protein_id="SIU02506.1" /translation="MTTVNPPTEAISGRVEHLRGSTLGFRNLTN" CDS complement(4261048..4261356) /codon_start=1 /transl_table=11 /gene="Mb3875cC" /locus_tag="BQ2027_MB3875CC" /note="Uncharacterized protein Mb3875cC, 102aa. Similar to ERS007672_00638 from M. tuberculosis COU91265, submitted to NCBI 10-03-2015 (100% identity in 79aa overlap). M. tuberculosis COU91265 annotated by ab initio prediction:Prodigal:2.60. Mb3875cC found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5F8" /protein_id="SIU02507.1" /translation="MFLFARVPPTRWAGSSLTSQPCSMTARVATTTLGSMPQGGRHRL PGRPQADTTSMGAFLVAEVPVGGPSSDRCWVDGRRCVAKVYGVGGQEEVDGWAVGGDG " CDS 4261389..4262012 /codon_start=1 /transl_table=11 /gene="sodA" /locus_tag="BQ2027_MB3876" /standard_name="sodB; sod" /product="SUPEROXIDE DISMUTASE [FE] SODA" /note="Mb3876, sodA, len: 207 aa. Equivalent to Rv3846, len: 207 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 207 aa overlap). sodA (alternate gene names: sodB, sod), superoxyde dismutase (EC 1.15.1.1) (see citations below), equivalent to many e.g. P47201|SODM_MYCAV|SODA|SOD from Mycobacterium avium (206 aa), FASTA scores: opt: 1210, E(): 1.8e-73, (82.5% identity in 206 aa overlap); Q9F9R1|SOD from Mycobacterium paratuberculosis (207 aa), FASTA scores: opt: 1207, E(): 2.9e-73, (81.65% identity in 207 aa overlap); O86165|SODM_MYCLP|SODA|SOD from Mycobacterium lepraemurium (206 aa), FASTA scores: opt: 1204, E(): 4.5e-73, (82.05% identity in 206 aa overlap); P13367|SODM_MYCLE|SODA|ML0072 from Mycobacterium leprae (206 aa), FASTA scores: opt: 1169, E(): 9.6e-71, (80.5% identity in 205 aa overlap); etc. Contains PS00088 Manganese and iron superoxide dismutases signature. BELONGS TO THE IRON/MANGANESE SUPEROXIDE DISMUTASE FAMILY. ALTHOUGH FOUND EXTRACELLULARLY, NO SIGNAL SEQUENCE IS PRESENT. AN ALTERNATIVE SECRETORY PATHWAY MAY BE USED. Protein product from Mb3876 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3876 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVI9" /db_xref="InterPro:IPR001189" /db_xref="InterPro:IPR019831" /db_xref="InterPro:IPR019832" /db_xref="InterPro:IPR019833" /db_xref="InterPro:IPR036314" /db_xref="InterPro:IPR036324" /db_xref="UniProtKB/Swiss-Prot:Q7TVI9" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02508.1" /translation="MAEYTLPDLDWDYGALEPHISGQINELHHSKHHATYVKGANDAV AKLEEARAKEDHSAILLNEKNLAFNLAGHVNHTIWWKNLSPNGGDKPTGELAAAIADA FGSFDKFRAQFHAAATTVQGSGWAALGWDTLGNKLLIFQVYDHQTNFPLGIVPLLLLD MWEHAFYLQYKNVKVDFAKAFWNVVNWADVQSRYAAATSQTKGLTFG" CDS 4262223..4262756 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3877" /product="HYPOTHETICAL PROTEIN" /note="Mb3877, -, len: 177 aa. Equivalent to Rv3847, len: 177 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 177 aa overlap). Conserved hypothetical protein, equivalent to Q9CDC7|ML0071 HYPOTHETICAL PROTEIN from Mycobacterium leprae (177 aa) FASTA scores: opt: 1149, E(): 1.6e-64, (96.6% identity in 177 aa overlap); and Q9F9R0 HYPOTHETICAL 18.5 KDA PROTEIN from Mycobacterium paratuberculosis (177 aa), FASTA scores: opt: 1139, E(): 6.8e-64, (96.6% identity in 177 aa overlap). Mb3877 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5E9" /protein_id="SIU02509.1" /translation="MGTGSGGPIGVSPFHSRGALKGFVISGRWPDSTKEWAQLLMVAV RVASLPGLLSTTTVFGAREELPDEPEPGTVGLVLAEGTVFGESAIQPGYFADHQPPAL LMLHPPSETTPSLPECTGAASGCVLLPGLPYLGLEHRAAWVEAEADGTITSMVSRVGV DPISHPDTAILAMLLAA" CDS 4263011..4263919 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3878" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3878, -, len: 302 aa. Equivalent to Rv3848, len: 302 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 302 aa overlap). Probable conserved transmembrane protein, similar to hypothetical (transmembrane) proteins e.g. Q9HVG2|PA4629 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (192 aa), FASTA scores: opt: 304, E(): 5.3e-11, (35.05% identity in 174 aa overlap); Q9A5S7|CC2370 HYPOTHETICAL PROTEIN from Caulobacter crescentus (207 aa), FASTA scores: opt: 285, E(): 7.4e-10, (29.9% identity in 184 aa overlap); Q9KY43|SCC8A.05c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (193 aa), FASTA scores: opt: 245, E(): 1.6e-07, (32.8% identity in 195 aa overlap); etc. Protein product from Mb3878 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3878 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5F1" /db_xref="InterPro:IPR001727" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5F1" /protein_id="SIU02510.1" /translation="MLAATLLSLGAVFLAELGDRSQLITMTYTLRYRWWVVLTGVAIA AFTVHGVAVAIGHFLGSTVPARPAACVSAIAFLIFAVWVWREDTASDSETSPTAAEPR LALFTVVSSFALAELGDKTTLATVTLASDHHWAGVWIGTTLGMILADGLAIGAGLLLH RRLPERLLQVLTGLLFLLFGLWLLFDDALGFRSIAIAVTAAVVLAAATTAVSVRVAQT RRRRPTAAATPEDDSTRPERSSVAPGHPGSILLPLPEVSLRGRRPPSGSPDERCADPG SKGGSRRISVGCWLPGVGRIRPTRSS" CDS 4264184..4264582 /codon_start=1 /transl_table=11 /gene="espr" /locus_tag="BQ2027_MB3879" /product="esx-1 transcriptional regulatory protein espr" /note="Mb3879, -, len: 132 aa. Equivalent to Rv3849, len: 132 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 132 aa overlap). Conserved hypothetical protein, equivalent to Q9CDC9|ML0069 HYPOTHETICAL PROTEIN from Mycobacterium leprae (132 aa) FASTA scores: opt: 724, E(): 8.7e-41, (83.95% identity in 131 aa overlap). Protein product from Mb3879 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3879 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02511.1" /translation="MSTTFAARLNRLFDTVYPPGRGPHTSAEVIAALKAEGITMSAPY LSQLRSGNRTNPSGATMAALANFFRIKAAYFTDDEYYEKLDKELQWLCTMRDDGVRRI AQRAHGLPSAAQQKVLDRIDELRRAEGIDA" CDS 4264700..4265356 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3880" /product="conserved protein" /note="Mb3880, -, len: 218 aa. Equivalent to Rv3850, len: 218 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 218 aa overlap). Conserved hypothetical protein, equivalent to Q9CDD0|ML0068 HYPOTHETICAL PROTEIN from Mycobacterium leprae (238 aa) FASTA scores: opt: 1071, E(): 7.2e-55, (78.35% identity in 217 aa overlap). Protein product from Mb3880 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3880 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5F3" /protein_id="SIU02512.1" /translation="MGLFGKRKSRATRRAEARAIKARAKLEAKLSAKNEARRIKAAQR AESKALKAQLKARRDSDRAALKVAEAELKVAREGKLLSPTRIRRLLTVSRLLAPILTP VIYRAAMAARGLIDQRRADQLGVPLAQIGRFSGHGARLSARVGGAERSLRMVQEKKPK DVETKQFVSAVTNRLTDLSAAVAAAEHMPAKRRRTAHSAISSQLDGIEADLMARLGLT " CDS 4265368..4265652 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3881" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb3881, -, len: 94 aa. Equivalent to Rv3851, len: 94 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 94 aa overlap). Possible membrane protein. Mb3881 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5G2" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5G2" /protein_id="SIU02513.1" /translation="MTAIGMSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPA AFAHTAQLSTATPAPAVAATDANDVPTWPFVVGTVAAVAVAALWAVRRGR" CDS 4265759..4266163 /codon_start=1 /transl_table=11 /gene="hns" /locus_tag="BQ2027_MB3882" /product="POSSIBLE HISTONE-LIKE PROTEIN HNS" /note="Mb3882, hns, len: 134 aa. Equivalent to Rv3852, len: 134 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 134 aa overlap). Possible hns, histone-like protein, equivalent to Q9CDD1|HNS|ML0067 HISTONE-LIKE PROTEIN from Mycobacterium leprae (121 aa), FASTA scores: opt: 341, E(): 4.3e-09, (51.5% identity in 134 aa overlap). Shows some similarity with other histone-like proteins e.g. O65795|HIS1 HISTONE H1 from Triticum aestivum (Wheat) (288 aa), FASTA scores: opt: 183, E(): 0.091, (34.85% identity in 109 aa overlap); etc. Protein product from Mb3882 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3882 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5G5" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5G5" /protein_id="SIU02514.1" /translation="MPDPQDRPDSEPSDASTPPAKKLPAKKAAKKAPARKTPAKKAPA KKTPAKGAKSAPPKPAEAPVSLQQRIETNGQLAAAAKDAAAQAKSTVEGANDALARNA SVPAPSHSPVPLIVAVTLSLLALLLIRQLRRR" CDS 4266180..4266653 /codon_start=1 /transl_table=11 /gene="rraa" /locus_tag="BQ2027_MB3883" /product="regulator of rnase e activity a rraa" /note="Mb3883, menG, len: 157 aa. Equivalent to Rv3853, len: 157 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 157 aa overlap). Probable menG, S-adenosylmethionine:2-demethylmenaquinone methyltransferase (EC 2.1.-.-), equivalent to Q9CDD2|MENG|ML0066 PUTATIVE S-ADENOSYLMETHIONINE:2-DEMETHYLMENAQUINONE METHYLTRANSFERASE from Mycobacterium leprae (157 aa) FASTA scores: opt: 896, E(): 1.3e-49, (87.1% identity in 155 aa overlap). Also highly similar to others e.g. Q9S4U0|MENG from Pseudomonas fluorescens (163 aa), FASTA scores: opt: 481, E(): 1.7e-23, (47.0% identity in 149 aa overlap); Q9RW10|DR0859 from Deinococcus radiodurans (160 aa) FASTA scores: opt: 456, E(): 6.3e-22, (45.75% identity in 153 aa overlap); P32165|MENG_ECOLI|B3929|Z5476|ECS4856 from Escherichia coli strain K12 (161 aa), FASTA scores: opt: 428, E(): 3.7e-20, (45.65% identity in 149 aa overlap); etc. Protein product from Mb3883 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3883 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A667" /db_xref="InterPro:IPR005493" /db_xref="InterPro:IPR010203" /db_xref="InterPro:IPR036704" /db_xref="UniProtKB/Swiss-Prot:P0A667" /protein_id="SIU02515.1" /translation="MAISFRPTADLVDDIGPDVRSCDLQFRQFGGRSQFAGPISTVRC FQDNALLKSVLSQPSAGGVLVIDGAGSLHTALVGDVIAELARSTGWTGLIVHGAVRDA AALRGIDIGIKALGTNPRKSTKTGAGERDVEITLGGVTFVPGDIAYSDDDGIIVV" CDS complement(4266689..4268158) /codon_start=1 /transl_table=11 /gene="ethA" /locus_tag="BQ2027_MB3884C" /product="MONOOXYGENASE ETHA" /note="Mb3884c, ethA, len: 489 aa. Equivalent to Rv3854c, len: 489 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 489 aa overlap). ethA (alternate gene names: aka or etaA), monooxygenase required for activation of the pro-drug ethionamide (EC 1.-.-.-) (see citations below), highly similar to other monooxygenases e.g. Q9A588|CC2569 MONOOXYGENASE (FLAVIN-BINDING FAMILY) from Caulobacter crescentus (498 aa), FASTA scores: opt: 1959, E(): 2.9e-114, (57.6% identity in 481 aa overlap); Q9RZT0|DRB0033 ARYLESTERASE/MONOXYGENASE from Deinococcus radiodurans (833 aa), FASTA scores: opt: 1771, E(): 2.2e-102, (53.75% identity in 480 aa overlap); Q9A8K5|CC1348 MONOOXYGENASE (FLAVIN-BINDING FAMILY) from Caulobacter crescentus (499 aa), FASTA scores: opt: 1385, E(): 1.4e-78, (43.2% identity in 486 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. O53300|Rv3083|MTV013.04 MONOXYGENASE (495 aa) FASTA scores: opt: 1692, E(): 1.1e-97, (49.7% identity in 489 aa overlap); O53762|Rv0565c|MTV039.03c PUTATIVE MONOXYGENASE (486 aa), FASTA scores: opt: 1571, E(): 3.7e-90, (49.05% identity in 471 aa overlap); O69708|Rv3741c|MTV025.089c POSSIBLE OXIDOREDUCTASE (probably second part of a two component monooxygenase) (224 aa), FASTA scores: opt: 542, E(): 1.7e-26, (50.0% identity in 162 aa overlap); etc. Protein product from Mb3884c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3884c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:Q7TVI2" /db_xref="InterPro:IPR020946" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/Swiss-Prot:Q7TVI2" /protein_id="SIU02516.1" /translation="MTEHLDVVIVGAGISGVSAAWHLQDRCPTKSYAILEKRESMGGT WDLFRYPGIRSDSDMYTLGFRFRPWTGRQAIADGKPILEYVKSTAAMYGIDRHIRFHH KVISADWSTAENRWTVHIQSHGTLSALTCEFLFLCSGYYNYDEGYSPRFAGSEDFVGP IIHPQHWPEDLDYDAKNIVVIGSGATAVTLVPALADSGAKHVTMLQRSPTYIVSQPDR DGIAEKLNRWLPETMAYTAVRWKNVLRQAAVYSACQKWPRRMRKMFLSLIQRQLPEGY DVRKHFGPHYNPWDQRLCLVPNGDLFRAIRHGKVEVVTDTIERFTATGIRLNSGRELP ADIIITATGLNLQLFGGATATIDGQQVDITTTMAYKGMMLSGIPNMAYTVGYTNASWT LKADLVSEFVCRLLNYMDDNGFDTVVVERPGSDVEERPFMEFTPGYVLRSLDELPKQG SRTPWRLNQNYLRDIRLIRRGKIDDEGLRFAKRPAPVGV" CDS 4268234..4268884 /codon_start=1 /transl_table=11 /gene="ethR" /locus_tag="BQ2027_MB3885" /product="TRANSCRIPTIONAL REGULATORY REPRESSOR PROTEIN (TETR-FAMILY) ETHR" /note="Mb3885, ethR, len: 216 aa. Equivalent to Rv3855, len: 216 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 216 aa overlap). ethR (alternate gene names: aka or etaR), regulatory protein tetR family, involved in ethionamide sensitivity/resistance, negatively controls neighbouring ethA (Rv3854c, MTCY01A6.14; alternate gene names: aka etaA) (see citations below). Equivalent to Q9CDD3|ML0064 PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (214 aa), FASTA scores: opt: 1017, E(): 7e-62, (77.0% identity in 213 aa overlap). Also similar to other transcriptional regulator e.g. Q9S1R1|SCJ9A.09 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (204 aa), FASTA scores: opt: 305, E(): 1.2e-13, (34.5% identity in 200 aa overlap); Q9KYT9|SCE22.24 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR (FRAGMENT) from Streptomyces coelicolor (244 aa), FASTA scores: opt: 179, E(): 4.9e-05, (35.5% identity in 93 aa overlap); Q9RUK2|DR1384 TRANSCRIPTIONAL REGULATOR (TETR FAMILY) from Deinococcus radiodurans (196 aa), FASTA scores: opt: 167, E(): 0.00026, (41.75% identity in 79 aa overlap); etc. Also similar to P95100|Rv3058c|MTCY22D7.23 HYPOTHETICAL 23.8 KDA PROTEIN from Mycobacterium tuberculosis (216 aa) FASTA scores: opt: 261, E(): 1.2e-10, (31.65% identity in 221 aa overlap); and O08377|Rv1534|MTCY07A7A.03 HYPOTHETICAL 24.5 KDA PROTEIN from Mycobacterium tuberculosis (225 aa), FASTA scores: opt: 164, E(): 0.00047, (25.5% identity in 248 aa overlap). Contains helix-turn-helix motif at aa 45-66, Score 1320 (+3.68 SD). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Protein product from Mb3885 detected using SWATH mass spectrometry. Mb3885 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:Q7TVI1" /db_xref="InterPro:IPR001647" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR036271" /db_xref="UniProtKB/Swiss-Prot:Q7TVI1" /protein_id="SIU02517.1" /translation="MTTSAASQASLPRGRRTARPSGDDRELAILATAENLLEDRPLAD ISVDDLAKGAGISRPTFYFYFPSKEAVLLTLLDRVVNQADMALQTLAENPADTDRENM WRTGINVFFETFGSHKAVTRAGQAARATSVEVAELWSTFMQKWIAYTAAVIDAERDRG AAPRTLPAHELATALNLMNERTLFASFAGEQPSVPEARVLDTLVHIWVTSIYGENR" CDS complement(4269086..4270093) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3886C" /product="DNA-dependent DNA polymerase beta chain" /note="Mb3886c, -, len: 335 aa. Equivalent to Rv3856c, len: 335 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 335 aa overlap). Conserved hypothetical protein, highly similar to various proteins from diverse organisms e.g. Q9EWR3|3SCF60.21 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (372 aa) FASTA scores: opt: 1286, E(): 2.4e-73, (64.0% identity in 336 aa overlap); P72464|ORF1 from Streptomyces lividans (343 aa), FASTA scores: opt: 1275, E(): 1.1e-72, (60.1% identity in 336 aa overlap); Q9K899|BH3107 DNA-DEPENDENT DNA POLYMERASE BETA CHAIN from Bacillus halodurans (571 aa), FASTA scores: opt: 592, E(): 1.2e-29, (39.15% identity in 240 aa overlap); etc. Protein product from Mb3886c detected using SWATH mass spectrometry. Mb3886c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5G8" /db_xref="InterPro:IPR003141" /db_xref="InterPro:IPR004013" /db_xref="InterPro:IPR010996" /db_xref="InterPro:IPR016195" /db_xref="InterPro:IPR017078" /db_xref="InterPro:IPR027421" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5G8" /protein_id="SIU02518.1" /translation="MDPVTALRQIAYYKDRNRHDPRRVMAYRNAADIIEGLDDAARQR HGQANSWQSLAGIGPKTAKVIAQAWSGREPDLLAELRADAEDLGGGAIRAALRGDLHL HSNWSDGSAPIEEMMATAAALGHQYCALTDHSPRLTIANGLSPDRLRKQLDVIDELRE KFAPLRILTGIEVDILEDGSLDQEPEMLDRLDIVVASVHSKLSMDSAAMTRRMVRAVA NGHTDVLGHCTGRLIAGNRGIRPESKFDAEAVFTACREHGTAVEINSRPERRDPPTRL LHLARDIGCVFSIDTDAHAPGQLDFLGYGAQRALDAEVPADRIVNTWPADTLLAWTGS H" CDS complement(4270102..4270299) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3887C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb3887c, -, len: 65 aa. Equivalent to Rv3857c, len: 65 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 65 aa overlap). Possible membrane protein. Mb3887c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5F2" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5F2" /protein_id="SIU02519.1" /translation="MNCALGFDTKPILLASYVTHGARRATANQFERPAKGAGVLMALL ILGEMAGFAVVVTGVVFGQLV" CDS complement(4270724..4272190) /codon_start=1 /transl_table=11 /gene="gltD" /locus_tag="BQ2027_MB3888C" /product="probable nadh-dependent glutamate synthase (small subunit) gltd (l-glutamate synthase) (l-glutamate synthetase) (nadh-glutamate synthase) (glutamate synthase (nadh)) (glts beta chain) (nadph-gogat)" /note="Mb3888c, gltD, len: 488 aa. Equivalent to Rv3858c, len: 488 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 488 aa overlap). Probable gltD, small subunit of NADH-dependent glutamate synthase (EC 1.4.1.13), equivalent to Q9CDD4|GLTD|ML0062 NADH-DEPENDENT GLUTAMATE SYNTHASE SMALL SUBUNIT from Mycobacterium leprae (488 aa), FASTA scores: opt: 2997, E(): 1e-166, (87.7% identity in 488 aa overlap). Also highly similar to many e.g. Q9S2Z0|SC3A3.03s from Streptomyces coelicolor (487 aa), FASTA scores: opt: 2152, E(): 1.2e-117, (63.85% identity in 487 aa overlap); Q9KPJ3|VC2374 from Vibrio cholerae (489 aa), FASTA scores: opt: 1699, E(): 2.5e-91, (51.75% identity in 487 aa overlap); Q03460|GLSN_MEDSA from Medicago sativa (Alfalfa) (2194 aa), FASTA scores: opt: 1322, E(): 6.2e-69, (54.45% identity in 485 aa overlap); P09832|GLTD_ECOLI from strain (471 aa) FASTA scores: opt: 889, E() : 0, (37.4% identity in 473 aa overlap); etc. SIMILAR TO OTHER GLUTAMATE SYNTHASES. COFACTOR: FAD (BY SIMILARITY). Protein product from Mb3888c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3888c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5G4" /db_xref="InterPro:IPR006005" /db_xref="InterPro:IPR009051" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR028261" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5G4" /protein_id="SIU02520.1" /translation="MADPGGFLKYTHRKLPKRRPVPLRLRDWREVYEEFDNESLRQQA TRCMDCGIPFCHNGCPLGNLIPEWNDLVRRGRWRDAIERLHATNNFPDFTGRLCPAPC EPACVLGINQDPVTIKQIELEIIDKAFDEGWVQPRPPRKLTGQTVAVVGSGPAGLAAA QQLTRAGHTVTVFEREDRIGGLLRYGIPEFKMEKRHLDRRLDQMRSEGTEFRPGVNVG VDISAEKLRADFDAVVLAGGATAWRELPIPGRELEGVHQAMEFLPWANRVQEGDDVLD EDGQPPITAKGKKVVIIGGGDTGADCLGTVHRQGAIAVHQFEIMPRPPDARAESTPWP TYPLMYRVSAAHEEGGERVFSVNTEAFVGTDGRVSALRAHEVTMLDGKFVKVEGSDFE LEADLVLLAMGFVGPERAGLLTDLGVKFTEHGNVARGDDFDTSVPGVFVAGDMGRGQS LIVWAIAEGRAAAAAVDRYLMGSSALPAPVKPTAAPLQ" CDS complement(4272183..4276766) /codon_start=1 /transl_table=11 /gene="gltB" /locus_tag="BQ2027_MB3889C" /product="probable ferredoxin-dependent glutamate synthase [nadph] (large subunit) gltb (l-glutamate synthase) (l-glutamate synthetase) (nadh-glutamate synthase) (glutamate synthase (nadh))(nadph-gogat)" /note="Mb3889c, gltB, len: 1527 aa. Equivalent to Rv3859c, len: 1527 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1527 aa overlap). Probable gltB, ferredoxin-dependent glutamate synthase large subunit (EC 1.4.1.13), equivalent to Q9CDD5|GLTB|ML0061 PUTATIVE FERREDOXIN-DEPENDENT GLUTAMATE SYNTHASE from Mycobacterium leprae (1527 aa), FASTA scores: opt: 9277, E(): 0, (90.25% identity in 1527 aa overlap). Also highly similar to many e.g. Q9S2Y9|SC3A3.04c from Streptomyces coelicolor (1514 aa), FASTA scores: opt: 5939, E(): 0, (64.3% identity in 1544 aa overlap); Q9Z465|GLTB from Corynebacterium glutamicum (Brevibacterium flavum) (1510 aa), FASTA scores: opt: 5790, E(): 0, (63.25% identity in 1534 aa overlap); P39812|GLTB_BACSU|GLTA from Bacillus subtilis (1520 aa), FASTA scores: opt: 3445, E(): 2.8e-196, (52.25% identity in 1531 aa overlap); etc. SIMILAR TO OTHER GLUTAMATE SYNTHASES. Protein product from Mb3889c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3889c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5J1" /db_xref="InterPro:IPR002489" /db_xref="InterPro:IPR002932" /db_xref="InterPro:IPR006982" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR017932" /db_xref="InterPro:IPR029055" /db_xref="InterPro:IPR036485" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5J1" /protein_id="SIU02521.1" /translation="MTPKRVGLYNPAFEHDSCGVAMVVDMHGRRSRDIVDKAITALLN LEHRGAQGAEPRSGDGAGILIQVPDEFLREAVDFELPAPGSYATGIAFLPQSSKDAAA ACAAVQKIAEAEGLQVLGWRSVPTDDSSLGALSRDAMPTFRQVFLAGASGMALERRCY VVRKRAEHELGTKGPGQDGPGRETVYFPSLSGQTLVYKGMLTTPQLKAFYLDLQDERL TSALGIVHSRFSTNTFPSWPLAHPFRRIAHNGEINTVTGNENWMRAREALIKTDIFGS AADVEKLFPICTPGASDTARFDEVLELLHLGGRSLAHAVLMMIPEAWERHESMDPARR AFYQYHASLMEPWDGPASMTFTDGTVVGAVLDRNGLRPSRIWVTDDGLVVMASEAGVL DLHPSTVVRRMRLQPGRMFLVDTAQGRIVSDEEIKADLAAEHPYQEWLDNGLVPLDEL PEGKDVRMPHHRIVMRQLAFGYTYEELNLLVAPMARLGAEPIGSMGTDTPVAVLSQRP RMLYDYFHQLFAQVTNPPLDAIREEVVTSLQGTTGGERDLLNPDENSCHQIVLPQPIL RNHELAKLVSLDPNDKVNGRPHGLRSKVIRCLYRVSEGGAGLAAALEEVRGAAAAAIA DGARIIILSDRESDEEMAPIPSLLAVAGVHHHLVRERTRTQVGLVVESGDAREVHHMA ALVGFGAAAINPYLVFESIEDMLDRGVIEGIDRTAALNNYIKAAGKGVLKVMSKMGIS TLASYTGAQLFQAVGISEQVLDEYFTGLTCPTGGITLDDIAADVAARHRLAYLDRPDE RAHRELEVGGEYQWRREGEYHLFNPETVFKLQHSTRTGQYKIFKEYTRLVDDQSERMA SLRGLLKFRTGVRPPVPLDEVEPASEIVKRFSTGAMSYGSISAEAHETLAIAMNRLGA RSNCGEGGEDVKRFDRDPNGDWRRSAIKQVASARFGVTSHYLTNCTDLQIKMAQGAKP GEGGQLPGHKVYPWVAEVRHSTPGVGLISPPPHHDIYSIEDLAQLIHDLKNANPSARV HVKLVSENGVGTVAAGVSKAHADVVLISGHDGGTGATPLTSMKHAGAPWELGLAETQQ TLLLNGLRDRIVVQVDGQLKTGRDVMIATLLGAEEFGFATAPLVVAGCIMMRVCHLDT CPVGVATQNPLLRERFTGKPEFVENFFMFIAEEVREYLAQLGFRTVNEAVGQAGALDT TLARAHWKAHKLDLAPVLHEPESAFMNQDLYCSSRQDHGLDKALDQQLIVMSREALDS GKPVRFSTTIGNVNRTVGTMLGHELTKAYGGQGLPDGTIDITFDGSAGNSFGAFVPKG ITLRVYGDANDYVGKGLSGGRIVVRPSDDAPQDYVAEDNIIGGNVILFGATSGEVYLR GVVGERFAVRNSGAHAVVEGVGDHGCEYMTGGRVVILGRTGRNFAAGMSGGVAYVYDP DGELPANLNSEMVELETLDEDDADWLHGTIQVHVDATDSAVGQRILSDWSGQQRHFVK VMPRDYKRVLQAIALAERDGVDVDKAIMAAAHG" CDS 4277462..4278634 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3890" /product="Antiactivator of flagellar biosynthesis FleN, an ATPase" /note="Mb3890, -, len: 390 aa. Equivalent to Rv3860, len: 390 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 390 aa overlap). Conserved hypothetical protein, showing similarity with hypothetical proteins from Mycobacterium leprae e.g. Q9CDD8|ML0048 (586 aa), FASTA scores: opt: 484, E(): 5.5e-14, (29.95% identity in 407 aa overlap); O33082|MLCB628.11c (478 aa) FASTA scores: opt: 484, E(): 4.8e-14, (29.95% identity in 407 aa overlap); etc. Also some similarity with O86637|SC3C3.03c HYPOTHETICAL 112.1 KDA PROTEIN from Streptomyces coelicolor(1083 aa), FASTA scores: opt: 483, E(): 9.6e-14, (30.45% identity in 404 aa overlap). And some similarity with other proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O05456|Rv3888c|MTCY15F10.24 HYPOTHETICAL 37.7 KDA PROTEIN (341 aa), FASTA scores: opt: 603, E(): 2.8e-19, (35.2% identity in 284 aa overlap); O06396|Rv0530|MTCY25D10.09 HYPOTHETICAL 43.0 KDA PROTEIN (405 aa), FASTA scores: opt: 538, E(): 2e-16, (31.0% identity in 371 aa overlap); O69740|Rv3876|MTV027.11 (666 aa), FASTA scores: opt: 475, E(): 1.5e-13, (30.2% identity in 391 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb3890 detected using SWATH mass spectrometry. Mb3890 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002586" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5G0" /protein_id="SIU02522.1" /translation="MYERDEFLRDRIRPHQPGTPRGYSPRPPSGDRCPAPPPGRHAAA ATPPGPPRLPSAPLRPLPDPAWPRQPEAPPPSTWADPALAPIRSRTRPGERGWRRMVR LVTFGLVGLGRSGMQRQEAQFEATIRTVLHGNHKVAVLGKGGVGKTSVAACVGSILAE LRQQDRIVGIDADTAFGRLSSRIDPRAAGSFWELTTDTNLRSFTDITARLGRNSAGLY VLAGQPASGPRRALDPAIYREAALRLDHHFAISVIDCGSSMEAAVTQEVLRDVDALIV VSSPWADGASAAANTIEWLSDYGLTGLLRRSIVVLNDSDGHADKRTKSLLAQEFIDHG QPVVEVPFDPHLRPGGVIDMSHEMAPTTRLKILQVAATVTAYFASRPADAHGSPPR" CDS 4278631..4278957 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3891" /product="HYPOTHETICAL PROTEIN" /note="Mb3891, -, len: 108 aa. Equivalent to Rv3861, len: 108 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 108 aa overlap). Hypothetical unknown protein. Mb3891 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5H2" /protein_id="SIU02523.1" /translation="MTWLADPVGNSRIARAQACKTSISAPIVESWRAQRGAQCGQREK SCRCSRAVHIQGISPPLFRRPLEPAVQAAVASCRLGRHPVVAHRVTVALGQGSQLAQR ECPRPA" CDS complement(4278856..4279206) /codon_start=1 /transl_table=11 /gene="whiB6" /locus_tag="BQ2027_MB3892C" /product="possible transcriptional regulatory protein whib-like whib6" /note="Mb3892c, whiB6, len: 116 aa. Equivalent to Rv3862c, len: 116 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 116 aa overlap). Possible whiB6, WhiB-like regulatory protein (see citation below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Shows similarity with Q49765|WHIB7|ML0639|B1937_F2_68 PUTATIVE TRANSCRIPTIONAL REGULATOR WHIB7 from Mycobacterium leprae (89 aa) FASTA scores: opt: 112, E(): 0.49, (41.2% identity in 51 aa overlap). Some similarity to Q9AD55|SCP1.95 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (102 aa) FASTA scores: opt: 129, E(): 0.038, (32.95% identity in 85 aa overlap); AAK47632|MT3290.1 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (96 aa), FASTA scores: opt: 126, E(): 0.058, (33.35% identity in 84 aa overlap); Q9FC80|SC4B10.07 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (88 aa), FASTA scores: opt: 119, E(): 0.16, (44.65% identity in 70 aa overlap); Q9K4K8|SC5F8.16c REGULATORY PROTEIN from Streptomyces coelicolor (83 aa), FASTA scores: opt: 114, E(): 0.34, (37.05% identity in 54 aa overlap); etc. Mb3892c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5I2" /db_xref="InterPro:IPR003482" /db_xref="InterPro:IPR034768" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I2" /protein_id="SIU02524.1" /translation="MRYAFAAEATTCNAFWRNVDMTVTALYEVPLGVCTQDPDRWTTT PDDEAKTLCRACPRRWLCARDAVESAGAEGLWAGVVIPESGRARAFALGQLRSLAERN GYPVRDHRVSAQSA" CDS 4279533..4280711 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3893" /product="unknown alanine rich protein" /note="Mb3893, -, len: 392 aa. Equivalent to Rv3863, len: 392 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 392 aa overlap). Hypothetical unknown ala-rich protein. Protein product from Mb3893 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3893 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR008984" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6X7" /protein_id="SIU02525.1" /translation="MAGERKVCPPSRLVPANKGSTQMSKAGSTVGPAPLVACSGGTSD VIEPRRGVAIIGHSCRVGTQIDDSRISQTHLRAVSDDGRWRIVGNIPRGMFVGGRRGS SVTVSDKTLIRFGDPPGGKALTFEVVRPSDSAAQHGRVQPSADLSDDPAHNAAPVAPD PGVVRAGAAAAARRRELDISQRSLAADGIINAGALIAFEKGRSWPRERTRAKLEEVLQ WPAGTIARIRRGEPTEPATNPDASPGLRPADGPASLIAQAVTAAVDGCSLAIAALPAT EDPEFTERAAPILADLRQLEAIAVQATRISRITPELIKALGAVRRHHDELMRLGATAP GATLAQRLYAARRRANLSTLETAQAAGVAEEMIVGAEAEEELPAEATEAIEALIRQIN " CDS 4280954..4282210 /codon_start=1 /transl_table=11 /gene="espe" /locus_tag="BQ2027_MB3894" /product="esx-1 secretion-associated protein espe" /note="Mb3894, -, len: 418 aa. Equivalent to Rv3864, len: 402 aa, from Mycobacterium tuberculosis strain H37Rv, (96.2% identity in 418 aa overlap). Conserved hypothetical protein, similar to Q49722|ML0405|B1620_C2_213|MLCL383.01 HYPOTHETICAL 40.8 KDA PROTEIN from Mycobacterium leprae (394 aa) FASTA scores: opt: 397, E(): 1.2e-12, (31.0% identity in 410 aa overlap). Also similar to various proteins from several organisms e.g. Q9VYF9|CG12723 HYPOTHETICAL PROTEIN from Drosophila melanogaster (Fruit fly) (450 aa), FASTA scores: opt: 291, E(): 2.3e-07, (34.6% identity in 130 aa overlap); Q98UE3 PROCOLLAGEN ALPHA1(III) (FRAGMENT) from Xenopus laevis (African clawed frog) (117 aa) FASTA scores: opt: 257, E(): 3.6e-06, (41.75% identity in 103 aa overlap); P27393|CA24_ASCSU COLLAGEN ALPHA 2(IV) CHAIN PRECURSOR from Ascaris suum (Pig roundworm) (Ascaris lumbricoides) (1763 aa), FASTA scores: opt: 273, E(): 5.7e-06, (32.1% identity in 240 aa overlap); etc. Also similar to O06267|Rv3616c|MTCY07H7B.06 (392 aa) FASTA scores: opt: 389, E(): 3e-12, (31.6% identity in 399 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a 36 bp insertion leads to a longer product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (418 aa versus 402 aa). Protein product from Mb3894 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3894 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5Q3" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5Q3" /protein_id="SIU02526.1" /translation="MASGSGLCKTTSNFIWGQLLLLGEGIPDPGDIFNTGSSLFKQIS DKMGLAIPGTNWIGQAAEAYLNQNIAQQLRAQVMGDLDKLTGNMISNQAKYVSDTRDV LRAMKKMIDGVYKVCKGLEKIPLLGHLWSWELAIPMSGIAMAVVGGALLYLTIMTLMN ATNLRGILGRLIEMLTTLPKFPGLPGLPSLPDIIDGLWPPKLPDIPIPGLPDIPGLPD FKWPPTPGSPLFPDLPSFPGFPGFPSLPGFPGLPGFPEFPAIPGFPALPGLPSIPNLF PGLPGLGDLLPGVGDLGKLPTWTELAALPDFLGGFAGLPSLGFGNLLSFASLPTVGQV TATMGQLQQLVAAGGGPSQLASMGSQQAQLISSQAQQGGQQHATLVSDKKEDEEGAAA GVAEAERAPIDAGTAASQRGQEGTVL" CDS 4282298..4282609 /codon_start=1 /transl_table=11 /gene="espf" /locus_tag="BQ2027_MB3895" /product="esx-1 secretion-associated protein espf" /note="Mb3895, -, len: 103 aa. Equivalent to Rv3865, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 103 aa overlap). Conserved hypothetical protein, showing some similarity to O06268|Rv3615c|MTCY07H7B.07 HYPOTHETICAL 10.8 KDA PROTEIN from Mycobacterium tuberculosis (103 aa), FASTA scores: opt: 198, E(): 7.5e-07, (36.25% identity in 102 aa overlap); Q49723|ML0406|B1620_C2_214|MLCL383.02 HYPOTHETICAL 11.1 KDA PROTEIN from Mycobacterium leprae (106 aa), FASTA scores: opt: 154, E(): 0.00071, (31.05% identity in 103 aa overlap). Protein product from Mb3895 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3895 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5H6" /db_xref="InterPro:IPR022536" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5H6" /protein_id="SIU02527.1" /translation="MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTH GSFTSKFNDTLQEFETTRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIF G" CDS 4282612..4283463 /codon_start=1 /transl_table=11 /gene="espg1" /locus_tag="BQ2027_MB3896" /product="esx-1 secretion-associated protein espg1" /note="Mb3896, -, len: 283 aa. Equivalent to Rv3866, len: 283 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 283 aa overlap). Conserved hypothetical protein. N-terminal end highly similar to O33091|MLCB628.20c HYPOTHETICAL 13.1 KDA PROTEIN from Mycobacterium leprae (122 aa), FASTA scores: opt: 260, E(): 2.1e-09, (43.6% identity in 117 aa overlap); and C-terminal end highly similar to O33090|MLCB628.19c HYPOTHETICAL 36.7 KDA PROTEIN from Mycobacterium leprae (338 aa), FASTA scores: opt: 540, E(): 1.4e-26, (54.5% identity in 156 aa overlap). Also similar to Q9CD34|ML2530 POSSIBLE DNA-BINDING PROTEIN from Mycobacterium leprae (289 aa), FASTA scores: opt: 146, E(): 0.058, (28.25% identity in 269 aa overlap) and O53694|Rv0289|MTV035.17 HYPOTHETICAL 31.6 KDA PROTEIN from Mycobacterium tuberculosis (295 aa), FASTA scores: opt: 133, E(): 0.39, (28.15% identity in 277 aa overlap). Protein product from Mb3896 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3896 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025734" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5H0" /protein_id="SIU02528.1" /translation="MTGPSAAGRAGTADNVVGVEVTIDGMLVIADRLHLVDFPVTLGI RPNIPQEDLRDIVWEQVQRDLTAQGVLDLHGEPQPTVAEMVETLGRPDRTLEGRWWRR DIGGVMVRFVVCRRGDRHVIAARDGDMLVLQLVAPQVGLAGMVTAVLGPAEPANVEPL TGVATELAECTTASQLTQYGIAPASARVYAEIVGNPTGWVEIVASQRHPGGTTTQTDA AAGVLDSKLGRLVSLPRRVGGDLYGSFLPGTQQNLERALDGLLELLPAGAWLDHTSDH AQASSRG" CDS 4283502..4284053 /codon_start=1 /transl_table=11 /gene="esph" /locus_tag="BQ2027_MB3897" /product="esx-1 secretion-associated protein esph" /note="Mb3897, -, len: 183 aa. Equivalent to Rv3867, len: 183 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 183 aa overlap). Conserved hypothetical protein, highly similar to the hypothetical proteins from Mycobacterium leprae Q9CDD6|ML0056 (169 aa) FASTA scores: opt: 403, E(): 1.8e-18, (48.2% identity in 166 aa overlap); Q49730|ML0407|B1620_C3_264|MLCL383.03 (216 aa), FASTA scores: opt: 517, E(): 1.7e-25, (51.45% identity in 175 aa overlap); and O33090|MLCB628.19c (338 aa), FASTA scores: opt: 403, E(): 3.4e-18, (48.2% identity in 166 aa overlap). Also highly similar to O06269|Rv3614c|MTCY07H7B.08 HYPOTHETICAL 19.8 KDA PROTEIN from Mycobacterium tuberculosis (184 aa), FASTA scores: opt: 559, E(): 3.4e-28, (54.35% identity in 173 aa overlap). Protein product from Mb3897 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3897 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5G7" /protein_id="SIU02529.1" /translation="MVDPPGNDDDHGDLDALDFSAAHTNEASPLDALDDYAPVQTDDA EGDLDALHALTERDEEPELELFTVTNPQGSVSVSTLMDGRIQHVELTDKATSMSEAQL ADEIFVIADLARQKARASQYTFMVENIGELTDEDAEGSALLREFVGMTLNLPTPEEAA AAEAEVFATRYDVDYTSRYKADD" CDS 4284046..4285767 /codon_start=1 /transl_table=11 /gene="ecca1" /locus_tag="BQ2027_MB3898" /product="esx conserved component ecca1. esx-1 type vii secretion system protein." /note="Mb3898, -, len: 573 aa. Equivalent to Rv3868, len: 573 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 573 aa overlap). Member of the CBXX/CFQX family of hypothetical proteins; C-terminal end is highly similar to many e.g. P40118|CBXC_ALCEU|CBXXC|CFXXC CBXX PROTEIN (317 aa) FASTA scores: opt: 572, E(): 3e-24, (42.7% identity in 294 aa overlap); CAC48589 PROBABLE CBBX PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (311 aa) FASTA scores: opt: 569, E(): 4.3e-24, (40.05% identity in 292 aa overlap); P95648|CBBX_RHOSH CBBX PROTEIN from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (309 aa), FASTA scores: opt: 559, E(): 1.5e-23, (41.4% identity in 290 aa overlap); etc. Equivalent to O33089|Y2G8_MYCLE|ML0055|MLCB628.18c HYPOTHETICAL 62.3 KDA PROTEIN from Mycobacterium leprae (573 aa), FASTA scores: opt: 3330, E(): 3.9e-175, (89.2% identity in 573 aa overlap); and similar to Q9CD28|Y282_MYCLE|ML2537 HYPOTHETICAL 69.1 KDA PROTEIN from Mycobacterium leprae (640 aa), FASTA scores: opt: 943, E(): 2.4e-44, (37.5% identity in 571 aa overlap). Also similar to many proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O53687|Y282_MYCTU|Rv0282|MT0295|MTV035.10 HYPOTHETICAL 68.1 KDA PROTEIN (631 aa), FASTA scores: opt: 936, E(): 5.8e-44, (39.05% identity in 568 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Protein product from Mb3898 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3898 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5H4" /db_xref="InterPro:IPR000641" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR003959" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR023835" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041627" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5H4" /protein_id="SIU02530.1" /translation="MTDRLASLFESAVSMLPMSEARSLDLFTEITNYDESACDAWIGR IRCGDTDRVTLFRAWYSRRNFGQLSGSVQISMSTLNARIAIGGLYGDITYPVTSPLAI TMGFAACEAAQGNYADAMEALEAAPVAGSEHLVAWMKAVVYGAAERWTDVIDQVKSAG KWPDKFLAGAAGVAHGVAAANLALFTEAERRLTEANDSPAGEACARAIAWYLAMARRS QGNESAAVALLEWLQTTHPEPKVAVALKDPSYRLKTTTAEQIASRADPWDPGSVVTDN SGRERLLAEAQAELDRQIGLTRVKNQIERYRAATLMARVRAAKGMKVAQPSKHMIFTG PPGTGKTTIARVVANILAGLGVIAEPKLVETSRKDFVAEYEGQSAVKTAKTIDQALGG VLFIDEAYALVQERDGRTDPFGQEALDTLLARMENDRDRLVVIIAGYSSDIDRLLETN EGLRSRFATRIEFDTYSPEELLEIANVIAAADDSALTAEAAENFLQAAKQLEQRMLRG RRALDVAGNGRYARQLVEASEQCRDMRLAQVLDIDTLDEDRLREINGSDMAEAIAAVH AHLNMRE" CDS 4285771..4287213 /codon_start=1 /transl_table=11 /gene="eccb1" /locus_tag="BQ2027_MB3899" /product="esx conserved component eccb1. esx-1 type vii secretion system protein. possible membrane protein." /note="Mb3899, -, len: 480 aa. Equivalent to Rv3869, len: 480 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 480 aa overlap). Possible conserved membrane protein (has hydrophobic stretch near N-terminus), equivalent to O33088|ML0054|MLCB628.17c PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (481 aa), FASTA scores: opt: 2489, E(): 8.3e-136, (75.75% identity in 478 aa overlap); and similar to others e.g. Q9Z5I3|ML1544|MLCB596.27 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (506 aa), FASTA scores: opt: 739, E(): 3.9e-35, (33.65% identity in 490 aa overlap). Also similar to hypothetical proteins from Mycobacterium tuberculosis e.g. O05449|Rv3895c|MTCY15F10.17 (495 aa), FASTA scores: opt: 795, E(): 2.3e-38, (35.8% identity in 486 aa overlap); O53933|Rv1782|MTV049.04 (506 aa), FASTA scores: opt: 763, E(): 1.6e-36, (34.7% identity in 490 aa overlap); O06317|Rv3450c|MTCY13E12.03c (470 aa) FASTA scores: opt: 717, E(): 6.7e-34, (32.55% identity in 479 aa overlap); etc. Protein product from Mb3899 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3899 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5K0" /db_xref="InterPro:IPR007795" /db_xref="InterPro:IPR042485" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5K0" /protein_id="SIU02531.1" /translation="MGLRLTTKVQVSGWRFLLRRLEHAIVRRDTRMFDDPLQFYSRSI ALGIVVAVLILAGAALLAYFKPQGKLGGTSLFTDRATNQLYVLLSGQLHPVYNLTSAR LVLGNPANPATVKSSELSKLPMGQTVGIPGAPYATPVSAGSTSIWTLCDTVARADSTS PVVQTAVIAMPLEIDASIDPLQSHEAVLVSYQGETWIVTTKGRHAIDLTDRALTSSMG IPVTARPTPISEGMFNALPDMGPWQLPPIPAAGAPNSLGLPDDLVIGSVFQIHTDKGP QYYVVLPDGIAQVNATTAAALRATQAHGLVAPPAMVPSLVVRIAERVYPSPLPDEPLK IVSRPQDPALCWSWQRSAGDQSPQSTVLSGRHLPISPSAMNMGIKQIHGTATVYLDGG KFVALQSPDPRYTESMYYIDPQGVRYGVPNAETAKSLGLSSPQNAPWEIVRLLVDGPV LSKDAALLEHDTLPADPSPRKVPAGASGAP" CDS 4287213..4289456 /codon_start=1 /transl_table=11 /gene="eccca1" /locus_tag="BQ2027_MB3900" /product="esx conserved component eccca1. esx-1 type vii secretion system protein. possible transmembrane protein." /note="Mb3900, -, len: 747 aa. Equivalent to Rv3870, len: 747 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 747 aa overlap). Possible conserved membrane protein, equivalent to O33087|ML0053|MLCB628.16c PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (744 aa), FASTA scores: opt: 4333, E(): 0, (85.4% identity in 746 aa overlap); and similar to N-terminal end of others e.g. Q9CD30|ML2535 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1329 aa), FASTA scores: opt: 1003, E(): 1e-52, (33.65% identity in 725 aa overlap); O86653|SC3C3.20c ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 1078, E(): 3e-57, (35.4% identity in 774 aa overlap); P71068|YUKA YUKA PROTEIN from Bacillus subtilis (1207 aa) FASTA scores: opt: 529, E(): 4.3e-24, (26.1% identity in 636 aa overlap); Q9KE81|BH0975 HYPOTHETICAL PROTEIN from Bacillus halodurans (1489 aa), FASTA scores: opt: 455, E(): 1.5e-19, (27.1% identity in 734 aa overlap); etc. Also similar to N-terminal end of hypothetical proteins from Mycobacterium tuberculosis e.g. O53689|Rv0284|MTV035.12 (1330 aa), FASTA scores: opt: 982, E(): 1.9e-51, (33.8% identity in 719 aa overlap); O06264|Rv3447c|MTCY77.19c (1236 aa), FASTA scores: opt: 761, E(): 4.1e-38, (38.2% identity in 746 aa overlap); O53935|Rv1784|MTV049.06 (932 aa), FASTA scores: opt: 547, E(): 2.8e-25, (36.25% identity in 276 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Note some similarity (with hypothetical proteins from Mycobacterium tuberculosis and P71068|YUKA) continues in downstream ORF MTV027.06. Protein product from Mb3900 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3900 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5H3" /db_xref="InterPro:IPR002543" /db_xref="InterPro:IPR023836" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5H3" /protein_id="SIU02532.1" /translation="MTTKKFTPTITRGPRLTPGEISLTPPDDLGIDIPPSGVQKILPY VMGGAMLGMIAIMVAGGTRQLSPYMLMMPLMMIVMMVGGLAGSTGGGGKKVPEINADR KEYLRYLAGLRTRVTSSATSQVAFFSYHAPHPEDLLSIVGTQRQWSRPANADFYAATR IGIGDQPAVDRLLKPAVGGELAAASAAPQPFLEPVSHMWVVKFLRTHGLIHDCPKLLQ LRTFPTIAIGGDLAGAAGLMTAMICHLAVFHPPDLLQIRVLTEEPDDPDWSWLKWLPH VQHQTETDAAGSTRLIFTRQEGLSDLAARGPHAPDSLPGGPYVVVVDLTGGKAGFPPD GRAGVTVITLGNHRGSAYRIRVHEDGTADDRLPNQSFRQVTSVTDRMSPQQASRIARK LAGWSITGTILDKTSRVQKKVATDWHQLVGAQSVEEITPSRWRMYTDTDRDRLKIPFG HELKTGNVMYLDIKEGAEFGAGPHGMLIGTTGSGKSEFLRTLILSLVAMTHPDQVNLL LTDFKGGSTFLGMEKLPHTAAVVTNMAEEAELVSRMGEVLTGELDRRQSILRQAGMKV GAAGALSGVAEYEKYRERGADLPPLPTLFVVVDEFAELLQSHPDFIGLFDRICRVGRS LRVHLLLATQSLQTGGVRIDKLEPNLTYRIALRTTSSHESKAVIGTPEAQYITNKESG VGFLRVGMEDPVKFSTFYISGPYMPPAAGVETNGEAGGPGQQTTRQAARIHRFTAAPV LEEAPTP" CDS 4289559..4291334 /codon_start=1 /transl_table=11 /gene="ecccb1" /locus_tag="BQ2027_MB3901" /product="esx conserved component ecccb1. esx-1 type vii secretion system protein." /note="Mb3901, -, len: 591 aa. Equivalent to Rv3871, len: 591 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 591 aa overlap). Conserved hypothetical protein, equivalent to Q9CDD7|ML0052 HYPOTHETICAL PROTEIN from Mycobacterium leprae (597 aa) FASTA scores: opt: 3341, E(): 9.8e-192, (80.85% identity in 596 aa overlap); and O33086|MLCB628.15c HYPOTHETICAL PROTEIN from Mycobacterium leprae (597 aa), FASTA scores: opt: 3329, E(): 5.1e-191, (80.55% identity in 596 aa overlap). And similar to C-terminal end of others e.g. Q9Z5I2|ML1543|MLCB596.28 POSSIBLE SPOIIIE-FAMILY MEMBRANE PROTEIN from Mycobacterium leprae (1345 aa), FASTA scores: opt: 601, E(): 5.6e-28, (32.3% identity in 613 aa overlap); O86653|SC3C3.20c ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 977, E(): 2.1e-50, (35.15% identity in 583 aa overlap); Q9L0T6|SCD35.15c PUTATIVE CELL DIVISION-RELATED PROTEIN from Streptomyces coelicolor (1525 aa), FASTA scores: opt: 414, E(): 9e-17, (27.6% identity in 424 aa overlap);P71068|YUKA YUKA PROTEIN from Bacillus subtilis (1207 aa), FASTA scores: opt: 343, E(): 1.3e-12, (25.8% identity in 395 aa overlap); etc. And similar to to C-terminal end of hypothetical proteins from Mycobacterium tuberculosis e.g. O06264|Rv3447c|MTCY77.19c (1236 aa) FASTA scores: opt: 845, E(): 1.5e-42, (35.3% identity in 586 aa overlap); O53689|Rv0284|MTV035.12 (1330 aa) FASTA scores: opt: 646, E(): 1.2e-30, (33.35% identity in 606 aa overlap); O53935|Rv1784|MTV049.06 (932 aa) FASTA scores: opt: 589, E(): 2.1e-27, (33.1% identity in 619 aa overlap); etc. Contains 2 X PS00017 ATP/GTP-binding site motif A (P-loop). Note some similarity (with hypothetical proteins from Mycobacterium tuberculosis and P71068|YUKA) continues in upstream ORF MTV027.05. Protein product from Mb3901 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3901 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5I0" /db_xref="InterPro:IPR002543" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR023837" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I0" /protein_id="SIU02533.1" /translation="MTAEPEVRTLREVVLDQLGTAESRAYKMWLPPLTNPVPLNELIA RDRRQPLRFALGIMDEPRRHLQDVWGVDVSGAGGNIGIGGAPQTGKSTLLQTMVMSAA ATHSPRNVQFYCIDLGGGGLIYLENLPHVGGVANRSEPDKVNRVVAEMQAVMRQRETT FKEHRVGSIGMYRQLRDDPSQPVASDPYGDVFLIIDGWPGFVGEFPDLEGQVQDLAAQ GLAFGVHVIISTPRWTELKSRVRDYLGTKIEFRLGDVNETQIDRITREIPANRPGRAV SMEKHHLMIGVPRFDGVHSADNLVEAITAGVTQIASQHTEQAPPVRVLPERIHLHELD PNPPGPESDYRTRWEIPIGLRETDLTPAHCHMHTNPHLLIFGAAKSGKTTIAHAIARA ICARNSPQQVRFMLADYRSGLLDAVPDTHLLGAGAINRNSASLDEAVQALAVNLKKRL PPTDLTTAQLRSRSWWSGFDVVLLVDDWHMIVGAAGGMPPMAPLAPLLPAAADIGLHI IVTCQMSQAYKATMDKFVGAAFGSGAPTMFLSGEKQEFPSSEFKVKRRPPGQAFLVSP DGKEVIQAPYIEPPEEVFAAPPSAG" CDS 4291477..4291773 /codon_start=1 /transl_table=11 /gene="PE35" /locus_tag="BQ2027_MB3902" /product="pe family-related protein pe35" /note="Mb3902, PE35, len: 98 aa. Equivalent to Rv3872, len: 99 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 98 aa overlap). Some similarity to Mycobacterium tuberculosis conserved PE family proteins, e.g. O69713|Rv3746c|MTV025.094c (111 aa), FASTA scores: opt: 306, E(): 5.5e-13, (50.5% identity in 99 aa overlap). Equivalent to AAK48354 from Mycobacterium tuberculosis strain CDC1551 (112 aa) but shorter 14 aa. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transversion (g-t) leads to a slightly shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (98 aa versus 99 aa). Protein product from Mb3902 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3902 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I3" /protein_id="SIU02534.1" /translation="MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGAD EVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA" CDS 4291807..4292913 /codon_start=1 /transl_table=11 /gene="PPE68" /locus_tag="BQ2027_MB3903" /product="ppe family protein ppe68" /note="Mb3903, PPE68, len: 368 aa. Equivalent to Rv3873, len: 368 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 368 aa overlap). Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. O33085|ML0051|MLCB628.14c from Mycobacterium leprae (302 aa), FASTA scores: opt: 656, E(): 2.8e-24, (46.2% identity in 288 aa overlap); and O53691|Rv0286|MTV035.14 (513 aa), FASTA scores: opt: 566, E(): 7.8e-20, (35.25% identity in 363 aa overlap). Protein product from Mb3903 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3903 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6Y1" /protein_id="SIU02535.1" /translation="MLWHAMPPELNTARLMAGAGPAPMLAAAAGWQTLSAALDAQAVE LTARLNSLGEAWTGGGSDKALAAATPMVVWLQTASTQAKTRAMQATAQAAAYTQAMAT TPSLPEIAANHITQAVLTATNFFGINTIPIALTEMDYFIRMWNQAALAMEVYQAETAV NTLFEKLEPMASILDPGASQSTTNPIFGMPSPGSSTPVGQLPPAATQTLGQLGEMSGP MQQLTQPLQQVTSLFSQVGGTGGGNPADEEAAQMGLLGTSPLSNHPLAGGSGPSAGAG LLRAESLPGAGGSLTRTPLMSQLIEKPVAPSVMPAAAAGSSATGGAAPVGAGAMGQGA QSGGSTRPGLVAPAPLAQEREEDDEDDWDEEDDW" CDS 4293006..4293308 /codon_start=1 /transl_table=11 /gene="esxB" /locus_tag="BQ2027_MB3904" /standard_name="lhp; cfp10" /product="10 kda culture filtrate antigen esxb (lhp) (cfp10)" /note="Mb3904, esxB, len: 100 aa. Equivalent to Rv3874, len: 100 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 100 aa overlap). esxB (alternate gene name: cfp10), 10 KDA culture filtrate antigen (see citations below, especially first), highly similar to O33084|CF10_MYCLE|ML0050|MLCB628.13c 10 KDA CULTURE FILTRATE ANTIGEN CFP10 HOMOLOG from Mycobacterium leprae (99 aa), FASTA scores: opt: 237, E(): 2.4e-08, (39.4% identity in 99 aa overlap). Also similar to O05440|ES6D_MYCTU|Rv3905c|MT4024|MTCY15F10.06 PUTATIVE ESAT-6 LIKE PROTEIN 13 from Mycobacterium tuberculosis (103 aa) FASTA scores: opt: 126, E(): 0.18, (23.1% identity in 91 aa overlap); and shows some similarity with other proteins from Mycobacterium tuberculosis. Contains probable coiled-coil from aa 49-93. BELONGS TO THE ESAT6 FAMILY. Protein product from Mb3904 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3904 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A567" /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P0A567" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02536.1" /translation="MAEMKTDAATLAQEAGNFERISGDLKTQIDQVESTAGSLQGQWR GAAGTAAQAAVVRFQEAANKQKQELDEISTNIRQAGVQYSRADEEQQQALSSQMGF" CDS 4293341..4293628 /codon_start=1 /transl_table=11 /gene="esxA" /locus_tag="BQ2027_MB3905" /standard_name="esat-6" /product="6 kda early secretory antigenic target esxa (esat-6)" /note="Mb3905, esxA, len: 95 aa. Equivalent to Rv3875, len: 95 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 95 aa overlap). esxA, Early Secretory Antigenic Target (see citations below), identical to Q57165|O84901|ESAT6 EARLY SECRETORY ANTIGENIC TARGET from Mycobacterium bovis (94 aa), FASTA scores: opt: 596, E(): 4.6e-33, (100.0% identity in 94 aa overlap). Also similar to Q50206|ESA6_MYCLE|ESAT6|ESX|L45|ML0049|MLCB628.12c 6 KDA EARLY SECRETORY ANTIGENIC TARGET HOMOLOG (ESAT-6-LIKE PROTEIN) (L-ESAT) from Mycobacterium leprae (95 aa), FASTA scores: opt: 236, E(): 3.3e-09, (36.25% identity in 91 aa overlap); and weak similarity with others proteins ESAT-like from Mycobacterium leprae. Also some similarity with O53266|ES69_MYCTU|Rv3019c|MT3104|MTV012.33c PUTATIVE SECRETED ESAT-6 LIKE PROTEIN 9 from Mycobacterium tuberculosis (96 aa), FASTA scores: opt: 131, E(): 0.03, (26.15% identity in 88 aa overlap); and other ESAT-like protein. Contains probable coiled-coil from 56 to 92 aa. BELONGS TO THE ESAT6 FAMILY. Protein product from Mb3905 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3905 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A565" /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/Swiss-Prot:P0A565" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02537.1" /translation="MTEQQWNFAGIEAAASAIQGNVTSIHSLLDEGKQSLTKLAAAWG GSGSEAYQGVQQKWDATATELNNALQNLARTISEAGQAMASTEGNVTGMFA" CDS 4293742..4295742 /codon_start=1 /transl_table=11 /gene="espi" /locus_tag="BQ2027_MB3906" /product="esx-1 secretion-associated protein espi. conserved proline and alanine rich protein." /note="Mb3906, -, len: 666 aa. Equivalent to Rv3876, len: 666 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 666 aa overlap). Conserved hypothetical pro-, ala-rich protein, similar to several proteins from Mycobacterium leprae e.g. Q9CDD8|ML0048 HYPOTHETICAL PROTEIN (586 aa), FASTA scores: opt: 1682, E(): 2.1e-45, (50.75% identity in 672 aa overlap); O33082|MLCB628.11c HYPOTHETICAL 52.0 KDA PROTEIN (478 aa), FASTA scores: opt: 1588, E(): 1.5e-42, (53.5% identity in 542 aa overlap) (also has a proline rich N-terminus); etc. Also similar to other proteins from Mycobacterium tuberculosis, especially in C-terminus, e.g. O06396|Rv0530|MTCY25D10.09 (405 aa), FASTA scores: opt: 670, E(): 2.5e-14, (34.85% identity in 396 aa overlap) (also has Pro-rich N-terminus); etc. Note that N-terminus is repetitive and highly Proline rich. Protein product from Mb3906 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3906 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR002586" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I8" /protein_id="SIU02538.1" /translation="MAADYDKLFRPHEGMEAPDDMAAQPFFDPSASFPPAPASANLPK PNGQTPPPTSDDLSERFVSAPPPPPPPPPPPPPTPMPIAAGEPPSPEPAASKPPTPPM PIAGPEPAPPKPPTPPMPIAGPEPAPPKPPTPPMPIAGPAPTPTESQLAPPRPPTPQT PTGAPQQPESPAPHVPSHGPHQPRRTAPAPPWAKMPIGEPPPAPSRPSASPAEPPTRP APQHSRRARRGHRYRTDTERNVGKVATGPSIQARLRAEEASGAQLAPGTEPSPAPLGQ PRSYLAPPTRPAPTEPPPSPSPQRNSGRRAERRVHPDLAAQHAAAQPDSITAATTGGR RRKRAAPDLDATQKSLRPAAKGPKVKKVKPQKPKATKPPKVVSQRGWRHWVHALTRIN LGLSPDEKYELDLHARVRRNPRGSYQIAVVGLKGGAGKTTLTAALGSTLAQVRADRIL ALDADPGAGNLADRVGRQSGATIADVLAEKELSHYNDIRAHTSVNAVNLEVLPAPEYS SAQRALSDADWHFIADPASRFYNLVLADCGAGFFDPLTRGVLSTVSGVVVVASVSIDG AQQASVALDWLRNNGYQDLASRACVVINHIMPGEPNVAVKDLVRHFEQQVQPGRVVVM PWDRHIAAGTEISLDLLDPIYKRKVLELAAALSDDFERAGRR" CDS 4295739..4297274 /codon_start=1 /transl_table=11 /gene="eccd1" /locus_tag="BQ2027_MB3907" /product="esx conserved component eccd1. esx-1 type vii secretion system protein. probable transmembrane protein." /note="Mb3907, -, len: 511 aa. Equivalent to Rv3877, len: 511 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 511 aa overlap). Probable conserved transmembrane protein, equivalent to Q9CDD9|ML0047 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (512 aa), FASTA scores: opt: 2496, E(): 2.8e-140, (74.0% identity in 512 aa overlap); and highly similar, but longer 32 aa, to O33081|MLCB628.10c HYPOTHETICAL 51.4 KDA PROTEIN from Mycobacterium leprae (480 aa), FASTA scores: opt: 2346, E(): 2e-131, (74.15% identity in 480 aa overlap). Shows also similarity with other membrane proteins from Mycobacterium leprae e.g. Q9CBV2|ML1539 PROBABLE MEMBRANE PROTEIN (503 aa), FASTA scores: opt: 318, E(): 2e-11, (22.7% identity in 520 aa overlap). Also similar to various proteins from Mycobacterium tuberculosis e.g. O53944|Rv1795|MTV049.17 PUTATIVE MEMBRANE PROTEIN (503 aa), FASTA scores: opt: 391, E(): 9.4e-16, (24.45% identity in 523 aa overlap); O86362|Rv0290|MTV035.18 HYPOTHETICAL 47.9 KDA PROTEIN (472 aa), FASTA scores: opt: 332, E(): 2.8e-12, (28.1% identity in 509 aa overlap); O05457|Rv3887c|MTCY15F10.25 HYPOTHETICAL 53.2 KDA PROTEIN (509 aa), FASTA scores: opt: 167, E(): 0.017, (24.0% identity in 517 aa overlap); etc. Equivalent to AAK48359 from Mycobacterium tuberculosis strain CDC1551 (479 aa) but longer 32 aa. Protein product from Mb3907 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3907 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5H5" /db_xref="InterPro:IPR006707" /db_xref="InterPro:IPR024962" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5H5" /protein_id="SIU02539.1" /translation="MSAPAVAAGPTAAGATAARPATTRVTILTGRRMTDLVLPAAVPM ETYIDDTVAVLSEVLEDTPADVLGGFDFTAQGVWAFARPGSPPLKLDQSLDDAGVVDG SLLTLVSVSRTERYRPLVEDVIDAIAVLDESPEFDRTALNRFVGAAIPLLTAPVIGMA MRAWWETGRSLWWPLAIGILGIAVLVGSFVANRFYQSGHLAECLLVTTYLLIATAAAL AVPLPRGVNSLGAPQVAGAATAVLFLTLMTRGGPRKRHELASFAVITAIAVIAAAAAF GYGYQDWVPAGGIAFGLFIVTNAAKLTVAVARIALPPIPVPGETVDNEELLDPVATPE ATSEETPTWQAIIASVPASAVRLTERSKLAKQLLIGYVTSGTLILAAGAIAVVVRGHF FVHSLVVAGLITTVCGFRSRLYAERWCAWALLAATVAIPTGLTAKLIIWYPHYAWLLL SVYLTVALVALVVVGSMAHVRRVSPVVKRTLELIDGAMIAAIIPMLLWITGVYDTVRN IRF" CDS 4297425..4298267 /codon_start=1 /transl_table=11 /gene="espj" /locus_tag="BQ2027_MB3908" /product="esx-1 secretion-associated protein espj. conserved alanine rich protein." /note="Mb3908, -, len: 280 aa. Equivalent to Rv3878, len: 280 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 280 aa overlap). Hypothetical unknown ala-rich protein. Protein product from Mb3908 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3908 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5H9" /protein_id="SIU02540.1" /translation="MAEPLAVDPTGLSAAAAKLAGLVFPQPPAPIAVSGTDSVVAAIN KTMPSIESLVSDGLPGVKAALTRTASNMNAAADVYAKTDQSLGTSLSQYAFGSSGEGL AGVASVGGQPSQATQLLSTPVSQVTTQLGETAAELAPRVVATVPQLVQLAPHAVQMSQ NASPIAQTISQTAQQAAQSAQGGSGPMPAQLASAEKPATEQAEPVHEVTNDDQGDQGD VQPAEVVAAARDEGAGASPGQQPGGGVPAQAMDTGAGARPAASPLAAPVDPSTPAPST TTTL" CDS complement(4298325..4300559) /codon_start=1 /transl_table=11 /gene="espk" /locus_tag="BQ2027_MB3909C" /product="esx-1 secretion-associated protein espk. alanine and proline rich protein." /note="Mb3909c, -, len: 744 aa. Equivalent to Rv3879c, len: 729 aa, from Mycobacterium tuberculosis strain H37Rv, (98.0% identity in 743 aa overlap). Hypothetical unknown ala-, pro-rich protein (N-terminal end is repetitive and highly Proline-rich). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 27 bp and 18 bp lead to a longer product compared to the homolog in Mycobacterium tuberculosis strain H37Rv (744 aa versus 729 aa). Protein product from Mb3909c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3909c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5K9" /protein_id="SIU02541.1" /translation="MSITRPTGSYARQMLDPGGWVEADEDTFYDRAQEYSQVLQRVTD VLDTCRQQKGHVFEGGLWSGGAANAANGALGANINQLMTLQDYLATVITWHRHIAGLI EQAKSDIGNNVDGAQREIDILENDPSLDADERHTAINSLVTATHGANVSLVAETAERV LESKNWKPPKNALEDLLQQKSPPPPDVPTLVVPSPGTPGTPGTPGTPITPGTPITPGT PITPGTPITPGTPITPIPGAPVTPITPTPGTPVTPVTPGKPVTPVTPVKPGTPGEPTP ITPVTPPVAPATPATPATPVTPAPAPHPQPAPAPAPSPGPQPVTPATPGPSGPATPGT PGGEPAPHVKPAALAEQPGVPGQHAGGGTQSGPAHADESAASVTPAAASGVPGARAAA AAPSGTAVGAGARSSVGTAAASGAGSHAATGRAPVATSDKAAAPSTRAASARTAPPAR PPSTDHIDKPDRSESADDGTPVSMIPVSAARAARDAATAAASARQRGRGDALRLARRI AAALNASDNNAGDYGFFWITAVTTDGSIVVANSYGLAYIPDGMELPNKVYLASADHAI PVDEIARCATYPVLAVQAWAAFHDMTLRAVIGTAEQLASSDPGVAKIVLEPDDIPESG KMTGRSRLEVVDPSAAAQLADTTDQRLLDLLPPAPVDVNPPGDERHMLWFELMKPMTS TATGREAAHLRAFRAYAAHSQEIALHQAHTATDAAVQRVAVADWLYWQYVTGLLDRAL AAAS" CDS complement(4300976..4301323) /codon_start=1 /transl_table=11 /gene="espl" /locus_tag="BQ2027_MB3910C" /product="esx-1 secretion-associated protein espl" /note="Mb3910c, -, len: 115 aa. Equivalent to Rv3880c, len: 115 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 115 aa overlap). Conserved hypothetical protein, equivalent to O33080|ML0044|MLCB628.09 HYPOTHETICAL 12.2 KDA PROTEIN from Mycobacterium leprae (113 aa), FASTA scores: opt: 397, E(): 2e-19, (56.35% identity in 110 aa overlap). Protein product from Mb3910c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3910c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5I7" /db_xref="InterPro:IPR004401" /db_xref="InterPro:IPR036894" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I7" /protein_id="SIU02542.1" /translation="MSMDELDPHVARALTLAARFQSALDGTLNQMNNGSFRATDEAET VEVTINGHQWLTGLRIEDGLLKKLGAEAVAQRVNEALHNAQAAASAYNDAAGEQLTAA LSAMSRAMNEGMA" CDS complement(4301320..4302702) /codon_start=1 /transl_table=11 /gene="espb" /locus_tag="BQ2027_MB3911C" /product="secreted esx-1 substrate protein b, espb. conserved alanine and glycine rich protein" /note="Mb3911c, -, len: 460 aa. Equivalent to Rv3881c, len: 460 aa, from Mycobacterium tuberculosis strain H37Rv, (98.7% identity in 460 aa overlap). Conserved hypothetical ala-, gly-rich protein. C-terminal end highly similar to O06126 HYPOTHETICAL 9.5 KDA PROTEIN (FRAGMENT) from Mycobacterium tuberculosis strain NTI 64719 (90 aa) FASTA scores: opt: 333, E(): 6.3e-07, (69.75% identity in 86 aa overlap) but sequence difference causes frameshift in NTI 64719. Also similar to part of small Mycobacterium leprae ORF O33078|MLCB628.06 (EMBL:Y14967) (101 aa), FASTA scores: opt: 194, E(): 0.04, (59.3% identity in 54 aa overlap), suggesting this is represented by a pseudogene in M. leprae. Protein product from Mb3911c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3911c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR041275" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I4" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02543.1" /translation="MTQSQTVTVDQQEILNRANEVEAPMADPPTDVPITPCELTAAKN AAQQLVLSADNMREYLAAGAKERQRLATSLRNAAKAYGEVDEEAATALDNDGEGTVQA ESAGAVGGDSSAELTDTPRVATAGEPNFMDLKEAARKLETGDQGASLAHFADGWNTFN LTLQGDVKRFRGFDNWEGDAATACEASLDQQRQWILHMAKLSAAMAKQAQYVAQLHVW ARREHPTYEDIVGLERLYAENPSARDQILPVYAEYQQRSEKVLTEYNNKAALEPVNPP KPPPRHQDRPAPPPQEQGLIPGFLMPPSDGSGVTPGTGMPAAPMVPPTGSPGGGLPAD TAAQLTSAGREAAALSGDVAVKAASLGGGGGGGVPSAPLGSAIGGAESVRPAGAGDIA GLGQGRAGGGAALGGGGMGMPMGAAHQGQGGAKSKGSQQEDEALYTEDRAWTEAVIGN RRRQDSKESK" CDS complement(4302809..4304197) /codon_start=1 /transl_table=11 /gene="ecce1" /locus_tag="BQ2027_MB3912C" /product="esx conserved component ecce1. esx-1 type vii secretion system protein. possible membrane protein." /note="Mb3912c, -, len: 462 aa. Equivalent to Rv3882c, len: 462 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 462 aa overlap). Possible conserved membrane protein, equivalent to O33077|ML0042|MLCB628.05 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (467 aa), FASTA scores: opt: 2346, E(): 1.1e-140, (72.1% identity in 462 aa overlap). Also similar to O05459|Rv3885c|MTCY15F10.27 POSSIBLE MEMBRANE PROTEIN from Mycobacterium tuberculosis (537 aa) FASTA scores: opt: 283, E(): 2.5e-10, (26.8% identity in 414 aa overlap); and C-terminal end shows similarity with AAK48368|MT4000 HYPOTHETICAL 45.6 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (422 aa) FASTA scores: opt: 215, E(): 4.1e-06, (26.85% identity in 320 aa overlap). Protein product from Mb3912c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3912c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5J6" /db_xref="InterPro:IPR021368" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5J6" /protein_id="SIU02544.1" /translation="MRNPLGLRFSTGHALLASALAPPCIIAFLETRYWWAGIALASLG VIVATVTFYGRRITGWVAAVYAWLRRRRRPPDSSSEPVVGATVKPGDHVAVRWQGEFL VAVIELIPRPFTPTVIVDGQAHTDDMLDTGLVEELLSVHCPDLEADIVSAGYRVGNTA APDVVSLYQQVIGTDPAPANRRTWIVLRADPERTRKSAQRRDEGVAGLARYLVASATR IADRLASHGVDAVCGRSFDDYDHATDIGFVREKWSMIKGRDAYTAAYAAPGGPDVWWS ARADHTITRVRVAPGMAPQSTVLLTTADKPKTPRGFARLFGGQRPALQGQHLVANRHC QLPIGSAGVLVGETVNRCPVYMPFDDVDIALNLGDAQTFTQFVVRAAAAGAMVTVGPQ FEEFARLIGAHIGQEVKVAWPNATTYLGPHPGIDRVILRHNVIGTPRHRQLPIRRVSP PEESRYQMALPK" CDS complement(4304194..4305534) /codon_start=1 /transl_table=11 /gene="mycp1" /locus_tag="BQ2027_MB3913C" /product="membrane-anchored mycosin mycp1 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-1)" /note="Mb3913c, -, len: 446 aa. Equivalent to Rv3883c, len: 446 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 446 aa overlap). Possible secreted protease (EC 3.4.-.-), equivalent to O33076|ML0041|MLCB628.04 PROBABLE SECRETED PROTEASE from Mycobacterium leprae (446 aa), FASTA scores: opt: 2448, E(): 1.5e-124, (79.15% identity in 446 aa overlap); and highly similar, but in part, to several putative proteases from Mycobacterium leprae; Q9CBV3|ML1538 (567 aa) FASTA scores: opt: 902, E(): 3e-41, (37.25% identity in 556 aa overlap); and Q9CD36|ML2528 (475 aa), FASTA scores: opt: 873, E(): 9.4e-40, (42.7% identity in 459 aa overlap). Shows also similarity with several proteases from other organisms e.g. Q9PCD0|XF1851 SERINE PROTEASE from Xylella fastidiosa (1000 aa), FASTA scores: opt: 281, E(): 1.3e-07, (27.95% identity in 422 aa overlap); P42780|BPRX_BACNO EXTRACELLULAR SUBTILISIN-LIKE PROTEASE PRECURSOR (EC 3.4.21.-) from Bacteroides nodosus (Dichelobacter nodosus) (595 aa), FASTA scores: opt: 270, E(): 3.2e-07, (28.9% identity in 384 aa overlap); Q46541|APRV5 ACIDIC PROTEASE V5 from Bacteroides nodosus (Dichelobacter nodosus) (595 aa), FASTA scores: opt: 264, E(): 6.8e-07, (28.65% identity in 384 aa overlap); etc. Also highly similar to various proteins from Mycobacterium tuberculosis e.g. O53695|Rv0291|MTV035.19 PUTATIVE PROTEASE (461 aa), FASTA scores: opt: 1168, E(): 1.2e-55, (44.6% identity in 453 aa overlap); O53945|Rv1796|MTV049.18 HYPOTHETICAL 60.0 KDA PROTEIN (585 aa), FASTA scores: opt: 928, E(): 1.2e-42, (37.85% identity in 555 aa overlap) (note gap from aa 155-264); and downstream ORF O05458|Rv3886c|MTCY15F10.26 HYPOTHETICAL 55.6 KDA PROTEIN (550 aa), FASTA scores: opt: 910, E(): 1.1e-41, (40.15% identity in 533 aa overlap) (note partial gap from aa 146-234); etc. Equivalent to AAK48366 from Mycobacterium tuberculosis strain CDC1551 (411 aa) but longer 35 aa. Has signal sequence with possible signal peptidase I cleavage site in residues 19-21 (ASA) and hydrophobic stretch at C-terminus, followed by short positively charged segment, that could act as membrane anchor. Protein product from Mb3913c detected using SWATH mass spectrometry. Mb3913c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6Y3" /db_xref="InterPro:IPR000209" /db_xref="InterPro:IPR015500" /db_xref="InterPro:IPR023834" /db_xref="InterPro:IPR036852" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6Y3" /protein_id="SIU02545.1" /translation="MHRIFLITVALALLTASPASAITPPPIDPGALPPDVAGPDQPTE QRVLCASPTTLPGSGFHDPPWSNTYLGVADAHKFATGAGVTVAVIDTGVDASPRVPAE PGGDFVDQAGNGLSDCDAHGTLTASIIAGRPAPTDGFVGVAPDARLLSLRQTSEAFEP VGSQANPNDPNATPAAGSIRSLARAVVHAANLGVGVINISEAACYKVSRPIDETSLGA SIDYAVNVKGVVVVVAAGNTGGDCVQNPAPDPSTPGDPRGWNNVQTVVTPAWYAPLVL SVGGIGQTGMPSSFSMHGPWVDVAAPAENIVALGDTGEPVNALQGREGPVPIAGTSFA AAYVSGLAALLRQRFPDLTPAQIIHRITATARHPGGGVDDLVGAGVIDAVAALTWDIP PGPASAPYNVRRLPPPVVEPGPDRRPITAVALVAVGLTLALGLGALARRALSRR" CDS complement(4305756..4307615) /codon_start=1 /transl_table=11 /gene="ecca2" /locus_tag="BQ2027_MB3914C" /product="esx conserved component ecca2. esx-2 type vii secretion system protein. probable cbxx/cfqx family protein." /note="Mb3914c, -, len: 619 aa. Equivalent to Rv3884c, len: 619 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 619 aa overlap). Putative CBXX/CFQX protein family, similar to hypothetical proteins from Mycobacterium leprae e.g. Q9CD28|Y282_MYCLE|ML2537 (640 aa), FASTA scores: opt: 725, E(): 2.9e-34, (28.95% identity in 587 aa overlap); O33089|Y2G8_MYCLE|ML0055|MLCB628.18c (BELONGS TO THE CBXX/CFQX FAMILY) (573 aa); Q9CBV5|ML1536 (610 aa) FASTA scores: opt: 648, E(): 7.4e-30, (31.5% identity in 549 aa overlap). Also similar to proteins belonging to the CBXX/CFQX FAMILY e.g. Q9RKZ2|SC6D7.05c PUTATIVE CBXX/CFQX FAMILY PROTEIN from Streptomyces coelicolor (618 aa) FASTA scores: opt: 557, E(): 1.3e-24, (28.6% identity in 601 aa overlap); P27643|SP5K_BACSU|SPOVK|SPOVJ STAGE V SPORULATION PROTEIN K from Bacillus subtilis (322 aa) FASTA scores: opt: 485, E(): 1.1e-20, (35.0% identity in 280 aa overlap) (similarity only at C-terminus); Q9KAC6|BH2363 STAGE V SPORULATION PROTEIN K from Bacillus halodurans (315 aa), FASTA scores: opt: 462, E(): 2.2e-19, (36.05% identity in 244 aa overlap) (similarity only at C-terminus); etc. And similar to hypothetical proteins from Mycobacterium tuberculosis belonging to the CBXX/CFQX FAMILY e.g. O53687|Y282_MYCTU|Rv0282|MT0295|MTV035.10 HYPOTHETICAL 68.1 KDA PROTEIN (631 aa), FASTA scores: opt: 743, E(): 2.6e-35, (29.9% identity in 612 aa overlap); O69733|Y2G8_MYCTU|Rv3868|MT3981|MTV027.03 HYPOTHETICAL 62.4 KDA PROTEIN (573 aa), FASTA scores: opt: 678, E(): 1.3e-31, (31.25% identity in 589 aa overlap); O53947|YH98_MYCTU|Rv1798|MT1847|MTV049.20 (610 aa) FASTA scores: opt: 669, E(): 4.6e-31, (30.95% identity in 549 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). SEEMS TO BELONG TO THE CBXX/CFQX FAMILY. Mb3914c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P59976" /db_xref="InterPro:IPR000641" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR003959" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR023835" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041627" /db_xref="UniProtKB/Swiss-Prot:P59976" /protein_id="SIU02546.1" /translation="MSRMVDTMGDLLTARRHFDRAMTIKNGQGCVAALPEFVAATEAD PSMADAWLGRIACGDRDLASLKQLNAHSEWLHRETTRIGRTLAAEVQLGPSIGITVTD ASQVGLALSSALTIAGEYAKADALLANRELLDSWRNYQWHQLARAFLMYVTQRWPDVL STAAEDLPPQAIVMPAVTASICALAAHAAAHLGQGRVALDWLDRVDVIGHSRSSGRFG ADVLTAAIGPADIPLLVADLAYVRGMVYRQLHEEDKAQIWLSKATINGVLTDAAKEAL ADPNLRLIVTDERTIASRSDRWDASTAKSRDQLDDDNAAQRRGELLAEGRELLAKQVG LAAVKQAVSALEDQLEVRMMRLEHGLPVEGQTNHMLLVGPPGTGKTTTAEALGKIYAG MGIVRHPEIREVRRSDFCGHYIGESGPKTNELIEKSLGRIIFMDEFYSLIERHQDGTP DMIGMEAVNQLLVQLETHRFDFCFIGAGYEDQVDEFLTVNPGLAGRFNRKLRFESYSP VEIVEIGHRYATPRASQLDDAAREVFLDAVTTIRNYTTPSGQHGIDAMQNGRFARNVI ERAEGFRDTRVVAQKRAGQPVSVQDLQIITATDIDAAIRSVCSDNRDMAAIVW" CDS complement(4307685..4309298) /codon_start=1 /transl_table=11 /gene="ecce2" /locus_tag="BQ2027_MB3915C" /product="esx conserved component ecce2. esx-2 type vii secretion system protein. possible membrane protein." /note="Mb3915c, -, len: 537 aa. Equivalent to Rv3885c, len: 537 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 537 aa overlap). Possible conserved membrane protein (has hydrophobic stretch near N-terminus), showing some similarity with O05462|Rv3882c|MTV027.17c|MTCY15F10.30 POSSIBLE MEMBRANE PROTEIN from Mycobacterium tuberculosis (462 aa) FASTA scores: opt: 283, E(): 8.3e-10, (26.55% identity in 414 aa overlap); and O33077|ML0042|MLCB628.05 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (467 aa), FASTA scores: opt: 260, E(): 2.1e-08, (28.0% identity in 382 aa overlap). Equivalent to AAK48368 from Mycobacterium tuberculosis strain CDC1551 (422 aa) but longer 115 aa. Protein product from Mb3915c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3915c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5J7" /db_xref="InterPro:IPR021368" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5J7" /protein_id="SIU02547.1" /translation="MTSKLTGFSPRSARRVAGVWTVFVLASAGWALGGQLGAVMAVVV GVALVFVQWWGQPAWSWAVLGLRGRRPVKWNDPITLANNRSGGGVRVQDGVAVVAVQL LGRAHRATTVTGSVTVESDNVIDVVELAPLPRHPLDLELDSISVVTFGSRTGTVGDYP RVYDAEIGTPPYAGRRETWLIMRLPVIGNTQALRWRTSVGAAAISVAQRVASSLRCQG LRAKLATATDLAELDRRLGSDAVAGSAQRWKAIRGEAGWMTTYAYPAEAISSRVLSQA WTLRADEVIQNVTVYPDATCTATITVRTPTPAPTPPSVILRRLNGEQAAAAAANMCGP RPHLRGQRRCPLPAQLVTEIGPSGVLIGKLSNGDRLMIPVTDAGELSRVFVAADDTIA KRIVIRVVGAGERVCVHTRDQERWASVRMPQLSIVGTPRPAPRTTVGVVEYVRRRKNG DDGKSEGSGVDVAISPTPRPASVITIARPGTSLSESDRHGFEVTIEQIDRATVKVGAA GQNWLVEMEMFRAENRYVSLEPVTMSIGR" CDS complement(4309295..4310947) /codon_start=1 /transl_table=11 /gene="mycp2" /locus_tag="BQ2027_MB3916C" /product="probable alanine and proline rich membrane-anchored mycosin mycp2 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-2)" /note="Mb3916c, -, len: 550 aa. Equivalent to Rv3886c, len: 550 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 550 aa overlap). Probable ala-, pro-rich protease (EC 3.4.-.-), highly similar to Q9CBV3|ML1538 POSSIBLE PROTEASE from Mycobacterium leprae (567 aa), FASTA scores: opt: 1034, E(): 3.9e-32, (43.5% identity in 575 aa overlap); and highly similar, but with gaps, to several putative proteases from Mycobacterium leprae; O33076|ML0041|MLCB628.04 (446 aa), FASTA scores: opt: 860, E(): 1.1e-25, (38.65% identity in 538 aa overlap); Q9CD36|ML2528 (475 aa) (475 aa), FASTA scores: opt: 413, E(): 7.1e-09, (37.7% identity in 562 aa overlap). Also similarity with Q99405|PRTM_BACSP M-PROTEASE (EC 3.4.21.-) from Bacillus sp. strain KSM-K16 (269 aa), FASTA scores: E(): 7.6e-06, (27.1% identity in 277 aa overlap). And highly similar, but also with gaps, to hypothetical proteins from Mycobacterium tuberculosis e.g. O53945|Rv1796|MTV049.18 (585 aa), FASTA scores: opt: 1173, E(): 2.4e-37, (47.9% identity in 578 aa overlap); the upstream ORF O05461|Rv3883c|MTCY15F10.29 (446 aa) FASTA scores: opt: 910, E(): 1.5e-27, (40.15% identity in 533 aa overlap); O06316|Rv3449|MTCY13E12.02 (455 aa) FASTA scores: opt: 477, E(): 2.7e-11, (38.75% identity in 550 aa overlap); etc. Pro rich protein with two serine protease, subtilase family active site motifs: aspartic acid active site motif (PS00136); and histidine active site motif (PS00137). BELONGS TO PEPTIDASE FAMILY S8; ALSO KNOWN AS THE SUBTILASE FAMILY. Protein product from Mb3916c detected using SWATH mass spectrometry. Mb3916c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5J2" /db_xref="InterPro:IPR000209" /db_xref="InterPro:IPR015500" /db_xref="InterPro:IPR023827" /db_xref="InterPro:IPR023834" /db_xref="InterPro:IPR036852" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5J2" /protein_id="SIU02548.1" /translation="MASPLNRPGLRAAAASAALTLVALSANVPAAQAIPPPSVDPAMV PADARPGPDQPMRRSNSCSTPITVRNPDVAQLAPGFNLVNISKAWQYSTGNGVPVAVI DTGVSPNPRLPVVPGGDYIMGEDGLSDCDAHGTVVSSIIAAAPLGILPMPRAMPATAA FPPPAGPPPVTAAPAPPVEVPPPMPPPPPVTITQTVAPPPPPPEDAGAMAPSNGPPDP QTEDEPAVPPPPPGAPDGVVGVAPHATIISIRQSSRAFEPVNPSSAGPNSDEKVKAGT LDSVARAVVHAANMGAKVINISVTACLPAAAPGDQRVLGAALWYAATVKDAVIVAAAG NDGEAGCGNNPMYDPLDPSDPRDWHQVTVVSSPSWFSDYVLSVGAVDAYGAALDKSMS GPWVGVAAPGTHIMGLSPQGGGPVNAYPPSRPGEKNMPFWGTSFSAAYVSGVAALVRA KFPELTAYQVINRIVQSAHNPPAGVDNKLGYGLVDPVAALTFNIPSGDRMAPGAQSRV ITPAAPPPPPDHRARNIAIGFVGAVATGVLAMAIGARLRRAR" CDS complement(4310932..4311801) /codon_start=1 /transl_table=11 /gene="eccd2" /locus_tag="BQ2027_MB3917C" /product="esx conserved component eccd2. esx-2 type vii secretion system protein. probable transmembrane protein." /note="Mb3917c, -, len: 289 aa. Equivalent to 3' end of Rv3887c, len: 509 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 288 aa overlap). Probable conserved transmembrane protein (has hydrophilic stretch from ~1-130 then very hydrophobic domain), similar to other membrane proteins and with weak similarity to known transporters, e.g. Q9CBV2|ML1539 PROBABLE MEMBRANE PROTEIN from Mycobacterium leprae (503 aa), FASTA scores: opt: 395, E(): 2.3e-16, (28.0% identity in 496 aa overlap); Q9CD35|ML2529 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (485 aa), FASTA scores: opt: 221, E(): 6.6e-06, (24.6% identity in 423 aa overlap); Q9ADP8|2SC10A7.11 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (430 aa), FASTA scores: opt: 171, E(): 0.0062, (26.55% identity in 358 aa overlap); CAC44275|SCBAC17F8.03 PUTATIVE DRUG EFFLUX PROTEIN from Streptomyces coelicolor (416 aa), FASTA scores: opt: 160, E(): 0.028, (27.85% identity in 323 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. O53944|Rv1795|MTV049.17 PUTATIVE MEMBRANE PROTEIN (503 aa), FASTA scores: opt: 360, E(): 2.9e-14, (26.65% identity in 514 aa overlap); etc. Equivalent to AAK48369 from Mycobacterium tuberculosis strain CDC1551 (469 aa) but longer 40 aa. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a deletion of 2406 bp leads to the loss of the NH2 part of Rv3887c, the entire Rv3888c and the COOH part of Rv3889c compared to Mycobacterium tuberculosis strain H37Rv. Protein product from Mb3917c detected using SWATH mass spectrometry. Mb3917c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5J4" /db_xref="InterPro:IPR006707" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5J4" /protein_id="SIU02549.1" /translation="MAAHALIGLVVVVLGAITIGVATRKRWQTAVVTAVVTVCGILAA VAAVRMFRPVSMQVLAICVLVGLLVLIRMTPTVALWVARVRPPHFGSITGRDLFARRA GMPVDTVAPVSEADADDEDNELTGITARGTAIAASARLVNAVQVGMCVGVSLVLPAAV WGVLTPRQPWAWLALLVAGLTVGLFITQGRGFAAKYQAVALVCGASAAVCAGVLKYAL DTPKGVQTGLLWPAIFVAAFAALGLAVALVVPATRFRPIIRLTVEWLEVLAMIALLPA AAALGGLFAWLRH" CDS complement(4311783..4312001) /codon_start=1 /transl_table=11 /gene="espg2" /locus_tag="BQ2027_MB3918C" /product="esx-2 secretion-associated protein espg2" /note="Mb3918c, -, len: 72 aa. Equivalent to 5' end of Rv3889c, len: 276 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 68 aa overlap). Hypothetical unknown protein. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a deletion of 2406 bases leads to the loss of the NH2 part of Rv3887c, the entire Rv3888c and the COOH part of Rv3889c compared to Mycobacterium tuberculosis strain H37Rv. Mb3918c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR025734" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5I9" /protein_id="SIU02550.1" /translation="MLTTTVDGLWVLQAVTGVEQTCPELGLRPLLPRLDTAERALRHP VAAELMAVGALDQAGNADPMVREWRLMR" CDS complement(4312009..4312383) /codon_start=1 /transl_table=11 /gene="esxC" /locus_tag="BQ2027_MB3919C" /product="esat-6 like protein esxc (esat-6 like protein 11)" /note="Mb3919c, esxC, len: 124 aa. Similar to Rv3890c, len: 95 aa, from Mycobacterium tuberculosis strain H37Rv, (100% identity in 55 aa overlap). esxC, putative ESAT-6 like protein 11, equivalent to Q9K548|ES6B_MYCPA PUTATIVE ESAT-6 LIKE PROTEIN 11 (ORF3890C) from Mycobacterium paratuberculo sis (95 aa), FASTA scores: opt: 490, E(): 3.3e-26, (76.85% identity in 95 aa overlap). BELONGS TO THE ESAT6 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) leads to a longer product with a different COOH part compared to its homolog in Mycobacterium tuberculosis strain H37Rv (125 aa versus 95 aa). Protein product from Mb3919c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3919c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L3" /protein_id="SIU02551.1" /translation="MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFF AGHGAQGFFDARRRCCRGCRGSLRRWVSMGLPPATCWTTRSEPTRPSRACSKAGRGPS EGGVHARTARLPHLLLAASWLR" CDS complement(4312419..4312742) /codon_start=1 /transl_table=11 /gene="esxD" /locus_tag="BQ2027_MB3920C" /product="possible esat-6 like protein esxd" /note="Mb3920c, esxD, len: 107 aa. Equivalent to Rv3891c, len: 107 aa, from Mycobacterium tuberculosis strain H37Rv, (99.1% identity in 107 aa overlap). esxD, conserved hypothetical protein, equivalent to Q9K547 HYPOTHETICAL 10.3 KDA PROTEIN (FRAGMENT) from Mycobacterium paratuberculosis (100 aa), FASTA scores: opt: 498, E(): 1.7e-26, (77.25% identity in 101 aa overlap). Protein product from Mb3920c detected using SWATH mass spectrometry. Mb3920c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5J9" /protein_id="SIU02552.1" /translation="MADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPA TWSGAGVVASHMTATEITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFG ASHGS" CDS complement(4312853..4314052) /codon_start=1 /transl_table=11 /gene="PPE69" /locus_tag="BQ2027_MB3921C" /product="ppe family protein ppe69" /note="Mb3921c, PPE69, len: 399 aa. Equivalent to Rv3892c, len: 399 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 399 aa overlap). Member of the Mycobacterium tuberculosis PPE family of conserved proteins, similar to many e.g. O05298|Rv1196|MTCI364.08 from Mycobacterium leprae (391 aa), FASTA scores: opt: 348, E(): 2.2e-08, (26.6% identity in 380 aa overlap). Mb3921c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000030" /db_xref="InterPro:IPR038332" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5J8" /protein_id="SIU02553.1" /translation="MPDPGWAARTPEANDLLLKAGTGVGTHLANQTAWTTLGASHHAS GVASAINTAATAASWLGVGSAASALNVTMLNATLHGLAGWVDVKPAVVSTAIAAFETA NAAMRPAPECMENRDEWGVDNAINPSVLWTLTPRIVSLDVEYFGVMWPNNAAVGATYG GVLAALAESLAIPPPVATMGASPAAPAQAAAAVGQAAAEAAAGDGMRSAYQGVQAGST GAGQSTSAGENFGNQLSTFMQPMQAVMQAAPQALQAPSGLMQAPMSAMQPLQSMVGMF ANPGALGMGGAAPGASAASAAGGISAAATEVGAGGGGAALGGGGMPATSFTRPVSAFE SGTSGRPVGLRPSGALGADVVRAPTTTVGGTPIGGMPVGHAAGGHRGSHGKSEQAATV RVVDDRR" CDS complement(4314131..4314364) /codon_start=1 /transl_table=11 /gene="PE36" /locus_tag="BQ2027_MB3922C" /product="pe family protein pe36" /note="Mb3922c, PE36, len: 77 aa. Equivalent to Rv3893c, len: 77 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 77 aa overlap). Member of the Mycobacterium tuberculosis PE family of conserved proteins, similar to other e.g. O53690|Rv0285|MTV035.13 from Mycobacterium tuberculosis (102 aa), FASTA scores: opt: 136, E(): 0.042, (35.6% identity in 73 aa overlap). Mb3922c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5M8" /protein_id="SIU02554.1" /translation="MVWSVQPEAVLASAAAESAISAETEAAAAGAAPALLSTTPMGGD PDSAMFSAALNACGASYLGVVAEHASQRGLFAG" CDS complement(4314631..4316316) /codon_start=1 /transl_table=11 /gene="eccc2" /locus_tag="BQ2027_MB3923C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN [SECOND PART]" /note="Mb3923c, -, len: 561 aa. Equivalent to 3' end of Rv3894c, len: 1396 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 561 aa overlap). Possible conserved membrane protein (possible transmembrane segments from aa ~37-85), similar to Q9CD30|ML2535 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1329 aa), FASTA scores: opt: 652, E(): 2.2e-30, (27.85% identity in 1425 aa overlap); Q9CDD7|ML0052 HYPOTHETICAL PROTEIN from Mycobacterium leprae (597 aa), FASTA scores: opt: 537, E(): 6.6e-24, (27.5% identity in 585 aa overlap) (similarity only with C-terminal end); Q9Z5I2|ML1543|MLCB596.28 POSSIBLE SPOIIIE-FAMILY MEMBRANE PROTEIN from Mycobacterium leprae (1345 aa), FASTA scores: opt: 523, E(): 8.6e-23, (31.65% identity in 1412 aa overlap). Also similar to various proteins e.g. O86653|SC3C3.20c ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 973, E(): 2.8e-49, (28.1% identity in 1409 aa); Q9L0T6|SCD35.15c PUTATIVE CELL DIVISION-RELATED PROTEIN from Streptomyces coelicolor(1525 aa), FASTA scores: opt: 524, E(): 8.3e-23, (24.95% identity in 1450 aa overlap); Q9KE81|BH0975 HYPOTHETICAL PROTEIN from Bacillus halodurans (1489 aa), FASTA scores: opt: 444, E(): 4.1e-18, (22.5% identity in 1346 aa overlap); etc. Also similar to AAK46103|MT1833 FTSK/SPOIIIE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (1391 aa), FASTA scores: opt: 769, E(): 2.9e-37, (30.6% identity in 1434 aa overlap); and other hypothetical proteins from Mycobacterium tuberculosis e.g. O53689|Rv0284|MTV035.12 (1330 aa), FASTA scores: opt: 634, E(): 2.5e-29, (28.2% identity in 1443 aa overlap); O06264|Rv3447c|MTCY77.19c (1236 aa), FASTA scores: opt: 632, E(): 3.1e-29, (28.75% identity in 1391 aa overlap); O69736|R3871|MTV027.06 (591 aa), FASTA scores: opt: 588, E(): 6.6e-27, (27.75% identity in 605 aa overlap) (similarity only with C-terminal end); etc. Contains two possible (PS00017) ATP/GTP-binding sites (P-loop) in central portion. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis, Rv3894c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c), splits Rv3894c into 2 parts, Mb3923c and Mb3924c. Protein product from Mb3923c detected using SWATH mass spectrometry. Mb3923c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6Y5" /db_xref="InterPro:IPR002543" /db_xref="InterPro:IPR023837" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6Y5" /protein_id="SIU02555.1" /translation="MAAYRGKPWHVDYGQNPGLMFPVGVMDIPEESQQVVHAVDALRS NIIVVGAKQRGKTTTLMALMCSAATMYTPERVTFFCIGGATMAQIGSLPHVTDIVSPK DAEGIERILSTMDALIDAREEAFRRAKIDMDGFRERRFGIGGDGVGGTDPTDAFGDVF VVLDDYDDLYAKDTLLGDRIISLSSRGPEYGVHLMCSAGGWIHGQRQSLLQNVTARIQ LRLADPGESQMGHLSIESREAARRTLNRPGFGLTESLHELRIGVPALADPGTGELVGI TDVGARIADVAGVTKHASLQRLPQRVELSAIVEHEAVHQGGDDLSIAFAIGERHELGP VPIKLRESPGLMILGRQGCGKTTALVAIGEAVMNRFSPQQAQLTLIDPKTAPHGLRDL HAPGYVRAYAYDQDEIDEVITELAQQILLPRLPPKGLSQEELRALKPWEGPRHFVLID DVQDLRPAQSYPQKPPVGAALWKLMERARQVGLHVFSTRNSANWATMPMDPWVKSQTS AKVAQLYMDNDPQNRINRSVRAQTLPPGRGLLVGADGDVEGILVGYPSVPGEQ" CDS complement(4316321..4318822) /codon_start=1 /transl_table=11 /gene="eccc2" /locus_tag="BQ2027_MB3924C" /product="POSSIBLE CONSERVED MEMBRANE PROTEIN [FIRST PART]" /note="Mb3924c, -, len: 833 aa. Equivalent to 5' end of Rv3894c, len: 1396 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 771 aa overlap). Possible conserved membrane protein (possible transmembrane segments from aa ~37-85), similar to Q9CD30|ML2535 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1329 aa), FASTA scores: opt: 652, E(): 2.2e-30, (27.85% identity in 1425 aa overlap); Q9CDD7|ML0052 HYPOTHETICAL PROTEIN from Mycobacterium leprae (597 aa), FASTA scores: opt: 537, E(): 6.6e-24, (27.5% identity in 585 aa overlap) (similarity only with C-terminal end); Q9Z5I2|ML1543|MLCB596.28 POSSIBLE SPOIIIE-FAMILY MEMBRANE PROTEIN from Mycobacterium leprae (1345 aa), FASTA scores: opt: 523, E(): 8.6e-23, (31.65% identity in 1412 aa overlap). Also similar to various proteins e.g. O86653|SC3C3.20c ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 973, E(): 2.8e-49, (28.1% identity in 1409 aa); Q9L0T6|SCD35.15c PUTATIVE CELL DIVISION-RELATED PROTEIN from Streptomyces coelicolor(1525 aa), FASTA scores: opt: 524, E(): 8.3e-23, (24.95% identity in 1450 aa overlap); Q9KE81|BH0975 HYPOTHETICAL PROTEIN from Bacillus halodurans (1489 aa), FASTA scores: opt: 444, E(): 4.1e-18, (22.5% identity in 1346 aa overlap); etc. Also similar to AAK46103|MT1833 FTSK/SPOIIIE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (1391 aa), FASTA scores: opt: 769, E(): 2.9e-37, (30.6% identity in 1434 aa overlap); and other hypothetical proteins from Mycobacterium tuberculosis e.g. O53689|Rv0284|MTV035.12 (1330 aa), FASTA scores: opt: 634, E(): 2.5e-29, (28.2% identity in 1443 aa overlap); O06264|Rv3447c|MTCY77.19c (1236 aa), FASTA scores: opt: 632, E(): 3.1e-29, (28.75% identity in 1391 aa overlap); O69736|R3871|MTV027.06 (591 aa), FASTA scores: opt: 588, E(): 6.6e-27, (27.75% identity in 605 aa overlap) (similarity only with C-terminal end); etc. Contains two possible (PS00017) ATP/GTP-binding sites (P-loop) in central portion. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis, Rv3894c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base insertion (*-c), splits Rv3894c into 2 parts, Mb3923c and Mb3924c." /db_xref="GOA:A0A1R3Y5T2" /db_xref="InterPro:IPR002543" /db_xref="InterPro:IPR023836" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5T2" /protein_id="SIU02556.1" /translation="MSKKAFPINRVNIDPPKPVRVAPNPPIALPEREPRNIWVMIGVP ALIVALIGTIVMLYVSGVRSLATGFFPLMGIGAFSMLAFSGRFGRARKITWGELEKGR RRYLRDLDTNRDEIQTAVCAQREWQNAVHSDPPGLGAIIGGPRMWERGRGDVDFLEVR VGTGVQHAPDSVLSVTWPDISSDEELEPVTGQALRDFILEQRKIRDIAKVVNLRSAPG FSFVSEDLDRVRSLMRSVLCSLAVFHNPRDVKLMVVTRNPEVWAWMVWLPHNLHDELF DACGWRRLIFATPEELEAALGAELHMKGKRGAWTPPTVASPTAMGSALETGQVGVDLG PHLVIVDDNTGSPDAWESVVGQVGKAGLTVLRIASRVGTGVGFAEDQVFEMAQRHGAA TAVKAGRDGADADDDQRPAPLLRARGTFFAHADQLSIHRAYRYARAMARWSPTSRSEV TDSTSGAAELLRSLGISDPRELDVDRLWAERRGRGDDRWCEIPVGAKPNGELQNIILR AKDFGGFGFHSVVIGTSGSGKSELFLSLVYGIALTHSPETFNVIFVDMKFESAAQDIL GIPHVVAALSNLGKDERHLAERMRRVIDGEIKQRYELFKSVGARDANDYEEIRLAGRD LPPVPVLLVIVDEYLELFANHKKWIDLIIHIGQEGRGANVFFMLGGQRLDLSSLQKVK SNIAFRIALRAESGDDSREVIGSDAAYHLPSKENGFALLKVGPRDLEPFRCFYLSAPF VVPKKKEVARTIDMTLTQPRLYDWQYQPLDARRRRGIGDRRGRRCGTRRIPLLRRRFQ EEEDRRRAAGVAIQRAAPIAAPAVVGAAGRPRAGR" CDS complement(4318823..4320310) /codon_start=1 /transl_table=11 /gene="eccb2" /locus_tag="BQ2027_MB3925C" /product="esx conserved component eccb2. esx-2 type vii secretion system protein. probable membrane protein." /note="Mb3925c, -, len: 495 aa. Equivalent to Rv3895c, len: 495 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 495 aa overlap). Possible conserved membrane protein, highly similar to two CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae: Q9Z5I3|ML1544|MLCB596.27 (506 aa), FASTA scores: opt: 1070, E(): 1.4e-53, (39.8% identity in 485 aa overlap); and Q9CD29|ML2536 (552 aa), FASTA scores: opt: 483, E(): 4e-20, (36.85% identity in 499 aa overlap). Also highly similar to various proteins from Mycobacterium tuberculosis e.g. O53933|Rv1782|MTV049.04 HYPOTHETICAL PROTEIN (506 aa), FASTA scores: opt: 1106, E(): 1.2e-55, (41.25% identity in 485 aa overlap); O69734|Rv3869|MTV027.04 HYPOTHETICAL PROTEIN (480 aa), FASTA scores: opt: 795, E(): 6.1e-38, (36.0% identity in 486 aa overlap); O33088|ML0054|MLCB628.17c PUTATIVE MEMBRANE PROTEIN) (481 aa), FASTA scores: opt: 740, E(): 8.3e-35, (35.65% identity in 485 aa overlap); etc. Protein product from Mb3925c detected using SWATH mass spectrometry. Mb3925c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5K7" /db_xref="InterPro:IPR007795" /db_xref="InterPro:IPR042485" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5K7" /protein_id="SIU02557.1" /translation="MPLSLSNRDQNSGHLFYNRRLRAATTRFSVRMKHDDRKQTAALA LSMVLVAIAAGWMMLLNVLKPTGIVGDSAIIGDRDSGALYARIDGRLYPALNLTSARL ATGTAGQPTWVKPAEIAKYPTGPLVGIPGAPAAMPVNRGAVSAWAVCDTAGRPRSADK PVVTSIAGPITGGGRATHLRDDAGLLVTFDGSTYVIWGGKRSQIDPTNRAVTLSLGLD PGVTSPIQISRALFDGLPATEPLRVPAVPEAGTPSTWVPGARVGSVLQAQTAGGGSQF YVLLPDGVQKISSFVADLLRSANSYGAAAPRVVTPDVLVHTPQVTSLPVEYYPAGRLN FVDTAADPTTCVSWEKASTGPQARVAVYNGRGLPVPPSMDSRIVRLVRDDRAPASVVA TQVLVLPGAANFVTSTSGVITAESRESLFWVSGNGVRFGIANDEATLRALGLDPGAAV QAPWPLLRTFAAGPALSRDAALLARDTVPTLGQVAIVTTTAKAGA" CDS complement(4320354..4321220) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3926C" /product="putative secreted protein" /note="Mb3926c, -, len: 288 aa. Similar to Rv3896c, len: 302 aa (first GTG taken, although TBparse suggests TTG at 16079), from Mycobacterium tuberculosis strain H37Rv, (98.8% identity in 249 aa overlap). Putative conserved ala-rich protein. C-terminus highly similar to C-terminal end of other proteins e.g. Q9XAS4|SC10A7.01 HYPOTHETICAL 17.2 KDA PROTEIN from Streptomyces coelicolor (244 aa), FASTA scores: opt: 255, E(): 1.4e-08, (32.0% identity in 222 aa overlap); CAC44611|STBAC16H6.32 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (172 aa), FASTA scores: opt: 214, E(): 3.4e-06, (42.55% identity in 94 aa overlap); Q38352|ORF360 from Lactococcus delbrueckii bacteriophage LL-H (360 aa), FASTA scores: opt: 211, E(): 9.5e-06, (40.0% identity in 115 aa overlap); P54334|XKDO_BACSU|XKDO PHAGE-LIKE ELEMENT PBSX PROTEIN from Bacillus subtilis (1332 aa), FASTA scores: opt: 209, E(): 3.6e-05, (38.35% identity in 86 aa overlap); etc. Also similar to P71594|P71594|Rv0024|MTCY10H4.24 HYPOTHETICAL 30.3 KDA PROTEIN from Mycobacterium tuberculosis (281 aa), FASTA scores: opt: 265, E(): 3.9e-09, (29.25% identity in 287 aa overlap). REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a frameshift due to a single base deletion (t-*) leads to a product with a different COOH part than in Mycobacterium tuberculosis (288 aa versus 302 aa). Mb3926c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR023346" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5K2" /protein_id="SIU02558.1" /translation="MSTWHRIGTEGEPLTDPLTTQAIAALSRGHGLFAGGVSGADIDA PQIQQYANAISWVANAVPTAAAYRWRGAARALRRLANTGEALAQIMAAAQIDHAHART ATRALLEAAKTDAMALTDTPLGRREAMARMAARLRAQHRHIARCRSRARLLGLRLRRL RYLRTAAARRPQVTTPGGRAQVLAAVQKALDIKGVHDPAARARWTRGMDLVARRESNY NANAINHWDSNAARGTPSRGVWQFIAPTFAAITSRARRPTSTIWSPRRARSSTTREAT TGWPPTHRIWPI" CDS complement(4321370..4322359) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3927C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3927c, -, len: 329 aa. Equivalent to Rv3898c and Rv3897c, len: 110 aa and 210 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 110 aa overlap and 98.8% identity in 168 aa overlap). Conserved hypothetical proteins. Highly similar, but in part, to Q10691|YK83_MYCTU|Rv2083|MT2145|MTCY49.22 HYPOTHETICAL 30.8 KDA PROTEIN from Mycobacterium tuberculosis (314 aa) FASTA scores: opt: 204, E(): 0.00042, (50.6% identity in 81 aa overlap) and FASTA scores: opt: 815, E(): 4.7e-26, (73.05% identity in 167 aa overlap). Similarity suggests it should be in frame with next ORF and that the stop codon could be read through, the sequence appears to be correct. Homology lost upstream at 15138 gatc sequence may suggest discontinuity due to chimerism in cY15F10 or cY49. Similarity to MTCY49.22 suggests that this is a continuation of MTCY15F10.14. There is a frameshift mutation near 3'-end with respect to this sequence as well, similarity to MTCY49.22 continues in an overlapping ORF. Sequence appears to be correct. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis, Rv3898c and Rv3897c exist as 2 genes. In Mycobacterium bovis, a single base transition (t-c) leads to a single product. Protein product from Mb3927c detected using SWATH mass spectrometry and 0. Mb3927c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5K4" /protein_id="SIU02559.1" /translation="MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPV DLPAPADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQ GVGAQAEAQGATQMMQQAVSGITGALGGAVGGVMGPLTQLPQQAMQAGQGAMQPLMSA LQQTYGAEGLDVADGARLVDSIEGEPGLGGEPGAGDVGAGGGGGGTTPTGYLGPPPVP TSSPPTTPAGAPAKSVTPDPVSGTPRASGPAGMTGMPMVPPGALGAGAEGANKDKPVE KRVTAPAVPNGQPVKGRLTVPPSVPVKSADDKPVVTKSTRRILVVPNDDKVKE" CDS complement(4322521..4323006) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3928C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3928c, -, len: 161 aa. Equivalent to 3' end of Rv3899c, len: 410 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 293 aa overlap). Conserved hypothetical protein, similar in part to proteins from Mycobacterium tuberculosis strains H37Rv and CDC1551. Region between aa 29-80 is strictly identical to P96909 HYPOTHETICAL 15.1 KDA PROTEIN (FRAGMENT) (143 aa) FASTA scores: opt: 562, E(): 4e-16, (69.0% identity in 142 aa overlap); and the N-terminal end is highly similar, but longer 65 aa, to O07266 HYPOTHETICAL 13.7 KDA PROTEIN (FRAGMENT) (143 aa), FASTA scores: opt: 562, E(): 4e-16, (69.0% identity in 142 aa overlap). Highly similar to C-terminal end of Q10690|YK82_MYCTU|Rv2082|MTCY49.21 HYPOTHETICAL 73.6 KDA PROTEIN (721 aa), FASTA scores: opt: 1388, E(): 1.5e-48, (55.25% identity in 409 aa overlap). And similar to P71599|Rv0029|MTCY10H4.29 HYPOTHETICAL 39.6 KDA PROTEIN (365 aa), FASTA scores: opt: 403, E(): 1.7e-09, (33.75% identity in 252 aa overlap). Note that MTCY15F10.12 and MTCY15F10.13 appear frameshifted with respect to MTCY49.21 although the sequence appears to be correct. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis, Rv3899c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits Rv3899c into 2 parts, Mb3928c and Mb3929c with the former being the more likely product. Mb3928c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR040833" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5K1" /protein_id="SIU02560.1" /translation="MLEPTARRRDADVIDLLGAVVAVAAHESNTYVAEPGPDAPALTG DRSARSAIPKVDEFGPTLVEAVRRRDSLPRIAQAIALPAVRKTGVLENEAELLHGCIT AVKESVLKAYPSHELTAVGDWMLLAAIEALIDEQDYLANYHLAWYAVTTRRGGSRGFA A" CDS complement(4323009..4323752) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3929C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3929c, -, len: 247 aa. Equivalent to 5' end of Rv3899c, len: 410 aa, from Mycobacterium tuberculosis strain H37Rv, (90.8% identity in 130 aa overlap). Conserved hypothetical protein, similar in part to proteins from Mycobacterium tuberculosis strains H37Rv and CDC1551. Region between aa 29-80 is strictly identical to P96909 HYPOTHETICAL 15.1 KDA PROTEIN (FRAGMENT) (143 aa) FASTA scores: opt: 562, E(): 4e-16, (69.0% identity in 142 aa overlap); and the N-terminal end is highly similar, but longer 65 aa, to O07266 HYPOTHETICAL 13.7 KDA PROTEIN (FRAGMENT) (143 aa), FASTA scores: opt: 562, E(): 4e-16, (69.0% identity in 142 aa overlap). Highly similar to C-terminal end of Q10690|YK82_MYCTU|Rv2082|MTCY49.21 HYPOTHETICAL 73.6 KDA PROTEIN (721 aa), FASTA scores: opt: 1388, E(): 1.5e-48, (55.25% identity in 409 aa overlap). And similar to P71599|Rv0029|MTCY10H4.29 HYPOTHETICAL 39.6 KDA PROTEIN (365 aa), FASTA scores: opt: 403, E(): 1.7e-09, (33.75% identity in 252 aa overlap). Note that MTCY15F10.12 and MTCY15F10.13 appear frameshifted with respect to MTCY49.21 although the sequence appears to be correct. In Mycobacterium tuberculosis strain H37Rv, Rv3899c (len: 410 aa) exists as a single gene on a single reading frame. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis, Rv3899c exists as a single gene. In Mycobacterium bovis, a frameshift due to a single base deletion (c-*) splits Rv3899c into 2 parts, Mb3928c and Mb3929c with the former being the more likely product. Mb3929c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5M0" /protein_id="SIU02561.1" /translation="MVTGQPAAAGAHSLSEGAMTAMQSGSVPPPQATPPITTPPVVSA PTMAAGIEATHGPVDTPANTPGAPPASTGTTGPVAPTVVTAGPVAAPAAPVVGGSAVP AGPLPAYGSDLRPPSWQPPPCPRFLRRPYPARRWRPRRHRPHRRVGRWFLRWSAQPRK LWLDRLVRARRQWPAPRHCRPPPARRRARYRLGRLSSNAYSESWMPWRARSRESHGRP GCATTAPPPCWSPIWPAGGFRPTSGCPRT" CDS complement(4323746..4324681) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3930C" /product="CONSERVED HYPOTHETICAL ALANINE RICH PROTEIN" /note="Mb3930c, -, len: 311 aa. Equivalent to Rv3900c, len: 311 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 311 aa overlap). Conserved hypothetical ala-rich protein, highly similar to N-terminal end of Q10690|YK82_MYCTU|Rv2082|MTCY49.21 HYPOTHETICAL 73.6 KDA PROTEIN from Mycobacterium tuberculosis (721 aa), FASTA scores: opt: 592, E(): 2.7e-22, (37.15% identity in 280 aa overlap). Note that MTCY15F10.12 and MTCY15F10.13 appear frameshifted with respect to MTCY49.21 although the sequence appears to be correct. Mb3930c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L5" /protein_id="SIU02562.1" /translation="MVAADLPPGRWSAVLVGAWWPAPSAALRAAAQHWATWAMQKQEL ARNLISQHDLLLRNQGRTAEDLIGRYLRGAKSEVTKAEKYEIKKGAFNTAADAIDYLR SRLTGIAGEGNKEIDDVLASKKPLPEQLAEIQAIQTRCNADAANASRDAVDKVMTAMQ EILEAEDIGDDPRTWARANGFNVDDAPPPRLIRENDLAALTGPGARGGSFGSVEGAGD LASPQSVGAGGFSGSGVQAACSQPAPRAIGASSRHASAGPVPPAPVVTTPAAATPPVI ATGPRWRCPAGRCRRRPSDRAYRLRRLGNRLRPGW" CDS complement(4324738..4325187) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3931C" /product="POSSIBLE MEMBRANE PROTEIN" /note="Mb3931c, -, len: 149 aa. Equivalent to Rv3901c, len: 149 aa, from Mycobacterium tuberculosis strain H37Rv, (99.3% identity in 149 aa overlap). Possible membrane protein (hydrophobic stretch from ~30-52), showing some similarity with O53200|Rv2473|MTV008.29 HYPOTHETICAL 25.1 KDA PROTEIN from Mycobacterium tuberculosis (238 aa), FASTA scores: opt: 147, E(): 0.036, (31.35% identity in 134 aa overlap). Mb3931c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5K8" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5K8" /protein_id="SIU02563.1" /translation="MQAANRRSADTICGVTAPAPWPIPRTRSWPAIVVAAIAAVVAVA ALIVALTNARPAATPATTSVPTYTAAQTAAAQRQLCDTYKLVAHAVPVDTNGSDKALA RITLTNAAAILDNAAADPALDAKHRDAARASDRLPHNDRNGEWWHSS" CDS complement(4325738..4326268) /codon_start=1 /transl_table=11 /gene="LH57_21250" /locus_tag="BQ2027_MB3932C" /product="necrotizing toxin (TNT)" /note="Mb3932c, -, len: 176 aa. Equivalent to Rv3902c, len: 176 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 176 aa overlap). Hypothetical unknown protein. Protein product from Mb3932c detected using SWATH mass spectrometry. Mb3932c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR028953" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5N5" /protein_id="SIU02564.1" /translation="MTIGVDLSTDLQDWIRLSGMNMIQGSETNDGRTILWNKGGEVRY FIDRLAGWYVITSSDRMSREGYEFAAASMSVIEKYLYGYFGGSVRSERELPAIRAPFQ PEELMPEYSIGTMTFAGRQRDTLIDSSGTVVAITAADRLVELSHYLDVSVNVIKDSFL DSEGKPLFTLWKDYKG" CDS complement(4326265..4328805) /codon_start=1 /transl_table=11 /gene="cpnT" /locus_tag="BQ2027_MB3933C" /product="Outer membrane channel protein CpnT (Channel protein with necrosis-inducing toxin) [Cleaved into: N-terminal channel domain; Tuberculosis necrotizing toxin (TNT) (NAD(+) glycohydrolase) (EC ]" /EC_number="3.2.2.5" /note="Mb3933c, -, len: 846 aa. Equivalent to Rv3903c, len: 846 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 846 aa overlap). Hypothetical unknown ala-, pro-rich protein." /db_xref="InterPro:IPR025331" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6Y7" /protein_id="SIU02565.1" /translation="MAPLAVDPAALDSAGGAVVAAGAGLGAVISSLTAALAGCAGMAG DDPAGAVFGRSYDGSAAALVQAMSVARNGLCNLGDGVRMSAHNYSLAEAMSDVAGRAA PLPAPPPSGCVGVGAPPSAVGGGGGAPKGWGWVAPYIGMIWPNGDSTKLRAAAVAWRS AGTQFALTEIQSTAGPMGVIRAQQLPEAGLIESAFADAYASTTAVVGQCHQLAAQLDA YAARIDAVHAAVLDLLARICDPLTGIKEVWEFLTDQDEDEIQRIAHDIAVVVDQFSGE VDALAAEITAVVSHAEAVITAMADHAGKQWDRFLHSNPVGVVIDGTGQQLKGFGEEAF GMAKDSWDLGPLRASIDPFGWYRSWEEMLTGMAPLAGLGGENAPGVVESWKQFGKSLI HWDEWTTNPNEALGKTVFDAATLALPGGPLSKLGSKGRDILAGVRGLKERLEPTTPHL EPPATPPRPGPQPPRIEPPESGHPAPAPAAKPAPVPANGPLPHSPTESKPPPVDRPAE PVAPSSASAGQPRVSAATTPGTHVPHGLPQPGEHVPAQAPPATTLLGGPPVESAPATA HQPQWATTPAAPAAAPHSTPGGVHSTESGPHGRSLSAHGSEPTHDGASHGSGHGSGSE PPGLHAPHREQQLAMHSNEPAGEGWHRLSDEAVDPQYGEPLSRHWDFTDNPADRSRIN PVVAQLMEDPNAPFGRDPQGQPYTQERYQERFNSVGPWGQQYSNFPPNNGAVPGTRIA YTNLEKFLSDYGPQLDRIGGDQGKYLAIMEHGRPASWEQRALHVTSLRDPYHAYTIDW LPEGWFIEVSEVAPGCGQPGGSIQVRIFDHQNEMRKVEELIRRGVLRQ" CDS complement(4328810..4329082) /codon_start=1 /transl_table=11 /gene="esxE" /locus_tag="BQ2027_MB3934C" /standard_name="ES6_12" /product="ESAT-6-like protein EsxE" /note="Mb3934c, esxE, len: 90 aa. Equivalent to Rv3904c, len: 90 aa, from Mycobacterium tuberculosis strain H37Rv, (98.9% identity in 90 aa overlap). esxE, putative ESAT-6 like protein 12, hypothetical unknown ala-rich protein. BELONGS TO THE ESAT6 FAMILY." /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5U3" /protein_id="SIU02566.1" /translation="MDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAA AHAEAQRHWAAGEAMMRQALAQLTAAGQSAHANYTGAVATNLGMWS" CDS complement(4329231..4329404) /codon_start=1 /transl_table=11 /gene="esxF" /locus_tag="BQ2027_MB3935C" /standard_name="ES6_13" /product="ESAT-6-like protein EsxF" /note="Mb3935c, esxF, len: 57 aa. Equivalent to Rv3905c, len: 103 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 57 aa overlap). esxF, putative ESAT-6 like protein 13, hypothetical unknown ala-, gly-rich protein, ESAT-6 like protein. BELONGS TO THE ESAT6 FAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, a single base transition (g-a) introducing a premature stop codon, leads to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (57 aa versus 103 aa). Mb3935c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="InterPro:IPR010310" /db_xref="InterPro:IPR036689" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L4" /protein_id="SIU02567.1" /translation="MGADDTLRVEPAVMQGFAASLDGAAEHLAVQLAELDAQVGQMLG GWRGASGSAYGSA" CDS complement(4329470..4329979) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3936C" /product="CONSERVED HYPOTHETICAL PROTEIN" /note="Mb3936c, -, len: 169 aa. Equivalent to Rv3906c, len: 169 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 169 aa overlap). Conserved hypothetical protein, strongly related to Q50578|AT9S (SOD related in Escherichia coli) from Mycobacterium tuberculosis strain AOYAMA B (155 aa), but apparently different as flanking sequences differ and shorter 43 aa, FASTA scores: opt: 548, E(): 1.3e-26, (79.4% identity in 102 aa overlap). Selfmarch results suggest that Rv3906c is not related to any other hypothetical protein from M. tuberculosis strain H37Rv except itself. Shows also similarity with Q9VFR2|CG9297 HYPOTHETICAL PROTEIN from Drosophila melanogaster (Fruit fly) (930 aa), FASTA scores: opt: 221, E(): 4.9e-06, (36.95% identity in 157 aa overlap); Q9HQ55|CBP|VNG1320G CALCIUM-BINDING PROTEIN HOMOLOGY from Halobacterium sp. strain NRC-1 (385 aa) FASTA scores: opt: 143, E(): 0.13, (35.65% identity in 160 aa overlap); Q24795 CALCIUM-BINDING PROTEIN (FRAGMENT) from Echinococcus granulosus (338 aa), FASTA scores: opt: 140, E(): 0.17, (33.95% identity in 156 aa overlap). Mb3936c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5L6" /db_xref="InterPro:IPR028974" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L6" /protein_id="SIU02568.1" /translation="MEYCIAGDDGSAGIWNRPFDVDLDGDGRLDAIGLDLDGDGLRDD ALADFDGDDVADHAVFDVDNDGTPESYFIDDGSGTWAVAVDRGGQLRWYGLDGVEHTG GPLVDFDGFGGLDDRLLDTDGDGLADRVLCAGEQRVTGYVDTDGDGRWDVRLTDTDGD GTADGASSL" CDS complement(4330004..4331446) /codon_start=1 /transl_table=11 /gene="pcnA" /locus_tag="BQ2027_MB3937C" /product="probable poly(a) polymerase pcna (polynucleotide adenylyltransferase) (ntp polymerase) (rna adenylating enzyme) (poly(a) polymerase)" /note="Mb3937c, pcnA, len: 480 aa. Equivalent to Rv3907c, len: 480 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 480 aa overlap). Probable pcnA, polynucleotide polymerase (EC 2.7.7.19), equivalent to Q9CCY1|PCNA|ML2697 PCNA PROTEIN from Mycobacterium leprae (486 aa), FASTA scores: opt: 2713, E(): 4.3e-160, (84.1% identity in 478 aa overlap); and Q59534|PCNB POLYA POLYMERASE from Mycobacterium leprae (411 aa) FASTA scores: opt: 2077, E(): 7.1e-121, (82.55% identity in 373 aa overlap). Also highly similar to many e.g. Q9X8T2|SCH24.18 PUTATIVE RNA NUCLEOTIDYLTRANSFERASE from Streptomyces coelicolor (483 aa), FASTA scores: opt: 1856, E(): 3.7e-107, (61.55% identity in 455 aa overlap); Q9ZN65 POLYA POLYMERASE from Prevotella ruminicola (Bacteroides ruminicola) (479 aa), FASTA scores: opt: 830, E(): 8.5e-44, (34.85% identity in 445 aa overlap); P42977|PAPS_BACSU POLY(A) POLYMERASE from Bacillus subtilis (397 aa), FASTA scores: opt: 479, E(): 3.5e-22, (29.35% identity in 450 aa overlap); etc. Contains: PS00017 ATP/GTP-binding site motif A (P-loop), PS00018 EF-hand calcium-binding domain, and probably less significant a PS00237 G-protein coupled receptor signature, and PS00639 Eukaryotic thiol (cysteine) proteases histidine active site. BELONGS TO THE TRNA NUCLEOTIDYLTRANSFERASE / POLY(A) POLYMERASE FAMILY. Protein product from Mb3937c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3937c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5L0" /db_xref="InterPro:IPR002646" /db_xref="InterPro:IPR003607" /db_xref="InterPro:IPR006674" /db_xref="InterPro:IPR006675" /db_xref="InterPro:IPR014065" /db_xref="InterPro:IPR032828" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L0" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02569.1" /translation="MPEAVQEADLLTAAAVALNRHAALLRELGSVFAAAGHELYLVGG SVRDALLGRLSPDLDFTTDARPERVQEIVRPWADAVWDTGIEFGTVGVGKSDHRMEIT TFRADSYDRVSRHPEVRFGDCLEGDLVRRDFTTNAMAVRVTATGPGEFLDPLGGLAAL RAKVLDTPAAPSGSFGDDPLRMLRAARFVSQLGFAVAPRVRAAIEEMAPQLARISAER VAAELDKLLVGEDPAAGIDLMVQSGMGAVVLPEIGGMRMAIDEHHQHKDVYQHSLTVL RQAIALEDDGPDLVLRWAALLHDIGKPATRRHEPDGGVSFHHHEVVGAKMVRKRMRAL KYSKQMIDDISQLVYLHLRFHGYGDGKWTDSAVRRYVTDAGALLPRLHKLVRADCTTR NKRRAARLQASYDRLEERIAELAAQEDLDRVRPDLDGNQIMAVLDIPAGPQVGEAWRY LKELRLERGPLSTEEATTELLSWWKSRGNR" CDS 4331822..4332568 /codon_start=1 /transl_table=11 /gene="mutt4" /locus_tag="BQ2027_MB3938" /product="possible mutator protein mutt4" /note="Mb3938, -, len: 248 aa. Equivalent to Rv3908, len: 248 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 248 aa overlap). Conserved hypothetical protein, equivalent to Q50195|ML2698|L222-ORF6 HYPOTHETICAL PROTEIN from Mycobacterium leprae (251 aa), FASTA scores: opt: 1270, E(): 3.4e-62, (79.05% identity in 248 aa overlap). Also similar to O66548|APFA|AQ_158 HYDROLASE from Aquifex aeolicus (134 aa), FASTA scores: opt: 300, E(): 1.1e-09, (37.3% identity in 142 aa overlap); and similarity with other various proteins e.g. O93721 DIADENOSINE 5'5'''-P1,P4-TETRAPHOSPHATE PYROPHOSPHOHYDROLASE from Pyrobaculum aerophilum (143 aa), FASTA scores: opt: 205, E(): 0.00017, (34.85% identity in 109 aa overlap); Q9HS29|APA|VNG0431G DIADENOSINE TETRAPHOSPHATE PYROPHOSPHOHYDROLASE from Halobacterium sp. strain NRC-1 (142 aa), FASTA scores: opt: 199, E(): 0.00036, (34.0% identity in 147 aa overlap); Q9YA58|APE2080 HYPOTHETICAL 19.2 KDA PROTEIN from Aeropyrum pernix (175 aa) FASTA scores: opt: 191, E(): 0.0012, (36.9% identity in 141 aa overlap); etc. Also similar to P95110|MUTT1|Rv2985|MTCY349.02 HYPOTHETICAL 34.7 KDA PROTEIN from Mycobacterium tuberculosis (317 aa) FASTA scores: opt: 224, E(): 3e-05, (34.05% identity in 144 aa overlap). Protein product from Mb3938 detected using SWATH mass spectrometry. Mb3938 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5K6" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR015797" /db_xref="InterPro:IPR020084" /db_xref="InterPro:IPR020476" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5K6" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02570.1" /translation="MSDGEQAKSRRRRGRRRGRRAAATAENHMDAQPAGDATPTPATA KRSRSRSPRRGSTRMRTVHETSAGGLVIDGIDGPRDAQVAALIGRVDRRGRLLWSLPK GHIELGETAEQTAIREVAEETGIRGSVLAALGRIDYWFVTDGRRVHKTVHHYLMRFLG GELSDEDLEVAEVAWVPIRELPSRLAYADERRLAEVADELIDKLQSDGPAALPPLPPS SPRRRPQTHSRARHADDSAPGQHNGPGPGP" CDS 4332565..4334973 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3939" /product="FIG01967133: Putative secreted protein" /note="Mb3939, -, len: 802 aa. Equivalent to Rv3909, len: 802 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 802 aa overlap). Conserved hypothetical protein, equivalent to Q9CCY0|ML2699 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (797 aa) FASTA scores: opt: 3777, E(): 8.8e-206, (72.35% identity in 803 aa overlap). Note that the N-terminal end is highly similar to Q50196|L222-ORF7 (286 aa), FASTA scores: opt: 1213, E(): 2.7e-61, (71.75% identity in 255 aa overlap); and the C-terminal end is highly similar to Q50197|L222-ORF8 also from Mycobacterium leprae (512 aa) FASTA scores: opt: 2375, E(): 9.9e-127, (71.8% identity in 518 aa overlap). Shows some similarity with N-terminal end of Q9I2M3|PA1874 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (2468 aa), FASTA scores: opt: 171, E(): 0.13, (22.9% identity in 672 aa overlap). Protein product from Mb3939 detected using SWATH mass spectrometry. Mb3939 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5N7" /protein_id="SIU02571.1" /translation="MTALQLRWAALARVTSAIGVVAGLAMALTVPSAAPHALAGEPSP TPFVQVRIDQVTPDVVTTSSEPHVTVSGTVTNTGDRPVRDVMVRLEHAAAVTSSTALR TSLDGGTDQYQPAADFLTVAPELDRGQEAGFTLSAPLRSLTRPSLAVNQPGIYPVLVN VNGTPDYGAPARLDNARFLLPVVGVPPDQATDFGSAVAPETTAPVWITMLWPLADRPR LAPGAPGGTVPVRLVDDDLANSLANGGRLDILLSAAEFATNREVDPDGAVGRALCLAI DPDLLITVNAMTGGYVVSDSPDGAAQLPGTPTHPGTGQAAASSWLDRLRTLVHRTCVT PLPFAQADLDALQRVNDPRLSAIATISPADIVDRILDVSSTRGATVLPDGPLTGRAIN LLSTHGSTVAVAAADFSPEEQQGSSQIGSALLPATAPRRLSPRVVAAPFDPAVGAALA AAGTNPTVPTYLDPSLFVRIAHESITARRQDALGAMLWRSLEPNAAPRTQILVPPASW SLASDDAQVILTALATAIRSGLAVPRPLPVVIADAAARTEPPEPPGAYSAARGRFNDD ITTQIGGQVARLWKLTSALTIDDRTGLTGVQYTAPLREDMLRALSQSLPPDTRNGLAQ QRLAVVGKTIDDLFGAVTIVNPGGSYTLATEHSPLPLALHNGLAVPIRVRLQVDAPPG MTVADVGQIELPPGYLPLRVPIEVNFTQRVAVDVSLRTPDGVALGEPVRLSVHSNAYG KVLFAITLSAAAVLVTLAGRRLWHRFRGQPDRADLDRPDLPTGKHAPQRRAVASRDDE KHRV" CDS 4334970..4338524 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3940" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3940, -, len: 1184 aa. Equivalent to Rv3910, len: 1184 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1184 aa overlap). Probable conserved transmembrane protein (hydrophobic domain ~50-550), equivalent to Q9CCX9|ML2700 POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (1206 aa), FASTA scores: opt: 5554, E(): 0, (75.15% identity in 1182 aa overlap); and highly similar, but shorter 380 aa, to Q50199|L222-ORF10 from Mycobacterium leprae (784 aa) FASTA scores: opt: 3297, E(): 5.5e-170, (68.8% identity in 769 aa overlap); and at the N-terminal end with Q50198|L222-ORF also from Mycobacterium leprae (379 aa) FASTA scores: opt: 1955, E(): 5.7e-98, (88.4% identity in 353 aa overlap) (ORFs 9 and 10 are adjacent on L222). Also similar in part (principally at the N-terminal end) to other membrane proteins e.g. Q9X8T0|SCH24.16c PUTATIVE TRANSMEMBRANE PROTEIN from Streptomyces coelicolor (811 aa), FASTA scores: opt: 573, E(): 2.8e-23, (31.05% identity in 573 aa overlap); O05467|MVIN_RHITR INTEGRAL MEMBRANE PROTEIN VIRULENCE FACTOR MVIN HOMOLOG from Rhizobium tropici (533 aa), FASTA scores: opt: 468, E(): 9e-18, (27.1% identity in 524 aa overlap); P56882|MVIN_RHIME INTEGRAL MEMBRANE PROTEIN VIRULENCE FACTOR MVIN HOMOLOG from Rhizobium meliloti (Sinorhizobium meliloti) (535 aa), FASTA scores: opt: 453, E(): 5.8e-17, (26.2% identity in 557 aa overlap); etc. Protein product from Mb3940 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3940 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5M2" /db_xref="InterPro:IPR004268" /db_xref="InterPro:IPR011009" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5M2" /protein_id="SIU02572.1" /translation="MRPSPGEVPTASQRQPELSDAALVSHSWAMAFATLISRITGFAR IVLLAAILGAALASSFSVANQLPNLVAALVLEATFTAIFVPVLARAEQDDPDGGAAFV RRLVTLATTLLLGATTLSVLAAPLLVRLMLGTNPQVNEPLTTAFAYLLLPQVLVYGLS SVFMAILNTRNVFGPPAWAPVVNNVVAIATLAVYLAVPGELSVDPVRMGNAKLLVLGI GTTAGVFAQTAVLLVAIRREHISLRPLWGIDQRLKRFGAMAAAMVLYVLISQLGLVVG NRIASTAAASGPAIYNYTWLVLMLPFGMIGVTVLTVVMPRLSRNAAADDTPAVLADLS LATRLTMITLIPTVAFMTVGGPAIGSALFAYGNFGDVDAGYLGAAIALSAFTLIPYAL VLLQLRVFYAREQPWTPITIIVVITGVKILGSLLAPHITGDPQLVAAYLGLANGLGFL AGTIVGYYILRRALRPDGGQLIGVGEARTALVTVAASLLAGLLAHVADRLLGLSELTA HAGSVGSLLRLSVLALIMLPILAAVTLCARVPEARAALDAVRARIRSRRLKTGPQTQN VLDQSSRPGPVTYPERRRLAPPRGKSVVHEPIRRRPPEQVARAGRAKGPEVIDRPSEN ASFGAASGAELPRPVADELQLDAPAGRDPGPVSRPHPSDLQNGDLPADAARGPIAFDA LREPDRESSAPPDDVQLVPGARIANGRYRLLIFHGGVPPLQFWQALDTALDRQVALTF VDPQGVLPDDVLQETLSRTLRLSRIDKPGVARVLDVVHTRAGGLVVAEWIRGGSLQEV ADTSPSPVGAIRAMQSLAAAADAAHRAGVALSIDHPSRVRVSIDGDVVLAYPATMPDA NPQDDIRGIGASLYALLVNRWPLPEAGVRSGLAPAERDTAGQPIEPADIDRDIPFQIS AVAARSVQGDGGIRSASTLLNLMQQATAVADRTEVLGPIDEAPVSAAPRTSAPNSETY TRRRRNLLIGIGAGAAVLMVALLVLASVLSRIFGDVSGGLNKDELGLNAPTASTSAAS SAPPGSVVKPTKVTVFSPDGGADNPGEADLAIDGNPATSWKTDIYTDPVPFPSFKNGV GLMLQLPQATVVGTVAIDVASTGTKVEIRSASTPTPATLEDTAVLTSATALRPGHNTI SVEAAAPTSNLLVWISTLGTTDGKSQADISEITIYAAS" CDS 4338559..4339149 /codon_start=1 /transl_table=11 /gene="sigMa" /locus_tag="BQ2027_MB3941" /product="POSSIBLE ALTERNATIVE RNA POLYMERASE SIGMA FACTOR SIGMa [FIRST PART]" /note="Mb3941, sigMa, len: 196 aa. Similar to Rv3911, len: 222 aa, from Mycobacterium tuberculosis strain H37Rv, (95.8% identity in 168 aa overlap). Possible sigM, alternative RNA polymerase sigma factor (see citation below), highly similar to others e.g. Q9S6U3|SCH24.14c (alias O86856|SIGT) PUTATIVE RNA POLYMERASE SIGMA FACTOR from Streptomyces coelicolor (236 aa), FASTA scores: opt: 336, E(): 2.8e-13, (41.5% identity in 212 aa overlap); Q98KG8|MLR1481 PROBABLE RNA POLYMERASE SIGMA SUBUNIT from Rhizobium loti (Mesorhizobium loti) (307 aa), FASTA scores: opt: 221, E(): 2.9e-06, (32.95% identity in 179 aa overlap); Q9A4S9|CC2751 PUTATIVE RNA POLYMERASE SIGMA FACTOR from Caulobacter crescentus (186 aa), FASTA scores: opt: 217, E(): 3.3e-06, (36.95% identity in 138 aa overlap); etc. Also similarity with other mycobacterial factors e.g. O06289|SIGE|Rv1221|MTCI61.04 PUTATIVE RNA POLYMERASE SIGMA FACTOR from Mycobacterium tuberculosis (257 aa), FASTA scores: opt: 193, E(): 0.00012, (33.15% identity in 163 aa overlap); and O05735|SIGE PUTATIVE RNA POLYMERASE SIGMA FACTOR from Mycobacterium avium (251 aa), FASTA scores: opt: 192, E(): 0.00014, (33.15% identity in 163 aa overlap). Equivalent to AAK48395|MT4030 RNA POLYMERASE SIGMA-70 FACTOR from Mycobacterium tuberculosis strain CDC1551 (196 aa) but without similarity at the C-terminal end. BELONGS TO THE SIGMA-70 FACTOR FAMILY, ECF SUBFAMILY. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis, sigM exists as a single gene. In Mycobacterium bovis, a frameshift due to a 2bp to 1bp substitution (cg-t) splits sigM into 2 parts, sigMa and sigMb. Mb3941 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5L7" /db_xref="InterPro:IPR007627" /db_xref="InterPro:IPR013249" /db_xref="InterPro:IPR013324" /db_xref="InterPro:IPR013325" /db_xref="InterPro:IPR014284" /db_xref="InterPro:IPR036388" /db_xref="InterPro:IPR039425" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L7" /protein_id="SIU02573.1" /translation="MPPPIGYCPAVGFGGRHERSDAELLAAHVAGDRYAFDQLFRRHH RQLHRLARLTSRTSEDADDALQDAMLSAHRGAGSFRYDAAVSSWLHRIVVNACLDRLR RAKAHPTAPLEDVYPVADRTAQVETAIAVQRALMRLPVEQRAAVVAVDMQGYSIADTS RMLGVAEGTVKSRCARARARLARLLGYLNTGVNIRR" CDS 4339242..4340006 /codon_start=1 /transl_table=11 /gene="rsmA" /locus_tag="BQ2027_MB3943" /product="Anti-sigma-M factor RsmA (Regulator of SigM) (Sigma-M anti-sigma factor RsmA)" /note="Mb3943, -, len: 254 aa. Equivalent to Rv3912, len: 254 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 254 aa overlap). Hypothetical unknown ala-rich protein. Mb3943 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5P5" /protein_id="SIU02574.1" /translation="MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRV RSDPQAQQILRALNRVRRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAH AARPHVHPVRMIAGAAGLCAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPL SRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVL LVIPADTPDKLAVFAVAPHCSAADTGLLASTVVPRA" CDS 4340100..4341107 /codon_start=1 /transl_table=11 /gene="trxB2" /locus_tag="BQ2027_MB3944" /product="PROBABLE THIOREDOXIN REDUCTASE TRXB2 (TRXR) (TR)" /note="Mb3944, trxB2, len: 335 aa. Equivalent to Rv3913, len: 335 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 335 aa overlap). Probable trxB2, thioredoxin reductase (EC 1.6.4.5) (see citation below), equivalent to O30973|TRXB_MYCSM THIOREDOXIN REDUCTASE from Mycobacterium smegmatis (311 aa), FASTA scores: opt: 1575, E(): 1.8e-87, (78.35% identity in 305 aa overlap); and highly similar, but shorter at C-terminus, to P46843|TRXB_MYCLE|TRXB/A|TRX|ML2703 BIFUNCTIONAL THIOREDOXIN REDUCTASE/THIOREDOXIN from Mycobacterium leprae (458 aa), FASTA scores: opt: 1766, E(): 8.7e-99, (83.25% identity in 328 aa overlap). Also highly similar to many e.g. P52215|TRXB_STRCO|SCH24.12 from Streptomyces coelicolor (321 aa), FASTA scores: opt: 1249, E(): 7.2e-68, (60.4% identity in 313 aa overlap); Q9Z8M4|TRXB_CHLPN from Chlamydia pneumoniae (Chlamydophila pneumoniae) (311 aa), FASTA scores: opt: 978, E(): 1.3e-51, (49.85% identity in 307 aa overlap); P09625|TRXB_ECOLI|B0888 from Escherichia coli strain K12 (320 aa), FASTA scores: opt: 948, E(): 8.6e-50, (49.2% identity in 309 aa overlap); etc. Contains PS00573 Pyridine nucleotide-disulphide oxidoreductases class-II active site. BELONGS TO THE PYRIDINE NUCLEOTIDE-DISULFIDE OXIDOREDUCTASES CLASS-II. COFACTOR: FAD (BY SIMILARITY). Protein product from Mb3944 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3944 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y6Y8" /db_xref="InterPro:IPR005982" /db_xref="InterPro:IPR008255" /db_xref="InterPro:IPR023753" /db_xref="InterPro:IPR036188" /db_xref="UniProtKB/TrEMBL:A0A1R3Y6Y8" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02575.1" /translation="MTAPPVHDRAHHPVRDVIVIGSGPAGYTAALYAARAQLAPLVFE GTSFGGALMTTTGVENYPGFRNGITGPELMDEMREQALRFGADLRMEDVESVSLHGPL KSVVTADGQTHRARAVILAMGAAARYLQVPGEQELLGRGVSSCATCDGFFFRDQDIAV IGGGDSAMEEATFLTRFARSVTLVHRRDEFRASKIMLDRARNNDKIRFLTNHTVVAVD GDTTVTGLRVRDTNTGAETTLPVTGVFVAIGHEPRSGLVREAIDVDPDGYVLVQGRTT STSLPGVFAAGDLVDRTYRQAVTAAGSGCAAAIDAERWLAEHAATGEADSTDALIGAQ R" CDS 4341104..4341454 /codon_start=1 /transl_table=11 /gene="trxC" /locus_tag="BQ2027_MB3945" /standard_name="trx; trxA" /product="THIOREDOXIN TRXC (TRX) (MPT46)" /note="Mb3945, trxC, len: 116 aa. Equivalent to Rv3914, len: 116 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 116 aa overlap). trxC (alternate gene names: trx, trxA *), thioredoxin (EC 1.-.-.-) (see citations below), equivalent to O30974|THIO_MYCSM|TRXA THIOREDOXIN from Mycobacterium smegmatis (112 aa), FASTA scores: opt: 576, E(): 2.1e-32, (80.2% identity in 111 aa overlap); and also equivalent to C-terminal end of P46843|TRXB_MYCLE|TRXB/A|TRX|ML2703 BIFUNCTIONAL THIOREDOXIN REDUCTASE/THIOREDOXIN from Mycobacterium leprae (458 aa), FASTA scores: opt: 628, E(): E(): 2e-35, (82.9% identity in 117 aa overlap). Also highly similar to many e.g. P80579|THIO_ALIAC from Alicyclobacillus acidocaldarius (Bacillus acidocaldarius) (105 aa), FASTA scores: opt: 411, E(): 3e-21, (57.15% identity in 105 aa overlap); P00275|THI1_CORNE from Corynebacterium nephridii (105 aa), FASTA scores: opt: 394, E(): 4.3e-20, (56.7% identity in 97 aa overlap); P00274|THIO_ECOLI|TRXA|TSNC|FIPA|B3781 from Escherichia coli and Salmonella typhimurium strain K12 and LT2 respectively (108 aa), FASTA scores: opt: 364, E(): 4.7e-18, (54.45% identity in 101 aa overlap); etc. Also similar to O53162|TRXB|Rv1471|MTV007.18 THIOREDOXIN from Mycobacterium tuberculosis (123 aa), FASTA scores: E(): 2.3e-15, (41.9% identity in 93 aa overlap). Contains PS00194 Thioredoxin family active site. BELONGS TO THE THIOREDOXIN FAMILY. The product of this CDS is supposed secreted. In this cas, this protein could exert its free radical scavenging activity inside macrophages. (*) Warning: note that Rv1470|MTV007.17 correspond also to trxA. Protein product from Mb3945 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3945 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A617" /db_xref="InterPro:IPR005746" /db_xref="InterPro:IPR013766" /db_xref="InterPro:IPR017937" /db_xref="InterPro:IPR036249" /db_xref="UniProtKB/Swiss-Prot:P0A617" /experiment="experimental evidence, no additional details recorded" /protein_id="SIU02576.1" /translation="MTDSEKSATIKVTDASFATDVLSSNKPVLVDFWATWCGPCKMVA PVLEEIATERATDLTVAKLDVDTNPETARNFQVVSIPTLILFKDGQPVKRIVGAKGKA ALLRELSDVVPNLN" CDS 4341564..4342784 /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3946" /product="probable peptidoglycan hydrolase" /note="Mb3946, -, len: 406 aa. Equivalent to Rv3915, len: 406 aa, from Myobacterium tuberculosis strain H37Rv, (100.0% identity in 406 aa overlap). Probable hydrolase (EC 3.-.-.-), equivalent to Q9CCX8|ML2704 PUTATIVE HYDROLASE from Mycobacterium leprae (406 aa) FASTA scores: opt: 2341, E(): 2.7e-138, (86.95% identity in 406 aa overlap); the N-terminal end is highly similar to Q59535 N-ACETYMURAMYL-L-ALANINE AMIDASE (EC 3.5.1.28) from Mycobacterium leprae (205 aa), FASTA scores: opt: 1046, E(): 5.7e-58, (84.85% identity in 185 aa overlap). Also similar to other hydrolases (especially amidases (EC 3.5.-.-)) e.g. C-terminal end of Q9K6R3|LYTC|BH3665 N-ACETYLMURAMOYL-L-ALANINE AMIDASE (MAJOR AUTOLYSIN) from Bacillus halodurans (588 aa), FASTA scores: opt: 363, E(): 4.3e-15, (33.15% identity in 356 aa overlap); Q9PKC7|TC0539 PUTATIVE N-ACETYLMURAMOYL-L-ALANINE AMIDASE from Chlamydia muridarum (268 aa), FASTA scores: opt: 285, E(): 1.6e-10, (26.05% identity in 242 aa overlap) (RV3915 product appears longer 127 aa); Q9S596|PDCA PENICILLIN-RESISTANT DD-CARBOXYPEPTIDASE (EC 3.4.-.-) from Myxococcus xanthus (302 aa), FASTA scores: opt: 270, E(): 1.5e-09, (39.85% identity in 158 aa overlap); etc. Note that previously known as cwlM. Protein product from Mb3946 detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3946 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y5M1" /db_xref="InterPro:IPR002477" /db_xref="InterPro:IPR002508" /db_xref="InterPro:IPR036365" /db_xref="InterPro:IPR036366" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5M1" /protein_id="SIU02577.1" /translation="MPSPRREDGDALRCGDRSAAVTEIRAALTALGMLDHQEEDLTTG RNVALELFDAQLDQAVRAFQQHRGLLVDGIVGEATYRALKEASYRLGARTLYHQFGAP LYGDDVATLQARLQDLGFYTGLVDGHFGLQTHNALMSYQREYGLAADGICGPETLRSL YFLSSRVSGGSPHAIREEELVRSSGPKLSGKRIIIDPGRGGVDHGLIAQGPAGPISEA DLLWDLASRLEGRMAAIGMETHLSRPTNRSPSDAERAATANAVGADLMISLRCETQTS LAANGVASFHFGNSHGSVSTIGRNLADFIQREVVARTGLRDCRVHGRTWDLLRLTRMP TVQVDIGYITNPHDRGMLVSTQTRDAIAEGILAAVKRLYLLGKNDRPTGTFTFAELLA HELSVERAGRLGGS" CDS complement(4342805..4343539) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3947C" /product="FIG007808: N-acetyltransferase" /note="Mb3947c, -, len: 244 aa. Equivalent to Rv3916c, len: 244 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 244 aa overlap). Conserved hypothetical protein, equivalent to Q50200|ML2705|L222-ORF1 HYPOTHETICAL PROTEIN from Mycobacterium leprae (259 aa), FASTA scores: opt: 1266, E(): 2e-74, (76.4% identity in 250 aa overlap). Also highly similar (but with gaps) to Q9R3S2|STH24.10 HYPOTHETICAL 22.6 KDA PROTEIN from Streptomyces coelicolor (205 aa), FASTA scores: opt: 387, E(): 7.5e-18, (40.25% identity in 231 aa overlap). Protein product from Mb3947c detected using SWATH mass spectrometry. Mb3947c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="UniProtKB/TrEMBL:A0A1R3Y5M7" /protein_id="SIU02578.1" /translation="MSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEF EKEAWLSMVMLEWGSCGQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVS ADAVLLTSMGIERGQADDDLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAV TPDVRPVLEALGDCCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVE AALERLLENARLQEPIAAGSTAGNTS" CDS complement(4343829..4344863) /codon_start=1 /transl_table=11 /gene="parB" /locus_tag="BQ2027_MB3948C" /product="PROBABLE CHROMOSOME PARTITIONING PROTEIN PARB" /note="Mb3948c, parB, len: 344 aa. Equivalent to Rv3917c, len: 344 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 344 aa overlap). Probable parB, chromosome partitioning protein, equivalent to Q50201|PARB_MYCLE|ML2706 PROBABLE CHROMOSOME PARTITIONING PROTEIN from Mycobacterium leprae (333 aa), FASTA scores: opt: 1654, E(): 1.6e-88, (78.6% identity in 332 aa overlap). Also highly similar to to others e.g. Q9S6U1|STH24.09 PUTATIVE PARTITIONING OR SPORULATION PROTEIN from Streptomyces coelicolor (328 aa), FASTA scores: opt: 966, E(): 9.7e-49, (58.55% identity in 287 aa overlap) (no similarity on N-terminus); Q9PB63|PARB_XYLFA|XF2281 PROBABLE CHROMOSOME PARTITIONING PROTEIN from Xylella fastidiosa (310 aa), FASTA scores: opt: 598, E(): 1.8e-27, (38.65% identity in 326 aa overlap); P31857|PARB_PSEPU PROBABLE CHROMOSOME PARTITIONING PROTEIN from Pseudomonas putida (290 aa), FASTA scores: opt: 573, E(): 4.6e-26, (40.35% identity in 322 aa overlap); etc. Contains probable helix-turn-helix motif at aa 179 to 200 (Score 1150, +3.1 0 SD). BELONGS TO THE PARB FAMILY. Note that previously known as parA. Protein product from Mb3948c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3948c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A5R3" /db_xref="InterPro:IPR003115" /db_xref="InterPro:IPR004437" /db_xref="InterPro:IPR036086" /db_xref="InterPro:IPR041468" /db_xref="UniProtKB/Swiss-Prot:P0A5R3" /protein_id="SIU02579.1" /translation="MTQPSRRKGGLGRGLAALIPTGPADGESGPPTLGPRMGSATADV VIGGPVPDTSVMGAIYREIPPSAIEANPRQPRQVFDEEALAELVHSIREFGLLQPIVV RSLAGSQTGVRYQIVMGERRWRAAQEAGLATIPAIVRETGDDNLLRDALLENIHRVQL NPLEEAAAYQQLLDEFGVTHDELAARIGRSRPLITNMIRLLKLPIPVQRRVAAGVLSA GHARALLSLEAGPEAQEELASRIVAEGLSVRATEETVTLANHEANRQAHHSDATTPAP PRRKPIQMPGLQDVAERLSTTFDTRVTVSLGKRKGKIVVEFGSVDDLARIVGLMTTDG RDKGLHRDAL" CDS complement(4344860..4345903) /codon_start=1 /transl_table=11 /gene="parA" /locus_tag="BQ2027_MB3949C" /product="PROBABLE CHROMOSOME PARTITIONING PROTEIN PARA" /note="Mb3949c, parA, len: 347 aa. Equivalent to Rv3918c, len: 347 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 347 aa overlap). Probable parA, chromosome partitioning protein, highly similar to Q9CCX7|PARA|ML2707 PUTATIVE CELL DIVISION PROTEIN from Mycobacterium leprae (351 aa), FASTA scores: opt: 1679, E(): 2.9e-93, (78.1% identity in 347 aa overlap). Also highly similar to others e.g. Q9RFM1|PARA PARA PROTEIN from Streptomyces coelicolor (357 aa), FASTA scores: opt: 1197, E(): 2e-64, (60.45% identity in 306 aa overlap); Q98DZ3|MLL4479|PARA CHROMOSOME PARTITIONING PROTEIN from Rhizobium loti (Mesorhizobium loti) (266 aa), FASTA scores: opt: 835, E(): 7.2e-43, (50.95% identity in 257 aa overlap); O05189|PARA_CAUCR CHROMOSOME PARTITIONING PROTEIN from Caulobacter crescentus (267 aa), FASTA scores: opt: 813, E(): 1.5e-41, (51.35% identity in 261 aa overlap) (has its N-terminus shorter); etc. Equivalent to AAK48403 from Mycobacterium tuberculosis strain CDC1551 (381 aa) but shorter 34 aa. Also similar to other M. tuberculosis proteins: MTCI125.30, FASTA scores: E(): 4.3e-32, (35.2% identity in 327 aa overlap); and MTCY07D11.13, FASTA scores: E(): 3e-30, (39.9% identity in 263 aa overlap). BELONGS TO THE PARA FAMILY. Possible alternative start site at aa 107. Note that previously known as parB. Protein product from Mb3949c detected using SWATH mass spectrometry. Mb3949c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR025669" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5L9" /protein_id="SIU02580.1" /translation="MSAPWGPVAAGPSALVRSGQASTIEPFQREMTPPTPTPEAAHNP TMNVSRETSTEFDTPIGAAAERAMRVLHTTHEPLQRPGRRRVLTIANQKGGVGKTTTA VNIAAALAVQGLKTLVIDLDPQGNASTALGITDRQSGTPSSYEMLIGEVSLHTALRRS PHSERLFCIPATIDLAGAEIELVSMVARENRLRTALAALDNFDFDYVFVDCPPSLGLL TINALVAAPEVMIPIQCEYYALEGVSQLMRNIEMVKAHLNPQLEVTTVILTMYDGRTK LADQVADEVRQYFGSKVLRTVIPRSVKVSEAPGYSMTIIDYDPGSRGAMSYLDASREL AERDRPPSAKGRP" CDS complement(4345900..4346574) /codon_start=1 /transl_table=11 /gene="gid" /locus_tag="BQ2027_MB3950C" /product="PROBABLE GLUCOSE-INHIBITED DIVISION PROTEIN B GID" /note="Mb3950c, gid, len: 224 aa. Equivalent to Rv3919c, len: 224 aa, from Mycobacterium tuberculosis strain H37Rv, (99.6% identity in 224 aa overlap). Probable gid (alternate gene name: gidB), glucose-inhibited division protein B, equivalent, but shorter 20 aa, to Q9L7M3 PUTATIVE GIDB (FRAGMENT) from Mycobacterium paratuberculosis (245 aa), FASTA scores: opt: 1018, E(): 4.8e-57, (73.95% identity in 211 aa overlap); and Q50203|GIDB_MYCLE|ML2708 GLUCOSE INHIBITED DIVISION PROTEIN B from Mycobacterium leprae (245 aa), FASTA scores: opt: 966, E(): 9.1e-54, (68.4% identity in 212 aa overlap). Also highly similar to many e.g. O54571|GIDB_STRCO|STH24.07 from Streptomyces coelicolor (239 aa), FASTA scores: opt: 654, E(): 3.9e-34, (47.95% identity in 221 aa overlap); Q9KNG5|VC2774 from Vibrio cholerae (210 aa), FASTA scores: opt: 300, E(): 6.9e-12, (38.15% identity in 139 aa overlap); P17113|GIDB_ECOLI|B3740|Z5240|ECS4682 from Escherichia coli (several strains) (207 aa), FASTA scores: opt: 287, E(): 4.5e-11, (34.8% identity in 138 aa overlap); etc. Contains PS00539 Pyrokinins signature. BELONGS TO THE GIDB FAMILY. Protein product from Mb3950c detected using SWATH mass spectrometry. Mb3950c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P59964" /db_xref="InterPro:IPR003682" /db_xref="InterPro:IPR029063" /db_xref="UniProtKB/Swiss-Prot:P59964" /protein_id="SIU02581.1" /translation="MSPIEPAASAIFGPRLGLARRYAEALAGPGVERGLVGPREVGRL WDRHLLNCAVIGELLERGDRVVDIGSGAGLPGVPLAIARPDLQVVLLEPLLRRTEFLR EMVTDLGVAVEIVRGRAEESWVQDQLGGSDAAVSRAVAALDKLTKWSMPLIRPNGRML AIKGERAHDEVREHRRVMIASGAVDVRVVTCGANYLRPPATVVFARRGKQIARGSARM ASGGTA" CDS complement(4346706..4347269) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3951C" /product="RNA-binding protein Jag" /note="Mb3951c, -, len: 187 aa. Equivalent to Rv3920c, len: 187 aa, from Mycobacterium tuberculosis strain H37Rv, (99.5% identity in 187 aa overlap). Hypothetical protein, similar to JAG protein, equivalent to Q9L7M2 HYPOTHETICAL 20.1 KDA PROTEIN from Mycobacterium paratuberculosis (183 aa), FASTA scores: opt: 1004, E(): 7.3e-52, (85.05% identity in 187 aa overlap); and Q50204|ML2709 HYPOTHETICAL PROTEIN SIMILAR TO JAG PROTEIN SPOIIIJ ASSOCIATED PROTEIN IN BACILLUS SUBTILIS from Mycobacterium leprae (193 aa), FASTA scores: opt: 871, E(): 4.4e-44, (73.05% identity in 193 aa overlap). Also similar to other bacterial proteins e.g. O54595|STH24.06|JAG JAG-LIKE PROTEIN from Streptomyces coelicolor (170 aa), FASTA scores: opt: 593, E(): 6.7e-28, (62.85% identity in 167 aa overlap); Q9RCA6|JAG|BH4063 JAG PROTEIN HOMOLOG from Bacillus halodurans (207 aa), FASTA scores: opt: 282, E(): 1.1e-09, (35.0% identity in 140 aa overlap); Q9X1H1|TM1460 PUTATIVE JAG PROTEIN, PUTATIVE from Thermotoga maritima (221 aa), FASTA scores: opt: 258, E(): 3e-08, (31.9% identity in 138 aa overlap);Q01620|JAG_BACSU JAG PROTEIN (SPOIIIJ ASSOCIATED PROTEIN) from Bacillus subtilis (208 aa), FASTA scores: opt: 196, E(): 0.00012, (28.05% identity in 139 aa overlap); etc. Protein product from Mb3951c detected using shotgun mass spectrometry. Mb3951c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y5N8" /db_xref="InterPro:IPR001374" /db_xref="InterPro:IPR015946" /db_xref="InterPro:IPR034079" /db_xref="InterPro:IPR036867" /db_xref="InterPro:IPR038008" /db_xref="InterPro:IPR039247" /db_xref="UniProtKB/TrEMBL:A0A1R3Y5N8" /protein_id="SIU02582.1" /translation="MADADTTDFDVDAEAPGGGVREDTATDADEADDQEERLVAEGEI AGDYLEELLDVLDFDGDIDLDVEGNRAVVSIDGSDDLNKLVGRGGEVLDALQELTRLA VHQKTGVWSRLMLDIARWRRRRREELAALADEVARRVAETGDREELVPMTPFERKIVH DAVAAVPGVHSESEGVEPERRVVVLRD" CDS complement(4347341..4348441) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3952C" /product="PROBABLE CONSERVED TRANSMEMBRANE PROTEIN" /note="Mb3952c, -, len: 366 aa. Equivalent to Rv3921c, len: 366 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 366 aa overlap). Probable conserved transmembrane protein, equivalent to Q9L7M1 HYPOTHETICAL 39.2 KDA PROTEIN from Mycobacterium paratuberculosis (353 aa), FASTA scores: opt: 2001, E(): 8.4e-100, (83.05% identity in 366 aa overlap); Q9CCX6|ML2710 PUTATIVE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (380 aa), FASTA scores: opt: 1929, E(): 6.2e-96, (77.1% identity in 380 aa overlap); Q50205 CDS 27 on L222 from Mycobacterium leprae (312 aa) FASTA scores: opt: 1770, E(): 1.6e-87, (88.2% identity in 288 aa overlap). Also similar to other e.g. O54569|STH24.05 INNER MEMBRANE PROTEIN. from Streptomyces coelicolor (431 aa), FASTA scores: opt: 412, E(): 6.5e-15, (33.45% identity in 266 aa overlap); O84253|CT251 60 KDA INNER MEMBRANE PROTEIN from Chlamydia trachomatis (787 aa), FASTA scores: opt: 304, E(): 6e-09, (27.9% identity in 269 aa overlap); P29431|60IM_BUCAP 60 KDA INNER-MEMBRANE PROTEIN HOMOLOG from Buchnera aphidicola (subsp. Schizaphis graminum) (536 aa), FASTA scores: opt: 282, E(): 6.7e-08, (36.1% identity in 108 aa overlap); etc. Protein product from Mb3952c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3952c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P65627" /db_xref="InterPro:IPR001708" /db_xref="InterPro:IPR028055" /db_xref="UniProtKB/Swiss-Prot:P65627" /protein_id="SIU02583.1" /translation="MSLLFDFFSLDFIYYPVSWIMWVWYRLFAFVLGPSNFFAWALSV MFLVFTLRALLYKPFVRQIRTTRQMQELQPQIKALQKKYGKDRQRMALEMQKLQREHG FNPILGCLPMLAQIPVFLGLYHVLRSFNRTTGGFGQPHLSVIENRLTGNYVFSPVDVG HFLDANLFGAPIGAYMTQRSGLDAFVDFSRPALIAVGVPVMILAGIATYFNSRASIAR QSAEAAANPQTAMMNKLALYVFPLGVVVGGPFLPLAIILYWFSNNIWTFGQQHYVFGM IEKEEEAKKQEAVRRRAANAPAPGAKPKRSPKTAPATNAAAPTEAGDTDDGAESDAST ERPADTSNPARRNSGPSARTPRPGVRPKKRKR" CDS complement(4348425..4348787) /codon_start=1 /transl_table=11 /locus_tag="BQ2027_MB3953C" /product="POSSIBLE HEMOLYSIN" /note="Mb3953c, -, len: 120 aa. Equivalent to Rv3922c, len: 120 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 120 aa overlap). Possible hemolysin, highly similar to Q9L7M0|YIDD_MYCPA HYPOTHETICAL 12.4 KDA PROTEIN from Mycobacterium paratuberculosis (115 aa), FASTA scores: opt: 521, E(): 1.9e-29, (65.2% identity in 112 aa overlap). Also highly similar to Q44066|HLYA_AERHY PUTATIVE ALPHA-HEMOLYSIN from Aeromonas hydrophila (85 aa), FASTA scores: opt: 276, E(): 1.5e-12, (51.45% identity in 70 aa overlap); and to many bacterial hypothetical proteins from bacterium e.g. P22847|YIDD_ECOLI|B3704.1 HYPOTHETICAL PROTEIN from Escherichia coli strain K12 (85 aa), FASTA scores: opt: 276, E(): 1.5e-12, (51.45% identity in 70 aa overlap). Protein product from Mb3953c detected using SWATH mass spectrometry. Mb3953c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P67301" /db_xref="InterPro:IPR002696" /db_xref="UniProtKB/Swiss-Prot:P67301" /protein_id="SIU02584.1" /translation="MSLSRQSCGRVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRF VPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDG ANDDWNPASKRGERESFV" CDS complement(4348784..4349161) /codon_start=1 /transl_table=11 /gene="rnpA" /locus_tag="BQ2027_MB3954C" /product="RIBONUCLEASE P PROTEIN COMPONENT RNPA (RNaseP PROTEIN) (RNase P PROTEIN) (PROTEIN C5)" /note="Mb3954c, rnpA, len: 125 aa. Equivalent to Rv3923c, len: 125 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 125 aa overlap). rnpA, ribonuclease P protein component (EC 3.1.26.5) (see citations below), equivalent, but longer ~10 aa, to P46610|RNPA_MYCLE|ML2712 RIBONUCLEASE P PROTEIN COMPONENT from Mycobacterium leprae (120 aa), FASTA scores: opt: 456, E(): 3.3e-24, (63.0% identity in 119 aa overlap); and Q9L7L9|RNPA from Mycobacterium paratuberculosis (119 aa), FASTA scores: opt: 426, E(): 3.5e-22, (60.65% identity in 122 aa overlap). Also similar to many e.g. P25817|RNPA_STRBI from Streptomyces bikiniensis (123 aa), FASTA scores: opt: 174, E(): 4.2e-05, (36.8% identity in 125 aa overlap); P25814|RNPA_BACSU from Bacillus subtilis (116 aa) FASTA scores: opt: 168, E(): 0.0001, (26.85% identity in 108 aa overlap); P48206|RNPA_STRCO|STH24.03 from Streptomyces coelicolor (123 aa), FASTA scores: opt: 166, E(): 0.00015, (37.6% identity in 125 aa overlap); etc. Contains PS00648 Bacterial Ribonuclease P protein component signature. BELONGS TO THE RNPA FAMILY. Protein product from Mb3954c detected using SWATH mass spectrometry. Mb3954c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5X9" /db_xref="InterPro:IPR000100" /db_xref="InterPro:IPR014721" /db_xref="InterPro:IPR020539" /db_xref="InterPro:IPR020568" /db_xref="UniProtKB/Swiss-Prot:P0A5X9" /protein_id="SIU02585.1" /translation="MIATPGLFAVLRARNRMRRSADFETTVKHGMRTVRSDMVVYWWR GSGGGPRVGLIIAKSVGSAVERHRVARRLRHVAGSIVKELHPSDHVVIRALPSSRHVS SARLEQQLRCGLRRAVELAGSDR" CDS complement(4349158..4349301) /codon_start=1 /transl_table=11 /gene="rpmH" /locus_tag="BQ2027_MB3955C" /product="50S RIBOSOMAL PROTEIN L34 RPMH" /note="Mb3955c, rpmH, len: 47 aa. Equivalent to Rv3924c, len: 47 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 47 aa overlap). rpmH, 50s ribosomal protein l34 (see citations below), equivalent to many mycobacterial 50S RIBOSOMAL PROTEIN L34 e.g. P46386|RL34_MYCLE|RPMH|ML2713 from Mycobacterium leprae (47 aa), FASTA scores: opt: 287, E(): 8.5e-17, (91.5% identity in 47 aa overlap); and Q9L7L8|RL34_MYCPA|RPMH from M. paratuberculosis (47 aa), FASTA scores: opt: 281, E(): 2.6e-16, (89.35% identity in 47 aa overlap). Also highly similar to other ribosomal proteins e.g. P27901|RL34_STRCO|RPMH|STH24.02 from Streptomyces coelicolor (45 aa), FASTA scores: opt: 234, E(): 1.4e-12, (79.05% identity in 43 aa overlap); and P05647|RL34_BACSU|RPMH from Bacillus subtilis (44 aa) FASTA scores: opt: 229, E(): 3.7e-12, (72.35% identity in 47 aa overlap); etc. Contains PS00784 Ribosomal protein L34 signature. BELONGS TO THE L34P FAMILY OF RIBOSOMAL PROTEINS. Protein product from Mb3955c detected using SWATH mass spectrometry. Mb3955c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:P0A5W5" /db_xref="InterPro:IPR000271" /db_xref="InterPro:IPR020939" /db_xref="UniProtKB/Swiss-Prot:P0A5W5" /protein_id="SIU02586.1" /translation="MTKGKRTFQPNNRRRARVHGFRLRMRTRAGRSIVSSRRRKGRRT LSA" BASE COUNT 747136 a 1430392 c 1424563 g 747813 t ORIGIN 1 ttgaccgatg accccggttc aggcttcacc acagtgtgga acgcggtcgt ctccgaactt 61 aacggcgacc ctaaggttga cgacggaccc agcagtgatg ctaatctcag cgctccgctg 121 acccctcagc aaagggcttg gctcaatctc gtccagccat tgaccatcgt cgaggggttt 181 gctctgttat ccgtgccgag cagctttgtc caaaacgaaa tcgagcgcca tctgcgggcc 241 ccgattaccg acgctctcag ccgccgactc ggacatcaga tccaactcgg ggtccgcatc 301 gctccgccgg cgaccgacga agccgacgac actaccgtgc cgccttccga aaatcctgct 361 accacatcgc cagacaccac aaccgacaac gacgagattg atgacagcgc tgcggcacgg 421 ggcgataacc agcacagttg gccaagttac ttcaccgagc gcccgcgcaa taccgattcc 481 gctaccgctg gcgtaaccag ccttaaccgt cgctacacct ttgatacgtt cgttatcggc 541 gcctccaacc ggttcgcgca cgccgccgcc ttggcgatcg cagaagcacc cgcccgcgct 601 tacaaccccc tgttcatctg gggcgagtcc ggtctcggca agacacacct gctacacgcg 661 gcaggcaact atgcccaacg gttgttcccg ggaatgcggg tcaaatatgt ctccaccgag 721 gaattcacca acgacttcat taactcgctc cgcgatgacc gcaaggtcgc attcaaacgc 781 agctaccgcg acgtagacgt gctgttggtc gacgacatcc aattcattga aggcaaagag 841 ggtattcaag aggagttctt ccacaccttc aacaccttgc acaatgccaa caagcaaatc 901 gtcatctcat ctgaccgccc acccaagcag ctcgccaccc tcgaggaccg gctgagaacc 961 cgctttgagt gggggctgat cactgacgta caaccacccg agctggagac ccgcatcgcc 1021 atcttgcgca agaaagcaca gatggaacgg ctcgcgatcc ccgacgatgt cctcgaactc 1081 atcgccagca gtatcgaacg caatatccgt gaactcgagg gcgcgctgat ccgggtcacc 1141 gcgttcgcct cattgaacaa aacaccaatc gacaaagcgc tggccgagat tgtgcttcgc 1201 gatctgatcg ccgacgccaa caccatgcaa atcagcgcgg cgacgatcat ggctgccacc 1261 gccgaatact tcgacactac cgtcgaagag cttcgcgggc ccggcaagac ccgagcactg 1321 gcccagtcac gacagattgc gatgtacctg tgtcgtgagc tcaccgatct ttcgttgccc 1381 aaaatcggcc aagcgttcgg ccgtgatcac acaaccgtca tgtacgccca acgcaagatc 1441 ctgtccgaga tggccgagcg ccgtgaggtc tttgatcacg tcaaagaact caccactcgc 1501 atccgtcagc gctccaagcg ctagcacggc gtgttcttcc gacaacgttc ttaaaaaaac 1561 ttctctctcc caggtcacac cagtcacaga gattggctgt gagtgtcgct gtgcacaaac 1621 cgcgcacaga ctcatacagt cccggcggtt ccgttcacaa cccacgcctc atccccaccg 1681 acccaacaca caccccacag tcatcgccac cgtcatccac aactccgacc gacgtcgacc 1741 tgcaccaaga ccagactgtc cccaaactgc acaccctcta atactgttac cgagatttct 1801 tcgtcgtttg ttcttggaaa gacagcgctg gggatcgttc gctggatacc acccgcataa 1861 ctggctcgtc gcggtgggtc agaggtcaat gatgaacttt caagttgacg tgagaagctc 1921 tacggttgtt gttcgactgc tgttgcggcc gtcgtggcgg gtcacgcgtc atgggcgttc 1981 gtcgttggca gtccccacgc tagcggggcg ctagccacgg gatcgaactc atcgtgaggt 2041 gaaagggcgc aatggacgcg gctacgacaa gagttggcct caccgacttg acgtttcgtt 2101 tgctacgaga gtctttcgcc gatgcggtgt cgtgggtggc taaaaatctg ccagccaggc 2161 ccgcggtgcc ggtgctctcc ggcgtgttgt tgaccggctc ggacaacggt ctgacgattt 2221 ccggattcga ctacgaggtt tccgccgagg cccaggttgg cgctgaaatt gtttctcctg 2281 gaagcgtttt agtttctggc cgattgttgt ccgatattac ccgggcgttg cctaacaagc 2341 ccgtaggcgt tcatgtcgaa ggtaaccggg tcgcattgac ctgcggtaac gccaggtttt 2401 cgctaccgac gatgccagtc gaggattatc cgacgctgcc gacgctgccg gaagagaccg 2461 gattgttgcc tgcggaatta ttcgccgagg caatcagtca ggtcgctatc gccgccggcc 2521 gggacgacac gctgcctatg ttgaccggca tccgggtcga aatcctcggt gagacggtgg 2581 ttttggccgc taccgacagg tttcgcctgg ctgttcgaga actgaagtgg tcggcgtcgt 2641 cgccagatat cgaagcggct gtgctggtcc cggccaagac gctggccgag gccgccaaag 2701 cgggcatcgg cggctctgac gttcgtttgt cgttgggtac tgggccgggg gtgggcaagg 2761 atggcctgct cggtatcagt gggaacggca agcgcagcac cacgcgactt cttgatgccg 2821 agttcccgaa gtttcggcag ttgctaccaa ccgaacacac cgcggtggcc accatggacg 2881 tggccgagtt gatcgaagcg atcaagctgg ttgcgttggt agctgatcgg ggcgcgcagg 2941 tgcgcatgga gttcgctgat ggcagcgtgc ggctttctgc gggtgccgat gatgttggac 3001 gagccgagga agatcttgtt gttgactatg ccggtgaacc attgacgatt gcgtttaacc 3061 caacctatct aacggacggt ttgagttcgt tgcgctcgga gcgagtgtct ttcgggttta 3121 cgactgcggg taagcctgcc ttgctacgtc cggtgtccgg ggacgatcgc cctgtggcgg 3181 gtctgaatgg caacggtccg ttcccggcgg tgtcgacgga ctatgtctat ctgttgatgc 3241 cggttcggtt gccgggctga gcacttggcg cccgggtagg tgtacgtccg tcatttgggg 3301 ctgcgtgact tccggtcctg ggcatgtgta gatctggaat tgcatccagg gcggacggtt 3361 tttgttgggc ctaacggtta tggtaagacg aatcttattg aggcactgtg gtattcgacg 3421 acgttaggtt cgcaccgcgt tagcgccgat ttgccgttga tccgggtagg taccgatcgt 3481 gcggtgatct ccacgatcgt ggtgaacgac ggtagagaat gtgccgtcga cctcgagatc 3541 gccacggggc gagtcaacaa agcgcgattg aatcgatcat cggtccgaag tacacgtgat 3601 gtggtcggag tgcttcgagc tgtgttgttt gcccctgagg atctggggtt ggttcgtggg 3661 gatcccgctg accggcggcg ctatctggat gatctggcga tcgtgcgtag gcctgcgatc 3721 gctgcggtac gagccgaata tgagagggtg gtgcgccagc ggacggcgtt attgaagtcc 3781 gtacctggag cacggtatcg gggtgaccgg ggtgtgtttg acactcttga ggtatgggac 3841 agtcgtttgg cggagcacgg ggctgaactg gtggccgccc gcatcgattt ggtcaaccag 3901 ttggcaccgg aagtgaagaa ggcataccag ctgttggcgc cggaatcgcg atcggcgtct 3961 atcggttatc gggccagcat ggatgtaacc ggtcccagcg agcagtcaga taccgatcgg 4021 caattgttag cagctcggct gttggcggcg ctggcggccc gtcgggatgc cgaactcgag 4081 cgtggggttt gtctagttgg tccgcaccgt gacgacctaa tactgcgact aggcgatcaa 4141 cccgcgaaag gatttgctag ccatggggag gcgtggtcgt tggcggtggc actgcggttg 4201 gcggcctatc aactgttacg cgttgatggt ggtgagccgg tgttgttgct cgacgacgtg 4261 ttcgccgaac tggatgtcat gcgccgtcga gcgttggcga cggcggccga gtccgccgaa 4321 caggtgttgg tgactgccgc ggtgctcgag gatattcccg ccggctggga cgccaggcgg 4381 gtgcacatcg atgtgcgtgc cgatgacacc ggatcgatgt cggtggttct gccatgacgg 4441 gttctgttga ccggcccgac cagaatcgcg gtgagcgatt aatgaagtca ccagggttgg 4501 atttggtcag gcgcaccctg gacgaagctc gtgctgctgc ccgcgcgcgc ggacaagacg 4561 ccggtcgagg gcgggtcgct tccgttgcgt cgggtcgggt ggccggacgg cgacgaagct 4621 ggtcgggtcc ggggcccgac attcgtgatc cacaaccgct gggtaaggcc gctcgtgagc 4681 tggcaaagaa acgcggctgg tcggtgcggg tcgccgaggg tatggtgctc ggccagtggt 4741 ctgcggtggt cggccaccag atcgccgaac atgcacgccc gactgcgcta aacgacgggg 4801 tgttgagcgt gattgcggag tcgacggcgt gggcgacgca gttgaggatc atgcaggccc 4861 agcttctggc caagatcgcc gcagcggttg gcaacgatgt ggtgcgatcg ctaaagatca 4921 ccgggccggc ggcaccatcg tggcgcaagg ggcctcgcca tattgccggt aggggtccgc 4981 gcgacaccta cggataacac gtcgatcggc ccagaacaag gcgctccggt cccggcctga 5041 gagcctcgag gacgaagcgg atccgtatgc cggacgtcgg gacgcaccag gaagaaagat 5101 gtccgacgca cggcgcggtt agatgggtaa aaacgaggcc agaagatcgg ccctggcgcc 5161 cgatcacggt acagtggtgt gcgaccccct gcggcgactc aaccgcatgc acgcaacccc 5221 tgaggagagt attcggatcg tggctgccca gaaaaagaag gcccaagacg aatacggcgc 5281 tgcgtctatc accattctcg aagggctgga ggccgtccgc aaacgtcccg gcatgtacat 5341 tggctcgacc ggtgagcgcg gtttacacca tctcatttgg gaggtggtcg acaacgcggt 5401 cgacgaggcg atggccggtt atgcaaccac agtgaacgta gtgctgcttg aggatggcgg 5461 tgtcgaggtc gccgacgacg gccgcggcat tccggtcgcc acccacgcct ccggcatacc 5521 gaccgtcgac gtggtgatga cacaactaca tgccggcggc aagttcgact cggacgcgta 5581 tgcgatatct ggtggtctgc acggcgtcgg cgtgtcggtg gttaacgcgc tatccacccg 5641 gctcgaagtc gagatcaagc gcgacgggta cgagtggtct caggtttatg agaagtcgga 5701 acccctgggc ctcaagcaag gggcgccgac caagaagacg gggtcaacgg tacggttctg 5761 ggccgacccc gctgttttcg aaaccacgga atacgacttc gaaaccgtcg cccgccggct 5821 gcaagagatg gcgttcctca acaaggggct gaccatcaac ctgaccgacg agagggtgac 5881 ccaagacgag gtcgtcgacg aagtggtcag cgacgtcgcc gaggcgccga agtcggcaag 5941 tgaacgcgca gccgaatcca ctgcaccgca caaagttaag agccgcacct ttcactatcc 6001 gggtggcctg gtggacttcg tgaaacacat caaccgcacc aagaacgcga ttcatagcag 6061 catcgtggac ttttccggca agggcaccgg gcacgaggtg gagatcgcga tgcaatggaa 6121 cgccgggtat tcggagtcgg tgcacacctt cgccaacacc atcaacaccc acgagggcgg 6181 cacccacgaa gagggcttcc gcagcgcgct gacgtcggtg gtgaacaagt acgccaagga 6241 ccgcaagcta ctgaaggaca aggaccccaa cctcaccggt gacgatatcc gggaaggcct 6301 ggccgctgtg atctcggtga aggtcagcga accgcagttc gagggccaga ccaagaccaa 6361 gttgggcaac accgaggtca aatcgtttgt gcagaaggtc tgtaatgaac agctgaccca 6421 ctggtttgaa gccaacccca ccgactcgaa agtcgttgtg aacaaggctg tgtcctcggc 6481 gcaagcccgt atcgcggcac gtaaggcacg agagttggtg cggcgtaaga gcgccaccga 6541 catcggtgga ttgcccggca agctggccga ttgccgttcc acggatccgc gcaagtccga 6601 actgtatgtc gtagaaggtg actcggccgg cggttctgca aaaagcggtc gcgattcgat 6661 gttccaggcg atacttccgc tgcgcggcaa gatcatcaat gtggagaaag cgcgcatcga 6721 ccgggtgcta aagaacaccg aagttcaggc gatcatcacg gcgctgggca ccgggatcca 6781 cgacgagttc gatatcggca agctgcgcta ccacaagatc gtgctgatgg ccgacgccga 6841 tgttgacggc caacatattt ccacgctgtt gttgacgttg ttgttccggt tcatgcggcc 6901 gctcatcgag aacgggcatg tgtttttggc acaaccgccg ctgtacaaac tcaagtggca 6961 gcgcagtgac ccggaattcg catactccga ccgcgagcgc gacggtctgc tggaggcggg 7021 gctgaaggcc gggaagaaga tcaacaagga agacggcatt cagcggtaca agggtctagg 7081 tgaaatggac gctaaggagt tgtgggagac caccatggat ccctcggttc gtgtgttgcg 7141 tcaagtgacg ctggacgacg ccgccgccgc cgacgagttg ttctccatcc tgatgggcga 7201 ggacgtcgac gcgcggcgca gctttatcac ccgcaacgcc aaggatgttc ggttcctgga 7261 tgtctaacgc aaccctgcgt tcgattgcaa acgaggaata gatgacagac acgacgttgc 7321 cgcctgacga ctcgctcgac cggatcgaac cggttgacat ccagcaggag atgcagcgca 7381 gctacatcga ctatgcgatg agcgtgatcg tcggccgcgc gctgccggag gtgcgcgacg 7441 ggctcaagcc cgtgcatcgc cgggtgctct atgcaatgtt cgattccggc ttccgcccgg 7501 accgcagcca cgccaagtcg gcccggtcgg ttgccgagac catgggcaac taccacccgc 7561 acggcgacgc gtcgatctac gacaccctgg tgcgcatggc ccagccctgg tcgctgcgct 7621 acccgctggt ggacggccag ggcaacttcg gctcgccagg caatgaccca ccggcggcga 7681 tgaggtacac cgaagcccgg ctgaccccgt tggcgatgga gatgctgagg gaaatcgacg 7741 aggagacagt cgatttcatc cctaactacg acggccgggt gcaagagccg acggtgctac 7801 ccagccggtt ccccaacctg ctggccaacg ggtcaggcgg catcgcggtc ggcatggcaa 7861 ccaatatccc gccgcacaac ctgcgtgagc tggccgacgc ggtgttctgg gcgctggaga 7921 atcacgacgc cgacgaagag gagaccctgg ccgcggtcat ggggcgggtt aaaggcccgg 7981 acttcccgac cgccggactg atcgtcggat cccagggcac cgctgatgcc tacaaaactg 8041 gccgcggctc cattcgaatg cgcggagttg ttgaggtaga agaggattcc cgcggtcgta 8101 cctcgctggt gatcaccgag ttgccgtatc aggtcaacca cgacaacttc atcacttcga 8161 tcgccgaaca ggtccgagac ggcaagctgg ccggcatttc caacattgag gaccagtcta 8221 gcgatcgggt cggtttacgc atcgtcatcg agatcaagcg cgatgcggtg gccaaggtgg 8281 tgattaataa cctttacaag cacacccagc tgcagaccag ctttggcgcc aacatgctag 8341 cgatcgtcga cggggtgccg cgcacgctgc ggctggacca gctgatccgc tattacgttg 8401 accaccaact cgacgtcatt gtgcggcgca ccacctaccg gctgcgcaag gcaaacgagc 8461 gagcccacat tctgcgcggc ctggttaaag cgctcgacgc gctggacgag gtcattgcac 8521 tgatccgggc gtcggagacc gtcgatatcg cccgggccgg actgatcgag ctgctcgaca 8581 tcgacgagat ccaggcccag gcaatcctgg acatgcagtt gcggcgcctg gccgcactgg 8641 aacgccagcg catcatcgac gacctggcca aaatcgaggc cgagatcgcc gatctggaag 8701 acatcctggc aaaacccgag cggcagcgtg ggatcgtgcg tgacgaactc gccgaaatcg 8761 tggacaggca cggcgacgac cggcgtaccc ggatcatcgc ggccgacgga gacgtcagcg 8821 acgaggattt gatcgcccgc gaggacgtcg ttgtcactat caccgaaacg ggatacgcca 8881 agcgcaccaa gaccgatctg tatcgcagcc agaaacgcgg cggcaagggc gtgcagggtg 8941 cggggttgaa gcaggacgac atcgtcgcgc acttcttcgt gtgctccacc cacgatttga 9001 tcctgttctt caccacccag ggacgggttt atcgggccaa ggcctacgac ttgcccgagg 9061 cctcccggac ggcgcgcggg cagcacgtgg ccaacctgtt agccttccag cccgaggaac 9121 gcatcgccca ggtcatccag atccgcggct acaccgacgc cccgtacctg gtgctggcca 9181 ctcgcaacgg gctggtgaaa aagtccaagc tgaccgcctt cgactccaat cgctcgggcg 9241 gaatcgtggc ggtcaacctg cgcgacaacg acgagctggt cggtgcggtg ctgtgttcgg 9301 ccgacgacga cctgctgctg gtctcggcca acgggcagtc catcaggttc tcggcgaccg 9361 acgaggcgct gcggccaatg ggtcgtgcca cctcgggtgt gcagggcatg cggttcaata 9421 tcgacgaccg gctgctgtcg ctgaacgtcg tgcgtgaagg cacctatctg ctggtggcga 9481 cgtcaggggg ctatgcgaaa cgtaccgcga tcgaggaata cccggtacag ggccgcggcg 9541 gtaaaggtgt gctgacggtc atgtacgacc gccggcgcgg caggttggtt ggggcgttga 9601 ttgtcgacga cgacagcgag ctgtatgccg tcacttccgg cggtggcgtg atccgcaccg 9661 cggcacgcca ggttcgcaag gcgggacggc agaccaaggg tgttcggttg atgaatctgg 9721 gcgagggcga cacactgttg gccatcgcgc gcaacgccga agaaagtggc gacgataatg 9781 ccgtggacgc caacggcgca gaccagacgg gcaattaatc aggctcgccc gacgacgatg 9841 cggatcgcgt agcgatctga ggaggaatcg ggcagctagg ctcggcagcc gggtacgagt 9901 gttaggagtc ggggtgactg caccgaacga gccgggggcg ctcagcaagg gcgacggccc 9961 gaatgcggat ggcttggtcg accgtggggg cgcacatcgg gcagcgaccg ggccaggccg 10021 cataccagat gctggagacc cgccgccgtg gcagcgtgct gcgactcggc aatcccaagc 10081 ggggcatcgt cagccgccgc cggtatcaca ccctgagggg cgcccgacca acccgcccgc 10141 cgccgccgat gctcggctga atcgcttcat ctccggtgcg tctgccccgg tgaccggccc 10201 agccgccgcg gtcaggaccc cgcagccgga tcccgacgct tcgctggggt gtggcgacgg 10261 ttcccccgcc gaggcctatg ccagcgagct gcccgaccta tccggcccga ctccgcgggc 10321 cccgcaacgc aaccccgcgc cggcgcgtcc cgcggagggt ggcgcgggat cgagagggga 10381 ttcggccgcc ggttcgagcg gcggtcgttc gattaccgct gagagtagag acgcccgtgt 10441 ccagctgtcg gcgcggcgaa gccgcgggcc ggttcgagcc agcatgcaga tccgacggat 10501 tgatccatgg agcacgttga aggtgtcgct gttgttgtcg gtggcgctgt tcttcgtctg 10561 gatgatcacg gtcgcgttcc tctacctggt gctcggcggt atgggcgtat gggccaagct 10621 caacagcaac gtcggtgacc tgttgaacaa cgcgagcggc agcagcgcgg aacttgtctc 10681 cagcggcacc atcttcggcg gcgcattcct gatcggcttg gtcaacgtcg tcctgatgac 10741 cgcgcttgcc accatcggtg cgttcgtcta caacctgatc accgatctga tcggcggcat 10801 cgaagtgacg ctggcagacc gggactaatg ttttgagagt cgggcgccgg ttgcggtaat 10861 ctcgtcgctc ggccgtacgc gagtacgggc ctatagctca ggcggttaga gcgcttcgct 10921 gataacgaag aggtcggagg ttcgagtcct cctaggccca cgaccatgtg cccgtcacga 10981 cgttcggtga ggttcgcatt gccactggcc gcgatcgctg tggcggccat cgtcgtgcgg 11041 ttccgacgcg gagccgatgt ctggcatgtg gccggcgatc cacctcctga tcacataacc 11101 ggtgacgaag aggggcctta gctcagttgg tagagcactg cctttgcaag gcaggggtca 11161 ggggttcgag tcccctaggc tccacaagtg aaaagcgtag ctcggatact tcgaatgacc 11221 acgtttgatc acaatcgcga gtgaagaggg cgttgatggc cactccgacg gcctcgacac 11281 ccgacccgta caggtggcgg tagcggtcca aggtcaaccc ggcggagtcg tgttcgagca 11341 tgttctgaag tgccttgaat tcgccccggc ctggatcgcc aacgacgccg ggtgtgccga 11401 gctcatgcag ttttgaactc ctacaccacc gccggcttcc cggtagcgtc catcacagtc 11461 tgagggaaca gctgcgccgc ggtcaccgcc tgcgaccacc accggcgccg cacatggctg 11521 ccgcgcatgt agccgcccgc cgagtccggg aacgctagaa gctcagcaac ccatcgaacg 11581 cggtcggccg gttgtcggcg tccacgagca cgcaccctag agcgaaagtc atggatccgc 11641 cgttggcggg gtctccggta ttgccggact cgtctatgta agcgaccagc acgcgacgat 11701 gctggcacga ttcttgggcg attgaccaca gttacagata actactgtta accgcagttg 11761 tgtcctttcg caggtggact gagttgtaac ccattgatct gcatcatgat tcgcctgtgc 11821 aaggcggggg tcaggggttc gaatccctag gccccaccgt gtgacgaccg gcctcagggg 11881 cgcggttgca cctcgacgct cggtggtcgg ggcgacggct ccggtcgcga cgagcgccgg 11941 acgatgctga aggcgacggc accgccggcg aggatggccg ccgcgatccc cgcgaagatc 12001 cagaggtggt gtttgctgcg acgttgggtc cgggcgtcct gtagggcctg cggtaggttg 12061 gccaccacgt cctgggcagc ggtcagctct tgagcgagcg tctcttgggc ggcagcgacc 12121 tcgcgggcca atcggccttc ccggtaacgg cggcgaagcc cggccgcggt cgaccgggcg 12181 gactgaagcc caagtccgac cccgagttcg aggaggcctc gggtcacgtc caccggaccc 12241 accgcagagt aggccagacc ccgggtcagc cgctcgcgtg gggtcaaccg ggtttccacc 12301 tgctcactca ttttgccgcc tttctgtgtc cgggccgagg cttgcgctca ataactcggt 12361 caagttcctt cacagactgc catcactggc ccgtcggcgg gctcgttgcg ggtgcgccgc 12421 gtgcgggttt gtgttccggg caccgggtgg gggcccgccc gggcgtaatg gcagactgtg 12481 attccgtgac taacagcccc cttgcgaccg ctaccgccac gctgcacact aaccgcggcg 12541 acatcaagat cgccctgttc ggaaaccatg cgcccaagac cgtcgccaat tttgtgggcc 12601 ttgcgcaggg caccaaggac tattcgaccc aaaacgcatc aggtggcccg tccggcccgt 12661 tctacgacgg cgcggtcttt caccgggtga tccagggctt catgatccag ggtggcgatc 12721 caaccgggac gggtcgcggc ggacccggct acaagttcgc cgacgagttc caccccgagc 12781 tgcaattcga caagccctat ctgctcgcga tggccaacgc cggtccgggc accaacggct 12841 cacagttttt catcaccgtc ggcaagactc cgcacctgaa ccggcgccac accattttcg 12901 gtgaagtgat cgacgcggag tcacagcggg ttgtggaggc gatctccaag acggccaccg 12961 acggcaacga tcggccgacg gacccggtgg tgatcgagtc gatcaccatc tcctgacccg 13021 aagctacgtc ggctcgtcgc tcgaatacac cttgtggacc cgccagggca cgtggcggta 13081 caccgacacg ccgttggggc cgttcaaccg gacgccctca cgccaagtcc gctcaccttt 13141 ggccgcgacc ggcgtaaccg gcagcggtaa gcgcatcgag cacctccact gggtcgctgc 13201 cgagatccca gcgggacaaa atcagcagcc cccgctgacc gtttcgatct cgagcaggcg 13261 caccaggcgg ccgtaacggc gaaactcgtc gattcggatg atcttgatat tggaatgtcg 13321 taatagctgc gtccggaacc aacctcggat cgccaggccg tcgggggtaa ttgccagcct 13381 tggacgtgcg cgccaagtgg cgctcgcaaa caagatcaga cccagcgcgg caactccggt 13441 caacacccgc ccgggcgtgt ctgtgactaa ggtcacagac gcaatagcca tcacgactcc 13501 cccggctccg caaccagcga ttcccgaggt gcgaggcgcc catgctgttt gctgcatgta 13561 ttccttagac cctctcacca ctgcagacaa agttatccac agacgctatc aacagtgggg 13621 atgaatcaca tgcgtgtgat tgagtgacca aaaggttgct ggcacagtaa cgacccgacc 13681 agaatatgaa ttcattctat cggcggcgtg gatcaatgcc agcgcatcgt gagcaacaaa 13741 ccggtgatca tgaaagcgaa cgcgatcgca tagttccagg gaccgagttg cgccatccaa 13801 ttgagcgctg tgggggcttg gctgccaatg gctgccaact gaaacaccat taaccagatg 13861 agtccgatca gcatcagacc gatgaacaac gagacgaacc atacgctcga cggtccgacc 13921 ttcaccttca tcggcgtgcg gctcaccgcg ctgacggtga agtcgttctt cttgcggacc 13981 ttggacttgg gcatcacttt cctcgggatc tggcgggact acctcgacaa gacgacgaat 14041 ggcccggggt gcaacgatag aagttgcagc tgcaggcata ccttgttatg agactaaccc 14101 acccaacacc ctgcccggaa aacggagaga ccatgattga tcggcgccga tcggcgtggc 14161 gtttcagtgt ccccttagtg tgcttgctgg cggggctgct gctggccgcc acgcatgggg 14221 tgtcgggcgg caccgagatc cgccgcagcg atgcgccgcg actggtcgac cttgtccgtc 14281 gggcgcaggc atcggtgaac cgtctcgcca ccgaacgcga agcgctgacc accagaatcg 14341 actcggtgca cggccgatct gtcgataccg cgttggcggc catgcagcgg cggtccgccg 14401 agctggccgg tgtggcggct atgaatccgg tccatgggcc gggcctggtg gttaccctgc 14461 aagacgcgca acgcgacgcc aacggccggt ttccgcgcga cgcgtccccg gacgatctgg 14521 ttgtgcatca gcaagacatc gaggctgtcc tcaacgcgtt gtggaatgcc ggtgctgagg 14581 cgatccagat gcaggaccag cgcatcatcg cgatgtcgat agctcgttgt gtcggaaaca 14641 cgttgctgct caacgggcgt acctatagcc cgccctacac gatcgccgcg atcggagacg 14701 ccgccgccat gcaggctgct ctggctgcgg ctcccctggt gacgctctac aagcagtacg 14761 tggtccggtt cggcctcggg taccgcgaag aagtccatcc tgacttgcag atagtcggct 14821 atgccgatcc cgtccggatg cacttcgcgc agcctgcagg ccccttggac tactgaacga 14881 ctgccggcag ggtcaggcgg tagcctgtca cgatgcggat cctggtcgtt gacaactacg 14941 acagcttcgt gttcaacctg gtgcagtacc tcggccagct cggcatcgag gccgaggtgt 15001 ggcgcaacga cgaccaccgg ctatccgatg aggccgccgt cgccggccaa ttcgacggtg 15061 tcctgctcag tcccggtccg ggtaccccgg agcgcgcggg cgcgtcggtg agtatggtgc 15121 acgcgtgtgc ggcagcacac acccctttgc tgggggtctg ccttgggcac caagccatcg 15181 gcgttgcgtt cggcgccacc gtggaccgtg cgcccgagct attgcacggc aagaccagca 15241 gcgtattcca caccaatgtc ggtgtgctac aagggcttcc ggatcccttc acggccactc 15301 gataccattc gttgacaatt ctgcctaagt cgctgccagc ggtgctgagg gtcacggccc 15361 gcactagcag cggtgtgatc atggccgtgc agcacaccgg gctgccgatc cacggtgtcc 15421 agttccatcc ggagtcgatt ctcaccgagg gcgggcaccg catactggcc aactggctca 15481 cctgctgcgg atggacgcaa gacgacaccc tggtacgtcg gctggaaaac gaagtgctca 15541 ccgccatctc accgcacttc ccaacttcaa ccgctagcgc gggcgaagct actggccgaa 15601 cctcagcgtg atgatgccgt cccggttgac gccggtcccc gccggcgggt tttgatagac 15661 gacccggttg tgttgggagc caccggcgtc gacgtcggcc cctttgtcga gcatcccggt 15721 ccagcccagc gcgcgcaatc gtggttcggc gtcgacccag aacatgccgg ataggtcggg 15781 catgacgaat tggttgccct tggacacctg tagttcgatg actgaatcga ccggaactgt 15841 ggtgcctgcg ggtggattgg tgccggtcac ctcgccggcg ggacgggggc tgtccaccga 15901 ggcctgactg aatttggtga agccgtagac gttgaggttc ttctgcgcca cgtcgacggt 15961 ctggcccgcg acatcgggaa tgtctttggt cgccggacca gagccaacga tgatgatgac 16021 cacattggtg atggccgacg tctggttggc tggcgggttg gtcccgatga ccttgcccac 16081 cagttccggg gtggacggcg aattcgcttg cttgaagcgg ccgaatccgg cggcagtcag 16141 tttcttgacc gcttcggcgt atgtcagcgt ggagacgtcg ggtatttcgc gttgctcggg 16201 tccggtggac acgttgactg tgatctcgtc gcctgcactc accgacgtgt tggcggccgg 16261 gtcggtgccg ataacgtggt ccggtgggat tgtcgagtcc ggcttctgca aggtgcggat 16321 tttgaagccc cggttttgca gtgtggcgat ggcgtcggcg gaggattgac cccgaacgtc 16381 gggaacttga acgtcgcggg tgatgccgcc gaacgtgttg atggcgatgg ttaccacgac 16441 ggtcagcaca gcgagcacgg cgaccaccgc aacccaacgg cccaccgaac cgatgctgcg 16501 gtcacggtcg gtgtcgtcta agtcctggcg tggtagcgga tcggtgcgcg gaccgctaag 16561 gttgccggcc gcagacgaca gcagcgaggt ccgctcggca tcggtgagca ctttgggcgc 16621 ctcgggcggc tcaccgttgt gcacgcggac caggtcggcg cgcatctccg ccgctgtctg 16681 atagcggttt tccggatttt tggccagcgc cttgagaacg acggcgtcca ggtcggcgga 16741 gaggccttcg tgccgcgccg aaggtgggat cgggtcttcg cgcacatgtt ggtaggcaac 16801 cgagacgggt gagtcgccgg tgaaaggtgg ctccccggtg aggacttcat aaagaacaca 16861 gcccaaggaa tagacatcgg atcgggcgtc gacggaatca ccccgggcct gttcgggtga 16921 caggtactgc gccgtgccga tcactgctgc ggtctgggtc acgctgttgc cgctgtcggc 16981 aatggcgcgg gcgatgccga aatccatcac ctttactgca ttggtcgcgc tgatcatgat 17041 gttcgccggc ttgacgtcac ggtggatgat tccgttctga tgactgaagt tcagcgcttg 17101 gcaggcgtcg gcgatgacct cgatggcgcg tttgggcgtc atcggccctt cggtgtggac 17161 aatgtcgcgc agggtaacgc cgtcgacgta ttccatgacg atgtagggca atggcccggc 17221 gggcgtttcg gcttcaccgg tgtcgtagac cgcgacgatt gcagggtggt tcaatgccgc 17281 ggcgttttgc gcctcacgcc ggaagcgaag gtaaaaactg ggatcgcggg ctagatcagc 17341 gcgcagcacc ttgaccgcaa cgtcgcggtg caaccggagg tcgcgggcca ggtggacctc 17401 ggacatgccc ccaaatccaa ggatttcgcc aagttcgtag cggtcggaca ggtgggaagg 17461 ggtggtcatt gcgctatctc gtatcgggcc agcgacgcgc gcgaatgcgg tgtcggcggg 17521 acaacccagc tttgcagtcc agaatgacgt gtttccccgc gttccgtcca attgagtcgc 17581 gggctagcat cagtcccgcc agtgttgctg gccggagggt tcccggtggt ggtcacggtc 17641 ggcgtcggtg cctgctgcgg gctgttgtcc ccgggcgctt tgatgacgag cagcacggcg 17701 atgatgattg ccagcgcccc cagcaccccc gcggcccaga gcagcgcacg ctgaccggac 17761 gaaaacgtgc gccgcggcgg ccggtgacca cccgtggccg ggcgggatcg acgggatgcc 17821 gcagtccggc cagcagagtt ggccgcgacc ctggccgtcg tacccgacgg aatggccgcc 17881 ggggcggccc ggccaggggg gggtgtctgg ctgggccgcg ggggccggcg gccggcgcgc 17941 accgctgcca ccgcgtcggc gaacggtccc ccactgcgat agcgcatcgc ggggttcttc 18001 accagagtta tctcgatgag ttctcgcaca ttgggcggca ggtcgggagg cagcggcggc 18061 ggcggctcct tgatgtgctt cattgccacg gtcagggcac catcgccggc gaacggccgt 18121 ttacccgaaa ccgcttcata cccaacaact cccagtgaat agacgtcgct ggccgggctg 18181 gcgtcgtgac cgagggcctg ctccggcgcg atgtattggg cggtgcccat caccatgccg 18241 gtctgggtca cgggcgctgc atcgacggct ttggcgatgc cgaagtcggt gatcttcacc 18301 tgcccggtgg gggtgatcaa gatgttgccc ggtttgacgt cgcggtgcac caggccagcg 18361 gcatgcgcga tctgcagagc gcggccggtc tgctcgagca tgtccagtgc gtgccgcaac 18421 gacagccggc cggtgcgttt gagcaccgaa tttagtggct cgccgttgac cagctccatc 18481 accaggtagg ccgtgcgacc ctccccgttc atctggcttt cgccgtagtc gtgcacgctg 18541 gcgatgcccg gatggttcag catcgcggtg gtgcgcgctt cggcccggaa ccgttcgatg 18601 aactccggat cggaggagaa ctcgctcttg agcaccttca ccgcaacacg ccggcccaac 18661 cggttatcca cggcctccca gacttggccc ataccaccgg tggcgatgag gcgctgcagg 18721 cggtatctgc ccgacagcgt cacgccaact cgggggctca tggttccccc tgcagtgcgg 18781 cttcgatcac cgcccgcccg atcggtgccg cgagggcacc tccggtggcg gacagccgat 18841 cagccccgtt ctccaccagc acggcaacag ccaccttggg cgcttgtgcg ggcgcaaagg 18901 cgatgtacca agcgtgcggt ggagtgtgac gagggtcggt gccatgttcg gcggtgcccg 18961 tcttggatgc gatctgcacg ccggggattg cccctttctg ctgtgcgact ttctcggcgc 19021 cgaccatcag ctctgttagc ttagcggcga cctgcggtga caccgcgcgg cgctgctggt 19081 atccgacggt ggttgagata ttggctaggt ccggtccctt gaggctgccg actagataag 19141 gcctcatcgt aatgccgccg tttgcgatgg tcgcggctat ttctgcgttc gctagcgggg 19201 tcagcgcaac gtccttttgg ccgatactgg tcatccctag tgcggcgctg tccgggatag 19261 gcccgacggt tgattccgcc acttgcagcg gagttgggcg cggtgggcta tcgagaccga 19321 acgcgcgcgc catgctgcgc agggcgtcgg cgccggtgcg gatgcccagc tggacgaatg 19381 cggtgttgca tgatttgacg aatgcctcac gcagcgacac ggtgggttcg tccccgcacg 19441 gcgcaccgcc gtagttctct agctgggcgg tgctgcctgg caacggaatt gtgggcgccg 19501 cagtcagctg ttcggtctcg gtggccccgg cggccagcgc ggccgcagtg gtgatcactt 19561 tgaaagtcga acccggtgga tacgtctcag agatggcacg gttggtcagt ggagaggcgg 19621 gattgtcgcc aagccgctgc caggcttgcg cctgcacctc ggggttatgc gacgccagca 19681 ggttggggtc gtaggacgga gaagacacca acgccaaaat cttgccggtt gatggctcaa 19741 gggcgaccac cgctccctta cagggcccgt agcagccttg ctgcatcgcg tcccagccgg 19801 cttgctgaat gcgcgggttg atcgtggtat cgacattacc gccgcgtggg tcgcgaccgg 19861 tgaagaagtc ggccagccgg cggccgaaca gacggcggtc ggacccgttc aatatcgggt 19921 cctcggctcg ttctagggcg gtgctggaat agcgcaggga gtagaagccg gtaaccggcg 19981 cgtacacctc aggattggga tagacccgca ggaaacgaaa gcggccgtcg gtggctaccg 20041 agtacgccag cagttggcca ccagcggtga tctggccgcg ctgccgtgaa tactcgtcga 20101 gcaacactcg ctggttgcgg ggatcggcac gcagcccgtc ggcggtgaag acctgcgtca 20161 tggtcgcgtt gagcagtagc aacacgatca acgccatcac ggtcaccgat attcggcgca 20221 gagaggcgtt catacgcgtt cgatgacctc ggtgccggcc gccgtaatcg gcgacttatt 20281 tcgtgggcgg gtgcgcagtg ggcggcgggc tccgtgcgag atgcgtgcca ggatggccag 20341 caatatgtag ttggccagca gtgaagaccc gccgtaggac atccacggtg tggtcaaccc 20401 ggtcagcgga atgagtcggg tcacaccgcc gacgacgatg aacagctgaa tggctagcgt 20461 cgatgagagg ccggcggcca gcagcttgcc gaagctatcg cgggtggcga tggccgtgcg 20521 caaaccccgg atgatcacga tggtgtagag catcaggatg gccgtcaagc ccaccaaccc 20581 aagctcttcg ccgaacgcgg cgatgatgaa atcggtggat gccgcgggca cggtgtcggg 20641 ttgaccatta ccgagcccgg tgccgaagat accgcctgta gcgaagctga aaagcgactg 20701 cacgatctga tatccggtgc cgtctggatc tgcgaacgga tccagccagg tctgtacgcg 20761 gagccggacg tgctcaaaaa tgaagtacgc caccaaggtt cctgccgcga acagagtcag 20821 gccgatgacg acccaactga accgctgggt ggcgaggtaa accaccacca gaaacgatgt 20881 gtacagcagc agcgaagcgc cgaggtcttt ctcgaagacc atcacaccca ccgagatgac 20941 ccaggctgcc aacagtggcg cgaggtctcg cgggcgcggc agggtcattc cgagcaaatg 21001 tttgccggcg ctggtgaaca ggccgcgttt ggccaccagt accgccgaaa agaagatcag 21061 cagcagaatc tttgaaaatt cggcgggttg aatcgagaag ccgggcaacc ggatccagat 21121 cttggcgccg ttctgttcgg acagtgctgc cgggagcagc gcgggaactg ccaagaaaac 21181 cagacccgcg agcccgcaaa tgtagccgta gcgtgcgagc tgtcggtggt ccttgaggaa 21241 ggtcaccacg agcgcgaagg cagctacgcc caccagcgtc cacagcatct gctggtttgc 21301 gctggggtgc cgatgctcgc cgatctcgtt gtccaccaga tcgaggcggt ggatcattac 21361 caggccaagt ccgttgagca gtgccaccac cgggagcaac agcgggtcag tgtagggggc 21421 gaagcgccgg atggccagat gcgcggatcc gaacagggtc aggaaggcca gtccgtagct 21481 agtcaagtcc cagggcaccc cctggtcttg attggcctgc acgaccagca gtgcggcaaa 21541 cgtgattacg gcggcaaagc acagcagcag cagttcagcg ttgcgccgag tcggcaacgg 21601 gggcgttacg gccaccggcg cttgcagtcg tgtcgtcatg ccgccgcccg gcagtcgatg 21661 cccggctgag gcgggggtgg cggaagtgcg gccatcgtcg gcgagctggt gacgggccaa 21721 ggcgtcggcg gcgacgcggg cgctgccggg gaggcactcg tggggatggc aggagtagtt 21781 ccggtggggg ccgacgcgga ggtggtgggt gatggagcgg ctggcgagga ggtgacgttt 21841 ggttcggttg tctcgctggt ggtgggtggg gccgggcgcc cgggcgggga cgtggcacgc 21901 ggcgccgggc aaggcggcag cagggagttg gccgccagtt cgcgcaactg cccgatggcg 21961 tcatcgagag tgccggccgg gagaccggcc cgaacctgtg cgcgctccgg cggtcgcaga 22021 tcctccagtt tcatcagatg gcagtcgaga gggcccccag actgtccgta gctgatctgc 22081 gacagctcgt tacgcgggct gaggcagccc atcaggtaag gctggtgcag ggacatgccc 22141 agtagcgacc cttgaatccc ccgcatgatg gacacgctgc cggcgtagtc cgctacgtag 22201 tagttgctgc ggatgatcgc gcgaccaatg agcaggcccg cagtcatcag cacggtcacc 22261 agggcgacaa cgaatgctag ccgtcggccc gaccaccgtg gccgactgaa tgtatcggcc 22321 tgtggcggaa cgcgtttaac gatctccttg cgctggctga tggcagaggc ccggccggcg 22381 gcggtgttgg gcagggtcag ttggtcgtcg tcgcctgaga ccgccccggc cagaatcggt 22441 tgggtctggc cgtagtcgta gtcgacgacg tcggcgacga cgacagtgac gttgtcgggg 22501 ccgccgccgc gcagcgccag ttcaatgagg cggtgagcgc tctcggcaac ctcggggatc 22561 tgcagggcct cgaggatagt ttcatcgcta accggatcgg acaacccgtc cgagcacagc 22621 aggtaacgat caccggcgcg ggcttctcgc atggtcagcg tcggttcgac ctcatggccg 22681 gtcaacgccc gcatgatcaa cgagcgttgc gggtggctgt gcgcctcctc cggggtgatc 22741 cggccttcgt cgaccagcgt ttggacaaac gtgtcgtcct tggtgatctg cgtcagctca 22801 ccgtcgcgca gcaggtaacc gcgcgagtca ccgatatgca ccaggccgag ccggttgccc 22861 gcgaacagga ttgcggtgag cgtggtaccc atgccttcga gatcgggctc catctcgact 22921 tgcgctgcga tagccgagtt gccggcgcgc accgcggcat ccagcttggc cagcagatcg 22981 ccaccgggct cgtcgtcatc gagatgggcc aatgcggcaa tcaccaactg ggacgccacc 23041 tcgccggccg catgcccacc catgccgtcg gccagggcca atagccgtgc cccagcgtag 23101 accgagtctt cgttgttggc gcgtaccaag ccgcgatcgc tgcgcgccgc gtatcgcagg 23161 accagggtca cgagcgccac tctcccccgc aagcgggtgg gggtaccccc cacttgtggg 23221 ggcgcgcccc caccgcttct ctgcgctctg catcgtcgcc agcgcgggtc acgggcgcaa 23281 ctcgattgca gttttgccga tgcgaaccgg cgttccgatc ggaactcgta ccgcagtcgt 23341 caccttcgcc ctgtccaggt aagtgccgtt ggtcgatcct agatcttcga cgtaccactc 23401 ggagccgcgc atagacagcc gagcgtgccg cgtcgaggcg tagtcgtcgg tcagcaccag 23461 ggtcgagtcg tcggcgcgcc cgatcaacac cggctgttcg ctcagcgtga tacgcgcgcc 23521 agtcaacgca ccttcggtca ccaccaggta gcgtgcagcg tgccggcgct gacgcgcgcc 23581 taagagcgtc cctcgcagcg ccaggccgcg gcgcatcatg accgcgccgg tcggcgcata 23641 aatgtcggtc ttcaagatcc gtagcacgga ccagatgaat acccacaaca acatcaagaa 23701 tccggcacgc gttagttgca gtaccaaccc ctgcatctgg cgtcctttcc gtcctgcacc 23761 gtctgctccg gccccgcgct gccgagcacg tcagcaaagt cacgatactt tgacggtggt 23821 cggcgcgggt caaccccggc agcttcgagc ccagtaggtt cagtgcatgc ggacgatgat 23881 ctcggagtgt cccaagcgga tcacatcacc gtcggccaac tgccactcct gtaccggtgc 23941 attgttaaca gtggtgccgt tggtggagtt caggtctgcg agcaatgcga cctgcccgtc 24001 ccaccggatc tccaagtgac ggcgtgacac accggtgtcg ggcagccgga actgggcgtc 24061 ctgtccgcga ccgatgatgt tggagccctc gcggagctgg taagtgcgtc cgctgccgtc 24121 gtcgagctgc agcgtaaccg acgttccggc ggacccatag ccgccctgcc cgtaaccgct 24181 gtagccaccg ggcgctggct gaccgtagtc cggagcgcct gattggccgt agtcgtagtc 24241 tcggccggcg ggttcggcgt acccgccacc ctgaggagcg tatcccggga cctgcgggga 24301 ttcggtgtag cgggtgtagt cagcgccgcc gccatagtct tgccggccgt atgtcgtggc 24361 gccttgctgg tagccctggt cgtaaccgcc ttggtcgggg taagccggtc gttgctcggg 24421 cgggcccgga gggccagagg gcacatagct gccctcctcg tggcgagccg ggccacgccc 24481 gtactccccg tacccgccgt agccgggctg gccgccaccg ggtgaagggc cgcagccgcc 24541 gctttggcga tagccctggt cgtagccggg agcgccgtag ccggcagccg ggccgggaga 24601 aacaggaggg cgttgctcgt agggcggcgg atagcccccc tgcccttggt cggggtagcc 24661 tcgaccctgg tcctggtgcc cgcgctggtc ggggtagccg cgttgctcgg ggtaaccgcc 24721 ctggtcgggg tacccgattt gctcggggta gtcgccctgg cccgggtggc gcgggcgtgg 24781 gtagcccggc tggggcgggt agccgcccgt ctcgggtgga taccccccgc gggggtcaga 24841 tccgccttgc ggatccgggc caccacgcgg atcctcttgc ggacgcgcat agcggtcgtc 24901 gtaatactcg tcgggacgcc cctgcccctg accgccacgg tagctcgaat tgtcactcat 24961 tggtgctact cctggttctg cgccaaacgc gtggtttgat tgtggccggg cgcaatcgat 25021 gaccggcggg tgggtctcaa cgtcggggtt aacagtgccg cgggcgcgga actggccggt 25081 atgcaggttc gacgactgct cgaatcggac gaccacatca ccatacgttt gccacccctg 25141 ttcttggata tagtccgcca agtcccgagc aaaaccggtt gacttcagct caggatcagc 25201 gcccaacttc tcaaagtcgt gcacaccgag ggtaatgatg tattcgttgg gcgccaaaag 25261 gcgatttccc tgcagcgact ggatgcggtc ggccgcctcg cggcgcagca gggcttcgac 25321 ctcttgcggg acgatcgagc ctccaaagat gcgggcaaac gcatcgccaa ccgtctgctc 25381 gagtttgcgc tcaacgcgct gaaccagcct tttctggcta cccatctttc agcgctcgcc 25441 tcactgttct ggtgcatcgt cggcgcaagg caaacgactc gcctgtatgt cgtgtcaatc 25501 aatcatggta tcgggacagt gtgagcgagc ggaaagggcc ggccacgccc actgagcccg 25561 ccggcgcccc tggcagcgga tggggcctgc ggctactaca gtggtatggt cctccggttg 25621 ttgcgggcga gtggcggaat ggcagacgcg ctggcttcag gtgccagtgt ccttcgggac 25681 gtgggggttc aagtccccct tcgcccaccg tactgtgaga cgagtcgtga ccgacatcgt 25741 cgtcgaaacg gccgccatgg gcgtggttgg aacggctagc gcacgcccac ggccagccca 25801 gggcaacccc ggtatcgacg tgactatcgc cggcgctgtc cggttcagct cggtcgcggc 25861 cgcagccggg tggcggggcg cctcggtacc cttttacacg gcgcgcatcg cggtcagcgt 25921 cctcgccgct tgttgcgcca taccggttat cacgtcggcg gctggcagga cggcattgac 25981 caggcccgcg gcttgaccgg cggtgacatt ggcgatgctg tagtcacgcg caggaacggc 26041 tcgccaatat ctggccatgg cttcttcgcg atggagaatg tcgagttcgg tgtcctcgaa 26101 ttggtcggtg agggcgttgc ttagcacgct catcgtgtgt ccttgcggcc agggatagcg 26161 ccgtagctga tcgtagatag tggtgcggca catgtcgtcg ccagtggccg ccagcagcgg 26221 gtcccgcgcc tgcggtgtgg ataacgcttc gaccgtggcg tagaagcgcg taccgaccaa 26281 taccccggcg gcgcccaaca tcaacgcggc ggcaaggccc cggccgtcgg cgatgccccc 26341 ggcggcgatc accgggatat cagttccccg cgcggtgacc aggtcgacga tttcgggtac 26401 caaggtcagg gtggaacgtg gaccgtggcc gtgcccaccg gcctcggtgc cctgagccac 26461 caacacatcg gcgccgacct gcagggctcg ctcggcctgg gtccggtttt ggatctggca 26521 gaccaaccgc gttccggcgg acttgatggc gtcagcgaaa accgcggggt ccccgaacga 26581 cagcatcacc gccaccggct catactgcag cgcgaggtcg agcagctgcg gttggcgggc 26641 caaagaccag gtgatgaacc cgcagcccac cggcgctcca gcggcgagat cgaactgccg 26701 ggccaaccaa tcccggtccc catagccgcc cccgatgagg ccgagtcccc ctgcgccact 26761 taccgcggca gccagctcac cgccggcgat caagtccatt ggcgcggaca ctatcggata 26821 gtcgattccg aacatctggc taaaggccgt cgatagcacc acaacaacct ccttggcgag 26881 cgtcgtgatg acacgcagat cctggccgat ggtaggtgat caggcgagcc acttcttcgg 26941 cgaactcgcg agccgagcct gatcacgctg ggtttggcaa ctgccgggct tgccgaccgg 27001 gcatcaagcg gccggttgtg ggccaacctg tgcgatcggc aggtgcacca cgaccccggg 27061 caccggggtg acctcgagtc cttcgttgcg ggccagcaga gccgcattgt ccgggagctg 27121 cctggaattg atctcgccgc cggcaatccg acgcagcacg tcgtgggctg ccgcgagctg 27181 ttcgcgcttt cggtactggc cgccgggaag cttgatgccg gcccatacgc cgtactccac 27241 ccgatgctcg accgcgtgtt gagcacaccg gcgctgctgt aggagcgggc accggcgcag 27301 gcattggatc cgcgcttggg tggccgaccg ctcataggcg cgtgccttag cggcgccgtc 27361 gctgccgtcg tcatcggggt acccgaacca tagttccggg tcggttgcgc aggggtgtgc 27421 catgtgccgg cctccttgtt gaacgaaacg taggcaaaag cgtatatgtc tgtggcgggc 27481 tctgcaagag aatcgcgata aaaacgtata tacataaggg gtggccgcgg ccgagtcgta 27541 tccgggtagt atccggctta tggccggagc gtgcggtgag ccgtgagtcg gccggcgcgg 27601 ccattcgcgc acttcgcgag tcgcgtgact ggtccctcgc ggacctggcg gccgccactg 27661 gcgtaagcac catgggcctg agctatctgg agcgcggtgc ccgcaagcca cacaaaagca 27721 cagttcagaa ggtcgaaaat ggcctcggcc tgccgcctgg cacctactcg cggctgttgg 27781 tcgccgctga tcccgatgcg gagctggccc gactgatcgc cgcacagccg tccaacccga 27841 cggctgtccg ccgcgccggt gcggtcgtcg tggaccgcca cagcgatacc gacgtgctgg 27901 agggctacgc cgaagcacag ctcgatgcca tcaaatccgt catcgaccga ttgcctgcga 27961 cgacctccaa cgaatatgag acgtatattc tctctgtgat cgcgcaatgc gtgaaggcgg 28021 agatgctggc cgccagctcc tggcgggtgg cggtgaacgc cggcgccgac tcgaccggcc 28081 ggctcatgga gcatctgcgg gcgctggaag ccacgcgcgg cgcgctactg gagcggatgc 28141 cgacaagctt gagcgcccgg ttcgatcggg catgtgcgca gtcgtcgtta ccggaggcgg 28201 tcgtggccgc gctaatcggc gtcggcgccg acgaaatgtg ggatatccgc aatcggggcg 28261 tcatccctgc gggcgcgctc ccccgcgtcc gagccttcgt cgacgcaatc gaggcaagtc 28321 acgacgcgga tgaggggcag cagtgaatta cagcgaggtc gagctgttga gtcgcgctca 28381 tcaactgttc gccggagaca gtcggcgacc ggggttggat gcgggcacca caccctacgg 28441 ggatctgctg tctcgggctg ccgacctgaa tgtgggtgcg ggccagcgcc ggtatcaact 28501 cgccgtggac cacagccggg cggccttgct gtctgctgcg cgaaccgatg ccgcggccgg 28561 ggccgtcatc accggcgctc aacgggatcg ggcatgggcc cggcggtcga ccggaaccgt 28621 tctcgacgag gctcgctcgg ataccaccgt tactgcggtt atgccgatag cccagcgcga 28681 agccatacgc cgtcgtgtgg cgcggctgcg cgcgcaacga gcccatgtgc tgacggcgcg 28741 acgacgggca cgacggcacc tggcggcgct gcgtgcgctg cggtaccggg tggcgcacgg 28801 cccgggggtc gcgctggcca aacttcggct gccgtcgccg agcggtcgcg ccggcatcgc 28861 ggtccacgcc gcgctgtcgc gacttggccg tccctatgtc tggggcgcaa cggggccaac 28921 cagttcgact gttccggttt ggtccagtgg gcctacgccc aggcgggtgt tcacctggat 28981 cgcaccacct atcaacagat caacgagggg atcccggtgc cgcgctcaca ggtccggccg 29041 ggcgatctgg tcttcccgca ccccgggcac gtgcagctgg cgatcggcaa caatctggtc 29101 gtcgaggcgc cccatgcggg cgcgtcggtt cgggtcagct cgctgggcaa caacgtgcag 29161 attcggcgac cgctgagtgg cagataatcg cccaatcaga cgggcaggat gagaaggttg 29221 aaccatgtcg gagcaagccg ggtcttcggt agctgtcatc caggagcgcc aggctttgct 29281 ggcaaggcaa cacgacgccg tggccgaagc cgaccgtgag ttggccgacg tgctagccag 29341 cgcgcatgcg gccatgcggg aaagcgtccg tcggctggat gctatcgcgg ccgaactcga 29401 ccgcgcggtt ccggatcagg atcagcttgc cgtcgatacg ctcatgggag cgcgtgagtt 29461 tcaaacgttc ctggtcgcca agcagcgcga gatcgtagcg gtcgtcgccg ccgcccacga 29521 gctcgatcgc gcaaaaagcg ctgtgctaaa gcgcctgcgg gcacagtaca cggaaccggc 29581 ccgttagctg cggaccggat acgctggacc ggcaggcgtt gggtgaattg tcggcgacta 29641 cacacctagg tactgtcacg cggcatggaa gcgccgggga cagggcccgc agtgggtcgc 29701 agtggcgttt gacgcggcga tgtccacgca cgaagatctc cttgccacga tcaggtacgt 29761 ccgcgaccga accggtgacc caaacgcgtg gcagaccggg ttgacaccga ccgaggtgac 29821 cgcggtggtc acgtccacga cacgttccga acagctcgat gccattttgc gtaagatccg 29881 ccagcggcat tcgaacctgt actatccagc accgcccgat cgggaacaag gagacgccgc 29941 ccgtgccatc gcggatgcgg aagcagctct ggcacatccg aattcggcta ccgcgcagct 30001 cgatctgcag gtcgtctcgg caattctgaa cgcgcatctg aagactgtcg agggtggcga 30061 atcgctgcac gagcttcagc aagagatcga agccgcggta cgcattcgat ccgatctgga 30121 cactccggcc ggcgcgcgtg atttccagcg tttcttgatc ggcaagctca aggatatccg 30181 ggaggtggtt gcgaccgcga gcctggacgc tgcgtcgaaa tccgctctga tggccgcctg 30241 gacatcgctg tatgacgcat ccaagggcga ccgtggcgat gccgatgacc gcggaccggc 30301 gtcggtcggc tcgggcggcg cgcccgcacg cggtgccggt cagcagccgg agttgccgac 30361 acgagccgaa cccgattgcc tcctcgactc gctgctgctc gaggatccgg gtttgctggc 30421 cgatgaccta caggtgccgg gaggcacatc cgcggcaata ccatcagcgt cgtcgacgcc 30481 aagcctgccc aatcttggcg gagcaacgat gccgggtggc ggagcaacac cggccttggt 30541 ccccggtgtg agcgcgccgg gtgggcttcc gctctccggc ctgctgcgcg gcgtgggtga 30601 cgaaccggag ttgacggact tcgacgaacg gggacaagaa gtcagggatc cggccgatta 30661 tgagcattcc aacgaaccgg atgagcgtcg cgccgacgac cgagaaggcg ccgacgagga 30721 cgccgggctg ggcaaatcag aatcgccacc gcaggctccg acgaccgtga cgctgcccaa 30781 cggtgagacg gtgaccgcgg ccagtcccca gctcgccgcg gcgatcaagg cggcggccag 30841 cggcacaccg atcgcagatg cgttccaaca acagggaatt gccatcccgc taccgggaac 30901 cgcggtcgcc aaccccgtcg accccgcccg gatctcagcg ggagacgtag gtgtgttcac 30961 cgatcgccac gcccttgccc ttggccctag caaagctctt ctggacggcc agattcaaca 31021 catctcagcc gtgcgagggc gaaactttct aggctggata catccagcgg cgaccgcgac 31081 cgcgccggcg aggaccgaag caccgacacc aaccaggccg gcggccgctc gataggtact 31141 gaccgcccgg tcacaacaag aggagacagc ggatgacaga tcgaattcac gtgcagcctg 31201 cacatttacg tcaggccgct gcccatcacc agcagaccgc cgactacctg cggaccgtgc 31261 cgtcgtcgca cgacgcgatc cgcgaaagtc tggactcgct ggggcctatt ttcagtgagc 31321 tccgcgacac cgggcgtgag ctgctcgagc tcagaaagca gtgctaccag cagcaagccg 31381 acaaccacgc cgatattgcc cagaacctgc gaacgtcggc cgcgatgtgg gagcagcacg 31441 agcgagcggc gtcgcgcagc ctcggcaaca tcattgacgg gagccgatga cagggcgatg 31501 accgacgcca atcccgcttt cgacacggtc caccccagcg ggcacattct tgttcggtcc 31561 tgccgcggtg gatacatgca tagcgtctcg ctgagcgagg cggcgatgga gaccgacgca 31621 gaaaccctgg cggaagccat cctgctcacc gccgacgtgt cctgccttaa agcgttgctg 31681 gaagtacgca acgagatcgt ggcggcgggc cacaccccgt ccgcgcaggt tcccacgacc 31741 gacgacctga acgtcgcgat cgaaaagctg ctggcccatc aactgcgccg ccgtaaccgt 31801 tgaagtgcta gatgagccag gtcttggtgc tgtcgggatc gggtgcgatg tcggtgggcg 31861 gctcgatcgg attggggccg aacaattctc gcgctcgagt gagcagagcc cgcacctcgt 31921 cgagttgctg ctgcagcgca gaatcagcca taaccccacg ctacccaggc cccgtctgac 31981 acacaattca ccacccgctc accgcctgcg cgggccagat gatgccggta cgcttacccg 32041 gtggcgatct tcggtcgatg gagtgcgcgc cagcgactcc ggagagcgac ccgggaatcc 32101 ctcacgattc cgacgtttag ctcctcgctg gattgcacca cacgggtaat tggcgggctc 32161 tggcccgctg agctttcgtc taacaccgcc gaaaccgcca cgcttgcaga acatctgaaa 32221 gcggatctgc atcggatagt tggttctgcc aacgacgagc tgatggtcat ctggcgtgcg 32281 gggatggctg attcgacgcg acgcgcagaa gaagacagag tgatcgaccg cgcccgcgcg 32341 tcggcgatgc gtcgcgtcga gtcggcgatg cgcgagcttc ggcagataac ggggcgcgtt 32401 cccgtggaaa ttccgcgtat gcgcggcgcc ggcggctcgg atctggacac gacacgactc 32461 atgccggccg tcacggtagt tcagcccgct gaccaggcct gtacggattg gccggttgcc 32521 gccgccgagg atgacgaagc ccgactgcag cgcctcctgg cgttcgtggc tcgtcaggag 32581 ccacggctga actgggcggt cggcgttaac gcggacggca cgacggtcct ggtcaccgac 32641 gtcgcccatg gttggatacc tccgggcatc gcccttcccg aaggcgtgcg attgttggca 32701 ccggcgcgac gcgccggcag agcccccgag ttggtcggta tcacgacgtg ttgcaagacg 32761 tacacccccg gtgactcgct gcgtcgggcg gtcgattcaa ccgcgccgac gtcctcggtg 32821 cagccgcgag cgttgccagc gatcgccggc ctgagtgtgg agctgggcat agcgacccag 32881 cggcacgacg gcttaccgaa gatcgtgcac gccatggcca cggcggccgg caacggcgcc 32941 gccgccgagg aagtcgacct gttgcgggtg cacgtcgata ccgcgctcca ccacgtcttg 33001 gcccagtatc cccgggtcga tccggcgtta ctgctcaact gtatgttgtt ggccgccacc 33061 gagcgcagcg tcacgggaga cccgatcgcg gcgaactatc acttcgcgtg gttccgggaa 33121 ctcgattcac gccgatagct ttctcgaatc cccacggcaa gcgtccggcg atgaattgac 33181 gctggtgggg ggcgtggaca tactgtcatg gtgtcggggt cggacagtcg cagcgaaccg 33241 agccagctga gcgaccgaga cctcgtcgaa tcggttcttc gtgacttgag cgaggcggcc 33301 gacaagtggg aggcgctcgt cacgcaggct gaaactgtta cctacagcgt ggacttggga 33361 gacgttcgcg ctgttgccaa ttcggacggg cggttgctcg agctgacgtt gcatccgggc 33421 gtgatgaccg gctacgcgca cggggagctg gccgaccgag tgaacctggc gattacggcc 33481 ctgcgcgacg aggttgaggc cgagaaccgg gcacggtacg gcggccgcct gcagtgacat 33541 cggtatctgc gaggatcaag cccatttgct ggcaaggcat ttcggcgcgg ggcgcaaggc 33601 ccacagccgg gccgtggcca ccctgaaagc cgatatccaa gcctggcacc cggctggcat 33661 ccagaccccg aagccgcgat gcgaatcaga tgtgttcgcg cgaatcggtc acacgagcca 33721 cccatcaact cggaagagcc gggtggggcc gggagcatcc gaggcaccgc ttgcctgaca 33781 taacagcata accgccccgc cattgtcgct gtgatggaca tgccccagcc atttgtcggc 33841 tagctataca gcgaacgtca atttttcgtg aatcagcctg aggctattga taattcacgg 33901 cggcacgtcc tactcttagc ggcgctatgc gacccaatgc gcgtgcgatg ttgcgtttgg 33961 tgcattgtgg tgccggtgct ggtgggccgg cgataacgtc gaaaggtgcg gtattgggtg 34021 accgtgtcgg cgcgttgtcg cagtgccgat cggcggcagc gctgagtcga ttcgactttg 34081 caccccgtga ctctgttccc accgccacct tcggtggtgg atgcgctttc aggtccacca 34141 ataggctagc tgttttcgag cggtgtattt gcgtgggggg tgaatgtgga tacggacaat 34201 gacaggccca cgctggcgag ggtttaccgc agcctgcggg acatttgtcc ggacagctgg 34261 aatcttccgg gcggtcggat gcccactggc ttgggctatg actttctgcg ccctgtcgag 34321 gactcgggga tcaacgacct gaagcactat tacttcatgg cggatttggc cgatgggcaa 34381 ccgctaggcc gggcaaacct ctatagcgtc tgtttcgacc tggccaccac cgaccgcaag 34441 ctcactccgg cctggcgaac gaccatcaaa cggtggtttc cggggtttat gaccttccgt 34501 ttcctcgagt gcgggttgct caccatggtg agcaacccgc tggcgttgcg gtccgacacc 34561 gacttggagc gggtattgcc tgtgctggcc ggccagatgg accagttggc gcatgacgac 34621 gggtcggatt tcttgatgat ccgggacgtg gacccggaac actaccagcg ataccttgac 34681 atcctgcgcc cgttgggctt tcggcctgcg ctgggctttt cccgggtaga cacgaccatc 34741 agctggtcga gcgtggaaga ggcactgggc tgcctgtctc acaaaaggcg cctgccgttg 34801 aagacgtcgc tggagtttcg tgagcggttc ggtatcgagg tcgaggaact cgacgagtat 34861 gccgagcatg cgccggtatt ggcccggctt tggcgcaacg tcaagacgga ggcaaaggat 34921 taccagcgcg aggacctgaa ccctgagttc ttcgcggcgt gttctcggca tctgcatgga 34981 cgtagcagac tgtggttgtt ccgctaccag ggcacgccaa ttgccttctt tttgaacgtt 35041 tggggtgcgg atgagaacta catactgctt gagtggggca tcgatcgtga ttttgaacat 35101 tataggaagg cgaatctgta ccgggcggcg ctgatgctca gcctaaaaga tgcgatcagc 35161 cgagataaac ggcgaatgga aatgggtatt acgaactatt tcacaaaact tcgcattccg 35221 ggtgcccgag tcataccgac catctatttc ctgcgtcaca gcacggatcc ggtgcatacg 35281 gcaacgttag cgcgaatgat gatgcacaat attcaacggc caacgctacc cgacgatatg 35341 tcggaggaat tctgtcgctg ggaagagcga atacgtctgg accaggacgg gctacccgaa 35401 cacgatatct ttcgcaagat cgatcgtcag cacaaataca cggggctcaa actcggcgga 35461 gtctacggtt tttatccccg attcaccgga ccgcagcgat ccacggtcaa ggccgcggag 35521 ctgggcgaga tcgtgttgct gggcacgaac tcgtatctgg gcctggccac ccatccagag 35581 gtggtggagg cctcggcgga ggccacgcga cggtacggca ccggctgctc gggttcgccg 35641 ttgctgaacg gcacgttgga cttgcacgtc tcgcttgagc aggaactagc ctgttttttg 35701 ggcaaacccg ccgccgtgtt gtgctccacc ggatatcaga gcaacctggc ggcgatcagc 35761 gcgctatgcg aatccgggga catgatcatc caagacgcgc tgaaccaccg cagcctgttc 35821 gacgccgcca ggttgtccgg ggccgacttc accttgtacc ggcacaacga catggaccac 35881 ctggcgcggg tgctacgccg caccgagggg cgccgccgga tcatcgtcgt ggacgcggtg 35941 ttcagcatgg aaggcaccgt cgccgacctg gccaccatcg ccgagcttgc cgaccggcac 36001 ggctgccggg tctatgtgga cgagtcccat gcgctgggcg tgctcggccc cgacgggcga 36061 ggagcttcgg ccgcgttggg tgtcttggcg cgcatggacg tggtgatggg cacgttcagc 36121 aaatcctttg cctccgtcgg cgggttcatc gccggagatc ggcccgtcgt ggactacatc 36181 cggcacaacg gttcaggtca tgtgttttcc gccagcctgc cgccggccgc cgcggctgcc 36241 acccacgcgg ctctgcgcgt cagtcggcgt gaacccgacc ggcgggcgcg ggtgctggcc 36301 gcggccgagt acatggccac cggcctggca cggcagggct atcaggccga gtatcacgga 36361 accgcgatcg tgccggtgat cctgggcaac ccgaccgtgg cgcatgcggg ctatctgcgg 36421 ctgatgcgct ccggggtgta tgtgaacccg gtggcccccc cagccgtgcc ggaggagcgt 36481 tcgggattcc gcaccagcta cctagccgac caccgacaat ccgacctcga ccgggccttg 36541 cacgtgtttg ccggccttgc cgaggacctg accccgcaag gagccgcgct atgaaggagg 36601 ccatcaacgc caccatccaa cggatcttgc gaaccgaccg cggcatcacc gcgaaccagg 36661 tactcgtcga cgacctgggt tttgactcgc tcaagctgtt ccagttgatc accgagctag 36721 aagacgaatt cgacatcgcc atctctttcc gcgacgcaca gaacatcaaa acagtgggag 36781 acgtctacac cagcgtcgcg gtctggttcc ccgaaaccgc caagccggcc ccacttggga 36841 aaggaacagc atgaccgacg acgccgatct tgatctggtc cgaagaactt tcgccgcgtt 36901 tgcccgcggc gacctcgccg agctgacgca atgctttgcg cccgacgtgg agcagtttgt 36961 cccgggcaag cacgccctgg ctggggtgtt ccgcggcgtg gacaacgtgg ttgcgtgcct 37021 cggcgacacc gcggccgccg ccgacggcac catgacggtg acgcttgaag acgtgttaag 37081 caacaccgat ggccaggtga tcgccgtgta tcgattgcgg gccagcaggg ccgggaaggt 37141 cctcgaccag cgcgaggcga tcctggttac cgtcgccggt ggtcggatca cccgacttag 37201 cgagttttac gccgacccgg cggcgaccga aagcttctgg gcatgacggc ggccttgctt 37261 tcaccagcca tcgcctggca gcagatctcg gcttgcacgg accgcacgct gacgatcact 37321 tgcgaggatt ccgaggtaat cagctatcag gacctcatcg cgcgcgcggc ggcatgcatc 37381 cccccgctac ggcgtcttga cctcaaacgc ggtgaacccg tgctgatcac cgcccacacc 37441 aacctggaat tcctgtcctg ctttttgggc ctcatgctcc atggcgctgt gccggtaccc 37501 atcccgccgc gggaggcact gaagaccacc gagcgtttca tgactcggct cggcccactg 37561 ctgcgccatc accgcgtgct gatctgcaca ccggccgaac acgacgagat acgcgctgcc 37621 gccagcaccg actgccagat cagcagattt actgccctag ccgaggctgg cgacgagcag 37681 ttcggccgcg ccacggccca gcaactcgcc gacaccgcca ccgccgactg gccgctatgc 37741 accctcgacg acgacgccta cgtccaatac acctctggca gcaccgcagc accacgcgga 37801 gtggtcatca cctaccgcaa cctgctgtcc aacatgcgcg caatggccgt gggctcacaa 37861 ttccagcacg gcgatgtcat gggcagctgg ctgcccttgc accatgacat ggggctggtg 37921 ggcagcctat tcgccgcact cttcaacagt gtcagcgcgg tattcaccac gccacaccgg 37981 tttctgtatg acccgttggg attcctcaga ctgctcacca gctccggggc tacccacacg 38041 ttcatgccta acttcgctct ggagtggctg atcaacgcct accacaggcg cggcgccgac 38101 atcgaaggca tcgacctaca caaaatgcgc cgcttgatca tcgcctccga acccgtccat 38161 gccgagggca tgcggagatt cgccgccacc ttcgccggcg tcggacttgc ccccacggcc 38221 ctgggttcgg gctatggcct ggccgaagcg accgtcgccg tgtcgatgtc agcgcccaac 38281 acgggattcc gcaccgaaac ccacgccgcc gcggaggtcg tcaccggcgg ccgagtgctg 38341 cctggctacg aggtgcgcat tgacgccgca ccaggtgccc gggccggaac gatcaaactg 38401 cgcggcgaca gcgtggccgc caaagcctat gtgggcggga agaagctgga cgcgctcgac 38461 gaggaaggct tctgcgacac ccacgacttg ggttttcttg tagacgacga aatcgtcatc 38521 cttggccggc aggacgaggt gttcattgtc cacggagaaa acagattccc ctacgacatc 38581 gagttcatca ttcgcgggga atccgagcag caccggacca aagtcgcatg tttcggggtc 38641 aacgaacgcg tcgtggttgt gttggaaagc ccattggaca gcatcatcga caaggccgaa 38701 gccgaccgac tgagatgtca agtcgttgcc gcgactgggc tgcagttgga tgaactgatc 38761 acggttcggc gcggcgcgat tcccaccacc accagcggca agctcaaacg acgcgccgtc 38821 gcgcaggctt atcgagacgg cacactgccc cgtcttgcca cccacgcgtg gacggcggat 38881 cccgatagcg ctcccaaaac gacccggtcc agcctggaag gcgcccactg atcttccact 38941 gacgtctcat caaacccccg gggcgctcgc gcgctgggcg cgctcatcga ccggggcttg 39001 ggttgattgg ccccggctct cttcgcgcgc tgggcgcgct catcgaccgc ggccgggtgg 39061 cccggcgaaa gcttgggcga tcgtcagcca gcgttgtgcg tcctccccta ctgcgttgac 39121 gtcaagagtg ctcagcgcgc gccgctgggt gaccaggaag cagaagtcct cggcggaccc 39181 ggtgacccgc tgggccgcat cggatggccc ccaagaccaa gtgtcgccgc tcggtccccg 39241 cagctcgacc aggaacggct cggccggagg ggttaggttg ttgacgatga acgcgtagtc 39301 gcgggtgcgg acaccgagat gcgcaataga ccgcagtcgc tgggtggcgg gccggatgac 39361 gcccagggcg tcggcgacgt ccagtccatg tgcccaggtc tccatcaacc gcgctgttgc 39421 catcgacgcc gcgctcatcg gtggcccgaa ccaggccaat ttgcggccat cgggaaccgc 39481 cagcagttcc tcgtgcagcc gcccccgagt gacccgccag tctgtgagca gttcggcagg 39541 tgaaacggcc gccagttctg tcgcggcgtc gtcgacgaaa ccggccggat tggccgcggc 39601 ggcggtcatc agctcggcga acccggcctc gtcggtgacc gccgtcagcg ccactcgatc 39661 ggtccacagc aggtggccga tctggtgtgc gatggtccaa cccggcgcag gtgtcggatc 39721 ggcccagcga tccgctggca ggtgcgccac cagcgcgtcg aggtcgtcgc tttcggcacg 39781 caggtctgcc acgaacggcc caggatccgc catcaccacc tcctgaggta acagttcgtc 39841 gggaaaggca tgtttgtacc ctagcgaccg atcacaggct ggccgcggcg cccgacgatg 39901 gtgtgcacca ccagcccggc taggtagatc gccgacccga acagcacaaa tacgggcgcg 39961 tgcccgtgct cgggaatcaa ggctgcggcc acggtaatcg agaggatgta tgagacccaa 40021 aacagtgcat cctgcacggc gaacacgtgc ccgcgcaatg cgtcgtcgac gtccatctgc 40081 atcgccgaat cggcgcacag cttgaccacc tggccggcca cacctaaaag gaagccgcat 40141 accaccatca ccgggaccag cagcccggcg gccgcgacct ggatagtggc ggccgcagcc 40201 aacgcgccat ttgccgtggc gtagcgtccc cagcgccgga tcgcggtcgg agtcaagacg 40261 ttggccagga aggctcccag cccggtggcc gcgaagaaca gcagtgcggt acccaacccc 40321 ccaacggccc gggcggtcac gtggcggacc aggagcaaga tcagcagtga gttgataccg 40381 accaccatcc gatgcgctgc caaaccggac aggccggcag cgacggtcgg aagttgcacc 40441 acggtgcgcg ctccatgtag ccaaccggtg accacggcgt agacagcaga tccgtggatc 40501 gcgcgttcgg tgtcgtccgg gccgagtacc cgcgggccga accgcagcga ccaaagcaac 40561 gcgatcgata cggggatcgc caccaggaag acgatcgcgg aggccccctc gtcgccgctg 40621 ccgagcagcc aacgaggcaa cagcatgaag ttggcgccca ggaacgcgga gaccgccccc 40681 gacgcgatgg ccaccgagtt catcgtgacc acctgttcgc gcggcaccac gtggggcagt 40741 gccgccgaca gtcccgaggc gacgaatcgt gccaagccgt tggcgaccag cgctccgacc 40801 aacagcggca cgtcgccggc tccgaccgcg agtatcgtgc cgaccccggc gatcagggct 40861 agccggccgg tgttggcgcc aaccagcacc caccgccgat cccaccggtc cattagggcc 40921 ccggcgaagg gccccagcag cgaatagggc agaaacagca ccgcgaaggc ccccgcgatg 40981 gccatcgggt cggccgcccg gtccgggttg aacagcaacg ctccggccag ccccgcctga 41041 aacaacccgt cgccgaactg actcgcaacc cgcacctgca gcagacgcca gaagtcgggc 41101 aagctgcgca ccgaccgcca aacgtcgacg ggtgcgcgtg cgtgcatccg ggagtgaatc 41161 actaaaccca cttccaccct gggcacaggc aaggttcggt ccaccccgtg ccgccccaac 41221 cacagtaaaa atattcgccg accctgcttg ttcgccccgg gcgatgcgac ggtggtgcga 41281 tgatggtgtg gtggcgccgc acgaagaccc cgaggaccat gtcgcacccg ccgcacaacg 41341 ggtgcgagcg ggcaccttat tgttggccaa caccgatctc cttgaaccga catttcgccg 41401 cagtgtgatc tacatcgtgg agcacaacga cggcggtacc ctcggtgtgg tcctcaatcg 41461 gcccagcgaa accgcggtct acaacgtgtt gccgcagtgg gccaaactcg cggccaagcc 41521 aaagacaatg ttcatcggtg ggccggtgaa gcgcgacgcg gcgctgtgtc tggcggtatt 41581 gcgggttggc gctgacccgg aaggcgtgcc gggcctaagg catgtcgcgg gcaggctggt 41641 gatggtcgat ctggatgccg accccgaggt gctcgcagcg gcggtggaag gggtgcgcat 41701 ctacgccggg tactccggct ggaccatcgg tcagctcgaa ggtgaaatcg agcgcgacga 41761 ctggattgtg ttgtcggcgt tgccatctga cgttttggtg gggccgagag ccgacctgtg 41821 ggggcaggtg ctgcgacggc agccgctgcc gctgtcgctg ctggccaccc acccgatcga 41881 tctgagccgg aactaggcta ctccgccgcc gagcttgcca gagcagcgcg tcgcgtcgcc 41941 gcggtcgagc caggcgatcc ggcccagcct agtgggccac aggctgttca atgacaggcc 42001 tgggtgcaga ccgcgcagct gccaacgcag ttggcggtgg ggctagcggt ttcacggcgc 42061 agcgcgtact gggcgctctg ccacgacccc gcggccagcg tgccgaccgc gcccgcaatg 42121 cagacgatca ccaccatcaa ggcggtgtgc ccgggcgcgg ccaccgccac cactcccccg 42181 gcggccagca ttactgcggc tgccaactgc gtgggcgcca tcgcgcgcag cgccagcgcc 42241 gtggggtcgg cagtgggcgt atggaacagc gaccagctcc cgaacagggc ggacgccgcc 42301 gccgcacaca tgcacagcac acccgcgagg aacattcgtt caccatacga ggccgccgac 42361 gaatccgctc accgagctcc atgcgggccc gtgtttctgc tcggcctcat cgcgacctag 42421 cgcggcggga ctggtgtcag ggtgcccgcg ggcggatacc caggcgcctg cccgggtagt 42481 cccaccggtg ccgaaccggg tgccggggca ggcgcctgag cgggcgccgc atgcgcaacc 42541 acttggaatc cgttgacaat cgcatcggtg gccggcccgt cggtgaccgc ctgcgacagc 42601 gcggtggtca ccgacagcga aaccaggtac ttgtcggctc cggaggtggc gatgacgtgg 42661 cgccgggagg tgttgagggt catgtcgttt tcgcggtagg tgccctcgat gattgatgac 42721 ggaaagccgt cgaaattggc catcgaggcg tttgtggtct gccatgcgag caatttctgg 42781 ctgtcaatgt agccgtgtgt gatggcctca gcgggatcga agtcaccgat cagcctatac 42841 accaccagct gcgcattcga cgtgtagacg ctgttgccca accggtcggc gatcaccacg 42901 aacgcgtcgg gcacgttggg gtcgggcacc tgagtccagc gcggcggcat cggcagtgtg 42961 atgtcgagcg ccttgaatcc gtgcggtcgc tgtgcctcca gcttgacgcc cttctcccgg 43021 aggtggtccc gaagtgtgcc gctgatcgcg ggagtcactg gcggcggcag cgggggcaca 43081 gcggtggacc cgggtgctcc gaccggaatc ggcgacgcga tcggtgcggg tgctggcgcc 43141 ggtgagaacc tgttgctgct cccgcccgga agcgccgtga ggttctgcac gggcgggact 43201 gttgccggcg ccgagactgg ggcagggata ggcggcggtg gcagcagggg atccgctgag 43261 gccttcccgg cggtgaccag caccacgccg atgaaaccgg tggccatgcc gcctgcgaag 43321 acccgccagg tgcgcgcgat ctggatcatt tgcgtcggtc cctccgaatg gccgggcgac 43381 ggtgcccgtc gtcgaggctg aatgtaacca gcgctccatg gcagtgcaca ggcttgaaat 43441 gcagctggaa tgaacctctg atcgtggtgc aacggaaccg agaccaaccc gtggccggta 43501 gcgcggcccc ggaggttccc gggccaccct tataccctgt tgggcgtgac cgaatcgcca 43561 accgctgggc ctggcggcgt gccccgtgcc gacgacgcgg actccgacgt gccacggtac 43621 cgctataccg ccgagctcgc ggctaggctg gaacggacct ggcaggaaaa ctgggcccgg 43681 ctagggacgt tcaacgtgcc caacccggtc ggctcgctgg ccccaccgga tggtgccgcg 43741 gtgcctgacg acaagctctt cgtgcaggac atgttcccct acccctcggg tgagggactc 43801 cacgttggtc atcccctcgg ctacatcgcg accgacgtct atgcccgcta tttccggatg 43861 gtgggccgta atgtgctgca tgcgctaggg ttcgacgcgt tcgggctgcc cgccgagcaa 43921 tacgcggtgc aaaccggcac ccatccgcgt acccggaccg aagccaacgt cgtcaacttt 43981 cgccgccagt tgggccggct gggcttcggc cacgacagcc gacgaagctt ctcgaccacc 44041 gatgtcgact tctacaggtg gactcagtgg atcttcctac agatatacaa cgcgtggttc 44101 gacaccacag ccaacaaggc gcgcccgata tcagagctgg tcgccgaatt cgagtccggt 44161 gcaaggtgtc tcgatggcgg ccgggattgg gccaagttga ccgcggggga gcgagccgat 44221 gtgatcgacg agtaccggct ggtctatcgg gcggattcgc tggtgaactg gtgcccgggg 44281 ctaggtacgg tgcttgccaa cgaagaggtg accgccgacg gccgcagcga ccggggcaat 44341 tttccggtgt tccggaagcg gttgcggcaa tggatgatgc ggatcaccgc ctatgccgac 44401 cggctgctcg acgacctgga tgtgctggat tggcctgagc aggtcaagac catgcagcgc 44461 aactggatcg ggcgttcgac gggtgcggtg gcgctgttct cggcgagagc ggccagcgat 44521 gacgggttcg aagtcgacat cgaggtgttc accacgcggc ccgacacctt gttcggcgcc 44581 acgtatctgg tgctggctcc cgagcacgac ttggtcgacg agttggtcgc cgcgtcctgg 44641 ccggctgggg tcaacccctt gtggacatac ggcggcggca cacctggtga ggccatcgcc 44701 gcctaccggc gtgcgatcgc cgccaaatca gacctcgagc gccaggagag cagggaaaag 44761 accggcgtct tcttgggcag ctacgccatc aacccggcca acggtgagcc ggtgccgatc 44821 ttcatcgccg actacgtgct ggccgggtac ggtaccgggg caatcatggc ggtgccgggt 44881 catgaccagc gggactggga cttcgctcgg gcatttggtc taccgatcgt ggaagtaatt 44941 gccggcggca atatttcgga atccgcgtat acaggcgatg gcatcctggt caactcggat 45001 tacctcaatg gaatgagcgt gccagcagca aagcgggcca tcgtcgaccg gttggagtcc 45061 gcgggccgcg gccgggctcg aatcgaattc aaattgcgcg actggctttt tgcgcggcag 45121 cggtattggg gtgaaccatt cccgatcgtc tatgacagcg acgggcgtcc gcatgcactc 45181 gacgaagctg cactgcccgt cgagctgcct gatgtcccgg actactcgcc ggttttgttc 45241 gaccccgacg atgcggacag cgagccttcg cccccactgg ccaaggcgac tgagtgggta 45301 cacgtcgacc tggacctcgg tgatggcctg aagccctaca gccgcgacac caacgtgatg 45361 ccgcagtggg cgggcagctc ctggtatgaa ctgcgctaca ccgatccgca caactcagaa 45421 cggttctgcg ccaaggaaaa cgaggcctat tggatgggac cgcggccggc tgagcacggc 45481 ccggacgacc ccggtggcgt cgacttgtac gtcggcggtg ctgaacacgc ggttttgcac 45541 ctgctgtatt ccaggttctg gcacaaggtc ttgtacgacc tgggtcacgt cagctctcgc 45601 gagccttacc gcaggctggt caatcagggc tatattcaag cttacgctta caccgatgcg 45661 cgcggatcct atgtccctgc cgagcaggtg atcgaacgcg gtgacagatt tgtctatcct 45721 ggacctgacg gtgaggtcga agttttccag gaattcggca aaatcggtaa gagcctgaag 45781 aattcggtat cgccggacga aatctgcgac gcatacgggg cagatacgct tcgggtttac 45841 gagatgtcga tggggccgct ggaggcttca cgtccatggg ccacaaagga tgttgtcggc 45901 gcgtaccgtt ttctgcagcg ggtgtggcgc ttggtcgtcg acgagcacac cggcgaaact 45961 cgggtggctg acggcgtgga actcgacatc gatacgctac gggcgttgca ccgcaccatc 46021 gtcggcgtgt cagaagactt tgcggcactt cgcaataaca ccgcaacggc taagttgatc 46081 gaatacacga accacctcac caagaagcat cgtgatgcgg tgcctcgggc cgccgtggag 46141 ccgcttgtac aaatgctggc tccgctggcc ccacatattg ccgaggagct gtggctgcga 46201 ctgggcaaca ccacctcgtt ggcacacggc ccgttcccga aggccgatgc cgcctacctc 46261 gtcgacgaga cggtcgagta tccggtgcag gtgaacggca aggtacgtgg ccgggtggtg 46321 gtggccgccg acaccgacga ggaaacgctg aaagccgccg ttctgaccga cgaaaaggtc 46381 caggcattct tggctggtgc caccccgcgc aaggttatcg tggtcgccgg ccggctggtc 46441 aatctcgtca tctaggtcgt gtcggcggtg ccgacggtgg gcgaggtaat ccgcggggta 46501 gttcgttgta tgcgttacgc cgcgagagcc ggcggcgacc agattggttg atagcgtggt 46561 actttcacgc tcgtttgcga gcaggggagt tgcttgcagg gccactggcc ggttcgcccg 46621 aggcgagacg ctccagtggc gccagggcct tcctgagggt ttccaagtcg gagcggggaa 46681 gttggctgag cagcgcggcc agagccgcgc gccggttggc cagtgactca ccgtgaaccg 46741 cccgcccttg cggcgtgatg tctaccaaca ccgcccgcaa gtcggacggg tctcgcgagc 46801 gtttcaccag tccaatcttc tcgagccgcc ggatcgccac ggtggtggtg ggagttcgca 46861 cccgttcgtg agcggccagg tcggtcatcc ggatgggacc ttgatcgagc agggtgacca 46921 ggatcgacag ttgcgccagc gttaggtcgc cggctgcagc cccgttggga tccccgcggc 46981 gcagcattga aatcagcttg gacaatgcgc ggtgcagccc ctccgccagt tgggtcactt 47041 ccggtgcggt gaattcgctg tccgccataa accggcagtc taacctgaca tgcgtgtgac 47101 cgtagacttg tgtcgggcga cctttgaccg ccaatgcatt tggtcccgaa atccgctgca 47161 ttttcttgcc aatcgagcgg acaacactca tgtcatggct gactacctac attgtcagtt 47221 ctgccggatc catggtcagt gatgtcgaat gccactgacc gccaacggaa accggctctc 47281 gcgttaacgg gacagtcaat attggagacg ccggcagccg ctgctggctt caccatcgga 47341 tcggcgtaat tagggcaccg gtgaggaggg ctggtagctt ctggcgaagc cagggatcgg 47401 cgccccaaac gggccgggac aagcgccctc gggcgggacc aatactcggc ggcggaacag 47461 ttcggccagc atcgtctggg ccatcagctc ggaacggccg atgcaggcag ccctcgcagc 47521 ttcaggttcg cgccgatgga ttgcggcgtt ctcttcctcg tagaacggca acacgtcgtc 47581 gcggctgttt tggtatgtca tccagaacac tcgcggaatc aggttctgtg aggcccggat 47641 ggtggcgtgc agccgcggtc ctgcgtactc gtcgttgacc gtgcgccggt actcccacac 47701 gcattcggcg aaggcccgcg actccttgga gttgcgcagc gatcgcatga ctgcgtcgag 47761 ctggcccagg atccgaggcg tggggttggc ggctgcgcgg gcagaggcaa tgccgttgag 47821 caagccgtcg agttcgtgat gttccaggat ggtggcgacg tcgaaccgct cgatgaacgc 47881 gccgcggtga tagcgagtcg acacaatgcc gtcgtgttcg agttgaacca gcgcctcttg 47941 gatgggaacc cggctgaccc ccaggccgtg cgcgatttca ttgcggtcga cgcggtcccc 48001 gctgcgcagt ttgccggtca atagcaggtt gaggatgtgg gcgacaacct ggtccttttc 48061 cttaaccccg tacttttttg gcatcggtat ctagcatctc tttcagcccg ctgcagccat 48121 ccggcgctgg caagtttctc atgactcggc gtctgcgttg tggtgtttcc cagatgaagc 48181 cgggggtaac gcgatctgac agacgtcaac cggagttcac cggccatcgc gccacctgca 48241 aagcgcggcc gcagcgctca ggtcgtagtc gggaccgtca cagccaacgg tcaacagcgt 48301 gacaccgaga ccggcgaggg cttcggcgct ggcgatcagc ccgccgccgt cgaccgcggc 48361 ggagcgttcg atagtcgctg ggtttcggcc gacggtcgag cagtgcgtgc tcagcacggc 48421 cgacttcgct aggtagctgt ccccggcggt aaagctgtgc cagatatcgg catactcggc 48481 gaccagtcgc agggtcttac gctctccgcc gccgccgatc agcaccggga tgtcccgtgt 48541 cggcggcggg ttcagcttgc caagccgcgc cttgatccgg ggcagcgcag ccgccaggtc 48601 gtcgaggcgg ctgcccgctg tgccgaaccg gtagccgtac tcgtcgtagt ccttctgttt 48661 ccagcccgac ccgataccca ggatgagccg gccgccggag atgtggtcga cggtacgggc 48721 catgtcggca agcagctccg gattgcggta ggagttgcac gtcactagag cgccgatttc 48781 gatgtgcgac gtttgctcgg cccaggctcc caagacggtc cagcattcga agtgtgggcc 48841 gtcagggtcg ccgtagagcg gaaagaagtg gtcccaggta aaagcgatgt ccacaccgat 48901 gtcctcgcac cggcggacgg cgtctcggac ggcgcggtaa tggggggcgt gctgcggctg 48961 cagttgtacg ccgatacgaa cggggagatc gggacgcacg agtgaagtca tgggtccacc 49021 gtaggctcag cgtgtgtcga gcaccccgcg cacgatctcg atcagggcgc gcggttggtc 49081 actttgcacc gagtggcctg acttctcgac gatgtgaacg ccacggaaat gcgttgcacg 49141 cctgtggagt tcggcggtgt cctggtcggt gacgaagccc gacgagccgc cgcgcacgag 49201 tgtgatcggc gcggacaggg cgtcgacgtc gtcccagagc cctgcgaaat ctccgaacgt 49261 gcggatcgcg tcatagcgcc acacccagtt gccgttgtcc agccggcggg agttgtggaa 49321 cacgccgcgg cgcaacgact tgacatcgcg gtgcggggcc gcggcgatcg ttaggtccag 49381 catggcctga aagctgggga attcccgctc gccgtgcatc agcgccaccg tgccgcgctg 49441 ctcggcggtc agctcggcgt gccgttgcaa tgccgacggg gtgacgtcga cgagaacgag 49501 ttcgccgacc aggtcgggtg ccatcgcggc cagccgtatc gcagtcaacc cgcccagcga 49561 catgccgacc acgaattcgg cacccggcgc aagctcgcgt agcaccggcg ccaaggtctc 49621 ggagttgagc tgcggcgagt aattgccgtc ctcccgccaa gcggaatggc cgtgccctgg 49681 aaggtccacc gccagcgccg gctcacccag gccgacgatc acggtgtccc aggtatgggc 49741 gttctgtccg ccgccgtgca gaaagatcac ccgcggcgca gagccgcccc agcgcagcgc 49801 gctgatggct cccgcttgga cccgctcgac ttcaggcagt ggaccattga caccggcctg 49861 ctcagcgttc tcagccagca gggcaaactc gtccagtccg gtcagttcgt cgtcagatag 49921 cacgcagcgg acgttacccg cgtttgactc tgcggatacc aggcaattgt gcgagtggcc 49981 cgcgtggtaa gcgcagagtc aacgctaacc gatgatgaac tcttcgagtt gcgcgcgcgc 50041 gatgtcgtcg ggcagctgct cgggcgggct cttcatcagg taggccgacg ccgggatcac 50101 cggtccgccg atgccgcggt ccttggcgat cttggccgcc cgcaccgcgt cgatgatgac 50161 gccggccgag tttggcgagt cccacacctc gagcttgtac tccaggttca acggcacatc 50221 tccgaaggcg cggccctcaa gacggacata ggcccatttg cgatcgtcga gccatccgac 50281 gtggtcggac gggccgatgt gcacgtcctt ggtcttgaac tcgcgcttca gattcgaagt 50341 gacggcctgg gtcttggaga tcttcttgga ctccagccgt tcacgttcga gcatgttgag 50401 gaagtccatg ttgccgccca cgttgagctg catggtgcgg tcgagctgca cgccgcggtc 50461 ctcgaacagc ttggccagca cccggtgggt gatcgtcgcg ccgacctggc tcttgatgtc 50521 atcaccgacg atgggtaccc cggcgtcggt gaacttcttg gcccacaccg ggtcggaggc 50581 gatgaacacc ggcagcgcgt tgacgaacgc caccccggcg tcgatagcac actgggcgta 50641 gaacttgtcg gcttcctccg agcccaccgg caaataggag accagcacgt cgaccttggc 50701 ctccttgagc gcctggacga cgtcgacggg ctccgcgtcg gagagttcga tggtgtcggc 50761 gtagtacttg ccgatgccat cgagggtagg cccgcgctgc acgatcacgt tggtcggcgc 50821 cacatcggcg atcttgatgg tgttgttctc cgaggcgaag atggcgtcgg acaggtcgaa 50881 gccgaccttc ttggcgtcca cgtcgaacgc cgccacgaac ttgacgtcgc gaacgtggta 50941 cgggccgaac cgcacgtgca tgaggcccgg tacggtcgat gtgtcgtcgg cgttgtagta 51001 gtactcgacg ccctggacca gcgaggacgc gcagttgccg acgccgacaa tggcgactcg 51061 aacctccgtc gacgcctccg gcgccggtaa cgactggtgc tcactcatta aggcgttctc 51121 ctaacctcat aacctctggg gtgtcttggg tgttggttcg tgctgggttt acgtctgttc 51181 ggcggggttg ggtgctgccc gttccgcggc gatgagctcg ttgagccact tgacctcgcg 51241 ctcgctggac tcgagcccga gttgatgcaa ttggcgggtg tagcggtcga aggaactgct 51301 ggcccgcgcc accgcctcgc gcaagccttc ccggcgttcc tcgacctggc ggcgccggcc 51361 ttccaggatg cgcatccgcg cttcggccgg ggtgcggttg aagaacgcca ggtgcacccc 51421 gaaaccgtcg tcggtgtagt tgtgtgggcc ggtgtcggcc accagctcgc cgaatcgacg 51481 gcgacccttg tcggtcagtt ggtaaacgcg tcgtgctcgc cgcaccgggg tgcccgctgg 51541 ggcggcattc tcggcgatca acccgtcggc ctgcatgcgt cgcagcgccg ggtataacga 51601 accgtacgaa aatgcccgaa acgcgcccag caggccggtc agcctcttgc gcaactcgta 51661 gccatgcatc ggtgactcga tcaacagacc caggatggcg agctccagca tcgagtcacc 51721 tccttttgta tggcttttga atggccgtta cgacggttcg acgcctcgcg tcatcgtatc 51781 gcctcgatat atttgcgaca acatcaccgc gtcaagacgg gtagctgacg tgcttgatgg 51841 tgccgtcacc tgcgaaaacg aggtatccac cgccgtagtc gctagagaca tacaacgaca 51901 acgacaacgc agccggcgtg gtggggtcct tggccggttc gacgatcagg tacatgcttt 51961 tgacgtcgga ttgtttgagg ccgagggttt ccggggcgcc gcgcatgatg ctcacagcgg 52021 tcttcgcatc gaatttgctc aggtcaacca cggacacgtc ggcaatgctc ttggcggaac 52081 tggtcgcatc gccccagccg ccgcggtagg tatacgccag gactcggcgg tcgtccgccg 52141 ggtcgacgcg atcgagcgac gcatactccg ggtagatcac cagccggtag cccatggtgt 52201 cgccgaaccg cttgcgggtc tgctccagca ggccggtgag cccgccgagg gaatgcagct 52261 gcctgggcgg ggtcagcacc acgggggcga tcccgtcggg ctttgctccg ggatccgagg 52321 tgaagtccag cggagagcgg gtgttgccgt acacgcccca gccgatgccg acgcccagca 52381 gcaccgatgc gacaaacgca gcggccagca agcccaactc ggtgcgtttc gcccgcgatt 52441 tgagcgcggg catttgtgcg ggtgcgctct cgacctgcag gtcggccacc agacgctgca 52501 ggtcacctag ggtcacagcc ttggtagctg cgctgacgcg ctcccggtgt tcctccatcg 52561 agagctcgcc gtcacgcagg gcgtcgtcga gaatccggca ggcgtcctgc cggtcgctgt 52621 ctttggcgcg ggttgccgtc gatactccgc gcgcaagggg tgcgcccagc cacttcgcca 52681 cagggacgat agtaggagtc tggctgggaa tctgaactcg atcccgccgt acccgcgcaa 52741 caacggcgcc ggttgcgtat cggtggtgtg gatggcgtcg tactctggtc agcgtgcgac 52801 tgcagcgaca ggtagtggac tacacgctac ggcgacgctc cctgctggcc gaggtgtatt 52861 cgggacgcac cggtgtgtcg gaggtgtgcg acgccaaccc ctacctgctg cgcgccgcaa 52921 agtttcatgg gaagcccagc cgggtcatct gcccgatctg ccgcaaggag cagctcacac 52981 tggtgtcgtg ggtgttcggc gagcacctcg gtgcggtatc agggtccgcg cgcaccgccg 53041 aagaactgat cctgctggcg acccggttct ccgagttcgc ggtccacgtg gtggaggtat 53101 gtcgaacctg cagttggaat catctggtca agtcatacgt cctgggcgcc gcacgtccgg 53161 cacgcccccc tagggggtct ggcgggacgc ggacggcgcg caacggggcc cgcacggcca 53221 gtgaatagcg acgggcgtca ccatcagtcg tccagcggcg ccccgcgcgg gccggcgaat 53281 cccggccagc gtggtcaggt tccacccgac gacagactga ccgcgatcct cccgccggtg 53341 accgatgacc gatcggctcc gcacgcggac tccatcgagg cggtcaaggc cgcgctcgac 53401 ggcgcgccgc cgatgccccc gccgcgcgac ccgctcgagg aggtcacggc cgcgttggcc 53461 gccccgcccg gtaaaccgcc gcggggggat cagcttggtg gcagacgtcg cccaccgggg 53521 ccgcccgggc cccccggttc gtccggacag cctgccggcc ggctgcccca accgagggtg 53581 gacttgcccc gggtcggcca gatcaactgg aaatggatac ggcgttcgct gtacctcacc 53641 gcggcggtgg tgatcctgtt gctgatggtc accttcacga tggcctacct gatcgtcgac 53701 gttcccaagc caggtgacat ccgtaccaac caggtctcca cgatccttgc cagcgacggc 53761 tcggaaatcg ccaaaattgt tccgcccgaa ggtaatcggg tcgacgtcaa cctcagccag 53821 gtgccgatgc atgtgcgcca ggcggtgatt gcggccgaag accgcaattt ctattcgaat 53881 ccgggattct cgttcaccgg cttcgcgcgg gcagtcaaga acaacctgtt cggcggcgat 53941 ctgcagggcg gatcgacgat tacccagcag tacgtcaaga acgcgctggt cggttccgca 54001 cagcacgggt ggagcggtct gatgcgcaag gcgaaagaat tggtcatcgc gacgaagatg 54061 tcgggggagt ggtctaaaga cgatgtgctg caggcgtatc tgaacatcat ctacttcggc 54121 cggggcgcct acggcatttc ggcggcgtcc aaggcttatt tcgacaagcc cgtcgagcag 54181 ctgaccgttg ccgaaggggc gttgttggca gcgctgattc ggcggccttc gacgctggac 54241 ccggcggtcg accccgaagg ggcccatgcc cgctggaatt gggtactcga cggcatggtg 54301 gaaaccaagg ctctctcgcc gaatgaccgt gcggcgcagg tgtttcccga gacagtgccg 54361 cccgatctgg cccgggcgga gaatcagacc aaaggaccca acgggctgat cgagcggcag 54421 gtgacaaggg agttgctcga gctgttcaac atcgacgagc agaccctcaa cacccagggg 54481 ctggtggtca ccaccacgat tgatccgcag gcccaacggg cggcggagaa ggcggttgcg 54541 aaatacctgg acgggcagga ccccgacatg cgtgccgccg tggtttccat cgacccgcac 54601 aacggggcgg tgcgtgcgta ctacggtggc gacaatgcca atggctttga cttcgctcaa 54661 gcgggattgc agactggatc gtcgtttaag gtgtttgctc tggtggccgc ccttgagcag 54721 gggatcggcc tgggctacca ggtagacagc tctccgttga cggtcgacgg catcaagatc 54781 accaacgtcg agggcgaggg ttgcgggacg tgcaacatcg ccgaggcgct caaaatgtcg 54841 ctgaacacct cctactaccg gctgatgctc aagctcaacg gcggcccaca ggctgtggcc 54901 gatgccgcgc accaagccgg cattgcctcc agcttcccgg gcgttgcgca cacgctgtcc 54961 gaagatggca agggtggacc gcccaacaac gggatcgtgt tgggccagta ccaaacccgg 55021 gtgatcgaca tggcatcggc gtatgccacg ttggccgcgt ccggtatcta ccacccgccg 55081 catttcgtac agaaggtggt cagtgccaac ggccaggtcc tcttcgacgc cagcaccgcg 55141 gacaacaccg gcgatcagcg catccccaag gcggtagccg acaacgtgac tgcggcgatg 55201 gagccgatcg caggttattc gcgtggccac aacctagcgg gtgggcggga ttcggcggcc 55261 aagaccggca ctacgcaatt tggtgacacc accgcgaaca aagacgcctg gatggtcggg 55321 tacacgccgt cgttgtctac ggctgtgtgg gtgggcaccg tcaagggtga cgagccactg 55381 gtaaccgctt cgggtgcagc gatttacggc tcgggcctgc cgtcggacat ctggaaggca 55441 accatggacg gcgccttgaa gggcacgtcg aacgagactt tccccaaacc gaccgaggtc 55501 ggtggttatg ccggtgtgcc gccgccgcct ccgccgccgc cgtcggaggt accaccttcg 55561 gagaccgtca tccagcccac ggtcgaaatt gcgccgggga ttaccatccc gatcggtccc 55621 ccgaccacca ttaccctggc gccaccgccc ccggccccgc ccgctgcgac tcccacgccg 55681 ccgccgtgac cggcgcgctg tcccaaagca gcaacatctc gccacttcct ttggccgccg 55741 atctgcggag cgccgataac cgcgattgcc ccagccgcac cgacgtattg ggtgccgctc 55801 tggcgaatgt cgtcggtggc ccggtaggcc ggcacgcgct gatcggccgc acccggctga 55861 tgaccccgct gcgggtgatg tttgcaatcg cgttggtgtt cctggcgctc ggttggtcga 55921 cgaaagcggc ctgcttgcag tccaccggaa ccggtccagg tgatcagcgg gtggccaact 55981 gggataacca gcgtgcttac taccagttgt gctactccga tacggtgccg ctctatggcg 56041 ctgagttatt gagccaaggc aagtttccgt acaaatcaag ctggatcgaa accgacagca 56101 acggcacacc gcagctgcgc tacgacggac agatcgcggt gcgctatatg gagtatccgg 56161 tgctgactgg gatctatcag tacctgtcga tggcgatagc caagacctac accgcgttaa 56221 gcaaggtggc tcccctcccg gtggttgccg aagtggtgat gttcttcaac gtcgccgcgt 56281 tcggtttggc gctggcgtgg ctgacaaccg tctgggcgac ctcgggcctg gccggccgcc 56341 ggatatggga tgcggcgctg gtggccgcct caccgctggt gatctttcag atattcacca 56401 atttcgatgc gctggcaacg ggtttggcga cgagtgggct gctggcctgg gcgcggcgca 56461 gaccggtgct tgccggtgtg ctgatcgggt tgggctccgc ggcgaaactg tatccgctgt 56521 tgttcttgta cccgttgttg ctgctgggca tccgggccgg tcgcctgaat gctctggccc 56581 gcaccatggc ggccgcggcg gcgacctggt tgttggtgaa tctgccggtg atgctgctct 56641 ttccgcgcgg ctggtcggag ttcttccggc tcaacacccg gcgcggcgac gacatggact 56701 cgttgtacaa cgtcgtcaag tcgttcaccg gctggcgtgg cttcgacccc accctgggct 56761 tctgggagcc gccgctggtg ctgaacacgg ttgtcacgct cttgttcgtg ttatgttgtg 56821 cggcaattgc ttacatcgcg ctcaccgcac cccaccggcc gcgcgtggcg cagctgactt 56881 tcttgacggt ggccagcttc ctgttggtca acaaggtgtg gagtccccag ttctcgcttt 56941 ggctggtgcc gctggccgtg ctggctttgc cgcaccgccg gatcttgctg gcgtggatga 57001 cgatcgacgc gttggtgtgg gtgccgcgga tgtactacct atacggcaac ccgagccgct 57061 cgctgcccga gcagtggttc accacgacgg tgttgctgcg tgacatcgcc gtgatggtgc 57121 tgtgcggact ggtggtctgg cagatctacc gccccgggcg cgacctcgtg cgtaccggcg 57181 ggccaggggc actgccggct tgtgggggag tcgacgaccc ggtgggaggg gtctttgcca 57241 acgccgccga cgccccgcca ggtcggctac cgtcgtggct gcgtccccgg ctgggcgacg 57301 agcatgcgcg agagaggacg cccgatgcag gtcgcgatcg cactttttcc gggcaacacc 57361 gcgcttgacg cggttggccc ctacgaggtg ctgcagcggg tgccgtcgtt cgacgtcgtg 57421 ttcgtcggcc accgccgcgg ggaggttcgc agcgacaacg ccatgctggg tctgctgtgt 57481 gacgcggcat tcgacgagct aacccggccc gatgtggtga tctttccggg cggcatcgga 57541 actcggaccc tgatccacga ccagaccgtg ctcgactggg tgcgcgaagc gcaccggcac 57601 accctactca ccacctcggt gtgcaccggc gggctggtgt tggcggctgc cggactgctc 57661 aacggcttga ccgcgaccac gcattggcga gtacaggatc tgttcaactc gctgggcgcc 57721 cgatacgtcc cccagcgtgt cgtcgagcat ctgccagagc gggtcatcac cgccgccggg 57781 gtgtcgagcg ggatcgacat gggattgcgg ctggtggagc ttttggtcag ccgggaagcc 57841 gccgaagcga gccagctgat gatcgagtat gacccgcagc caccggtgga tgccggctcc 57901 ctggccaagg cctcgccggc tacccatcgg ctcgcgttgg agttctatca gcatcgtttg 57961 tgatctgttc gcgataggcc tcgccgttcg cgacactgac attgcgcaca cgacacgccg 58021 cggatcgtcg caccgggtta agcctggagt gcggtggtgc ctggtcggca ttttcgcagt 58081 cgagggctct cgtgtagcct gggcgagttg ccgacgcagg cgaccctcct gccacggatc 58141 gaccgtggcc gcacacgacc acaggaggtg atgaggttcc tatgcgtcca tacgaaatca 58201 tggtcatcct cgacccgacc ctcgacgaac gcaccgtagc cccgtccttg gagacgttcc 58261 tcaacgtcgt ccgtaaggac ggcggaaaag tcgaaaaggt ggacatctgg ggcaagcgtc 58321 ggctggcgta cgagatcgcc aagcatgccg aaggcatcta cgtggtgatc gacgtgaaag 58381 ccgccccggc gacggtgtcc gaactcgacc gccagctcag cctcaacgag tcggtgttgc 58441 gcaccaaggt aatgcgcacc gacaagcact aatcggcctg ccaggcactg gctgttcgct 58501 gtcggtgcgg ttacgtaggc tcggcgaaga agaacacgac cagccgccga acccaggcgg 58561 acgcaggagg aaattgtggc tggtgacacc accatcacca tcgtcggaaa tctgaccgct 58621 gaccccgagc tgcggttcac cccgtccggt gcggccgtgg cgaatttcac cgtggcgtca 58681 acgccccgga tctatgaccg tcagaccggc gaatggaaag acggcgaagc gctgttcctc 58741 cggtgcaata tctggcggga ggcggccgag aacgtggccg agagcctcac ccggggggca 58801 cgagtcatcg ttagcgggcg gcttaagcag cggtcgtttg aaacccgtga gggcgagaag 58861 cgcaccgtca tcgaggtcga ggtcgatgag attgggcctt cgcttcggta cgccaccgcc 58921 aaggtcaaca aggccagccg cagcggcggg tttggcagcg gatcccgtcc ggcgccggcg 58981 cagaccagca gcgcctcggg agatgacccg tggggcagcg caccggcgtc gggttcgttc 59041 ggcggcggcg atgacgaacc gccattctga ccccaagaac tgcaaatcaa gaaacggaaa 59101 gatagacact catggccaag tccagcaagc ggcgcccggc tccggaaaag ccggtcaaga 59161 cgcgtaaatg cgtgttctgc gcgaagaagg accaagcgat cgactacaag gacaccgcgc 59221 tgttgcgcac ctacatcagc gagcgcggca agatccgcgc gcgtcgggtc acgggcaact 59281 gcgtgcagca ccagcgagac atcgcgctcg cggtgaagaa cgcccgcgag gtggcgctgc 59341 tgccctttac gtcttcggtg cggtagcgcc gaatgtccaa cggagagtgc aaaataccat 59401 gaagctcatt ctcacggccg atgtcgatca cctcgggtcc atcggcgaca ctgtcgaggt 59461 caaggacggg tatggccgta actttctgct cccgcgcggc ctggcgatcg tcgcctcgcg 59521 cggagcccag aagcaggctg acgagatccg ccgggcccgc gaaaccaaaa gcgtacgcga 59581 cctagagcac gccaacgaga tcaaggcggc gatcgaggcg ctcggcccga tagcgctgcc 59641 ggtgaagact tcagctgatt ctgggaagtt gttcggctcg gtgaccgccg cagatgtggt 59701 tgctgccatc aagaaggccg gtggaccaaa cctcgataag cggatcgttc ggctgcccaa 59761 gacgcacatc aaggccgtgg gcacgcattt tgtgtcggtg cacctgcacc cggaaatcga 59821 tgtcgaggta tcgctggacg tcgtggcgca gagctaaggc aagctgaggc cacaacagtt 59881 tgcgcatgcc ggtggtgacc gcggtcggcc gccgccgggg tttcgccatg ccctgggtgt 59941 ccaccgcacg gtccggtgcg gtgatgctgg cgaactattc ggccggcgtt tgcgggcggg 60001 tgtcttcacc gggccttaac gtcaggaaaa tgtgtctgaa agccaacacg cccggcgcgg 60061 taacctggct cgacacgccg aagagattct tgtccacaca aacggcgtcg cgttgtatgg 60121 ccgttaacag cagtgatgtc gtaacgggcc gtattgatcc acaggttctc cacaccccgc 60181 tcaacacaga cgtcgacgga tatgcacatg cgatgcacag ctccataaac agtggcccct 60241 tggagtactt gccagcaacg tttagcgtct tcccggcgct aggcgatgtg ggtgacttgg 60301 gcggtggtgt cggtgcggcg acttacgctc tggataggtt gtcgaatatg cgttcgggtg 60361 cttgtgtcgg aggaggtgag agcccatggc ggtcgttgat gacctagcgc ccggcatgga 60421 ctcctcaccg cccagtgaag attacggccg tcaaccaccg caggatctcg ccgccgagca 60481 gtccgtgctg ggcgggatgt tgctgagcaa ggacgccatc gccgatgtac tggaacggct 60541 acggcccggc gatttttatc gtccggcgca tcagaacgtc tacgacgcca ttttggacct 60601 gtatgggcgg ggagaaccgg ctgatgcggt gacggtggcc gccgaactgg atcgccgtgg 60661 gctgctgcgc cgcatcggcg gtgctcccta cctgcacacc ctgatctcga cggtgccgac 60721 ggccgccaac gcgggctact acgcgagcat cgttgccgaa aaggcgctgc tgcgccggct 60781 ggtagaggcc ggaacccggg tggtgcagta cggctatgcc ggcgccgaag gcgcggatgt 60841 ggccgaggtg gtcgatcgcg cgcaggccga aatctacgac gtcgcggatc ggcggctgtc 60901 ggaagacttt gtggcgcttg aggacctgct gcaaccgacg atggacgaga tcgatgccat 60961 cgcttccagt ggcggcctgg cgcgcggggt ggctaccggc ttcaccgaac tcgacgaggt 61021 caccaacggc ctgcatccgg ggcagatggt catcgtggcg gcgcgcccgg gcgtgggaaa 61081 gtccaccctt gggctggact tcatgcggtc atgctcgatc aggcatcgga tggccagcgt 61141 catcttctcg ctggagatga gcaagtccga gattgtcatg cgactgctgt cggcggaggc 61201 caaaatcaag ctctccgaca tgcgttcggg ccggatgagc gatgacgact ggacccggct 61261 ggcgcggcgg atgagcgaaa tcagcgaagc gccactgttt atcgacgact cgcccaacct 61321 gaccatgatg gagatccgtg ccaaggcgcg ccgcctgcgg caaaaggcca acctgaagtt 61381 gatcgtggtc gactacctgc aactgatgac ctcgggcaag aagtatgaat cacggcaggt 61441 ggaggtgtcg gagttctcgc ggcatctgaa gctgttggca aaagagcttg aggttcccgt 61501 ggtcgcgatc agccagctca accgtgggcc cgagcagcgt accgataaga aaccgatgct 61561 ggccgacctc agggaatcgg gctgcctgac cgcgtccacc agaatcttgc gcgccgatac 61621 cggcgctgag gtcgccttcg gtgagctcat gcgaagcggt gaacgtccca tggtgtggtc 61681 gctggacgag cggctgcgca tggtggcccg gccgatgatc aacgtgttcc cgagcgggcg 61741 caaggaagtg tttcggcttc ggctggcttc cggacgcgaa gtcgaggcca ccggcagcca 61801 cccctttatg aagttcgaag gctggactcc cttggcgcag ttgaaggttg gtgaccggat 61861 cgcagcaccg cgccgggtac ctgagcccat cgacactcag cggatgcccg agtctgagct 61921 catttcgctg gctcgcatga tcggtgacgg gtcgtgcctg aagaaccagc cgatccgcta 61981 cgagccggtg gatgaggcga acctggccgc ggtgacggtc tcggcggcgc actcggatgg 62041 ggctgcgatc cgcgacgact acctcgcagc tcgagtgccg tcgttgcgcc cggcgcggca 62101 acgactaccg cgcgggcggt gcacgccgat tgcggcgtgg ctggctggcc tagggctatt 62161 cacgaaacgc agccacgaaa aatgcgtacc ggaggctgta tttcgcgccc ccaatgacca 62221 ggtggcgttg tttctgcggc atctgtggag cgctggtggc tctgttcggt gggatcccac 62281 gaatggtcaa ggccgggtct actacggctc aaccagtagg cgtctcatcg acgatgtggc 62341 tcaattgctg cttcgggttg ggattttttc ctggatcaca cacgccccaa agttgggcgg 62401 ccacgattcg tggcggctgc acattcatgg cgcgaaggat caggtcaggt tccttcgtca 62461 cgtcggcgtt cacggcgccg aagcggtggc ggcccaagag atgctgcgtc agctcaaagg 62521 accggttcgc aacccgaacc tggacagcgc gccgaaaaaa gtatgggcgc aagtccgcaa 62581 ccgactgtcc gccaaacaga tgatggacat ccagctccac gaaccgacga tgtggaagca 62641 ttccccgagc cggtcaaggc cgcatcgcgc ggaggcgcgg atcgaagatc gagcgatcca 62701 tgagctggcg agaggcgacg cgtactggga caccgtcgtg gagatcacca gcattgggga 62761 tcaacatgtt ttcgatggga ctgtaagcgg cacacacaat ttcgtcgcca atggcattag 62821 tttgcacaat tcgctggaac aagatgccga cgttgtcatc ctgctgcatc gacccgacgc 62881 ctttgaccgc gacgatccac gtgggggaga agcggatttc attctcgcca aacaccgcaa 62941 cggtccgacg aagacggtca ccgtagcgca tcaactgcac ctgtcacgct tcgccaacat 63001 ggctcggtga catgcggatg tgtggggttt cacggagcgt ggccgaatgt cacgaatgat 63061 ggggccatca gggcggaccg gtccacgcat ccgcggcggc gttgaagtcc ccgagcaaca 63121 cgcgtcgtgg ttgatgcgtg agatgagtca gatcagggcg acaggacgtc gaaccagtgg 63181 gactaatgca tgatcaccag atacaagcct gagtcggggt ttgtcgcccg tagcggtggt 63241 cccgaccgga agcgtcccca tgactggatc gtttggcact tcacccatgc cgacaatctc 63301 cctgggatca tcaccgctgg ccgtctgctg gccgattcag cagtcacccc gacgaccgag 63361 gtggcatata acccagtcaa ggagttgcgc cgccacaaag tcgtcgcccc cgacagcagg 63421 tacccggcgt cgatggcaag cgatcatgtg ccgttctaca ttgcggcgcg gtcgcccatg 63481 ctctacgtcg tatgcaaggg ccactccggc tactccggcg gtgccggccc gctggtgcac 63541 ctcggggtgg cgcttggcga catcatagac gcggatctga cgtggtgcgc cagtgacggc 63601 aatgctgcag ccagctacac caagttcagc cgccaggtcg acacgctcgg caccttcgtc 63661 gactttgacc tgctctgcca gcggcaatgg cacaacaccg atgacgaccc caaccgccag 63721 agccgccgcg ccgccgagat cctggtatac ggccatgtcc cgttcgagct ggtcagctac 63781 gtgtgttgct ataacaccga gacgatgaca cgggtacgaa ctctgctcga tcctgtcggt 63841 ggggtgcgaa agtatgtcat caagcccggc atgtactact aaggaaggag gaggccatat 63901 gatcacgtac ggctctggcg acctccttcg ggctgacacc gaagcgctcg tcaacaccgt 63961 caactgtgtt ggggtgatgg gcaagggaat tgcgctgcag ttcaaacgcc gctaccccga 64021 gatgttcacc gcctacgaaa aggcgtgcaa acgcggcgaa gttaccatcg gcaagatgtt 64081 cgtcgtcgac accggacagc tcgacggacc gaaacacatc atcaacttcc ccaccaagaa 64141 acactggcgt gcaccgtcga agctggccta tatcgacgcc ggcctcattg atctcatccg 64201 cgtgatccgt gaactcaaca ttgcttctgt ggcagttccc ccgctggggg tgggcaacgg 64261 aggtctggat tgggaagatg tcgagcaacg gctcgtatca gcattccagc agctgcccga 64321 cgttgacgcc gtgatctacc ccccatcagg tggatctcgc gccatcgagg gcgtcgaagg 64381 acttcggatg acctgggggc gcgccgtcat actcgaagcg atgcggcgat atctccagca 64441 gcgccgcgcg atggagccgt gggaagaccc tgcagggatc tcgcatctgg agattcagaa 64501 gctcatgtac ttcgccaacg aggccgatcc cgatcttgcg ctagatttca cgcccggccg 64561 atacgggcca tacagcgaac gtgtccgtca cttactgcaa ggaatggagg gcgcattcac 64621 agtcggcctg ggtgacggca ccgcaagagt tcttgcgaac caaccgatct cgttgactac 64681 taagggaact gacgccataa cggactatct ggccaccgat gcggcagctg accgggtgag 64741 cgccgcagtc gacacggtgt tgcgcgtcat cgaaggcttt gaaggcccat acggggttga 64801 gctgctcgcc agtacgcatt gggtggccac acgtgagggc gccaaggaac cagccacggc 64861 agcggccgcg gtccgaaagt ggacaaaacg caagggtcgg atctacagcg acgatcgcat 64921 cggtgttgcc ctcgaccgca ttcttatgac tgcctgaaag cgaccggctc gtcgttaagg 64981 atgtgcgccg acgcccagcc gtcagggagc gttgggctgc tcggacggaa ttgccccacc 65041 gcaaccaccc ggtggcggcg ggccggggag gggctcaccg ccgctgacac aatcgaagta 65101 aaactgtggg ccggtaaacc acgtttgcat ccactggtgc caaaacgagc cgtcggggta 65161 cttctcgccg tcgcacacgg ccaagtcgcc aaaaccccat cggccacccg ggcaatagcc 65221 tttcgtcatg tccggctgat gcgggtcagg tggatctgcg ctggcaaccg aggcaggaaa 65281 cacaagcgcc gctgcacaac ccagtatcgc agtactcagg cgagcaaact tcaacttcat 65341 ttcaaactcc gtcaaacgtt gaatcgactc ggcggactcc aagcgatggt cagcgcttgc 65401 ggatgagccg cggcaatgag tcgtagtggg cagacattcc cgagaacagc ctgaaatcct 65461 gttcggttga tgccgtgccg gcatcgacgt accaggacga ggcactgact cgggaaggca 65521 cagccgccgt ggcgattgta tatgacgcgt cggactgggc agcgatggcg cgggactctg 65581 cccgggcgcc ggccttggac acggccagcg cccgccacct gtcgtcggca tttggcgttt 65641 gtcgaattgc ggcattattt tgctcgggtg atgtcatcag ctattggttc ggtcgcgcgg 65701 tggatagtcc ccctcctggg ggttgcagcc gttgcttcca tcggtgttat cgcggacccg 65761 gtgcgggtcg ttcgggcccc ggcgttgatc ctggtcgatg cggcaaaccc gctggccgga 65821 aagcccttct acgtcgatcc cgcctcggcg gccatgatcg ccgcgcgcaa cgccaacccg 65881 ccgaacgccg agctgacctc cgtcgccaac accccgcagt cctactggct cgaccaggca 65941 ttcccgccgg cgaccgtcgg cggcacggtt gccaggtaca ccggagcggc gcaggcggcc 66001 ggcgccatgc cggttctgac gctgtatgga atcccccatc gcgactgcgg tagctacgca 66061 tccggtgggt tcgcgacggg cactgattac cgcgggtgga tcgacgctgt cgcatccggc 66121 ctgggctcat cgccggcgac gatcatcgtc gaacccgatg cgctggccat ggccgactgc 66181 ctgtcgcctg accagcgcca ggaacgtttc gacttggtgc gctacgccgt cgacacgctg 66241 acccgcgacc cggccgctgc cgtgtacgtc gatgcggggc attcgcgctg gctgagcgcc 66301 gaggcaatgg ccgccaggct caacgatgtc ggtgtgggcc gcgcgcgcgg gtttagcctc 66361 aacgtctcga acttctacac caccgatgag gaaatcggct atggcgaggc gatttcgggg 66421 ctcacgaacg gttcgcatta cgtgatcgac acgtcgcgca acggcgccgg acccgcgccc 66481 gacgccccgc tcaactggtg taaccccagc ggccgcgccc tgggcgcacc gcccaccacg 66541 gcgaccgcgg gcgcgcacgc cgacgcttac ctgtggatca aacgtcccgg ggaatcggac 66601 ggaacctgcg gtcgcgggga gcctcaggcg ggtcggttcg ttagccagta cgccatcgat 66661 ctggcccaca acgccggcca gtagagacct cacgcgcaga ccggctgagc gtgcggccgt 66721 tgggccgtcg gcgtcgggtt cggccaggtg gggtaacggt tcgggcacgt ttccactacc 66781 tcgtgacacg tcatgcggca ccgcggttcg ggtggtcgac aatgcgggac atgacccaaa 66841 attcggggtg ctgccggccc gcagcgtcgg gctgcgccgc gctggtgacc gtcgcgagac 66901 gggagcccga cgttggcgcg tgagatctca cgccagacgt ttctgcgggg tgccgccgga 66961 gcgttggccg ccggcgcggt cttcggctcg gtccgggcta ccgcggatcc ggctgcctct 67021 ggctgggagg ctctttcttc cgccctcgga gggaaagtgc tacaaccgga cgacggtccc 67081 caattcgcaa cggccaagca ggttttcaac accaactaca acggctatac gccggcggtg 67141 atcgttaccc cgacatcgca gctggacgtg cagaaggcga tggcgttcgc tgccgcgaac 67201 aacctcaagg tggccccacg cggtggcggg cactcctacg tgggggcgtc cacggccaac 67261 ggcgccatgg tgctcgacct acgtcagcta cctggggaca tcaactacga cgccaccacc 67321 gggcgggtca cggtgacgcc cgccaccggt ttgtacgcca tgcaccaggt gttggccgcg 67381 gccggccggg gcatcccgac cggcacctgc ccgacggtcg gtgtcgcggg acacgcgctg 67441 ggcggcgggc tgggcgccaa ttcccggcac gccggcctgc tctgtgacca attgacgtcg 67501 gcgtcggtgg tgctgcccag cggccaggcg gtcaccgcgt ccgccaccga ccaccccgac 67561 ctgttctggg cgttgcgcgg tggcggtggc ggcaacttcg gcgtgacaac ctcgctgacc 67621 ttcgcgacgt tccccagcgg ggacctcgac gtcgtgaacc tcaatttccc accgcagtcg 67681 ttcgcgcagg ttctggtcgg ttggcagaat tggctgcgaa ccgccgaccg aggcagctgg 67741 gcactggccg atgccaccgt cgacccgctg ggcacgcatt gccgcatcct tgcgacctgc 67801 ccggccgggt cgggcggcag cgtggcggcc gccatcgttt cggccgtcgg aacgcaaccg 67861 accggcaccg aaaaccacac gttcaactat ctggacctgg tcagatatct ggccgtcggg 67921 aacctcaacc cgtcgccgct gggatatgtc ggcggatccg atgtcttcac gacgatcact 67981 ccggcgaccg cccagggaat cgcctcggcg gtcgacgcct ttccgcgtgg agcgggccgc 68041 atgttggcga tcatgcacgc cctcgacggc gcgctcgcca ctgtgtcacc gggggccacg 68101 gccttcccgt ggcgtcggca gtcggcgctg gtgcagtggt acgtcgaaac atccggctcc 68161 ccgtcggaag cgactagctg gctcaacacc gcacatcaag cggtgcgagc gtattcggtt 68221 ggcggctatg tgaactatct cgaggtaaac caaccgccgg cacgttactt tggcccgaat 68281 ctgtcccggc tgagcgcagt acgtcagaag tatgacccca gccgggtcat gttctccggg 68341 ctgaacttct agcagccccg catgagtact agcccctagg acgggccatc ctcgtctacc 68401 ctgggaagtg atcatggaac tttccgtgtc tgttatcgcg gggttggtca tcgcactgct 68461 ggcggccatc acccctgctg cgggcgaacg cccggaaagc cgccgccagg cgctcgcaaa 68521 tgccgccgag gccggggagc atccggccac atcaccgttg cgacggtagc cgattcgtcg 68581 cgatacggct gtggagttag gaggcgcgga tggagacagg ttcgccggga aaacgtccgg 68641 tcttgcccaa gcgtgcccgc ctgctggtga cggcaggcat gggcatgctc gcgttgctgc 68701 tgtttggacc ccggctagtc gatatttacg ttgactggtt gtggtttggt gaggtcggtt 68761 tccgcagcgt ctggatcacg gtactgctga cccgcctggc gattgtcgca gcggtcgcac 68821 ttgtggtggc cggcattgtg cttgctgccc tactgctggc gtatcgctcg cggccgttct 68881 ttgtacccga cgagccgcag cgggacccgg tcgcgccact tcgcagcgcg gtgatgcgcc 68941 ggccgcgcct gttcgggtgg ggcatcgccg tcacgctcgg tgtggtgtgc gggctgatcg 69001 cttcgttcga ctgggtgaag gttcagttgt tcgtacacgg gggcaccttt ggcatcgtgg 69061 accccgaatt cggctatgac attgggtttt tcgtcttcga tctgccgttc taccggtcgg 69121 tgctgaactg gctgttcgtg gccgtggttc tggcgtttct agcgagcctg ttgacgcatt 69181 acctgttcgg cggccttcgg ctgacaaccg gcagaggcat gctgacccag gcagctcgcg 69241 ttcaactcgc agtgttcgcc ggcgcggttg tactgctgaa ggcggttgcc tactggttgg 69301 atcgctatga gctgttgtcg agtggacgta aggagccgac cttcaccggc gccggctaca 69361 ccgatatcca cgccgagctg ccggccaagc ttgtgctggt ggcgattgcg gtattgtgtg 69421 cggtgtcatt ctttaccgcg atctttttgc gcgacttgag gattccggcg atggccgccg 69481 cactgctggt gctgtcggcg atcctggtcg gtggactgtg gccgctgctg atggagcagt 69541 tctcggtgcg tcccaacgcc gccgatgtcg aacgcccata tatccaacgc aacatcgaag 69601 cgacccgcga ggcgtatcgg atcggtggcg attgggtcca gtaccgtagc tatccgggca 69661 tcggtaccaa acagccgcgc gacgtgcccg tggatgtcac cacgattgcc aaggtgcggc 69721 tgttggaccc gcatatcctg tcccgaacct tcacccagca acagcagctc aagaatttct 69781 ttagcttcgc cgagatactc gacatcgatc gctatcgcat cgacggtgag ctgcaggact 69841 acatcgtcgg cgtccgggag ctctcgccga aaagcctcac cggcaatcag accgactgga 69901 tcaacaaaca catcgtctac acgcatggca acggcttcgt ggccgccccg gccaatcggg 69961 tgaacgcggc ggcccgcgat gccgagaata tttccgacag caacagcggg tacccgatat 70021 acgccgtcag tgacatcgcg tcgctgggtt ctgggcgcca ggtcatcccg gtcgagcagc 70081 cgcgggtcta ctacggcgag gtgatcgccc aggccgatcc ggactacgcg atcgtgggcg 70141 gagccccggg gtccgcgccg cgcgagtatg acaccgacac gtccaagtac acctataccg 70201 gcgccggggg tgtgtcgatc ggaaactggt tcaaccgcac ggtgtttgcc accaagttcg 70261 cccagcacaa gttcctgttc tcccgggaga tcggctcgga gtcgaaggtg ttgatccatc 70321 gcgacccgaa ggaacgggtg caacgcgtgg cgccgtggtt gaccaccgac gacaacccct 70381 atccggtggt ggtgaacggg cggatcgtct ggatcgtcga cgcctacacc accttggaca 70441 cctatccgta cgcacaacgc agctcgctcg agggcccggt gaccagcccg accggcattg 70501 tgcggcaagg caagcaggtg tcgtacgtgc gtaactccgt caaggcaacc gtggacgcct 70561 acgacggaac cgtaacgctg tttcagttcg atcgagacga cccggtgctg cggacctgga 70621 tgcgtgcctt tcccggaacc gtcaagtccg aagaccagat tcccgacgag ttgcgtgccc 70681 acttccgtta tccggaggac cttttcgagg tccaacgtag cttgctggcc aagtatcatg 70741 tcgacgaacc gcgagagttc ttcaccacca acgccttctg gtcggtgccc agcgacccga 70801 ccaacgacgc taacgccact caaccgccgt tctacgtcct cgtcggcgac cagcagagcg 70861 cccagccgtc cttccggttg gcgtcggcga tggttggcta caaccgcgaa ttcctctccg 70921 cgtacatctc ggcgcactcg gatccggcga actacggcaa gctgaccgtg ctggagttac 70981 ccaccgacac cctgacccaa ggcccgcaac aaattcagaa ctcgatgatc tccgacactc 71041 gggtcgcctc cgagcgcacc ctgctggaac ggtcaaaccg gattcactac ggcaacctct 71101 tgtcgctgcc gatcgccgac ggcggcgtgc tctatgtgga accgctctac accgagcgga 71161 tctcgacaag cccgagcagt tcgactttcc cgcaactttc ccgggtgctg gtcagcgtgc 71221 gtgaaccccg caccgagggc ggggtccggg tcgggtacgc accgaccctg gccgaatctt 71281 tggatcaggt atttgggccc ggcaccggtc gggtcgccac cgctcccggc ggtgatgccg 71341 ccagcgcgcc accgccggga gccggcgggc cggcaccgcc gcaggccgta ccgccaccga 71401 gaacgaccca accgccggcc gccccgcccc gggggccgga cgtccccccc gcgacggtgg 71461 ccgaactgcg ggaaacgctg gccgatctgc gcgcggtgct cgaccggtta gagaaggcca 71521 tcgatgccgc cgaaacgccc ggtggataag ccggcattct tagccggtga actccgagcg 71581 ctgttctggc gctaatctga cgctagaata gcgctatggc taccattcaa gttcgggatt 71641 tgcccgaaga tgtcgccgaa acctatcgac ggcgcgccac cgcagcgggg cagtcgctgc 71701 agacgtatat gcgcaccaag ctcatcgaag gggtgcgggg ccgagacaag gccgaggtaa 71761 tcgagatcct ggaacaggcg ctcgccagca ctgccagccc aggcatcagc cgggagacca 71821 tcgaggcatc ccggcgggag ctcaggggtg gatgaatgtg tagtcgacgc ggcggccgtg 71881 gttgacgctc tcgccggcaa gggcgccagc gcgatcgttc tgcgcggttt gctcaaggag 71941 tcgatttcta acgcgccgca tttgctggac gcagaggtcg gacatgcact ccgccgcgcc 72001 gtgctcagcg acgaaatctc cgaagagcag gctcgcgccg cgttggatgc cttgccttat 72061 ctcatcgaca atcgttaccc gcacagccca cgactgatcg aatacacatg gcagctaagg 72121 cacaacgtca cgttctacga cgccctttac gtcgcactgg ccaccgcact ggatgtcccg 72181 ctgctcacgg gcgactcgcg gcttgcggcc gcgccgggcc ttccgtgcga aatcaaactc 72241 gttcggtgac atccctttgc gggacgccaa tggcgccgtc gtagccgggc cagcccgtcg 72301 tcagccttgg acagcctcca gcgctgcatt gaacgtcttg ctgggccgca tcaccgccgt 72361 agtcatgtcg ctgtccggcg cgtagtagcc gccgatgtcc accggttcgc cttgtacctc 72421 ggtgagctct cgcacgatga cgtcttcgtt tttggtcaac acatctgcca gcgaggcgaa 72481 gtgttcggcc agctgctggt cgtcggtctg cgcggccagc tcttgtgccc agtacatggc 72541 gaggtagaac tggctgcccc ggttgtcgag ttcaccggtt ttgcgcgacg gactcttgtc 72601 gttgtccagc agcttgccga tggcggcatc cagggtctta cccaagagtt tggcccgctc 72661 gttaccggtc ttgatgccga tatcctcgaa accggcgccc agcgcgagga actcacccag 72721 agaatcccag cgcaggtgat tctcctccac caattgtttg acgtgcttgg gtgccgaacc 72781 gcccgccccc gtctcgtaca ttccgccgcc ggccatcagc ggaacgacgg acagcatctt 72841 ggcgctggtg cctaactcca ggatcgggaa caggtcggtg aggtagtcgc gcaggatgtt 72901 gccggtcgcg gcgatggtgt ccagtccacg gaccaggcgc tcgcacgtgt agcgcatgga 72961 tcgcacttgc gacatgatct ggatgtccag accttcggtg tcgtgatctt tcaggtatgt 73021 cttgaccttc ttgatcagct cgttctcgtg cgggcggtac gggtccagcc agaacagcac 73081 cggcatcccg gagatgcgcg cgcgggtgac agccagcttg acccagtcac ggatcggtgc 73141 gtccttgacg atgcacatgc gccagatgtc gccggcttcc acgttctcgg tcagcagcac 73201 ctcgccggtg gcgacatcga cgatgttggc gacgccgtcc tcgggaatct cgaacgtctt 73261 gtcgtgcgag ccgtactcct cggcctgctg ggccatcaga cccacattgg ggacggtgcc 73321 catcgtcgtc ggatcgaact ggccatttgt cttgcagaag ttgatgatct cctgatagat 73381 gcgcgagaag gtggactccg ggttgaccgc cttggtgtcc ttgagctttc cgtcggcgcc 73441 atacatcttg ccgcccgcgc gaatcatcgc gggcatcgag gcgtccacga tcacatcgct 73501 cggcgagtgg aagttggaga tacctctggc cgaatcgacc atcgcgagct cggggcggtg 73561 ttcgtggcaa cggtgtaggt cctcgatgat ctcgtcgcgt tgcgacgccg gcagcgactc 73621 gatcttgctg tacagatcgg acaagccatt gttgacgttg acgcccaagt cgtcgaacag 73681 ctcctggtgc ttggcgaagg cgtccttgta gaagatcctg accgcgtggc cgaagacgat 73741 ggggtggctg accttcatca tggtcgcctt gacgtgcaag gagaacatca cgccggtctc 73801 gaacgcatcc tgcatctgct cttcgtagaa gtcgcacagc gctttcttgc tcatgaacat 73861 gctgtcgatg acgtcgccgt catccagcgg cacctcgggc ttgagcacga tcgtcttgcc 73921 gctcttggcc agcagttcca tcctcacgtt gcgcgcgcgg tccagtgtca tcgacttctc 73981 gccggcgtag aagtcaccgt gccgcatgtg cgctacgtgg gtgcgtgagg ccatcgacca 74041 ctcgcccatg ctgtgcgggt gcttgcgcgc gtactccttc accgccttgg gcgcccgacg 74101 gtccgaattg ccttggcgta gtaccgggtt caccgcgctg cccaggcatc tggcgtagcg 74161 ctctttgatg gccttctcct ggtcagtgtt cgggtccgcc gggtagtctg ggaccgcgta 74221 acccttgtct tgcagttcct tgatggcggc taccagctgt ggcaccgagg cgctgatgtt 74281 cggcagcttg atgatgttgg tgtcgggtag ctgagtcagc cggcccagtt cggcgaggtt 74341 atccggtacc cgctgctcct cggtcaggta atcggggaat tccgccagga tgcgtgccgc 74401 tacagagatg tcgctggcct cgatcttgat gcccgccggt tcggcaaagg cacgcacaat 74461 cggcagaaag gcgtaggtcg ccagcagcgg cgcctcgtcg gtcagcgtgt aaatgatggt 74521 cggctgttcg gcgctcatgg tgttctcccg gcgtcactgt cggtcagatg ctgaatcact 74581 ccgcgttgta gcggcggtta ccagtatcgc ggattgcgcc gcacatgatt cgggcggtgt 74641 tctgcgcgac gacgatcact ttctgtttgc ccgaaggccg tcgagggcga cgtcggtcac 74701 ctttgcggcc aactcagcgt tgtagctctg catcgcttgg cagccgacta ggagcgtctt 74761 gacttcaagc acgtctacgt ccggccgtac ggtgccggcg cgctgggcgg cgcgcaacag 74821 gtcggtgagc aggtccaaga aatctgcctc ggcttccggg gccgcgctgc tgatttcaat 74881 cccgacgccg gccagcgcct cgaccaggcc gcgatcggtg gcgccccact gcaataccat 74941 cgaccgcagg aatgcaaaca gcgcgtcgcc gggatgcttg gatttgagca gggcatgtcc 75001 cttgtcgatg atgcggtgca tccggtcggc gatcaccgcc tgaaacagcg cctccttggt 75061 cgggaaatgc cggtataccg tgcctgcgcc gactccggcg cgccgagcga tctcgtcaac 75121 gggcaccgat agaccgtcgg ccgcaaaggt ttggtaggca acctccaata cgcgtgcccg 75181 gttacgggcc gcgtcggcac gcacccgccg gtcagtagga gccaagtcgt acctccgaaa 75241 gccttgacaa agcggggcgc gcgttccgta tagttcggct aagcggagcg ctcgccccgc 75301 ttagtcaaag catagcgagg agccctcatg accaaatgga ctgccgccga cattcctgac 75361 cagaccggcc ggaccgccgt catcacgggg gccaacaccg gacttggatt cgagaccgcc 75421 gcagcgcttg ccgcccatgg tgcacacgtg gtgctggctg tgcgcaacct cgacaagggc 75481 aagcaggcgg cggcacgcat caccgaggcc acccccggcg ccgaagtaga gcttcaggag 75541 cttgacctga cctcgctggc gtcggtgcgc gccgccgcgg cacagctgaa gtctgaccac 75601 cagcgcatcg acctgctgat caacaacgcc ggggtgatgt atacaccccg acagaccaca 75661 gcagacggct tcgagatgca gttcggcacc aaccacttgg gccatttcgc gttgaccggc 75721 ctgttgattg atcgactgct gcccgtcgcc ggttcacgag tggtcaccat cagcagcgtc 75781 ggccatcgca tccgtgccgc aatccatttc gacgacctcc agtgggaacg ccggtacagg 75841 cgggtcgccg cctacggcca agccaagctc gccaacctgc tgttcactta tgaacttcag 75901 cgtcggttag caccgggcgg aaccaccatc gcggtcgcgt cgcacccggg agtgtccaac 75961 accgaactgg tccgcaacat gccacggccg ctcgtcgcgg tggcggccat actggcgccg 76021 ctgatgcaag acgccgaact gggggccctg ccgacattgc gtgccgccac cgatcccgcg 76081 gtgcgcggcg gccagtactt cggacccgat ggcttcggtg aaatacgggg ctacccgaag 76141 gtggtggcct ccagcgccca gtctcacgac gagcagctgc agcgccgcct gtgggctgtg 76201 tccgaagagc tcaccggggt cgtctatccc gtcggatgag ccggactcaa cggcaacggt 76261 tggtcaacac tcgacgatgt tgactgcgac gttgatggcg agcccgccgg ccgaggtttc 76321 cttgtacttg gtgtgcatgt ccgcgccggt ggcgcgcatg gtgtcgatga cctggtcgag 76381 ggtgacgcga tggatgccgt cgccgcgcaa tgccatccgt gcggcgttga tggccttgcc 76441 ggcggaaatc gcgttgcgtt cgatgcaggg gatctgcacc agcccggcga tggggtcaca 76501 ggtcaggccg aggctgtgtt ccatggcgat ctcggcggcg ttttccactt gtcgcggtgt 76561 gccgccgagg atttcagcca atccggcggc ggccatggcg gccgcggagc cgacctcgcc 76621 ctgacagccg acctcggctc cggagatcga tgctcgctcc ttgaacaacg atccgatggc 76681 tccagcagtg agcaggaatc gcacggtgac atcgtcgggg tcccccgcgc cggccgacgt 76741 gtagtggatt gcgtagtgca ggaccgccgg cacgatgccg gcggcaccgt tggtcggggc 76801 ggtgacgacg cgcccaccgg aggcgttctc ctcgttgact gccagcgcga ccaggttgac 76861 ccagtcctca gcgaattccg gcttgcgagt ggggtcttcg gcgttcaagc ggtcatacca 76921 caccttcgct cgccggcgca cccggaggcc gccaggaagc aacccttcgc gagcgatgct 76981 ccgctgttcg cactcaacca tgacgtcgcg caggtgcagc agcgcggcgc gtacctcgtt 77041 ctcggtgcgg caacatgttt cgttgcgcag cgccgcttcg ctaattgaca cgtcgaggcg 77101 gtcacagatg tccagcagtt cttgggccga cacgtaggga agggcaactg agcatggatg 77161 ttggccgctg ttgccgctgg tctgttccgt gacgatgaac cctccgccca ccgaaaaata 77221 agtctcggtg gccaagacgc ggccgtgtgg gcccgcggca gtgaacgtca ttccgttggg 77281 atgcgttggc agaacgatgt cgggatgcag gtcgatatca cgctcggtca gcgggaccgg 77341 aatgacaccg ccgattcgcg tcacgccgga cgctgcgatc tcggcgagcc ggcgttcctt 77401 gtgttcggtg gtaatcgttt ctggctggca gccttccagc cccagcaata tcgccgacat 77461 ggtgccatga ccggctccgg tggccgcgag cgagccgaac agatccactc gcatcgcctc 77521 gaggtcatcc aggtggcccc ggcggcgcag cgcaactacg aactggtttg ccgcgcgcat 77581 cggtcccacg gtgtgggaac tggacggccc gatgccgatg gtgaacaggt cgaagacgct 77641 gatggtcatg tccggtgcag ttccgggtag agcggatagc gtgcggccag ccgctggacc 77701 tgggcgcgca gcggacccag ctggtcgtcg ttggtggccg tcagtgccgc cgcgatgagg 77761 tctgccacgg cgcggaagtc gttgtgggag aagccgcgtg cggccagcgc cggggtgccg 77821 attcgcaggc ccgaggtgat catcggggga cgagggtcga agggtaccgc gttgcggttg 77881 acggtgatgt ccacggcggc caaccggtct tcggcttgct ggccgtcgag ttcggcgtcg 77941 cgcaggtcga ctaggacgag gtgcacatcg gtgccgccgg ttagcaccgc gatgccacgt 78001 tcggcgacgt cgggctgggt caaccggccg gcaaggatgc gcgcgccgtc gaggcaacgt 78061 tgttggcgct gcgcgaattc aggttgtgct gccatcttga atgcggtggc cttggctgcg 78121 atgacatgcc cgagcggccc gccctgctgc ccagggaaga ccgcggaatt gatcttcttg 78181 gcgatggccg ggtcattgca caagatgatg ccgccgcggg gcccgccgag cgtcttgtga 78241 gtggtggagg tgacgacgtg ggcgtgcggc accgggctgg ggtgcacgcc agcggcgacc 78301 aggccggcga aatgcgccat atccaccatg agcacggcgt cgacttcgtc ggcgatggcg 78361 cggaagcggg cgaaatccag ctggcgtggg tacgccgacc agccggcgat gatcattttg 78421 ggccggtgtg tgcgcgctgc ctcggcgacg gcatccatgt cgaccaggta gtcctctttg 78481 gacacctcgt aggcggtggc gtggtagagc ttgccggaaa agttgatccg catcccgtgg 78541 gtcaggtgac cgccatgagc cagcgacaac cccaggatgg tgtcgccggg gtttagcagc 78601 gcatgcatgg tggcggcgtt ggcggtggcc cccgaatgtg gttgcacgtt ggcgtattcg 78661 gcgccaaaga gcgctttgac gcggtcgata gccaactgct cgacaccgtc gacgaattca 78721 cagccaccgt agtagcgccg gcccgggtag ccttcggcgt acttgttggt caagaccgaa 78781 ccttgggcct gcatcacggc cagcggtgca tagttctccg aagcgatcat ctccaagccg 78841 gattcttgac ggcgcagctc gccgtcgatc agggcggcga tgtccgggtc gaaggcggtc 78901 agggagtcgt tgagggtgtt catcagctca gtccggtctg ttcggcgtac tcgggggcgg 78961 tcaagggtgt tcccggagca atcggctgcc cggccaaatg ggcatccggc ggccgcgaca 79021 tcgtttcggc cacggcgagg tcgccaacag ttcgatcgtg cggttcagac aagggccaac 79081 tccggtttcg acgagcccgg atcgcgccgg gctggttgcg ccctccccgc tctgtcctga 79141 aacctgagag tctgcggcgt cgcatcatgg cgccgctcta caccttcggt caggcacggt 79201 cggtgcgacc gtccctgtct ccagagttgc ctcggcggtg tggtgcttgg gcctgagaga 79261 ttctcgggga ggagattgct cctacggcgc ctcgacatgg aggttctccc acatcgcgtc 79321 agcggctgtt cgattgtgac ggaaagcaac atacacacca cgcatgtgtt ttgtcaccct 79381 gcggtcgctg gtagtcggac ggcccaatca gacagcgcgg gtcatatcac gcgttcgtgc 79441 acagttgggt gtttatccac aggggtgcgt ttgtcggcgg ctggcggggc gtggcggcga 79501 tagcattcga atatgagttc gatcacggtg tcggtggacc cggtggaccc ggtggacccg 79561 gtggacccgg tggacccggt ggacccggtg gacgccgtgg tcgccgcggg atcagacggg 79621 ctcactgtgg cccgcatcga gtccgagatc ggggccttgg agttcctgaa cgaactgcgc 79681 actgaactca agagtggaca gtttcgacct caaccggtgc gggaacgcaa gatccccaaa 79741 ccgggcgggt tgggcaaggt acggcggctg gggattccca cagtggccga ccgggtcgtt 79801 caggcggcgt tgaaactggt gctagaaccc atctttgaga ccgacttcga gccggtctcc 79861 tacgggtttc ggcccgcgcg acgcgcgcac gacacgatcg ctgagattca cttgttcggc 79921 acccaggagt atcgctgggt gctcgacgct gatatcaagg cgtgctttga ccgcatcgac 79981 cacgcggacc tgatggaccg ggtgcgtcac cggatcaaag acaagcgggt gttgcggctg 80041 gtgaactggc agcgcattcg gcatcgctgg aattggaccg acgtccgccg ctggctcacc 80101 gaccccaccg ggcggtggca ccccatcagc gcggacggga tcaccctgtt taaccccgcc 80161 gcggtgccta ttcggcgata ccgctatcgg ggcaacacga tccccactcc ctggactcag 80221 gctgtctgaa ccaccccatc ggcagattcc gtgaagagcc agatacggtg aaagtcgcac 80281 gtccggttcg aagggcggcc acgggaaacg gacccgcagc aacgcgggca ccgcacccat 80341 ggtcgaccca actgccacgc acccggtgac cggtgcgaag tccaccatat cgaccagtgg 80401 gcaaccggcg gctcaaccga tatcgacaaa ctcaccttca cctgcacacc caaccacaag 80461 ctagtcggga aaggctggca gacaaggaaa cggtccgacg gccaaacgga atggatcccg 80521 ccaccccacc tcgaccgcgg tgcccacacc aacgactacc accaccccga acgcctcttc 80581 gaccactagc gggccgcgcc ctgaccacaa aacgtcaaga ccaggcccca caagtgcgcc 80641 acgttggtag cgtctgggaa tgctcttcgc ggccctgcgt gacatgcaat ggagaaagcg 80701 ccgcctggtc atcacgatca tcagcaccgg gctgatcttc gggatgacgc ttgttttgac 80761 cggactcgcg aacggcttcc gggtggaggc ccggcacacc gtcgattcca tgggtgtcga 80821 tgtattcgtc gtcagatccg gcgctgctgg accttttctg ggttcaatac cgtttcccga 80881 tgttgacctg gcccgagtgg ccgctgaacc cggtgtcatg gccgcggccc cgttgggcag 80941 cgtggggacg atcatgaaag aaggcacgtc gacgcgaaac gtcacggtct tcggcgcgcc 81001 cgagcacgga cctggcatgc cacgggtctc agagggtcgg tcaccgtcga aaccggacga 81061 agtcgcggca tcgagcacga tgggccgaca cctcggtgac actgtcgagg tcggcgcgcg 81121 cagattgcgg gtcgttggca ttgtgccgaa ttccaccgcg ctggccaaga tccccaatgt 81181 cttcctcacg accgagggct tacagaaatt ggcgtacaac gggcagccga atatcacgtc 81241 catcgggatc ataggtatgc cccgacagct gccggagggt taccagactt tcgatcgggt 81301 gggcgctgtc aatgatttgg tgcgcccatt gaaggtcgca gtgaattcga tctcgatcgt 81361 ggctgttttg ctgtggattg tggcggtgct gatcgtcggc tcggtggtgt acctttcggc 81421 tcttgagcgg ctacgtgact tcgcggtgtt caaggcgatt ggcacgccaa cgcgctcgat 81481 tatggccggg ctcgcattac aggcgctggt cattgcgttg cttgcggcgg tggtgggcgt 81541 cgtcctggcg caggtgttgg caccactgtt tccgatgatt gtcgcggtac ccgtcggtgc 81601 ttacctggcg ctaccggtgg ccgcgatcgt catcggtctg ttcgctagtg ttgccggatt 81661 gaagcgcgtg gtgacggtcg atcccgcgca ggcgttcgga ggtccctagc ggtgggcgat 81721 ctcagcattc agaacctcgt cgttgagtac tacagcggtg gatacgcgct taggccgatc 81781 aacggtttga acctcgacgt ggcagccggg tcgttggtga tgctgctcgg acccagcggc 81841 tgcggcaaga cgacactgct ttcctgtctg ggcggcattc tgcgcccgaa gtctggggcg 81901 atcaagttcg acgaagtcga catcacgacg ctacaaggcg ccgagctggc gaactaccgg 81961 cgtaacaagg tcggcatcgt gttccaggcg ttcaatctgg tgcccagcct gaccgctgtc 82021 gagaacgtga tggtgccgtt acgctcggcc gggatgtcac gcagggcgtc gcgtaggcgt 82081 gccgaagaac tgctggcgcg cgtcaatctc gcggaacgaa tgaatcatcg acccggtgat 82141 ctgagcggag gtcagcagca acgagtcgcg gtggcacgcg cgattgcgct ggatccgcca 82201 ctgatcctcg ctgacgaacc gaccgcacac ctggatttca tccaggtgga ggaggtgctg 82261 cggttgatcc gcgaactggc cgatggcgag cgtgtggtcg tggtcgcaac ccacgacagc 82321 aggatgttgc cgatggccga tcgcgtcgtt gagctgacac ccgatttcgc ggagacaaat 82381 cggccacctg aaaccgtaca tcttcaggcc ggcgaggtgc tgttcgagca gagcacgatg 82441 ggcgacctga tctacgtggt gtcggagggc gagtttgaga ttgtgcacga ttggccgacg 82501 gcggtgagga attggtcaag gttgccgggc cgggggatta cttcggcgag ataggcgtgc 82561 tgtttcacct gccgcgctcg gcgaccgtgc gtgcccgcag cgacgcgacg gccgtcggct 82621 ataccgtgca ggcgtttcgt gagcggctcg gcgtgggggg tctgcgcgat ctgatcgagc 82681 atcgtgcgct tgccaacgac taacccggct tggccggaac tagccactgc cggggcagcg 82741 gtggcggttc acaccgcgtg cgcgtttgga ggtccctgag cgatgggcga tctgagcatt 82801 agccaggtgt cggcgcgtcc gggacggatc gggattcgcg ctaggcaaat gttcgacgga 82861 taccggtttc agcgtggtcc cgtgctggtc gtggtcgagg atggtcggat cagcgcggtc 82921 gattttgctg gctccgcctg ccccgatatg aacctggttg atctgggtga atcgactttg 82981 ttgccgggtc tggtggatgc gcatgcgcat ttgtgctggg accccgacgg taggccagag 83041 gatttggccg gcgaccccca tgcggtgctg gtgggacggg cgcgacggca cgccgcggcc 83101 gcgttgcgct ccgggatcac cacgattcgc gatctcggcg accgtgacta tgcggccttg 83161 gcgctgcggg aggagtatcg gcagaaaacg acggtggggc cggaactggt ggtttctggg 83221 ccaccattga ctcgcagcgg cgggcattgc tggttcctcg gcggcgtggc cgatagcgtc 83281 gaggagctgg ttgatgcggt gcaggagcgg gccgcgcggg gagcggattg gatcaaggtg 83341 atggccacgg gcggattcgt taccacagca tccgatccgt ggcagccgca gtacggcagc 83401 ggccaactgg ccgcggtggt ggcggccgcc gagcaggtag gtctaccggt gaccgcacat 83461 gcacatgcca ccgcagggat cgccgcggcg gtcgccgcgg gtgttgacgg catcgagcac 83521 tgcacgttct tgagcgaagg cagcgccgcc gccagcccgg atgttgttga agcgattgtt 83581 gcccaaggtg tgtggtgcgg tatgacgatt ccccgggtgt atccggagat gccggagaac 83641 cttgtcgcgg ttgtgcagga tggatggcga aacatccgcc ggctcatcga cgccggtgcg 83701 cgtgtcgccc tgtccaccga cgctggagtc gccccgggca gacgccatga cgtgctcccc 83761 gacgatttgg tgtatctgtc tcgacacggg ttcaccagca cagaggtgct gaccggcgcc 83821 accgcagcgg ccgctgccag ctgtgggctc ggccaccgca agggtcgcat cgcgccgggc 83881 tacgacgctg atctgctggc tgttgcggca ggtgtggacc atgaccccgc cggactctgc 83941 gacgtcaaag ccgtctggcg cagcggaacc caggtaccgc tacaagcatc cgctgtgggc 84001 tacaacaccc cgtcataacc ccgtcataaa atgcaggaca gcatcttcaa tctgttgacc 84061 gaggaacagc ttcggggtcg caacacgctc aagtggaact atttcgggcc cgatgtagtg 84121 ccactgtggc tggcggagat ggactttccc accgcaccgg ctgtgctcga cggggtgcgg 84181 gcgtgcgtcg acaacgagga gttcggctac ccgccgttgg gcgaggacag cctgccgagg 84241 gcgacggccg attggtgccg acaacgctac ggttggtgcc cccgaccgga ctgggtccgc 84301 gtcgtgccgg atgtcctgaa ggggatggaa gtcgtcgtcg aattccttac ccggccggag 84361 agtccggtcg cgttgccggt tccggcttac atgccgtttt tcgacgtcct gcacgtcacc 84421 ggccgccaac gagtggaagt cccaatggtg cagcaagact cgggacgcta cctgctggac 84481 ctggacgctc tgcaggccgc gttcgtccgc ggtgccggat cggtgattat ctgcaatccg 84541 aataacccac tgggtacggc gttcaccgaa gccgagctac gtgcgattgt ggatatcgcg 84601 gcccgccacg gcgcccgggt gatcgcggat gagatctggg caccggtggt ctacggatcg 84661 cgccatgtcg ccgccgcttc ggtgtcggag gcggcggctg aagtcgtggt cacgttggtg 84721 tcggcgtcca aaggctggaa cttgccgggt ctgatgtgcg ctcaggtgat cctgtctaac 84781 cgccgtgacg cccacgactg ggaccggatc aacatgttgc accgcatggg cgcatcaacg 84841 gtcggtatcc gcgcgaacat cgccgcctac catcatggcg aatcttggtt ggacgagctg 84901 ctcccttatc tgcgggcgaa ccgtgatcat ctggcacggg cgctgccgga gttagctccc 84961 ggggtagagg tcaacgctcc ggacggtacc tacctgtcgt gggtggattt ccgtgcgctg 85021 gctctgccgt ctgaaccggc ggaatacctg ctctcgaagg cgaaggtggc gctgtcgcct 85081 ggcattccgt tcggcgccgc ggtgggctcg ggatttgcgc ggctgaactt cgccaccacc 85141 cgcgcaatac tggatcgggc gatcgaggct atcgcggccg ccctgcgcga catcatcgat 85201 taagccaacc agtagattca caacgctgcg gcgtgttggg tcaggctgaa gaagatgtag 85261 gcgaggcaga tcaggaagtt cagtgccacg agaaccaaac ccagacagat tagtgaatgc 85321 gtggctcggc gttgtaggcg gtggaatttc gcgacgcgct tctcatggtt cagctgggtc 85381 acgatcagtg cgaacttgac gtcggtccat tcttcgtcgg cggcgggagc gcccaacagc 85441 atttcctgaa ggcgcttcgg cggggcttcg acgccgaccc gcgcgaagtg ctggctcagc 85501 cgccgggctt gcctgcggcc aagaatctga ccccgcaccg gtggctgatg cgagagcttc 85561 cttcgttcgt ccccccagtg gttggacggg gtcgtcacag cgggcattct aagtcccgcg 85621 ggccacaaaa ggcagtgccg cggaacttct tggcccaaac gggcacccgg ctacgtgcgc 85681 accgcgaccg tcgacaactg gtcggcgagc cggtccgggg aatccaccat cgagaacgtc 85741 cgtgctccct cgattacctc gaaacgggcg cgcgggatgg tcgcggcgag ccgttgaccg 85801 ttctcgagtg cgaagaacac gtcatccgcc gaccacgcga tgagcgccgg cttgtcgaat 85861 tcaggcagcc gggcggcgac tgcggtggtg acttcggtgc gcagcgatag cgagagctga 85921 cgcaggtctt cggcgatggc cgggttggat agcgccggac gaacccaggc ccgggtgaga 85981 tggtcgatgt tgtggtgcga caaaccggca tacgcgcggt tacgcgcggc cggtgcccgc 86041 atcacctgga tcgcggcccg gaacagggtg gccgatttcg cggccaggat caccggtttg 86101 aggatcggcg gcggaaagtg ttcgaacgca tcgcaactag tgaggaccag ggcaccgagc 86161 cgttcgggat agtggaccgc gacgagctgg gtgacgaccc cgccggtgtc gttgccgacc 86221 agcaccacgt ccttgagctc gagcgcggca aggacgtcgg cgacgatgcc ggcaaccccg 86281 ccgatggtct ggtcggcgcc ggggcgtagc ggcttaggat gcgcacccag cggccaggtg 86341 ggggcgatgc agcgcaggcc acgaccggcg agtcgctcac tgacccgtcg ccatagttga 86401 ccgcccatca tgtacccgtg cacgaacacg acaggcctgc cagtttcggg tccggttgct 86461 tcgtaatgaa tagttccggc actaatgtcg atcgtcgaca tggatgccca cccttcgagg 86521 tacatttaca agcagactgc cggtaactta ccaacagatt gtatggaaat caagagacgc 86581 acccaggagg aacgctccgc ggcgacccgc gaggcgctga tcaccggggc ccgcaagctg 86641 tgggggttac ggggttatgc ggaggtgggg acgccggaaa tcgcgaccga ggcgggggtc 86701 acgcgggggg cgatgtacca ccaattcgcc gataaagcag cactattccg cgatgtggtg 86761 gaggtcgtgg agcaagacgt gatggcccgg atggccacct tggtcgccgc ctcgggggcg 86821 gcgacgccgg ccgatgcaat ccgggcagcg gtcgatgcct ggctcgaggt atctggtgat 86881 ccggaggtgc gtcagctggt cctgctggat gcgcccgtcg tgctgggctg ggcgggtttc 86941 cgcgacgtcg cccagcgata cagcctgggc atgaccgaac agttgatcac cgaggcgatc 87001 cgggccggcc agttggctcg tcaaccggtg cggccgctgg cccaggtgct cattggcgcg 87061 ctcgacgagg cggcgatgtt catcgccacc gccgacgacc ccaagcgcgc ccgtcgggag 87121 accagacagg tgctgcgccg gctcatcgac gggatgctta acggctagcg ctgggcgcgg 87181 cctcggcaaa atggcttgcg gaccgggatc tgagttccag aactgggcgc aggactggct 87241 ggtcaccact tggcggcgag gcgtgtccat tccgctgcca ggtcgcggtc ccggtggaag 87301 ccgcgcaggg taatcagctc gatagctttg cgcgcatctt ggatatcttg aggcgatgcg 87361 gcgtccacga gcgcacgtag atcactgcga tcctggggtc gccgatcatc atctctcgca 87421 agaagtttca tcgcgatcag atgcgccgtt gtggccaccg gagcgactag atcgggcaag 87481 atctcgatct cctcggcagc ctccgcaatc tccggttcga tgccacagct cgcgaaaagg 87541 aggtccacca caacattcgc ggcagtgtct gcggttgctc cgagacggac cgctgccaac 87601 cgtctggccg cgtcctgctc taccgacgcc aggagatggt actgctgggt aagaagttga 87661 cggactaaag attccgcggc atcgtcgttt gccaccgcga caacaatgtc cacgtcacgg 87721 gtgaaacgtg gttcggatcg cgcagacacc gcgaaaccac caaccagcgc ccaccgctga 87781 cgcaatccgg tcaggtcctt ggcgacccta cggagtgtcg actccacagc gttcatgtga 87841 accgtgtgga cgtcgggcct gcgctgtcac cctcctccgc cccgggacgc gtcatcctcc 87901 acgcgtcgat agctgcttca atttcaacaa cgtccgcatt gggccgttca cgacccagcc 87961 tcatgcgctg catctgctcg ccaacctcgt acatgtccag agcgagcctc agcttctgcg 88021 cagcgacgga aactgccaca ctcaaagcct actgggcgca cgtgtggcaa cgagtcgatc 88081 cacacgaaat gccgccgttg ggccgcggac tagccgaatt ttccgggtgg tgacacagcc 88141 cacatttggc atgggacttt cggccctgtc cgcgtccgtg tcggccagac aagctttggg 88201 cattggccac aatcgggcca caatcgaaag ccgagcaggt ggaaccgaaa cgcagtcgcc 88261 tcgtcgtatg tgcacccgag ccatcgcacg cgcgggaatt cccggatgtc gccgtattct 88321 ccggcggccg ggctaacgca tcccaggccg aacggttggc tcgtgccgtg ggtcgcgtgt 88381 tggccgatcg gggcgtcacc gggggtgctc gggtgcggct gaccatggcg aactgcgccg 88441 atgggccgac gctggtgcag ataaacctgc aggtaggtga caccccatta agggcgcagg 88501 ccgccaccgc gggcatcgat gatctgcgac ccgcactgat cagactggat cgacagatcg 88561 tgcgggcgtc ggcacagtgg tgcccccggc cttggccgga tcggccccgc cggcgattga 88621 ccacgccggc cgaggcgcta gtcacccgcc gcaaaccggt cgtgctaagg cgcgcaaccc 88681 cgttgcaggc gattgccgct atggacgcca tggactacga cgtgcatttg ttcaccgacg 88741 ccgagacggg ggaggacgct gtggtctatc gggctggacc gtcggggctg cggctggccc 88801 gccagcacca cgtatttccc ccaggatggt cacgttgtcg cgccccagcc gggccgccgg 88861 tgccgctgat tgtgaattcg cgtccgacac cggttctcac ggaggccgcc gcggtggacc 88921 gggcgcgcga acatggactg ccattcctgt ttttcaccga ccaggccacc ggccgcggcc 88981 agctgctcta ctcccgctac gacggcaacc tcgggttgat caccccgacc ggtgacggcg 89041 ttgccgacgg tctggcatga gcccgggctc gcggcgcgcc agcccgcaaa gcgcccggga 89101 ggtggtcgag ctcgaccgtg acgaggcgat gcggttgctg gccagcgttg accatgggcg 89161 tgtggtgttc acccgcgcgg cgctgccggc gatccgtcca gtcaatcacc tcgtggtcga 89221 cggtcgggtg atcgtgcgca cccgcctgac ggccaaggtg tccgttgcgg tgcgatcgag 89281 cgccgatgcc ggtgtcgtgg tcgcctacga agccgacgac cttgatccgc ggcgtcggac 89341 ggggtggagt gtggtggtga cgggactggc gaccgaggtc agcgatcccg agcaggttgc 89401 ccgctaccag cggctgctac acccgtgggt gaacatggcg atggacaccg tggtcgcgat 89461 cgaacccgag atcgtcaccg gcatccgcat cgttgctgac tcgcgtacgc cgtagccgat 89521 tggccgcggg cggcccgcac gcatccgcac tatctgataa attcttcaat tcgtcaaccg 89581 atgtaacgct gaagctctca ggagacgcgg tggagtccga accgctgtac aagctcaagg 89641 cggagttctt caaaaccctt gcgcatccgg cgcggatcag gattttggag ctgctggtcg 89701 agcgggaccg ttcggtcggt gagttgctgt cctcggacgt cggcctggag tcgtcgaacc 89761 tgtcccagca gctgggtgtg ctacgccggg cgggtgttgt cgcggcacgt cgtgacggca 89821 acgcgatgat ctattcgatt gccgcacccg atatcgccga gctgctggcg gtggcacgca 89881 aggtgctggc cagggtgctc agcgaccggg tggcggtgct agaggacctc cgcgccggcg 89941 gctcggccac gtaacgccat gggttgggtt gccaagattt tccgtgttgg ccgggtggtc 90001 gagcccgcgg cccccttacc ggcggcgata gccgaaccac ccgccggggt acggggttcg 90061 ctgcagatcc gacatgttga cgcgggttcg tgcaacgggt gtgaggtgga gatttcgggc 90121 gcctttggcc cggtgtatga cgcggagcgg ttcggggcgc ggctggtcgc ctcgccccga 90181 cacgccgatg cgttgttggt gaccggcgtg gtcacgcaca acatggccgg cccactgcgc 90241 aagaccctgg aggccacgcc gcgcccgcgg gtggtaatcg cgtgcgggga ttgcgcgctg 90301 aaccgggggg tgttcgccga cgcctacggc gtggtcggtg cggtcggcga ggtggtaccc 90361 gtcgacgtcg agatcgccgg ctgcccgccg acacccgcgg ccatcatggc ggcgctgcga 90421 tcggtgaccg ggaaatgacc gctgcaccga cggccggcgg ggtcgtcact tcgggcgtgg 90481 gcgttgccgg ggtcggcgtg gggttgctgg gcatgtttgg accggtgcgt gtagtgcacg 90541 tcggttggct gcttccgctg tccggcgtgc acatcgagct cgaccggttg ggcggattct 90601 tcatggcgct cacgggcgcg gtagcggctc cggtcggttg ttacctgatc ggctacgtgc 90661 gccgtgaaca cctcggtcgg gtcccgatgg cggtggtgcc gctgttcgtc gcggcgatgc 90721 tgttggtgcc ggccgcgggc tcggtgacga cgtttctgct ggcgtgggag ctgatggcga 90781 tcgcgtcgct gatcctggtg ctctccgagc acgcccgccc gcaggtccgc tcggcgggcc 90841 tgtggtacgc cgtgatgact cagctgggat tcatcgcaat cctggtcggg ctggtggtgt 90901 tggcggcggc cgggggttcc gaccggttcg ccggcctcgg ggcagtctgc gacggggtcc 90961 gcgtcgccgt atttatgctc acgctggtcg ggtttggttc gaaggcgggc ctggtgccac 91021 tgcacgcctg gctgccgcgg gcccacccgg aggcgccgag cccggtgtcg gcgttgatga 91081 gcgcggcgat ggtcaacctg ggcatctacg gcatcgtccg tttcgatctg cagctgctgg 91141 ggccgggccc acgctggtgg gggcttgcgc tgctggccgt gggcggcacg tccgcgctgt 91201 atggggtgct gcaggcttcg gtggccgccg atctcaaacg gctgctggcc tattcgacga 91261 ccgagaacat gggcctgatc acgctggcgc tcggtgcggc aacacttttc gcggataccg 91321 gagcctacgg gccggcgtcg atcgccgccg ccgcagcgat gctgcacatg attgcgcacg 91381 cggcgtttaa gagcctcgcc ttcatggcgg ccggatctgt gctggccgcg accgggctgc 91441 gcgacctgga cctgctcggc gggctggccc gccgaatgcc ggcgaccacc gtctttttcg 91501 gggtggccgc actgggcgca tgtggtctgc cgttgggcgc cgggtttgtc agtgagtggc 91561 tgctggtcca gtcgttgatc cacgctgccc ccggacacga ccccatcgtg gcgctgacga 91621 caccgctggc ggtcggcgtg gtcgcactgg ccaccggtct gagcgtggcg gcgatgacca 91681 aggccttcgg gatcgggttt ctcgcccgtc cccgctccac ccaagccgaa gcggcgcgtg 91741 aggcgccggc cagcatgcgc gccggcatgg cgatcgcggc gggcgcctgc ctggtgctgg 91801 cggtggcacc gctgctggtc gcacccatgg tgcggcgggc cgccgcgacg ctgccggccg 91861 ctcaggcggt caagttcacc ggtctgggcg ccgtggtgcg gctgcccgcg atgtccgggt 91921 cgatcgcgcc cggcgtgatc gccgccgctg tgctcgccgc ggcgttggcg gtagccgtcc 91981 tcgcgcggtg gcgtttccgc cggcgcccgg cgccggccag gttgccgctg tgggcttgcg 92041 gcgcggccga tctcaccgtg cgcatgcaat acacggccac gtcgttcgcc gagccgctgc 92101 agcgggtctt cggcgacgtg ctgcgcccgg acaccgacat cgaggtcacc cacaccgccg 92161 agtcgcgcta tatggccgag cggatcacct accggaccgc ggtcgccgac gcgatcgaac 92221 agcgcctcta tacgccggtg gtcggggcgg tggccgccat ggccgagctg ctgcgccgtg 92281 cccacaccgg cagcgtgcac cgctacctgg cctacggcgc gctgggcgta ctgatcgtgc 92341 tggtggtcgc gaggtgtacg tgatgtccta cctagcgggc gccgcgcaaa tcggcggggt 92401 catggtgggt gcgccgctgg tcatcggtat gacgcggcag gtacgggcac gctgggaagg 92461 ccgggccggc gccggcctgc tgcaaccgtg gcgtgatctg ctcaaacagc ttggcaagca 92521 acagatcaca ccggcgggga cgacgatcgt gttcgccgcc gcgccggtga tcgtcgccgg 92581 gacaacgctt ttgatcgccg cgatcgcacc tctggtggcc accgggtcac ccctggaccc 92641 cagcgccgac ttgtttgccg tggtcgggct gctattcctg ggcaccgtcg cactgaccct 92701 ggccggcatc gacaccggca cctctttcgg cggcatgggc gccagccgcg agatcaccat 92761 cgccgcactg gtcgaaccaa cgatcctgct ggcggtgttc gcgctgtcca tccccgccgg 92821 atcggccaat ctcggtgcgc tggtggcgag tacgatcgac cacccgggcc acgtggtgtc 92881 gctggccggc gtactggcct tcgtggcgtt ggtgattgtc atcgtcgccg agaccgggcg 92941 gctgccggtg gacaacccgg ccacccacct ggaattgacg atggtgcacg aggccatggt 93001 cctcgagtac gccggcccac ggctggcgct ggtcgaatgg gcggccggga tgcggctcac 93061 ggtgctgctg gcactgctgg cgaatctgtt cctgccgtgg gggatcgccg gcgccgcgcc 93121 caccgcgctc gacgtgttga ccggcgtggt ggcggtggcg gccaaggtcg cgattctcgc 93181 ggtgctgctg gcgacgttcg aggtgttcct cgccaaactg cgattgttcc gggtacccga 93241 actgctggcc ggctcgtttc tgctggcctt gctcgcggtc accgccgcca acttcttcac 93301 ggtgggggcg tgaggggcca gcgatgagta acgccaactt ctcgatcctg gtcgacttcg 93361 ccgcgggtgg gctggtgttg gcgtcggtgc tgattgtctg gcgccgcgac ctgcgggcca 93421 ttgtgcggct gctggcctgg cagggtgctg cgctggccgc gatcccgcta ctgcgcggca 93481 tccgcgacaa cgaccgtgcg ctgatcgcgg tgggcatcgc cgtgttggcg ctgcgcgcgc 93541 tggtgttgcc ctggctgctg gcccgcgcgg tgggcgccga agcggccgcg cagcgggagg 93601 ccaccccgtt ggtcaacacc gccagctcgc tgctgattac cgccggactg accctcaccg 93661 cgttcgcgat cacccagccg gtggtcaacc tggaaccggg cgtcaccatc aacgcggtgc 93721 cggccgcgtt cgcggtggtg ctgatcgcgc tgttcgtgat gaccacgcgg ctgcacgcgg 93781 tctcgcaggc cgccggattc ctgatgctag acaacgggat cgcggcgacc gcattcctgc 93841 tcaccgccgg ggtgccgctg atcgtcgaac ttggtgcctc gctggacgtg ctgttcgcgg 93901 tcatcgtgat cggcgtgttg accggccggc tgcgccgcat tttcggcgat gccgacctgg 93961 acaagctgcg ggagttgcgg gattgatgac cggtttgctg cttgccgcga tcctcgcacc 94021 gctcgccgcg tcaatcgcct ccttgatcac cgggtggcga cgcacgacgg cgacgctcac 94081 cgcgctgtcc gccacgacgg tgctggcctg cgctgtggcg atggggtttt ggatggggtc 94141 gggggcgcag ttcgggctgg gcggtctgct gcgcgccgat gcgctgacgg tggtcatgct 94201 cgtcgtcatc gggatcgtcg gcacactggc caccgcggcg agcatcggct acatcgacac 94261 cgagctggca cacgggcata tcgacggacg tagcgctcgg ctgtatgggg tgctgacccc 94321 ggcgtttctt tgcgcgacgg ttctggcggt gtgcgccaac aacatcggcg tcatttgggt 94381 agcgatcgag gccaccacgg tgatcaccgc gtttctggtg gggcatcgcc gcacccgcac 94441 cgcgctggaa gcgacctgga aatacgtggt gatctgttcg gtcgggatcg ccgtcgcctt 94501 cttgggtacc gtgctgctgt atttcgccgc gcgggattcc ggtgccgctg ctgccggcgc 94561 gctgaacctc gatatcctgg ccgaacacgc cgccggccta gaccccgggg tcgctcgact 94621 ggccggcggg ttgctgctca tcggttatgg cgccaaggcg ggcctcttcc cgtttcacac 94681 ctggctggcg gacgcgcaca gccaagcccc cgcaccggtg tccgcactga tgagcggcgt 94741 gctgctggcg gtggcgttct cggtgctgat ccgattgcgg ccgatcctcg acgcggtcag 94801 cgggcccgcc tacctgcgca acgggctgct cgtggtcggg ttggcgacgc tgctggtggc 94861 ggtgctgatg ctgaccgtga ccggcgacgt caagcggatg ctggcctact cgtcgatgga 94921 gcacatgggc ctgatcgcga tcgccgcggc cgccggcacg acattggcga tcgccgcgct 94981 gctgctgcac gtgctcgccc acgggatcgg caagaccgtg ctgtttctgg cgggcggtca 95041 gctgcaggcc gcacacgact ccaccgccat cgccgatatc accggcgtga tgcgacggtc 95101 gcggctgatc ggcgtgtcgt ttgccgtcgg cctgatcgtc ctgcttggct tgccgccgtt 95161 cgcgatgttc gccagcgagc tggcgatcgc gcgctcattg gccaacgagc ggctggcctg 95221 ggtgctgggt gcggcgctgc tgctgatcgc catcggtttc acggctctgg cacgcaattc 95281 cggacgcatg ctgctcggca ccccggcggc gggcgcgccg gcgatcaccg tgccggccac 95341 cgcggcggcg gcgttgatgg tgggcatcgt cgtctcggcg gccctcggca tcaccgcggg 95401 cccactcgcc gacctgcttg gcatcgccgc cagcaacgtg ggtctaccgt gatgagtgcc 95461 agctggctgc gccaccgggt atccgagcgt ggactgatag cgacggccga acaactctgg 95521 gccgattcgt ttcgcctggc cctggtcgct gcccacgacg acggcgacag tctgcgtgtc 95581 gtgtaccttt tcttggcggg ctatccagat cgccgcgtcg agttggaata cgttgtgccg 95641 gcggataatc cagagatcag atcgttggcg tacctgtcct ttccggctgg ccggttcgag 95701 cgcgaaatgg cggacctgta cggaattcgc ccggtcggcc atcccaaacc ccgccgactg 95761 gtacggcacg cgcattggcc cgactggcat cccatgcgca ccgacgccgg gcccgcgccc 95821 gaattcactg atacgggggc cttcccgttc ctcgccgtcg aaggacccgg cgtgtacgag 95881 attccggtcg ggccggtgca cgccggcctc atcgaacccg gtcacttccg gttttctgtc 95941 gcgggcgaga cgatcgtgcg gctgaaggcg cggctgtggt ttgtgcaccg tggcatcgag 96001 aaactcttcc acggccgccc cgccacggcc gcggtcgatc tcgccgaacg catcagcggc 96061 gacacgtcgg cagcgcacgc gctcgcgcac agcctggcga tcgaagacgc tctcggcatc 96121 gagctgcccc acgagatcca ccggctgcgg gccctgatcg tcgaactcga acggctctac 96181 aaccacgccg ccgacctggg tgccttggcc aacgacgtcg gctactcgct ggccaacgct 96241 cacgcccaac gcatccgcga aaatctgttg cggcgcaatg ccgcagtcac cggtcaccgg 96301 ctactgcgcg gcgccatccg cgcgggcggg gttgcgctgc gtgcgctgcc cgataccgac 96361 gagcttgcag cgctcgccgt cgatctcgcc gaggtcgcca ccctgacgct ggccaactcg 96421 gtggtctacg accgcttcgc cggcaccgcc gtgctgcacc ccgacgacgc cagcgccctg 96481 ggctgcctgg gctatgttgc ccgcgccagc ggactgcgca gcgacgcccg ggtcgaacac 96541 cccaccatag tgctgcccat caccgagatc ggcgcgcctg acggcgacgt cttggctcgc 96601 tacaccgtgc ggcgcgacga attcgccgcg tctgccgctc ttgctcaaca cattgtcgaa 96661 tcacacaccg gtccaataga atacgccgct acactgcacc cggtgggcgc gcccagcagc 96721 ggtatcggca tcgtcgaagg ctggcgcggc actatcgtgc accgcgtcga aattgacgtc 96781 gatggccgca tcacccgggc gaaagtcgtc gatccgtcct ggttcaactg gcccgcactg 96841 ccggtggcga tggccgacac catcgtcccc gacttcccgt tggccaacaa aagcttcaac 96901 cagtcctacg cgggcaacga cctctaaccg tgagcgcgcc cagttgtacg gccctagcgg 96961 cgtgtcggtg tacaaacacg caccctcgcg ggttcggttg cgccaaacta gaagtaccgt 97021 ggtcaaggga cgttcgggga gcctgtcgtg gcgtcgagtg cgcaccggtg acctcggtct 97081 ggctgtttgg ggtggacgcg aggagtaccg ggcggtcaaa ccgggcacac cagggataca 97141 accgaaggga gacatgatga ctgtgaccgt tgtcgatgct ggacccggcc gggtgagccg 97201 ttcggtggag gtggccgcgc cggcggccga gttgttcgcc atcgttgctg atccccggcg 97261 ccaccgcgaa ctggacggat cgggcacggt tcgcggcaac atcaaggtac cggcgaaatt 97321 agttgtcggg tcgaagtttt cgacgaagat gaagttgttc ggcctaccgt atcgcatcac 97381 cagcagggtg accgcgctca aaccgaacga attggtcgag tggagccacc cgttaggcca 97441 tcggtggaga tgggaattcg aatcgctgtc accgacactg acccgcgtca ccgagacatt 97501 cgactaccac gccgccggtg cgatcaagaa cggcctgaag ttctacgaga tgacgggttt 97561 cgcgaagtcc aatgcggcgg gaatcgaggc cacgttggcc aagctgagcg atcagtacgc 97621 ccgcggtagg gcatgacgcc atgggggcgt gtcggtgtac cgacacgctc gctcacgggt 97681 tcggttgcac caagaaaaga tgtaccagat cacctgcctg aataggattt ctggcccgac 97741 gtagcttcgg gctagcgcga gcgacgactc cgccgtcgag caggatgtca ccgtggatca 97801 accgtggaac gccaacatcc actacgacgc tctgctggat gccatggtgc cgctcggtac 97861 ccagtgcgtg ctcgacgtcg ggtgcggcga cgggttgctg gctgcccggc tggctcggcg 97921 cataccctac gtcacggcag tggacatcga tgcgcccgtc ctgcgacgtg cgcagacacg 97981 gttcgccaac gcgccgatcc gctggctgca tgccgacatc atgacggctg agctgcccaa 98041 cgcgggcttc gacgccgtgg tctccaatgc cgccctgcac cacatcgagg acactcggac 98101 ggcgctgagc cggctcggcg ggctggtaac tcccggtggg acgctggccg tggtcacctt 98161 cgtgacgccc tcgctgcgaa acggcttatg gcacttgaca agctgggttg cctgcggcat 98221 ggccaatcgc gtcaagggca agtgggaaca ttccgctccg atcaagtggc cgcccccgca 98281 gacgttgcat gagctacgca gccacgttcg cgccctgctg cccggggcgt gtatccgtcg 98341 gctgctgtac ggccgggtgc tcgttacgtg gcgcgcaccc gtctaatcgg gagaacccaa 98401 tggcggcggc cgatatgacc aagtgcgcgt tagcttgcga gattggctgc ccgcatccaa 98461 tgatcggcgg atacgggtcg caaaccacct cagaccggca gctaaggagc gcaagtggcc 98521 aagaaccaaa accgcatccg caaccggtgg gagttgatca cctgtggtct cgggggacac 98581 gtcacctacg cgccggacga cgcggcactt gctgcgcggc tgcgcgccag caccgggctg 98641 ggcgaagtat ggcgctgctt gcgctgcggc gatttcgcgc tcggtgggcc gcaggggcgt 98701 ggtgctcccg aggatgcgcc gttgattatg cgcggcaagg cgttacgtca ggccatcatc 98761 attcgcgcgc tcggggtcga acggctagtc cgggcgttgg tgttggcgct ggccgcgtgg 98821 gcggtgtggg agtttcgcgg tgcgcgggga gctatccagg cgaccctgga tagggacttg 98881 ccggtcctgc gtgcggccgg attcaaggtc gatcaaatga cggtgatcca cgctctggag 98941 aaagcgttgg ccgccaaacc gtcgacgttg gccctgatca cgggcatgct ggcggcatac 99001 gcagtgctgc aggccgtcga gggggtcggt ttgtggctgc tgaagcgctg gggcgagtac 99061 ttcgcggtgg tggccacctc aattttcctg ccgttggagg ttcacgacct ggccaagggc 99121 atcacgacga ctcgggtcgt gaccttcagc atcaatgtcg ccgccgttgt ctacctgctg 99181 atttctaagc ggttgttcgg tgtgcgcggc gggcgcaagg cttatgacgt cgaacggcgc 99241 ggcgagcagc tgctcgacct cgagcgcgcc gcgatgctca cctgaccagc caaaatccca 99301 cctgtgcggg gcctgcgggt tgtgtcaaag gtcactagcg cctttttcgc actgtttact 99361 ccggcgcggc gtgcccgtaa agccgcccgg gtgaacttgg atcaggtggc gcaatgtcgc 99421 cggaccgacg aaggaccgac gctgtgtcaa cactgccaac ctgggtcagc cagagctcta 99481 ccgaccgcgg cgtggtcgcg ccaatcacag cgcgtgcccg cgacgcactg caggccgtgc 99541 tgcgcgccag gcgccgcggc cagcgctctg acttgcgcct tatgcgcaga ggcgtggagc 99601 gttgttgagg tcaggcccgc gccgagggcc gcgactttct cgctacaatc gcgcgcggcg 99661 cgggagagcc gctagccgcc ggtgaccggc gattggagat tgagttgcga ccgaacggat 99721 ggcggtgacg gtcggcgtca tttgtgcgat cccgcaagag ctggcgtatc tgcgcggtgt 99781 cctggtcgat gcgaaacgcc agcaggtcgc gcagatcctc ttcgatagcg gccaactcga 99841 cgcgcaccgg gtcgtgttgg ccgccgccgg catgggcaaa gttaacacgg gcctgaccgc 99901 aacgctgctt gccgatcgat tcggctgccg caccatcgtt ttcacgggag tggccggcgg 99961 gctggatccc gagctatgca tcggtgacat cgtcatcgcc gatcgggtcg tccaacacga 100021 cttcggtctg ctcaccgatg agcggctgcg cccctatcag cccggacaca tccccttcat 100081 cgaaccgacc gagcggctcg gatacccggt tgatcccgcg gtcatcgatc gggtcaaaca 100141 ccgcctcgac gggttcacgc tggcgccgct gtccaccgcc gcgggaggtg gtggccggca 100201 gccacgcatc tactacggca ccatcctgac cggtgaccaa taccttcact gcgagcgcac 100261 ccgcaaccgg ctgcaccacg aactcggcgg tatggccgtc gaaatggaag gcggtgcggt 100321 ggcgcaaatc tgcgcgtcct tcgatatccc atggctggtc attcgcgcgc tctccgatct 100381 cgccggagcc gattcggggg tggacttcaa tcggtttgtc ggcgaggtgg cggccagttc 100441 ggcccgcgtt ctgctgcgct tgctgccggt gttgacggcc tgttgaagac gactatccgc 100501 cggtgcgttc accgcgtcag gcggcttcgg tgaggtgagt aatttggtca ttaacttggt 100561 catgccgccg ccgatgttga gcggaggcca caggtcggcc ggaagtgagg agccacgatg 100621 acggcggccg tgaccggtga acaccacgcg agtgtgcagc ggatacaact cagaatcagc 100681 gggatgtcgt gctctgcgtg cgcccaccgt gtggaatcga ccctcaacaa gctgccgggg 100741 gttcgggcag ctgtgaactt cggcacccgg gtggcaacca tcgacaccag cgaggcggtc 100801 gacgctgccg cgctgtgcca ggcggtccgc cgcgcgggct atcaggccga tctgtgcacg 100861 gatgacggtc ggagcgcgag tgatccggac gccgaccacg ctcgacagct gctgatccgg 100921 ctagcgatcg ccgccgtgct gtttgtgccc gtggccgatc tgtcggtgat gtttggggtc 100981 gtgcctgcca cgcgcttcac cggctggcag tgggtgctaa gcgcgctggc actgccggtc 101041 gtgacctggg cggcgtggcc gtttcaccgc gttgcgatgc gcaacgcccg ccaccacgcc 101101 gcctccatgg agacgctaat ctcggtcggt atcacggccg ccacgatctg gtcgctgtac 101161 accgtcttcg gcaatcactc gcccatcgag cgcagcggca tatggcaggc gctgctggga 101221 agcgatgcta tttatttcga ggtcgcggcg ggtgtcacgg tgttcgtgct ggtggggcgg 101281 tatttcgagg cgcgcgccaa gtcgcaggcg ggcagtgcgc tgagagcctt ggcggcgctg 101341 agcgccaagg aagtagccgt cctgctaccg gatgggtcgg agatggtcat cccggccgac 101401 gaactcaaag aacagcagcg cttcgtggtg cgtccagggc agatagttgc cgccgacggc 101461 ctcgccgtcg acgggtccgc tgcggtcgac atgagcgcga tgaccggcga ggccaaaccg 101521 acccgggtgc gtccgggggg gcaggtcatc ggcggcacca cagtgcttga cggccggctg 101581 atcgtggagg cggccgcggt gggcgccgac acccagttcg ccggaatggt ccgcctcgtt 101641 gagcaagcgc aggcgcaaaa ggccgacgca cagcgactag ccgaccggat ctcctcggtg 101701 tttgttcccg ctgtgttggt tatcgcggca ctaaccgcag ccggatggct aatcgccggg 101761 ggacaacccg accgtgccgt ctcggccgca ctcgccgtgc ttgtcatcgc ctgcccgtgt 101821 gccctggggc tggcgactcc gaccgcgatg atggtggcct ctggtcgcgg tgcccagctc 101881 ggaatatttc tgaagggcta caaatcgttg gaggccaccc gcgcggtgga caccgtcgtc 101941 ttcgacaaga ccggcaccct gacgacgggc cggctgcagg tcagtgcggt gaccgcggca 102001 ccgggctggg aggccgacca ggtgctcgcc ttggccgcga ccgtggaagc cgcgtccgag 102061 cactcggtgg cgctcgcgat cgccgcggca acgactcggc gagacgcggt caccgacttt 102121 cgcgccatac ccggccgcgg cgtcagcggc accgtgtccg ggcgggcggt acgggtgggc 102181 aaaccgtcat ggatcgggtc ctcgtcgtgc caccccaaca tgcgcgcggc ccggcgccac 102241 gccgaatcgc tgggtgagac ggccgtattc gtcgaggtcg acggcgaacc atgcggggtc 102301 atcgcggtcg ccgacgccgt caaggactcg gcgcgagacg ccgtggccgc cctggccgat 102361 cgtggtctgc gcaccatgct gttgaccggt gacaatcccg aatcggcggc ggccgtggct 102421 actcgcgtcg gcatcgacga ggtgatcgcc gacatcctgc cggaaggcaa ggtcgatgtc 102481 atcgagcagc tacgcgaccg cggacatgtc gtcgccatgg tcggtgacgg catcaacgac 102541 ggacccgcac tggcccgtgc cgatctaggc atggccatcg ggcgcggcac ggacgtcgcg 102601 atcggtgccg ccgacatcat cttggtccgc gaccacctcg acgttgtacc ccttgcgctt 102661 gacctggcaa gggccacgat gcgcaccgtc aaactcaaca tggtctgggc attcggatac 102721 aacatcgccg cgattcccgt cgccgctgcc ggactgctca accccctggt ggccggtgcg 102781 gccatggcgt tctcatcgtt cttcgtggtc tcaaacagct tgcggttgcg caaatttggg 102841 cgatacccgc taggctgcgg aaccgtcggt gggccacaaa tgaccgcgcc gtcgtccgcg 102901 tgatgcgttg tcgggcaaca cgatatcggg ctcagcggcg accgcatccg gtctcggccg 102961 aggaccagag gcgcttcgcc acaccatgat tgccaggacc gcgccgatca ccaccggcag 103021 atgagtcaaa atccgcgtgg tgctgaccgc gccggacagc gcatccacaa tcacatagcc 103081 ggtcagtatg gcgacgaacg ccgtcagaac accggccagg ccggcggcgg cgctcggcca 103141 tagcgccgcg cccaccatga tcacaccgag cgcaatcgac cacgacgtgg actcgttgag 103201 caagtgggtg ccggcacccg tcgggtgctg atgggtcagg ccgacgtcta ggccaaaccc 103261 ctgcacggtg cccagggcga tctgcgcgat gcccacgcac agcaacgccc aacgtcgcca 103321 ggtcatcggt gaatgttgcc gccgcggcgc ccggcggatc ccgaggcgcc caacaggcgg 103381 gacaaccggg cgggactcgg cgagccgacg cagatcacca gcctggctgg ccacctgggt 103441 aaaccatgcg cgacaggcgc tgcactcgcc caggtgttca tcgactctcg ccgagggcac 103501 cggtgcgcgc tcgccgtcga gtcgtgccga cagcgcttcg cgcgcgacct cgcagtccat 103561 gccatcaata gtcgcgcaat gccgacggat tgctccagcg ggctcggacc acatcgccgc 103621 gggcacaccc ctgcggcctt gcaaaacggt tgatgcgtgg tggttaaagc tcccggccgt 103681 tgtggcttgt gcgagcacgg tggcccgggt ggtgcgtgag cgccgtgggg ctcgcgttca 103741 ggggtcaatc gggtttgtcg tcgtcgtctt ggttgtggag gaatcgttcg gggtggtgga 103801 aggtgttggt ccacggttgg ccgtggtcga ggtggggtgg tggtagccat tcggtgtggc 103861 cgtgggtgtt tttgcgggtg gtccagcctt tttcggcgag tcggttgtcg gggtcgcagg 103921 ccagggtgag gtcggtgatg tcggtgcgtc cggtgctggt ccagccggtg acgtggtggg 103981 cttggctgtg gtaggccggt gcgtcacagc cgggtttggt gcagccgcgg tcgttggcga 104041 acagcatgat ccgctgggcc ggggaggcta ggcgtttggt gtgatacagc gccaggggtg 104101 tgccgtggtc gaagatcgcc tgggggtacc tcccgcttgc gggggagtag tggtgggcgt 104161 ggctggtcat gcggatcaca tcggccatgg gtagcagggt gccgccgccg gtgaagccct 104221 tgccggcgcc ggtttgcagg tcggtcaggg tggtggtgac cacaatcgag acgggaagac 104281 cgttgtgttg gcccagttcc ccggaggcga tcagcgcgcg cagcccggcc agcagcccgt 104341 cgtggttgcg ttgggcttgg ctgcgggtgt cgcggtcgat ggcggccgca tcgggggtgg 104401 tgtcgatgac cggggtgtgg tcgtcggggt tggtcgcgcc gggggcggcc agtttggcta 104461 gcacggcttc aaaggtggcc cgcgcttggg gggtcaggta gccacttagc cgtgacatgc 104521 cgtcgtattg ctggttgctc agggtgatgc cgcgtttgcg ggcgcgttcg gtgtcggtga 104581 ggtcgccgtc ggggtgtagc cagtccatga cccgctgggc gtagcgggcc agctcgtcgg 104641 gacgatattg agcggctttg ccggccaggt cggcttcggc ggcctggcgg gtggacacat 104701 ccaccgcggc gggcaggtgg gcgaaaaagg gcgcgaatca ctttgatgtg cgcctcgccg 104761 atcaggccct ggcgttgggc ggtggcggtg gcggtcaact gtggggctag cggttcgccg 104821 gtgagtgctc gacgaggtcc gagatcggcg gcgtcggcga tgcgtagggc ggcgtcgggc 104881 ttggtgatgc gtaaccggtt ggccagcgcg cagcacagcg tgccgcccag ttcttcctcg 104941 ctggcttggg tgtcgagttg gttgatcaac gtgtgcccga ccgccggtag ccggcgcacc 105001 aagcattcca gacgttccag agaccgcagc cgttccgggg tggtcaacac ctcaaaagac 105061 acctcgtcca agcggtccag ctcggcatcc agcgcatcaa agacctcgac aagctcctcc 105121 cggctattcg ctaacatgtt cgaatcataa cgtcgggcac tgacaaagag cgccccgctg 105181 ataaccgtga aactgaagtg acacaaggga tttacccaga tcctacgagt tgatacggga 105241 aggtaccgca cctttcctgg gcgcgatggg aactttctgc ccgttatggc cgactaacac 105301 cgcgggtgaa gcaaagcgct gcctaggcaa ggaggtgagt cctggcggcc acgatatgga 105361 tggctatacc accggaggtg cactcgggcc tgttgagcgc cgggtgcggt ccgggatcat 105421 tgcttgttgc cgcgcagcag tggcaagaac ttagtgatca gtacgcactc gcatgcgccg 105481 agttgggcca attgttgggc gaggttcagg ccagcagctg gcagggaacc gccgccaccc 105541 agtacgtggc tgcccatggc ccctatctgg cctggcttga gcaaaccgcg atcaacagcg 105601 ccgtcaccgc cgcacagcac gtagcggctg ccgctgccta ctgcagcgcc ctggccgcga 105661 tgcccacccc agcagagctg gccgccaacc acgccattca tggcgttctg atcgccacca 105721 acttcttcgg gatcaacacc gttccgatcg cgctcaacga agccgattat gtccgcatgt 105781 ggctgcaagc cgccgacacc atggccgcct accaggccgt cgccgatgcg gccacggtgg 105841 ccgtaccgtc cacccaaccg gcgccaccga tccgcgcgcc cggcggcgat gccgcagata 105901 cctggctaga cgtattgagt tcaattggtc agctcatccg ggatatcttg gatttcattg 105961 ccaacccgta caagtatttt ctggagtttt tcgagcaatt cggcttcagc ccggccgtaa 106021 cggtcgtcct tgcccttgtt gccctgcagc tgtacgactt tctttggtat ccctattacg 106081 cctcgtacgg cctgctcctg cttccgttct tcactcccac cttgagcgcg ttgaccgccc 106141 taagcgcgct gatccatttg ctgaacctgc ccccggctgg actgcttcct atcgccgcag 106201 cgctcggtcc cggcgaccaa tggggcgcaa acttggctgt ggctgtcacg ccggccacgg 106261 cggccgtgcc cggcggaagc ccgcccacca gcaaccccgc gcccgccgct cccagctcga 106321 actcggttgg cagcgcttcg gctgcacccg gcatcagcta tgccgtgcca ggcctggcgc 106381 cacccggggt tagctctggc cctaaagccg gcaccaaatc acctgacacc gccgccgaca 106441 cccttgcaac cgcgggcgca gcacgaccgg gcctcgcccg agcccaccga agaaagcgca 106501 gcgaaagcgg cgtcgggata cgcggttacc gcgacgaatt tttggacgcg accgccacgg 106561 tggacgccgc tacggatgtg cccgctcccg ccaacgcggc tggcagtcaa ggtgccggca 106621 ctctcggctt tgccggtacc gcaccgacaa ccagcggcgc cgcggccgga atggttcaac 106681 tgtcgtcgca cagcacaagc actacagtcc cgttgctgcc cactacctgg acaaccgacg 106741 ccgaacaatg aacaaggaga aaagaaccga tgacgcttaa ggtcaaaggc gagggactcg 106801 gtgcgcaggt cacaggggtc gatcccaaga atctggacga tataaccacc gacgagatcc 106861 gggatatcgt ttacacgaac aagctcgttg tgctaaaaga cgtccatccg tctccgcggg 106921 agttcatcaa actcggcagg ataattggac aaatcgttcc gtattacgaa cccatgtacc 106981 atcacgaaga ccacccggag atctttgtct cctccactga ggaaggtcag ggggtcccaa 107041 aaaccggcgc gttctggcat atcgactata tgtttatgcc ggaacctttc gcgttttcca 107101 tggtgctgcc gctggcggtg cctggacacg accgcgggac ctatttcatc gatctcgcca 107161 gggtctggca gtcgctgccc gccgccaagc gagacccggc ccgcggaacc gtcagcaccc 107221 acgaccctcg acgccacatc aagatccgac ccagcgacgt ctaccggccc atcggagagg 107281 tatgggacga gatcaaccgg accacgcccc caataaagtg gcctacggtc atccggcacc 107341 caaagaccgg ccaagagatc ctctacatct gcgcgacggg caccaccaag atcgaggaca 107401 aggacggcaa tccggttgat ccggaggtgc tgcaagaact catggccgcg accggacagc 107461 tcgatcctga gtaccagtcg ccgttcatac atactcagca ctaccaggtt ggcgacatca 107521 tcttgtggga caaccgggtt ctcatgcacc gagcgaagca cggcagcgcc gcgggcactc 107581 tgacgaccta ccgcctgacc atgcttgatg gcctcaagac gccgggatac gcggcatgag 107641 ccacaccgac ttgacgccct gcacacgggt gctggcatcc agcggcacgg ttccgatcgc 107701 agaggaactg ctggccagag tgctcgagcc ctactcctgc aaaggatgtc gctacctcat 107761 cgacgcacag tacagcgcca ccgaggattc ggttcttgcc tatggcaact tcacgatcgg 107821 tgagtccgcc tatattcgaa gcacggggca cttcaacgcg gtcgaactga ttctgtgttt 107881 caatcagctc gcctacagcg ccttcgctcc ggccgtcctc aacgaggaaa tccgggtgct 107941 tcgcggctgg tcgatcgacg actactgcca acaccagctc tctagcatgc tgatcaggaa 108001 ggcatcatcg cggttcagaa aaccgctgaa cccgcaaaag ttctctgccc gcctcctgtg 108061 tcgagatctg caggtcatcg aacgaacctg gcgctatctc aaggtcccgt gcgtcatcga 108121 gttctgggac gagaacggcg gggcggcgtc cggtgagatc gaactagcgg ccctcaacat 108181 tccgtaatcc aatgggagga aagaagtttc aagctatgcc tcagttgcca tctaccgtgc 108241 tggaccgggt cttcgagcag gcacggcagc agccggaagc aatcgccttg cgtcgctgcg 108301 acggcactag cgcactgcgg taccgtgaac tcgtcgccga agttggtggc cttgccgcgg 108361 atttgcgtgc ccagtcggtt agccggggtt ctagggtgct ggtcatttcc gacaatggac 108421 ccgagacgta cctgtcggtg ctggcgtgtg caaagctcgg ggcgatcgcc gtcatggccg 108481 acggcaatct tccgatcgca gccatcgaac gattctgtca gatcaccgac cccgcagcgg 108541 ctctcgtcgc accagggagc aagatggcat cttccgccgt tcccgaggcg ctgcactcga 108601 taccagtgat cgcggtcgac atagccgctg ttacacggga atccgagcat tccttggatg 108661 cagccagcct cgccgggaac gcggaccagg ggagcgagga tccgctggcg atgatcttca 108721 ccagcggtac cacgggcgag cccaaggctg tgctactggc caaccgcacc ttcttcgccg 108781 tcccggacat cttgcaaaaa gagggtttga actgggtcac ttgggtcgtc ggcgaaacca 108841 cctactcgcc gctgccggcg acgcacatcg gtggactgtg gtggatactt acctgcctga 108901 tgcacggcgg gttgtgtgtc accggcggcg agaatacgac atcgttgctg gagattctca 108961 ccacgaacgc ggtggcgacg acgtgcctag tgccaacgct tctttcgaag ttagtttctg 109021 aactgaagtc cgccaacgcg acggttccct cgctgcgcct agttggatac ggtggttcgc 109081 gggcgatcgc ggccgatgtg cggtttatcg aagctaccgg cgtgcgcacc gcacaggtct 109141 acggattgag cgagaccggt tgcacggctt tgtgtttgcc gaccgatgac ggctcgatcg 109201 tcaagatcga agcaggtgct gttggccgtc cgtaccctgg cgtggacgtc tatcttgccg 109261 ctaccgatgg catcggccct accgcccccg gcgccggccc gtccgcctcg ttcggcacgc 109321 tatggattaa gtcaccggcc aacatgctgg gctactggaa caatcccgaa cgcaccgcag 109381 aggtgctgat tgacggctgg gtgaacaccg gtgacctgct ggagcgccgc gaggacggct 109441 tcttctacat caagggaaga tcctcggaga tgatcatctg tggtggcgtg aacattgcgc 109501 ccgacgaggt cgatcgcatc gcggagggcg tgtcgggcgt ccgcgaggcc gcgtgctacg 109561 agattcctga cgaagagttc ggcgcgctgg tgggcctggc cgtggtcgca tcggcagagc 109621 ttgacgagtc ggcagcccgg gcgctcaagc acacgattgc ggctcgtttt cgacgggagt 109681 ccgagccgat ggcgcggccg tcgacaattg tgatcgtcac cgacattcca cgaacgcagt 109741 ccggcaaggt catgcgggcc tcgcttgcag cggcggcaac agcagacaag gccagagtgg 109801 tcgttcgtgg ctgagccggt gcgggaccga atcctcgccg ccgtctgcga cgtgttgtat 109861 atcgacgagg cggatctcat tgatggcgac gaaacggatc tccgcgacct cgggctggac 109921 tctgttcggt ttgttctgct gatgaagcag ctaggcgtga accgacaatc cgaactgccg 109981 tcccgattgg ccgcgaaccc gtcgattgcg ggttggcttc gcgagctgga ggctgtgtgc 110041 accgagttcg gttaagccgc tcgcagcgca acctctacaa cggcgtgcgc caggataaca 110101 atcccgcgtt atatctgatc ggcaagagct atcggttccg ccggttggag ctggcgagat 110161 tcctggccgc tctgcacgca acggtactgg acaaccccgt gcaactttgc gtcctggaga 110221 attcgggggc agactatccg gatctggtgc cgcggctacg gttcggcgac atcgtgcggg 110281 tggggtcagc cgatgagcac ctgcagagca catggtgttc gggcatcctg ggcaagccac 110341 tggtgcggca tacggtgcac accgacccga acgggtatgt gaccggtctg gacgttcaca 110401 cccaccacat cctgctggac ggcggcgcga ccgggacgat cgaagctgac ctggcgcgtt 110461 acctgaccac cgacccggcg ggcgaaaccc ccagtgtcgg tgcgggtcta gccaagctca 110521 gggaggcgca ccgtcgtgag acggccaagg tggaagaatc gcgggggcgc ctgtcggctg 110581 tcgtgcagcg tgaactcgcc gacgaagcat accacggcgg gcacgggcac agcgttagcg 110641 acgctcccgg gaccgcggcc aagggcgtcc tgcacgaatc ggcaacgatc tgcggcaacg 110701 cgtttgatgc catcctgacc ctttcggaag cgcagcgggt cccgcttaat gtgctggtgg 110761 ctgcggcggc cgtcgcggtg gacgcgagcc ttcggcagaa caccgaaacc ctcttggtgc 110821 acacggtgga caaccggttc ggagattctg atctgaatgt cgcgacctgt ttggtcaatt 110881 cggttgccca gaccgtccgg tttcccccat ttgcgtcggt gtccgatgtc gttcgaacgc 110941 ttgaccgcgg ctatgtcaag gcggtaagac gccggtggct tcgtgaggag cattaccgcc 111001 gaatgtattt ggcgatcaac cggacatctc acgtggaggc gttgacgcta aatttcattc 111061 gcgagccatg cgcacctggc ctgcgcccgt tcttgtcgga ggtcccgatt gccacggata 111121 tcggtccggt cgagggcatg acggtggcgt ctgttctgga cgaagaacag cgcacactga 111181 acctagccat ctggaaccga gccgatctgc ccgcgtgcaa gacacacccc aaggtcgcgg 111241 aacggatagc ggcagcgttg gaatcgatgg cggcgatgtg ggatcggccg atcgccatga 111301 tcgtcaacga ctggttcggg atcggcccgg acgggactcg ctgccaaggc gattggccag 111361 cccgtcagcc gtcgacgccc gcgtggtttc tcgattccgc aaggggcgtc caccaatttc 111421 tcggcaggcg ccgcttcgtc tacccgtggg tcgcgtggtt ggtgcaacgc ggcgccgcac 111481 cgggtgatgt tctggtgttc accgacgacg acaccgacaa gaccattgac ctgctcatcg 111541 cgtgtcacct tgcgggttgc gggtacagcg tctgcgacac cgctgacgaa atttccgtgc 111601 ggaccaatgc gattaccgag cacggcgatg gcatcttggt gacagtggtc gacgtggccg 111661 ccacccagct ggcggttgtc ggccatgacg agctgcggaa ggtcgttgac gagcgcgtca 111721 cacaggtgac acacgacgca ctgctggcca ccaagaccgc ctacatcatg ccgacctcgg 111781 gaactaccgg acaacccaag ctggtgcgaa tctcacacgg ctcgctcgcg gttttctgtg 111841 atgcgatcag ccgcgcctac ggttggggag cccacgacac cgttctgcag tgcgctccgt 111901 tgacatcgga catcagcgtc gaggagattt tcggtggcgc ggcctgtggc gcgcgactgg 111961 tgcgatccgc ggctatgaaa accggcgacc tggcggcgct ggttgacgat ctcgtcgccc 112021 gcgagacgac aatcgtcgac ctgccgaccg ccgtctggca gctgttgtgc gccgacggcg 112081 acgccattga cgcgatcggc cgctcgcgcc tgcggcagat cgtaatcggc ggtgaagcca 112141 tccgctgtag cgccgtggac aagtggcttg aatcggctgc ttcacaaggg atctcgctgc 112201 tctcgagcta tggtccaaca gaagccacgg tcgtcgccac cttcttgccg atcgtttgcg 112261 accagaccac catggacggc gcactgctca ggctcggccg gccgatccta ccgaacacgg 112321 tgttcctcgc gttcggtgaa gtcgtcattg tcggggattt agtcgccgac ggctacctcg 112381 ggatcgacgg cgacggcttc ggcaccgtga cggccgcaga cggttcccga cgccgtgcct 112441 ttgccactgg cgaccgggtg accgtcgacg ccgaaggatt tccggtcttc tccggacgca 112501 aagacgccgt cgtcaagatc tccggcaagc gtgtcgatat cgctgaggta accaggcgca 112561 tcgccgaaga ccccgcggtg tcagatgtcg ccgtcgagtt gcacagcgga agcctcggag 112621 tgtggttcaa gagccaacgg acccgcgagg gcgaacaaga cgctgccgcg gcgacccgga 112681 tcaggctcgt cctcgtgagt ctgggagtgt cgtcgttttt cgttgtcggc gtgccgaata 112741 tcccgaggaa gcccaacggg aagatcgaca gcgacaacct gccgaggctg cctcagtggt 112801 cagctgctgg gctaaacacc gccgagacgg gtcagcgagc ggccggcctc tcgcagatct 112861 ggagccggca gctcggccgg gcaatcgggc cggactcgtc gctgcttggt gagggcatcg 112921 gctcgttgga tctcatcaga atactgcccg agacgcgtag gtatctgggg tggcgcctct 112981 cgctgctgga tctgatcggt gccgataccg ccgccaatct ggccgattac gcgccaacgc 113041 ccgacgcgcc gacgggcgaa gatcggttta ggccgctggt ggccgcgcaa cggcccgcgg 113101 cgattccgtt gtcgtttgcc cagcggcgac tatggtttct cgaccagtta cagcgacccg 113161 ctccggtcta caacatggcg gtggcgttgc ggctgcgcgg gtatctcgat accgaggcgt 113221 tgggcgcggc ggtcgccgat gtcgtgggcc gccacgaaag cctacggacg gtgtttccgg 113281 cggtcgacgg ggtccctcgg cagctggtca tcgaagcgcg gcgggcagat cttggctgcg 113341 acatcgtcga tgccaccgca tggccggctg accggctgca acgggccatc gaggaggcgg 113401 cgcgccacag cttcgatttg gcaaccgaga tacctttgcg gacgtggctt ttccggatcg 113461 ccgacgacga acatgtgctg gtggcggttg cacaccatat cgccgccgac ggctggtcgg 113521 tggctccgct gacggccgat ctgagtgcgg catatgccag ccgttgtgcg ggtcgggcac 113581 cggactgggc gccattgcca gtgcagtatg tcgattacac gctgtggcag cgggaaatcc 113641 tcggtgatct cgacgacagc gacagcccga tcgccgcgca gctggcctac tgggaaaatg 113701 cgttggccgg tatgccggaa cggctgcggc tgcccaccgc tcggccctat ccaccggttg 113761 ccgatcagcg cggcgccagt ttggtggtgg attggccggc gtcggtgcaa cagcaggtgc 113821 gtcggatcgc ccgccagcac aacgcgacca gcttcatggt ggtagctgcc gggcttgccg 113881 tgctgctgtc gaaactcagc ggaagccccg atgtggcggt cggatttccc atcgccggcc 113941 gcagcgatcc tgcgctggat aacttggtgg gcttttttgt caacaccttg gtgttgcggg 114001 tcaacctggc cggtgatccc agcttcgccg aactgctggg gcaggtgcga gcgcgcagcc 114061 tggccgccta cgaaaatcaa gacgtacctt tcgaggtgct cgttgatcgc ctcaaaccca 114121 ctcgagccct gacccatcac ccgctgatcc aggtgatgtt ggcctggcag gacaatccgg 114181 ttggacagct gaatttgggt gatctgcagg ccaccccgat gccgatcgac acccgcaccg 114241 cccgcatgga cttggtgttt tcgttagcgg aacgcttcag cgagggtagc gaacctgccg 114301 ggatcggcgg agcggtggaa taccgcaccg atgtgtttga agcccaagca atcgacgtgc 114361 ttatcgagcg gttgcggaag gtgttggtgg cggtggccgc tgctccggaa cggacggtgt 114421 cgtcgatcga tgcgctggat gggaccgagc gtgcccggtt ggatgagtgg ggtaaccgcg 114481 ctgtgctgac tgcgcccgcg cccacgccgg tgtcgatccc gcagatgttg gccgcccagg 114541 tggcacgtat ccccgaagcg gaggcggtgt gttgcgggga cgcgtcgatg acgtatcggg 114601 aactcgacga ggcgtccaac cggttagcgc atcggctggc aggttgtggg gccggcccgg 114661 gcgagtgtgt ggcgctgctg ttcgagcggt gcgcgccggc ggtcgtggcg atggtggcag 114721 tgctcaaaac cggggcggcg tatctgccga tcgatccggc gaatcctccg ccgcgggtgg 114781 cgttcatgct cggcgacgcg gtgcccgtgg ccgcggtcac cacggctggg ctgcgctccc 114841 ggttggcggg acacgacttg ccgatcatcg atgtcgtcga tgctttagcg gcatatccgg 114901 gcacgccccc acccatgccg gccgcagtga acctcgccta catcctgtac acctcgggca 114961 ctaccggcga gcccaaaggc gtggggatca cccatcgcaa cgtcaccagg ctgttcgcat 115021 cactgccggc acgcttgtcg gcggcgcagg tgtggtcgca gtgtcattcc tatggcttcg 115081 acgcctcggc gtgggagatc tggggcgcgt tgctaggtgg tgggcgactg gtgatcgtgc 115141 ccgagtcggt ggcggcctcg ccgaacgact ttcatgggct gctcgtggcc gaacacgtca 115201 gcgtgctgac tcagactccg gctgcggtgg caatgttgcc gacgcagggt ttggagtcgg 115261 tggcgttggt ggtggccggt gaggcatgtc cggcagcgct ggtggatcgg tgggcgcccg 115321 ggcgggtgat gctaaatgct tatggcccaa ccgagaccac gatctgtgcg gcgataagtg 115381 cgccgttgcg accgggttcg gggatgccgc cgattggtgt tccggtgtcg ggggcggcgt 115441 tgtttgtgct ggatagctgg ttgcgcccgg taccggccgg ggtggccgga gagttgtaca 115501 ttgccggtgc gggcgtcggt gttgggtatt ggcgtcgggc ggggctgacc gcgtcacggt 115561 ttgtggcctg cccattcggc ggttccgggg cacgcatgta tcgcaccggg gatctggtgt 115621 gttggcgcgc cgatggccag ttggagttcc tggggcgcac cgacgatcag gtcaagatcc 115681 gcgggtatcg catcgagctc ggcgaggttg cgaccgcgct ggccgagctg gctggggtag 115741 gtcaagcggt tgtaatcgcc cgtgaagacc gccctgggga caagcgccta gtcgggtatg 115801 ccaccgaaat tgcccccggg gcagtggacc cggccgggct gcgggcgcaa ctagcccagc 115861 gattgcccgg ttacctggtg ccagccgcgg tggtagtgat cgatgcgctt ccgttgacgg 115921 tcaacggcaa acttgatcat cgtgcgttgc cggcaccgga atacggtgat accaacggat 115981 atcgcgctcc ggccgggccg gttgagaaga ccgtggccgg catctttgcc cgggtgcttg 116041 ggcttgagcg ggtcggcgtc gacgactcgt tcttcgagct cggcggcgat tcgctggcgg 116101 caatgcgggt tatcgccgcg atcaacacca ccctaaacgc cgatctgccg gtgcgcgcgt 116161 tgctgcacgc gtcgtcgacg agaggtttaa gccagctgtt ggggcgagat gcccgaccga 116221 ccagcgatcc gcgcttggtg tctgtgcacg gcgacaaccc caccgaggtg catgccagcg 116281 acctcacgct ggaccggttc atcgacgccg acacgctggc caccgccgtc aacctgccgg 116341 gcccgagccc cgagctacgg acggtcctgc tgacgggcgc gacgggtttc ctcggacggt 116401 atctggtcct tgaattgctg cggcggctgg acgtcgacgg caggctgatc tgtttggtgc 116461 gggcggagtc cgacgaggat gcgcggcgtc gtctggagaa gaccttcgat agcggtgacc 116521 cggaattgct gcggcacttc aaggagcttg ccgccgaccg gctggaggtc gtcgcaggcg 116581 acaagagcga acccgacctg ggcctggacc aaccgatgtg gcggcggctg gccgaaaccg 116641 tggatttgat tgtcgattcc gcggcgatgg tcaacgcgtt tccctaccac gaattgttcg 116701 ggcccaacgt cgcgggcacc gccgagctga tccgaatcgc gcttaccacc aagctcaaac 116761 ccttcaccta cgtgtcaacc gccgacgtgg gtgctgcgat cgagccgtcg gcgttcaccg 116821 aggacgccga catccgggta atcagcccca cccgcaccgt cgacggcggc tgggctggcg 116881 gctacggcac cagcaagtgg gccggtgagg tgctgctgcg cgaggccaac gacctgtgcg 116941 cgctgccggt cgcggtgttt cgctgcggga tgatcctggc cgacaccagc tatgccggac 117001 agctcaacat gtcggactgg gtcacccgga tggtgttgag cttgatggct accggcatcg 117061 cgcctcgttc gttctacgaa ccggactccg agggcaatcg gcaacgcgcg cacttcgacg 117121 ggctgccagt caccttcgtt gccgaggcga tcgcggtgct gggcgcgcgg gtggccggct 117181 catcgttggc gggatttgcg acctatcacg tgatgaaccc gcacgacgac ggtatcgggc 117241 tcgatgagta tgtggactgg ctgattgagg ccggctaccc gatacgccgc atcgatgact 117301 ttgcggagtg gttgcagcgg tttgaggcca gcctgggcgc tctgccggat cggcaacgcc 117361 ggcactcggt gctgccgatg ctgctggcga gcaattccca gcgattgcag ccgcttaagc 117421 cgaccagggg gtgctccgcg ccgaccgacc gattccgtgc cgcggtgcga gcggcgaaag 117481 tcggctccga caaggacaat ccagacatcc cgcacgtgtc ggcgccgacc atcatcaact 117541 acgtcaccaa cctacaactg ctcggactgc tgtagttgct cggcgataaa gagcgcagcc 117601 atggtcgggg gagatcatgt ggtcactttc gggtcggcat cgattctgcg agcagaatat 117661 gtggttgatg gccactaggc cggtaccggg gaactggcgg ttcccggccg atgagcatcg 117721 gccctgacgc gcggccgtaa gctccaggaa tggggacgca cggggctacc aagagtgcga 117781 cgtcggctgt gccaacgccc cggtcgaact ccatggcgat ggtacggctg gcaattggcc 117841 tgctgggtgt gtgcgcggtg gtcgcggcct tcgggctggt gtcgggagcg cgccgctacg 117901 ctgaggccgg caatccctat ccgggcgcct tcgtcagcgt cgccgagccg gtcgggttct 117961 tcgccgcgtc gctggccggt gcgctgtgtc tgggcgcgct gatccacgtg gtcatgacgg 118021 ccaaacccga gccggatggc ttaatcgacg ccgcggcgtt ccggattcac ctgctggcag 118081 aacgtgtttc aggtctctgg ttggggctag ccgcgaccat ggtggtcatt caggccgccc 118141 acgatactgg agtggggccc gcgagactgc tggctagtgg ggcactatcg gactccgtcg 118201 ccgcctccga gatggcacgc gggtggattg ttgcggcgat ctgcgcgctg gtggttgcga 118261 cggcgctgcg gctgtacact cgctggctcg ggcacgttgt gctgcttgtc cccactgtgc 118321 ttgccgtcgt cgccaccgcg gtgaccggta acccgggaca gggacccgac catgactacg 118381 cgaccagcgc cgcgatcgtg ttcgcggtcg cgttcgccac cttgaccggg ctcaagatcg 118441 ctgcggcgtt ggcgggaacg acgccaagcc gcgctgtgct ggtaacgcag gtcacctgtg 118501 gagcgctcgc gttggcatac ggagcgatgc tgctttatct cttcatcccg ggctgggcgg 118561 tcgattcgga ttttgcccgc cttggtctgc ttgcgggggt aatcctgacg tcggtgtggt 118621 tgtttgactg ctggcggctg ttggtcaggc cgccacatgc gggccgtcgc cgcggtggtg 118681 gctccggtgc cgcactggcc atgatggccg ccatggcttc gatagctgcc atggccgtta 118741 tgaccgcgcc gcgatttctc acccacgcgt tcacggcttg ggatgtcttc ctcggctatg 118801 aactgccgca accgccgacc atagcccggg tgctcaccgt gtggcgcttc gatagcctga 118861 tcggagccgc tggtgtggtt ctcgcgatcg ggtatgcggc gggcttcgcc gcgctgcggc 118921 gccgaggtaa ctcttggccg gtgggcagat tgatcgcctg gctgactggt tgcgccgcac 118981 tggtattcac cagcggctcc ggtgtacggg cctatggttc ggcgatgttc agcgtccaca 119041 tggccgaaca catgacactg aacatgttca tcccggtcct gttggtgctc ggtggcccgg 119101 tcacgctggc gctgcgggtg ctgccggtaa cgggtgatgg acggccgccg ggggctcgcg 119161 aatggctgac ctggctgctg cactcccggg tgacaacttt cctgtcgcac ccgatcaccg 119221 cattcgtcct ctttgtggcc tcgccctata tcgtctattt cacaccgctg ttcgatacct 119281 tcgtccgcta tcactggggc cacgagttca tggcgatcca tttcctggtg gtcgggtact 119341 tgttctactg ggcgatcatc ggcatcgacc cagggccgcg ccgactgccc tacccgggcc 119401 ggatcgggct gttgttcgcg gtgatgccgt tccacgcctt cttcgggatc gcgctgatga 119461 cgatgtcgtc tacggtgggc gctacgttct atcgttccgt caatctgccg tggttgtcga 119521 gcatcatcgc cgaccagcat ctcggcggtg gaattgcttg gagcctaacg gaattgccgg 119581 tcatcatggt catcgtggcg ctggttaccc aatgggcgcg ccaagaccgc cgagtcgcgt 119641 cccgcgaaga ccggcatgcc gacagcgact acgccgacga cgagctggaa gcctacaacg 119701 cgatgcttcg cgagttgtcg cgaatgcggc gctgaatgtg cagatgattt tggaagcggt 119761 tggcgtatct gcccgtgctc ggctacacca ggaccgcggg gcgctggcac gcgaacgatc 119821 cggcgaggag gtgggccagc cggagattcc ctccacaggc tgcagcagaa gtcctggatc 119881 tgaccccgac ctgaaccctt gtcagtgcgg tccatcgacg gaaaattgct gttccgccat 119941 gctgggcatg ctattgagcg ccaaaattgc gtagccgcaa gctgtttgac acgacgaaaa 120001 atgacgagaa cgccatggcg gcaccggcga tcaaagggtt gagcagtccg gcggcggcaa 120061 tcgggatggc tgcgacgttg tacccgaacg cccagatcat gttcatccgg atcgtccgca 120121 tggttgcacg ggccaggtcc agcgcctgcg gaacagtatt cagatcatcg cgcaccagaa 120181 tgatgtcggc tgcaccgagc gcgacgtcgg tgccacgccc gatcgccaac cccaagtcgg 120241 cacccaccaa cgcgggaccg tcgttgatgc cgtcaccgac catggcgacg gtatgtcctt 120301 cctcgcggag ccgttggatc acgtcgacct tgccttcggg cagcatatcg gcgacagcgg 120361 agtcgatgcc gacctgcgcc gccaccgcgt cggcggcggc ccgattgtcg ccggtgagca 120421 gaatcgtccg cagcccgcgg ctgcgtagcg cagcgacggc ggcagccgct gaatccttga 120481 gggtgtcggc gattgtcagg gctgcgcgga cgacaccgtc gaccgacaca aaaacgacag 120541 tctcgcctcg ggattcgccg tccaggcgcg cggacaccag agccgcgtcg tggcagggcg 120601 tggtccgggt aatccaggat ggcttgccga cctcaacgtg atggccgccg acttcccccg 120661 atacaccgca gcccgcgacg gcgacaaacc cgttgactgg acccggatcc ggcgaagcgg 120721 caacgatggc cgccgccatc gcatgctcgg aagccgattc gacagcggcg gcgaggccaa 120781 gcacttcctc gcgatctcgc tcgctggtgc ctgaacctgc cattgttacg gtgctcaccg 120841 ccagctgccc aaccgtcaac gtgccggtct tgtcgaacac cacggtgtcg atgctccgga 120901 tggtttccag tgcccggtac cccttgataa agatccctag ctgcgctccc cgtccggaag 120961 caaccatcat ggcggtaggt gtcgcgagcc caagcgcaca cgggcacgcg atcaccaaca 121021 cccctagcgt gaccgagaac gcgcgatccg cgcctgcgcc gctgacgagc caggccgcac 121081 ctgcaagtcc agcaatgacg aaaaccaccg gcacgaacac gcccgcgatg tggtcggcga 121141 ggcgctgggc acgcgccttc tgcgtctggg cttgctccac gaggcggacc atcgcggcga 121201 actgggtatc ggcccctacc gcggtggcct cgatgaccag gcggccgtcc atcacgaccg 121261 tgccccccac gaccgaggcc gccggatagg cacggaccgg cttggcctca ccggtcatgg 121321 cgctcatatc gatcgccgcg ctgccgtcga caacgactcc gtcagctgcg atggtttccc 121381 ccggccgcgt cacgaagcgc tggcgcttct tgagttcgct cgccggtatc actagctccg 121441 cgccgtcggg cagcagcacc gccacattct tggcgcctag ctccgccagc gcacgcagcg 121501 cgctgccggc cttggacttg gctcgtgctt caaagtaacg accggcaaga acgaagacgg 121561 tcacaccggc cgcgacctcg aggtagatcg agtcgctgtt gagaatggcc cgccagattc 121621 ccgagccttc ccgtggcggc tgatcgccga agacggacga aagcgaccag gcggtggcgg 121681 ccacgatccc gaccgagatc agcgtttcca tggatgtcgt ccggtggcgc gcgtttcgca 121741 gcgcgaccga gtggaagggc catgcggccc aggtcacaac cggagcggcc agggccgtca 121801 atatgtatcc ccagccggga accctggcgc tggggacgat cgcgaacaac gtcgacaggt 121861 cagccagcgg cacgaacaac accgccgcga ctagcagccg ccgcagcagt ctgcgggcgt 121921 gggcgccgtc gggatccttt gtccgtttgt ctaggacggt tgtctcggtg tgcggtgccg 121981 cgtggtatcc ggctttctcg accaccccgc acagctcatc ggctgccatg cccacggcat 122041 cgatggtcgc gacgcgggtt gcgaagttga cggatgcgcg tactccgggg atcttgttga 122101 gcttcgtctc gacgcggctg gcacaggccg cacatgacat acccgaaaca tcgagccgga 122161 tccgccgcac cgactgcagg tcggcatctc ccacaactgg agccgccacg gccctcctcg 122221 gatcggcgta tttgcacccg tcagcctaca agtcgtaagc aggcggtaat cggttcccta 122281 tggcccgctg gatgcactgg cgatggattc ttttggtccg atttctgcgg ttggcgtgct 122341 aggtttccga ctgtgacgcc cgtcacaacg tttcctctcg tggacgcgat cctcgctggt 122401 cgcgaccgca accttgacgg cgttatcttg atcgccgccc aacacctgct gcaaacaacg 122461 cacgccatgc tgcgttcgct atttcgggtc ggcctcgatc cgcgcaacgt cgcggtgatc 122521 ggcaagtgct attccactca cccgggagtt gtcgacgcga tgcgggccga cggcatctat 122581 gtcgacgatt gcagcgacgc ctacgcaccc cacgaatcat tcgacaccca gtacacccgc 122641 cacgtagaat ggtttttcgc cgaatcctgg gcgcggctta cggccgggcg tacggctcgt 122701 gtcgtgctcc tcgacgacgg cggatcgctg ctagccgtcg ccggcgccat gctcgatgcg 122761 agcgccgacg tgatcggaat cgagcagacg tccgccggct acgccaaaat cgtcggttgt 122821 gcgctggggt ttcccgtcat caacatcgcc cgctcgtcgg caaagcttct atacgagtcg 122881 ccgatcatcg ccgcacgcgt gacacagacg gcattcgagc gcaccgcggg catcgactca 122941 agcgcagcga tcctgatcac cggcgcgggc gcaatcggca ctgccctggc cgatgtgctg 123001 cgtccgctgc atgaccgggt ggacgtgtac gacacgcgct ccggctgtat gacgcccatc 123061 gatcttccga atgcgatcgg cggctatgac gtgatcatcg gtgccaccgg cgccaccagt 123121 gtgcccgcca gcatgcacga attgctgcgc cccggcgtat tgctgatgtc ggcgtcttcg 123181 tccgatcgcg agttcgatgc cgtcgcgttg cgtcggcgca cgacgcccaa tcctgactgc 123241 catgccgacc tcagggtagc cgacggcagt gtcgacgcta ccttgttgaa ttcgggcttc 123301 ccggtcaact ttgacggttc gcccatgtgc ggcgatgcgt cgatggcgct cacgatggcg 123361 ttgttggcgg ccgcggtgtt gtatgcgtcg gtcgcggtcg ccgacgaaat gtcatccgat 123421 catccgcatc tcgggctgat cgaccagggc gacatcgtgg catcgtttct gaacatcgac 123481 gtcccgctcc aagctctcag ccggctaccg ttgctttcga tcgatgggta tcgccgcctt 123541 caggtgcgct ccggccatac cttgttccgc caaggtgagc gggccgacca cttctttgtc 123601 atcgaatccg gcgagcttga ggcgctcgtc gacgggaagg tcatccttag actcggtgcc 123661 ggagaccact tcggcgaggc gtgtttgctc ggtggcatgc ggcgcatagc gacggtgcgg 123721 gcatgtgagc catcggtcct gtgggagctc gacggcaagg ctttcggcga cgcgctgcat 123781 ggggacgctg caatgcgtga gatcgcctac ggtgtcgctc gcacccggct catgcacgcc 123841 ggcgcgtccg agtccttgat ggtgtaacgg tcttgcattc gtgggctgtc ggcggatcac 123901 gggatcgtta tgccggttct tgcgagtgac ataggttgac atacgtataa ccggtccctg 123961 cggtcgaaca cggcttgaca attggacgaa tctcgttgcg cgccatcagt tgtgctcaca 124021 ggatcgccgc cgttcggagc gatgagcccg cttggcgcgc gaagtgcgcc ggggcggatc 124081 ctgcccgagc cgcgcgacga cggcctcgat gcccgtcgcg gtcgatgacc ttgattcctt 124141 gggcgctgac ccgcaccttg atgcggcggt cctccgacgg taagtagtag gccttgagct 124201 ggatattggg cggccaacgt cgccgagtgc ggcggtgtga gtgcgacaca gccttaccga 124261 aacccacagt gcggccggtg atttggcagc gggcggacat ggcgaacctc ctcccggacc 124321 agcctgttga aaatagtttt cgacaaccgt tgcacggcac ggtagcgtgg gtgcagttta 124381 atggcaatca ttttcaataa ggtttggcga tgcgtactcc ggtgatattg gtggcaggtc 124441 aggatcacac cgacgaggtg acgggcgcct tgttgcgccg gaccggaacg gtggtcgtgg 124501 agcaccggtt tgacggccat gtggtgcgac ggatgactgc cacgctgagc cgtggcgaat 124561 tgatcaccac ggaggacgct ttggagttcg cccacggctg tgtgtcgtgc acaatccgcg 124621 acgacctgct ggtgctgtta cgcagactgc accgccgaga caatgtcggc cggatcgtcg 124681 tgcacctggc gccgtggctg gagccccagc ccatctgctg ggcgatcgac cacgtgcggg 124741 tttgcgtcgg acacggatac ccagacggac cagccgccct cgacgtgcgg gtcgcggccg 124801 tggtgacctg tgtggactgc gtaaggtggc tgccgcagtc actcggcgag gacgaactgc 124861 ccgacgggcg cacggtggcc caagtgacgg tcggtcaggc cgagttcgcc gaccttctgg 124921 tgctgaccca cccggaaccg gtcgccgtgg cggttctgcg ccgactggcc cctcgagcgc 124981 gaatcaccgg cggcgtcgac cgcgtcgagc tggcgctggc gcatctggac gacaactcac 125041 ggaggggtcg taccgatacc ccgcacacgc cattgctggc gggcctgcct ccgttggcag 125101 ccgacggtga ggttgcgatc gtggaattca gtgcccgccg cccgtttcac ccgcaacgtc 125161 tgcatgccgc ggttgacctg ctgctcgatg gcgtggttcg cactcgaggt cggctgtggc 125221 tggccaaccg gccggatcag gtcatgtggc tcgaatcagc cggtggcggt ctgcgggtcg 125281 catcggccgg aaagtggttg gcggcgatgg cggcctcgga ggtggcctat gtcgacctgg 125341 agcggcggtt gttcgccgac ctgatgtggg tctacccgtt cggagaccgg cacaccgcga 125401 tgacggtact ggtatgcggc gccgatccga ccgacatcgt caatgccctg aacgcggcgc 125461 tgctcagcga cgacgaaatg gcatctccgc aacgctggca gtcctacgtc gaccctttcg 125521 gcgactggca tgacgacccg tgccacgaaa tgcccgatgc ggctggggaa ttctcggcac 125581 accgcaactc aggagaatct cgatgaaacc ccggtatcca tcccgactac cagcccgtgg 125641 tacagacgcc gacactacgg ctcagcgcgc gctggatgct accgagggcg tcgataggtt 125701 ctatccaccc gcgtcagagt cttccgcgtc gtcagggcgt tcatcaggtt gcacgacacc 125761 gactgtgctt gccaaccact tcggtgccag cgctgagact gcggtggctc ctgccgtggc 125821 gctgaagacg cccgtccagg cgaccggtcc cagcggggta cacccgaaaa agtggctgat 125881 aaccggggtt tgaatgatgc cgaccaacac cccggcgctg cccagtgcgg tggcaatgac 125941 gagcggactg tgccggcgcg tcagcagtgt ctgcgctagc tgggtcatca cgagtgcagt 126001 caaacccatt gtcgccgtgc gtcgttcggt tccgggagtc cagcgcccga tggcccaggc 126061 tgccgttgcg ccggcggcgg tgacgacgcc gcggttaacg atctgacgca gcaatggcgc 126121 gtccagcgag ggcgtaggcc cgatcagcac cgcacgtcga tgttcgcgct gcgctcgctc 126181 ggccgcgtca tcggttgggt attcggcgtc gtcgggttcg gcaaactgcg aggtgacggc 126241 caccgcaagc gcgggaaaca tgtcggtgag cagattcacc agcagcagtt gacgagtccc 126301 caccggcgcc cgcccggccc cgaacgccgt cccgatgacg gtgaacagaa cttcgcccac 126361 attgccgccg accagaatcg tcaccgcgtc acgaacaccg gcccacatgc tgcggccctc 126421 gaccagcgcg tcgagcagca cgcccaggtc atcgtcggtc agcacgatat cggcggcccc 126481 acgggcggca gaggaaccgc gcccgctcac tccgatgccc acgtcggcca tccggatggc 126541 cgcggcgtcg ttggcgccgt cgccgaccat cgcggtcact cgcccgcagc gctgcagcga 126601 cgccacaatc tgaacctttt gttccgggct gacccgagca aagacttgca tgtcggcggc 126661 gagtttggca tgcgcctcct cgtccaggac ggcaagttcg gcaccggtca cgactcgcgc 126721 gtccgccggt agtcccagct ggcgggcgat cgcccgggcg gtgatcggat ggtcgccggt 126781 gatcagcacc acgttgcgct cggcgtccag caaggcttcg atcaacggac gcgaggaagg 126841 ccgggccgta tccgccaatc cgacatagcc gatcagctcg agatcgtgcg cgacggcgtc 126901 gacagcgtcg gcgtcggtct cgtcatcatg ggtggtcccg ttgtcccagg tgcgctgcgc 126961 gactgccaga acacgcaggc cctgctcggc gaggtggcgt accacggatt cggcatgttc 127021 gtggtcgacg cccgggtcgg cgagtcggca gcgcggcagg atcgtctccg gagcgccctt 127081 gagcatcaac atcggtatcc cgtcggtgcc cactctgccg atcgcggcgg cgtagccgcg 127141 actggactca aacggtactt cggccagcac cacccactcc gaatcgcctt ggctactaag 127201 cgaaccggcc agcgcactag ccgccgcgag gatcgcctca tcggtggcgt gcgcgtgccc 127261 ttccccgtta tggggctgcg tggacgcgcg cgcggcggcc cgcagcacct cggcggaggg 127321 cgcatcggtg gtctgcggca acggatcccg ttcggctgcg gtgctgctcg gtagcgcgca 127381 taccacccgc aggcggttct cggtgagtgt gccggtcttg tcgaaacata tggtgtcgac 127441 acggccgagc gcctcgatgg tgcgaggcga gcgcaccagc gccccacgtg ccgtcaggcg 127501 ctgggcggcg gcaagctggg agagggtggc caccaacggt agaccctccg ggaccgcggc 127561 caccgcgatg gcgacgccgt cggccaccgc ttgccgcagc gacgcccggc gcagcaacgc 127621 cagagctgtc accgcggcgc cgccggccaa cgtcatgggc agcactttgc tggtcagctc 127681 gcgcagccgg gcctggactc cggccgccgt ttcgacatcg gcgaccgccg agatcgcgcg 127741 atgtgcggcg gtgccgactc cggtggctac cacgatcgcg cgggcgtgtc cggcgacgat 127801 ggtgctgccc tcaaacagca tgctggcccg gtcggggtcg ttgacggcga cggggtccac 127861 ctgcttgtcc accggtagcg actcgccggt aagaaaggac tcgtcgacct cgaggtcttc 127921 ggccaccagc aggcgcgcat ccgccgggac cacctccggc gcggccaggt cgatgacatc 127981 gccgactcgc agcgacttcg ccgacaccgt ggccgtccgg gtggcgtgcc gggccgcctc 128041 cagtcgacgt cgggtagtcg ctaccgccgg caccaccacc cggcgcacca gctggtcctg 128101 ctcggcgaat agctcggcgg ccgccgcctc ggctcgcaat cgttgtaccc caccggtgat 128161 cgcgttgacc gtcatcacgc ccgctaccag tagcgcgtcg atattgctgc cgacaatcgc 128221 cgatgctgcg gcgcccaccg ccaggatcgg agtcagcgga tcggccagtt catggcgggt 128281 ggccaccgcc agctgcgcca aggttcgtgc cgggccgcgc agcggcgcca tcaccggttc 128341 gtaggacagg tcgtccagaa tgcgccgcca ggccgggatt ccgggttcga cggccaaggg 128401 tcgggagccg ccggctagcc gcgagtagac gatctcgggg tccagcgcgt gccaggcggt 128461 cagcggttgc ggggtggggt cgggcatccg cagcaccttg gcggccgacc acattccgga 128521 caccaaagcc gttgcggcag cggcattgac cggattgagc cagcgacgga agctggctgg 128581 gttggtggtt ttgtcctgct caccggtgac caacaacagc ccggccaagg tggtgccacc 128641 ttgggcgagg tgtaccgcgg attcactggc tgcccgggcc accggaagcg ctgacaggat 128701 ccgcaccgcc gcggccagat cggtgccggt gattaggtcg gcagtccatg gtgttgcccc 128761 gcggggatcg tcgagagcca caccgacgtc agcgatggcc aacgcggcca acgtatcggt 128821 ggatgcgaag tcccggtgca ccgcggtgat cagcaatacc ggtccgcgat ccgcgcgcaa 128881 ctcacgcacc aacttcagca acggcgtgcc aggcggatgc gtcgaaccga cgctggccga 128941 tagatcttcg gtgcccgcga catggcgcaa aaccacccgc gctccggttc ggtgcgcggt 129001 ctgcagcagc gggattgcgt atgggtcgac ttcccacccc acgtcgacgc tgcccacgca 129061 ttggccatcg accaccaggt cggcatgctc gaggccctga gccggcgttg ccgacggccc 129121 ttgagccgga gcccatctca agcgggcacc ggtagccggc aattcatcgg ggtcgggttc 129181 gggtgcctgc tcgccatgga gcaaggcgtc ggcgacctcg tagacgcggt cgtcgtccca 129241 gccgggttcg tctccctgtg catgcaatac ggcgcggttg tcaccgcgca gcgctgcgcc 129301 gtcgataacg accacccgca cccgatccag gcggcgcaac gcgccggggt cgaggactag 129361 ttgcccggtg ttggcgagcc cgcgacccag caccgccgcg aacgcctgtc ggcccatgtg 129421 cgcggcacgt ggtactccgg ccaggatcgc gccggccgcg tcctcggtac caccgccggc 129481 caccagtgca ctggccgcgg cgatcaacga accgtttgcg gcctggttga cgtattgctc 129541 caccggcccg gcccttgacc ctttggcggt gtcgatcgcg gcgtcgatgg atccgccgac 129601 cacgacgtgc gaagcctcgc ctgccgccgc agccgcccaa ctgtgtcgcg gctcctgcga 129661 tttggccccg gccgacgaga tgatgggcac caccggagcc tgcggccgtc tgggggaggc 129721 cagcgcgggt tcgcggtcac gccatacgcg acggtgcgcc gccgcttcgg agatttgcag 129781 gctgcgttgc accagatcga gcagcggtgt gcccagggac tgggttagcc cgttggccgc 129841 cgccgtcgta gcggcgagcg caatatcggt gcccactcga cccaaccgcg actccataag 129901 cgacaccatt cgcggttgat ggtttatcag agcggccagg gctctggtgg tttgcggtgc 129961 ggcgggcagt cgggcgaccc agccggtgac cgtggcaccc atcgctacca gatccattgc 130021 cgcagcggtc aacggcacca ggatcgccaa ggggttacct gggtcggcga atggtgccga 130081 gttcggcgac gacaccgacc cagccaggaa tatgtcggca gccaccgcgg aaaccacatc 130141 acgtacctcg tccacggcga tgtcgctatc cacatcaggt tcaagttcga ccaccagccg 130201 acccaatgag ccctcaacgt gggcctcggc cacgcctggg atcctgcgga ctggctcctc 130261 caccatggcg gcatgctcgt gccagcgagg gaatggcagt agcggatcca ggtcgaaatg 130321 cacgcgccgt ccgctgcgcc aacgcaccgg cggtgtcatt ccgtcaggcg attcgttgtg 130381 ggaaccgcgt accccgattg cgcgaccagt cgtttgcacc accgactgca ccaccgggcc 130441 ggtcagctcc agcaccgggc tggccagcgt ctgcaccgcc gcggccgcac tccccggcaa 130501 gcgggcgccg gctcgcactg tctgtgctac tccgttggtc acaccaccga ggacagtggc 130561 cacacccggg atcttcactg agtcaccctt caactaccga taccgcgcct aatcctgatg 130621 gcgtatcagc gccatgtcta ccgacttgcg catacttcgc cgggtgaggt cgccggtgaa 130681 ggcagtccgg acccctttgg tctgcgagcg atgaatgcag acgccgtgtc ggatctagct 130741 tgagtacggg cgggcccgtg acgcgccggt ggcgggcacg tgaaaccgac ccaaacgatc 130801 ccaacgacgc ggcaacgcct ggctaacggc tcacggatcg aatcagtgga tgcggtgggg 130861 tccgtgaatc agccggcaag cggccaagcg ttgcattgtg cggccgacat tggggggccg 130921 acgaaatcgg ctcacaaaat gcggtggtgg gccctgccga cgtggtaacc cgccgggaag 130981 gacttctcga tgacctgcag ctggtggttc actctcgact ccagctcgag caccacgcag 131041 ctgtcaccga cggtcttcga ctcaaggacg atgtaccgct gaccgccacc gttgacatcg 131101 gtgatcgggt caccggagtg caatgtttcg acgggcacca aatccggatt tccgtttgac 131161 tccacgcgaa acacagtaag gcaattaagc cgactaggga acactctgcg tgggtgcgcc 131221 acctcgacgc ggaggcacca gcgggttggc cgcggcgggc tcgctcttgg gtggtcgcca 131281 gtgattgtga ccagctgccg gagcgggaat gcgtgttgga cagccgaatc cccgcaattg 131341 gcgcaacgtg ccccggaagc cgcgctacat ttggctccta gccaccagac agcttgcccg 131401 aaaacggcag aggtccctga tgtcgctttt gatcacatca ccggcgacgg tggctgcggc 131461 ggcaacacat ctggcgggta tcggatcggc gctcagcaca gccaacgcgg cagcggccgc 131521 tccgacgacg gcgctatcgg tcgcgggtgc cgatgaggtc tcggtgctga tcgcagcgct 131581 attcgaggcg tacgcccagg agtatcaggc gctgagtgcc caggcactgg cgttccacga 131641 ccagttcgtg caggcgctca acatgggtgc ggtttgctat gcggccgcag agacagccaa 131701 cgcaactccg ctgcaggctc tgcagactgt gcagcagaac gtcctcaccg tggtcaacgc 131761 gcccacccag gcattgctag gtcgaccaat catcggcaac ggtgccaacg ggttaccgaa 131821 caccgggcaa gacggtgggc ccggcgggtt gctgttcggc aacggtggca acggcggatc 131881 cggcggggtg gatcaggccg gtggtaacgg cggtgcagcc ggcctgatcg gtaacggcgg 131941 gtccggcggc gtcggcgggc cggggatagc tggcagtgcg ggcggggcgg gcggcgccgg 132001 tgggctgctg ttcggcaacg gcgggcccgg cggggccggt gggattggca ccaccggtga 132061 cggtgggcct ggcggtgccg gcggtaacgc catcggtctg tttggcagcg gaggtaccgg 132121 cgggatgggc ggcgtcggcg gcatgggcgg tgtcggcaac ggcggcaacg cgggtaacgg 132181 cggcaccgcc ggactgttcg gtcacggcgg ggccggcggt gccgggggca tcggcagcgc 132241 cgacggcggg ctcggtggtg gcggcggcaa tggccggttc atgggcaacg gtggggtcgg 132301 cggtgccggc ggctacggcg ctagcggaga cggcggaaac gccggcaacg gcggcttggg 132361 cggcgtgttc ggcgatggcg gggccggtgg taccggcggt ctgggtgacg ttaacggcgg 132421 gcttgccggt attggcggta acgccgggtt cgtcggcaac ggcggagccg gcggcaatgg 132481 ccagctcggc agcggcgcag tctcctcggc gggtgggatg ggcggcaacg ggggcttggt 132541 gttcggcaac ggcggccccg gcggtctagg cgggccgggc acgtcggccg gcaacggcgg 132601 tatgggcggc aacgctgtcg gactgttcgg ccagggcggg gccggcgggg ccggcgggtc 132661 cggattcggg gccggtattc caggtggcag gggcggtgac ggcggtagcg gcgggctgat 132721 cggcgacggc ggcaccggtg gcggtgcagg cgcgggtgac gctgctgcat cggccggtgg 132781 taacggtggt aacgcccggt tgatcgggaa cggcggtgac ggtggcccgg gcatgttcgg 132841 cgggcccggc ggagctggcg gcagcggcgg cacgatattc ggcttcgccg gaacccccgg 132901 gccgagctag gcgtgttgca tcccgcccaa cggcgcaggc aacaatggtg cgatgagtgg 132961 cgccagctca tcggagtcgc ccacctgcta tcgccatccc gggcgccgga cctacgtccg 133021 ctgcacccga tgtgatcggt acatctgtgg cgaatgtatg cgcgtgggtc ccgtcggcca 133081 ccagtgcgcg gagtgtgtgc gcgaaggcgc ccgggcggtg cggcagcctc gtaccccatt 133141 cggcgggcgg cagcggtcgg caactccggt ggttacatac acgctgatct cgctgaatgc 133201 gctggtgttc gtcatgcaag tgaccgtgat gggtctggaa cggcagctcg ctttgtggcc 133261 acccgcggtc gccagcggtc agacctaccg gttggtgacc tcggcgttcc tgcactacgg 133321 ggcgatgcac ctgctgttga acatgtgggc gctgtatgtg gtgggtccgc cgttggagat 133381 gtggctgggc cggttgcggt tcggcgcgct gtatgcggtg agcgcgctgg gtggctcggt 133441 gttggtctat ctgatcgcac cgcttaatac ggcgacggcg ggggcatcgg gggcggtgtt 133501 cggtcttttc ggtgccacgt tcatggtggc caggcggctc caccttgatg ttcgttgggt 133561 cgtcgcgctc atcgtgatca acttggcttt cacgttcctc gcgccggcga tcagctggca 133621 ggggcacgtc ggcgggctgg taacgggtgc gctggtggca gcgacctacg tctacgcgcc 133681 cagggaacgt cggaacttga tccaggccac agtgacgatc accgttttgg ttgcgttcgt 133741 cgtgctgatc ggctggcgca cagtcgattt gctcgcactg ttcggtgggc gcctcaacct 133801 gagctgaaca catcaaaacc gatagccgct tgtcttcgcg tgtcttcggg gaatccgacg 133861 cggtcacatc taaacttgcc acgatcaaga ggaggggcag cgacgtatcg gcagcaagca 133921 ctgcgccgga cgacgaagtg gtcagggcgc gctaacagcg agagctgagc cgggcgggat 133981 tcactccgtg ccggcacgtt ctgttccccg gccccgttgg gtggccccgg tgcgccgggt 134041 cggtcggctg gccgtatggg atcggccgga gcggcgcagc ggaattccag cgttagatgg 134101 ccttcgtgcg atagcggtcg cgctggtact cgccagccat ggcggcatcc ccggtatggg 134161 cggcgggttc atcggcgtcg acgccttctt cgtcttgagc ggatttctca tcacctcgct 134221 gctgctcgac gagctggggc gcaccggtcg tatcgatctg agcgggttct ggattcgccg 134281 tgcgcggcgg ctgctgccgg cgctggtgct gatggttctc accgtgagcg ccgcacgcgc 134341 actatttcct gaccaagctc tcaccgggct acggagcgat gcgatcgccg cgttcctatg 134401 gacggcgaat tggcggtttg tggcccaaaa taccgattac ttcacccagg gcgctccacc 134461 ctcgccccta cagcacacct ggtcgttggg ggtggaggag cagtattacg ttgtctggcc 134521 actgttgctg atcggggcga cgctactgtt ggcggcccgg gcgaggcgcc gttgcagacg 134581 ggccacggtg ggcggggttc ggttcgccgc gttcctgatt gccagtctcg gcacgatggc 134641 ttccgccacc gccgcggtcg catttacctc ggcggccacc cgcgaccgga tttacttcgg 134701 caccgatacc cgtgcgcagg cgttgctgat cggctccgcg gcagcggctc tgctggtgcg 134761 ggattggcca tcgctgaacc gcgggtggtg cctgatccgg actcgctggg gacggcggat 134821 tgcccgtctg ttgccgttcg tcgggctggc tgggctggcg gtgacgactc acgtcgcaac 134881 gggcagtgtg ggcgagttcc gccatggtct gctgatcgtg gtggcaggtg cggccgtcat 134941 cgtggttgcc tcggtagcca tggagcagcg cggagcggtg gcccgcatcc tggcctggcg 135001 accgttggtg tggctgggca ccatatcgta cggcgtctat ctgtggcact ggccaatctt 135061 tctggcgctc aacggccaac gtacgggctg gtcgggcccg gccctgtttg ccgctaggtg 135121 tgcagccacg gtggtgctgg ccggtgcgtc gtggtggctg atcgagcaac ctattcggcg 135181 ctggcgaccg gcacgggttc cgctgttgcc gctggcagcg gcgaccgttg ccagcgctgc 135241 cgccgtgacg atgctcgttg ttccggtcgg agccggaccg gggctacgcg agatcggcct 135301 tccgcccggc gtttcggcgg tcgccgcggt ctcgccgtcg ccgccggaag cgagtcagcc 135361 cgcgcccggg ccacgagatc ccaaccggcc gttcaccgtt tcggtattcg gtgattcgat 135421 cgggtggact ttgatgcatt acctgccgcc gactcccgga ttccggttca tcgaccacac 135481 cgtcatcggc tgcagcctgg tacgcggcac accgtatcgg tacatcggtc aaaccctgga 135541 gcagagggcg gaatgcgacg gctggccggc cagatggtcg gcgcaggtca accgggacca 135601 accggacgtt gcgttgctga tcgtcggccg ctgggagacg gtagaccggg tcaatgaggg 135661 gcggtggaca catatcggcg acccgacctt cgatgcgtac ctcaacgccg agctacagcg 135721 agcgctcagc atcgttggat ccaccggggt tcgagtgatg gtcaccaccg tgccctacag 135781 ccgcggcggc gaaaagccgg acggccgctt gtatccggag gatcaacccg agcgtgtgaa 135841 caaatggaac gccatgttac ataacgccat tagccaacac tcgaacgtcg gaatgatcga 135901 cctcaacaaa aagctttgtc cagacggcgt ttacacggcc aaggtcgacg gcatcaaggt 135961 ccgcagtgat ggtgttcatc tcacccagga aggcgtgaag tggctgatac cgtggcttga 136021 ggattcggtg cgggtcgcca gttaatccgc cgtgtgctcc ggatgagcgc gacggtaacc 136081 ctggaattgt gctgtgtgct ggctgtgtcg ttgtgatgag cctgtctaag tggtgcgtaa 136141 ccgtttgacg agccgcggcc tcgctgcaaa cattgaagcc cgcacgtctg ggtttgtatt 136201 tacacaacga gggcgctccc cgatctggcg cgcgcaacga ggtgcgcact atccattcga 136261 ggtgaactgg actccttgat gctcaggccg gtgcggtttg tcgagaaagg cgaataggaa 136321 cagtccatga aagtgtggat cactggggct ggcggaatga tggggtcaca tctcgccgaa 136381 atgttgctgg ccgccggaca cgatgtgtac gctacctact gcaggccgac catcgatccg 136441 tcggacctgc aattcaacgg agcagaagtc gatatcaccg actggtgctc ggtctacgat 136501 tcgatagcga cattccgccc cgacgcggta tttcatctcg cggcccaaag ctatccggcg 136561 gtttcgtggg cccggccggt tgagacgctg accaccaaca tggttggcac cgccatcgtt 136621 ttcgaagcac tacgtcgcgt gcgaccgcac gcaaagatta ttgttgcggg ctcgtcggcc 136681 gaatatggat ttgttgaccc atccgaggtt ccgattaatg agcggcgaga acttcgcccg 136741 ctccatccgt atggtgtttc taaggcggcc accgacatgc tggcgtatca atatcacaag 136801 tcttacggca tgcacaccgt cgtcgctcgt atcttcaatt gcaccgggcc acgcaaagtc 136861 ggagatgcac tttccgattt cgtccgccgt tgtacatggt tggagcacca tccggaacaa 136921 agtgccatcc gggtgggaaa tcttaagacg aaacggacta tcgtggacgt ccgcgatctc 136981 aatcgggcgt tgatgctgat gctggataaa ggcgaggccg gggctgacta caatgtggga 137041 ggttcgatcg cctacgagat gggcgacgtt ctcaaacaag taatcgcggc ttgtaaacgt 137101 gacgatatcg tgccggaagt cgaccccgcc cttcttcggc ccaccgacga aaagatcatc 137161 tacggagatt gcagcaagct ggcggccata acaggctggc aacaagaaat ctgtttgact 137221 caaacgattg ccgacatgtt cgattattgg cgtagcaaat ccgagtccgc cctgatggtg 137281 tgaccgaatg tctttgtcct gccaacctga ggagcagata agattgaccg taacggactc 137341 tcagtatcga caaaaggtgt gcaccgcgag aactgctgag gagatctttg tagagacaat 137401 cgctgtcaag acacgcatcc tcaatgaccg ggtcttgctg gaagccgctc gcgcaattgg 137461 ggaccgcttg attgccggct atcgtgcggg agcacgcgtc ttcatgtgtg gcaacggtgg 137521 tagcgctgcg gatgcgcaac attttgccgc ggagctaacg ggtcacctga tctttgatcg 137581 gccaccgctt ggcgccgagg cactccacgc caattcgtcg cacctaacag cggtggccaa 137641 cgactatgac tacgacaccg tttttgccag ggccctcgaa ggatctgcgc gtcccggcga 137701 cacgcttttt gcgataagta cctccggcaa ttctatgagt gtactgcggg ccgcgaaaac 137761 cgcaagggag ttgggtgtga cggttgttgc aatgacgggc gaatccggcg gccagctggc 137821 agaattcgca gatttcttga tcaacgtccc gtcacgcgac accgggcgaa tccaggaatc 137881 tcacatcgtt tttattcatg cgatctccga acatgtcgaa cacgcgcttt tcgcgcctcg 137941 ccaataggaa agccgatcct tacgcggcca ttcgaaagat ggtcgcggaa cgtgcgggac 138001 accaatggtg tctcttcctc gatagagacg gggtcatcaa tcgacaagtg gtcggcgact 138061 acgtacggaa ctggcggcag tttgaatggt tgcccggggc ggcgcgggcg ttgaagaagc 138121 tacgggcatg ggctccgtac atcgttgtcg tgacaaacca gcagggcgtg ggtgccggat 138181 tgatgagcgc cgtcgacgtg atggtgatac atcggcacct ccaaatgcag cttgcatccg 138241 atggcgtgct gatagatgga tttcaggttt gcccgcacca ccgttcgcag cggtgtggct 138301 gccgtaagcc gagaccgggt ctggtcctcg actggctcag acgacacccc gacagtgagc 138361 cattgctgag catcgtggtt ggggacagcc tcagcgatct tgaattggca cacaacgtcg 138421 ccgctgctgc cggtgcatgt gccagtgtcc agataggggg cgccagttct ggcggtgtcg 138481 ctgacgcgtc atttgactcg ctctgggagt tcgctgtcgc agtcggacat gcgcgggggg 138541 agcggggcta atggcgatct tgcgcgggcg agcgccgttg cggctcggac tcggcggtgg 138601 cgggacagac gtggaaccgt actcgagcca gtttggcgga cgaattctta gcgtaaccat 138661 cgacaaatac gcctacgcgt tcgcggagcg cggaacagga gatgagatcg cctttcgctc 138721 gccggaccgc gaccgagccg gccaggcctc gatcgacgat ctggcgtctc tcgaagaaga 138781 ctttaccgtt gcacgtcgcc gtctaccggc gggtgattgc ggagttcaac ggtggtacac 138841 cgtttccgct ccagctggcg acgcaggtgg acgctcctcc cgggtcgggg ctgggctcgt 138901 cgtctgcttt ggtggtggcg atgcttctca cgacatgtgc gctcatcggc tcgtcgccgg 138961 gcccatacga gctggcgcga ctggcctggg aaatcgaacg ggttgatctc ggcatggccg 139021 gtggttggca agaccactac gccgcggctt tcggcggctt caacttcatg gagtcccgcc 139081 ccaacggaga agtcgtggtg aatccgcttc ggatacggcg ggaggtgatc gccgaactgg 139141 aagcttccct tcttctgtac ttcggcggcg tctccaggct gtcgtcggaa gtcatcgccg 139201 atcaacaacg caatgtcgtc gagcgagacg cggacgcgct tgcggccact cactcgatct 139261 gcgccgaggc actcgaaatg aaggatcttc tcgtggtggg tgacataccc ggcttcgccg 139321 attcactgct tcgcggctgg caagcgaaaa agcggacgtc aacccgaatc tcgaaccccg 139381 caatcgagca cgcttaccag gtcgcgcagt ccagcggcat ggtcgccggg aaagtctcgg 139441 gtgccggtgg gggtggcttc ctcatgatga tcgtggaccc gcgtcgccgt atcgaagtcg 139501 cacgcagcct cgaacgagag tgcggaggat cggtggctcc ttgcctgttt accaaaggcg 139561 gagcggtgac ctggcatatc ccagagtcca cggcaccccg taaggcgtgg agttgctgat 139621 gccgtggctt cagcgctcgg aaacgctgga atcttgctgt gtgctggctg tgtccttgcg 139681 acgagccact cgacttggcg cgtaccggtt tgacgatcgg ggagcccagt gcaagcatga 139741 gaccccgcaa gcaccgggcg ctgacgcctc ttcgtgaggt gactgagacg acacccccgt 139801 gtgtcctggc cgtgaggagg tgagggcgag atgagtccga gcgacagtcc cgatccgaca 139861 ttcgtcttgt cccgatctgg ctccggcatt ctttctgcct tctgagcttt cgcgagttac 139921 tgcgcatgtc cgatgtggcg cagttgtggc gttctgaatg acgcacgctg atcgggcttc 139981 ctgcaggaga agaacatgac cacgatgatc atgacttttg ttgttccaca acgtgttacc 140041 cgtgcgacga aagggcgggc acggtcgctg ctgcgggtga gtcggcgtct gacggacacg 140101 tttcgcgcac cgctcgcctg gaccccgcag gagcgggccg accggtatgt ggcacgtatg 140161 ccgatcgcgg tgattgcgga ctgagcgggc gtcggcgcgt ggcgcggtta cccgttggac 140221 cggcgctagc ccaacccgcg cgcgcgtgtt ggtacaccga cacgctgtct gggccctaca 140281 actgcgcacg ctcgcggcca gtgccgctag ccgaccacct caatgggatc accaacggtg 140341 acggcgtcga agtaccatgc cgcgttgtcc gggctgaggt tgatacagcc gtggctgacg 140401 ttggcgtatc cctgcgagtt gaccgaccag ggggccgagt gcacgtacac gccgctccag 140461 gtaacacgaa ccgcgtagtg ggcggtgagc agatacccgt ccgaggaatt cagcgggatg 140521 ccgatggtac gcgagtccat cacgaccgtg cgctccttgg acattgcgtg aaagctaccg 140581 attggtgtcg ggcggctggg cttgcctaac gacgcgggca tggtgcggag gacttctccg 140641 tttctgctga ccgtgaaggt atgtgccgag atgctggcaa ccccgatcag tgcgtcaccg 140701 gtctcgaatc cttcggtcag ttcctgcaca cccaccgaga cacgggtgtg aggtggccaa 140761 taccggtggg gcacccaccg cacgacattg ctagcgaccc actcgaagtg tccggtcgtg 140821 ttgtgcggtg tgctgatgcg gatggaccgc tcgacggcgc ggcgatcggt cacgggcgtg 140881 gtgaatgtca ccaccaccgg gtgcgccacc cccaccacgg caccattagc cggcgacacc 140941 gacgcaacgc ctgggatcgg ttggagtggc gggaccgcgg cggtcgctat gctgactgat 141001 tccgcggtga gcatcagcgt gatcgcgacc acaacggata gataacgaac cactcgacgc 141061 atggcgtcca ccctcccgag atggtgcgat cgacacacga cattctagtg accatcgacc 141121 cattgcgggc cgagcaagca gtttctggat agccccgccg ccccgcgggt gcggattggc 141181 aggccgcgcg gcctcgcgtt agcctcagcg gaatcggtgc caaggccgag gaggtgcggg 141241 tgctcttccg tcagctggag tacttcgtcg cggtcgccca ggagcggcat ttcgctcggg 141301 ccgctgagaa gtgctacgtg tcgcaacctg cgctgtcttc ggcaatcgcc aagctcgaac 141361 gcgaactcaa cgtcaccctg atcaatcgcg gacacagttt cgaaggcctt actcgcgagg 141421 gtgagcggtt ggtggtatgg gccaagcgga tacttgccga gcacgctgcg ttcaaggccg 141481 aggtggatgc ggtgcggtcc gggataaccg ggacgcttcg gctaggcacg gttcccaccg 141541 cgtcaacgac ggcatccctg gtgctgtcgg cgttttgctc ggcgcacccg ttggcgaagg 141601 tgcaagtctg ttcccggctg gctgcgaccg agctgtaccg acggctgcgc gaattcgagc 141661 tcgatgccgt catcgtgcac cccgagaccc aagacagtga tgatgttgat ctggtgccgc 141721 tctatgagga gcagtacgtg ctgttgtcgc cggcggatat gctgccgccg gggacatcga 141781 cgttggtgtg gcgggatgcc gcgcaactac cgttggcatt gctcactgcg gatatgcggg 141841 accgccaggt tatcgacgcc gcgttcgccg accacgcggt ctcggcgatc ccgcaggtcg 141901 aaaccgattc cgttgcttcg ctgttcgcac aggtggcaac cggcaactgg gcgtccatcg 141961 ttccgcacac ctggctatgg gcaatgccaa tgagcgggcc gacgggtggt gagatccgcg 142021 cggtcgaatt ggtcgatccg gtgctgaaag cccagatcgc cctggctacc aacgccttgg 142081 gaccgggatc tccggttgcc cgagcgctca taacatgcgc gcaggcgctg gcgctgaacg 142141 aattctttga cacgcagctg cgggggatca cccgtcgccg ctgatcgcgg gcgtcgctgc 142201 gctggtagtg ttcagcttcg ccaggtggcc gctctccacc ccgtctgcag ggtcgagttc 142261 gcagtcgatg agtgacggtc cgttcgatgc cagtgcatcg gtcagcgccg actccagttc 142321 ggttggggtg cttacgtgat atcctttgcc gccgaacgcc tctgcgatca gttcgtgacg 142381 tgcatgagcg ttcagcacgg tgggcgctgg gtcgtgtcgc cacaccgggg cggccgacct 142441 aaagatcgtt gcctcgtcgc cgcggtagac gccgccgttg ttgaggatga cgacggtgac 142501 cgggagtcgg tatcggcaga tggtctcgaa ctccatgccg ctgaagccaa atgcgctgtc 142561 gccctcgatc gccacgacag gtcgcccggt ctcgacggcg gccgcgatgg cgtagcccat 142621 gccgatgccc atcacgcccc aggttccgct gtcgagccgg tgccgcggta ggtgcatgtc 142681 gatgatgttg cgggcgaggt ccagcgcgtt ggcgccttcg ttgaccacat agacatccgg 142741 gttgcgttgc agcacagacc taatggcacc aagcgcgttg tagaaccgca tcggatgatg 142801 atcgtcggcc aaccgccgac gcatcttggc actgttgcgg gccttgcggt cggcgagctc 142861 gccggtccac gccgccgagg ccacgctcga acgatcggcc gcagcttcga ggagcgccga 142921 cattaccgag ccgatgtcgc cggtcagcgg tgccacgatc ggccggttgc tgtcaaactc 142981 cgacgcctcg atatcgacct ggatgaactt ggcatcggcc gaccattgcg gcgactctcc 143041 gttgcctagt agccaattca gccgagcgcc aaccagcagc accacgtcgg cgcgggccat 143101 cgccagcgaa cgagccgcag ccgccgactg cgggtgtgag tcgggcagca gccccttggc 143161 catcgacatc ggcaggaagg gaatgccggt gtgctccaca aactcccgaa taacgttgtc 143221 ggcctgcgca tatgccgcgc ccttgccgag cacgagcagc ggtcgctgcg cttgggcgag 143281 cacgtccagc gcgcgatcaa tcgcctccgg tgccggcagt agtcggggag ccgggtccac 143341 cggccgccaa atggcgccgg aagcagccga tgcctcaacg gcctggccca gcacatcgcc 143401 ggggatatcg aggtatacac cgccgggccg cccggaggtc gcggtgcgaa tggcgcgcgc 143461 gacgccgcgc ccgatgtcct ggacttggcc gatccgatac gccgccttca cgaacggtcg 143521 agcggcgttg agctggtcga ggtcctgata gtcgccgcgc tgcaggtcga ccatcggccg 143581 gctgctcgat ccggagatct ggatcatcgg gaagcagttc gtggtggcgt tcgccagcgc 143641 gggcaggccg ttgagaaagc cggggccgga cgtcgtcaga cacacgccgg gccgtgcggt 143701 gaggaacccc gcggcggccg ccgcattgcc cgctgatgct tcgtggcgga aaccgatata 143761 gcggatcccc gaggcttggg cggcgcgagc caggtcggtg atcgggatgc cgacaacgcc 143821 gtagatggtg tcgacgtcgt tggctttgag ggcgtccacc accaggtggc agccgtcggt 143881 cagcactgtg cagggagatg ccgatcgtgt ggtcatggtg ttcactgttg tccggggcgc 143941 cggccgtgtc caagaccgag tcactatgca gcgatttacg cggtctatca accgttagcg 144001 gatcggtatt ggacgccggg caggcgagcc cggcactgtg ctgatcgtgc cgaacccgca 144061 caccgaacac atggaaggag cgttcgcgat ggcatccgac ttcggcccgc gcatcgccga 144121 tcttgtcgag gtggcggcga cccggctgcc cgaggctccg gcgctcgtcg tcaccgcgga 144181 tcgcatcgcg atcagccacc gcgacctggc ccgtctggtt gatgagctgg ccggccagct 144241 gacgcggtcc ggcctgctgc ccggtgaccg ggtcgcgctg cgcatgggca gcaacgccga 144301 attcgtcgtc gccttgctgg cggcgtcgcg tgcggatctc gtcgtcgtgc cgctggatcc 144361 ggcgctgccc atcaccgagc aacgcgtccg aagccaggcc gcgggagccc gggtggtgct 144421 gattgacgcg gatgggccgc acgacagggc agaacccacc acccggtggt ggccgctcac 144481 ggtgaacgtc ggcggtgaca gcggcccctc gggtggcacc ttgtcggtcc acctggacgc 144541 cgccaccgag ccgaaccccg caacctcgac gcccgaggga ctgcgacccg atgacgccat 144601 gatcatgttc accggcggga cgaccggcct gccgaagatg gtcccctgga cgcacgcaaa 144661 catcgccagc tcggtccgcg ccatcatcac cgggtaccgg ctgagcccgc gggacgccac 144721 cgttgcggtg atgccgctct accatggcca cgggctgatc gcgtcgttgc ttgccaccct 144781 ggcgtccggc ggcgcggtgt cgctgcccgc acgcgggcga ttctccgcgc acaccttctg 144841 ggacgacatc aaagccgttg gagccacctg gtatacggcg gttccgacga ttcaccaaat 144901 cctgctggag cgatcggcaa ccgaaccgtc ggggcgcaaa cctgccgcac tgcgtttcat 144961 ccgcagctgc agcgcaccgc tcactgccca agccgcgcta gcactgcaaa ccgagttcgc 145021 ggcaccggtc gtgtgtgcct tcggcatgac cgaagccacc caccaggtaa cgacaacgca 145081 gattgagggt atcgaccaaa ccgaaactcc cgtcgtgtca accggtctgg tcggccggtc 145141 gacgggagcg caaatccgga tcgtcgggtc cgacgggctg ccactgcccg cgggcgcggt 145201 cggggagatc tggctgcggg ggaccaccgt ggtacgcggg tatctgggtg acccgacgat 145261 aaccgccgcg aatttcaccg acggttggtt gcgtaccggt gatctcgggt ccctgtcggc 145321 ggccggtgac ctgagcatcc gcggccgcat caaggaactc atcaaccgag gtggtgaaaa 145381 gatctcgccc gagcgcgtcg agggcgtgct ggccagccat ccaaacgtca tggaggcagc 145441 cgtattcggc gtcccgcacc agctctacgg cgaggcggtc gcggcggtga ttgtgcctcg 145501 tgagtccgcc ccgccgactc gcgaggagct tgtccagttc tgccgggaac ggttggcggc 145561 cttcgagatc ccggcctcct tccaggaggc cagcgggctg ccgcacaccg cgaagggttc 145621 gctcgaccgc cgcgctgtcg ccgaacggtt cggccattcg gtgtagctag ccggccccgg 145681 cctttacccg ggcggcggcg gattccggca tcggttcgta gcgggcaaac gaacgggtga 145741 aggatgcggc cccatgcgcc agcgagcgca aatcgattgc gtagcgggtc agctcgacct 145801 gaggcacctc ggccttgatc accgtgcggt cgtgccccgc ggtctcggtg ccgagcactc 145861 ggccacgacg actggacagg tcgcccaaca ccgcgccgac gaaatcgtcg ggtaccagca 145921 ccgaaatctc atcgattggc tcgagcaaga tcaccttcgt cgcggccgcg gcctcccgca 145981 atgcgagcgc gccggccatt tggaaagcga aatcggaaga gtcgacgctg tgggctttgc 146041 cgtcgagcaa cgtgacccgg atatcgacca ccgggtagcc ggcgtgcact cccttatcca 146101 tctgtgcgcg gacacctttc tccacactgg ggataaactg ccgcggcacc gccccgccaa 146161 ccactttgtc gaggaactcg aacccggagc cctccggcag cggctccacc tcgatgtcgc 146221 acaccccgta ctgaccgtga ccaccggact gtttgatgtg gcggccatgg cctttcgcat 146281 tgccggcgaa ggtttcccgc agcggcaccc gcagctcgat cgtgtctacg ctgacgccgt 146341 accggttggc cagtgtatcc aggacgacgc cggcatgggc ctcgcccata caccacagca 146401 cgacctgatg ggtctcttga ttttgctcga tccgcagtgt cgggtcttcg gcggccaacc 146461 ggcccaaccc gaccgacagc ttgtcttcgt cggtcttggc atgcgccgca atggcgatcg 146521 gcagcagcgg ctcgggcatg gtccagggtt tcagcaccag gggctcggcc ttatccgaga 146581 gtgtgtcccc ggtctcggcc cggctcagct tgccgatggc gcagatgtcg cccgcgacca 146641 cggctgctgc cgggcgctgt tgcttgccca gcgggaacga caagactccg atgcgctcgt 146701 cttcgtcgtg gtcggggtgc gtgttactag ttccgccgcc gaaaaacgat gagaaatggc 146761 ccgacacatg gaccgtcgtg tcgggcctga tggttccgga gaacacccgc accaagctga 146821 cccggccgac gtaggggtcc gacgtcgtct tcaccacctc ggcgagcaac ggcgcgtcat 146881 tgtcacaggc cagctccgca tgcgggacac cctgcggggt aaagacctcc ggcagtgggt 146941 gctccatcgg agacggaaat ccgcgggtgg ctacctcaag caattccagt gtgccgaccc 147001 cggtgctgct gcacaccgga atcaccggga agaacgagcc tcgggcgacg gctttctcca 147061 gatcctggat cagcaccgac tcgtcgatcg tctcgccgcc gaggtagcgc tccatcaagg 147121 actcatcctc ggattcctcg atgattcctt cgatcaaggc gccgcgcgcc tcctcgattc 147181 gctcggtgtc cgactcggcc ggggttcgtg tcgttcgctt gccgtcggcg tactcgtaca 147241 gtgcctgcga aagcaatccg atcaggccgt caccggacgg caggtagagc ggtaagacct 147301 tgtcgccgaa ggcgtcttgt gccgcggtca gcgcttcccg gtagttcgcc cgggcgtggt 147361 ccagcttggt gatgaccacc gcgcggggca tgccgacctg gctgcattcc tgccacaggg 147421 acttggtcgg ttcgtcgacg ccctcgttgg ccgcgatcac gaacagtgcg caatcggcgg 147481 cccgcaaccc ggccacagct cacccacgaa gtcggcgtac ccaggggtgt cgacgaggtt 147541 gaccttgatg ccgtcgtaag ccagcgaggc gaccgcaagg cccaccgagc gctgttgccg 147601 gatctccgcc tcgtcgaagt cgcagaccgt ggtgccctcg gtgaccgagc ccggcctgga 147661 caacaccttg gccgccacca ggagagcctc gatgagggtg gtcttgccgc cccccgaggg 147721 ccccaccaga accacgttgc gaacgccgcc gggcccgttt gcggtgggag cggcggccgc 147781 gccctgggaa gcattcactc tgtcggccat ggctttcctc cagttctccg gggtcggttc 147841 ccgtggtgtg gccagcagga cgtagtaggc aacttttctc ccaactgccg cccagcacaa 147901 gggtcgggtc cggtgagtag taggcaatcg gagccgtcgt tgtggtcagg cgtgccagct 147961 ggcccagcgc tggactgcta ttgcgattac cgggccgttc aacggaaccg attggtattg 148021 agcatatttg gcgcgcagca ggcggtatgc ggcgcgcatc acctcgccat cgcgatgaat 148081 cgcggcgacc ccgtcggccc ggacccacca caactgggtc caatcatcgg catagctgtc 148141 gacgagcacg ctggcccgtg gattgtgctc gagattggcg agccggcgca gccgctgcgt 148201 cgttttccgc ttcgcgtcga cggcggtgta gataacgtct gcaccggtcg cctcggccgg 148261 gcgcctagcg ccgagcgcga atacgaccgg caccaggtgg ggtgtgccgt cgggcgtgct 148321 ggtggccagt cgtgcgacgg gggactgggc aaacctgagc tttgggtcga attcccccac 148381 ggcgccagct tatgctcagc tgccgcccaa cgtcgcgcag tctggacggc cagacgtcgc 148441 ggccgtgaca gcggacatct cgggcagccc ggtccatggg gcgtgcgtgc taatggtgcc 148501 ggtggtaatc cagtggcgcg caaggtaatt ggccgggtcg gtctcggccg ccgcaggaat 148561 cggttgggtc ggtttgaacg tgacagagac gaacagagac cagtgctatc gcgtcgaacg 148621 gacgaccgtt gacgctttga cacatcccga gtatcgagta catactcgag gcgtgcagcg 148681 ggtcagggtc acgaggaacg cccggaagca ccgcgtgtcc aagcaccgca tcgtcgccgc 148741 tatgcgccac tgcggtgttc cggtcattca ggaagatggc tcgctgtact accagggccg 148801 cgatacgtcg ggccgtctta ccgaggtcgt cgccgtcgaa gccgacgacg gtgacctgat 148861 catcactcac gcaatgccga aggagtggaa gcgatgacga agaagccacg taaccccgcc 148921 gactacgtga tcggcgacga tgtcgaggtg tctgacgtcg atctcaagca agaggaggtc 148981 tatgtcgatg gcgagcggct aacggacgag cgcgtcgagc agatggcttc agagtcgctg 149041 cggctggcgc gcgaacgaga agccaacctg attcctggcg gcaagtctct gtccggcggc 149101 tctgcgcact cgccggctgt gcaggtggtc gtttcgaagg ctacccacgc caagctcaag 149161 gagctggcgc gcagccggaa gatgagcgta tctaagctgc tgcgtcccgt gctcgacgag 149221 ttcgtacagc gagaaacggg tcggattctc ccacggcgtt agcttgtgct cagccgccgc 149281 tcgacgtcgc gaagtctgga cagtcagctg tcgcagccgt gaccagcgga catctcgggc 149341 agctagcccg acagggtgcg cgtgcacctg gcccgggtgg taatccattg acgcgcacgg 149401 caattggccg gctcggtctc ggtctgcgga taccgcactg aagggcgaca attttggcga 149461 aaaggccgtg tgcggtgccg ggtcgcgcta cgttcagatt cacctaacaa tgtcgtccgc 149521 caacgagcgt gttcgccggt ggtggggcgg gcgggttggg gaggtgtgtg atgtcgtttg 149581 tcagcgtagc cccggagatt gtggtggccg cggcaacaga cctggcgggt atcggatcgg 149641 cgatcagcgc ggccaatgcc gccgcggctg cgccgaccac cgccgtgctg gccgcgggtg 149701 ccgatgaggt gtcggcggcg atcgcggcgc tgttttccgg ccacgctcag gcctatcagg 149761 cgctcagcgc ccaggcggcg gcgtttcatc agcagttcgt gcagacgctt gccggtggcg 149821 ctggagcata tgcggccgcc gaggcccagg tcgagcagca gctgctggcc gcgatcaacg 149881 cgcccaccca ggcgctgctg gggcgcccct tgatcggcaa cggtgccgat ggggcgccgg 149941 ggactgggca ggccggcggg gctgggggga tcttgtacgg caatgggggc aatggcggct 150001 ccggggcggc tgggcaggcc gggggtgccg gcgggccggc cgggctgatc ggccatggcg 150061 ggtccggcgg ggccggcggg cacggcggat ggctgtgggg caacggcggc gtcggcggat 150121 ccggcggggc gggtgtcggc gcaggcgtgg ctggcggtca cggcggtgcg ggcggtgccg 150181 ccgggctgtg gggcgccggc ggcggcggtg gcaatggcgg gaacggcgcc gatgccaaca 150241 tcgtcagcgg tggagacggt ggcctcggcg gtgccggtgg cggtggcgga tggctctacg 150301 gcgacggcgg ggccggcgga cacggcggac aaggcgcaat cggcctcggc ggcggcgccg 150361 gcggcgacgg gggccagggc ggcgccggcc gcggactgtg gggtactggc ggcgccggcg 150421 gacacggcgg gcaaggcggt ggtaccgggg gcccaccgct gcccggtcag gcaggcatgg 150481 gcgccgcggg tggcgccggt gggctgatcg gcaacggcgg ggccggcggc gacggcggtg 150541 tcggcgcgtc cggcggggtc gccggagtag gcggtgccgg cgggaacgcc atgctgatcg 150601 ggcacggcgg cgccggcggc gccggcggag acagcagttt cgctaatggc gcggccggcg 150661 gcgcgggcgg tgccggaggg cacctcttcg gcaatggcgg gtccggcggc cacggcggag 150721 ccgtcacggc cggcaacacc ggtatcggtg gcgccggcgg cgtcggtggg gacgccaggc 150781 tgatcggcca cggtggcgcc ggcggtgccg gcggggaccg cgccggagcc ttggttggcc 150841 gtgacggcgg gcccggtggg aacgggggcg ctggcggcca gctatacggc aacggcggcg 150901 acggcgggcc cggcggtcag ggcggtcagg cgttcggcgc taacaatatc ggcggcaccg 150961 gcggggccgg cggcaacggc gggccggcca tcctgagtgg caatggtggc aatggcggcg 151021 ccggcggcgc tggcggcgcc ggcggcgcag gtggtggggc cggtggggtc ggcggtgccg 151081 gcggcgcccc cggcaccggc ggaacactgc aggcggcggt gagcggattg gtgacggctt 151141 tgttcggtgc acccggccaa cccggcgaca ccggccaacc cggctagccc cgatcaacga 151201 gggtttcggt gccggtccgg ggcatggcca tccgctgagc tggcgatctg gactacgttg 151261 gtgtagaaaa atcctgccgc ccggaccctt aaggctggga caatttctga tagctacccc 151321 gacacaggag gttacgggat gagcaattcg cgccgccgct cactcaggtg gtcatggttg 151381 ctgagcgtgc tggctgccgt cgggctgggc ctggccacgg cgccggccca ggcggccccg 151441 ccggccttgt cgcaggaccg gttcgccgac ttccccgcgc tgcccctcga cccgtccgcg 151501 atggtcgccc aagtggggcc acaggtggtc aacatcaaca ccaaactggg ctacaacaac 151561 gccgtgggcg ccgggaccgg catcgtcatc gatcccaacg gtgtcgtgct gaccaacaac 151621 cacgtgatcg cgggcgccac cgacatcaat gcgttcagcg tcggctccgg ccaaacctac 151681 ggcgtcgatg tggtcgggta tgaccgcacc caggatgtcg cggtgctgca gctgcgcggt 151741 gccggtggcc tgccgtcggc ggcgatcggt ggcggcgtcg cggttggtga gcccgtcgtc 151801 gcgatgggca acagcggtgg gcagggcgga acgccccgtg cggtgcctgg cagggtggtc 151861 gcgctcggcc aaaccgtgca ggcgtcggat tcgctgaccg gtgccgaaga gacattgaac 151921 gggttgatcc agttcgatgc cgcgatccag cccggtgatt cgggcgggcc cgtcgtcaac 151981 ggcctaggac aggtggtcgg tatgaacacg gccgcgtccg ataacttcca gctgtcccag 152041 ggtgggcagg gattcgccat tccgatcggg caggcgatgg cgatcgcggg ccagatccga 152101 tcgggtgggg ggtcacccac cgttcatatc gggcctaccg ccttcctcgg cttgggtgtt 152161 gtcgacaaca acggcaacgg cgcacgagtc caacgcgtgg tcgggagcgc tccggcggca 152221 agtctcggca tctccaccgg cgacgtgatc accgcggtcg acggcgctcc gatcaactcg 152281 gccaccgcga tggcggacgc gcttaacggg catcatcccg gtgacgtcat ctcggtgacc 152341 tggcaaacca agtcgggcgg cacgcgtaca gggaacgtga cattggccga gggacccccg 152401 gcctgatttc gtcgcggata ccacccgccg gccggccaat tggattggcg ccagccgtga 152461 ttgccgcgtg agcccccgag ttccgtctcc cgtgcgcgtg gcatcgtgga agcaatgaac 152521 gaggcagaac acagcgtcga gcaccctccc gtgcagggca gtcacgtcga aggcggtgtg 152581 gtcgagcatc cggatgccaa ggacttcggc agcgccgccg ccctgcccgc cgatccgacc 152641 tggtttaagc acgccgtctt ctacgaggtg ctggtccggg cgttcttcga cgccagcgcg 152701 gacggttccg gcgatctgcg tggactcatc gatcgcctcg actacctgca gtggcttggc 152761 atcgactgca tctggttgcc gccgttctac gactcgccgc tgcgcgacgg cggttacgac 152821 attcgcgact tctacaaggt gctgcccgaa ttcggcaccg tcgacgattt cgtcgccctg 152881 gtcgacgccg ctcaccggcg aggtatccgc atcatcaccg acctggtgat gaatcacacc 152941 tcggagtcgc acccctggtt tcaggagtcc cgccgcgacc cagacggacc gtacggtgac 153001 tattacgtgt ggagcgacac cagcgagcgc tacaccgacg cccggatcat cttcgtcgac 153061 accgaagagt cgaactggtc attcgatcct gtccgccgac agttctactg gcaccgattc 153121 ttctcccacc aaccggatct gaactacgac aaccccgccg tgcaagaggc gatgatcgac 153181 gtcatccgct tttggctcgg cttgggcatc gacgggtttc ggttggacgc ggtgccctat 153241 ctctttgaac gtgagggcac caactgcgag aacctgccgg aaacacacgc ttttctcaag 153301 cgagtccgca aggtggtgga cgacgaattc cccggccggg tgctgctagc cgaagccaat 153361 cagtggccgg gcgatgtcgt cgaatatttc ggtgatccca acaccggtgg cgacgagtgc 153421 cacatggcct ttcacttccc gctgatgccg cgcatcttca tggccgtgcg ccgggagtcc 153481 cgttttccga tctcggagat catcgcccag accccaccaa tccctgacat ggcgcaatgg 153541 gggatatttc tgcgcaacca cgacgagctg acgttagaaa tggtcaccga cgaagagcgc 153601 gactacatgt acgccgagta cgccaaggat ccacggatga aggcgaatgt cggaatccgt 153661 cgtcggcttg cgccgctgct cgacaacgac cgcaaccaga tcgagctgtt caccgcgctg 153721 ctgctgtcgc tgcccggctc gccggtcctc tactacggcg acgagatcgg gatgggcgac 153781 gtgatctggt tgggtgatcg cgacggcgtg cgcatcccga tgcagtggac accggaccgc 153841 aacgcgggtt tctccaccgc caacccgggt cggctgtacc tgccgcccag ccaggacccg 153901 gtttacgggt atcaggccgt caacgtcgag gcgcaacgcg acacctcgac gtcgctgctc 153961 aacttcactc gcaccatgct ggccgtgcgt cgccgacacc ccgcgtttgc ggtcggcgca 154021 ttccaggaat tgggcgggtc caacccgtcg gtgctggcct acgtgcgtca ggtggccggc 154081 gatgacggcg acaccgtgct ctgtgtcaac aacctgtcgc gattcccgca gcccatcgaa 154141 ttggacttgc agcaatggac caactacacg ccggtcgagc tgaccgggca cgtggagttt 154201 ccacgcatcg gccaggtgcc ctatctgctg acgctgccag gacacgggtt ctactggttc 154261 cagttgacca cacatgaggt gggggcacct cccacttgcg ggggagagcg gcgcctatga 154321 ctcgcgccgg cgacgatgca cagcgaagcg atgaggagga gcggcgccta tgactcgcgc 154381 cagcgacgat gcacagcgaa gcgatgagga ggagcggcgc ctatgactcg gtcggacacg 154441 ctggcaacca agctgccatg gtccgattgg cttccgcggc aacgttggta tgccggacgc 154501 aaccgcgagc tggccacggt caagccgggc gtagtcgtcg ccctgcgaca caacctcgac 154561 ctagtcctgg tcgacgtaac ctacaccgac ggtgcaacgg agcgttacca ggtgctcgtc 154621 ggatgggatt ttgagccggc gtccgagtac ggcacgaaag ccgccatcgg cgtcgccgac 154681 gatcgcacgg gattcgatgc tctctacgac gtcgccgggc cgcaattcct cctgtcgcta 154741 atcgtctcgt ccgccgtctg tggcacatcc accggcgaag taacgttcac cagggagcca 154801 gacgtcgagc tgccctttgc cgcgcagccg cgggtatgtg acgccgaaca gagcaacacc 154861 agtgtgatct tcgatcggcg ggctatcctc aaggtgttcc gccgggtaag cagcgggatc 154921 aaccccgaca tagagctgaa ccgcgtgctt acccgtgccg gtaatccaca tgtggcccgc 154981 ctgctgggcg cttaccagtt tgggcggccc aatcgttcgc caaccgatgc tctggcgtac 155041 gccctgggca tggtgaccga gtatgaggcg aacgcggccg aaggctgggc gatggccacc 155101 gccagcgtgc gggacctctt cgccgaggga gacctctatg cccacgaagt cggcggcgat 155161 ttcgccggtg aatcctaccg gctcggcgag gcggtcgcct cggtgcacgc cacgctggct 155221 gacagcctcg gaaccgcgca ggcaacgttc ccggtggacc ggatgctggc gcggctgtcg 155281 tcgacggtgg cggtggtgcc cgaactgcgg gagtacgcgc caacgatcga acagcaattc 155341 cagaagctcg cggcggaggc aatcacggtc cagcgggtgc acggtgacct gcacttggga 155401 caggtgctgc gtaccccgga aagctggctg ttgatcgact ttgaaggcga gccgggccag 155461 ccgctggacg aacggcgagc gccggattcg ccgctgcgcg acgtggccgg tgtgttgcga 155521 tcgttcgagt acgccgctta cgggccgctg gtggaccagg ccaccgacaa acaacttgcc 155581 gctcgcgccc gcgaatgggt cgagcgcaat cgcgccgcct tctgcgacgg ctacgcggtc 155641 gcgtcgggaa tcgacccgcg agattcggcg ctgctgttgg gcgcctacga actcgacaag 155701 gcggtttatg agaccggcta tgagacacgg caccggccgg gttggcttcc gattccgctg 155761 cgttcgatcg cccgcctgac cgctagctga taccggccgg ggtgtccggc ttattgcttg 155821 gcgtgcgtgc gtcctgggcg tctggaagca tgctcgtgtg caacgagaga tttatgacgg 155881 tgaggcgcgg ctgtcatggg tgttggcggc gctggccggg atactggggg caaccgcgtt 155941 cacccactcc gcgggatact tcgttacttt catgaccggc aattcgcagc gcgcggtgct 156001 gggattgttt ggggacgacg cgtggatgtc tgtcaccgcg tcgttgctga ttctattctt 156061 cgtcgccggc gtggtgattg cgtcggtgtg ccggcggcat ttctgggcgg cgcatcccca 156121 cggcccgacc gtgctgacca ccttcagttt gatatttgcc gccggagtcg acattatgct 156181 gggcggctgg cacgagagca tgctcgattt tgtgccgatt ctgttcgtgg tcttcgggat 156241 tggcgccttg aacacatcgt tcgtcaagga tggcgaggta tcggttccgt tgagctatgt 156301 gaccggcaca ttggtcaaga tgggccaggg catcgaacgt cacctggccg gcggaaaagt 156361 ggaggactgg ctcggctact tcctgctgca cgccagcttc gtgctaggcg ccgcggccgg 156421 tggcgccatt agtatggtcg tcaccggacc ccagatgctc gcggtcgcgg cggtagtgtg 156481 cgctgcgaca accggctata cctacctgca cgctgaccgg cgagggttgg tcaatcaaaa 156541 gcggccccag ccgggaaagc ggctctttcg agcgctcagg cgaggcgaat tagattcggg 156601 aacctccacg cccgcaacca attacgggtc gagttagctt ggcttccagt ggcgctggcg 156661 aaggggtgac cacgccaact tcacccggaa ggtccgaccc agtgcggatg ttccacacat 156721 cggcagcagc gctggccgtt gcgctgctgc cgatgctggc ttgctggctc aggcggccgg 156781 cgcagcaggg gcggccgggg gtgtcgcgcc gttgagcaca tgctggatat cggccttcat 156841 ggcgaccagc tgctcgttcc agtagggcca cgagtgtgtt ccgttgggcg ggaagttaaa 156901 caccccgttg cgtccaccgt cggccgcgta ggtgtcccgg aaggtctggt tggtgcgcag 156961 ggtgaggcct tccaggaact tcgccggtat gttgtcgccg ccgaggtcgc tgggtgtgcc 157021 gttaccgcag tacacccaga tccgggtgtt gttggcgacc aggcggggaa tctgaaccat 157081 tgggtcgttg cgcttccagg ccgggtcgct ggacggaccc cacatgctgt tggcgttgta 157141 accgcccgag tcgttcatcg ccaggccgat cagcgtcggc caccagccct cggacgggtt 157201 gaggaagccc gacaacgacg cggcgtacgg gaactgctgc gggtagtacg cggccaggat 157261 cagcgcggaa ccgcccgaca tcgaaagacc caccgccgcg ttgcctgtcg gggacacgcc 157321 cttgttggcc tgtagccagg cgggcatctc tctggtaagg aaggtctccc acttgtaggt 157381 gtagttctgg ccgttgctct gcgagggctg ataccagtcg gtgtagaaac tggattggcc 157441 gcccacgggc atgatcaccg acaaccctga ctggtagtac tcctcgaagg ccggggtgtt 157501 gatgtcccag ccgttgtagt catcctgggc ccgcagaccg tcgagcaggt agaccgcgtg 157561 cggtccgccg ccctggaact ggaccttgat gtcgcggccc atcgacgcgg atggcacctg 157621 cagatattcc actggaagac cgggcctaga gaatgcgccc gcggtggccg gcccgccgaa 157681 ggtaccgacc agaccgtaaa ccaggacagc ccccatagcc gcgatagcca gccggcgcgg 157741 cagggttgtc gctgcgctcc gcaaccttcg cacctgttcg aagaacgtca tagctactac 157801 caatcccaac tctcatctgc cgcacgacgc ggtcgaatct gttctgggcg agtgaaacac 157861 accgaggacg ctcagttcga atgtcgtggc cgcagcgcga gatcgcggtt ggctaacgat 157921 tcagcgtcgg cccggacacc ttgggcgatt gacacacccg ggtcacggct ggctcccgag 157981 cggcgcaacg accgcacgca caacccctat gcttactgcc gaccagagga gagacccatg 158041 cgcaccttcg agtcggtcgc cgacctggcc gccgccgcgg gcgaaaaggt cgggcagagc 158101 gactgggtga ccatcaccca ggaagaggtc aatctgttcg ccgacgcaac gggtgatcac 158161 cagtggatcc acgtcgaccc ggaacgggcc gctgcgggtc cctttggcac caccatcgcg 158221 cacggattca tgaccctggc gttgctcccg cgcctgcaac accagatgta caccgtcaag 158281 ggcgtcaagc tggcaatcaa ctacggcctg aacaaggttc gcttcccggc accagtaccc 158341 gtcggctcgc gggtgcgtgc gacgagctcg ctggtcggtg tcgaggatct gggcaacggc 158401 accgtgcagg cgacggtgtc gacgaccgtc gaggtcgagg gatcggccaa gccggcgtgt 158461 gtggccgaaa gcatcgtgcg ctacgtcgcc tgaggcaact cgcggtcaga attcggcgat 158521 cgcgtgctcg aggcgttggg ccagccaggc ctcggcgtgc gcgcgccggg tcggaatgtg 158581 ctgtgacggg aaaagcgttg tcaccggctg gtattcgcgc agcgtacggc gggcgacggt 158641 catcttgtgc agctcagtgg cgccgtcggc gatgcccagt gactcggctg ccagcatcat 158701 cttgacgaac ggcatctcgt cggagacccc gagcgcgccg tgcaggtgca tggcccgctg 158761 cacgacgtca tgcagcacct ggggcatcgc caccttgacc gccgcgatgt cgcggcgcac 158821 cttttgatag tcgtggtgtt tgtcgataag ccacgcggtg cgcagtacca gcagccggaa 158881 ctgctcgatc tggatccaac tatcggcgat cttctcctgg gtcatctgca gatcggcgag 158941 ccgcccgtgt ctagtctggc gcgacagggc acgctcgcac atcatgtcga atgctctgcg 159001 cgccagcgcg attgtccgca tcgcgtgatg tattcggccg ccgcccaatc gggtctgcgc 159061 gatcatgaac gcttggccct cgccgccgag cacatgatcg gccggcaccc ggacgtcgtg 159121 gtagcggatg tagccgtggc tggcgtgccg ggtggactcg gctcccacac cgacgttgcg 159181 cacgatctcg atgcccgggg tgtcggccgg gacgatgaac agcgacatct tctcgtacgt 159241 acgggcttcc ggcttggtga cggccatgac gataaagaac gacgcatgct tggcgttggt 159301 ggaaaaccac ttctcgccgt tgatgatcca gtccccgttt cccgcggcat cgcgggtcgc 159361 cgcggtcaca aacagcccgg gatcggaacc accctgcggc tcggtcatcg aatagcagga 159421 ggtgatctcg ccgtcgagca gcggtcgtag atagcgggct ttctgctcgt cggtgccgaa 159481 cagcgccagg atctcggcgt tgccggagtc cggcgcctga cagccgaacg ccgacggcgc 159541 ccaccgggag cggccgatga tctcgttgag cagcgccagc ttgacctgac cgaagccctg 159601 tccgccgagt tcgggacgca aatgcgcggc ccacaacccc tggtctttca cctgccgctg 159661 cagcggccgc aggatcgcca tcgtgtcggc gttctttttg tcgtaaggat cgagggcgac 159721 cagatcgagc ggttcgagtt cctcggccat gaatttttcg acccaatcca gcttggactg 159781 gtattgcggg tcggtttcga agtcccacac cgtcggcaac cgttccccgg cgcggcgtcg 159841 caccggcatc gttgatagag caagaccatc gtaggtgcgg tctagcggct tcagcgcagt 159901 tcgggcagga cgttggtgcg gtagaagtcg atggcggtga tcgggtcgtc ctgggggaaa 159961 tgcaggaagg ggacggcgcc ggcgtcgaga accgcttgca ccgcaccgat gtggacgccg 160021 ggatcggtac cgaccgccca attggccagc actttctcga tcgggttcga ctcggcggca 160081 cgctggatct cgaccggatt gggctggtcg acggccccgg cggtgaatcg ccacaagtcg 160141 gcggcgcggg cggccgcctt gtcgtcgccg acgacggcga acagttcggc ccgcttaccc 160201 agggtggtgg gatctcgtcc ggccgcttga gcgcccgcgg cgaacgcggc gagcagcttg 160261 gcgtcgttga tgtcgcgggc ttgggcgatc caaccatcac cgtatcggcc ggccagggtg 160321 gcgctctggg ggccgctcgc ggcgacaaag atcggcggcg gcatcgccgg cgtgtcgtag 160381 agcttgagct cgtcggtccg aaaatagtgg cccgtgaacg agatccgctc accgctccac 160441 agctggcgga tcagtacgat ggcctcgatc agccggtcgt ggcgctcgcg gtagttgccg 160501 aacgtgtcgg tggcggcttg ttcgttgagc cgcttgccgg tgcccagccc cagaaacacc 160561 cgtccggggt tcaggatcgc cagcgaggca aacgcctgag cgacggtggc cggatggtag 160621 cggtatatgg gacaggtcac cccggtgccg aacaagatgc tgctggtgct gttgcccacc 160681 aacgccaggg tcagccaggg aaacatcgaa tggccctcgt tgtcttgcca tggctgtagg 160741 tggtcgctgg cccacacata ccggaagcca gcttgctcgg cggcttgggc gtgcgccacc 160801 agccgatcgg tgcggaattg ttcgtgggat aagacgacac ccaccccgcg gcttgccggc 160861 tctggggtcg gcgtcggacc gctgcgcgtg ctgcaaccgc cgcctagccc accggcgccg 160921 atcgcgccga acccggcggc cagaccgaac gtccgccgtg agatgccggt catcgggctg 160981 cactacccgc gtcgcgctgc agcacacctt cgagagtgca tcctgactca ccgtcggcgc 161041 caccggttag cctggcgaga tgaccccgca ggcacgccca gcgcgcaggg ccgatgtccg 161101 cgagctgtcc cgcaccatgg cccgggcgtt ctatgacgat ccgttcatga gctggttact 161161 gtcgaacgac aacgcccgca ccgcaaggct gacccggttg ttcgcgacga ttgtccgcca 161221 ccagcatctg gccggcggtg gtgtggaagt ggcccgcggc gcggcgggca tcggcggggc 161281 ggcgctgtgg gatcccccgg atcgatggcg ggagtcgcgc cgccagcaac tggcgatgac 161341 accggggttc ctgcgggtgt tcggctttcg gacggccaag gcccgcgcgg cgctggacgt 161401 gatgatgcgt gtgcatcccg aagaacccca ctggtatctg gccgccatcg gcagcgaccc 161461 gacggtccgc ggccaggggt tcggtcaggt gctgatgcgg tcacggctgg accgttgcga 161521 tgccgaacac tgtccggcct acctcgaatc caccaaaccc gagaatgtgc cctactatca 161581 acggttcggt ttccgggtga cccgtgagat cgctctgccc gacgcggggc cgccgctatg 161641 ggcgatgtgg cgggagcctc ggtagcggtt cttggcagct ggatcgttcg tccggccggg 161701 tgatcactgc gcgaccgtga atctggcgac gccgcaccgg cgtgtcgcgt cgccagactc 161761 acagtcgcgg caatctctga ccgccggtgc gctgagatag ctcccgaggt gcaaaagtgg 161821 tgcgcagatc gtcaggctga gcttgccggg atcgcgtggg tcggcacccg cagccgtcgt 161881 ctgccaccca atagtgtgtg cgacccgccc ggtacacgcg gaatcaacgg gtatgcggtt 161941 ctggcatagg cttgtcaggc aatgatcgct ctgcccgcct tggaaggtgt cgaacatcgg 162001 cacgtggatg tggcggaagg cgtcaggatc cacgttgcgg acgccgggcc ggccgatggt 162061 ccggcggtaa tgctggtgca cggcttcccg cagaactggt gggagtggcg cgacctcatc 162121 ggcccgctgg ccgccgacgg caaccgggtg ctgtgtcccg acctgcgcgg cgcgggctgg 162181 agttcggcgc cccgctcgcg gtataccaag accgagatgg ctgacgatct ggctgcggtt 162241 ttggacggcc tgggtgtggc caaggtcaag ctggtggccc acgattgggg tgggccggtc 162301 gcgttcatca tgatgttgcg ccatcccgag aaggtgaccg ggttttcggc gtgaacaccg 162361 tggcaccctg ggtgaagcgc gatcttggca tgctccgcaa tatgtggcgg ttctggtatc 162421 agatccccat gtcgctgccg gtgatcggcc ccgcgggtga tcagcgatcc taagggccgc 162481 tacttccggc tgttgaccag gtgggtcggg ggcggatttc gggttcccgg tgacgacgtg 162541 cgcctgtact tggactgcat gcgcgagccg gggcacgccg aggccggatc gcggtggtat 162601 cgcacctttc agaccaggga aatgctgcgc tggctgcgcg gcgagtacaa cgacgctcgg 162661 gtcgatgtcc cggtccgatg gctgcacggc accgagatcc ggtgatcacg cccgacctgc 162721 tggacggcta tgccgagcgg gccagcgatt tcgaggtgga gctggtcgac ggcgtgggcc 162781 attggatcgt cgagcagcga cccgagctgg tgctcgaccg ggtgcgtgcg ttcctagctg 162841 cggggaccga gcagcgcgat tgacgcatcc accgccggct cgacgatgtt ccggatcggc 162901 tggccgtcct cgacggtcag cgcggtcagt tcacgcaaac cgcccagcaa gattacggcc 162961 agtggcacat tcagtggcgg taggttagcc cgccggaacc cagggctggc gctgagctcg 163021 atcagcaggc tggttagctg ctccatgccg cggcgctgga cggggtaagc ggcggcaccg 163081 agcgacggga attcacggat ccaactcaac gtcaccgccg gcctggattc gatatgggtg 163141 acgtaggcct cgaccgcctg acgaatctgg tcgtgccagt cggcgtttgg atcgacggcc 163201 gcccggatgc tgttgcccaa cgtctcgttg tccgctagca ggagttccaa aaagcactgt 163261 tccttgctgg tgaaccggtc gtagaacgtg cgcttggatg tgcgggcgtg ccggacgatg 163321 tcggagacgg tggtggcgcg ataaccccgc tcaccgatcg aggcgaccag gccgtcgagc 163381 aaccgtagcc gaaacgagtc ggtctcgacg accaacgcgc cggcggcgac tgctgtcacc 163441 cgcgcctcct ctacctatcc cttgtcaggt ttggtaccaa agagtaccgt actggacaag 163501 ccacggtaca ccaccgtacc acgcccgatc cagggacgtt aggagcaaca ccgccatgag 163561 cgaagtcgtc accgccgcac cggcaccgcc cgtagtccga cttcccccgg cggtccgcgg 163621 gccgaagttg ttccagggat tggccttcgt ggtgtcacgg cgacggctgc tggggcggtt 163681 cgtgcgtcgc tacggcaagg ccttcaccgc caatatcctg atgtacggcc gggtcgtggt 163741 ggtcgccgac ccgcagctag ccaggcaggt cttcaccagc agtcctgagg agctgggcaa 163801 catccagccc aacctgagtc ggatgttcgg ttccggctcg gtgttcgcgc tggacggcga 163861 cgaccaccgg cggcggcgcc ggctactggc gccgcctttc cacggcaaga gcatgaagaa 163921 ctacgagacc atcatcgaag aggagaccct gcgcgagacc gccaattggc cgcaaggaca 163981 ggctttcgca acgctgccgt caatgatgca tatcacgctc aacgccatcc tgcgtgcgat 164041 cttcggggcc ggcggcagtg aactagacga gctgcgccgc ctcattccgc cgtgggtcac 164101 gctgggctcg cgcctggcgg cgctaccgaa acccaaacgc gactatggcc gccttagccc 164161 gtggggccgg ctggccgagt ggcggcgcca gtacgacact gtcatcgaca agctcatcga 164221 agccgagcgg gccgacccga acttcgccga tcggaccgac gtattggcgt tgatgctgcg 164281 cagcacttac gacgacggtt ccatcatgtc gcgcaaggac attggcgacg agctgctcac 164341 gctgctggcc gccgggcacg aaaccacggc ggcgacactg ggctgggcgt tcgagcggct 164401 cagccggcac cccgacgtgc tcgcggctct ggtcgaggag gtcgacaacg gcggtcacga 164461 gctgcgtcaa gcggcgatcc tggaggtaca gcgggccagg accgtcatcg attttgcggc 164521 tcgtcgcgtc aatccacccg tttaccagct cggcgagtgg gtgattcccc gcgggtattc 164581 gatcattatc aatatcgccc agatacatgg cgatcccgac gtcttcccgc agccggatcg 164641 cttcgacccg cagcgctaca tcggaagtaa gccatccccg tttgcgtgga tcccttttgg 164701 tggcgggacc cgccgctgtg tcggggccgc attcgccaac atggagatgg atgtggtgct 164761 gcgaacggtg ctgcgccact tcaccctcga gaccaccacg gccgcgggcg agcgcagcca 164821 cggtcgagga gttgcattca ccccgaagga tggcggtcgg gtggtgatgc gccgacgctg 164881 acggccagct cgggcccgcg ttcaggtccc gagttcgggt gaaaggctgg cccgcagtgc 164941 agattcggcg gtccgtcggg gtagcctcca gccgggccgg acgaagtggc acgtgtaccc 165001 gttggggtag cgctgcaggt agtcctggtg ctcgggttcg gcttcccaga aatccccggc 165061 cgggctgacc tcggtcacca ccttgccggg ccacaggccg gatgcctcga catcggcgat 165121 ggtgtccagc gcgatccgct tttgctgctc atcgaagtag aagatggccg accggtagct 165181 ggtcccccgg tcgttacctt gccggtcttt ggttgtcggg tcgtggatct ggaagaagaa 165241 ttccagcagg gtgcggtaat cggtgaccgt ggggtcgaag atgatttcga cggcttcggc 165301 gtgcgtgccg tggttacggt aggttgcgtt ggggatgttc ccgccgctgt agcccacccg 165361 cgtggagacc acaccgggct ggttgcggat cagatcctgc agcccccaaa agcagccgcc 165421 ggcgaggatc gctttctgat tgctcgtcat ttccggacct cccgatcagg ctacactccg 165481 gcgatggagt gtaacggcgc gaagaccgca ctgtgagcgc ttcggagttc tcccgtgctg 165541 aactcgccgc cgccttcgag aagttcgaga agaccgtggc ccgcgccgcc gcgacgcgcg 165601 actgggattg ctgggtgcag cactacaccc ccgacgtcga atacatcgag cacgcggcgg 165661 gcatcatgcg aggccgccag cgggtacgtg cctggattca agaaacgatg acgaccttcc 165721 cgggcagtca catggtggcc ttcccgtcgc tgtggtcggt gatcgacgag tccaccgggc 165781 gaattatctg cgaattgggc aaccccatgc tcgaccccgg cgacggcagc gtgatcagcg 165841 cgacgaacat ttcgatcatc acctatgccg gcaatggcca gtggtgccgt caagaagaca 165901 tctacaaccc gttgcggttc ctgcgggcgg cgatgaagtg gtgtcgcaag gcgcaggagt 165961 tgggcaccct cgacgaggac gcggcgcgtt ggatgcgccg gcatggaggt ccttaaatga 166021 acgcacccaa gctggtcatt ggcgcgaacg gcttcctggg ttcgcacgtg actcgccagc 166081 tcgtcgccga ctgcgcgccg cagaaaggtg aggtacgcgc gatggtgcga cccgctgcca 166141 acacccggag catcgacgat ctaccgctca cccgattcca cggcgacgtc ttcgacaccg 166201 ccaccgtggc cgaggcgatg gccggctgcg acgacgtcta ctactgtgtg gtcgacaccc 166261 gcgcctggtt gcgcgatccc tccccgctgt ttcgcaccaa tgtggcaggc ctgcgcaacg 166321 tcctcgatgt ggccacagac gccagcctgc gcaggttcgt cttcaccagc agttatgcga 166381 cggtgggtcg tcggcgtgga cacgtggcga ccgaagaaga ccgggtggat acccgcaagg 166441 tgactcctta cgtgcggtcc cgggtggcgg ccgaggatct ggtgctgcaa tacgcgcacg 166501 acgcaggtct gcccgccgtc gcgatgtgtg tgtcgacaac ctacggcggc ggcgactggg 166561 gccgcacccc acacggcgcc ttcatcgcgg gcgcggtgtt cggcaggctg cctttcacga 166621 tgcgcggcat ccggctggag gcggtgggtg tcgacgatgc tgcgagggcg ctgatcttgg 166681 cggccgaacg cgggcacaac ggcgaacggt acctcatctc cgaacgcatg atgccgttgc 166741 aagaagtggt gcggatcgcc gcggatgagg ccggtgtccc gccgccacga tggtcgatct 166801 cggtgccggt gctttacgcc ctgggtgcgt tgggcagttt gcgagcccga ctcacgggca 166861 aagataccga actcagcctg gcgtcggtgc gcatgatgcg ttccgaggcc gatgtcgacc 166921 acggcaaggc cgtccgcgag ttgggttggc agccacgtcc ggtggaggag tcgatccggg 166981 aggccgcccg gttctgggcg gcgatgcgca ccgtcgggaa ggaccccgcg gcctcgtgat 167041 ccgaaaaggc ctagggacgc tgccgggaat gttgatcgcc ggcacgtgtt gcacaggtca 167101 tgagcaaccg gattgtgtta gaacccagcg ccgatcaccc gatcaccatc gagccgacca 167161 accgacgggt gcaggtacgc gtcaatggcg aggtggtcgc ggacacggcc gcggcgctgt 167221 gcttgcagga agccagttac cctgcagtgc aatatattcc gttggccgac gtggtacagg 167281 ataggctgat ccgcaccgag accagcacct attgcccgtt caagggtgaa gccagctatt 167341 acagcgtgac taccgacgcc ggcgacatcg tcgacgacgt gatgtggacg tacgaaaacc 167401 cttatccggc ggtagcggcg atcgcggggc atgtcgcgtg ctatccggac aaagccgaaa 167461 tcagcatctt cccggggtag cgcaggctac cgggtatacc tcggccaacg actgggtgtc 167521 gctgtattcg cgcagcgaga tgatcatccc gtcacgggtc tcgaagatgc agacgaacgg 167581 gctgtcatat cgggtccggt cggcgctcac accgtcgcaa tgcccctcga ccactaccgt 167641 ttcaccctcg ttgacgcagc ggatgagttc gatgttgacc tcgaagacct gcttgcgccg 167701 ctcgactgct cgccgaaacg tcttcttgtc caattccgta cgggtgacga tgctccagta 167761 ggtgaagtcg ttgctgagca gcgcgaagcc ttcgtcgaga tctccgccct cgcagaggct 167821 ttgcaggaac atccaggcca gttcggcttg cgggtcgtcg aacggcgtca tcacatcgcc 167881 atcttgtctc gggagacagc gtgcggtcaa ttgacgtggt cgtcgaagcg gtggtcacct 167941 tcgcgggggc ggccggcttc gcgcacacct tggcgccgtt gcgtcgcggt cagcaggatc 168001 catgctttcg ggtccccggt gacggcacta tctggcggac cagcttgctg cccaccgggc 168061 cggtcaccgc gcggatcagc cgtgctgggc gcgacgccgc ccgttgcgtg gcgtggggca 168121 gcggtgccga ggagtttgtc gacatggcgc ccgccatgct gggcgccgcc gacgacgcca 168181 gcgatttcgt gccgctgcat ccggccgtgg ccgccgcgca ccgccggctg ccgaacttgc 168241 gcctgggccg caccggccag gtgctggaag ccttgatccc ggcggtcatc gagcagcggg 168301 tacccggcgc cgacgcgttt cggtcgtggc ggctgttggt gtccaagtac ggaacgcagg 168361 cccccggtcc ggcgccaccc ggcatgcggg tgccgccgtc ggccgaggtg tggcgtcaca 168421 tcccgtcctg ggagtttcat cgcgccaatg tcgacccggg gcgggctcgc gcggtggtgg 168481 gttgcgcgca gcgggcggcg tcgctggagc ggctggtgtc gctgcccgcg gctcgggcgg 168541 cggaggcgct gacatcgttg cctggagtcg gggtatggac cgcggccgag accacacaac 168601 gcgtgttcgg tgacgccgac gccgtgtcgg tcggcgacta ccacattccg aagatgatcg 168661 gctggacgct tgtgggccgg ccggtcgacg acgccggcat gctcgagctg ctggagccga 168721 tgcgcccgca tcgccaccgg gtggtccgct tgctcgaagc cagcggcttg gcgcgtgagc 168781 cgcgccgcgg gccccggctg ccggtacaga acatccgggc gctgtagggg agtttgacgg 168841 ggatcttgct cggtccggcg ccccgattcc cgccagatcg gctgccggcg ccgctaagcc 168901 gttgtcggcc gatcactgcc tccgcgttcg gcctcggcgg tctgccggtt cagtcgctgc 168961 gtctcgtaga tggtgacgtt ggtgcgagac aacaacagtg ccgcgatacc gacggcgatg 169021 atcgctccag gcaccaccga gaacgagccg gtcatctcag cgaccatgat catgacggcc 169081 agcggcgcgc gggagacact gccgaagcac gccatcattg cgaccacgac gaagatgccc 169141 ggctcgtggg gcaccccggg cagctcggtg agctcgccta gccgccagat cgccgctccg 169201 acgaaggcgc cgatcacgat tcccggcccg aatagcccgc ctgatccgcc ggtgccgatc 169261 gacagcgacg tcgcgaggat cttggcgatc ggcaagacga tgacgatcca caacgggatg 169321 ctcagcagcg tcccccgatc ggcggctagc tgcgcccagc catagccgct gctcaggatc 169381 tggggaatcg gcagacctaa cagcccgacc agcagtccgc cgatcgccgg tttgagcacc 169441 gggcccccgg gcagccggcg cgtaattgcc accgacgcgt gaaagactcg ggcatacaag 169501 tagcctacgg cggctgcgat cagcccgatc accacgaacc acagtagtgg ccacgccttt 169561 tcgaagcgat actcggcgtc gatgtagccg aacagcgggt cgaagcccaa gaaggcgccg 169621 agcacggcgt aggcggttcc cgaggcgatg aaacccggca gcaggttgcg gtagtcgaag 169681 tcgtcgcggt aggggatcga ggcgcccaac gccgctccgc ccagtggcgc agcgaagatg 169741 gcgccgatgc cggcgccgat acccagcgct accgcggtcc ggccgtcttc gttggacagg 169801 ttcagccggc gggtcagcag tgagcagaag ccggccgaga tctgcgcggt cgggccttcg 169861 cggccgcctg aaccgcccga gccgatggtc aaggcgctgg ccaccatctt caccagcacc 169921 gcccgacctc ggatggcgcg cggatcgccg tgcaccgact cgatcgcttc gtcggtgccg 169981 tgaccggtgg cctccggggc gagcttggcc acgatcaatg ccgacagcac cgccccgccc 170041 gtcgtcacca gcggaatcgc ccacggacgc gcgaaaccgg tggacccgcg gtggccgccc 170101 tccccaacgg gagtgggaat ctgatagtcc gcgaggtagc cgagcagaaa ctcgctggtg 170161 tatttcagcg cgaggtagaa gacgacggcg cccaggccgg caatgacacc gatcgtgatg 170221 cctagcagga accatttgcg caggtagccc gcgctcctga tcgatacgcc gagtcgtccg 170281 ccggcggcct cgttcccgat gtcttccgcc tccggcatgg tcgggaggtt agcagcatgc 170341 caagcgaaca ccgaccagtc gcccggcgcc atcccagagt tggccagcgc tatccgacga 170401 tcagcagcgc aaccatggcc caggtctgga cgtacgcgat caccgccgct gtgcggcgag 170461 gagatccgaa acggtgccgc actcttggac cccgacctct gtcatgacgc cgccgctcgt 170521 cgtggccgcg ttcaggccgg tcggccatta ccgactcgca acggacagag ccggtgggcc 170581 ctgctcgccc ccggcgaccg gagccaagct gacaagttcc gtagcatccc gcccaacggt 170641 aggtaccaag ccgcagtggt ggcacacttt agtgatgtca atgtcgctca cggccggtcg 170701 cggcccggga cgtcccccgg cggcgaaagc agatgagact cggaagcgta ttctgcacgc 170761 cgcccgtcaa gtgttcagcg aacgtggtta tgacggcgcg acttttcagg agatcgccgt 170821 ccgcgccgac ctgacccgac cggcgatcaa ccactacttc gccaacaagc gggtgctcta 170881 ccaagaggtg gtggagcaaa cccacgaact cgtcattgtg gccggcatcg aacgggcacg 170941 ccgcgagccg accttgatgg ggcggctggc ggtcgtcgtt gacttcgcga tggaggccga 171001 tgcccagtat cccgcctcga ccgcgttcct ggccaccacc gtgctcgaat cccagcggca 171061 tccagaattg agtcggaccg aaaacgatgc ggtgcgcgca acccgagaat tcctggtttg 171121 ggctgtcaat gatgcgatcg aacgcggtga actagccgcc gacgtcgatg tctcttcgtt 171181 ggccgagacg ctgttggtcg tgttgtgtgg cgtgggcttc tatatcggtt ttgtcgggag 171241 ctatcagcgg atggcgacca tcaccgattc gttccagcag ctgttggccg gcacgctctg 171301 gcggcctccg acctgaccga gacctaaccg gcggccccga agcgtagtga tgtgccacac 171361 aaatcgtata ggttacctaa cttacttagg tagcatggca tgccgtgacc gaactcgacg 171421 acgtgtcctc gttaccatcc tcgcgacgga ccgctggcga tacctgggcg atcaccgaaa 171481 gcgttggcgc caccgcgttg ggggtcgcgg cggcacgtgc cgtggaaacg gccgcgacca 171541 atccgctgat ccgtgacgag ttcgccaagg tgttggtgtc gtcggcgggt accgcctggg 171601 cacggctggc cgacgccgat ttggcctggc tcgacggtga tcagctcggc cgacgcgtgc 171661 atcgggttgc ctgcgactac caggcggtgc gcacccactt cttcgacgag tacttcggtg 171721 ccgccgtcga cgcaggtgtc cggcaggtgg tgatcctcgc tgccggactg gacgctcggg 171781 cctaccgcct gaactggccg gcgggcactg tggtttacga gatcgaccag ccttcggtgt 171841 tggagtacaa ggcggggatt cttcaatcgc atggcgcggt tccaacggcg agacggcatg 171901 ccgtcgcggt ggacctgcgc gacgactggc cggccgcgct gatagctgcc ggattcgatg 171961 gcacccaacc gactgcctgg ctagccgagg gcttgctacc ctacctgccc ggcgacgccg 172021 cggaccggct attcgacatg gtcaccgcgc tcagcgcacc gggcagccag gtcgctgtcg 172081 aggctttcac catgaacaca aagggcaaca cgcagcgctg gaatcggatg cgcgagcgac 172141 tcggtttaga catcgatgtc caggcgttga cctaccacga gcccgaccgg tcggatgccg 172201 cgcaatggct ggccacgcat ggctggcagg tgcacagcgt gagcaatcgc gaggagatgg 172261 cccgactggg ccgggcgatc ccgcaagacc tggtcgacga gaccgtccgc accacgttgc 172321 tgcgagggcg tctggtcaca cccgctcaac cggcgtgaca ccggcatcac gagaaccaga 172381 gggagcacag gatgagcgcc atgcgcaccc atgacgacac ctgggatatc aagaccagcg 172441 tcggcgccac cgcagtgatg gtggctgctg cccgggccgt cgaaaccgac cggcccgacc 172501 cgctgatccg cgatccctac gccagactgc tcgtcaccaa cgccggggcc ggcgccattt 172561 gggaagccat gctcgaccca acactggtag ccaaggcggc tgccatcgat gccgaaaccg 172621 cggccatcgt cgcctatctg cgcagctacc aagcggtgcg gaccaacttc ttcgatacct 172681 acttcgccag cgctgtcgcc gccggaatcc ggcaggtagt gattctggcg tccggactgg 172741 attcccgcgc ctatcgcctg gactggcccg ccggaaccat cgtgtatgag atcgatcaac 172801 ccaaggtgct ttcctacaag tccacgacgc tggcggaaaa cggggtaacg ccgtcggctg 172861 gtcgccgtga ggtgcccgcc gacctgcgcc aggactggcc cgccgcgctg cgtgatgccg 172921 ggtttgaccc gacggcacgc acggcgtggt tggccgaggg gctgttgatg tacctaccgg 172981 ccgaggccca ggaccggctg ttcacccagg tcggcgccgt gagcgtggcg ggcagccgga 173041 tcgcggccga gactgcgccg gtgcacggcg aagagcggcg agcagaaatg agggcacggt 173101 tcaagaaagt ggccgatgtg ctcggtatcg agcagaccat cgacgtgcag gaactggtct 173161 accacgacca ggatcgggcg tccgttgccg actggctcac cgatcacggt tggcgggccc 173221 gatcccaacg tgcgcccgac gagatgcgcc gcgtgggtcg ctgggttgag ggggtgccga 173281 tggcggacga cccgactgcg ttcgccgagt ttgtcaccgc agagcggttg tagcgagcgc 173341 atccgactga ccttatatat ccggatatat ggctggatct tttctattgc tggttcaacc 173401 gggtgactag gatcgcggtt atcaccgatg agtgaccgcg tcaaggcggt cgcgccgccg 173461 gacggaagga cgatgatgac caccgaatcg gttgcccgga agacccagaa atctgagacc 173521 gaggctccgc gcgaaccggc gcccgtttcg gatgaaaagc aaaccgatgt cgctaaaacg 173581 gtggctcggc tgcgaaagac ctttgccagc gggcgtaccc gcagcgtcga gtggcgcaag 173641 cagcagttgc gcgcgctaca gaagttgatg gacgagaacg aggacgcgat cgccgcggca 173701 ctcgccgagg atctggatcg caatccgttc gaggcatacc tcgctgacat cgcgacgacc 173761 tccgccgaag cgaaatacgc ggccaagcgg gtgcgcaggt ggatgcggcg ccgctacctg 173821 ctgctcgagg tgccgcagct gcccggccgc ggctgggtgg agtacgagcc atatggcacc 173881 gtgctaatca tcggtgcctg gaactacccg ttctacctga ccctgggtcc ggcggtcgga 173941 gccattgccg ctggaaacgc cgtcgtgctc aaaccgtcgg aaatcgccgc tgcatcggcg 174001 cacttgatga ccgaattggt gtatcgctat ctcgacaccg aagcgatcgc ggtcgtgcag 174061 ggcgatggtg cggtgagtca ggagctgatc gctcagggtt tcgaccgcgt gatgttcacc 174121 ggtggcaccg agatcggccg caaggtctac gaaggcgccg cgccgcacct gaccccggtc 174181 accctcgagc tcggcggcaa gagcccggtg atcgtcgcgg ccgatgccga tgtagatgtc 174241 gcggccaagc ggatcgcctg gatcaaactg ctcaacgccg ggcagacatg cgttgcaccc 174301 gactatgtgc tggcggatgc caccgtccgc gacgagctgg tcagcaagat caccgcggcc 174361 ctcaccaagt tccgctccgg tgcgccgcag ggcatgcgca tcgtcaacca gcgtcaattc 174421 gaccggctga gtggatacct cgccgcagcg aaaaccgacg ctgcagccga cggcggcggg 174481 gtcgtcgtgg gcggcgactg tgacgcatcg aacctgcgca tccaacccac cgtggtcgtc 174541 gatcccgacc cggacgggcc gttgatgagc aacgagatct tcggaccgat cctgccggtg 174601 gtcaccgtca aatctctgga cgacgcgatt cgcttcgtga actcgcggcc caagccgcta 174661 tcggcgtacc tgttcactaa gtcgcgtgcg gttcgcgagc gggtgatcag ggaggtgccg 174721 gcgggcggaa tgatggttaa ccatttggct tttcaggtgt cgacggccaa actgccgttc 174781 ggtggtgtcg gcgcatcggg catgggtgcc taccacggcc gttggggttt cgaggagttc 174841 agccaccgta agtcggtgtt gaccaaacca acccgacccg acctgtccag ctttatctac 174901 ccgccgtaca ccgagcgcgc catcaaggtg gctcgccggc tgttctgacc tgggcgcggg 174961 ttgtcgcccc gttgacaccc gactcgttat aaccccgaat tgtgattgcg gagaggagcc 175021 tgatgcccgg agtgcaagat cgcgtcatcg tcgttactgg agccggcggt ggcttgggcc 175081 gcgaatacgc ccttacgctc gccggggagg gcgccagcgt cgtggtcaac gacctcggtg 175141 gcgcccgcga cggcacgggc gccggttcgg cgatggccga tgaggtcgtc gccgagattc 175201 gcgacaaggg gggccgggcg gtcgccaact acgacagcgt cgccaccgag gacggcgcag 175261 cgaacatcat caagaccgcg cttgacgaat tcggcgccgt gcacggtgtg gtgagcaacg 175321 ccgggatctt gcgcgacggc accttccaca agatgtcgtt cgagaattgg gacgccgtgc 175381 ttaaggtgca cctttatggc ggataccacg tgctacgcgc ggcctggccg catttccgtg 175441 agcagagtta cggccgggtc gtggtggcga cctccaccag cgggctgttc ggcaacttcg 175501 gccagaccaa ctatggggcg gccaagcttg gtctggtcgg cctgatcaat acgctggcgc 175561 tggagggagc caagtacaac atccacgcca atgctcttgc cccgatcgcg gcgaccagga 175621 tgacccagga catcctgccg cccgaagtac tggaaaagct cacacccgag ttcgtcgcac 175681 cggtggtggc ctacctgtgc accgaggagt gtgccgacaa cgcatcggtg tacgtcgtcg 175741 gtggtggcaa ggtgcagcga gttgcgctgt ttggcaacga cggcgccaac ttcgacaaac 175801 cgccgtcggt acaagatgtt gcggcgcggt gggccgagat caccgatctg tccggtgcga 175861 aaattgctgg attcaagttg tagaagtaaa tgaaggcttg tgtcgtaaaa gaactttccg 175921 gcccgtccgg catggtgtac accgacatcg acgaggtatc cggtgacggc ggaaaggttg 175981 ttatcgacgt acgggccgcc ggcgtctgct ttccggacct gctgctgacc aagggcgagt 176041 atcaactgaa gctaacgccg ccgttcgtgc ccggcatgga aacggcgggt gtggtgcgtt 176101 cggcgccgtc ggatgcgggt tttcatgtgg gcgaacgtgt ttcagcattc ggagtgctcg 176161 gcggctacgc cgaacaaata gccgtaccgg tggccaatgt ggttcgcagc cccgtcgagc 176221 tcgatgacgc cggggcggtg tcgctgttgg tgaactacaa caccatgtac ttcgccctgg 176281 ctcggcgtgc cgcgctgcga ccgggagaca ccgtgctggt gctcggcgcc gccggcggag 176341 tgggcacggc cgccgtccag atcgcgaagg cgatgcaggc tggcaaggtg atagccatgg 176401 tgcaccgcga aggtgcgatc gactatgtcg cttcgctcgg tgccgacgtg gtgcttccgc 176461 tgaccgaggg ctgggctcag caggtgcgtg accacaccta cggtcagggg gtggacatcg 176521 tcgtcgatcc catcggcgga ccgacattcg acgacgcgct cggcgtgctg gcgatcgacg 176581 gcaagttatt gttgatcggc tttgccgcgg gtgctgtacc gaccctcaag gtcaaccggc 176641 tgctggtgcg caatatcagc gtggtgggcg tcgggtgggg cgagtatctc aacgcggttc 176701 ccggttcggc cgccttgttc gcctgggggc taaaccagct ggtctttctg gggctcagac 176761 cgcctccgcc gcaacgctat ccgttgtcgg aagcacaggc cgcgttgcag agtctggacg 176821 acggcggtgt gctcggcaag gttgtgctcg agccctaagc gcatgctcgc gattcggcga 176881 tacggtgatg ctgtgacgga tcggcgggcc aacacgagga attcgcaccc gctgccggcg 176941 tgaccaacgc cacgctggca gcaatcgggt atccgatcgc gttggccagc aagctgttgg 177001 cgatatcggc cgtcgaaagc acaaccgcgt agccgtccgc aaccacagtg gaaatggtgc 177061 tggcgatctt ggtgtgcgcg agcgcttcga tgccagggtc agggagcccg gtgggcgccc 177121 ggtcgtcggg caacgtcagc atcgacgagc cgccggtctg tgacaagttc gccaacaacg 177181 gattgggcag ccacgccggt acctggcgcg gatgtggtgc acgaacgcgt tgacatattg 177241 ggggctcttc gcggatgagg gtgtagggcg ggtcggcgcg tcgttgccgg gtaggggtcg 177301 cggtctttcg atgatgggcg gttccacgct gccgaaaagg aagacctcgg cgtgtctgcc 177361 cgaggcacta ggtcgcaagg gtaaccgagg gtgcacgttg acggggtgag gccaagcggg 177421 cgccgagcgt gaactgaggg cgagatttcg gccgattctc cgccctcagt tcacgctggg 177481 cgacggcgcc aacgggctgc ccctggccgg tcgcaccaag acgccgcata cgtaccaaac 177541 ttcccatact cacccatcgc ggtgaacccc aaacccagtg ccggccacca ttggccttcc 177601 cgatggattg gtgccagcag caaccggcat catcgaaaac cggctcttca tgatcgaggg 177661 ccggcagcgg ctcgagcagc ggcaggccgg ggtgatcacg tagtagtgct gaatgacccg 177721 agcatcgggc gatcagatgc tgaagctttg cagttgctga gtaatgtcgg ccaacgtcac 177781 cacaatcgcg atgaattcaa tcatgccgcc cagggcggcc aacccaatgg tggccgcgag 177841 cggcagctcg atcgcagcgc ggaggttgcc ggccgccagt tgattcacga acagggtgag 177901 gtcataggcg ggcaggatag tgacgaaggc aagacctaga tctgccgtcg gaagaagaat 177961 cgagtagccg gtcgacacaa cggaagcgaa agtgtccgcg atgttgatga gcgtcgccgg 178021 ttgtggcggc ggtggcggcg gtagcaacgt cggcacatac ggcgggaacg cgggcatcgg 178081 agtttggggc agggtgttca gggcggctgg caactcgacc atgaagtcgt tgacgccctg 178141 ttgcgttccg gcaaccaggg catcggcgac aacgctcgcc gggacatccg ggaagagccc 178201 gaatggggta ggcacgttcg ccgggctcgt cgaatagccg aacctcgggt cgccgtaacc 178261 cagattgacg attacttcca agttcggttc gaccagcgcc gccagcggtg ggccaatgac 178321 cgggattgcc cgcaacgggg ccagcagcgg cagatgctcg gtttcaatga tgtagtacgt 178381 gttcgacgtc gtgccctgtg tcggcaactg cgtggccgac gctatctgtg ccggtgtgag 178441 gtccgcatac gtggtgtgca ccgtgagtat cccgaatact gcgttgatat cggacaggac 178501 attgagtgga taccgcggga agtcggcgaa accgtcgtac tcgagggtgt aggtcgtcgt 178561 cggataggga ttgtccgggg tcgccccgta gaacggtagg ccgagggtgg tgacattcag 178621 accgggtatg cgcgcaagta tcccgccatt gggattcatc tcgttgccga tcaagatgaa 178681 attgagctgg ctggggctgg gagcgttggg acccagcgag atgaggtgct gcatttccag 178741 ggacgcgatg acggcgctct gcgaatagcc gaacacggtg acgtggtttc cggcgttgat 178801 ttgctcccaa atcgcgccgt cgagaatctg taggcccaac tgcaccgagg tttggaaggg 178861 cagggatttg acgccggtga tcggatatag ctcttcgggc gtcaccagcg ctttgacgac 178921 cggattcgag acgacggggt cgatgaacaa ggtcgtgatg gcgttgacat aactcggcgt 178981 gggtatcggt gacccggtgc cgcccatgat gatcgccgta ttttggttga acattggcgg 179041 tagcaccggg ggtgaggttg gcttaaagag tccggccgtc gcctcctgca ccagcgcgct 179101 cgtgttggtg gcctcggcat tgacaaatgc gtttgcggcc gccgccaacc tctgggtgaa 179161 ttcgttgtga aacgccgcaa cctgtgcgct gatcgcctgg aactgctggc cgtacgcgcc 179221 gaacagcgtg gcaagggccg tggacacttc gtccgcggca gccgccgcca ggccggttgt 179281 cggggccgcg acggccgccg tagcctggtt gatcgccgag ccgatcccgg ccaaatcggt 179341 agccgccgct gccaataccg acggctgcgc gaatacgtac gacaaacccc atccctcctt 179401 gtcgacgggg cccataaccc acctgtcgag ccgatacgtt gagcgtaaag cgactccgcg 179461 gttgtgtctg gcctttggag tgaacccaaa tggggccatg ctgcctcgtc attggcgagg 179521 tcggtaaacg gtagtcggtg gacgtcgatg ccgtcgggaa tccgttaggt gacgaggccc 179581 tcgatgtttc gaacggtgtc cgaggccgcc gcgaggaggg tgagcaattc cacgccgccc 179641 gctatcgatc gtgcctaaac ctacggtggc cgccagggga tagccgatcg cgttgatcag 179701 attgcccgca gcgagttgcc tgacgaacag ttgggtggtg tacagcggca gggtggtgac 179761 cagggcgagg gcgatgtcca cggtgggcag caggacggcg tagttggttg agatgatcct 179821 ggcgagcgtg ttcaccacct cggccggcgt cggtgcggcg gccaccgcgg ccaccagatc 179881 ggcgggttgc ggcagctgga tctgcgggag cgtgagcggt tgcgcggaca gcgcctgcag 179941 gtcggccgtg aagtcaagga tgccttcttg tgttccggcg gccagggcat cggcgatgac 180001 ctgaggcggc acgttcggcc acagcccgaa cggcgttcgc acatcggcgt agctcgtcga 180061 gtagccgtag ttcgggtcgc cgtagcccag gttgacgatc accttcaggt tcggctggat 180121 caggtcggcc agcggatctc cgatgaccgg caccgcccgc agcggttgca gcagcggccg 180181 attctcggtg cggatgatgt agtagtcggt gacccccgta tagcccggcg acgtcggtaa 180241 tttagtagcg ccctcgacct gcgcgggcgt gaggtccaaa tacttggtgt gtacgaatgt 180301 gatgcctgca accgcgttga ggtcggaaat gaagttgagc gggtatcgcg agaagtcggc 180361 gaacccgtcg tactcgagcg tgtagatggc cgtcggatag atcgtgtccg agggcgttgc 180421 gccatagaac gtcaggtcca gagtcggaag cgtcagatcc gggaaccgcg cgagcatacc 180481 gccattgggg ttcatttcat tgccgacaag cacgaaattg aggtcgctcg ccgaaggtgc 180541 ggccccgccc atcgccgtga acctctgcat ctccagcgac gcgattatgg cgctttgcga 180601 ccagccgaaa acggtgaccg cgtttccggt ggtcgcgagc tctaccatga tcgcgtcgtg 180661 caagatggtc aagccctctt ccactgacgt gttgaggacc aaacttctga caccggtgag 180721 tgggtacaac tcttcgggtg tgaagacggc ttgtagcgcc cgccgaacgg acctacagcg 180781 tattggcggc gtcaacatag acggcggtgg tagtggaatt ccggtgggcc caaagaacaa 180841 ggtggtcaag ttcgccggga atggcggaat catcgcggcc gccgcggggg ttggtgcggc 180901 ggcgggcaca gccagctgat tttgccgggt gctggcgatg gcggcctcgg catctgcgta 180961 gctgttcgcc gcggcggcca acgtctggtg gaacctaact gtgaaacgcc tcgacttgag 181021 cgagcacggc ctggtattcc tggccgtatg cgccgaacgg tttcgcgatg gcggccgaca 181081 cctcatcgcc ggccgccgcg gccagtgcac acgtcgggcc tgccgcggcc gcgccggccg 181141 tactcacggc cgaaccgatt cctgccacct cggcggcggc cgccgctacg atccgcggct 181201 cagcgatcag atacgacatc gtctcactcc cctagcacca ggtgtcggcc aaccgggtca 181261 acccggggtt ttggtcagcc cagagcggtc ccgctgccct ggtggtcgct tacgcgaatc 181321 ggattcgcgc gaaagcgttt cccctcatcc gagcagcacc ccgcgcatcc ggttgactgt 181381 ggcctggctg ataccggcgt cgcgcaggta gccgcccagc gatccgtagg tctcgtcaat 181441 ggtctggcgt gcggcggcca ggtactccgc gcggacaccc aggaccccgt cggacagccg 181501 ggccttggtg aacgtcacca cctcgggtgc cagttcggtg tcgaaacgct gctggatcat 181561 ctcggagatc cgggcccgca gttgtggcac ggagtcgttg ctgcgcaggt agtcggcgac 181621 gatgacgtcg cggtccaggc cgaccgcttc aagcaccagc gcgaccacga agccggtgcg 181681 atccttaccc gcgaagcagt gggtgagcac cgggcgtccg gcggcaagca gtgtgacgac 181741 acgatgtagc gcgcgctgtg ctccattgcg cgttgggaat tggcgatact cgtcggtcat 181801 gtagcgggtg gccgcgtcat ttatcgactg gctggattcg ccggactcgc cgttggaccc 181861 gccattggtt agcagcctct tgaatgcggt ttcgtgcggc gctgagtcgt cggcgtcatc 181921 atcggcgagg tcggggaacg gcagcaggtg gacgtcgatg ccgtccggaa cccgtcctgg 181981 accgcggcgg gcaacctccc gggacgaccg caggtcggca acgtcggtga tccccagccg 182041 gcgcagcgtt gcccggccgg cgtcgtcgag gcggctcagc tcgctggacc ggaacagccg 182101 ccccggccgc aatgcggttg cggtgtcggc gacgtcacga aagttccacg cgcccggcag 182161 ttcacggaca gccatctcag gtgaccgccg cagcgaaggt ggacttctcc ctcgacagct 182221 cggcgcgggc gatggagcgc aggtgcacct cgtcgggacc gtcgaagatg cgcatggcgc 182281 ggtgccagcc gtacaaccgg gccagcgggg tgtcgtcgct gacgccggcg gccccgtgga 182341 cctggattgc gcggtcgatg acatcgcagg ccacccgcgg ggccaccgcc ttgatcatgg 182401 cgaccaggtg gcgcgcctct ttgttgccat gttggtcgat tgtccacgcc gccttttcgc 182461 acagcagcct tgcctggtcg atttcgttgc gggactgagc aatcgcctgt tgcacgacgc 182521 cctgttcggc tagcggacgg ccgaacgcca cccggttgcg gacgcgattc accatgagtg 182581 ccaaggcgcg ttcggccgcg cccagcgcac gcatgcagtg gtggatacgg cccggcccca 182641 gccgggcctg ggctatggcg aatccgctgc cctcttcgcc gagcaggttg gtggccggga 182701 cccggacgtt gtggtagtcg atctcgcagt ggccgtgccg gtcctgccag ccgaacaccg 182761 gtgtggagcg aacgatcgtc acgccggggg tgtcgatcgg gacgaggacc atcgactgct 182821 gttggtgggc ggctgcgtcc gggttggtgc ggcccatcac gatgaggatc ttgcaccgcg 182881 ggtccgccgc tcccgacgtc caccacttac ggccgttgat gacgtagtcg gcaccgtccc 182941 gggagatggt ggtttcgatg ttgcgggcgt cgctgctggc caccgccggc tcggtcatcg 183001 agaaggcgct gcggatcttg ccgtcgagca gcggccgcag ccattgcgcc cgttgctgct 183061 cggtgccgaa catgtgcagg atctccatgt tgccggtgtc cggtgcggcg cagttgagtg 183121 cctcgggcgc gatttccatg ctccatccgg tcatttcggc cagcggcgcg tactccaggt 183181 tggtcaatcc cgactcggcc gacaggaata ggttccacag gccgcggtct ttggccttgg 183241 ttttcagttc ctcgatgatc ggcggcgcgg tgtggtcggc cggtccggcc gcgcggcgat 183301 agtcgtcgta atcggcctca gcgccgaaga cgtgctcggt catgaagtcg gacaaccgcg 183361 tgcggtagtc gatggccttg gccgacatcg cgaagtccat tccgccacga tatctaccgg 183421 cgctagcaga cgcataagtc cctcgacacg ccgacgagaa gggggttttg cgtctgctcg 183481 ccgtcgtttc gtgccaccgt tcaactgacc cgcaagtggc agcgcgagct cgactattcg 183541 ctacgcaaga gtttgtggag cttccacgac aaccgcattg cgatgcggtt ccagtacgaa 183601 tcccgtgacc gcaacggcca gtggtatcgc agctacggca ccgaactgtg gcgaagccag 183661 catcaacgac gtgccgatcg ccgaatccga gcgtcgctac ctcggtgcgc gctcggcatc 183721 cgagtatggc caggaaatac cgctctggta gcccggtagg gtgtctgagc aaatctatcg 183781 gcgttcagta aggaaagtgg atgtacgcgc catgacagat ccgcagacgc agagcaccag 183841 ggtcggggtg gttgccgagt cggggcccga cgaacgacgg gtcgcgctgg ttcccaaggc 183901 ggtcgcgtcg ctggtgaacc gtggtgtggc ggtcgtggtc gaggccggtg cgggcgagcg 183961 cgcgctgctt cccgatgagc tctacaccgc tgtcggtgcc agcatcgggg atgcttgggc 184021 cgccgacgtc gttgtcaagg tcgcgccgcc gacggcggcg gaggtcggcc ggttgcgcgg 184081 tgggcagaca ctgatcggct ttctagcgcc ccgtaatgct gacaactcga tcggcgcgct 184141 gacccaggcc ggggtgcagg cgttcgcgct cgaggccatc ccgcgcatct cgcgggcgca 184201 ggtgatggac gcgctgtcgt cgcaagccaa cgtgtctggg tataaggctg tgctgctcgc 184261 ggcctcggaa tcgacccggt tctttccgat gctgacgacg gcggccggaa cggtgaagcc 184321 ggccacggtg ctggtgctcg gcgtcggcgt ggccggcctg caggcgctgg cgacggccaa 184381 acggctaggc gcgcgcacca cgggctacga tgtgcgtccc gaggtggccg accaggtccg 184441 atcggtgggc gctcaatggc ttgatttggg catctcagcg tccggtgagg gcggttacgc 184501 ccgcgaactg accgacgacg agcgcgccca gcagcaaaag gcattggaag aagcgatcag 184561 tggcttcgac gtggtgatca ccaccgcgct ggtgccgggc cgcccggcgc caacgttggt 184621 gaccgccgct gcagtggaag cgatgaagcc tggcagcgtg gtggtggatc tcgccggcga 184681 gacgggcggc aactgcgaat tgaccgagcc cggccggaca gtcgtcaagc acgacgtcac 184741 cattgccgca ccgctgaacc tgccggccac gatgcccgag cacgccagcg agctctacag 184801 caagaacatc accgcgctac tcgacttgtt gatcaaagac ggcaggctgg ccccggactt 184861 cgacgacgag gtgattgccc agtcgtgtgt cacccgcggg aaggactcct agatgtgcaa 184921 cgaattgttg gagaacctgg cgatcctggt gctgtccgga ttcgtcgggt tcgcggtgat 184981 ctcgaaagtg cccaacacgt tgcacacccc gctgatgtca ggaaccaacg ccatccacgg 185041 cattgtcgtt ctcggcgcgc tggtggtttt cggcgaaatt gagcacccat cgctcgtgtt 185101 gcaggtcatc ctgttcgtcg cggtggtgtt cggcacgctg aacgtcatcg gcggattcat 185161 cgtcaccgac cgaatgctcg gcatgttcaa ggccaagaag cccgccgtgc cagccaagcc 185221 cgaccgcgac gaggcgctcc gatgaacctg cactacctgg tcgagattct ctacatcatc 185281 tccttttcac tcttcatcta cgggttgatg gggctcaccg gccccaagac cgcggtgcgc 185341 gggaacctga tcgccgcggc cggcatgacc atcgccgtgg cggccacgtt ggtcatgatc 185401 cgacacacca gccaatggcc gctgatcatc gccggtctgg tggtgggtgt tgtgctcggt 185461 gtgccgccgg cgcgactgac caagatgacc gccatgccgc agctggtggc attcttcaac 185521 ggcgtgggcg gaggaacggt cgcactcatc gcgctgtcgg agttcatcga taccaccggc 185581 ttttccgcat tccagcacgg cgagtcgccg accgtgcaca tcgtggtggc ctcattgttc 185641 gccgcgatca tcgggtcgat ctcgttctgg gggtctatcg tcgcgttcgg caagttgcag 185701 gagatcatct ccgggcggcc gatcggactc ggcaaggcgc agcagccgat caacctgttg 185761 ctgctggccg tggccgtggc cgccgccgtg gtgatcggac tgcacgcgca tcccgggagc 185821 ggtggggtcg cattgtggtg gatgatcggc ctgttggtcg ccgccggcgt gctgggtctg 185881 atggtggtgt tgccgatcgg tggcgccgac atgccggtgg tcatctcgat gctcaacgcc 185941 atgaccggcc tgtcggccgc ggcggcgggt ctggcgttga acaacaccgc gatgatcgtg 186001 gccggcatga tcgtcggcgc gtccggctcg atcctgacca acctgatggc taaggcgatg 186061 aaccgctcca ttccggcgat cgtcgcgggc ggtttcggcg gcggcggtgt ggcgcccagt 186121 ggcggcggcg acgacaaaca cgtcaaggcc acttcggccg ccgatgccgc gatccagatg 186181 gcatacgcca atcaggtgat cgtggtgccc ggctacgggt tggccgtcgc gcaggcgcag 186241 catgcggtga aggacctggc aaccttgctg gaggacaggg gtgtgccggt caagtacgcg 186301 attcacccgg tcgccggccg gatgcccggg catatgaacg tgctgctggc cgaggccgaa 186361 gtcgactacg acgcgatgaa ggacatggac gacatcaacg acgagttcgc ccgcaccgac 186421 gtcaccatcg tgatcggcgc caacgacgtc accaacccgg cggcccgcaa cgagacgtcc 186481 agcccgatct acggcatgcc gatcctcaac gtggacaagt cgaggtcggt gatcgtgctc 186541 aaacggtcga tgaattccgg gttcgccggc atcgacaacc cgctgttcta cgccgacggc 186601 accactatgt tgttcggtga tgcgaagaaa tcggtgaccg aagtctccga ggaactcaag 186661 gcgttgtagc gcgcgagcgc tggctcagac gggcggatac gccggcggcg ggtatccgtc 186721 gccggtttcg accccgcgta gaccccaggt gaggtaccgg aagaagaact cgatttcgtc 186781 gctcacgtcg tagtcaggac tcggatccat cacttcaccc tctcgactcg cgacttggtt 186841 cgcaacggag tttagtcaca tccgcgccgg tgcgacaggt tgtcgccgcc ttgcctaaac 186901 tgaacaacca gttgattgat acagcttcgg ccggggccca tgggctccac cggcagcgac 186961 gatagcgagt agcgatgcca tccgacacca gccccaacgg gctaagccgc cgtgaggagt 187021 tgctggctgt tgccaccaaa ctattcgcgg cgcgcggtta tcacggcacc cggatggacg 187081 acgtcgccga tgtgatcggg ctcaacaaag caacggtcta tcactactac gccagcaagt 187141 cgctgatcct gttcgacatt taccgtcagg cggccgaggg caccctggcc gccgtgcacg 187201 acgatccgtc ctggacggcc cgtgaagcgc tgtaccagta cacggtccgg ctgctcactg 187261 cgatcgcgag caaccccgag cgggccgccg tgtacttcca ggagcagccc tacatcaccg 187321 agtggttcac cagcgagcag gtcgccgagg tccgcgagaa ggagcagcaa gtctacgagc 187381 acgtacacgg cctgatcgac cgcgggattg ccagcggcga gttctatgag tgcgactcgc 187441 atgtggtggc gctggggtac atcgggatga cgctgggcag ctaccgctgg ctgcggccga 187501 gcgggcgccg aacggccaag gagatcgcgg cggagttcag cacggcactg ctgcgcgggc 187561 tgatccgcga cgaatcgatc cgcaaccagt ctccgcttgg aactcggaag gaaacgtgaa 187621 cctcacgcga tcggtggaat caatctcgct acggacccga gggcgccact gagcaccgac 187681 aactccgtca cactggattg accgaagttg aacatcaggc ccggattcgc cgacggaaga 187741 tacggatacg tattgggtag cgcggactgc ggtaacaatc cgatgcttac tagggcggct 187801 tgggggcctt gcacggtccc ggtcgccagg gccgaggcca cggcgatcgg gttgattggc 187861 gcgaacaggc tggccggggt gggtacgtcg gcgtagccgt agccatagcc caagtcgact 187921 agcacccgta ggtcgggctg aatcagctcg gctattgggg tccctacgaa ggggatggcg 187981 cgaatcggct gcaacagcgg caggtcctgg gtcagaaaca tgtagtaatg ggtgttgccg 188041 gtgtagcccg gagacgtggg caacggcacg gcattggcaa cctcggccgc ggtgaagggg 188101 tacgcgttgt gcacccatct gatgcccatg aaggcgttga ggtccgacaa gatattgagc 188161 gggtactgcg ggttgtgggc gtagccgtcg tattggccgg tgtacatgta ggtctggtag 188221 ggggaatccg gtggagtcgc accgttgaac gacatatcca agaacgtgag gtaaaggccc 188281 acgtaacgct cgaggacgcc gccgttgggg ttattgatat taccgatcaa cgtgaaagcc 188341 agccggcttg gatctggggc ttggcccggt ggtaacgcca taagagcgcg tatttcattg 188401 gtcgctaccg cggcgctttg cgagtagccg aaaacgacga cgtcatgccc attttgtagt 188461 tccgcgttga tgccgttgtt cagcagcgtg acaccctggg cgatggattg gtccagtgac 188521 aggttcccga taaacggcca ccactgctcg ggcgtgtact gggcgaccgg gttgttgggc 188581 ccgaaaatgg gccgaatgta tgcgctgtca atgatcgcca agacgcggtc actaaggatc 188641 ggttccccgg tgccgcccat catcaacgcg gttagcgggt tgcctgacag catcccgaca 188701 gaaccgaggg cgccgctgga cccggcggtg cccgacatag cagcggtgtt gctggcttca 188761 gcctgggcat aggcggcccc ggcggcagcc agcgcccggg tgaactcgcc atggaacgcc 188821 gcagcctgct ttaggacctc ttgacattcg cgcgcgtatt cgctgaacag cgctgcagcg 188881 gccgacgaca cctcatcggc ggccgcggcc agcagtccgg tcgttggacc cgcagcggac 188941 gcgctggccg ctcgtatcgc cgaaccgatc ccgtccacgt ccgcggccgc cgttgccaac 189001 atctccgggg ccgcgatgac gtaggacatc tggtctcctg ttcgacgctg gggcccttag 189061 agcctagagc gcgcccgccg ggaagcccgg cgttttcggc caatcgttat cgcggccgcg 189121 tcaggtgaag accggtggcg ggatcaggtg caggatgttg ccgagaccgc cactcatcag 189181 ggatagcagt gtcacctgtg gctggccgaa gtagaaattc aggcccgggt ttatcgacgg 189241 gacccacgga tagctgtccg ggaaccactc cggcccaatc aatccggctt ccaccccaat 189301 ctccacgatg gcgccatagg gcgcctgcag gctccctttg atcaggtaat acgtgacagc 189361 gaacgggttg gggatcgaga acagcccggc cggagtgggg atatccgcgt aattgccgcc 189421 cggcccgtag tcggcgtagc ccaagtcgac gagcacccgc agctgcggct ggaacaggtc 189481 ggcgatcggg ggaccggcgt aggggatgtc acggatcggc tggagcagtg gcagatcctg 189541 agtcaggaac atgtagtact gggtgttgcc ggtatagccc ggggaggtcg gcaacggcac 189601 cgcgttatcc acctgggtgg ccatgagttc cgggtacgtg ttgtgcacgt agaagtagcc 189661 catgaaggcg ttgatgtccg acaggatgcg cagcgggaat tgcggcgcgt gggcgatgcc 189721 gtcgtactgg gccgtgtaaa tgtgtgtcgg gtagggacta ttcgccgggg ttgcgccatt 189781 gaacggcacg tccaggaacg ggatgtagaa gccggggaag cgcgccagca gcccgccgac 189841 gggattgttg ccactaccaa tcatgacgaa ggagatatcg tccggattcg gcgaacccat 189901 cgccatcagc gaattgatgt agttgttgat gatcgtggcg ctctgcgagt agccgaacgc 189961 aacgaccttg ttgtcgaggg ccagttggtt gttgacggtg gtattcagca gcgccacgcc 190021 ttcggtgacg gactggttga acgtcagatt gccgaggtcg ggggtaaccg gccagaactg 190081 ctcgggcgtg aacaggcctt gcgagacagc acccgggaag agggtctgga tgaaagcctt 190141 gttgatgtct gtcacgtact cggggtcggg tagcgggtta ttggtgccgc ccataatcaa 190201 cgccgttatc ggactctccg cagccagctg cgcgatcgcc ggcagcccgc cggccccgct 190261 ggatccgttg gggctcaacg gcgtacggcc caacagcgtc cggatcggtg cgttgatagt 190321 gtccagcgcg tgcgataccc gggccgcatt ggccgcttcg gcgtgtgcgt aggcgttgcc 190381 ggcggcctcc aacgtccggg tgaactcgct gtggaacgcc gcggcctgct tgacgaccgc 190441 ctgatactcc cgcccgtatg cgctgaacag ggccgccgtt gccgccgaaa cctcatcgcc 190501 ggccgcggcc agcaggttac atgtcgggcc tgccgcagcc gcgttggcgg cccgcagcgt 190561 ggaagcgatc tcatccacat gggcagctgc cgtcgccagc atgtcagggg ctgtgaccag 190621 gtgcgacatc tccccgtcct tcccaacgga ccggcgcccg caccggtcac ttgggactga 190681 cccgctaccg cgggtattag gtacttaacg agagtaaggc ggtcctgccg ctacgtccgg 190741 cgtttggaca aacctcgatg actgcctgac ctatggcggc tgctataacc gcgagcatgc 190801 taaccagctt ggtgagtgcg gtcggatcgc atcacgtcac caccgaccct gacgtgctgg 190861 ccggccgcag cgtcgaccac accggccgct atcggggccg ggccagcgcg ctggtgcggc 190921 ccggctcggc tgaagaggtc gccgaagtgc tgcgggtgtg ccgggacgct ggagcctatg 190981 tcaccgttca aggcggccgc acctcactgg tggcgggcac cgttcccgaa cacgacgacg 191041 tgctgctgtc taccgaacgg ctttgcgtcg tcagcgatgt cgataccgtt gagcgccgaa 191101 tcgagatcgg tgccggggtc acactggccg cggtgcagca cgccgcgtca acggctgggc 191161 tggtgttcgg cgtggatttg tcggcccggg ataccgcgac cgtcggtggc atggcctcga 191221 cgaacgccgg cggattgcgc acggtccgtt acggcaacat gggcgagcag gttgtcgggc 191281 tagacgtcgc gctgcccgac ggtacggtgc tgcgccggca cagccgggtg cgtcgcgaca 191341 acaccggcta cgacctgccc gcgctgttcg tcggggccga aggcaccctg ggggttatca 191401 ccgcgctgga tctgcggctg caccccaccc cgtcgcatcg ggtgacagcc gtgtgcgggt 191461 tcgccgagct ggcagcgctg gtcgatgccg gccgaatgtt ccgcgacgtg gagggcatcg 191521 cggcgttgga attgattgac ggtcgggccg ccgcgctaac ccgtgaacat cttggcgttc 191581 gcccccccgt cgaggctgac tggttgctat tggtggaact ggccgccgac cacgatcaga 191641 ccgaccggct cgccgacctg ctcggcggtg cacggatgtg cggggagccc gcggtcggtg 191701 tggatgccgc tgcgcagcaa cggttgtggc gcacccgtga atcgctggcc gaggtgctcg 191761 gtgtgtacgg cccgccgctg aagttcgacg tctcgctgcc attgtcggcg atcagcggct 191821 tcgcccgaga tgcggtcgcg ttggttcacc gacacgtccc ggattctccg gaggcgttgc 191881 cgctgttgtt cggtcacatc ggtgagggca acctgcacct gaacgtgctg cgttgcccgc 191941 ctgatcggga accggcgttg tacgcaaaga tgatgggcct catcgccgaa tgcggcggta 192001 acgtcagttc agaacatggg gtgggcagcc gcaagcgtgc ctacctggga atgtcccggc 192061 aggccaacga cgtcgccgcg atgcggaggg tcaaggcggc gttggacccg accgggtacc 192121 ttaacgccgc ggtcttgttc gactgaccgg tgctgcgcaa gcattcagcg cctttaaaga 192181 tcaccggtga aactgatgag ctgacgcacc gcgatgccat cggcgaggtg gtccatcgcc 192241 tcgttgatat cgtccaaccg aatcgttgac gtcaccagcg actccaccgg cagacggccc 192301 gattgccaca acgacacgaa gcggggaatg tcgtggctgg gcaccgccga acccagatag 192361 ctgccgatca gtgaccggcc ttcggtgaca aaatccaacg gcgacaagct gatccggaca 192421 tccggtggcg gcaacccgac ggtgatggtg cgccctccgg gcgcggtaag cccgatcgcg 192481 gtgtgcagcg cggcaggatg accgacggct tcgacaacca cggcggcttt gaccccgccg 192541 gccgtggcct gctgcggtgt gtagatctca tgggcgccca aggcctttgc ggccgacagc 192601 ttttcgggta gctgatcgac ggcgaccaca cgaacgtctg tatacgtcaa agcggtgagc 192661 accgctgcca taccgacgcc cccgaggccg acgacggcga ccgactggcc gggctgcgga 192721 tcaccgacgt tgagtaccgc acccccaccg gtgagcaccg cgcacccgag tagggcagcg 192781 acggtgggcg gcacctcgtg cggcaccgga accacgctgg cccggttgac gacgacatgg 192841 gtcgcgaaac ccgagacgcc gaggtggtgg tacaccgggc ggccgccccg gctgagccgg 192901 ataccgccac cgagcagtgt gccggccttg ttggccgcgc tgcccggttc gcacggcgtc 192961 cgaccgtcgg tcgcgcacgc cgcgcactgg ccgcaacgcg gaaggaacac cagcacgact 193021 cgctgaccga ccgcgacccc gtcgacgccg tcgccgacct gctcgacgat tccagcggct 193081 tcatgaccga gcaagatcgg caccggccgt acccgggtgc cgtcgaccac cgacaggtcg 193141 gagtggcaca cgcccgcagc ctcgattcgg acaaggacct caccgcggtc gggcgggtcc 193201 aggtgcagct cgacgacgct gattggtttc gaccgccaat agggccgcgg cacaccgatc 193261 tggtctagca ccgcgccccg gatggcaggc atgttggaat acaaccatgg ctgcactgcc 193321 ggcaccggag aagctcctgc gcagcgactt tccggtgctg tggccggtgg gaactcgatg 193381 ggccgacaac gacatgttcg gccacctcaa caacgccgtc tactaccagc tgtttgacac 193441 cgcgataaac gcctggatca acacgagcac cggggttgac ccgctcgcga tgcctgtgct 193501 gggcattgtc gcggagtcgg gctgccgtta tttctcggaa ctgcgtttcc cggagagcct 193561 aatggtgggc ctggctgtga cgcggttggg gcgcagcagc gtcacctacc ggctgggtgt 193621 gtttaaggag cctgacgatg cgggggtgat caccgcactc gggcactggg tgcacgtcta 193681 tgtcgatcgg actagccgca ggccggttcc gattcccgag gccattcggt cgctgttgtc 193741 gacggcttgc gtaagcggat aagccgcgcc cagattgcgt tcagggctgt gattttcgcc 193801 gctccaacca cagccatgac ggcaatctcg tgctcaccgc gacccaggta tgcttcccga 193861 atgccagttt tgagcaagac cgtcgaggtc accgccgacg ccgcatcgat catggccatc 193921 gttgccgata tcgagcgcta cccagagtgg aatgaagggg tcaagggcgc atgggtgctc 193981 gctcgctacg atgacgggcg tcccagccag gtgcggctcg acaccgctgt tcaaggcatc 194041 gagggcacct atatccacgc cgtgtactac ccaggcgaaa accagattca aaccgtcatg 194101 cagcagggtg aactgtttgc caagcaggag cagctgttca gtgtggtggc aaccggcgcc 194161 gcgagcttgc tcacggtgga catggacgtc caggtcacca tgccggtgcc cgagccgatg 194221 gtgaagatgc tgctcaacaa cgtcctggag catctcgccg aaaatctcaa gcagcgcgcc 194281 gagcagctgg cggccagcta aggcatgtgc gggctcagcc gaagacttcg gtctcagcca 194341 gggcctccgt cagcctgcgt gccccatcgg tgaactgcca gacggtgtgc tcgattacgg 194401 cggctgtgtc gcggcggcgc agcgcggcga tcagctgccg atgactgttc accgcgtccg 194461 cgccccatcg cgggtcggcc gcgaacacct gcgccggcat atagcgcgcg gcattaagca 194521 ggaaccaggc caacttgatc cggcggctcg ctttgttgaa gacgcggtgg aacgcgaact 194581 cgatcgacgc gatggttttg gcatcaccgg acccgatagc accggccagc gcattgttga 194641 tgcggtccag ctcgtcgatc tcaacgtcgg tgatgtgagc ggtggccgat gtggcaagtt 194701 cttgggcaat ggtggcctgc agccagaaaa tgtcgtcgat gtcttggcgg gtcaacggca 194761 gcaccacgtg gccgcgatgt ggctccagcc cgaccatccc ctcaccgcgc agtttcagca 194821 gcgcctcccg caccggcgtg acgctgactc cgagctcggc tgccgtctcg tccagacgga 194881 tgaacgttcc agagcgcagg gcgcccgaca tgatggcggc ccgcaggtgg cccgcgacct 194941 cgtcggacaa ctgtgcccgg cgcaggggaa gctggctccg cggcttcgcc gatagaggtg 195001 cgttcacgtg gcttgccagg actttcaggg tcgggccggg attgccgggg acttgccggg 195061 ggcttggcgg gggcttgttg ttgggccgct caggccatag tgtgacccag acaacatcat 195121 gctttatcaa atatcaacct ggcgcaaggg atgcgcaagt gaaaggaagg gaaggaaggg 195181 atagttgacc gcgcaactgg ccagtcacct gacgcgggcg ctaacactag cccaacagca 195241 gccctacctt gctcgccggc agaactgggt caaccagctc gaacggcacg cgatgatgca 195301 gccagacgcg ccggcgctga ggtttgtggg caacaccatg acgtgggctg acctaaggcg 195361 ccgggttgcg gcgctggcgg gcgcattgag cggtcgcggg gtcggtttcg gcgatcgggt 195421 catgatcctg atgcttaacc gcaccgagtt cgtcgagtcg gtgctggccg ccaacatgat 195481 cggggccatc gccgtaccac tgaatttccg gctcacccca accgaaatcg ccgtcctggt 195541 cgaagactgt gccgcacacg tgatgctgac cgaagctgcg ctggctccgg tggccatcgg 195601 tgtccgcaac atccagccct tgctgagcgt gatcgtggtc gccggcggat ccagccagga 195661 cagcgtgttc ggctatgagg acctactcaa cgaggccggg gatgtccacg aaccggtgga 195721 catcccgaac gactcgccgg ccttgatcat gtacaccgcg ggcaccaccg gccgcccgaa 195781 gggcgccgtg ctgactcacg cgaacctcac cggtcaggcg atgaccgcgc tctacaccag 195841 tggcgccaat atcaacagcg acgtcggttt cgtcggcgtc ccgctgttcc atatcgccgg 195901 aatcggcaac atgctgaccg ggctgctgct cggcttgccc acggtgatct atccgctggg 195961 cgcgttcgac ccgggacagc tgctcgacgt gctggaggca gagaaggtca ccggcatctt 196021 tctggttccc gcgcagtggc aggcggtctg taccgaacag caagcacgac cacgtgactt 196081 gaggttacgg gtgttgtcgt ggggagctgc gccggcgccg gatgcgttgc tgcggcagat 196141 gtcggcaacc tttcccgaaa cccagatact ggccgcattc ggccagaccg agatgtcacc 196201 ggtcacctgc atgctgctcg gcgaagatgc gatcgctaag cgcggatcgg tcggcagggt 196261 gatcccgacc gtcgccgcaa gggtggtcga tcagaacatg aacgatgtcc ccgtcggcga 196321 agtgggcgaa attgtctacc gggcaccaac attgatgagc tgctactgga acaacccgga 196381 ggccaccgcg gaggcgttcg caggcggctg gttccattct ggggatctgg ttcgtatgga 196441 ctccgacggt tacgtctggg tggtggaccg caagaaggac atgattatct ccggcggtga 196501 aaacatttac tgcgccgagc tggaaaacgt tctggccagc catcccgaca tcgccgaagt 196561 cgcggtcatc ggccgggccg acgagaagtg gggagaggtg ccgatcgcgg tcgcggccgt 196621 aacgaacgac gaccttcgga tcgaagacct aggtgagttc ctgaccgacc ggcttgcgcg 196681 ctacaagcac cccaaggcgc tcgagatcgt ggacgctctg ccccgcaacc ccgcggggaa 196741 ggtgctcaag actgaactgc gattgcgcta cggcgcctgt gtgaatgttg aaagacgttc 196801 tgcatcagct ggtttcacgg agagaaggga aaatcgacag aaattgtaac gtttgcccgc 196861 tattgacgaa gggttaaatg tgcggatgcc ttacactcct ggctggccat cgggtagatt 196921 cctgtggtct ccgttactcc ctgtgagtaa cgaggtggcg gtcacacacc aagggtcggg 196981 gcaaggagga ggcgtgcgac atgatgcgcc gcggcgccgc gatacccagg tcggcggctt 197041 gagggagccg cggtgacgac gtcgacaacg cttggcggtt acgtccgcga ccaactgcaa 197101 accccgctga ccctcgtcgg tggattcttt cgcatgtgtg tgctgactgg aaaggcgctg 197161 tttcgctggc cgttccagtg gcgcgagttc attctgcagt gctggttcat catgcgggtc 197221 ggatttttac cgacgatcat ggtctcgata ccgctgacgg tgctgttgat cttcacgctc 197281 aatattctgc tggcccagtt cggcgcggca gacatctccg gttccggcgc ggcgatcggc 197341 gcggtcaccc agcttggccc gctgacaacg gtgctggtgg tcgccggcgc cggatccacg 197401 gccatctgcg ccgacctggg tgcccgcacc atccgcgagg aaatcgacgc gatggaggtg 197461 ctgggcatcg atcccatcca ccgtctggtg gtgccgcggg tgctcgcctc gatgctggtc 197521 gccacgctgc tcaacggctt ggtgatcacc gtcggcctgg tcggtggctt tctcttcggt 197581 gtctatctgc agaacgtttc gggcggcgcc taccttgcca cgctgacctt gatcaccggc 197641 ctgcccgagg tggtcatcgc aaccatcaaa gccgcaacgt tcggcctgat cgcgggcctt 197701 gtcggctgct atcgggggct gaccgtccgt ggcggttcca agggtcttgg caccgccgtc 197761 aacgagaccg tggtgctgtg tgtgattgcc ctgttcgccg tcaacgtgat cttgacgacc 197821 atcggtgtgc gattcgggac ggggcgctga catgtcgacc gctgctgtgc tgcgcgcccg 197881 cttcccgcgg gcggtcgcca accttcgtca atatggaggt gcggcggccc gtggattgga 197941 cgaggccggc cagctcacct ggttcgcttt gaccagcatc gggcagatcg cgcacgcgct 198001 gcgctactac cgcaaggaga cgctgcggct gatcgcccag atcggcatgg gtaccggcgc 198061 gatggccgtc gtcggcggca cggtcgccat cgttggcttt gtcacgctgt ccggcagctc 198121 gctggtcgca atccagggct tcgcgtcgct gggcaacatc ggtgtcgagg cgttcaccgg 198181 gttcttcgcc gcactgatca acgtgcgcat cgccggccca gttgtcacgg gtgtcgccct 198241 ggcggccacg gtcggtgcgg gtgctacggc cgagctgggc gcgatgcgga tcagcgagga 198301 gatcgatgcc ctggaagtga tgggcataaa gtcgatctcg tttctggcct ccacccggat 198361 catggccggg ctggtggtga tcatcccgct gtacgcgttg gcgatgatta tgtcgttcct 198421 gtccccgcag atcaccacca cggtgctcta cgggcagtcg aacggcacct acgagcatta 198481 ctttcaaacg ttcctgcgtc ccgacgatgt cttttggtcc ttcttggagg ccctcatcat 198541 cactgcgatc gtcatggtca gccactgcta ctacgggtac gccgccggtg gaggccccgt 198601 cggtgtcggc gaggccgtcg gccgatcgat gcgtttctcg ttggtctcgg tgcaggtcgt 198661 tgtcctgttt gcagcgttgg cgctctacgg tgtcgacccg aacttcaatc tcacggtgta 198721 gccgcatgac gacgccgggg aagctgaaca aggcgcgagt gccgccctac aagacggcgg 198781 gtttgggtct agtgctggtc ttcgcgctcg tagttgcctt ggtatacctg cagtttcgcg 198841 gggagttcac gcccaagacg cagttgacga tgctgtccgc tcgtgcgggt ttggtgatgg 198901 atcccgggtc gaaggtcacc tataacgggg tggagatcgg gcgggtagac accatctcgg 198961 aggtcacacg tgacggcgag tcggcggcca agttcatctt ggatgtggat ccgcgttaca 199021 tccacctgat tccggcaaat gtgaacgccg acatcaaggc gaccacggtg ttcggcggta 199081 agtatgtgtc gttgaccacg ccgaaaaacc cgacaaagag gcggataacg ccaaaagacg 199141 tcatcgacgt acggtcggtg accaccgaga tcaacacgtt gttccagacg ctcacctcga 199201 tcgccgagaa ggtggatccg gtcaagctga acctgaccct gagcgcggcc gcggaggcgt 199261 tgaccgggct gggcgataag ttcggcgagt cgatcgtcaa cgccaacacc gttctggatg 199321 acctcaattc gcggatgccg cagtcgcgcc acgacattca gcaattggcg gctctgggcg 199381 acgtctacgc cgacgcggcg ccggacctgt tcgactttct cgacagttcg gtgaccaccg 199441 cccgcaccat caatgcccag caagcggaac tggattcggc gctgttggcg gcggccgggt 199501 tcggcaacac cacagccgat gtcttcgacc gcggcgggcc gtatctgcag cggggggtcg 199561 ccgacctggt ccccaccgcc accctgctcg acacttatag cccggaactg ttctgcacga 199621 tccgcaactt ctacgatgcc gatccgctcg ctaaagcggc ggccggtggc ggtaacggct 199681 actcgctgag gacgaactca gagatcctat ccgggatagg tatctccttg ttgtctcccc 199741 tggcgttagc caccaatggg gcggcaatcg gaatcggact ggtagccgga ttgatagcgt 199801 cgcccctcgc ggtggccgca aatctagcgg gagccctacc cggaatcgtt ggcggcgcgc 199861 ccaatcccta tacctatccg gagaatctgc cgcgggtgaa cgctcgcggt ggcccggggg 199921 gcgcccccgg ttgctggcag ccgatcaccc gggatctgtg gccagcgccg tatctggtga 199981 tggacaccgg tgccagcctc gccccgtaca accacatgga ggttggctcg ccttatgcag 200041 tcgagtacgt ctggggccgt caggtagggg ataacacgat caacccatga aaatcactgg 200101 aaccgtcgtc aaactcggca tcgtctcggt ggtgctgctg ttcttcacgg tgatgatcat 200161 cgtgattttc ggtcagatgc gcttcgaccg gactaatggc tataccgcgg agttcagcaa 200221 tgtcagcggg ctgcgccaag gccagtttgt ccgtgcttcg ggggtagaga tcggcaaggt 200281 caaagcacta cacctggtcg acggtggccg tcgggttcgg gtggagttca atatcgatcg 200341 ttcggtgccg ttgtatcagt ccacgaccgc ccagatccgc tattccgacc tgatcggtaa 200401 ccggtacgtg gagctcaaac ggggtgaggg caagggggcc aacgatctgc tgccgccagg 200461 tggactcatc ccattgtccc gcacgtcacc ggccttggat ctggacgcgt tgatcggtgg 200521 tttcaagccg gtgtttcggg cgttggatcc cgcgaaggtg aacaacatcg ccaacgcgct 200581 catcaccgtc ttccaggggc aaggtggcac cataaacgac accctcgacc agaccgcgca 200641 actgaccagc cagatcgcgg agcgcgatca ggcgatcggt gaggttgtca agaacctgaa 200701 catcgtgctg gacaccacgg tcaagcatcg aaaagagttc gacgagacgg tcaataactt 200761 ggagaatctg atcactgggc tgaggaacca ctccgaccag ttggccggcg gcctcgcgca 200821 catcagcaac ggcgccggca cggtggccga cctgcttgcc gagaatcgca cgttggtgcg 200881 caaggccgtc agctacctgg acgctattca gcaaccggtc atcgaccagc gcgtcgagtt 200941 ggacgacctg ctccacaaga cgccgaccgc gttgacggcg ctcggacgcg ccaacggaac 201001 ctacggcgat ttccagaact tctacctctg cgacctccag atcaagtgga acggattcca 201061 agccggaggg ccggtccgca cggtgaagct ctttagccag ccgacgggta ggtgcacgcc 201121 gcaatgagaa cgctggaacc acccaaccga atgcgaattg ggctcatggg catcgtcgtt 201181 gcgctgctcg ttgtcgctgt gggccaaagc tttaccagtg ttcccatgct attcgcaaag 201241 ccgagctact acggccagtt caccgactcc ggcggactgc acaagggcga cagggtacgc 201301 atcgccggct tgggagtggg caccgtggag gggctcaaga tcgacggcga ccacatcgtg 201361 gtcaagttct ccatcggcac caacaccatc ggcaccgaga gccgcctagc catccgcacc 201421 gacaccatcc tgggtaggaa agtgctcgag atcgagccgc gcggcgccca agcgttgccg 201481 cccgggggcg ttttgccggt tgggcaaagc accaccccgt accagattta cgacgcgttc 201541 ttcgacgtca ccaaggccgc atccggctgg gacatcgaga cggtcaagcg gtcgctgaat 201601 gtgttgtcgg agaccgttga tcagacctat ccgcacctga gcgccgccct cgacggggtg 201661 gctaagttct ccgacaccat cggcaagcgc gacgagcaga tcacgcacct actagcccag 201721 gccaaccagg tggccagcat cctgggtgat cgcagtgacc aggtcgaccg cctattggtc 201781 aacgctaaga ccctgatcgc cgcgttcaac gagcgcggcc gcgcggtcga cgccctgctg 201841 gggaacatct ccgctttctc ggcccaggtg caaaacctta tcaacgacaa cccgaacctg 201901 aaccatgtgc tcgagcagct gcgcatcctc accgacctgt tggtcgaccg caaggaggat 201961 ttggctgaaa ccctgacgat cttgggcaga ttcagcgcgt cgttcggtga gacgtttgcc 202021 tctgggccct acttcaaagt gctgctggcc aacctggtgc cgggtcagat cttgcagccg 202081 tttgtcgatg cggcattcaa gaagcgtggt attagcccgg aggacttctg gcgcagcgcc 202141 gggctgccgg cataccggtg gcccgacccc aatggcaccc ggttccccaa cggtgcgccg 202201 ccgccaccac cgccggtgtt ggagggcacg cccgagcatc ccgggccggc ggtgccgccg 202261 ggatcgccgt gctcctacac cccgccggcg gacggtctgc cgcggccgtg ggatccgctg 202321 ccctgcgcta acctcactca aggtccattc ggtggccccg atttcccggc gccgctggat 202381 gtcgcgacgt cgccgccgaa cccagacggt ccaccgcccg ccccgggcct accaatcgcg 202441 ggacgtccgg gtgaggtgcc gccgaacgtt cccggcacgc cggtgccgat tccacaggag 202501 gctccccccg gggcacgcac gctgcccctc gggccggcgc ctggtccggc tccgcccccg 202561 gcggcgccag gcccgccggc accaccgggc cccgggccgc agttgccggc cccgttcatc 202621 aaccccggcg gcaccggcgg tagtggcgtg acgggaggta gcgagaattg agcaccatct 202681 ttgatatccg caacctgcgg ttgccgcagc tgtcgcgggc ctcggttgtc atcggatcgt 202741 tggtggtggt gctggcgctg gccgccggaa ttgttggtgt gcggctctat caaaaactga 202801 cgaacaacac ggtggtcgcc tacttcaccc aagccaatgc gctgtatgtc ggagacaagg 202861 tccagattat gggcctcccg gtcggttcga tcgacaagat cgaaccagcc ggcgacaaaa 202921 tgaaggtgac tttccactac cagaacaagt acaaggtgcc tgccaatgcc tccgcggtga 202981 tcctcaaccc caccttggtg gcgtcgcgga acattcagtt ggagccaccc tacagaggtg 203041 gtccagtgct ggccgataat gcggtgatcc cggtcgagcg cacccaggta ccgacggagt 203101 gggacgagct gcgggacagc gtttcgcata ttatcgacga gctcggcccg acacctgagc 203161 agcccaaggg gccgttcggc gaagtcatcg aggcattcgc cgacgggctg gccggcaagg 203221 gtaagcaaat caacaccacg ctgaacagcc tgtcgcaggc gttgaacgcc ttgaatgagg 203281 gccgcggcga cttcttcgcg gtggtacgca gcctggcgct attcgtcaac gcgctacatc 203341 aggacgacca acagttcgtc gcgttgaaca agaaccttgc ggagttcacc gacaggttga 203401 cccactccga tgcggacctg tcgaacgcca tccagcaatt cgacagcttg ctcgccgtcg 203461 cgcgcccgtt cttcgccaag aaccgcgagg tgctgacgca tgacgtcaat aatctcgcga 203521 ccgtgaccac cacgttgctg cagcccgatc cgttggatgg gttggagacc gtcctgcaca 203581 tcttcccgac gctggcggcg aacattaacc agctttacca tccgacacac ggtggcgtgg 203641 tgtcgctttc cgcgttcacg aatttcgcca acccgatgga gttcatctgc agctcgattc 203701 aggcgggtag ccggctcggt tatcaagagt cggccgaact ctgtgcgcag tatctggcgc 203761 cagtcctcga tgcgatcaag ttcaactact ttccgttcgg cctgaacgtg gccagcaccg 203821 cctcgacact gcctaaagag atcgcgtact ccgagccccg cttgcagccg cccaacgggt 203881 acaaggacac cacggtgccc ggcatctggg tgccggatac gccgttgtca caccgcaaca 203941 cgcagcccgg ttgggtggtg gcacccggga tgcaaggggt tcaggtggga ccgatcacgc 204001 agggtttgct gacgccggag tccctggccg aactcatggg tggtcccgat atcgcccctc 204061 cgtcgtcagg gctgcaaacc ccgcccggac ccccgaatgc gtacgacgag taccccgtgc 204121 tgccgccgat cggtttacag gccccacagg tgccgatacc accgccgcct cctgggcccg 204181 acgtaatccc gggtccggtg ccaccgacgc cggcaccggt gggggcgccg ttgcccgctg 204241 aggcaggagg gggtcaatga tgagcgtgct ggcgcggatg cgggtgatgc gccaccgagc 204301 ctggcagggg ctggtgttgc tggtgctcgc actcttgctg agttcgtgcg gctggcgcgg 204361 catctccaat gtggcgatcc ccggcggccc gggcaccggc ccgggctcct acaccatcta 204421 cgtgcagatg ccggacacgt tggcgatcaa cggcaacagt cgggtcatgg tggccgacgt 204481 ctgggtcgga tcgatccgcg cgatcaagtt gaagaactgg gtggccacgc tgacgctgag 204541 cctgaagaag gacgtcacgc taccgaaaaa tgccaccgcc aagatcgggc agaccagcct 204601 gctgggttcg cagcacgtcg agctggccgc gccgccagat ccgtcgccgg tgccgctgaa 204661 ggatggtgac accatcccgt tgaagcgctc ctcggcctat cccaccaccg agcagacgct 204721 ggccagcatc gccaccttgt tgcgcggcgg cggcctggtg aacctcgaag ggattcagca 204781 agagatcaac gccatcgtga cggggcgggc ggaccagatc cgggcctttc ttggcaagct 204841 cgacaccttc accgacgagc tcaaccagca acgcgatgac attacccgcg ccattgattc 204901 caccaatcgg ttgttggctt atgtgggcgg tcgttcggaa gtcctcaatc gggtgctcac 204961 cgacctaccg ccattgatca agcactttgc ggataagcag gaactgttga tcaacgcttc 205021 cgatgcggta ggccggctca gccagtccgc cgaccagtat ctttcggctg cccggggcga 205081 tctgcaccag gacctgcagg cgctgcaatg cccgctcaag gaactgcgtc gagccgctcc 205141 gtatctggtg ggtgcgctca aattgatcct cacccagccc tttgacgtcg acaccgtgcc 205201 gcagctggtg cggggcgact acatgaactt gtcgctgacg ctggacctga cctacagcgc 205261 catcgacaat gcgttcctta ccgggaccgg attctccggt gcgttgcgcg ccctcgagca 205321 gtcttttggc cgcgatcccg agacaatgat tcccgacatc cggtacacac cgaaccccaa 205381 cgatgcgccg ggcggcccgc tggtagaaag gggaaatcgc cagtgctgac tcgcttcatc 205441 cgacgccagt tgatcctttt tgcgatcgtc tccgtagtcg caatcgtcgt attgggctgg 205501 tactacctgc gaattccgag tctggtgggt atcgggcagt acaccttgaa ggccgacttg 205561 cccgcatcgg gtggcctgta tccgacggcc aatgtgacct accgcggtat caccattggc 205621 aaggttactg ccgtcgagcc caccgaccag ggcgcacgag tgacgatgag catcgccagc 205681 aactacaaaa tccccgtcga tgcctcggcg aacgtgcatt cggtgtcagc ggtgggcgag 205741 cagtacatcg acctggtgtc caccggtgct ccgggtaaat acttctcctc cggacagacc 205801 atcaccaagg gcaccgttcc cagtgagatc gggccggcgc tggacaattc caatcgcggg 205861 ttggccgcat tgcccacgga gaagatcggc ttgctgctcg acgagaccgc gcaagcggtg 205921 ggtgggctgg gacccgcgtt gcaacggttg gtcgattcca ctcaagcgat cgtcggtgac 205981 ttcaaaacca acattggcga cgtcaacgac atcatcgaga actccgggcc gattttggac 206041 agccaggtca acacgggtga tcagatcgag cgctgggcgc gcaaattgaa caatctggcc 206101 gcacagaccg cgaccaggga tcagaacgtg cgaagcatcc tgtcccaggc ggcccccacc 206161 gccgatgagg ttaacgcggt attcagcggt gttcgcgatt cgctgccaca gaccctggcc 206221 aatcttgagg ttgtgttcga tatgctcaag cgctaccacg ccggcgtgga gcaattgttg 206281 gtgttcctcc cacagggtgc cgcgatcgca cagaccgtac tcacgccaac tccgggtgct 206341 gcccagctgc cgctcgcgcc ggcgatcaac tatccgccgc cgtgcttgac gggttttctt 206401 cctgcatcgg agtggcggtc tccggccgat accagtccca ggccgttgcc gtcgggaacc 206461 tattgcaaga ttccccagga tgcccagctg caagtccggg gggcgcgcaa cattccctgt 206521 gtcgatgtcc cgggcaaacg agcggcgacg ccgaaggagt gccgcagtaa ggacccgtac 206581 gttccgctgg gtaccaaccc gtggtttggt gatccgaacc agattctcac ctgcccggca 206641 cctggagcgc gctgcgatca gccggtgaag ccgggtttgg tgattccggc gccctcgatc 206701 aacaccggtt tgaatccggc gcccgccgat caggtgcaag gaacgccccc gccggtcagt 206761 gacccgttgc aaagaccggg ttcgggtact gtgcagtgca acgggcagca gcctaacccg 206821 tgcgtctaca ctccaacatc gggcccgtcg gcggtctata gcccggccag cggtgaactg 206881 gtggggccgg atggtgtcaa gtacgccgtc gcaaactcga gcacaacagg agacgacgga 206941 tggaaggaga tgctggcgcc ggccagctga accctgccga tgcgaataag tcgtcgtcta 207001 cggaggtgaa ggcggcggat tcggcggaat ctgacgccgg agccgaccag actggcccgc 207061 aggtgaaggc ggcggattcg gcggaatctg acgccggaga gctcggcgag gacgcgtgcc 207121 cagaacaggc cctcgtcgag cggcgcccgt cgcggttgcg gcgaggctgg cttgttggca 207181 ttgcggcgac gctgctcgcg ttggccggtg gccttggcgc agcgggttat tttgcgttgc 207241 gctcacacca ggaaagccaa tcaatcgcgc gcgaggacct tgcggccatt gaggccgcta 207301 aggattgcgt tgcggccacg caggcacccg atgctggggc gatgtcggct agcatgcaga 207361 agatcatcga gtgtggcacc ggtgatttcg gtgcccaggc gtcgttgtac accagcatgc 207421 tcgtcgaggc gtatcaagcg gccagcgtcc acgtgcaagt gaccgatatg cgcgcggcgg 207481 tcgagcgcaa caacaatgac gggtcggtcg atgttctggt ggcgctccgg gtcaaggtgt 207541 ccaacaccga ctcggatgcc catgaagtcg gctaccgtct tcgggtccgg atggcactgg 207601 atgagggccg ctataagatc gccaaactcg accaggtgac gaagtgacgg tggtggtcga 207661 gaagacgccg accaccctgc cccaggcgac accgaacggt gcagcgccct ggcatgttcg 207721 ggcgggcgcc ttcgccatcg acgtgctgcc cgggctcgcc gtggcggcga ccatggcgtt 207781 gacggcttta acggtgccgc cgggcagcgc gtggcggtgg ttatgcgctt gtctgctcgg 207841 attgaccatt ctccttctgg ccgttaaccg gttgttgttg ccgacgatta ccggatggag 207901 tcttggccgc gctcttaccg gcatccgggt ggttcggcgt gacggctccg ccatcggtcc 207961 gtggcggttg ctggtccggg atttggcgca cttggtggac accctctcgc tgtttgtggg 208021 ttggctgtgg ccgctgtggg attcgcggcg acgcaccttc gccgacctgt tgttgcgcac 208081 tgaggtgcga cgtgtcgaac cggtgcagcg gcccgcggtg atacggcgac tgacggcggc 208141 ggtggcattg gcggcggcgg gcgcgtgcgc gagcgcaacc gcggtgggcg ctgcggtggt 208201 gtacgtcaat gaatggcaaa ccgatcacac tcgcgcgcag ctcgcaacgc ggggcccgaa 208261 gctcgtggtc gacgtcctga gctacgaccc cgaaacggtg cagcgtgatt tcgaacgggc 208321 gcgatcgctg gccaccgaca ggtaccgccc gcagctgagc atccaacagg attcggtgcg 208381 cgagtcggga cctgttcgta accagtactg ggttaccgac agcgcggtgc tgtcggcgac 208441 accagctcag gcgaccatgc tgttgttcat gcagggtgaa cgcggtacac cacccaatca 208501 gcggtatatc agcgcaactg tgcgggcgat cttccaaaaa tcgcgcgggc aatggcgcct 208561 cgacgatctg gcagtcgtga tgaaaccccg acaacccacc ggcgaaaaat gagcccccgt 208621 cgtaagtttg aacccggcga gggggcgctg ctggccccgc agtcaatcga accgtcgcgg 208681 cgatggggtt tgccgctggc tctgaccgca tccgctgtgg ttatggccgc ggcgatctca 208741 gcctgtgcgc tcatgcggat ctcccatgaa tcgcaccagc gagcagcgca caaggatatc 208801 gtgatgctca gtgatgtccg atctttcatg accatgttca cgtcaccgga tccgtttcac 208861 gccaacgaat atgcggagcg ggtgctgtcc cacgccacgg gcgacttcgc caagcagtac 208921 cacgaaagag caaacgatat cctgattcgc atctccgggg tggaaccgac cacaggaacg 208981 gttctagacg cgggcgtaca gaggtggaac gaggatggta gtgccaacgt gctggtggtc 209041 acccagatca cctcgaaatc cgcggacggc aagcgggtgg tctcgaacgc caatcgttgg 209101 ctggtaacgg ctaagcagga aggtaacgag tggaagatca gcagtctgct tccggtgatc 209161 tgacccaaaa gtccgttgcc aacggagagt ccaccgacac ggcatccgca gccaccgagg 209221 gccaccgggg cgagatcgac gccgcgggag agccggacga acgcggtgcc gccgtggctg 209281 acagccaagc tgacgaggat gattcggccg cgacggctgc caggggcggc aagacacggg 209341 caagacgatc gcgtggcagg cggttagcga tcacggtcgg cgtggccgct gcgttgttcg 209401 tgggctcggc agcgttcgct ggtgcgacgg tggagcccta cctctccgag cgcgccgtgg 209461 tggccaccaa gctcatggtc gcgcggaccg ccgccaatgc gatcacgacg ttgtggacct 209521 acacgccgga gaacatggac accctggccg atcgggccgc gaattacctc agcggtgatt 209581 tcgcggctca gtaccgcaga ttcgtcgacc agatcgccgc agcaaacaaa caggccaaga 209641 ttaccaacga taccgaggtc accggtgctg ccgtggaatc gctgagcggc cgggatgccg 209701 ttgccatcgt ctacaccaac accacgacca ccagtccggt gaccaagaac atcccagcat 209761 tgaagtatct gtcctaccgg ctgttcatga agcgttatga cgcgcggtgg ctggtgacca 209821 ggatgacgac catcacctcg ctggatttga cgccgcaggt gtagcgggac cgagcccgcc 209881 ggcgctgcga agccttagtt gaacgccagc cagctgggca gcgcccgctc atgggagtca 209941 cagagcacct gacgggtgtc gcacgatcct ttcggcgacc cagcgccggc ccacatgccc 210001 ccggtatcac ggcgcagaac gatcgccgat gaccccccgc cgtcgagcag aatcgcggtg 210061 tcactaccca ggccgcggaa caggtcttgg atgttgtccg gggtgtagtt gccgccctgg 210121 aagatgtaca tctcgtcctt ctgcttcgca taggcaagcg ccgttcgcgc ggcgctggga 210181 ccgccgtcgt ggagctggcc ggtattgccg ggggataaca gcccgattcc ggccacggcg 210241 acgaaccgtg cattcttgtt gagcaagtcc tcgatcaccg gagtggcaag atcgtagtcc 210301 tgtctgcttt tgggccgcaa aacatacggt gcaccaccga ccggaaggat catcgtcgtc 210361 agcgagctcc acaactcatt tcctccggaa aggccctgct ttccggcgta ggcgacggtg 210421 ccggtgaccg cttggttggc gcgtccttgt ccgcgggtgt tgtccacgta ggcgcccagc 210481 ggtgagctgc agccggtcga ccgccagctg cccccctttt gtccgcgaac gtcgaagaag 210541 ttggcgttga ccgcaatggt gggtcgcccc atacgctgcc acgccttaag cggcgggtag 210601 atctcggagg cttgccacaa gccttcaccg gtgtgagcac ctgggttgtg ttcgcagcgc 210661 gcctggtctc cagtgtgggt gtctaccagt agatgtggtg aaagccgttg ggaggcattc 210721 ttgatgatca tcagatggcc gccgttgttc atctcgtacc agtgaccacc tgcgttgagc 210781 agcggcatag gatgaccgcc gccgaagttg tacaccaggt atgagcccct ggtggtggct 210841 atcgcttggg cgagcatctc gcgcccgtcg gcggcgcggg cggccggctg cccggtggtg 210901 caggcgaggg cggcgcacac cgccaacgcg gcgtagcaag ccgtcaatcg gcgcaggctg 210961 gcagtcggtg tcagcacagc aaccctctcg gcccgaatcc acatgcaacc atcccagcat 211021 taggcacact gatcacactg tcaacttcag taacagctgc gtgacggttc ggccgcgttc 211081 gaattacggt tgttcgcttg agctttcgcg tgcgctttgg cgagcttggt attgagcttg 211141 gtgttccacg gcgatcgcca tctcgaccgc tcccggaatt cggtggaagc tgctgcggtc 211201 gtacaggtgt gtgatgaatc cacccagcaa cagcccgatg atcagcccga tcgacgtcat 211261 ggtcagggcc tgggacaggc cggcgtcggc gtttccgttc agatacagca gtgaccgcac 211321 acctaggaac acctggtgca tcggctcgaa ttgagccaac cagcggaaga acgctggtac 211381 ggcttccagc gggacggtcg cgcccgccga cggcaatccg aggatgacga agatcaacat 211441 gctgaccaac aggcccatcg agcccagcac cgcgatcagc gagctggacg tgacgccgac 211501 cgctatgatc gcgaatactc cgtagagcca tacttgccac ccgagcggaa tcggcatgcc 211561 caggccgtgg gcgatcgcca ggtagacacc cgaggtgagc aacgccagca ccaccatcac 211621 cgcccacttg accaacagcg tacggaagcg agagatgttg acctgctcgg cgaagcgata 211681 gacggggccg aattcggctg gtacatagcc aagcatcgag tccaccaggg tgctcaccac 211741 gatgctgccg gtaaagcccg ccaatagcag caagagggcg tagtaaaacg ccgacagccc 211801 gttgccggtg ccgttgggca gtgggttata ggcggtggat ttgacatcga tgggactggc 211861 cagcccggcc gccgccgccc cggccagtgc cacaccgccg gtctgggccg ctacctccgc 211921 ggtaagtcgc tcgcccactt tgccgttgac caccgtcagt gcccgggtca gcgtctggcc 211981 ggcgatgcta gctgccagcg tgcccgcccg cggattcgtt gagatcgtga tcgcgggccg 212041 gtctgtgcgg gttggcgtca ccgcactcgc cccgaagtcc cgtagctgcg acgagaaggt 212101 cggcggtatc agcgccgagc cgtacaccgc cgcggtgtcg agcagccgcc tggcctcgtc 212161 cggcgaaacc actcggatgt cgaacttgtt cttgtccaag ccggaaacca gaccgtcgac 212221 aatctgctgg cccgcgggcc cggcgtcctc gttcaccaac gcgattggga aatgccgcaa 212281 attggtcatg gggtttagga tgccgcccag atagagcgcg gccagcgccg acatcagggc 212341 caacgtggtg gcgatcggtg ccatccagaa acgcaccgtc cgaatcgctt tgacgttccg 212401 cttggggttg ggtgcggcgg ggcgcggctg cgcttgagac atgcgagctc ctgtctgtcg 212461 tggccactct atgttgccga atcgcccagc ttcgcgtgca tctcccagat cagtacttcc 212521 gacggctcat tggcggtcag gccgcgggcg tcagcgtcgg tgaaccgcac cgcgtccccg 212581 tcggcaagct cgccgccacc ttccagagtg aggcggccgt aggcgacgaa cagatgcagg 212641 aagggtgcgc agggcaggct gaccgtagcg ccgggccgca gccgcgcgcc gtgcaacgag 212701 gcgctgctgt tatgcagggt gagcgctgcg tcttgcccgg gtatgcccga cgcgatggtt 212761 accaggccgg cgcgcaacag ttcgtcgtct atctcctgct gttggtagct ggcagtgatg 212821 ccggttgcat cgggtattac ccacatctgc acgaaatgca ccggctcggt agcagaatcg 212881 ttcatttccg aatgcaagat tccggtgccg gccgacatgc gttgggccag accgggatag 212941 atcactccgc tattgccggc ggaatcctgg tgtctgagcg ctccccgcag cacccaggtc 213001 acgatttcca tgtcacggtg tggatgggga tcaaaacccg aagccggttc catttggtcg 213061 tcgttgttca ccaacaggag cccgtggtgg gtgttgtcgg gatcgtagtg gtcgccgaat 213121 gagaacgaat gccgggattt cagccaggac gtcgtggtga ccgcccggtc ggccgcacgc 213181 cttatctcga cggtggcggt catgacgtca cgttcgccat cacagcgaat cgggcaggcc 213241 gaatttcggg aacaaggtgg tgtcgaggaa ggccaccacg tgtgacacgc ggtcggcggc 213301 catgtccagc acgtgtagct gaaaaggcag gtgcacgtca ccggcacgca tgtacatggc 213361 cgcggcgggc tggccgttgg cgatcaacga aatcaggcgc atatcgccag gcgaataggc 213421 ggggcactgt tggtgaatga gggtgacgat ggcctgtgcg ccctggtacc agccggtata 213481 cggcggcatt tcccagatcg cctcggcggt gaacagctcg accaaccggt cgatgtcata 213541 agcctcgaac gcggcgatat agcgggccaa caggtcttgc gcctcgggtg aatccggcgc 213601 ggacaaccgg tcggcggcgc tgggccggac cgtctgcagc tgagagcggg cccgctgcag 213661 caggctattg acggcgacgg tgctggtacc gatcgcgtcg gccacctcgg ccgatttcca 213721 ctgcagcacg tcgcgcagca gcagtacggc tcgctgccgg ggtgagaggt gctgcagagc 213781 cgccacaaag gccaaccgca ccgattcccg gttcccgacg atcgttgagg gatcagcagg 213841 gtcgtccgtc acgtccggca gcggctccag ccaggacacc tctcgacgtt ccaccaactc 213901 cccggacgga tcggcactcg gccgcccgag ccccgtcggc aacggccggc gtcgacggcc 213961 ctccaacgcc gtcaggcagg tgttggtggc gatccgatgc agccaggtgc gtagcgagga 214021 cttgcccgcg aagccctcat aggccttcca ggcccgcagc agcgtctcct gaacaaggtc 214081 ttccgcgtcg tgcagcgagc cagtcatgcg atagcagtgt gcgagcagtt cacgccggta 214141 gggctcggtg tgggcggaga agtccccgcg ccgttcgtcg gcgggctcgc ggccagagtt 214201 ttctgcgagc acactcacgt caatgagcct acgcagagtc tccgacactc tcaccggagc 214261 agccgttacg ctcccggtaa tgactaccac ccggactgaa cggaatttcg cgggcatcgg 214321 cgatgtgcgc atcgtctacg acgtctggac gccggacacc gcgccgcaag cggtggtcgt 214381 gctggcccat ggtctgggcg agcatgcccg ccgctacgac catgtcgcgc agcggctcgg 214441 cgcggccggc ctggtcacct atgcgcttga ccaccgcggg catggccgct cgggtggcaa 214501 acgggtgcta gtgagagaca tctccgagta caccgctgac ttcgacaccc tcgttgggat 214561 cgccacccgg gaatatcccg ggtgcaagcg catcgtgctc gggcacagca tgggcggcgg 214621 cattgtgttc gcttacggtg tcgaacgtcc agacaactac gacctgatgg tgctttcggc 214681 gccggcggtg gcggcacagg acctggtgag cccggtagtg gcggttgccg ccaagcttct 214741 gggcgtcgtg gtgcccggcc tgccggtgca ggaactggat tttactgcca tctctcgcga 214801 ccctgaggtg gtccaggctt acaacaccga cccactcgtg caccacggac gggttccggc 214861 cgggattggc cgcgcgctgc tgcaggtggg cgagaccatg ccgcggcgag caccggcatt 214921 gaccgcgccg ctgctagtgc tgcacggcac cgatgaccgg ctgatcccca tcgagggcag 214981 ccgtcgcctg gtcgaatgtg tgggatcggc cgacgtgcag ctgaaggagt atcccgggct 215041 gtaccacgag gtgttcaacg agccggagcg caaccaggtg ctcgacgatg tggtcgcctg 215101 gctcaccgag cggttgtagg ccgagccgac ctgtcgcagc cctccactag ttttggcgcc 215161 atgaccaacg acaagatgct ggcccgcatc gcagccctgc tgcgccaggc cgaaggcacc 215221 gacaacccgc acgaggccga cgcgttcatg agcaccgcac aacggttggc cacggcggca 215281 tccatcgacc tggcggtggc ccggtcgcac gcgggcaacc gttcacccgc gcaggccccg 215341 acacagcgca ccatcaccat cggggcggcg ggcacccgcg gattgcggac ctatgtgcag 215401 ctcttcgtgc tcatcgcggc ggccaacgac gtgcgctgcg acgtggcatc gaattcgacg 215461 ttcgtgtacg cctacgggtt cgccgaggac atcgacacca gccacgccct atacgccagc 215521 ctggtggtcc agatggtccg ggcatccgac gcctacctcg cctcgggagc gcaccggccc 215581 acgccgacga tcaccgcccg actcaacttc cagctggcgt tcggcgcccg ggtcggccag 215641 cgcttggccg atgcccgaga gcagactcgg caggaagcca ccaaggaccg tgatcgtccg 215701 cctggtaccg caattgccct gcgggacaag gacatcgagc tgcatgagta ctaccgtcgt 215761 tcctctaagg cgcgcggcgc ctggcgagcc agccgggcca ccgcgggata ctcgtcggcg 215821 gcacggcgcg ccggtgatcg agcgggacgg caagcacgac tcgggaacaa ccccgagctg 215881 cccggggcac gggccgcgct gggccggtga tcggcgcgga cgttccgcgg gattcccagc 215941 gtgccagggt gtacgcggcc gaggcgttcg tccggacctt gttcgaccgc gtcaccgcac 216001 acggctcacc gacggtggag ttcttcggta cccagttgac gctgccccca gaaggtcggt 216061 tcggttcggt ggcatcggtg cagcgttatg tggacgacgt gcttgcgcta ccggcggtag 216121 ggcagaactg gccgacggtg tcgccggtgc gcgtgcgggc gcgccgggcg gccaccgcgg 216181 cgcactatga aaaccatggc ggcacaggca ctattgcggt acccgaccgg cacaccgccg 216241 gttgggcgat gcgcgagttg gtcgtgctac acgaagtggc gcatcatttg tgccaggtgc 216301 caccgccaca cggacccgag tttgtggcga cggtgtgcac cctgacagag ctggtgatgg 216361 gacccgaagt tggtcacgtg tttcgcgtcg tctacgcgca ggagggcgtg cgctgaacga 216421 gctagacgcc gacctgcggg cacgtgaggt cgaggcccag atgaccgacg acgagcgatt 216481 ctcactgttg gtcggcctga ccggggccag cgatctgtgg ccggtgcgcg atgaacgcat 216541 cccacagggc gtgccgatgt gtgccgggta tgtgccgggg attccccggc tcggggtccc 216601 ggccttgttg atgagcgatg ccggtctggg cgtcaccaac cctggctacc gccccggtga 216661 caccgctacg gcgctgcccg ccggccttgc cctagcggcc agctttaacc cggtgctggc 216721 ccggtcctcg ggcaaagcga tcggccggga ggcgcgcagt cgcgggttca acgtgcaact 216781 ggccggcgca atcaatctgg cgcgcgaccc gcgtaacggc cgcaacttcg agtacctttc 216841 cgaggacccg ttgttgagtg ccacgatggc cgcggagtcg atcatcggga ttcagcagca 216901 gggtgtcatt gcgacgacga aacacttctc gctgaactgc aacgaaacca atcggcactg 216961 gctggacgcg gtcatcgatc ccgacgcgca ccgcgagtcg gacttgttgg cgttcgagat 217021 cgtcatcgag cggtcgcagc ccggcgccgt gatggcggcg tacaacaagg tcaacggaga 217081 ttacgctgcc ggcaacgacc acttgctcaa cgacgtgctg aaaggtgctt ggggataccg 217141 cggttgggtg atgtcggatt ggggcggaac acccagctgg gagtgcgcgc tggccggcct 217201 ggaccaagag tgcggtgcgc agatcgatgc agtgctgtgg cagtcggaag cattcaccga 217261 ccgcctgcgt gccgcctacg ccgacggcaa tctacccaag gggcgcctgt cggacatggt 217321 acggcggatc ctgcggtcga tgtttgccgt cggaatcgac cgatggaaac cagcgccggc 217381 gccggacatg aacgcgcaca acgagattgc cgcacagatg gcgcggcaag gaatcgtgct 217441 gctgcaaaac cgagggctgc tgccgctcgc tcccgaatcg gccgggcgta ttgccgtcat 217501 cggcggctat gcacacctcg gtgtgccagc cggttacggt tcgagcgccg tcaccccgcc 217561 ggggggctat gcgggcgtga taccgatcgg tgggtctggc ttggcagccg ggttgcgtaa 217621 tctctacctg ctgccgtcaa gcccgctgag tgagttgcga aagcggttgc ccaacgcgca 217681 gttcgagttc gatcctggca tcaacccggc ggaggcggtg ctggctgcgc ggcgagcaga 217741 catcgcgatc gtgttcgcga tccgtgccga aggagagggc ttcgacagcg ccgatctgtc 217801 gctgccatgg ggtcaggatg cgctgatcgc cgcagtcgcg tccgccaacg cgaataccgt 217861 tgtggtgctt gagacgggca acccggtgac catgccctgg cgcgactcgg tgaacgccat 217921 catgcaggcc tggtatccgg gccaggcggg tggccaggcc gttgcggaga ttgtgaccgg 217981 gcaggtgaat ccttcgggcc ggctgccgat caccttcccg gtcgatctcg gtcagacgcc 218041 acgctcgcaa ccgcgcgagc tcggtgcccc gtgggggaca tcgaccacga tccactacac 218101 cgagggcgcc gatgttggtt accgctggtt tgccagcaca aatcagaccc cgatgttcgc 218161 gttcggtcac ggcttgtcct ataccagttt cgagtatcgt gacctggtgg tgacgggcgg 218221 ccacaccgtg cacgccagtt tcagcgttac caacacgggc gaccgcagcg gggcggatgt 218281 cccgcagctg tatatgatcg cagctcccgg cgaatcgcgg ttgcggttgc tgggattcga 218341 gcgggtcgag ctcgaacccg gccagactcg gcgggtaagg atcgaggcgg acccgcgact 218401 gctcgcccgc tacgacggcg aggccagaag ctggcgcatc gagccgggcg gttacacggt 218461 ggcggtgggc gcttcggcgg tagcgctgaa gctggcagcc aaggtcaagc tggccggccg 218521 tgggttcggg cggtgacggg ccggcccagc gaggcccgta cccacgaccg gcatgatagg 218581 tctacttgac cggggccaat tcgtcgccgc aggtgcagcg gtaggcgtca ccggcgccag 218641 cacagtggca tgggacttcg atgcgaaccc gacagccaca gccctcgtgg ctgcaggtca 218701 gcaaggtccc agcctcgtag ttcgtcattc gtatcaccct catccgtgtc ggggatcccc 218761 gaggaatccc aggtggtcag ctgtcggtaa tccagaacag ctacttaaat atatacccta 218821 tacgggtatc tggtaaaccc ccaggccggt gggcggttgc ctgctggcgc gcgacggtcg 218881 gtggtcgcgc tagcgtttgg gcatggacca gcaacccaac ccgcccgacg tcgacgcatt 218941 tttggacagc acactggtcg gcgacgatcc ggcgttagcc gcggcattgg cggccagcga 219001 cgcggccgag ttaccccgca tcgcggtgtc ggcacagcag ggcaagttcc tgtgcctgct 219061 ggccggtgcc atccaggcgc gccgcgtcct cgagatcggc acactcggtg gcttcagcac 219121 catttggctg gcgcgtggcg cgggcccaca gggacgggtg gtcacgctgg aataccagcc 219181 caagcacgct gaggtcgccc gggtgaacct gcagcgagcg ggcgtcgccg atcgggtgga 219241 ggtggtcgtc ggtccggcgc tggacacgtt gccgacgttg gccggtggcc cgttcgacct 219301 ggtgttcatc gacgccgaca aagagaacaa cgtcgcatat attcagtggg cgatccggtt 219361 ggcccggcgc ggcgcagtga tcgtggtgga caacgttatt cgtggcggcg ggattcttgc 219421 tgagtccgac gatgccgacg cagtggcggc acgtcggacg ctgcaaatga tgggtgagca 219481 ccccggccta gacgccacgg cgatccagac cgtcgggcgc aagggctggg acggtttcgc 219541 cctcgctttg gtgcggtagc cgctggtccg gcgcccaatt ttcgttgctg gcatcccgaa 219601 aacgggcgta atcttggagc agatggatgg gtggcagcga gcccaaaagt tttgctgcat 219661 aacagaaagg ttgcaaaatg agtacagtcc attcatcaat tgatcaacac cctgatttgt 219721 tggctctgcg tgccagcttc gaccgcgccg ccgagtcgac gatcgcgcat ttcacattcg 219781 gtctggccct gctggcgggc ctgtatgtgg ctgcatcgcc gtggatcgtc ggcttcagcg 219841 ccaccagagg gctgccaacg tgtgacctta tcgtggggat cgcggtcgcg tacttggcgt 219901 atgggttcgc gtcggccctg gatcgcacac acggcatgac ctggacgcta cccgtgctcg 219961 gtgtgtgggt cattttctcg ccgtgggtgc taccaggggt cgcggtgacg gctggcatga 220021 tgtggtcgca catcatcgca ggtgcggtgg tagccgtcct gggcttctac ttcgggatgc 220081 gcacgcgggc cgcggctaac caaggatagt tcgaagttcg cgagccagag ggcaactcgg 220141 gaatgtcctg gccggggcgg tcccggccag gcagcggcta gttgcggcta gccgcagacc 220201 gcgccgaccg cggcagagct gaccagcttg acgtacttgg acagtacgcc agtagtgtag 220261 cgcggcggtg gaggactgaa atcctgttgt cgggacgcga attcggccgg atcggccaac 220321 acatcgagaa cgcggccggc cacgtcgagc cggatccggt cgccgttgcg cagaagtgcg 220381 atcggtccgc cgtcgaccgc ctccggtgcg atgtggccaa cgcacaggcc ggtggttcca 220441 ccggagaacc ggccgtcggt cagcagtaga acatctttac cgagtcctgc gcctttgatc 220501 gcgcctgtga tggcgagcat ttcgcgcatc ccggggccgc ccttgggtcc ttcgtaccgg 220561 attaccacgg cgtcgcccac ggtaatggtg ccatcctcaa gggcgtccag cgcagcgcgc 220621 tcgccgtcga aaactcttgc ggtgccttcg aatacgtcgg aatcgaatcc ggcggtcttg 220681 accaccgcac cttcgggtgc cagcgatccg tgcaggatgg tgatgccacc gctcgggtgg 220741 atcgggtttg ccaacgcacg tagcaccttg ccatctggat ccggcggggt gatggcagcc 220801 agattctcgg ccatggtgtg accggtaacc gtcaggcagt cgccgtgtag cagaccggcg 220861 tccagcagcg ccttcataac caccggcaca ccgccgatgt gatcgacgtc ggacatcaca 220921 tggcggccga acggcttgac atcggccaaa tgcggcaccc ccgacccgat ccggctgaag 220981 tcctgaagcg atagtgcgac gttggcctcg tgggcgatgg ccagcagatg cagcaccgcg 221041 ttggtcgagc cgccgaacgc cattaccacc gcgatggcgt tctcgaacgc ctccttggtg 221101 aggatgtcgc gggcggtgat gccgcggcgc agcagctcga cgacggcctg accgctgcga 221161 cgcgcgaacc cgtcgcgccg gcggtcggtc gccggcggtg ccgcgctgcc cggcaacgac 221221 atgccgagcg cctcggcggc gctggccatg gtgttagcgg tgtacatgcc gccgcatgcc 221281 ccttcgccgg ggcagattgc ccgctcgatg gcatcgacgt cggcgcgact catcaaaccg 221341 cgagagcacg ctccgaccgc ctcgaaggcg tcaatgatgg tgacgtctcg ttcgctaccg 221401 tcggagagct tggcccggcc gggcaaaata gagcccgcgt agaggaacac cgccgccaga 221461 tccagtcgtg cggcggccat cagcattccg ggcagcgatt tgtcgcatcc ggccagcagc 221521 accgaaccgt cgagtcgttc ggcctgcatc acgacttcga cgctgtcggc gatcacctca 221581 cgggaaacca gcgagaagtg catcccctca tgacccatgg agatgccgtc cgaaaccgag 221641 atcgtgccga actcaagcgg atagccgccg gccgaaaaca ccccctcctt gaccgcgttg 221701 gccagccggt ccaatgagag attgcacggc gtgatttcgt tccacgacga cgcgaccccg 221761 atctgtggct tcgcgaagtc ttcgtcgtcc atgcccaccg ccctcaacat gccccgggca 221821 gcggccttct ccaggccgtc ggtgacgtct cgactgcggg gcttgatgtc ggcgaccgtc 221881 gagacggaag cggcttcgtc ggtggtttgc ggcattgttc aagtatgcgg cccaaggatg 221941 cgctcgccgc ggcacggttg ccaaattcta ggtccgatac cccgctgggg tacaagatat 222001 gatgggtagc atgcctgggc cctgctttcg ggttggcgag tatctctgga gatggcgagt 222061 aaatgacagc agcacacggc tacacgcagc aaaaggacaa ctacgccaag cggttgcgtc 222121 gcgtcgaggg gcaagtgcgc ggcatcgcgc gaatgatcga ggaagacaag tactgcattg 222181 acgttctgac ccagatcagc gccgtcacca gtgcgttgcg gtcggtggcg ctgaacctgc 222241 tggacgagca cctgagccac tgcgtcaccc gtgccgtggc cgagggcggt cctggggctg 222301 acggcaagct ggcagaggcc tcggcagcaa tcgcgcgcct ggttcgttcc tgatcgccgc 222361 gtgttgaagc gcaaacctgc ccaccacccg ttggtgcggt gcgtacggta ggggcagcgt 222421 aatcgtgccc tgaacgaccc cgaaccatcg aacttcgcgg ccgattccgc gcaggacgcg 222481 atgactgccc caaccggaac ctccgccact acgacgcgac cgtggacgcc acggatcgcc 222541 acgcaactgt ccgtgctggc ttgcgcggcc tttatctatg tcaccgccga aatcctgcca 222601 gtgggcgcgc tgtcggcgat agcgcggaac ttgcgcgtca gcgtggtcct agttgggacc 222661 ttgctgtcct ggtatgccct tgtcgcggcc gtgacaacgg ttccgctggt gcgttggacc 222721 gcacactggc cgcgccgccg ggccctggtg gtcagcctgg tctgcctgac cgtctcgcaa 222781 ctcgtctcgg cgctggcgcc caacttcgcg gtgctggccg ccgggcgggt gctctgcgcg 222841 gtcacccatg gcctgctgtg ggcggtcatc gcgccgatcg ccacccggct ggtgccgccc 222901 agtcacgccg ggcgcgccac gacgtcgatc tacatcggaa ccagtctggc gctggtcgtc 222961 ggtagcccac tcacggctgc catgagcctg atgtggggtt ggcggctggc ggcggtgtgc 223021 gtgaccggcg cggcggccgc ggtcgccctg gccgcccggc tggcgttgcc ggagatggtg 223081 ctgcgcgccg accagctcga gcacgttggc cgacgggctc gtcaccaccg taatcctcgc 223141 ctggtcaagg tcagtgtgct cacgatgatc gcggtaaccg gccatttcgt gtcctacacc 223201 tacatcgtgg tgatcatccg cgacgtcgtc ggtgtacgtg ggccgaatct ggcctggctg 223261 ctcgccgcct atggggtcgc cggcctggtg tccgtgcccc tggtggcgcg gccgttggac 223321 cgttggccca agggcgccgt catcgtcggt atgaccggac tgacggcggc gttcaccttg 223381 ctgaccgcgc tggcattcgg tgaacgccac accgcggcga cggcactgct gggcaccggt 223441 gcgattgtgc tgtggggagc cttggccact gccgtgtcac cgatgctgca atcggcggcg 223501 atgcgtagcg gcggcgacga ccccgacggg gcctcaggtt tgtatgtgac ggcgtttcag 223561 atcggcatca tggccggcgc tctgctgggt gggctgctct acgagcgcag cttggcgatg 223621 atgctgaccg cgtcggcggg tttgatgggt gttgcgttgt tcgggatgac ggttagccag 223681 cacttgttcg agaatccgac tctgagtccc ggcgacggct aacacagcag gtcagcggga 223741 ccagttggtg ccgctatgcc acactgggct gaagaacgtc accggaggga aagcaattat 223801 gtcgcgctgg aagcagggct ggacgagggg gagtctattc gccgctctga acatagccgc 223861 agtggttgcg gtgctgatgc tgggtgctgg cgttgccgtg gcggacccgg acgcggctcc 223921 cggcgatccc ggaggtcccg ggggccccgg gggcacagcg ggacccgtcg acccgccggc 223981 agttgacctg ttggcgccgc cacccgaccc gttggcgctg ccgccggcac ttgacccgtt 224041 ggcgccgccg ccacctgacc cgctcgcgcc gcccccgcct gacccgctgg cagtgccggt 224101 agcagcgggc cccgttgccg ggcaggatcc gacaccgttt gttggcccgc cgccgttccg 224161 gccgccgacg ttcaatccgg tcgacggcgc gatggtcggt gtggccaagc cgatcgtcat 224221 caacttcgcg gtgccgatcg ccgaccgggc gatggccgaa agcgccatcc acatttcgtc 224281 catcccgccc gtgccgggca agttctactg gatgagcccg actcaggtac gctggcgccc 224341 gtttgagttc tggcccgcca acaccgcggt aaacatcgat gcggccggca ccaagtcgag 224401 cttccggacc ggtgattcgc tggtggccac cgccgacgac gccacgcatc agatgacaat 224461 cacccgcaac ggcgtcgtgc aaaagacctt ccccatgtcg atgggcatgg tgtccggcgg 224521 ccaccagacc ccgaatggca cctactacgt gcttgagaag ttcgccaccg tggtcatgga 224581 ctcctcgacg tacggggtcc cggtcaactc ggcccaaggc tacaagttga ccgtctccga 224641 cgccgtccgg atcgacaaca gcggcaactt cgtgcacagc gcgccgtggt cggtggcaga 224701 tcagggcaag cgcaacgtca cccacggctg catcaacctc agcccggcca acgcgaagtg 224761 gttctacgac aacttcggca gcggtgaccc ggtcgtcgtg aagaactctg tcgggactta 224821 caacaaaaac gacggtgccc aggactggca gatctaacgg ccgcgcggtt gcccacgagt 224881 gacccgtagc caatcgcggc tccccttact ggagctttac tgaaagcagg tcagcgacag 224941 catcgtgtag tgccgaagca gccggcgggc gcagtctttc accaccaggt tgcgcctgcc 225001 gtcgagactg taggcggcgt cgaccgccca gaaggcgaac aagctggtcg ataggtaagc 225061 ggccatgtcc tcgcagcgca gcagtgcctc ggccacatcc tcgagcaggt gtttgaccga 225121 gcgcgcggcc tcggcatggc tcatgccgag caagtcgaag tgccaatcgg cgggcgggtc 225181 ttggtcttcg tccagcggca gcgtctttag cacctcggag attagcgcgt ggatctgcag 225241 cggacgtaga cagtcggccg ggcggggctg ctgaatgtcg gcaagccagc ccagcaaccg 225301 ctcgtatagc cgatctgctg acatctcgcg aatcaggttg cgcgcccact tccgcggttc 225361 ctcgcggttg tcgtggtaaa ggatcagatc cagcgggcca tgcttgtcga accgcatcag 225421 gcgccagatc aggtcgaaga actcctccgc tgacaggacc tcaccgtagg gaatcgtcag 225481 ttcgcggttg cggatctcgc gcacttcggg ggcctcggtg gccgccgaaa tccattcctc 225541 gaaccgcgcg ccgtcgatct gctttgtctg cagtggtagc gataccggac gttgcggggt 225601 tttcttccac ttgaggagca gttcggcaat ctcgtcagcg gcggcgcaca gggtggcaat 225661 aaatcgctgc tggtaagact cgtcgggtac acccctagtc aacagctcgc tcaaggagac 225721 atccgaactg tcctcgacat cgctgaccat ccgccctgga ttgtcggcga cgcggtggaa 225781 aaccagaatg atgcagctgc gagcacttcc ggaggcggtg cggaagcggt gatgcaccag 225841 cgagccagcg gcgaatacac tgctcagcgg tcgctcggaa tcggtcacct cgatcaccgt 225901 tgggtggtcc tgaccgccca gctggcgctg cgcgtcgaac accttgcgga tgctgtcggg 225961 ggtggtgaag atcgatgcat tttcactcga ccacggcacc ccgtcctcgt tgacgaagca 226021 ggtgcgggcg agtttatggg tgccgggtag gaaggtgaaa ttctgtcccg ctggcccctg 226081 cgcggttccg cgccgccagg taatgaggat cttgtattcg tcgttgaacg gggtgttgtc 226141 gatatgcagc atgttgtcct gggccaggac cgacagcggt tcggcgtcct tgccgcgtgc 226201 atcgatcatt ctgatcggtc caccgacggc ataggagatc aacgcgatca tcaaggggtg 226261 caccagcgcg ccattgaccg ctggatcggt cagcattccg gggcttcgcc gcaggtctag 226321 gaagcggtga atgaaactgc gactgccctc ccgggccatg agttcgtcgt agcgttttac 226381 caattccaaa aagtcgtccg actcgacgat gtcggcaagg acgactgcgc cctgctcggc 226441 catctgatcg atgaggtctc gtagtgccgc ggtggcgatt tcggcagctt ctgaaccctg 226501 ggccgccagc gcgtcggtca gctcctgcag cagtttcctg cggtaggact ccttgtcatc 226561 tagatcacgg aatcgtttgt aggcccaggc ggcgggcagc tggtcctgtg acaccagctc 226621 gggaccgatc ggtggtgcgt attccagcgg caatatgtca tcggcggcga acttggtcag 226681 ctggattccg tcggaattat ccggcaaggc ctgggtggtc gcagtctgac cgagtgagct 226741 catgtcccgg gaaatctgaa tcacctccgc tttcgcgtat tgcgcaagaa ctcggttcgt 226801 tgacccgtcg aggtcgactg cagaacgtac ctccggaggc ggcgttatcg ccagacctat 226861 tacctggggg tctgcccgaa agggaaaacc cggtgtcctt tctggttatc gaagtgaccg 226921 gaatattcgg tgccggcggc gcacacgcga gaatggatgc cgcgcacgag tttatgcgct 226981 tgttcgggtt ctgcccgaaa gggaagactt gatttcccgt tagttcaacc accgggtgat 227041 cggcgcactg aacgagaaag gatatggcga atgcgcacga attgctggtg gcggttgtcc 227101 ggctatgtca tgcggcatcg gcgcgatctg ctgttgggat tcggggcggc gctggccggc 227161 accgtcatcg ccgttttggt tccgctggta accaagcgtg tcatagacga cgcggtcgcg 227221 gccgaccaca gaccgctggc gccctgggcc gtggttctgg tcgccgccgc cggggcgacc 227281 tacttgctga cgtacgtacg ccggtactac ggcggtcgaa ttgcccacct ggtacagcat 227341 gacctgcgca tggacgcctt tcaggccctg ttgcggtggg acggccgaca acaggaccgg 227401 tggagcagcg gccagctcat cgtccgcacc accaatgacc tgcaactggt gcaggcgttg 227461 ctgttcgatg tgcccaatgt gctcaggcat gtgctgacac tgctactagg tgtcgcggtc 227521 atgacctggt tgtcggtgcc gcttgcgctg cttgcggtgc tgctggtacc cgtgattggc 227581 ctgatcgccc accgcagccg ccggctgctg gccgcagcca cccactgtgc ccaggaacac 227641 aaggccgcgg tcaccggagt cgtcgatgcg gcggtctgcg gaatccgggt cgtcaaggcg 227701 ttcgggcagg aggagcggga gacggtcaag ctggtgatgg catcccgcgc gctctatgct 227761 gcccagctgc gggtggccag gctcaacgca cacttcggtc cgctgctgca aaccctgccc 227821 gcgttgggtc agatggcggt cttcgcgctc ggcggatgga tggccgcgca gggcagcatt 227881 acggtgggca cctttgttgc cttctgggcc tgcctgacat tgctggcgcg gccggcatgc 227941 gatctggcgg ggatgctgac cattgcccag caggcgcgcg ccggcgcggt gcgggtactc 228001 gaactcatcg acagccggcc gacgctggtt gacggcacca agccgctgtc gctggaggct 228061 cggttatcac tggagttcca gcgggtgtcc ttcggatatg tggctgaccg ccccgtgctc 228121 cgcgagataa gcctgtcggt ccgggccggg gagaccctgg cggtggtcgg tgcgccgggc 228181 agcggcaaat ccacgttggc gtcgctggcg acgcgttgct acgacgtcac acagggcgcg 228241 gtgcggatcg gtggtcagga tgtgcgcgag ctgacgctcg actcgctgcg gtcagccatc 228301 ggcctggtac ccgaagatgc cgtcctgttc tccggaacga tcggtgcaaa catcgcctat 228361 ggccgcccgg atgcgacgcc cgaacagatt gccacggcgg cccgggcggc gcacatcgag 228421 gagttcgtca acactctgcc ggacgggtat cagacggccg tcggtgcgcg cggactgacg 228481 ctgtccggcg ggcaacgcca acgcatcgcc ctggcccggg cgctactgca ccagccgcgg 228541 ttgttgatca tggacgaccc gacctctgcc gtggatgcgg tcatcgaatg cggaattcag 228601 gaggtgctgc gggaggcgat cgcggatcgc accgcggtca ttttcacccg ccgccgatcc 228661 atgcttacct tggccgaccg ggtcgcggtc ctcgactccg ggcgcctgct cgatgtcggc 228721 acccccgacg aggtgtggga gcgctgtccc cgctatcggg aattgctgtc gcccgcgccg 228781 gatctcgccg atgacctggt tgtcgcggag cgctcgccgg tgtgtcgacc ggtggccggg 228841 ctcggcacca aggccgcgca gcacaccaac gtccacaacc ccgggcctca cgatcaccca 228901 cccggccccg acccgttacg ccgtctgctg cgtgagttcc gcggcccgct tgcgttgagc 228961 ctgctgttgg tggccgtgca gacctgcgcg ggtctgctgc cgcccctgct catccgccac 229021 ggtattgacg tcgggattcg ccgccatgtg ctctcggcgc tttggtgggc agcgctcgcc 229081 ggcaccgcca ccgtggtcat taggtgggtc gtgcagtggg ggagtgccat ggtcgccgga 229141 tacaccggtg agcaggtgct gtttcgattg cggtccgtcg tcttcgccca tgcccagcgc 229201 ctgggcctgg acgcatttga agacgacgga gatgcccaga tcgtcaccgc ggtcaccgcc 229261 gacgtcgagg ccatcgtggc gttcctgcgc acgggtctgg tcgttgccgt gatcagcgtg 229321 gtgaccctgg tcggcatttt ggtggcgctg ctggccatcc gcgcccggct ggtgttgctg 229381 atcttcacca ccatgccggt gcttgccctt gcgacctggc aattccgtcg ggcgtcgaat 229441 tggacctatc ggcgggcgcg gcaccggttg gggacggtaa ccgccacgtt gcgtgagtac 229501 gcggcggggt tgcggatcgc ccaggcgttc cgcgccgaat accggggact gcaaagctat 229561 ttcgctcata gtgacgacta tcgccgactt ggggtgcgcg ggcagcggct gctagccctg 229621 tactacccgt tcgtggcatt gctctgcagc ctggcgacca ccctggtcct gctcgacggt 229681 gcacgcgagg tgcgagcggg ggtgatctcg gtcggagcgc tggtgaccta tctgctctac 229741 atcgagctgt tgtacacgcc gataggcgaa ctggcgcaaa tgttcgacga ttaccagcgt 229801 gcggcggtgg cggccgggcg gatccggtcg ctgctgagca cgcggacacc gtcgtcgccg 229861 gcggcacgac cggtggggac gttgcgtggt gaagtggttt tcgacgccgt ccactattcc 229921 taccgaacac gagaagtgcc ggcactggcc ggcatcaacc tgcgaattcc ggccgggcag 229981 acggtggtgt tcgtcggctc caccggatcc gggaaatcca ccctgatcaa gttggtggcg 230041 cggttctacg atccgaccca tgggacggtc cgagtcgacg gatgcgacct gcgggagttc 230101 gatgtcgacg gctatcgcaa ccggctcggc atcgtgacgc aggagcagta cgtcttcgcc 230161 gggacggtcc gcgatgccat cgcatacgga cggcccgatg ccaccgatgc ccaggtcgaa 230221 cgggctgcgc gggaggtcgg tgcccatccg atgatcaccg cactcgacaa cgggtacctg 230281 catcaggtca ccgcgggtgg gcgcaatctg tccgccggtc agctgcagtt gctcgcattg 230341 gccagggcgc gtctggttga ccccgacatt ctgctgctgg atgaggccac cgtggccctg 230401 gatcctgcca ccgaggccgt ggtgcagcgg gccaccctca ccctggcagc ccgtcggacg 230461 accttgatcg tggctcacgg gctagccatc gccgaacacg ccgaccgcat tgtcgtgctc 230521 gagcacggca ccgttgtcga ggacggcgcc cacaccgaac ttctcgctgc tgggggccac 230581 tattcgcggc tgtgggcggc ccatactcga ctgtgttcgc cggaaatcac tcagcttcaa 230641 tgtattgacg catagacgtc accaagccac cgaatgggtg gcgagttgac cgggcgccgg 230701 atcccgacgg ttgtggttga tctgccgaat caacggcttc tggccacgaa catgtgtccg 230761 cgactggcgt tctgcgatac caacccaatc ggttactata gaaactgttc ccgccgacaa 230821 ctaactccct tgttcgcgtg gaggggttct cgggtccggt cagcgaggtc cggagcgggg 230881 cggaaatttc attgaacagc cgtagaagtt cagccaggac cggaacggat ccagcggcaa 230941 gcatgccttc aggagccatg ttgtcgaatc agtgcctagg gctgggggcg cccggaagga 231001 acaccacagg ggggaccgac attccgcatg tggtcaagcg cagcggagcg aaattccgcg 231061 aggagttcat cctccgtccg gaccgggtgc aaatggcacc ggtgaatgtc atttcggtcg 231121 cggtggtggc gagcgacccg ttgacccgcg atggagcttt ggcccgactc tcgtctcacc 231181 gggagctcga cgtgcgcgct tggcaggctg gatgcgaaac ctcggtcctg ctcgtgctgg 231241 ccaccacgat caccgcgcct cttctatgcc agatcgagga cgtgcagaag gatggcccca 231301 gtcacgcgcc gaaactggtc gtcgtcgccg acgaattctc cgctgaacaa gttttccgga 231361 tgatcaagct ggggttgacc gggttgttgt atcgcagcca gagcacgttc gactgcatcg 231421 tcgagacaat ccggttgtcc gccgaaggcc gcctgcgact ccccgaacgt gtccagcgtt 231481 acctggtcgg ccgcatcaag tccaccccga ccgccgaacc tgacacaccg tgcgccgccg 231541 ctcttgccga gcgtgaggtg gcggtgctgc gtctgctagc ggacggcttg agcacgcacc 231601 aagtggcggt gcagctcaac tattgcgagc gcacgatcaa gaacatcgtt catgacatag 231661 tgacgcggct gaagctccgc aaccgcacgc atgccgtcgc acatgcgctg cgcgcgggcc 231721 tcatttgatt gatggccggc gtccgacgta cgtgcggccg ggccgatccc aagcgagtgg 231781 tgcaacgtgc acggtagcca ttatgtatag caacatacat atgcctcgga tggagcggcg 231841 atgcaaggtc cacgcgaacg gatggtggtc tcggccgcgc tgttgattcg ggaacgggga 231901 gcccacgcca ccgccatctc ggatgtgctg cagcacagcg gcgcaccgcg ggggtcggcc 231961 tatcactact tcccgggcgg tcgtacccaa ctgctatgcg aggccgtcga ttacgccgga 232021 gagcatgtcg ccgccatgat caacgaggcc gaggggggcc tggagctgct ggacgcgctg 232081 attgacaagt atcgccagca gctgctcagc accgactttc gcgccggctg cccgatcgcc 232141 gcggtctcgg tggaggcggg cgacgaacaa gatcgcgagc ggatggcgcc ggtgatcgcg 232201 cgtgcagcgg cggtgtttga ccgctggtcg gacttgactg cccagcggtt cattgccgac 232261 ggcataccgc cggatcgggc gcacgagctg gcggtgttgg cgacgtcgac gctcgagggc 232321 gcaatcttgc tggctcgggt gcggcgcgac ctgacgccgc tggatctggt tcaccgccag 232381 ctgcgcaacc tgctgctggc cgagctgccc gaaaggagcc gatgatgacc agctctgatt 232441 ggctgcccac cgcgtgcatc ctctgcgagt gcaactgcgg catcgtcgtg caagtcgacg 232501 atcgccgact ggcccgcatc cggggcgaca aggcgcatcc ggggtctgcg ggctacacct 232561 gcaacaaggc gttgcggctg gaccattacc agaacaaccg ggctcgcctg agctcgccga 232621 tgcgccgccg agccgatggc acctacgagg agatcgactg ggacacggcg attgtcgaga 232681 ttgccgaggg attcaaacag atccgtgata cccacggcgg ggacaagatc ttctactacg 232741 gcggcggcgg acagggcaat cacctcggcg gcgcctacag cggcgccttt ctgaaggcac 232801 tggggtcgcg ctaccggtcg aatgcgctgg cgcaggagaa gaccggcgaa gcctgggtcg 232861 acttccagct gtacggcggt cacacgcgcg gcgagttcga gaacgccgag gtgtcggtgt 232921 tcgtcgggaa gaacccatgg atgtcgcaga gcttcccgcg ggcccgggtc gtgctcaacg 232981 agatcgccaa ggatcccggc cggtcgatga tcgtgatcga tcccgtcgtc accgacaccg 233041 cgaagatggc cgacttccat ctacgggtgc aaccgggttg cgacgcctgg tgcttggcgg 233101 ctttggccgc ggtcttggtc caggaaaacc tctgtaacga agcctttctt gccgcgcacg 233161 tgcacggagt ggacaccgtg cgcgccgccc tgcaagaggt cccggtcgcc gactacgcgc 233221 agcgttgcgg ggtggacgag gagttgttgc gtgccgcggc ccggcgcatc ggcaccgccg 233281 cgagcgtgtc ggtgttcgaa gacctgggaa tccagcaggc gcccaacagc accgtctgct 233341 cctatctgaa caagctgctg tggatcctga ccggcaactt cgcgaaaaag ggtggccaac 233401 acctgcattc gtcgttcgct ccgctgttca gccaggtctc cggccgcaca ccggtcaccg 233461 gtgcgcctat tatcgcgggc ctgatcccgg gcaacgtggt gcccgaggag atcctgaccg 233521 agcacccgga tcggtttcgg gcgatgatcg tcgagagcgg caatccggct cactcgctgg 233581 ccgattcagc cgcctgccgg gcggcattcc aggcgctgga actgatggtg gtcgtcgatg 233641 tcgccatgac cgagacggcc aggctcgccc actacgtgct gccggcggcg tcgcagttcg 233701 agaagccgga agccacattc ttcaatttcg agtttccacg caacggcttt cagttgcgcc 233761 ggccgttgtt tccgccactg cccggaacac tgcccgaacc cgagatttgg gcgcggctgg 233821 tgcgggcact tggcgtagtc gacgaagcgg acctgcggcc gctgcgagag gccgctgctc 233881 agggtcgcca ggcgtatacc gaggcgttcc tcgcggcggc ggcgaccaat cccaccgtgg 233941 cgaaactgac cgcctatgtg ctctatgaaa cgctcgggcc gacgctgccg gacggtctgg 234001 ccggggcggc cgcgttgtgg ggacttgccc agaagacggc gatggcctac cctgacgccg 234061 tccgccgcgc cggccacgcc gacggcaacg cgctgttcga cgcgattctc gagcgcccct 234121 ccggggtcac gtttaccgtg cacaactacg aagacgactt cgctttgatt agccaccccg 234181 atcacaagat cgccctggag attccggaaa tgctggcaga gatccggtcg ctgacccaga 234241 ccccgtcgcg gttgaccacg cctcaactgc cgatcgtgct gtcggtgggc gagcgccgcg 234301 cgtacacggc caacgacatc ttccgtgacc cgtcctggcg caaacgcgac gccaacgggg 234361 cgctgcgggt cagcgtcgaa gacgcccagg ccctgggact ggccgatggg tgcctggctc 234421 gtatcacgac cgcggcgggc agtgcggagg cgacggtgga ggtcaccgag acgatgctgg 234481 ccggacacgc cgcgctgccc aacggctttg ggctggacta caccggcgac gacgggcgca 234541 ccgtcgtcgc cggtgtcgcc ccgaacgcac ttacttcgac gagatggcgc gacccctacg 234601 ccggcacccc ctggcacaag cacgtgcccg ccgccatccg ccgagcagac gcagaatcgc 234661 ccatttggta gcccaaatgg gcgattctgc gtctgctcgc ggggtcttag cctagttcca 234721 gatccggacc ctgcgctgcg ggtccagaaa cagcgcgtca tcctcggtga cgtcgaaggc 234781 ctgataaaaa gcgtccacgt tgcgaaccac accgttgcac cggaactccg gcggggagtg 234841 cggatcgacc gccaaccggc ggattgcttc ggctgcacgc gatttggttc gccatatttg 234901 tgcccagccg aagaacaccc gttgcatgcc ggtcagcccg tcgataaccg gagcggggtt 234961 gccgttcagc gagagctggt aagccagcag ggcgatcgac agcccgccca ggtcgccgat 235021 gttctcgcct atggtgaacg cgccttgcac atgaggcggg ccggggtggt cgacgagatc 235081 gcgcggcgtg taagcgtggt actgctcgat caacgctttg gtgcgggcgg cgaactcggt 235141 gcgatcgtcg tcggtccacc aatcgaccag attgccgtcg ccgtcgtatt tggcgccctg 235201 atcgtcgaaa ccgtgcccga tctcgtgccc gatcaccgcc ccgatcccgc cgtagttggc 235261 ggcctcgtcg gcctgcggat cgaaaaatgg tggctgtaaa atcgctgcgg ggaagacgat 235321 ttcgttcatc cccgggttgt agtaggcgtt gacggtttgt ggtgtcatga accactcgtc 235381 gcggtcgacc gggccgaaaa gcttggctag ctcgcggtca tggttgacgg cgtagccgcg 235441 ctggacgtta ccgtagaggt cgtcgcggtc gatcgccagc ttcgagtagt cgcgccactt 235501 gatcggatag ccgactttgg cggtgaactt gttcagcttc gctagcgcgc gttgccgggt 235561 ctgcggcgtc atccaatcca gctcgctgat gctgatccga tacgcctcct gcaggttgtc 235621 caccagggtg tcgatgcggg acttggcatc cggcgggaaa tggcgttgta catagagctt 235681 tccgacggca tcgcccatca ggttctccac cagtgacacc ccacgcttcc aacggtcccg 235741 aagctgctgt gcgccggtaa gcgtgcggcc gtagaattcg aagtcctcgg cgaccagggc 235801 gcgggtcagc cagggggccc gggcgcggat caaacgccaa cgcgcccagc atttccagtc 235861 ttcaacgtta acgctcgccc acagcgaggc aaaggtgacg aggtaatcag gttggcgcac 235921 aaccagttcc gtcatggcgt ccggagcgct ccccaatgcg gtcacccagc tgacccagtc 235981 gaaacccgcc ccttcggtct gcagctgggc aaacgtgcgc aggttgtagc caaggtcggc 236041 gtcgcggcgc ttcaccacat cccaatgcgc gtcggcgagt ttggtctcca gcgcgacgat 236101 gcggtccgcg gttttggcat ggtcacggct ctcgcccccg tacaccaggc cgaacatccg 236161 ggcgatgtgc cccgggtagg ccgctagcac ggcggcgtgt tgctcgtcac ggtagtagga 236221 ctcgtcgggt aatccgatgc cggattgggt gaaatgcacc aagtaacggg tcgagtcttt 236281 ggaatcggta tcgacataga ctccgatgcc gccgcccacg ccggcacgtt gcagagtgcc 236341 aagggcggcg gccaattcgg tggcgtcggc cgcgctgtca atcgtggcca attcgtcgtg 236401 cagcggttgc acccctgcgc gctcgacggc ttcctcgtcg aggaagctgg cgtagaggtc 236461 gccgatgcgc tgcgcatcgg tgcctaccgc agcacctgct tggctggcct ggatgatcag 236521 gtctcgcact tgtgtctcgg cgcggtcgaa caggctacgg aaggcgccgt cggtcgctcg 236581 gtccgctggt atctcgtgtt cagccagcca gcggccgtta acgtggccga acaggtcgtc 236641 ttggggtcgg gcatcagcgt cgatgtggct caggtcgata cccgagggga tggcaagtgt 236701 caccccgcca tccttccacc tcttttcggg tgcaacgatc gggccatgcc tgacggggag 236761 cagagccagc caccggccca agaagatgcg gaagacgact cgcggcccga cgccgcggag 236821 gccgccgcgg ccgaacccaa atcatcagcc ggtccgatgt tctcgaccta cggtatcgcc 236881 tcgacactac tcggcgtgct atcggtcgcc gcggtcgtgc tgggtgcgat gatctggtcc 236941 gcacaccgcg atgactccgg cgagcgtacc tacctgaccc gggtcatgct gaccgccgct 237001 gaatggacgg ccgtgctgat caacatgaac gccgacaaca tcgatgccag cctgcagcga 237061 ctgcacgacg gaacggtcgg tcaactcaac accgacttcg acgctgtcgt gcagccctac 237121 cggcaggtgg tggagaagtt gcggacgcac agcagcggca ggatcgaggc ggtagcgatc 237181 gatacggtgc accgcgagct ggatacccag tccggtgccg cccgaccggt agtaaccacg 237241 aaattgccac cgtttgccac tcgcaccgac tcggtgctgc tggtcgcgac gtcggtcagt 237301 gagaacgccg gcgccaaacc ccagaccgtg cactggaact tgcggctcga tgtctccgat 237361 gtggacggca agctgatgat ctcccggttg gagtcgattc gatgagaaat gcttggcggc 237421 tggtggtgtt cgatgtcctg gcaccactgg ccacgatcgc cgccctggcc gcgatcggcg 237481 tcttgctcgg ctggcccctg tggtgggttt cgacgtgctc ggtgttggtg ctgctggtgg 237541 tcgaaggtgt ggcaatcaac ttctggctgt tgcgtcgtga ttcggtaacc gtcggtaccg 237601 acgacgatgc gcccgggctg cgactggccg ttgtcttcct gtgcgccgcc gcgatctcgg 237661 cggcggtggt gactgggtac ctgcgctgga cgacaccgga ccgcgacttc aatcgggatt 237721 cccgggaagt ggtgcatctt gccacgggga tggccgagac ggtcgcgtca ttctccccga 237781 gcgcaccggc cgccgctgtt gaccgggccg cggcgatgat ggtgcccgaa catgcgggcg 237841 ggttcaagga gcaatacgcc aagtccagcg ccgatctcgc acggcgcggt gttacggccc 237901 aggccgctac gctggcggcc ggcgtggagg cgatcgggcc gtcggcagcc agtgttgcgg 237961 tgattctgcg ggttagccaa agcattcccg gccagccgac cagtcaagcg gcgcgagcgc 238021 tgcgggtgac cttgaccaag cggggcagcg gctggctggt gctcgacgtg acgccgatca 238081 acgctcgcta agagtcggcg gcacgtacgg atttggctct gacgaaccgg tccgacagcc 238141 gccgcatccg gatcatcagc gaggccgacg ggctcacgat gccgtcgagg taggcggtca 238201 ggtcctgcgc tgtgacgcca atgcgcgacg cgaattcctg tcgttgcagg ccagagcggt 238261 ccaacaggag cccaacctga cgggccacct cggcgcgctc attggcgtct aggtgagtac 238321 gggcccggtc cagcacctcc caaaaggcgt tggcgatgcc ggtcgccggt atgccctcga 238381 ggacttcttc gacttggcgt gctgtccgcc cgtaggggtc gcgcttgagc gcggccgcta 238441 tgcgttgcca ggtggcgatg tcgccacttt ccagcgccga acgaatggcg acggtaggcc 238501 agaactcgac ccgccggtcg acgtccggct cgctccacgc gacggtgggt tgttgcggcg 238561 gtgcggggtg tggctcggct gccaacgtca cctcgcctcc tccaacatcg ccacggccac 238621 cgacaggcaa cgccgccgga cctcttccca ctttgcctgg gcatcagctc cgggcgactg 238681 gtcaccgagg tcagacggtt gcggatctgc caggcgacca accaactggg tggccatcca 238741 ttgccgcccg ggtgcttgac aagagtagta ccgatccatc ccagccagca ccgcggcggc 238801 ggtttcgggt gccatcgtat cgaccaggtc agcaaagtcg gcgtagtcgt ggctgctgtt 238861 tcgggacatg atcaggtagc ccttgaagcg cagcgtttcc gcgccggttg ggatctgcaa 238921 gcggtcaccg gtgggcaatg cgacgttggt cgtctccacc gggctgcgcc gccggtagcc 238981 ggggcccgcc cggtgagtct gcaccccccc gcactcccag gtggttgtct gaagcgcgtc 239041 gagggcgacc gcgagccgct tgcgccacac ggtgaccggg tgcaccgggc gctggcccca 239101 tgatatcgcc cgggcgatgc cgttgcggta ggccagctgc acctggtcga gcgccttacc 239161 gtcacatcca cacccggtga aggcgagcgg atcggtaacg caaatggcgt ccggcgcaag 239221 tcgcttgagc ttggccgccg acttgagcac catccgcaca tccgcgctgg gcgggatcgc 239281 cgcggcgaag tcgtcaggaa tgaccacgac gtcaccgagg tcgactttcg gcagcggtcg 239341 gtcgaagtcg accgacggca agatgtgggc cagccatcgg ggcaaccacc agttccatcg 239401 gtcaaacatc gccatcaatg ccggtaccag caccagccgc acgacggtgg cgtccacggc 239461 gatcgcgacc gcgcacgcca cgccgatctc ggccactagc ggcatgccgg cgaacgcgaa 239521 cccgcaaaac accgcgatca tgatcaacgc ggcgctggtg atcgtgcgcg cgctggtgcg 239581 cacaccgtac gcgaccgcgt cgcgggtctg gcccgtctgc aggaaccgct cccggattcg 239641 cgtaagcagg aagatttcat agtccatcga caacccgaac gtcatcgcca ggaccagcgg 239701 gggaacggtg ctgtcgatcg aatgaagcgc cgggaaaccg agcccccgtg cccagcccca 239761 ctggaagacc atcaccaggc tgccgtaggc ggcggccacc gacagcagcg tcatcagcac 239821 gcccttgaac gccaggaaca ccgagcggat tgagatcaac aacatcaaaa acgcgatcac 239881 cgccacgaag accagcacca gcggttgcgt cgcggacacc cggtcgtcga aatccttgat 239941 cagagcggtc ggcccgccga cgtccacttg tgccgcgccg gcaacccggg gtagctgggt 240001 ccgcatccag gtgatggtgt cgcgggcgcc caaatcctcg ggatcgaccg atagcaccgc 240061 gctgagcaaa gcgctgccgt tgtcgtcggc gaatcgcggt ggggccaccg aaacgacgtt 240121 gggcgcctgt gcgatccgat gacggattgc ggcgattgtc tggctatgtt cgggtgcgga 240181 cgcaccgccg gcgtcaaacc tgaccagcac ctgaaccggg cccagcgcgc ccggccccag 240241 cgcttgggcc gcggccgctg cgccggtgcg gatctcgtgt gacgagtcga actggcgcag 240301 caagctgttg cccagcacca tcaaggttgc cggtgccgcc atgacaagca gcacggtcga 240361 tgccgccagt gctgtgatcc agggtcggcg catcacccac ccgacccagc gggaccagaa 240421 ccaagattgc gtgcttgccg gccgccgcga ccagtgcact aacgctgacc gcttggccgc 240481 cgcgcgggca aatgttgcta gcacggcagg tgtcagggtg gccgacgtca gcatcgcaac 240541 cgcgaccgcg agaatcgccc cggtggccat cgatctcagc gccggggtgt tgatcaggta 240601 gatcccggtc agcgacgcga tgaccgtcat accggacaac accacagcca accccgaagt 240661 ggccatcgcg gcgtcgaccg cgtcgggcgg ccggcgtccg caacgcagtt cctcgcggta 240721 gcgcatcagg atgaacaggg agtagtcgac ggcaagcgcg atgccgaaca tcgaaacggt 240781 cgatgtcacg aacaccgaca tggtggtgtg catcgacaac acaaacacca ggcccatggt 240841 gatgacgacc gtgcaaacgg cgagtgccag cgggatcgct gcggcggcca acgagccgaa 240901 aaccgcaacc aggaccatca gaatgatagg caggttccag cgttcggcgt tggcaatatc 240961 gtgtttggtg tttgccgccg cggccgcgga cagcgcgccc tgcccgatga catagagccg 241021 cactttgccg ttggcagttt gcccggactg atcgcctttg acgcctattc ggtcgcgcag 241081 ctttttggcg acgtcactgg tgcccgcgtt gcgggcgtcc agccgcagcg acaccacata 241141 cggccggtcc ggttgcgggg gccgttgggt ggggttgggt gcctccgtca ccccaggcag 241201 ttcgctggct atttgtcgca gtagcgcgac ggcattgtcg atgtcttggt agctagcatc 241261 cggtcggggg gccgctacca gcgccagcgc cggggctccc cggtccgggt agtgcgcgtc 241321 gagttggtcg tggaccagca atgactgcga cccggcgact tcgaaaccgc caccggttag 241381 attccccgac tgcgtcatcg ccaggtaaac cgccggcact aacgccagca accaacccgt 241441 gaagaccaac caacggcacc tgcgcaggtt gcggctcaag cgcatcatga actgctggat 241501 ttcggactcc ccgtactctc gcgcagtgcg tgcccgcgag cctaccgaag atcgcgtgca 241561 tgcgttcggc gtggaccgca cagcacctgg agttggcggc gccgagggcc gagatggcag 241621 gatgacggat cgtcgggggc gggaactccc aggccgccgg gccgtcgcaa acccgtcgca 241681 aacccgtcgc aaaccgtaag gagtcatcca tgaagacagg caccgcgacg acgcggcgca 241741 ggctgttggc agtactgatc gccctcgcgt tgccgggggc cgccgttgcg ctgctggccg 241801 aaccatcagc gaccggcgcg tcggacccgt gcgcggccag cgaagtggcg aggacggtcg 241861 gttcggtcgc caagtcgatg ggcgactacc tggattcaca cccagagacc aaccaggtga 241921 tgaccgcggt cttgcagcag caggtagggc cggggtcggt cgcatcgctg aaggcccatt 241981 tcgaggcgaa tcccaaggtc gcatcggatc tgcacgcgct ttcgcaaccg ctgaccgatc 242041 tttcgactcg gtgctcgctg ccgatcagcg gcctgcaggc gatcggtttg atgcaggcgg 242101 tgcagggcgc ccgccggtag atgccggacc gccgccgggt ccggcgcagt cgagcgtgag 242161 gcagcggtcg cctaccgggg cggtgtctcg ccgccttctg gtcgcaggtc aggggtcggc 242221 gctggacctt gcggtgtggt ttcgaccggg tcgtcgcagg gtgtgccctg cggttggatg 242281 acaagtcgca ggtttggatc ggttggcggg tcgcgatcgt tgtcggaatc ggcggtgctc 242341 tcggtgcgga acatgaagaa gaacaccacc cagccgattg cggcgatgag cagccagctg 242401 atcagccggt agatcaacat cgccgagatg gcactcggca agggcatgcc gctggatacc 242461 aggccgggta ccagcaccgc ctcgaccacc aacagaccac cgggcatcag cggtatggtg 242521 ccgaccgcgc gggcggcggc gtaggcgacc gccagcccac cgaccgaggc atggtcgccg 242581 gcggcgtacg cggcgaaacc gaggcaggct acgtcggcga tccagttgaa caacgaccaa 242641 ccgaacgcca cgcccaggtc gcgcctgccc aggctgaccg attccagctg catgagcgtc 242701 tcgcgccact tcggtaggcc ggcatcggcc ggcctaccgc gaaccgagtt ggcccacgac 242761 aaaactctcc tgccgatccc ctcgatgagc tccggccgcg acgccaccgc ctgggccagt 242821 agcagcaatg tgacgaagcc gcccagggtg aacagcagtg agaacgggtt gttcttggcg 242881 cccaggaaga atgcgccacc caacccgagc aatgccaagc ccaccgcctg caacacgccc 242941 gacatgacca gctgccatga cgccaccacc gtcgaggcgc cccagatgcg ttgctgacgg 243001 agtaagaacg tagccgacaa caccggccca cccggcagcg tggtgctcag cgagttggcg 243061 gcgtagaagg cggcctccga ccgccattgc ttgacgtgca ccccggcgga tttcagcagg 243121 gttcgctgaa tctgggcgaa gctgtgcatc gaggcgcccg cggctgccac cgcggccagc 243181 aaccaccacc acttggcgcg atacaagctc acccaggcct tggcgagctg gtcccagccc 243241 aacgccacct ctatagcaag cacgattgcg acgatggcca gtaccgccca tcgcaaccac 243301 cagtacttgc cgcgcggggg tacgccctca gcggggggtg cccccacccg cgtgcgaggg 243361 agtgccccca cgcgctggcg gaggttgcgg gcgggggcgt cgtgcgacac gtgcttaagg 243421 gtaaccgtgc aggtggcgcc gtaatcgcga tacatcgcta accgtgtcag cctcgttggg 243481 gggtcgtgac cggatcgtgc cgcctggcaa agtaactatg cgggctcgac gcgacccgcc 243541 gcgaccttac gacgccgccg ttcccgttac gcttgccgga tgtcggcgag cctggatgac 243601 gcttcggtcg caccgctggt tcgcaagacc gcggcctggg cgtggcggtt cttggtcatc 243661 ctggccgcga tggtcgcgct gctgtgggtc ctcaacaagt ttgaggtcat cgtcgtcccg 243721 gtgttgctgg cgctgatgtt gagtgcgttg ctggtgccgc cggtggattg gctggactcc 243781 cggggcctgc cgcgcgctgt cgcggtgacg ctggtcttgt tgagcggttt cgcggttctc 243841 ggcggcatcc tgacgttcgt cgtcagccaa ttcatcgcgg ggttgccgca tctggtcacc 243901 gaggttgagc gcagcatcga ctccgcgcgc agatggctga tcgaaggccc ggcgcacttg 243961 cgcggcgaac agatcgacaa cgcgggcaac gccgcgatcg aggcgctgcg caacaaccag 244021 gcgaagctga ccagtggcgc attgtcgact gcagccacca ttaccgagct ggttaccgcg 244081 gcggtgctgg tactgttcac gctcattttc ttcctctacg ggggccggag catctggcag 244141 tacgtcacga aggccttccc ggccagcgtc cgtgacagag tgcgtgcggc ggggcgcgcc 244201 ggttatgcgt cgctgatcgg gtacgcgcgg gccaccttcc tagtggcatt gaccgatgcg 244261 gccggggtgg gcgcggggct ggcggtgatg ggtgtgccgc tggcattacc gctggcctcg 244321 ctggtgtttt tcggtgcctt cattccgttg atcggtgccg tggtcgccgg gtttctggcc 244381 gtggtggtgg ccctgctggc caagggcatt ggctacgcgc tgatcacggt cggtttgcta 244441 atcgcggtga accaacttga ggcccattta ctgcagccgc tggtgatggg tcgggcggtg 244501 tcgattcacc cgctggccgt ggtgctggcc attgccgctg gcggtgtgct tgccggagtc 244561 gtcggcgccc tgttggccgt cccgacggcc gctttcttca acaatgcggt gcaggtgctg 244621 ctgggcggga atccgttcgc cgacgtggca gacgtttctt ccgatcacct caccgaggtt 244681 taaaggcgtc cttcgcggcg aagcagatcc tgggcggaca gggcgccgcc gccgcggcgg 244741 cgctgacgcg tcttatcgct cgtgccgcgg gcattcagct gctcagtggc tgcctctgag 244801 tcgtcgccgt ccgaccgtat gattggcagg gccgcggtgg gttcggccgg gtcaccggct 244861 gcgtctgtgg agcggttcgc cgcaagcggc atagcccggg tctgaccggc agacggggcc 244921 gatggcgggg tcggggttgg tggcggcgtg gatgctggcg actgcacgga ccgaccggca 244981 gccgcgaggc gggttgtcgg cggctccgtc gacgacccga tctgcatccg tgtggtgctc 245041 gccccagacg gcgccgccgg cgcggcagtt gcttccaggg caggcgtgag ctccggtgag 245101 cttgctggac tcgagcgggc cggtcgaggt gactccgcca gcggatgggt cggatcgtgc 245161 ggtgggcgcg ggtccccagc ggcgcgcgcc gcaaccagcc cagctgtgac cggaggacgt 245221 gcgggacgcc cgttgctgac gggccgcttg cgctcgtcgg gcaggtggat ctcgcccagc 245281 ccgatgcggg tctgcaggcg tctggcccag cgcggtgccc accagcagtc atcgccgagc 245341 agcttcatca ccgatggcac taaaaacatc cgcaccacgg tcgcgtccag cagcagcgcc 245401 gccatcagtc caaaggccag atacttcatc atcaccaggt cggagaacac gaacgcgccc 245461 gcgacgacgg caacaatcag cgccgcggcg gtaatgatgc gtccggtggc tgcggtgccg 245521 atccggatcg cctcctgggt cgacatgccg cgctctcgcg cctcgaccat ccgggacacc 245581 aagaacacct cgtagtcggt ggataggccg aagaccagcg cgatgatcag cccgatcacc 245641 ggcgctgtca gcggggtcgg cgtgaaattc agccacttcg aaaagtgtcc gtcgacgaat 245701 atccacgtca ggatgcccat ggtggacccg agcgtcagag cgctcatcag cgtcgccttg 245761 attggcagca ccaccgagcc gaacgccaag aacatcaaga cgatcgtggt ggtcagcagg 245821 atgaccacca tcagcggcat cttcgcgaac aggccgtgga ttgaatccag ctccagggcg 245881 ggagttccac cgaccaagac cgtgattcct ttgggcgggg tgatcgcgcg cagctcggtg 245941 agcttcttcg acgcgtcagc cgggttgatc aacccgttct gcaggacgcg caccgatgga 246001 tctttagatg cgcctaccgc gtaggcacgc tcttgccaca tattcgccgg atcgttgtcc 246061 ggctcgatga atccgccgat cgccatcgcc ttgctgcgga tgtcagcgat ctgcgcctcg 246121 gtgaccggtt gatggttgct ggtctggatc accagtgtca gcggattggt gcggtatccg 246181 gggaagagtt tgtcgaactc ctcctgcgcc tggcgcaccg aattggtcgg cggcaagtac 246241 ttctcgctga tcccgcccaa tgacagcttg cccaccggga taatcagcaa aatcatgatg 246301 atgacgatcg gtgcggcgaa cagcactggg cgcttcatca cccggttaac cagcttgccc 246361 cagatgccgg cttcgacctc ttcgcgggtc ttggtccgct gcaggcggtc ggcgagccag 246421 ttcaggtagg cggccgaaat cttccagttc gccaggaagg gcacccggaa cagggtccgc 246481 acgccgagcg cgtcgacgtg tttgcccagg atccccagac aggccggcaa cacggtgata 246541 gacaggatgg ccgacagcat caccgatgcg atcgtggcgt aggtcagcga cttcaggaaa 246601 ccctgcggga agagcagcag accgatcgcc gacgcgacga tcaacaccgc cgagaacgtc 246661 accgtgcgtc cggcggtgat caccgtgcgc cgtactgccg tctcggtgtc gtagccttcg 246721 gcgatctctt cgcggaaccg gctcacgatg aacaacccgt agtcgatggc gatccccaga 246781 ccgatcagcg acaccacggg ctgggcgaaa tagtgcacgg gaccgaagat cgcgaggaac 246841 cgcatgatgc ccagcgcgcc ggcgatgcac agccctccga ccatcaccgg taggccggcg 246901 gcgatcacgc cgccgaacac gaagaacaac accaccgcca ccaacggcag cgccagcact 246961 tccattcgcc gttggtcggt ggcgatggtg ccggtcaacg cctcggccac cggttgcagc 247021 ccggcgagct tcaccgtgcc tccgtcgagc cgctgcaggt cgggtgcgat ggccttgtag 247081 ttgttgagga tggtgtcgtc gtcatcaccc ttgagcggga tggaaacgaa ggtgtacttc 247141 ttgtcggcgg tggccatgcc ggtcgcctga ctcgctctca ggtagccggc ccatcccaag 247201 acctggtcgg ggtgatcctg ctggaaccgg ttgagctcgt cgacgacctt ctttgaccag 247261 gccgggtcgt caacggtctt gccggctggg gcttggaaga tcgcgacgat gtgaccgctt 247321 cggtctcggc cgtagacctg gtcgcccagc accgatgctt gcaccgattg gctgccgtcg 247381 tcgtagaagc cgctctgcgt gacgtgcttg ccgaggctca gcccgaaaac gccgccgccg 247441 aggcatagag cgaccatgac cccgattacg atgaaccggt agcggtacac agttcgaccc 247501 caccaggcga acacgtaagc tccttactgg atcggcagcg acccgcgtat tgctttttgg 247561 ttgtcacaca cgtcggctgt cacactcgcg aggtcaacag cgaggacagc ggccggaacg 247621 gctgcagcca agccccctgc tcaggtagcg aatcgaggcc gattcgaggt agtggttccc 247681 ggaaaacacc agcgatgtcc tccaggtcga cgaactccaa ggtatccgac gctagcgccc 247741 aactggcgtg ttcacgaaat ccgagcactt gaactggggt tccgctgcgg gcgaccgcct 247801 ccaacggttg gcggaatgcc tgaccgtcgg ccgacgccac caccagcgcg gcgagccctt 247861 cgcggtagcg ctcgtcgatg tgcgccaaca tgtcgcggtc aacgtcgctg tcctcgtcta 247921 ctttcggttt ggcgaagacg gcgaatccga cattgcgcaa cgcgtccacc cacggccgga 247981 ccacctcggc gctgccaggg gcgatgttgg tgaagacggt ggcctccggt tcggtcgaga 248041 tgcctggacg gccggccaca atctctgcgg ttcgggccag cagccagcgt cccagggcgt 248101 cgaatcgtgg tcgttccagt gctgtcggcc ggcggcccaa gatggagccc aaacccatgt 248161 cgaggttggg agcgtcccac accagcaata cccgtgcccc cggcgcaccg agactggtca 248221 gcccgtcctg cgataagtct tccgccagta ccgagtgccg ggcgagggat tcagatgttt 248281 gtgaagtcac gtcttcggtc aggctcatca tcatctaatt ttcaggtctc tttcagagca 248341 accgtgcttt ttccataaca actcgatgac tgcgccgccc ccaagctggg ctttcctctc 248401 gtacttggta gccggtcgga cgaccgaaat cggcagcagt tcggtgtcgg ggtcgacgcg 248461 aaccagccgg ggttcggcgt cgccggccgc ggcgatgtgc tcggcgtagc cgggatggtc 248521 ggtcgcggcg tgcagcacac cactgggaac gagccggtct gcgatcaagg ccatggtggc 248581 cggctgtaac aggcggcgct tatggtggcg tgccttcggc cacggatcgg ggaagaagac 248641 tcgaacaccg cacaacgaat cgggggcgat caagtgttgc agcacgtcga cggcattgcc 248701 aaggatcagc cggatgttga tcccgtcgga gcccactttg tcaatcgcgc agagcagctg 248761 agccagcccg cggcgataga cgtccacagc gatcacgtcg acatggggtt cggccttcgc 248821 catcgccagc gtcgacgtgc cgctgccgga gccgatctcc aacaccaccg gcgcgtcacg 248881 gccaaaccag gcacgggtat ccaccggtgt cccgcgcggg gattgaggta gcgccaggag 248941 gccaagctcc ggccaaagtc gctcccaggt ctcgcgttgg gccttggaga tccccgaccg 249001 ccgcgaccgg atgctcgtgc tggggagctg gcccgatgcc accggtgtgt cgggacgtag 249061 ccctaccccg ggttgcgcat gcatttgtcc atggtggacc atcagcgccc ggcgtagccg 249121 cccctggtcc agattgatac ccaacagttg ccttcggcgg gtagcggaca actgctgact 249181 cgcgcctcgg cggcgagggt gccaccattc tgaacgaacc gatcgggtgg gagatgcgcg 249241 gacaagggca ccagattttc gtcgacgagc tggcgcgatt cgccaccagc tccgccgacc 249301 agcgggtagt ggcgatcgcg cagcgggccg ccgaaccgct gcgcgtagcg gtccgtgggc 249361 gtcccggggt gggttgccgc acggtggcgc gcgccctgca gggtgctggg agctcgtcgg 249421 gcatgacggt gacaccgcaa gcacgcgccg ccgactctga cgtcgacctg gtcgtctacg 249481 tcaccgtcga ggtagtcaag cccgaggacc gcgaagccat cgccgccacc cggcgcccgg 249541 tggtggcggt gttgaacaag gccgatctgg ccggcccgct ctcgggtgca ggtccgatcg 249601 tgatggcgca ggcccggtgc gcgcaatttt ctacactcct cggggtcccc atggagtcca 249661 tgatcgggtt gctcgccgtc gcggcgctcg acgatcttga tgacaccttg cgggccgcgc 249721 tgcgggcgct agccgcccac cccgacggct ttgacgctct cgaccgagcc gttgcggggt 249781 ttctggcggc agccctgccg gtccctaccg aggtacggtt gcggttgctg gacaccctcg 249841 acctgttcgg catcgcactg ggcatggcag cgttccggcc gggccggccc tcgcgaaccc 249901 cggcgcagct ccgcaccctg ttacgccggg tcagcggtgt cgacgccgtc atcgacaagg 249961 tcaccgccgc cggttctgag gtgcgctacc ggcggttgct tgacgcggtc gcggagctgg 250021 aggcgctggc cgcgcaggcc aaggagatcg gcggtccgat cggtgagttc ttgcgcgacg 250081 acgacacagt tctcgcccgg atggcggccg ccgtcgacgt agccctggcc gtcgggctag 250141 acgttggccc gttggacgat ccggccgccc acctgccgcg ggcggtgcgg tggcatcgtt 250201 acagcctgga caacggtgac atgcaccgca cgtgcggcgc ggatatcgct cggggatccc 250261 ttcggctgtg gtcgctggcc ggcggcatgc ccctgcaccg ataccggaag tcatcgtgat 250321 ccgcgcggct agtgatgacc cggccggggt ggacgagctg gtggcagcga tcgcgccggg 250381 gcttgccggg ctgggtttgc cggtcatcaa ccgccgcgag gtggtgctgg tgaccggtcc 250441 gtggctggcc ggggttagcg gtgtgcgcgc ggcactggcc gaaaggctgc cgcagcgtag 250501 gttcgtcgag acggcagagt tgggacccgg cgatgcgccg gtggcggtgg tgttcgttgt 250561 ttccgcggca accgcgctga ccgaatccga ttgcgtgttg ctggacaccg ccgcggagca 250621 caccgatgcg gtggtagctg tggtgtccaa gatcgacgtg caccgcggct ggcgtgacgt 250681 gcttaccagt aaccgcgaca ggctggccgc gcgcgcgtcc cgctacgccc gggtgccctg 250741 ggttggcgcg gccgccgcac ctgagctggg cgagccatac ctggacgact tggtcgccgc 250801 catccagaaa cagctcgccg atccggctgt cgcgcggcga aacatgttgc gtgcgtggga 250861 atcccggctt ctgatggtcg cgcggcggtt cgatggcgat gcacagagcg ccggtcggcg 250921 ggcacgggtc gacgcgttgc gccagcaacg gcgcacggtc ctgcggcagg ggcgtcaatc 250981 gaagtctgaa cacaccatcg cgctgcgcgc gcagatccag cacgctcggg tcaaattgtc 251041 ctactttgcc cgcaatcggt gttcgttgct gcgcgtcgag ctgcaggagc acgtcgccgg 251101 tctgtcccgg aaggacatcg ccaggttcgc ggcatacacg cgcggccggg tccaggaggt 251161 ggtcgccgag gtgggcgaag gtgccgtcgc gcaccttgcc gacgtcgcgc agctgttggg 251221 tgtgccggtg cagccaccgg tcctcgagaa cctcccggcg gtgctcccga cggttgtggc 251281 cccgccactg acatcacgac gattggagat ccggctgaca acactcttgg gcgccgggtt 251341 cgggctgggt atcgcgctga ccctgagcag gctggtggcg ggtcttactc ccggcctggc 251401 tgcatcgggg atggtggcgg gtgtggcgat cggcctggcg gtgaccgcct gggtggtgaa 251461 tgcccgcgcg ctgctgcacg accgtgtcgt ggtggaccgc tggacgggtg aggtgacggc 251521 atcgctgcgg tccgtggtgg agcagctggt cgccactcgg gtggtggctg tcgagacgct 251581 gctgagcacc gcgattagtg aacgcgacga cgccgagaac gcccgggtgg ccgatcaggt 251641 cagcatcatt gacggcgaac tgcgcgaaca cgccgtcgct gcggcgcggg ccgcggccct 251701 gcgtgaccgg gagatgccgg cggtgcgggc cgcacttgag gcggtgcgtg cagaactcgg 251761 cgagccgggt acgcccacaa caggcctgtt ctgaagcttc tgaatcgttg ttgtgagcag 251821 gcttataccc gcccaagtct tccctgacaa gttctgggcg ataacctgga taaaaagtgt 251881 ctcactaggt gagcggccgt atcagcctcg ccaccaagac gggcatacct aacccatacg 251941 taaccgcgag cacccgataa ctacgcagga gaattcgatg acctcagcga ccatccccgg 252001 tctggatacc gcgccgacga atcaccaggg gttgctgtcc tgggtcgaag aggtcgccga 252061 gctcacccag ccggaccggg tggtcttcac tgacggctcg gaagaagagt tccagcggct 252121 ctgcgatcag ctagtcgagg ccggcacgtt catcaggctc aaccccgaga agcacaagaa 252181 ctcctacctg gcattgtcgg atccgtccga tgtcgcgcgg gtggagtcgc ggacgtacat 252241 ctgctcggcg aaagagatcg acgccggccc caccaacaac tggatggatc ccggcgaaat 252301 gcggtccatc atgaaagacc tgtaccgggg ttgcatgcgc gggcgcacca tgtatgtggt 252361 gccgttctgt atgggaccgc tgggcgccga ggaccccaaa cttggtgtgg agatcaccga 252421 ctccgagtac gtcgtcgtct ccatgcgcac catgacccgg atgggcaagg ccgcgctgga 252481 gaaaatgggc gacgacggtt tctttgtcaa ggcgctgcac tcggtcggcg cgccgctgga 252541 accgggccaa aaggacgtgg cctggccctg cagcgaaacc aagtacatca cccacttccc 252601 ggagacccgg gagatctgga gctacggctc gggctacggc ggcaacgcgt tgctgggcaa 252661 gaagtgctac tcactgcgta tcgcgtcggc gatggcccac gatgagggct ggctggccga 252721 gcacatgctg atcctcaagc tgatttcgcc ggagaacaag gcttactact tcgcggccgc 252781 attcccgtcg gcgtgtggca agaccaacct ggcgatgctg cagccaacca tccccggctg 252841 gcgtgcggag acactcggag acgacatcgc atggatgcga tttggcaagg acggtcgcct 252901 gtacgccgtc aacccggaat tcggcttctt cggggtggcg ccgggcacca actggaagtc 252961 gaaccctaac gccatgcgca ccattgccgc cggcaacacg gtgttcacca atgtcgcact 253021 caccgacgac ggcgacgtgt ggtgggaggg cctggaaggc gacccgcagc acctgatcga 253081 ctggaagggc aacgactggt acttccgcga gacggaaacc aatgcggcac acccgaactc 253141 ccggtactgc acaccgatgt cgcagtgccc gatcctggcc cccgagtggg atgacccgca 253201 gggcgtcccg atctcgggga tcctgttcgg cggccgccgc aagaccacgg ttccgctggt 253261 caccgaggcg cgcgactggc agcacggggt gttcatcggt gcgaccctgg gtagcgagca 253321 gaccgccgcg gccgagggca aggtcggcaa tgtgcgccgc gacccgatgg ccatgctgcc 253381 gtttttgggc tacaacgttg gggactactt ccagcactgg atcaacctgg gcaagcacgc 253441 cgatgagtcc aagctgccca aggtgttctt cgtcaactgg ttccgtcgcg gtgacgacgg 253501 tcgcttcctg tggccgggct tcggcgagaa cagccgggtg ctgaagtgga tcgtcgatcg 253561 catcgagcac aaggccggcg gtgcgaccac cccgatcggc accgttcccg ccgtggagga 253621 cttggacctg gacggactgg acgtcgacgc cgccgatgta gccgcggcgc tggcagtcga 253681 tgccgatgaa tggcgtcagg aactgccgct gatcgaagaa tggctgcagt tcgtcggcga 253741 gaagctgccg accggtgtca aagatgagtt cgacgccctg aaggagcgcc taggttaggg 253801 cgagcagacg cataagcccc cgcacgcacg gcgtgtcgag ggctttagtg tctgctcgcg 253861 ctcgttagcg gcgggcacgc acaagttctt cgacagcgcg caaagacacc gaaagcctct 253921 cttcccaacc gcccgtgatc accacgaatg atcgtcccgc ggcgcggaga gcctgctcgc 253981 agcgggcgaa aaaggtaccg cgtgcgccgg ggacacagcg tccgtcgtcg gcgtcccagg 254041 gcacatcggg cgtggtgagc agtgtgagat cgtagggacg ccgagctaga tcacggagct 254101 cttgcgggca gccgcccgcc aggaactcgg cccacacggt cgtcgcgagc ggatccgtgt 254161 cgcagatcag gacgcgatcg gcgtcacgag ccaaggcttc ctccgacgcg atctgtccgc 254221 gaacgatttc ggcccactcc agtcctatca gtgagccgcc attgagctcc cgcaacattt 254281 tcgcccgctc cgggacccac ttcgttcgga gcttttccgc aaccgcctgt gccagcgtgg 254341 tcttcccggt ggattcgggt ccgatgatgc tcacgcgttt gacgaaggcc ggccgcacgc 254401 accgtgggat gtgttgccag tggccaagcg ggtccgcgcg gatgtcggtt gcagtcacgg 254461 gaacgacggt gcgaccgtga tcgaccgcca cgaaacgcgc tccgaggacc tgggcaaagt 254521 ccgcgttgta gggctcggca ccgaagacga agtcggggcg ggttgccagc acgccctgca 254581 ggctcgcctt ccagatgtcc cagaagtccg ggtgctccca cgggcgctgc gggttctcgt 254641 tggccagatg gaccacgcga tcgaagggga acagctcccg catccatgca acgcgctggg 254701 cgcccggaat cggctctgct gccgttgatc cgacgacgat ggtcagctca tccacccatc 254761 gccgcgcgaa ctcgcaaagg tagacgtgtc ccgcatgggg cggcatgaac ttgccgagca 254821 ccattccgtg tgtcacgacg tcgcctcagc gattcggccg gcgatgacat tctcccactc 254881 gtccagccgc gaaagaaagt acgcctcagc ctcatcgaag ctgattccga cgtcaggagc 254941 gaccttggct acgacgtcgg cgtagacccg ggaggcctct gggtcgacga attcccactg 255001 accgaggggc catttcgcgg tgagataacc cgcgtccgag tacgcttggt acaacggcgt 255061 tcccggatag gggacgatcc ggctcatgaa cttgaagccg accgtatact ttgttgcccg 255121 cagcaggcgg actgtttcgc gtagctcgtc gggctgcacg gtggggtgaa acataatggt 255181 gcccgggata acgtcaatgc cgagctgttg cagggcgttg atcgtgtcgg cggcatcttg 255241 tccacgagtg aggatctgct tgcggtaggc gcgcagttgc tcgtaggacc cagtctccac 255301 gccgatgaat acccgacgca ggcccgccct gtgcagatgt ttgaacaagt ccagatcgac 255361 aacggagtcc agccggatat cgaccatgaa gttgacgctg atccccctcc tgagtaccgc 255421 gttggcaaag tcagcggcgc gttgctgcga accggggtgt ttggagataa acaggtcgtc 255481 ggtgatggat aggaagttga cgtcgtagtc cgacaccaga taatcgatct cgtcgacgac 255541 cgcgtcaacc gacttcgccc ggtagctgtc cttccctagc atcgcggaca tggccccggt 255601 gccgcagaac gtgcagcggt aggggcatcc gcgggtggaa aagacggagg cggcgaagcc 255661 atcagcaagg acggtcggca actcgtcgcg agccgggcga ggcaactcgt caaggtcgac 255721 cagcgaggag ggtgtgcgca ggatctgtcc ctgctcacta cggcgggcta gtcccgggac 255781 gtcgtcaacc gcagcgtcat tcgccagggc caaggccagc ttggtgaacg ctacctcccc 255841 gtcgccaacg acgacgtagt cgaaacagtc atgctggcgc aggatgcgct cgtagttcag 255901 tgttgccatc gcattcccga tgacgatgcg cacgccatcc caggcctgtc tcgcgcgctg 255961 cgccaaccac aacacctccg gaaatgtgtc gatgcaggaa aagccgacaa gccggggcgt 256021 tcctgataag gcggcggcgc tttgcatggc cagccacgtc tcctgcacgg acccgtggcc 256081 ggcgaccagg ccgttgacgg aggtgactgc gatcccttgg gtcttggcgt atgccttgat 256141 cgacatcatc ccgaggtgct ccatgggcat ggagcaatac agccacggat ctccaagctt 256201 gagtccgtca acgtacgaca gcccgtcttg acggacgcct ggaggattga ccagaagagt 256261 tgccacgtgg agaactttac aaacgatttc ggctggtgat gggcggaatt gcgccctgcg 256321 gctctggtcg ccgggccgcg acgtaccctc ggcgcatgca gattcgcccg tatatcgccg 256381 ccgataagcc cgccgtcatc ctgtatccgt ccgggacggt catcagcttc gacgagttgg 256441 aggcccgcgc caaccggttg gcgcattggt tccgccaggc tggtctgcgc gaggacgacg 256501 tcgtggccat cctgatggag aacaacgagc acgtgcacgc ggtcatgtgg gcggctcgcc 256561 gcagcgggtt gtactacgtg ccgatcaata cccacctgac cgcctccgag gccgcctaca 256621 tcgtcgacaa cagcggtgcc aaagcaattg tcggttcggc ggcgctgcgc gagacctgcc 256681 acggcctggc cgaacacctt ccgggcgggc tgccggacct gctgatgctt gccgggggcg 256741 gtctggtcgg ctggatgacc tacccggaat gcgttgccga tcaaccagac accccgatcg 256801 aggacgaacg cgagggtgac ctgctgcagt actcgtcggg aacgactggc cgaccgaagg 256861 gaatcaaacg cgaattgcca cacgtctcac cggatgcggc acccgggatg atgccggcac 256921 tgctcgattt ctggatggac gccgactcgg tatatctgag tcccgcgccg atgtaccaca 256981 ccgctccgtc agtgtggacg atgagcgcac tggccgcggg cgtcaccacc gtcgtgatgg 257041 agaagttcga cgccgagggc gccctcgacg ccatccagcg ctaccgggtg acccacgcgc 257101 aattcgtccc ggccatgttc gtccggatgc tgaaactccc tgaagcagtt cgtaattcgt 257161 atgacatgtc cagccttagg cgagtgatcc acgcggccgc tccatgtcca gtccagatca 257221 aggagcagat gattcactgg tggggaccga tcatcgacga gtactacgcc tcctcggaag 257281 ccagcgggtc gacgttgatc acagccgagg attggttgac gcatccgggt tcggtcggca 257341 agcccataca gggcggggtg cacatcgtgg gcgccgacgg cagcgagctg ccgccgaacc 257401 agccgggcga aatctatttc gagggcgggt accccttcga atacctcaac gatccggcga 257461 aaaccgcggc gtcgcgcaac aagcacggct gggtaaccgt cggcgacgtc ggctatctcg 257521 acgacgacgg ctacttgttc ctgaccggcc ggcgccacca catgatcatc tccggcggcg 257581 tgaacatcta cccgcaggag gcggagaacc tcttggtcgc ccaccccaag gtgctcgacg 257641 cggcggtgtt cggcgttccc gacgacgaga tgggtcaacg tgtcatggcc gcggtgcaaa 257701 ccgtcgactc cgccgatgcc aacgatcagt tcgccggcga gctattagcc tggttacgag 257761 accgcttgtc acacttcaag tgtccaaggt cgatcgcgtt cgaaccgcaa ttgccgcgca 257821 ccgacaccgg aaagctctac aagagcgggc tggtcgaaaa atactcggtg tgaccgatgc 257881 tgccgggggc ccgacctgtc cacccagaca ccggctatat cccgccccgg gccaccagtt 257941 gtccggctat cacgttgcgc tggatctcgt tggtgccttc accgaggatc atcaagggtg 258001 cgtcgcggaa gtaccgttcc acgtcgtact cggtcgagta gccgtagccg ccgtggatac 258061 gcacggcgtt tagggcgatt tccatcgcga cctcggaggc gaacaacttg gccatcccgg 258121 cctccatatc gcagcgttgg ccgctgtcgt accgctcggc ggcatagcgg gtcagctgac 258181 gggccgctgt gagcttggtc gccatgtcgg ccaggtaatt gccgaccgcc tgatgctgcc 258241 agatcggtcg gccaaagctt tcccgttgct gagcgtaggc cagcgagtcc tcgagtgccg 258301 ccgtcgccac gcccagcgcc cgcgcggcca cttggatgcg acccgtttca agtcccttca 258361 tcatctgcga aaagccttga cccatggctc cgcccaggat cgccgagacc ggcacccgga 258421 ggttgtcgaa cgacagctcg caggattcga cgcccttgta acccaacttc ggcaagtccc 258481 gcgacaccgt gagtcccggc ccgggttcga cgagcacgat cgacatgcct tggtgccgcg 258541 gtgtggcgtt cgggtcggtc ttgcacagca ccgcgaaaag tccggaccgg cgggcgttgc 258601 tgatccacgt cttgcagccg ttgatcaaca acccggcaga gccttcaggg ccgtcggcca 258661 acgccgtggt cgacatgttc tgcagatccg agccaccgcc gggctcggtt agcgccatgg 258721 tggcccgcag ctcgccactg gccatcgggg gcagatatgt ccgccgctgt tcctcggtgc 258781 caaacagggt cagcaatttg gcgacgacgg tgtgcccgcc catcgcgccg gccaggctca 258841 tccagccgcg tgccagctcc tgggtgactt gcacatagca cggcatcgac accggcgacc 258901 cgccgtactg ttcgtcgatc gccaggccgt agatgccgat gtgtttcatc tgctcgatcc 258961 acgcctccgg gtagctattg gcatgctcga cctcacggac ggttggcttc acgtctcggt 259021 cgatgaatgc ccgcacggtg gcgaccagca tcgcttcgtc gtcgttgagc tcgttgcgca 259081 ccttttgtcg ccctccgtat tgaccccctg tccgatagcc tgccagaatg tggcgttgtg 259141 gctagcgggt atgggggcat ccgcgtcggc gggccctatt tcgatgacct gtcaaaaggt 259201 caggtgttcg actgggcgcc gggggtcaca ctgtcgctgg ggctggcggc cgcccatcag 259261 tcgatcgtgg gtaaccggct acgcctggct ctggactccg acctgtgtgc ggcggtgacg 259321 ggtatgccgg ggccgctggc gcatccgggc ctggtttgcg atgtggcgat cggccagtca 259381 actttggcga ctcagcgggt caaagccaac ctgttctacc gcgggctcag gtttcaccga 259441 tttccggcag tgggcgacac cctctacacc cgtaccgagg tggtggggct gcgagccaac 259501 tcgcccaaac cgggccgtgc gccaaccgga ttggcggggc tgcggatgac cacgatcgac 259561 cggaccgatc ggttggtgct cgatttctac cggtgcgcca tgctgcccgc cagccccgat 259621 tggaaacccg gcgctgtgcc aggtgacgac ttgtccagga tcggtgccga cgcgccggcg 259681 ccggccgccg atccaaccgc acactgggac ggtgcggttt tccgaaagcg ggttcccggg 259741 ccgcacttcg atgccggtat tgccggtgcg gtgttgcata gcaccgcaga cctggtcagt 259801 ggagcgccgg agctggctcg gctcaccctc aatatcgctg ctacgcacca tgattggcgg 259861 gtcagcggac gacggctggt ctacggcggg cataccatcg gactggcact cgcgcaggca 259921 acccggctat tgcctaacct ggcgaccgtc ctggactggg aatcctgcga ccacaccgct 259981 ccggtacacg agggcgacac cctctacagc gagctgcata tcgagtctgc gcaggcccac 260041 gcagacgggg gtgtgctggg actgcggtca ctggtctacg cggtcagcga ttcggcgagt 260101 gagcccgatc ggcaggtgct cgactggcgt tttagcgcct tgcaattcta ggttcggtta 260161 ctaagggcca gcgcggcacg caaactgttg cactgactag tgaagaacct ttgtgagacc 260221 ccaacattcg gggccacacg atcgaaaccg tggaaggcgc cttcgactac ttctacctgg 260281 catggcaccc cggctgctgt cagacgttcg gcataggcca gatcctcgtc gtggagcagg 260341 tcgtgggtgc cgacgccgat ccatgccggc gccagccctc ctaggtcgtc acgccgtccc 260401 gggaccgcga cccgtgcgtc cgcatcgcca agatatgccc gccagccgaa ccggttggcg 260461 cgcccgttcc atagccggta gtgcgggttg gcgggggcga tcgacgtccg gtcgtcgagc 260521 atggggtaca ccagcaactg aaatgccggt gtgatgccgc cacggtcgcg ggcaagcaga 260581 gccagcgccg ccgcgaggcc gccgccggca ctagcgccgc cgattgccac ccgcgcgggg 260641 tccaccgccg gcaggctggc cagccaggtc aacgccgagt agcagtcgcc cagggcggca 260701 ggatacggat tttccggcgc caggcggtag tccaccgatg cgacagtgat gcccagtctg 260761 ctgctgaacc ggaggcagag ccgatcgtcc tgttgcgcgg tgcccattac gtatccgccg 260821 gcgtggatcc acagcagcgc gggcgctggt tcgttgctgc cggcgggtcg gtatagccgg 260881 acaccgaccc cggattccag ggtgagcacc tcgatatcgg ggggtgtacg ggacattcga 260941 agccccgcga cgacgatcaa tgcccgcatg actggcaggg tgcgaggacc gaccagctgt 261001 cgtggggtga cgacggcgat gcgacgcagg tcggggtgga cttcgttgcc ggacaccggt 261061 ccagtatgcg tcggcgcaat ttcgcctcgg tacagcgatg gctttggcag gctgcggtta 261121 gtcgaacgag gatcgggatg gtggcctgat gagtgatcca gcaagagggg cggaagccga 261181 ggatgcctac ggttttcccg ccgggctgtg gcgctggctg cagcggcatc caccgccggc 261241 gttgcaccgg ctcacccggt ttcgcagccc gttgcgtggt ccgtggttga cgtcggtgtt 261301 cggcctggtg ctattggtgg cgttgccttt cgtcatcatc accgggctac tttcttatat 261361 cgcctatgcg ccgcagctgg gccaggccat ccccggtgac gtcggctggc tgcgactccc 261421 cgctttcacc tggcccaccc gtccgtcctg gctgtaccgg ttgacccagg ggctgcatgt 261481 ggggctgggg ctggtgatca ttcccgtggt gctggccaag ttgtggtcgg tgataccgcg 261541 gctgtttgtg tggccgccgg cgcgctcgat tgcccaggtg ctcgaacggt tgtcggtgct 261601 gatgctggtc ggtgggatcc tgttccagat cgtcaccggc gtgctcaaca ttcagtatga 261661 ctacatcttc gggttcagct tctacaccgg ccactatttt ggggcttggg tcttcattgc 261721 gggtttcctg ttgcatatcg tggtcaagat cccccacatg gtcaccgggt tgcgatcgat 261781 accgatgcga gaagtgttgg gtaccaacgt ggctgacacc cgggcgcagc cgtgcgatcc 261841 ggacgggctg gtgtcggtca atccgggcga ggccacgcta agcagacgcg gtgccctggg 261901 attggtcggt gccggggtgc tgctgatcgg ggtgctgacg gttgggcaaa ccctgggcgg 261961 gttcacccgc aaggccgccc tgctgctgcc ccggggccgt gtcgtgagcc cgggcgactt 262021 cccggtcaac aagaccgccg ccgccgccgg gatcaccgcg gaggccattg gccccgactg 262081 gcggctggtg ctgcgtggcg ggcctgcgga agtagtgctg gatcgcgcca cgctggccgg 262141 cctgccgcaa cgcaccgccc ggctgccgct ggcctgcgtc gaagggtggt cggccgtgcg 262201 cacctggagc ggcgtgccgc tggccgagct ggcgctgctg gcgggcgtgc cggcggcgcg 262261 ctcggcacgg gttacatcgc tgcagcgcgg cggggcgttc ggcgaggcga agctggcggc 262321 aaaccagatc gccgaccccg atgcgctgct ggcgttgcgg gtcgacgggg cggatctgtc 262381 gctgaatcat ggctacccgg cccgcatcat cgttcccgca ctgcccggtg tgcacaacac 262441 gaaatgggtc gctggcatcg aattccacaa gaggtgaaat gttcgacatt gcaacgcgtt 262501 tcaaaaactc ctacgggtca ggtccattgc acctgctggc gatggtgtct ggcttcgccc 262561 tgctgggcta catcgtggcc accgccaggc cctcggcgct gtggaaccag gccacctggt 262621 ggcagtcgat cgcggtctgg tttgtcgccg ccgtcgtagc ccacgacctg ctgttgtacc 262681 cgctctacgc gctggccgac cggatcctgg ccaggctagt cggcaggcgc gacgtctcgg 262741 cgccccgccg ccgcccggaa ctaccggtac gcaactacat tcggatcccg gcgctggcag 262801 ccggcttgac gctgctggtt ttcctgcccg gcatcatcag acagggtgcg ccgacatacc 262861 tggatgcgac cggacagacg caggaaccat ttctgggcag gtggttgctg ctcaccgcgg 262921 tcgcgttcgg gatcagcgcg gccgcttacg ccattcggct ggtggtggcg cacgtgaggc 262981 ggcgccgagc ggggtgttcg cgggtcgacg cgatcgacga ggagtaggct cccaccatga 263041 accagcgacg cgccgccggg tcaaccggtg tggcctacat cagatggttg ctacgtgccc 263101 gtcccgctga ctatatgctg gccttgagtg tcgccggggg ttcgctaccg gtggtgggta 263161 agcacctcaa gccgctcggc ggcgttactg ccatcggcgt ctggggcgcc cggcacgcat 263221 ccgatttctt gtccgcgacg gcgaaggatt tactgacccc cggtatcaac gaggttcgcc 263281 gtcgagatcg tgccagcacg caggaggttt ccgtcgcggc cttacgcggc atcgtttcgc 263341 ccgacgacct tgccgtcgaa tggccggcgc cggagcgcac gccgccggtc tgcggggcgc 263401 tgcgccaccg ccgttacgtc caccgccgtc gcgtcctcta cggcgacgac ccggcccagt 263461 tgctcgacgt atggcgccgc aaagatatgc ccaccaaacc cgcgccggtg ttgatcttcg 263521 tcccaggcgg tgcctgggtg cacggcagtc gcgccatcca ggggtatgcg gtgctgtctc 263581 ggctggccgc acaggggtgg gtgtgcctat cgatcgacta ccgggtcgca ccgcatcacc 263641 gctggccacg acacatcctg gatgtcaaga ccgccatcgc gtgggcacgg gccaatgtcg 263701 acaaattcgg cggtgaccgc aatttcattg cggtggctgg ttgttcggcc ggcggccact 263761 tgtccgcgct ggccgggctc accgccaacg acccgcaata tcaggccgag ctgccagagg 263821 gctccgacac gtcggtcgac gcggtggtgg ggatttacgg ccgctacgac tgggaggacc 263881 gctccacccc ggaacgtgcc cggttcgtcg attttctgga gcgggtagtg gttcagcgca 263941 cgattgatcg tcaccccgaa gtgttccgtg acgcgtcgcc gatccaacga gtcaccagaa 264001 atgcaccgcc attcctggtg attcatggca gccgtgactg tgtcatcccg gttgagcagg 264061 cgcggagctt tgtcgagcgg ttacgagcgg tctcccgctc acaggttggc tacctggagc 264121 tgcccggtgc gggccacggc ttcgacctgc tagacggcgc tcgcaccggc ccgacggcac 264181 acgcgatcgc gctgtttctc aaccaggttc atcgcagccg ggcacagttc gcgaaagagg 264241 tcatctaaac gccggccaat tgtatggtcg ccctatgagt agggggctgc ggtgaaacgg 264301 ctcagcggct gggacgcggt actgctttac agcgagaccc cgaatgtgca catgcacaca 264361 ctcaaggtcg ccgtgatcga attggattcg gacagacagg aattcggtgt cgacgcgttt 264421 cgcgaggtga tcgctggccg gctgcataag cttgagccat tgggctatca gctggttgat 264481 gtcccgttga agttccatca cccgatgtgg cgggagcact gccaggtcga tctcaactac 264541 cacatccggc cgtggcggtt gcgcgccccg gggggtcggc gcgaactcga cgaggcggtc 264601 ggagaaatcg ccagcacccc gctgaaccgc gaccacccgc tgtgggagat gtacttcgtt 264661 gaggggcttg ccaaccaccg gatcgcggtg gttgccaaaa ttcaccatgc gttggctgac 264721 ggtgttgcct cggcaaacat gatggcacgg gggatggatc tgctgccggg accggaggtc 264781 ggccgctatg tgcctgaccc cgctcctacc aagcggcagt tgctgtccgc ggcgttcatc 264841 gaccacttgc gccacctcgg ccggattcct gcaaccatcc ggtacaccac gcagggtcta 264901 ggccgggtgc gacgtagctc gcgcaagctc tcacccgcac tgaccatgcc atttaccccg 264961 ccaccgacgt tcatgaatcg gataaagaag ccgttgtcca agccctcggg acggccgccg 265021 ccgcacacca accgagcgcc ctcctcgatg cccttggcga tgtagcgctt caacgcgagt 265081 ccgctgcttc tccgagatca gcggcccgat ctgagctgcc gggtccgacg gcgggcccac 265141 cgggagagcc gttacgaaat tagttaccgc agccacgatt tcatcgtacc gggagcgcgg 265201 agccagaatg cgggtctggt tgacgcagcc ctgtccggcg ttcatgacgc cggagaacac 265261 catcatcgga atagctgcgg ccaggtcgac gtcctcgaga atgatggccg ccgacttgcc 265321 gccgagttct aaggtgcacg gcttgagcat ctcagcggca cgcctgccga cctctcggcc 265381 gacggccgag ctgccggtga aggtaaacat gtcgatgtcc gggttagacg tcagcgcctg 265441 accggtctca atccctcccg gcactaccga caacaccccc tcgggcaggc ccacctcggc 265501 gaacacctcc gccaaagcgt ttgcggtcag cggtgtttcg gcggcgggct tgagcacgat 265561 ggtgcagccg gccagcagcg ccggcgcaat cttgttgacg gccagaaaca gcgggacgtt 265621 ccaggccacg atcgcgccca ccacaccgac cgactcacgg ctgacaatgc tctgtccata 265681 ggagccggtg cgggtttcgg tccaggtgac cttgtccgct gcaccggcaa agtagttcat 265741 cgcccccatc gaacccatcc agtgcatcgt ctcgatgatg gtcggcggct ggccggtttc 265801 ggctgcgagc agcttggtga acaggtcctt gcgctcagcc agcatcttga ccgccgcagc 265861 gatcaccgcc gcacgctcgt gcggcggggt cgagggccag gggccgttgt cgaacgccgc 265921 acgtgctgcg gcgaccgcgg cgtcgacgtc ggcggcggcc gccatcggca ccttgccgac 265981 atattcccca gtggctgggc agcgtacctc gataacatcg gaggtcgacg gtttggtcca 266041 cttgccgccg atgaaaagct tgtcgtattc cgtggcactg tcagacatat gcgccgctcc 266101 tcctcatcgc tgcgctcggc atcgtcgccg gcggtcatgg cgtcacccta cccaagccga 266161 acgcgaaacg agaacgtgtt ccattattag ggtgtgagca ccaataccag attgctcacc 266221 aggaactcac gcagcaccgg gacggatgtc agccaccacg cccatctggg gtggtagcgg 266281 ggaaatacgg ctaacgcggc tccggtgccg gcagcccagc gcagaccctc ggcggcggac 266341 acggcaaaca acgacgaccc atagttgttc tttgccggat ggccgtgttt gcggacatat 266401 cgggcggcgg cgcgggcgcc gccgaggtag tggctgaggc ccatctcgtg cccgccgaat 266461 ggccccagcc aaaccgtgta ggacagcacg accaacccgc ctggcttggt cacccgcagc 266521 atctcggtgc caagctgcca ggggcgcggc acgtgttcgg cgacattgga ggacaagcag 266581 atgtctaccg agtcgtcggc caacggcagt gccatgcctg acgcccggac gaacatgccg 266641 ggccggccgg tgaacgcagg tccggcggca tgcatttcat cagggtccgg ctcgacgccg 266701 atgtagccga caccggcgtc ggagaacgcc gtcgcgaaat accccggccc gccgccaacg 266761 tcgagcagcg tacggccaac tggcggctcg ctatgtgtgg ccagccacag atcgccgatc 266821 attgctgcgg tgtcggccgc cagtgtgcga tagaaccgtg ccgggtcgcg ctgctcgtag 266881 cggaagtctg ccagcagtcg cagcgagcgc cgcagtgtcg cccgtcgcgc gaacacatcg 266941 gtgaccgcca cctggcacac cctacggccc gctaggctat cgaccaatgt ctgctctgcg 267001 ctcggtgttg ctgctgtgct ggcgcgacat cgggcacccc caggggggcg ggagcgaagc 267061 ctatctgcaa cgcatcgggg ctcagttggc cgcatcgggc attgcagtca cgttgcgcac 267121 cgctcgctat cccggtgcgc cacggcatga actggtcgac ggggtgcgga tcagtcgtgc 267181 cggcgggcgc tactcggtgt atctatgggc gttgctggcg atggccgcag cccgatgtgg 267241 gcttgggccg ctgcgccgag tgcgcccgga tgtggtcgtc gatacccaaa acggctggcc 267301 gtttgtggcc cggctgttgt atggccggcg gtcgctggta ctggtacacc attgccaccg 267361 tgagcagtgg ccggtggccg ggcggatgat gggtcggctc ggctggtatg tcgagtcgat 267421 gttgtcgcca cggctacacc ggcgcaacca gtacgtgacg gtgtcgctgc cgtcggcgcg 267481 ggatctgatc gccctcggtg tggacagcga gcggatcgct gtggtgcgca acggcctcga 267541 cgaggcgccg tcgccaacgt tgtccggccc acgtgcgccc acgccgcgtg tggtggtgct 267601 ctcccggctg gtgccgcaca agcagatcga ggacgcgttg gcagcggtcg cggagctaca 267661 gcctcggata ccgggcctgc acctagacat cgtcggcggt ggctggtggc ggcagcgcct 267721 cgttgaccat gtgcaccggc tcgacattgc tgacgccgtt acctttcacg ggcatgtcga 267781 cgatgtgacc aaacaccatg tgctgcaaag ctcctgggtg cacttgttgc cctcacgtaa 267841 agagggatgg gggctcgcgg tcatcgaggc ggcccagcac ggcgtgccca ccatcgggta 267901 cagatcctcc ggtggtttgg cggactcgat cgtcgacggg gtgaccggca tattggtcga 267961 cgaccgggcc gaattggtgg cttggctcga acaactgctg tccgattcgg tgctgcgtga 268021 ccaactcggc gccaaggcac aggcgcgtag cggtgagttc tcctggcggc aaagcgccga 268081 agcgctgcgc agcgtgttgg aggcagtgca ggccagccgt tttgtcagcg gcgtggtttg 268141 agccggcttc gacagactta atcctgggcg cggctcgccg gcgtgtcttc gcagtggtgt 268201 aagtgtcggc gcacccaata gccggccgcg ccagcgccgc cgaccagcag catcgaaagc 268261 cacgcccaat gcgcgagcat tgtcgctttg aggcgggccg acgatgcacc ggaggtttgg 268321 ccgccgaccc gataaagagc caattcgtcg tcgcggtgcg ccgctgctag ccggccgagg 268381 gtgcgtgcgg ccgcgcccat gtcgccggcg ctgtcggatt cgacgaccag ccacccgacg 268441 ccggccgcgg ccaaggttga cggatggggc ccggtgagca gcagctcctg gaccgcccgg 268501 gcgtgcgcgt cttcgccggg aacggtcacc ccggaaatga ccagatcacc tgtggtcagc 268561 acatcggcgc gaacccaacg ggggagcgga tcgagtaccg gtgccgaacc ggaccacgag 268621 aagcgccgca tggtgcccgc gggcaagacc gcaaccgtcc ggggatcggc attgatcgcc 268681 gctgccaccg ccgcccaacc ggacgggtag tgcacaggcg caaccttgcc ccacaccccc 268741 cacgccaagt caggcagcgt taggaccagc gccagacagc agaccaccgc cgccgttgcc 268801 ggtcgcagcc agcgtcgcag cgttagcacc gtgcccgcac cggagagtgt gtatccgggt 268861 accgccagcg cgacccactt ctgtccgtcg cgcagcacgc ccaggccggg tgcggcatcg 268921 accaccaccc gtagcgcgtg cagacctggg ccggtcgcaa ggacagccgg gaccatcacg 268981 gacaccgccg ctagtgtcag cagcggcact gccacgggcc ggcgcgccac agtcggtagt 269041 ccgatcgcca ccatggcgag tagtacgacg gcggatgcca ctgcgaaaag cgttgtccgc 269101 gagctaggta cggcctcgcc gttccagatc ccaccgagac tggccaagct gccaagcgtg 269161 cccagccccg gttcggcgcg tggcgcgaac gcggtaaccc caagctgatt ggctgccgtg 269221 tggctggtca acgacgagcc cagcgccgac gccgtcagcc agggcagcgc acccaccagc 269281 gcggagccca acgccgcgac cccacattgc cagcgcgggc ggcccgcgcc gggcatcgcc 269341 acgcacacca ccgcaactgt cgcggcgagt agcagcccgg acggggtcag gccggccagc 269401 gcaacccaga acgccagccc aaaaagcccg aaccaacccg cgccaaccgt tgttcgcatc 269461 gttaacatcg cggtcgcaac ccagggcaga cacccatagc cgaccagcag gctccaatgg 269521 ccctgcaaaa gtcgttcggc cacataggga ttccagatcg ccagcgtgat cgcgacaaac 269581 tggccggctg cccccgctgc gggtagtgcc gttgcgacca gtcgggccgc gccccagccc 269641 gccagccaaa gccccagcag cagcagcgct ttcaccacga cgccgccgtc gacgaggtgt 269701 gacgccaaag cgaccgcgaa gtcctgcgga gtcgcccggg gcgccgatgt cagccctagg 269761 gcgttggccg acacatacga ccgtggtgtg gacactgcat cgcgcagcag taggtatccg 269821 ggccgcagta gcggcgcggc caacagcagc accaagacca gcgcgtaccc cggtcggaac 269881 cagcgcacgt cgcctgatta gcgccgctcg ggcgggccgg ggtcgggatg ccccgcgtcc 269941 ggcggtgggg gcggctgcgc cgaaccgagt cggggcggat ccgagccatt cggctcgcgc 270001 gggaagtcgg ggcgctgcgt gggcagtttc tcggtctcag cctccgctcc aggcaccggt 270061 tcttcgaagc cgccgcggcg gtaatcgtgg tcgtcccgat ccccgctggc agccatcagc 270121 gcaccttcgg tccgaaggct aaacgacgcg aacagaccac cgccgaccag cgcgaccaag 270181 ccggccgcgg tgaatgtaat cggcagcacc cgcgaccaca gcgccagccg gtcgcgctcg 270241 tcgcgagccg cgttgacctg ggattcgacc gtctcttcgg tggaggtgac ctggtagtcg 270301 gcaaacgtga cctctggttt gagtgggtca cgagcgaagt agtggttggc gcgttcggtt 270361 tctttgacga tggtgccgga caccgggtcc acccagaatg ttcgctgcgc cgcgtaatag 270421 cgggtcatgg tgatttgctc gttcggatca ccgggtagcc cccacatcgc cgctgatgtg 270481 gtgactttgc cgtcctcgtc gccggcgtac agcgacgggt atttgagggg agccaccagc 270541 ttcccctcgg gggtgtagcc gacgttctgc gtgaagcggt atgtggttaa accgttgacg 270601 tcctcctcgc cttcgtagtt ggcgtcaaac gccttctgtg cgatggggtc gaaatagggg 270661 tatgtcttct tctcggtgtg aaacgggaac cggtaagaca gcccgtcgtg ccgcagcgga 270721 atggccgtcg gcgggttctc gtcattgagg ccccgcggtt tctggacggc gccgccggtg 270781 tgggtgtcgt cggagacagc catcgccgtc ttgcggttga gggtgaccgt gtcgacgatc 270841 gccagcagca gcccgctgtc cttctgcttg tcggtgcgcc ggagcgagga tccgacctga 270901 agtgtgacca cgtcggcgtt ggcgggcgat tcgacggtga cttgctgttg ggacaccagc 270961 ggcacgtcct ggttgaccac gatgtgctcg gtggctagcg acgccgagtc gagtgccgtt 271021 ccagtgccgt cgctgatcaa cgtggcatcg atatcgagtg ggatctcagc gatcctcctg 271081 gtggtatagg tcgacagcag cagggcggcg atcagtaggg cggctccgag tccgatagcg 271141 ccgcacgcgg cgaaccgcaa catgactgcc cggttcacct gcgccgctct cccccgcaag 271201 cgggtggtgc ccccacctca tcgcttcgtc ccccgcaagc gggcggtgcc cccactgcat 271261 cgtcgccggc gcggttcacg ttgctgtgac ctccttatgg tccatggact cgtcggtcgg 271321 gacccgctcc gacctgacca agcgaggcaa aacccgtttg accctaacag cagagcgtat 271381 gggcccggcg gacgaatcgg gtgcaccgat tcgcccgcaa acacctcaca ggcacactgt 271441 gttggtgacc aacggccagg tggtgggtgg gacccgtggc tttctgcccg ccgtcgaggg 271501 aatgcgcgca tgcgcggccg tcggcgtcgt ggtcactcac gtcgcgttcc agaccgggca 271561 ctctagcggt gtgggcgggc ggctgttcgg ccgcttcgat ctggcggtgg cggtgttctt 271621 cgccgtgtcg ggattcttgt tgtggcgcgg acacgccgca gcggcgcgag atctgcggtc 271681 acacccgcga accggtccgt atctgcgatc gcgggtggcg cgcatcatgc cggcctatgt 271741 ggtggcggtg gtcgtcatcc tgtccctgct gcccgacgcg gatcatgcca gcctgaccgt 271801 gtggctggcc aacctgacgc tcacccagat ctatgtgccg ctgaccctga ccggcggcct 271861 gacccagatg tggagcctgt cggtggaggt cgccttctat gcggcgctgc cggtcttagc 271921 gttgctgggc cgccgaattc cggtcggtgc ccgagtgccg gcgatcgcgg cgctggcggc 271981 gctcagctgg gcgtggggct ggctcccgtt ggacgccggg tcggggatca acccgttgac 272041 ctggccgccg gcgttcttct cgtggttcgc cgcgggaatg ttgctggcgg agtgggccta 272101 cagcccggtc gggttgccgc atcggtgggc gcgccgccgc gtggcgatgg cggttaccgc 272161 gctgctgggt tacctggtgg cggcctcgcc gttggcgggt ccggagggcc tggttccggg 272221 cacggcggca caattcgcgg tgaagaccgc gatgggctcg ctggtagcgt tcgcgctggt 272281 ggcgccgctg gtgctggacc ggcccgacac gtcgcaccgg ctgctgggca gccccgcgat 272341 ggtgaccctg ggccgttggt cctatggcct gttcatctgg catctggccg cgctggccat 272401 ggtgtttccc gtgatcggag cgttcccgtt taccgggcga atgccgacgg tgctggtgtt 272461 gacgctgatc ttcggtttcg cgatcgccgc ggtcagctac gccctggtcg agtcgccctg 272521 ccgggaagcg ttgcgccgct gggagcgccg caacgaaccc atatcggtcg gcgaacttca 272581 ggcggacgcg attgcaccct gactcggccg gctgacacct ggcgggcacc tagtcgatcg 272641 tgcccgctgg cacgatccac tgacagggct gaccggtcac ggcggcgatg agatcgaagt 272701 ccgcgtcgta atgtagaacg accagcccgt gttcctcgcc ggccgcggca atgagcaggt 272761 ccgggatttt gcgaccgcgc tgactacgcg cagcgagtag gcgctggatt ccaagcgcgc 272821 ggcgatgatg cgatgccgtc gattcgatga ggtcgaacgc gctcaatgcc accatgagcc 272881 gctgccactc ggtctcattg cgtgcggagt acccgacttc aaggtcggtt atttgcgtgc 272941 gagcgacggc accggcctca gccaacggtt ccaccgcccg ccgcacggcg ggccggctga 273001 gccttttgat cacgctggtg tcgagaagat atttcagcgc catgcttcgg cgcggtcctc 273061 tggcggtgcg gcggccagcg tgtcgagagc ggcggcgacg cgttgaactc gctgagacgt 273121 ggcttgccgc agggccgcgt tgacggtgtc tttgatcgtc gtcgtgccca attctgtacg 273181 agccatgttt aaagcctgct cgtcgatgtc gacgagatgt ttcgccatga atcggagtat 273241 atatcaataa ggagccgata tatatgcaca atgccaagcc catggcattc gcccggcgcg 273301 gctgtctcac tgatagccgc cctgccgctc gaagatgcgg cgcgggttgt cgacgagcat 273361 ggtgtgcagc tgctcgtcgg tgacgccgtg ctgcttcagt gcggggatga cgtcgttgtg 273421 gatgtggagg taatgccaat tcggcatcgc caccggcacc agctcctcgg gaagcgcgtc 273481 gaaatagcag caggcgtcgt gtgatagcac catcttgtcg gcatggccgc gctcgcacat 273541 tcgggccacg atgttcaccc ggtcctgaaa cggtgagatc acgtcgacgc cgaaccggtc 273601 catcccgagg taggagccgg cggcgatgag ctcttccagg tagccgacgt cggtgctgtc 273661 gccgcagtgt ccgataacca cccggctcag ttccaccccc tcctcggcga agatgcgttg 273721 ctggtcaagg ccgcgccgca gcccggcgtg ggtgtgggtg gagatcggcg ccccggtgcg 273781 tttgtgtgct tgggcgaccg cgcgcaacac ccgctcgaca ccaggggtga ggccgggttc 273841 gtcggtggcg cacttgagga ttcccgcctt gatgccggtg tcggcgatgc cgtgctcgat 273901 gtcgcggacg aacatgtcgg tcatgatctc cgggccgtcc agctgtgcgc ccggcccgag 273961 gtagtggaag tagaacggga cgtcgttgta ggtgtacaag ccggtggcca cgacgatgtt 274021 cagctcggtg gccgcggcca cccgggcgat gcgcgggatg tatcggccca gcccgatcac 274081 cgtgaggtcg acgatggtgt ccacgccgcg ggccttgagt tcgcctagcc gggcgatggc 274141 gccggccacc cgcttgtcct cgtcgcccca ggcttccggg tagttctgcg caatctcggt 274201 ggtcatgatg aagacgtgct cgtgcatcag cgtgacgccg agatcagcgg tgtcgatggg 274261 tccgcgagcg gtatttagtt ctggcacgtc actgatgcta ggccgcaatc ggtgtcttgc 274321 ggggccgcag tgcagtagcg tcaccctcgt cgttgaccga accgctcggg agccaattct 274381 tatgctgctc aaccccaacc atttgacacg caaataccca gaccgtcgct ccggggagat 274441 catggccgcg acggtggact tcttcgagtc cagggggaag gcccggctca agcacgacga 274501 ccacgagcgg atctggtact cggacttcct ggacttcgtc gggcgggaac gcatctttgc 274561 ttccctactg acgccggcct cctatggcgc cgatgattgc cgctgggaca cctaccggat 274621 cagcgagttc gccgagatca tgggcttcta cgggctgagc tactggtacc ccttccaggt 274681 gaccgcccta ggcctgggcc cgatctggat gagcgccaac gaggacgcca agcgcaaggc 274741 cgccgcgggg ctcgaggccg gcgaagtgtt cgccttcggc ctgtccgaac agacccacgg 274801 cgccgacgtc tatcagaccg acatgatcct tacccccagc gacggcggct ggaccgccaa 274861 cggcgagaag tactacatcg gcaacgccaa cgtggcccgg atggtctcca ccttcggcaa 274921 gatcgccggc accccagaaa gccaggagta cgtcttcttc gtcgccgact cccagcacga 274981 gcggtatgac ctgatcaaga atgtggtgaa ctcgcagaac tatgtggcca attacgcgct 275041 gcgcgattac ccggtcaccg aggccgacat cctgcatcgt ggcgccgaag ccttccacgc 275101 cgccctcaac acggtcaacg tctgcaagta caacctgggt tggggtgcca tcggaatgtg 275161 cacccacgcc ctctacgagt cggtcaccca cgcggccaac cgtcacctgt acggcactgt 275221 ggtgaccgac ttcagccacg tgcggcggct gctcaccgac gcctacgtgc ggctaattgc 275281 gatgaagctg gtcgccagcc gggccagcga ctacatgcgc agcgcgtcgg ccgccgaccg 275341 tcgctacctg ctctacagcc cgctgaccaa ggcgaaggtc accagcgaag gcgagcgggt 275401 catcaccgcc ctgtgggacg tcattgcggc caaaggggtg gaaaaggaca cgtttttcga 275461 gaccgtggct cgcgagattg gcctgctgcc caggttggaa ggcaccgtgc acatcaacat 275521 cgggctactc ggcaaattca tgcccaacta cctgttcgct cccgactcca cgctgccggt 275581 catcccgcgt cgcgacgacg ccgccgatga cgcgttcctg tttgcccagg gacccaccgg 275641 gggcttgggt aaggtgcgtt tccacgactg gcgcgcgtca tttgacacct gcgcgcatct 275701 gcctaatgtc gcactgctgc gcgagcaagt cgacgtgttc gccgagctgc tggccagcgc 275761 caccccggac gcggcacagc agaaggatat cgactttgcc ttcggcgtgg gacaactctt 275821 cgcgaacgtg ccctatgccc agctcatttt ggaggaggcc cggctatctg gtgtcgacga 275881 ggccttgatc gacgagatct tcggcgtact ggttcgggac ttcaacaccc atgccgtcga 275941 gctgcacggc aggtccgcca cgacagccga acaggctcgg ttcgccatgc gaatggtccg 276001 tcggccggtg cacgatcccg cccgctacga ccagatctgg aaggaccacg tgctcgcgct 276061 caacggcgca tatcaaatgg caccatagtg cgccgcgtcg agatcgacgc tgccgtgttg 276121 cccactcgca ctttcgcgcg ctggtgtcaa tctcgacgcc agccttgacc gtgatgcagc 276181 gcacattaga atgaccagtg gtcaccaacg caaggaggcc ccatgccgac ggtgacgtgg 276241 gcgcgtgtcg atccggctcg ccgtgccgcc gtggtggaag ccgccgaggc tgagttcggt 276301 gcgcacggat tctcccgcgg cagcttgaac gtcatagccc ggcgtgccgg agtcgccaag 276361 ggcagcctgt tccagtactt cgcggacaag cgcgacctct acgcgtttat tgccgacatc 276421 gccagccagc gagtccgctc ctacatggag gacctgatcc gcgagctgga cccgaaccgg 276481 ccgttcttcg aattcctcac cgacctgctc gatggctggg tcgcctactt cgccgagcat 276541 cctcgggaac gtgcgttgca tgctgcggcg accctggagg tcgacaccga tgcccgcatc 276601 agcgtgcgca gcgtcctgca ccgccactac ctggacgtgc tacggccgct ggtgcgcgac 276661 gcgcacgcgc ggggcgacct gcgcgcagat tccgacaccg gtgcattgat gtcgctgctg 276721 ctgctgatct ttccgcacct ggcgctggct ccatacatgc gtggtttgga tccgatcctg 276781 ggcctcgacg agcccacacc tgagcagccc gcgctggccg tgcgcaggct tgtcgccgtg 276841 ctggcggcgg ccttcgatgc ccagcacccc gcgaccaact cagcccagac ccgatcggag 276901 gagatcacat gacacgcaca cgttcgggct cgctcgccgc gggcggactc aactgggcga 276961 gcctgccact gaagctgttc gccgggggca acgcaaagtt ctgggatccg gccgacatcg 277021 acttcacgcg cgaccgggcg gactgggaga agttgtcgga cgacgaacgt gactacgcca 277081 cccgattgtg cacccagttc attgccggcg aggaggcggt gaccgaggac atccagccgt 277141 tcatgtccgc gatgcgggcc gagggacggc tggccgacga gatgtatctg acgcagttcg 277201 cgttcgagga agccaaacac acccaggtgt ttcgcatgtg gctggatgcc gtcggaatca 277261 gcgaagactt gcatcgctat ctcgacgact tgcccgccta ccgccaaatc ttctacgcgg 277321 agttgccgga gtgcctcaac gcattgtcgg ccgatccctc accggccgcc caggtccggg 277381 cgtcggtcac ctacaaccac atcgtggaag gcatgctggc gctcacgggc tactacgcct 277441 ggcacaagat ctgtgtggaa cgcgcaatcc ttcccggcat gcaggagctg gtccggcgca 277501 tcggtgacga cgagcgacgc cacatggctt ggggcacctt cacctgtcgg cgccacgtcg 277561 ccgccgacga cgccaattgg acggtgttcg aaacacggat gaacgagctc atcccgctgg 277621 cgctgcgcct catcgaggag ggctttgcgc tgtacggcga ccagccccca ttcgacctgt 277681 ccaaggacga tttcctgcaa tactcgaccg acaagggaat gcgccggttc ggcaccatca 277741 gcaacgcccg cggccggccg gtcgccgaaa tcgacgtcga ctactcgccc gcgcagctgg 277801 aggacacctt cgccgacgag gaccggcgca ccctggcagc ggcctcggcc taggcctggc 277861 gagcagacgc aaaatcgccc aatttcgtgc cgaattgggc gattttgcgt ctgctcgcca 277921 ggggaacgct aggcgatcca gacggtcttg atgttgcaga actcgcgtat cccgtgtgcg 277981 gacagttccc ggccatagcc cgagcgcttg accccgccga acggcaattc gggataggac 278041 accgtcatgc cgttgataaa aacctggccc gccacgatgt cgtcgatgaa gcgtcgttgc 278101 tcggtctcgt cgcgggtcca ggcgttggat cccagcccga aggtggtggc gttggcgatc 278161 tcgacggcct cgtcgatgtt cgccgcgcgg aacaccgagg cgaccggacc gaagacctcc 278221 tcggtgtaga gagccatgtc cttggagatg tcggtgatca cggtcggcgg gtagaaccag 278281 cccggccggt cgagacgctt tccgccgcac cggatcaccg cgcccgccgc ggcagcatcc 278341 tcgacttgct tggcaacctc gttgcggccc tgctcggtgg ccagcgggcc cacgtcggtg 278401 tccgggtcgg tcgggtcgcc gacccgtaac gccgccatcc gcgcgacgaa cttgtcgacg 278461 aaatcgtcgt aaatgtcggc gtggacgatg aaccgcttgg cggcgatgca ggattggccg 278521 ttgttctgca cccggccggt gacggcggtg ctgaccgcgg cgtccagatc ggccgacggc 278581 ataacgatga acgggtcgct gccgccgagc tcgagcacgg tcggcttgat ctcgttaccg 278641 gcgatagcgc ccaccgattg gccggccggc tcgcttccgg tcagcgtggc cgccgcgacc 278701 cggggatcac gcaggatggc ttcgacggct cccgagctaa caagcaacgt ctggaagcag 278761 ccgtccggga agccgcctcg ggcgatgacg tcggccaggt acagcgcgca ttgcggcacg 278821 ttcgacgcgt gcttgagcag gccgacgttt ccggccatca gtgccggtgc ggcgaaccga 278881 accgcttgcc acagggggaa gttccatggc atcaccgcca ggatcacccc gagcggctgg 278941 tatcggccgt aggccgccga cgccccgacc ttggccgcat cggcgggttc gtcggccagc 279001 aacgcctcgg cgttttcggc gtagtagcga aaacccttgg cgcacttcag tgcctcggct 279061 ttggccgcgg ccagcgtctt gcccatctcg agcgtcatca tcgcggcggc ctggtcggcc 279121 tcggcttcca gcaagtcggc ggtggcattg gcccaccggg cgcgctgggc gaagctggtc 279181 tggcggtagt cggcgaaccg ccggtgggcc cgggctattg ccgcgtcgac ttcgtcatcg 279241 gtcgccgcag tgaatgtctt gactgtttcg ccggtagccg ggttgatggt ggcgatgggc 279301 acgctgacat cctttgctgg gtgggtttgc acaaatcgtc cggtgtccag cctgccacta 279361 acgtggccag cgctcccgag caggaggtgt cggggcctcc tatcggctgg ggtgggctct 279421 atcacgggca ggaccagcgt ggcggaacat gtcaccgatc gcatgttcgt cgggagctaa 279481 tcggcccgtt caatcggccg gtggcgaggc gactttgcgt agcgacatcg gcgggacgta 279541 gcgcccgatc agcgtgcggt gccaccaggc tcggtcccgt cgcagctcgg ccacggtggt 279601 gaagcggtac tgatacagct gcgcgcgcac ataccgaggc ggagattgcg ggaaaggatt 279661 gtggcgcaac agcttcagcg tcgcaggatc attgcgcagc aaccggttta ggaatggcgt 279721 catccacggt agtgcgtagc ccggtgagat ggcggcgaac cacatgagcc agtccagccg 279781 cagatggtag ggggcccatt gccgcggcag ccggcgcgga tcaccgggct tgcccttgaa 279841 ttcgtatgct ttccagacgg tttgttcggt aatcggtgac tcgtcggtcc cttcgattac 279901 cacttcccgg cgggtgcggc agatgctgcc gaacgccccg taggtgttga ccaaatgaaa 279961 ggggttgaac gacatgttca ttcgttgatg agaggacagc agattgcgtg ccggccagta 280021 gctcagcaac agcaccgccg cggtgaatac gaccacgagg ccggcgaacc actgcggcgg 280081 tgccgacagc gccggctggg ccggcatcgg cagcagcgcc gcggccgaag atgtgtcgat 280141 cgcgctgcac gccaagagga tggtcagcca attgagccag gagaaatttc ccgatgccac 280201 cagccatagc tgggtaacca cgatgatcgc ggcggcgatg ctggctgcgg gctgtggtgt 280261 gaacaaccca aacggcacca cgagctgggc gaaatggttg cccgccacct caatccggtg 280321 caatggctta ggcaggtgat ggaagaacca gctcaacggg cccggcatgg gctgtgtttc 280381 gtggtggtag tacaggcacg tcagactgcg ccagcacgag tcgccgcgca tcttgatcaa 280441 tccggcaccg aattcgaccc ggaacagcag ccagcgcgcc aacaacaacg tcagaatcgg 280501 cggggcggtg cgctcgtttc cgaggaagat catcaggaag ccggtctcca gcagcagcga 280561 ctcccaaccg aatgagtacc acgcctgccc gacgttgacg atggacaggt agagcaccca 280621 cagcgtcagc cagatcagca tcgtggccca caacggcacg aaggaggccg caccggcgac 280681 gacggctgcc gacaacacgg cacccaacca gcagaccccg gcgaacaccc gatcggaata 280741 gcgaaagtga aagatgctcg gtgttcgcca gaaggactgt ccagccagat accgcggcac 280801 cggcagcatg ccgtgctctc cgatgagggg ccggaactgc tgtgcggccg cgacgaatgc 280861 gatcagataa ataatcgccg tgccgcgctc cagcgccagt ctgcccagcc aatattcggg 280921 cgctgaaaac catcccatgg ccgttactcc ttggacacgg cgttcacacc aactattgca 280981 tgcggtcttg accacgagac tctgatgtgg cgaccaccga tgccgccacc acggaaaccg 281041 aaatcagtgc cagcagttgc acactggccc agttcccggc gtatccgtcg accgaccgcc 281101 acggatgccg ggacagcgca gcgccggcca ggatcagccc gcccgcagcc agtccgacgg 281161 tgacccggtc acgtagtcgc tcccggcggc gcagtgcata ccgcacaccg agggccgtgc 281221 ccatcaccat gaccccggcg atgctggcga tcaccgcccc ggccgccagc accccggccg 281281 ccgcccaggc tccgggcctc cagggtggtg tgggccggtc ggcgagctgc cgtcgcccgg 281341 tccgccagaa cgccagcaga gctagcaggg gcagcagggc cagccctatc gccaggctcg 281401 cccgatacag cgagttcggt gcgaatgtca gcgtgatggt gccggggttc ccggcgggca 281461 ccacccaggc ctgctgccac ccgttgacgg cgatcggtgt cagccgggcc ccggtgctcg 281521 tgcgggccac ccagcccgag ttgatgcttt cgggtaccac cagcacccgg gaagtggccg 281581 actcgggaac ccgaacttcg cggtgggtgg gaccccacgc acccgtttca gcagaagtaa 281641 ctgtcgcgct tgacaatcca gcaccgggag ttgacaactg ggcaccgtcg accacgaacg 281701 cggcgccggg gctgatcagc aattcctgct gtcccgccgg cagcgctatc ggctcgcgtt 281761 cacacgggag cgcggcgacc ggttcaccgt ccagcaaggc gcccaccgtg gttcggatcg 281821 aggtgtgcac gaaccggccc gcgacggcga cgaccgggcc gtgatcgcaa tccacggtga 281881 gcgcacgcgc gcggttgcgg gcggcgtcgg ccggcgcaat cggggcgccg ccggcgccaa 281941 gcaccaccac ttcggccagc cccggcggct tgagctggtc gaagcccagc gcgttgcgat 282001 cgatgacatc gtcccagtcc agcaggctga ccgacaccgt gtcggtcacc cggggatgca 282061 gccatagcgt cgttagctcg ccgacctgca gttgtcggac ctgggggccg tcgcccaggt 282121 tgatggccac caccgtcgga tgggccggca acatcgaccg gctggcggcc agccgcagcc 282181 cggtcaccac ggtgggccgc ggcagggcca gcgtcagcgt cggcggggtt ttgtgttgca 282241 ccacccgctg cggcgcggtc caggcggtgg ccggatcgcc gtcggcggcc gcgtacgccg 282301 agccgaggat gtcgacaagg tcggaatcac cgctggcccg ggtggtggaa ggcgcggcga 282361 tcaagtcggc cagcttcggg ccctgccgtg gtcgcaccca caccatcggg gtcaccgaca 282421 ccgggcgggg tacggtcagt gtgcggctga gattggccgg ttcctcgggt gccagggcca 282481 tcgaggcggc gcagcgcacg ccgtcgggtc ccggggcgca gcccggtctg cccagcagtt 282541 cggatcccag gtcccagccc gcgatcgccg aacccggcgg cggcccgggc accagcacgg 282601 tgtgtcgcag ctgaaccgga tgggcgaaac cggacgcatc gtattgggtg atggccagat 282661 cggtgatgcc gaactgcaca ccggccgacc cgtcgtcggt ggcggccgcg gtaaaccgca 282721 cccagggggt ttcgccgtag ggcagtgcgg cggtgagcgg tttgcccgcc tcatcgaacc 282781 gcagggtggt gctgccgttg acggtctcga tcaggatgcg tcggacctgg gcgccgaccg 282841 cggtcgcgct gggtgtcagg gtgacgacgg cattggtcac cggacggtcg aaatccacct 282901 gcagccactg cccaacggcg gcctgcagcg cgttggacac ccaagcggtc gccgggtcac 282961 cgtcgacggc ggccgcgggt gcgctcgccg gggcgacgtc gggcatggcg gtggcatccg 283021 ccgaggagct cgacacggtg atccggccgc cggtccatcc accgacgacc ggctccgcgc 283081 ccggcaccgg gtagtcaggc acccggttgt aggtgtgccg ggcgtcgccg ggtgcccgga 283141 tcgccgacga gtggtggtcc acccggccgt aatcggtctc gcgggccacc ggggtgtcgg 283201 tgacggcgac ctggggcacc ggcaagccgg cagctcgagc gtccgcggtc atcagcaccg 283261 gacccagcgg gggctggccc tgcagccggc gtcgttcgtc cagacgcagc aggacctcgg 283321 gtccgccgtc gacgcgggcg agctggtcgg tcgcggcgaa gtagggcgca ccggggttgg 283381 cgggcgcgct cacccggtag atctcaatcg cgggatatcg gggtcgcagg ccgctgtcgt 283441 tgacgaaacc cgccagcgga tcaggaccca ccggcgcgcc gaactccgcc agcttcgcta 283501 gcccgggcga ccctgcgatg ctacggtgca gcagaatcgg tcgtgccgag cgcgacgtct 283561 cgggatccag atcgttgcgt accagcacat aggaaatgcc ttggcgggca agggtatcgg 283621 ccagccccgc cgacggtcgt ccggcggcga acaggcgttg cacggagtcc agcgctcgaa 283681 tggtctgcgg cggggtcagc ggaatggagt cgcgcacgcc ccacgggccg tcgccgagca 283741 cctgcagcgg ctcgtcgtgg ctggtgcccc acacctgggt ggcgaacggg gcgcccggga 283801 ccaccagcac ccgcccggga gtgggcgtcg cggcatggtg tgtgcgcagc cagtcggcgg 283861 cctcctgcca gtactgggga agcgcaccga acgtgccggg cggggcgacc cggccggtcc 283921 acgccagcga ggtgctgacc accagcgcgg tcagggctac caccgccacc gctactcgct 283981 tgtcccgctc ggggtgcgcg aacgcgcgca gccacgccgg cctcggcgcg ctgcctggca 284041 gcggaactcg gctcagcagc tgcgccaagc ccagcaccag gggcagccgg atcaccggcc 284101 ccaccttgtg tacgttgcgc aggggggtgc cggcggcgtc caggaacgcc tgcaccgggt 284161 gggcgaccgg cgaagccagc ccgccgcggt ggccgacggc cagcagcacc accccgacca 284221 acagcatcgt caccagccgg ccgcgcgccg gcatcgccgg gctagtcagt ccggccagcc 284281 cggccgctgc gaccaggcag gtgcccagga tggccgccga tccggtgacc aacggcgcgc 284341 ccgcggtcgc gttcggcgcc acgaacggcg tccagctgtc ggtgccgcgc agcacctcca 284401 ccagcgagga ccattgcgtg gtcacgccgg aagattcgat gaagtccaga aacggcggac 284461 tgaccccgtg cagctgcgtc agcgccatta cccaccacag tgtcgccagg gccatcgcca 284521 acagccacca cgcggtgtag cgccaccaca accgattcgg ccggtgacag gcccaccaga 284581 tcaccgccgg caggcaaccg gccagcgtcg cgatggcgtt gaccgcgccc atcagcgcca 284641 ccgccagccc ggcttgggcg gccagcgcgc gcaccgagcg gccagaagtc ccccgcagcg 284701 ccaggatcgt gggcagcagc acccacggcg ccagcatcat cggcaaggtt tccgacgaga 284761 tcgacccgag tgtggtcagc acccgtggtg acagcgcgaa cgccacggcg ccgaccaccc 284821 gcgaggacgg gccgccgacg cccagcgcct cggctacccg cagcaggccc cagaagccga 284881 ccgtgagcaa caccgcccac cacagccgct gagtgaccca gccgggcact cccagcaggt 284941 gaccgatcac gaagaaggtg ccgtgcggaa acagataccc gtaggcctgg ttctgcgcct 285001 gcccgaacgg caggtcgctg ttccacaggt tggtcgcacg cgccaggaag cgcagcgggt 285061 tggcggtgag gtccagcttg gtgtcggggg agacttgtcc gggggattgg gcgaacgtca 285121 gcgccaacgc taccgcgccg accaccggca gccatttgcg agacaacggc gccacctgcg 285181 acccggaggc cgcctcagcc tgcggcgccc gggtcgcggg gctagctacg gttaccgtac 285241 tcgacccggt tgagcactga tgacgacgga tcgcccccgg ggagtggcgg tttcttgtcc 285301 tgctgcacca tcagggtcac tccgaagatc gcggccgcgc ccagcaacag accaaccacc 285361 acgcttgcgg cggcgggcgc gacgatccgg ttcatcggtg gctcctcgac ggctgtgggt 285421 gcggcttgag aggctagagg caacttagca gaagcgtggg cctggccccc caacccggag 285481 cgtatgcgcc accgtgacag catgtccgga tggcttttcc acgcacactg gcgatactcg 285541 ctgcggcagc agcgttggtg gtggcctgca gccatggcgg cacacccacc ggatcgtcga 285601 cgacctccgg cgcgtcgccc gcaactccgg tagccgttcc cgtgccccgg agctgcgccg 285661 agccggcggg gatcccggcg ctgctgtccc cccgtgacaa gctggcccag ctgctggtgg 285721 tcggcgtgcg agatgctgcg gacgcccaag ccgtggtcac caactaccac gtcggcggca 285781 tcctcatcgg cagcgacacc gacctgacga tttttgacgg cgcgctggcc gagatcgttg 285841 ccggcggggg tccgctgccg ctggcggtga gtgtcgacga ggaaggcggg cggttgtccc 285901 ggttgaggtc gctgatcggc ggtacggggc cgtcggcccg cgaactggca caaacccgaa 285961 ccgtccagca ggtgcgcgac ttggctcgag accgcggccg gcagatgaga aagctgggta 286021 tcaccatcga cttcgccccg gtggtcgacg tcaccgacgc cccggatgac acggtgatcg 286081 gggaccggtc gttcggctcg gatccggcta cggtcaccgc gtatgccggg gcgtacgcgc 286141 agggtctgcg cgatgccggg gtgctgccgg tgctcaagca tttccccggt cacgggcgtg 286201 gctcgggtga ttcgcacaac gggggtgtca cgacaccacc gcttgatgac ctggtgggcg 286261 atgacctggt gccctaccga acgctggtga cccaggcgcc ggtcggtgtg atggtgggtc 286321 atctgcaggt tcctgggttg accggctccg agccggccag tctgagcaag gccgcggtga 286381 acctgctgcg caccggcacg ggatacggcg caccgccgtt cgatggtcca gtgttcagcg 286441 acgacctctc tggtatggcc gcgatctcag accggtttgg cgtcagcgag gcggtgttgc 286501 gcaccttgca agccggtgcc gatatcgcac tgtgggttac caccaaagag gtgcccgcgg 286561 tgctggaccg cctggaacag gcgctgcgcg ccggtgaatt gccgatgtcg gcggtcgacc 286621 ggtcggtggt gcgggtggcg accatgaagg ggcccaaccc ggggtgtggc cgttagcgat 286681 gtgcggctgg cgccccactg cttaccgtag ggttagatag acgggctaca ggggcccaaa 286741 aggggctggc gatggcaggt ggtaccaagc gactaccgcg tgctgtccga gagcagcaga 286801 tgctcgatgc cgccgtgcag atgttctcgg ttaacggcta ccacgagacc tcgatggacg 286861 cgatcgctgc cgaggcgcag atctccaagc cgatgctgta cctgtactac ggctccaagg 286921 aagacctgtt cggcgcctgc ctgaaccgtg agatgagccg gttcatcgac gcgttgcgtt 286981 ccagcatcaa cttcgaccag agcccgaaag acttgctgcg caacaccatc gtgtcgttcc 287041 tacgctatat cgatgccaac cgggcgtcgt ggatcgtgat gtacacccag gccaccagct 287101 cccaagcgtt cgcgcacacg gtgcgtgagg ggcgcgaaca gatcgtccaa ctggtggccg 287161 agttggtgcg ggccggcacc cgcggcccgc ttacggacgc cgaaatcgag atgatggccg 287221 tcgcgctggt gggcgccggc gaggcagtgg ccacccggct cggtatcggt gacactgacg 287281 ttgacgaggc ggccgagatg atgatcaacc tgttctggct cggcctcaag ggcgcgccgg 287341 tggatcggct cgagaccggg cactgacctg cgcggtatcg gccactgaga tgtgggtgta 287401 ttttagatgc agatgtaaat tcgatgtatg attcgaacgc aagtccagct cccagatgag 287461 ctttaccggg acgccaagcg ggtcgcgcac gagcacgaaa tgacccttgc cgaggtcgtt 287521 cgtcgcgggc tggagcacat ggtgcggatc tatccgaggc gcgatgcggc gtccgacacc 287581 tggcagccgc ccacgccgcg tcgactcggt ccgtttcgtg cgtccgaaga aacgtggcgc 287641 gagctcgcca acgaggcgtg agtagcccgt gctctcgatc gatacgaata tcctgctgta 287701 cgcgcagaac cgggattgcc ccgagcatga cgccgccgcc gccttcctcg tcgagtgctc 287761 tggtcgagcc gacgtcgcag tctgcgaact cgtgcttatg gagctgtatc aattgctgcg 287821 gaatcctacg gtggtgacgc gaccgctcga gggccccgag gcggcggaag tctgtcagac 287881 gttccgtcgc aaccggcggt gggcgctcct cgagaacgct ccggtcatga acgaggtgtg 287941 ggtgttggcg gccacgccta gaattgctcg ccggcgccta ttcgatgccc ggctggcact 288001 gaccttgcgc catcatggtg tcgacgaatt cgccactcga aacatcaacg gcttcaccga 288061 cttcggcttc tcacgcgtgt gggacccgat aacgtcggat ggctgaccac gccgggccga 288121 tccgcgtggc cccggctata gaccccgcac ggtagcggtc aggtgggggt atcccttggc 288181 catattgcgc agcgtgagat cccagccgcc atcgccttcg gcgacgtaga gtcccgcggt 288241 ggccggcagc agcaccggct tggcgaaccg aaccgaatag cgcaccgcgt ccggaaaacg 288301 ggcttcgata ttcgccaata ccgccgcggc agtgaacatc ccgtgcgcga tgacggtggg 288361 gaagccgaac agtttcgccg cgatcgggtt ggtgtggatc gggttgtgat cgccgccgac 288421 ggcggcatag cggcggatct tcgccggggt gatccgcagg accgcggcgg gcgggggtag 288481 cttgggcttt ttttgcggcg gcggtttggg ttcgccggac aagctggtgc gttgttgatg 288541 caggaacgtc gtcacctggt gccaggcgac atcgttgccg acgctgacgt tggtcaccag 288601 atcgaccagc aggcccctgc ggtgttcgcg cagattctcc gcgcgcaccc gcacgcccac 288661 cgcgtcggtg accgcgatcg gccggtattg cgtgatgtgg ttctcggtgt gtatcgctcc 288721 cattgcggcg aacgggaagt cgaagccggt caccaacgac atcaccgatg gaaaagttaa 288781 cgcgaacgga taggtcaacg gcacctggtt gccgtagcgc agaccggtga ccgccgcgta 288841 ggccgcgacg ttggcggggt cgatcggcag ctcctcgacg gtcaccgtcc ggttgggcag 288901 ctggtctgtc cggggcacca cgggtagcgc cccggccgcc gcgcgcagca ggttcttcag 288961 gccgctgggt tgagtcacta ctgtcccctc acgcgccgat catggcctgg ccgcagacac 289021 gaatgacgtt gccggtcacc gcgtttgacg ccgggctggc gaagtaggcg atggcctcgg 289081 cgacgtcgac gggctgcccg ccctgcagca gcgagttcag ccggcggcct acctcacggg 289141 tggccagcgg gatggcggcc gtcatctggg tttcgatgaa tcccggtgcc acggcgttga 289201 tcgtgatgcc tttcgcggcc aggccgggtg ccagcgcctg ggtgatgccg atcatcccgg 289261 ccttggtggt ggcgtagttg gtctggccgc ggttgccggc gatgccggcg atcgacgaca 289321 gcccgatcac ccgaccaccc tctccgatgc tgccgttgcc caccagaccc tcggtgagcc 289381 gcaacggggc aagcagattg acagccagga cggcgtccca acgcgcatcg tccatgttgg 289441 ccagcagctt gtcacgggtg atgccggcgt tgttgaccag gatgtcggcc ttgccaccgt 289501 ggtggtcgcg caggtgctcg ctgatcttgt cgacggcatc gtcggcggtg acgtcgagcc 289561 acagcgcggt gccgcccacc ttgctggcgg tttcggccag gttctcggcg gcggactcca 289621 catcgatggc gaccacgtgg gcgccgtcgc gagcgaacac ctcggcgatg gttgcgccga 289681 tgccgcgggc cgcgccggtc acaatggcga ccttgccgtc cagcggcttc tcccagtcgg 289741 ccggcggtgt ggaatcgtcc gccccgacag agaagacttg gccgtcgacg taggccgact 289801 tggccgacag caggaatcgc atggtcgact cgaggccggt agctgcgggc ttggcgtccg 289861 gcgacaggta gaccaacgcc gttgtcgcac cgcggcgcag ttccttgccc agcgagcggg 289921 tgaagccctc cagcgcgcgc tgcgcgatcc gctcgttcgt gctggcggcc gcttcgggtg 289981 tgccgccaac aaccaccacg cgcccgcaac ggccgagatt gcgcagtacc ggagtaaaga 290041 actcgtgcag ccccttgagc ccggccggct ctgtgatgcc ggtggcgtcg aagaccagcc 290101 cgccgaacga gtccgcccag cgcccgccca ggttgtttcc taccaggtcg tagtcctttt 290161 cgagtgccgc gcgcagtggt tcgacgaccc tgccggcccc gccgatcagc agcgacccgg 290221 tcagtggcgg ttcgcctgct cgatagcggc gaagcgtctc gggttgcgga acacccaatt 290281 gcctggccaa aaacgatcct ggaccggagt tgacaacctg cgagaacaga tcggacgaac 290341 gcttgggagc cacttcagct gccttccgta tcgtgtgggg gtcgggcgcg ccaatacacg 290401 taaccgtatc gaggactaac ttacttcaga gtaagaacag tgggtagtat ggccctcaac 290461 ggccgatccc ccgaactgat caacggagaa aacagtggcc cctgctgcta agaacacttc 290521 acagaccagg cggcgagtcg ccgtactggg cggcaaccgc atcccgttcg ccagatcgga 290581 cggtgcctac gcggatgcgt ccaaccagga catgttcacc gcggcgctga gcggcttggt 290641 ggaccgattc ggactcgccg gcgagcggct ggacatggtg gtgggcggtg cggtgctcaa 290701 acacagccgc gacttcaatc taatgcgcga atgcgtgctg ggctccgaac tctcgccgta 290761 cacgccggcg ttcgacctgc agcaggcctg cgggacgggc ctgcaggccg cgatcgcggc 290821 cgccgacggc attgccgccg ggcggtatga ggtggccgcc gctggcgggg tggacaccac 290881 ctcggacccg ccgatcggcc tgggcgacga cctgcgccgc accctgctca agctgcgccg 290941 atctaggtcc aacgtgcaac gcctcaagct ggtgggcacg ctgccggcca gcctgggcgt 291001 ggagatcccc gccaacagcg agccgcgcac cgggctgtcg atgggcgagc acgccgccgt 291061 caccgccaag cagatgggca tcaaacgcgt agaccaggac gagctggccg ccgccagcca 291121 tcgcaatatg gccgacgcct acgaccgggg tttcttcgac gacctggtca gtccgttttt 291181 agggctgtac cgagacgaca atctgcggcc taactccagc gtcgagaaac tggccacgct 291241 gcgtccggtc ttcggagtga aggccggtga cgcgacgatg acggccggca attcgactcc 291301 gctgaccgac ggcgcctcgg tggcattgct ggccagcgaa cagtgggcgg aggcacactc 291361 gctggctccg ctggcctatc tcgtggatgc cgagaccgcc gcggtcgact atgtcaacgg 291421 caacgacggc ctgttgatgg cgccgaccta cgcggtaccc cggctgctgg cccgtaacgg 291481 gttgagcctg caggacttcg acttctacga aatccacgag gcgtttgcct ccgtggtgct 291541 cgcgcatctg gcggcgtggg agtccgagga gtactgcaag cggcggctgg gcctggacgc 291601 cgcgctgggg tcgatcgatc ggtccaagct caacgtcaac gggtcgtcgt tggccgccgg 291661 gcaccccttc gcggcgaccg gtgggcggat tttggcgcag accgccaagc agctcgccga 291721 gaagaaggcg gcgaaaaaag gcggcggacc gctgcgcggg ctgatttcga tctgcgcggc 291781 cggcggccaa ggtgtggccg cgattttgga ggcctgacgc tgacggctcg gtaagtgcct 291841 cgcgggaagt cccgagtggc cggtgggccg cccaaagaaa tgtgttgcgg gtggtttgcg 291901 ccctgagcag atgggtaccc gatcactcgg atagccccgt gttgttgtct gaccccccga 291961 ccccgacggc aatgcggggc aatcccctgg aaagggccgc cgctggtggg aggggaccca 292021 gcggcggtct ttttgggctt gccccatcgt tcgttgactc tgcgtccacc acgcaaaagt 292081 gcgagtaacc cgtccggtgg acgcagagtc aacagataag gatcagaacg cggcctcgtc 292141 gagttccatg atgtcgttgt ccagcgtctc gatcacctcg cgggtgctgg tcaacagcgg 292201 caagaagttc ttcgcgaaga acgacgccac cgcgactttg ccttcgtaga aggaccgctc 292261 gtcgccggtg gcacccgcgt cgagtgccgc caccgccacc gcggcctgac gctgcagcaa 292321 ccagccgatg atgaggtcac cgacgctcat caagaagcgc accgaaccca agcccacctt 292381 gtagaggctg gtgacgtcct gctgcgcggc catcaggtag ccggtcagtg cggccgccat 292441 gccctggacg tcggtgagcg ccttggccag cagcgcgcgt tcggtcttca gccggccgtt 292501 gccagcaccg ctgtcgacga acgcctggat ctggcctgac acgtgcgcca acgccacgcc 292561 cttgtcacgg acgattttgc ggaagaagaa gtcttgtgcc tggatggcgg tggtgccttc 292621 gtacagggag tcgatcttgg cgtcccggat gtactgctcg atcggatagt cctgcaagaa 292681 gccggatcca cccagggttt gcaggctttc agtgagcttg gcgtaagcct gttcggagcc 292741 cacacccttg actaccggca acatcaggtc gttgaccctg acggccaact tggcgtccac 292801 accgtgcacc acctcggcga cagccgcgtc ctggaaagtg gcggtgtaga ggtagagcgc 292861 acgcaggccc tcggcgtaag ccttctgggt catcagcgag cggcgcacgt cggggtgatg 292921 tgtgatcgtc acccggggcg cggtcttgtc ggtcatctgg gtcaggtcgg caccctgcac 292981 gcgggacttg gcgtactgaa gcgcgttgag gtagccggtg gacagcgtcg cgatggcctt 293041 cgtgccgacc atcatgcggg cctgctcaat gacctcgaac atctgcgcga tgccgttgtg 293101 tacctcgccg accagccagc ccttggcggg gacgccgtgt tggccgaacg ccagttcaca 293161 ggtcgccgag acctttaggc ccatcttgtg ttcgacgttg gtgacgaaca cgccattgcg 293221 ctcgccgggt tcgccggttt cgacgtcgaa caggaacttg ggcacgaagt acagcgacag 293281 gcccttggtg ccgggaccgg cgccctccgg gcgagccagc accaggtgga agatgttctc 293341 gaacaggtcg ccggagtcac ccgaggtaat gaaccgcttg acgccgtcga tgtgccagga 293401 cccgtcggcc tgttggacag ctttggttcg ggcagcgccc acatcggagc cggcatccgg 293461 ctcggtgagc accatggtcg atccccagcc gcgttcggcg gctaggaccg cccacttctt 293521 ctgctcctcg gtgccgaggt ggtagaggat ctgggcgaag cccgcgccgc cggcgtacat 293581 ccataccgcc ggattggcgc ccaagatgtg ctcatgcagc gcccagacca ctgccttggg 293641 catcggcatg cccccgagtg cctcgtcgat gccgaccttg tcccaaccgg cttccagcat 293701 cgcgttgact gactttttga acgattccgg cagcatcacc gagtgggttt tcgggtcgaa 293761 aacgggcggg ttgcggtccc cttcgacgaa cgactcggcc accggcccct cggccagccg 293821 gctgacctcg gccagcatgt cgcgggcggt gtcgacgtcg acgtcgctga attcgccatg 293881 gcccaaagct ttgtcgacgc ccagcacttc gaacaggtta aaaacctggt cacggacgtt 293941 gctccggtag tggctcactg ccgatcctcc tcgttgagag tgccacctca gggttgggta 294001 gggttgggta ctcgaaacca agttacccac cagtaacacc gtcaaaatat atccgttgca 294061 taggtcaatg caagttgatg tgagctacat tgcaccaact aactaaccaa ccggttgggt 294121 tagcggtgat cctggccgtg tcggtcctct cacctgcggc gatagcgatc aaatgaagac 294181 tatgcggagt ctagggcggc agcgcctggc agcgtagatc atcggctcac gcggatgcgg 294241 cctcttggta cggacatgcg cgcggatgtc cggcgagtag ggtcggatgc gaaaactacg 294301 tcctcggctc taggggcgaa tgaagttcgg tgaactcaac gaacaacctg acgccgtcct 294361 cactgcggga ggccttcggc catttcccga ccggggtggt ggccatcgct gcggaggtcg 294421 acggagtgcg gcaaggcttg gcagccagta cctttgtccc ggtctcgctg gaaccgccgc 294481 tggtgtcgtt ctgtgtgcag aacacctcga cgacatggcc gaaactcacc ggcgtgccga 294541 tgctgggcat cagcgtgctc ggcgaggccc atgacgccgc agtgcgcaca ctggccgcaa 294601 aaactgggga caggttcgcc ggtttggaga cggtatccaa cgacgccggc gccgtcttca 294661 tcaagggcac cagcgtgtgg ctcgagagcg cgatcgagca gctggtcccg gcgggagatc 294721 acaccatcgt ggtcttacgg gtcaaccagg tcaaggtgga tcccaacgta gcgcccattg 294781 tgttccatcg cagcgtgctc cgccgactcg gcgtctaaac gtctatacgg acgcccactt 294841 ggtctgtccg gacaacatag cggtcagcgg cccattctgg ttgcgataaa tgatggtaga 294901 tcacgtcatt ttgcttccag tagtcgtgcc catgtttgag aggcacaact attggtcgct 294961 ttcattcgtt gcgcgcagac cggtctttgt atgacgatga tgggaagttc tatctgccgc 295021 caaaagcaga atggcaggac gcaggatgaa gcgatgagcc gacccgccgg aaccggtttc 295081 cgggaacggg tgggatgcat gcccacttga ggtctcgcgg caggcggtgg agcgtggcaa 295141 aaacgtcgca tcgggtgagc agcgccgatg gcatgagtaa gcgtattttg cgtttgataa 295201 tcgcgcagag cggcttctat agcgccgcac ttcagctcgg gaatgtctcg atcgttctac 295261 cgtttgtggt agccgagctc gacgccgaat tgtggatagc ggctcttatt tttcctgcat 295321 tcacggccgg tggggcgatc gggaatgtgg tcgcgccgcc ggcggtggcc gccgttccac 295381 gccgtcaccg attgttcatt attgtgtcct gtttggccgt cctggctggc gtcaatgcct 295441 tgtgcgcaac catcggcaaa ggaagcgtcg ctggaatcct attggtggtc aatgtgacgc 295501 tgatcggggt cgtttcggtg atctccttcg tcgccttcgc ggatctggtg gcggctatgc 295561 catcaggaac cgcccgagcc cgcattcttc ttaccgaggt cggagtaggg gcggctttga 295621 cggccgtggt ggcggcgacg ctgtcattcg tacccgacca acacccatta agcaggaaca 295681 ttcacctact gtggacggca gccgtggcaa tggctatctc ggcggccata tgccgggcat 295741 tgcctcaccg gatcgtcccc agggtccatg cggcgcccgg tctgcacaaa ctcgtgtacg 295801 tcggttggac ggctatccga accaatggtt ggtatcgtcg gtacctgctt gtgcaggtac 295861 tctttggctc ggtcgtgctc gggtcctcgt tccacagcat tcgcgtcgcc gccgtacccg 295921 gggaccagcc cgacgaggtc gttgccgtcg tccttttcgt ctgcgtcgga ctcttgggtg 295981 ggatcgcgtt gtggaaccgc gtccgggaga gatttggcct ggtcggtttg tttgtcggca 296041 gtgcactcgt tagcatcgcc gcggcagtgc tatccatcgc attcgatttg gccggagcgt 296101 ggcccaacgt cgtcgccatc ggtctggtga ttgcactggt atccatcgcc aatcaaagcg 296161 tattcaccgc aggccaactg tggattgccc gtgacgccga acccggcctg cgaacatccc 296221 tcatctcctt cggccagctc gtcatcaacg caggcttagt cggtatgggt ttggcgctgg 296281 ggttgattgc ccaggatcac gatgcggtgt ggccggtgat gatcgttctg ctgttgaacc 296341 tgacggctgc ctactcagcg acgcggttcg ctccagccaa gtccgtggat gttcgtggct 296401 tgcctcaggt ttcgcgcact tcccgaccta aaaccggggg ttagcggcga aacagcttgc 296461 tgcccagcca taccaccgga tcatacttgc ggtcggcgac ccgttctttc atcgggatca 296521 gggcattgtc ggtgatcttg atgttttctg ggcacacttc ggtgcagcac ttggtgatat 296581 tgcagtagcc caggccgtgc tcttcctgtg cttggctgcg tcggtcccgg gtgtccagcg 296641 gatgcatttc gagttcggcg attcgcatca ggaagcgggg gccggcgaac gcatccttgt 296701 tttcctcgtg atcgcgaact acgtggcaga cgttttggca caggaagcat tcaatgcact 296761 tgcggaactc ctgcgagcgt gcaacgtcga cttgcgccat tcggtactcg ctgggctgta 296821 gctccttggg tggcgcgaaa gacgggatct cgcgcgcttt ttggtagttg aacgagacgt 296881 cggtaacaag atcgcgaatc accggaaacg tccgcattgg ggtgaccgtg acgatctcgt 296941 cctcgtcgaa tgtcgacatc cgcgtcatgc acatcagtcg cggtttgccg ttgatctcgg 297001 ccgagcagga tccgcacttg ccagctttgc aattccagcg cactgcgaga tccggtgtct 297061 gcgtctgttg tagacggagg atgacgtcca gcacgacctc gccctcgttg acctccacgg 297121 tgaattcgcg gagttcgcca cagctttcgt ctccgcgcca cacccgcata ctcgcgctgt 297181 acgtcattta gcctctccgt cctggatgct cggccagctc ttcgtcggtg tagtatttct 297241 ccaactccga gatctcgaag agctccagca agtcgggtcg catgggcgtt tgcagctgct 297301 gggtgacgtt gatgtggcag ttggagtcgc cggacccgct gccaccggtg cccatggttt 297361 cggtggcccg gcataccagc aagatcctgc gccagttggg gtccataccg ggatggtcgt 297421 ctcgggtgtg gccgcctcgg ctttcggtgc gctgtagcgc agctctggcc acgcactcgc 297481 tgaccagcaa catgttgcgc aggtcgatgg acaggttcca gcccggattg tattgacggt 297541 gaccttcgac gagtacgttg tggtagcgcg accacagctc ggccaaaaga gtcagcgccc 297601 tggatatttc gtcggcgttg cggatgatac cgaccagatc gttcatcacg tactgcaagt 297661 ccatatgcag cgcgtacgga ttctccggcg ccgagccgtc tttcggtcct tcgaaggggc 297721 tcagcgcctg ctgggccgcc gcatcgatag cctccgctga aaccgctggc cggctgctca 297781 gtgcccgtac gtaatccgct gcgcccaggc cggcccgccg gccgaatacc agcagatcgg 297841 acagcgaatt gccgcccagc cggttggagc cgtgcatacc gccggcacac tcaccggcag 297901 cgaacaggcc tggcaccgtg gcggcgccgg tgtccgcgtc tacttcgaca ccgcccatca 297961 cgtagtgaca cgtcggcccg acttccattg cctgcgttgt gatatcgact tcagcgagct 298021 ctttgaactg gtgatacatc gacggcaatc gccgtttgat ctcggcgggt gtcagccggg 298081 atgcgatgtc gaggtagacg ccgccgtgcg gggtaccgcg gccggccttg acctctgagt 298141 tgatcgcgcg cgcgacctcg tcgcggggca gcaagtccgg ggtgcgtcgg gccgagtcgt 298201 tgtccttaag ccactggtcg gcctcttcct ccgtctcggc gtactggccc ttgaacaccg 298261 gcggaatgta gtcgaacatg aagcgagagt tctccgagtt tttgagcact ccgccgtcgc 298321 cgcgaacacc ctcagtgacc agaattccct tgacactggg cggccacacc atgcccgtcg 298381 ggtggaactg gacgaactcc atgttgatca gcgtcgcccc ggcccgcagt gccaacgcgt 298441 gcccgtctcc ggtgtactcc caggagttgg atgtcacctt gaacgacttg ccgatcccgc 298501 cagtggcaag caccaccgct ggcgcctcga acacgatgaa ccggccgctt tcccgccagt 298561 agccgaaggc tccggcgatc gcgccttggt ccttgagcag ttcggtgatg gtgcattcgg 298621 cgaacacttt gatccgcgct tcgtagtcgc cgagctcggc gtggtcctcc tgctgcagcg 298681 agacaacctt ttgctgcagg gtgcggatca actccaggcc ggtgcggtcg ccgacgtgcg 298741 ccagtcgcgg ataggtgtgt ccgccgaagt tgcgctgact gattcggcca tcgtcggtgc 298801 ggtcgaacag cgcgccgtag gtctccaact cccagacccg gtccggcgcc tccttggcgt 298861 gcagctcggc catacgccag ttgttcagga actttccacc gcgcatcgtg tcgccgaagt 298921 gagtcttcca attgtccttc gggttggcgt tgcccatcgc ggccgcgcag ccgccttcgg 298981 ccatgaccgt gtgggccttg ccgaataggg atttgcacac gacggctact ttcaagccgc 299041 gttcccgcgc ctcgatgacc gcgcgtaacc ccgcgccgcc ggcaccgatc acgactacgt 299101 cgtaggagtg ccgctcgacc tcaaccataa aacctcgctc agcttctgaa acgatccttc 299161 agccaataaa tctgagatct gtgatgctgc cactggccac cagcatgatg tagaaatcgg 299221 tgagcgccag ggtccccagc gtgatccacg cgaattgcat gtgtcgggta ttgagcttgc 299281 tgacctgtgt ccagatccag tatcgcactg ggtgcttgga gaaatgcttg agccgaccgc 299341 cggtggcgtg ccggcacgaa tggcacgaga tggtgtatgc ccacagcaga accacattga 299401 tcgtcaaaat gacattgccc aaaccgaagc cgaatccgga cggcgagtga aatgccgcga 299461 tcgcgtcata ggtgttgatc agcgacacca ccaccgcgat atagaagaaa taccggtggg 299521 tgttctggac gatcagcgga agccgggttt caccggtgta atgagcccgc ggctcgggca 299581 ctgcgcagct tgtcggcgac tgccataccg accggtagta ggccttgcgg taataatagc 299641 aggtgagccg gaatccaagc aggaacggta ataccatcgc tcccaacgga atccaccctg 299701 gaaaatgccc gaaccagacg ccgagatgac tggcgccggg ctggcaggac gcgctgacgc 299761 acggcgagta gaacggcgtc aggtaatgat atttttccac ccagtattgg ctgccccaga 299821 acgcccgagt ggtcgcatag cagatgaacg ccaaaagacc gaggttggtc agcagcggtg 299881 gcaaccacca gaggtcggtg cgaagcgtcc gttctgggat ttgtgcgcgg gtgggtgtga 299941 aaacgccgat cgcaggacgg ttcgccgtgg gtgcgctcat ctaatgtgat cctcttcgcg 300001 tgttatctcg ttgaagggta cacagagaac ggcccccttt ttctgggggg ctcggttgtt 300061 cagtacctgt gacctccgac accctcatcg tcgacatcgc gccaaaattc gcgatcgtac 300121 tcggtgtcgg ggatggcgat tttctcgctg ggttgcggta cggcggcccg ttccaggtct 300181 aattcagaga cgtcggtgtc gagcaattcg atgtcggtga ggatccggtc ggcatcgatc 300241 acgatgcgac gcgttgccgg gttgtcgccg aatcgcgcct tcagtgcggt cacgcaccgc 300301 cgcagtccgc cgacgaggtc gtgcagttcg gcgagttcgg cagtcgtgga caatgggtgc 300361 tccctgggct ggcggtgtta cagatcacag tacgctcccg atactagcta tcgacggacg 300421 gagtcgttgg gtctactcgg cccaatggca tgatccggcg gacccatcgg cccggccgga 300481 tcatgccgta tcgcgaacta cttcgtgatg gcgatgcgct gcgcctgagt ttcggctggg 300541 gccttgtagg cgccggcaac ccggacggtc agcacaccgg cgtcatagga agccgcgatg 300601 gcctcgctgg tgacgtgcgc gggcagccgg aacgagcggc ggaatgatcc gtagcggatc 300661 tcacgcaggg tgcggccgtc tttgtctccg gcgtcttgcg tgtgctcgtc gcggtgttcg 300721 ccgcggatca ccaggcggct caccggctgg ccagggtcaa gctcgacgtt gacgtccttg 300781 tcgacgtcaa tgccgggcag ttccaaacgg accaccgcgt cgtcgccatc cttgacgatc 300841 tcggcggccg gcgtgaagtc tccggcgacc gggcggtacc agtccgtcgt cgcggcaggg 300901 ccgaagaagt cacgtagcca gcggtcccag ggctcaacgt cccacaccgg acgcgaccac 300961 aatgcgagat tgttcatggt tatctcctca tgcttcgttg tgagttagct gtgtccggcg 301021 cgttgccggc ccgctatacc aagaacctga gtcgaccacg cttaagttcc acctcggcgt 301081 tcaccggaag cgaacactgt cacacagccg gtcgccaggt gtgatcacag cgtcatatgt 301141 gcgtcacatt cggcgatttt tcggtaattt gcccctcata ccctcagacc atgcctacgg 301201 ctgggagttc gcgcgcgcct gccgcggctc gcgagatcgt cgtggtcggc cacggcatgg 301261 tgggccatcg gctggtcgaa gcggtgcgtg cccgtgacgc ggacgggtcg ctgcggatca 301321 cggtgctggc cgaggagggc gatgcggcct atgaccgggt cggtctgacg tcctataccg 301381 aaagctggga ccgcgccctg ttggccttgc cgggtaacga ttacgccggt gaccagcggg 301441 ttcggttgct actaaacacc cgagtcaccc agattgaccg ggcaaccaag tcggtggtca 301501 ccgcggcagg gcaacggcat cgctacgaca ccctggtgct ggccaccggc tcctacgcat 301561 tcgtcccgcc ggtgcccggc cacgacctgc ccgcgtgcca cgtctaccgc acctttgacg 301621 atctcgacgc tatccgcgcc ggcgcccagc gcaccctgga cggcggtcac accgatggcg 301681 gggtggttat cggtggcggc ctgctgggcc tggaagccgc caatgcgctg cgccagttcg 301741 ggttgcagac acacgtcgtc gagatgatgc cacgattgat ggcccaacag atcgacgagg 301801 ccgggggtgc actactggcc aggatgatcg ccgatctcgg gatcgcggtg cacgtcggga 301861 ccggtaccga gtcgatcgag tcggtgaagc attcggatgg ctcggtgtgg gcgcgggttc 301921 gcctgagcga cggcgaggtg atcgatgctg gggtggtgat ctttgccgcc ggcatccggc 301981 cgcgcgacga gttggccagg gcggcggggc tggcgatcgg cgaccggggc ggtgtgctca 302041 ccgacttgtc ctgccggaca agcgatcccg atatctacgc ggtcggcgaa gtcgccgcga 302101 tagacgggcg gtgttacggc ctggtcgggc ccggatacac cagcgccgag gtggtggccg 302161 accgactgct ggacgggtcg gccgagttcc ccgaagcgga cctgtcgacc aaactcaagc 302221 tgttgggtgt cgacgtcgcc agcttcggcg acgcgatggg ggcaaccgag aactgcctcg 302281 aggttgtcat caatgacgcg gtgaagcgca catatgccaa gttggtgctc tccgacgacg 302341 ccaccacgct gctcggtggc gtgctggtgg gcgatgcctc gtcgtacggg gtgctgcggc 302401 cgatggtcgg cgccgaactg cccggggatc ccctggcgct gatcgcgccg gccggatctg 302461 gggccggcgc tggcgcttta ggtgttgggg cgctgccgga ttcggcccag atctgctcgt 302521 gcaacaacgt caccaagggc gagctgaagt gcgcgattgc cgacggttgt ggggacgttc 302581 ccgcgctgaa gtcatgcacc gcggccggca cgtcgtgtgg gtcgtgcgtg ccgctgctca 302641 agcagctgct agaagccgag ggtgtggagc agtccaaggc gctgtgcgag cacttcagcc 302701 agtcgcgcgc ggagcttttt gaaatcatca ccgccaccga agtccggact ttctccgggt 302761 tgcttgaccg ctttggacgc ggaaagggtt gcgacatctg caaacccgtg gtcgcctcta 302821 tcctggcatc caccggctcc gaccacattt tggacggcga gcaggcctcg ctacaagatt 302881 ccaacgacca cttcctggcc aacatccaga agaacggcag ttactcggtg gtgccgaggg 302941 tgcctggcgg tgacatcaag ccagaacacc tgattttgat cggccagatc gcacaggact 303001 tcggcctcta caccaagatc accggcggtc agcggatcga cttgttcggc gcccgggtgg 303061 atcagctgcc cttgatctgg cagcgactgg ttgatggcgg catggaatct gggcacgcct 303121 acggcaaggc ggtgcggacc gtgaagagct gcgtgggcag cgactggtgc cgctacggtc 303181 agcaggattc ggtgcagctg gccatcgacc tggaactgcg ttatcgcggg ctacgggcac 303241 cgcacaagat aaagctgggc gtctcgggtt gcgcgcggga atgcgccgag gcgcgcggca 303301 aggatgtggg cgtgatcgcc accgagaaag gctggaacct ttacgtcgcc ggcaacggcg 303361 gcatgacgcc caagcacgct caactactgg ccagcgacct cgacaaagag acgctcatcc 303421 gctacatcga ccgctttctc atttactaca tccgcacggc cgaccggctg cagcgaaccg 303481 cgccatgggt ggaatcgctt gggctggacc atgtgcgcga ggtggtctgc gaggactcgc 303541 tgggtctggc cgaggaattc gaggccgcga tgcaacgcca tgtcgccaac tacaagtgcg 303601 agtggaaggg cgtgctggag gacccggaca agctgtcccg gttcgtttcc ttcgtcaacg 303661 cccccgatgc cgtcgactcg acggtgacct tcaccgagcg tgccgggcgc aaagtacctg 303721 tgtccattgg tatcccgcgg gtccgatcat gaagtccggg aggacaaagg agggactgtg 303781 acgcttctca acgacattca ggtatggacc accgcctgcg catacgacca tctcattccg 303841 ggacgtggtg tcggggtgtt actcgatgac ggtagtcagg tggcactgtt ccggctcgac 303901 gacggctcgg tgcacgcggt cggtaacgtc gacccgttct ccggtgctgc ggtgatgtcc 303961 cgcggcatcg tcggtgatcg cggaggtcgc gccatggtgc aatcgccgat cctgaagcag 304021 gctttcgcgc tcgacgatgg ctcgtgcctc gacgatccgc gcgtttcggt gccggtgtat 304081 ccggcgcgcg tcacacccga aggccgcatt caggtcgcgc gggtagcggt ctagctcacc 304141 ccgcgaacct cacagcttga gcacacgtcc ggcgatgacc agatgtacct catcgcagac 304201 ggctgccacg cgtcggttga ttgtgcccag tagatcgcga aacagcacgc ccgaagaatg 304261 ggatggcacc accccgaggc cgacctcgtt cgtcaccacg atcgcagtgg gcaatccggt 304321 cagcgcggcg cacaacccgt cgagccgtgc ctcgaggacg gcgtagacgt ccgcggtcgc 304381 agcagaccac aacgcctcgc catccatgat ggccgtcagc caggtgccca agcagtccac 304441 gagcacggga cttcgtgcct cggacaaagc cgtcgcgacg tcggccgttt ccaccgttag 304501 ccaggtcggt gggcggcgag cgcgatgcag tgcgacccgg gcgtcccaat cgggatcgct 304561 gccagcggcc gggcggccag gcgcgacgta gacgacgtcg gccgcatcgc ccaacaacgc 304621 ttcggcgtgc gtggactttc ccgagcggac gccgccagtg accagtatcc gcaccgggtc 304681 atcgtaggtg gggcggcctc atggcgcgcc cggagcgaga aagggcaagg tcggcgggca 304741 accatggcgg gccaggttga gcagcgcatc gacgtcgagg tgtcgttcga cgagatcgcc 304801 gagcaggtcg aggcggcgct cgcgtgcggc caggaagcat gagcccgacg gggcgaggcc 304861 gagcgtctct cgcaggaagg cctcgcgcag ggcgtcgcct tccaacgagc cgtgccacat 304921 ggtgccgaac accggtccgt cgcgcgcgcc gccgaggaac tcctcggcgg tgtcaccgcg 304981 ggtgatccgg ccgtggtgaa tctcgtaccc cgacgcgggc acaccgagtc cttcgccgcg 305041 cggtagccgc agcaccttgt ggggggaaaa tgcggtctcc acgtcgagca aacccaagcc 305101 ctcgacctcg gtcacctgcc ctcccggacc ttcgatgccg tacgggtcgc gaatcacccg 305161 gcccagcatc tggaacccgc cacaaatgcc gagcagcggc ttgcccgccg caacatgcac 305221 cagcagcgca cgatctaggt ctcgcgccct cagccaggct agatcggcga tcgttgcccg 305281 ggtgcccggc aacacgatca gatcggcatc gtccagcgcg cgggggtcgg aagcgaacac 305341 gacatccaag tcgggctcaa gacccaatgc gtcgacatcg gtgaagttgc tgattcgtgg 305401 caggcgcacg acggctaccc ggcgggctcc ggtgcccgcc gcgcgccggc cctgtaggtc 305461 gagggcatct tcggagtcca gccagaggtc ggggtgccac ggcagggtgc cgtacaccct 305521 gcgcccggtg acccgttcca ggtcgcgcag acctggcgcc agcaggtcgg agtcgccccg 305581 aaacttattg accacaaacc ccgcgaccag cgcctggtcc tcggcagcca gcaacgcgac 305641 ggtgcccagg aacgcagcga acaccccgcc gcggtcgatg tcaccgacga cgatggtcgg 305701 cagtcccgca tgacgggcaa gccccatgtt gacgtagtca cctgcgcgca ggttgatttc 305761 ggccgggctg ccggcgccct ccgcgacaac gacgtcgtag cgggcggcga gggcgtcgaa 305821 ggcgcggcat gcggcctcgg cgagcgctcg ccgccccgca caccagcttg acgacgccac 305881 ctcgccccag ggcttgccca tcaacaccac gtggctgcgg tgatcactgg ccggcttgag 305941 caagaccggg ttcatcgccg cctcgggcgt ggtcctagcc gcgagtgcct gcacccattg 306001 cgcccgaccg atctccacgc ccgtgccgtc ggggcctcgg cagaccatcg agttgttgga 306061 catgttctgc gccttaaacg gcgccacccg cacaccgcgt cgggccaacg cgcggcacag 306121 ccccgcggtc acggcgctct taccggcgtc gcttgtcgta cccgcgacca gcagacccga 306181 catccgtctc ccgaaggttt ctcactccac ccgggtcgct gagtcggtgt cccaggttcc 306241 gggcatcatt ggcgtgcgtg ggctgccgcc gaacgcgtcg ttgggtaacg tgatcagtcc 306301 tgcgacttgt ccgggactgg ccttgtgggt tgttccggcg aaacccaggg ttcccgctcc 306361 ttgaggcgaa cccgtcgggt cgtggccggt ttcggggtct aaatccaggt attcgtaacc 306421 gcggccgagc tgtttgatct ttggccgccg acgccgctgc ggttgaacct gttcctcggg 306481 cgccgccgcg gccgctgggg cctcggcgct gtcgggttcc ggcgtcttct ttcgaacgcc 306541 ggtgccgacg gccttcctgg cctgcgccgc cgagttcagg tcacccacca ggtacccgaa 306601 gctttgtatg ccggctccgg tcaccggcgg cggggcggtc accggcggcg gtggcggccc 306661 gggcgggggc gtcggcgcgg tcacggccgt gggggccggg gctggggctg gagttggggt 306721 gggggtcggg atactcgggg caatcgccgc gaccggcggg atgacgggcg gcgcggatgg 306781 cgggatgcca accaggcccg ccagcccaga caagcccgcg aagccgcctg ctgcactcgc 306841 aggggcaagg gtcaacggcg ccagcggggc ggctagcagg ggcagcgccg ccgggagcaa 306901 cgcaagagtt tgctcgagca gcgttttaac cagcgcgatg gtatcggtga tgatcgtgcc 306961 aatggcttcg accgtggtga acatcagggt aaaagcgata gttgcgggat tgcccgacgc 307021 gaacgccgcc gccagatccg ccccgatgaa tgcgaaggtt tgcgacagga aggcgacata 307081 agatccgatg tccatggggt agccaagggc gaacgcaatg ttggccgggc ttaggaaggt 307141 tagcggatta cccagcgagg gcagccacgg atcaaatccg gaaaacatcg cttgcaaaaa 307201 ggggaggttg gtcagccagt tgatgaacgg ttgtataacg ttgttgtaga agtcggtata 307261 cccgatcttc tgcaaccatt gcagccattc ctggacttgg ttcggctcgt cggaagccgc 307321 tgtcggcgcg ttggctttca cgatctgggg ggctggggtg gtctgcggtg cggcggccac 307381 cgccgcggtc gagaccgctt gatagctggc catcgtggtg gcggcctgga tccacatccg 307441 cgcgtagtcg gactcgttga gcgcgatcgg gatggtgttg atgccgaaga agttcgtcgc 307501 catcagcacg ccgtggaggg cgtggttggc gcccagctcg gccaacgttg gcatcgcggc 307561 caaggcggtg ccgtaggcgg tggccgcggt ttcttgccgg gtggccatgg ccgcgctgtt 307621 agcgctggcc tgcaccagcc acgccagata aggggtatgg gcggccacgt aaaccgcggc 307681 ggtcgggccg tcccaggtgc cggcctgtac ggcggccaac agcgcggcca gctcgtcggc 307741 cgtctccgcg taggcgatgc tcaacgagtg ccacccctcg gccgacacca gcagcggacc 307801 gggcccaggc ccgctgctta gcagcgccga gtgcacctct gggggcgaag ccatccagat 307861 cggggcggtc atcggcggct gaccgccggc ggaggtgtcg tcgcgtcgcg agcagccacg 307921 ttaaggccca gcagcgtggt ggtggcccga ccgctagaca aggtttggag cgtcatgacc 307981 ggttagcttt ctcggggtac accgccccgg gtggcaggac gcgatgacgc gagtctcctg 308041 gctcccggat cgttgcttgc ctcgccttcc agcctgtggc cgtggcttat gagggtcgct 308101 ccccggtgac agtggcggga ccgcgccgga ttctcaccgg cttcctgcat cgtcatcgcc 308161 tgacgggaag aatattggca tgcagagcgt ggatttgcac gttgagcggc atttgccaag 308221 caggggtcgg tcacatcgca cggtcgcaac agtcacatgt gtcactgcac taggcgacat 308281 ccgatctgcc cagctctcag cgacaggcgc ctggccggcg gttttgttcc caagttggtc 308341 gtggctgtgc gggattggag gcggcgttga cctgcagaaa ccgagttgtc gcgcttagct 308401 gggcacagcg accatcgccg acggcggagc tcggcgtcgg tgagtcgctt cggtcggccg 308461 gggcggcgcg attcgggttc gaccacgtgg tcgtcgacca gctgacgcgc cgaacgtgca 308521 accacggcgg cagcgcccgg cgacgtgtcc ccgccaccag tacacgttcg gcgcagccag 308581 tgcacacacg gcacggagtt taggacttac tcatttggct atccgcgacc gatatcgccg 308641 accaggtagc gctgcattgt cgggccaatg gcgtcgacca gcatctccac cgacatggag 308701 tgcagcggct cagaacgcac cccgtagcgc atgatgccca aaccgacgag ttgagcggcg 308761 cacagcgacg ctcggatggc aatcttgtcg gccccgagca tcttgagcaa cgggttgaag 308821 accggtccga tgaacatgga ctgcacgatc tcggcggtct tggctagccc ggtggttgcg 308881 atggcgctcg ccgcaaaggg accgccgccg gccgcatccc aggtggtgat cagcacgtag 308941 agggttcggc ggcctacctg gttgacgctt ccggtgacga ttttttcgat gaaatccggt 309001 gtgccgaagg gcaaccgcag catcttcgct accgggtcga ggagtccgcg tgatggctct 309061 tggctacggg ccatggtgtc aggatcaccc cgctgtgatc aaagatcaag cgtcaccggt 309121 gtcggcgtgc catgccagcg gtgcagccgt tgctgacgtg ctaccgcgct gcgaaatcgg 309181 ttcgcgacca gctgtgccaa gcccggatgg gtgccgagcg gtcgggttac cacatcggca 309241 ccggatgccc gcagccgctc ttgaaaaagg ccttctgcca acaggaagga ggcgaccacg 309301 acgcggcgcg cacctcggtt ggcttcggcc cggtctcggg cccgctgcac agccgtgcgc 309361 acatccggac cgccggtgcc cgcaaatccc atgtccaccc atgatccggt cagttcggac 309421 actagcgtcc gagtggtgtg caggtcggca cgtgcccgcc tatccgacgc gccggccgct 309481 gcgaggatca ctgaatcgcc aggacgccaa ccggattcca ccagctgctg ggtgactatc 309541 tgcgcgatct cacggcatgg ccccaacgcg ggggtgaccg tgacatgcgg gtgcgcactg 309601 gctgcgacat gagcgggcag gtcggtgcga acatgatatc cgcgggacaa gaacgcgggc 309661 accacgattg cgggacggca ggaaagggcg gaaagcactt cgctgggtga gggtccgagc 309721 acatcaacga aggcgacctg cacagtgcgg tcgacgagcg cgctcacttg cgcggcgatg 309781 tccgctatca tcgcgacacc ggacggtctg cgggttccgt gggccgtcaa gatcaggttc 309841 atacgtcatc gtgccggctg tcaacggcga gacggtagcc acgtttcacc actgttgcca 309901 cgatgttctt gtcgcccaga gccgttcgta gccgcagtac ggcggtgtcc acggcgtggg 309961 tgtcgctgcc gtcgccgggt aggacgcgta gcaagtcgcc acgagagacg acgccgccgg 310021 ggcgatgtac caacgcgcgc aaaatcgcca ttccggacgg cgatagtggc ttcaccgaat 310081 catccaccag cacagaggtt ccacggatct cgatcacgtg gccggctgct ttgaacgtgc 310141 acgaacccag cagcggcagc tcctcggcaa tgtggcgggc taaggctccc aaccgcattc 310201 gctcgggagc cgacgtcggg acgccctttc ggatcaacgg ccgcgaagtt accgggccga 310261 cacacatcgc gtgcacgtcg gtacgcagcg cagccaacag ttggtcctcg atatccaatt 310321 cacggctgcg ttctagcacc gcggctgcgg caggtgccga cgtgaaggtg accgcgtcga 310381 attgtcgtcg cgcgatcccg gtgactaaat ggtcgaacac gccgcctagt ggcgccggct 310441 tccaccggta aacccggatc ggcaccactt gcgcgccggc gaaacgtaac ccgcccagaa 310501 attccggaaa cgggtcccag ctgtcggcgg caccgtgcag ctggacggca atacgcgtac 310561 gggacacccc cgattcgagc agatattcca gcacttcatg cgacgattca gagtcggggg 310621 accactcttc acgcaggccg gcggcacgca gcgcaccagt tgcctttggt ccgcgggaga 310681 tgatccgggc cgacgacaac gattccagga gctcgttggc cagcccccac ccctcggccg 310741 cggccaacca gccgcgaaat ccgatgccgg tgtgggcgac cagaatgtca ggcgggtcgg 310801 cgatcaacgc ctcggtgttg ttctgcagtt catcgtcgtc gggaagcgcg atcatcttga 310861 tcgctggggc actacagacc tcggcgccct ggcggcgaag caatgcgcac agctcttcgg 310921 cgcggcgagc ggatgtcacc gcgatccggt agccggtcag tggcgccgag tgtgcctggg 310981 ccatatgacg tgtctaggcc tgtgaggttt cagtcgcgtt accaggcaat tgctgccgga 311041 ttgcccattg ccgataccca cctctgtggc ttcgggcgtg gcgctagacg taggccaagc 311101 ccgcgggtgc ggtggtcgcc ggcacgagct cgccggcgct ctttaggccc cgacgcacat 311161 aaatcgccca ggtcaacacc gaggcgacca ggtagaacac cccgaaggcc caaaatgccg 311221 aggtggccgt gccactggtc aggtaggact ctcgcagagc caggttgacg cccactccgc 311281 cgagcgcgcc gaccgccccg gccaggccga tcagcgcgcc tgacatcgac cgcgaccact 311341 gcctgcgctc ggcttcactg atctgcagcg aatggctgcg cgcctcgaag atcgacggaa 311401 tcatcttgta cacagagcca ttgccgatgc cggacaaaat gaacagagcc gtgaagccga 311461 tgacgtagcc gaccatcgtc gcagtcggca tcggcccggc caggtggtca ccgaaagtgc 311521 ttgcgctgat gagtattccg gtggccagca gcatggcgca gaaggcagct agggtgactc 311581 ggccgccacc gatacggtcg gcgagcttgc cgccatatat tcgggacagc gatcccaata 311641 gcggccccag gaaggcgatc tgggccgcat gcagcgaggc ctgcgccgtg ctctgaccgc 311701 tggcgatgaa gttgatctgc agcacctgac cgaatgcgaa agagaacccg atgaacgagc 311761 cgaaagtgcc gatgtacagc agcgagatca cccaggtgtg cggctcggac actaccgcac 311821 gcatggtgtt cagctcgatg cgatactccg tcaggttgtc catgtacagt gcggcgccga 311881 ggccggcgac cgccagcagc accagatata tcgcgcacac ccagtagggc tcgcggtcac 311941 cggccgttgc gatcaccagc aggccgacca actgcaccat cggcaccccg aggttgccgc 312001 cacccgcgtt gagcgcaagc gcggcgccct tgagtcgttg cggaaagaaa gcgttgatgt 312061 tcgtcatgga ggcggcgaag ttgccgccgc cgaggccggc tagcgcaccg cacaccagat 312121 acggccacag tggcaaacca gggttggcca gcaacagaat gctgccaacg gtcggaatca 312181 acagcaccag tgcagaaaag atggtccagt tgcgcccccc gaactttgcg gtggcaaatg 312241 tgtaagggaa gcgcaggcat gccccgacca aggtcgcggt ggcgccgagc aggaacttgt 312301 cgccggcgga aaagccgtac accgatgtgg gcatgaacag caccatcacc gaccagaggg 312361 accagacgga aaatccgacg tgctcggcgg ccaccgacca gatcagattg cgtcgggcga 312421 tgaatttgtt gccggcctcc cacgccaccg agtcttcggg atcccagtcg gagatctggt 312481 gggaacggcc catactgacc cctatcgtga tcgacgttct cgatcacgct agaaatcctt 312541 tgttgcccgg gcgcttccgg tagtgacccc ggcgtgaact ttcgctcaca cggttaccgc 312601 cagcgtgtga gggcggccgt gcagcggagc ggattaccag acgtcgcccg cgcgccaatc 312661 gcacatcagc tccgccgagg tgtccaggct gatgtcgatg ggcaggacga acaccgttcc 312721 gtcgtcatcg ggtgtacgga ctggaccggt tggtgccagt accgatgtcg ggccgtgcca 312781 gggcagccag ccgcgtgagg cgtacagtct gcgggcccgc gccgaggaac tgagcgctcc 312841 gagctggtaa gcgccgcgca tcacctgctc gacggcgtcc aacagcgcgc tcaccaggcg 312901 ttggccccgc cagtccgccc gcaccgcaac gccttcgacg tacccgcagc gcagcgcgtt 312961 gccgcggtag atcagtcgcc gctggatcac cgcggcatgc gcgatgatcg ccccgtgatg 313021 ccagatcagg gcgtgcatcc cacccagcgt gtgctcccag tcggtctcgg tgaagtcacc 313081 ggcaaacgcg ccggtgacca tctgacggat gtcctggcgg gtctcgctgt caagatcggc 313141 ggtgtggacc aggcgggccg tgtgtacctg ggtgtgcaca gtccctgtct accaggcttg 313201 tgttacaccc tggccaggca accgagaccg gggtcgtgcc cagtgcagtc gcacatattg 313261 gccgggccgt atctgcgcga ccttgtcgat gtcctcgtcg gtgatgacgc cgacgaccgg 313321 atagcttccg gtgatcgggt gatccggccc caggatcacc ggtaatccgt tgggcggcac 313381 ctggattgcg ccgcgggtaa cgccttcgcc gggcagttgc cgatccggcc agcggtgctg 313441 tagcgggcgg ccctgtagcc gcattcctac gcggtcactg cggttggacg ccatccagat 313501 ggtatgcacc aacgcgtccg ggtccaccag ccagtcgtcg cgcggcccgg gcaccacccg 313561 cagctccacc agatgctcct cgatagcggc caccggtgcc tggtcgagtt cgggatagtc 313621 gtcggtgtgt tcgccgaccg gcagcacgtc tccggcccgt agcggcgacg ggccgatcgc 313681 cgacatcacg tcgtagctgc gtgaccccag cacgggctcc acacagacgc cgccgcgcac 313741 cgccagatag gtccgcagcc cggcccgtgg ggtgcccagt gagatcacct ggccgtcccg 313801 gacgtggtga atgctgttgg tgccgaccat gattccgttc acggtcggat cggtgtcggc 313861 gcccgtcacc gcgatgtcga cgtcgccgcc gcgaacccgc gccgagaagc cgccgaaggt 313921 cacttcgacc gtggcccaat cgtcggggtt ggcgactagc cggttggcca gcgtgtggga 313981 gcggcggtcg gcggcaccgg atcgaccgac accgagatgg gccagtccgg cacggccgag 314041 gtcttcgacg agggccagcg gtccgctgcg caggatttcc agtgttgtca tggctgcttc 314101 ctccagctca ggcggcccgg aactgaaccc acatgcccgg tgtgagcagc gccggctggg 314161 gtcggtcgac atcccacagg accgcgtcgg tgtggccgat gatctgccaa tcgctgggcg 314221 cttgagatgg atatatcgcg ctgaatccgt cggcgagggc gaccgatccg ggcggcatcg 314281 aggtgcgccg ttcgggccgg cgcggcaccc gcaggctcgg gtcgccgtcg atcaggtagg 314341 cgaaccccgg ggcggaccca ctgaatcccg cccgccatcc ggtggcggtg tgggcgttga 314401 tgaccgctgc ggtggtcagg ccggtgcagc gggcgacctc ggcgaggtct gggccgtcgt 314461 agacgacgtc gattaccagg tcgcatcggt gatcggccgc agccaccgcc tcgggggtga 314521 cccgcaacct gcgcagccgc tgacgggtga ccccttggta gcggggcgcg tccagcttca 314581 ccaatacggt gcgcgaggcc gcaacgatgt cgaccacacc gggtagcgcc gcggctcgca 314641 atgcatcggt ccatgccatt gcgtcagcgg tgctgtcaca ttgcagcatc agcgcatggt 314701 cgccgtagtc gagcacggtg caggccaatg ccgcgtccat aaagacgtcc atcacactca 314761 tgcgtcgacg gtagcgctgc aatcttcggc tcggccaggg attttcgaga ctgccagagg 314821 tgccttagca aatgctcatg cgcccaagat ctggctgatc tgtggcggca gttgctcggc 314881 caccacggga taactcagca ccgacgagaa ggcgatggct ccggcctgtt ccttggaagt 314941 gaagatgtgg cgccgctggg cggttgcctg cgacgccgca atctccgggt cggccaacaa 315001 cgccttctcg tcctcggggc tctcggtcat ccagatcagc acatcggcgg catcaagcac 315061 cgctttaatg tgatcgcgcg gaatgacgcc gcgctgatcg acggcgaagg gtttgatgct 315121 gtcggcgatc accagaccca tgtcgttgag gaagtcagtt cgccagcccg ccagggttgc 315181 gaccacgttg ccctgccaga ggcgaccctg cagcaacagc gccttcttgc cccgccagcg 315241 cggatgccgc tgcgccaccg cggcgaactt ctggtcgacg gcctcgatca gcgacctcat 315301 ccggtcggcc gcaaacaccg cctggccgat cgacctggcc tggtccttcc acggctcgaa 315361 gaatgcgtcg ccgccggact gggcgacggt cggggcgatc gccgacagct gctgataggt 315421 atcggcgtcc accccggcgt tgatcgccac gatcaggtcg ggttttaagg cggcgattcg 315481 gtcgatctga atcccgttgt ccaggttcaa taccgccggc cgcgccccgc cgagcttggg 315541 cgccgcccac ggccacaccg caaacggctg gtcaccgaac cagtcggtca ccgcgatggg 315601 caccacatcg accgcgagca agtcgtcctg ctcggtgtag ccggcgctga ccacgcgctt 315661 gggtggctct ttgatgacgg tctgaccgaa caggtgggtg atagttaccg ccgcgccgcc 315721 aggagtgccc ggcgggggtt tgggcgatga acagcccgcg aacagcccgg tggctgctgc 315781 agcctcggcg acctgcaaga atccccggcg gctgcatccc tgtcgcacag cgtgagggta 315841 tcgcgcgcgt taccgccggc gtcgggcgct ggtacttgct ggcccgtatc cgccgccgcc 315901 gggggtttcg atcaccagcg tgtcgcccgg ctcgacgtgc gttgagccgc atccggccaa 315961 ctcgacggtg ctgccgtcgg cgcgttccac tcggttgcgt cccagctctc cgggggagcc 316021 gccggccatg ccgtagggcc gaacccgccg atgaccggag agcgtgctga ccgtcatcgg 316081 ctcggtgaac tcgaggcgtc ggacggcgcc gtcgccgccc cgccagcgac cggcgccccc 316141 gctgccctga cgtacggcga actcgcgcag caacaccggg tagcgccact ccagcacctc 316201 gggatcggtg agccgggagt tggtcatgtg cgtctgcacc accgaggccc cgtggtaccc 316261 gtcaccggcc ccggagcccg atcctacggt ttcgtagtac tggtgccgct cgttgccgaa 316321 cgtgacgttg ttcatcgtcc cggatccctc ggcctgcaca cccaacgcgg cgaacagcgc 316381 gccggtgatc gcctgcgagg tttcgacgtt gccagcgacc accgcggcgg gatgggttgg 316441 tgcgagcatc gagccttcgg ggacgacgat acgcaacggg cgcaggcaac cgtcgttgag 316501 cgggatgtcg tcggcgacca gggtccggaa cacgtagagc accgccgcat tcaccaccga 316561 ggtcggtgcg ttgaagttgg tgtccagctg agccgaggtt ccggtgaagt cgatggtcgc 316621 gctgcgggcg gcgcggtcga cggtgatgcg cacggcgatc gtcgcgcccg aatccatgcg 316681 gtagcggtag gcgccgttgt cgagccggtc gatgacccgg cggaccgctt cctcggcgtt 316741 gtcctggacg tggcgcatgt aggccgccac cacgtcgcgg ccgaagtggt cgatcatttt 316801 tccgacctcg tcgacgccct tttggttggc ggcgatctgc gcgcgcagat cggcgaggtt 316861 ggtgtcggga ttgcgggaac cgaacggcgc ctcggtaagc aggcgccggg tttcggcctc 316921 gcggaaccgt ccgttctcgg cgagcagcca gttgtcgaac agcacgccct cttcgtggat 316981 ctcgcggctg tcggcgggca tggagccggg ggtgatgccg ccgatttcgg cgtggtgccc 317041 gcgagaggcg acaaagaata ggacgtcctc gccgccggtg ttgaacaccg gggtgatcac 317101 tgtgatgtcc ggcaggtggg tgccgccgtg gtacgggtcg ttgacggcgt atacgtcacc 317161 gggcttcatg ccgctcaagc gccggcggat cacttccttg acggtggtgc ccatcgagcc 317221 gaggtgcacc ggaatgtgcg gggcgttggc gaccaggttg ccgtccggat cgaacagcgc 317281 gcaggagaag tccagccgct cccggatgtt caccgactgg gcggtggctt ccagccggaa 317341 gcccatctgc tcggcgatcg acatgaacag gttgttgaag atctccaaca gcaccgggtc 317401 ggcctcgaaa ccggcctcga aaccggcccg agtggccgca tcgggccgcg gcggggtgac 317461 cactcgttgc gcgagcaggt gcccggtctc cgtcatcgtc gcctgccagc cgtcgtcgac 317521 gacggtggtg gcgttggcct cggcgatgat cgccggaccg gtcagcacgt cgcccggccg 317581 catcgcctcc ctacgccgca gcggtgcgtc gcgccacaat ccgttcgaat agatccgcac 317641 ggtttccgac gagccggtgg tgtcgttggc ctgatcgccc agctgggaca ggtcgggctg 317701 gtcggtgagc ccggtcgcct cgaccgagat cgcttcggcg atcagcggac gatccagcag 317761 gaacgtgtac agcgcgcggt ggctgctttc aaacgccgtg gccatggtct cgatctcggc 317821 cagttgcacg gggatcgcgg tatcggttcc ctcatagcgc aggtgcaccc ggcgaaccac 317881 ccggatgcgc tcacccggga cgccctcgtc cagcaactcg gcgcgggcgg ctcgttcgag 317941 ggattccgca acgctggcca aacgctgtgg cgcggcgggt ccgagcggga tctccaccga 318001 ttgttcgcgc attgcggtgg tgtcggccag gccgatcccc agcgcggaaa gcacgccggc 318061 cattggtggg atcagcaccg tgcggatgcc gagggcgtcg gccaccgcac atgcgtgctg 318121 accgccggcg ccgccgaacg tcgtcagcgc gtaccgcgtc acgtcgtgtc ccttttgcac 318181 ggagatcttt ttgaccgcgt tggccatgtt cgccaccgcg atccgcagat atccctcggc 318241 gacctgctcg ggtgaccggt cgtcgccggt ccgcgcggcg atgtcggcgg ccaggtcggt 318301 gaagccacgc cgcacggtcc cggcgtccag cggctggtcg ccggaaggac cgaatacgga 318361 cgggaagtgg gtgggctgga tgcggccgag catcacgttg gcgtcggtga cgcacagcgg 318421 tccgccgccg cggtagcagg ccgggccggg gtcggctccg gccgagtccg ggccgactcg 318481 gtagcggctc ccgtcgaaat gcaggatcga cccgccgccg gcggccaccg tgtggatgtc 318541 cagcatcggc gcgcgcagcc ggaccccggc aacctgggtc gtgaagacgc gttcgtactc 318601 gccggcgtag tgcgacacgt cggtcgaggt gccgcccatg tcgaagccaa taacatgatc 318661 gaagccggcc agcgccgaca tccgcaccat gccgacgatg ccgccggccg gaccagacag 318721 aatcgcgtcc ttgccgcgga agtgcccggc ctgcgccagc cccccgttgg actgcatgaa 318781 catcagtcgc acaccccgca tctggtcggc cacctggttg atgtatcggc gcagcaccgg 318841 ggacaagtag gcgtcgacca cggtggtatc cccgcgcggg accagtttca tcagcgggct 318901 gacctcagat gacaacgaga tctgggcgaa gccgatgcgc tgcgccagcg taccgatttc 318961 tcgctcgtgt cccgggtaga ggtaactgtg caggcacacc accgcgaccg cgcggattcc 319021 gtccgcatgg gcctgccgca tcttctcgcc caatgcctcc aggtcgggtg cccgcagcac 319081 ccggccgtcg gctgtgaccc gttcatcgac ctcgacgacc cgctcataaa gcatctcggg 319141 caacacgatc cgccggtcga agatgcgcgg acgattctgg taggcgatgc gcagggcgtc 319201 gccgaaaccg cgggtgatca ccagcagtgt gcgctcaccc gtgcgctcga gcaacgcatt 319261 ggtcgccacc gtggtgccca tccgcaccgc gtcgacgcgc gtgcccgcct cgccgttcgc 319321 tagcagcgca cggatgccgg ccaccgcggc gtcgcgatag cgtgccgggt tgtccgacag 319381 cagcttgtgg gtcagcagcc gtccgtccgg ccggcgcgcc acaacgtcgg tgaacgtgcc 319441 accccggtcg acccagaagt gccaccccgc gccaaccacc cggactcccc cttcacgctc 319501 gcagccggtc ccgtcctcac aacggcagac gggccgaagc cacctaaagg tatctccgct 319561 gtaacagcgc gcatccgggc cggtaacagg gtctctttag cgtcgagccg tcattaccgc 319621 tgatgtcgcc cgcttgtcga caggagacct aaccgatggc actcaccacc gccccggcaa 319681 tcgattatgc gctgccacgc cagcaggatg agggcgatca ctggatcgac gactggcgcc 319741 cggaagaccc ggtgttctgg gagacgatcg gcaggccgat cgcccgccgt aacctgatct 319801 tctccatctt cgccgagcac gtcggcttca gcgtgtggat gctgtggagc atcgtggttg 319861 tccagatgac cgccgccgct cccgggcacc ccgccgcgtc cggctgggcg ctgtccgcca 319921 gccaggccct atgtttggtc gccgtcccca gcggtgtcgg ggcgttcctc cggctgccgt 319981 acaccttcgc gatcccgatc tttggtggcc gcaactggac gaccgtctcg gcggcgctgc 320041 tggtgatccc gtgcctgctg ctggcttggg cggtgagcca cccttccctg ccgttcgcgg 320101 tgttggtggt gatcgcggcc accgccggtt tcggtggcgg caactttgcc tcatcgatgg 320161 ccaacatctc gttcttctac ccggagaagg acaagggttg ggcgctgggc ctgaacgcgg 320221 ccggaggcaa catcggggtg gcggtggtgc agaagatcat tccgcccatc gtggtcgccg 320281 gcagtggggt ggcactgtcg cgtgccggac tgttcttcgt gcccttggcc gtcgccgccg 320341 cggtgtgcgc attcctgttt atgaacaacc tcacggaggc caaggccgat gtgaagccgg 320401 tgtggcagtc gctgcggcat gccgacacct ggatcatgtc gctgctgtac atcggcacct 320461 ttgggtcgtt catcgggtat tcggcggcct tcccgacgtt gctcaagacc gtgtttggcc 320521 gtggtgacat cgcgttgggt tgggccttcc tcggcgcggg catcggttcc ctggtccgtc 320581 cgctgggcgg caagctcgcc gaccggatcg gcggtgcgcg gatcaccgcg gccagtttcg 320641 tcatgctggc ggccggggcg gctgcggcgt tgtggtcggt gcagtcggtc aatctgccgg 320701 tgttcttcgt cagcttcatg ttcttgttcg ttgccaccgg catcggcaat ggttcgagct 320761 accggatgat ctcgaggatc ttccaggtca aaggcgaagt cgccggcggg gatccggaaa 320821 cgatggtgaa catgcgccga caggccgccg gagcgctggg catcatctcc tcgatcggcg 320881 cgttcggcgg gtttgtggtg ccgctggcct acgcctggtc gaaggtgcac ttcggcaata 320941 tcgaacccgc cctgcacttc tacgtggcgt tcttccttgc cctgctcgtc gtcacctggt 321001 actgctactt gcgtagaacc acccccatgg gccaggtggg ggtgtagtta gcccggcggc 321061 ggtctcacgt tgtgagccac gcgcaaactc agactctgcc gatgtcaacg cccagctcgg 321121 caccgagctt gtccagcggc atggtgacgt gctctcgctc gtgcacgctg agccggtcgc 321181 gcaggtcttc aagctcttcc attaggctct cgtagcgctc agctgagatg aggatggctg 321241 cgggtcggcc atgattcatc aacacgacgt catcgtcggc ggattcacgc acaagcctcg 321301 ataggtgagc gcgggcttca ctaataggca ctagactgct ggtcatcggt agacccccct 321361 tcggtgtccg acgcgagtaa cggtgatgac acgcgctgcg tcgtcgacga tgtaaacaac 321421 gcgatagttg ccgaggcgga tgcggtaagt ggtgtcgaag ccactcatct tctcgcagcc 321481 acgcgggcgc ggttcgtcgg cgagcgcggc gacggcggtc agatgcgccg ctggtcgtgg 321541 cggtgcagcc gttggattgc tttagctgcc gagttctcga tttcgaccgc gtacccactt 321601 gccatacaaa aatgtacaga cttcagatgc gtaatataag cgctaatttt gccgacgcgc 321661 tctcaccgcg gccacgggct gtagtcggcg atcagctcct cctgcggcgg gcgctggtcg 321721 gccgggacgt gctgcaggtt gatccggatc cgataccaga tcgaactcgg cccgcgcatg 321781 ccgtcgacca gcacatcggc gggccgcagc aaagccgcgg cgccgggata ccggtcgcgc 321841 cagatgtcca acgccgccat cgcctcggcc cgcgtcttgg cgcgggcgat ctcgatcagc 321901 ggcttggcgg attgcgcctt ctgcggcgga cccagttcct cggccagcat caacagccgg 321961 tcgagccggc caaccgcgtc atccatcccg gcccaggggt caccgatgtc ggccaaccgg 322021 ctgggcacgg tggccatggt gaacaccgcc gggtcgcagc cgggcacctc ctcccagtgc 322081 agcggcgtgg acacccgggc atccggggtg gcccgcaccg agtaggccga cgcgaccgtg 322141 cggtccttgg cgttctggtt gaagtcgacg aacacgccct cgcgttcttc cttccaccaa 322201 cgactggttg ccgcgtcggg taggcgccgt tcgacctcac gcgcaacggt ctgggcggcc 322261 aggcgcacct ggggaaacga ccagcaaggc gcgatccggg catagacgtg aaagccccgc 322321 gacccggacg tcttcggcca tgcggtcaac ccgtaatcct ccagcacctc ccggaccacc 322381 aacgcgacct cgacgacccg ctgccacgcg accccgggca tcgggtccaa gtccacccgc 322441 agctcgtcgg gatgatcgag gtcgccggcg agcaccggat gcggattgag atccacacac 322501 cccaggttga tcacccacgc cagcccggcg gcgtcgtgaa tgaccgcctc cgcggcggag 322561 cggcccgacg catagtgcag ctcggccacg tccacccagt ctggccggtt tgccggtgcg 322621 cgcttctgaa acaccgcctc ggcggagatg cccttgacga aacgcttgag aatcatcggc 322681 cggccggcca ccccgcgcat cgccccctcg gccacggcga ggtaatagcg gaccagatcg 322741 aacttggtgt agccctttcg atcgttgtga gcggggaaga cgaccctgcc cggatgcgtg 322801 acgatgacct ggcgtccgtg cacgtccagc gacaccgggg cggccatgcg gctcatggta 322861 atttgcgacc cgcctcacat agggtgaggt catgcctaac ctcactgatc tgcccgggca 322921 ggccgtctcc aagctccaga agtccatcgg acagtacgtc gcgcgcggca ctgccgagtt 322981 gcattacctg cggaagatca tcgaatcggg cgcgatcggg ctggagccgc cgctgaacta 323041 cgccgcgctc gcagccgata tccgcaagtg gggggaagtc ggcatgctgc cgtcgcacaa 323101 tgccaggcgc gcccccaacc gggcggccgt catcgacgaa gaaggcacgc tcacgttttc 323161 cgaactcgac gaggccgcac acgcggtggc caatggccta ctggccaagg gtgtccgcgc 323221 cggggacggc gtcgccatct tggcgcgcaa ccaccgctgg tttgtcatcg ccaactacgg 323281 ggcggcccga gtgggggccc gcatcatctt gctcaacagc gagttctccg gcccgcagat 323341 caaagaggtg tcggaccgtg agggcgccaa ggtgatcatc tacgacgacg agtacaccaa 323401 ggccgtcagc ttggcccagc caccgttggg caagctgcgg gcgcttggtg tcaatcccga 323461 cgacgacaag ccgtcgggca gctccgacga aacgttggcc gagctgattg cgcacagcag 323521 caccgcgccc gccccgaagg cgagccgccg tgcgtcgatc atcattttga ccagcggcac 323581 caccggcacc ccgaaggggg cgaaccgtaa cacaccgccg acgctggctc cgatcggcgg 323641 cattttgtcg cacgtgccgt tcaaggccgg cgaggtgacg ctgttgccgt cgccgatgtt 323701 ccatgcgctg ggttacatgc acgccgcgct cgccatgttc ctgggctcga cgctggtgct 323761 gcggcggcgg ttcaagcccg cgttggtgct ggaagacatc gaaaagcaca aggcgacatc 323821 catggtcgtc gtaccagtga tgctgtcgcg gatcctcgac cagctggaga aaaccgaacc 323881 caagcccgac ttgtcgagct tgaagatcgt gttcgtatcc ggatcgcaat tgggtgccga 323941 gctggccacc cgcgcgctgg gggacctcgg cccggtcatc tacaacatgt acggctcgac 324001 cgaggtcgcg ttcgccacca tcgccggccc caaggatctg cagttcaacc ccagcacggt 324061 ggggcccgtc gtcaaggggg tgacggttaa gatcctcgac gagaacggca atgaggtgcc 324121 gcagggtgcc gttggccgga tctttgtggg caatgccttc ccgttcgagg gttacaccgg 324181 cggcggtggc aagcagatca tcgacggcct gttgtcgtcc ggcgacgtcg gctacttcga 324241 cgagcgcggc ctgctgtatg tgagcggccg cgacgacgag atgatcgtct ctggtggtga 324301 gaacgtgttt cccgccgaag tcgaggatct gatcagcggg catcccgacg tggtggaggc 324361 cgccgcgatc ggcgtcgacg ataaggagtt cggtgcccgg ctgcgcgcgt tcgtggtcaa 324421 gaagccggga gctgacctcg acgaggacac catcaagcag tacgtacgcg atcatcttgc 324481 ccgctacaag gtgccgcggg aggtgatctt cctcgacgag ctaccgcgca accccaccgg 324541 caaggtcctc aaacgtgagc tacgcaagct gtagctgctc gcgcgggtac ttacgggtcg 324601 cggggtaggc ccagcaaccg ctcggcgatg atgttgagct ggacctccga cgtgcccccg 324661 tagatggtgg tggcccggct ggctagcagg tactcgcccc acttgccggg caatcgctct 324721 gtgtcgccga tcaccgcatc ggtgccaaag gacgacaccg cgaattcggc ataaccctgg 324781 ccggtgcgca tggacaacag cttggagatc gccgccggcg ccatcgggtc acccccggcc 324841 agcgtcaaca gcgtggagcg caagttgagc agcttggtgg cgtggccctc ggcgatcaat 324901 tgcccggcac ggtgtcgcgc gacctggtcg aactgtcctt cgaaacggta atcgcgaacg 324961 aagtcgacga actcgcccag ggtggggagg aaggtcgaat cgctgccgcc gatcgacacc 325021 cgctcggccg tcagggtgtt gcggctgacc tcccaccccc ggttcacctc cccgagcacc 325081 aactcgtcgg ggacgaacac gtcgtcgagg tagacggtgt tgaaaaactc cttacccgtg 325141 agctcgcgca gcggcttcac ttgtacgcct tcgcttttca tgtccagcag gaagtaggtg 325201 atgccgttgt gcttgggcgc cgacgggtcc gtccgcgcca gcagcgcacc ccattgggag 325261 tactgcgcgc cggtggtcca gatcttctga ccagtgatgc gccagccacc gtcgacccgg 325321 gtggccttgg ttgccaggct agccaggtcc gatcccgcgc ccggctcgga gaacagctgg 325381 caccagaaaa tgtcgcctcg gaacgttggc ggcaggaggc gctgcttctg attgtcggtt 325441 ccgaacgcga cgatcgacgg cacgatccac gtcgcgatgg caatctgcgg ccgcttgacc 325501 cgcccggcgg tgaactcctg ggcgatgatg atctgctcga ccgggctggc ggcccgaccc 325561 cacggcttgg gcagatatgg cagcacccac ccaccttcgg cgatcgcgac agtgcgcggc 325621 tctcgcggca tcgccttcag cgcggcgact tcggcccgga tctgggcccg cagcttctcg 325681 gtagaggggt ccaggtcgat gtctaccgga cgcataccgg cagtcgtcgc ggtgtccacc 325741 acccgctgcg gatactccga gccgcggcca aagcacgcgg ccagcatcaa cgcccggcgg 325801 tagtagacgt tcgtgtcatg ctcccaggtg aagccgatgc cgccgtgcac ctgaatgcag 325861 tcctgcgtgc agcgctgagc ggtcgccggt gccagcgtcg ccgccaccgc cgccgcgaat 325921 tcgacgtcgg agctagattc gcccgcgtcg tctaaggctc gcgccgcgtc ccacaccgcg 325981 gcggtggccc gctcggtgtc agcgatcatc tcggcgcact tgtgcttgat cgcctggaat 326041 tgcccgatcg gccggccgaa ttgttcgcgg atcttggcat atgccgacgc ggtgtcggtc 326101 gcccaccgcg cgacgccaac ggcttcagcg gacagcaggg tggacatcag cgcgtgagcg 326161 gtcgtcatcg tgaggttgct cagcagggcg tcgtcgctga cgtcgaccgc gttggcccga 326221 acatgcgcga tgggccgcaa cggatccagg ctcttgaccg cttcgatctc gagctgatcg 326281 ttgcgcagta caacccactc gtcacggctt tcgatggcca ccggtagcac cagaacggag 326341 gcttgcgccg cggccggaac cgcgcggact tcgccccgga tcaccagcac gtcgccatgg 326401 cgggtggcgg tcagcccgga atctagcgcg taggcggcga tggccgcacc ggttgccagt 326461 tcggcgagga ctttggcttg cggatcatgg gctgcgatca gcgcgctggc gatcgccgac 326521 ggcacgaacg gcccgggcac ggcgccgtag ccgaactcgg caagcaccac cgctagctcg 326581 aggatgccga aaccctggcc gccgaccgac tcggccagat gcacaccctg caagccttgt 326641 tcggccgcgg cctgccagta aggcggcggg ttttcgaccg gtgattctag cgccgcgtgc 326701 agcacctcgg acggcgctac ccgcgccacc agggaacgca ccgaatcggc cagctcataa 326761 tgctcaggag taatagcgat cgacattgct cgccttccca tgctgttgga cgtttcggcc 326821 aagcaccttc caagctaaca accggtgggt cggttattaa cgttggctag cggatggccg 326881 gcgaaatggg tgagaacact cagcgccacc gcttggctat ccacttggcg atggtgtcgg 326941 cctgctcgct gcgggcgccc ggggtggtga agtaatggtc ggtgtcgatc gagacctgag 327001 tcttgtcgct gctggcgagc ccgtcgtaga tctgctgggc atccgacggg aagattccgg 327061 tgtcggcctc ggcgttgagc accagggccg ggcaggtgat ccgggccagg tggggtgcgg 327121 cacgggtttg ggccacccgc aggctccaca tgcccagcca gccgcgcagc gtgcaggccg 327181 cggcgatgcc gtgtgcggag cggttcgcct tcaccggcgt gcccgcgtag cactggttgg 327241 gccgacgctt ggtcggttcg atgctgggat cgaccatgcg cgggtcggcc caggtacgca 327301 tcacgctgaa cggccgatca gaaaagccag ctgcgcgaac acgtttgagt tcggattcgg 327361 cccagtcggt gatggtgtgg ttgcgtttga cctgcgcgga gcgataccgg ctgataaact 327421 ccggtgagta cggcggcccg ttgcgttcgt cgaacaggtc aagttcggga tcggttgcaa 327481 ccggatcatt ttcgtcaatg acggcggcgt ccatccaagc ggtgagcaca tccggacggc 327541 cgagatgagc tgcggcggca acgtatgcgt cggcggccgg caattcggtt accccggctg 327601 cgggtcgcat accgtccagg ggagtcacgt tcggatcgac cgcttgtgat tggtaggcgg 327661 ccatcaatga gccaccacct gaattgccaa gcaacaccac tgtttccacg ccctgaactt 327721 cgcggagcca gcgcaccccg acgccgatgt cgaccagtgc gtgatcgagc agaaagctgc 327781 tttcgaaacc acggaatcgg gtgttccagc ccagaaaccc gatgccgcgg atcgccatgt 327841 actcggcgag atagtgctcg gagaaatcga tctggtagtg cgcggcgatg agcgccacct 327901 tcggtttgcg tcccacgctg tggtggtaca gcccctggca tgggtgccca ccggcagccg 327961 cacgccccgc ggttcgcgac ggcagcccga cgaactctcg gatgaccccg ggcgtggcag 328021 cacgaccagt caatttgagc tgtcctcctt actgtagatg gcgcggtagt aaatgttggc 328081 caaggtctgg atgcacgcct gatcgtcagg ttgtccacgg cgtgaacgct taccgctgag 328141 ctgcaggtag caaaactggt tgaacattgc cacaatcgcc tcggccatca actgggggtc 328201 atcgccgacg caatagccgt gcgcctgagc gcgtttgacc gtctcggtga tgaacgatat 328261 tggaatctgg catatttcgg accagtattg cgcgaagtcg tcactgacca tcgccaactg 328321 tgacacgctg atcgcttctg cgaggcggtt gcggtaggtg taccaatggg cggcagcggc 328381 ttcatacgcg cgctcgcggt cggataggcc gtgccggatc accgacaatg cccgctggtt 328441 ggcgtcgtcg cggaagcgca gcgcccactg ccggaccatc gcctctttgg agtcgtagta 328501 gttgtaaaag gatgccgccg agcggccggc ttcggcggtg atgtcggcga cggtggtcgc 328561 caggattccg ttgcgcacca cgaccgtccg cgcggcggcg tcgattgcgg cctgggtccg 328621 ccgaccgcgt tgcgtcggga agtccggcac ctgggcacct ccctggaaca aaactgaacc 328681 tgatgttaga ttcagattca gagcttggcc aggccgccgt cccggggagc caatgggagc 328741 cgcacgatga tcaagccgca caacaccaac accgaattcg agcttggtgg gatcaaccac 328801 gtcgcgctgg tgtgttcgga catggcgcgc accgtggact tctacagcaa catcctgggg 328861 atgccgctga tcaaggcgct cgatctgccc ggcggccaag ggcagcactt cttctttgac 328921 gccggcaacg gcgattgtgt cgccttcttc tggttcgccg atgcacctga tcgggtgccc 328981 ggtctttcgt cgccggttgc catccccggc atcggcgaca tcaccagcgc ggtgagcacc 329041 atgaaccatc tggcgtttca tgtacccgcc gaaaggttcg acgcctaccg gcagcggctc 329101 aaggacaaag gcgtgcgggt cggcccggtg ctcaaccatg acgacagcga gacgcaggtg 329161 tccgcggtgg tgcatcccgg tgtgtacgta cgctcgttct acttccagga ccccgatggg 329221 ataactctgg aattcgcttg ctggacaaag gaattcacta cgagcgacgc gcaggccgtg 329281 ccgaagacgg cggctgaccg gcgacctccg gtggctgcgg atcgttagcc ccggatttgg 329341 cagctgttgc cgctacccgg ggacgggaca agtttgggtc ggtgagttca tcgagcagcg 329401 cagctagctg atcgaccagc tggtcgggat cgagtcgcac gtcaccggcc agccaggcgc 329461 tgatggtctg cccgacgccg ccgacggcga agtgtgcgac cgccttgacg tggtcatttg 329521 ccggtgcgtg cagggtgtcg acggcatgtt ggccggacag catggcgaac agggcgctgg 329581 attccgcacg cttgcgggtg atcactgcgt tggccagctg tgtgctgaac agcaggcgtc 329641 cgacgcgggc gtctgcggtg atggtccgca cgatgttggc catgcccgcg cgagtctgct 329701 cccgcgccgg taccgccgtg accgcggcct gagtggtggc gaccagctcg gccaccaccc 329761 agtcgaacac gcggccgacg aattcgtcct tgtcggtgaa gctttcgtag aagtagcgca 329821 ccgacaggcc ggcccgccgg caaatggtgc ggatggttag ctcggcgatg tcgtgctggt 329881 cggaccccaa caggtccagg ccggcagaga gcaactggcg acggcgcgtc gccagtcgct 329941 cggcggcctc gacgccgcgg tagggtcgat cactgcgcgt catacggatc atcttgacac 330001 tcgggcacga taccggccaa tatcaggata caggtgtttc cataattagc ggcagcgccg 330061 ggaggccttc ggatggcgat ttcgctggtg gctcaccagc ccatccccca cgtcgagcgt 330121 cccatggccg acccaccccg tctccagctg gccaggcgcc ggcgatcggc ggccggcccc 330181 ggcggtaacg aggacagctt gatgggagtg gcgctgctag ccggcccggc caacgtgatc 330241 atggagttgg cgatgccggg tgtcggctac ggcgtgttgg agagccgtgt cgaaagcggc 330301 cggctggacc gccatccgat caagcgggcg cgcaccacct ttacctacgt tgcggtggcc 330361 gttgccggca gcgacgacca gaaggcggcc tttcgtcgcg cggtgaataa ggttcacgcg 330421 caggtgtatt cgactccgga gagcccggtg tcctaccacg cgttcgatcc cgaactacag 330481 ctgtgggtgg cggcatgcct ctataagggc ggcgtcgacg tctaccgcac cttcgtcggc 330541 gagatggacg acgaagaggc cgaccatcat taccgcgcgg gcatggcgat gggcaccacg 330601 ttgcaggtgc cgccgcagat gtggccaccg gatcgggcgg ccttcgaccg ctactggcgg 330661 caatcactgg acagggtgca catcgatgac gtcgttcgcg actacctgta tccgatcgtg 330721 gcgctccgaa ttcgcgggat cgcactgccg ggtccgctgc ggcggctgtc ggagggtatc 330781 gcgctgctga tcaccaccgg tttcctgccg cagcggtttc gcgacgagat gcggttgccg 330841 tgggacgcga ccaagcagcg gcgctttgac gcgctcatgg ccgtgctgcg cacggtgaat 330901 cgcctgatgc cgcggtttgt ccgggagttc ccgttcaacc tgatgctctg ggacctggac 330961 cggcggatga ggcgcgggcg cccgctggtg taatcgccgg cttcgcgtgg accgttgccg 331021 gtagaccgct cgctagattg gcgggcgaat atggcgcaca gaggcaaacc gggcgaaatc 331081 cctatccagg ctcaccacgg cgcagtgatg ctccacggcg atggccccga gtaccgcgtc 331141 aggtatcaag tcgcccgatg cgtcggcctc gtcgcagagt tttcgcagca gcaccaggtg 331201 tctggggccg gggcttgtcg gaaggtgatg gggctgggcg ttgacggctt cgacgaatgc 331261 gaatgcatcc gctcgtggtg acggaatctc gaagatgcgt cgattcgttg ttagccggag 331321 gaacgacgcc cacactaggt tcggcactgt gaaggggtcg tcggccgcaa gcagtcgatc 331381 gaaccagggg cggacggttc ggtgattcgg atggtcaccg cggtgtgcag ccagcagcac 331441 gttgacgtcg atgaggaaca tcgcctattt gtgcctgtcc aggctcactt ccgcgagttc 331501 agttccagac cctcgtcgag cacttcggac aacaccctat tcgaggttag gtcgatacct 331561 ggccgcggac cggtgccggc gtcaaaaacg gggacggttg gccgggcgcc gccggtgcgg 331621 gcggcggcga gctcccgccg aagggcgtct tcgatcacag cgcccagcga ttaaccacgc 331681 tcgcgggccc ggcgtttggc ggtagccagt agttcatcgg agattgacac ggtggtgcgc 331741 atgatgctca ggatagcgca tctacggcat catctgcggt gagcaactga tgccctcaac 331801 gccgcgtgtg gtcgcaggtc tgcctgctat ggcaagccgt tgagtccgtt ctcgccgagc 331861 agcagcccgc cggtgccgcc ggcaccgggc gtggccccgg ctttgccggc gttgccgccg 331921 ttgccgccgt tgccgatcag cacggcgttg ccgccgacac caccgctgcc gccggtaccg 331981 gcgccaaacc cgccggcacc cccgtcaccg ccgttgccga acaccccggc gtggccaccg 332041 tcaccgccgg tgccgccggt accggcgcct agagcgttgg caccgctgcc gccggcgccg 332101 ccggcgccgg cggagccgaa gagcaagccg ccgttcccgc cggcgccgcc ggcgccgcct 332161 tgctggatgc tggtaagtgc tgccccgccg tgcccgccgg cgccgccggc gccgccgaag 332221 ccgaagagta aggcgccgtt cccgccggtt ccgccggccc cgccggcaag ggagctggcg 332281 ccaccgctgc cgccggcgcc accggaggcg ccgagggaga gtaggccggc gttgccgccg 332341 tgcccgccgc cgccggtggt gatcccggac cctcccgagc cggcggcgcc gccggtgccg 332401 ccggctccga acagtccgcc gttcccgccg ttcccaccgg ccccgaagtt cgtgccggcc 332461 ccgccggtgc cgccagttcc gaacagtccg ccgttcccgc cgttcccgcc ggctgcgttg 332521 aacccgccgg cccctccggc tccgccgttg gcgaacagtc cgccgttgcc gccggcgccg 332581 ccgacgccgg ccgggacacc gccagcggcg ccgtggccgc cggtgccggc cgcgccgaag 332641 agcaaaccgg cgtcgccgcc gcgcccgccg gccccgccga tgccagcgac gcctatggag 332701 ttcccaccgt tgccgccggt gccgccggag ccgatcagca aggagacccc accggcgccg 332761 ccggccccgc cgatccctcc agcaccggtg gctatcccgc cggtcccgcc attgccaccg 332821 gtaccgaaca agatcccgcc ggccccgccg gccccgcccg tagccgtggc ggcggtgttg 332881 gtcgcaccgt gcccgccgtt accgccgttg ccgaacaacc acccgccggc cccgccggca 332941 gccccggtcc ccggggtccc gttggcgccg ttgccgatca gcggacgccc agtcaatgtc 333001 tggaagggct cattcaccac attgagcgct gcctgctgca gggtatggac tgaggtgctg 333061 actggagcat tggatccgtc tagtcctatt agctgcccgc cgaggccgcc gaggccggcc 333121 ttgcccgcgg ctgcgccggt cccgccgctg ccgccgttgc cgccgttgcc gatctgcacg 333181 ccgttaccgc ccgctccgcc gttgccgccg gtgccggtca cgctggcgcc gccgtcaccg 333241 ccgttgccgc cgttgccgat caaaccgggt atgccaccgg ccccgccggt gccgcctgga 333301 ccggtgatag cagcgccgcc ggcgccgccg gcgccgccgg agccgttgag caggccggcg 333361 ttgccgccgg tgcctcctat cccgccggtg gtgcggccga acccgccggc gccgccggcg 333421 ccgccgtccc cgccggtggt agcgccgccc ccaccactgc cgccggtgcc gccggcgccg 333481 aacagtccgc cggccccgcc ggccccgccg gccccgccca tcccggcgac accggcgccg 333541 atggtgccgc cggccccgcc ggtgccgccg gcgccgaaca gtccgccgtt tccgccggcc 333601 ccgccggccc cgccgaaccc gccttttccg cccaccccac cgaccccgcc gtcggtgaac 333661 atcccgccgg ccccgcccgc accgccggcc ccgccggagg cggtggtggc caatgcgaac 333721 ccgccggcgc cgccgacccc ggcggcgccg aagagcaggc cggcgttgcc gccgtccccg 333781 cccgcaccac cggccccgcc ggtggtgttc gttgaggctc cgccggctcc gccggcgccg 333841 aacaacacgg cgttcccgcc ggtaccgccg gccccgccga tcccgccggt gttggtggcg 333901 accccgccgg cgccgcccgc cccgccgctg ccgagcagcc cactggcccc gccggccccg 333961 ccagtcccgc cggtgccccc aacggcgttg ttcgccgcgc cggatccgcc ggcgccgccg 334021 ttgccgatca accagccgcc gggcgcgccg tcggccccgg tgccgggggc gccgttggcg 334081 ccgttgccga tcagcgggcg cccggtattc gccaggaaga actcgttgat cggggcgagc 334141 agcggcgagg tggcggcggc ctcggcggcg gcgtacgcgc cgccaccgga ggtcaacgcc 334201 tgcgtgaact gggcatgaaa cgcctgcgct tgggcgctga gcgcctgata ggcctggccg 334261 tgggcgccga acagcgccgc aaccgccgtc gagacttcat cggcacccgc ggccagcagt 334321 gctgtggtgt tggccgccgc ggccgcgttg gccgcggcga tgctcgactc gagactggct 334381 aaatccgttg ccgctgccgc gataacctct ggcgccgcaa tcacaaacga catctgacac 334441 ctcccgacgg gcatcaccgc tctgtcgcgc cgacctgggc ctccccggtc accagtgaaa 334501 atcggctgta agaagcatcc catttcgagg ggcaacacct ggggggtttg tcgaattctg 334561 gtaggtaact tcgcgggccg ggtaaacccg gtgcgtcctg tcaggtgagt cgagggcagc 334621 ccgcaggctg atgctggggt atctgggccg gccgaccatg gctggccggc gggtgttctg 334681 agggccggtt cgctggctat ggcaggccgt tgagcccgtt ctcgccgatc agcagcccgc 334741 tggtgccgcc cgcgcccggc gtgccgccgg ctttcccgcc gttgccgccg ttgccgccgt 334801 tgccgatcag cacggcgttg ccgccgacac cgccattgcc gccggtaccg gcgccaaacc 334861 cgccggcacc gccgtcgccg ccgttgccga acaccccggc cgtaccgccg ttgccgccga 334921 ccccgccctt ggcggcaccg ctggcgccgc cggtcccgcc ggagccgccg gagccggaga 334981 gcatgccggc gttgccgccg gccccgccgg ccccaccagc gccgatgttg ctgtacccgc 335041 cggcgccgcc ggcgccgccg tagccgaaga gcaagccgcc ggctccgccg accccaccga 335101 tcccgccgct gtcggtcaag ctggagccgc cgctaccgcc ggcgccaccc gaagcgccca 335161 gggagaacaa gcccgcgttg ccgccggccc cgccggcgcc gccggggacc gcggcggggg 335221 taccgtggcc gccggagccg ccggtgccgc cggcgccgaa cagcccaccg ttcccgccgg 335281 caccaccggc ccccgggccg agggagccgc cggcgccgcc ggtgccgccg gcgccgaaca 335341 gcccgccgtt cccgccggcc ccgcctgtcc ccgcatctcc gcccaacccg ccgctcccgc 335401 cggccccgcc gttggcaaac agcccgccgt tcccgccggt gccgccggtg ccgccggctt 335461 tagtgctggc gccggccccg ccggttccgg cggcgccgaa gaacgatccg gcgttaccgc 335521 cgtccccgcc gtccccaccg gcggtgaggc cggcgccgcc gctgccgccg ttaccgccgg 335581 agcccaccaa gaaggccgcc ccaccggcgc cgccgtcccc gccggtgccc gtcgtgccga 335641 cgccgccggc cccgccggtg ccgccggtgc cgaagaagat cccgccggcg ccgccgtccc 335701 caccgtcccc gccgttggtt ccggtcgccc cggacccgcc gttgccgccg ttgccgaaca 335761 gccacccgcc ggccccgccg tcagccccgg ttccaggagt cccgttggcg ccgttgccga 335821 tcagcgggcg gccggtgagc gtctggaagg gctcgttcac cacattgagc acattttgct 335881 gcagggtgtg cagtggcgag gtgctcgcgg gagcattgaa tccgtctaga ccgagcagca 335941 gcccgctgac gccgcccact ccggccttgc ccgcgccaat cccaccgcta ccgccgttac 336001 cgccattgcc gatcaacacg ccggtgccgc cgatcccgcc gttgccgccg gtcaccgcgc 336061 tggcgccacc gttaccgccg ttgccgccgt taccgatcag cccgggggtg ccgccagccc 336121 caccgatccc gccggcgaag ccctggccaa ctccgccgtt gccgccggcg ccgccggagc 336181 cgaagaccgt gccggcgttg cccccggggc cgccttgccc gccgtcggcg aagccgaatc 336241 cgccggcgcc gccggagccg ccggagccga agagcagccc agcgttgccg ccggcgccgc 336301 cggcgccgcc tatgccgccg gccgtgagag taccgccgtc cccaccgatt ccgccggcgc 336361 cgcccgcggc gccgagggcg agcatgccgg cattgccgcc ggccccgccg tccccgccgg 336421 cgaccaggct gtgtccgccg ctgccgcctt ccccgcctgc gccgaacagc ccgccggccc 336481 cgccggcccc gccgactccg ccgaagctgc tgtcggcgaa cccgccatgc ccgccggtgc 336541 cgccggcgcc gaacagcccg ccagcgccac cggccccacc ggccccgccg gagctgccgg 336601 ccccaccgga tccgccgacc ccgccggtgg cgaacagccc gccggccccg ccggcgccgc 336661 ccgccccgcc gagtgcactg ccgttcgtga atccgccggc cccgcccgtg agggctacta 336721 cgccgccgcc ggcgccgccg gcgccgccgg cgccgaacag catggcgttg ccgccggctc 336781 cgccggaccc gccgatccca ctgctggcga ccccgccagc gccgccggcg ccgccgttgc 336841 cgatgagccc gccggcgccg ccgttgccgc cggcgccgcc gttgacgccg gccgcgccgg 336901 atcctccggc gccgccgttg ccgattaacc agccgccgtc cccgccattg gccccggtgc 336961 cgggggcgcc gttggcgccg ttgccgatca acgggcgccc ggtattcgcc aggaagaact 337021 cgttgatcgg atccagcagc ggcgacaccg cggcggcctc ggcggccgca taggcgccgc 337081 caccggaggt caatgcctgc acgaactggg catgaaacgc ctgcgcttgg gcgctgagcg 337141 cctgataggc ctggccgtgg gcgccgaaca gcgcggcgat ggctgtcgac acctcgtcgg 337201 cgcccgcggc catcagtgcc gtggtgttgg ccgccgcggc tgcgtttgcc gcgctgatgc 337261 tcgatccgag actggccaaa tccgttgccg ctgccgcgat aacctctggc gccgcaatca 337321 caaacgacat ctgacacctc ccaatacgca tgaccgctct gtcatgccga cccggggaac 337381 gtcaccagca aaaatcggca gtaagaagca tcccatttcc agcgacaaca cctggggggt 337441 tttggtcaaa ctctggtaag cgacttcgtg taccgggtga acccggtgtg tcttgaagga 337501 cagcccgcag gctgatgctg ggggatctgg gccggccgac catggctggc cggctgttgg 337561 tctgatggcc ggttcgcggt tacaggccgt tgagcccgtt ctcgccgatg atcagcccgc 337621 tggtgccgcc ggcgccgggt gtgccgccgg ctttcccgcc gttgccgccg ttgccgccgt 337681 tgccgatcag cacggcgttg ccgccggtgc cgccggtgcc gccggcaccg gtgccaaacc 337741 cgccggcgcc gccgtcgccg ccgttgccga acaccccggc cgtaccgccg tcaccgccgg 337801 tgccgccgct gctgccgatg ccgctggagc caccggtgcc gccggcaccg ccgaagccga 337861 agagcgagcc gccactgccg ccgttcccgc cgaccccgcc ggtcccgccg acatttaagg 337921 cgctgccgcc gctgccgccg gcgccgccgg aggcgccgag ggcgagtagg ccggcgttgc 337981 cgccgctgcc gccgttgccg ccgaaggtgc cgccgctgct gccgccagca ccgccagtgc 338041 cgccggcgcc gaacagcccg ccgtgccccc cggcgccgcc gtcggcgccg agcgtgcccg 338101 ccccgccggt gccgccggcg ccgaagagca atccgttccc cccggtcccg ccattcgcgc 338161 caaacccgcc ggccccgccg ttggcgaaca gcccaccggt accaccggct ccggcggtgc 338221 cgccggcacc gataaagttt tgggagaggg cggcctggcc gccggtccct gcggcaccga 338281 ggaacaagcc ggcgtcaccg ccgcgcccgc cggccccgcc ggtgtccagg ccaaacccgc 338341 cgctgccgcc ggtgccgccg gagccgatca gcaaggcggc tccgccggtc ccgccggtcc 338401 cgccttggcc cgtcgttccg atgccgccgg acccgccggt gccgccaata cctgacagga 338461 ttccgccggc cccgccggat ccgccgtctc cgccgtcggc gccggtcgct ccgtggccgc 338521 cgttgccgcc gttgccgaac aaccacccgc cggccccacc gtcggccccg gtccccggag 338581 tgccgttggc gccgttgccg atcagcggtc gcccggtgag ggcttgggtg ggctcgttga 338641 tcgcgttgag gatttgttgc tgcagggtgt gcagtggcgt gctggcgggg gcgttgaatc 338701 cgtctcgacc tagtagctgc ccgcctaagc cgccggcgcc ggccgtgccg gcgggtgcgc 338761 cagtgccgcc actaccaccg ttaccgccat tgccgatcag cacgccgctt ccgccggcgc 338821 cgccggcggc accggcgccg ttcgcgctgg cgccgccgtt gccgccgttg ccggcgttgc 338881 cgacgagccc gggcgcgccg ccggccccac cggttccgcc ggcgcccgcg aaggacccgc 338941 cgccggcgcc gccggcaccg cccgccccgc tgagcagacc cgcctttccg ccggcgccgc 339001 ccgccccgcc ggcgtcgaag cccagcccgc cgacgccgcc ggcgccgccg gagccgaaca 339061 acgtgccgcc gtcgcctccg atcccaccgg caccgccgcc accgtccggg ctggatccgc 339121 cgctgccgcc ggcgccgccg gcggcaccga ggctgagcat gccggcgtcg ccgccggccc 339181 caccgttccc gccgacgttg attatgctcg tcccgccact accgccggtg ccgccggcgc 339241 cgaacagccc gccagagccg ccatccccgc cggcgccgcc gccaaagatg ccgaatccgc 339301 cgggcccacc ggtgccgccg gcgccgaaca gcccgccgtt tccgccggat ccgccggccc 339361 cgccggtgcc ggcgtcggtt gccccgccgg cgccaccgac cccgccgtcg gcgaacagcc 339421 cgccgtttcc gccggccccg ccggcgccgc cggtggcgcc gaaggcggct gcgaatccgc 339481 cgggaccgcc gaccccggcg gccccgaaca gcatgccggc ggccccgccg gcgccgccgg 339541 tccccgtgct ggccctcccg ccggcgccgc cggcgccgcc gttgccgatg agcccgccgg 339601 cgccgccgtt gccgccggcc ccgccgttga cgccggccgc gccggatcct ccggcgccgc 339661 cgttgccgat caaccagccg ccgggcgcgc cgtcggcccc ggtgccgggg gcgccgttgg 339721 cgccgttgcc gatcagcggg cgcccggtat tcgccaggaa gaactcgttg atcggggcga 339781 gcagcggcga ggtggcggcg gcctcggcgg cggcgtacgc gccgccaccg gaggtcaacg 339841 cctgcacgaa ctgggcatga aacgcctgcg cttgggcgct gagcgcctga taggcctggc 339901 cgtgggcgcc gaacagcgcc gcaaccgccg tcgagacttc atcggcaccc gcggccagca 339961 gtgctgtggt gttggccgcc gcggccgcgt tggccgcggc gatgctcgac tcgagactgg 340021 ctaaatccgt tgccgctgcc gcgataacct ctggcgccgc aatcacaaac gacatctgac 340081 acctcccgac gggcatcacc gctctgtcat gccgacccgg ggaacgtcac cagcaaaaat 340141 cggcgggcta cagaataact ccggcccggg aaagggattt ggtatttccc aaaatatctc 340201 ccacatttat gcggtcggcg cgtcggccga cgggagctgg cagcacccgt gggccggcgc 340261 cgagcgttcg ctggtgtccg gctgggactt gcattgcggc gcgccgtggt gtggaatagt 340321 ggtaatgaaa atcatgttca tcagtcctct gtggtgttta cggctatgac gctgtggatg 340381 gcctcgccgc ccgaggtgca ttcggcgttg ctcagcagcg ggccggggcc gggctcggtg 340441 ttgtcggcgg ccggggtgtg gtcgtcgctg agcgccgaat acgccgcggt cgccgacgag 340501 ctcatagggc tgctgggcgc cgtgcagacc ggcgcttggc aggggcccag cgccgcggct 340561 tatgtggccg cccacgcgcc gtacctcgcg tggttaatgc gggccagcga aaccagcgcg 340621 gaagcggccg cccggcacga gaccgtggcc gcggcctaca cgaccgcggt ggcggccatg 340681 ccgacgttgg tcgagctggc cgccaaccac acgcttcacg gggtcttggt ggcgacgaac 340741 ttcttcggca tcaacaccat cccgatcgcg ctcaacgagg ccgactacgc gcggatgtgg 340801 acgcaggccg ccagcacgat ggcgacctat caagcggtcg ccgaggccgc ggtggcgtcg 340861 gcaccgcaga ccaccccggc gccgccgatc ttggcagccg aagcggccga cgatgaccac 340921 gatcatgacc acgatcacgg gggcgaaccg accccgctgg actatctggt cgcggagata 340981 ttgcgcatca tcagcggtgg gcgcctgatc tgggatcccg ccgagggcac catgaacgga 341041 atcccgttcg aagattatac ggacgcagcc caaccaatct ggtgggttgt tcgtgccatc 341101 gaattcagta aggactttga aacgtttgtt caggaactgt ttgtcaatcc ggtggaggca 341161 tttcagttct actttgagct tctattgttc gactacccga cccacattgt gcagattgtt 341221 gaggcgttga gccagtcccc gcagttgctg gcggtcgcac tcggttccgt catctccaac 341281 ttgggtgcgg tgaccgggtt cgccgggcta tccggcttgg ccggcatgca gccggcggct 341341 atcccggcgc tagcacccgt cgcggcggcc ccgccgacat tgccggcggt cgcgatggcc 341401 ccgaccatgg ccgcgccggg cgcggcggtt gcgtcggcag ccgcgccggc gtccgcgccg 341461 gcggccagca cggtggccag cgccacgccg gcaccgccgc cggcacccgg cgccgccggg 341521 ttcggctatc cctacgccat cgctccgccc ggcatcgggt tcggctcggg gatgagcgcc 341581 agcgccagcg ctcaacgcaa ggcaccacag cccgatagtg cggcggcggc ggcggccgcg 341641 gcggccgtac gtgaccaagc gcgggcgcgg cggcggcgcc gtgtcacgcg gcgcggatac 341701 ggcgacgagt ttatggatat gaacatcgac gtcgatccgg actggggccc tccgcccggc 341761 gaagacccag tcacatccac ggtggcctcg gatcggggtg ccggacatct gggctttgcc 341821 gggacggccc gcagggaggc ggttgccgac gcggccggga tgaccacgct ggctggcgat 341881 gatttcggcg acgggccaac gacgccaatg gtgccgggtt cgtgggatcc ggaccgggat 341941 gcgcctggct cggcggagcc tggagatcgg ggctgagcta gccgcgtagg gtcgattggg 342001 tgcgtaccga aggtgatagc tgggacatca caacgagtgt cggttcgacc gcgctgtttg 342061 tcgcgacggc gcgagcgctg gaagcccaga agtccgaccc gctggtcgtc gacccatatg 342121 cggaggcgtt ctgccgtgcc gtcggcggtt cgtgggccga tgtgctcgac ggcaagcttc 342181 ccgaccacaa gttgaagagc accgatttcg gcgagcactt cgtcaacttc cagggtgccc 342241 gcaccaagta tttcgacgag tatttccgtc gggccgccgc cgccggcgcg cggcaggtgg 342301 tcatcctggc ggcggggctg gactcgcgcg cgtaccggct gccttggccc gacgggacca 342361 cggtttttga gctggaccgc ccgcaggtcc ttgatttcaa gcgcgaggtg ctcgccagcc 342421 acggtgccca accgcgcgcc ctgcgccgcg agatcgccgt cgacctgcgt gacgattggc 342481 cacaagcctt gcgggacagt ggtttcgatg cggctgcacc gtcggcatgg attgccgaag 342541 ggctgctgat ctatctcccg gccaccgccc aggagcggct attcaccggc atcgatgccc 342601 tggccgggcg ccgaagccac gtcgccgtcg aggatggtgc cccaatgggg ccagacgaat 342661 atgcggctaa ggtcgaagag gagcgcgccg cgatcgccga gggagccgag gagcacccgt 342721 tttttcaact ggtctacaac gagcgatgcg cgccggccgc cgagtggttc ggcgagcgag 342781 gttggaccgc ggtcgctacg ctgttgaacg actacctcga agcggtgggt cgcccggtac 342841 ccggaccgga atccgaagcc gggccgatgt tcgcccgcaa caccctggtc agtgccgccc 342901 gcgtctgacg gcgcaccgtt cgcgctgccg gcaccccggg ctccataatg aaaatcatgt 342961 tcagtaagct acactctgca tatcgggcta ccaacgaaat ggagtatcgg tcatgatctt 343021 gccagccgtg cctaaaagct tggccgcagg gccgagtcga ttggtcgcgg tcgcctcgac 343081 agttagctta tgcaatgcta acttcggggc aaagttcagg cggatcggcc gatggcgggc 343141 gtaggtgcag gagacagcgg aggcgtggag cgtgatgaca ttggcatggt ggccgcttcc 343201 cccgtcgcgt ctcgggtaaa tggcaaggta gacgctgacg tcgtcggtcg atttgccacc 343261 tgctgccgtg ccctgggcat cgcggtttac cagcgtaaac gtccgccgga cctggctgcc 343321 gcccggtctg gtttcgccgc gctgacccgc gtcgcccatg accagtgcga cgcctggacc 343381 gggctggccg ctgccggcga ccagtccatc ggggtgctgg aagccgcctc gcgcacggcg 343441 accacggctg gtgtgttgca gcggcaggtg gaactggccg ataacgcctt gggcttcctg 343501 tacgacaccg ggctgtacct gcgttttcgt gccaccggac ctgacgattt ccacctcgcg 343561 tatgccgctg cgttggcttc gacgggcggg ccggaggagt ttgccaaggc caatcacgtg 343621 gtgtccggta tcaccgagcg ccgcgccggc tggcgtgccg cccgttggct cgccgtggtc 343681 atcaactacc gcgccgagcg ctggtcggat gtcgtgaagc tgctcactcc gatggttaat 343741 gatcccgacc tcgacgaggc cttttcgcac gcggccaaga tcaccctggg caccgcactg 343801 gcccgactgg gcatgtttgc cccggcgctg tcttatctgg aggaacccga cggtcctgtc 343861 gcggtcgctg ctgtcgacgg tgcactggcc aaagcgctgg tgctgcgcgc gcatgtggat 343921 gaggagtcgg ccagcgaagt gctgcaggac ttgtatgcgg ctcaccccga aaacgaacag 343981 gtcgagcagg cgctgtcgga taccagcttc gggatcgtca ccaccacagc cgggcggatc 344041 gaggcccgca ccgatccgtg ggatccggcg accgagcccg gcgcggagga tttcgtcgat 344101 cccgcggccc acgaacgcaa ggccgcgctg ctgcacgagg ccgaactcca actcgccgag 344161 ttcatcggcc tcgacgaggt caaacgccag gtgtcgcggc tgaagagctc agtggccatg 344221 gaactggtcc gcaagcagcg tgggctcacg gtcgcccaac gcacgcacca cttggtgttt 344281 gccggaccgc ccgggaccgg caagaccacc attgcccggg tggtcgccaa gatctattgc 344341 ggccttggct tgttgaagcg ggagaacatc cgcgaggtcc atcgcgccga cctcatcggc 344401 caacacatcg gcgagaccga ggcgaaaacc aacgcgatca tcgacagcgc gctggacggg 344461 gtgctgttcc tcgacgaggc ctacgccctg gtggccaccg gcgccaagaa cgacttcggg 344521 ttggtggcca ttgacacctt gttggccagg atggaaaacg accgcgaccg gctggtggtc 344581 atcatcgccg gctatcgcgc cgacctggac aaattcctgg acaccaacga gggacttcgg 344641 tcgcgtttca cccgcaacat cgactttccc tcctacacgt cccatgagct ggtggagatc 344701 gcgcacaaga tggccgaaca gcgagacagc gtcttcgaac agtccgcgct gcacgatttg 344761 gaggcgttgt tcgccaagtt ggcggcggag tcgacaccag ataccaacgg aatctcgcga 344821 cgtagcctcg acatcgcggg caatggtcgg tttgtgcgca acatcgtcga acgctccgaa 344881 gaagagcgtg aattccggct ggaccattcc gaacatgccg gatccggtga gttcagcgac 344941 gaggagctga tgaccatcac ggccgacgac gtgggtagat cggtagagcc gctattgcgt 345001 ggcctcgggc tctcggtgcg ggcatgacga accagcagca cgaccacgac ttcgaccacg 345061 accgtcgctc gttcgcctcc cgaaccccgg tcaacaacaa ccccgacaag gttgtctacc 345121 gccgcggctt cgtcacccgc catcaggtga cgggctggcg gttcgtgatg cgccgaatcg 345181 ccgccggaat cgcattgcac gacacccgca tgctggtcga cccgttgcgc actcagtcac 345241 gcgcggtgct gatgggtgtg ctgattgtga tcacggggtt gatcggctcc ttcgtattct 345301 cgttgattcg gcccaatggg caggcgggta gcaacgcggt gcttgccgac cggtccaccg 345361 cggcgctgta tgtgcgggtg ggcgagcagc tgcacccggt gctcaacctg acctcggccc 345421 ggctgatcgt cggccggccg gtgagcccga cgacggtgaa aagtactgag ttggaccagt 345481 ttccgcgcgg aaacctgatc ggcatcccgg gtgcgccgga gcggatggtg cagaacacct 345541 ccaccgacgc gaactggacg gtgtgtgacg gcctcaacgc accgtcgcgg ggcggtgcgg 345601 atggcgtggg tgtgacggtg attgccggcc cgctggagga caccggcgca cgcgcggccg 345661 cgctcgggcc cgggcaggcg gtgctggtcg acagcggcgc cggcacctgg ctgttgtggg 345721 acggcaagcg cagcccgatt gatctggccg atcatgcggt caccagcggc ctcggcctgg 345781 gcgccgacgt gcccgcgccg cggatcatcg cctcggggct gttcaacgcg atacccgaag 345841 caccgccact gacggcgccg atcatcccgg atgccggcaa cccggcgagc ttcggtgtgc 345901 cggcgccgat cggcgcggtg gtgagttcct acgccctgaa agactcgggc aagaccatat 345961 cggacaccgt gcagtactac gcggtgctgc cggacggttt gcagcagatt tcgccggtat 346021 tggcggcaat cctgcgcaac aacaactcct atggtctgca gcagccgcct cggctggggg 346081 ccgacgaggt cgccaagctg ccggtgtcgc gggtgttgga caccaggcgc tatcccagcg 346141 agccggtaag tctcgtcgac gttacccgtg accccgtcac ctgcgcgtac tggagcaagc 346201 cggtgggtgc ggccaccagc tcgttgactc tgttggcagg ctcggcgctg ccggtgccag 346261 atgcggtgca caccgtcgag ctggtcggcg ccggcaacgg tggtgtggca acccgagtgg 346321 cgttagcggc cggtactggc tacttcaccc agacggtggg cggcggccca gatgcgccgg 346381 gcgccgggtc gttgttctgg gtgtcggata ccggggtgcg ttacggtatc gacaatgagc 346441 ctcagggagt ggctggaggc ggcaaagcgg ttgaggccct tggcctgaac ccgcccccgg 346501 tccccatccc gtggtcggtg ctgtcgctgt ttgtgcccgg cccgacgctg tcgcgtgccg 346561 acgcgctgct ggcacacgac accttggtgc ccgacagcag gcccgctcgt ccggtatcgg 346621 ccgagggagg gtaccggtga gcagactgat ctttgaggct cgtcgccgac tggcgccgcc 346681 gagcagccac cagggcacca tcatcatcga ggcgcctccc gagctgcctc gggtgatccc 346741 accgtcactg ctacgacgag cgctgcctta tctgatcggg atcctcatcg tggggatgat 346801 cgtggcgctg gtcgccaccg ggatgcgggt gatttctccg cagacgttgt tcttcccatt 346861 tgtgctgctg ttggcggcca ccgcgctcta ccgcggcaac gacaagaaga tgcgcaccga 346921 ggaggtcgac gccgaacggg ccgactacct acgttaccta tcggtggtgc gggacaacat 346981 tcgggcccag gccgccgagc agcgggccag cgcgttgtgg tctcatcctg acccgacggc 347041 gttggcgtcg gtgccggggt cacgtcgcca atgggagcgt gacccgcacg accccgactt 347101 tttggtgttg cgggccggcc ggcacacggt accgctggct actacgctgc gagtcaacga 347161 caccgccgac gagatcgacc tggaaccggt gtcgcacagt gcattacgca gcctgctcga 347221 cacccagcgc agcattggcg acgtgccgac cgggatcgac ctgaccaagg tttcgcggat 347281 caccgtgctg ggggagcgcg cacaggtgcg cgcggtgtta cgcgcctgga tcgctcaggc 347341 ggtgacctgg cacgacccga cggtgctcgg ggtggcgctg gccgcgcgtg atctggaggg 347401 tcgcgattgg aactggctga agtggttacc gcacgtggac attcccggcc gcctcgatgc 347461 gctgggcccg gcccgcaatc tgtcgaccga tcccgacgag ctcatcgcgc tgctggggcc 347521 cgtcctggca gaccgcccgg cgtttaccgg gcagccaaca gatgcgttgc ggcacttgct 347581 gatcgtcgtc gatgacccgg actacgacct gggcgcatcg ccgctggcgg tgggccgcgc 347641 gggtgtcacc gtcgtgcact gctcggccag tgcgccgcac cgggaacagt attcggatcc 347701 ggaaaagccg atcctgcggg tggctcacgg cgctatcgaa cgctggcaga caggcggctg 347761 gcagccctac atcgacgccg ccgaccaatt cagcgctgat gaggccgccc acctggcgcg 347821 ccgactgtcg cggtgggact ccaaccccac ccatgccggg ctgcgctcgg cggccactcg 347881 cggcgcgagt ttcaccacac tgctgggcat cgaggacgca tcccgactgg atgtgcccgc 347941 gctgtgggcg ccgcgacgac gcgacgagga gttacgcgtg ccgatcggtg tcactggcac 348001 cggcgagccg ctgatgttcg acctcaaaga cgaagccgag ggcgggatgg gcccgcacgg 348061 gctgatgatc ggcatgaccg gttcgggcaa gtcgcagact ttgatgtcga ttctgttgtc 348121 gctgttgacc acacactccg cggagcggct catcgtcatc tacgccgact tcaagggtga 348181 ggccggcgcc gacagtttcc gagatttccc gcaggtggtt gcggtgatct cgaatatggc 348241 cgagaagaag tcgttggctg atcggttcgc cgacacgctg cgcggcgagg tggctcgtcg 348301 cgagatgctg ctgcgtgagg ccggccgcaa ggtccagggc agcgcgttca actcggtgct 348361 cgagtatgaa aacgccatcg ccgcagggca tagcctgccg cccatcccga cactgttcgt 348421 ggtcgccgac gagttcacct tgatgctggc cgatcacccg gaatacgcgg agctgttcga 348481 ctatgtggcc cgcaagggtc gctcgtttcg catccacatc ctattcgcgt cccagacact 348541 ggacgtgggc aagatcaaag acatcgacaa gaacaccgcc tatcggattg ggctgaaagt 348601 ggccagcccc agcgtttctc gccagatcat cggcgtggag gacgcctacc acatcgagtc 348661 gggcaaagaa cacaaaggcg tgggcttttt ggtgcccgcg cccggtgcca ccccgataag 348721 gttccgcagc acctatgtcg acgggatcta tgaaccgccg cagacggcta aagccgttgt 348781 cgtgcaatcc gttccggagc ccaagctgtt caccgccgcc gcggtggaac cggatccggg 348841 cacggtgatc gccgatactg acgaacaaga acccgccgac ccaccacgca aactgatcgc 348901 gaccatcggc gaacaactgg cccgctacgg tccgcgggcg ccgcagttgt ggctgccgcc 348961 actcgacgaa acgatcccac tgagcgcggc gttggcccgc gccggggtgg gcccccggca 349021 gtggcgctgg ccgctggggg agatcgacag gcccttcgag atgcggcgcg acccgttggt 349081 gtttgacgct aggtcgtcgg ccggaaatat ggtgatccac ggcggcccca agtccggcaa 349141 atccactgcg ctgcagacat tcatcctctc agctgctagc ctgcactcgc cgcacgaggt 349201 tagcttctat tgcctggact acggcggtgg gcagctgcgg gcgctacagg atctagcgca 349261 cgtcggcagt gtcgcctcag cgctggaacc cgaacgcatc cgccgcacct tcggcgagct 349321 cgagcaactg ctgttgtccc ggcagcagcg ggaagtattc cgtgaccggg gtgctaatgg 349381 ctcgaccccc gacgacgggt tcggtgaggt gttcctggtc atcgacaatc tctatggctt 349441 cggccgcgat aacaccgatc agttcaacac ccgtaatccg ttgctggcca gggtaaccga 349501 actggtcaac gtgggccttg cctacgggat ccacgtgatc attaccacgc cgagctggct 349561 ggaagtgccg ttggcgatgc gcgacgggct cgggctgcgt ctcgagctgc gactgcacga 349621 cgcgcgcgac agcaacgtgc gggtggtcgg cgccctgcgc cgcccggccg acgccgtccc 349681 gcacgaccag cccggccgcg gactgaccat ggccgccgag cacttcctgt tcgcggctcc 349741 agaactggac gcgcaaacaa acccggtggc cgcgatcaac gcccgctacc ccggcatggc 349801 ggctcccccg gttcggttgt tgcccaccaa ccttgcgccg cacgccgtcg gcgaactgta 349861 tcggggtccc gaccaactgg tgattggcca gcgcgaagaa gacctggcgc cggtgatact 349921 cgacctcgcc gccaacccgc tgctgatggt gttcggcgat gccaggtcag gaaagacgac 349981 gctgctgcgc cacatcatcc gcaccgtccg cgagcactcc accgccgacc gggtcgcgtt 350041 caccgtgctg gaccgccggc tacacctggt cgacgaacca ctgttccccg acaacgagta 350101 caccgccaac atcgatcgga tcatcccggc gatgctcggg ctggccaacc tcatcgaggc 350161 gcgccggccg ccggccggga tgtctgcggc cgagctgtcc cgctggacct ttgccgggca 350221 cacccactac ctgatcatcg acgacgtcga ccaggtaccg gattcgccgg cgatgaccgg 350281 tccctacatc ggacagcggc cgtggacccc gctgatcggt ctcctggccc aggccggcga 350341 cttggggcta cgggtgattg tcaccgggcg tgccactgga tcggcgcacc tgctgatgac 350401 aagtccgttg ctgcgccggt tcaacgacct gcaggcgacc acgctgatgt tggcaggcaa 350461 tccggccgac agcggcaaga ttcgcggtga gcggtttgcc cgattgcctg ctggacgagc 350521 aattctgttg accgacagtg atagtccaac ctacgtgcag ttgatcaacc cgctggtcga 350581 tgcggccgcg gtttctggtg aaacccaaca gaaggggagt cagtcatgac gttgcgagtg 350641 gttccggagg ggctggccgc agccagcgct gcggtggaag cgctgacggc gcggttggcc 350701 gccgcgcatg cgagcgcagc gccggtgatt accgcggtag tgccgccggc ggcggatccg 350761 gtgtcgctgc agaccgcggc cgggttcagt gcacagggcg tcgagcacgc ggtcgtcacc 350821 gccgaaggtg tcgaagagct gggacgcgcc ggcgttggtg tgggcgaatc cggcgccagc 350881 tacctggccg gtgatgcggc cgccgccgct acgtacgggg tcgtgggcgg ctgagcatgg 350941 ccgcgcccat ctggatggct tcgccgccgg aggtacattc ggcgttgctt agcaatggtc 351001 cgggcccggg ttcgctagtg gcggctgcca cggcctggag ccagctgagt gccgagtatg 351061 cctcgacggc agcagaactc agtgggctac tgggggcggt acctggttgg gcatggcagg 351121 ggcccagcgc ggagtggtac gtggccgcgc atttgccata tgtggcgtgg ctgacgcagg 351181 ccagtgcgga tgccgcagga gcagcggccc agcacgaggc cgccgcggcg gcctacacca 351241 ctgccttggc agccatgccg acattagcgg agttggccgc caaccacgtg attcacaccg 351301 tgttggtggc gacgaatttc tttgggatca acacgattcc catcacgctc aatgaggccg 351361 attacgtgcg catgtggttg caggcggccg ccgtcatggg tctttatcag gcggcttcgg 351421 gtgcggcact ggcttcggcg ccgcgcaccg tcccggcgcc gacggttatg aatccaggtg 351481 gcggtgcggc cagcactgtc ggggcggtca acccctggca gtggctctta gcgttgcttc 351541 aacagctctg gaacgcctac acgggtttct acgggtggat gttgcagctc atctggcagt 351601 tcctgcagga tcccattggt aactcgatca agatcatcat cgccttcctc acgaatccca 351661 ttcaggcact gatcacttac gggccgctgt tgttcgcgct gggctaccag attttcttca 351721 acctggtcgg ctggccgacc tggggcatga tcttgagctc gccgttcttg ttgccggccg 351781 ggctcgggct gggcttggca gcaatagcct ttctacctat tgtgcttgcg cccgcggtga 351841 ttccgccggc gagtactccg ctggctgctg ccgccgtcgc cgccgggtcg gtgtggccgg 351901 cggtcagcat ggccgtaacg ggggcgggca ccgctggggc tgcgacgccc gcggcgggcg 351961 cggctccgtc tgcgggcgca gcgccggccc cggcagctcc cgcgaccgcc agtttcgcct 352021 atgcggtggg tggcagcggt gattgggggc cgagcttggg gccgacggta ggtggtcgcg 352081 gtggtatcaa ggcgccggcc gctacggttc cggcggcggc cgcggcggcg gcaactcgtg 352141 ggcagtcgcg cgcgcggcgg cgccggcggt ctgaattgcg ggactacggc gacgagttct 352201 tggacatgga ttccgatagc ggtttcggcc cctcgacggg cgaccacggc gcgcaggcct 352261 ccgaacgggg ggccgggacg ctgggattcg ccgggaccgc aaccaaagaa cgccgggtcc 352321 gggcggtcgg gctgaccgca ctggccggtg atgagttcgg caacggcccc cggatgccga 352381 tggtgccggg gacctgggag cagggcagca acgagcccga ggcgcccgac ggatcgggga 352441 gagggggagg cgacggctta ccgcacgaca gcaagtaacc gaattccgaa tcacgtggac 352501 ccgtacgggt cgaaaggaga gatgttatga gccttttgga tgctcatatc ccacagttgg 352561 tggcctccca gtcggcgttt gccgccaagg cggggctgat gcggcacacg atcggtcagg 352621 ccgagcaggc ggcgatgtcg gctcaggcgt ttcaccaggg ggagtcgtcg gcggcgtttc 352681 aggccgccca tgcccggttt gtggcggcgg ccgccaaagt caacaccttg ttggatgtcg 352741 cgcaggcgaa tctgggtgag gccgccggta cctatgtggc cgccgatgct gcggccgcgt 352801 cgacctatac cgggttctga tcgaaccctg ctgaccgaga ggacttgtga tgtcgcaaat 352861 catgtacaac taccccgcga tgttgggtca cgccggggat atggccggat atgccggcac 352921 gctgcagagc ttgggtgccg agatcgccgt ggagcaggcc gcgttgcaga gtgcgtggca 352981 gggcgatacc gggatcacgt atcaggcgtg gcaggcacag tggaaccagg ccatggaaga 353041 tttggtgcgg gcctatcatg cgatgtccag cacccatgaa gccaacacca tggcgatgat 353101 ggcccgcgac acggccgaag ccgccaaatg gggcggctag ctcgcgctac atggatgcaa 353161 cacccaacgc cgtcgagctg acggtcgaca acgcttggtt catcgctgaa accattgggg 353221 cggggacctt tccgtgggtg ctggcgatca cgatgcccta tagtgatgcc gcccagcggg 353281 gtgcgttcgt cgaccgtcag cgcgacgagc tgacccggat ggggctgtta tcgccgcagg 353341 gtgttatcaa ccctgcggtc gccgactgga tcaaagtggt gtgcttcccg gaccgctggc 353401 ttgacctgcg ttatgtgggg ccggcctcgg ccgacggcgc ctgcgagctg ctacgtggca 353461 tcgtcgcgct gcgcaccggc accggtaaga cctccaacaa gaccggaaac ggtgttgttg 353521 cgctgcgtaa tgcgcagctg gtcacgttca ccgcgatgga tatcgacgac ccccgggcgc 353581 tggttccgat tcttggtgtc ggtttggcgc accggccgcc ggcgcggttc gacgagttca 353641 gcttgccgac gcgggtgggc gcgcgggccg acgaacggct gcggtccggc gtgccactcg 353701 gggaagtcgt tgactatctg ggtattccgg cgtccgcacg gccggtggtg gagtccgtct 353761 tctcggggcc gcgcagctac gtcgagatcg tcgccgggtg caaccgtgac ggccggcaca 353821 ccaccaccga ggtcggccta agcatcgtcg acacctcggc gggccgggtg ttggtgagtc 353881 cgtcgcgggc attcgacggc gagtgggtct ccaccttcag ccctgggaca ccgtttgcga 353941 tcgccgtcgc gatccaaaca ctgaccgcgt gcttgccaga cgggcaatgg ttcccgggac 354001 agcgggtgtc gcgggacttc tccacccaat cctcgtaatc agaaaccaga aagtgagcac 354061 gatgtcccag gaacggtccc gctgatgtcc ggcaccgtca tgcagatcgt ccgcgtcgcc 354121 attcttgcgg acagcaggtt gaccgagatg gccctgcccg cggagttgcc actgcgcgaa 354181 atcctgcccg cggtacaacg cttggtggtt ccctcggcgc aaaacggcga tggtggccaa 354241 gccgactccg gcgctgccgt gcaactgagt ttggcgcccg tcggcgggca gccgtttagc 354301 ttggatgcca gcctggacac cgtcggtgtc gtcgacggtg atctgttggt gttgcagccg 354361 gtgcccgccg gtccggccgc gccgggcatc gtcgaagaca tcgccgacgc cgcgatgatc 354421 ttttcgacgt cgcggttaaa gccctggggc atagcgcata tccaacgagg agcgctggcc 354481 gcggtgattg ccgtggctct gctggctacc ggtttgacgg tgacctatcg ggttgccacc 354541 ggtgtgctgg ccgggctgct ggcggtggcc gggatcgcgg tggctagcgc gctggccgga 354601 ttgttgatca ccatccgttc gccacgttcg ggtatcgcgc tgtcgatcgc cgcgctggtc 354661 cccatcggcg cggccctggc gttggcggtg ccaggaaagt tcgggccggc gcaggtattg 354721 ctgggtgcag ctggggtagc cgcatggtcg ctgatcgcgc tgatgattcc cagcgccgaa 354781 cgggaacgcg tcgtcgcctt cttcaccgca gcggcggtgg tcggggcgtc ggtggcgctg 354841 gcggccggtg cgcaattgct gtggcagctg ccgttgttga gcatcggctg cgggctgatt 354901 gtggcggcgc tgttggtcac catccaggcg gctcagcttt ccgcactgtg ggcgcggttc 354961 ccgttgccgg tgatcccggc gccgggggat cccaccccgt cggccccgcc gttgcgcctg 355021 ctggaggatt tgcctcggcg ggtgcgggtc agtgacgccc atcaaagcgg cttcatcgcc 355081 gcggccgtgc tgctcagcgt gttggggtcg gtggccatcg cggtgcgccc agaggcgctc 355141 agcgttgtgg gctggtatct ggtggcggcg actgcggccg cggccaccct gcgcgcgcgg 355201 gtgtgggatt cggccgcatg caaggcgtgg ctgctggctc agccctatct ggtagccggg 355261 gtcctgttgg tgttctacac cacgaccgga cgctatgtcg ccgcgttcgg cgcggtgctg 355321 gtgctagccg tgctcatgct ggcctgggtt gtggtggcac tgaacccggg catcgcttcg 355381 ccggagagct actcgctgcc gctgcgccgg ctgctgggtt tggtcgccgc cgggctggat 355441 gtttcgctga tccccgtcat ggcctacctg gtcggattgt tcgcttgggt gctcaacaga 355501 tgatccgtgc cgcatttgcg tgtctggcgg cgaccgtggt cgttgcgggg tggtggacgc 355561 cgccggcgtg ggcgatcggg ccgccggtgg tggacgccgc cgcgcaaccg cccagcggag 355621 acccgggacc ggtggcgccg atggaacaac gcggtgcgtg cagcgtctcc ggtgttatcc 355681 cgggcaccga tccaggcgta ccgacgccca gccaaacgat gctgaatctg cctgcggctt 355741 ggcagttttc ccggggtgag ggccagctgg tggcgatcat cgacaccggg gtgcagccgg 355801 gcccgcgact gcccaacgtc gatgccggtg gtgacttcgt ggagtcgacc gacgggctga 355861 ccgattgtga cgggcatggc accctggtcg ccggaatcgt cgccggccag cccggtaatg 355921 acggcttctc tggtgtggcg ccggcggcgc ggctgctgtc catcagggcg atgtctacga 355981 agttctcacc gcgcacatcg gggggcgatc cgcagctggc gcaggccaca cttgacgtcg 356041 cggtgctggc cggtgccatc gttcatgcgg ccgaccttgg tgccaaggtg atcaacgtct 356101 ccacgatcac ctgcctaccc gccgatcgga tggtcgacca ggccgcgctg ggcgcggcga 356161 tccggtatgc ggcggtggac aaggacgcgg tgatcgtggc ggccgcggga aacaccggag 356221 cgagcggatc ggtcagcgcg tcgtgtgatt ccaacccgtt gaccgatctg agccgcccag 356281 acgatccgcg gaactgggcg ggcgtcacct cggtgtccat cccgtcgtgg tggcagccct 356341 acgtgttgtc ggtggcgtcg ctcacatccg ccgggcagcc atcgaaattc agcatgcccg 356401 ggccgtgggt gggcatcgcc gcacccgggg aaaacattgc gtcggtgagt aactcaggcg 356461 acggcgccct ggctaacgga ctgcccgacg cccaccagaa actggtggct ctcagcggca 356521 ccagctacgc ggccggctat gtctccgggg tggccgcgct ggtccgcagc cgctatcccg 356581 ggctgaacgc caccgaggtg gtgcgccggc tgaccgccac cgcgcaccgc ggcgcccgag 356641 agtcctccaa catcgtcggc gccggcaacc tggacgcggt ggcggccctg acctggcaac 356701 tgcccgccga acccgggggc ggtgccgcac cggccaagcc ggtcgccgat ccgccggtcc 356761 cggcgcccaa agacaccaca ccgcgcaacg tcgcattcgc cggagcagcc gcgctgagcg 356821 tgctggtcgg gctcacagcc gcgactgtcg cgatagcgcg ccgacgaagg gagcccaccg 356881 aatgaacccg atcccttctt ggcccggcag gggccgggtc acgttggtgc tgctggcggt 356941 ggtgcctgta gcgctggcct acccctggca atcgacacgc gattacgtgc tgctgggcgt 357001 ggccgccgcc gtcgtgattg ggctattcgg cttctggcgc gggctgtatt tcaccacgat 357061 cgcgcgccgc gggttggcaa tcctgcgccg ccgacgccgg attgccgagc ccgcaacgtg 357121 cacgcgcaca acggtgctgg tgtgggttgg gccgccggca tcggatacga acgtgctgcc 357181 gctgacgctg atcgcccggt atttggaccg atacggcatc cgcgccgaca cgattcgcat 357241 caccagccgc gtcaccgcat ccggcgactg ccggacctgg gtcgggttga cggtggtcgc 357301 cgacgataac ctggcggcgc tgcaggcccg gtcagcgcgc atccccttgc aagagaccgc 357361 gcaggtcgcg gcgcgccggc tcgccgacca tctgcgcgaa atcggttggg aggctggtac 357421 ggccgcaccc gacgagatcc cagcgttggt ggctgcggat tctcgcgaga cgtggcgcgg 357481 aatgcggcac accgactcgg attacgttgc ggcatatcgg gtcagcgccg atgccgagtt 357541 gcccgatacg ttgcccgcga tccggtcgcg tccggcgcag gagacctgga tcgcgctgga 357601 gatcgcatat gccgccgggt catcaacccg ctacacggtg gccgctgcct gcgcattgcg 357661 gaccgattgg cggcctggcg gcaccgcacc ggtggccggc ctgctcccgc aacacggaaa 357721 ccacgtgcca gccctgacag ccttggatcc gcgatccacc cgccgactcg acgggcacac 357781 cgatgctcct gccgacctgc tgacccggct gcactggcct actcctaccg ccggcgccca 357841 ccgggcaccg ctgaccaacg ccgtcagtcg aacatgaggc cctgcaggaa cacggtcatc 357901 cgccgcagat agtccaactg gctcacatgc agcaggtggc tgccggggaa ccagtgcagc 357961 gcacagcgat cccactgctt ccacagcgtt accgcgtgct cgggtggagc cattcgatcg 358021 ccaaggccgg tgatgatcat ccgccggtcc ttaggtagca gcggccgata gttcagtggg 358081 ccgtggtagg ccagcccggc gatcagctca tcacggctga tgttggttag ccgcagtcct 358141 agcttgacga gcttattggc cggaaaccat tcgtcgaaca gcttggcggg catgacgacg 358201 gggcagttgg ggatgacagc ctcaagccga ctttcgaccg aagccagcag cgcagacgtg 358261 tagcccccca gggatatacc cgtcagggcg atacggtcga cgccgatgtg gcgcaggtag 358321 tccacgatgg aacgaaagtc atacactgcc tgcgccatcg cctcggcgaa gccgctcaat 358381 ccgctagtga aatagccgaa accgctaaac ggcgagaact tttcggcccg ctggccgtga 358441 aacggcaacg tgtacagcaa aacgtcgtag ccggaccggt aataccaagg cagcgaaaag 358501 aacagcccgt tgagcaagta tgacgatccc atgaagccgt ggatgacgca cagcgtagga 358561 cgcgggccgt cgcggtggcg ccagtgctgc gcgtgcacaa tgttgttggc ggtcaatgca 358621 ctccaccgct ggcgcatcgt ggggttgatc gcccggaagc cgctggcaaa tgcgatgttg 358681 tccacggtgc cgcgcgcaac ccattcggtg agcgggctgg ccggccgcga ggtgaccttg 358741 ggcaactccg tcggcgccgg aaaggacttc gccggatcat gcgctgccgc aagttcggcg 358801 tagaagttca ggttgctgcg ctcgctgcct tcgttgacgt gacgtagtgc gttggcgaca 358861 acggccggag tcaccgtcgc ggacagcacc gacgcgaccg cggtgcgcag cgcgacatcg 358921 gcgatcgccg aagactcgac gagtatccgc tggcgggccg acagcaccga gcgcgagggc 358981 aggccctcgg cgccggcatc cgcgccgggg acgtcgggaa tggggacggg cggaccgatc 359041 gcgtcggcag tgaacgtccc tgacatctcg gacatcaatg tcgatggtaa tcgccaatgt 359101 ggctgaccgc tgaaggtttc gactgtatcg tcaatttctc actcggtcga gcgcttgtcc 359161 aggagcacgt acatgtggga tcccgacgtc tacctggctt tttcgggtca tcgcaaccgc 359221 ccgttctacg agttggtgtc acgggtgggt ctcgagcggg cgcgccgcgt ggtcgacctg 359281 gggtgcgggc ccggccacct gacacgctac ctggcacgac gatggcccgg cgcggtgatc 359341 gaggctctgg acagctcacc ggagatggtc gctgccgcgg ccgaacgcgg gatcgacgcc 359401 accaccggtg acctgcggga ctggaaacca aagcccgaca ccgatgtggt ggtgagcaac 359461 gctgcgttgc attgggtgcc tgagcattcc gacctgttgg tccggtgggt cgacgagctg 359521 gcgccgggat catggatcgc tgttcagatc cccggcaact tcgagacgcc gtcgcacgcc 359581 gcggtacggg cgttggcccg ccgcgagccg tatgcaaagc taatgcgcga catacctttt 359641 cgtgtgggcg cggtggtcca atctccggcg tattacgcgg agctgctgat ggacaccggc 359701 tgcaaggtcg acgtgtggga gaccacgtac ctacaccagc tgaccggcga gcacccggtg 359761 ttggactgga ttaccggaag cgcgctggtc ccagtgcgtg agcggctcag cgatgagagc 359821 tggcagcagt ttcggcagga gctcattccg ctgctgaacg acgcctaccc gccacgggcc 359881 gacggtagca ccatctttcc cttccggcgg ctgttcatgg tcgccgaagt tggtggcgcg 359941 cgccgctcag gtgggtagcc ccagccgcgg cgcctccgct cggtaccggt cgacccactc 360001 atcagagcgc tggttggcct gccgttccag catcggtgcg ggcgccagtt tgggatcctg 360061 gccgatggcg tcaagcacac tggccacaat cgcggtcagg ttgcgccata acaccgggta 360121 ggcgatatcg atcggatcga tgccttcctc ggcaaaccaa gcgcgccagc cgttttcctg 360181 atcgcgcaga ttcctgatga tgtgggcgat ggcaccggcg tggtagacgg cctgcgagtc 360241 gcgcttgggg tccggatggc cccgccaaac ctgggtttgc acggcgcgcc agaacgacac 360301 cgcttgtgac accacatcgg gccggtggac gtgcacgaaa accggttcgt tgccaatgac 360361 gtcgcggatt gccgcgcgca agccatcccc ggagcgatcc ggcaattgtg ctgcgcgttg 360421 ctgcagcagc gcagtctgat tccacatcaa cttgccgccc cagacgccgt tgggcgtgcg 360481 accggaggtg cggacgtgct cacgccaggc aaccggcgtc gcggtgtccg gtgtaccggg 360541 gtccagcgga tcgagcaatt gcaggatcgt gtcatcgtcg accccagcga accactcccg 360601 gggctggggg gccatcccgg tgctaggcag gtattggaag aactcctgtg gttccccggc 360661 acagcccgtc gcgcgcagcg attccaccag cagcgtgctg ccgctgcgtt gggtggcgag 360721 caccagatac ggtctcacag cgcgggacat ccgatgagcc tagctgcagt gttcgtcgat 360781 gccgcggtcg gcggcgatcg ctgaccggcc cgttggcgtc ttgcggtgga tccgcagata 360841 cgtttcggtg tagcgctcgg cgatgcggga accggcgaag tccgacggaa tgacgtcggc 360901 cgtgcgctgt cgccaatcat gcagtcgcac ggccagatcg gccgcgatcg cggccacgcc 360961 ctgggtgctg tcgtcgccgg ctaacaggtt attggtctcg gtgggatcgg cgcgtagatc 361021 gtagagttcc cgctgcgggc ggggcgcctt gaccaacggt gcgacggcca tgccggccgg 361081 gctttcctgg atatcccacg gtaggtccag cagcggccgg ggcgcgtaat tctcgatgta 361141 gctgtattcc ttggtgcgga ttgcccgaat cggatcgaac gagtcgtgat aggtcttggc 361201 ggtgtatacg tggtcacgca ccgcagcgtt ttcagtgtcc ggcgcgagga gggccggtgc 361261 gtgtgacaca ccctcgacat cggcgggtac ctcgagtctc agcaggtcca atagcgtcgg 361321 aaccagatcg acgccgctga aaagctcgtc atagacgcga ggcgccatcg cccggcgagt 361381 gggcgggcgg atgatcagcg cgataccggt tccggcgtca tacagtgtgg acttcgcccg 361441 cggaaatgcc ggaccgtgat cggtgacgaa caccacccag gtgctggcgt ctaggccggt 361501 atcggccagt gtgtcaagta gccggccaac cgcctcgtcg gctgtggcga tagaaccgta 361561 gaactcggcg acgtcttggc gcacctcggg ggtatcgggc agatagtcgg gcagctcgac 361621 ggccgcgctg tcggccggcc ggtagcgctc atgcggatag ggccggtggg tttcgaagaa 361681 gccggcggtc aacaggaacc gttgtccgtc taacgcgggc acgcgattat gcagccagtc 361741 ctgggctttg gcgaccacgt attcgcagta ggagttcgac acgtcgaatt cgtcgaagcc 361801 cagccgcttt gggtaggacg tctcatgctg cataccgaaa agagctgagt accaacccga 361861 ttcggatagc aattgcggta gggtttggac cccggtgcgg tattcccagc cgtgatgggc 361921 caggccgacc aacccgttgc tttgcgggta gcggccggtg aacagcgagc cccgcgatgg 361981 tgtgcacagc ggcgcggtgg catgtgccct ggtgaacagg atgccctcgg cggcaagccg 362041 gtccagccgc gggctgtaga cgtccggatg gtggtagacg ccgagatagc gccccaggtc 362101 gtgccagtgc acgatcagca ggttctcgcg ctgccctgtg gcacgctcac tcgtcacctt 362161 tgccacctct ccagcgaacc gcacccggcg ccgaagccgg acaatagagc ctatacgtcg 362221 cgaggcacta gatacgccac cgatgatggc ggtaggctcg ctgattgaat cgcggcgacg 362281 gcgtaggcgt gttgtgtctt ggcgtccagg agtcacgagt cgacgggagg ttcccgtgtc 362341 ctttgtgatc gcacaaccgg agatgatcgc ggcggcggcc ggtgagttgg ccagcatcag 362401 atcggcgatc aacgcggcca atgcggcggc cgcggcccag accaccggag tcatgtcggc 362461 ggccgccgac gaggtgtcta cggcggttgc cgcgctgttt tcctcgcatg cccaggccta 362521 tcaggccgcc agcgcgcaag cggccgcctt tcacgcccag gtggtgcgga ccctgaccgt 362581 ggacgcggga gcgtatgcca gcgccgaggc cgccaacgcc gggccgaaca tgctggccgc 362641 ggtcaacgcc cccgcccagg cgctgttggg gcgcccactg atcggcaacg gtgccaacgg 362701 ggcgccgggc accgggcagg ccggcggcga cggtgggctg ttgttcggca acggcggcaa 362761 cggcgggtcc ggcgcacccg gacaggccgg cggggccggc ggggcggccg ggttctttgg 362821 caacggtggc aacggcgggg acggcggggc cggagcgaac ggcggcgccg gcggcaccgc 362881 cggctggttc ttcggcttcg gcggcaacgg cggggccggc gggatcggtg ttgccggcat 362941 caacggcggt ctcggcggcg ccggcggcga cggcggcaac gccgggttct tcggcaacgg 363001 cggcaacggc ggcatgggcg gggccggggc ggccggcgtg aacgccgtca atcccggcct 363061 ggccaccccg gtcaccccgg cggccaacgg cggcaacggc ctcaacctcg tcggcgttcc 363121 cggcaccgcc ggtggcggcg ccgatggcgc caacggcagt gccattggcc aggcgggcgg 363181 cgctggcggt gacggcggca acgcctccac gagtgggggc atcgggatcg cgcaaaccgg 363241 gggcgccggc ggcgctggcg gtgccggcgg cgacggcgca cccggtggca acggcggcaa 363301 tggtggcagc gtcgagcaca ctggcgctac cggctcctct gcgagcggcg gcaatggtgc 363361 caccggcggg aacggcgggg tcggtgcgcc cggcggtgcc ggcggcaacg gcggccacgt 363421 cagcggcgga tcggtcaaca cagccggcgc cggtggcaaa ggcggcaacg gcggcaccgg 363481 cggcgccggc ggcccgggcg gccacggcgg cagcgttcta tccggcccgg ttggcgacag 363541 tggcaacggt ggtgccggcg gggacggcgg ggccggggtt agcgccaccg atatcgccgg 363601 caccggcggg cgcggcggca acggtggtca tggcgggctg tggatcggca acggcggcga 363661 cggtggtgcg ggcggtgtcg gcggtgtcgg cggggccggt gcggctggcg cgatcggcgg 363721 ccacggcggc gatggcggct ccgtaaatac ccctattggc ggcagcgagg ccggtgacgg 363781 cggtaagggc ggcctgggcg gggacggcgg ggacggcgcc gccggtggcg acgggggcgc 363841 cggcggggac ggcggtgggc gcgggatatt cggccagttt ggggccggcg gggccggtgg 363901 tgccggaggc gtcggcggcg ccggcggggc tggcgggacc ggcggcggcg gcggcaacgg 363961 tggggccatt ttcaatgccg gtacccccgg cgccgccggc acgggcggtg acggcggtgt 364021 tggcgggacc ggtgcggccg gcgggaaagg cggggccggc ggtagcggcg gcgtcaacgg 364081 cgccaccggc gccgacggcg ccaagggcct cgacggtgcc accggcggca aaggcaacaa 364141 cggcaacccc ggctgagtcc ggattcaccg agtctgtaga taccgtggtc cgcattcgca 364201 gttttgtgcg ccaactacag cctcgatgac acgaccgcgg cgaatcccgt ttcccgggtg 364261 cggcgacacc gcgtcctacg attagtagga tctctggtat gacgaaagag aagatctccg 364321 tgacggtgga cgcggccgtc ctcgcggcga tcgacgcgga cgccagggcg gcgggtttga 364381 atcggtcgga aatgattgag caggcactgc gcaacgagca cctgcgtgtc gctctgcgcg 364441 attacacggc taaaaccgta ccggcgttgg acatcgatgc ctacgcacag cgggtgtacc 364501 aggcgaaccg ggcggccgga agttgatcgc tcccggcgac atcgcgccgc gccgcgacaa 364561 tgaacacgag ctctacgtcg ccgtcttgtc caacgcgctc catcgggccg cggacaccgg 364621 acgggtgatc acctgcccat tcattccggg ccgggtcccc gaggatctct tggcgatggt 364681 ggtggcggtc gagcaaccca acggcacgct gctgccggaa ctcgtgcagt ggcttcatgt 364741 tgccgcgctc ggtgcgccac tcggcaacgc gggcgtggcc gccctacgcg aggctgcctc 364801 ggttgtgaca gctctgctct gttagccctg tcaccggcga agatacctga tatcgccaga 364861 tatcatcgga agatgagtga tgtactgatt cgggacatcc ccgacgacgt gttagcaagc 364921 cttgacgcga tcgcggcacg cttgggcttg tcgcggaccg aatacatccg tcggcgttta 364981 gcccaggatg cgcagacggc tcgcgtcacc gtgacagccg cggatcttcg acgcctcagg 365041 ggtgcggttg ccggtctggg cgatcccgag cttatgcgtc aggcgtggag gtgactgacc 365101 agcgctggct gatcgacaag tcggcgctgg tgcggctcac ggacagccct gacatggaaa 365161 tctggtcgaa ccggatcgaa cgcggcctgg tacacatcac gggcgtgaca cgcttggaag 365221 tagggttctc ggccgaatgc ggggagatag cgcgacggga gtttcgtgaa ccgccgctgt 365281 ctgcgatgcc cgtggaatac ctaaccccga gaattgaaga ccgtgcgctc gaggtgcaga 365341 ccttgcttgc cgaccgcgga caccaccgtg gcccgtcgat cccggatctg ctcatcgccg 365401 cgacagccga actgtcgggc ttgacggtac tgcacgtcga caaggacttt gacgccatcg 365461 ccgcgcttac cggtcagaaa acagaacggc tcacgcatcg cccgccttcc gcttaaggag 365521 cccgaccaac ccttgtgatt ggcgtggggg ggcgctaacg taactgtctg taacgttcga 365581 tacagaactg gcgccggggt gcggccgcga ctctacgagc cgagacaagc cggcgcaagg 365641 atggcgcacc agtgggcgtt cccgccaaga aaaaacagca gcagggggag aggtcacgag 365701 aatcgattct cgacgcgacc gaacgcctga tggcgaccaa gggctacgcg gcgacctcga 365761 tcagcgacat ccgcgacgcg tgcgggctag cacccagctc tatttactgg cacttcggct 365821 ccaaagaggg cgtgctggcc gccatgatgg agcgcggcgc gcagcgcttc tttgccgcga 365881 tacccacctg ggatgaggcc catgggcccg tcgagcagcg atccgagcgc cagctgaccg 365941 agctggtgag cctgcagtcg cagcatccgg acttcctgcg cctgttctac ctgctgtcga 366001 tggaacgaag tcaggatccg gtggttgccg cggtggtgcg ccgggtccgc aacaccgcga 366061 tcgcccgatt tcgtgacagc atcacgcacc tgctgccatc ggacatcccg ccgggcaaag 366121 ccgatctcgt cgtcgcggag ctgaccgcgt tcgcggttgc gctgtcggac ggcgtctatt 366181 tcgccggcca ccttgaaccg gacacgaccg acgtcgagcg catgtaccgg cggctgcggc 366241 aagcgctcga ggccctgatt cccgtcctcc tggaggagac atgaacaccg gaaccgccgt 366301 catcaccggg gccagctccg gcctcgggtt gcagtgcgcc cgcgccctgc tacgtcgcga 366361 cgcatcgtgg catgtggtgt tggcggtgcg cgacccggcg cgcggccgtg cggccatgga 366421 ggaattgggg gagccaaacc ggtgttcggt tctcgaggtg gacctcgcgt cggtgcggtc 366481 cgtgcgcagt ttcgtggaaa ccgtgcggac cacgccgctg ccgccgattc gtgccctggt 366541 gtgcaatgcc ggcctgcagg tggtgtcggg catcgcgttc accgacgacg gtgtcgagat 366601 gacgttcggg gtaaaccact tgggtcactt tgctttagtg accgggattc tcgactggtt 366661 ggcccgtccg gcgcgcatcg ttgtcgtcag cagcggcacg cacgacccga gcaagcacac 366721 cggaatgccc gaccctcggt atacctgcgc cgccgacctc gcgcacccgc ccaccgatca 366781 gaacacgccg gccgaaggcc gccgtcgata caccacgtcc aagctgtgca acgtgctctt 366841 cacctacgag ctcgaccgcc gcctcgatca cggagaacag ggcgtgatgg tcaacgcgtt 366901 cgaccccggc ctaatgccgg gctccggctt ggcccgcgac tatccgccga tcctgcgact 366961 ggcgtaccgt ctcctgtcgc cgatgctgcg cgtccttccc ttcgttcaca gcacccgggt 367021 ctccggcgaa cacctggcgg cgctggcggt cgatccgcgg ttcgcgggcg tgacgggcca 367081 atatttcgcg ggcgccaagg cgatccggtc ttccgccgag tcctacgatc gggcaaaggc 367141 gctcgacctc tgggagacca gtgaacggct gctggcccag gtgacatagc tgcgcgttat 367201 cccctaaaga aacccgccag gttggtgcca aagttaccga tgccggaaag gaaccccggc 367261 gtcgcgagat ccagcgcgct ggcgttcaac cagcccgaga tggtgttgcc cacgttggcg 367321 acacccgatc ccagcgcgcc ggtattgagg tagcccgaca ggccggcacc ggagttcacg 367381 aatcccgata cgcttccggc gccgctgttg aagaagcccg acgacgggcc gccagtgagg 367441 ttgaagtagc cgggggtcac cggaatgccc aacagcggca ggccgatcag gccctgatag 367501 tcgccactca ccaagaagcc gttgctgtag ctgccggtaa tgaacgcacc ggtgtccaca 367561 tcgcccgtgt tcgccacgcc cgtgttgtag tcaccggtgt tgaggtagcc cgtgttgcca 367621 ctgcccgggt tgaagccgcc ggtgttgaag ctgcccgggt tgaagctgcc ggtgttgagg 367681 tccccggggt tgaacacgcc cgtgttggtg ctgcccgcgt tgccgatgcc ggtgttgaag 367741 ccgcccgagt tcgcgagacc gaagttgccg gtgccggcgt taaagatgcc gacgttgccg 367801 gtgcccgagt tgaagaaccc gatgtttccg ctgccggagt tgaacagccc gatgttgttg 367861 ctgcccgagt tgaggctgcc gatcccgatc tggccgttgc cggtgagccc gacgccgatg 367921 ttgttgttac cggtattcgc gaagccgatg ttgtagctgc cggtgttggc aaagcccagg 367981 ttgtcgctgc cgaagttcgc gaagccgatg ttgtagctgc cggcgttgcc aaagcccagg 368041 ttgtcgtcgc ccagatttgc caacccgatg ttgtagctgc ccaggttcgc caagccgata 368101 tcgaagatcc cggtgttggc gatgccgatg ttgttaccgc cgatgttgac cccgccgaag 368161 ttgaggtcgc cgaggttgcc gatgcccagg ttggagtcgc cggtattaac gaagccgatg 368221 ttgacgctgc ccaggttccc gatgcccgcg ttgaggccgc cctggtttgc gacgccgaag 368281 ttcagcgtca ggttgccggt gttgtcgagg aacaggccgg ccaggttggc gccgatgttt 368341 gcgatgcccg agccgaaggc gggcgtcgcg aggtccagcg tgctcgtgtt gtagacgccc 368401 gagatggtgt tgcccaggtt cgccagaccc gactgcagcg cgccgaaatt cagtaggccc 368461 gaattgccca agccgccggc gttaaagaag cccgaattgc cagcgccgaa gttcccgaag 368521 cccgacatgt tgccggcgcc gaagttcccg aagcccgata catggccggc gccggtgtgg 368581 aagaaacccg acgacgggcc ggtggtcgag ttgccgaaac ccggggtacc accgatgctg 368641 atgccgatgg ggatcgggcc gaagccgccg gtgccaacca tgctgatggt ttgctgaatg 368701 ggcgaatcga tggcgatgac ttgattgaca tcgatcgtga tggggccgat catctcgttg 368761 acaagcaccg ccgcaggacc aagcaagact cgtatctgga aaccgggaat ggtgaaactg 368821 tttggcgtgg tggcgacgac ggtgccggtg atgggtatgt cgattggaac actcaagtcg 368881 tagcggtagg ggatttcggg aatggtgatc gttgtggaaa ggccaatcaa cccctggtag 368941 tcacctcgcc agaagaaccc gttgctgtaa ttgccggaga tgaacgcgcc ggtgttgacg 369001 ttgcccgtgt tggccacacc cgtgttgtag tcacccgcgt tgaagtagcc cgtgttgtag 369061 tcaccggagt tgaagctgcc ggtgttgtga tcgccgaggt tgaagctgcc ggtattggtg 369121 ctgccagtgt tgaagctgcc ggtgttgatg ctgccggtgt tgccgacgcc ggtgttgacg 369181 ttgcccgggt tgaacaggct cgtgttggtg ctgcccgtgt tcccgaggcc ggtgttgaag 369241 ctgcccgagt ttgcgatgcc gaagtttgcg gtgccggtgt ttccgatgcc cacgttgccg 369301 gtgcccgagt tgaagaatcc tacgtttccg tcaccggagt tgaacaagcc gatgttgtgg 369361 ctgcccgagt tgaagctgcc gaacccgatc tgaccggtgc cggtgagccc gatgccgata 369421 ttgccgctac cggtgttggc gaagccgata ttggcactgc cggtgttagc aatgccgata 369481 tggtagttgc ccgagttggc gaagccgacg ctgtagttgc ccaggtttgc caagccaatg 369541 ttgtggttgc ccacgtttgc gaaaccgaca ttgaagattc cggtgttccc gatcccgaag 369601 ttagagcccc cgaggtttgc caagccgaca ttgaggttgc cgaggccggc caagtcgagg 369661 atcgtcgtgc cggcgccgcc ctgcagcagg ccggcgatgt ttgcgaggcc ggagccgaag 369721 gccggcgtcc cgaggtccag cgggctcgtg ttgtagatac ccgagatggt gttgcccaca 369781 ttcgccacac ccgatcccaa cgcgccgacg ttgagcaagc ccgagactcc tgaggctgcc 369841 gacgcaaggt tccacaggcc cgatgtgttg ccgccgacgt tgccgaaccc ggatgcggtg 369901 ccggcgccgg tgttgaagaa gcctgacgac gggctggtgg tcgagtttcc gatgcccggc 369961 gctgccggaa tgtcgatgat cgggatggtg atggggccga ggccggcggt ggcgctgatg 370021 ttgatcgcgg tcgtgggtcc gcccacggcg atcgcgaacg tgggaacgct gagcacgaag 370081 ctcgggacaa tgatgggacc gatgtccggc tcggtatgga tgtgaaagct aaacgcgaag 370141 gattcgaagc cgatgatggg gatagtgaaa ttgtccacca cgaggtcggt gaaactgccg 370201 gtgatcggta tgtcgattgg gatattgacg tccaagtgcg ccggaatctc cggaatagtc 370261 agcgcgtagg agtaaccgat caggccctgg tagtcgcccc gccacaagat gccgttgctg 370321 tagttgccgg agatgaaagc gccggtgctg acgtcgcccg tattcgcgat gccggtgttg 370381 tagttcccgg tgttgaagtg gccggtgttg gtgttacccg cgttgaagcc gccggtgttg 370441 aagctgccgg tattgaaatt gccggtgttg aagttaccgg ggttgaagcc gccggtgttg 370501 ccgtcgcccg agttgaacaa gcccgtgctg gtgctgcccg agttgccgat gccgaagttt 370561 ccggtgccgg tgttgccgat gccgaagttc ccggtgcccg agttaaagaa gccgatgttt 370621 ccgtcgcccg agttgaacaa gccgatgttt ccgctgcccg agttcagagc gccgatcccg 370681 atttggccgg taccggtgag cccgatgccg atattgttgc tgccggtgtt ggcaaagccg 370741 atattgttgc tgccggtgtt ggcaacgccg atgttgtagc tgcctaggtt ggcaaagcca 370801 ggttgtcgtc gccgaagttt ccgaagccga tgttgtagtt gcccagattc gccacgccga 370861 cgtcaaagat cccggtgttg ccgaagccgg cgttattgct gcccaggttt gccagcccaa 370921 ggttcagagt catcgtgccc ataccgtcgc gcatgagacc ggaggcaaag gccggcgtca 370981 cgaggtccaa cgcgctcgcg ttgtagaaac ccgagacggt gtgacccacg ttagtcacac 371041 ccgaccccag cgcaccgaca ttgagataac ccgaaatccc tgaggcgccg gcgaccacgt 371101 tcaaaaagcc cgacgcgctg cccgctccgg agttgaagaa gcccgacgac ggactagtgg 371161 tcgagttgcc gaagcccgat gtcgcgggaa tgtcgatgat cgggatggtg atggagccga 371221 taccggcgct ggcggtgata ccgatcgagg tggtgggtcc gcccacggtg atcgccgccg 371281 tgggcaaggt gatattgatg gtcgggatga tgatgggggt gaagtcgata ttattttcgg 371341 cagctacgat gctgaagccc tggagggtga cgaccccggc gtcgatgttg atgggtatat 371401 gtatcgggat gtcgacgcca aaggttaggg cgatttcggg aatcgctagc gccgcgtgca 371461 agccaatgag gccctgataa tttccactcc acaagaaccc gttgctgtag ctgccggaga 371521 tgaaggcgcc ggtgtcaaca tcgccggtgt ttccgagtcc cgtgttgtag ctgcctgggt 371581 tgaaatcccc ggtgttggaa tcgcccggat tgaagctgcc ggtgttgaag ctgccggtgt 371641 tgccgatacc ggtattgacg tcgccggagt tgaagaagcc ggtgttggtg ctgccggtgt 371701 ttccaagccc gaagtttgcg gtgccggtgt tgccgatgcc aacgtttccg ttgcccgagt 371761 tgaaaaagcc gatgtttccg ctgcccgagt tgaacaagcc gatgtttccg ctgccagaat 371821 tcagggagcc gaacccgatc tggccgtcgc ccgtgagccc aatgccgata ttgttgctgc 371881 cggtgttcgc gaacccgata ttgttgctgc cggtgttggc gaagcccagg ttgtcgttgc 371941 ccaagtttcc gaagccgacg ttgtagctgc cgaggtttcc gaagcccagg ttgtcatcgc 372001 cgaagtttcc gaagccgatg ttgtaactgc ccagatttgc caaaccgatg tcgaatattc 372061 cggtgtttgc gccgccgatg ttgttgccac cgatattggc gctgccgaag ttggcgctgc 372121 cgaggtttgc aaagccgatg ttgtagtcgc cgaggtttgc aatgccgacg ttgagggtgc 372181 cgtggtttgc caagcccagg ttgaggacca tggtgcccgt gctgtcgcgc agcaggccgg 372241 cgatactggt gctgatgttg gccaggcctg aattgaaggc cggcgtcgcg aggtccgacg 372301 tgctggtgtt gtagaacccc gagacggtgg tgcccacgtt cgccagacct gatcccaggg 372361 cgccgacgtt gaggagcccc gacgcccccg aggtcgcgga ggccaggttc caaaagcccg 372421 aattggcgcc gccgaagttg ccgaagcccg aggcgccgcc ggcgccggta ttgaagaaac 372481 ctgacgacgg gttggtggtc gagtttccga aacctggcgc cgccgggata ctgatgagcg 372541 ggatcctaat ggcgccaccg ccagttatgg tgatcgcggt attcggccct cctatggcga 372601 ctgtcgtcgt ggggccgaca accgtgatgt tcggaatggt aatggggcca aagtgggctc 372661 gctggcccgc tattgacgaa aggacgatat cgccggtggg cggaatcgtg acgcccataa 372721 gggtgatgtt gccggccgag gcggtgatcg ggatatcgat gggaatattc acgccgaggc 372781 ttatggggag aggcatatcg atcaccaggt tgaggccgac caggccctgg taatcgccgc 372841 ttaagaacaa cccgttgctg tagtttccgg tgatgaaagc cccggtgtca acgtcgccgg 372901 tgttggcgat gccggtgttg tagttgccaa tgttgaggta gccggtgttg gtattgcccg 372961 gattgaagcc accggtgttg aagctgccgg tgttgaagct accggtgttg aagctgcccg 373021 ggttgaaggc gcccgtgttg acgtcgccgg agttgaatag gccggtattg gtgttgccgg 373081 tgtttccgat gccagtgttg aagctgcccg agtttgcgat gccgaagttg ccgctgccgg 373141 aattgaagaa tccgatgttg ttgctgcccg agttgaacaa gccgatgttt ccgctgcccg 373201 agttgaagct gccgaacccg atctgtccgt tgcccgtgag cccgatgccg acattgttgc 373261 tgcccgtgtt cccaaagccg acattgttgc tgccggtgtt cgcaaagccg atgttgccgc 373321 cgcccgcgtt agcgaaaccc agattgtcgt tgccgacgtt gccgaagccg atgttgtagc 373381 tgccgaagtt gccgaagccc aggttgtcgt cgccaaggtt tccgaagccg atgttgtagc 373441 tgcccaggtt cgccaggccg acatcgaaga ttccggtgtt cccgatgccg acgttgttgt 373501 ggccgatggt ggcgccgccg aagttaaagc cgccgagact tgcgaagccc acgttgaggt 373561 tgccgtggtt ggccaagccc aagttaatag ccgcagtacc cgcgccgtcg cgcagcaggc 373621 cggcaatatt ggttccgata tttgccaacc cggagttaac ggcgggcgtc gagaggtccg 373681 acgtgcccac gttgtagata cccgagatgg tgttgcccac attcgccaca cccgatccca 373741 gcgcgccgac gttgaggaag cccgacattc ccgacgttgt ggagaccagg ttcataaagc 373801 ccgacgcggc gcccccgaag ttgccgaagc ccgaggcgct gccagcgccg ctattgaaga 373861 agcccgacga cagtccgccg gtcgagttgc cgaagcctgg agtcgctgga atatggataa 373921 tcgggatgtt gatggcgtcg acgcccacag tggcgccgat attgatcgcg gtagtgggtc 373981 cacccaccat gaccacaggt gtgggaccgg ttatccgtat cactgggacg gtgaaggggt 374041 cgatatcgac gggtccgaaa aaaataacag tgacggctgt gttcggtgga agatcgagcc 374101 cgctgtatac gatgtccgtg aagctggcgg tgatcggtat gtgaatcggg atattcacgt 374161 cgacgctcac aatcgggatt tcgggaatgt cgacgccgat ggcgaggtcg atcagaccct 374221 ggttgtcgcc ccgccacaga aggccgttgc tgtggttgcc ggagatgaaa gcgccggtgt 374281 tgatattgcc ggtgttcgcc atgccggtgt tgtagtcgcc ggtgttgaag tagccggtgt 374341 tgtagttacc tgcattgaag ccaccggtgt tgacgctgcc ggtgttgacg ctgccggtgt 374401 tatagctgcc cgggttcaag ctacccgtgt tgaggtctcc ggagttgaac aagcccgtgt 374461 tggtgctgcc cgcgttcccg atgccgaagt ttccggcgcc cgagtttccg atgccccagt 374521 ttccggtgcc cgcgtttcct atcccgaagt ttccgctgcc cgagttgaac aatccgatgt 374581 ttccgtcacc cgagttgaac aagccgatat tgtggctgcc cgagttcagg ctgccgaacc 374641 cgatctggcc gctgccggtg agcccgatcc cgatattgtt gctgccgata ttcgcaaaac 374701 cgatattgtt gtcacccgtg ttcgcgaagc caagattatt gctgccggtg ttcgcaaagc 374761 cgatgttgta gctgcccgcg tgggcgaagc ccaggttgtc gccgcctaga ttgccgaagc 374821 caatattgta gtcgcccagg tttgccaagc ccacattgaa gattccggcg tttgcaccgc 374881 cgacattgtt gccgccgacg tttgccaacc cgaagttaag ggtcatggtg cccacgctgt 374941 ggtgcagcag gccggagtta aaggctggcg tcgccagatt cgacgtgctc gtgttgtaga 375001 gacccgagat ggtgttgcca acgttagcta cacccgatcc cagcgcgccg acgttcccga 375061 agcccgaaag tcccgaggtt gccgaggcca ggttccaaaa gcccgaagcg ccgccgccga 375121 agttgccgaa gcccgaggcg gtgccggcgc cagcgttgaa gaagcccgac gacgggctgg 375181 tggtcgagtt cccgaagccc ggggccgccg gaatcttgat gagcgggatg ctgacgcccc 375241 ccaccatgcc ggtgaggttg ccgtcgatcg tggtggttgg tccgcccacg gtgatcgtca 375301 ccgtgggaag ggtgagcgtg gattgcggga gctcgaccgg gccgtagtaa acaacgaagg 375361 gaacaatgga tgtgaagggc aaacgcatgc ccggaatcgt catcacgctt ccgggcatga 375421 ccatcacctg atgtatcggc atgctgaata gctgcgcgtt tatcggaatg gcgggaatct 375481 cgagggcgat atcggcaccg atcaggcctt ggtagtcgcc ccgccacaag acgccgttgc 375541 tgtagttgcc ggcgatgaag gcgccggtgt tgacgttgcc ggtgttggcc actccagtgt 375601 tgtagtcgcc ggtgttgaag tagccggtgt tgtagttacc tgcgttgaag ctgccggtgt 375661 tgtagttgcc ggtgttgaag ttcccggtgt tgtagctgcc cgggttgaag ccgccggtgt 375721 tgacgtcgcc ggtgttgaac cagccggtgt tggtgctgcc cgtgttgccg aggccgaagt 375781 ttccggtgcc gctgtttccg atgccgaaat tgccggtgcc cgagttgaac aacccgacgt 375841 ttccgctgcc ggagttgaac aggccgatgt tgtggctgcc cgagttgaag ctgccgaacc 375901 cgatctgtcc gttgccggtg agcccgatgc cgacattgtt gctgcccgta ttcccaaagc 375961 cgacattgtt gctgccggtg ttcgcaaagc cgatgttgtg gccgcccagg ttggccaaac 376021 ccaggttgtc gctgcccagg tttgcaaagc cgaggttgta gctgcccaaa ttgccgaagc 376081 cgacgttgaa cacgccgacg tttccgttgc ccacgttgtt ggcgccgacg tttgccaagc 376141 cgagattgaa gcccgccgcg ctcggggggc cggcagcggc tgccgcggcg ctggtcagcc 376201 gctccgatag gcccgccagc ttcttcagct gctgggtgaa cggcatcaac gcggagacgg 376261 ccgccgacgc tccagcgtga tagccaacca tcgcggccac atcctgggcc cacatccgct 376321 cataggcggc ctcggtggcc gcgatcgccg gagcgttgaa tcccagcaga ttcgagctca 376381 ccagcgacac cagcacggcg cggttggccg cgacgatcgc cggatgcacc gtcgctgccc 376441 gcgccgcctc gaacgtggcc acggctaccc gtgcctgagc ggcggcctgc tcagcctggg 376501 ccgttgccga aatcaaccag cccaggtagg gggctaccgc gcgcgccatc gcaaccgccg 376561 cggggccgcg ccacgccgca tccgccaggc ccgaggtcac cgacccaaac cacgacgccg 376621 ccaccgccag ttcgtcggct agtccgtccc aggccgccgc ggccgccaac atcggccccg 376681 atccggcccc gagatacatc cgtaacgaat tgacctcggg cgccgacacc acgaaatcca 376741 tccgtcatac ccgttcgtca gctggccgtc ggaggtacgt tcaggctaat caatcgtcta 376801 ctactcgact agcccgtgaa cgggtgaaaa atgctaggac attcacgtat tggcccgagt 376861 ggggctggtc gagtatcagg ggaagcttta tggggcaaag tcaagtttgt ggttcgtcgt 376921 atcggggcga tccaaccgag cacatgttta gtgcaccaga acgacgggcc gtgtatcggg 376981 tgatcgccga acgccgagac atgcgccggt tcgtgcccgg cggtgtggtg tccgaggatg 377041 tgctggcgcg gctgttgcac gccgcacacg ccgcgcccag cgtcggtctg atgcagccat 377101 ggcgctttat ccgcatcacc gacgagacac tcaagcgacg catccacgcg ctcgtcgacg 377161 acgaacgcct actcaccgcc gaagccctgg gagcacggga agaagaattc ctggcgctga 377221 aggtcgaggg cattctcgac tgcgccgagc tgctggtggt ggcgctgtgc gaccgcagag 377281 ggtcctacat cttcggccgg cgcaccctgc cccagatgga tctggcgtcg gtgtcgtgcg 377341 ccatccaaaa cctgtggctg gcagcgcggt ccgaaggcct gggcatggga tgggtgtcgc 377401 tgttcgaccc acaacgttta gcggccctgc tggcgatgcc cgccgacgcc gaaccggtgg 377461 ccatcttgtg cctggggccg gtgcccgagt ttccggaccg gcccgcgctg gaactggatg 377521 gctgggccta cgcgcggcca ctcgcggaat tcgtctccga aaaccgatgg agttatccgt 377581 cggcgctggc cacagatcac catcacggcg aataggtcac gccgaccgcg aggttgacgt 377641 attcggccgg cacgtcaaag gccagcgatc gccgcggcaa gcggctcaac acatcttcac 377701 cgatagtggc cttgaagtgc agcgcgagct ggccgctgaa cttcgggtcg acatcggcga 377761 catcgagatc gagttgcaga cccgtcggcg agatttcggc ggtcgcggca ccggcctgcg 377821 gcccgtccca gcgcgcgtcg acgacccggc cggctagctt cggcaccatc gacagggttc 377881 ccagcacccg ctgttcggtg aacgccagcg cacccacgta gctggcgatg ctgtgcgatg 377941 cccgcagccc cgggataacg ccggtgaacc gcctggtgac ggccacgtat tcggcaaggt 378001 agatgagtcc ctcagcctca acctgacagc gtaggtcagc aggcagcctg ccgagaccaa 378061 accacttgcg aacaatgaca gccatgaggc cagtatggag tcgttttgtc ggtgccgcac 378121 cgatgctggt aggagttaga gcatgactcg cccgcaagcg cttctcgctg tttcgctcgc 378181 ttttgtcgca accgcggtgt atgccgtcat gtgggtgggg cactcccagg attggggttg 378241 gctgcatagt ttcgattggt cgttgttgaa cgcagcgcac gacatcggga taaagaaccc 378301 tgcgtgggtg cgcttctggg atggtgtatc cctgatcttg ggcccagtcg tgctgcggcc 378361 gctgggtttg ctggccgcga tggtcgcact ggcgaagcgc aagatacgga tagcgttgtt 378421 gctgttggcc tgtttaccgc tcaacgcgat catgacgatc gcggccaaat ccgtggccca 378481 ccgcccgcga ccggcgactg cgctggtatc tgcccattcg acttcgtttc cgtcagggca 378541 tgcgttggag gcgaccgcaa gcgtactcgc gctgctaacc gtcctgttgc ccatgctgca 378601 cagcaggttt actcggcaca tcgccatcac ggtgggcgcg ctgtgcgtgt tgacggtcgg 378661 tgttgccagg gtggcgttga acgtgcatca tccgaccgac gttgttgccg gctgggcgct 378721 ggggtacctg tatttcctcg tgtgcctgtg cgtatttcga ccgccgtcga tattcggtgc 378781 ccaacgcgcg tctcatgctt tgtcgccgcc agtggaggtg tcgagacaac ccgaaccgga 378841 agtcgacacg gcccgctaaa gccatggtgc gctgtgcatt tcgctttgtc accgcacagt 378901 gacccagccg gattctaacc ttgacttgac cacacgaggt gattgtctga cgattgagcg 378961 atgagccgac tcctagcttt gctgtgcgct gcggtatgca cgggctgcgt tgctgtggtt 379021 ctcgcgccag tgagcctggc cgtcgtcaac ccgtggttcg cgaactcggt cggcaatgcc 379081 actcaggtgg tttcggtggt gggaaccggc ggttcgacgg ccaagatgga tgtctaccaa 379141 cgcaccgccg ccggctggca gccgctcaag accggtatca ccacccatat cggttcggcg 379201 ggcatggcgc cggaagccaa gagcggatat ccggccactc cgatgggggt ttacagcctg 379261 gactccgctt ttggcaccgc gccgaatccc ggtggcgggt tgccgtatac ccaagtcgga 379321 cccaatcact ggtggagtgg cgacgacaat agccccacct ttaactccat gcaggtctgt 379381 cagaagtccc agtgcccgtt cagcacggcc gacagcgaga acctgcaaat cccgcagtac 379441 aagcattcgg tcgtgatggg cgtcaacaag gccaaggtcc caggcaaagg ctccgcgttc 379501 ttctttcaca ccaccgacgg cgggcccacc gcgggttgtg tggcgatcga cgatgccacg 379561 ctggtgcaga tcatccgttg gctgcggcct ggtgcggtga tcgcgatcgc caagtaaccc 379621 cggacctcga ttgtgaactg tgcgacgggt tttcggcgtg ttgcgtcgtg agattcacgt 379681 tcggcgtcaa tcggccagcg cgcggcccgg cctgatgttg aagttaaggc ccgccaacga 379741 catggtcgcc tcgtaggttc ggtcgtagcc ggtggcgctg atccgccagc cgtcggtggt 379801 tcgtcggtac tggtcgtggt agaacgcggc gccgatgagc atgaaattga actcggcgac 379861 gatgacccgg tcttgcaggt accagatgcc ggttgcggta tcgccggtca cggtgatttc 379921 cggatgggtg acccggtgtt cggtgatgac acccgggccg agtgcctggc gcaggtagtc 379981 gaccaggtcg gcgcggttgg tgaagtgcag ctccgtaccg accgatgacc cgtaatcgcc 380041 ggtgacatcc tcggccaggg tgtcggtgaa gtcgtcccaa tgcttggtgt ccaatgcccg 380101 cagataccgg tatttgagct gtttgatcgc tgcaatgtcg gctggatcac ccggagtcac 380161 cacgccattg cagcacaccg gctcacgggt agctttgggg tatgagccaa tcccggtacg 380221 cggggttgtc ccgcagcgag ctggcagttc tgttacccga gctgttgttg atcggccagc 380281 tgatcgaccg atcgggcatg gcctggtgta tacaggcatt cggccgccag gagatgctgc 380341 agatcgccat cgaggagtgg gcgggcgcca gcccgatcta caccaagcgc atgcaaaagg 380401 cgctgaactt cgagggcgac gacgtgccca ccatcttcaa ggggctacag ctcgacatcg 380461 gcgcgccgcc gcaattcatg gacttccgtt tcaccctgca cgaccgctgg cacggcgagt 380521 ttcacctcga ccactgcggt gcgctgctcg acgtggagcc gatgggcgac gactacgtcg 380581 tcggcatgtg ccacaccatc gaagatccga cgttcgacgc caccgcgatc gcgaccaacc 380641 cgcgcgcgca ggtgcgcccc atccaccggc cgccccgcaa gccggccgac cggcatccgc 380701 actgtgcgtg gaccgtcatc atcgacgagt cctatcccga ggctgagggt attccggcgc 380761 tggacgcggt ccgtgaaacc aaagctgcca cctgggaatt agacaacgtc gatgcgtctg 380821 acgacgggct ggtggactat tcgggtccgc tggtgtccga cctggacttc ggggcgttct 380881 cgcattccgc actggtgcgg atggccgatg aggtctgcct gcaaatgcac ctgctgaatc 380941 tgtcgttcgc cattgccgtg cggaaacggg ccaaagccga tgctcaactg gccatttcgg 381001 tgaacacccg ccagttgatc ggagtggccg ggctgggcgc agaacgcatt caccgtgcga 381061 tggctttacc cggcggaatc gaaggcgcgt taggtgtgct ggagctacac ccgctgctca 381121 acccggccgg ttacgtgctg gccgaaacgt cgccggaccg tctggtggtg cacaactcgc 381181 cagcccacgc cgacggcgcc tggatttcgt tgtgcacacc ggcatccgtg cagccgttgc 381241 aggccatcgc caccgctgta gacccgcatc tgaaggttcg gatcagcggg acggacaccg 381301 actggaccgc ggaactcatc gaggccgatg ccccagcgag cgaactgccg gaggtgttgg 381361 tagccaaggt cagtcgcgga tcggtcttcc agttcgagcc gaggcgctca ctgccgttga 381421 ccgtgaaatg agctcgatgc gatctgtcaa gtcggtggcg gtaccgcttc ggtgacacca 381481 ccgcatcgac cgcataccaa tgaggttgtc accgaaccgt atacggccca cccgccgcta 381541 tggttaacgc tggccaccga cccctattga cgaaagcctt ccgctatgta cgacccgctg 381601 gggttgtcga tcgggaccac aaacctggtc gcggcgggta acggaggtcc gccggttact 381661 cgtcgcgccg tgctgaccct gtacccgcat tgcgcaccga aaatcggtgt gcctagccag 381721 aacccgaact tgatcgagcc gggcgcccta atgagcggct ttgttgagcg cattggagat 381781 gcggtggcgc tggtgtctcc cgacggatcc gtgcacgatc cagacctctt gctggtcgag 381841 gcgctggatg cgatggtgct gaccgccggt gcggacgcga gttcctcgga gatcgccatt 381901 gccgttcccg cgcattggaa gcccggagct gtacacgcac tgcgtaacgg tttgcggacg 381961 cacgtcggct tcgtccgcag cggcatggcg ccgcgcctgg tttccgatgc gatcgcggcg 382021 ttgaccgcgg tgaactcgga attgggcctg ccccacggcg gtgtggtggg gttgcttgat 382081 ttcggtggct ccgcgactta cgtcaccttg gtggagacca agtcggattc caggacgtcg 382141 gatttccagc ccgttagtgc cacggcacgg taccaggact tttccggtag tcagatcgac 382201 caggctttgc tgcttcgggt catcgaccaa ttcgggtacg gcgatgacgt cgatccggcc 382261 agtaccgccg cggtcgggca actcggccaa ctcagggagc agtgccgtgc ggcaaaggaa 382321 cgactgtcca ccgacgttgc cacggaattg ttcgctgagc ttgccgggtg cagctcgagc 382381 atcgagatga ctcgggaaca gctcgaagac ctgatccagg atccattgac cggcttcatc 382441 tacgcgttcg acgacatgct ggcgcgccac aacgcgagct gggcggatct cgcggcggtg 382501 gtcaccgtcg gcggtggtgc caatattccc cttgtgactc aacgtctttc gttccacact 382561 cgtcgacctg tgctgaccgc gtcgcaaccc gggtgcgcgg cggcgatggg tgcgttgctg 382621 ctcgccaacc gtgggggaga gcgcgattcg cgaacgcgga cgtccatcgg cctcgccacg 382681 gccgcagccg ccggcaccag tgtcatcgag ctgccggccg gcgacgtcat ggtcatcgac 382741 catgaggcct tgaccgatcg cgagttggcc tggtcgcaga ccgacttccc aagcgaagct 382801 ccggcgcgtt tcgagggcga ctcgtataac gaaggcggcc cctgctggtc gatgcgtctg 382861 aacgcggtcg agccccccaa aggaccagcg tggcggcgaa tccgggtgtc gcagttgctc 382921 atcggggtgt cggcggtagt ggccatgacc gcgatcgggg gcgtggcatt gacgttgaca 382981 gccatcgaga gacgcccaag cccgctacca accccaattg tgcccggcct ggccccgatg 383041 ccgcccggat ccgtcgtgcc tagctcgcgc gcaccgaccc cgccgccacc gccgtcgacc 383101 gttgcgccgc ttcccagtgc ggcaccggcc ccgacgacgg tcgcgccggc accgccgccg 383161 cccacacagg tggtgacgac cacgacagcg ccacccgtca ccacgacgcc gaggccgtcg 383221 ccgaccacca caacgaccac cgcgccaccg tcgacaacga cgacaaccga gctgccggtg 383281 acgaccactt cgacgattcc aacgattccg acgactacga cgacggtgaa gatgaccacg 383341 gagtggttgc acgtcccgtt tttgcccgtt ccgatcccgg tcccgattcc gcaaaatccg 383401 ggtgccggcg aaccgcagaa cccgttcgga agccttggct ctgggtgagc cgcgttcccc 383461 ggagctggcc ccgtcggtgt caggtccgta gtatcggtat gggttgctga ggaggtcgtg 383521 tgggcgacta tggtccgttt ggattcgatc ccgacgaatt cgatcgggtg atccgggagg 383581 ggagcgaggg actgcgcgac gcgttcgagc ggatcggcag gttcctcagc tcatccggcg 383641 cgggaacggg ctggtcggca atcttcgagg acttgtcccg gcgctcgcgt ccggcgccgg 383701 agaccgccgg cgaggccggt gacggtgtgt gggccatcta tacggtggac gccgacggtg 383761 gtgcccgcgt tgaacaggtg tatgcgaccg agcttgacgc cctgcgcgcg aacaaggaca 383821 acaccgaccc gaaacgcaaa gtccgcttcc tgccatacgg catcgcggtc agcgtcctcg 383881 acgatccggt ggacgaggcc cagtaacgtc agccctgctg gacgctgttg gaaccgccgg 383941 cattgctgat cttcggcgag cccgagtgat acgtgacctc gttgttgaag ccggcggctt 384001 cgatggtgtc gacggagtcg gcggtgaccg agttcctcat gccggacacg gtgaggctgg 384061 tgcagtggcc ggtgatcacc accgtgttgg acatgccgct gacgctgaca atgctgtcgt 384121 tgcaggcgat tgtccggttc acgttgacgc cggagacgct caggctggcg ccggccggcg 384181 gaagagtggt ggccggttgc gcagtcgggg ttggaacagc ccgggagaca gacggggtgg 384241 gcgagagaac gacgaagttg ccttgggaaa gccgctgtgc gctgaatgcg gcgatgccac 384301 ccaccagaac cagcacgccg acaacgacga ccgcggccag gatccaccac gccctgttgc 384361 cggaggacga tcgcggcgat gggccgccga acgggccgcc atagctatac ggcggcggtg 384421 gcgggccggg tggataggtg tagccgcccg actgcgagcc gccgagttcg gaggcgcgtg 384481 ccacgtcggc tagcggccgc tccagttccc ggattcgcgc ctccgggtca tcctctgggt 384541 tcatgcacag atgctcccac acgacgatca tgccgcatag gtagttgcgc ccggcggcac 384601 cacacgattc ggcttggcct gctatcgtcc catgcttatg cctgagatgg atcgtcgccg 384661 aatgatgatg atggcggggt tcggcgccct ggctgccgcg cttcccgccc cgacagcctg 384721 ggccgacccg tcccggccgg ccgcgccggc tggtccgaca ccggcgcccg ccgcgccggc 384781 tgcggcaacc ggtgggcttt tgttccacga cgagttcgac gggccggccg gttcggtccc 384841 ggacccgtcc aagtggcagg tgtcgaacca ccggacgccc atcaagaacc cggtgggctt 384901 tgaccggccc cagttttttg ggcagtaccg cgacagtcga cagaacgtgt tcctcgacgg 384961 caactccaat ctcgtgctgc gcgctacccg agagggcaac aggtatttcg gtggcctggt 385021 ccacggcctg tggcggggtg gcatcgggac cacctgggag gcccggatca agttcaactg 385081 cctggctccg ggcatgtggc ccgcctggtg gttgtccaat gacgatcctg gtcgcagcgg 385141 cgaaatcgac ctgatcgagt ggtatggcaa cgggacttgg ccgtcgggaa ccaccgtgca 385201 cgccaacccg gacggcaccg cattcgagac ctgcccgatc ggtgtggacg gtggttggca 385261 caactggcgc gtcacgtgga atccgagcgg catgtacttc tggctggatt acgccgacgg 385321 cattgagccc tacttctcgg ttccggcgac cggaatcgaa gacctcaacg agcccatccg 385381 cgagtggccg ttcaacgacc ccggctacac ggtgtttccg gtgttgaacc ttgcggttgg 385441 cggttctggt ggcggcgatc ccgcgacggg ttcctatcca caggagatgc tcgtcgactg 385501 ggtgcgcgtc ttttaacgcc tcgcgctctt gcccggggtg ctacccggct tgctcggaga 385561 aagcatggag tttttggtca ccatgaccac ccgcgttccc gatagcatgc ccgcggacgc 385621 agtcgagcgg gtccgtgccc gcgaggctgc ccgctcgcgc gagctcgcgg cacagggaaa 385681 gctactccgc ctgtggcgcc cgccgctgcg gccgggcgaa tggcgcaccc tggggctgtt 385741 cgccgccgac gacaacggcg aactggagca gctgctggcc tcgatgccgc cgcggtcgtg 385801 gcgcaccgac gacgtcacgc cgctgggtgc tcacccgaac gacccggttg gccaggggat 385861 aaccatcgcg ccgggtaagg gtccggagtt tctgatcgcg acgaccatta tggtgccacc 385921 gggtaccccg gctcaggtgg tcgacgacac cgtggcgcgc gaggctcgcc gcgcgcccga 385981 gctggccggg cggggacacc tggtgcggtt gtgggcacta cccgacggac cggacggcca 386041 gcgcaccctg gggctgtggc gggctcgcga ccctggcgag ctgatggcca tcctggaatc 386101 gctaccgctt gctggctgga tgaccatcga gaccacgccg ctgagtccgc atcccgatga 386161 tccgatccgc atgccctgac cgtttccggt gtcgccgggc tcttaggcgc cgtcccactc 386221 gccgcgggcg atgagaacat cacgaagtag gtccgcgcga tcggtgatga tgccgtccac 386281 gtccatgtcg agaagggtgt gcatcacatc gggttcgtcg acggtccagg catgcacttg 386341 gcgtcccgca gcatgaaagc cgcggacccg tgccggcgta atgaccggta caccgccaag 386401 ccgtgacggt agttgcacgc agtcgatgtc gcgcatcatc cgccaggcat atgcccggct 386461 gcccagcgga cgcgcggtca gccacgccag cagcgcgccc gttcctgccg aactagcgac 386521 ccgcttggtc agcaggcgca atgcgcgccg gcgacggcgc tcggaaaacg aaccgatcag 386581 cacccggttg tgcgcgttgc accgctcgat gacgttgacg gtcggctcga tcgccgatgc 386641 ggctttaatg tcgatgttga cccgcatgtc tggcagcgcg gtaagcaggt cttccagggt 386701 tgggatcgac tgccccgcac ccagctgcgc cttgcggaca tcacgccaat ccaaccggtc 386761 gaccgcgccg gataacccca ccccgggcgc cagcctacgg tcatgcagga tcacggctac 386821 gccgtcccgg gtggcgcgaa cgtcggtctc gatgtagcgg aatccgagct tggccgcctc 386881 ctggaacgcc cccatgctgt tcatgggcaa tctgaacgac gtaaatcctc tgtgcgccat 386941 ggcaatccgc cccccatggc gaagaaattc cacggtaggt gcgccaccgt cgctcatcag 387001 gtcagtatca catagcctcg gccgccgggg gcgtccacgc cgggggcagc accgctctgt 387061 cggcgacggt tcctgtgcac cagccgcttc cgcatcgcag tggtaatggg cgctcccata 387121 cggcgcggtc ggcgacgacg gtgcatgggc cggccatcgt tttgggcctt ccccgccttg 387181 ccgccgggcc accgacgttc agcatcacca tcagcgtcga cgtgtcacat cggagccgat 387241 gacgggaatc gaacccgcgt attcagcttg ggaagctgat gttctgccat tgaactacat 387301 cggcacggtt gcctcgaaag gctagcatcc agaatcattc catcaccccc aggccgtaca 387361 agatcagaaa tccggcaaag aacatcacgg ccatccatgg ccgctccggc gtttcgtgcg 387421 cctcgacaag cagttcctcc acgaccagcc agagcagcgc cgccgccgcg aacgccaaca 387481 cgagggtcag gacggtattt cccgcccggc ccagcgccac ggcacctgac acaccgccca 387541 ccgcgatcac taggctcagg gcgcttgtgg tcgccgcggc ccggatccta ggcattccgg 387601 agccggccag gcgcagggcc accgccagac ccaggaacag cacctcgacc gtcagggcga 387661 tggtgatgat gatcgcggtg cgactggaca ccgtcgcgcc cgttgcgacc agcaacccgt 387721 cgatgaagag gtcaaccgcg actacggtga ggaacccgac gggcagttcg cccacgtcgt 387781 cgccgtcttg atgttccccg tggccgtcaa atcggcgcag tgcaacgagt accgcgacgc 387841 ctgcactgaa gcccacaacg atcagccaga gcggacctct gctgcgcagg tctggtagca 387901 cttccccggc cacggcggcc atgacaattc ccgcggcgaa atgttggacg ccgctgacca 387961 tcgccgccga cggcgtgcgc accgacggga ccacgccgcc gagaatcccg gcgagaaccg 388021 ggaaggtgac caacgaggcg gccgttgtga cgttgctgat gccaacctcc cggtttcggt 388081 cgaagatctc ggctcgggca cgcttgaaca ttgtgacggc tagtgacaaa tgcagcgact 388141 ttcggggaaa cgggcattga aataaggaag gaacagcatg tcgaaggtgc tggtcaccgg 388201 attcggaccc tacggcgtga cgccggtaaa tccggcacag ctcaccgccg aagagctgga 388261 tggtcgcacc atcgccggcg caacggtcat ctcgcggatc gtgcccaaca cgttcttcga 388321 gtcgatcgcg gcagctcagc aggccatcgc agagatcgag ccagcattgg tgatcatgct 388381 gggcgaatac ccgggacgca gcatgatcac cgtcgagcga ctcgcgcaaa acgtcaacga 388441 ctgcgggcgg tacggcctcg ccgactgcgc cggcagggtt ttggtcggtg agccaaccga 388501 ccccgccggc ccggtcgcct accacgcgac cgtaccggtt cgcgcgatgg tgctggccat 388561 gcgaaaggcc ggcgtgccag ctgacgtctc ggacgcggcg ggcacgttcg tgtgcaatca 388621 cctcatgtac ggcgtgctgc accacctcgc ccagaagggt ctgcccgtcc gcgccggttg 388681 gattcatctg ccgtgcctgc ccagcgtcgc cgcactggat cacaacctcg gtgttccgag 388741 catgtcggtc cagacggcgg tcgccggggt cacggctggc atcgaggcag ccattcggca 388801 gtccgcagat atccgcgaac cgatcccgtc gcgattgcag atctagggcg cagctgacgg 388861 cggtcttcta gagattagat atttattctt ccgttatctt gtcgtaatct gctcagcgtg 388921 ggccgacatg aattagctag ggaccggcga aagtcgtcag cggtcctggc tgcggtcctc 388981 gccccggccg ccgtgttctt cgccacgggc ggagatgtca gtacgcttgc cgcccgcgcc 389041 gatgccaacc cggttctcgg cgacgacgcg ccctgttgtg tgcagatcgt gccggttgca 389101 ccgctggctt tctcctcaca gatatccggc ggtgaaatcg ggacgggcct tgctgccagc 389161 cagttcgctt cggcatcgag atggcgcatc gtatctcggt atttgccggt aggggtggca 389221 cccgagcagg gtctacaggt caagaccgtc ttgacagccc gcagtatcag tgcggctttc 389281 cccgaaattc gcgaaatcgg cggcgttcgg ccggatgcgc tgagatggca tcccaatggt 389341 ttggcgctcg acgtgatggt tcccaacccc ggcaccgccg agggcatagc gctgggcaac 389401 gagatcgtcg ctttcgtact gaagaacgcg acccgatttg ggatgcaaga tgtgatttgg 389461 cgtggcgcct actacacgcc caacggcgcg cggacaaccg gggccggcca ctacgaccac 389521 atccacatca cgaccgtggg cggcgggtat cccaccggcg aggaactcta catccgctga 389581 gccagcgtgc ggcgacagat acgctcgtcg ggtgctgctc tccgatcgtg atcttcgggc 389641 cgagatctcc tccgggcggt tggggatcga cccgttcgac gacaccctgg tccagccgtc 389701 cagcatcgac gtccggctcg attgcttgtt tcgggtgttc aacaacactc gctacaccca 389761 catcgacccg gccaagcagc aggacgagct gaccagcctg gtgcaaccgg tcgacgggga 389821 acccttcgtg ttgcacccgg gcgaattcgt gctcggctcg acgctggagc ttttcactct 389881 gcccgacaac ctcgccggac ggctggaagg caagtcttcg ttgggccggc tgggcctgct 389941 gacgcattcc accgcgggct tcatcgatcc tggcttcagc ggtcacatca ccctggagct 390001 atccaacgtc gccaacctgc cgatcacttt gtggcccggc atgaaaatcg gtcagctgtg 390061 catgttgcgc ctgaccagcc cgtccgagca tccctacggc agttcccggg cggggtcgaa 390121 ataccagggt cagcgcgggc ccacgccgtc gcgctcctgc cagaacttca tcaggtctac 390181 ttagcatccg gcgcggctag gcctgtcgcg ggtagctgtc acctgccgtt tgcctggtgc 390241 tcagcgccgc gatgcggttc gctcatcgca gccacctaca cacagtggtg tgcgatgcag 390301 cgtcttcggc actgggtatc tgggtgccac ccacgccgtc ggtatggcgc aactgggaca 390361 cgaggtcgtc ggggtcgata tcgatcccgg taaggtcgcc aagctcgccg ggggtgacat 390421 tccgttctac gaacccggcc tgcgaaagct gttgactgat aacctggctg ccggccgctt 390481 gcggttcacc accgactacg acatggcggc cgatttcgcc gacgtgcatt tcctgggggt 390541 cggcacgccg caaaagatag gcgaatatgg cgccgacctg cggcatgtcc acgccgtcat 390601 cgatgcgctg gtgccgcgtc tggtcagggc gtcgattctg gtcggcaagt cgacagtccc 390661 agtgggcacc gcagccgaac tgggacatcg ggccggtgca ctggcacccc ggggagtcga 390721 cgtggaaatt gcctggaatc cggaattcct gcgcgagggc ttcgcggtgc acgacaccct 390781 caaccccgac cgtatcgtcc ttggggtaca agatgattcg acgcgcgccg aggtagccgt 390841 ccgcgagctg tacgcgccgc tgctggcagc gggcgtgccg tttctggtga ccgatctgca 390901 gaccgcggag ttggtcaagg tatccgccaa tgcctttctg gcgaccaaga tttcgtttat 390961 caatgcgatc tccgaagtgt gcgaggcggc gggtgccgac gttagccagc tggccgatgc 391021 gctcggatac gacccgcgga tcggacgcca atgcctcaac gcgggcttgg gtttcggcgg 391081 cggctgcttg cccaaggaca tccgcgcttt catggcccgc gccggcgaac tgggagccga 391141 ccaggcgttg acgttcctgc gtgaagtgga cagcatcaac atgcgccggc gcaccaagat 391201 ggtggaactg gccaccaccg catgcggtgg ctcgttgctg ggcgccaata ttgcggtgct 391261 cggcgcggcg ttcaaacccg aatccgatga cgtgcgcgat tcgcccgccc tcaatgtggc 391321 gggccagctg cagctcaacg gcgccacggt ccacgtgtac gatccaaagg ccttggacaa 391381 cgcccaccga ctgttcccta ccttgaacta tgcggtttcg gttgcggagg cctgcgagcg 391441 cgcggacgcc gtgttggtgc ttaccgaatg gcgggagttc atcgatctcg aacccgctga 391501 tctagccaac cgggtgcggg cccgggtgat cgtggacggc cgcaactgcc tcgacgtgac 391561 ccgctggcgg cgggcaggct ggcgggtgtt ccggctggga gtgccgcgat tagggcactg 391621 accggcgcag ccagcgcaag tactctcggt caccgagcag ttccagacga cgccacagca 391681 cggggttgtc ggcggactgg gtgaaatggc agccgatagc ggctagctgt cggctgcggt 391741 caacctcgat catgatgtcg aggtgaccgt gaccgcgccc cccgaaggag gcgctgaact 391801 cggcgttgag ccgatcggcg atcggttggg gcagtgccca ggccaatacg gggataccgg 391861 gtgtcgaagc cgccgcgagc gcagcttcgg ttgcgcgacg gtggtcgggg tggcctgtta 391921 cgccgttgtc gtcgaacacg agtagcaggt ctgctccggc gagggcatcc accacgcgtt 391981 gcgtcagctc gttgagcggg atctgcgcta gaccgttatc cgggtatgcg agtagttgca 392041 catgatcgac acccaggacc tgtgccgcag cggcgagttc ctcccggcgc acctcaccga 392101 ggtttcggtc ggtccggccg agtgtggagg cctcgccgtg ggtgaagcac aatcctcgca 392161 gccgcgttcc ctgcgccgtg aaatcaccca ataccgcccc gagcccgaag gactcgtcgt 392221 ccggatgggc gaacacagca agcacttcgt gtgcgcaggg gagacggttg cagctgttca 392281 tcgattcacc gtccggagga tccgtgcgcg cgggtggaca gccgccgcat attatgtagt 392341 tccaatgagc aatggaatta tattcccaag gatgactgga aatggctgga cagtccgatc 392401 gtaaggcggc gttgttggac caggtagcgc gcgtgggcaa ggcgctggcc aatgggcggc 392461 gattgcaaat cctggacttg ctcgcccaag gtgagcgcgc ggtagaagcg atcgcgacgg 392521 cgaccgggat gaacctgacc acggcatcgg cgaatctgca ggcgctgaag agcggcgggc 392581 tggtcgaggc tcgccgcgag gggacccggc agtactaccg gattgctggg gaagacgtgg 392641 caaggctgtt cgcgctggtg caagtggttg ccgacgagca tctggccgac gtggcggtcg 392701 cggccgcaga cgtgctcggt tcgccggagg atgcgatcac ccgtgcggag ctgctgcggc 392761 ggcgcgaagc cggcgaggtc accctggtcg acgtgcgacc gcacgaggaa taccaggccg 392821 gccatatccc gggcgccatc aatatcccga tagccgaact ggccgaccgg ctcgccgaac 392881 tagctggcga ccgcgacatt gtcgcctact gtcgtggtgc ctactgcgtc atggcccccg 392941 atgccgtccg catcgcgcgc gacgcggggc gggaggtgaa acgcctcgac gacggaatgc 393001 tcgaatggcg attggccgga ctgccggtcg acgagggtgc accggtcggg catggggatt 393061 gatcgcccgt ggggccgaag ggaagtctac gtttggtgaa gcggcagcca gaactgctcg 393121 ttgcccagca tgaacactgg caggacacct accgagcgca tccggtgctg tacggaaccc 393181 gcccgtcaga gccgggggta tatgccgccg aggtgttcaa tgccgacggc gtgcagcggg 393241 tgctggagtt ggcggccggt catgggcgtg acaccctgta tttcgctggc cagggcttca 393301 cggtggtggc caccgatttc agcgacgttg ccgtcgcgca acttcgccga agtgcccaag 393361 cgcgcggggt ctccgcgcgg gtgcaaccga ttgtgcacga tctgcgccag cctctgcccg 393421 tcaaaaccgg ttccattgac ggcgcctttg cacacatggc gttgtgtatg gcgttgtcca 393481 ccagcgaaat tcatgcagtc gttgccgagg tcggccgggt gttgaggccg ggtgggaagt 393541 tcatctacac cgttcggcat accggcgatg cgcactacgg cgccgggcag gcccacggtg 393601 acgacatctt cgagtgcgca gggttcgcag tgcacttctt ccgccgtgag ctggtagcgc 393661 gcctggctac cggttgggta ctcgaggagg tacacgattt cgaggaaggt gagctgcccc 393721 ggcggctatg gcgggtcact gtcaccaagc ccgcctagcc ggcgctgtgg gatcagccgc 393781 aggtgtgcac cgtgtttggg gacggtggtg atgttgcgca ccaacggagt ctcgcctttg 393841 gacgggccgg cggcggtgat ggtgaagcgg cggaaaatct cttgcaggat gaccgctccc 393901 tcggtgaggg cgaacccgaa gccgaggcat cggcgcacac cgccgccgaa tggcagccag 393961 gtgttgggtg ccacgctgcc gtcaaggaac cggctaggac gaaactctgt gggtttgggg 394021 tgcgatacct cgctggcgtg ggccaacagg atcgacgtgt tgaccaccgt ccccgctggc 394081 agtcgccaac caccgatctc tgccggcgcg gtgaccttgc gagcggtaga agcgatgacg 394141 gtgtgtcggc gcattccttc cttgaggacg gcctccaaga atccgtcgtc accgccgacg 394201 gcagcccaga ctacttggct ttggatttcc ggagcatggg caagttccca caacgtccag 394261 gacagggcgg cggcggttgt ctcatgaccg gccagcagca acgtgatgag ctggtcgcga 394321 agctcggcat cggtcagcgg cttagtaggc gtgtccttgg tttgcaaaag tctggatagc 394381 acgtcggttc gggcggtgag atcggaatcg atacggcggg aggcgatctc gcggtagagg 394441 atctcgtcta tcttggtttg gttatggaag aagcgcttcc agggattcat ccgcttgagc 394501 gacgggtacg gaacgcccgc gagaatcgcg ggatggatgt ttatgatctg ttgcagccga 394561 ctagtcaact cggccttgac ttttgggtca gtgaccccga aaacgacccg caggatgatg 394621 tcgagggtga gcgcattcat gtggtcaaga ctgttgatcg ttgcgtgggg ccgccagcgc 394681 gtgatgtgtt cacgcgcaac ggaggcgatc atgtcgcggt atccgcgcag cgcggcgcgg 394741 gtgaacgcgg gcatgagcag cgatcgcatc cgcgcgtatt cggcttcgtc ggtcatcaat 394801 accgagtgct cgcccatgac aaaaccaagg atgtggttgc cttcgcccgc gtgcagcgac 394861 ctcgggtcgg ccgcgaagat ctctttgatg tgttcggggc gggtatagac cacgaggttg 394921 tcggcatatg ggggcacccg caaggagaac acgtcgccgt acttgcgatg catcgctggc 394981 aggaaccatt cccgaaacct caggtacagc acgctctgca ggtagcgggg tagccgcggc 395041 ccgggtggca ggcccgtcgt caacgtgctt gccatggcgg ctcccttctg ataatcaaat 395101 gtttgatgta aacgaatgct tatcacgata ggatgcagct gtgcaacagc aacgcacaaa 395161 ccgcgacaaa ctgctcgacg gcgctctggc ttgtttacga gaacgcggct acggcaacac 395221 cagctcgcgc gacatcgctc gtgcggcagg ggtgaacatc gcgtcgatca actaccactt 395281 cggtagcaag gacgcgctgc tcgacgatgc gctcggccgg tgcttttcga cgtggaacca 395341 gcgtgtccag gaggcattcg atcactcccg cgccgccggt ccggccgggc agatcctggc 395401 ggtactcgaa gccaccgtcg attcgttcga gcagatccgc cccgccgtgt atgcgtgtgt 395461 ggagtcatac gctccggcgt tgcgctcaga ggccttgcgg gagcgcctgg ccgccggata 395521 tgccgacgtt cggcagcatt cggtcgatct ggctggcgct gcgcttgccg gtaccgacat 395581 agcaccgccg gagaacctgt cgaccatcgt ctcggtgttg atggcggtca tcgatggcct 395641 catgatccag tggatcgccg atccgtccgc caccccgcga tcgaccgagg taatccgagc 395701 gcttgccagc atcggcgcgg tcgtcacgtc gcagttgcgg tgaaccacac ggtcgccgga 395761 tggtctgcac tgcgcttgat gccgacgtcg atgaagccgg cagcgccaag ccacgcggcg 395821 gtgtcgaggg tggggggcac gcgatagatc gcgggatcga aacgggccgc cagtggttgg 395881 tcatcggata tcgatgtaag gacgagtcga cccccgggcc gcagggctcg agcgatgtcg 395941 caaaggctgg cgcggggatc gggcgagaag taaaagttgt gcacgccgag caccttgtca 396001 aggctgtggt cggcaaccgg cagggttact ccatcgccgt gataaagcga gatcaggccg 396061 gctgcaatgg ctttcgcgtt gtgatgggcc gcgattgcga tcatggtcgt cgacacctcg 396121 acgccgctca cttgcgcgcc ggcggcggcg agcagcccaa gggttcggcc ggggccaaag 396181 ccgatctcgc aaacccgctc gcccgggccg ggcgcgagca gctcgacggc gatgcgattg 396241 acgtcggcgg tctcggctcg ccagatccgt cccagtaggc ggccgaacgc gcctgttggc 396301 cgggcagcct gactggatag gtaccgtcgg gccggatgtg tgaggcgcat ggggacgacc 396361 tttcggttgc aagcggttag tccgaagaag ctgtggtggc ccgaacgaca aactcggcga 396421 gggtcgcagc gatcgcatcg tcatcgatca cgccaggttg cacgatcgac caaggctccg 396481 gtgacgggtc gaagtgaaga tgcaccgccc acaacgcgca caactcgacg atggtccggg 396541 ccaccatcgg tgccggcccg ggcaggatca gaaggccggc gcgctcgcgg tgcactaggt 396601 atgcctggac cgcatcgact tgggcgttcc ggccggtgcc gaaccaaacc tcggcgaggt 396661 cgggtagctc gggggcacag cggtcgacca gtttgagcgc gatccggtgc cgggccaggc 396721 ggctgtagag gtcggtgacg ataccggcga gttctgctcg cgcgtctcca gtcgtcgcac 396781 ccggcggcaa agtcgctcgc aacgcgtgcg tgagccgcat gtcggtgacc tcgccagcca 396841 gtcgggccga caccacagct gcgatctcgc ccgcaacggg agcggccacc ggcagttcgg 396901 atgccagcgg aagggcttcc tgagcgtcgc cgtagcgcac cgccgccgcg aacagcgcag 396961 ccttgccctg ggcgtagcca tacagcgtgc ctttggccag ggcgagtgcg tcggccacgt 397021 cctgcacctg ggtgcgctgg taaccgtggg cgatgaacac ccgcgccgac gcggcgacaa 397081 tcgcggaaaa ccggtccgcg ggaatgctgc gggccatggg ccgataatag tttgactgac 397141 tcggtcagtc accccaagac cttgcgcaag actgcggcgg aatctaatat tccaaagata 397201 tatggaactc gatgcgaagg aatcaggctc atgagcaaga cggttctcat ccttggcgcg 397261 ggtgtcggcg gcctgaccac cgccgacacc ctccgtcaac tgctaccacc tgaggatcga 397321 atcatattgg tggacaggag ctttgacggg acgctgggct tgtcgttgct atgggtgttg 397381 cggggctggc ggcggcctga cgacgtccgc gtccgcccca ccgcggcgtc gctgcccggt 397441 gtggaaatgg ttactgcaac cgtcgcccac attgacatcg cggcccaggt agtgcacacc 397501 gacaacagcg tcatcggcta tgacgcgttg gtgatcgcat taggtgcggc gctgaacacc 397561 gacgccgttc ccggactgtc ggacgcgctc gacgccgacg tcgcgggcca gttctacacc 397621 ctggacggcg cggctgagct gcgtgcgaag gtcgaggcgc tcgagcatgg ccggatcgct 397681 gtggctatcg ccggggtgcc gttcaaatgc ccagccgcac cgttcgaagc ggcgtttctg 397741 atcgccgccc aactcggtga ccgctacgcc accggaaccg tacagatcga cacgttcacg 397801 cctgacccgc tgccgatgcc cgttgcaggt cccgaggtcg gcgaggcttt ggtctcgatg 397861 ctcaaggatc acggtgtcgg cttccatcct cgcaaggccc tagctcgcgt cgatgaggcc 397921 gcaaggacga tgcacttcgg tgacggcacg tccgaaccgt tcgatctgct tgccgtggtc 397981 cccccgcacg tgccctccgc cgcggcgcgg tcagcgggtc tcagcgaatc cgggtggata 398041 cccgtggacc cgcgcaccct gtccactagc gccgacaacg tgtgggccat cggcgatgcg 398101 accgtgctga cgctgccgaa tggcaaaccg ctgcccaagg ctgccgtgtt cgccgaagcc 398161 caggccgcag ttgtcgccca cggcgtcgcc cgccatctcg gttacgacgt agctgagcgc 398221 cacttcaccg gcacgggcgc ctgctacgtc gagaccggtg atcaccaggc agccaagggc 398281 gacggcgatt tcttcgctcc gtcggcgccc tcggtgacgc tgtacccgcc gtcgcgggag 398341 tttcacgagg agaaggtcgc acaagaactg gcctggctga cccgctggaa gacgtgacac 398401 gccggtgggc gcggctccct accacggctc ctaccggcgc ccctgaaaca ccagactgtg 398461 gataaccgct gttgcgcaag cctgctagta gcctcgccaa ggtggactac tcgtcggcat 398521 acctggagca gacccacgcc ttcggcgaac tgatccgcaa cgtcgatcaa tccaccccgg 398581 tgccgacctg cccgggctgg agcctgggtc aactattccg ccacgtcggg cgcggggacc 398641 gctgggcggc gcagattgtc cgcgatcgac tcgaccattt cctcgatcca cgcagcgtcg 398701 agggcggtaa gccaccgccg gaccccgacg acgcgatctc ctggctgtac ggcggggcgc 398761 ggctgctggt cgacgctgtg gaacaaacgg gtgtggaaac gccggtgtgg accttcctcg 398821 gaccgcgccc ggcgggctgg tgggttcggc ggcggctaca cgaggtcgca gtgcaccgcg 398881 ccgacgtggc gatcaccgtc gggggcgaat tcacactgga accgaacgtg gcagccgacg 398941 ggatcagcga attcctggag cgcatagcgg tccaggccgg cagcggcggc acgccattac 399001 cgctcgaaga cgacgacacc ttacatctgc acgccaccga tccggggctt cttgaagccg 399061 gcggatggac ggttcgtcgc gacgagcgcg gcgtcacctg gtcgcatcgg cacggaaagg 399121 gtgccgtggc actgcgtggc ggcgccaccg agctgctgct ggcgatggtg cgccgactct 399181 cggttgccga caccggcatc gagctgttgg gggatgccgg ggtatggcaa aaatggctgg 399241 atcgcacgcc gctgtagccg ccgcacacgg taactttcag accatgacca catcggagat 399301 cgctaccgtg ctggcctggc acgacgccct caatgccgcc gacattgaga ccctcgtggc 399361 gttgtctact gacgacatcg acatcggtga cgcgcacggg gctgtacagg gccacgatgc 399421 gctgcgcggg tgggccagct cgctcaccac aaccgcagaa cttggccgca tgtacgtgca 399481 ccacggagtc gtggtcgtcg aacaaaagat caccagcggc gaagatccgg gcatcgccag 399541 gaccggcgcc gcggcgttcc gtgtggtcca agaccacgtc gcatcggttt tccggcacga 399601 agacttggcg tcggcgctgg cggccaccga actcaccgag gacgatttgg tcgattgagg 399661 tcggcgaacg gcagttagga gccagttatg cgcgggatca tcttggccgg cggttcgggc 399721 acccggctgt acccgatcac catggggatc agcaagcagc tgctgccggt ctacgacaaa 399781 ccgatgatct actacccgct caccacgctg atgatggctg ggatccgaga cattcagttg 399841 atcaccaccc cgcatgacgc gcccggcttt catcgactcc tgggcgacgg cgcgcacttg 399901 ggagtgaaca tcagctacgc cacccaggat cagcctgacg gtctggcgca ggcgttcgtc 399961 attggcgcca accacatcgg cgccgattcg gtggcattgg tgttggggga caacatcttc 400021 tacggcccag gtctggggac cagcctgaag cgcttccaat ccatcagtgg tggagcaatt 400081 ttcgcctatt gggtagccaa cccgtcggcc tatggtgtcg ttgagttcgg cgccgagggc 400141 atggcgctgt ctctggagga gaagccggtg accccgaagt cgaattacgc ggtgccgggc 400201 ctgtatttct atgacaacga tgtgatcgaa atcgccaggg gtttaaagaa atcagcgcgc 400261 ggggagtacg agatcaccga ggtcaaccag gtctacctca atcagggtag gttggcggtc 400321 gaggtgctgg cccgcgggac agcgtggctg gacaccggga cattcgactc gctgctggac 400381 gccgccgatt tcgtccggac cctggagcgt cggcagggcc tgaaggtcag catccccgaa 400441 gaagtggcgt ggcgcatggg ctggatcgac gacgagcagc tggtgcagcg agcccgtgct 400501 ctggtcaagt ccggatatgg taactacctg ctggagttgt tggagcgcaa ctgatttcgg 400561 cgggttattg tcggtgatta tggaaccccc tggtagcccg tcctggatga gcagcccacc 400621 ggaccagcca ttgccgaaca gcccgccgtt ggcgccgttg gcgatcagcg ggccccaaca 400681 gcgcctgggt cggcgcatcg gcggtggtct cggcgctggc acacgagccc gcacccacgt 400741 tcaggttctg tgcaaactgg ccatggaacg ccgccgcctg attgttgagg gagtgatgcc 400801 gccgaccgtg tgcggaaatc agtgccgcga cggccgccga cacctcgtct tcggccgccg 400861 ccagcacgcg ggtcttgtgg cgcttcggcg ggaagttgct gatccgagat gctggcggct 400921 ggtttccttg tggtggcctg ggccgggtgg tggcgcacag tgggcccggt ggggtcgcgg 400981 ccggccgggc aagaacgctg cgccctggcc gggccatgag cggagccggc aagctcgacg 401041 gcgcccggca tgcgcggtgc aagaacccca tggaccgcac cgagtgccgt gctcgccctc 401101 ggcggctacc gagccggtgt ctccctagtc atccacgtta tccacagcgc cttgggttac 401161 cgggcgccgg tcgggtagcg atggtagtat cgaaagtatg ttcgatcagg tgcgggggcg 401221 catgccttca ccggaggcga tcgctcattt tgatgagcgg tttgaatgcc atgctccgcg 401281 gaccacgagg gtgtcggcgg cgttcatcga tcggatctgc tcggcgactc gggccgaaaa 401341 ccgggccgct gcggcgcagt tggtggcgtt gggggagttg ttcgcctatc ggtggtcgcg 401401 ttgcgggggc cgcgaggagt gggtgatgga caccatggcg gcggtggccg ccgaggtggc 401461 ggcggcgttg cggatcagtc agggtctggc ggccagccgg ttgcggtatg cgcgggcgat 401521 gcgtgagcgg ctgcctaaga cggctgaggt gtttagcgcc ggcgacatcg gctatctgat 401581 gtttgccacg attgtgtatc gcaccgactt gatcgttgac cctgatgttt tggcggcggt 401641 ggatgcgcag ttggccgcca atgtggcgcg ttggccctcg atgaccaagg cccgcctggc 401701 tgggcaggtc gataagatcg tggcgcgtgc cgatgccgat gcggtgcggc ggcgcaagga 401761 gtatcaggcc cagcgccagt tctgggtcgg ggaaagccaa gacggtgtgt gccagatcgg 401821 tggcagcctg ttggccgtcg acgcacacgc cctcgatgcg cggttgagcg cgttggcggg 401881 caccgtgtgt gagcacgatc cgcgcagccg tgagcagcgc cgcgcggacg cgttgggggc 401941 gttggcgggc ggggccgatc ggctgggctg tggctgtggg cgcgctgatt gtgcggccgg 402001 gaagcggcct gcggccccgc cggtggtgat tcacctgatc gccgaggcgg ccacgatcaa 402061 tggcacgggc tcggcgccgg catcgcagat gaacgccgac gggctgatca ccgccgaact 402121 ggtggccgag ctggccaaga cggccacgct ggtgccgctg gttcatcccg gcgatgcgcc 402181 gcccgagccg gggtatgcgc cgtcgaaagc gctcgccgat ttcgttcgct gccgggatct 402241 gacgtgtcgc tggcccggct gtgatgagcc cgccaccaat tgcgacctgg atcatacgat 402301 cccgtatgcc gctggtgggc ccacccatgc gtcgaacctg aaatgttact gccgtaccca 402361 tcacctggtg aaaacgtttt ggggatggcg tgatcaacag ctacccgacg gcaccctgat 402421 tttgacctcc ccgtccgggc atacctatgt cagcaccccg ggcagtgcgc tgctgttccc 402481 cagcttgtgc cacttcagcg gcggcatccc ggcaccggaa gccgacccac cctacgacca 402541 ttgcgaccag cgcacagcga tgatgcccaa acgccggcgc acccgcgccc aagaccgggc 402601 ctatcgcatc gccaccgaac gtcgacaaaa ccacgccgcc cgccagcgcg cccaggtgct 402661 cacccagacc gccgcggcca ccgacaccca cggcccacca ccggatcaca acgacgaccc 402721 accgccgttt tgatgtggaa cggcctgtca agtggccgat tagtgcttgt tgcctcgggg 402781 ttgtttgggg tttctggctt tgatccgatg acgggaccct gcggcgctcc ctcgacgccg 402841 ccgcgccggc ttaagggcgc ccggccgcgc tgccaccccc agggcatcac gtgcgtcggc 402901 tgctattgcc ggtaactgac caggaagtta cccagccgct cgatggcggc cgccagatcg 402961 cgggaccatg gcagcgtcac caggcgcaga tgatccggtg cgggccagtt gaacccggtg 403021 ccctgggtga ccaggatctt ctccgacagc agcagatcga gcacgagttg ctcgtcgtcg 403081 tcgatgtcgt agacctcggg gtctagccgg ggaaacgcat acagcgcgcc cgccggtttg 403141 acgcacgaca cccccgggat ctcgttgagc ttggtccagg cgatgtcgcg ctgctcgagc 403201 agccggccgc cgggcagcac caggtcctcg atgctctgat ggccgcccag tgcaacctga 403261 atggcatgct gggccgggac atttgggcac aaccgcatat tggccagcag gccgatgccc 403321 tcgatgaagc tgctggcgtg ctccttgggt ccggtgatcg ccagccagcc ggcccggtat 403381 ccggcgacgc ggtaggcctt cgacagccca ttgaaggtca ggcacaacat atccggggcg 403441 atcgatgcca ggctgatgtg cttggcgtcg tcgtagagga ttttgtcgta gatttcgtcc 403501 gccaacagca gcagttgatg cttgcgggcc agatcgacca tctgggtgag gatttcgcag 403561 ctgtacaccg cgccggttgg gttgttgggg ttgatcacga ccagcgcctt ggtgcgctcg 403621 gtgatcttgg attccaggtc ggcgatatcg ggctgccagc cttgggtctc atcgcacagg 403681 tagtggaccg gagtgccgcc agccagcgag gtcgacgccg tccacagcgg gtagtccggt 403741 gatggaatca gcacctgatc gccgttgtcc agcagggctt gcagcgtcat cgtgatcagc 403801 tcggagaccc cgttacccag gtagacgtcg tccacgtcga atcggggaaa tccgggcacc 403861 agctcgtagc gcgtgaccac cgcacgccgg gccgacagga tgccctgcga gtcggagtac 403921 ccctgcgcgt agggcagcgc ctggatgata tcgcgcatga tcacgtcggg tgcttcgaag 403981 ccgaacggcg ccgggttgcc gatgttgagt ttgaggatgc ggtgaccttc ggcttcgagc 404041 cgcgcggcgt gctggtgcac cgggccgcgg atctcgtaca ggacgtcctg cagcttggcc 404101 gactgagcga aggcgcgctg ccgctgatgg ctggcggtgt gccagggcag ctggtgggtt 404161 gtcacgtcca caatggtgcc atcgttgtcc actggaattt gctgtcaggt gccaaatcgt 404221 gatcagcgtt tgcccggtgg acgggccccg cgcgcaatgc ccagcccttt caccggcgcg 404281 gccggtgcag ccggatcacc gtcggtttgc ggctttggcg gtgcggcggg ctccggttgc 404341 ggctttgctt cgggttgcgg ctgggctgct ggctcggcga gcccaggagc cgggggaggt 404401 gtctttttcg ccccgggccg cttggcgccg gcggcaatac ccagcccttt aacgggcgcg 404461 gcgggcgcag ccggggccgc gggcgttggg gccgctttct tggcgccagg ccgcttggcg 404521 ccggcggcca tgccgaggcc tttcacgggt gctgcgggag cggccggcgc tggcgcctgt 404581 ggtgcctcgg cgggtgcctc cacgggcgtc accggtgcgg cagccttagg agcggctttc 404641 ggggcgcgct cctgagcctg tttggcggcc gtacccttgg ccggcagctg cgccttgtcg 404701 tggtctagtg atccgagtag cacctgggcc acgtcgagca cctcgacgcc gctgcggccg 404761 gcttcttcct gccgatcgtt cacaccgtcg gtgaccatca cccggcagaa tgggcacgcg 404821 gtggcgattg cggtggcatc ggtggccagc gcctcatcga cgcgttcatg gttgatccgc 404881 ttgccgatgt gttcttccat ccacatgcgg gcgccgcctg cgccgcaaca aaagctgcgg 404941 tcggcatggc gcggcatctc ggtcaggctg gcccccgcgg caccgatcag ctcccgtggt 405001 gcctcgtaga ccttgttgtg ccgacccagg tagcacgggt cgtggtaggt gatgtcctga 405061 gaaaccggag tgacagggac cagcctcttg tcgcgcacca accgattgag cagctgggtg 405121 tggtgcagca cggtgtagtt ggcgcccagc tgccgatatt ccttgccgat ggtgttgaag 405181 cagtgcgggc aggtgacaac gatcttgcgg tcgacggtct ccacaccctc gaacaaaccg 405241 tccagggtct cgacggcctg ttgtgccagc tgctggaaga ggaactcgtt gccggagcgg 405301 cgcgccgagt cgccgttgca ggtttcccca gcgcccagca ccaagtattt caccccggcg 405361 acggcgagca gctcggcgac ggccttggtg gtcttcttgg ccttgtcgtc gtaggcgccc 405421 gcacaaccca cccagaacag gtactcgtag ccgtcgaagc tgtcgacgtc ctggccgtac 405481 acggggacgt cgaagtcaac ctcgtcgatc cagttggtgc gatctgaggc gttctgaccc 405541 cacgggttgc ccttggtctc caggttcttg aacagcaccg acagctcgga ggggaactcc 405601 gactccatca tcacctggta gcggcgcata tcgacgatgt gatcgacatg ttcgatatcc 405661 accgggcact gctcgacgca ggcaccacag gtcacacatg accacaagac gtcgggatcg 405721 ataacgccac cctgttcctc ggtgccgacc agcgggcgag tcgcctgctc cggtccatgc 405781 ccgggcactc gaccgaaccc cgattccggc acgtgatgat gctcttggtg accggcctcg 405841 ccgcccgcgc tggcatcctt ttggcccagg atgtagggcg ccttggccat ccaatggtcg 405901 cgcaggtcca tgatgaccag cttgggcgac aacggtttgc cggtgttcca ggccgggcat 405961 tgcgactgac agcgtccgca ctcggtgcag gtagcgaagt cgagcatccc cttccaggtg 406021 aagtcttcga tcttgccgcg gccgaatacg gcatcctcgc tgggattctc gaagtcgatt 406081 ggtttgccat cggcttcgag cggcaacagc gggcccagcc catccggcag ccgtttgaac 406141 gtgacgttaa tgggcgccag gaagatgtgc aggtgcttgg aatgcaaaac gaggatcagg 406201 aacgcaagca tgaccccgat gtgcagcaac agcgctgtgg tttcgatgat ttcgttggcg 406261 ggctgcccga gggggcgaag aatcgcgccg aatagctgcg ataggaaggc cccgttgccg 406321 tagggcaggg tgccgttgtt gaccgctgag ccgcggacca acacgtaggt ccagatgacg 406381 ttgaagatca tcaacaggac gagccacgcg ccgccgttgt gcgatccgta gaaccgggag 406441 ctccgaccga tctcgcgggg gttgcgcagg atacggatga tggcgaaggt cgtgataccg 406501 agaaagacgg cggtggcaaa gaagtcctgc aggaagccca acgcgtccca ccggccgatg 406561 accgggatgt ggaatctctc ctcgaacagc aggccgtaag cctcgatata gacggtgagc 406621 aggatgaaga agccccacat ggtgaaaaag tgcgccaggc ccgggatcga ccatttcaac 406681 agtcggcgct gccctagaac ctcggagatc tgggtccaga tgcgggtgcc gaggttgtcg 406741 gttcgcccgc tggccggctg cccggacatg accagcttgt aaagccacca gactcgccgc 406801 agagcgaaca cccccacaac cgcggtcatg ctcatgccca gtatcagcct gatgagcgtt 406861 tgcgtggtca cggaaggtca cccgaattcg tagcactcaa tggaacccct gcataacctg 406921 ctcatcctga catctgtgcg actttcgccg cgagaaaggc tgtcctaacc taccggtcgt 406981 caacgcctct catctgcggt taagctctcc ggggccagca tggcccgcag catcgacaac 407041 atctccgacc gggagccagc gcccagccgc tggcgtatcc gggcgacgtg gtgctcgacc 407101 gtcttcgctg agatgaacag ccgggcgcca atgtcgcgat agggcatgcc cagtagcagt 407161 agctcggcga cttcgcgttc gcgatcggat agcggcgagc ccgccggtgg ctggcgcggt 407221 gccggcgggg taccggaagc tggttccgtg tcgccggccc cgctgggggg ctcgccgaaa 407281 tcgttgccca gcttaagatc ccgtgccaac tgcagcatgg caccggacac ccgtgcgtcg 407341 gatgtttgca atgcggcctg acctgccagt cgggtcgcat ccgacgtcag gccgacgtgt 407401 gacagggacc gcgccgccgc ggtgacctcg tcggcgtcga cgttttcggc caggacccgc 407461 agccaggtgc gaccggcatc cgacagggcc tgcgcgagcg tgctgtgggc gaccattgca 407521 ccgagggcct gtccgtgcgg tgccaccgat tccggcgaat tggcgaggat tccagcgtgc 407581 actccagccc aatgcagtga gttcgaccac agggcggggt tgcccagcga atccagcagc 407641 gtgagcgcct gatccagggt gtgttgtagc tggtcaacct ggcgcattcg ggcggccgcg 407701 acccacagtt caccaagtgg cagcagggcg aacagatcga gcgaatactc ggccagcgct 407761 tccatcgccg cataccaatg ctgttgcagc gcaccgatat cgccggtgcg acgcgagatc 407821 gcggtttgca gtgccgcggc ccacaacgcg tcgcgccggt gcaggtgcgt gccggcgctg 407881 gccgccgcga cgtccgcgct tgccgacggc aattgcccct cttgcatttt gatccagccg 407941 gaaagcagca ggtgccgacg ctggaacagc gggtcggcgc cggctcgcac ggcacgcccg 408001 atcacactgc gggcgcggac cggatcgccg gcgtgtatcg cggccaaggt aaccagcgct 408061 gccgggctgt ccggaatgac ttggctgagc gattgttcgg tggcaatggc ttggcccagt 408121 tttgccatcg cgaccggata cggctgatcc atggtcagca gcagcccctc ggcgaggttg 408181 cgcgcgcaac gcgctgccat cgtcggtgga ccggcatcct tgagtcgcag ggtggcacgc 408241 gccgtcgcca ggtcgccgtt cgcggcgaac acgatcgtgg cggccgagct caccatcgtg 408301 tccgggtgtg ggcccagcca gccgaacaac tcggctgcgt gtcccgtgtt gccgtcgtgg 408361 accgcgacgc tggccgcaac ccgcaccgcg gcagcgcgtt cggtggcatc cggggagctg 408421 agcagatcgt cggctagtgt tgccgcggcc gtacagtcgc cggtgcgggc cagtgcgtcg 408481 gccaggcgga ccgtcaatcc tttggcgccg gcatggaccg cggcgcggta cagccgtgcg 408541 caacggaccg aagcgtcgcg ggtgtccgcg gcgtaccgcg tgaggatgtc cgccagccgc 408601 tcgtcgcgca gcccgtgttc ggccagtcgc agcgctagct ccgccgacac cggcgagata 408661 tcgagttgtg agcgtaacag cgaggtttcg acctcgtggt ggtgtgcatt gccgacgatc 408721 tgagcgatcg catcatggac tgactgcaga aacgccgcgg tgtgtgacga ctcgatcagt 408781 ccgctggcgt gcgcacgatc gaccaatccg cgggcatccg ttaccgaaat cccaagtgca 408841 gcagctacat cgctgacccc tagctcgtgg gttagcgaca tcatgagcag ggtgtccaga 408901 gtgggttcgt cgaggcggcg cagccgctcg atgagggcca ccttggccgc ttgcgcggga 408961 gcctgtgccc tggcggaaac cgcatgaatg aggaacggca gtcccgcggt gcaatctcgc 409021 aggtgctcgg caaccggaag tggaccgagc gagattcgtg gccggtcccg ttcgagcgcc 409081 atcgtcaggg ctcgtagtgc ccggtggtgc tcgcgggctt ccgcggccgc caccaccgtc 409141 agccgtgaat cggccacgcg ctcggtgagc cggagcaatt cggtatcggt gagcaactgg 409201 gcgtcgtcga tgacgagcgc ggtctccggc ggttcgccgt ctggcggcgg gcatgccagc 409261 acggtgagtc ccgagcggcg cagtgtgtcg cgggcggcag ccagaacggt ggtcttgccc 409321 gttccgatgc ccccggtgat caggaccttg accggtaccg tcggggcatt cgcgagttcc 409381 aggagggcac ggcgtgctgc cggcgggacc tcggtgaggg aatcggtcac cgatgcgtcg 409441 tatgcttggc cacggttctt gcaccccctg tgctgcacgg ctggtcggcg gcggctccct 409501 caccatagcc ccagcccgtc ccgcagcccc gcatttcccc taatgcggcc atcccctaac 409561 ggcgccccgg ggccggcggg ttccgcaccg aacacggacg cggcctcaac cgatagcatc 409621 gtgctaacac gggactaacg ggggtggggc aaggaggcgg gtagtggcaa actcgttgct 409681 cgactttgtc atctcgcttg tgcgcgaccc ggaagcggcc gcacgttacg ccgcgaaccc 409741 cgagcggtcg atcgccgaag ctcaccttac cgacgtgacc agagcggatg tgaacagcct 409801 gatcccggtg gtgtcggatt cgttgtcgat gtccgaaccc atcggagccg ctggcggggc 409861 acacgctggc gatcgtggca acgtttgggc gagcggcgcg gccacggctg cgcttgatgc 409921 gttcgcccca cacgccgatg cgggtgttgt ccaacagcac ggtgcggtcg gcagcgttct 409981 caaccagccg accccacccg gaccgggcgt gacacccacc gatccgcgcc ccttccgagc 410041 cggtccacat gagacgtcgg cgctgctcac gagcgctgaa atacccgaca cgaccagcga 410101 ggacggggga ttgccgacag accatccggc tgtctggaac cacccggtcg ttgacccaca 410161 taccgtcgag cccgatcatc acggctacga catccacgga taagttccgg accggcgtag 410221 gggtgcccca tttcccctaa tcccctaacg cggcggccag gccgatcccg ataggtgttt 410281 ggccggcttg cggatcagac cccgatttcg gggtgaggcg gaatccatag cgtcgatggc 410341 acagcgccgg tcacgccggc gaacagcttc ttcgattgaa gggaaatgaa gatgacctcg 410401 cttatcgatt acatcctgag cctgttccgc agcgaagacg ccgcccggtc gttcgttgcc 410461 gctccgggac gggccatgac cagtgccggg ctgatcgata tcgcgccgca ccaaatctca 410521 tcggtggcgg ccaatgtggt gccgggtctg aatctgggtg ccggcgaccc catgagcgga 410581 ttgcggcagg ccgtcgccgc tcggcatggc tttgcgcagg acgtcgccaa tgtcggcttc 410641 gccggtgacg cgggcgcggg ggtggcaagc gtcatcacga ccgatgtcgg tgcgggcctg 410701 gctagcggac tgggtgctgg gttcctgggt cagggtggcc tggctctcgc cgcgtcaagc 410761 ggtggtttcg gcggtcaggt cggcttggct gcccaggtcg gtctgggttt tactgccgtg 410821 attgaggccg aggtcggcgc tcaggttggt gctgggttag gtattgggac gggtctgggt 410881 gctcaggccg gtatgggctt tggcggcggg gttggcctgg gtctgggtgg tcaggccggc 410941 ggtgtgatcg gtgggagcgc ggccggggct atcggtgccg gcgtcggcgg tcgcctaggc 411001 ggcaatggcc agatcggagt tgccggccag ggtgccgttg gcgctggtgt cggcgctggt 411061 gtcggcggcc aggcgggcat cgctagccag atcggtgtct cagccggtgg tgggctcggc 411121 ggcgtcggca atgtcagcgg cctgaccggg gtcagcagca acgcagtgtt ggcttccaac 411181 gcaagcggcc aggcggggtt gatcgccagt gaaggcgctg ccttgaacgg cgctgctatg 411241 cctcatctgt cgggcccgtt agccggtgtc ggtgtgggtg gtcaggccgg cgccgctggc 411301 ggcgccgggt tgggcttcgg agcggtcggg cacccgactc ctcagccggc ggccctgggc 411361 gcggctggcg tggtggccaa gaccgaggcg gctgctggag tggttggcgg ggtcggcggg 411421 gcaaccgcgg ccggggtcgg cggggcacac ggcgacatcc tgggccacga gggagccgca 411481 ctgggcagtg tcgacacggt caacgccggt gtcacgcccg tcgagcatgg cttggtcctg 411541 cccagtggcc ccctgatcca cggcggtacc ggcggctatg gcggcatgaa cccgccagtg 411601 accgatgcgc cggcaccgca agttccggcg cgggcccagc cgatgaccac ggcggccgag 411661 cacacgccgg cggttaccca accgcagcac acgccggtcg agccgccggt ccacgataag 411721 ccgccgagcc attcggtgtt tgacgtcggt cacgagccgc cggtgacgca cacgccgccg 411781 gcgcccatcg aactgccgtc gtacggcctt ttcggactac ccgggttctg attcgcgagc 411841 cgatttcacg aaccggtggg gacgttcatg gtccccgccg gtttgtgcgc ataccgtgat 411901 ctgaggcgta aacgagcgag aaagtggggc gacacggtga cccagcccga tgacccacgt 411961 cgggtcggtg tgatcgtcga actgatcgat cacactatcg ccatcgccaa actgaacgag 412021 cgtggtgatc tagtacagcg gttgacgcgg gctcgccagc ggatcaccga cccgcaggtc 412081 cgtgtggtga tcgccgggct gctcaaacag ggcaagagtc aattgctcag ttcgttgctc 412141 aacctgcccg cggcgcgagt aggcgatgac gaggccaccg tggtgatcac cgtcgtaagc 412201 tacagcgccc aaccgtcggc ccggcttgtg ctggccgccg ggcccgacgg gacaaccgca 412261 gcggttgaca ttcccgtcga tgacatcagc accgatgtgc gtcgggctcc gcacgccggt 412321 ggccgcgagg tgttgcgggt cgaggtcggc gcgcccagcc cgctgctgcg gggcgggctg 412381 gcgtttatcg atactccggg tgtgggcggc ctcggacagc cccacctgtc ggcgacgctg 412441 gggctgctac ccgaggccga tgccgtcttg gtggtcagcg acaccagcca ggaattcacc 412501 gaacccgaga tgtggttcgt gcggcaggcc caccagatct gtccggtcgg ggcggtcgtg 412561 gccaccaaga ccgacctgta tccgcgctgg cgggagatcg tcaatgccaa tgcagcacat 412621 ctgcagcggg cccgggttcc gatgccgatc atcgcagtct catcactgtt gcgcagccac 412681 gcggtcacgc ttaacgacaa agagctcaac gaagagtcca actttccggc gatcgtcaag 412741 tttctcagcg agcaggtgct ttcccgcgcg acggagcgag tgcgtgctgg ggtactcggc 412801 gaaatacgtt cggcaacaga gcaattggcg gtgtctctag gttccgaact atcggtggtc 412861 aacgacccga acctccgtga ccgacttgct tcggatttgg agcggcgcaa acgggaagcc 412921 cagcaggcgg tgcaacagac agcgctgtgg cagcaggtgc tgggcgacgg gttcaacgac 412981 ctgactgctg acgtggacca cgacctacga acccgcttcc gcaccgtcac cgaagacgcc 413041 gagcgccaga tcgactcctg tgacccgact gcgcattggg ccgagattgg caacgacgtc 413101 gagaatgcga tcgccacagc ggtcggcgac aacttcgtgt gggcatacca gcgttccgaa 413161 gcgttggccg acgacgtcgc tcgctccttt gccgacgcgg ggttggactc ggtcctgtca 413221 gcagagctga gcccccacgt catgggcacc gacttcggcc ggctcaaagc gctgggccgg 413281 atggaatcga aaccgctgcg ccggggccag aaaatgatta tcggcatgcg gggttcctat 413341 ggcggcgtgg tcatgattgg catgctgtcg tcggtggtcg gacttgggtt gttcaacccg 413401 ctatcggtgg gggccgggtt gatcctcggc cggatggcat ataaagagga caaacaaaac 413461 cggttgctgc gggtgcgcag cgaggccaag gccaatgtgc ggcgcttcgt cgacgacatt 413521 tcgttcgtcg tcagcaaaca atcacgggat cggctcaaga tgatccagcg tctgctgcgc 413581 gaccactacc gcgagatcgc cgaagagatc acccggtcgc tcaccgagtc cctgcaggcg 413641 accatcgcgg cggcgcaggt ggcggaaacc gagcgggaca atcgaattcg ggaacttcag 413701 cggcaattgg gtatcctgag ccaggtcaac gacaaccttg ccggcttgga gccaaccttg 413761 acgccccggg cgagcttggg acgagcgtga gcaccagcga ccgggtccgc gcgattctgc 413821 acgcaaccat ccaggcctac cggggtgcgc cggcctatcg tcagcgtggc gacgtttttt 413881 gccagctgga ccgcatcggt gcgcgcctag ccgaaccgct gcgcatcgcg ttggctggca 413941 cactcaaggc cggaaaatcc actctcgtca acgcccttgt cggcgacgac atcgctccga 414001 ccgatgccac cgaggccacc cggattgtga cctggttccg gcacggtccg acaccgcggg 414061 tcaccgccaa ccatcgcggc ggtcgacgcg ccaacgtgcc gatcacccgt cggggcgggc 414121 tgagtttcga cctgcgcagg atcaacccgg ccgagctgat cgacctggaa gtcgagtggc 414181 cagccgagga actcatcgac gccaccattg ttgacacccc gggaacgtcg tcgttggcat 414241 gcgatgcctc cgagcgcacg ttgcggctgc tggtccccgc cgacggggtg cctcgggtgg 414301 atgcggtggt gttcctgttg cgcaccctga acgccgctga cgtcgcgctg ctcaaacaga 414361 tcggtgggct ggtcggcggg tcggtgggag ccctgggcat catcggggtg gcgtctcgcg 414421 cggatgagat cggcgcgggc cgcatcgacg cgatgctctc ggccaacgac gtggccaagc 414481 ggttcacccg cgaactgaac cagatgggca tttgccaggc ggtggtgccg gtatccggac 414541 ttcttgcgct gaccgcgcgc acactgcgcc agaccgagtt catcgcgctg cgcaagctgg 414601 ccggtgccga gcgcaccgag ctcaataggg ccctgctgag cgtggaccgt tttgtgcgcc 414661 gggacagtcc gctaccggtg gacgcgggca tccgtgcgca attgctcgag cggttcggca 414721 tgttcggcat ccggatgtcg attgccgtgc tggcggccgg cgtgaccgat tcgaccgggc 414781 tggccgccga actgctggag cgcagcgggc tggtggcgct gcgcaatgtg atagaccagc 414841 agttcgcgca gcgctccgac atgcttaagg cgcataccgc cttggtctcc ttgcgccgat 414901 tcgtgcagac gcatccggtg ccggcgaccc cgtacgtcat tgccgacatc gacccgttgc 414961 tagccgacac ccacgccttc gaagaactcc gaatgctaag ccttttgcct tcgcgggcaa 415021 cgacattgaa cgacgacgaa atcgcgtcgc tgcgccgcat catcggcggg tcgggcacca 415081 gtgccgccgc tcggctgggc ctggatcccg cgaattctcg cgaggccccg cgcgccgcgc 415141 tggccgcagc gcaacactgg cgtcgccgtg cggcgcatcc actcaacgat ccgttcacta 415201 ccagggcctg tcgcgcggcg gtgcgcagcg ccgaggcgat ggtggcggag ttctctgctc 415261 gccgctgacg cgtcaggccc tcgggtgtca cagtggtggg cgtgactggt ggcgccaacg 415321 caacggtgat cagccaccgg gtggaacatg ttttcgagcc caaggggcag cgacggcagc 415381 tcggggcaca agggtcataa gggcatgcgc tcagaatgtg tcgaccttct cgatgctgac 415441 gaacatgcca tggcccgtgc ggttgttcgt gaagcgggtg ccatcggtgg tggcgtcgat 415501 ggtccagccc tgcgcttcat aggttcggta gtcgatgctg acggtgggga tggcgcccag 415561 attgcccaac acccactgca cggagccgcc ggcggtgatg tggatgccgt tggcgtgctc 415621 accatctcgc aagggggagt tggtgaacgg tgcctcgcag ccaacgctgt ccctattgat 415681 ctggcaacgc gtcattccgg acttggtttc gatgaagacg taaccgttgg agtcaggcgg 415741 gagcggaatg gcgccggccg gcgcggtcgg gctgaccggt gtcgtcggta gcgtcggggc 415801 ggtggtcccc ggcggcgcgg tagttggcct aggcgtcgga aaagtcggct cggtaggtcc 415861 cgaccctggc gaagcgaccg gcctgccgtc gatggtggtg ttgcagccgg caactagcgc 415921 ggtagccgcc agcagggccg ccataccccg tgcaatgagc gatagccgca cgcgctactc 415981 cccggaaatc tgagatatcg ggagtaggtt acgcgcgagg tcccgcaatt tactgcagtg 416041 acgcgcttct gcaacggccc gcataatcgg agaatggcgt tgttgccgtc gacggtcgtg 416101 ggagtcttgc tggccgcggg tgcgggccgg tggtatggca agccgaaagt gctggttgac 416161 gggtggctgg acaccgcggt cggggcgttg cgcgacggtg gttgtaacga cgttttttgg 416221 tgctgggtgc tgtcgaggtg tcggcaccgg ccggtgtcac cgcgattacc gcgccggact 416281 ggcagcaggg gctgagcgcg tcagtgcgtg cgggtctggc ccaggccgac cgcgagcacg 416341 ccgactacgc cgtcctgcat gtgatcgaca cgcccgatgt caatgccaag gtggtggctc 416401 gagtccttgg ccgtgccttg gtatcccgca gcggtctggc agggcgcggc cgcatacctg 416461 cgcacagtgc ccgacgtcga ggctgttgag tgcggcgact tggctagtgg tcgcgatgtc 416521 gacgtggacc tcagattgga tccgccgaat ggacgaccgc gacactcttg gtgtggtcga 416581 tggcgtggtg cgcgacggcc gtcacacgat tgcggaccaa ataccagcca ccgatgaggg 416641 ccggtacacc gatcaccgtt gcggcgatca tccagggacc gtgttgttcg tcgaagtaca 416701 tcaggatcag cacgccggcc aggaaagcca gcgtcagata gccgctgaac ggtgaaagcg 416761 gcatccggaa tttcggccgc tgcagctgcc cggcgttcgc catccggtgg agccgcagct 416821 ggcaagccac gatcgtcgcc caggccgcga tgactcccgt cgcggcgatg tggagcacga 416881 tctcgaaggc ttggctcggt ttgatggcgt tgagaatgat gcccaacagg ccgataccgg 416941 cggtgagcag gatcccgccg tacggcacgc cggtcttcga cattggtgcg gtgaacctcg 417001 ggccgctgcc gttgatcgcc attgatcgca ggatccgtcc ggtggaatac agtccggcgt 417061 tgaggctcga cagcgcggcg gtgagcacga cgaggttcat cacgctgccc gccgcgtcga 417121 taccgatctt ggaaaagaag gtcacgaacg ggctgacatg ttctttgtag gcggtatagg 417181 gcagcagcag ggccagcagg acagtcgacc cgacgtagaa gcacgcgatg cgcaacacca 417241 cagagttgat cgcgcgcggc atgatctttg ccggttcggc tgtttccccg gccgcgatgc 417301 ccaccagttc gattgcggcg taggcgaata ccacccccga ggtgaccagc actatgggca 417361 gcagaccggt tggcacgatg cctccatggc tgctccacag ggagaccccg gtctcctggc 417421 cgtcgatctt gtagcgccca gcgagaaaga ccgtaccgac gatcagaaac gtaaccagcg 417481 cgatcacctt gatcaatgag gcccagaact ccagctcgcc gaagagcctg accgagatca 417541 ggttcatcga caacacgacc agcagcgcga tcaacgccag cgtccactgg gggatgggtt 417601 gaaacgcccg ccagtaatgg caatagtgcg cgatcgcggt ggtatcgacg atccccgtca 417661 tcgcccagtt caggaagtac atccacccgg cgacgaaggc caccttttcc ccgtagaact 417721 cgcgggcgta ggacacgaat gaccccgagg acggacggtg cagcaccagt tcgccgagcg 417781 cgcgcaggat caggaacacg aagatgccgc agatcccata gaccaggaac aaaccgggcc 417841 ccgccgatgc aaggcggccg ccggcgccga gaaaaaggcc ggtaccgatg gcaccaccga 417901 gagcgatcat ttgcagttgc cggctatgga ggcctttgtg atagcccgtg tcttcgcgcg 417961 tgagccgctc gtcggtgatg tctagcggtg gcattgagct ccctgggatg gtggcttctt 418021 gggacgcgcg tgagatgggg cacacccaac ggactggctg tcaggctatc ccacgcggct 418081 gcgaggtgcc gcttggcaac caatcggaaa caatcgatcg gtcaacggtg ctttgttgtc 418141 gtgccgaccg tcgcgggtgg ccgcgttgac agtcgatatt gcggtcacag gctgacgcgc 418201 ctggccagcc agacgctcgc gaagtgcggg tccgtcctgg ccgcgagggt gtcgtagccg 418261 cggtcgtagt gtgagactcc gacacctgat ccgccagcgc agagatgtga gatcaacgga 418321 cggaaggcga cggtgcccgg cgcgcgcgag ttgacgctgc gcgtcgagcg cggggctcta 418381 tttcggcgtc gatgggcagc atcggcagcg tcatcagctc gcgcagcaat tcgtcgtgat 418441 ccgcggcgct gcgcgctggg tacccggcct cgatgggtat catttttggt tatcgttctg 418501 gttatcatga atgttgtgac ggcccatccc aagtacccga atgaccctct tgcgctggta 418561 ttgattgaac tgcgccatcc gcggaccgag ccgccggtgc catctgctat ctccatcctg 418621 aaggaggagc tggcgcgatg gactcccata ctcgaacagg aggaggtgcg gcaggtcaac 418681 ctagaaacgg gcgaacatac cgcacactca cagaagaagc tcgttgcccg tgatcgccgc 418741 accgcgatca cgtttcgacc cgacgccatg accctcgaag tcaccgacta cccgggctgg 418801 gaggagtttc ggtccatcgt tcacgcgatg gtcacagccc gccaggacgt ggccccagtc 418861 gatggctgca tccggatcgg tctgcgctac atcaacgaga ttcgggcatc gctggcggag 418921 ccatccggct gggcgtactg ggtggcggaa agtctcctcg ggcctgggac acagcttgcc 418981 gatctcaaac tcaccaccac cgcgcaacgg cacgtcattc agtgcgaagg cccggagcca 419041 ggcgactcct tgacactgag gtacgccggt gcgcgcggcg cggtcatcca gtcaaccccg 419101 tttctccagc ggttgaaaga acctccggca gaaggagatt tcttcctcat cgatatcgac 419161 agcgcgtgga gcgacccctg caagggcatc ccagcgctcg acgcccacct ggtggacgag 419221 gtcgccgaaa ggctccacac acccatcggc ccactgttcg aatcgctgat aacttccgaa 419281 ctccgtacaa aggtgctgca acaacctggg caggagtgac catgaccatt tcgttctcta 419341 gctcgaatct ccgagacgac gccacctctg gcaacggcga ttaccgcctc gacaagctgc 419401 ccgaaaccac cccatcgacc tcggtgttcg accgcgccga tgtcacctac cgccaattca 419461 cggaactcca cgggcaagcc cgcgacacac ggcgggaggc gcacgtggtt gagctggagt 419521 ccaagaccgg cgagcgggct cggtgcgcac ccatgcatgc gcttgagcag ctcgcggact 419581 acggctttgc ctggcgggac atcgcacgcg ttgtcggagt gagcgtgccc gcaatcacca 419641 aatggcgcaa gggcgctgga gttaccggcg agaaccggct aaaaatcgcc cgtctactcg 419701 ccctcatcga catgctctcg gaccgattca tcggcgagcc cgcctcctgg ctggaaatgc 419761 cgatccaagc cggagtggga atcacccgaa tggacctcct ggagcgaggt cgatatgacc 419821 tcgtattggc gctggctagt acccacactg gggacggtac ggtcgaatac gtactgaacg 419881 agactgataa ggactggcga gagaccgttg tagacaacgc tttcgaatcc tacacagccg 419941 aggacggcgt gatctcgata agacccaagc ggtaaccgtg ccagagctgg agacgcccga 420001 cgacccagag tcgatatacc ttgcccgcct cgaggatgtc ggagaacaca gaccgacgtt 420061 cacgggcgac atctaccgac tcggcgatgg tcgcatggtg atgatcctcc agcacccatg 420121 cgcgctgcgg cacggcgttg acctccatcc gcgactgctg gtcgctcccg taagacccga 420181 ctcgcttcgt tccaactggg ctagagcccc gttcggcacg atgccgcttc cgaagctcat 420241 cgacggtcag gatcactcgg cggacttcat caatcttgaa ctcatcgatt caccaacgct 420301 tccgacctgt gagcggatcg cggtgctcag ccagtcaggc gtcaacttgg tcatgcaacg 420361 gtgggtgtac cacagcaccc ggctcgccgt gcccacgcac acctactccg acagcaccgt 420421 tggcccgttc gatgaggcag acctgatcga ggagtgggtg acggatcgcg tcgacgatgg 420481 ggccgacccg caggcggccg aacacgaatg cgcctcctgg ctcgatgaaa gaatcagcgg 420541 ccgcactcgg cgagcgctgc tcagcgaccg tcagcacgcc agttcaatac ggcgagaagc 420601 gcgttctcat cgaaagtcgg tcaagctggc ggactgagca ctgctctccg ggcttgaccg 420661 gggcctctcc cagctacgcc ccgagcgtgt gccctgccga cacgcgggaa caagacccgc 420721 acgaccagcg ttagcatgct cagtaagttg agtgcatcag gctcagctct gaattgacag 420781 cacaccgccg tcgaggcaag cttgagcggg gtgcactcat catagtgcag gaaagaagct 420841 ctacatattc aggaggattc accatggctc gtgcggtcgg gatcgacctc gggaccacca 420901 actccgtcgt ctcggttctg gaaggtggcg acccggtcgt cgtcgccaac tccgagggct 420961 ccaggaccac cccgtcaatt gtcgcgttcg cccgcaacgg tgaggtgctg gtcggccagc 421021 ccgccaagaa ccaggcggtg accaacgtcg atcgcaccgt gcgctcggtc aagcgacaca 421081 tgggcagcga ctggtccata gagattgacg gcaagaaata caccgcgccg gagatcagcg 421141 cccgcattct gatgaagctg aagcgcgacg ccgaggccta cctcggtgag gacattaccg 421201 acgcggttat cacgacgccc gcctacttca atgacgccca gcgtcaggcc accaaggacg 421261 ccggccagat cgccggcctc aacgtgctgc ggatcgtcaa cgagccgacc gcggccgcgc 421321 tggcctacgg cctcgacaag ggcgagaagg agcagcgaat cctggtcttc gacttgggtg 421381 gtggcacttt cgacgtttcc ctgctggaga tcggcgaggg tgtggttgag gtccgtgcca 421441 cttcgggtga caaccacctc ggcggcgacg actgggacca gcgggtcgtc gattggctgg 421501 tggacaagtt caagggcacc agcggcatcg atctgaccaa ggacaagatg gcgatgcagc 421561 ggctgcggga agccgccgag aaggcaaaga tcgagctgag ttcgagtcag tccacctcga 421621 tcaacctgcc ctacatcacc gtcgacgccg acaagaaccc gttgttctta gacgagcagc 421681 tgacccgcgc ggagttccaa cggatcactc aggacctgct ggaccgcact cgcaagccgt 421741 tccagtcggt gatcgctgac accggcattt cggtgtcgga gatcgatcac gttgtgctcg 421801 tgggtggttc gacccggatg cccgcggtga ccgatctggt caaggaactc accggcggca 421861 aggaacccaa caagggcgtc aaccccgatg aggttgtcgc ggtgggagcc gctctgcagg 421921 ccggcgtcct caagggcgag gtgaaagacg ttctgctgct tgatgttacc ccgctgagcc 421981 tgggtatcga gaccaagggc ggggtgatga ccaggctcat cgagcgcaac accacgatcc 422041 ccaccaagcg gtcggagact ttcaccaccg ccgacgacaa ccaaccgtcg gtgcagatcc 422101 aggtctatca gggggagcgt gagatcgccg cgcacaacaa gttgctcggg tccttcgagc 422161 tgaccggcat cccgccggcg ccgcggggga ttccgcagat cgaggtcact ttcgacatcg 422221 acgccaacgg cattgtgcac gtcaccgcca aggacaaggg caccggcaag gagaacacga 422281 tccgaatcca ggaaggctcg ggcctgtcca aggaagacat tgaccgcatg atcaaggacg 422341 ccgaagcgca cgccgaggag gatcgcaagc gtcgcgagga ggccgatgtt cgtaatcaag 422401 ccgagacatt ggtctaccag acggagaagt tcgtcaaaga acagcgtgag gccgagggtg 422461 gttcgaaggt acctgaagac acgctgaaca aggttgatgc cgcggtggcg gaagcgaagg 422521 cggcacttgg cggatcggat atttcggcca tcaagtcggc gatggagaag ctgggccagg 422581 agtcgcaggc tctggggcaa gcgatctacg aagcagctca ggctgcgtca caggccactg 422641 gcgctgccca ccccggcggc gagccgggcg gtgcccaccc cggctcggct gatgacgttg 422701 tggacgcgga ggtggtcgac gacggccggg aggccaagtg acggacggaa atcaaaagcc 422761 ggatggcaat tcgggcgaac aggtaaccgt cactgacaag cggcggatcg atcccgagac 422821 gggtgaagtg cggcacgtcc ctcccggcga catgccggga gggacggctg cggccgatgc 422881 ggcgcacacc gaagacaagg tcgccgagct gaccgccgat ctgcaacgcg tgcaggccga 422941 cttcgccaac taccgtaagc gggcgttgcg cgatcagcag gcggccgctg accgagccaa 423001 ggccagcgtt gtcagccaat tgctgggtgt actggacgat ctcgagcggg cgcgcaagca 423061 cggcgatttg gagtcgggtc cactgaagtc ggtcgccgac aagctagaca gcgcgttgac 423121 cgggctgggt ctggtggcgt tcggtgccga gggcgaggat ttcgaccccg tgctgcacga 423181 agcggtgcaa cacgagggcg acggcgggca ggggtccaag ccggtaatcg gcaccgtcat 423241 gcggcagggc taccaactgg gtgagcaggt gctgcggcac gccttggtcg gcgtcgtcga 423301 cacggtggtc gtcgacgcgg ccgaactgga gtcagtcgac gacggcactg cggtcgcaga 423361 taccgccgaa aacgatcaag ctgaccaggg caatagcgcc gacaccttgg gcgaacaggc 423421 agaatcagaa ccgtcgggca gttaacaaca aaagaggaag gcgagagggg gtgacgcgac 423481 atggcccaaa gggaatgggt cgaaaaagac ttctaccagg agctgggcgt ctcctctgat 423541 gccagtcctg aagagatcaa acgtgcctat cggaagttgg cgcgcgacct gcatccggac 423601 gcgaacccgg gcaacccggc cgccggcgaa cggttcaagg cggtttcgga ggcgcataac 423661 gtgctgtcgg atccggccaa gcgcaaggag tacgacgaaa cccgccgcct gttcgccggc 423721 ggcgggttcg gcggccgtcg gttcgacagc ggctttgggg gcgggttcgg cggcttcggg 423781 gtcggtggag acggcgccga gttcaacctc aacgacttgt tcgacgccgc cagccgaacc 423841 ggcggtacca ccatcggtga cttgttcggt ggcttgttcg gacgcggtgg cagcgcccgt 423901 cccagccgcc cgcgacgcgg caacgacctg gagaccgaga ccgagttgga tttcgtggag 423961 gccgccaagg gcgtggcgat gccgctgcga ttaaccagcc cggcgccgtg caccaactgc 424021 catggcagcg gggcccggcc aggcaccagc ccaaaggtgt gtcccacttg caacgggtcg 424081 ggcgtgatca accgcaatca gggcgcgttc ggcttctccg agccgtgcac cgactgccga 424141 ggtagcggct cgatcatcga gcacccctgc gaggagtgca aaggcaccgg cgtgaccacc 424201 cgcacccgaa ccatcaacgt gcggatcccg cccggtgtcg aggatgggca gcgcatccgg 424261 ctagccggtc agggcgaggc cgggttgcgc ggcgctccct cgggggatct ctacgtgacg 424321 gtgcatgtgc ggcccgacaa gatcttcggc cgcgacggcg acgacctcac cgtcaccgtt 424381 ccggtcagct tcaccgaatt ggctttgggc tcgacgctgt cggtgcctac cctggacggc 424441 acggtcgggg tccgggtgcc caaaggcacc gctgacggcc gcattctgcg tgtgcgcgga 424501 cgcggtgtgc ccaagcgcag tgggggtagc ggcgacctac ttgtcaccgt gaaggtggcc 424561 gtgccgccca atttggcagg cgccgctcag gaagctctgg aagcctatgc ggcggcggag 424621 cggtccagtg gtttcaaccc gcgggccgga tgggcaggta atcgctgatg gcgaagaacc 424681 caaaggacgg ggaatcccgg acgtttttga tctcggtagc cgccgagcta gccggcatgc 424741 atgcacagac cctgcgtacc tacgatcgtc ttgggttggt cagcccgcgg cgcacctccg 424801 gtggcgggcg ccgctattcc ctgcatgacg tcgagttgct gcgccaggtg cagcacctct 424861 cgcaggacga gggggtcaac ttggccggca tcaagcgcat tattgaactg accagtcagg 424921 tcgaggcgct gcagtccagg ttgcaagaga tggctgagga gttggcggtg ttgcgtgcca 424981 accagcgccg cgaggtcgcg gtggtgccga agagcaccgc cctggtcgtc tggaaaccgc 425041 gccggtgagc gagcgcgcgt agcgggggag cgaacggcgc agttggcacc agccggtgag 425101 cgagcgcgcg tagcggggga gcgaacggcg cagttggcac cagccggtga gcgagcgcgc 425161 gtagcggggg agttagggtc cgctaccgtt gttgaggatg ccggagagtc gggctccgtg 425221 gttgccgaag ccggagataa gggcttgggt cgcgaggtcc agcatgctcg tgttgtagaa 425281 accggagacg gtattgccta ggttcgccca gcccgacagc aggttgccga agttttggaa 425341 gcccgaattc cctacgccgc cagcattgaa gaagcccgaa gtctcggtga agacgtttcc 425401 caggcccgac acggcggctg cggcgtcgtt gaggaagccc gatgcgccac cggcgccgga 425461 gttgaagaag cccgacgacg gggttgtggt cgagttgaag aatcccgggc tctgctgcca 425521 gccgaagccg aaggggaacg cgcccacggt gccgctgccg gcgaaatcga gggtttgggt 425581 gaaagccgtg tcgatgggct ggtcggggtt gatcgtgctg gcatcgattt cgtaggggcc 425641 gagatgttcg gtggtgatgg gtatggtgac cgagacatgc tttacacacc ccttgaaagg 425701 gatgtagatc acgcagaccg acacccgcaa cttgatgggt atttcgaatt cgtcaatagt 425761 gaacgcgtcc tgggtgatgg cgttgatgtc gccctcgatg ggtatttcaa tgttggaacc 425821 tatgtcgtag ctccacggga tttcggaaac ggcgctctgg taggcgaaac cgcctaggcc 425881 ctggtagtcg ccccgccaga agaagccgtt gctgtagttg cccgaattga aggcgccggt 425941 gttgacatcg ccggagttgg cgaagccggt gttggagttg cccgggttga agtagccggt 426001 gttgtagtta ccggggttga agccgccggt gttgtagtcg ccggggttga agctgccggt 426061 gttggtgttc ccggcgttga ataggccggt gttgtagtca ccggagttgc caatgccggt 426121 gttgaccagg ccggtgttga agaagccgga gttggtgctg ccggtgttgc cgatgccggt 426181 gttgtagtca ccggagtttc cgatgcccca gttgccggtg ccggagttgc cgaagccgat 426241 gttgccggtg ccggagttga atagcccgat gttgccggtg cccgagttcc agccgccgaa 426301 cccggtcatg gtgtcgccgg tcagcccgat gccgatgttt ccgtttccgg tgttgccgaa 426361 gccgatgttg ccggtgccgg tattgccgat gccgatgttt ccgtttccgg tgttgccgaa 426421 gccgatgttg ccgatggccg ccgtcagccc cggaccgacg ttgccgaacc cgatgttgga 426481 gctgccgatg ttggcgctgc ccaggttgaa gtcgccgatg ttggcgctgc ccaggttgaa 426541 gtcgccgatg ttggcgctgc ccaggttgta gacgccgatg ttggcgctgc ccaggttgta 426601 gttgccgatg tttgcgctac ccagattcca gaacccgagg ttggccaagc ccacgctgaa 426661 ggtcgtctcg gtcgggccgt tctgcaacca cccagccagg ttggtgccga tgttgagcaa 426721 gcccgagaca tttgccggtg ctcccaagcc ggtgttgtag atgcccgaga tgctgttgcc 426781 caaattcgcc cagcccgact gcagcgaccc atagttttgg aagcccgaat ttcccatgct 426841 gctagtggcg aagttgtaga ggcccgagtt gttgccgaag ttcagcaagc ccgatgcgct 426901 gccagcaccc cagttgagga agcccgacga cgggccggtg gtggcgttga agaagccggg 426961 cgctggccgc aagtcgatga tcggaatgct gatcgggccg gcgccggcgc cgcccacgat 427021 gttgatcacg gtggagccgt cgggcttgcc gatgttgagg ttgatcgccg gcgaggggcc 427081 gaagtcaatt tcgatgggtg tgtccagcgg ggccgacgcg tccccgccat gcagggtgat 427141 cggaccgacc ggggccaaga cggtgccact gaggatgcct atgtcgacgc ttccgctggc 427201 gtcgattctg gggaacgtaa tggcggggat ggagacattg gtgatgtcgc cggtgatggg 427261 gatgttgacc gggatgtcga cattgaggaa cgcggcaggt cgctcgatgg tgatggtgta 427321 gttggccgcc agcaggccct gccggtcggc gcgccagagc aagccgttgt tcatgctgcc 427381 ggtgatgaac gcgccggtgc cgtagtctcc ggcgtttgcc atgccggtgt tgtagctacc 427441 ggtgttgtag gagccggtgt tgtagtggcc cgggttggcg atgccggtgt tgaaggtgcc 427501 gatgttgaac aggccggtgt tgtggttgcc cgggttggcg atgccggtgt tgaccaggcc 427561 cgcgttgagc aagccggtgt tgtagttgcc gctgtttccg atgccggtgt tgccggtgcc 427621 ggtgttgccg atgccgacgt tgccggtgcc cgagttgccg atgccgacat tgccggtgcc 427681 cgagttaaac aagccgatgt tgaaactgcc cgagttggtg ccgccgatgc ccacctggct 427741 gtcgccggat aggccgacgc cgaagccgcc ggtgcccatg ttgccgatgc cgacgttgcc 427801 ggtgcccgag ttgaacaagc cggtgttgcc ggtgccggag ttgaacaagc cggtgttggc 427861 ggtgccggag ttgaagaaac cggtgttgcc ggcgccggag ttcagggagc tgaagccgga 427921 caagccgtcg ccggtcagcc cgatgccgac gttgttgttg ccggtgttga acaagccgat 427981 gttgttgttg ccggtgttgg caatgccctg gttgaagttg cccgcgttgg ccatgccaaa 428041 gttgttgtcg cccaggttga acaggcccat gttggcgatg ccggcgttca gcgggccgaa 428101 cccgatctgg ttgtcgccgg acaggccgat gccgatgttg ttgttgccgg tgttgccgaa 428161 gccgatgttg tagttaccgg tgttgccgac accgatgttg tagttgccgg tgttgccgat 428221 accgatgttg ttgacagccg ccgttagccc cggaccggtg tttgcgatcc cgacgttaaa 428281 gtcgccgatg tttgcgccgc cgatgttcgc gttgccaatg ttgccgaagc cgacgtttga 428341 attgccgagg tttgcactgc cgaggtttgc actgccgagg tttgcactgc ccacgttgaa 428401 ttggccgagg tttgccaggc ccgcgttgaa ggtcgacccg ttcgggccgc ggagcacgcc 428461 ggtcaggtcg gcgccgacgt tggagaggcc cgagaggtta gccggcgtcg cgaagtccgc 428521 cgcactggtg ttgtagaagc ccgagacggt gttgcccaag ttcgcccagc cggattgcag 428581 cgagccgaag ttctgcaagc cagagtttcc tatgccggcg aaagcggtgt tccagaagcc 428641 cgaattgttg gcgccgacgt ttccgaatcc agacgagctg ccggtgccgg agttgaagaa 428701 gcccgatgag gggccggtgg tcgagtttcc gaaaccgggg gccggcggga tgtccagtag 428761 cgggatgacg accgggccgg cgccgctggt gatcgtgatg ggtatcgagg atgaaccgcc 428821 ggggtcgccg atgttgatat cgatcaccgg taaggtgccg gcgatcctcg gaacgatgat 428881 gggtccgacg ctgaagtggg tgaccccgag ggcgcggagg gtgatagccg gaatctggaa 428941 gccgttgacg gcaagggtac cgaagtcgag atggatgggg atgttgaccg gaatgttgag 429001 gctaaagttc agtagcggga tctggggaac agtgattgcg tagtgtgcgc cccactggcc 429061 ttggtgatcg ctctgccaga aggcgccgtt gctgaagttg cccccgatga aggcgccggt 429121 gttcacgtcg ccggtgttgt agaagccggt gttgtagtcg ccggtgttgt agaagccggt 429181 attgaagtcg ccgtcgttga agctgccggt gttggtgctg ccggtgttgt agctgcccgt 429241 gttggcgatg cccacgttgg cgatgccggt gttggtgctg cccacgttgt agccaccggt 429301 gttgtaggtg cccacgttgg cgacgcccat attgccggtg cctgggttcc acaggcccca 429361 gttgccggtg cccgagttcc ccaagccggt gttgccgaca cccgggttgc cgatcccggt 429421 gtttccgatg cccgagttgc caatgccgat gtttccggtg cccgagttga acaaaccgat 429481 gttgttggtg cccgagttga acaggccggt gttggcggta ccggagttcc aggagccgaa 429541 cccctgttga ttgtggccgg acagcccgat accgatgttg ttgttgccgg tgttggcaaa 429601 cccgatgttg ttgctgccgc tgttggcgaa gcccaggttg aaatcgccgg cgttgccgaa 429661 tccgaagttg tagctgccca ggttgcctag gccgatgttg tagttaccca ggtttgccgg 429721 gccgatattg tatgagcccc ggtttccgga gaagacgttg aagctgccga tgttgccatg 429781 gccgacgttc gcgttgccga ggtttccgag gccgacgttg gagtcgccga tattgccgtg 429841 gccgacattc ccgctgccta cgttgccaaa cccgaggttg aggatctggg tgttgttgac 429901 gaagccggcg accgtagcac cgacattccc tccgccggag agattggccg gcatcgaaag 429961 cccaagcgtg ctcgcattga caaggccgga gacagtgtta ccgaagttga acccgccgga 430021 gatcagctcg ccgaagttct ggacgcccga gattcccgag cttccagagg tgaggtcgaa 430081 gaagccggaa ctattgctgc ccacgttgcc aacgcccgac acggtaccgg taccggcgtt 430141 gaagaatccc gacgacgggg tggtcgtcga gttcccgaat cccggggccg ccggtatatc 430201 gatgagtgga atcttgatcg gcaatagacc gccggtgccg gcgatatcga tcagcgggtc 430261 cggcccgcta cccaggttga tgccgatatt gggaaggaca atcgagatgt tcgggaaact 430321 gaatgcatcg agtgtggcgg cattgaacgg tatgccgatc aagaagatat cgccggtgat 430381 ctccgggaat ctgaagccat gaacggtgaa cgtgccaagt gtgccggtga ccgggatatc 430441 gaggaagatc ggcacgtgca gtttcaccgg aacggcggtg tcgggcacgg tgatcgtttg 430501 gctgatcccc gccaggcctt ggtaatcgcc ccgccaccag aacccgttgc tgtaattgcc 430561 ggtgatgaag ccgccggtgt tgacatcgcc cgagttggcc agtccggtgt tgtagctgcc 430621 ggtgttcaaa taacccgtgt tgtagctacc cgggttgaag ccggccgtgt tgaagctgcc 430681 ggcgttgaag ctgccggtgt tgtaatggcc ggtgttgaag atgccggtgt tgacgttgcc 430741 ggcgttgagc aagccggtgt tgacgtcgcc ggtgttgaag atgccggtgt tggtgtcacc 430801 ggtgttggcg atgccccagt ttccggtgcc cgagttgccg atgccgatgt tgccggtgcc 430861 cgagttgaac aaaccgatgt tgttggtgcc agagttgaac aggccgctgt tgccgctgcc 430921 ggagttccag ccaccggcaa aattgaagcc ctgctggttg tcgccggaca ggccgatacc 430981 gatgttgttg ttgccggtgt tggcaaaccc gatgttgttg ttgccggtgt tggcgaagcc 431041 caggttgaag tcgccggcgt tgccgaatcc gaagttgtag ctgcccaggt tgcctaggcc 431101 gatgttgtag ttacccaggt tcgccgggcc gatgttgtat gagccctggt ttccgccgaa 431161 gacgttgaag ctgccgaggt tgccgctgcc gaggttgaag ctgccgatgt tcgccaagcc 431221 ggcgttgctg tcgcctacgt tggagaagcc gacgttgaat tggccgatgt ttcccaggcc 431281 gaggttgaac atcgacatcc cggtcgcctg gtcgtggaag aaccccgcga ggttgctgcc 431341 gatgttgaac atgcccgaga cgttggccgg tgccccgatg ccggtgttga atacgcccga 431401 gacggtatcg cccaggttcg ccagtcccga ttgcagcgag ccgtagttgt tgaagcccga 431461 ggtcgcggag ttcgcgacgt tctggaagcc ggaaatgttg gcgccgatgt tggcgatgcc 431521 cgatacggtt ccggggccgc cgttgaagaa gcccgaggac ggatcggtgg tggcgttgaa 431581 aaagcccgtg gtagccgcaa tgttgacgaa cgtgacatcg aagggaccga cgcttgcggt 431641 ggccgggatc ctgatcgcgg tcgaaccgcc agggtcgccg atgttgaccg tgatcgcggg 431701 accggtcccg gtgatgggcg ggagaacggc cttgctgatt gcaccggcca gcagggggat 431761 ccctgcgatg tcgatggtga aaccgaagtt gatttgctca agcgttatgc cgctgtagac 431821 ggtgttggtg aagctggcgg tgatggggat gttgacggga acttccacgg tgacgtgtgc 431881 gggtatttcg ggaacatgga cccgatagcc cgcgctgaat aggccctgct ggtcgccgcg 431941 ccagaaggcg ccgttgccca tgtcgccggt gatgaaagcc ccggtggcga tatcaccctg 432001 gttggcaaag cccgtactga aattgccggt gttgtagaag cccgtgttga agtcgcccag 432061 gttggcgatg ccggtgttgg tgtcgccggt gttgtaccag ccggtgttgt agctgcccgc 432121 gttggcgaca ccggtgttga cgatgccggt gttgaagaag cccgtgttag tgctgccggt 432181 gttgccgatg ccggtgttgc cgctgccgga gttgccgata ccccagttgc cggtgcccga 432241 gttgccgatg ccgacgttgt tggtgccgga gttgaacaag ccgatgttcg cggtgcctga 432301 gttccagccg ccagcaaaat tgaagccctg ctggttgtcg ccggacagcc cgatgccgat 432361 gttgttgttg ccggtgttgg caaatccgat gttgttgttg ccggtgttgg caaagccttg 432421 gttgaaatcg ccggcgttgc cgaagccgat gttgtagtta cccaggttcg cgaaaccgat 432481 gttgtagtta cccaggttcg ccggaccgat attgtatgag ccctggtttc cggagaagac 432541 gttgaagctg ccgacgtttc cgctgcccag gttgaagtcg ccgagatttg cgctgccgat 432601 gttcaactgg cccaggttgg caaggcccgc gttgaagatc gtcccggtcg gaccgcggaa 432661 cacgccggac aggttggtgc cgatgttgtt caggcccgag acattggccg gcgtggagag 432721 gttcaccgta ctggtgttga aaaagcccga tacggagttg cccaggttcg cccagcctga 432781 ctgcagcgag ccgaggttct ggaaacccga attccctatc gcgctgctca aaccactgtt 432841 ccagacgcct gaactgccgc cgccgacgtt ttggaagcca gatgtgccac cggtgcccga 432901 gttgaagaag ccggacgagg ggttggtggt cgaatttccg atgcccggcg ccggatcgat 432961 cttgaggaag gtaatcgtgc ggctctccag agcaccgaca atgctgatgg ggacggtcac 433021 cgtcggtccg ccgatggtga gggtgatcgt cggaacggtc agcgtggatg cgctgagatt 433081 gaccgggccg aagaagaaca aaccgctcag atagaaggtt tgggggaaaa cggtcgaggc 433141 ctcggtgacc gtgatcatgt tgccgccgaa ggtcattacg ttgtgtacgt caatgaccat 433201 ctgctcgttt atggggatga atggagtggt gaccgagaga tcgatggcaa tctggccctg 433261 gttatcgccc gccaccaaga agccattgtt gaagtcgccc gtgtcgaaag cgccggtatt 433321 gacgttgccg ggattgaaga agccggtgtt ggtgtcaccc gggttatagc tgccggtatt 433381 ggtgtcaccc acgttgaagt tgccggtgtt ggtgttaccg acgttgaagc cgccggtgtt 433441 gtagctgccc gtgttgtaga agcccgtgtt gaagtcgccg gcgttgagga tgcccgtgtt 433501 gtagctgcca gcattgagga tgccggtatt gtcggtaccc gggttcccga taccccagtt 433561 cccggtgccc gagtttgcga tgccgacgtt tccggtgccc gcgttgaaga tgccaacgtt 433621 attggtgccc gaattgaaca ggccgctgtt gccggtgccc gagttccagc cgctagcaat 433681 attgaagccc tgctggttgt cgccggacag cccgatgccg atgttgttgt tgccggtgtt 433741 ggcgaacccg atgttgttgt tgccggtgtt ggcaaagcct tggttgaagt cgcccgcgtt 433801 cccgaagccg acgttgtagt cgccgacgtt tccaaaaccg atgttgtaga tcccgaggtt 433861 tccggatccg atgttgtagt ttcccaggct tccggaaccg acattgaata ctccgatgtt 433921 tccactgccg atattgaagc tgccgacgtt gccgctgccc aagatgtttt ggctgccgag 433981 gttgccgctg ccaaggatgt tgaagtcacc gacgtttccg ctgccgagaa tgttgtaatt 434041 gccgatgttg gcgttgccga gaatgttcac gacgccccgg tttgccaggc cgagattgaa 434101 gaccggtggg ccaccgaaaa atcccgacat gttgcttccg gtgttgaaga agcccgagat 434161 caaggccggc gttgtgatgg ccaccaggct catgttgaac aaacccgata cggtgttgcc 434221 cgagttgatc acgcccgata ccagcacgcc cgcgtttgcc aggccggagt taccgatggc 434281 ccccgacgaa gagttgaaga agccagaatt gttggcaccg gagttcagga agccggacgc 434341 gctaccggca ccgctgttga agaatcccga cgacggcgca ctggtcgagt tgaagaagcc 434401 ggggctcccg aaaatcaggc cttggtggtc gccgcgccac aagaagccgt tgttgaagtt 434461 gccagtaatg aaggcgccgg tgttgacatt gccggagttt gccaagccgg tgttgtagtt 434521 gccgctgttc aggtagcccg tgttgtactg gcccatgttg aagccgccgg tattgctgtt 434581 gcccgggttg tagctaccgg tgttgtagtt gccggcgttg ccgacgccgg tgttggctat 434641 tccggagttg aagaagcccg tgttggcgtc gccggagttg ccaaaaccgg tgttgtagct 434701 gttgcccgag ttgccaatgc cccagttccc ggtacccgag ttgccgatgc cgacgtttcc 434761 ggtgcccgag ttgaacagac cgatgttgcc ggtgcccgag ttcaggccgc cgaaccccaa 434821 caaaccgcta cccgtgagcc cgatacctcg gttgccgtct ccggtattgc cgaagccgat 434881 gttgttgctg ccggtgttgc caaacccgat gttgttgctg ccggtgttgc cgaaaccgat 434941 gttgttcagc gctgcggtca acccaggacc cacgttgcca aacccgatgt tggagctgcc 435001 gatgttgccg ctgccgatgt tgccgttgcc gatgttggcc gagccgagat tgaagttccc 435061 gacattaccg ttgccgacgt tgccctcgcc gacgttcgcc aagcccaggt tgcggaagac 435121 ccgcgtggtc acctgagccg cggccgcgct gaccagcgca ccgccgcccg ccacggtcgg 435181 cagcgcctgg ccgaacggtg tcaacgccga gacggccgcc gaagccccgg catggtagcc 435241 aaacatcgcc gccacgtcct gggcccacat ctgctcgtag gcggcctcgg tggccgcgat 435301 cgccggggcg ttttggccca gcaggttcga gaccaccagc gacacgaaca gtgcccggtt 435361 ggccgagatg atcgccggat gtaccgtcgc cgccagggct gcctcgaagg cggccgccgc 435421 cagccgggtt tgggtggccg cctgctgggc ctgcgccgcc gccgcgctca accagcccag 435481 atagggggcg gccgctcccg tcatcgccgt cgacgccgcg cccagccacg aggaacctgc 435541 cagccccgcc gtcaccgccg aaaacgaggc cgcggccgaa cccaattcgt cggccagtcc 435601 atcccaagcg gccgccgcgt ccagcatcgg cgccaacccg gcacccacgt acaggcgcgc 435661 cgaattgatc tccgggggca gcaccgcgaa gctcatctag cgtccctaac cggaaccgct 435721 gaccaccacc gcgtggtggg tggagccaaa cgtcccgttc cgcgcttggg tgtcttgaca 435781 gtgacgatta ttcaacagac gcctgacgca ggtttggctt tggagtgtcg agacagaaaa 435841 tctcagctag ggctggccgg gcagtagccg caccatcagg ccgttgcctt cggccaacag 435901 cgtctcgtcg ctgtcaaaca gttccgcgca cacaaacgcc tttcggccct cggtattggt 435961 gactcgtccg cgtacgatca acggcacatc aatcggggtg attcggcggt aatcaacgtg 436021 cagaaaggcg gtccggctga tcggccgtcc cgccgcatgc gagatcatgc cgaacatgtg 436081 atcaaacaac agcggcaaca cgccgccgtg caccgcggag ttgcccccga cgtgaaaccg 436141 gctaaacgac ccccgcatct caacaccgtc ggtgccgtac cgggtcaccg tccatggcgg 436201 tagcagcagg ctgcccatgc cgggcaggcc gggggtccgc ccggccggcg ccttgccttc 436261 gtcggcctca aatgggctca gcaactcgac gagcgcggcg gcgcgctcgg ccgcctcgtc 436321 ccacacggcg tcgccggggt ccgccgcgac cgccaggtcc tgcaaccggc gcatggtcgc 436381 cacgaactgg ccgaaccccg caccgggact ggccggaccg tactccggaa atccaccgtg 436441 gtggtgatac tcgggatcga gttcgtcggg gtgcactgac gcatctgtca cgggcgatcc 436501 tgcaggacgt cccggcgcac gatggtctgt tcccgccccg gaccgactcc aatgcacgaa 436561 accggtgctc cggcaagctg ttccagtcgc agcacataat cacgcgcttt ggcgggcagg 436621 tcgtcgaact cgcgcgcccc ggagatgtct tcccaccagc ccggcagctc ctcgtaaacc 436681 ggcttggcgc ggcaaagatc ccgctgggtc atcggcatat cgcgggtgcg ccggccgtcg 436741 atctcatatc cgacgcagac cggcaccgat tccaggctgg acagcacgtc gagcttggtc 436801 aggaagtagt cggtgatgcc gttgacccgg gcggcgtagc gggcaatgac ggcgtcgaac 436861 cagccgcagc gccggcgccg gccggtggtc acaccgaact cgcggccagt cttggacagg 436921 tattcgccgt gttcgtcgaa cagctcggtg gggaacgggc cggagcccac ccgagtggtg 436981 taggccttga gaatccccag cacggtgccg atgcgggtcg ggccgatacc agagcccacg 437041 gccgcgccac ccgccgtcgg attcgacgat gtcacatacg ggtaggtgcc gtggtcgaca 437101 tcgagcaggg tgccttgaga gccttccagc agcaccgttt cgccggcctc cagggcagca 437161 ttgagtagca gccgggtgtc ggcgatgcga tgcttgaaac cctcggcctg ctccagcagc 437221 gcgtcgacca cctgcgcggg gtccagggcc ttgcggttgt agatcttgac cagcacttgg 437281 ttcttgaact cgcacgcggc ctcgaccttg tgggtcaatt gttccgggtc cagcacatcg 437341 gcgacccgga tcccaatacg ggcgatcttg tcctggtagc acggcccgat accacggccg 437401 gtggtgccga tcttcttgct gcccatatag cgctcggtga ccttgtcgat agcaatgtgg 437461 taaggcatca gcagatgggc gtcggcggag atcaacagct tggcggtgtc cacgccgcgg 437521 tcttgcagtc cccgcagctc attgagcagg acaccgggat cgatcaccac gccgttgccg 437581 ataacgttgg tgaccccggg cgtcagcaca cccgacggga tgagatgcaa tgcgaaattc 437641 tcgccggtag gcaagacgac ggtgtgcccg gcgttgttgc ccccctgata gcgcaccacc 437701 cactgcacgc ggccacccaa caggtcggtg gccttaccct tgccctcgtc gccccattgg 437761 gcgccgatga ggacgatcgc cggcatgagt tgctcccacc tggtctcgca ggctatgccc 437821 gcttattgtg gtccagccgg tgacctaccc tacccagcag gttgcgagga gctgtcatgt 437881 atacggccga gaacgcaccc ggcgtcgcgg tgttgctctc cggtgatgcc gacgtgcccg 437941 gcccgttgac cggcttgcct acccatcaag acaacctgga caccgtcatc ggacggtatt 438001 cgcggctcat cgtcgtcggc gccgacgcgg acctgggggc ggtactgact cggctgttgc 438061 gcaccgaccg gctcgacgtc gaggtgggtt atgtgccgcg ccggcgcagc cccgcgaccc 438121 gggcctaccg cttgccggcc gggcgccggg cggcgcggcg cgcccggtgt ggcgtcgctc 438181 ggcgggtgcc gctaatccgt gacgagaccg ggtcggtaat cgtcggccga gcacagtggc 438241 tgccggccga agagcaggcc ctgatccacg gcgaggcggt cgttgacgac accgtgctgt 438301 tcgatggcga tgtggccggg gtgtgcatcg agccgacgct gaccctgcca ggcctgcgag 438361 ctgcggtaga cggcgccgga aagtggcggc ggtggatcgg cgggcgcgcc gcgcagctag 438421 gcaccaccgg tgctgcggta cttcgggacg gtgtcgcggc gccccgcccg gtgcgccgat 438481 ccacgtttta ccgcaacgtc gagggttggc tgctggtccg gtagttttcg accggtgagc 438541 gagacgggcc agcgcgagtc ggtgcgaccc agcccgatct ttctgggcct gctcggattg 438601 acggccgtcg ggggcgcgct ggcctggctg gccggggaga cggtgcagcc gctggcctac 438661 gccggggtgt tcgtcatggt gatcgccggc tggctggtgt cgctgtgcct gcacgagttc 438721 ggtcacgcgt tcaccgcttg gcgtttcggt gaccacgacg tcgcagtgcg cggctacctg 438781 acgctggatc cccgccgcta cagccatccc atgctctcgc tcggtctgcc gatgctgttc 438841 atcgccctgg gcgggatcgg tctgccgggt gccgcggtgt atgtgcacac ctggttcatg 438901 acgacggcgc gccgcaccct ggtcagtttg gcggggccga cggtcaacct ggcgctggcc 438961 atgttgctgc tggcggcgac ccggttgttg ttcgacccga tccacgcggt gttatgggcc 439021 ggggtggcgt tcctagcatt ccttcagctc accgcgctgg tgttaaacct gctacccatc 439081 ccgggtctgg acggctatgc ggccctggag ccgcacctga gacccgagac gcagcgcgcc 439141 ctggcgccgg ccaagcagtt cgctttggtg tttctgctgg tcctgttcct ggcgccgacg 439201 ctgaacgggt ggtttttcgg ggtggtgtac tggctcttcg acctgtctgg cgtgtcgcac 439261 cggctggccg ccgcgggcag cgtgctgacc cgtttctgga gtatctggtt ctgaccgttc 439321 agagcccaag cgccggacgg gccgcggggt cacagtcgtc aagcagatcc aggcagcgtc 439381 catactcgtc ggtctcgccg atagcggctg cggcgcgcgc cagcgccgcc acacaccgta 439441 ggaaaccccg gttgggctgg tgggaatacg gcaccgggcc gaagcccttc cagccatggc 439501 ggcgcagctg gtccaggccg cggtggtacc cggtacgcgc gtatgcgtag gccgtgacgg 439561 tcttgtcgtc ggccagcgcc ccttcggcga gcaccgccca ggcgaccgac gccgacggat 439621 gcgcggccgc gacgatgctc ggactttcgt tggcaagcag ctccgcttcg gcgtcgctgt 439681 cgccaggcaa caggattggc tcaggtccca agagatcacc catcgacgtc atgggagtta 439741 ttgtgcgctt ggtcacgtca cctcgacgat ggggccaacc gaaggctggg tcgctaagct 439801 ccaaagagcc actcgatacc gggaggacag cagcacccat gtccaacgca cccgagccag 439861 accgctcagc cggtgaatcc gggagcgaac cggccggcga gcggtccgcc gatcctggcg 439921 aggaacgcac cgaaagctac cccctggtgc ctcacgacgc cgaaaccgag accgtggtga 439981 tcaccacctc cgacaacgat gccgcggtta cgcaaccgga agcgcagcgc gaacgccgtt 440041 tcaccgcgcc cggcttcgac gccaaggaga cccaggtgat cgtcacggcc cacgaggcag 440101 ccaccgaggt tttccaaacc aaccaggcgc cgaccacccc gccgcggatg ccaaccggaa 440161 tgcccccgaa aactgctgtg ccacaatcaa tcccgccacg gacggaggcg acgtcagtcc 440221 ggcaacgcac ctggggctgg gcgctggcgg tggtagtgat cgtgctggcg ttggcggcaa 440281 tcgcgatcct gggcaccgtg ctgctgaccc gcggcaaaca ttcgaagatg tcgcaggaag 440341 atcaggtgcg gcaggccatc cagagcttgg acatcgccat ccagaccggc gacctgaccg 440401 cgctgcgttc cctgacttgt ggctccaccc gcgatggcta cgtggattat gacgagcgtg 440461 attgggccga aacctatcgc cgggtttcgg cggccaaaca atatccggtc atcgccagca 440521 tcgaccaggt cgtcgtcaac ggcgcgcacg ccgaggccaa tgtcaccact ttcatggcgt 440581 tcgatcccca ggtccgctcg acccgcagcc tcgacctaca gtttcgcgac gatcagtgga 440641 agatctgcca gtcctccagc aactgaagcc aggattggct ggtttgcccg cattttggcc 440701 attggtcagt gctaggaccg gtccgcatca ccggcacgtc accaggaccg actagtccga 440761 acaccgaaac gagcaaccgt agccgaaatg cggctggatc ccgtctgtgg caatgtactg 440821 gcggcctgtt cccgcagaga cggcggcata gcgtctcgat cgtcaacgag aggcaggtga 440881 tcgccaggtg agcatccgcc ccgccgagaa ctcaacactc gacatccgcc acgtcatcgg 440941 tatcggcacc ccgaaagccg tcgatttgtg gctcgacgtc gtcaccgagc tgccggatcg 441001 cgcccgcgaa ctcgggtcgt tatccaaagc cgaactcgga aagcttggcc cactgctcga 441061 cggcaccaac gccgtcgagc tattcgagtc gatcgacgac aagctggccg cagaggcact 441121 gcacgcgatg gatccgtcgc tggccgccac cttcctcgag gccctcgact ccgaccacgc 441181 cgccaacatc ctgcgcgaat tcaaggagcc caagcgggag gcgctgctga cgttgctacc 441241 gctggagcgg gcgatggtgc tgcgtggctt gttgagctgg ccggaggact gcgccgcggc 441301 ccacatggtg cccgaaacgc tgaccgtacg cccgaacatg acggtgtcgc aggccgtcgc 441361 cagcgtgcgg gaacgcgcct cgggcctgcg cagcgatgca cgaaccaccg cctacgtcta 441421 tgtgacagac gccgactccc acctgctggg tgtgatcgcc tttcgcgccc tggtgctggc 441481 caatcccgaa cagcgagtcc gtgagctgat gggtgacgac ctcatcgtcg tgtcgccgtt 441541 gactgacaag gagctcgcgg cgcagacaat catgggccac aacctgatgg cggttcccgt 441601 cgtcgatgcc gacaaccggc tactgggcat catcgccgaa gacgaagcca tcgacattgc 441661 cgaggaggaa gcaaccgaag acgccgagcg ccagggtggg tcggccccgc tcgaggtgcc 441721 ctacctgcgg gcgtcgccgt ggctgctatg gcgcaagcgg gccgtctggc tcctggtact 441781 ttttgctgcc gaggcctaca ccggcagcgt cctgcgggcg ttctccgacg aaatggaggc 441841 ggtgatagcg ctcgcgttct tcatcccact gctgatcggc accggcggca acaccggcac 441901 ccagatcgcc accactctgg tccgcgcgat ggccaccggt caggtccggt ttcgcgatgt 441961 gcctgcggtg ttagccaagg agctgtcaac cggtgtgctg gtcggcctca ctatggccgc 442021 cgccgcggtg gtgcgcgcct ggacattggg cgtgggcccg caggtgaccc tgacggtcgc 442081 gctgacggtg gccgccatcg tggtgtggtc gtcgctggtg gctgccgtcc ttccgccgct 442141 gctgaagaag ttgcgcatcg acccggccat cgtttcgggg ccgatgatcg ccaccatcgt 442201 tgacggcacg ggtctgctca tctacttcct ggtcgcgcac ctgacgctga ccgagctgca 442261 cggcttgtga gcggccccgg tttagtgggt tagggacttt ccggcgcagt gcaggtcatt 442321 gcacgcctga acgacccgct ggctcatcga agcttcggcc ttcttgaggt agctgcgcgg 442381 gtcgtagacc ttcttgacac ccacctcgcc atcgaccttg agcactccgt cgtagttggt 442441 gaacatgtga ccggcgatcg ggcgggtgaa cgcgtactgg gtgtcggtgt cgacgttcat 442501 cttcaccacg ccgtagcgca gcgcctcctc gatctccgac ttaagcgaac ccgagccgcc 442561 gtggaacacg aagtcgaacg gcttggcgtc ggccggcagt ccgagcttgg ccgccgccac 442621 ctgttgccct tgcgcaagga tgtcggggcg aagcttgacg ttgccgggct tgtagacgcc 442681 atgcacgttg ccgaacgtcg cggccagcag gtatttgccg tgctcaccgg cgcccagcgc 442741 ctcgatggtt ttctcgaagt cctccgggct ggtgtacagc ttctcgttga tctcgttcgc 442801 cacgccgtcc tcttcgccgc cgacgacgcc gatctcgatc tccagaatga tcttggcggc 442861 cgccgccgcc ttgagcagct cctgggcgat ggccaggttc tcatcgattg gcactgccga 442921 gccgtcccac atgtgcgact ggaacaaagg attgccacct ttgctcacgc gttgcgccga 442981 gatcgccagc aagggccgga catagctgtc caacttgtcc ttggggcagt ggtcggtgtg 443041 cagcgccacg ttgaccgggt acttggccgc gataacgtgg gtgaactccg ccaaggcgac 443101 cgcaccggtc accatgtctt tgaccccgag gccggagccg aattctgcgc caccggtcga 443161 gaactggatg attccgtcac tgccggcgtc ggcgaaacct ttgatcgcgg cgttgacggt 443221 ttccgaggag gtgcagttga tagccgggaa agcgtacgag ttttgtttgg cctgaccgag 443281 catctccgcg tagacctcgg gcgttgcgat aggcatgaaa cgttcctcct gacgactccg 443341 atccacccag tatcgcaaca ccgcaaccga gcttgtcggc ctgtgcgtga tggccggtat 443401 gttgggacgt catgagcacc gccgtgacgg ccatgccgga catcctcgac ccgatgtact 443461 ggttgggcgc caacggcgta ttcggttccg cggtgctgcc cgggattttg atcatcgtct 443521 tcatcgagac cggtctgctg tttccgctgc tgccgggcga gtcgctgttg ttcaccggcg 443581 ggctgttgtc cgctagcccg gcaccaccgg tcaccatcgg ggtgctcgcc ccgtgcgttg 443641 cgctggtcgc ggtgctcggc gatcagaccg catatttcat cgggcgacgg atcgggccgg 443701 cgctgttcaa gaaggaagac tcccggttct tcaagaaaca ctatgtgacc gagtcccacg 443761 cgttttttga gaagtacggg aaatggacga taattctggc tcgattcgtg ccgatcgcgc 443821 ggacttttgt gccagtcatt gccggggtgt cctacatgcg gtatccggtg ttcctcgggt 443881 tcgacatcgt cggcggagtc gcctggggtg cgggtgtgac gttggcgggc tactttttgg 443941 gcagtgtccc gttcgtgcac atgaactttc agctcatcat cctggccctc gtgttcgtct 444001 cactgttgcc cgcactggtc tcggcggcgc gggtctaccg ggcgcggcgt aacgcacccc 444061 agagcgaccc cgacccgttg gtgttacccg agtgagctga ccgctgcggc gctgtgggcg 444121 gcttccatca gcatccaacc cgatagctgc accgacagat ctcgctcggc aatcgccgag 444181 ctatgcaccg ctcctcggac ggaccgcgcc tgctcaccgc cggcggtggg caactcggct 444241 tcgcgatccc agaacgcccc gaacaccggc aacccgtcca cggtttgccg gtaatcccac 444301 gccgattgcg cgctagccag cactatcgcg cgggcggtgt cgcgggcggc ggcgtcgtcg 444361 gccgagtcgc ccggcaacgt ggtggcgacc aaggcgaggt atcgggcggt gatccccgcg 444421 aacaggccac cgtccccgcc gccggcgccc cgtaacacac ccaatggagc catgtgctcg 444481 ttgacggccg cgaccaagcg atgaacgcga gcgcagtgcc gcgctctggc tgccggaccg 444541 gtgcgcaccg ccagctcggt ttccagcccg agcacaaccc cttggcagta ggtgtactgc 444601 gcgcggacca acgacccggc cttgatgccg tcgaatacca ggtgtgtctc cggatcgatc 444661 agcgtgcgat cgatccagtc ggccatctgt tctgcgcgct tgagcctttt cccgtactgg 444721 tctgggtagc gggccaggaa tagcccggcc gggccgttgg ctggggcgtt gaagaactgg 444781 tcctgcttgc gccacgggat gccgccgccg tcctcgggca cccaggcttc gacgaactgg 444841 ttggtgagct tgggcagtgc gcgccggcgt cgtaccccgg cgacccggtc ggcacgttcc 444901 agcgctaacg ctagccacgc catgtcgtcg taatagctgt tgagccacga gaaattgttg 444961 cggacccggt gcgagcggac ctggcggttg atccgggcgc gccgctgcgg ctgcgggtcg 445021 cgcagctgcg cgtcgaccag gcaatccagc aggtgtgcct gccaccagta gtgccagctg 445081 ccgaacaacc ggtcgcgccg ggttgacggc caagccacca ccgccaactg ggtgcccggc 445141 aacgcccaaa gccgtctcag atgccgttgc gtgacggcgg tttcggcgct ggctgcccgg 445201 tttgccagat tcataatgcg atcctgccct agcctgtctt acgccgtctc aggcctgtta 445261 ctccagcgtg acatcaaggg tggcggcgtc cacgaaggcc agcatgcgcg cccgatgatg 445321 ccacccccgc tgaactgagc gacgatccgc gggcccgcga gccggctgtt gtcatagacg 445381 gtggcgccgt cggccagcgt gatggcctgg gcgacaagct cggccagccg gcggtgacgc 445441 tcgcggatct tggtctccgg cacatcgtgg ccgcccgcgg cgacgcgatg cctgacgcgc 445501 tcgaccgcca ggccttcggg gataaccaac acgtgcagta cgacggtgta gccggccgtg 445561 cgcgcggtgc ggatgagctc gagcttcgat gggtgcgaga acaccgtctc ggcaatgaac 445621 ggccggccca agtcgatgag cctcgcgcgg gtgtcggcgg cgacctgcgc cgcctggtag 445681 gcgtgcgatg ttgggtcgtc gggccagcgt tgtttggcga tttcgtcggc gttgacgaag 445741 acgatgccgg gcagcaaggg cgccagcgtg agggcgacga acgtcgactt gccggcgccg 445801 ttgggcccgg cgaccagatc gagccgcttc acgcgtggcg cgttgtcttc gagctggcga 445861 tcacggcgtg gccgccagca cgaccgaggt gccgtctggc cggtgctcga cgatgtcgcc 445921 cgcgtcgttc agggcgaccg tggtgatgcc ctgcgcggcg agcacgtcgc cgtagttggt 445981 tcgcgacagg cgctcctcga tagctgcgga gatctcggcg ttgaacacca cgccctcctc 446041 cagcgtcagg tccgtcatcg gcagatgccc cgcgagcgca gcttccacgc ggcgccgcga 446101 cgccgtgtgc tggttcgaca ccgcccgacc gacgcgggcc cagtggtcga gctgctgctt 446161 ggccgaacgg ctctgacgag caccctcggc cgccgcgctg tccaccagat ccgcggcgac 446221 gcgcgtgacg cggtcgacgg ctttgggcac gacatctctc ctcgggtgta gcgatctgtt 446281 acagcttata gcaaagtgct acaccgagct gtggtgaggg gcgcacacgg ctagcgggca 446341 ccggccagcg ccagcagcaa ctggtgcagg ccggccagcg agtgcgccgg caggaacagg 446401 tcgcaatagg gcagcgccgc cgccatcgaa ctggcacgcg gctggaactc cggatgtgcc 446461 gcgcgagggt tcagccagac cagcaactcg gcgcggcgac gcaccctggt cagtgcgtgc 446521 accaacacgt cgggcggatc gctgtcccag ccgtcggagg cgatgatcac caccgcgccg 446581 cgtaacgcgt tgccatgcgg cggggccagc agggcggcga cactacggcc gatgaacgta 446641 ccgccgtagc ggtcggtcac cctagcgttg gcccgatgta gcgccatctc ggccgagcga 446701 tgagacagca ccgaggtaag tcgagtcagc gacgtcgaaa acgcgaaaac ctccgggtgg 446761 ccccctgccc ggcgcagcac cgccgcccgc atcagatgca gatagatggc ggcgtagggc 446821 tgcatcgagc ggctcacatc gcagagcagg agcacccgcc tggggcgtcg gcggggccgg 446881 atccgtgcca acagcaccga ctcccagcca gtcgaccgcg acgcgttcat cgtcgcccgc 446941 aggtcgatgc gcttgccgtg cgggctggac tcgaatcgca tgctgcgccg ccgcggccag 447001 cgcgccatcg tggcctccag ccaggcgccg agcagacgca gatcgtcggg atcgaactgg 447061 tcgaatggct cgtcggcccg ggcgacaatg cggctgggca ggacatcggg cagtgtgcgg 447121 ctgggtccgc cctgaccggc gctggccatc gtcagcgagc gagtatccca gggcagattc 447181 tgggcttggg cggcacaaga tcgccgcttg gcgcggtgcc cgacgccggc caccggtgtg 447241 cgcgggcctg caatgggcgg tggtgggcgg ttggcaccgt cgggttcggc gctgccaaat 447301 accccgaaca gcgaagcgaa taccgcatcg aacgtggcca gttcgtctac acggctgacc 447361 agggtcaacc gcgcgcccca atacagcgcc gccggcgtac gcggcaccaa ctgctgcaac 447421 gcctgcacca aactcgcttg accgctggcg gacaccggta tcccggcgtc gcgaaggcgc 447481 gctgccagcg ctgccgcgaa cgccgcgagg tcgacgcccg gcaacagtgc aggggtggcc 447541 atcccattca tcgccgccgg cgcagcaccc agatcagcag cagcacggtc agcgccgcca 447601 gcagcgccga accgtacttt ttgagctggc cgccgtcggc cagctgcagc aagtcgatgg 447661 gcgccgcttc ggtagcaggg ggtgtgcctt gcgggctttc tgaactctgg gccgcgagct 447721 cggcttctag cgaatccacg aactggccca gcagcttctc cgacacctgc tgcagcatgc 447781 cactgccgaa ttgcgccagt ttgccgacaa tcttcagatc ggtgtcgacg gtgacgcggg 447841 tacgctctcc gacctcgtgc agctgggcag cgaccgtggc ggccgcgttg ccggtaccgc 447901 gcgcctcctt gcctttggcg tcgaaaacgg cgcggtgctg gttgcggtcc tgctcgacaa 447961 agtgcacctt gccgctgaac tcgctggtga ccggcccaac cttgaccttg accttaccga 448021 ggtactcgtc gccctcatgg ccgatcaact gggctccagg catcagcgga atcatctgct 448081 ccaggtcgca tagcctgctc caggcctgct cgatcggagc gctgacggtg aactcgttgg 448141 cgatcttcat cctgtgcgtc ctctcatgcg tggctgcact cagtaaaagc ttggtacgca 448201 tcgcgaatct gcgtacggtc gtcgggcgtt ttggccaggg ccccaaggct ggccagagcg 448261 ggactcgaat ccgctgcggt gaggtctgcg accccgagtg ccaccaaagc cgccacccag 448321 tcgatagtct cggccacacc gggtggcttg tccagatcga gatcccgtgc agtgcaaacg 448381 aattgagtgg cgttctcgat caacggcgcg gtagccccgg gcaccgtgcg gcgcacgatc 448441 gcggccgccc ggtccggctc cgggtagtcg atccagtggt agaggcagcg ccgccgcagt 448501 gcgtcgtgca ggtcacggct gcggttggac gtgagcaccg cgatcggcgg gcactccgcg 448561 aggaaagtgc ccagctcggg aacggtcacc gcggactcac caaggaactc cagcagtaac 448621 gcctcgaatt cgtcgtcggc ccggtcgatt tcatcgatga gcagcactgg aggggtcggt 448681 ccgcggtgcc gcacgcaccg caggatgggc cggtccacca gatacgcctc ggtgtacaga 448741 tccgcttccg atatgtctga gatacccttg ccgcgcgcct cggccagcct gatggacaat 448801 agctggcgtt ggtagttcca gtcgtagagc gcctcgttgg ccgtcagccc ttcatagcat 448861 tgcagccgga tcagcgtggt atccaacacg actgcaaggg ttttcgcggc tgttgtcttg 448921 ccaacaccgg gctcaccctc caacaacagc ggcctgccca gcgtaaccgc cagatagatt 448981 gccgacgccg tgccggtatc cagcaggtag ttctgttcgt cgaaccggcg gatcacgtcg 449041 tcgggacttg cgaaggtcac gagggcaccg attccagcag ccgtcggtag tcgtcccagg 449101 tgtccacgtc aagcggcacg cagccgtcca cggcgagttc gcgcactggg tggcggccgg 449161 agtgcaccag cttccagaca cccttgtcgc cgtgcagtcg cgcgagttcg ccgaacacgg 449221 tgcggctaaa ccagaatgga tgcccgacgc cgtcggcgta gcggcacacc atgatctcgg 449281 tggccggccc gacgtcgatg atccgccgca gtgtcgccgg cgccacctga ggctggtcgc 449341 ccagcatcag cacgatcccg gtggcccgcg gatgcacccg tgccaacgcg acgcgcagcg 449401 atgccgcaca cccgcgctcg acatcctcga cgaccaccac gtcggtcccg tccagcgcca 449461 tcgcggcacg caccgccgac gccgcaccgc ccagggtgag gatcagctgg tcgaatccgg 449521 cttgccgggc aacgtcgagg gtggccccaa gcaccgtggt atcccgatat ggcagtagct 449581 gtttgggcgt gcccaaccgg ttggagcgcc cggcggcgag taccacaccg gtgatctggg 449641 tcgcggtcat gcgccgccgt tctcgtccgc caacgccttc cggcctctgg ggccgccgcc 449701 gcgcagcgtg gcgatcagtt ccgccgcaat cgacaccgcg atctccgccg gagttttggc 449761 gccgatggcc aatccgaccg gggtatgcac ccgggcccgc tcggcatcgg acaggtccag 449821 cgaatccagg atggacgcgc cgcgtaccgt gctggccacc agcccgacat acccaacgcc 449881 gttatccagc gccgtgcgga tgatttcggc ttcgggcccg ctgtggctgg cgatcacaat 449941 cgcagttggc aaggcgtcgg tgtcggccgg atcggtgtcg cggcgcgcgt cgtagcccaa 450001 caggccgcac agttcgatca acgcgtcggc gatcggggtt tcgccgtaaa tctggatcag 450061 cggggccggc agctgcgggg tcaggaagat ctccagggat ccgccggcca ggcacgggtt 450121 gaccaccaca cacgccccgg gagcttccgg gaagtgcacg tcaccgtcgg gcagcacgcg 450181 cagcagcacg ctctcgccgg cctgcaacac gcccatcgcc gccttgcgga ccgagttctg 450241 cgcgcagtgg ccgccgacaa agccctcgat ggtgccgtcc gccaacagga ttgcctcatc 450301 gcccgggcgg gccgacgtgg gctgctgggc ccgcaccacg gtcgcgcgca cgaacggtgt 450361 ccgcgcggcc accagctgtg cggcccggtc actgatggac atcgacgccc tcgagctccc 450421 ctagatcggt ggtgtggccc ggccctgcat ggcctcccag acccgcgacg gcgtcaacgg 450481 catgtcggcg tgccgaaccc cgaacggcgc caacgcatcc accaccgcgt tcaccaccgc 450541 cggcggggaa cccaccgtgg ccgactcacc gatgcccttg gcgccgatcg ggtgatgcgg 450601 cgacggggtc acggtgtgcc cggtctctag gtgtggcacc tcgagcgcgg tcgggatcag 450661 gtagtccatc aacgatccgc ccagacagtt gccgtcctcg tcgaaggcaa tcatctccat 450721 cagcgccatg ccgatgccgt cgacgatgcc gccgtgtacc tgaccctcga tgatcatcgg 450781 gttgatccgg gttccgcaat catcgacggc caaaaagcgc cgcaccttca ccaccgcggt 450841 gcccgggtcg atgtcgacca cacagaagta ggcgccgtac gggtaggtca gattcgacgg 450901 gttgtagcag acctcggcat ccagcccgcc ctcgatgccc tcgggcagat cgccggcgcc 450961 gtgcgcgcgc atcgcgatgt cggcgatggt caccgcggcc gacgggtcac ccttgacgtg 451021 gaacttccct ttctcccact gtaagtcggc gaccgaaacc tcgagcatgc ccgaggcgat 451081 gatcttggcc ttgtcgcgca ccttgcgggc gaccagcgcc gcggcaccac ctgagacggg 451141 tgtggaccgg ctgccgtagg tgcccaaccc gaacggtgtc tggtcggtgt cgccgtgcac 451201 cacctcgatg tcgtcgggcg caatccccag ctcctcggcg acgatctgcg cgaacgtcgt 451261 ctcgtggccc tggccctggg tctgaaccga aagccgcagc acggctttgc ccgtcgggtg 451321 cacgcgcagc tcgcagccgt cggccatgcc caggccgagg atgtccatgt ccttgcgcgg 451381 cccggcgccc acggcctcgg tgaaaaatga catcccgatg cccatcagct cgccgcgcgc 451441 tcgccgctgc ttttgttcgg cgcgtaacgc ctcgtagccg atcatgttca tcgccttacg 451501 cattgtggtc tcgtagtcgc ccgagtcgta cacccaacca gtcttgctct gatacggaaa 451561 ctggttgggc cgcaatagat tccgcaagcg cagctcggct ggatccatct tcagctcgaa 451621 ggccaggcag tccaccagcc gctcgacgaa gtagaccgct tcggtgatgc ggaacgaaca 451681 cgcgtaggcg accccgccgg gcgccttgtt ggtatacacc gcggtcatgt gacagtaggc 451741 ggcctcgatg tcgtagctgc cggtgaacac cccgaagaac ccggctgggt acttcgccgg 451801 cgcggcctgg gcgttaaacg caccatggtc ggccagcaca ttggaccgga tcgccaggat 451861 cttgccgtca cggttggcgg caatctcgcc gaccatgatg tagtcgcggg cgaatccggt 451921 ggacgtcagg ttctcgctgc ggtcctccat ccatttgacc ggcttgtcca gcagcagcga 451981 cgcgacaatg gcacagacat aaccgggata gatcggcacc ttgttgccga agccgccgcc 452041 gatgtcgggc gagatcaccc gaatcttgtg ttcgggcaac ccggccacca gcgcgtatag 452101 cgtgcgatgc gcgtgcggcg cctggctggt ggtccacagc gtcagctttc cggtgaccgg 452161 atctagatcg gccaccgcgc cacaggtttc catcggcgcc gggtgcaccc gcgggtagac 452221 gatctcctgc tggacaacga cgtcggcctt ggcgaacacc gcctcggtcg ccgccgcgtc 452281 gccggtctcc cagtcgaaga tgtgattgtc gctctttccc tccagatcgg tgcggatgac 452341 cggcgccgac gggtccagcg ccgtgcgggc atccacgacg ggatcccgcg gttcgtagtc 452401 gacgtcgacc aactcgcatg catcgcgggc cgaataccgg tcctcggcaa ccacggacgc 452461 cacctcttgg ccctggaagc gcgtcttgtc ggtggccagc acggcttgta cgtcgttggc 452521 tagtgtcggc atccaagcca ggcccttggc ggccagatcg gcgccggtca ccacggcctt 452581 gactttcgga tgtgcctgcg cggcagtcac atcgatgcgc acgatgcggg catgcgcata 452641 cggcgaacgc aggatggcca gatgcaacat gcccggcagc gcgacgtcgt cgacgtaggt 452701 tccgcgcccg cggatgaatc gcgggtcctc tttgcgcatc atccggccgt gcccgcacgg 452761 ctgctgagcg ttgtcggcta ggtcttccgg cgacggaggg cgtgactcga tcgttgtcat 452821 gactgcgcct ttacggtctg gtgtgctgcc gcccactgaa tggagcgcac gatcgtggtg 452881 tatccggtgc accggcagat ctgccccgag atcgcttccc ggatggtctg ctcgtcggga 452941 tccgggttgc ggtccagcag ggcgcgcgcg gtaatcagca ttcccggggt gcagaagccg 453001 cattgcagcc cgtggcagcg catgaaccct tcctgcaccg ggtcgagctg gccgtcgggc 453061 ccagccaagc cctctaccgt gcggatgctg tgcccggagg ccatcacggc gagcatcgtg 453121 caggatttca ccggcacgcc gtcgacctcc accacgcatg tcccgcagtt gctggtatca 453181 cagccccagt gagttccggt gagccgcagc tgatcacgga gaaaatggac cagcagcatc 453241 cggggttcga cctcggcggt gacgggctcg ccgtttaccg tcatgttcac ctgcatggtt 453301 ggttcccctc tcaggcctcg ggggccgccg gcgcgccgag cacgcgcccg gcggcggtgc 453361 gcagcgtgcg aacggtcagt tcaccggcga ggtgccgctt gtactccgcg gtgccgcgga 453421 cgtcggtcac cggcgtgcaa gcttgcgcgg cgcgccggcc cgcctcagcg aacacctctt 453481 cggtagcggg ttggccgacc agtcccgcgg acagctccgc cagcgcgacc gggtcgggat 453541 tcaccgcggt caaacccacc cgagcggcga ggatcgtctg gccgtcgagc gtgaccgcgg 453601 caccggccgc ggtgatggcc cagtcgccga cccgccgttc caccttggcg tacgcgctgg 453661 aggtgttgtg ccgcagcgga atccgcacct caattaggac ctcgttgtgg gcgagcgcgg 453721 tttcgtacgg cccgaccagg aagtcgtcga tcgctatctc acgttcaccc gagggccctt 453781 tcgccaggca caccgcatcc agaacggtgc acacggtcga caggtcctcg gccggatccg 453841 cctggcagag cgaaccgccc agggtgccgc ggttgcggac caccgggtcg gcgatcaccc 453901 gctcggcatc gcggaagatc gggcacaccg ccgccagcgc atcggagtcc agaatctctc 453961 gatggcgggt catcgcaccc agccgaacca ggttgggatt gttgattccg ccgaccacga 454021 cgtagccgag ttcgggggcc aggtcgttga tgtccacgag gtactcgggg ttggcgatgc 454081 gcagcttcat catcggcagc aggctgtgcc cgccggcgac cacccgcgct ccctccccca 454141 accgatccaa caatccgatg gcgtggtcca cgctggtggc acgttcgtat tcgaaaggcc 454201 caggtacttg catgcgcccc agtgtcggcc gcccgcgaaa agggcgtcaa tgtcgagtta 454261 agtaatcctt gaactcgccc gctacctgcg catcatggtg gatccgtccg gcaatatcgg 454321 ccagcgggcg tccccctccg ccccaacgtc gggcgatgat gtcggccgcg atcgagaccg 454381 cggtctcctc gggggttcgg gcaccgagat ccagcccgat cgggctggac aaccggctca 454441 gctcggcgtc ggtcaggccc gccgcgcgta gccgatccat ccggtcgtcg tgcgtcttgc 454501 gtgatcccat cgcccccacg tatccgacac ccaggcgcag cgccacctcg agcaccggga 454561 cgtcgaactt cggatcgtgg gtgagcacgc agatcaccgt gcgctcgtcg ataccacccg 454621 cctccgcctg ggcagccaga tagcggtggg gccatgcgac gacgacgtca tcggccgtcg 454681 gaaagcgcgc tggcgtggcg aataccgcgc gggcgtcgca gacggtgacc cggtagccga 454741 ggaacgaacc ctgccgcgcc agcgcggcgg cgaagtcgat ggcaccgaac accagcatcc 454801 gcgggcgcgg cgcgtggctg gacacgaaga cctccatgcc ctcgccacgc cgctgcccat 454861 cgggcccata ttcgaggatc tcgctgcggc ccaccgcgag cagaccccgc gcatcgtcga 454921 taaccgccgc atcggcacgc gccgaaccca gcgaacccgt cacggggctc tttgtgtcgg 454981 gccggatcac cagtcggcga cccacccgcc gctcgtccgg atgggcgatg acggtcgcga 455041 tggcgaccgg gcgttgcgcg ccgatgtcgt cggccagctc gcccagctcg ggaaacgtgg 455101 cccgcgatac gggctcgacg aagacgtcga tgatgccgcc acaggtcagg cctaccgcga 455161 atgcggtatc gtcgctgact ccgtagtgtt ccagccgcgg tatcccggtt tgggccacct 455221 cggcggccag ctcatatacc gcaccctcca cgcagccgcc cgacaccgac ccacttaccg 455281 aaccgtccgg ggctaccacc atcgcggccc ccgggggccg cggcgctgac cgcaaggttc 455341 gcaccaccgt cgcgaccccc gcggtgtcac cggcggccca gatcgccatc agctcggcaa 455401 gcacttcacg cacgcttccc aaagtaggct tcagtgcatg accccggctc aacttcgggc 455461 ctattcggcg gtggttcgcc tgggctcggt acgggcggcc gccgcggaac tcggtctttc 455521 cgacgccgga gtctccatgc acgtcgcggc gctgcgcaag gaactcgacg acccgctgtt 455581 taccaggacc ggtgccgggc tggcgttcac gcccggcggg ctgcggctgg ccagccgcgc 455641 ggtcgaaatc ctgggcctgc aacaacaaac cgcgatcgag gtcaccgagg ccgcccacgg 455701 gcgtcggttg ctgcgcatcg ccgcctccag cgccttcgcc gaacacgccg cgccgggcct 455761 gatcgagctc ttctcgtctc gggccgacga cctttcggtc gagttgagcg tgcatcccac 455821 cagccggttc cgcgaactga tctgctcgcg cgccgtcgac atcgcgatcg gcccggccag 455881 tgagagctcg atcggttccg acggctcgat ctttctacgg cccttcctga agtatcagat 455941 catcaccgtc gtcgcgccga atagcccact ggccgcaggc attccgatgc ccgcgctgtt 456001 gcgtcaccag caatggatgt tgggtccgtc cgccggcagc gtagatggtg agatcgcaac 456061 catgttgcgc ggcttggcga ttccggagtc ccagcaacgg atcttccaga gcgatgccgc 456121 cgcgctggag gaggtcatgc gcgtcggggg cgccacgctg gccattggct ttgcggtcgc 456181 caaggatctt gccgccggac ggttggtgca cgtgaccggt cctgggctgg atcgcgccgg 456241 cgagtggtgt gtggcgacat tggcgccttc ggcccgccaa cccgccgtct ccgagcttgt 456301 tggcttcatc agcaccccga ggtgtattca ggcgatgatc cggggcagcg gggtcggggt 456361 gacgcggttc cgcccaaagg tccacgtcac cctgtggagc tagctacttc gacttgaaag 456421 gctcggcgcg ccggtccgcc cgttgacggg gcccggctgc gaggattagc cagttccctt 456481 gtcgcacagg agcgttgagg ctatcgccgt acgcctactg cgtgcgatca gcgcttgctc 456541 gttccatacc acagggtgcg gcccaggtgc aaggttcact gtgcatcgtg cgctggagcc 456601 tttggtgcct gttgcccgtt gaaccgtgat ccagcgcggc tgagggtgtg gtggtgtcgg 456661 gccgctggga ggccgggaat gcggacggta acggtggctc cgcggggttg atcggcagcg 456721 gcggggccgg cggcgacggc ggtagcggcg gggccaccgg cgccggtggc gaaggtggcg 456781 atgctggagc aagcgggtcc ataaacggca acgccggcga ccccggcaac agcggagaac 456841 gcggcgcagt gggcaagccc ggcgcacccg gctgacccga aaatcaccgc atcaccgggc 456901 tcgctcacaa ccgagagcgg acgcgggctc ggcgggctag acgaatcgac gcgccaactt 456961 tctcggatcg aagaagctat acgctttacc cccatgagtg tgtacaaggt gatcgacatc 457021 atcgggacca gccccacatc ctgggaacag gcggcggcgg aggcggtcca gcgggcgcgg 457081 gatagcgtcg atgacatccg cgtcgctcgg gtcattgagc aggacatggc cgtggacagc 457141 gccggcaaga tcacctaccg catcaagctc gaagtgtcgt tcaagatgag gccggcgcaa 457201 ccgcgctagc acgggccggc gagcagacgc aaaatcgcac ggtttgcggt tgattcgtgc 457261 gattttgtgt ctgctcgccg aggcctacca ggcgcggccc aggtccgcgt gctgccgtat 457321 ccaggcgtgc atcgcgattc cggcggccac gccggcgtta atgcttcgcg tcgacccgaa 457381 ctgggcgatc gacaccgtga ccgccgcgcc ggcacgggcg tcgtcggtaa tgccgggccc 457441 ttcctggccg aataacagca ggcattcccg cggcaacgcg gtctgctcca ggcgcgccgc 457501 acccgggacg ttgtccaccg ccaccacggt caagccggcg cccgccgcga actccagcag 457561 cccggtggtg ctgtcgtggt ggcataaccg ctgatagcgg tcggtcacca tggcgccgcg 457621 ccgattccac cgccgacgcc cgacgatgtg cacggtgtgc acggcgaatg cattggcggt 457681 gcgcaccacc gagccgatat tggcatcgtg tccgaagttc tcgatcgcca cgtgcaaggg 457741 gtgacggcgc gtatcgatgt cggcgatgat cgcctctcgg gtccagtacc ggtaggcgtc 457801 gacgacgttg cgagcatcgc cgtcgcgcaa caacaccggg tcgtatcggg ggtcgtccgg 457861 gaggtcgcct gcccagggcc ccacgccgcc ggtcggcgcg ccccattccg taggcccggg 457921 cccaagcgca ctcatcgcga ggtccacaac gcggcgtggg ttcccactgt cgcgaccgtc 457981 gcgtacagca acgcctcgtt gatctcgccc tgcgtacaca gcgacgcgct aatgtggacc 458041 gcgtccagta gcgcgtcagg aaggatcagc accgaggatc catacgtcgc gcacgccggg 458101 ctcagtccgc ggccgggcag tgtcggtgcg gcgagcagca tcatgggacc gccgcgggca 458161 gcggagtcgt cccgataccg atccgggacg gcgtcgacct ccagccccag cagcatgtag 458221 ccattaccgg tgaacgccac cggatctccg agctggggct tacccagctg catgccgtcg 458281 gcgcgccaag ccgagacgct ggtggtcttg accaccagac cggcgtcgtt ttgattggtc 458341 ggcagcatcc ccaccggaaa cgcggccgga taggccgcgg cagtacccgg gatccgatcg 458401 cggggcgagt acgtgtacac gccgcggacg gcgcttcgct ctttcagcgg acccaggcag 458461 accgtgccgg tgagccggcc ggcaggcgcc gatagcggcg acacgacgtc gcgcacgtgg 458521 gccatcgcgt cgccgcagct gccgagcgca gcggattcca tcggatgggc caacgcgccg 458581 tacagcccga agcgaatgtc ttcgggcttg gcgtgcggcg catgcgggtc cgtcggggat 458641 gcatccacgt cgatgagcac atagtcgccg gaccagcgca gattcgacac cgacatgttc 458701 cagcccagca ccgccagcga ttcgccggtc cgggcgctct gggcgccgta ggtgcgaccg 458761 gaatgactcg aacccgagca gcccgtcaga ccggacagga cgacggcccc gcaggtggcc 458821 caggcaacga gaatgcgcac agcgatgccg ccgacgccta atccagcccc agatcggcca 458881 ggcccagcac gctgcggtag cgcagtccct cggcttcgat agcctctgcg gccccggtgg 458941 cgcgatccac cacggtagcc acgccgacaa cctcaccacc cacgtcttgg acggcgtgca 459001 ccgccgtcag cgcggagtta ccggtggtac tggtgtcctc taccaccagc acccgctgcc 459061 cggtaacctc cgacccttcg ataagtcgct gcatgccatg ggctttcgcc gacttgcgga 459121 ccacgaacgc gtcgatcgga cggcccgggg catgcatgat ggcggtcgcc acgggatcgg 459181 ccccgagtgt caggccgccg acaaccgaat agtcccagtc ggcagtgagt tcgcgcatta 459241 gccggccgat cagcgcggac gcccgatggt gcaaggtggc gcgacgcagg tcgacgtagt 459301 agtcggcctc ccggccagac gacagcgtga cgcggccgtg caccaccgac agccggcgca 459361 ccaactcagc caactctgcg cggtcaggtc cggccacggc ttctcctcac gccgccacgc 459421 gggaggccga tcacatgcgg cgtcaccgcg gtggcctcgg gcgtgacatc cgcggtctca 459481 gtgttggtag ttggtggcct gctggccgtt gcgtccggcc ggcggcgggc gccgaacgcc 459541 ttcggagcgg ccttcatcgc ggcggatcgg ctcgggggcc cgacgagccg gatcgggcag 459601 caccgtggtt gccggatccg gctgggcgcg ccgcggcggt aactccgcgg ggccgcccgg 459661 ggccacgggt cggccaggtg ccgcgccgcg cggtcccacc ccggtttgtt ggggcatctc 459721 ctgcggcagc ggcggcaaca cgcgcagcaa gtcgttgaat tggcgcacgg tgcgcaggcc 459781 ctcgtcccac tgggcacggg tgctggcaat cggcatgctg accagcgtcc agttctgctc 459841 gttccacatg atttcggcgc agtcgggcgc ggtgtgcgcg aaggtgacca tccgccgatc 459901 gcaggcgcgc cgggccgcgt ctagattggt ggagtacacc atccgtggcc cgatcgcgcc 459961 cagcagccag atgtcgcttt ctcgtggctc tttcaggcct ttgagccgca ggtcgaccac 460021 gacattggtg cccaccttgc gatgcagcgc gatcacggtg gcgacttcct cgagatcgaa 460081 gatgtacacc gcctcgccgc ggatctgacc cagcaccacg ttatgggcgg caacatcgcc 460141 aactgtggac atcacgccgc gcgtccagcg cttgagtatc tcggtggatt cccgttcgta 460201 gtcgaacccg tgcgatcgcg cccacgactt gcggcgtctg ctgcgcccgc ggcgtcgatc 460261 gatgtcgacg tacagcaaca ccacggcacc gacgaagcac agtgccgaga gcgtgaacca 460321 aagcgggacc atcggtgctt agcctatccg ctggcggccc ggaaccgaga atgcgaccag 460381 gtcacaaccc agtcaccttc cacgccgagc agacgcggaa tcgcactgcg cggacctcac 460441 gcgtgcgatt ccgcgtctgc tcgtcagaca aatcagccca ggatcagcga gtcggcgtcg 460501 gggctgacgt tgaccggcac ggtatcgccg tcgtgcacct ggccggccaa cagcatcttg 460561 gccagctggt caccgatggc ctgctgcacc agccggcgca acggccgcgc cccgtacacc 460621 gggtcgaatc cgcgctgcgc caaccagcgc ttggccggca gcgagacctg cagctgcagc 460681 cgccgctgcg ccagccgctt gcccagctgc gccagctgga tgtcgacgat gcgcaccagc 460741 tcttcggggt tgagaccctc aaagatgagc acgtcgtcga gccggttgat gaactccggc 460801 ttgaacgtag cgcgcaccgc ggccagcacc tgctcggcgc tgccacccga ccccaggttg 460861 gacgtcagga tcaagatggt gttgcggaag tcgaccgtgc ggccgtgccc gtcggtgagc 460921 cggccctcgt cgaggacctg cagcagcacg tcgaacacgt ccgggtgcgc cttctcgatc 460981 tcgtcgaaca gcaccaccgt gtagggacgc cggcgcaccg cctcggtcag ctgaccgccc 461041 gcctcgtatc ccacatagcc gggcggggcg ccgatcaacc gagccacggt gtgcttctcg 461101 ccgtactcgc tcatgtcgat gcggaccatc gcccgctcgt cgtcgaacag gaagtcggcc 461161 agcgccttgg ccagctcggt cttgccgaca ccggtcgggc cgaggaacat gaacgccccg 461221 gtgggccggt tggggtcgga caccccggcc cggctgcgcc gcaccgcatc agagactgcg 461281 gtaaccgcgg ccttctgccc gatgacccgc ttgcccagct cgtcttccat gcgcagcagc 461341 ttggcggtct cgccttccag cagccgaccg gccgggatgc cggtccacgc cgacaccacg 461401 tcggcgatgt cgtcgggacc gacctcctcc ttgagcatca cctgctcccg ggcctgcgcc 461461 tgcggcaacg ccgcgtcgag cttcttctcc acctcgggga tgcgtccgta gcgcagctcg 461521 gcggccttgg ccaggtcgcc gtcgcgttcg gcccgctcgg attccccgcg cagggcttcc 461581 agctgctcct tgaggtcgcg gacgatttcg atcgcgttct tctcgttctg ccagcgggtg 461641 gtgagctcgg ccaacttctc tttctggtcg gccagctcgg agcgcagctt ggccaaccgc 461701 tccgccgacg cctcgtcttc ttctttggac agcgccatct cttcgatctc cagccggcgc 461761 accagccgct cgacctcgtc gatctcgacg ggccgcgagt cgatctccat ccgcagccgg 461821 ctggccgcct cgtcgaccag gtcgatggcc ttgtcgggca ggaagcgggc ggtgatatac 461881 cggtcgctca aagtggcagc tgccaccagc gccgagtcgg tgatgcgcac cccgtggtgc 461941 acctcgtagc ggtctttgag cccgcgcagg atgccgatgg tgtcctccac cgacggctcg 462001 ccgacgtaca cctgttggaa acggcgctcg agcgcggcgt ccttctcgat gtgcttgcgg 462061 tattcgtcca gcgtggtcgc cccgaccagc cgtaactcgc cgcgggccag catcggcttg 462121 atcatgttgc cggcgtccat cgccccctcg ccggtggcgc cggcgccgac gatggtgtgc 462181 agctcgtcga tgaacgtgat gatttggccg gccgagttct tgatgtcgtc gaggacggcc 462241 ttgagccgtt cctcgaattc gccgcggtat ttggagccgg cgaccatcga gccgagatcg 462301 agcgcgacga tggtcttgtc gcgcaagctc tccggcacgt cgccggccac gatgcgctgc 462361 gccaggccct ccacgatcgc ggtcttgccg acgccgggct caccgatcag caccgggttg 462421 ttcttggtgc gacgggacag cacctgcacc acgcggcgga tctcgttgtc gcggccgatg 462481 accgggtcga gtttgccttc gcgggcgcgg gcggtcaggt cggtggagta cttctgcagc 462541 gcctgatagg tcgcctccgg ttcggggctg gtgacccggg cgctgccgcg caccttgacg 462601 aacgcctccc gcagcgcctg cggcgaggcg ccgtggccgg tcaacagctt ggcgacgtcg 462661 gagtcaccgg tggccagccc gaccatcacg tgctcggtgg agacgtactc gtcgtccagc 462721 tcggtggcca gctgctgcgc ggtggtgatc gccgctaacg actcgcggga cagctgcggc 462781 tgcgtgctgg ctccagtcgc ctgcggcaaa cggtcgagca ggcgctgggt ttcggcgcgg 462841 acggtggcgg gctcgacacc gacagcctcc agtagcggtg cggcgatacc gtcgttttgg 462901 gtcagcagcg ccatcagcag gtgagcgggc cggatctcgg gattgccggc ggtcgaagcc 462961 gcctgtaacg ccgcggttag cgccgcctgc gtcttggtcg tcgggttaaa cgagtccacg 463021 acacctccat tcggggtccg ttcgaaatgc ttgtcgggtt gttcaacgcc gtcaatgttg 463081 agtctgttcc gctcaatttt acccacttgt gcatccgccg ccgtttcgcc gcgagcttag 463141 aatcgaggtc cgtgggcctc gaggaccggg acgcgttgcg ggtgttgcaa aacgccttca 463201 agctcgacga cccggaactg gtccgccgct tctatgccca ttggtttgcc ctcgacgcct 463261 cggtacgcga cctgttccca cccgacatgg gcgcccagcg agccgctttc gggcaggcgc 463321 tgcactgggt gtacggcgag ctggtggcgc agcgcgccga ggaaccggtg gcctttcttg 463381 cccagctcgg ccgcgaccac cgcaaatacg gtgtgctgcc aacccagtac gacacgttgc 463441 gccgcgcgct gtatacgacc ctgcgtgact atctgggcca tccaagccgg ggcgcctgga 463501 cggacgccgt cgacgaggcc gccggccagt cgctcaacct gatcatcggg gtgatgagcg 463561 gtgccgcgga cgccgatgac gcgcccgcct ggtgggacgg cacggtcgtc gagcacatcc 463621 gggtgtcacg cgaccttgct gtcgctcggc tgcagctgga ccgcccgctg cactattacc 463681 ctggccaata cgtcaacgtg catgttccgc aatgcccccg ccggtggcga tatctcagcc 463741 cagccattcc ggccgacccg aacgggcgga tcgagtttca cgtccgggtg gttcccggtg 463801 gcctggtcag caacgccatc gtgggtgaaa ctcggcccgg tgaccggtgg cgattgtccg 463861 gtccgcacgg agcctttcgg gtggaccgcg acggcggcga cgtgctcatg gtcgccggta 463921 gcaccgggct ggcgccgctg cgggcgctga tcatcgacct cagccgcttc gcggtgaatc 463981 cgcgcgtgca cctgttcttc ggagcacgct atgcctgcga actctacgac ctgcccacgc 464041 tgtggcagat cgcggcgcac aatccgtggc tgtcggtctc gccggtgtcg gagtacaacg 464101 gtgatccggc ttgggccgcc gactatcccg acgtgtcggc gccgcgcggt ctgcacgtgc 464161 gccagaccgg ccgactaccc gatgtggtct cccgatacgg cggctggggc gatcggcaga 464221 ttctgatctg cggtggaccg gccatggtcc gcgccaccaa ggccgccctg atcgccaaag 464281 gcgcgccacc ggagcgcatt cagcacgacc cactgtcgcg ctagccgggc ggaaatccac 464341 cgtccggtgg cgtcgcttcg acatggcata cggcctttgc tacccggtca ccgctggcta 464401 gcatgagtgc gactgagtgg agcggggatg agcaagttgc tgccacgggg cacagtgaca 464461 ttgctgttgg ccgacgtcga gggatccacc tggctgtggg agacccatcc agacgacatg 464521 ggtgctgccg tggcgcgcct cgacaaagcc gtgtctggtg tgattgccgc ccatgacggc 464581 gtacgcccag tcgagcaggg tgagggtgat agctttgtcc tcgcgttcgc ctgcgcgtcg 464641 gatgccgtgg ccgccgcgtt ggacttgcag cgagcgcggc tcgcaccgat ccggttgcgc 464701 ataggcgtgc acaccgggga ggtcgcgctc cgcgacgaag gcaactatgc cggtccgacc 464761 atcaaccgga ccgcgcgcct gcgtgacttg gcgcatgggg gccagacggt gctctcgggc 464821 gtgaccgaaa gcctggtcat cgatcgcctc ccggacaaag catggctggt tgacctgggg 464881 acgcacgcgc tgcgggatct gtcgcgtccg gagcgggtaa tgcagctgtg tcatcccgaa 464941 ttgcgtatcg atttcccgcc gctgcgggtg gccaatgacg atgtggccca tggtcttccg 465001 gtgcacctga cgcgttttgt ggggcgcggc gcgcagatca ccgaggtgca ccggttggtg 465061 accgataacc ggttggtgac cctgaccggc gccggcggcg tgggcaagac acggctggcg 465121 gcgcagctcg cggcgcagat cgccggtgag ttcggtcgcg cgtggttcgt ggatctggcg 465181 ccgatcacgg accccgactt ggtgccggtc acggtggcgg gcgcgctggg actgcacgac 465241 cagccgggcc gctccacgac ggacaccgtg ctgcgctttc ttggcgggcg tccagccctg 465301 gtggtgctgg ataactgcga gcacctgctg gatgcgacgg cggccttggt gttagcgctg 465361 gtgaaagcgt gccggggggt gaggttgctg gcaacttgtc gtgagccgct ccgggtcgag 465421 ggtgaggtga gctaccgggt gccgtcgctg tcactgagcg atgaagccgt tgagatgttt 465481 tgctaccggg ctcagcgagt ccggccggac tttcgcctca ccgacgacaa ctccgccgca 465541 gtgaccgaga tctgcaaacg gctggacggt ttgccgctgg cgatcgagct ggcggctgcg 465601 cggctgcggt cgatgacgct tgacgagatc atcgatggct tgcgtgaccg gttcgcgctg 465661 ttgaccggcg gtgcgcgcac ggccgcgcac cggcagcaga cgctgtgggc ctcggtggat 465721 tggtcgtaca cgctattgac cgagccggaa cgtaccttgt ttcgccggct tgcggtgttt 465781 gtgggttgct tttttgtcga cgacgcacag gcggttgcct gcagcggcga tgtgcagcgc 465841 taccaggtcc ttgacgagat caccctgctg gtcgacaagt cactggtgat ggccgacgac 465901 aacagcggcc ggacgtgcta tcggttatgc gagacgatgc gccactacgc gttggaaaaa 465961 ctctccgagg ctggcgaggt ggacgccgtg tttgcgcggc accgtgacta ctacacggcg 466021 ctggctgcca gggtcgacaa tcccggaccc tccgattatt cgcactgcct cgaccaagcc 466081 gaaaccgaga tcgacaacct acgtgccgcc tttgtgtgga accgggaaaa ttccgacacc 466141 gagggcgcct tggcgctggc gtcctccctg ttgcgggtat ggatgacgcg ggggcgcatc 466201 caggaggggc gcgcctggtt tgacagcatt cttgccgacg agaatgcgcg tcatctcgag 466261 gtggcggccg cggtgcgcgc ccgggcattg gccgacaagg ccctgctcga catcttcgtc 466321 gacgccgccg ccggtatgga gcaggcccaa caggctttgg tgatcgcgcg cgaggtcgat 466381 gaaccggcgc tgctgtcccg ggcgctcacg gcctgcggct tgatcgcggt agcggtagct 466441 cgcgccgatg cggccgcgtc ttatttcgcc gaggcgatcg acctggcacg agcggtagac 466501 gaccggtgga ggctggccca gatccttacc tttcaggcgg tcgatgcggt cgtggcgggt 466561 gacccggtcg cggcacgccc ggccgcccaa gaggcacgcg agctggctgc cgcgatcggt 466621 gaccactcca atgcgctgtg gtgccgctgg tgtctcggct acgcccagct gatgcggggg 466681 gagctggccg cggccgccgc ccaattcggc gaggtggtgg acgaggccga ggcgtctcag 466741 gaagtgctgc acaaggccaa cagcctgcag ggcctggcct tcgcgctcgc ctaccagggt 466801 gaattgagtg cggctagggc ggcggccgac gccgctctcg aggccgccga gctgggcgag 466861 tacttcgcgg gtatgggcta ctcggcgttg accacggccg cgttggccgc cggcgacgtg 466921 cagacggctc aacatgccag cgaggcggcc tggcggaact tgagtttggc gctgcccctc 466981 tcggcagcgg tgcagcgcgc gttcaatgcc caggctgcac tggctggtgg tgaccttagc 467041 gcagcgcgtc gttggtgtga cgatgccgtg cagtcaatga ccggccatca tctggcgatg 467101 gcgctggcga ctcgcgccag gatcgcggtc gccgagggca agcgggaaga agccgaacgc 467161 gacgcgcata aggcgctcgc gtgcgcggcc gagagcgggg cacacctgga tctccccgac 467221 gtgctcgaat gccttgccgg cctggccagc gacgccggca cccaccatgc ggcggcacga 467281 ctcttcggcg ccgccgaggc tatccgacag cagatcggct cggtccgctt cgcgatttac 467341 cgctcggact atgtgcagtc ggtgacggct ctgcgagatg cgatggggga gaaagacttc 467401 gccgctgcat gggccgaagg tgccgcgttg tcgatcaagg agacgatcgc ctatgcgcaa 467461 cgtggccact cctggcgcaa acgaccggcc accggttggg aatcgcttac tccgaccgag 467521 attgacgtcg tgcgactggt tggcgaggga ctggccaaca aggacatcgc gacgcggctt 467581 ttcgtctcac cgcgaacagt gcaaacgcac ctgacgcacg tctacaccaa actcggcttc 467641 acctcgcgac tgcaactcgc tcaagcggcc gcccgccgta cctgagtgct attgattggc 467701 gttcggggac ggcggtacca cgatgatggt cgctccgggg atcgccgcca gggtcgccgc 467761 aaggttggca accacgccgg gcggcaaacc gggcggtagg ccaggtatgg ctgacccggc 467821 ggccgccgcg gggaccgcgg gcgtctgctg ggcggcgggc ctggtggcag ccgggccggc 467881 tccgccggct ccggccgggg cggctgcagc cggggtcggc gcggagccac cgcgggcggc 467941 aagaccggcc agaccggtgc cggccaaacc cgccgctgcc aggccggcgt aggagccgtc 468001 cgaactcccc gtcatataca tcggagtcgc gccggggaac atcgccgcaa cttggcgcac 468061 cgctggcgga agattccagc tgtgcggcac ggagagtccg ccgatgttag cggagtagcc 468121 ggtgagcgcg gcgaccggcg gctgtggcgg gctggtgacc cgtcctccga cctcagggag 468181 atccccgtca cccgcggccc cagcgggtgt ctggtcggat tcggctgggc aatgcggcgc 468241 cttttgggcc tcgtcgacca cgtcgtggcg gtagatctcg ccgagctgcg ccgccgatac 468301 ccccaaactg ccagccgcga cggctacagc tgcggctacg agaacgtcaa gatcctcgat 468361 ggggttcgga atggcgtcga agggcggcgg taggaaggac tgcagggttg gtagcaggct 468421 cacgtcggtc gccgcggcgg ccggtactac cgcctgagcc gctgtcgcgg cggcgtcact 468481 cagcgacccg gccccgctgg tggtcgccgg tggcggggtg aacggcggca actgcgacgc 468541 ggccgccgag gcgcccgcat agccctccat cgccacgatg tcctgggccc acattgcgcc 468601 gtactgcgct tcactagtcg cgatcgccgg ggtgttctgg ccaaaaacat tggtcttgac 468661 cagtgacagc attgtgcggc ggttggccgc gattaccgtc gggggcacgg tcgccgcgta 468721 cgccgactcg taggcgttcg ccgcggccac ggcctgagcc gcggcctgct cggccgaggc 468781 ggcggtggcc ctcatccacg cgacataggg gacggccgcg gccgccatcg acagtgccga 468841 cgggcccagc cagtcatcgc cggtgagccc ggaaatcacc gaggagtagg aagccgccgt 468901 cgcggtcagt tcgttggcca gccgttgcca ggctgcggcc gcttgcatca ggggcctgga 468961 gccgggaccg gaatagattc tggcggagtt gatttccggt ggtagcgcac cgaaatccat 469021 gactagccgc tcctcacacc ggcagcagcc tcagcgctgc gtggctgggt cgtcacgaaa 469081 gacacggatt ctcctttgcc gaagctgtcc ggtccgcgca gggttcgtcg ctgccgcgag 469141 ccaggcgact gggcgcatac ctattcgggt ggcggcaacc atgtcggagc cggatggatg 469201 gctaagcggt catcaagttc ggatggcttg ggttatcagg tcactcagtt gcccccacct 469261 cctcatagca aaagtacaca ggcagatgtg agcggagttg cgaaaataga caaataattg 469321 agccgagcaa cgaccgagcg agagggtgag ctggtgatcg acggctggac ggaaggacag 469381 cacgaaccca ccgttaggca tgagcgccca gcagctcccc aagacgttcg gcgggtgatg 469441 ttgctgggtt cggccgaacc cagccgggag ctggcgatcg cgttgcaggg cttgggcgcg 469501 gaggtgatcg ccgtcgacgg ctatgtcggc gcgcctgccc accggatagc cgaccagtcg 469561 gtggtggtca ccatgaccga tgctgaagag ctgacggcgg tgatccggcg gctgcaaccg 469621 gatttcttgg tgacggtcac cgccgcggtg tctgtggatg ctctcgatgc cgtcgagcaa 469681 gccgacggcg agtgcactga gctggtgccg aacgcccgtg ccgtccggtg cacggccgac 469741 cgggagggcc tgcgccggct ggccgccgat cagctcggcc tgcccacagc cccgttctgg 469801 ttcgtcggat cccttggcga acttcaagcg gtggccgtcc atgctgggtt tccgttgctg 469861 gtgagcccgg tggcaggggt ggctggccag ggtagctcgg tggtcgccgg gcccaacgag 469921 gtcgagcccg cctggcagcg cgcggcaggc catcaagtac agccgcagac tgggggagtg 469981 agccctcggg tgtgcgccga gtcggtggtc gagatcgagt ttttggtcac catgatcgtt 470041 gtgtgcagtc agggcccgaa cgggccgctc atcgagttct gtgcacctat cggtcatcgc 470101 gacgccgatg ccggtgagtt ggaatcctgg caaccgcaga agctgagcac ggcggcgctg 470161 gacgcggcca agtcgatcgc cgcgcgcatc gtcaaggcgc tcgggggacg cggggttttc 470221 ggcgtcgaat tgatgatcaa cggcgatgag gtgtatttcg ccgatgtcac cgtgtgtcct 470281 gccgggagtg cctgggtcac cgtgcgcagc cagcggcttt cggtgttcga actgcaggcc 470341 cgggcgaccc tgggtctggc ggtggacacc ctgatgatct cgccgggtgc cgcgcgggtg 470401 atcaacccgg accacacggc aggccgggca gcggtcggcg ccgcaccacc tgccgatgcg 470461 ctgaccggtg cgctcggtgt gccggaaagc gacgtcgtga tattcggccg cgggcttggg 470521 gtggcgctgg ccaccgcacc cgaggtggca atcgcccgcg aacgcgcccg cgaagttgca 470581 tctcggctaa atgtgccaga ctcacgcgag tgagctacgc cggagatatc acgccacttc 470641 aggcctggga gatgctcagc gataatccgc gggcggtcct ggtcgacgtg cgctgcgagg 470701 cggaatggcg cttcgtcggt gtgcccgact tgtcgagcct tggtcgtgaa gtggtctatg 470761 tcgaatgggc gacgtccgac gggacgcaca acgacaactt cctcgccgag ttgcgggacc 470821 gcatcccggc ggacgctgat cagcacgagc ggcccgttat tttcttgtgt cgctccggta 470881 accgctccat cggcgcggcc gaggtcgcga ccgaggcggg catcacgccg gcctataacg 470941 tgctggacgg cttcgaaggg catctcgacg ctgagggtca tcgaggcgca acgggctggc 471001 gggcggtggg actgccgtgg agacagggat gaccgacgag tcttcggtcc gcaccccgaa 471061 ggcgctgccc gacggcgtca gccaggccac cgtcggggtg cgcggcggga tgttgcggtc 471121 ggggttcgaa gagaccgccg aggcgatgta cctgacgtcc ggatatgtct acggctcggc 471181 ggcggttgcc gagaagtcgt tcgctggcga gctggaccac tatgtgtact cccgctacgg 471241 caacccaacg gtgtcggtgt tcgaggagcg gctgcggctg atcgagggtg ccccggcggc 471301 gttcgccacc gccagtggca tggccgcggt attcacctcg ctgggcgcgc tgctgggtgc 471361 cggagaccga ctggttgccg cgcgcagcct gtttggctcg tgtttcgtgg tgtgcagcga 471421 gatcctgccg cgctgggggg tgcagaccgt cttcgtcgac ggtgacgacc tctcgcaatg 471481 ggagcgggcg ctttcggtac ccacgcaggc cgtgttcttc gagacgccgt ccaatcccat 471541 gcagtcgctg gtggatatcg ctgcggtgac cgagctggca catgccgcgg gtgcaaaagt 471601 ggtgctggac aacgtatttg ccacaccgct actgcagcag ggctttccgc tgggggtcga 471661 cgtggtggtg tactcgggca ccaagcacat cgacggtcag ggtcgggtgc tgggcggggc 471721 catactcggt gaccgggagt acatcgacgg tccggtgcaa aagctgatgc gccacaccgg 471781 tccggcgatg agtgcgttca acgcctgggt actgttgaaa ggccttgaga cgctggctat 471841 tcgggtgcaa cacagcaatg cctcggcgca gcggatcgcg gagttcctca acggccatcc 471901 ctcggttcgg tgggtgcgtt acccgtacct gccgtcgcac ccacaatatg acctggccaa 471961 gcgtcagatg tccggtggcg gaaccgtcgt taccttcgca ctcgactgcc cggaggatgt 472021 tgccaaacag cgggccttcg aggtgctcga caagatgcgg ctgatcgaca tctccaacaa 472081 cctcggcgac gccaaatcgc ttgtcaccca ccccgccacc acgacgcacc gggcgatggg 472141 cccggagggc cgggccgcga tcgggctcgg tgacggtgtg gtccgcatct cggttgggtt 472201 ggaagacacc gacgacctga ttgccgatat cgatcgggcg ttgagctaac ccgctgcctc 472261 ttgctcggcg tgctcggcct gttcggcggc tgccagcgct ccttgtgcct gctgttccat 472321 caaggtcatc actaacctgg cgtagatcat ctggctggtg atggccatct ggccgcgggc 472381 gcggcccatg aaggagatcc cccaggcgaa cagggctgcg atgcggttcc gatagccgac 472441 caggtagacc aggtgcagca ccagccacgc cagccaggcg aagtacccgg caaactccag 472501 cttgccgacc tgcgcgacgg cgctgtggcg ggagatcgtc gccatgctgc ccttgttgaa 472561 gtaatggaac ggcttgcgat tggctgggtc gtcattgccc ttgaccatgt gtttgatcac 472621 cgtggtggcg tatcgggccc cctggatcgc gccctgagcc accccgggta cgccgggcac 472681 gaacatcaga tcgccgacta cgaagacgtt cggatgtccc ttgacggtga gatcgggttc 472741 cacgatcacc cttccggccc ggtcgatttc ggttccgtcg gatccctcgg cgatcatctt 472801 gcccagcggg ctggccgcca cgccggccgc ccaaaccttg cacgcgcatt cgatgcggcg 472861 ttcgccgccg tccttttcct tgatggtgat gcctttgtag tcgaccgcgg tcaccatcgc 472921 gttgagttga acctcggcgt ccatcttttc cagccgccgt tgtgccttga gacccagctt 472981 tggacccatc ggcggcaaca ccgcgggtgc ggcgtcgagc aggatcaccc ggcactcact 473041 gggcgtgatg gtcctaaacg cgcctgccag ggtgcgctcg gcgagctcga cgatctgccc 473101 agccacctcg acgccggtcg gcccagcgcc gacgacgacg aacgtcaggc gccgctcccg 473161 ttcggcatgg tcggtgctga cctcggcggc ctcgaacgcg cccaggatgc ggccgcgcag 473221 ctccagcgcg tcgtcgatgg tcttcattcc gggcgcgaag gtggcgaatt cgtcgttgcc 473281 gaagtaggac tgctgtgcgc cggcggccac gatgaggctg tcgtacggcg tcaccgtggt 473341 catgtccatc aatttcgacg tgaccgtctg cgctttcagg tcgatcgcgt tgacctcgcc 473401 cagcaacacc cggacgttct tttgccggcg caggatcagc cgggtggtcg gggcaatgtc 473461 gccctcggac aagatcccgg tggccacttg atacagcagc ggctggaaca ggtgggtcgt 473521 tgtcttggag atcagcgtga tgtcgacatc cgcccgttta agcgccttgg ccgcattcag 473581 gccgccgaat ccactaccga tgatgaccac gcgatggcgc ccgccgacgg ccgagggttc 473641 accagatgag agcgtcatgg tcctccttca gtctggtcgc tgtggcgcag ctacacagta 473701 cgactcccgt catgccaacg gcgtaacttt ttgtgggcct tgtgggcctt gtgggccttg 473761 tgggcctttg tcgggccgcc ttcggatcgg acgctcggga tggctgttgg gcgctgcgca 473821 atcccgcgct tcgatcaggc agcgtccggc agtgccatca atggcggcca ggtacacctc 473881 tccgacggct cgacatcgcc ggcccggcag ttacctgcac catggccggg cgatgcggga 473941 gcggctgccg aaggtcgggc aggtgtttgc tgccggggaa atcgactacc acatgtttca 474001 gacgttggtg tatcgcaccg atttgatcac cgacccgcag gtgttggcgc gggtggatgc 474061 cgagctggcg ctgcgggtgc ggggctggcc gtcgatgacc cggggcagct ggccgccgcg 474121 atagatcgga tcgtggcggt ggccgacccc gatgcggtgc gccaggtgcg ggagcgggcc 474181 cgcgatcggg aggtgtcgat ctggaattcc gcggacggca tgggcgaggt gtacgcccag 474241 ttgtatgcca ccgacgccca agccctggat gcgcggctga acgccttggt ggccacggtg 474301 tgtgccggtg atccgcgcag cacagatcag cgccgcgccg acgcgctggg cgcgttggcg 474361 gccggggcgg atcggctggc ctgccgctgc gacaatcccg actgtgccgc cgaggggcgc 474421 ccggtgtcgg cggtggtgat tcatgtggtg gccgagcagg ccagcgtcaa gggccacggc 474481 caggcgccgg cagcgttgct gggcggcgac gggctgatcc cggccgagct ggtggccgag 474541 ttggccaaga ccgccgggct gcagccgatc ccggtcccgg ccgggaccga gccgggttat 474601 cggccctcgg tgaagctggc ggcgtttgtg cgggcccggg atctgacctg tcgggcgccc 474661 ggttgcgacc gcccggccac ccagtgcgac ctggatcaca ccatcgcatt cgccgacggt 474721 ggggccaccc acgcggccaa cctcaaatgc ctgtgccgtc ttcatcattt gctggccacc 474781 ttctgtggct ggcgcgccca gcaactgccc gacggcacgg tgatttggac gctgccgggt 474841 aaccagacct acgtcaccac cccgggcagc gcgctgctgt tcccggcgct gtgcaccccc 474901 accggtgacc cgcccgcacc cgagccggcc cgcgccgacc gccgcgggca gcgcaccgcg 474961 atgatgccgc gccgggccag cacccgcacc caaaaccgcg cccattgcat cgccgccgaa 475021 cgccaccgca accaccaagc ccgccggatt gcccaagcgg ccgtcatcgc caccgagacc 475081 cacggcccac cacccgatcc cgacgacgac ccgccgcctt tttgatgaag tgagtccgaa 475141 tcatctcgac gtggacgggt gcggcgtcgg gtggtcgccg gttggcgcag accctccaga 475201 ggggaggatg aggagctcgg cacctgcgtc ggcggccctg agataggcca gcaggcggtg 475261 gccgaagtcg ctgacgtcgt ggataaggat gtggccgctt tcttctgccg gagtcccggt 475321 gccgttgccg tgccaaacgg ttgttgtcgc gatcgcgacg ccggtttgaa tgagggccgc 475381 tagcaccggc acgggctcga ccttactcgc cgccctcatc acctcgcgtc gctggatctc 475441 gtcctggtcc ggtgaggact tggcggcttt ggccagccga acgagtgcat ggatatgcac 475501 gggctcaagt tgggaaagcg tggccacgat gagtgatgcc ggctcgacct tctggtcatc 475561 ctcgagcgcg gcggcggcag cttgcgcgag gagccggcgc ttggcctcca tactggtgcg 475621 agtggcggcc tcgatcgcct ggctgagaag cggctcgagt tcgggatttt tgtcaatgcg 475681 gctcaacacg gtgtccgcgc cgccgacgct ctcgcatatc tcgcgcgtgg ttgtctcggc 475741 gcggtgccgg gtgcgttcct cgatggcgtc gaacacggtt tgtagcgggc cgccgaccat 475801 cgggatggcg gataggccgg cgctgatcac gacagcgaag acaggtctgg gctcagtcat 475861 agctcgaaca gtagaggccg tcgcggcaag gacggccgac ggcgtgtttt cggcgttgcg 475921 gggtggtcgc cggacacgag gaggcagacc gaggctcgat ggattggatg ccgctcggcg 475981 actacgagac tttccggcat tggtcgggga agccccgcgc atgggggccg caagagtcgg 476041 ggtggcgcgc gtggttcggc gggaagatag tcgatgggct ctgcgaggta ctcgacgagc 476101 acctcgcggt gcggcgtcgt ggtgttccag ccgcgatcgg ctgcgtgccc tggctgagta 476161 gcgaggcggt cgccgagacg ctgctcgcat tgagcgcctt ttgcgtggtg atcgacaagg 476221 gaacctcgtt cccgtcgcga ctgcgtaacc ctgacaaagg gtttcccaac gtcgccctat 476281 tgcggcttcg cgacatggcg ccctccgagc atggctcacg ctgctcctcg gcccgtggtc 476341 gtctatgcct gagcatgagc taggtccggt gcgggcgctc ggctggctac gagaggaccg 476401 caagccgctg ctgaatgcca aattgctcgt gctcggtcat ctggctttga acgtctacga 476461 ccccgataac ggttacggcg aagaggtgtt ggactttgag ccgcggacgg tgtggtgggg 476521 atcggccaat tggaccgtgc gggccgggtc acacttggaa gttggctttg catgcgacga 476581 cccaaccctc gtcgaagaag ctacagcgtt tgtcgctgac gtgatcgcgt tctccgaacc 476641 gatcgacacg acctgtgccg gtcccgaacc gaacctcgtg caggtggagt tcgacgacgc 476701 cgcgatggct gaggcgatgg aggagatggc cgagcccgat gatgacgggg aggattggta 476761 gcgatgctgc ttgatgaacc caaaggtcgt cacgggcacg ctcaaaacgt tcttcgtgta 476821 atcggtgcca tcatttgctg gccaccttct ggggctggcg cgcccagcaa ctgcccgacg 476881 gcacggtgat ttggacgctg ccgggtgacc agacctatgt caccaccccg ggcagcgcgc 476941 tgctgttccc ggcgctgtgc acccccaccg gtgacccacc tcgacccgac ccggcccgcg 477001 ccgaccgccg cgggcagcgc accgcgatga tgccgcgccg ggccagcacc cgagcgcaaa 477061 accgcgccca ctacatcgcc gccgaacgcc accgcaacca ccaagcccgc cggattgccc 477121 acgtggtcac ccaaaccgcc acaaccgccc ccgagactaa cggcccacca cccgatcccg 477181 acgacgaccc gccgcccttc taaccggtag gcgcctgccc aaaacacggg tattgggtaa 477241 aggcacgggg tcctgatgtt gttgtatttc aatgcgattc agctaaggcc cggagcccat 477301 ggctcgtccg gatggtcggt tggggtgatg tgtatgcccc tcctgctcca tcccgtttcc 477361 ttgtatcctc aagtttgtcg tttggcgctg ttgcgacagg aaggcgtcga tcatgcacgc 477421 actgaggttg gtcggcttgg cgatattgac ggcgatcgct ccaatcgcgg tcctcatcgg 477481 aagtagccca gcgcatgccg ataccgatat tggtcaaccg tgctcgccgg aaggcgcgaa 477541 actctggggg aaccccggcc cgatatattg cgagcgcacg gcggacgggc aactgcaatg 477601 ggtatcaatt cctgcttggg cattgtgtgt ggcgttctgc gaccggcctg gcgggccata 477661 ggggcccacc agcggacccc cacggtccgc cggcctgcta gcccggccat gagctcgcgg 477721 tggttcggta gttcgcgttg ggcgcactgc agaagtccga ggccgtgccg gccagcaaga 477781 cgaaatagcc ctcttcgccg cggcgggtgt cggcgaacgg cgggtaggtc caaccgttgt 477841 tgcacatcga agtcggcgtg cccgccgtgg ccttgcgcca accggacttg acgcaggcct 477901 tgagggcgtc gacgtcggaa acggtgctct tggcgacgac cagcatcgcg cctccccatt 477961 gcccgtcggt tcgccggtag gcgcgcgccc cgaagccggc ccccgagccg gctgtgttct 478021 gttggcagcc caacacccgc tgcagcggca agaacaccga cgccacatcg cggtccgcga 478081 cagcggctgc ggccgtgatg aattgagccg cgtccgagcc gtcaacctcc tggacggtgg 478141 gatctccgaa gctaaacagc tgctgtgaga tccgctgtcg cagcgcggcg tcaccgtcgc 478201 cgatgaccgg tccgctgccg ctggatgtca tcgggggcag cgcgccggtg ggttccgcgc 478261 ccgcggtggg cgcggcggcc agcacggcca gggacaaacc gcacgcggcg acaccgacaa 478321 cgcgggcaat gactcccatg gctacctacc tccccggcgg catgggtggg gcgtcgttcg 478381 gtgctacctc ggcaccgatc ttgcgaaata gtatgtcggc ctggttgcgg tagttgccct 478441 ggtcatcgaa ggcctcgggg gcgtaggtga ctgcgaccgc caccgcgacc cgttgcgacg 478501 gcagataggc ctccaccgcg gcgtaaccgg cgaacatggg attttgcagc agccaatggc 478561 cggatatgac gatcccgaga ccatagctgt agccgtcgtt ctgctcgaag caggtggggc 478621 agcccggctg ggcgcgggtc ttgccgcgca gctcggtcga caccatcttc ttgtacgaat 478681 ccgccgagag cagcctgccc gacccgatcc ccaccgcggt ggcctccatg tcgtagatgg 478741 tggtggtttg gatggcgccg tgggtgatgg tccacgacgg attccagaag gtcgattcct 478801 cgtaaaacgg cacgccggca ggaattttca aggccgctcg gcgctcggag gtgaatgcat 478861 gcaaggcggg ctcggggatg gcgggggtat cggagttggc ggtggccgtg aggcccaggg 478921 gggaaaggac cttgcgctgc agcagggttg gcatgtcttg gccggcggcc ttctccaacg 478981 ccagccccag caagaggtaa ttggtgtgcg cgtagttcca gttggtgccc gggtcgtaaa 479041 gcagtggccg tgaagagatt tgatcgagta actcttgtgt tgtccactgc cggaacggat 479101 tagcgtaaag ctcggcatca aacgcctcgt tgccgaggac gtagtcgggg tagccggatg 479161 tcatctgcgc tagttgaccc agcgtgaccc ggtcggcgtg cggaaagtcg ggaagccacc 479221 tggacagctt gtcgtccagg cgcagctttt tttcgtcgac cagtttgagc aacagcgtcg 479281 cgacatagga gattgcgacc gcgccgttgc gaaagtgcat ggcggtggtg gccggcacgc 479341 cggtcatcga gtcgccgacg gcccgcgtca cgacctcctt gccggccacg gtgacccgga 479401 ccagcaccgc cttcagatgc gcttgcgtca tgaagtcacg cacaatccgg atgaccgcgt 479461 cggccttggc cccgttgttg gtcggcgacg aagccggccc ggtgcggggt ggggcgcagc 479521 cggccagcag cccgagagcc aggaccgaac acccgaggcg ccgcaagacg ggcatgcgac 479581 ggtcctaccg gaaggcggcc aagcccgtga aggcctgacc gagcaccagc tgatgcatct 479641 cgggcgtgcc ctcgtaggtg agcaccgact ccaggttgac catgtgccgg atgaccgggt 479701 actccagcga tatcccgttg ccgcccagta ttgttcgagc ggtccggcag attttgagcg 479761 cttcccgggt gttgttgagc ttgccgaagc tgacctgatc ggggcgcagg cccaccctgt 479821 ctttgaggcg ccccagatgc aacgacagca gctgaccctt gtgcagttcc acggccatgt 479881 cgacgagctt ggcctgggtc agctggaagc cggcgatcgg acgtccgaac tgggtgcgct 479941 gtctcgcgta gtcgagcgcg cactgccagg ccgacctggc cgcgcccatc gctccccaga 480001 cgatcccgta gcgcgcctcc gacaggcatg ccagcggcgc cctgaggccg gtcgcgccgg 480061 gcagcatggc gtcggcgggc agccggacat tgtcgagcac cagctcgctg gtgatcgacg 480121 cccgcagcga cagcttgtga ccgatggtgt tggcggtgaa acccggggtg tcggtgggca 480181 cgatgaatcc gcggattccg tcgtcggtgg cggcccacac gatcgccacg tcggcgaccg 480241 agccgttggt gatccacatc ttgcccccgg tgatcaccca gtccggacca tcgcgtcgcg 480301 cccgggtttt catcgcggcc gggtcggagc cgacgtcggg ctcggtgagc ccgaagcagc 480361 cgagcaggtc accggtggcc atgccgggca gccactgccg cttttgctcg tcggagccaa 480421 agctcgcgat ggcgaacatc gccagcgaac cctgtaccga caccagcgac cggatgccgg 480481 agtcggcggc ctccagctcc cggcaggcca ggccatagtg caccgccgac gcgccgccac 480541 agccgtggcc gtgcagctgc attcccagca gtccgagttc gccgaactgt ttggccaaat 480601 cgcgcgcgac cggtaggtcg ccgtcctcga accacgccgc gacgtgcggg gtgacgtgtt 480661 cggcgcagaa ccgcctgacg gtgtcgcgga cggcgatctc gtcgctggat agcgacgcgt 480721 ccagtcccag cgggtcgtcg cggtcaaggg cgggtggtgt cggggtgctc atcactcaat 480781 actgccccgg cccggtagcc tcgcggcatg cgaccacggc gcgcgctggc ggggctggcc 480841 gccgacgtcg tcgccgtgct ggtgttctgc gcggtgggac gtcgcagcca cgccgaagga 480901 ctgagcgtca ccggcctggc ggctacggca tggccatttc tcaccggcac tggtatcggt 480961 tgggtgctgg ctcgcggctg gcggcggccg accgccctcg cccccacggg ggtgatcgtg 481021 tggctgtgca ccatcgtggt cggcatggtg ttacgcaagg tcagttcggc gggtgtggcc 481081 gcgagtttcg tcgtggtcgc gtccgcggtc accgcggtgc tgctgctggg ttggagagcc 481141 gccgttgcgc tgatggcacc gcaccgcgcg gacggctgag aaggccaaat gtcgtcgggg 481201 tgttcgccga ccccgggatt tccgacgtcc gcctccgtgc cctcgaagtc tcagtaccga 481261 gccagatttc acggtcgaga ccccaaccaa caggtcagcg cggtgccacc gcgatcgtga 481321 tgttggcgca ggtatgggcc gcgacctgtc gagctacgac ccgggccgtg ccgctactgc 481381 agcagcgctg cgcgttccgc actgagcgca gtggtgcgac gaggcccgag cacctggggt 481441 tgtcgggcta gccgatccac ccgacgtggc caccagaacc agcggccgag cagagttgcc 481501 agcgcaggcg tcatgtaaga ccggacaacg agggtgtcga acagcaatcc gatcatgatc 481561 gtggtgccga tttggccgac gacgcgtaga tcgctggcca ccatcgagcc catggtgaag 481621 gcaaacacca gtccggcgat ggtgaccact cggccggtgc cggccatggc tcggatcatg 481681 ccggttttga ggccggcccc gatttcttct tggaatcggg ctatcaagag caggttgtag 481741 tcggatccga cggccaacat gacaatgatg gccatgggca gcacgagcca gtgcagtggc 481801 atatgcagga tgtgctgcca gatgaggacc gacaatccga aggctgagcc cagcgaaagg 481861 gcgaccgtgc cgacgatgac ggcggatgcg accacgcttc tggtgatgcc gagcatgatg 481921 atgaagatca ggcaaagcga cgccacgacg gcgatcatga cgtcatacag ggtgccctcg 481981 tggatgtctt tgtaggtgga tgaggtgccc gccagataga tgctggcagc ctgtagcggg 482041 gttcctttca cggcttcgtc ggcggcctgc atgatggggt cgatgtgtga gatgccttca 482101 gcgctcgcgg gatcaccccg atgggtgatg acgaatcgag cgcaggtgcc atcgggggat 482161 aggaagagtt tcagaccccg ctggaagtcg gggttttgga aggcctccgg tggaaggtag 482221 aacgagtcgt cgttgttggc ggcgtcgaat gttcgaccca tgacggtggc gttgcgggtc 482281 atgtcttcca tttgagtgac cagtccggag aacgcgctgg tcagtgtttg ggcaaggtct 482341 ttgacggttt gcatggtggc gatcgtgggg tccagttggg cgagtagttg ccgctgtgtg 482401 gtgtccatgc gttcggtgtc gtcggtgagg ttggcaaggt cctcggtgag cttgtcgacg 482461 ttatccatgc tgttcaacaa ggagcgcatc gaccagcaga tgggaatgtc gaagcagtgg 482521 cgctcccagt acgtgaaact tctgaggggc gccagaagtc gtcgaaatcg gcgattcgat 482581 cgcgtagttc gttggcgttg tctcgcatct gcctggtatg agcgttcata tcatgggtgg 482641 catcggttag ctgtcgcgtc agctcctggg ttcgctgggt gatgtcgatc atgcgttgca 482701 gttgatcggt aagggtggat agatcagcca cgcggtcctt gaggttctgc aggttttcga 482761 tggtcatcgt gctttgcatg ccgagctgaa acgggatcga cgagtggtcg atcggagccc 482821 ccaacggtct ggtaatgctt tgcacccgcg cgatccccgg cgtatggaag acggttttgg 482881 cgatcctgtc caggatgggc atgtcggtcg ggttacgcag gtcgtgatcg gcctcgacca 482941 tcaggacctc cggttccatg cgggcttgcg gaaagtgacg gtctgatgcg aggtaaccga 483001 tgttggatgg cgccgcgctg gggatgtagt agcgctcgtt gtagttggtc tggtatttcg 483061 gcaaggcgag cagtccgatc agcgctatca gcagggtggc ggccaagacg gggccgggcc 483121 atcgcacgac gaccgtgccg atccggcgcc aacgccgttt cgttgtcgct cgtttggggt 483181 cgaatagccc gaatcggctg gcaacggcga tgatcgcggg cgccagcgtc agagacgcca 483241 acatcaccgt gaccaaaccg atcgcgcatg gcgatgcgag ggtattgaag tagggtagcc 483301 gggtaaagcc gaggcagtac atggcgccgg cgacggtgag gccagatgcc aggaccacgt 483361 gtgccgtccc accaaacatg gtgtagtaag cggcttctcg gttctggcca gtcgcacgtg 483421 cctcttgata gcgtccgacg agaaagatga tgtagtccgt cgaagccgcg atcgtgagcg 483481 ccaccaacac gttgacagtg aatgtagaca ggcccatgag gtcgttgacg gcaaaagtgg 483541 agatgatgcc gcggaccgcc agcagctcga gcccgaccgt cagcagcatg atcagagcag 483601 cagaaagcga gcggtaggcg atgaacaaca tgatcgcgat caccgcaatg ctgatgccgg 483661 taatcgtgtg aaggctgcgg tcgccgtata caactcgatc ggcgccgagt ggacccgggc 483721 ctgtcacgta agccttgatc cccggcggcg gtggcacgct gtccacgatg cgttgcacgg 483781 cggcgacaga ctcgttggcc tgcgagccgc cctgatcacc agtgaggttc agctggacat 483841 atgctgcctt gccgtcagcg ctctgcgatc ccgccgcggt cagcggatcg ccccagaagt 483901 tctcaatgtg ttggacgtgg gtggtgtctt gtgacagctt ggtcactagc acgtcataga 483961 agcggtgcgc ctcatcaccc agcttctctt ggccctccag cagcaccatt gcagtggtgt 484021 cagaatcgaa ttgctgaaag tccttgccga tgcgcttcat ggcgatcagt gacggagcat 484081 cgtgggggcc taacgccacc gaatgtgtcc tagcgaccga ctgtagctgc ggcgcaacga 484141 cgttcaccac aatagtcagc gccacccaga acagaatgat gggtagcgac aacgcatgga 484201 tcgtccgggc ggcggccgac aggtgcccgg ctagacgttg gctcctcacg cggatttcac 484261 caggcaactg gtgtgcgcgt ggtaagcatt cacaatgcgc tcctcgcgga tcacctcgtt 484321 gacagtgatg cgacagccca ggcttgcacc gtcaccgcgg gcaaccacgt tggcgactac 484381 ggcggtcaag gtggtcacga tggtaaatga ccacgggacc gcggcattga cgacctcatg 484441 cggctgggca tcggcatcca ggtaattgat gctggcgacc gtccctggcg ggccgaagac 484501 ctcgtagaga acatgcttcg ggtaaaacgc gatgatcggg tcgaggttgc cggtgtcggg 484561 cgcatgttga tgtgagccaa acaccgagtg cagccgcgag accgtcacgg ccgcgacagc 484621 cacaacgatg actatcacca tcgggatcca gaagcgtttg gcaacgccga acatttacct 484681 tcctgattcc atcgcttcaa caagccgccg cgtgaggacg aaccctaccg gggagacgcc 484741 actcgttggg gcagttttgt acactccgtt tacatcgttt acggcgaggt caaaaaattt 484801 cggttaatcg tacaggctgc cgctcggtca tctatagtca tcgatccaga gccgcttcga 484861 ccagcctgtg gtcgaagcgg atcagttgaa ccggaggagt ggaaacatga gcggcccgac 484921 gggaaattcg atgcccagac agctcggcgg ccgggtggcc aggatcgtta ccgggtaagg 484981 gatcgccact cccaatgtct gttatatcca cgttgcgcga ccgtgcgacc acgactccaa 485041 gcgacgaagc ctttgtgttc atggattacg acacaaaaac cggcgaccaa attgaccgaa 485101 tgacgtggag tcaattatat tctcgcgtca ccgccgtgtc tgcgtatcta ataagttatg 485161 gccggcatgc tgaccgacga aggaccgcag cgatatcagc tccgcaaggt ctggactatg 485221 ttgcaggatt tctaggagca ctgtgcgccg gatgggcgcc ggttccgtta ccagaaccgc 485281 tgggcagcct acgcgataag cggactggac tggctgtact cgactgtgcc gccgacgtcg 485341 tgctgacgac gtcgcaagcc gaaacgcggg tcagggccac gatagctaca catggggcgt 485401 ctgtaactac gccggtcata gcgttggata cattggacga gccatccgga gataactgtg 485461 atctcgattc tcaactatca gactggagtt cgtatttgca gtatacttcg ggttcaacgg 485521 ccaacccccg tggtgtggtt ttatccatgc gtaacgttac ggaaaatgtc gaccaaatta 485581 tccgtaacta ttttcgccat gagggcggcg cgccgaggtt gcccagctcg gtcgtttcgt 485641 ggttgccgct ttaccatgac atgggtttaa tggttggcct ctttattccg ttgtttgtcg 485701 gatgtccggt tatcctgacg agcccagagg catttatccg taagcctgcc agatggatgc 485761 aactgcttgc taaacaccag gcgccatttt cggccgcgcc gaacttcgca ttcgatttgg 485821 ccgtcgctaa aacttccgaa gaggacatgg cggggctgga tttaggccac gtaaatacaa 485881 taatcaacgg cgcggagcag gtacagccaa atacaataac caaattcctc cgccggttcc 485941 gtccctacaa tttgatgccc gcagcggtca agccatcata cgggatggct gaagcggtgg 486001 tttacctggc gacgacgaag gcgggatcac ctccaacgtc aaccgagttc gatgctgata 486061 gcttggctcg aggccacgcg gagctaagta ctttcgaaac tgagcgtgca acgcgtttaa 486121 tacgctacca cagcgacgac aaggaaccgt tgcttcggat tgtcgatccg gactcgaata 486181 tcgagctcgg accgggacgt atcggcgaga tttggattca cggtaagaat gtgtctaccg 486241 gatatcacaa tgcagacgac gcgctcaatc gagataagtt ccaggccagc atccgggagg 486301 cctctgcggg aacgccaagg tcgccgtggc ttcgcacggg agacttggga ttcatagtag 486361 gagatgagtt ctacatcgtc ggccgtatga aagatctcat tatccaagac ggtgtaaacc 486421 attatcccga tgatatcgaa actacggtca aggagtttac cggtggccgg gtcgcggcat 486481 tttcagtatc cgacgacggg gtggagcatt tggtcattgc ggccgaggta aggactgagc 486541 atgggcccga taaagtgact attatggatt tctcgacgat caaaaggctg gtcgtatcgg 486601 cgttgtcgaa attacatggc ctgcatgtaa cagattttct tctggtaccg cccggggcgc 486661 taccgaagac caccagcgga aagattagcc gggcggcatg cgcaaagcag tacggagcaa 486721 ataagttgca acgagtagca acgttcccat gacagacggt tcggtcactg cggataagct 486781 tcaaaaatgg tttcgagagt acttgtccac gcatatcgag tgtcatccaa atgaggtcag 486841 cctagacgtt ccgattagag atttaggttt gaaatcgatt gatgtcttag cgattcccgg 486901 cgacctcggt gacagatttg ggttttgtat tcccgatttg gccgtttggg ataatcctag 486961 cgctaatgat ttgattgata gtctgttgaa ccagcgtagt gctgactcgt taagagagag 487021 tcatggacac gccgacagga acacgcaggg tcggggcagc ataaacgagc cggttgcggt 487081 catcggagtg ggctgtcgat ttccgggaga tattgacggc ccggaacggc tatgggactt 487141 tctgaccgag aagaagtgtg cgataacagc gtatccagat cgtgggttca cgaatgctgg 487201 aactttcgcg gagtccggag gctttttaaa ggatgtcgcg ggtttcgata atagattttt 487261 tgatatcccg ccggacgagg ctctgcgaat ggatccgcaa caacggttgt tactggaggt 487321 ctcttgggaa gcgttagagc atgcaggaat tattcctgag tcattaagac tttcacgtac 487381 gggcgtattc gttggggtgt cgtcaactga ctacgtccgg cttgtgtcag ctagcgctca 487441 gcaaaagtct actatttggg ataacaccgg cggttcttcg agtattattg ccaatagaat 487501 ctcatacttt ctcgatattc agggtccgtc cattgtcatt gacacggcat gctcgtcatc 487561 cctggtcgcc gtgcatctag cctgtcgaag tctcagtacc tgggactgcg atatcgcact 487621 tgtcggtggg acgaatgttc ttatttcacc agaaccatgg ggtgggttta gggaagcggg 487681 catcttgtcg cagacaggct gctgtcacgc gttcgataaa tccgccgacg ggatggtacg 487741 cggtgaggga tgcggagtta tcgtgctgca gcgcctcagt gatgcacgcc ttgagggccg 487801 gcggatatta gcgattctga cgggttcagc ggtcaatcag gacggtaagt ccaacggtat 487861 tatggcgcca aatcctagtg cgcaaattgg tgttcttgaa aatgcatgca agagcgctcg 487921 cgtcgatccg ctggaaatcg gctacgtcga ggcccacggg accggaacgt cgttagggga 487981 taggatcgag gcgcacgcct taggcatggt ctttggtcgc aagagaccgg gatctgggcc 488041 cctgatgatc gggagcatca agccgaatat cggccatctg gaaggtgcgg ctggcatcgc 488101 cggattgatc aaggcgggtg ttgatggttg agcgtggctc gctgcttccg agcggggggt 488161 ttacggagcc aaatccagct atcccattca cggaattggg cctgagagtt gtagacgaac 488221 ttcaggagtg gccggtggtg gcgggtcggc cgcgccgggc tggggtgtca tcgttcggct 488281 ttggcggcac caatgcgcat gtgattgtcg aggaagctgg ttcggttggg gcggacacgg 488341 tttcgggccg cgcggatgtt ggcggttccg gtggtggggt ggtggcgtgg gtgatttcgg 488401 ggaagacggc ttcggcgttg gctgctcagg cgggtcggtt ggggcggtat gtgcgggctc 488461 ggccggcgct tgatgttgtt gatgtggggt attcgttggt gagcacgcgg tcggtgtttg 488521 atcatcgggc ggtggtggtc ggccagactc gcgatgagtt gctggctggg ttggctgggg 488581 tggttgctgg tcggccggag gctggggtgg tctgcggtgt tggcaagccg gcgggcaaga 488641 cggcttttgt gtttgccggt cagggctcgc agtggctggg tatgggtagc gagctttatg 488701 ctgcctaccc ggttttcgcc gaggccctcg atgctgtggt ggacgagttg gaccggcacc 488761 tgcggtatcc gctgcgcgat gtgatctggg ggcacgacca agatctgttg aataccaccg 488821 aattcgccca gccggcgctg tttgcggtgg aggtggcgct gtatcggctg ctcatgtcgt 488881 ggggggtgcg gccggggttg gtgctgggtc attcggtggg cgagttggcc gcggcgcacg 488941 tcgccggggc gctgtgtttg ccggatgcgg cgatgctggt ggccgcgcgt ggacggttga 489001 tgcaggcgtt gcccgccggc ggcgccatgt ttgcggtgca ggcccgtgaa gacgaggtag 489061 cgccgatgct ggggcacgat gtgagcatcg cggcggtcaa tggtccggct tcggtggtga 489121 tctctggtgc ccacgatgcg gtgagcgcga tcgctgatcg gctgcgcggc cagggccgtc 489181 gggtccaccg gttggcggtc tcgcatgcct ttcactcggc gttgatggag ccgatgatcg 489241 ctgagttcac agccgttgcg gccgaactgt ctgtgggctt gcccacgatc ccggtcattt 489301 ccaatgtgac cgggcagttg gtggccgacg acttcgcctc agctgattac tgggcccggc 489361 atatccgggc ggtggtgcgg tttggcgaca gtgttcgtag tgcccactgc gccggtgcca 489421 gtcgtttcat cgaagtcggg cccggtggcg gcttgacgtc gttgatcgag gcatcgctgg 489481 ccgacgcgca gatcgtgtcg gtgcccacgc tgcgcaaaga tcggcccgaa ccggtcagtg 489541 tgatgacggc ggcggcccag ggcttcgtct cggggatggg cctggattgg gcctcggtgt 489601 tttccgggta ccggcccaag cgggtggagt tgccgacgta tgccttccag catcaaaagt 489661 tctggctcgc accagcccca tcggtcagcg accccaccgc cgccggccag atcggggcta 489721 gcgatggtgg tgctgaactc ttggcgtcct ccgggtttgc cgcccggctg gccggtcggt 489781 cggccgacga gcaactcgcc gcagcgatcg aggtggtatg tgagcatgcc gcagcggtgc 489841 tggggcgcga cggcgctgcc ggactcgacg ctggccaggc gtttgccgat tcgggattta 489901 attccttgag tgccgtggag ctacgtaacc gcttaacagc cgtcaccgca gtaacgctgc 489961 cggccaccgc gatcttcgat caccccaccc cgaccgaact agcccagtat ctgatcactc 490021 aaatagacgg tcacggcagc tccgccgccg cagcggcaaa cccggcggag cgaatcgatg 490081 cgctcaccga tctttttcta caagcttgcg atgcgggtcg ggatgccgat ggttggaaga 490141 tggtcgccct ggcgtcgaat acgcgcgagc gcatgagctc accggttcgg aacaacgtat 490201 cgaagaacgt cgcactgctg gcagatggta tctccgatgt ggttgtaatt tgtatcccaa 490261 ctctaactgt gctatcggat cagcgtgaat atcgagatat tgcgaatgcg atgacaggcc 490321 gccattcggt ttattcgctt acgcttcccg ggttcgattc gtctgatgca ctgccgcaaa 490381 acgcggatat gattgttgaa accgtatcta acgcaattat tgatgtggta ggcggcagct 490441 gccgttttgt gctgtcgggc tattcatcgg gtggggtgtt ggcctatgcc ctctgctccc 490501 atctgtcggt caagcaccag cggaatcccc tcggagtcgc actcatcgat acatatctgc 490561 ctagtcagat cgccaatcct tcaatgaatg aagggttcag ccccaacgat actgggaagg 490621 gcctttcccg tgaagtaatt cgagtggcca gaatgttgaa tcggttaact gccacccgac 490681 tcaccgcggc agccacctat gctgcaatct ttcaggcctg ggaaccaggt agatcaatgg 490741 ctccggttct taacatcgtg gcgaaggacc gaatagctac cgtcgaaaat ttacgcgaag 490801 aacgaatcaa ccggtggcga actgctgctg cagaggcggc ctattctgta gccgaagtac 490861 ccggggatca tttcggagtg atgagcacct cgagtgaggc aatagctacc gaaatacatg 490921 attggatttc tgggctcgtt cgagggcctc atccgtagct ttgcgaatcg gcccgtgcca 490981 cagctcgccg tgaccaggtg ccaggatgtt ggtctctagc aaagccagcg cagccaggct 491041 gcggatactg ttctgctggc tgtggctgaa caccgcgggc agtagctgtg gcccgcggtg 491101 acgcaacatc ggatgaccag tgatcagcgc atcgccgctg gccagcacac cgtcgacgac 491161 atacgagcag tgaccgctgg tgtgtcccgg ggtgaaaatc gccatcggtt gacccggcag 491221 cccggcggcc gcttcggcgg tcagcggctg ggcggtcgga atgccgtcgc cggtcaggcc 491281 gccgcggcga agcaagtgaa taccccagac cgccacacgg ggccgccagc tgcgcagcgc 491341 aacatcgaaa accgaggcat tctcccggta ttcccgcttg gcgtgaccta cctcctcggc 491401 gtggcagtac accggcgtgc tgtgctcacg agcaaaccag attgccgagc ccaggtggtc 491461 gatgtgcgcg tgggtgagca cgatggcgcg cacgtcaccc ggtgtgtagc ccagtttgtt 491521 cagcgaggcc agcacctccg cacggtcgcc gggatagccg gcgtcgatca gcagcacgcc 491581 ggtgtcgtcg gtgactagca cccagttgac cgcgtggccg cgagcgaggt gaaccttgtc 491641 ggtgatctga acaagctccg ccatgcccgc gagtctagga gcgagcgcga gcgcggcaag 491701 ccgggtgccg cgggtcgcga ccatgggata tggagcgatc gcgagcgcgg cgaagccggg 491761 cgtggcgggt cgcgtttatg gcataggagt agaaagaact ggtggctgaa ctgaagctag 491821 gttacaaagc atcggccgaa caattcgcac cgcgcgagct cgtcgaacta gccgtcgccg 491881 ccgaagccca cggcatggac agcgcgaccg tcagcgacca ttttcagcct tggcgccacc 491941 agggcggcca tgccccgttc tcgctgtcct ggatgaccgc tgtcggcgaa cgtaccaacc 492001 ggctgctgct gggcacttcg gtgctgaccc ccaccttccg ctacaacccc gccgtcatcg 492061 ctcaggcttt cgccaccatg ggatgcctgt acccgaaccg tgttttcctt ggcgtgggca 492121 ccggtgaggc gctgaacgaa atcgccaccg gatacgaggg cgcctggccg gagttcaagg 492181 agcggttcgc ccggctgcgt gaatcggtgg ggctaatgcg gcagctgtgg agcggtgacc 492241 gcgtcgactt tgacggcgac tattaccggc tcaagggtgc ctcgatctac gacgtgcccg 492301 acgggggcgt gcccgtctac atcgccgccg gcggcccggc ggtggccaag tacgccggcc 492361 gcgccggtga cggcttcatc tgtacgtccg gcaagggcga ggagctctac accgagaagc 492421 tgatgccggc ggtacgagaa ggcgccgctg ccgctgaccg atccgtcgac ggcatcgaca 492481 agatgatcga aatcaagatc tcctacgacc ccgacccgga gctggcattg aacaacaccc 492541 ggttttgggc gccgctgtcg ttgacagctg agcagaagca cagcatcgac gacccgatcg 492601 agatggagaa ggccgccgat gcgctgccaa tcgaacagat cgccaagcgc tggatcgtgg 492661 cgtcggaccc cgacgaagcc gtcgaaaagg taggtcaata cgtgacatgg ggcctgaacc 492721 acctggtatt tcacgcacca ggacatgacc agcgccggtt cctggagctc ttccagtcgg 492781 acctggcacc caggttgcgg cgacttggct gactcctcgg cgatctacct cgccgcacca 492841 gaatcgcaga cgggtaagtc gacgattgca ctggggcttt tgcaccgact gaccgcgatg 492901 gtcgccaaag tcggtgtgtt ccggccgatt acgcggctct ctgcggagcg ggactacatc 492961 ctggaactac tgctcgcgca caccagtgcg ggcctgccct atgagcggtg tgttggcgtg 493021 acctaccagc agctgcatgc tgaccgcgac gacgcgatcg ccgaaattgt cgattcgtat 493081 cacgcaatgg ccgacgagtg tgacgcggtg gtggtcgtcg gcagtgacta caccgacgtc 493141 accagcccca ccgagctctc ggtcaacgcc cggatcgcgg tgaacctcgg cgcgccagtg 493201 ttgttgacgg ttcgggcgaa ggaccgcacc cccgatcagg tcgccagcgt cgtcgaggtc 493261 tgcttggccg agctggacac ccagcgcgct cataccgcgg cggtagtggc gaaccggtgc 493321 gagctgtccg cgataccggc cgtgaccgac gcgctgcgca ggttcacccc gcctagctat 493381 gtagtgcccg aggaaccact gctgtcggcg ccgaccgttg ccgagttaac gcaggctgtg 493441 aacggggcgg tggtaagcgg tgatgttgcg ctgcgcgaac gtgaggtgat gggcgtgctg 493501 gccgcgggta tgaccgccga ccatgtgttg gagcggctga ccgatggcat ggcggtgatt 493561 actcccggcg accgctcgga cgtggtgttg gccgtcgcta gcgcccatgc ggccgaaggg 493621 tttccgtcat tgtcatgcat cgtcctcaat ggcgggttcc agttgcatcc ggcgatcgcc 493681 gccctggttt ccggcctgcg attgcggtta cctgtcatcg ccaccgcgtt gggcacctac 493741 gacaccgcca gcgctgccgc gtcggcccgc gggctggtaa cggcgacgtc gcaacgcaag 493801 atcgacaccg cgttggagct gatggaccgc cacgtggacg tcgccggtct attggcgcag 493861 ctgaccattc ccatccctac ggtcactaca ccacagatgt tcacttatcg gctgctgcag 493921 caggcccgtt cggacctcat gcgcatcgtc cttcccgaag gggacgacga tcgcatcctc 493981 aaatcggcgg gccgcctgct tcagcgcggc atcgtcgacc tgaccatcct gggcgatgaa 494041 gccaaagtcc gtctgcgggc agcggaactc ggtgtggacc tggacggcgc cacggtaatc 494101 gagccatgcg caagcgaact gcacgatcaa ttcgccgacc agtatgcgca gttgcgtaag 494161 gcgaagggaa tcaccgtgga gcatgcccgc gaaatcatga acgatgccac atatttcggc 494221 accatgctgg tgcacaactg tcatgccgac ggcatggtat cgggtgctgc tcacaccacg 494281 gcgcacaccg ttcgtccggc gctggagatc atcaagaccg ttccgggcat atccaccgtg 494341 tccagcattt tcctgatgtg tctgccggat cgggtactgg cgtacggcga ctgcgcgatc 494401 atcccgaacc cgacggtgga gcagctcgct gatatcgcca tctgctcggc acgcaccgcc 494461 gcacagttcg gcatcgagcc ccgggtggcc atgctgtcct actccaccgg tgactcgggg 494521 aaaggtgccg acgtcgacaa ggtcagagcg gcaacggagt tggtgcgcgc tcgggagccg 494581 cagctgccgg tcgagggtcc cattcaatac gacgccgcag tggaaccgtc ggtcgcggcc 494641 accaagttgc gcgattcgcc ggtggccggc cgcgcgacgg tgctgatctt ccccgatctc 494701 aataccggca acaacaccta caaagcggtg cagcgttctg cgggtgcgat cgcgatcggc 494761 ccggtgctgc agggcttacg caagccggtg aacgacctat ctcggggtgc actggtcgac 494821 gacatagtca acaccgtggc catcacggcg attcaggcgc agggcgtcca tgagtagcac 494881 cgtgctggtg atcaattccg gctcgtcgtc gctgaagttc cagctcgtcg agccggtcgc 494941 cggcatgtca cgtgccgccg ggattgtcga gcggatcggc gagcggtcat ccccggttgc 495001 cgatcacgcc caggcgctgc atcgcgcatt caagatgttg gccgaggacg gaattgacct 495061 gcagacctgc gggctggtgg cggtcggaca ccgggtggtc cacggcggca cggagtttca 495121 ccagccgacg ctgctggatg acacggtgat cggcaagctt gaggagctgt cggcgctggc 495181 cccgttgcac aacccgccgg cggtactggg catcaaggtg gcacgcagat tgctggccaa 495241 tgtcgcgcac gtcgcggtgt tcgatacggc ctttttccat gacttgcccc cggcggccgc 495301 gacctatgcc atcgaccgcg acgtcgccga cagatggcat atccgccgct acggatttca 495361 tggcacttca caccaatacg tcagcgagcg ggccgccgcc ttcctgggcc gcccgctcga 495421 cggtttgaat cagattgtgc tgcatctggg taacggtgcc tccgcctcgg cgattgcccg 495481 cggccggccg gtggaaacgt cgatgggcct gacaccgctt gagggcttgg tgatgggcac 495541 ccgcagtggc gacctggacc cgggcgtcat cagctacttg tggcgcaccg cgaggatggg 495601 tgtcgaggac atcgaatcga tgctcaacca tcggtccggg atgttggggt tggcggggga 495661 gcgggatttt cgccgtctac gactagtgat cgaaaccggg gaccggtcag cacaattggc 495721 gtatgaggtg ttcatccacc ggttgcgcaa gtaccttggt gcctatctgg cggtgttggg 495781 ccacaccgat gtggtgagct ttaccgccgg gatcggcgaa aacgatgcgg cggtgcggcg 495841 ggacgcgttg gctggccttc aggggctagg tatcgcactc gaccaagacc gcaacctggg 495901 cccggggcac ggcgcccggc ggatttcgtc agacgattca ccgatcgccg tgctggtggt 495961 tcccacgaat gaagaactgg ccatcgcccg cgattgcctg agggtgctgg gcggacgccg 496021 agcgtgaatc atacgacagc ccgccggcgt gtcgcgtcgt gcgattcaca ctcgggcggc 496081 ttagaacgtg ctggtgggcc ggaccttgtt ggccatgtcc accagcgtgt agcgatgccg 496141 ttgagtggga gctacccggg ccaggctgcg cagtgacgcc tcgacaccca gccgcagccc 496201 gtgactggtg aacgggaaac cgaggatgtg gttggtgctg gccttgttgt ccttcagcca 496261 gtccagcgcg ccacccagca ccagggcgcg gatctgcagc acgcgtggtt cggtcggggg 496321 cagcgcctcc actcttcggg cggcgtcgcg gatctgttcc tcggtgactt cactcgttga 496381 ccggccggac aacagagtca ccgcgctggt cagccgtgcc gtggtgaaat gccgagaagt 496441 gggcggtacc tcgtcgagcg tgcgcacggc gccgacccga tcaccttcgg ccgaccgggc 496501 tctggccagt ccgaaagccg ccgagatcac gccgtcgttg gtgctccaca ccgtctgata 496561 gaacttgtgt tcgtcggtgt tgccggctag ttcggcggtg gcggccaggg cgagcttggg 496621 cgccagctcg ccgggaaagg tatccagcac ctcggtgaaa tgtttggtgg ccgagtcata 496681 gtcgccggtg agcagctcgg cgacggcccg gtaccagacc aatcgccatc gccagccaac 496741 gcgttcggcc agatcgtcga gttttcgggt ggccttggcc acatcgccga gatccagcag 496801 cgcgcggact tccattagcg gcagctccac tgactcggag aagtcgacgc cgtcggcgtc 496861 cagcgcaccg tggcgggccg cgcgcagcga gtctagggtc tgcaccggct gggagagcac 496921 cgtggcctgc aggaccgaag ctgcgacgtc ggtcggatcg accagcggca ccgacagcgc 496981 ggtcacgatc tcgttggcgg tcagcttctc cgcgtgcacc tgcccgtcca gatacacgtc 497041 ggtgtgcgcc accagcaggt ccactccaaa tgtcgaccga ctgggactga agatcgttga 497101 tagccctggc cgcggcaccc cggtgtcctg ggcgaccacc tcccgcaaca cgcccgtcaa 497161 ttgcgcggac atctcttcgg cggtggtgaa ccgttgccgc ggatcggggt cgatggccct 497221 gcgcagcaac cggccgtaag agtcgtaggt tttcagcacc gggtcgtctt cgggtagccc 497281 atccacataa cggccattgc gggtgggcag gtccagcgtg agcgccgcga gcgtgcgtcc 497341 cacggtgtag atgtcggtgg ccaccgtcgg accggtccgc acgatctcgg gcgcctggaa 497401 gcctggggtc ccgtagaggt agccgaacga gttgatccgc gataccgcgc ccaggtcgat 497461 cagcttgagc tgttcctcgg tcagcatgat gttttccggc ttcaggtcgt tgtagaccaa 497521 gccgatggaa tgcaggtagc tcagcgccgg caggatctcc agcaggtagg cgatggcctc 497581 cgcgacgggc agtttctgac ccttgctgcg tttgagcgat tgcccgccga cgtattccat 497641 cacgatgtag ccgaccggat ccccgtgcct gtcggtgtgc tcgacaaagt tgaagatctg 497701 cacgatcgac gggtgcacca cctcggccag gaactggcgt tcggccatcg ccattgcctg 497761 cgcttcggca tcaccggaat gcaccaggcc cttgagcacc accggacggc cgttgacatt 497821 gcggtcgaga gcgaggtaga tccagcccag tccgccgtgc gcgatgcagc ctttgacctc 497881 gtactggccg gcgacgatgt ccccgggatt tagctgcggc aggaacgaat acgggctgcc 497941 gcaataggga caccagccct ctgaagctcc cttggtctcc gagtcggacc ggccgacggg 498001 acgtccacag ttccagcaga accgcttgga ctccggcacc accgggttgg tcatcagggc 498061 ctcaagcgga tcgatatcgg gcgcccgcgg gatttccacc aggccgccgc ccagccgtct 498121 gaccggtggg cgcacccggc tggtggtggc catccggtct tgcggctcgg tgtccgggcc 498181 gagcgtcgga tgggggaagt tgtcctcatc gccgaaatcg gggcggaaca ccgcctgggt 498241 gctcaggggt cgaaccgtcg cggacgtcgc ggtctgggcg tccgccggtt gggtgccggg 498301 gcccgaacgt tcggtctctg acgctttggc catcagtcca catacctcgg cgtgggcggg 498361 gcgggggctg ggccgagcac cgtcagccac ttgcgataca acgtgttcca ggtgccgtca 498421 ttgcggatgc gttcgagcgt gccgttgacg aaccggacca atccggtgtt gtccaggttg 498481 atcccgacgc cgtagggctg gtcggccatg tcgggcccga cgatatgcag gtaggggtct 498541 tcctctacca gcccggccag gatggtgtcg tcggtgctga cagcgtcgat ctcgcgctgc 498601 tgcaaggcca ccaagcagtc cgcccagttc accaccgaca caatgacggg aggcggtgcg 498661 atctcccgga tacggcgcaa cgatgtggtg cccctggcca cacagacccg cttgcccgac 498721 aggtcggaca cctttgtgat cggcgagtca cgcggggcga ggatgcgttg gttggcgtcg 498781 aggtagacgg tggagaagtt gaccagcttg cgccgctcgc aggtgatcga catcgtcttg 498841 acgacgatgt cgacctgcga cttctgcagc gcggtgaccc gctccgcggc cgacaggatc 498901 cggtactcga catgtgacgg gacaccgaag atgtcgcgtg ccacttcgcc ggcgatgtca 498961 acgtcgaagc cggtgatctc gccggtgatc gggtcgcgga agctgaacag gttgctgccg 499021 atgtcgagtc cgacgatcag cctgccgcgc gcgcggatgt cggccaccgc ggcgtcggcc 499081 tcggccttgg tggcaaaggg gcgcaggctg gcggtgggat cgcagtcctg gctcgaactg 499141 tccggcggca gcgggggttg cggtggcatg atctccatgc cgaccggtgt gggcagcggc 499201 agcgtcggcg tcgcctccac ccccagcgtt tccgagtggc cgcaactggc cagcaccatc 499261 gccaaggcca gcggcgcgag tggggctgcc gcccgcgcca ggagggcccg gcgcgtcatc 499321 accgatactc tttcagccgg ggccacaggc cgagcgcgac ggcaatggcg gcacctaagc 499381 tgagcaccac gccgcccacc tgcgcgcctg ccagcccgcg atgcgcattg aggatgtcgt 499441 ggcgcagttg ggtgcggctt tgtcccatgg ctttggtcag tgcttcgtcg agcttgtcga 499501 atgcgggggt agcatcgtcc tcgcctttac ccagtgccac ctgagtggcg gcccgatagt 499561 tgccgacgga gatgtaggaa ttgatccggt cgttggcctg ccgccagcgc accaacagct 499621 ggtcggcgcc ctgcagatcg ggtttgtcga cggcgtggcg gcgggccatg tagtcgttga 499681 gctggcgttg catggcgtcg atgcgctgat agaaggcctg cttgcggacc tcttcgtcgc 499741 cgcgccggat cagcgacagt gtctcgtcgg cccgtgcctg ttgggcggtg atcgccaggt 499801 tggtgatggt cttgagtgac tcagccgcgg tatctttcgc gctacggctg gccgttgtag 499861 agatggtcag cgcagttccc acccacacca ccatgacgag aataccgagc gcgcccacga 499921 caagaccggg gttaatccgt cgcctggtgc gccgggccag ccagcgatgt gcgaacgcac 499981 cgaagaccac ggtggtggcg accaccagga tcaccggggc cgggatctgg gtcgacgcgg 500041 tggtttcccg atctacccgt gctgatgtcg cctggtagag ccgttgcgcg tcgggcagga 500101 tcgtcgattg catcagcccc gacgcctctg acagatatga cgacccgacc gggttgcccg 500161 cccggttgtt ggcgcgggcg atctcgacca ggccggtgta gacggccaat tcggcgttga 500221 tccggcccag caattgcacc aacgattcgt cggtgagccc gctcgaggcc cgggttaccg 500281 ctaccgaggc atcggtaatg gcctgctcgt agcgcagccg aacgccgccc ggctcggctt 500341 gggctatgaa cgcggtggcg gccgcggcat cagccaccga cagcgtggtg tacagccgtc 500401 cagccgcgaa cgacagcggc tcggtgtggt cgagcaccgc ggtcaacacc tgctgccggt 500461 gttcgatggt ggtggaggta gcgaaggcgc tggccacgcc gagagccgcc aacacgatgc 500521 cgatcgtcat gattcggccg ggtgtcgtcg agatgaacca ccgccgggga tgtgccggtt 500581 cggcgggcga gcgtgatccc agcggctcgg tcgacgggtg cgccagctca accgtcacgt 500641 ctgttaggac ctcatctttc ggctaacgca acgaaactct ataagcgaat tctaagagaa 500701 ggttccgaca gatggtgtta ggcatacgca attgcccagt tgcccgcctg catattctga 500761 acaggtgcgg ggcgacggtg acggatgggt ggtgtccgac agcggcgttg cgtactgggg 500821 ccgctacggt gcggccggtc tgttgcttcg ggctccgcgg ccggacggca cccccgcggt 500881 gctgctgcag caccgcgcgc tgtggagcca tcagggcggc acctggggct tgccgggcgg 500941 tgctcgagac agccacgaga cgccggaaca gaccgcggtc cgcgaatcga gcgaggaggc 501001 gggcctgtcc gccgagcgac tcgaggtgcg ggccacggtg gtcaccgccg aggtgtgcgg 501061 ggtcgacgac acgcactgga cctacaccac cgttgtcgcc gatgccgggg agttgctgga 501121 caccgtgccc aaccgggaaa gcgccgaact gcgctgggtg gccgagaacg aggtggccga 501181 cttgccgtta catcccggat tcgccgccag ttggcaacga ctgcggaccg ctccggcgac 501241 cgtgccactg gcccggtgcg acgaacggcg gcagcggctg ccgcgcacca ttcagatcga 501301 ggccggggtt ttcctctggt gtacgccggg cgacgcggat caggcgccct cgccgctggg 501361 taggcggatc agttcgctgc tgtaagcgcc gaccggagct gctcggccgc cgcacgtggg 501421 tcgtcagccg aggtgatcgc ccgcaccacc acgatccggc gagcgccggc atcgagcacg 501481 gccggcagcc gttgcgcgtt gatgccgccg atagcgaacc acggcttgtc gtcgccgccg 501541 agttcggcgg cgacccgtac cagccccaga cccggcgccg cacggccagg cttggtcggt 501601 gtcggccaac atggtccgac acagaaatag tcggcgtcgc cggcggcggc cgcagcaacc 501661 tggtcggggt cgtgggtgga ccggccgatg agggtatccg gtgccaggat ctgtcgtgcg 501721 acgttcacgg gcaggtcgcg ttgacccaga tgcagcacgt cggcgccggc cgcgcgggca 501781 atatcggcgc ggtcgttgac cgcgaatagg gcgccgtacc ggtgcgctgc gtcggccagg 501841 atctcgcagg cggccagttc gtcacgcgcc tgtagcgggc cgaaccgcag ctcaccgggt 501901 gagcccttgt cgcgcaactg gatgatgtcc actccgccgg ccagggcggc ctcggcgaac 501961 tgagccaagt cgccgcgttc ccgacgggcg tcggtgcaca gatacagcct tgccgatgcc 502021 agacgggatt cgtgcacatc gtgacgctag cgcgctagcg tggaaccctg tagacacggg 502081 agtcccggga gcggggtctg agagtgggcg cgcctgccct taccgtcaca cctgatccgg 502141 atcatgccgg cgaagggagg tcaaggatgg cgtccgacct acacaccggg tcgctggctg 502201 tcatcggcgg cggtgtcatc gggctgtcgg tggcccgccg tgccgcccaa gccggctggc 502261 cggtgcgggt gcaccgcagc gacgagcggg gggcgtcctg ggttgccggc ggcatgctgg 502321 ccccacacag cgaaggctgg cccggcgagg aacggttgtt gcggctaggc ctgcagtccc 502381 tgcggctttg gcgtgagggc agctttctcg acgggctggg cccgcaactg gtcaccgcgc 502441 acgagtcgct ggtggtggcc gtcgaccggg ccgacgtcgc cgacctgcgc actgtcgcgg 502501 actggttgtc cgcacagggg cacccggtga tctgggagtc ggctgcccgt gacgtcgaac 502561 ccctactggc gcaaggcatc cggcacgggt ttcgggcgcc caccgaactg gccgtcgaca 502621 accgcgccct gctcgacgtg ctgtgccgtg actgcgagcg actcggagtt cgctggagct 502681 cacaggtgag cagcctgtcc gacgtcgatg cgcacacggt ggtgatcgcc aacggcattg 502741 acgccccggc cttgtggccc ggcctgccga tacgcccggt gaagggtgag gtgctgcggc 502801 tgcgatggcg accaggttgt atgcccttgc cgcagagagt gattcgtgcc cgtgtgcgtg 502861 gacgacaggt ctatctggtg ccacgttcgg acggggtggt cgtcggcgcc acccaatacg 502921 agcacgggcg cgacaccgcg ccggtggtat cgggagttcg tgacctgcta gacgatgcgt 502981 gtaccgtgct gccggcgctg ggtgagtacg agctggccga gtgtgaggcc ggactgcgcc 503041 cgatgacacc cgacaacttg ccgctggtcc aacgcctgga ttcgcggacc ctggtcgcgg 503101 ccggtcacgg ccgatccgga ttcctattgg cgccgtggac tgccgaacag attgtgtccg 503161 aactcgtttc ggttggggcc gcctcatgat cgtcgttgtc aacgagcaac aggtcgaggt 503221 cgacgagcag accaccatcg ccgcgctgct ggattcgctg ggcttcgggg accggggtat 503281 cgctgtggcg ttgaactttt cggtgctacc acgatcggac tgggccacca agatctgtga 503341 gctgcgtaag ccggtgcgac tagaggtggt gacggcggtg cagggtggct gagtccaagt 503401 tggttatcgg tgaccgcagc ttcgcctcgc ggctcatcat gggtactggg ggtgcgacca 503461 atctggcggt gctagagcag gctctgatcg cctcaggtac cgagctgacc accgtcgcga 503521 tacgccgggt cgacgccgac gggggaaccg gcctgctcga cctgctcaac cggctcggca 503581 tcacaccgct acccaacacc gcggggtgcc gcagcgccgc ggaagcggtc ctgacagcgc 503641 agttggcccg tgaggcgctg aacaccaact gggtcaagct cgaggtgatt gccgacgaac 503701 gcaccctgtg gcctgatgcg gtcgaattag tccgggctgc agaacaattg gtggacgacg 503761 gatttgtggt cctaccgtac acaaccgacg acccggtgct ggcccgccgg ctagaagata 503821 ccggttgcgc agcggtgatg ccgctgggtt cgccgatcgg caccggcctt ggtatcgcca 503881 acccgcacaa tatcgagatg atcgtcgccg gtgcccgcgt tcccgtggtg ctggacgcgg 503941 gcatcggtac cgccagcgat gccgcgttgg cgatggagtt gggttgcgat gccgtgttgt 504001 tggccagtgc ggtgacccgg gccgccgacc cgccggcgat ggccgcggcg atggccgccg 504061 cggtgaccgc cggatatctg gcgcgttgcg cggggcggat cccgaaacgc ttctgggctc 504121 aggcttccag cccggcacga taaccaaaac ggtgaagcca cggggtgcgg gcggcccgct 504181 accggtccga ttgccccgga tgtggcagct tgcgcataca gtgcagcctt atacacgccg 504241 acctgttggc tgccgccgac tacaacgttg tgggattggc ggcggcggtg ctatcggtgt 504301 gggcctactt ggcgtagacc tatggccgac tggtgggacg acgagtccgg agttggcagc 504361 accatcgcca gtcttccgta gcggcattgt cgctggtagt gctttggttt gtgctgtgta 504421 acctccggtt taggccattc aacgctctgt tcgtttgatt ggtcggtggg atgcgaaagc 504481 tgcgcggcga caggcgcggt ctaatctggg cgcgatggtg aacaaatcca ggatgatgcc 504541 ggcggtgctg gccgtggctg tggtcgtcgc attcctgacg acgggctgta tccggtggtc 504601 tacgcagtcg cggcccgttg ttaacggccc cgctgccgca gagttcgccg ttgcgttgcg 504661 caaccgggtg agcaccgacg cgatgatggc gcacctatcg aaactgcagg acatcgccaa 504721 cgccaacgac ggcactcgcg cggtgggcac ccctggctat caggccagcg tcgactatgt 504781 ggtaaacaca ctgcgcaaca gcggttttga tgtgcaaacc ccggagttct ccgctcgcgt 504841 gttcaaggcc gaaaaagggg tggtgaccct cggcggcaac accgtggagg cgagggcgct 504901 cgagtacagc ctcggcacac cgccggacgg ggtgacgggc ccgctggtgg ctgcccccgc 504961 cgacgacagt ccgggctgca gtccgtcgga ctacgacagg ctgccggtgt ccggtgcggt 505021 ggtgctggta gatcgcggcg tctgtccttt tgcccagaag gaagacgcag ccgcgcagcg 505081 cggtgcggtg gcgctgatca ttgctgacaa catcgacgag caggcgatgg gcggcaccct 505141 gggggctaat accgacgtca agatcccggt ggtgagtgtc accaagtcgg tcggattcca 505201 gctacgcgga cagtctgggc caaccaccgt caagctcacg gcgagcaccc aaagtttcaa 505261 ggcccgcaac gtcatcgcgc agacgaagac ggggtcgtcg gccaacgtgg tgatggcagg 505321 tgcgcatttg gacagcgttc cggaaggacc cggcatcaac gacaacggct cgggagtggc 505381 tgcggttctg gaaacggcag tgcagctggg gaactcaccg catgtgtcca acgcggtacg 505441 gttcgccttc tggggcgccg aggaattcgg cctgattggg tcacgaaact acgtcgagtc 505501 gctggacatc gacgcgctca aaggcatcgc gctgtatctg aacttcgaca tgttggcgtc 505561 gccgaacccg ggttacttca cctacgacgg tgaccagtcg ctgccgctag acgcccgcgg 505621 tcagccggtg gtgcccgaag gctcggccgg tatcgagcgc acgttcgtcg cctatctgaa 505681 gatggccggc aagaccgcgc aggacacctc gttcgacggt cggtccgact acgacggctt 505741 cacgctggcg ggtatccctt cgggtggcct gttctccggc gctgaggtca agaagtccgc 505801 cgagcaagcc gagctctggg gcggcaccgc cgacgagcct ttcgatccca actatcacca 505861 gaagacagac accctggacc atatcgaccg caccgcgctc ggtatcaacg gcgctggcgt 505921 cgcgtacgcg gtgggtttgt atgcgcagga cctcggcggc cccaacgggg ttccggtcat 505981 ggcggaccgc acccgccacc tgattgccaa accgtgatcc gggcctgatc tcgccactga 506041 ccccgcaccg accgatctag aatgggattt ccttggtgca tgccgggcgg gacggggtta 506101 ggagatgcat ggtcgcgggc ggtatcgacc tctggtccgc tgtgttcgcc ctcgccgggt 506161 ggccgcgtcg gtgcggaccc cgatcgcctg tctagcggcg gtggtcgtga tagccggctg 506221 cacgaccgtc gtcgacgggc gggcgctgtc catcctcaac gacccgttcc gggtgggggg 506281 tctgcccgcg accaacggtc cgagcggcgc ccgccccgac gcaccggctg cgtcgggcac 506341 ggtgatcaac accaacaacg gagcgatcga caagttgtcg ttgttgtcgg tcaacgacat 506401 cgaggactac tggatggcgg tctacagcga atcgctgaag ggcaccttcc ggccggtcgg 506461 caagctggtg tcctacgatt ccaacgaccc aagtagtccg atcgtctgcc acattgacac 506521 ctatcagctc gtcaacgcct ttttcagctc tcggtgcaac ttgattgcct gggatcgagg 506581 ggtcttcatg gcggtcgcgc aagaatactt cggcgacatg tccgtcaatg gtgtgctggc 506641 acacgaattc gggcatgctc tgcaagtgat ggcgaatttg gttaccagga aagatcccac 506701 catcgtccgc gagcagcaag cggattgctt cgccggggtc tatctgtggt gggtggccga 506761 aggtaagtcg acacgcttta cgctgagcac cgcggacggg ctcgaccacg tgctcgccgg 506821 catcatcacc acccgagacc cggtgatgga agccgatgcg gaaaacgacg acgaacatgg 506881 gtcggccttg gatcgggtca gcgcgttcca gctgggcttc atcaacggca cgccggcgtg 506941 cgcggcgatc gacgaggacg aagtcgagcg gcgccgcggt gacctgccga cgacgttgcg 507001 ggtcgatgcc agcggcaacc cagagaccgg cgaggtcgga atcaacgaag agaccctctc 507061 gacgttgatg gagttgatgg gcaagatctt ctcgccgaag aatccgccca cgctgtccta 507121 ccagccggcc ggttgcccag acgccaagcc cagcccaccg gccgcctact gtccggccac 507181 caacaccatc gtggtcgacc tgcccgccct ggcgaggatg ggcaaggtgg cctcggcagc 507241 ggaacacagc ctgccgcagg gcgatgacac gtcgttgtcg attgtgatgt cgcggtacgc 507301 gttggcggtg cagcacgaac gcgggctgcc gatgcagagc ccgtggaccg ccttacggac 507361 ggcgtgcctg accggcgttg cgcaccgcaa gatggccgtg cccatcgacc tgccctccgg 507421 ccagcaactc gtacttaccg cgggtgatct cgacgaagcg gtttccgggt tgctgaccaa 507481 ccgcatggtc gccagtgacg ccgacggtgt cagcgttccg gccggtttca ctcggatagc 507541 cgcgttccgt gccggcgtgg gcggcgacat ggacgcatgc tatgcccggt atccgggata 507601 ggactggccc tgatgttgat cgttgtgcac ccacatcacc aaaaacccgg tgaccagcaa 507661 ccaccccagg gcaacggacg ggatcgccca ggcgcgacgt acgagtagtg cggtgtcaca 507721 acgcgtgacc agggcctgag tgttgtcgtt gccgtcggcg tacagcgcct gggtgaggtt 507781 ggagcgccag ccgctgccac aggtgacctt gatgccatac gcatcgtatt gatccaggta 507841 gaccggaaac cacagcgcca tcagaccaat gaccgccagc agcaggccag taattccgat 507901 gaacatctgg cgacgattca cggcttcttc atgtcttgcg atgtgcattc gggattcggg 507961 cgccgcagcg ctcgcgtcat gcaagcgcaa atgcgggctt tgccaacaaa ggccgggtgg 508021 ccacgcccag gcaagttgtg agggaggccc cccggggccg caaccatgtt aacgcgcgtc 508081 cgcctaagca ttcagcgcgc cgtgccctac cggcactacg cccgggcgtg cgtgcggaac 508141 ctgacagagc tcacgctatt tggcccgccg acagacgtag cgccgcatcg accgccagcc 508201 tggcgacatc gagggtcttc gagcccaagt catgtcgggc gccggtgatc tcgacgacct 508261 cggtcggtgc cgagaccatc gccgcggcgg aacgcacctg ggccagcgtg ccgaacgggt 508321 ccgccgttcc gtgggtgaac accgtcggca ctgcgatccc cggcaagtgc tcggtacgga 508381 cgcgttccgg ctttcccggc ggatggaccg gataggagaa cagcgtcagc acgtcgaccg 508441 gtgcctgccc ggccgccacc accatggacg tctgccgacc gccgtaggaa tgtcccccgg 508501 cgatcagcgg accctcggca aggccgcggc acagctggat cgcttcgacg atgccggcac 508561 ggtcgcctga ccccgagccg gatggcggac cggtgggtcg gcgtcggcgg tagggcaggt 508621 tgtagcgcac ggccagccat cctcggcggg tccattcggc gcaaacctgt tgcaacagtg 508681 tggattcgcg gctaccgccc gcgccgtggg taaggacgac taccccgtgt ggtgggccgg 508741 ccggttggtg tgcaacgccg gcgatctgat caaggttcat gacagccgaa acagcggcga 508801 aacgggcccg tggccgcggc ccagtggata ggccgcgcgc aggcattcgg taacccatcg 508861 cttcccgaag tccaccgcgt cgggcacggt gaagccgtgc gccaacgcgg cggcgatcgc 508921 ggtcgccagc gtgtcaccac cgccatggtc atcgccggtg ggtagtcgct gcgcgtcgaa 508981 ctggtagcag ctgacgccgt catagagcag gtcgcagctg ccgtccgacg agcgcaggtg 509041 tccgcctttg accagcaccc actgcggccc cagcgcatgc agggctttgg ccgccgcacg 509101 ctgcgactcg gcgtcgacta cctcgatatc taccagcagg cgcgcctcgt caaggttggg 509161 ggtcagcagc gtcgccaacg ggaacagctg accgcgaagc gaatccaggg cagacggtgc 509221 caacagcggg tctccgtgca tggatgcgca taccgggtcg acgacgagcg gaacggacag 509281 ctcgagccga cgccaggtcg cggccacggt cgcaacgatg cgcgacgagg ccagcatccc 509341 ggtcttggcg gcttgaacgc cgatgtcggt gacgaccgcc tcgatctggc cggccaccac 509401 atcgttggga acttcatgaa tatccttgac tcccaacgtg ttctgtacgg taaccgccgt 509461 gactgcgacg cacgcgtgca ctcctagcag tgccatcgtg cgcatatcgg cttggatgcc 509521 ggcaccgccc ccggagtccg atccggcgat gctcaacacc cgcggcggcg tcattcccgg 509581 cggtgccagc gggaggtagt tcactgggtt atcgggagat acacccgatt gccgtgctct 509641 gcgaattcac gtgacttttc ggccattccg gcggcgagca cggcttcgat gtccgcttcg 509701 gtctcaagcc cgtgttcggc ggcgtactca cggacgtcct gggtgatgcg catggagcag 509761 aacttcggtc cgcacatcga gcagaagtgc gcggtcttgg ccggctccgc cggcagggtt 509821 tcgtcgtgga attcccgtgc ggtgtcggga tccagcgaca gtgcgaactg gtcgttccag 509881 cggaactcga aacgcgccgt gctcaaagcg tcgtcgcgct cctgggcgcg cggatggccc 509941 ttggccaaat cggccgcatg cgcggcgatc ttgtaggcga tcaccccgtc cttgacgtcc 510001 ttgcggtccg gcaacccgag gtgctccttg ggggtgacgt agcacagcat cgcggtaccg 510061 gcttgggcga tgatggccgc accgatcgcc gaggtgatgt ggtcgtaggc cggcgcgatg 510121 tcggtggcca gcggacccag cgtgtagaac ggggcctcct cacacagttc ctcttccagc 510181 cgcacattct cgacgatctt gtgcattggg atatggcccg gcccctcgat catcacctgt 510241 gcgccatggg ctttggcgat cttggtgagc tcgcccaggg tgcgcagctc ggcgaactgc 510301 gcggcgtcgt tggcatcagc gatcgaccct ggtcgcagcc cgtcaccgag tgagaaggtg 510361 acgtcgtagc gggcgaaaat atcgcagagc tcctcaaagt tggtgtacaa gaacgactcc 510421 cgatgatgtg ccaaacacca cgcggccatg atcgaacccc cgcgggacac gatgccggtg 510481 acccgcttgg cggtcagcgg cacataccgc agcagcaccc cggcgtgcac cgtcatgtag 510541 tccacgcctt gctcacactg ctcgatcacg gtgtcgcggt agatctccca ggtcagctcg 510601 gtcggatcgc ccttgacttt ctccagcgcc tgatagatcg gcacggtgcc gaccggcacg 510661 ggagaattgc gcaggatcca ctcgcgggtt tcgtggatgt tcttgccggt ggacaggtcc 510721 atgatggtgt cggcccccca gcgggtggcc cacaccatct tgtcgacctc ctcggcgatc 510781 gagctcgtca ccgccgagtt gccgatgttg gcgttgactt tcaccgcgaa cgccttgccg 510841 atgatcatcg gctcgctctc ggggtggtgg tggttggccg ggatcaccgc gcggccgcgg 510901 gcgacctcgt cgcgcactag ctcggcggac atgtcttcgc gggcggcgat gaacgccatc 510961 tcggcggtga tctccccggc gcgggcccgc tgcagctggg tgccccgatc gcgaaccact 511021 ccgggcctat gcggcagccc cgcggtcagg tcgatcaccg tgtccgtgtc ggtgtagggc 511081 ccggaggtgt cgtagaggtc gaagtggtct ccggtggaca agtgcacccg tcgaaacggg 511141 acttggagag tagctccgct gccgggagcc tcgatttcac ggtaggcctt ggcgctgccc 511201 gcgatgggac ccgtggtcac cgacggttca acggtgatgg tcatttgcaa ctccctacgc 511261 cggcattacc cggtcaggtt cgtacggtcg acggccccga gccgtcctct cagcgcactc 511321 ggcgtgcgct cccgcgtggg tacccccacg ctagcgcagc gcggcgccgg tgtgcacgga 511381 cggcccgatg ccgcgttagg cctcttccat cgcctcgccg agttcctcga ggacccggtt 511441 gtggtggtta tttgccaaga tatgtccggt ggcgataacc agggcgaccg gccagtcgat 511501 gagttcgagg gcagcgagtg ccgccagacc tccgtagtag gccaggtgct ctggccgcgg 511561 aatcttcact tggccgcaga tcgggaggtt catcacaata gtctccgctt cgcggatctt 511621 agccacggcc tcacgctgag atgtcgctcg acgcgtattc ttttcggcca tcatttgcct 511681 ttcagtaacg aagggtttgc cgttgtgcag ggtggtgcgg tcaccgtcgg gggtatgtcg 511741 acgaggaccg atgcgctggc ggaccctttg gcttcgtcgt ttgccttgtt gctgacggtg 511801 cctttactgg agctttacgc cgtgctgtgg cgcgtcggcg tcgtcgaggt ccggggggcg 511861 caccggggga cgcgtcgcgg gaaagcgcat cggtctcggg tggttgcggg ttcggctggc 511921 ccgatttgtc ccgacccgtc agcacacggt tcagcacggc gacggcgacg gtcgcagcag 511981 tcgcggctgc tgtcgcctgc gcccagccga gtgggtcgag gggggtgcag ccgagcaact 512041 ggctgacgac ggggatgctg atcaaggttg ccagcgcggc cagtgagccc agtgcggtga 512101 gcacaaccag ccaggcatgc gagtccacca aggtttgacc caactgagcg gccaccagcg 512161 ccaccagcgc caccgtggat gcgcggcgcg gcaagccggt gaaccccgcc atcacccagg 512221 ccacggtggc cgcggccgcc gtggtcgccc cgcggatacc gacggcacgc catagctcgc 512281 gttgatcggg accgcgggtt gccggcgtta ccgggtcgct tggcttgctg accgcgagcg 512341 ccgccgcggg cagtgcgtcg gtcagcatgt tcaccagcag cagctgacgg gtgttcaacg 512401 gcgaggtccc ggtgatagcg ctgccgatga tggcaaaggc cacctcgccc gcgttgccgc 512461 cgagcagcac agacactgcc gcttgcaccc gctgccaaag ctggcgtcct tccaggattg 512521 cgggcagcaa tgactcgatc cggccgtcga ccaacaccag gtcggcggcg actcgggccg 512581 ggtcgctgcc gtgggcgacg acaccgatgc cgacggtggc ggcgcggatc gcggccgcgt 512641 cattggagcc gtcgccgacc attgcgcaca cccggccgct gtgttccagc gtctgcacga 512701 tctgtacctt gttctccggt gtcatccggg cgaagatcac ccgctcggct accgctcgct 512761 cctggtcctt gcgtgacagg gcatcccact cggcaccgct aatgacctgc tcagggctca 512821 cttgcatgcc gagctcctcg gcgatggcgg cggcggtaat cgggtgatca ccggtgatca 512881 gccggatatc cagatcgtgc tcgtgcaggt ccgcaagtag ggccgccgcc tgggcgcggg 512941 gggtgtcgga caacccaaga aaccccacca gactcaactc gtcgcggcac aatctcgcga 513001 tctcgtcggg gtcgtccacg accgactgtg cctgttgcgc ggtcagctgg cggtgggcca 513061 ccgcgatcac ccgcaatccg ttggcggcca gttcagcgac cgcgtcgtcc atgctcgagc 513121 cgatgccttc gcacgccgcc agcaccactt cgggcgcacc tttgacggtc agctcggtgc 513181 cggacaccga ggcggaaaac gacctaccgg agcgaaatgg caggtgggcg gcgggttctg 513241 cggcgccggg ttcggcaccg tcggtgccac tggcggcagc cgctgccgca gcttgcacga 513301 tcgcgacgtc ggtggcgtgc acctgcgggc cgttcgacgc cggcgcagcg tgcgccgcgc 513361 agcgcagcac ttcctcgcgc gagtgccccg ccaccggccg cacctgcgcc acccgcaaac 513421 ggttctcgct gagcgttccg gtcttgtcga agcagaccat gtcgacacgg ccgagcgcct 513481 ccaccgagcg cgggatgcgg accagcgcac cgaagtgact tagccgtcgc gcggatgcct 513541 gctgggccag tgtggccacc agcggcatcc cttccggcac tgcggccact gtgactgcga 513601 taccgctggc caccgcttgg cgtaggcccc gccggcgcaa cagcccaagc ccggtgacca 513661 gtgcgccgcc ggtcatgctg accggccagg cctggttggt gagccgactc agctgatgct 513721 gcaggccgac gctggacaga tcaccggaca cgagctcggc cgcgcggcgc tcctgagtgt 513781 caggacccac cgcggtcacc accgcgaccg cggtgccgga cacgacggtc gtcccggcat 513841 agagcatgca gcgacgttcg atcaggtcga cacccggcgt gggttcgact tgtttggtca 513901 ccgacagcga ctcaccggtg agcgcggact cgtcgacctc cacgtcgacc tcctcaatca 513961 cccgggcgtc ggcgggaacc acctcgtggg tccgcacctc gatgatgtcg ccgggacgca 514021 gctcctcggc gcggacttcg atgtacctcg gctggtcgtc cgcgccggcc agcaccttcc 514081 tggcgggtgg aatctgctga gccaacaagc gattcagccg actttcggca cgcagccgct 514141 ggctggccgc gagaatagag tttccggtga gcaccgaacc gaccatcacc gcgtccaccg 514201 gcgaacccaa caccgcactg gccattgcac caagcgccag cataggcgtc aacgggtccg 514261 acaactccgc gcgcacggcc ttggtcaact gccaaagcgc gttcaagggt gcctgggtta 514321 tttgtgcgcc gcgcttcgcc gtgtgcaggc caccggccag cgcacgggcc ggatacggcg 514381 aaggcggtgc cttcgccggt gcctgctcgt ccggcgacgg caaagctttg cggacttgct 514441 cgaccgacat tgcgtgccat tcatgagcgg gtgccggtcg cggtgcttgc gcgtcgacga 514501 ccttgcgtgc cagcaggtat cccgagagca gtccggccgc cgcgccggtg gtcaccgggc 514561 cgggccccag tccgcggacc cctggcagca tcaacagggc tcccaaagcc gatgcaccac 514621 cggaaatttc gttacctcgc tggcgtgcgg ccctggccgc cggaatcgcg tgcagcaccc 514681 tccaggcggc tccaagatcg ggcagcagga catctgcgta ccagggcggt gcaccggctc 514741 caggtggtgg cagcacacca agcgccacat cggcagccga aagcgcttgc ttaccaaccg 514801 atgacaacac cgcgacggtg cggcccgcct ggcgcagctc ggccaccgca cgggctaggg 514861 cttcgtcgag ggacccgctg gcaccgtcgt cgaggggccg gatgtcgtcg aacaccggtc 514921 gcaactcgcc cagggcgtcg acatcgaccg aaaccaggtc cgccccggtg cgatgtgcct 514981 cggcaaccac cgcggaggcc agccggtcgt gcatggggcg gaaaagtgcc tcgactgccg 515041 aatccgaacc gctggccgag acaccgggta ctcggtgcca gcctgggcgc aggccactct 515101 ccgtcaaaac gagttgcgcc cgattccacg ctgtggacag ctcgtctgcg ccgcaaccgc 515161 ggatgcgcgc cacgcgcagg tcatcggtgc acagcacgcg ggggtcgatg acgatcgcat 515221 cgacccgatc caatcggcgc aaactctccg gccgtaacgg caacaccgcg tgctgatcag 515281 ccaaaccttg gccgagcgcc gcggcgaacg cctccggcgt ggtccggctg gctttggggg 515341 tggccaccag cgtcgcggtc gcggccatgt ccgcgtcgcg ggtcccggcg cccacgagca 515401 ccgcgctcag cgcttggatc agcgcgaaac gcgcgacgct gcgttggaca ggttgcgtcg 515461 accgtgcggg acgcggccaa agggattggg gttggtcggc cggttcgtcg gcgtgcagcg 515521 cgagctgtgg ttcatgccgg cgccaggctc tggctccggc acggcattcc gcggctttca 515581 gcgcctggat cgtcagatcc accgacaacg ccgccggcga cagcgtgacc gtgtgtgcgg 515641 cggccatggc cagctcaagg acggtggcgg tcgccgccgt gcctattcga tcctcgagta 515701 ggcggcgcag caacggctgg tggtccacgg ccgccaccgc tgcctcgatg acgagcggaa 515761 atcggggcca gcgcagcgcc cggccgccga gcgctaagcc cagcccggcc gcggtggcgg 515821 ctaccgttac agctctgacc gccagcagca cgccgtcgcc cggcaggctc cccggtgatt 515881 gcgccagctg atcggcggct tgatctgggt ggcggtgtct ttcggctttt tcggcgtcat 515941 cgacaatgcg gcaaagttcg cgcagtgatg tgtcgggatc gtcgatagcg acgacgacac 516001 gggacaacgg gtagttcagg ctggccgacc cgaccccggg gtgggcttgg attgcgttga 516061 gcacgacgcg cccaagttcg tcgtcgcctc cgctgcgcaa gccgcgcact tcgatccagg 516121 cgcgacgctc gccacgccaa cagttccggc cgagtgtctc gcgggacagc tcgccggaaa 516181 gtgccttggc tcctgcgcgc agcgggatga tggcgacctt cataccggtt ccgaccccgg 516241 tcttggccag ggttgccgag accgctgtcg cggcagtgat cgatgcgccg gtaagggtgg 516301 cggtcgcccg gaagcccgtg gcgacagcac gcaccggcat agcccgtgcg atggatgcag 516361 caatgctcaa gagttgctca acgccgccag actagttggt gctgcgcagc tcagcggtgc 516421 ccgctcggcg accgctcgtc ttcctggctg ttgtcttgct cgctttggct ggcgcttcct 516481 ttgcggcagc cggcttgtcg ggcaccggcg caagtttcgc cttcaccggc ggtgcggcca 516541 cctcgggggt gcggttgagc ttccgcaata gcaacgctcc gccgcccacg gccaacagga 516601 tcggccaatc gacgagtcca gcgacaccga tcgcaccgat ggccaacgcg gctgccgcgg 516661 tcgacttgct gccgctgcta agccccttct gaatgccttt ggcggctccg gtcacaccgc 516721 cgacgatgcc gctcacggcg gcgccaccca ccgcacccgc ggccgctgtg gtcgccgttg 516781 ctgcaccact caccgttcgc cccacggttc gaactgtccc gccgaccaca ctcatgatga 516841 ctccctggcc caaactgcat tcgtttacaa atggtttagc tacagttcta cactcgttaa 516901 cccgcaccct gcattcgcac cgctgacgag atttctgttc agcgctctcg aaatgcaagc 516961 ctgccacgcc gccctgactg agacaacgcg caactgccgc gtgcggcgcg actgccgact 517021 accgccgtac gccgcctacc cggcgtgcag gtcgacgagc accggagcgt ggtcgctggg 517081 cgctttgcct ttacgctcct cgcgtacgat ctgggcgtcc atcacccggg cggccaacgc 517141 cggcgagccg aggatgaagt cgatgcgcat gccctgtttc ttcgggaacc gcagctgcgt 517201 gtaatcccag taggtgtaaa ccccgggtcc cggggtgaaa ggccgtacta catcggtgaa 517261 ttgcgcgtcg acaatggcgt tgaacgcctt gcgctcgggt tcggaaacgt gcgtgcagcc 517321 ggcgaagaat tcggtgctcc agacatcatc atcggtcgga gcgatgttcc agtcgcccat 517381 cagtgcgatt ggtgcggcgg gatcgtcacg tagccagcct tcggccgtat cacgcagcgc 517441 ggcaagccaa tccaacttgt aggtgtagtg cggatcgtcc agggcgcgcc cgttgggcac 517501 gtagaggctc cacacccgga tgccgccgca ggtggcgccc agggcacggg cctccgtcgt 517561 ggcggccact tccggcttgc cgctccagct gggctggccg tcgaacccaa cccgcacgtc 517621 gtcgaggccg acgcgggatg cgatcgccac gccgttccac tgatcgaagc cgacgtgtgc 517681 gacgtcatag ccgagttcga acagcggcaa ggccgggaat tggccgtccg ggcacttggt 517741 ctcctgcatg gccaacacgt cgacatcggc gcgcccaagc caatcgagga cacgatccaa 517801 ccgggtgcga atcgaattca cattccaggt ggccagccgc agcagcggcg atcgcaagcg 517861 cggcgaagcc gggcgttggg ggtggccgcc gtcaattgtg ccgtcgggca tggctagaag 517921 gtatcccagc cgaccgactg ggcaggaaga tagcggcagt gatggtgcag ccggaagccc 517981 agggactcgg cgaggacgga tgtcgcggtg tcgtgcacgc gcacgtagcc gcgggtcgcg 518041 ccgcggcccg ctccccagcc caacagcgct tcccacaatt ggcggccagc ggagccggtc 518101 gcggattgct cgtcggcggc acgcattgcc gacagaccca cccaccgggt gccgtcgggt 518161 gcgtcggtta ccgctgcacg tgcgaccgcc acacccaggt agctgccgaa tgccaactcg 518221 ccgtcgatga cgggggtcgc catgtcgagg ggtaggcgtt ggtggtagag ccgcagccag 518281 gtgtcgtcgg ggtggtccag caacgtgacc gaccggtcgg gttcaccggt ggacacgtca 518341 cgcaccaaca cttgctctcg gcgctcacct gccaggccgg ccggtagtgg cagcaagcgg 518401 tccgggacgg ccagccatgg ctgcagatcg cggctcgcat accatgcgct gatttctgtg 518461 atggtgttcg tgtgtgccga gatatccagc ggtactgctg aattagcggc cagtacggcc 518521 ccgtgtccgg ctcgcaggag ccagccgtcc agccaggttc gttcaacgcc gggccaggcc 518581 gccgcggcgg cgtgttcaag tgcgcggatc gcggcggtgc gcaccggcgc atcggtcagg 518641 acccgcaggg ccaccacatc gacgggcgag aactcgacga tggtcccggt cttggtctgc 518701 actcgcaccg tcggatcgac ggctagcagc cgacccaccg catcggtcag cggtggcatc 518761 gatccggcgg gccggcggta gcgcaccgtt acccgtgtcc caagccccgg ccacgagacc 518821 attagtgacc gaacgggtcg gggtcctcgc cgggcagcca cgacagtccg ggaacgcccc 518881 agccatgtga cttgacggcc cgtttggcgt tgcgggcgta ccggccgatg aggcggtcca 518941 ggtacaggaa tccatcaagg tgcccggttt cgtgctgcag catccgcgcg aacaggccgg 519001 tgccctcgat actgaccgga ctgccatcgg cgtcgagtcc ggtgactcgt gcccacttcg 519061 cgcgtccggt aggaaatgac tcgccgggaa ccgacagaca gccttcgtcg tcggtgtccg 519121 ggtcgggcat ggtctcaggt atttcggagg tctcaagcac cggattgatg accacaccgc 519181 gtcggcgggc ggtcattgcg cggtccgcgg cgcaatcgta gacgaagagc cgcaggctgc 519241 agccgatctg gttggcagcc aggccgactc cgttggcggc gtccatggtg tcgtacatgg 519301 tggcgatcaa ctgggcgaga tccgccggga gtgaaccgtc ggcggcgacc gtcaccggtg 519361 tggtcgcagt gtgtaagacg ggatcgccca cgatgcggat gggtacgact gtcatggtgg 519421 gctagcttaa gcgcgccgac gatacgcgcc gcgaggcggc gggctgagga ggcgggcaat 519481 cggcttaggc gcgccgcggg gcggcgggca tcatcgccgg gtgtgaacca cacgacggct 519541 ggccggcatg tcgcgtcgca ggattcacac tcggagcatg agccggcgcg ccgcgatcgg 519601 cagtcgggtg caagcaagtc ggccgactcg cgggcaggat taccgcccga cggttcctgg 519661 cgtggttcaa tattcgccga agaagcgcct acgtaggcca agtcattcgt acacattgag 519721 aattcgccgg aagggcccag gggaaagcga tatggacagc gccatggcgc gggcaattcg 519781 atcgggggac gacgccgagg tcgccgatgg gctgacccgg cgcgagcacg acatcctggc 519841 gttcgaacgt cagtggtgga agtttgccgg tgtcaaggaa gaagccatca aagagttgtt 519901 ctccatgtcg gcgacgcgct actaccaagt gctcaatgcg ctggtggatc ggcccgaggc 519961 gctggccgcc gacccgatgc tggtaaagcg gttgcggcgg ctgcgcgcca gtcggcagaa 520021 ggcgcgggcc gcgcgacgcc ttggcttcga ggtgacctga cactctcccc gcttttgccg 520081 gttgtgtccc ggtgctggtt acagtgggct cgatgaatga gcgtgtaccc gactcttccg 520141 ggcttcccct gcgggccatg gtgatggtgc tgttgtttct cggcgtcgtc ttcctgctgc 520201 tcggctggca ggcactgggt tcgtctccga actccgagga cgactcgtca gcgatttcca 520261 ccatgaccac caccactgcg gcgccgacgt cgaccagcgt taagcccgcg gcgccccggg 520321 ccgaggtgcg cgtctacaac atctcaggcg cagaaggcgc cgccgcgcgg acggccgatc 520381 ggctcaaggc ggccggtttc acggtcaccg acgttgggaa tctatcgtta cccgacgtcg 520441 cggcgaccac ggtgtactac accgaagtcg aaggcgaacg ggccaccgcc gacgcggtag 520501 gccggacgct aggagcagcg gtggagctgc gactgccaga gctgtccgac cagccgcccg 520561 gggtcatcgt cgtggtgacc ggctgacgct gattcgaacg ccaggttagg ctctcgctat 520621 gccaaagccc gccgatcacc gcaatcacgc agctgtcagc acgtcggtcc tgtccgcgtt 520681 gtttctgggc gccggtgccg cgctgctgag cgcatgctcg tcgccgcagc acgcgtctac 520741 agttccgggt accacgccgt cgatttggac cggatcgccc gcgccgtcgg gactttcggg 520801 tcacgacgag gagtcgcccg gtgcacagag cctgaccagt accctgacgg cgcccgacgg 520861 cacgaaggta gcgaccgcga agttcgagtt cgccaacggc tatgccaccg tcacgatcgc 520921 gacgaccggc gtcggtaagc tcacgcccgg cttccacggc ctacacatcc accaggtggg 520981 taagtgtgag cccaactcgg ttgcccccac cggcggtgcg cccggcaact ttctgtccgc 521041 cggcggccac taccacgtgc cagggcatac cggcaccccc gccagcggcg acctggcctc 521101 gctgcaggta cgcggtgacg gttcggcgat gctggtgacc accaccgacg ccttcaccat 521161 ggacgacctg ctgagcggcg cgaaaaccgc gatcatcatt cacgccggcg ccgacaactt 521221 tgccaacatt ccgccagaac gctacgtcca ggtcaatggg actccgggtc ccgacgagac 521281 gacgttgacc accggcgacg ccggcaagcg ggtggcgtgc ggtgtcattg gttccggcta 521341 gcttgcctgc ccgcaggtcg gccgcccgaa ttgatttcgc aggctcaccg cggcccaccc 521401 tcggtgtgga gtgggagttc gcgctcgttg actcgcagac ccgcgatctg agcaatgaag 521461 ccaccgcggt tatcgccgaa atcggcgaaa acccgcgggt ccacaaggaa ttgctgcgca 521521 acaccgtaga gattgtcagc ggtatctgcg aatgtaccgc cgaggcaatg caggatctgc 521581 gcgataccct gggccccgcc cgtcagatcg tgcgcgaccg cgggatggag ctgttctgcg 521641 cgggtaccca ccccttcgcg cggtggtcgg cccagaagct caccgacgcg ccgcggtacg 521701 cggagctgat caaacgcacc cagtggtggg gccggcagat gctgatctgg ggtgtacacg 521761 tgcatgtcgg gattcgctcg gcgcacaaag tgatgccgat catgacgtcg ctgctcaact 521821 actacccgca tctgttggcg ctctcggcct catcaccctg gtggggtggc gaagacaccg 521881 ggtatgccag caaccgggcg atgatgttcc agcagttgcc caccgccggg ctgccgtttc 521941 actttcagag gtgggcggag ttcgaaggtt tcgtgtacga ccagaagaag accggcatca 522001 tcgaccatat ggacgaaatc cgttgggata taagaccctc accccatctg ggcaccctgg 522061 aggtgcggat ctgcgatggc gtgtccaacc tacgagagct cggcgcgctg gtcgcgctga 522121 cgcattgcct gatcgtcgat ctggaccgcc gcttggacgc cggcgaaacg ctaccgacca 522181 tgcctccctg gcacgtccag gagaacaagt ggcgtgccgc ccgctacggc ctggacgcgg 522241 tgatcatctt ggacgccgac agcaacgaac ggctggttac cgatgacctc gcggatgtgc 522301 tgacccggct ggagccggtc gccaagtcgc tgaactgcgc cgacgagctt gccgcggtct 522361 ccgatatcta ccgcgatggc gcctcctacc agcggcagct gcgagtggcg cagcagcatg 522421 acggcgattt gcgcgcggta gttgacgcgc tggttgccga gctggtgatt tagccgatgc 522481 gggctggctg agtgtgacgt ccgccagccg cgaggagatt gaggtttagg tgatggccga 522541 tttcgcgccg gttgagttgg cgatgttccc gctcgagtcg gcgccgctgc ccgacgaaga 522601 tctgccgttg cacatctttg agccccgcta cgcggcgctg gtccgtgact gcatggacac 522661 cgcggatcct cgcttcggtg ttgtactgat ctcgcgtggc cgcgaggtcg gcggcggcga 522721 tacgcgatgt gatgtcggga cgctggccag gatcaccgaa tgcgcggacg cgggttcggg 522781 tcgctatatg ctgcgctgcc gggtgggcga acggatccgg gtgtgcgact ggctgcccga 522841 cgatccgtac ccgcgtgcga aggtacggtt ctggcccgac cagccggggc acccagtgac 522901 ggctgcccag ctgctggaag tcgaagaccg ggttgtggcg ctattcgagc ggatcgctgc 522961 cgcccgggga gttcggctgc cggcccgtga ggtggtattg ggctacccgg tggttgaccc 523021 agccgatacc gggcagcgtc tgtacgcgct ggcatgtcga gtgccgatgg gcccggccga 523081 tcggtacgcc gtgctggcgg cgccgtcggc ggccgatcga ttggtccgct tgggtgacgc 523141 gctggactcg gtggccgcga tggtggagtt cgagttgtcg acgtaactgc cctacgcggt 523201 gcgtctgacc cactgggcct gaaccacatt cactgcgccg agcaccatat acggacccgt 523261 caccgccggc aagcgcatcc gggtgcggaa ccggctcgac aatggtcaac gccttcgcac 523321 cattgccgac cagtacccgc aattgctcga cttcatcagt ggtcgctagg accgaaggtc 523381 acccttggtg ccgaacttac gcagcgacgc cacctgcagc ggatccagcg acgcgcgcac 523441 ggtttctcgc gcggtcgcca ggtcggcggc ggtgacgttg gcggcatcga tggaacgccg 523501 catcgcggta agcgcggctt cgcgcagcag cgccacacag tcggcggcac tataaccgtc 523561 gagtccggct gccacctcgt ccaggtcgac gtcggagctc agcgggatcg acttgccagc 523621 ggtgcgcagg atttcgcggc gagcggcagc gtcgggcggt tcaacgaaca ccagccgttc 523681 tagccgcccc gggcgcagca gcgccgggtc tatcagatcg ggccggttgg tcgcgcctag 523741 catgacgaca tcccgcagcg ggtcaatacc gtcgagctca gtcagcagcg cggccaccac 523801 ccggtcggag acgcccgagt cgaagctctg accgcgccgt ggcgccagag cgtccagctc 523861 gtcgaggaac accagtgacg gcgcggagtc gcgggcccgc cggaatagct cgcggactgc 523921 cttctccgag gagcccaccc acttgtccat cagctccgac cctttgacgg catgcacgct 523981 caactgtccg gtgctggcca gggcacgaac cacaaaggtc ttgccgcagc cgggcgggcc 524041 gtacagcaac accccgcgcg gcggttcgac acctagccga gcgaaggtgt cggggtgctg 524101 cagcggccac agcaccgcct cggtcagtgc ttgtttggcc gcggccatgt caccgacatc 524161 gtcgagcgtc acgtcaccca cggtgacttc gtcgctggcc gagcgggaca gcggccggat 524221 gacggtcaac gcaccgagga ggtcgtcttg gtgcagcatc ggtggtcggc cgtcggcact 524281 ggctcgagac gctgcccgca gcgccgcctc gcgaaccagc gcagccaggt cggccacgac 524341 gaaacccggt gtgcgggagg cgatttcgtc gaggttgagg tctccggtag gaaccggatt 524401 cagcagcgcc tccagcagcg atttgcgggt ggccgcgtcg ggcagcggca ggccaagctc 524461 ccggtcgcac aactcggggg aacgcagccg ggcatcgagt tgatcgggcc gtgctgaggt 524521 ggcgatcaat accacaccgg cggtggccac cgcggtacgc agctcggaca ggatcagcga 524581 ggctaccggc tcggcggcgg ctggcagcag ggcgtcggca tcggtgatca gcaacacacc 524641 gccctcatgg cgaaccgcct gcactgccga ggccacggct ttgacccggt ctccggcggc 524701 cagagctcca atctccggac catccagtgt caccaacctt cggccgtcgc acaccgcgcg 524761 caccagcgtc gccttgccca ccccggccgg acccgacacc agcacaccca aattggtgcc 524821 ggcgcccaag gtctgtagta ggtgcggctc atcgagggca agcttgagcc attcggtgag 524881 cttggcagcc tgcggctggg cgcccttgag ctcttcgatc tggatctccg gactcgagat 524941 gctcacttgc ccggccgtgg acgtacccat tgcggccggg accccagcgc cccaggtgac 525001 cagcgagttg ggctgcacgc tgaccggccc gtcggggtcg acgccggtaa cggtcagcag 525061 ctccgaggtc caactgatcc cgaccgcagc tgccaatgcg cggctggcag ccgacgtgga 525121 tgtgccgggg cctagatcgc ggggcagcag cgagaccgcg tcaccgacgg tcatcacctt 525181 gccgagtagg gcctgccgca gcgtgaccgg cggcaccgac tgggtggcca gcgttgaacc 525241 gctcagcgtc accgatcgcg ctccgtagac ggtgaccggg ctgacgatca cctcggtgcc 525301 ttcgcgaagg cccgcattgg acagtgtgac gtcatcgagc agcaccgtcc cgaccgcggt 525361 gtctgccgcg gccaggccgg cgaccgcggc ggttgtccga gagccggtca gcgacaccgc 525421 gtcccactcg cggatgccaa gggcagcaat ggcattgggg tgcaaccgaa cgacgccgcg 525481 gcgtgagtcg acggccgagg tgttcagccg ggcggtaagg gtgagttggc gggccgggtc 525541 cgggtgggtc acagccgtcg acccggcttg cgcaggccca gccgcgccat cgacggccgg 525601 tagggatgcg cccggcggct cgcgcgccgc accgcgcgcc gttgcttggg cttgtcgtcc 525661 cacacctcag ggtgttgggc aagccagcgc tggctgcgca ccgcgaaagg aatatggcac 525721 atgtaggcga tgatgatcac ccagatcaac aagtaggggg ccaggactgc ggccgccgcg 525781 cagatagcca gcaccgccag cagggcggcc gcgtagttgg gtggtaccga cacggcgtgc 525841 atctttttca tcgggatccc gctgaccaag agtatcgacg ttcccgtcac ccaaaagctg 525901 aggaaccaga ccgaggtcca ccatccttcg ccgaactgca ttttgagggc tagcaggccg 525961 atcatggaaa ccgcgcccgc cggcgcgggc attccgacga agaattcatg cgcgtaggcg 526021 ggctgggttc cgtcgtcctg cagtgcgttg taccgcgcca gccgtaatac cacgcacacc 526081 gcgtagagca gcacgaccac ccaaccgacc ggccacttcg acaacatcga cacgtaaagc 526141 accagcgcgg gtgtcactcc gaagttcacc gcgtcggcca gtgagtcgat ctctgcgccc 526201 atccgcgact gggcatccag gatgcgggcc acccggccgt cgagcccgtc gaggatggcc 526261 gctgcggcga tcagtgccat cgcggccttc ggctggtgct cgagcgcaaa cttgattgcg 526321 gtcagtcccg cgcaaatgga cagcaccgtc atcgcgctgg gcagtatctg caggtttacc 526381 cctcgcctgc cgcggggctt tccgatcatc gacattcggc cagcacggtc tcgccggcga 526441 ccgcgcgctg gccgacgttg acgatcggct ctgcgcccgc tggcaggtag gtatccagcc 526501 gggagccgaa ccggatcagg ccgtaggtgt caccgatggc cagcttgtct ccgacgtgtg 526561 cgtcgcacac aatgcggcgc gccaccagcc cggcgatctg caccgcgacc acctcggcgc 526621 cgttgggcat gcggatccgc acactggtgc gctcgttgtc gtcgctcgcc tccggtaggt 526681 cggccgaccc gaaccggccc ggccggtgtt gcacggcgat cacttccccg ctcaccgggg 526741 cacgttgcac gtgggcgtcc aatatcgaca ggaagatgct gactcgcggt aacggcgtgt 526801 cacccatgct gagttcggcc ggtggggccg ctgagtcgat cgcgcagatc acgccgtcgg 526861 cgggcgcgac aatggcagcc ggcctggtgg gcggtacccg ctgcgggtgc cggaagaagc 526921 ccgcgcaggc agcggccgcc agcagacccg tgccgcgcaa ccaccggtag cggtgtccga 526981 cggccgcaat cgcaaggccg gcggcaatga acggccgccc ggccggatga accggtggaa 527041 cggcggaccg caccagggcg agcagatgtt gcgggccgtc ggggcggggg cgtctggcca 527101 cggggtcatc ttacggagct tcgtgccgca ggttgggtgc acggcactag gatcggtccg 527161 gttaggtcaa gtcccagact tgcagctgcg ttccggcagc cacctccacg acgtcctccg 527221 ggatgtccag aagtccgttg gccgatgcca accaacgcaa atggtgcgac gccggtgggc 527281 cgtagctgat gaccgtgcct gcctggtgat cgagtattgc gcgtcggaac tgacgtttgc 527341 cgcgcggcga tgtcaggctc gcggtgagta ccgcgcttcg gtgcggccgg tacggatccg 527401 gcaggcccat ggccatgcgc agcgggggac ggatgaacac ctcgaaggac accagcgcgc 527461 tgaccgggtt gccgggaagg gtgacgatcg gcgtacctgc cacccgcccg acgccctggg 527521 gcattccggg ttgcatcgcc accttgacga attcgacacc gtggtcgcct ccccggtagt 527581 cagcgctgcc gaacgcgtct ttgaccacct cgtaggctcc ggcactgaca ccgccgctgg 527641 tgatgatcag gtcggcgtcc accgcgtacc ggtcaaggat cgcgccgaac tgcgcgacgt 527701 cgtcgccggc ggttgcggtg gcgaccacag cggcgcccgc atcgcggacg gcagcggcca 527761 gcatgatcga gttggactcg tagatctgac ccggttgtag gggcgtgcct ggcgacgcca 527821 gctccgaccc tgtggagatc accagcaccc gctgacgggg gagcaccggc agctcggcca 527881 aacccagcgc ggcggccagg ccgagcaccg ccggggtcac gatctggccg ttgtgcagca 527941 ccgtggtacc ggcggcgacg tcttcgcccg accgtcggat gtgcttgcct ggggtggcct 528001 gttggcggat cgccaccgaa tcgacgccgc cgtcggtggc ttcgaccggc acgatcgccg 528061 tcgcaccggt gggcactggc gcaccggtca tgatccggtg cgcagtcaca ggctgcagcg 528121 tcagcatgtc ggcgcgcccg gcgggaatgt cctcggcgac cggcaacatc accggatttt 528181 gcggtgtggc acctgaggtg tcttcggcgc gcaccgcata gccatccatt gcggagttgt 528241 cgaaaaccgg cagcgacagc ggtgcgacca cgtcgccgcc caggaccaga ccttgagcct 528301 gggtcagcgg aaccgtaatc gggcgacagg cgcggatcat ctccgctacg acacgttgat 528361 gctcctggac tgaccgcacc cggccattat cggtcgttca gactccgaag ctgacgccgg 528421 tgagttcttc ggagacggtc cagaggcggc gctgcagatc tttgtcgtgg gactgcgcgc 528481 tggattggac caccttcggg tgaccgcgct gctcgccgaa cccgtccggg ccgtagtatt 528541 gcccgccctg cgtggtcgga tcggtggcgg cacgcagtgt tggcagggcg cccatctctg 528601 ggctttggaa aagcaacggc ccgagcacgg tagcgacggg ccggataagt cgcggcaggt 528661 tgcgagtcag ctcggtgttg gagccgccag ggtgagcggc gacggcgatg gtggatttgc 528721 ccgcttcgcc cagccggcgt tgcagctcgt aggtgaacag cagattagcc agtttggctt 528781 gtccgtaggc ggcgacgcgg ttgtaacggc gttcccactg caagtcgtcg aagtggatgg 528841 cagcgtgaat ccggtggccc tggctgctga cggtcaccac ccgcgaaccg ggtaccggca 528901 gcatgtggtc gagtaccagt ccggttagtg cgaaatgacc gagatggttg gtaccgaact 528961 gcagctcgaa accgtccttg gtgacctgct tcggcgtcca catcacgccg gcgttattga 529021 ttagcacgtc gatgcgcgga taggccgtgc gtaacgcgtc ggcggctgcg cgcaccgagt 529081 ccagcgagca cagatcgagt tgctgcagcg tgacgtgggc gcctgggcgg gcggccatga 529141 tgcgggcccg ggcggcgttg cccttctcga gattgcggac ggccaacact acgtgtgcac 529201 cgcggtcggc aaacacggcg gcggtgtggt agccgatgcc ggtgttggcg ccggtgacca 529261 caacgacgcg cccgctttga tcggggacgt ctgcggccga ccatttacgg gtcttgttgt 529321 cgttggcggt catgggccga acatactcac ccggatcgga gggccgagga caaggtcgaa 529381 cgaggggcat gacccggtgc ggggcttctt gcactcggca taggcgagtg ctaagaataa 529441 cgttggcact cgcgaccggt gagtgctagg tcgggacggt gaggccaggc ccgtcgtcgc 529501 agcgagtggc agcgaggaca acttgagccg tccgtcgcgg gcactgcgcc cggccagcgt 529561 aagtagcggg gttgccgtca cccggtgacc cccgtttcat ccccgatccg gaggaatcac 529621 ttcgcaatgg ccaagacaat tgcgtacgac gaagaggccc gtcgcggcct cgagcggggc 529681 ttgaacgccc tcgccgatgc ggtaaaggtg acattgggcc ccaagggccg caacgtcgtc 529741 ctggaaaaga agtggggtgc ccccacgatc accaacgatg gtgtgtccat cgccaaggag 529801 atcgagctgg aggatccgta cgagaagatc ggcgccgagc tggtcaaaga ggtagccaag 529861 aagaccgatg acgtcgccgg tgacggcacc acgacggcca ccgtgctggc ccaggcgttg 529921 gttcgcgagg gcctgcgcaa cgtcgcggcc ggcgccaacc cgctcggtct caaacgcggc 529981 atcgaaaagg ccgtggagaa ggtcaccgag accctgctca agggcgccaa ggaggtcgag 530041 accaaggagc agattgcggc caccgcagcg atttcggcgg gtgaccagtc catcggtgac 530101 ctgatcgccg aggcgatgga caaggtgggc aacgagggcg tcatcaccgt cgaggagtcc 530161 aacacctttg ggctgcagct cgagctcacc gagggtatgc ggttcgacaa gggctacatc 530221 tcggggtact tcgtgaccga cccggagcgt caggaggcgg tcctggagga cccctacatc 530281 ctgctggtca gctccaaggt gtccactgtc aaggatctgc tgccgctgct cgagaaggtc 530341 atcggagccg gtaagccgct gctgatcatc gccgaggacg tcgagggcga ggcgctgtcc 530401 accctggtcg tcaacaagat ccgcggcacc ttcaagtcgg tggcggtcaa ggctcccggc 530461 ttcggcgacc gccgcaaggc gatgctgcag gatatggcca ttctcaccgg tggtcaggtg 530521 atcagcgaag aggtcggcct gacgctggag aacgccgacc tgtcgctgct aggcaaggcc 530581 cgcaaggtcg tggtcaccaa ggacgagacc accatcgtcg agggcgccgg tgacaccgac 530641 gccatcgccg gacgagtggc ccagatccgc caggagatcg agaacagcga ctccgactac 530701 gaccgtgaga agctgcagga gcggctggcc aagctggccg gtggtgtcgc ggtgatcaag 530761 gccggtgccg ccaccgaggt cgaactcaag gagcgcaagc accgcatcga ggatgcggtt 530821 cgcaatgcca aggccgccgt cgaggagggc atcgtcgccg gtgggggtgt gacgctgttg 530881 caagcggccc cgaccctgga cgagctgaag ctcgaaggcg acgaggcgac cggcgccaac 530941 atcgtgaagg tggcgctgga ggccccgctg aagcagatcg ccttcaactc cgggctggag 531001 ccgggcgtgg tggccgagaa ggtgcgcaac ctgccggctg gccacggact gaacgctcag 531061 accggtgtct acgaggatct gctcgctgcc ggcgttgctg acccggtcaa ggtgacccgt 531121 tcggcgctgc agaatgcggc gtccatcgcg gggctgttcc tgaccaccga ggccgtcgtt 531181 gccgacaagc cggaaaagga gaaggcttcc gttcccggtg gcggcgacat gggtggcatg 531241 gatttctgac cccggcgaga agtcgcagcg aggagcccgg tccctttgtg gggccgggct 531301 cctctggttg ggagctacgg taccgagaac accacgcagt cgtgtaggca acctttggcc 531361 gctgtgggcg agtcgggggc cgcgtctcgg tgcagcagcg cgcggatggg tacgacaccg 531421 cagcgggcgg tgtcgtcatc ggggcctgcg tccgacgcct gggcacggcc gtcgacgatc 531481 agcgagtagc cgctaggatc ggatggcggc cacaacaggg tgacttcgct gcggtgggcc 531541 aggttttgcc gcgtacgacc cccgatcagg ccgacgtcga ccactgcccg gggtccatcg 531601 gggccgtcgg ggagttcgcg cagcaccggc tcgactgcca ccgtgtgcac gcgatggcca 531661 tcatcgacgg tgatcaggta agcgaacggg tagtcgggca aggcggcggc cagccgtttg 531721 aggtctacct ttttggcacc cacggattcg aggataggcg cccgatgtgt tactccgaac 531781 cgaccggctg cccgatccgc gggctggcgt aggcggattc gcggtcgggg ctcgggtaga 531841 agttcgactt ggggatgccg gagccggggg tactcggctc acgcacggcg gtattccgca 531901 agcccgagtc gttgctgccc gagttgacga agctcgggta gctggtgcca gggcttctaa 531961 ggcccgggtt tgcgcccgag ccagccgcgg cactgccgct accggggttc gggttgcctg 532021 agtccaggcc gccaacagga gcactggccg gggcggcgac gggcgtgttg gtcaggcccg 532081 agttgaggac gttcgccagg ccgtgttgga gaccgcccgt tgatccgagg gcggaggcga 532141 ggatgcccga actcaaagcc gccgtgctca tgccgccggt ggcgtagccg gcggagctga 532201 ccaaggccgc ctccgagcca gccgcgcttc ctaaggcggc gttttgcatc cccgcgttcc 532261 agaagctggt gttgaggctg cctgcgctgc cgaggcccgc gttgattgtc cccgaggtcc 532321 cgatgccgct gttcagggag cccgaattcc cgatgccgat gtttccgctg ccggagttga 532381 ataagccgac gttgccggtg cccgagttcc cgaagccgat gttgccgcta cccgagttga 532441 agccgccgaa acccatctgg tgatcaccgg tgatcccgaa cccgatattc ccgctaccgg 532501 tgttgccgaa gccgatattc ccgtcgccga ggttgccgag gcccaggttg ccgctgccgg 532561 tgttgccgct gccgatgttg ccggtgccgg tgttgccgct gccgatgttg ttgttgccga 532621 tgttgttgtt gccgatgttg ccgctgccgg tgttgccgaa gcccagattg atctggccgt 532681 tcttgccgat gtcgatgccg aggttccgca agacctgctg ccagggcgcc agttgtgcga 532741 cggccgcaga cgcatcgaag tggtaaccag ccatcgccgc cacgtccaat gcccacattt 532801 gctcgtatgc cgcctcgacg tccatgagcg ccggagcatt ctgcccaaac cagttcgtag 532861 ctgccagcag ctgcatcagg ccacgattgg ccgctaccac tgccggctgc acggtggccg 532921 ccagcgccgc ctcgaacgcg gtcgctattg ccatggcctg tgcggccgct tgttccgcct 532981 gcgctgccgc cgtgctgagc caggctaggt actgggttgc gacggccatc atcgccgccg 533041 cggacggacc cagccaggcg ccactagtca gttcggatgt gacggagcca agcgacgcta 533101 ttgacgcgag caattcttcg gccagctcgc cccaggcggt ggccgcagca attagcggtc 533161 ccgacccggg accggcaaac atcagtgccg aattgatctc tggcggcaac tacgcaaaat 533221 gcgggcttgt cactgatcca acttaactgt cagcgaccgt tgccgtggcg gtatcggcac 533281 ttcaatacca ctcatctttg gggtcatctt tggagcgccc ctaggaaccg ccagcttacc 533341 tagtcccggg taggggccga ctggcggccg ggatgcagct gagggtctgc cacctgcccc 533401 gtaatgtcgc tggtatggca agcaccgacg ccgcggccca agagttgctc cgcgacgcgt 533461 tcacccggtt gatcgaacat gtcgacgaac tcaccgacgg cctcaccgac caactcgcct 533521 gctaccgccc gacccccagc gccaacagca ttgcgtggct gctctggcac agcgcccggg 533581 tgcaggatat acaggtcgcc catgtggccg gcgtggaaga ggtgtggacc cgcgacggtt 533641 gggtggaccg ctttgggtta gatctgccgc ggcacgacac cggatatgga caccgtcccg 533701 aggatgtggc gaaggtacgg gcacccgccg acctgctgtc ggggtactac cacgcggtgc 533761 ataaactgac cctggaatac atcgctggca tgaccgcaga tgagttgtcc cgtgtggtgg 533821 ataccagttg gaatccgccg gttaccgtca gcgcacggtt ggtgagcatc gtcgacgact 533881 gcgctcagca cctcgggcag gccgcctacc tgcgggggat agcccgataa cggcgacatc 533941 cgccggatcg ctgaggcgat ggtcagctac gccgaagatc gcctgcaccg atggttacct 534001 gacgctagcc ggcagcgccg ccctagtggt acccggcgtg ttcgtcgcga tgctgggcac 534061 cattgtcgcg ccgagactgc ggtgaggggc cggggtgtgc gtcctcggct cacccgagcg 534121 gcagctcggc caagatggta ccggtgggct gtggtgatcc ggtgccgggt tcgacggtga 534181 atgccagtgc ggtcgaggct ccgagatcgg tcagcgtcgc cgtcgtcgag ggcgtcaccg 534241 ccgcggtgcc catcgtctcc gccgacctcg gccctttggc ccctcccagc agccacatct 534301 gatacacggt tccccgggat ggtggcgcca cattgttcat caccagcaga cctgtgttgc 534361 ggtcgcggga gaacaccacc gtggccgtcc cggcgcccag tgggcgagag accgtccgta 534421 cgtccggcgc cgtcagaact tgctcggcca cggtgggggg tggcgatggc cgggtcagca 534481 cccccaggtc gaacgccccc agccccacag cgatcgccgc tgcggacgca aaggctgccg 534541 tacgccagcg tgattggcgc ctaacctcgg gcttggtcgc atccaggatg gccgtccgca 534601 gatgtgctgg cggctcggcg gtggtggccg ccgagacgac ggccatcgtc tcgcggacgg 534661 ctcgaacttc gtcgttgaaa gccgcggcta ccggcgaggg cgcggcggcc acccgtcggt 534721 cgatgtcggc tcgttcatcg tcggacacag cgttcagggc atacggggta gccagctcga 534781 gcagctcaaa atcggtatgt tcagtcatga gcgccgctct cccaacgcat cgcttcgctc 534841 ggccggcgca gtcatgacac gtccaggcag ttgcgcaggc tgcgcagggc gtcgcgcatg 534901 cgggatttga tggtcgacag attggccgct aaccgccgcg aaacttcgac atacgtcagc 534961 ccgccgtagt aggccagttc gatgcactgc cgctgcgtgt cggtcaacgc cttgaggcac 535021 tcggtcaccc ggcgccgctc atcaccggcg atcgccaggt cggcgacgac gtcactcgcg 535081 ggatcgacgt tggccgcacc atagcgcact tcccgctggt tgccggcttg ctcgcaacgg 535141 actcggtcga cagcgcgccg gtgggccatg gtcaaaagcc aggccaacgc ggaacctttg 535201 gcggagtcaa actccgacgc gttccgccac acctcaagat agatctcctg ggtggtttct 535261 tcgctgtagc cggtatcacg cagcacccgc atcaccagtc catacacccg cgacttggtg 535321 tggtcgtaga attcggcgaa tgcggcctgg tcgtgaccag cgacccggcg caacagggcg 535381 tccaggtcgc tgctcagccg tggcggtccg gtcatcgatg ggtagcctat cgccagccgg 535441 cgccgagatg gtcaagccgg tcatcaccga cgcgccgatc gcggtggccg gggcacgaaa 535501 taggctgttc gcctttgata ttcggcgaaa ccggggcgac ccttcaggta tctctcagtc 535561 agccgggctc cgctgacgtc caccagcagg taggtcatca gcagcggcga acccaccgtg 535621 gccagcggcg cccagtcgtt gatcgtgatc aaccacaacc cccaccagac acaggcatcg 535681 ccgaagtagt tggggtgacg cgtccaggcc cacaggccgc ggtccatgat gaccccgcga 535741 ttggccgggt cggatttgaa tacccacagt tgccaatctc ccaccgcttc gaaggtgata 535801 ccgaccagcc acacggctaa gcccacgccc ccaacagcca gtaacggctt cggcgtcggc 535861 ccggtgactg cggaaagctg cagcgggaat gagacgaaca acgtcaggag gccctgtaat 535921 ccgaagacct tgcgcaatgc ctgcacaggc gtggcaccgc gcagcaggtc ggcgtagcgg 535981 ggatcctccc cctgaccggc tgtcttgcgg tacatgtgcc agctcagccg cagaccccag 536041 gtcgacacca acgctagtag cagccatcgg cgaaccgggt cgccgtggcc gagcgtcgcg 536101 gcggcgacgg cgacggcgac gaaacccaag ccccatacca cgtcgacgac gttgtaccgg 536161 ccgatgcggc ggccgatcgc aaacgccacc gaatgcacca cggccacagc caaagccgac 536221 acgctggtta ccacgacgat gttcacgggg ggccctcgcg gatcaacgtc cactggtaga 536281 cgtccagata gcccgaccgg aagcccgcct ccgagtacgc caggtacagc tcccacatcc 536341 gtgcaaacac ctcgtcgaaa cctaaatgcg ccagcccatc tcgccgctgc ataaatcgtt 536401 cccgccagag ccgcagcgtc tcggcgtaat gcggtcgcag cgaggccgcg tcgacgatgc 536461 gcagcccggt gtgttgcccg gtgatgtcga tgatggcctg cgtggacggt agcagtccgc 536521 cagggaagat gtacttctgg atccaggtct gggtgtggcg ggtggccagc attcggtggt 536581 gcggcatggt gatcgcttga atcgctaccg ggccacccgg gcgcaccaac tgttctagcg 536641 cggcgaagta ccgtggccac gaacggtatc ccaccgcctc gatcatctcg actgagacta 536701 ctgagtcata ctgcccgtcg acgtcgcggt agtcgcacaa gtcgatctct acccggtggc 536761 caaagccggc cgcggcgacc cgctgccgag ccagccgttg ctgctccacc gatagggtca 536821 ccgagcggat gtgggccccc cgtgcggccg cgcgaatgca cagctcgccc catccggtgc 536881 cgatctcgag aacgtggctg ccctgctgga ccccggccac gtcgagcagc cggtcgatct 536941 tgcggcgttg ggctgcggcc aactcggtcc aggcgggagt tggctgggcc agcaggtcgg 537001 tgaacattgc gcacgaatac gtcatggtct cgtcgagaaa cgcggcgaac aggtcgttcg 537061 acaggtcata gtgcacggct atattgcgcc gggcctgatc tcggctgtgg tctggccaac 537121 taggtcgaaa ggtcggcgtg atcggccgca gccagtgcag cgagcgcggt accagctcgt 537181 ccaccgaccc tgccagcacg gtcaacaccc gcgtgagctc cttcgacgac cattcgccgg 537241 ccatgtagga ctcgccgaag ccgatcaagc cgtggcgccc gatccggcgt gcaagtgcgt 537301 ccggccgatg gatgaacagg ctgggtgcgc gcggatcggc ggcacctgtt gccgttccgt 537361 cggagtagac caatcgcagc ggcaagtgag tggccgtgcg ccgaagcagc cggttggcga 537421 ttgccgccga tgccgcggct aggggaccgc gcggcacctt ggcaaccgct ggccagcgat 537481 ccgaatcgat tgctgccgac ggtgtctggc tggtttcgac ggtcatcgcg gcaccaccgg 537541 aactcgacgt agccacagtc tgatcccctg tatcctgatg cgcgcggcca ccaccatcgg 537601 cgccagcggt gaaatgattt gcatcatcgc gatctgtctt gtcgttgccg gtcgccgctg 537661 cccacgcagg gtggctgtga attccgggca cacctgccgg cggtcacggt gcagcgtcac 537721 cgtgacgtcg agttcgcggt cgggccgtgg tgcccgtatc aggtagtagc cggctagctg 537781 atgaaacggc gaaacgtaga agttcttggc cgtcaccacg ggcaggtcgg ccggcggtag 537841 caggtaagca tggcgtccgc cgtaggtgtt gtgcacctcg gcaatgacat ggcgcagttg 537901 gccgtcgcgg tcgtggcacc agaagatgct caacgggttg aagacatagc cgagaacgcg 537961 tgcttgcagc agcgcggtga tacggccgtc ggggacggca aggccgcgag cggcaaagaa 538021 ggcgtccagc cggtcacgca gcgagctatg cggcggacac gagaacgggt cagcgaagtg 538081 gtcgtcggcg tggaaccgtg cgaacggtcg cagccaccag ggcagctggg ggaggttgtc 538141 gacatccacg taccagctgt agctgcggta tgcgaacgag tggtgcaccg ggacttgtct 538201 gcagtggctg atcgtggtgc ggtagatcgc cggcgtcagg gtttgagtca gcacgcgacc 538261 atcgcctcct gtgggatcgc tgccggccag tcggcgccaa ggcgccgggc cgcccgcaga 538321 cccgaggcgg cgccgtcctc gtggaatccc cagccgtggt aggcgccggc gaataccacc 538381 cgattgtcac ccagcgtcgg caataagcgt tgggctgcaa ccgattccgg tgtgtacagc 538441 ggatggctgt aggtcatctc ggcgatcacc gagctgggat caacccggtc gtggccgccg 538501 agggtgacca gataccggcg gccaccgtcg aggcgcatta gcctgctgat gtcgtagctg 538561 accacgacct ggtgctgccc gggtgtcacc aggtagttcc aggatgcgcg ggcgcgatgg 538621 tggcggggca ggaccgactc gtcggtgtgc agctgggcgc tgttggtgga gtatgcgatc 538681 gcgcccagga ccgcgcgctc ggccggtgtc ggctcgtcga gcaacagcag cgcctggtcg 538741 ggatggaccg cgacgacggc cgcatcgaaa cgccgcgacg gcccatcacc cgcgcccacc 538801 aataccccgt ccggcagccg gcgcagcgag tgcactggcg tgcgggtcga cacctcgtcc 538861 agctgagctg cgatcgcctg cacgtagttg gcggaacctc cggtgacggt acgccaggtt 538921 ggcgacccga acaccgacag catgccgtga tggtcgagga agacgaacag ataccgggcc 538981 ggatagcgca aggcgtcggc cccgccgcag gaccacacgg cggcgaccaa gggtgtgatg 539041 aagtaatcga cgaaatactg cgagaagtgg tgccggctca ggaaggcttc cagcgtctcc 539101 ggtttgtctt ccgcgttgtc ggtctcctca cgcagcaggc gagccgcggc gcggtggaag 539161 cggagaatct cggcaagcat gcacagatac cgtggccgca gcgattgccg gcaagcgaac 539221 agcccgcgcg ctcccagtgc gccggcatat tcgagtccga tgtcgtcggc gcgcaccgac 539281 atcgacattt ccgactcctg ggtggccaca cccagttcgg cgaacaatcg gcacaacgtt 539341 ggataggttc ggtcgttgtg caccaggaac gccgagtcga cgccgacgac gtcggtgccc 539401 cgggggccgc caccgttgtc cagatagtgg gtgtgggcat gaccgcccag ccggccgtcc 539461 gcctcgtaca gggtgactcg gtcccgtcca gacaggatgt aggcggcggt gaggccggcg 539521 accccacttc cgacaacagc caccgatcgt cggagtgatt gctgcacatc ctgtattcgg 539581 agcggccggc tagacggacg ggcggttcag ccgaggcggt cgctgctcat cgccaagggc 539641 cggcccgcgg gctgggtttc gctgggtacg gtcggggtcc gggcgggccg ggaacgcacc 539701 cgcagcggcc accagaacca gcggcccagt agtgcggcga tggatggcgt catgaacgat 539761 cgcacgatca acgtgtcgaa cagcaaaccc agaccgatgg tggtgcccac ctgtccgata 539821 acccgcagat cgctgacggc catggacgcc atggtgacgg cgaataccag ccctgcgttg 539881 gtcacgacct tgccggtgcc gcccatcgac cggatgatgc cggtcttcag ccccgctcct 539941 atttcctgtt tgaaccggga gaccaagagc agattgtagt cagatcccac cgccaacaga 540001 acgatgaccg acatcgcaag cacgagccaa tgcagatgga ttgcgagaat atgctgccag 540061 agcagcaccg atagtccgaa agaggcaccc agtgaaagtg cgactgtgcc cacaatgacg 540121 gcggcggcaa taaaggcccg tgtgatgatc agcatgatga taaaaatgag acagagggac 540181 gaaattgccg cgataagaag gtcccattgg gcgccctcgg agatgtcgtg gaagacggcc 540241 gccgtgccgg ccaggtagat cttggcgtct tctagtggag ttcccttgag cgattcctcg 540301 gccgcggtac gaatcgcgtc gatacttttg atgccctcgg gtgattgcgg atcccccctg 540361 tgcaggatga taaaccgggc cgcgtgtccg tccgaagaca ggaacgactt catggcgcgc 540421 tggaagtctt tgttcttgaa aacctcgggt ggaaggtaga acgagtcgtc gttcttggcg 540481 gcgtcaaaag ccttacccat ggctgtggca ttgtcgctca tttcgagcat ctggtcgaag 540541 attccggtca tggtgctgtg catggtaaga atcatggtcc gcatgttttc catggcctca 540601 atctgcggcg ggatctgcgc gaccatttgt ggcatgaggc gatccatctc gcgcaagtcg 540661 cccaagagga cgcctatttg ctcgctgagc ttgtcgattc cgtccagtgc atcgaatatc 540721 gatctgaacg accaacagat cggaattccg tagcagtgct tttcccagta gaaatagctt 540781 cgaattggtc tccagaaatc atcaaaatcc gcgacgtggt cgcgtaattc ttcggtgatc 540841 tccttcatct cttcggtgtc gccgaccatg cggtgggtag tgctggccat ctccgccatc 540901 aggctatgca tccgcgtcaa caccgcaatc gtcgtggcca tctcgtcggc ctgcttcagc 540961 atgtcgttcg cccggtcgcg ctggtacttt atggtctgca gctgaccggc attttgcatg 541021 ctgatctgga acgggatcga cgtgtggtcc atcgtcgttc cttcgggccg ggtaattgct 541081 tgcacacggg aaatgcccgg gacccggaag atgcctttag ccagcttgtc caggaccaga 541141 aaatctgccg gattccgcat atcgtgatcg gattcaatca ttaggatctc gggcttcatc 541201 ctggcctgag agaaatgacg atccgcggcc gcatatcctt ggttggcggg tatgaagtcc 541261 ggtaggtagt cacggtcgtt gtagctggtt ttgtatccag gcagggcgag cagaccgact 541321 agggcgatcg cgcaggtggc gacgagaacg ggcagcggcc agcgaaccac cacggtaccc 541381 acccgccgcc agccacggac tttgaggagc cgcttagggt cgaacaggcc gaaccggctg 541441 ccgacgtgta ggacggccgg acccagcgtc aacgcgaccg ccactgcgac tagcatcccc 541501 accgcgcagg ggatgcccag ggtttgaaag tagggcatgc gggcaaagct caggcaaaag 541561 gtagctccgg tgatggtcaa tccagagccc agaatcacgt gggcggtccc gcggtacatg 541621 gtgtagtagg cggcctcttt gtcctcgccg gcttggcggg cttcctggta gcgcccgatg 541681 atgaatatcc cgtagtccgt accggccgcg attgccagcg aagtcagcaa gctcaccgca 541741 aaggtggtaa gtccgatagc cccgctatgc cccagaaccg ctacgactcc gcgcgcagcc 541801 gtcaattcga cccccaccgt gatcagcagg agaaccacgg tgattatcga ccggtagacg 541861 agcaacaaca taataaagat cacggcgacc gtaaccatgg tgatcctggc catggatcta 541921 tcgccactgt ggtgcatatc cgcggcgagt gcggatggtc cggtcacata ggcctttatg 541981 cccggcggcg cgggcgtgct ttcgacgatg ctgcgtactg cctcgacgga ttcgttggcc 542041 agcggcgtgc cttggttgcc ggcaagtgac agttgaacat aggcggcctt gccgtcgtta 542101 ctttgcacgc ccgcggcggt gagtgggtcc ccccataaat cttggacact ttgcacgtgc 542161 ttcttatcgg ccctcaattg agcaaccagg ccgtcgtaat acttatgggc agcgtcgccg 542221 aggggttggt taccctctat tatgaccatc gcgaaactgt cggaatcgcc ttccttgaac 542281 accatgccga tacgtcccat cgcctcaaac gacggtgcat ccttgggact cagcgacacc 542341 gatcgctctt ggccgacagc ttccagtgac gggacaaata cggtgacaac gacgcaaact 542401 gccagccagc caaggatgat cggtaccgca aaggcgtgga tcatcctggc gatgaatggc 542461 ttttcggggc gagcgttggt attggagtcg ttcgcgaatt tagtactcac gcggacttca 542521 ccaagcagta agtataggcg ttgacttcgt tggaaaccct ctcggccctg accttgccgt 542581 ctaccgtgat tcggcagcca atgctgtcgc tattaccttg tgccacgata tttcccatca 542641 ccgccgcgtc gtttgtcgtg atatgcaatg accacggtag caccgctcca tcgacccgtt 542701 gcggctcgga attgacgtcg aaataactaa tgtcggcgac tgttccgggg ggtccgaaga 542761 tctcgtaagt caggtgttta gggttgaatg gtttgctgtt ttccaggttg gtgtcggagt 542821 acgacgggcg gttttcggag ccgaagaagc cgcggatccg gtgcacggtg aagcccccga 542881 cgatgaccac caccaggatg accagtggaa tccaagtccg cattagcacc ttgaaaatct 542941 cagatcccct tcaccggttg gcagtggtac ggcggacgat acccaacttt caaaatccgt 543001 tcgagctggt cgctacttga acgcaactaa gcctagccta agtaaaacat ggttttaggc 543061 ccgagctctc gactccttac ctcgttcgct ggagtgtaac gcatatcacg tgcgtaacgg 543121 cacgctacgt tatcggcagc cctcttacaa atcacacggt gtgcgttatc ctctggcggt 543181 ggcgcaactc ggcttccagc gcgcccgcac cgaggaaaac aagcgccaac gtgcggcggc 543241 gctggtggaa gccgcgcggt cgctggcgct ggagacgggc gtggcatcgg tgacgttaac 543301 ggctgtcgca ggtcgtgccg ggattcacta ctctgcggtg cgccgctact tcacctcgca 543361 caaagaagtg ctgctgcacc tcgccgccga gggttgggcg cggtggtcgg gcacggtatg 543421 cgagcagctg ggcgagccgg ggccgatgtc ggcaccgcgg gtggccgagg cactggccaa 543481 cggtctggcc gccgatccgc tgttttgtga tctgcttgcc aatctgcatc tgcatctcga 543541 gcaggaggtg gatgtcgacc gggtcatcga ggtcaagcgg accagcatcg cagccgtgat 543601 agcgctcgtc gacgcgatcg aaagcgcatt gccggcactc gggcgttctg gggcattcga 543661 catcctgctg gccgcttact cgctggcggc caccctgtgg cagatcgcca atccgccgga 543721 gcggctcacc gacgcctatg ccgaggagcc agagttgctc ccaccggagt ggaacctcga 543781 ctttgctgcc gcgcttactc gcctgctcac cgctacgctt ctcggcctgc tcgccggatc 543841 cccatgcgaa tgccggtcgc caacgcgctg aagcgggtgc gggacgaagg gggcgccgga 543901 cttgggcccg cttggcggcg gtaggtgacc aaactcacgc ttcttgggcg tgcgccgcag 543961 ccgaaccacg actattgcta gttgcaaacg atagtcatag tcaattgttg ccagacgcac 544021 agctggtgtt ggcgggagtc gccgatagag gagtgttcga catgacgttg cacgtcggtg 544081 ccgacggcct agagaccgca actacggcgc gcgccgtggc ggtcgctagg tccggaatgg 544141 attgtgtggc cggtgatgcg tcaggggcga cttcgtgccc acgcggtgag ctatgacgag 544201 cgcactgata tggatggcct ctccgccgga ggtgcattcg gccttgttga gtagtgggcc 544261 ggggccgggg ccggtactgg ccgccgccac agggtggtcg tcactgggcc gtgaatacgc 544321 cgcggttgct gaggaactcg gggcattgct ggctgcggtg caagccgggg tgtggcaggg 544381 gcccagcgcc gaatcatttg ctgccgcgtg cctgccgtat ctgtcttggt tgacgcaggc 544441 cagcgccgac tgcgccgcgg cggctgcccg gctggaggcg gtgaccgccg cctacgccgc 544501 ggctttggtg gccatgccca ccctggccga gttggcggct aaccacgcga cccacggggc 544561 catggtggcg accaatttct tcgggatcaa caccataccg atcgcggtca acgaggccga 544621 ctacgtgcgg atgtggcttc aggcggccac cacgatggcc acctatcaag cggtcgcgga 544681 ctcggcggtg cgctcgatcc cggacagcgt gcctccgccg cgaattctga aatccaatgc 544741 ccaatcccaa cactcgagct cgaataattc cgggggcgcg gacccggtgg acgacttcat 544801 tgcagagatc ttgaagatca tcaccggcgg tcgcgtgatc tgggaccccg aagccggcac 544861 tgtcaacggc ctcccctatg acgcttatac caaccccggc acactcatgt ggtggattgc 544921 cagaagtctg gaacttcttc aagactttca agagttcgcc aagctgctgt tcaccaatcc 544981 ggtgaaggct tttcagttcc ttgtcgacct catcctgttc gactggccta cacacatgct 545041 gcagctggct acctggctgg ccgagaaccc gcagttgctg gtggctgcgc tcaccccagc 545101 catctccgga ctgggagcgg tatcggggtt ggccgggttg accggcctag tccctcagcc 545161 ccccgtcgtg cccgcgccgg cacccgatgc ggtcgtgccc accgtgttgc cactcgccgg 545221 gacggccacg ccgactaccg cgccggccag cgccccggcc gccggagcgg cgcccgggcc 545281 cccggccggt accgccactg ccacatcggc gtcggtgcca acgagcgccg gcggctttcc 545341 cccttacctc gtgggcagcg gtccaggcat cgacttcgac gcggggacgc ccgccggttc 545401 caggagagcg cagcccgccg cggataacgt cacggccgtg gcggcagcgc aggtgtcggc 545461 ccgtcatcag gcacgtcggc gccgacgagc ggcggcgaag gaacgtggca acgccgacga 545521 gttcgtcgat atggactccg gcccggcgat tccgccgtcg ggcgagcggg acgcttgggc 545581 gtccaattcg ggcgtgggcg ggctggggtt tgccggcacc gcaagcaacg agacggtggc 545641 agcgccggcc ggattgacca cgctggccga cgatgagttc cagtgtggcc cacggatgcc 545701 gatgctgccg ggcgcttggg acttgggaac ttgggaccgc ggggactgat taccctacaa 545761 cgcagcgacg tcgcgcatga tgtcggtggg ttcgcgcacc ggcgccccac aggtcaggca 545821 gaacgcgccc ggggaacggg tgagccgacc gacttgaagc aggactttgg cctcgacgtg 545881 ccacaagcag gcaatgcaca gaatttcgac ggtgttcccg aatgggtcca ggtcggggtc 545941 gttacattcg tctaccgcat gcagatgcac cacgtaactc gcccggttgg tgcacccggc 546001 tccggactgg caggtgattc caccccagtc cagggccgcc agcgtgtgtg ggatctcgtt 546061 gccgggcgct tgactcatgc gccgcgctcc agtgtccagg ccatgcggcc cacgatgttt 546121 acctctgccc cgcaacggca tggtatcccg gcgcgtggcc ggtggtggct gggctaccaa 546181 gagcgaagtc gggcatggcc ttagtcctag tggtacgcga taggtcgtcg aattccgtgg 546241 gtgatggata tgactatttc gtagctggtc gccagaatca atccgccgaa cggcggctga 546301 tgggcccaac gggctgtccc ccgaatggtg gacaacattt ccgggttcgt tgcaaacgac 546361 cgcgctttga cgccggttag ctttaggccg gacttaggcc cagttccaca ccgacatgtc 546421 gccggctggg tatccattgc acacctcggt ccctttagcg acgacgccct tgttgttgaa 546481 gaagattttc atgtgattga cccaggcaaa cgtcagcgga tcgccattgt aaaagtgttc 546541 ggagtagtct cggcgctccg ccggtgacag cgagaagaac cagtgcgcct tgttgatcgt 546601 cgcttgctga aggtttgcat ggttgttgaa gtcgatcatg taccgctggt agtacaccgg 546661 actggtatcc cgcaccgccg ccagatattg ttcggcgtcg caggtggttg cgatcatccg 546721 gcgaggtatt ggaaagtctt ccgtggagtc ggctgccgcg ctttgtggaa atgtcgcagc 546781 ggcgatgccg agaaccagaa atgccgcgcc ggcacgcagg atggaactca gccgagacat 546841 agtggttacc gtagcacttt tggggcgcct cgaggcgggc agacgacaag gttcatagtc 546901 tgtctcacta catgctccca tcaggagtga tgacgtgcgt ggggtcgggt cgcagttccg 546961 gtggggcttg gctgtagtcg ccgaacgggc cgtcgcggcg ctcgaccgcg gctcgcacac 547021 cctgggtttg ggcggtccgg atgaactcga gcgcgtcggg ggtgttgcgc atcagcccgt 547081 cgagaatgcc gcccagcagc tgggtggagg ccaggcccat gttctcgtag gcctggttga 547141 cgatcagttt ctgggcttgc aactgtgaca acgggattcg tgccagctcg gtggcgatct 547201 cggcgacgcg agcctcgagc cgctcgaacg gcaccgcctc gttgatcagc tcggcttcgg 547261 cggcctgcac accggtcagc ggccggcccg tcagcgagtg ccatttgacc ttggcaaggc 547321 tgagtcgata cagccacatc ccggtcaaat aggctcccca catgcggcta tacggggtcc 547381 cgatcacggc gtcctcgctg gcgatcacaa tgtcggcaca cagcgcgtag tcgctggccc 547441 cgccgacgca ccaaccatgc acttgcgcga tcaccggttt ggacgcccgc cagatggcca 547501 tgaatttctg cgtcggtccg gtctcccgcg cggtgaccat ggcgaaatcc ttgcccggat 547561 cccatcggcc gtcggtcatc atggcatcgc cccaatgctg gaagccgccg ccgaagtcgt 547621 aaccgccgga gaaggcgcgg ccggcaccgc gcagcacgat gaccttgatg tcctggtcgc 547681 gctcggccaa cccgatagcg gcctcgatct cgtcgggcat gggcgggacg atggtgttga 547741 gctgttccgg gcggttgagc gtgatggtgg ccaccggccc ggccgtcgtg tacagcagcg 547801 tctggaaatc gggtgtcggc atagcagcag cgaagtcact tcggccctaa gggtcaagtg 547861 tctcagcggg gatcgtgata acgccgctgg ttcgaagctt cggccaaccc gggcgcaggg 547921 tttcgctagc tggcatttgc atgcctcggg catcggtgtc cggttgcgct ctttgctccg 547981 acgttagccg cagggccctg cggctaggcg cggccggtgc cgttggccgc ggcggcaatc 548041 gatgttgcag cagttacaac gccaaatgga gtctgagcgc atcgtcgagt tcgatcagct 548101 cggcagggga gacgttgcgc agcgacggat ccaacctgct gggcctgcgc cttcgaatcg 548161 acggccaggc caccgctcgc tgccggcaac aacacctgga atggggacct tttcggtgtt 548221 gctggtaacc gggacaaccg gcaccacgcc tcggtcgaga cgtatcgcgg cagcgttggc 548281 cctgtcgttg ctgacaatta ccgctggccg ccgcatattt gccgcgctgc cgcgggccgg 548341 atccaggtcg acctgccaga tctcaccgcg cagcatctac gccgttcgct gcaaaccgcc 548401 gactgcgacg gcaggcccac tctcttggca tgcgtccaat gctgcgacgt cctcggtaga 548461 caagctcacg cttggcttca tgccgcagtc ctacccatgt agtaacagat agtaatacgt 548521 agtaataggt agtaatgcag tatcaatcgg ctacaactcg atagccacgt tatttgggct 548581 aagtccaccg ttcgtgaatg ccggttagcc ggccagcatc cgccatagga acgcgaaact 548641 cagcgccgat ttgaatgcga tctgtgcgtt gtcggctgcg ccggcgtgcc caccctcgat 548701 gttttcgtaa taccagacgg ggtggcccgc agcctgcagg gccgccgtca ttttgcgggc 548761 gtggccgggg tgcacccgat cgtcgcgggt agaggtcgtc atgagtactg gcgggtattt 548821 ccggttcgcc gaaatgtttt ggtatggcga atattcagag atgaacttcc agtcatccgg 548881 gttatccgga tcgccgtatt cggccatcca ggaagcgccg gccagcagca ggtggtaccg 548941 cttcatgtcc agcagcggca cgtcgcagac cagcgcgccg aacttctccg ggtacccggt 549001 caacatgatg cccatcagca gcccaccgtt gctgccgccc cgcgcgccga gctgctcagc 549061 ggtggtgatg ccgcgggtca ccaaatcggt tgccacggcg gcgaagtctt gggcgacctt 549121 gtcccggccc tcgcgcatcg cctgcgtgtg ccagccaggc ccgtactcgc cgccgccgcg 549181 gatgttggcc aacgcatagg tgcccccgcg ggccagccac agccggccca ggacgccgtc 549241 atacgtcggc gttctggatg tctcgaatcc accgtagccg ttcaacaatg tggggccggg 549301 attgtccgcg tcggtgcgtc gcacgacgaa atacgggatc gatgtgccat cgtctgatgt 549361 cgcgaaatac tgtgttacag ccatgttttc cgcgtcgaag aaagctggcg cagatttgat 549421 ctctgctagt cggccgtcat cggtgccgcg catcagccgc gacggcgtat cgaatccact 549481 ggagtcgagg aagaactcgt cgccgtggct gtcggcggag acgatgacgg tgttggtggc 549541 ggcggggata cctgagagtg gctcacgtcg ccagctgccg ggagttgcga tctcgacgcg 549601 gctcgccacg tcggccaggg tgacgatcaa cagccggtct cgggtccagg cgtattggta 549661 cagcgcggtg tgctcgtcgg gttcgaacac cacctgtaat tccgctgagc cggcaaggaa 549721 ttcgtcgtat tcggcggcca gcagtgagcc ggcagtgtac ctggtggtgg ccacggtcca 549781 gtcggtgcgc agctcgatca acagccagtc gcggtgaatt gacacgctcg cgtcggtggg 549841 ggcttcgatt cggatcagct ccgaaccacg caattcgtag acctcttcgt tccagaagtc 549901 gagggcccgt cccagcaggg tgcgctcgaa tccgggcgtg cgatccgctg acgcgttgac 549961 gcggacgtcg gtgcccgcgc cctcgaagat tgtctccgca tcggccagcg gtttgccccg 550021 gcgccatcgc ttgatcactc gcggatagcc ggaagtggtg agcgagtcgc cgccgaagtc 550081 ggtgcccagc aagacagtgt ccgggtcctc ccaggtaatc tgggatttgg ccggtggcag 550141 ctggaaccca tcctcgacga attcgcgtgt cagcatgtcg aattcacgca caatggatgc 550201 atccgagccg cccggggaca ggccgatcag cgcgcgcgtg tagtcgggtt cgatgacacc 550261 ggcgccgccc cacacccact tctggtcgtc ggcgcggccc agttcatcaa catcgatcag 550321 cacatcccag cccggcgagt cggtgcggta gctgtccagc gtggtgcgcc gccacaaccc 550381 gcgggggttg gcggcatcgc gccagaagtt gtagagatag ttgccgcgcc tgttcacata 550441 ggggattcgg gcatcggtgt cgagcacctc gagcgcctcg acgcgcatcc gctcgaactc 550501 tgcgtcgcag aacgccgccg ttgtcggctt gttgcgcgcg cgtacccaat ccagcgcttc 550561 cgcaccggtg acgtcctcga gccataggta ggggtcagcg ccgtctgggg caggctcaaa 550621 tgtcatggaa gccattgtgg ccccggcggt agtgtgagct gtattacatg attttgacga 550681 ggagccgaat acgatgactg tcttttcccg tcccggttcc gccggggcgc tgatgtccta 550741 tgaatcccgg taccaaaact tcatcggggg ccagtgggtc gcgccggtcc atgggcgcta 550801 cttcgagaac ccgacgccgg tgaccggcca gccgttctgc gaggtgccgc gctccgacgc 550861 ggccgacatc gacaaggcgc tcgacgccgc gcacgcggcg gcgccggggt ggggcaagac 550921 cgcaccggcc gaacgggcgg cgatcctcaa catgattgcc gaccgcatcg acaagaacgc 550981 cgccgcgctg gcggtggccg aggtctggga caacgggaaa ccggtccggg aagcgctggc 551041 cgccgatatc ccgttggcgg tcgatcactt ccggtacttc gccgcggcga ttcgcgccca 551101 ggagggcgcg ctgagccaga tcgacgagga caccgtggcc taccacttcc acgagccgct 551161 cggcgtggtg ggccagatca ttccgtggaa cttccccatc ctgatggcgg cctggaagct 551221 ggcgccggcg ttggcggccg gcaacacggc ggtgctcaaa cccgccgagc agacacccgc 551281 ttcggtgctc tacctgatgt cgctgatcgg tgatctgttg ccgcccgggg tggtcaacgt 551341 ggtcaacgga ttcggcgccg aggccggcaa gccgttggcc tccagcgacc gcatcgccaa 551401 ggtcgcgttc accggggaaa ccaccacggg gcggctgatc atgcaatacg cctcgcacaa 551461 cctgatcccg gtcaccctgg aactcggcgg caagagcccc aacatcttct tcgccgacgt 551521 gctggccgcc cacgacgact tctgcgacaa ggcgctggaa ggcttcacca tgttcgccct 551581 caaccagggc gaggtgtgca cctgcccgtc gcgcagtctg atccaggccg acatctacga 551641 cgagttcctg gagctggcgg cgatccggac caaggcggtc cggcagggcg acccgctgga 551701 caccgaaacc atgctgggtt cccaggcctc caacgaccag ctggaaaagg tgttgtccta 551761 catcgaaatc ggcaagcaag agggtgcggt gattatcgcc ggaggcgagc gcgccgaact 551821 aggcggcgac ctgtccggcg gttattacat gcagccgacg atcttcaccg gcaccaacaa 551881 catgcggatt ttcaaggagg agatcttcgg gccggtggtc gcggtgacgt cgttcaccga 551941 ttacgacgac gcgatcggca tcgccaacga caccctctac ggcttgggtg ccggtgtgtg 552001 gagccgcgac ggcaacactg cctatcgggc cgggcgggac atccaggccg gccgggtgtg 552061 ggtcaactgc taccacctct accccgcgca cgcggcgttc ggcggctaca agcagtccgg 552121 catcggccgg gagggccacc agatgatgct gcagcactac cagcacacca agaacctgct 552181 ggtgtcctac tcggataagg cgctggggtt cttctgatga acgctcccgc gggggtgctc 552241 atcaccgccg aggccgccgc gctgctggct gggttacagg accggcacgg tccggtgatg 552301 ttccaccaat ccggcggctg ctgcgacggg tccgcgccga tgtgctaccc gcgggcggac 552361 ttcctggtcg gtgaccgcga catcttgctg ggtgtgttgg acgtcgggga agacggcgtg 552421 ccggtgtgga tttcgggccc gcagtaccag gcctggaagc acacccagct gatcatcgac 552481 gtggtgccgg gccgcggtgg cgggttcagt ctggaagcgc ccgagggcgt gcgctttctc 552541 agccgaggtc gggtgttcag cgacgccgaa aaggcgatgc gggaggctgc gccggtgatc 552601 accggcgcag cctacgagtg cggcgaacga ccgttagtgc ggggtcttgt cgtcgatctc 552661 gacgatccag atgccacgcc gggagtgtgc cgcgccagtc ggcggtagcc gcagtaaggt 552721 cgtagaccgt gatccccctt ccgcggtcat ggcagctgac cagcgcgatg ctggttggta 552781 atgcgatcgg actgctagcg ggggtggcgt gcagcgtgct ggtgcatgcc cggatccgtc 552841 cggacatcgt catcgcaatg gtagtcggga ttcccagcgc gatcgggctg ctggtcatcc 552901 tgttctccgg acgtcgatgg gtgacgatgc tgggcgcgtt catcctggcg ttggcgccgg 552961 gttggtttgg tgtgctggtt gcgatccagg tggcgtccag tggctgacaa cgattaccgg 553021 tcggcacccg gaaccgagcc gtttgtgccc gatttcgaca ccggcgcaca ctcgcagcgg 553081 ttcctctcgt tggccggcca gcaggacagg gcggggaaat cctggccagg ctcgacgccg 553141 aagccgcagg aggaccccgt gggtgtcgcg ccttcggcca gcgtcgaggt gctggggtcc 553201 gagccggccg ccacgctagc gcactcggtt acagtacccg gtcgatatac ctacctgaag 553261 tggtggaagt tcgttctagt ggtcctcggc gtatggatcg gtgctggcga ggtcggcctg 553321 agcttgttct actggtggta tcacacactc gacaagacgg ccgccgtgtt cgtcgtcctg 553381 gtctacgtcg tcgcgtgcac cgtcggtggc ttgatcctgg cgctggtgcc gggcaggcca 553441 ctgatcacgg cgttgtccct cggagtgatg tcggggccgt ttgcctcggt cgccgccgcg 553501 gcgccgctct acggctacta ctactgcgag cggatgagtc attgcctggt cggcgtcatt 553561 ccgtactagt cggttgtcgg acttgaccta ctgggtcagg ccgacgagca ctcgaccatt 553621 agggtagggg ccgtgaccca ctatgacgtc gtcgttctcg gagccggtcc cggcgggtat 553681 gtcgcggcga ttcgcgccgc acagctcggc ctgagcactg caatcgtcga acccaagtac 553741 tggggcggag tatgcctcaa tgtcggctgt atcccatcca aggcgctgtt gcgcaacgcc 553801 gaactggtcc acatcttcac caaggacgcc aaagcatttg gcatcagcgg cgaggtgacc 553861 ttcgactacg gcatcgccta tgaccgcagc cgaaaggtag ccgagggcag ggtggccggt 553921 gtgcacttcc tgatgaagaa gaacaagatc accgagatcc acgggtacgg cacatttgcc 553981 gacgccaaca cgttgttggt tgatctcaac gacggcggta cagaatcggt cacgttcgac 554041 aacgccatca tcgcgaccgg cagtagcacc cggctggttc ccggcacctc actgtcggcc 554101 aacgtagtca cctacgagga acagatcctg tcccgagagc tgccgaaatc gatcattatt 554161 gccggagctg gtgccattgg catggagttc ggctacgtgc tgaagaacta cggcgttgac 554221 gtgaccatcg tggaattcct tccgcgggcg ctgcccaacg aggacgccga tgtgtccaag 554281 gagatcgaga agcagttcaa aaagctgggt gtcacgatcc tgaccgccac gaaggtcgag 554341 tccatcgccg atggcgggtc gcaggtcacc gtgaccgtca ccaaggacgg cgtggcgcaa 554401 gagcttaagg cggaaaaggt gttgcaggcc atcggatttg cgcccaacgt cgaagggtac 554461 gggctggaca aggcaggcgt cgcgctgacc gaccgcaagg ctatcggtgt cgacgactac 554521 atgcgtacca acgtgggcca catctacgct atcggcgatg tcaatggatt actgcagctg 554581 gcgcacgtcg ccgaggcaca aggcgtggta gccgccgaaa ccattgccgg tgcagagact 554641 ttgacgctgg gcgaccatcg gatgttgccg cgcgcgacgt tctgtcagcc aaacgttgcc 554701 agcttcgggc tcaccgagca gcaagcccgc aacgaaggtt acgacgtggt ggtggccaag 554761 ttcccgttca cggccaacgc caaggcgcac ggcgtgggtg accccagtgg gttcgtcaag 554821 ctggtggccg acgccaagca cggcgagcta ctgggtgggc acctggtcgg ccacgacgtg 554881 gccgagctgc tgccggagct cacgctggcg cagaggtggg acctgaccgc cagcgagctg 554941 gctcgcaacg tccacaccca cccaacgatg tctgaggcgc tgcaggagtg cttccacggc 555001 ctggttggcc acatgatcaa tttctgagcg gctcatgacg aggcgcgcga gcactgacac 555061 cccccagatc atcatgggtg ccatcggtgg tgtggttacc ggctacatcc tctggctggc 555121 ggcgatctcc gtcggcgatg gtctgacgac ggtgagtcaa tggagtcgcg tggtgttatt 555181 gctgtcggtc ctggtggcgg tgtgcggcgc ggcgggcggc ttgcggctgc gcagccgcgg 555241 caagctcgcg tggtcggcgt ttgctttcag tttgccgatt cctcccgtgg tgctgaccgt 555301 ggcggtgctg gccgacatct acctttgacg gctactgtgg gttgtccggc gggatggcca 555361 gggcggtgat cgttgcggcg atcgcgtcgt attgggttgc gagtaaacag aattcgatca 555421 acaggcgcgg atcgaggtga gttgccagcc gctcccaggt gcccgcggtg atcgtgcgat 555481 ccttgatcaa ttcatcggta gcctgtagca gcgcctgttg gcgggcgctg agcacttttc 555541 gcggtccgtc tccatctgga acgtcgggcc aggcgaatat cgtggcctgg gtgttggcgt 555601 ctaggccccg acggcgcgcc attcggcgat gatgctgaag ttcgtattcg caagatcgta 555661 ggtgtgcgac ccgaaggatc accaactcgg tatcgacgcc gggcagccgc ccgtgcagta 555721 gtcggccggt gtagatggca aaggtccaga acaagtactg gcggtagccc agcgtggtga 555781 acaggtgcat ctgcggtgcc ccaaccgcac gtgcggccag cttggccacc agccagttga 555841 ccggccccag ctggcggaac ttccccgggg agatacgcgc gacttggccg ttctgaccgg 555901 tcatagttgt ttcaccagat acggggacac cgtgctgcgg tgttcgtcga gatccagtgc 555961 ccgccccaag gcggggaagg cgcgttgcgg acagttgtcg cgttcgcaga cgcggcaacc 556021 ggcgccgata ggtgtggccg cagtattcgg gtcacccgac aagtcgagtc cttccgagta 556081 gacgagccgg tgcgcgtggc gaagttcgca gcccagcccg atcgcgaagg tcttaccggg 556141 ctgaccatac cgggcggccc ggagctcaac ggtgcgggcc acccacaggt agttgcggcc 556201 gtcgggcatc tgggcgattt gcaccaagat cttccccggg ttggcaaacg tttcgtagac 556261 gttccacagc gggcaggtgc cgccgctgga ggagaagtga aagccggtgg ccgactgacg 556321 ttttgacatg tttcccgctc ggtccacctg gacgaaggtg aacgggaccc cgcgcatcga 556381 aggccgttgt agtgtcgaca gccggtgggc gatggtctcg tagctcaccg agtagaacgc 556441 cgacagccgc tcgacgtcgt agcggaaatt ctcggcgacg tcgtggaact ggcggtaggg 556501 cagcacggtg gccgcggcga agtaattagc caggcccagc cgggccaacg tccgcgactc 556561 ggcgctggtg aacttgccgt cggtgaccat ggcgtcgatg aggtcgccga actcgagata 556621 ggccaactcg gcggccatct tgaacacctg ctggcccggg gagaggtgac tgctgatctc 556681 cagcgtgttg gtcgcggggt cgtagcggtg cagcacggtg tcaccgaggt cgatgcgctt 556741 gttgatgcgt actccgtgca cctcggtgag ccggcgggtc aattcgcggg ccaggtcgcc 556801 gtggtgcatc cgcatctggg ccgtgaggtc ttcggccgcg gtgtccagcg catgtagata 556861 gttctggcgt tggtagaagt agtcgcgcac ctcttcgtgc ggcatggtga tcgaccctcg 556921 gccactgccg tcggagaacc gctcctcggt cgcggcggcc agctgcgcgg tggtgatccg 556981 gtagcgccga tgcaggttga ccaccgcgcg ggccagcccg ggatgagcgc tgaccatttc 557041 ggccacttca tgcgggtcga tggcgatgtc tagatcgcgg tccagggtca cctccctgag 557101 ttcggcaacc agccgggtgt cgtcctggga ggcaaagaac gtcgcgtcca ccccgaacac 557161 ttcggtgatg cgcagcagca cggccacggt cagcggccgg acgtcgtgtt cgatctggtc 557221 cagatagctc ggcgagatct ccagcatctg ggccagcgcg gcctggctga acccgcgctc 557281 gttacgcagt tggcggaccc gcgagccgac gtaggtcttg gacacccaac cgagcgtacc 557341 gggtgttgtg aagacgccat tcgcagagtt agcaagcgtg ctgcgattgg tgtttccgcc 557401 acggcgttgg catgattcgc accgggactc aagggtgagc ctgaggtaca cgcgaggagg 557461 aaatggggag aacgccgtga gcctcgacaa aaaattgatg cccgtgcccg acggtcaccc 557521 cgacgtgttc gaccgagaat ggccgctgcg cgtcggcgac atcgaccgcg cgggccggct 557581 gcggctggac gcggcttgtc ggcacatcca ggacatcggt caggaccaac tgcgcgagat 557641 gggcttcgag gagacccacc cgctgtggat cgtccgcagg accatggtgg accttatccg 557701 gccgatcgag ttcggcgaca tgctgcggtg tcggcgctgg tgctcgggca cctccaaccg 557761 gtggtgtgag atgcgagttc gtgtcgatgg ccgcaagggc ggcctgatcg aatccgaggc 557821 gttctggatc cacgtcaacc gggaaaccga gatgccggcc cgcattgccg acgacttcct 557881 cgcgggtctg caccggacca cgtctgttga tcggctgcgc tggaagggct atctgaagcc 557941 gggcagccgg gatgatgcgt cggagatcca cgagttcccg gtccgggtca ccgatatcga 558001 cttgttcgac cacatgaaca acgctgtcta ttggagtgtg atcgaggact acctggcgtc 558061 gcatgcagag ctgctgcggg gccctttgcg ggtgaccatc gagcatgagg cgccggttgc 558121 gctcggcgac aagctggaga tcatctccca cgttcacccg gctggttcga ccgagatatt 558181 cggcccgggg ttggtcgacc gcgctgttac aacgctcaca tatgtggttg gcgacgagcc 558241 caaggcagtc gcctcgctgt tcaatctgtg accggatccg caggacgtcg atccgtgggt 558301 ttacctgcgg atttgtcatt actggcgggt agcttctgaa acggttcagt ttttgggcga 558361 cttcgcaaaa tttgcaaaaa gtccgcaggc cgttgccgaa attcgcaagt gaaatgggtg 558421 gaccagcgtt gacacgctgt gccatggtcg agttagcaca ccagtgaagc tgcgccgttg 558481 acaccgcctg gacgacggta gggcgtcagc gttttcggca atgaaagacc gttaaggagt 558541 tgtctatgtc tgtcgtcggc accccgaaga gcgcggagca gatccagcag gaatgggaca 558601 cgaacccgcg ctggaaggac gtcacccgca cctactccgc cgaggacgtc gtcgccctcc 558661 agggcagcgt ggtcgaggag cacacgctgg cccgccgcgg tgcggaggtg ctgtgggagc 558721 agctgcacga cctcgagtgg gtcaacgcgc tgggcgcgct gaccggcaac atggccgtcc 558781 agcaggtgcg cgccggcctg aaggccatct acctgtcggg ctggcaggtc gccggcgatg 558841 ccaacctgtc cgggcacacc taccccgacc agagcctgta tcccgccaac tcggtgccgc 558901 aggtggtccg ccggatcaac aacgcactgc agcgcgccga ccagatcgcc aagatcgagg 558961 gcgatacttc ggtggagaac tggctggcgc cgattgtcgc cgacggcgag gccggctttg 559021 gcggcgcgct caacgtctac gagctgcaga aagccctgat cgccgcgggc gttgcgggtt 559081 cgcactggga ggaccagttg gcctctgaga agaagtgcgg ccacctgggc ggcaaggtgt 559141 tgatcccgac ccagcagcac atccgcactt tgacgtctgc tcggctcgcg gccgatgtgg 559201 ctgatgttcc cacggtggtg atcgcccgta ccgacgccga ggcggccacg ctgatcacct 559261 ccgacgtcga cgagcgcgac cagccgttca tcaccggcga gcgcacccgg gaaggcttct 559321 accgcaccaa gaacggcatc gagccttgca tcgctcgggc gaaggcctac gccccgttcg 559381 ccgacttgat ctggatggag accggtaccc cggacctcga ggccgcccgg cagttctccg 559441 aggcggtcaa ggcggagtac ccggaccaga tgctggccta caactgctcg ccatcgttca 559501 actggaaaaa gcacctcgac gacgccacca tcgccaagtt ccagaaggag ctggcagcca 559561 tgggcttcaa gttccagttc atcacgctgg ccggcttcca tgcgctgaac tactcgatgt 559621 tcgatctggc ctacggctac gcccagaacc agatgagcgc gtatgtcgaa ctgcaggaac 559681 gcgagttcgc cgccgaagaa cggggctaca ccgcgaccaa gcaccagcgc gaggtcggcg 559741 ccggctactt cgaccggatt gccaccaccg tggacccgaa ttcgtcgacc accgcgttga 559801 ccggttccac cgaagagggc cagttccact agtctgccga gcagacgcaa aagcaccctt 559861 ttgcggcgca aaagtggcgc ttttgcgtct gctcgcgcat ttgaggagga acagtgagcg 559921 atgcgatcca gcgggtaggg gttgtcgggg ccgggcagat ggggtccggc atcgccgagg 559981 tctcggctcg cgccggcgtc gaagtgacgg tgttcgagcc ggccgaggcg ttgatcaccg 560041 cgggacgcaa ccgcatcgtg aagtcgctgg agcgggccgt cagcgccggc aaggtaaccg 560101 agcgcgagcg tgaccgcgcc ctcggcctgt tgaccttcac caccgacctc aacgacctat 560161 ccgataggca actggtgatc gaggccgttg tcgaggacga ggccgtcaag tccgagatct 560221 tcgccgagct cgaccgggtc gtcaccgatc cggacgcggt gctggcgtcg aatacctcca 560281 gcatcccgat catgaaggtc gccgcggcca ccaagcagcc gcaacgggtt cttggcctgc 560341 atttcttcaa tccggtcccg gtgctgccgc tggtcgagtt ggtgcgcacg ctggtcaccg 560401 acgaagccgc cgccgcgcgc acggaggagt ttgccagtac tgtgctgggc aaacaggtcg 560461 tgcgttgctc cgaccgctcc ggattcgtgg tcaatgcgct cctggtgccg tatttgctgt 560521 cggcgattcg gatggtcgag gccgggtttg ccaccgtcga agatgtcgac aaggccgttg 560581 ttgcggggtt atcgcacccg atgggtccgc tgcggctttc cgatcttgtc ggcctagaca 560641 ccctcaagct gatcgcggac aagatgttcg aagaattcaa agaaccgcac tacgggcccc 560701 ctccgctgtt gctgcgtatg gttgaggcgg gccagttggg aaagaaatcg ggtcgaggtt 560761 tctacacgta ctgaagtgta tgaacggccc ccaggcttga cgcaaggcga gatcacagac 560821 cgagacggtg tggttacgat cgtgtgacag ccgttgcgta catcgggtag tatttccgcg 560881 atcaacagat gagaggttcg gccggcatga ctgagttaag gcccttttac gaagagtcgc 560941 aatcgattta cgacgtttcc gacgagtttt tctcactgtt tctagacccc acgatggctt 561001 acacctgcgc gtacttcgag cgtgaggaca tgactctcga agaagcgcaa aacgcgaagt 561061 tcgatttggc gctggacaag ttgcatcttg agcccgggat gacgctgctc gatattggct 561121 gcggctgggg tggtgggctg caacgagcga tcgagaacta cgatgtgaac gtcatcggta 561181 tcacgctcag tcgcaatcag ttcgagtaca gcaaagcgaa attggcgaaa attcccaccg 561241 aacgcagcgt ccaggtgcgg ctgcagggct gggatgagtt cacggacaag gtcgaccgta 561301 ttgtcagcat cggtgccttc gaagcattca aaatggagcg ttatgcggca ttctttgagc 561361 gttcctacga catacttcca gatgacggcc ggatgctgct gcacacaatt ctgacctata 561421 cgcagaagca gatgcatgag atgggcgtca aggtgacgat gagcgatgtg cggtttatga 561481 aattcatcgg cgaagaaatt tttccgggcg gacagttacc ggcgcaggaa gacatcttca 561541 aatttgcgca ggcggcggac ttttcggtgg agaaggtgca attgctgcag cagcattacg 561601 ctcggacgct aaacatctgg gcggcgaatc tggaggctaa caaggaccgc gccattgctc 561661 ttcagtccga ggagatttac aacaaataca tgcactatct gaccggatgt gagcacttct 561721 tccgcaaggg catcagcaac gtgggacagt tcacactgac caagtagccc atcgccgccc 561781 gagcacccca ggggttgcgg agctcacgcc gggtgtggct tgacgcccgg gcaccggccg 561841 gtgggtagcc agcgcgcttt gtccggttac ttttcgagtg tgaactggtc gacgtcggtg 561901 taaccctggc ggaacagctt cgcgcagccg gtcaggtact tcatgtagcg gtcgtagaca 561961 gtctgcgact ggatcgcgat ggcctgatct ttgttggcct cgagcgctgt ggcccacatg 562021 tccagcgtcc tggcgtagtg cagctgcaat gactggaccg cggtgacccg gaagccgacc 562081 ttctcggcgt actcgtgcac cgtcgggatg gacggcagcc agccaccggg gaagatctcg 562141 gccaggatga atttggtgaa gtgaaccagt tcgtgggtca acgtcaggcc cttttccctg 562201 ccttctttga aggtggggcg cacgatggtg tgcagcaaca tcttgccgtc ggccggcaac 562261 gtgcggtggg tcacctcgaa gaaatggtgg tagcgctggt ggccgaagtg ctcgaatgcg 562321 ccgatcgaga cgatgcggtc gacgggctcg tcaaatttct cccatccctc cagcaacact 562381 cgtctggagc ggggggtgtc catttggtcg aacattttct ggacatgacc ggcctggttc 562441 tccgacaacg tcaggcccac gacattgacg tcgtatttct cgatggcgcg ccgcatggtc 562501 gcgccccagc cgcagccgat gtccagcaac gtcatcccgg gttcgaggtt cagcttgccc 562561 agggccaggt cgatcttggc gatctgggcc tcctgcagcg tcatgtcgtc gcgttcgaag 562621 taggcacagc tgtaggtctg ggtggggtcc aagaacagcc ggaaaaagtc gtcggagagg 562681 tcgtaatgag cttgcacgtt tccaaaatgc ggcgtgagct gcacggacat accgattgag 562741 cctttctgtg ttccgaggcc cgcatccgct tgcctcgacg cacccctgat ctatccccga 562801 tgcatccctt gcatgctagc tgctgaaagg cggcccagtc gcaatcggcg ccatgaccag 562861 ctgtcgcagc cgtcagcgaa aatcaccagg cgcgccgcca ggcaccgatc gccaggccca 562921 caaccagcag cgcaccggcc tgacgcacgt gcagccaggc caacgcggca taccacagcg 562981 gccacaccgg aaacgccggt ggcggctgct ggggccgccg tcgcaggaaa tagggccaca 563041 ctttcgccag ccggggcagc gccccggcga ccagcaacgc ggccagggca tggcagccag 563101 catgacgttt acggcgataa gtaggtagaa gccgaccatc atggccagtg tcacggtgcg 563161 cgcgcacgtt tcgccgagta gcaccggcag cgttcggata cccagcggtt cgtcgtaacc 563221 gatcttgtcg atgtgcttac ccatcagcac cgtggtgcac aacagcccgt aggggagcga 563281 cgccagcacg acctcccaac cgcccgcgcc caccgcggcg tagtaggttc cagcgcacgc 563341 tcgggtgagc cgcagctggt ggttcgtcga ggcgtggtat atgcggcacg gttggccccg 563401 gtagcggccg ggtgctgggc atagcgggcg cgcgcgtagg tggcgctgtc agtaccgaca 563461 tcggtgtcgt agagatcgtt cataaggttg ttggcgatgt gcggcgcatg tgattcccac 563521 cacaggacga gccagcgcca atccaagcca ggctcgccga tcgccaacag ccccgcgacc 563581 aggccggaga ccagggtcat cggcagcact gcggcccggg tgacgacgag ccaccgggtg 563641 accgtgtcgg tcggcccgtc agctggcggg ttggtggtgc gaagtgcgta ggcccacgat 563701 ctgagccggg agcccgcgcc cgcgtcgggc atccctaaag cctagacctg cccccaggca 563761 ggcacgatcg gcgaaggatg cggctgctcg cgaaacttct ccaacgatcc gccggcctcg 563821 acgatgccgc acagtgcgct ccagctcagc atcgtcaggt agtcgatcag ttcgtcactg 563881 ctcatgcgcg ggtctgacat ccaggagtgg gtggccagct gcacgccgcc cacgatcaga 563941 tatgcccacg gctcgactcc gccggtgtcc atcccggctt cttgcatgcg gcggcgcagc 564001 atcaccgcga gcatgcgggc aatgattcgc tccgagtcgg caatcacttt gcttttgctg 564061 gccgagctat tcgccatcac gaaccgatac ggctccggtt gggccgccac ggtctcgaca 564121 tagacccgga tgatttcgcg ggtcagttcg aaaccatcca tatcggccga cagcgcagcg 564181 atcatgttgg ggatcaaggt ggtctgcgtg aaccgcatca tcacggcggt cgtcaggtcg 564241 tttttgtcga cgaagtagcg gtagagcacg gtcttggaga ccccgatctc ggccgctatc 564301 tcgtccatgc tgaggaagcg gccatgccgg cgaatcgcct caatcgtgcc gtccaccagc 564361 tcattgcggc gctccacctt gtgctggtgc cagcgtcgct tgcgaccatc cgtcttcacg 564421 gtcacggccg ggatacgctc tgccactgtt gccaattccc attcactaga cgctcccgat 564481 actacggcca attgggggtc ctgctggcac attggacgcg cgcgcggggt gcgcaggaca 564541 gtgtcgtcac attaactggt gccggtgata gcggatgatg gtgtggtggc acatagagcc 564601 gaggtgtcgg gctcgccgcc gccacggctg aatttgagca cccagccgac ggtggcgcgg 564661 cgtgtccgcg cctccttcgc ggaatccttc gccgcagccg atccggaggc ggatgccgcc 564721 cggcggatgg cgctgcgtcg gatgaaagtg gtggcagtgg ggtttttggt aggcgccacc 564781 ggcgtgttcc tcgcttgtcg ctgggcacag gccgatggcg ctgaccacgc gtggctgggt 564841 tatctgggcg ctgcggcgga agccggtatg gtcggcgcct tggcggactg gttcgcggtg 564901 accgcgctgt tcaagcatcc gctaggcatt ccgatcccgc atacggcgat catcaagcgc 564961 aagaaggatc agctgggcga gggcctgggc accttcgtgc gggagaattt cctgtcgccg 565021 ccggtcgtgg agaccaagct gcgtgatgcg cagataccga gtcggcttgg caagtggttg 565081 tcagaggcca cgcatgccca gcgggtggcg gccgagaccg caacggtgct gcgggtgctg 565141 gtggagctgc tgcgtgacga ggacatccag caggtgatcg accggatgat tgtgcgtcgt 565201 atcgccgaac cgcagtgggg tccgccggcg ggccgggtgc tggcgacgtt gctggccgag 565261 aatcggcagg aagcctttat ccaattgttg gccgatcggg cgttccagtg gtcgctcaac 565321 gccggggtgg tgatccagcg ggtggtggag cgtgactcgc cgagttggtc gccccgattc 565381 atcgaccacc tggttggcga ccgtatccac cgtgagttga tggaatttac cgacaaggtg 565441 cgccgcaacc ccgatcacga gttgcgccgt tcggctaccc gcttcttgtt cgatttcgct 565501 gacgacctgc aacacgatcc ggccactgtc gcgcgcgccg acgcgatcaa agaggagcta 565561 atggcgcgcg atgagatcgc cactgcggcc gcggcggcgt ggaagacact gaagcggttg 565621 gtgctcgagg gtgttgacga cccgtccagt gcgttgcgca cccgcatcac cgatgcggtc 565681 atccggatcg gcgaatcgct tcgtgacgat gccgacctgc gtgacaaggt agacagttgg 565741 acggtgcggg cggcccaaca tctggtctcg gagtacgggg tggagatcac cgcgatcatc 565801 accgagacga tcgagcgctg ggacgccgag gaagccagcc ggcgaatcga actgcacgtc 565861 ggccgagacc tgcagttcat tcggatcaac ggaacagtgg tcggggcgat ggcagggttg 565921 gcgatctatg cgatcgcgca actgttgttc tgacgggtgc taacaaacgc ttgcaatagc 565981 aagcacttgg acgtactctg gtggccgttg caccgatcac cccgagctag gagtagccaa 566041 tgtcgtcgga ggagaagctg gccgccaagg tgtccaccaa ggcctccgat gtggcttccg 566101 acatcggcag cttcatcagg tcgcaacgtg agacggcgca cgtctcgatg cggcagctcg 566161 ccgagcggtc cggagtcagc aatccgtacc tgagccaggt tgagcgcgga ttgcgtaagc 566221 cgtccgccga cgtgttgagc cagatcgcaa aggcgctgcg ggtctcggcc gaagtccttt 566281 atgtgcgcgc cgggattctc gagcccagcg agaccagtca ggtgcgtgac gccatcatca 566341 ccgatacggc gatcaccgag cgtcagaagc agattctgct cgatatctac gcgtcattta 566401 cccaccagaa cgaagccacc cgtgaggagt gtccgagcga tccgacaccg accgatgact 566461 agccgttggc cggctgtttt gcgcaccggc tggcgggtaa tcaaacctga aggacagtca 566521 tctgggtgag gtcgaccgca ggctgatcca gccgatcggc cgcgctggcc aacagcgact 566581 ccgtcgatga cgtgcagcaa aggagacatg tagtgaccgg atcagctggg cctgacatct 566641 acgaactcga ccgacaaccg acccgacgat caggaggttt ccccggcaag tcgcgtgcca 566701 tgtcaatccg cgggtcttga ctagtcctcc ctggaggagc cgacgcttgc cccaacgtcc 566761 agaccaaaga tgtaagaacg ccgatatcag aaaatagtta atgaaaggaa tacccatggc 566821 tgaaaactcg aacattgatg acatcaaggc tccgttgctt gccgcgcttg gagcggccga 566881 cctggccttg gccactgtca acgagttgat cacgaacctg cgtgagcgtg cggaggagac 566941 tcgtacggac acccgcagcc gggtcgagga gagccgtgct cgcctgacca agctgcagga 567001 agatctgccc gagcagctca ccgagctgcg tgagaagttc accgccgagg agctgcgtaa 567061 ggccgccgag ggctacctcg aggccgcgac tagccggtac aacgagctgg tcgagcgcgg 567121 tgaggccgct ctagagcggc tgcgcagcca gcagagcttc gaggaagtgt cggcgcgcgc 567181 cgaaggctac gtggaccagg cggtggagtt gacccaggag gcgttgggta cggtcgcatc 567241 gcagacccgc gcggtcggtg agcgtgccgc caagctggtc ggcatcgagc tgcctaagaa 567301 ggctgctccg gccaagaagg ccgctccggc caagaaggcc gctccggcca agaaggcggc 567361 ggccaagaag gcgcccgcga agaaggcggc ggccaagaag gtcacccaga agtagtcggg 567421 ctccgaatca ccatcgactc cgagtcgccc acggggcgac tcggagtcga cgtgttggat 567481 gcaaaccgca tagtctgaat gcgtgagcca cctcgtgggt accgtcatgc tggtattgct 567541 ggtcgccgtc ttggtgacag cggtgtacgc gtttgtgcat gctgcgttgc agcggcccga 567601 tgcctatacc gccgccgaca agctgaccaa gccggtgtgg ttggtgatcc tgggcgcggc 567661 cgtggcgttg gcctccatcc tgtatcccgt tttgggtgtg ctcgggatgg cgatgtccgc 567721 ctgtgcgtcc ggcgtgtatc tggtcgacgt gcggcccaag cttctcgaga ttcagggcaa 567781 gtcgcgctaa cggaatgaaa gccctggtgg ccgtgtcggc ggtggccgtc gtcgcactgc 567841 tcggtgtatc ttccgcccaa gctgatcccg aggcggatcc cggcgcaggt gaggccaact 567901 atggtggccc cccaagttcc ccacgtcttg tcgatcacac cgaatgggcg cagtggggaa 567961 gtctgcccag cctccgggtc tacccgtccc aagttgggcg tacagcctcc cgccgcctcg 568021 ggatggccgc tgccgacgcg gcctgggccg aggttctcgc gctgtcaccg gaggccgaca 568081 ctgccggcat gcgcgcgcag ttcatctgcc actggcagta cgccgaaatc agacaacccg 568141 gcaaacccag ctggaacctc gagccgtggc ggccggtcgt cgacgactcg gagatgttgg 568201 cttccggctg caatccgggc agccctgaag agtcgtttta gtgctcggcc aaccgactcg 568261 ggcgcagttg gccgcgctgg tagaccacac cctgctcaag cctgagacca cccgtgccga 568321 tgtggccgcg ctggtcgccg aagccgccga actcggcgtc tacgcggtct gcgtgtcgcc 568381 gtcgatggtg ccagttgcgg tccaagccgg tggtgtgcgg gttgcggcgg tgacgggctt 568441 cccgtcgggc aagcacgtgt cctcggtcaa ggcgcatgag gcggctgcgg ccctggcatc 568501 cggcgccagt gagatcgaca tggtcatcga catcggggct gcgctgtgcg gtgacatcga 568561 cgcagtgcgc tccgacatcg aggcggtgcg tgccgctgcg gccggggctg tgctcaaggt 568621 gatcgtggag tcggcggtgc tgttgggaca gtcaaacgcg cacacgttgg tggatgcgtg 568681 tcgtgccgcc gaggatgccg gtgccgactt cgtcaaaacc tcgactgggt gtcatccggc 568741 cggcggggcc acggtgcgtg ccgtcgagct gatggccgag acggtcggcc ctcggctagg 568801 ggtcaaagcc agcggtggga tccgcaccgc cgccgacgcg gtcgcgatgc tcaacgccgg 568861 tgccaccagg ttgggcctgt ccggcacccg ggcggtgctc gatgggctca gctgacagct 568921 gagcgcgcgg gtggcggcgt caaatgtgcg agaagcaggg attctggatg ccggtgggga 568981 tagccgcgtc gcgagttgag aaccggctca ccacgccggt cgaggtgact tgcacgctgt 569041 ccgcgtgaat ccccaacggg tagttcttgg tcaggctgga ggtgaactcg ttcagcgtcg 569101 actgaacggt ttctttcggc agcgagaacc cgagcgtgtt gaaattgatg atctgcagct 569161 ccaatccttt gccagccact atcggcttgg ctgtgatgtt gttcagcagg cccttcagtt 569221 cgacggtgcc gtctgcgggg tgagtgacca cgctgctggt gacgaaagcg cccaggatcg 569281 gaatcgcgtt ttgcaccgat tccttgatgc cttccgacga ccaggtaatg gtggcgtcca 569341 gggcgccgat cgtgccccta gagttggggg tgttcttgag ccggacgttc tggatcgtga 569401 gctttatctg catgcccttg gcatcgcgga tctgattgcc cgcggtttcc accgagatat 569461 tggtgaagtg ccgcgtagcg acctgccaca gcagcagcgg cgccacaccg aaggatgcgg 569521 tggcttggtc tttgaccacg catgcgaccg cttgggcgac cttgctattg gcaacatggc 569581 gagcgtatag ctcgcctccg atcagcccgg cgaggacgag cgaaaacacg atgatcagga 569641 caagaaagac ggttagcggg tcgcggcggg cacgtcgttt ggtcttcacc gccgctggct 569701 cttcctcttg cgcagccagc aggcccgttg ggtcccacgc ctggtgcgca gcttggcggc 569761 cggatcgccg tgtgtgggca tgcgacgcag ccagatgctc agtttgcggc tgctgctccg 569821 gttgggtggg cggactcacc ggttcttgga tatgacccgc gggctcgccg gggcgcagtc 569881 gaccggtgga tgcttcggac gatgccgggg gtcgggcgag cggaccttga tccccagggc 569941 gcgcccaggg cgacgggtcg ttcggtggac cttgcgggtt ggtcacccac gcgattgtgc 570001 cttatcgatc tgaacgaagt ctgtctggtt gcgtagcacc gcaatgcggt cgcgagccgc 570061 ggccacattg tcgacatcga tgtcggcgac cagcagttgc ggctgggtgc cagctgacac 570121 caccacctcg cctagcggcg aggccaccag gctgccgcct accccggtcg gtgcagccga 570181 gctcgccccc acgccggtgc gggcatcacc tgggtctgct tggccggccg cggcgacgta 570241 actcatggag tctagcgccc gggcgcgggc cagcaacgtc cactgttcga gtttgcccgg 570301 accggaaccc caggatgcac agaccgcgat cagttgggcc ccgcgccgcg ccagctcggt 570361 ataaagggcg ggaaagcgaa tgtcgtagca aacggtcaaa cccacccgca cgccgtcgac 570421 cacgactacc accggttcgc gcccgggtgc gacggtacgt gactcggtga agccgaacgc 570481 gtcatagagg tggatcttgt ggtagtgcgc gtccggctga ttgggcgtgc ccgggccggc 570541 tgcgatcagc gtgtttgtta cccgcccgtc gccggtcggg gtgaacatgc cggcgatcac 570601 ggtgatgccc gcctcggtcg cgatccgtcg gactccgttt gcccagggtc cgtcgacggg 570661 ctcggcgacc tgccgcagcg ggacaccgag ccggcacatg gtcgcctcag gaaacaccac 570721 cagctgtgcg cccgcggtgg cggcttcgcc ggcgtacttg ccgaccagtt gcagattggc 570781 ggcggggtcg gtaccgctgc ggatttgcgc caacgcgatt cgcatgcgcg ccagcctagg 570841 cccggcgacg agcgcgccgc accggcgcgc gcaggagccg ggcaatccag cttgcgcccg 570901 gcgacgagcg cgccgcaccg gcgcgcgcag gagccgggca agctggcacc tcagacgttg 570961 ttcgtgatcc acagcgtggt gaagcgctgt tcgatggtca ctagctggct taattgggtg 571021 ccgataagcc tctccagctt cccgccaatg aacgggatac gcacctggat ggtgacctgc 571081 agcgtcattc gggagccacc cgactccggt atgggcgaga gcacggcggt gccccacaag 571141 ttcaccggag cgtccacgat cgatcccgca atggacgcgg tcgcgatgcc ttccttgacc 571201 gggccccagg tctcctcgcg ccgtaccgaa agatcgcccc ggtgcaactg tgtgaccagg 571261 ccgggcagat tgtgactgcg caccatctgc agggtgacga cttcgatggt gccgtcgtct 571321 ccggagtcgc cacctacgcg tatcgactca agggtcgcga cgtcgaccgg cgtttcggcc 571381 agtctggctt tccagtagtc cgcctcgtag aaagcccgat gaacctcctc gacgctgccc 571441 tcgtagtcgg ccgacatgtc gaatgaacgc ggcatagcag gtcaggctac ccttacgggc 571501 catgaaacgg agcggtgtcg gttcgctctt tgccggtgcg catattgccg aggcggtccc 571561 gttggcgccg ctgaccactt tgcgtgtggg cccgatcgcc cgacgtgtca tcacttgcac 571621 cagcgccgaa caggtggtgg ctgcgctgcg gcacctggat tcggcggcca agaccggagc 571681 tgaccgcccg ctggtgtttg ctggtggctc caatttggtg atcgccgaga acctgaccga 571741 cctgaccgtg gtgcggttgg ccaatagcgg catcaccatc gacggtaact tggtgcgggc 571801 cgaggccggt gcggtcttcg atgacgtggt ggttagggcc atcgaacagg gtctgggcgg 571861 actggaatgc ctgtctggca tcccaggatc ggccggggcg acacccgtgc agaacgtggg 571921 ggcgtatggc gcggaggtgt ctgacaccat cactcgggtt cggcttttgg atcggtgcac 571981 gggtgaggtg cgttgggtat ccgcgcgcga cctgcgcttc ggctatcgca cgagcgtgct 572041 caaacacgct gatgggcttg cggtgcccac cgtggtcttg gaggtggagt ttgcgctgga 572101 tccgtcgggc cgcagcgcac cgctgcgcta cggcgagctg atcgccgcgc tgaatgcgac 572161 cagcggcgag cgcgccgacc cgcaagcggt ccgcgaagcg gtgctggccc tgcgggcacg 572221 caagggcatg gtgctggacc cgaccgacca tgacacctgg agcgtgggat cgttcttcac 572281 aaacccggtg gtcacccagg atgtttacga acggctggcc ggtgacgcgg ccaccagaaa 572341 ggacggtccg gtcccgcact atcccgcgcc cgacggcgtc aagctggccg ccggctggct 572401 ggtggaacgg gccggcttcg gcaagggcta tccggatgcc ggcgccgccc catgccggct 572461 ttccaccaaa catgcgctgg cgctgacaaa tcgtggcggg gccaccgccg aagatgtggt 572521 gacgctggcg cgcgccgtgc gcgatggggt ccatgatgtg tttggtatca cactaaaacc 572581 cgaacccgtg ctgatcggct gcatgttgta gctgcgtttt cgcggcgggg cggcgtggcg 572641 cgcattgctt agggctggtt gccaggcgtt ctgtggtcat tcgtgtgctg tttcgcccgg 572701 tatctttgat acccgtgaat aactccagca ccccccagag tcaggggccg atcagtcggc 572761 gtctggcgtt gacggccctt gggtttgggg tgttggcacc gaacgttctg gtcgcgtgcg 572821 ccggcaaagt gaccaagctg gccgagaaga ggccgccacc ggcgcctcgt ctgactttcc 572881 ggcctgccga ctctgccgcc gacgtggtgc cgatcgcgcc gatcagcgtc gaggtcggtg 572941 acggctggtt tcagcgggtc gcgctgacca attcggcagg caaggtcgtc gccggggcat 573001 acagccggga tcgcaccatc tacacgatca ccgagccgct gggctacgac acgacctaca 573061 cctggagcgg ttcggccgtc ggccatgacg gcaaggcggt tccggtggcg ggcaagttca 573121 ccaccgtggc acccgtcaag acgatcaacg cgggattcca gctcgccgac ggccagaccg 573181 tcgggatcgc ggcgccggtg attattcagt tcgattcacc gatcagcgac aaggccgccg 573241 tcgagcgggc actaaccgtg accaccgacc cgcctgtcga gggcggctgg gcctggctgc 573301 ccgacgaggc gcagggcgct cgcgtgcact ggcgtcctcg ggagtactac ccggcgggta 573361 ccaccgtcga cgtcgacgcc aagctgtatg ggctgccgtt cggcgacggc gcgtacggcg 573421 cgcaggatat gtcgttgcac ttccagatcg gtcgtcgtca ggtggtcaag gccgaagtct 573481 cgtcgcaccg catccaagtc gtcaccgatg ccggcgtcat catggacttc ccgtgcagct 573541 acggcgaggc cgacttggcg cgcaacgtca cccgcaacgg catccacgtc gtcaccgaga 573601 aatactcgga cttctacatg tccaacccgg ccgccggtta cagccatatc cacgaacgtt 573661 gggcggtgcg gatttccaac aacggcgagt tcatccatgc caaccctatg agcgccggtg 573721 cccagggcaa cagcaatgtc accaacggct gtatcaacct gtcgacggag aacgccgaac 573781 agtactaccg cagcgcggtc tacggtgacc cggttgaggt gaccggcagt tcgatccagc 573841 tgtcctacgc cgacggtgac atctgggact gggcggtgga ctgggacacc tgggtgtcga 573901 tgtcggcgct accgccaccg gcggccaaac cggcggcgac gcaaatcccg gtcaccgccc 573961 cggtcacgcc gtcggatgcc cccaccccgt ccggcacacc cacgactact aacggaccgg 574021 gtgggtagcg cgacggctag ctgatgcctg gtcgcggggc cggatgacga tctggtcaag 574081 gttgacgtgt gagggccggg tggccacgaa tccgatcacc tcggcgacgt cggcggctac 574141 tagcggtgtc atgccggcat aaaccgcgtc cgcgcgttgc tggtcgccgt cgaagcggac 574201 cagcgaaaat tcggtctcga ccgcgcctgg agcgatctcc gtgagccgga ccggcttccc 574261 cagcagttcg ccgcgcagcg tgcgatgcag cgcgccctgc gcgtgcttgg cagcggtgta 574321 gccggcgccg ccgtcgtaca cctcgagcgc ggcgatcgag gtgacggtga cgatcaggcc 574381 gtcgccggag tcgatcagct tgggcagcag cgcgcgggtt acccgcagcg tgcccagtac 574441 gttggtgtcc cacatccatc gccagtgctc caaatcggca tcggcgacga actgaagccc 574501 cttggcgcca ccggcgttgt tgaccagcac gtccacccgg ctcagcgcgc gggccaacgc 574561 ttcgacggcg gcgtcgtcag tgacatcggc cacaattgcg gttccgccga tctggttggc 574621 cagcgcggtg atccggtccg cccgacgcgc caccgcgacc acgtgaaacc cctgggccgc 574681 aagggttctc gcggttgcct cgccgatacc ggaactggcg ccggtgacca cggcgactcg 574741 cttgcgggtg ccgattgtcg tcatcgggac aactctaata aacgtgctaa attctcggtg 574801 tgtaccacag cgccttgttc cgcacgacga ccgcgtgtct tttcgcgggc gcgtgttgtt 574861 gccgccccct ttgccgcgcc tgaccgatac acgtcagcag gtgtggccaa caggacccgg 574921 ccattggaac tcggagaaga acgcccgtgt actcgactaa ccgcacctca cagtcactca 574981 gccgcaagcc cggccgcaag caccagctgc gatcgcaccg ttacgtcatg ccgccgtcgc 575041 tgcacctgtc cgattccgcg gctgcgtccg tcttccgggc cgtgcgtttg cgtggtccgg 575101 tcggtcggga cgtaattgct ggatctacgt cgctgagcat cgcgacggtg aaccgccagg 575161 tcatcgcact gctggaagcg ggcctcctgc gtgagcgggc ggacctggcg gtttccgggg 575221 ctatcgggcg cccacgcgtg cctgtcgaag taaaccacga gccttttgtc accctgggca 575281 tccacatcgg tgcccggacc accagcatcg tggccaccga cctgttcggc cgcacgctcg 575341 acacggtgga gaccccgacc ccgcgtaacg ctgccggggc cgcgctgacc tcactggccg 575401 acagcgctga ccgatacttg cagcgctggc gccggcgccg tgcgctgtgg gtcggggtga 575461 cgcttggtgg tgcagtcgac agtgccaccg gtcatgtcga ccatccgcgg ctcggttggc 575521 gtcaggctcc ggtcggaccc gtgctggcgg atgccctagg cctgcccgtg tcggtggcgt 575581 cccacgtcga cgccatggcc ggggccgagc tgatgctcgg catgcggcgg ttcgcaccga 575641 gctcgtcgac gagcctctac gtctacgccc gcgaaaccgt aggctatgcg ctgatgatcg 575701 gtgggcgggt gcactgcccg gccagtggtc ccggcaccat cgcgcccctg cccgtccact 575761 ctgaaatgct cggcggtacc gggcagctgg agtccactgt cagcgacgag gcggttttgg 575821 ctgctgcccg ccggctgcgg atcatccccg gcatcgcttc gaggacccgg accggtgggt 575881 ccgctaccgc catcaccgac ttgctgcgag tggcacgagc cggtaatcag caagccaagg 575941 agctgctggc ggagcgggcc cgcgtgctcg gtggggcggt cgcgctgctg cgtgacttac 576001 tcaatcccga cgaagtggtg gtgggtggcc aggcgtttac cgaatatccc gaggcgatgg 576061 agcaggtgga ggcggcgttt acggcagggt cggtgctggc gccgcgtgac atccgcgtga 576121 ccgttttcgg caaccgggtg caggaggccg gggcaggcat cgtgtcccta agcgggctct 576181 atgccgatcc attgggtgcc ttgcggcgat cgggcgcgct ggatgcccgg ctgcaggaca 576241 ccgccccgga ggcgctcgcg tgatcggctg acgagccgcg tccgcgcgtg tcacttcggt 576301 tcctgcaagg atggcaggtg tgcggcacga tgacggttca gggttgatcg cccagcgccg 576361 tccggtccgc ggcgagggtg ccacccgctc gcgcggccca tccgggccat ccaatcggaa 576421 tgtttcggca gcagacgacc cgcgccgggt tgcgctgctg gcggtgcaca cctcaccgct 576481 ggcacagccg ggcaccggtg acgccggcgg catgaacgtc tacatgctgc aaagtgcgct 576541 gcacctggcc cgtcggggca tcgaggtgga gatcttcacc cgggccaccg catcggcaga 576601 tccaccggtg gtgcgggtgg cacccggggt gctggtgcgc aacgtggtgg cggggccctt 576661 cgagggtttg gacaagtacg acctgcccac ccagctttgt gcgttcgccg ccggggtgct 576721 gcgcgccgag gcggtccacg aaccgggtta ctacgacatc gtgcactcgc actactggct 576781 gtcgggtcag gtcggctggc tggcgcgcga ccgctgggcg gtgccgttgg tgcacaccgc 576841 acacacgctg gccgccgtga agaacgcggc actggccgac ggcgacggac ccgagccgcc 576901 gctgcgtacg gtcggggagc agcaggtcgt cgacgaggcg gatcggttga tcgtcaacac 576961 cgacgatgaa gccaggcaag tgatttcgct tcatggtgcc gatccggcac gaatcgacgt 577021 ggtccatccc ggtgtcgatc tggacgtgtt ccgcccgggt gatcggcgcg cggcccgggc 577081 cgcgctagga ctaccagttg acgagcgcgt ggtggccttc gtcggacgca tccagccgct 577141 gaaggcaccc gacattgtgc tgcgtgcggc cgccaagttg cccggggtgc gcatcatcgt 577201 ggccggcgga ccgtcgggca gcggtctggc ttcaccggac ggactggtcc ggctcgccga 577261 cgaactgggc atctctgcac gggtgacgtt tctgccgccg cagtcccaca cggatctggc 577321 caccttgttt cgggcggcgg acctggttgc ggtgccgagc tactccgagt cgttcggcct 577381 ggttgctgtg gaggcccaag cgtgcggcac accggtggtg gccgcggcgg tgggcgggct 577441 gcccgtcgcg gtgcgcgacg ggatcaccgg caccctggtg tccgggcacg aggtcggtca 577501 gtgggccgac gccatcgatc acctgctgcg gttgtgtgcc gggccacggg gacgggtgat 577561 gagccgggcg gcggcacggc acgccgccac gttctcgtgg gagaacacca ccgacgcgct 577621 gttggccagt tatcggcgtg cgatcggcga gtacaacgcc gagcgccagc gccggggcgg 577681 cgaggtgata tcggacctgg tagcggtggg caagccccgc cactggacgc cgcgtcgcgg 577741 ggtgggcgcg tgacttcctc cttgccgacc gtgcaacgtg tgatccagaa tgcgctcgag 577801 gtcagccagc tgaagtactc ccaacacccc cgcccgggcg gggcgccgcc cgcgctgatc 577861 gtcgagctgc cgggcgaacg caagctcaag atcaacacca tcctgagcgt cggcgagcat 577921 tcggtgcgtg tcgaggcgtt cgtgtgtcgc aagcctgacg agaaccgcga agacgtatac 577981 cggttcctgc tgcggcgcaa ccgccgcctg tatggggtcg cgtacacgct ggacaatgtc 578041 ggcgacatct acctggtggg ccagatggcg ctgtccgcag tggacgccga cgaggttgac 578101 cgggtgttgg ggcaggtgtt agaggtggtg gattcggact tcaatgcgtt gttggagttg 578161 ggatttcggt cgtcgattca acgagagtgg cagtggcggt tatctcgcgg tgagtcgctg 578221 cagaacctgc aggccttcgc tcacttacgc ccgacgacga tgcagagcgc gcagcgcgat 578281 gagaaggagt tgggcggtta ggtcgagccc gacgacgatg cagagcgcgc agcgcgatga 578341 gaaggagttg ggcggttagg tcgagcccga cgacgatgca gagcgcgcag cgcgatgaga 578401 aggagttggg cggttaggtc gagcccgacg acgatgcaga gcgcgcagcg cgatgagaag 578461 gagttgggcg gttaggtcga gcccgacgac gatgcagagc gcgcagcgcg atgagaaata 578521 gcactcgtgg aggtcaagac gcccgccggt gatgggctgg tggcgctcac cccgttccgg 578581 actcagaaat tcgcgatcac aatttgcgcg ttcaagtcat tggcatgcat gtgatggttt 578641 agcgttccgc tgtgcctctt caggtgtttg tcggcttcgt tgccatgatg acgctcaagg 578701 tcgcgatcgg cccgcaaaac gcatttgtcc tgcgccaagg aattaggcga gaatacgtgc 578761 tggtcattgt ggcgctgtgc gggatcgctg atggggcact gattgccgcg ggcgttggcg 578821 gcttcgctgc gctgattcac gctcatccca atatgacttt ggttgcccga tttggcggcg 578881 cagcgttctt gattggctac gcgctattgg ccgcgcggaa cgcgtggcgc ccgagcgggc 578941 tggtgccgtc ggaatcgggg ccggctgcgc tgatcggcgt ggtgcaaatg tgcctggtgg 579001 tgacctttct caacccacac gtctatctgg acactgtggt gttgatcggt gccctcgcca 579061 atgaggaatc agatctgcgg tggtttttcg gagccggtgc ctgggccgcc agcgtcgtat 579121 ggttcgccgt gttgggattt agcgcgggcc ggctacagcc attcttcgca actccagctg 579181 cttggcgcat tcttgatgcg ctggttgccg tgacgatgat tggggtcgcc gtcgttgtgc 579241 tcgtcacgtc accaagtgtg ccgacggcca atgtcgcact gatcatttga ccacctcgta 579301 ggccgcccat gtatcggcct tggtgaaccg gccgttacgg tgccgaccac ctcggcggta 579361 tgaacgcgct gcgcagcgga ccgaggagaa ttcgggcatt ttggtccacg atgaggagtg 579421 cgggagtgcg tgagagactt gccggtatgg caaacactgg cagcctggtg ttgctgcgcc 579481 acggcgagag cgactggaat gccctcaacc tgttcaccgg ctgggtcgat gtcggcctga 579541 cggacaaggg ccaggcagag gcggttcgaa gcggcgagct gatcgcggaa cacgacctat 579601 tgcccgacgt gctctacacc tcgttgctgc ggcgcgcgat caccaccgcg catctggcgt 579661 tggacagcgc cgatcggctc tggattcccg tgcggcgtag ctggcggctc aacgaacgcc 579721 actacggcgc gctgcagggt ttggacaagg ccgagaccaa ggcccgctat ggcgaagagc 579781 agttcatggc ctggcggcgc agctatgaca cgccgccgcc gccgatcgag cggggcagtc 579841 agttcagcca ggacgccgac cctcgttacg ccgacatcgg cggtggcccg ctcaccgaat 579901 gtctggctga cgtggtcgcc cggtttttgc catatttcac cgacgtcatc gttggcgact 579961 tgcgggtcgg caagacggtg ctgatcgttg cccacggcaa ctcgttgcgc gcgctggtca 580021 agcacctgga ccagatgtct gacgacgaaa tcgtcggact gaacatcccg accggaattc 580081 cgctgcgcta cgacctggat tccgcgatga ggccgctggt gcgcggtggt acgtatctgg 580141 acccggaggc ggcagccgcc ggcgccgccg cggtggccgg ccagggccgc gggtaattgt 580201 ttgagatccc acctgccggc ggtttcggcg gctgatggtg tgctttggtg cgctgtttgc 580261 caaacagcat gtgaacggta accgaacagc tgtggcgtag tgtgtgactt gtccgatttt 580321 ggccttgccg cgctagggcg acgttcaccg gatttgtagg attttccttg tgactgtgtt 580381 ctcggcgctg ttgctggccg gggttttgtc cgcgctggca ctggccgtcg gtggtgctgt 580441 tggaatgcgg ctgacgtcgc gggtcgtcga acagcgccaa cgggtggcca cggagtggtc 580501 gggaatcacg gtttcgcaga tgttgcaatg cattgtcacg ctgatgccgc tgggcgccgc 580561 ggtggtggac acccatcgcg acgttgtcta cctcaacgaa cgggccaaag agctaggtct 580621 ggtgcgcgac cgccagctcg atgatcaggc ctggcgggcc gcccggcagg cgctgggtgg 580681 tgaagacgtc gagttcgacc tgtcgccgcg caagcggtcg gccacgggtc gatccgggct 580741 atcagtgcat gggcatgccc ggttgctgag cgaggaagac cgccggttcg ccgtggtgtt 580801 cgtgcacgac cagtcggatt atgcgcggat ggaggcggct aggcgtgact tcgtggccaa 580861 cgtcagtcac gagctcaaga cgcccgtcgg tgccatggct ctactcgccg aggcgctgct 580921 ggcgtcggcc gacgactccg aaaccgttcg gcggttcgcc gagaaggtgc tcattgaggc 580981 caaccggctc ggtgacatgg tcgccgagtt gatcgagcta tcccggctac agggcgccga 581041 gcggctaccc aatatgaccg acgtcgacgt cgatacgatt gtgtcggaag cgatttcacg 581101 ccataaggtg gcggccgaca acgccgacat cgaagtccgc accgacgcgc ccagcaatct 581161 gcgggtgctg ggcgaccaaa ctctgctggt taccgcactg gcaaacctgg tttccaatgc 581221 gattgcctat tcgccgcgcg ggtcgctggt gtcgatcagc cgtcgccgtc gcggtgccaa 581281 catcgagatc gccgtcaccg accggggcat cggcatcgcg ccggaagacc aggagcgggt 581341 cttcgaacgg ttcttccggg gggacaaggc gcgctcgcgt gccaccggag gcagcggact 581401 cgggttggcc atcgtcaaac acgtcgcggc taatcacgac ggcaccatcc gcgtgtggag 581461 caaaccggga accgggtcaa cgttcacctt ggctcttccg gcgttgatcg aggcctatca 581521 cgacgacgag cgacccgagc aggcgcgaga gcccgaactg cggtcaaaca ggtcacaacg 581581 agaggaagag ctgagccgat gacctgcgcc gacgacgatg cagagcgtag cgatgaggtg 581641 ggggcaccac ccgcttgcgg gggagagtgg cgctgatgac ctgcgccgac gacgatgcag 581701 agcgtagcga tgaggtgggg gcaccacccg cttgcggggg agagtggcgc tgatgacctg 581761 cgccgacgac gatgcagagc gtagcgatga ggtgggggca ccacccgctt gcgggggaga 581821 gtggcgctga tgacctgcgc cgacgacgat gcagagcgta gcgatgaggt gggggcacca 581881 cccgcttgcg ggggagagtg gcgctgatga cctgcgccga cgacgatgca gagcgtagcg 581941 atgaggagga gtggcgctga tgaccagtgt gttgattgtg gaggacgagg agtcgctggc 582001 cgatccgctg gcgtttctgc tgcgcaagga gggctttgag gccacggtgg tgaccgatgg 582061 tccggcagct ctcgccgagt tcgaccgggc cggcgccgac atcgtcctgc tcgatctgat 582121 gctgcctggg atgtcgggta ccgatgtatg caagcagttg cgcgctcggt ccagcgttcc 582181 ggtgatcatg gtgaccgccc gggatagcga gatcgacaag gtggtcggcc tggagctggg 582241 cgctgacgac tacgtgacca agccctattc ggcacgcgag ttgatcgcac gcatccgcgc 582301 ggtgctgcgc cgtggcggcg acgacgactc ggagatgagc gatggcgtgc tggagtccgg 582361 gccggttcgc atggatgtgg agcgccatgt cgtctcggtg aacggtgaca ccatcacgct 582421 gccgctcaag gagttcgacc tgctggaata cctgatgcgc aacagcgggc gggtgttgac 582481 tcgcggacaa ctgatcgacc gggtctgggg tgcggactac gtgggcgaca ccaagacgct 582541 cgacgtccat gtcaagcggc tgcgctccaa gatcgaagcc gacccggcta acccggttca 582601 cttggtgacg gtgcgcgggc tgggctacaa actcgagggc tagcggacgc cgacaacctt 582661 ggcgactgtc tggtcggcta cggccagtgc catcgccatg atggacagct gcgggttcac 582721 ttccgggcag ctgggcagga tcgaggcgtc ggcaacccac acgccctcga cgccgcgcag 582781 ccggcccgtc gcgtcgaccg gacaaagctg ctcgtcggcg ccggcggccg cggtgcccgt 582841 cggatggaag gcggccaggt gcaggcttct ggggttggct cggcgcagca catcctgcag 582901 ctcgggcagg gaccgcatcg gtggggcgcc ggggataccg gtcagcacct ccaccgcgcc 582961 ggcggcaaag aacagccggc caatggcctg cagcgcgacc cgtagcttgg cgatctcacc 583021 tggagctatg tcatagcgca ccaccgtctc gccgcgcacc gaccgcaccg tgccgacgcc 583081 ccgatcggcc accatcgccc cgaatgttgc gatctgcggc gcccggtcga gccagcggag 583141 cagctcggcc ccgtagccgg ggaagaccat cgaccccatg cccggcggtg tggaggtggc 583201 ctcgatcagc acgccgtcgg attcgtgaaa ctcgtgaacc gccgcgctct gcagcacccc 583261 gcgccacgcg aagacgtcgt cgtcgaagag cccggccagc atagttgccg ggtgcagcgc 583321 aaggttgtgg cccagtcgcg ggtgcccacc aagaccgctg cgccgcaaca gccctggcgt 583381 ctccgtcgca ccggcggcga cgacgaccgc gtcggccagc acgtcgagtg tggtgccgtc 583441 gggccggcgg gctcgcacgc cataggcccg cccggcgcgg tgcaggatcc gttcgacccg 583501 cgcccaggag atgatccgcg cgccggccgc gcaggcttgc ggcagggcgt tgaggtgcac 583561 gccgaacttg gcgttgctgg ggcagccgat cgcgcactgg caacagccac ggcaccccgg 583621 cgcattgcgc gggatgggcg ccgcccgcca gcccagcgac ttggcggcct gcagcaacag 583681 gcgcccgttg cggcccatga tctccagcgg caccggcgca acccgcagtg tttgctccgc 583741 atcgtcaaga cgacgtccca gctggtcggg gtcggccagg ccgagaccga actcgtcacg 583801 ccagcgccgc tgcacggcaa gtgaaggccg aaagcaggtg ccggagttga cgacggtggt 583861 gccgcccacc gcccggccca tcggcagcac caccgccggt cgcccgagcg cgacggtggc 583921 cccggcgcca cggtacaacc cggcataacg gtcgaccggg tgggtgctac ggaactcctc 583981 gaccgtccag cgccgtccct cttcgagcac gaccacgtca aggccggccc gggccagcgt 584041 gcgcgcgacc atcgcgccgc ccgccccgga gccgacgacc accgcatcgg ccctggtgac 584101 ggatgggctg tccgccgaca agatgacggt caactccgcg tcggggcgcg ccgcgtcatg 584161 ttcctgggcg cgggcgagca attcgtgcgc gtaggtgtcg gcgccgttgg ccaacagcac 584221 gatcgccttc aacccctcca cggccgcagc gacttccggg ctcagtgcgg cgatccggtg 584281 cagcacccgt gcccgctcgt ccgggtgcag tcgcggtagc gaccggccgg tggtgaggta 584341 gctggccgcc gccagtgaag ccagcccggc gcgcaccgcg aatcgtgagg tcgccggcag 584401 tcgtgtgacg tagcggtcaa cgcgctgcac gaattgagcc ggcaacgggc cgccgagctc 584461 cggcggcagc agcgcggcgc cgaacgaggc caacggatag gacttagccc gatcggcgag 584521 ccggctcata tccggcgccc gagccggcgg ccgagcttta tgaagaacgg atacgttgcg 584581 aagatggcag cggccatcgc gtgcagcggc cactccgcgc gggccacatc cacgctgaaa 584641 accccgctgt tccacatgaa atcgcggccg ttctgtgcgc gaaatggccg ccacagcatc 584701 ccgagcccgg gtacgttgtg gtacagcccg aacgaagcgc cgaagaagac gcccagggcg 584761 gcggcctcgg cggcatcgcg gcggtccacg ggcaggcgtc gctcgatgag cactccgcag 584821 acaaacagca gcggcgggtc gagcaggaaa ctcatgctgg cgtcccttcc ttgatagccg 584881 gtgccgcggt tccccgcagg ccgacttcgg cgtgtccggt gcccagcacc gaccagtgcc 584941 ggccgccgag ctcgatgtgg atgtcggctt gctcggtgtt ggtgcacacc gccttggccc 585001 cgtcgggatc ggtgtatccc aggcttacgc accgctccgg cggctggtct acccggatta 585061 gcgcctcccg gccgccgatg cgtccttcca gttgccagtg ccgcacgccg agcgttgtcc 585121 gcattcgcag cgacggtaaa ggacttgcgg gccaatcctt tccgtcgatg cggaagcgaa 585181 cgaacgctag cggcgcgagc ctgcgtaggc ccggcttgtg tgataccgcg gtcaccacct 585241 ctaggacgtc gccgtcgccg agatcggcat ggatccatcc ccaccgcttg gcattgccat 585301 gtccgtagat gtgggccaca ccgccgcgcc agctgtcgac gcggtgggtg gtttcgccga 585361 cggccaagga gccagcgaag acggcggtgg gtgcgatcac cacttgggcg ccgggcagca 585421 actcgcgctc ccaggccacg cgaggaaacg tccacagtgg cgccgcggtg tccttccagg 585481 acagctccca tgcgagtgat cgggtacgtc cggtcagctc cgctggcgcc attcgtacac 585541 cggcgatgtc gaaccaggcg gggccggccg cgggttgggc gggctggggg ccgaagcgct 585601 cggtgcccgg cggggcatcc ggtggaaacc aggtcaccca gccgtgcgcg tagggcccgc 585661 ctgtcgtcgg ggccaccgtc tcacagtgca cccataggcc ggtacgcgtc agtggatccg 585721 acagagtcgc ataccagact tccaggcgcc cggctgcacc gcgccaccgc ggcaaggccg 585781 ccgaccgcgt ttcatcgtcc actgcggcac ctcctgctgg ctgagttgtc gattcgccca 585841 ctatattggt tgagccaatg aaccagtcaa gtgtctttca gccgccggat cggcagcggg 585901 tggatgagcg gatcgcgacg acgatcgccg acgccatcct cgacggcgtc ttcccgccgg 585961 gctcgaccct gccgcccgag cgagacctgg cagagcggct cggtgtcaac cgcacctcgc 586021 tacgccaggg tctggcgcga ctgcaacaga tgggcctgat cgaggtgcgg cacggcagcg 586081 gcagtgtggt ccgtgacccc gaggggctca cccatcccgc ggtggtcgag gcgctggtgc 586141 gcaaactggg ccccgacttc ctcgtcgagt tgctggagat ccgcgcggcg ttaggcccgt 586201 tgattggccg cctggcggcc gcccggagca cgcccgagga tgccgaggcg ttgtgtgcgg 586261 cgctggaagt ggtgcaacag gcggacacgg ccgcggcgcg gcaggcagcc gatcttgcct 586321 acttccgggt gctcatccac agcactcgca accgcgcatt ggggttgctc taccgctggg 586381 tggagcacgc cttcggcggc cgcgagcatg cgctcaccgg ggcctacgac gacgcggacc 586441 cagtgttgac cgacctgcgg gcgatcaacg gggcggtgct ggccggtgac ccggcggccg 586501 ctgccgcgac cgtcgaggcg tatctgaacg ccagtgcgct gcgcatggtc aagtcctacc 586561 gcgaccgcgc ttagctactg ggccgcacgc gtcgccggat gtacggcgat gagccctaat 586621 tgactgcggc gcttgcacat tgctgcgagt tccccatagg ccttctcccc gagtaattcg 586681 gtgagttcgt cggcaaggct ctgccacacc tgcttggttc cgacatgggc ggccggatcg 586741 ccggtgcaat accagtgcag gtcagcacca cccgagcccc aaccgcggcg gtcatattcg 586801 gtgagggtgg tcttgaggat ttcggtgccg tcggggcggg tcacccattc ctggctgcgc 586861 cggatcggca actgccagca gacatcgggt ttcatcgtca acggcggcac gcccagcttg 586921 agggctttgc tgtgcagcgc gcagccggcg ccaccggcga acccgggccg gttcaagaag 586981 atacacgcgc ccttgtgttt gcgggtgcgg tgctggggtt ggccgtcgtg ctcgtcgagt 587041 tccaggtagc ccttgcggcg caggcccttt gcccggaact gccagtcgtc gtcggtcagc 587101 ttgtgcaccg cgtcggccaa ccgggtgcgg tcgtcgtcgt cggacaggaa cgcaccgtgc 587161 gaacaacagc cgtcgtttgg ccggcccgcg acggtgccct ggcaggcggg tgtgccgaat 587221 acacacgccc agcgcgacag caaccaggta aggtcggccg cgatcaggtg ctcgggattg 587281 tccgggtcgt agaactccac ccactcacgg gcgaagtcca actcgacttc ttgccccggg 587341 tgcaccggtc tccgtcgcga atttgccacg gattcaacgt tagaccacga agcccgccgc 587401 gggattccgc catagcccag cacggccggc acatgccacc gggcgccttg cgcgggtcgc 587461 cacacgcccg tatcttcgcc cggctagttt gttttcgtgc gattgggcgt gctggacgtg 587521 ggtagcaaca cggtccatct gctggtggtc gatgcccacc gcggcggcca cccgaccccg 587581 atgagctcga cgaaggccac gctgcggctg gccgaggcca ccgacagctc gggcaagatc 587641 accaagcgcg gagccgacaa gctgatttcc accatcgacg aattcgccaa gattgccatc 587701 agctcgggct gtgccgagct gatggccttc gccacgtcgg cggtccgcga cgccgagaat 587761 tccgaggacg tcctgtcccg ggtgcgcaaa gagaccggtg tcgagttgca ggcgctgcgt 587821 ggggaggacg agtcacggct gaccttcctg gccgtgcgac gatggtacgg gtggagcgct 587881 gggcgcatcc tcaacctcga catcggcggc ggctcgctgg aagtgtccag tggcgtggac 587941 gaggagcccg agattgcgtt atcgctgccc ctgggcgccg gacggttgac ccgagagtgg 588001 ctgcccgacg atccgccggg ccggcgccgg gtggcgatgc tgcgagactg gctggatgcc 588061 gagctggccg agcccagtgt gaccgtcctg gaagccggca gccccgacct ggcggtcgca 588121 acgtcgaaga cgtttcgctc gttggcgcga ctaaccggtg cggccccatc catggccggg 588181 ccgcgggtga agaggaccct aacggcaaat ggtctgcggc aactcatcgc gtttatctct 588241 aggatgacgg cggttgaccg tgcagaactg gaaggggtaa gcgccgaccg agcgccgcag 588301 attgtggccg gcgccctggt ggcagaggcg agcatgcgag cactgtcgat agaagcggtg 588361 gaaatctgcc cgtgggcgct gcgggaaggt ctcatcttgc gcaaactcga cagcgaagcc 588421 gacggaaccg ccctcatcga gtcttcgtct gtgcacactt cggtgcgtgc cgtcggaggt 588481 cagccagctg atcggaacgc ggccaaccga tcgagaggca gcaaaccatg acgggaccac 588541 accccgaaac agagagctcc ggtaaccggc agatctcggt ggccgagttg ctggccaggc 588601 aaggggtcac cggcgccccg gcccgacggc gccggcggcg acgcggcgat agtgacgcca 588661 tcacggtcgc cgagctgacc ggtgagattc cgatcattcg tgacgaccat caccacgccg 588721 gcccggacgc gcacgcgagc cagtctccgg cggctaacgg gcgagtccag gttggcgaag 588781 ctgccccaca gtcgccggcg gaaccagtcg ccgagcaggt tgccgaagag ccaacgagaa 588841 ccgtgtactg gtcgcaaccc gagccgcgct ggcccaagtc ccccccgcag gaccggcgcg 588901 agtccgggcc cgagcttagc gagtacccgc ggccactgcg ccacacgcat agcgacagag 588961 cacccgcggg gccgccgtcc ggtgccgaac acatgagtcc ggatccggtc gagcactacc 589021 ccgatctctg ggtggatgtc ctggacaccg aggtgggcga agcggaagcc gagaccgagg 589081 tgcgcgaagc gcaacctggg cgcggcgagc gccacgccgc agcggcggcg gccggcaccg 589141 acgtcgaggg tgatggtgcg gccgaggcgc gggttgcccg tcgtgccctg gacgtggtcc 589201 cgacgctgtg gcgcggcgcg ttggtcgtgc tgcagtcgat cctggccgtt gccttcggtg 589261 ccgggttgtt catcgccttc gaccagttgt ggcgctggaa cagcatagtg gcgctagtgc 589321 tatcggtgat ggtcatcctt ggcctagtgg tctcggtgcg ggcagtccgc aagaccgaag 589381 acatcgccag tacgttgatc gcggttgcgg tgggggcgct gattaccctg ggaccgctgg 589441 ccttgttgca atcgggctag ccgccaccac acacagtgcg cccagcaatc aaagtcggct 589501 tgtcgacggc ctcggtgtac ccgttgcggg ccgaggccgc gttcgagtac gccgacaggc 589561 ttggctacga cggggtcgag ctgatggtct ggggtgaatc ggtcagtcag gacatcgatg 589621 ccgtccggaa gctgtcgcgc cgctaccgcg tgccggtgtt gtcggtgcac gctccgtgcc 589681 tactcatctc gcagcgggtg tggggcgcca atccgatcct caagttggac cgcagtgtgc 589741 gggccgccga acaactgggc gcgcaaacgg tcgtcgtgca tccgcctttc cgctggcaac 589801 gacgctacgc cgaagggttc agcgatcagg ttgccgccct agaagcggcc agcaccgtga 589861 tggtggccgt tgaaaacatg tttcccttcc gagcggaccg gtttttcggg gccggccagt 589921 cccgggaacg gatgcgtaag cggggtggtg gcccaggtcc ggcgatctcg gcgttcgcgc 589981 cgtcctacga cccgctggac ggcaaccacg cgcattacac gctggacctc tcgcacaccg 590041 cgactgcggg caccgactcg ctggatatgg cgcggcggat gggcccaggg ctggtgcacc 590101 tgcacctgtg tgacggcagc ggcctgcccg ccgacgagca cctggtgccc ggccgcggta 590161 cccagccgac cgccgaggtg tgccagatgc tggccggcag cggcttcgtc ggccacgtcg 590221 tgttggaggt gtccacctca agcgcgcgtt cggccaatga acgcgaatcc atgctggccg 590281 agtcgttgca gttcgcccgc actcacctgc tgcgttgata tgccgggaac actatgaacg 590341 cgttgttcac cacggcgatg gcgctgcgcc cgcttgactc cgatcccggc aatccggcgt 590401 gccgggtttt tgaaggcgag ctgaacgagc actggaccat cgggcccaag gtgcacggcg 590461 gtgcgatggt ggcgctgtgt gccaatgccg cccgcaccgc ttacggcgcg gccggacagc 590521 agcccatgcg gcaaccggtc gcagtgtcgg cgagctttct gtgggcgccg gatccgggga 590581 cgatgcggtt ggtgacgtcg atccgcaagc gtggtcgccg gattagcgtg gccgatgtcg 590641 agctcaccca gggtggccgc acagcggtgc acgccgtggt caccctgggt gagccggagc 590701 attttctccc cggcgttgat gggagcggcg gggccagtgg aaccgcgccg ctgctgtcgg 590761 cgaatccggt ggtggagctg atggcaccgg aaccgcccga gggagtcgtg ccgatcggtc 590821 ccggccatca gctggccggg ctggtgcact taggcgaagg ctgcgatgtc cggccggtgt 590881 tgtcgacgtt gcggtccgcg accgatgggc ggccaccggt gattcagctg tgggcgcgtc 590941 cacgcggcgt tgctccggac gcgctgttcg ctctgttgtg cggggacttg tcggccccgg 591001 tgaccttcgc ggtggaccgc accggctggg cgcctacagt tgcgctcacc gcctatcttc 591061 gggccctgcc cgccgacggc tggctgcgag tgctctgcac ctgcgtcgaa atcgggcagg 591121 actggtttga cgaggaccac atcgtcgtcg accggttggg ccgcatcgtg gtgcagacgc 591181 gccaactggc gatggtgcct gcccagtagc acggatcggc cgagctgtct gcgatgcttt 591241 tcggcatggc aaggatcgcg attatcggcg gcggcagcat cggtgaggca ttgctgtcgg 591301 gtctgctgcg ggcgggccgg caggtcaaag acctggtagt ggccgagcgg atgcccgatc 591361 gcgccaacta cctggcgcag acctattcgg tgttggtgac gtcggcggcc gacgcggtgg 591421 agaacgcgac gttcgtcgtc gtcgcggtca aaccagccga cgtcgagccg gtgatcgcgg 591481 atctggcgaa cgcgactgcg gcggccgaaa acgacagtgc tgagcaggtg ttcgtcaccg 591541 tggtagcggg catcacgatc gcgtatttcg aatccaagct accggccggg acgccagtgg 591601 tgcgtgcgat gccgaacgcg gcggcattgg tgggagcggg ggttacagcg ctggccaaag 591661 gccgctttgt caccccgcaa cagcttgagg aggtctcggc cttgttcgac gcggtcggcg 591721 gcgtgctgac cgttccggaa tcgcagttgg acgcggtgac cgcggtgtcc ggctcgggtc 591781 cggcctattt ctttctgctg gtcgaggccc tggtggatgc cggagtcggg gtgggcttga 591841 gccgtcaggt ggccaccgat ctcgccgcgc agacaatggc tggctcagcg gcgatgctgc 591901 tggagcggat ggaccaagac cagggtggcg ccaatggcga gctgatgggg ctgcgcgtgg 591961 accttaccgc atcacggctg cgcgccgcgg ttacctcgcc gggcggtacg accgccgctg 592021 cgctgcggga actcgaacgc ggcgggtttc ggatggctgt cgacgcggcg gttcaagccg 592081 ccaaaagccg ctctgagcag ctcagaatta caccggaatg attcacgaat tttgaactga 592141 ttatccctca ccagtaccag taaccccact agtcccgcta ttctcctctt tgtaagcgcg 592201 tgtgggtgcc agcggagggg aagccgctgg gactgcgcgt gcctgacacg attgggttgc 592261 gatgacgtct acgaacgggc catcggcgcg ggataccggt tttgttgagg gccagcaggc 592321 caagacacaa cttctcaccg tggccgaagt ggcggccctg atgcgggtgt ccaagatgac 592381 ggtgtaccgg ctggtgcaca atggcgaact gcccgcggtt cgggtcgggc ggtcattccg 592441 ggtgcatgcc aaggccgtcc acgacatgtt ggagacttcg tacttcgacg cgggctagtt 592501 gccggccgca cgcggccgga gtccgcctga ccgatctggc aatgctcggg cgctgccggt 592561 ttggtgttcc gtgcgaccgc ccgggtagag tgtccgggtc agatagccgt atagatggcg 592621 gggtcatggg ttcagtaatc aagaagcggc gcaagcgcat gtccaagaaa aagcatcgca 592681 agctgctgcg tcgcacccgg gtgcagcgca ggaaactggg caaataggtt gcgagcagac 592741 cccgccagct cgaccgtcac gcgcttgtaa cgccgccgtt tcgcctggcc gttaggctgt 592801 cggagtgagt tcgtcgaacg ggcgcggtgg cgccggagga gtcggcggca gcagtgagca 592861 cccgcagtac cccaaagttg tgctggtgac cggtgcttgc cgtttcctag gcggctacct 592921 gaccgcacgg cttgcccaga acccgctgat caaccgggtc atcgcggtgg acgcgatcgc 592981 gccgagcaag gacatgctgc gccggatggg ccgagccgaa tttgttcgcg ctgatatccg 593041 aaacccattc atcgccaagg tgattcgcaa tggcgaggtg gacacggtgg tgcacgccgc 593101 ggcggcctcg tatgcgccgc ggtccggcgg cagtgcggca ttgaaggaac ttaacgtgat 593161 gggcgcgatg caactgttcg ccgcctgcca aaaggcgccc tcggtccgcc gggtcgtgct 593221 gaagtcgacc tctgaggttt acggatcgag cccacacgat ccggtgatgt tcaccgagga 593281 cagcagcagt cgacgtcctt tcagccaagg tttccctaag gacagtctcg atatcgaggg 593341 ctacgtgcgc gcgctgggcc gacgccgccc cgatattgca gtgactatcc tgcggctggc 593401 caacatgatc ggcccggcga tggacaccac gctttcacga tatctggccg ggccgctggt 593461 cccgacgatc ttcggccgtg atgcgcgact gcagttgctg cacgagcagg atgcgctggg 593521 tgcgttggag cgcgcggcga tggccggcaa ggccggaacg ttcaacatcg gagccgacgg 593581 catcctcatg ctgtcgcagg cgatccggcg ggccgggcga attccggtgc cggtgccagg 593641 gtttggggta tgggctctgg attcgctgag gcgagcgaat cactacaccg agctgaatcg 593701 tgagcaattc gcttacctga gttatggccg ggttatggac accaccagaa tgcgcgtcga 593761 actgggttac cagccgaagt ggacgaccgt cgaggcgttc gatgactatt ttcgcggccg 593821 cggcctgact cccattattg acccacatcg ggtacgctcc tgggagggtc gcgccgtagg 593881 tttagcgcag cgctggggta gccgaaatcc aattccatgg agcggactca gataggtttg 593941 gatgggtaac gtggcgggcg aaaccagagc gaatgtcatt ccactgcaca caaatcggag 594001 ccgggtagcg gcgcgcaggc gtgccggtca acgggcagag tcccggcagc atccgtcgtt 594061 gctgtccgat ccaaatgacc gggcgtcggc cgagcagatc gccgccgttg tccgggaaat 594121 cgacgaacac cggcgcgctg cgggtgccac gacctcgtcc accgaggcca cgcccaacga 594181 ccttgcgcaa ctcgtcgccg cggttgctgg atttctccga cagcgcctga ccggtgacta 594241 cagcgtcgac gaattcgggt tcgacccgca cttcaacagc gccatcgtac gacccttgct 594301 gcgattcttc ttcaagtcat ggtttcgggt cgaagtcagt ggtgtcgaga acatcccgcg 594361 cgatggtgcg gcgctggtgg tggccaatca cgcaggtgtg ttgccgtttg acgggttgat 594421 gttgtcggtg gccgtccacg acgagcaccc ggcgcatcgg gatctgcggc tgcttgccgc 594481 cgacatggtg ttcgacctcc ccgtgatcgg cgaagccgcc cgcaaggcgg gtcataccat 594541 ggcgtgtacg acggatgcgc accggttgct tgcctccggc gaactcaccg cggtgttccc 594601 cgagggatac aaggggctgg gtaagcgttt cgaggaccgt taccggttac agcggtttgg 594661 tcgcggcggc ttcgtatcgg ccgcgctacg gaccaaggcg ccgattgtgc cgtgttcgat 594721 catcggctcc gaagagatct accccatgct gaccgatgtc aagctgctgg ctcggctgtt 594781 cggcctgccg tacttcccga ttacgccgtt gttcccgttg gctggaccgg tcgggctagt 594841 gccgttgccc tcgaaatggc gcatcgcgtt cggtgagccg atctgcaccg ccgactacgc 594901 ctccaccgac gccgacgacc cgatggtgac gttcgagttg accgatcagg tgcgcgagac 594961 gatccagcag acgctatacc gactgcttgc cggccgtcgc aacatctttt tcggctgacc 595021 cttatttgac cagagtgaac tggcagacgt ccgtgtactt gtcgcggaac aggtctgagc 595081 agccacgtag gtagtgcatg tagatgtcgt acgtctcctg gcccttgagg gcgatcgcct 595141 catctttgtg cgcctgtagc gcatccgccc aggcgttcag ggtcggcacg tagttggccc 595201 cgatccggtg gtagcgctcg accttccatc cggcgttgga ggagtaatag tccacctgcg 595261 agatcctggg cagccgcccg cccgggaaga tctcggtcag gatgaacttg atgaagcgca 595321 gcaggctcat cggagacgtc aagcccagct cctgggcttc ctctttgtcc gggatagtga 595381 tggtgtgcag cagcatccgg ccgtcgtcgg gcgtcaaatt gtagaacttc ttgaagaagg 595441 tgtcgtagcg ctcgaacccg gcgtccccgg caccgtcggc gaaatgctca aacgcaccga 595501 gtgacacgat gcggtcgacc ggctcgtcga actcctccca gccctggatt cgcacctctt 595561 ttcggcgggg gctgtcgacc tcatcgaaca tcgccttgtc gtgggcgtac tggttttcgc 595621 tcagggtcaa gccgatgacg ttgacgtcgt actcggcgac cgcgtgtcgc atggtggaac 595681 cccagccgca gccgatgtcg agcagcgtca tgccgggctc aaggttcagc ttgtccagtg 595741 ccagcttgcg cttcgcgtac tgcgcctctt ccagcgtcat atcgggacgt tcgaagtagg 595801 cgcagctgta cgtcatcgat gggtcaagcc agagcttgaa gaactcgttc gatttgtcgt 595861 agtgggatcg aactgcttcg accggcggct tgagctgcgt gccgcttgtc gtgtcgccct 595921 gtgacgtcat tgaacggacc ctactttccc cactagatcg atgcaatcgc cgccaccgtt 595981 gcatcggcat cggcttcgtg gtgggccgct tctcccaaca tggtgacgac actggtgacc 596041 acaggctttc cttcggcgtc ggtaacttcg cttcggatct cggcgagcac cgtgccgtgg 596101 gattcgatga cggagtcaag ataggtgtcg aagtacagct tgtcgttggc caggatcggc 596161 cggtggaagc ggaacttctg gtcgcgatga aagacccggg cgatgttgat cgggatattg 596221 aacttggtga agatctccag ctgcacgcgc cggccggcga tcgccaggaa ggtcagcggg 596281 gctaccagcg cggggtaacc ggccgctgcg gcatccggct cgctgtagtg ggtcgggtgg 596341 tcgtctttga ccgcgaccgc gaactcgcgg atcttctcgc gccccaccag aaagtggtcc 596401 ggcgcccgat aatgcttgcc gatcagtgtc tgggcttctt cgggaactgt catgccgctg 596461 ccgccctccg ctcgaatagt tgctaagccc tattgcccgg ctcctcctcg ccccgctgcg 596521 cgggtcgcat cgtcgccagg ctgggcccta ttgcccggct cctcctcgcc ccgctgcgcg 596581 ggccgcatcg tcgccaggct aacggcgcag cttatcagcg tgattggcgt ctagaggcta 596641 gagccgccaa cgcgccgccg gccgcaccca gcgccagggc cgacggaacc ccgatccgag 596701 cggccttgcg ggcgattcgg aaatcacgga tctcccaccc ccgttcccgg gccaggctgc 596761 gcaggcgggc gtcggggttg atggcgaccg cggtgcccac cagcgacagc atcgggacgt 596821 cgttgtagct gtcggagtag gcggtgcagc gtttgagatt gagtccctcc cggatggcca 596881 gcgaccgcac cgcgtgtgcc ttgccggtgc cgtgcaggat ctcgccgacc agtctgccgg 596941 tgaatatccc gtcgaccgac tcggcgacgg tgcccagggc gccggttagg ccgagccggc 597001 gggcgatggt ggccgcgagt tcgtatgggg tagcggtgat cagccatacc tgctggccgg 597061 cgtccaggtg catctgggtg agttcgcggg tgccgtccca gatcttgtcg gcgatgatct 597121 cgtcgtaaat ctcctctccc aaggccacca actccgcgac ggatcggccc tcgatgaacg 597181 cgagcgcctt gcgccggcca gcggcgacgt cgttgctgtt ctccttgcca agtagctgga 597241 acttggcctg agcgtaaaga aatccgagga cgtcgcggta ggtgaagtag tggcgagcgg 597301 ctagcccgcg gccgaagtgc accgccgacg agccctgaac caaggtgttg tccacgtcga 597361 agaaggcggc tgcggtcagg tcgatcggcg gctgccgatc gctgccggcg gcggcgacgg 597421 gggccggcat gtccaccggc gagtggctgg cgctggcatc gggtggcggc gggtcggccg 597481 gcgaagccag gtcgacgtga ccggcctggt ctgggctacc caggtgggag gaaaccatca 597541 ttactcctaa tcgcggtgcc tgcccggtgg ccgatgctgc ggccgttatc aaccctatcc 597601 ggcaaatgcg cggcggagct cttggctggc gcggattgat ctgcaagccc agcgcggtat 597661 cgaaattcgc gaggccgcag cgactttcgt cgtgaacacg acccgcagcg gttcggggcc 597721 aacatgtcag ccccataccg gtacgcgcaa agctgggtac gtgaaatcct gaattcttca 597781 gcctgtcaac ggtagcgtct acgctagcta acgcaacgag acatccgatt actacgcacg 597841 ttaggacatt tcaggaggta tcgggaggcc taagggtcac taggtccgcg cgatgggcgg 597901 aacacgaggg tgaggatgat ttcggttagc ggcgccgtga aacgcatgtg gttgctgctg 597961 gccatcgtcg tggtggccgt tgtcgggggg cttggtatct atcggctgca cagcatcttc 598021 ggtgttcacg agcaacccac tgtcatggtc aagcctgatt tcgacgtccc gctgttcaac 598081 cccaagcggg tgacctacga agtctttggc cccgccaaga ccgcaaagat cgcctacctg 598141 gaccctgatg cccgggtgca tcgactcgat agcgtgtccc tgccgtggtc cgtcacggtc 598201 gagacgacgc tgcccgcggt cagcgtcaac ctcatggcgc agagtaacgc cgacgtgatc 598261 agctgccgga tcatcgtcaa cggcgccgtt aaggacgaaa ggtctgagac ctcgccgcga 598321 gcgctaacct cctgccaggt gtcatccgga tgagcgaaag acacgccgca ctgacgtcac 598381 tgccgcccat tctgccgcgg ctgatccgcc ggtttgcggt ggtgatcgtc ctgctctggc 598441 tgggcttcac cgcctttgtc aatctcgccg taccgcaact ggaagtggtc ggaaaagcac 598501 actcggtatc gatgagcccc agcgacgccg catcgattca ggcgatcaag cgcgttggtc 598561 aggtgttcgg tgagtttgat tccgataacg cggtaacgat cgtgctggaa ggcgaccagc 598621 cactcggtgg ggacgcgcac cggttctata gcgatctgat gcggaagctt tccgccgata 598681 cccgccatgt cgcgcacatc cagcacttct ggggggatcc gctgacagcg gcgggatccc 598741 aaagtgcgga tgatcgggcc gcctacgtcg tggtgtacct cgtcggtaac aacgaaaccg 598801 aagcgtatga ctcggtccac gcggtgcggc acatggtgga caccacaccg ccaccgcacg 598861 gggtgaaggc ctatgtcacc ggtccggcag cactcaatgc cgaccaggcc gaggccggag 598921 acaaaagtat cgctaaggtc accgcgatca cgagcatggt gatcgcggca atgttgctag 598981 tgatctatcg ctccgtaatt accgcggttc tcgtcttgat catggtcggc atcgacctcg 599041 gcgcaatccg cggattcatc gccttgctcg ccgaccacaa cattttcagc ctttcaacat 599101 ttgcgaccaa cctgctcgtt ctcatggcga ttgcggcgag cacggactac gcgatattca 599161 tgctcggccg ttaccacgaa tcgcgctacg ccggcgagga tcgggaaacg gccttctaca 599221 cgatgtttca cgggaccgcc cacgtgatct tgggttcggg tttgaccatt gccggcgcca 599281 tgtattgcct cagctttgcc cggcttccgt attttgaaac gctcggcgcg cccattgcta 599341 tcggcatgct ggtcgcggtc ttggcggcgc tcacgctcgg cccggccgta ctgaccgtgg 599401 gcagcttctt caagctgttc gatcccaagc ggcggatgaa cactcggcgg tggcgccggg 599461 tgggaacggc aattgtgcgt tggccggggc cggtgctcgc ggcgacatgc ttggtcgcct 599521 ccattggctt gctggccttg cccagttacc ggacaacgta tgatctgcgc aagttcatgc 599581 ccgccagcat gccgtccaat gtgggggatg cggcggctgg tcgacacttt tcacgggctc 599641 ggctgaaccc tgaggtgctg ttgatcgaga ctgaccacga tatgcgtaat ccggtggaca 599701 tgctggtgtt ggacaaggta gccaaaaata tctaccacag tcccggtatt gaacaagtga 599761 aagcgataac ccggcccttg ggaacaacca tcaagcacac ttcgataccg ttcatcatca 599821 gcatgcaggg cgtgaatagt agcgagcaaa tggaattcat gaaggaccga atttatgaca 599881 tactggtgca ggtggccgcg atgaatacct ccatcgagac gatgcatcgc atgtatgcac 599941 tcatgggcga ggtcattgac aacaccgtcg acatggatca tctcacgcat gatatgtcgg 600001 acataacggc tacgctaaga gatcatctcg cggatttcga ggatttcttc cggcctattc 600061 gcagctactt ctactgggaa aaacattgtt tcgacgttcc gctctgctgg tcgataagat 600121 cgatattcga tatgtttgac agtgtggacc agctgagcga aaagctcgag tacctggtca 600181 aggatatgga tattctgatt acactgttgc cgcagatgcg cgcgcagatg ccgccgatga 600241 tatctgcgat gacgacgatg cgggacatga tgcttatctg gcatggcacg cttggcgcgt 600301 tctataagca acaggagagg aataacaagg accccggcgc gatgggccgg gtttttgacg 600361 ccgcccagat cgatgattcg ttctatctgc cgcagtcggc ttttgagaat ccggatttca 600421 agcgggggct gaagatgttt ttgtctccgg acggcaaggc agcccgcttt gtcattgctc 600481 tggagggaga tcccgcaacg cccgagggca tctgtcgggt cgagccgatc aagcgggagg 600541 ctagagaggc cataaaggga actccattgc agggcgctgc gatctatctg ggtggcaccg 600601 cggcgacgtt caaggatatt cgagagggcg ccagatacga tctgctgatc gccggagtgg 600661 cggcgataag cttgattttg atcatcatga tgatcatcac ccgaagtgtg gtagccgcag 600721 tggttatcgt gggtaccgtc gtgctttcca tgggcgcctc tttcgggctt tccgtattgg 600781 tctggcagga cattctgggt atcgagttgt actggatggt gttggcgatg tcggtgatcc 600841 tgctcctggc ggtgggatcc gactacaatc tgctgctgat ttcccggttg aaagaggaaa 600901 ttggggccgg attgaacacc ggaattatcc gtgccatggc tggtaccggg ggagtggtga 600961 cggctgccgg catggtgttc gccgttacca tgtcgttgtt tgtgttcagc gatttgcgga 601021 ttattggtca gatcggtacc accatcggcc tgggcttgct gttcgacacc ctcgtcgtgc 601081 gctcgttcat gacaccgtcc attgctgcgc tgctgggacg ctggttctgg tggccgctac 601141 gggtgcgccc gcgcccggcc agtcagatgc ttcggccgtt cgcgccgcgc cgattggttc 601201 gcgccttgtt gctgccgtcc ggccagcacc cgtcagcgac tggcgcccat gagtaggccc 601261 caggtggagc ttttgactcg cgccgggtgc gcgatctgcg tgcgggtagc ggagcagctg 601321 gccgaactgt ccagcgaact gggcttcgac atgatgacga tcgacgtcga tgtcgcggcg 601381 tcgacgggca atccagggct gcgagctgag tttggcgatc ggttgccggt ggtcctgctg 601441 gacggccgcg agcacagcta ctgggaggtc gacgagcacc ggctgcgtgc ggatatagcc 601501 cgcagcacat ttggtagccc acctgataaa cgtctaccgt agacaccagt tttactgggg 601561 tagtcgaggg agctggccag gtggtgctgc cgtgagcgtg ctgctcttcg gggtgtcgca 601621 tcgtagcgcg ccggtcgtcg tccttgaaca actcagtatc gacgaatccg atcaagtcaa 601681 gatcatcgac cgagtgctgg cttcgccgct ggtgaccgag gcgatggtgc tgtcgacttg 601741 caaccgcgtc gaggtctacg ccgtagtgga cgcgttccat ggcggcctgt cggtgatcgg 601801 gcaggtgctt gccgaacact ccggtatgtc gatgggggag ctgaccaagt acgcatatgt 601861 ccgctacagc gaggcagcag ttgagcacct gttcgcggtt gccagcggcc tggactcggc 601921 ggtgatcggc gagcagcagg tgcttggtca ggtgcgccgc gcctatgccg tcgccgaatc 601981 caaccgcacg gtcggccgcg tgctgcacga attggcccag cgggcgctgt cggtgggcaa 602041 gcgagtgcac tccgaaaccg ccattgacgc tgccggtgcc tccgtggtgt cggtcgccct 602101 gggaatggcc gagcgcaaat tgggctcgtt ggcgggcacg accgcggtgg tgatcggcgc 602161 cggggcgatg ggcgcgctgt cggcggtaca tctgacccgt gccggcgtcg ggcacattca 602221 ggtgctcaac cggtcgttgt cccgggcgca gcggttggcc cgaaggatcc gcgaatctgg 602281 cgtgccggcc gaggcgctag cgctcgaccg cctggctaat gtcctggccg atgccgacgt 602341 ggtggtcagc tgtactgggg cggtgcgtcc ggtggtgtcg ctggccgatg tgcatcatgc 602401 gctggccgcc gcccgccgtg acgaggccac ccgtccgttg gtgatatgcg acttgggcat 602461 gccgcgtgac gtcgatcctg cggtggccag attaccgtgt gtgtgggtcg tggacgtgga 602521 tagcgtgcaa catgaaccct cggcacatgc cgcggctgcc gacgttgagg ccgcccgcca 602581 catcgtcgcc gccgaagttg ccagctatct ggtggggcag cggatggccg aggtcacccc 602641 aaccgtgacg gcgttgcgcc agcgagccgc cgaagtggtc gaagcggaat tgctgcgcct 602701 ggacaaccgg ctgcccggcc tgcagagtgt ccagcgcgag gaggtggccc gcaccgtacg 602761 gcgagtcgtg gacaagctgt tgcacgcgcc taccgtgcgg atcaagcagc tcgccagtgc 602821 gcccggcggt gacagctacg ccgaggcgct gcgcgaactc ttcgagcttg accagaccgc 602881 cgtcgatgcc gtcgccactg caggtgaatt accggtggtg ccaagcggat tcgacgctga 602941 aagtcgccgc ggtggaggcg acatgcaaag cagcccgaag cgatcgccga gtaactgatt 603001 ggcgcacgtg atccggatag gtacccgggg cagcttgctg gccaccactc aggccgccac 603061 tgtcagagac gccctcatcg ctggtggcca ctccgcggag ttggtgacca tcagcaccga 603121 gggtgaccga tccatggcgc cgatcgccag tctcggggtt ggcgtcttca ccacggcgtt 603181 gcgcgaggcg atggaggcag gcctcgtcga tgcggcggtg cattcgtaca aggatttgcc 603241 gactgccgcc gatccaaggt tcacggttgc ggcgataccg ccgcgcaatg acccccgcga 603301 cgcggtggta gcccgtgacg ggctgacgct gggggaattg ccggtcggat cgttggtggg 603361 cacatcctcg ccgcggcggg ccgcacagct tagagcattg ggtctcggtt tggaaatccg 603421 ccccctacga ggcaacctag ataccaggtt gaacaaggta agtagcggcg atcttgacgc 603481 catcgtggtg gcccgggctg gtctggcgcg gctgggccgc ctcgatgacg tgaccgagac 603541 gttagagccg gtgcagatgt tgcccgcgcc ggctcagggc gcgctcgcgg tcgaatgccg 603601 cgccggcgac agccggttgg tggcagtgct ggcggagttg gatgacgccg acacgcgtgc 603661 ggcggtcacc gccgagcgag ccctgcttgc cgacctggag gcaggttgct ccgcaccggt 603721 gggagcgatc gcagaagtgg tcgagtccat cgatgaggac ggccgtgtct tcgaggagct 603781 gtcgctgcgc gggtgcgtgg cggcgctgga cggatccgac gtgatccgcg cgtccggcat 603841 cggcagttgc ggtcgggcac gggagctggg gctctcggtc gccgcggagc tgttcgagct 603901 gggcgcccgg gagctgatgt ggggagtgcg gcattagccc gcatgaagaa gtgactggga 603961 gtgacaatca tgacgcgagg gcgtaagccg agaccgggcc gcatcgtttt cgtgggctcc 604021 ggtccgggcg accccggctt gcttacgaca cgggctgccg cggtgctggc caacgccgcg 604081 ctggtgttca ccgatcccga cgtaccggag ccggtggtgg cgctgatcgg cacggatctg 604141 ccccccgtgt ccggcccggc gcccgccgag ccggttgccg ggaacggcga tgcggccggc 604201 ggaggaagtg cgcaggaaca cggccgggcc gcgtccgcgg tagtctccgg tggtcctgac 604261 atccgcccgg cgctgggcga tcccgccgat gtggccaaga cgctgaccgc cgaggcccgt 604321 tcgggtgtcg acgtggtgcg gctggtggcg ggcgatccgc tcacggtgga tgcggtaatc 604381 agcgaggtga acgccgtcgc acgcacccac ctgcacatcg aaatcgtgcc cggcctggcc 604441 gccagcagcg cggtcccgac ctatgccggg ttgccgctgg gttcgtcgca caccgtcgcc 604501 gacgtgcgta tcgaccccga aaacaccgac tgggacgcgc tggctgccgc acccgggccg 604561 ctgatcctgc aggccaccgc atcgcatcta gccgaatcgg cccgcagcct gatcgatcac 604621 cagctggccg agtccactcc gtgcgtggtg accgcacacg gcaccacctg tcagcagcgt 604681 tcggtcgaga ccacacttca gggattgacc gacccggccg tcctgggcgc taccgacccc 604741 gcgtgctccg caaacgggag ggactcccag gccggaccgc tgatagtgac catcggcaag 604801 acggtgacca gtcgggcaaa gctgaactgg tgggagagcc gcgccctcta cggctggacg 604861 gtgttggtgc cgcgcaccaa ggaccaggcc ggcgagatga gcgagcggct cacgtcgtac 604921 ggcgcgctgc cggtggaggt gccgaccatc gccgtcgagc cgccgcgcag ccccgcgcag 604981 atggagcgcg ccgtcaaggg cctggtcgat ggccgattcc agtggatcgt gttcacctcc 605041 accaacgcgg tgcgtgcggt gtgggagaag ttcggcgagt tcggtctgga tgcccgcgcg 605101 ttctccgggg tgaagatcgc ctgtgtcggc gagtcgacgg ccgaccgggt gcgcgccttc 605161 ggaatcagtc ccgagctggt gccctccggg gagcagtcct cgcttggctt gctagacgac 605221 ttcccgccct acgacagcgt tttcgacccg gtgaaccggg ttttgctgcc gcgcgccgac 605281 atcgccaccg aaacgctggc cgagggactg cgagagcgtg gctgggagat cgaggacgtc 605341 accgcctacc ggaccgtgcg ggccgcgccg ccgccggcca ctacccggga aatgatcaag 605401 acgggcgggt ttgacgcggt atgtttcacc tccagctcga cggtgcgaaa cctggtcggc 605461 atcgccggca agccgcacgc gcggacgatc atcgcctgca tagggccaaa gaccgccgag 605521 accgcagccg agttcggctt gcgggtcgat gtccagccgg acaccgccgc catcggcccg 605581 ctggtcgatg cgctggccga gcatgccgcc cggttgcgcg ctgagggtgc gctgcccccg 605641 ccgcgcaaga agagccgcag gcgctagtgg cccaccctcg tcaggtgagc gtgcgtgtct 605701 gtacaccgac acgccgaccg agctggcatt ttgcgtacgc tcgcggctac gaatgagcat 605761 gagttcctat ccgcggcagc gaccgcgccg gctccgctcc accgtcgcga tgcgccgtct 605821 ggttgcgcaa acctcgttgg agccaaggca tttggtgctg ccgatgttcg ttgccgacgg 605881 cattgacgag ccgcggccga ttacctccat gccgggcgtg gtacagcaca cccgggattc 605941 gctacgtagg gccgcggcag ccgcggtggc cgccggcgtg ggtgggctga tgcttttcgg 606001 cgtgccgcgc gaccaggaca aggacggtgt cggttcggcg ggcatcgacc ccgacgggat 606061 cctcaacgtc gcccttcgcg atctggccaa ggacctgggt gaggccacgg tgttgatggc 606121 cgacacctgt ctggacgagt tcaccgacca cgggcactgc ggtgtgctcg atgaccgggg 606181 ccgggtcgat aacgacgcca ccgtggcccg ctatgtggaa ctggctgtgg cgcaagcgga 606241 atcgggcgcc cacgtggtcg gacccagtgg gatgatggat ggccaggtag ccgcgatccg 606301 ggacggtttg gacgccgccg gctacatcga tgtggtgatc ttggcctacg ccgcgaagtt 606361 tgcttcggcg ttctacggcc cgttccgcga ggcggtgagc tctagcctgt ccggggatcg 606421 gcgcacctac cagcaggagc cgggcaacgc cgccgaggcg ctgcgtgaga tcgagctcga 606481 tctcgacgga ggcgccgaca ttgtgatggt caaacccgcg atgggctacc tcgatgtggt 606541 ggcggccgcg gcggacgtct cgccggtccc ggtggccgcc tatcaggtct cgggagagta 606601 cgcgatgatt cgtgcggcgg cggccaataa ttggatcgat gagcgtgccg cggtgctaga 606661 gtcgctgacc ggtatccggc gtgccggcgc cgacatcgtg ctcacctact gggcggtaga 606721 cgcggcgggc tggcttacgt gacggaggcc tgacatgaca ccaaccgggg ataccaagcc 606781 caagttgttg ttctacgaac ccggcgcgag ctggtactgg gtgctgactg gtccgcttgc 606841 ggcggtgtcg gtgctcctcc tcgagatatc cagcggcgcc ggggttgggt tgataacgcc 606901 ggcgatcttt ctggtgatgg tgtcggcgtt cgtggcattg caggtgaagg cggcgcggat 606961 tcacacgtcg gtcgagctga cgcatgatgc cttgcgccaa ggcaccgaga ccatcaggct 607021 ggccgaaatc gtcaaaatct atccggaggc agacggccgc gagacgtccg gggaagagcc 607081 ggcaaagtgg cagtcggcgc ggaccctggg cgagctcgtc ggcgtaccgc gcggccgggt 607141 gggaatcggg ctgaagctga ccggaggccg caccgcccag gcctgggcgc gtcgtcatca 607201 acagctgcgg gcggcgctga ctccgctggt tcaggagcgg ctcgggcccg tggattctga 607261 tgtcgccgac gtcaacggtg acgacgccgg gccagcgcgg tgatcgcccg ctaccgggcc 607321 ggggccgaac tgttcctggc ttgtgccgcg cttgccggat ctgcggcgag ctggtcgcgg 607381 acccgctcca ccgtggccgt cgcgcccgtc atcgacggcc agccggtcac cctgtcggtg 607441 gtctatcacc cgcaaccgtt ggtgctgacc ctgctgctgg cgacgatcgc cggcgtgttg 607501 tcggtggtgg ggacggccag gttgcggcgc gcgcgagctg gcttgaacgc acatccggac 607561 ggcttgaacc agcgtccgcc cggcggttgg tgtcattgag ccgtttgcgt ggatcacttc 607621 cgctgctgct tgatcgggcc ctggtctgtg tcggcagcgg ctggtagtat cgaaagtatg 607681 ttcgatcagg tgcgggggcg catgccttca ccggaggcga tcgctcattt tgatgagcgg 607741 tttgaatgcc atgctccgcg gaccacgagg gtgtcggcgg cgttcatcga tcggatctgc 607801 tcggcgactc gggccgaaaa ccgggccgct gcggcgcagt tggtggcgtt gggggagttg 607861 ttcgcctatc ggtggtcgcg ttgcgggggc cgcgaggagt gggtgatgga caccatggcg 607921 gcggtggccg ccgaggtggc ggcggcgttg cggatcagtc agggtctggc ggccagccgg 607981 ttgcggtatg cgcgggcgat gcgtgagcgg ctgcctaaga cggctgaggt gtttagcgcc 608041 ggcgacatcg gctatctgat gtttgccacg attgtgtatc gcaccgactt gatcgttgac 608101 cctgatgttt tggcggcggt ggatgcgcag ttggccgcca atgtggcgcg ttggccctcg 608161 atgaccaagg cccgcctggc tgggcaggtc gataagatcg tggcgcgtgc cgatgccgat 608221 gcggtgcggc ggcgcaagga gtatcaggcc cagcgccagt tctgggtcgg ggaaagccaa 608281 gacggtgtgt gccagatcgg tggcagcctg ttggccgtcg acgcacacgc cctcgatgcg 608341 cggttgagcg cgttggcggg caccgtgtgt gagcacgatc cgcgcagccg tgagcagcgc 608401 cgcgcggacg cgttgggggc gttggcgggc ggggccgatc ggctgggctg tggctgtggg 608461 cgcgctgatt gtgcggccgg gaagcggcct gcggccccgc cggtggtgat tcacctgatc 608521 gccgaggcgg ccacgatcaa tggcacgggc tcggcgccgg catcgcagat gaacgccgac 608581 gggctgatca ccgccgaact ggtggccgag ctggccaaga cggccacgct ggtgccgctg 608641 gttcatcccg gcgatgcgcc gcccgagccg gggtatgcgc cgtcgaaagc gctcgccgat 608701 ttcgttcgct gccgggatct gacgtgtcgc tggcccggct gtgatgagcc cgccaccaat 608761 tgcgacctgg atcatacgat cccgtatgcc gctggtgggc ccacccatgc gtcgaacctg 608821 aaatgttact gccgtaccca tcacctggtg aaaacgtttt ggggatggcg tgatcaacag 608881 ctacccgacg gcaccctgat tttgacctcc ccgtccgggc atacctatgt cagcaccccg 608941 ggcagtgcgc tgctgttccc cagcttgtgc cacttcagcg gcggcatccc ggcaccggaa 609001 gccgacccac cctacgacca ttgcgaccag cgcacagcga tgatgcccaa acgccggcgc 609061 acccgcgccc aagaccgggc ctatcgcatc gccaccgaac gtcgacaaaa ccacgccgcc 609121 cgccagcgcg cccaggtgct cacccagacc gccgcggcca ccgacaccca cggcccacca 609181 ccggatcaca acgacgaccc accgccgttt taggctgacc tgctgattag cggtagcacc 609241 agctgacggc ggcggtcgat ggcgtcagcc aggtcgtgga gcgctttatg caccgagcgc 609301 gccatcggga acatggattc atgctcgccc tggtcacagc ggccacctag ctgttcgact 609361 actgcggggc tcgcgactaa tgcccactgg acgccggcgg ctcggcagtc ctcatcgagg 609421 atgcacaaga gcgagatgcc ggccccactg aagtgactca actcgctcag gtcgagcacc 609481 atcggatttg ttccgaggct gaaacgccgg acgtgctcgc tgatctgctc gacattggcg 609541 gcgtcgatct cgcctcggat ggtcaccact gtcgccaggt gatgcaggta ggcccgaatc 609601 tgagcgccac cgtagtcaac ggcggcattt ccgggccgcg tcgtgacgct gcaagccgat 609661 tttgacgtcg ggatcgtggt agtcatcaat agcctcgttc tccgtcgcgt tgcgggccga 609721 ccgatcgccg gctaaagctg cctttaacca aacccgcaaa atctaagggg agcgaaagcc 609781 gcctctaact ctttgctaag aagcgatttt cggggtgctc ccggcgaccc acgccgtcgc 609841 ggccatggcg ctgttaggct gcgatggctg ccggttgcta gtcgggggct gatgatatgg 609901 ccggtggtat ggatcagccg cccggtcagc ctagaaggcg gaccagacag cagagttcag 609961 acggaaagaa cggcgtgcgc gctgcagaga tcaccggaga aattagggcc ctgacaggat 610021 tgcgcatcgt cgcggcggtg tgggtagtgc tgtttcactt ccgaccgatg ttgggtgatg 610081 cgtcaccggg cttccgcgac gccctcgcgc cggtgctcga ctgcggcgcg cagggtgtag 610141 acctcttctt catcctcagt gggttcgtgc tgacctggaa ctacctcgac cgcatgggcc 610201 ggtcgtggtc ggtccgtgcc aacctgcact tcttgtggct gcggctggcc agggtgtggc 610261 cggtgtacct ggtcaccttg cacctggccg ccgtgtgggt catctttacg ctgcacgtcg 610321 gtcacgtgcc gtctccggag gcaggccagc tgaccgcgat cagctatgtg cgccagatcc 610381 tgctggtgca gctgtggttt cagccgtatt tcgatggatc cagttgggat ggaccggcct 610441 ggtcgatcag tgcggaatgg ttggcctact tgctgttcgg tctgctcatt ctggtcatct 610501 tccggatgaa gcatgccacc agggcgcggg gcctgatgtg gctggccttc gcggcgtcgt 610561 tgccgcccgt ggtgctgctg ttggccagcg gccagttcta tacgccatgg agctggctgc 610621 cccgaatcgt gacgcaattc gccgcgggag cgctggcgtg tgccgccgtc cgcaggttgc 610681 ggccgaccga tcgcgctcgc cgcatcgccg ggtacctttc cgtgctggtc ggcgtcgcga 610741 ttgtcggcat cctctacctg ttgcacgcgc atccgctcgc cggggtcgag gacagcggcg 610801 gggtggtcga cgtgctgttc gttccgctgg tgatcagcct ggcgattggc gtcggcagcc 610861 tgccggcgtt gctgtcgacg cggttgatgg tttttggcgg gcagatctcg ttttgcctct 610921 acatggtgca cgagctggtg cataccgcct ggggatgggc cgtgcaacaa tacgagcttg 610981 cgctgcagga tcagccgtgg aaatggaacg tcgtcggtct gctcgcgatc gccctggggg 611041 ctgcgatctt gctgtatcac ttcgtcgaag aaccggaccg ccgatggatg cgccggatgg 611101 tcgacgtcaa agccgcgagt gcgagaagcg agcccgggga gccggtaggc agcacgcgtt 611161 atcaaatcga cgatgcgctg gaaggggttt cggcccgcgc ggtgtgacgg ttgagtgggg 611221 ctgcagcggg tcgacgcgag ttcacatcgg tttcctcgta cgattccctt gatttggacg 611281 cggcgcacga cccgttcaac tttgagccga gtccagtgga gccatcagtg gagtcagtgt 611341 gagtcgcccg ggtacatacg tcattggtct cactctcctg gtcggcctgg tcgtcggcaa 611401 tccagggtgc ccgcggtcct accgcccact gaccctggat taccggctta acccggtcgc 611461 ggtgattggc gactcctata ccaccggcac cgatgagggc ggtctgggct cgaaatcatg 611521 gaccgctcgc acctggcaga tgctcgctgc acgtggcgtg cggatcgcag ccgacgtggc 611581 cgccgagggc cgggccggct acggggtgcc cggcgaccac ggcaacgtgt ttgaggatct 611641 gaccgccagg gccgtccagc ccgacgatgc actggtggtg ttctttggct cccgcaacga 611701 ccaaggcatg gatcctgagg atcccgagat gctggccgaa aaggtccgcg acactttcga 611761 tctagcgcgc caccgcgcac catccgcgag cttgctggtg atcgcaccgc cgtggcctac 611821 cgccgacgta cctggcccaa tgctgcggat tcgcgacgtg ctgggcgctc aggcgcgggc 611881 cgcaggagca gtgtttgtcg acccgatcgc cgaccactgg tttgtcgaca ggcccgagct 611941 gatcggcgcg gatggcgtgc atcccaacga tgcgggacat gagtatctgg cggacaagat 612001 cgcgccgctg atcagcatgg agttggttgg atgagttggg agtcacgagc cacgcaaagg 612061 gttagcgtga cgacggtcga cgtgctagtc ctctgcgtgc cgttcgtaat cccaacgctc 612121 aaggcgcgcc tgcaactgca ggagaccaag tccggcgagt ggcgccgcgg cggtgaggaa 612181 ggccagcagc atcggactca tctcagaacc tccaaaacca tttcattcgt accacgttcg 612241 tcgtcgaggg gtggttcttt cgcgaaacat gtccgtccga attcagctgt cctcagccac 612301 cgccacgctg cgccacgtca gctaggacgc catccaagcc agttcgccgg gcaactgttc 612361 gcgccagtac gacgcgtcgt gtcctccggg cgagaagctg ccggcaggcg gttggtgcag 612421 ttggttgacg aattggcgag tggcgaagta gaagcggtcg ctggtgccgc aatccacccg 612481 tagcgggatt gagttcagcg cgggcaggcc caacacgctg tgctgcacat agtcgtcgta 612541 gctgtcgaac gccccgggtg tgctgccggt gaacgacgtg aacaatgccg ggctgatggc 612601 acagatcccc gcggttctgg ccggacccaa ccgggcaccc aggagcagcg cgccgtatcc 612661 ccccatcgac caccccagga atcccacccg ggaggtgtcc atacccatcg aggtcagcat 612721 cggcagcagc tcgtcgagca ccatcgcacc cgagtccccg ccggaagagc gacggtgcca 612781 gtaggtgttg ccgccgtcga cgccgaccac cgcgaacgct ggcttgccct ccttgaccag 612841 gcgggccaac ccctgctcga cgccgagatc cagcatcatg ccggcgttgc cgtccttgcc 612901 atgcagtgcg atcactggcc gcagctgccc gctctggccg ggcggcatgg agatcaccca 612961 gttggtcttg atgcctccgc gagccgccga gatgaacgag ccggagatcc tggtcggcaa 613021 gctgctgccc gccgtcgggg gctcgaacgg cgccggggcc gcctgcggct caagtgggtc 613081 caccagggcg ccgaaggccc acacgccggc ggctcccgcg ccggcgccgg caccccaacg 613141 gagcagggca cggcgggtca ggtctgccat gggcgtcatg atgccgcgcc gatcggtgtt 613201 gcccgcacag ccacgccgta gcaccggcca atcgtgacac cggtaacggc tggcgagtcg 613261 ccgtagtggg ggcccggctg cgcagcagtg acggcatgaa gaactttcgc aaaactggaa 613321 acggctggta ccggaagtcg gtattctttg cgcggcagct gcgtgtcaat gatgaccgag 613381 cggtagcccg gtcgtccctg gtgtatggga gggtgttcga tcacctgcct caacatctcc 613441 gaagtgccga acgagaccaa ccgtaagaag aaccgtcagg ccggactcga ccgcagtatc 613501 cgggtgattc atggcagctt cgacgacatt cccgagccgg acagcggcta tgacgtcgtc 613561 tggtcacaag atgcgatcct gcacgcgccc gaccgccgaa aggtgctcga ggaggcattc 613621 cgggtgttgc ggcccggcgg cgaactgatc ttcaccgatc cgatgcaggc cgacgatgtt 613681 cccgacggtg tgctgcagcc ggtctacgac cggctcaacc tgcgtgacct tggctcgatg 613741 cgcttctatg cgtgaagccg cacaggcact cggtttcgag gtgctcgacc aaagagacct 613801 ggttcgcaat ctgcggacgc actacagccg agtgttcgag gaactcgaag cccggcgtct 613861 cgaactcgag gggaagtcct cccaggagta cctcgacaag atgcgggtag gcctgaagaa 613921 ctgggtcgag gccgccgaca acggtcactc tcgcgtgggg catccaacat ttccgagaac 613981 ccgcctgact ccgatatgcc agctgcccac ggccgcgatc gactcgacgg ctggtcgtcg 614041 ccggtatcgt tgaccccacg gactgcgtga cagccggggg cacggagttg cccggcggcg 614101 ccagtactgc ccccgacgga ccggaaggca ggtgccatag ctaccacttc aggactgcgc 614161 ccaggactgt cgcagcgtca gctcaacatg atcgctatcg gcggcgtcat cggtgctggc 614221 ttgttcgtcg ggtctggtgt ggttatccgt gcgaccggtc cggcggcatt cctgacctat 614281 gcgctgtgcg gcgcactgat cgttctggtg atgcgcatgc tgggcgagat ggccgccgcc 614341 aatccgtcga ctggagcgtt cgccgactac gcggcaaaag ccctgggcgg ctgggcggga 614401 ttctcggttg gctggctgta ctggtacttc tgggtaatcg tcgtggggtt cgaggcggtt 614461 gccggcggga aggttctaac ctactggatc gatgcgccgc tgtggttggc gtcgctgtgt 614521 ctgatgatga tgatgaccgc gacgaacttg gtctcggtgt catccttcgg tgagttcgag 614581 ttctggttcg ccggagtcaa ggttgccacc atcgtcggct tcctggtcct tggcaccgct 614641 ttcgccttcg ggctgctgcc gggccatggc atggatttca gcaacctcag cgcgcacggt 614701 ggcttctttc ccgacggggt aggtgccgtc ttcgctgcca tcgtggtcgc gatcttctcc 614761 atgactggca cggaagtagt caccatcgcc gcggctgaag cgccggaccc tcaacgagcg 614821 gtccaacgcg cgatgagcac ggtggtggca cgcatcgtga tcttcttcgt cggctcggtc 614881 ttcctgctca cggtgatcct gccgtggaac tcgttggagc ttggcgcctc cccgtacgtt 614941 gccgcgctgc ggcacatggg tattgggggt gctgatcaga tcatgaatgc cgtcgtgctt 615001 accgcggtgc tgtcctgctt gaactcgggc ctgtataccg cgtcgcggat gctgttcgtg 615061 ctcgccgccc ggcaggaggc gccggcccag ctggtcaaag tcaaccggcg tggagtcccc 615121 accttcgcga tcatgggatc gtccgtggtg ggattcctgt gcgtgatcat ggcatgggtc 615181 tcacccgcaa cggtattcgt tttcctgctc aactcgtcgg gcgctgtgat tttgttcgtc 615241 tacctgctta tcgcgctgtc gcagatcgtg ttgcgtcgcc agacatctgg ccaaaatctg 615301 ggggtacgga tgtggctttt cccggggctg tcgatcgtca cggtgaccgg aattgtcgcc 615361 gtgctggcgc ggatggcgtt cgactacgcc gcgcgcagcc agctctggct cagcctgctg 615421 tcctgggcag tggtcgttgg gtgttatttg gtcaccacat tggtgcgacg tccccttaat 615481 cggccttggt gagcagtacg gcctcgtcga acggcagtct ggcaaagacc ggccgccatc 615541 ggctgctgac atacggcgcc gcctcggcct tggtgagccg ccgcgggttg gcgacaccaa 615601 aggttttgcc gtagcggcgc atccggccac cgccggccgc ggtgatgttc ttcagccagt 615661 cgcggttcgg accgtaggtg agcaaaatcg ccacgcccgc ccggccgtcg acgtccgcgc 615721 tgaacacgtt caacggggta cggtacggct tgcccgagcg gcggcccacg tgctcaagaa 615781 tcgcgaacgc cgggagccag ccggcccata gccgctgaat ggggttggtg acatatcgat 615841 tgaaccgagc cagccactgc ggtagttgca tgcccaccat ccaactcgtg gaccggccgc 615901 ggcatcaagc aaacctctgg tggctgcggc aaactcttac accctgtagt tgagcgacct 615961 gggcaggctg gaacactagt cgtcatgggc agcacggaac aggccacctc gcgggtaagg 616021 ggagccgcgc gcacatcggc gcagctgttc gaggccgcat gcagcgtcat acccggcgga 616081 gtgaactccc cggtgcgggc gttcacggcg gtgggcggca ccccgcgctt cattaccgaa 616141 gcccacggct gctggttgat tgacgccgac ggcaaccgct acgtagacct ggtctgctca 616201 tggggcccga tgatcctcgg tcacgcgcat ccggccgtcg tcgaggcagt ggccaaggcc 616261 gcagcccgcg gcctgtcctt cggggccccg actcccgccg aaacccaact agccggcgag 616321 atcatcggcc gggtagctcc cgtcgagcgg atacggctgg tgaactccgg caccgaggcc 616381 actatgagcg ccgtgcggct ggcccgcgga ttcaccggcc gggccaagat cgtcaagttc 616441 tccggctgct accacggaca cgtcgacgca ttgctcgccg acgcgggttc gggagtggcc 616501 accctgggct tatgtgacga cccccagcgc ccggcttcgc cgcgctcgca atcgtcacgg 616561 ggcctgccgt cctcccccgg ggtcactggc gccgcggcag ccgacacgat cgtgttgccc 616621 tacaacgaca tcgatgccgt acagcagacc ttcgcccggt tcggcgagca gatcgccgcc 616681 gtaatcaccg aggccagccc cggcaacatg ggagtcgtcc cgcccgggcc cggcttcaac 616741 gcggcgctgc gcgcgatcac cgccgagcac ggcgccctgc tcatcctcga cgaggtgatg 616801 accgggttcc gggtcagccg aagtggttgg tacggaatcg atccggtgcc cgctgacctg 616861 ttcgccttcg gcaaggtgat gagcggcggg atgcccgccg ccgcgttcgg cgggcgcgcc 616921 gaggtgatgc agcggctggc gccgctgggg ccggtgtatc aggccggcac gttgtcgggt 616981 aacccggtgg cggttgccgc cgggctggca acgctgcggg ccgccgacga cgcggtctac 617041 accgcattgg acgccaacgc tgaccgcctg gccggcctgc tctccgaggc actgacggat 617101 gccgttgtgc cacaccagat ttcgcgggca ggcaatatgc tcagtgtgtt cttcggcgaa 617161 acaccggtga ccgacttcgc gtccgcgcgg gccagccaga cctggcgtta tccagcgttc 617221 tttcatgcca tgctggacgc cggtgtctac ccgccgtgca gtgccttcga ggcatggttc 617281 gtctcggccg ctttggacga cgcggcgttc ggccggatcg ccaacgcgct gcccgccgcg 617341 gcccgagcgg cggcccagga aaggcccgcc tgatgcccga ggaaacccaa gtccacgtgg 617401 tgcgccacgg tgaggtgcac aaccctaccg gcatcctgta cgggcggctg cccggattcc 617461 acctgtccgc aaccggcgcg gcgcaggccg ccgccgtcgc cgacgcgctg gccgaccgcg 617521 acatcgtcgc ggtaatcgca tcgcccttgc agcgtgccca ggagaccgcc gcgcccatcg 617581 ccgcccggca tgaccttgcg gtggagacag acccggatct gatcgaatcg gccaacttct 617641 tcgagggccg ccgcgtcggc cccggtgacg gggcatggcg cgacccgcgg gtgtggtggc 617701 agctgcgtaa cccgttcacc ccgtcgtggg gtgagcctta cgtggatatc gctgcccgaa 617761 tgacgaccgc ggtggacaag gcacgtgtcc gcggcgccgg ccatgaggtg gtgtgcgtca 617821 gccatcagct gccggtgtgg acgctgcggc tgtatctgac cggtaagcgc ctctggcacg 617881 atccgcgccg tcgggactgc gcactggcct cggtgacgtc gttgatctac gacggcgacc 617941 gcctggttga cgtggtgtat tcgcagccgg cggcgctttg accgcgccgg cgacgatgca 618001 gagcagagcg accagaagga gcggcgcttt gaccatgcgc cggctggtga tcgccgcagc 618061 ggtatcggca ttgctgctca ccggctgttc cgggcgcgac gccgtcgccc aaggcggcac 618121 gttcgaattc gtctcgcccg gcggaaagac cgacatcttc tacgatccgc ctgccagccg 618181 cggccgcccg ggcccactgt ctgggccgga gctggcggat ccggcgcgca gtgtgtcgct 618241 ggacgacttc cctgggcagg tcgtcgtcgt caacgtgtgg gggcaatggt gtgggccgtg 618301 ccgggccgag gtcagccaac tacagcgggt gtatgacgcc acccgaggtg cgggtgtgtc 618361 gttcctcggg atcgacgtgc gcgacaacaa ccgccaggcg ccccaggact tcatcaacga 618421 ccggcatgtg acgtacccgt cgatctatga cccggcgatg cgcaccttga tcgcattcgg 618481 tggcaaatac cccaccagcg tcattccgtc cacgctggtg ctggaccgtc agcaccgggt 618541 cgcggcggtg tttctgcgcg aattgctggc tgcggacctg cagccggtgg tcgagcgggt 618601 ggccgaggag gagccgtcgg gtcgggctcc ggtgggggcg caatgaccgg gttcaccgag 618661 attgccgcgg tggggccact gctggtggcg gtgggggtat gtctgctggc tggtctggtg 618721 tcgttcgcct caccatgtgt ggtgccgctg gtgcccggct acctgtcgta tctggcggcc 618781 gtcgttgggg tggacgagca gctgccggcc ggcgtcgtca aacccccggt ggctgcccgc 618841 tggcgggtcg ccggatcggc ggcgctgttc gtggcggggt tcacgacggt gttcgtgctg 618901 ggcaccgtcg ccgtcttggg catgaccacc acgctgatca cgaatcagct gctgctgcag 618961 cgggtcggag gcgtgctgat cgtcgtcatg ggcctggtgt tcgtggggtt catcggagcc 619021 ctgcagcgcc aggcgaggtt cacgccgcgc cagttgacga gcgtagcggg ggcgccggtg 619081 cttggcgcgg tgttcgcgct cggctggaca ccgtgcctgg ggccgacgct gaccggggtg 619141 atcaccgttg cctcggccac cgagggtgcc agcgtggcgc gtgggatcgt gctggtgatt 619201 gcctattgcc tggggctggg gattccgttc gtgcttttgg cgttcggttc ggcgtgggcg 619261 gtggcgggcc tgggctggct gcgccggcac accagggcca tccagatctt cggcggggcg 619321 ctgctgatcg cggtcggtgc cgcgctggtc accggggtgt ggaacgacgt cgtgtcgtgg 619381 ctgcgcgacg ccttcgtttc cgacgtgagg ttgccgattt gagtgggcag ggtgccgcgc 619441 aaaaggcgcg caacatgtgg cggtcgttga cgtcgatggg caccgcgctg gtgctgctgt 619501 ttttgctcgc gctggctgcc atacccgggg ccctgctgcc gcagcgtggc ctcaacgccg 619561 ccaaggtgga cgactacctg gccgcgcacc cactcatcgg tccgtggctg gacgagctgc 619621 aggccttcga cgtgttctcc agcttctggt tcaccgccat ctacgtgctg ctgttcgtgt 619681 ccctcgtcgg ctgtctggcc ccgcggacga tcgagcacgc ccgcagcctg cgggctacac 619741 cggtcgccgc cccgcgcaac ctggcccggc tgcccaagca cgcccacgcc cggctggccg 619801 gcgagcccgc cgccctggcc gccaccatca cgggccggct gcgcggctgg cgcagcatca 619861 cccggcaaca aggcgacagc gtggaagtct ccgccgagaa gggctacctg cgcgagttcg 619921 gcaacctggt gttccacttc gcgctgctgg gtctgctggt ggcggtggcc gtcggcaagc 619981 tgttcggcta cgagggcaac gtgatcgtga tagccgacgg cggacccggt ttttgttcgg 620041 cgtcgccggc cgcgttcgac tcgtttcgcg ccggcaacac cgtcgacggc acgtcgttgc 620101 acccgatctg tgtgcgggtc aacaacttcc aagcgcacta cctgccgtcc gggcaggcca 620161 cctcgttcgc cgccgacatc gactatcagg ccgacccggc cactgctgac ctgatcgcca 620221 acagctggcg gccctaccgg ctgcaggtca atcacccgct gcgggtcggc ggcgaccggg 620281 tgtacctgca gggccacggc tatgcgccca ccttcaccgt gacgttcccg gacgggcaga 620341 cccgcacgtc gaccgtgcag tggcgacccg acaacccgca gaccctgctg tcggcgggcg 620401 tcgtgcgcat cgacccgccg gccggcagct accccaaccc cgacgagcgt cgcaaacacc 620461 agatcgccat ccagggcctg ctggctccca ccgagcagct cgacggcacc ctgctgtcgt 620521 cgcgtttccc cgcgctcaat gccccggcgg tggccatcga catctaccgc ggcgacaccg 620581 gcctggacag cgggcggccc cagtcgttgt tcaccctgga ccaccggctg atcgagcagg 620641 gccggctggt caaggaaaag cgggtcaacc tgcgcgccgg tcagcaagtc cgcatcgacc 620701 aaggcccggc ggccggcacg gtggtccggt tcgacggcgc ggtgccgttc gtcaacctgc 620761 aggtctccca cgaccccggc cagtcctggg tgctggtctt cgcaatcacg atgatggcgg 620821 gactgctggt gtcgctgctg gtgcgcaggc gccgggtgtg ggcgcggatc acgccgacga 620881 ccgcgggtac ggtaaacgtc gagctgggcg gcctgacgcg caccgacaac tccgggtggg 620941 gcgccgagtt cgagcggctg accgggcggt tgctggcggg ttttgaggcg cggtccccgg 621001 acatggccga agcggccgca gggaccggaa gggacgtcga ttgaacacgc tgcacgtcaa 621061 cgtcggcctg gcccgctact ccgactgggc gttcacctcg gccgtggtgg cgctggtggt 621121 cgcgctgctg ctgctggcgt tcgagttcgc ccaggttcgc ggtcgcggac tcgcgccgct 621181 ggccgtgccg gccggatcgg tggccaccga tagcgctacc cctgggatcg tggcggacca 621241 acggcaccgg ccgttcgacg aacgcgtcgg gcggggcggg ctggccgtcg cctatctggg 621301 catcgggcta ctgctggcgt gcgtcgtgct gcgcggcctg gccacccagc gggtgccgtg 621361 gggcaacatg tacgagttca tcaacctgac ctgcttgtcc gggctcatcg ccggcgcggt 621421 cgtgctgcgc cgtgcgcgat accggccgct gtgggtcttc ctgctggtcc cggtgctgat 621481 cctgctcacc gtgtccggac gctggctcta cgccaatgcc gccccggtga tgccggcact 621541 gcagtcctac tggctgccca ttcatgtgtc ggtggtcagc ctcggttctg gggtattcct 621601 ggtcgccggt gtcgccagca tcctgttcct tgtgcgcaca tcgcggctgg gtgagccaac 621661 cggtgaaggc gcgctggcgg gtatggtgcg gcggctcccc gatgcccaaa ccctggacgg 621721 aatcgcctac cggaccacga tcttcgcctt ccccgttttc ggcttcgggg tgatattcgg 621781 tgccatctgg gccgaggaag cctggggccg ctactggggc tgggacccca aggagacggt 621841 gtccttcgtc gcgtgggtgg tgtacgcggc gtacctgcac gcgcggtcaa cggcgggttg 621901 gcgggaccgc aaggccgcct ggatcaatgt cgccggcttc gtggccatgg tcttcaatct 621961 gttcttcgtt aacctggtga ccgtcggcct gcactcgtat gcgggcgtgg gctgaccgtt 622021 cgtctgcaac cgacccgagg accgcagcaa gggggagtgc tggtgaccga gcatccgagg 622081 acgggcgtgg gagcccccga tagcggcaac ggcggcacgg atcatccgac cgtgcagttg 622141 ccgcccgtgc catccgtggg ggcaccaccg gctgcggccg gtggtgaaac accgactagg 622201 tcagttgcgg gattccgcac ccagcggctc gacccgacgg cctacggcgc ctactacagc 622261 ggccccgatg agggcccggc cagcccggct gaaaggccgc cgtatcgtct cgagccggtg 622321 ccccatacgc cgtatccgga actggccacc accacgctgc tgaggccggt caagccgcca 622381 ccgtccgaag gctggcgtcg gttgctctat ctgctgtcgg gtcggctgat caacgccggg 622441 gaaggccctc gggccgcgca cctcaacgac ctggtcgctc aggtcaaccg cccgctgcgc 622501 ggctgctacc ggatcgcggt gttgtcgttg aaagggggtg tcggcaagac cacgatcacc 622561 gcgaccctgg gggccacctt tgccgacctg cgcggtgacc gggttgtcgc ggtcgacgcc 622621 aatcccgacc gcggcacact gagccaaaag gtcccgctcg agacgccggc cacggtgcgg 622681 cacctgctgc gcgacgccga cggcatcgag cgctacagcg acgttcgcgg ctacacatcg 622741 aagggaccca gcgggctgga agtgctggca tcggacagtg atccggcctc ctcggacgca 622801 ttcagcgccg acgactacac ccgcaccctg gacattctgg agcggttcta cggcctggtg 622861 ctcaccgact gcggtaccgg gttgctgcac tcggcgatgt cggcggttct gcctaggtcc 622921 gacgtactgg tcgtggtcag ctcggggtcc atcgacggcg cccgcagcgc cgcggcgacg 622981 ctggactggc tgcaggccca cggccacgac gaccaggtgc gcaactcgat cgccgtcgtc 623041 aacgcggtgc ggccgcgcgc gggcaaggtc gacgtgggca aggtcgtcga gcacttctcc 623101 aggcgttgcc gtgcggtgcg cgtggtgccg ttcgacccac acctcgaaga aggcgccgaa 623161 atcgcgctgg atcggttgcg gcgggagacc cgcgaagcgc tcaccgaact ggcagcggtg 623221 gtggccgctg gattccccgg cgacccgcgg cgctgcaaac cgagcttcac ctaggaacgg 623281 ttattgtccc cgtgccccaa ccgccgcagg aactctggat cgtcgtcggg cccgatgacg 623341 cgagtcttgg gccggttcat ctgtgcccgt gcagcgcgcc agccaaggta gatcagcgtc 623401 gccaaaatca gcacgaggag caggtagagc actcgacacc tccttggacc gaatataccc 623461 gcgccgtagg ctcaggctgt gtcagaagcg cctaacgaca agaccactcg gggtgttgtc 623521 gacatactgg tctatgcgac ggcgcggctg ctgctggtgg tggcggtcag cgcagcgatt 623581 ttcggggtcg cgcgactgat cgggttgacc gaattccccg ttgtcgtggc cacgctgttc 623641 gggctgatca tcgcgatgcc gttgggcatt tgggtgttca gcccgctgcg gcggcgcgcc 623701 acggccgcgc tcgcggtggc cggtgagcgt cggcgcgccg agcgggaacg gctgcgggcc 623761 cggctgcgtg gcgagtcgct acccgaagaa cagtgagcgc ggggcgcctg gtagtcggca 623821 ttgtgcacaa gtgggttggg cattcagcac agtgtttgcg ctgatcgtgg cgattcgcct 623881 cggccgcgat tggcggctcc taacgttggc tgcaccgggt gtgggttgcg ggaaggtgtg 623941 cgatgtctaa tttgctggta accccggagc tggtggcggc tgcggcggcg gatttggcgg 624001 gtattgggtc ggctatcggt gcggccaatg cggcggccgg ggccccgacg atggcgctgt 624061 tggccgccgg tgccgatgag gtgtcggcgg cggtggcggc cgtgttttcc tcctacgccc 624121 agcaatatca ggcgctgagc gctgcggcgg cggcgtttca cgaccagttc gtgcgggcgt 624181 tggccgcggg tgcgggtgcg tatgcgggcg ccgaggccgc caacgtggag cagcagttgc 624241 tgaacgcgat caatgcgccc accctcgcgt tgttggggcg gccgctgatc ggcaacggcg 624301 ccgacggggc ggccgggacc ggtcaggccg gcggggcagg cgggctgttg tacggcaacg 624361 gcggtaacgg cgggtcgggt gcggccgggc aggccggcgg ggccggcggc gccgccgggc 624421 tgatcggcca cggcgggacc ggcggggtcg ggggtaccgg tgcggccggc ggggccggcg 624481 ggaccggcgg gtggctgttc ggcaacggcg gggccggcgg gaccggcggg gccgtcaccg 624541 gggtcagcac caccggcggg ccgggcggtc acggcggtga cgccggcctg tacgggtttg 624601 gcggggccgg tggcgcgggt gggttcggcc agagcggggc ggccggcggg gccggcgggg 624661 ccggtggggc cggtgggtgg ttgtacggcg acggcggcga cggcggcgca ggcggcaacg 624721 gcggtaacga gtccggcacc ggcgtcagtg gcgttggggg tgtgggtggg gccggtggtg 624781 ctggtgggtt gttgttcggt aacggcggcg acggcggcgt cggcggcgac ggcggcgacg 624841 gcagcagcac ccaggattcc ggtggtgatg ggggtgcggg tggggccggt ggtgctggtg 624901 ggtggttgct tggtaatggg ggggccggcg gggccggcgg ggccgcctca atcaaggttg 624961 ccactggtgg tctgggtggt gatggtggcg atgccgggct gttcgggttt ggtggggacg 625021 gcggctgggg cggacgcgga gtggatgctc gattcggtgc ggctgggggt gccgctgggg 625081 ccggcggtgc gggcgggtgg ttgtacggcg atggcggcgc cggcggcgtc ggcggtgtcg 625141 gcggtgctgt cttcagcctt tcctccggtg acggcggggc cggcggggcc ggtggcggtg 625201 gtgggtggtt gttcggtaac ggcggcgacg gcggcgccgg tggcggcggc ggtggccgct 625261 tcggcagcgg cagcggtgcc ggtggtgatg gggctgtcgg tggggccggt ggtgcgggcg 625321 cgtggttcgg caacggtggc gccggcggcg tcggcggcgg cggtggccgc ggcaccaccg 625381 ccatcggtgg cgacgggggt gccggtgggg ccggtggtgc gggtgggtgg ttgtacggcg 625441 acggtggcgc cggcggtgcc ggcggcggtg gtggccgcgg cggcaccggc aacgatggtg 625501 gcgacggcgg ggacggcggc cgcggcggtg atgcccagct gcttggcaac ggcggtgacg 625561 gcggggccgg cggggccggc gggcccgccg ggtttggcgc ttcccccggg gccggcgcgg 625621 ccggcggggg cggcggtgcc ggcggttcgc tgttcggcag ccccggcacg accggcccgc 625681 acggctgatc cctggctagc gccgatcttc gcgcgctcaa cccttcggca ttcgcaccac 625741 ctgggcggca tagctcagac cggcgccgta gccgatcaac agggccagat cgccgggctt 625801 ggccgcgccg gtcgtcagta attcggccat cgcgagcgga atggaggccg ccgaggtgtt 625861 tccggtgtgc tcgatatcgt tggcgaccac cgcgtcgggc cgcaactgca ggttcttgac 625921 cagcagctcg ttgatgcggc tattggcctg atgagggacg aacacgtcta tctggtcggg 625981 tcgcaccccg gcggcgtcca tcgcgcgccg accgacgtcg cccattttga acgctgccca 626041 acggaagacc gcgggacctt cgagccgcac aaacgggcgt gggccgctgg gattctgggc 626101 gaaagtgatc cagtcgatgt cctgccgtat ggcatcggcc tgttcgccgt cgctacccgc 626161 cacggttggt ccaatgcctt gaaacggtgt ctcgcccacc accactgcgg ccgcgccgtc 626221 ggcgaagatg aagcagttgc cgcggtcgta catgtctatc gtgggggaca gtttttccgt 626281 gccgaccacc agcatcgtgg ccgcacctcc gccccggatc atgtcggccg ctgcgccaag 626341 cgcatatccg aatccggcgc accccgccga aagatcgaac ccgagtatgc ccttggcgcc 626401 cagcgacgcc gcgaccattg gggcggccgg cggggtttgc aggaaatggg tgttggtggt 626461 gacgatcacg ccatcgatgt cggccgccga caggccggcg ttcgacagtg cccgtcgaca 626521 ggcctcagtc gccatggaag ccgccgactc gtcgtcggcg gcgaatcggc gggtcttgat 626581 gccggttcgg gtgtagatcc actcgtcgga cgagtcgatg tgctggcata tctcgtcgtt 626641 ggtgaccacg cgttcgggcc ggtacgcccc gacactgagc agcccgacgc tcctggcgcc 626701 gctggtcgtg gcgatctccg tcatacccgt cctatctgtt ctcgtcgagt gtgcacctac 626761 ggcgacgaca cgccgacgga gcccgccctg agtgcacgtt cgaagttagc tcaactgacc 626821 aaacgccaat gcccccgcca ccgccaacgc ccacaccagc atggccagcc cagtgtcacg 626881 cagtaccggg atcagctcgc gcccgccgcg cccggatcgc accggtccgg cagcgcgcag 626941 cgccaaaggc gcggccacca agcccaccac acaccacggc gtggccagca ttagcacgaa 627001 cgtcagcacc ccggcgaccg ccagcaggcc ctggtaaagc atccgggtcc gggcgtctcc 627061 cagccgcacc gccagcgtga tcttgtcggc ccgcgcgtcg gtggggatgt cgcgcaggtt 627121 gttggccacc agcaccgagc acgacaacgc acccgttgct accgcctgtg ccagccccac 627181 ccagtccacc cgcaatgcct gcgtgtactg ggtaccgagc acggcgaccg gcccgaagaa 627241 cacaaacacc gccagttcgc cgaagcccgc atagccgtag ggttttgacc cgccggtgta 627301 gagccaggcc ccggcgatgc agatcgcacc caccgcaatc agccacggcg cgctgagcag 627361 cgccaaaacc agcccggcca gcgcaccgag cgccaggctc gtcatggcag cggtcagcac 627421 cgagcgcggg gtcgccagcc gcgagcccac caaccgcacc ggacccaccc tgtcgtcatc 627481 ggtgccgcgg atgccgtcgg agtagtcatt ggcgtaattg accccaatga ccagcgccac 627541 cgcaacagcc agtgccaaca gcgctttcca ccacacggcc gcgtgcagcc aggccgcggc 627601 gccggtgccg gcaaccactg gcgcgatcgc gttcggcagc gttcggggcc gcgcgccgga 627661 gacccactgt gcgaaactgg ccaccagggc atcctgccct atgcacaaca atgggcgcat 627721 gctcggagtg atcggcggca gcggcttcta caccttcttt gggtcggaca cccgcacagt 627781 caattcggac accccctacg gtcaacccag cgccccgatc acgatcggca ccatcggggt 627841 gcacgacgtc gcgttcttgc cccgccacgg cgcccatcac cagtactcgg cgcacgccgt 627901 gccgtatcgg gccaacatgt gggcgctgcg cgcgcttggt gtgcggcggg tcttcgggcc 627961 gtgtgcggtc ggcagcctgg accctgaact cgagcccggc gcggtcgtgg tgcccgatca 628021 gctggtcgac cgcaccagcg gccgcgccga cacctatttc gacttcggcg gtgtccatgc 628081 cgccttcgcc gatccgtact gccccacgct gcgggccgcg gtgaccggcc tgcccggtgt 628141 tgtcgacggc ggcaccatgg tggtgatcca gggtccgcgg ttttccaccc gcgcggaaag 628201 ccagtggttc gccgctgccg ggtgcaatct ggtcaacatg accggctatc ccgaggcggt 628261 gctggctcgc gaactcgaat tatgctacgc agcaatcgct ttggtgacag atgtggatgc 628321 cggcgtcgct gctggcgatg gcgtgaaagc cgccgacgtg ttcgccgcat tcggggagaa 628381 catcgaactg ctcaaaaggc tggtgcgggc cgccatcgat cgggtcgccg acgagcgcac 628441 gtgcacgcac tgtcaacacc acgccggtgt tccgttgccg ttcgagctgc catgagggtg 628501 ctgctgaccg gcgcggccgg cttcatcggg tcgcgcgtgg atgcggcgtt acgggctgcg 628561 ggtcacgacg tggtgggcgt cgacgcgctg ctgcccgccg cgcacgggcc aaacccggtg 628621 ctgccaccgg gctgccagcg ggtcgacgtg cgcgacgcca gcgcgctggc cccgttgttg 628681 gccggtgtcg atctggtgtg tcaccaggcc gccatggtgg gtgccggcgt caacgccgcc 628741 gacgcacccg cctatggcgg ccacaacgat ttcgccacca cggtgctgct ggcgcagatg 628801 ttcgccgccg gggtccgccg tttggtgctg gcgtcgtcga tggtggttta cgggcagggg 628861 cgctatgact gtccccagca tggaccggtc gacccgctgc cgcggcggcg agccgacctg 628921 gacaatgggg tcttcgagca ccgttgcccg gggtgcggcg agccagtcat ctggcaattg 628981 gtcgacgagg atgccccgtt gcgcccgcgc agcctgtacg cggccagcaa gaccgcgcag 629041 gagcactacg cgctggcgtg gtcggaagcg agtggcggtt cggtggtggc gttgcgctac 629101 cacaacgtct acggccccgg catgccgcgc gacaccccct actccggagt ggccgcgatc 629161 ttccgctcgg cggttgaaaa aggcaagcca ccaaaggttt tcgaagacgg cggccagatg 629221 cgggacttcg tgcacgtgga cgacgtggcc gcggcgaacc tcgccgcggt gcatctgggt 629281 gaagcggacc gcgacgggtt taccgcggtc aacgtctgtt ccgggcgccc catctcgatc 629341 cttcaggtgg caaccgcgat atgcgacgcc cgcggtggct cgatgtcccc ggccatcacc 629401 gggcactacc gcagcggcga cgtgcgccac attgtcgccg atcccgcgcg ggccgcccgc 629461 gtgctcgggt tccgcgcggc cgtcgatcca ggcgaaggac tgcgtgagtt cgcgttcgcg 629521 ccgcttcgct gaccgctcga gctacgacga gtggtccggc ggccggtaga tcttcggccg 629581 cactgggtgc gtcgacccag ctgacctgaa aatccggggg gatccagcag gccgggacag 629641 cgccggggtg tgcgggggtt gcggcagctg gcgcagcctg ccgatgacga tggccgccgc 629701 gaggatgctg agcgccaggc cgcacagcac cacatcgaag gtgctggtga tctgctgggc 629761 aagatcgttt ccgtagacgg ggtgagccga cccgcccgaa gatgcctgga atcgcggagc 629821 caggacgacg ccgacgatga cgcgcgaaac ggtgaatgtg catagcaatc cgcacagcga 629881 cgagatcagc ctcgcgggca gggcctcggt cgccatgttg aagcgaagcc agatcgctag 629941 gaccgcggtg gccagatcga acgcagccag cgccatgctg tcgtagcgcg agaacggata 630001 attcaacagc accgcgacga cgaggtcggt caaccgcatc gctaccgagc ccaggcacca 630061 cacggcgatg agcagcaagc cgtttcggga cgctgcccgc cacacggttt tgtcgatcgg 630121 tggcgcgttc ggggacctga acagggtcag cggcgcacac attgccgcgg ccgcggccca 630181 caccagataa ccctcgtatc ccacgccggc cgtcgacgtg ttttgggcaa tgccgtggaa 630241 cgcgtcgatc tcgcggccga tgggaaggct ccacacgatg gacccggcga tcagggtgga 630301 gccacccagc gccacggttg agagcgcttc ggccgctgta ggccgaagca gccaccggga 630361 cgcgaccagc acggcggcca gggctaccac gccgtacacc accgccgtgt cgatgaccgc 630421 gaggttctgt ttgccaaaac cggacgcgcc ggccgcgggc tccaacgcgt acctgacccg 630481 ccagctcagg ttgaaaccgg tgctgagggc cgcacccaac atagacgcgt agccgaggaa 630541 ctgggtggcc cgcagccacc tgctgtggct gccctcatcg gtggtagcgc cggttagcgc 630601 cggttgcgcg ctcaacagcg cgccggtgat ccccagccat ccccccggcc cgacaccacc 630661 gggcacgtgg acggtgccgc cgagtcgaat cgtctggatc gcgtcgaaca ccacgaaggc 630721 cagcaccagc agcagatagg ggacgttgag gcccaggcga agctgtgagc gcctcccggc 630781 gaaggtcacg gcaagcgatg ccaaagagag cgatgtcacc gccagcagca acccgaacac 630841 ggtcttgctg ctgtccggga ttcggaaacc gaaatacagg ttccatggga aaaacagcgc 630901 accgatgagc agggcaccag cggccaagtc gcggacgacc tcgcgtcgtc gggtgtcgtc 630961 gctgctcagg cccacgatgc cccccgggaa tcaagaacgg ttggcgccga gtcggtcctg 631021 tggtggcgtg ggtgcacccg gccgggccga ctgcgttgct cgcttgcgaa catagtctcc 631081 gttccgacga cgcggcagtg gcgcagaaca cgcggttggg cggatctcgt ttgcccggtg 631141 accgtcccgc tgtttgcgaa cccggttacg ctgcggtcat aggcgaacgc tgtcgccgaa 631201 ttaccgatac tgccgacggt atcgcagtgt aacgatgccg ggacattgct ggttgtgggg 631261 tggccagccg aaggagagcc gcgatggacg tcgctttggg ggttgcggtc acggatcggg 631321 tcgcgcgtct ggcgctggtc gactcggctg cgcccggcac cgtgatcgac cagttcgtgc 631381 tcgatgtggc cgagcacccg gtcgaggtgt taaccgagac cgtggtgggc acggatcggt 631441 cattggccgg cgaaaaccac cggctggtcg ctacccggct gtgttggccg gatcaggcca 631501 aagctgacga gctgcagcac gcactgcagg actccggggt ccacgacgtt gccgtgatat 631561 ccgaggcgca ggccgccacg gcgctggtcg gggcggcaca tgccggctct gccgtgctgt 631621 tggtgggtga tgagacggca accttatcgg tggttggtga cccggacgcg ccgccgacga 631681 tggtggccgt cgcgccggtg gcgggcgccg acgccacatc gaccgtcgat accctgatgg 631741 cccggctcgg cgaccaggcc ctcgccccgg gggatgtctt cctggtgggt aggtccgccg 631801 agcacaccac ggttcttgcc gaccagctgc gcgcggcgtc gacgatgcgc gtgcagactc 631861 ccgacgaccc cacgttcgcg ctggcccgtg gcgcggcgat ggcggccggc gccgctacga 631921 tggcgcaccc tgccctggtc gcggatgcga ccacttcgct ccccccggcc gaggcggggc 631981 aatcgggttc tgaaggcgag cagctggcgt actcgcaggc cagcgattac gagctgcttc 632041 cggtcgacga atatgaggaa cacgacgaat acggggcagc cgcggatcgc tcggcgccgt 632101 tgagccgacg gtcgctgctg atcggcaacg ctgtcgtggc ctttgcggtg atcggtttcg 632161 cctcgctggc ggtggcggtg gcggtcacca tccgaccgac cgcggcctca aaaccggtag 632221 agggacacca aaacgcccag ccagggaagt tcatgccgtt gttgccgacg caacagcagg 632281 cgccggtccc gccgcctccg cccgatgatc ccaccgctgg attccagggc ggcaccattc 632341 cggctgtaca gaacgtggtg ccgcggccgg gtacctcacc cggggtgggt gggacgccgg 632401 cttcgcctgc gccggaagcg ccggccgtgc ccggtgttgt gcctgccccg gtgccaatcc 632461 cggtcccgat catcattccc ccgttcccgg gttggcagcc tggaatgccg accatcccca 632521 ccgcaccgcc gacgacgccg gtgaccacgt cggcgacgac gccgccgacc acgccgccga 632581 ccacgccggt gaccacgccg ccaacgacgc cgccgaccac gccggtgacc acgccgccaa 632641 cgacgccgcc gaccacgccg gtgaccacgc caccaacgac cgtcgccccg acgaccgtcg 632701 ccccgacgac ggtcgctccg accaccgtcg ccccgaccac ggtcgctcca gccaccgcca 632761 cgccgacgac cgtcgctccg cagccgacgc agcagcccac gcaacaacca acccaacaga 632821 tgccaaccca gcagcagacc gtggccccgc agacggtggc gccggctccg cagccgccgt 632881 ccggtggccg caacggcagc ggcgggggcg acttattcgg cgggttctga tcacggtcgc 632941 ggcttcacta cggtcggagg acatggccgg tgatgcggtg acggtggtgc tgccctgtct 633001 caacgaggag gagtcactcc cggcggtgct ggccgcgatc ccggccggct atcgggcgct 633061 agtggtggac aacaacagca ccgatgacac cgcgacggtg gccgcccgcc acggtgccca 633121 ggtggttgtc gagccgcggc ccggatacgg ctcggcggtg catgccggtg tgctcgccgc 633181 gaccaccccc atcgtagcgg tcatcgacgc cgacggctcg atggatgccg gcgacttgcc 633241 caagctggtc gccgaactcg acaagggcgc cgacctggtg accggtcggc ggcggccggt 633301 ggcgggcctg cactggccat gggtcgcccg ggtgggcacc gtggtgatga gctggcggct 633361 gcgcacccgc caccgcctgc cggtgcacga catcgcgccc atgcgggtcg cccggcgaga 633421 ggccctgctg gatctgggcg ttgtcgatcg acgctcgggt tacccgctgg agctgctggt 633481 ccgggccgct gcggcgggct ggcgtgtcgt cgaactcgac gtcagttacg gtccccggac 633541 cggcggcaaa tccaaggtca gcggttcgct gcggggcagc atcatcgcga tcctggactt 633601 ctggaaggtg atctcgtgag ctgcctgccg gtcagcgtgc tggtggtcgc taaagcgccg 633661 gagccgggcc gggtcaagac ccggctggcc gcggcgattg gcgataaggt cgccgccgac 633721 atcgccgcgg ccgcactgct ggacaccctg gatgcggtgg ccgctgcgcc ggtcaccgcc 633781 cgggcggtgg cgcttaccgg cgacctggac tccgcggccg attccgcgga gatccgccga 633841 cggcttaagt ccttcacggt atttcggcag cgcggtgacg ccttcgccga ccggctcgcc 633901 aacgcacacg tcgacgcggc cgacggctat ccggtgctgc agatcgggat ggacacgccc 633961 caggtgaccg ccgagctgtt ggccgattgt gcacgcctgc tgcttcaaat ccccgcggtg 634021 ctcggcctgg cgttcgacgg cggttggtgg gtgctgggga tacgcacgcc tactgcggcc 634081 gagtgcctgc gcgccgtccc gatgtcacag ccagacaccg gcgagctcac cttgaaggcg 634141 ttgcgcgaca acggcattga tgtgacgcta gtgcagcgtc tgggcgactt cgacatcgtg 634201 gacgacatcg cgctggtacg cgattgctgc gctccgggga gtcggttcgc gcaggctacc 634261 cgcgcggctg gactctgagg ccgcgccggc gcatttgctt accagttggt gaagatgatg 634321 ctgttcagca gtagggcccc ggcggcgttg accgccagcc agagtcggtg cgagcggggc 634381 ggcagcagtg cgggcgccgc ggtcagccaa atggtgaagg gcagccagat tcgttcggtc 634441 tcggctttgc tcagcatgct caggtcggcc aaggcgatgg cggccagcac cgccagcagc 634501 agcagatggc agccggatcg acgactgatc gcggcccggt cgaatacccg gctgagacct 634561 gcgacgctgc ctaacccgat agcgcagacc acgcacgcca agtttgccca ggaccaatag 634621 ccgaacggcc gatctttggc gatcccctgc caatagcgtt gctggacaag ggtataaccg 634681 tcgaaccagg agaatccggc aaccgcgaag ctcaccgcga ccaccagcgc cgccagcacg 634741 gccggcccca gtgcccgcag gacgggccgc caatctgcgg cggccaacac cgccatcccc 634801 ggcagcacga tcagcacgag cccgtagttg agaaagacac cccagccgag cagtagcccc 634861 gctccggccg ccaccagcgc cgggaagcga gtggcaccat gcaccgccac cgccaacagg 634921 gcgatacccc acgccgccac accggcgaaa tacccgtcgg ccgaaaccgc gacccagatc 634981 gccgtcggcg ccaccgcgac gaatggtgcc gtccgccgcg ccatctgctc actggccagc 635041 acccgcacgg cgatcagcac cgccgccgcc gcgctggatc ccaccagcag gcacaccagc 635101 cccgcccaac cgccaccgcg cagcccgatc cgatccagcc agacaaacgt cagcagcgca 635161 cccggcgggt gcccggagac gtgagtcacc caggaattgg gctggaagtc gagaatccgg 635221 ctggtgaacg tccgcaacgt cgccgggatg tcggcaatgc cgggcacctg ccacaggtac 635281 tcgtcacggg tggtcaatcg gccggcaaag ccgcgctgcc agccgtcgat catcgccagt 635341 gagaacgccc aggcggcggc ggtggcccag gtgctcagcg tcagcacccg ccaggggagc 635401 cggtgcgcca ctaccggccc ccacgccaca acggccactg cggtaagaac cgccggggcc 635461 gtgccccagc caacatgggc gtcccagtag ccgaagatcg gcgcggcgcc ggcgcgcgtg 635521 gcaaaccgct ccaagccgat atcggatcgc ggtttgattc ccaggttcaa ccgcggcagt 635581 acgaacgcgg cgccgaccag gacaaacccg atcgcgacgg ccaatccctc gcggcgaccg 635641 atcctcacga ccgatcagcc tattgatcgg cttcaccggc gaaccggcgc accaacgctg 635701 cccggtccac cttgccgatg ccgcgtcgcg gtagcacgtt cacgacatgt agctctcggg 635761 gcgcggcggt gacgtccagg gtgcgcgcga catgcgcccg cagcgcttct agcgttggtg 635821 gtgggcatcc gtcgccgacc acaatcgcgg cgaccactcg ctgaccgagt cggtcgtcgg 635881 caagtccaaa aaccgcgcag tcacgcaccg cagggtgggt gcccagtgcg gcctccactg 635941 gctgcggcag cacggtgaat ccgcccgtgc tgatcgcttc gtcggctcgg cccagcacgg 636001 tcagcacacc cgaatcaccc gattcaaggg cgccaaggtc gtcggtgtga aaccagcctg 636061 gctcggcgaa cggatcgggc gagaccgggt tgcgatagcc cttggccagg gtcgcaccgc 636121 cgatagctat gcggccgccg gccagcaccc tcagccggac cccgtcgagc ggaacgccgt 636181 cgtagacaca gccgcccgag gtctcgctca tgccgtaggt gcgcaccacc gtgatgccgg 636241 cggcggccgc ggcgtccagg atgggccggg gggccggccc gccgccgatc agcaccgcgt 636301 ccaattcggc cagcgcggcc gtggccgccg ggtcggtaag tgccttggcc aactgtgcgg 636361 cgaccagcga cgtgtatcgc cggccagaac ccaatctctt tatcgcgttg ggtaattcgg 636421 tgacatcgaa tcccgcggag acgttcagtt cgacaggaac tgatccggcg atcacgctgc 636481 gcaccagcac cgccagcccg gcgatgtgat acggcggcac agccaacagc cagctgcccg 636541 gtccgccgag ccggtcgtgg gcggccgagg cgctggcggt caaggccgcc gcggtcaaca 636601 tggcgccctt gggcggtccc gtggttcctg acgtcgtcac taccagggcg acgtcgtcgt 636661 caatctgctc gcccactcgc aaagcgccca gcaaggactc atgctgggtg ggcaccgcga 636721 ccaatgccgg gtcgctgcca cccagcactc gttgcagggc aggcagcagc agcgcggtag 636781 cagaaccggc cgggacgtgc agcgcacgca ggatggctat gcgtgctcct cgcggtcgcg 636841 gacgtcatcg agtggccatc cctgggcggc caaccgcgcc cggacgcgct caacatcctc 636901 cggtgacggc aactcgtcgg tgaaatgggt gatcaccacg ccgatgtcga tctggtcgaa 636961 gtcaccgagt cgcatcagtt cgttagccac cgccttgacc tcatcgtggc tcagccggcg 637021 gcaaagcagg gcgagcaccg caaaggagtc ggtcggcgga atgccctcgg gatatcccgc 637081 gcgcaaccac gcgacgatcg aggtgagaaa ccggttcacg ctgttatatc ttcccgtcgg 637141 ggccgtcgcc aaaccctatg tcgcggccat ctgcgactct acttgggtgt ggcgcccagg 637201 aaggcccagc cggtgtgatg cgcgatgaag tcacgggcga tatacagcac tccaatgatg 637261 actaccgcaa gcaccagcgc gaagatcgcc caactgacgg caaccagcag cgggcgccgg 637321 cgcgcggtgg cgtcggcacc gtcgccggcg gcctgcagcc gcacgcccac cgcgaacagg 637381 cccggtagca gcgcgccggc tagcagactg aagatcagga tcttcagggt ggccgtgtag 637441 ttgaaccagg cactcacggg gcgttcctcg tggtgacgcc gaactgtggt gctcggtggt 637501 tcggtggggg agtcggagcc ggcggtgcag gcaccggcgg cctctgatcc gcaagcggct 637561 gcgcacccgc ttccaggccg gccgtcaggt tgccttccca gtcggcgttg acgttggtgt 637621 ggtcgatcgg cgccctgcgc gaccgcagcc agatggcggt ggcggtcagc cacaacagtg 637681 cgaaaccgag gatcgcaccg gggtagccac cgatgaaatg caccagcccg taggtgaagg 637741 ccccgaccag cccggccaac gggagcgtca ccagccacgc gaccaccatg cggccggcta 637801 ccccccagcg cacctcggcg ccgggcttgc cgacgccgct gcccagcacg gacccggtcg 637861 cgacctgcgt tgtggacagc gcatagccga agtgcgcgga caacagaatg acggcggccg 637921 atgacgactc ggcggccata ccctgcggtg gtttgatctc gaccagccct ttgcctaggg 637981 tgcggatgat gcgccagcca cccaggtagg taccggcggc catggccacg gcgcaactca 638041 cgatcaccca cagcggcggc accgatgccg tcgtgctgac cgcgccgtag gacatcaacg 638101 ccaggaagat cacgcccatc gtcttctgcg cgtcgttggt gccgtgcgcc agcgagacca 638161 gcgacgccga gccgatctgg ccgcgccgga aaccgcgttc cgtacgcttt tcggcaaccc 638221 cgcgcgtcgt ccggtagacc agccaggtgc cgactgctcc gaccagcgtg gccagcagcg 638281 cggctaccac ggccggcacg atcaccttgg acaccactcc gctccagatc accccacgca 638341 ggccgacggc ggcaattgtg gcgccgacga tgccgccgat cagcgcatgt gaggaactcg 638401 acggaatgcc cagcaaccag gtcaacaggt tccagacgat cccgccgacc aggccggcga 638461 acaccaactc cagcgtcacc agattcgcgt cgatcagacc cttggcgatt gtggccgcca 638521 cggcggtgga caaaaacgca ccgatcaggt tcagcacggc agaaagtgct accgctaccc 638581 gcggtgccag ggcgccgctg gcaatcgagg tcgccatggc gtttccggtg tcgtggaacc 638641 cgttggtgaa gtcgaacgcc aatgccgtca cgacgacaat gagcaaaagg aacaactgaa 638701 ggttcacagg gcctgattct gctggtcggg atattgcgtt gtcgatcaaa cgagtacgcg 638761 aaatgcgggt gtatctcgac tcgtcgtcag atgttaccaa tcacgtaacc cagcgttttg 638821 cggagttcac gcccgggtgt ctgtacgcag cgggtgaccc tcgggaacct cgacgaatat 638881 cagtgtgatc ccgtctgggt cggtcacatg catctcgtgc aggccccacg gttcgcggcg 638941 gggctcgcga gcgatcgaca cgcctcggct gaccagctcg gtctgggtag cctcgaggtc 639001 gcgcacctgc agccacagcg cgccgggaaa aggtccccgc gaatggtccg gctcgccgta 639061 accggccagt tcgagcagtg actgaccggc gaaaaacact gtgccggccc cgtattcacg 639121 ggcaatcgcc agcccgatct ggtcacggta gaagctcagc gaccgctgat agtccgccgg 639181 ccgaagtagc atccggctgg ccaggatttc catggccctg tgtctatcac gtagcggcac 639241 gccggcggcc gagggtcggc aggccgggac ccggttcaag ggttgagctg ttcgttgcgg 639301 cgctgcatga gtgcattgac ccagcgggga ccgatgctgt ccagcgcgtt gacggcgacg 639361 gcaacccgag gcgcgatccg caccggtcgg gtgcgggcgg cggtgaccat ccactcggcg 639421 gcttccgcgg cggtcagcgc cggcagcccg tcgtaggcct tcgtcggcgc aatcatcgga 639481 gttgccacca gcgggtagta cagcgtcgtc gaatgcacgc cctgactacc ccactcggtt 639541 tcgatgatcc ggctcaccgc cgacagtgcg gccttcgatg cgttgtacac cgagaacagc 639601 ggcgaagcct ccgacaacac gccccaggtg gcgacattga tgatatggcc gtcgccacgc 639661 tcgagcatcc cgggtgcaag cccgcggata agccgcagcg gggcatagta gttgagcacc 639721 atggtgcgct cgacgtcgtg ccagcgttcc agcgactcgg ccagcggccg ccggatcgac 639781 cggccggcat tgttgatcag gatgtcgatc ccgccgatgc gcttttcgac gtcttcgacc 639841 agcgcgtcga tcgcttccat gtccgagagg tcgcagggga gcgacatcgc cgtgccgccg 639901 tcgccggtga tccggtccgc caccgcatcc agcagatcct tacggcgcgc gacggcaacc 639961 acgacggcgc ggtgcagtcc gaactgtttg gtcgcggccg caccgatgcc tgacgacgcg 640021 ccggtgagca ggatgcgctt gccggtgagg tcgacgggtt gcatcgcggg ccggttgatc 640081 agcagttgcg gcgaaattgg tggccgcatg ccggccaatg tgatttgttc agtcaaccag 640141 cgcagcggtc ttttgctcac agctggggag tctagttttg ccgagcctgt agttactgtg 640201 gtgtcccact cgtcgggctt ctgctcggca actacagcct cggcgaacgg ccgcgttaga 640261 aatagcgcgg aaacgggctc cagtcggggg gacgcttctg taggaaggcg tcgcggcctt 640321 cgacggcctc gtcggtcatg taggccaggc gggtggcctc accggcaaac agttgctgac 640381 ccaccagccc gtcgtcgagc aggttgaacg cgaacttcag catccgttgc gcctgaggcg 640441 atttcgcgtt gatctcggcc gcccactgca gccccactgt ctccagctcg gcgtgttcgg 640501 ccaccgcgtt gaccgcgccc atctggtgca tctgctcggc ggtgtaggtg cggcctagga 640561 agaagatctc gcgggcaaac ttctggccga cctgacgggc cagatatgcg ctgccgtaac 640621 cgccatcgaa gctgccgacg tcggcgtcgg tctgcttgaa gcgggcgtac tcgcggctgg 640681 ccagggtgag atcgcagacc acgtgcaggc tgtgtccccc gccggccgcc cagccattga 640741 ccagacaaat gaccaccttg ggcatgaacc ggatcagccg ctgcacctcc aggatgtgca 640801 accggccggc gcgggcgaca tcaaccgtgt ccgcggtgtc tccgctggcg tactggtaac 640861 cgctgcgccc acgaattcgt tggtcgccgc cggagcagaa cgcccagccg ccgtccttcg 640921 gggacggccc gttgccggtc agcagcacca ctccgacgtc gggcgacatt cgtgcatggt 640981 cgagcacccg gtacagctcg tcgacggtgt gcgggcgaaa tgcgttgcgc acttcagggc 641041 ggttgaacgc cacccgcacc gtggcatcgt cgacgtggcg gtggtaggtg atgtcggtca 641101 gatcgtcgaa cccgtccacg agccgccacg ccttggcatc aaaagggttg tcactcaagg 641161 ctgttgaact ccgtccttgt tcgccggctg gagccaccac ggcgatctga tccgttcacc 641221 catgcctgcc acagtaatca tggccgctgg gcgtcagccg gacggtatgg tgcccggggc 641281 cttggtcaca tgtggtcgtg agttggcgcc cgggcggctt tctgtggagg gtcaccgcgt 641341 actcgatcat ggcgctgctc gcagttcatc actgaaccgg cacagtgtcg ctcggcaatg 641401 cggtctactc gtggggcatg ttaagcgctc aacagggcgg cgcgcccacc tttctccaag 641461 ccccgctgga ctcagccgat ggcgtgagcc gagggccagg cgcgtgccaa tctttcgtcg 641521 gtggtcaaca acaccagacc tgccgtttcg gccagctcga cgtagagggc atcggtcagg 641581 cggagggtgt cgcggcgcga ccacgctccg caagcagcga cgaaagaccg tgtcgagtca 641641 ccggcacctg tcgcaactcc tccagtgccg catcgacata ggcaacggtg agtgcgccgg 641701 cgcgctgcat gcgccccagc gccgacaaca cctctgcatc gaagtgcgcc ggcgcgtgca 641761 tcgcggtccg agccagccgc gcgcgcaccg cagagcaccg atcgctagtg cgagccagta 641821 gatccaccat ggcactcgcg tcgacgacca cctgctccgg cggcgaagtg ggcgatgctc 641881 tcacgcttcg aactcatcgc gagcggcatc gatcgcaccc agcacgtcat catgccgagc 641941 gccggtgctt ctgggttcca acccctcaag ccacgcatcg gttgcggagt tctccaactc 642001 ggcactgatc gcggcctgag tcagcgccga gacgttcaag ccccgcgccc tggcgcgctc 642061 cgccaattcg tcgggcacat acacgttcaa ccgagccata cacaccaatg tacacacaac 642121 gatcgttttc gtgcgccggc tcaacaaggc cttcggcggg ttctttcgcc cgccgcagac 642181 cgcgaaaccc gctgtgaagg tgggttatcc cgagcatcgc cggcatatct gcacggcatc 642241 ggcggcgtca agtgcgccgg catcgccagg ccgaaggccg gggtgaagac taatccagat 642301 cagatgcgag ggaccagact tcatgcaacg gccaagccct agccgaccgc gcgcccagcg 642361 ccttcccaga accgtgcgcg cacggccttc ttgtccggct ttcctagacc ggtcaacggc 642421 aaagagtcga cgaccaccac ccgcttgggt gcctgcaccg atcccttgcg ttgtttgacc 642481 gctgcctgga tctcggcggt catggcctcg atcgcgggct catcgcgggc cgcgttggag 642541 cgcaacacca ccaccgcggt gacggcctcg ccccacttct catccggcgc gccaaccacg 642601 cacacctgag caaccgccgg atgctcggcc accacgtcct cgacctcccg ggggaacacg 642661 ttgaagccgc cggtgacgat catgtccttg acgcggtcga cgatgtagta gaagccatcg 642721 gagtcctcgc gggccaggtc gccggtgtgc agccagccgt ctttaaaagt ccgcgacgtc 642781 tcgtctggca gattccagta accgcccgcc aacagcggtc cgctgacaca gatttcgccg 642841 acttcgccct gcttcaccgg cttgccatgc tcgtctaaca gcgcgacgcg ggcgaacagc 642901 gtcggccgcc cacatgaggt cagccgcttc tcgtcgtgat cgcccttggc cagataggtg 642961 atcaccatgg gcgcctcgga ttgcccgtag tactgggcga agattgggcc gaaccgccgg 643021 atcgcctcgg ctagtcgcac cgggttgatc gccgaggcgc cgtagtagac ggtttccagc 643081 gacgacaggt cccgggtgtg cgaatccggg tggtccagca gcgcgtacag catcgatggc 643141 accaacatgg tcgctgtaat gcgttgctcc tcaatgattc tgagtacctc ggccgggtcg 643201 aacttcgcca gcactatcat ctcgccgccc ttgatcaccg tcggcgtgaa aaacgccgcg 643261 ccggcgtgcg acagcggggt gcacattaag aaccgcgggt tggccggcca ctcccattcg 643321 gcgagctgga tcgaggtcat ggtggcgatc gactgcgcgg tgcctatcac gcccttaggc 643381 ttgccggtgg tgccgccggt gtaagtcagg ccgataactt ggtcgggtgg caggtcggcg 643441 gcgaccagcg gctgcggctg gtatttggcg gcctcggcgg ataggtcgac tgccacatgc 643501 ttgagcgcat cgggcaccgg cccaatggtg aggatttgct gcagcgagtc cacctgctcc 643561 agcagagcca gtgcgcgctc gacgaacatc gggttggggt cgatgatcag tgagctgatg 643621 ccggcgtcgt tcagcacgta ggcgtgatcg gccagcgagc ccaacgggtg cagcgcggtg 643681 cgccgataac cgcgggcctg cccggcgccg atgatcatca aaacttcagg acggttgagc 643741 gacagcagac cgaccgccac cccggtgccg gcacctagcg cctcgaatgc ctggatgtac 643801 tggctgatac ggtccgccag ctggccaccg gtcagcctgg tgtcgccgag gaacagcacc 643861 ggcttgttct ggtggcgctt gagcgctccc actagcagat ggccgttgtg ggtcgggctg 643921 cgcaacagct cgcccgaaca atcctggtca cgcatggcgc cgctctccct cgctagctgg 643981 ggtaccccca ccgcatcgct tcgtcccccg caagcgggtg gtacccccac tgcatcgtcg 644041 ccggcggtgc tcatctggca agactagaac gtgttgcaat ttggatctgc cgtgccctcg 644101 taatctcgaa ggatcactac gcttggagcc catggccgat gcagacctcg tcatgaccgg 644161 aaccgtgctc accgtcgacg atgcgcggcc aacggccgag gcgatcgcgg tcgccgacgg 644221 ccgggtcatt gccgtcggtg accgctccga ggttgccggc ctggttggcg ccaacacccg 644281 ggtcatcgat ctgggtgccg ggtgcgtcat gccaggattt gttgaggcac acggccatcc 644341 gctactggag gcggtcgtgc tgtcggaccg gttcgtcgat atccgtccgg tgacgatgcg 644401 ggacgcggac gacgtcgttg ccgcgatccg cggcgaggtt gcacggcgcg gcccggccgg 644461 cgcctatctg gtcggctggg atccgctgct gcagtccggt cttggcgagc cgacgctgac 644521 ctggctcgac agcctcgcgc cgaacgggcc gctggtgatc atccacaact ccggacacaa 644581 ggcttacttc aactcgcacg ccgcctggct caatgggctc acccgagaca ccgcggatcc 644641 caagggcgcg aagtatggcc gcgacggcaa tggcgaactc gacggcaccg ccgaggaaat 644701 cggcgcgatt cttccgcttt tggccggtgt agccgacccc agcaacttcg gtgccatgct 644761 gcgcgccgag tgtgctcggc tcaaccgtgc cggcctgacc acatgctcgg agatggcttt 644821 tgacccaggg tatcggccga tggtcgaggc ggtgcgcgcc gaactgacgg tccggctgtg 644881 cacctacgag atctccaatg cgcggatgtg caccgatgcg acgcctgggc aaggtgacga 644941 catgctgcgc caggtgggca tcaagatctg ggtggacggc tcgccgtggg tcggcaatat 645001 cgatctgacc tttccctacc tggacacccc cgccacccgt gccatcggtg taccgcccgg 645061 ttcccgcggg tgcgccaatt acacccgtga acagttggcc gaaatcgtcg gggcctactt 645121 tccgcggggc tggcagatcg cctgtcacgt gcacggcgac ggcggtgtgg acaccatcct 645181 cgacgtctac gaagaggcac tgcgccgcaa tcctcgagac gatcaccggc tgcggctcga 645241 acacgtcggg gccatccggc ccgaccaact gcggcgcgcc gccgaactcg gtgtcacctg 645301 cagcatcttc gtcgaccaga tccattactg gggcgatgtg atcgtcgatg acctgttcgg 645361 ggcacagcgc gggtcccggt ggatgccggc tggatccgcg gtggccgccg gcatgcgtat 645421 ctcgctgcac aacgacccgc ccgtcacacc ggaggagcca ctgcgcaaca tcagcgtggc 645481 cgcaacccgg gtggcgccca gtggccgggt gctggcaccg gaggagcgcc tgacggtcga 645541 gcaggcgatt cgcgcgcaga ccatcgatgc cgcctggcaa ctgttcgctg aggacgcgat 645601 cggctcgctt caggtcggca agtacgcgga tatggtggtg ctgtcggcgg atccccggac 645661 ggtgccgcca gagcagatcg ctgacctggc ggtgcgggcg acgtttctgg ccggtcgcca 645721 ggtttatcgg cggtgatacc cgtgctgccc cccctagaag ccctgctgga ccgcctgtat 645781 gtggtggccc tgccgatgcg agtgcgtttc cgcggcatca ccacccgtga agtggccttg 645841 atcgagggtc cggccggttg gggcgaattc ggtgcgttcg tggagtacca gtccgcgcag 645901 gcgtgcgcgt ggttggcgtc ggcgatcgag accgcctact gtgcgccgcc gccggtgcga 645961 cgtgaccgcg ttccgattaa cgccactgtg ccggccgttg ccgccgccca ggtgggcgag 646021 gtgctggccc ggtttcctgg ggcccggacg gccaaggtga aggtcgccga gcctgggcag 646081 agcttggccg acgacatcga gcgtgtcaac gcggttcggg agctggttcc catggtgcgg 646141 gtggacgcca acggtggctg gggtgtcgcc gaggcggtgg ccgcggcggc cgccctgacc 646201 gccgacggcc cgctggaata ccttgaacaa ccctgtgcca ccgtcgccga actcgccgag 646261 ttgcgccggc gggtggatgt gccgatcgcc gccgacgaaa gcatccgcaa ggccgaggat 646321 ccgttggccg ttgtccgcgc tcaggccgcc gatatcgcgg tgctgaaggt cgccccgctg 646381 ggcggtattt cggcgctgct tgatatcgcg gcgcggatcg ccgttccggt ggtggtctcc 646441 agcgcgctcg attccgccgt cggaatcgcc gccggcctga ccgccgccgc ggccctgccg 646501 gagctcgacc acgcgtgcgg gctgggcacc ggcgggctgt ttgaagagga cgtggccgag 646561 cccgcagcac ccgtcgacgg ctttctggca gttgcgcgga caacgcccga cccggcgcgg 646621 ttgcaagccc tgggtgcacc gccgcagcgg cgacagtggt ggatcgaccg ggtcaaggcc 646681 tgctactcgt tgcttgtacc gtctttcggg tgatcaacct ggcctacgac gacaacggga 646741 ccggtgaccc ggtggtcttt atcgccggcc gcggcggcgc cggacgcacc tggcacccac 646801 atcaagtccc ggcctttctg gcggctggat atcggtgcat cacgttcgac aatcggggca 646861 tcggcgccac cgaaaacgcc gaaggcttca ccacgcaaac catggtcgcc gacaccgcgg 646921 cgctgatcga aaccctagac atcgccccgg cgcgcgttgt cggggtgtcg atgggggcat 646981 tcatcgcgca ggaactcatg gtggtcgcac ccgagctggt cagctcggcg gtgctgatgg 647041 ccactcgcgg ccgcctggac cgcgcccgcc agttctttaa caaagccgag gccgaactct 647101 atgactcggg tgtccagctg ccacccacat acgacgcgag ggctcgctta ctggagaact 647161 tctcccgaaa gacgctcaac gatgacgtgg ccgttggcga ctggatcgcg atgttttcca 647221 tgtggccgat taagtccacc cccggactgc gctgtcagct agattgcgct ccgcagacca 647281 accggctgcc cgcctaccgc aacatcgccg cgccggtgct ggtgattggt ttcgccgacg 647341 acgtggtgac gccgccctac ctgggtcggg aggtcgccga cgccctgccg aacggccgtt 647401 acctgcagat acctgacgcc ggtcatctcg ggttcttcga gcggccggaa gccgtcaaca 647461 ccgcgatgct gaagttcttc gccagtgtca aggcctgagc gcggcccggc catacggtcc 647521 ggctgtgaca ctctgtactg gtgaacccct cgacgacaca ggcgcgcgtc gtcgtcgacg 647581 aactgatccg cggcggcgtt cgcgacgtgg tgctgtgtcc gggctcgcgc aatgcgccgc 647641 tggccttcgc gctgcaggac gccgaccggt ccggccggat ccggttgcac gttcgcatcg 647701 atgaacgcac cgccggctac ctggccatcg ggctggcaat cggggcgggc gcgccggtgt 647761 gtgtcgcgat gacatccggc accgccgtgg ccaacctcgg tccggcggtg gtggaggcaa 647821 actacgctcg ggtgccgctg atcgtgctgt cagccaatcg gccctacgag ctgctgggca 647881 ccggcgccaa ccagaccatg gaacagctgg gctatttcgg cacccaggtg cgcgccagca 647941 tcagcctggg gctggccgag gacgcacccg agcggacctc ggcgctcaac gcgacctggc 648001 gatcggctac gtgccgagtg ttggcggccg ccacgggtgc tcgcaccgcc aacgcgggcc 648061 ccgtgcactt cgacatcccg ctgcgcgaac cgctggtgcc cgatcccgag cccctcggcg 648121 cggtcacccc gccgggccgg cctgctggca agccgtggac ctacacgccg ccggtcacct 648181 tcgaccagcc actggacatc gacctgtcgg tcgacaccgt ggtcatctcc gggcatggcg 648241 ctggcgtgca ccccaacctc gcggcgttgc cgaccgtcgc agaaccgacg gcgccgcggt 648301 ccggggacaa cccgttgcac ccgctggcgc tgccgctgct gcgccctcaa caggtgatca 648361 tgctgggccg gccgacactg catcgtccgg tatcggtgct gctggccgac gcagaagtgc 648421 cggtattcgc attgacaacc ggtccacgct ggccggatgt ctcgggtaac tcgcaggcca 648481 ccggcacgcg ggcggtcacc accggcgcgc cgcggcccgc gtggctggac cggtgtgcgg 648541 cgatgaaccg gcacgcgatc gcggcggttc gggaacagct cgcggcgcac ccgttgacca 648601 ccgggctgca tgtcgcggcg gcggtgtcgc atgcgctgcg gcccggtgac cagctggtgc 648661 tcggggcatc caatccggtg cgggatgtgg cgttggccgg tttggacacc cgcggcatcc 648721 gggtacggtc caaccgtggg gtcgccggca tcgacggcac cgtgtccacc gcgatcgggg 648781 cggccctagc ttatgagggg gctcacgagc gcaccggcag cccggactcc ccgccccgca 648841 ccatcgcact gatcggcgac ctgacgttcg tgcacgacag ctccgggctg ttgatcgggc 648901 cgaccgaacc gataccgcgg tcattgacca tcgtggtgtc taatgacaac ggcggcggca 648961 tcttcgaatt gctcgagcag ggtgatccca ggttctccga cgtgtcatcg cgaatcttcg 649021 gcaccccaca cgacgtcgat gtgggcgcat tgtgccgcgc ctaccacgtg gaatctcgcc 649081 agatcgaggt cgacgaactc ggaccgaccc tcgatcaacc cggtgccggc atgcgcgtgc 649141 tcgaggtcaa ggccgaccgg tcgtcgttgc gacaattgca cgccgccatc aaggcggctc 649201 tgtgatatca ccgaaacccc tgctgcacat cctgattcat gggcgcagtg atgaactgcc 649261 cgatactcga ggcaggatcg tgctgcgctg gttacgaatc gccgtcctga tagtgaccgg 649321 tttggtcacg ctgcagtcgg tgcttctggt ggctggtgcg tggcgcaatg acattgcgat 649381 ccaacgtaat atgggggtcg cgcaggctga ggtgctcagc gccgggccgc ggcgttcgac 649441 gatcgagttt gtcacaccgg atcggatcac ctatcggccg caactcggtg tgctgtatcc 649501 gtccgaatta tccacgggca tgcgaattta cgttgagtac aacaagaggg atcccaacct 649561 ggtcagagtg cagcaccgta acgccggact ggcgatcatc ccggccgggt ccatcgcggt 649621 ggtggcctgg ctgatcgccg ccgccgcgct ggtcgtgcta gcggtgctgg acaagcggtt 649681 ggaacgtcgt gaaaattcgg cgtctgcaac gggctgagca gcagagttcg cacgccgtat 649741 gccgctacgc aaccatttcg acagccggcg ctgacagtgt gtgtggcgtg cgcgttgcga 649801 tcgtcgccga gtcgttcctc ccgcaggtga acggcgtcag caactcggtg gtcaaggtac 649861 tcgaacatct gcgtcgaacc ggtcatgaag ccctggtgat cgcgcccgat acgccgccag 649921 gtgaagaccg cgccgagcga cttcacgacg gtgtccgggt gcaccgggtg ccgtcgcgga 649981 tgttcccaaa ggtgaccacg ttgccgctcg gcgtgcccac cttccgaatg ctgagagcgc 650041 tgcgcggatt cgatccggat gtcgtgcatc tggcgtcgcc ggcgctgctt ggctacggcg 650101 gactccatgc cgctcggcgg ctaggggtgc ccacggtcgc ggtctaccaa accgatgttc 650161 cgggtttcgc gtccagctac ggcattccga tgacagcacg ggcggcgtgg gcatggttcc 650221 gccacttgca tcgcctggct gaccgcactc tggcgccgtc cacagcgaca atggaatccc 650281 ttattgccca gggcattccg cgagtacacc ggtgggcacg cggggtggac gtgcaacgtt 650341 tcgcgccgtc ggcgcgaaac gaggtgttga ggcgacggtg gtcaccggac ggcaaaccca 650401 tcgtcggctt tgtgggtcgg cttgctccgg agaagcatgt cgaccggctc acgggtctgg 650461 cggcctccgg cgccgtgcgg ctggtgatcg tcggcgacgg catcgaccgg gcaagattgc 650521 aatcagcaat gcccacagcg gttttcaccg gagcacggta tggcaaagag ctcgccgagg 650581 cgtatgccag catggacgtc ttcgtacatt ccggtgagca cgagacgttc tgccaagtcg 650641 tgcaggaagc gctggcgtcg gggctaccgg tgatcgctcc ggacgccggc ggaccgcgtg 650701 atctgataac cccgcaccgc accgggctgc tgttgccggt cggcgagttc gagcaccggc 650761 ttcctgacgc cgtcgcccac ctggtgcacg aacgccagcg ctacgcgctg gccgcccggc 650821 gcagtgtgct gggccgcagt tggccggtgg tctgcgatga gctgctcggc cactacgagg 650881 cggtgcgagg tcggcgcacg acccaggccg cgtaacggta gcgtcgaggc tatgagtcgc 650941 gccgccttgg acaaggatcc ccgcgacgtg gcgtcgatgt tcgatggcgt cgcccgcaag 651001 tatgacctga ccaataccgt gttgtccctg ggccaggacc ggtattggcg gcgagccact 651061 cggtcggcgc tgcggatcgg gcccggccaa aaggtcctgg acctggccgc gggcaccgcc 651121 gtgtccaccg tagagctcac caaatcgggc gcgtggtgtg tggctgccga tttttcggtc 651181 ggcatgcttg cggcgggcgc tgcgcgcaag gttcccaagg tcgccggtga cgccacccgg 651241 ctgccgtttg gtgacgacgt gttcgatgcg gtcaccatca gtttcgggct gcgtaacgtc 651301 gcaaaccagc aagcggcgct gcgggaaatg gctcgtgtca cccggccggg cgggcggcta 651361 ctagtgtgcg aattctccac gcccaccaat gcgttgttcg ccaccgccta caaggaatac 651421 ttgatgcggg cgctgccccg ggtggcgcgg gcggtgtcta gcaaccccga ggcctacgag 651481 tacctcgcgg agtcgatcag ggcctggccc gaccaggcgg tgctggcgca ccagatttcg 651541 cgggccgggt ggtcgggggt gcggtggcgc aacctgaccg gcggcatcgt agctctgcat 651601 gccggataca aacccggcaa acaaaccccg cagtgaccgg taggaagact tagcgggtgc 651661 cagcccgttg caggacgccc acatgctcag ggcagtagtg atcgatcgcg gcgcccagga 651721 actgaaacgc ttggccctgg gtggttccgc gcggcaggtt gcgttgcagg aaagtggccg 651781 acttgtacgc atcgccgtca acgcctctgc tcagccgttc gcagctgatc ttggcaagcc 651841 aagcgttgta gtcctgcggg ccgtagatcc cgaagcgatg gatcgtgttg ttgaaggggg 651901 cgtcgtagtc gtcggcctgc gccggcgctg ccaaactaac ggcagccacc gtcatgccga 651961 cgacaacagc cagctttgtt cccttcatta gccggactat acgcgtcgtt tgggtgcgcc 652021 gtcagcccag gtgggccgag agcagccagc caccgatcga ctgcagcccg ttgggctctt 652081 cgcggatgtc caggagtgcg ggcatgccgg cgaagccggc cgggaacctc gcgtacagcc 652141 gcgcgggctt gatctcatcg atgatccagt acttggacac cgccgcgcgc agctcgtcct 652201 cggtgaccgc attgatcggc ccctcgggta tcgccgcccg gtcgaatacc aacacgaagt 652261 aggaggcgcc cggtgccgcc gcacgcacga tcgattgcag atagccctcc cgggactcga 652321 ccggcatgga gtggaacagc gtgctgtcga cgatggtgtc gaacctgccg tcatagccgg 652381 taaacgaact ggcgtcggcc acctcgaagc tggcattggc caggccgcgc ttcgctgctt 652441 catgccgagc cagttctacg gcggcggggg agaggtccag tccgaccgtg gtgtgtcccc 652501 gttcggccag tgccagcgaa atcgcggcct ccccgcagcc cacgtcgagg acgtcgccgc 652561 ggaacttgcc ctgcacgatc agggcggcca gctcgggctg gggttcgccg atgctccatg 652621 gcggtcggac tccctccccg aaggcgacgg attcaccgcg gtaggcggat tcgaactcaa 652681 gatccagcga ttcagtcatg tgttcatata tatcaacggc cctgatatat gtcaacacag 652741 ttgacattcg cgcacccttg gttgccggcc gtcagctgaa cggcggtcgt cgatcgacga 652801 gccgggacaa ttgaccgcca ccgcgccaca cccgcgccac ccagtcgcgg tcgtcgtcgg 652861 tgaccagatt ggacatcacc cgcacggcga tgttcatcaa tgcggtggag cgcatcgtga 652921 tgggcccagt cgtgggtagg aaccgtggga aggtcagtaa caacgctagc cggcgcgcaa 652981 ccgagaagcc gcgaccgtag cggtcggcca gcagcgacgg ccacagccgt gccaggtcac 653041 gcgaatccag cagttcggcg gccagccgcc cggtttccag cccgtagtcg atgccctcgc 653101 cattgagcgg gttgacgcag gccgcggcgt cgccgatgag catccagttg gacccggcca 653161 ctccagaaac cgcgccgccc atcggcaaca gcgccgacga caccgcgcgc ggctggccgg 653221 tgaagcccca ctcgtcacgg cgcaggtcgg tgtagtagga gatcagcggg cgcagggcca 653281 gatcggctgg ccgtcttgag gtcgacaacg ctcccacgcc gatgttcact tcgccgttgc 653341 ccagcggaaa gatccagccg tagccgggta gcacggcgcc gtcgggggag cgcagttcca 653401 gatgcgacgt cagccacggg tcatcgctgt acgccgtgct caggtacccc cggaccgcga 653461 cgccatagac cgtctcccga tgccatcgcc ggcccagctt gcgtcccagc ggggatcggg 653521 ccccgtcggc aacgatcagc tggcggcagc ccacctcagt gccgtcggcc agggtcagcg 653581 ataccacccg cctcgatgaa tcatggtgaa cagcaacggc tttagcgcca agtagcatgc 653641 gcgcaccggt gtcctcggcg acctttcgga tccggtcgtc cagctcgaga cgggccaccg 653701 cgctgccgta cgacgggaag ctcggaccgg gccagtccac ttccacctcg cctccgaagc 653761 cgctcatccg caacccacga tgccggatgt ggtccgccag ccacttacct agtcccagct 653821 ggtgcagttc ggcgaccgcg cgtggtgtca gcccgtcgcc gcaaggcttg tcgcggggga 653881 aggtggcggt gtcgatgacg aggacgtcgc ggcccgcgcg ggcagcccag gcggccgcag 653941 ctgacccggc cggtccggcg cccacgacca ccacgtcggc actgtcatcc acgctcacca 654001 gtatgttggt cgagtgagga ctccggcgac ggtggtggca ggcgttgacc tgggcgacgc 654061 tgtctttgcc gcggccgtgc gtgctggtgt cgcgcgagtc gagcaactca tggacaccga 654121 gctgcgccag gccgacgagg tgatgagcga ttcgctgctg cacttgttca atgccggtgg 654181 caagcggttc cgtccactgt tcaccgtgct gtcggcgcag atcgggccgc agccggatgc 654241 cgcagcggtg acagtcgccg gggcggtgat cgagatgatc cacctggcga ccctctacca 654301 cgatgacgtg atggacgagg cccaggtccg ccgcggcgcg cccagcgcca acgcgcaatg 654361 gggtaacaac gtcgcgatcc tggctggcga ctacctactg gccaccgcat cgcggctggt 654421 ggcgaggttg ggaccggagg cggtgcggat catcgccgac accttcgccc agttggtgac 654481 cgggcagatg cgtgagacgc gcggcacgtc ggagaacgtg gactccatcg agcagtacct 654541 gaaggtggtc caggagaaga ccggcagtct aatcggggcg gccggccggc tgggtgggat 654601 gttctccggt gccaccgacg aacaggtcga acggctgagc cgcctcggcg gcgtggtggg 654661 caccgcgttt cagatcgccg acgacattat cgacatcgac agcgagtctg acgagtcggg 654721 caagctgccc ggtaccgatg tgcgcgaagg agtacacacc ctgccgatgc tctacgcgtt 654781 acgggaatca gggcccgatt gcgctcggtt gcgcgcactg ctgaacggac cggtcgacga 654841 cgacgccgag gtgcgcgagg cgctgacatt gttgcgggcg tcgccgggca tggcccgggc 654901 caaagacgtc ctggcgcagt acgcggctca ggcacgtcac gagctggcct tactgcccga 654961 cgtcccggga cggcgtgccc tggcggcgct ggtcgactac accgtgagcc ggcacggcta 655021 ggttgcccgg ccaggctcga ttgcggaacc agcggatacc cctcaggcgt tgaaccagca 655081 gtaatctccc aagttgaggt gttctaggag gacacgcact gatgacttgg catccgcatg 655141 ccaaccggct gaagacgttc ctgctgttgg tcggtatgtc cgcgttgatc gtggccgtcg 655201 gcgcgttgtt tggcaggacg gcgctgatgc tggcggcgct gttcgccgtc ggaatgaacg 655261 tctacgtcta cttcaatagc gacaagctgg cgctgcgggc gatgcatgcg caaccggttt 655321 ccgaactgca ggcgccggcg atgtaccgga tcgttcgaga gctggcgacc agcgctcacc 655381 agccgatgcc ccggctgtac atcagcgaca ccgccgcacc caacgcgttc gccaccggcc 655441 gcaacccgcg caatgccgcg gtgtgttgca cgactggcat cctgcgtatc ctcaatgagc 655501 gtgagctgcg tgccgtgctg ggccacgagc tgtctcacgt ctacaaccgc gacatcctga 655561 tctcttgtgt ggcaggtgcg ctggcagcgg tgattaccgc gctggccaac atggccatgt 655621 gggccggcat gttcggcggc aaccgagaca acgccaatcc ctttgcactg cttctggttg 655681 cgctgctggg cccgatcgcg gcaaccgtga tacggatggc cgtgtcgcga tcgcgggagt 655741 accaggccga cgagtcgggt gccgtcctga ccggggaccc gctggcgttg gcgtcggcat 655801 tgcgcaagat ctccggcggc gtccaggcgg cgccgctgcc gccggagccg cagctggcca 655861 gccaggcgca cctgatgatc gccaacccgt tccgggcggg tgagcggatc ggatcgctgt 655921 tttcgactca cccaccgatc gaggaccgca ttcgccggct ggaggcgatg gcgcgcggct 655981 gataactgtg ggtatcgaga tgccatcggt gatgagtcag gcgccgctat cgaggaggcg 656041 gtcgatcagt tcgtggctgg catgccggcg tgcggcgagg acgcgcccgt cccactgacc 656101 gaacgcaaca gccgaactat cgttgtgcgg acatcaccgg catgcgtgcc ggcagcggtg 656161 gcaagcctaa aaccccgagc cgtgcacctc gtgtccgggg acctcggcga tcaagcctct 656221 atacgcctgc tccaccgtcg aaccatggtt gatgacggcg tcgacctcgc gggcgatcgg 656281 catgttcagc ccgaactcgt tggcgaactc catcaccaca ccggcagctt tgacgccctc 656341 ggcgacctgg ctcatcgatg cgatgatttc gtcgatcggc ttgcctgcgc cgagttgttc 656401 gcccacatgc cggttgcggc tgcgttggct ggtgcaggtg acgatcaggt cgccgagacc 656461 ggccagtccg gggaacgttt cgcttttccc acccattgcc acacccagct tcgtcatctc 656521 gcgcagcgcg cgggcgatca ccagggcgcg ggtgttttcg ccgataccca gcgaatagcc 656581 catcccgacc gcgatggcga agacgttctt gagggcgccc gccgtctcga caccgacgac 656641 gtcgtcagtt gtgtacacgc ggaagcgccg ggtgcgaaac attgctgata gccgggtcgc 656701 caggtgctgg tcgggcatgg ccagcaccgc cgcggccgcg tagccctcgg ccacctcgcg 656761 ggcgatgttc gggccggcca ggatgcctgc cggatgaccg ggcagtacct cctcgatgat 656821 ctgcgacatc cgcatattgg tgccctgttc gagccccttg accagggaca ccactggcac 656881 ccagggtcgc agctctttgc tcagctcgac aagcactccg cggaaaccgt gcgagggcac 656941 ccccatgacg acgacgtcgg cgcagttggc ggcctcggtg aagtctgtgg tggcgcgcag 657001 ggtgtcgctg agcaccacgt cgttgccgag gtatcggcta ttgcggtggt tgtcgttgat 657061 gtcctgcgcg gtgaccgccg agcgcaccca ctgcaaggtt ggtccgcggc gcgcacagat 657121 ggaggcgacg gtggtgcccc aggaaccgcc gccgaggaca acgactttgg gttcgcgctt 657181 gttggctgcc atggcgttca gcgtattgcg gcaaccggac atttgatagc cgtcgacgaa 657241 ccgcaggagc aatcatgccg cgccgaacac cattgcctcc tcgatgcggt cgaatcggta 657301 gtcgatggcg tcggccaagt agttctgtcg tacattccac ggccgcttgg tgccggactt 657361 gggcagcgcg tacggcgccc gcttcacata gccggcctga atgtcccagg acggtttctc 657421 gtccatcggc tcgtcgccca ggtgcggggc ggcgcgcgtg tgtccatggg cggccatgtg 657481 tgccagtagt tttgccgtcg cccgggccgt catgtcggcg cgcagcgtcc aggacgcgtt 657541 cgtgtaaccc acacaccaga acaggttggg cacgtcttcg agcatgtgcg ccttgtagac 657601 aaagcgatcc cgagggtcga tctcgacgcc gtcgaggctg atcgcggccc cgccaagcgc 657661 ttgcaactgc aggccggtgg cggtgacgat aatgtccgca tcgaggtgcc caccggattt 657721 gagtgcaata ccggtggcgt cgaagtggtc gatatggtcg gtgaccacct cggcgcggcc 657781 gctggtgatg gcgttgtaca ggtcggcgtc cgggatcagg cacagtcgct gatcccacgg 657841 gttgtaccgc ggcgtgaagt gggtttcgat gtcgtagccc tcgggcagat ttttgatcgc 657901 ggtacggcgc agcagccatt tcacgaacac cggtgtcttg cgggacaaga accagaacac 657961 cgcttccaat aacgcgttgt acattcggac aatcaagtga gaagttttgg gaggcaacgc 658021 tttacgaaca acggcggcga acgtgctgta tttggatgcc gagatcaggt aggtcgggga 658081 tcgctgcagc atggttacct tttcggcccg gtcggtcagc gaggggatca gtgtgaccgc 658141 ggtggccccg ctgccgatca ccacgatctt cttgccggtg tagtccagat cctctggcca 658201 gtgctgggga tgcactaccg cgccgccaaa cttctcgatg cctccgaagt cgggggtgta 658261 gccctcgtca tagttgtagt agccgctgcc gaagaacacg aaccggctgc ggtagtgctt 658321 gtgcacgccg ttctgctcga aggtgacggt ccaggtatcg gtggatgagt cccagtccgc 658381 tgcgcgaacg tagctgttga actcgatgtg gcgatcgatg ccgtacttgt gggccatgtc 658441 ggtgaggtac tcgcggatgt gggcgccgtc ggcgatgcct tcttcgcggg tccacggctc 658501 gtagggaaac gacagcgtga agatgctgct gtcggagcgc acgccggggt agcggaacag 658561 atcccaggtg ccgccgatcc gcgcacgcct ttccaggatg gtgtaggtca gctgcgggtt 658621 gcgttcgatg atccggtagg ccgcgcccag tccggagatg ccggcgccga cgatgacgac 658681 gtcgacacag ccggcgtttg gagtcacgct catcgtgaac ctcgcttgaa atcctggatc 658741 agcgaccagg gtagccagga catccagcca gcccctccag atcgccgcga ctagcggtag 658801 ttcacaaact gcaatgccac gtccaggtcg gccttcttca gcatggcgat gacggcctgc 658861 aggtcgtcgc gcttcttgct ggtgacccgg acctcgtcgc cctggatctg ggttttgacg 658921 ttcttggggc ctgcgtcgcg gatgagcttg gtgatcttct tggcgttctc gctgctaatg 658981 ccctgtttga gggcgccggt aactttgtac gtcttacccg aggcctgcgg ttctccggcc 659041 tcgaaggcct tcagcgagat gtcgcggcgg atcagcttct ccttgaagac gtcgacggcg 659101 gccttgacac gctcctcggt ggacgaggtg agctcgacgg cctcgtcgcc cttccacgcg 659161 atcttggtgt cggtgccgcg gaagtcgaag cgcgtggcca gctccttggc ggcctggttg 659221 agtgcgttgt cgacctcctg ccggtcgacc ttgctgacga tgtcgaacga tgagtccgcc 659281 attcggttcg tccctccttc gcgagatagc cgtgtgtgct ctgtctaccc ggtcgttgta 659341 ccctgctagg cggcaggttg cccgagcggc caatgggagc ggactgtaaa tccgtcgcga 659401 aagctacgca ggttcgaatc ctgcacctgc caccacggtc aagctggtat ccgggcatgg 659461 gcgccgggca tggccacgcc cgcgcgttgg tgccccaacg tcgcctacgg tcggtagaca 659521 gcggcgcgac acccgcactc caacaatttc gggaggtcaa gtggtggagt tgagcccgga 659581 tcggatcatg gcgatcggcg gcgggtacgg cccgtctaag gtactgctta ccgcggtcgg 659641 gcttgggctg ttcaccgaac ttggcgatga ggccatgacc gccgaggcca ttgccgaccg 659701 cctcgggttg ctaaagcgac cggcgattga cttcctcgac gccttggtct cgctggactt 659761 gctggcgcga gacggcgacg gacccgggtc ccactaccgc aatacaccgg agacagcgca 659821 ctttctggac gaggcccgtc ccacctacgc gggcggcctg ctgaagatct ggaacgaacg 659881 caactaccgc ttctgggcgg atttgaccga ggcgctcaag accgggaagg cacaaagcga 659941 ggtcaagcaa accgggcggc ccttcttcga ggcgctctat gcagatcctc ggcggctcga 660001 ggcgttcatg gcggctatgg acgcggcgtc gcgacgcaac atcgagctcc tcgcgaaacg 660061 ctttccgttc gagcgctacc ggcgtctctg tgacgtgggc tgcgcggacg gtctgttgtc 660121 acgaatcgtc gcggcggctc acccgcactt gcagtgcgtc agcttggact tgcccgcggt 660181 gaccgagatc gctcgacgca agctgacagc cgagggtttg ggtgagcggg tgcaggcgtg 660241 cgccggtgac tttttggccg accctctgcc ggcggccgat gtcatcacga tgggccagat 660301 tctgcacgac tggaacctcg accgtaaaca gcagttggtc gctaaggcct acgaggccct 660361 gtccaaggag ggggctttca ttgtgatcga gacattgatc gacgacgcgc gacgcgaaaa 660421 cacaaccggc ctgatgatgt cactgaacat gcttatcgag ttcggtgacg cgttcgacta 660481 ctccgccgcc gacttccggg ggtggtgtgg cgaggcggga ttccgttcgt tcgaggtgat 660541 cccgcttgcc ggcggctcca gcgcggcggt ggcctataaa tagcgggcaa tgacatggtg 660601 ggtggccgac caacgtgaac tgaggacggc aaatcggcct cagttcacgc tcggcgcttt 660661 gagcaacaaa ttgaacacat agaatcgtgt cgatgagcgg cacatcgtcg atgggattgc 660721 cgccgggacc tcgactttcc ggctcggtgc aggccgtgtt gatgttgcgc catgggctgc 660781 gttttttgac ggcctgtcaa cgccgttacg gcagtgtttt cacgctgcat gtcgcggggt 660841 tcggccacat ggtgtatctg tccgatccgg ccgccatcaa gacagtgttt gccggcaacc 660901 cgagtgtctt tcacgccggc gaagccaact cgatgttggc cggactgctc ggcgacagct 660961 cactgctgtt gatcgacgac gacgtgcacc gcgaccggcg tcgcctgatg tcgccgccgt 661021 tccatcgcga cgcggtcgcg cgccaggccg ggccgatagc cgagattgcc gccgccaaca 661081 tcgccgggtg gccgatggct aaggcgttcg cggtggcgcc caagatgtct gagatcaccc 661141 ttgaggtgat cctgcggacc gtcataggcg ccagcgatcc ggtccggctc gccgcgctgc 661201 gcaaggtcat gccgcggctg ctcaacgtgg gcccgtgggc gacgctcgca ctggccaacc 661261 cgagcctgct gaacaatcgg ctctggagca ggctgcgacg gcggatcgaa gaagccgacg 661321 ccctgctgta cgccgagatc gccgaccgcc gagccgatcc cgatctggcc gcacgcaccg 661381 acacgctggc catgctggtt cgggccgccg acgaagacgg acggacgatg accgagcgcg 661441 agctgcgcga ccagctgata acgttgctgg tcgcaggtca cgacaccacc gcgacgggac 661501 tgtcgtgggc actggagcgg ttgacccgcc acccggtcac cctggccaag gccgtgcaag 661561 cggccgacgc cagcgcggcc ggcgatccag ccggcgacga gtacctggac gcggtggcca 661621 aagagacact gcggatccgc ccggtggtgt acgacgtggg ccgggtcctc accgaggcgg 661681 tggaggtggc cggttaccgg ctgccggccg gggtcatggt ggtcccagcg atcgggctgg 661741 tgcacgcgag cgcgcaactg tatccggatc cggaacggtt cgaccctgat cggatggttg 661801 gcgccacttt gagcccgacc acctggttgc cgttcggcgg cggcaaccgc cgctgcctcg 661861 gcgccacctt tgccatggtc gagatgcggg tcgtccttcg ggagatcctg cgccgcgtcg 661921 agttgagcac caccacgacc tccggcgaac ggccgaagct aaagcacgtc atcatggtgc 661981 cgcaccgcgg cgcgcgcatc cgcgtccggg caaccaggga cgtttcggcc acgtcgcaag 662041 cgacagccca gggtgccgga tgcccagccg ctcgcggtgg cgggccgtcc agagccgtcg 662101 gcagccagtg accagctggg gtatccgcat ggggtcgccc agcgggtccc gaggggactt 662161 ttggccaccg gcgctggtgg cctactgccc tcccgccgtt gcgccgggtg cgtgcacgat 662221 tgaagtcccc aaggaaggga cgctcatgaa ggcaaaggtc ggggactggc tggtgatcaa 662281 aggcgcgacg atagatcaac cggaccaccg agggttgatt attgaggtgc gctcatccga 662341 tggttcgccg ccgtatgtgg tgcgctggct cgagaccgac catgtggcga cggtgattcc 662401 gggtccggat gcggtcgtgg tcactgcgga ggagcagaat gcggccgacg agcgggcgca 662461 gcatcggttc ggcgcggttc agtcggcgat cctccatgcc aggggaacgt aggcgattcg 662521 ctcaagcgac gaagtcggtg ggtgtcagct ggccggcgaa agtccggcgc cgggatggaa 662581 cgctggtgcc gttcgacatc gcgcggatcg aagcagcggt gacgcgggca gcgcgcgagg 662641 tggcttgcga cgaccccgat atgccgggca ccgtagcgaa agccgtcgcc gacgcactcg 662701 ggcgcggtat cgctcccgtt gaggacattc aggactgcgt ggaggcccgg ctgggggaag 662761 ccggtctgga tgacgtggcc cgtgtttaca tcatctaccg gcagcggcgc gccgagctgc 662821 ggacggctaa ggccttgctc ggcgtgcggg acgagttaaa gctgagcttg gcggccgtga 662881 cggtactgcg cgagcgctat ctgctgcacg acgagcaggg ccggccggcc gagtcgaccg 662941 gcgagctgat ggaccgatcg gcgcgctgtg tcgcggcggc cgaggaccag tatgagccgg 663001 gctcgtcgag gcggtgggcc gagcggttcg ccacgctatt acgcaacctg gaattcctgc 663061 cgaattcgcc cacgttgatg aactctggca ccgacctggg actgctcgcc ggctgttttg 663121 ttctgccgat tgaggattcg ctgcaatcga tctttgcgac gctgggacag gccgccgagc 663181 tgcagcgggc tggaggcggc accggatatg cgttcagcca cctgcgaccc gccggggatc 663241 gggtggcctc cacgggcggc acggccagcg gaccggtgtc gtttctacgg ctgtatgaca 663301 gtgccgcggg tgtggtctcc atgggcggtc gccggcgtgg cgcctgtatg gctgtgcttg 663361 atgtgtcgca cccggatatc tgtgatttcg tcaccgccaa ggccgaatcc cccagcgagc 663421 tcccgcattt caacctatcg gttggtgtga ccgacgcgtt cctgcgggcc gtcgaacgca 663481 acggcctaca ccggctggtc aatccgcgaa ccggcaagat cgtcgcgcgg atgcccgccg 663541 ccgagctgtt cgacgccatc tgcaaagccg cgcacgccgg tggcgatccc gggctggtgt 663601 ttctcgacac gatcaatagg gcaaacccgg tgccggggag aggccgcatc gaggcgacca 663661 acccgtgcgg ggaggtccca ctgctgcctt acgagtcatg taatctcggc tcgatcaacc 663721 tcgcccggat gctcgccgac ggtcgcgtcg actgggaccg gctcgaggag gtcgccggtg 663781 tggcggtgcg gttccttgat gacgtcatcg atgtcagccg ctaccccttc cccgaactgg 663841 gtgaggcggc ccgcgccacc cgcaagatcg ggctgggagt catgggtttg gcggaactgc 663901 ttgccgcact gggtattccg tacgacagtg aagaagccgt gcggttagcc acccggctca 663961 tgcgtcgcat acagcaggcg gcgcacacgg catcgcggag gctggccgaa gagcggggcg 664021 cattcccggc gttcaccgat agccggttcg cgcggtcggg cccgaggcgc aacgcacagg 664081 tcacctccgt cgctccgacg ggcaccatct cactgatcgc cggaaccacc gcgggcatcg 664141 agccgatgtt cgccatcgcg ttcacccgcg ccatcgtcgg ccggcatctg ctggaggtca 664201 atccgtgctt cgaccgactg gcccgcgatc ggggctttta tcgtgacgag ctgatcgccg 664261 agatcgctca gcgtggcgga gtccgtggct atccgcggct gcctgctgag gtgcgggccg 664321 cgttcccgac cgcggcggag atcgcgccgc agtggcatct gcgcatgcag gccgcggtgc 664381 agcgccacgt cgaggccgcc gtgtccaaga cggtcaactt gcccgccacg ggacggtcga 664441 tgacgtccgc gccatctatg tggccgcctg gaaggcaaag gtcaagggca tcacggtgta 664501 tcgctacggc agccgggaag gacaggtact gtcctacgcc gcgccgaaac cgctactggc 664561 gcaggctgac acggagttca gcggcggctg tgcgggccgc tcctgcgagt tctgacggcg 664621 gctcccatgg cgcgagcaga cgcagaatcg cacaaaatca gcgattttga tgcgattctg 664681 cgtctgctcg cgcagggatc gcagggatca ccccggccgg ctagcggttt agccgcttgg 664741 gcctgggccg cacaagtggt cgatgaacca atcgcacgcc agcttggcaa cctgttccag 664801 cgtgcctggt tcttcgaata ggtgtgtggc gccggggacc acggtgagtt ggcatttccc 664861 gggtattacc gcttgcgctc gttggttcag ctcgaggacc acctggtcgc gtccacccac 664921 gatcagcagc gtcggtgcca ccacgctccc cagcgaatca cccgcgagat cgggccggcc 664981 gccgcgggac accaccgccc gcacgttcac gcgcggatcg gcggccgcga ccagcgccgc 665041 acccgctccc gtgctggcgc cgaagtagcc gaccggcagc gatgcggtgt cgggctgggt 665101 ggccaaccaa ccggtcacgt cgatgagtcg ggaagcgagc agctcaatgt cgaagacgtt 665161 ggcgcggttg cgttcttctt cgggcgtgag caagtcgaat aacagcgtcg caaacccggc 665221 cccggtcaag acctctgcaa cgtaccgatt gcggatactg tgccggctgc tgccactgcc 665281 atgtgcgaaa accacaattc ccctgggttt ttcggggaca gtcaggtgcc ctgccaccgg 665341 tactggaccg gcaacgacct ggacctcctc atcgcgaagc ggtgggtcag cggcggcatc 665401 gatcgcacct gcctcggcga agtcgcggtg agcacgatcc agaaacgcca ccacctcgtc 665461 gtcggaggtc tgggtgaagt tgcggtaacc ctgcccgacg gcgaagaaca acgccggcgt 665521 cgccaaacac accacctcat cggcgtaccc ggcgaatctc gccacgatgt cgtctgggcc 665581 gatcgggacc gccagcacca ccttgtccgc accgtgcgcc cgggcgacct ggcacgccgc 665641 cttggccgtc gctccggtgg cgatgccgtc atcgacgatc accgcgatcc gcccggtcaa 665701 cgggatgcgg tcacgcccgc ggcggaagcg ttccgcgcgg cgttgtagct cgatcagctg 665761 cttgcgttcg accgcgtcca tggcggcagc atcgaggtgt gtcccgcgga cgacgtcgtc 665821 gttgagcacc cgcacgccgt cctcaccgat ggcgccgaaa gccaattcgg gttggaacgg 665881 cacgccaagc ttgcgcacga ccaggacgtc gagtggcgct tgcagtgact tggcgacctc 665941 aaaggccacc ggtaccccgc cgcgcggcaa gccaaggacg acgacggcct tgccggatag 666001 ctgcgccagg cgttgcgcca actggcgtcc agcgtcgcca cgatcgtcaa agagcttcat 666061 ctgccgagtg tgtcgccatc tcatggctcc aaatatggaa ttaggtccct gggccgactg 666121 acgacagtcc ctcagcgacc ggattgcgca tcccgccttg tacgctactc cgcaaatccc 666181 gggcttgcgt ccgcggaagc gaactcggcg gcgctacggt ggtggctcac ttcggccgtg 666241 cgcactcgga tcgacgggcc gatggcggcc gggcccgcgc gcttcatagg tcatcggatt 666301 gaggtgatcg actcggcgat gagtgttcga aagatgactc agtggtgtgc cttccgtcgg 666361 tgagctgcac gacatatgtg cggtcgtcgg cgtcgtactc aaccgtgccg cccgagacaa 666421 ccatcccaac caacgcattg ccgtcctgga agtagtacgc gttcttttct cggcgcatgg 666481 aatccaggtg gcaaccgggc actatgagga cgctcctgcg gggatcgtcg gtgaggacca 666541 gaatccgcgc gccccgtttc tgggctaggg gatgcttcgt aggcttccgt tgccgcatgt 666601 gccgcttgat ggcgtgctca cccattttgg ttttgccctc tcacttgacg ctgcgttgcc 666661 tagcatgcca accggctagc ttcgcggaac gtgctccccg gggtgcgggc attcaccggg 666721 cacgtgaatc agtactgcgc cgtcatcgac gatcccggct tgaccgcggc gatgggcggt 666781 gatgtatagc catccgggtt cgatggtttg cttgcagcgt gtacattgcg ggtcggcggc 666841 catgtgctcc tcgcttccct agcctcacgg tttgcgccgt cggtcgacag gcgaactgct 666901 cttcgccgat gtacgtcact gcttcggcgg ctaaacccct tgtccagcac gacaagtcca 666961 accggcctgc gtcggcggag tttggcctcg tgctcggctg gcggtgctca tggtgtccct 667021 ccggaactcg gggtaacggc aagctttcga tgcgtcggca gtccgaaatc tagagacgac 667081 gaacttgttg ttctagggtc gtttggcctt cgccccgacg acgttggacc cggggtgggc 667141 ttcggccgtg tcggcgtgcc gcagccgggc gagttcgccc acgatcctgt cgctgaccgc 667201 caccggatac gagtagccgg tgtcctcgag cgagcgcagc tccggcggga gcgcgtcgat 667261 ctgctggcgg gcccagtccc gcgcgccgtc caatgtgggt gcatgctgcc ggatgcgtcg 667321 gccgttggtc atgatgggca ccagcaacgg gtccccggga aggttttcac cgtgctcgcc 667381 gagcgtgtcg ccgcaaaaga ctccgtgctc gagcttacgg aacacctgct tgcgtcccgg 667441 gtagatcacc ttgccgctgg agaacttggt gcgcccgctg ccgtcgtatg ccaccagctt 667501 gtaggccatg tccagcgcgg gcgcgtcttg agccacgacg agctgggtgc ccacgccgaa 667561 gccgtcgatc ggacagcggg cagccaaaag cgcggcgatg cggttttcgt cgaggcccga 667621 cgacgcgaag atctcgacct gctcgagacc ggcggtgtcg agccgtgcac gggtcgcctt 667681 ggacagctca tcgaggtcgc cggaatccag ccggaccgcg cgcacatcga agcgattgcc 667741 cagccgcttg gccaactcga tgacgtgatc gacgccgcgt agcgtgtcgt aggtgtccac 667801 gagcagcatg gtggctgggt agagccgggc gaacgcctcg aacgcggcca cctcactgtc 667861 gaaggcttga acaaagctgt gcgccatggt gccgaacgtc gggatcccat attggcgggc 667921 cgcgagcaga ttcgacgtgc ccgcagcgcc cgcgagataa ctggtgcgcg cgaccttgca 667981 ggccgcgtcg gtgccgtgag cgcgccgcgc gccgaaatcc accaccggtc gtccgcgcgc 668041 ggcggcgacc acccgcgcgg ccttgctcgc gagcacgctt tgcagatgaa tctggttcag 668101 cacgaacgtc tcgacaagct gggcctcgat gattggcgcg atcagctgga ccgcgggttc 668161 gttcggaaaa atcacggttc cttccggcgc ggcccagaca tctccggtga aacgcactcc 668221 ggccagccac ctcaggaact cgtcggaaaa ctggcccagg ccacgcaggt aacgcagatc 668281 ctgctcgtcg aatcgaaacg cttcgaggaa ctcgaccaca tcggccagcc cggcggccat 668341 gatgtaggac ctgccaggcg gaagcttgcg gaagaatatc tcgaaaaccg ctgtgcccga 668401 cattctttcg gcccagtagg cctgggccat cgtcacctcg tacaggtcgg tgaacagcgc 668461 gccgacgtgt tggcggatcg ccatggttgc cggttactcc ttgctcgtta ggttggcagc 668521 gggaacgacc tccagcaggt tgtcgggtcg agtcacgact cgaatcccga accggcggct 668581 gatgcgctca atggtgttgc ggagccattc ggtgtcggtc tgggaggcac gctgtaggcg 668641 catccgcgac actcgcagtg gaagcatctg cagcgagatc aggttcccgc tggcgggatc 668701 ggtgacggtc agatacagca gtcgcagttc actgcggaac gactcgtgcc cgccgatgcc 668761 ttcgtagtcg tcaacgacgt caccgcatcc gtacaggatc ggtttaccgc gatatatctc 668821 gattggccgc ggatggtgcg aggaatgtcc gtggaccatg tcgatgccgg cgtcgatcag 668881 tcggtgcgcg aacgcgacgt cgccgggtgc ggtcgcatag ccccaattgg atccccaatg 668941 catcgagact atggcgatat cgccggggcg tttgtccgcc agcacctgtg ccgccacatc 669001 gtcggcgacg tcgcgttgcg ccggatcccg gatcaaccac actccgggcc ggtcgcggcg 669061 ggcggcccag gattcgggga cgccgctgga ttccgccgct accgagccga cgatcacccg 669121 gcgttcatgg ccaaccgtga ctagcgccga gcggcgagcg gcgagcaaat cggctcccgc 669181 cccgacactc tggatccccg caccggcgag agccgcgacc gtatcggtca gcccctggta 669241 gccgaaatcg agaatgtggt tgttggccag cgcgcacacg tgcggccgca atgccgtcag 669301 cgccggcacg ttatccgggt gcatccggta gcagaccggt ttgcggtcgg cgaattcacc 669361 gtcggcggtg atcgtcgtct ccagattgat caaacagacg tcggtcgcgg tgttctcaag 669421 gaccgccaac gcctcgcccc agggccagcg ccaatccacg gggagcggaa tgcgcccgtt 669481 cacccgctcg gccaggcgaa catagccggt cgcatcccgc atataccgtt cgcgcaattg 669541 cggtttgccg ggatgaggca ggatctgatc gacgccacgg ccgagcatga cgtcaccgcc 669601 cagcagcacc gtcaccacat caggcttgcc agccactccg gaccaccgcc gccttcaggt 669661 aatcgccgta acacgcaccc tatggcgtac attgcacgtc atacgatcgg ccggcggcgg 669721 cctcgtgggt ggggccgaag gtcctcaaga ccgcgcccaa aggtcacatt gccggcgaca 669781 aaccgtgcct acctggcgga gaggtgcccg tcggcggtgg tcaccaggtg tagtcgggca 669841 gctcgaagtc gtcacgcacg ctgccggcga acagcgtcgc cagcgggccg aagttcatcg 669901 tgcgcatcgc aacgttgcga aaccacaggc cgaatcgggt tcgggtggcg gaaaaaccag 669961 atgaacttcg ccgcactggc ttgcttgccc tcgatgaagg gacgcaggcg cttctcgtag 670021 gcgtcgaagg cgcgacggtg gtcgcccccg gcgcgggcga gctccccggc cagcacgtag 670081 gcctcggtga tcgccaggcc ggtgccctcg ccgccgagca gcgagatgca cccggccgcg 670141 tcgccgatca gcagcacccg accgcgtgac cagcggtcca tccggatttg gctgaccacg 670201 tcgaagtaca ggtcctcgac gtcgtcgagg gcggccagaa tgtcccggct ttcccagccc 670261 acgtcgccga attggtcgcg cagctcatct ttgggtgcca cgccggggtt gtcgtgttcg 670321 gcgcggaaga cgaacaagaa catggtgcgg tcgccgcgca gcgcgaaccg cgccagctgt 670381 cggtcgacgg tgttgtagag gacatagctg cgctcgtcgc ggggccggta gccgtcgacc 670441 acgcaggccg cgaccttgca gcccaggtag tgctcgaaat cccgctccgg cccgaagacc 670501 agccggcgca cgttggagtg cagtccgtcg gcaccgatga ccaggtcgaa atcgcgcggg 670561 gcggtccttt cgaaggtgag ccggacgccg tcgcggtgct cgtcgatggt ggcgatgctg 670621 tcgtcgaaga tcgtttccac ctggtcttcg atcgtcgtgt agatcgcggc ggcgagatcg 670681 ccgcgcggca agctggtgaa gtcgtcgccg accatgcggc gaaagacgtc gacgcccagg 670741 tcggctttga ccttgccggt gggaccgacg gagcggacgt gttccatgtg gtaacccgcc 670801 gctgcgatct ggtccgtgat gcccattcgt ttggccacct ggtagccgac gccccagaag 670861 tcgatcatgt agccgccggt gcggaacttc ggcgcccgct cgatcactgt cggggtgtgg 670921 ccggtgcgct gcagccagtg ggcgagcgcc gctcctgcca cgccggcacc gctaatcgct 670981 actttcacac tgcaattgtg ctcttcggca atagtttaga acaagaccgg tcgctcgttg 671041 ccccttgatc aatacgttag tgagcgctaa cgtattggcg tgtgcccgac atgctggaag 671101 tcgcggcaga gccaacccgg cgccggctgc tacagctcct ggcaccgggt gaacgcaccg 671161 ttacccagct tgcgtcgcag ttcacggtca cccgttcggc gatatcgcag cacctcggca 671221 tgctcgccga agcgggattg gttaccgccc gcaaacaggg ccgggaacgg tactaccggc 671281 tcgatgagcg cggggtgctg cggcttcgtg cgctcatgga gtccttctgg agcgacgagc 671341 tggaccgtct tgtcgccgat gccgcccact acccgccgtc acaaggagac tgtgccatgc 671401 cgttcgagaa agcggtcgtc gtgcccttgg atccgaccag caccttcgcg ctcatcaccc 671461 agcccgacag gcttcggcgc tggatggccg tcgccgcgcg tatcgagctg cgcaccggtg 671521 gcgcttatcg ctggacggtg actccggggc atagcgcggc cggcaccgtc atcgacgtcg 671581 accccggcaa gcgggtggtc ttcacctggg gttgggagga ccacggcgac cccccgccgg 671641 gcgggtcgac ggtgaccatc acgctgaccc cggtcgacgg cggcaccgag gtccggctgg 671701 tccacgacgg gctgaccgcg cagcaggccg cccggcacgc caaagggtgg aaccacttcc 671761 tggaccggct ggtcgtcgcc ggccaacacg gtgacgccgg tcccgacgaa tgggccgcag 671821 cgcccgatcc gctcgacgaa ttatcttgtg ccgaagcaac attggccgtt cttcagcacg 671881 tactgcgcgg gataggcgcc tctgacctga ccaggcagac accgtgtacg gaatatgacg 671941 tttcgcaact ggcggatcat ttgctgcgct cgctggcgat catcggcgct gcggcgggcg 672001 cgcagctggc gccccgcgat gtggacgcgc cactggaaac ccaggtggcc gacgcggcgc 672061 aggccgtgat ggaagcctgg cggcggcgtg gcttggcggg cacggtggag ctgaactcga 672121 accaggtgcc tgcgacggtg ccggtcggca tcctgtgcct agaatttctg gtccacgctt 672181 gggatttcgc gattgccacc ggttctcagg tgatcgcgtc cgagccggtg tcggagtacg 672241 tactggcggt ggccggcaag gtcatcaccc cggcaacccg taactccgcg ggcttcgccg 672301 cgccggcggc ggtcggttcc tttgccccag tcctcgatcg cctcatcgcc ttcaccggcc 672361 gccagccgac cgcaggccac gtgtccgcca cctaacgaaa ggatgatcat gcccaagaga 672421 agcgaataca ggcaaggcac gccgaactgg gtcgaccttc agaccaccga tcagtccgcc 672481 gccaaaaagt tctacacatc gttgttcggc tggggttacg acgacaaccc ggtccccgga 672541 ggcggtgggg tctattccat ggccacgctg aacggcgaag ccgtggccgc catcgcaccg 672601 atgcccccgg gtgcaccgga ggggatgccg ccgatctgga acacctatat cgcggtggac 672661 gacgtcgatg cggtggtgga caaggtggtg cccgggggcg ggcaggtgat gatgccggcc 672721 ttcgacatcg gcgatgccgg ccggatgtcg ttcatcaccg atccgaccgg cgctgccgtg 672781 ggcctatggc aggccaatcg gcacatcgga gcgacgttgg tcaacgagac gggcacgctc 672841 atctggaacg aactgctcac ggacaagccg gatttggcgc tagcgttcta cgaggctgtg 672901 gttggcctca cccactcgag catggagata gctgcgggcc agaactatcg ggtgctcaag 672961 gccggcgacg cggaagtcgg cggctgtatg gaaccgccga tgcccggcgt gccgaatcat 673021 tggcacgtct actttgcggt ggatgacgcc gacgccacgg cggccaaagc cgccgcagcg 673081 ggcggccagg tcattgcgga accggctgac attccgtcgg tgggccggtt cgccgtgttg 673141 tccgatccgc agggcgcgat cttcagtgtg ttgaagcccg caccgcagca atagggagca 673201 tcccgggcag gcccgccggc cggcagattc ggagaatgct agaagctgcc gccggcgccg 673261 ccgcccccgc ctgcgccccc ggccccgccg cggccgtcgg cgccggggct gccgaactgg 673321 ccaggctggc cggattggcc gatgatggcc aggggcccga ggtgtgcggt gccgccggtg 673381 ccaccggtgc cacccttacc gccagcccca gggatcggga ataaaccgcc ggggtcggcc 673441 cctttgccgc cgtccccacc tcgcccgccc gccccagcgg tcctgaagcc gtcgccaccg 673501 tgcccgccgt ccccgccatt cccaccggaa ctggcatcaa ggccgtcgcc gccgaagccg 673561 ccccttccgc cgtcaccgcc ggcgctgacg gtgctggtgc cgccggcgcc gcccatgccg 673621 ccggtgccgc cggggccaaa ggcggagcca aggccgccac tgccgccgac gccaccgttt 673681 ccggcgcggc cggccgcccc tgtcgcaccg gtcgcgccca gggtggaacc ggtgccgccg 673741 gcaccgccgg caccaccggt gccgccggtg ccgccggtgc cgccatttcc gccagtcccg 673801 ccagtgccag cgaggctgct gaagagagtg ccgtgggcac ctctgccgcc gtcgccgccg 673861 gtgccgccgg tgccgccggt gccaccggcc ccaccatctc cgccggcgcc ttggctgccg 673921 ttgttgcccg ttggcgacag cgctttgccg ccggccccgc cgttgccgcc gccgccgccg 673981 gcgccgccgg tcccgccaac cccgccggtg ccaccgttac cgccgtgacc gtccgcgcca 674041 gcgtcgaatg tgccggtcgc accggtggcg ccggtggtgc cccgcaggcc cgtcccgccc 674101 gtgccgccgg ccccgccccg gccgccgtca gcgccgtcgc cggcgacgct cccaccttgc 674161 ccgcctacgc cgccgtcgcc gccgcggccg ccgctgccgg taatggctcc gggattgccg 674221 tcactaccgg tgccgccgtc tccgccattg ccgcccgctc cgccgttgcc aatctgcccg 674281 gcgtttccgc cggcgccacc ggttccgccg tcaccgccca tgcccctgct ggcattgccg 674341 ccgttgccgc cgtggccgcc ggccccaccg ctgccgcgca ggctgccgtt gccgccgttg 674401 ccgccgttgc cgccggccgc gccgttgccg ctgagggcat ggtcgccgtt gccgccgttg 674461 ccgccgttgc cgccgttgac gtgaatgctg ctgcttgagc cggtcgcacc gaaagtggag 674521 ccggcgccgc cactcccgcc ggccccgctg gggccggcgt tgccgccgtt gccgccgttg 674581 ccgccgatgc cgttgttggt gaacacgctg ccgttagcgc cgttgccgcc gtcaccgggg 674641 tccccgccgg tgccgccgct gccgccgttg ccgccggcgc cttggctgcc ggttgtgccc 674701 gccggcccgg ccccgcccgg cccgccggtc ccgcctcggc cgccctttcc gccggccccg 674761 ccggcgccgc catcctggcc gcgggcaccc gcggtggcgc cgtcggcgcc gtcaatgccg 674821 cggccgccgt taccgccaac tccgccggtc ccaccgtcgc cgccggcacc gccggggcct 674881 tggctgccgg cgacgccgtt gggtgcggcc ccgccgtccc cgccgtcccc accttttccg 674941 ccggtaccgc caactccgcc ggtgccgccg gggtgcccgt ccgcgcccgc gctggaaccg 675001 ttgacaccgt cgctgccgga ccctccagtc ccgccgacgc cgccggtgcc gccggccccg 675061 ccggtgccac cgttgcccgc ccaggcgccg ccggatccac cggccccacc gtttccgccg 675121 gtgccgccat ccaggccggg gttgccgagc ctgcccagac cgggcaggcc tttgctgccg 675181 ttgccgccgg cgccgccggc gccgccgttg ccgaccaaac cgccatcacc gcccctgccg 675241 ccggacgcgc cggtctggcc aaagccggtg gcatcggcgc ctctgccgcc gttgccgccg 675301 ttgccgccgc tggtgggggt gttgccgggt gcgccgttgg caccgggggt ggagccgctt 675361 ccgccctggc cgccggcacc gccgacaccg ggatcaccgc cgtggccacc ggcgccacct 675421 acaccaccgt tgacaccgag cgcgccggcg gcgccgtgac cgccgttgcc aggagtcccg 675481 ccgttcccgc cggctccgcc gtcaccgcca gcgccctggc tgccgttctg gcccgaggcg 675541 gccaacgcga gaccgccggc cccgccctcg ccgccggctc cgccaggccc accgttaccg 675601 ccattcccgc cgggtgagcc tgcggccccg ggagcggacg cattgaagcc gatgctgcca 675661 gcacctccgg atccgccatc gccgccggcc ccgccagcac ctccggtgcc gccgtcaccg 675721 gcctgagttc cgccgttgcc gccggccccg ccggtgccgc cggccccgcc ggggcgaccg 675781 ggcgcttcgg atccaaatcc gagaccgccg gccccgccgc ggccaccggc cccaccggca 675841 ccgccattac ccacctgacc gccgtcgcca cccctgccac cgttcgcgcc ggtctgtccg 675901 ctgctgatag cgtcggcgcc tttgccgccg tcgccgccgt taccaccgct ggtggaggtg 675961 gtgccgggcg cgccgttcgc gccatgcgcg ctgccgccga cgctggcgcc accggcgcca 676021 ccggccccac cggcgcccgg gttgccgcca ttgccaccgg tcccgccggc accaaggttg 676081 tgaccccacg tcccggtagc gccgttgccg ccgtcaccgg gagctccgcc gtcaccgccg 676141 ctaccgccag ccccgccggc gccgtggctg ccgccgaggc cgagcagacc gtggccgccg 676201 ccgggcccgc cgaccccgcc ggtcccgcca gccccaccat tcccgccgtt tccgccggct 676261 tgaccgtcag cgcccaagtt ggtggcgtgg gcgccgctgg cgcccgcacc gccggcgccg 676321 ccgggcccgc cctcgccgcc ggccccgccg ttgccgccgt tgcccatcag caccccgccg 676381 gccccgccgg ccccgccgtt gccgccgatc ccgccggccc cgccagcggt gccggatcca 676441 cccggtgtgc tggccgacgt acccgtgaca ccggcgatgc cgttgcctcc ggccccaccg 676501 gccccgccga caccgaacaa cccggcggta ccgccggccc cgccgttgcc gccgaccgcc 676561 ccggccccgc caaaaccccc ggcgcctccg ttgccataca gccacccgcc cgcgccgccg 676621 tgaccaccgg ccccgccggt ggtacccacg ccgccggctc caccgttgcc gccgttaccg 676681 attaggcccg ccgccccgcc ggccccgcct cgttgtcctg gcgccccaga cccgccgttg 676741 ccgccgttgc cgtacaagat gccgcctggc ccgccggcct gcccggtccc gggggagccg 676801 tcggcgccgt tgccgatcag cgggcgtccg aacagcgcct gggtgggcgc attgaccgcg 676861 gctagcaaac tctgttcaac gttgaccgcc tcggcggcca cgtacgagct cgcggccgcg 676921 gacagggtct gcacgaaccg gtcatgaaac gtcgccactt gggcgctgac ggtctgatat 676981 tcctgggcgt gcgtgccgaa caacgccgca acggccaccg acacctcgtc ggctgacgcg 677041 ggcagcactt tcgccaccgc ggccgctgcg gtgttggccg cagtgatcgt cgaaccaatt 677101 ttcgccaaat ccgttgccgc cgtggtcagc atctccggcg tcgcgattac gaacgacatc 677161 tcgctcccca ggtcaggtca gcccggtgtt gcccggcgtg gcaaggaatt gtgtggctgt 677221 cccggcgatc taccatgtgg agcgaatctt cgggatccca actccaacga tcccttgttg 677281 acgctatcgt caaaagggca aaaccccaaa ctttacgcga acgaactatc cacagtgcac 677341 cctcgatttc cgtcgacacg tgcaaacggc cagacctcga cggtgctagc cccgcggcga 677401 tattgcaggt cttcgagccg gtcgcgcccc ggggcgcgaa ctccgttgcc ctcccgcgac 677461 cctgcgggag aggataagga atggtcggct atgtggatgt ccgggcatac gccgagctca 677521 acgagttcgt ggagctgcag gcgcgcggtc tgacggtgcg ccggccgttc cgcagccatc 677581 agacggtcaa agatgtgctg gaggcgatgg gcattccgca taccgaggtg gatctcatcc 677641 tggtgaacgg cgatcccgcg gacttttcct accggccggt cgccggcgac cgcattgccg 677701 cctaccctat gttcgaggcc ctcgacatcg ggtcgaccgc caggttgcgc ccagcgccgt 677761 tgcgtaaccc gcgcttcgtc gtcgacgtca acctcggcca gctggcgcgg ctgcttcggc 677821 tgttgggctt cgacacacgg tggtcgagtg ccgccgatga tccgacgctg gccgatatca 677881 gcctgggcga gcagcgaatt ctgctgaccc gcgaccgcgg cctgttgaag cgccgggcaa 677941 tcacccatgg tctgttcgtc cactcccagc acccggagga gcaggcgctc gaggtgctgc 678001 ggcggctaga cctcaacggg cggctggcac cgctatcccg gtgtctgcga tgcaatggtg 678061 agctggccgc ggtttccaaa gacgaggtga ttggccagct ggagccgttg acccgccggt 678121 actacgagtc attcagccgc tgcttcggtt gcgggcggat ctactggccg ggatcacacc 678181 acgcacggtt ggttcgcctc gtcgaacgac tgcgggacca gctaactact tcgacctgac 678241 ccgcacggtg gtgcgcgcgt cgatcgtcgc cagctgacac gccgaaggtg caaccacggc 678301 ggcatcgagc ggcgtgtccc cgccaccaat gcacgttcgg cgcggccggc gcacgctcgg 678361 cgcggagcta cgaattgtcg gccggagtca accgaatggc taccagcttg agccggtcca 678421 ccgcctcggc gaactcctcg agcgtgggta tgcgccggtc gcggaagctc aaccccagca 678481 tgcgttggcc acgcttgacg ccataggcct gtgcggcgcg caggaaaagt tcagaaacga 678541 ccgcacggtc ccggatgagc tcgccacgca tagcggtcgt cttgccgtcg tagacgacct 678601 gggcggcggc accgtcggag aagttgtgct tccatccggc ctcggtcagc gcgtagaggt 678661 cgttgtcgat gacgtgcgcg ctcaagggaa tcgagaagtg ccgcccagtc tttcgcccgg 678721 tgaagctcac caccatcagc tgtgtgcgta gcgggccggc aagcggggtg tgcagcaggg 678781 agcgcaggat cgggttgacg aggcgaagga gggccgccgg tgggtgtgcg atgtctaccg 678841 catacgactg atctgtcatg ccttcaccgt agatccgatc ggggttcgcg gctacgccga 678901 caagttggtg acgcaacaag atatatggcg ccaccggtag taccatacgt atgtggacaa 678961 gacgacggtc tacctgccgg atgaactcaa ggcggccgtg aagcgcgccg ctcggcagcg 679021 cggagtctcc gaagcgcagg taatccggga gtccatccgg gcggcggtcg gcggcgccaa 679081 gccgccgccg cgcgggggtc tatatgcggg ttcggagccc atcgcgcggc gagtcgacga 679141 gctgctggct ggcttcggtg agcggtgatc atcgacacga gtgcgctgct tgcctatttc 679201 gacgccgccg agccagacca cgccgcagtg tctgagtgca tcgatagctc cgcagacgcg 679261 ctcgtcgtat ccccttatgt ggtagcggaa ctcgactatc tcgtcgccac ccgggtaggt 679321 gtcgatgccg agctcgccgt cctgcgtgaa ctcgccggcg gggcctggga gctcgccaac 679381 tgcggtgccg ccgaaatcga gcaggccgcc cgcatcgtca cgaaatacca ggatcagcgg 679441 atcgggatcg cggatgcggc caacgtcgtg ctggccgacc gataccgcac gcgcacgatc 679501 ctcaccctgg accgtcggca cttctcggcg ctgcggccga tcggcggtgg gcgcttcacc 679561 gtcattccgt aaaccgcaac cgattcggtg ctgcaccgcg gcgtgttcgt cttccgcgtg 679621 cgatccgtcc cttagggcgt gatggtcgtc tgctcgtcga tgacgttggc ggcgtccatc 679681 aacgtcatcg tctcgtcgtc gagcgcgtcg gcgttgagtt gaagcacgaa caccgcacct 679741 tggctgggaa tcaccaccgt cttctgcgcg acggtccgca acttgccgtt cttgctgtat 679801 gaaccaccga gctgccatgc tgaaaagccg ccgagcgtgg ctgcacttcc gtcgccgctg 679861 ccttggaagc cgggcaggtt tttcaactcg ccgggtgcga attggaggac cttcgcgggg 679921 tcgatgtcac cggtgagttt ggagaggatc gcaacgatgg tggggggatc gttgggatcg 679981 gcgggctggg tgtagacgat gccgccatag ggtgcgcggg agctttccgg aagcagccgc 680041 caatcgtcgg gcaccggcag gtcgatggtc ggggagccgg ggtcgccgtg gtgcactggg 680101 gtctcctgga tgtggttgtc ccggatatag tcggcgatgg tgtagttggg ccccgctgcc 680161 tgagccgagg tggttgccga cgtagtcgtt gtcgacgtcg tcggggacgt cgtggttggc 680221 gacgtggttg gcgcgctgtc ggtcttgatg ttgaaactgc agccagccag tgccaggctc 680281 agcgccaccg tcgcgacggc cgccgtgaag tgcttcattg cgcgctcccg aagattggac 680341 cggcacttcc ggccggtgag gtcggattga gactagtcca actggtgtgc gcgcgaccct 680401 atcactgcaa tcccacctcg attgaccgca aaacaccgcg ggaacaggcg tctatgcagt 680461 aagagacagc tatgcgggca cgcaggttgc gcagagccct ggccgcgctc ttggcggtgg 680521 cgggtctgtt tgttccgttc attgttggcg tgcccacggc ctacgacggt gagccggtgt 680581 tcgtcgccat tccggtcgag catgtcaata cgctcatcgg caccggcacg ggagccgcga 680641 tagtggggga gatcaacaac tttcccggcg cctcggtgcc gttcggcatg gtgcagtact 680701 cgccggacac cgtcgacaac tacgccggct acgactacgg caacccgcat tccaccggat 680761 tcagcatgac gcacgcgtcg gtgggctgcc cggcgttcgg cgacatctcg atgttgccca 680821 cgaccacccc gctcggctcg cagccgtgga gcgcctggga ggagatcgcc cacgacgaca 680881 ccgaggtcgg cgtgcccggc tactacaccg tacggttccc cggtaccggg gtgatcgccg 680941 agctcaccgc caccacccgc acgggcgtcg gccggtttcg ctacccccgc aatgggtggc 681001 cggcgctgtt tcacgtgcgc tccggcgcat cgttggcggg caactacgcc gcgacactgc 681061 agatcgagga caacaccaca atcaccggct cggcgaccag cggcgggttc tgcggcaaga 681121 agaacctgta cacggtgtac ttcgccatga agttcagcca gccgttcagc tcgtatggca 681181 cctgggacgg ctacgcggtc tatcccggtt cacacagcat gaattcgagt tacagcgggg 681241 ggtatgtcgg gtttccggcc ggctcggtgc tcgaggtgcg gaccgccctg tcctatgtga 681301 gcgtggacgg ggcgcgagcc aacctggacg ccgaaggcgg agcaagcttc gacgacatcc 681361 gtgcggcgac atcgagcgaa tggaacgccg cgctatcgcg aatcgcggtg gccggcaggg 681421 ggcctggcga cgtggacacc ttctacactt gtctttaccg gtcactgttg caccccaaca 681481 cctttaacga cgtggacgga cgttacatcg gattcgacgg tgtcatccac agcgttgcca 681541 gtgggcacac ccactacgcc aatttctccg actgggacac ctaccgcagc ctcgccccac 681601 tgcagggact gttgttcccg caacgggcca gcgacatgat ccagtcgttg gtgaccgacg 681661 cggagcagag tggtgcgtat ccgcgttggg cgctggcgaa ttccgcaacc ggcatgatga 681721 gcggagacag tgtggtaccg ctcatcgtaa acctctacgc cttcggcgcc agggatttcg 681781 acctcaaatc cgcgctgcac tacatggtga atgcagcgac ccagggcggt gtcggacttg 681841 acggtttcct ggagcggccg ggaatcgccg cctatctgag gctcggctat ggaccacaaa 681901 cggcggaatt ccgcgccaac ggtcgtatcg ccggcgcctc ggtcacgctg gagtggtcgg 681961 tcgatgactt tgccatctcc cgattcgctg attcgttggg cgataccgca actgccgccg 682021 tcttccagaa ccggtcgcag tattggcaga acctgttcaa tcccaccacc ggctatatct 682081 cgccccggag cgcggccggt ttcttccccg acggtcccgg gttcgtggca tacccctcgg 682141 gctttgggca ggacggatac gacgagggca acgccgaaca atacctgtgg tgggtgccgc 682201 ataacgtggc cggtttggtg accgcgcttg gtggccgcac ggccgtcgtc aagcggctcg 682261 accgctttac caaaaagctc aacgtcggcc ccaacgaacc ctatctgtgg gccggtaacg 682321 agcccggttt cggggtgccc tggctgtaca actacatcgg ccaaccgtgg aaaacccagc 682381 ggacggtcga ccgggtccgc gggctgttcg gcccgacacc tggcggtgcg ccgggcaacg 682441 acgacctcgg cgccctgtcc agctggtatg tctgggctgc ccttggcctg tatccgagca 682501 ccccgggaac caccatcctg accgtgaaca caccgctttt cgatcgcgcc gtgatcgcgc 682561 tccccaccgg aaagtccatt cagatcaccg cgccgggcgc atccgggcgg aaccgcctga 682621 agtacatcga cggcctgacc atcgaccgcc aaccgagcaa ccagacgttt cttccggagt 682681 cgatcgtgcg caccggaggc gacctgacct tctcgctcgc cggcacaccc aacaaggtct 682741 ggggaaccgc ggcgtctgcc gcgccgccgt cattcggtgc gggcagctcg gcggtgacgg 682801 taaatatcgc ccggcccatc atcgggatcg tgccgggagc gaccgggacc gtgaccgtcg 682861 acgcgcaacg gatgatcgac ggcgtcgacg actacactgt caccccaacg tcctacgttg 682921 ttgggattgc ggcggaaccg ttatccgggc aattcgacga tgacggagcc gtgagcgcgt 682981 cggtcgcgat caccgtagct cgatcggtgc cgtcggggta ttacccgatc tatgtcacca 683041 ccagcgccgg ggatagtgcc cggacattga tcgtgctggt cgtggtcgcc gaggcggtgg 683101 aatgatcatt gcgcaagcgc agaggagtta gatcatttcg tgtctggtca gccagtgcat 683161 cacctgccag ccggcgaata ccggtagcca acaggtcaat agtcgataca gcagcaccga 683221 cggcacaccc aatgctgcag gtacaccgaa ggcggcgagc ccaccgatca gcgccgcctc 683281 caccgcgcca accccgcccg gggtgggggc ggccgaggcg agggtgccgc cgaccatcgt 683341 caccacggtc accgtgacga acgtcgttcc gccgccaaag gcttcgatac tggcccacag 683401 tgccaacgca gctccgagcg tcgttccggc acaaccgagt acgatcaacg ccagtcgctt 683461 cggctcccgg gccaacgcaa tgaggtcatt cgttacctcc ctgagcttcg gccgcaccgc 683521 cgtcgctagc cagcgtcgca gcttcggcac gaagaggaat gtcccgacaa tgcctagggc 683581 cacaccggca atgaggtaga gcaccgtggc attcgggacg aaatgagata ggtcggtcga 683641 ggtgccggcc agggcgctga acaggatcag cagcacgagg tggacgatca cctgtaccga 683701 ctgctgcagt gccaccgccg cggtggcccg cactgcggtc agccctccct tctgcaagaa 683761 ccgggtactc aacgctagcc cgccgacgcc ggccggggta gtcgttgcag caaaagtgtt 683821 ggctacctgc atgattgaca gcttccagaa gcccaccagc ccatcagcgc aggcccacaa 683881 cgccgctgcc gcaccgacat acgtcagcgc cgacaccgct aggcccagta gcgcccacca 683941 ccagttcgcg gttcgcagct gggaaaagaa cgtgggcacg gtactgatga aagggtaagc 684001 gacatagacc agagcaccga ttaacaccag ttgaatgagc tggccgcggc tgaaccgggt 684061 gatcgtttcg gctttgatct gatccgcgcc cgtttgccgc atcacctcgg cgcgtgtgct 684121 ggcgatgacc gcatttgggt cggttatcga ctctcggatt cgttttggca cagcggattt 684181 ggtaagtctt cgcgatgccg ccaggatggc ttgcttgccg aacgtgtcaa tggctgcggt 684241 cacggcggcc tcggcgtcat acagcgccga cgtcgtcacc aagagttggg ccaggtcgga 684301 ttggagttgg gcgtcggtgg cgccgtactc ggcctcaccg aacccgccga acagcaccgc 684361 gccgttgtcg acggtgatct cggcactaca caggtccccg tgggagatct gctggtcgtg 684421 cagggtccgt agcgcctccc agacatgggc agtcggcgtg gttttggtgc attcgctgat 684481 gccgattccg cgagcgggcc ggtgtgcata caacgtccat ccccggtcga gcggggacac 684541 cgcgatcacc gtcgtgttgg ccatgcctag atcgccgaag gcaatggcca tcagcgcgcg 684601 atgctcgacc gcacggcgca tggaggcttg caggggtgcg gtctcggtgc cgcgcaacgt 684661 cagcttcagc cagagttggc gcagcgcgcc gccgccactt tggtgcgggc cgtacaactc 684721 gatcaatgcc tcgctgcacg ccccggcgtt gggctgctcg caagcggccg acagtaccag 684781 tggcccgggc ccggccggcc gcacaaccgc gagcccggac accgcgaatc cgcgttttgc 684841 caacgcgcga atggcaccat ccagtggcac ttcaagcgct ggtgtgccga cgaccaggac 684901 caccaacgcg ccgaccaacc accccaccgc cagccccaac aatgagcggg ccggcacaat 684961 cgcgctgaca accagatgga tcggcacgaa tgccaacagc agcgcccacc accagtgccg 685021 ccagcgcgcg ggcagccagg gacccgacac ggtgagcacc gccgcgagta tcgcgatcca 685081 tcgcgggtca tcgagaaact gggccagcaa tgtggcgagc cggtcggaaa ggtcaaagtg 685141 ccatcggggt gccgcgatgc ggctactgct gatcgacaac gggagaacgg ccataagtcc 685201 ggcggccgca tacgcgccca gcagcttcca ctgccgggaa acgatcaggc caatcaggat 685261 cacgaacggc aacgccaaaa tcgccaggcc gtaccccagg tacaccagat cggattgcga 685321 cggggacagc accccgacga tctccgagat ggatttctcc agcgccaccc actgcgggcg 685381 ggtgatcagc gaactcgtga tcaccgccac gaggtagatc gccgccagca ccgcccggat 685441 gatgtcgttg gtgcgccggg tcagtggttg cagcaagtta ccggaaacgc cgatgtcgcg 685501 tccgtcaact cgcatgttct aacgatcttc cggatcaggg cccgcggtgt ctggtgccgt 685561 ttcgcggctc cgcggacaac ttagcccgat aactgcgtgg ggtgtcggtc tgaccacttg 685621 acgtcttacc aatcttcatt cacactgggc gcatggcgct gcagccggtg actcgccgat 685681 cggtgcccga agaggtcttc gagcagatcg ctaccgatgt gctcaccggc gagatgccgc 685741 ccggcgaggc gttgcccagc gagcgtcggt tggctgagtt gctcggagtg tcgcgacccg 685801 cggtccgcga ggcgctcaaa cggctgtcgg ccgcaggtct ggtcgaggtg cgtcagggcg 685861 acgtcaccac cgtgcgtgac ttccggcggc acgccggcct ggatctgttg ccccgattgt 685921 tgtttcgcaa cggtgagctg gatatctccg tcgtccgcag catcctcgag gcccggctgc 685981 gcaattttcc gaaggtcgcg gaactagcgg ccgaacggaa cgagcccgag ttggcggaat 686041 tgctgcagga ttcgctgcgt gcgctggaca ctgaggaaga tccgatcgtg tggcaacgcc 686101 acacgctcga cttttgggat catgtggtcg acagcgccgg ttcgatcgta gatcgattga 686161 tgtacaacgc atttcgtgct gcttacgagc cgacgctagc tgctctgacc accacgatga 686221 ccgctgcggc taagcgtccg tcggactacc ggaaactcgc ggatgcgatc tgctcaggtg 686281 atcccaccgg agcgaagaaa gccgcccaag acctactcga acttgcgaac acatcgttga 686341 tggccgtact cgttagccag gcgagtcggc aatgaccacc cacgccgtga tcatcaccta 686401 tctccgcgac cagacgcagc ccgccgtcga tgcgatcggc gggttctacc ggacatgcgt 686461 actgactggc aaggcgctgg ttcggcggcc cttccattgg cgtgaggcga tcgagcaggg 686521 ctggttcatt accagcgtct cgttgctgcc aaccctggcg gtgtcgattc cgttgaccgt 686581 gttgatcatc ttcacgctca atatcctgct ggccgagttc ggcgccgccg acatctccgg 686641 cgccggcgcg gcgctaggcg cggtcaccca gctgggcccg ctgaccaccg tgttggtgat 686701 tgcgggcgct ggagccacag cgatctgcgc cgacctgggt gcccgcacca tccgggaaga 686761 gatcgatgcg atggaggtgc tgggcatcga ccccatccac cggctggtgg tgcctcgggt 686821 cgttgccgcg accatcgtcg ccgcactgct caacggcgcg gtgataacca ttggcctggt 686881 tggtggtttc gtcttcagtg tcttcatcca acacgtctcg gccggcgcct acgtgggcac 686941 gctcaccttg gtcaccggtc tacccgaggt gatcatctcg gtggtcaagt cggcgacgtt 687001 cggcctgatc gctggcctag tcggctgtta ccgcgggctg accacgaaag gcggccccaa 687061 gggagttgga accgccgtca acgaaaccct ggtgctgtgc gtgatcgcgc tgttcgcgac 687121 caatgtggtg ttgaccacga tcggcgtgcg gttcgggacg ggacactagc atggtggagt 687181 cttcaacggc atcagcggca gccgtattgc gggcccgcta cccacgcaca gccgccagcc 687241 ttgaccgcta cggcggcggc acggcccgaa gacttgagcg gacagggact ttcgcgagat 687301 tcacccggat cagcgtcgtg cagatcggct gggcactgcg tcgctatcgc cgggagacgc 687361 tgcgcctggt cgccgagatc gggatgggca ccggcgcgat ggccgtcgtc ggcggcacgg 687421 tcgcgatcat cggttttgtg acgctgtccg gcggctcgct gatcgccatc cagggcttcg 687481 cgtcgctggg caacatcggt gtcgaggcgt ttaccggatt ctttgccgca ctggccaaca 687541 cacgcgtcgc tgcgcccatt gtctccggtg tcgcgctggc cgcgacggtg ggcgccggcg 687601 ccaccgcaca gttaggtgcc atgcggatca gtgaggagat cgacgcgctg gaagtgatgg 687661 gcatcaagtc gatttcgttt ctggtctcca ctcggattct aggagggctg gtggtgatca 687721 tgccgctgta cgcgctcgct ctcgacatgg ctttcacctc tggtcaggtg gtcacaaccg 687781 tgttctacgg ccagtccaac ggcacctatg agcactactt ccgcaccttc ctgcgcccag 687841 aggatgtggg ttggtcggtc gtggaggtgg tgatcatcgc ggtggtggtg atgatcaccc 687901 attgctacta cgggtacacc gccagcggtg gcccggttgg ggtcggccag gcggttggtc 687961 gatcgatgcg tttctcgctg gtctcggtgg tggtcgttgt cctgctggcc gagttggcgc 688021 tctacggcgt cgacccgaac ttcaatctca cggtgtagcc gcggtgccaa cgctggtgac 688081 gaggaagaac cgacgtgcgt ggctgtatgt ggagggtgtt gtcctgctgt tggtgggcgc 688141 gttggtgctc gtattggtgt acaagcagtt tcgtggggaa ttcacgccga agaccgagct 688201 gactatggtc gcctcccggg ctgggctggt tatggaagct ggatccaaag tcacctacaa 688261 cggggtggag atcggccggg tgggcagcat ttcggagatt gagcgtgacg gccggccggc 688321 ggcgaagctg gttttggacg tgaatcctcg ctacatcagc ctgattccgg tcaatgtggt 688381 ggccgatatc gaggcggcca ccctgttcgg caacaagtat gttgcgctgt ccgcgccgaa 688441 aattcctcaa cagcagcgga tttcctcaca tgacgtgatt gatgtggggt cggtgaccac 688501 cgaattcaac acgttgttcg agacgatcac ctcgatcgcc gagaaggtgg atccgatcga 688561 gctgaacgcg acgctgtccg cggtagcaca ggcgccggat gggctgggcg gcaagttcgg 688621 tgagtcgatc gttaatggca atcagattct ggcgcaatta aatccgcggc tgccgcagct 688681 cggctatgat gttcggcggt tggcggatct cggtgaggtc tatgtcgatg cttcgccgga 688741 tctgtggtcc tttctgcaga acgcactgac cactgcgcgc acattgacca gccaacagcg 688801 cgatctggat gccgcgttgt tggcggctac gggtgcgggc aacaccggtg aagacgtttt 688861 tgctcgaggc gggccgtatc ttgcgcgcgc agccgccgat ctggtgccca ccgctacgct 688921 gctggacacc tacagtcccg aactgttctg catgatccgc aactttcacg acgctgcgcc 688981 caaagtcgcg gacgcggtgg gcggcaacgg ctattcgcta gcggccgccg gaacgatttt 689041 gggagcaccc aatccctatg tctatccgga caatctgccg cgggtgaatg cccacggtgg 689101 acccgggggc cgaccgggct gctggcagac gatcacccgg gagctgtggc cggcacccta 689161 tctggtgatg gacaccggtg ccagcctcgc accgtacaac cacgtcgagc tcggccaacc 689221 gatgttcact gaatacgtat ggggacgcca atacggagag aacacgatca acccatgaaa 689281 accacaggca caactatcaa actcggcatc gtctggttgg tgctgtcggt gttcaccgtg 689341 atgatcatcg tggtgttcgg gcaggtgcgg ttccatcaca ccaccgggta ctccgcggtg 689401 ttcacccatg tcagcgggct gcgggccggg caatttgtcc gcgctgcggg cgtagaggtc 689461 ggcaaggtcg ccaaggtaac gctgatcgac ggggacaagc aagtattggt ggacttcacc 689521 gtggatcgct cgctgtcact ggatcaggcg acgaccgcct cgatccgcta cctcaacctg 689581 atcggcgacc ggtaccttga gctcggccgc ggtcacagcg gtcagcggct ggcgccgggt 689641 gccacgatcc cgctcgagca cacccatccg gccttggatc tcgacgctct gctcggcggg 689701 tttcgcccac tcttccaaac gttggaccca gacaaggtca acagcatcgc ctcctcgatc 689761 atcaccgtgt tccaagggca aggcgccacc atcaacgaca tcctcgacca gaccgcctcg 689821 ctgacggcaa cgctggccga ccgggaccat gcgataggtg aggtcgtcaa caacttgaac 689881 accgtgctgg ccaccaccgt caagcatcaa acggaattcg accgcacggt cgacaagcta 689941 gaggtgctga tcactggact gaagaacagg gcggacccgc tggccgcggc ggcggcacac 690001 atcagcagcg ccgcgggaac cctagccgac ctgctggggg cggatcgtcc attgctgcac 690061 agcagcttcg ggcacctcga gggcatccag cagccgctca tagacgagct ggcagaactc 690121 gaccacgtgt tgggcaagct gccggacgcc taccggatca tcggccgcgc cggcggcata 690181 tacggtgact tcttcaactt ctatctgtgt gacatctcac tgaaagtcaa cggattacag 690241 cctggaggtc cggtacgcac cgtcaagttg ttcggccagc cgaccggcag gtgcacaccg 690301 caatgagaac gctgaccgag ttcaaccgcg gccgtgtcgg gatgatgggt gcggtggtca 690361 cggtgctcgt cgttggtgtt gcgcaaagct tcaccagcgt gccgatgctg ttcgccacac 690421 ctacctacta tgcgcaattc gccgacatgg gtggcatcaa cacgggcgat aaggtggaaa 690481 tcgctggggt gaacgtcggg ctggtgcgct cgctggcaat ccgcggcaac cgcgtgttga 690541 tcggattctc gttgcccggc aagacaatcg ggatgcaaag ccgggcagca attcgcaccg 690601 acaccattct tggccgtaag aacctggaga tcgaaccccg cggttcggag ccgttgaaac 690661 ccaacggttt cctgccgttg gcgcagacca ctacgccata ccaaatctat gacgcgttcg 690721 tcgatgtcac gaaggcggcg acgggctggg acatcgatgc cgtcaaacgc tcgctaaacg 690781 tgttgtcgga gacattcgat cagaccgccc cgcatctaag tgccgccctc gagggtgtca 690841 aggcattctc cgacaccgtc ggccggcgcg gcgagcagat cgagcaactg ctggcgaacg 690901 ccaacaggat cgcgcgcgtg ctcggcgacc gcagcgagca ggtcaacggg ctgctggtga 690961 atgccaagac gctgctggcc gcgttcaagc aacgcagcca ggcactgcgc attctgctaa 691021 ccaacgtgtc ggaggcatca gcccaggtat ctggcctgat cacagacaac cccaacctca 691081 accatgtgct ggcccagttg cgcacggtca gcgaggagct ggtgaagcgc aagaacgaat 691141 tggccgatgt agccgtcttg ctcggcagat acaccgcggc cctgacagag gccgtcggtt 691201 ccggaccgtt cttcaaggcg atggtggtca atctgctgcc ctaccagatt cttcagccct 691261 gggttgacgc ggcgttcaaa aagcggggca tcgacccgga gaacttctgg cgcagtgcgg 691321 gtctgccgga attccgctgg cccgacccca acggcacccg gttccccaac ggcgcgccgc 691381 cggcggcgcc accggtgcgg gagggtacac ccaagcatcc gggaccggcc gtcccgccgg 691441 gaacgccgtg ctcctacaca ccggcggcgg gcgcgttgcc acggcccgac aacccactac 691501 cctgcgcggg cgccaccgtt ggcccgttcg gtggacccga cttcccggca ccgctcgatg 691561 tccagccgtc gccgcctaat cccgatgggc cgccgccgac gccgggcatc ctaagtgctg 691621 ggcggccggg cgagccggct ccggctgttc cgggcatacc gatgccgctg ccgccgaacg 691681 cgccgccggg tgcccgcacc caaccgctgg agccgtttcc tgacgggacg ggaggtagca 691741 accaatgagc accatcttcg acatccgcag cctgcgactg ccgaaactgt ctgcaaaggt 691801 agtggtcgtc ggcgggttgg tggtggtctt ggcggtcgtg gccgctgcgg ccggcgcgcg 691861 gctctaccgg aaactgacta ccactaccgt ggtcgcgtat ttctctgagg cgctcgcgct 691921 gtacccagga gacaaagtcc agatcatggg tgtgcgggtc ggttctatcg acaagatcga 691981 gccggccggc gacaagatgc gagtcacgtt gcactacagc aacaaatacc aggtgccggc 692041 cacggctacc gcgtcgatcc tcaaccccag cctggtggcc tcgcgcacca tccagctgtc 692101 accgccgtac accggcggcc cggtcttgca agacggcgcg gtgatcccaa tcgagcgcac 692161 ccaggtgccc gtcgagtggg atcagttgcg cgattccatc aatgggatcc tccgccagct 692221 cggcccgacg gagcggcagc cgaaggggcc gttcggcgac ctcatcgaat cggccgcgga 692281 caacctggcc ggcaagggca ggcagctcaa cgaaacgctg aacagtttgt cgcaggcgtt 692341 gaccgcgctg aacgagggcc ggggagactt cgttgcgatc acgcgaagcc tggcgctatt 692401 tgtcagcgcg ctctaccaga atgatcaaca gttcgttgcg ctcaacgaaa accttgccga 692461 gttcaccgac tggttcacca aatccgacca tgacttggcc gacacggtgg aacggatcga 692521 cgacgttctc ggcaccgtcc gaaagttcgt gagcgacaac agatccgtgc tggctgccga 692581 tgtcaacaac ctcgccgacg cgaccactac actagtgcaa cccgagccgc gggacggtct 692641 ggaaaccgcg ttgcacgtgt tgccgaccta cgccagcaac ttcaacaacc tttactatcc 692701 actgcacagc tctctggtgg gccagttcgt gttccccaac ttcgcgaacc caattcagct 692761 catttgcagc gctattcagg ccggcagccg actcggctat caggaatccg ccgagctgtg 692821 cgcgcagtac ttggcaccgg ttctggacgc tctcaagttc aattacttgc cgttcggctc 692881 aaacccgttc agttcggcgg ccactttgcc caaggaggtg gcttactccg aggagcggct 692941 ccgcccgccg cccgggtaca aggacaccac tgtcccaggg atcttctcgc gggacacacc 693001 gttttcacac ggcaaccatg aaccgggctg ggtcgttgcg cccgggatgc agggtatgca 693061 ggttcagccg tttaccgcga acatgctcac cccggaatcg ctggcagagc tgctgggtgg 693121 tccggatatt gccccccccg ccgccgggaa ccaacttgcc cggaccgccg aatgcgtatg 693181 acgagtccaa tccgttgccg ccgccgtggt acccgcagcc cgcgtccctc ccggctgcgg 693241 gcgccacagg acagccaggc ccgggccagt gaggtgcggc gtgagcgcgg gtagcgcgaa 693301 cggcaagccg aaccgttgga ccctgaggtg cggcgtgagc gcgggtcacc gtggatcggt 693361 gttcttgctg gcggtcttgc tggccccggt ggttttgact tcgtgtacct ggcgtggcat 693421 cgccaatgtg ccgctgccgg tcggccgggg tatgggtccg gatcgcatga cgatctacgt 693481 gcagatgcct gacacgctgg cgctgaacac taacagccgg gtcagggttg ccgacgtctg 693541 ggtcggtacg gtgcgtgaca tcagcctgag gaactggatc gcgaccctga cgctggagct 693601 cgagccgacc gtgcggctac cggcaaatgc gaccgcgaag atcggccaga ccagcctgtt 693661 aggcacacaa catgtcgagc tggccgcacc gccaatcccg tcaccgcagc cgctgaaaag 693721 cggcgacacc atcggcctga agaactcctc ggcctaccct accgtcgaac ggaccttggc 693781 cagcgtcgcg ttgatcctca ccggcggcgg catcgtcaac ctcgacgtga tttaaaccga 693841 gatcctcaac atccttgacg gccatgccgg tcagattcgc gaattcctcg agcggctagc 693901 cactttcacc gccgagctga acaaccaacg cggcgatctg actcgcgcaa tcgactcaac 693961 caaccaactc ctgaccatca tcgccaaccg caacgacacg ctggatcggg tgctcactga 694021 cgtcccaccg ctgatcgagc atttcgccga caccggtcag ctgttcgctg acgccaccga 694081 atccttgggg cggttcagcg aagtcgccaa ccgggcgctg gcggctaccc ggcctaacct 694141 tcaccagacg ctgcagtcgt tgcagcggcc gttaaggcaa ttggaacggg cttcgccgta 694201 tgtggtcggc gcgttgaagc taggcctcac cgctccgttc aacatcgacg aggtgccaaa 694261 cgttatccgc ggcgactacg tcaacgtgtc cgcgacgttc gacgtgacgc tttctgcact 694321 cgacaacgca ctgctgagcg gaacgggcat ctcgggaatg ttgcgtgcgc tcgagcaggc 694381 gtggggacgg gatccggaca ccatgatccc ggatgtccgc tacacgccga acccgaatga 694441 cgcgccgggc ggaccgctgg tggaaagggc tgagtgagga gatgctgact cgcgctatca 694501 agacccagct ggtgttgttg acggtgttgg cggtcatcgc ggtggtggtc cttggttggt 694561 atttcctgcg gatacccagc ctggtcggca tcggtcgata cacgctttat gccgaattgc 694621 ctcggtccgg gggtctatac cgaacagcca acgtcacata tcggggcatc accataggga 694681 aggtcaccgg cgtcgaacca accgagcggg gcgcgcgagc aaccatgagc atcgacaatg 694741 gctaccagat ccccaccgac gcctcggcca atgtgcactc agtgtcggcg gtcggcgagc 694801 agttcgttga cctggtgtcg acccgcacca gcggtccgta tctgcggcat gggcagacga 694861 tcaccacgac tacggtcccc agccagattg gcccggcgct ggacgccgcc aaccgtggat 694921 tggcagtgct gcccaaagac cgggtcgcgt cggtgctgca cgaggcgtcg gaggccgtgg 694981 gcgggctggg atcctcactg aatcgcctca tcgaagccac ccaggcaatc gcccacgatg 695041 tcaggggcag cctcgaggac atcgacgaca tcatcgagcg ttcggcgcct atcatcgata 695101 gccaggtcaa ttccggcaac gagatcgccc gctgggccgc caacctcaac acgctggccg 695161 ctcagaccgc gcagaccgat ccggcggtgc gaagcattct ggccaacgcg gcaccgactg 695221 ccgatcaggt caacgccacg ttcagcgacg tgcgggagtc gttgccgcag acgctggcca 695281 atctcgaggt cgtaatcgat atgctcaagc gctaccacaa cggcgtcgag caggcgttgg 695341 tgttcttgcc gcagtccggc gcgatcgccc agtcggttac tacagagttc cccggccagg 695401 ccggactggg tgtcggcggc ctggcgctca accaaccacc gccgtgcctg accggcttcc 695461 tgccggcgtc ggagtggcgg tcacctgctg acaccagcac cgcaccgcta cccaagggca 695521 cctactgcag gattccgatg gacgcgagca atgtggttcg tggagcacgc aacaacccgt 695581 gtgtagacgt gcccggcaag cgggcggcga ccccgcggga atgccgcagc aatgaagctt 695641 atgtgcccgg gggcaccaat ccctggtatg gggaccccaa ccagatgctc agctgtcccg 695701 cgccggccgc gcgttgtgac cagccggtga agccaggcca ggtgatcccg gcgccgtcag 695761 ttaacaatgg catcaacccg ctgcccgccg atcagctgcc aggcacacct ccaccggtca 695821 acgatccttt gcagcgacct gggtcaggca ccgtccagtg caatgggcaa caacccaacc 695881 cgtgcgtcta caccccgagc acatttccta caaccattta cgacgtgcag agcggcaaag 695941 tcgtagcacc cgacggtgtg gtgtattccg ttgaggcttc gactcatgcc ggagccgacg 696001 gatggaaggt gatgctggca ccaaccggct gagccggcgc gatcaggtac cggcggattc 696061 gcgctggtca agaaaggcaa ccgtcagatc gttatgacct cgacgtcggg catggcggcg 696121 tagtcgttgt cttgggtcag gatcgcaatg ccgtgcgcca cggctgtggc cgcaatccag 696181 ctgtcgttga tcggcacgcg cagtttggcg gcgcgcagct tggacaccag taatgcccat 696241 gcttcggaga ccgcctcgtc gatgcctagt ggttcgaacc gttgcgcaag ctggtaggtg 696301 gagagccgac gtgcggcggc ctcggggccg gaggcttgca acaccccgag ccgcagctcg 696361 ccgagtgtga ctaccgagac gccccattcg tatcccgcaa accggtccgg gtcgaatcgt 696421 gtcgcctcga tgccaatgaa aacggatgtg tcggcgaggg cgcgccgtac gttcaccacc 696481 gcacatcgtc cgtggtttgc gtcagcgtct ctcgcagctc ctcgcccaga ttggtggtat 696541 cggggcccaa gcgcaccagt tcgccgatca cctcggcagc tggcaaccat tggcggcgcc 696601 gcttgagcgg aacgatgcgc gctacggggc gattgtcctt gagcacctcg atttcctcgc 696661 cggcggcaac tcgccgcagt acctcggcgg tgtggttgcg aagatcgcga gcgggtatcg 696721 tagcagacat gctacgagtg tagcggagct gctgtcgcgc cgcctcgtct cgatgtctgc 696781 ggtcacgatc tccgcaggtt acggccgctg ctgtgcccgc agtcgcccgc gatggtgggc 696841 ccgtcggggt agattgcgag cgcgcccgga cggaggccgc cgatgccgaa gtgccgtgat 696901 ttgttcgaag agttagccgg cccagagcgt gctaacgggt aacgccgcga gcgttgggcc 696961 gaacggggtg gcttccggcg cggcggtgag cagcacgccg aactgaaatc gatcatcgag 697021 ccgctcggcc agataacgta ggccacgcag gtcctcggct cgcggcgtgc tggtcgcctt 697081 gacctcgatg ccgcagaccc gaccatcggg atgttcgagc accagatcga cctcggcgcc 697141 gccgcggtcg cgaaaatgcc acagactcgg ccgttcggtc gaccaggtga gctgtttgcg 697201 aatctcgttc gccacgaaag tctccagtag cgggccgagt ggacggccgg ggcgatccag 697261 cgtcgcaccg gtaacgccga gcaggtgaca cgccaggcca ctgtccgaga ccaccagttt 697321 cggtcggcga atcaccttgc ggctcaggtt ggtcgaccag gccggcaccc ggtggataag 697381 gaacgccgct tccagcaggg ccagatagcc agcggtggtg cgagccggga tcgacaggtc 697441 gttcgccagt gcgctcacgt tgagctcggc gccggtacgc gcggcgcaga gccgaagcac 697501 acgcggcatt tcggcaagcc gctcgatcgg cgaaatctcg cggatcaccg actgcgtcgc 697561 cgtcgtgaga tagttgtcga accacgcgcg acgcctcgac ggcgatcggg cgacgatgtc 697621 cgggaagcct ccggtggcga tcctgtcgac cagatcggcg cggcgcatat cggagccgtg 697681 gatcagctcg cgtggtgcgg tgaacagcgc atcgacgaaa ccgtccgcga ttccggcccg 697741 ctcaccttgc gagaacggcc agagttcgat gatttcgacc cgcccgacga gcgcgtcggc 697801 catgtcagga gccgagagca gcctcgctga acccgtgagc aggaacctgc ccggcctgcg 697861 atcccggtcg acctctgcct tgatcgcccg aaacagcccc ggctcgagct gggcttcgtc 697921 gatgacgagc gtgtccaccg gccgggatac gaatgcgcgg ggatcgtcgc gggcggcgtc 697981 gcggttggcg acgtcgtcaa gcgagacgac ttcgctggat cccggatagt caagtcgcgc 698041 gaccagtgtt gttttgccga cctgacgcgc gccgttgaca acgacgaccg gggtgtcggc 698101 gagcgcggcc agcaccgagg gcgcgatcgc gcgttcgacg actcccatgc ggacagaata 698161 cgctgccgat ttgtctacct attggctgcc gattcgtccc cattagcggt gcggattagt 698221 ccacatcatc gctgcggatc cgtccgacgg cggccctgag ccacgtggcc gacgaagaac 698281 gctcggaggc gctgctgcgg gagcgctcgc tgcactacgt ggccagtagc cgggcggggg 698341 acatgttggt ggtgacctgg agcggacagc ggtcggagtt gttgagtcag ctgaagattc 698401 acgcggcgac aacgacgtgg acgccgatct tttcgtaagt gtccttggct cgagcatcgc 698461 gtgtggcgag ttcagcccgg tgctcggcgg cggcaagggc cacgagagcg tcatagaccg 698521 cgccaccagt gatctcgaat tgggccagca cgcgtgggag atgttcagtg gtgcgggaac 698581 tcaacaacag cggtgccgca aagcgttcgg taagaagccg cgcggcgtcc atcggtgcca 698641 gtcgtaggtc acgcggcagg cgggtcagca cggagtaggt ttcggccagg gcgtgcccgc 698701 acagcgcggc ctcccgatgt gcccaccagg cgacaaccgc cgcatgcgcg gtatgggtcc 698761 gtaccagcaa cggaatcgcg acgctggtgt ccactgccag cggcggtttc acttccggcc 698821 gctatcgata aggccgaaca cgacctcatc gtcgatcgtg gtctcaccgg tggccaccag 698881 tacgccattc tcctcttcga gacgcgctgt tcgtccggtg ggaatcaggt ggagaccagc 698941 gccatagcgg gatatttcca cggtggaccc cggttgcagc cccaaggctt cgcgcagcgg 699001 tttgggtacg acgatgcggc cagccgcatc cacaacagcc ttcatgggaa tacgatacca 699061 atggcttccc actcaggtgc ggaagtcgac tcaccgccgt taccacgacc ccgacgacca 699121 caccgtcagc tgcggcgcgg cgtggcgact attggtcgct agtgtcgtgc tcccgattcg 699181 ggtcctttgt atctgatgac gtgatccgcg gaagctcctc gtggaagggc ggccgcggtg 699241 tggggagggt gatgcggagt tcggccccgc cgtcggggtg gttggtggcg ttggcatgtc 699301 cgccgtgggt ggtggtgagg gcggcgacga tggccaggcc gaggccgctg ccgcgaccgc 699361 cgcgggcggt gtcggcgcgg gtgaatcggt cgaaggcgac gggaagaaag tggtcggcga 699421 atccggggcc gtggtcgcgg acgccgatgt cgactgcacc gtcgcgggcg tgcgcggtga 699481 cagcgatttc accgtccccg tgggtgatgg cgttgtcgag cacggcggtg aggattcggc 699541 gcaggtggtc cggatcgatc gagacgaaca ggtccggttc cgcgcgtgtg gtgatgtccg 699601 ctccagtagc ggcgaagcgg gccacgctct cgtgcagcag gggagtgatc ggcaccgctt 699661 tggcggaggg gtgggattcg gggcggtcgg cgcgggccag ggtgagcagt tggtcggcca 699721 gtccgctgag ccggcgggtt tcttcgagcg cggagcgcag ggcggcgctc agctggtcgg 699781 cgggtctggg ccggcgcagc gcagttcgag ttcggtggtc agcagtgcca acggggtgcg 699841 taattcgtgg ctggcgtcgg cgacgaactg ttgttcgtgg gcgagggccc gttgcagtcg 699901 ggtgagcatg gtgttgagag tcgttgctag ccaagcgatc tcgtcgtcgg tgggaggtac 699961 cggcagcggc gcgtcggtgt cggggtgcgg cgtggtggtc agtgtttgcg ccgccgcgcg 700021 gatccggtcg acgggccgca gcgcggcgcg gctgagcagg taggcggcca ccgcggcgat 700081 gacgagcacg atcggcagga tggtcaccaa ttcccggacc agatcggcgg tgatgtcgtc 700141 ggtgagcccg cgcagcgcgc catcggggtc ggcttcgtga gcggcgtcgc ggaactggac 700201 gacggtgacg gcaccggctg ctgccagaac gagcgccatg gcggcgctga agacgagggt 700261 gagtcgccat cggatgggcc actcagcggg ggagcgcatg ccgtcctccg tccttgcgca 700321 gccggtatcc ggcaccgcga atggtttcca gcgaggtgac gccgaagggc cggtcgatct 700381 tgtcgcgcag gtagcggatg tagacgtcga cgatgttgga gcgggcctcg taggcggcgt 700441 cccagcagcg ttccagcagc tgggcgcggg tgtggacgat gccgggacgg cggatcaggg 700501 cttccagcag ggtgaattcc ttgtgactga gccggatttc ggtgtcggca cgccagactc 700561 ggtgttcgct cgggtccagg cgtagatcgc cggcctccag cgtcggtggg cgtgggatgg 700621 gcccgcgccg tgacagcgcg cgcaaccggg cgaacagttc gtcgaggttg aacggtttgg 700681 tgaggtaatc gtcggcgccg ccgtctaggc ccgcgatgcg gtcggtgacc gcgccgcggg 700741 cggtaagcat cagcaccggt gtccacaccc gctgccgtcg cagccgcgcg catacctcga 700801 acccgtcgat accgggcagc atcacatcca gcaccaccgc gtcgtagtca ccgccgtcga 700861 cggccgccac cgcatggcgg ccgtcggcaa cggtgtcgac cgtgtggccc tcctcggtca 700921 gcgcccgcgc cagcagcgcc gtcatcttgg gctcgtcctc gatcaccagg atgcgcacac 700981 ccgacaccct gccgcatgcc cggcccgggc cgcgaccagc tctcatcgtc gtttcatctg 701041 ccacccctac cgtcggagcc gcacaccgtc acagcgaggt agacagatca ggagaaagcg 701101 atgaatcgca tcgtgcagtt cggagtttcc gccgtggccg cggcggcgat cggcatcgga 701161 gccgggtcgg ggatcgcggc ggcgttcgac ggcgaggacg aggtgaccgg ccccgacgcc 701221 gaccgcgcgc gcgccgccgc ggtgcaggcg gtcccgggcg gcaccgccgg agaagtcgag 701281 accgagaccg gcgaaggcgc cgccgcctac ggcgtgctgg tcacccgggc cgacggcacc 701341 cgtgtcgagg tccacctgga ccgggatttc cgggttctgg acaccaaacc ggccgacggg 701401 gacggcggtt agcatcggcg catgcccgca ccgggccacc gatagcctcc gggtgcgcac 701461 cgatgagatc tagcgaggag accatgatca ggcgacgagg cgcccgtatg gccgcgctgc 701521 tggcggcggc cgcgctggca ctgaccgcat gcgcgggcag cgacgacaag ggcgaacccg 701581 acgacggcgg ggaccggggc gcatccttgg ccaccaccag cgatgcggac tggaagccgg 701641 tggccgacat tctcggccga accggcaagc tgaacgatgg cagcgtctac aaaatcgggt 701701 ttgcgcgctc ggatctgagc gtgcagacca agggggtgac cgtcgccccc gcgctgtcac 701761 tcgggtcgtg ggtcgcgttc gcccgcaccc ccgacgggca gaccatgctg atgggagatc 701821 tggtggtcac cgaagacgag ctggcctcgg tgaccgacgc cgtgcaggcc ggcggcctgc 701881 agcagaccgc gctgcacaag cacctgctcg agcagtcgcc gccgatctgg tggacccaca 701941 tcgccggcca cggcgacgcc gccgacctgg cccgtgcggt ccggtcggcg ctggatgcca 702001 ccgacacacc accgcccgcc ccggcaactt ccggccagac cagcttggac ctggacaccg 702061 cggccatcga tgaggcgctg ggccgctccg gcaccatcgc gggcggggtg tacaaattct 702121 tcatcgcccg ccgcgatccg gtcaccatgt ccggcatgct catccccccg tccatgggtc 702181 tggctaccgc cctcaacttc cagcccaccg gcaacggccg cgcggcgatc aacggcgatt 702241 tcgtcatgac cgccgccgag gtccaagacg tcgtccaagc actgcgcggc ggcggaatcg 702301 acatcgtcgc catacacaac cacgggttcg acgaacaacc acgcctgttc tacatgcact 702361 tctgggccga gaacgacgcc gtcgcactcg cccgcacgct acgcgccgcg gtggacgcca 702421 ccgcggcccg gtgaccccgc gccccggcgc ataccgaccc gccgcgaacc accggtggcg 702481 gacgtggtca tgcaggcgtc gtgcgatgac gtcctcgttc aatgggccat gttcggccgg 702541 gatcctcgcc acggcacggt cgcatggaac gcttcggcca cggtggccac cctatgccgc 702601 gtcgagccgg ggctgccaac tgttgcgcgg tgagtggtcg gtagttgtcg gtggcgtgct 702661 gtaggaacag aggtatgaat ctcgcggcgt gggccgagcg caatggcgtc gcgcgggtga 702721 ccgcgtatcg ctggttccac gctgggctct tgccggtccc ggcccggaag gttggtcgac 702781 tcattctggt cgacgagctg gctagcgagg ctggcgcgca gccaaagact gcggtgtacg 702841 cgcgggtgtc gtcggctgat cagaagtctg atttggatcg gcaggtggcg cgggtgactt 702901 cgtgggccac agccgaacag atcccggtcg acaaggttgt caccgaggtc gggtcggtgc 702961 tcaacgggca ccgacgtaag ttccctgcgg tgctgcgcga tctgtcggtc acgcggattg 703021 tggttgagca tcgggatcgg ttctgccggt tcggttcgga gtatgtccac gctgcgctgg 703081 ccgctcaggg tcgggagttg gtcgtggtgg actcggccga ggttgacgat gacctggtat 703141 gggatatgac cgagattctg acctcgatgt gcgcaaggtt gtatggcaaa cgtgctgctc 703201 agaaccgggc caagcgggcc gtcgcggctg ccgctgtcga tgatcatgag gcggcctgag 703261 atgccgcgtt tggagatccc caacggctgg tgtgtgcaag cgttccggtt cacactcgat 703321 ccgaccgccg agcaggcaca cgcgttggcg cggcatttcg gcgcccgccg caaggcctac 703381 aactggaccg tcgcgcagct gaaagccgat atccaagcgt ggcgcgcgac cggcgcccag 703441 acggcgaagc cgtcgcttcg ggtactgcgg aaacgctgga acacggtgaa agacgaggtg 703501 tgtgtcaacg ccgagactgg caccgtgtgg tggccggaat gctcgaaaga ggcctacgcc 703561 gacgggatcg cgggcgcggt cgacgcgtac tggaactggc agcagaggcg tgctggcaag 703621 cgcgacggca agagaatggg cttccctcga ttcaagaaga agggccgcga cgccgatcgc 703681 gtgtcgttca ccacgggtgc gatgcgcgtt gagcccgacc gtagacacct cactttgccg 703741 gtgatcggct gcgtgcgtac gcatgagaac acccgccgca tcgagcgcct catcgccaaa 703801 gaccgggcgc gggtgctggc gatcacggtg cgccgcaacg gcacccggct ggatgcgagt 703861 gtgcgggtac tggtgcagcg cccccagcaa cccaacgtgg aactgcctga gtcgcgaatc 703921 ggtgtcgacg tgggtgttcg tcgtctggcc acggtcgcca ccgcggacgg cgcatgctgc 703981 ccggtcctgg tgccagacgg ctaacgctgg gcattatccc cgagggcggc gcccatatcg 704041 acgtgccccg aaagaccgtg ggcgcctggc aaacagccga caccatgggc atcttccagg 704101 cccttcccga cgtctggggc gggtggcgga ccgaatgctg ggaagaccgc ttcgaagagc 704161 agctgattcg atgcaacggg gcgctgcggc ttcccgagct ggatttggcc gcgggcatgg 704221 acagcgcccg ggagtggctc cgtgacagga tatttcagcg cttctcggac agcccggcag 704281 gccaaattct gaaactctcc gagctgctgg ccgatgtcgg acccggtctg gtcgtcagcg 704341 acgatgccgt gacgaatggc ggggctcgcc caaacaacga agagtgggcg cgtttcgttg 704401 cggcgtgcga tctggtgcgt ggggctcacg ccgaatcggc ctgacttcgg ggatagtggt 704461 accatcactt tggtagaagg gtactaacat ggcgttgaac atcaaagatc cgtcggttca 704521 ccaggcggtc aagcagatcg cgaaaatcac cggcgaatct caggctcggg cggtggcgac 704581 cgcggtgaac gagcgtctgg ccagactgcg cagcgacgat ctcgccgccc ggctcttggc 704641 tatcggccac aagaccgcga gcaggatgag cccggaagca aagcgcctcg accacgatgc 704701 tctgctgtat gacgagcgag ggctgccggc gtgatcgtcg acacgtcggc gatcatcgcg 704761 attctgcgcg acgaggacga cgccgcggcc tacgccgacg cgctcgccaa cgccgatgtc 704821 cgcagactgt ctgcggccag ctacctggaa tgcgggatag tccttgactc ccagcgtgat 704881 ccggtcatca gcagagcact ggatgaactt atcgaagaag ccgagttcgt cgtcgagccg 704941 gtaaccgagc gccaggcccg cctggcccga gcggcctacg cggatttcgg cagaggcagc 705001 ggccaccccg cgggcttgaa tttcggcgac tgcctgtcct atgcactggc gatcgatcga 705061 cgtgagccgc tgctgtggaa gggcaacgac tttgggcaca ccggcgtcca aagggcactg 705121 gatcggcggt gatcgacgtc agcctggcgc ggcggtgcga ggctcacggg tacgactatt 705181 ttcgttccga cgatccggtg gcagcggcgg gctttgtggt gtccgctgtg tggagttgtg 705241 ggcgtggacc tgggaacgcc acgggttccg ggcgtttgcc gaaaccgctg cgccacagtt 705301 gatttggcgg gagtacagac ccggctggac ccgatacggc gacggatctg tggcgcaggt 705361 caaatcgatc ttcgacgctc cgcgcggtta cctcaatgcg gcgtgtcgtc ggcgtgttgt 705421 acattgggca tcgggactcc tgagaaggat cctgtaggcc gcagccccac ccacgggtgg 705481 ggctgacgtg cgtccaaggg ggccagatct ggcagacctt catcttgttt gcgacgatgt 705541 cccataatcg ttggtggtct tcaccgaccg ggcgtctttg acgtctgacc gacgcctccg 705601 aaagtggagg taggacacaa ggtcggcagc ttgcagcagg cgacggtgtt tcgagggcgc 705661 gaaatgcagt gcgtcgacgc ccgctattcc tcaccgccgc ggtttcctcg gtggcaatct 705721 cacttcgtcg agccgcgggc acggctttcg agatagaggt cgatatgccc acaagtctcg 705781 caggcaacgg cgttgacctg ggtgccgcga ttgaagtggc cggcaccttc gcgcttgaaa 705841 cgcagcggcg cgttccagac gacggcccct tcgacgagct ggtcgccccc gcatctcacg 705901 cacttctcgt cggtcacgac gcctcccctt ctctgcggct ggccaggcta cgcccagcgc 705961 ttgatgccca ggaaatccac ggcgccgccg ctagtttcac ctgaacgacg ccgcgcgatc 706021 acgaagcttt cggatcgccc gtgcggtaaa cgcttgcggc tccagatgcc acaggtgcgc 706081 gccttcaggt gtgcgcaacg ccgcgaaagg aacccgctca ccacacacga gcttctccag 706141 ctcgagtgcc cagcctacgg ccagggcggc acgctgccgg tgccaggcgt cagcgcgcca 706201 cgtcagccgg ggcaagccgg cgagtttgct ctggatgctt tggcggagtt tgccggcgca 706261 gaggatccgg gacggaaccg agcccagacc ttcggtgtcg gacgccggag cgacgcgggt 706321 gaacgcgatc tcccggcggt cgtagttcca ccaggcgcgc gcgacggtga cctcgccgac 706381 cggggtcgca cggttggaca gttcgcgctt ttcttgacgc gccccatgct gatccagcca 706441 cagatggatt tcggcaaccg tgcgggcggg cagtggctcg gcgttcgggc gcggatcgca 706501 tccggcgacc agcatcgcgg ggacgtgcgg cggccacagc tgggcgagtg gacgcagcgg 706561 attgagccga acgccgtcga agtcgctgcg tttggcgagg tcgccccaga gtgcgggacc 706621 caaccgcacg acgtcgagtc cgcggtgtcg ctcggttcgc cggaatgtcg ccgccgcgtc 706681 cggcgcggtc atgatcgtga tgacgtccaa cccgtcggcg cggtgaacgg cgggccggcc 706741 ggtgcccggg gagacggcaa cccaccacgg cccttcgtac agcaaatccc gctggccggg 706801 gcccggtttg gcaacggcat cctcgagctc gaccgtgcgc gcccagctct gcagctcggg 706861 cagcgcaccc ggccggaacg ccgaacccca cggttcgtcg ggatcgaatg acaggccacc 706921 gacgagcgga gcgatgtcgc gaaccagttc caggccggaa tattcgcggc cgtccgacgc 706981 cgaactctcg gcgatgagca tcggtgtgcc gtcatcgagc gccacggtct cgagttcgcc 707041 gtctatctcg gcgacccgcc acttcgggta gcgcagtatc gcgcgccgcg cgtcgtcggc 707101 ggagagttct ccgcgcgcat atcgggccag taagccccgt agctcgtcgt ccatcggcca 707161 tcacccggtc gggttgcagc atccgccaca gaacaaagcg gacgactacg ccacctcgcg 707221 gacatgcgga atctcccgcc gccgtcgtgg tcggatatcg tcgccggcca acgtgacgac 707281 cgctaccgtg cagccgttcg cggcggtaaa gtcgacttcg tagccacccg cggcgtagcg 707341 cccgacgacg gcaccgacgt ctccggcgat gagggatttg tcgggaacat cccgtgttag 707401 caccacaaca tcgtgttctg cgtacatcgg tccgctccta gcgtggatag gcggtaacca 707461 atcgaggcac gccgtcgggt tcgtcgctga tccacactgt acgcaatgca accatccggc 707521 cgcaccgtga ttccacgaca ccatcgacga ttgccgtgac gccgtagggt gttggggccg 707581 atccggcaac cgcgcctgac ggtgcggcca gggcggctgc cggggatgat cgccggcgtg 707641 gcggcgaaac gaatgaaccg cgaacagttc ttccgcgcgg cgtcggggct cgatgaggat 707701 cgcctacgga aggcgctgtg gaacctctac tggcgcggca ccgcaaacat gcgggagcgc 707761 atcgaggccg agctggccag cgccgggcgc gctcgcccgg cgcgcaaaat aaagccgccg 707821 gccgatccgg acatcgtggg ttgggaggtc gacgagttcg tgtcactggc gcggtcgggt 707881 gcctacctgg gcggggaccg gcgggtgtcg ccgcgggaac gatcgcgctg gcgtttcacc 707941 ttcaagcggc tcgccgcgga agcccaggac gccctgcgag ccgaggacgc cgagcccgcg 708001 gcatccgcac tggagcaact gatcgacctg gcgcgcgagg ccgacgggta cgactacttc 708061 cgctccgacg atccggtggc agcggcgggt ttcgtcgtgt ccgatgtggc ggcggcgggc 708121 cacccacact tccgtgagtt cgccgccgag atcggtgcgg cgatcccgcc gtgagtaccg 708181 cccgcccggc tactacaagc ccaaagcggt gcgcagccgg tcggcgtcca tcccgccacg 708241 ggcgcccgcg ccggcgggaa acgtgtccag gagcttgatc aggtcggcgc ggcgggtggg 708301 gtcgtcggcg gcctgccgcg gcgtgtgccc gtccagcgcg gggatgggtt gatcgagcca 708361 gctggtctcg tagtcgcgga tgaattcctc gagcgcggcg gccagctcgg ggctgtcggg 708421 gtcgggcgcg cccgcgccgg taactggcat ctgctcggcc agcgcggcgg cctcgcgggt 708481 gttgcgcagc ggacggcggt cgtcgtcgag caccgtcatc gccgggtcga ggcgggtcag 708541 cgtggccagc acgcgatcca tccgcggttc gctgttggtt tccacccgca gcgtgtcacc 708601 gtcgaggacc agcgtggccc ggacccgcag catgccgtcg ttggtgacgt gttcgatcca 708661 ccgcggcggc tcctcgccgt caacccggtc gtagaccccg tcgagcgcgc cctggatccc 708721 ggccggatcg tcgactcgca cgctggcctc gcagattgcc agcgagtcgc cctcggtgtt 708781 gaccagtgtc ggcggcgcga accggcggct cagctgggcc accagtgtca ccgggtcggg 708841 ctcgtcatcg agcagctcga tcagcacggc acgctcgtgc agcgcgaccg gctcgatccc 708901 gccgaagaac accatggtgt ccccggcggg caccgggcgc gcgcagatca gctgcccggc 708961 tcgcagctgg cggctggccg cccgctcatg cacctcatgg gtgtcgccgg tgcgtacgtc 709021 gcgcacgatc acgccctcgc caggttgcac gtgctcgacc tcgaacaccg accgctccac 709081 gagcagccat tgctcggcaa gcagccgctc gtcgtcgggt agcagcgaac cgcgcacttc 709141 gaggaactcc gcgaacgcgc cgccctcgaa caacaccgcg tccagcacca gcggatcggc 709201 cagcgccgcg gccagcgcgt cctcatcgtc agagtcggca taccggaagc gctcatagct 709261 gacttcggcc agcaggccgg tccagtcgcc cgacagtgcg tgctgggatg ccttggcata 709321 cagccagtcc acccgctcgg ccagcggcag cgcctcacgg ccgagatggc atttcttgta 709381 cttgcggccc gacccgcacc agcacgcctc gttgcggccc aggtcgcggc gcggctgggc 709441 tcggtgccgc tccaggagcc gcaccagcgg gtggtcgggt tcggtgccgg cgcggcgcag 709501 cagtgccaac ccgcgctcgg cgtcgccgcg atcggaggcg atgcgggcca ggtcgagcaa 709561 cggcagcggc cactcggtgt ccatcgactc ggccgccagc agctcacgtt cggccgcctc 709621 gacatcaccg atccggtcca gcgcgaccgc gcgcagccag cgcaccgcca cccgcgccgc 709681 gcgcggcacc ttgggctcca gcatctcggt gagcaggccc agcgcggccg ccccgccgga 709741 gtcggtgccc accgtctctg ccaccagcag ctcggccagc agcgggtcgg ccagcgccgc 709801 cccaatgtcg ccgagcagat cgaccaacga gtcggagccg gtttccgtcg cggtctcggc 709861 ggctgtggcg agcacatccc gcggcaactc gtccgggtcg gttgcttcga gcagcagcga 709921 catcgtctcg tgcagtttga tcagcgtgta cagcgcgacc gcgtcgttgg ggtcgaggtc 709981 gtggcgaaag gccagcagtt cgcatcggtt ctcgaaacgc caagcgtcga aattgaatcc 710041 gccaggtgct agccagtcgt cttcgtgcgt gaggccgtgc tggtcgagga tctcgcgcag 710101 cggtgccact ggctcggtaa atgccgccgg gtcgtcgacg cacgccgtcc agaccgccgc 710161 ggggaagaac gcgggctcgt cggggtcgac cagctcggcc agccgggcgc cgacggaggt 710221 gtccgcaccg gctgtgccga tccgctcgag caccagccct gcggcggtca gccgcacacc 710281 gaccagatcc cccgcggcgg cccccaacgt cgccagcgtg cccggctcca gcagcagtgc 710341 cccgccgggg tcgatggcct cgtccgggat gccccgtcgt tcgagcagct cctcgtcgta 710401 tccggccagc acgatccgcg ccgccgaacc gtcggccagc cggccatact cctcgtgctc 710461 gcagagcgtg gtgatcgggt ccaggtccgg ggtcacgccg agcatgtcgt ggaccgcctc 710521 gtccgcgccg agccgatggg tgaatacccg cccggctagc agcgtcggca gccacaccca 710581 ccgatcgtcg accaactgcc ttgccggcca ttccgtttca aggcgaagcg cgcgcaggac 710641 ggcgtccggg tcggccacgc cgctgtccag caggcgtcgt gcgatgtcgt cctcgctcaa 710701 tgggccatgt tcggccagga ttctcgccac ggcttgggtc gcatcgaacg cttcggccac 710761 ggtggccacc ttatgccgcg gccagccgag gcttgacgtc gggcaccagc cgatggggct 710821 ggcctcgcct agggttcggc gttgtgacgg cgccgacgcg gtggaccctg gccgacggac 710881 gtgagctgct gttcttttcg ctgcccgggc cccgcaccag cggcaccgcc gcagaacggg 710941 tggctcgcca cgctcaagcg caaacgttcg ccggcgatat ccgccagcgc gccatacagc 711001 tggtcgtgtc cgaacaagaa gtggcaagca aaatcaccgc cgctaccgcc ggaatcgcca 711061 ccaccacctt cccggaaaca cccagcatcg acgacaccat catcggcaac gacaaccgcg 711121 acactggggt ccggttggtc gacgtcaaac aagatggcgg cactagtccc ccgcccccat 711181 ttgcgccgtg ggacacccct gatggaacac cgccgccggg cactggccta agccctacgc 711241 tgcagcagat gatcctcggc ggtgatccag ctaatctgac cggccagggt cttgcggaca 711301 acgtgcaacg gttcgtacag tcgctgcccg caaacgaccc caacacagcg tggttgcgcg 711361 gtcaggttgc ggatctgcag gcgcacgtcg ccgatattga gtacgcccgc acccattgca 711421 gcaccaacga ctggatcgac cggaccgccc agttcgcctc gggcgccata gtcttcagca 711481 tcggcgtgtt gaccgcagag accggggcgg gggtcgtggc tgccgcggcc ggtggtgtcg 711541 gcgcggccac ggcgggcgtg agtcttctac aatgcctggt ggggagcaag tgatggacgt 711601 attggctgct gggatcgcgg ctggcgcgct cacgctggcg gcgtggggcg cctggcgccc 711661 gcactaccgg gcggcgtcct acctcgtggc cggtgccgta gagctggcac tgatcgggct 711721 gctggtggtg accgggcaaa cattgatggc catctcggtg gccttccttg tggcgctggg 711781 cggtccgttg gtggtggtca accaccgcag agctgaacgc agccgaggtt agatgaacga 711841 agagggcctg taggtcgcac tcatcgcgcg gctagcctgt gaggccagcc ctcgggccgc 711901 cacccaacac ggctcgtgcg ctgtctcggc cggctcgtct gccgcacggc cagcatgatc 711961 agtcccgttg gaataccggt gagcgtcggc gcgcgcatca cgatgcagcg atgttaggat 712021 gaggcggtgc gcactaccat cgacctgccg caagacctgc acaagcaggc actggcgatt 712081 gcccgggata cgcaccgcac gttgagtgaa acggtcgccg acctcatgcg acgaggcctg 712141 gccgccaacc gccctaccgc gttgtcctca gaccccagaa cgggattgcc tttggtgagc 712201 gtcgggaccg tcgtgacctc cgaggacgtg cgttcattag aggacgagca gtgacggtgc 712261 tgctcgacgc caacgtgctg atcgcattgg tggtcgccga gcatgtgcat catgatgctg 712321 cagcggactg gctcatggcg tccgacaccg gatttgcgac ctgcccgatg acacaaggaa 712381 gcctggttcg attcctggtg cgctcgggac agtccgcggc ggcggctcgg gatgtcgtca 712441 gtgcggtcca gtgcacgagc cgccacgaat tctggcccga tgcactctct ttcgccggtg 712501 tcgaggtcgc tggtgtggtt gggcaccggc aggtgaccga tgcctacctt gcccagctcg 712561 cgcgaagcca cgacgggcag ttggcgacgc tcgacagcgg cttagcacac ctgcacggcg 712621 acgtcgcggt actcattcca acgaccacct gatgtgcatc gtctcccggc ggcgcggcga 712681 gccgccccaa aaccaacgat tgggccacga tgcgtaggca tagctgaggt ggcgtcgcgg 712741 ccctcaccgg cgacaccaca gaggatctcg ggccgatccg atgagcgcca cgccaccgcc 712801 cggaggactc gacgcgtcgg tgttcatcgc gaacgaacgc ggtcggcaac tcgacgaggc 712861 gctcccagta gggttctgcg ttgtgacggc gccgacgcgg tggaccctgg ccgatggccg 712921 tgacctgctg ttcttttcgc tgcccggaca cgtcccggcg ccggtgtcgg atcgtcggcc 712981 gctgcccgaa cgtgacccgg ctccctcgcg gctgcggttc gaccgggcca ccggccagtg 713041 ggtgatcgtc gccgcacagc gccaggatcg cacctacaag ccgccggccg cgcgctgccc 713101 gctgtgtccg gggccgaccg gtctgagtag cgaggtgccc gcccccgact acgacgttgt 713161 cgtcttcgag aaccggtttc ccagcctggc cggggccggc atcgccccaa tcggcgcgcc 713221 cgacggtgac gggttcgtat ccgctccggg gcacggacgc tgcgaggtga tctgcttttc 713281 ggccgatcac accggttcgt tcgcgggcct ggacccggcg catgccgggc tggtcgtgca 713341 cgcgtggcgg caccgcaccg ccgaattgac ggccctgccc ggggtagcgc aggtgttctg 713401 cttcgagaac cgtggtgagg agatcggggt gaccctgacc cacccgcacg gccagattta 713461 cgcctatccg tatctgacgc cgcgcaccgc ggcgatgctg cgccaggctc gtcggcaccg 713521 aaagcgtcac ggtgacaacc tgtttgccag cctgctggca cgcgaggtcg ccgacggcag 713581 ccgcatcgtg gtacgcggcg agctgttcac cgcattcgta ccgttcgccg cacgctggcc 713641 ggtggaggtg cacatttacc caaaccggtt ggtgcgcaac ctcaccgagc tcaatgacgg 713701 ggagttggat gagttcgccc ggatctatct ggacgtgctg cagaggtttg atcggatgta 713761 ttcttcaccg ctgccgtaca tgtcggcgct gcaccagttc agcgaggtcc agcgcgatgg 713821 ctactttcac gtcgagctca tgtcgatccg gcgcagcgcc accaaactga aatatctggc 713881 ggccgccgag tcggcgatgg acgcgttcat cgccgacgtt atcccggaga gcgtggccgc 713941 ccggctgcgc gagctgggcc catgacggtc agctacggcg cacccgggcg ggtcaacctg 714001 atcggcgaac acaccgatta caacctgggt ttcgcgctgc cgattgcgtt gccgcggcgc 714061 accgttgtca cgttcacccc cgagcacacc ggcgcgatca ccgcgcgcag cgaccgcgcc 714121 gacggctcgg cgcggatccc gctcgacacc acgccggggc aggtgaccgg ctgggcagcc 714181 tatgcggccg gggcgatctg ggcgctgcgg ggcgccggcc acccggtgcc cggcggggcg 714241 atgtcgatca ccagcgacgt cgagatcggg tcggggcttt cgtcgtcggc ggcgctgatc 714301 ggcgcggtgc tgggcgcggt cggcgccgcc accggcaccc gcatcgaccg tctcgagcgg 714361 gcccggctcg cacagcgagc cgagaacgac tacgtcggtg ccccaacggg tttgctcgac 714421 cacctggccg cgctgttcgg agcgccgaag accgcgctgc tgatcgactt tcgcgacatc 714481 accgtgcgcc cggtggcctt cgacccggac gcctgcgatg tggtgctgct gttgatggat 714541 tctcgagccc gacaccgtca cgccggcggg gagtatgcgc tgcgccgggc gtcgtgtgaa 714601 cgggcggccg ccgatctggg ggtgtcctcg ttgcgcgctg tgcaggatcg cgggctggcg 714661 gcgctgggcg cgatcgccga tccgatcgac gcgcgccgcg cccggcacgt gctgaccgag 714721 aatcagcggg tgctggattt cgcggccgca ctggctgatt cggatttcac cgccgccggg 714781 cagctgctga ccgcgtcgca tgagtccatg cgcgaggact tcgccatcac caccgagcgg 714841 atcgatctga tcgccgagag cgccgtacgg gccggtgcgc tgggcgcccg gatgaccggg 714901 ggcggcttcg ggggcgccgt gatcgcactg gtgcctgccg atagggcgcg cgacgtggcc 714961 gacacggtgc gacgggcggc ggtcaccgcc ggctacgacg agccggcggt gagccggacc 715021 tatgccgcgc ccggcgcggc cgagtgctgt tgagcgggtt ggcgaagcgt catgtccaca 715081 gtgagcagat cggtgcgggt ccgccactgc ccttgacctc gaagccgaac accggctacg 715141 agaccgctgc cgcggtggat ctggtgtctg gggcttagcc cgcttcgctg atacccagaa 715201 ccagtgcgag cgcgtggtcg gtctggcgca tcctgccagg tgccagggct cccaatcgtt 715261 cgagcaggcg ggtgacgagg atcgccgact ggtcctgcgc gcgagtattg gttgtcgccg 715321 cggtatcccg gggccatgaa gacgcagcct tcgatcttgg cttcgctcga ctcccgatgc 715381 cgccccaact cgatcgctct tgggtttgtc cgaccgcggg cgtagccttt gcctcgaggt 715441 gcagccgatg gcaggcgatc gaggcgctga ccccggtccg gcgaatgtga ctccgggtgc 715501 ggatgaccat gcacagcatg cgtcgccgac ggtgctatgt ccccagggtc acgtgaacgc 715561 atgggactac aggttctgtg agcggtgcgg ctcgccgatc ggcgtggtgc cctggccgtc 715621 ggaggaatca ggcacacgcc agacggcgcc cgcgcgatcc ttcgtccccc tcgtcgtcct 715681 cgcggcgacg ctgctcgtgg tcgccgtcgt cgtgacggcc gtcggctacg cggtgacgcg 715741 accggctcgc aacgaccgtg aggagcccag ttccgcgcgg ggcgccgcca cgacgggtgt 715801 gccgttcgca caggccgagg ccgcgagttg cccggacgat ccggtgcttg aagcggagtc 715861 gatcgacctg acgtccgacg ggcttgcggt gagtgccgcg ttcatgtcgg catgcgccgg 715921 cggcgatgtc gagtcgaact cggcgctcga ggtcaccgtc gccgacggac ggcgcgacgt 715981 ggcggccgga agcttcgact tctcggcaga tccgctgagg atcgagcccg gcgtgcccgc 716041 ccgtcgaacc ctggtctttc cgcccggaat gtattggcga acgcccgaca tgttgtccgg 716101 cgcaccggca ttggcggcca cacggaaggg caggtccgat cgttcggccg cacgaggcgg 716161 atcggcacgg acgaccatgg tcgcggccgc gtccgcggca ccggcttacg gcagcatcaa 716221 cgccgttgcc ggggcggtgc tggtggagct acgtgactcg gacttcccct acgtgcgagt 716281 cggtatcgcc aatcgctggg tgccgcaggt gagttcgaag cgcgtcggcc tggtcgccgc 716341 ggggaaaacg tggacgagcg ccgatattct tcgcgatcac ctggccctgc ggcagcggtt 716401 cgggggcgcc cgcctggtgt ggtcggggca ctggaccacc ttcagcggac ccgatttctg 716461 ggtgacggtg gttgggccgg cgcagcccac cgcagctgag gccaatcgct ggtgcgactc 716521 gaacgggttc ggcgccgatg actgtttcgc gaagttcatc agcaccctcg ttggcgcgaa 716581 gggcacgacg gtgtaccgga agtgacgacg ctgccatgag tttctgcgtg tattgcggtg 716641 ccgagcttgc cgacccgacc aggtgcgggg cgtgcggcgc atacaagatt ggttcaacct 716701 ggcatcggac cacgacgccg acggtcggcg ccgcgacgac ggcaacggga tggcgacccg 716761 atcccaccgg tcgccacgag ggacgctact tcgtcgccgg gcagccgacc gacctcgttc 716821 gcgagggcga cgccgaagcc gttgacccac ttggtcagca gcagctggat cagtcaggtg 716881 ccgttggtgt ttcgccgtca gcggtgtcgg ggtgggtgcg ttctgggcac cgtcgactgt 716941 ggtgggcgct tgcgggcgtg gtggcgtttc tcgggctggt gggagccggt gtcgtcggga 717001 cgctgttcct gaatcgagac cgggagtcca tcgacgacaa gtacctcgcc gccttgaggc 717061 ggtccggact caccggtgag ttcaactccg acgcgaacgc catcgcccgc ggcaagcagg 717121 tgtgccgcca gttgcaagac ggtggcgaac agcaggggat gccggtcgat caggtcgccg 717181 tgcaatacta ctgcccgcag ttcagcgatg gcttccatat cctggaaacc ataactgtca 717241 ctggaagttt caccctcaag gatgaatcgc caaacgtgta cgcaccggcg atcaccgtgt 717301 cgggctccgg gtgctcaggg tcagccggct acgccgacat cgaccgggga acgcaggtga 717361 cggtgaaaaa cggtcagggg gacatcctgg ccacggcctt cctgcaggcg ggtcagggcg 717421 gccgattctt gtgcaccttc cctttctcgt ttgaaatcac cgagggcgaa gaccgctacg 717481 tcgtgtcggt cagtcgtcga ggcgaaatga gttactcgtt cgccgatctg aaggccaatg 717541 ggctatcgct cgtcttgggc tgagtcaccg cggtattcgg cacggcgcac cgctgcgcaa 717601 ccagctagcg ctgaccgtgt gatctagaat ctagctacta gtatagaatc gagacatggc 717661 gctgagtatc aagcacccgg aagccgaccg gctcgcgcga gcgcttgcgg cgcgcaccgg 717721 cgagacgttg accgaggcag tggttaccgc gttgcgcgag cggctcgctc gtgagactgg 717781 gcgtgcccgt gttgtcccgt tgcgcgacga gcttgccgcg attcggcacc ggtgcgcagc 717841 gttgccggtg gtcgacaacc ggtccgctga ggcgattctc ggctatgacg agcgcggatt 717901 gccggcctga tggtgatcga cacgtccgcg ctcgttgcga tgctcagcga cgagccagac 717961 gcagagcggt tcgaggccgc cgtcgaagcc gaccacatcc ggctgatgtc gacggcgtct 718021 tacctggaaa cggcactcgt gatagaagcc cgcttcggtg aaccgggcgg acgtgagctg 718081 gatctgtggc ttcatcgcgc cgcggtcgac cttgttgccg tgcatgccga ccaagcggat 718141 gccgcgcgcg ccgcctaccg cacgtacggc aagggaaggc atcgtgcggg gctcaactac 718201 ggcgactgct tctcatacgg cctcgccaag atcagcggcc agccactcct gttcaagggc 718261 gaagatttcc aacacaccga catcgccacg gtcgcgctgc cctaattctt agtcagccag 718321 gtgttcgccg caccggcttt cggcagcgtc aacggtgttg ttaagtgcgg cagaaggttc 718381 acaaggcatg tcgaccgctc agcgtgctcc gacttcgcga tccggatcct cgacgccgcc 718441 gtccgcgccg tcgccacggg cgtgtgcacg ccactggcgg tacccgtgtc gcgccgcgaa 718501 cgcaccgatg atggcggtga cgcaccacac cgcgatcgcg caggacgcca gcagcggtga 718561 gcgatcgccg atcgccgcac ccaatgcggt gtaggcgaat gcccgcggcg cggaaccgat 718621 gaatgcaccg acggccatct gccacaacgg aactccgaac gtcccgaacg cataggaggc 718681 gaacgcatcc gatatgccgg ggacaaagcg ttggccgacg acggcccaca ggccgcatcg 718741 ttcgatcagc gcgtcggtgc gatcggcacg ttccccgccc agcagggctc gcgcgctggc 718801 ccggccggct cgacggccga ccaggctcgc gacgacggcg gtgcccaccg tggcacccag 718861 cgtcacgaag acccccacta gcggaccgaa cagcagcccg ctgcttgcgg ccaggatcgg 718921 gcccgggacg aacaacgcgc cgagcacggc cgacactacg acataggtca gcggcgccgc 718981 cggcccggtc gccgagaccg cgccccgcac cgcggccaca tcgatgacgt ccgtggcggc 719041 taccaggtag aacattccta caaggaagcc ggcgaacacg acaagccgca cgatgtggcg 719101 tcgccgggat gtcggtgcgg aatcgttgtg agtgctcatg ctgaccgtga ttgttccgca 719161 ccgacgctgg ccgcgcccgt cgtccccggc gttggctggg gaacctcggc tgcgcgggcg 719221 ccgtccggcg agcaacccgt ttgtcctacg attgagctac gatcgtaggc atgtctgagg 719281 tggcctcgcg tgagctgcgt aacgatacgg ccggcgtgct gcgccgcgtg cgggcagggg 719341 aggacgtcac catcaccgtc agcggccgtc cggtcgcggt gcttaccccg gttcgtccgc 719401 ggcgccggcg ttggctgagc aaaacggagt tcctgtcgcg gttgcgcggc gctcaagccg 719461 atcccgggct ccgtaacgac ctcgcggtcc ttgccggcga cacgaccgag gatctcgggc 719521 cgatccggtg agcacgacgc cggccgccgg agtgctcgac acgtcggtgt tcatcgcgac 719581 cgaaagcggc cggcaactcg acgaggcgct gatccccgac cgggtcgcca ccaccgtcgt 719641 caccctcgcc gaactgcgcg tcggcgtgct ggccgcggcg acgaccgaca tccgggctca 719701 acgcctggcg accctggaat ccgttgccga tatggaaacg ttgcccgtcg acgacgatgc 719761 cgcccgaatg tgggcccgat tgcggatcca tcttgccgag tccggtcgcc gggtgcggat 719821 caacgacctg tggatcgcgg ccgtcgcggc atcgcgagcg ctgccggtca tcacccagga 719881 cgacgacttc gccgccctcg acggtgcggc cagtgtggag atcattcggg tctgactcgg 719941 tggccacgcg tctctcgcgc tgttgtccgc acccgcaggg cgtcccggtg ggtcaacgcg 720001 gcggcctcag tcgacgaaca gcgccatcga cgcggtaaac ccgtgcaacg cgttgtggcc 720061 cgcgaccggg ccgatctccc cggcggcgaa gaaaccggcc agcggaatcc cgcccagcag 720121 gtcctcgatc gtcgacgcgt cgtggtcggt gaccccgaac attcgtcgtc cgcgcccgtt 720181 gcaggtgaac agcagcccac cgaccggggg cccgggcagc tccgccgccg cccgctcgac 720241 ggccaggcgc aggtccttgt cggccgccgc cgcgtcccgg acctggaatt gcacggtcgc 720301 gccgacctcg acaacctcgc cgatcccgat cgcccccgtc gttgggtcgg cgccgagcag 720361 cccgcggatc aaaaagtcgc cctgacccgg caccgccagg tgctcgtcga cgacgattcc 720421 gatctgcagg ccgcggctga ccagttcctg ctcgtcgggc gccatcccca agacgatctc 720481 ccgcaggcgg tgcagcggcg gtcggccgcc cagctcggtg atcacggcac cgtccgcgcc 720541 ggtgacaatg tacggttccc cgatcggccg gcagccctgc gacaccacgg aaacgctgtg 720601 cgcgccgggc aggcgcacgc cgaccagccc ggaggtgagc acgtcgcggt cacgaaacag 720661 ccgggtgtcg ccccgccgac gcccaccgct caccaccccg ccgacgacgg tcgttcccgg 720721 caggtcggtg ttgaggtgct cgatgagcag attcgacggg aacgagtacg ggtccggcag 720781 cagcaggtgc aagtcgtgcg cggtccggtc gaagcggtaa ccggtgatca gagcgcccga 720841 gccggtgcga acgaagtcca ggtggaatgt ctccgcgggt gggccggacg ccagccacac 720901 cgctaccgcg ggctcgttct ccagctcgtg gcgaccggcg acgatgcctt gggccacgca 720961 accgatcagc gcggccggct cgaccgacgc ctgcaccgca gccagcaggt ccacggcctg 721021 gtcggtgtgt gaccgcgatc cgaggagcac ggccagcgcc ggcgtcccac ccgcgagctc 721081 ctcgcgcgcg tgcgcggcag cctccgccgc ggcccggcgc acgtccggcg cggtggaaac 721141 cccgactccg atccgcacac atccatgatg cgccgtcgcc gtgctgttcg tgtatgcgat 721201 gtcaaagtcc gggcgcggtt acccgacgag ccgagcacat ccccgacgag tcagccacac 721261 cccgtcgact gtaaccgcat ccgcaacccg ctggcccgca ccgcccggcg tgcgatcgcg 721321 gcccgcaccg aagcttcgga cccgaccacc cggaccttgc gtttggcccg ggccaccgcg 721381 gtgtacagca actcccgggt cagcaaccgc gaatcctctt gcggcatcag caccgtcacc 721441 tcgtcgacct ggctgccctg actcttgtgg atggtcatcg cgtgcatggt ctcgacgtcg 721501 ccgaggcggc cggtggcaac gtcaagtggc ccggatgcac cagaaatgac ggcccgcaga 721561 ccggtggggc cggccagcac gacaccggtg tcgccgttgt agacgcgaag gccgtagtcg 721621 ttggccgtca ccagcagcgg acgcccggcg taccacggcg tccagggcgg ctggccggtc 721681 tcctcggcga gccaggcttg aacccggcgg ttccagtgca gcacgccggt gggcccgtcc 721741 cgatgcgcac acagcagccg gtgctcgtcc agggtggcca acgcgacgtc ggaggcaccc 721801 aacagcgccg cctcgcgcag ccgcaacgcg tgtggcacca gcaccgcgcg caaccgcggc 721861 gccggatcct cgtcgtcgac gaactcgatc cgctcctcac ccgagcgcag caggcccagt 721921 acggcatcgc catcgccggc ccggatcgct tcggccaagg taccgatcac cttgccgaac 721981 cgatgcgacg ttcgcagctg cgccaccagc gcgtcgtcgc gtaccgagaa gccatcgacc 722041 aaatccgcca gcaccgctcc ggcttccacc gacgccaact ggtcggcatc gccgacgagg 722101 atcaaccggg cgcccgggcg caccgcctcg gccagccggg ccatcagcgt cagcgacacc 722161 atcgaggtct cgtcgaccac gatcacgttg tgaggcaacc ggttctggcg atcctggcga 722221 aaccgcgctc ccggtttggc acccagcaga cgatgcagcg tgaccgcgtg caggtcgccg 722281 agccgtgccc ggtcggtggc gtcgagcttg gccatctcgc gccgtaccgc ctcggccagc 722341 cgggccgccg ccttgccggt gggtgcggcc agcgcgatcc gcggccgcgg ctcaccggcc 722401 agctccgcct gctcggcaac caacgccagc agccgcgcga ccgtcgtcgt ctttccggtg 722461 ccaggcccgc cagtcaacac cgtaacacct tgcgagagcg cgatttccgc cgcgcgccgc 722521 tgctcgtcaa agccggtcgg gaacagtcgc cgcaagtcgg gtaccccggc cggtcgcctg 722581 gatgtcagca acgcgagcag gtccgcgcac acctgctctt cttcgcgcca gtagcggtcc 722641 agatagagca gccgatcgtc atacaggtgc agcacgggtg gatcggcgag caacggactg 722701 gcccgcaccg ccgccaacca gtccgccgga tccggccacg gcaggtcgtc gtgtccagca 722761 acccgcgcga tcgacaacag atccacacac accgaaccgg cccgtagcgc gcggaccgcc 722821 accgctaccg ccaacgccac ccgctcgtcg ctctccccgg ccagtgcaca gagacgttgc 722881 gccacatgca catccgacac gtccagcaca ccggcctggt tgaaggcccg caccatcccg 722941 gaggcctcga cggcaaaatc gacgtcggtg agcttcacga ctgcagcctt ccccggtcga 723001 gcagatccga gagcgccacc accaacgccg tgggcgggtt ccaggtgaac acaccggccg 723061 gatgcccggc cgtcaccggc gtcgccgcac cgcacatgcc ccgcacaaac aggtacagca 723121 ccccgccgag atggcgcgcc ggagcgtaat cccgctgccg ccaccgcagg aagcggtgca 723181 gcacaacaac atacagcagc gcctgcagtg ggtagtccga atgcagcatg gcctcggtca 723241 accgctcgaa gccgtaatcg gcggcggtgt caccaaggtg attggtcttg taatcgacca 723301 ccagatatcg ctgcccgggt agccgcagca ccacgtcgat cgaccccgcc aggtagccac 723361 gcagcggttg atcacccaac ccggccgaac caagccgatc ggcgtagggc gacaacgggt 723421 cgtcgccggg caggtgcgac gccagcagct cacccacgtc ggccagcgac acgtccgggg 723481 accggccgcg cagatcgccc ccggccagcg gcatctcgaa gtccaactcc cgcagacgat 723541 cacgcacacc gatctgccgc aatgtcagtg cggcggcggc gggtcccagc ggcgtgtcgt 723601 gcatcggcag caacgctcgg gccagttcgg gagccagctg cgcgtggtcg acgtccacgg 723661 tccaccacgg cgcgtgccgg gcacctgggc ttccagttcg gcagccagat cgggagcggc 723721 tgggtccgcg gtctcgagca ccgcgtgcac cagcgagccg aacgacgccc ccgacggcag 723781 cgcggccagc ggtgatgtca gatcggcgcc ggaaccgggc gcggcgaaga cggcgatctc 723841 cacctcgtcc gcacggccgc cggccgccgg ctcgctggtg acggtgacgg cttccgagcc 723901 ccgcaccaga tccgagtacg aggtccgccg ccacgtggtg tcgatccggc ggtgaaagtg 723961 ccgaacctcg aaaccgggta cgggcaccgg cttttcgagg gaactgcgag caccgatgac 724021 cgattcctcg accgacggcc cgcccgcggc ctcccactgc gcgaacaccg cccaggcctg 724081 ctcgtcggtg acgcgtggtg tacaccggtc cggtacctgc gactggccgg gccggcgccc 724141 gcgcagcaac cgcgacaacc cgccgttgac ctcgtcgaac gtcggtgccc accacgcgac 724201 gacctgcgat tgcgcgcggg taagcgcgac ataggtgagc cggaggttgt cgtgggccgc 724261 ctcgacgcgg ttcagcccct caacggtgcg ccgctgagca ccgccgtcct tgccgccgat 724321 gtacaggcag cgggtgccgt cgtcgtgata cagcaggatg tcgtcgctgc ggacgttgcg 724381 gttgaaggcg aacggcagat acacgatggg aaactgcagt cccttggcca cgaagacggt 724441 catgatctgc accgccgcgg cgtcgctgtc caaccggcga ttgtgttccg gcgggccggc 724501 acccgccttg gcctggcggc gcagccaatc gcgcagcccg ggcaggccga gccgctcgcg 724561 atgagcggcc tcgtgcagca gctgcgcaat gtgcgccagg tctgtcaggt cccgttcgcc 724621 gccgcgctgg ctcagcacgc gccggcccat cccggccagc tgagcggcct gaaacaccgc 724681 ggccacaccg cgatggcgtg cgtggtcggc ccactcgcgc aacgtgccgg ccacccgatc 724741 ggtcagcgca tcgccctcgg cggcaagcga ttccgcggtc tcaccgaaga acatcgtgca 724801 cgcggcggcg cggaccagcc cgctgcgctg cggcgcgtcg aacgcctcca gcaggcacag 724861 ccagtccttg gcggcctgcg aggcgaacac gtcggtgtca ccggtgtaga tcgccgggat 724921 gcccgcctcc gccaacgcat tccggcacgc ccgcgcgtct ttgtgatgct cgacgatcac 724981 cgcgatgtct gcggccacca cgggccgccc ggcgaaggtg gccccgctgg ccagtagcgc 725041 cgcgacgtcg gcggccaggt cgtcggggat gtgccggcgc agcgcctcga tcgggacgtg 725101 ggcggtcccg tcatacccga gcgtgtgccg tttgaccacg cgcaaccgaa acggcgccgg 725161 gcgcggcgcc gaggccaggc ggtgcccggc gtggtgggcg tcggtgccgc ggacgacgat 725221 gtcggcgtga cccagggtcg catcgcgcag caccgtctgc aggctctcga ccagcgcccg 725281 gtcgctgcgc cagttgacgc ccaacgtgta gcgggcatcg gcggtgccgg ccgccttgag 725341 gtaggtgtgg atgtcgccgc cgcgaaagcc gtagatcgcc tgcttgggat cgccgatcag 725401 gatcagcgcc gaatgccggc taaacgcgcg ctcgagcacc cgccactgca tggggtcggt 725461 atcttgaaac tcgtccacca gcacgatccg ccagcgttcc cgcatccgat cgcgagctgg 725521 cgagtcggcc gcctcgaggg ctgtcgccaa acggatcagc agatcgttga atccttgcgc 725581 acgcagccgg cccttgcggc gctcgagttc ctcgagcacc tcggcggcaa agcgcagccg 725641 caccgctgcc ttgctgccgg gctcgggatc aggcgggcgc agttgggcgc acgggtcgtc 725701 gacgacggca agggccaggg ccagcgcctc ggcgtaggtc agctccggat cggtctcctg 725761 acgaccgaag ttcgccagat agcgatcgtc cacgatctca gtgaccaggt cggtaaggct 725821 ctccttgagc tccacgtcgg cggcgttgtc accggccaca ccgagggatt tcaacaccga 725881 gccgcagaac tcgtgggtgg tggcgatggt tgccgcgtcg aagttggcca gcgcgtcacg 725941 cagccgcgac cgcttctggg cgcgctcggc gtcgctgccg cgcagcaggt gctcgacgag 726001 ctcgccgctc ggcggcgcgt cgccttgtag cgcgcccaag gcctcgacga tctgcccgcg 726061 cactcgctcg cgtaactccc ggctggccgc acggttgaac gtgatcaaca acatctcgtc 726121 gagcgtcgcg gcggtttcgg ccagatagcg ggtgaccaga ccggccagcg cgaacgtctt 726181 accggtgccg gcgctggctt ccagcacggt ggtggtgccc tccctcggca acgggcccag 726241 cagctcgaag cggtccatca gaccgaccct tcggcggcca acagcggcag ccatagccgg 726301 gcggccagcg cccctagccg ggtctcttcc ccggcgacct cttcgcccgc gcggggcttg 726361 ccgagcaaca cctcgaaggg tgcgcgcggg ccccaggctc gcacgtgggc gggcgcgtcg 726421 tcgtcgcccg gccggaacct gttggtctgc cagcattcgc gggcgggcgg gtaggggtct 726481 tggccgtctc ggcgtgcctg ggcccacgcg caggacgtct tcagcggcag cggcagtggt 726541 tcgcgccggc cggcgtcgta cagcaacacc agctcccgca ataccgccac cgggtccggc 726601 ggcggcacga aaagccttct ggcgatgtgg ttcctggtct tgctgcggcc gatgcacagc 726661 gccgaccact cgcggccagg ctcttgggcg gccagcgtaa ccaggccgat ccacgccggc 726721 aacacatgct tgggcgccag ctttgagtag gtcaccgaca ccgtgcgccc gccgaacacg 726781 ggtgtcaccg tgccgctcag tcgccgcccg tcgccgaggt cgacgtcgac gtcgtgcgcc 726841 tggccgtggc cgtcgcggtg cgccagcgcg gcggccgcca gatcgcgcgc gcggttccgg 726901 atttccttcg cccgtcgcac gccgaggcgc ccgggcggca acgtgccgcg acgccattcg 726961 gagtgagcgg cgtcgtcgag gtgcaggccg cggagcatgt cgcgcaacat ccgctcgccc 727021 accgtccact cggccaaggc gtcgacctgg accggtatcg agtcctcgac ggtgtcgacg 727081 tcccagggca gcgtgtagtc cagcgcccgg aagaacccct tgaccggatc cttgaagaag 727141 tcgagcaggt ccgccagcgt cacgtcggcc gcgggtggtg cgggcagccg accggagatg 727201 aaagccgttg gtggacagcg cttcccggcg gcggcctggg cggcggcgag cgcggcgggg 727261 tcgaacgtga acggcttggc gcccagcagt gcgccggggg tgacgttctt ccggtcgaac 727321 ggctgcagtg ggtgtgtgac caggatccgc tcacgcaccg gcgctgacgt cgtctggtcg 727381 agcgcgtcga gcaactcggc cagcggcacc gcgggtgggc gcggttgccc ggtgcgctcg 727441 tcggcgccgg tgtaagtgat caccagggtc tgggtggccg cacctatcgc gtccagcagc 727501 aattgccggt cctccgaacg gatgtcacgt tcacccgtca tcggttctcg ggccagcacg 727561 tcgtccccgt cgggatggct cagccgcgga aacacgccgt cgtccagacc caccaggcac 727621 accacccggt gcggcaccga gcgcatcggg accatcgtgc agacggtcag cgtgccggtg 727681 cgaaagttgg cccgggtcgg gcgcccggcc agctgcgcgt ccaaaagcgc tcgcacgtcg 727741 ggcagccgca acagcggcgc cgcgcgcgaa ccggcgcgcg ccagcacgtc ggcgaactcc 727801 cgctgcacct gcgcgcgttg ccagccgtcg ttacaggcgg tcagcagatc gatccccgtg 727861 gccagcgcat ccagccatgc gaccaacggc cgtgcaccgc tgagtccgcc gacgacatga 727921 tgcaaccgtt cgacgaactc ggccagcctc ccggccagct cgacccgatt gctgccgacg 727981 tcatcaaggg gcagcgcggt atccagccac gcttgggaat cctcggacat ggccaccccg 728041 gtgaggatgc ggtcgagtcc gaaccgccac gtgttgtgca cgacggtgtc gaggccatag 728101 cgtcgccggt gcgtcgggtc gaagccccag cggatgttcg attcgcgcac ccacgtggtg 728161 atggtgtcca ggtcgtcgtc ggcgaacccg aatttggcgc gcaccggagc ggcctgcgcg 728221 aggttgagca gttggctggc ggtggcccgg gtttcggcga tggtgagcag ttcggcggcc 728281 accgagagca gcggattggt ctgggtcagg gcgcggtcgg ccagacgcac ccgcagccgg 728341 tgtgcggggt ggcagtcgcc ggccacctca ccgaggccga agccggcgac gatcaacggt 728401 gcgtaggtgt cgatgtcggg gcacatcacc acgatgtcgc gcggttgcag cgtcgggtcg 728461 tcctcgagga ggccgagcag cacctcgcgc agcacatcga tttgccgcgc cgggccgtga 728521 caggcatgga cctgcaccga tcggtcggca tccgacaagc tacgctcggc gggtcgcggc 728581 gcgttgccgg cgatgtcggc ttgcagccat cccagcaacg tgtcgggttt ggttgtggca 728641 ccaaggaatt cgtcggtggc ccgggcggcg ggcagcgcgc gctgcagttc gcgcacgtcg 728701 cggcccagcg tttccagcag cgggtgctgg gcggcccgcc ggctggtgtc ctgccgccgc 728761 ggcagcaggc catcagcgcc ctggaagccg gccagcgccc gccacaactc gtcgctgggg 728821 tgcggcagcc acaggtgcag gtcgtggtgg acggccagcg catccagcag ctgcacgtcg 728881 gtgcaggcca ggcgggtgtg gccgaacagc gaaagccgag ccggcaggtc ggcggggccg 728941 tcgcgcagcc gggcgatggt cttgtcgtgg cggacatgcg ggggatcggc cccgaccgtg 729001 gtcaccaggg cgcgccacag tggcggttgc caggccaagt cgccgggcag ctcgccgagg 729061 tcgccgtcca gccaagcggc cagcaacccg ggacgctggc gtgcatagga cgcgaacagc 729121 ccggctagcc ggcgcgccac cgaatagcgc cggccgcggc gcagctccgc ctcggcatcg 729181 gtcgtcgcga agtgccccaa gtgggatgcc agcgtgcggc accacggttc gtcgaggctg 729241 gcgtcgatca ccgccagcag cggccacgcc agggcttccg gcgaccacgg gtcgtcgtcg 729301 agggtgccgg tgatctcggc gatcagggac tgcggattgc ggaacgcgat gccggcgcac 729361 accccgtcgg cgcggcccgg cccgcagccc aacacgagcg aaagccgttg gctcagccag 729421 cgttccacgc cgcgggcagc gaccagcacc agttcctgcg cgaaagggtc gggctgggga 729481 tcggccagca gcgcgccgag cccgtcggca agcagatcgg tgcgctcggc acggtgcagg 729541 tgaagcgcca tcgggcgtca ccctagtcga gcggccggcc gccgacatgc atgctggcgt 729601 gcataaacag acgcgagatc accgaacgac aagggctacc agtcggtacg ggccttcttg 729661 tcgacctgga actgggtcag atatcgaacc gttccgggat ttcatcaacg cgctggggcg 729721 tgccgcgatg tcggcatgac gagccgcctc ggacgtgaca cacttcgaga tggaggaggc 729781 ggtgtaggtg tgaggcggtt gcccaaagca accgcgtcac ccaccactta cagcccgaac 729841 tcggctgcta tcccgtcgat cccggcccga atcgcagtga gcgcgtcggc gcgggagcgc 729901 aacttggtcg cggcatgggc gtgttggttg agaccggcga actctcgtgc ggcttcctcg 729961 gcgcggctga ccaccacctc cggcagggcg atctcgtcga taaacccggc ggccagcgcg 730021 gtttccccga agaacgtctt ggccagcccg gttgcctgct ggtatgccga ccgggtcagt 730081 cgcagcttca tgatctctaa cgccgcgtac ggaatggtca tgccgatcgc gacctcattg 730141 gcctggatgt tgtatgcgtg ggccgccacc cgatgatcgc cgcaggacaa cagaaacgcg 730201 cccatggcga tggcgtgacc ggtgcacgcc atcaccaccg gtttggggta ggacaagagg 730261 cgatacgcca gctcgaagcc gcccctgagc atgtcgatcg cgggctgcac ttcaccggag 730321 gtgaggatct tcaggtcgaa gcctccgctg aatacccgac cattaccggt gatcaccagc 730381 gccccaacat catcacggtc cgcgttgtcg atcgctgcat tgagggcttg ttgcatcgcc 730441 gggcccagtg cgttgacctt gccgtcgtcc atactgatga cggcgatgga atccttgcgg 730501 gtatagctga ccgggtcgct catgctctcg attgaatcag atcagcattg ggggatcttg 730561 tgcgcccgca gttagcctgc cggtatccgc gtgggctgtg gcccttgccc ctccgagcgc 730621 tggctgacct cggtgggcac ctcgacctgc cgagcgcgcc acctgtcctg ggtttcggcc 730681 gcggcgcggt tgatccggtc gatctcgctg cggaacacgt cgggacgcgt ctccttgctc 730741 gcgtagtgaa aatacgtcaa cagactacgt aacagctcca gcttctgctt ttcccgcttc 730801 gccaggtcgt gggtggtcat cagctcgatg cgcgcaacgg gcggcagggc atcccagtcc 730861 agcaccactt tggtctccaa cggacgccgt ccgcgccgcc agaaccgcca tcggcgcggc 730921 tgctcgggtc ggtcgtagta ggtcaccgtg cccgggaagc gagactcgat gccccgcccg 730981 atctcggcgc gatcgagcgc tgaatcccac accatccgcc actcctgacc gggagccagc 731041 atcggcaact cttggggcag ccgaagttcc acgacatcgg cgtagccgtt ggcggcattc 731101 tcgtattggg ccacggttgg tgggttgggg aacgagaacc ggacgtcgta ggcggctgtg 731161 cgaccgaagt tgcggactac cagctcgatc acgtgccagt ccgcgacgtg gggctccata 731221 aacatggcca cgtagggccg agtctgctcc gcagccagtc gacgattgcg ttggatttgc 731281 cgcttggtca ccaccagggc gaccacaccg agcccaagcg ccgcccacgc caaccaggtg 731341 ccggagtcga cgccggtgac ctcatgccag ctgctcagga cccaccccat ggaatccacc 731401 atccgcttat accacagtga catcggaccg agaagttagc tgacaggatc ccagaggcgc 731461 ctgggcactg gtcgctggct gccgaatcgt tggcggaagc gccgctggac acgtcgctgg 731521 acccgggccg gaacgggaga ggcttgccca gtccttcagc cgcccatcaa cattcgccat 731581 tgatcgagac ttgcggggcg ataaacgtaa ttggaacgct tgacctccga cagcgacgca 731641 cttggctcgg ccgaatacca gtgcccggga aagacggttg ggtcacccgg aagctcggcg 731701 agctgtcgca ggctgcggta catctcgtcg gaatcaccgc cgggaaagtc tgtgcgtcca 731761 cagccttcca ggaatagcgt gtcaccggcg accagccggc cgtcgagtag aaagcactga 731821 ctgcctgggg tatgcccggg tgtgtgcagc agctcgatgt cgatgtcgcc gacgctgacc 731881 ttgtccccat gctcatgggt gatcaggtcg ccgacaggaa tcccagtgac tcgcgaaacc 731941 cacagcgctt catgggtgtt cacgtgcacg ggtacagatg cccgctccag cagctcagcc 732001 agtcccggca gctgaaaacc catcatcgag ccgcccacat ggtctggatg atggtgggtc 732061 accagcacac ccgatagctg catatcgtcg gattcgagcg cgtcgagcag atccccggca 732121 gcgtaggccg ggtcgaccac cacgcagtcc ccggttgtgc gatctccgat caggtaggca 732181 aagttgcgca tttgcgtcgc gaacatgtcg ccgacggcga aatcgcgacc ggagagcagt 732241 tgacggaagt acagccggtc cttggacacg caaccagcct atgtcttgtc catcgccgcc 732301 cagaccgcgt cttggcgttt gcagcccggg acacgttaat gcggagtctt ggggtctgac 732361 tgtgggtgcg gtgggtatct ttggtccatg ctgaagaggg tcgagataga ggttgatgac 732421 gaccttatcc aaaaggtcat ccggcggtac cgtgtgaagg gtgcgcgcga ggctgtcaac 732481 cttgcgctgc gaacgttgct cggcgaggcg gataccgcgg agcatgggca cgatgacgag 732541 tacgacgagt tcagcgatcc caatgcctgg gttccgcggc ggagccgcga cacagggtga 732601 tcccgtccaa tcttggacga cttggtccgt agctgcatgg gtggcaccgg tggtttggtg 732661 gcgttgcgcg ccaggctgta ccctctttta ggcccgcggc acgacccgac tggtcgctac 732721 gggtgagcgg cccccttagc tcagtcggca gagcgtttcc atggtaagga aaaggtcaac 732781 ggttcgattc cgttaggggg ctcggcggac gccgggcagg ctggcggtgc gtaccagagg 732841 cgatgtagct cagtcggtta gagcgaacga ctcataatcg ttaggtcgcc ggttcgagtc 732901 cggccatcgc tacaacacaa cagcaagact cgttagagag aacggatatg gcttccagta 732961 ccgacgtgcg gccgaagatc actttggcat gcgaggtgtg caagcaccgt aactacatca 733021 ccaaaaagaa ccgccgcaac gacccggacc ggctggagct gaagaagttc tgcccgaatt 733081 gcggcaaaca ccaggcgcac cgcgagacgc ggtaaccgcc gacccgcgag cagttgctga 733141 gactgactag gtaggttcta cagccgtggc gttgagcgca gacatcgttg ggatgcatta 733201 ccggtatccc gaccactacg aggtggagcg ggagaagatt cgcgagtacg ccgtcgccgt 733261 tcaaaacgac gacgcgtggt atttcgagga ggacggcgcc gccgaactcg ggtataaggg 733321 cttgctggct ccgttgacgt ttatctgtgt gttcggctac aaggcccagg cggcgttctt 733381 caagcatgcg aacatcgcga ccgcggaggc gcagatcgtc caggtagacc aagtgctgaa 733441 attcgagaaa ccgatcgtgg cgggcgacaa gctgtactgc gacgtctatg tggattcggt 733501 gcgtgaggcg cacggcaccc agatcatcgt gaccaagaac atcgtcacca acgaggaagg 733561 tgacctcgtg caggagacct atacgaccct ggcgggccgt gccggcgagg atggagaggg 733621 attttctgat ggcgctgcgt gagttcagct cggtgaaggt cggagaccag cttccggaga 733681 agacctaccc gctgacccgc caggatctgg tgaactacgc cggagtttcg ggtgacttga 733741 acccgattca ctgggacgac gagatcgcca aggtcgtcgg gctggacgcc gcgatcgctc 733801 acggcatgtt gacgatgggg atcggcggtg gctacgtcac atcctgggtt ggcgacccgg 733861 gcgcggtcac cgagtacaac gtgcggttca ctgcggtggt tccggtgccc aatgacggca 733921 agggcgccga gctggtgttc aacggtcggg tgaaatcggt tgatcctgag agcaagtcgg 733981 tgaccatcgc actcaccgct actaccggcg gcaagaagat tttcgggcgg gccatcgcct 734041 cggcgaagtt agcgtagttt atggcgctca agaccgatat ccgcgggatg atttggcggt 734101 acccggacta cttcatcgtg ggccgtgagc aatgccgcga gtttgcccga gctgtcaagt 734161 gcgaccaccc ggcctttttc agcgaggaag cggccgccga cctcggttac gacgcgctgg 734221 ttgctccgct gaccttcgtg acgatcctcg ccaaatatgt gcaactggac ttcttccgcc 734281 acgtcgacgt gggcatggag acgatgcaga tcgttcaggt cgaccagcgg ttcgtgttcc 734341 acaaacccgt gctcgccggg gacaagttgt gggctcggat ggacatccat tcggtggacg 734401 agcggttcgg cgcagacatc gtcgttacca gaaacctctg caccaacgac gacggtgagc 734461 tggtcatgga ggcctacacc acgctgatgg gccagcaggg tgatggttcc gccagactca 734521 aatgggacaa ggaatccggg caggtcatca ggaccgcgta attagcaact ggccgctgcg 734581 gccatgtaca ctcggacctc ggggttttcc caacatcggc gcgctttccg tgagttcaac 734641 gagcggagtg tcgtctccac tttcggttcg cgatcaccga acggagggcg cgcgtgtcat 734701 gtgagccccg gcgtagtggg ttggccaggg cctggtctgg tcttgcctgc caaccgcgaa 734761 ggggcgtagc tcaactggca gagcagcggt ctccaaaacc gcaggttgca ggttcaagtc 734821 ctgtcgcccc tgctgaaggc gaacgttcga cgacgatgca ggcacggcct gaagaggaga 734881 cggaccatag gtatgtgcca tggtggacac tggaaggtgc cccaccagag cggaacggct 734941 cgcggggtag ctagtaaacg aaggagcatg cggtgagcga cgaaggcgac gttgccgacg 735001 aggccgtagc cgacggcgcc gagaatgcgg acagccgcgg gagcggtggc cggacggccc 735061 tggtgacaaa gccggtggtg cggccgcaac gtcccaccgg caagcggtcg cggtcgcgtg 735121 cggcaggagc cgacgcagac gtcgacgtcg aagagccgtc gaccgcggct tcggaagcta 735181 ccggggtcgc caaggacgat tcgaccacca aggccgtgtc gaaggctgcc agggcaaaaa 735241 aggccagtaa accgaaggcc cggtcggtta acccgatcgc attcgtctac aactacctca 735301 agcaggtcgt tgccgagatg cggaaggtaa tctggccgaa ccgcaaacaa atgcttacct 735361 acacgtcggt ggtgctggcg tttctggcct tcatggtggc gctggtcgcc ggtgctgact 735421 tgggcctgac caagctggtg atgttggtgt tcggctgagg ctcgagagtg acagagagga 735481 ctgaaaaccg tgactacctt cgacggtgac acgtccgcgg gtgaggcggt cgatctaaca 735541 gaggccaacg ccttccagga tgcagcggcc ccggctgaag aggtcgatcc ggccgccgcg 735601 ctcaaagcgg agctgcgcag caagcccggc gactggtacg tcgttcactc ctacgcaggg 735661 tacgagaaca aggtcaaggc caacctggaa acccgggtgc agaaccttga tgtcggcgac 735721 tacatcttcc aggtggaggt gcccaccgaa gaggtcaccg agatcaaaaa cggccaacgc 735781 aagcaggtca accgtaaggt gctgcccggc tacattctgg tgcggatgga cttgaccgac 735841 gactcctggg ccgcggtgcg taacacgccg ggggtcacgg ggttcgttgg ggcaacatct 735901 cgcccgtcag cgctcgccct cgacgacgtg gtgaagtttc tgcttccgcg ggggtcgacg 735961 aggaaggctg ccaagggtgc ggccagcacg gctgccgccg ccgaggcggg cgggctagag 736021 cgtccggtcg tcgaggtcga ctacgaggtg ggcgaatcgg taaccgtcat ggacgggccg 736081 tttgccacat tgccggccac gatcagcgag gtcaacgccg aacagcagaa actcaaggtg 736141 ctggtctcca tcttcggccg cgaaacaccg gtggagctga cctttggcca agtctccaag 736201 atctagccca gcagggcagg ccacacaggc tgaaacaagg aaggacatcg acacgtcatg 736261 gccccgaaga agaaggtcgc cgggttgatc aagctgcaga tcgtggcggg ccaggccaac 736321 cctgccccgc cagtgggccc cgcgctcggt cagcacggcg tcaacatcat ggagttctgc 736381 aaggcgtaca acgccgcgac ggagaaccag cgcggcaacg tcatcccggt ggagatcacc 736441 gtttatgaag accgtagctt cactttcacg ctgaagacgc cgcccgccgc caagctgctg 736501 cttaaggccg ctggtgtggc gaagggttcg gcggagccgc acaagaccaa ggtcgccaaa 736561 gtcacctggg atcaagtccg cgaaatcgcc gagaccaaga agacggacct caacgccaac 736621 gacgtcgacg ctgcggccaa gatcatcgcc ggtaccgctc ggtcgatggg catcaccgtc 736681 gaatagggcc ctacccgtgg gagggccagc ttcggcccgc tgagtaacca cgacccatag 736741 attggatatc aaatgagcaa gaccagcaag gcatatcgcg ccgccgccgc gaaggtggac 736801 cgcaccaacc tctacacccc gctgcaggcg gccaagcttg ccaaagagac ctcgtcgacc 736861 aagcaggacg cgaccgtcga ggtggcgatc cggcttggcg tcgacccgcg taaggcagac 736921 cagatggttc gcggcacggt caacctgcca cacggcactg gtaagactgc ccgcgtcgcg 736981 gtattcgcgg ttggtgaaaa ggccgatgct gccgttgccg cgggggcgga tgttgtcggg 737041 agtgacgatc tgatcgagag gattcagggc ggctggctgg aattcgatgc cgcgatcgcg 737101 gcaccggatc agatggccaa agtcggtcgc atcgctcggg tgctgggtcc gcgcggcctg 737161 atgcccaacc cgaaaaccgg caccgtcacc gccgacgtcg ccaaggccgt cgcggacatc 737221 aagggcggca agatcaactt ccgggttgac aagcaggcca acctgcactt cgtcatcggg 737281 aaagcgtcgt tcgacgagaa gttgttggcg gagaactacg gcgcggcgat cgacgaggtg 737341 ctgcggctca agccgtcctc gtcgaagggc cgctacctga agaagatcac cgtgtcgacg 737401 acgacgggcc cgggcattcc ggtcgaccca tccatcaccc gcaacttcgc gggggagtag 737461 tttccccggc gagcagacgc ataagccccc gcacgcacgg cgtgtcgggg gcttatgcgt 737521 ctgctcgccg ggcttaggcc gcggcacctg gcttgaggta ggtcaccagg ctgcagtcga 737581 gcatctcgtc ggtgaagtag tgctcgcagc cacgcaaata cttcatgtag cggttgtaga 737641 cctcttcgga ggtgacctcg atggccttgt ccttattgga ctgcagcgtg tccccccaga 737701 tccgcagcgt cttgatgtaa tgcgggcgca acgagagcgg ctccgggacg gtgaaaccgg 737761 ccttctcgcc gtgttcgacc atcatctcgg tggacggcag gcggccgccg ggaaatatct 737821 cggtgacgat gaacttgatg aaacgcgccg tctcgaagct cagcttctta ccgcgggccg 737881 ccatctcgta ggggtggtag ctgacgctgc tctggacggt catccggccg tcggcgggca 737941 tgatgttgaa acaccgcttg aagaagtcgt cgtagttctc gtgcccgaag tgctcgaagg 738001 cttcgatcga cacaatccgg tcgacgggtt cggcgaaatc ctcccagcct tgcagcagca 738061 cttgacgtga gcggttggtg tcgatcgaag ccagcacttg ctcgcagcgg gcgtgctggt 738121 tcttggacaa cgtcaggccg atgacgttaa cgtctaaccg ctcgacggcg cgcctcatgg 738181 tggtgcccca accgcaccca atgtccagca gcgtcatgcc cggcttgagg tccagcttgt 738241 ccaggttgag gtcgaccttg gcgtattggg cttcttcgag cgtgagctcc ggtggctcga 738301 agtaggcaca gctgtaagtt cgggtcgggt cctggaacag ggcgaagaaa tcatcggaga 738361 cgtcgtagtg cgcttggatg tcttcgaagc gtgtccgtgt cttggttggg ctaatcggtt 738421 tctcggccat tctcgtcatg ttctcctgga tggtgtcagt taccggtggc tgtgcaccca 738481 tagcccgtcg gtggcacgaa agtctacttg gccagcgtga actggttgca gtcgatgtag 738541 cccattcgga acgccttggc gcagccggtc aggtatttca tgtaccgctc gtatacctct 738601 gcggactgga tctcgatggc ctcgtccttg tgcgcttgaa gcgcctcggc ccacaggtca 738661 agagtcctgg cgaagtgcgg ttgcagggac tggatatcgg taatggtgaa accggccttc 738721 gtcacatgct cctcgatcgt ctcgatcgtc ggcaaccggc cgcccgggaa gatgtcggtc 738781 acgatgaacc ggatgaattt ggccatctcc atggtcaacg gtatgccgcg ctcgatgacc 738841 tgctttacgt gcaagccggt gatcgagtgc agcagcatca cgccgtccgc gggcatcgcg 738901 ttgtaggcga acttgaagaa gtcatcgtag cgctcgaaac cgaagtgctc tatcgcttcg 738961 atggttacga tgcggtccac cggctcgctg aagttggccc agtcgctcag cagtacccgg 739021 tgcgagcggt tggtgtcgac cttgtcgagc acttgctggc agtaggcgtg ctgatttttc 739081 gacagggtca agccgacgac gttgacgtcg tagcgctcga cggcacgctt catgaccgaa 739141 ccccagccgc agcccacgtc gagcagtgtc attcccggtt ctagccccag cttgcccagg 739201 gttaggtcca gcttggcgac ctgtgcttcg tgcaaggtca tgtcgtcgcg ctcgaagtag 739261 gcgcagctgt aggtccgagt cggatcctgg aacagcgcga agaacgcatc tgaaaggtcg 739321 taatgggctt ggacgtcgtc gacattggac cgagactttg tggtgcccgt tgagttatca 739381 gacatgtgtc ctcccactgt gaggggcacc ttcagcaggt ggccatcccc ggcaccctac 739441 acggtgcatg gcacatcgcc cgcattcgcg ctcgcatgcg ccggtctttc tcgatcggga 739501 tttgccagat atcaccctgg ccggcgcaat cactacttcg ccagcgtgaa ctggttgacg 739561 tcgatgtagc cgacccggaa cagcttggcg cagccggtca ggtatttcat gtaccgctcg 739621 tagacctctt cggactggat cgcgatggcc tcgcttttgt gttcctgcag cgcctcggcc 739681 cacaggtcga gggtcctggc gtaatgcggc tgcagcgact ggcggcgagt cagcgtgaaa 739741 cccgtcttcg ccgactgttc ctcaaccatt tcaatcgtcg gaggttggcc ccccgggaag 739801 atttcggtcg cgatgaactt gagaaagcgg gccagccaca acgtgagcgg caagccgtgg 739861 tcgaccatct gctgcctggt caggccggtg atcgtgtgca gcagcaacac gccatcgggc 739921 ggcaggattt tgtgggcccg ggcgaagaag tcggcgtgac gatcgtggcc gaagtgctcg 739981 aacgcgccga tcgacacgat gcggtcgacg ggctcgttga actgctccca tcccgccagc 740041 aacactcgcc tgtcgagcgg ggtgtccatc tcgtcgaacg acttctgcac atgggcggcc 740101 tggttcttcg acaatgtcag gccgacgacg ttgacgtcat actgcgcgat cgcgcgccgc 740161 atggtggcgc cccagccgca accgatatcg agcagcgtca tgccgggctg cagacctagc 740221 ttgcccagcg ccaggtcgat cttggcgatc tgggcctctt ccagcgtcat gtcctcgcgt 740281 tcgaaatgcg cgcagctgta ggtctgggtc ggatccagga acagccggaa gaagtcgtcg 740341 gacaggtcgt agtgtgcctg cacgtcctcg aagtgcggcg ttaggtcgtt gaccatgagg 740401 tgtaatgcct ttccggaccc taggtggcct ttcggtgctt gcacggaacg caccgatgct 740461 tccccctccc cgcatgctcg aggcatgcta tccgatacag ggccgccgca ctaaaccgcg 740521 atcgaatttg cccaggtcag ggaacggata tgagcggacg agctacttgg tcatggtgaa 740581 ctgggcgacg ttgattaggc ctctgcggaa gcgctccgcg catccggtca gatagtgcat 740641 gaagttgttg tagacctctt cggactgtac ggcgatggcg cgttcgcggg cagcctgtag 740701 gttggcggcc catgcatcga gagtccgtgc gtagtgctgc tgcagcagct ggacatgctc 740761 gatggtgaag cccgcggcct gcgcattgtc gacaatgtcg ggctccgatg gcagctcgcc 740821 gcccgggaag atcgactccc gcaggaattt gaggaatcga aggtcgctca tcgtcagcgc 740881 aatgccctgt tcgtgcagcc acctgcggtc gtaggtgaac aggctgtgca gtagcatccg 740941 cccgtcatcg ggcaggatgt cgtaggagcg ttcgaagaac gtcagatacc gctccttttt 741001 gaacgcgtcg aatgcctcaa agctgacgat ccggtcgacg ttctcttcaa actcttccca 741061 gccctgcagc cgggcctcgg cgcgccgttg cgttccgatt gcggccaggc ggtctttgct 741121 gcgttcatag tgattccggc tgagcgtgag gccgatgaca ttgacgtcgt acttctccac 741181 ggcccgaacg agcgccccgc cccacccgca acccacgtcg agtagcgtca tccccggttc 741241 gaggttcagc ttgtccaacg ccagatccac cttggccagt tgcgcctctt ccagcgtcat 741301 atcgtcacgc tcgaaatagg cgcaggtgta gacccaggtg ggatcgagga acaacgcgaa 741361 gaagtcatcc gaaatgtcgt aagccgactg tgactcttcg taatatggtc tcagcttggc 741421 cataggcgac aacctcccgc gccaaccgta caacgcctcg ccgaccggct cagccggcct 741481 cagagaagtt gcgcgtcaac tcgccgatca cccgatccca cagctgtctg ggcaggtcat 741541 ggcccatgcc gtcgatgagc accaggcgcg cgccgttgat tgctcgcgcg accgcgcggc 741601 cgccgaacgg ccgcatcagc ttgtccgcgc gcccgtggat gacgacggtc ggtgcgacga 741661 tgcgccggtc gtagcgcagc aggctgccgc tgcccagtat cgcgctgaac tgctgggcga 741721 ttccccaggg atggaagttg cggtcgtagc tttcggcggc ctcggctcgt acctggtctt 741781 cgggaatcgg gtaggccggg ctgccgatga tcttgctgac ccggacggcg ttgtcgacaa 741841 tgacgtcgcg tggcgaatcc ggcggcggac ccgtgagcag cgccagcagc gcgcgtggcg 741901 ccggcggtgg cagaaaccgg tgattgttgc tggagaagat gaccgccagg gttttcgtcc 741961 gctgcgcgaa tcgcgcggcg aaaatctggg cgatcatgcc gcccatcgac gccccgacga 742021 cgtgcgcgtg cttgacgtcg aggtgatcga gcaacgccgc ggcgtcggcg gccatgtctt 742081 ccaacgtgta ggcagcctgg ctgggcagac cgagccagga ccggaccaac cgcgtggcca 742141 gtggctgtcc cgggcggtgg cgctcggtct tggtggacag gccgacatcg cggttgtcgt 742201 agcggatgac gcgcaggccc ttcgcgacga gccgcgcgca gaagtcggtc cgccacagca 742261 gcatctgggc gcccaggccc atgatcagca acaccggcgg gtggtcgagg tcacccatgt 742321 cctcgtagta cagcttcaca tcaccggaga ccgcggtgcc gctacggatg tccaccgaga 742381 cctcgcctaa acctcgatgt cggattgatg ttcgcggctg acctcgacca tgaagttggc 742441 gaaatatccg gtcagctgcg ggtccgacat catctgccac ctcggcgcca gcagcttcat 742501 gtagcgctcc acgtacagga actgcttgcc gatcagcacc agctcgcggg gcagcttgac 742561 gtcgtaggcg tcggccagcg ccgagagctg gcggccgatg tcggcatatg acatgtcgcc 742621 cagcgattgc atggtcagcg gggtggcgaa gcgctccagg tctttggcgg cctgggtctc 742681 gggcttcatg gtgccgacgg cgcccatgag cacgacgatc ttgccggcgg ctgcgtggtc 742741 cttcttcacc agcagcgcat acaccagctc gcggagtagc cagcgggtgc gtggatcgat 742801 gcggcccatg atcccgaagt cgaagaacac gatgcggccc gcctcgtcga cgtagaggtt 742861 gcccgcgtgc aggtcgccgt ggaacagccc gtgccgcagg ccgccctcga acaccgaaaa 742921 cagcagtgcc ttgaccagct cgacaccgtc gaacccggcc ttgcggatcg cggcggtgtt 742981 gtcgatgcgg atgccgtgca cccgttccat cgtcaacacc cgctcggtgg tgaagtccca 743041 gtgcacctgc ggcacccgga tgtttttgcc cagcggcgag gcgtgtaggt gggagaccca 743101 ggcctccatg gactgcgcct cgaggcgaaa gtccagctcc tcggccaggt tgtcggcgaa 743161 gtcggcgacc acgtcttgtg ccgagagccg ccggcccagc ttggccagtt cgacggtctg 743221 cgcgaagcgc ttgaggatct gcaggtcggc ggcaacgcgg cggcggatgc ccggccgctg 743281 gatcttgacc accacctcct cgccgctgcg cagggtcgcg tagtgcacct gggcgatgga 743341 cgccgacgcg aacggctctt cctcgaagga ggcgaacagc cgggccggct cgtcgccgag 743401 ttcctcgacg aagagcttgt gcacctcgtc ggtttttgcg ggcggcaccc ggtcgagcag 743461 gccgcggaat tcccgcgaca gcgactcacc gaatgctccc gggctggacg cgatgatctg 743521 gccgaacttc acgtatgtcg gtcccagatc ggcgaaggtc tgcgggagct ccttgatcac 743581 cttctgttgc cagggccctt ttcgggggag cctgccgatg aaccggacgg cggtgcgggt 743641 gacctgccaa ccggtggccg ccacccgggc agcttcgacc ggcagcggta cccggtcaag 743701 cttggccacc tcgcggtgtg tggtggaacc catctgagca gtgtgccaaa ccggggcaga 743761 cagctcccaa ttgacgtgag cccgctcact tgctgggtaa gcgtcgccga atgtgtaatg 743821 agggcggaaa tccggcccga tttccgccct cattacacat tcggcgacgc tcggttggcc 743881 cgctgctggg ccggaggcgt tcctttatcg caacgattgg tcgccgtaag gtgcggtcat 743941 gcgggtggtt tcggcggaat cgaccgagct tttcgtcggc ccgtcggatg cgccgctgca 744001 gctggtccgc gtcgcggtca ccgggtgcac cgaaccgccg ccgatccgca tccatggtga 744061 cggcttggcc ggcgaggcgg ccgcccgccc cggcgacgac gtcatcgagg ttgcggtcgc 744121 ggttgagtcc ccggtcgtcg gtgagcggcg gaccgcgcgg gtacacaccc ccgatggtcc 744181 gagcctggcg ttcgagttca ccgtggccga gcccggctgg acgatgttca tgatcagcca 744241 cttccactac gacccggtct ggtggaacac ccagggcgcc tataccagcc agtggcgcga 744301 agacccgccc gggcgagccc gccaggccaa cggcttcgag ctggtgcgcg cgcatctgga 744361 gatggcgcgc cgcgagcccg agtacaagtt cgtgctagcc gaggtggact acctcaagcc 744421 gtactgggat acccacccgc aggaccgcgc cgacctgcgc cggttcctcg ccgatggccg 744481 tatcgaagtg atgggcggaa cctacaacga acccaacacc aacctcacca gcccggagac 744541 caccatccga aacctggtgc acggcatcgg ttttcagcgt gacgtgctgg gcgccgagcc 744601 ggccaccgcg tggcagctcg acgtgttcgg ccatgacccg caatttcctg ggctggccgc 744661 cgatgccggg ctgacgtcga gttcctgggc ccgcgggcca caccaccagt ggggtccggc 744721 ccaaggcggg gtagaccgca tgcagttttg cagcgagttc gagtggatcg cgccgtcggg 744781 tcgcggcctg ttgacccatt acatgccggc gcattattcg gcgggctggt cgatggactc 744841 gtccacctcg ctggccgacg ctgaggccgc cacctacgcg ctgttcgacc agctcaaaaa 744901 ggtcgcgctg acccgcaacg tgctcctgcc ggtgggcacc gactacaccc cgccgaacaa 744961 gtgggtcacc gccatccacc gcgactgggg tgcgcgctac acctggccgc gcttcgtgtg 745021 cgcgctgccc aaggagttct tcgccgcggt gcgcgccgaa ctggccaagc gtggttgggt 745081 gccgttgccg cagacccgcg acatgaaccc gatctacacc ggcaaggacg tctcctacat 745141 cgacaccaaa caagccaacc gggccgccga gaacgccgtc ctggaagccg agcggttcgc 745201 ggtgttcgcc gcgctgctga ccggcgccga gtatccgcag gcggcgttgg ccaaggcgtg 745261 ggtgcaactg gcctacggtg cgcaccacga cgccatcacc ggctcggagt ccgaccaggt 745321 ctacctcgac ctgctgaccg ggtggcgtga cgcgtgggag ctgggccgcg cggcccggga 745381 caactcgctg cggttgctgt ccggcgcggt cgccgcgtcg cacgatcgcg tcgtcgtgtg 745441 gaacccgctg acccagcggc gcaccgacat cgtcactgcc agggtcgacc cgccgctgca 745501 ggccggcgtg cgggtgttcg atcccgacgg ggctgaggtg gccgcgctcg tcgagcacga 745561 cggacggtcg gtcacctggc tggcgtgcga cgtgccctcg ctgggctggc gggtttaccg 745621 gttggtgccc gccgacgagg cgccaggctg ggaattggta cccggcaccg acatcgccaa 745681 cgagcactat cggctggccg tcgaccccga gcgtggcggg gcgttgtcgt cgctggtgca 745741 ggacggccgc cagctgatcg ccgccggccg ggtagccaac gagctggccc tctacgagga 745801 atacccgtcg cacccgactc agggggaggg tccgtggcat ctactgccca cggggccggt 745861 ggtgtgctcc tcggcatgcc cggcgcaggt gcaggcatac cgcggcccgc tcggtcagcg 745921 gttggtcgtg cgggggcgga tcggcaccct gctgcgctac acgcagacac tcaccttgtg 745981 ggacggcgtc gaccgggtgg actgccgcac cagcatcgac gagttcaccg gggaagaccg 746041 cttgctgcgg ctgcgctggc cgtgtccggt acccggcgcc atgccgatca gcgaagtggg 746101 ggacgccgtc gtcgggcggg gtttcgcgtt gctgcacgag gggcccgaat cggtggacac 746161 cgcccagcat ccgtggaccc tggacaaccc ggcctacggc tggttcgggt tgtcctcggc 746221 ggtgcgggta cgcgccggcg atggggtgcg cgcggtgtcg gtggccgagg tggtgtcgcc 746281 gacggagacg gtgtccggcc cgatggcgcg cgacctgatg gtcgcgctgg tccgcgcggg 746341 cgtcaccgcg acctgcagcg gcgccgacaa gccgcgctac ggccacctcg atgtcgattc 746401 caatctgccg gacgccagga tcgcgctcgg tgggccggac cgcaacacgt tcaccaaggc 746461 cgtgctggcc gaggccgccc cggcctacac cgccgaactg cagcggcagc tggcgaagac 746521 cggcacggcc agggtgtggg tgccggccgc gaacccgttg gcgcgggcct ggctgcccgg 746581 cgcggacttg cgggcaccgt gcgcgctgcc ggtgctggtg atcgacggcc gagacgagaa 746641 gcacctgcgc gccgcggtgg cgtcgctggc cgacgacctg gccgacgccg agatcgtcgt 746701 gcaccagcgg gccgcgccgc aaatggagcc gttcgaggat cgcacggtcg cgctgctcaa 746761 ccgtggggtg cccagcttcg ccgtcgactc cgagggcacc ctgcacaccg cgctgatgcg 746821 gtcgtgcacc ggctggccct ccggggtctg gatcgaccag ccgcgacgca ccgccccgga 746881 tggctcgaat ttccaactcc agcactggac ccaccacttc gactacgcgc ttgtctgcgg 746941 cggcggcgat tggcggcgcg ccggcatccc ggcgcgcagc gcgcagttct cccacccact 747001 gcttgcggtg gcgccgcgac ggccacaggg cgagctgccg gcggtcggct cgctgctgca 747061 cgtcgagccg gccgactcgg tgcagctggg cgcgctcaag gcggccggca accggctggc 747121 agccggcagc gcgcggccgg tccaacccgc cgcggtggcg ctgcgattgg tgcaaacgac 747181 aggagccgac accccggtca ccatcggctg cgagctgggc aaggtaggcg ccctccggcc 747241 ggccgacctg ctggaaacgc cgctcgcaat ggcaagggcg cgcaagtcgt ccatcgacct 747301 gcacggctat caggtcgcca ccgtgctggc ccggctcgac gtggccgctg atatggctaa 747361 cgtgctggcg gccgacgacg tggcgttggc gccgcacgcc gagaccgctc agccgcagta 747421 cgcgcgctat tggctgcaca accgcggccc ggcgccgctg ggcgggctgc ccgcggtcgc 747481 ccacctgcac ccgcggcggg tgcgcggcca gcccggtgac gacgtggtgc tgcgcctgac 747541 cgcggccagc gactgcaccg attcggtgct gggcggcgtg gtcgacgtcg tgtgtccgct 747601 cggctggccg gccacaccgg ctcggttgcc gttcacgctg ggcgccgggg cgcacctgca 747661 ggccgacatc gcgttgagca ttcccgccgg cgcgccgccg ggaccgtatc cggtccgcgc 747721 gcagctgcgc gtcgtcgaca cggcggtacc ggccgcctgg cgccaggtgg tcgaggacgt 747781 gtgcgtggtc accgtcggcg ccgactccga tctggaggag ctggtctacc tcgtcgatgg 747841 gccggccgac atcgagctgg ccgccggcga ccgggcccgg ctggcggtga cgatcggcag 747901 ccgcgctcac gccgagctgg ccctggatgc gcactcgatc agcccctggg gcacctggga 747961 gtggatcggc ccgcccgcgc tcggcgccgt gctacccgcc cggggcatgg ccaagctggc 748021 tttcgatgtg accccgccgg cctggctgga gcccgggcag tggtgggccc tggttcgggt 748081 cggttgcgcg ggtcagttgg tctattcgcc ggcggtgaag gtgagcgtga catgagcggg 748141 cgaagccgat tgcccggctc ctcctcacgc cgcgacgcgg cgcgcatcgt cgccgagcgg 748201 gtggtcgcga ccgtcgccgg tgtcgcggta gcggtcgacg aggtcgacgc ggccgaagcg 748261 cggctgcgcg acggaccgcg cgcggccgcg ctgccggcga gcggcaccag cgagggacgc 748321 caactgcggc gctggctcac ccaactgatc gtgaccgagc gggtggtagc cgccgaggcc 748381 gccgcacgtg gtctgaccgc ggcgggcgcc cccgccgagg cggacctgct gcccgacgcg 748441 acggctcggc tggagatcgg cagcgtcgcc gccgcggtgc tggcggatcc tttggcgcgg 748501 gcgttgttcg ccgccgtcac cgcgcgggtc gcggtcaccg acgacgccgt ggccgactac 748561 catgcccgca acccgctgcg gttcgccgcg ccatgtcccg gccagcacgg ctggcgtgcc 748621 ccggcggcgg ccgccccacc gctggatcag gtgcgccgcg cgatcaccga gcatctgttg 748681 ggggccgcgc gccgccgcgc cttccgggtg tggctggacg cgcgccggaa cgccctggtg 748741 gtgctggccc ccggctatga gcaccccggc gacccgcgcc aacccgacaa cacccgccgg 748801 cactgatgct caccctttgc ctcgacatcg gcggcaccaa gatcgccgcg ggcctggccg 748861 acccggccgg cacgttggtg cacaccgccc aacgtcccac cccggcgtat ggcggagccg 748921 aacaggtctg ggccgcggtc gccgagatga tcgccgacgc gctcggcgtg gcggggggcg 748981 cggtcggtgg tgtggggatc gcctcggccg gtcctatcga cctacacagc ggccgcgtca 749041 gcccgatcaa catcggatcc tggggcggct ttccgctgcg ggatcgggtc gccgccgcgg 749101 tcccgggggt tccggtgcgg ctggggggtg acggggtgtg catggcgctc ggcgagcact 749161 ggctgggagc cggacggggt gcgcgctttc tgttgggttt ggtggtgtcc accggggtgg 749221 gcggcgggtt ggtgctcgac ggcgccccct gtctcggccg caccggcaac gccggtcacg 749281 tcggccacgt ggtggtggat ccggatggct cgccgtgccc gtgcgggggg cgtggctgtg 749341 tggagaccat cgcgtccggc ccgtcgctgg cgcgctgggc gcgggccaac ggctggtccg 749401 cgccgcccgg ggccggcgcc aaagagctgg ccgaggcggc tggggccgga gacccggtgg 749461 cgctgcgggc cttccgccgc ggcgccgcgg cgctggccgc gatgatcgcc tcggtgggcg 749521 ccgtgtgcga cttggatctc gccgtcatcg gcggcggcgt ggccaagtcg ggtcgcctgc 749581 tgttcgagcc gttacgtgcg gcgctagccg accacgcccg gctggacttt ctggccggcc 749641 tgcgggtggt gcctgccgag ctgggcggcg ccgccggcct ggtgggtgcg gccaggctcg 749701 cggccatcgc ataatgccga ttgtgaatct ggcgacgcga cacgccggtg cggcgtcgcg 749761 ggattcacac tcggcgatac gtgtcgccgt tttggctgac cggaccgggc caggctattg 749821 tggttgccga tccaccgaag accgtcggtc accgagcaat cggttgaagg tccgggagca 749881 tcccggcgac ccacgcagga ggacgaggca gcaccgccgg cgcgcgccgg cctagttcca 749941 cgccccgacc gcttcctgcg tcggggcgtt cgtcgttccc gggtggtcgc agacggcacg 750001 tcgtaccccg actgccacca gacttgcacc gtcaggaggt atgcatggcc agggctgaca 750061 aggccaccgc cgtcgcagac atcgcagcgc agttcaagga gtcgaccgcg acgttgatca 750121 ccgaataccg cggcttgacg gtggccaacc tggccgagct acgcaggtct ctgacggggt 750181 cggcgaccta cgcggtggcc aaaaacacac tcatcaagcg ggcggcctcc gaggccggca 750241 tcgagggcct cgacgaactg tttgtgggcc ccaccgcgat cgcgttcgtc accggtgagc 750301 cggtcgacgc cgccaaggcc atcaagacct tcgccaagga gcacaaggcg ctggtcatca 750361 agggcggcta catggacggc cacccattga ccgtggccga agtcgagcgc atcgccgacc 750421 tggagtcccg cgaggtgtta ctggccaagc tggccggtgc gatgaagggc aacctggcca 750481 aggcggccgg gttgttcaac gcgccggcct cgcagctggc ccggctcgcg gccgccctgc 750541 aggaaaagaa ggcctgccca ggcccagact cagccgagta gtcacccagt accccacacc 750601 aggaaggacc gcccatcatg gcaaagctct ccaccgacga actgctggac gcgttcaagg 750661 aaatgaccct gttggagctc tccgacttcg tcaagaagtt cgaggagacc ttcgaggtca 750721 ccgccgccgc tccagtcgcc gtcgccgccg ccggtgccgc cccggccggt gccgccgtcg 750781 aggctgccga ggagcagtcc gagttcgacg tgatccttga ggccgccggc gacaagaaga 750841 tcggcgtcat caaggtggtc cgggagatcg tttccggcct gggcctcaag gaggccaagg 750901 acctggtcga cggcgcgccc aagccgctgc tggagaaggt cgccaaggag gccgccgacg 750961 aggccaaggc caagctggag gccgccggcg ccaccgtcac cgtcaagtag ctctgcccag 751021 cgtgttcttt tgcgtctgct cggcccgtag cgaacactgc gcccgctcgg gtgaatctcc 751081 cagcgcgaca agcaggttca ccgtcatcgc ggcgagcacc ggttcgacgg ccgcgcctcg 751141 atcgccgtag aagccggcca gctcgagcat cacgaagccg tggatctgtg accaaaactg 751201 cgccgcggtg gcaactattg ccgtgtcgtc gtcggctcca agcgcggtcg cgaaccggcc 751261 ggccagcagg caccggtgca ccgctcgcac cacatgcgcg aaactggggt gctggtgttc 751321 gatctcggca accttgaggg tcaacacgtc gcgcgctggc acgttgatgc cgtgtgcgct 751381 ggtgctgccg aacattagcc ggtacatgtg cgggcgctcg atggcgtagc gccggtaggc 751441 ggtgccgatg gccagcaggt cggcgaccgg atcggcggtc tgcgggaccg tcagcgcgac 751501 atcgaactgg cgtagccctt cttcggctat ggcggcgatc agtccgcgca tcccgccgaa 751561 atgggtgtac accgccatcg tcgaggtgcc tgctgcggcg gccaccttgc gggtctgcag 751621 cgcgtcgggc ccgtgatcgt cgagcagtcg cacgccggcg tgcagcagct cgtcgcgaac 751681 accggtctgc gaggtcatcc ttgccatgtt ctcaccaagg gcgtaccgtt ccaatatcag 751741 tgaaataaca atgttatagg agatcggcat gaccaccgca caagccgccg aatcccaaaa 751801 cccatatctc gagggcttcc tggcgccggt gagcaccgag gtaactgcca ccgacctgcc 751861 ggtcaccggc cgcattccgg aacacctcga cgggcgttat ctgcgtaacg gccccaaccc 751921 ggtcgcggag gtcgacccgg ccacctacca ctggttcacc ggcgacgcca tggtgcacgg 751981 agtcgcgctg cgcgacggga aggcccgctg gtatcgcaat cgctgggtcc gcacacccgc 752041 ggtgtgcgcc gccctgggcg agcccatttc ggcccggcct cacccgcgca ccgggattat 752101 cgagggcggt cccaacacca acgtgctgac ccacgccgga cgcaccctgg ccttggttga 752161 ggccggcgtg gtcaactacg aactcaccga tgagctggac accgtgggac cctgtgactt 752221 cgacggcacc ctgcacggcg gttacaccgc ccatccgcag cgtgatccgc acacgggtga 752281 actgcacgcg gtgtcctact cgttcgcccg cggacacaga gtgcagtact cggtgatcgg 752341 caccgacgga cacgctcgtc ggacggttga tatcgaggtg gcgggatcgc cgatgatgca 752401 cagcttctcc ctgaccgaca actacgtggt gatctacgac ctgccggtga ccttcgaccc 752461 aatgcaggtg gtgccggcgt ccgtgccacg ctggctgcaa cggcccgcca ggttggtgat 752521 ccagtcggtc ctgggccgtg tccgcatccc cgacccgata gcggcgttgg gcaaccggat 752581 gcagggtcac tccgatcgcc tcccgtacgc ctggaacccc agctacccgg cgcgcgtcgg 752641 tgtcatgccg cgcgagggtg gcaacgagga cgtgcggtgg ttcgacatcg aaccctgcta 752701 cgtataccac ccacttaacg cctactcgga gtgccggaac ggcgctgagg tgctggtgtt 752761 ggacgtggtg cgctactcac ggatgtttga tcgcgaccgg cggggtcccg gcggtgacag 752821 ccggccctcg ctggatcgct ggaccatcaa cctggcgacc ggtgcggtga ccgccgaatg 752881 ccgcgacgat cgggcgcagg agtttccccg catcaacgag actctggtgg gtgggccgca 752941 tcgcttcgcc tacaccgtcg gcatcgaggg tgggtttctc gtcggcgccg gcgctgcgtt 753001 gtcgactccg ctgtataaac aggactgcgt gaccgggtcc agcacggtcg cctcgctcga 753061 tcccgacctg ctgatcggcg agatggtgtt cgtgccgaac ccgtcggcgc gtgcagaaga 753121 tgacgggatt ctcatgggct acggctggca ccgcggccgc gacgaaggcc agctgctctt 753181 gctggatgcc cagactctcg agtcgatcgc caccgtgcac ctgccacagc gtgtgccgat 753241 gggcttccac ggcaactggg cgccgaccac ctgacggcgc ctcgggtgcg atacagtgac 753301 tcataccaca caacgggccg gtggcagcca cgagcgtcga cagaagggtt tcccatgggc 753361 gtcagcatcg aggtcaacgg actaacgaag tccttcgggt cctcgaggat ctgggaagat 753421 gtcacgctaa cgatccccgc cggggaggtc agcgtgctgc tgggcccatc gggtaccggc 753481 aaatcggtgt ttctgaaatc tctgatcggc ctcctgcggc cggagcgcgg ctcgatcatc 753541 atcgacggca ccgacatcat cgaatgctcg gccaaggagc tttacgagat ccgcacattg 753601 ttcggcgtgc tgtttcagga cggtgccctg ttcgggtcga tgaacctcta cgacaacacc 753661 gcgttccccc tgcgtgagca caccaagaaa aaggaaagcg agatccgtga catcgtcatg 753721 gagaagctgg ccctagtcgg cctgggtggg gacgagaaga agttccccgg cgagatctcc 753781 ggcgggatgc gtaagcgtgc cggcctagcg cgtgccctgg tccttgaccc gcagatcatt 753841 ctctgcgacg agcccgactc gggtctggac ccggttcgta ccgcctacct gagccagctg 753901 atcatggaca tcaacgccca gatcgacgcc accatcctga tcgtgacgca caacatcaac 753961 atcgcccgca ccgtgccgga caacatgggc atgttgttcc gcaagcattt ggtgatgttc 754021 gggccgcggg aggtgctact caccagcgac gagccggtgg tgcggcagtt cctcaacggc 754081 cggcgcatcg gcccgatcgg catgtccgag gagaaggacg aggccaccat ggccgaagag 754141 caggccctgc tcgatgccgg ccaccacgcg ggcggtgtcg aggaaatcga gggcgtgccg 754201 ccgcagatca gcgcgacacc gggcatgccg gagcgcaaag cggtcgcccg gcgtcaggct 754261 cgggttcgcg agatgttgca cacgctgccc aaaaaggccc aggcggcgat cctcgacgat 754321 ctcgagggca cgcacaagta cgcggtgcac gaaatcggcc agtaaggcgc gcggggatgc 754381 gaccgccgga ccgccgcaat cggatgattt cgcgtaactt gccgcatatc acccggagac 754441 cgaatcgggt cggccgctgg aggcggcgcc tgttcgggag ctgatcacgc aacgtttgta 754501 tctgctgccg accttccgtt ggcggctcgc gtaggtggca cagtccgcga agtgcttggg 754561 ccgctgatca aggcgctccc ggagcacaat ccagacatgt caggccgtca ccgacgcaca 754621 ggcgacggcc ctcgagcagc gtgggaagag ccgggctcgt cgagtggatc acacatttcg 754681 aggcgctctc gtgtatcgag cggcacatca gccatgcgtg tctccttgtc ctgccttctc 754741 cagaggaaac cgctagtcgt cggcgctgac gaccctccgc actctgatgt cgggaaggtg 754801 acgctctgcg agttcgtagt cggcatcgtc gtggaggact actaggcccc tggccgccgc 754861 agtgtcgcag atcagcagat cgacaaccga cagggcaccc accgctcccg cccgggcgag 754921 gcggtgctgt gccgaatcga tccaccgcca cacggatttc ggcactggca catcggggta 754981 gacgtcacca aacatccggc tcatctggtc gaactcgtcc gcattccgcg ctgatcggca 755041 gaactcggct cgttgcggtt cgcacgaccc gacggcccgc tgagcagcgc ggagttccag 755101 gcctcggtgg gttccggttg tcgttgcagc cgccaaaccg ctgaggaatc caccaggaaa 755161 tagatcaaat cccgagggcc ttctcgtcgt cccgcgcggc cacccagccc ttgtagtccc 755221 agcctttcgc ctactcgcgc gagcgggcca gggcctcgat gcgccgaaac cgttcgacgt 755281 aatcgcgcat cgcgaggttc acggcttcct tctttgtgtg cacggcggcg atgcgcatca 755341 catcggccag cgcttcgtcg tcgaggtcga tctgggtcac cgacacgacg gcctcctatg 755401 ttgaagacat atcacataaa catacgtaac caacatcgcg aggagaccgt ctcgcgcctg 755461 ctcagggcaa cgatatggcg ccagtcagac caagcagcaa tacgatcccg ggcaataggt 755521 tggtcacttg gtgcgtgacg atgctggcca gtagaccgcc ggaatagaac cgtgccagcg 755581 cgatcgggat ggccaccacc accagcagtg gagctcgggc gaactcgaga tgggccaatg 755641 cgaagaccac ggtggtaacc accagcgccg cccaccgacc ccagcgccga tccacagcac 755701 cccagagcag cccgcggtag atgatctctt cgcacagtgg cgcgacgaac accacgacca 755761 gaaagacgac cagcgcccac ggccaggacg cccgaacgcc accgaaaatc cttactacag 755821 cggaattcgc ttctggccca acgatagcgg tgtagaccag cgacgccgga atcgtgacca 755881 gcattccgcc gaaaccgaac atcaacccga gccgcagtcc gcgccacgac cagcgcagcc 755941 gcaagtcggt gcgggggccg ttgccgcgga gcctggtgat gaggatggcc agcccggcgg 756001 cgaccaccgt gggggcggct agcgcaaggg ccagcacccc ggcagacacc gggccgtgac 756061 cggtaaggac aaccgctaac gaagtcgagg cgaccaggaa taccagctcg acgaccaaga 756121 aggccccaag tccccagcgg tgactggggg ctacggtatc ggcacggccc gcttccacgg 756181 ctccgacggt atcgaagtgt caccgccacc ggcgctgacg tcgagccggc ggacggccgg 756241 ctgctacgcg cgcggtacct cgtcgggcgg atcgggttcg gtcgatcgac gcgaaatatg 756301 ttggcgactg gcaacttccg gtgcttgcca cggtctcaac tctcaccgcc gtgttgatcc 756361 ggaccgacgg atgtgtcgcc gaaccgacca tatcgtgggc ttgctgacgc gctcatcagt 756421 cggttcgggt ggccacgtgc aaccagcccc cgctcaacac cccgtgctcg cccggagtgt 756481 ttgacaggct tcgtgcaggc gggccgggga cagccgggtg atgcggcgtc ggaatgcggt 756541 gcgtggcaac gtatgaatgt tgtcgaagtt gacgacgcag tcgctcggaa cacggttttc 756601 gacggccgtg agctccaatt ccgacaccag gcctcggcgg gtgcgggtta gggccaccac 756661 aacgaccgcg ccgatgcggt ctgccaccgg atctctggta aggacaagta ctggtctgtc 756721 accaccaggt gtggcggcaa accacaattc accgcgccgc atcggcccag tcggcccagt 756781 cctccgccgg cccccagtcg gcgatctcag ccagtgcgtt ctcgtcgtcc gtcaacggtc 756841 gctcggtgta ggcctggaca tcctggtccg cggccagcgc ggccaagtga cgtcgcagcg 756901 catcgcgcag cagctcggag cggccgatgt gtaggcgacg cgcccacgcg tcggccaggt 756961 cgacgtcgtg gtcgtcggcg cggaagctga gcatcgtcat acatcgagtt tagagcgtat 757021 gacattgtcg gccggcgagc agacgcataa gcccccgcac gctcggcgtg tcgggggctt 757081 atgcgactgc tcgcccgggg ccgtcagcgg tcgccgagca ggctgaccat cccggccgcg 757141 tcgggaatga cgtgcacgac atccgacaga tcggcgaacg ccgggtcagc cgagacaaga 757201 gcggtcgcgc cggcgcttgc ggcaaccgcc gcgagcaccg cgtcgcaggc ttcaagccct 757261 ggcgttgtct cgaacagcgt caggccgcgc ttcgaggtgg cctcgattga tggtgagtag 757321 cggcgagagc agttcggcat agtcacacgg cccagcgcgg cggcgtcgct gcggtcgcgc 757381 cggcgggcgc gtacgtggac gaactcctgg atcacctcgg cggtggtggt cgcagcgatg 757441 cgttcgtcgg cgattgccgc gacgagatcg cggcagggat cgcggagtgg atgctcggcg 757501 cctttggcat agacgaggac ggtggtgtcg agcactatca tccgcggcgc gcccggaggg 757561 cctcgagttc ctgcttcagc tcccgcggct cgggaacgga catgtcggcg gcgtcgagca 757621 ggcgcctgcc cgcggacttg cggcgaccgg cggggctgac gaggcctcga tcaatggcct 757681 cacgcacgac ggttgcgacc gggacgcctc gctcgcgcgc caccgcggtg atgcggcggt 757741 ggcactcgtc gtcgagcagg atctggagcc gatgcgccag acgcatgctc atacatttag 757801 catgctgaaa tttgggcggc ggctgccatt gcggtcgcgt tgacccgcgg acggcccaga 757861 cgctgcggtt gtagcgtcga taggcacgcg tattagggag gaacaatgcc gcagccaaga 757921 acgcatctgc cgattcccag tgctgctcgc accgggctga tcacgtatga cgcgaaggat 757981 cccgacagca cctatccgcc gatcgagcag ctgcgcccac cggcgggtgc cccgaatgtg 758041 ttgctgatcc tgcttgacga tgtcgggttc ggtgcgtcga gcgcgttcgg aggcccatgc 758101 aggacgtcga cggcggaact gcttgccggt aacgggttgc ggtacaaccg gtttcacacc 758161 accgcgctgt gctcgccgac gcgtcaggcg ttgttaactg gacgcaacca tcactccgcc 758221 ggcatgggcg gtatcaccga aatcgccacc ggtgcaccgg gatacagctc agtactaccg 758281 aacaccatgt cgccgatcgc gcggacgcta aagctcaacg gctacaacac cgcccagttc 758341 ggcaagtgcc acgaagtccc ggtctggcag accagcccgg tcgggccgtt cgacgcgtgg 758401 cccagcggcg gcggtggttt cgaatacttc tacgggttta tcggtggcga ggctaaccag 758461 tggtatccga gtctgtacga gggcaccacg ccggtcgagg tgaaccgcac gcccgaggag 758521 ggttaccatt tcatggcgga catgaccgac aaggccctcg gctggatcgg acagcagaag 758581 gcactggccc ccgaccggcc gttcttcgtg tacttcgccc cgggcgccac ccacgcgccc 758641 caccacgttc cgcgggagtg ggccgacaag taccggggcc gcttcgatgt gggctgggac 758701 gcactgcgag aggaaacctt cgcccggcaa aaggaactcg gggtgatccc ggcggactgc 758761 cagctgaccg cgcggcacgc cgaaatcccg gcgtgggacg acatgccgga ggacctcaaa 758821 cccgtgctat gccggcagat ggaggtctac gcgggctttc tggaatacac cgaccaccac 758881 gtcggccggc tcgtcgacgg cctgcagcgc ctcggtgtgc tcgacgacac gctggtgttc 758941 tacatcatcg gcgacaacgg cgcctcggcc gagggcacga tcaacggcac ctacaacgag 759001 atgttgaact tcaacggcct ggccgacatc gagacgccgc ggttcatgac cgaccggctc 759061 gacaagttcg gcgggccgga gtcctacaac cactattcgg tgggttgggc gcatgcgatg 759121 gataccccct atcagtggac caaacaagtg gcctcgcact ggggtggcac gcgtaacggc 759181 acgattgtgc actggcccaa cggaattgcc gccaaggggg agatgcgctg gcagtttcac 759241 cacgtcatcg acgtggcgcc gaccatcctg gaggcggcgg ggttgccgga accgttattc 759301 gtcaacggcg tgcagcaaca ccccatcgaa ggggtcagca tggcctattc gttcgacgac 759361 gcgcaggcgc cggatcggca cgagacgcag tatttcgaga tgttcggaaa ccggggcatc 759421 taccacaagg gttggaccgc ggtgaccaag cacaagacgc cgtggatttt ggttggcgag 759481 cagaccgtcg cgttcgacga cgacgtgtgg gagctctacg acaccaccaa ggattggagc 759541 caggccaaag acttggccaa ggagatgccg gaaaagctgc atgagctgca gcggctgtgg 759601 ctgatcgagg cgacgcgcta caacgtgctt ccgctggacg acgacaccgc cagccgcatc 759661 aaccccgatc tggcgggcag gccggtgctc atcaggggca acacccaggt gctgttttcg 759721 aacatgggcc ggttgtcgga gaactgtgtg ctcaacctca agaacaaatc gcacacggtg 759781 accgctgagg tcgaggtgcc cgagaccggt gctgagggcg tgatcgtcgc gcagggcgcc 759841 agcatcggcg gctggagcct gtatgccaac gacggcaagc tcaagtactg ctacaacctg 759901 ggtggtatca agcacttcta cgccgagtcc gccgacccgc tgccggccgg cgcccatcag 759961 gtgcgcatgg aattcgctta tgccggtggc ggtttgggca agggcggcga ggtaactctt 760021 tatgtcgacg gccaacaggt cggcgaagga catgtcgaag ccacccttgc catcgtcttc 760081 tcggccgacg acggctgcga tgtcggcatg gattcgggct cgcccgtctc acccgactat 760141 gccccgggga gtaacgcgtt caacgggcgg atcaagggcg tgcagctcgc gatcgccgag 760201 gccgccgctg ctgcgggcca tctggtcgac ccggagcacg cgatccgcat cgcgctggcg 760261 cgccaatagg gccgcacagt caaacgggga ggggacggcg atggaaaagt cacggtgcca 760321 cgctgtcgca catggaggtg ggtgtgcggg atctgcgaaa tcgcacaagt caggtggtcg 760381 atgcggtcaa ggccggggtg ccggtgactc tcacggtaca cggggagccg gtcgccgata 760441 tcgtgccgca tcggcgccgc atccgctggc tgtcggggcg catctgcgcg atgagctcgc 760501 caagcgctcg gccgacccgc gcctcaccga tgaactcaac gacttggccg gtcataccct 760561 cgacgacctg tgaccgaggg cgaggtcggg gtaggcctgc tagatacgtc ggtcttcatt 760621 gcgcgcgaga gcggcggtgc aatcgcggac ctgcctgaac gcgtggcgct ttcggttatg 760681 acgatcggtg agctgcaact cggtctgctc aatgctggcg attcggcgac ccgatcacga 760741 cgcgccgaca ccctcgcgct agcgcgcacg gccgatcaga tccctgtcag tgaagcggtg 760801 atgatttcgt tggctcgact cgtcgcggac tgccgagccg cgggcgtgcg gcggtcggtg 760861 aagctgaccg acgctctcat tgcggcaacc gcggagatca aggtgtgaca ccgaggactg 760921 atgaaggtgc cgctgcaccc tgcctgatgc ctgacgtcac gatgcccgtg aagcgtggtg 760981 atgcccgggg agctttgggt gtgggtccag ctttgttcgt ggtgagcgtg agcagctcgc 761041 tggtgagggc caggagctgt cgttgcacgg cggattgatc gattcgaccg catccatctg 761101 gagctactgc cccagaccgg actcgcagcc ttggcaagcc gctacgcggg cattctcacc 761161 tgaggcaacg aagggcgcta tgcgcgcatt gtgggtgagt caacgcgagg acttgacggc 761221 agacgctaaa cgggtcaatc tgttgggcag catgcgccgc atgtggccaa aggaagtcga 761281 gatcgccagc tagcgccgat atccggggat ggttattgcc gggtatttga ggaatgcgcc 761341 gtcctgcgct attgttggac gttgcgctgg ctacttcctg cccacctcac ccgccacttg 761401 acaccgtggt cttagtctga gcccagtttg cggctcagcg gtttagttgc gtgcgtgaga 761461 tccggacaga tcgttcgccg gccgaaaccg acaaaattat cgcggcgaac gggcccgtgg 761521 gcaccgctcc tctaagggct ctcgttggtc gcatgaagtg ctggaaggat gcatcttggc 761581 agattcccgc cagagcaaaa cagccgctag tcctagtccg agtcgcccgc aaagttcctc 761641 gaataactcc gtacccggag cgccaaaccg ggtctccttc gctaagctgc gcgaaccact 761701 tgaggttccg ggactccttg acgtccagac cgattcgttc gagtggctga tcggttcgcc 761761 gcgctggcgc gaatccgccg ccgagcgggg tgatgtcaac ccagtgggtg gcctggaaga 761821 ggtgctctac gagctgtctc cgatcgagga cttctccggg tcgatgtcgt tgtcgttctc 761881 tgaccctcgt ttcgacgatg tcaaggcacc cgtcgacgag tgcaaagaca aggacatgac 761941 gtacgcggct ccactgttcg tcaccgccga gttcatcaac aacaacaccg gtgagatcaa 762001 gagtcagacg gtgttcatgg gtgacttccc gatgatgacc gagaagggca cgttcatcat 762061 caacgggacc gagcgtgtgg tggtcagcca gctggtgcgg tcgcccgggg tgtacttcga 762121 cgagaccatt gacaagtcca ccgacaagac gctgcacagc gtcaaggtga tcccgagccg 762181 cggcgcgtgg ctcgagtttg acgtcgacaa gcgcgacacc gtcggcgtgc gcatcgaccg 762241 caaacgccgg caaccggtca ccgtgctgct caaggcgctg ggctggacca gcgagcagat 762301 tgtcgagcgg ttcgggttct ccgagatcat gcgatcgacg ctggagaagg acaacaccgt 762361 cggcaccgac gaggcgctgt tggacatcta ccgcaagctg cgtccgggcg agcccccgac 762421 caaagagtca gcgcagacgc tgttggaaaa cttgttcttc aaggagaagc gctacgacct 762481 ggcccgcgtc ggtcgctata aggtcaacaa gaagctcggg ctgcatgtcg gcgagcccat 762541 cacgtcgtcg acgctgaccg aagaagacgt cgtggccacc atcgaatatc tggtccgctt 762601 gcacgagggt cagaccacga tgaccgttcc gggcggcgtc gaggtgccgg tggaaaccga 762661 cgacatcgac cacttcggca accgccgcct gcgtacggtc ggcgagctga tccaaaacca 762721 gatccgggtc ggcatgtcgc ggatggagcg ggtggtccgg gagcggatga ccacccagga 762781 cgtggaggcg atcacaccgc agacgttgat caacatccgg ccggtggtcg ccgcgatcaa 762841 ggagttcttc ggcaccagcc agctgagcca attcatggac cagaacaacc cgctgtcggg 762901 gttgacccac aagcgccgac tgtcggcgct ggggcccggc ggtctgtcac gtgagcgtgc 762961 cgggctggag gtccgcgacg tgcacccgtc gcactacggc cggatgtgcc cgatcgaaac 763021 ccctgagggg cccaacatcg gtctgatcgg ctcgctgtcg gtgtacgcgc gggtcaaccc 763081 gttcgggttc atcgaaacgc cgtaccgcaa ggtggtcgac ggcgtggtta gcgacgagat 763141 cgtgtacctg accgccgacg aggaggaccg ccacgtggtg gcacaggcca attcgccgat 763201 cgatgcggac ggtcgcttcg tcgagccgcg cgtgctggtc cgccgcaagg cgggcgaggt 763261 ggagtacgtg ccctcgtctg aggtggacta catggacgtc tcgccccgcc agatggtgtc 763321 ggtggccacc gcgatgattc ccttcctgga gcacgacgac gccaaccgtg ccctcatggg 763381 ggcaaacatg cagcgccagg cggtgccgct ggtccgtagc gaggccccgc tggtgggcac 763441 cgggatggag ctgcgcgcgg cgatcgacgc cggcgacgtc gtcgtcgccg aagaaagcgg 763501 cgtcatcgag gaggtgtcgg ccgactacat cactgtgatg cacgacaacg gcacccggcg 763561 tacctaccgg atgcgcaagt ttgcccggtc caaccacggc acttgcgcca accagtgccc 763621 catcgtggac gcgggcgacc gagtcgaggc cggtcaggtg atcgccgacg gtccctgtac 763681 tgacgacggc gagatggcgc tgggcaagaa cctgctggtg gccatcatgc cgtgggaggg 763741 ccacaactac gaggacgcga tcatcctgtc caaccgcctg gtcgaagagg acgtgctcac 763801 ctcgatccac atcgaggagc atgagatcga tgctcgcgac accaagctgg gtgcggagga 763861 gatcacccgc gacatcccga acatctccga cgaggtgctc gccgacctgg atgagcgggg 763921 catcgtgcgc atcggtgccg aggttcgcga cggggacatc ctggtcggca aggtcacccc 763981 gaagggtgag accgagctga cgccggagga gcggctgctg cgtgccatct tcggtgagaa 764041 ggcccgcgag gtgcgcgaca cttcgctgaa ggtgccgcac ggcgaatccg gcaaggtgat 764101 cggcattcgg gtgttttccc gcgaggacga ggacgagttg ccggccggtg tcaacgagct 764161 ggtgcgtgtg tatgtggctc agaaacgcaa gatctccgac ggtgacaagc tggccggccg 764221 gcacggcaac aagggcgtga tcggcaagat cctgccggtt gaggacatgc cgttccttgc 764281 cgacggcacc ccggtggaca ttattttgaa cacccacggc gtgccgcgac ggatgaacat 764341 cggccagatt ttggagaccc acctgggttg gtgtgcccac agcggctgga aggtcgacgc 764401 cgccaagggg gttccggact gggccgccag gctgcccgac gaactgctcg aggcgcagcc 764461 gaacgccatt gtgtcgacgc cggtgttcga cggcgcccag gaggccgagc tgcagggcct 764521 gttgtcgtgc acgctgccca accgcgacgg tgacgtgctg gtcgacgccg acggcaaggc 764581 catgctcttc gacgggcgca gcggcgagcc gttcccgtac ccggtcacgg ttggctacat 764641 gtacatcatg aagctgcacc acctggtgga cgacaagatc cacgcccgct ccaccgggcc 764701 gtactcgatg atcacccagc agccgctggg cggtaaggcg cagttcggtg gccagcggtt 764761 cggggagatg gagtgctggg ccatgcaggc ctacggtgcc gcctacaccc tgcaggagct 764821 gttgaccatc aagtccgatg acaccgtcgg ccgcgtcaag gtgtacgagg cgatcgtcaa 764881 gggtgagaac atcccggagc cgggcatccc cgagtcgttc aaggtgctgc tcaaagaact 764941 gcagtcgctg tgcctcaacg tcgaggtgct atcgagtgac ggtgcggcga tcgaactgcg 765001 cgaaggtgag gacgaggacc tggagcgggc cgcggccaac ctgggaatca atctgtcccg 765061 caacgaatcc gcaagtgtcg aggatcttgc gtaaagctgt cgcaaaatta ctaaacccgt 765121 taggggaaag ggagttacgt gctcgacgtc aacttcttcg atgaactccg catcggtctt 765181 gctaccgcgg aggacatcag gcaatggtcc tatggcgagg tcaaaaagcc ggagacgatc 765241 aactaccgca cgcttaagcc ggagaaggac ggcctgttct gcgagaagat cttcgggccg 765301 actcgcgact gggaatgcta ctgcggcaag tacaagcggg tgcgcttcaa gggcatcatc 765361 tgcgagcgct gcggcgtcga ggtgacccgc gccaaggtgc gtcgtgagcg gatgggccac 765421 atcgagcttg ccgcgcccgt cacccacatc tggtacttca agggtgtgcc ctcgcggctg 765481 gggtatctgc tggacctggc cccgaaggac ctggagaaga tcatctactt cgctgcctac 765541 gtgatcacct cggtcgacga ggagatgcgc cacaatgagc tctccacgct cgaggccgaa 765601 atggcggtgg agcgcaaggc cgtcgaagac cagcgcgacg gcgaactaga ggcccgggcg 765661 caaaagctgg aggccgacct ggccgagctg gaggccgagg gcgccaaggc cgatgcgcgg 765721 cgcaaggttc gcgacggcgg cgagcgcgag atgcgccaga tccgtgaccg cgcgcagcgt 765781 gagctggacc ggttggagga catctggagc actttcacca agctggcgcc caagcagctg 765841 atcgtcgacg aaaacctcta ccgcgaactc gtcgaccgct acggcgagta cttcaccggt 765901 gccatgggcg cggagtcgat ccagaagctg atcgagaact tcgacatcga cgccgaagcc 765961 gagtcgctgc gggatgtcat ccgaaacggc aaggggcaga agaagcttcg cgccctcaag 766021 cggctgaagg tggttgcggc gttccaacag tcgggcaact cgccgatggg catggtgctc 766081 gacgccgtcc cggtgatccc gccggagctg cgcccgatgg tgcagctcga cggcggccgg 766141 ttcgccacgt ccgacttgaa cgacctgtac cgcagggtga tcaaccgcaa caaccggctg 766201 aaaaggctga tcgatctggg tgcgccggaa atcatcgtca acaacgagaa gcggatgctg 766261 caggaatccg tggacgcgct gttcgacaat ggccgccgcg gccggcccgt caccgggccg 766321 ggcaaccgtc cgctcaagtc gctttccgat ctgctcaagg gcaagcaggg ccggttccgg 766381 cagaacctgc tcggcaagcg tgtcgactac tcgggccggt cggtcatcgt ggtcggcccg 766441 cagctcaagc tgcaccagtg cggtctgccc aagctgatgg cgctggagct gttcaagccg 766501 ttcgtgatga agcggctggt ggacctcaac catgcgcaga acatcaagag cgccaagcgc 766561 atggtggagc gccagcgccc ccaagtgtgg gatgtgctcg aagaggtcat cgccgagcac 766621 ccggtgttgc tgaaccgcgc acccaccctg caccggttgg gtatccaggc cttcgagcca 766681 atgctggtgg aaggcaaggc cattcagctg cacccgttgg tgtgtgaggc gttcaatgcc 766741 gacttcgacg gtgaccagat ggccgtgcac ctgcctttga gcgccgaagc gcaggccgag 766801 gctcgcattt tgatgttgtc ctccaacaac atcctgtcgc cggcatctgg gcgtccgttg 766861 gccatgccgc ggctggacat ggtgaccggg ctgtactacc tgaccaccga ggtccccggg 766921 gacaccggcg aataccagcc ggccagcggg gatcacccgg agactggtgt ctactcttcg 766981 ccggccgaag cgatcatggc ggccgaccgc ggtgtcttga gcgtgcgggc caagatcaag 767041 gtgcggctga cccagctgcg gccgccggtc gagatcgagg ccgagctatt cggccacagc 767101 ggctggcagc cgggcgatgc gtggatggcc gagaccacgc tgggccgggt gatgttcaac 767161 gagctgctgc cgctgggtta tccgttcgtc aacaagcaga tgcacaagaa ggtgcaggcc 767221 gccatcatca acgacctggc cgagcgttac ccgatgatcg tggtcgccca gaccgtcgac 767281 aagctcaagg acgccggctt ctactgggcc acccgcagcg gcgtgacggt gtcgatggcc 767341 gacgtgctgg tgccgccgcg caagaaggag atcctcgacc actacgagga gcgcgcggac 767401 aaggtcgaaa agcagttcca gcgtggcgct ttgaaccacg acgagcgcaa cgaggcgctg 767461 gtggagattt ggaaggaagc caccgacgag gtcggtcagg cgttgcggga gcactacccc 767521 gacgacaacc cgatcatcac catcgtcgac tccggcgcca ccggcaactt cacccagact 767581 cgaacgctgg ccggtatgaa gggcctggtg accaacccga agggtgagtt catcccgcgt 767641 ccggtcaagt cctccttccg tgagggcctg accgtgctgg agtacttcat caacacccac 767701 ggcgctcgaa agggcttggc ggacaccgcg ttgcgcaccg ccgactccgg ctacctgacc 767761 cgacgtctgg tggacgtgtc ccaggacgtg atcgtgcgcg agcacgactg ccagaccgag 767821 cgcggcatcg tcgtcgagct ggccgagcgt gcacccgacg gcacgctgat ccgcgacccg 767881 tacatcgaaa cctcggccta cgcgcggacc ctgggcaccg acgcggtcga cgaggccggc 767941 aacgtcatcg tcgagcgtgg tcaagacctg ggcgatccgg agattgacgc tctgttggct 768001 gctggtatta cccaggtcaa ggtgcgttcg gtgctgacgt gtgccaccag caccggcgtg 768061 tgcgcgacct gctacgggcg ttccatggcc accggcaagc tggtcgacat cggtgaagcc 768121 gtcggcatcg tggccgccca gtccatcggc gaacccggca cccagctgac catgcgcacc 768181 ttccaccagg gtggcgtcgg tgaggacatc accggtggtc tgccccgggt gcaggagctg 768241 ttcgaggccc gggtaccgcg tggcaaggcg ccgatcgccg acgtcaccgg ccgggttcgg 768301 ctcgaggacg gcgagcggtt ctacaagatc accatcgttc ctgacgacgg cggtgaggaa 768361 gtggtctacg acaagatctc caagcggcag cggctgcggg tgttcaagca cgaagacggt 768421 tccgaacggg tgctctccga tggcgaccac gtcgaggtgg gccagcagct gatggaaggc 768481 tcggccgacc cgcatgaggt gctgcgggtg cagggccccc gcgaggtgca gatacacctg 768541 gttcgcgagg tccaggaggt ctaccgcgcc caaggtgtgt cgatccacga caagcacatc 768601 gaggtgatcg ttcgccagat gctgcgccgg gtgaccatca tcgactcggg ctcgacggag 768661 tttttgcctg gctcgctgat cgaccgcgcg gagttcgagg cagagaaccg ccgagtggtg 768721 gccgagggcg gtgagcccgc ggccggccgt ccggtgctga tgggcatcac gaaggcgtcg 768781 ctggccaccg actcgtggct gtcggcggcg tcgttccagg agaccactcg cgtgctgacc 768841 gatgcggcga tcaactgccg cagcgataag ctcaacggtc tgaaggaaaa cgtgatcatc 768901 ggcaagctga tcccggccgg taccggtatc aaccgctacc gcaacatcgc ggtgcagccc 768961 accgaggagg cccgcgctgc ggcgtacacc atcccgtcgt atgaggatca gtactacagc 769021 ccggacttcg gtgcggccac cggtgctgcc gtcccgctgg acgactacgg ctacagcgac 769081 taccgctagg tgggcgagca gacgcagaat cgcacgcgaa atgcctgcgc gatgcgattc 769141 tgcgtctgct cgccgtggtg gatgagccgg tcttgcatcg ccaatgcggg aaacccatgc 769201 atgcgttggg gcacgacgcc ggcctggccg ccagattggc gctgcccgcc gccccgttca 769261 acatgagtgg caatcccgcc atctgcctgc ctgcggggga cacgtcgtga ggaaccccgg 769321 tcggggttta gtttatcggc cgtgaattcg ccgaacggtt gctcgtccaa gccggccacg 769381 cattccagca ggccactgcg ttccatcgcc gacgcccagg catggcctgg gttggtgatt 769441 gcggccgccg agtcaaacaa ccgtgaactc gcgcgtcgtc gcgctgaacg ctgtaagcat 769501 gccgttgcga tcgcgtgcgg tgccgtggtg gacgatgcgg tattgcccgg gcgtggtatc 769561 gccgggaaca tcccagcgaa tgctgacatg cgatccggcc cgcccttggc gctgccagcg 769621 aaagctcgtg gcccagtcgc cgtcgtcagc aatccgcacc cagctggcac cttcccggcg 769681 gaccacttcg aggtaggtgc cgccgcggcg cagatcgtta ttgggcagcg cgctgacgaa 769741 aacggcttcc accgcctgac ccggtcggta cgtcgccgag ggctcggcga tgaccgctcc 769801 gaacgacccg gcatcggcgg gcgcgccgcg cacccagctc agctcccggg tgggccgcgg 769861 ccggcgaccg agcgtcaccg gacggccgtc gcgcatggcc tcggcgagtt cggccacggt 769921 ctgcatgagg gcgcacagtt cccatcgacc gaacaacgtg ctgccgccct cgtagcgctg 769981 ttcgagatac tcttcgggcg ttgtcacgta atggatgtag gcgttggtgt agcccacgca 770041 gagcacgtcg gccaggtcgg cgccaacaat cgaagccacc atgcggcgca gcctaagccc 770101 cgcgacgatg gtcggttcgc ccggaatacc gatcagatag aggcgaccga ttcgcacgag 770161 ctgaacggga acaatttcct ggacaaaggg gtgtatccgg ttcggcaggc gtgcgggcat 770221 cacaatgcct ttgggggcct gtgccgctgc cgtcggcctt gccagccggt acatggcgcg 770281 ggatagtctg tcccagaacg ggtttcgccc ttggcgaaag ccatggaagc ccgggccctc 770341 gtcggtgcct gccatggccc cggcgccaaa catcggacgc ccggtgcggc gctcttcacc 770401 gtctggtgtg tactcgccgc gcacgagcac agaaccgaga tcgacatagg tgaaccgggc 770461 atcaatgcca gcgccgatgg gcgtcgctcc gctcaactgc gtgaaagcat cctcgaactg 770521 gcacaacccg gtacgacggg tgttgtcgaa ttcccggtct ggtggggcct cgggagaaag 770581 gggcccgtcg acattcgggc tcatgtcgcc cggattcgtc tgtgcgaagg cggcgatgaa 770641 gtcgggctgg ccggcgagat aatccgcgcc gcccacggtg cgttcccagt gataggccgc 770701 gaaacccttg ttgtctccgg agatgaggtg gttgcgattc gtcatgctcg taccgtgggt 770761 agcgaagaaa tggatcacgc ccacggtggc ctcgccccgg tcgatacgca cgagcgtggt 770821 atgcgggtcg acgcgtttcg ggaagaacgc cttgtcggcc ggcgggttgc ggtcgaacgc 770881 tgatggggat cgattgatgc ttgcgccgta cagctcgccg tgcgagagcg gaacctcggc 770941 gggcgccaca tcggcatgcg catgttccac cgattcgaca attccgtcga cgatcgccgc 771001 aaaggttgcc ggccgaaagc cgctcgtggt caggttgtac agcaggtatc cgcagtaccc 771061 gccaggcccg gcgtgggtgt gggtcgccgt gatcagtgtg ttctgctccg agtaggtatc 771121 gccatacaaa tcggccaacc ggcgcagcac ttcctcattc acgttttgca tgggcagcgg 771181 cagttcggcg acaatcagca gcaaccgcgc gtccccgtcc tgggaatcgt cccggaacac 771241 aaacgcccgt gacctaagtc gctggtgaat gccggcggtg cgctggtcgg acttgccgta 771301 gccgagcatg ccgcagtccg ccgcctcacc agtgatgtcg gcgatgccgc gccctacact 771361 aagcattgcc taatcctccg caccagcagc aaatttcacg agcgctgact acgcgctgct 771421 ccggggaaac gtatcccaca aggagaacca ctttatgcgc cggggcccac gaatacggac 771481 ggcagcatcc cgtgccgcgg ctggccggac gtgatccgag ggtgtgggtc tcaccagatc 771541 ggtctcacta gacttgggtt gtgctcattg gttcgcatgt cagcccaacc gatccgctgg 771601 ccgcagcgga ggccgaaggc gctgacgtag tgcagatttt ccttggcaat ccgcagagct 771661 ggaaggctcc caagccgcgg gacgacgccg ccgcgctgaa agccgcgacc ctgcccatct 771721 acgtgcatgc gccctacctg atcaaccttg cgtcggcgaa caatcgcgtg cggatcccgt 771781 cgcgcaagat cctgcaagag acctgtgctg cggcggccga cattggcgca gcggcggtga 771841 tcgtgcacgg tgggcacgtc gccgacgaca acgacatcga caagggcttc cagcgctggc 771901 gcaaggcgct ggaccggctg gaaaccgagg ttcccgtcta cctggaaaac accgccggcg 771961 gcgatcacgc gatggcgcgc cgcttcgaca ccatcgcccg gctctgggac gtcatcggcg 772021 acaccggaat cgggttttgc ctggacacct gccacacctg ggcggccggc gaggcgctga 772081 ccgatgccgt cgatcggatc aaagcaatta ccggccgcat cgatctggtg cactgcaacg 772141 actccaggga cgaagcagga tcgggccgtg accgccacgc caacctcggc agcggccaga 772201 ttgatcctga cctgctggtg gctgccgtca aggcggccgg cgcgccggtg atctgcgaaa 772261 ccgccgacca aggtcgcaag gacgacatcg cgtttctgcg ggaaagaacc ggcagctgac 772321 ttcaagcccc gcggcaccta ccgttgactt atgctccgca gggtcgccat actgctcgcc 772381 gctgtgcttg cgttcgcggg ctgctcgggg ggaacgaggt tggcggcggg cttcggcaat 772441 ggcaatagcg tgcacaccct cgatgtcgat ggagccggcc gcagctaccg gctttataag 772501 cccgtcgggt tgccgtcctc ggcgccgctg gtcgtcatgt tgcacggcgg gttcggcagc 772561 gccaagcaag ccgaaaggtc ttatggctgg gacgaattgg ccgactccga gaagttcctc 772621 gtcgcctacc ccgatggcta tcacagggct tggaatgcca atggcggagg ctgctgcggc 772681 cggcccgcac gtgaaggcgt cgacgacatc ggcttcgtcc gcgcggtcgt cgccgacatc 772741 gccaacaatg tcagcatcga ccccgcccgg gtctacgtca cgggcatgag caacggtgcc 772801 atcatgtcct acacgctggc ctgcaacacc agcatcttcg cggcgatcgg cgtcgtttcg 772861 ggcacgcaac tagacccctg tcagtccccg cgtccggtgt cggtcatcca catccatggc 772921 acggccgatc cgctggtccg ctaccacggc gggcccggcg ccgggttcgc gcgcatcgac 772981 ggtccgccgg tgcccgatct caatgcgttc tggcgcgagg tcaaccggtg cggcgcgctg 773041 gataccacga ccgaaggtcc ggtcaccaca tcgggcgcca catgcgccga caatcgccgt 773101 gtcgtgctgc tcaccgtcga tgacgccggc caccgatggc cgtcatttgc cacccagaca 773161 ctgtggcgat tctttgcagc gcacttcaga tgaggacaaa accatccgtt acattctctt 773221 gtgcagttgt agaaaaaacg taacatggtg gcatgtcaga tacgcatgtc gtcaccaacc 773281 aggttccgcc cttggagaac tacaatcccg cgtcatcccc ggtgctcatc gaggctctga 773341 tccaggaggg tggccagtgg ggcctggatg aagtaaacga ggtcggggca atttctgcca 773401 gctgccaagc ccaacgctgg ggagagcttg cagaccgcaa ccggcccatc ctgcataccc 773461 acgacgctta cgggtaccgg gtcgatgagg tggagtacga cccggcctac cacgagctga 773521 tgcgtaccgc gatcacccat ggcatgcacg ccgcaccgtg ggctgacgac cgcccgggtg 773581 cgcacgtggt gcgagcggcc aagacatcgg tgtggaccgt cgagccgggc catatctgcc 773641 ccatctcgat gacctacgcc gtcgttccgg cgctgcggta taactccgag ctggctgcgg 773701 tctacgagcc gctgctgacc agtcgtgagt acgacccgga gctgaagccg gcgaccacga 773761 aggccggcat caccgccggc atgtcgatga ccgagaagca gggtggctcc gacgtgcgcg 773821 ctggcaccac ccaggcgacc ccgaatgcgg acggcagcta cagcttgacc ggccacaagt 773881 ggttcacttc ggcgccgatg tgcgacatct tcctggtgct cgcgcaggca ccggacgggc 773941 tgtcgtgctt cctgctgccg cgggtgctgc ccgacggcac ccgcaaccga atgttcttgc 774001 agcggctcaa ggacaagctc ggcaaccacg caaacgcctc gagcgaggtc gaatacgacg 774061 gtgccgtcgc gtggctggtg ggcgaggagg gccgcggcgt gccgaccatc atcgagatgg 774121 tcaacctcac ccggctggac tgcgctctgg gcagtgccac cagcatgcgc accggcctaa 774181 cccgcgccgt ccaccatgcc cagcatcgga aggcgttcgg cgcctacctg atcgaccagc 774241 cgttgatgcg caacgtgctg gccgacctgg cggtggaggc cgaggccgcc accatcgtgg 774301 caatgcggat ggccggtgcc accgacaacg cggtgcgcgg gaacgagacc gaagcgctgc 774361 tgcgtcgcat cggcctggcg gccgccaagt actgggtgtg caagcgctcc accgctcacg 774421 ccgccgaagc gctggagtgc ctgggcggca acggttatgt cgaggattcc gggatgcccc 774481 ggctctaccg ggaggcgccg ttgatgggca tctgggaggg ctcgggcaat gtcagcgcgc 774541 tagatacctt gcgcgccatg gcaacccggc ccgcatgcgt cgaggtgctg tttgacgagc 774601 tggcccgcag cgcaggccag gaccccaggc tggacggcca cgtcgaaagg ctgcgtccgc 774661 agctaggcga tcttgacacg atcggttatc gagcccgcaa gattgccgaa gacatctgcc 774721 tggcgttgca gggatcgttg ttggtgcgcc acggacatcc cgccgtcgcc gaggcgtttc 774781 tggccactcg gctcggcggc cagtggggcg gagcgtacgg caccatgccg gccggtctgg 774841 atctcgcgcc catcctcgag cgtgcgctgg taaaaggctg agcggccgct gatgacacac 774901 gcgatcaggc cggtcgattt cgacaacctg aagacgatga cctatgaggt caccggtcgg 774961 attgcgcgga tcaccttcaa ccggccggag aagggcaacg cgatcatcgc agacaccccg 775021 ctggagttgt ctgctctggt ggagcgtgcc gatctggatc caggcgtgca tgtcattctg 775081 gtgtccggtc gcggcgaggg attctgtgcc ggcttcgacc tgtccgccta cgccgagggg 775141 tcgtcgtcga ccgggggcgg cggcgcatac caaggcacgg tgctagatgg caagacccag 775201 gccgtcaacc acctaccgaa ccagccgtgg gacccgatga tcgactacca gatgatgagc 775261 cggttcgtgc gcggattcgc cagtctgatg catgccgaca agccgacggt ggtcaagatc 775321 cacggctact gcgtggccgg cggcaccgac atcgcgctgc acgccgatca ggtgatcgcc 775381 gccgccgacg ccaagatcgg ctacccgccc acccgggtgt ggggggtgcc ggcggcgggc 775441 ctgtgggcgc accggctcgg cgaccagcgg gccaaacggc tgctgttcac cggcgattgc 775501 atcaccggcg cgcaggccgc cgagtggggc ctggcggtcg aggcgccgga gccggctgac 775561 ctcgacgagc ggaccgagcg actggtggcc cggatcgccg cactgccggt caatcaattg 775621 atcatggtca agctcgcgct caattccgct ctgctgcaac agggtgtggc caccagcagg 775681 atggtcagca ccgtgttcga cggcgccgct cggcacacac ccgaggggca cgcgtttgtc 775741 gccgacgcgg tcgagcacgg cttccgggat gcggtgcggc gccgtgacga gccgtttggc 775801 gactacggcc gtcaagcatc gcgggtgtaa ccatgccggc catgaccgcc cgttcggtgg 775861 tactcagcgt gctgctcggt gctcatcccg cgtgggccac cgcaagcgaa ttgatccagc 775921 tgacagcgga tttcggtatc aaggagacga cgttgcgggt cgcgctgacc cgcatggtcg 775981 gtgccgggga tctggtccgg tccgcggacg gctaccggct ctcggatcgg ttgctggccc 776041 gccagcgccg acaagatgag gccatgcgcc cacggacccg cgcttggcac ggaaactggc 776101 acatgctgat tgtcaccagc atcggcaccg atgctcgtac ccgggccgca ctgcgaacct 776161 gcatgcacca caagcgtttc ggtgaattgc gggaaggggt gtggatgcgg ccggacaatc 776221 tcgacctcga cttggagtcc gacgttgcgg cccgggttag gatgctgacg gcccgcgacg 776281 aggcccccgc cgacttggcc gggcagctgt gggatctgtc ggggtggacc gaggccggcc 776341 accggttgct cggcgacatg gcagcggcca ccgacatgcc cgggcgattt gtggtggctg 776401 cggcgatggt gcgccacctg ctcaccgatc cgatgttgcc cgctgaactg ttgcccgccg 776461 actggccggg cgccgggtta cgggcggcgt accacgactt cgccactgca atggcgaaac 776521 gacgcgatgc aactcaactc ctggaggtga catgagtgat ctggtgcgtg tggagcgcaa 776581 aggtcgggtg accacggtga ttctgaaccg gccggcctcc cgcaacgcgg tcaacggccc 776641 gaccgccgcg gcgttgtgcg cggcgttcga gcaattcgac cgggacgacg ccgcgtcggt 776701 ggccgtactc tggggtgcgg gtggaacctt ttgtgcggga gccgatttga aggcctttgg 776761 cacaccggag gccaactctg tgcaccggac gggtcccggc ccgatggggc cgtcacgaat 776821 gatgctgtcc aaacctgtga tcgccgccgt cagcggctac gccgtcgccg gggggctgga 776881 attggcactg tggtgcgacc tgcgggtggc cgaggaagac gccgtgttcg gtgtgttttg 776941 ccgtcgctgg ggggtaccgc tcatcgacgg cggcaccgtg cgactgccac ggctgatcgg 777001 gcacagccgc gcgatggaca tgatcctcac tggccgtggg gtgccggccg acgaagcgct 777061 ggccatgggg ttggccaatc gggtggtgcc caagggtcaa gcccgacagg cggctgagga 777121 gttggcggcg caattggccg cgctgccgca gcagtgtctg cgatcggatc ggctgtcggc 777181 gctgcaccag tggggcctgc ccgagtccgc ggcgctcgac ctcgagttcg ccagcatcgc 777241 gcgggtggcc ggcgaggcgc tagagggggc gagacggttc gccgcgggtg ccggtcggca 777301 tggggccccg gcacctcggg ccgaacaggg cgacacgctt taggcgggta cggctcagac 777361 caaggcgaag gtccgtgccg atgccggcga gggccacggc tgcggaacgg gtcgttgccg 777421 gacaacctgg ggccaccaga accactttcc gaggagggcc gcgatcgacg gtgtcatgaa 777481 cgaccggacg atcagggtgt cgaagagcag gcccataccg atggtggtgc caacctgggc 777541 catcacggtc agctcgctga cggcaaacga catcatggtg aaggcaaaca ccagcccggc 777601 ggcggtcacc accgacccgc tgccacccat cgcacggatg atgccggtgt tgattccggc 777661 gtggatctcc tccttgagcc gggcaaccag cagcaggttg tagtccgcgc cgacggccag 777721 caggatgatg accgccatcg ccaacaccaa ccagtgcagc tcgataccca ggatgtgttg 777781 ccagatcagc accgacagcc cgaacgaggc gcccagcgac aacaccacgg tgccgacgat 777841 gacggcggcc gcgacgacgc tgcgggtgat gatcagcatg atgatgaaga tcaggcagag 777901 tgcggagatt ccggcgatca tcaagtcata ggtgttgccg tcggacaagt ccttgaacat 777961 cgccgcggta ccgcccaggt agatcgcgga tccctccaac ggtgtgccct tgatggcttc 778021 cttggcggcg gtcttgatct tggcgatgcg cgcgatgccc gcctggctca tcgggtcgcc 778081 ttcgtggctg atgatgaacc gcaccgcgtg cccgtccggc gagaggaact gttccaggcc 778141 gcgttggaag tcgggattgt cgaaaacctc gggaggcaga tagaacgagt cgtcgttgcg 778201 cgaagcatca aaggcttcgc ccatcgccgc cgaatcctcc tgcatcgcgg ccatctgatc 778261 ctgcagccct tcctgggtgg aatgcatgct cagcatctgc gccttcatgc tcttcatggt 778321 ctggatcatc tcgggcatca tcgcggtcag ctggggcatg agcgtgtcca ggcgctgcat 778381 gagcggcagc aggttgttga tgtcttcggt catgacgtcg attccgtcga gggtgtcgaa 778441 caccgaccgc agcgaccagc agaccgggat gtcgtagcag tgcttttccc agtagaagta 778501 gctgcggatg gggcggaaga aatcgtcgaa atccgcaata tggttgcgca actcctcgac 778561 atcgaccacc atccccgtca tctgaatgac catttcgtgg gtgacatcgg ccatctgctg 778621 ggtgaggctg tgcatccgct ccatctggtc gatgttggac tgaatgtcgt tgacctgctc 778681 cagcatcctg gccgtcaggt cctggttgta tttctcggtc agtttctggc tggtgccctg 778741 catgctgatc aggaacggga ttgaggtgtg ctcgatcggt ttgccgtccg gccgggtgat 778801 ggcctgcacc cgggatatcc cctccacggc gaaaatggcc ttggcgatct tgttgatcac 778861 caaaaagtcg gccgaattac gcatgtcgtg gtcgctttcg accatcagca cctcggggtt 778921 catccgggcc tgggagaaat ggcgctccgc ggccgcatag ccttcgttgg ccggtaggtc 778981 ggcgggcagg tagttgcggt cgttgtagtt ggtccggtag cccggcaggg tcagcagacc 779041 gacgagcgcc agggccaccg caccgaccag gatggggccg ggccagcgga cgatggcggc 779101 cccgaccttg cgccagcccc gcacccgcgc catccgcttg ggctcgagca gcttgccgaa 779161 ccggctcgtc acggcgatta tcgccgggcc cagggtgagt gcggcggcga cgacgatgac 779221 catcccgatc gccaacggca caccgagggt ctgaaagtac ggcagtcggg tgaagctcag 779281 acagaacgtg gcacccgcga tggtcagacc cgagcccagc acgacatggg cggtgccgcc 779341 gaacatggtg tagtacgccg actcccggtc ctggccgagc ccgcgtgctt cctggtagcg 779401 gccgatcagg aagatggcgt agtcggtggc ggccgcgatc gccagcacca cgagcaggtt 779461 ggtcgcgaag gtcgagagcc caatgatccg gtggaaaccg aggaaagcca cgcccccgcg 779521 ggtggcgagc agcccgagca ccaccatcgt cagcatgatc gccgacgtga tgatcgaccg 779581 gtagaccagc agcaacatca cgatgatcac ggtgaacgtg accgcctcga tcacctgcag 779641 actacggtcg ccggcctgct gctgatcggc gaccagcgcg gccgaaccgg tgacgtacac 779701 cttgacaccg ggtggcggcg caaggcgctc gacgatggtc ttgaccgctt ccacggactc 779761 gttggccagt gactcgccct gattgcccgc gagtttcacc tgaacgtagg cggccttgcc 779821 gtcgctgctc tgggcgccgg tggcggtcag tggatccccc caaaagtcct gcaaggactg 779881 gacgtgggtg gtgtcggctt gcagtctgcc gatcatctgg tcgtaaaacg catgggcggc 779941 gtcaccgagc ggccgctggc cctccagcac gatcatcgcc gcgctgtcgg agtctccctc 780001 ctcgaacacc ttgccgatgt gtttcatcga gatcatcgac ggtgccgcgt cggggctcat 780061 cgacaccgcc tgtatctgtc cgaccgtttc cagttgcggc acagtgacgt tgaggacggc 780121 gatggtgacc aaccacccaa ggatgatcgg caccgcgaag gtacggatca ttctggggat 780181 gaacggtcgc gccgcgtgcc tgtcgggcgg gacggagccc gtcggcgcag ctgtcctttg 780241 cacgatcatg cggatttcac aaagcagtag gtcagggcat ccacgccggt tgcggtccgc 780301 tcgtccttca cttcgccatc gacggtgatt cggcaggtga tggaagtgcc gtcgccttgc 780361 gcgaggatgt tgggggccgc ggacggcgcc gtggtcttca aggtgagcga ccacggcagg 780421 gctgcgccgt cgatccgctg tggcttggcg tcgaggtcca ggtagttgat gttgacgtaa 780481 ctaccggagc cggaaacttc gtactccacc accttggggt cgaacggctc cgggtcatcg 780541 gcgaagacct tcggcgtcac caagatgcct tcggaaccaa agaaagtgcg gatccgctgc 780601 accgtgaagc cggcgatggc gaccacaacc aggatgagca gcggtatcca ggcacgcttg 780661 agagttccaa tcatcgccct ccgcctctgc cgcatgaagt tcacgccggt ctggtgacgc 780721 ataccgaacg tcacagattt cagagtacag tgaaacttgt gagcgtcaac gacggggtcg 780781 atcagatggg cgccgagccc gacatcatgg aattcgtcga acagatgggc ggctatttcg 780841 agtccaggag tttgactcgg ttggcgggtc gattgttggg ctggctgctg gtgtgtgatc 780901 ccgagcggca gtcctcggag gaactggcga cggcgctggc ggccagcagc ggggggatca 780961 gcaccaatgc ccggatgctg atccaatttg ggttcattga gcggctcgcg gtcgccgggg 781021 atcggcgcac ctatttccgg ttgcggccca acgctttcgc ggctggcgag cgtgaacgca 781081 tccgggcaat ggccgaactg caggacctgg ctgacgtggg gctgagggcg ctgggcgacg 781141 ccccgccgca gcgaagccga cggctgcggg agatgcggga tctgttggca tatatggaga 781201 acgtcgtctc cgacgccctg gggcgataca gccagcgaac cggagaggac gactgatgag 781261 caacctcgca atctgaccga ggtggcgagc aagacggcga ttggcctgtg gtcactcctt 781321 gttgatgcgg ttgcccgcgc cgaggttatc gattgtgggg tcaccgtttt tgtaggtgac 781381 cgtgttgtcc agcccaacaa caacgaggcg ctcgtcgatc ctgtcgaagg cgatcttgtt 781441 gttcgcacca ccgacggtca ccgtttcgca ggtgccgttg acggtcagcg tgttgtccga 781501 gccggccacg ttcagtgact tgccgtcagc gcagtcaagg gtggcggtag tcccgatgga 781561 tccgtaggtc agcatgtcac cgatctggat cgaagcggtt gtggattctc cggtcgtcac 781621 ggtcggcgcc gcggtcgggc cgctcgtcgc tgtcgtggtg gtggcggtcg cgggcgtcgt 781681 ggtagctgcc ggcgggttgg cagtggaact gcagccggcc agcggcaacg ctgcggcagc 781741 cagcgccaga gcaaaggtcg ccaaccggga gtgggtagcg cgatcggcgc gcaacggttt 781801 ctcgaccacc tcagccgacc cgctgcagtc ggtttaccat tcctagttcc cggccacggt 781861 cccagatgaa cggatcacca ttgcggaaga acaccgtttc gtcccagccg tagacggtga 781921 tgtcgttgat gatcgtgtcg gcgacaacgg tgttggacga gcccatcacg gtcaccgccc 781981 agcaggttcc cagcgcggtc acgatgttct gagtgccgtt gaccaacaag gtggattcgt 782041 tgcagtccag cgtccgctcg atgccctgcc cggtgacatg ggtgtcgccg ttcttggcgt 782101 gtgcggccgg cggtggggcg gccaaggcga cagcgatggt gatgacaccg gcagccagcg 782161 acgcggcgac ggtgttccac ttcacggcgg gccccccttc gactgggcgg gtgatgcttg 782221 actgagcctt ggtcgggcct tgattgagcg tacgtgcatt cgcccgggcg acgacagacc 782281 tgagtgcatt tgccgggcag gcaccccgcg tctgatgtca gctactccac aacccggtcg 782341 ctagagtcat tagttggccc taacgtcccc cgaagaccgg tgcggaccca aagccgatca 782401 ccccaaccga agggcgaacc gccatggcag ctcagccgca agcaccgtca gcgggcggcc 782461 gcccgcgcgc ggggaaagcg gtgaagtccg tggctcgccc ggccaaactg agccgtgaga 782521 gcatcgtcga gggcgccctg acctttttgg atcgggaggg gtgggactcg ctgaccatca 782581 atgcgctggc gacccagctc gggaccaagg ggccgtcgct gtacaaccac gtggacagcc 782641 tcgaggatct acgccgggcg gtgcggattc gggtgatcga cgacatcatc acgatgctga 782701 atagggtcgg tgcgggtcgc gcacgcgatg acgcggtgtt ggtcatggcc ggtgcctacc 782761 gcagctacgc ccaccaccac ccgggtcggt actcggcgtt cacccggatg ccgctgggcg 782821 gtgacgatcc cgaatacacc gctgcgacta ggggcgcagc cgcgcccgtc atcgccgtgc 782881 tgtcctcgta cggcctcgac ggtgagcagg ctttctacgc ggcgctcgag ttttggtcgg 782941 cactgcatgg gtttgtgttg ctggaaatga ccggcgtcat ggacgacatc gataccgatg 783001 cggtgttcac cgacatggtg ctgcggctgg cggcgggcat ggaaaggcgc accacacacg 783061 gtggtaccgc gtcaacgtag cgccctgctt cggccgcaac gcccgctttg acctgccaga 783121 ctggcggcgg gtattgtggt tgctcgtgcc tggcggctta cgcctgatgt aggggcgtgg 783181 atgccgggcc aattcgcatg tccgcgatgc ctcggatgag acgaatcgag tttgaggcaa 783241 gctatgcgac acacccggcc gcgggtaacc gtggcggggc atggccgaca aacagaacgt 783301 gaaagcgccc aagatagaaa gccggtagat gccaaccatc cagcagctgg tccgcaaggg 783361 tcgtcgggac aagatcagta aggtcaagac cgcggctctg aagggcagcc cgcagcgtcg 783421 tggtgtatgc acccgcgtgt acaccaccac tccgaagaag ccgaactcgg cgcttcggaa 783481 ggttgcccgc gtgaagttga cgagtcaggt cgaggtcacg gcgtacattc ccggcgaggg 783541 ccacaacctg caggagcact cgatggtgct ggtgcgcggc ggccgggtga aggacctgcc 783601 tggtgtgcgc tacaagatca tccgcggttc gctggatacg cagggtgtca agaaccgcaa 783661 acaggcacgc agccgttacg gcgctaagaa ggagaagggc tgatgccacg caaggggccc 783721 gcgcccaagc gtccgttggt caacgacccg gtctacggat cgcagttggt cacccagttg 783781 gtgaacaagg ttctgttgaa ggggaaaaaa tcgctggccg agcgcattgt ttatggtgcg 783841 cttgagcaag ctcgcgacaa gaccggcacc gatccggtga tcaccctcaa gcgggctctc 783901 gacaatgtca aacccgccct ggaggtgcgc agccgtcgcg tcggcggcgc gacctatcag 783961 gtgcctgtcg aggtgcgccc cgaccggtcg accacgctgg cgctgcgctg gctcgtcggc 784021 tactcgcggc aacgccgtga gaagacgatg atcgagcgcc tggcaaatga gatcctggat 784081 gccagcaatg gccttggggc ctccgtcaag cggcgtgagg acacccacaa gatggccgag 784141 gcgaaccgag cctttgcgca ttatcgctgg tgagaagcgc cggttagcca gccagggcgc 784201 aaaccgacag tgatagacag ctaactagca accgaaagag tgggaagact tctgtggcac 784261 agaaggacgt gctgaccgac ctgagtaggg tccgcaactt cggcatcatg gcgcacatcg 784321 atgccggcaa gaccacaacc accgagcgca tcctgtacta caccggtatc aactacaaga 784381 ttggtgaggt gcacgacggc gcagccacca tggactggat ggaacaggaa caggagcgcg 784441 gcatcaccat cacctctgcg gccacgacca cgttctggaa agacaaccag ctcaatatca 784501 tcgacacgcc agggcatgtg gatttcaccg tcgaggtgga gcgcaatctg cgcgtgctcg 784561 acggcgcggt cgcggttttc gacggcaaag agggtgtcga accgcagtcc gaacaggtgt 784621 ggcggcaggc cgacaaatac gatgtccccc gaatctgctt cgtcaacaag atggacaaga 784681 tcggtgcgga cttctacttc tcggttcgca cgatggggga gcggcttggg gccaacgccg 784741 tgcccattca gcttcccgtc ggtgcagagg ccgacttcga aggcgtcgtc gacctggtgg 784801 agatgaacgc caaggtgtgg cgcggcgaga cgaaactcgg cgaaacctac gacaccgtgg 784861 aaataccggc cgacctggcc gagcaggctg aggagtaccg gaccaagctg ctcgaggtgg 784921 tcgccgagtc cgacgagcac ctgttggaga agtacctggg cggtgaggag ctcaccgtcg 784981 acgagatcaa gggcgcgatc cgcaagctga caatcgccag cgagatctac ccggtgctgt 785041 gcggcagcgc gttcaagaac aagggcgtgc agccgatgct ggatgccgtc gtcgactacc 785101 tgccgtcgcc gctggacgtt ccgccggcga tcgggcacgc gcccgccaag gaggacgagg 785161 aggtggtgcg caaggcgacc accgacgagc cctttgcggc cctggcgttc aagatcgcta 785221 ctcacccgtt cttcggcaag ctcacctaca tccgggtgta ctcgggcacc gtcgagtcgg 785281 gtagccaggt catcaatgcc accaagggca agaaagaacg gctgggcaag ctgttccaga 785341 tgcactccaa caaggagaac ccggtcgata gggctagtgc cggtcacatc tacgcggtga 785401 tcggtctcaa ggacaccacc accggtgaca ccttgagcga cccgaaccag cagatcgtgc 785461 tggagtcgat gaccttcccc gacccggtga tcgaggtggc catcgagccg aagaccaaga 785521 gcgaccaaga gaagctgagt ctgtcgatcc agaagctcgc cgaagaggat ccgaccttca 785581 aggtgcacct ggattccgag accggccaga ccgtcatcgg cggcatgggc gagctgcatc 785641 tggacatcct ggtggaccgc atgcgccggg aattcaaggt cgaggccaac gtcggcaagc 785701 ctcaggttgc ctacaaggag accatcaagc ggctcgtgca gaacgtcgag tacacccaca 785761 agaagcagac gggtggctcg ggccagttcg ccaaggtcat catcaacctc gagccgttca 785821 ccggtgaaga gggcgcgacc tacgagttcg agagcaaagt caccggcggg cgtatcccgc 785881 gggagtacat cccgtcggtg gatgccggcg cacaggacgc catgcagtac ggcgtgctgg 785941 ccggctatcc gctggtgaac ctgaaggtca cgctgctcga cggcgcctac cacgaggttg 786001 actcctcgga aatggcgttc aagatcgcgg gctcgcaggt gctcaaaaag gctgccgcac 786061 ttgcgcagcc ggtgatcctg gaaccgatca tggcggtcga ggtgaccaca cccgaggact 786121 acatgggtga cgtgatcggc gacctgaact cccgccgtgg ccagatccag gccatggagg 786181 agcgggctgg tgcgcgcgtt gttagggcgc acgtgccgct gtcggagatg ttcggctacg 786241 tcggtgacct tcggtccaag actcaaggcc gggcaaacta ctccatggtg ttcgactcgt 786301 actccgaagt gccggcgaac gtgtcgaagg aaatcatcgc gaaggcgacg ggcgagtgag 786361 cgcaagctca cgagtgagga gccgagcaat gggtacagcg aaggcgacgg gcgactaggc 786421 gatgcgaaga cgaccgctag tgagcgaagc tcacgagcaa tgagcagcgc gaaggcgact 786481 ggcgagtaga tacaaccata cgagtaggct ggcccggtta cgaccgcggc ataactgaaa 786541 acatcaacac tgcttttata agcactaaca agtccaggag gacacaaaag tggcgaaggc 786601 gaagttccag cggaccaagc cccacgtcaa catcgggacc atcggtcacg ttgaccacgg 786661 caagaccacc ctgaccgcgg ctatcaccaa ggtcctgcac gacaaattcc ccgatctgaa 786721 cgagacgaag gcattcgacc agatcgacaa cgcccccgag gagcgtcagc gcggtatcac 786781 catcaacatc gcgcacgtgg agtaccagac cgacaagcgg cactacgcac acgtcgacgc 786841 ccctggccac gccgactaca tcaagaacat gatcaccggc gccgcgcaga tggacggtgc 786901 gatcctggtg gtcgccgcca ccgacggccc gatgccccag acccgcgagc acgttctgct 786961 ggcgcgtcaa gtgggtgtgc cctacatcct ggtagcgctg aacaaggccg acgcagtgga 787021 cgacgaggag ctgctcgaac tcgtcgagat ggaggtccgc gagctgctgg ctgcccagga 787081 attcgacgag gacgccccgg ttgtgcgggt ctcggcgctc aaggcgctcg agggtgacgc 787141 gaagtgggtt gcctctgtcg aggaactgat gaacgcggtc gacgagtcga ttccggaccc 787201 ggtccgcgag accgacaagc cgttcctgat gccggtcgag gacgtcttca ccattaccgg 787261 ccgcggaacc gtggtcaccg gacgtgtgga gcgcggcgtg atcaacgtga acgaggaagt 787321 tgagatcgtc ggcattcgcc catcgaccac caagaccacc gtcaccggtg tggagatgtt 787381 ccgcaagctg ctcgaccagg gccaggcggg cgacaacgtt ggtttgctgc tgcggggcgt 787441 caagcgcgag gacgtcgagc gtggccaggt tgtcaccaag cccggcacca ccacgccgca 787501 caccgagttc gaaggccagg tctacatcct gtccaaggac gagggcggcc ggcacacgcc 787561 gttcttcaac aactaccgtc cgcagttcta cttccgcacc accgacgtga ccggtgtggt 787621 gacactgccg gagggcaccg agatggtgat gcccggtgac aacaccaaca tctcggtgaa 787681 gttgatccag cccgtcgcca tggacgaagg tctgcgtttc gcgatccgcg agggtggccg 787741 caccgtgggc gccggccggg tcaccaagat catcaagtag gtctaccggc caccagacgc 787801 aaaagaacat gatgggcgca ccagcgccca tcatgttctt ttgcgtctgc tcgcgaaaat 787861 gcccagcgtg cggcgctacg ctgacatgga ccctccgacg aggcaaggag caggcacgtg 787921 ttagcgcgct acatcaagat gcagttattg gtgctgttgt gcggtggtct ggtcgggccg 787981 atcttcttgg tcgtctactt cacgctcgga ctgggcagcc tgatgtcgtg gatgttctat 788041 gtcggtctga tcattaccgt tgctgacgtg ctggtcgcgc tcgcattgac caactacggg 788101 gcaaagaccg ctgccaagac cgcggcactt gaacggagtg gagtgctggc gctcgcccaa 788161 agcaccgggc tcagcgagac agggacccgg atcaacgatc aaccgctggt aaaggtgcac 788221 ctgcacatct cgggacccgg catcactccg ttcgacacgg aagaccgggt catcgccagt 788281 gtgacccggc tgggcaatct cacggctcga aaactggtgg tattggtgaa tcccgccacg 788341 cagcaatacc tgatcgactg ggaacgaagc gctttggtca acggcctggt gcccgcccaa 788401 ttcaccgtcg ccgaagacaa caagacctac gacttgagtg ggcaaaccgg cccgctgatg 788461 gagatcttgc agattctgaa ggcaaacaac gttccgctga accggatggt tgacatccgc 788521 tcgaatccgg cactgcgtca gcaagtccaa gcggtggtgc ggcgggcagc cgagcggcag 788581 gcgccggcgg ccgagccagc gtcgcaagga tcgatcgccg agcggcttgc ggagctggaa 788641 tcgctgcgcg ccagcggtgc ggtcaacgcg gcggaatacg agagcaagcg cgcccagatc 788701 atctccgaaa tctgaggcga gctggggcac catccgcggc gagcagacgc gaaagcccgc 788761 gacacgccga ggcatcgggg gattttgtct ggtgggcggg aatctggggc acgttagaac 788821 acgttacagt ttcgctgcta gcctgacagt cggcgagagg ggcgtatgtg tctgcgcggg 788881 gaggatcact gcacggccgg gtggcatttg tcaccggcgc cgcccgcgcc caaggacggt 788941 cgcacgcggt gcggctggcg cgcgaggggg ccgatatcgt cgcgctggac atctgcgcgc 789001 cagtatccgg cagcgtgact tacccgccgg ccacgtccga agatctcggc gagaccgtcc 789061 gcgcggtgga agccgaaggc cgcaaggtgc tcgcccgcga ggtggatatt cgcgacgacg 789121 ccgagttgcg gcggctggtg gccgatggtg tcgagcagtt cggccggctc gacatcgtgg 789181 tggccaacgc cggggtgctg ggttggggca ggctctggga actcaccgat gagcagtggg 789241 agaccgttat cggggtcaac ttgacgggta cgtggcgcac cttgcgggcc accgtgcccg 789301 cgatgatcga tgccggcaat gggggttcga ttgtggttgt cagctcgtcg gcggggttga 789361 aggcgacacc gggcaacggc cactacgcgg ccagcaagca tgcactcgta gcgctgacca 789421 acacgttggc gatagagctc ggtgaattcg gcatacgggt caactccatt catccttact 789481 cggtcgacac cccgatgatc gaaccggagg caatgattca gacgttcgcc aagcatcccg 789541 gatatgtgca tagctttcca ccaatgccgt tgcagcccaa aggttttatg acaccagacg 789601 agatatccga cgtcgttgtc tggttggccg gcgacggctc gggcgcactg tcgggcaatc 789661 agatcccggt cgataagggt gccttgaagt attgacgcgc gatcgtgtat gaacgcacac 789721 gtgaccagtc gtgaaggcgt caatgagttt gacgatggaa ttgtgatcgt cggcggcgga 789781 ttggcagctg cgcgcaccgc cgagcagttg cgtcgtgcgg gctattcggg tcgcctcacg 789841 atcgtcagcg acgaggtgca tctgccgtac gaccgtccgc cgctatccaa ggaggtgctg 789901 cgcagcgagg tcgacgatgt ggccctcaaa ccccgcgagt tctacgacga aaaggacatc 789961 gcacttcggc tggggtcggc tgccgtcagc ttggacacgg gagaacagac ggtaacgctg 790021 gccgacggta cggtgctcgg ctacgacgag ctcgtcatcg cgactggttt ggtgccccgg 790081 cgtattccat cgcttcccga ccttgatggc attcgggtgc tccggtcgtt cgacgagagc 790141 atggcactgc gcaagcatgc atccgccgca cggcacgccg tggtggtggg ggccggtttc 790201 atcggctgcg aggtggctgc cagtctgcgc ggtctcggtg tggatgtggt gctggttgag 790261 ccgcagccgg cgccgttggc ctcggtgctg ggcgagcaga tcggccagtt ggtgacgcgg 790321 ctccatcgcg atgagggcgt tgatgttcgc acgggtgtga cagtggccga ggtacgtggc 790381 aaggggcatg tcgacgcggt ggtcctgacc gacggtaccg aactgccggc tgatctggtg 790441 gttgtgggca ttgggtcgac cccggcgacc gaatggctag agggtagcgg cgtcgaggtc 790501 gacaacggcg tgatctgtga caaagccggg cggactagcg cgccgaatgt gtgggcgctc 790561 ggtgacgtcg cctcctggcg agatccgatg ggacaccaag cacgcgtgga acattggagc 790621 aacgtcgccg accaggcccg agtcgtggtg cccgcgatgc tcgggaccga tgtgcccacg 790681 ggcgtggtcg tcccgtattt ctggagtgac cagtatgacg tcaaaatcca gtgcctgggg 790741 gagccgcacg ccaccgacgt tgtgcatctg gtcgaggacg acgggcgcaa gttccttgcc 790801 tattacgagc gcgatggcgt gctggttggc gtggtcggtg gcgggatggc cggcaaggtc 790861 atgaaggtgc gcggcaagat cgccgcgggc gcgcccatcg ccgaagtgtt agaccaaact 790921 caggcctaga gctgacctag gtggcagcgg gcgccctggt cgtcggcgca ttcggcggac 790981 atatcgtctg gctgtcggga cggctcggcc agcgcgccgg ccgcacgcac ccgtgcgacc 791041 gccgcgtcga tatcgccacc gtccatatcg gcacggcgct cggatgcggg ctgcggcccg 791101 ctgcaccggg ccatcagatg cacgcccggc gcctgccagc catccgcgac gcggcccggt 791161 ttgacggtcc accccagcac gtggcgtaga agtcgcggat gacgtcggaa tcggctacct 791221 cgtagctgac attcgccgtt tcactgagcc ggcatctgtt gagctctggg cgtttccggc 791281 acgggctcgg cttcaagacc gcgaatgccg cgcttcggtt gtcggtagca tcgaggatgg 791341 ctccgtgctc ggattgacct gtttgtcccg cggcgccgct ggctgcggtg atcgcctcgc 791401 gcgtggcgac caggcctgcg acggcctagc agcaaccagg gtgggacaac cggagccggc 791461 gaatatgccg gtcgggagct cggttgttcg tgggagctcg gtgaccggca tggggacatg 791521 acggcggctg atcgtggtca gccagcttgt gtcgcgccac gcagccatga aatcgcgatt 791581 ggcggcggat cgcttccatg cgcggcatcc gttgcagccc gaaatgtctc cgacgtcagg 791641 tcctcggccg tgccacgggc gccgcagagc cgcccatacc gtcgccgcgc tggcgtgaat 791701 tccgacgggt tggtcaggaa aatatccggc gagctgctac cgtggcggcg ccaacactgt 791761 gccgacccaa ctcggggatc gcagatcgct cgtcattgcc aggtcaccgg tgggccgtgt 791821 ggatggcatt cacccaagac ccgggcgtga ccacccggcc aactgcgcat acgtaccagg 791881 tacttgattt gggcgccggg ccgctggtgg gcaggctcca aggtgaggtg cacgaacggg 791941 cagtgggcat cggcttgcgc ggctaatgcg tcgattccgg cgcggatcgc tgcgcgttcg 792001 tcggcgggca ggtactgcca ggtgatcgaa tgccacaaca cggtgagtgc atcgtcggtc 792061 agagtcatgc cggcgactgc ggcgtgcgcc gcctgccgat ggaggtccgc gggaatgttg 792121 cgggcgacgg cgatggcgcc ccgcaaccgc tccaaccgat cggtctggtc cggccagatg 792181 tagctcaacg cgttcagctc cccgtcgggg ctggtgacgt cgatgggcgc gatgtcgtat 792241 ccgtgtcgtt cgacgatccg caccgtggcc gtcggcggca attcgcccag ccaggcattg 792301 tcgattcgca ccggtgagtc ggccaggccc cattcgccgc cgagataacg gtagcggtac 792361 cgatctggtc gcaggttcag ccctgcactg gaccctatct cgaaaagcct tattggcaag 792421 tcgaattgga ggcaggcgat gagaagtcca ccgatcaacg ccgccgagcg ccctacctcg 792481 ttggtctgcg gtggccgatc gagagccgca cgcagcgact ccggctggtc ggtcgcggtg 792541 cggacgatat cgggccaggc tgcctccgcc tgccaggtgc cgccggtgct ggggtaccag 792601 cggcgcaaca ccggtgcgcg gccgtcgagc accatccggt gcaatccgcc gagcagccga 792661 agcggcaccg cctggccctc cggagcaccc ttctggtcgg ccaagatgga cgcgaagacg 792721 ccgccgcttt cgacgtcagc tgccacgagc tcaagtagct cgcggtacat cggggagccg 792781 gaggaggtgc acacccgccc ctgtgaccgc agggtgtgga ccaggtgttc ggtgcccgtc 792841 actggttgag tcggtccaga ccggcgccga cgacgtcaaa cgcggccccg agcgcttcgg 792901 tcaaggagac ggattcgtcg cgcagccaat gttcataggc gctcagagcg acccccagca 792961 ttgtccaggc gacggtttgg ggcataaagt ctgtcgtctt tccacccgat ctgcgggcaa 793021 cgaatttggc gatcacctcg cgccagccag catacatggt catcgaatag gcctgcagtt 793081 caggagtttg caagatgacc cgcatgcgct tgcggtgtcg gatggtttcg gattcgtcaa 793141 aggtgttgaa ggccaacagc gctgcgcgca acgcgtccct cagctgaatc cgtgaatcga 793201 tattgtcgag tagaccttgt agctgtgcaa ggtgggtgct gaagtcaccc caggggatgg 793261 cgttcttgga ggcgtagtag cgaaacaacg ttctgcgggc gatgccggcc gcccgggcga 793321 tgtcgtccac gctgacatcg gtgaaaccgt gggcagcgaa cagttcgatg gcaacatcgc 793381 tgatgtggtg cggtgtggtt gagcgccgtc ggcccacccg cgactcgtgc ggcatcacat 793441 tcgcccttcc atttcggcac tcgatgccat attgtgtcca gatcgacgga tcgctgtcga 793501 gacctgctgg cgaaaggcaa tccagatgga ctacgaaacc gataccgaca ccgagcttgt 793561 caccgagacc ctggttgaag aggtgtccat cgacggaatg tgtggggttt actgaccgtg 793621 ccggcgcccg cgcaggctcg ccgggctgat tccagcgaat tcgatcccga tcgcggctgg 793681 cgactacacc cacaggtggc ggtccggccg gagccttttg gcgcgctgct ctatcacttc 793741 ggcacccgta agttgtcatt tctgaaaaat cgcaccatcc tcgcggtggt gcagacgctg 793801 gcggattatc ccgatatccg gtcggcctgc cgcggcgccg gcgtcgacga ctgtgaccag 793861 gatccgtacc tgcacgccct gagtgtgctc gccggttcga acatgctggt tcctcggcag 793921 acaacatgac gagccccgta ccccgactca tcgagcagtt cgagcggggg ctcgacgcgc 793981 cgatctgcct tacctgggag ctgacctacg cctgcaacct agcttgcgtg cactgcctgt 794041 cgtcctcggg caaacgcgat cccggcgagt tgtccacccg ccaatgcaag gacatcatcg 794101 acgaactgga acgcatgcag gtgttctacg tgaacatcgg cggcggcgaa ccaaccgtgc 794161 gcccggactt ttgggagctg gtagattacg ccaccgcaca ccacgtcggg gtgaaattct 794221 ccaccaacgg ggtccggatc acccccgagg tggccacgcg gctggcagcc accgactacg 794281 tcgacgttca gatctcactc gacggcgcca cggccgaggt caacgacgcc atccgcggca 794341 ccgggtcgtt cgacatggcg gtgcgcgcgc tgcagaacct ggcagcggcg ggatttgccg 794401 gcgtcaagat ctcggttgtg atcacccggc gcaacgtcgc ccagctcgac gaattcgcca 794461 cgctggcaag ccgttacgga gcgacgttgc ggataaccag gttgcgaccg tccgggcgcg 794521 ggactgacgt atgggccgac ctgcacccca ccgccgacca gcaggtgcag ctttacgact 794581 ggctggtttc caaaggagag cgggtgctca ccggcgattc cttcttccac ctggcgccgc 794641 tcggccagtc gggggctctg gccggcttga acatgtgcgg agccgggcgg gtagtgtgcc 794701 tgatcgaccc ggtgggtgac gtgtatgcgt gcccattcgc cattcatgac cacttcttag 794761 ccggaaacgt gttgtccgac ggcggatttc aaaatgtctg gaagaactcg tcgctgtttc 794821 gcgagctccg ggagccccag tccgcaggcg cctgtggcag ctgcggacac tacgacagct 794881 gccggggcgg ctgcatggcg gcgaaattct tcaccggcct gccgctggac gggccggatc 794941 ccgaatgcgt gcaaggccat agcgagccgg cgctggcgcg cgagcgccac ctaccgcggc 795001 cccgcgccga ccactcccgc ggtcggcgcg tcagcaaacc ggtgcccctg acgctgtcga 795061 tgcggccacc caagcgcccg tgcaatgaaa gtccggtgta gccgtggccg aagcgtggtt 795121 tgaaacggta gccatcgcgc agcaacgcgc gaagcggagg ctgccgaaat cggtttactc 795181 gtccctgatt gcggccagtg aaaagggaat cacggtcgcc gacaatgtcg cagcattcag 795241 cgagctcggg ttcgcgccgc acgtcatcgg ggcgacagat aaacgtgact tgtcgacgac 795301 cgttatgggg caagaagttt cgttgccagt gattatttcg ccgaccggtg ttcaggcggt 795361 cgatcccggc ggtgaagtcg ccgtcgcgcg ggccgcggcc gcccggggta ctgtgatggg 795421 attgtcctcg tttgccagca agccgatcga ggaggtcatt gccgccaacc ccaagacctt 795481 cttccaggtc tactggcagg gcgggcgcga cgcgctcgct gaacgcgtcg aacgggcgcg 795541 gcaggccggc gcggtcggcc tggtcgtcac caccgactgg acgttctcgc acgggcgcga 795601 ctggggcagc cccaagatcc ccgaagagat gaacttgaag accatcctgc ggctatcccc 795661 ggaggcgatc acccggccga ggtggttgtg gaagttcgcc aagacgctac ggccaccgga 795721 cctacgggtg cccaaccagg gccggcgcgg cgagcccggc ccaccgttct tcgcagccta 795781 cggcgaatgg atggcaacac ctccgccgac ctgggaagat atcggctggc tgcgcgaact 795841 gtggggcgga ccgttcatgc tcaagggcgt catgcgggtc gacgatgcca aaagagctgt 795901 ggatgccggg gtttcggcga tctcggtatc caaccatggt ggcaacaatt tggatgggac 795961 gccagcatcg atccgggccc tgcccgcggt ctcggcggcg gtcggcgatc aggtcgaagt 796021 gttgctcgac ggcggcatcc ggcggggcag cgatgtcgtc aaggcggtgg cgctgggcgc 796081 gcgcgcggta atgattggtc gcgcttacct gtggggcttg gccgccaacg gccaagccgg 796141 ggtcgagaat gtactcgaca tcctgcgcgg tggtatcgac tcggctctga tgggtctcgg 796201 gcatgcctct gtccatgacc tcagcccagc cgacatcctc gttcccaccg ggttcatccg 796261 cgacctgggt gtgccctccc gacgggacgt ttagccggat gttgagctgg gcccaaattg 796321 gggttggccc tcccattacc acagagatgc tcgcgacgga atgacgtttt tagaaattct 796381 gatacgggcg tggcagccgt ggcgggcgag caggatgtgt ggccggtaag tcatcacgac 796441 gaaaaaaatt tcggtagaag acataacaat tggtgcacgc caggtgaatt cgtcctacca 796501 tcggcgagtg ccggtagtcg gggaactcgg gagtgcgacg tcgagccagc taccaagcac 796561 gtcgccgtcg atagtgatcc cgctggggtc caccgagcag cacggtcccc acctgccgtt 796621 agataccgat acccggatcg cgaccgccgt ggcccggacc gtcaccgcga ggctgcacgc 796681 cgaggacctg cccattgctc aggaggaatg gctgatggcg cccgccattg cctacggcgc 796741 cagcggcgaa caccagcgtt tcgctggaac gatctctatc ggcactgaag ccctgacgat 796801 gttgctcgtg gagtatggca ggtcggccgc ctgctgggcc cggcgcctgg tcttcgtcaa 796861 cgggcacggc ggcaatgtcg gcgctttgac ccgagcggta ggcctgctgc gcgctgaagg 796921 tcgcgacgcc ggatggtgcc cgtgcacctg cccgggcggt gacccccacg ccggccacac 796981 cgaaacatcc gtgctgctgc atctttcgcc ggccgacgtg cgcaccgaac ggtggcgcgc 797041 gggtaatcgc gcaccgctgc ccgtgttgtt gccgtcgatg cgccgaggcg gggtcgcggc 797101 cgtgagcgag acaggagtgc tcggggatcc gaccacggcg accgcggccg aggggcggcg 797161 gatcttcgcg gcgatggtcg acgactgtgt gcgccgagtc gcccggtgga tgccacagcc 797221 cgacgggatg ttgacatgac cgcgccggcg acgatgcaga gcgaagcgat gaggagaagc 797281 ggcgcagatg accgcgaccc gactgcctga cgggttcgcc gtccaggttg accgtcgcgt 797341 gcgagtgctt ggcgacggct cggccctgct cggtggctca ccgacccggt tgctgcggct 797401 ggctcccgcc gcacgaggcc tgctctgtga cggccgcctt aaggtccgcg acgaggtcag 797461 cgcggagctg gcccgcatcc tgctggacgc cacggtggcg catccacggc cgccgagtgg 797521 gccgtcacat cgtgacgtca ccgtcgttat accagtacgg aacaacgcat ctggtctgcg 797581 gcgtctggtg acctcgttac gcggattacg cgtcatcgtg gtcgacgacg gttcggcgtg 797641 cccggtcgag tcggacgact ttgtcggcgc acattgcgac atcgaagtac tccaccaccc 797701 ccacagcaag gggccggccg cggctcgcaa caccgggcta gcggcctgca ccaccgactt 797761 cgtggcgttc ctggattccg acgtgacgcc gcggcgggga tggttggaat ccttactcgg 797821 ccacttctgc gatcccaccg tcgcactcgt cgcacctcgc atcgtcagct tggtggaagg 797881 cgagaacccg gtagctcgct atgaggccct gcactcgtcg ttggaccttg gtcagcgcga 797941 agcgccggtg ttaccgcata gcacagtctc ttacgtgccg agcgccgcca tcgtttgccg 798001 gagttcagcc atccgcgacg tcggcggctt cgacgagacc atgcactccg gggaagatgt 798061 cgacttgtgc tggcggctca tcgaggctgg tgctcggctg cgctacgagc caattgcgct 798121 ggtcgcccat gaccatcgga cccaattgcg ggactggatc gcgcgcaagg cgttttacgg 798181 cggttcggcg gctccgctag ctgtgcggca cccggacaag accgcgccgc tggtgatttc 798241 gggcggggcg ctgatggcgt ggatcctcat gtcgatcggc acaggccttg gtcgactggc 798301 gtcgttggtg atcgcggtgc tgactggtcg ccggatcgcc agggccatgc gctgcgccga 798361 gacgtcgttc ttggatgtgc ttgccgtcgc cacccgcggg ttgtgggcgg ccgcgctgca 798421 gctggcgtcg gccatctgcc ggcactattg gccactggca ttgctcgcgg ccatcctgtc 798481 gcgccgctgt aggcgggtgg tgttgattgc ggcggtagtg gacggtgtgg tggattggct 798541 tcgccgcagg gagggcgccg acgatgatgc tgaaccgatt gggccgctga cctacctagt 798601 gctgaagcgc gtggacgact tggcttatgg cgctggcctg tggtacgggg tggtgcgcga 798661 acgtaacatc ggcgcgctca agccgcagat tcgtacctag tgtgactgcg gcggtccggc 798721 atagcgatgt gctggtcgtc ggtgctggaa gtgctggatc ggttgttgcc gagcgtcttt 798781 ccatggactc gagctgtgtg gtgaccgtgc ttgaggctgg ccccgggctg gccgatccgg 798841 ggttgctggc tcagacggcc aatgggttgc aactgccgat cggagctggc agccctctgg 798901 ttgagcgtta tcggacgcgg ctcaccgatc gaccggttcg ccacttgccg atcgtgcggg 798961 gtgcgacggt cggcggttcc ggcgcaatca acggcggcta tttctgccgc ggactgccca 799021 gcgatttcga ccgtgcctcg ataccaggct gggcatggtc tgacgttctg gagcacttcc 799081 gggctatcga gacagatctg gatttcgaga cgcctgtgca tggccgtagt ggccccatcc 799141 cagttcgccg cacacacgaa atgactggca tcactgaaag tttcatggct gccgcagagg 799201 acgcagggtt cgcttggatc gctgacctca acgatgttgg gccggaaatg ccttcgggtg 799261 taggcgcggt cccgctcaac atcgttaacg gcgtacgcac cagctcggcg gtcggctatc 799321 tgatgcccgc gctgggacgg ccgaatctga cactgctggc ccggacgcgg gcggtgcggt 799381 tgcgcttttc cgccaccacc gcggtgggtg tcgacgcgat cggcccagga ggcccggtaa 799441 gcctgagcgc tgaccgaatc gtattgtgcg ccggagcgat tcagtcagct catctgttga 799501 tgctctcggg cgtcggcgag gaggaggtgt tgcgatccgc cggtgtgaag gtgcttatgg 799561 cgttgccggt tggcatgggc tgcagtgacc acccggaatg ggtgatgccg accaactggg 799621 cggtggctgt cgatcggccg gtgttagagg tgctgctgag cactcatgac ggcatcgaaa 799681 taaggccgta cacaggcggc ttcgttgcga tgaccggcga cggtacagcc gggcatcgcg 799741 attggccgca tatcggggtg gcgctcatgc agccgcgggc acgcggacgc atcacgttgg 799801 tctcgagtga tccccagata ccagtccgca tcgagcaccg atacgacagt gaacctgccg 799861 atgtcgcggc cctgcgccag ggtagcgcat tggcccacga attatgcggt gcggcaacgc 799921 gcatcggtcc agccgtatgg gcgacatcgc agcatctgtg tggtagtgcc ccaatgggca 799981 ccgacgatga cccacgagcc gtcgtcgacc cgaggtgtcg ggtccgcggc atcgaaaacc 800041 tatgggtgat agacggatct gtccttccgt cgatcaccag tcgcggtcca cacgcaacga 800101 tcgtaatgct gggccaccgc gcggccgaat ttgttcagtg actttcgtcg agtggggcga 800161 ccacagcggt cgctgccgaa tgtgcatttc ggtcaggcat tgagcagggg accgaatagc 800221 gtagctccgc atcggactgc agtcgtcagg tcgacgatga tggcgctgac atcggaggtg 800281 ggccgcggcc caggcttcgc ggtttggcgg cctgcgaaga agtggctctt ctgacacttc 800341 cgtgggtgga cttctggttt gagtaggcgc acgtcgttgt cgcttagggt ttctggcttg 800401 tcaaaggaca ggaccagcgc agatcactgt agtcttagct gatgctgccg cccggattgc 800461 cgacgtcgtg gcccagcggt gccccaacgc ggtccgccgc gtcgatcctt tccacgtggt 800521 ggcctgggcc accgaggctc tagaggctga acggcgccgg gcctgaaacg acgcgcgagc 800581 gcctgcccgg accccgagtc attgggtcgc aggggtaacc gaagggtgca cgttgaccgc 800641 gtgaggctaa ccggcaccga gcgtgaactg agggcggaga atcagagccc cccgattttc 800701 cgcccgcaga acacgttggg cgacggcgcc aacgggctgc cactggccgt gtgcaccacg 800761 acggctcaca cgtgccacac ttcccatact cacccatcgc ggtggacccc aaacccagtg 800821 ccggccacca agggcgtccc cgctggattg gtgcaagcaa ccttcatcat cgaaaacctt 800881 gaccccggca acaacgacac gccgaccccc ctacacccaa actgcgatta gcccgaaaac 800941 ctgggcacca taggcgatct gaatacgatg cggattcggt gctgcggaga aaggatacat 801001 cgcgccgatg cgtccaggcg gatgacgtcc gatgcgtgca gctggtccag gatccgcggc 801061 gcggacgtgt cgaactcggt ggttaccgcg ccgagcttac tgttggccga cgggcggcgg 801121 tgaattgcca acgcccgcaa tatggtgcgg atggatggcc cgttcggttg ggttgcgggg 801181 taggcggcgc cgcgcgaggc gatcagcgct gaggtcggga attcacctcc ggtcgcggga 801241 gtacagcggt cggctggggt gccgccggtg tctgtcgggt agaggcggca ggacacgctc 801301 gccgtcaaaa cggcttcggc aaacgggtct tcgccgtcga caggcagggt tggtgatccc 801361 ggcctcggcg gcgacggtct ggtcattgat tgtgcgatgg gtgatcgtcg tgtcgatctg 801421 ctcgcggcga aggactcgga gatccggcgc tcgatggggg cagtaccggt cggcgcggga 801481 agctcgcagg tggcgacgag ttgggcgagt gatcgttgca tccgctgtcg ggcggcgatt 801541 ctgtcggccg actgtgccaa cttggctagg gccaattcgc ggggcggtct ggcagtcggc 801601 gggtccgctg tcagctagct gcagcagctc ctcgatctcg tcgaggctga atccatggga 801661 ctacgcgcgt ctgacggacg agaccaccga taccgcatcg gcgcgatacc gcgatagccc 801721 tgagaacgac cgggccggcg cggctagcag gtcctccgct cgtaatagcg cagcgtctgg 801781 ccgttgatcc cagcccgcgc ggcaacctgg ctactccgca ttccgccatt ccgaaccctg 801841 tactcgactg tcgagtcaag ggtgtgtgtt gtcagtgccg ggtcaggtgc cgatagcaac 801901 cggccgcccg ccgctgcacc cgagccagcg gcgatttggc cgaagcggtg atctggggca 801961 tactgttcag gttgccctgc gccaggtttg cgtctggccg ggcatacgac cagcgcccac 802021 acgggcgcgt ccggtcccac ccttgaagcg cgacgatttc ggccttgaaa ttgatcgcga 802081 caaggctgtg tgcgggcgac acgcccgagc gcggggccgg tggacctacg acaggtaaac 802141 agcggcgcag tattcggcgc aacgctagat cggtccagaa ggaccgggtc gatcggcgcg 802201 ccggggagca ccggacccgg atacgggctc gagtgggagt gaggtaggag aagcgtggcg 802261 ggacagaaga tccgcatcag gctgaaggcc tacgaccatg aggccattga cgcttcggcg 802321 cgcaagatcg tcgaaaccgt cgtccgcacc ggtgccagcg tcgtagggcc ggtgccgcta 802381 ccgactgaga agaacgtgta ttgcgtcatc cgctcaccgc ataagtacaa ggactcgcgg 802441 gagcacttcg agatgcgcac acacaagcgg ttgatcgaca tcatcgatcc cacgccgaag 802501 accgttgacg cgctcatgcg catcgacctt ccggccagcg tcgacgtcaa catccagtag 802561 gagattggac agagcaatgg cacgaaaggg cattctcggt accaagctgg gtatgacgca 802621 ggtattcgac gaaagcaaca gagtagtacc ggtgaccgtg gtcaaggccg ggcccaacgt 802681 ggtaacccgc atccgcacgc ccgaacgcga cggttatagc gccgtgcagc tggcctatgg 802741 cgagatcagc ccacgcaagg tcaacaagcc gctgacaggt cagtacaccg ccgccggcgt 802801 caacccacgc cgatacctgg cggagctgcg gctggacgac tcggatgccg cgaccgagta 802861 ccaggttggg caagagttga ccgcggagat cttcgccgat ggcagctacg tcgatgtgac 802921 gggtacctcc aagggcaaag gtttcgccgg caccatgaag cggcacggct tccgcggtca 802981 gggcgccagt cacggtgccc aggcggtgca ccgccgtccg ggctccatcg gcggatgtgc 803041 cacgccggcg cgggtgttca agggcacccg gatggccggg cggatgggca atgaccgggt 803101 gaccgttctt aaccttttgg tgcataaggt cgatgccgag aacggcgtgc tgctgatcaa 803161 gggtgcggtt cctggccgca ccggtggact ggtcatggtc cgcagtgcga tcaaacgagg 803221 tgagaagtga tggctgcgca agagcagaag acactcaaaa tcgacgtcaa gacgccggcg 803281 ggcaaggtcg acggcgctat cgagctgccg gccgagctgt tcgacgtccc ggccaacatc 803341 gcgctgatgc accaggtggt caccgcccag cgggcggcgg cacgccaggg tacccactcg 803401 acgaagacgc gcggcgaggt cagtggcggt ggccgcaagc cctaccggca gaaggggacc 803461 ggtcgtgccc ggcagggctc gacgcgggcg ccgcagttca ccggcggtgg cgtggtacac 803521 ggtcccaagc cgcgcgacta cagccagcgc acacccaaga agatgatcgc cgcggcgctg 803581 cgcggggcgc tgtccgaccg ggcccgcaac gggcgtatcc acgcgatcac cgagctagtg 803641 gaaggtcaaa acccgtcgac caagagcgcc agggcatttc tggccagcct gacagaacgt 803701 aaacaggtgc tggtggtcat cgggcgcagc gacgaggccg gcgcgaaaag cgtgcgcaat 803761 ctgccgggcg tgcacatcct ggcgccggac cagctcaaca cctatgacgt gctgcgtgcc 803821 gacgacgtgg tgttcagcgt tgaggcgctg aatgcctata tcgcggccaa caccacgacg 803881 tccgaggagg tttcggcctg atggcgacgc tcgctgaccc ccgcgacatc atcctggccc 803941 cggtgatctc ggagaaatcc tatgggttgc tggatgacaa cgtgtacacg tttttggtgc 804001 gcccggattc caacaagacg cagatcaaga tcgccgtcga gaagattttt gccgtcaagg 804061 tcgcatcggt gaacaccgcg aaccggcagg gcaagcgtaa acgcacccgg accggatacg 804121 gcaagcgcaa gagcaccaag cgcgccatcg tcaccctggc gccgggcagc aggccgatcg 804181 acctgttcgg ggcaccggcc tagcccggcg acgatgcaga gcgaagcgat gaggaggagc 804241 agggcaatgc ggcctagccc ggcgacgatg cagagcgaag cgatgaggag gagcagggca 804301 atgcggccta gcccggcgac gagagcgtga gagaaagacc tgattagaca tggcaattcg 804361 caagtacaag cccacgacgc ctggtcgtcg cggcgccagc gtatctgatt tcgccgagat 804421 cacccggtca accccggaga agtcgctggt gcgcccgctg cacggtcgcg gtggacgcaa 804481 cgcgcatggc cggattacca cccggcacaa aggcggcggt cataagcgcg cttaccggat 804541 gatcgacttt cgccgcaatg acaaagatgg tgtcaacgcc aaggtcgcgc acatcgagta 804601 cgacccgaac cgtaccgcac ggattgcgtt gctccactat ctcgatgggg agaagcgcta 804661 catcattgca cccaacggac tttcgcaagg ggatgtggtg gaatccggcg ctaacgccga 804721 catcaagccg ggcaacaacc tgccattgcg caacatcccg gccggtacct tgatccacgc 804781 cgtggagctc cgcccgggag gtggcgctaa gcttgcgcgc tcggccgggt cgagcatcca 804841 gctgctcggc aaggaggcca gctacgcgtc gctgcgtatg cccagcggtg agatccgccg 804901 ggtcgacgtc cgctgccgcg cgaccgtcgg cgaagtgggc aatgccgagc aggcaaacat 804961 caactggggc aaggccggtc ggatgcggtg gaagggtaag cgcccgtcgg tccggggcgt 805021 ggtgatgaac ccggtcgacc acccgcacgg cggtggtgag ggtaagacct ccggcggccg 805081 tcacccggtt agcccgtggg gcaagcctga ggggcgtacc cgcaatgcga acaagtcgag 805141 caacaagttc atcgtccgac gccggcgcac cggcaagaag cactcgcgtt agccgcgcaa 805201 tcagatctag ggagtttcag gagtagccaa ccatgccacg cagcctgaag aagggcccgt 805261 tcgtcgacga gcatctgctc aagaaggtcg atgtccagaa cgagaagaac accaagcagg 805321 tcatcaagac ctggtcgcgt cggtcgacca tcattccgga cttcatcggc catacctttg 805381 cggtgcacga cggccgcaag cacgtccccg tgttcgtcac cgaatcgatg gtgggccaca 805441 aacttggtga gttcgcgccg acacgcacct tcaagggcca cattaaagac gaccgaaaga 805501 gcaagcggcg atgactgcgg ctactaaggc taccgagtat ccctcggcgg tcgccaaggc 805561 ccgatttgtg cgggtgtcgc caagaaaggc gcgccgggtg atcgatctgg tgcgtggcag 805621 gtcggtgtca gacgcgctcg acatcctgcg ctgggcgccg caggccgcca gcggtccggt 805681 ggccaaagtg atcgccagtg cggcggccaa cgcgcaaaac aacggcgggc tggacccggc 805741 aaccttggtg gtggccaccg tgtacgccga ccagggaccg accgccaagc gcatccgtcc 805801 gcgcgcccag ggccgcgcgt tccgcatccg ccggcgcact agccacatca cggtggtggt 805861 ggaaagccgg ccggccaaag atcaacggtc ggcgaaatcg tcgcgggccc gccgcaccga 805921 ggccagcaag gccgccagca aggtcggggc tacggcgccg gccaagaaag cggccgccaa 805981 agcgcccgcc aagaaggcac ccgccagttc cggcgttaag aagacacccg caaagaaagc 806041 gcccgccaag aaggcgcccg ccaaggcttc tgagacttct gcagcgaagg gaggctcaga 806101 ctagtgggcc aaaagatcaa tccgcatggc ttccggctgg gcatcaccac cgactggaag 806161 tcgcgctggt atgccgacaa gcagtatgcc gagtacgtca aggaggacgt ggcgatccgc 806221 cggctgctgt ccagtggcct agagcgtgct gggatcgccg atgtagagat cgagcggacc 806281 cgcgaccggg tccgggtgga cattcacacc gcgcgtccgg gcatcgtcat tggtcggcgt 806341 gggaccgagg ccgaccggat tcgtgccgac ctggaaaagc tgaccggcaa gcaggtccag 806401 ctcaacatcc tggaggtcaa aaacccggag tcgcaagcgc aattagtggc ccagggggta 806461 gccgagcagt tgagcaaccg ggtggcgttc cgccgcgcaa tgcgcaaggc gatccagtcg 806521 gcgatgcgtc agcccaacgt caagggaatc cgggtgcagt gctcgggccg cctcggcggc 806581 gcggaaatga gccgctcgga gttctaccgc gagggccgcg tcccgctgca caccttgcgg 806641 gcagatatcg actacggcct atacgaggcc aagaccacct tcggccggat cggtgtgaag 806701 gtgtggatct acaagggtga catcgtgggc ggcaaacgtg aattggctgc cgccgcgcca 806761 gcgggcgccg accgtccgcg ccgtgagcgg ccgtcgggca cgcgcccccg tcgcagcggt 806821 gcttcgggca ccacggcgac cggtaccgac gcgggtcggg ccgcgggtgg cgaagaggcc 806881 gcgcctgacg ccgcagcgcc cgttgaagcg cagagcacgg agagctgaat catgttgatt 806941 ccccgtaagg ttaaacatcg caagcagcac catcctcgcc agcgcggcat cgccagcggc 807001 ggcaccacgg tgaacttcgg cgactacggc attcaggccc ttgagcacgc ctatgtcacc 807061 aaccggcaga tcgaatcggc gcgtatcgcc atcaaccggc acatcaagcg tggcggcaag 807121 gtttggatca acatcttccc tgaccgcccg ctgaccaaga agcccgccga aacccgcatg 807181 ggttcgggca agggctcgcc ggagtggtgg gtagccaacg ttaagccggg ccgggtgctg 807241 ttcgagctca gttaccccaa tgaaggtgtc gcccgggccg cgctcacccg agcgatccac 807301 aagctgccga tcaaggcacg cattattact cgagaggagc agttctgatg gcagtgggtg 807361 tctcgccggg cgaactgcgt gagctcaccg acgaggagct ggccgagcgg ttgcgcgagt 807421 ccaaggaaga gttgttcaac ttgcgtttcc agatggcgac cggccagctc aacaataacc 807481 gccggctccg tacggtgcgt caggaaatcg cgcgcatcta caccgtgctg cgcgaacgag 807541 aactgggtct ggcgactggg cccgatggta aggaatcgtg atggcagagg ctaagaccgg 807601 cgcgaaggcg gcgcctaggg tggctaaggc cgccaaggcg gcccccaaga aggccgcacc 807661 caacgacgct gaggccatag gtgcggccaa cgcggcaaac gttaaggggc ccaagcacac 807721 tccgcgtact ccgaagccac gcggccgccg caagacacga atcggctatg tggtgagcga 807781 caaaatgcag aagaccattg tggtggagct ggaagaccgc atgcggcacc cgctatacgg 807841 caagatcatc cggaccacta agaaggtcaa ggcacacgac gaagacagcg ttgccggcat 807901 tggcgaccgt gtctcgctga tggagacgcg tccgctgtcg gcgaccaagc gctggcggct 807961 cgtcgagatc ctcgagaagg ctaagtaagc ctgacgagca gtcgcaaaag cccccgacac 808021 gcgcggcgtg cgggggcttt tgcgactgct cgcccaacca gcgcggcgtc agtgcggaaa 808081 tcctcagctg attcctaccc tgtgcgtgta gtgtacacaa ccgttcatta actccacggg 808141 gaagtgaggc tggcttatgg cacccgaggc caccgaggcg ttcaacggca ccatcgagct 808201 ggatattcgt gattcggagc cggattgggg cccatacgca gcgccggtgg caccggagca 808261 ctcaccaaac atcctttatc tggtctggga cgacgtcggc atcgcgacct gggactgctt 808321 tggcggcctg gtcgagatgc ccgcgatgac gcgcgtcgcc gagcgtggcg tgcgactgtc 808381 gcaatttcac accaccgcac tgtgctcgcc gacccgggcg tcgctgctga ccggtcgcaa 808441 cgccaccacc gtaggcatgg ctaccatcga agagttcacc gacgggttcc ccaactgcaa 808501 cgggcggatc ccggctgaca ccgcgttgct cccagaggtg ctggccgaac atggctacaa 808561 cacctactgt gtgggcaagt ggcacctgac gccactcgaa taatccaata tggcgtcgac 808621 gaagcggcac tggccgacct cgcgtgggtt cgagcggttc tacggattcc taggcgggga 808681 gaccgaccag tggtatcccg acctggtata cgacaaccac ccagtgagtc ctcccggcac 808741 acccgagggt ggctaccacc tgtcaaaaga catcgccgac aagacgatcg agttcattcg 808801 tgatgccaag gtgatcgcgc ccgacaagcc gtggttcagc tacgtgtgcc caggcgccgg 808861 gcatgcgccg caccacgtct tcaaggaatg ggcggacaga tacgccggcc gattcgacat 808921 ggggtatgag cgctatcgcg agatcgtgct ggaaaggcaa aaggcgctag ggatcgtgcc 808981 acccgacacc gaactgtcgc ccataaaccc ttatctggat gtgccggggc caaacggcga 809041 gacctggccg ctgcaggaca cggtgcggcc gtgggactcg ctgagcgatg aagaaaagaa 809101 gctgttttgc cggatggccg aggtgttcgc cggctttctg agctacaccg acgcccagat 809161 cggacggatc ctggactacc tcgaggaatc cggccagctg gacaacacca tcatcgtggt 809221 gatctccgac aacggcgcca gcggcgaggg cggacccaac ggatcggtca acgaaggcaa 809281 gttcttcaac ggctacatcg acaccgtcgc tgaaagcatg aagctcttcg accacctcgg 809341 tggcccgcag acctacaacc actaccccat cgggtgggca atggccttca acacccccta 809401 caagctgttc aagcgctacg cctcgcatga aggcggcatt gccgacccgg caatcatctc 809461 ctggcccaac ggcattgccg cacacggtga aatccgcgac aactacgtca atgtcagcga 809521 catcacgccc accgtctacg acctgttggg catgacaccg ccggggaccg tcaaggggat 809581 tccgcagaaa ccgatggacg gcgtgagctt catagcggcc cttgccgacc cggccgccga 809641 caccggcaag accacccagt tctacaccat gctgggcacc cgcgggatct ggcatgaagg 809701 ttggttcgcc aacaccattc acgcggccac gcccgccggc tggtcgaatt tcaacgctga 809761 ccgctgggaa ctgttccaca tcgcagcaga ccgcagccag tgccacgacc tggccgccga 809821 gcatcccgac aaacttgagg agctcaaggc gctgtggttc tccgaagccg ccaagtacaa 809881 cgggctgccg ctggccgatc tgaacctcct ggaaacgatg actcggtcgc ggccttacct 809941 ggtcagcgaa cgagccagct acgtctacta tcccgactgc gctgacgtcg gcatcggcgc 810001 ggccgtagag attcgcgggc gctcgttcgc cgtgctggcc gatgtgacca tcgataccac 810061 cggcgccgag ggcgtgctgt tcaagcacgg cggcgcccat ggcgggcacg tgctgttcgt 810121 ccgggacgga cgcttgcact acgtctacaa cttcctcggt gagcgccagc agctggtcag 810181 ctcgtcgggt ccggtcccgt cgggaagaca tctactcggg gttcgttatt tgcggaccgg 810241 aaccgtgccc aacagtcaca cgccggtggg cgatcttgag ctgttcttcg acgagaacct 810301 ggtcggcgcc ctgaccaatg tgctgaccca ccctggaacg ttcgggttgg ccggcgccgc 810361 tatcagcgtt ggccgcaacg gcggttcggc tgtgtccagc cactacgaag cgccgttcgc 810421 gttcaccggc ggtaccatca cccaggtcac cgtcgacgtg tcaggccgac cgttcgaaga 810481 tgtggaatcc gatcttgcgc ttgctttttc gcgtgactga gcggtctgct gtgacgcggg 810541 acggcgtggt cggcatacgc tgaagtcgtg ctgaccgagt tggttgacct gcccggcgga 810601 tcgttccgca tgggctcgac gcgcttctac cccgaagaag cgccgattca taccgtgacc 810661 gtgcgcgcct ttgcggtaga gcgacacccg gtgaccaacg cgcaatttgc cgaattcgtc 810721 tccgcgacag gctatgtgac ggttgcagaa caaccccttg accccgggct ctacccagga 810781 gtggacgcag cagacctgtg tcccggtgcg atggtgtttt gtccgacggc cgggccggtc 810841 gacctgcgtg actggcggca atggtgggac tgggtacctg gcgcctgctg gcgccatccg 810901 tttggccggg acagcgatat cgccgaccga gccggccacc cggtcgtaca ggtggcctat 810961 ccggacgccg tggcctacgc acgatgggct ggtcgacgcc taccgaccga ggccgagtgg 811021 gagtacgcgg cccgtggcgg aaccacggca acctatgcgt ggggcgacca ggagaagccg 811081 gggggcatgc tcatggcgaa cacctggcag ggccggtttc cttaccgcaa cgacggtgca 811141 ttgggctggg tgggaacctc cccggtgggc aggtttccgg ccaacgggtt tggcttgctc 811201 gacatgatcg gaaacgtttg ggagtggacc accaccgagt tctatccaca ccatcgcatc 811261 gatccaccct cgacggcctg ctgcgcaccg gtcaagctcg ctacagccgc cgacccgacg 811321 atcagccaga ccctcaaggg cggctcgcac ctgtgcgcgc cggagtactg ccaccgctac 811381 cgcccggcgg cgcgctcgcc gcagtcgcag gacaccgcga ccacccatat cgggttccgg 811441 tgcgtggccg acccggtgtc cgggtagtgc caacttcgca tgaggaactg cacacccagc 811501 agggcgtcag tcggcgcgac gagtcactcc cgggggctac gcatgaattc gactaccgga 811561 gcgggcctgg ctgggcgtgg gcgcgcgcag ttgtacggcc ccaacggcgt gtcgctgtac 811621 aaacacacgc cctcgctggt ccggttgccc caaaaagcca agccccccaa accagttgct 811681 cgccagcaat gacgccggtt gctaccatct gactccgtgt cgcttcccgg ggcaggcctg 811741 gggcagtggg ttatccggtg atgaccgatg gccggtagcg acccaccaac aggtgggccg 811801 gcgtcgcagg cgggttcaga cgcgggagcc tcgccagaac acaaacacat gtcgcggcga 811861 aagcacctcg tgctcgatgt ctgcatcatc ctgggtgttc tcattgccta cgtcttttcg 811921 ctgctcggct acgactggtt ggcccacaca ccgggtccgc ttccgcagcc ggacgtgggc 811981 acgactgacg acaccgtggt tttgatccgc ttcgaggagc tgcacactgt ggcaaatcgc 812041 ctcgatgtga aagtgctggt gctgcccgac gattcgatga tcgaccatcg cctccaagtg 812101 ttgactaccg acacctcggt gcggttgtat ccggagaacg aactcggaga tctgcagtac 812161 ccggtaggaa agctgcccgc gcaagtagcg accacgatcg aggcgcacgg caacccgggc 812221 gcctggccat tcgatacata caccaccgat acggtccagg ccgatgtgct cgtcggcgct 812281 ggcgacaacc gtcaatacgt acccgcccgg gtcgaagtga ccggatcgct ggaaggctgg 812341 gacatcagcg ccgtccgcgt cggggaaagc agccaaacct ctgatcgccc ggacaatgtc 812401 atcatcaccc tgaagagggc caagggtccg ctggttttcg acctgggcat ctgcctggtg 812461 ctgatcacat tgccgacgtt ggccttgttc gtggccatcc agatgattac cggccgcaga 812521 aaattccaac caccgttcgg cacttggtac gccgcgatgt tgttcgctgt cgtgccgctg 812581 cgcactattc tcccgggctc gccgccggcg ggtgcgtgga ttgaccgggc cgttgtgatc 812641 tgggtgctca tagcgctggc ggcggcgatg gtggtgtaca tcgtcgcctg gtaccgagaa 812701 tcggactagg gcgggcgtca gatggcttct gtcgacgcgt ccggagggtt tccgctggat 812761 ttcataaaca ggcgctagcg cggtgtccaa cgatacgatt ggggcccatg cggcccgacg 812821 agatcggctc gctgcgggcc ggcctggcgg ctgttgcgcg gtgaactcaa aacgcgttga 812881 cgccggatca gctatccgat gattcaggcg gagatctcga cgatcgtggg cgctaccgcc 812941 aatccggtat ccgggtagat catgatcgac atgggttgat ctgccctggt ggggcggact 813001 cacattagcg aaattttgcg ctgagtaggt cgtcccctaa acttcagggg ttgccgtgag 813061 cagacctcgg ccggcgcgca taagctttgc ttggtcggcc ccgcgtgccc gtcggcgaca 813121 aagaccgcgc acgtcaggga tggtcctggc tggctcctcc taccgtgcac acgtcaacca 813181 ggtcaggaga tctagtgatt cagcaggaat cgcggctgaa ggtcgccgac aacaccggcg 813241 ccaaggagat cttgtgcatc cgggtgctgg gcggttcgtc gcgacgctac gccggcatcg 813301 gtgacgtcat cgtcgccacc gtgaaggacg ccattccggg cggcaacgtt aagcgggggg 813361 atgtcgtcaa ggccgtcgtg gtgcgcacag tcaaggaacg ccgacgtccc gacggcagct 813421 acatcaagtt cgacgagaac gccgcggtga tcatcaagcc cgacaacgac ccgcgcggca 813481 cccgcatttt tggaccggtc ggtcgcgagc tgcgggagaa gcggtttatg aagatcattt 813541 cgctggcccc ggaggtgttg tagatgaagg tccacaaagg cgacaccgtg ctggtgattt 813601 cgggcaaaga taaaggggcc aagggcaaag tcttgcaggc gtatccggac cgcaaccggg 813661 tattggtcga gggtgtcaac cggatcaaga agcacaccgc gatctcgacc acccagcggg 813721 gcgcgcgttc gggtgggatc gtcacccagg aagcgccgat ccatgtctcc aacgtgatgg 813781 tggttgactc cgacggcaag cccacccgaa tcggctatcg ggtcgacgag gagaccggca 813841 agcgcgtccg tatctccaag cgcaacggca aggacatttg atgaccactg cacagaaggt 813901 tcagccgcgc ctcaaggagc gctaccgcag tgagattcgg gatgcgctgc gcaagcagtt 813961 cggctacggc aatgtcatgc agatcccgac ggtgacgaaa gtcgtcgtca acatgggtgt 814021 cggcgaggcc gcccgggacg ccaagttgat caacggggcg gtcaacgatt tggcgctgat 814081 caccgggcag aagccggaag tccgccgggc gcgcaagtcc atcgcgcagt tcaaattgcg 814141 tgagggcatg ccggtgggcg tccgagtcac gctgcgcggt gaccggatgt gggagttcct 814201 tgaccggctc acgtcgatcg cactgccacg catccgtgac ttccgtgggc tttcgcccaa 814261 acagttcgac ggtgtgggca actacacctt cgggctggcc gagcaggcgg tattccacga 814321 ggtcgacgtg gacaagattg accgggtccg tggcatggac atcaacgtcg tcacttccgc 814381 ggcgaccgac gacgaaggcc gagcgctgtt gcgggccctc ggctttccct tcaaggagaa 814441 ctgagcagat ggcgaagaag gcactggtca acaaggccgc aggcaaaccg aggtttgccg 814501 tgcgcgccta cacccgttgc agcaagtgcg gccgcccgcg tgcggtctac cgcaagttcg 814561 ggctgtgcag gatttgcctg cgcgagatgg cgcacgcggg tgagttgccc ggcgtgcaga 814621 agagcagctg gtaacgggac acggggacta gaacatatga ccgcgctgac gacgatgcag 814681 tgggggtacc cccagacgcg cagcggcgag ggggccgcaa gcgatgagga ggagtagcgc 814741 tcgatgaccg cgctgacgac gatgcagagc gcaagcgatg aggaggagta gcgctcgatg 814801 acgatgacgg acccgatcgc agactttttg acccgtctgc gtaacgccaa ctcggcgtat 814861 cacgacgagg tcagcttgcc gcactccaag ctcaaggcca acatcgcgca gattctcaag 814921 aacgaggggt acatcagcga cttccgaacc gaggacgctc gggtcggtaa atcgctggtt 814981 atccagctca agtacggccc tagccgggag cgcagcatcg ccgggttgcg gcgggtgtcc 815041 aagcccggcc tgcgggtgta cgcgaaatcc accaatctgc cgcgggtgct cggcggcctg 815101 ggcgtggcga tcatctcgac ctcctcgggc ctgctgactg accggcaggc agctagacag 815161 ggcgtgggcg gcgaagtcct cgcatatgtc tggtgagagt gtggtgagag gaagcaacca 815221 tgtcgcgtat tggtaagcag ccgattccgg tgcccgccgg ggtcgacgtc acgatcgagg 815281 gacagagcat ctcggttaag gggcccaagg gcaccctagg actgacggtc gccgagccaa 815341 tcaaagtggc acgcaatgac gacggcgcta tcgtggtcac ccgtcccgac gatgagcggc 815401 gtaatcgctc cttacacggg ctgtcccgta ccctggtgtc caacctggtc actggcgtga 815461 cgcaggggta caccaccaag atggagatct tcggggttgg ctatcgggtg cagctcaagg 815521 gctccaatct ggagtttgcg ctggggtaca gccacccggt ggtgatcgag gctcccgaag 815581 gaatcacgtt cgccgtccag gcaccgacga agttcaccgt ttccgggatc gacaaacaaa 815641 aagtcggcca gatcgccgcc aatatccgcc gtcttcgccg tcccgatccg tacaagggca 815701 agggcgtgcg ctacgagggc gagcagatcc gccgcaaggt cggaaagaca ggtaagtagc 815761 catggcgcaa tcagtttccg cgactcgacg aatctcccgc ctgcgccggc acacgcggct 815821 gcggaagaag ctctcgggca ccgcggagcg cccgcggctg gtggtgcatc ggtccgcgcg 815881 gcacatccac gtgcaactgg tgaacgacct caacggcacc accgtggccg ccgcttcgtc 815941 gatcgaggcc gatgtgcgcg gcgtgccggg tgacaaaaag gcccgcagtg tgcgggtcgg 816001 ccagttgatc gccgagcggg ccaaagccgc cggcatcgac accgtggtat tcgaccgcgg 816061 cgggtatacc tacggcggac gaatcgccgc gctggccgac gccgcacgcg agaacggatt 816121 gagtttctga tgaacgggag gaccgcataa tggcggagca gccggccgga caggcaggca 816181 ctaccgacaa ccgtgacgca cggggtgatc gggagggccg gcgccgcgac agcggccgcg 816241 gcagtcgtga acgggatggc gagaagagca actatctaga gcgggtcgtc gccatcaacc 816301 gcgtctccaa ggtggtcaag ggtggtcggc gcttcagctt caccgctttg gtcatcgtgg 816361 gcgacggtaa cgggatggtc ggtgtcggct acggcaaggc caaggaagta ccggccgcga 816421 tcgccaaggg cgtcgaagag gcgcgcaaaa gcttcttccg ggtaccgctg atcggcggca 816481 ccatcacgca cccggtgcag ggcgaggcgg ccgccggtgt ggtgttgcta cggccggcca 816541 gcccgggtac cggtgtgatc gccggtggtg cggcccgcgc ggtgctggaa tgtgcggggg 816601 tgcacgacat cttggccaag tcgctgggca gtgacaacgc gatcaatgtg gtgcacgcca 816661 ccgtggccgc gctcaagctg ctgcagcgtc cggaggaggt ggcggcgcgc cgcggtttgc 816721 cgatagagga cgtcgccccg gccgggatgc tgaaggcgcg tcggaaaagt gaagcgctgg 816781 ccgccagcgt tttgccggat agaacgatat agccatgtca cagctgaaga tcacccaggt 816841 gcgcagcacc atcggagcac gctggaagca gcgcgagagc ctgcgcactc tgggcttacg 816901 aaggattcgt cattcggtga tccgtgaaga caacgcagcg actcgcggac tgatcgcggt 816961 ggtgcgtcac ctcgtggagg ttgagcccgc gcagaccgga gggaagacat agtgacgctc 817021 aagctgcatg acctgcgccc cgcgcggggg tccaagaccg cccgcacccg agtcggtcga 817081 ggtgacggct ccaagggcaa gacggccggc cgtggcacca agggcaccag ggcccgcaag 817141 caggtgccgg tgaccttcga gggcgggcag atgccgatcc acatgcggct gcccaagctc 817201 aagggcttcc gtaaccggtt tcgcaccgaa tacgaaattg tcaacgtcgg cgacatcaac 817261 cggctgtttc cgcagggtgg tgccgtcggc gtggacgacc tggtggccaa gggggccgtc 817321 cgcaagaacg ctctggtcaa ggtgttgggt gacggcaagc tgaccgccaa ggtcgacgtg 817381 tccgcgcaca agttcagcgg cagcgcgcgc gcgaagatca ccgcagcggg cggttcagcc 817441 accgagctct agtttcgggc gagcagacgc aaaatgcccc cgaaatgccc attttcgggg 817501 gcttttgcgt ctgctcgcgg gcccttggcg gccggtgggt acgctgggtg aatatggttg 817561 cctttctgcc ttccattccc gttgtcgagg acctacgcgc cctggtcggc cgggttgata 817621 ccgcccgcca ccacggtgta cccaacggct gcgtgctcga attcaacctg cggtcggtgc 817681 cgccggagac gacgggcttc gaccctctta cggtgctcac cgggggtggg cggccgatgg 817741 cgctgcgcga tgcggtcgcc gcgatccacc gtgccgccga ggacccccgg gtagccgggc 817801 tgatagcccg cgtgcagctt ccgccctcgc cggcgggggc ggttcaggag ctgcgggagg 817861 ccatcgcggc cttcagtgcg gtcaagccgt cgctggcctg ggccgaaact tatccgggca 817921 ccctgtccta ctatctggct tcggcgttcg gtgaggtctg gatgcaaccc tcggggagtg 817981 tggggctggt cggcttcgcc accaacgcca cattcctgcg cgacgccctg cacaaggcgg 818041 gcatcgaggc ccagttcgtc gcccggggcg aatacaagtc ggcggcaaac cttttcaccg 818101 aggatggctt cacagacgcc caccgcgaag cggtcacgcg gatgctggac agtctgcagg 818161 accaggtgtg gcaggcggtc gccaagtcgc gcaatatcgg cgtcgatgcg cttgatgagc 818221 tggctgaccg ggctccgcta ttgcgggacg acgccgtgac ttgcggtctg atcgaccgga 818281 tcggatttcg cgaccaagcc tacgcccgta tggcggaatt ggttggtgtg gaaaaaggtt 818341 caccggaatc cagtggctcg caaacaagcc cagacgaaaa gccgccgcgg atgtacctgg 818401 cgcgctacgc cagttcggcc cggccacggc tgacgccccc cgtcccatcg attcctggtc 818461 gccggtccaa gccgacgatc gcggtggtga ccctggaagg cccgatcgtc aacggtcgtg 818521 gtgggcccca gtttctgccg ctcggtccgt cgagcgccgg cggtgacacc atcgcggcag 818581 cgctgcggga ggtggccgcc gacgattcgg tgtcggcgat agtgctgcgg gtcgacagtc 818641 cggggggctc ggtcaccgca tcggagacta tctggcgtga ggtggccagg gcccgcgacc 818701 gtggcaaacc ggtggtggcg tcgatgggtg cggtcgccgc ctccggtggc tattacgtgt 818761 cgatgggtgc cgacgccatc gtggccaacc cgggcaccat caccgggtcg atcggtgtga 818821 tcaccggaaa gctggtggtt cgggatctca aggaccggtt gggtgtcggg tcggatgcgg 818881 tgcgcaccaa cgctaatgcc gatgcctggt cgatcgacgc acccttcacc ccggaccagc 818941 aggcccatcg cgaggcggag gcggacttgt tctacagcga cttcgtggaa cgcgtcgccg 819001 agggccgcaa gatgactacc gacgccgtgg acgtcgttgc gcgaggccgg gtctggaccg 819061 gtgccgacgc tctcgatcgc ggcctggtcg acgaactcgg cggccttcga accgcggtgc 819121 gtcgcgcgaa ggtgctagcc ggactagatg aggacaccga ggttcgcata gtcagttatc 819181 cggggtcgtc actctgggac atggtgcgac cgcgtccgtc gtcacgaccg gcagcggcat 819241 cgctgccgga tgctatgggt gcgctgcttg cccgttcgat cgtcggcatc gtcgagcagg 819301 tggaacagac tctcagtggt gccagcgtgt tgtggctggg ggagtcgcgc ctctagccgt 819361 tcaaacgacc gctgatgaag atgatttcgc cgagcggatc gtcgtcgtgt ggggcgggaa 819421 cgggcaaacc attgcgcctg aataggtcgg tccgcactgt gccctcaacg tcccagccct 819481 tggcgcgcag gtagtcgacg acgtggctgc gttcgccgga atacaccagc gacgccatgt 819541 cgatgtccac gccgtgcttg cgaaacgaat ccgccatttc tcgtacccgg cctgcgtcga 819601 aatccacaat gcccgggaca agttcggtag cgatcgtgct gcccgcaaca ctgagttcgg 819661 tgctgttgtc gaacaaccgg tcctgggatc cggcggcagg tagatcagca tgccttcggc 819721 caaccatgct gtcggtgccg tcgagtccag gccggcagct tgcagtgccg ccggccagtc 819781 cgcgcgcaag tcgatgtaca ccgtgcgccg aatggcggtg ggcttggcgc cgatgccggc 819841 caaggtggtt gtcttgaagt cgatcacctg tggttggtcg atctcgtaga ccacggtgcc 819901 ggccggccac ggcaaccgat aggcgcgcgc gtccaacccg gctgccagga tcaccacttg 819961 tcgcactccg ccgtccgtgg cagtgcggaa gtagtcgtcg aagtacttgg tgcgcaccgc 820021 tatcccgtcg atcatcgcct gtgcccgccc cggcgaaagg ttcccggtcg tcgcgatatc 820081 gagctcgccg tcgatcaact tggtgaagaa atccagcccg accgcgcgca ccagcggttc 820141 ggcgaacggg tcgttgatca aacctcgtgg atccttggtc gccaacgcgc gtccggcagc 820201 aaccatggtc gcggtagccc cgacgctgga ggctagatcc cagttgtcgt cgtgagcgcg 820261 cggcatctgc gccctatgtc cgggtcgcag cgacgtagtt cattgtgccg ggacggccgc 820321 tgcagcgctg aggtcggcca gtgtacgcga ccgccaactc agccggtaag ccctggcggc 820381 ggtggagcag tcgtcgaagc ctggtgagca tcactgcgag tcatcgtgta ggcggccgat 820441 ttcgacagtt cattgacggg gcaagcggta tggcgccacg aaggtgctgg cttgcggtgt 820501 gctgggtact gtctgtgttt ccgttgcaac ctggcgctga catagaagaa atcaggcaac 820561 ggcacttctt cgtcctcgaa cggttgaaat ccgttggcgg tcagcaggtc ctgggatttg 820621 atctcggtca gcagccaacc attgtcggat aggtacgagg cgggctcgtt gcggtcgccg 820681 aagtacacca gctcattcat gtctagatcg aaaccgtatg cgcgccaacg attggcgagg 820741 atcgtcatgc gctccctcat ccgttcctca tgatgcggct tgaagttgcg tatgctctcg 820801 gttgcaaacc tgctgtccgg cacactgagc gcggtgacat tgtccaacaa gcggtcctgc 820861 gcttccggcg ggaggtagcg gagcaaccct tcagcgctcc acgcggtggg ctgggtcggg 820921 tcgaatcccg ccgcgcccaa cgcggtgggc caatccgcac gcaaatcggc ggtgaccacg 820981 cgccggtcgg cggtgggcgt ggcgcccagt tcggcgagtg tgcgagtttt gaactccatg 821041 acttgcggtt ggtcgatctc atacaccacg gtctgggcgg gccaggccag ccggtatgcc 821101 cgggaatcca atcctgaggc caggatcacg acctgcctga tgcccgcgcg tgtcgcatcc 821161 atgaagaact cgtcgaagaa cttggtgcgg acggcatggt gttcggccat acggaccatg 821221 gacgcattcg ggcgttccgg atcgtcgatg tctgaggccg tcaattcccc gctcgcgagc 821281 cgggtcagaa cgtccacccc caccgcccgg accagcggct cagcgaactg atcgttgatc 821341 agtgggttgg cggcgcgggt cgccatcgcg cgagccgccg caaccatcgt ggcggtcgcc 821401 ccgacgctgg atgccagatc ccaggtgtcc ccttcgcacc tgatggaacc ggtgtatgtc 821461 atgcacggcc tctcttcaaa aagcggggat aattccttag taaagttaac aacaggcgac 821521 aaattccgcg acttggaaag gctggcgcga tcggcggcgt cggggtgccg ccataggggg 821581 cgcacgtggg ggtcctggct gttgagcgtg aataccgcga tgggttttcg gcgtgtcgcg 821641 tggtgcgatt cactctcggt gcggctagag cggattcgcg cgcagatagc cgtagacgcc 821701 cgtgaagtta cggcacacgt cctcaggaat tggcaccggt ccaccgagag cgcgggcacc 821761 ccaaacgatt tgtgcggtgc gctcaacaag ggcggtgacg cgcagcacct ggtcggggcg 821821 gggccccacg gccaccaggc cgtggttggc gatcagggcg gcggcgcggc cctcaagcgc 821881 gcgcaccgcg ttgcggccga cctcgggtgt accggacgcg gcgtactcgg tgcagcgaac 821941 gtccccgccg cagtagatcg cgaactcgtc gatgcaggcg ggaatcggct catgggcgac 822001 ggcgaacatg gtcgcccaca ccgggtggct gtggatcacg ctgccaatgt cgtcgaatgc 822061 gcgatagcac gccaggtgta ggtttagttc ggtcgacggc gaccggccgt ccttggcgtg 822121 cagcaccgca ccgccggcgt cgactagcac cagatcgtgg agcagcatct cggcgtagtc 822181 gaccgaggac ggcgtgatga ccacgttgcc gtccgagcgc ctggctgaga tatttccggc 822241 ggtcccctcg accaggcccc gacgcaacat gtccttggcg gccgccagca ccgcggattc 822301 cgggtcgtca acgaagttca tgagcccaat acctccgggt tgacgacatg ggcgggcctg 822361 ttgccggaca gcagtgcgcc caggtcgtcg gcgaccatcc gcgcctgccg ggcctcggtg 822421 ttccaggtgg ccccgccgat gtggggggtg aggacgacat tgggcatgct caccaaaggg 822481 tgatcggtcg gcagccattc accggtgaag tggtccaggc cggcggcggc cagcttgccg 822541 ccacgcaggg cgtcgacgag cgcatcggtg tcgtgcagct gggaccgggc ggtgttgaga 822601 aacaccgcac cgtcgcgcat ggccgcgaac tgctgggcac cgatcatccc gatcgtgtcg 822661 tcggtgaccg ccgcgtgcat ggagacgatg tcagcctcgg ccagcagctc gtcaaggctg 822721 tggccggcgt cgtcgcggta aggatcgtgc gcgatgaccc gcaggcccag cccggacagc 822781 ctccagcgca ccgcgcgacc gacggcaccc aggcccacca gcccggcagt cagcccggcg 822841 atttcggcac cgcggaaccg ctgatagggg atggtgccgt cgcgaaagat gttgccggac 822901 cgcacatctg cgtccgcggg aatcaggtgc cgggcgacgg ccagcaacag ggccaccgtc 822961 atctcggcga cagcgtcggc gttgcgagcc ggggtgtgca gcaccggtat gccggccgcg 823021 gtggcgccgg ggatgtcgac gttgctggga tccccgcggg tggcggcgac cacccgcaac 823081 ccccgctcga acaccgggcc accgaccgag tcactttcca ccacaagaac atcggcggcg 823141 acggcggtga tccggtcagc tagctgctcg gcgctgtaga ttcgcagcgg tcgctgatcg 823201 atccacgggt cgtataccac gtcggctagc cgccggagct gggcgaaccc cggtccacgc 823261 aatggagccg tcaccagagc acgcggtcga ggcgtcacgt ttgccaatgc tggcgtacgg 823321 tggcgcccgt gtcacgcgac gacgtcacaa tcggcatcga tatcggcacc accgccgtca 823381 aagcggtggc cgccgacgac aacggtcggg tgacggcgcg ggtacggatt ggccaccagc 823441 tggcggtgcc ggcccccgac cggctggagc acgacgccga cgaagcgtgg cggcggggac 823501 cattggcagc actggaccgg ctggtcggac ccgacacccg ggcactggcc gttgccgcga 823561 tggtgccatc gctgaccgct gtcgatcccg ctggccggcc gatcacaccc gggctgctgt 823621 acggcgacgc caggggtcgg gtaccgaacg cctcggtggc acgggcgcag tcggtgccgt 823681 cggtgggtga gaccgccgag tttctgcgct ggacggccgg ccaagcgccg gatgcgtccg 823741 ggtactggcc ggcgccggcg gtggccaatt acgccttgtc gggcgaagcg gtcatcgact 823801 atgccacggc cgtcacgact ctcccgttgt tcgacgggac gggatggaac gcgaccgctt 823861 gcgccgactg cggtgtgacc gttgaccgga tgccgcgggt ggagacgttc ggagtgggag 823921 tggggcaggt gcgcggcacc ggcgcggtgc tggcggtcgg tgccgtcgat gccctgtgcg 823981 aacagatcgt ggccggcgcc gaccgcgacg gcgacgtgtt ggtgctatgc ggcgccacct 824041 tgatcgtgtg gaccaccatc tccgcggctc gtcaagtgcc gggtttgtgg accatcccgc 824101 atacggcacc gggcaagagc cagatcggag gggccagcaa cgctggtggg ttgttcctca 824161 actgggtgga tcgtgttatt ggaccgggcg atccagcgct agccgatccg cggcgggtgc 824221 cggtgtggct gccctatata cgcggcgagc gcaccccgtt ccatgagccc gatcgccggg 824281 ccgtgctcga cggtgtggat ctctcccagg acgccgcatc ggtgcggcgg gccgcctacg 824341 aggcgtcggg cttcgtcgtg cgccagctca tcgagctaag cggggcgccg gtggcgcgca 824401 tcgtggcggc aggcggcggc acccggatac agccttggat gcaggctatc gccgacgcga 824461 ccggccggcc ggtggaggtg tccagggtgg ccgaaggggc ggcactggga gcggctttcc 824521 tcggccgctt ggcggccgga ttggaatcgt cgatcgccga cgctgcccgg tgggcctcaa 824581 ccgaccgcat tgtcgaaccc agtgccgact gggcggggcc gaccaaggaa cgctatcgcc 824641 ggttcctggc gctcagcggc tcgaagttgg cctgacggtg gaccaagatg catggcgcaa 824701 gaactggtgt gtcgttctac gcttatgcaa tgacagatca cgaccagacc gcggcccgtc 824761 gagagatcgc cgatgccctg ctcgccgcgc tggaacgtcg gcatgaggtc gcagacgcca 824821 tcgtggaggc cgccaacaag gccgccgccg tcgaggcgat cgtgaacttg ctgggcacct 824881 cgcacttggc cgccgaagcg gtgatgagca tgtctttcga tcagctcacc caggatgcgc 824941 gcacaaagat catcgccgag ctcgacgacc tgaacaaaca gctgagcttc accgtcaagg 825001 agcgtccagc cagctctggt gagggcctgg agctgcggcc gttctcccca gatgaggacc 825061 gcgacatctt cgctcgacga accgaagaaa tgggcgccgc cggcgatgga tccgggggac 825121 ccgccggcag cgtcgacgac gagatccgag ccgcacagaa gcgcgtcgac gacgaggagg 825181 cggcttggtt cgtggctgtt gattccggcg tcaaggtcgg gatggtgttc ggcgagcttg 825241 tccacggcga ggtggacgtc cggatctgga ttcaccccga tcatcgaaaa aagggttacg 825301 gaaccgcggc attgcgcaag tcgcgctcgg agatggcctg ggcgttcccg gccgtgccga 825361 tggtcgcccg cgcgcccgcg gcccaacccg cccagccggg aagtgccggc cggtagcatc 825421 cggttcggtc tggcaggcgg tcgccaggcc gatcggcggc gaatccgcgg cgccaacgct 825481 gccgccggat cccaactggc ttaatcagcg tgtgtcttgg tgtttctgct tcagttcggc 825541 ggagacatag atcacctcgc cgaacggggc gtcgtcgccg tcgatcggcg gcagtccgtg 825601 ttcagccagc aagtcggtgg tgctcgcgct ggcggttcgc cagccgtggt cggccagata 825661 ggtccgcgcg tccgtgcggt cgccgaaata caccaggccc gacatgtcga ggtcgaggcc 825721 atggcgcctg aaccgctccg ccaggcgccg catgcgtccc cgcagttctt cttcgttgag 825781 ccgattgatg tcgcgcagga cttcggtggc gaactggctg cccggtacgc tctgggcggt 825841 gatctggtca agcagccggt cctgcgcctc ggcggacaga tagcccagca gcccctcggc 825901 gatccaggcg gtccgctgcg cgttgtcaaa gccggctttt tgcagggcgg tgggccagtc 825961 gtcgcgcaaa tcgaccgcca cggtgcgccg gtcggtcgtg ggtgccgcac ccaggccggc 826021 cagcgtcgtg gtcttgaagt cgatcacctg cggctgatcg acttcgaaga cgatggtgcc 826081 ggctggccag cgcagccggt aggcgcggga atccaggccg gaagccaaga tgacggcctg 826141 ccgaatcccg gctcgggtgg catccagaaa gaagttgtcg aagtagtgag tgcgaatggc 826201 catcgcgtcg gcgaaccgcc gcaggccgtt ggcctcgtct tcggctagct cgtcgggatc 826261 cagttcgcca ctggccatgc gtacgaagaa gtcgacgccg accgcgcgga ccagcggttc 826321 cgcgaactgg tcgttgacca gcgcgccggg agcccggccg gctaccgctc gggccgccgc 826381 caccatggtg gccgtcaaac ccacactgga cgccaagtcc cacgaatcgc cctcaaagcg 826441 ggcactgccc gtttgcgtca tctgtaaccc cttcgatagc tcgcaccgtg gcggcccgga 826501 acgggccagt ccataccagc tgttagtctc ttacacgatt tggcgcgcga cgccgtacgt 826561 cctggcctgc gggtgttggg cgcgtgatgc aagatgaccc cgggctgcgc aggaggatag 826621 agtgctttcg gctttcatct cgtcgctgcg aacagtcgac ttgagacgaa agatcctctt 826681 cacgctgggc atcgtcattc tctaccgtgt cggtgccgcg ctgccgtccc ccggtgtcaa 826741 ttttccgaac gtgcagcagt gcatcaaaga agccagcgcg ggcgaagccg gacagatcta 826801 ttccctgatc aacctgttct ccggcggtgc gttattaaag ctcacggtgt tcgcggtggg 826861 ggtgatgccc tacatcaccg ccagcatcat cgtgcagctg ctcaccgtgg tcatcccgag 826921 gttcgaggaa ctccggaagg aaggccaggc gggtcagtcg aagatgaccc agtacacccg 826981 ttacctagcg atcgcgttgg ctatccttca agccaccagc atcgtggcgt tggctgccaa 827041 cggcgggttg ctacaaggtt gctcgctgga catcatcgcc gaccagagca ttttcacact 827101 ggtcgtcatc gtgctcgtga tgacgggcgg cgccgcgttg gtgatgtgga tgggcgagtt 827161 gatcaccgaa cgcggcatcg gcaacggcat gtcgctgctg atcttcgttg gcatcgctgc 827221 ccgcatcccg gccgaaggtc aaagcatcct ggaaagccgc ggtggagtcg tcttcaccgc 827281 ggtctgcgcg gccgcgttga tcatcatcgt cggtgtggtg ttcgtcgaac agggtcagcg 827341 ccggattcca gtgcaatacg ccaagcgcat ggtgggccgg cggatgtatg gcgggacttc 827401 gacttatctg ccgctcaagg tcaaccaggc cggcgttatc ccggttatct tcgcgtcgtc 827461 gctgatctac attccgcacc tgatcaccca gctgattcgc agcggcagcg gtgtcgtggg 827521 aaacagctgg tgggacaaat tcgtcggcac gtacctgtcc gacccgagca acctggtcta 827581 catcggcatc tacttcggcc tcatcatctt cttcacctac ttctacgtgt cgatcacctt 827641 caaccccgac gaacgtgccg acgagatgaa gaagttcggc ggcttcattc cgggaattcg 827701 gccgggccgt ccgaccgcag actatctgcg ctatgtgctg agccggatta ccttgccggg 827761 ctcgatttac ctcggcgtga tcgccgtgct gcccaacctg ttcctccaga tcggcgccgg 827821 tggaaccgtg cagaacctgc cctttggggg taccgcggtg ctgatcatga tcggtgtcgg 827881 tttggatacg gtcaagcaga tcgagagtca gctcatgcag cgcaactacg aagggttcct 827941 caagtgagag ttttgttgct gggaccgccc ggggcgggca aggggacgca ggcggtgaag 828001 ctcgccgaga agctcgggat cccgcagatc tccaccggcg aactcttccg gcgcaacatc 828061 gaagagggca ccaagctcgg cgtggaagcc aaacgctact tggatgccgg tgacttggtg 828121 ccgtccgact tgaccaatga actcgtcgac gaccggctga acaatccgga cgcggccaac 828181 ggattcatct tggatggcta tccacgctcg gtcgagcagg ccaaggcgct tcacgagatg 828241 ctcgaacgcc gggggaccga catcgacgcg gtgctggagt ttcgtgtgtc cgaggaggtg 828301 ttgttggagc gactcaaggg gcgtggccgc gccgacgaca ccgacgacgt catcctcaac 828361 cggatgaagg tctaccgcga cgagaccgcg ccgctgctgg agtactaccg cgaccaattg 828421 aagaccgtcg acgccgtcgg caccatggac gaggtgttcg cccgtgcgtt gcgggctctg 828481 ggaaagtagt catgcgccca ctggcacggc tgcggggtcg cagggtcgtg ccgcagcgca 828541 gtgccggcga actcgacgcg atggccgcgg cgggcgccgt cgttgccgcc gcgctgcggg 828601 cgatccgtgc ggcagcggct cccggcacat ccagcctgag tctcgacgag atcgccgagt 828661 cggtgatccg cgaatccggc gccaccccgt cgtttctggg ctatcacggc tacccggcct 828721 cgatctgcgc gtcgatcaac gaccgggtgg ttcatggcat cccgtcgacc gccgaggtgc 828781 tcgcgcccgg tgatctggta tccatcgact gcggtgcggt gctggacggt tggcatggcg 828841 atgcggcgat cactttcggg gttggcgccc tgagcgacgc cgacgaagcg ctgtcggagg 828901 cgacaaggga atcgcttcag gccggcatcg ccgcgatggt ggtcggcaat cggttgaccg 828961 acgtcgcgca tgccatcgaa acgggtaccc gtgccgccga gctccgttat ggacgctcgt 829021 tcgggatcgt cgccggttac gggggccacg gcatcggccg ccagatgcat atggatccgt 829081 tcttgccgaa cgagggtgcg ccggggcgcg gtccgctgct ggctgccggc tcggtgctgg 829141 ccatcgaacc gatgctgacc ctcggtacca ccaaaacggt ggtgctcgac gacaaatgga 829201 cggtcacgac cgccgatggg tcacgtgcgg cacactggga acacaccgtg gcggtaaccg 829261 acgacgggcc ccgaattctg acgctcggtt agcgcggctg ccggcgcggg cagtggtgaa 829321 ccaaactctt actcgactcg tgtcagtaag cgggaggtga tcgcgtggct cgtgtgtcgg 829381 gcgccgcggc cgctgaagcc gcgttgatga gggcgctcta cgacgagcat gccgccgtgt 829441 tgtggcgtta cgcgctgcgc ttgaccgggg atgcggccca agccgaagac gtcgtccaag 829501 agacgctgtt gcgggcgtgg cagcatccgg aggtgatcgg cgacaccgcg cggccggcaa 829561 gggcgtggtt gttcaccgtc gcgcgcaaca tgatcatcga cgagcggcgc agcgcccggt 829621 tccgcaatgt ggtcggttcg accgaccaat cgggcacacc cgagcagtcg acgccggacg 829681 aggtgaacgc cgcactggat cggctgctga tcgccgatgc gctggcccaa ctgtccgccg 829741 agcatagggc cgtgatccag cggtcctact accgcggatg gtcgaccgca cagattgcca 829801 ccgacctcgg aattgccgaa ggaacggtga agtcgcgatt gcactacgcc gtgcgcgcgt 829861 tgcggctcac tctgcaggaa ctgggagtta ctcgatgacg gcagagccca ttcgcatggc 829921 tgccggctcc ggatacgtga gggtgacagg agagagatga catgacgatg ccgctacgag 829981 gacttggccc gcccgatgac accggtgtgc gcgaggtgtc gacgggtgat gatcaccact 830041 acgcgatgtg ggatgcagct tacgtgttgg gagcattgtc tgcggccgac cgccgcgaat 830101 tcgaagcgca cctggccggt tgccccgaat gccggggggc cgtcaccgaa ctctgcgggg 830161 tgcccgccct gctgtcccag ctcgatcgtg acgaagtggc cgcgattagc gaatccgccc 830221 cgactgtggt ggcttcgggg ctgtcgccgg agttgttgcc gtcgttgctg gcggcggtgc 830281 acaggcgtcg gcgccgtacc cggctgatca cctgggtggc ctcgtccgcc gctgccgcgg 830341 tgctggcgat cggtgtgcta gtcggtgtgc agggccactc cgcggcaccg cagcgggcgg 830401 ccgtgtcggc gctgccgatg gcccaggtcg gcacgcagct gttggcgtcc acggtgtcga 830461 tcagcggcga gccttggggg acgttcatca acctgcggtg cgtctgcctg gcgccgccgt 830521 atgcttccca cgacacgctg gccatggttg tggtgggtcg tgacggcagc cagacacggc 830581 tggcgacttg gttggccgaa cccggtcaca ccgcgacacc cgccggcagc atttcgacac 830641 cggttgacca gatcgccgcc gtgcaagtgg ttgccgccga taccggccag gttctgctgc 830701 agcgttcgct ctaagactga gctttaggca cctggcgccc tgctattggc acgccctaca 830761 agcaccaggt ggtcgggcgt cgaccacctg ctcggagtgg gctgcatgat gccgcgcatc 830821 ttcagtcgtc gatcaccgtg gtgctggcca aacacgagtt ctccgctgcc acggtggccg 830881 acgggtacag ccgcagcggg gccgggttcg gggtcgcggc ggcggcctcc ggtggcggca 830941 ctttcctcgg tcagaaatgc gccgcagcaa cggcaagctg aattccgtaa ggttggcccg 831001 cgtcgacgca tgtgcgataa gaaggggcgt ggcctcagat aatcgcgacc ccatcgccgc 831061 agcacgggcc aactgggagc gttccgggtg gggtgatgtg tcgctaggca tggtggcggt 831121 gacgtcggtg atgcgtgcgc atcagattct gctggcccgc gtcgagacgg cgctgcgccc 831181 ctatgacctg agtttctccc gcttcgagct gctgcggctg ctggcgttca gccgtatcgg 831241 agcgctaccg atcaccaaag cgtcggaccg attgcaggtt cacgtgacca gcgtcaccca 831301 cgcgatccgc cggctggagg ccgatggatt ggtgcggcgg gttccgcacc ccaccgacgg 831361 gcggaccaca ctggtgcaga tcaccgagct gggtcgctcc acggtcgagg acgccaccgt 831421 caccctcaac gagcaggtgt tcgccaacgt tgggatgggc gccgaggaat cgcaggcgct 831481 ggtgtcggcc gtcgaaacgt tgcggcgcaa cgccggcgac ttttgagggc gggcagacgc 831541 gtaagcgccc aatgtcgtgc cgaaatgggc gcttatgcgt ctgctcgcgc ccggcttggc 831601 gcgcagccgg cgacattcca tgaccagttt gtgcgggcct tgacgcgggc gcgggctcgt 831661 atgcgaccgc cgaggccggc cggcttgctg ctgggcaatg gcggggctcg gcggtatccg 831721 gcggcgggca gctaaccgga ctgccccgaa acccactgcg tggtcaacga tttcaggaca 831781 agctgttagc aggacgtgcc cgcgctgcgc tatccaaaaa cgtcatgggc acgcatgatg 831841 gtgaaatgcg gcggacacca attcaaccgc gaaaggcagg acagtggacc cactgatggc 831901 tcaccagcgc gctcaggacg cgttcgccgc gctcctggcc aacgtccgcg ctgaccagct 831961 cggcggcccc acgccctgct cggagtggac gatcaacgat ctgatcgagc acgtcgtcgg 832021 cggcaacgag caggtcgggc gatgggcggc cagccccatc gagccacccg cccggcccga 832081 tggcctcgtt gccgcccacc aagccgcggc cgcggtcgcc cacgagatct tcgcggcgcc 832141 gggcgggatg tccgccacat tcaagctgcc gttgggcgag gttcccgggc aggtgttcat 832201 cgggttacgc accaccgatg tgctgaccca cgcgtgggat cttgccgccg ccaccggcca 832261 atccaccgat cttgatcccg agttggccgt cgagcggctc gccgccgcgc gtgccttggt 832321 ggggccgcag ttccgcgggc cgggaaagcc cttcgcggac gagaagcctt gcccgcgtga 832381 gcgcccgccc gccgatcagc tggcggcatt tttgggccgc acggtgcggt gaacccgcga 832441 attcggctgc cgcgcaacgt gtggatcacc gcgctgcggt ccagggcgcc gtggtcggcg 832501 gcgaatctgg cgtagatttc ggcggggtgg ccacctagcg gcgccgccgc accggtcgag 832561 gccaccgctt ccatggccag gcccacatct tgatcggcgt ggtggccacg cccggtgtga 832621 agtgctgttg gccgtgatgt cggattacag tctcggcgtg cccgacgaga caggccttgg 832681 tgctgacgcg gcgcgcgcgc gtgaagtggc gctgacacag cacattgggg tatccgcgga 832741 gaccgatcgg gccgtcgtcc ccaagctgcg ccaggcctat gacagcctgg tgtgcggtcg 832801 ccgccggctt ggcgccattg gagccgagat cgagaacgcg gtggcccatc agcgcgcgct 832861 gggccttgac accccggccg gtgcccgtaa cttctcccgg tttctcgcca ccaaagcaca 832921 cgacatcacg cgagtgctgg cagcaaccgc cgcggaatcc caggccggcg cggcgcggtt 832981 gcgatccctg gcttcgtcct atcaggctgt gggatttggc cccaaacccc aggagccgcc 833041 tccggatcca gtgccatttc cgccctacca gccgaaggtg tgggcggcgt gccgggcgcg 833101 tggccaagac ccggacaagg tcgtcaggac gttccatcac gcgccgatga gcgcgagatt 833161 ccgctcgcta ccggccggag actccgtgtt gtactgcggc aatgacaagt acgggctgct 833221 gcacattcag gccaagcatg gacgccaatg gcacgatatt gcggatgcac gatggccgag 833281 tgcaggcaat tggcgctatc tcgccgatta cgcaatcggt gccacactgg cctacccgga 833341 gcgagtggag tacaaccaag acaacgacac gttcgccgta taccggagag tgtcgttgcc 833401 agacggcaga tacgttttca caacccgcgt cattatttcg gcacgcgacg ggaagatcat 833461 tacggccttc ccgcagacga cgtgatgcgt cggttgggaa ctaagggaag gtgatggcgt 833521 gaccgggcca ccgcgaagct atacagggcg ccgggatctc atcgcggaga agctggagcc 833581 gtactttcag atcagcgcca tgctgccgaa gaacaccaga cccacctcgg aaaccgccga 833641 agagttctgg gacaactcgc tgtggtgcag ctggggcgac cgagaaacgg gatacacccg 833701 caccgtcacg gtttcgatct gccaggtggc ggacggcgaa cgtgaggccg aaggggttcg 833761 ggacatgatg cggctggagt gtccggctgg gctggatcta cggacaccca acccggaggc 833821 atacgagatt accggtcagc ggcccggaga attcgtgttc gtgctcggct atctggggca 833881 tgtgcgggcc atcgtgggca actgttacat cgagatcatg ccgatgggca ccagggtcga 833941 gctgagcaag ttggccgatg tggcattgga tatcggccgc agtgtcggat gctcggccta 834001 cgagaacgac ttcacgctgc cggatattcc aacgcagtgg cgcaaccagc cgctgggctg 834061 gtacacgcag ggccttgccc cctacctgcc ggggctgtcg gacccgaaag acgccgccga 834121 gggctgatgg gtgtgccggc gacctctgag ggcgagcaga cgcataagcg cccaatttcg 834181 ggctcttctg acccttccgt gggtggaacc ttggtctgag taggcgcacg tcgttgtagc 834241 ttaaggttgc tggtttgtca aaggtccgaa accaagggga gcgagcaacg acgtgcgcaa 834301 tgcgaggttg tggcgtgaac tgctgggtgt tgataagcgg acggtggcct acgccaggtg 834361 ttttcggtca aaggcgaaga aggcaagcag gcactggatc ggtggatctc ctgggcgcgg 834421 cgctgccgca tccccgtctt cgtggagctg gccggcggca tcgtgcgaca ccgccaagcc 834481 atcgacgccg cccttgacca cggcctatgg caaggactga tcgaatccac caacaccaag 834541 atccgactcc taacccggat cgcgttcgga ttccgctccc ccgaagcact catcgccttg 834601 gccatgctcg ccctcggcgg ccgccgcccc gccctaccgg gcagaaccaa acacccacgg 834661 atcagtcagt agagccggaa aacctgggat ttcgctgccc gttggacggt gcaatgcgct 834721 tctgtccatg agtcgctgga agacctgggc atctcgcccg ggttgtcctg gcttattggg 834781 ccatgacctc ttgggaggtg tcacatatcg tttgtgatcg cggcgccgga ggccatcgcg 834841 gcagcggcca cggatttggc aagcatcggt tcgacgatcg gggcggccaa cgccgcggcc 834901 gcggccaaca cgacggcggt gctggccgcg ggcgccgatc aggtgtcggt ggccatcgcg 834961 gcggcttttg gggcgcacgg ccaggcctat caggcgctca gcgcgcaggc ggcgacgttt 835021 catatccagt ttgtgcaggc cttgaccgcg ggcgcgggct cgtatgcggc cgccgaggcc 835081 gccagcgccg cgtccataac cagtccgctg ctcgacgcga tcaacgcgcc cttcctggcg 835141 gcgttggggc gcccgctgat cggtaacggc gccgacgggg cgccggggac cggggccgcc 835201 ggcggggccg gcggattgtt gttcggcaac ggcggcgcgg gcgggtccgg cgcgcccggc 835261 ggggccggcg gattgttgtt cggcaacggc ggcgccggcg gccccggcgc gtccggcggc 835321 gcgctgggct gatcggcaac ggcggtaacg gcggtaaggg cgggcttggg gtcccgccgg 835381 gtgtcggtgg taccggcggc gccggggggc tgctgctcgg cctggatggg ttgacgtagg 835441 cggcggcccg cagcccgccg ggctccacgt catctggcgc tgctggcaga ccaacgctcc 835501 ctacgagccc acgcgccacc gagccctcca gggccctgct ggcccaacat caacgaacgg 835561 atacctggga caggacgact ggaaggcggg cagttgaccc atgccgaata ccggtggcag 835621 cctgctgcac atcgcatcca cttccgggcg accaacacgt cgagcagccg cgacatccgc 835681 ggcatgcaat gctggcggcg cgacaggtgc taggaggagt ggttgcccgc accgtagtag 835741 ttcagccagg ccgcaatgcg ttgaccgata cgcgggtcgg tttcctccgg cagcagcaaa 835801 acccgagcct ggatcacacc cacgtcgagc aggccgcttc ggatgagggc ggccacgaaa 835861 gctttgtcct tctcgcgtcc ggcggcaagt ttcgccaccg cgaggtcgtg cggttccaga 835921 aagcgtggtt tcgccgggcg cgaggattcg acggtccaac tgaccagccg gtcccgccac 835981 ccgttaggca ggatcgcggt gtcgatatgt acgccctcgg cataaacgcc attgctgcgg 836041 tgaaaatcgg acatctcgcc gattgccacg tcgacatgat ccgctttgtc ccgcgccggg 836101 tcgttgacaa acgcgatgtc ggcctcctgg gaggcggtgg cctgcggcgg tagttcgttt 836161 tcatcaaatg accccaggat cgactgcgac ccgagtacca gcacgtccac atcgcccaca 836221 acagcacagg cgcggcggag gagatgtgca agttgctgac gcgtcattcc gtcatggccc 836281 gctcgtgctc gcgatcccag tgatccttga acgaccgcag caccgcgacc ctcgtcgcct 836341 ccggcagtat gcccgcgaac ggggagttct gccgcatctc ccgagcgtcc tccgaagggc 836401 tggtcaatac gtgcatgacc gcgtcgaggc cgtcgttaag gacacgctgc cacttcgtga 836461 aataccaccc cgccatgccg tcccgacgat gcatacccga ccagcgacgc aagttctctc 836521 gtgcggcgga gacgaccgta tccggttcgg tcaacagcgg gctcagcagg gcgcgatgca 836581 gccacagcga cctttcctcc tcgcgggtca accggcggct cgtgacgcgc tcgacttcgc 836641 tactgggcac ccgccgatgg ctgccgacgt gcacgcacac catctcgccg cggtcacaca 836701 tgttgacgac atgctgccgc gataccccga gtatctgcgc ggcctcactc gtcttcagca 836761 gagtctccat gtcccaatgc tggctagtaa acccaaaaaa cacaacatcg ttgcggagcg 836821 tgatcgcacc ggctgtcgct agagcgaggg ccccagttcc gcggccgaca ggtggcagtg 836881 agctagctgc cgcgcagcgt ctcgatcact gcgctgaagt ccaagtcggc gtgatcggcg 836941 gcgaacttgg cgtagatctg ggcggcgtgg ctgcccagtg gggccgccgc accggtcgag 837001 gccaccgctt ccatcgccag gcccaacatg ccaggtgctg ccgactacgg cggtgatcca 837061 cacggtcacg gcggaagcat tgggccgcat cggtattgat gcgccgcgga ttcctggatc 837121 gttggacgtc gccgcgcatg cggcgatcgg gctgctgccg ttggtggccg gctgcgaccg 837181 ccgacatcgg cggcctgtcc gcggtgctcg ggccggacgg gctgcccaag tgtctttgtg 837241 tatgacggct atccgggtgg agccggtttc gtcgaacgcg gtttgcaccg gccccgcggc 837301 gcaggtgggt gaccagtcac gctcaccgca gcgcgattac gcgcaccagg ccttgcaacc 837361 cgatgtgccg cggcgccgcg cgcggcggca cagaccccgc cggtgttcgg caaaaacggg 837421 gtcgtcgtct tcgacgatgc ggtgtacttg tcatcagaat cagtgtctat ggtcatcggg 837481 ggtgtcgtgg gcgctggccc gctgactcgg gtgggaggtg gcacatgtcg ttcgtgctgg 837541 cgatgccgga ggtgttgggg tcggcggcaa cggatctggc cgctctgggc tcggtgctgg 837601 gcgcggccga tgcggccgcg gcggctacga cgacgggcat cgtggccgcg gtccaggatg 837661 aggtgtcggc ggcgatcgcg gcgttgtttt ccgcccacgg ccgggcctat caggtggcca 837721 gtgcgcaggc ggcggcggtt cacgcccagt tcgtggaggc gttgagcgcg ggtgcggggg 837781 cctacgccag cgcggaggcc gccggcgcgg cggtgctggc caacccggcg cagagcgtgc 837841 agcaggacct gctggccgcc gtcaatgcgc aaagtgtcgc gctcacgggg cgcccgttga 837901 tcggcaacgg cgccaacggg gccccgggca cgggggccaa tggtgcgccg ggcgggtggt 837961 tgctcggtaa tggtggggcc ggcgggtccg ccgccgctgg ctcgggcctg cccggcgggg 838021 ccggcggggc cgccgggttg ttcggcaccg gcggggctgg tggggccggc gggagttcca 838081 cggtaggtga tggcggggcc gggggtgccg gtgggtcagg tggctggttg ttgggcaccg 838141 gtggggtcgg cggggtcggc gggctcgggg ccggcgccgg tggggccggc ggggttggtg 838201 gggccggcgg gctgttgggt gctggcgggc acggcggcgc cggcgggctc ggcgccgtca 838261 ccggtggggt cgggggagct ggcggagccg gtgggctgct ggccgggctg ctggccgggc 838321 cgggcggggc cggcgggacc ggcggacgtg gctttctcaa cgacggtggg gtcggtgggg 838381 ctggcggcaa cgccgggctg ctgttcggtg ccggcggcac cggtggatcc ggcggagccg 838441 gcctaggtgg tgacggtggg gccggtgggg ccggcggcaa cgccggtgtg ctgttcggca 838501 acgccggatc cggggggacc ggcgggttcg gcgataccga cgggggagcc ggcggtgccg 838561 gcggtgacgc cggctggttg ggctccggtg gggtcggcgg ggccggcggg ttcggcgaaa 838621 ccggtgacgg gggtgtcggc ggggccggcg gcaaggccgg gttgctgatc ggtaacggcg 838681 gggccggcgg cgccggggtg ctgatcggca acggcggcaa cgccggcatc ggcggaaccg 838741 gaccgaccgc gggtgatacc ggcgcgggtg ggatcagtgg gctgctgctg ggcgccgacg 838801 gcttcaacgc cccggccagc gcctctccgc tgcacaccct gaaacaacag gcgctggccg 838861 cgatcaacgc gccgacccag acactgaccg ggcgaccgct gatcggcaac ggcacccccg 838921 gggcggtcgg cagcggggcc accggggccc ccggtgggtg gctgctcggc gacggcgggg 838981 ccggcgggtc cggcgcggcg ggctcgggcg cgcccggcgg ggcgggcggg gctgccgggc 839041 tgtggggtac cggcggggcc ggcggggccg gcggctggct gctcggcgac ggcggggccg 839101 gcgggatcgg cggagccagc accgtactcg gcggcaccgg cgggggaggc ggggtcggtg 839161 ggctgtgggg cgccggtggg gccggcgggg ccggtggaac cggccttgtt ggtggcgacg 839221 gcggggccgg tggggccggc gggaccggcg gactgctggc cgggctgatc ggtgccggcg 839281 gaggtcacgg cgggaccggc gggctcaaca ctaatggcga cggcggggtt ggcggggccg 839341 gcgggaatgc cggaatgctc gccgggccgg gcggcgccgg cggagccggc ggtgacggcg 839401 aaaacctgga caccggtggg gacggcgggg ccggcggtag cgcagggctg ctgttcggca 839461 gcggcggcgc cggcggcgcc ggcggatttg gtttcctcgg tggggacggc ggggccggtg 839521 gcaacgccgg gctgctgttg tccagcggcg gggccggcgg gttcggcggg ttcggcaccg 839581 ccggtggggt cggtggggcc ggcggcaatg ccggctggct gggcttcggc ggggccgggg 839641 gcatcggcgg aatcggcggt aacgctaacg ggggcgccgg tgggaacggc ggcaccggcg 839701 gtcagttatg gggtagcggc ggcgccggcg gcgaaggcgg cgcagcctta agcgtcggcg 839761 acaccggcgg ggccggtggc gtcggcggca gcgccgggct gatcggcacc ggcggcaacg 839821 gcggcaacgg cggcaccggc gccaacgccg gcagccccgg aaccggcggc gccggcgggt 839881 tgctgctggg ccaaaacggg ctcaacgggt tgccgtagcc gggcggcacg gcatggcttc 839941 cgggcgtcaa ccactcgccg gtgatgcaga tcggctgcgg agcgggccgc caaaatgggg 840001 gccgccgcgc caggtatctc ggcgaagatc cccggcgctc gagcgctttg tcagaggccc 840061 gtcgcgggtc gtcgtgacga cggctatccg ggcggtgcgg gtttcgcggc gcgccctgtg 840121 cccggcaccg ccgcccgttt gtcggcaacg ccgccgcgac ccgtgagccg tccagcagct 840181 ggcgcctgcg aaacgtgtgg aagcgctgca tgcggtgccg gatcgcgata tcgttgattt 840241 ctgcaattaa ttcctacccg tacgggtgtg tcgctggtag tcgggcacca ggccgtgagg 840301 ggttgggagg catgcgatgt catgggtgat ggtttcgccg gagctggtgg tggcggcggc 840361 agcggatttg gcggggatcg ggtcggcgat tagctcggct aatgcggcgg cggccgtcaa 840421 cacgacggga ttgttgaccg cgggtgccga tgaggtgtcg acagcgattg cggcgttgtt 840481 cggtgcccaa ggccaggcct accaggcggc gagcgcacag gcggcggcgt tttacgccca 840541 gttcgtgcag gccctgagcg ccggcggagg cgcgtatgcg gccgccgagg ccgccgccgt 840601 gtcgccgctg ctggccccga tcaacgcgca attcgtggcg gccaccgggc gcccgctgat 840661 cggcaacggc gccaacggcg cccccgggac cggagccaac ggcgggcccg gcgggtggtt 840721 gatcggcaac ggcggcgccg gcgggtctgg cgcccccggc gctggggccg gcggtaacgg 840781 cggggccggc gggctgttcg gcagcggcgg ggccggcggg gccggcggaa acgccttcgg 840841 cgctggcgag gtcggcgggg ccggcggggc cggcgggaac gccatgctgt tcggcgccgg 840901 cggggccggt ggggcgggcg gggccggcgg aaacgccggc atgctgttcg gcgccgccgg 840961 ggtcggcggc gtcggcggat tctcgaacgg cggtgccacc ggcggggcag gcggggccgg 841021 cggggccggc gggctgttca ccaccggcgg cgtcggcggg gccggcgggg caggcgggga 841081 cggcggggac ggcggggccg gagggttgtt cggtgccggc ggcaccggcg gggccggcgg 841141 attcggaacc gccgttctgg ctggcaccgg tggggccggc gggcccggcg gggcgggcgg 841201 gctgtttggc gccggagggg aaggcggcag cggcgggtcg ggcaacctca ctggcggggc 841261 cggcggggcc ggcggcaacg ccgggacgct cgccactggt gatggcgggg ccggcgggac 841321 cggcggcgct agtcgcagcg gcggattcgg cggggccggc ggagccggcg gcgacgccgg 841381 catgttcttc ggctccggtg gctccggcgg cgccggcggg gccggcggct ccggcgggtt 841441 tggcctcccc agcgggggga aagggggggc cggcggcgac gccggcatgc tcttcggctc 841501 cggcggctcc ggcggcgccg gcggcattag tagaagcgtc ggggacggcg ccgccggcgg 841561 ggccggcggg gcccccgggc tgatcggcaa cggcggcaac ggcggcaacg gcggcgcgag 841621 caccggcggc ggggacggtg ggcccggcgg ggccggcggc accggcgtgt tgatcggcaa 841681 cggcggcagc ggcgggaccg gcgcgaccct gggcaaggcc ggcatcggcg gtaccggggg 841741 ggtgctgttg ggcctggacg gctttacggc ccccgccagc acctcgcccc tgcacaccct 841801 gcagcaggac gtgatcaata tggtgaacga ccccttccag acgctcaccg ggcgtccgct 841861 gatcggcaac ggcgccaacg gcactccggg gaccggggct gacggcggag ccggcggctg 841921 gttgttcggc aacggcggaa acggcgggca gggaacgatc ggcggcgtca acggcggggc 841981 cggcggggcc ggcggggccg gcgggatctt gttcggcacc ggcggcaccg ggggcagcgg 842041 cgggcccggc gccaccggcc tcggcgggat tggcggggcc ggcggagccg ccttgctctt 842101 cggctccggc ggggccggcg gaagcggtgg tgccggcgcg gtcggtggca atggcggggc 842161 cggcggcaac gccggtgcgc tcttgggcgc cgccggggcc ggcggggccg gtggtgccgg 842221 cgcggtcggt ggcaatggcg gggccggcgg taacggcggg ctgttcgcca acgggggagc 842281 cggcgggccc ggtgggtttg gcagccccgc tggggctggc gggatcggcg gggcaggtgg 842341 gaacggcggg ctgttcggcg ccggcgggac cggcggggcc ggcgggggaa gcaccctcgc 842401 cggcggcgcc ggcggggcgg gcggcaacgg cgggctgttc ggcgccggcg gcaccggcgg 842461 cgccggcagc catagcaccg ccgccggagt ttccggaggg gccggcgggg ccggcggcga 842521 cgccggcttg ctctccctcg gcgcctccgg cggggccggc ggcagcggcg gttccagcct 842581 gaccgccgcc ggcgtggtcg gcggcatcgg cggcgccgga ggcttgctct tcggctccgg 842641 cggcgccggc gggagcggcg ggttcagcaa ctctggcaac ggcggggccg gcggggccgg 842701 cggcgacgcg ggtttgctcg tcggctccgg cggggccggc ggggccggcg cctccgccac 842761 cggcgccgcc accggcgggg acggcggggc cggcggcaag tccggagcgt tcggtctcgg 842821 aggtgacggc ggcgccggcg gcgccaccgg tttgtccggt gctttccaca tcggcggcaa 842881 gggcggcgtc ggcggcagcg ccgtgctgat cggcaacggc ggcaacggcg gcaacggcgg 842941 taacagcggt aacgccggga aatccggggg tgcacccggc cccagcggcg ccggcggcgc 843001 cggcgggctg ctgctcggtg agaacgggct gaacggcttg atgtagccgg cgggcctgcg 843061 accgcgcgcg gcgttgacag catcgcttcg gccgctcgac cgcagatgat gctgttgatg 843121 cgttaccgtg tgcatcatgc gcaccacggt gtcaatctcc gatgaaatac tcgctgccgc 843181 caaacgccgg gcccgcgagc gtggtcaatc gctgggcgct gtgatcgagg acgcccttcg 843241 gcgggagttc gccgccgccc acgtcggcgg cgcccgcccg accgtcccgg ttttcgacgg 843301 cggcaccggt ccgcggcgag gcatcgacct gacctcgaat agagcgttgt ccgaagtgct 843361 cgacgagggc ctggaactga actcccggaa gtaaccccca ataggcgcag aacggcaatg 843421 ttccttctcg acgccaacgt gctgctggct gcacaccgcg gtgaccaccc gaatcaccga 843481 accgtccgcc cctggttcga tcgactgctc gcggctgacg accccttcac agtgccgaac 843541 ctggtatggg cgtcgttcct ccggctggca acgaatcgac gcatcttcga gattccgtca 843601 ccgcgagcag aggcattcgc attcgtcgaa gccgtcaccg cccagcccca tcaccttccg 843661 acgaaccccg gtcccagaca cctcatgctg ctgcgaaaac tctgcgacga ggccgacgca 843721 tcgggcgact tgatacctga cgcggtactc gcggccatag cagtggggca tcactgcgcc 843781 gtggtgagcc tggacaggga tttcgcccgg tttgcctcgg tgcgccacat tcgcccgccg 843841 ctctagcgag cggtcctcaa gtacagtcgg cgaccggaca aaccgctgcg ccagacgatt 843901 caccgtcctc gcgtcaattc gagcagctac ggccgaaagc caagggcctt cttcgtcggg 843961 gtgaaaaagt tcagacgcag cgacaccagc tgccacagct ggttgagcaa ctccagttcc 844021 tcggtgctgt cgtagcgcca gtggaacgcg tgtttgcgca ccacacggtt gttctttcga 844081 ctccacgtgc gcctggtcgt tcgtctggta caccaggcta ccgggctatc ggattcggcc 844141 ccaaacctca ggagccgcct ccggatccgg tgccgtttcc gccctaccag ccgaaggtgt 844201 gggactaaac tatctagggc aagtgcgggc catagtgggc gactgcgtca tccacatcat 844261 gccgatgggc accggggtcg agctgagcaa gttggccgat ctggcattgg atatcggccg 844321 cagtgtcgga tgctcggcct acgagaacga cttcacgctg ccggacattc caacgcagtg 844381 gcgcaaccag ccgctgggct ggtacacgca aggccttgcc ccctacctgc cggggctgtc 844441 ggacccgaaa gacgccgccg agggctgatg ggtgtgccgg cgacctctga gggcgagcag 844501 acgcataagc gcccaatttc gtgtcgaaat gggcgcttat gcgtctgctc gcgcgcgcaa 844561 cgtgtggatc accgcgctga agtccaggtc ggcgtggtcg gcggcgaatt tggcgtagat 844621 gtcggcggcg tggctgccca gcggggccgt cgcaccggtg gcggccaccg catccatcgc 844681 caggcccagg tccttgttca tcaacgcggt cgaaaacccg ggcttgaagt cgttgttggc 844741 cggtgaggtg ggcaccgggc ccggcaccgg gcaattggtg tgcaccgccc agcaattgcc 844801 ggtcgcgccg gtgatgacgt cgaacaacga ttgtgcggac agcccgagct tctcggccag 844861 cacgaacgcc tcggcgatcg cgatctgctg caccgccagc accatgttgt tgcacacctt 844921 ggcggcctgt ccggcaccgg cggcgccgca gtgaatgatc ttgcccgcca tgggctctag 844981 taccgggcgt gcccgccgta gcgtggactc gtcgccgccg accatgaatg ccagcgtcgc 845041 ggcggcggcg cccttcaccc cgccggagac cggcgcatcc agttggagca tgccgtgcga 845101 ttcggccagc gcgtgcacct cacgggcatc ggtgaccgag atcgtggagc tgtcgatgaa 845161 cagcgttgcc ggacgcgcgg cggccagcac gtcggtgtag cagcgccgga ccacctcgcc 845221 ggtgggcagc atggtgatga ccacgtcggc ctcggccacc gcttcgggcg cgctacgaaa 845281 caccgcgaca ccgtgcgcgg cggcgccgga cgccgccgtg ggtgccgggt cgaatccacg 845341 cacgacgtgg cccgcaccaa ccagattcgc cgacatcggc gcacccatgt tgcccaaacc 845401 taggaaggcg atggtcgtca tctgagcctc tctaaacggt ggcgcggaac cgcgcggcct 845461 cggcccgacc gatgaccagc cgcatgatct cgttggtccc ttccaggatg cgatgcaccc 845521 gcaggtcgcg gacgatcttc tccagaccat actcgcgcag atagccatag ccgccgtgca 845581 gctgcagggc ctggtcggcg acctcaaagc aggtgtcggt gacgtagcgc ttggccatcg 845641 cacacagctc gaccttgtcg gcgtcgtcgt catcgagcgc acttgcggcc cgccacaaca 845701 acattcgcga cgtctgcagc ccggtagcca tgtcggccag ggtaaaccgc acggtgggct 845761 cgtcgagcag cgatccgccg aaggcctgtc ggtcgcgaac gtaggcgccc gctttgtcaa 845821 aggcggcctg cgcgccaccc agcgagcatg ctgcgatatt gagccggccg ccgttgaggc 845881 cgctcatcgc gataccgaag ccggcgcctt cgccgtcggc gccgcccagc atggcctcgg 845941 cgggtacccg caccccgtcc agcaccacct gcgcggtggg ttgggcatgc caacccatct 846001 tcgcttcggg cgcgccgaaa ctcagccccg gtgtgccctt ttcgacgacg aacgccgaca 846061 cgccgcgcgg accctcggcg cccgtgcgcg ccatcaccac atacacgtcc gatgctgcgg 846121 ccccggaaat gaattgtttg acgccatcga gcacgtagtc gccgcctttt cctgagccgt 846181 gcctgacggc gcgggtgctc agtgcgccgg catcggatcc ggcgcccggt tcggtcaggc 846241 agtagctggc gatgacgccc atggtggcca gtcgcggaat ccagtccttg cgttgctcgt 846301 cggtgccgaa gctgtcaatc atccacgcgc acatgttgtg gatggacaaa aacgcggcgg 846361 tcaccgggtc ggcgatcgcc aactgctcga agatgcgcgc gccgtcgagc cggcgcagcc 846421 cactgccgcc gacgtcgtcg cggcaataga tcgcggccat gccgagttcg gccgcttccc 846481 gcaacacgtc caccggaaag tgtttggcgg catcccattc cagggcgtgc ggagccaggc 846541 gtttgccggc gaaggcggcc gccgtctcga cgatcacccg ttcgtcgtcg ttaaggacaa 846601 acatgacacg ctaactcatt gtggggatga cgaattcggc accgtccttg atgcctgacg 846661 gccatcgcga cgtgacggtc ttgaccttgg tgtagaactg gattgccgcc gggccgtgct 846721 ggttgaggtc gccgaagccg gagcgcttcc agccgccgaa agtgtggtag gccaccggca 846781 ccgggatcgg cacgttgacg ccgaccatgc ccacctgccc ccgggagacg aagtcgcggg 846841 ccgcgtcgcc gtcgcgggtg aagatcgcca ccccgttgcc gtattcgtgc tccgacggca 846901 gccgcaacgc ctcttcgtag tcgcgggcgc gaaccatgca caacaccggc ccgaagattt 846961 cgtcggtgta gatcgacatg tgggcagcga catggtcgaa cagggtcggc ccgatgaaga 847021 agccgccctc caggttcgca tcgccttcag gcagcccaaa ggtcaggtcg tcgctggcgc 847081 ggtcgcggcc gtcaacgacc agctcggcac cggcggccac accctggccg atgtagtcgc 847141 gcacccgcgc cagcgccgcc ccggtgacca gcgggccgta gtccgccttg gggtccaggc 847201 tgtgtcccac ccgcaagtta ttgatccgct cgatcagcct ggcgcgcaac cgctccgcgg 847261 tctgatcgcc caccggcacg gcgacgctga tcgccatgca gcgttcgccg gcgctgccgt 847321 atccggcgcc gatcagtgcg tccacggcct gatccaggtc cgcgtcgggc atcacgatca 847381 tgtggttctt ggcaccgccg aaacactgcg cccgcttgcc ggtggcggcg gcaccagcgt 847441 agatgtactg agcgatatcc gagctgccga cgaagccgac ggccttgatg tcggggtggt 847501 gcaggatggc gtcgacggcc tccttgtcgc cgtgcaccac ctggaacacg cccgccggca 847561 ggcccgcctc gatgaacagc tcggccagcc tcaccggaac cgacgggtcg cgctcacttg 847621 gcttgagcac gaaggcgttg ccgcacgcta gggccgggcc ggccttccac agcggaatca 847681 tcgccgggaa gttgaacggg gtgatccccg cgaccacacc caggggctgc cgcagcgaat 847741 agacgtcgat gccggggccg gcaccctcgg tgtactcgcc cttgagcagg tggggaatgc 847801 ccaggcagaa ctcgattacc tcgatgccgc gctggacgtc gccgcgggcg tcggccagcg 847861 ttttgccgtg ctcacgcgac aacagctcgg ccaactcgtc gatggtgtcg ttgaccagtt 847921 cgataaaccg catcaacacc cgggcacggc gctggggatt ccatgcggcc cagccctttt 847981 gggcctcgac cgcggaggcc acggccgcgt cgatgtctga cttgccggcc atcggtacct 848041 tcgcctggat ctggccggtg ttggggtcga agacgtcggc cgagcgcgtg gactggccgg 848101 cggtgcgttg tccgtcgatg aaatgtgaaa tctgtgtggt catggttgtc ctgtgcaagc 848161 cggtggcggc ggggaatccc gatacttgga tatcctagta actgtggcgg atggctcgca 848221 aggcgaccga gccgacagcg tcctagcggg agacgcttgg atgctcgttg cattttggcc 848281 gatacccgca tctgttccgg cgctgcgctc catcatggct agtacgcgac aacacccggg 848341 ggtaagcgat gtcatttgtg atcgtggcgc gggacgcgtt ggcggcggcc gcggcggatc 848401 tagcgcagat cggttcggca gtgaatgcgg gcaatctggc cgcagccaat ccgacgaccg 848461 ctgtggcggc ggcggccgcc gacgaggtat cggcggcact cgcggcgctg ttcggcgcgc 848521 atgcccggga gtatcaggcg gcggcggcgc aggcagcggc gtatcacgag cagtttgtgc 848581 accgattgag cgcggcagcg acatcgtatg cggttaccga ggtgaccatc gcgacgtcgc 848641 tccggggggc gctgggctcg gcgcccgcgt ccgtttccga cgggttccaa gcgttcgtct 848701 atggtccgat tcacgcgacc ggccagcaat ggatcaacag cccggtcggc gaggcgctcg 848761 ccccgattgt caatgcgccg acaaacgtgc tgctcggccg cgatctgatc ggcaacggcg 848821 tcaccgggac ggcggcagct cccaacggtg gccccggcgg tttgctattc ggtgacggtg 848881 gggccggcta taccggcggt aacggtggga gtgccgggtt aatcggcaac gggggtaccg 848941 gtggcgccgg ctttgccggc ggagtgggcg gcatgggcgg caccggcggc tggttgatgg 849001 gcaacggcgg catgggtggc gcgggcggtg tcggcggtaa cggcggcgcc gggggccagg 849061 cgctgttgtt cggcaacggc ggcctgggcg gagccggcgg ggctggcggg gtcgatgggg 849121 ctatcggtcg tggcgggtgg ttcatcggta ccggcggcat ggccacgatc ggtggtggcg 849181 gcaacgggca gtcgatcgtc atcgacttcg tgcggcacgg ccagacgccg ggcaacgccg 849241 caatgttgat cgacacggcg gtgcccggac ccggactcac cgcgctgggc cagcaacagg 849301 cgcaggccat cgccaacgcg ctcgcggcca agggccccta tgccgggatc ttcgactcgc 849361 agttgatcag aacgcagcag accgccgcgc cgttggcgaa cttgctgggg atggccccgc 849421 aggtattgcc cgggctcaac gagatccatg ccggcatctt cgaggacctg ccgcagatca 849481 gccccgcggg cctgctgtat ctcgtcggcc cgatcgcctg gacgctcgga tttcccatcg 849541 tgccgatgct ggccccgggc tccaccgacg tcaacgggat cgtcttcaac cgagccttta 849601 ccggtgcggt tcaaacgatc tacgacgctt ccttggccaa tccggtcgtg gccgcagacg 849661 gcaacatcac gtcggtcgct tactccagcg cattcaccat cggggtcggg acgatgatga 849721 acgtcgacaa tccccatccg ctactgctgc tcacccaccc ggtgcccaac accggcgccg 849781 tcgtggtaca gggcaatccc gagggcggct ggacgctggt cagctgggac gggatacccg 849841 tcgggccggc gtcgctgccg accgcgttat tcgtcgacgt gcgcgagctg atcacggcgc 849901 cgcaatatgc ggcctacgac atttgggagt ccctgttcac cggcgatccg gcggcggtca 849961 tcaacgcggt gcgagacggt gccgatgagg tcggcgcggc tgtggtccag ttcccacatg 850021 cggtggctga cgacgtgatc gacgctacgg gccaccccta tctaagcggc ctgccgatcg 850081 gtctgcccag cctgatccca tgaccgcgag cgaccaatag gtccccacat ggcccggagg 850141 ccgctgccag cattgacccg acgatgccgg cccgcaggct tccccgatcg tgcggaacct 850201 gctcggccgt gcatgggaca tccagatcgg attgcctccg ggtacggcgt acgccggacc 850261 cggtcgccgg gacgataccg ggctagtgtt agctagcggt ggaaaaagcc cgacacgaaa 850321 tcgatcgaat taaagccacc agaatcctgc tttccagagt tcccgaaacc cgatgtggcg 850381 ctgttgttgt cgggattggc ttcaagatta ccgaagcccg actgtagaaa acccgtattg 850441 ccaaagcccg acatgaatcc actgaacagg ccggttccct tgttcccgaa acccgatgtt 850501 acactaaccg aattgttgta acccgttacc gattggcccg agttggcgaa gcccgagatt 850561 tgtaaattac caacgttttg ggcgccggag tttcccttac cagaattatt gaaacccgaa 850621 tttccactgc cggcgtttcc gaatcccgag ttttcgccca gcccatcggt agtattgccg 850681 aaaccggtgt tcaggttgcc cgcgttaaag ccgcccgtgt tgatattgcc agaatttgcg 850741 aagccggtgt tcgtcaggcc agagttcaag aaaccagaat tagcgtctcc tccgttgaag 850801 ctgcctgagt tgaatgcacc cgagttgaag ctaccggtgt taatgatgcc gccgttgaag 850861 ttgccggtgt tgaaatcgcc cgcgttccct atgccggtat tggcctgacc tgagttgcca 850921 aagccagtgt tgacgcttaa cgcgttcccg aagccggtgt tgataaagcc ggagtttccg 850981 aagccggtgt tgatgttgcc tgagttggct acgcccgtgt tggtgacgcc cgagttgccc 851041 acgccgaagt tgccgctgcc cgagttgaag aagccgatgt tcccggtgcc cgagttacca 851101 aatcctatat taccgctacc ggaattcagt ccgccaaagc cgatctgatt gctgccggtt 851161 aacccaatgc cgatattatt gttgccggtg ttcccgaagc cgaagttgta gctgccgctg 851221 ttcccgaagc caacgttgcc gtcgcctacg tttcccagac cgatatttgc gttgcccgtg 851281 ttaccaccgc cgaagtttcc attgccaccg ttgccaatcc cgacattccc attgccgggg 851341 gtgggcacgg cgggggaact catgtttgga cctgcatttc cgataccaat gttggcattg 851401 ccaaagttcc cgaagccgaa gttgttgtcg ccagagttgc cacccccgac gttcgcatta 851461 ccgatgttgc cgctacccac gttaaagctg gacggtccga tgccaacgtt tccgtttccg 851521 ctacccaggt tgaagttgcc gatgttgccg ctacccacgt tgaagctgga aatctggccg 851581 tggaaggcgc ttccgtttcc gccgccaaag ttggcgttac cgaggtttcc attacccagg 851641 ttgtaatcgc cggtattacc attgccgaca ttgaggttgc cgatattgcc gacacccaaa 851701 ttgatgtttg gcaacgccgg ctgccacgac gccagctgtg ccgctgccgc cgaggatgcg 851761 gcatggtagc cggccatcac cgccacgtct tgtgcccaca tctgctcgta ggtggattcc 851821 atggccgcta tggccggcgc gttttgcccg aacacattcg acaccgccaa caaccacgtc 851881 cgaacacgat tggcctgcac caccgccgga tgcaccacgc cggccagcgc ctcctcaaat 851941 gcacccgccg ccgcccgagc ctgccgcgct gccagctcgg cctgggctcc agccgcggtc 852001 aaccagctcg catacggccc cgcggcattc gccatcgcgg ccgacgccgg tccctgccaa 852061 gcgccgccgg ccaactccga cgtcaccgac ccgaacgacg acgccgccgc gtgcaactcc 852121 tcggccaacc cgtcccaggc ccccgccgcc gccaatagtg gccgtgaccc cgcacccaga 852181 tacatccgta gcgaattggt ctccggaggc aaccacgcga aaccgaccat cacggccccc 852241 tcacaccatt gacaaaccag gacgcctcga gcctaactac acaacgcgaa gggattggga 852301 cttctatcgg aattgcgccg cgtgcactgg ccgccggcct ttccccgcca gcctcggtgt 852361 ttcatgccgc ttgccgtggt ctgccacctg cgagttcgca tttgtgcaga gtcccgtcgg 852421 gagttgtcaa aactaaaacg ggcgatcttg atcgcatcgg aagcgcgaga ttgcgccctg 852481 agctgcgcct tcgtggagcc cccggtcagg attgaacgga cgaccgctcg cttataaggc 852541 tgttccggta ccgatcctta agccatcgag gccctcggcc tcatagcggg ccaaccaggt 852601 atgcagcgtc tgccgcgaca ccccaacttt ctcggcaacc tgcgagatcg acaacccgtc 852661 gctgatcacc gccaacacgg cttgataccg ctgttctgcc acactcaact ccttcatcga 852721 aggagtgtca aggatcagcc gaaccaactg tcaagcatca gccgaaacat cgtcaggcat 852781 cacccgaacc caaaacgtca agcatcagcc gaggtactac acgaacgctt gagccccctg 852841 tcaggattga actgacgacc gctcgcttac aaggcgagtg ctctaccact gagctaagga 852901 ggccgatgaa atcgctgtga gtctagccgc tcactcgctg tcgacgacgc gttgcgaacg 852961 caccgaccgc gacgacgagc ggcgcgcggg acggcgcccg ggcagtggaa tgcgctcggc 853021 gatgctgctc agcgggttga ccaccatggt aagtgcgatc acagcgtctt gcagcgtcgc 853081 gatggccggc tcgagcgcct ccatcccggg tgtcagccgc gccaacgtgt cggcgacgtc 853141 ggcgagctgt tcgagcggtc cgtccttggc cgttatcttg tcgatcagtc cgccttcggc 853201 cagcagccgg tcggccagcc cgtcttcgga gagcacccgc tcgataagtc cgtcctcggc 853261 gagcagctgg tcggccagtc cgcccggttg cagtgcgcgc tgcatggcgc cgccttcagc 853321 ggtcaggcgg tcgagtaagc cgccgggctg ggtcagcagg tcgaccaccc cgccgggccg 853381 cagcatccgg tccatcggcc cgttgggcgc gatggcgcgt cccagcggca tatcgtcgtc 853441 caatagcctg gccagccggt tggcgcgggc aatcgtgtca tcgattccca gcatgttggc 853501 cattgaggtc gacccgcttg cgccgccggc atcacccaac gcttgtttgg ccatgtcaac 853561 cgccgcgccg gccatgttca aaccggtgtc ggcggcggcg agccccgctc gtgcgggcca 853621 ggtcgcaata cccacgaggg tttggccgag gttcattctg cgagtgtatt cacggcgcgc 853681 cgtggattga gcggcaacgg tccaagctga tttggcgatt cctggcagac tgttagcaga 853741 ctactggcaa caagctttca ggaattacac aatgactgtg aaggtaacgt tcaaccaatg 853801 cggaaagggg ttgatctcgt gacggcggga accccaggcg aaaacaccac accggaggct 853861 cgtgtcctcg tggtcgatga tgaggccaac atcgttgaac tgctgtcggt gagcctcaag 853921 ttccagggct ttgaagtcta caccgcgacc aacggggcac aggcgctgga tcgggcccgg 853981 gaaacccggc cggacgcggt gatcctcgat gtgatgatgc ccgggatgga cggctttggg 854041 gtgctgcgcc ggctgcgcgc cgacggcatc gatgccccgg cgttgttcct gacggcccgt 854101 gactcgctac aggacaagat cgcgggtctg accctgggtg gtgacgacta tgtgacaaag 854161 cccttcagtt tggaggaggt cgtggccagg ctgcgggtca tcctgcgacg cgcgggcaag 854221 ggcaacaagg aaccacgtaa tgttcgactg acgttcgccg atatcgagct cgacgaggag 854281 acccacgaag tgtggaaggc gggccaaccg gtgtcgctgt cgcccaccga attcaccctg 854341 ctgcgctatt tcgtgatcaa cgcgggcacc gtgctgagca agcctaagat tctcgaccac 854401 gtttggcgct acgacttcgg tggtgatgtc aacgtcgtcg agtcctacgt gtcgtatctg 854461 cgccgcaaga tcgacactgg ggagaagcgg ctgctgcaca cgctgcgcgg ggtgggctac 854521 gtactgcggg agcctcgatg agtcttggta gttaatcgga tcggcagccc gaggagaacg 854581 cggcaatggc cagacacctt cgaggaaggc tgcccctacg ggtacgcctg gtcgcagcca 854641 cgctgatcct ggtggccact ggacttgtgg cctcggggat cgcggtcacc tcgatgttgc 854701 agcaccggct gaccagccgg atcgatcggg tgttgctcga ggaagcccaa atctgggcgc 854761 agatcacgct gcccttggcg ccggacccct accctattca taaccccgat cggccgccgt 854821 cgaggttcta cgttcgggtg atcagccccg acggccagag ctatacggca ctcaacgaca 854881 acactgccat accggcggtg cccgccaaca atgatgtcgg ccggcacccg acgacgctgc 854941 catcgatcgg cggatccaag actttatggc gcgcggtctc ggtgcgcgcg tcggatggct 855001 acttgaccac cgtcgccatt gatctggccg acgtccggag caccgtgcgg tcactggtgc 855061 tgttgcaggt cggcataggc agtgcggtgc tggttgtcct cggggtggcg ggctacgctg 855121 tggttcgccg cagcctgcgg ccgctggcag aattcgagca gacggccgcg gcgatcggcg 855181 cggggcagct ggatcgccgg gtcccgcagt ggcatccgcg aactgaggtc ggccggcttt 855241 cgttggcgct caacggaatg ctggcacaaa ttcagcgggc ggtggcgtcc gcggaatctt 855301 ccgccgaaaa ggcccgggat tcagaggacc ggatgcgaca gttcatcacc gacgccagcc 855361 atgaactgcg taccccgttg accactatcc gcggcttcgc ggagctgtac cgacaaggag 855421 ccgcccgcga cgtgggcatg ctgctgtcgc ggattgagag cgaagcgagc cggatggggc 855481 tgctggtgga cgatttgctg ctgcttgccc ggctagatgc gcaccggccg ttggaactgt 855541 gccgggtgga cctgctggcg ctggccagtg atgccgcgca cgacgcgcgg gcgatggacc 855601 ccaaacgcag gatcaccctg gaggtccttg acggccccgg caccccggag gtcctcggcg 855661 acgaatcgcg gcttcggcag gtgctgcgca atctcgttgc aaatgccata cagcacaccc 855721 cggaaagcgc cgacgtcacc gtgcgagtcg gcaccgaggg cgacgacgcc atcctcgagg 855781 tcgccgatga cggtccgggc atgagtcagg aggatgcgct gcgggtgttc gagcggttct 855841 atcgcgccga ctcgtcgcgg gcgcgcgcca gcggcgggac cggactgggg ttgtcgatcg 855901 tcgactcttt ggtggcggcc catggcggag cggtcaccgt gacgaccgcg ctcggggagg 855961 gttgctgctt tcgtgtctcg ctgccgcgcg tcagtgacgt ggaccagctg agcctcacgc 856021 cagttgtgcc agggccgccc tgatcttggc ctgcgcttcg tccagcgatc ccggtgaggg 856081 gttgcggtcg acgttggcaa agccgaaatc actgaggctg cgggtgggaa acacgtggat 856141 gtgtaggtgg ggcacttcca gcccggcaat gatcatcccg gcgcgttggg ttgaaaacgc 856201 ccggcacacg gccttgccga tcagctggct caccgacatg acgcggccaa ataacgcggg 856261 atccacgttt tgccagtggt cgatttcggc gcgtggcacc accaaggtgt ggccttgcgt 856321 catcggctca atcgtcaaga acgccacgac gtcgtcgtcc tcgtagacga aacggccggg 856381 cagttcacgg ttgatgatct tggtgaagat cgacacccgt tgagcatatg acgtcgcaac 856441 ggcccccccc aggtttcatt cctggttacc gaaggtcatc atgtcgaggt tccagtaccc 856501 acgcatgttg gtgatcaaac cggccttatt cacccggtag gtgaacacgc cgcggacctc 856561 actggtaaag ccgccgtcaa actcgctgtg caacaccaga atgtgggcga tctcgtccgg 856621 tgagctggac gggaacgtct cctcgcaggt gaccgtcaac cgattggccg caatgtgtgt 856681 gtcgaaaaag gcgccgacgg cctccttacc tttgatgccg ctgccatcgg gattggtgac 856741 ggacttgccg atcggatcct cgatgacgac gtcgtcggcc atcagcgcca gccagccctc 856801 ccggtcgtgg gcttggacgc accgccacga cgactgcgac gcgatcaggg ccggggattg 856861 ggtcgtttgg gtcatggcta tctccggcta gcggtcgtcg tccgtgtacc ggatcacgcc 856921 gcgaatgttc ttgccgttca gcatgtcctg gtatccgtcg ttgatctgct ccagcttgta 856981 cgcagtggtc accatgtcgt cgaggttgag tttgccggcc ttatacatcg acaacagctt 857041 cggaatgtcg tagtgcgggt tgccgccgcc gaagatggtg ccctggatgt tcttttgcag 857101 cagggtcaac atcgcgaggt tcagcgtcac ctgggtgtcg accaggctgc cgatggccgt 857161 cagcacgcag gtgccgccct tggccgtgat ggtcagatag ctgtcgacgt cggcgccatc 857221 gagcttgccg acggtgatga tcaccttctg cgccatcagg ccgtaggtga cctcggcaat 857281 gcccatcagc gcggcgttga tgtccgggta gacgtgggtg gcaccgaatt tcagagcctg 857341 atcacgtttc cattccaccg gctccaccgc gaagacgtag cgggcgcccg cgctgaccgc 857401 gccctgcaac gccgccatgc cgaccccacc caagccgacg atggccacgt cgtcgcccgg 857461 ccggacgtcg gccgtgcgga ccgccgaacc atagccggtg gtgacgccgc aaccaaccag 857521 gcaggcgact tcgaagggca ccgacgggtc gatcttcacc accgagctgc ggtgcaccac 857581 catgtacggt gaaaacgttc cgagcagggt catcgggtag acgttctggc cgcgagcctg 857641 aatccggaag gagccgtccg tcacagattc cccggcgagc agccccgccc ccaggtcgca 857701 cagattccgc attccagcct ggcaggacgg acacttgccg caggacggga tgaatgccaa 857761 caccacgtga tcgcccgggg cgaagtcgtc gactcccggg ccgacctcgg tgacgatgcc 857821 cgcgccctcg tgtccgccca gaacgggaaa gcccgccatc gggatgtcgc ccgtcaccag 857881 gtgatggtcg gagcggcaca tcccagccgc ttccatctgg atcttgactt cgtccttgcg 857941 cgggtcgccg atttcgatct cttcgacgga ccatggctgg ttgaactccc agatcagtgc 858001 gcccttggtc ttcaccgcaa acctgctttc atcgttgaac ttcggctacg agtggtccct 858061 agcctcggcc ggaacgccga ctggctgagt gtaggtcaac ggcgctaggg cgtttaccac 858121 agtggcaccg gcgtcttgcc gagcgggtaa tagcccggca ctttgttgcc cgacacggcg 858181 cgttcgatgc gcttctgcat gcctggcgat agcttgccgg ccttgatcaa ttccaggtag 858241 agcgcggaaa catgaccgaa gtcgaagaaa tcgcgttgcc agttccattt cccgccgccg 858301 gcataccgga accagctgcc gccaatgccg tacacctcct gctcggcgcc attggcgtcg 858361 gtggcaacct gtttccagaa cccaaccacc tcgccctgtt tctcgtcgat gacgacccgt 858421 tgataggggt agcgccagcc ctgcaggccg tccatttcct ggcccagcgc aatgtcgcgg 858481 atctcgtcga tgccgacgca catcacgtcc tcgttgggac cgacgttcca gccgtaggtg 858541 gcgtcgtcgg tgtagaagtc ggccagcaac gtccagtcgc cgcgccgctc cgccgtacgg 858601 ttggcctgta accagcggtg aaccacatct tcgagttcgt cgcgaggata gccggccacg 858661 gttactctcc cgtttctcgg atggacagtg cctgggtggg acaggcccac acggcatgct 858721 tgatcacgcc gcgggcttcc tcgggcggct cggggtcgag gatttcgacc tggccgcgct 858781 tgggcacccg gaaatactcg ggtgcctcca gctcgcacat cgcgtgtcct tggcacagat 858841 cccggtcggc ttcgactcga tagcccatcg ttaaactccc gttcgccggc ggtagcgcac 858901 gcaagcgggc tgggccaact gcaccaccat cttcgaatgg tcgttacgat agctttctgg 858961 cggttgcgcc atctcaaact catactcgcg caacaacacc gagaagatcg ctttgatctg 859021 catgatggcg aacgccgccc ccacgcaacg atgccggccg gcgccgaacg gaatccacgt 859081 ccagcggttg agcagatctt cctggcgcgg ctgctcgtat cgtgctggca cgaagtcgtg 859141 gggatcgggg aagtcttcgg ggatccggtt ggagatcgcc ggggaggccg ccaccagatc 859201 gccctcatga atccggtggc cttgcacctc gaactcgccc ttggccactc gcatgaggat 859261 gatcagcgga gggtgcaggc gcagcgtctc tttcagcacg ttttccagct gcggaatctg 859321 gcgcagcgca tggaaactca ccgatcggcc gtcgccgtac agctcgtcga gttcgtcgat 859381 cacggccgcg taggcgtcgc gatggcgcat caactcgatc agcgtccacg aagccgtacc 859441 cgagctggtg tgatggccgg cgaacatcat cgagatgaac atgccggtga tctcgtcggc 859501 cgagaaccgg ggagtgccgg tctcagcctt gacggcgatg agcacgtcga gcatgtcacg 859561 gtcgctcttg tcggtgggtg ggttggcgat ccggccgttc atgatgtccg caaccagtgc 859621 caccagacca ttgcgggctt cgtcgcggcg acggaagctc tcgatcggca gatacgggtc 859681 gacgtaggct agtgggtcgg tgccgcgctc caactcgtga tagagcttgg cgaatcgccc 859741 gtcgagctgg tcgcggaact tcttgccgat caggcaggcc gaggaggtgt agatggtcag 859801 ctcggcgaag aagtccagca gatcgatctc gccggcctca ccccagtcgg cgatcatccg 859861 tcggacttga tcttcgatgg tggcggcgtg gcccttcatc tgctcgccgc gtagcgcggc 859921 attgtgcagc atctctttac gccgttccgg gctggcgtcg aacaccacgc cctcgccgaa 859981 gatcggcgtc atgaacgggt atgccttggc ctggtccagg tcgtcgtcgc ccgcccggaa 860041 gaagaattcg ttggcgtgcg agccggacag cagcacgacc tgcttcccgg ccagctggaa 860101 ggtaccgacg tctccgcatt cgtcgcggac ccgttgcatc agcccgatcg gatcggtgcg 860161 gaactcctcg aggtggccgt gttcgtcgtg gccacccgaa acccggggta gtgcaacagc 860221 gctcattagc ccggcatccc ctcttcgccc agtactagct tctgacggtg cgcgggcgca 860281 tccctcagcg gggcctccgg ttgaatctcc atgttcacca cgacacaccc gcgcggcgtt 860341 tctgcgacga acgcgatagc gcgtgccagg tcgctgggtc gcaagaagta gttgtgccgg 860401 gcctgccccc actttgccca gtccgccagc attgggccga cttgttcggc cgacagctgc 860461 cagcccatac cggtcagcgt gggtcccgga tgcacgatcg atgcgcgaac accggtgcct 860521 tccaactcca tctgcaggtt ggtgaccata gcggccagac cggccttggc ggcgccgtag 860581 gcacccatat gcgggcgttg gcgcaggccc acatcggatc cgacgaagat gaggtcacct 860641 cgccggcgtg ccaccatggc cggtagcacg gccgtggcca gccggttggc accgaccagg 860701 tgtatctgaa cctgctctgc aaaggcctcg gtgctgacct cgtgcagctg tcccgggagc 860761 atgtcgcctg cactggacac cagcagttcg acctcgccga gtgcctcgac cgtttgcgcc 860821 acaaacgatt tcaccgactc gggatcggtc acgtcgaggg ggaaggctac cgcctcgcca 860881 ccgtcggcgc ggattttgtc gaccagctcg gccaacttgt ccatgcggcg ggcccccaag 860941 gcgaccggaa acccgcggcc ggcgagttcg gttgcggtgg ccgcgccgat gcccgacgat 861001 gcgccggcga cgacggtggt ccgccgggcg gggtgaggtt cgaagcgtgg cattacctgg 861061 cctgcacgct gatcggcaga tgggcaaatc cgcgcacgtt gctggaatgg acgcgcacga 861121 cgttgtcgtc gtcgacttcg tagttgcgga tccgacgcag cagcgcgccc agggccaccc 861181 gggcttccat ccgggccagg tgagccccca gacagaagtg ggcaccgctg ccgaaactga 861241 ctagtttgca gccgatttcg cggccgatgc gatagtcgtc cgggtcgtcg aacacccggt 861301 cgtcacggtt ggccgatccc ggtagcagca gcaacacctc accctcgggg atcgtggtgt 861361 cgtacaacgt gagatcgtgc gcgacggtgc gggccagaat ctggctggac gtgtcgtagc 861421 gcagggtttc ctccacccac atcggaatcc gggagtggtc ggcgaatacg cgggccagct 861481 ggccagggtg gtgggcggcc cagtagacgg cattggccag tagcttggtg gtggtctcgt 861541 tgccggcgat caccatgaga aacaggaacg ccatgatttc ctggtcggaa agccggtcgc 861601 cgtcgagctc ggctgccagc agtgccgacg tcagattgtt cgcgggccgc cgccggaatt 861661 ccgcgatcag gtcagcgtaa tatctcatca gctcgatcga cgccgccatc gccggcgggg 861721 gcacatcggc cacgccgtcc tcgcggtgca gcaccgcatc ggccagcgcg cggatgcggg 861781 cccggtcggt gtcgggcacg cctatcagct ctgaaatcac atccatcggc agcttgccag 861841 cgaattctgc tacgaaatcg aaactttcgg tttgcagggc cgaatccagg tgaatgcggg 861901 caagttcgag cacctgcggc tcgagttcac ggatccgccg tggggtgaag cccttggaca 861961 ccaaggtacg catccgcaga tgtgcggggt cgtccatggc cagcatcgac attacccggt 862021 acgcctcaga agtgcgtgag gacggatcca gggatacccc ataggcattc gacaacgccg 862081 tgctgtcccg gaagccttgc agcacgtcgt ggtgccgcga caccgcccag aaattgcgtt 862141 cctcgttacg gtacagcggg gcctcgtccc gcagccgacg ataatacggg tacgggtctt 862201 cgtgaaagtc gtagtcgtag gggtccagga ccagttcggg gtcaccgacg cggacggtca 862261 ttcgctgcca ccagtgctcg gctcgttagc tccggccaag atcaggccca ccacgtaccc 862321 gagtcgatcg gcgatctcgt ggtaggtgaa ggtgccgctg ccggcctgta cgagcgctcc 862381 gaagaacgcc atctcgagtg cgaacacggt accgggatcg gcgccaggtc cgatcgccga 862441 tgtgatgcgg cggtggatct cggcgccgat tcggtcgcgc accgcacgca ccgcggggtc 862501 ggcgccgccg tcgagcagcg ccgccgtgca cgccgcgccg atttcgggtt cgtcggcaac 862561 caccagcgcc aggtgtcgca acgagctcgt cacccggata ggcatcggga cgttgacgtc 862621 ggtgacgcag gggacctggc ggaccaggtc gaggtagacc tcggcgatca gatggttctt 862681 cgacgagaag tatgtgtagg ccgtcgccgg ggctaccttg gcgcgggccg ccaccaggcg 862741 caccgtcagg tcggcgtatg acttctcccg cagggtcgcc atggcggcag ctagcacctt 862801 gcggaaggtt gcctgctggc ggcggttgcg tgacaccgct tcggcgtggg gttcggtttg 862861 gcgctgggcc ggggtggtaa ccagtacatc gctggacaca tgtccaagct atcggatggt 862921 cgcggcagga ggcaagccag tctgctaaac atgcagctaa catgggactg tccgcgacgc 862981 gacgtggccc ctggtgcatc ggtcaggacg gtgtagcggc cttgcggata cggtctcgat 863041 gaggcaatat cggacaagtg tccaatcgat gatgagagtc ggagaagttg ggagcggtag 863101 atggccctgt ggggcgacgg aattagtgcg ctgctcatcg acggcaaact atcggacggc 863161 cgtgcgggca ccttcccgac ggtcaatccg gccaccgagg aagtgctggg agtcgccgcc 863221 gacgccgatg ccgaggacat gggccgcgcc atcgaggccg cgcggcgggc gttcgactcg 863281 accgactggt cccgcaatac cgaacttcgg gtgcggtgtg ttcggcaact gcgcgacgca 863341 atgcaacagc acgtcgaaga actacgcgaa ctgacgatct ccgaggtggg cgcgccgcgg 863401 atgctcaccg ccagcgccca gctggaaggc ccggtcgggg atctatcgtt tgcggcggac 863461 acggccgagt cctacccgtg gaagcaggac ctcggcgagg catcgccgtt gggcatcgcc 863521 acccggcgca ccctcgcacg ggaggccgtc ggtgtcgtcg gcgccatcac cccgtggaac 863581 ttcccgcacc agatcaatct cgccaagcta ggtccggcgc tagccgcggg taacaccgtc 863641 gttttaaagc cggcgcctga cacaccgtgg tgcgcagcag cgctcgggga aatcatcgtc 863701 gagcacaccg acttcccacc gggcgttgtc aacatcgtca cctccagcag tcacgctttg 863761 ggggcgctgt tggccaaaga ccctcgggtg gacatgattt cgttcaccgg ttctactgcg 863821 accggccgtg ccgtaatggc cgatgccgcg gccaccatca aaaaggtttt tctggaactg 863881 ggtggcaagt cggcgttcgt cgtgctcgac gacgctgacc tagccgctgc cagcgcggta 863941 tcggcgttct cggcttgcat gcacgccggg caggggtgcg caatcacgac ccggctggtg 864001 gtgccacggg cccgttatga agaggcggtt gccatcgcgg cagccaccat gtcgtcgatc 864061 aggcccggcg atcccaacga ccccggaacc gtttgcgggc cgttgatttc ggcccgacaa 864121 cgggatcgtg tgcagggcta cctcgacctg gcggtcgccg aaggcggaag gttcgcatgc 864181 ggtggcgcgc ggccggcgga tagagaggtc ggtttctaca tcgagcccac ggtcatcgca 864241 gggttgacca atgacgccag agtcgcccga gaggagatct tcggaccggt gctcacggtg 864301 attgcccacg acggtgacga tgatgcggtg cgcatcgcca acgactcgcc atacggcttg 864361 tcgggcaccg tgtatggcgc cgacccgcag cgcgccgcga ggattgcctc gcggctgcgg 864421 gtaggcaccg tcaacgtcaa tgggggtgtc tggtactgcg ccgacgcgcc gttcggcggc 864481 tacaagcaat ccggtatcgg acgcgagatg ggtctcctcg gcttcgagga gtacttagaa 864541 gccaaactca ttgctaccgc tgcaaattag ctagcgggtt gacagcgcag aaaggaagcc 864601 atgttcgaca gcaaggtggc tatcgtcacc ggggctgccc agggtatcgg gcaggcctac 864661 gctcaggcgt tggcccgcga aggtgcctcg gtggtcgtcg ctgacatcaa cgccgacggt 864721 gccgcggcgg tagccaagca gattgtcgcc gacggcggta ctgtgattca tgtgcccgtt 864781 gacgtgtccg acgaggattc cgctaaagcc atggtcgacc gcgccgtcgg tgctttcggc 864841 ggcatcgact atctggtgaa caatgcggcg atctacggtg gcatgaagct cgatctgttg 864901 ttgaccgtgc cgttggacta ctacaagaaa ttcatgagcg tcaaccacga cggcgtgctg 864961 gtgtgtaccc gcgcggtgta caagcacatg gccaaacggg gcggcggcgc gattgtcaac 865021 cagtcctcga ccgcggcctg gctgtattcc aacttctacg gcctggccaa ggtcggtgtc 865081 aacgggctga cgcagcagct ggcccgcgag ctgggcggaa tgaagataag gatcaatgcg 865141 atcgcacccg gaccgatcga caccgaagct acccgcaccg tcacccccgc agagctggtc 865201 aagaacatgg tgcagaccat cccgctgtcg cggatgggta caccggagga tctggtgggc 865261 atgtgcctgt tcctgctgtc ggattcggca tcgtggatca ccgggcagat cttcaatgtc 865321 gatggcggac agatcatccg gtcatgaccg gcgccggcgc cgatgcagag cggggcgatg 865381 aggtgggggc acgcccccac aagtgggagg tacccccatc cgctggcggg ggagagcggc 865441 gctcatgacc gctcacccgg agacaccacg cctgggatat atcggcttgg gtaatcaagg 865501 cgcgccgatg gctaagcgtc tgctcgattg gcctggcgga ctgaccgttt tcgatgtgcg 865561 ggtcgaggcc atggcaccgt tcgtcgaggg cggcgccacc gcagcggcaa gcgtctccga 865621 cgtcgccgaa gccgacatca tcagcatcac cgtgttcgac gacgcgcagg tgagttcggt 865681 gatcaccgcc gacaacggac tggcgacgca cgccaagccc ggcactattg tcgcgattca 865741 ctccaccatc gccgacacga cagcagtcga tctggccgaa aagctcaagc cgcaggggat 865801 ccacatcgtg gatgcaccgg gcagcggcgg cgcggcggcg gccgccaagg gtgagttggc 865861 cgtgatggtc ggcgctgacg acgaggcgtt ccagcggatt aaagagccat tttcgaggtg 865921 ggcttcgctg ttgattcatg ccggggaacc gggcgctggc acccggatga aactggcgcg 865981 caacatgttg actttcgtct cttatgccgc cgccgccgag gcgcagcggc tggccgaagc 866041 ctgtggctta gacctcgtgg cgctcgggaa ggtggtgcgg cacagcgact cattcaccgg 866101 cggcgcggga gcgatcatgt tccgcaacac cactgcgccg atggagccgg ctgacccgct 866161 gcggccgttg ttggagcaca cccgcggcct gggtgagaaa gacctgagtc tggcgttggc 866221 cctgggcgag gtggtatcgg tcgacctgcc gctggcccag ctggcgctgc aacggctggc 866281 cgccggcctc ggggtaccgc acccggacac cgagccagca aaggagacat gatggacgag 866341 ctgcgccgca ccggcctgga caaaatgaac gaggtttacg cctgggacat gcccgacatg 866401 ccaggtgagt tttttgccct gaccgtcgat cacctattcg gcaggatctg gacccgtccc 866461 ggcctgtcca tgcgggaccg gcggatggcc gtgatcgcgg tgctgaccgc tcaaggccag 866521 tcggatctgc tcgaggtcca agtcaacgcc gtcctgcata acgacgaact caccatagac 866581 gagctgcgtg aactcgctgt gttcattacc cactatgtcg gcttcccgct gggctcgcgg 866641 ctgaacagtg cgatcgagcg ggtagcggcc aagcgtaagc aggcggccga gaacggctcg 866701 ctgcccgaca cgaaagccaa cgtcgccgaa gttcttgcta aggaatctgg taaatcgagc 866761 tagtctgacg tgtcgtgcgc gtcctggtaa tcggttcggg tgcccgcgaa catgcgctat 866821 tgctggcgct cggcaaagac ccgcaggttt cggggctaat cgttgctccc ggcaatgcag 866881 gcaccgctcg gatcgccgag cagcacgacg tcgacatcac ctccgccgag gcggtggtcg 866941 ccctggctcg cgaagtcggc gctgacatgg tggtgattgg ccccgaggta ccgttagtgc 867001 tcggggtggc cgacgccgtg cgcgcggccg gcatcgtgtg tttcgggccc ggtaaggacg 867061 cggctcgcat cgaaggctcc aaagcattcg ccaaggacgt catggcggcg gccggtgtgc 867121 gcaccgcgaa cagcgaaatc gtagacagcc cagcgcactt ggacgcggcc ctggaccggt 867181 tcgggccgcc tgccggtgac ccggcctggg tggtcaaaga cgaccggcta gccgccggca 867241 agggtgtggt ggtgacagcg gaccgcgatg tcgcgcgcgc acacggagct gccctgctcg 867301 aggccgggca cccggtgttg ctggagtcct acctggacgg cccggaggta tcgctgttct 867361 gtgtcgtcga ccgcaccgtc gtggtgccgc tgctgccggc acaggacttc aagcgagtcg 867421 gtgaggacga caccggactt aacaccggcg gtatgggcgc ctacgcgccg ctgccgtggt 867481 tgcccgacaa catctatcgg gaggtggtca gccggatcgt cgaacccgtt gcggccgaac 867541 tagtccggcg tggaagctcg ttttgcggat tgctgtatgt tggtctcgcg attaccgccc 867601 gcgggccggc ggtggtcgag ttcaactgcc gattcggcga tccggagacc caagccgtgc 867661 tggccttgct ggagtctccg ctcggccaac tgcttcatgc cgccgctacc gggaagctgg 867721 ccgatttcgg cgagttgcgg tggcgtgacg gtgtggccgt aacagtggta ctggcggccg 867781 aaaactatcc cgggcgcccc cgggtcggcg acgtcgttgt cggctccgaa gccgaggggg 867841 tgctgcacgc cggaaccacg cggcgcgacg atggcgcgat cgtttcgtcc ggtggccggg 867901 tgctgtcggt ggtgggcacc ggtgccgact tgtccgcagc acgcgcacac gcgtatgaaa 867961 tcctcagttc aattcggttg ccaggaggtc atttccgcag cgatatcggt ttacgggcgg 868021 ccgaggggaa gatcagcgtc tagcaggctg cggcttggcc atcacggcgg ggatcgctgg 868081 ccgcgaggta cccatcgtcg agccgccaga ttgcctgaca actcccgaac tggctgtagt 868141 ccgctactgc gaccaagtca tgcccacgct gccgcagttc atcgagagtt gaatccggga 868201 agccgttttc gaaactgacc cgcataccgt tcacccagcg gaaccgaggg ccgtcacagg 868261 ccgcctgggg gttctggccg tagtcggcga tgcgcaccag cacctgcacg tgaccctggg 868321 gttgcatcat gccgcccatc accccgaagc tcatcaccgg cgcaccgtcg cgggtcacaa 868381 aacctgggat gatcgtgtga taggggcgct tccgtggccc aacccggttc ggatgtctcg 868441 gcaccacagt gaaatccgag ccgcgattgt gcagcgaaat gccggtgccg ggcaccacca 868501 caccggagcc gaacccaagg tagttcgact gaatcatgga caccatcatt cccgcagcat 868561 cggcggcggc cagatagacg gtgccgcctc gcgggatgcc ggtggccgcc ggcattgccc 868621 tctttggatc gatcagcgtg gcgcgctgcc gcagatactc cttgtcgagc aggcgcttcg 868681 ggtgcaccgg catgtagtcg atgtcggcga cacacgcttg cgcgtcggcg aaggcaagct 868741 tcagtgcttc gatctgcacg tgcacacttt cagcggaatc cactgaccac gatgacatat 868801 cgaaatgctc gaggattccg agggcgatca aggccacgat gccctggccg ttgggcggta 868861 tctggtggat ggtgtacccg cggtaggttc ccgtgatcgt gtcgacccag tccacgcgat 868921 gggcggcgag gtcgtcggca cgcatcaccc cgccgtttgc cgccgagtgc gcctcgagtt 868981 tggcggccag ctctccccgg tagaactcct caccgttggt cgccgcgatc ttctctagcg 869041 tcgccgcgtg gtcaggaaag gtaaacagct caccgggttt cggcgctcgt ccgccgggca 869101 tgaacgcatc ggcgaatccg ggctgggatg cgaacaacgg cacctgtgcc gcccattgtg 869161 ccgcgacggt cggtgagacc agaaagccgt tgcggccgta cgagatggcg ggctcgaaga 869221 gtgtttcgaa tggtagcctg ccgaacctgg cgtgcagttc cacccaggcc gacaccgcac 869281 cgggcaccgt cacggagttc cagccgagca cgggaacggc gttgccgccg aagtactctg 869341 gcgtccacgc cgagggtgag cggccggacg cgttcaggcc gtgcagtttt tgcccgtccc 869401 agacgatgct gaaggcgtcc gagccgatgc cattggacac cggttccacc acggtgaggg 869461 tgatggctgt ggcgacggcg gcgtcgaccg cgttgccgcc gtcggccagc atccgaagac 869521 ccgcttgcgc ggccagcggt tgtgacgtgc acacgacgtt tgtcgccagg atgggcatgc 869581 gcggccaagc gtaggggaag gtccaaccaa acggcgtgct cacgccgctt aacctgtgag 869641 cagcggcgcg aaccaggtca gctcggcggg tagctgtgcg ctccagaacc caccgttgtg 869701 cccgccaggg gagaagccgc ccgccggcgg gtggggcagc tgcgccacga actgcttggt 869761 tgcggcataa aacggatcgc tgttgccgca atcgacccgg atcgggatgg accccaatgc 869821 ggggagtccg aaaaccgagt tcgccgacca gtcgtcgggt ccgtcgaagg agccgggtgc 869881 gacggcaccg gcggatagcc acagtgccgg gctgaccgcg cagatcgctg cggtgcgtgc 869941 cggtccaagg cggctgccga gcagcaaagc gccgtagccg cccatcgacc agcccagaaa 870001 cgctacccgg gaggtgtcca gccgctgggt gtccaatagc ggaatgagct cgttgagcac 870061 cattgccccc gcgtcctcgc cagaagcccg ctggtgccag tagctgctgc ctccggccac 870121 ggagaccacc gcgaacggtg gcaacccggc gttgacggcc tgggccaggc cctgctcgac 870181 gccgccgtcc atcacggccg atgcgctacc gcccaagccg tgcagtgcga tcacgggccg 870241 caacgcctgg gtctggccgg gtgggcgggc gatggcccag ttggtcatct tcccggcgcg 870301 cgctgccgac acgaacgagc cggtggacat cgtcggcgcc gcctgagccg ggggggccgg 870361 atcgagtgct ggtgtcggcg ccaatggaac gtttgtgcca atcgccgccg ccggtgcggc 870421 atgtgaagtt cggggctgca acagcatgtc gatcgcatac gctgaggtag cgccaaggac 870481 cgtgccggcg ccgagaccga gcacggcgcg gcggctcaac tctggcatgc gggccatcat 870541 gccatggacg tttggccgaa ttggcaatgc agtaccactt tgactggcag catggatggg 870601 cgtgacagca gcggtcactc caaaaggaga acgtcggcgg tatgcgttgg tcagcgccgc 870661 cgcggagctg ctcggcgagg gcgggttcga ggcggtacgc caccgggcgg tggcgcggcg 870721 ggccggtttg ccgttggcgt ctaccaccta ctacttctcg tcgctcgacg atttgatcgc 870781 tcgcgcggtc gaacacatcg gaatgatcga ggtggctcag ctgcgagccc gggtcagtgc 870841 gctgtcccgg cgacgtcggg ggcccgagac caccgccgtt gtgctggttg acctgctggt 870901 gggggaaatg tccagtccgg ggcttgccga gcagctgatc tcacgatacg agcgccatat 870961 cgcctgtacc cgcctgcctg acctgcgcga aagcatgcgc cgcagcctgc gtcagcgcgc 871021 tgaggccgtg gccgaggcca tcgagcgctc cggccgctcc gcacagatcg aactggtgtg 871081 tacgttgatc tgtgcggtcg acggatcggt ggtctcggcg ctggtcgaag ggcgggaccc 871141 gcgtgccgct gcgctggcga cggtggtcga cctcatcgac gtgctcgcgc ccgtcgacca 871201 gcgtccggtg ccgttctgaa gtcggtgggc agcgacggcg tgacaatgta cccggtggtg 871261 aagtccccat agatcgtgac atcggcgggc cggcgttggg cgtacaacgc cacgtaggcg 871321 catacgacgg cgtcgatcgg atcctcggcg gcccgcaggt cgctttttcg ctgcgcgacc 871381 gtcacctgcc ggcgcaacga gacccaatcc ggctgaccgg ctacctgcat ccgaaccccg 871441 gcctgggcga gcccctcgac gccgtccatc agtcgcaata gctccgattt gagcaggtca 871501 acgctgcgtc ccggcttggc cttgtacttc agcgcgcggg gtagccgaaa cagcgccacc 871561 gtagccgggt gcggatagac ctcgatggcc cgccgcgtgg cggacgaaag aggatccata 871621 tccagcgcca gttggcgggc cagccgggcg gcgcgtggaa cgtcggcaaa ctcgggcttt 871681 tcggtgttgg ccggatacgc gccggcctcg aattgtcgga agtctcgatt cagtgcggcc 871741 tccgccggcc gctggccggt gcggttggcc accaccagcg gcgcgtcgaa ggcgaccagg 871801 caatcgccca caacgtaggg ccgcagcgcc gccagcacgg aggcatcgtc gcgagcggca 871861 ccgaccccca ccagacaccc gtccgcgtcg acagccgcga caccggtcgg attgcggccg 871921 gcccaggcga ggtccacgcc gacgaagtac atctgcccag ggtatggcgg ggccgcggcg 871981 tatgtgctgt ggtgtcacat ccgtcacttg cgcctctgtc agagggatgc gcgttgtgcc 872041 cgtctcatag cgacatcgcc cgggcggcac cgggaccggg cgttgccgag ttgtcgcgat 872101 gagtcgggca catcgggtgc tccctggcgc cgggactcgt gtgacaactg cgactactag 872161 gcccgcgacc gtaagctgtg tctttgtgag ggccaagtga gcattcccaa cgtgctggcc 872221 acccgatacg ccagcgccga gatggtcgcg atctggtcgc cggaggccaa ggtggtctcg 872281 gagcggcggt tatggctggc cgtattgcgg gcacaggcag agctgggggt agcggttgcc 872341 gattcggtgc tcgccgacta cgaacgtgtg gtcgacgatg tggacttggc ctcgatctca 872401 gcccgggagc gggtgctgcg ccacgatgtc aaggcccgca tcgaggaatt caacgcattg 872461 gccggtcatg agcacgtgca caaggggatg accagccgcg acctgaccga gaacgtggag 872521 caactgcaga ttcggcggtc gctggaagtg attttcgccc atggggtggc ggcggtggcg 872581 cggctggccg agcgggcggt gagctaccgt gacctgatca tggccgggcg cagccacaac 872641 gtggccgctc aggccaccac cttgggcaag cggttcgcct cggcggccca agagatgatg 872701 atcgcgttga ggcggttgag ggagttgatc gaccgctacc ccctgcgtgg catcaagggc 872761 ccgatgggca ccggtcagga catgctcgat ctgctgggcg gtgaccgtgc ggcgctggcc 872821 gatctcgagc ggcgcgtcgc cgacttcttg ggctttgcaa ctgttttcaa cagcgtgggg 872881 caggtgtatc cgcgttcatt ggaccacgac gtggtttcgg ctctggtgca gctcggcgcg 872941 gggccgtcat cactggcaca cacgattcga ttgatggccg gccacgagct cgccaccgag 873001 ggtttcgcgc cgggtcaggt cggttcgtcg gcgatgccgc acaagatgaa cacccgcagc 873061 tgcgaacggg tcaacgggct gcaggttgtg ctacgcggct atgcatccat ggtggccgag 873121 ttagccggtg cacagtggaa cgagggtgat gtgttttgct ccgtggtgcg ccgggttgcg 873181 ttgccggaca gcttctttgc cgtcgacggg cagatcgaga cgtttttgac ggtgctggac 873241 gagttcggcg cctacccggc ggtgatcggc cgcgagttgg atcgttatct gccgttcctg 873301 gccaccacta aggtgctaat ggcggccgtg cgcgcgggga tgggtcgcga gtccgcgcac 873361 cggttgatct ccgagcacgc ggtggcgacg gcgctggcca tgcgagaaca cggcgcggag 873421 cccgacctgc tggaccggtt ggccgccgat ccgcggctgc cgctgggacg agacgctttg 873481 gaggccgcgc tggccgacaa gaaggcattt gccggtgccg cgggtgacca ggtcgatgat 873541 gtggtcgcga tggtggacgc gctggtgagc cgttacccgg acgcggctaa atacacgccg 873601 ggtgcaattc tttagtgtca tgactaccgc cgccgggctt tcgggcatcg atctgaccga 873661 tctggacaac ttcgccgacg gcttccccca tcacctcttc gccatccacc gtcgtgaagc 873721 gccggtgtat tggcatcggc cgaccgagca caccccggac ggggagggct tctggtcggt 873781 ggctacctac gccgaaaccc ttgaggtgtt acgtgatccg gtgacctatt cgtcggtcac 873841 cgggggccaa cgtcggtttg ggggcacggt gctgcaggat ctgccggtcg ccggccaggt 873901 gctcaacatg atggatgatc cccggcacac ccgtatccgg cggttggtca gctcgggctt 873961 gacaccacgg atgatccggc gggtcgaaga cgatctgcgc cgccgggcgc gtggattgct 874021 cgatggcgta gaacccggag cgcctttcga cttcgtggtc gagatcgctg ccgaattgcc 874081 catgcagatg atctgcattc tgctgggtgt gccggagacg gatcgacatt ggttgttcga 874141 ggcggttgag ccgggattcg atttccgcgg ctcccgcagg gcgacgatgc cgaggctgaa 874201 cgtcgaggat gccggatcgc ggttatacac ctacgcattg gagctgatcg ccggtaaacg 874261 cgccgaacct gccgacgaca tgctgtccgt cgtcgccaac gctaccatcg acgatccgga 874321 cgcgccggcg ctgtccgacg ccgaactgta cctgttcttc catctactgt tcagcgccgg 874381 cgcggaaacc acccgtaact ccattgccgg cgggctgctg gcgctggccg agaaccctga 874441 ccaactgcaa acgctgcgaa gcgattttga gttgttgccg actgcgatcg aagagatcgt 874501 gaggtggacg tcgccgtcac catcgaagcg gcgcacggcg tcccgtgcgg tcagcctggg 874561 cggccagccg atcgaggcgg gtcagaaggt tgtggtgtgg gagggctcgg ccaaccgtga 874621 tcccagcgtg ttcgaccgcg cggacgagtt cgatatcacc cgaaaaccca atccgcacct 874681 gggtttcggt cagggggtgc actattgcct gggcgccaat ctggctcggc tggaactgcg 874741 ggtgctgttc gaggaactct tgtcccgctt tggctcagtg cgggtggtgg aacccgcgga 874801 atggacacgt agcaaccggc ataccggcat ccggcaccta gtcgttgaat tgcgcggagg 874861 ctagtccccg cgcagcggga ttccggcggc ccgcaactcg agcgcggcca gcgcacgcat 874921 ggtggcggga tcctctcgtc gccaggcgcc gaccggatcg gtgctgacgg cggccagttt 874981 gcccggcggc cggttcgcca atgcgcgcag cgccagcagc tggcgaccgg ctggggtcgc 875041 cgccagggtg gtaacggtcc acttgcgccg gcagaaccgc agccgcagga acagccaggg 875101 catggccacg gcaagaatcg gcgtcgcggc gaccgccagc gcgagcacta ccgcaagcca 875161 gccggccgtg gtgtccaggt tgtggccggc gccggcgatg tcaagggcgg cctggcttgc 875221 ggcggtgatg gggttgctga gcgcgtcgcc caccaccggg atacgctggg cgtcctggcc 875281 cgcggccgcc aggttgccgg caatcccgtg cgagccgatt tcgatttggc ggccggcctc 875341 gccgattatc gagatggcgt cgtgcacggc gaggccgacg agcatccata gcgtcgtcca 875401 caccgcgaca gtgatatcgc tgatcagttg ggccagcagt cggccgggcg tggtggcata 875461 cggcaagaag cgcgatctca taccagagat accagcacag ggcgccgtcg tgcggcggat 875521 aggctggcgc gatgcgcccc gcattgtccg actaccagca tgtggccagc ggtaaggtcc 875581 gcgagatcta ccgtgtcgat gacgagcacc tgctgctggt tgccagcgac cggatctcgg 875641 cgtacgacta cgtcctggac agcaccatcc cggacaaggg ccgcgtcctg accgccatga 875701 gcgcattctt cttcgggctc gtcgatgccc ctaaccatct ggccgggccg ccggacgacc 875761 cgcgtatccc cgacgaggtg ctgggccgcg cgctggtggt gcgtcggctg gagatgctgc 875821 cggtggaatg tgtggcccgt ggctacctga ccggttcggg gttactggat taccaggcaa 875881 ccgggaaggt atgcggtatc gcgctgccgc cgggcctggt cgaggccagt cggttcgcca 875941 caccgctgtt caccccggcg actaaagccg cgttggggga ccacgacgag aacatctcgt 876001 ttgaccgggt ggtggagatg gtaggcgcgt tgcgtgccaa ccagctgcgt gatcgtactc 876061 tgcagacgta tgtgcaggcc gccgatcacg ctctcacccg cggaatcatt atcgccgaca 876121 ccaagtttga atttggcatc gaccgccacg gcaacctgct gctggccgac gaaatcttca 876181 caccggactc gtcgcggtac tggcctgccg acgactaccg ggccggcgtg gtccagacca 876241 gcttcgacaa acagtttgtc cgcagctggc tcaccggctc cgagtccggc tgggatagag 876301 gcagcgatcg gccgccgcct ccgctccccg agcatatcgt cgaggccacg cgtgcccgtt 876361 atattaatgc atacgaacgg atttccgaac taaaattcga cgactggatc ggccctggcg 876421 catgatgcac cgaaccgcac taccctcacc gcccgtggcc aagcgggtgc agacccgccg 876481 ggagcaccac ggcgacgtct ttgtcgaccc atatgaatgg ttgcgcgaca aggacagccc 876541 tgaagtaatc gcctacctcg aagctgaaaa cgactacacc gaacggacca ccgcgcacct 876601 tgagccattg cggcaaaaga tcttccacga aatcaaagcg cgtaccaagg aaaccgactt 876661 atcggtgccg acgcgacgtg gcaactggtg gtactacgcg cggacctttg agggaaagca 876721 gtatggcgta cactgtcgtt gcccggtaac cgatcccgac gactggaacc caccagagtt 876781 cgacgagcgc accgaaatac ccggtgaaca gcttctgctc gacgagaacg tggaagctga 876841 cggccacgac ttcttcgcac tgggcgcggc cagcgtcagc ctggacgata acctcttagc 876901 gtattccgtt gatgtcgtag gtgacgaacg atataccttg cggttcaagg atttacgcac 876961 cggagaacag tacccggacg agatcgccgg gatcggagcg ggagtcacct gggcagctga 877021 caaccgcact gtctactaca ccaccgtgga cgcggcctgg cgtccggaca cagtgtggcg 877081 ataccgacta gggtccggcg aatcgtcgga gcgggtttac cacgaagccg atgatcggtt 877141 ctggctcgcg gtggggcgta ctcgcagcaa cgcctatctg ctgattgcgg cggggtcgtc 877201 catcacttcg gaggtccgtt acgcgcacgc ggcagatccg acagcgcagt tcagcgtggt 877261 gctgccgcgc cgcgacggcg tcgagtactc ggtggagcat gcggtcatag ctggccagga 877321 ccggtttctg atcctgcaca acgacggcgc ggtgaacttc acactggtag aggccccggt 877381 cgaggatcct gcgcggcaac gcaccctcat cgcccaccgc gacgacgtcc gactcgacgc 877441 ggtggatgcc ttggccggcc atctggtagt cagctatcgg cgcgaggcgc tgccgcgggt 877501 tcaactgtgg ccgatcgggc ctgacggaaa ctatggtgag cccgaagaga tctcgttcga 877561 ctccgagctg atgtcggccg gactggggcc caaccccaac tgggattcgc ccaaactgcg 877621 ggtcggtgcc ggatctttcg tcaccccggt gcggatctac gacatcgacc tggtcactgg 877681 cgagcgtacc ttgctgaaag aacagcccgt actgggcggc taccgccgcg aagactatgt 877741 ggagcggcgt gactgggcgt acggagacga cggcacccgg atcccggtct cgatagtgca 877801 ccgagccgat atcgaattcc cggcacctgc gttgatctat ggctacggcg cctacgagat 877861 ctgtgaggat ccgcggtttt ccatcgctcg gttgtcgctg ctggatcgcg ggatggtgtt 877921 cgtcgtcgcc cacgttcgcg gcggcggtga gatgggcagg ctgtggtatg aaaacggcaa 877981 gctactggac aagaagaaca cgttcaccga cttcatcgcg gtggcaagac atctggtgga 878041 cacgggactt acttcccagc agcagctggt ggcattgggg ggtagcgcgg gcggtctgct 878101 gatgggcgcg gtggccaaca tggcaccgga tctcttcgcc ggaatccttg cgcaggtgcc 878161 gttcgtggac ccgctgacca ccatcttgga tccatcgttg ccgctgaccg tcaccgagtg 878221 ggacgaatgg ggaaatccgt tgaacgacag cgatgtctat gcctatgtga aatcgtattc 878281 gccgtacgag aacgtcacgg cccaaaagta cccggccatc ctggcaatga cgtcgctgaa 878341 cgacaccagg gtctattacg tggagccggc caagtgggtg gccgcgttgc ggcacgccaa 878401 gaccgacggc aattccgtgc tgttgaagac ccagatgcac gccggtcatg gtgggatcag 878461 tggccgctac gagcgctgga aggagaccgc gtttcaatac gggtggttgc tagctactgc 878521 cgacagcgac cgttacggcg gcggccaggg aaacgacctc gatggcgctg cgccagcata 878581 gccggtggga tcggccattc gggatgcgta gacattggct ccgaacatgg ccagcatcag 878641 cgccagcgag cataccgccg ctgccatgcg ggtgtcgggc agcaacaggc ccacggcgac 878701 caggagcctc cagcgcaccg gtgatggtga ccagcaggcc gggcgcaagc agcccgggtg 878761 aaacgatggc gatgaggtgg ccgcgcaggg gcggcgtgaa gtgagctcgg ctgctcgacg 878821 gctccgattc cgaactggtc gacgccgaga ccgccgctgc cgccgagctg gcgcgcgggg 878881 tggcggcgct gcgcgatccc aacgcccggg cgaatccggc gggtgccgag ctggcgacct 878941 ggtcgctggt gcacggcttt tcgacgctgt ggctcgacga tgcggtcaac gctgacgtga 879001 agcagacgtc atgcggatag caacggtgct cttcgatgac tagcctgctg tttcggcagg 879061 aatgccgcgg ggatcagcgt cgagaccact agcgcggtcg ctatcacgaa taccaccgcg 879121 taggcgtgcg aaaggtcatg cagcagttgg gccgcgaagt tggtttggcg cggtagcgag 879181 gaagggtcaa ccgccgcccc ccgcccggcg ccactctctg gggtcagtgc gactttcttt 879241 gcagtagcga tgatttcgct gtgattgaac tggtaggtga gcagcaccga catcagtgcg 879301 gtccctatcg aaccgcccac ctgctggttg acgctgatca gcgtcgaacc gcgagcgatc 879361 tgatgtgggg ccagggtctg cactgccgcc ccggacagtg gcatcatgga gcagcccatg 879421 cccatgccca tgattgccag cccggtcggc agaatgggta agtagtccgc ttgccgcgcg 879481 acaccaaagg cgaaggtgcc caaccccgca gcgatcagca tgatcccaac cagcacgatc 879541 ttggccggtc cccgtcggtc catcatcgct ccggcgatcg gcatcgccag catggcaccg 879601 aggccctgtg ggatgatatg cacccccgat tgcatcggtg attggtgcaa cacttgctgg 879661 aggtagctcg ggagcagcaa gaaggagcca aacagcccga gggagagcac cgtcatcgtc 879721 atgttggcct gcgcgaccgc tcggttctgg aacaagcgca tgtctatgag cggatgttct 879781 gtgcggtacc acgaatgtgc gacgaatgcc gcgatcaacg ccaggccggt gatcgccggt 879841 atcaacacgt gccgatcggc catcgttcca cgggcggggc tagatgacac cccgaacagg 879901 aaggtcgcca aacccggcga cagcaacaag aggcccatgt agtcgaagtt ttccgacgct 879961 gccgggcgat ctcttgggaa cacgatcgcc gccaagacga gcgcggacag cccgaccggc 880021 aggttgacca agaaaatcca acgccagccg taggccccga tgagccaacc acccaggatc 880081 ggcccaccga ccgggccgag cagcatcgga atgcccacca ccgccatcac gcgccccagc 880141 cgcttcgggc ccgcctcacg ggccaagatg gcaaaggaca ccggcgtcag catgccccca 880201 ccgaaaccct ggacaacacg aaatatgatg agcagcaaga tgtttggtgc tactgcgcac 880261 agcagtgagc cgagggtgaa cgccaatacc gaacccatga aaagccgcct ggtgccgaac 880321 cggtcggccg cccaaccggc tgtcgggatc acagtggcca acgcgagcat gtagccggtc 880381 atggtccagg ccacgacggc ctgggtggac ccgaaatcgg caacgaaggt gcgttgcgcg 880441 acgctgacca cggtgacgtc cacatgtgcc atcaccgagg ccaggacaca cactccggcg 880501 gtccgaagca accccacatc gagcctatcg ggatagctgc gttggccaga gcggggccgc 880561 cccgcggggg tgatgggcac cggggcatcg ccttccgcgg gacacgcttc aaccatggcg 880621 ttgccgagca tatcgatacc ggtcacgggt accgcgcgag gatgtcgggc ggtgcttggt 880681 tccggcgtcg ggtcatggcc ctggcgccga gccgacgtgc gctcgttctg cgctggtcag 880741 ggtccagata tacgcctgct gtccgcgtgt ccttcaccgt ccggaaacct ggaatcggca 880801 gactgcaagc gtgtctggaa aactgctcgt gtcggtctcg gggataggtg agagcaccct 880861 ggccgatgtc gacgcgttct gcgcggaaat ggacgcccgc tcggtgccgg tatcgttgct 880921 ggtggctccg cgtatgcgcg atgactaccg gctcgaccgc gacccacgca ccgtcgactg 880981 gctgaccggt cgccgggccg ccggcgacgc tctggtactg catggctacg acgaagcggc 881041 caccaagagg cggcgcggcg aattcgcaat gctgcgcgca cacgaggcca acctgcggct 881101 gatggccgcc gaccgggtgc tcgaacacct tgggctgcga acccgactgt ttgcggcacc 881161 gggctggctg gtatcaccag gtgtccgtac agcgttgccg gccaatggat ttcggctgct 881221 tgcggatctc catggaatca cggatctggt tcggctcacc accgtgcgtg cccgcgtgct 881281 gggcatcggc gagggtttcc tggcggagcc ctggtggtgc cggatggtgg tgatgtcggc 881341 cgagcggatc gcccggcgtg ggggcgtcgt ccggattgcg gtggccgccc gtcatttgcg 881401 caagtccggt ccgctgcagg cgatgctcga tgccgtcgac ctggcgatgc tgcaggggtg 881461 cacaccgatg gtgtaccggt ggcgagccga tgcggcggta ctcgacgcgg cctgaccgag 881521 cgcctgatcg gtggcgttaa cctgtaccga catgagcgat gctgtagccg gttcagatgc 881581 cgaggggctc accgctgatg ccattgtcgt gggagccgga ttagcgggcc tggtagccgc 881641 ttgtgagttg gccgaccgcg gcctacgggt gctgatcctc gaccaggaga atcgggccaa 881701 cgtgggcggg caggccttct ggtcgttcgg cggtttgttc ttggtcaaca gtcccgagca 881761 gcgccgcttg ggcatccgtg atagccatga gcttgctctg caggattggc tggggacggc 881821 ggcgttcgac cggcccgagg actactggcc cgaacaatgg gcgcatgctt acgtcgattt 881881 cgcggcgggg gagaagcgca gctggctgcg ggcccgcggg ctgaagatct tccgctggtg 881941 ggctgggccg agcgtggtgg ttacgacgcg caggggcacg gcaactcggt gccccgtttc 882001 cacatcacct ggggtactgg gccggctctg gtcgacatat tcgtgcgtca gctgcgtgat 882061 cgccccacgg tgcgctttgc gcaccgccac caggtcgaca aactgatcgt cgagggtaac 882121 gcggtgacag gcgttcgggg taccgtgctg gagccctcgg atgagccgcg cggcgcgcct 882181 tcgtcgcgaa agtctgtggg gaaattcgag tttcgcgcgt cagcggtgat cgtcgccagt 882241 ggtggtatcg gtggcaatca tgagctggtg cgcaaaaact ggccgagacg gatgggccgc 882301 attcccaagc aactgttgag cggggtgccc gcgcacgttg atggcaggat gatcggcatc 882361 gctcaaaagg ccggggctgc ggtgatcaat ccggaccgga tgtggcatta caccgaaggc 882421 attaccaact acgacccgat ctggccgcgg cacggtatcc ggattattcc ggggccgtcg 882481 tcgctatggc tggatgccgc gggcaagcgg ttgccggtac cgttgtttcc cgggttcgac 882541 accctcggca cattggagta catcaccaag tctggacatg actacacctg gttcgtgttg 882601 aatgccaaga taatcgagaa ggaattcgcg ctgtccggtc aggagcagaa ccctgacttg 882661 accggtcggc gcctgggcca gctgttgcgc tctcgggctc acgccggccc gcccggaccg 882721 gtgcaggcat tcatcgatcg tggtgtggac ttcgtccacg cgaactcgtt gcgcgagttg 882781 gtggccgcga tgaacgagtt gcccgatgtg gtgccgctgg actacgagac ggtggcagcc 882841 gcggtcactg cgcgcgatcg tgaggtggtc aataagtaca gcaaggatgg acagatcacc 882901 gcgattcgtg ccgctcgccg ctaccgaggc gaccgatttg gccgggtggt ggcgccacat 882961 cggttgaccg atccgaaggc cgggccgctg atcgcggtca agctgcacat cctgactcga 883021 aagacgttgg gtggcatcga aactgactta gatgctcggg tgctcaaggc cgacggtacg 883081 ccactggccg ggttgtatgc agccggcgag gtcgccgggt tcggcggggg cggtgtccat 883141 ggctaccggg ccttggaggg caccttcctg ggtggatgca tattttccgg ccgcgctgcc 883201 ggccgcgggg ccgccgagga tatccgctag ttgtggccgc ttgacatagg agctattgct 883261 cgcgctagaa ggtgaccgcg ctttcctcgg gcaacacctg aaagtcggtg gtggtcatct 883321 cggtgagccg gccgtagtag atacccctgg cgtccggagc gacgatggct tggtggatgg 883381 gtaccgctcg tgccggagct acggcccgca ggtagtcgac cgcctcggag atcttcatcc 883441 atggggccgc ggcgggagtg gccagtacgt ccacctgctc gccgggaacg aacaacgcgt 883501 caccgggatg catcagtctt gcccgatgtt tactgtcgcc caccagatac gaaatgttct 883561 ctatcacagg gatttccggg tggatcaccg cgtggcaacc gccgaccgca cggacggtca 883621 gctccgctaa cggcagctcg tcgccaacgt gcaccgcccg ccatggctcg cccagctgcg 883681 ccgccgtctg cggatcggcg tacagctcgg cagccgggtt gtcctcgagc agggtcggca 883741 gccgcgtgac gtctatgtga tcggggtgct ggtgggtgat caagatcgcg gacaaaccgg 883801 tgattccctc gaagccgtgc gagaaagtac cgggatcgaa gagcaggcgg gtttgaccga 883861 actcagcgag gaggcaggaa tggccgaaat gcgtgagttg catgtttacg attgtgccct 883921 tatgggggcg tttccgatgc ggttgatcct ggcgacgatg ctggtcgccg gtcgcttgtt 883981 ggcgacgctc atggccgcgc ctagcgccca ggctgagccg gaaacctgcc cgccgatatg 884041 cgaccagatt cctgctaccg cgtggatcag cacccacgcc gtgccgttga actcgcaata 884101 ccgttggccg gcaatggccg gcgcggcagt ggcggtgacc agggcgacac cacgtttcgg 884161 gttcgagcag gtgtgcgcca cgccggcgtt cccgcacgac agccgcgatt gggcggtcgc 884221 gggccgggtc acggtggtcc accccgacgg ccagtggcag ttgcaggctc aggtgctgca 884281 ctggcgcggg gacaccgccc gcggtggcca gatcgcggcg tcggtgtttg gcaccgccgt 884341 cgccgcgtta cgcgcctgcc agctgggcgc accgctgcag tcgccgtcgg tcaccgacga 884401 cgaaccgacc cggatggccg cggtgatcag cgggccggtc atcatgcaca cctacctggt 884461 cgcgcacgta tcaagcagca cgatcagcga actcaccttg tggtcgtccg ggccgccaca 884521 agttccgtgg cctacggttg cggactccgc ggttctggac gccctgaccg cgccgttatg 884581 cgaagcctac atcggctcgt gcccgtgacc aggcggggca cctgccgccg gtagagttgg 884641 cgcgggaatc attgcccggc tcctggcggc cgctgtcgcc gggcgcggcg ggcagatctg 884701 aggaggagcg ccggtggcca gggtggtcgt gcatgtgatg cccaaggcgg agattcttga 884761 cccgcagggc caggcgattg tcggtgcgct ggggcggctt gggcatctcg gaatatcaga 884821 tgtgcgtcag ggcaagaggt ttgagctgga ggtcgacgat acggttgatg acaccacgct 884881 tgccgagatc gcagaatcac tgttggccaa caccgtgatc gaggactgga cgatcagccg 884941 ggacccgcag tgacggcgcg catcggtgtc gtcacgtttc ccggcacgct cgacgacgtc 885001 gacgccgcgc gcgcggcgcg gcaggtgggc gccgaggtgg tcagcctgtg gcatgccgac 885061 gccgacctta agggtgtcga cgccgtagtg gtgcccggcg gattttccta cggtgactac 885121 ctccgggccg gagcgatcgc cagattcgct ccggtgatgg acgaagtggt agctgccgcg 885181 gaccgcggca tgccggtgtt ggggatttgc aacggctttc aggtgctgtg tgaggccggg 885241 ctactacctg gtgccctgac ccgcaacgtg ggattgcact tcatctgccg ggatgtgtgg 885301 ctgcgggtag cgtcgacgtc gacggcgtgg acatcgcgtt tcgagcctga cgccgacctg 885361 ttggttccgc tgaagtccgg cgagggccgt tacgtggcgc cggagaaggt gcttgacgaa 885421 ctagaaggcg aaggccgggt ggtgttccgc taccatgaca acgtcaacgg ctcgctgcgc 885481 gacatcgccg gcatctgctc agccaacggc cgtgtcgtcg gcctgatgcc gcaccccgaa 885541 catgcgattg aagcgttgac cgggccgtcc gacgacggac tgggtctgtt ctattcagcg 885601 ctggatgccg ttctgacggg ctgaggtcac ccgctcacgc tcacccggcg tctcgcagca 885661 acggcggcgt cgcggttgga ggtaatccgg ctgccgtcag ctgaccgaag agctccgtcg 885721 cggccgagac ggcgttgtcg acgaaggtgg cgaaatcgtc gaaccggatg cggtccctga 885781 tcaaggaccg ctctgcggcc acgccgatgc ggtgcggatc agaggacccg tgcacgatcg 885841 cggtgacctc gtggttctgc aggttccacg cgttgacgat ctccgccaac cgggtgtggt 885901 cggtggcggg gaagaagtat gcgggactga ccctgatcgt gaacacgtcg cggtaggcgg 885961 gagagatttc taggtggacg tgcagccgca ggtgggcgtt ggcgacgaag aagaactcgg 886021 cgtcgtggtg gccacggaag tatcgccggc cgcgggcgcg caggtagcgc tcgatcaggt 886081 tggtgctcag cggctcgcct atcgactcag tcatgaactc atgatgcggc cggcgccttg 886141 gtgaatcctt tgagctggga acccggttgc gaagaacaag atgagaattc cctgagcgac 886201 gcggggcagc ccggccactg tgaatggcac gacgcgacac gcggcggagg cgtcgtgaga 886261 ttcacagtcg gtgggttgcg tcggccaatt caaccggggg gccggtccac agttcctcgt 886321 cagcggctac caaggcgtgt acttcggtgg actgcaacgc cttcaggacc gactgagcga 886381 ttcgttcgta ccattgcgcg accgcgctgg gatagtcggc gtagctgccg ttttcccgca 886441 ggatggttcc ttccaccgtc gggatcgggg tagctccgtc gaattcttgc cggtagggct 886501 tgccgaggcg ggcggcggtg ggtgcgtcga tggtggcgtc cagcttgacc catcgccgac 886561 caagatatgc ctcacccagc gagtgccacg ggaagggccg gccagttcgg cctccccata 886621 gggcacgtac ctgcggggac agaaactcct tatcgggggc gtcgatcgtc tggaacgcga 886681 tacgggccgg gacaccggcg gctcggcaca gggcgacgaa ggaacttgcc ttgcccatgc 886741 agaaggcgac cccgtggccg atcacgtcgc tggcgcggtg atgtccctgc gcgaggtagc 886801 gaaaggacgc gaggacgtcg tatggcacgt cgcgcacgta gtagtagatc cgcctgaccc 886861 gctcggtatc cgacaccgcg tcccggatga gggttgctgc cgtcgtacga acgagcggat 886921 ggcccgcgtc gaggtactcc gtgggcgtca gaaagtggtc catgccggtt ccattgttgg 886981 ctagcgtcat ggaatcgtga cctcagtttt gacccgcgga atgatgtcac tgccgatgat 887041 gtgcagataa tcggacttca cgacgtgtgg aatcgtgaag agaaaatggc cgacaccgcg 887101 gtcctggtat tcacgaatgc gctcgacaca cctgtcgggt gtcccgacga tgagccccgg 887161 ctcggggatg gacgcgaatt cttcgcggat ccggacttct tcctcgccgg actgggtggg 887221 tgccagcagc agcgtgaccg acagtcgcag cgtgtcgggg tcacgcccgg ccgcctccga 887281 cgcctgggtg agaaatccgc ggcgttgggt gacttgctgc ggcgaccacc agcgcacgtt 887341 caggccctgg gcatgcttag cggcgatgcg ctggacccgg tcgccttccc cgccgatcca 887401 caacggagga tgtggccgtt gcaccggcgg cggatcgcag gtggcgccgt ccaaggtgta 887461 aaaccggccg gcgtaggtgg ggtttggctc ggtccacacg gccttgatga cctgcagcga 887521 ctcggcaagc gcggagactc ggtcgccaac cggcgggaac gggatgccgt aggcttgcga 887581 ctcgcgccga aaccagccgg cgcccaatcc cagatcgaga cgtccctggg aaatgacgtc 887641 cagcgtcgca gccatcttgg ccagcacgga aggatgacgg taggaattgc acagcacgct 887701 ggtgcccaac cgcagcttcg tggtgtcgcg ggacaatgcc gcaagtgcgg tccagcactc 887761 gagcaggggc agcgacctcg aaggggcgca ctggcccgcc ccgccggttt cggtgccggt 887821 cgcggagccg gtgtcggcgg cgatgccggc gaccttcgca tactcgccgg ggcttatcgt 887881 caggaagtgg tcgcataacc acactgaatc gaatccgtat tcttccgccg tctgcgagac 887941 gacaaccatt tcgcggtaac tgccgaccgc caggccatta accgtcgcag ccaacatgag 888001 tccgaagtgc gggtcgtctt tggcgttcat gcgaaatctc gtttctcgat aattccggca 888061 cctgatccgg gcaacgttcg gggtaacgtg acggagaact ggtaccgctc ggggcgatgg 888121 tggaacacga ccacttcaag gggcttgccg tcattggtgt agctggtgcg gtcgacgacc 888181 agtaccggcg aacccaccgc cagacccaac gcgtcggcta cgtcggggga ggccccggcg 888241 gcatggattt cgtgggtagc ctgtgcaatg cgtacaccca gtcgccgctc ccacatcgca 888301 tatgtggttt cggtgtccgc gctgcccgat agcaacggct cgacggctgg gcccacgccg 888361 ggcggaagat aggccgtgac cagggccaag ggttgatcgc cagtgcggat gcgccggcga 888421 atacagagga cctcaaccaa acccagcgtc tcggaaatcc gttgcggcgc cggtccggtc 888481 tggtgtgaca gcacgtcgac ctgcggggta acaccacagc tcaacaacac ctctgtgatg 888541 gtgcgcacgc cgcaactgag ctcctgttcc accggatcgg cgacgaaggt acccaagcct 888601 tgccggcgca ctagccatcc ctgacgttgc agcatgccga ccgccgcgcg cacggtcacg 888661 cggctcaaac cggaacggtc gatcaattct cgttcgctgg gcaagcgccc gccgcgcggc 888721 agccgctgct ggatgatctg ggcctttagc gcctcggcaa gctgggtact cgccggcacg 888781 ctgccacgcg atatccgcag atcggcagcg tccaggtcca gcttgacaga tgtcataaga 888841 cgtattaaaa cgtcttatac tcaccacgtc aagcgtgcgt gcgcggtagc agcggaagaa 888901 ggtcagccat gacgtcaccc gtcgcggtca tcgcccggtt catgccacgg cctgacgcta 888961 ggtcggccct gcgcgctctc ttggacgcaa tgattacccc gacacgggcc gaggacggat 889021 gccgtagcta cgacctctac gagagcgccg acggcggcga gctggtgctt ttcgaacggt 889081 accgcagccg catcgcgctc gacgagcacc gcggttcgcc gcactatctg aactaccggg 889141 cacaggtcgg tgaattgctg acccggcccg tcgcggtgac tgtgctcgcg ccgctcgacg 889201 aggcttctgc ttagagcggg tagcacccag gcagcttgat ccacgcccgg caccggccga 889261 gcgctcggga accgccgcag accaccgcag tccccccgtg ggttcagcgg cgcggcggcg 889321 ggttggctat accagcaggt aaaacgaatc tcggtaggat tcaagaagtc tcagccacag 889381 ttcgctgatg gtcgggaagc acggaacggc gtgccacaac cgatcgattg gcacctggcc 889441 ggcgacggcg acggtggccg aatgcaacag ctcggcggcg cccgggccaa ccatggtcac 889501 gcccagcaga tggccccgat cgacgtcgac caccatgcgc gccctgccgg tgtatccgtc 889561 ggcaaagagc ttggctccca taacgacatc gccgatttcg acatcgatcg ctttgatccg 889621 gtgaccagcc tgtgcggcct gatcagctgt caggccgacc gctgcggctt cggggtcggt 889681 aaagaatgcc tgcggcaccg cgtgatggtc ggcggtggtc gcgtgcatgc cccacgacgt 889741 ggtgtctagc ggtcgtccgg cggcacgggc gccgatcgcg gtgccggcga tccgcgcctg 889801 gtatttgcct tggtgggtca gcaacgcgcg atggttgacg tcgccggcgg catagagcca 889861 gccgtcgtca acagcccgca ctcggcaggt gtcatcgacg tccagccagc tgcccggcgt 889921 cagtcctatt gtctccaagc cgatgtcgtc ggttcgcggt gctcggccgg tggcgaagag 889981 tacctcgtcg acccgcagct cggtaccgtc gtccagctcg aggaccactg ggccagtggg 890041 ttggggcggc ccagcgcgcg taccgatact cccacgcgca cgtcaacgcc ggcgtcggcc 890101 agtccgcgac cgatgagttc ccccacaaac ggttccattc ggggcagcag gccagatccc 890161 cgagccagca gggtcaccga ggcgcccagt ccctgccagg cggtcgccat ctccacaccg 890221 acgccgccgg cgccgacgat cgcaagccgg tcggggaccg tactgttgtc ggtggcttgg 890281 cgattggtcc atggccgggc ttcggtgatg ccaggaaggt cggggagtgc tggccggctt 890341 ccggtgcaga tgacaacggc atgccgggcg gtcagcgcca cgctttcgcc gctcgacttg 890401 gtgacgacga cgcggcgcgg accgtccaat cgcccgtcac cgcgtatcag cgtcgcgccg 890461 attccactca cccagtcggc ctggccggtg tcgtcccagt gggccacata gcggttgcgg 890521 cggccaaaga cgccggctgt gttgatcgag ccgtcgactg cttcgcgcgc gccgtcgacc 890581 cgtcgggcgt cagagatcgc gatgaccgga cgcagcaagg ctttgctggg cacacaggcc 890641 caataggagc attcaccccc gacgagttcg cgctccacca ccgcgacacg caggcccccc 890701 gcgcgggcac gatcggcgac gttctgtcca acgggtcccg cgccgagcac gacgacgtca 890761 tacgtttcac cctcacggca gccgggtgtt gccattggcg cctggtcctg ttgggccgcg 890821 gtcataatca aagatccttt cgtcggactc tgccagcgac gctacgcgcg cctagcgccg 890881 gtgagccgtg ccggcctatc gcccaccaga cgcaaaagct ctcgacacgc cgtgcgaaaa 890941 gggaccttta tgtctcagtg tcggtgttgt gtgtgccgcg aggtgggtgt gtcggtgtga 891001 cagacgccgt gtcgcggtgg tttgttccgg atcacctggt gtctggctca ctttgcgtct 891061 gccgtcctct tggggttggc gttgagcagt attgccggca ctaggtgaga aggaccggcc 891121 ggcgtgactt gataggagcg tggctttcgc cccgactgag atgtgtccgc cgaccggccc 891181 aacctcaaca ccccctcaag tgaaggaggc aaccaccatg gttgttgttg gaaccgatgc 891241 gcacaagtac agccacacct ttgtggccac cgacgaagtg ggtcgccaac tcggtgagaa 891301 gaccgtcaag gccaccacgg ccgggcacgc cacagccatc atgtgggccc gtgaacagtt 891361 cggcctcgag ctgatctggg gcatcgagga ctgccgcaac atgtcggcgc gtctggagcg 891421 tgacctactg gcggccggcc agcaggtggt gcgggtaccc accaagctga tggcccagac 891481 ccgcaagtcg gcgcgcagtc ggggcaagtc ggatccgatc gatgcgctgg cggtggcgcg 891541 ggcggtgctg cgtgaaaccg acctacccct ggccacccac gacgagacgt cgcgggagtt 891601 gaagttgttg actgaccgtc gagatgtcct tgtggcccaa cgcacgtcgg cgatcaaccg 891661 gttgcgctgg ctcgtccatg aactcgatcc cgagcgggca ccggcagcac gctcgctcga 891721 tgccgccaag caccagcagg ccctgcggac ctggctggac acccagccag gattggtcgc 891781 cgaactcgcg cgcgccgagc tgaccgacat catccggctc accggcgaga tcaacaccct 891841 ggcccagcgc atcagcgccc gagtccacca ggtcgccccc gcactgctgg aaatccctgg 891901 ctgcgcggag ctgactgcag ccaaaatcgt cggcgaagcc gccggagtga cccggttcaa 891961 aagcgaagcc gccttcgcct gccatgccgc agtggctccc atcccggtgt ggtcgggcaa 892021 caccgccggc cagatgcggc tcagccgctc gggcaaccgc cagctcaacg ccgccctaca 892081 ccgcatcgca ctgacccaaa tccggatgac cgacagccgg ggccaggcct actaccaaag 892141 gctgcaagac gccgggaaaa ccaaacgcgc agcactacgc tgcctcaaac gccgcctagc 892201 ccgcaccgtc ttccaggccc tgcgcaccgt ccatcagccc agctccgaac acacccaacc 892261 cgcggccgct tgccatagga gctattgctc gcgctcgtgc cttagtggct gagcgcgacc 892321 gacgcctcgg cggtgtagca aaggaacgtc agcgtctcct gcaggtagag gcgcacggtg 892381 tccgtgtcgt ggctggcgta cccgattgca acgtcggtgc ccagctgtag gtcgaagtcg 892441 ccgcctcgag tggtcagcac gaacgcgccg tcgatggccg gggcccaaat gatgtccccg 892501 tccaccagcc ggttcagatg ctcacggatg ggatagccgt gatcggaagt ctcgctaacc 892561 ttggtgtaga cgtcagcaga gagcaacacc gaatacggtc cgtccacacc ggccaaccgc 892621 agttcggaca atgcctggga gatgacatca gggatttcac ggggatcctc gggcaacgtc 892681 agcgccgggt tcgaactcgc gctgcggatc ccttcgattg atgcggcgct gtagccttcg 892741 aatattgtgc ggtcctcgac gaaggccagc ttcttggccg cctcctttac cggttcccaa 892801 tcggagtcct tagagccacg ttccacgtcg tcgatctcgt tgcgcgacag ggtaaacgga 892861 acccgtagcc ggacaagggg tttgctggcc cgcaggtggg cgatcacgcc gttggttggt 892921 gccttaacat cgatcagccg gccggtgctg accgccgcgg tgacgggccc cccgggatca 892981 ctgacatcga ccacccggcg cccggcgatg tgtcgcttga acgtccgcgc cgcctccaat 893041 tcgatttccg cccaagcggc ttcggtgacc ggtgccaaat cgcggtagag attgttcatc 893101 gggggcttcc tttcaagctg ccgatcgata gcgacccggc tgccagagtt ggcgtcgccg 893161 cctgcggtag gggcggtgga tggtcgagaa agtcgatggt gggtgagaag aacagtccgc 893221 cggtcaccgc ggtggaaaag tcaagcactc gatcggtgtt gcctgccgga tcgccgagaa 893281 acatgttgcg cagcatctgc tcggtcaccg ttggcgtgcg cgaatatccg atgaagtaag 893341 tgccgtactc gcccttgccg acttcgccga acggcatgtt gtgtcgcacg atcttgcgct 893401 cggtgccgtc gtcgtcggtg atgacgttga gcgctacgtg tgaattggct ggcttcgcgt 893461 tgtcgtcgag ttcgatgtcg tcgagcttgg tccggccgat cacacgctcc tgctcggtga 893521 ccgagaggga ttcccacgag gccatatcgt gcacatactt ctgcacgtgc acataacacg 893581 agccggcgaa atttcgatcc tcgtcaccga tcgtggtggc cttgatggcg attgggccac 893641 ttgggttttc ggtgccatcg acaaagccca gcagatcacg gttgtcgaaa aaccggaagc 893701 cgtgcacttc gtcgacaacg gtcaccgcat cgcccatcga cttgagaatg cggccagcca 893761 actcgaagca cacgtccatg gtctcggccc ggatgtggaa caacagatcg ccgggagttg 893821 ccggggcggt atgccgtggt ccggtcagct cgacgaacgg atgcagctcg gtgggtcgag 893881 gtccggcgaa caagcggtcc caggcgtcgg acccgatcga gacgaccacg gacaagtgtt 893941 tggtcgggtc acggaagccg atcgcacgca ccaggccgga gatcttcgac agtgcgtcgt 894001 gcaccgtcgc ctcgccgtcg gcgccgatgg tggcgaccag gaagatcgcg gccggagtca 894061 acggcgccag aatcggctgc ggagagacag caggcacagc cacgacccta acgtccctgc 894121 aataccggtg atgctagaca tggctacatg gcggccacgg cacacggcct gtgcgaattc 894181 atcgacgcgt ccccgtcgcc gtttcacgtc tgcgcgacgg tggcgggacg gctgctcggc 894241 gccggatacc gcgagctgcg cgaagcggat cgctggccgg acaaaccggg ccggtacttc 894301 accgtccggg ctggctcgct ggtggcgtgg aacgccgagc agagcgggca cacgcaggtc 894361 ccattccgga tcgtcggcgc gcacaccgac agccccaatc tgcgggtcaa gcagcatccg 894421 gacaggctcg tcgccggctg gcacgtggtg gcgctgcaac cgtatggggg agtttggctg 894481 cactcctggc tggatcgcga tctgggcatc agcgggcggc tatcggtgcg tgacggtacc 894541 ggggtcagcc accggctggt ccggatcgac gacccgatcc tgcgggtgcc gcagctggcg 894601 attcacctgg ccgaggaccg caagtcgctc acgctcgatc cgcaacgaca catcaacgct 894661 gtatggggcg tgggagagcg ggtggagtcc tttgtggggt acgtcgctca gcgcgccggg 894721 gtggcggcgg ccgacgtgct ggccgcggac ctgatgaccc atgacttgac cccgtcggcg 894781 ctgatcggcg cttcggtcaa cggcactgcc agcctgctca gcgcgccgcg gctggacaac 894841 caggccagtt gctatgccgg gatggaggca ctgctggccg tggacgtgga ctcggcgtcg 894901 agcggattcg tgcccgtgct ggcgattttc gaccacgagg aggtgggatc ggcctcgggc 894961 cacggcgcac agtccgatct gctatccagc gtgctcgaac gcatcgtgct cgcggcgggc 895021 ggcacccggg aggacttcct gcgccgactg accacctcga tgctcgcctc ggccgacatg 895081 gcgcatgcga cgcaccccaa ctacccggac cgtcacgagc cgagccaccc gatcgaagtc 895141 aacgcgggtc cggtgctcaa ggtgcaccca aatctgcgct acgccaccga cggacgcacc 895201 gcggcggcgt tcgcactggc ctgccagcgc gcgggagtgc ctatgcagcg ttacgaacat 895261 cgcgccgatc tgccgtgcgg gtcgacgatc gggccgttgg ccgcggcgcg caccggaatc 895321 cccacggtcg acgtcggcgc cgcccagctg gcgatgcact ccgcgcgaga gttgatgggc 895381 gctcacgacg tagccgccta ttcggcggca ctgcaagcgt ttctttccgc cgagctatcc 895441 gaggcatagg gtcgggcggt atggcactca aggtagagat ggtcactttc gactgcagcg 895501 accctgcgaa gcttgccggc tggtgggccg agcagttcga tggcacgacg cgtgaactgc 895561 tgcccggcga attcgtcgtg gtcgcccgga ccgatggacc gcggttggga ttccagaagg 895621 tgcccgatcc cgcccctggg aaaaaccgcg tgcacctcga cttcacgacc aaggacctgg 895681 atgccgaggt gttgcgcctg gtcgccgccg gagccagcga ggtcgggcgg catcaggtcg 895741 gcgagagctt tcgctgggtg gtgctggctg accccgaagg caacgctttt tgcgtggcgg 895801 gtcaataacg aggcggttcc aaggggccga aaagcggccg gcagcggtcg aacccgtcca 895861 cccgaacctc aacagtgcga tggcgctgcc aatcgtcgcg ggtcagccgg aataacagcg 895921 cctctgccat agccccttcg cgtgccacgc gatctaggcc gttgtcgcgg tatccgttac 895981 ggcgggatac cgcgatcgag gccgggttat ccacgaacga cctcgacgtc gcgacctgcg 896041 cctccagctc ggcaaacgcg aaatacagta cagccgcccg catctcggtg ccgtagccgt 896101 gaccttggta acgcaacccg agccatgatc cagaatccac ctgacgggtg attgggaaat 896161 ccttggagct cagggcctgt acgcctacgg ccctaccgtc gacgaggacg gccagcggca 896221 gcgaccagtc atcccgcttg aacccggcca gttgctgcca taggtgcgac agcgtgttga 896281 acggcaggtc ctcgcgcgat gctcgcgtcc acggaaccga aaacggcatt cggtcggggt 896341 cgtggactcc ctccaggatg gtgtcgatca gctggtcgca caactcctcg gtgggcagtt 896401 gcaactggag ccgcggcgtg gtgatgcgca ggtcgaacaa cggccagtga cgagacatgg 896461 ttccattttg cgcaccacca tcctgagcgc ccgccccgat gtcagcccga cggctgatgc 896521 caccggggtt cttgccgcgg gcatacctat ccgtcggctt gtccgtgtca acgcggccgc 896581 agcgcgatgg ggcctagcta gactgcctcc gtgatgtctc cgctcgcccg gaccccgcgc 896641 aaaacgtcgg tgctggacac cgtcgaacac gccgcgacca cacccgacca accacaaccg 896701 tatggtgagc tgggcctcaa agacgacgag taccggcgga ttcgccagat cctgggccgc 896761 cggcccaccg acaccgagct ggccatgtac tcggtgatgt ggagcgaaca ctgttcgtac 896821 aagtcctcca aggtgcacct gcgctacttc ggtgagacca cctccgacga gatgcgcgcg 896881 gccatgctgg ccggcatcgg cgagaacgcc ggcgtcgtcg acatcggcga cggctgggcg 896941 gtcaccttca aggtggagtc acacaaccac ccgtcctacg tcgagcccta ccagggcgcg 897001 gccaccgggg tgggcggcat cgtccgcgac atcatggcca tgggcgcccg accggtcgcc 897061 gtgatggacc agcttcggtt cggcgccgcc gacgcccccg atacccgccg cgtgctcgac 897121 ggcgtggtcc gcggcatcgg cggatacggc aactccctgg gcctgcccaa cattggcgga 897181 gagaccgtct tcgacccgtg ctacgccggc aaccccttag tgaacgcgtt gtgtgtcggc 897241 gtattacggc aggaggacct gcatttggcg ttcgcctccg gcgccggcaa caagatcatc 897301 ctgtttggcg cgcgcaccgg gctcgacggt atcggcgggg tgtcggtgct ggcgtcggac 897361 accttcgatg ccgagggatc ccgcaagaag ctgccctcgg tgcaggtcgg cgacccgttc 897421 atggagaagg tgctcatcga atgctgtctc gagctctacg cgggcggcct ggtgatcggc 897481 atccaagacc tgggcggagc cggattatct tgtgccacat cggagttagc atccgccggt 897541 gatggcggaa tgacgatcca gctggacagc gtcccgctgc gggccaagga gatgacgccc 897601 gccgaggtgc tctgcagcga atcgcaggag cggatgtgcg cggtggtctc cccgaagaac 897661 gtcgacgcat tcctggcggt gtgccgcaag tgggaggtgc tggcgacggt gatcggcgag 897721 gtcaccgacg gcgaccggct gcagatcacc tggcacggcg agacggtggt cgacgtgccg 897781 ccgcgcaccg tagctcacga aggtccggta tatcagcgcc cggtcgcccg ccccgatacg 897841 caggacgcgc tgaacgccga ccgctcggcc aagctgtcac ggccggtcac cggcgacgag 897901 ctgcgcgcga ctttgcttgc gttacttggc agcccgcacc tgtgcagccg cgcgttcatc 897961 accgagcagt acgaccgcta tgtgcgcggc aacacggtgc tcgccgagca cgccgacggc 898021 ggcatgctgc gcatcgacga gtcgaccggc cggggcatcg cggtatcgac cgacgcgtcg 898081 ggacgctaca cgctgctgga tccctacgct ggcgcgcaac tcgcgttggc cgaggcgtac 898141 cgcaacgtcg ccgtcaccgg cgccaccccg gtcgcggtga ccaactgcct gaacttcggt 898201 tcccccgagg accccggcgt gatgtggcag ttcacgcagg cggtccgcgg tctggccgat 898261 ggctgtgcgg acctcgggat tccggtgacc ggtggcaacg tgagtttcta caaccaaacc 898321 ggttcggcgg caatcctgcc cacgccggtg gtcggggtgc tcggcgtcat cgacgatgtg 898381 cgtcggcgca tccctaccgg cctgggcgcc gagcccgggg aaacgttgat gctgttgggc 898441 gacacccgcg acgagttcga cggttccgtg tgggcgcagg tgaccgcaga ccacctgggt 898501 ggattgccgc cggtagtcga tctggcgcgg gagaagctgc tggccgcggt gctgagctcg 898561 gcgtcgcggg acgggctagt gtccgcggcg cacgatctgt ccgagggtgg gctggcccaa 898621 gccatcgtgg aatcggcgtt agcgggtgaa accggttgcc gcatagtgct tcccgaaggg 898681 gctgacccgt ttgtgctgct gttctccgag tcggcgggtc gggtgctggt cgcggtgcca 898741 cgcaccgagg agagccggtt tcgcgggatg tgtgaggcgc ggggacttcc cgcggtccgc 898801 atcggcgtcg tcgatcaagg ttcggacgcg gttgaggtgc agggcttgtt cgcggtgtcg 898861 ttggccgaac tgcgtgcgac atccgaggcg gtgttgccgc gatacttcgg atgagtcggc 898921 ttcgcgccct gtctttggcc gccggcctgg tcggctggag tctggtcagc ccgcggctgc 898981 cggcgccgtg gcggattccg ttgcaggcgg ggctggggag cgtgttggtg ctggttactc 899041 gtgcgacgat gggcctttgg ccgccgcggc tgtgggccgg gctgcggctg ggctgggccg 899101 cgggggcggc ggcggcgacc gcgatcgcgg caacgacgcc ggtgccgatg gtgcggttgt 899161 cgatgtcggc tcgtgagttg ccggcgtcgg tgccggtctg gctggtatgg cacatacctg 899221 gcggcacggt gtgggccgag gaggccgcgt ttcgcggggc gctggccact atcggtgccc 899281 gggccttcgg tcggtcgggt ggacggatac tgcaggccgg cgcctttggt ttgtctcaca 899341 tcgccgacgc gcgcgcgacg ggcgagccgc tggtgctcac ggtgttggcc accggtatcg 899401 ccggctggat gttcggttgg ctggccgacc ggtccggcag tctggcagca ccgctgctga 899461 cgcacttggc catcaacgag gccggtgcgg tcgccgcggt gctggtccag cggcgttctg 899521 gtatctcgac tcgactgtga tcgcggggtc gggcccctgg tgatcgtgga acggctcaca 899581 acagcgcgga cctggtcggc ggcgccgcta tactgattgg tcactgtcta accaatcaat 899641 ggagagggtt ggcacctcag gtgcatagac ttagggccgc ggagcatccg cggccggatt 899701 acgttctctt acatatcagc gacactcatc tcatcggggg ggatcgtcgg ctctacgggg 899761 cggtggacgc cgacgaccgg ctgggcgaac tgctcgaaca gttgaaccaa tccggccttc 899821 gtcccgatgc gatcgtcttc accggcgatt tggccgataa gggcgaaccg gcggcatacc 899881 gcaagctccg aggcctggtc gagccgttcg cggcgcagtt gggcgccgag ctcgtctggg 899941 tgatgggtaa ccacgacgac cgggccgaac tacgcaaatt cttgctggac gaagcgccat 900001 cgatggcgcc gctagaccgg gtgtgcatga tcgacggtct gcgcatcatc gtgttggata 900061 cctcggtacc cggacatcat cacggcgaaa tccgcgcgtc ccaattgggt tggcttgctg 900121 aagagttggc cacgccagcg ccggacggca ccattttggc gttgcatcat ccgccgattc 900181 cgagtgtttt ggatatggcc gtcacggtgg agctgcgcga ccaggctgcg cttgggcgag 900241 tgctgcgggg cactgacgtt cgcgccattt tggccgggca cctgcactac tcgacgaatg 900301 ccaccttcgt cgggatccca gtgtcggttg cctcggcgac ttgctacacc caggacctga 900361 ccgtcgctgc tggaggaacg cgtggcagag acggcgccca aggttgcaac ctggtgcacg 900421 tctatccgga caccgtcgtg cattcggtga ttccgctggg cggcggagaa acggtcggca 900481 cctttgtctc acccgggcag gcgcgacgca aaatcgccga gagcggcatt ttcatcgaac 900541 cgtcgcgtcg cgattcgcta ttcaagcacc ctccgatggt gctgacgtcc tcggcaccgc 900601 gaagtcccgt cgactgacgt ccgcggcgat cttctcccag ggagccggta tcgggaaata 900661 gcgctccagg aaactgacga ctcgttctgc gcgctgcgct gcggggactt caggaaagct 900721 accgtcgttg aggcagaaga aatcgtatcc gcggtgcttc cgcaacttag gaagtagccg 900781 aagacccgca tagctggtgg tgtcgacata gaggacttta gccttttcct gcgggacggc 900841 gcgtccggtc atcagcgcgt aatagtggta gaacgagttg gtcaccgaga tgtcggtgtc 900901 ggagcggaac gggctggccg cggtgcgggc gaattcctcc gggaattccc gctccatctc 900961 gatcagcaca ctcttgcgca acggtaccgc ggtgtgctcg agatgacggg taatcacctg 901021 cccgaaccgg tcgaagagca gctggcggtt cacccgggcc gcgttttcaa agccactacg 901081 cgctgggttg ttggcgccga gcccgatccg ggtcttggct tcgatgaacc tggtgactcc 901141 accgggagag aagaacatac tggccttgag cggccggccg aagaacatgt cgtcgttgga 901201 gtacaagaag tgctcgctga gccccgggat gtggtgcagc tggctctcca ccgcatgcga 901261 gttataggtc ggcaacgcgg aacggtcgga aaagtggtcc tcggcgcgaa cgatggtgat 901321 tttaggatgt tcggccaacc atggcggcgg ggttgaatcc gtcgcgatga agatgcgacg 901381 tatccacgga gcaaacatgt tcaccgaccg cagcgcgtat ttcaactcgt cgatttggcg 901441 gatccgcgct tcggcgtcgt cgccctcgcc caccacgtac tgcgacattt gagccatgcg 901501 gcgcgcccgg aactcggggt cactaccgtc cacccaggag aacaccatgt ctatgtcgaa 901561 cacgacgtcg ctggcgtgcg gggcaaacat cccgtcaagg gtcggccatt tgtacccgta 901621 gagtttgaca tttgtcggcg ttatttcgtt tcggggcagc actttgcggc taagcgagtt 901681 ttcgacaggg cagcggatca ccgtctcctc gtatacccag aattgcagtt ccacaccgaa 901741 cgccgggccg tagcgaaatc cgcccggcgc gatccggcgt cgatacaacc gcacgacacg 901801 cgggtcaacc agctgcgaca gcccgtcggt ggcgaccaaa acaggagaaa ggccaggctc 901861 atcaatagtt ttggcgtaca tcggttcggt tgcacatgcg gccgcaagag cgcgctcgag 901921 gccggcacgt agttcgatgt tgatggcaag caccggccgg ttcttgtggt ttcggatcag 901981 tagataggga atatcagccc tgtttaacac ctttcgcaga aagaccagat cttcgatctg 902041 ggcctcctgg ggggtcaggc cggattccag gcgggcgatc ttgccgcgcc gggtaacgat 902101 gatgggattc acggtgcgct gagcgggccg accgccgtcg cgcgaagaga ttttgggcat 902161 cgggtcaccg ccttgggaac tcagggagaa atgattaggt caccgaaaga atctcacaga 902221 tcgcgggtcg gcgcaggttg accgcgctgg cgcggggtcc atacagaatt gtgcggtcaa 902281 ggcgataact cttgcaagac accagatcta gcgatctaag aacatcggcc ggaaacctgg 902341 ttgttgcggc cgcgccatgt caagttcagt tcggaactgg gctcgcatac aacccgatcc 902401 cagtctcagc agcggcgctt ggccgccatc tggatggatc caccgattct tgagacccta 902461 aggtatgagc gctcgtgatc gagtcgatcc ggcgaagact cggcaggtcg tgttggccct 902521 cgcggactgg ttgcgcgacg aaacgttgcc agcacccgac accgacgtgt tggcggcggc 902581 ggttcggctt acggcgcgca cgctcgctgc gctggcccct ggcgccagcg tcgaagtccg 902641 gatcccaccg tttgctgcgg tgcagtgcat ttctgggccc cggcacactc gcggcacacc 902701 ccccaacgtc gtgcagaccg acccacggac ctggctcctg gtggctaccg ggctgtcggg 902761 ggtggcgcag gcccggggca gtggcgcgct gcagctctcc ggctcgcggg ccggtgagat 902821 cgaggcctgg ttgccactgg tggatctcgg ctgattccgg cgtgctgagc tgcggctatg 902881 gtgtgtgagg gtggcgccgg ggtgcccgac acgtaagccg aattcggcgg tgcagacgtc 902941 gtggccgtag actcggatta cgtcaccgac cgcgccgcag ggagccgcca aaccgtgacc 903001 ggccagcaac ccgagcaaga cctgaactcg ccccgggaag agtgcggtgt cttcggggtc 903061 tgggccccgg gtgaagacgt cgccaaactc acctactacg gcctgtacgc gttgcagcat 903121 cgcggccagg aagccgccgg gatcgccgtc gccgacggct cccaggtgct ggtcttcaaa 903181 gacctcggcc tggtcagcca ggtgttcgac gagcagacgt tggcggccat gcagggccat 903241 gtcgccatcg ggcactgtcg ttactccacc accggggaca cgacgtggga gaacgcccag 903301 cccgtgttcc gcaacaccgc cgctggcacc ggtgttgcgt tgggccacaa cggaaatctg 903361 gtcaatgccg ctgcccttgc cgcccgcgcc cgcgacgcgg ggttgatcgc cacccgctgc 903421 ccagccccgg cgacgacgga ctccgacatt ctgggggcgc tgctggccca cggtgctgcc 903481 gattccaccc tcgaacaggc ggcgctggac ctgctgccca cagtgcgggg agcgttctgt 903541 ctgacgttca tggacgaaaa cacgctttat gcgtgccgcg acccgtacgg ggtgcgcccg 903601 ctatcgctcg ggcgtttgga ccgtggctgg gtggtggcct ccgaaacggc cgcactcgac 903661 atcgtcggcg cctcgttcgt ccgtgatatc gaaccgggcg aattgctggc tatcgacgcc 903721 gacggggtgc ggtccacccg ctttgccaac cccacgccca agggctgcgt attcgaatac 903781 gtctacctgg cgcggccgga cagtacgatc gccggccggt cggtacacgc cgcgcgggtg 903841 gagatcggtc gccgactggc tcgggaatgc ccggtcgagg ccgacttggt gattggtgtg 903901 ccggaatccg gcacacccgc cgcggtcggc tacgcgcagg agtccggcgt tccatatggg 903961 cagggtctga tgaagaacgc ctatgtcggg cgcaccttca tccagccgtc acagaccatc 904021 cgtcagctcg gcatccggct gaagctcaac ccgctcaaag aggtgatccg cggcaagcgg 904081 ctcatcgtcg tcgacgactc gatcgtgcgg ggcaacaccc agcgtgcgct ggtacgcatg 904141 ctgcgcgagg ccggtgcggt cgaattgcat gtgcgcatcg cctcgccacc ggtgaagtgg 904201 ccgtgcttct acggtatcga cttcccctcg ccggccgagt tgatcgccaa cgccgtggaa 904261 aacgaggacg agatgctgga ggcggtacgg catgccatcg gggccgacac gctgggatac 904321 atctcgctgc ggggcatggt ggcggcgtcc gagcagccca cgtcgcggct gtgcaccgcc 904381 tgcttcgacg gcaagtatcc aatagagctg ccccgcgaga ccgcgctagg caaaaatgtc 904441 atcgagcaca tgctcgccaa tgcggcccgc ggagccgcgc tgggcgaact cgccgccgac 904501 gacgaagtcc ccgttgggcg ctgacaaaac gcacgcgcgg tagcctttat cgcgatgacg 904561 gatctcgcaa aaggccccgg aaaagacccg ggtagtcggg gtatcaccta cgcgtcggcc 904621 ggggtcgaca tcgaagccgg tgaccgcgcc atcgacctgt tcaagccgct cgcttcgaag 904681 gccaccagac ccgaagtgcg cggcgggctg gggggattcg ccggactgtt cactctccgc 904741 ggcgactacc gcgaaccggt gctggcggcc tccagcgacg gcgtcggcac caaactcgcg 904801 atcgctcagg cgatggataa gcacgacacg gtgggcctgg acctggtggc gatggtggtc 904861 gatgacttgg tggtttgcgg cgccgagccg ctgttcctgt tggattacat cgccgtcggt 904921 cggatcgtgc cggagcgact cagcgcgatc gtcgccggta tcgccgatgg gtgcatgcgt 904981 gccggctgtg cgctgcttgg cggcgagacc gcagaacatc cgggcctgat cgagcccgat 905041 cactacgata tctctgccac cggcgtcggc gtcgtcgagg cggacaatgt gctgggtccc 905101 gaccgggtca aacccggcga cgtcatcatc gcgatgggct cgtcgggtct gcattccaat 905161 gggtactcgc tggtccgcaa ggtgttgctg gagatcgacc ggatgaatct ggccggtcat 905221 gtggaggagt tcggtcgcac cttgggcgaa gagttattgg agccgactcg catctacgcc 905281 aaagactgtt tggccttggc cgccgaaacc cgtgtccgga cgttttgcca cgtcaccggc 905341 ggcgggctcg ccggcaacct gcaacgggtc atcccgcatg gcctcatcgc cgaggtcgac 905401 cgcggcacct ggacacccgc gccggtattc accatgattg cccagcgcgg ccgggtcagg 905461 cgcacagaga tggagaagac gttcaacatg ggtgtcggca tgatcgccgt cgttgccccc 905521 gaagacacga cgcgcgccct ggccgtcctg accgcgcggc acctggactg ctgggtattg 905581 ggaaccgtct gcaaaggcgg aaaacaaggc ccgcgggcaa aactggttgg gcagcacccg 905641 agattctaag aaccagacct aaccgggtct aatgaggtca acgccacgcc gatgggaacc 905701 gaatcggcac cgtgcggggg gcagctccgt ggtgctagcg ccgccagtcg tcctcatcgt 905761 tccacgagtc gtcgtccgac gggccgtcgc cgtccagtcg gtcggtaccg gtacctgaca 905821 gctcacgctg aagccgctgg aagtcggtct gcggggagct gtatttcaat tctcgagcaa 905881 ccttggtctg ctttgcctta gcccggccgc gccccatggg ggaaccccct cgcgaaataa 905941 cggagcggcc taacgagtag gcggctccga tctctggtgt cgtttattgt cctgccgaca 906001 gtttaccgtg ccgcccggtc gggcgcgggg cggcctgccc gccgttacgg agggcacggg 906061 taatcaccga ataccgccgc gcagccgctc gacggcccgc cgtccggcgc ccacatcgtc 906121 ggcgggcggc agcgagtcga cgtcgatcac cgcggcaacc tcggcttccg gaccggttac 906181 cagggcggtg tcgcccggca gaccccgttt gagcagggcc agtgccaccg gccctagctc 906241 tacgtgctcg accaccgttc ccagtcgtcc caccgtgcga ccgccggcca gcaccgcatc 906301 gcccgtcgac ggccgctgca ctgactcgtc cagatgcaac aacaccagca tccggggtgg 906361 tctacccagg ttgtgcaccc gtgcgacggt ctcttgccct cggtaacagc ccttgttcag 906421 gtggacggct ccggcgccgg ggccaccgat ccaacccact tcgtgaggga tggtgcgttc 906481 atcggtgtca acgcccagcc gcgggcgcct agccggcacc cggtgagcca ctcgatgggc 906541 ttcataggcc cagatgccgg ccgggcgcac acccgcctga gtcaggcgac gctgccagtc 906601 ggcacgatcg ccgcgcttca ccaccacgtc cagttcgatt tggcccgcta ggccgtcggg 906661 catccggcgg acaatcccgc cgccggcaag cggcacggcc agccactcag cgggcaagac 906721 atctagaccc agcgcgtcga gcactcgttc ctcagccagc cgcggcccca atagcgacaa 906781 caccgccata tcagcggcac gaggagtgac catcgaccaa aaaaccatct tgcgcaaata 906841 ggccagcagc ggttcacccc gccacggctc ggtatcgaga taggtcgtgc cacccagctc 906901 ggtctgtatc cagtgatcct caactcggcc ttgtccgtcc aggctgagat tttgggtgct 906961 ggcgccctca ggcaggtcgc tgacgtgttg tgtggagatg ctgtgcagcc aggtttgccg 907021 atcgccaccg tcgagggtga gcacggcgcg gtgcgagcga tccaccagca cggcatcggc 907081 ttgccccgcg cgttgctcgc ccagcgggtc gccgtaatgc cagatcgcac ccgcgtcggg 907141 tccggggtct ggggcaggga ctgcggccac acaacaactc tacgaaaagc cgcgctcggc 907201 ctcgttgacc agcgtgcagc taggctgcag ggacatgttg aggcagacgg gcgtggtggt 907261 cacgcttgac ggtgagatcc tgcagccggg tatgccgctg ctgcacgccg atgatcttgc 907321 cgctgtgcgg ggggatggcg ttttcgagac actgctggtg cgcgacggcc gagcctgtct 907381 ggttgaagcg cacctgcagc ggctgaccca atcagccagg ttgatggacc ttcccgaacc 907441 ggatctcccc aggtggcgcc gcgcggtcga ggtggcaacg cagcggtggg tggctagcac 907501 cgctgacgag ggcgcgctgc gcttgatcta cagtcgcggt cgggagggcg gctcggcgcc 907561 gacggcctat gtcatggtca gtccggtccc ggcgcgagtt atcggggccc gccgcgatgg 907621 tgtgtcggcg atcacgctgg accgcggttt gccggctgac ggtggcgacg ccatgccgtg 907681 gctgatggcc agcgccaaaa cactgtccta tgcggtgaac atggccgtcc tgcgtcatgc 907741 cgcccggcag ggcgccggcg acgtcatctt cgtcagcacg gacggctacg tcctggaagg 907801 ccctcgctcg acggtggtga tcgccaccga cggtgaccaa gggggcggga acccctgctt 907861 gctgacgccg cctccgtggt atccaatcct gcggggaacc acgcaacaag cgctcttcga 907921 agtggcccgc gcgaaaggct acgactgcga ctaccgtgcc ctacgcgtcg ccgatctctt 907981 cgattcccaa ggtatttggt tggtatcgag catgactctg gccgcccgcg tacacaccct 908041 ggacgggcgg cgattacccc gcaccccgat cgctgaggtg tttgccgaat tggtggacgc 908101 cgctattgtc agcgaccggt gatacggcaa cctctgttgt ggtcagcgcc ggccataccg 908161 ctcgccgtta tccgacgaac cgggacaacc gcgccgacag atgtggtacc agcccgccgt 908221 cggcatcgac gcgttcctcg acgtaggcca ggtcgccacc ttcgacgatg ccgtagagtc 908281 gtttggcgcc gccgaccaga acgccagacc gactgcgggc cagcgcatcg gtcaccaact 908341 cccacgagga ctgggtgcgc ggccgcccgt agaacagttc gacataaccg gccgaatgcg 908401 ccaatagcaa ctcgatcgcc tgagactcgc tcggatcgta cgggtcggcg acgaaccgcc 908461 agaatcccgc ttctcgtaag cctggttcct ggtagtcgcc cgtggcggtg agccgccagg 908521 accgggattc ccaattcaga tagtcgccgc cgtcgtgtga cacaacgatc tgctggccga 908581 accggtagtc gccgtcgggt ccgcggccct cgccttcgcc gcgccacacg ccgaccagtg 908641 gcagcagcgc cagcagtgca ttgttcaggt cggcaccttc gcgcaggttt gcggtatctg 908701 cgggaaccgg caaatcgtcg aaggcaggga tattgcgcgc ggcggtcgcc ttggcccgct 908761 cgacggcagc ggcgaccgca cggtcgccgg agcccgcagc atggacgccg ccggccccgg 908821 tcgcatccga gcccgcgccg gaactcacga ctcgtcggta acgagccggt acagcgtgta 908881 cagcgcgaac caggagataa ccacggtcgc caagaccagc atgatctcga agaacagcac 908941 cacggggacg agtgtatgcg gccgcggccg cttccgtggc cctggctcgt ctgggcctga 909001 ttggggccgg tcaggtgatc ttgacgtcta cctcgtggat gcccgcgccc gagggctgca 909061 ccaccgcgtc gccgttgccg gccgccgaca gcgcgcgcag cgtccaggat ccgggcgcgg 909121 cgaagaaccg gaaatcgccg gtggccgacg cgacgacctc cgcggtgaac tcgtcggagg 909181 agtccagcag ccgcacgaac gcgccgccca cggcctggcc gtcaccgtcc actacgcggc 909241 cggtgatcac cgtttctttt tccaggtcga cgctggccgg caatgtcagt ccttgcttgg 909301 gtccagagca catatcagct tcccaactcg atcggggcgc ccaccaggga gccgtattct 909361 gtccaactgc cgtcgtagtt cttgacgttt tggtgtccga gtaattcccg caacacgaac 909421 caggtgtgcg aggaccgttc cccgattcgg cagtaggcaa tcgtttcctt gctgttgtct 909481 aggccggcgt cggcgtaaag cttggccaac tcctcatcgg acttgaaggt gccgtcctcg 909541 ttggcggccc tgctccacgg cacgttgatg gcaccaggaa tgtgtccggg ccgctggctt 909601 tgttcctgcg gcaggtgcgc gggggccagg atcttgccgg agaactcgtc gggagagcgc 909661 acgtcgatga ggttcttgac gttgatggcc gccaggacct cgtcgcggaa tgcccgaatc 909721 gtgttatccg gcggggaggc ggtgtaggag gtcaccggcc ggctgaccgg gtcgctggac 909781 agcgggcgtc cgtcgagctc ccacttcttg cggccgccgt cgagcaactt gaccttctca 909841 tggccgtaga gcttgaaata ccagtacgcg taggcggcga accaattgtt gttgccgccg 909901 tacaggatca ccgtgtcctc gttggcgatg ccacgctcgg acagcagctt ggagaattgc 909961 tgggcgtcga cgaagtcacg tttgaccgga tcctgcaggt cggtgcgcca gtccaacttg 910021 atcgcgccgg caatatggtc acggtcatat gcactggtgt cctcgtccac ttcgacgaaa 910081 acgaccttcg gcgcgtgcag attgctctca gcccagtcgg cggagaccag gacatcgcag 910141 cgtgccatgg cgggaatcct ttcgcatagt tcggtgacca gcgtggtcaa ctggttaggc 910201 gggacgggga gtgttactgc ttgactgctc cttgggacgt ctgttgcaca gaaacggcgg 910261 gcgacacgct acggtggggc tcctaggctg ctctaagtgc tgcgcggacg tgcgcggcta 910321 ctcagcagct acagcaacag caacaacccg ctaggcggca cagatcaact gcgcgacgct 910381 tggtgagcat gggctcgatg cgggctgaca cgtcggacag cttacccaat cgcatagtgc 910441 tcaagccaac agtggtttca gggcagagcg caggtcggcg gccttgggga ccccggaggt 910501 ccggtagcgc tgtcgcccgt cgacatcgaa gatcaacgtg gtgggcagcg aaagcaccga 910561 aaatcgccgc gctgcctgcg ggttggagtc caggtcgacc tcgatgtgag caacatctcc 910621 cagatcggcg cagacgtcgc cgacccctcg gcgtacccgg tcgcagggcg cacaccctgg 910681 ggccctgaaa tgcacgacgg tcggcccggc cccggacagg cccagttccg cggtgcgcgc 910741 cggagccgcc ggtgtcgttt ccggaccaac ctcccgcagg atcactgacc gccgggtcag 910801 caaccaccgg gcaatggtcg ccagcgcacc tgtagcaacg gaagcgacga tcatggtcgt 910861 catgactgtt tgaactcgtc gagcgagatc gttactcccc gggtaatgcc ttcgatgatg 910921 acgtccgatc cgcgcgcccc cacggtgttt ggcaccaccc cgaacggcag cttctggttg 910981 ggcagcttgc tggcgaaggc gtgcagcacc gcatcccgct tgtcatccgg aaccggttgg 911041 tccgcggtgt cgggtccggt cacgacggcg gtgggggtga taaccaaggt cgcgcggtcg 911101 tccgaggcaa cggacaggtc caccaagacg ctgacccggt gagcgaagtt ggccgatatg 911161 ggcgtgccgc tgaacaccag cccgcggctg ccagatatcc cggactcggt agtgccgccg 911221 gtggcgtcgt tgctctcctg acggggcgcg gcgaccataa ggtcgctaat gcccaggtag 911281 cggcccaggt gcatggagtc gatgatgatg cgactctcca gctcgccgac cgggagcttg 911341 gcatcgggcc tgatcagcca ggacgcgtag gacaagtcga tcgaatgcat agtggcctcc 911401 agagtggccg tgccacttcc ggcgtgctcc acggcaaatg ccttgatttc tagctccgcg 911461 tagtgttccc gcatcgcctg cgggatgaat gggaaccgca ggatggcgac gaacgggtct 911521 gacctcaggt ttgccgcttt gcgcacagtg gtcgacagcc ggtactcggc atagatgctg 911581 gccccgaaat cggcgccgac ggcgcccacg atgagaacgg ccacgacgat cgccgccccg 911641 gtcaccccga ccagcacctt gcgcatcggc atattgtcgc ccagcgctcg agcccgtccc 911701 ggagcgcctc gtcaggcggc acgttatcgt tagatgagct gccgctaccg tcacatggcg 911761 cgatgaactg ggagacgcct ttcccacgac gctggagggg cttgttggag ttattactgc 911821 tgacctcgga gctgtatccg gatccggtcc tgccggcgct gtcgctgctg ccccacaccg 911881 tgcggacggc gccggccgag gcgtcttcgt tgctggaggc gggaaacgca gacgctgtgc 911941 tcgtcgacgc gcgcaacgac ctgtcgtccg ggcgaggcct gtgccgcctg ttgagctcga 912001 ccggccggtc gatcccggta ctggcggtgg tgagcgaagg cgggctggtg gcggtcagcg 912061 ctgactgggg gctggacgag atcctgctgc ccagcaccgg gcccgctgag atcgacgcca 912121 gactgcggct ggtggttggc cggcgcggag atctggctga ccaggagagt ctgggcaagg 912181 tgagcctggg cgagctggtg atcgacgaag gcacctacac cgcccggctg cgtggccgcc 912241 cgctggatct cacctacaaa gagttcgagc tgctgaaata cctggcgcag catgccggcc 912301 gggtgttcac tcgggcgcag ctgctgcacg aagtatgggg gtatgacttc ttcgggggca 912361 cccggactgt tgatgtgcac gtgcggcggt tgcgggccaa actcggcccc gagcatgaag 912421 cgctgatcgg cacggtgcgc aacgtcggat acaaagctgt tcggccggcg cgcggccgac 912481 cgccggccgc ggaccccgac gacgaagacg ccgatcccgg ccgggatggt atgcaagaac 912541 cactggtcga cccgttgcgc agtcagtgac ggcgcttgac tggcgctccg ctctgaccgc 912601 cgacgagcag cgcagcgtgc gtgcactggt cacggcgaca acagcagtcg atggggtagc 912661 acccgtgggt gaacaggtgc tgcgggaact gggccagcaa cgcaccgagc atctgctggt 912721 ggccggttcg cgaccgggcg gcccgatcat cggctacctc aacctcagcc caccccgggg 912781 cgcgggtggt gcgatggcgg agttggtggt gcatccgcag tctcgacggc gcggtatcgg 912841 caccgccatg gcccgcgcgg cattggccaa gaccgccggc cgcaaccagt tctgggcgca 912901 cggcacgctg gatcccgctc gggcgaccgc gtccgcgctg ggtctggtcg gcgtccgcga 912961 actgatccag atgcgacgcc cgctgcgtga tatccccgaa ccgacgatcc ccgacggggt 913021 ggtgatccgc acctacgcgg gcacgtccga cgacgctgag ctactccggg tcaacaacgc 913081 cgcgttcgcc ggacacccgg aacagggtgg gtggaccgcg gtccagcttg ccgagcggcg 913141 tggcgaggcg tggttcgatc cagacggcct gatcttggcc ttcggtgatt cgccacgtga 913201 acggcctggc cggttgctgg gtttccattg gaccaaagtg catcccgatc acccgggatt 913261 gggcgaggtg tacgtgctgg gcgtcgatcc ggcggcgcag cgccgcggtc tcggccagat 913321 gttgacgtcg atcggtatcg tctcgctggc ccgtcggctg ggcggtcgga agaccctcga 913381 ccctgcggtc gaacccgccg tgctgctcta cgtggagtcg gacaatgtgg cggccgtgcg 913441 aacctaccag agcctgggct tcaccaccta cagcgtcgat accgcctacg cgctggctgg 913501 cacggataac tgaccgaaga tgttcccccc caagaagtcg taagcaggag cttaagtggc 913561 caagcggttg gacctcacgg acgtcaacat ctactacggg tcatttcatg cggtcgctga 913621 tgtgtcgctg gcgattctgc cccgcagcgt cacggcgttg atcggtccct cgggctgcgg 913681 caagacgacg gtgctgcgca ccttgaaccg gatgcatgag gtcatccccg gagctcgagt 913741 cgagggtgcc gtactgctcg atgatcaaga tatctacgcc cccggtatcg acccggtcgg 913801 tgtccgccgg gcaatcggga tggtgtttca gcggccgaat ccattccccg ccatgtcgat 913861 tcgcaacaat gtggttgccg gcctgaagct gcagggtgtg cgcaatcgca aggtgctcga 913921 cgatacggcc gaatcctcgc tgcgcggcgc aaacctgtgg gacgaggtca aggatcgact 913981 ggataaaccc ggcggcggat tgtctggggg gcagcagcag cggttgtgca tcgcacgggc 914041 aatcgccgtg caacccgacg tgttgctgat ggacgagccc tgctcctcgc tggacccaat 914101 ctcgaccatg gccatcgaag acctgatcag cgagctcaag cagcagtaca ccatcgtcat 914161 cgtcacccat aacatgcagc aggctgcccg ggtgagtgat cagacggcat tcttcaacct 914221 ggaagcggtg ggaaagccgg ggcggctggt agagatcgcc agcaccgaga aaatcttctc 914281 caacccgaac cagaaggcca ccgaggacta catctccggg cgcttcggct aggcccgatg 914341 ccctcgatgg ccaggctggc gtcaccgcgg gtggatgttt gctcggccta gggaaaggcg 914401 ccggtcgcct ggaagatcac gcgtcgtgcc acttccacgg cgtggtcggc aaagcgctcg 914461 tagaatcggc tcagcaacgt cacgtcgacg gcggccgcca ctccgtgctt ccattcgcgg 914521 tccatcagca cggtgaacaa atgccggtgc aggtcgtcca tcgcgtcgtc ttcttcgcgg 914581 atctgggcgg ccttttccgg gtcgtgcgac aacacgacct cttgggcact gttgcccaat 914641 tcgactgcaa ctcttcccat ttcggcaaaa taaccgttga cctcttcggg cagcgcgtgc 914701 tgtggatgcc gacggcgggc gatcttggcg acatgcagcg ccaacgcccc catccggtcg 914761 atgtcagcca ccatctggat ggcgctcaca atggctctga ggtcaccggc gaccggtgcc 914821 tgcaacgcca gaagaacgaa tgcactctcc tcggcccggg cgcttagcgt cgcgatcttt 914881 tcgtggtcgg agatcacttg ctcggccagc acgagatcgg cctgcagcaa ggcttgggtg 914941 gcccgctcca tggcgatgcc tgctagcccg cacatttccc cgagacgctc ggataattcc 915001 gagagttgct catggtaggc ggtccgcatg tgctaaagcc tacgttcccg accttggaaa 915061 atgccgtaag cgtcgtgtca atgcggctac tcgcaggtgg tgtcggcggc gttggtgacc 915121 gtcaggtcct cgggcagctt ggtcggtggg ctggaggagt tgcggcttat ctgcacgctg 915181 acggtggagc cactcggcag gggagcgcgc accgcgctga agtcttggcc cagcaccacc 915241 tggaccagtt ggccgatccc ggtcacccgc tcgatctttg actggccgaa cacggcggcc 915301 acggtggcgg cagcctgttc gttgccgggc gaaaaaaaca ctgtggtggc cagcagcgaa 915361 ctcgggtagt cgtccggagc catcacgttg aagccgttcc gcttgagctg atcggtggcg 915421 gtggtggcca aaccggcctg gccggtcgag ttagagacct gcactgtgac ctcttttggc 915481 gaggtcgtcg taacctgctg gtgctgaatc tcgttggtca gacccgcctg cggcgccttc 915541 ttggtggtgg tcggcggggt cgacggcgtg ttgcccagac gctgggcgtt gtgatcgttt 915601 tccaggggca gcggatcgtc gtcgatgatg gcggtgaaaa gcgccttcat gtcggaggta 915661 cgcgggggct cgtcgccgtt ctggtcggtt ataccggtcg gaacggtcac gaacgtgacg 915721 tgcccggccg ccatatgctg caacgatcga ccgagttcga ccaggtcttt ggtcttgacg 915781 ttgtccacgt agctgttacc gatgaacatg ttgacgacgt tgttgagcct gctgaggttg 915841 aacaaggtgt ccgtcgagat catcgaacgc agcagcgacg acaaaaacaa ctgctggcgt 915901 ttgatgcgcc cgtagtcgcc attgctctcg gtggtgacct ggcgagcgcg cacatagttc 915961 agcgcggtcg gcccgtcaat gacctggcgt ccggcgtgct ccagcaccgt gcccagttcg 916021 tagtcccgca acggggtggt gctgcatacc tcgacgccgc cgagggcctc gaccatccgc 916081 gcgaaaccga cgaagtcaat cgcgatgaac cggttgatgc tcaagcccga cagtttctga 916141 atgaccttca ctagacactt aggcccgccg aaggagaatg ccgagttcag cttggtctcc 916201 gtgtacacca gtctgggacc catcgttccc gtcttctcgt cgtagatggg tccgtactta 916261 ccggtctcgg ggttccacgc ctcgcattgg attggagtga tcgccaggtc gcgggggaac 916321 gacaccgcga cgacccgctc gcggctggcc ggaatgttga ccagcatgac ggtgtccgaa 916381 cgtgcgccgc cggcgtcctc ggcgtcgccg gcgccgatat tggcgttcgc cccggcacga 916441 gagtccatac cgacgagcaa gaagttctcg tcgccatgct gcccgctggg gttgacgatg 916501 tcgcccgaat gcgggtcgag cgcgcttacc atgttcagcc ggctgttctt cgacgcgctc 916561 cactgccatg ccccgccggt cagcgccaac gccagagcgg caaacagagc cgccagcgag 916621 cgcgcggcca gcaccatcgg gcgccggccg gagttcggcg ctggcttggc gggcgcgggc 916681 gacgttcggc ggatccgcaa tggccgcact cgagccgatc cggttagctg cttgccgggt 916741 agctcgggtt cacggcgggc gtggtcggcg cgcggatagt tggctgcccg gaggtcggga 916801 agctccgaga ggaactcgag cgagtgggcc gggatggcga tagcctcggt gtcctgctgg 916861 tcgtcggcgt cgtcggggac cttcgggccg cggccggatg gctcgggttc gggggcgaca 916921 tggcggtgcg tggggaggtc aggaaaagcg gggccgagcc tggcgatcag atcggccaca 916981 ctaacggcgc cggtggcatg acagccgaca ttctgggtgt cccgcggacc ctgggctgcc 917041 acccatgtgg cgggcggtac cgtgatccat cggtcaacac catcggggaa tgctgactcg 917101 gagagccgtg cccacggcgc ggcgctctcg ccgtcactca tgtcctaccg gcctccgaga 917161 gtctaggtgg cggacgccca cggtgttggc tgcgtgtcct acgcgcacct tcgcgcagca 917221 ccgccacgag tcggcgccgc acaatgcagc aaggcccaca tcgtactgat ttatcggtcc 917281 agacgcgatt tcgacagggt ctcgattcag ccacccgacc ccatggcgtc cgccccttcc 917341 ggcactcggc agtcgtcggg gtcggttagc cagccgtcgg gaagggccac ccgggcgggg 917401 gaaccctgcc ggccccgggc gccagtcgcg gagtccggga acggtaccgt gccgtccaac 917461 cggtccagca ggcagtcgag ctcgtcgagc gtcttgacca ttgctaacgc ccgccggagc 917521 gcggagcccg ccgggaagcc atgtagatac caggcgatgt gcttgcggat atcgcgcatg 917581 cccttgtcct cgccgaagtg tgcggccagc aaggtgccgt gacggcggat gatgtcggcg 917641 acttcgccga gcgtgggtgg ggtgggggcc gggctgccgg tgaaagccgc ggacaactcg 917701 gcaaatagcc agggacggcc caggcagcca cggccgatga ccacgccgtc acagccggtg 917761 gtggacatca tggccagtgc gtcgccggca tcgtagatgt cgccgttgcc gagcaccgga 917821 atcgtccgga catgctgctt gagccgggcg atctgttccc agtcggcggt gccggaatag 917881 cgttgtgccg cggtacgggc gtgcagcgcg accgcagcgg ctccttcggc ctcagcgatg 917941 cggccggcat ccagatgtgt gtggtgggcg tcatcgatgc caatgcgaaa cttgaccgtc 918001 accggtatat cggtgccttc ggtggcgcgc acagccgcgg ccacgatctg accgaatagc 918061 cgccgtttga acggtagcgc cgccccgcag ccgcgcttgg tgactttggg cactgggcag 918121 ccgaaattca tgtcgatgtg atcggctaac ccttcgccag cgatcatccg agcggccgca 918181 tacgtggtgt ccggatcgac ggtgtacagc tgcagcgagc gtggtgattc gtccgcggag 918241 aacgttgtca tgtgcatggt gaccgggtgc cgctcgatga gcgcacgtgc ggtcaccatc 918301 tcgcagacat acagtccgct gaccgtgccg accttcgact gttccagctg acgacacagc 918361 gcccggaatg cgacgttcgt cacaccggcc atcggagcca gcacaaccgg gctggcgagc 918421 tcgatcgggc cgatgcgcaa cgccgggctg ggttggattg cccgcctcct gctcatcgcg 918481 ctgcgcgctc tgcatcgtcg ccgggctggg ttggattgcc cgcctcctgc tcatcgcgct 918541 gcgcgctctg catcgtcgcc gggctaacga cggctcatcg ccagtttgcc agcggtttta 918601 tgcagctcgt gtgcgctgac cttcttgccc gtacgggctt cccggtcgag ttggcgttgc 918661 ttggacacct cgaacttgtc gcaggccagc tcgaggtcct tgatcaccag ggccagctcg 918721 tcgcgcagct tagccccctc gccggtgaag tcctcgcgct cgaagatacg ccatttcttc 918781 agtaccggca tgacgacttc gtcgaggtgg atgcgcgggt cgtagacacc cccgacggcg 918841 atgaccacgg ctttgcgccg gaactcgggt acttggaagc cgggcatctg gaagtggctc 918901 aaaatcaggt gcagcgactt catggcctgg ttgggcacga ggtcgaacgc ggcctcgctg 918961 acgtcgcggt agaagatcat gtgcagattc tcgtctgccg agatcttggc catgagctgg 919021 tcggcgacgg ggtcgttaca tgccttgccg gtattgcggt gcgaaatccg ggttgccagt 919081 tcctggaaac tgacatagag gacggagtcg gtgaggctct ccgcgaaata gtggccctgg 919141 tggttttggc ctgggctgaa gccccggttg actacctcga ggcgaagttt ctccaactcg 919201 acagggtcga ccgatcgggt caccaccagg tagtcgcgca gcgcgatgcc gtgccgattc 919261 tcctcggcgg tccaacggtt gacccactgc ccccacgcgc cgtccatgcc catgttcatc 919321 gcgatctcgc ggtgatacga cggcaggttg tcctcggtga ccaggttctg caccatcgcc 919381 acctgggcga catcagaaag cttgctctgg tcggggtccc aatcctgccc gccgagcgcg 919441 tagtagttct tcccgtccga ccacgggatg tagtcgtgcg ggttccaggg cttgtgcatg 919501 ctcaggtgcc ggttcaggta cttctcgacg accggttcaa gttcgtgcag cagctgcagg 919561 tcggtcagct tggctgacat ggcgcctcca gttatctgtg tctaatggtt gcagtcaata 919621 tatctgtgcc tctcggtagc atcaagtttg ggcttcgcgc ggcatgttga gctgccagca 919681 gcgggcagga tgctggcatc ggcgggcccc ggtggccgcg tggggtgaac cccagtcgtc 919741 ctcagttgtg cggcccggct gggatggagt gttcggattc tccccgctcg cggtgcggtg 919801 cgtaggtggc ggcggtgctg agcaacatgt tgacgcagta gtcgatgaat tgcttgcggg 919861 tggctcccag ccgtccgttc agatatgcgg tgaacagacc ggtaagagcg ccgatcaagc 919921 tggtggcgac cagtttctgc agaactggat caacgatgcg ggacaacttg cgttgcagca 919981 actcgatgaa gttgggcatc cactccgcgc ccgaccgggt cagggccggt tctaccgccg 920041 gcgccagcaa cagcacgcgc ccgcgcaccg gatcgtcgac catcagctcg acgaattgct 920101 ctacggcctc gcgcggggtt tgcgcggacg tgagggttgc catcgctcgt gtgcagacgt 920161 cgtcgtagac cgcgcgaacg aaatgttcac ggtcggcgaa gctttcgtaa aagtagcgtt 920221 ctgtcaggcc ggcgtggcgg cacactgcgc ggacggtgag tgcgggtccg cctgcgccgc 920281 cgagcaactg cacgccggcg gcgacgaggt tgtctcgacg tagggcgtgc cgactttcca 920341 aggggacacc ggaccagcgg ccccggtttt gaccggtctg cacagctctc ctaaactcca 920401 tagtgacaac gtgcgtagtc agaattcgtg tggccaatga agattcagca ggcaaaacca 920461 ccagtgaccc aagatacgtc tgctacctgt ccgctgacca gcaccgtgca ggattcctcg 920521 ccggttgcgg gccagcttgg caggcctata gggttccgcg gactggccgg cggttgcccc 920581 gtgtcaccgc tgggttacga atcgccgccg ctgccgctgg ggccggattc gctgacgtgg 920641 cgatacttcg gtgactggcg tgggatgctg cagggaccgt gggcgggatc catgcagaat 920701 atgcatccgc agctgggcgc ggcggtcgaa gatcattcga cgttcttccg gggacgctgg 920761 ccacggctgc tgcggtcgtt gtacccgatc ggcggagttg tcttcgacgg cgatcgagcc 920821 ccagtcaccg gtgtgcaggt gcgtgactac cacatcacca tcaagggtgt cgacggtgcg 920881 ggccgtcgct accacgcgtt gaatcccgac gtcttctact gggcgcacgc caccttcttt 920941 gtcggcacgt tgcatgtggc cgagcggttc tgcggtggcc tgaccgaggc gcagcggcgc 921001 cagctatttg acgagcacgt ccagtggtac cgcatgtacg gcatgagcat gcggccggtg 921061 ccggcgacct gggaggagtt tcaggactac tgggaccaca tgtgccgcaa cgtgctggag 921121 aacaacttcg cggcgcgtgc cgtgctcgac ctgaccgaac tacccaaacc gccattcgcc 921181 caacgagttc cggattggct gtgggccgcg ccgcgcaagt tgctggcccg gttcttcgtc 921241 tggctgaccg tcggactcta cgatccgccc gtgcgcgagc tgatgggcta ccggtggttg 921301 cgccgcgacg aatggttgca ccgccgcttt ggcgacatcg tccagctcgt ctttgccttg 921361 gtgccattcc ggtttcgcaa gcacccgcgg gctcgcgccg gctgggaccg tgccaccggc 921421 cgcatccccg ccgatgcgcc gctagtacag acgcccgcgc gcaacctgcc gccgcccgac 921481 gagcgtgaca acccgacgca ctactgccct aaggtctgac cccggacctg cggcgcaacc 921541 ggggcgtggt tgtgctcacc gttaattggc ttacccgaca tccttggtag ccgatgcctt 921601 agcgaccgac tgcagtccgc cggcagcacg gtggtggcgg ggaatcccgg gaccggcgtg 921661 ctcggcgttg aaaacggcgt cgatgacgag ctggcgcacg tgctcgttct ccagacggta 921721 aaagatcgtg gttccatcgc ggcgggtgcg caccagccgc gccattcgta gctttgccag 921781 gtgctgggag accgacggcg cgggcttgcc cacctgctcg gcgagttcat tgaccgacat 921841 ttcgcggtct gccagcgacc acagcacctg cacgcgggtc gcgtcggcga gcattcggaa 921901 cacctcgacc accaagcaga cctgatcgtc aggcaacggg tcaggtccac tatctgcgta 921961 catacgcaaa caatagaacg cgggcgtggt gggctgtcaa ggtcgcgggt cggcgcccgc 922021 tcagcccgtc ggagcggcga tcgcgctgcg ctcaccgccg ttgggttcct gccggaaccg 922081 gtagacatcc accgcgccag ccctgatatc gggccggtgc tcttggcgca tcggcaggcg 922141 ccggtcctgc cattccttgg cgaactcgtc gtagaaggtg gcgggctcga agtacctgcg 922201 gtcatcgacg taatgcggtt cataggcgtc acgcgacgtc aggaagacaa cctcatcggg 922261 tgagcagtag tacagcgatc catagcacat cggacacgga tgggccagca cgttgagagt 922321 ggtaccgacc aggtgctcag tgcccagctt ggtgcacgcg gcacggatgg caaggctctc 922381 ggcgtgggcg gtcggatcat tggtttgggc ccatcgtcga aaacctgcca tgcctgccgg 922441 catgtgcaag acatcggctg ggacgaaaaa tggcaatgcg acggctgttc gatcacgcac 922501 caacgtgacg acaacgccgc gatcaacctc gcacgctacg aggaaccacc tagcgtcgtc 922561 ggcccagttg gggccgccgt caagcgtgga gccgaccgta agaccgggcc tggcccggcg 922621 ggtggccgtg aagcgcggaa gggaaccggc cacccggctg gcgaacaacc ccgagacggg 922681 gtgctagtcg cgtgaccact aaagatcact cacttgcaac ggtagttcgc agtggagacc 922741 acggtagtag ctagactatc tacatttatc gcatatccgt tttgcttgag ggggcaacga 922801 tggtacgcgc cgatcgtgat cgctgggatc tcgcgacgag tgtcggggcg acggctacca 922861 tggtcgccgc ccagcgcgcg ctggctgccg acccgcgata tgcgctgatc gatgatccat 922921 atgcggcgcc gttggtgcgt gccgttggta tggacgtcta cacgcggctg gtggattggc 922981 agatccccgt cgagggggat tccgagttcg atccgcagcg aatggccacg gggatggcct 923041 gccgcaccag gttcttcgat cagttcttcc ttgatgccac ccacagtggc atcggccagt 923101 tcgtcatcct ggcgtccggg ctggacgccc gggcttaccg ccttgcctgg ccggtgggca 923161 gcatcgtcta cgaagtggac atgccggagg tgatcgagtt caagaccgcc acgctgagcg 923221 atctgggcgc cgagccggcc accgaacgcc ggactgtcgc ggtcgacttg cgcgacgact 923281 gggccaccgc acttcagacg gcgggttttg atccgaaggt gccagcggcc tggagtgctg 923341 aagggttgct ggtatacctg ccggtcgaag ctcaggatgc gctgttcgac aacatcaccg 923401 cgttgagtgc tcccggtagt cggctggcgt tcgaattcgt gccggatacc gcgatttttg 923461 ccgatgagcg atggcgcaac tatcacaatc ggatgagcga gctcggattc gacatcgacc 923521 tcaacgagct ggtgtaccac ggtcagcgtg gtcacgttct cgactattta acccgcgatg 923581 gctggcagac ctcggcgctt acggtcacgc agttgtacga ggcaaacggc tttgcctatc 923641 ccgacgacga gctcgcgacg gcgtttgccg acctcaccta cagcagcgcg acgctcatgc 923701 gctaaagcaa gcgatctgac cgcttactgg cgaagcagct catctttcag gcgactggtg 923761 atcatctcct gaaacacgac ctgggccgga ccgtacaggt cctggaatgt cgacactaag 923821 gcgtccctgt tgtactcggg aatggagccg ccactgggag tccaaaagct atcgatgtcc 923881 agcaggaaga atggtccggt ttgggcgggt gttattcggc gcagatggta attgggatca 923941 agcgcttggc ccatacccgg gccgtagcgc acgatgagcg atttgcctgg ttgtagctca 924001 cggtagactg cggcaccctg ccactcggtc aggaccaggc cgccgggagt gaaacgctgc 924061 ggcccgagca gctgctcgtc gatccagttg ctccacgtga tccggccgtc gacacccgcg 924121 gggacgcgga tctccagaac aaagcgaaga ccgatacgct ccaacccaac gattgacgag 924181 acctgcgcgc gagcatccac gacccgcatc acaacgtcgg taaaggcctc aaagctgcgg 924241 taggcggtgg tctccacgac tatcgcctgg ttcttcagtg aagcggcggt ggtgttatcg 924301 cgattgacat aacgaacgaa acgatccgcg accggggtgg gggctccacc gggcgccgtc 924361 atcccccagc tgacgtcctg cgcctggcgt tcgatcggta gatcattgat aagcaggtgt 924421 ttgagctccc ggttcgctga ttcggtgagc gaatccgttg tcgggtgacg gatttccacc 924481 gtcaccaggg caacgggtgc gttgggctgg acctcatcct gatttgtctc ggggagcata 924541 gacagcaagc atagccaggt tgctttgctc agatcgccgg accgtgcatc gggagggaat 924601 cggcgatgcg cacggcttcg tgcccctgtt tgtgccccca ccaggactcg aacctgggac 924661 ctgcggatta aaagtccgta gctctaccaa ctgagctata ggggcgcgaa gactcaggat 924721 actgcgttgg cgtcggccgc tcgtttgagg aataggctgg gggtgaccta agctggcgtg 924781 gctcccaacg gtcaccacgt tgcgagtgcc ccggagagat tcggttctgc ccccttcgtc 924841 tagacggcct aggacgccgc cctttcaagg cggtaacgcg ggttcgaatc ccgtaggggg 924901 tacctgcgac gcggtatcgc ggagcacaca acacagcaag gccctgtggc gcagttggtt 924961 agcgcgccgc cctgtcacgg cggaggtcgc gggttcgagt cccgtcaggg tcgccaggac 925021 ggtgaggcac atgctgcctt ccggccaggt agctcagtcg gtatgagcgt ccgcctgaaa 925081 agcggaaggt cggcggttcg atcccgcccc tggccaccat ggtctaccta gataggcact 925141 gtggcggcac tgctacgtag ccgacctccc tgggtctggg tgattggtcc cgggctgcga 925201 tggtcgtgag cacacgcccg gatcaccgat gccgtcccgc cccggtaggc catcgcggcg 925261 atgatcgaga ttgccggccg ggttgatcgc tgcggattcc acccgggtcg aacggcgggt 925321 ccatctgctc ctcgatcgct cgtgaaagac ctgattgttc agccatttcc agcatcacag 925381 gcgccaaacc cattggccga catcaaattc cgctcgtcaa ccaccgccgg ctcggtggtg 925441 aacgcatgca gtgaatgggt caaaagtgtg gtcttggact gtagagaaat gcgacgtgag 925501 cgctggtgtt gtcccaggcc agaaggccca gaagacttgt cgcggttcgc acgccgatcg 925561 agtcaccgga ccatccatgg gcgatgcgcc ggaaaaccag acgcgcgcaa gcctcgaagg 925621 ccttggcgtg gcgaagggcc gccggctagg gcaaccctcg tattcccgga tgttggcggc 925681 ccgacgggat tacactgctt cctgctgatt cctccctgcg atcggtcgat cgcaggatcg 925741 gttggcatcg aggtcatgtc gctgtgggag gagatgtcgc gtgtcttatg tgagcgtgtt 925801 gcccgctacg ctggccacag cggcaacaga ggtggcccgc atcggctcgg cgctcagttt 925861 ggctagcgcg gtcgcggcgg cccagaccag cgcggtgcag gccgcggccg cggatgaggt 925921 gtcggcggcg atcgctgcgc tgttttccgc ccacgggcgg gattttcagg cgctcagcgc 925981 gcgggcggca gcgtttcatc acgagtttgt gcaggccctg gccgcgggtg cggggtccta 926041 tgcggtcgcc gagattgccg ccgcatcgcc gttgcagagc ctgatcgacg tgttcaacgc 926101 gcccatccag gccgccaccg ggcgcccgct gatcggcaac ggcgccaacg gccagccggg 926161 caccggggcc ccggggggcc cggcgggtgg ttgatcggca acggcggggc cggcgggtcc 926221 ggggcgcccg gcgccatcgg tggggccggc gggcccgcgg ggttgatcgg tgtcggaggt 926281 gccggcgggg ccggtggaga ctccgcggtc gcgggtgtca tcggaggggc cggtggggca 926341 ggcggggctg ccctgctgtt cggtgccggt ggggccggcg gggccggggg ttccggcggt 926401 tccggcgcag ctggtggggc cggtggcgcc ggtggggccg gcgggctgtt cgccagcggc 926461 ggcagcggcg ggttcggcgg gttcgcatcg acgggcaccg gtggggccgg cggcaccggt 926521 ggggctggtg ggttgttcgc cagcggcggg gtcggcggta ctggcggggg agccgggtcc 926581 ggcggtaccg gtggggttgg tgggacgggt ggggccggag ggctgttcgc tagcggcggc 926641 gctggcgggg ccggcgggtc cggcggtacc ggtggggctg gtgggacggg tggggccggc 926701 gggctgttcg gagccggtgg cgctggcggg ctcggcgggc aaggcaacca caccggcggg 926761 cacggtgggg ccggtggcag cgccggcctg ctcgcccttg gcgacggcgg cgctggcggg 926821 gccggcgggg ccgctaccac cggaaccggc ggggccggcg gggcgggtgg caaggccggc 926881 ctgctgttcg gctccggtgg ggccggtggg tccggtgggg ctgccggcac cttcggtgac 926941 accggtaact ccggcggggc cggtggggcg ggtggcaagg ccggcctgct gttcggctcc 927001 ggtggggccg gtgggtccgg cggcgctggg ggcttcgcca acggctctac cggcggtgcc 927061 ggcggggccg gcggcggggc cgggctgatc ggcaacggcg gcaacggtgg cagcggcggc 927121 acgtcggttg ccaccggggg ggccgggaac ggcggtgccg gcggcgccgg cggcggggcc 927181 gggctgatcg gcaacggcgg caacggcggc agtggcggaa tgggcgatgc cccgggcggc 927241 accggcgtcg gcggcatcgg tgggctgttg ttgggtttgg acggcgccaa cgccccggcc 927301 agcaccaacc cgctgcacac cgcgcagcag caggcgttgg ccgcagtcaa cgcgcccatc 927361 caggccgtga ccgggcgccc gctgatcggc aacggcgcca acggcgcccc gggcagcggg 927421 gcccccggcg ggcacggcgg gtggttgttc ggcggcggag ggaccggcgg gtccggcgtc 927481 agcggcgggg cgggcggaga tggcggggcc ggcgggatct tgttcggcgc cggcggggcc 927541 ggcggcgcgg gcggggccgt cacgggaacc ggcgccaccg gcgggtccgg tggggccggc 927601 ggtggagcct tgctgtttgg ggccggtggg gccggtggag ccggcgggtc cagcgggatt 927661 ggcgggttcg ccgcgggcgg ggccggtggg cccggagggg ccggtgggct gttcaacggc 927721 ggcggggccg gcggggccgg cgggtccggc gtcagcggcg gggctggcgg ggagggcggg 927781 gccggcgggg ccggtggcct gttcgccggt ggcgggatcg gcggggccgg cggattcggc 927841 ggattccgcg gcggggaggg cggggccggc ggggccggtg gcctgttcgc cggtggcggg 927901 gccggcgggg ccggcggatc gggcaacaac gtcggggggg ccggcggggc cggtggggtc 927961 ggtgggctgt tcggggccgg cggggccggc ggatccggcg gcggcggtag cgttgctggc 928021 gacggtgggg ccggcggcaa cgcgggcttg ctcgcccccg gtctcgccgg cggtgccggc 928081 ggtggcggcg ggcagggttt tgacaccggc ggggccggcg ggcccggcgg cgacgccggc 928141 ctgctggtcg gctccggcgg ggtcggaggt gccggcggat tcggcctcac tacgggtggg 928201 cctggggcgg ccggcggcga cgccggcctg ctgttcggct ccggcggcgc tggcggggcc 928261 ggcggctccg gccgaaccga cctcggcggc gctggcgggg ccggcggcaa ggccgggctg 928321 atcggcaacg gcggtaacgg cggggccggc ggggccggcg gggccggcgg gcccggtgga 928381 gccgccttcg ggctcggtaa cggcggcaac ggcggcaacg gggggaccgg cacgtccgcg 928441 ggcagccccg gtgccggcgg cgccggtggt tcgctgatcg gcgcggaggg gctgcccggg 928501 ctgctgccct agccggcccg gttggaccac gtgatcgacg accgtcacaa gtcgacacgc 928561 cgaacgtgca accacggcgg catcacctgg cgtgtcgccg ccaccagcgc acgctcggca 928621 cggagtttag caactactca tccagaagcc ggccactacg gcctggccac ctggtttacc 928681 cgcatggacg cgatgaccgc accgacctga gtcggcattg ctggttgcgc tcacccggtt 928741 atggcaagcc gttctgtccc ggcgcgccaa acaccccggc cttgccaccg gtaccgccgg 928801 ctccgccgtt gacgccgttg ccaccgtagc cgagggtaga cggggcgagc atgccgttga 928861 caactatcgt cgtgtcgccg ccgttgccgc cggtgttagc cccgaagccg gtgccggcgt 928921 tgccgccgtt cccacccacg ccgactagtc cgagggcgtc gccgccgttg ccgccggcgc 928981 cgccgatgcc gatgcccagg atcacgggtg agctcaaccc gccaccgcca ccggcccccc 929041 cgttgccgcc ggtcccggtg gcggtgccgc cagctccgcc ggcgccgccg tgcagcacgg 929101 agaaccctag gaagtttgcg atgccagcgc cggcgccgcc gaagcctccg gctccgccgg 929161 tggcgccgtc cccggtggcg gcaccaccag ccccggcggc gccgccaaag cctaggccga 929221 ggacagcaat gccctcgaag acgccgtcac cgccggctcc gccggtggcg ccgctagtgc 929281 cggcgccgcc ctgcgcgccc gcaccgccga tggcgatggc gatcccgaag gggctgctgg 929341 cggtgcccgt gccacccgga ccgccgggtc cgccgactcc cgtggaagcg tcgccgccgg 929401 cggctccagc gccgcccagg gcaaagatca ggccgcgggc gctgccgcca gcaccgccga 929461 acccgccggt tccgctggtg gcgtccccgc cggccgcgcc ggccccgccg acggccgcga 929521 gtgcgccggt agcgctgccg ccgttgccgc cgttggcgcc gttaaccccg actccggtgc 929581 cggcgttgcc gccgttgcca cctgcgccga cgaatccgaa gccgtcaccg ccggcaccgc 929641 cgctgccgcc ggtaccaacc gaagccccgc cgccgtgccc accggcgccg cccacgccgc 929701 ccagcagccc ggtcccgctg cctccggcgc cgccgttgcc gccggtgtcg gtggcggctc 929761 cgccaacccc cccgacgccg ccgatgccgg cgccgatcaa tccgagggca tcgccgccgg 929821 tcccgccatg gccgccgcta ccagccgaag cggcgccgcc gggaccgccg gcgccgccgg 929881 cgccgcccag cagcccgacg cccaatccgc cggcgccgcc gatgccgccg gtctcggtgg 929941 cggccccgcc agccccgccg gcgccgccga cgcccacgcc cagagccgcg aagccgccgg 930001 caccaacgcc accggtcccg ccggtgccgc cggcaccggt cgcagcccca ccaagcccgc 930061 cggccccgcc gtaggccgcg ccgaacccga tgaagtcggg ggcaacagcg aagccgccag 930121 tgccgccggc cccgccggtc ccagtggtag ctgcgccacc attgccgcca gcaccgcccc 930181 agctcaagtc gagcgcgaaa acggtgcccg aggaaccgcc ggcaccgccg gcgccgccgg 930241 caccgccgtt agtacctgcg ccgccgtgcc cgccggcacc gccgatgccg atgtcgatcc 930301 cgaaggggct ggcggcgcca ccagagccac cggcaccgcc ggcaccgccg cttcccatgg 930361 ccgagtcgcc gccctgaccg ccggacccgc ccaggccaag gaacagcccc aatgcgttgc 930421 tgccggcgcc gccggcaccg ccggttccag tggtagcggc cccgccggcg ccaccggcgc 930481 caccgatggc taccagtgcg ccgccggctc caccggcgcc gccgaccccg ccgttcccga 930541 ctccgctggc ggccccgcca gctccgccgt tgccgccaat gccgaacatc agcgcgttgc 930601 cacccgcccc accggacccg ccgccggacc cgccggcccc gccagctccg ccgctacccc 930661 acagccaccc gccggtgccg cctttgccac ccgaggcgcc ggtgcctccg gccccaccgg 930721 tcccgccatg gcccagcagc ccggcggcac ccccggcacc cccggtctgc cccggagcac 930781 ccgaaccgcc gttgccgccg ttgcccaaca accagcctcc aggcccaccg gcctccccgg 930841 tccccggggc gccgttggtg ccgttgccga taaaaggtcg gcccgacaac gcggcggcgg 930901 gtgcattgat ggcgcctagc aagccctgct cgagggtctg caacggcgac gcgttggccg 930961 cttcggcggc cgcataggcg cccatagccc cggccagtgc ctgcacaaac cgggcatgaa 931021 acgccgccgc ctgcgcgctc atcgcctgat actgctgacc gtggctggaa aacaacgccg 931081 cgatggccgc cgacacctca tcgccagcag cggccaacag cccgcttgtc gggacggcgg 931141 ccgccgcatt ggcagcggtc agggacgcgc cgatgcctgc aagatcctcc gtggccatcg 931201 ccaccaagtc cggcgctgca atcacgaaag acatccgaca cctcccagct ggccggtgtg 931261 atctgactgt cgcccatcgt tacgatacgc gcatatagcg cctaccggga gacgaagttg 931321 acactcgtca acatccgatg gccgccggag atccggcacg gctcggcggt cgtttgggcg 931381 ggcgttggcc ccgcacgttc gacagattcg acaagttcgt gcgcctcgcg caacgagaca 931441 accggcgacg ccgcctaagg tcaagggcgg cgtgcgttag cacttccgtc actcttgtca 931501 attagccgca gcaaacgcca gtcgcccgta cgatggcggc aacggcgtcg gcggagcggt 931561 ttcccgcttg gccaacgccg aagtcccagc atgaccgatc gcggacgcca gtccgcagaa 931621 gccggcttat cgacaatgag gccaaagagc tcaacccgtc agcggacatg tggcgcgcgc 931681 tggccagtgt ggcgatcagt cgtgtgttgc tccactgctg ccaagtcggc cgtcatcgtc 931741 tgctgtgcgg ccatcgcgac cacggcatgc tcgtttcaag ccacatcgac ccagccgagc 931801 accgcacccc cgacatcgcg ggtcgattcg ttgatcgtca gcatcgaaga cgtacggcgc 931861 atcgccaact acgaggagct cgccgcacat tttcagaccg acttgcgtga accgccggag 931921 gcggacacga acgttccggg cccctgtcgt gtggtgggca gcagtgatcg caccttcgga 931981 accgactggt cagagttccg tagcgcgggt taccacggcg ttaccgacga cctcagaccg 932041 ggcgggccgg tcatggtcga gacggttagc caggcgatag cgctgtaccc ggacccgagt 932101 acggcgcgcg gtgtgttcca tcggctcgag tcgtcgctgg cagaatgtgc tggcttgcat 932161 gacccctact tcgatttcat cctcggcagg ccggacgcct ccaccgtgag gatcggcgct 932221 gcgggttgga gtcatgtgta tcgcctgaaa tcgtcggtat tcatatccgt tggcgtgttg 932281 ggtattgaac cggcagagcc gatcgccaac gtcatcttgc agacgatcag cgatcgcatc 932341 cagtagttag ccgaggactg gaaagcagca gcggcggcga cgagcgcagc gtgttgaggg 932401 ctgttgacgc cacgacgccc accgttgcga agaagaaggc gagaagcgtc gcttcggcac 932461 cgactgctgt caccgcaacc gagctgtaac tacggggatc tattggatgc gaggcgtaat 932521 caagcagcgt ggcgatgggt ctggtgtcca ccgcaaagga gaagacatgc catatggggg 932581 aaagcttgac ccacgagagc gctgtcccga acaactcggt gcctgtcaga aggatcagcg 932641 cactggccgc tatccaggcc gtcaggcgtg ccggggatat cacgacgccg aatgtctttc 932701 gttggttatc ccagactgtc gagcgacgtt gtttttgcac tgaacgtcga atcttctgag 932761 actgccgccg ctttcgccgg cgccaagtct cgggcttact taaccaggcg agccgccacc 932821 gtacgacagt cgcagtcgct aagacttgct gctgcatcca actcgtggcg gccttcattg 932881 atcccgacta ccaccctgcc taaccaattc tgtatgacgc gccgtttgag aacgtacatt 932941 tgtgattgcg gttcgcattt aggagcccgg cgtgagctgg tcgagtaacg cctcgaccag 933001 agggcgccgc gaagctgtgg tggtgggcca gcccggtcga cccacggcga agtgctgggc 933061 cagcaggtcg tggtcggcct gtgtggcgcg cgtcgccagc acggcggcct ccggtgcgct 933121 gacactggcg cgcatgtcgt ggccgagcag cgcggcagcg gcggtgcgta ggtcgaagtc 933181 gtgacggcgt agcgcccact ggtctggttt ggcgtaaagc cggtcgaggt cgccggcgta 933241 ccagtgcacc accaaggcca gatctgggcc gtctttgtag tcgtggtccg cggaccgatc 933301 gagccatgcg tgcagtttga ggaccgcata gttcggcggt tggggaaggt ggactgtcag 933361 gccgccaggg agaggcagaa catcggcacg caggtaggcg tcggtgcatc cgtggacgtt 933421 catgagctgg ttgcctgggg gatggcgggt tgtgccggtg ggcgactcca cctcgccgaa 933481 cgggagggca tcgacggcgc ggtcggcgat caggaatcgg tgcccggtgc tgcccagggc 933541 gcggaaggtg gcccgaattg cctcgaagtg gtcccaattg ttcagggtcc ctgcgatatc 933601 ggtgtcgttg gtggcccgcg gcggcacccc gcggcagaag cgccagtgca gtagatcgcg 933661 gcactgtgcc ccgacgagca tcagctgttc agccggcacg acgtcggcaa gtgctgtgac 933721 gatcggtgtc acccaggcca ggaggaccgg gtcataatcg ggcgagtcgc tcatcctgcc 933781 ttctcatgag gtgggcgact tcgacctggc gcggctcgcg cgaggcaagg aggtcggcat 933841 agatcaaggc cgtgggagcc aaccccggtt gctcgtcagg taggttgcgc cagaatagct 933901 ttcggatcac gatgctgccg tgtgggtcgc ggtgccagcg gttgtgtata agcaggtcgg 933961 cgggtagccc gggcgctggg gtgtcgacgt agagcatcag tgattcggga ttgcggattt 934021 cgtcgggcag ggcctgttcc ccgctgaccg ccactgcgag tccgtcgggt gcggaccacg 934081 tgtggatatc accactggcg accaggagtt tgttggcccg gcccagaccc cccggatagg 934141 cagccgccca caggtccagc agctcatcgg tgcgcaccag cctgcggcgg gagccgaggt 934201 gttcgaagaa gccggtagtg cgcaacgtat ccatcgtctc cttggccata ccgaccgaga 934261 cgccggcgct cgcggcgatc gcacgcagcg gcgcgtcgac cagttgcggt gcgtcaagca 934321 gtacgcagac aacctgcgcg cgcttggggg taaacgggtt acgcggtcca tcgctgtgca 934381 gtccgtcacc gagggtgccc ggttgtgcgg acacagctga ccgtcggccg cgcacgtcga 934441 tgagcagtcc accctggtgc cgcaaataag cgttcccagc tccgtcgatg taccagagtc 934501 cgcgagcccg cagcgtttca gcgctcgacg gatgcagacg cgggcccacc acaagcagcg 934561 gcgaaccagc gccggcggta tcccaggcct gcagtgctgc cgttgccgac aggtgaggaa 934621 ggtagagggc agtgatcgtg agggggtgag cgtcgatctc aaggtctagt gattcgggat 934681 gcgcggagtt caatgctgat aggccaccga gcacccgcac tccgtattcg gtgaggtgac 934741 gctcgacggc ctcagcgagg tcagccccga tctgatccat gcgttcagta tatccgtacg 934801 ttcagtttta ttgaacataa tgatttattg aacatatcag gtcggagctg gtcgacttgg 934861 aaggtgtagc ggtatccgag tcgcactcac tgcctcctgc catgactcac cccaagggtg 934921 caggttgtgc ggcagtctga tgagttgccg cagcatcgtt gccgcggcct cctcgttggc 934981 tatctgaaac ctcgtctgca gtcgaggggt ggtcagcacg cgccgggcca gacggactgg 935041 tctactgcgc caaagcttgt cgctgcgctt ggaggtcagg ccgagcaggc gcgaggaacg 935101 acgaacccaa caagccatgg tggttggcgc cgtcgagagg tcggcggtcg ccacaacggg 935161 aagatcgcct tgagcgtcgc tcgaccgccg cctcgagttg ggtcataacg aagtagctga 935221 tgccgatcat gtcgacgttt ccgtcgcatc agcgtgcagc ggcgacccac tcgacgaggt 935281 ctcggtgccg ccgcggccag ggcaccagca gtgacgagtc caggcgccgt cgggccaagc 935341 agtcgcggtg ccagccgtgt tgggtcgggc gatggttggg tgtgctcatt tcgggaacgc 935401 cagggcgatc agcgtcggca aactcgcgtc gatgtgcccg cggcgcaaca atccgcgaca 935461 atgatcgggt gcgtctgatc gggcggctcc gtctgctcat ggtggggctg gtcgtcatct 935521 gcggggcttg cgcatgtgac cgcgtgtcgg ccggccgttg gtccgagtcg ccgagtgcga 935581 cctcgtggcc cgtccggccg gtaaacacca caacgccatc cggtcctgtg ccgccagtca 935641 gcgaggcggc gcgggcagcc gggttggtcg atgttcgcgg tgttgttccc gatgccgcca 935701 tcgacctgcg ctacgcgacg gcgaacaatt tcaccggcac acagctgtac ccgcccgggg 935761 caagatgcct ggtgcacgag tccatggccg agggtctcgc ggccgccgcg gcggtgctgc 935821 gcccacacgg gcaggtgctg gtcttctggg actgctatcg gccccacgac gttcaggtca 935881 ggatgttcga tgtggtcccc aacccggcct gggtggcgcg gccgggcaag tacgcgcata 935941 gccatgaggc ggggcgttcg gtcgatgtga cgtttgccag cgctcagcgg cagtgcccat 936001 cagtgcggcg atccggcgaa ttgtgcctgg ccgacatggg caccgacttc gacgactttt 936061 cttcgcgggc gacagcgttt gcaacgcagg gcgtcagtgc tgaggcccag gccaaccgtg 936121 cccacctgcg agccgccatg caggccgggg ggttgacggt gtactccggt gagtggtggc 936181 atttcgacgg ccccggcccc ggcgccggcg tcgatcgccc gattctcgaa gtgccagttg 936241 actgacgtct catatagtga aataaatgtc cactatttgg gcgcagtggc ggtaggcttt 936301 gagccgaaca cctcgaccat gggaccgcac ggtgaacgac aaacgtcggg cgatttatac 936361 gcacggatat cacgagtcgg tgctgcgcag tcaccggcga cgcactgcgg aaaactccgc 936421 cggctacctg ctgccctact tggtgccggg gttgtcggtg ctcgacgtcg gttgcggccc 936481 cgggacgatc accgtcgacc tcgccgctcg ggtcgtgccg ggatccgtga ccggcgtcga 936541 gccaaccgat gacgccttaa gcctggcccg cgccgaggcc cagctgcacc gcctgtcaaa 936601 catttcgttc accacttccg acgtgcataa gctcgacttc cctgacgacg cgttcgatgt 936661 cgtccacgca caccaggtgc tgcagcacgt cgccgatccg gtacgggcac tacaggagat 936721 gaggcgggtg tgtacaccag gcggcatcgt cgcagctcgc gatgccgact attcggggtt 936781 catctggttc ccgaagcttc cggcgctgga ccggtggttg gacctttatg aacgggcggc 936841 tcgagccaac ggcggcgaac cggatgccgg ccggcggctg ctgtcctggg cccgtgcggc 936901 aggattcgac gacgtcacgc cgacggccag tgtctggtgt ttcgcgacgg cctcggcccg 936961 cgaatggtgg ggcctagtgt gggccgaccg gattctgcaa tccgatctgg ctcaccagct 937021 ggtggattcg ggtctggcca ctgccgcgca actcgaggag atctccacgg cgtggcgaga 937081 gtgggccgcg gccccggacg gttggctggc gataccccac ggtgaaatcc tttgccgggc 937141 ataaactcag gcacacgcgc gaggctcgcg cggttggttg ccgacgacgg gcaggacgtg 937201 gcccggcgag atcaaatatc gtgcagccga aggaattcac gcatcacccg gtcgaatcgc 937261 gccggctctt cgatgaacgg catgtgggaa ctggactcga agaattccaa tcgcgagccc 937321 gcaatccggc cctgcatttc tcgcatgtgc tcaggcgaac attcgtcgaa acggcccacc 937381 accagcaagg tcggcaccgc gatgtcggcc aaccggtcga cgacgtccca gtctcgaaca 937441 ttcccaacga tgcgaaagtc gctgggccca aacatcgtct cgaagatctc ggttcccatg 937501 ttggcgaatg cttccgtgag ttcccggggc caggggcggg tgcggcacag ataagtctcg 937561 ttccaggttc tgatcgcggc ctggtattcg gcggaatggg tggtgccggc cgcctcgtga 937621 cggtcaattg ccgagcgagt tgccacgtcc aagcacgact tcaagctgac cagactggcc 937681 gaaaattcgg gtatcgaagc cgtgctgttc gcgatggtca gactgacggc gtcaggcgcc 937741 ttgtcgagca cgtactgctg tgccagcatc ccaccccacg aatggctgaa gatgtgaaag 937801 cgggtaaggg caagggcttc cgccacggtt gccatctcgg ccactgagcg gttcatcgtc 937861 caaaggtcta cgtctgacgg acatgcggaa tttccgcaac cgagctggtc ccagaagatg 937921 acctcccgct catcagacaa ccgtcgcagt ggggccaagt agttgtgcgg caagcccggc 937981 ccaccgtgca ctacaagcag cggacgacca ggaccgccac caatccgctg gaaccagacg 938041 cgtccacccg ggaccgcgat tgtcccctcc acttgacctc cgatttcggt tgaccaacag 938101 acgcagaatc gcacattcgc cccttcgggg gagtgcgagt ttgcgtcgcc tcgccgggca 938161 tgtcggtcag cgatggcgcg gtcgagacca gacggcccga ggcggtttgg gtggatcgac 938221 agtatcggtc gcgcagttac cggcggactc ggcttctgct ggccggccgg tcgggtgtgc 938281 ccgtgcatac cgctctcggc ttcaccgtgg ctgtggccgt gtgcacaccg ggtgagacgc 938341 ccggttcgtg gttgcggcca gcatcgtgca ccacagcgct gcgccggcca accgcggtcg 938401 ctaccacgga atctggtcga tgacccctgt atttgcttcg gtggttgtgc caatcatggc 938461 ttcctacggc ccgattcatg gtgctcatct cttggccgcg gtggtcgtgg ggtcggccgg 938521 tgccgcgctg tgcctgccgt tggcgcgggc cctgcgccga ccgaccccca gtgcaatgac 938581 gacggattga cggtgcggag cccggggatg tgctgagggc accaatgtgg tgaaagttgc 938641 acgcaagcag cacaatcgga gcccagaatg ggcactgggc gcagaacccg agccgcagaa 938701 gtaatgtgct ggaggggtta ctgcagcaac cacacccccg ggtgtcctcc gatcggggga 938761 aggggctttc gtcatcgttt caggccgatc ggaggacgcc ggcacaggtc aacgatccta 938821 acttgagtta gtgaccacag cggcggccat cgcccgcgag gaccggttgc gttacaccgg 938881 tccggagcgc tgctcggggg acggacaagt tcgagcggcc ggggatcgct attcgacggt 938941 gatctggctg ctgggcggca acttgctggt gcgctcggcc ggattcggct atccgttcct 939001 ggcctaccac gtggctggac gaggacatgg tgcgggagcg gtcggcgcgg tcgtggcggc 939061 ctacggcctg ggttgggcgg tggggcagct gctgtgtggg tggttggtgg accgtgtcgg 939121 ggcgcgggtg acgctggtat ccaccatgct ggtggccgcc gccgtgctgg tgctgatggc 939181 cgggctacac accgtgccgg gattgctggt tggggccatg atcgccggcc tggtttgcga 939241 tgccccgcgt ccggtgttgg gtgcggtgat cgcggagttg gttgccgacc cacagcggcg 939301 ggcacaactc gacggctggc gatacggttg ggtgctcaat atcggtgctg cgatcaccgg 939361 cggggtcggc ggtgtggtcg cgggctggtt ggacaccccg gtgttgtact ggatcaatgg 939421 catcgggtgt gcgatcttcg cggggttggc aggccgctgt atacctgccg atgtgtgccg 939481 taggaccgag tccggccttc gagcttgcac cgccatgtcg aaagttggct atcggcaggc 939541 actctcggac aagcgcctgg tcctgttggc cgtctcgggt ctggcaacgc tcacgacgct 939601 gatgggtttc ttcgcggcgg taccgatgct gatgagcgcg agtggactgg gtgtcggggc 939661 gtacggctgg gtgcagttga tcaacgccct agcggttgtc gcggtgaccc cgctgttgac 939721 gccgtggctg agcaagcagc tcgcacttgg tccacggcca gacattctgg ccggcgcggg 939781 agtgtgggtg actctttgta tggcggctgc cgggctcgcc cgcaccacgg tcggtttcag 939841 tgtggccgcg gctgcctgct cgccgggcga gattgcctgg ttcgtggttg ccgccggcat 939901 cgtgcaccgg atcgcccctc ccgcgcacgg tgggcgctac cacgggatct ggtcgatggc 939961 cgtcgcggcg tcgtcggtgg ccgcgcctat cctggctgct ttcaacctgg ctaatggtgg 940021 gcgcctagtg ctggcggcca ccacggtgac ggttggtttc ttcggggccg ctttgtgctt 940081 gccgctggct cgtgttctgg cagctgccag ttgcggtccg ttgagcagca aggagccgtc 940141 gcgtgactcg taccagtgaa gggttggctg cgttcgtggt cgatcagctg gaggagctgt 940201 atcgccggat gtgggtgttg cgactgctcg atatggcgtt ggagcagttg cgcatcgaag 940261 gcctgatcaa cgggccgctg cagggtggct tcggccagga agcagtaagt gtcggtgccg 940321 cggcggcgct gggcgaaggc gatgtcatca tcaccaccca tcgtccgcat gcccaacacg 940381 ttggtactga cgctccgctg ggcccggtga tcgccgacat gctgggtgcg accgcaggcg 940441 atctagaagg cgctgacgag gatgcgcaca ttgccgatcc tcgggccggg ctaccggctg 940501 caatacgcgt ggtcaagcaa tcgccgctgt tggctatcgg acacgcctac gccctgtggc 940561 tgcgcgacac cggacgggtc acactctgcg tgacccaaga ctgtgatgtt gatgccgatg 940621 ccttcaacga ggccgcggac ctagcggccg tgtggcaact tccggtggtg attctcgtcg 940681 aaaacattcg tggtgcccta agtgtgcacc tgggcaggta cacgcacgag cctcgggttt 940741 atcgccgggc tgtggcctac ggaatgccgg gggtatcggt ggacggcaac gacgtcgaag 940801 cggtccgtga ctgtgtggcc aacgcggtgg ttcgggctcg cgctggtggc ggccccacgc 940861 tggtccaagc catcacctac cgcaccaccg atttctctgg atctgaccgc ggcggctatc 940921 gcgacctggc cggatccgag cagtttctgg atccgctgat cttcgcgaga aggcggctga 940981 ttgctgctgg cacgacccgc ggtcggctcg acgagcagga gcgggcggca tgccaacagg 941041 tggccgatgc cgtggcgttc gccaaggaca gggcgcggcc caacggcggt gggccaatca 941101 gccgaccaac atccggctgg caccaacaac caaagacccg gttctgaggc ctagatgtac 941161 gttggccgcg gacaacgcgg tcggtacatg ccgtcgcgcc gcggccccag ctagtcgagc 941221 agcctctgcc gcatcgcctc ggcgaccgcg gcagctcggt cgctgacgcc gagcttctcg 941281 tacaaccgtt gcacgtgggt ctttaccgtc gacggcgcca catatagctc ggctgcgatc 941341 gcggggatgc tttgaccgca cgcaatgcga ttgagcacct cgcgctcgcg cgcgctgagc 941401 accggggcca cgggtgccgc gcgctggcga atctccccgg cgaggccccc gaccagcgag 941461 ggcgccacca cgtcgcggcc cttcgcgcaa tcgagcaccg ccttgacgat ctcggtgcga 941521 gtcgaatcct tgagcaggaa tccggcggcg ccctgttgga gtgcctggta gacgatcgcc 941581 ggctcgtcgt gcgcggaaat aagcagcacc cgggttggca actcgtagct gcgcaccgcc 941641 gccggaacct gcgcgccgtc catgccgggc atgcggtagt ccagcaatgc gacgtcgggc 941701 aaatgggcct tgatcaactc cagggccgcg gcgccgtcgt cggcctcgcc gaccacgttc 941761 accgagccac tcaacgaaag cgctcgcaca acgccctcgc gaaataacgg gtggtcgtcg 941821 ccgaccacca cgcgcacttt ctccggctgc ggattgctca tggcgcgccg accatggcga 941881 tgagtttagc tgctcgtcgg caaccagccg ctggcagtcg ctggacattg atttgcactc 941941 cgacgtgccc agctacggca acctcggacg tttgggcggt cgccatgagt acggtgtcct 942001 agtggcaatg accagctcgg cggaactgga ccgggttcgt tgggcgcacc agttgcgctc 942061 ctaccgaatt gcttcggtat tgcggatcgg tgtcgtgggg ctcatggtcg ccgcgatggt 942121 cgttggaacc agccggtccg aatggccaca gcaaatcgtg ttgatcggcg tctacgcggt 942181 cgctgcattg tgggctctgc tgttagcgta ttcggcgtcc cggcgattct tcgctttgcg 942241 acgctttcgc agtatgggcc ggttggagcc atttgctttc accgccgtcg acgttttgat 942301 attgacgggc tttcagctgc tgtccaccga cgggatctat ccgctgctga tcatgatcct 942361 gctgccggtc ctggtgggcc ttgacgtgtc gacgcgacgg gcggcggtgg tgctggcctg 942421 tacgctagtc ggattcgcag tcgcggtgct gggagacccc gtgatgctgc gcgcgattgg 942481 atggcccgag acaatatttc ggttcgcgct ctatgcgttc ctgtgcgcca cggccttgat 942541 ggtggttcgc atcgaggagc ggcatacccg ttcggttgcc ggcctgagtg cgttgcggga 942601 ggaactgctt gcccagacga tgacggcctc ggaggtgctg cagcggcgga ttgcggaagc 942661 cattcacgat ggaccgctgc aagacgtgct ggccgcgcgt caggagctca tcgagttgga 942721 tgccgtaacc cccggcgacg agcgcgtcgg acgcgcgttg gccggactgc agagcgcgtc 942781 ggagcggctg cggcaggcca ccttcgagct gcatccggca gtgcttgagc aagttgggtt 942841 ggggccggcg gtaaaacagt tggcggcctc taccgctcag cgttcgggta tcaagatctc 942901 caccgatatt gattacccaa tacgtagtgg gatcgacccc atcgttttcg gtgtggttcg 942961 cgaactgctg tccaacgtcg tgcggcattc cggagctacc accgcctcgg tcaggctcgg 943021 aatcaccgac gaaaaatgcg ttttggatgt ggccgacgat ggcgtggggg tcaccggtga 943081 cactatggcg cgccgcctgg gtgagggaca catcggtctg gcttcgcatc gggctcgggt 943141 ggatgccgcc ggcggagttt tggttttcct ggccaccccc agggggaccc atgtctgcgt 943201 ggaactacca ctgaaacggt gaatggccgt tgttgccggt caaccgatgt gccggtggca 943261 gcgacgtgac ccccgcgcag gtcgaaagcc ttgctggatc gatggttccg ccggtgcccg 943321 ccatgggccc ggccggtcac gccggccagt ccgcaaccgg ctgtccaggg ccatctcacg 943381 ggcaacgtcc tgggaggcgc tggcagcggc ccggttcagc ccacaagccg cctgtcacag 943441 aatgtagtcc aggcgggtcg ccattccggc gacctggtga tagttgttgt ggcagtgcat 943501 cacccacacg ccaggattgt cggcgaccag gacggcgcgc atcttctgct tgggcagcac 943561 tatcacggtg tccttgcggg cgccggggct gccgtcggcc ttgatcatct gaaaggtatg 943621 gccgtgtagg tggattgggt gatacatcat ggtggtgtta tcgaacatca gggttggccg 943681 ttggcctagc cgcacgtgca gtggattggt cgtgctgtgg ggttccccgt tgattgtcca 943741 gtcgtacttg gccatggtgc cgcccaaggt gaccgggagg tcgtgggtgg gttcgggccg 943801 gcccaggttg gcagtcgttg cggcggtgaa catttccacg gtacccactc gccagttgag 943861 ttcatccggc cgaaactgcg ggtcgggtgg gctgccggcg ccggtagaca gcagcgcacg 943921 cgccagcgcg ttcttgcctt ccgcgagtgc gaccagggga aagacgccgc cagcggcggt 943981 caccatgacg tcgtagcgtt cggccatgcc gatcagcaga gcgtcgactt cggtgggaat 944041 cactgggtaa ccgtcggtgt gggtgaccgt catcgaatgc ccggccagcg cgatgcggaa 944101 cgcggtgtcg gcggcgctgt tgatgatgcg gatccggatt cgctggccag gcttggcctt 944161 aaaagacgtg gccgccacgg ggattcgccc gttgatcaga tagtacgggt aggcgatgtc 944221 ccctccgtcg ccgccgagca ggttgctgtc aacgccttcg ccttcgggca tacctgttgt 944281 gttttgcatg gtgggtttgt tcgggtcggt cagctcgccg tagagctgtt gcggggactt 944341 cccgatgccg tccgtccaat cgtcgaggat gatgatccat tcggcgtcgt agtggcctgg 944401 ctcagtcgga tcgtcgacga cgacaggcag atataggccg tggtcgcctt gaagaccgac 944461 gtgcggatgg gcccagtagg tgcccggatc cggcacggag aaccggtacg taaagtcacc 944521 gccggggccg atgttcgcag tcgcgggctc ggtgccatcc atatcgttgc gcagcgcgat 944581 gccgtgccaa tgcaccgacg tcggatcacc cagacggttg gtcaccgaga cgacaatctc 944641 atccccgacg gtggcccgga tcagtggtcc ggggatggtg ttgccgtagg tcagcgtgct 944701 gacgatcggc ccacccaggt cgatcctcgc cggctggggg gtcagcgtgg cggtaaccgt 944761 tcgcccactg tgcggccggg ccgcctcggc cgcgtcgatt gcagcggtca tcccggcggc 944821 gccggatgcc gtgggcttcg aggcgcaagc ggctagcgca aagccgctgg cgatgccggc 944881 gccgaggaag ccgcgccggc tgaaccgcct cttgtcgaag gcgttaccgc tcgtggccag 944941 ctcgggcatc gatcgctcct cgtctggatt tggtctcgct cttcgtaccc tgcccagaca 945001 tcgggcagta cgcaacggtt gatgatcacc acgccatcat ccgcccttac gccctacccc 945061 tatagggtat atagtgggcc acgtggaaag cgggcacgtg gtgtggatgc gatcggcgat 945121 tgtcgcggtc gcgctggggg tgacggtagc cgccgtcgcc gctgcatgct ggctccccca 945181 gctccaccgt catgtggctc acccaaacca cccgttgacg acgtccgtag gtagcgaatt 945241 cgtcatcaac accgaccacg ggcacctggt ggacaactcg atgccaccgt gcccggaacg 945301 gctcgcgacg gcggtgctgc cgcgctccgc cactccggtg ttactaccag acgtcgtggc 945361 ggctgcgccc ggcatgacag ccgcgcttac cgaccccgtc gcgccggccg cgcgcggtcc 945421 gccggcggcg cagggatccg ttcgcaccgg tcaagacctg ttgacccggt tctgcctggt 945481 tcgtcgctga ggggtcagcg ccaggcggtg gtggccattc gccatcgccg gtgaccgctg 945541 acccccatcc agtgccgcgt gtgacttccg gccccgatgc agaagcgacg atcactatga 945601 acaacaacct gccgctggca aatccggtaa acccaacaag catcacctcc aacccgcaga 945661 tactcctggc caaccgggcg caccgcacct tggtgaggtc gcggcagacc cgcgaccggt 945721 accgcctcct cccggaggga tatcaagtca ctcctggccg gaatcgccac ccgggcacca 945781 tggttggcaa taccccggtg ctttggatac ctgagctgtc ggggacctca gaccctgacc 945841 gtggattttg ggccaagcta gaaggattca atcccggggg tatgaaagac cgccccgcgc 945901 tgtacatggt cgaatgcgcg cgcgcccggg gcgatatcgc gcccggtgcc gcgatagtcg 945961 aatcaaccag tggcactctg ggattgggcc tagccctcgc tggtaaggtg taccggcacc 946021 cggtcaccct ggtcaccgac ccggggctgg aacccatcat cgcgcgcatg ctgaccgcct 946081 acggcgccgg cgtcgatatg gggacgcagc cgcacccggt cggcggatgg caacaggcgc 946141 gcaaggaccg ggttgcgcag ctgatggccg aataccccgg cgcgtggaat ccgaaccagt 946201 acggcaaccc cgacaacgtc ggcgcctacc ggtcgttggc gctggagctg gtcgctcagc 946261 ttggccggat cgatgtcctg gtgtgctcgg tggggacggg tggacattca gcaggtgtcg 946321 cccgagtgct acgggagttc aacccggaca tgcggttgat cggcgtggac accatcgggt 946381 ccacgatctt tgggcagccc gcgtcgaaca ggctgatgcg cgggctgggc tcgagtattt 946441 atccgcgcaa tgtcgattac cgtgcattcg acgaagtgca ctgggttgct ccccccgaag 946501 ccgtctgggc gtgccgctcc ctggccgcaa cccactacgc cagcggcggc tggagcgtcg 946561 gggcggtcgc cctggtagcc ggctgggcag cacgcaactt gccggcggac accacgattg 946621 ccgcggtctt tcccgacggc ccacaacgct acttcgacac catctacaac gacgcgtact 946681 gcaacgaaca cgaactgcta ggcggacaac ctcccaccga gcccgacgag attgcctcgc 946741 cgctagacgc cgtcgtcacc cgatggacac gcagcaccac ggtgatcgat ccaacccagg 946801 tggtgtcgta atgggagcgc gcgctatatt ccgcgggttc aaccgcccga gccgggtgtt 946861 gatgatcaac cagttcggca tcaacatcgg cttctacatg ctgatgccgt acctggccga 946921 ctacctagcc gggccactgg ggctagccgc gtgggcggtg ggtctggtga tgggcgtgcg 946981 caatttctcc cagcagggca tgttcttcgt gggtggcacg ctggccgatc ggttcggcta 947041 caagccactg atcatcgccg gatgtctgat ccgcaccggc gggtttgcct tgctggtggt 947101 cgcccagtcg ctgcccagtg tgctgatcgc cgcggctgcc acgggctttg ccggcgcgct 947161 gttcaatccc gcggtgcgcg gctatctcgc ggccgaagcc ggggaacgca agatcgaagc 947221 gttcgcgatg ttcaacgtct tctaccagtc ggggatcctg ctcggcccgc tggttggatt 947281 agtattgctg gcgctggatt tccggatcac ggtgctggcc gccgccggtg tgttcggcct 947341 actcaccgtc gcgcagctgg tcgcactgcc ccaacaccgg gccgactcgg agcgcgaaaa 947401 aacatcgatc ctgcaggact ggcgggtcgt cgttcgcaac cgtccgtttc tgacgttagc 947461 cgccgccatg accggatgct atgcgctgtc gttccagatc tatctggctc tgcccatgca 947521 ggcgtcgatc ctcatgccac gcaaccaata tctcttgatt gcggcgatgt tcgcggtatc 947581 gggtctggtc gccgtcggcg ggcagctgcg catcacccgc tggttcgccg tcagatgggg 947641 ggccgagcgc agcctggtag tcggcgcgac gattttggcg gcctcgttca tcccggttgc 947701 agtcatccca aacggccagc ggttcggcgt cgccgttgcg gtcatggcat tggtgctgtc 947761 ggcgagtctg ctggcggttg cctcggcagc gttgtttcct ttcgaaatgc gtgccgtggt 947821 cgcactgtcg ggcgaccggc tggtggcgac ccactacggg ttctacagca ccatcgtggg 947881 cgtcggagtc ctcgtcggaa atctggcgat cggatcgctc atgagcgccg cgcgccgctt 947941 aaataccgat gaaattgttt ggggcggatt gattctggtg ggcatcgttg cggtggccgg 948001 gctccgtcgg ttggacacat ttacctcggg ttcccagaac atgaccggcc ggtgggctgc 948061 accccggtga cccgcgatcc acacagcccg gactgcgggc gcgagggcag ctaccgcgac 948121 accatcaccc gcccgttgac cgacctaccg gtggccggct atccgttggt gccgcgggtc 948181 gcgtcgcccc gctaccggtg cacaacgccg cagtgcgggc gtgcggtatt caatcaggat 948241 ctcgctaacg tcgaccagta cctcgttgtc aatcaactgg cgcaccaact catcgacggt 948301 tcttccctca tacccgatgc tgacaagaga tgggatgcgc gacgacatgc cgacatgacg 948361 caccatctga catcgagcct taaggaaaat caaagctaat gccgccaccc ctcggcggcc 948421 tgttcgtcga aggtgcggtc aatgcgctcg aacctgcggc ggatcgaagc gcgcgaggcc 948481 gcatgcggaa ggacgtagag gcggttggcc agaatcgcat cggctgttag ctgggcgata 948541 tcgtcgacgc ccaggttgtc gtcctgcagg gggagtggac cgggcgatcc cgtcgttgag 948601 gactgcgcgc aagccgcgcc tcggattcgt tcagagttgg caaccagatt ggtttcgacg 948661 accatcgggc agagcaccga caccccaatg ccgtcggcgg tgacctcgcg ggccagcgtc 948721 tccgccagac cgacaacccc gtacttggca acgccgtatg cgccgagtcc ggcattgggc 948781 accagcccgg caaaggacgc ggtgaacacc acatgcccgc ccgtgccctg ctcaagcaac 948841 ctcggcagga acgcttcgac cgtatggatc gagccccaca ggtcgacgtc gatcacccaa 948901 cgccagtcgt cgtgcgtcat ctccacgatc ggaccgccga caacgatgcc ggcgttgctg 948961 aatacgacat cgaagtggcc gagcaggcgg aaagcctcgt ccgcgaggtg agtgacctct 949021 tctcgatgcc ggacgtcgca catcacgccg tgcacatcga acccctcggc acgcaggtgg 949081 ttcaccgcct gccgaagtcc cggcttgtca acgtccccta gcacgactct ggctccgcgg 949141 cgggcgaact cggtgccggt agccaacccg atgccactgg caccgccagt gatgaccgca 949201 ccgcgcccgg gaaacccgtc cacagcacgc aaccctattt caggcagtca cccgcgtcga 949261 ctgcgccggg cgagcgtgat tctggcgacg ccacagcggc atgttgcgtc gcggtgttca 949321 caatcggtta cagctgcgct agtcgcggcg cagattcatg gttgatccgc aggtgcagtg 949381 tcgtgcaagg ttgtctcgac gatccaggtg ccactgtgga ggcaatcgat gacgacggat 949441 ggccgcacac cggcgatcct tgcagcccga attcggcggc ctccggcaaa tatggtgaaa 949501 gaccagcttc ggtgagtacc ggcgacattc attcgttggt gatcgcttcg gactatcggg 949561 tccctgatcc cggtagagtg tggccgctgc tgcagcgcaa caaatcggct ctggccgaca 949621 tcggcgcaca ccacgttctg atctacgcgt caacgcacga ctctggccgt gtgctggtaa 949681 tgatcggagt acgcagtcgt gagccgatcg tggaattgct ccgctcacgg gtcttcttcg 949741 actggttcga cgccatgggc gtcgacgata tcccggcggt cttcgccggc gagatcgtcg 949801 accgatttgt cgcggcgcct actacgactc agtccactcc acgggttcct ggcgttgtgg 949861 tggccgcgtt cgcgtcggtg aacaacgtgt ccaacctgac cgccgaggtc cgttctgcga 949921 tagccaggtt taccgccgcg gggattcgaa agacctgggt tttccaggct ttcgacgatg 949981 cgcacgaggt tttgatcctg caggagtttg ccgatgaggc gggcgcgcgg cagtggatcg 950041 agcatcccga cgccgccgcc gaatggatga gcggggcggg agtgggagcc tacccaccgc 950101 tgttcgtcgg ccggttcttc gacatgatgc ggatcgaggc gctgcagtga gcgcatcgct 950161 gggcactcgg cccggcccgg gtcagcgacc tcactgcggc gccatggatc ccacgagttg 950221 gccaagcagg cgggggatct cgagccgcgg caacaccacc tcgacgagca ccatccggtc 950281 ccgccgtgcc gcggcgacgg tgagggcgtc gtcgagttgg ccataggttt gggcacggaa 950341 cgcgaggtga ttggtcacac ccagcgcgct gggaagctcg gtccaattcc agctcacgat 950401 gtcgttgtac ggggccgtct cgccgtggat ggcccgttcg accgtgtaac catcgttgtt 950461 gaccaccacg atgaccgggg acagcccttc gcgggagaac gtgccgagtt cctgcacggt 950521 caattgtgcg gccccgtcgc cgatcaacag caccgtacgg cggtccggat gcgcaaccgc 950581 ggccccgact gccgcgggca gcgtgtaacc gattgagccc cacaagggtt ggccgataaa 950641 ggtcactcct tgcggcaacc ggtggtccgc catgccgtag aacgacgtcc cctggtcggc 950701 gagcaccacg tttccgggtg tgagcgctga gcaaacccgg tcccacacca tctgctgggt 950761 gagcggctca tcgcgcgccg gcatcgccgg cggcggttcg gcgggcggcg gtaccaccgg 950821 cggcgaactg attccgcgcc cggtcaggat ggtggccagc gcctgcagcg cggcactcat 950881 ttccagtggt gcgaacacct ggtcggccac gctgctctgg tattgcccga tgtcgatggt 950941 ccgggccggg tcgatccgct ggctgaagaa gccgctgacc atgtcggtga acaccactcc 951001 ggcggtcacc agcaccggcg ccccttcgat cgcggcgcgc acccgttcgg cgctggccgc 951061 gccggcgtag attcccagga agttcggcga gctctcgtcg agcaggctct tcccccacat 951121 caacgtggcg tgcggcacca cgtcggcggc caacagcgcc tcgagttctt tgacggcctg 951181 caggcgatga accaacagat cggcgagcac cgtcaactgg tggtcggcaa tgagttcgat 951241 ggcggccttg gtgaacagcg acagcgcgcg cgggctggtg ccgccggggt agcggggcaa 951301 cggcgcagcg ggcggttcag tggggaagcg tgctacgtcg ctggacagca atatatatcc 951361 tggacgcttc tgctcccgta cctcggacag cacccgatct atttctctac cggccgttgc 951421 cggcatgaga ttggcttggg cacaggtgat ttcacggctg atccggagaa agtgctcgaa 951481 gtcgccgtcg ccgagggaat gatgcaatgc ccggcgagtg ccctgggcgt ctttggtcgg 951541 gccgccaaca atgtgcacca ctggcacatg ctcggcgtaa ctgcccgcga tcgcattggt 951601 caccgagagc tcgccgaccc cgaatgtcgt taccaccgct gacatcccac gcagccgccc 951661 gtacccgtcg gcggcatacc cggcattcag ttcgttggcg ctgcccaccc accggatggt 951721 cgggtgggcc acgatgtggt cgaggaattg caggttgtag tcgccgggaa cgccgaagat 951781 ctcagagacg ccgagttcgg cgagccggtc gagtaggtag tcgccgacgg tgtagacggg 951841 atcgctgcag gcatcgctct tctggggtgt cacgaagacg accgtacgcc ggattgaggc 951901 tattcccgac tggacgccga ttcgctatcg tgcggccatg gccatcaagg agtcgcgcga 951961 catagttatc gaagcaagtc ccgaggagat cctggatgtc attgccgact tcgaagcgat 952021 gaccgaatgg tcgccagccc atcagagcgt cgaaatactc gagaccggag acgacgggcg 952081 gcccagcaag gtgaagatga aagtcaagac cgccggcatc accgacgagc aggtggtggc 952141 ctatagctgg accgacagat cagtgcggtg gacgctggtc agctccaccc agcagcgctc 952201 gcaggatgga aagtacgagt tgacacccaa gggcgacaac accctggtcc agtttgagat 952261 caccgtcgac ccgcaggtgc cactgcccgg cttcgtgctg aaacgtgcga tcaaagggac 952321 gatcgacacg gccaccgagg cgttgcgcag ccaggtgttg aaagtgaaga agggtcaata 952381 gtcgcggtga cgaccggggg gcccctggcc ggggtgaagg tcatcgaact cggtggtatc 952441 ggaccggggc cgcacgccgg gatggtgctc gccgacctgg gtgctgacgt ggtgcgggtg 952501 cgccgcccgg gtggcctgac gatgccgtcc gaagaccgcg acctgctgca ccgtgggaag 952561 cggatcgtcg acctggacgt caaaacgcaa ccgcaggcga tgctggagct ggccgccaag 952621 gccgatgtgc tgctggactg tttccggccc ggcacttgcg agcgcctcgg catcggaccc 952681 gacgactgtg cgtcggtcaa tccgcgactg atcttcgccc gcattaccgg ttggggacag 952741 gatggcccgt tggcctcgac ggcgggtcac gacatcaact acctgtcgca gaccggtgcg 952801 ctggcggcgt ttggctacgc cgaccggcct ccgatgccgc cgctaaacct ggttgccgac 952861 ttcggcggcg gctcgatgct ggtgctgctg ggcattgtgg tggccctcta cgaacgggaa 952921 cgttcgggtg tgggtcaggt cgtcgatgct gcgatggtcg acggggttag cgtgttggcg 952981 cagatgatgt ggaccatgaa ggggattggc agcctgcgcg accagcgcga atctttcctg 953041 ctcgacggcg gcgccccgtt ctaccgctgc tacgaaacgt ccgacggcaa gtacatggcc 953101 gttggggcaa tcgagccgca gttcttcgcg gcgttgctga gcgggctcgg cttgtcggcc 953161 gctgacgtgc cgactcagct cgatgtggcc ggctacccgc agatgtatga catcttcgcc 953221 gagcgatttg ccagccgaac ccgcgacgag tggacgcggg ttttcgccgg cactgacgca 953281 tgtgttacgc cggtgctggc gtggagcgaa gccgccaaca acgatcattt gaaggcacga 953341 tcgacggtga tcaccgccca tggtgtccag caggccgcgc ccgctccccg attttcccgg 953401 acaccggccg ggccggtcag gccgccgccg gccgcagcca caccgatcga cgaaatcaac 953461 tggtaaccac ggtggctgcc gaacaccgcc caccaacggc gcggcgttgc tagcgtcaac 953521 gtcagtggcc gtaaaagcat cgcgggaatt tgtcatcgac gcgcctccag aagtggtgat 953581 ggaggcgctg gcagatgtcg gcgtcctggc ttcgtggtca ccgctgcaca aacaggtgga 953641 agtgatcgac tactacccgg atggccggcc gcaccatgtg agggcaaccg tcaagattct 953701 ggggctcgtc gacaaagagg tcctcgaata tcactggggc ccggactggg tgtgctggga 953761 tgccgatcag accttccagc aacatggaca gcacatcgag tacaccgtga aacctgaggg 953821 tgtcgatagg gcccgggtgc gcttcgacat caccgtcgag ccggcgggac cgatccccgg 953881 cttcatcgtc aagcgggcaa gtgagcatgt gttggatgcc gcggcgaaag ggctgcagaa 953941 gttgatcgcg ggtgccggcg atcaaggaaa cgcgaaatcg tgacgatgtg acgggtccgc 954001 gtagcggatc gtgattgcta atttggtagc agtggctatc cgagcatcgc gcgaagtcgt 954061 catcgaagcg cctccggaag tgatcgtgga ggcgctcgcc gacatggacg ctgtgccgtc 954121 ttggtcttca gtgcacaaac gggtcgaagt cgtcgacact tactccgacg gtcgaccaca 954181 tcacgtgaag gtcaccatca aggtggcggg catcgtcgac acggagttac tggagtatca 954241 ctggggaccc gactgggtgg tgtgggatgc cgccaagacc gcgcagcaac acggccagca 954301 cggcgagtac aacctgcgcc gtgaggataa cgacaagacc cgagtgcgat tcaccctcac 954361 ggtcgaaccc tcggcgcccc tgccggcgtt ttgggtcaac attgcccgca agaagatcct 954421 ccatgcggcg acggaaggac tgcgaaagca ggtggtgggg cgccgacggt tcacgtcggg 954481 ctaggtagcg ggtcgctcgg cgagcacgct cagtcgcctg attgcctcgt cgagggtgtc 954541 gtctcgtttg cagaaggtga agcgcaccag gtggttccac acatcggctt gttgtgaggc 954601 ctgtcctgcg gcggggtcgc agaacgccga catcgggatg gcggccaccc cgactttctc 954661 cggtagcgcc gcacagaatt cggtgctgtc gtcataaccc aacgggcgcg ggtcggcgca 954721 taggaagtac gtgccgtagc tgtcgtgcac tgcgaagccg atctccgtca ggcccgctgc 954781 cagccggtcg cgccgggccc gcaacgagtt ccgaagggcc gccacccagg cgtcttcggt 954841 gtctagcgcg agggcgaccg caggctgaaa cggtgcgccg cccacatagc tcaggtactg 954901 ttttgcggcg cgcaccccgg cgatgagttc ggctgggccg caagcccatc cgattttcca 954961 gccggtgcag ttgaacatct tggccgcact ggaaatggtg atcgtgcgct cggccatgcc 955021 gtcgaaaccc gccagcggca ggtgtctggc gtggtcaaac actaggtgct cgtacacctc 955081 gtcggtgatc accacaaggt tcgccgccac cgcgatctcg gcgatggctg cgagttccgt 955141 cgcgctcagc accgcaccgg tcggattgtg cggcgagtta atgatcagcg cccgagttcg 955201 cggggtcacc gcgcgtcgca gcgcgtcggc gtctagggcg aagccgcggc catcgggcac 955261 cagcggtacg gtcacgcggt gggcgccggc catcgccacc accggcgagt aggagtcgta 955321 gaacggctcg atcagcaaca cctccgagcc cggttcgacc agtccgagca ccgctgcggc 955381 gatggcctcg gtggctccga ccgtgaccag cacctcggtc tcggggtcgt agtcgacgcc 955441 gaaatggcgc cgccgctggg cggcgatggc ccgccgtagc ggagcgcttc cagggccggg 955501 cgggtactgg ttgacgccgc cggcgatggc gtcttgggcg gcctgcagca tcttcggcgg 955561 gccgtcctcg tcgggaaagc cctgtcccag gttgaccgcg ccgatacggg tggccagcgc 955621 ggacatttcg gcgaacaccg tggtcgcata cggccgcagc cgcgacaccg tcatggcggt 955681 cgagcctatc cgggcgacga tgcgcgccgc agcgatacct tgcccaacca acaggttggc 955741 cgggggccct gttagggtgc cggtacggga cctagtcttg aagaaggatc caaaccccct 955801 tttgtggaat ttgtggaaca ggaaatcgac atgtccgaag aagccttcat ctacgaggcc 955861 atccgcaccc cgcgcggcaa acaaaagaac ggatcgttgc acgaagtcaa gccattgagc 955921 ctggtcgtcg gcctgatcga cgagctgcgc aagcgccatc ccgacctcga cgagaacctg 955981 atcagcgacg tcatcttggg ctgcgtctca ccggtgggcg accagggcgg cgacatcgcc 956041 cgcgccgcag tgctggcatc gggcatgccg gtcacctccg gcggtgtgca gctcaaccgg 956101 ttctgcgcgt ccggcctgga ggccgtcaac accgccgcgc agaaggtgcg ttcgggctgg 956161 gatgacctgg tgctggccgg cggcgtggag tcgatgagcc gggtgccgat gggctccgac 956221 ggcggcgcta tgggcctgga cccggcgacc aactacgacg tcatgttcgt cccgcagggc 956281 atcggcgccg acctgatcgc caccatcgag ggcttctccc gcgaagacgt cgacgcctac 956341 gcgctacgca gccagcaaaa ggccgccgag gcgtggtcgg gcggctactt cgccaagtcg 956401 gtggtgccgg tgcgcgacca gaacggcctg ctgatcctcg atcatgacga acacatgcgg 956461 ccggacacca ccaaggaggg tctggccaag ctgaagccgg ccttcgaagg cctggccgcg 956521 ctgggcggtt tcgacgacgt ggcgctgcag aagtaccact gggtggaaaa gatcaaccac 956581 gtacacaccg gcggcaacag ctcggggatc gtcgacggtg ccgcgctggt gatgatcggt 956641 tccgcggccg ccggcaagtt gcagggcctg actccgcggg cgcgcatcgt cgccaccgcc 956701 accagcggcg ccgacccggt gatcatgctc accggcccca ccccggccac ccgcaaggtg 956761 ctcgaccgcg ccgggctgac cgtcgacgac atcgacctgt tcgagctcaa cgaggcgttc 956821 gcgtcggtgg tgctgaagtt ccagaaggac ctcaacattc ccgacgagaa gctcaacgtc 956881 aacggtggcg ccatcgcgat gggccacccg ctgggtgcca ccggcgcgat gatcctgggc 956941 accatggtcg acgaactgga gcgccgcaac gcccgacgtg cactcatcac gctgtgcatc 957001 gggggcggca tgggtgtcgc gacgatcatc gagagggttt aacagcatgc cagacaacac 957061 aatccagtgg gacaaggatg ccgacggcat cgtcacgctg accatggacg atccctccgg 957121 gtcaaccaac gtgatgaacg aggcctacat cgagtcgatg ggcaaggccg tcgatcgcct 957181 tgtcgccgaa aaggattcga tcaccggagt ggtagtcgcc agcgcgaaga aaaccttctt 957241 cgccggcggc gacgtcaaga cgatgatcca ggccaggccc gaggacgccg gcgatgtatt 957301 caacaccgtc gagaccatca agcggcagct gcgcaccttg gagacattgg gtaagccggt 957361 cgtcgcggcc atcaacgggg cggcgttggg cggcggcctg gagatcgcgc tggcgtgtca 957421 tcaccggatc gccgccgacg tcaagggcag ccagctcggt ctgccggagg tgacgctggg 957481 tctgctgccg ggtggcggtg gggtgacccg cacggtacgg atgttcggca tccagaacgc 957541 gttcgtgagc gtgctggcgc aaggtacccg gttcaagccg gccaaggcca aggagatcgg 957601 tctggtcgac gagctggtgg caacggtcga ggagctggtg cccgccgcca aggcttggat 957661 aaaggaggag ctcaaggcca accccgacgg tgccggggtg cagccgtggg acaagaaggg 957721 ctacaagatg cccggcggca ccccgtcgtc gccgggtctg gcggcgattt tgccgtcgtt 957781 cccgtcgaac ctgcgcaagc agctcaaggg tgccccgatg ccggcgccgc gggccatcct 957841 ggccgccgcg gtcgaggggg cacaggtcga cttcgacacc gccagccgca tcgagagccg 957901 ctacttcgcg tcgttggtca ccggccaggt cgccaagaac atgatgcagg cgttcttctt 957961 cgacctgcag gccatcaatg ccggcgggtc tcggcccgaa ggcatcggca agaccccgat 958021 caagaggatc ggtgtgctgg gtgcgggcat gatgggcgcc ggcatcgcct acgtctctgc 958081 caaggccggc tatgaggtgg tactcaaaga tgtcagcctt gaggccgccg ctaaaggcaa 958141 gggctactcc gaaaagctgg aggccaaggc gctggagcgg ggccgcacca cacaggagcg 958201 cagcgacgcc ctgctggcgc gcatcacccc gaccgccgac gccgccgatt tcaagggcgt 958261 tgatttcgtg atcgaggcgg tttttgaaaa ccaggagctc aagcacaagg tgttcggcga 958321 gatcgaagac atcgtcgagc ccaacgcgat cctgggatcc aacacctcga cgctgccgat 958381 caccggtctg gcgaccggcg tcaagcggca ggaagacttt atcgggatcc acttcttctc 958441 gccggtcgac aagatgccgc tggtggagat catcaagggc gagaagactt ctgacgaggc 958501 cctggcccgg gtgttcgact acaccttggc catcggcaag accccgatcg tggtcaacga 958561 cagccgcggc tttttcacct cgcgggtcat cggcacgttc gtcaacgagg cgctggcgat 958621 gctcggtgag ggtgtcgagc cggcttctat cgagcaggcg gggtcgcagg ccgggtatcc 958681 ggcgccgccg ctgcagctgt ccgacgagct caacttagag ctgatgcaca agatcgccgt 958741 cgccacccgt aagggtgttg aggacgccgg cggcacgtac cagccgcatc cggcggaggc 958801 cgtggtggag aagatgatcg agctcggccg gtccggccgg ctgaagggcg cgggcttcta 958861 cgagtacgcc gacggcaagc gatccgggtt gtggcccggc ttgcgcgaga cgttcaagtc 958921 gggctcgtcg cagccgccgc tgcaggacat gatcgaccgc atgctgttcg ccgaggcgct 958981 ggaaacccag aagtgcctcg acgagggggt gctgacgtcg acggccgacg ccaacatcgg 959041 ctcgatcatg ggcatcggct tcccgccgtg gacaggtggc agtgcccagt tcatcgtcgg 959101 ctactccggc ccggccggta ccggtaaggc ggctttcgtg gcccggaccc gcgagctggc 959161 ggccgcctac ggcgaccgct tcctgccgcc ggagtcgctg ctaagctgag cgcgagcaga 959221 cgtaaaagcc cccgcacgct cggcgtgtcg ggggctttta cgtctgctcg cgcaacctaa 959281 attgccgggc ccagcaggtc gtcggcgtcg cggatgatgt aaccgtagcc ctgctcagct 959341 aaaaaccgct gccggtgtgc ggcgtactcg gcatccaggc tgtcgcgggc caccaccgag 959401 tagaagatgg caccgccccc gtcggccttg ggtcgcaata tccggccgag ccgttgcgcc 959461 tcttcctggc gtgagccgaa tgttcccgaa acctgtaccg ccacggcggc ttccggcaag 959521 tcgatggaga agttagccac cttggacacc acgagcgtag cgacctcgcc gcggcggaag 959581 gcgtcgaaca gtgcctcgcg ttcgctggtc cttgtcgacc cctgaatcac cggagcgccg 959641 agctcggcgc ccagctcgtc gagctgatcc aagtacgctc cgatgaccag ggtctgctca 959701 tccgggtgct tcgccagaat cgacttgacc acagcaattt tggtgtgcac cgtcgagcag 959761 atccggtagc gttcttcggg ttcggcggtg gcgtacatca tccgctcgct gtcggtcatc 959821 gtgacccgga cttccacgca ctcagctggc gcgatccagc cctgcgcctc aatgtccttc 959881 cacggcgcgt catagcgctt tggtccgata agggaaaaca cgtcgccctc gcgtccgtct 959941 tcacggatca acgtggcggt cagccccagc cgccgtttgg actgcaggtc agcggtcatc 960001 cggaagaccg gtgccggcaa caggtgcacc tcgtcataga tgatgagccc ccagtcgcgg 960061 ctgtcgaaca gttccagatg gcggtactcg cccttagtgc ggcgggtgat catctggtat 960121 gtcgagatgg tggcaggtcg gatttccttg cgttctcccg agaattcgcc gatctcattc 960181 tcggtgagcg aggtgcgcgc gaccagctct cgtttccatt gccgggccgc gacgatattg 960241 gtgaccagga tcaacgtcgt cgcgccggct ttggccattg cggccgcacc gaccagcgtc 960301 ttgccggccc cacatggcag caccaccacc ccggagccgc ccgcccagaa cgagtccgcg 960361 gccagccgct ggtaatcgcg cagctgccag ccctcctggt gcaggctgat cgggtgcgct 960421 tcaccatcga cgtagccggc gagatcctct gcgggccaac cgatcttgag cagcagctgc 960481 ttgacccggc cgcgttcgct ggggtggacg acgacggtgt cgtcatcgat gcgggcgcca 960541 agcatcggcg cgatcttctt gttgcgcagc acttcctcaa gcaccgcgcg gtccaggctc 960601 accagcgtca ggccatgggc cgggttcttg accaactgca gtcgtccgta gcgggccatg 960661 gtgtcgacga tgtcgacgag caagggttgc ggcaccgcgt agcgggagta actgaccagc 960721 gcgtcgacga cttgctcggc atcatggccg gcggcgcgag cattccacag tgccagcggt 960781 gtgatgcggt aggtgtggac atgttcgggt gcacgttcca gctcggcgaa cggcgcgatg 960841 gcggcgcgtg cagcgccggc cagttcatgg tcgacttcca acagcaccgt cttatcggac 960901 tgcactatca atggtccgtc agtcaatggc gccgctcctc ctcatcgctg cgctctgcat 960961 cgtcgccggc ggtagtcaat ggcgccgctc ctcctcatcg ctgcgctctg catcgtcgcc 961021 ggcgcggggg tcatgggctc cattatcggt cgtgggccga caccaccgac gtgatgcggt 961081 ggatggcgaa gtcacgcagt cgcccggatg acgagtcgaa cgccaccagc tggccgcccc 961141 gtagcgtgat cggtgcgacc acccgctgag tggcaacgcc ggcggcatcg aggtagctga 961201 tcaccaaggt ggcctggtcc ttggccgcgc gctgcaacag cgacatggtg accgccgggt 961261 cgacgcggac attagcgaac ggcgctgcgg tcacctcacg cagcacggca accacggctt 961321 tcaacgcctc gctattgggt ctcggcggcg gtcggtatgg ccggcgccgt tgcggtgtgg 961381 gcacccgggc gccgcgggtt cgcacgtcga caacggctcc ggtggaatct tcggcggccg 961441 gggcaaagcc cgcgccgcgc aacgtgacga ggacttcgga tatcggagcg ggggacaccg 961501 ccaccgttgg ggccagggcc cgcagtgcca gcccgtcggc ttcgggcgcc gccacgacct 961561 gggccagtag cgttgggtcc tcgcaccgca cgaacgatgc ggccatgccg atccgaagct 961621 ggccgtgccg gcgcgcgaca tcgtcgatga gatatgtaag cccttgtggt acaggagttt 961681 tagaacgatt tgcgaagaat tcctgcaacc agtcgcggga cttgccgaca tcgagggcat 961741 gccggatcga ctgctcgctg acgcggtaca ccatcgccgt gccggccgat tccacggtgg 961801 cgacggtggt caggtcgtcg gccagttcgc gctgcagcgg ccctggcacc acgacggtca 961861 ggtcggcctg caccaggaag tgatcgatgg gcttgggcag cgcccgagcc atcacgccga 961921 ccgcggcggc aggggcagta gctggctcta aggcctcgtc caacagtgcg cgagcaggcg 961981 tgctgatcgc cccgcgcccc accagaccca gcgcatggcc ctctgtcagc agatccgcga 962041 taggcgcagg ttgcaatcgc ctggcccaac gtgggcggcg ccagatcagt gtcgccgacg 962101 cccgggacgc atcgacgccg gcgccggcgg gcagctcggc gagcatgcct agcaatagcc 962161 ggcgatccag tggggccgcc gtggagaaca gcgaatccga cagggcgcca tagggtttgg 962221 cgtcgggtcc gcgggtaccg attaacgccg gccggcccgg aaggtcaagc caggcgctgg 962281 ccagcaagtg ccaacgctcg gcgggtgaca tcgtggcgaa tcgatcggcg gccaccgttg 962341 gcgcccaaaa aggtccgtca ctgtggggcg gttcgggatc gggcatgccg ctggcgatca 962401 gtccagccgc ggccgcaatc tcgaggatta ggcccagccg cggctcgtcg attcccgttg 962461 ccttggccag ccgcttgaat tcacgaaccc ccagtccgcc gctgcgtagt tcggcaaccg 962521 gtgtggcgcc gaggttttcg agcagtacgt cgacttcacg cagtaggtcg atgacggctc 962581 cggccgccgc agcgtcggcg tcgtcgggtg tggtggtgga aactaccggg tccggcgcgg 962641 tcaactccat cggaccgggt tgttcgccgc gcagcacctg cccgacgtgg cggggcaaga 962701 tcaccgtttc ggcatcgatt cgtcgcagca agcccatcgc cagcaaccgc ggcacgggtc 962761 gatcagatgg cgcgccgggt gcggcgtcgc gagtgcgccc cacgggtgac ccttggagca 962821 atttgtccag aacgtcacgc tgcgcggggt cgaggccggc gatcaggtcg gcgagctgat 962881 ccccggaacg cgaacttccc tcgagggtga cctggccggg atgccacggc aacgccgtac 962941 ctgcgtctgt cgccacccgg actgcggtct cgccccaggc cagggcacgt tgtttaaggt 963001 cagccagcgc gccaagcacg tcggcttggg cggcgcggtc gccgatcact gccagcagcc 963061 ggacgatcgg caccggtgcg gtatctgcct gcagcaccag cagtgcgtcg aacaccgcca 963121 gccgcaggaa gtcgagctcg tcggtggccg ccttgaccga ctggcgggcc tgggcacggg 963181 cggccagcgc ggcgatgctg ccgggtggtg gctgggcaag gtcgggccgc agctccaaca 963241 gctgggtcag ccgttcatcg ggcaaggcgg ccagccagga ccccagcggg atatccgggg 963301 tgtgttcggt cattgctgat cagcgtaggc cggaccagcc ttgtggcgtg ggcgggtgca 963361 agacctgtca gaatggtttc gtggctgaca ttgctgaagg taaggcacgc aagaccaggt 963421 acgtggacca tggttggccg accaccgatc cagacgacca tgcggtgagc gaactcgtga 963481 ccgaccgcac gggtgcgcta tcacccttcg gtgaattgac gttcccggta ccgtccgacg 963541 acctgcccta catccacccg gtgaccgtca tcaatcggta agccgccagg atggccaggg 963601 cttctggggc atccgactac cgctcgggcg agctgtcgca ccaggatgag cgggggcagc 963661 gcacatggtc gatatcaccg agaaggcaac cacgaagcga acagccgttg ccgcgggcat 963721 cttacgtacc tcggcgcagg tggtggcgct gatctcgact ggcgggctgc ccaaagggga 963781 tgcgctggcc accgcgcggg tggcgggcat tatggcggcc aagcgcacca gcgacctgat 963841 cccgctgtgc catcaactcg cgcttaccgg agtcgacgtc gatttcaccg tcggccagtt 963901 ggatatcgag atcacagcga cggtacgcag taccgaccga acgggcgtcg agatggaagc 963961 gctgaccgct gtcagcgtgg ccgccctcac gctctacgac atgatcaagg cggtcgatcc 964021 gggcgcgctt atcgatgaca tccgggtgct ccacaaagaa ggcggtcgtc gcgggacctg 964081 gacgaggcga tgagcacccg gtccgctcga attgtcgttg tgtcgagccg cgcggcggcc 964141 ggtgtgtata ccgatgattg cgggccgatt atcgctggat ggcttgaaca gcatgggttt 964201 tcgtccgtcc agccgcaggt ggttgccgac gggaacccag tcggcgaggc gctacacgac 964261 gcggtcaacg ccggagtcga cgtgatcatc acttccggcg gcaccggtat ctcgcccacc 964321 gataccacgc ccgaacacac ggtcgccgtg ctggactacg tcattcccgg gctggccgac 964381 gcgatccgcc gctccggcct gcccaaggtg ccgacatcgg tgctgtcgcg cggggtgtgc 964441 ggcgtggctg ggcggaccct gatcatcaat ctgccgggat cgcctggagg tgtacgtgac 964501 ggcctcgggg tgctcgccga tgtgctggac catgctctcg agcagatcgc cggtggagat 964561 cacccgcgat gacgcaggtc ctgcgcgccg cgctgacaga tcaaccgatc tttctggccg 964621 agcacgagga gctggtgagc catcggtcgg ctggcgccat tgtcgggttc gtcggaatga 964681 tccgcgaccg tgacggtgga cggggggtgt tgcggctgga gtactccgcg cacccgtcgg 964741 ccgcacaggt ccttgcggat ttggtggcgg aggtagctga agagtccagt ggcgtgcgtg 964801 cggtggcggc cagccaccgg atcggcgtct tgcaggtcgg ggaggccgcc ctggtggcgg 964861 cggttgccgc cgatcaccgg cgggcggcgt ttggcacctg tgcgcacctg gtggagacca 964921 tcaaggcgcg gcttcccgtg tggaagcacc agttcttcga ggacggtacc gacgaatggg 964981 tgggttcggt ttaaagtccg gcctcagccc gtcagccgat gacgtacggc tgtgcgagcg 965041 agtccagcgc atcgttgccg cagacgtcct gggcccgaat cgcctgccac agcttcttcg 965101 tataggcgat gttcgagact tggggcgttt cggcgggcgc ctcggtgacg tcgccgggcg 965161 gcggtgcgtc agctggttgg gggtcgggct cggggagttc caaatcggtg gcaaggccaa 965221 ccgggccgcc tggagctgtg gcgggctgat cgcccggcgc ggtttgctcg ttcaccgcag 965281 cgggtggtgc caggtcggcg ggcgcgggtg gcgccagttc ggcgggcgcg ggtggcgcca 965341 ggtcggcggg cgcgggtggc gccaggtcgg cggacgcggg tgccagatcg gcgggtggcg 965401 ccagttcggc gggagctgcc gggaggggtt cacccagcgg cgcgggcagg tcgtttacgg 965461 caagttccac gggtggtgcc aggtcggcgg gcgcgggtgg cgccaggtcg gcgggcgcgg 965521 gtggcgccag gtcggcgggc ggcggggcca gcggtgctgg ttcgccgttg accgcggccg 965581 cgtccaacgg agcgtccatc gctgccgaag cgggaagcac ttcgcggggt gttgcgttcg 965641 ataacccgcg gccgcacacc ggccaggcgc cgcgaccctg ggtggccagc acccgctcac 965701 cgacggcaat ctgctgctcc cggctggcca gctgagccga cggggcgaac tcgccgccac 965761 catgtgcggc ccaggtgctt tgagtgaact gcaagccacc gaggtaaccg ttgccggtgt 965821 tgatcgacca gttgccgccc gactcgcagc gggccacctg atcccattcc ccgtcggtgg 965881 ccgcggtcgc ctgagcggcc atggcgatgc cgccgccacc gagtactgcg ccggtaaagg 965941 cgatcttggc gacgctgacg ttggatgtgg tgggcttacg gtggcgtcca ctcatacgtt 966001 aggtaattcc tctcggtaca cgcctacgag gtcagctgtc gggttcgggt tggattcgcc 966061 gtggagagga tcacccggcc gcggtcgtac atcggcgaac gacgttggct tcaccccaag 966121 gagccgtatg cggctccggt ccgatctcgg cggacctggt gggtcccccg cctccatccg 966181 cggtcggaat ccctcgccca ctggatggag ttcggcgtgc tatcggcgag ggagggcacg 966241 tcattttggg ttaggttgac gagcctcccg agacggtagc ggtttcaggc gattccgtca 966301 cgtttaagaa aagtcggcgt ttccgtcaca atcgccggca agaacgccaa gaaatatagg 966361 catttgcgca ggtagtaagc cctcgcaatc ggagcgtgtc cgccccgtta tcgttccgtt 966421 atgtgggtaa tgtcacatgg ccttagccgc cggcgaaagg gggtagtacg tcaatcgtgt 966481 cgccggcgga taacgcgacg gcgtcatctc ggacgacaat cccgtcgcgc aggtaggagc 966541 atcgactcaa caccgtcgcg aggcgaacat cgcggaccga caggccgtct atcagctcgg 966601 cgactgtggc gccagatcgc agggtgactt tctccgaccc ggcaccagcg gccgctcggg 966661 cggccgcgaa gtagcggaca gtcacctgaa ttccggcgga ttcgtcggac acctgcgtca 966721 ccggttagcc accgatcgcg ctcatcgggc ggtcgggctg aatgaaatcg ggggcgttga 966781 tgccgtggcc ggcgggtttg ctccacatcg cggcacgcca tgccgcctcg atcgcgtcgt 966841 cgtcagcacc gccgcgcagt aggcggcgca ggtcggtctc ctcggtggag aacagacagc 966901 tgcggatctg gccatcggcg gtcagccggg tgcggtcgca cgtcgaacag aaggcgtgcg 966961 acaccgaggc gatgacaccg aaccgtccgc gtggcgtgtt cggtccggcg tcgaccagcc 967021 agagttcggc cggggccgaa ccgcgcggtg ccgggtcggg ccgtagccgg aagtggggcc 967081 gcagcgccgc cagcacgtcg tcggcgctca gtgcgatgtt ccgccgccag ctatgccccg 967141 cgtccagcgg catctgctcg atgactcgca attgataacc gcgctctagg cagaacctca 967201 gcaggtcgac gacatcctcg cggccggtcg tggggtcgag gacggcgttc accttgacgg 967261 gtgtcaaccc ggctgccttg gcggcggcca agccggccag cacatgcgca agccggtccc 967321 gacgggtgat agcagcgaag tgggcgcggt cgatgctatc cagcgagacg ttgacccggt 967381 ccaggcccgc ttcggccagg gcgcccgccc gccgcgccag tcccaccccg ttggtggtca 967441 gcgagatctc cgggcgcggc cgcagcctag ctgtcgctgc gaccacctcg tcgaggtggt 967501 gggccaatag cggctcgccg ccggtgaacc gcacgctggt gacgccgagc cgagttaccg 967561 cgatgtgtat cagcctggcc agttcgtcgg gccgcagcag ttgctcgccg ggcagccacc 967621 tcagccctcg ctcaggcatg cagtagctgc accgcaggtt gcagcggtcg gttagcgaca 967681 cccgcagatc gttggcgacc cggccgaacg tgtccaccaa agggccagtg gtgggtacga 967741 cccgcgggtc ggcaatgccg ttggtgcggc tgcgcagcgc cggcatgccc agcgcggtca 967801 gtgtcatgtg ggcacctgtg agttgacccc gacgatgtcc ttgcccagcg gcaccagcga 967861 caccgggatc agtttcaagt tggccagtgc tagcggaatg ccgatgatcg tgactgccat 967921 tgccgcggca ctcaccaaat gcccgagggc cagccagatc ccgaacagca gcacccagat 967981 gacgttgctg atcaaggccc cggtcccggc ggttggcttt tcgacgatcg tccggccgaa 968041 cggccacaac gcgtacgacg cgatgcgcag cgccgcgaag ccaaacggaa tggtgatgat 968101 gagcaggaag cagacaagcg acgccagcag gtacccgagg gccagccaga ggccaccgaa 968161 caccaaccag ataacgttca ggattagtcg catatcgcct ccagcggtag cgcaagccta 968221 ccgcgtgagg ggtaagcagg ggtgctcggc ggccgacgat ccgagtagga tcttcagatc 968281 gtcatcgcgt cccgcgcagg cgggacgcgt cttctgttgc caatccgagc gatccgtcag 968341 acaagcaggt gagaccagtg ccgaccggca aggtgaagtg gtacgacccc gacaaggggt 968401 tcggcttcct gtcacaggag ggtggcgagg atgtctacgt ccgctcctcg gcgttgccca 968461 cgggtgtcga ggcactcaaa gccgggcagc gggtggaatt tggcatcgcc tccgggcggc 968521 gcggaccgca ggcattgagt ctcagattga tcgaaccgcc gcccagcctc tcccggccgc 968581 gccgtgagcc ggcggccgag cacaagcaca gccccgatga gctgcacggc atggtcgagg 968641 acatgatcac gttgctggaa agcaccgtgc agccggagct gcgtaagggg cgctacccgg 968701 atcgcaagac tgctcgccgg gtcgccgagg ttgtccgggc ggtggcgcgg gagttcgagt 968761 cctaacgggg tcgggtggtg cgctggccca attgcgccga gctggcaacg ccgcgccgtt 968821 ccagggtcac ggtcggatcg attccgccgc actcggtttg acgagacgac gaccggctag 968881 cagttagccg ggttggccgg gttggccggc gctgcccggg ctgccgccca tgccgggatt 968941 gccgtcggtg gttttcccgt cgccgcccgt gccgccctca ccgcctgtgc cgccggcgcc 969001 acccgcgccg cccgctcctc cggcaccaac gccaccgtcg ctgcccgcgc ggccaatcag 969061 cccgccggac ccgcccttgc cgccgagggc gccgggggcg ccgccgttgc cgccgttgcc 969121 gccggtgttg ccgttaccgc caaacgctgc gcccggacca ctgttggtgc cgccaccacc 969181 ggcgcctccg gcacctccag caccaccgga accaccagta ccaccggcac cggccgtgcc 969241 gccgacacca ccggcgccgc cgttgccgat gagcagcccg ccagcacccc cgttaccgcc 969301 ctggccgccg ataccgcctt cgccaccggt gccgagggcg ctcgcgccgt caccgcccac 969361 accgccatcg cccccgaacg ccttgcgtga actgccggcg ctgtcggtgt tggcggcacc 969421 gccgtcgccc ccggcgccgc cgccgccggg ggcaccgcta ccaccggtac caccgactcc 969481 gcccgcgccg ccggcgccgc cgtcgccgat cagcagcccg ccgcggccac cggcgccccc 969541 ggttccgccc gctccgccgg taccgccgga agcgaagctg tcgaagccgt tgccgccgct 969601 gccgccgtta ccggcttccg cgttgctggg agaattggtg cccacctggg catccccgcc 969661 ttcgccgccg gcgcccccca caccgccggg ggcctgggcg ttcccgccgg caccgccgtt 969721 acctcctatg gctgcattgc cgatggaagc ggcgctgccg ccggcgccgc cagcgccgcc 969781 ataggcgccc tcaccccctt tgcctccctc accgcccacg gcgtcgccgt tgacggttga 969841 gacggcgttc ccgccagacc cgccagcgcc gccggacgtc atcgcgtcgc cggcatcgcc 969901 ggggccaccg ttaccgccta tggcgttgcc gcccagcgtc tggtcggtgc cggtactgcc 969961 ggcatcagcg gtgccggtgg gtgtggggtt gacgccgtct gccccggctg cccccatacc 970021 gccctgcccg cccgcgccgc cagaaccgaa caggtaggcg ttgccgccgg cacctcctgc 970081 tgcgcccata ccgccattga cgccggccac cccggtcccc ccgagcccgc cggccccgcc 970141 attgccgtat aaccaccccc cgttgccgcc gacgccgccg accgcgccgg cccctccggc 970201 ccccccggtc ccgccgatgc cgatcaaccc ggcgctgccg ccggctccgc cggcttggcc 970261 gaccccgccg gacccgccat tgccgccgtt gccatacaac aagccacccg gcccgccggc 970321 ctgcccggta cccggcgcac cattggtgcc gttgccgatc agggggcgcc caagcagcgt 970381 ctgcgtgggc gcgttgatca cattcaacgc ttgctgcagc ggggaggcat ttgcggcctc 970441 ggcgctacca tatgcgcccg cagccgtgct caaggcctgg ataaaccgtt catgaaacgc 970501 ggccgcctgc gtgccgagtg tctgataggc ctgggcgtgc ccggaaaaca gcgacgccac 970561 cgctgccgac acctcgtcgg cgcccgcggc cagcactccg gtggtcgggg ccagcgccgc 970621 ggcattggcc gcgctcaacg tcgagccgat ctgcgccaaa ttgtttgctg ccgctgccac 970681 catctccggc gtcgccaata catacgacat cgctgtcctc ccgcagggtc ttcgttgacc 970741 gatcggctgt tactaacgtt agcgcgaacg cgggtcggcg tctccagttt ctatttcttg 970801 acatggaaaa acggcggccc cgaccctgcc tcagcgtcgc agccgtcgtt ggcggcgagc 970861 accggtgacc gtgactttgg tagcggcccg tccgcagtgg gtgccacgta gtattcggac 970921 agataggtag tggtaggcaa ccttcgtgat tcgtcagcga ggaggcggcg atggcacagc 970981 aaactcaggt caccgaggag caagcgcggg cccttgccga ggaatctcgc gaaagtggtt 971041 gggataaacc gtccttcgcc aaagaactct ttctgggccg ctttccctta gggctcatac 971101 acccatttcc caagccgtcg gacgccgagg aggcccgaac cgaggcgttt ctggtcaaac 971161 tgcgggaatt cctcgacacc gtggacggca gcgtcatcga gcgtgctgcc cagatccccg 971221 acgagtacgt gaaaggcctg gccgagctgg gctgtttcgg cttgaagatt ccgtccgagt 971281 acggcgggtt gaacatgtcg caagtcgcct acaaccgcgt gctgatgatg gtcacgacgg 971341 ttcattccag tcttggcgcg ttgttgtcgg cgcatcagtc gatcggggta cctgaaccgc 971401 tcaagcttgc cgggactgcg gaacagaagc ggcggttcct accgcggtgt gcggccggcg 971461 cgatatcggc ctttttacta accgaacccg atgtgggctc cgatccggcg cgcatggcat 971521 cgacggcgac gccgatcgat gacggccagg cttacgagct tgagggtgtg aagttgtgga 971581 ccaccaacgg tgtggtagcg gacctgctag tggttatggc gcgggtaccg cgcagtgaag 971641 ggcaccgagg gggaatcagc gcctttgtcg tcgaggctga ttcgcccggg atcaccgtgg 971701 agcggcgcaa caagttcatg ggactgcgtg gcatcgaaaa cggcgtgacc cggcttcatc 971761 gcgtcagggt gcccaaagac aacttgatcg gcagggaagg cgacggtctg aagatcgcgc 971821 tgaccacact caacgccgga cggctgtccc taccggcgat cgcaaccgga gttgcgaaac 971881 aggcgctgaa gatagcgcgg gaatggtccg tcgagcgagt gcaatggggc aagccggttg 971941 gccaacatga agcggtagcc agcaagatct cgttcattgc cgccaccaat tacgcgctcg 972001 atgcggtggt cgagctgtcc agtcagatgg ccgacgaagg ccgcaacgac atccggatcg 972061 aggctgcgct ggctaaattg tggtccagtg agatggcctg cctggttggc gatgagttgc 972121 tacagatccg cggtggccgc ggatacgaga ccgccgaatc cctcgccgcg cgcggtgagc 972181 gggcggtacc agtggagcag atggtgcggg acctgcggat caaccggatc ttcgaagggt 972241 ccagtgagat catgcggctg ctcatcgcgc gtgaagcggt cgacgcgcac ctcactgccg 972301 cgggtgatct ggcgaaccct aaggccgatc tgcggcagaa ggccgcggcg gcggccggcg 972361 ccagcggatt ctacgcgaag tggttgccga agctggtttt cggcgaaggc caactaccca 972421 cgacgtaccg cgagttcggc gccctggcga cacatctgcg ttttgtcgaa cgctcgtcac 972481 gcaaattggc ccgcaacacc ttctacggga tggcgcgctg gcaggccagc ctggagaaaa 972541 agcaagggtt cctcggccgc atcgtggata tcggcgccga gctattcgcc atctccgcgg 972601 cgtgtgtgcg cgccgaggcg cagcgaacgg ccgatccggt cgagggtgag caggcatacg 972661 aactggccga ggcgttctgc cagcaggcca cgttgcgggt ggaggcgctg ttcgacgcgt 972721 tgtggtccaa caccgacagc atcgacgttc ggctggcaaa cgatgtgctg gagggccgct 972781 acacctggct ggagcaaggg atactcgatc agtccgaagg caccggaccg tggatcgcgt 972841 cctgggaacc gggtccatcc accgaggcca atctggctcg gcggttcttg acggtgtcgc 972901 catcgagcga agcgaaactt tagggcgccc gcgtggccgg tcacgtccgc gggggaccgc 972961 ccgagtctcg tcgggtacca cgctggcgcg tatcgcgtct gggtgcaggt tctattccat 973021 gtcgtcgaca aacagcgcca tcgatgcggt gaatccgtgc agggcattgc ggcccgcgat 973081 cgggccgatc tctccggcgg cgaagaagcc ggcaagcgga attccgccta ggagttcctc 973141 gatcgtcgac gcgtcgtggt cggcgacccc gaacatccgc cgcccccgcc cgttgcaggt 973201 gaacaacagc gctccagccg cgcgtccggg cagccgcgcc gcggcccgct ccacggtcag 973261 gcgtaggtcc ttgtcggccc cggccgcgtc acggacctgg aactgcatgg tggcgccgac 973321 ctggacaacc tcgtcgatct cgatcgaccc ggtcgacggg tcggcgccga gcagcccgcg 973381 gatcacgaaa tcgccctgac ccggagccgc caggtgctcg tcgacgacga tcccgatctg 973441 taggccgtgg ctgacgagtg ccctttcgtc gggcgacagc ccctcgacga tctcacgcag 973501 tcgctgcaac ggcggacggc cgccgagctc ggtgatcagt atgccgtccg cgccggtgac 973561 gatgtatggg tagccgatcg gccggcaacc ctgcgacacg accgggacac cgcgcatccc 973621 gggcaggcgc acgccgacga cgccggaggt gagcacgtcg tgatcgcgga acagccgggt 973681 gtcgccccgc cggcgcccgc cgctcaccac gccgcccacg acggcggtgc ccggcaggtc 973741 ggtgttgggg tgctcgatga gcaggttcga cgggaatgtg tacgggtccg gcagcagcag 973801 atgcagatcc cgggcggtgc ggtcgaaccg ataaccggtg atcagggcac ccgagccggt 973861 acggacaaag tccagctgga atgtctcggc ggccaagccg gacgccagcc acaccaccac 973921 cgcgggctcg tcctcgatct cgtggcggcc ggcgacgatg gcctgggcga tgcaaccgac 973981 aagcgcgggc ggatcgatca tctgcagcac cgcgctcagg acgtcggcag cccggtcggt 974041 gtgtgcacgc gatccaagca acaccgccag cgacggcgcc tcacccgcca gctcgtcgcg 974101 cgcctggccc gcagcctcca ccgcggcctg ccgcgcatcg ggcgtggtgc aaaccccgac 974161 tccgatccgc acagttccat gatgcgccga tgtgccccgg gtgtcggcgg ctcttcggac 974221 cgttggcgcc gaccgcgctt aagcgcggtc ggccgtcgag ccgcggcctc gtcaaaagat 974281 aaggcgcacc gaccattccg cgtgcggaac gtcgcgtagt tcacccgagt ggtcgaccac 974341 caacgtcagc aactgcacga caatcccggt cagccgcccg cgctgcgggt cgacagtggg 974401 gatggtgacc gccaaccggg tgtccggccg aaacaaggtg ctggtggtgt tggcggggtc 974461 ctggtatacc tgcagcaaac gccacggcgc ccgggaaatg acttcgggta ccgagagctg 974521 cacgggatag cgttcgctta ccggcaattc gccctgcgcc tgcggggtct gacagtcgtc 974581 gaggtcgacc acgttgcagt acagataggg ccccacgcgg gtcaggtgcc cgtgcgagta 974641 agcgctgatc tcgggttgct gcggaccgtg tccgcgtact agcagccatg caccggcccc 974701 ggccgccacc gagagcagaa tcaccaggat caccggcagc gttgcgacac cgcgcttcac 974761 tgcggcgcca ccgccgcacc acgacgggtg gtttcttgct cggccatcac gggccgatta 974821 ccgcccaggc cagggatcag cgaatcgccg cggaagctga cgatggtctg agccagaccc 974881 aggatcagca gcgcgctcac cgcagtgaag cccacccaca gctcggtgta caccaacacg 974941 cccaccgcgc cgcccagcac ccaggccagc tgaagagtcg actcggaacg cccaaacccc 975001 gatgcccgcg actcctcggg caggtcgtgc tgcaacgagg cgtccagcga ggctttagca 975061 atggcactgg accctgccgt gatcagggtg gcaatcgctg tcgctgccag gctgccggcc 975121 accgcggccg cgatggctaa cacggtaact agcacggtgc agcgcaccac cagcacagct 975181 ggcctgccta gctgcaggcg tgcgctggtg aaattgccgg cgaagttgcc gaccgcggcc 975241 gccgcgccga tcaggcccag catgcccaat tgcacccacc cgttggcttc gtgcgccttg 975301 gcgacaaacg ccggatacaa gaacagaaag ccgaccatca ccttgatggt gcagttaccc 975361 cacagggagg taatgatgtt gcggcccaac ggttgtcgga gtgttccgcc gaggttcttg 975421 acttcctccg gccagcgtcg ccgtagtctg cccctatccc ggtggtagct caatgtggcc 975481 gggacctcac cgctggtcac ctcgacccag cgcggaatgc gcatcgacag cgaagcgcca 975541 gcgatggtga tcgcgacgac gacgaacaac gcgcccggca gctggaacag gtgggtgcag 975601 acgaattcga ctccggccgc aatcgcgcca ccagcgatgg tgccgccgag caggccgaac 975661 acggtcagcc gtgagttgac ccggaccaag tcgatggttg gcggcatcac cctcggtgtc 975721 actgcgctgc gcagcacgct gaacgacttc gagaacacca tcatggccag cgcacaggga 975781 tagagcaccc atgacgggaa gctgccggtg gcgccgtcgt agttcatgat cagcaccacc 975841 gccaacgcgg tccgaagtcc gaatgacagc gccaaggcga cgcgacggcc atgctgcagc 975901 cggtcgagtg ccggaccgat gagtggagcg atcacggcga acggcgcgat ggtgatcaac 975961 aggtacaagg cgaccctgga cttgctctcc ccgctggcgg ccgcaaagaa tagtgtgttt 976021 gccagtgcta ccgccattgc cgagtcgacc gcgaagttcg ccattaccgg ccaggtcaat 976081 gccgtcagtc cagacttgtc ggcgccgtct gcggtagcgg cccggtgcac cagcaagtac 976141 atccgagaac ccatttcgcg gctgcgcatg gccgcggcgc gggtgacggt gatccgctcg 976201 cccgcccttg tagtccgtgg cggtacgcgg ctgcgttcag gttcgggctg ctcgcccagc 976261 ggcgggagat agcggttggc actgggcatc ggcggaggtc gacgcgatcg gcgatagttg 976321 gcgtcgccgg gagggtagtt ggccatgcca gggtgcccgt tgaccgatcc gtttcgggtc 976381 cggcggcccg gggttggggc catccggccc ggatgatcac ctcgccgtcc ggacacaaat 976441 caattctgtc ctatccggac tcctggcgta gccaaccggg tgtggcttgc cggccgtgtc 976501 ttccggcagt attggaagcg cgttacagag aggggacagc gtgaccgggc ccaccgagga 976561 gtctgccgtg gcgactgtgg ccgactggcc cgaggggtta gcggcggtgc tcaggggtgc 976621 ggccgaccaa gccagggccg ccgttgtgga gttcagcggc ccggaggcgg tgggagacta 976681 cctgggcgtc agctacgagg atggcaacgc cgccacccac cggttcatcg cgcatctgcc 976741 tggctaccag ggatggcaat gggccgtcgt ggtggcgagc tattccggtg cggaccatgc 976801 cacgatcagc gaggtggtgc tggtcccggg gcctaccgca ctgctggcgc cggattgggt 976861 gccgtgggag caacgggtgc ggccgggaga cttgagcccc ggagatctgc tggcgccggc 976921 gaaggatgat ccgcggctgg ttccgggtta caccgccagt ggtgatgcgc aggttgacga 976981 gaccgccgca gagatcgggt tgggtcggcg ctgggtgatg agcgcctggg gtcgcgccca 977041 gtcggcccaa cggtggcacg acggcgacta tggtcccggc tctgctatgg cgcggtcgac 977101 gaaacgcgtc tgccgcgact gcggtttctt cctgccgctg gccgggtcgc tgggcgcaat 977161 gttcggggta tgtggtaacg aactgtccgc tgacgggcat gttgtcgata ggcaatacgg 977221 ctgtggcgcc cattccgaca ccactgcgcc ggccggtggc agcacaccca tttatgagcc 977281 gtacgacgac ggtgtgctcg acatcatcga gaagccggct gaatcatagg ttttctctca 977341 cccgctgttc cctacttttt ttgggggggg gcaccagtcg aagaaacccg actgattatc 977401 acccgtattg aacactcccg agctgttgtc gcccgagttc gccacacctg aagtctggag 977461 ggtgcccgta ttggccaagc ccgaggtgaa tctgcccacg ttgaagaagc ccgagttaac 977521 gcggtgcctg cattctggaa gccggagcta gtgtcgctcg cgtttccgaa gccggagctg 977581 ccgttgccca agttctggaa gccggagcca ctattgccgg agttaaagaa gcccgagtga 977641 ccggtgcccg tgttgccgaa gcccgagttc gcgacgcctt gagtgaccgg gctgccgatg 977701 ccggtgttca ggtcccccga gttgaagccg cccgtgttgg tgtcgcccga attaccccat 977761 cccgtgttgt cgtcgccggc gtttccgaag ccgaggtttc cagaaccttc gtttccactg 977821 ccgatgttga ggaagccggc atttccgctg ccgaagttgg tgttgccgtc gtttccgttg 977881 ccgaagttga agaagccggc gtttccgctg cccaggtttg cattgccgat gttgccgttg 977941 ccgaggttgg cgttgccgat attcccgatg ccgaagttgt agtcgccggt gttgccgctg 978001 cccacgttgt tgttgccgat gttgccgatg cccaagaagt tccccacgcc gatgttctcg 978061 acgcccaggg ccgggatcgc gagcgctgct gggaccccca ccgcaccggc cggcgcggcg 978121 gtcaccacgg gcggcaagac actcagcagc tgctgccacg gggtcaactg cgaagccacc 978181 gtcgaggctc caccgtgata acccaccatc gccgccacat cctgtgccca caactgctca 978241 taggacgcct cggtcgctgc gatcgctggc aaattctgcc caaacagatt cgagagcacc 978301 aacgacagca actggttgcg attggccgcc accaatgctg gatgtgcggt cgctgcccgc 978361 gcggcctcat atactgcggc cgcagccttg gccccggcag ccgccccctc agcgcgtgcc 978421 gtcgccgcgt tcagccaact cagatacggt gcggccgcgg ccgccatcgc ggccgccgcc 978481 ggaccctgcc acgccgaacc cggcccagcc gttaggcctg agatcagcaa cgaaaacgag 978541 gccgctgcca tccccagctc ggcggccagc ccatcccagg ccaccgccgc cgccagcatc 978601 ggggccggcc ccgcaccggc gtagatccgc gccgaattaa cctccggcgg cagcaccatg 978661 aaattcatca cgccatccct tctcagctgg ccacccccgg cctagccacc acgacggcgg 978721 gacccggctg ccgcgatccg cgccggcggg cctcggtcga ctacagtggc gcgatcgctc 978781 gacaacttga gcaccttggc aaacgacggt atgtccaatc gcggcacatt gtcggggttt 978841 tcatcgaaat cctgtcgcca accccgacag ccgggttccg ggaagccggg tgtcgcagtg 978901 gtttaggtgt cgacgttgaa cacccgggca ggcaaccggc cgtggctatt tcgggtcgag 978961 ataggtttcg agtccggctt gtgcgccgcg tgcgccacgg cgggcagcgg cgagctgcca 979021 gacgaagatg gtcgtgccga gcaaccctgt tgccagcccg gccactgtca ccggacgcca 979081 acttgcgagg ccaggcacga cgaatgcggc caccgcggcg accagccagg cgagcgcgcc 979141 gaccgcgatc accggccaca cctcgagcag cacgggtggt agcggtggcg gctcgcgaat 979201 ctgactattt tcgacgctca tcccgagtca acatagcgcg gcgatgatgc gtcggcgaac 979261 ggcccggggt gggtggcttc cgcaccagcg ggaggtacca ccacctgctg gtgggtcgtc 979321 ggccggcaat gggtggaacc gaaatcgtcg ttcgccgttt cagatgccct agtctgaact 979381 tccgttgtaa cctcagctgt gcttgacagc gatgcgcggc tggccagcga cttgtcattg 979441 gcggtcatgc ggctctcccg ccaactgcgg tttcggaacc cgtcatcgcc ggtctcgctg 979501 tcccagctct cagcgttgac gacgctggcc aatgagggcg cgatgacccc gggtgcgttg 979561 gcgattcgtg aacgggtccg gccaccgtcg atgaccaggg tgatcgcctc attggccgac 979621 atgggttttg tagaccgcgc cccacacccc atcgacggtc ggcaggtgct ggtctcggtg 979681 tcggaatcgg gcgccgaatt ggtcaaggcg gcacggcggg cccggcagga gtggctggct 979741 gagcggctcg cgacgctgaa ccgcagcgag cgtgacatcc tgcgcagcgc cgccgatctg 979801 atgctggctc tggtcgacga aagcccgtga ccgaaggccg ttgtgcccag caccccgacg 979861 gcctcgatgt tcaggacgtc tgcgatcccg acgacccacg gctcgacgat ttccgtgacc 979921 tgaacagcat cgaccgtcgt cccgatctgc cgaccggcaa ggcgttggtg atcgccgagg 979981 gtgtgctggt ggtgcagcgc atgctggcct cacggttcac gccgctggcg ctgttcggca 980041 ccgaccgccg gctggccgag ctcaaggatg atctggccgg tgtcggcgcg ccgtactatc 980101 gagcgtcggc tgatgtcatg gcacgggtga tcggcttcca tctcaatcgt ggggtgttgg 980161 cagccgcgcg ccgggtgccg gagccgagcg ttgctcaggt ggtcgccggg gcgcgcaccg 980221 tcgcagtgtt ggaaggcgtt aacgaccatg agaacctggg ctcgatcttc cgcaacgcgg 980281 cagggctgag cgtggacgcg gtagtgttcg gcaccggctg cgctgatccg ctctaccgtc 980341 gtgcggtccg ggtatccatg ggacacgcgt tattggtgcc atatgcacgc gcggccgact 980401 ggcccaccga acttatgacg ttgaaagaga gcggctttcg actgttggcg atgaccccac 980461 acggcaacgc gtgcaaacta ccggaggcca tcgccgcggt gtcgcacgaa cggattgcgc 980521 tactggtggg cgcggagggc ccgggcctaa cggcggccgc actgcggatt agcgatgtgc 980581 gggtgcgcat tccgatgtcc cgagggaccg actccctcaa cgtcgcgacg gcggccgcat 980641 tggctttcta cgagcggact aggtcgggcc atcacattgg gcccggcacg tgaacgatca 980701 gcgcgaccaa gccgtgccct gggcaacggg tttggcggtc gccggcttcg tcgccgcagt 980761 catcgcggtt gcggtcgtgg tgctgagcct cggcctgatc cgcgtgcatc cgctgttggc 980821 cgtcggtctc aacattgtgg cggtcagcgg gttggcccct acgctgtggg gctggcgccg 980881 caccccagtg ctgcgctggt tcgtgcttgg cgcggcagtg ggcgtggcgg gcgcgtggtt 980941 ggcgctgctc gccttgacgt tgggggacgg ctagcgacgc ccgcctgagc gcaccccgag 981001 cagcacatct tcccaggcag gtatggcggg tttgcctcgt cggttgctga ccggctgtgc 981061 ggacggcacc gtgagcgtcg gctgtgcggg ctcgggctca tcgaagtcga gatgggcgac 981121 cggcgccaac ggccgtagcg ggcgattgaa ggtggggttg atcagctcat gggccgtgtc 981181 gtcgatcgcg gtggcggttc cgccgtgggc gccgggggtg aagcggaaat gcgccaggtt 981241 gtcggagcgg ccagccttcc aggcaagctg caccgtccag cgactgtcct cgttgcgcca 981301 cgcgtcccag gtgaggctgt cggggttaag gccgcgtgcc accagggccg cggcgacggt 981361 ctcctgcatg gtcagcaccg ccgggccgtc ggccaggacc gggtgcgccg cggttgccag 981421 ctcggccgcg cgcgagcgtt ccaacagtac cgggtgggca aaccggcgga tacgggcgat 981481 gtcggagccc gatgccgcag cgacctgttc gacagacgcg ccggcccgaa ttcgggcctg 981541 aatctccttg gggctcagca cgttggtgac ctcgatgtcc agctgggctt gctccggctg 981601 gacggagtcg tcccgtagcg ccgcccgcag tcggtcgtcg accggcagct tgaactgttc 981661 ggacgggatg gcaccctggc agatgatgtt tttgccgtcg gcatcgagcc caacgacttt 981721 gagttcccgc atggcttctc ctcgcaggct ccgggcagga caacgccgga cctgttacgt 981781 gcgcactcta gtgcggtaaa cgccgttagc ctcgttgaca cgcggaggtg tcttgccggc 981841 atggcgctgg tgaccggaat gcccggtcac agccgcacta aggcagcgct aaagccgctc 981901 gaccacccag tcgacgcact cggtgagggc gctgacgtcg tccggctcga ccgcggggaa 981961 catcgcgacc cgcagctggt ttcggcccag tttgcgatac ggctcggtgt cgacgatgcc 982021 gttagcccgc aggatcttcg cgacggtccc ggcgtcgacg tcgtcgacga agtcgatcgt 982081 gcccaccacc tgcgaccgca acccggggtc ggtgacaaat ggcgtggtgt agggccgctc 982141 ttgcgcccac gagtacaacc gctgcgacga gtccgcggtg cgtttgaccg cccagtccaa 982201 gccaccgtta cccaccagcc agtcgatctg ttcggccagc agcgccagcg tggcgatggc 982261 cggtgtgttg tatgtctggt tcttcaagct gttctcgacc gcgatcggca gggacaggaa 982321 atcaggaacc cagcgaccgg tcgcggcgat ggcctcgatc cggctcaggg cggccgggct 982381 catgatggcc agccacaggc cgccgtcgct ggcgaagttc ttctgcggtg cgaagtagta 982441 ggcgtcggtc tcggcgatgt cgaccggtag gccgccagca ccggaggtgg cgtcgatgac 982501 gaccaaggcg tcatcggagc cctccggacg gcgcaccgca accgcgaccc cggtcgaggt 982561 ctcgttgtgg gcccaggcga tcacatcgac tgacgggtcg gtttgcggct ccggagcact 982621 gccgggatcc gacgtgatga tgatcggctc gccgacgaac gggttcttgg aaacggcgga 982681 agcgaacttc gcgctgaact cgccgtaagt caagtgcagt gagcgtttgt caatcagccc 982741 gaaggcggcc gcatcccaga acgccgtggc accaccattg cccagtatca cctcatagcc 982801 gtccggcaac gagaacagct cggccaggcc tgaccgaacc ctgcccacca gattcttgac 982861 cggcgcctgt cggtgcgacg tgccgaacaa tgccgctgcg gtggtggtca gcgtttgcag 982921 ttgctcaagc cggaccttcg acgggcccga cccaaagcgg ccgtcgcggg gtttgatggc 982981 ggtgggaatt tccaggtggg gggtgagctg gtcggccatg ccatcagggt agtgaggggt 983041 accgaaccgc ggcgactcga gcggaacgaa agcctgccgg cacaggcgcg tagtgtgaac 983101 aagctcacat gcaagccctg gctggtggct gggtcatagt gtcgccaagg gtctggataa 983161 ttcccggtac cagcggtacc gtgttcgata cccgtgcgga cgcacacctc ggtggggagg 983221 cttcgaatgg acaggacgcg catagttcgg cggtggcgcc gcaacatgga cgtggccgac 983281 gacgccgagt acgtggaaat gctggccaca ctgtccgagg ggtctgtgcg gcggaatttc 983341 aacccgtaca ccgatatcga ctgggagtcg ccggagttcg ccgtcacgga caacgatccc 983401 cggtggatcc tcccggcgac cgatccgttg ggccgccacc cctggtacca ggcgcagtcg 983461 cgggaacgcc agatcgagat cgggatgtgg cgccaggcca acgtggccaa ggtcgggctg 983521 cacttcgaat ccatcctgat tcgcggcctg atgaactaca cgttctggat gcccaacggc 983581 tcaccggaat accggtattg cctgcacgaa tcggtcgaag agtgcaacca caccatgatg 983641 ttccaggaga tggtcaaccg tgtcggcgcg gacgttccgg ggctgccacg gcggctgcgg 983701 tgggtttcac cgctggttcc gctggtggcc ggaccattgc cggtggcctt cttcatcggc 983761 gtgctcgctg gggaggagcc catcgaccac acgcaaaaga acgtgttgcg cgaaggcaag 983821 tcgctgcatc cgatcatgga acgagtgatg tccattcacg tggccgagga agcgcggcac 983881 atctcgttcg cccacgagta cttgcgtaag cggctgccgc gcctgacccg gatgcagcgg 983941 ttctggatct cgctctactt ccccctgacg atgcggtcgt tgtgcaacgc gatcgtggtg 984001 ccgcccaagg cattctggga ggaattcgac atcccgcgcg aggtcaagaa ggagttgttc 984061 ttcggctcgc cggagtcgcg aaagtggttg tgcgacatgt ttgccgacgc ccgcatgctg 984121 gcccacgata ccggattgat gaacccgatc gctcggctag tgtggcgact ctgcaagatc 984181 gacggcaagc cgtcgcgcta ccgcagcgag ccgcagcgtc agcacttggc tgccgcgccg 984241 gccgcatagc ttgctacgag tgcacgcatg ccgcacgtaa ttactcagtc gtgctgcaac 984301 gacgcgtcct gcgtcttcgc atgtccggtg aactgcatcc acccgacgcc ggacgagccg 984361 ggcttcgcga cctcggaaat gctctatatc gatccggtgg cctgcgtgga ctgtggtgcc 984421 tgcgtaaccg cctgcccggt cagcgcgatc gcgccgaaca cccggttgga cttcgagcag 984481 ctgccgttcg tcgaaatcaa tgcgtcgtat tacccgaagc ggcccgccgg cgtgaagcta 984541 gcgccgacgt cgaagctggc tccggtgact ccggccgccg aggtgcgtgt gcgccggcag 984601 ccgctgacgg tagccgtcgt cgggtccggg cccgcggcga tgtatgccgc cgatgagctg 984661 ctggtccagc agggagtgca ggtcaacgtc tttgagaagc tgccgacacc ctacgggctg 984721 gtgcgctccg gggtggcgcc ggatcaccag aacaccaagc gggtcacgcg actatttgac 984781 cggatcgccg gtcatcgccg cttccggttc tatctcaacg tcgagatcgg caagcatcta 984841 ggccatgccg agctattggc ccaccatcac gccgtgctgt acgcggtcgg agcgcccgac 984901 gaccgccggc tgacgattga cgggatggga ctgccgggca ccggtaccgc cacggagctg 984961 gtcgcgtggc tcaacggaca tcccgacttc aacgatctgc cagtcgatct cagtcacgaa 985021 cgcgtggtga tcatcggcaa cgggaatgtc gcgctcgacg tggcgcgcgt gcttgcggcc 985081 gatccgcacg agctggccgc caccgacatc gccgaccacg cgttgtccgc gttacgcaac 985141 tcggcggtcc gtgaggtggt ggtcgccgcc cgccgcggtc ctgcccattc ggcgttcacc 985201 ctgcccgagc tgatcgggct cacggccgga gccgacgtcg tgcttgaccc gggagatcat 985261 cagcgagtac tcgatgatct ggcaatcgtt gccgatccgt tgaccaggaa caagctggag 985321 atcttgagca cgctggggga cgggtcggcg cctgcgcgac gagtcgggcg cccgcggatc 985381 cggctggcct atcggctcac gccgcggcgc gtcctcggcc agcggcgggc cggcggagtt 985441 cagttctcgg tcaccggaac cgacgagctg cgccaactgg atgctggcct ggtgctgacg 985501 tcgattggct accgcggcaa gccgattccc gacctgccgt tcgacgagca ggccgcgctc 985561 gtgcccaacg atggtggacg ggtcatcgac ccgggcaccg gcgagccggt gcccggcgca 985621 tacgtcgcgg gttggatcaa gcgcgggccc accgggttca tcggcacgaa caagtcctgc 985681 tctatgcaga ccgttcaggc gttggtggcc gacttcaacg acggccggct gaccgatccg 985741 gtggctacac cgacggcgct ggatcagctg gtgcaggccc gccagcccca agccatcggc 985801 tgtgcgggat ggcgggccat cgacgcggcc gagattgcgc gcggcagcgc cgacggccgg 985861 gtccgcaaca agttcaccga cgtcgccgag atgctcgcgg cagcaaccag cgcgcctaag 985921 gaaccgcttc ggcggcgcgt gctggcccgg ctgcgtgacc tggggcagcc gatcgtgcta 985981 accgtcccct tgtgatgaca tggcggcttg gatctcatcc atgttgacct cgcgcaccgg 986041 ctggcccagc gaccagtggt ggccgaacgg gtcggcgacc accccgtagc ggtctcccca 986101 gagctggtcc tccaaggcgg tcaccaccgt ggcgcccgcg ttcagggcac gctggaactt 986161 ggcgtcgaca tcggtgacgg tcaaatgaat ggtgaccggt gttccgccca gcgaggtggg 986221 cgtcatcgac ttgccgccgc acatctgcgg gacgtcgtcg ttgagcatca ccgtaaagcc 986281 gttgatgcgt agtgcggcgt ggatcagttt gccatcggga ccggggacgc gccccagttc 986341 gacggcgtca aaggccttga cgtagaagtc gatcgccgag gcagcgtcgt cgacgacaag 986401 gtgtggtgac agagcgggtt cgacgttgat cgccatggtg tctccttgtt gttggtgtgc 986461 tcggccaatc cggggcccgg acaggctcac ggatattgac tcccggcgcg atggaaaatc 986521 atcgcggtgc cgtcattcaa tcgccggaca cgtggccacc gcccagcggt gtggccagca 986581 agccgaatct caaccgcagg tgtgttcaat gaatactttt ccgtcacaac gtgattgctg 986641 ctttgtgtcg acaagcgcac ttttcggtct cgacacgaat gctcttccgt tacagcgcaa 986701 gttgaaactt tctgcacgca acccatgccg accatgtccg cgccacccgc tcaagcgccg 986761 gtatgtggcg ccttggcggc taggccaacc gcccccggca acgccagctg cacacgccca 986821 gcgaagcgcg attgtcggta cgggtcgcgc tgcgaaacct gcctcccatt cgcactagca 986881 aaagactgtc gacaagcgag cagtcgactt caggccgcga ccgaacccga cgagacgaca 986941 acaacatctg tcatctcaat gcgctcacca ggatcgctac aatatcagcc agctacatga 987001 gccgatgtat atccaggaag gctctgccgc cgacatgttg gatcgctcgc gcggacagct 987061 gtaccggctc tacctggcta gtaggtgaat tcaatggcgc gttcgctcat tactcaccca 987121 tgtgcacaat aggttcgcgt gcggctcgcc ggcaacgttg gcaacatccc gattcccatt 987181 gattgcacgt tgcgcggcct aacccaatat tcccggacga acaacgccga ggtcgtgcag 987241 agcgtcgaga cacaccaccg tcccgctaac tttgatgccc tcacctgagg aaaaccacag 987301 gagcgtcagg tactcaccca ctgcgggaat tgcgatgacg ttcaaaccga tcgaggccgc 987361 gcagctacgc cagcgcgcag tgaacaggcc gtaactggac cgcgcttgcg caacgttcga 987421 aaagggatcc ggtggagcgg cccgacgaca ccaaataggc catatccccc aaagactggt 987481 attgacaacc gttctgatgc cgcgtcagac ttcccaccac gccacggacc gtccaacgcc 987541 agaactcaat accgtctcgt cccaggcgaa accgtgagcc tagccgatga tctcctggca 987601 ttggtcggac tggacttgat ctgctcgctg acaagcatac gtatcagtgc tacgaaccgt 987661 tcacgcggtg aacctgctgg gcgcacaagg agaatcgatg gattacgcca aacgcatcgg 987721 ccaggttggg gcgttagccg ttgtcctggg ggtgggggcg gcggtgacta cccacgcgat 987781 cggctctgcc gcgccgacgg atccgagctc ctcgagcacc gattcgccgg tcgacgcgtg 987841 ctcgccgttg ggtgggtccg ccagttcgtt ggctgcgata ccgggcgcca gtgtgccaca 987901 ggtcggcgtg cgacaggtag accccggaag catccccgat gacttgctca atgccctgat 987961 cgactttctg gccgcggtac gcaacgggtt ggtgcccatc atcgaaaacc gcactccggt 988021 agcgaatccg caacaagtca gcgtccctga ggggggcacc gtcggcccgg tccggtttga 988081 cgcctgcgac cccgatggca accggatgac cttcgcggtg cgcgagcgcg gtgcacccgg 988141 tggaccccag catggcatcg tgaccgtcga ccaacgaacg gccagcttca tctacacagc 988201 cgatccgggt ttcgttggca ccgatacctt cagtgtgaac gtcagcgatg acaccagcct 988261 gcacgtgcac ggtctggcgg gatacctggg tccgttccat gggcacgacg acgtcgccac 988321 cgtgaccgtg ttcgtcggca acaccccgac cgacaccatc agcggcgact tcagcatgct 988381 cacctacaac atcgcggggc tgcccttccc gctatccagc gcaattctgc cccggttctt 988441 ctacaccaaa gagattggga agcggctcaa cgcctactac gtcgcgaacg tccaggagga 988501 tttcgcctac caccaattcc tcatcaagaa atccaagatg cccagccaga ccccgccgga 988561 gccgcctacc ttgctgtggc ctatcggtgt gcccttctcc gacgggctca ataccctctc 988621 ggagttcaag gtgcagcggc tggaccggca gacatggtat gagtgcacat ccgacaactg 988681 cctcaccttg aagggcttca cctacagcca gatgcggctt cccggcggtg acacggtcga 988741 cgtctacaac ttacatacca acaccggtgg agggccgacc accaacgcca acctcgcgca 988801 ggtcgccaac tacatccagc agaactcggc gggccgcgcg gtcatcgtca ccggcgactt 988861 caacgcgcgg tactccgacg accaaagcgc tctgttgcaa tttgcgcagg tcaacgggct 988921 caccgatgcc tgggtgcagg tagaacacgg ccccaccaca ccgccgttcg cgcccacttg 988981 catggtcggc aacgagtgcg agctgctcga caagatcttc tatcgaagcg gccagggagt 989041 gacgttgcag gccgtcagct acggcaacga ggcgccgaaa ttcttcaatt ccaagggtga 989101 gccactgtcg gatcacagcc cggcggtggt cggcttccac tacgtcgcgg acaacgtggc 989161 cgtacggtga cagcggttga tcgccaactg gtttgccgtc ggcctcaggc ggtggtgagt 989221 acccgctccc agccgtcgac cgattccggg ctgcgcgggc ccggtcccac gtaaatggcc 989281 gacgggcgga ccagcttgcc gagtcgcttc tgctcgagaa tgtgggcaca ccagccggca 989341 gtgcgcccac aggtgaacat tgctggcatc atgttggccg gtacccgggc aaagtccagg 989401 accactgcgg cccagaattc gacattggtc tcgatcgccc gatccggacg gcgctctcgc 989461 agttctgaca gcgcagcctg ctccaccgcg accgcgacct cgtagcgggg ggcgcccagc 989521 cgctcggcgg ccgcccgcag cacccgcgcc cgcgggtcct cggcgcggta gacccggtgc 989581 ccgaacccca tcagtttctc gccgcggtcc aggattccct tgaccacgct gcgggcatcg 989641 ccggcgcgtt cgacctcgtc gagcatcggc aggacgcgcg ccggcgcgcc accatgcagc 989701 ggtccgctca tcgccccgat tgcgcccgac agcgctgctg ccacatccgc cccagttgag 989761 gcgatcacac gcgcggtgaa tgtcgaagcg ttcatgccgt gctcggcggc cgacacccag 989821 taggcgtcaa tggcctcgat gtgtctgggg tctggctcgc cctgccagcg cgtcatgaaa 989881 cgtgctgtga ccgtcgagca ttcatcgatg attcgctgcg ggaccgccgg ctggtagatg 989941 ccccgtgcgg attgcgcgac ataggacagc gccatcaccg atgcccgggc cagctgttgg 990001 cgggcggtgg cgtcgtcgat gtcgagcagc ggcgcatatc cccagatggg cgccagcatc 990061 gccaggccgg cctggacgtc gacgcgcaca tcgccggagt gaatcggcag cgggaacggt 990121 tcagccggcg gcagcccgct gccgaagttg ccgtccacca gcagcgccca cacatcgccg 990181 aaggtgaccc gctgacttac caggtcttcg atgtcgacgc cacggtagcg cagggccccg 990241 ccgtctttgt ccggctcggc gatctcggtc gtaaaggcca ccacgccgtc gaggccgggg 990301 acgaaattct ccgggaccac tgtcatacga gaattctcac acctggcccc ggcaacgacg 990361 ctaccggctg gtgccaatca cggtgccggc gatgagcgtg ccgcgagaat cgtcacgagg 990421 gtgagccgcg gcgtgccgcc tcgtctacca gttgtactcg ggaggccaag ccaagtttgg 990481 cgtagacgtg ggtgaggtgg gtttgcacag tgcgcggcga gacgaaaagc cgttttgcaa 990541 tgtccttgtt ggataacccc tcgctgacca accgcacgac gtcgcgttcg gtcggggtca 990601 acgagcccca cccgcgggcc ggtcgcttgc gttcaccgcg accgcgttgt gcatatgcga 990661 tcgcctcgtc ggtggacaag gcggccccct cggcccaggc gcggtcgaaa tcctcatcac 990721 ccatcgcctc acgaagcgcc gtcaccgagg cctggtagcc ggcatcccaa atcttgaagc 990781 ggacctgacg tgtctgttgc cgaagggcgg ctgcggcacc gagaaggcgg acaccttcgg 990841 agtgactgcc gacctcgccg gccaggccgg cgaggagttc catggcatct ggcatgccct 990901 ggtagatgtg cagctcggcg ccgcacgcca gcgcagcatg agcatcatcg cgcgccagtt 990961 ctggttcgcc ccgtgcggtg gctacgcgcg cgcgtattgt caacgccacc attcggtgcc 991021 acccattggt cgcatcgacg gcgtcgttgg cgaactgtcg tgcggcgatc gcatcacctc 991081 ctgccagggc taactgcgcc atcaggacct ggtgcatggt cacctggtcg ggctgggccc 991141 taagaatcgg ccgcgccgcg tcgctggcct cgagcgctgc cgtgacatca ccggcggcca 991201 gcgcggcgta cgtcatcgcc gcataaccaa tgccttggta cacaccgcct aactccgtcg 991261 cggctgcaat gcacgccccg gctatggcgt gggccgcgct ggcgccgcaa tacgccagca 991321 cctgggcttg ggtatatagg ccgagaacct ttgtcggcac atcgttggat gcctcggcct 991381 cggcagtgat ttccctggat agctcgaggg cttcggtcag attgccagcc cacatctgcg 991441 ccaaactaag ccacaagctg cagtgacgtg agacgaaccg gtcgccgatg gtgtcggcca 991501 ggtcgcggca ttcttctgcc gcggctcgca aagcattcgg gtcacctgat atgcaggtcc 991561 ccaccccccg ccagtagagg atttgacaca gcgtccattt gtcgtcaata gcgcgtgcca 991621 ggtcggtcgc ttcggcgaaa tagggcgcag cggcctccgc gttgtagcca ctgctacagc 991681 cgcaggcggt gagcgcccgc accaacgcgg cggggtcgcc cacctcacgt gccatcgcca 991741 gcgcttgttg tgcgggagcg atgatgtcgg tggcgcctac cggactggtg gccagccagg 991801 tactgagcat tgccttgtca gcgagcgctc gcgcccgtac tgctgttgac acagcgagcc 991861 ggtggaacct ttggtcttcc aggatcgagt tgaaccagga caacccctcg cgcaggtgcg 991921 cccgcccgaa ccagattggt tgcagcgaag atgcgagctg taacgcttcg gtgatatggc 991981 cattttcccg gctccaggcg aacgcggcgc gcaggttgtc gatctcggtc tcagcccggg 992041 cgacaagccg ttggtgatcg ttgtccgcag gagtgttgag tgaggcggcc agcgccgtgt 992101 agtagtcacg gtgacgtgcg tgcacatcgg cctcgccgga gtcgcccagt ttttccagcg 992161 cgtaccgacg caccgtttcc agcagccggt accgcgtgcg gccctggcag tcgtcggcca 992221 ccaccagcga cttgtctacc agcagggtca gctgatcaag caccgaaaac ggatccaggt 992281 cgctaccggc ggcgaccgcc cgcaccgcgg cgaggtcgaa cccgccgaca aatggcgcca 992341 gtcgccgaaa caagatttgc tcggtctcgg tcagcagtgc atgcgaccaa tcgatcgagg 992401 cgcgaagtgt ctgctggcgc tgcaccgcgc cccgcacacc gccggccaac agccggaaac 992461 agtcgtccag accgtcggca atctcgagcg gtgacatcga ccgcacccgt gcggcagcga 992521 actcgatcgc cagcggtatg ccgtctagcc gccggcagat ctcgccgacg gccgcggcgt 992581 tgtgattggc gatggtgaac ccgggctgaa ctcggctggc tcggtcagca aacaattcga 992641 ctgcttcgtc ggttatcgac atcgacggta cgcgccaggt gatctcgccg gccatcccga 992701 tcggctcccg gctagtcgct aagatcgtca gctccggaca ggcccccaat agctcaacga 992761 ccaacgctgc gcacgcatcg agaagatgtt cacagttgtc caacaccatg agcatgcggc 992821 gattgccgat gaatcggcga agactatcca tggttgaacg gcccggctga tcgggcagac 992881 ccacggcgcg cgcagccgtg gctgcgacga tcccggattc agtgatcggg gccagatcga 992941 caaagcacaa accgtcgcga agttcggatg cactcgcgat ctggattgcc agacgggtct 993001 tgccgacacc gccggttccg catagcgtca cgagccggtt ctgcgccaac agtgcccgca 993061 cctcagctta tttgcgcacg gcggcccaca aatgtggtga actgcgccgg gagaatcgat 993121 gtcgggctgg atttggccgt gcgcagtggg ggaaactttt cgcgaatgtc ggggtggcac 993181 aactgcatga cccattcggg acgaggtaga ccgcgcagcg ggtggcggcc gagatcgaca 993241 agccatgcat cggctgggag ccggccagtc actaaatcac ctgtcgcagc tgacaggaca 993301 acctgacccc cgtgtgccaa atcgcggaga cgcgccgtcc ggttgatagt ggggccgaca 993361 tagagttcgt cgcgcaactg tacctcgcct gtatgaagac ctatacgtag tcggatcggc 993421 gcgagcgagg tccgctgcag atccagcgcg catgcagcgg catcgctagc gcgagtgaaa 993481 gccgcaacga agctatcacc ctcgtaccgt ttgaccggct gcaccccacc gtgattcgtg 993541 atagcttccg acacagtgtg atccaagtgc gcgatggcgg tcgccatgtc ctctgggcac 993601 atttgccata ggtgggtcga ttcctcgacg tcggctaaga gcaatgtcac cgtgcccgtc 993661 ggcggcaatc tgctcacgtc taatccctgg ttggctataa ggacgcgtct gcgtggggga 993721 acgaactcac atcggccaac atctggtgga gccgcatagc agcggagcga atggtaccgg 993781 agatccagcg atcctagcgc agatatacga cccctggcga cgcactttgc gcatgttggc 993841 ggatgatctt cgccccgcag gatcgcatgg tcgatgtcga tgttgggagg aaggctgtta 993901 tgaactgcgt tgaagagcac gatacgtgtc tgaccactgc tatcacgtca tcgcaacacc 993961 ttcgcggcgc cgcgaagcca ataagcacac tacagttcgg ggaagacacc tggcccatcc 994021 tcgaaacagg cctctcgcag cgatgttcat taccgcccaa agagattgtc ttcggcgctg 994081 cacggtgggc gctcgcggcg gcccgcggga tgctaccgcg gcccacgacc gacagcccac 994141 cgcagcgtca gcgctacccg aagcgctacc gattcctgga gcactcctgc ctagaacgcg 994201 agatgcgtcg actatagaac agcgtcgcgt gtttgtctcg gtagctgctc tgtatagtat 994261 gcgttgctta accgcatgtg ggagggtgat tttgggctgt tctggggggt cggagcgatg 994321 accgggcgat gtccgacggt tgccgtggtc ggagcgggta tgtccggaat gtgcgtcgca 994381 attacgttgc tgagcgcagg gattactgat gtctgcatct atgaaaaggc cgacgatgtt 994441 ggcggaacgt ggcgcgataa cacctatcca ggtctgacat gtgatgtgcc gtcccggctc 994501 tatcagtaca gctttgccaa gaatccgaac tggacccaga tgttttcacg cggaggcgaa 994561 atccaagatt acttgcgtgg gatcgccgag cgctacgggc tgaggcaccg gattcggttt 994621 ggcgccacgg ttgtcagcgc ccgattcgac gacggccggt gggtgttgcg caccgattcc 994681 ggaacggagt cgacagtaga cttcttgatt tcggccaccg gcgttttaca tcatccccga 994741 ataccgccga tcgctggttt ggacgacttc agggggacgg tgtttcactc ggctcgctgg 994801 gatcacacgg ttccgctgct gggacgccga atcgcggtga tcggtaccgg gtccacgggc 994861 gtacaactcg tctgcggcct ggctggggtc gcgggtaaag tcaccatgtt ccagcgcacc 994921 gcacaatggg tgctgccgtg gcctaaccct cgatactcga agctggcgcg tgttttccac 994981 cgcgcttttc cgtgtctggg ttcgctggcc tataaggcat atagcctttc cttcgaaacg 995041 ttcgcggttg cgctcagcaa tccaggtttg caccgaaagc tggtaggggc cgtgtgtcgc 995101 gccagcttac gtcgggtgcg tgacccccga ctgcgtcggg cactgacgcc tgattacgag 995161 ccgatgtgca aacggctagt gatgtccggc ggattctatc gggcgattca gcgtgacgac 995221 gtcgaattag tcaccgccgg tatcgatcac gtcgaacatc ggggcatcgt caccgatgat 995281 ggtgtgttgc acgaggtgga cgtcatcgtg cttgccacgg ggtttgactc tcatgcattt 995341 ttccggccga tgcagctgac cggtcgcgac ggcatcagga tcgacgatgt gtggcaagac 995401 ggtccgcatg ctcatcaaac cgtcgcaata cctggatttc cgaacttctt tatgatgttg 995461 gggccacaca gcccagtggg aaacttcccg ctgacagcgg tcgccgaatc tcaggctgaa 995521 cacatagtgc agtggataaa gcgatggcgc catggtgaat tcgacaccat ggaaccgaag 995581 tcagctgcta ccgaagcata taacacggtg ttgcgggccg cgatgccgaa caccgtctgg 995641 accaccggct gcgacagctg gtacctgaac aaagacggta ttcctgaggt ttggccattt 995701 gcaccggcca aacaccgcgc catgctcgct aacctacatc ccgaagaata cgacctgcga 995761 cgctatgctg cggtgcgcgc aactagtcgg cctcaaagcg cttgaagcct atcgaggtgc 995821 tggacggtga cgttcgcgcg ggatcggcca ctaatcccgt tctgacggcg ctgacaaagg 995881 ttatagcggt gaccattggc gcagcttcgg tatcggcttc gggcaccgct cggccgacgc 995941 ggcgcagata ctcggccaat ggagtagcgg tcgcgcgcca gcctcgctca tcgaaccatt 996001 ccgtggcccg cgcccaccgc tcgttgtaga ccatttgaaa gaacctgcgc gggtcgcctt 996061 gggcgttcgc ggcgcgttct cgttcgagct tcgctgcgaa ttcgcaaggg tccagtggag 996121 ttgcttcctc gacagcaacg tggctaccgg ggctagccaa ggtgtcgatg ccgataaaca 996181 ggcgctgctg ggcctcggcc gagagataga ccagcaggcc ctcggcgatc caagccgacg 996241 gccggttggc atcaaatccg ttgttacaca aggctatctg ccactcatcg cgcagatcga 996301 cagcaaccga ccgacgttgg gctcgcggcc gtatgtgata gtcggcgagc accgcgttct 996361 tgaagtcgag gacctgaggt cgatccaact cgaagattgt tgtcccgatt ggccattgca 996421 atcggaatgc acgggaatcc aatcctgcag ccaagatgac cacctgcttc atgccggcgg 996481 ccgttgcccg ggagaaatac tcgtcgaaat acctggtgcg ggcaccttgg aagttgacga 996541 aatgctcacc gaagtccccg gttgtcagat agtgatcggg cagcttgccg tccaatacgt 996601 cggcccattc accacctgcg gcacggcaga aaacctcggc atagggatcg atggccagcg 996661 gatcggcctt ctgcgtctcc aatgctcttg cggcggctac caatagtcct gtcgaaccaa 996721 cactcgtggt gacatcccag ctatcgtcct cggtccgcat tcatcgaact ctagttgctc 996781 cagtccgccc accgctgtcg gtatcccagc gcagtcggcc gtgcacacat atctgcgcgg 996841 tggacttggt acttctacgc gcattcgccg atgttttgcg atccgcggcg ggtctatggt 996901 gccatttatg tgccaggatc ggtcttcaat aacaacgtcg cgaagcgagg ggtcgtgacg 996961 tgagagggct cgcttatgcc ggcggtggat gcccagtagg gcgacggtcc aggaattctc 997021 agacagttat ccgttctgcc acaatggatt ccggccgatc atgatgccaa agatcgtctc 997081 cgtccaacat tccactcgcc gccacttgac gagctttgtc ggtcgcaagg ctgagctgaa 997141 cgacgtgcgg cggctcctgt ccgacaaacg actggtgacg cttaccggtc cggatgggat 997201 ggggaaatcc cgtctcgcgc tgcagatcgg cgcccagatt gcacacgaat tcacttatgg 997261 ccgttgggat tgcgacttgg ctacggtcac tgaccgagac tgcgtgtcca tctcgatgct 997321 gaatgccttg ggcttgcctg tccagccggg tttgtctgcg atcgacacgc tcgtcggtgt 997381 catcaatgat gctcgggtgc tgctggtgtt ggaccattgt gagcatttgc tggacgcgtg 997441 tgccgcgata attgattcgc tgttacgttc ctgtccgaga ttgacgatcc tgacgacaag 997501 taccgaagcg atcgggttgg cgggcgagct gacctggcgg gtgcccccgt tgtcgctgac 997561 caacgatgcc atcgagctgt ttgtcgaccg ggcacgccga gtgcggtcgg attttgcgat 997621 taatgccgat accgcggtga cggtcgggga aatctgccga cgcttggacg gtgtgccact 997681 ggcgatcgag ctggccgcgg cgcgaacgga caccttgtcg ccggtggaga tccttgctgg 997741 tctaaatgac cgattccggc tggtggccgg tgctgcgggc aacgcggtgc gccccgaaca 997801 gacgctgtgt gccacggtgc aatggtcgca tgctctgttg agtggacctg agcgtgcgtt 997861 gttgcaccgg ttggcagtct tcgccggcgg gttcgacctt gacggcgccc aggcggtcgg 997921 tgccaatgac gaggacttcg agggctacca gacactcggc cggtttgccg agttggtgga 997981 caaggcattt gtcgtcgtcg aaaacaacag gggccgagcg ggataccggt tgctgtattc 998041 ggtgcgtcag tacgcgttgg agaagctcag tgagtcggga gaggccgacg ccgtgcttgc 998101 gcgttaccgc aagcacctca aacaacccaa ccaggtagtg cgtgctgggt caggcggggt 998161 tcggtactga tgcgtgaacg taccttaacc gtcggtggga attgaccgcg ccacccatag 998221 cagtcgagag gaacacccgc agcaaagtgc gccaacaaca ggaggctgac gtcgttgccc 998281 tgggtcgaaa gccagggctg ctatgtgtgc cggaaaggtt ccgtgcaatg gatcttccga 998341 tggcagccgc cgatgcctta ttcctatggg ccgagacgcc gacgcggccg ctgcatgtcg 998401 gcgcgttggc cgtgctgagt cagcccgaca acgggaccgg gcgttacctg cgcaaggtgt 998461 tctccgccgc ggtggcccgt cagcaggtgg cgccgtggtg gcgccgacgc ccgcaccggt 998521 cgctcacctc gctcgggcag tggtcttggc gcaccgagac cgaggtggac ctggattacc 998581 acgtgcggct tagcgcattg ccgccacggg ccggtaccgc cgagctgtgg gcgttggttt 998641 ctgaactaca cgccggcatg ctggaccgct cccgcccgct atggcaggtg gacctgatcg 998701 agggtctacc tggcgggcgg tgcgcggtct acgtcaaggt ccaccatgcg ctggcggacg 998761 gagtctcggt gatgcggctt ttacaacgga tcgtcaccgc ggacccgcat cagcgtcaga 998821 tgcccacctt gtgggaggtg ccagcgcagg cgtcggtggc caaacacacg gcaccgcgcg 998881 gttcgtcgag accactgacg ttggccaagg gggtgctggg tcaagccagg ggcgtcccgg 998941 gcatggtgcg cgtagtggcc gataccacgt ggcgggcagc gcaatgtcgc agcgggccgc 999001 tgacactggc cgcaccacac accccgctga acgagccgat cgccggggcc cggtccgtgg 999061 caggttgttc ctttccgatc gagcggctgc gacaggtcgc cgaacacgcc gatgccacca 999121 tcaacgatgt cgtgctggcc atgtgcggcg gggcgttacg tgcgtacctg atcagccggg 999181 gagcgttacc gggtgcgccg ctgatagcga tggtgccggt ttcgctgcgc gataccgcag 999241 ttatcgacgt gttcggccag ggtccaggca acaagatcgg tacgttgatg tgttcgctgg 999301 cgacgcacct ggccagtccg gtcgaacggc tgtcggcgat acgggcaagt atgcgcgacg 999361 gcaaagccgc gatcgccggc cgaagccgaa accaggcgct ggctatgagc gcattgggcg 999421 ccgccccgct cgcccttgcg atggccctgg ggcgcgtgcc cgcgccgctg cgcccaccaa 999481 atgtgacgat ctccaacgtg ccgggcccgc agggcgcgct gtactggaac ggcgctcgcc 999541 tggacgcgct ctacctgctc tcggcacctg tcgatggcgc ggcgttgaac atcacctgta 999601 gcggcaccaa tgagcagatc actttcggtt tgacgggctg ccgtcgtgcc gtccccgcgc 999661 tgagcatcct gaccgaccag ctcgcccacg aactcgagct actcgttggc gtcagtgaag 999721 ccggcccagg gaccagactt cgaaggatcg cagggcgccg ttaaacggac gccgcgagtc 999781 atcacccggc cgagcgcgca gcggcttacc ttacgcgcgg ccgcccatgg tgccagagac 999841 cccaccccgg gcaggcgggt catcccgata gcgactacct tcagctataa gcacttagtg 999901 gggcagccat atcagccaaa gcgcgaaggg gttctcgtgg ccgacaccga cgacaccgca 999961 accctccgtt acccgggagg cgagatcgac ctgcagatcg tgcacgccac cgaaggcgcc 1000021 gacggcattg cgctcgggcc gctgctggca aaaaccgggc acaccacgtt cgacgtcggc 1000081 ttcgccaaca cggccgccgc taaaagctcc atcacctaca tcgacggaga tgccggcatt 1000141 ctgcgttatc gcggctaccc gatcgaccaa ctggcggaga agtcaacctt catcgaggtc 1000201 tgctacctgt tgatttacgg cgagctgccc gataccgacc agcttgccca gttcaccggc 1000261 cggatccagc gccacaccat gctgcacgag gatctcaagc ggttcttcga cggctttccg 1000321 cgcaatgccc acccgatgcc ggtgttgtcc agcgtggtca atgcgctgtc ggcgtactac 1000381 caggatgctc tggaccccat ggacaacggt caagtcgagc tgtcgaccat tcggctgctg 1000441 gccaagctgc ccaccatcgc cgcgtacgcc tacaagaaat cggtcggcca gcccttcctc 1000501 tacccagata actcactgac gctggtggag aacttcctac ggttgacgtt cggatttccc 1000561 gccgagccct accaggccga ccccgaggtg gtgcgggcgc tggacatgtt gttcatcttg 1000621 cacgccgacc acgagcagaa ctgctcgacg tcgacggttc ggctggttgg ctcgtcgcga 1000681 gccaacctgt tcacctcgat ctcgggtggc atcaacgcac tatggggtcc gcttcatggc 1000741 ggcgccaatc aggctgtcct ggagatgctc gagggcattc gcgacagcgg cgacgacgtc 1000801 agcgagtttg tacgcaaggt caagaaccgc gaggccgggg tcaaattgat gggtttcggt 1000861 catcgtgtct acaagaacta cgatccgcgg gcccgcatcg tcaaggaaca ggccgacaag 1000921 atcctggcca agctcggcgg cgatgactcc ttgctgggca tcgccaagga gctcgaagag 1000981 gcggcgctga ccgacgacta cttcatcgaa cgcaagcttt accccaacgt cgacttctac 1001041 accggcctga tctaccgggc cctcggcttc ccgaccagga tgttcaccgt gttgtttgcc 1001101 ctgggcaggc ttcccggctg gatcgcgcac tggcgtgaga tgcacgacga gggcgacagc 1001161 aagatcggcc ggccccgcca gatctacacc ggctacgcgg agcgcgacta cgtcaccata 1001221 gacgcgcggt aggccggcga gcagacgcaa aagcccccta aaccggcagg tattaggggc 1001281 ttttgcgtct gctcgccagg caagccagca ctgccatcgc ggcgttgtga ccgccgatgc 1001341 ccgacaccgc cccgccgcga cgggcacccg agccgcacag catgatccgc tcgtggtcgg 1001401 tggctacgcc ccactgccgt gccggtgtgt ccagcggatc gtcgttgtca gcgaacggcc 1001461 aggacaacgc accgtggaag atgttgccgc cggtcatccc aagcgtccgc tgcaggtcca 1001521 gggtggtcgt cgtctcgatg catggcttgc tctgcgcatc ggtccaaagc acgtcctgaa 1001581 tcggttcggc cagaacggaa ttcagcgacg ctaggacggc tgccgtcagc cgttcggcta 1001641 agccttcggt gtcgccgaac accgagtgcg gtgtgtgcaa gccgaacacc gtcagcgtct 1001701 gagcgccggc atcgcgcaac cgggcggaca ggatgctcgg gtcggtcagc gaatggcagt 1001761 aggcttcgca gggtagggga tccggcaacc gcccgctggc tgcttgcgag tacgcggcat 1001821 ccaattggct ccatgtctcg ttgacgtgga acgtcccggc aaatgcttgc tgcggtgtga 1001881 cactgtcgtc gcgcaaccgg gggagtcggc gcaccaccat gttgaccttg acctgtgcgc 1001941 ccggggccag tgccgcaacc ggttcaccga gcaggctggc cagcaccgcc ggtgtgaccc 1002001 cgaccagaac gaaccggccc cggaccaaat gctcggcacc gtcgctaccg tcgctgtggt 1002061 agcgcaccgt accgtctgga tcaagggcga aaacgtctgc accggtgact atttcggcgc 1002121 cgtggcgggc agctgccgtg gccagggccg aggtcaccga ccccatgccg ccgattggga 1002181 cgtgccagac tccggtgccc ccaccgacca ggtgatacag gaagcagatg ttctgcatca 1002241 gcgacggttc gtgcatgcgg gcgaaggtgc cgatcagcgc gtcggtggcg atcaccccgc 1002301 gtagcaggtc attggccacc gcgccggcga tggcatgccc gatcggctcg tcgaccatgg 1002361 cttgccaggc agcggccgcc tcgtggccgc cgtattccac aatgtcgcgg cgggcctgct 1002421 cgcgggtgcg cagcggctcg atcagggtgg gccacagccg tgcggtcacc agccggcagc 1002481 gccggtagaa cgcggcgaag ccgtgcgcat ccggcgcggc gccgatcgcc gcgaggtgcg 1002541 ctgcgcgtgg ttcgccggtg ggcccgatga gcaggccaga gcgcccggcc gtggctgggg 1002601 caggggtgta tgaggaaaat ggccgccgcg ccaaccgcac cggagcgccg aggtcggcga 1002661 cgatgcgcga cggcagcaag ctgaccaggt acgagtagcg tgacagcgcg acctcgacac 1002721 cgtcgaaggc ctgtatcgac accgcggccc ccccagtctg tgccagccgc tcgagcagtc 1002781 gcactcgaag cccggcccgg gccaggtagg cggccgcgac caagccgttg tgaccgccgc 1002841 caaccacgac aacgtcgaag tccctgtcgt gatcgctcat agtgacggcg gctatcgaga 1002901 cggatctagc cggtgtaccc ctcgacttgg tcggcgggac gcacgactgc ttcgcgcggg 1002961 tcaccaccgg tttggcgcaa tgcccgtcgc tgtcggagca ggtcccagca ctggtcgagt 1003021 tcgatctcga tgcggcgcag ttgctgctgc tcctcggact cgctgatgcc accgtgccgc 1003081 agctgcgctc gcaacgcctt ctcctcggcc accaggtcac ggatgtgtgc cagggtctcg 1003141 ctgtctgtcg gtttgcgtcc cttgcccatg gctccagtgt gcccgatttg acgcggtgtc 1003201 ccggcaccga ctcggtaggc tgcatatcgc ctgcagcacg gacgagacgc gttcgacgac 1003261 ctgagggagt ggcgtagtgg cttctaaggc gggtttgggc caaacacccg cgaccaccga 1003321 cgcgcgacga actcagaaat tctaccgggg ctcgccgggc cgtccgtggc tgattggcgc 1003381 ggtggttatt ccgttgctga tagcggcaat cggttacggt gcattcgagc ggccccagtc 1003441 cgttaccgga ccgaccggtg tgttgccgac actgacaccg accagcaccc ggggcgcttc 1003501 tgcgttgtcc ttgtctttgc tgtcaattag ccgcagcggc aacaccgtta ctctgatcgg 1003561 tgacttcccc gatgaggccg ccaaggcggc cttgatgacg gcgctcaacg gcttgcttgc 1003621 tccgggcgtg aacgtcatcg accagattca cgtcgatccc gttgtgcgat cacttgattt 1003681 ctcaagtgcg gaaccagttt tcaccgccag cgtgccgatt cctgattttg gcctcaaagt 1003741 cgaaagggac accgtcacct tgaccggaac tgccccttca tccgagcaca aggacgcagt 1003801 gaagcgcgcg gcgaccagca cctggcctga catgaaaatc gttaacaata ttgaggttac 1003861 ggggcaggca ccgccaggac ccccggcctc cggcccatgt gccgacctgc aatcagccat 1003921 caatgccgtg acgggtggac ccatcgcgtt tggcaacgac ggggctagtc tgatcccagc 1003981 cgactatgaa atcctgaacc gggtagccga caagctcaag gcatgtccgg acgctcgggt 1004041 gacgatcaac ggctacaccg acaacaccgg cagcgaaggt atcaatatcc cgttgagcgc 1004101 tcagcgagcc aagatagtcg ccgactacct ggttgcccgc ggagttgccg gcgatcacat 1004161 tgccaccgtg ggtctcggtt cggtgaatcc gatcgccagc aacgccacac ccgaggggcg 1004221 cgccaagaat cgtcgcgtcg agatcgtggt caactaagga gaacccagca tggattttgt 1004281 gatccagtgg tcgtgctacc tgctggcgtt cctggggggc tcggctgttg cctgggtagt 1004341 cgtcactctg tcgatcaagc gcgccagccg tgatgagggt gctgcggagg cgcccagtgc 1004401 agccgagaca ggcgcacagt gatggaacac gtgcactggt ggctggcggg cctggcgttc 1004461 acgctcggga tggtgctgac gtcgacgctg atggtccggc ccgtcgaaca tcaagtgctg 1004521 gtaaagaaat cggtccgcgg gtcaagcgct aagtccaagc cgccaacggc gagaaaaccc 1004581 gccgtcaagt cgggcaccaa gagagaggag tcgccgacgg cgaagaccaa ggtggcaacg 1004641 gagtctgctg cggagcagat cccggttgcc ggggagcccg cggcggagcc gatcccggtc 1004701 gccggcgagc cggcggcgcg tattccggtg gttccgtacg cgccgtacgg cccgggctcg 1004761 gcgcgcgctg gtgccgatgg cagcggaccg caggggtggc tggtgaaggg ccgctcggac 1004821 accaggctct actacactcc cgaagatccg acgtacgacc ctactgtcgc ccaggtttgg 1004881 ttccaggacg aggagtcggc agcgcgggcg tttttcacgc cgtggcgcaa gagcacacgg 1004941 cggacatgag gtcagggccg cagggctaac tgggcccggg aaggcgcaac acgaggcgcg 1005001 cgccacccag cgggctgttc tccagcgacg cggtgccgcc gtgcaactgg gcctgttggg 1005061 ccaccaacgc cagcccgaga cccgaccccg aatgagatgc cgtggacccg cgggagaacc 1005121 gctcgaacac cacttggcgc tcaccttcgg gcactccgct gccgttgtcg tcgatggcga 1005181 tctccacgcc ggcccgcgag ctgaccgcgg agagttgaac cagggtggcg ccgccgtgct 1005241 tgaccgcgtt ggcgatggcg ttgtcgacgg ccaggcgcaa cccggccggc aaacccacga 1005301 tgatgcaggt cggcgacggc accagcgata catcgagatc ggggtagatc cgggccgcgt 1005361 cgtgggcggc gcggtcgagc aggtcggtga tatcgaccgg cacgtgatcg tccgaggtcg 1005421 acagttcgcc ctgggccaac cgctccagcg cgctcagggt ggcctcaatg cgcgactggg 1005481 tgcggatgac gtcgttgagc acttctttgc gctggtcgtc gggcagatcc agggtggaca 1005541 gcacctccag gttggtgcgc atcgcggtca gcggagtgcg cagctcgtgg gaggacaccg 1005601 ccgcgaagtc acgcgccgac gcaagcgcct ccttggttcg gttctgctcg ttccagatgc 1005661 gctgcagcat gccgcgcatc gcctcggcga tctcgatggc ttcgctggcg ccgtgtactt 1005721 ccacgcgtgg cgcctcgtcg cccgcgtcga tggaccgggt ctgctcggcg agctgcttga 1005781 acgggcgtac cgcgaacgcg gccaacagcc aggcgaacac cgccgccgcg ccgatggcga 1005841 aggtacagat cagcagcacc cggcggtgca ggttgttggt ctcggctacg gtggcgtcat 1005901 acgtcgcgcc caccgccacc gacgtcggct cgggcccggg gatctccacc gtgcgcacgc 1005961 ggtagcgcac cccgcggacg taggtgtcgg cgtagtcgtc ttgcagtttg ggcagcgtga 1006021 tgtcggaatt cgacttgatc acgttgccac ggcggaccgt gatgagggcg tcctggtcgt 1006081 tcggtgagcg cgggatctcg tcgaggccac gcggcacgaa cgggatcgcg aaacccgcgg 1006141 cctcgtcgag ccggcggtcc agccgctcct tgcggtcgtt ggtgatcccg acccagacga 1006201 cggtgccgac aatgagtacc gggatcgcgg cgccgatcgc cgtcgcgacc accacccggg 1006261 ttcgcagcga gggcgtacgg gcgaagatcc gcgacagaat attcatgcat gccccgtcac 1006321 tgcatacgca gcacgaatcc gactccgcgg acggtatgca gcagcctagg gccaccgccg 1006381 gcctccagtt tgcgccgcag gtacccgatg aagacgtcca ccacgttggt gtcggcggcg 1006441 aagtcgtagc cccacaccaa ttccaggagt tgcgctcggg agagcaccgc ggtcttgtgc 1006501 tcggccagca ccgcgagcag gtcgaattcg cgcttggtca ggtcgacgtc gacgccgttg 1006561 acccgggccc gccggccggg gatgtccacc tccagcgggc ccaccgtgat ggtttccgag 1006621 gacgacgttg cagtggagcc gcggcggcgc agcagcgcct tcacccgtgc caccagctcg 1006681 gccagcacga acggtttcac caggtaatcg tcggcgccgg cctccaatcc ggccactcgg 1006741 tcatcgacag agctgcgtgc ggatagcaca cagaccggga cgtcgttgtc catcgcgcgt 1006801 agtgccgtca cgacgctgac tccatcgagc actggcatgt tgatgtcgag cacgatcgcg 1006861 tccggccggt tctcggtggc gctgcgcaag gcctcggcgc cgtccaccgc ggtcgctacc 1006921 tcgaatccgg acagccgtaa gccgcgttcc agcgaggcga gcacatcgga gtcgtcgtcg 1006981 acgaccaaca cccgaggtga ggtcacacca gtgtccatgc cgcccatttt gcctgattac 1007041 cgtccagcag ggtgggaggg tgagccgccg ggtcgcgtgc tgggcgagca gacacagagt 1007101 cgcatcaaaa ccgccgattt tgtgcgactc tgtgtctgct cgcggggtgc gcgcgggtta 1007161 gtcgcggggc aacccgatcc ggcggtagcg ttgcaaccga gtcgcgaggc gttccggggc 1007221 cggtatcttc cgtaacgcgt gcacttcggc ggcgatggcg ttcgacagtc gtagggcgaa 1007281 ctcgatcggc tcgtctgcgg cgtcggggta ctccggcacg atggtgtcga caatccccga 1007341 cttcagtagg tcggccgacc ggatgccttg ggcggcagcg agttcggcgg catgagcagt 1007401 gtctcggaac acgatcgcgc tggctccttc gggaggcaag ggcgccagcc agccgtggag 1007461 tgcggccagc acccggtcgg cgggcaacat cgccagcgcc ggcccgccgc tgccctggcc 1007521 cagcaggatc gacacggtcg gggtatccag cgtgacgagc tcggccaggc aatgcgcgat 1007581 ctggccggcc agcccgccct gttcggctgc ggccgacaac gcgggtccgg ccgcgtcaat 1007641 gaccagcacc agcggcaggc acagctcggc ggcgagcgcc atcccgcgtc gggcttcgcg 1007701 taacgcagcg ggcccgacag tgcttccccc gccgcctact gccctttgct ggccgaggac 1007761 caccgtgggt tggccgccaa agcgggccag cgccagcagc gtggtcgccg cttcgccttg 1007821 atcggttcct gacaacaaca cccggtcggt ggcgccgtgt cgcagtagct gcctgacgcc 1007881 cggccggtcc ggccggcgcg atgccaccac cgagtcccac gtgggcacat cgggtacggg 1007941 cgcgggcgtc tgcggtgccg gaagcggttc gggagcgtcg atgagcaccg tcaacgcacg 1008001 atccagcatc ggtcgtagcc ggtccagtgc aacgacgccg tcgatgatcc catgccgccg 1008061 tagattctcg gcggtttgga cgccggatgg gaaggggtcg ccatagagca actcatagac 1008121 ccgtggtccc agaaagccga tcagggcgcc cggctcggcg acggtgagat gccccagcga 1008181 gccccacgac gcgaaaactc cacccgtggt cggatggcgc aaatagacca ggtagggcag 1008241 gcgcgcctgg ttgtgcagct ggatggccgc agcgatcttc accatctgca gaaacgcgac 1008301 cgtgccttct tgcatgcggg tgcctcccga gcttggtgac gccagtagcg gcagccgctc 1008361 ggcggtcgcc cgctcgacgg cggcggtgat ccgttcggcc gctgccaccc caatcgagcc 1008421 gcccaggaag tcgaactcac aggccaccac ggccacccgc cgcccgaata cgcgtccctc 1008481 accggtctgc accgattcgt ccgcgccggt ggccgcccga gcggcggcca gctcccgcgc 1008541 ataggagtcg gctaccggca ccgccagcgg ctcgctatcc cagctgacga aagatccccg 1008601 gtctagcacc gcgtgccgca gttggtcggt cgtgatacga ctcacgcgat gaggctatat 1008661 aggctgaccc aatgatcggt atcacccagg cagaagccgt gctgaccatt gagctgcaac 1008721 gcccggagcg ccgcaacgcc ttaaattccc agctggtcga ggagcttacg caggccatcc 1008781 ggaaagccgg ggatggatcg gctcgggcga tcgtgctgac cggccaaggc accgcgttct 1008841 gcgctggcgc ggacctgagc ggagacgcat tcgccgccga ttatcccgac cggctcatcg 1008901 agctgcacaa ggcgatggac gcctccccga tgccagtggt cggcgcgatc aacggtcccg 1008961 ccatcggcgc cggcttgcag cttgccatgc aatgcgacct gcgggttgtc gcgcccgatg 1009021 ccttcttcca gtttccgacg tcgaaatacg gtctggccct ggataactgg agcatccgcc 1009081 ggctgtcgtc gttggttggg cacggacgtg cccgcgcgat gctgctcagc gcggaaaagc 1009141 tgaccgccga gatcgcactg cacaccggaa tggcgaatcg cattggcact ttggccgacg 1009201 cccaggcctg ggccgccgag atcgccaggc tggcaccact ggctatccag cacgccaagc 1009261 gggtgctcaa cgacgacggc gctatcgagg aagcgtggcc ggcccataag gaactcttcg 1009321 acaaagcctg gggcagccag gatgtcatcg aagcgcaggt tgcccggatg gaaaagcggc 1009381 cgccgaagtt ccaaggggct taaccgccat ggtgcgccga gcgctacgac tggcggccgg 1009441 caccgcctcg ctggccgccg gcacgtggct gttgcgtgcg ctgcacggca cgccggccgc 1009501 gctcggtgcc gacgcggcgt cgatcagggc tgtgtcggag caatcgccga actatcgtga 1009561 cggcgccttc gtcaacctgg atcccgcgtc gatgttcacc ctggatcgcg aggagcttcg 1009621 gctcatcgtg tgggagttag tggccagaca cagtgcgagc cggccggcgg cgccgatccc 1009681 gttggcctcg ccgaatatct accggggtga cgccagccgg ctcgccgtca gctggttcgg 1009741 tcactcgacg gcgctgctgg aaatcgacgg ctaccgggtg cttaccgatc cggtgtggag 1009801 cgatcggtgc tcaccgtccg acgtcgtcgg cccccagcgc ctgcatccgc cgccggtgca 1009861 actggcagct ctcccggccg tcgacgccgt ggtcatcagc cacgaccact acgaccatct 1009921 cgatatcgac accgtggttg cgctggtcgg catgcaacgg gccccgttcc ttgtgccgct 1009981 cggggtcggc gcccaccttc ggtcgtgggg tgttccgcag gatcgcattg ttgagctcga 1010041 ctggaaccag agcgctcagg tcgatgagct caccgtggtc tgcgtgccgg cacggcactt 1010101 ctcgggacgg ttcctgagcc gcaacaccac actgtgggcc tcgtgggcgt ttgttgggcc 1010161 gaaccatcgc gcctacttcg gtggtgatac cggatacacc aagagcttca cccagatcgg 1010221 cgcggaccac ggaccgttcg acctgaccct gctgcccatc ggggcctaca acacggcgtg 1010281 gccggacatc cacatgaacc ccgaggaggc ggtccgggcg cacctggacg tcaccgattc 1010341 gggctcggga atgctggtgc cggtgcactg gggcaccttc cggctggccc cccatccgtg 1010401 gggcgagccg gtcgagcggc tactcgcggc ggctgaaccc gagcacgtca cggtagccgt 1010461 gccgctaccc ggtcagcggg tcgacccgac cgggcccatg agattgcacc catggtggcg 1010521 gctgtaattc cccgcagcgc ccggctaatg gtgctagggg gcgagccgag gcgatcaaac 1010581 caccgagtgt tccggccgcg ttggctacta tctgcggcca tgaccaaacg agcggcaacg 1010641 gccgccatgg tgatgttgct gacgttaacg ggttgcggat ccacgcacca ggcacttggc 1010701 ccgccgtccg ggttgcccga tgcctctccc aatgagaggt cagcgataca gatccccgct 1010761 ggccgcatcg acgatgccgt ggcaaaggtc gacggcctgg tcggcgagct gatgcagaat 1010821 accggcatac ccggaatggc agtggcgata gtccatggcg gaaagacgtt gtatgccaaa 1010881 gggttcggtg tcagagacgt gggcaaaggt ggtggtccgg acaacaaggt ggacgccgac 1010941 accgtctttc agttggcgtc ggtgtccaaa tcggtcggcg ccacggtggt ggcgcatgcg 1011001 gtaaccgaca acgtcgtgac ctgggatacg cccgtcgtat cgaagctgcc gtggtttgcc 1011061 cttcgcgatc cctacgtcac cggccaggta accattgctg acctctactc gcatcgctcc 1011121 ggcctgcccg accatgcggg cgatctgttg gaggatttgg gttatgaccg tcgacaggta 1011181 ctgcagcggc tgaaatacct gccgctggca ccgtttcgaa tcagctatgc ctacaccaac 1011241 tttggtgtga ccgcggcggc cgaagcggtc gcggccgcgg ccggccagtc ctgggaggac 1011301 ctgtccgacg aggtgctcta ccgcccgttg gggatggggt ctacgagttc ccggttcacc 1011361 gactttctgg ccaggcccaa ccatgcggtc aaccacgtca aggtcgcaga ccgatgggag 1011421 gcgcgctacc agcgcgatcc cgacgcccaa tcacctgcgg gcggggtgag ttcgtctctt 1011481 aacgacatga cgcactggct ggccatggtg ctggccgacg gcgtgtacaa cggccgtcgg 1011541 atcacgtcgc cggaggccct gctcctcgtc tacacgccgc aggtgatctc tcgacacccg 1011601 gtgtcaccga gagcgcgggc cagcttctat ggctacggat tcaacgtggg ggtaacctct 1011661 tcgggacgca ccgagtacag ccattccggc gccttcgggc tgggtgccgc ggcgaatttc 1011721 gtggtgctgc cctccgaaga cctggccatc atcgcgctga ccaacgccgg gcccatcggc 1011781 gtgccggaga cgctgaccgc cgaattcatg gacttggtgc agtacggcca ggtacgcgag 1011841 gactgggcgg ccctgtacaa gaaggcattt gccccgctga acgagctcgc gggctcgctg 1011901 gtcggcaagc aatccccggc caacccagcg ccgagcagac cgctgaacga ctacgtcggc 1011961 gtgtacgcca acgactactg ggggcccgcc accgtgacct accacgacgg ccaactgcgc 1012021 ctgtcgctgg ggccgaagaa ccagacgttc gatttgacgc actgggacgg cgacactttc 1012081 acgttcacgt tgtcgaccga aaacgcattg cccggatcga tttccaaggc caccttcgcc 1012141 ggcgacacgt taaacctgga atactacgac gccgacaagc tgggaacgtt tacccgatga 1012201 cccgttcggc ttcggcgaca gccggtttga ccgatgccga agtggcgcaa cgggtcgccg 1012261 aaggcaagag caacgatatc ccggaacggg tcacccgcac cgtcgggcag atcgtccggg 1012321 ccaacgtatt cacgcggatc aacgcgattc tgggcgtttt gctgctcatc gtcttggcga 1012381 cgggctcgtt gatcaacggg atgttcggcc tgctcatcat cgccaacagc gtcatcggca 1012441 tggtccagga gatccgtgcc aagcagacgc tggacaaact cgcgatcatc ggacaggcga 1012501 aaccgttggt gcgcaggcaa tccggaacgc gcacgcggtc gaccaacgag gtggtgctgg 1012561 acgacatcat cgaacttggg cccggggacc aggttgtcgt cgacggcgag gtcgtcgagg 1012621 aggaaaactt ggagatcgac gaatcattgc tgaccggcga ggccgacccg attgccaaag 1012681 acgctggcga taccgtgatg tcgggcagtt tcgtcgtctc cggtgccggc gcctaccgcg 1012741 ccaccaaggt cggcagcgaa gcatatgcag ccaaactggc cgccgaggcc agcaagttca 1012801 ccctggtgaa atccgaattg cgcaacggca tcaacaggat tctgcagttc atcacttact 1012861 tgttggtgcc ggccggcctg ctgaccatct acacccagtt gttcaccaca cacgtgggat 1012921 ggcgggaatc cgtgttgcgg atggtgggcg cgctggtgcc gatggttccc gaaggcctgg 1012981 tgctgatgac ctcgatcgcc ttcgccgtcg gggtggtcag gctcggccag cgtcaatgcc 1013041 tggtgcaaga gttgcccgcc atcgaggggt tggcgcgggt ggacgtggtc tgcgccgaca 1013101 agaccggcac actgaccgaa agtggcatgc gggtctgcga ggtcgaagag ctcgacgggg 1013161 ctggtcgaca ggaaagtgtc gccgatgtgc tggccgccct ggccgccgcc gacgcccgtc 1013221 ccaacgcgag catgcaggca atcgccgagg cctttcactc gccgccgggc tgggtcgtgg 1013281 ccgcgaacgc gcctttcaag tcggccacca agtggagcgg cgtctccttt cgcgatcacg 1013341 gtaactgggt gatcggcgcg cccgacgtgc tgctcgatcc ggcttcggtg gcggccagac 1013401 aggccgagcg gatcggagcg cagggattgc gggtgctgct gctggctgct ggcagtgtgg 1013461 ccgtcgacca tgcccaagcg ccgggtcagg tcaccccggt agcgctggtt gtgctggagc 1013521 agaaggtgcg gcccgacgcc cgtgaaacgc tggattattt tgctgttcag aatgtttcgg 1013581 tcaaggtgat ctccggtgac aacgcggtgt cggttggtgc ggtcgccgac cggctcgggc 1013641 tgcatggcga ggcgatggat gcgcgtgcgc tgccgacggg ccgcgaagaa ctggccgaca 1013701 cactggactc ttacaccagt tttggccgtg tgcggccgga ccagaagcgt gcgatcgtgc 1013761 atgctctgca atcacacggg cataccgtgg cgatgaccgg cgacggcgtc aacgacgtgc 1013821 ttgccctcaa ggacgctgat atcggtgtgg cgatgggctc gggcagcccg gcctcgcgtg 1013881 cggtggcaca gatcgtgttg ctgaacaacc ggtttgccac gctgccccat gtggtcggcg 1013941 aggggcgtcg ggtcatcggc aatatcgaac gggtcgccaa tctattcctg actaagacgg 1014001 tgtattccgt gttgctggcg ctgctggtgg gtattgagtg cttaattgcc ataccgctgc 1014061 ggcgtgatcc gctgttgttc ccgttccagc cgatccacgt caccatcgcg gcctggttca 1014121 ctatcgggat cccagcgttc atcctgtcct tggcgcccaa caacgagcgg gcctatccgg 1014181 gcttcgttcg gcgagttatg acgtctgcgg tgccgttcgg actagtcatc ggtgtcgcga 1014241 ctttcgtcac ctatctggcc gcttaccagg gtcgctacgc ctcgtggcag gagcaggaac 1014301 aggcgtcgac cgctgcgctg atcacgttgt tgatgaccgc gttatgggtg ctggcggtga 1014361 tcgcacgccc ctatcagtgg tggcgactgg cgctggtgct tgcctccgga ctggcctatg 1014421 tggtgatctt cagccttccg ctggcgcggg agaagttcct gctggatgcc tcgaacctgg 1014481 cgacgacgtc aatcgcgctg gcggttggcg tggtgggtgc ggcgaccatt gaggcgatgt 1014541 ggtggatccg aagcaggatg ctcggtgtga aaccgagagt gtggcgataa ccgcgaatcg 1014601 ccgcgcatta gcgcccgcag ttcgggcaat ccgagggcgt tgcggcgtag tgcatccagg 1014661 cggccattga tggcttcggt agggctggtc ttgccgcggc gccggtcggg gtgggcgtag 1014721 gcggcgatca ggcgctggga gaagccccag cacattttgc cgtgtgttga gcggtgggta 1014781 gcgcgtgcgc ggggtgtgtc gtactcggta gagcggatct ccgcgcggcc ccggtgaccg 1014841 cccgtgagct gttggatggt tggtggtgca tatcgtcggt ctgtcgatcg agaccacagc 1014901 accgaccgac tccgcgatta ctcccatcat ggtccgggaa atcaacatcg gtgagatccc 1014961 cctaggcctc aggctgggca gcgacaccac actgctcgac gccgctctcg cgggtgggta 1015021 acaccggcag ccagctttcg ggcttttccc gaccggctct aagggctggt tgcagtcaac 1015081 cgcaccgcga caagtagggt tcaccagagg atactggggc caagctcgtg gcaagaaacg 1015141 gtacgcatgg gaatcctgga caaggtaaag aacctgctgt cgcagaacgc cgacaaggtc 1015201 gagacggtga tcaacaaagc gggcgaattc gtcgacgagc agacgcaagg caattattct 1015261 gacgccatcc acaagctgca ggacgcggcc agcaacgtcg tcggcatgag cgaccagcag 1015321 agctagcacg catggcgaaa ctgtccggat ccatcgacgt accgctgcca ccggaggaag 1015381 cctggatgca cgcctccgat ctgactcgtt accgagagtg gctgaccatc cacaaggtat 1015441 ggcgcagcaa gttgcccgaa gtgctcgaga agggcacggt cgtcgagtcg tatgtcgagg 1015501 tcaagggcat gcccaaccgg atcaagtgga cgatcgtgcg gtacaaaccc ccggagggca 1015561 tgacgctcaa cggcgacggt gtgggtggtg tcaaagtcaa gctgatcgct aaggtagcgc 1015621 cgaaagagca cggctccgtc gtcagcttcg atgtgcacct cggcggcccg gccctgctcg 1015681 ggccgatcgg catgatcgtc gccgctgcat tgcgagccga catccgcgaa tcgctgcaga 1015741 acttcgtcac ggtgtttgcc ggctgaccgg cgaacgtgat cggtgtcgat gagtttcaga 1015801 ctccggggcg gtcggtacct gtgaaccctg atccagggcc cgacacagac taggaggtca 1015861 tccgtgccta ctcgtagtag cgcgccgctg ggcgcaccct gctggatcga cttgacgact 1015921 tcggacgtcg accgtgccca agatttctac ggcacggtgt tcggctgggc gttcgagtcc 1015981 gcgggacccg actacggcgg atacatcaat gccgccaagg gcggtcaccc ggtcgccggc 1016041 ctgatggcca atcggcccga gtttcagtct cccgacggct gggccaccta ctttcatacc 1016101 gtcgacatcg gtgcgaccgt ggccaagttg gctgccgcgg gcggttcgtc gtgcctggac 1016161 ccgatggaag tacccggcaa gggcttcatg agcctggcgg tcgatccgtc gggtgcggcc 1016221 ttcggcctgt ggcagccgct gcagcaccac ggcttcgagg tgatcggtga agccggctcg 1016281 cccgtctggc atcagctgac gacgcgcgac taccgttccg tcatagactt ctaccgccag 1016341 gtcttcgggt ggcgcaccga acagatttcc gacactgacg aattctgcta caccacagca 1016401 tggttcgacg atcagcaatt gctcggtgtg atggacggca gctcctgtct ccccgaaggc 1016461 gttccgtcga attggaccat attctttggt gccgaggacg ttgacgagac gttgcgggtg 1016521 atctgcgaca acggcggaag tgtggtgcgg gccgccgaga acaccccgta tggccgattg 1016581 gccgcggcag ccgacccgat gggcgttgtc ttcaatttgt cgtctctgca ggcgtaatgg 1016641 cgaatcgggc tgccgcgtgg cgcgcggcga cccgcccatg cgcagtatta gtgtcacaac 1016701 catgacgcgc cgcctgcgcc ctggttggct cgtggcactt tccgccgcgg tcatcgcggc 1016761 cagcacctgg atgccttggc tgacgacgac cgtcggcggt ggaggctggg tcaacgccat 1016821 tgggggcaca cacggcagcc tggagctccc gcacgggttc ggcccgggtc agctcatcgt 1016881 cttgctttcc tcgacgctgc tggtggttgg cgcgatggcg ggacgcggcc tgtcggtgaa 1016941 gctttcctcg attgccgcgc tggtcgtctc gctgctcatc gtggcactca cggtgtggta 1017001 ctacaagctc aacgtcaacc cacccgtgtc agccgaatac gggctgtact tcggtgccgc 1017061 cggcggggtg tgcgcggtgg gttgctcgtt gtgggctgcg gtgtcggccg cttcgcctgg 1017121 gcgtcgtcgc catcgtgaag tggtgcggta gaacatttca gcccggcgga actcgtgttt 1017181 tccccgtgcg gggctggctc ccgattgggt agccccgtac acgaaaggcg caaacacaac 1017241 ctcgcggcca tccgggtcgc gatagatgac ggctccggta gcttctcaaa gggggcgttg 1017301 ttccaccggc tgggtcgcca cctcttgcag gccagtaagt gcggcttgct gcccccgctt 1017361 gtccgagcaa taccggcaca gctgttcgtg atggtctgtt tcgacgattt ccagcgtcag 1017421 gctcgagctg ggaaggccga atatcgcgcc gttgcttccg tagctttcgg cgaaggtctg 1017481 gtcgagcatt cccaccagat cacggtagaa ccgcactgtc tcttccaagt tcgacgggcg 1017541 gggccgattg acgccaatcc ctcgggccag tgcgttgttc ggcggcgctt ttcgttgtcc 1017601 accgctactc cgtttcgccg aggctgcact tctgcaggcc gctactcgca gttttgctgc 1017661 ggattttccg gctcggtgca attcatagcc caacggcagc cgccggcgac tctgcgtgat 1017721 cccagcgacg caactcggcg cccggcaccc acgccgaatg cgtgccgctg gaaatacgtt 1017781 ccggcagtgc aagcttgcat atcgggccat cgccggggcg cgccgcgtcg aaaaccaggc 1017841 aatacgatgc gtcgtcgttc atgtcggtgg tgagggtgac cagatagccg tcgtcctcgg 1017901 cgctgctgcc cacccgtgga gccatcgcgg tctcacttcc gtagacgccg tcaccgaacg 1017961 agtaacactc gtggttgccg gtgagcagat cgtgcttaac cagtccgtcg aacaggaacc 1018021 aactcggttt gccggtagcg gcataggtgt aacggtagct gctggccgcg taatcggcgt 1018081 tgatggttcc gaactcggtg atggactcgg acagttgctc ctcgtggact gccccggtca 1018141 ccatattgag ccgccaccga tgtagccggg actgcagccg atccagagcc aggaaccgaa 1018201 acagcttctc ccacttcgtt cctccggtgt caagtggctg cggatcgcct tcgtagaagc 1018261 cgtcgagcac gatctcgtcg ccctgctcgt aggcgttggt gaagtgcaac acgaacgttg 1018321 gatcggcttc gaaccagcga atgtcgttgc ctcggcgagc aacaaccgca aaccgagatg 1018381 gaatctccgg atagaagcgt ggtaggtgca cgtcgcgctc gagcagcctg ggatcccaga 1018441 acagtggaaa atcgttgagg attacgtaat tttcggtgaa cgccatgtca tgcagtagcc 1018501 gcggcccggg cagcggaaca tcgacatagt gcacaagctc attgttctgg tcgacaacgc 1018561 cgtagcgcat atacggctct tgcttgctgt agttgaagaa caacagttcg ccggtcttgt 1018621 tgtctacctt cggatgtgcc gacacgcccc agtcgaacgg aaaccttccg tgccagctct 1018681 ccttgccgag cgtattggcc gagtacgggt cgatccgata cagatcgccg cactggtaga 1018741 agctagtcag cgcgatacct cggtggacga tgacgtcggt gctcgacgcg tccttcatga 1018801 ggccacgagc gccccagccg tgttcccgct tggccagttg caccggttct gccagacccg 1018861 gccacagcgg cccgccggcc tcgttctcgg ccaagaatcc atcggtgcga ataaatcggt 1018921 tgcggtagaa ggcttttcca tcacggaagc cgacgacatg gatcatgccg tcgccatcga 1018981 aggggtggta ggtcgcgaat gccgggtgta gcgggttctc ggtgttgcgc aggtagatgc 1019041 cgtccaggtc ggcggggact tcgcctgtca cggtggtcag gtcgtcggca tcccattcgg 1019101 tggtctgtgg tcgccacgga ccggtgcgat aggggtggtc gtcgtcttcg ggaagggtcg 1019161 acaagtactt gccgacaatc gtgatgtcca tttcacgatc ctcgtgtggt gctgacaacg 1019221 aaactgaccg tggtggccgt gctgccaccg aaattcagcg tgccgaacgc ttcggcgttc 1019281 tcgacctgat agtcaccggc aatgccgctc acctgtttgg ccgcgtcgag cagcatccgc 1019341 acaccggaag ccccgaccgg atgtccgcca ccgatcagtc ctccgctggg gttgatgggt 1019401 agccgcccgc cgatctcgat ctctccgttc tcgatggcct tccaagattc cccggggccg 1019461 gtcaacccga tgtgatcgat ggccaggtat tcgctggggg tgaagcagtc gtgcacctcg 1019521 atcccgtcca gatcgtcgag ggtcacccgg gcgcggcgca gggcgtccag cactgtggcc 1019581 cgcacgtgcg gcagtaggta gggggccgag tcgccctggg cgacgcggtc cagtttctgc 1019641 cgcagaccca acccgacggt gcgatgtccc cagccgtcga tgcggccgat cgggcgcgcg 1019701 tcgcgatggt cgcgcagata ggcatcgctg accaggacca atcccgcgcc gccgtcggtc 1019761 atctggctgc aatcaaaccg tcgcagccgg ccttcggtaa gagggttggt cgcgtcgtcg 1019821 tcggtgatcg ggtcggggat cgtccagccg cgggtctgcg cgttggggtt gcggcgcgcg 1019881 ttggcgaagt tgagttgagc gatggcccgc aggtgagtgt catccaaacc gtatcgccgg 1019941 tcgtattcgt cggcgacctg agcgaacatc gacggccata agtagcgggc ctcggctcct 1020001 tcgtgcccgg tccaggccgc ggcactcaga tgctcggccg cggtgtcgcc gggcacggtc 1020061 ttctccagct ctaggcccac gacgagcgcg acacggtacg cgcctgatcg caggtcggcc 1020121 atcgccgcga gcgtcgccac gctgccggat gcgcacgcgg cctcgtgccg ggtggccggc 1020181 gtgtcccaga gatcgtcgca gacagtggcc ggcatcgcgc cgaggtggcc ttgacgggcg 1020241 aacatctcgc cgaaggcgtt cgcgacgtgg acgactcccg cagcggctag gtcggcggcg 1020301 tccaccttgg ccgcggtgag cgtgccgtcg acgacctccc tagtcaggtc ggcgaagtcg 1020361 cggttctctt tgctgaggtt gcgagcaaaa tcgctctgat agccgccgag aatccagaca 1020421 ccgtcgtcca tagccgtacg ctactacaag cggtgtgaac ggcccgtcgg atagccacgc 1020481 tcaccaggca ttttccgcgc ggcgacgaac ggttgccgga cttttaccgc ggggggtttc 1020541 cgggcggcgg ctgctctcta atcacaacta ccgggggttt gcggccgtcc tcttggccgt 1020601 cagtgctggt gccgctacgg gtgccgccac cgcccgtcgt gccgcgtgcg gccaggctcg 1020661 ccaaagccat cccgctgagc aggcctgccg gcatcccgtt tagggccgtc gggtcggcgc 1020721 cggcgctgga gctgaaggtg ggtgttgcct gaacggcgag ctggatctcc ggggcggccg 1020781 tggtccagct gtgcggcacc gacaacgctc cgactaatgc tgcgtggccg acgcccgcgg 1020841 acaccggcgc cgcgcccccg aaggggcccc agtgcggctc cggctcgtcg gtcgccgaac 1020901 tcagtggatg gccctgcgtc ggtcccagcc cgccggcgtt cccgtatagg ccgatgtgcc 1020961 agggtctggc cgtgttcgtg atcgcgagcg caatgctgcc ggtcgcgatg gatgcaatgt 1021021 agagcgcgat cacgtccaat tcccctatcg gggtggggat cactatcggc tgagcggatc 1021081 cgacttgcgg gttgagggtc gacgcgatcc ccaacagtcc cgatgtcagc ggatcagcgt 1021141 tggcggccaa tgcggacaga atgtcgctca ggatccccgg gggcagctgg gccagtgtcg 1021201 cctgtgcatc cgcaacggcg cccgcaccgg cggcttgggt cgccgcggct gcggccgcgg 1021261 gcccggccgg gccggtgcct tgcacgggtg gagtgaacgg cggcaacgcc gacgcggccg 1021321 cagatgcccc ctcatagctg tacatcacgg cagcgtcttg ggcccacatt tcggcatact 1021381 cggcctgggt agccgcgatc gccgcactgt tttgccccag aatgttcgcc gcgaccagcg 1021441 acatcaaccg gctgcggttg gccgcgacga gggatggtgg caccgtcatc gcgaacgccg 1021501 tcccaaacgc ttccgccgct gccctcgcct gtgtggccgt ctccttcgcc agcgccgccg 1021561 tggcggccag ccaccccaca tacggcgttg ccgcggccgc catcgcggcc gccgccggcc 1021621 ccatccacgg ctcaacgatc agcgtcgaca ccaccgatcc atacgagacc gcggcggaag 1021681 tcaactccgc ggccacaccg tcccaggcgg ccgcggcggc tagcatcgac tccggccccg 1021741 gaccggaata cattcggctt gaattcactt ccggaggtaa aagcccgaaa tccattgcca 1021801 gcaacctcct taaccggtcg cgaccacatt gacggcctcg gtggtcgcat acgcatcggc 1021861 ggtggccgcc gggagggcca cgaacatgcc atggaccagc gcggccggct tactcaccac 1021921 tcggtagtgc ttggtgtgcg cggtgaaccg ggccgccgtc aggaccgaca cgtcattggc 1021981 agcagggggt aacacccccg tcgtcggggc acagacggct gtgttccgag cactcacggc 1022041 ggtaccgatc gtcggcaagt cccccgtcgc ggctgccaag accaccggct ggatggtcac 1022101 aaaagacatc ggataccacc tgacgcggat cgcttcatct gatcggtcga catcttctac 1022161 ataaccacgg aaatgtctgc tttataacgg aattagacta ctttgtgttg tctggcgttg 1022221 ctctgcaccg acggcatggg taaacgtctg agatgcgggt gtcggcggta gctgaaaaac 1022281 cgtgctgaca accatgattc gccattcccg aacgacctgc gaactttgtc gcctagcgta 1022341 acgccgtggc gagatttggc tcgattgttc gcagtggcgt tacgctcgcc acgcgtgagc 1022401 ctggatcagg caaacgcggc tccacctggc catttgctgt ccgagacggt agttactcag 1022461 catggtgcac aggtctgtgc ttgtctggtt gatggtgatt tggcgttgcg gtggccgtga 1022521 tgaggacgcg gtgagaaacg gagcttgaag atatgtcagc gaaagaacgc ggtgaccaga 1022581 acgccgtcgt cgacgccctg cggagtattc agcccgcagt cttcattccg gcttcagtgg 1022641 tcatcgtcgc catgatcgtc gtttccgtgg tgtactcgag cgtcgccgag aatgcgttcg 1022701 ttcggctgaa ctccgcgatc accggcggcg tcgggtggtg gtacatcctg gttgccaccg 1022761 ggtttgtggt attcgcgctg tactgcggca tttcccggat tggcactatc cggctgggcc 1022821 gcgacgatga gctccccgag ttcagcttct gggcatggct ggcaatgctg tttagtgccg 1022881 gtatgggtat cggcctggtc ttctacgggg tggccgagcc gctcagccac tacctgcggc 1022941 caccgcggtc acgcggcgtg cccgcgctta ctgatgcggc ggctaaccag gcgatggcgc 1023001 tgacagtgtt ccactggggc ctgcacgcct gggcaattta tgtcgtggtt ggcctcggta 1023061 tggcgtacat gacctatcgg cggggtcgcc ccttgtcggt gcgctggctg ctggagccgg 1023121 tcgtgggtcg gggccgtgta gagggcgcct tggggcacgc ggtggacgtc atcgccattg 1023181 tcggaacact ctttggtgtc gccacgtcac tgggcttcgg tatcactcag atcgcctccg 1023241 gcctggaata tctcggctgg atccgggtgg acaactggtg gatggtcggc atgatcgccg 1023301 ccatcaccgc cactgcgacg gcgtcggtgg tcagtggggt cagcaagggt ttgaagtggc 1023361 tgtcgaacat caatatggcg ctggccgccg cattggccct gttcgtgttg ttgctcgggc 1023421 cgacactttt cttgctgcag tcgtgggtgc aaaatttggg aggctacgtc cagtcgcttc 1023481 cgcaattcat gctgcgcacc gcgccgttct cgcacgacgg ctggctcggc gactggacta 1023541 tcttctactg gggttggtgg atcagctggg ctccgtttgt cgggatgttc atcgcgcgga 1023601 tttcgcgggg acggacgatc cgggagttca tcggggcggt gctgctcgtt cccaccgtga 1023661 tcgcctcgct atggtttacg atcttcggtg actcggcgtt gttgcggcaa cgcaacaacg 1023721 gcgacatgct cgtcaacggg gcggtagaca ccaacacatc gcttttccga ttgctggacg 1023781 gtttgcctat cggggctatt accagcgttc ttgctgtgct ggtgatcgtg ttcttcttcg 1023841 ttacgtcgtc ggactccggt tcgttggtca tcgacatctt gtcagcgggt ggtgagctgg 1023901 acccgcccaa gctgaccagg gtctactggg cggtgttgga gggggtagcc gcggccgttt 1023961 tgctcctgat cggaggtgct gggtcactga ccgcgttgcg gacggccgct attgccacgg 1024021 ccctgccgtt ctcaatcgtc atggtggtgg cgtgctatgc gatgaccaaa gcgttccact 1024081 tcgacctggc cgccacacct aggctgctgc acgtcaccgt gcctgacgtg gttgcggcag 1024141 gaaaccggcg acgccacgat atctcggcga cgctgtcggg gctcattgcc gtccgtgatg 1024201 tcgatagcgg cacatatata gtccaccccg acaccggcgc tctcaccgtc actgcaccac 1024261 cagatccgtt ggacgatcat gtttttgagt ctgatcggca cgtaacgcga agaaacacaa 1024321 catcatcgag atgatgtgtt atcgacctgc cgggtcgccg ctgcctggac cggagccggc 1024381 tacttccggt aaacgcgcac cgctggatga atcgccgcgg catgagaagc tcgacggtgg 1024441 tgccgggatc gtcgcgcacg atgtcatgct ccagggtgct ggtcagccga tggcctttgg 1024501 tgtgccactg accgggtcga tctccgcggc cggcgaccac gccacggtcg cgtccatagc 1024561 acaggtcgcg cggcgcgcga cggcgtgacc cgacatcaag tccttatcgg aggagcttgg 1024621 cccctcgcgt tggtccgcgg caggctcggt cggcaaatcc tcaaatcggc cccaagttgc 1024681 accgagcggg agcggcggtg acggccaacg tgtggtgtcg tgcgggcggc attcggatgg 1024741 cgccacggcc ggtcatcccg gtggctacgc agcagcgcct gcggcggcag gcggatcgcc 1024801 agagcctggg tggtagcggc ttgccagcgt tgaattgtac gcctatcagg cacacaattg 1024861 atgtcatggc taccaagcct gagcggaaga ccgagcgtct tgcagcgcgc ctgacccctg 1024921 agcaggacgc gctgattcgt cgtgctgccg aggccgaggg gactgacctc accaatttca 1024981 cggttacagc ggcgttggcg cacgcgcgcg acgtgctggc cgaccgccgg ctcttcgtac 1025041 tcaccgatgc cgcgtggact gagttcctcg ccgcgctgga ccggcccgtc tcacacaagc 1025101 ctcggttgga gaagctgttc gccgcgcggt ccattttcga caccgagggg tgagcggcta 1025161 cagcgcgccg cgacgtatca gcgacgccga tgacgtcacg agcttcagca gcggcgagcc 1025221 cagtctggac gattacttgc gcaagcgggc gttggccaac catgtgcagg gagggtcgcg 1025281 ctgtttcgtg acgtgccgtg acggtcgggt agtcggcttc tatgcgctag cgtcagggtc 1025341 ggtcgcacac gctgatgctc cgggacgggt gcgccgcaat atgcctgacc ccgtgccggt 1025401 gatcctgctg tcgcggttgg cggttgatcg caaagaacag ggcaggggcc tgggcagtca 1025461 tctgctgcgt gatgcgatcg gtcgctgtgt ccaggctgcg gactcgatcg ggctgcgggc 1025521 gattcttgtt catgcgttgc acgatgaggc ccgcgcgttc tacgtccact tcgacttcga 1025581 gatctcgccg accgatccgc tgcacctaat gctgttgatg aaagacgctc gcgcgctaat 1025641 tggcgactga tgctacgcga ttgactatcg agagccaggc tacgtcatct gataccaacc 1025701 aatcaccgac cacagcaccg accagaacaa gccacgacca ctcggctgac acctgaaaac 1025761 catggctgaa ctgcgcaaac acagagtgcc cccggcagga ttcgaacctg cgacaccggc 1025821 tttaggagag ccgtgctcta tcccctgagc tacgagggcg gggacgcctt tgaatacctg 1025881 actaaaacct agccgttcgc cgcgccggcc gggactgtcc gatattcggt gtaagtggcg 1025941 tttctcggga tttttctttc ggtcagcgtt cttcggcggc tggcatgcga tcggcgaacg 1026001 tgatcgccag ggcgttgagc gctggcttcc agcgtacggc ccacttggtt tgcccggtgc 1026061 ccttgggatc cagggagcgg gtgaccaggt agagcgtctt gagtgctgac tgttcgttcg 1026121 ggaagtgtcc acgtgcccgc accgcccgcc ggtagcgcgc attgagactt tcaattgcgt 1026181 tggtagaaca cgggactcgc cgtatttcga catcatagtc caggaacgga atgaactctt 1026241 cccacgcgct gtcccacagc cgtgtgatcg ccgggtaagg cttaccccat ttctcggcga 1026301 actcctcgta gcgcaacctg gcctcagcgg cactggctgc ggtgtagatc ggcttgaggt 1026361 cgacgctgat cttgtcccag tacttgcggg aggcataccg gaaagtgttg cggatcagat 1026421 ggatgatgca ggtctgcacc gtggccaacg ggaacgccgc ggacacgctg tcgggcaacc 1026481 ctttgaggcc gtcgcagacc aggaagaaga tgtctttgac cccacgattg cgcagttcgg 1026541 tgagcactgc cagccaaaat ttggctgact caccgtcgcc ttcgccggcc cacatcccca 1026601 ggatgtcctt gtggccgtcg aggtcgacgc cgatcgcggc gtagaccggc cggttgcgga 1026661 cctgcccgtc gcggatcttg accatgatcg cgtcgatgaa caccgcggcg tagaccttct 1026721 ccagcggcct ggaccaccac gcctgcatct cctcgatgac ccggtcggtg atccgcgaga 1026781 tggtgtcctt ggacaccgac accccgtaaa cgtcggcgaa gtgagccgcg atctcgccgg 1026841 tggtcaggcc tttggcgtac agcgacaaca ccacccggtc cacatcggtg acccggcgct 1026901 tacgtttgcc cacgatcacc ggctcgaagg tgccgttgcg gtcacggggc accgcaatct 1026961 cgacctgtcc gcacgcatcg gttatcacct tcttgttacg agatccgttg cgtgagtttc 1027021 cacttccacg cccggctgcg gcgtgcctgt cgtagccgag gtgttcggtc atctcctctt 1027081 gcagggcggc ttcgagcacc gtcttggtca gcgccttgag caacccgtca gggccggtca 1027141 atgcgacccc ctcagcgcgt gcctggcgta ccagatcacc caccagcgcc cgctcggcac 1027201 cggagagctc acgggccgca acggccgcct catccacgtc ctggccggcg tgagccggct 1027261 ctatcacctg agcagcatcc atgcccttga gtgtgtttgg tcatagcagt gattccttct 1027321 gccccacgcc gggggcggtc agaaccactt acaccgaatc agcgatagac ccctccggcg 1027381 gcgggggggt tggcggtgtt tgtggcgtcc ggtcgtcggg gtgcggcggg tgtgagtgta 1027441 gcgggcgcaa cgagggccac ctgacgctcg ggcgtgtgtg gtgggcgctt gtcggccaac 1027501 gctctggggt tcagagctgt tgcgtgttga gtgtgtttta gtgtgcgtta gtgtgttcta 1027561 attggcggcg tgaatctggc ggattgggcg gagtcggtgg gggtgaatcg acataccgct 1027621 tatcgctggt ttcgggaggg gacgttgccg gtgcccgcgg agcgggttgg ccggttgatc 1027681 ctggtcaaga cggccgcctc ggcgtcggcc gcagcggcgg gagtggtgct gtatgcgcgg 1027741 gtgtcaagcc atgataggcg ttcggatctg gatcggcagg tcgcgcgtct aaccgcgtgg 1027801 gccaccgagc gtgacttggg ggtggggcaa gtggtgtgcg aggtcggttc cggcctgaac 1027861 ggcaagcgac ccaagctgcg gcgcatcttg tcggaccccg atgcgagagt catcgttgtg 1027921 gagcatcggg atcggctggc gcgtttcggg gtggagcacc tcgaggcggc gctgtctgct 1027981 cagggccggc ggattgtggt cgccgatcct ggtgagacga ccgatgatct ggtgtgtgac 1028041 atgatcgagg tcttgaccgg tatgtgcgcg cggctgtacg ggcgtcgcgg tgcgcgcaac 1028101 cgggcgatgc gtgcggtcac ggaggccaag cgtgagccgg gggcggggtg atgatcgtca 1028161 ggatgcgtag ctgcgctcag gccgcgaagg tggccgaggc caccggtggt gtgcagctgg 1028221 cgggcaagcc gaaacccgat gggacaccga cgttctcccg gtatgtggag atcggcgtgg 1028281 attttgaggc gcaccggccg gtggtggagt cggtttcggt gctgttcgag ctttatgacg 1028341 gcgacgccaa cagttatgcc gcgaccgggg ggccgggtgc ccaactgccg tcgggctgga 1028401 tggtcacggc ggcgaaattc gaggtcgagt ggcccgccga cccgcagcgg gcgggtttgg 1028461 tgcgttcaca tttcggcgcc cgccgcaaag ctttcaactg gggcctggcc caggtgaagg 1028521 ccgacctcga cgccaaagcc gctgatccgg cacatgagtc ggtggactgg gacttgaagt 1028581 cgctgcgatg ggcgtggaac cgagccaaag atgacgtggc gccgtggtgg gccgagaatt 1028641 ccaaggagtg ctactcgtcg gggttggccg atctggccca gggcctggct aattggaaag 1028701 ctggcaagaa cgggacccgc aaaggccggc gggtgggctt cccgcgattc aaatccgggc 1028761 ggcgtgatcc tggcagggtg cggttcacca ccggcaccat gcgcatagag gatgaccggc 1028821 gcacgatcac ggtcccggtg atcgggccgc tgcgggccaa ggagaacacc cgccgggtgc 1028881 aacgccacct cgtgagcggg cgcgcgcaga tcctgaacat gaccttgtcg cagcggtggg 1028941 gccggttatt cgtggcggtc tgctacgcgc tgcgcacccc gaccaccaga tcaccgctca 1029001 cccagccgac tgtgcgcgcc ggaatggacc tgggagtccg gaccctggcc acggtcgcca 1029061 ccctcgacac cgccaccggc gagcagacca tcatcgaata cccaaacccg gccccgctca 1029121 aggcgacact cgtcgcccgt cgcagggccg gccgagaact ttcccgccgc atccccggct 1029181 cccatgggca tcgggcagtg aaagccaagc tggcccgcct ggatcgccgg tgcgtgcacc 1029241 tacggcggga agcagcccac cagctcacca ccgagttggc gggcacctat ggccaggtcg 1029301 tgatcgaaga cctcgacgtg gccgcgatga aacgcagcat gcgccggcgg gcgtttcgcc 1029361 gatcggtctc cgatgccgca atgggtttgg tcgcgccgca gctggcttac aaaacggcca 1029421 agtgcagcgg cgtgctgacg gtggcggacc gctggtttgc ctccagccaa atccaccacg 1029481 gctgcaccag ccccgacggc acaccgtgcc ggctgcaagg caagggccgc atcgacaaac 1029541 acctgctctg ccctgtaacg ggcgaggtag tcgaccgcga cagaaacgct gctttgaatc 1029601 tccgtgactg gccggataac gccagtcgtg gtccagtcgg gaccacggcc ccatcggcac 1029661 ccgggccaac caccacggtt ggtacaggcc atggcgcgga caccggatca tccggcgccg 1029721 gcggagcatc cgtaagaccc cgcccacgca gggccggacg cggcgaggcc aaaacccaaa 1029781 ccccgcaagg ggacgccgca tgagagtgca actaaaacac actcaacggc aacggtgtcg 1029841 tcgggatgcc agcgccgccc acgcatcttc acttgatcga gatcgatcag gtgatcggcc 1029901 gctcattggc ggccgcggca tcatgcagat ggttgacgag ctgcgtgcgg ccgcttccgg 1029961 tccaaaatcg ccagacagct accaggaacg ggccgcagtt accaggccct gtaccagggt 1030021 agcggtgacc ggtgacatgc cgccgacgcc ggggagggta ctgcgtgggc ccagacccct 1030081 tacccgaatc gatagttcca gctgggtccc gccgtcgcgg acccggttga ccggattgtc 1030141 tggatgcagg ccgcggagct cctccgggat ggcggccaga tcggtgacta cccgatagcc 1030201 gggcagctgg atgtgccgcg cgagatgggc ggcaagcgcg cggttgcggc ccccggccaa 1030261 cagctgggtg ctgcgcccga tgcggtcgta gccgtgcagc gacacggcga cgtcaacgtg 1030321 gtcaaggaat tcggcgaggc gcgccgattc cgcagggtcg aaccgggccg acggcaggtg 1030381 gtgcgggtag ttgtccggat ggcgcagcag gtacaccgaa gcgcccgcag cctcggcgga 1030441 gcgttcggcg atcaggtcgg tcacctgctc caggccgccc ccgtggatgg cgaggaagcc 1030501 gaagcgggac cgcagctggc tcgtctcgat gacgccgggc tggcttagca actccgaaag 1030561 tgattgtggc gcaggcccag atctcgatga cggtaacact ggcaggggcc accgcgcggg 1030621 gtcccagcgg tgcagatagt cgatccagcg ttgcggcagc ccgtggtgtc gagcgccgtc 1030681 gatgacgcgc ggtagatagc ccggccgcgg ccggcccggc atcacccggt ggtcaatgta 1030741 gacccaggcc ggcaacgctg tgtcgtcggt gtgcacggtc aaccgttcgc gccggtagcg 1030801 caccggcacg ccttcggcgc tgtccaacct gaccaggtcg cgctcggaga gctgccatag 1030861 cacgccatgc accttgtttc cggcgaaggg ttcgacggtg gccacgccgc gctggttgat 1030921 cagccagttg tgatcgctga gcactgccgg ccgcggagca ccggcgtcgg gacagcgcga 1030981 cgccatctgg tgggcgcaca ggttggaccc gtaggcgaag tagggatgcc ggcggtccgg 1031041 cattcagccg gtcaccgtga gatagatcag catcacgttg agcagactaa ccatcaccgc 1031101 gaccacccag ccaacccaag tcgtggcgcg atggttggtg tcgccgccca tcaccgcggg 1031161 gctgccggtg agtttgacca gtggaagtac cgcaaacgga ataccgaacg acagcaccac 1031221 ctgtgagagc accaatgtgc gggtggggtc gaagcccagc gtaagtatcg ccaacgcggg 1031281 gcccagcgtg attaggcggc gcaccagcat gggaacgctc cagtgcagca gcccctgcat 1031341 gatcatcgcg ccggcgtaag cacccaccga cgacgacgcc aagccggacg ccagcaaccc 1031401 gaccgcgaag agcaccgcga tcgtcgcccc caaggtgtcg tggacggcgt ggtaggcgcc 1031461 ttcgatcgag gcggtgtccc cacggccccg catgttcagc gcggcaacca gcagcatcgc 1031521 ggcgtttacc ccgccggcta tcagcatcgc caggccgaca tcccagcggg tgacgcgcag 1031581 cagccggcgc cgctgagggc ccggatcggg atgcccgtgc cggtcgcgcg cgagacctga 1031641 atgcaggtag acggcgtgcg gcatgacggt cgcccccatg atcgccgcgg ccaaaagaac 1031701 gctctcggtt ccctgaaagc gcggtgccaa accgccgagg accgcattgg ggggtggtgt 1031761 cacgacgaag aaactggcgg tgaagccgat ggcaatcacc agcagcaagg cggtgatgac 1031821 gcgctcgaac aaacgttgac cgcgccgatc ctggatcgtc agcagcagca gcgagaccac 1031881 cccggtgatg atcccgccga tcggcagcgg caggttgaac atgatccgca atgcgatagc 1031941 tccgccgatc acttcggcca catcggttgc catcgcgacg atctcggcct gtgcccagta 1032001 ggccagccgg gccgggcgtc ccattcgctt gccgatcgct tccggcagtg agcgtccggt 1032061 caccagcccg agctttgccg acaggtactg caccagggcg gccatcacgt tggcggcgac 1032121 gatcacccat aacaacaggt agccgaactg ggcgccggag ctgacgttgg ctgccacgtt 1032181 cccggggtcg acgtaggcga tggccgcgac aaaggctggc ccgagcagat accagctcgt 1032241 cttcagggaa gtccgggtgt cctgggccaa ctcaccgact ttcgatccac gcgaacaaag 1032301 atgcgagagt aaccgaaatt cgcccgccac caaccaccgg gctactcggg acctccgctg 1032361 gctatcggta gtcggggttg gcgaagtccg gccggcagcc ggcgtcccac ttggtgcgtt 1032421 gattgccgta ggccgggatg ccgccggcgc ccgcaacatc tgcgcgatgt gcatcagatt 1032481 gaacgtcatg aatgtggtgt tgcggttggt gaagtcgttc tctggaccgc cggatccggg 1032541 gtcgagatac gacggtcccg gccccgcttc accgatccag ccggcatccg cttgcggcgg 1032601 gatggtgtat cccaggtgtt gcaggctata gagcacattc atcgcgcaat gcttgacgcc 1032661 gtcctcgttt ccggtaatga ggcaaccacc ggcgcggccg tagtaggcgt actgtccatc 1032721 ctcgttgagc aggctcgagc atgcgtacag gcgctcgata acccgtttca tcaccgagct 1032781 gttgtcgccc agccagatcg gcccgcacag caccaggatg tgcgcatcga ggacacgccg 1032841 atacagggcg ggccattcgt cggtcgccca accgtgttcg gtcatgtccg gccatacgcc 1032901 ggtcgctatg tcatggtcaa ctgcgcgcag agtgtcgacc tggacgccat gctcacgcat 1032961 gatccccgag ctgcgctcaa tgagcccgtc ggtatggctg agctctggcg agcgcttcag 1033021 tgtcgcgttg atgaacagcg cacgcagccc gtcgaatcgg ggtggggccg cggcgttctg 1033081 gtcagaggtt gtggtcatac gtcataccca cctgcctgtc atcgtcgtgc cgggttgccg 1033141 ctgggcggcg gtgctggtgc caagaaatga ccgatcaggc agcagcgtac cgcccttcac 1033201 cggtgatcag gggtaggtcg agggttgtcc ggatacccgg ttcggcggcc accactgcag 1033261 ggatcgcgtt gacgatgcgc atcgcggtgg cgaccagtcc ggcgtggttg tggtccccgt 1033321 ggcggctgct caggcagatg tccatggcgt agcagggctc gccggagatt tcgatgcggt 1033381 acgagccgcc cggctgggcg ggctgcggcc actcgggaca taggtccgcg cgcaaccggg 1033441 tcacgtgttc caggactacc gctggcacgc cgtcgaccag gccgagcacc tcgaagcgca 1033501 gggcggcggc gctgccctta ggaatatggc ccgatgcaat gttgaaggcc tccggcgccg 1033561 gctcccggac atacatttcc tcgaccccgt caagtgaaat gccaaggccc gcagcaagtt 1033621 gtcggaccac tgatccccag gccaggctga gcacacctgg ctgcagcagc atcgggatct 1033681 ggtccatcgg cttaccgaag cccatcacgt cgaacatgac tacggcgctg tcataggtgg 1033741 cgtagtcgac gatctccatg cagcgtatct gctcgatgct ttcacaggtg ccggccaacg 1033801 ccatcggcaa caggtcgttg gcgaaacccg gatcgatgcc gttcacgtac agacttgaat 1033861 ttcctgcgcg cgcagcgtct tgcaaaggct tgatgatctc gtcggggatc acctgccacg 1033921 gatattgcaa gaacaccggg ccgctgccga cgatattgat ccctgccgcc aagattcggc 1033981 ggtagtcttc cagcgcctcg ggcagccgat tgtcggccat cgcgttgtag acggcgcacc 1034041 gcggcccggt ggcgagcacg gcgttcagat cggtgctggc ccgcacaccc gtcgaatccg 1034101 ccagcccggc aagctctgcc gcatccttgc cggctttggc gtccgatgac acccagacac 1034161 cggtgagctc gaactccggg tcggcgatga gcgcacgcaa cgagtgcacg ccaacgttgc 1034221 cggtgcccaa ttgaacgacg ggtatggcca tggcgggctc cttagcggta ggggtcagac 1034281 tgcgactgct cgcgcatcat cggttcacag gtccggaatg ggaaggtcga gattggggaa 1034341 ggtgagtccg ccgtcgacct ccaacgtctt gccggtcagg aagctgcccg ccggagaggc 1034401 caaatacact gccgcagctg caatgtcgac ggggtcaccg agccggcgca gtggtgtcgc 1034461 ctgctccatc ggcgcacgca gctcgtcgtt ggcggctacc acctccagcg ccgaggtcag 1034521 gatggaaccc ggcgcgatcg cattgacccg gacgcgtggg cacaggtcca gcgccgccag 1034581 ccgggtgtag tgggccagtg cggccttggc ggtgccgtag gcggcgaaac cccgcgccgc 1034641 cagccggccc atggtggagc tgatgttgat cacgctgccg ccgccggagt gttccagcat 1034701 cagcggcacc gccgcgacgg tcagcgcgtg ggcggtgccc acgttgaagg cgaaggcgtc 1034761 cgcgaggtcc ttggtcgagg tgcttagcag cgtgttgggc atggtgccgc caacgttgtt 1034821 gacgacgatg tcgagcttcc cgaaagctcc gacggcctga ccagccagct gcgcggtcac 1034881 ctcgggatgg gccagatcgg cggcaacggt gtgggcgcgg cggccggcag cgcggatctg 1034941 ttcggcgaca gcgtcaagct cggatgatgt tcgtgaagcg atgaggacat ccgcgccggc 1035001 ctgggcgaaa gccaatgcga tggctgctcc caggccgcgg ccgccgccgg tgatgacggc 1035061 aaccttgtcg tcaagacgga acatatccag gatcatggcg ccctcttttc cggctgtcgg 1035121 ccgaaacggt aacaagcttg ctgcagcttc ctgtgactgc tcccgaaacc tgggggtgtg 1035181 cctgctgtgt atgcacggca tacggacatc cttcccctga gacccgcggt cgaaccagcc 1035241 acgtgtccat catcaggggt caaccccggc caagggcgac ggcacgccaa gttcgccgac 1035301 cgttaaccta gtgctgttag cttcatttgc tgcgagcaaa acagctggtc ggccgttagg 1035361 aactgaattg aaactcaacc gatttggtgc cgccgtaggt gtcctggctg cgggtgcgct 1035421 ggtgttgtcc gcgtgtggta acgacgacaa tgtgaccggg ggaggtgcaa ccactggcca 1035481 ggcgtcggcg aaggtcgatt gcggggggaa gaagacactc aaagccagtg ggtcgacggc 1035541 gcaggccaac gcgatgaccc gctttgtcaa cgtgttcgag caggcctgcc ccggccaaac 1035601 cctgaactac acggccaatg gttcgggcgc tggaatcagc gaatttaatg gcaaccaaac 1035661 cgatttcggt ggctcagatg tacccctgag caaggacgag gccgcagcgg cgcagcggcg 1035721 ttgcggctcg ccggcgtgga atctgccggt ggtgttcggc ccgatcgcgg ttacctacaa 1035781 cctcaacagc gtttcctcgc taaatttgga cggccccacg ttggcgaaga tcttcaacgg 1035841 ctccattacg cagtggaaca atcccgcgat ccaggcgctg aaccgcgact tcacgctgcc 1035901 aggtgagcgg attcacgtgg tgttccgcag cgatgagtcg gggaccacgg acaacttcca 1035961 gaggtacctg caggccgcgt ccaacggtgc gtggggtaag ggcgctggaa agtcgttcca 1036021 aggcggcgtc ggtgagggcg cgcggggtaa cgatggcacg tcagcggccg cgaagaacac 1036081 cccggggtcg atcacctaca acgagtggtc gttcgcccag gcgcagcacc tgaccatggc 1036141 caacatcgtc acttcggctg gtggggaccc ggtggcgatt actatcgact cggtcggcca 1036201 gacgatcgcc ggggccacca tctccggggt gggcaacgac ctggtgctcg acacggactc 1036261 gttctaccgg ccgaagcgtc ccggctccta tccgatcgtg ttagcgacat acgaaatcgt 1036321 ttgctcgaag tatcccgact cgcaggttgg cacggctgtg aaggcgttcc tgcagagcac 1036381 tatcggcgcc ggtcaaagcg gcctggggga caacggatac atcccaattc cggacgagtt 1036441 caaatcgagg ctgtcgactg cggtcaacgc gatcgcctga tctgaggttg acgtggtcac 1036501 cgagccgctc acaaagccgg cgctagtggc ggtcgacatg cgccccgcgc ggcgcggcga 1036561 gcggctgttc aagctggccg cgtcggccgc cggttcgacg atcgtcatcg caatcctgct 1036621 gatcgcgata ttcctgttgg tccgcgccgt gccgtcgttg cgggcgaatc acgccaattt 1036681 cttcaccagt acccaattcg acacgtcgga cgatgagcag ctggcgtttg gtgtccggga 1036741 cttgttcatg gtcacggcgt tgagttcgat aacggctctg gtgttggcgg tgccggtggc 1036801 tgtcgggatc gcggtgttcc tcacccacta cgcgccgagg agactgtcgc gtccattcgg 1036861 cgcgatggtg gatctactgg ccgcagtgcc gtcgatcatc ttcgggttgt gggggatctt 1036921 tgtgctggcg cccaagctcg agccgatcgc gaggtttctc aatcgcaact tgggctggtt 1036981 gttcctgttt aagcagggca acgtgtcgtt ggccggcggc ggcacgattt tcaccgcggg 1037041 catcgtgctg tcggtgatga tcctgcctat cgtcacatcg atatcacgcg aagtgttccg 1037101 gcagactccg ctgatccaaa tcgaagcagc gctggcgcta ggcgcgacga aatgggaggt 1037161 agtgcggatg accgtgctgc catacgggcg aagcggggtg gtcgcggcct ccatgctggg 1037221 tttggggcgg gctctgggcg aaaccgtggc cgtgctggtc atcctgcgct cggccgcgcg 1037281 gccggggacc tggtcgctgt tcgacggcgg ttatacgttc gcttccaaga tcgcctccgc 1037341 tgcttcagaa ttcagcgaac cgctgccgac cggagcctat atttcggcgg gatttgcgtt 1037401 attcgtgctg acgttcctgg tcaatgcggc cgctcgcgca atcgccggcg ggaaggtcaa 1037461 cgggtgagtc cctcaacgag catcgaggcg ctcgaccagc cggtaaagcc ggtggtgttt 1037521 cgtccgctta cgctgcgacg gcggatcaaa aacagcgtcg cgacaacgtt tttcttcacc 1037581 tcgttcgtgg tcgcgttgat accgttggtc tggctgcttt gggtggtgat tgcccggggt 1037641 tggtttgccg tcacccgatc gggctggtgg acccactcgc tgcgcggcgt gctgccagag 1037701 caattcgccg gtggggtgta tcacgccctg tacggcacgc tggtgcaggc cggggtggcc 1037761 gccgtgctgg ccgtgccgct gggcttgatg accgcggttt acctagtgga atacgggact 1037821 ggtcgaatgt cgcgggtgac taccttcacc gtcgacgtgc ttgccggcgt gccctctatc 1037881 gtggcggcgt tattcgtctt cagcctgtgg atcgccaccc taggatttca gcagagcgcc 1037941 tttgccgtgg cgttggcgtt ggtcctgctg atgttgccgg tggtggttcg ggcaggcgag 1038001 gagatgctca ggttggtgcc cgatgaactg cgagaagcca gctacgcgtt aggcgttccg 1038061 aaatggaaga cgatcgtgcg gatcgtcgcc ccgatcgcga tgccgggcat cgtgtcaggc 1038121 atcttgttgt ccatcgcgcg cgtcgtcggt gaaaccgcac cggttctggt gctggtcggg 1038181 tacagccact ccatcaacct cgacgtcttc cacggcaaca tggcctcgct gccgttgctg 1038241 atctacaccg aactcaccaa tcccgagcac gccggcttcc tgcgcgtctg gggcgcggcg 1038301 ctgaccctga tcatcgtggt cgccacgatc aacctggccg cggcgatgat ccggttcgtc 1038361 gcaacccgac ggcggtgact cccgttatga cgtgagtttc accactcggt cgttgccgcg 1038421 gtcggcgacg tagacggtcc ggtcgctgtc cactgccacc gcgagggggg tgttgaggcc 1038481 ggtgaacggt agcactgtcg aggtggtcga cccggccagg agtttgacca cctggtttgt 1038541 gttgtgctcg gtgacgtaga cggttccggc ttcgtccacc gcgatgcccc acggtgcggt 1038601 gatatccgtg aatggcagca cgacctggtt attcgactcg gcctctagct tgacaaccct 1038661 gttgttgtcg gtgtcggtga catagacgtt gccggagttg tcgacggcca ccccgtcggg 1038721 gtcgttgagg ccggtgaacg gcagcacggt ctgggtcttg gatccggccg ccaacttcac 1038781 caccctgttg ttgccccggt cggcgacgta taccgcaccc tgggtatcca ccgcgagacc 1038841 ttcggggtag ttgaggccgt cgaacggtag cacggtctgg ttgttggacc cggccgctaa 1038901 cgtcaccacc cggttgttga aatcggtgac gtatacggtg ccagcgccgt ccaccgccaa 1038961 cccctgcggc tggtacagcc cgttgaacgg taacaccgtc gtgccggttg acccggtggc 1039021 caacttgacc actcggccgt acatgccctc actggtgacg tacacgttgc cggcgctgtc 1039081 cactgccacc ccactcggcg agaggcggaa gtcgatgccg gtgaacggca acacggtctg 1039141 tccggatgcc tgcgtcggcg accacgaagg tcgtaagacc aggtagccgg cggcggcgac 1039201 gatggccacc agtacgatcg cggcagcgcc gacgacggcc cacaccttcc gtttgttgcc 1039261 ggccggcggc acagcgtgtc ccagggaggc ctggagcgca ttcgggacgg caggggagtg 1039321 tccggtttgg ctgggccagt tcccgccgcg gctgtccgct gccaggggtc cggccacggt 1039381 cgcggagtcc ccgggcgacc atcgggcagc acccggggtc ggtgggccgg tgcccgcccc 1039441 ggcaatgccg gactcggact ggctcaagcc cgtatcggcc ggagtggcca gcaaggttgc 1039501 gttgtcaccg cgccgcagaa tcgtcgtggc ctggtgttgc tcggatgtgg ttgagtgcgt 1039561 catgggcggc gatggccaga tcaccagcgc tcataaagcg ctccgcgggg tttttggcca 1039621 tgcctttggc gatcacctga tccagggccg gcggcacgcg cccgggccgt agctggctgg 1039681 gctgcggggc agggtccatt agatgcgcgg cgatcaaccg ctcaacgctg tcggcccgat 1039741 acggtggggc accggtcaaa cactcaccca acacgcacgc caacgcatag atatctgcgc 1039801 gataggtgac ctcatcgccg gtgaaccgct ccggggccat gtagttgtag gttcccacgg 1039861 cggtcccggt ctgggtcagc cccgggtcgg aggcggcacg ggcaataccg aaatcgacca 1039921 gataggcgaa gtcgctcgcg gtgaccagaa tgttttccgg ttttacgtcg cggtgcgtta 1039981 cgccgttggc atgcgcggca tccaaagcgg cggcgatctg gcgcacgatg gccacagctc 1040041 gggccggggt cagcggacca tactgtttca atagggcgcg taaagaggtg ccgtcgatca 1040101 tgcgcatttc gacaaagaac tgtccgttga tctcgccgta gtcatggatc ggcacgatgt 1040161 gtggctcggt cagccgtccc gcggtgtcgg cctcgcgttg catccgtgct cgaaacaccg 1040221 cattgtcgga gtactgcggc gagatcaact tcagcgccac cacccggtgc ttgcgggtgt 1040281 cctcggcctc ataaacctcg cccatcccgc ctcggcccag cagccgcaat agctgatacg 1040341 gcccaaattg cgaccctacc tgcggaacgg catcgctcac cgtcgaattc ccttcactag 1040401 gtcaagaaat agcattcacc gcggccgcca attttgcttg gaacgatttg ggcaacggaa 1040461 tggagccgta ttggtccagg ccttcttggc ctggaccaat cgcggcttgc ataaacgccc 1040521 ttaccgcagt accggtcgtc gcatccgggt atttcgagca gacgatctca taggtcgcca 1040581 gcacgatcgg gtaagagcca ggctgggtgg gcctgtagaa cgacgacgtg tccaatacca 1040641 ggtcgttgcc ttgtcccatg atcttggccc cggcgattgt cttgccgacc gactcggtgg 1040701 tgatcgccac tggatccgga cccgccgacg tgatgatctg ggccatgttc aactgcttac 1040761 ccaccgcaaa cgaccactcg ttgtaggtga tcgacccgtc ggtcgtctgc agtagggccg 1040821 acgtgccgtt gttcccgctg gcgccgacgc cgacgccccc gttgaacgtt tcgctggcgc 1040881 ctttgcccca cgccccgttg gatgcgccgt cgaggtattt ctggaagttg tccgacgtac 1040941 cggacttgtc gctgcggaag ataacgctaa tcggtgttgg cggcaggtcg gtgccggagt 1041001 tgagggcttg gatctgtgga tcattccaca cggtgatggt gccgttgaaa atcttggcgg 1041061 tagtgggtcc gtcaagattc agcgtgctca cgcccttgat attgtaggtg atcgcgatcg 1041121 ggccgaacac cgtcggcagg tcccatgccg gggaaccgca ccgctccgcc gcccggtcag 1041181 gttgaccggt cgacggattc aacgggacat ccgagccggc gaaatcggtt tcgttgttga 1041241 gaaactgggt caccccggca ccggacccgt tggcgttgta gtccaacgtg tagcccgggc 1041301 acgatcgcac gtaggcatag acgaactgct ccatggcatt ttcttgtgcg gtcgagccgc 1041361 tggagtggag ctccttcttg ccgccgcagt gcaccgaccc agacgtgccg cctgcgcctg 1041421 acgacgagct gttggtgcca ccgccgcatg ctgtcaacac cagtgtgccg gcggccaaca 1041481 ggcttaccgc tgcgccggat cgggcgaact tcacgcaact cctctcgagg gggtcgtggt 1041541 ggcggatcca ctcgccaccg gtggtcgccg agccaccgac ccggggtcgg tattcgagcc 1041601 gtcaccgttg tgcatcgaaa gaggtctgat cattgaaatc ctagcgttca ggaggggccg 1041661 ctgatactga gggtcgacgg cgcgctttgt ccaaggagca tcccaaggag catgtagtac 1041721 cctgcgccga tggcgtgtga acggctcggc ggccagagcg gtgctgctga tgtcgacgcc 1041781 gctgcgccgg cgatggcggc ggtgaacctc accctgggtt tcgctggcaa aaccgtgctc 1041841 gaccaggtga gtatgggctt tcccgctcgt gcggtgacgt cgttgatggg accgaccggt 1041901 tcaggtaaga cgactttttt tgcgcaccct aaaccggatg aatgacaagg tctccggtta 1041961 ccgctacagc ggtgatgtgc tgttgggcgg acgcagcatc ttcaactacc gcgacgtgct 1042021 ggagtttcgc cgccgggttg gcatgctgtt ccagcgcccg aatccgttcc cgatgtcaat 1042081 catggacaac gtgctcgccg gcgtgcgtgc ccacaaactg gtgccgcgca aggaattccg 1042141 tggcgtcgcg caggctcggc ttaccgaggt cggcctctgg gacgcggtca aggatcggct 1042201 cagcgattca ccgtttcgac tctctggtgg tcagcagcag ttgttgtgcc tagcccgtac 1042261 gcttgcggtg aatccggagg tgttgctgct cgacgagccc acctccgcgc tggacccgac 1042321 taccaccgag aagatcgaag agttcatccg atcgctcgct gatcgcctca cggtgatcat 1042381 cgtgacccat aaccttgccc aggccgcccg catcagcgac cgggcggccc tgttcttcga 1042441 cggcaggctg gtggaggaag ggcccaccga acagctgttc tcctcgccga agcatgcgga 1042501 aaccgcccga tacgtcgccg gactgtcggg ggacgtcaag gacgccaagc gcggaaattg 1042561 aagagcacag aaaggtatgg cgtgaaaatt cgtttgcata cgctgttggc cgtgttgacc 1042621 gctgcgccgc tgctgctagc agcggcgggc tgtggctcga aaccaccgag cggttcgcct 1042681 gaaacgggcg ccggcgccgg tactgtcgcg actacccccg cgtcgtcgcc ggtgacgttg 1042741 gcggagaccg gtagcacgct gctctacccg ctgttcaacc tgtggggtcc ggcctttcac 1042801 gagaggtatc cgaacgtcac gatcaccgct cagggcaccg gttctggtgc cgggatcgcg 1042861 caggccgccg ccgggacggt caacattggg gcctccgacg cctatctgtc ggaaggtgat 1042921 atggccgcgc acaaggggct gatgaacatc gcgctagcca tctccgctca gcaggtcaac 1042981 tacaacctgc ccggagtgag cgagcacctc aagctgaacg gaaaagtcct ggcggccatg 1043041 taccagggca ccatcaaaac ctgggacgac ccgcagatcg ctgcgctcaa ccccggcgtg 1043101 aacctgcccg gcaccgcggt agttccgctg caccgctccg acgggtccgg tgacaccttc 1043161 ttgttcaccc agtacctgtc caagcaagat cccgagggct ggggcaagtc gcccggcttc 1043221 ggcaccaccg tcgacttccc ggcggtgccg ggtgcgctgg gtgagaacgg caacggcggc 1043281 atggtgaccg gttgcgccga gacaccgggc tgcgtggcct atatcggcat cagcttcctc 1043341 gaccaggcca gtcaacgggg actcggcgag gcccaactag gcaatagctc tggcaatttc 1043401 ttgttgcccg acgcgcaaag cattcaggcc gcggcggctg gcttcgcatc gaaaaccccg 1043461 gcgaaccagg cgatttcgat gatcgacggg cccgccccgg acggctaccc gatcatcaac 1043521 tacgagtacg ccatcgtcaa caaccggcaa aaggacgccg ccaccgcgca gaccttgcag 1043581 gcatttctgc actgggcgat caccgacggc aacaaggcct cgttcctcga ccaggctcat 1043641 ttccagccgc tgccgcccgc ggtggtgaag ttgtctgacg cgttgatcgc gacgatttcc 1043701 agctagcctc gttgaccacc acgcgacagc aacctccgtc gggccatcgg gctgctttgc 1043761 ggagcatgct ggcccgtgcc ggtgaagtcg gccgcgctgg cccggccatc cggtggttgg 1043821 gtgggatagg tgcggtgatc ccgctgcttg cgctggtctt ggtgctggtg gtgctggtca 1043881 tcgaggcgat gggtgcgatc aggctcaacg ggttgcattt cttcaccgcc accgaatgga 1043941 atccaggcaa cacctacggc gaaaccgttg tcaccgacgg cgtcgcccat ccggtcggcg 1044001 cctactacgg ggcgttgccg ctgatcgtcg ggacgctggc gacctcggca atcgccctga 1044061 tcatcgcggt gccggtctct gtaggagcgg cgctggtgat cgtggaacgg ctgccgaaac 1044121 ggttggccga ggctgtggga atagtcctgg aattgctcgc cggaatcccc agcgtggtcg 1044181 tcggtttgtg gggggcaatg acgttcgggc cgttcatcgc tcatcacatc gctccggtga 1044241 tcgctcacaa cgctcccgat gtgccggtgc tgaactactt gcgcggcgac ccgggcaacg 1044301 gggagggcat gttggtgtcc ggtctggtgt tggcggtgat ggtcgttccc attatcgcca 1044361 ccaccactca tgacctgttc cggcaggtgc cggtgttgcc ccgggagggc gcgatcgcgc 1044421 tggggatgtc gaattgggag tgtgtccgca gggtcaccct gccgtgggtg tccagcggca 1044481 tcgtcggtgc ggtggtgcta gggcttggcc gtgcgctggg ggagacgatg gcggtagcca 1044541 tggtgtccgg cgcggtgctg ggggccatgc ccgccaacat ctacgcgacc atgaccacca 1044601 tcgccgccac catcgtgtcg cagctggatt cggcgatgac cgattccacc aacttcgcgg 1044661 tgaagacgct cgccgaggtg ggtttggtgc tgatggtgat cacgttgctg actaatgtgg 1044721 ccgcgcgcgg gatggttcgt cgggtgtcac gcaccgcgct tccggtggga cgcggcatct 1044781 gacatgggcg aatcggctga gtccgggtcc cggcagctac cggcgatgtc cccgccgcgg 1044841 cgatcggtag cctatcggcg caagatcgtc gatgccctgt ggtgggcggc gtgcgtgtgt 1044901 tgtctggcgg tggtgatcac cccgacgttg tggatgttga tcggagtcgt cagccgcgct 1044961 gtaccggttt tccactggag tgtgctggtg caggactccc agggcaatgg cggcggcttg 1045021 cgcaacgcca tcatcggtac cgcagtgttg gccatcgggg tgatcctggt gggtggcacg 1045081 gtgagtgtgt tgaccgggat ttatctgtcc gaattcgcca ccggcaaaac acggtccatt 1045141 ctgcgcggcg cctacgaggt gttgtccggt attccgtcga tcgtgctcgg ctacgtcggc 1045201 tatttggccc tggtggtgta cttcgattgg gggttttcgc tggcggccgg ggtgttggtg 1045261 ctgtcggtga tgagcattcc ctacatcgcc aaggccaccg agtccgcgct ggcccaggtg 1045321 ccgacgtcgt atcgggaagc ggctgaggca ctcgggttac cagccggctg ggcgctgcgc 1045381 aagatcgtgc tgaagacggc gatgcccgga atcgtcaccg ggatgttggt cgcgctggcc 1045441 ctggcgatcg gcgagacggc gccgctgctg tacacggcgg ggtggtcgaa ttcgccgccg 1045501 accggacaac tcaccgactc gccggtcggc tacctgacct acccaatttg gacgttctac 1045561 aaccagccat ccaagtcggc tcaggatctg tcctatgacg cggctctctt gctgatcgtg 1045621 ttcctgctgc tattgatctt cattggccgg ttgatcaact ggctgtcacg gaggcgttgg 1045681 gacgtttgag ttggccttcg agcgcgcctt cacgctggcc tccagcttgg cgagcaggtc 1045741 ggagacgtct tcgggctcgt ccagcaacct cggttggtcc tcggcggtaa atgcctgccc 1045801 accttcgagt ttggtgtcga tcagctcctg taactgctcc tggtaggtgt cgtggtagcg 1045861 gtccggattg aagtcgtcgg ccatcgagtc caccacctgg ccggccatct tgagttccgc 1045921 gggtttgatc tccaccttct ggtccagcac cgggaagtcg gggtcgcgga tctcatcggg 1045981 ccacagcaac gtgtgcacca tcatcacctc tcgcttgccg aaatccttga cgcgcaacgc 1046041 cgccagcctg gtcttgttgc gcagcgtgaa atgcacgatc gccatccggt cggtctcggc 1046101 gagtgtctta gccagcagca catacgattt cgacgacttc gaatcaggct ccaaaaagta 1046161 gctgcggtcg aacatcatcg ggtccacgtc ggcggcgggg acgaactcca acacctcgat 1046221 ctcccggctg cgttcttcag gcaagctggc gatgtcgtcg tcggtgatcg ccaccatttg 1046281 gccgtcgccg gactcgtagg cccgggcaag atcgcggtag tcgaccacct cgccacacgc 1046341 ctcgcagacg cgcttgtacc ggatgcgtcc gttgtccttg gcgtgcacct ggtggaacct 1046401 gatgtcgtgg tctgcggtag cgctgtacac cttgaccggc acgttcacca gcccgaaggc 1046461 gatcgaaccc gtccaaatgg ctcgcatgta agtgagtatg ccttgattgt ccgcgagcgg 1046521 aacgtcacgg cgaaattcca cgcgatattt gaccgtgacg ttacgctcgc gacttgtgtg 1046581 accgacaggc tacgttgaaa gcatgggttc ggcgtcggag caacgggtga cgctgaccaa 1046641 cgccgacaag gtgctctatc ccgccaccgg gaccacaaag tccgatatct tcgactacta 1046701 cgccggtgtt gccgaagtca tgctcggcca catcgcggga cggccggcga cgcgcaagcg 1046761 ctggcctaac ggcgtcgacc aacccgcgtt cttcgaaaag cagttggcgt tgtcggcgcc 1046821 gccttggctg tcacgtgcaa cggtggcgca ccggtccggg acgacgacct atccgatcat 1046881 cgatagcgca accgggctgg cctggatcgc ccaacaggcg gcgctggagg tgcacgtgcc 1046941 gcagtggcgg tttgtcgccg agcccggatc aggtgagtta aatccgggcc cggcaacgcg 1047001 tttggtgttc gacctggacc cgggcgaagg cgtgatgatg gcccagctgg ccgaggtggc 1047061 gcgcgcggtt cgtgatcttc tcgccgatat cgggttggtc accttcccgg tcaccagcgg 1047121 cagcaaggga ttgcatctgt acacaccgct ggatgagccg gtgagcagca ggggagccac 1047181 ggtgttggcc aagcgcgtcg cgcagcgatt ggagcaggcg atgcccgcgt tggtcacctc 1047241 gaccatgacc aaaagcctgc gggccgggaa ggtgtttgtg gactggagcc agaacagcgg 1047301 ctcgaagacc accatcgcgc cgtactcact acgtggccgg acgcatccga ccgtcgcggc 1047361 gccacgcacc tgggcggagc tcgacgaccc cgcactgcgt cagctctcct acgacgaggt 1047421 gctgacccgg attgcccgcg acggcgatct gctcgagcgg ctggatgccg acgctccggt 1047481 agcggaccgg ttgacccgat accgccgcat gcgcgacgca tcgaaaactc ccgagccgat 1047541 tcccacggcg aaacccgtta ccggagacgg caatacgttc gtcatccagg agcatcacgc 1047601 gcgtcggccg cactacgatt tccggctgga acgcgacggc gtgctggtct cgtgggcggt 1047661 accgaaaaac ctgcccgaca acacatcggt taaccatcta gcgatacaca ccgaggacca 1047721 cccgctggaa tacgccacgt tcgagggcgc gattcccagc ggggagtacg gcgccggcaa 1047781 ggtgatcatc tgggactccg gcacttacga caccgagaag ttccacgatg acccgcacac 1047841 gggggaggtc atcgtgaatc tgcacggcgg ccggatctct gggcgttatg cgctgattcg 1047901 gaccaacggc gatcggtggc tggcgcaccg cctaaagaat cagaaagacc agaaggtgtt 1047961 cgagttcgac aatctggccc caatgcttgc cacgcacggc acggtggccg gtctaaaggc 1048021 cagccagtgg gcgttcgaag gcaagtggga cggctaccgg ttgctggttg aggctgacca 1048081 cggcgccgtg cggctgcggt cccgcagcgg gcgcgatgtc accgccgagt atccgcaatt 1048141 gcgggcattg gcggaggatc tcgccgatca ccacgtggtg ctggacggcg aggccgtcgt 1048201 acttgactcc tctggtgtgc ccagcttcag ccagatgcag aatcggggcc gcgacacccg 1048261 tgtcgagttc tgggcgttcg acctgctcta ccttgacggc cgcgcgctgc taggcacccg 1048321 ctaccaagac cggcgtaagc tgctcgaaac cctagctaac gcaaccagtc tcaccgttcc 1048381 cgagctgctg cccggtgacg gcgcccaagc gtttgcgtgc tcgcgcaagc acggctggga 1048441 gggcgtgatc gccaagaggc gtgactcgcg ctatcagccg ggccggcgct gcgcgtcgtg 1048501 ggtcaaggac aagcactgga acacccagga agtcgtcatt ggtggctggc gcgccgggga 1048561 aggcgggcgc agcagtggcg tcgggtcgct gctcatgggc atccccggtc caggtgggct 1048621 gcagttcgcc gggcgggtcg gtaccggcct cagcgaacgc gaactggcca acctcaagga 1048681 gatgctggcg ccgctgcata ccgacgagtc ccccttcgac gtaccactgc ccgcgcgtga 1048741 cgccaagggc atcacatatg tcaagccggc gctggttgca gaggtgcgct acagcgagtg 1048801 gactccggag ggccggctgc gtcaatcaag ctggcgtggg ctgcggccgg acaagaaacc 1048861 cagtgaggtg gtgcgcgaat gaagtgggtg acgtatcgaa gtgaccacgg cgaacgaacg 1048921 ggagtgcttt ccggtgacgc catctacgcg atgccgccgg acgtgtcgtt gctggatctg 1048981 gtcgggcgcg gcgccgacgg tctgcgcacg gcgggcgaac gggcagtgcg ctcaccggcc 1049041 gcggtggtag cgctcgacga ggttacgctg gcggcgccga ttccgcgccc gccgtcgatc 1049101 cgggactcgt tgtgctttct ggaccacatg cgtaactgcc aggaagcgat ggggggcggc 1049161 cgggtgctca tggatacttg gtaccgcatc ccggcgttct acttcgcgtg cccgtcaacg 1049221 gttttgggac cgtacgacga cgcacccacc gcacccggaa gtgcgtggca ggacttcgaa 1049281 ttggagatcg cggcggttat cggaaccagc ggcaaagact tgaccgtcga gcaggccgaa 1049341 cggtcgatca tcggctatac cattttcaac gactggtccg cacgggacct gcagatgctg 1049401 gagggccagc tgcgcatcgg acaggccaag ggcaaagaca gcggtatcac cctgggcccc 1049461 tatctggtca caccggatga gctggagccc tattgccggg gcgggaagct aagcttgcgg 1049521 gtgatcgcct tggtcaacgg caccgtgatc ggatcggggt cgaccgcaca gatggactgg 1049581 agcttcggcg aagtcatcgc ctatgcctcg cggggggtga cgctgacccc gggtgacgtg 1049641 ttcggctcgg gcacggtgcc cacctgcacg ctcgtcgagc acctcaggcc accggaatca 1049701 ttcccgggct ggctgcacga cggcgacgtg gtcaccctcc aggtcgaagg gctgggcgag 1049761 acgaggcaga ccgtccggac gagcggcact ccttttccgt tggctcttcg gccgaatccg 1049821 gacgccgaac ccgaccggcg cggggtcaac ccggcaccga cgcgggtgcc gtttacccgc 1049881 gggctgcacg aagtcgccga ccgggtatgg gcgtggacgc tgcccgacgg gggatacggc 1049941 ttcagcaacg ccgggctggt cgccggggac ggcgcgtcgc tgctcgtgga taccctgttc 1050001 gacctggcac tgacacgcga gatgttggcc gcgatgaagc cggtcaccga gcgggcgccc 1050061 atcaccgacg ccctgatcac gcactccaac ggcgaccaca cgcacggcac tcaactgttg 1050121 gaccgctcag tgcgcatcat cgccgccaag ggcacctccg aggagatcga gcatggcccg 1050181 gcaccggaga tgctagcccg gatccaaacc gccgacctgg gccccgttgc gacgcggtat 1050241 ctgcgtgatc gcttcggtca ctttgacttc agcggcatca agctgcgcaa cgccgacctg 1050301 acgttcgacc gcgacctggc catcgagctc ggcggccggc gagtcgacct gctcaacctc 1050361 ggtcccgcgc acaccaccgc cgactcggtc gtgcacgtgg ccgacgccgg tgtgctgttc 1050421 gccggggatc tgctgttcat cggttgcacc ccgattgtgt gggcgggccc gatcgccaac 1050481 tgggtggcgg cctgcgacgc gatgatcgcg ctggacgcgc ccacggtggt gcctgggcat 1050541 ggtccggtca ccggcccgga cgggatccgt gccgtccgtg gctatctggc gcacatcgcc 1050601 gaacaggccg aggcggccta ccgcaagggg ctatcgttgc ccgaggccgt cgagaccatc 1050661 gacctgggcg agtacgcgag ctggctggac tccgaacggg tagtggtcaa cgtctaccag 1050721 cgttaccgcg aattggatcc cgacaccccg cgccaggact tgctggcgtt gctggtgatg 1050781 caggccgaat gggcggcgcg ccactgtacg tagccactcg ggcgcgtttg tcacgggaat 1050841 ctgcggaccg gcgggcgcat ggtttgcctg tccacgagcg acaaagccag cgcgccaagg 1050901 attcccgatg gcagccatca ctttgtcgcg ctgaggcggg cacgaagaac atcccgtcca 1050961 gacagcggcc aatgtggcgg gtgtgaaagg cgccgccgag catggcaccg ggtccaacgg 1051021 ctctcacgaa gctgatcggg gatcgatccg ttgtgatgct taaactttcg cgatgacgtt 1051081 ctcggcgaac atctccagat tgcggatctt ggtctgcagc ggctcggtgt cggggcccat 1051141 ggtgtacggg acacggaaac cgacgatgac gtccgtcacc cctttgtcct cgagccgctt 1051201 gacgccgtcc acggtgaaac cgtccaggga gatcacgtgg atttcgaacg ggctggtttt 1051261 ccccgcttcc tcgcgaagcc gcttgaccct ggcgatcagc cggtcgagtt cgtccggatc 1051321 gccgccgcca tgcatccatc catcggcgcg cgccgcccgt cgcagtgctg catcggcgtg 1051381 gccaccgacc aggatcggga tcggctgggt gggcgccggg gtcatcttgg tcttgggtat 1051441 gtcgtagaac tcgccgtgga actcgaagta atcgccggtg gtaaggccac gcacgatctc 1051501 gatgcattcg tcaatccgct tgccgcgctt agcgaacggg acgcccatca gctcgtaatc 1051561 ctccggccac gggctagtgc cgacacccag cccgacccgg ttgccgatca gggcggctag 1051621 ggaaccggcc tgctttgcca ccagagccgg cgggcggatg ggcagcttga ggacgaagaa 1051681 gttgaaccgc agcctcgtcg tgactgcgcc caatgctgct gtcaggacaa aggtttcgat 1051741 gaaaggcttg ccgtccatga attcgcggtt gccgtcgggt gtgtacgggt acttcgagtc 1051801 ggattcgaag gggtaggcga tgctgtcggg aatcgtcatg ctgctgtatc ccgccgcttc 1051861 ggctgccttg gccagcggga tgtagaacgt gaagtcggtc attgcctccg cgtagctgaa 1051921 ccgcacgtga ttgccttcct cgaagtggcc gtccccaacg agattagaac gtgttctaat 1051981 ttgacgtgca agcggggcgc aacggcttgg tcagagttgg ttctccggcc caataattgc 1052041 ccagaccgtc ttgcccgacg aagtgggact gctgccccag gcgcgggaca acgcggcaac 1052101 gatcgccagg ccggaaacgt cgatgccctt cggtggggac gccagccgaa ccgccggagc 1052161 gctgctgccg tcggaaaccg cgatggttgc cgttgggcca tcgctttcga tccgcatcac 1052221 cgggtcgctt ccggtgtgtt tcagcacgtt ctccacgaat acgttgacga cgaccaacgc 1052281 gactggaata agcccgggac gtgaccattg ggtgagccat tcgcggacca actggcgtga 1052341 ctcgcgaagg ctgttcaggt tggcgggcag ttgtgcgtcc gaacgcttga aattgcggcg 1052401 cgcgagccga ccgatggcct tgctcgccgc tttttcggtc gggtacaccg gcatgaagcg 1052461 ggcgaccccg gtgcgggtga ccgccgcgcg gccggcccga tggccgcaga ccagcaagac 1052521 cggtacatcc gctcggaagt cggcctgcca gcgggcgctg ataaagaccg accatgccga 1052581 ttcctcggcg acttgcagct cggtgacatt gacgataacg gcggacggct gctcgagcgt 1052641 cgccctcgtg aggctgtccc ggagcagtgc agaactgctg gagtcaagcg caccgtcggc 1052701 ggtcaagatg accaccgaat cctgtgtacg taccgcaatg gccagcgctg tcggtgactt 1052761 ggctgccgtg ctcaccgcga ccacttcctt gcgtcccttg ccccggcgtc aggtgcacat 1052821 cgcaacttgg gtcggagtgc caccatagcc atggttccga aacggcggga cgccatgaac 1052881 cggcattccg gtcccatcct gtcgtccggt ttcatagcca gctcctcgaa ctcctgtccc 1052941 gccaatagct tgaggatgcc gtccgccttg gcggcagaaa ccctatcttt tgatgatcgc 1053001 gccgtccggc gcagcaccca tcacccaggg ggtggttacc cacaaaaaca cgcgatcaac 1053061 ctccagtccg ggctatgccc agcctatgca aacgccagca ggtagggccc gggaatccgg 1053121 ccaacaaaga tcaacgaacg ccgcgccggc gccgggatgc gttcaagtgg tggccgaggc 1053181 tgggccgctt cgggcatagg gcggtgggcc cactccggcg accgagtggg taccccacgg 1053241 tgtttgttca gtgatgcgtg cgggtgcgct acgtccgccg atggttaacg tcgccgcccg 1053301 ggcatgggtg agtgaagtct cgggcaagga atcgaatacg gtgccctgcc agtggtagtt 1053361 gccgtcgatc ggatcgaggt gaccggtaag ccggacgcgg acccgaaagc gggcaccagc 1053421 gagcgttagc gtcgccgcac cgtcgtaggt ctgatcgtcc tcggtcgccg cggatgacaa 1053481 gtcgaacgct tcgaggcccc cagtctgccg atggggctga gcgggtttga gttgggcgcg 1053541 ctcgttgaat acctgctggc tgctgcggcg cacctcgatg cggcggctgg ccgtgcgctc 1053601 catgagcttc atgcattcga cgacgcagcg tgcctgcgcg gcggtatcgg gcccggtgat 1053661 gaagaagtag ttggggaaac cgtgaacggc gacgccgagg tagggctcca tgccatcgtc 1053721 ccaggcttgg cggatggtca caccgccggc accgaccagg gtctgatcgc cgacctgatc 1053781 ggcgatcgcg aacccggtgc cgtagatgat ggcgtcgacg gggtgttcca cgccatcgct 1053841 ggtgcggatg cccgaggagg tcagcgcgtc gatcgccgcc gtcgcccagg cgaccgctgg 1053901 atgctcagcc ccggtgcggc gacgtagcca gcgtttggcg cgtgtcgtcc acagtggtac 1053961 tccggtgacg acgcggcgcg gtgcctgggt gaagaccgtg accgacgccg ccgattcaga 1054021 caaccggctg atgtagtggg cggcggcggc atcggtgccg accaccgcga tgcgtttgcc 1054081 ggccgggtcg aaatcgcggt cccatgccgc cgaagtgggc ctgatgggcc cgatcgcccg 1054141 tcgcgcctgg aaacgcacca actttctgtg accgcgacgc tcggcctcgc tgacgccggc 1054201 caccgcattg tcatcgtcgg caggggtgct ggtggccggg acgccgcagc cgcgcgcgct 1054261 cgggcccgat gcgctggacg tcagcaccga cgacctggcc gggctgttgg ccggcaacac 1054321 cggccggatc aagaccgtca tcaccgacca gaaggtaatt gccggcatcg gcaacgccta 1054381 tagtgacgaa atcctgcacg tcgcgaagat ctcgccgttc gccacggccg gcaagttatc 1054441 cggcgcacag ctcacctgcc tgcatgaggc gatggcgtcg gtgctgtcgg acgcggtgcg 1054501 ccggtccgtc ggccagggcg cggccatgct caaaggggag aaacgttctg ggcttcgagt 1054561 acatgcgcgc accgggttac cctgcccagt gtgcggtgac actgtgcggg aggtgtcctt 1054621 cgcggacaag tcttttcagt actgtccaac gtgtcagacc ggtggcaagg cgctggccga 1054681 ccggcgtatg tcgcggctgc tcaagtagtc gatatgctca ccggagtgac tcgccagaag 1054741 atcctgatca ccggcgccag ttccggcctg ggcgccggga tggcccgatc cttcgccgcc 1054801 cagggccgcg acctggcgct ctgcgcccgc cgcacggatc ggctgaccga actgaaagcc 1054861 gaactgtcgc aacggtatcc cgacatcaag atcgctgtcg cggagctgga cgtcaacgac 1054921 cacgagcggg tgcccaaggt attcgccgaa ctcagcgatg agattggcgg cattgaccgt 1054981 gtgatcgtca acgccggaat cggcaagggt gcccggctgg gctcgggcaa gctgtgggcg 1055041 aacaaggcaa ccatcgaaac caacctggtc gccgcactcg tgcagatcga aacggcactg 1055101 gacatgttca accagcgcgg ttcggggcat ttggtgctca tctcctcagt gctcggcgtc 1055161 aaaggggtgc cgggcgtcaa agccgcgtat gcggcaagca aagccggtgt gcgctcgcta 1055221 ggcgaatcgc tgcgcgccga gtacgcccaa ggccccatca gggtcacggt gctggagccg 1055281 ggttatatcg agtcggagat gacggccaaa tcggcgagca caatgttgat ggtggacaac 1055341 gcaactggcg tcaaggcgct ggtggccgcc atcgagcgcg agcccggacg cgccgcggtc 1055401 ccctggtggc catgggcgcc actggtgcgg ctgatgtggg tgctgccgcc gcggctgacc 1055461 agacgcttcg cctagcgggc gctcggccac ctagcccgcg cggccacgtt cggtgcggta 1055521 gcggcgcacc agcccgtcgg tcgagctgtc cgactgcggt ggcggtgaac cggcgccggt 1055581 gattaccgga agcagcgcct tggcctgcgt cttgcccagc tccacccccc actggtcgaa 1055641 cgagtcgata ccccacacca caccctcggt gaacacctga tgctcgtaga gcgcgatcaa 1055701 ctgccccagc accgacggcg tgagccgact ggccagaatt gaggtggacg gccggttgcc 1055761 gggcatcacc ttatgcgcta ccacgtgggc gggggtgccg tcggcggcga tctcctcggc 1055821 ggtcttgccg aacgccagca cctgggtttg ggcgaagaag ttgctcatca gcagatcatg 1055881 catgctgccg gtgccctcgg cggtcggcag gtcgtcgagg ggttgagcaa agccgatgaa 1055941 atcggctggc accagccggg tgccctggtg cagcaactgg tagaaggcgt gctggccgtt 1056001 ggttcccggt tcaccccaaa agatttcacc ggtgtcggcg ctgaccgggc tgccgtcggc 1056061 gcgcgtggac ttgccgttgg attccatggt caactgctga aggtaggccg gaaaacgcga 1056121 caagtcattg gaatacggca gcacggtgcg tgattgcgca ccgaagaaat tggagtacca 1056181 cagtccgatc aggccaagca gcaccggcgc gttggattcc agcggagcgg tcgcgaaatg 1056241 gcggtcgatg atgtggaatc cggccaagaa atcggcgaag gcgtcgcggc cgatcaccgt 1056301 catcaacgac agcccgatcg ccgaatccac cgaataacgc ccgccgaccc aatcccaaaa 1056361 accgaacatg ttgtcggtgt tgatgccgaa gtcgtcgacc aggcgcttgt tggtggacac 1056421 cgcgacaaaa tgccgcgaca ccgcggcgtc gcccagcgca tcggtcagcc agcgacgcgc 1056481 cgcggtcgca ttggtcaatg tctccagcgt cgagaacgtc ttcgacgcga cgatgaaaag 1056541 cgttgtggcg gggtctagat cggcgagcgt ggcgatcagg tcggcgggat cgacgttgga 1056601 cacgaagcgc gcggaaatgc ccgcgtcggc atagtggcgc aacgcttggt acaccatcac 1056661 cggacccaaa tccgaaccac cgatgccgat gttgacgacg gtgctgatcc gctttccagt 1056721 tgctccggtc cactcgccgc tgcgcaggcg gtcggtgaag gcgcccatcg cgtcgagcac 1056781 ggcatgtacg tcggtgacga cgtcttggcc gtcgacgacg agttcggcgt ctcggggcag 1056841 ccgcagcgcg gtgtgcaaca ccgctcgatc ctcagaggtg ttgatatgca caccggcgaa 1056901 catctggtcg cgacgctctt cgaggtgggc cgtccgggcc agatcgatca gcagcgccag 1056961 cgtctcgcgg gtgacgcggt gtttgctgta gtcgatgtag agatcgccga cgctgacggt 1057021 gagctcccgg ccgcgacccg gatcgtcggc gaagaactgg cgaagatggg tgtttccgat 1057081 ctgatcgtga tgtctgcgca gggcgtccca tgccggggta gcggtgatgt cggggattgg 1057141 cgcggaggtc atggttcgac cctaatgccg tggagtggcg tcgatcagag ccgctgtctt 1057201 cgccgagcct ttagttatcg tgctcggcgg cactcgccgt ttgtcgcggt atctacaggc 1057261 tcggcgatgc gggcctgcgc tctcgcggcc tcggcccccg ccgaggccgc tgaccgtcgc 1057321 ccagcacccg ctgcagatca ggcagcatgg cctgcaatgg cgcacgccag tacgcccagg 1057381 tgtgggtttc gccatccggg aagttgaacc gccgttgcgc cgcttggtac ttgctttgca 1057441 aagcctgtcg cctattgccg ctcaactccg ccgagggcgt gccgttgccc gaatacgccc 1057501 agatacgggg tgctgttggc caccagcttc gcggcattga ccgttggctc gctgtgggcc 1057561 cacgccggat cggtcggcgg gcccccggat caatgacgcg gatccggcaa ccacgccatt 1057621 tccactcggg atgatcccct cactcgccgc cagccagtcg gccagctctt gggccatatc 1057681 ccgccgttgc cgaccgccgg ccggtgccag ttggaataga actcgccatg ccaccggtgg 1057741 gcatggcgag cgacagaccg gtctggtcaa ccgaataccg gcaggtgtat atcccggccg 1057801 ttgtagtcgt ctcgggcgag tatgccgtcg gacaagtacc acgcatgagg gccgccacct 1057861 tgaaactcca ccttgattag gcgatgcatc gaccgcgacg ggaccatcag tcgatgggta 1057921 gaccccgccg cgactacgtt gacgaggttg tcgagctggc gccttgcccg agcagcgcga 1057981 tcaaagctgc cgcccatatg accatcaacc ggcgcaattg tgaaactcca gctgcctttg 1058041 ctgcatccat ttcggcggaa attcagcgca gcgatgcaga aattcccggc aaacagcggc 1058101 ggaagtgacc cattagtgac cgaggcggcc cctgcccaat cgcaaaagca ggatggccag 1058161 atccttaccg tcgggtccca gctcgctgta gcgttcgatg accttcatct cccggctgtg 1058221 taccagccga gtgccaccgg acgccatccg ggccttgccg atggccttgg aaacctcagc 1058281 gcgtcgcttg actaacgcga ggatttcggc gtctagccgg tcgatctctt cgcgcagcgt 1058341 gtcgatctcg gggacaggtt gggactcgag catttccagg ttcatggctg ctaactccgc 1058401 gttctcgtga tgtgggggtt ctggtctcat ccggtactgg gcctcacaca agagacgagc 1058461 cccgaatccg gaagcggacc acggggctct gcgaaagcag ctagaccacg ggcaccgctg 1058521 gccggtaccc gtagaaaaat cggcgctgcg cgttgagcac gaaccgagtg tgccatcaac 1058581 ggacgcgccc gcgcaaaaac ttggcgggaa aagtgcaccc aaaattgggt ggtggcgccg 1058641 aaggacctgc cgcgtggcga tgagcctggc caggctatgc cgcggtccgc cgactcgtcg 1058701 ccgcgcggcg gtaagtttgg accgacatga gtgtgcacgc gaccgacgcc aagcctcccg 1058761 gtccatcccc agcggaccaa ctgctcgacg gcctcaaccc gcaacagcgc caggcggtcg 1058821 tgcatgaggg ttcgccgctg ctgatcgtcg cgggcgcggg ttcgggtaag accgcggtgt 1058881 tgacccgccg cattgcctat ctgatggcgg cccgcggcgt cggggtgggc cagattctgg 1058941 ccatcacctt caccaacaaa gccgccgccg agatgcgcga acgggtggtg ggcctggttg 1059001 gggagaaggc ccggtacatg tgggtgtcga cgtttcactc cacctgcgtg cgtatcctgc 1059061 gcaaccaggc ggcgctgatc gagggcctca actccaactt ttcgatctat gacgccgacg 1059121 attcgcggcg gttgctgcag atggtgggcc gcgacctggg cctagacatc aagcggtact 1059181 cgccgcgact gctggctaac gccatctcca acctgaagaa cgagttgatc gacccgcatc 1059241 aggcgctggc cggcttaacg gaggactccg atgacctagc gcgcgccgtg gcgtcggttt 1059301 atgacgaata ccagcggcgg ctgcgggcgg ccaacgcgct ggacttcgac gacctgatcg 1059361 gcgagaccgt cgcggtgctg caggccttcc cgcagatcgc ccagtactac cgtcggaggt 1059421 tccggcatgt cctggttgac gaataccagg acaccaacca cgcccagtac gtattggtgc 1059481 gcgagctggt cggccgcgac agcaatgacg gtattccccc cggcgagttg tgcgtcgtcg 1059541 gggatgccga tcagtcgatc tatgcgttcc gcggcgccac catccgcaac atcgaagact 1059601 tcgaacgtga ctaccccgac accagaacca ttctgctgga acagaattac cgctcgacgc 1059661 agaacatcct gtcggcggcc aactcggtga ttgcccgtaa cgcggggcgc cgggagaagc 1059721 ggttgtggac cgacgccggc gccggggagt tgatcgttgg ctatgtcgcc gacaacgagc 1059781 acgacgaggc ccggttcgtg gccgaggaga tcgatgcgct cgccgagggt agcgagatca 1059841 cctacaacga tgtcgccgtc ttctaccgca ccaacaactc gtcgcggtca ctggaagagg 1059901 tgctgatccg cgccggtatt ccgtacaagg tcgttggggg agtgcgcttt tacgagcgca 1059961 aggagattcg cgacatcgtt gcctacctgc gcgtgctgga caacccgggc gacgcggtca 1060021 gcctacggcg catccttaac accccgcgcc gcggtatcgg ggatcgtgcc gaggcgtgtg 1060081 tggcggtgta cgccgagaac accggcgtcg gcttcggtga cgcgctcgtc gccgcggccc 1060141 aaggcaaagt accgatgctg aatacccggg cggagaaggc gatcgcgggt ttcgtcgaga 1060201 tgttcgacga gctgcggggc cgcctcgatg acgacctggg ggagctggtc gaggcggtgc 1060261 tggaacgcac cggataccgc cgcgagctgg aagcgtccac cgatccacag gaattggccc 1060321 gcctggacaa cctcaacgaa ttagtcagcg tcgcacacga attcagtacc gaccgggaga 1060381 atgccgccgc acttggccca gacgacgaag acgtccccga caccggtgtg ctggcggatt 1060441 ttctggaacg ggtgtcgctg gtcgccgacg ccgatgagat cccggagcat ggcgcgggtg 1060501 tggttacctt gatgaccttg cacaccgcca agggtttgga gttcccggtg gtgtttgtga 1060561 ccggctggga ggacgggatg ttcccgcaca tgcgggcgtt ggacaacccg accgagttgt 1060621 ccgaggagcg gcggctggcc tatgtcggca tcacccgcgc ccggcagcgg ttgtacgtga 1060681 gccgggcgat cgtgcgttcg tcttggggcc agccgatgct caacccggag tcgcggtttc 1060741 tgcgggaaat cccgcaggag ctcatcgact ggcggcgcac cgccccgaag ccgtcgttca 1060801 gtgccccggt gagtggcgcc ggtcggttcg gtagcgcgcg tccatcaccg acccgctcgg 1060861 gggcgagcag gcgcccgctg ctggtgcttc aggtcggcga ccgcgtgacc catgacaaat 1060921 acggcctggg ccgtgtcgag gaggtctccg gtgtcggcga atcggcgatg tcgctgatcg 1060981 acttcggtag ctcggggcgg gtgaagctga tgcacaacca cgcccctgtc accaagctct 1061041 gagatttcgc gccgagcgtg aagtcacggc ggctatttcg cggatttctc gccctgagaa 1061101 cacgttcggc gtcgttgccg ggtcaaccgg tgtaattgcc gacgctaagt ccccgcttgg 1061161 cgagccacgg cactgggtcc acgcgctcgg tgccgcccag gagcacctcg aagtgcaggt 1061221 gcgggccggt ggaaaagcca cggctgccca tggtggcgat ctggtcgcct gccatcacgc 1061281 gctcaccgac gctgaccaac gtggtattga cgtggccgta tagcgtgacc gtgccgtcgg 1061341 cgtgcagcag cttgacccac attccgtagc cggcggtggg gccggcgtcg atgacgacgc 1061401 cgtcggacac cgcataaatc ggggttccga tcgcgttagc caggtcgata ccggcgtgca 1061461 gtacacccca tcgataaccg aaactcgacg tgaagatgcc cttcgtcggc atgacataca 1061521 gcgggcgctg tagtcgcgcc tcgcgctcgg cgcgctcctc ggcgaaggca acccccctgg 1061581 cgaactccgc gttgtgcacc gcagcactcg ccgccggctg ggcggcgatg acctggacgc 1061641 cccgcggtgg gttgcttccc gacccttcgt tgagcgccga tgcatgagcg gtcagcacgg 1061701 tctcggtgcg tggggtttcc gactgttgga tcgccgtatg cgctgctgcg gccgccgcgc 1061761 ccgcggccat cgccgagatc agcaggcgcc cccgggccgc accgatcggt tgcttgcggt 1061821 gctgcccgac gcgccgggac accggggtga cctccggggt cagcacgacc gtgggggcta 1061881 ccagccattc gggggccaga tcgtcggcgt cgtccaggtc gtcgagttct ggagccgcta 1061941 gcaactgcgc ttcgtagtcg aagacgcagt cgtcccctag gtccaggtca tcgagctctg 1062001 cgaaatccag ctcatcgtag agtgctaagc cgtcgaggaa tccgtccagc gggatgattt 1062061 cggtgacttc gttacggtga tgatgcggcc aacgatcgcg aggtgtgcga atcgctgcca 1062121 tggcagcaga acgggcgata cggtgctggg acaaatctga aatgtcctcg gatcgtgacc 1062181 ataacgttat ctggaccctg agacgttatc cgcaaccgga tggtagtggc aacttcagcg 1062241 cggaattcgg ctgtgattgt gagttggatc acgtttcggc tggacaaaca tatcggtgag 1062301 ctgtgccaca ccgggtggat gcggccgcgg agttaatcgg cggtctcgat acagttctcc 1062361 gtgcgagtcg ccgatttcgg caccgcctac ctattggtcg agcagtaagc cgagcgaaga 1062421 cggtgagccc atggatcttt tcgagtatca agccaaggag ttattcgcca agcacaacgt 1062481 gcccagcacg ccgggtcggg tgaccgacac agccgagggt gccaaggcta tcgccacgga 1062541 gatcgggcgt ccggtgatgg tcaaagcgca ggtcaagatc ggcggccggg gcaaggccgg 1062601 tggcgtcaaa tacgccgcga ccccacaaga cgcgtacgag cacgccaaga acatcctcgg 1062661 cctggacatc aaaggacaca tcgtcaagaa actgctggtc gctgaggcta gcgatatcgc 1062721 cgaggagtac tacctatcct tcctgctcga ccgggccaac cgcacctacc tggcgatgtg 1062781 ctcggtggag ggcggcatgg agatcgaaga ggtagcggcc accaaacccg agcggctcgc 1062841 caaagtcccg gtgaatgccg tcaagggcgt tgacctagat ttcgcgcggt ccatcgccga 1062901 acagggtcat cttccggccg aggtgctcga caccgcagcg gtcaccatcg ccaagctgtg 1062961 ggagctcttc gtcgccgagg acgcgacgct ggttgaggtc aacccgttgg tgcggacgcc 1063021 tgaccacaag atcctcgcgc tggatgccaa gatcaccctc gacggcaacg ccgatttccg 1063081 tcagcctggc catgccgagt tcgaggatcg agctgccacc gatccactgg agttgaaggc 1063141 caaggagcac gacctcaact acgtcaagct ggacggtcag gtggggatca tcggcaatgg 1063201 cgcgggcttg gcgatgtcga ctctcgacgt cgtcgcgtat gccggtgaga agcacggcgg 1063261 agtcaagccg gccaacttcc tggatatcgg cggcggcgct tcggccgagg tgatggccgc 1063321 gggtctggac gtggtgctgg gcgaccagca ggtcaagagc gtgttcgtca acgtcttcgg 1063381 tggcatcacc tcgtgcgatg cggtggcgac cgggatcgtc aaggcgctgg gcatgctggg 1063441 tgacgaagcc aacaagccgc tggtggttcg gctcgacggc aacaacgtcg aggaaggccg 1063501 tcgcatcctg accgaggcca accaccccct ggtgacactg gtggcgacga tggacgaagc 1063561 cgccgacaag gccgctgagc tggcgagcgc ctgagcgaaa ggacccatga ctcacatgtc 1063621 catatttctg agcagggaca acaaggtcat tgtgcagggc atcaccggca gtgaggccac 1063681 cgtccatacc gcgcgaatgc tgcgggcggg cacgcaaatc gtcggcggtg tgaacgcacg 1063741 caaagcgggc accaccgtca cgcatgagga taagggcggc cggctgatca agctgccggt 1063801 gttcggcagt gtcgcggagg cgatggaaaa gaccggcgcc gatgtgtcga tcatcttcgt 1063861 gccgccgacg ttcgccaagg acgccatcat cgaggccatc gacgccgaaa ttccgctgtt 1063921 ggttgtgatc accgagggaa ttccggtgca ggacaccgcc tatgcctggg cctacaacct 1063981 cgaggctggc cacaagaccc gcatcattgg ccccaactgt cctggcatta tcagtcccgg 1064041 tcagtcgctg gccggtatca cgccggccaa catcaccgga cccggtccaa ttggtctggt 1064101 gtccaagtcg gggacgttga cctaccagat gatgttcgaa ctgcgcgacc ttggattctc 1064161 cacggcgatc ggcatcggtg gtgatccggt gattggcact acccacatcg acgccatcga 1064221 ggccttcgag aaggatccgg acaccaagct catcgtgatg atcggcgaga tcggtggtga 1064281 cgccgaggag cgggccgcag acttcatcaa gaccaacgtg tccaagccgg tcgtcggcta 1064341 tgtcgccgga tttaccgcac ccgaaggcaa gacgatgggc cacgccggcg ccatcgtctc 1064401 cggctcgtct ggcacagcgg cggccaagca agaggccctg gaggccgccg gtgtgaaggt 1064461 cggcaagacc ccatcggcga ccgcggcgct ggcccgggag atcttgctca gtctctaggg 1064521 cgagcagacg cataagcccc cgcacgctcg gcgtgtcggg ggcttatgcg tctgctcgcc 1064581 ctatacgcaa caggccaact tggcggccag ccgctccacg tacgcggctg cgtcgtctgc 1064641 agacctgtcc ggcataccga acagcacctc cgtaacgcca agctcggccc agcgcgccag 1064701 cttgtcgggc accggtttga cgtccagggc cacgatctgt ggaagcccgt cgcggccggc 1064761 ggccgcccag atgtcttgca gtaacttcac cggctcgtcg atgtcgacgt cgcgtggagt 1064821 ggtgatccag ccgtcggcgc tgcgcgcgat ccacttgaag ttcttctccg tccccgcagc 1064881 gcctaccagc accgggatgt gcggctgcac cggcttgggc caggcccagc taggtccgaa 1064941 cttgacgaac tcgccgtcat agcaggcctc ctcttgggtc cacaacgccc gcatcgcctc 1065001 gaggtattcg cgcagcatgg tgcggcggcg tccgggtggc acaccatgat cgacgagctc 1065061 gtcggtgttc cagccgaacc cgaccccgac gctgacccgg ccgtgcgaca aatgatccag 1065121 cgtcgcaatg cttttcgcca gcgtgatcgg atcatgctcg accggcagcg ccaccgcggt 1065181 ggcaagccgg atccgcgacg tcaccgccga tgctgctccc aggctcaccc acgggtccaa 1065241 cgtgcgcata tagcggtcgt ccggcagcga agcgtcaccc gtcgtcggat gggccgcctg 1065301 gcgcttgacc gggatgtggg tgtgttcggg cacgtaaaac gtgcgaaacc cgtggctttc 1065361 agcaagtctg gcggccgcgg ccggggtgat gccgcggtcg ctggtgaaca gcacaagtcc 1065421 gtagtgcatg caccgaatta gaacgtgttc cacctgcgcc gggcaagcgg ccgtccagtc 1065481 gttaatgtcg cgagcgccgg tcgctccggc agcggcaccc gaacgtgcgc tagcgtggtt 1065541 gatcgaatcg cgtcgccggg agcacagcgt cgcactgcac cagtggagga gccatgacct 1065601 actcgccggg taaccccgga tacccgcaag cgcagcccgc aggctcctac ggaggcgtca 1065661 caccctcgtt cgcccacgcc gatgagggtg cgagcaagct accgatgtac ctgaacatcg 1065721 cggtggcagt gctcggcctg gctgcgtact tcgccagctt cggcccaatg ttcaccctca 1065781 gtaccgaact cggcggaggt gatggcgcag tgtccggtga cactgggctg ccggtcgggg 1065841 tggctctgct ggctgcgctg cttgccgggg tggctctggt gcctaaggcc aagagccatg 1065901 tgacggtagt tgcggtgctc ggggtactcg gcgtatttct gatggtctcg gcgacgttta 1065961 acaagcccag cgcctattcg accggttggg cattgtgggt tgtgttggct ttcatcgtgt 1066021 tccaggcggt tgcggcagtc ctggcgctct tggtggagac cggcgctatc accgcgccgg 1066081 cgccgcggcc caagttcgac ccgtatggac agtacgggcg gtacgggcag tacgggcagt 1066141 acggggtgca gccgggtggg tactacggtc agcagggtgc tcagcaggcc gcgggactgc 1066201 agtcgcccgg cccgcagcag tctccgcagc ctcccggata tgggtcgcag tacggcggct 1066261 attcgtccag tccgagccaa tcgggcagtg gatacactgc tcagcccccg gcccagccgc 1066321 cggcgcagtc cgggtcgcaa caatcgcacc agggcccatc cacgccacct accggctttc 1066381 cgagcttcag cccgccgcca ccggtcagtg ccgggacggg gtcgcaggct ggttcggctc 1066441 cagtcaacta ttcaaacccc agcgggggcg agcagtcgtc gtcccccggg ggggcgccgg 1066501 tctaaccggg cgttcccgcg tccggtcgcg cgtgtgcgcg aagagtgaac agggtgtcag 1066561 caagcgcgga cgatcgggcg gccggcgctc gtccagctcg cgacctcgtc agggttgcgt 1066621 tcggcccagg tgtggtggcg ttgggcatca tcgccgcggt gacgctgctc caattgctga 1066681 tcgccaatag cgacatgacc ggtgcgtggg gcgccatcgc cagcatgtgg ctgggcgtgc 1066741 acctggtgcc gatctcgatc ggtggccgcg cactgggcgt catgccgctg ttgccggtcc 1066801 tgttgatggt gtgggccacc gcgcgcagca cggcgcgggc cacatcccca cagtcgtcag 1066861 ggctcgttgt tcgctgggtc gtcgcgtcgg ccctgggcgg accgctgctg atggcggcga 1066921 ttgccctggc ggtcattcac gacgcgtcat cagtggtcac cgagctgcag acgcccagcg 1066981 ccctgcgcgc gttcactagt gtgctggttg tgcattccgt tggggccgcg accggggtgt 1067041 ggtcccgggt aggtcgacgg gcgctagccg ccacggcact gcccgattgg ctgcatgatt 1067101 cgatgcgtgc cgccgccgct ggggtgctgg cgttgctcgg gctttccggc gtggtgacgg 1067161 cggggtcgct ggttgtgcat tgggcgacga tgcaagagct ctacgggatc accgattcga 1067221 tattcggcca gttcagcctc actgtacttt cggtgcttta cgcacccaac gtcatcgtcg 1067281 gcacctcggc catcgcggtt gggtccagtg ctcacattgg cttcgcgacg ttcagttcgt 1067341 ttgcagtttt gggcggcgat atcccggcac tgccgatcct ggccgcggcc ccgacgccgc 1067401 cgctcggccc ggcatgggtt gccttactca ttgtgggtgc ttcgtcgggt gtggcggtcg 1067461 gtcagcagtg cgcccgccgc gccctgccgt ttgttgcggc tatggccaag ctgctggtcg 1067521 ctgccgttgc cggggcattg gtaatggcgg ttctgggtta cggcggtggc ggccggctgg 1067581 gcaatttcgg cgatgtcggc gtggacgagg gcgccttggt gttgggcgtg ctcttctggt 1067641 ttacgttcgt aggatgggtc acggtggtga ttgccggcgg gatcagccgc cgccccaagc 1067701 ggctccggcc ggccccgccg gtcgagctgg acgccgatga atcttcgcca ccggtagaca 1067761 tgttcgacgg ggcagcgagc gagcagccgc ccgcttcggt cgcggaagac gtcccgccta 1067821 gccacgacga catcgccaac ggcctcaagg cccctactgc cgacgacgag gcgctgccct 1067881 tgtccgacga accgccgccg cgggccgact aatctgcggt tggtgaggcc gcaactgtct 1067941 gaggccttta ctcacggtac tgagtctgca ctgggatgca ggctggtggt gctcacacgc 1068001 tttgaggagc cagactaggc tcgccgtgtg caggaaccgc ttcgtgtacc cccgagtgca 1068061 cctgcgcggc tggtagtact cgcgtctggc accggttcgt tgctgagatc tctactcgat 1068121 gccgctgtcg gcgactaccc ggcacgggta gtcgccgttg gtgtggatcg cgaatgccgg 1068181 gccgccgaaa tcgccgcgga agcatcggtg ccggtgttca ccgttcggct cgccgaccac 1068241 cccagtcgcg atgcctggga cgtcgccatc accgccgcca ccgcagccca tgagcccgac 1068301 ctcgtcgttt ctgcgggctt tatgagaatc cttggaccgc agttcctttc acgattctac 1068361 gggcgcaccc tcaacaccca cccggcgctg ctgccggcct tccccggcac gcacggtgtc 1068421 gctgacgcgc tggcctacgg ggtgaaggtc accggcgcta cggtgcacct ggtagacgct 1068481 ggcacggaca ccgggccaat actggcgcag caacctgtgc cggtgctcga cggtgacgac 1068541 gaagagactt tgcatgaacg aatcaaggtc accgaacgac ggctgttggt agcggcggtg 1068601 gccgcactgg ccacccacgg cgtgacggtg gtcggacgaa cagcgacgat gggacgaaag 1068661 gtaaccatag gatgagcacc gacgacggaa gacggccgat ccgccgtgcg ctgatcagcg 1068721 tgtacgacaa gaccgggctg gtagacctgg cacagggcct gagcgcggcc ggcgtcgaga 1068781 tcatctcgac tgggtcaacg gccaagacca ttgccgacac cgggattccg gtgacccccg 1068841 tggagcagct gaccggcttt cccgaggtgc tcgatggccg ggtcaagaca ctgcacccgc 1068901 gagtgcatgc cgggctgctg gctgacctgc gcaagtccga gcacgccgcg gccctcgagc 1068961 aactcgggat cgaggctttc gaactcgttg tagtcaactt gtatccgttc agccagaccg 1069021 tcgaatccgg cgccagtgtc gacgactgcg tcgagcagat tgatatcggc gggccggcga 1069081 tggtgcgggc cgccgccaaa aaccatccca gcgcggcggt ggtcaccgat ccgcttgggt 1069141 accatggcgt gcttgccgca ctgcgcgccg gcggattcac cctcgccgag cgcaaaaggc 1069201 tggcgtcgtt agcgtttcag catatagccg agtacgacat cgccgtcgcg agctggatgc 1069261 aacagaccct agcgcccgaa catcctgttg ccgcctttcc gcagtggttc ggccgaagct 1069321 ggcgccgcgt ggcgatgctg cgctacggcg agaacccgca ccaacaggcc gctctctacg 1069381 gcgaccccac cgcctggccg gggctggccc aggccgagca actgcacgga aaagacatgt 1069441 cctacaacaa cttcaccgat gcggacgcag cctggcgggc cgccttcgac cacgaacaaa 1069501 cgtgcgtggc gatcatcaag cacgccaacc cgtgcggcat cgcaatctcg tccgtttcgg 1069561 tcgccgacgc gcatcgcaag gctcacgaat gcgatccgct gagcgcctac ggcggggtca 1069621 tcgccgccaa taccgaggtc agtgtcgaaa tggccgagta tgtgagcacc atcttcaccg 1069681 aagtcatcgt cgcgcctggc tacgcccccg gggccctcga tgtgctggcc cgcaagaaga 1069741 acatccgggt gctggtagcc gccgagccac tggccggtgg cagcgagttg cgtccgatca 1069801 gcggtggact gctgatacag cagagcgacc agcttgacgc gcacggtgac aacccggcga 1069861 actggacctt ggcgaccggg tcacctgcgg accccgcgac gctgaccgac ctggtcttcg 1069921 cgtggcgagc ctgccgtgcg gtcaagtcga acgcgatagt gatagctgcc gacggcgcca 1069981 ccgtcggcgt cgggatgggt caggtcaacc gtgtcgacgc cgcccggttg gccgtcgaac 1070041 gcggcggcga gcgggttcgc ggcgcggtgg cagcctcgga tgcgttcttc ccctttcccg 1070101 acggcctgga aacgttggcc gccgcggggg tcaccgcggt cgtccacccc ggtggctcgg 1070161 tgcgcgacga ggaagtgacc gaagcggcgg ccaaggccgg tgtcacccta tatctcaccg 1070221 gggcgcggca cttcgcgcac tgaggccgct ggccgcgaca gtgaaatcca cgacgtgaca 1070281 cgccggaaac gcgtcgtgac attcactctc gtggccagaa gaaagacggc gtcgtagcgt 1070341 ggaacggtga tgtcacccag taacctgccc cgcaccgtgg gcgagctgcg tgccgccggt 1070401 catcgggaac ggggggtcaa gcaggaaatc cgggaaaatc tgctgaccgc gctggccgac 1070461 ggcgacaacg tctggccggg catcctgggt ttcgacgaca ccgtgattcc ccaggtggag 1070521 cgggccttga tcgccggtca cgactttgtc ctgctcggcg aacgcggcca gggcaagacc 1070581 cggctgctgc gcgcactcgc gggtctgctg gacgagtgga cgccggtgat cgccggcgcc 1070641 gaactgggcg agcaccccta cacgccgatc acgccggagt cgatccggcg ggccgcgcag 1070701 ctcggcgacg acctaccggt ggcgtggaag caccgcagcg agcgctacac cgagaagctg 1070761 gccacccccg acaccagcgt cgccgacctg gtcggcgacg tcgacccgat caaggttgcc 1070821 gagggccgca gcctcgggga tcccgaaacc atcgcctacg ggctcatccc gcgggcgcac 1070881 cgcggcatcg tcgcggtcaa cgagctgccc gacctcgccg aacgcatcca ggtgtcgatg 1070941 ctcaacgtca tggaggagcg cgacatccag gtccgcggct acacgctgcg gctgccgctg 1071001 gatgtgttgg tggtcgccag cgccaacccc gaggactaca ccaaccgtgg ccgcatcatc 1071061 acgcccatca aggaccggtt cggcgccgag atccgcaccc actacccact ggagctggag 1071121 gcggagatgg gcgtcatcgt ccaggaggcg cacctgagtg cacaggtgcc cgactacctg 1071181 atgcaggtgc tcgcgcggtt tgcccgttac ctgcgagaat cccgctcgat cgatcagcgc 1071241 tccggggtgt cggcgcggtt tgccatcgca gcggccgaaa ccgtggcggc tgccgcccgg 1071301 caccgcgggg cggtgctggg ggagacagac ccggtggccc gggtggtcga tttgggcacg 1071361 gtgatcgacg tgctgcgcgg caagctggaa ttcgagtccg gcgaggaggg ccgcgaacag 1071421 gcggtgctcg agcatctgtt gcgtcgcgcc accgccgata ccgcgtcccg ggtgctgggc 1071481 ggtatcgacg ttggctcgtt ggtgaccgcg gtcgagggcg gttcggcggt gacgacgggc 1071541 gagcgggtct cggccaagga tgtgctggcg gcggtgccgg gcctgccggt ggtggacagg 1071601 atcgcgcgca agctgggcgc cgaatccgag ggggagcgtg ccgcggcact ggaactggcg 1071661 ttggaggcgc tatacctggc caagcgcgtt gacaaggtct gcggggaggg ccagaccgtc 1071721 tatggctaag tctgatggtg acgacccgct gcgcccggct tcgccgcgct tgcgatcgtc 1071781 acgacggcac tcgctacgct actcggcgta caccggcggg cccgacccgc tggccccgcc 1071841 ggtggatctg cgggatgcgc tggaacagat tggccaagac gtcatggcgg gcgcctcgcc 1071901 gcgccgggcg ctgtccgagc tgctgcggcg gggcaccagg aacctgaccg gcgccgaccg 1071961 gctggcggcc gaggtgaacc gccgccgacg ggagttgttg cgccgcaaca acttagatgg 1072021 caccttgcag gagatcaaga agctgctcga cgaggccgtg ctggccgaac gcaaggagct 1072081 ggcccgcgcg ctagacgacg acgcccgctt cgccgagctg cagctggacg cgcttccggc 1072141 ctcgccggcc aaggcagtac aggagctggc cgaataccgc tggcgcagcg ggcaggcccg 1072201 cgaaaagtat gagcagatca aggatttgct cggccgtgag ctgctcgacc aacgctttgc 1072261 cggcatgaag caggcgcttg ccggtgccac cgacgacgat cgccggcggg tcaccgagat 1072321 gctcgacgac ctcaacgacc tgttggataa gcacgcccgc ggtgaagata cgcagcggga 1072381 cttcgacgag ttcatgacca agcacggcga gttcttcccg gagaacccgc gcaacgtcga 1072441 ggagctgctg gactcgctgg ccaagcgagc cgccgccgcg cagcggttcc gcaacagcct 1072501 gagccaggaa cagcgggacg agctggacgc gttggcgcag caggcatttg gctctccggc 1072561 gttgatgcgg gcgctggacc gtttggatgc gcatctgcag gccgcccgtc ccggcgaaga 1072621 ctggaccggc tcgcagcagt tctccggtga taatccgttc ggcatggggg aaggcaccca 1072681 ggcgctggcc gacattgccg agctggagca gctggccgag cagctgtcgc agagctatcc 1072741 gggcgccagc atggacgatg tcgacctgga cgcgctggcc cgtcagctcg gcgaccaggc 1072801 cgccgtcgac gcccggacgc tggctgaatt ggaacgcgcg ctggtcaatc agggcttcct 1072861 ggaccgcggt tccgacggcc agtggcggct ctcgccgaag gccatgcgcc gcctcggcga 1072921 aacggcgtta cgcgatgtgg cgcaacaact ttccgggcgc cacggcgagc gtgatcaccg 1072981 gcgtgccggc gccgcgggcg agctgaccgg tgcgacgcgg ccctggcagt tcggcgacac 1073041 cgagccgtgg cacgtcgccc gcacgctgac caatgccgtg ctgcgccaag ccgcggccgt 1073101 gcatgaccgc atccggatca ccgtcgagga tgtcgaggtc gccgagaccg aaacgcgcac 1073161 ccaggccgct gttgcgttgt tggtggacac ctcgttttcg atggtgatgg agaatcgctg 1073221 gttgccgatg aagcgcacgg cgctggcgct gcaccacctg gtgtgcaccc ggttccgctc 1073281 ggatgccttg cagatcatcg cgtttgggcg ctacgcccgc acggtgacgg cggccgagct 1073341 gacggggttg gcgggtgtct acgagcaggg caccaacctg caccatgcgc tcgcgctggc 1073401 cggccggcac ctgcgccggc acgcaggcgc ccagcccgtg gtgctggtgg tgaccgacgg 1073461 cgagccgacc gcccacctgg aggacttcga cggcgacggt acgtcggtgt tctttgatta 1073521 cccgccccat ccgcgcacca tcgcccacac cgtgcgcggg tttgacgaca tggcgcggct 1073581 gggtgcgcag gtgacgatct tccggttggg cagtgacccc ggtctggctc ggttcattga 1073641 ccaggttgcg cgacgggtgc agggccgcgt ggtggtgccc gatctcgacg ggctgggcgc 1073701 ggcggtggtg ggcgactacc tgcgcttccg gcggcgctag tttgttgcaa tcatggtgct 1073761 agcatcgtgc tagcaatatg ctaacatagt gcgatgaaga cgctgtatct gcgcaatgtg 1073821 ccggacgacg tggtcgagcg actcgagcgc ctcgccgaac tcgccaagac gtcggtgtcc 1073881 gcggttgctg tgcgtgagct caccgaggct tctcgccgcg ccgacaatcc ggcgcttctt 1073941 ggggacttgc ccgatatcgg catcgacacg accgaactga tcggtggtat cgacgccgag 1074001 cgcgccggtc gatgatcgtc gttgacgcct cggccgcgct ggccgcgctg ctcaacgatg 1074061 gacaagctcg acaattgatc gctgccgagc gcctgcatgt cccgcatctg gtcgattcgg 1074121 aaatcgcgag cgggctccgc aggctagcgc agcgggatcg gctgggcgcg gccgacggac 1074181 ggcgggccct ccaaacgtgg cgccgcctcg cggtgacgcg ttatccggtg gtgggccttt 1074241 tcgagcgtat ctgggaaatc cgcgcgaacc tgtcggcata cgacgccagc tatgtggcct 1074301 tggcggaagc cctgaactgt gcgctcgtca cagcggatct gcggctcagc gacaccggcc 1074361 aagcccagtg tccgattacc gttgtgccca ggtagccgtg gcacggatgt tcgaggatcc 1074421 gtatatcaca acgcgatagg tcctgttgac acaagggaag cgcggggcgc cgtcggcggt 1074481 tcgtctcgtc gaaatgcgac aacaacgccg tgcgcggcac atcccagttt gtgagacact 1074541 gtgcgcgtgc cctcgcagtg gatgatctca tcccgggtaa cggtagcctg gaacatcgtc 1074601 ggctacctcg tgtatgcggc cctggctttt gtcggcgggt ttgcggtttg gttctcctta 1074661 ttcttcgcga tggccaccga tggttgtcac gactcagctt gcgacgcaag ctatcacgtg 1074721 ttcccggcca tggtcaccat gtggatcgga gttggcgcgg tcttgctgct caccttggtg 1074781 gtcatggttc gcaactcgtc gcgaggcaac gtcgtgatcg gatggccttt tgttgggttg 1074841 ttggcgcttg gccttgtcta cgtggctgcc gatgcggtct tgcactgatc gacgtggggt 1074901 tctgcgtcag taggcgtcgc gggttcggcc gccgggggat ccgtacaggt acgggtagtg 1074961 cacgtcgggg tcgttggccg gtcgcatgtt cagcggtggc ggcgcggtgc gccaggcggc 1075021 cgggagatgg caaccggtgt tgtaggagta gccgagtggg ccgcggttgt cgccattgac 1075081 gaggtttatc gtgacgccgt catcgcggat ttgagagtaa ttcggggcgc ccagcggctg 1075141 tggcgggtcg ccgggtacgg agttgttggg ccgaaaaccc gccgccttga acaccggggc 1075201 tagctcggtg acaatttgca gccattgctg cggagtgggc gctggacggc caaagaataa 1075261 ctcgcttgcc tcttgtcgac cgatggtgcg ggtgaacggg tcgttgcagc cgttggtgag 1075321 atggctcact gtgacgcctg ttgagaaccg ggtctgcggt gaatatttcg cgatcatcgc 1075381 cctgatggtg gcgtcgaggt tggcgagctg ctgctggact gtctcaagat cgggtcgtcc 1075441 gttgacgatt ttctgccggc ggtccagctc gccccgcccg ggattggcgt aagggtcgaa 1075501 cgtattgggt ttgatacacc cggccagcag tgcggcgatg ccgaggagcg ccgcggtcag 1075561 gctgcgggat gttcgcttca tgggtgatag ttcgggttgg gtattggcaa tccgaacggg 1075621 ccgggcccac tgggcatcgt cggtggcggc agcacagggg gtttgatgag gtcgtcgggc 1075681 aatcccgcca gcacggcggc caagttgtaa ccactcatcc gaagctgatc attgtctccg 1075741 ttacgcgcgt actcagagtg tccttatgct cgctcgtgaa gttggccgtc gcccaagagc 1075801 gggccgggcg ccaaaccggt gttgaccgac agttgtgtca tgccgggtac gtcctggggt 1075861 gcggagccga atgcgccgaa ttcggggatg gtattggcga catggtcgtt gacgccgatc 1075921 atgtagaagg catgccctgg ctcaacgccc agctgcgacg cgtgcgtgag ctcggtgccc 1075981 ggtgaaccgt acaaaacgac gtcgctgacc ggtgccccct gctgcagcgc cagactggtt 1076041 accagggatc cgtaggaatg cccgaacgcg gtgatgtgct gatcgctgac attcgtggta 1076101 gcggccaagc ctttgtcgaa gcggttcaac ggccccgcgg catcgcgagc cgaccagtcg 1076161 tgcatcacgt ctttgaggcc gtccggcgcg tcatagccca gccacgcaat ggatgccacc 1076221 gcatcgtaat ttggccatcc ggctcgttcc ctcagttcgg ctgcctttgc gcgctgaatt 1076281 ccagcttcct tgaccatgtc cccaacgctc gaactcaccc gcgtgttcag gccgcccatc 1076341 gtgacgccga cgcgttcggc gttgtcgacg tcgccaactc ccacagccgc caacaccttt 1076401 cgcgggtcac tggcggtgtc caacagaatg aggctggtgc cgggatgggc tgccagagta 1076461 tcccgcaacg ctcgcagatc cgccagcttg tcggtgtcgg tgtgccagac tccatctctg 1076521 ctcaaccagc cgttctgcag ccgggtgagt tctcgttgca gcactgaaag attcagttcg 1076581 ttgcgaacgg cgatgggaat gccgtcgcga ttacgcaggg tattggggaa ccattgcttg 1076641 acccgatcct gctggcccgg ggtcagcgaa tgccaccacc gcttgacctc ctcagggtcg 1076701 ctgtccggcg gcggcatctg cggcattgtg ggtggcgcat ggctgagttg ggcattgacc 1076761 tgctcgcgcg acaagtcccc ggcggcgcat cgaatcgccg cggccagatc ctcatcggcg 1076821 gtctcggcgt cggccagcag acgtttgatg ccctccggat tgcggtgttg aggatggcct 1076881 gctgatcggc cggggaatac gacgacaagt cgggtggtgg caacgccgta ccggtcgcgt 1076941 aagcgatcgt caggtgatgc tcacgcgcgg catcgcggat cacttgtagc cgcatcttga 1077001 tcgcggcgac ctcctcggcc gccttttccg cggcccgcgc gaccgcttca cacgcgccgg 1077061 catggtgatc gagcagcacc gttgtgtgat gggttgctac ctgtgccgcc tcggcggccg 1077121 caccgccaaa gccgagcagc cccatggtgt cacgcagcgc cgccgatgcg gtgcgtgtgc 1077181 cgtgcgcgcg gtcgatcgcg gcctgaacac cgtcctgatc gctgtgggat cccagcgctc 1077241 aatgtcagct aacgtcaacg ccgtcgcatc gggcgcttcc accgcctgtt ggataaaccg 1077301 cggccagtgc cgcggcgttg tgttcctcca tctcggcgaa cccgaccgcg gccagatgca 1077361 tgccgtaaga atggtccccg atgtgggccg cgtgggcggt gctggcctcc gcccagctat 1077421 ccagcaatcc cgacagcgcg cccgccgacg aaccgaccca tcccggccgc gccccttcgg 1077481 ccgcacccag gcagcagtgg tgtgacgtca gcaaagactc gccgtggtca gcctgctgat 1077541 aaccgacctg ggacaggact tcaggaatcg cccgcaacgt tctgccatac caactcgctt 1077601 ccacacgaac caaactttcg gcggagtatg gcacacgagc acattgcggg cgattcaccc 1077661 gcatcgagct gaccgggcgg cgcaccttgc tatttgcggc tatttgcgtg gcttgcgggg 1077721 tttgcgcttg atgcccacat caccccccag cgagaagccg cggatcctca ccgtcggcgc 1077781 gccacgggtg ccctccccga ccaccttgcg gtcgaagccg cccatcactc ggtgaccgtg 1077841 gatctccacg ttgacttcgg gtggcagcag aattgtctgc gcccccatga tcgagtacgc 1077901 acggatgtcc acctcggtcg aggtgaagtc ggcgtagcgc agatccagca ccccgctgcc 1077961 ccacaaggtg aacgtggtca gcttcttcgg cacgttccag cggccgcgtc gttcgaatcc 1078021 gcccagtagc gccagcagca gcgtggacgg cgccggattg cattcgccac ccctgcgcgg 1078081 gcctatcgcc gcccccggca gatcggcccg cagccgatcc agctcctggt aggtggttgc 1078141 cgcataggcc cgcgccagcc ggtcttcata atcggtcagc tgcaggcggc cctgctcggc 1078201 cgcgtaggcc agcaactgcg caatctgtat ccggtcggtg tccgacgcac gcgcggactc 1078261 gtcgcgcgag ttcctcgcgt cacgctgcgc cgagttgctc atcgtccacg agcctacgac 1078321 gtcaagaatt tgcttcaaga ggtgttggcg aaactgcaaa tgttgccagg ttcgactcct 1078381 tgggtagccc acccccagtg gggtgggata ccatgaacgg gtgagggatt aggggcaagc 1078441 catgagcaag gaattgaccg caaagaagcg cgcggcgctg aaccggctga agacggttcg 1078501 gggccatctt gacggaatcg ttcggatgct ggagtccgac gcctactgcg tggacgtgat 1078561 gaagcagatt tcagcggttc agtcctcgct ggagcgggcc aaccgggtga tgctgcacaa 1078621 ccacttggag acgtgctttt ccacggcggt gctggatggt catgggcaag cggccatcga 1078681 agagctcatt gatgccgtca aattcacgcc ggcgctgacc ggtccacacg cgcggctcgg 1078741 cggtgccgcg gtcggcgagt cggccaccga ggagccgatg ccggatgcca gcaacatgtg 1078801 acgagcgccg gactccggtg tttctcggga caacgacata cgaaaggagc atccgcgatg 1078861 gtgtggcatg gattcctagc gaaggcggta cccaccgtgg tcaccggcgc ggtgggggtc 1078921 gcggcgtatg aggcgctgcg caagatggtg gtgaaggctc cgctgcgggc ggcaaccgtg 1078981 tccgttgccg cctggggcat acgcttagca cgtgaagccg agcgcaaggc cggggagagc 1079041 gccgagcaag ctcgactgat gttcgccgac gtgctagccg aagccagcga gcgcgccggg 1079101 gaagaagttc caccactggc ggtggcgggt tcggacgacg gtcatgacca ctgacgttct 1079161 ttctgacacc gacgtctcgc tgaaggtggt ctccaacgcg tcggggcgga tgcgcgtgtg 1079221 cgtcaccggg ttcaatgtcg atgcggttcg ggccgtcgcg attgaggaga cggtctccca 1079281 agtgaccggg gtgcacgccg tgcacgccta tccgcgaaca gcgtcggtgg tgatctggta 1079341 ctcgccagag ctcggtgaca ccgccgccgt gctgtcggcg atcaccaaag cgcagcacgt 1079401 cccggcagaa ttggtgcccg cccgtgcccc gcactcagcg ggtgtgcgcg gcgtgggcgt 1079461 ggtgcggaaa atcaccggcg ggatccgccg catgctaagt cgcccgccgg gcgtcgacaa 1079521 gcccctgaag gcgtcgcgtt gcggcggccg cccgcgcggg ccggtccgcg ggagcgcctc 1079581 gtggccgggc gagcagaacc ggcgcgagcg gcggacgtgg ttgccgcggg tgtggttggc 1079641 cttgccgttg gggctactgg cgctgggttc gtcaatgttc ttcggtgctt acccgtgggc 1079701 ggggtggctg gccttcgccg cgacgctgcc ggtgcaattc gtggccgggt ggccgattct 1079761 gcggggggcg gtgcaacagg cgcgggcgtt gacctcgaac atggacacgc tgatcgcgct 1079821 gggtacgctg accgcgtttg tctactccac gtatcagttg tttgccggtg gacctctgtt 1079881 cttcgacacc tcggcgctga tcatcgcgtt cgtggtgttg ggccgccatc tcgaggccag 1079941 agcaaccgga aaagcgtccg aggcgatcag caagctgctg gagctgggcg ccaaggaagc 1080001 cacgctgctt gtcgacggcc aagagctcct ggtgccggtc gatcaggtcc aagtcggaga 1080061 cctggtgcgg gtgcggcccg gagagaagat cccggtcgac ggtgaggtca ccgatgggcg 1080121 cgccgccgtc gacgagtcga tgctcaccgg cgaatccgtc ccggtcgaga agacggcggg 1080181 tgaccgcgtt gccggcgcaa cggtcaacct cgacgggctg ttgaccgtgc gcgccaccgc 1080241 cgtcggggca gacaccgcgc tggcgcagat tgtgcgactg gtcgagcagg cacagggcga 1080301 caaggcgccg gtgcagcggc tggccgaccg ggtttcggcg gtgtttgtcc cggccgtcat 1080361 cggcgttgcc gtcgcgacct ttgcgggatg gacactgatc gccgccaacc cggtggctgg 1080421 tatgaccgcc gcggtcgcgg tgctgatcat cgcgtgcccg tgtgcgttgg gcctggctac 1080481 ccccacggcc atcatggtcg gcaccggccg gggcgccgaa ctggggatcc tggtcaaggg 1080541 aggcgaggtg ctggaagcgt cgaagaagat cgacaccgtg gtgttcgaca agaccggcac 1080601 cctcacccgc gcccggatgc gggtgaccga tgtgattgcc ggccagcggc gccagcctaa 1080661 tcaggtgctg cggctcgccg ccgcggtcga atcgggctcc gaacacccca tcggtgcggc 1080721 gatcgttgcc gctgcacacg agcgcgggtt ggcgataccg gccgccaatg cgttcaccgc 1080781 cgtcgccggg cacggggtgc gggcgcaggt caacggcggg ccggtggtgg tcggacggcg 1080841 caagctcgtc gacgaacaac atttggttct gcccgaccac ctcgctgcgg cggccgtgga 1080901 gcaggaagag cgcggccgca ccgcggtgtt cgtcggccaa gacggccagg ttgtgggtgt 1080961 gctcgcggta gcggacacgg tcaaagacga cgccgcggac gtggtcggtc ggctgcacgc 1081021 catggggcta caggtagcca tgatcaccgg cgacaacgcc cgcacggctg ccgcgatcgc 1081081 caagcaggtc ggcatcgaga aggtgctggc cgaggtgttg ccgcaggaca aggtagctga 1081141 ggttcggcgg ctgcaggacc agggccgggt ggtcgcgatg gtgggtgacg gcgtcaacga 1081201 cgcgcccgcc ttggtacaag ccgatctggg cattgcgatc ggcaccggta ccgacgtggc 1081261 catcgaggcc tccgacatca cgctaatgtc cggccggctc gatggtgtcg tgcgcgcgat 1081321 cgaactctcc aggcagaccc tgcgcaccat ctaccagaat ctcggctggg ccttcggcta 1081381 caacaccgcc gcgatcccac tggccgcgct gggcgcgctg aacccggtcg tggcgggcgc 1081441 ggcgatgggg ttctcctcgg tcagcgtggt gaccaactca ctgcggttac gccgcttcgg 1081501 ccgcgacggc cgaaccgcat gatccatgac ctgatgcttc gttgggtggt taccggcctg 1081561 ttcgtgctga ccgccgccga atgtggtctg gcaatcatcg ccaaacgccg accgtggacg 1081621 ttgatcgtca accacgggtt gcatttcgca atggccgttg cgatggcggt gatggcctgg 1081681 ccgtggggcg cgcgggttcc gacgacggga cctgcggtat ttttcttgct ggcggccgtg 1081741 tggtttgggg cgacggccgt cgttgcggtc cgcgggaccg ctacgcgtgg actgtacgga 1081801 tatcacggct tgatgatgct ggccacagcc tggatgtatg ccgccatgaa tcctcgtttg 1081861 ctccctgtcc gctcgtgcac cgaatacgcc accgagccgg atgggtcaat gccggctatg 1081921 gacatgactg cgatgaacat gccgccgaat agcgggtcac ccatctggtt cagcgcggtg 1081981 aactggatcg gtacggtcgg cttcgcggtt gcggcggttt tctgggcatg caggtttgtc 1082041 atggagcggc ggcaggaggc gacccagtcc aggttgccgg gcagcatagg ccaagcgatg 1082101 atggcggccg gtatggcgat gttgttcttc gccatgctgt ttccggtctg aggcagttcg 1082161 ccgcctgtgt gtccgaaccg caaggtaatt cggaataggc tgttcccaac ctcctgcgtc 1082221 gtaggcgggg gcccggcggg cctagtcagc ggcccgcatc gtcgccggct ggacccagcg 1082281 gggcggacgt ttctgcagga aggccagcat cccttcgcgc gcttcgtcgg agacgaacag 1082341 cctggccgac tcctcggtca ggcgttcggc gtcgcggtcg aacccttcga gcacggcggc 1082401 cgtggtcagc gccttcgacg cggccaggcc ttgtggcgag ccgcggccca cgtcggcgac 1082461 cagcgcggcc accgcggcgt ccacgtcgtc ggccgccatg gtgatcagtc cgatgtcggc 1082521 ggcttcgcgg gcgccgaact tctcgccggt caggtaatag cgggccgcgg cgcgcggcga 1082581 aagcttgggc agcagcgtca gcgagatgat cgccggtgcc accccgatcc gtgcctcggt 1082641 cagcgcgaac gtgctttccg gtccggcgac caccatgtcg cacgcaccga ccaggccgaa 1082701 cccgccggcc cgcacatgcc cgttgatggc gccgaccacc ggcagcggcg actcgacgat 1082761 ggcgcgcaac agcgccgtca tttcccgcgc ccgcgccacc gccatccggt acggatcacc 1082821 accaccaccg ccggcctcgc tgaggtccgc gccggcgcag aacgttccgc cggtatgccc 1082881 cagcacgacc agccgcaccg ccggatctgc ttcggccgca ctcagccctt gatgtagttg 1082941 gctgaccagc gtgctcgaca gcgcgttgcg gttgtgcgga gagttcagtg tcagcctggc 1083001 gaaggggccg ccgcaggcgg ccgggccagc gtagtcgacg gggctgtcca tcagtaggac 1083061 cggggcagac ccagcgatgt ctgcgcaacg aagttcagca ccatctcgcg gctgatcggg 1083121 gcgatccgcg ccaagcgggc cgaggtcatc atcgctgcca cgccatattc cttggtgagg 1083181 ccgttgccgc ccatcgactg tacggcctga tcgaccgcgc ggctggatgc ctcggccgca 1083241 gcgtatttgg ccatgttggc cgcctcggcc gcaccgaagt cgtcaccatg gtcgtagagt 1083301 gtggcggctt tctgggtcat cagcttggcg agttcgacct caatgtggca ctgcgccaac 1083361 ggatgtgcca ggccctggtg cgcgccgatc ggggtggacc acaccttgcg ggttttgacg 1083421 tagtcgacgg ccctgccgag tgcgaaccgg cccatgccca ccgcgctagc cgcacccatg 1083481 atgcgctcgg ggttcaggcc cgcgaaaaga tgtgcgatcg ccgcgtcttc ggctccaacc 1083541 agcgcatcgg cgggtagccg gacgtcgtcg aggaaaacct ggaactggcg ttcggggctg 1083601 accagctcca tctcgatcgg ggtgtagctg aacccgggag cgtcggtggg caccacgaac 1083661 aacgcggggc gtagcttgcc ggttttggct tcctcgctgc ggcccacgac cagcaccgcc 1083721 tgcgcctggt cgatgccaga aataaagact ttctggccct tgatgatcca gtcgctgccg 1083781 tcgcgacgcg cggtggtggt gatcttgtgt gagttggagc cggcgtcggg ctcggtgatg 1083841 gcgaacgcca tggtcaacga gccgtcggcg atgcccggca accagcgctt cttctgatcg 1083901 tcggtgccga acttggcgat gatggttccg ttgatggccg gtgacaccac catcagcagc 1083961 agcgccgagc cggcggcggc catctcctcc atcaccagcg acagttcgta catgcctgcg 1084021 ccgccgccgc cgtactcttc gggcagattc acccccaaaa aaccgagttt gcctgcctcg 1084081 gcccataact cgctggtgtg ttcgtgtttg cgcgccttgt ccaggtagta ctcgtggcca 1084141 tagttggcca cccaagaggc caccgccttg cgcagcgcct gacgttcctc gctttcgata 1084201 aagctggtgt ctgtcacggt gaatctcctt ctgctgggcc attttgaggt gcttctactc 1084261 gtgcgagaat ggcgcctact tcgacctgtt gacccgtgtt gacgctgacg tgggtgagca 1084321 cgccgtcggc aggcgcggcg atggtgtgtt ccatcttcat ggcctccagc cagatcaacg 1084381 gctgaccggc cgtgaccgtg tcgccaacct cggcgccgat ccggatgacg ttgccgggca 1084441 tgggggccac cagcgagcct tgctcgacgg ccgagctcgg ctcggggaag cgtgacagtg 1084501 ccaccaggtg aacgggtccg cgcgccgagt cgacgtagac gtcggggccg tggcgggcaa 1084561 ccgtgaagcc gtgtgcgacc ccgtcctggg cgagcaccac ctggtccacg tcagccgaga 1084621 ccagctgtac caccggatcg ccgggaagcg ccagacccgt tctggtgaac cggtattcga 1084681 cgcggtgttc ggtgtccgcg tcgtcacgat aggtcttgac ctgatagccc gaggccaggt 1084741 tgcgccagcc gctgggaatc gagctgaaca cgcccgcgct cgcccgattg tgctcggcgt 1084801 cggccagcgc ggcggcgatc gccgacaacc ggagggtcgc ggtgtcggcc agcggtgtcg 1084861 acaactcggc catgccgtgc gtgtcgaaaa acccggtgtc ggtggcgccg tcgaggaacg 1084921 ccggatgacg cagcacgttg accaagagct cacggttggt gcgcagaccg tgcagccggg 1084981 cgcgtaccag cgcatcggcc aacacaagcg cggcctgccg gcgggtggca ccgtaggaga 1085041 cgaccttggc cagcattggg tcgtagtgga tcgacactgt ggaaccgtcg acgatcccgg 1085101 aatccagccg gatgccggtc cgctgtccca acgagtcgaa ctgcgcccga acccccggaa 1085161 cctcaatcgt gtgcatcacg cctgcctgtg gctgccagcc atgcgcggga tcctcggcgt 1085221 agaggcgggc ctcgatcgaa tatccctggg cggggggagg ttcggtgtcg agtcgcccgc 1085281 agtcggcaat catgagctgc agttcgacca gatccagccc ggtggtctct tcggtgaccg 1085341 ggtgctcgac ctgtagccgg gtgttcatct ccaggaagta gaactcacct tcccggccag 1085401 gtgagtcatc ggcgaggaac tccaccgtgc ctgccccggt gtagccgatc gcgctggccg 1085461 ccagccgggc cgcgtcgaac agcttggccc gcatccccgg tacgcgttcc accagcggcg 1085521 acggtgcctc ttcgatgatc ttctggtggc ggcgctgaaa cgagcattcc cgttccccga 1085581 ccgcccacac ggtgccatgg gtgtcggcca tgacttgcac ttcgacgtgg tgcccggtgg 1085641 gcaggtagcg ctcgcagaat acggtcgggt cgccgaacgc ggattgggct tcacgtcgcg 1085701 cggcttcgac ttcggccggc agggccgata attcgtgaac cactcgcatg ccgcgaccgc 1085761 caccgcccgc cgacgccttc accagcaccg gcagctgcgc ggtggtgacg gcgtcggggt 1085821 cgagttcctc gagcaccggc accccggcgg cggccatcag cttcttggac tcgattttgg 1085881 agcccatcgc gcgcaccgcg tccaccggtg gcccgaccca ggttaggccg gcctcctgca 1085941 cggcggccgc gaattcggcg ttctccgaga ggaatccgta gccgggatgc accgcgtcgg 1086001 ctccggctgc ctgcgcggcc gcgatgatcg cctcggcgtt cagatagtcg gtggtctgcg 1086061 gcagccggac ccgggcgtcg gcctcggcga catgcggtgc cgcggcatcc gggtctgtgt 1086121 agacggcgac ggtgccgagc cccagccggc ggcaggtggc gaacacccgc cgggcgatct 1086181 cgccgcggtt agcaaccaat actcgagtga ttcccatcag catcacatcc ggaagacgcc 1086241 gaagttcgac gtccccttga tcgggccatt ggcgatggcg gacaaacaca ttcccagcac 1086301 ggtgcgggtg tcgcgcgggt cgatcacccc gtcgtcgtaa agcatcccgg acagcaccaa 1086361 cggtagcgac tcggcttcga tctggccctc gacggcggcc cgcatcgccg cgtcggcggc 1086421 ttcgtcgact tgctgcccgc gggcttcggc tgccgcccgg gccacgatgg acagcacgcc 1086481 cgacagctgg gcgccgccca tcaccgcgga cttggcgctg ggccaggcga ataggaagcg 1086541 cgggtcgtag gcgcgcccgc acatgccgta gtgcccggcg ccgtaggacg cgccgatcag 1086601 cagcgagatg tgcgggacgg tcgagttgga cacggcgttg atcatcatcg agccatgctt 1086661 gatcatcccg ccttcctcgt agtccttgcc caccatgtag ccggtggtgt tgtgtaagaa 1086721 caacagcggc gtgtcggccc ggttggccag ctggatgaac tgggtggcct tctgtgattc 1086781 ctcgctgaac agcacgccgc gggcgttggc caggatgccc agcggatagc cgtgcaaccg 1086841 agcccagccg gtcaccagag acgacccgta cagcggcttg aattcgtcga actcggagcc 1086901 atcgacgatg cgggcgatca cctcgcgcgg gtcgaatggg atgcgcagat ccgggggcac 1086961 gatgccgatt agctcctcgg cgtcgaacag cggctcggtc accggagcgg gtgcgggtcc 1087021 ctgtttgatc cagttcagtc gcgccacgat gcggcgtccg atgcggatcg cgtcgagctc 1087081 gtcgagcgca aaatagtcgg ccaaacccga tatgcgggcg tgcatttcgg cgccgcccag 1087141 cgactcgtcg tcggactctt cgccggtggc catcttcact agcggcgggc cggccaaaaa 1087201 caccttggag cgttccttga tcatcaccac gtgatcggac atgccgggga cgtaggcacc 1087261 gcccgcggtg gagttgccga aaaccagcgc aatggtcggg atcccggccg ccgacagccg 1087321 ggtcaggtcg cggaacatct gtccgccggg gatgaaaatc tctttctggg tgggcagatc 1087381 ggccccgccg gattccacca gcgaaatgac gggaagccgg ttttcgaagg cgatctggtt 1087441 ggcccgcagt atctttcgaa gcgtccacgg attgctggtg ccgcccttga ccgtcgggtc 1087501 gttggcgacg atcatgcatt ccacgccgca gaccgcgccg atgccggtga ccaggctggc 1087561 gccgatctgg aagttgctgc cgtaggcggc cagcgggctc agctccagga acggggagtc 1087621 cgggtcgacg agcagctcga tgcgttcccg tggtgtcagc ttgccgcggg cgtggtgccg 1087681 gtcgacgtat ttggggccac cgccggcgag cgccttggcc agttcggcgt tgatctcgtc 1087741 gagcttgccg ctcatcgtcg cggccgcctc gtcgtaggcg gaagcgttcg ggtccagtgt 1087801 ggattgcagc acggtcacga ttgatacccc agggttttgg cggccaaagc ggtcagtatt 1087861 tcggtggtgc cgcctccgat accgaggatt cgcatgtccc ggtattggcg ttcgcttcgg 1087921 attcggccat gtaacccatg ccgccgaaca gctgtacggc ctggttggca acccactccc 1087981 cggcctgcac ggcggtgttc ttggcgaaac acacctgcgc gatcaggtcg gtctcgccgg 1088041 cgagctggcg ttccaccaca tggtgcgcat agacccgggc gacgtcgatg cggcgggcca 1088101 tctcggccag cgtgttctgc accgactggc gtgaaatcag cggccgaccg aacgtctcgc 1088161 ggtcccggca ccactgcgcg gtgaggtcca ggcaccgctg ggcgctcgaa tacgcctggg 1088221 cggcaaggcc gatgcgctcg gaaacaaatg cccgggcgat ctgggtgaag ccgctgttct 1088281 cggcgcccac gaggttagtc gccggcacgg ccacgtcggt gtagcacagc tcggcggtat 1088341 ccgaggaacg ccagcccatc ttgtccagct tgcgggtcac ctcaaagccg ggggtgtcct 1088401 tttccaccac cagcagcgaa accccggcgg caccgggtcc accggttcgc accgcggtga 1088461 ccacgtagtc ggcccgcacg ccggaggtga tgtaggtctt ggcgccgttg atcacgtaat 1088521 ggtcgccgtc ccgtaccgcg ctggtccgta gatgcccgac gtcggagccg ccgccgggtt 1088581 cggtgatggc cagcgcgccg atcttctccc cggccaaggt gggccgcacg tacgtggcga 1088641 tcagccgttc gtcgccggat gcgaccatgt gcggtacggc gataccgcag gtgaacaggg 1088701 acgcatacac cccgcccggg gcgccggcct ggtgcatctc ctcgcagatg atgacggggt 1088761 cggcgccgtc accgccgcca ccaccgaccg cctcgggaaa gccggcgccc agcagcccgg 1088821 cggccccggc gagccggtgc aggccgcggg gcaactcgcc gatcctttcc cactcgtcga 1088881 cgtgcggcag gatctcgcgc tcggcaaagg cgcgcaccgt ttttcgcagc tgttggcgct 1088941 ccggtgtggt ccagatgttc acaacagggt ctccgggatc tcgacgtggc ggctgcgcag 1089001 ccactcaccc agtcccttgg cctgcgggtc gaagcgggcc tggtaggcga cgccctggcc 1089061 gaggattgcc tcgatgacga agttcagtgc ccgcagattc ggcagcacgt gacgggtgac 1089121 gaccaggcct gccgtttctg gcagcagctc cttgagtagc tcgacggtca gcgtgtgcgc 1089181 cagccagcgc cactgctcgt cggtgcgtac ccacacgccg acgttggccg atccgccctt 1089241 gtcgccgctg cgggcgccag cgatcaggcc cagcggtacg cgccgggtcg ggccagccgg 1089301 cagcgggtcg ggcagcgccg ggggatgtgc cggcgccagc tccaacgtct cagtggcgca 1089361 gggaatctcg gtgcgggtgc cgtcggcgtg cacggcgatg tgcgccacct tgccggcgtc 1089421 gacgtagccg ggggtgaaca cgccatacac ctggccgtca ccgggcgggg cggtggcggt 1089481 gaaccccggg tagctggcca gcgccaattc gaccgcggcc gaggagaatt gccgacccac 1089541 attggcaggg tcgggatcgc gggcgacgca ggtgagcagc gcgctggcgg tttcttcggt 1089601 gtcggcgtcg gggtggtcgg tgcgggccag cgtccattgc agctcagcgg gtttgacggt 1089661 cagcgcggcc tcgagctggc gtcgcaccaa gtcggccttg gcatcgatgt ccaggccggt 1089721 cagcacgaat gtcatggcgt tgcggaagcc gccgatgctg ttcagcgaca ccttgtaggt 1089781 cggcggcggc ggttcgccga tcacgccgct aatgcgcact cgatccggcc cgtcgggcga 1089841 cagttcgacg ctgtccatcc gggccgtcac atccgggttg gcataccgag cgcccgtgat 1089901 ctcgtagagc agctgcgcgg tgatggtgtc gacgctgacc aggccgccgg tgccgtggtg 1089961 cttggtgatc accgacgagc cgtcggcagc gatctcggcc agcgggaagc cggcgtgagt 1090021 gaggtcgcct atctcggtga agaacgcgta gttgccgccg gtggcctgga ctccgcattc 1090081 gatcacgtgc ccggccacca cggcgccggc cagtcggtgg tagtcggtgc ggccccagcc 1090141 gaagtgcgcg gccgccgccc cgacgaccac cgaggcgtcg gtgacccggc cggtgaccac 1090201 gacgtcggcg ccgcgctcga agcagtcgac gatgccccat gcgcccaggt aggcgttggc 1090261 cgtcagtggc gtccccagcc ccagttcggc cgcccgtggt tgcaggtcgt cgccttccac 1090321 gtgggcgacc tgcgccggaa tgcccaggcg cgcggccagc gcccgcaccg cgttggccag 1090381 cccggcgggg ttcaggccac cggcgttggt gacgatgcgc accccgcggt catgggccag 1090441 gcccaggcag tcctcgagct gggccaggaa ggtcttcgcg tagccgcgat cggggttttt 1090501 catgcggtcg cgaccgagaa tcaacatggt cagctcggcc aggtagtcgc cggtgagata 1090561 gtccagctcg ccgccggtca gcatctcgcg catggcggag aggcggtcgc cgtagaagcc 1090621 cgagcagttt ccgatacgca cggcaccaca gtcaggggcc atgcgattcc tcccttggga 1090681 tcggcgacgc taccaaccaa ccggtaggtt agcactgccc tgtttcgcga cggagatcgc 1090741 ttcctgagtc gaagcggccc ggtctgcgcc gtccattgga gtagagtccg tttcgctacg 1090801 ggacgccggg tgctttgccg gccccaggag gtcagcgcca tgtccttcgt ggtcacagca 1090861 ccgccggtgc tcgcgtcggc ggcgtcggat ctgggcggta tcgcgtccat gatcagcgag 1090921 gccaacgcga tggcagcggt ccgaacgacg gcgttggcgc ccgccgccgc cgacgaggtt 1090981 tcggcggcga tcgcggcgct gttttccagc tacgcgcggg actatcaaac gctgagcgtc 1091041 caggtgacgg ccttccacgt gcagttcgcg cagacattga ccaatgcggg gcagctgtat 1091101 gcggtcgtcg acgtcggcaa tggcgtgctg ttgaagaccg agcagcaggt gctgggtgtg 1091161 atcaatgcgc ccacccagac gttggtgggt cgtccgctga tcggcgatgg cacccacggg 1091221 gcgccgggga ccgggcagaa cggtggggcg ggcggaatct tgtggggcaa cggcggtaac 1091281 ggcgggtccg gggctcccgg acagccgggc ggccggggcg gtgatgccgg cctgttcggc 1091341 cacggcggtc atggcggtgt cggggggccg ggcatcgccg gtgccgctgg caccgcgggc 1091401 ctgcccgggg gcaacggcgc caacggcgga agcggcggca tcggcggcgc cggcggcgcc 1091461 ggcggcaacg gcgggctgct attcggcaac ggtggtgccg gcggccaggg tggctccggc 1091521 ggacttgggg gctccggcgg gacgggcggc gcgggcatgg ctgccggtcc cgccggcggc 1091581 accggcggca tcgggggcat cggcggcatc ggcggcgcgg gcggggtcgg cggccacggc 1091641 tcggcgttgt tcggccacgg gggaatcaac ggcgatggcg gtaccggcgg catgggtggc 1091701 cagggcggtg ctggcggcaa cggctgggcc gctgagggca tcacggtcgg cattggtgag 1091761 caaggcggcc agggcggcga cgggggagcc ggcggcgccg gcgggatcgg tggttcggcg 1091821 ggtgggatcg gcggcagcca gggtgcgggt gggcacggcg gcgacggcgg ccagggcggc 1091881 gccggcggta gtggcggcgt tggcggcggc ggcgcaggcg ccggcggcga cggcggcgcg 1091941 ggcggcatcg gcggcactgg cggtaacggc agcatcggcg gggccgccgg caatggcggt 1092001 aacggcggcc gcggcggcgc cggtggcatg gccaccgcgg gaagtgatgg cggcaatggc 1092061 ggcggcggcg gcaacggcgg cgtcggtgtt ggcagcgccg gaggggccgg cggcaccggc 1092121 ggtgacggcg gggcggccgg ggcgggcggc gcgccgggcc acggctactt ccaacagccc 1092181 gcgccccaag ggctgcccat cggaaccggc gggaccggcg gcgaaggcgg tgccggcggc 1092241 gccggtggag acggcgggca gggcgacatc ggcttcgatg gcggccgggg tggcgacggc 1092301 ggcccgggcg gtggcggcgg cgccggcggt gacggcagcg gcaccttcaa tgcccaagcc 1092361 aacaacggcg gcgacggtgg tgccggcggt gttgggggag ccggcggcac cggcggcacg 1092421 ggtggggtcg gggccgacgg gggtcgcggg ggggactcgg gccgcggcgg cgacggcggc 1092481 aacgccggcc acggcggcgc cgcccaattc tccggtcgcg gcgcctacgg cggtgaaggt 1092541 ggcagcggcg gcgccggcgg caacgccggt ggcgccggca ccggtggcac cgcgggctcc 1092601 ggcggtgccg gaggtttcgg cggcaacggt gccgatggcg gcaatggcgg caacggtggc 1092661 aacggcggct tcggcggaat taacggcacg ttcggcacca acggtgccgg cggcaccggc 1092721 gggctcggca ccctgctcgg cggccacaac ggcaacatcg gcctcaacgg ggccaccggc 1092781 ggcatcggca gcaccacgtt gaccaacgcg accgtaccgc tgcagctggt gaataccacc 1092841 gagccggtgg tattcatctc cttaaacggc ggccaaatgg tgcccgtgct gctcgacacc 1092901 ggatccaccg gtctggtcat ggacagccaa ttcctgacgc agaacttcgg ccccgtcatc 1092961 gggacgggca ccgccggtta cgccggcggg ctgacctaca actacaacac ctactcaacg 1093021 acggtggatt tcggcaatgg ccttctcacc ctgccgacca gcgttaacgt cgtcacctcg 1093081 tcatcaccgg gaaccctggg caacttcttg tcgagatccg gtgcggtggg cgtcttggga 1093141 atcgggccca acaacgggtt cccgggcacc agctccatcg ttaccgcgat gcccggcctg 1093201 ctcaacaacg gtgtgctcat cgacgaatcg gcgggcatcc tgcagttcgg tcccaacaca 1093261 ttaaccggcg gtatcacgat ttctggagca ccgatttcca ccgtggctgt tcagatcgac 1093321 aacgggccgc tgcaacaagc tccggtgatg ttcgactccg gcggcatcaa cggaaccatc 1093381 ccgtcagccc tcgccagcct gccgtccggg ggattcgtgc cggcgggaac gaccatttcg 1093441 gtctacacca gcgacggcca gacgctgttg tactcctaca ccaccaccgc gacaaacacc 1093501 ccatttgtca cctccggcgg cgtgatgaac accgggcacg tccccttcgc gcagcaaccg 1093561 atatacgtct cctacagccc caccgccatc gggacgacca cctttaactg acggcccctc 1093621 cctggctcgt gatagggaag gggcgtctgc agcgggcgtt ctcgattgtc gccgcgctca 1093681 tctgcgcgcg gaagctcata ccaaagagga aggcccacca tggctgtgcc cacgcgcaga 1093741 aagtcgcgcg cgaacacccg aagccggcgc tcgcagtgga gggcccggcc ggacgggtgc 1093801 gggccgaaca caccgggcgg gctggtgtca gctgattacc gacaccgtgt cgccggcgaa 1093861 gttggtgaca tagacctcgc cggtgacggg gttgaccgcc accccggtcg gagcggtgcc 1093921 gacggtgatg ggggagccgg tgacggtgtt ggtggtcggg tcgatcaccg acaccgtgtt 1093981 gctgtcgaag ttggtcacga agaccaggcc ggtgacgggg ctgaccgcca ccccgcttgg 1094041 accgttgccg atggtgatgg gggagccggt gacggtgttg gtggcggggt tgatcaccga 1094101 caccgtgccg ctgccgaaat tggtgacgta gacgttgccg cccgggttga ccgccacccc 1094161 gtgcggatcg ttgaagctgg cgtgggtgat ggtggtgacg gcgccggcgg ccccaccggc 1094221 accgccgacc ccacccgcac cgccgatacc gccgaccggg ccgcggccgg caccgccggc 1094281 cgtgccggcg cgggcgaggc tgaccgcgcc gccggtgccg ccggccccac cgttgccgat 1094341 caacccggcc gccccgccgg cgccgccggc ctgtccgggt gcccccgacc cgccattgcc 1094401 gccgttgccc cacaagatgc cgccggcccc gccggcctgc ccggtgccgg gcgccccgtg 1094461 ggcgccgtca ccgatcagct tgcgccccac cagcgcctca gtcggcgtgt tgatcacccc 1094521 caacagggcc tgctcgatct gctgcagcgg tgttgcgctg gccgcttcgg cgaccgcgta 1094581 ggtgctgcca gcttggctta aggccagcac gaaccgttcc tgataggccg cgacctgcgc 1094641 gctgatcgct tgatagtgct ggccgtggct gccgaacagc gccgcgatcg ccgttgacac 1094701 ctcgtcttgg gcggcgacca acacctgggt ggtcgccgcc gccgcggtgt tggcggtgtt 1094761 gatcgccgag ccgatccgcg ccgcatcggc cgcggctgtg gacactaact gtggggccac 1094821 gttgacaaac gacatcgaaa tcctcctgac cgccacgatg ttgagatgcg ggcggcccac 1094881 cgcctgttac ccctgcggtg ggtaaccgtt tattcggacg atccctgccg ttccacgcct 1094941 gggcgcaggc gcaaaccgca ccaacattgg tggaacgtgg tgcacactgc acctggggtt 1095001 ctgccctcat cgtgtgtcag caggcgaaac ccgcgcggac gagaactcct gcgttaagca 1095061 gcacaaatcg ctgctcacgc tcaccggtca gcgcactgaa ccggccccat gtcgacgacc 1095121 ggtgaggcga ccgctcaact cgtcggcgtc aactcggcca ttgccaccct ggtcgccgat 1095181 tcctgtccca cagccccacc accatcgggg cgacaaccgt gaactgacgg tcacgcccgg 1095241 gcccaacccc ggcccggaat tgggccgggc cgtcttcaac cggtatcctc cacgtcattg 1095301 tcgacgcgat tgtcgccgcg cccacctgcg tgcggaagcc cataccaaaa gaggaaggcc 1095361 caccatggct gtgcccaagc gcagaaagtc gcgctcgaat acccgaagcc ggcgctcgca 1095421 gtggaaggcc gccaagaccg agctggtcgg tgtgaccgtc gccggtcacg cccacaaggt 1095481 gcctcggcgc ttgctcaagg ccgcccggct cggcctcatc gatttcgata agcgctgacg 1095541 cgccggcggc cgacgatcat atggccgccg aacacaccga gcgcgccggc tctccggtga 1095601 tcaccgacac cgtgtcgtcg agagagttag tgacgtagac cacgccggtg acggggttga 1095661 ccgccacccc tgtcgggtcg agtccgacgg ggatggggga gccggtgacg gtgttggtgg 1095721 ccgggtcgat caccgacacc gtgttgctga actggttggt gacgtagatg ttgccgcctg 1095781 ggttgaccgc caccccatac gcaccggtac cgacggggat ggagccggtg acggtgttgg 1095841 tgttcgggtc gatcaccgac accgtgttgc tgtcgaagtt ggtcacgaag accaggccgg 1095901 tgacggggct gaccgccacc ccgcttggac cgttgccgtc ggtgatggag ccggtgacgg 1095961 tgttggtgac cgggtcgatc accgacaccg tgttgctgcc ctggttggtg acgtagatgt 1096021 tgccgcccgg gttgaccgcc accccgtgcg gatcgttgaa gctggcgtgg gtgatggtgg 1096081 tgacggcgcc ggcggcccca ccggcaccgc cgaccccacc cgtaccgccg ataccgccgg 1096141 ccgggccgcc gccggcgccg ccggcggtgc cggcgcgggc gaggctgacc gcgccgccgg 1096201 tgccgccggt cccgccgtcc ccgccgtgtc cacccacacc gattaacccg ccgtgaccac 1096261 caaccccgcc ggtgccaccg tcaccgccgg ccacaccgaa ggttgtgccg gctccgccgg 1096321 ccccgccgac accaccggcc ccgccgttgc cgaacagcca tccaccggcg ccgccggctc 1096381 cgccgttcgc gccggcctca aagggtaggc cctggccgcc agctccgccg gccccaccgt 1096441 tgccgatcaa cccggccgca ccgccggccc cgccggcctg cccgggtgcc cccgacccgc 1096501 cgttgccgcc gttgccccac agccacccgc cgttaccgcc ggcttgcccg gtcccgtcga 1096561 tcccgttcgc gccgtcgccg atcaatgggc gcccggtcag cgactgaacg ggtgcgttga 1096621 tcgcatcgag cacgttctgc agcggtgttg cgctggccgc ttcggcgacc gcgtaggtgc 1096681 tgctagcttg gcttaaggcc agcacgaacc gttcctggta ggccgcgacc tgcgcgctga 1096741 tcgcttgata gtgctggccg tggctgccga acagcgccgc gatcgccgtt gacacctcgt 1096801 cttgggcggc ggccaacacc tgggtggtcg ccgccgccgc ggtgttggcg gtgttgatcg 1096861 ccgagccgat ccgcgccgca tcggccgcgg ctgtggacac taactgtggg gccacgttga 1096921 caaacgacat cgaaatcctc ctgaccgcga cgatgttgag atgcgggcgg cccaccgcct 1096981 gttacccctg cggtgggtaa ccgtttattc ggacgatccc tgccgttcca cgcctgggcg 1097041 caggcacaaa ccgcaccaac attggtggaa cgtggtgcac actgcacctg gggttctgcc 1097101 ctcatcgtgt ggcagcaggc gaaacccgcg cggacgagaa ctcttccgcc aagcagcaca 1097161 aatcgcccta ctcttgacca ccaaacaaaa cccgtccatg gggccaatgt ggctgatgtg 1097221 gctaaacctc gtcgaacaaa cccgcatacc acggcgcgcc tctcaggcca gtctcaggcg 1097281 ctgcgacgac actggtgtcc gtgcgaattc ttgtcgttga cgacgatcgt gcggtgcgcg 1097341 agtcgctgcg ccggtcgctt tccttcaatg gctattcggt cgaactggcc cacgacgggg 1097401 ttgaggcgct cgacatgatt gccagcgatc gccccgacgc gttggtcctg gatgtcatga 1097461 tgccgcggct ggacggcctc gaggtgtgcc gtcagctccg cagcaccggc gacgacctgc 1097521 cgattctggt gctgaccgcg cgcgactcgg tgtccgagcg ggtggccggg ctggacgccg 1097581 gtgccgacga ctacctacca aagccgttcg ccctcgaaga gctgctggca cggatgcggg 1097641 cgctgctgcg ccgcaccaag cccgaggatg ccgccgagtc gatggccatg aggttctccg 1097701 acctgacgct ggacccggta acccgcgaag tcaaccgtgg acagcgccgg atcagcctga 1097761 cccgcaccga atttgcattg ctggagatgc tgatcgccaa tccgcggcga gtgctgacgc 1097821 gcagccgtat cctggaagag gtatggggat tcgactttcc cacctcgggc aacgcgctgg 1097881 aagtctacgt cgggtatcta cgccgcaaga ccgaggccga cggcgagccg cggctgatcc 1097941 acactgtgcg cggagtgggt tacgtgctac gtgaaacacc accctgatgt ggtggttccg 1098001 ccgccgagac cgggcgccgc tgcgcgccac cagctcatta tccctgcggt ggcgggtcat 1098061 gctgctggcg atgtccatgg tcgcgatggt ggttgtgctg atgtcgttcg ccgtctatgc 1098121 ggtgatctcg gccgcgctct acagcgacat cgacaaccaa ctgcagagcc gggcgcaact 1098181 gctcatcgcc agtggctcgc tggcagctga tccgggtaag gcaatcgagg gtaccgccta 1098241 ttcggatgtc aacgcgatgc tggtcaaccc cggccagtcc atctacaccg ctcaacagcc 1098301 gggccagacg ctgccggtcg gtgctgccga gaaggcggtg atccgtggcg agttgttcat 1098361 gtcgcggcgc accaccgccg accaacgggt gcttgccatc cgtctgacca acggtagttc 1098421 gctgctgatc tccaaaagtc tcaagcccac cgaagcagtc atgaacaagc tgcgttgggt 1098481 gctattgatc gtgggtggga tcggggtggc ggtcgccgcg gtggccgggg ggatggtcac 1098541 ccgggccggg ctgaggccgg tgggccgcct caccgaagcg gccgagcggg tggcgcgaac 1098601 cgacgacctg cggcccatcc ccgtcttcgg cagcgacgaa ttggccaggc tgacagaggc 1098661 attcaattta atgctgcggg cgctggccga gtcacgggaa cggcaggcaa ggctggttac 1098721 cgacgccgga catgaattgc gtaccccgct aacgtcgctg cgcaccaatg tcgaactctt 1098781 gatggcctcg atggccccgg gggctccgcg gctacccaag caggagatgg tcgacctgcg 1098841 tgccgatgtg ctggctcaaa tcgaggaatt gtccacactg gtaggcgatt tggtggacct 1098901 gtcccgaggc gacgccggag aagtggtgca cgagccggtc gacatggctg acgtcgtcga 1098961 ccgcagcctg gagcgggtca ggcggcggcg caacgatatc catttcgacg tcgaggtgat 1099021 tgggtggcag gtttatggcg ataccgctgg attgtcgcgg atggcgctta acctgatgga 1099081 caacgccgcg aagtggagcc cgccgggcgg ccacgtgggt gtcaggctga gccagctcga 1099141 cgcgtcgcac gctgagctgg tggtttccga ccgcggcccg ggcattcccg tgcaggagcg 1099201 ccgtctggtg tttgaacggt tttaccggtc ggcatcggca cgggcgttgc cgggttcggg 1099261 cctcgggttg gcgatcgtca aacaggtggt gctcaaccac ggcggattgc tgcgcatcga 1099321 agacaccgac ccaggcggcc agccccctgg aacgtcgatt tacgtgctgc tccccggccg 1099381 tcggatgccg attccgcagc ttcccggtgc gacggctggc gctcggagca cggacatcga 1099441 gaactctcgg ggttcggcga acgttatctc agtggaatct cagtccacgc gcgcaaccta 1099501 gttgtgcagt tactgttgaa agccacaccc atgccagtcc acgcatggcc aagttggccc 1099561 gagtagtggg cctagtacag gaagagcaac ctagcgacat gacgaatcac ccacggtatt 1099621 cgccaccgcc gcagcagccg ggaaccccag gttatgctca ggggcagcag caaacgtaca 1099681 gccagcagtt cgactggcgt tacccaccgt ccccgccccc gcagccaacc cagtaccgtc 1099741 aaccctacga ggcgttgggt ggtacccggc cgggtctgat acctggcgtg attccgacca 1099801 tgacgccccc tcctgggatg gttcgccaac gccctcgtgc aggcatgttg gccatcggcg 1099861 cggtgacgat agcggtggtg tccgccggca tcggcggcgc ggccgcatcc ctggtcgggt 1099921 tcaaccgggc acccgccggc cccagcggcg gcccagtggc tgccagcgcg gcgccaagca 1099981 tccccgcagc aaacatgccg ccggggtcgg tcgaacaggt ggcggccaag gtggtgccca 1100041 gtgtcgtcat gttggaaacc gatctgggcc gccagtcgga ggagggctcc ggcatcattc 1100101 tgtctgccga ggggctgatc ttgaccaaca accacgtgat cgcggcggcc gccaagcctc 1100161 ccctgggcag tccgccgccg aaaacgacgg taaccttctc tgacgggcgg accgcaccct 1100221 tcacggtggt gggggctgac cccaccagtg atatcgccgt cgtccgtgtt cagggcgtct 1100281 ccgggctcac cccgatctcc ctgggttcct cctcggacct gagggtcggt cagccggtgc 1100341 tggcgatcgg gtcgccgctc ggtttggagg gcaccgtgac cacggggatc gtcagcgctc 1100401 tcaaccgtcc agtgtcgacg accggcgagg ccggcaacca gaacaccgtg ctggacgcca 1100461 ttcagaccga cgccgcgatc aaccccggta actccggggg cgcgctggtg aacatgaacg 1100521 ctcaactcgt cggagtcaac tcggccattg ccacgctggg cgcggactca gccgatgcgc 1100581 agagcggctc gatcggtctc ggttttgcga ttccagtcga ccaggccaag cgcatcgccg 1100641 acgagttgat cagcaccggc aaggcgtcac atgcctccct gggtgtgcag gtgaccaatg 1100701 acaaagacac cccgggcgcc aagatcgtcg aagtagtggc cggtggtgct gccgcgaacg 1100761 ctggagtgcc gaagggcgtc gttgtcacca aggtcgacga ccgcccgatc aacagcgcgg 1100821 acgcgttggt tgccgccgtg cggtccaaag cgccgggcgc cacggtggcg ctaacctttc 1100881 aggatccctc gggcggtagc cgcacagtgc aagtcaccct cggcaaggcg gagcagtgat 1100941 gaaggtcgcc gcgcagtgtt caaagctcgg atatacggtg gcacccatgg aacagcgtgc 1101001 ggagttggtg gttggccggg cacttgtcgt cgtcgttgac gatcgcacgg cgcacggcga 1101061 tgaagaccac agcgggccgc ttgtcaccga gctgctcacc gaggccgggt ttgttgtcga 1101121 cggcgtggtg gcggtgtcgg ccgacgaggt cgagatccga aatgcgctga acacagcggt 1101181 gatcggcggg gtggacctgg tggtgtcggt cggcgggacc ggggtgacgc ctcgcgatgt 1101241 caccccggaa gccacccgcg acattctgga ccgcgagatc ctcggtatcg ccgaggccat 1101301 ccgcgcgtcc gggctgtccg cgggaatcgt cgacgccggg ttgtcgcgcg gcctggcggg 1101361 tgtctccggc agcacgctgg tggtcaacct cgcgggttcg cgttatgcgg tgcgcgatgg 1101421 aatggcgacg ctgaatccgc tagcggcaca gatcatcggg cagttgtcga gcttggagat 1101481 ctgaatccgg atcgagtgtc gggctattgc gattctgtgc tcgcgcgagg cccgtcggtt 1101541 ggcgatggtg tcccacggcc gccgtgcctc cccggcgagt ccccgttcgt ttgcgcgagc 1101601 agatcgcgga tttcggtgag cagcacgact tgggtgtcgc ccggctgctc gacctccccc 1101661 ttcttgcgta gtgtgttgta gggcagcacg actaggaagt acaccgcgaa cgcgatcagg 1101721 aaaaagttga tcgctgccga caacaagacg ttcaagtcaa tggtctgacc accgccgata 1101781 ccgatccgca agatgccgac gtcggactgt gcgttgacgc cgatccggtt gatcagcggc 1101841 gtaatgatgc tgtcggtgaa cttggtgacc aacgccgtga acgctgtgcc gattaccacc 1101901 gcgacagcca ggtcgacgat attaccccgc gcgagaaact ccttgaatcc tttgagcatg 1101961 cgatgtcctt tctgcagtcg gcggccggca gtccgcgagt ggaacaccta gaaaaactag 1102021 accaggtggt gtcaatggcc acgacgctgg gatcgccgtt gccatgggga gctgacgctg 1102081 ccgggatccg gtgctgttgt ttgttgacgg gatgcccttg acttcgctga ccgtggtgtg 1102141 cgcgtaaccg gccggtcggg aacgcggcga cggatggcgc ggtggccagg acagtgatcg 1102201 agatgacatc acgccaacaa cgccttcagc tgtgagcgat ccgggctaga ctaccgccga 1102261 aatatccaac aaaggaccta catgaaccgg caacctatcg ttcagctgag taacttgagc 1102321 tggacattcc gagaaggcga aacccgacga caagtcctag accacatcac cttcgatttc 1102381 gagcccggtg agtttgtcgc gctgctgggg caaagtggaa gtggtaaaag cactttgctg 1102441 aacctcatca gtggcataga aaagcccacc acaggtgacg tcacaattaa tgggttcgct 1102501 atcactcaga aaaccgagcg agaccggacg ttgttccggc gcgatcagat tggcatcgtc 1102561 tttcaatttt tcaacctgat tcccactctt accgtgttgg aaaatattac gctgcctcag 1102621 gaactggccg gagtttctca gaggaaagcg gccgtggtcg ctcgtgacct tctcgaaaaa 1102681 gtgggcatgg ccgaccgtga acgcaccttt cccgataaac tctccggcgg agaacaacaa 1102741 cgggtcgcta tttccagagc gttggcgcat aatcccatgc tggtgttagc cgatgagccg 1102801 accggcaacc tggactccga taccggggat aaagtcttgg atgttctgct tgatctcacc 1102861 cgccaagcag gtaaaacctt aatcatggct acgcatagcc cgtcgatgac gcagcatgcc 1102921 gaccgggtag tcaacttaca gggcggcagg ttgatacctg ccttgaaccg agaaaatcaa 1102981 accgaccagc cggccagcac gatcctattg cccacgtcat atgaatgacc aagctcccgt 1103041 tgcttatgca ccactatggc gcacggcgtg gcgtcggctg cgtcagcggc cgtttcaata 1103101 tattctgctg gtcctgggaa ttgcgctagg cgttgccatg atcgtggcta tcgatgtatc 1103161 cagtaattcg gcgcaacgtg ccttcgatct ctctgccgcg gccatcaccg gaaaatctac 1103221 tcaccggctg gtcagtggcc ccgccggggt ggaccaacag ctttatgtcg atctgcgccg 1103281 acacgggtac gatttttccg ctccggtaat cgaaggctat gtgttggccc gcggactggg 1103341 aaaccgagct atgcagttca tgggcaccga cccatttgcg gagtcagctt ttcgctcgcc 1103401 tttatggtcc aaccaaaata tcgccgagtt gggtggcttt ttgactcgac ccaacggtgt 1103461 cgtgttaagc cgacaagtgg cacagaagta tggcttggct gtgggcgatc gcattgctct 1103521 gcaagtgaaa ggtgcgccta ccacagtaac cctggtggga ttgctgacac ctgcagatga 1103581 agttagcaat caaaaattgt ccgaccttat cattgctgat atttccacgg cccaagagtt 1103641 gttccatatg cccggaagac tgagccacat cgatttgatc atcaaagatg aggccactgc 1103701 aacacgcatc caacaaagac tgccggccgg tgtgcgtatg gaaacgtcgg atacccaacg 1103761 ggacaccgtc aaacagatga cggacgcttt tacggtcaat ttaaccgctc tcagtttgat 1103821 tgccttgttg gtgggtatct ttttaatcta caataccgtg acatttaatg tcgtgcaacg 1103881 gcgaccgttt ttcgccatat tgcgctgttt gggtgtaacc cgagagcagt tattttggct 1103941 gataatgacg gaatccctcg ttgccgggct gattggtacg ggcttgggcc tcttgattgg 1104001 aatttggctc ggcgaaggtt tgatcggcct ggtgactcaa accatcaatg atttctattt 1104061 tgtcatcaat gttcgcaatg tgtccgtctc cgccgaaagc ttgttgaagg ggctgatcat 1104121 cggcatcttt gccgccatgt tagccacact gccaccggct atagaagcga tgcgcaccgt 1104181 ccctgccagc acattgcggc gctcctccct ggaaagcaag ataaccaagc tcatgccgtg 1104241 gttgtgggtg gcgtggtttg gtttgggtag ctttggtgta ttgatgctgt agttgccggg 1104301 caacaacctg gttgtggcct ttgtcggtct ctttagtgtg ctgattgccc tggcgcttat 1104361 tgccccgccg ctgacccggt ttgtaatgtt gcgcttagct cctggcttag gacggctgct 1104421 cggtccaata ggtcgaatgg cgccacgcaa tattgtgcgc tcgttgagtc gcacctctat 1104481 cgccatcgcc gccctgatga tggccgtgtc cttgatggta ggcgtctcca tatcggtggg 1104541 gtcgtttcga cagacgctgg ccaattggct agaggtgact ttgaagtcgg atgtctatgt 1104601 gtctccgccg accttaacat ccggtcgccc cagcggtaat ctgcctgtgg atgccgtccg 1104661 gaatataagc aaatggccag gagtgcgtga cgcagttatg gctcggtata gttccgtttt 1104721 tgccccggac tgggggcgtg aggtggaact aatggcggtg tcgggtgata tttccgacgg 1104781 caagcgacca tataggtgga tcgacggcaa taaagacacg ctctggccac gtttcttggc 1104841 ggggaaaggg gtgatgctat cggagccaat ggtatcgcga caacacttgc agatgccgcc 1104901 aaggccgatc acgctaatga cggattcggg gccacaaacg ttccccgttc tggcggtttt 1104961 ctctgactac acctcagatc aaggtgtgat tttgatggat cgcgccagtt atcgggccca 1105021 ttggcaggat gatgacgtga cgaccatgtt tctttttttg gcatcgggtg cgaatagcgg 1105081 tgccttgata gatcaactac aagccgcgtt cgcgggtcgg gaagacattg ttattcaatc 1105141 gactcatagt gtccgcgaag catcaatggt catatttgat cgtagtttta ccattaccat 1105201 cgcgttgcaa ctggtggcca cggtggtggc ttttattggc gtactgagcg cgctgatgag 1105261 tttggaattg gaccgggctc atgagttggg tgtttttcgc gccattggca tgactacccg 1105321 ccaattatgg aagctgatgt tcattgagac cggcctaatg ggcgggatgg ccggcttgat 1105381 ggccttgcca actggttgta ttctagcgtg gattcttgtc cgcattatca atgtccgctc 1105441 attcggctgg accttgcaga tgcactttga gtcggcgcat tttcttcgag ccctgttggt 1105501 agcggtggtg gccgccctgg cggcgggtat gtaccccgct tggcgtttgg ggcggatgac 1105561 gattcgcacg gcgattcgtg aggaatgacg gtacatgaga aaagcaggat tgaccggtgt 1105621 tgtactggtt ctgacgctga cgctggtggc tttctggtgg tggcaacgtc cgcgaacgaa 1105681 tgctgtggct gctgactctt tagttggcgt tttggtcgat gagaataacg ccggatattc 1105741 cttggccaca gtgccgggag ccattcggtt tccccgggat ttgggtcctc attacgatta 1105801 ccagacggaa tggtggtatt acaccggtaa tctggaaact gctgacggtc ggcttttcgg 1105861 ctaccagctt acttttttcc gcagggctct cgcaccaccc ggcgaggggg tcgccatagc 1105921 ggatgcttct tcatggcgca cgacccaggt ctatatggcc cacttcgcga taagtgatat 1105981 ttcgaacagg ggctttgatc cggctgagaa attcagtcgg caggcgttgg gtttggctgg 1106041 tgctagctcg gagccgtatg cggtgtggct agacgattgg tatgcgcgtg aatccaacaa 1106101 caattcggtg caattgtttg ctcgaactca gaacacggtg ttggatttga cattgacgca 1106161 aacgctgccg cctatcttgc aaggaaatgc tgggttaagt gtgaaaggcg cgcaaccggg 1106221 aaacgcgtcc aactactact cgttagttcg tcaagaatcg cggggcactg tcagtgttaa 1106281 tggcgacaca ttcatggtta gtggtttgag ctggaaagat catgagtaca tgaccagtgc 1106341 gctggcccct gaagatgtgg gttgggattg gttcgggctc caattttaca atggcaccgc 1106401 tttgatgctt tttcagattc gacaggcgga tgggagtgtg acccgatttt ccagcggtac 1106461 ctttgttgcc ggggatggtg gcgtgatccc tctcgagtcg tccgatttcc gcatcaagac 1106521 gactgatcgt tggaccagtg accagagtgg cgccacctat ccgattgcat gggaaatcga 1106581 aattgaacgg ataggtttga cgctgcgcgg ggccgcatta atggctaatc aagaactgcg 1106641 gttatcgagg acttactggg aaggggcggt tgcccttgag ggtcgttatc aaggaatgcc 1106701 gatcagtggt cggggatacg ttgaaatgac cggctatgta caacggctgt cttgaagtcg 1106761 ggtaattgcc ggtgattctt ggtttagagg ctctcgaatg gtcgtcgggc agttgtgata 1106821 tcgctgcaaa ccctagagta cttattcgtc gttgtgtcaa caggtagttg ctggggtgtg 1106881 tcgctagtcg cacgcagata ccgcgtggtc gatcaatgtc gcaagggctc ggcgaggttg 1106941 gcggtcaggc aaatagggga gctcctctcg cgcctgtgcg gcataggcgg ctaccacatt 1107001 cttggccttt cctatgcccg gtgagcaacg cagcagtgtg agggcttcgg cgacgtggtc 1107061 gtcgtggatc ggtccggcca gcaactcacg cagccggctt gtgtcgggtg tctgctcacg 1107121 cagcgcgtag agcatcggca gcgtgtggac agcttggcca aggtcggcgc ccgatagcgt 1107181 agcggagtca ccggagatgg cgatgatgtc gcgcgagatc tcaaacgcag caccgatcat 1107241 gcgccccaag cgcgctacgc ggcggatctg ctcttcggcg gcgccggaga gtgccgctcc 1107301 gagctgtccg gatgctgcga tgagagagcc ggtcttctcg tgcacgactc ggaggtaatg 1107361 ctcgatcgtg tcgatatgcg aggcggggcc ccgggtcgcg cgcatctgcc cggtgatcag 1107421 ctcggcgaac gcctcggcga cgaccgcgaa ggcctcgggg tccagccgcg aggctagctg 1107481 tgaggccgtc gcgaatcggt agtcaccggc gaggattgcg aagttgttgg tccagcgtgt 1107541 gttgtcgcta ggtgtcttgc ggctcatgtc ggactcatcc acgactctgt cgtgacaaag 1107601 cgtccccagg tgcatcaact cgatggctgc ccccgcgacc gtgacctccc atccgtcggg 1107661 gtcggagccc agttgcgccg caagcaccgt gaaaagcggt ctaaacgggg tgccgccggc 1107721 gtcgacaagg tgcgccaccg tgtcgcgcat aacctcgtcg gcctgggaga gttcgctatt 1107781 gatcagctct gtaatccggg caatcccgtc gtggacgttg gcggtgaatt gcgggtcacc 1107841 caggctgact gccgggatca tgctcgtggc cgtaggcatg cgcacaacat tgacacgtgt 1107901 acaagataag gtatggcgtg ttcagtgcag ggtcagcgtc accgtctgac ccagcgccgc 1107961 accggctacc gtattggcca gccgggctgg cagcgccacc aaaacgaccc ggtcactatc 1108021 agccgcctgg gccttctgct gggccgagac caggaccacg atggcgtcgg tggccaagag 1108081 ccgtagagct gccggcgaat cggttaccgg cgcggccagc acgtcgacca catccccgac 1108141 ccgaacaagg tcgaccaaag cgctgtcagc cagatgcagc ggcacgatgc gggcgtccgg 1108201 gccggcagtc gactcggcca accggctgcc cagtaaacgc acgtcggtga gcacctcgcc 1108261 acggcgtgtc gggctggcca gcgtcgaacc caccactgcg tccaggtcag cttgcgaccc 1108321 gtcgggaagc gtggtggccg aacgtttttc cagcctgaca tcaccgggag tcaatgcggt 1108381 accggggcgc agatcgtgcg cggccaccac cacctcggcg cgatcatcct ctggattgga 1108441 ccgcagcgcc gcaacgccgg ccagcatgac cagcccggcc gcggcgaagc gccgggcccg 1108501 cacggtccgg gtccagtccg ggcgcaaaaa cgccgatatc cggctgacca ggctcggatt 1108561 cagggaggat tccgccacac cgcaaacggt aggcgcagcg ccgtgctagg cagcgccggt 1108621 cagaaatccc cttgtggata acctctcaac tcagacggcc gcggcggcgg ttgtggagct 1108681 ggttgacttc tcggttgacc ccgaagcctt gctttcactc gagccagaac tccccgaact 1108741 ccccgaactc ttcgtcgact cgctggtcga ggatccgttg gtctggctct tggacttctt 1108801 gcccgactcg cggctgtcgg tgcggtagaa gccggtgcct ttgaacacca cgccgaccgc 1108861 attgaacagc ttgcgcagcc ggccagaaca ccgctcgcac gtggtcagcg catcgtcggt 1108921 gaaggcctgc acaacatcga agcggttggc gcactgggtg cactcgtagc tgtaggttgg 1108981 cacaagaacc tccggaaatg tcactcggcg ttagcactct accgtctcaa gtgctagaac 1109041 cgctaggtga gttccgtcat tccccgcacg gcagcgcgat cagcccgcgc tccggtgtga 1109101 gcgcatgagt catgggtacg tcgtgcggct ccgacggcaa cacgtcgacc agttcgacag 1109161 tgcgcaccac cgcgactaga cgagcgtgcg ggtcgcggca ccgcagcgag cgatcgtaga 1109221 agccgcgacc tcggcccagt cgcacgccct ggcggtcgac agccagcgcc ggcaccagca 1109281 ccaagctggc ctgcgccagc gcggcttccg gcagccaagg ttcgggtggt tcgagcagtc 1109341 cccagcgtgc gcgcgcgagt ccgccggcac ggtactcgcc ccaccgcaac ggcaacggga 1109401 ggtcaccgcc ggcggtgcgc gccaccggca acagcactcg ccccgcgcgg cgcagcaaca 1109461 catccaacat ctcgattgac cccggctcgc cgcctaccgg cacatacgcg cagacggtgc 1109521 tgtcgctggt gaccatgcgc tccaggtgtc cacgcaacat ccgggcctcg gcggcgcgca 1109581 cgtcgtcggc aacgcggcgt cgggccgcca ggagctggtc gcgcaacgcc gacttgctcg 1109641 cgatcgccat gtcctcaacg atgacacagc cccggccgtc ccgcgcgagc gccgggacag 1109701 cgccaacgaa gaggcgggca atcagcacgc tgcgggttat cgtgtgaacg atgtcacgcc 1109761 cagaagtact aacgccgttc acggcaatcg tcccggcagc cggcctgggt acgcgctttc 1109821 tgccggccac caagacggtg cccaaggagc tgctgcccgt cgtcgacact cccggtatcg 1109881 agctggtggc cgccgaggcg gccgcggccg gtgccgaacg gctggtgatc gtcacctccg 1109941 agggtaagga cggggtggtc gcgcatttcg tggaagacct ggtgctggag ggcacgctcg 1110001 aggcccgagg caagatcgcc atgctggcca aggtgcgtcg cgccccggca ctgatcaagg 1110061 tcgaatccgt ggtgcaggcc gagccgctgg gactgggaca cgccatcggc tgtgtggagc 1110121 cgacgctgtc gcccgacgaa gacgctgtcg cggtgctgct gcctgacgac ctggtgctgc 1110181 cgaccggcgt cctggagacg atgtcgaagg tgcgagccag caggggcggc accgtgctgt 1110241 gtgctatcga ggtggcgcgc gaggagatca gtgcctacgg ggttttcgat gtcgagccgg 1110301 tccccgatgg tgactacacc gacgatccca acgtgctgaa ggtcaggggc atggtcgaaa 1110361 agcccaaggc cgaaacggcg ccgtcgaggt atgcggcggc cggccgctac gttctagacc 1110421 gtgccatctt cgatgcgtta cgccgcatcg accggggtgc aggcggtgaa gtgcagctca 1110481 ccgatgcgat cgcgctgctg attgccgagg gccatcccgt ccatgtcgtc gtccaccaag 1110541 ggtcccgaca cgacctggga aatccgggcg ggtacctcaa ggctgcggtt gactttgcat 1110601 tggatcgtga cgactacggc ccggacttgc ggcgatggtt ggtggcgcga ctgggtctga 1110661 cagagcagta gcctggcgac gatacggcac ggacggttcc ggggtggggg atgcccggcc 1110721 ccatggctcg acggaaaggc gggcgctgtg cgttctgtgg aggagcagca ggctcggata 1110781 tcggccgctg cggtagcccc gaggccgata cgcgttgcga tcgccgaggc gcagggattg 1110841 atgtgcgccg aagaagtggt caccgaacgt ccaatgcccg gttttgatca ggccgccatc 1110901 gacggctacg cggtgcgcag tgtcgatgtg gccggtgtcg gtgataccgg tggtgtccaa 1110961 gtctttgccg accacggcga tcttgacggt cgcgacgtgc tgaccctacc ggtgatggga 1111021 accatcgaag ccggagcgcg caccctgagc aggttgcagc ctcgccaagc ggtccgggtg 1111081 cagaccggcg cgccgcttcc caccctggcc gatgcggtcc tgccgttgcg gtggaccgat 1111141 ggcggaatgt ctcgggtgcg ggtgctgcgc ggggcgccgt cgggcgccta cgtgcggcgt 1111201 gcgggcgacg acgtgcagcc cggtgatgtg gcggtgcgcg cggggacgat catcggcgca 1111261 gcccaggtgg ggttgctggc ggcggtcggc cgtgaacggg tgctggtgca ccctcgtccg 1111321 cggctgtcgg tgatggccgt cgggggcgag ttggtcgaca tctcgcggac cccgggcaac 1111381 gggcaggttt atgacgtcaa ctcctatgcc ttggctgcgg cgggccggga tgccggtgcg 1111441 gaggtgaacc gggttggcat cgtcagcaac gaccctacgg aacttggcga aatcgtcgag 1111501 ggccagctca atcgggctga ggtcgtggtg atcgccggcg gggtgggcgg tgcggcggca 1111561 gaagcggtca ggtcggtgct ttccgagctc ggtgagatgg aggtcgtgcg ggtcgccatg 1111621 catccgggat ccgtgcaggg cttcggacag ctcggccgtg atggtgtacc gacctttctg 1111681 ctgccggcca acccggtcag cgccctggtg gtcttcgagg tgatggttcg gccgctgatc 1111741 cggctgtcgc tgggtaaacg gcatccgatg cgacggatcg tgtcggcgcg cacgctgtcg 1111801 ccgatcacgt cggtggccgg gcgcaagggc tacctgcgtg gccagttgat gcgtgatcag 1111861 gacagcggcg agtacctggt gcaggcgctg ggcggcgctc cgggggcgtc atcgcacctg 1111921 ctcgcgacgc ttgccgaagc gaactgtctg gttgtggttc ccaccggggc cgagcagatt 1111981 cgcacgggtg agatcgtgga tgtcgccttc ctggctcagc acggctgagc cgaaccacgg 1112041 cgactctggt gaacttatgg cgctcgaatc cccggcatcc gggatggccg atggccgtcg 1112101 ggccgctgcg ggtctcggca ggcgtgattc ggctgcggcc ggtgcggatg cgtgacggcg 1112161 tgcattggag ccggatccgg ttggccgacc gtgcacatct tgagccgtgg gagcccagcg 1112221 cggacggcga gtggaccgtc cggcacacgg ttgctgcctg gccggcggtg tgttcgggtc 1112281 tgcgttcgga ggctcgcaac ggccgcatgc tgccgtacgt gatcgagctg gatgggcagt 1112341 tctgcggcca gttgaccatc ggcaatgtca cccacggggc cttgcggtcg gcctggatcg 1112401 gctattgggt accaagcgcg gccactggcg gaggggtggc caccggagcg ttggcgttgg 1112461 gtctcgacca ctgcttcggt ccggtcatgc tgcatcgagt cgaggccacc gtgcgcccgg 1112521 agaatgcggc cagtcgcgcc gtgctggcaa aggttggctt ccgcgaggag gggctgttgc 1112581 gccgttacct tgaggttgac cgggcatggc gagaccatct gttgatggcg atcaccgtcg 1112641 aagaggttta cgggtcggtg gcctcgacgc tggtccgtgc cgggcatgcc agctggccct 1112701 aacgcggaat cgcaaccaaa ctgtgactgg cgcgacacgt gtggcgtgtg gtgcttgtga 1112761 gagatgaatt acaggtgtgt aattgccctg ggcgctttga cccggccgcg ctggccaacg 1112821 atggggcctc gcggggatcg gaaccgaaga gagcaggtca tcatgccaag catcccgcag 1112881 tcgttgttgt ggatatcgct cgtggtgctc tggctgttcg tgctggttcc catgctgatc 1112941 agcaaacgtg atgccgttcg gcgcaccagc gatgtggctt tggcgactcg ggtactcaac 1113001 ggtggcgctg gtgcgcgcct gctcaagcga ggtggtcccg ccgcgggaca tcgctggggg 1113061 tacctcccgc ccgaagggca gggggacgac ccggactgga agccggagga agactggcgc 1113121 gacgacccgg tcgagggcgg gttcgccgac gtcgagcatg acatcgacga ggaccaggag 1113181 gccgacgatg cgcgccgtcg gggtgcggtt gtcatgaagg ttgccgctcc gcagaccgca 1113241 ggtgccgacg agccggacta cttagacgtc gatgtggtcg aagaagactc ggaggcgctt 1113301 ccggtggggg ctggcgctgc ggtcggcgag tccgccgacg aggccgatgc cgaagctgct 1113361 gacggagttg cgggccacgc cgacccggag gccgacccgg tcgaatacga atacgaatac 1113421 gaatacgtcg aggacacctg cggtttggag ctcgaggagg acgaccagga agcgccaccg 1113481 accgtcgcat ccggcacgtc acggcggcgc cgattcgaca ccaagaccgc cgccgcggtc 1113541 agcgcccgca agtacacctt ccgcaaacgt gcgttgatcg tgatggcggt gatcctggtt 1113601 ggctctgccg ccgcggcctt cgagctgacc ccggtcgcgt ggtggatctg tggtagcgcc 1113661 accggtgtga cggtgctcta cctggcatat ttgcgtcggc aaacccgcat cgaggagaag 1113721 gtgcgtcggc ggcggatgca gcggatcgcg cgggcgcggc tcggtgtaca gaacacccgt 1113781 gaccgcgagt acgatgtggt gccgtcgcgg ctgcgccgtc cgggcgcggt ggtcctggag 1113841 atcgacgacg aggacccgat cttcacgcac ctggagagcg cggccccgat acggaactac 1113901 ggctggccca gggacctgcc ccgggcggtg ggtcagtagg gcgcgcagtt cggccatcgg 1113961 cgccgctgct ggtagcctgc taccgatcag gggctatggc gcagttggta gcgcgactcg 1114021 ttcgcatcga gtaggtcagg ggttcgaatc cccttagctc caccatctaa tcagtagcca 1114081 tcggcagcct cgttggctgt gccgccgcgg acgtggttga gacggcgagc acagccctcg 1114141 gggcaatcct ggcaggtcgc aatgcggtgg tgccgccacg gtgtccacgt cgaggcgccg 1114201 gccttgtggt accggtaaag tgctgtggcg accgcgatct ggcgcgaagc ctgatgaaga 1114261 tcgaatattc ggctgaatat tcgctaagac atgtgtggcg gcgtccgatc ctgtcacaac 1114321 ctgcccctag ggtcggtgca tgagcacgaa atactacctg cagaaggtcc ctgtcgaagc 1114381 cgtccagccg ggcttttcgc tggccattcc acacgatggc gactatcgcc ttttccaggt 1114441 cgactgcacg caaatgtgcc agcgaagtgg ccagccggtg atgatcagac tcatgtcgga 1114501 gtccgtcgat ggtggccagc cgtgggtctt ggaatatgaa gcgggcacgg cggtaatccg 1114561 gcttctcggt gtttgccagg ccgcttcgta gggtggcgtg tgctcgctaa ccgggcttgg 1114621 cggcggctac aaacggcaac gcgcgttgtg tctactgctc gacgtccact agcccggccg 1114681 accgagacag gttgacgaag gcattccggt caaacatcgt gagtccgatg ttgccggcgg 1114741 cggcgttcgg cgcgtagcgc atcggcgggc attggccggc atagctggtg tggatcgtga 1114801 tccgcccggc tggccgcagc actcgcacct tctcgcgggc gatccggaac ggttccggca 1114861 tcagctacag cgcgccgaaa caacaaacag catcgaatgt ttcgtcgccg aatggcacca 1114921 tgcgggcgtg gccgcggata tgacacgtcc gtggcccacg gttgtccagg gcggtgctgg 1114981 tcagcgtcgg cgcagagatg tcgaacccga ccgcaagacc cccgtccggt ggatgtccgg 1115041 acagcggctc agtgaaatta cctggcccac aaccgatatc gagcactctg tgggcgcggc 1115101 cgaggtgcag agacaccgcg gcgcggtgcc gctcggttcg ggtggtgatg cggctggcaa 1115161 ggtggaagga ggccggacgc cacaaccgtt cgtacaaccg taagctgggc ttggcgccgg 1115221 gccggattgg acgggatagc cgaattgacc ggcgcacgag tcgaagatct tgcggggatg 1115281 gacgtctttc agggatgtcc ggccgagggt ctggtgtcat tggcggcgag cgttcagccg 1115341 ttgcgggccg ctgccggcca ggtgctgctg cggcagggcg agccggcggt ttcgtttctg 1115401 cttatctcgt cgggtagcgc agaagtcagc catgttggcg acgatggtgt tgcgatcatc 1115461 gctcgggcgc tgccgggcat gatcgtcggc gaaatcgcgc tgctgcgcga tagcccgcgc 1115521 agcgcgacgg tcaccaccat cgagccgctg accggctgga cgggtggccg cggcgctttc 1115581 gccacaatgg tgcacatccc cggggtcggt gagcgattgc tgcgcaccgc caggcagcgt 1115641 ctcgccgcct tcgtctcccc gattccggta cggcttgccg acgggactca actgatgcta 1115701 cgccccgtgc tgcccggtga ccgcgagcgg accgtgcacg gacacatcca gttctccggc 1115761 gagacgctgt atcgacggtt catgtcggct cgtgttccca gtccggcgtt gatgcactac 1115821 ctgtcggaag tcgactacgt cgaccacttc gtctgggtgg tgaccgacgg aagcgacccc 1115881 gtagccgacg cgcgttttgt gcgggatgaa accgatccga cggtcgccga gatcgcgttc 1115941 acggttgccg acgcgtatca gggcaggggg attggaagct ttctcatcgg tgcgttgtcc 1116001 gtggccgccc gggtcgacgg cgtcgaaagg tttgccgcgc gcatgctttc cgacaatgtg 1116061 ccgatgcgaa cgatcatgga ccgctacggg gcggtgtggc agcgcgagga cgtcggagtc 1116121 atcaccacca tgatcgatgt gccgggtccg ggtgagctga gcttggggcg cgagatggtc 1116181 gaccagatca accgggtagc ccggcaagtg atcgaggccg tcggctgatc accgaccccg 1116241 ggtcggtgcg tccgccgctg gcaccgcagt tcgccgctga tctgctagtc aaaacggtgt 1116301 cgacgttgcg cagctcaggg gctgcgttgg gtagattgac cacgatgcgc aaggcggtac 1116361 tggcagtcgg atcggtgtgc tggcttgtcg gctgctcatc aggggccagc tccaccaccg 1116421 cctcgaccgg cgacatcgcc aaggtggccg aagtgaagtc gggctttgga cctgaataca 1116481 ccgtcaccga tgtcactccc agggccatcg atcccgggtt cttttccgcc cgcaaactgc 1116541 ccgacgggct gagtttcgat ccggcgaact gtgcgcaagt ggcggccggg ccccagctgc 1116601 cgaccgggtt gcagggcaac atggccgccg tctccgccga gggcaacggc aaccggttcg 1116661 tcgtcatcgc ggtggagacg tcccagccgc tgccggcccc cagccccggg aaagactgca 1116721 gcaaggtgac tttttccggg acgcagctgc ggggcggcat cgaggtggtc gatgtaccgc 1116781 acatcgacgg gacacagacg ctgggcgtgc atcgcgtgtt gcaggcggtc gtcggcgggt 1116841 cagcgcgcac cggcgagctc tatgactatt ccgctcggtt cggggactac caggtgattg 1116901 tcatcgccaa tccactggta atccctggac ggccggttgc gcgggtcgat acgcaacgcg 1116961 cccgcgatct gctcgtacag gcggtggccg cggtccgggg ttgaccgagt tagcggacgt 1117021 cgcgcggccg gaactggatg ctcacgcgcg gacccgtcgg cgccgatgtc ttgggcaccg 1117081 catgctcgaa ggtgcgttga cacgatccgc ccatcaccaa tagatcgcca tgcgccaacg 1117141 gcagtcgcaa cgatggaccg cggccacgcg gccgcagcgc gaagacgcgg gtggcgccga 1117201 ggctgacgat cgccaccata gtgtcctcag tgctgccgcg accaatggtg tcgccatgcc 1117261 aggcgacgct gtcagagccg tcgcggtagt agcacagccc ggcggtggtg aagggctcac 1117321 ccagttcgcc gccgtagatg tcgttgagcc gccggcgcat ccgcgccagc tgcggatgcg 1117381 gcggatcttc gatggtcagg tcgtgaaaac tcaccagccg cggcacatcg accacccggt 1117441 cgtacatctg acggcgctcg gctcgccacg gcaccgtcga caacaacgcg tccagcagtt 1117501 cttcgccgcc ggtcagccag cccgaacgga tgtcgataaa ggctccgtcg ccgagctgtc 1117561 ttcgctcgtt gtgctcgaag agcgcgcctt gaaccgcgat cgccacgccg ccaagcttat 1117621 cgcacattcg ttcgatggcg ccgccccggc tacggtttga cctgtgggtg tcgaattggg 1117681 gtcaaattcc gaggtcggcg cgctaagagt ggtcatcctg caccgcccgg gggccgaact 1117741 gcgccggctc acaccgcgca acaccgacca gctgctgttc gacggcctgc cctgggtatc 1117801 ccgcgcgcag gacgagcacg acgaattcgc cgagctgctg gcttcccgcg gtgcggaagt 1117861 gctgttgctg tcggacctgt tgactgaggc actacatcac agcggggccg cccgcatgca 1117921 ggggatcgcc gctgccgtcg acgcaccgcg gctgggactg ccgctggcgc aagagctttc 1117981 ggcctacctg cgtagtctcg acccaggcag gttggcgcat gtgctgacgg ccggcatgac 1118041 cttcaacgag ctcccgtcgg acacgcggac cgacgtgtcg ttggtgttgc gtatgcacca 1118101 tggcggagac ttcgtcattg agccgttgcc gaacctggtg ttcacccgcg actcgtcgat 1118161 atggatcggg ccgcgggtgg tgatcccgtc gctggcatta cgggcacggg tgcgcgaagc 1118221 gtcgctgacc gacctcatct atgctcatca cccgcggttc accggtgtgc ggcgtgccta 1118281 tgaatcgcgc accgctccgg tcgagggtgg cgacgtgttg ttgctcgccc cgggtgtggt 1118341 cgctgtcgga gtgggcgagc ggactacacc agcaggcgcg gaagcattgg cgcgcagcct 1118401 ttttgacgat gatcttgcgc ataccgtgct cgccgtgccg atcgctcagc agcgcgcgca 1118461 aatgcatctg gacacggtgt gcacgatggt cgacaccgat acgatggtga tgtacgccaa 1118521 cgttgtcgac acgctcgagg cgttcacgat ccagcgcaca cccgacggcg tgaccatcgg 1118581 cgatgcggcc ccgttcgcgg aggcggctgc caaggcgatg ggaatcgaca agctgcgggt 1118641 aattcatacc ggaatggacc ccgtcgtcgc tgaacgcgaa cagtgggacg acggcaacaa 1118701 cacgttggcg ttggcgcccg gtgtcgttgt cgcctacgag cgcaacgtac agaccaacgc 1118761 ccgcctgcag gacgcgggca tcgaagtgct taccatcgcc ggctccgaat tgggtaccgg 1118821 ccgtggcggg ccccgctgca tgtcctgtcc ggccgcccgc gatccgcttt aggagtggcg 1118881 atttcggcgc ctggcggcgc cgcagatcac cgccagctgg gcagccagat ctccaggttc 1118941 caggtctgtt gtgagattgg cagaccggtg agcaccggat acagccacgc aaagttcgtc 1119001 accacgaggg ccacgtagca gcagacgacg atcagcccca gtgtgcgtcg ttcggagccc 1119061 tgaccggggt gatagaggat atcgccgaga accagcgaaa tgcccatcac cagaaatggc 1119121 gccatggtcg ctgcgtagaa gaagtacatc tgccggtcga tgtcggcgaa ccacggcagc 1119181 caaccggcgc agtagccgac caggaccacc gcataacgcc agtcccggcg cacaaacata 1119241 cgccaccccg cgtatgccag gactggcacc gccagccacc acatcgcggg cgtgccgacc 1119301 agcatctcgg ccttgacgca cgactgtgcg ccgcagcctg caacgtcttg ctggtcgatg 1119361 gcgtacagca ccggccgcaa cgacatgggc caggtccacg gtttggattc ccaagggtgg 1119421 tagttgcctg cggaattcgt caggcccgcg tggaagtgga acgctttggc ggtgtagtgc 1119481 cagagcgagc gcacggcgtc gggcagcgga acaaccgagt tgcgaccgac cgcttgaccg 1119541 accgcatgcc gatcgatcgc ggtctcggac gcgaaccacg gagcgtaggt ggccagatag 1119601 accgcgaacg ggatcaaccc cagcgcatac ccgctgggaa gcacgtcacg ccgcactgtc 1119661 cccagccacg gtctttgcac ttggtactga cgtcgcgccg ccacgtcgaa cgccagcgcc 1119721 atcgcgccga agaacagcac gaagtacacg ccggaccact tggtggcgca agccaatccc 1119781 agcagcaccc cggcgccgaa ccgccaccag cgcacaccca cccgcggtcc ccacacggtg 1119841 gcggcgctgc ggccggccag cagagcgatg tgcatccgtt cgcgaacctg atcgcggtcg 1119901 acgatgagcg cgccgaacgc cgcgacgacg aagaacgtca ggaagccgtc cagcagcgcg 1119961 gtccgcgcgg tgacgaagct gaccccgtcg cagatcagca gcaccccggc gatggcgccg 1120021 accaatgtcg accggctgat ccgccgcacg atccgcacca ccagcgccac caggagcaca 1120081 cccagcaggg cgccggtgaa ccgccagccg aatccgttgt aaccgaagat ggcctccccg 1120141 atcgcgatca gctgcttacc gaccggcggg tgaaccacca ggccgtaccc ggggttgtct 1120201 tccaccccat ggttgttcag cacctgccag gcctggggtg cgtaatgctt ctcgtcgaag 1120261 atgggggtgc cggcatcggt cagcgagccc aggttcagga accgggtcac cgtggccagc 1120321 agcgtgatca ggccggtcac gatccagccg cgtaaccggt ccaggggccc gaaatccgcg 1120381 accggcacca gcgggccggg gctgacgacg ggtaccacag gctcctcggg gcggtccttg 1120441 gccaggacac aggattctgg gggccgggcg gtcatcggtg tcgatcgtag gctgtccgtc 1120501 atgtcctctg gtcgcctgtt gctcggcgcc accccgctgg gccagccgtc ggatgcgtca 1120561 ccacgcctgg cggccgcgtt ggccaccgcc gatgtggtgg cggccgagga cacccggcgg 1120621 gtgcggaaat tggccaaggc tcttgacatc cggattggtg gacgggtggt cagcctgttc 1120681 gaccgggtgg aggcgttgcg cgtgacggcc cttctcgacg cgatcaataa cggtgcgacg 1120741 gtgctggtgg tcagtgacgc cgggaccccg gtgatcagcg atcccggcta tcggctggtc 1120801 gcggcgtgca tcgacgcggg ggtttcggtg acgtgtttac ccgggccgtc cgcggtgacc 1120861 accgcgctgg tgatgtccgg tctgccggcg gagaagttct gcttcgaggg tttcgccccg 1120921 cgcaagggtg cggcgcgccg ggcctggctg gccgaactgg ccgaggagcg gcgcacctgt 1120981 gttttcttcg aatccccgcg ccggttggct gcgtgcctta acgatgccgt cgagcagctc 1121041 ggtggtgccc gtccggcggc gatctgccgg gagctgacca aggtgcatga ggaagtggtg 1121101 cgcggatcgc ttgacgagtt ggcgatctgg gcggccggtg gtgtgctcgg cgagatcacc 1121161 gtggtggtgg cgggcgccgc cccccacgcc gaactgtcgt cgctgatagc ccaagtggag 1121221 gagttcgtcg cggcgggtat tcgtgtcaag gacgcctgca gcgaggtagc ggcggcacat 1121281 ccgggggtgc gcacccgcca gctttacgac gcggtgctgc aatcacggcg ggaaaccggc 1121341 gggccagcgc agccgtagtc ggtcaggtta ggggatacac accccgatgg gaccgaatcc 1121401 gggtgtgcac agacgcgacg ggagcgccgg cagcggaggc ggtcccggca gtgggggagg 1121461 tgccggcaat gcgggcacgg ccggtaacgg cggcagaccc gctggcagtg ccggcaggcc 1121521 ggccgccagc gccggcagcg ccgcagccag cgccgcagga tccacgcctg gcagcgccgg 1121581 gaggcccggc ggtagaccac ctgccgccag cgccggcagc gccgcagcca gcgtcgccgg 1121641 atccaccccc gccagaccag ccggcagacc agcggccgcc agcatcggca gaccacctgc 1121701 caccagcgcc gtcagctccg ccggcgacat ccccgccaga cccggcaaac tcgtcggcag 1121761 accggccgcg gccgccatcg ccatcaggtc ggtcggcgac acacctggca gactcggaaa 1121821 acccacgcct ggcaggccgg cggccgccgc catcgcgagc agactcgccg gcgtcacgcc 1121881 cggcagggcg ggcagcccca cagccggcag agcggccgcc gccgattgcg ccccgggcag 1121941 cagcagggag gccaccgtgc tggctgttcc gcgggccgtc ggcaggatgc cggaagactc 1122001 cagcgcgtta accgcaagca cgagataggt gacggccacc gcgctcgcgg tcaccacccc 1122061 cgccgccgtg cctcccacac ccaggacacc gttcacgacc gcggcagcgg tgttcacccc 1122121 ggtgattggg tcgggtacgc cggggatgcc gatgcccggg atgccgatgc ccgggatgcc 1122181 gatgcccggg atgccgacgc caggtacgct cggcgcggcc aggttcggca gggccggtgg 1122241 gggaggcagg gccgggccgg ccgcaccggg aatgttgggt aggccgggaa cggctggcgg 1122301 gcccaccctg ggcacggcgg cagccgccgg tacacccggc cgcggaacca atcccggcgc 1122361 gatcgtgtcg gcgaccggct cgaacggcgc cggcaccgcg tgctccgctg cggccggccc 1122421 cgatggtgtc gcggtgtcgg gcatcaggac cgcagccagc cgatcgcact gctggccggc 1122481 gctgcaggtg ccgcccgcga cccgctccgg ggtggacaac gtcaacggtg ccaccgccag 1122541 cgttccccct atcgcggcag cagccgcggt gcccacgatc gcgagtctca ttacaaaccc 1122601 ctctcgaact cgacacgaga tagacacgcg tcgatggccc gagcttaggc gcacccggca 1122661 caccatgtgg gcgttatgcc aatttccgcc gcccgctggg ctaccgcact ttgctggcta 1122721 accgagccgg ggtggtgcgc gtggcggccg gcagcccgac gatgggagcg gctttgtgca 1122781 ggcactccgc ccattcggcg tccggatcgg agtcggcggt gatcccgccg ccaacgccca 1122841 gcacggcgtt gcctgcggta tcgaattcga cggtgcggat tgcgacgttg agctcgcatc 1122901 cggcgaccgg tgacgccaaa ccgactgtgc cgcaatatat cccgcggcga tatcgctccc 1122961 attgtgaaat caattggcga gcccgcagtt taggtgtgcc ggtgaccgag gccggcggga 1123021 aggcggcgtc gagcagcgct gacatcggtt cctcgagcgg aacccgcgcc gacaccgtgg 1123081 acaccaggtg ccacactccc ggcgctggtc gcaccaccaa cagctcgggc accgtcacgg 1123141 taccggtaac cgctacccgg ccgaggtcgt tgcggaccag atccacgatc atgatgttct 1123201 cggccacctc tttggccgat gcccgcagcg ccgacggcgg ggcgtccagc ggcagcgtgc 1123261 ccttgatcgg gctcgatgtc accacggacc cgcggcggcg caggaatagc tccggggata 1123321 gcgatgcgac ggctccccac ggtccggcga caaaggcgga ccgggacgga gcggtacgac 1123381 cgaacccgtc gatgaagaag tccagcgggg atccggtgac cgtcccggcg aattgggtgc 1123441 acacgcacgc ttgatagacc tcgcccgcgc cgatagcttc cagacacgcc agtaccccgt 1123501 cgcggtgcgc tgcccggtcg gccggttccc agtcgatccg gcatgccggt gccggtctgg 1123561 cgaccgatgc ccgagtggtc gccaacgcgc tggccagcca gtccgctatc ggcgcaccgg 1123621 acaggctctc ataccaccac tggccgtcgc ggtcgcggcg cagcacgcaa tcggtccagc 1123681 cgccggcggc ctcggggatc cggtggggtc gcccgtcggc gccggcgtcc gggtaggaca 1123741 ggtagccgac ccagccgccg cccaccgccc cggtggcatc gggcccgccg gtgcccggcg 1123801 ggcccgagaa cacgtcgtcg ccgctgaccg gttgtataga cacactcggt gcgatcaccg 1123861 ccagcgcacc gaaccattcg ccggtcagcg ccgccggtgg tggcaagtcg agtcgactgg 1123921 tggcgcggcc gaccgcccgc agcaccgcag gcgctccgcc aagatcgccg agtcggtcga 1123981 ttcgcaccgt tctagcttga cagaactgtg gattttcgca gcgcaagtgg ctgcgtgggg 1124041 atttcgtccg cgtgctaagc tcccacgcta agttcgatcc gtgaccggct ccggtctccg 1124101 tcccgggggg tgttgctgtg cgagcagcca atgccaatgc cgtttctcgc tgaccgcgag 1124161 acgttgacgc tcggtgtgat cttgaagtag cgatggtttt aagaagtagg aaaagcacgc 1124221 tcggcgttgt cgtgtgctta gcgctggtgc tcggtgggcc gctcagcggt tgcagcagca 1124281 gcgcgagcca ccgcggtcca ctgaacgcaa tgggaagtcc ggccataccg tcgacggcgc 1124341 aggagatacc caacccgttg cgcggtcagt acgaagacct catggaaccg ctgtttccgc 1124401 aggggaaccc cgcgcagcaa cgctatccgc cttggcccgc gtcctacgac gcgagtttgc 1124461 gagtctcctg gcggcagctg cagcctacgg atccgcgcac tctgcccccg gatgctccgg 1124521 acgaccgcaa gtacgacttc agcgtgatcg acaacgcgtt gaccaggctc gccgaccgcg 1124581 gcatgcggct gacgctgcgg gtgtacgcct acagctcgtg ctgcaaggct tcctatccgg 1124641 acggcactaa catcgcgatt cccgactggg agcgcgctat cgccagcacc aacaccagtt 1124701 atccagggcc ggcgaccgat ccctcgaccg gggtggtgca ggtggtgccg aatttcaacg 1124761 attcgaccta tcttaacgat tttgcgcagt tgctcgccgc gcttggtcgc cgctacgacg 1124821 gtgacgagcg cctcagcgtg ttcgagttct ccgggtacgg ggacttcagc gaaaatcacg 1124881 tcgcatacct gcgcgacacg ctcggtgcgc cgggtccggg cccggatgaa agcgtggcga 1124941 ccctgggcta ttacagccag ttccgtgatc agaacatcac caccgcgtcc atcaaacagc 1125001 taatcgcggc gaacgtcagc gccttcccgc atacccaact ggtgaccagt cccgctaatc 1125061 cggaaatcgt gcgagaactg ttcgccgacg aggtcaccaa caagcttgcc gcgccggtgg 1125121 gtgtccgctc ggattgcctg ggcgtcgacg cgccgttgcc ggcctgggcc gagtccagca 1125181 cttcgcacta tgtgcagacc aaagacccgg tggtcgccgc gctgcggcag cggctggcaa 1125241 cggcgccggt gatcaccgag tggtgcgagt tgccgaccgg cagttcgccg cgggcttact 1125301 acgagaaggg cctgcgcgac gtcatcaggt atcacgtgtc gatgacgtcg agcgttaact 1125361 tccccgacca gacggcgacc tcgccgatgg accccgcgtt gtacctggtg tgggcgcaag 1125421 ctaacgccgc cgcaggctat cggtactcgg tcgaagcgca gccggggtcg caagcgctag 1125481 cgggcaaggt cgcgacgatc tcggtcacct ggaccaacta cggcgctgct gccgccaccg 1125541 aaaagtgggt gcccggctac cggctggtgg attccaccgg acaggtggtt cggacgctgc 1125601 cggcagcggt ggacctgaag acgctggtct ccgaccagcg cggcgatcgc agcagcgacc 1125661 agccgacacc ggcgtcggtc gccgagacgg ttcgcgttga tctgtccggc ttgcccgcgg 1125721 gccactacac gctgcgggcc gcgatcgact ggcaacagca caaaccgaac ggctcccatg 1125781 tggtgaacta tccgtccatg ctgttgtccc gcgacggccg cgacgattcc gggttttatc 1125841 ccgtcgccac gctcgacatc ccacgcgacg cgcagaccgc ggtcaacgct tcgtaggtgg 1125901 ctttcccgtc gctgcggtcc gctcacttgc cttcgggtgg ttgcggcggc tggtagcggg 1125961 gaaatacccc ggtgggcggc ggcagcgctg tgccgggggt cagccgaaca cctacggcgg 1126021 cgaacgaccg ctggtttggg gcctggccga gcaggtccaa aattttgccg gccgactccg 1126081 gcatcaccgg ctggatcagc agtgccgcga tgcggactac ctcgcaggtg acgtagagcg 1126141 tggtgcggaa ccgggcctga tcggcttcgg actcgctctt gcgcagtacc cacggctgct 1126201 gcaccgaaaa gtacttgttc gcgtcgccga gcatcagcca gatcgcctcc agcgccaggt 1126261 gcatcgcctg tgcgtcgaag tgaccgcgca ctcgctccaa caagccatcg gcggtcgcaa 1126321 gcagcgcggc gtcggcgtcg gcgaactcac ccgggttggg caccctgccg tcaaggtttt 1126381 tggccaccat cgacaacgag cgttgggcca agttgccgag ctcgttggcc agatcggtgt 1126441 tgatccgagt gacgatggcc tcgtcgctgt aactgccgtc ctggccgaac gggacctccc 1126501 gcaacaggaa gtagcggacc tggtccaccc cgagcgcttc cgccagggca accgggtcga 1126561 cgatgttgcc caccgattta ctcatcttct cgccgcggtt gtgcaagaac ccgtgcgcga 1126621 agatccttcg cggcaactcg attccggctg acatcaaaaa cgccggccaa tagacggcat 1126681 gaaacctgat gatgtccttg ccgatcatgt gcaaatcggc gggccagtag cggcggaaca 1126741 actccgagtc ggtatccggg aagcccgccc cggtcaggta attggtcagc gcgtcgaccc 1126801 agacgtacat gacgtggtcg gggtgctcgg gcacctgcac accccagtca aacgaggtgc 1126861 gcgagatcga caggtcgtcc aggccgccgg agacgaagct gatcacttcg ttgcgccgcg 1126921 tctccggcgc gatgaagtcg gggttggcgt gatagtgggc cagcagcttg tcggtatagg 1126981 ccgacagccg gaagaagtag gtctgctcct cggtccaggt caccggcgtg ccggtctcta 1127041 ccgtcaggcg cgtgccgtcg acaagttggg tctccgattc gacgaagaac cgctcgtcgc 1127101 gcaccgagta ccacccggaa tagttgtcca gatagatgtc gccggccgcc gacatccgtc 1127161 gccagagttc cttggacgcc tcgtggtggt cggcatcggt agtgcggatg aatcggtcga 1127221 aggagatgtt cagcgcctcc tgcatgcgct gaaacacgtc ggaattgcgc cgggcaagcg 1127281 ccgcggtggg cacgcccgct gccgcggcgg cttgtgcgac cttcaggcca tgctcgtcgg 1127341 tcccggtcag gaagcgcacg tcatagccat ccagccgttt gaaccgggcg atcgcgtcgg 1127401 tggcgatgta ttcgtaggcg tgacctacgt ggggtgcagc gttgggatat gcgatcgcgg 1127461 tggtgacgta atagggcttc atttcgacac caccctattg tgtgcgggtg agctccgacc 1127521 gcccagccag acgagatcca ccgcccgctc cggaacccct ggcgccgttg gtcgacgccc 1127581 acacccatct cgacgcgtgc ggtgcacgag acgccgatac ggtgcggtcg ctcgtcgagc 1127641 gagccgccgc ggccggcgtg accgcggtgg tcaccgtcgc cgacgacctg gagtccgcgc 1127701 gctgggtcac ccgcgcggcc gaatgggatc ggcgagtcta tgccgcggtg gcgttgcacc 1127761 cgacccgcgc cgatgcgctc accgacgctg cccgtgccga gctcgagcga ttggttgccc 1127821 accccagggt ggtggccgtc ggtgagaccg gaatcgacat gtactggccg ggtcgcctgg 1127881 acgggtgtgc ggagccgcac gtccagcggg aggcctttgc ctggcatatc gatctggcca 1127941 agcggaccgg taaaccgctg atgatccaca atcgtcaggc cgaccgcgac gtgctggacg 1128001 tgctgcgggc cgagggcgcg ccggacaccg tgatcttgca ctgcttctcg tcggacgcgg 1128061 cgatggcccg cacgtgtgtg gacgccgggt ggctgctcag cctgtccggg acggtgagct 1128121 tccgtaacgc ccgtgaacta cgggaagccg tcccgctgat gccggtggag cagcttttgg 1128181 tggaaaccga tgcaccgtat ttgaccccgc atccccaccg gggcttggcg aacgaaccgt 1128241 actgcctgcc ctataccgtg cgggcgctgg ctgaactggt caatcggcgc cccgaagagg 1128301 tggcgctcat caccacaagc aacgctcgcc gagcttatgg gctagggtgg atgcgccaat 1128361 gagcgcgccg agcggcccat aacacccgcg cgccggagtt gctcaacatt ggccggttcg 1128421 ttaccgtctt gtgatcgaac gggtggggcc tctaggtttc ggagggccca ttttgctttt 1128481 tgttcgctgt gtaggtggtt gagtgttgcc gaggtcgggg atatagcgcg ttgactctac 1128541 ttaccaaact tcatcagacc caatcaccga tgttgcgcct ggtagtcggt gcgctgctgc 1128601 tggtgttggc gttcgccggt ggctatgcgg tcgccgcatg caaaacggtg acgttgaccg 1128661 tcgacggaac cgcgatgcgg gtgaccacga tgaaatcgcg ggtgatcgac atcgtcgaag 1128721 agaacgggtt ctcagtcgac gaccgcgacg acctgtatcc cgcggccggc gtgcaggtcc 1128781 atgacgccga caccatcgtg ctgcggcgta gccgtccgct gcagatctcg ctggatggtc 1128841 acgacgctaa gcaggtgtgg acgaccgcgt cgacggtgga cgaggcgctg gcccaactcg 1128901 cgatgaccga cacggcgccg gccgcggctt ctcgcgccag ccgcgtcccg ctgtccggga 1128961 tggcgctacc ggtcgtcagc gccaagacgg tgcagctcaa cgacggcggg ttggtgcgca 1129021 cggtgcactt gccggccccc aatgtcgcgg ggctgctgag tgcggccggc gtgccgctgt 1129081 tgcaaagcga ccacgtggtg cccgccgcga cggccccgat cgtcgaaggc atgcagatcc 1129141 aggtgacccg caatcggatc aagaaggtca ccgagcggct gccgctgccg ccgaacgcgc 1129201 gtcgtgtcga ggacccggag atgaacatga gccgggaggt cgtcgaagac ccgggggttc 1129261 cggggaccca ggatgtgacg ttcgcggtag ctgaggtcaa cggcgtcgag accggccgtt 1129321 tgcccgtcgc caacgtcgtg gtgaccccgg cccacgaagc cgtggtgcgg gtgggcacca 1129381 agcccggtac cgaggtgccc ccggtgatcg acgaaagcat ctgggacgcg atcgccggct 1129441 gtgaggccgg tggcaactgg gcgatcaaca ccggcaacgg gtattacggt ggtgtgcagt 1129501 ttgaccaggg cacctgggag gccaacggcg ggctgcggta tgcaccccgc gctgacctcg 1129561 ccacccgcga agagcagatc gccgttgccg aggtgacccg actgcgtcaa ggttggggcg 1129621 cctggccggt atgtgctgta cgagcgggtg cgcgctgacc atccggctgc tcgggcgcac 1129681 tgagatcagg cggctggcca aagagctcga ctttcggccg cgcaaatctc tcggacagaa 1129741 cttcgtgcac gacgccaaca cggtgcgacg ggtggttgcc gcctccgggg tcagccgttc 1129801 cgacctggtt ttggaggtcg ggccgggcct gggatcgctg accctggcac tgctcgaccg 1129861 cggcgcgacc gtcaccgcgg tcgagatcga tccactactg gcttctcggc tgcaacagac 1129921 cgtggcggag cactcgcaca gcgaggttca ccgactaacg gtggtcaatc gcgacgtcct 1129981 ggccctgcgc cgggaggatc tagccgcggc gccgaccgcg gtggttgcca atctgccgta 1130041 caacgtagcg gtaccggcgt tgttgcatct gcttgtcgag ttcccgtcga tccgtgtcgt 1130101 gacggtgatg gtgcaggccg aggtcgccga acggctcgcc gccgagccgg gcagcaaaga 1130161 gtacggcgtg cccagcgtta agctgcgctt cttcgggcgg gttcgccgct gcggcatggt 1130221 gtcgccgacc gttttctggc ccattccgcg tgtctattcc gggctggtac gcatcgatcg 1130281 atatgagacc tcgccctggc ccaccgacga cgcttttcga cggcgggtat tcgaactcgt 1130341 ggacatcgca ttcgcgcagc ggcgcaagac ttctcgcaac gcgtttgtgc agtgggcggg 1130401 ctcgggaagc gagtcggcga atcgattgtt ggcggccagc atcgaccccg cccgtcgcgg 1130461 tgagacgctg tccatcgacg acttcgtgcg gctgctgcga cggtccggcg gctccgacga 1130521 ggccaccagc accggccggg acgccagggc gccggacatt tcggggcacg cgtcggcgag 1130581 ctgacggggc gccgccgcgt gtggtcggcg cgtcacagcg atagtctgct gcggtgtccg 1130641 catctgacgg caacaccgct gaattgtggg tgcccaccgg gtcggtcacc gttcgggtgc 1130701 ccggaaaggt caacctctat ctggcggtcg gcgatcgccg cgaggacggc tatcacgagc 1130761 tgaccacggt atttcatgcc gtctcgctgg tcgacgaggt aaccgttcgt aacgctgatg 1130821 tgctctcgct cgagttggtc ggcgaggggg ccgaccagct gccgaccgac gaacgcaatc 1130881 tcgcctggca ggcggccgag ctgatggccg aacacgtggg ccgggcgccg gacgtctcga 1130941 tcatgatcga caaatccatt ccggtcgccg gcggcatggc cggtggcagc gcggacgctg 1131001 cggcggtcct ggttgcgatg aactcgttgt gggaactcaa tgtgccccgc cgcgacctgc 1131061 gcatgctcgc cgcgcggcta ggcagcgatg tgccgtttgc cctgcatggt ggtaccgcgc 1131121 tggggacggg tcgcggcgag gagttggcca ccgtgttatc ccgcaacacc ttccactggg 1131181 tcctggcgtt cgccgacagc gggttgctca cctccgcggt gtacaacgag ctcgaccggc 1131241 tcagggaggt gggggatccg ccccggcttg gtgagcccgg gccggttctg gctgccttag 1131301 ctgcgggtga tccggatcag ctggcgccgt tgctgggtaa tgaaatgcaa gcggccgcgg 1131361 tgagcctgga cccggcgctg gctcgtgcgt tacgcgccgg tgtggaggcc ggcgcgctcg 1131421 caggcatcgt gtccggttcg ggtcccacgt gtgccttcct gtgcacctcg gcgagctcgg 1131481 cgatcgatgt cggcgcgcag ctgtcggggg cgggagtttg tcgcaccgtt cgagtcgcca 1131541 ccgggccggt acccggcgcc cgcgtggtgt ctgcgccgac cgaagtgtga ccgaattctt 1131601 gggagcatgc ctcgggcggc caggggtatc cgcgcgtgcc gaggccggtg ggtcgatcgg 1131661 ctggcgcacc agcatgccag cggtagggcc gcaggcatcc gccctcgcgg aggtcggtgg 1131721 cgcgcatcaa agccaggcgc aaaagccata ccatgatgcg acagagccgc tcggcgagaa 1131781 cctccgctac cggccagctc acggcgatag ctgcatcaac ggccatcgag acaacccgtc 1131841 ggcacgggaa tcctcgcagt tcaccgcggg gagtacggca aaggctgtga ccaagctgtg 1131901 acatcgccct caaacctcgg cagagtttgg cagctactta agagttgctt aagataatcc 1131961 gcggtgttgg gtcgtgggct catcaccgaa ccgagaccca accgctcccc aactgtgtgc 1132021 gcgcgcctgt cgcgatgtgg catccggtag gcggaccatg aaaacccgga ccttggggac 1132081 agcaccggaa ccgaggaggt tgccttgagc aggttcaccg agaagatgtt ccacaatgcc 1132141 cgcaccgcga cgacgggcat ggtcacaggt gaaccgcaca tgcccgtccg ccacacctgg 1132201 ggcgaggtcc atgagcgtgc tcgttgcatc gcgggcggcc tggccgccgc gggtgtcggt 1132261 cttggtgacg ttgttggggt gctggccggc ttcccggtgg agatcgcccc cacggcgcag 1132321 gccctgtgga tgcgcggggc cagcctgacc atgctgcacc agcccacacc gcgcaccgac 1132381 ttggccgtgt gggccgagga caccatgacc gtcatcggca tgatcgaggc caaggccgtg 1132441 atcgtctccg agcccttcct cgtggccatt cccatccttg agcagaaagg catgcaggtc 1132501 cttaccgtcg ctgacctttt ggcgtcggat ccgatcggcc ccatcgaggt cggcgaggac 1132561 gacctggcgt tgatgcagct gacgtccgga tctaccggct cccctaaagc cgtccagatc 1132621 acccaccgca acatctactc caacgccgag gcaatgttcg tcggcgccca gtatgacgtc 1132681 gacaaggacg tcatggtcag ctggttgccc tgcttccatg acatgggcat ggtgggcttc 1132741 ttgactatcc cgatgttctt cggtgcggag ctggtcaagg tcacgccaat ggacttcctg 1132801 cgcgacacgc tgctgtgggc gaagctcatc gacaagtacc agggcaccat gaccgcggcg 1132861 cccaacttcg cctacgcgct gctcgccaag cggttgcggc gccaggccaa gcccggcgac 1132921 ttcgatctgt cgaccctacg cttcgcgctg tccggcgccg agcccgtcga acccgccgac 1132981 gtcgaggacc tgctcgacgc gggcaagccg ttcggcctga ggccctcagc gatcctgccg 1133041 gcctacggca tggccgagac cacgctggcg gtgtccttct cggagtgcaa cgccggcctc 1133101 gtcgtggacg aggttgacgc cgacctgctg gcggctctgc gccgggccgt tcccgccacc 1133161 aaaggcaata cccgcaggct ggccacgcta ggtccgctgc tgcaggacct agaggcccgc 1133221 atcatcgacg aacagggcga tgtcatgccc gcccgcggcg tgggtgtcat cgagctgcgc 1133281 ggcgagtcgc taactcccgg ctacctgact atgggtggct tcatcccggc ccaagacgag 1133341 catggctggt acgacacggg cgacctcggc tacctcaccg aggagggcca cgtggtggta 1133401 tgtggccgcg tcaaggatgt catcatcatg gccgggcgca atatttaccc gaccgacatc 1133461 gagcgggcgg ccggccgcgt cgacggcgtt cgtccgggtt gcgcggtggc cgtgcgtctc 1133521 gatgccggac attcgcgcga atcctttgcc gtcgcggtcg agtcgaacgc cttcgaggat 1133581 cccgccgagg ttcgtcgcat cgagcatcaa gtggcccacg aggtggttgc cgaggtcgac 1133641 gtgcggcctc gcaacgtcgt ggttcttgga cccgggacca ttccgaagac gccgtcgggc 1133701 aagctgcgtc gggccaactc cgtcaccctg gtcacctaag gccgccgagc agacgcaaaa 1133761 tcccctcgac acgccggttg cgaggggatt ttgcgtctgc tcacgcgggt cgttaccagg 1133821 cgtggacgcg gttttgtgcg ggctccatgc cctgttcgat aagcagctcg gtggcatcgg 1133881 cggcctgctc gcagatcgtg gggacctcgg cgcgctcggc cggggtaaag ttctccaaca 1133941 caaacgccgc cgggtccttg cggccgggcg ggcggccgat cccgatacgc acccgctgaa 1134001 agtctttggt acccagcgcg gccaccaccg agcgcaaccc gttgtggccg ccttcgccgc 1134061 cgccgatctt gagccggatg cggccgaact cgaggtcaag gtcgtcgtgg atgacgatga 1134121 tgttggccgg cgccaccgag tagaacttcg ccagcggccc tatctggcgg ccggactcgt 1134181 tcatgtagca gcgcggcttg gccaaaacca gggagcgccc ggctgatcta ccagtggcga 1134241 cttcggcgcc ggaacgcttg tgtgccttga acttcgcgcc tagtcgcgcg gcgagcagat 1134301 cggcgaccac gaacccgagg ttgtgccggg tacgggcgta attggctcca gggttgccga 1134361 ggccgaccac gagcaacggc tcggccatgt cgcaagccgt ctactcggac tcgccagcgg 1134421 cctcggcttc gccggcttct accgcggctt cctcggcttc ctcggctcct gcgacttcgc 1134481 cctccagctc ctcggcggtt ggcgccttca ccacgttgac caccaacaga tcagggtcag 1134541 aaatcaggct gacaccggcc ggcagcgcga tctgcccggc ggtgagctgg gtgcctggtt 1134601 cggcaccttc gatggacacg gtcaactgct cgggaatcga cagcgcctcg gcctcgatct 1134661 cgatgctgtt ggtctcttgg gtgaccaggg tgtcgggtcc ggcctggccc tcgacgacca 1134721 cgctgacttc gacgacgacc ttctcgccac ggcgcacgac cagtaggtcg gcatgctgga 1134781 tggtgcggcg gatcggatgg atatgaagtg ccttggtcag tgccagctgt tccttaccgg 1134841 cgatgtcgag ggtcaacacc gcgttggtgc cggaatgccg cagtacggcc gcatagtcgt 1134901 gtccgggcag ctccaggtgc tgtggctcgg cgccgtggcc atacagcaca gcgggtatct 1134961 tgccggcgcg ccgggcccgc cgggacgcgc ccttgccggt ctcggtacgc accgtgacgc 1135021 gcagctggtt gcttgcggat ttggccatat gtcgctcctg ggtggctcgg ttacctcgtt 1135081 tgggggcacg gccagggtcg cgacagcttg tcggcctccg tcgataacgg tgttctgccg 1135141 gcctgctgta gaccgccgac caccctcgcc gtgacgcccg gctaggctaa cccatggcta 1135201 ctgcattggg gaaattcgat ccttgtgagc tgctcggata gctgtgcccc aaccgtgcgg 1135261 acaattactt tgccgcgacg acgaatccgg cgatgatcgc ctcgatgtcg gaagcgtgct 1135321 tgacggcctc gttggccaga ctcgtgatgg tgagctgcac caggtagcgc tgcttggccg 1135381 gcgccggttg ggaagacgat ccggttccag gtgtgcagtc gcctgccgtg caggtcataa 1135441 ctgccctgaa tcatcgagga cggaaacccg ttgaagtctg ccgtcgagga gtccaattcg 1135501 gtgaagttcg tcgacagccg ggcatcggca gtgccatgct tgagcgcttc ggcgatatcg 1135561 aagtcccggt gcagcttgaa caccatgagc atggccgttg gatagctttc gcccttggcg 1135621 atcatctccg tgttcggggt gatgttcgga tttttcatcg gtgcccagcc cggtggtgtc 1135681 ggaatcgaca cggtcaggtc ggtcaggctg ctcggtgcca ccggctctcc ggtgacgccg 1135741 acgctttcca gatacttcca cagcgggacc ggcacttccg tcgtggtcga gacggcgctg 1135801 gtggttgggc tcgtggacaa aatcgactgg aagtcaggcg atttcggtcc gcaagcgacc 1135861 gctgacattg ccagcgtggc taccgcgacc gcgaccgcca agggtctcac agaatcttgc 1135921 ggacagcgtc gaccggccaa gcccgccgga tgccctcaag gatgacggct gccatctatg 1135981 cgtccccgtc gaaaagtcct gttactgagc cgttttcgaa gaccgcccgg attgtgctgg 1136041 ccagcagcgg cgcgatggac aaaacggtga gctgggggaa gcgcttgtct tcgccgatcg 1136101 ggagcgtgtt cgtgacgatc acttcgcggg cgccgcagga ggccagccgc tgcgcagcgg 1136161 ggtcggagag cacgccgtgg gttgccgcga tgatcacgtc accggcgccg tcgttgtgca 1136221 gcaatgccac cgcgccggcg atggtgccgc cggtgtcgat catgtcgtca atcaggacac 1136281 aggtgcgccc ggccacgtcg ccgacgacgc ggttggacac cacttggttg ggtacccgcg 1136341 gatcacgggt cttgtggatg aaggcgaggg gaacaccacc taatgcgtcg gcccacttct 1136401 cggcgatgcg tacccggccg gagtcagggg agacgaccac catgttgccg tccgggtagt 1136461 tgtctctgat gtaaccggtc agcaggttct gaccgcgcat atgatcgacc ggcccgtcga 1136521 agaaaccctg gatctggtcg gtgtgcaggt cgaccgtcac gatccggtcg gcgcccgcgg 1136581 tcttgagcag gtcggcgatc agtcgcgcgg agatcggttc gcggccacgg tgtttcttgt 1136641 cttgccgggc atacggatag aacggcatga cggcggtgat ccgtttggcg ctgccccgtt 1136701 tgagcgcgtc gatcatgatc agctgttcca tcagccacct gttcaccggt gccgggcagg 1136761 attgcaggac gaaggcgtcg caaccgcgta ccgattcgtg gaagcgcacg aagatctcgc 1136821 cgttggcgaa ctcccgcgcg tcctgagagg tgacgtggac gtcgagctct ttggctacct 1136881 gctcggccag ctccggatgg gcgcggccgg caaagagcat caggtttttg cgattatcgg 1136941 tccagtcgtg gctcaacgcg ctgccctcgc cgtttgggat cgaattggat tacccatggt 1137001 acgtagcgca ccgcccggat ttgtcgccgg gtagccggga tgcgacttca cggtgtctga 1137061 tcagcgtcgg gtggttgtgt gggctgttgg caggccattt ctgaggctct ttttgaggcc 1137121 tgagccgctg ggctgccggg gcgtttgcgc tgcacccagt tctcgatgtt gcgttgcgga 1137181 cccgccgaca ctgccagcgc ccccggcggg acatcctccc gcaccactgt gccggccccg 1137241 gtatacgcgc cgtcgccgat ggttactggg gccacgaaca tggtgtcgga cccggtccgt 1137301 acgtgcgaac cgacggtggt gcgccgtttg gacgtaccgt cgtagttgac gaacacgctg 1137361 gaggcgccga tgttgctgta ctcgccgatg tcggcgtcgc cgacgtaggt caggtgcggc 1137421 accttggtgc cggtgccgat ggtggagttc ttgacctcga cgaacgcgcc cagcttgccg 1137481 tcggcgccca acgcggttcc gggccgcagg taggtgaagg gcccgaccgc ggcgccatcc 1137541 ccaatcgacg acgacgaacc gtgggtgcgc accaccgagg caccgtcgcc gacggcgacg 1137601 tcggtcaggg tggtgtcggg accgacgaca cagcgaccgc cgatctgggt gcggcccagc 1137661 aactgggtac ccgggtgaat gacggtgtcg cggccgatgg tgacgtcgac gtcgatccag 1137721 gtggtagccg ggtcgacgac ggtgacgccg gccagctggt gagcggccac cacccgccgg 1137781 ttgagttcgg aggccagctg ggccagctgg acgcgattgt tgacgccggc caccaacgcg 1137841 ctgtcgtcga cgtggctggc atgtacggtc tggccgtcgg agcgcaagat ggcgatgacg 1137901 tcggtgaggt agagctcctg ttgggcgttg ttggagctca gccggctcag tgcggaccgc 1137961 agcgcggcga tgtcgaaggc gtagacgccg gcgttgactt cgcggatttc ccgctgcgat 1138021 ggtgtcgcgt cggtttgctc cacgatcgcc atgactccgt gatcctgggt gcgcaggatg 1138081 cggccgtagc cgaagggatc atccagcgtc gtggtcagca ccgtcaccgc agccgacacc 1138141 gcgcggtggg tggcgatcaa gtcggccagc gtgtcggcgt ccagcagcgg ggtatctccc 1138201 gaggtgacca cgacgttgcc ggcgtagtca tcgggcagcg cggacagccc gcagagtacc 1138261 gcatgcccgg tccctagcgg tcgatcctgc agggcgacgt cgatcgttcg gcctagggtg 1138321 tcggcgagtt caccgactag cggcgcgatg cgctggtgat cgtgtcccag caccacgatt 1138381 agacgctgcg gcgccagctt ggcgatcgca tgcagtacat gcgacagcat gctgcgaccg 1138441 gcgagtgtgt gcagcacctt gggggtgtcc gaacgcatcc gggtcccggg cccggccgct 1138501 aggaccagga ccgcggtgtc accaggaaac gtcatcaacc ctccttgaag ctccgtcgcc 1138561 aggactcgaa cctgaactat ctgaaccaaa atcagaggtg ctgccgatta caccacgacg 1138621 gattgcacat cgatgtgact ttagacggtg tcaacgccgt cagcacagtc aacgctgtcg 1138681 ccgtctaccc accggcccca cgcaaaccga tacccttgtt gatgtggccg gaccggataa 1138741 agggccggat aaggcgccgg aaaacccgac gcgggtgacg cgcgccagga tgacggggac 1138801 cgagcgccgt caccagctca tcggcatcgc gcgatcgctg tttgccgaac gcggttacga 1138861 cgggacgtcg atcgaagaga tcgcgcagcg cgccaacgta tccaagccgg tcgtctacga 1138921 acatttcggt ggcaaggagg gcctgtacgc ggtggtggtc gatcgggaga tgtcggcgct 1138981 gctggacgga atcacctcgt cgctgaccaa caaccgatcc cgggtgcggg tggagcgggt 1139041 cgcgctggcg ttgctgacct acgtcgagga acgcaccgac ggcttccgca tcatgattcg 1139101 cgactcgccg gcctcgatca gctcgggcac ctattccagc ctgctcaacg acgccgtcag 1139161 ccaggtcagc tcgattctgg ctggagactt cgcccggcgc ggcctggacc cggacctggc 1139221 accgctgtat gcgcaagcat tggtgggttc ggtgtcgatg acggcgcaat ggtggctcga 1139281 tgcgcgcgaa ccgaagaagg aagtggtggc cgcgcacctg gtcaacctgg tctggaatgg 1139341 cctgacccac ctggaggccg atccgcggct acaggacgag tagcgggcgg ggaagccggg 1139401 cccaatgttg actaacctcg gcgccctaga atggccgcat catgaccgca ccggggcctg 1139461 cctgctcaga taccccgatc gcggggcttg tcgaattggc gctgagcgcg ccgacattcc 1139521 aacagctcat gcagcgcgcc gggggtcgac ccgacgaatt gacgctcatc gcgccggcca 1139581 gcgcgcggct gttggtcgcc agtgcgctgg ctcggcaggg gccattgctg gtggtcaccg 1139641 ccaccgggcg ggaagccgac gacctggccg ccgaactgcg tggtgtgttc ggggatgcgg 1139701 tggcgttgtt gccgtcctgg gagacactgc cgcacgaacg gctctcaccc ggtgttgaca 1139761 ccgtcggcac tcgcctgatg gcgctgcgcc ggctggccca ccccgacgat gcccagctgg 1139821 gcccaccgct gggggtagtg gtgacctcgg tgcgctcgct gctgcagccc atgacgccgc 1139881 agctgggcat gatggagccc ctcacgctga ccgttggcga cgaatccccc ttcgacggcg 1139941 tggtggcgcg gctggtcgag ctggcatata cccgggtgga tatggtcggc cggcgcggcg 1140001 agttcgctgt gcgcggcggg attctggaca tctttgcccc gacggccgaa catccggtgc 1140061 gggtcgagtt ctggggcgac gagatcaccg agatgcggat gttctcggta gccgaccagc 1140121 gctcgattcc ggagatcgac attcacacac tggttgcctt cgcctgccgt gaactgctgc 1140181 tgagcgagga cgtgcgggcg cgggccgccc aactggccgc acggcatccc gcggccgaga 1140241 gcaccgtcac cggcagtgct tccgacatgc tggcgaagct cgccgagggc atcgcggtcg 1140301 acggcatgga ggcggtgttg ccggtgctct ggtccgacgg gcacgcgttg ctgaccgatc 1140361 agctgcccga cggcacgccg gtgttggtgt gcgacccgga aaaggtgcgc acccgcgccg 1140421 cggatctgat caggactggc cgtgaattcc tggaagcctc gtggtcggtc gcggcgctgg 1140481 gaactgcaga aaatcaagcc cccgtcgacg tcgaacaact gggtgggtcg gggttcgtcg 1140541 aactggacca ggtgcgggcc gcggcggccc gaacgggtca tccgtggtgg acgttgagcc 1140601 aattgtccga cgagtcggcg atcgagttgg acgttcgggc cgcgccgtcg gcgcgcgggc 1140661 accagcgtga catcgacgaa atcttcgcga tgctacgtgc ccacatcgcg accggcgggt 1140721 acgccgcgct ggtcgcgccg ggcaccggaa ccgcacaccg cgtggtggaa cggctgtccg 1140781 agtccgacac ccccgcgggg atgctcgatc ccggccaggc gcccaagccg ggagtcgtcg 1140841 gggtgctcca gggcccgctg cgtgacggcg tcatcattcc cggcgccaac ctggtcgtca 1140901 tcaccgagac cgatttgacc ggcagccggg tcagcgccgc cgagggcaag cggctggcgg 1140961 ccaagcggcg caacatcgtc gacccgctgg cgctgacggc cggtgacctg gtggtgcacg 1141021 atcagcacgg catcggccgg ttcgtggaga tggtcgagcg cacggtcggg ggcgcccgcc 1141081 gggagtatct ggtgctggag tatgcctcgg ccaagagggg tggcggggcg aaaaatactg 1141141 acaagctcta tgtcccgatg gattcgctgg accagctgtc gcggtatgtc ggcgggcagg 1141201 cgccggcgct gagccggctg ggcggcagcg actgggccaa caccaagacc aaggcgcgcc 1141261 gcgcggtgcg cgagatcgcg ggcgagctgg tctcgctgta cgccaaacgg caggccagcc 1141321 ccgggcatgc gttctcgccg gacacgccgt ggcaggccga gctggaggac gcgttcggct 1141381 tcaccgagac cgtggaccag ctcaccgcca tcgaagaggt caaggcggac atggaaaagc 1141441 cgatcccgat ggaccgggtg atctgcggcg atgtcggcta cggcaagacc gagatcgcgg 1141501 tgcgggcggc gttcaaggcg gtccaagacg gtaaacaggt cgcggtgctg gtgcccacca 1141561 cgctgctggc cgaccagcat ctgcagacgt tcggcgagcg aatgtccgga ttcccggtga 1141621 ccatcaaggg tctgtcgcgg ttcaccgacg ccgccgagtc ccgcgccgtg atcgacggcc 1141681 tggccgacgg gtcggtggac atcgtgatcg gcacccatcg gctgctgcag accggggtgc 1141741 gctggaagga tctgggcctg gtggtggtcg acgaggagca gcggttcggc gtcgagcaca 1141801 aggagcacat caagtcactg cgcacccatg tcgacgtgct gaccatgagc gccaccccga 1141861 tcccgcgcac gttggagatg agcctggccg ggattcgcga gatgtcgacc atcctgacgc 1141921 cgcccgagga gcgctacccg gtgctgacct acgtcggacc gcacgacgac aagcagatcg 1141981 ccgcggcgct gcgccgggag ctgctgcgcg acgggcaggc gttctacgtg cacaaccggg 1142041 tcagctcgat cgacgcggcc gccgcccggg tgcgtgagct ggtgcccgag gcgcgggtgg 1142101 tggtcgcgca cgggcagatg cccgaggacc tgttggagac caccgtgcaa cggttctgga 1142161 accgcgagca tgacatcctg gtttgcacca ccatcgtgga gaccggcctg gacatctcca 1142221 acgccaacac tttgatcgtc gagcgcgccg ataccttcgg gctgtcccag ctgcaccagc 1142281 tgcgtggccg ggtgggccgc agccgggagc gcggctacgc ctatttcctc tatccaccgc 1142341 aggtgccgct gaccgagacc gcttacgacc ggttggcgac gatcgcgcag aacaatgagc 1142401 tgggcgcggg catggccgtg gcgttgaagg acctagagat ccgcggtgcc ggcaacgtgc 1142461 tcggcatcga gcagtccgga cacgtcgccg gcgtcggatt cgacctgtac gtgcggttgg 1142521 tcggcgaggc cctggagacg taccgggacg cgtaccgggc ggccgccgac ggccaaaccg 1142581 tgaggaccgc cgaagaaccc aaggatgtgc gaatcgacct gcccgttgac gcgcacctgc 1142641 caccggacta catcgccagt gatcggctgc ggctggaggg ctaccggcgg ctggcggccg 1142701 cctcctctga tcgcgaagtg gcggccgttg tggacgagct aaccgatcgg tatggggccc 1142761 tgccggagcc ggcccggcgg ctggcggcgg tggcacggct gcggctgctg tgccgtggct 1142821 ccggcatcac cgacgtgacg gcggcgtcgg cagcgaccgt gcggctgtcc ccgttgacgc 1142881 tgccggactc cgcccaggtg cggctgaagc gaatgtatcc cggagcgcac taccgtgcca 1142941 cgacggccac cgtgcaggtt cccattccgc gagccggtgg cctcggcgcg ccgcgaatcc 1143001 gcgacgtcga gctggttcag atggtggccg atttgataac cgcgctcgct gggaaaccgc 1143061 gccagcatat tggtataacg aaccctagcc cgccaggcga agacggccgt ggtcgcaaca 1143121 cgacgattaa ggagcgacaa ccgtgatgat tgtcgtcctg gtcgaccccc ggcgtccgac 1143181 actggtgcct gttgaagcga tcgagttcct gcgcggcgag gtgcaataca ccgaggaaat 1143241 gccggtcgcg gtgccctggt cgctaccagc ggctcgttcg gcgcacgccg gaaacgacgc 1143301 gccggtgttg ctgtcgtctg accccaacca tcctgctgtc attactcgac tggccgccgg 1143361 tgcccggctg atctcggcac cggattctca gcgtggcgaa cgactcgtcg acgccgtcgc 1143421 gatgatggac aagctgcgca ccgccggacc gtgggaaagt gagcagactc acgactcgct 1143481 gcgcagatac ctgctggagg agacctacga gctgttggac gcggtccgca gcggcagtgt 1143541 tgaccagctg cgcgaagagc ttggtgatct cttgctgcag gtcctctttc acgcccggat 1143601 cgctgaggat gcgtcgcaat cgccgttcac catcgacgac gtcgccgaca cactgatgcg 1143661 aaagctcggc aatcgggcgc caggagtact tgcgggcgaa tcgatttcgc tcgaagatca 1143721 actggcgcaa tgggaggcag ccaaggcctc ggaaaaggcg cgaaagtcgg tagccgacga 1143781 tgtccatacg ggccagccgg cattagcgct ggcgcagaag gttattcagc gtgcccaaaa 1143841 ggctgggctg cccgctcacc tgatccccga tgagatcact tctgtttcgg tttcagctga 1143901 cgtagatgcg gaaaacacgc tgcgcactgc cgttttggac tttattgaca ggctgcgctg 1143961 tgccgagcgg gcaattgccg tcgcacgccg gggcagcaac gttgccgagc agctcgatgt 1144021 gacgccgctg ggtgtgatca ccgagcagga gtggctcgcg cattggccaa ctgctgtcaa 1144081 cgattcccgc ggcgggtcca agaaacgtaa aggcatgcga taaccgcccc gagtgcgacg 1144141 gggtagtcaa caaacccatg ggacgatgat cgtgacggaa gccggtatag gtgccctacg 1144201 agggagagtt gtgtcgccga gacgctggtt gcgggcggtc gccgtgatag gggcgaccgc 1144261 gatgctgttg gcgtcgagct gcacttggca gctgagcctt ttcatccccg acggcgtgcc 1144321 gcctccgccc ggcgatccgg tgccgccggt ggatacgcac gccggcggcc ggcccgcgga 1144381 tcagttgcgc gaatgggcgg agaaacgtgc tgcggcattg ggaattccgg tcatcgcgct 1144441 ggaggcctac gcctacgccg ctcgcgtcgc cgaggtcgag aatcccaagt gtcatcttgc 1144501 gtggaccacg ctggcgggca tcgggcgggt ggagagtcac cacggaacct accggggcgc 1144561 cacgattgcg cccaatgggg atgtaagccc cccgattcgg ggcgtccgcc tcgacggcac 1144621 cggcggcacc ctgcgcatcg tggacaggga cgggggcggc ctggacggtg acgccgcggt 1144681 ggagcgtgcg atggggccaa tgcagttcat ttcggaaacc tggcggttgt acggggtcgc 1144741 tgccagaaac gacggcatcg ccaacgtcga caacatcgat gatgctgccc tctcggcagc 1144801 gggctattta tgctggcgtg gaaaggatct cgcgacaccg cgagggtgga taaccgcgct 1144861 gagggcctac aacaactccg ttatctatgc gcgggcggtc cgggactggg cgaccgcgta 1144921 tgcggcgggt catccgctgt agcaggatga accgctaacc caggctttac gctaacagcg 1144981 gtcggggcca gccaacccaa gaccgtccgt gcagcagcta cgacgcaagg agaacccagt 1145041 gccgattatc gagcaggttg gggcccgaga gatcctcgat tcccgcggca acccgacggt 1145101 ggaggtcgag gtggcgctta tcgacgggac attcgcccgg gccgcggtgc cgtcgggcgc 1145161 ctcgaccggg gagcacgagg ccgtcgagtt gcgcgacggc ggcgatcgct acggcggcaa 1145221 aggcgtgcaa aaagccgtgc aggctgttct tgatgagatc ggcccggccg tcatcggact 1145281 caacgccgac gaccagcgat tggtcgacca ggcgctggtg gacctagacg gcacccccga 1145341 caagtcccgg ctgggcggca acgcgatctt gggtgtctcg ctcgctgttg ccaaggcggc 1145401 ggcggattcg gcggagctgc cgttgttccg ttatgtcggg gggccaaacg cgcacattct 1145461 gccggtaccg atgatgaaca tcctcaacgg cggcgcacac gccgataccg ctgtcgacat 1145521 tcaagagttc atggtggcgc caattggcgc gcccagcttc gtcgaggcgt tgcgctgggg 1145581 cgctgaggtg taccacgcgc tcaagtcggt cctgaaaaag gaggggctgt ccaccggcct 1145641 gggcgacgaa ggcggcttcg ccccggatgt ggccggcacc accgcggcgt tggacctgat 1145701 cagccgggcc atcgagtcgg cgggcttgcg acccggcgcc gacgtggcgc tggccctgga 1145761 cgcggcggcc accgagttct tcaccgacgg caccggctac gtcttcgagg gcaccacccg 1145821 taccgcagac cagatgaccg agttctacgc gggcctgctc ggcgcctacc cgctggtgtc 1145881 gatcgaagac ccactgtccg aagacgattg ggacggctgg gccgcgctga cggcctcgat 1145941 cggtgaccgg gtgcaaatcg tcggcgacga catctttgtc accaatcccg agcggctcga 1146001 ggagggcatc gaacggggcg tggcaaatgc gttgctggtc aaggtgaacc agatcgggac 1146061 gttgaccgag acactcgacg cggtcacgct ggctcaccac ggcggatacc gcacgatgat 1146121 cagtcaccgc agtggcgaga cggaggacac catgatcgcc gacctcgcgg tggccatcgg 1146181 cagcgggcag atcaagacgg gcgcgcctgc tcgcagtgag cgcgtcgcaa aatacaacca 1146241 gctgctgcgg atcgaagagg cgcttggcga cgcggcccgc tacgcgggcg acctggcatt 1146301 tcctcggttc gcgtgcgaga cgaaataggt acatgcccga agcgaaacgg cccgaatcga 1146361 agcgccggtc gccggcatcg cgcccgggga aggccggcga ctcggttcgg ggcggtcgcg 1146421 ccaccaagcc ttccgcaaaa ccctccacgc ccgcaccgca cgccagccgc aagaccactc 1146481 gcacgccgca tgagcacatt gtcgaaccca tcaaacgggc gatcaccgaa tcggtcgaga 1146541 agcgctccga acagcggctg gggttcaccg cgcggcgcgc agcgatcctc gccgcggttg 1146601 tatgcgtgct gacgctgacc attgcgaggc cggtacgcac ctacttcgcg cagcgcgccg 1146661 agatggaaca actggctgcg accgaggcca tgttgcgccg ccagatcgct gacctggagg 1146721 aacagcaggt taagctcgcc gatccggcgt atattgcggc tcaggcccgc gaacggctcg 1146781 gctttgtgat gcctggagac atcccgtttc aggtccagct tccgtcgacg ccgttggcgc 1146841 cgccgcaacc ggggtcagac gcggctactg cgaccaacaa cgaaccctgg tacaccgcgc 1146901 tgtggcacac gatcgccgac gacccgcacc tgccgcctgc cgcgccaccg gcaccggagc 1146961 ccggacgtcc gggcccgctg ccgccggcct cgccaaaccc cgagcagccc ggtggttgat 1147021 cgtgccgatc tggaggtggt cacgcggcaa ctcggccgtg caccccgggg tgtgctcgcg 1147081 atcgcctatc gttgccccaa cggtgaaccc ggcgtcgtga aaactgcgcc gagactgccc 1147141 gacggcacgc cgtttccgac cctgtactac ctgacgcatc cggtgctcac ggcggcggcc 1147201 agcaggttgg agaccacggg actcatgcgc gagatgaacc ggcggctggg ccaggatgcg 1147261 gagttggccg ccgcctatcg acgggcacac gagtcgtatc tgtccgagcg tgacgctctc 1147321 gagccgctcg ggacaacggt ctccgcgggg ggcatgcccg accgggtcaa gtgcctgcat 1147381 gtgctgatcg cgcattcgct ggccaagggc ccggggttga acccattcgg tgacgaggcg 1147441 ctggcgttac tggccgccga gccacggacg gccgcgaccc tggtggctgg gcagtggcgc 1147501 taacccgggt cgccgcgatc gactgcggta ccaactcgat tcgcttgctg atcgccgacg 1147561 tgggagccgg gttggcgcgc ggagagctgc acgatgtgca tcgtgagacc cggatagtgc 1147621 gcctgggcca gggagtcgac gccaccggtc ggttcgcgcc ggaggcgatt gcgcggaccc 1147681 ggaccgccct gaccgactac gccgaactgc tgacgtttca ccatgccgag cgggtgcgga 1147741 tggtcgccac gtcggccgcc cgcgatgtgg tcaatcgcga cgttttcttt gcgatgacgg 1147801 ccgacgtgtt gggcgccgcg ctgcccggct cggccgcgga ggtgattacc ggcgccgagg 1147861 aggccgagct ctccttccgt ggagcggtgg gcgaattagg cagcgccggt gcgcctttcg 1147921 tcgtcgtgga cctcggtggc ggttccaccg agatcgtgct gggcgagcac gaagtggttg 1147981 ccagctactc ggcggacatc ggatgcgtcc ggctgaccga acgctgtttg cactccgacc 1148041 cgccgacgtt gcaggaggtg tccacggccc gccggctggt tcgcgagcgg ctcgagcccg 1148101 cactgcgcac cgtgccgctg gagctggccc ggacctgggt cgggctggct ggaacgatga 1148161 ccacactgtc cgcgctggcg cagtccatga cggcgtatga cgctgcggcc attcatcttt 1148221 cgcgggtgcc cggtgctgat ctgctcgagg tttgccagcg gctgatcggc atgactcgca 1148281 agcagcgggc cgcgctggcg ccgatgcacc cgggccgggc cgacgtgatc ggcggtggcg 1148341 cgatcgtggt cgaagagttg gcgcgcgagc tgcgcgagcg ggccggcatc gaccagctga 1148401 ccgtcagcga acacgacatc ttggacggca tcgcgttgtc actggccgga taagtcacat 1148461 ctgccacacg cgtatctgcg cggggggaca ctcttctgcc cgcctcgtag cgacaacctt 1148521 ggccgatgtc agacccgcat gggaatgttc ggccatgacc agacaactgc atggaattga 1148581 gcttcgatac gtgctcaccc tgcacctggc cgtccatgga ccggcggcca ttaccgaaat 1148641 gatctaaggc ctgggctggc acggctttgg agtccggggc agggcatcca aggtggtgtc 1148701 ggaggcactg cgctgggaaa tcggacgggg ccgggtatac cggctcgggc gcggacgcta 1148761 cgggccgggg tacatcccgc gctccaccga ataccggatt caccaacgcg tgttggcgtt 1148821 gcgggcatcc gccaacgtgt cgctgcgagg cgggcaaagt gtacatccgc tcccagcgga 1148881 aacgcctgtg gcagatgtga tttaggcttc gaagcggtag cccatccctg attcggtcag 1148941 cagatgtttg gggtgcgacg ggtcatcctc caatttgcgc cgcagctgcg ccagatacac 1149001 ccgcaggtaa tgggtttcag tcgcatatgc cggtccccac acttctttga gaagctcccc 1149061 gcggccgacc aacttgccgc ggttgcgggc cagcatttcc agcatgcccc actcggtcgg 1149121 cgtgagatgc acttcggcac cgtctttgat gaccttcttg ccggccagat cgacggtgaa 1149181 tgaatcggtt tcgatcaccg gctgctccaa ctcggcggcc gcggtgttac gccgtaccgc 1149241 tgcgcgcagc cgagccagaa actcgtccat tccaaacggt ttcgtcacgt aatcgtcggc 1149301 gcccgcatcg agggcctgga ccttgtccga cgaatcggta cgcgccgaca acacgatcac 1149361 cggtgccgtc aaccagccac gcagcccgcc gagcacgtcg atacccgaca tgtccggcag 1149421 gccgaggtcg aggatcacca catcgggcgg atgctcagcg gcggcgcgca gcgcacccgc 1149481 acccgtcgag gcggtgatga cctggtagcc acgcacggtc aggttgatac gcagcgcgcg 1149541 caggatctgg ggttcgtcgt caatcaccaa gacgagggtc atgggcggtc ctcgggagcc 1149601 gctagatcga tcaccactgt gagcccgccg cccggggtat cggtagccga aatcgtgccg 1149661 cccatagcct cgacgaagcc gcgtgccacc gacatcccca gaccgacacc ggtggtgttg 1149721 tcgtgatccc ccggccgctg gaacggggca aagagttgct cctcggtccc gcgcgggacc 1149781 cctgggccct cgtcgatgac attaatcagg acccgctcac gcacccgtcc cgcgttgacc 1149841 cggaccacgc agtcgggcgc atatcgcagc gcgttgtcga tcaggttggc tagcacccgc 1149901 tccagcaacc cggcgtcggc catcgccacg gcgtctccca cgtcgacctt gacccggtcg 1149961 atgccggatc ggtaaaaacc ggtggcgccc ttgccgatgc tgaccaaggc ccgttgcacc 1150021 gcttcttcca ggtatgcccg gcgcagctgg gggcgaatca cgccggcagc caaccgcgac 1150081 gaatcgagca ggtttgcgac cagggcggtg agttggtcga tggactcctc gatggtggcc 1150141 aacagctcgg cggtatcctc gggggagaaa gcgacgtctt cggtgcgcaa gctggacacc 1150201 gcaaccttgg ccgccgccag cggggtgcgc aggtcgtggc tgaccgccga cagcagcgac 1150261 cggcgcagct catcggccct agcgatggcc tcggcctggc cggcctcttc cgccagctcg 1150321 cgctgcttca ccagacccgc ggcctgtgtc gcgaccgcgg tcagcactcg gcggtcgcgg 1150381 gcggccaact tgcggcctgc catcagcatc caaaactcgt cgtcgccgac ttcgattgcg 1150441 gtgtcggcgg agtcgacgtc ccgacacggg tttgtcccga cgcacgcgac ggtttcgcct 1150501 gtcgatgcgc cctgccggac acgcagcatg gtcacggccc gttgggaata cgtttcgcgg 1150561 acccgctgca gcagcgtggc aaggtctgcg ccgcgcaaca ccgaaccggc aaacagggcc 1150621 agcaactcag cctcctggga tgcgcgccga gcctcacggg ttcggctagc cgcgccgtcc 1150681 accaacaccg ccaccgcaac ggccatcgcc aacaacacga attcggttac tgcggcgtcc 1150741 ggttcggcga tggtccaggt gtagcggggc tcggtcagaa agtagttcag cagcatgccc 1150801 gacagcaagg ccgacaatgc ggcgggggcg acgccgccca gcaacgccac gatcagcacg 1150861 ccgatgaaga acaacgcgct ctcgccgccg atgcccatga atcggtcgag ccaggccacc 1150921 gtgatggcgc agatcaccga gggcaccacc agcgcggcca gccacgacgc gatatgccgc 1150981 tcgcgcgggg agacccgcga ccacccggag gcccggctgg ccgcgggatg ggtgaccatg 1151041 tgaacgtcga tgccgccgga ctcctggacg gtgcgggcgc cgatcccctc gtcaaacagg 1151101 cgtgcccatc gcgatcgccg cgatgtgccg acgacgagct gcgtggcgtt catctcgcgg 1151161 gcgaagtcca gcagcgcggt gggcacgtcg tcgccgacca cggtgtgcat ggtcgcaccg 1151221 aggcttgtcg ccagctcgcg gaccctgccc agctgcggcg cggacacccc cgccaggtcg 1151281 tcgccacgga taacgtgaac caccatcagc tcggcgctgg acttcgacgc gatccgcgat 1151341 gcccgtcgca ccaacgtctc cgactccggg ccgccggtca cggcgacgac gacgcgttcc 1151401 cgcgcctccc acgtggcggt gatctttttg tctgcgcggt acttctccag ggccgcatca 1151461 acttggtcgg ccagccacag caacgcgatc tcgcgcagcg cggtcagatt gcccgtgcgg 1151521 aagtagttcg acagcgcggc atcgacccgt tcggctgcat agacgttgcc gtgagcaagc 1151581 ctgcgccgca acgcttccgg tgtgatgtcg accagctcga cctgatcggc cgcgcggacg 1151641 atctcgtcgg ggatcttctc cttctgctcg atgccggtga tttgctccac gacatcgttt 1151701 aggccctcca agtgctggat gttgaccgtc gagatcaccg tgatgccggc gtcgaggatt 1151761 tcctgaacgt cctgccagcg cttggggttc ttgctgccag gtgtgttggt gtgggcgagt 1151821 tcgtccacca gcaccacctg aggatgacgt cgcagtactg cctccacatc gagttcggga 1151881 aacctggcac cccgatattc gacgtagcgc ggcgggatca tctcgatgcc ctcgagcagt 1151941 ttcgcggtct tgttgcgtcc gtgtgtctcg acgaccgcgg cgaccacgtc ggtgccgcgc 1152001 tccagcctgc ggtgcgcctc gccgagcatg gcgtaggttt tgcccacgcc gggggccgcg 1152061 cccagataga tccgcagctg cccgcgcttg gtggtcacat gctcaatcat ccaccggtag 1152121 ggcgtaaaga tcgcgcaaag atcggcgaag agcaacgtca cggtcgtgtt cctggggggc 1152181 tcggcaacta ccatcctgct gggctatctg atgcgctgcg atgccggtgc acaagaatcg 1152241 agaggactca catggccgac ttggtgttgg tgctgaccgt gatggccttt gccgggcttt 1152301 gcctgctcta cgtccgtggc tgtgaacgga tcattcgccg cgacgaaatc ggggaaacaa 1152361 cagtcgaact cacgcgagcg ccggccgaat ggcgatgact acggtcgaca acatcgtcgg 1152421 gttggtgatc gcggtggcgc taatggcgtt cctattcgcg gcgctgctgt ttccggagaa 1152481 gttctgatgt ccgggacgag ttggttgcag ttcgcggcgt tgatcgcggt gctgttgctc 1152541 accgcgccag cgctgggcgg ctacctggcc aagatctacg gcgacgaggc caaaaagccc 1152601 ggcgatcggg tgtttgggcc gatcgagcgc gtgatctacc aggtatgccg agtcgatccc 1152661 ggcagcgagc aacggtggag cacctatgcc ctgtccgtgc ttgcgttcag tgttatgtcc 1152721 ttcctgctgc tgtatgggat cgcgcggttt cagggcgtgc tgccgttcaa tccgacggac 1152781 aagccggcgg tgaccgacca tgtcgccttc aacgccgcgg tcagcttcat gaccaatacc 1152841 aactggcagt cctacagcgg cgaagccacg atgagccact tcacccagat gaccgggctg 1152901 gccgtgcaga acttcgtctc cgcgtccgcc ggcatgtgcg tgctggcggc cctgatcaga 1152961 ggtctggccc gcaaacgggc gagcacgctc ggcaacttct gggtagacct cgcccgcacc 1153021 gtgttgcgca tcatgtttcc gctgtcgttc gtggtggcga tcctgttggt cagccagggc 1153081 gtgatccaga acctgcatgg tttcatcgtc gccaacacgc tggagggcgc cccccagctc 1153141 attccaggcg ggccggtggc cagccaggtc gcgatcaagc agctcggcac caacggcggc 1153201 gggttcttca acgtgaactc cgcgcatccg ttcgaaaact acacgccgat aggcaatttc 1153261 gtcgaaaact gggcgatcct gatcatcccg ttcgcgctgt gcttcgcctt cggcaagatg 1153321 gtgcacgacc gtcgtcaagg ctgggcggtg ctggccatca tgggcatcat ttggatcgga 1153381 atgtcagtcg cggcaatgtc attcgaggcc aagggcaacc cgcggctgga tgcgctgggg 1153441 gtgacacagc agacgacggt cgaccagtcc ggcggcaacc tggagggcaa ggaggtgcgc 1153501 tttggcgtcg gtgcgtctgg gttatgggcg gcgtcgacga ccggcacctc caacggctcg 1153561 gtcaactcga tgcacgacag ctacacacca ctgggcggca tggtcccgct ggcgcacatg 1153621 atgctcggcg aagtcagccc gggcggcacc ggcgtcggat tgaacggcct actggtcatg 1153681 gcgatcctgg cggttttcat cgccggcctc atggtaggcc ggacaccgga gtatctcggc 1153741 aagaagatcc aggccaccga gatgaagctg gtgacgctct acatcctggc gatgcccatc 1153801 gccctgctga gtttcgccgc cgcgtcggtg ctgatctcct ccgcgctggc gtcgcggaac 1153861 aaccctgggc cgcatggtct ttcggagatt ctatacgcct acacgtcggg cgcgaacaac 1153921 aacgggtcgg cctttgccgg tctgaccgcg tctacctggt catatgacac cacgatcgga 1153981 gtggcgatgt tgatcggtag gttcttcctg atcattccgg tgctggcgat cgccggctcc 1154041 ctggcacgta aaggcacgac gccggttacc gccgccacct tcccgacgca caagccgctc 1154101 tttgttggcc tggtcattgg ggtcgtactg atcgtcggcg gcctgacgtt cttccccgcc 1154161 ctggcgctgg ggccgatcgt cgagcagtta tcgacccagt gatgatcgca cgcatggaga 1154221 cctccgcaac cgccgcggca gcgacgtcgg caccccggct ccggctggcc aagcgctcgc 1154281 tgttcgatcc gatgattgtg cgctcggcgc tgccccagag cctgcgcaag ctggctccgc 1154341 gggtacaggc ccgtaacccg gtcatgttgg tcgtgctggt cggtgccgtg atcaccacac 1154401 tggcgttcct gcgcgacctc gcatcctcga cagcccaaga gaacgtcttc aacggtctgg 1154461 tcgccgcgtt cctctggttc accgtcctgt ttgccaactt tgccgaggcc atggccgaag 1154521 gacgcggcaa ggctcaggcg gcggcgctgc gcaaagtccg gtccgaaacg atggccaacc 1154581 ggcgcacggc tgcgggcaac atcgaatcgg tcccttcgtc gcggctggac ctcgacgacg 1154641 tggtggaggt ttcggctggc gaaacgatcc cgtcggacgg cgagatcatc gaaggcattg 1154701 cctccgtcga cgagtctgcg atcaccggcg aatcggcacc ggtgatccgc gagtcgggcg 1154761 gcgaccgttc cgcggtgacg ggtggcaccg tggtgctgtc ggatcggatc gtcgtgcgga 1154821 tcaccgccaa gcagggacaa acattcatcg accggatgat cgcgctggtg gagggcgccg 1154881 cacggcagca gacaccgaac gagatcgcgc tgaacatcct gctggctggg ctgacgatca 1154941 tctttttgct cgcggtggtg acgctgcagc cgttcgccat ctattccggc gggggacagc 1155001 gggtggtcgt gctggtggcg ttgctggtgt gtctcattcc gaccacgatc ggtgcgctgc 1155061 tgtccgcgat cggcatcgcg gggatggacc ggctggtgca acacaacgtg ctcgccacat 1155121 ctgggcgggc ggtggaggcg gccggcgacg tgaacacgct gctgctggac aagaccggca 1155181 ccatcaccct cggtaaccgg caggccaccg agttcgtgcc gatcaacggt gtgagtgccg 1155241 aggcggtcgc cgacgccgcc cagctgtcga gcttggccga cgaaactccg gagggccgct 1155301 cgatcgtcgt gctggcgaag gacgagttcg ggctgcgcgc ccgcgacgag ggcgtgatgt 1155361 cacacgccag gttcgtgccg ttcaccgccg aaacccggat gtccggggtc gatctcgccg 1155421 aggttagcgg catccgtcgg atccgcaagg gtgccgcggc tgcggtgatg aagtgggttc 1155481 gcgatcacgg tggccacccc accgaggagg tgggtgccat tgtcgacggc atcagctccg 1155541 gcggggggac acccctagtc gttgcggaat ggaccgataa cagcagcgcg cgggccatcg 1155601 gcgtcgtcca tctgaaggac atcgtcaagg tgggcatacg ggaacgcttc gacgaaatgc 1155661 gccgaatgag catccgcacc gtgatgatca ccggtgacaa cccggcgacc gccaaggcga 1155721 ttgcacagga ggccggcgtc gacgatttct tggccgaggc cacgcccgag gacaagcttg 1155781 cgctcatcaa gcgcgaacag cagggcggtc ggctggtcgc catgacgggt gacgggacca 1155841 atgacgcacc cgcgctcgcg caagccgatg tcggggtggc gatgaatacc ggcacccagg 1155901 cggcccggga agccggcaac atggtcgatc tcgactccga ccccaccaag ctcatcgagg 1155961 tcgtggagat cggcaagcag ctgctgatca cgcggggcgc gctgacgacg ttttcgatcg 1156021 ccaacgacgt cgcgaagtac ttcgccatca tccctgccat gttcgtcggc ctgtatccgg 1156081 tgctcgacaa gctgaacgtc atggcgctgc actcaccaag gtcggcgatt ctgtcggcgg 1156141 tcatcttcaa tgcgctggtg atcgtcgcct tgatcccatt ggcgttgcgg ggcgtgcggt 1156201 ttagggcgga aagcgcgtcg gcgatgctgc ggcgcaacct gctgatctat gggctgggcg 1156261 gtctcgtcgt cccgtttatc ggcattaaac tggtcgatct cgtcatcgtc gccctcgggg 1156321 tgtcctgatg cgtcgtcaat tactgcccgc gctcaccatg ctgttggtgt tcaccgtcat 1156381 caccggcatc gtctacccgc ttgccgtgac cggcgtcggg caactgttct tcggtgacca 1156441 ggcgaacggc gcgctgctcg agcgggacgg gcaggtcatc ggctccgccc acatcggcca 1156501 gcagttcacc gccgcgaagt acttccaccc gcgcccctcg tcggcaggcg acggttacga 1156561 cgctgcggcg agctcgggct ccaacctggg accgacgaac gagaagctgc tggcggccgt 1156621 cgctgaacgg gtcaccgcct accgcaagga aaacaatctg ccggccgata cgctggttcc 1156681 ggtcgacgcg gttaccggct cgggttccgg gctggacccg gccatatcgg tggtcaatgc 1156741 caagctgcag gcaccgcggg tggcgcaggc gcgcaatatc tcgataaggc aggtcgagcg 1156801 tctgatcgag gaccacaccg acgcgcgtgg tctcggcttc ctgggcgagc gcgcggtgaa 1156861 cgtgctcagg ctgaacctcg cattggatcg cctctgactc tcaggcggta gtggcgatct 1156921 gctgctcgat catcgggagc cgcacccgaa acaccgtctg gccgttgccc gactcggccg 1156981 tgaccgagcc gcgatgcgcc ttgacgatcg agctgacgat ggccaggccc aagccgtggc 1157041 cggacccatt ggaccgagac ttgctggccc gcacgaaccg gtcgaagagg tggggcagga 1157101 tctccgggtc gatgtcgggg ccgtcgtcgg tcaccgacaa ttcaacacac ggcgcgttgg 1157161 gaccagtgcg gtggcaggtg atcccgatgg tcactgtgac gccgggctgg gtatgcaccc 1157221 aggcattggt gagtagattg ctgacgagtt gatgcaagcg ggcatgatcc ccgttgaccc 1157281 agaccggctc gtcgggcaga ttcttcaccc aacggtgggt gggcgccgca accgccgcgt 1157341 cattcaccgc gttgatgacc aggtcggtca ggtcgaggtc ctcggtttct agatcttcgc 1157401 cctcgctgag acgggagagc agcagcagct cgtcgaccag cagcgtcatc cgccgcgcct 1157461 cggattcgat gcgggccagc gcgtattcgg tggtgggcgg taggtccgag ctatcctgac 1157521 gtgtcagttc ggcatagccc tggatcgccg ccaggggagt acgcagctcg tggctggcgt 1157581 cggtgatgaa ctgccgcatc cgcagatcgg aatcgacgcg atgcgccagc gcaccatcga 1157641 cgttgtccaa caagcgattc agcgtgtgcc cgacgattcc gacctcgtta tccgggtcgg 1157701 tatcccccgg acggactcgc acgctgatct ggtggtcgtc atcggtaagt ggcatggtgg 1157761 cgacctcggc ggcggtcgcg gcgacccggc gcagcgggcg tagcgcatat cccaccaccc 1157821 acaccgtcag tgctgcggta accaccagtg cggccccaac aagcgcgacg gtggtgactt 1157881 tcttgcgggc gatgatctgg ttggccaggc ttagcgatac gccgacgaac agtcgatcgg 1157941 cgccagcggc gctgctgtca acctggtagg cgcccaggct gcccaggctt tcgacacgcg 1158001 gcgggccgcc gtcccacact tgcgcttcga tcgcgcggat gacgtcgggc ggagcgggtc 1158061 gtgctccgtc ttcggagaaa acggccgatc cgatcaccac gccgtcgtgc agcacggcaa 1158121 tgaggtttcc gggcgtctgg ccggtgaact ccagcaccgc ttgtgacatc gggaggttgc 1158181 cggtgggcgt ggatgtttgc gcactgtcgc ggtatctggt gtaagagtgg ttcaacgcgt 1158241 gcagggattc gactagctcg gcgtcgttca tcgcggtgac atagccgctt aggctcagca 1158301 cggagacgac accgacggcc accagcacaa cggtaacgac cgccaacacg ccgagcagca 1158361 attgctggcg taacgagcgg ggtcgccagc agggggcttt tctggaccga gtgtttcggt 1158421 ccgggatcat gccaggctca ttccggcgga cgcagcatgt atccaatgcc gcggaccgta 1158481 tggatcattg gctcccggtc ggagtcgatc ttcttcctca gataggagat atacaggtcg 1158541 acaatgctgg tgcggcctgc gaagtcgtag ttccaaaccc gatccaggat ctcggtacgg 1158601 ctcagtgctc gtcggggatt gcgcatcagg aatcgaagca gttcgaactc ggtcgaggag 1158661 agcgagatcg gcgtaccgtc gcgggttacc tcccggctgg ccccgtcgag cgtaaggtct 1158721 ccgacccgga gtgcctcatc ggcgggcctt tccagatggc tggagcggcg cagcaacccg 1158781 cgcaaccggg cgaccagctc ctcgaggctg aacggctttg tcatgtagtc gtcggcgccc 1158841 gaggtcagac cggtgacccg gtccatcacg gaatcgcgcg cggtgaggaa cagcgtgggt 1158901 gtgtagacgt cggattctcg gacccgtcgc aggatttcca acccgtccac atcgggaagc 1158961 atgatgtcga ggaccagcac atcggggccg accttgtcga acttggctat ggcctcttgc 1159021 ccgtcgtggg cgacttcgac atcccagcct tcgtagtgca gcgccatctt gaccagattg 1159081 gtcagcgctg gttcgtcatc gaccaacaac acccggatcg gtgatccatc cgcgcgatga 1159141 atccgtggca gctgccccag gatggcttgc cgcggacgtt gactgcgcgt gtaccccgac 1159201 atcgtcgtca tgctcccgta tcctctcaag tcctgtgcaa gcgcacatgc agttgtcacg 1159261 ggattcataa atttttcaaa tgtcgcttat gtagttactt cggcctgaaa aggtgaccgg 1159321 gcgggatgtc gggcttcggc ggtgagaaag cggatctcgg tttccgggta tacggagccc 1159381 ccggtggacc ggttatgcgg ggagggcgct gatcgtgacc aggttgtggg cgaacacgcc 1159441 gtgtccgacc caggtccggg tgccttcgag accgccgatc cggccgcggt cccagccgta 1159501 gccgcgtttg aggtggctga tccggccttc gcatccggtc cgccatttga tggtgcggcg 1159561 gaacgctttt cggtgttctt cggcgcgtcg atcctgcgaa ggtttgcctt tgcgcgggat 1159621 cagcacattc ttgacgccca cctcggtgag ctgctggtcg acggcggctt cgccatagcc 1159681 gcggtcggcg gtgacggtgc gcggcgtgcg tccggcgcgc tttttcaccc acgccaccgc 1159741 tggcgccagc tgcggcgcat cgggtgggtt gccctgctgc acagtgtgat ccagcacaat 1159801 cccgtcatcg ttgtcgacga cctgggcctt gtgctcaaac tcgaccggct taccgagccg 1159861 acccttggtg atcggggcgg gcatcaccgt cgtgcaggct gacccgtcga ctcgccccgt 1159921 ccgaagtgat gcccgcgacc cgctggcggg tctgcgccac aatctgacgc gtcgcgttga 1159981 gcagctcggt taggtcgttg accgcgcgca ccagcccacc acagcggcga cccgcgaccg 1160041 catcacgctc accgcgggcg gccagcgcgg cggccttggc cttggcccgg agcaccgcct 1160101 gcttggcgtt gtccagcagc tgctgggcct cctgagcagc ggcttgggcc agctcggcca 1160161 gctcgccggt gaacctcagt accgcggccc gcgcttcgtc acgccccagc tccgcacgcg 1160221 agcgcagttt cgctgcgacc gcgtgcgcgc gccgaccggc cgcgcgggag cggtcgccaa 1160281 cccgggtgcg caccgcgccg ccagcggcct gaatccgttt gccggttgcg gcgatccggc 1160341 gcattgcctt ggccaacaga cccaagtcgg tcggataaga cacgttcgcc cgcgccaccg 1160401 tggtatcggc ccggatccga ttggtgccca gcagcttggc ctcggccgcc ttggccaaca 1160461 atgcctcgtt gagcccgtcg atcgccgccg atccgcaacg cgtggtgagc ttcatcaatg 1160521 tggtcggatg cggcaccgac ccgtccagcg caatgcggca aaaccgccgt caggtgatcg 1160581 aatcagccac ctcccggcac agcgactcat agcccagccg gtagcggaac ttcacaaaca 1160641 tcaactgcag atagacctcc atcggcgtcg acggccggcc cctgcgcggg tcgaagaacg 1160701 gcacgaacgg ggcgaagaac gccggatcgt ccaacaatgc gtccacccgg gccagttcct 1160761 cgggcagtcg gcgcacctcg tcgggcagca gcgactccca caaccagcac tgatcgccta 1160821 aagtacgaaa cacgatggcc tcaatccctt ccgcaacaag ggcattgagg ccatcttccc 1160881 agttcagcac catccgaccg gggatcaacg cgccgacttt agcaggtcga agtagttagt 1160941 cgttcagata acaacgtggc cacacaccaa ccggtgtgcg gccacgttgt aattgacggc 1161001 gcgggcctta agccagcttt aggcccagct ggagccgacg gcgctgtcgg tttgtgccat 1161061 gttgttgccg gcagcctgca ccttctgccc gtgggcgttg gcctgctcgt agatcacctg 1161121 gaagttacgg cccagctggg taatgaaccc ctggcaggcc gccgaaccgg cgccgcccca 1161181 aaagtcactc gcggtcaaca catcagaaat gatggcctga tgctcggcct ccagcgaccc 1161241 ggccagagcg cggatcatgg cgccgtgagc gtcgacgtcc ccgaattgat agttgatggt 1161301 catgtgtcct cctgagtcgt cgggccgggt cagctgctga ggatctgctg ggaggcctgc 1161361 tcttgctgtt cgtagttgtt ggcgtcgcga accagcccgt cacgcacccc gtgcagcatg 1161421 ttcacgatgt tgcgaaacgc ctgattcatc tgggtcatgg tgtctagcga ggtcgcctcg 1161481 gccatgccac tccagcccgc gcccgagatg ttttgcgcgg acgcccacat ccggcgagcc 1161541 tcgtcctcca ccgtctgggc gtgcacctca aaacggcccg ccatgtcccg catcgcgtgc 1161601 ggatccgtca taaaacgcga ggccatgctg ctgtctcctt gtctcgaagt cgtcacgttg 1161661 ttgaagttct agcggctgtg atcggcgcgg tggtggccgc gtggcggaca ggttatgact 1161721 caacggttaa ttgctggcct caaacgagtg agatgtcccc ctttgtccgc atcacacgac 1161781 gacctgtttg ggcatgacag tgggcttgaa tccgtaccgc ggcccggcat aggcaccggt 1161841 gcccttggcg gccgaggcca ttcccggcat catcccggta actgggccgg cttcttcggc 1161901 ggcgacggtc cagccgctgc cttcgagcgc tgtggcgccg gcggttgtcg ccggtgcggc 1161961 cgtagaccag gccgccggca ctgacagccg gccgaccagg gtggcctcgc ctaaacttgc 1162021 gccgagcccc gctggcgtca ccgagtccgc caaccccgcg gcggccgcac tggcggcacc 1162081 ctcggcagcc tcgatggcgc cttcggcgat cgctaccggc gccccactgt tcagggcatt 1162141 tgctaggaat atcgcggtgg ggatggcggc gttgacatac caagcggcgg tgttgactgc 1162201 gctgttgatg atgtttgcca cgaacggggt cgcgagcagg gcgtcgatgt cggcaatgat 1162261 tccgctcagc cccgtcgagt cgagaaccga tgtgactggg gaggcgagcc cactcaccgc 1162321 gttgggcagg ctactgatca ggtccgctac gctcacctgg ttgacggcgg cggtggcggc 1162381 agccgagccg accgcggcgg actgggcggc cagcccgccc gggttggtgg tctgcgacgg 1162441 cgggcttaac ggttgcagca tcccggcggc tcccgaagcg gccgcgtagc cgtacatagc 1162501 cagagcgtcc tgagcccaca tctcggcata gagggcttcg gtcgccatga ttgccggtgt 1162561 gttgatcccc aggacgttcg tcgcgaccag ggccgccagc agcgcccggt tggccgcgac 1162621 cacctccggc ggcactgtca tcgcataggc cgcctcgtag gcggccgccg acgccatggc 1162681 ctgcgagccg gcatgcgcag cggcttcggc ggtgtaggtc aaccaagcca gatagggctg 1162741 ggctgcggcg accatcgcca tcgaggccgg acccatccac gactcggtgg tcagccgggt 1162801 gatcaccgac tcatacgacg cggccgtcgt acccaactcg gcggccaggc cgttccatgc 1162861 ggccccggcg gccatcatcg gtcctgcacc cgcgccggcg tacatgcgtg cggagttgat 1162921 ctcagggggt aaagctccga aatccatggg gtattccgtt tccgtggagt tatttggctg 1162981 aatttcgttg ttggttgagc gtggccgccc gtacgtctgc cgcctagacg gttgctggct 1163041 tgggcatgac gatgggtttg acgccgtagc gcggtgcacc gaagccggcg ctgttgcgtg 1163101 cggccgaggc cacccctggc atcccgggga tgaacgtccc cgcggcggcc tgcggcgcgg 1163161 cggcggtcca gcccgcgccc ggcagtgtgc tggtggtgga taccagggtc gcctgtccgg 1163221 cccaggcggg cggcaccgac aacatgccga tcgcggatgc gctgcccagg ccggccgcaa 1163281 ttccggcctc gccgagggcg gcttcggccg cgcccaatgc acccaattcg cccaaggcgg 1163341 cttccccacc gagagccgag gcggcctcgg ccgcctcctc tgcaggcagg aggccgccgc 1163401 cggcaaggcc gatcagcgta gacgtggcgg aggcccagtt cccggccccg atgttgagga 1163461 tgttgccaat gccacccgag agttcgggcg ggaacaaccc cgtcgtcgct tggatgatgg 1163521 ccgacgcttc ccctgtgatc cccgagagtg gcgaggcggc agccgatgag ttgagcgact 1163581 cggtgacgcc gtaggtaccg gcgctgatcc ccagagtgtt gacaaacatg tcatgcatag 1163641 cctgagcttc ggcgctgacc tgctggtaga aggtgccgta cgcagtgaag agcgccgcct 1163701 gcagcgccga aacctcatcg agggccgccg gagcgatggc tgtggtgggc gccgcggcgg 1163761 cagcgttttg ggctgccatc gcagcaccga tggtcccgag ttgcgcggcc gcagccgtca 1163821 actcttcagg cactgtcttg aggaatgaca tccattgctc cttgtgtgtg aaacctgccg 1163881 gccgctagca ccccgggccg accctgtgtg tttgcgtacg gctgcctgtg gattggcgta 1163941 acgctaaccg gccaagcctc cacagtcgcg accgaaaggc atgggacgcc cgacgtttac 1164001 ggttttttaa cgtttacgtc agcatcctta acaaggtctt ggcggctgac atggcggtgt 1164061 gatctggtgc ccgggctagc acacttcggc acacaaatga gacgcgcggc gcgcggattc 1164121 taggcgaatg acggctcttt cgcacctggc gtgtcgcggt agggttggtg cactggatcg 1164181 ggtccaagcg ctacattcgc cgtcaagcct ccacagcccg attagcagag gcagcggaca 1164241 atccgcgctc acgggtgctg gcgtttgcta gtgccggtaa tcttcgaaag agtcgcttct 1164301 aactgccaat atgccgggtc gaagccactg tccagcactg tcggcatcca gatgggggcg 1164361 ttggcgcgct gatcgatacg ccgtctgcgc ggctcccggc acaatgagtt cgtgcccgat 1164421 tcctggccgg tcgtttgcgt tgacgactgg tctgttgccg gcctggagac ccaggggcaa 1164481 cacccgcacg attggctcaa acattcttcg cagaagcgga cgtggctctt caagccggcg 1164541 cgaccggagc gcgatcgttt actcggcgaa gacgtggcag aaaagctcgc cagcgagttg 1164601 gcgcggctac gcgatgtctc cacaacaaga ggggaagctc acccgtcgtg caaatgctga 1164661 gcgggggtct ggtcggtcag cgtgaacccg aggctggccg cgtggtcgaa cgatgggcac 1164721 agcgcctcca catactttgt ctccagtggc ggcacatgga ccgcccagtt gcggtcgtga 1164781 cgatcaccgt gggcgatcaa tgcgtcgaac acgaggtagg tcgaaagcgc ggaacgtggg 1164841 taggagcgct tggtcggcag gtgctgcgaa ccgagcaagc gcctgctgga tcgcctcgac 1164901 gttgtgccca cgttgcccgg gatcgtcccg gtcgcagttg agcacaacct cgggcatcaa 1164961 tgcttgcggc aaccgcacgt ccttgaccag cgcaccgcgc acgccgtcac ggacagccag 1165021 ctggaccggt gccgcaggta tcccggctag ggcgtgtctc ccaatttcgg agttcccact 1165081 cgggcgtgga tgacggcgca ggccagcagg acgccgccga ggtaggtcag ggcgtatttg 1165141 tcgtagcggg ttgcgatgcc gcgccactgc ttgagtcgat ggaagccgcg ttcgacggtg 1165201 ttgcgtagcc cgtagagcgc ggcgtcgaat gctggtggcc gcccgccggc agaccccttg 1165261 gccttgcgcc ggtcgatctg atcttggcgt tcggggatgg tgtgcttgat cttcttagac 1165321 cgtaatgcgg cacgggtact tgggtgtgag taggccttgt cggcgagtaa gcggaaatcc 1165381 gtgctgccca gggcgtattc ggtgctggca tggcgatagt cgtcgagcag gggcagcagt 1165441 tgcgggttgt cgccggcctg gcctgcggtc aaccggatcc gcaccggggc ttcgcgctga 1165501 tcggtcaggg catggatctt ggtggtcagc ccgccgcgcg agcggccgat cgcatgatcg 1165561 tcgggttcat cggcggattt cttgtaattc gacagtgccc cctgtggcga gcgtgtccga 1165621 gcaggcgccc gccgaatgct ggtgtgcccg cacgttcgtg gaatccaccg acagcagctt 1165681 ctcgatatcc tcggccacct cagcgtccac cccgaacacc gcggcaacgt gggcgaacac 1165741 ctcgtcgcag gtaccatcca gcgaccaacg gtgatggcgc ttccacaccg tttgccacgg 1165801 cccgaactca gcgggcaggt cccgccacgg acttcccgta cggaaccgcc acgcgatccc 1165861 ttccaggata agccggtgat cgctaaaccg tctgccgggc ttgccctcat gcgacggcat 1165921 caacggctcg accacggccc agaactcgtc cgaaatcaca cccactcgcg tcaccggcca 1165981 atcctcgctg gccagtaacc taaaaatttg ggagacacgc cctaggcgcg ggctgcagcg 1166041 gtagtacttt ggcctgttcg gcgcatctcc tatggctgcg gcccgctggc tcaaaccttg 1166101 ccttgccacg ccaagccatt cctagccttg cctagccaca ccatgccctg cctagacaca 1166161 gcgagcctac gccgcgtcga gttcggcgaa aatcaaactg acccactacc accggattga 1166221 agggtttggt gcgtgttgat acgtcccggg ttgtgcctat gggagggtgt ccatctccac 1166281 gatgccgccg aagtcgagtt cgtcgagtgc tcggatcact tcgctcgacg ggatgccgcg 1166341 atagaagggt gccgcgttcg gtccggtgcc cgtcgacggt gcttccgcag agtcctcgac 1166401 cacgaggccg atcacacggc cgtcttgcgc aacgatcgga ccgccgctgt tgcccggccg 1166461 cgcgattgcc gagtagagga aaatcttctg ccggcagggg atagtcgtcg cggccgggtt 1166521 gaccacctcg ccacgctgca ccgtgatcgc catctccgca gtcatcggca cccgcgggta 1166581 accgaacacg tagacctcat ccgcccagtc gggatcacgg aacgccatgc cgccaagccg 1166641 cgggatgtac ttgccttcgg gcatctcgaa tttgattact gcgacgtcga gcgtggggtg 1166701 cgggtgagcg gtgcccgaga agttcaccaa ctcggcttcg gcgtggttgc ttgacggata 1166761 gacggacaga cctgcgctcg tgcccgcgag cccggtcacg acatgtttgt tggtgatgac 1166821 gtgattgtgg tcgacgacga ggccggttcc ccaactatcc accggattgc cagcgtcgtc 1166881 gtgaccggcg agttgaacgg tcaccgcgtt gtagctcggg atgatgagct cggcaccgaa 1166941 cacctcggac aaccagaggt tgccgccacg ctgtcccttc gatatcgtcc cctgcgagat 1167001 gtacttctgc cccatgactg gcaatcgcgg gtcccaaccg agcggcagca gaagtcccgc 1167061 gcgttccatc gagctgagga tgcggtggag ggtcaccgcg tcgcccgcgg cgggcaggcc 1167121 gagggtgctc aggtatcggg agaaatctgc gaccgaccac ggttcgaagg gcaccgtcgt 1167181 cggcagaccg atatccgagt caaccggtgg tggttcgggt ttgccgatcg ccgccgcaac 1167241 caccgggttg tggaccagcc cgaagaattg atgggcgcac atcgccacgt tcacacgcca 1167301 cgcaggagtc ccgggcttca ggtcggccgc cgtgagctgt cgcggtcagg tgctttccgc 1167361 gccatccgcc gtcacctctg ccatggtcca tctacggtat ctgcgacaag ggcagcgtcg 1167421 atgcctcgac atgcagagtc ggtgttcgct tcacgcgaac taggcgcgcc tagcctggac 1167481 gagtccccgg gccgacattc gcccgaggcc ttggcctcca tcacctaatt gtgtgcaaaa 1167541 ccgtatctaa ttgatacgat tgcgcacatg gctatctggg atcgcctcgt cgaggttgcc 1167601 gccgagcaac atggctacgt cacgactcgc gatgcgcgag acatcggcgt cgaccctgtg 1167661 cagctccgcc tcctagcggg gcgcggacgt cttgagcgtg tcggccgagg tgtgtaccgg 1167721 gtgcccgtgc tgccgcgtgg tgagcacgac gatctcgcag ccgcagtgtc gtggactttg 1167781 gggcgtggcg ttatctcgca tgagtcggcc ttggcgcttc atgccctcgc tgacgtgaac 1167841 ccgtcgcgca tccatctcac cgtcccgcgc aacaaccatc cgcgtgcggc cgggggcgag 1167901 ctgtaccgag ttcaccgccg cgacctccag gcagcccacg tcacttcggt cgacggaata 1167961 cccgtcacga cggttgcgcg caccatcaaa gactgcgtga agacgggcac ggatccttat 1168021 cagcttcggg ccgcgatcga gcgagccgaa gccgagggca cgcttcgtcg tgggtcagca 1168081 gctgagctac gcgctgcgct cgatgagacc actgccggat tacgcgctcg gccgaagcga 1168141 gcatcggcgt gaccaagccc tattcgtcgc cgccaacgaa cctgcgctca ctacgagatc 1168201 ggctcaccca agtagcggaa cggcaaggtg tcgtgttcgg tcgactgcag cggcatgtcg 1168261 cgatgattgt tgtcgcacag ttcgcggcca cgctcaccga cgacaccggc gctccgctgc 1168321 tgttggtcaa aggcggatcg tcgctggaac tgcgccgggg aattcccgat tcgcggacct 1168381 ccaaagactt cgacacggtc gcacgtcgcg atatcgaatt aatccatgaa cagctcgctg 1168441 acgcgggcga gacggggggg gaaggattca ctgcaatctt caccgccccc gaagaaatcg 1168501 atgttcctgg tatgccggtc aagccgcgcc gattcaccgc caagctgagc taccgaggcc 1168561 gggctttcgc aactgttccg atcgaggtct cctccgtcga agccggcaat gccgaccaat 1168621 tcgacaccct cacctcagac gcgctcggcc tcgtgggcgt acccgcagca gtcgccgtac 1168681 cctgcatgac cattccctgg caaatcgcgc agaagctgca cgcagtaact gccgtgctcg 1168741 aagaaccgaa ggtcaacgac cgcgctcacg acctggtgga cttgcagctt cttgaaggac 1168801 tgttgctcga tgccgacctc atgccgacgc gcagcgcgtg catcgcgata ttcgaagcgc 1168861 gcgcccagca tccttggcca ccgagagtcg ccacgctgcc gcactggccg ctgatctatg 1168921 caggtgcgct ggaggggctt gaccaccttg aactcgccag gacggtcgac gcggcggccc 1168981 aggcagtgca gcgattcgtt gcgcggattg atcgggcgac gaaaagatga gtgctggcgc 1169041 ggcctgcggc gcacgggaga acacagggac caccccggtt ccatagtcaa cgtcagcggt 1169101 gcgggtgtcg atcagacgac gaatggaatc gccctcgcat tcctcgcgat cgagtgccta 1169161 tgagccgcgc tcctgcggcc taggcgagcg ctttccgggg ctctcagaca tcggcctcgt 1169221 ggcggtgtgc gcggcggcat gtggctctgt gatctcttgc gcgagcgccg attgcgaatt 1169281 tcgtccggcg aaaagtgacc gctccgtgac cttaatgcaa gaggtgtgtg gtgtggagag 1169341 gggcgggagg aagggagtga ggcgacggtg tcgagatgca gcgaggattg gtggacttcc 1169401 ggtagttgtt taacaaggcc ccggagacca gggggcgagg gagagcgcgg gccgacttgg 1169461 gtgggtgagc ctggcttggg ctggtgcgtg agcggaggat cgctggtggc cccgtagttg 1169521 gcgttggcct gcggacgtgc cgcgcctgcg agggattcgt caatcttcct gttgatgtcg 1169581 cccgtgccac gtcggtgaga tgtcgaaggg atgtgacctg gtgcgttcgc gaacagctgc 1169641 tgaccacggc caccgacggc gctcaactgt cgtcgattcc atcccacccg tgcttggact 1169701 ttcaaactgt ccggcgccga tggggaaacc tggtgtttgg ccggaacgtg gcgccgagcc 1169761 tcgataatat cagcagttac gtccaggggt gtggtgtacg ggcaggtaag gccggtgggc 1169821 gtgtcgtagc ccagtagtgg gcggtcatcg cgtgatcctt cgaaacgacc agcaaaagtc 1169881 aatcgaagga aatgacgcaa tgacctcttc tcatcttatc gacaccgagc agcttctggc 1169941 tgaccaactc gcacaggcga gcccggatct gctgcgcggg ctgctctcga cgttcatcgc 1170001 cgccttgatg ggggctgaag ccgacgccct gtgcggggcg ggctaccgcg aacgcagcga 1170061 tgagcggtcc aatcagcgca acggctaccg ccaccgtgat ttcgacaccc gtgccgcaac 1170121 catcgacgtc gcgatcccca agctgcgcca gggcagctat ttcccggact ggctgctgca 1170181 gcgccgcaag cgagctgaac gcgcactgac cagcgtggtg gcgacctgct acctgctggg 1170241 agtatccact cgccggatgg agcgcctggt cgaaacactt ggtgtgacaa agctttccaa 1170301 gtcgcaagtg tcgatcatgg ccaaagagct cgacgaagcc gtagaggcgt ttcggacccg 1170361 cccgctcgat gccggcccgt ataccttcct cgccgccgac gccctggtgc tcaaggtgcg 1170421 cgaggcaggc cgcgtcgtcg gagtgcacac cttgatcgcc accggcgtca acgccgaggg 1170481 ctaccgagag atcctgggca tccaggtcac ctccgccgag gacggggccg gctggctggc 1170541 gttcttccgc gacctggtcg cccgcggcct gtccggggtc gcgctggtca ccagcgacgc 1170601 ccacgccggc ctggtggccg cgatcggcgc caccctgccc gcagcggcct ggcagcgctg 1170661 cagaacccac tacgcagcca atctgatggc agccaccccg aagccctcct ggccgtgggt 1170721 gcgcaccctg ctgcactcca tctacgacca gcccgacgcc gaatcagttg ttgcccaata 1170781 tgatcgggta ctcgacgctc tgaccgacaa actccccgcg gtggccgagc acctcgacac 1170841 cgcccgcacc gacctgctgg cgttcaccgc cttccccaag cagatctggc gccaaatctg 1170901 gtccaacaac ccccaggaac gcctcaaccg agaggtacga cgccgaaccg acgtcgtggg 1170961 catcttcccc gaccgcgcct cgatcatccg cctcgtcgga gccgtcctcg ccgaacaaca 1171021 cgacgaatgg atcgaaggac ggcgctacct gggcctcgag gtcctcaccc gagcccgagc 1171081 agcactgacc agcaccgaag aacccgccaa gcagcaaacc accaacaccc cagcactgac 1171141 cacctagact gccacccgaa ggatcacgcg aggaaccttc actcgtacac cacgtccctg 1171201 gccttggccg aaggtagaac gccagcacga cttgctgttg tcaactcttg cgagttacgt 1171261 gagtgcggcc ggagcacacg ctcgtatcgt cgtcacagtc gaagggcgcg atcttgagtt 1171321 cgacatatcg accttcgccc ttgtgggccc gcagcagctg cccgaagtcg agccgtcgca 1171381 gtagtgaccg ggggcccagt tagcgatggc tttgtcactg tggagggtct ccctcccgta 1171441 gtgatgcacc actcgcacga gagccaattc ggccgcccgt cgcgccgcag cagagcgcgg 1171501 tggctcttcg tcgttcattt ggtcatcgcc tcgcgtagat gttccgccgc gtcttcgccg 1171561 cggacgccgg ccgtgcgtag gtcggcgtat accctcggcc ataacatcga cctaaacccc 1171621 tgcaggttct gttcggtcag ccgggcgcac gctggggtcg ggaagaagcg caatatcaac 1171681 cgaccgccgg cgatttcctg caatcccgca gccattgctg cacggcgaag atcgctccat 1171741 gatcgcccgg gcacgtagat ttccatcgga gctatttccg tctgcattgg cgcgagcagg 1171801 ctcgccgaca acgcgcttgt ggctgcccac tcaatgcctg cggcatccca caactggccg 1171861 gccttgacca cgccggcagt cggatcgcgc cacagcacac cagtcgaaat agaaatcggc 1171921 gaccgaagct tgtctgctgc ctcagcgtat gcatccaaca gcgcatcgcg atcaacgatc 1171981 aggcgcgccg atttcgggcc gcgggcagtg gcactggcca gatggccgtt tttttcgaga 1172041 aacttcaacg cctgagcgct gcttcccatc gagagaccgg tggcctctac aaccgaggcg 1172101 acagttggac cggcgatgtt cgccagcagc gcttcacata cggcaagtgt ggcgcggcgc 1172161 cagcctatgc gcgcgtcgag tggggcaggt ggcgcgcctt tcgtctcgat caccagagtg 1172221 gttccagttg atgtgtttcg gtagtggata tcggcagcgc ccgactcgtc cacccaccca 1172281 acaccggcgt catgtgccgc cttccgggca ccaggagaca tcgtgggtgc agccaaaatg 1172341 tcgggccggg atgtggcgtg gagtgcctcg gctacctgac ggggccaacc agtcgtgagc 1172401 cagcgaacca ggaactctgc gccgtcgagc gacacaatca cgtcgcgatg agggccgttt 1172461 acgcgtcgtg cgcgcacttc gctgcgaaac gcgccttcca gcgcactcac ggtgcgttcg 1172521 tcccaagaca tggaggcatc atacttcact aagggacgat actctactgt ttcagtgaag 1172581 taccatctac ggatgaagtt cgattgccac gtgcgatccg acgcttgcac ttcgctggcg 1172641 ggccgcgaac ccgatcagct cctccaggtc gtcggcacgg gtcagcaagg cggcgctgtc 1172701 cgggtgggcg cgcatgccaa acaccagtcg ctcaccgtcg gggtcaagca acaccaggtc 1172761 gcggagcatg gtggcgggcg gaacccacac ttcgggctag ctctaggggg cagggctttg 1172821 acgggtcttg acaaatacgt gtagctacac gagtctggag taatgggcaa aggggcggcg 1172881 ttcgacgaat gcgcttgcta caccacccgg cgggcggccc gacagctcgg ccaggcctat 1172941 gatcgcgcgc tgcggccgag cgggttgacg aacacccaat tcagcacgct ggccgtgatc 1173001 tcgctgtcgg aaggcagcgc cgggatcgac ctcacgatga gcgagcttgc cgcccgcatc 1173061 ggcgttgaac gcacgacgct aacccgcaac ctcgaggtga tgaggcgcga cggactggtg 1173121 cgggtcatgg cgggtgccga cgcgcggtgc aagcgcatcg agctgaccgc gaagggccgc 1173181 gcggcactgc aaaaggcggt gcccctatgg cgcggggtgc aggcggaggt gaccgcaagc 1173241 gtcggtgact ggccacgggt gcgacgcgac atcgcgaatc tgggtcaggc ggcggaggcg 1173301 tgtcggtgat ctttttgcgc atatatgtgt agttacaccc aactgaggag caaatgatgg 1173361 ctaggcagag atttcgtgac caggtggtgt tgatcaccgg tgcctccagc ggcatcgggg 1173421 aggcgaccgc gaaggcattc gcccgtgagg gcgccgtggt cgccttggcg gcgcgccgcg 1173481 agggtgcgtt gcgccgggtt gcccgggaga tcgaggccgc gggtgggcgg gcgatggtcg 1173541 ccccgctcga cgtctcgtcg tcggagagcg tgcgcgccat ggttgccgac gtggtcggcg 1173601 agtttggtcg cattgacgtc gtgttcaaca acgccggcgt ctcgctggta ggcccggtcg 1173661 acgcagagac cttccttgac gacactcgcg agatgctgga gatcgactac ctcggcacgg 1173721 tgcgcgtggt gcgggaggtc ttgccgatca tgaagcagca acgatcggga cggatcatga 1173781 acatgtcgtc ggtggtgggt cgcaaggcct ttgcgcgatt cgccggctac tcctccgcca 1173841 tgcacgcgat cgccggtttc tccgatgcgt tgcgccaaga gctgcggggt agcggaatcg 1173901 ccgtctcggt gatccacccg gcgctgaccc agacaccgct gttggccaac gtcgaccccg 1173961 ccgacatgcc gccgccgttt cgcagcctca cgcccattcc cgttcactgg gtcgcggcag 1174021 cggtgcttga cggtgtggcg cggcggcgcg cccgcgtagt cgttccattt cagccgcggc 1174081 tgctcatggt gggtgacgcg ttctcgccgc ggtacggcga ccgggtggtc cgcttgctcg 1174141 agagcaagat attcggtcgc ctgatcggtt cctatcgggg ttcggtatac cgccatcagc 1174201 cgaccgaatc agcgaaggca caggcggccc agcccgagcg cgggtactcg tcggcccggt 1174261 gaggttggtt ggagccaggc tccacgtcgc tgaggcgagc ggcgtgcgca gcgcgtagcg 1174321 gctcgtcggc acggtgtcga tggtctcctt ggcgctgaat cgcgacgtgc tggcgatcac 1174381 ccgggcaagc cgatcacgca gttcgtcgtc gggcgccagc tcaacgtcga gttgcgctct 1174441 cccgctttga gatccagcgc gacacctcgt cgcgccggta cacgacgcgc cgtcccaagg 1174501 tgaagctcgc cggtccgatg tccgagtgcc gccagtgccg tagagtgccg acgggaacgc 1174561 cgatcatctc cgaaacttgt tttgcgtcca gcagatccat gtttctcctc ccgacatggg 1174621 ctggtttcca atgtctccaa cagtgctggc agcgtccgtg ttcggtcgcc catttcgctt 1174681 gcgcgactgc gccataaccg gccaggtgag gcgcgacggg ttcgagagtg gcgccgcggt 1174741 attgtgcgac tgccctggcc gagcggagca gctcgtcgtc gtcgatgccc cggcggtcga 1174801 gttggacgat gtcgacgtag tcccgccagc gggtgctggt gatgccgcgt tcgaggatgg 1174861 tcactccctt ctcggcgatg atggtctcgg gcgcgtagcc caggagtgtg atcggctcgc 1174921 cgaggatccg gtcgatggtc acccgtgtgg gccacggcgc gatcggttcg ccggtggaca 1174981 catcccaggc cgcgatgccc tgccacggtc cgaccgacat agcgactcgc acgcgcaggc 1175041 ccgggtagtc ggcccgctcg cgaatttcct gcacgctgct cgtgtcgagg ttgaacgcca 1175101 ccccgtcgtc gatgtcgatc acggcgatgt cgcgaaccac ctgggtgaga tgctcggcgg 1175161 tgacgtcggc gcgcatggcg ttggagtcgg tgtccttcgt cgggtgccga acgccgtagg 1175221 cggccagcag gatccggcct ttgaggacga agtctgcagc atgcgaggtg cgggtgagcc 1175281 gatccaggaa cgattcgagg gtgtgtcgag tcaggtactc ctgcgtcggt gcgccggtcc 1175341 cgcacttcga ggcagtagaa cgagcgagga ttggatccgg cgggacaccg tgtcgccgga 1175401 gctcacgcca gcatcgccaa tgtttgtaag accggggact tctcccgcgg taggcgggtg 1175461 gcgatctcga tcagccgggc gggtttgccg cctcggcgca gccactctcg cagcgcgtca 1175521 cgcgccagtt cgtaaccgac ttcgtagcgg agccggaatg tatcggcgat cgagcgctcg 1175581 ggtgagttag attccgattg tctgatccga tcccgggatc gtgatctcgt cgcgtccgat 1175641 ctaaaatgtg gcccggtcga agtggtgcca cgcaatcgcg cctgtgctgg ccggtgtcct 1175701 cgaccagcgg gggatggcga tgtccagcgc ggcggggatc gcgtcggtca ggtcgtggtg 1175761 cgtgagtgcg gaggccaggc agatcgtagc gtcggggcgg cgcgtggcgg cctcgatccg 1175821 atcccagtcg gcggtcgacg cgtctacggg taggtagatg ccgcgggcga tgcggtccca 1175881 gcggcctgcc tgcgcgccgc ggtaaagcgc gctgcgcgag gccggcccgc cccgcagtgc 1175941 tcggtgtcag ggcttccacg gcgcctatcc cactcgtctt tggtacggaa cgtaggcaga 1176001 taactctatg tgtagacgtt tcgtatcgat gcctgaggaa atcgggaaca ggccccgcgg 1176061 gcgcatggct attgggagta cggcggggca ctgacattgc gaggccaccg tcggttggcg 1176121 ccggtagcat ggggatttgt cgatgcttgg tgaaggagca accgtgggcg gtgagacgcc 1176181 taagaaggtg gtcgtctcat ggactgctgt gaagaacgcg gggtcgcgcg ccacaagggc 1176241 ctcagccaag ttggaacgcc gggttgtccc cgttggtcac aagcggtcag ctgccgttgc 1176301 agcgcatatc gagaagcagc ggtcaccgca gtccagatgc cgctgacgcc cggctacggt 1176361 gagaccccgc ttccgcacga cgaactggcc gcgttgctcc ccgaggttgt cgaggtgttg 1176421 gacaagccga tcacgcgcgc tgatgtttat gacctcgaac agggccttca ggaccaggtt 1176481 ttcgatctat tgatgccgac ggctgttgaa ggctcgttgt cgcttgatga gcttctcagt 1176541 gaccatttcg tccgcgatct ccacgcgcgt atgtttggtc cggtatagga ctgggccggg 1176601 cggtggtgac gacgtgaact caacatcggt gttgcaccgg agcaggtcgc cgtcgaggta 1176661 cgcaacgcgc tcgacaccat cgcgtaccgc tgggtgcaca ccgatgattg gaccggtcgg 1176721 caactgggta ttgttgttca tgcagacctt gtgcgaatcc atccgttcac cgatggaaat 1176781 gggcgcacca caaggcttct cgctgatttg gtgtacgcga cggttcagaa tcccaccgag 1176841 ctgcagtatg actgggagct cgataaactg cgcttacgtc gaactacttc gcggctacga 1176901 ccgagaccgg gacattgcgg cgctcgccgc cttcatcggt gtgcggccca tcgagacata 1176961 ggcaggctgt cttgttgaag ccggcgaccg ggcgacccaa gcggaggagg taccgcggat 1177021 cactgcggta ccgtcgacgc ggtggcaacc aggcatcaac gggcggggat tgacgaccgc 1177081 tggcataagc gggtcaaagg gccggacggg aacaggcgaa ccgtgcggtc tgctgtctgc 1177141 ggcagggttt cgcgctggcg cgtcaggtgg gttgacggcg gcggagagga gcacagcaag 1177201 agcttccagc gcaaacctga cgcgcaggca cctgacccat gccgaactgt tgatgctcgc 1177261 cagggccacg ggccggttcg aaacgctcac cttggtgctc ggctactgcg gcttacggcg 1177321 gtttacggtt cggtgaggct gttgccctgc ggcgcaagca tgtgggggat cgcgtgctga 1177381 ccgtccgatc gtcccctacg gcggtgaccg gcaagggcat cgttgagtcg acgaccaaga 1177441 cgaagcggga tcgtcacgta ccagtgcctg agcctgtttg gcgcaggctc catgccgagt 1177501 tgcccaccga cccgaacgcc ttggtgttcc ccggccgtaa gggcggattc ctgcctctcg 1177561 gtgaataccg ctgggcattc gacaacgccg gcgaccaggt cgggatcgaa ggctggtacc 1177621 gcacggtctg gggcacacca cggcctcgct ggcgatcagc gcaggcgcta acgtcaaggt 1177681 cgtgcaacgg ctccttggac acgcagcagc ggcgatgacg ctcgaccggc acggccatct 1177741 gctcaacgac gatctagcgg tgtggccgat gcgctgtgca aagtcatcga gaacactgcg 1177801 gtatcactgc ggtatgcgga gacggaacag agtcgggctc cgggcatgag atagcgcgtc 1177861 tgaactgcaa cgcccccata gcccaattgg cagaggcagc ggacttaaaa tccgtcaagt 1177921 gtcggttcga gtccgactgg gggcacgggg aaatcgttgt tggcaagtca tggcgttggg 1177981 cactgctgct gctcgccgct caagccagca acccaacctg gcgatacgtt ggtttgagcg 1178041 gggcgactcc cgtcgggcca cctacgcccc gcctgttgct atggccggac aaggagcatc 1178101 gcgatgagcg tggattaccc ccaaatggct gctacccggg gaagaataga accggccccg 1178161 cggcgagttc gcggctatct cggacatgtg ctcgtcttcg acaccagtgc ggcgcgctat 1178221 gtctgggagg ttccctacta cccgcagtac tacatcccgc tggcggatgt ccgcatggag 1178281 ttcctgcgcg acgagaacca cccgcagcga gtgcagctgg gtccgtcgcg gctgcactcc 1178341 ttggtaagcg ccggtcagac ccaccgatcg gcggcgcggg tattcgatgt cgacggcgac 1178401 agcccggtgg cgggcaccgt gcgtttcaac tgggatccgc tgcggtggtt cgaggaggac 1178461 gagccgatct acggccatcc gcgcaatccc tatcagcggg ccgatgcgct gcgctcgcac 1178521 cgacacgtcc gtgtcgagct ggacggcatt gtgctcgctg acacccgatc gcccgttctg 1178581 ctattcgaaa ccgggatacc cacaaggtat tacatcgatc cggccgacat cgctttcgag 1178641 catctggagc ccacctcgac gcagacgttg tgtccgtaca aggggacgac gtcgggctat 1178701 tggtctgtgc gcgtcggcga cgccgtgcac cgcgacctgg cctggacgta tcactatcca 1178761 ctgcccgccg ttgccccgat cgccggcctg gtggcgtttt acaacgagaa ggtcgacctc 1178821 accgtcgacg gcgtcgccct gccgcggccg cacactcagt tcagctagtg cttggtttgt 1178881 tcgccggttg gcggccgcca gcatggtcaa cctcatctag ggcgtgggtg tcggggcgca 1178941 gcaggctgcc ggcgatctcg cggacaccgt cttggctgtg cccaatctag attccgatcg 1179001 gcctgagtct tcttctgccg gcgcagcgca tcggcgcggg ccacgattgc atcgacgtgg 1179061 acggccagcc ggcgctgggt catcgacggc cagcgagccg ccctgagagc gagctcggcg 1179121 gccacggcgc caacacctca ccgtcgacgg tcagatcgct gcgacacacg atcgtttgaa 1179181 acatccggta gtcgatgtcg ccggcggtga agacttggcc gaccatggct aggcactcgc 1179241 gcataccgca gctggctggc cgttggcccc tggctgatcc gcaaggccgc accgacctca 1179301 gcgatcaccg ccgcctgtga ccactaacca gtctcatcga aaatatattc gatacagcca 1179361 cttgccgtcg acattgacca tgaggcgttc acgtcgcagg gccgacgaaa tatgctgaga 1179421 cctgcctact cgtgtgcaat gtgatattag cctcattttg atttgaatta tgagaatttc 1179481 ttatttccca gttatgggga gcgtgtgctg gttgttagcg aagtacgcta aaactgcagt 1179541 tactgctcat agcactggtt tgccacatac cccgtatcgg gatacgtcat gatcggtatc 1179601 ctgagcggaa cataaggcgg tcacgtgacc taggtaacag cgtctaattc gtgaaatttt 1179661 tgatcagaat ttggtcgcta gacttattcc agcccagtat gaatcagcgc ttttggtgcc 1179721 gaaatgcggc gaatcccggg cagtcggcgt cgcacagcac ggttgctgtg ctgtcgcaag 1179781 cctggaggcc cgcagacaca gcaagcgagg agcggcgcgt atgagccgcg ccggcgacga 1179841 tgcggaacga agtgatgagg aggagcggcg catgagcgtt atgaacggcc gggaggtcgc 1179901 tcgagagagc agagatgccc aggtcttcga gttcggcacc gcaccgggct ccgccgtggt 1179961 caagattccg gtgcagggcg gtccgatcgg tggcatcgcc atcagccgcg acggcagtct 1180021 gctggtagtg accaacaacg gcaccgacac cgtctcggtc gtcggcaccg acacctgccg 1180081 ggtcacccag accgtcacca gtgtcaacga accgttcgcg atcgccatgg gcaatgcgga 1180141 agccaaccgc gcgtacgtca gcacggtgtc gtcggcgtac gacgcgatcg cggtcatcga 1180201 cgtggccacg aacaccgttc tcggcaccca tccgctggcg ctcagtgtga gcgacctgac 1180261 actcagcccg gacgacaagt acctgtacgt cagccgaaat ggcactcgcg gtgctgacgt 1180321 tgcggtgctg gacacgacga cgggcgcact gatcgacgtc gtagacgttt cccaggcgcc 1180381 gggcaccacc acgcaatgcg tgcggatgag cccggacgga agtgtcctgt acgtcggcgc 1180441 caatgggcca tccggcggcc tgctcgtcgt gatcacgacc cgcgcgcagt ccgacggggg 1180501 acgcatcggg agtcgctcgc gttcgcggca gaagagctcc aaaccccggg gtaaccaggc 1180561 ggcggcgggc ttgcgcgtgg tggcgaccat cgacatcggg tcatcggtcc gcgacgtcgc 1180621 gctcagcccc gacggtgcca tcgcctacgt cgccagctgc ggctccgact tcggggcagt 1180681 ggtcgacgtc atcgacactc gcacccacca gatcaccagc tcgcgcgcga tcagcgagat 1180741 cggcgggttg gtcacccggg tgagcgttag cggcgacgcg gatcgcgcct acttggtcag 1180801 cgaggatcgg gtgaccgtgc tgtgcacccg tacgcacgat gtcatcggca cgatcaggac 1180861 cggccagccg tcgtgcgtgg tcgagagccc ggacggaaag tacctgtaca tcgccgacta 1180921 ctccggcacc atcaccagga cagcggttgc ctcgaccatc gtgtccggga ccgagcagct 1180981 ggcgctacag cgccgcgggt ctatgcagtg gttctcgcct gagctgcagc agtacgcgcc 1181041 ggcgctcgcc tagctcgaac gcgcttctcg ggggaacccg tttctcatga cttctcgcgg 1181101 cgatagcatt cgcccgagga ggacatgagg cgcgccgaga cccgtaaggc ggtacatcga 1181161 tgtacggcac gatgcaggac tttccgttga cgatcaccgc gatcatgcgc cacggctgcg 1181221 gtgtccacgg gcgacgcacg gtcaccaccg cgacgggtga gggctatcgg cacagtagct 1181281 atcgcgatgt ggggcaacga gctggccagc tggcaaatgc gttgcgccgc ctcggtgtta 1181341 ccggggacca gcgggttgcc acgttcatgt ggaacaacac cgaacacttg gtgacctact 1181401 tcgcggtccc gtcgatgggc gcggtgctgc ataccctcaa catccggctc ttccccgagc 1181461 agatcgccta tgtcaccaac gaggccgaag accgcgtcat tctggtcgac ttgtcattgg 1181521 ccagactgct cgcgccggtg ctgcccaaac tcgacaccgt gcataccgtg atcgcggtag 1181581 gagagggcga cacgacgccg ctgcgggaag ctggcaagac cgtgctgcgc ttcgccgaat 1181641 taattgacgc cgaatccccc gacttcgggt ggccgcagat cgatgagaac tccgcggccg 1181701 caatgtgtta caccagcggt actaccggca atcccaaagg cgttgtatac agccatcgtt 1181761 cgagctttct gcacacgatg gcggcctgca ccacaaacgg tatcggggtc gggtccagtg 1181821 acaaggtgct gccgatcgtg ccgatgtttc atgccaacgg gtgggggcta ccgtatgcgg 1181881 ccttgatggc gggtgcggac ttggtgctac ccgatcggca tctcgacgcc cgctcgctga 1181941 tccacatggt ggagacgctg aagccgacgt tggccggcgc ggtgccaacc atctggaacg 1182001 acgtcatgca ttacctagag aaggaccccg atcacgacat gtcatcgctg cgtctggtcg 1182061 cctgcggcgg atcggcggtt ccggaatcgc tgatgcgcac cttcgaggac aagcacgatg 1182121 tccagattcg gcagctgtgg ggcatgacgg aaacatcgcc gctggccacc atggcctggc 1182181 cgccacctgg caccccggac gaccagcatt gggcattccg catcactcag ggccaaccgg 1182241 tgtgcggggt ggagacccgg atcgtcgacg acgatggcca ggtgctgccc aacgacggca 1182301 acgccgttgg cgaggtggag gttcgcgggc cctggattgc tggctcgtat tacgggggac 1182361 gtgacgagtc caagttcgat tccggctggt tgcgcaccgg tgacgtcggc cgcatcgacg 1182421 agcaaggctt catcaccctg accgaccgcg ccaaagacgt catcaagtcc ggcggtgaat 1182481 ggatctcctc ggttgagttg gagaactgcc ttatcgcgca cccggacgtg ctcgaggccg 1182541 cggtcgtcgg cgttcccgac gagcgctggc aggaacggcc gctggcggtt gtcgtagttc 1182601 gggaaggggc caccgttagt gctggtgatc tgcgagcatt cctggcggac aaggtcgttc 1182661 gctggtggtt gccggagcgg tgggcgtttg tcgacgagat tccccgcacc agcgtgggca 1182721 agtacgacaa gaaggccatc cgttctcgct acgccgaagg tgcctaccag atcaccgagg 1182781 tgcacacttg acccgcgcga gcagacgcaa aatcgcccat tttcgtgtcg aaatgggggc 1182841 ttttgcgtct gctcgcgggt agaaaggtga ccatgagcct gcgggtcatt caatgggcga 1182901 cgggatcggt cggtgtggcg gcgatcaaag gcgtgctgca gcatcccgaa ctcgaactcg 1182961 taggctgctg ggtgcattcg gcggccaaga gcggcaaaga cgtcggcgaa atcatcggtt 1183021 caccaccatt gggcgtgatc gcgactaaca gcatcgacga cgttttggcg ctggacgccg 1183081 acgcggtgat ctacgcgcca ttgctgccca gcgtcgacga agtcgccgcg ctgttgcgtt 1183141 cgggcaagaa cgtggtcact ccgcttgggt ggttctatcc gagtgaaaag gaggccgccc 1183201 cactggaagt cgccgcgcag gccggcaatg cgacgctgca cggcgccgga attgggcccg 1183261 gggctgtcac cgagctgttc ccgttgctcc tgtcggtgat gtccaccggt gtgacttttg 1183321 ttcgctccga agagttttcg gatctgcgca gctatggagc gccggacgtg ctgcgctatg 1183381 tgatgggttt cggcggcaca ccggacagcg cgttgaccgg accgatgcag aaaattctgg 1183441 acgggggctt cctgcagtcg gtacggctgt gtgtcgaccg gttgggcttt gccgccgacc 1183501 cccagatccg cacttcgcag gaggtggcgg ttgcgaccgc cccgatcgac tcgccgatcg 1183561 gagtaattga gcccggacag gtggccggac gccgcttcca ttgggaggcg ctggtcgagg 1183621 acacagtggt cgtccagatc gccgtgaact ggttgatggg atcggaaaat ctggatcccc 1183681 cttggtcatt cgggccggcc ggagaacgct acgagatcga agtgcgcggc agcccggaca 1183741 cctgcgtcac catcaagggt tggcaaccgc agaccgtggc ggccggcttg aagagcaacc 1183801 ccgggatcgt ggcaaccgcg gcgcactgcg tcaacgcgat cccggcaacc tgcgccgccc 1183861 cggcggggat ccagagcttt ttcgacctgc cgctcatcac cggccgggcc gctcccgggc 1183921 tggcacgcta gagttgctgg cggcgtcccc ggccgggatg tcgagaatcg gacgggtaat 1183981 ccaatggcaa agtctgtcgt cgtcgagcaa tcgcgagcga ttccggtgca atccgaggat 1184041 gcgttcggtg gcacgctggc ggcagcgctg ccggtgattt gttcgcactg gtacggcctg 1184101 atcccaccaa tcaaggaggt ccgggatcaa acgggtgctt gggattctgt cggacaggcc 1184161 cgtgtcatca cgatggtcgg cggcgggcgc gtgcgcgagg agctgaccag tgtcgacccg 1184221 ccgcggtcgt tcggctacac gctcaccgac atcaagggcc cgttggcgcc gctggtcgcg 1184281 ttggtggagg gcaagtggag cttcgctccc gcggataccg gaaccacggt gacctggcaa 1184341 tggaccatcc atcctagatc ggcgctggcc gcgccggtgt tgccggtgtt cgccaggatg 1184401 tggcggggct acgcgcgcgg ggtgctcgag aagctttccg ctttgttggt gggctgagcg 1184461 gcgctgccgg cttcgtctac cgtcggggtc atgtgccgac tctttggctt gcactccgga 1184521 accgatgctg tcaccgcgac gttttggttg ctgaacgcct cggatagcct ggccgagcaa 1184581 agccgacgaa accccgacgg caccggcctt ggtgtattcg acgaacacca ccagccgcgg 1184641 ctacacaagc aaccaatagc ggcctggcaa gacgccgact tcgccaccga agcccacgag 1184701 ctgaccggca cgacgttcgt cgcccatgtt cgctacgcga cgaccgggtc gctcgacatc 1184761 cgcaataccc acccattcct gcaagacggg cggatcttcg cacacaatgg ggtggtcgaa 1184821 ggactggatg tcctcgacga acggctgcgc gaggtcggcg ccgatgacct ggtgttgggc 1184881 cagaccgact ccgagcgcgt attcgctttg atcaccgctt cgatccgcgc ccgggacggc 1184941 aacgaatcag ccggtctgat tgacgcgctg aggtggctcg cggcgaatgt gccgatctat 1185001 gccgtcaacg tgttgctcag caccgcgacc gatgtatggg cactgcggta tccggagtcc 1185061 cacgagctgt atatcttgga ccgccgcggc gacggtgcgc ccgagttcca cttgcgaagc 1185121 aagcgaatcc gcgcacactc gacgcacttg cgcgaacggt cgtcggtggt gttcgcgact 1185181 gaaccgatgg atgacaaccc gcgttggcgc ctgctggacg cgggggagct ggtccacgtg 1185241 gacgccgccc tgcgggtcaa caggagtctg gtgctacctg atccacccag acatccgatt 1185301 cgccgggaag atctcagcga gccggtactg catgcgcaac acacgtcggc gtgaactcgt 1185361 gacaactaga cgcgcgctgg tattggccgg cggaggactg gccggaatcg cctgggaaac 1185421 aggtgttttg cgcggcatcg cggacgaatc gccggcggcg gcccggctgc tactggattc 1185481 ggatgtgttg gtcgggacat cggccggtgc aacggtcgcc gcgcagatca gcagtggctg 1185541 cccgctcgac acgctgtacg aacggcagct cgccgagacg tcggccgaga tcgatcccgg 1185601 tgtcgacatc gatgccatca ctgatctttt cctgactgcc gtgaccgagc cgcacatttc 1185661 gacgcgccgg cggctacaac ggatcggtgc cgtggcgttg gcggtcgaca ccgttccgga 1185721 gtccgtccgc cgtcaggtga tcgcccagcg cttgccgtcg cacgactggc cggaccgggt 1185781 gttgcgggtc accgcgatcg acatcgccac cggcgaattg gttgttttcc atcgcgagtc 1185841 gaatgtggcg ctggtcgacg cggtggcggc cagttgctcg gtgccggggg cgtggcctcc 1185901 ggtgacaatt gccggccgcc gctacatgga tggcggggtg gccagctcgg tcaaccttgg 1185961 tgtcgccgac gattgtgatg ccgccgtggt tttggtgccc gccggcgccg acgcgccgtc 1186021 gccctttggc ggcggggcgg ccgcggagat cgcggcagcc accggcatgg tgtttgccgt 1186081 gttcgccgac gacgactcgt tggcggcttt cgggcccaac ccgctggatc cgctctgccg 1186141 tgtgaactcg gcgatggccg gacgtcagca gggccgccgc gaagcgcaag ccgttgccag 1186201 gctgctcggc gtttgatcag ccctcgatgg tcgcagcggc agattcgtcg tcgtcgatct 1186261 cgaatgcttc caaggcttgg gtggccagcg cgcggccgac ggcgatcacc tccaccgcgc 1186321 ggtgaaattc caggcttcgg cacgttgaac gcggtacctc gatcagcagg tcggccggat 1186381 agcccgccag cgtatggcgc gccagtgcgg attgggcgat atcgatcgtc cgattcatca 1186441 cctcgaaact gcccattttg ggtagcccgg gtgtgtcagc ggcttcctcg cggtcagctg 1186501 gtgggccggc tggacgctgc tcgatctccg gagcttgcga ccaggaatcc gattccgcgg 1186561 ccgccgcgcc gaagcgactc agcaccgccc gcgccgtagg ccggtcgagc agcgaccggg 1186621 cggcgctgac gtcaaacagc gcggaagtgc tgcgcaccat gcggttcaac cactcggcgg 1186681 tgacgttggg ctccgcatcg cgagcggggc cggcctcact gccgttaagg ctgaccgcga 1186741 tggtcaggtc ggcgttgacc ccggcgatcg gcgccatcgg cagtggatcc aggattccgc 1186801 cgtcggccag caggcgtccg tcgacttcgt gtggggcgat caccccgggt atggcgatgg 1186861 acgcccggat cgccgcgtcg agggggccgc gctgaaacca caccgacttg ccggccagta 1186921 ggtcggtggc caccgcggta taggggatcg gcagctgctc gatggcgacc gggccgacga 1186981 tgtcgcgcac cgcgtcgaga atcttttctg cccgcaggat gccggccgcg ctaatagacg 1187041 gatccagcag ccgcaagatg gtgcgctgcg tcagggactt ggcccagtgg gcgaactcgt 1187101 cgagtcggcc ggccgcatgc accccaccga ccaccgcgcc catcgacgag ccggcgatcc 1187161 caacgatgtc atagccgcgc tcccgcagcg cctggatcac tccgatgtgg gcgtaacccc 1187221 gggcgccgcc gctgccgagc gccagtgcga cgcgcggcga agacgaccct cgcacccgga 1187281 gggcagctgg tgcgggcatg ctttcattct gctcggcgag gtgcccttat cgggatccgg 1187341 ccactagttt cttgcacccc tgatctcaat tgccgagcgt tatccgcatt ccgcgttggc 1187401 ggcggcgcgc gccgcgacga tcacggccgc ctgccgtgcc ggggtcagcg ccgcccagcg 1187461 gatgtgccag ctgccggcaa ctccggatgg cgatgcttgg accaccgcca gatacggctc 1187521 gattaacgac tcgccggagc cgggctgggc atccatccac agccttgcgg cgtgacaggc 1187581 ctggtaatac tcctcctcgg tcgattccgc gggagcgtca accctggtgg tcacgccggc 1187641 cggggagacg ccgacgacgc ccgccggcaa cgtaccggca acgcttgacg aacgtccggc 1187701 tttgctgctg ccgccgcgag agcagccggc aacggccgac aaccacgcca gcgccaaaac 1187761 cattgcgcac agcagggggg cataacggct cgggcgcacc gtcccaatct atgcaagact 1187821 gaccgcgtga tggagcgcta cggattttgt gggtgttgtc ggccctgacc tgccgtccgc 1187881 cctgtccgtt cgactctttg gagttctccc gtggttatgc ctcttgtcac gccaaccacc 1187941 gcggttccat caccgggacc cacacggctg cgtgtagccg atctcctgcg cgccaccgac 1188001 caagccgcag acgacgtgct tggcgggcgc tgcgaccacc tgctacccga cggtggtgtc 1188061 ccgcagacgc agcgctggta cacccgcatc cacggtgacg aggagctgga tatctggctg 1188121 attagctggg ttcccggtca accgaccgag ctgcacgacc atggcgggtc cctgggagcg 1188181 ttgaccgtgc tgagcgggtc gctcaacgaa tatcgttggg acggccgtcg gttgcgacgg 1188241 cgccgcctcg atgccggtga tcaggcaggg ttcccgttgg gttgggtgca cgacgtggtg 1188301 tgggcgcccc ggccgattgg ggggcctgat gcggccggga tggctgtggc gccaaccctg 1188361 agcgtgcacg cctactcgcc gccgctgacg gcgatgtcgt actacgagat caccgaacgc 1188421 aacacgctgc gccgccagcg caccgaattg accgaccagc ccgaagggtc gggatgagcc 1188481 gaatcgaccg ggtgctggag gccgctcgcc gccggtatcg gcgccttgcg gccgaccagg 1188541 tgcccgaggc ggcgcggcgc ggcgcggtgc tcgtcgacat ccggccccaa gcccagcggg 1188601 cccgggaggg cgaggtgcca ggggcgctag tgatcgagcg caacgtcttg gaatggcgct 1188661 gcgatcccac cagcgacgcc cggctgcccc aggccgtcga cgacgacgtc gagtgggtga 1188721 tcctgtgctc ggagggctac acctcgagcc tggcggcagc gtcgctgctg gacttggggt 1188781 tgcaccgggc caccgatgtc gtcggtggct atcgtgcgct ggcggccggc ggcgtgctgg 1188841 ccgagcttgg tggtgccgtg ggcgggtagt ttggctcgcc gctgctggct gggtcgttac 1188901 tgccccggcg tgccggcgtt gccgaagatg agtcctcgag ttccgccggc gccgccggcg 1188961 ccgtcgagtc cggcgatcag gccggcgcca cccttgccgc cgttgccgcc gttgccgccg 1189021 tcaccgacca actgggcgtc gccgcccttg ccgccgttgc cgccgttgcc gccgttgccg 1189081 tcgacaccgc cggcggcggc gcccagaccg ccttggccgc cgcccccgcc gttgccgccg 1189141 gtgccgccgc cgagcaggcc ggcgccgccg ccgttgccgc cgtgaccgcc cgcgtgaccg 1189201 ctgccgccgt taccgccggc ggcttgaagc ccggtcggcg ggttggtgcc gccgctgccg 1189261 ccgctgccgg cggtgctgcc cgttccgccg gcgccgccgg cgccgccgcc accgaacagc 1189321 ctggcggccg atccgccgtt gccgccgttg ccggcgttgc cggtgtcccc gccgttgccg 1189381 ccgataccgg gattgatggc cagaccgttg ggggtgtcgc cgccctttcc gccggcgccg 1189441 ccggctccgg cgctgccgcc gctaccgccg gcgccgccgt tgcccgacag ccagccggcc 1189501 gacccgccgg tgccgccggc cccggcgttg ccgccgacac cgccaccgcc accgttacca 1189561 cctagtgcgg cgttgagccc ggtgccgccg tcgcccccgg agttgccggc gccgccggcc 1189621 ccgccgttgc cgccggcccc gccgttgcca tacagcagcc caccgccacc accggcgccg 1189681 ccgccgccgt cgccgccgac accacccgta ccacccttac cggcggtggc catgacatgt 1189741 tcgccgccgg cgccggcggc cgcgccgttg ccgccggcgc cgccgtgccc ggcgttacct 1189801 ccgtgaccga acagcacggc gcctcgtccg ccgttgccgc cggcgccggc ggtgccgccg 1189861 gtgccgccgg tgccgccgtc tccaccgaat tggccgccgt tgccggcacc gccggcggtg 1189921 ccgccgccgc cgccgttgcc ggcgtcaccg ccgttgccgg acagccagcc ggccgacccg 1189981 ccgtcgccac cgcggccacc ggcgccgccc gcaccaccgg cgccgcccgg ttggctgggt 1190041 gggccggggg cgccgggact ggcttgtccg ccggccccgc cggcgccgcc gtcaccgccg 1190101 gcgccgccgt ggccgtggat ccagccgccg gcaccgcccg ccccgccggc gccggcgtca 1190161 ccgcccttgg tgccgctggc cccggcgccg gcaccgttgc cgccttgtcc gccgtcaccg 1190221 ccgacgccgc cgacaccgcc gttgccgaac aatccggccg ttcccccggc cccgccggca 1190281 ccacccgcga cgcccggcgc gccgatggct ccggcggggc cggcgccgcc ggcgccgcca 1190341 ttgccgccgc tgccgtagag ccagccgccg ttgccgcccg cgccgccgtt ggcgccggct 1190401 ccgccggccc ctccgttgcc gccgttgccg atcaacccgg ccgacccgcc ggtacccccg 1190461 gtgagcccgg cggtggtttg ggaaaacccg ttgccgccgt tgccgtacaa caacccaccg 1190521 gcgccaccgt tgggattggc cgcggtccca tcggcgccgt tgccgatcag cggacgcccc 1190581 cacagcgcct gggtgggcgc gttgatcaaa cccagcacct gctgctcgac attggtcgcc 1190641 tcggcgctgg catacgcgct cgccgccccg gtcaatgcct gcacgaactg ctcgtgaaac 1190701 agcgctgcac gcgcgcccag ctgttgatac tggccggcgt gggcggaaaa cagtgccgcg 1190761 accgccgcgg acacctcatc ggcaccggcc gcggccagca ccgacgtcgg ggccagggcg 1190821 gccgcgttgg ccgcgctgat tgccgaaccg ataccggcca catcggccgc cgcggccatc 1190881 agctgcgacg gagacaccaa cacaaacgac acggtttcct ctccctgatt tgctgatatg 1190941 tagttgcgat gttaactagc gcacaccgca actggggcgg ttttccgcca ttgtctggtc 1191001 gcacgtatac atttttgtga attctttgag cggaattgct cgtgcgatcc ggctacgttt 1191061 tcgaggtgag atctgggtgg gcggcgatgc cccgtgcttc gatgatcaat ttggggatct 1191121 gaaatgtcaa atgtgttgac attcattggg tgatctttcg cgccacccgg cgacgtcaaa 1191181 tacttggaca taagccactc gtcgttgtgt gatacgtcgt cacaccggat ctggccgtgc 1191241 gggtttattg cccgggcgtg ccggggttgc cggagatctg cccgcgacta ccgccggcgc 1191301 ctccagtgcc gttgattccg ggcatcaggc cggtgccgcc tttgccaccg ttgccgccgt 1191361 taccgccgtt accgatcaac tgggcgtcgc cgcccttgcc gccgtcgcca ccgttgccgc 1191421 cgttgccttt ggcgccgctg ccggcgccca gaccgccgtt gccgccgtcg ccgccggtgc 1191481 cgccgctgcc gagcaggccg gcggtgccgc cgctaccgcc ggcaccgccc gcgtggccgt 1191541 tgccaccgtt gccgccggtg ccgccgccga agccgccgcc accgccggtg ccgccggtgc 1191601 tgcccatccc accggcgccg ccggcgccgc cgtcgccgaa cagcttggcg gccgatccgc 1191661 cgtggccgcc gttgccggcg ttgccggtgt ctgcgccctg gccgccgtta ccgggatcaa 1191721 taccgctgtt gccgttgccg ccttggccgc cggcgccggc ggtgccgccg cctccgccgg 1191781 tgccgccgtt gcccgacagc cagccggctg acccgccgtt gccgccgttg ccggcgttgc 1191841 cgccgacacc accggcgcca ccgtcaccac cttgcgcggt gccagacccc gcgccgccgt 1191901 cgccgcctct ggcgccggcg ccgccgctgc ccccggtgcc gccggccccg ccgttgccat 1191961 acagcaggcc ggcgtcgccg ccggtgccgc cggcaccgcc gccgccgcca ccacccgtac 1192021 cacccttacc accgacggct gtgatgctgc ttccgccgtt tccgctgacc gcgccgttgc 1192081 cgccagcgcc gccgtggccg gcgttaccgc cgtggccgaa cagcacggag cctcgtccac 1192141 cgttgccgcc gttgccggcg ttgccgccgc taccaccgtt gccaccgtca ccaccttgtg 1192201 cggtgctaag cccgggggca ccgatcccgc ctttgccggc ggcgccgccg ttgccgccgt 1192261 tgccgccggc cccgccgttg ccatacagca gcccgccgtc gccgccggtg ccgccggcac 1192321 cggcgccgcc gccgccacca cccgtaccac ccttaccgga aatgcctagc tgagtgtctg 1192381 cgccgtttcc gccggcggcg ccgttgccgc cggcgccgcc gtggccggcg ttaccgccgt 1192441 gaccgaacag cgcggcgccg cgtccgccgt tgccgccgcc gccagcggtg ccgccggtgc 1192501 cgccgttgcc gccgttgccg ccgaagatgc cgccctctcc gccggcaccg gcgttgccgc 1192561 cgccgccgcc ggtgccggcg tcgccgccgt tgccggacag ccagccggcc gacccgccgt 1192621 cgccaccgcg gccgccggcg ccgcccgctc cgccggcggc gccggtgtcg ccgggctcgc 1192681 cctggacgcc gtccccgccc tgaccgccgg ttccgccgtc gccgccgacg ccgccgtggc 1192741 cgtggatcca gccgccggca ccgcccgccc cgccggcgcc ggcgtcaccg cccttggtgc 1192801 cgtcggcccc gccgatgccg gcgccgcctt gtccgccggc cccgccggca ccgccggcac 1192861 cgccgttgcc gaacaatccg gccgatcccc cggccccgcc ggcaccaccc gcgacgcccg 1192921 gcgcgccgat ggctccggcg gggccggcgc cgccggcgcc gccattgccg ccgctgccgt 1192981 agagccagcc gccgttgccg cccgcgccgc cgttggcgcc ggctccgccg gcccctccgt 1193041 tgccgccgtt gccgatcaac ccggccgacc cgccggtacc cccggtgagc ccggcggtgg 1193101 tttgggaaaa cccgttgccg ccgttgccat acaacaaccc accggcgcca ccgttgggat 1193161 tggccgcggt cccatcggcg ccgttgccga tcagcggacg cccccacagc gcctgggtgg 1193221 gcgcgttgat caaacccagc acctgctgct cgacgttggt cgcctcggcg ctggcatacg 1193281 cgcccgcact tgacgtcagg gccagcgtga actggtcatg aaacgctgcc atctgccggg 1193341 cgatcgcctg atagccctcg ccatgtccgc taaacagtgc cgcaatgtgg gccgacactt 1193401 cgtcggcggc ggccggcaac aacctcgtcg tcgcggccgc ggccgccctc gttgaggcat 1193461 tgatcgacga tccgatgctg gccaaatccc cagccgccga gctgagcatg tctggcaccg 1193521 caatcatgta ggacatttcg cgcatctccc tcatcgccgg gcgacggata tcgggaccgg 1193581 agtcaacgtg atggcgcgag tctaagcacg cccggaacgg aaatgcagag tgttcgacaa 1193641 atctttcccc aagacatttt tattggtcgc acgatgggcg tcgtcgtcga gcggtatggc 1193701 agcaccgatt tgtcttccag gggaatgttc gtaccgtttc atgacgtcga ctgtgtccaa 1193761 tagctttaca tttcccgttt ttatttgctg atgatgtcta acacctagac aaacaccgtc 1193821 ttgtcgtcca tcgatatggg ctcgggctag ccgccacgcc gacggcgcac gccaaaccgg 1193881 ccgacccgct gcccgcccta cgagccgaag ggcttggcgt tggcgtgcag caatggctgc 1193941 agccgctccg tcttctgctg tgtccagccg ggcggcgaga gcaccgcggc ccagccgtcg 1194001 gccacggtgg cgacgtagcg gtgaccatgc ccgtccggca catgcgtagc caccgccata 1194061 tcggccgaaa cctggacgaa cgtcaccacc gggatccagc gtgtctgggg aaggacgtcg 1194121 tagccccgtt gctcccgcag ccagtccggc tcccgaaaca gcaggcgggg agtccaccag 1194181 gcgatcggat cagaggcatg ctgcagatac accacccgcg gtctgcccca cggcgcatca 1194241 gggcgttgca ggtcgcgtgc gcgggccacg aaacgcacgt tgcggccgtc gtcgtagatg 1194301 ggcagccact gcggtgatcc ggcatcgcgg ttcgcagtca aggagttcca aacggtgttg 1194361 ttgaacgtcg gtccgctgaa caacgcgccg tcggtgcggg cgaggatgtt gttgaggttc 1194421 atgaacggcg cttcaccgcc gaacgatccc aggctctcgc cgaacacgac cagcttcggg 1194481 cgctgcgact cgggcagttg acggatcagc ttgtcgaccg cctcgaacag cgcctcgccg 1194541 gcgtgccggg cattctcctt gtccaccagg aaagacagcc agctcggcaa gaacgaatac 1194601 tgcatgctca cgatcgcggt atcgccgttg tacatgtact ccagcgcgga ggcttccgcc 1194661 tcgttgatcc aaccggttcc ggtgctcgtg gccactgcca caacggcgcg gcgcaagcca 1194721 ccggtgcgcg ctagctcgcg cgccgccagc tccgcggtgg ccatgatgcc gtccgccgag 1194781 ttcaaccccg cataggttcg gatcggctcg acggccgggg tgccgttgaa cgcggtgagg 1194841 tcggcgatgg tgggaccgct gtggacgaaa attcggccct gatggcccag cgactcccac 1194901 gacaccagcg atcccgggcc acccgatcgc agcggggttt tcggcggtgc cgaatccgga 1194961 ttcatctcat tgttgaccgc agcgaacgtg ctgttcatgg aattcatcgc gaacttgagc 1195021 accacaccgt tgagcagtgt gatggtcagc accacgagca gcaccaccac aatggccgcc 1195081 gaaactcgga atggcgcaat gcgatcgacc tgtcccacca gaaaacggaa cagccatcgg 1195141 atgaactggc cgatttcgac cagcgtgaac agcacgacca gcgacaatgc ggcggccagc 1195201 gggtagtcgt accaccgcag gtgctcgaca cccattaggt cgcgcacatc gtcttgccag 1195261 acatgaaact gcactgccat acccaccatg ccgaccgcgc cgactgcgat cagcggcggc 1195321 cacgcccagc gtggtggcgg cgggctggaa ttgtgcgagc gcatgtagcg gaccagccag 1195381 acggcgaaga ctcccaagcc gtatccgaag gcgccgcaga ttccgctgac cagtccctga 1195441 aacagcggac cacgcggcag cagcgacggc gtcatcgaga accacacgaa aacgaggccc 1195501 atcgcggtgc cggtgaatgt gtagtggcga atccaccaag tgctgcggat cggttgcggt 1195561 tcaggggttt gtggagttgc tgcggtgtcg accgcctgct cagcgccggt agctggttcg 1195621 tcgctggcgt tggtggtcgt cgctgcagcc ggttccgtca tcggtgggtg aactggggag 1195681 cgcgtttctc gatgaacgct gccatacctt cggattggtc ttcggtcgcg aaagccgaat 1195741 ggaaaagccg gcgttcgtag agcagcccct cggacaaact ggattcgaaa gcccggttga 1195801 cggcctcctt ggccatccgg gccgccgagg ccgacatctg cgaaatggtc gtggcagtgg 1195861 ccctggcttc ggtcagcaag tcgtcggccg gcaccacccg tgaaaccaga ccgctgcgct 1195921 cggcctcggc ggcgtccatg gtgcgcccgg tcaggatgag gtccatcgcc ttagccttgc 1195981 cgatagcccg ggtcagccgc tgggagccgc ccatgcctgg cagcacgccc agctttatct 1196041 cgggctgtcc gaacttcgcg gtgtcggcgg cgatcagcac gtcgcacatc atcgccagct 1196101 cgcagccacc gccgagcgcg tatcccgcca ccgcggcgat cgtcggggtg cgcacggcgg 1196161 ccagcttgcc ccaggtggcg aagaagtcgg cggtgaacgc gtcggcgaac gtcaggtcgg 1196221 ccatttcttt gatgtcggct ccggcggcaa acgctttggc cgaaccggtg atgatgatcg 1196281 ccccaatgtc cgggtcatcg tccagttcgg ttgcagcgct ggtgacctcg ttcatcacct 1196341 ggctgttgag cgcgttcagt gcctggggac ggttcagcgt gataatgcca actcgctgat 1196401 cgcgctcgac caggatggtt tcgtacgtca tgcgctacct ctctagaaac tcaagtcatc 1196461 gtcgaccggt tcgaaatagg cttcgatgtc ggccgccgtg atcgcgtcca gggttgccgg 1196521 cgaccagttc gggttgcgat ccttgtcgat caactgcgcg cggatgccct ccaccaggtc 1196581 atgcgagcgc agcgacgccg atgacacccg atagtcctgg atcaacacgt cttctagcgt 1196641 gtcgagtttg gcggcgcgac gcactgcctg caacgtcacc gacagcgcga tgggggagcg 1196701 gctggcaatc aggtcggaag catttacggc tggttcgccg ccctgtttcc gcagcgccgc 1196761 aacgatgtcg gcgacgctgt cgccggcata gcattcgtcg atccaatcac gttgggcggc 1196821 aagcgtgctc ggtggaggtt cgacggcgtg ggcggccaat gcgctctcca cgccgccggt 1196881 gacgatcttc tgcgtgaacg catcgaggtc gccgtgtggc acgaagtggt cggcgaatcc 1196941 cagcgcgatg gcgtcggcgc cggaaaacgg cgctccagtc agggcggcgt gcagacccag 1197001 cgcgccgggt gcacgcgaca gcaaatacac cccgccgacg tcggggatga acccgatgcc 1197061 cacttcgggc atcgcgacct tggaggtatc ggtaaccacc cgggtgttcg cgtgtgcgct 1197121 gacgccgacg ccgccgccca ttacgatgcc gtccatcaac gccacgtagg gcttggcgaa 1197181 ccggccgatc agggcgttga gcagatactc gtggcgccag aaccgccgcg cctcgacccc 1197241 gtccttgcgg gcactgtggt agacggccac cacgtccccg ccggcgcaaa gtccgcgttc 1197301 gccggctccg gagagcacca ccgcgtgcac cgcgtcctca tgctcccagc tcatgagcac 1197361 tgtggccagc aggtcgacca tggtttggtt cagtgagttg atcgccttgg ggcggttgag 1197421 cgtcacgaat ccgacaccgc cctcgacgtt tgtcaggacc tcatgcgatt cgccggtcac 1197481 gggcctcgcc tcccctgaag agtttgacca gcaatctaga tcgtggctcg cccagcggtg 1197541 cccgcggggg ctaaggttta tcgtgtaccc ggatgacaac gctggccggg aacccgggcc 1197601 tactactgat cgttgagcgg atgttcgcac agctcgtagc catagccatc aagagaggat 1197661 ccgacggtgc gggagacaag caacccggta tttcgttcgt tgcctaagca gcggggcgga 1197721 tacgcgcaat tcggaactgg caccgcccag cagggattcc cagccgatcc ctacctggcg 1197781 ccctatcggg aagcaaaggc cacccgcccg ctgaccatcg acgatgtcgt gaccaagacg 1197841 ggcctgacgc tggctatgtt ggcgggcacc gccgtcgtct cctacttcct ggttgcgtcg 1197901 aacgtcgcac tggccatgcc gctgaccttg gtgggggctt tgggtggttt ggcgctggtg 1197961 ctggtggcca ccttcggccg caagcaggac aacccggcga tcgtgctcag ctacgcggcg 1198021 ctcgagggcc tgttcctggg tgccatctcg ttcgtcttgg ctaacttcac ggtggcgtcc 1198081 gcgaatgctg gggtgctgat cggggaggcc atcttaggga cgatgggtgt gttcttcggc 1198141 atgctcgtcg tctacaagac aggggccatc cgggtcaccc ccaagttcac ccgaatggtg 1198201 gtcgctgcgc tgttcggcgt gctggtcttg atgctcggca acctcgtgct ggcgatgttc 1198261 aatgtcggcg gcggtgaagg cttgggctta cgcagccccg gaccgctggg gatcatcttc 1198321 tcgctggtgt gcatcggcat cgcggcgttc agcttcctga tcgacttcga tgcggctgat 1198381 cagatgattc gcgcgggagc accggagaag gcggcatggg gcgtcgcgtt aggcctgacc 1198441 gtaacgctgg tctggttgta catcgagatc ctgcgcctgc tcagttatct acagaatgag 1198501 tagcgctcgt tggccgttga ttctgcgtcc accaggctga ccactcgcac ttttgcgtgg 1198561 tagacgcagg atcaacggct gtgtcggtgg gtgctgacac catgcccgca tgcgggagat 1198621 gggggcgcag ccgttcatcg gcagcgaggc gttggcggcg ggactcatca gctggcatga 1198681 gctgggcaag tactacaccg cgatcatgcc caacgtctat ctggacaagc ggctgaagcc 1198741 ctccctgcgg caacgcgtta tcgcggcctg gctgtggtcg ggccgcaaag gggtgatcgc 1198801 cggcgcttcg gcatcagcgc tgcacggcgc gaaatgggtc gatgaccacg cattggtgga 1198861 gttgatctgg cgcaacgcca gggcgccgaa cggggtgcgg actaaggatg agctactgct 1198921 cgacggcgaa gtccagcgct tgtgcgggct tactgtgact accgttgaac gtacggcctt 1198981 cgacttgggc aggcgtccac ccttaggtca ggcgataacc agactggatg cgcttgccaa 1199041 tgccaccgat ttcaagatca acgatgttag ggagctcgcg aggaagcacc cccatactcg 1199101 cgggctgcgt caactagaca aggcgctgga tctcgtcgac ccaggtgcgc agtcgccgaa 1199161 ggagacgtgg ctgcggctct tgctgataaa cgccggcttt ccacggccgt ccactcagat 1199221 ccccttgctc ggcgtctacg ggcatccaaa gtatttcctc gacatgggat gggaggacat 1199281 catgctcgcg gtcgagtacg acggcgagca acaccgtctc agccgagacc agttcgtcaa 1199341 agacgtcgaa cgcctggaat acatccggcg cgccggctgg actcacatca gggtgctggc 1199401 agaccacaag ggacccgacg tcgtccgccg ggttcggcag gcttgggaca cgttgacatc 1199461 acgacgttga ctctgcgccc accacgtgtc ctactcgcac ttttgcgtgg tggacgcaga 1199521 gtcaacgcga tcgagcgcct cgctcacgcg aggcgctcga tcaccatcgc catgccctgg 1199581 ccgccaccga cacacatggt ttccagaccg aacgtcttgt cgtaggtctg caggttgttc 1199641 aacagcgtgg tggtgatgcg cgcacccgtc ataccgaacg ggtgacctag ggcgatcgcg 1199701 ccacctgaga tgttgagctt gtcctcgtcg atgcccagct cgcgcgccga gcccaggacc 1199761 tgcaccgcga aggcctcgtt gatctcgacc aggtcgatgt cggtgatcgc catcccggct 1199821 ctttccagcg ccttcttgga cgcctcgatc ggccctaagc ccatgatctc cggggacagc 1199881 ccgctgaccc cggtggacac aatgcgcgcc agcggtgtca agcctaattc cttggccttg 1199941 gtgtcgctgg tgatcaccac cgcggcggcc ccgtcgttga gcggacaggc attccccgcg 1200001 gtcacggtgc cattcggccg gaaagccggc ttgagctcgc tgaccttttc gtaggtggta 1200061 cccggtcgcg ggccgtcgtc ggtgctgacc gtggtgccgt ccggaagggt gaccggcgtg 1200121 atttctcgtt cgaagaaccc gttcttgatc gcctcttcgg cccggttctg gctgcgcacg 1200181 ccccagcggt cctgttcttc gcggctgatg ccggtcatga tggcgacgtt ttccgcggtc 1200241 tggcccatcg caatatagat gtccggcagc ttctgatcgg tgcggggatc gtgccattcg 1200301 tcggcgccgg cggctgccgc ggccgaacgt tcctgagccc cgtcgaacag cgggttcttg 1200361 gtgtccggcc aggagtcgga gtttcccttg gcgaaccggg agacggtttc cacgcccgcg 1200421 gagatgaacg cgtcgccctc accggccttg atcgcgtgga aggccatccg ggtggtctgc 1200481 agcgacgacg aacagtaccg gttgaccgtg gtgcccggca ggaagtcata gccgagcgcg 1200541 acggcgacga cacgggcgat gttgaaaccg gactcaccgc ctggcaggcc acagcccatc 1200601 atgaggtcgt cgatctgatg ggggttcagt gccggaacct tgtcgagcgc ggcgcgcacc 1200661 atctggacgg ccaggtcgtc gggccgcatg ccgaccagcg atcctttcat ggcccggcca 1200721 atcggcgagc gggcagtcga gacgatgaca gcttctggca tgacggctcc cggcatggac 1200781 aagacgtggt gaagtttagg tcaaatgtag tcgctaccca ccggtcggca cggcccgggc 1200841 cggccggggc cgccgcagcc gcgacatcat gctgtgtcgc gtgtggcccg gctcgagggt 1200901 ggccgttcca ggccgggacg gcgtttcatg aattgggata tcgagctttt cggtcagcgc 1200961 atcgcgcagc gcaaggaaca acagatcggc cgccagggcg tacgcgggcg ccgacgggtg 1201021 gtagcggtcg gcggagaaca tcagctcggg cattgcccgg aatttgggag ccagtagatg 1201081 tcctagcggc accggcaccc caccggccgc cttgacggct gccgtttggg cgcgggccag 1201141 ccgcacacca cgggtgtgcg ctagcgcgcg cagcggctgc gggatggcgg taatgacgcc 1201201 gaggtcgggg caagtgccga ccaccactac cgctccgcgg gtgcgcaacc tgcgtacgca 1201261 gtcggccagc cgttgcgcag aggggccaat gccgttgagt gccgttatgt cgttggcgcc 1201321 aatcatgatt accgccgcat ccggcggcgg accgaccacg aacatcgcat cgacttgacc 1201381 gcagacgcct ttcgaggtgg cgccgacgat ggctttggtg ctcagccgga tccgcttgcc 1201441 ggtctgctcg gcgagtccgc gggcgatcaa cacgcccggt acttcctcag cgctagcgca 1201501 gccgtatccc gtcgccgtcg agtcaccaaa gatcatcagg tgcacgtcga agggcacttc 1201561 gcgtcgccac cgttgcacgg gcccaccgcc gcgggtgtat acgccgtcgg cgcggggcgg 1201621 tgcgtcgaag gatttgggaa ttaccgtgcg cgcgtgggtc gcctgaccga ccagcaggtt 1201681 gcgtgcgccc agataggccg tgcccgtcga ggcgagtgca cccgcggtgg ccaaagcgat 1201741 cgtggaacgc cgtggcacgc gcatgctcac gggatcagtt taggacggtt gtgccgattt 1201801 cgtgggtagc tgacgaacaa acccgtcacg gtgtggacca aatgtggtat cgaatcagac 1201861 tctttggctg tggcacctaa aaaagactgt caagctaagt tcgcggggtt ggctgagcca 1201921 gaggctcagc cgcttcgtca catgctgtat cggactacaa cggcgtagga agtgttgggc 1201981 atgactgcac ccagtaaggt atccggctca cccagagttg tcatttcgcc gcgcgacgtg 1202041 ttgaaggcac gtagactcga ggcacgcaag tttgcgatca gcgacggcgc cccggtggag 1202101 gtcgtcgagt ctggtccaag tcttgttgcg cgattagctg cgctggcgtc acgagtggcg 1202161 gtccggccgg tgctagcggt cggtagctat cttccgcatg cgccctggcc gtggggtgtc 1202221 atcgaccagg ctgcccgggt tctgctccca gcgtcaacga ccgtaagggc cgcggtgagc 1202281 ctgcctaatg cgtccgccca actggttcgg gcgtcgggtg tgttgccggc ggacggcact 1202341 cgacgcgccg tcctgtacct gcacggcggc gcgtttctga cgtgtggagc aaactcgcat 1202401 ggacgactcg tcgagttgct ctctaagttc gctgactcgc ctgttctggt ggtcgactat 1202461 cggttgattc ccaagcactc gatcgggatg gcgctcgacg actgtcacga cggctaccgg 1202521 tggctgaggc tgttgggcta tgagccggag cagatcgtgc tagcgggcga ttccgcgggc 1202581 gggtatcttg cgctcgctct cgcgcagcgg ctacaggaag tgggggagga gccggcggct 1202641 ctagtcgcga tctcgccact gctgcagcta gcaaaggaac acaagcaggc gcatcccaac 1202701 atcaaaaccg atgcgatgtt cccggcaagg gcgttcgatg cgcttgacgc attggttgct 1202761 agcgcagcag cgaggaacca ggtagacggc gaacccgaag agctctatga gcccttggag 1202821 cacatcacac cggggctgcc gcggacactg attcacgtgt cgggctccga ggtattgctg 1202881 cacgacgctc agttggcggc ggccaaactg gcggcggccg gggtgccggc cgaggtccgg 1202941 gtatggccgg gccaggtcca cgactttcag gttgcggcgt cgatgctgcc cgaggcgatc 1203001 cgctcgttgc gtcagatcgg ggagtacatc cgcgaggcca ccgggtagcg ggatgccgac 1203061 ggagcgcgtg tgcctggccg gcaggcgcct gagacgatga acgcatgcgg atcgcgcaac 1203121 atatcagtga actcattggt ggtaccccac tggttcggct gaactccgtg gtacccgacg 1203181 gcgccggaac cgtggccgca aaggtcgagt atctcaaccc tggcggcagc tccaaggatc 1203241 ggatcgcggt gaagatgatc gaagccgccg aggccagcgg tcagctgaag ccgggtggca 1203301 ccatcgtcga acccacgtcc ggcaataccg gcgttggtct ggcgttggtc gctcagcgcc 1203361 gcggctacaa gtgcgtgttc gtctgcccgg acaaggtcag tgaggataaa cgcaatgtgt 1203421 tgatcgccta cggcgccgag gtcgtggtgt gcccgacggc ggtcccgccg cacgatccgg 1203481 ccagctacta cagtgtgtcg gaccggttgg tccgtgatat cgacggtgcc tggaagcccg 1203541 accagtacgc caacccggag ggaccggcaa gccattatgt gaccaccggc ccggaaatct 1203601 gggccgatac cgagggcaag gtcacccatt tcgtggctgg catcggcacc ggcggtacca 1203661 tcaccggcgc tggccggtac ctcaaagagg tgtccggggg ccgagtacgc atcgtcggcg 1203721 ccgacccgga gggatcggtc tattcgggcg gtgccggccg accgtatctg gtcgaggggg 1203781 tcggcgagga tttctggccg gcggcctatg acccgagcgt gcccgacgag atcatcgcgg 1203841 tgtccgactc cgactcgttc gacatgacca ggcggctggc ccgcgaagag gcgatgttgg 1203901 tcggcgggtc gtgcgggatg gcggtggttg ccgcgctcaa ggtcgccgag gaagccgggc 1203961 ccgacgcgtt gatcgtcgtc ctgttgcccg acggcggccg gggctacatg tcgaaaatct 1204021 tcaacgacgc gtggatgtcg tcctatgggt tcctgcgcag ccgccttgac gggtcgaccg 1204081 agcaatccac cgtcggtgat gtgttgcgcc gcaagtccgg cgcgctgccc gccctggtgc 1204141 acacccatcc gtcggagacc gtgcgcgacg ccatcgggat tcttcgcgag tacggggtgt 1204201 cgcagatgcc ggtggtcggc gccgagccgc cggtgatggc cggcgaggtc gccggtagcg 1204261 tctcggaacg cgagctgctc tcggccgtgt tcgagggccg cgccaagttg gccgacgccg 1204321 tgtcggcaca catgagcccg ccgctgcgga tgataggcgc cggtgaattg gtcagtgcgg 1204381 ccggcaaggc gttgcgtgat tgggatgcgt tgatggtggt ggaggaaggc aagccggttg 1204441 gggtcattac ccggtacgac ttgttgggct tcttgtcgga gggggcggga cggcggtagt 1204501 cgcgcaggca ggcgcgccgc aatttagttc ggctacaaac aattacggca ggcggccagt 1204561 gccgcacagg tcgtgggcac tgacccattg ggccccgtgg ctcatctcac cgccgggcgt 1204621 tccggtgaat ccggtcctca ggtactgtag tcccgcctag ttcaccctag ttcagctgaa 1204681 cctcagtgga aggtgtgccc atgaccgaac agccgccccc cggcgggtcg tacccaccgc 1204741 ccccgccacc gcctgggccg tccggtgggc atgagccacc tcccgctgca ccacccggcg 1204801 gcagtggtta cgctccgccc cctccgccct cgagcggcag tggctacccg cctccgccgc 1204861 caccgcctgg cgggggggcc tacccgccgc ctccgccgtc ggccggcggt tacgcgccgc 1204921 cgccgcccgg accggcgatt cgtacgatgc cgaccgagtc ctacacgccg tggattaccc 1204981 gggtgctggc ggcattcatc gactgggccc catacgtagt gctggttggc atcggttggg 1205041 tgatcatgct ggtcactcag acgtcgtcgt gcgtcaccag cattagtgag tacgacgtcg 1205101 gccagttctg cgtttcccag ccgtcgatga tcggccagtt ggtgcagtgg ttgttgtcgg 1205161 tgggcggatt ggcttacctg gtctggaact acggctatcg ccagggcacc accgggtcga 1205221 gcatcggcaa gtcggtgctg aagttcaagg tggtcagcga gaccaccggg caaccaatcg 1205281 gcttcgggat gtcggtggta cgccagcttg cccactttat cgacgcgatc atctgcttcg 1205341 tcgggttcct gtttccgctg tgggacgcta aacggcaaac gttggcggac aagatcatga 1205401 cgacggtgtg cgtgccgatc tgatccggga ctgcactgcc cacccgaccg tccgatgagc 1205461 gaagaccgca cgggacacca gggaatcagc ggaccggcca cccgcgccat ccacgctggc 1205521 taccgcccgg atccggcgac cggggcggtg aacgtgccga tctacgccag cagcaccttc 1205581 gcccaagacg gcgtcggcgg tctgcgtggc ggtttcgaat acgcacgcac cggcaacccc 1205641 acccgggccg cattggaggc ctcgctggcg gcagtcgagg agggtgcttt cgcgcgggca 1205701 ttcagttccg ggatggccgc gaccgactgc gccctgcggg cgatgttacg gcccggagac 1205761 cacgtcgtca ttcccgatga cgcctacggc ggcacattcc ggttgataga caaggtgttc 1205821 acccggtggg atgtccagta cacgccggtg cggcttgccg atctggatgc ggtgggtgcc 1205881 gcgattactc cgcgcacccg gctgatttgg gtggagacgc ccaccaatcc gctactgtcg 1205941 atcgccgata tcacggccat tgccgagctg ggcacagaca gatcggcaaa agtattggtg 1206001 gacaatacct ttgcctcacc cgcgttgcag cagccgttgc ggctgggcgc cgatgtggtg 1206061 ttgcactcga ctaccaagta catcggcggc cattccgacg tggtgggagg tgcgctggtc 1206121 accaacgacg aagagctgga cgaggagttc gctttcttgc agaacggcgc cggcgcggtg 1206181 cccggaccat tcgacgccta cctgaccatg cgcggcctga agaccttggt gctgcggatg 1206241 cagcggcaca gtgaaaatgc ctgtgcggta gcggaattcc tcgctgatca tccgtcggtg 1206301 agttctgtgt tgtatccggg tttgcccagt catcccgggc atgagattgc cgcgcgacag 1206361 atgcgcggct tcggcggcat ggtttcggtg cggatgcggg ccggtcggcg tgcggcgcag 1206421 gacctgtgtg ccaagacccg cgtcttcatc ctggccgagt cgctgggtgg ggtggagtcg 1206481 ctgatcgaac atcccagcgc catgacccat gcgtcgacgg ccggttcgca attggaggtg 1206541 cccgacgatc tggtgcggct ttcggtcggt atcgaagaca ttgccgacct gctcggcgat 1206601 ctcgaacagg ccctgggtta actaccgcga gcagacgcga aagcacccca aaaccgccgg 1206661 tttgggggct tctgcgtctg ctcgcgggta cctaggagtg gtacggctcg gcgctgacta 1206721 gggtcaccga cacggtgctg ccgttgggca ccgtgtagct gcgggtctcg ccgaccttgg 1206781 cgtcgatcag ggccccaccg agcggtgaat tcggcgagta gacctcgagc ttgccgtcgc 1206841 tgacgccctc ctggcgggtg gcgatgagga acgtttcgct gtccgacttg tcgccattgt 1206901 agtacacctt gaccacagaa ccgggtaatg cgacgccgga ttgcttgggt gcctcgccaa 1206961 cctttgcgtt gctgagcaag tcctgcagct ggcgaatgcg ggcctcctgc tggccctgct 1207021 cctcgcgggc ggcgtggtat ccgccgttct cgcgcaggtc gccttcttcg cggcggtcgt 1207081 tgatttcggc ggcgatgacc gggcgattcg caatcagctg gtcgagctct gctttgagtc 1207141 ggtcatgtga ctcttgggtc aaccaggtga cttgagtatc cgtcatctcg tcgcgctcct 1207201 cgtgttgtcg ttcccgcgta gtcgggcaag tttcggatcc ctgccagcag cactgtcggg 1207261 aatatttggg gtctcacccc gggttgccgc cgctccgttc tgcgtacggc cgttaatgca 1207321 gcaatacacg gccccggcag gaccgtgcat cgatccatgc taccaccacg gtcaggggag 1207381 gcgcaggtag ctgggcactt cggtgccaca accgtatacg tccgccatca ccggcggctg 1207441 ggaggatttc acggtcgtcg tcacctgcac ggtggttgcc tcggacggtg ggactagcag 1207501 ctcacgtctg ccggtctcgc tgccgtttgt tgcccgaact cgcacgatgc aggccaccgg 1207561 tcgggacggg tccgaacgtg tcacgctgat ggtgaccgat gccgtctcgt cgtcgaccag 1207621 tcgatagccc accagcgaac cggtgacggc gctggtgctg atccgttggt agccgatgac 1207681 ggcaatgacg atgccggccg cggcgaccag cacccccagg gcgatcgcga cacggcgccg 1207741 cgctcggcgg gacagtcgcg ggcgtccgta gcgggcgtct ggtcgcggaa tgggggtgtg 1207801 ggtcatgcct gggttcacgc cggcgggatg caacgcttcg acaaaccgga attatagggt 1207861 cacttatagg cttaaggggg cagccaggcg gacggacaag ggggcacgtg agcgaactgc 1207921 ggttgatggc ggtgcacgcc caccccgatg acgagtccag caagggcgcg gccaccctgg 1207981 cgcgctacgc cgacgagggt catcgcgtgc tggtggtgac gttgaccggt ggtgagcgcg 1208041 gcgagatcct caacccggcg atggacctgc cggacgtgca tgggcgcatc gccgagatcc 1208101 ggcgtgacga gatgaccaag gcggccgaga tcctcggtgt cgagcacacc tggctgggct 1208161 tcgtcgactc cgggctacct aagggtgatt taccgccacc gctgcctgat gactgcttcg 1208221 cgcgggtacc gctggaggtg tccaccgagg cgctggtgcg ggtggttcgc gagtttcggc 1208281 cgcacgtgat gaccacctac gacgagaacg gcggctaccc acatcccgac cacattcgct 1208341 gccatcaggt ttcggtggct gcctacgagg cggccggtga cttttgccgg tttcccgacg 1208401 cgggtgagcc gtggacggtg tccaagctgt actacgtcca cggcttcctg cgggagcgga 1208461 tgcagatgtt gcaggatgag ttcgcccggc acggccaacg cggcccattc gaacaatggc 1208521 tggcgtactg ggaccccgac catgactttc tcaccagccg agtgaccacc cgggtcgagt 1208581 gctcgaaata cttcagccaa cgcgacgatg cgttgcgcgc gcatgccacc cagatcgacc 1208641 cgaacgccga attcttcgcc gccccgcttg cctggcagga gcggctgtgg ccgaccgagg 1208701 aattcgagtt ggctcgctcg cgtatccccg cgcgcccacc ggagaccgaa ttgttcgccg 1208761 ggatcgagcc gtgaaccaga ttctgctcag cgtgattgct gagggcgggc ccggtaacac 1208821 cggacccgat ttcgggaagg ctagcccggt ggggttgctg gtgatcgtgc tattggtgat 1208881 cgccacgttg tttctggtgc gttcgatgaa ccagcaactg aagaaagttc ccaagtcgtt 1208941 cgaccgggat caccccgagc tcgaccaggc agccgacgag ggcaccgacc gcgacggacc 1209001 ggcccgacca ccgggacccc cgcatgagtc cggctaatcc gtccgggacg aataccctcg 1209061 cgctggccac cagcccgtac ctgcgccagc acgctgataa cccggtgcac tggcagcagt 1209121 ggacgccgca ggcactggcg gaggcggccg cgcgcgcggt gccgatcctg ctgtccgtcg 1209181 gctacgccgc ctgccactgg tgtcacgtca tggcccacga gtcattcgac gacgacgagg 1209241 tggccgcggc catgaacgcg ggcttcgtct gtatcaaggt cgaccgggag gagcggcccg 1209301 acatcgacgc ggtctacatg aacgccaccg tcgcgctcac cgggcagggc ggctggccga 1209361 tgacatgctt tctcaccccc aacggccggc cgttcttctg cggcacctac tacccgaaag 1209421 cggctttcct gcaacttctt tcggccatat ccgaaacctg gcgggaacgc cgcgctgagg 1209481 tggagcaggc atctgaccat atcgctgccg agttgcgctc gatggcttcg gggctgcccg 1209541 ggggtggccc ggaggtggcg ccggagctgt gtgacgacgc ggtggcagga gtgctgcgtg 1209601 agcaggacac ggcgcacggc ggatttggcg gtgcgccgaa attcccgccg tcggcactgc 1209661 tggaagcgct aatgcggcac tacgagcgca cccgatcacc ggcggcgctg gaggcggtcg 1209721 cacgcactgg aaacgccatg gcccgtggcg gcatctatga ccaactcggc ggcggtttcg 1209781 cccgatacag cgtcgacggt gcctgggtgg taccgcattt cgagaagatg ctgtacgaca 1209841 acgcgctgct gctgcgcgcc tacgcgcact gggcccgccg taccggggat ccgttggccc 1209901 gccgggtcgc cgcccagacc gcgcgatttc tgctcgacga gttgggcagc aaagcaccgg 1209961 ccgacatgtt cacctcgtcg ctggatgccg acgccgacgg ccgcgagggt tcgacctacg 1210021 tttggacgcc ggtgcaactg accgaggtgc tcggcggcga cgacggccgt tgggcggcag 1210081 aggttttcgg ggtgaccgag gccggcacct tcgagcacgg gacgtctgtg ctgcagttgc 1210141 ccgccgaccc cgacgacgcg gcgcgtctgg accgggtccg cgccgcgttg ctggtggccc 1210201 gcctggcccg ggcccagccc gcccgcgacg acaaggtcgt cacgtcctgg aacgggttgg 1210261 cgatcaccgc gctggccgaa gccagcgtgg ccctggacga ccccgcgttg gcgcacgccg 1210321 cgcggcgctg cgcgaccagg ctgctggacc tgcacgtcgt cgacggccgc ctgcgccggg 1210381 ccagcctggg cggggtggtc ggcgacagcg ccgccatcct ggaggaccac gcgatgctgg 1210441 ccaccgggct gctggcgctc taccagctga cctccgaggg cgcgtggctg acggcggcta 1210501 ccggattgct ggacaccgcg gtggcgcatt tcggcgaccc gcagcgcccc ggtcgctggt 1210561 tcgacaccgc cgacgacgcc gagcggctga tgctgcggcc ctccgatccg ctggacgggg 1210621 cgacaccgtc gggcgcttcg tcgatcgccg aggcgctgct gacggcgggc catgtggtcg 1210681 acggtgctcg cgccgagcgg tattggcagc tggcggccga cacgctgcgg gcgcatgcgg 1210741 tgctgctggc tcgggcgccg cggtcggccg ggcattggct ggcggtcgcc gaggcggtgg 1210801 tgcgcggacc gctgcagatc gccgtcgcgt gcgacctgcc gcggtcgtcc ctgctggccg 1210861 acgcgcgccg gctggccccg ggcggggcga tcgtcgtggg cggcgcggcg ggttcgtcgg 1210921 cgctgctggt cggccgggat cgggtggccg gcgccgacgc cgcctacgta tgccggggcc 1210981 gggtctgcga tctgccggtg accagcgcgg ccgaactcgc caccgctttg ggcgtacccg 1211041 gctagcggac tcgggtggca cccgtccacc gtgaaatccg cgacgcggtg tcggcgtgtc 1211101 gcgtcgcaat tttcacgctc gcgaccgccc tgggcgtgcc gggtcagaac accacgaacc 1211161 acatcgcgat gtagtggcag atcgccgcca ccgcggtgca ggcgtggaag aactcgtggt 1211221 agccgaacgt cgtcggccac gggtcgggcc agcgtaccgc gtagagaatg ccgccgatgc 1211281 tgtacaacgc gccgccaaca aacagcaaca ccaacgcggt caccccggcg ttgtgcagga 1211341 tcgtcgcggt gtaccagacc gccacccaac ccagcaacag gtacagcgga accccgaccg 1211401 agcgcggcgc cgccggccaa cacatcttca gcaagattcc ggcgatcgca ccgccccaaa 1211461 caatcgacaa caccacgcgc ccgtcgtggg ccggcaaggc cagcagcgcg aacggcgtgt 1211521 agctgccggc gatgaacacg aagatcatcg agtggtcggc ccgcttcatc cagttgcggg 1211581 ccgtcgcgga tttccaattg acccggtgat aagtggcgct gacggtgaac atggtgatcg 1211641 tggccgcggt gtaggccagc gtcgtcaggc ccgccttggc ggaacccacc gcccacgaca 1211701 ccgcgaccag cgacgcaccg gccaacaccg cggtgccggc ggaatacacg tggatccagc 1211761 cgcggaagcg cggtttggtc aggacacggg cgacaccttc gacgaggtgg tgggcagcgt 1211821 gggccggcgt ccttgcttcc gcggtggtgg cggtgtcggc ctggccgctc atttcgcctg 1211881 ttgcctcgtc ttgtgcttgc cggtgggtgt cgtcgaacac agtagtcggg ccaggtagcg 1211941 gacatctgac tcgacgtctg ggtcacagta gtctgggtat ctgtggagat catcccgccg 1212001 cggctcaaag agccgttgta ccggctctac gagctgcgcc tgcggcaggg cttggccgcc 1212061 tcgaaatccg acctgccccg gcacatagcc gtgctgtgcg acggcaaccg gcgatgggcg 1212121 cgcagcgcgg gctacgacga cgtcagctac ggctaccgga tgggtgcggc caagatcgcc 1212181 gaaatgctgc ggtggtgcca cgaagccggc atcgaactgg ccaccgtcta tctgctgtcc 1212241 accgaaaacc tgcagcgcga tcccgacgag cttgcagcac tcatcgagat catcaccgat 1212301 gtcgtggaag agatctgcgc accggccaac cactggagtg tgcggacggt cggggatctg 1212361 gggttgatcg gcgaggaacc ggcccggcgg ctgcgcggtg cggtggaatc caccccggag 1212421 gtggcctcgt ttcatgtcaa cgttgctgtt ggctacggcg ggcgccgcga gatcgtcgac 1212481 gctgtgcgcg cgttgttgag caaggaactc gccaacgggg ccaccgcgga ggaactcgtc 1212541 gacgcggtga ccgtcgaggg tatctcggaa aacctgtaca cctcaggcca acccgacccc 1212601 gatttggtga tacgcacctc cggcgagcaa cgcttgtccg ggttcttgct gtggcaaagc 1212661 gcctactcgg agatgtggtt caccgaggcg cactggccgg cgtttcgcca cgtcgatttt 1212721 ctacgcgcgc tgcgtgacta cagtgcgagg catcgccgct acggcaggtg aatccggcgc 1212781 aggacgccta tgttgcgctg ttcggctgcc tgcgcagagt gcacattagc cggctcgtca 1212841 tgctgtgcaa tctgcccagg tgaaacccgg tgtttgggat cctggatagc gataccatcg 1212901 actgatccat gcgggacatc cgatgctgga ctgatcggag taaggcgatg tcgtttgtag 1212961 tcgtggcgcc ggaggtgttg gcggcggccg cttcggatct agcgggcatc gggtcgacac 1213021 tggcgcaggc caacgccgcg gcgttggcgc cgaccaccgc ggtgttggcc gcgggtgctg 1213081 atgaggtttc cgcggcaatc gcgtcgctgt ttggggcgca tggtcaggcg tatcaggcgg 1213141 tgagcgccca aatgtcggcg tttcacgccc agttcatgca ggcgttgacg ggtgccggcg 1213201 gggcttatgc ggctgcggag gcggtcaacg tctcggcggc gcagagcgtg gaacaagacc 1213261 tgttggccgc gatcaacgct cgcttcgagc ggatttttgg gcgcccgctg atcggtgatg 1213321 gcgccaacgg cgggccggga caagacggcg ggcccggcgg gttgctgtac ggcaacggtg 1213381 gcaacggcgg caccagcacg accgtgggga tggccggcgg caacggtggt gccgccgggc 1213441 tgatcggcaa cggtgggttc gggggcggcg gcgggcccgg cgcggccggc ggcaacggcg 1213501 gcgccggcgg gtggctattc ggcaacggcg gcgccggcgg tgccggcggc ctcggcgtag 1213561 cgcccggcgt gcccggcggc gccggcggtg ccggcggcgc cggcggtgtc ggcggacccg 1213621 ccgggttgtg gggccacggg ggtgccggcg gggcgggtgg tgccggcgtg gctggcgccg 1213681 gcggcttcga ggggacgatc ggtgccggcg gtgccggcgg tgtcggcggt gccggcggtg 1213741 tcggcggtgc cggcggtgcc ggcgggtggc tgtacggcga cgccggtgcc ggtggggatg 1213801 gtggtgtcgg cggtgccggc ggcaccggcg ggttaggcaa ccgtggcggc gccggtggcg 1213861 ccgggggcgc cggtggtgtc ggcggcgccg ggggtgccgc cgggctgtgg ggcggcggtg 1213921 gtgccggcgg ggtgggtggg accggcggcg gcgccggcct cggtgctcag agcgtcacct 1213981 tcagtagtag cttaagtggc ctttccggtg gcgacggcgg cgccggcggg gccggtggcg 1214041 ccggtggcgc cggtggcacc ggtgggtggc tgtatggcgg cggtggtgcc gccggatccg 1214101 gcggggacgg tggtaccggc ggtcagggcg gcgccggcgg cgccggtgta tttagcctat 1214161 tcggatccgg tggcggcccc ggcggcaacg gcggcgtcgg cggcgtcggc ggtgtcggcg 1214221 gtgctggcgg gcgtgccggc ttgttcggcg tcgggggcct cggcggcgcg ggtggcgacg 1214281 ccggtgactc cggcgaaggc ggcttcggcg ggccggggct cgccggcggg ctgttcggca 1214341 accccggcaa cggcggcgtc ggcgggatcg gcggcgacgc cgcagccggc ggcgccggtg 1214401 gggccggagg caacggtggg tggttgttcg gcaatggtgg tgccggcggc tccggtggcg 1214461 acggcggcgc cgccggccgt ggcggtgccg gcaacttggg ctcggccggg ggtatcaacg 1214521 cccccgccgg taaccccggt agcggctcgg tcggcatcgg cggtgccggt ggtgccggcg 1214581 gcaccgccgg gctgttcggc gacggtgggg ctggtggtgc cggcgccgcc ggcggcttcg 1214641 gcggcatcag cgccgccacc ccctcggcgg gcagtgaggg cgccatgggt ggggccggtg 1214701 gtgttggcgg caacgccagg ctgttgggca ctggtggcgc cggtggagtc ggcggcggcg 1214761 gcggggccgg cggcgacgga ggccgcggcg gagtcgcaac ccccggcggt cagggcggtg 1214821 acgctgggga cggtggcgcc ggcggggccg gcggcaatgg cggcggcggc gccagcggcg 1214881 ccggcgggtg gctgttgggg accggtggtg ccggtggtgc cggtggtaac ggcggcaatg 1214941 gcggaaaagc cggttttagc cctgggccga ccaacttcgg tctcaacggc gccggtggtg 1215001 gtggtggtgt cggcggcaac ggcgccaccg gaccctggct gttcggcgac ggcggcgccg 1215061 gtggcggcgg cggggccggc ggcatcggcg gcgacggcgg ccccacccca ggcagcaccg 1215121 gtgccggtgc ggccggtggt cacggcggcg acgcccagct gatcggcaac ggcggccacg 1215181 gcggggccgg cggcaccggg gtgccgaacg ggtcaggtgg tgccggcggc ctcagcgggc 1215241 tgctgttcgg cgagccgggg gcgaacgggt aggttcggcg ccgctgccgt gatcgcggcg 1215301 aggcgtcggt gtccgcgtcc gtgcgggcga atccagtccg gtctgagtgc gtctactaca 1215361 gcttgcgcag ccgtagccgc ttgatggcat cggactggtt accgtctgcc tgctgtccac 1215421 agaaaacctg tgtgcgatcc cgacgagctt gccgtgcgtg ggctacggcg accgtcgcga 1215481 attcgtcgac gcggtggccg tagaagccat ctgcgaaaac ctgaatacct cggggcaacc 1215541 cgatcccgac ctggtgatcc gcacctcggg ggaacaacgc ttgtccggcc accgagggcc 1215601 cactggcgga gtttcgcgac gtcgacttct gcgcgcgctg cgtgactaca gtacgccaca 1215661 cgcgtcgatc ccctacgttc cgccgcccta tcgaagcgac gggatccacg cttcccggct 1215721 ggcggttgaa tcggttttcg atgcattggc tgggcgcgtc gaactctaaa gactttatgg 1215781 aaattagttg tacagtgata aaaccgttat agggtccgtt gtcaaacaat gataatcacg 1215841 tgataggaac gtgattcatc ggtctgaagt gcttatgatg atttatatat aaaaccgtta 1215901 tatgtgggta aaggattgcg gatgtcatac atgattgcca caccagcggc gttgacggcg 1215961 gcggcaacgg atatcgacgg gattggctcg gcggttagcg ttgcgaacgc cgcggcggtc 1216021 gccgcgacaa ccggagtgct ggccgccggt ggcgatgaag tgttggcggc catcgctagg 1216081 ctgttcaacg caaacgccga ggaatatcac gccctcagcg cgcaggtggc ggcgtttcaa 1216141 accctgtttg tgcgcacctt gactgggggg tgcggagtct ttcgccggcg ccgaggccgc 1216201 caatgcgtca cagctgcaga gcatcgcgcg gcaggtgcgg ggcgccgtca acgccgtcgc 1216261 cggtcaggtg acgggcaatg gcggctccgg caacagcggc acttcggctg cggcggccaa 1216321 cccgaattcc gacaacacag cgagcatcgc cgataggggc acaagcgcca tcatgaccac 1216381 ggcaagcgcg accgcgtctt ccacgggcgt cgatggcgga atagcggcga cgtatgcggt 1216441 cgcctcgcaa tgggatggtg gctacgtggc caattacacg atcacccaat tcgggcgcga 1216501 cttcgatgac cgattggcgg ttgcaattca ctttgcctga aaatgcctct atttcgaacg 1216561 cgtgctgcgc tcaacttgcc cagtcgggca cgcagtacac tcttgacgcc cgagagctat 1216621 aacggcaccc cccgtggact cgatcaccgt cggctaccaa gcagcgcaaa ccggcggcta 1216681 ctcgccaccg acaaatctgc tgatcaacgg tcaagccgtc accatcgacc agacccccat 1216741 cacctcgtcg ccaacgactc cgccacccac cacaccaccc gagatcccga ccggtggaac 1216801 ggtgatctcc acctagttcg ggacgactac ggtcaccgga ggctacgtgg tgcagaacaa 1216861 cgcgtggaac aacccccgcc gggcagaccg tcaacgtcag ccaaaccggg ttcaccatca 1216921 ccgagatgaa cggtgctgcc ccaaccaacg gcgccccgct gagttacccc tcgatctgcg 1216981 agggcgtgca ctggggccac ctcgtcggtg ggcaccaacc tgcctactga ggtgggccag 1217041 attttgtcgg cgccgaccag catcgactac aactacccga cgaccggggt atgggacgcc 1217101 tcctacgaca tctgcctgga ttccacaccc aagacgaccg gggtcaacca gcaggagatc 1217161 atgatctggt tcaaccacca gggctccatt cagccggtcg gctccccggt gggcaacacc 1217221 accatcgagg gcaagaactt cgtggtgtgg gatggcagca acggcatgaa caacgcgatg 1217281 gcctatgtcg cgaccgagcc gatcgaggtc tggagcttcg acgtgatgag tttcgtcgac 1217341 cacaccgcca ccatggagcc gatcaccgac tcgtggtacc tcacgagcat ccgggccggc 1217401 ttggagccct ggagcgacgg tgtgggtctg ggggtcgatt cgttctcggc gaaagtcaac 1217461 taaagaccac gttgacaccc aaccggcggc ccggcatggg ccgtcgcggc gtagaagctt 1217521 tgaccgcggc gcgaaacgtt cgctgctgcg gcccatgcag atcgcacacg cttgcttgaa 1217581 catcgggtgg agccggtggt aacgccaggc tttgggtgtc ggcgcggctc ggcggtcagc 1217641 tgcgcggacg cggtcggcca tcgtgacgac gagatgctgg cggcatgtac ggcaaccgct 1217701 ggctcgtctt agagccattt gctgaggcgc atgctttgcg tcatgcaaag tgcatatgcc 1217761 gccagcggga tggtgtgcat tctgtccatg ggaaaccggg ttgatggtgg gcgcgtcagc 1217821 gatacgatct gtgcaccctg acgacatggc cgatgcatga ttgatcggag gtaaacgatg 1217881 tcgtttgtga ttgctgcgcc ggaggcgttg gtcgcggtcg cttcggatct ggcgggcatt 1217941 gggtcggcgc tggcggaggc caacgccgcg gcgttggccc cgacgacggc gttgttggcc 1218001 gcgggtgccg atgaggtgtc ggcggcgatc gcggcgctgt ttggcgcgca cgggcaggcg 1218061 tatcagacgg ttagcgccca ggcgtcggcg tttcatgccc agtttgtgca ggcgttgact 1218121 ggcggcggcg gggcgtatgc ggctgccgag gccgccaacg tctcggcggc gcagagcacc 1218181 gaccagcggc tgctcgatct gatcaatggg cccacccagg cgttgttggg gcgtccactg 1218241 atcggtgatg gcgccaacgg cgggccgggg caagacggcg ggcccggggg gttgctgtac 1218301 ggcaacggcg gcaacggcgg cactagtacc accgccgggg tggccggcgg caacggtggc 1218361 gccgccgggc tgatcggcaa cggcggggcc gggggcggcg gcggggccgg cgcggccggc 1218421 ggcaatggcg gtgcgggcgg gtggctgtat ggcaacggcg gcgccggcgg ggccggtggg 1218481 acatcggtga tacccggtgt cgccggcggc aatggcgggg ctggcgggtc cgcgggactg 1218541 tggggtaccg gcggggccgg tggcgacggc ggcaacggcc ggtcggggcc agtcaacgtc 1218601 gccggcagcg cgggcggcaa cggtggcgct ggtggcgccg ccgggttatt cggtgacgcc 1218661 ggggccggtg gcaacggcgg caagggcggt gctggcggcg ccgcctttag cattaacttc 1218721 accgcaggcg atggcggtgc gggaggtgcc ggtgggtccg gcggccacgc attgctgtgg 1218781 ggcgccggcg gagccggggg taacggcgga tccggcggca cggggggtgc cggcggcagc 1218841 accgctggcg ctggcggcaa cggcggggcc gggggtggcg gcggaaccgg tgggttgctc 1218901 ttcggcaacg gcggtgccgg cgggcacggc gccgccgccg gaaacggctt agccgcgggt 1218961 aatggcgtca gcagcagcgg cggcggcggt gccggtggga ccggcggggc cggtggggac 1219021 ggtggcgccg gcggggccgg aggcaacgcc aggctgtggg gcgtcggtgg cgccggcggg 1219081 gccggcgggg acggtggcgc cggcggggcc ggcggcaaag gcggctctgg cctcagcggt 1219141 aacgccaacg gcggggccgg cggcgacagc ggccgtggcg gcacgggcgg cgccggcggc 1219201 gagggcggcg ccgccgggct gctggtgggc accggcgggc acggcggtga cggcggggcc 1219261 ggcggcgccg ccgtcaaggg cggtgacggc ggggccgccg ccggcacggg catcgccggc 1219321 gctggcggcc gtggcggcgc gggcggcagc ggtggcagcg gtggtgacgg cgggggcggg 1219381 gccgccggcc ccgccgggtg gctgttcggc gatggcgggg ctggcgggaa cggcggggcc 1219441 gcggccgccg gcggcgccgg cggccaagcc ggcggtggcg gcgggaacgg cggcaatggc 1219501 ggcaacggcg gcaatggcgg caatggcggc aacggcgcca ccggggggtg gctgtacggc 1219561 aacggcgggg ccggcggcca gggcgccacc gccggagccg gcggagccgg cgctaacggc 1219621 gtcagcagca ccaatggcgg cggcaacggg gggatcggcg ggaccggtgg gtccggcggg 1219681 gccggtggca acgccgggct gttgggcgtg ggcggcgccg gcgggcacgg cgcctccggc 1219741 ggcgccggcg ataggggcgg cgctggcggt accgggttca taagcagtga cggcggtgct 1219801 ggcggtgatg gcggtgatgg cggcaacggc ggggccggcg gcaccggtgg gctgttgttc 1219861 ggtgccggcg gcaatggtgg ccccggcggg tctggcggtg ccgccgatat tggcggcaac 1219921 ggcggcgccg gtaacggcgg gggcaccgac gggaacggcg gtaatggcgg gtccggcggc 1219981 ggcgccggca gcggcggtga cggcggcggg gctggcggca acggtgcgtg gctgttcggc 1220041 aatggcggcg ccggcggggg cggcggaaaa ggcggcaacg gtgccggcgg cgggcttggc 1220101 ggcggttcat tcggcctccc cggcctgaac ggcagcggcg gcgacggtgg cgacggcggt 1220161 aacggtgccc ccggcggggt gctgtatggc aatggcggcg ccggcggcca ggggtcaagc 1220221 ggtggcatcg gcggccccgg cgccaccggc ggtgccggcg gcaaaggcgg tgatggtggc 1220281 gatgcgcagc tgatcggcga cggcggcaat gggggcaacg gaggcgcggg cggcaccggg 1220341 ggcaccccgg ggcccggcgg acccggcggg tccggcgggc ttggaggcct gctgttcggc 1220401 caaaccggca cggctggcgt gtcgccgtag ccggtaggct ggccgcctcc gcggcattgg 1220461 cgtcgtcgca aacttcgcgc acgccctggt gtcgatcgtt gccgctgaat tggcgccgat 1220521 gaccgcaacc ggtatcgccg ctacgccggc ccgaggcggg tacaccacgg ttttcgaggg 1220581 atggcaatat ccgggagtgc gccggctggc ggcctaactc gcctgcaccc ggcgattgga 1220641 ccgccaatta cagcttgcgc agccgcagcc ggttaatgga atgatcggcg tccttgcgca 1220701 gcaccagggt ggcccgggga cgggtcggca gaatgttctc cacgaggttg ggccggttga 1220761 tggtccgcca gatctcgcgc gcggcgacga cggcctgcga gtcagaaaaa gccgcgtagt 1220821 ggtggaagtg tgattccggg tcggcgaacg ccgtggtgcg catggccaaa aaccgtgata 1220881 cgtaccactg ctcgatgtcc tcgatccggg cgtctacata caacgaaaaa tcgaacagat 1220941 ccgacaccat gagcgtgggg ccggtctgca agacgttgag cccctccagg atcaggatgt 1221001 cgggatggcg gaccacttgt tctgcccccg ggatgatgtc gtagtgcaaa tgcgaataca 1221061 ccggcgcaca tgcgtagtcg gagccggact tcaccgaggt gacaaaccgc atcagtgccc 1221121 ggcggttata gctttccgga aaacctttgc gatgcatgag gtttcgccgc tgcagctcgg 1221181 cgttggggta gagaaagccg tcggtggtca ccagatctac ccgggggtgg tgatcccagc 1221241 gagccagcag cgcctgcagc acgcgggcgg tggtggactt gccgaccgcc acactgccgg 1221301 ccacaccgat gatgaacggc accggccggt ccgggttttg ttggggctcg ccgagaaatt 1221361 ccgcggtggc cgcgaacagc cgttggcggg cggcgacttg caggtgaatc agccgggcca 1221421 gcggtaggta gacctcttcg acctccaaca ggtcgatctg ctcaccgaga ccgcgcaggc 1221481 caaccagttc ttcttcggtg agggctagcg gagtcgacat acggagcgcg cgccactgcc 1221541 ttcggtcgaa ctcgacatat gggctcggct cgctaagccg cgacatggtg tcagtcttgc 1221601 agggacgggt gcggggcctg atggctgggc tggcgaagtg cggtgctggc agactccgtg 1221661 tcggtgccga gggccggggg taccccctgg gcttagctgg gcactggggc cagggcgcgg 1221721 tgtttcgatg gaattcagct gtggccctgt gaatttcgca cgctgacgcc ggttgatgct 1221781 gtgagtcggg cacaaaccgc ccaccgctac tcgtgaccta cgtggcagct ggggcactag 1221841 tggctgccgt ttgcggtgca gacgtgcaac ggtggatggc gtgtgctgca ttaagggtaa 1221901 tcagcccggg agcggctcgc tggatacact ggcgcccgtg actgctgcac ctgacgctcg 1221961 cactaccgct gtaatgtctg ccccgctcgc tgaggttgac cccgatatcg ccgagttgct 1222021 ggccaaggag cttggtcggc aacgagacac cctggagatg atcgcctcgg agaacttcgc 1222081 accgcgcgct gtgctgcagg cccagggcag tgtgctgacc aacaagtacg ccgagggact 1222141 gcccgggcgg cgctactacg gcggttgtga gcacgtcgac gtggtggaaa acctcgcccg 1222201 cgaccgagcc aaggcgttgt tcggtgccga attcgccaat gtgcaaccgc attcgggcgc 1222261 tcaggccaac gccgcggtgc tgcatgcgct gatgtcaccc ggcgagcggc tgttgggtct 1222321 ggacctggcc aacggtggtc acctgaccca tggcatgcgg ctgaacttct ccggcaagct 1222381 ctacgagaat ggcttctacg gcgtcgaccc ggcgacacat ctgatcgaca tggatgcggt 1222441 gcgggccacc gcactcgaat tccgcccgaa ggtgatcatc gccggctggt cggcctaccc 1222501 gcgggtgctc gacttcgcgg cgttccggtc gatcgccgac gaggtcgggg ccaagttgct 1222561 cgtggacatg gcgcatttcg cgggtctggt cgccgcgggg ttgcacccgt cgccggtgcc 1222621 gcacgcggat gtggtgtcca ccaccgtgca caagacgctc ggcggcggcc gctccggcct 1222681 gatcgtcggt aagcagcagt acgccaaggc gatcaactcg gcggtgtttc ccgggcagca 1222741 gggcggtccg ctcatgcacg tcattgccgg caaggcggtc gcgttgaaga tcgccgccac 1222801 acccgaattt gccgaccggc agcggcgcac gctgtccggg gcccggatca ttgccgatcg 1222861 actgatggct cccgatgtcg ccaaggccgg tgtgtcggtg gtcagcggcg gcaccgacgt 1222921 ccacctggtg ctggtcgatc tgcgtgattc cccactggat ggccaggccg ccgaggacct 1222981 gctgcacgag gtcggcatca cggtcaaccg caacgccgtc cccaatgatc cccgaccgcc 1223041 gatggtgacc tcgggcctgc ggataggcac gcccgcgctg gcgacccgcg gcttcggcga 1223101 caccgagttc accgaggtcg ccgacattat tgcgaccgcg ctggcgaccg gcagttccgt 1223161 tgatgtgtcg gcgcttaagg atcgggcgac ccggctggcc agggcgtttc cgctctacga 1223221 cgggctcgag gagtggagtc tggtcggccg ctgacgcggg cctgtcgttg gcgcgcataa 1223281 gcgcgagagc gccgatcacc gcgcgacacg gcggcgcccg atttcacgaa atctgtgtat 1223341 gcgagttaca gttaccgcat ggcacagaaa cctgtcgctg atgcgctgac ccttgagctc 1223401 gagccggtgg tcgaagcgaa catgacccgc cacctcgaca ccgaggacat ctggttcgcc 1223461 cacgactacg tcccgttcga tcagggggag aacttcgcat tcctcggcgg acgcgattgg 1223521 gatccatccc agtcgacgct gcccagaacg atcaccgacg catgcgagat cctgctgatc 1223581 ctcaaggaca acctggccgg tcatcaccgt gagctcgtcg agcacttcat actcgaggat 1223641 tggtggggcc gctggctcgg ccggtggacc gcagaggagc acctgcacgc catcgcactg 1223701 cgcgaatacc tggtggtgac ccgggaagtc gacccggtcg ccaacgagga cgttcgagtc 1223761 caacacgtga tgaagggcta ccgagccgag aagtacacgc aggtcgagac cctggtgtac 1223821 atggcgttct acgagcgctg cggcgcggtg ttctgtcgta atctggccgc gcagatcgaa 1223881 gagcccatcc tggccggact catcgaccgc atcgcccgag acgaagtgcg acacgaggag 1223941 ttcttcgcca acctcgttac gcactgcctg gactacacgc gtgacgagac gatcgcggcg 1224001 atcgccgccc gtgccgccga cctcgacgtc ctcggggccg acatcgaggc ctaccgagac 1224061 aagctgcaga acgtggccga cgctggcatt ttcggcaagc cgcagctacg gcagctgatc 1224121 tcggaccgca tcacggcatg gggcctggct ggggagccct ccctcaagca attcgtcacg 1224181 ggctagacac ccgtcggcgc gcctgccctg cgggggtacg gccggcggag tagcgtcgca 1224241 ctcgatggct agcgacatgc tctgctgcca gggcggcacc ttccgtcacg acggctgtca 1224301 tgacaagggc aggaccggcc ccggtcctgg tgtcgctgcc cccgccgaca tgctcgggtg 1224361 ggtccgctcg agcgccgtta gctcgaggag cgctccgtga ccgatacccg cacgtacgtg 1224421 ctcgacacct ctgtgctgct gtccgatccg tgggcgtgca gccggttcgc cgaacacgat 1224481 gtggtggttc cgttggtggt gatcagcgag ctagaagcca agcgccacca ccacgagctg 1224541 ggatggttcg cccgccaggc gttgcgtctg ttcgacgatc tgcgcctaga acacgggcgg 1224601 ttggatcagc cgattccggt tggcacccaa ggcggtacgc tgcacgtcga actcaatcac 1224661 accgacccgg cggtgctgcc cgcaggcttt cgcaccgaca gcaacgactc gaggatcttg 1224721 agttgcgccg ccaacctcgc cgccgagggc aagcgggtca cgttggtcag caaggacatt 1224781 ccgctgcgcg ttaaggccgc cgcggtgggg ctggccgccg acgagtacca cgcgcaggac 1224841 gtcgttgtgt ccggatggtc ggggatgcac gagctcgaga ccgcttccgc ggatatcgat 1224901 gcgttgttcg ccgatggcga gatcgacctg gtcgaagccc gggacctacc gtgtcacacc 1224961 gggattcggt tgctgggcgg cggttcccac gcgctgggcc gggtcaatgc gcataaacgt 1225021 gttcagctgg tgcgaggtga ccgtgaggcg ttcggtctgc gtggccgctc cgccgagcag 1225081 cgggtggcgc tggatttgct gctcgatgag tcggtgggca tcgtgtcgct gggcggcaaa 1225141 gccggcacgg gcaagtccgc tttggcgttg tgtgcgggtc tggaagccgt gctggagcga 1225201 cgcacccacc gcaaggtggt ggtcttccgc ccgctgtacg cggtcggcgg ccaggagctg 1225261 ggctacctgc ccggtagcga gagcgagaag atgggcccgt gggcgcaggc ggtcttcgac 1225321 accctcgagg ggctggccag cccggcggtg ctcgaggaag tgctgtcccg tggcatgctc 1225381 gaggtgctgc cgctgaccca catccggggc cgctcgttgc atgactcgtt cgtcatcgtc 1225441 gacgaggcac agtcgctgga gcgcaatgtg ttgctgaccg tgctgtcccg gttggggacc 1225501 ggttcccggg tggtgttgac ccacgacatc gcccagcgcg acaacctgcg ggtcggccgc 1225561 cacgacgggg tcgccgcggt gatcgagaag ctcaaaggtc atccgttgtt cgcccacatc 1225621 accttgctgc gcagtgagcg ctcgccgatc gccgcgctgg tcaccgagat gctcgaggag 1225681 atcaccgggc cgcgctgagt gcgcctcccg cgagcagaca cagaatcgca ctgcgccggc 1225741 ccggcgcgtg cgattctgtg tctgctcgcc ggtagacttc ctgggtgccg aagcgacccg 1225801 acaaccagac ctggcgctac tggcgcacgg ttaccggtgt cgtggtcgcc ggtgcggtgc 1225861 tggtggtggg cgggcttagc ggccgggtca cacgggcgga gaacctgagc tgttcggtca 1225921 tcaagtgtgt cgcgttgacc ttcgacgacg gtccggggcc ctataccgac cggctgctgc 1225981 acatcctgac cgacaacgac gccaaagcca ccttcttcct gatcggcaac aaagtggccg 1226041 ccaaccccgc cggcgcccgg cgcatcgcgg acgcgggcat ggagatcggt agccatacct 1226101 gggaacaccc caatatgacc acgattccgc ccgaggatat ccccggccaa ttctccaggg 1226161 ccaacgatgt gatcgccgcg gcgaccggcc gcacgccgac gttgtatcgc ccggccggcg 1226221 gactgtccaa cgatgcggta cgccaggccg cggccaaggt tgggcaagcc gaaatccttt 1226281 gggacgttat acctttcgac tggatcaacg actccaacac ggcagcaacc cggcacatgc 1226341 tgatgacgca gatcaagccg ggttcggtgg tgttgttcca cgacacctac tccagcaccg 1226401 tcgacgtggt gtaccagttc atcccggtgc tcaaagccaa cggctatcgc ctggtgaccg 1226461 tcagcgagct gctcgggccg agggcgccag gaagcagtta cggcagccgg gaaaacggtc 1226521 cacccgtcaa cgaactgcgt gacattccgg ccagcgagat cccgccgttg cccaacacct 1226581 catcgcccaa gccgatgtcc aacttcccga tcaccgatat tgcgggtcag aattcgggcg 1226641 ggccaaataa cggtgcgtaa cctcaggact tgttgacctt cagcgcctca atgaccctct 1226701 cgacggtggc gcgcgaggtt gcatcaccga tgggggtggc gcccaggaag acggtgaccg 1226761 gcttggtgtc gaccgcgatg atcgtgaccg aatcaccttt gacgttgcgt gaactgtcgg 1226821 cgattgtgat atcggcgtct acccgggcgg ccctgacccc gtcgacggtg atcgacgacg 1226881 tcttggtcgg gcccagggtg ggcgacgagc ctgcgtagcc ggggccgtcg gccacgcatt 1226941 gcatcaactt cgatgcttgc gcggcgacgt ccatggtggt gacgaagttg gttatcgcaa 1227001 cctcggcttg catcatccac tggtcggcac cggccacctc gtggccgacg cccaccgcgt 1227061 cgatgaggtt cgggttctgg tcgtcggaga acgccgacca cccgggtgcc gcgctggtcg 1227121 ggaacgacag cttacccgca ctgatcgaat cgccgatggg ctgcacaccg ccggacacat 1227181 ttggggtaca accggttgcg gtttgctggg aaaacggttg cgacgtggga gcactcgtcg 1227241 ccggagaggt tgccgtggtc gacttgttgt cgccgcggag gccgatcacc aggatcacca 1227301 ccagtaggat gacacccagc accgcgaggc cggcgaggat cagccacggt gtcttcgatc 1227361 ctggcccggg cggaggtggt cctggcggat agggccccgc cggccagccg ggcggatact 1227421 gctggggtgg gtaggccggc ggataggagc cgccctgcgg ttggcctccc caatacaggt 1227481 cctgcccata cgtattcggg ccgtaggggt agttgccgta ggggccagcg ggaggaaccg 1227541 tcatagccga tcgctgtcga gctgctcggc cttggccatt gccagcacgt ccagacggcg 1227601 gtccagatcc tcgatcgaca gcctgtcgcc gatcaggcca cggtcgatca cggtttggcg 1227661 aatcgttttg cgttccttga gtgcttgctt ggcgacggcg gccgcctcct cgtagccgat 1227721 ggccgaattc aacggtgtca cgatcgacgg tgaggactcg gccagccgcc gcaggtgctc 1227781 gacgttggcg gtcagccctg ctatgcagcg ctgggcgaac agccgtgaca cattggtcag 1227841 cagcttgaag gactcgagga tgttgcgggc catcatcggg atgtagacgt tgagttcgaa 1227901 tgcgccgttg gccccacccc aggcgatggc ggcgtcgttt ccgatcacct gcgcggcgac 1227961 ctgcgtaacc gcctccggca gaaccggatt cacctttccc ggcatgatcg agctgcccgg 1228021 ctgcagatct ggcagttgga tctcggccag gccggtcaat gggcccgatc ccatccagcg 1228081 gatgtcgttg gcgatcttgg tcagcgatac cgcgatcgtg cgcagcgccc cggacgcctc 1228141 caccagcccg tcgcgggcag cctgagcttc gaaagaatta gccgccgtac gcaattccga 1228201 cagaccggtc tgcgcgacca gcaccgcgac cactctgacg ccgaagtcgt cgggagcgtt 1228261 gaggccggta cccaccgcgg tgccgccgat cgccagctcg cccagcctgg gcagacacgc 1228321 gcgcacccgc tcgatgccgg cctcgatctg gcgggcatat ccgctgaact cctggccgag 1228381 tgtcaccgga acggcgtcca tcagatgcgt tcggcccgac ttcaccaccg tgtgccaatc 1228441 aagagccttg gcggccaatg cgtcgtgcag ctgctgcagc gctgggatga gatgagcgac 1228501 cgcggcctcg gtggccgcga tgtgggtggc cgtcgggaag gtgtcgttgg acgactgcga 1228561 catgttcacg tcgtcgttgg gatgcaacgt gaccccgccc ttggccgcga tggacgcaat 1228621 cacctcgttg gtgttcatgt tggagctggt gcccgagccg gtctggaaga cgtcgatggg 1228681 aaactggtcg tcgtgttgac cgtcggcgat ctcggcggcc gcggcgatga tggcgtcggc 1228741 tttctccggc gccagcaacc cgaggtcgga gttcacctgc gcgcaggcgc ctttcagcag 1228801 gcctagcgcg cggatctggg tgcgctccaa cccgcggccg gatatcggga agttctccac 1228861 cgcgcgctgg gtttgcgcgc gccacaacgc ttttgccggc acccggactt cgcccatggt 1228921 gtcgtgctcg atgcggtaat tggcgctgtc ggcgtcaacg gccattgatc gggttccttg 1228981 tgtgtcgtgg gtgtgttagg gcaatgggta cacggcgctg ctgtcgccgg tgaagtcgat 1229041 cgcggagtat tcgttgagct ttgaaagccg gtggtaggcc tcgatcatcc ggacggtgcc 1229101 ggacttcgag cgcatcacga tcgaatgggt ggtgcagccg ccggggtagt aacgcactcc 1229161 cttgagcagg tcgccgtcgg tgaccccagt ggcgcagaag aagacgtttt ccccggacac 1229221 cagatcttcg gtggtcaaga cctggttcag gtcgtaaccg gcttctaggg ccttgcggcg 1229281 ttccgcgtcg tcgcgcgggg cgagctgcgc ctggatcgcc ccgcccatgc agcggatcgc 1229341 cgcggcggcg atgattccct ccggggtgcc gccgatccca gctagcaggt cggtgccgga 1229401 gtgcggtcgg cacgccgaga tcgcgccggc gacgtcgcca tcggtgatca gccggatccg 1229461 ggccccggtg gcgcggacgt cgtggatgag ttgcgcgtgc cgcggcctgt ccaggatgca 1229521 caccgtcatg tctcgcaccg acaggtcctt gaccttggcg accgctcgga tgttttccga 1229581 gatcggcgcg gtgatatcca gcacgtgtgc ggcatcgggg ccgacggcga ttttgttcat 1229641 gtagaacacc gccgacgggt cgaacatggt gccgcgatcg gctaccgcca gcaccgagat 1229701 ggcgttggtc atgcccttgc tcatcagcgt ggtgccgtca atggggtcga cggcaaagtc 1229761 gcattccggt ccgtcgccgt tgcccacttc ttcgccgttg tagagcattg gtgcgtggtc 1229821 cttttcgcct tcgccgatga ccaccacccc gcgcatggaa accgagttga ccagttcgcg 1229881 catcgcgtcg accgccgcgc cgtcgccgcc ctccttgtcg ccgcggccta cccagcggcc 1229941 cgcggccatg gctccggcct cggtcacccg gaccagctcc atggccaggt tgcggtccgg 1230001 ggcttcccgg cgcgatggcc tggtgtgcga cgggtcgtgg ctggccaccg cggccgtcga 1230061 cgaaccggat ccctcagctg tcatggttgg tgattgtccc agaagccgaa ccgtgcgctg 1230121 gagctgggat actggccatg tgaccgccga gccgcagccg acccctaggc cggctaaacc 1230181 gcggttgctg caggacggcc gcgacatgtt ctggtcgctc gcgccgctgg tcgtggggtg 1230241 catcctgttg gcgggcctgg ttgggatgtg ctcgtttcaa ctgggcggga ccaagcgggg 1230301 accgatcccg tcctacgatg cggcccaggc gctgcgggca gacgccaaga cgctgggatt 1230361 cccgatacgg ttgccgcaat tgccaggcgg ctggacgccc aactccgggg gtcgcggcgg 1230421 catcgagaac gggcgagcgg acccggcaac cggtcaacgc cgcaacgcgg cgacctcaat 1230481 cgtgggattc atcagcccga ccgggagata tctgagcttg acccagagca acgccgacga 1230541 ggacaagctg gtcggctcca tccacccgtc gatgtacccg acggggacgg tcgacgtggg 1230601 cggcacccgt tgggtcgttt acgagggttc ggacgaaaac ggtgccgtcg agccggtatg 1230661 gacgacacgg ctcaccggac cgggcggggc cacccagctg gcaatcaccg gtgccggcag 1230721 catcgatcag ttccgcacgc tggcgtcggc gacgcaatcg cagcccccgt tgcccgcacg 1230781 atagcgggtc tcactcagcg gttgacggag gcggggcgtt tcttgacgtg gccgggcctc 1230841 gacgcggcag ccacctgcgg cggacgggtg gtgcgtcgaa ctgttccagt tcgacgcctt 1230901 tgtacaccgc gaggtagacg tcgatggtgg tgacgatgag gatcatcagc accgggccga 1230961 tgatgatacc ccaggggccg aacatggtga taccggcgaa caccgacagc aacatcagcg 1231021 ccgagttcag ccgcgcgtcg cgcggcacca ggatcggccg caggacgttg tcgatgttgg 1231081 taaccaccag cagatgccac agcagcacga agattccccc ggcgatattg ccgtagaaga 1231141 tcatcccgat gccgaacgga atcgtcacga tgccgccgcc cagcgggatg atcgacaacg 1231201 cggtgagcac gatggcgaag atgaagaagc cgtggtgaaa tccggcgatg tagatcgatg 1231261 cggcgccggc gactccctgg cacgccgcga tgacgaactg gccgttcacc gtgccgcgga 1231321 ccatcgagcc catcttctgc aggtacagat ccgtgacgtc ttcgccgagc gggttgagct 1231381 ggccgatcag tgtccttagc ttctcgcggt tcaccaagag cgcgacgaac acgtacacaa 1231441 agatgatggc cgacgtgatg acaccggcga ggcttccggc ggcgtcgcgc aggaagtgca 1231501 gcagccattc gccgacgttc tgtgctaccg aaatcatcgc tttgcgcagt gcgtccgcgg 1231561 taaccgtgat gtgcaggaac ggcacccggt caaacaagcc gttgacgaat tgcaggatct 1231621 tgtcgccgag ggtgctcaga tcggtcgtcc gcacccagtc ggcgacggag tcgaccatgc 1231681 gagcgatctg cacgatcgcc agccccacca aggctcccac cggcacgacg acggcggcca 1231741 gcgccgacaa caacgtgcag gcggccgaca ggccggtatt gaagcgcttg gtgaaccact 1231801 tgaaaagtgg cgtgaacaat aggcgccgac ggctgccacc acgatcagaa cgaaatagtt 1231861 acgcaggaag tacgcaccga acagcaaagc gatcaacgtg aggatcgcca gggcgcgctt 1231921 ctgagtgagc gtgaattcgg tgttcaaagc gggtccgccc ttcgcttctt ggtgctgact 1231981 ctgcgtccag caggcgggtt actcgcacta ttgcgtggtg gatgcagagt caacggatgt 1232041 cggtgcagtg ctgtagacct atgccaccac ccaatcgagg tcgaacgcgt tgccgatggc 1232101 ctcggctaga gccggctcct gcgacgcgag caggtagccg atttgacgac caaggtcaca 1232161 caccgggatc gtttggatgt tgtcgcagct gacgactgac ggttgattca gcccgttcac 1232221 cgcgtctacc gggacttcgg tggctagccc acgcacggtt gtcgtgatcg gggcgacggt 1232281 gacgttcgtg aggtgcggac gtacgacctc gcgggtaagg atcaggacgg gtctagcctt 1232341 gtcaagctgt gcgatgtgga taggtcgcat cagtcgatgt cgagggcggt acgagcgcag 1232401 tggccggcca gcgtatccag atcacccgtg gctgacgtgt tggtggcgag gatctccgcg 1232461 tcgcgttcgg cgagacgacg gcggcgttcc cgttccagcg cccgcagcac gacagccgca 1232521 cggctacggg catgctgtcc ccggacttcg tcgtcgatga acgcgacaat ctcatcgggc 1232581 aagcgaaccg caatctgtgt actcacttca cagatggtac cagtttggta tgcacccgcc 1232641 ccaaaaccgt tcgcgccgcc ggcgaggacg accccccagg gtaggtacat tccagaagta 1232701 tggtcgtcga cagctgcgtg gccgaatccc gctatggtcc ggtccggggc gccgatgatg 1232761 gccgcgtcaa agtgtggaaa ggcatccggt atgccgcgcc accactaggt gacctgaggt 1232821 tccggacgcc cgaacctccc gaacggtgga ccgaggtcgc cgacgccaca accttcggtc 1232881 cggcctgccc gcagccggcc atccccaaca tgccgctcga tttaggggcg tcgcagagcg 1232941 aggactgttg gagcctgaac atttgggcgc cggcggacac cgagcccggt gacggaaaac 1233001 ccgtgatggt gtggctgcac gggggcgcct acatcctggg atcgggcagc cagccgctct 1233061 ataacggccg caggttggcc gccagcggcg acgtggtcgt ggtgacggtc aactaccggc 1233121 tcggagcgct tggcttcctg gacttgtcgt cgttcaacac gtcacggcga cggttcgact 1233181 cgaatatcgg cctgcgtgac gtgctggccg tgctgcgctg ggtagcagac aacatcgcgg 1233241 tgtttggcgg cgatcccgag aaggtcacgc tgttcggtga atccgcgcgg gaatcgtcac 1233301 gaccctgctc gccaccccgg cggccgcggg tctgttcgcg gcggcgatcg cccagagctc 1233361 accggcgaca tcggtctacg accaggtgag ggctcggcgc gtcgcggttt gcgtcctcga 1233421 caagctggga atcgacccgt ccgatgtgca caggttcatg aagtgccgac cgcggcaatc 1233481 ctttccgcgt ccagcgaagt gttcaacgaa gtgccggttc gtaaccccgg cacgctggcg 1233541 ttcgtcccga tcgtcgacgg cgatctgctg cccgactacc cggtcaagct ggcgcaggag 1233601 ggccgctcac acccggttcc cttgatcatc ggcaccaaca agcacgagtc ggcgctcttt 1233661 cggttgatgc gctcgccgct gatgccgatc accccgcgcg atcacgtcga tgttcaccca 1233721 gattgccgcc gaacagcccg atctgcaagt gccaaccgag gagcagatcg gctccgcgta 1233781 ctcgcgatgg cggcgcaaag cacgctcatt gagtatggct accgacgtcg gcttccggat 1233841 gccgtcggtg tggctcgctg aagggcacag cggggtggcg ccggtgtatc tgtatcggtt 1233901 tgactactcg actccgctgc tgaagctgct gctggtccgg gccgcccatg ccaccgaatt 1233961 gccttacgtc tggggcaatc tcggaggatc ccaggaccct gcattgaagt tgggcgacgc 1234021 caaagccgcc atagcggtgt cccggagggt acggacgcgg tggatcaatt tcgcgacgcg 1234081 gggcaaaccc acgggtcccg atggcgagcc agactggcca tgttacgagg aggcccatcg 1234141 tgcctgcctg attatcggca ggcgagacgc cgtcgtgcac gacgtcgacg cacacatccg 1234201 agcgacctgg ggcagcaagt ggtgagtttc agataattct ggctacggct tgactgtggc 1234261 ggccgttttt tccgcccggg cctcgttctt catctgctca aacagactca cgtagtacgg 1234321 caggcattcg gtcagcgcct gctgggtggt gaacagcggc tcatagccca ggtcgcggcg 1234381 tgccttagcg atcgaaaagt agttgtccag gtacagtcgt tcgacggcca gcggctcgag 1234441 cagcggcgcg gggaatccga accggaagtg cagccgctgc caccccgtca ttacccagcg 1234501 gaccgcgggg ccggaaatcc gcatcttcgg ccagcgctgc ccgcacgcct cgagcaccgg 1234561 ccgagcgaac tcgaacatat tgatcggctc tgcgtcgttg atgaagtaag cctgcccggg 1234621 cgctgtgccg tccggcacca gatgggcagc ggccaagatg aaaccgtgaa tcaggttgtg 1234681 cacgtaagag ttatccagcc gggccgactt gcgcccgacc agcaccttga cgtggccctt 1234741 gagcacactt tcgaacagct tgcggaacat cgtctgatcg ccgtttcccc agatgccgct 1234801 gggccggatc gcgcacgtca gcatgccgtc gacaccgttc tgggccaaca cgaatcgctc 1234861 ggcaaccacc ttggtctcgg tgtagaggtc gttgaaccgg tcggtatagg gcagcgtctc 1234921 gtcaccgccg gcgatgttct ggccgcccat caccacactg ttggatgacg tgtagacgaa 1234981 ccgctgcacc ccggcccgct ggccggcgtg cagcaggttc tcggtgccgc cgacgttgac 1235041 cgcaaagcta cgttggcggt actcgtcggt gaccgacgcg ccgcccatca gctcgatgat 1235101 cgctgcggtg tggaagatcg tgtcgatgcc gtccacggcc gcggcgcaga cgtccgcgtc 1235161 ggtgatgtcc ccttgcagca cctccagttg cggatgcgca ggcaacagcg acggcgcgcg 1235221 gtcgaaggaa cgcacccagt gcccgcggtc cagcaaggtg gtcaccaggt tggcgcccac 1235281 gaagcccgcg ccgccggtga ccagaacgcg gccgagctcg gttgtcagcg atgcatcacc 1235341 catgcggcga agcataacct tgccttagcc gttttgggcc tcgtcgccgg ccagcacatc 1235401 ggacacccgc tggcgtgcac cagctaagtg ctcctcgcac cttttggcga gttgctcccc 1235461 tctttcccac agtcgcagcg acgcatcgag gtccaatccg ccctgctcca gaagccgcac 1235521 gacttccatc agctcgtccc ggcaggcttc atagccaagc tgactgacag gcacagttgc 1235581 gtgggttctg cccgtgtcat cgccgttggg gtcacagacc attggtttgt ccttcactga 1235641 ccgccgctag ggctccgtcg gcaacccgca cgcgcagctt ggtgccttcc ggtgcgtcgt 1235701 ggaccgaccg cagcacctgt ggttcggatc cgccctcggg tcccgtctga gcaacggtct 1235761 gcactatggc atagccgcgg gcgagcgtgg cggccggacc cagcgtggcc aggcgtgcgg 1235821 ccagatgacc gatgcgttcg gtctcggcgg cgaccatcag ggtgaggttg cgacgaagcg 1235881 tcgagcgggc tcggtggacc tcctcggcgc gcacgctgac catcgtcatc ggatcggcca 1235941 gcaccgggcg gctacgcaac tgcgcgactg cccgttgctc gcgggaaacc cagttgcgca 1236001 acgcctgggc gctgcgccgg cgcagatcgt cgatcagccg ctgctcggct gcggtatcgg 1236061 gaaccacttt cttggcggcg tcggtggggg tggcggcgcg caggtcgacg accagatcgc 1236121 acagcggatt gtcgggttcg tgaccgacgg cgctgaccac gggcgtacgg caggccgcga 1236181 tcgcgcggca caacgtctcg tcagaaaacg gcagcaggtc ctcgacggag ccgccgcccc 1236241 gggccagcac gatcacgtcg acgtccgggt ctcgatcgag ctcgcgcagc gcctcgacga 1236301 tctggccgac ggcgttgggg ccctgcacgg cgacgttgcg gacggcgaaa cgtgccgctg 1236361 gccagcgcgc cgaggccacc gtcgtaacgt cacgttcggc ggcactcgca cggccggtga 1236421 tcagaccgat catgttgggc aggtacggga tcggccgctt gaggcggggg tcgaagagcc 1236481 cctcggcgtc cagcagccgg cgcagccggt cgatgcgtgc cagcagctcg ccgatgccga 1236541 cagcgcgaat ctcgctgagc cgcaaggaga atgtgccacg tccggtgtag aacgagggct 1236601 tgccgcagac cactacctga acgccttcgg ccagcttcac cggcgcggac agcaccaggt 1236661 cgcgggaaca cgtcacggtc agcgacatgt cggccgcagg atcgcgcaat accatgaaca 1236721 ccgtcttggc gtctgggcgc attgtgatct gggccaattg cccctccacc cagaccgcgc 1236781 ccagcttgtc gatccagccc gcgacccgga ttgccaccgc gcgaaccggg aacggattct 1236841 ccgccgaatt ctgggtcact tcgcagtcgc gcgggtgatc ctgttggcga gcagcgtctg 1236901 gaacggggca cgggccttgg tggcctgctc gtaggccagc agggcctcga gctcggggac 1236961 atcgagcgtg tgcagcctgg cccgcagctg ggccagcgtc agcgccggat agtcgagttc 1237021 ggctgccacc gcaggcgtcg gaactgtcgg cttggccgcc gacttggggt gcttggcggt 1237081 tttgggattg gtcgagcgat ccgcactcct ggatgctgtc gtcgtttccg gcgtatcgga 1237141 taccgagtac aacgcgaacc gcccgtcgga ccggcgatcg tcgttcttgg cttcgctggc 1237201 atccgacaag ccgagcaatg gaatcgaagt cccttcgagc gcgtcgggca agtcctcgtc 1237261 gaatgttgcc cactccggct tctcgtcctt gggcggaaac agcgtctcca gggtgttgtc 1237321 gcccttgatc accagttcgg ccaggccctg ttggaatcgc atcaccacgt gcgccgcctg 1237381 gctggccagg gtcattgggt acatcaggat ggttcgtggc agcttcatcg tctcctcaac 1237441 ggcgactgtc gccgcgccga ccaatagccg aaccccatac ggtgcagtag ccatggatcc 1237501 aagactgcct caagcagcgg ctaactccaa gccggtggcc gtgagctggc gggttcgtgt 1237561 cggcccaaag taccctgaat gccatggttc cgacggtcga catggggatt cccggggctt 1237621 cggtatcgtc gcgatcggtg gccgaccgtc ccaaccgtaa gcgggtgctg ctggccgagc 1237681 cgcgtggcta ctgcgctggc gtggatcggg ccgtcgaaac ggtcgaacgc gcgcttcaaa 1237741 aacacggccc gcctgtctac gtgcgtcacg agatcgtgca taaccgccac gtggttgaca 1237801 ccctggctaa ggccggtgcg gttttcgtcg aagagaccga gcaggttccc gagggagcga 1237861 ttgtggtgtt ctccgcgcac ggggtcgcgc ctacggtgca cgtcagcgcc agcgagcgca 1237921 acctgcaggt cattgacgcc acctgcccgc tggtcaccaa ggtgcacaac gaggccaggc 1237981 ggttcgcccg ggacgactac gacatcttgc tgatcggtca tgagggccac gaggaagtcg 1238041 tcggtactgc tggggaagct cccgatcatg tgcagctggt cgacggggtg gacgccgtcg 1238101 accaggtgac cgtccgtgac gaggacaaag tggtttggct gtcgcagacc accctgtccg 1238161 tcgatgagac catggagatt gtcgggcggt tgcgtcggcg tttccccaag ctgcaggatc 1238221 cgcccagcga cgacatctgc tatgcgaccc agaatcggca ggtcgcggtc aaggcgatgg 1238281 cgcccgagtg cgagctggtc atcgtggtcg gctcgcgcaa ttcgtcgaat tcggttcggc 1238341 tggtcgaggt ggcgctgggt gccggggcgc gggccgccca cctggtggac tgggccgacg 1238401 atatcgactc ggcctggctg gacggcgtta ccacggtcgg cgttacgtcg ggggcatcgg 1238461 tccccgaggt gctggtgcgc ggtgtgctgg agcggctggc cgaatgcggc tacgacatcg 1238521 tgcaaccggt gacaacggcc aacgagacgt tggtgttcgc attgccccgg gagctccgct 1238581 cacctcgctg agcacatccg ctcacggtta gacgtcgtat tcccaggatt cagccggtgg 1238641 tctgcgcggt gcccgcgaac gatcccgccg atcgaaccgc tgctcctcgc ggtagttgtc 1238701 ccgccgcgcg tcgcgagtag ctgacccgcg gtagcggacc tgcgagatcg gatggtgtgt 1238761 cgggttggtt ggctcgctgg gacgggcgcg tcggcgttgg ggctcgtagg tgggctcgta 1238821 gcgcgcatag ggctggtatc gagcaccccg acgttcgtac cggttgacgg gctcggcagg 1238881 cccggagggt tcggacggct ggtagctgcg gtaagaatcg aagcggctgc tgcgcggtgc 1238941 cggccgttcg tgggcattgc ggcgcgggtg cggatcattc tgcggcctag ggcggcggcg 1239001 cgatcgacgc tcggctatgg gttcgcggtt gtcctccgac gggggtcggg cgtgtcggga 1239061 acgagtacgg gcaggccgct gggccgaccg ccggccgccg tcgtcgtccg aatcgccggt 1239121 catcagcgag ctgagcttcc tggcgatgct gtcgaacaga gccgtcccga gataccacct 1239181 gaccagtccg atcagcagca cgccggcagc cgtgcccagc atcagcggga aacgttcgat 1239241 gagcgagtag ccgcagttga tcaagaggtc tttgaacttg ccgatcgtgc ccccgtggaa 1239301 cagccagtag gccccgggca cggcgcagaa aagtatcagt ggcggctgga cgagcgcggt 1239361 gaacaggtcc gactgccgga cggccaggac cgcccccacg cagccggcga tatagcagcc 1239421 ggtaaagacg agggttagcg ccttgtggcc cgatccggcg tcgattgcat acccgatcgc 1239481 cgtcgcggtg acggcgatca ggatggcagc ccaccacggc acacctggga tgtgggggtg 1239541 aatcgagcgg tgacttgcct gtaccgccga cctcgcccgc tgcgctgaca cacgtcgacc 1239601 gtaccggcaa tggcgccgaa ggcggcaccg cctcgcctta aacttggctc tctgtgagct 1239661 tgagcctggg gatcgtgggc ctgcccaacg tcggcaagtc gacacttttc aacgcgctga 1239721 cccgaaacaa cgtggtcgcg gccaactacc cgttcgcgac gatcgaaccg aacgaaggtg 1239781 tcgtctccct gcccgatccc cgcctggaca agcttgctga gcttttcgga tcgcagcgag 1239841 tcgtacccgc gccggtcacc ttcgtggata tcgccggcct gatcaagggg gcgtccgagg 1239901 gagccgggct gggtaacaag ttcctggctc atatccgcga atgcgacgcc atttgtcagg 1239961 tggtgcgggt gttcgtcgac gacgacgtga ctcatgtcac cggacgggtc gatccccagt 1240021 ccgacattga ggtcgtcgag accgagctga tcctggcaga tctgcaaacc ctggagcggg 1240081 ccacgggccg gctggagaag gaagcgcgca ccaacaaggc gcgcaagccg gtctacgacg 1240141 cggcactgcg tgcccagcag gtgctcgacg ccggcaagac gctgttcgcc gcgggggtgg 1240201 atgccgccgc gttgcgcgag ctgaacctgc tgaccaccaa gcccttcctg tatgtgttca 1240261 acgccgacga ggcggtgctc accgacccgg cgcgagtcgg tgagctgcgc gcgttggtgg 1240321 cgcccgccga tgcggtgttc ctggacgccg ccatcgagtc ggagttgacc gaactggacg 1240381 acgagtcggc cgcggagctg ctggagtcca tcgggcagag cgagcgcggg ctggacgcgc 1240441 tggcccgggc gggttttcac accctgaagt tgcagacctt tttgaccgcg ggccccaagg 1240501 aagcgcgggc gtggaccatc catcaaggcg acaccgcgcc gaaggcggcc ggggtgatcc 1240561 acagcgactt cgagaagggt ttcatcaagg ccgagatcgt gtcctacgac gacctggtgg 1240621 ccgcgggttc gatggcggcg gccaaggcgg ccggcaaggt ccggatcgaa ggcaaggact 1240681 acgtgatggc cgacggtgac gtagtggagt tccgattcaa cgtgtaggcg ggaaagccgg 1240741 gacgcagcca gagcccagat cccatggcat cattgcttgc atcgagtgat gcatgtattg 1240801 atgggagttg gtgaatgagg acgacggtga ccgttgacga cgccttgtta gccaaagcgg 1240861 ccgaattgac tggggtgaaa gagaagtcga cgctcctgcg cgaggggttg cagacactgg 1240921 tccgggtgga gagcgcccgg cggttggcgg ctctcggcgg caccgacccg caagctaccg 1240981 cggcgccgag acgccggacg tcgccccggt gatcctggtc gacacttcgg tatggattga 1241041 gcacctgcgc gccgccgacg cgcgactcgt cgagctgctg ggcgatgacg aggccggttg 1241101 ccatccgctc gtcatcgagg agctggcgct tggctcgatc aagcagcgag acgttgttct 1241161 cgatctgttg gccaacctct accagtttcc ggtggtgacc cacgacgaag tgttgcggct 1241221 tgtcggtcgg cggcggttgt ggggtcgggg actcggtgcc gtcgatgcca accttcttgg 1241281 ttcggtggct ctggttggcg gcgcgcgact atggacgcgg gacaagcggt tgaaggcggc 1241341 gtgcgcggaa agcggtgttg cgctggctga ggaagtgtcc tgagttgtat accgtcagcg 1241401 ttgctgggag taatcgaccc ggtgccgcgt ggcgcatgtt cggccatgtt cattgcccga 1241461 tttggcgcga tagcgtgatt tatgttgatt tgttacattc gcactgaacc cttccgtatc 1241521 tatttttata ttgttgcgtg acatatccgc tgtacgcgtg ggacgggcca ttatttggat 1241581 aatgcgtgat aagcaccaca agaattgatt tcctatggat attgtcggta gcgttcgcgc 1241641 ccatgattgc tcttgcaacg ctgttgacgc ttatcaatca agtcgtcggc actccgtata 1241701 ttcccggtgg cgattctccc gccgggaccg actgctcgga gctggcttcg tgggtatcga 1241761 atgcggcgac ggccaggccg gttttcggag ataggttcaa caccggcaac gaggaagccg 1241821 ccttggcggc tcggggcttt caacagggaa ccgcccccaa tgccttggtg atcggttgga 1241881 atggccacca cacggcggtg acgctgcccg atggcacgcc cgtatccagt ggtgaaggcg 1241941 gtggcgtgcg ggtcggtggc ggtggcgcct accagcccaa attcacccac cacatgtatc 1242001 tgccgatgga tgtggacgcg ggagaagacc agccgccggc gccagatgag ccggtcaccg 1242061 cggtcgacga cgtggaaccg gaaatgcctg caccgtgccc gacccagcgc ccgccggtga 1242121 ccccgagaca taacctgtgc aacaaactcc ggactatgcc aggggcgctc tcggccgcgc 1242181 tggccgcggc ggcgccggtc tggccggccc ctataagcgg ctgccgcggg ttcagcacgt 1242241 ccctcttagc aaaaagaaat cacccagtaa tcgtcgggaa atagagtgta cccaaaccaa 1242301 tccttccgtg gcggaaatat tcttggcgct tctccaacgc cttcgccaaa tcgttgtcca 1242361 cggaacgatt tcacttatgc aagcacggcg ctgccatacg gatgtgtagt cgaatggccg 1242421 acgaaccgcg cttagaagcc ggcgcgcacc ccttcgaaga gggccgggac aaggcccccg 1242481 aacttcgtgc cactcagatg gaccatgtcc ggttcaccga aggtcggcgt gaacgtaacc 1242541 gtgaccggct cgagcggagc cagcagttcc gccaaccggg tcgctgacag cgaccaactc 1242601 gtatccgtac tccggtgaca cgtcaatcga ctgcgatatc gacgtctggc cgaacaagaa 1242661 accgttgacg gcgttgcccg gagcctgcag aaccgcaccg gtggccccca acaggttccc 1242721 gctatgcagc gcaccgacaa atgccgtgtt gctgtcctgc aaccccctga ccgtgcccag 1242781 ggcacccata cgtcatcgtc gagcacacag cgtagccgcc gggcgctccg gctctgggtg 1242841 aaatgacgct ggggcctcaa ggccagcacc ggttacccac ttctcggccc cgggagcgca 1242901 ccatgcgcac ggcgatgtcg ccccgtcagg catgtgccca aaccgtggac aacgcacgtt 1242961 gtcaccgttt atcgtgagcg caaagtggga gtatggagtg tacgtgcccg gcccgggtac 1243021 cctgagcggc aatgatcttc atcgtcgtca agttcgagac caaacccgag tggaccgagc 1243081 gctggccgga tttggtcgca tcgttcaccg cggccacgcg tgccgaagag ggcaacctat 1243141 ggttcgagtg gtcccgcagc ctcgacgacc cggccgagta cgtcctggtc gaatccttcc 1243201 gtgacggcga ggccggcggc gtacacgtca acagcgatca cttcaggcag gccatgcggg 1243261 aactgccgaa ggcactggcg tccaccccca agatcatcag ccaaaccatc gatgcgacgg 1243321 gttggtcggc gatgggggag atgacggtcg ggtaaccggc gaggcccgat cagccgccca 1243381 cgtcgaccgc gatttcgtga cccagccgat aacccggcgc caggggcagc gagtcaccgc 1243441 tccagaactt gccggggtcg aaccagtgcg cgtccttgtc ggtgaccagc aatcccattt 1243501 cctcataggt gatggccacc gtctccgcgc aatatgccgt cgccaggccc atcgtgcgct 1243561 gctgttgctt acgtcgctgg gtttgttcgc gcaccttgcg gtccagcacc ggtatgccgc 1243621 gcagccaatc gttgagggtc ggaagccggc cgcgcagcca ccggccggtc aaccgggcgg 1243681 tggttgggaa aggcgtgccg ttcatccgcg cgatgacccg cagcagtttg tcctcctggt 1243741 cgcgattggc gtgcggtgtc agttgacgca gccagcaccg ctgccgataa cggccggccc 1243801 actgctgcac gacttggcgg gcgtcgttga gctgcacgcc gcggtggttg gtgccggtcc 1243861 atacgtcgag cagcttgtcg cccagttcgg catgccagat cagcggcggc aagtcgtcga 1243921 tggccaccgt catgccgacg tggttcaccg gggcgttcgt caaggtctgg atcgcccggt 1243981 cgggtcggga acggccgcga aacagccaga ggtcgccggt gcgggtttcg ttcagcgctc 1244041 gatccagcgc tagcgtgctc gggtccaccc catgcaccat aggcggatat agcctgtcgg 1244101 ggtgcgcaac gtgtggaagt gggtcgggct ggccggtgtc gccggcgtcg tcgcgggtgg 1244161 cgccctggtg gcgcgcgatc aacggaaacg acgtgcctac acgcccgacg aggtgcgggc 1244221 ccgattgcac cagaggctgg acgaatccga cgtcgacggt tatcagtcca ggtccggccc 1244281 gggtgccgcg tcgagcgaga acaggcgata gctgccgaaa cggatatcgg cacagtcgct 1244341 gacggcgtcg tgcaccggtt cgcctaccag gatctggccg ccgaccgctt gccccgcaac 1244401 ccgagcggtc attgcgacgt tgcggccgaa cagatcgtca ccgtgccgca ccgagcgccc 1244461 catgtggtgc cgatccgcac ccgaattccc tggttccgct tacgctttgc gctgttgcgc 1244521 agcgcgtcct ggatgtcgat gccgcaccgc accgcctgtt cggcgcgggc gaacgcgatc 1244581 atgaacccgt caccctgact cgtgaccatg tgcccggacc agcgccgcac cagctcatga 1244641 accagcttgt catgcgcgcc aatcaacttg acccatgtgc gatccccgat tcgttcgtcg 1244701 agcgcggtgg actcctcgat gtcggagaac aggatcacca cccgggcgtc cggggttacc 1244761 cgagccaggt cgggacgctc tacctcggcc cagtcggcgg ggtcctcgat cgagctgcgc 1244821 acggccgctc cgaacccttc tttgcgcacc aggttcgcgg tctgccagac cgtctttacc 1244881 gcttcacgac cacccgacag cattgcgggc gtcgagccgc tcccgcggtt gctcggtttc 1244941 ctggcgtatc cgcctcagcc ggatgcgcat cgggacgagt ccgccggcct cgatcgtggc 1245001 gatcccggcc aggatgtaga ccgcgatctg cagcgtcggg ttgtcgggcc aatgggctag 1245061 ggttgagttc ggccgccgcg ggaaagcaag tctggaggtg cgggtttggt tgacggcgga 1245121 ggtggcgcgt cagatctgtt ggtgatcttc ggaattaccg gtgacctggc ccgcaagatg 1245181 accttccgcg cgttgtatcg gctcgagcgc caccagttgc tggactgccc catcctgggt 1245241 gtggccagtg acgacatgtc cgtcgggcag ttggtcaagt gggctcgcga gtccatcggt 1245301 cgtaccgaaa agatcgacga tgcggtgttc gaccggttgg cgggccggtt gtcctacctg 1245361 cacggtgacg tcaccgacag ccagctctac gattcgctgg ccgaactgat tggctcggcc 1245421 tgtcggccgc tgtattacct ggaaatgccg ccggcgctgt tcgcgccgat tgtcgaaaat 1245481 ctcgcgaacg tgcggctgtt ggagcgcgca cgcgttgccg tggaaaagcc gttcggccac 1245541 gacctggcct ccgcgctcga actcaacgcc cggctgcgag cggtgttggg cgaagaccaa 1245601 atcctgcgtg tggaccactt tctgggcaag cagcccgtcg tcgagctgga gtacctgagg 1245661 ttcgccaatc aggcgttagc cgagctctgg gatcgcaaca gcatctccga gatccacatc 1245721 accatggccg aggacttcgg ggtggaggac cgcggcaagt tttacgacgc cgtcggtgcc 1245781 ctgcgtgacg tggtgcaaaa ccatctgctg caggtgctgg cgctggtgac gatggaaccg 1245841 ccggtcggtt ccagcgccga tgacctcaac gacaagaagg ccgaggtctt ccgggcgatg 1245901 gcgccgctgg atcccgatcg gtgcgtgcgt gggcagtacc tcggctacac cgaagttgcg 1245961 ggcgtagcaa gcgattcggc gaccgaaacg tatgtcgcgc tgcgaaccga gatcgacaac 1246021 tggcgctggg ccggggtgcc gatcttcgtg cgggccggaa aagagctgcc cgcgaaggtc 1246081 accgaagtac ggctatttct acgccgagtt ccggcattgg cctttctgcc caaccgccga 1246141 ccggccgagc ccaaccagat tgtgctgcgt atcgaccccg atccgggtat gcgactgcag 1246201 atttcggccc acaccgacga ctcgtggcga gatatccacc tggactcctc gttcgcggtg 1246261 gacctcggtg aaccgatacg accctatgag cggctgctgt atgccggatt ggtcggcgat 1246321 caccagttgt tcgcccgcga ggacagcatc gagcagacgt ggcggatcgt gcagccgctg 1246381 ctcgacaacc cgggtgaaat ccatcggtac gatcgcggtt cctggggtcc ggaagccgcg 1246441 cagtcgttgc tgcgcggtca ccgcggttgg cagtcgccgt ggctgccccg cggcacggac 1246501 gcatgagttc aaggagacga aaaggcgatg caactaggaa tgatcggtct gggccggatg 1246561 ggtgcgaata tcgtccgccg cttggccaaa ggtggacacg actgcgtggt ctacgaccac 1246621 gaccccgacg cggtcaaggc gatggccggg gaggaccgga ccaccggggt ggcctcgttg 1246681 cgtgagttgt ctcagcggct ctccgccccg cgagttgtct gggtgatggt gcccgcgggg 1246741 aacatcacca ccgcggtgat cgaagagctg gccaacacgc tcgaggccgg cgacattgtg 1246801 atcgacggtg gcaacaccta ttatcgcgac gatctgcggc acgaaaagct gttgttcaag 1246861 aagggaattc acctactcga ctgtggcacc agcggcggtg tgtggggtcg ggaacgtggc 1246921 tactgcctga tgatcggcgg ggatggcgac gcgttcgcgc gcgcggagcc gatcttcgcc 1246981 accgtcgcgc cgggggtggc ggccgccccg cgcaccccgg gccgagacgg tgaggtcgcg 1247041 ccatcggaac aaggctattt gcattgtggg ccttgcggtt cgggtcactt cgtgaagatg 1247101 gtccacaacg gcatcgaata cgggatgatg gcctccttgg cggagggatt gaacatcctg 1247161 cgcaatgccg acgtcggcac ccgcgtgcaa cacggtgacg ccgaaaccgc gccgctgccg 1247221 aatcccgagt gctaccagta cgacttcgac atcccggagg tcgccgaggt atggcggcgg 1247281 ggcagcgtga tcggctcctg gctgctggat ttgaccgcga tcgcgctgcg cgaatcacct 1247341 gacctagcgg aattctccgg acgggtctcc gactctggcg agggccggtg gaccgccatc 1247401 gcggcgatcg acgagggcgt gcccgcgccg gtgctgacca ccgcgctgca gtcccgcttc 1247461 gcctcgcgtg acctcgacga cttcgccaac aaggcgctgt cggcgatgcg caagcagttc 1247521 ggcggacacg ccgagaaacc ggctaactaa gtcgcctgac gaagtccacc acgacgtcgg 1247581 tgaacgcgtc gttgtcgtcg ccggcggcgg tgcgccccgc gttggacaat tcgacgaact 1247641 ccgcgttggg caccttggcc aggaagtccc gggcaccgtc ggaactgacc acgtcggaca 1247701 gctttccgcg aatcaacagg accgggatcg tcaggcccat ggcagcccgt tcgaagttct 1247761 cggtgcgcag ctgcgggtcg tgccccggcg cggtcatcat ggccggatcc cagtgccagt 1247821 gccagcgtcc gtctcgcagg cgcagattcc tcttcaggcc ctcgggactg cgcggcttgt 1247881 cgcggtgcgg cagatactcg gcgactgcgt cggcggcttc ctcgagcgaa ccgaagccgt 1247941 cgatgttgcc cagcatgaag tcccggatac gggcgttgcc ctccttctcg taacgcggca 1248001 ccacgtcgac caataccagt ccgttcaccg tctgcggacc ggcgcgctcg gcgaccagga 1248061 tgccagtcag tccgcccatg ctggcctcga ccaccaccac acggcggccg atcgcctcga 1248121 cgacgtgtag cacatcggtg gtcggggtct ccacggcata gtcggcgccg ggagcgcggt 1248181 cgctgtcacc gggtccgcgg gtgtccagcg caacgacgtg gtgcccctcg tcggccagga 1248241 tctggccggt gtttttccag gaaaaccggt tttggccgcc accgtgcaac atcaggatcg 1248301 tcggccgatc ggccgctgcg gcgccccgat tccactcgtc ggcgaccagg gtaatcccac 1248361 gagcaccgga aaacgcgacc gcttggggac tgctgctcac ggcgctcacg ggtcctgacg 1248421 ttaccttgct gggcacgcgc caaatcgtca tcgccgacct ggaggatgcg gtgatcaagg 1248481 tgccctagct actggcctct tgggttccgc cggttacgtt ggaccatgcg ggctggacgc 1248541 ggcgaacggg agtcaacatg gcggacgaca atggctgaac cacactggat tgacgtgaag 1248601 ggtcccaacg gcgacctgaa agccttgacc tgggggccgg ccggcgcgcc agttgcgttg 1248661 tgcttgcacg gctttccgga taccgcctac gggtggcgca aggtcgcacc ccggctggcc 1248721 gagtccggct ggcacgtcgt ggcgccgttc atgcgtggtt atgcgccgtc ttcgattccg 1248781 gccgacggca gctatcacgt cggtgcgttg atgcacgacg ccctgcgggt gcgctcggct 1248841 gccggtggca ccgagcgcga tgtgatcatc ggccacgact ggggcgcgat cgccgctacc 1248901 ggcctggccg ccatgcccga cagcccgttt gccaaggcgg tgatcatgtc ggtgccgccg 1248961 tcggcggcat ttcgcccgct gggccgggtg cccgagcgtg gccggttgct gcgtgagttg 1249021 ccgcatcagc tgctgcgcag ctggtacatc ctgtacttcc agttgccctg gctgccggag 1249081 cgatccgcct cctgggtggt gccgctgctg tggcggcgtt ggtcgccggg ctatcacgcc 1249141 gaggaagacc tgcggcatgt cgacgccgcg atcgggacgc cggagggccg gcgggcggcc 1249201 ttgggaccgt atcgcgccac catgcgcaac acccgggccc cggcggacta tgccgacttg 1249261 aatcggctgt ggaccgaggc gccgaagctg ccggttctgt acctgcatgg ccacgacgat 1249321 ggctgtgcca catcggcatt cactcattgg acggcaaggg tgttgcccgc cggcagtgag 1249381 gtggccgtag tggaacacgc cgggcacttc ttgcagctcg agcagccgga caagattgca 1249441 gagttgatcg tggcgttcat tggctcaccc ggctgaagtc gtggccgggc accggatggc 1249501 ggccgtcgac gcgcagttct actggatgtc ggccaaagtc cccaacgacc agttcctgct 1249561 gtatgcgttc gatggtgaac ccaccgatct ggaacgtgcc gtcgcgcagg tctaccgtcg 1249621 agcccgtggg tgtccgggct tagggatgcg agttcaggac cgtggtgctc tggcctaccc 1249681 gcagtgggtg cccacacccg tgcaacgtga ccaactggtc tgccacgacc tggccgatcg 1249741 cagctggcaa ggttgtctgg cggccgttgt cggcctcgcc ggcaagcagc tggatatgcg 1249801 ccggatgccc tggcggctgc acgtgttcac cccggtgcac gacgttccgg gcgtcagcgg 1249861 cctcggcacc gtcgccgtca tgcagttcgc gcatgcgctg ggcgacggcg cgcgggcttc 1249921 ggcgatggcc gcgtggctgt tcggccggcc ggccgcggtt cccgaaatag ccaggtcgcg 1249981 tgcgggtttc ctgccgtggc gggccgccca tgcggcccgc gctcatctcc gactggttcg 1250041 tgataccaat gccgggctgg tagcgccagg tgtcggatcc cggccgccgc tgtccacgaa 1250101 tgcccgcccc gaaggtgtcc gcgcggtgcg caccctgctg cggcggcgct cgcaactagc 1250161 cggtcccacg gtgaccgtca cggtgctcgc cgcggtgtcc accgggctgt tgggtctgct 1250221 tggcggggat gtggacacgc taggcgccga agtacccatg gccaaaccgg gtgtgccacg 1250281 gtcatataac cacttcggca acgttgtcgt tgggctgtac ccgcggctgg agcccgatga 1250341 gcgggtgcgg cggatcgcaa ccgatttggc caacgcccgc cgtcgctttg aacatccggc 1250401 gatgctctcc gctgaccggg cctttgcggc ggtaccggcg gcgctgctgc gttggggcgt 1250461 atcgcagttc gacgctgagg tgcggccggt gcgggtggcc ggcaataccg tggtgtccag 1250521 tgtttatcgc ggggctgccg atctgagctt cggggacgct ccggtggtgc tgacggccgg 1250581 gtatccggcg ctgtcgccgg cgatgggtct aacccatggc gtgcacggca tcggtgatac 1250641 cgtcgcgatc agtgtgcacg cggccgagtc tgcggtgtct gacatcgacg cctacatgcg 1250701 gctgctggac gcggctctgc agtgaaaact actgggcatc accggattta gccgcttcgt 1250761 ctcgtgtcag cccgacggcc tggatcagct cctcgtgtag ttcgaaccac acggtgtggt 1250821 aggagtcgat gagtgggcgc gtcagccagg cgatgtcgcc cgctttgacc ttgtccagcg 1250881 ccgcacgcaa tttcaccggg tacctgctca accgcggcag ctgcatggcc accgtaccga 1250941 tgatcgggcc cacccgccgg tgtacgccat cgaggcggga cagcaccgcg gcgtcgtatt 1251001 cggcgtcgtc gtgtgtgtta ggcttttcgc ccttgagctg ccagtcggtg accagcctct 1251061 tgaaatcggc gttgacggaa cggaaatcgc ggtaagcggc agccagcacg gtcgaatcgg 1251121 cccggttgcg ctcctcggca agcaagtcgt cgagcctcat ccggccgctg ggactgatcc 1251181 gcaacggcgt ggcgtcgacc aggaggccgg ccgcggtcag cctgtcgacg gtcgcggcga 1251241 cgtcggcaag gtcttcaccc aaggtctgcg ccaggtcggt ggtgatcacc cggcccttga 1251301 gccgcacggc ctgcagtacc gtcaactcgc tcatgaactg atccgttgcg cgatgtcggc 1251361 cagctcgcgc aactccggcg tgtcactttc cgaccaggca gacaatgcca gaacgccttg 1251421 gcgcacttcg ccttcatagc cgtcgacggt gatctccttg ccggccagtg ccgccgcgac 1251481 cccgggaccg caacccacca cggccactcg accgagctcg cggctaacca ccgccgcatg 1251541 actggcggca ccccccacct cggtgacaat gccttgcgcg gcaagcatgc ccatgacgtc 1251601 ctccggtctg gtgtgatctc gcaccaagat gaccggctcg ccccggtccg cagcgtccag 1251661 cgcctcgtcc acctcggtgt aggcggtccc ggataccacg cccgggcaag cgggcaggcc 1251721 cttggccaaa agcggtgcgg ccaaccgtgt ttccggctgc agcgacggcc gtagcaaagt 1251781 ctcgatgtgc gtcggagtca cccggcgcag tgtctcggtg tcgtcgatga gtccctcgtg 1251841 atgcagttgc agcgccagtc gcacggcggc ctgcgccgag cgttccgccc cgcgggtctg 1251901 cagcagccac agctggctgt cctccacggt gaattcgatc tcctggacgt cgcctgccat 1251961 gcgctccaaa ctgcgggcgg ccgccatcag ttggtcgtag acggccggct gctggtcgcg 1252021 cagggcggtg atcggtgcga cggcgaccaa tccggacacc acgtcgtcgc cttggccgcc 1252081 gggtagccat tcgccgaacg gttcgttggc tccggtgatc gggttgcgtg aggacagcac 1252141 cccggcgccc gagttcgcgg tgaggttgcc gaataccatc gcctgcacca ccaccgccgt 1252201 accgccttgg tcgtcgaggc cgtgatggtc gcgataggca acggcgcgag gtgagttcca 1252261 ggaggcgaat accgcctcga tgctcgcgcg caactgggca tacgggtcgt cggtaatggg 1252321 accggcgctg ccgacgatgc gccgatacat gctggtgaat cgccgtctgg tgtcgtgggc 1252381 gaagtcggcg gcacccggcc tggcaagtac tcgttcgacc gcgtcggtca tgcccacgtc 1252441 cagaatcgtg tccatcatgc cgggcatcga ctgggtggct cccgagcgca cgctgaccag 1252501 cagcggattc gggccacggc cgaacgtgca cgaggtttct gtttccagcc agctcatccg 1252561 atccagcacg tcatcccaga tcgcggcgat cgtggattcg ggcgcggcga gatagcgcac 1252621 gcccacctcg gtggtaatgc agaatgcagg cggcaccggc agatggtgcc ggcgcatcat 1252681 gtcgatgccg tggcctttgt tgcccaggat ctcgcgtggg tagttcgcgc cgccgtccag 1252741 cgccacaacg gcgttttcga gagttccgtc ggggcaacca ttggctcggg tgatacgagt 1252801 catgggcacc ccttgatgct acttatgggc aacgccagac cgcccactgt gggcccacag 1252861 ggggcgcctt ggtcagcggt cggactactc agcttgtgtc tggtgttggg ccttacccat 1252921 gctgcgagac aacgccggct gccggtgatg gtggctggcg gcgtggacag cgcaccggcc 1252981 caacggcttg gttcgaccgg ctcccccgcc taacgctacg ggtcgccttc gtcgtctgcc 1253041 aggagctttt ccgggtgatg gaacgtattg actcgaggtt ggccgtggtc gagatgtggc 1253101 ggcggtagcc actcggtgtc gccgtgggcg ttcttgcggg tcgtccagcc acgttcggct 1253161 aacggatgat ggccaccgca gccgagtgtc aggtcattga cgtcggtgtt gcggcactgg 1253221 gcgtacggcg tgacatgatg gacttcacag taatagccgg gcacgtcgca accaggtgcg 1253281 ctgcagccac tgtccttggc gtacaacata attcgctgcg ccggggaggc caggcgcttg 1253341 gtgtggtaga gcgccagggc cttgcctcga tcgaatatcg cgaggtagtg gtttgcgtgg 1253401 cgggccagcc ggatcacatc cgatatgggc aagatcgtac ccccgccggt gaggcccgcg 1253461 ccggccgcgg cctccaagtc cttcagcgtg gtggtcacga tgatgctggc cggtaatccg 1253521 ttgtgctggc ccagattgcc acttgtcaac aaactacgta atccggcgtt gagcgcgtcg 1253581 tggttccgct gtgggcagct gcgggtgtct cgccgcgcct gctccttcga gggcgcgccg 1253641 ttcacacacg gtgccttctg ctcggggttg cacatacccg gggcggccag cttggcccac 1253701 accgcctcga tagtggcgcg cagctcgggg gtcacatatc cgctgagccg cgacatccca 1253761 tcgacatctt gctttcctaa cgtcaagccg cggcggcggg cgcggtcctc gtcggtgtag 1253821 tcgccatcgg ggttgaggca gtccatgatc cgcgcggcca atttggccag ctggtcggga 1253881 cggtactggg tggcctgctt agccaagtcc cgttcggcct tctccagggt cttgaggtct 1253941 acccaggatg gtaggcggtg cacgaaagca cggattactt caacatggcc gtcaccaatt 1254001 aacccgtggc gctgtgcctt tgcggtggcg gtgagtagcg gtggcagcgg ctcgccggtc 1254061 agcgcacggc gctggccaag gtcggcggcc tcggccactc gccgcttggc ctcgctgcgg 1254121 gtgatgcgca accggtcggc cagcgtcaat cccagcttgc cgcccagctc ctcctcggtg 1254181 gattgttcgc cgatctgatt gatcaacgtg tgttcgacgc tgggcagctg gcgtcgcgcg 1254241 gtctcgcagt gctccagcag cgccaggcgc tccggggtgg tcaatgcgtc aaaggtcagc 1254301 cccagcacgc gggacagcgc ggtagccaat gacgcgaagg cctccgtgat ctcctcccga 1254361 gtggaacaca tgactgaatg ctatgtgcag gcaccgacaa caatgcttgc ccagagcctg 1254421 ctgaaaccac agtaatataa ggggtttcgt tgtctgctgt ggcgtcgggc ggtcaaaccg 1254481 attgctcggt cgacgaataa ggcaagctgc tgcccgcgtt ctcgtcgacc gcgacgcgac 1254541 caccgagata ggggaacgca cgttgggcgc acgacgttcg gttgcagatc ttgcagcccg 1254601 ccccgatcgg gacctccgtg ctcgggtcgt ccaggacgac accggtggag tagacgagtt 1254661 tatgggcgtg cgcgaggtcg cagcccagcc cgaccgcgaa gttcttgtgc gggcccagat 1254721 acccgagccc gtcggcagcg gtggtcttgg ccacccagaa gtacgacctg ccgtcgggca 1254781 tttgcgccac ctggcggacg atcctctctg gctgggcgaa cgcgtcgtgg accacccaca 1254841 gcgggcagct gccgccgacc cggctgaagt gaaacgccgt cgcggactgt cgctttgaga 1254901 tgtttccggc cttgtcggtg cggacgaaga tgaacggtat ccctcgctgc cgcgggcgct 1254961 gcagtgtgga gagccggtgg cagacggttt cgaagcccac tccgaaccgg cggcccagca 1255021 ggtcgatgtc atagcgtaac tgctctgcgg cacggtggaa ttcgcggtag gggagcagga 1255081 aggcgccggc gaagtagttg gccagtccga tgcgcgcgac gccgcgggct tcggtgctga 1255141 gctggtcatc ggtggccacg atcgacgaga tcaggtctga ctggcccacc agcgccagtt 1255201 gggtggcgat ctggaaggcg cgctgtccgg gcatcagcca gtgggcgacc cgaaggacct 1255261 tggtgtcggg gtggtagcgg cgcttggcgg tgtcgggcag attgtcatcg atcaccaccg 1255321 agatgccgaa ccggtcccgc atcagctcgg ccagctggat gtccaatccg ccggtccgca 1255381 tcccgctttc ggtaaacatc cgctccgccg ccatgtccag gtcgtggatg tagttgttgc 1255441 ggtcgtagaa gaagtcgcgg acctcctcga acggcatcgg ccgcgcgggc ggtagctcgg 1255501 tttcggcggt cgcacgagat cggtagccct ctagttcctc ggtggcggcg cgcaaccggc 1255561 ggtgcacggc aaccaggctg tggccgacct cgggcatccg ggcgacgaat tcttcgatct 1255621 gggcgccgct gaccgcgtgc tcgacgccga tgtcggtgaa gacgtcggac aggtcggcca 1255681 ccaaccgtgc gtcggaatcc gaggagaaat actgcgccga caggtcaaac cgctcggtaa 1255741 gcagaagcag cacgggcacg gtgatgggcc gctggtcatt ctccaactgg ttgacatagc 1255801 ttgtggataa gtccagggcc ttggccagcg ccacctgggt gagcccgcgc tcttgacgta 1255861 accgccgcag gcgggcaccg gaaaacgtcc tcgaatacgt cctagccacc ggtaagacat 1255921 tactccgcgt catgttcgca aaatttgcaa aatgtgccgg gtcaggacac aaaagtacgc 1255981 cttttcaggg tcttttgttg gtgtcctgtg ctgcgtatgg tgcggattat gttgatgcat 1256041 gcggtccggg cgtggcgcag cgccgacgat ttcccgtgca ccgagcacat ggcctacaag 1256101 atcgcccagg tggctgccga tccggttgac gtcgacccgg aggtagcgga catggtgtgc 1256161 aaccgcatca tcgacaacgc tgcggtgagc gccgcatcaa tggtgcgcag accggtcacc 1256221 gtggcccgcc accaggcact ggcgcatccg gtgcgacacg gggcgaaggt atttggcgtc 1256281 gagggcagct actcggcgga ctgggcggcc tgggccaacg gcgtcgccgc gcgtgaactt 1256341 gactttcacg acacgtttct ggccgccgac tattcgcacc cggcggacaa cataccccca 1256401 ctggtggcgg tcgcccagca gctcggcgtg tgcggcgcgg agctgatccg cggtctggta 1256461 accgcctatg agatccacat cgacctaacc cgcggaatct gcttgcacga gcacaagatc 1256521 gaccatgtcg cccacctggg cccggcggtg gccgccggca tcgggaccat gctgcggctc 1256581 gaccaagaga ccatctacca cgcgatcggc caggccctgc atctgaccac cagcacccgt 1256641 caatcccgca agggcgccat ctccagctgg aaggcgttcg cgccggcgca tgccggcaag 1256701 gtcggcatcg aggcggtcga tcgggcgatg cgcggcgagg gctcaccggc tccgatctgg 1256761 gagggcgagg acggggtgat cgcctggctg ctggccggac ccgagcacac ctaccgggtg 1256821 ccgttgcccg cacctggtga acccaagcgc gccattctgg acagctacac caagcaacac 1256881 tccgcggagt accagagcca ggcgccgatc gacctggcct gccggctacg tgagcgtatc 1256941 ggcgatctcg accagatcgc gtcgatcgtg ctgcacacca gccaccacac ccatgtagtg 1257001 atcggaacgg gatccggcga tccgcagaag ttcgacccgg acgcgtcacg cgaaaccctc 1257061 gaccactcgc tgccctacat cttcgccgtg gcactgcagg acggctgctg gcaccacgag 1257121 cgctcctacg cgcccgagcg ggcgcgccgt tccgacacgg tggcactgtg gcacaagatt 1257181 tccaccgtcg aggatcccga gtggacccgc cgctatcact gcgccgatcc ggccaaaaag 1257241 gcgttcgggg cgcgcgcgga ggtgacgctg cacagcggtg aagtgatcgt ggacgaactg 1257301 gcggtggccg acgcccatcc gctgggcacc cggccgttcg agcgcaagca gtacgtagag 1257361 aagttcaccg agctcgccga tggtgtagtg gaacccgttg aacagcaacg tttcctggcc 1257421 gtagtagaga gtctcgccga tctcgagagc ggtgccgtgg gtgggctgaa cgtgttggtc 1257481 gatccgcggg tgctggacaa agcgccggtg attccaccag gaatctttcg atgaccgggc 1257541 cgctcgcggc ggccaggtcc gtcgctgcca cgaaatcgat gaccgcgccc accgttgatg 1257601 agcggcccga catcaaaaag ggcctcgccg gcgtggtggt ggacaccacc gccatctcca 1257661 aggtggtgcc gcagaccaat tcgttgacct accggggata tccggtccag gatctggcag 1257721 cccgctgcag tttcgagcag gtcgccttcc tgctgtggcg tggtgagttg cccaccgatg 1257781 ccgagctggc gttgttcagc cagcgcgaac gagccagccg tcgggtggac cgctcgatgc 1257841 tgtcattgct ggccaagctg ccggacaact gccacccgat ggacgtggtg cgcaccgcga 1257901 tcagctatct cggtgccgag gacccggacg aggacgacgc cgcggccaac cgggccaagg 1257961 cgatgcgcat gatggcggtg ttgccgacga tcgtggcgat cgacatgcgg cgccgacgcg 1258021 ggttgccccc gatcgcaccg cacagcgggc tcggttatgc gcagaacttc ctgcacatgt 1258081 gcttcgggga ggtacccgaa accgccgtcg tgtcggcgtt cgagcagtcg atgatcctct 1258141 acgccgagca cggattcaac gcgtcgacgt tcgccgcccg ggtggtgacc tcgacccaat 1258201 ccgacatcta cagcgcggtg accggcgcga tcggcgccct caaggggcgg ctacacggcg 1258261 gcgccaacga agccgtcatg cacgacatga tcgagatcgg cgatccggcc aacgcgcggg 1258321 agtggttgcg cgccaagctc gcccgcaagg aaaagatcat gggcttcggg catcgggtgt 1258381 accggcacgg cgactcccgg gtgccgacca tgaaacgggc gctggagcgc gtggggaccg 1258441 ttcgcgacgg ccagcgatgg ctggacatct accaggtgtt agcggccgag atggcgtcgg 1258501 ccaccgggat cttgcccaac ctcgattttc cgaccgggcc cgcgtactac ctgatgggat 1258561 tcgacatcgc cagcttcacc ccgatcttcg tgatgagtag gatcaccggc tggaccgcac 1258621 acatcatgga acaggccacg gccaacgcgc tgatccggcc gctgagcgca tattgcgggc 1258681 acgagcagcg ggtgttaccg ggcaccttct agtcttatgg gccatgggat ttctccagcc 1258741 ccgacttccc gacatcgacc tggccgaatg gagccagggc tcccgcagcc agaagatccg 1258801 gccgatggcc cagcattggg ccgaggtggg ttttggcact ccggtgctgc tgcacctgtt 1258861 ttacgtcgcc aagatcctgt tgtacgtcct tgtcggctgg ctgatcgtgt tgaccaccaa 1258921 ggggattgat ggattcaccg atgcggcagc gtggtacgcc gagccgatcg tgttcgagaa 1258981 ggtcgtgctc tacaccatgc tgttcgaggt gatagggctg ggctgcggct ttgggccgct 1259041 gaacaaccga ttcttcccgc cgatgggctc gatcctgtac tggatgaggt tcggcaccat 1259101 ccggctgccg ccgtggccgg atcgagtgcc gtggacccgc ggcaccaagc gcaagccggt 1259161 ggacgttgcc ctctacgcac tgctggtgat gatgttgctg tcggcgctgt tcaccgatgg 1259221 cgccggcccc ataccggagc tgggcaccat ggtcgggctg ctgcccgcct ggcagatcgt 1259281 gctgatcctg ctgcttctcg gtgtgctggg cctgcgcgac aaggtgatct tcctggccgc 1259341 ccgcggcgag gtctacgcga cgctgacggt gacgtttttg ttcggccgct tgaacggtat 1259401 agacatgatc gtggccgcca aactggtgtt cctggtgatc tggatcggtg cggcgacatc 1259461 gaaactcaac cggcacttcc cttttgtgat ctccacgatg atgtccaaca acccgctgtt 1259521 tcggccgcgg ttcatcaagc ggatgttttt caagaagttc cccggcgacc tgcggcccgg 1259581 gctgttgtcg cggattgtcg cccacgtcag cactgttatc gagatgtgtg tgcccgtggt 1259641 gttgttcgtt gcgcacggcg gctggccgac ggtggtggcc gcgacgatca tggtctgctt 1259701 tcacctgggg attctgacgg ccatcccgat gggggtgccg ctggagtgga acgtgttcat 1259761 gatcttcggc gtcctgtcgc tgttcgtcgg ccacgcctgc ctcgggttag cggacgtgaa 1259821 aaacccggtg ccgctggcga tcctgatcgc cgttgtcgcg ggaatcgtca ttgcgggcaa 1259881 cgtgtttccc cgcaagatct cgtttctagc cgccatgcgc tattacgccg gcaactggga 1259941 taccacgctg tggtgcatca agccctccgc ggaggacaag atcaaccggg gcatcgtcgc 1260001 gatcgccagc atgccggccg ctcagctgga gcgcttctac ggcaaggacc gagcccagat 1260061 cccgatgtat ctgggatacg cgtttcgtgc gatgaactcc catggcaggg cgctatttac 1260121 gctggcgcat cgggcgatgg ccggccatga cgaagacgac tacgtcatca ccgacggcga 1260181 acgggtctgc agcactgccg tcggctggaa cttcggcgac ggccacctgc acaacgagca 1260241 actgatcgcg gcgatgcaac agcggtgcgg cttccaaccc ggtgaggtgc gggtggtgct 1260301 gctcgacgcg cagcccatcc atcggcaaac ccaggagtac cggttggtag acgcggcgac 1260361 cggggagttc gagcgcggct atgtccgggt ggccgacatg gtgaaccggc agccctggga 1260421 cgacgacgtg ccggtccacg tgctgccggg ctagctgctc gtcagctagc ccgcgcgcac 1260481 ctcccgggcg gcggcgacca tgttgtgcag cgacgcggtc acctcgtcga cattgcgggt 1260541 cttcagtccg cagtcggggt tgacccacag ccgctcggcc ggcaccgcgc gcaacgcggc 1260601 ccgcaacgag tcggccatct cctcagcgga gggcacccgt ggcgagtgaa tgtcatagac 1260661 gcccgggccc acaccgttgg cgaagccgat cgcgttcagg tcgtcgagca cctccatgtg 1260721 tgaccgggcc gcctcgatgg acgtgacgtc cgcgtccaga tcggcgatcg cgccgatcac 1260781 ctcgccgaac tccgagtagc acagatgcgt gtggatctgg gtggcgtccg agacgccgga 1260841 ggtggccaac cggaaagccc ctaccgccca acgcaagtac tcggcctggt cggcgcgacg 1260901 cagcggcagc agttcacgca gcgcaggctc gtcgacctgg atgaccgcga tgccggcgga 1260961 ctgcaaatcc acggtctcgt cgcgaatcgc cagcgccacc tggttggcgg tatcggccaa 1261021 cggctggtcg tcacgcacga acgaccacgc cagaatcgtc accggcccgg tcaacatgcc 1261081 cttcaccggt ttgtcggtca gcgactgcgc gtaggtgatc cactcgaccg tcatcgcccg 1261141 cggccgggac acgtcgccgt acaggatcgg cggacgcaca cagcggctgc cgtaggactg 1261201 cacccagccg ttctgggtag cgaagaaacc cgccaattgc tcggcgaagt actgcaccat 1261261 gtcgttgcgc tccggttcgc cgtgcaccag cacgtcgagc ccgagccgct cctgtagcgc 1261321 gatcacctcg gtgatctctt gccgcatccg gcgcacgtac tcggcctcgt cgatctcacc 1261381 ggcccgcagc gccgcacgcg caacgcggat cgccgaggtc tgcgggtagg agccgatcgt 1261441 cgtggtcggc agcggcggca ggtgcagtcg cgcgtcttgg ctggcgcggc gctgggcggc 1261501 attgccgcgg tgggctccgg acgcgacgat cgcctcgatg cgcgcccgga tttgcccatt 1261561 gtgtaaccgc gggtcgcgct tgcgggacgc gatggcggcg cgggacgacg cgatctcgtc 1261621 ggcgaccgcg tcgtgtccgt cgcgcagggc acgcgcgaga acgacgactt cgcgcacctt 1261681 ttcggcaccg aacgccagcc agctccgcaa cgcgtcatcc aggtcggttt ccggttccag 1261741 cgagtacggc acgtgcagtg tcgagcacga cgtcgagacg gccacggtag ccgccgaacc 1261801 cagcagggtc gccaacgtgc ccaacgccgc ctccaggtcg gtgcgccaga cgttgcgccc 1261861 gtcgacgacc ccggccacca gcgtcttgcc ggccagctcg ggtaccccgg ccaccgaggt 1261921 gtcggcaccg gccactaggt cgacgccgat ggcttcgacc ggggtgcgag ccagcgccgg 1261981 tagggccgcg cccgggtccc cgaagtaggt ggcgacatag atcgcaggcc ggttgctcac 1262041 cgagcacagc gcggtgtaca ccgcttcagc cagggcgggc gcgtcggggg agaggtcggt 1262101 caccagcgcc ggctcgtcga actgcaccca ctgggcgccg ccgtcggcaa gcagcgacag 1262161 cagctccgaa tagaccggaa ccaactcttc gaggcgttcg atcggcgccc ccgcgccgtc 1262221 gacggccttg ctcagcagca ggaaggtgat cggcccgatg atcaccggac gtgcgggaat 1262281 gccttgccct aacgcctctt tgagttcggc gagcaccttg ccggggtgca gcgtgaacgt 1262341 ggtcgacggc ccgatctcgg gtaccaggta gtggtagttg gtgtcgaacc acttcgtcat 1262401 ctccagcggc gcgatctggt cggtgccccg cgccgcggcg aaatagcggt ccagcccgtc 1262461 ggaaaccggg ctcactcggg gcggcagcgc gccgagcagc accgcggtat cgagcatttg 1262521 gtcgtagtag gagaaggtgt tcaccggcac cgagtccaga ccggccgcgg ccagggccga 1262581 ccaggtgtcg cggcgtaacg tggcggcgac ggcctccagc tcggatcggc tggtacgtcc 1262641 ggcccagtag ccttcggtgg cgcgcttgag ttcgcggcgc gggccgatgc gcggggagcc 1262701 ggtgatggtt gcggtaaagg gttgacgacg tacaggctgg gtcacgtgct gtccttcgat 1262761 cgacgggtgg ttcaccgccc gcggacgcgc agccgatccg attgaggtgc acaccgatgc 1262821 acccggcaac aggcacggcc aaacgcccat tccacgaggc gatgagccgc cgggcgcggc 1262881 gcgtccggca cggctggcag gtcttcggac tcgcaggctc gcacccggtg ggtgctccta 1262941 ctggccgtcg cttcccagtc gttgagacca gtgcttgtct acttccaaga cggcggtcgt 1263001 tcctgcatac cgctgcggga cagtcccgga ttctcaccag gttccctctc gcgaagcatc 1263061 gttgccccgc tcgatgccga cgccctttcg gacgccagca gaccagctgc gtggtcaagg 1263121 ctactccggt gacatcggcc ggcatggccc ggccggcggc aaaatcgctc ggcgccggat 1263181 gtcctcatcg ggcccgccgc gatcgtcatg tgggtgagat tcgggatagg cccggaccat 1263241 gatgggtcaa caggccgcaa tacgccgcac tcacctgcac cagagacgtc gactggtcgg 1263301 cccccgagca ggccgctgac atggccgcct accagaagtt cgggcaggag cacgccgccg 1263361 cgatccgtgg cggcgccgtg ctgcacccga cggccaccgc cacgacggtc cgggtaaccg 1263421 gcgcccgcgg cggcgacgtc gtcaccggcg acggtccgta cgaggcggcc gacctggacg 1263481 agcaagggcc attcccgatg gagacggtct acctgtggga ggacggcccg aacggtacga 1263541 cgaggatgac gctgtaaaac cgtggtgagc cttcccgctt cgcgggaatc gccgcacccg 1263601 ccatgacggt ggcggtcagg cgggccaacg cgaaggatct cgcgcggcgc aggctgctgg 1263661 aatccggggg ctaaccgtcg aagaacccgg actggtcatt accggcgttg aacccgcctg 1263721 agctgttgtc gccggagttg gccaccccgg aggtggtggt gaaggcggcg ttggtggccg 1263781 agtttccgat gccggtgttg ttgaagccgg tgttgaacag gcccgtgtta aagccggttc 1263841 ccgagttgct gatgcccacg tgctggccgc cgccggaatt gagcagaccc gagtgaccga 1263901 tgaagaaggc gccggtgttg gtgttctgga agccggagtt cgcgtcgccg gagttattga 1263961 agcccgagtt gccggtgccg atgttgccga agccggagtt catcaccggt tggtccaccg 1264021 ggctgccgaa cccggtgttc aggtctccgg agtggaagcc gccagtgttg atatcgcccg 1264081 agttggccca gccggtattg aagtcgcccg agttcaagtc tccggtattc aaggtgcccg 1264141 agttgaagct gcccgtgttg taggcacccg agttgccgac acccatgttc tcaaatcccg 1264201 agttgccgaa cccgaagttg ttgttgcctg cgttcccgaa accgaagttg ccgctgcccg 1264261 cgttcccgaa gccggtgttg gtgaagcccg cgttcccgaa accggtgttg gtgtcaccgg 1264321 agttgaagaa gccgaagttg ctgtcgccgg agttgaagaa gccgatgttg ttgttgccgg 1264381 agttgaacaa gccgatgttg ttgttcccgg agttgccgaa gccgaggttg ccgatgccgg 1264441 agttcagtgc gccgatgccg atcatgttgt cgccggtgag cccgataccg atgttgttgt 1264501 tgccgagatt cgcaatgccc aggttgttgt tgccgagatt cgcaatgccc aggttgttgt 1264561 tgccgagatt cgcaaagccc acgttgggag agccgtgatt tgcgctgccc acgttgaagg 1264621 aaccggcgtt ggcggtgccg aagttgaagc tgccgacgtt cccgctgccc cggttgccat 1264681 cgccgatatt ccccaggccg aagttaccgt tgccgtcgtt gccgctgccc aggttgaggt 1264741 tgccgaggtt cccgctaccg aagttggtgt tggcggtgtt gccgctgcca aagttgaaga 1264801 aaccggtatt gccgctgccc aggttggcct ggcccgtgtt tccgctgcct aggtttgcgt 1264861 tgccggtatt gccgttgcct aggttgtagt cgccgatgtt gccgatgctg aagatgttgc 1264921 cgatgccggt gttgccgata cccaatgccg ggatggccag ggcagcgggg ccggaggcca 1264981 gtgcgggcgc cgtcgggttg ggcagggcgc gcaccgcctg tgcccacggg gccagctggg 1265041 cggccaccgc cgaggccccg ctgtggtagc tcaccatggc cgcgacatcg gcggcccaca 1265101 attgttcgta ggctgcctcg gtcgccgcga tcgccggggc gttttggccg aacaggtttg 1265161 atagcaccag cgacaccagc tggtggcggt tggcggcgac cagcagtgga tccacggtgg 1265221 ccgcccgcgc ggcttcatac accgccgcgg ccgccttggc ctgtgcggcc gcgcttagcg 1265281 cgcgtgttgc tgccgtgctt agccagctgg catagggggc tgccgcggcg gccatcgccg 1265341 tcgccgccgg accttgccag gcggtgtcgg ccagcgctgc ggtggccgac gaaaacgagt 1265401 tcgccgcttg gcctaactcg gcggccagcc cgtcccaggc cgccgcggcg gccagcgtcg 1265461 ggcctgagcc cgcaccggca aacatcaacg cggaattgac ctcgggaggc aacaccagaa 1265521 aactcatcac gccatccctt ccgcagctgg acgtgcccgg gccatcccct cccgtgacca 1265581 caaacctccg ctggctgaat acgcacagcc cgatcctccc ggcgcgaagc agcgccgcgg 1265641 tcccgcctgc ttgaccccag attccatggc gcgcctccca ccaccaacac tgggccgatc 1265701 gctcgacacc tcatgcagct tggcaatcaa aacactatga gattcgcagg gcggcctcag 1265761 cgttttcgcc aaagcgctta ccccctgttc aaccccaaca gcgcgatcgc gcttggccac 1265821 ccattcggcg gctcgggggc acggttgatg actacagtgc tacaccacat gccggacaag 1265881 ggaattcgct acggcttaca gacgatgtgc gagggccgcg gccaagccaa tgccaccatt 1265941 gtggagttgc tgtgacagcg accgatagcc agccggcggc gttgtcgagt accgcgacaa 1266001 tgtcatggtc attacgatca atcggccgga agcccgcaat gcggtcaatg gtgccgtcag 1266061 catcgtggtt ggagacgcgc tggaagaagc gcacgacaac cccgatgtgc gggccgtggt 1266121 gatcaccggc gccggcgaca agtcgctttg cgccggtgcc gacctcaagg cgatcgcacg 1266181 ccgggagaac ccgtaccacc cgcatcacgg cgagtggggc atcgccggtt acaggcacca 1266241 tttcatcgac aagccgacca gcgccgcggt cagtggcacg gccttggacg acggtgccga 1266301 gccagcgctg gccagcgacc tggtggtggc cgacgagcac acctaattcg ggtttgccgg 1266361 aggtcaaacg cgggctgatc gccgccgccg ggggtgtacc ggtgagccgc tgaccgcatc 1266421 cgacgactgg gagtggggcc tgatcaaccg ggtcgtcaag gagggttcgg tcgtcgaggc 1266481 cgccctcacc tggccgtgcg ggtgaccgtc aacgcgtcgc tgtcggtgca ggccagcaag 1266541 cggatcgcct gtggtgtcga tgacggggtc gtcgtcgacg aagggactcc gcacccagcg 1266601 cgagatgggt tccctgatga gatcgcagga cctcgggcgt tcgccgagaa acaggaaccg 1266661 gtgtggcggg cccgctgcat cgtctcggcg ccttggatgg gcttggcggg cgtaccgtca 1266721 gccagcactg tcgcattgcc aacgtttgtg ggacttatcc cgatgccggg gcgcagtgtc 1266781 gcgctgaggt gggcacaacg agcatccttc ccgggagaac caatgtggcg gatgtgacaa 1266841 cgcgccgaca acaccagatc ctgggctgtc tcagtacgcc aggatgttca ccccgtaccg 1266901 gaatgccgtg ggcagaagtg cgcacagcgg cacgatggca cggcgtgccg cgcgtggcgt 1266961 actggccagc accaacccgc gggtgactag ccggtaatca cgagtgatcc ggtgccacgc 1267021 ggcctcatac gacgccggtg tgtcgtcgac gatggcgctc accgccgcgg cggcctgctt 1267081 gacggcaagg ctgatgcctt cgccggttag ggcatcttcg tacccggccg cgtcaccgac 1267141 caaaagcacc cgccccgcga cgcgccggga gaccacctgg cgcaagggac cgcagccacg 1267201 tgcgtgtccg cggctcgcgt cttgcagatg gtgtgcaagg ctgggaaacc aggcaagttc 1267261 gggtcgttgg cgggacaaga tcgcgacgcc gaccagatcc ggttccaccg gagtcacata 1267321 agcctcaccc caacgggacc aatgcacttc gacgaagtcc gaccacaccg gcagccggta 1267381 atgccagcgc accccgtatc gccgtggtgt cccggcggtg gctttgatcc cgacggcgcg 1267441 ccggacggcc gaatgcagtc catcggctgc caccaaccat ttcgcgcgaa cgccggcggc 1267501 ggtcacacca tgtgcgtctt gctgaatagt ggctacccgc gaccggatcc attcagtgtc 1267561 ttgctctttg gctcgtgccg ccagtgccgc atgcagcgtg gtgcgtcgca cgccccgccc 1267621 cggcccggtg cgaaaccgcg cctgcacccg acgatgttca ccaacgtagg caatcccatg 1267681 aaagggcaga ccgaccgggt ccacgcctag cgaggtcaat tcggccaggc caccgggcat 1267741 cagcccctcg ccgcacgcct tgtcgatggg attctcgcga ggctcggcca cgatcaccga 1267801 aagtccacgc gcgcgtgcgt gcaatgccgt ggcgagtccg ccggggccgc cgccgacgac 1267861 caacaggtcg gtgtcgtagc tggtcatatg tagcccagaa cggagttctc cacccgcaga 1267921 cgaacggtca gcaaggtcgc attggccagg gtgaaaacca gtgcggtcaa ccacgccgtg 1267981 tgcaccagtg gcaacgcgaa cccttcggcc accaccgcaa cataattcgg atgccgcatc 1268041 caccggtagg ggccccgccg caccaacgtg gcgtgcggca acacgattac ccgggtgttc 1268101 caccgcttgc ccagcgattt gacgcaccac cagcgcaggc cctggcttgc caccactacg 1268161 gccagcatcg gccagccgag ccacggtatg aaaggccggt gcaaggccca cggttcgacg 1268221 acgcagccca gcagtagggc ggtgtgcagg ataaccatca ccacatagtg tgggcggcca 1268281 aactctttgc cgccctgcgc gaaagaccac cgcgcgttac gctgggccac caccagctcc 1268341 gccagccgtt cgaagacgac cgccaggatc agcaggtagt acacggccct accacctaag 1268401 aagcaccgac tcggaggaaa aacccgggcc tatgcacact atcctggcgg gaccggcgat 1268461 gcagcccagc ccgaacggtg gaatagctcc ccctgcgatc gactccatat tgtcagccac 1268521 gttactggcg ccggatgggt tcagattctg gcgagtggga ccgccattgc cgggccgttc 1268581 cacggcccgt atcgtcgccg cgctgtgctg gattgcgcgg cttctcctcg ggccgttcca 1268641 cggcccgtat cgtcgccgcg ctaggttgga cgctgtgcgg atcgtggtga gcagtgccac 1268701 cagaaatgcg ggttcgtaca cctgtgtcag caccggcagc gctggatgcc gcgagattac 1268761 accgcccctc gctgggccca cgcctgggcc ggtgaacccc ggcccgcccg ctggcaccct 1268821 gcgaaccagc ctgcacatcc tgaccactcc aaccgcgaaa gtccggcctg catgagccaa 1268881 tccaccactc cataccgcag cagcgtgctt gccgagtttc gtcgtgcgat caccaatgtc 1268941 gctgtgcccc atcatgaacc gccgggaatc gtgcgccgcc gccgtgtggt cgtcggcgtc 1269001 acgttggtta tcggcgctgt gatgctgggc ttttcgctga ggcggacgcc cggcgagtcg 1269061 agcttttact ggctgacgct cgcgctggca gccgtgtgga tcgccggcgc actgatgtct 1269121 ggaccgctgc atctgggtgg catctgttgg cgcggtcgca atcagcgtcc ggtcatcacc 1269181 gggaccactg tcgggctgct gctagcaggc atcttcgggg tgggtgcaat gatcgtcagg 1269241 gcaattcctg gcgcagctga accgatagcc cgcgtcctgc aattcgccca tcagggaact 1269301 ctgctgccga tcctgctgat caccttgatt aacggcatcg ccgaggagat gttctttcgc 1269361 ggtgcgctct acaccgcgct gggacgacgc tatccggtga ccatctcaac cgtcctgtac 1269421 gtcggcgcca ccatggccag cgcgaatctg atgctcggct tcgcagcgat cttcgtcggt 1269481 acggtgtgtg cgttggagcg ccgggccagc ggtggagtgc tggcaccgat cttgacccac 1269541 ttcgtgtggg gcctgatcat ggtgttcgcg ctgcccccgc tgttcgcggt ctgacgcgcg 1269601 ttcaggaacc ggtgaagttg ggggtgcggc gttgcaggaa cgccgctgcg ccctcggcga 1269661 agtcgtgtgt tcgcagcagg acttcctgtc catccaattc gcgcgcgaac gtgggttcca 1269721 attcggtgag ggcggctgca ttgatggcgt ttttggcctg ggcgaacgcc agcgccgggc 1269781 cggccagcaa ccgtgaaatc accttgtcca cctcggcctc gaagtcgctg tccggatata 1269841 ccgcgctgat caggccccag gccagtgcct cgcgggccgg cagttgctcg gccagcagcg 1269901 ccagccgcat cgcccggatc cggccggtgg ccgcggcgac taacgccgat gcgccgccgt 1269961 cgggcatcaa cgctaccttg gtgttggcga gcatgaaaaa tgcactatca gaagccaata 1270021 tgaagtcaca cgccagcgct agcgagacag cgacgccgac cgctggtcct tgaacgacag 1270081 ctacaaccgg gtgcggtagc gcggccacgg cgcgtactgc gcggttggcc tcttcgacga 1270141 tggcggtcgg cggccctccg ccccacacat cgtccacaga catagacact ccggagctga 1270201 aaccgcggcc caccccgcct aggcgcacca ccttgaccac gggatcggcc gccgcgcgct 1270261 ccagcgtgtc ggcgatcccc gtcaggattg gcacggtcag cgagttgaga ctgctagggc 1270321 ggttgatgcg caccgacaac actctgtcgg tcagggtgac gttgaggcct gtgaccggcg 1270381 ttaatgcggc aatcccggaa tctggcatgt gcagcatcct aaatgagggc cagctacaca 1270441 gagtggttaa tgatgctccg caaacatgcc caaccagcag ttggagtaat cggtgagtac 1270501 acgggcatcg acgcggccca gtcgcgggac cgctagcggg ccgagagcgc tcaacggccg 1270561 gtgaacatgg gggtccggcg ctgctggaat gccgttgcgc cctcggcgaa gtcgtcagta 1270621 cgcaggagga gggcctggcc atccaattcg cgcaggagag tgggtgccaa ctcggtgagc 1270681 gtggccgcat tgatcgcgtt cttcgtcttg gcgatagcca gcgctgggcc ggccaacagc 1270741 cgtgagatca acttgtccac ctcggcatcg aagtcggcgg ccggatagac ggcgctgacc 1270801 aggccccagg acaaggcctc ggcggccggc acccggtccg gcagcagcgc catatgcatg 1270861 gcgcggatgc ggccgatcgc ggcctgaacc aacgccgacg cgccgccgtc gggcatcaac 1270921 cccacgttgg tgtgagcgag catgaaaaac gcattgtcgg aggccaatac gaggtcacaa 1270981 gcgagcgcca gggagacgcc acagccgacg gttggtccct gcacgacggc aacgaccggt 1271041 tgtggtagtg ccacaatggc acgcaccgtg cggttggcct ccgcgacggt gtcggtaggc 1271101 gggccactgg cccacacatc gtcaacgctg attgcccctc cggagctgaa gccgcgaccg 1271161 gcgcccccga ggcgcaccac cttcacccgt gggtcggtgg ccgcgccctc gatcgcgtcg 1271221 gccatccctg ccagcaccgg cttggtcagc gagttgagac tctccgggcg atcgatggtc 1271281 accgacagca ccccgtcggc cagggtgacg gcgagacccg ggacaattgt ccgagtgtcg 1271341 atccggtagt tcgacatgtg gttaacacta atcgacgacg ccgtcaccga gctgcggcga 1271401 catgatcttc gtcgatacgc cgtcgagggc gtcaatggga gacgaaaggc cggtacattc 1271461 atggcgggtc cgctgagcgg gttgcgagtt gtcgagctgg cgggcatcgg gccgggcccg 1271521 cacgcagcga tgatcctggg ggacctcggt gccgacgtgg tgcgcatcga tcgcccgtca 1271581 agtgtcgacg gtatttcgag agacgccatg ttgcgtaacc ggcgtatcgt gaccgccgac 1271641 ctgaagtccg atcagggact cgagcttgcg ctcaaactca tcgccaaggc cgacgtgttg 1271701 atcgagggtt accgtcccgg cgtcaccgaa cggctgggat tgggtccgga agaatgtgcg 1271761 aaggtcaacg accggctgat ctacgcgcgg atgaccggct ggggccaaac cggcccgcgt 1271821 agtcagcagg ccggtcacga catcaactac atctcgctga acggcatttt gcacgccatt 1271881 ggccggggcg acgagcgacc ggtgccgccg ctgaacctgg ttggtgactt cggcggcggc 1271941 tcgatgttcc tgctggtcgg catcctggcc gcgctatggg agcggcagag ctccggcaag 1272001 ggccaggtcg tcgatgcggc gatggtcgac gggtccagcg tgctgattca gatgatgtgg 1272061 gcgatgcgag cgacgggcat gtggaccgac acaagagggg ccaacatgct cgacggcggg 1272121 gcaccctact acgacaccta cgaatgcgcc gacggccgct acgtcgctgt cggcgccatt 1272181 gagccgcagt tctatgcggc catgctggcc ggattgggtc tagacgccgc cgagctgccc 1272241 ccgcaaaacg accgcgcccg ttggcccgaa ctgcgggcgc tgctgaccga agcgttcgcg 1272301 agccacgacc gtgaccattg gggcgcggtg ttcgccaatt ccgatgcctg tgtgacgccg 1272361 gtgctggcgt tcggtgaggt gcacaacgag ccgcacatca tcgagcgaaa caccttttat 1272421 gaagccaacg gcggatggca acccatgccg gctccgcggt tctcccgcac cgcttcgagc 1272481 cagccacgcc cgccggccgc cacgatcgac atcgaggcag tgctcaccga ctgggacgga 1272541 taggaaggat tcgtatgaag accaaagacg ccgtagccgt tgtcaccggt ggcgcctcag 1272601 gcctgggtct ggccaccacc aagcggctat tggacgctgg ggcacaggtg gtcgtcgtgg 1272661 acctccgcgg cgacgacgtg gttggcgggc tcggcgatcg cgcgcgtttt gcgcaagccg 1272721 acgtcaccga cgaagccgcc gtcagcaacg cgctagagct ggcggattcg ctcggcccgg 1272781 tgcgggtcgt cgtcaactgc gccggcaccg gcaacgcgat tcgcgtactg agtcgcgacg 1272841 gcgtgttccc gctggccgcg ttccgcaaga tcgtggacat caacctagtc ggcaccttca 1272901 acgtgctgcg actgggcgcc gagcggatcg ccaagaccga accgattggg gaagagcgcg 1272961 gcgtcatcat taacaccgcc tcggtggcgg cattcgacgg tcagatcggc caggccgcct 1273021 actcggcgtc caagggcggc gtagttggca tgaccctgcc gatcgcccgc gatctggcca 1273081 gcaagctgat ccgggtggtc accattgcgc cgggtctgtt cgacaccccg ctgctggctt 1273141 cattgccggc ggaggccaag gcctcactgg gccaacaggt gccgcatccc tcgcggctgg 1273201 gcaaccccga cgagtacggg gcgctagttc tgcacatcat cgaaaacccg atgcttaacg 1273261 gcgaggtcat ccgtctggac ggcgccatcc gcatggcgcc gcgctaagcc gcaccaaaag 1273321 aaagaccccc gcgttgcggg ggaccggaat cgggaacaag aacttaccga cgaaaccatc 1273381 ggctgacggc tggttcggcc atgaggagcc gtgcaagcat gcccatggtg tcgctcagct 1273441 cgcggtgggc agcgggtgca agtcttcgag ctgctcggag gtgtcgccct ctaccagcat 1273501 gtcgccgtgg tagagagcct cgaagtcagc cttgatgacg tcggcactcg agtcgtcgat 1273561 ccacatgaca gcgagcctaa aagccgccat taaggaatta gtgagtcacg attcggaaaa 1273621 cagtggcaat tcctaccggt cggtagggtg ctgcgccggc atggtggccg gcatcgcggg 1273681 catgcggcag gtgaaccact cgagcgcccg catccgtatc tatggcaggc gttgtttgac 1273741 agttgtaact tatcgcagat aagtcatcgc ggatttggtg cgggtccgcg cgaccagcac 1273801 cggctgcgga ggaaacgcaa catgctgcag aggatcgctc ggctcgccat cgctgcgccg 1273861 cgccgaatca tcgggtttgc ggtcttcgtc ttcatcgccg cagcggtctt cggtgttccg 1273921 gtggctgaca gcctgtcgcc cgggggtttc caagatccgc gatcggagtc ggcacgggca 1273981 atcgaggtgt tgaccgacaa gttcggccag agcggtcaga aaatgctgat cgtggttacg 1274041 gcagccgcgg gcgccgacag cccacctgcc cgcgaggtcg ggactgacat cgtcgaggtg 1274101 ctgcggcggt cgccgttggt ttacaacgtg acctcgccgt ggactgtgcc accgactgcc 1274161 gccgccgacc tgctcagcac cgacggaaaa tcggggttga tcgtcgtcaa cgtcaaaggc 1274221 ggcgaaaacg acgcgcagaa ccacgcccaa accctgtcag acgaagtcgc ccatgaccgc 1274281 gacggcgtca ccgtccgtgc cggcggctcg gcgatggagt acgcccagat caatcggcag 1274341 aacaaagacg acctgctggt gatggagttg atcgcgattc cgctgagctt cctggtgctg 1274401 atctgggtgt tcggtgggct gttggccgcc gggctgccga tggcccaggc cgtactggcc 1274461 gttgtgggat cgatggccgt attgcgactc gttacgtttg ccaccgaggt gtcgaccttc 1274521 gcgctcaacc tgagtacagc gttgggcctc gcgttggcta tcgactacac gctgctcatc 1274581 gtcagtcgct atcgcgacga gctcgccgag ggcagtgatc gagacgaagc actgatccgg 1274641 accatggcga cttcggggcg cacggtgttg ttttcggcgg tcaccgtggc gctgtcgatg 1274701 tcggcgactg cgctgttccc gatgtacttt ctgaagtcgt tcgcctacgc cggcgtggct 1274761 accgtggcat tcgtcgcgac cgcgtcgatc gtgatcaccc cggccgcgat tgtgttgcta 1274821 ggtcctcggc tagatgcgtt ggacgtgcgc cgactggtgc gtcggctgct gggccggccc 1274881 gatccggtgc acaaaccggt caagcaactg ttctggtacc ggtcgagcaa gttcgtgatg 1274941 cgccgttggc tgccggtcgg tacggctgtt gtcgcgctgc tggtgctgct cgggctgccg 1275001 ttcttgtcgg tgaagtgggg tttcccggac gaccgggtgt tgccgcggtc ggcgtcggcc 1275061 cgtcaagtcg gcgatatctt gcgcgatgac tttggccacg atcctgcgac gcagataccc 1275121 atcgtcgtcc cggacgctcg tggtctcggc ccggtcgaac ttgacagcta cgcagccgag 1275181 ttgtcccggg tgcccgacgt atccgcggta gccgccccga cgggcacgtt cgtagacggc 1275241 agctgggtgg gaacgccgcg cggggccacc gggttggctg agggcagcgc gttcctgacg 1275301 gtgagcagca cggcgccgct gttttcgcga gcctccgata tccagctcaa gcggttgcac 1275361 caggtggcag ggccggccgg tcgatccgtc gtgatggccg gtgtcgcgca ggtcaaccgc 1275421 gacagtgtcg acgcggtgac cgatcggctt ccgatggtgc tagggctaat tgccgcgatc 1275481 acctacgtac tgttgttcct gctcaccggc agcgtggtgc tgccggcgaa agcgttggtt 1275541 tgtaatgtgt tatcgctgac cgcggcgttt ggcgcgttgg tgtggatctt ccaggaaggc 1275601 catttcggtg ccctgggaac gactccgagc gggacgttgg tggcgaatat gccggtccta 1275661 ctgttttgca tcgcattcgg tttgtccatg gactacgagg tgtttctggt ctccaggatt 1275721 cgggagtact ggttggaatc cggagccgcg cgacccgcgc gaagaagcgt cgcagaggtg 1275781 cacgccgcca acgacgagag cgtcgcgctc ggcgtggccc gcaccggtcg ggtgatcacc 1275841 gcggcagcgt tggtgatgtc catgtcgttc gccgcgttga tcgctgcgca cgtgtcgttc 1275901 atgcggatgt tcggcctcgg cctgacttta gccgtggctg cagacgccac actggtgcgg 1275961 atggtcgtgg tcccagcatt catgcatgtg acgggccgct ggaattggtg ggcaccgaga 1276021 cccctggcgt ggctgcatga gcggttcggt gtcagcgagg cagcagagcc ggtttcgagg 1276081 agacgttccc acgccggtgg gttgggcaag attgccggac gaagcgacgg tcagacgatc 1276141 cctgcctcgc tgacgcgcaa tggttgacgt ctcgatgaat ggtcttcgcc ggcaacgtgc 1276201 ccggcggggc cccaacgcca cattacggca gctggcggac tgggtgcagg cacgtcgccc 1276261 atcggagaaa cgacgaggac catcggagga atcctggcca tgacgtcagg cgcggccgct 1276321 tcggcgtcca gggtcgacca cccgcttttc gcccggatct ggcccgtggt cgccgcacac 1276381 gaagccgaag caatacgagc cctccgccgg gagaatctgg ccggtttgtc ggggcgggtg 1276441 ttggaagtcg gggccggcgt cgggacgaac tttgcctact acccggtggc cgtcgaacag 1276501 gtcatcgcca tggagcccga gccgcggctt gctgccaagg cccgcatcgc ggccgctgac 1276561 gcacccgttc cgatagtcgt gacggacaag acggtcgagg agttccgcga caccgagacg 1276621 tttgacgcgg tggtttgctc gctggtgctg tgctcggtga gcgacccggg cgcggtgctg 1276681 gcgcacctgc gttcgctact acggcgaggc ggggagctgc gctatctcga gcatgtggcc 1276741 agcgccggcg ctcggggccg ggtgcagcgg ttcgtcgacg cgacattttg gcccaggctg 1276801 gcgggcaact gtcacacgca tcgccatacc gaacgcgcga tcctcgacgc cggattcgtg 1276861 gtggacagct cccggcggga gtgggcattt cccgcctggg tgccgctacc ggtgtcagag 1276921 ttggctctgg gccgcgcgca ccggacctag ctatagctag tactgcagcc gtagataggg 1276981 attgctgatg ctggcgtgtc tgcgctggtc agggcggtga ccgcggcatt gttttcagtt 1277041 tgtgacaact tctcaatatg ccgcggtcgc cgcggctcat agcgtagacc ctgatcggtg 1277101 gcaggcggag ttctcggcgg tgctggatcg gatcgcgccg cgtttcgccc ggcaccagcc 1277161 gttgcgccat gccggtgaac tcatggccgg gatggtttcg ggcttggacc gcaagaattg 1277221 ctggaccatc gccgagcacc gcggtgatac caccccgatg ggttgcagca tctgttggca 1277281 cgggccagct gggacgccga cgatgtccgt gacgatctgc gtgactatcg ccattgatcg 1277341 atggcgaagg accaggccac cagtatatcg atgatttgaa tagtccagcg ccgacattga 1277401 tgatatctgt tgacgaatac gcttgattta cgatgttcgg ccgcgggcag cgcgctccac 1277461 cagaccgagc acagcgagga cgcgacggcc gtcagcggcg tgctgtgcct caacagcgcc 1277521 gaccaatagc gaagaaatca agtccgtgct cacccgtgac cagggtgtca tgttcgtcga 1277581 cgggtagaag cttgtcgccg cggcgatcgg ctgctctggt gccggctgtg ccgacgggtc 1277641 ggtccgcatc tgcttcagtg attctgtgat gcgaccggca acgtcttcgt tgttgggtgt 1277701 caatgtggtt cgtcgtcgtc ttgttcgcac aggattttcg cggggtggtg gtatcgattt 1277761 attcgcggtt ggccgtggtc gaggtgtggt ggtggtagcc attcggtgtc gccgtgggcg 1277821 tttttgcggg tcttccagcc tttttcgaca aggcgattgt cggggccgca ggccagcgtg 1277881 aggtcgttga tgtcggtacg gtgggtggtt gtccacggcg ttacgtggtg gacctcactg 1277941 tggtaggccg gggcgtcgca acccggcctg gagcagccac gatccttcgc gtacaacatg 1278001 attcgctgcg ccggggaagc taaccgcttg gtgtgataca acgccaacgg cttagcgccg 1278061 tcaaacaatg ccagatagtg gttggcgtgg ctcgccatcc ggataaggtc cgacatcggc 1278121 acccgcgaac caccaccggt tacccccttg ccggtggcgg cttccagctc ctttagcgtg 1278181 gtgctcacca cgatcgttac cggcagcccc ttgtgttggc ccagctcacc ggaggccaac 1278241 aggccccgca gcgcggccaa aaacgcatca tgattgcgtt gcgcctggct gcgggtgtcg 1278301 cggcgcaccg cgtccgcatc cggtgtgtca tccacgagcg gggtctggtc atcggggttg 1278361 cacgcccccg gtgcggccag tttggccaac accgcctcga tggtggcccg caactccggg 1278421 gtcagcagac cgctgatacg tgacatcccg tcaaattcct gcttacccat cgtgatgccg 1278481 cgcttgcggg cacgctcctg gtcggaaaag ttgccgtcgg ggtgcagcca gtccatcagc 1278541 tgcgtggcca ggccatgcag gtgatcggga cgccgactgg tggccagttc ggccagctgg 1278601 gcctcggcgg cctcgcggat acccagatcc accgcggcgg acaactcctt gaagaaggcc 1278661 tggatctcct taatgtgttc tcggccgatc ttgccctcac gttgagcggc cgcggtcgcg 1278721 gtcaactgcg ctggcagcgg ttcaccggtc agggcgcggc gctcaccgag gtcttcggct 1278781 tcggcgatgc ggcggctggc ctcaccggga gtgatgtgta gccggttggc caacgccgtg 1278841 cgcagcgtcc cgccgagctc ttcctcgcag gcttgcccag cgagttggtt gatcaaggcg 1278901 tgctcggcgg cgccctggcg gcgccgttcg acctcgagtc gctgcaaaca ggccagcaat 1278961 tccggggtgg tcaacgcatc gcacttgaga tcgagcaccc gcgacaacga ggcgtggtag 1279021 gcatccaacg ccgcggagat ctcctcgcgc gtgtccgacc tcatgcctcg gattctacga 1279081 agcaccactg acaagaaccg ggccgtcata ggctcggaat gatcagtgag gcagaacgtt 1279141 tcgctcacag cgaaaacagc cgcgccatag cgactgccgc caccaaatgc cgcgtgcacg 1279201 cagacacgcc agcgtcagca atccctatcc acggctgcag tactagggcg tgtctcccaa 1279261 atttttaggt actggccagc gaggattggc cggtgacgcg agtgggtgtg atttcggacg 1279321 agttctgggc cgtggtcgag ccgttgatgc cgtcgcatga gggcaagccc ggcagacggt 1279381 ttagcgatca ccggcttatc ctggaaggga tcgcgtggcg gttccgtacg ggaagtccgt 1279441 ggcgggacct gcccgctgag ttcgggccgt ggcaaacggt gtggaagcgc catcaccgtt 1279501 ggtcgctgga tggtacctgc gacgaggtgt tcgcccacgt tgccgcggtg ttcggggtgg 1279561 acgctgaggt ggccgaggat atcgagaagc tgctgtcggt ggattccacg aacgtgcggg 1279621 cacaccagca ttcggcgggc gcctgctcgg acacgctcgc cacagggggc actgtcggat 1279681 tacaagaaat ccgccgatga acccgacgat catgcgatcg gccgctcgcg cggcgggctg 1279741 accaccaaga tccatgccct gaccgatcag cgcgaagccc cggtgcggat ccggttgacc 1279801 gcaggccagg ccggcgacaa cccgcaactg ctgcccctgc tcgacgacta tcgccatgcc 1279861 agcaccgaat acgccctggg cagcacggat ttccgcttac tcgccgacaa ggcctactca 1279921 cacccaagta cccgtgccgc attacggtct aagaagatca agcacaccat ccccgaacgc 1279981 caagatcaga tcgaccggcg caaggccaag gggtctgccg gcgggcggcc accagcattc 1280041 gacgccgcgc tctacgggct acgcaacacc gtcgaacgcg gcttccatcg actcaagcag 1280101 tggcgcggca tcgcaacccg ctacgacaaa tacgccctga cctacctcgg cggcgtcctg 1280161 ctggcctgcg ccgtcatcca cgcccgagtg ggaactccga aattgggaga cacgccctag 1280221 ccgagaccgg cgagcgtgca tccagggcga gattccgccc ggcaaaccgt cgccctgagt 1280281 tcacgttcgg cgcccatagg cgactatttc agcagggcgg gcaggcgctc caacagcccc 1280341 ggcaacgctt ggctggccga ctcgcggatg ctgatcgtcg cgctgccgga caacggcgtg 1280401 ggctcgggat tgacttcgat cacggcagtg ccgcgcgcca gcgccaggtc gggtaaaccg 1280461 gccgccgggt agacgatcgc cgaggtcccc accacgacca tcacgtcggc gctccctgtc 1280521 gcctcgaccg cgctccgcca cggctcctct ggcagcggct caccgaacca tacgatgtcg 1280581 ggccggatca gaccgccgca gtcgcagacc ggcggctcca cttcgatcgc aggctcgggc 1280641 atctccggaa gggcgtcggt gtagggcaca ccacaacgtg cacaacgaaa ttcgaaaagg 1280701 ctgccgtgca ggtgatgcac cgcaccgctg ccggcgcgct cgtgcagatc gtcgacattc 1280761 tgggtgatga cgctgacctc agcatggtcc tgccaggcgg cgatcgcgcg atgcccgtcg 1280821 ttgggttcga cgttggccac cagataatgg cgccataggt accatcccca gacccgctcg 1280881 gggttgcgca gccagccttg cgtgctggac agctcgtaag ggtcgaatcg ggcccacaat 1280941 ccgttcttgt catcgcggaa cgtcggtaca ccgctttccg cggagatccc cgcgccgctg 1281001 agcaccgcca ctcgcatccc acaaacatag ctgtgcttgg tagatactgg gtacgtggag 1281061 ctgcgggatt ggttacgggt cgacgtgaag gcgggaaagc cgttgttcga ccagctcaga 1281121 acccaggtga tcgacggagt ccgcgccggc gcattgccgc ccggcacccg gctcccgacg 1281181 gtgcgtgact tggccgggca gctgggcgtg gcggccaata ccgtggcccg cgcctaccgc 1281241 gagttggaat cggcggcgat cgtcgaaacg cggggacgct tcggcacttt catttcccgc 1281301 ttcgatccga ccgacgccgc gatggctgcc gcggccaagg aatatgtcgg cgtggcgcga 1281361 gcgctggggc tgacgaagtc cgatgcgatg cgctatctca cccacgtgcc ggacgactga 1281421 attccagcaa agtcaggcac ggccgcagcg gatcgaatac gggcaggcgg taaacggtcg 1281481 acagcgccat attgacccac aggccacggc ccggtggcac ccgcagatcg cggaccgcga 1281541 cgacaccagg caccttattt accaggtccg cggcctgcgc gacggacatg ctgaacggca 1281601 tgcgcggcac cttgtagcgc agcgaggttc gtaacccgag ccggctccac ccagcgaacc 1281661 agcgcggcgg caggtcgaag agcatttggc cgccaggaaa cgtttgagcg cattgggcga 1281721 ttaaccccag tgcctgttcg ggttgtaggt acatcagtaa tccttcggcg gtgatgaaca 1281781 ccccgccggc gggatcgacg gaatccatcc agctgtagtc cagcgcagac tgggcacaca 1281841 ccgacacgcg cggcgagctc ggcagcagcc gtgtccgtaa atcgacgatc ggtggcaggt 1281901 caactgtcag ccaacggaac tggccgcccg ggatggccac gtccaaacgc caaaagctgg 1281961 tttgcaagcc ctccgccaac gccaccacgg tggccgctgg gtgctgatcg agataatgct 1282021 gtgccgccat gtcgaaggcc cgtgctcgta gggcgaagcc ctggccggta gggccgaact 1282081 tcgcgaagcc gaagtcgatc gactcgacca gggctaccgc catcggatcg tcgataatgg 1282141 tatcgcggcg gcgggcctct gcggcccggg cgttcagcgt cagcaaggcg gtctcggaga 1282201 ctccggtgag tgcgacccgc tgtttggcgg gcttatgggc actcaccgca acaccttagc 1282261 cagcgtgcgc aggttgcggg tcgtggtcga cgacttgtag cgcttcttgc ccatcgtctg 1282321 gccgatggtg ctgtccaggg tgctgccctt gggtacctgc cagtagagga cgccaagagg 1282381 gtcgggtcca cgactgatgt tctcgtcagg gccggctgtg tcggcgagtg cggatagctc 1282441 gtcgagtatc gcggcgtcgg caacgaaggt gacgtacgac tggtatccct cgagctcgca 1282501 ttcaaatggg tatgccgcca cgatggtgcg caccgtatcg acgtcgtaga tcaacgccca 1282561 cgcgtcgtag ccgaatcgtt cgcgtagcgt ggcttcggtc ttctcgcgca cttccgcggc 1282621 accgcacgtc gactccagca acacgttgcc gctggccagg atggtgcgca cattgcagaa 1282681 tcccgcatcg gtcaacgccg tcgccacctc ggccatcttg aggttgacgc cgccgacgtt 1282741 gacaccgcgc agaaacgccg cgaacttggc catacccgat tgcaccaggc cgccggagaa 1282801 tgacgcaacg gcgacgtagg ctcttggcat ggcccgccaa gtcttcgacg acaagctgtt 1282861 ggccgtaatc agtggaaact ccattggggt gctggccacc attaagcacg acgggcgccc 1282921 ccagttgtcc aacgtgcaat atcacttcga cccgcgcaaa ctgctgatac aggtatcgat 1282981 cgccgagccg cgagccaaga ctcgcaacct gcgtcgcgac ccacgggctt cgatcctggt 1283041 cgacgccgac gacggatggt catacgccgt tgctgagggc actgcgcaac tgacacctcc 1283101 tgcggcggcg cccgatgacg acaccgtgga ggcgctgatt gccttgtatc gcaacatcgc 1283161 tggcgagcat ccggactggg acgactaccg gcaggcgatg gtcaccgatc ggcgtgtgtt 1283221 gctgacgctg ccgatctcgc acgtatacgg cctgccgccc ggtatgcgct aacccccggg 1283281 gctgcggacc tacggactgg gtcggattgc ctcgctgctc ggcgggccgc atcctgcggc 1283341 ccgcatcgtc gcgaggctgg gtcggattgc ctcgctcctc gccgtgccgc atcctgcggc 1283401 ccgcatcgtc gcgaggctag gctgcgggta tgggtgaatc gaagtccccg caagagtcca 1283461 gctcagaggg tgagaccaag cgcaagttcc gggaagccct cgaccgcaag atggcacagt 1283521 cgtcgagcgg atccgatcat aaggatggcg gcggcaagca gtcgcgggcg cacggtccgg 1283581 tggcgagccg tcgggaattc cgccgcaaga gcggctagcc acggggcgcg gctgctcagc 1283641 ggcgacccga acgttgccga agatgctcat caagaggtcc gtcccgacag ctctacactg 1283701 aggacgtgcc aaatctgcag cttgtccaag agccggcagc cgacgcgctg ctgaacgcca 1283761 acccattcgc gttgctggtg ggcatgttgc tcgaccagca ggtgccgatg gagaccgcct 1283821 tcgccgggcc gaagaagatc gccgatcgga tgggtagctt tgacgccggc gacatcgccg 1283881 actacgaccc ggataagttc gtcgcactgt gctcggaaag gcctgctata caccgatttc 1283941 cgggctcgat ggccaaacgc atccaggcgc tcgcgcagat catcgtggac cgctacgacg 1284001 gggatgcggc cgcattgtgg accgccggcg aacctgacgg gaacgagttg ctgcggcggc 1284061 ttaaggggtt acccggcttc ggtgagcaga aggcgcggat ctttctcgcg ttgcttggca 1284121 agcagtacgg agtgacgccg aagggttggc aggtggcagc cggggagttc ggtcagcccg 1284181 gcacctatct atccgtcgcc gatatcgtcg acgccgggtc gcttgggcag gtgcgatcgc 1284241 acaagaggca aaggaaagcg gcggccaagg cagagggaaa ggcgccaacg tgaagacaca 1284301 cctgacgtgt ccgtgcggcg aagccatcac cggcaaggac gaggacgagc tggtcgagct 1284361 gactcaggcc caccttgcca gcgttcatcc cggcctggag tacgaccgcg acgccatatt 1284421 gttcatggcg tactgatgga ccattcccgc tggtgctagg gcaccaccgt tgagccgatc 1284481 gtcggcatga actggcactg ccggtccttg gtggtcacct gcccgaagat cgttgacatg 1284541 atgctgcctg aaccggtgtc ggcgattacg gtcaacgtgg tcggtccgtc cggattgatg 1284601 tccgaacgcg gccgcagggt ggcgctgccg gactttcccg tggtcaggtt cacccacgtg 1284661 acgttcagcg gcaacctctg cacgtcggcg ggccccggcg tgccgacggc cgtgaacacg 1284721 taggcggtct ggccgggtcc gggaccgggc agcgggatct tggcgggccc cgccaccgac 1284781 agcgcggtcg cgatggaatt gctgccgtcc gccacacaat tggggccgat cgaggggtac 1284841 atgaagtcct gtgtgggcgg agcgtcggcg ccgaagcccg aggcaggcgc cggagcggcc 1284901 gctggcgccg gtgccgccga ggccggcgcc ggagcggcgg cgcgcggcgc aggtggcgcc 1284961 ggtggcggcc ccggtaccgc aaccggttgg gctgcgtcag gtgctggtgc cggcgcggaa 1285021 gccggcggcg cggcgaccgg cggcgtaacc gtaggtgcca cagcgggtgc ggggcccgcg 1285081 gcgtgggatg gatcgatgcc ggtcggaaga tgtgcctgca cacctggctc ggcgcccagt 1285141 gggggcacat gcggcaccgg gatggcctcg ggcagggcga caccatgagg cgccggcaca 1285201 cccagggcag cagagtcggg attggtcggc tcagccacaa actggttcac cgaggaagcg 1285261 acgttcttgg attcggtggg cactgccgga ttgccggcga acgcagacgc tgcggccatg 1285321 agcagttgcg tcgcctgtgc cgggttcatc gccgcttgct ggattatcgg actcagctgg 1285381 gccaacgccg gcaagcccgg tagctgttga gtggggttgg gctgcggcgt cgccgggtcg 1285441 gctgccgcgt tcggacacag cgcgaacgcg gcagccgagg tgatgacgac ggcggccaaa 1285501 cctttgcaca cactccaagt gcttgccacg gtggtgttct cccggtgttc ggtgttggtc 1285561 agccttctca cagatgcgtc agggcagcgc ggcgagcaac gacggcggcc cgggcggtaa 1285621 cgcgggcgcg ccgggagccg gcggcgtcgg cgcgatgggc gctgccggaa tgacccccga 1285681 tgcgagcgcc ggtaggtcgg ctggcagcga aagctgttgc ggcacttgga gcggaaggta 1285741 gggcagctgc ggtaggtcga ccttcgccga cggcacgccg ggaacggagg ccggcaccgc 1285801 ggccgccgcc ggggccggtg cggttatccc cgggatcggg gcgttcactc cgggaatggt 1285861 cggagctgcc gccggggcgg tgacggggag cgccggtgcc gccggggtta tccccgggat 1285921 cggggcgttc actccgggaa tggacggagt tagcgcgggg gccgcggctg ccgccggtgc 1285981 ggccggcgtc agtcctggaa aggtggcggt tatcccgggg gcggcgggtg cgggttccgc 1286041 aactttgggg gcgctcagcg gcggagtggc gcccagagcc gtcgcgaggt tttgcaggat 1286101 ttgcggtgcg ttggcggccg agctgatcag ctgctgcgga atgttgggag caggcgccgg 1286161 cgccggtgcc ggatctgcgt gagcgatacc gcccgtaagt agtgcggcgg acgaaccgac 1286221 caagacggcg gcggcgcgga caaacgtcca gatggttggc atgtctctcc ctggttagcg 1286281 gtgacgggtc tcgccgaacg tatcgcggtg cagatgtgac tcaagtgaca cgtgtggcat 1286341 ttatgtgatt gttacggata cgagtggttg tggtgaccgg gcacccgagt gatgtgccgc 1286401 accctgatcg acggcccggt gcgctcggcg atcgctaaag tcaggcagat agacaccacc 1286461 tcatccaccc cggcggccgc caggcgcgtg acctcaccac cggcccggga gacacgcgcc 1286521 gccgtgctgc tactggtcct cagcgtcggt gcgcgactcg cctggaccta tctggcgccc 1286581 aacggcgcaa acttcgtcga cctgcacgtt tacgtgagcg gtgcagcgtc cctcgaccat 1286641 cccggcaccc tgtatggcta cgtctacgct gatcagaccc cggacttccc gctgccgttc 1286701 acctatccgc cgtttgcggc tgtggtcttc tacccgttgc atttggtgcc gttcggtctg 1286761 atcgcgctgc tgtggcaagt agtgacgatg gccgcgctct acggcgcggt tcggatcagc 1286821 cagcgcctga tggggggcac cgctgagacc ggtcatttcg ccgcgatgtt atggacggcg 1286881 atcgccatct ggatcgagcc gttgcgcagc acctttgact atgggcagat caacgtgctg 1286941 ctgatgctgg cggcgctttg ggcggtctac accccgcggt ggtggctatc gggactgctg 1287001 gtcggggtgg cctcgggtgt caagttgacg ccggcgatta ccgctgtcta cctcgtcggc 1287061 gttcggcggt tgcatgcggc cgcattttcg gtggtcgtgt tccttgccac cgtcggcgtg 1287121 tcgctactgg tcgtcggcga tgaagcccgc tactacttca ccgacctgtt gggcgacgca 1287181 ggccgggttg ggcccatcgc cacctccttc aatcaatcct ggcgcggcgc gatttcccgg 1287241 attctcggtc acgacgccgg ttttggtccg ctggttctgg ctgcgatcgc cagtacggcg 1287301 gtattggcca tcctggcctg gcgtgcgctc gacaggtccg atcggctggg caaactattg 1287361 gtggtcgagt tgttcggcct gctgctctcg ccgatctcct ggactcacca ctgggtgtgg 1287421 ctagtgccgc tgatgatctg gctgattgac gggccagcgc gtgagcgccc gggcgcccgg 1287481 attttgggct ggggctggtt ggtgttgacc atcgtcggcg tgccgtggtt gctgagcttt 1287541 gctcaaccga gcatctggca aatcggccgg ccgtggtatt tggcctgggc cggtctggtc 1287601 tacgtggtgg cgacgctggc gaccttgggc tggatcgccg cctccgagcg ttacgtgcgc 1287661 attcggccgc ggcgcatggc caattaggcc ccaaacattg cgtcgatatc gtgcgccatc 1287721 gcaatgtcgt tttccgtgat accacctacc gcatgcgtaa ccagcgcgaa agttactgtt 1287781 cgccaacgga tatcgatgtc cggatgatga tttacctcct cggctcgctc ggccacccgg 1287841 cgtacggcgt cgataccggc cataaacgtc ggaaacttga ttgacctacg caggacacca 1287901 ccggcgcgct gccagccgtt gaggtcgtgc agtgcggcgt cgacctgctc atccgttaac 1287961 acagccatac ctcgacggta taccgtcaca ggtcatgctg aatcagatcg tggttgccgg 1288021 agccatcgtc cgcggttgca cggtcttggt ggcgcaacgc gttcggccac cggagttggc 1288081 gggtcgttgg gaacttcccg gcggtaaggt cgccgccggc gaaaccgagc gcgccgcgct 1288141 ggcccgagag ctcgccgaag aactgggact cgaggtcgcc gacctcgcgg tgggcgaccg 1288201 tgtgggcgac gatattgcgt tgaacggcac gacgacgctg cgggcctatc gcgtgcatct 1288261 gcttggcggc gaaccgcgtg cgcgtgacca ccgggcgctg tgctgggtga cggcggccga 1288321 actgcacgat gtcgactggg taccagccga ccgcggctgg attgcggacc tggcgcgaac 1288381 cctcaacggg tccgccgcag atgtccaccg tcgctgttag gaaaccgacg gtgtggttga 1288441 cggtggccgc cgtcaacttg gttagaacaa cgtgacaaaa cgttaacttg ggtttgcatg 1288501 cccgtagcga tcacgatggt tttctggacg cgtggcgaca acttccgggc aggacgctga 1288561 cgcccatcca tcgagatacc cgatgttgac gagaggggtc cccgacccgg cggaccgggg 1288621 cttgacgggc gcaatgcggc gcggccggcc agcccgtaac gtccagcgag tgcggtcgcg 1288681 cgccgacggc ccggccccac accgctcatg acgaggaggg tcatcccgtg accgttacac 1288741 ctcacgtcgg tggaccgctc gaagagctgc tggagcgcag cgggcgcttc ttcaccccag 1288801 gtgagttctc ggccgacctg cgcaccgtaa cccggcgcgg cggccgcgaa ggtgacgtgt 1288861 tctaccgcga tcggtggagt cacgacaaag tggtccgatc cacgcacgga gtcaactgca 1288921 ccggatcctg ctcatggaag atctacgtca aagacgggat catcacctgg gaaacccagc 1288981 agaccgacta cccgtcggtg ggcccggacc ggcccgaata cgagccacga ggttgtcccc 1289041 gtggcgcgtc gttctcctgg tacagctatt cgccgacgcg ggtgcgctat ccgtatgccc 1289101 ggggcgtgct ggttgagatg taccgggaag ccaagacccg cctgggcgac ccggtgctgg 1289161 cgtgggccga cattcaggcg gatcccgagc gcagacgccg ctatcaacag gcccgcggca 1289221 agggtgggct ggtccgggtg agctgggccg aggccagcga gatggtggcc gccgcccacg 1289281 tgcacaccat caagacatac ggcccggacc gggtcgccgg cttctcgccg attccggcga 1289341 tgtcaatggt cagccatgcc gcggggtccc ggttcgtgga gctgatcggc ggcgtgatga 1289401 cgtcgttcta cgactggtac gccgacttgc cggtggcctc gccgcaggtg ttcggcgacc 1289461 agaccgacgt gcccgaatcc ggcgactggt gggatgcgtc gtatttggtc atgtggggct 1289521 ccaacgtccc gatcacccgg acgcccgacg cacattggat ggcggaggcc cgttaccgcg 1289581 gcgctaaagt cgttgtcgtc agcccggact acgccgacaa caccaagttc gccgacgagt 1289641 gggtgcggtg cgccgccggt accgataccg cgctggcgat ggcgatgggc cacgtgatcc 1289701 tgtcggaatg ttacgtccgt aaccaggttc cgttctttgt cgactatgtg cgccgctaca 1289761 ccgacctgcc gtttttgatc aagttggaaa agcggggcga cctgctggtt cccggaaagt 1289821 tcttgaccgc ggccgacatt ggtgaagaaa gtgagaacgc ggcgttcaaa cccgccctgc 1289881 tggatgagct tacgaatacc gttgtcgtgc cgcagggctc actgggattc cgtttcggtg 1289941 aggacggtgt tgggaagtgg aacctggacc tgggttcggt ggtgccggcg ctaagtgtgg 1290001 agatggacaa ggctgtcaac ggcgatcgca gtgctgaact ggttacgctg cccagctttg 1290061 acaccatcga cgggcacggt gagacggtgt cgcgtgggct gccggtgcgc cgggcgggca 1290121 agcatctggt gtgcacggtg ttcgatctga tgttggccca ctacggggtg gcgcgtgcgg 1290181 ggctgcccgg cgaatggccg accggctacc acgaccgaac ccagcagaac accccggcct 1290241 ggcaggagtc gatcaccggt gtgccggccg cgcaggcaat ccggtttgcc aaggaattcg 1290301 cccgcaacgc gaccgaatcc ggaggacggt cgatgatcat catgggcggc ggaatctgtc 1290361 actggttcca cagcgatgtc atgtaccgct cggtgttggc gctgctcatg ttgaccggat 1290421 cgatgggacg caacggcggc gggtgggcgc actacgtcgg ccaggagaag gtgcgtccgt 1290481 tgaccgggtg gcagacgatg gcgatggcca ccgactggtc gcggccgccg cgtcaggtgc 1290541 ccggcgcgtc gtactggtat gcgcacaccg accaatggcg ctacgacggc tacggcgcgg 1290601 acaagcttgc cagcccggtg ggtcgcggca ggttcgccgg caagcacacc atggacctgc 1290661 tgacctcggc cacggcgatg ggctggagcc cgttctatcc acaattcgat cggtccagtc 1290721 tcgatgtcgc cgacgaggcc cgcgccgcgg gccgcgacgt gggtgattac gtcgccgaac 1290781 aacttgccca gcacaagctg aagctctcga ttaccgatcc ggataacccg gtcaactggc 1290841 cgcgggtgct caccgtctgg cgggcgaacc tgatcggctc gtcgggcaag ggcggcgagt 1290901 atttcttgcg gcatctgctg ggcaccgact ccaacgtaca gtccgaccct cccaccgacg 1290961 gtgtgcatcc ccgggatgtg gtgtgggaca gcgacattcc agagggcaag ctcgacctga 1291021 taatgtcgat cgacttccgg atgacgtcga cgacgctggt gtcggatgtc gtgttgcccg 1291081 ccgcgacctg gtacgagaaa tccgacctgt ccagtaccga tatgcacccg tacgtgcact 1291141 cgttcagtcc ggcgatcgat ccgccgtggg aaacccgttc ggactttggc gcattcgccg 1291201 ccatcgcgcg tgctttcagt gcgctggcga aacgtcatct gggcactcgc accgatgtgg 1291261 tgctgaccgc gctgcagcac gacaccccgg atgagatggc atatcccgat ggcaccgaac 1291321 gtgattggct ggcgaccgga gaagtcccgg tgccaggcag gacgatgagc aagctcactg 1291381 tggtggagcg ggactacacc gcgatctacg acaagtggct gaccctggga ccgctcatcg 1291441 accagttcgg gatgaccacc aagggatata ccgtccatcc cttccgggag gtcagcgagc 1291501 tggcagccaa cttcggggtg atgaattccg gtgtggcggt gggtcgtccg gcgatcacca 1291561 cggctaagcg gatggctgac gtgatcctgg cgctgtccgg cacatgcaac gggcgactcg 1291621 cggtcgaggg attcctcgag ctggagaagc gtaccgggca gcggctggct catctggccg 1291681 agggcagcga ggaacgccgc atcacctacg ccgataccca ggcgcgtccc gtgccggtga 1291741 tcaccagccc ggaatggtcg ggcagcgaga gcggtggccg ccgctacgcg ccgttcacga 1291801 tcaacatcga gcatcttaag ccgtttcaca cgctcaccgg gcgtatgcac ttctacctgg 1291861 cgcatgactg ggtcgaagaa ctcggcgagc agttgcccgt ctatcggccg ccgctggaca 1291921 tggcgcggct gttcaaccag cccgagctcg gaccgaccga cgatggactc gggctcaccg 1291981 tgcgctatct gacgccgcac tccaagtggt cgtttcactc gacctaccag gacaacctat 1292041 acatgttgtc gttgtcccgt ggcggtccga cgatgtggat gagcccgggt gacgcggcga 1292101 aaatcaatgt gcgcgacaat gattgggtag aggcggtcaa tgccaacggc atctacgtgt 1292161 gccgggcaat cgtcagccac cggatgcccg agggtgtggt gttcgtctac cacgtgcagg 1292221 agcgcaccgt ggacacgccg cgcaccgaga ccaacggcaa acgcggcggc aaccataacg 1292281 cgctgacccg cgtacgaatc aaacccagcc acctggccgg tggctacggc cagcacgcgt 1292341 tcgcgttcaa ctacctgggt ccgaccggta accagcgtga cgaggtgacc gtggtgcgcc 1292401 gccgcagcca ggaagtgcgg tactgaccaa tgaagggccc gagcgacgct tgcggagcga 1292461 gacgatgaag gtcatggcgc agatggcgat ggtgatgaac ctcgacaaat gcattggttg 1292521 ccatacctgc tcggtgacct gcaagcaggc ctggaccaat cgctcgggaa ccgagtacgt 1292581 gtggttcaac aatgtcgaaa cccgtccggg tgtgggctac ccgcgcacct acgaggatca 1292641 ggagcggtgg cgcggggggt gggtgcgcga caagaagggc cggctgcggc tgcgcgacgg 1292701 cggccggatc cataagctgt tgcgcatctt tgccaacccc aagctgccca ctatcggcga 1292761 ctactacgag ccgtggacct atgactacga aaacctgaca tcggcgccgg cgggtgacac 1292821 ctttccgacc gcggcgccgc gaagcctgat cagcggcaat ccgatgaagg tgtcgtgggg 1292881 atccaactgg gacgacaacc tggccgggtc gccagagatc gtgccgaacg acccggtgct 1292941 aaagaaggtc aaccaagtca accaagaggt caagctgaag cttgaagaga ccttcatgtt 1293001 ttacctgccg cggatctgcg agcactgcct gaacccgtcg tgtgtggcgt cgtgtccgtc 1293061 gggggcgatg tacaagcgca ccgaggacgg catcgtgctc gtcgaccagg accgctgccg 1293121 cggctggcgg atgtgtgtgt ccgggtgccc atacaagaag gtgtatttca accacaagac 1293181 cggcaaggcc gaaaagtgca ccctgtgcta tccgcgcatc gaggtggggt tgccgacggt 1293241 gtgctcggaa acgtgtgtgg ggcggctgcg ctatctgggt ctggtgctct atgacgtcga 1293301 tcaggtgctg caggccgcgt cggtggaaag cgacaccgac ctctacgagg cgcagcgccg 1293361 gatcctgctg gacccgcacg atccgcgggt gatcgccggg gcgcgcgcgg aaggcatcgc 1293421 cgacgagtgg atcgaggccg cccagcggtc cccggtgtac gcgttgatca acacctaccg 1293481 ggtggcgctg ccgctacatc cggagtaccg gaccatgccg atggtctggt acatcccgcc 1293541 gctgtcgccg gtggtcgacg cggtcagccg cgacgggcac gacggggagg acctgggcaa 1293601 tttgttcggc gcgctggacg cactgcggat tccgattgcc tatctggccg agctgttcac 1293661 cgcgggcgac accgaggtgg tcgcgggcgt gttgcggcgg ctggcggcga tgcgctgcta 1293721 catgcgcgac atcaacctgg gccgggagac ccagccccac atcccggaat cggtcgggat 1293781 gaccgaggag cagatctacc agatgtaccg actgttggct gtggcgaaat atgaagagcg 1293841 ctatgtcatt ccgacgtcgt acgcggggga gctgccggcc gcggcgatga ccgacgatat 1293901 ggggtgctcg ttgtcggtcg acggcggacc gggaatgtac gagtccggtc cgttcgggca 1293961 gggcagccct actccggtgc caatcgccgt ggagagcttc cacgctctgc agcatgccgg 1294021 tagcgcggcc accggcggcg ctggccgatc ccgggtcaac ctgctcaact gggaccccaa 1294081 cggcgcagcg gcggggctct tcccggagcc tcagcccagc aaggatgtgg tccagcgatg 1294141 aagttgctgt ctcgtgtccg agagcggtcg agcgccacca caatgaggga ccgactggtg 1294201 tggcagtcgg cctcgctact gctggcctat ccggatgacg ggctggccga gcggctgcac 1294261 atggtcgatg cgctgcgcgc ccaccaaacg ggcccggcgg cggcgctgct agggcgaacg 1294321 gtagcggagt tgcgtgccct ggcgccgatg gccgcggcgg cgcagtacgt cgagaccttc 1294381 gatatgcgac gccgatccac gatgtatctg acgtactgga ccgccgggga cacccgcaac 1294441 cgcggccggg agatgctggc gttcgccacc gcctatcgag acgccggcgt caagccgccg 1294501 cgtaccgagg cgcccgacta cctgcccgtc gtgctcgagt tcgccgccac cgtcgacccc 1294561 gaggccggac gtcggctgct gaccgagcac cgtgtgccga tcgacgtgtt gcgcggcgcg 1294621 ctggccgacg ccaagtcacc ctatgagtac accgtggcgg cgatctgcga gacactgccc 1294681 gctgccacca accaggaagt gcgtcgggca caacgcctag ctcagtcggg gccgcccgcg 1294741 gaagccgttg gtttgcaacc gtttaccttg accgtcccgc ccaagcgcgc cgagggggcc 1294801 tgaccttggc cgtcttggac ttggttgaga tcttctggga tgccgcgcct tacgtcgttg 1294861 tggcgatcgc ggtggtcggc acctggtggc ggtatcgcta cgacaagttc ggctggacca 1294921 cacgctcgtc gcagctctac gagtcgcggt tgctgtcgat cggcagcccg atgttccatt 1294981 tcggcagctt gctggtgatc atgggccacg tgatgggcct gttcattccg gattcctgga 1295041 ccagagcgtt cggcatgagc gatcacctgt accatctgca ggcgctgctg cttggcgcgc 1295101 ccgccggttt cgccactctg ctcggtatcg ggttgctgat ctatcggcgg cgcatccaga 1295161 caccggtgtg gctggctacc actcggaatg acaagctgat gtacctggtg ctggtgtgcg 1295221 cgatcgtggc tggcctggca tgcacgctga tgggcgccac ccatgagggc gatatgcacg 1295281 attaccggcg ctcggtgtcg gtctggttcc gctcgatctg gatgctagcg ccgcgtggcg 1295341 atctgatggc ccaggcgacg ctgtactacc aggtgcatgt gctgatcgcg ctcgcgctgt 1295401 ttgtgctctg gccgtttacc cgattggtgc acgcgttcag cgcgccgatc gcctacctgt 1295461 tccggcccta catcgtgtac cgcagccgcg aggtggcggc caagcacgaa ttgatcggtt 1295521 ccgcgccgcg tcgtcgtggg tggtagttct ctgccacaat caccgtcgtg ccattccgca 1295581 acgttgccat cgtcgcgcac gtcgaccacg gcaagaccac cctggttgac gccatgttgc 1295641 ggcagtccgg ggcgctgcgt gaacgcggtg agctgcagga acgggtgatg gacacgggcg 1295701 atctggagcg ggagaagggc atcaccatcc tggccaagaa caccgccgtg caccgccatc 1295761 acccggatgg aaccgtcacc gtaatcaatg tcatagatac cccggggcac gcggacttcg 1295821 gtggcgaggt ggagcgcggg ctgtccatgg tggacggggt gctgctgctg gtcgacgcct 1295881 ccgagggtcc attgccgcag acgcggtttg ttctgcgtaa agcgctggcc gcccatttgc 1295941 cggtgattct ggtggtcaac aagacagacc ggcccgacgc ccgcatcgcc gaggtcgtgg 1296001 acgccagcca cgacctgttg ctagatgtcg cgtccgacct tgacgacgaa gcggccgcag 1296061 cggccgaaca cgcgctgggc ctgccgacgc tgtacgcatc cgggcgcgcc ggggtggcga 1296121 gcaccacggc gccgcccgac ggccaggttc ccgacggcac caacctggat ccgttgttcg 1296181 aggtgctcga aaagcatgtg ccgccgccga aaggagagcc ggacgcaccg ctgcaggcgc 1296241 tggtcaccaa cctggacgcg tcgacctttc tgggtcggtt ggcgctgatc cgcatctaca 1296301 acggccgcat ccgcaaaggc cagcaggttg cgtggatccg tcaggtggat ggtcagcaga 1296361 ccgtcaccac tgccaagatc accgaattgt tggccaccga aggcgtggaa cgcaaaccaa 1296421 ccgacgctgc cgtcgccggc gatatcgtcg ccgtcgccgg cctgcccgag atcatgatcg 1296481 gcgacacgct ggccgcttcc gcgaatcccg ttgccctgcc caggattacc gtggacgagc 1296541 cggcgatctc ggtcaccatc ggcaccaaca cctcgccgct ggcgggcaag gtgggtggtc 1296601 acaagctcac cgcgcgcatg gtccgaagca ggctggatgc cgagctggtg ggcaacgtgt 1296661 cgattcgtgt cgtcgacatc ggcgccccgg acgcctggga ggtacagggt cgcggcgagc 1296721 tggcgctggc ggtgctggtc gagcagatgc gccgagaggg tttcgaattg accgtgggta 1296781 agccacaggt ggtgaccaag accatcgatg gcacgctgca cgagccattc gagtcgatga 1296841 ccgtcgactg ccccgaggag tacatcggcg cggtcacgca attgatggcc gcgcgcaagg 1296901 gccgcatggt ggagatggcc aaccacacca ccggctgggt ccgcatggac ttcgtggttc 1296961 ccagtcgcgg cctgattggg tggcgcaccg acttcctcac cgagacccgt ggctccggtg 1297021 tcgggcatgc ggtgttcgac ggataccggc catgggcggg ggagatccgg gcccgccaca 1297081 ccggttctct ggtatcggac cgggccggcg ccatcacacc gttcgcgttg ctgcaactcg 1297141 ccgatcgggg gcagttcttc gtcgagcccg gccaacagac ctacgagggc atggtcgtcg 1297201 ggatcaaccc ccgtccggag gacctcgaca tcaatgtcac ccgggagaag aagctgacca 1297261 acatgcgctc atcgaccgcg gatgtcatcg agacgctggc caagccgctg cagctggatc 1297321 tcgagcgcgc catggagtta tgtgcgcccg acgaatgcgt cgaggtgacc ccggagatcg 1297381 tgcggatccg caaagtcgag ctggccgccg ccgcccgggc tcgcagccgg gcgcgcacca 1297441 aggcgcgtgg ctagcaactt ggcgcgctgg ccgcgcgagc gtaacgccac tgcgaaatcc 1297501 agcccggctt ttcgcagccg ggttacgctc gtgggggtac tggatagcct gatgggcgtg 1297561 cccagcccag tccgccgcgt ctgtgtgacg gtcggcgcgt tggtcgcgct ggcgtgtatg 1297621 gtgttggccg ggtgcacggt cagcccgccg ccggcacccc agagcactga tacgccgcgc 1297681 agcacaccgc ccccgccgcg ccgccctacc cagatcatca tgggcatcga ctggatcggc 1297741 cccgggttca acccgcattt gctgtccgac ctgtcgccgg tgaacgccgc aatcagtgcg 1297801 ttggtgttgc ccagcgcgtt ccggccgatt ccggatccca acacgccgac cggttcgcgc 1297861 tgggagatgg acccgaccct gttggtttcc gccgacgtga ccaacaacca cccgttcacg 1297921 gtgacctaca agatccggcc cgaggcgcag tggacggaca acgccccgat cgccgccgac 1297981 gacttctggt atctgtggca gcagatggtc acacagccgg gcgtcgtcga ccccgccgga 1298041 taccacctga tcaccagtgt ccagtcgctc gagggcggta agcaggccgt cgttacgttc 1298101 gcacagccct accccgcttg gcgtgagttg ttcaccgaca tcctgccggc gcacatcgtc 1298161 aaggacatac cagggggctt cgcgtccggt ttggctcgag cgctgccggt gacaggtgga 1298221 cagtttcggg tggaaaacat cgacccacag cgcgatgaga tcctgatcgc ccgcaatgac 1298281 cgttactggg gcccaccttc caaacccggc atcattctct tccgccgggc cggggcgccg 1298341 gccgcgctgg ccgattcggt acgtaacgga gacacccagg tcgcccaggt gcatggtggc 1298401 tcggcggcct tcgcccagtt gtcggccatc cccgacgtgc ggaccgcccg gatcgtgaca 1298461 ccgcgggtca tgcagttcac gctgcgggca aacgttccca agctggccga cacccaggtt 1298521 cgcaaggcga ttttggggtt gctggacgtg gacctacttg ccgccgtggg cgccggcacc 1298581 gacaacaccg tcaccttgga ccaggcgcag attcgttcgc cgagtgaccc gggttatgtt 1298641 ccgaccgcgc ctcccgcaat gagcagcgcc gccgcgctgg gtctgctgga ggcatcggga 1298701 ttccaggtcg acaccaacac gtcggtgtcg ccggcgccgt cggtccccga ttcgacgacc 1298761 acgtcggtga gcaccgggcc gccggaagtc atccgcggcc ggatcagcaa ggacggcgaa 1298821 cagttaacgc tggtcatcgg ggtggccgcg aacgatccga cctcggtggc ggtcgccaac 1298881 actgctgccg accagctgcg cgacgtcggc atcgccgcga ctgtgctggc gttagacccg 1298941 gtcacgctct atcacgacgc gctgaacgac aatcgggtag acgccattgt gggctggcgc 1299001 caagccggcg gaaacctggc gacgctgctg gcctctcgtt acggctgtcc cgcattgcag 1299061 gcgacgacgg tcccggctgc gaatgcgccg acgacggccc cgtccgctcc cattggccct 1299121 acgccgtccg ccgcgcccga caccgcgaca ccgccaccaa cggcgccgcg ccgcccatcc 1299181 gacccgggcg cgctggtaaa agcgccgtcg aatctcaccg gcatctgcga ccgcagcatc 1299241 cagtcgaaca tcgatgccgc actcaatggc accaagaaca tcaacgacgt gatcaccgcg 1299301 gtcgaaccgc gactgtggaa tatgtcgacc gtgttgccga tcctgcagga caccacgatc 1299361 gtcgcggccg gcccgagcgt gcagaacgtc agcctgtctg gtgcggtgcc agtgggcatc 1299421 gtcggcgacg ccggccaatg ggtgaagacc gggcaatagc cctggtcacg ccggcggaat 1299481 cgtcggctag ctctcgcggc gttcgccggt ggtgaggatc atggcgtcga taatgcgtgt 1299541 gagctgctca cggtccggcg gggatccggt aaacaagaca tgctgatgga tcaaggccgg 1299601 tccgattcgt gcggtcatcg gagtcagagt tgccgggtcg atttcgccgg aacgcacgcc 1299661 cgcctgcagg atggactcga caattcgcag ccgcggggcc cacaccgagt tgatgaagat 1299721 ggcgcgcagc tcgggctcgt gtaggagctg gctgacgatt tccatgctgg ggagggccgt 1299781 cttgccggcc aggatttcgc agttggcggt gaacaccgcc agcagattct cccttgccga 1299841 ccggtcagcg cgcggctcgg gtaccggcgg caaagcgtat tgcaccgcgg ccagcaccag 1299901 ctcacgtttg ccggcccacc gccgatacaa cgcggctttg ccggtttggg cgcgtgccgc 1299961 gatgccttcc atggtcagcc cgccgtatcc ggcggattcg agttcggcca gcgtcgcatc 1300021 gtagagcgca cgctcaagca cctcgccgcg ccgccggtac gggttggcct ttgcgggtgc 1300081 gctcaccgtc atgctgcgat actagccaac tgcggctttt ccgccggcgc ggttcgatcg 1300141 atgcatcagg tgaggccctt ttgctagccg gcggcgggtg accgcagtat cactccggaa 1300201 cgggttcttg ccgcgacggc gcccacagcg cccccggcca ggcttgccaa tcccagctgg 1300261 gcccacgagg ttcccgacgg accgcccggc gcggaggtcg ggacggcttt ggccaacggc 1300321 gtaatcgatt tgtccgcaac ccagctgggg ggcaccgaca accccccgat cgtatccgca 1300381 tttcccaatg ttgccgacac cgcgggcctg accgcgttgg gtagcaactc cagcgggccc 1300441 ttgagcgtgg gtgtcaggct ccgcatggcg aggttgccca tcatctggcc catgttgctg 1300501 ccctcgacaa aggccaacac gtattccacg ggtatcgacg agattcgcgc ggctgtgagc 1300561 gggtaggtgc ccttcgtcaa cagcgtccac agcttcgaag gcagattctt ggtcgccggc 1300621 gcggcgaatg accccaccgt ctcggacacc gaaccgatca gttcataaag tctggccagc 1300681 gcgcccgggt tggcgatcgg cgccgggggg cgagaacggt gtcagccgtg cggcggccgc 1300741 cgccattgtg gcgtagaggt tcatcgcctc gccatcttgg gaccagtact gggcatacag 1300801 tgcatcgagg gcaaagatcg cgggggtgtg aatcccgaaa atgttggtcg tggcgagagc 1300861 gaggcgcgtc agtcggttgg tctcgatcac cggcagcggc acgtgtgctg cgtgcgccgc 1300921 ttcataggcg cctgccacga cgctgatgtg gtcggcgacg agttcagcca gggaagcggt 1300981 cgtgacaatc caggcccgaa atggggcgac cgcagctgcc atgatcgtcg acgatggccc 1301041 ccgccacgat gtgatcagcc cgttgatctc actctcgaac cgactggccg cgtagctcag 1301101 ctcgttggac agattcttcc aggcgttcgc ggctactaga aacggacgag cgctaccttg 1301161 gatgttgagg gagttgaact ccggcggaaa aattgtgaaa tccattgtcg ctcaaccgct 1301221 gtctaggtgg aggtgcccgc gcggttggct aattcggtga gccaatacga agtcttgctg 1301281 gtctgaagtg tttggacaaa tgactcgtgg atcacatggg cctggcgcgc gatcgccttg 1301341 tacagctcgc cgtgcatgga aaacagcatc gacgtcacga tggacacaag atcgtgggcg 1301401 ggggattcca cattggtgat cagcggcgtg accccgtcat catgggcgct catcgtcacc 1301461 ccgatctcgt ggaggttggc ggccgtttcc ccaatcgaat cgggccgtgt ggtgacaaaa 1301521 gacacgcgtg catctccttc cactgacgtg gtctgatggt gggggtcagc gacgacttgg 1301581 ggttccgcac ggcattgtag acggaatcgt tcactaaggt attttcacca taacggcttc 1301641 ggtcacaaaa cggtagcgat tctgttgagg aattttttcg acgctcgccc ggtagggtgc 1301701 ctccatgtct gagacgccgc ggctgctgtt tgttcatgca caccccgacg atgagagcct 1301761 gagcaacggc gcaaccatcg cgcactacac ctcccgtggc gcacaggtcc atgtcgtcac 1301821 gtgcaccctg ggtgaggagg gcgaggtcat tggcgatcgc tgggctcaac tcaccgccga 1301881 tcatgcggac caactcggtg gctaccgcat cggcgagctc accgcggcgt tgcgagcgct 1301941 cggggtcagc gcaccgatct accttggcgg cgcgggtcgc tggcgcgact ccggcatggc 1302001 cggcacagac cagcggagtc agcggagatt cgtcgatgct gacccccggc agaccgtcgg 1302061 ggcattggtc gcgatcattc gcgagctgcg gccgcatgtc gtggtgacct atgaccccaa 1302121 tggcggttac ggtcatcctg accacgtgca cacccacacc gtcactaccg ccgcggtggc 1302181 cgcagcgggt gttgggtccg gtaccgcaga tcaccccggc gacccgtgga cggtgccgaa 1302241 gttctactgg acggtcttgg gtctgagcgc gctcatttcg ggcgcgcgag ccctggtccc 1302301 cgacgatctg cgacccgaat gggtgttgcc gcgggccgac gagattgcat tcgggtactc 1302361 cgacgacggt atcgacgccg tcgtcgaggc cgatgagcag gcgcgagccg ccaaggttgc 1302421 ggcactggct gcccatgcca cccaagttgt cgtcggcccg accggccggg ccgccgcctt 1302481 gtcgaacaac ctggcactgc ccatcctggc cgatgagcat tacgtgctcg ccggcggctc 1302541 cgcgggcgcc cgcgatgaac gtggctggga aactgatctg ctcgccggtc tgggcttcac 1302601 cgcgtccggc acgtaggctg ccaaccaggc agccacggaa ggaaccccat ggaccccgac 1302661 ctggacccta acctgcagca ttggcaggac cgactcgaca gcctgcagtg ggtcatcggg 1302721 tcgatactct ctcagatcga cagcgtgcca acctgaccac cggcgcgaca gatcgagcaa 1302781 tccgtttggt tgtcctggcc ctgttgactg tcgacggggt cgtgtctgcg cttgccgggg 1302841 ctctgctgat gccctggtat atcggctcgg ctccgtttcc gatcagtgcc ttgatcagtg 1302901 gattggtcaa tgctgcgctg gtgtgggccg cagcgcgatg gaccacatcg tcgcgggtgg 1302961 ccgcgctgcc cctgtgggcg tggctactga cggtagcggc gatgagcttc ggcggccctg 1303021 gcgacgatgt cattctgggt ggccagggcc tgctggtcta cggcgcgctg gtgttcgtcg 1303081 tggcaggggc cgtgccaccg gcgtgggtgc tgtggcggcg cagggtccaa gctgacggat 1303141 ctggctagtc cgaagttagg gcaaagacgg gaatcccggc gggctgattg gcggcaacgg 1303201 cggcaggaag ccgcgtatcc agttgatctc ggtgttgatg aattggttga tcgatgccgc 1303261 ggtggccgtt tcgatattgg ttagtgcctg gctgaaagtg actgtcccgt ccacgaagtc 1303321 gatcgcattg aacaggactg cctgcacgat gggctcgccg aggtaataga agaagttgat 1303381 ctgcggtgcc agtatgccga tgtagggcag ccatcccacc gcccatgcgg tgaggttgaa 1303441 gccgtactgc acccacggtt cgacggcgtt gtagagattc ttgattgcgt tgccgatcga 1303501 ctcggcggcc agagccggca gcgccgcggc ggcggccgct ccggcgcgcg gcagcagggc 1303561 gccgacggcg gacaaaccgc tggccccacc ggtgggttgg agctgcagtg caccgctggc 1303621 agcggcattt gcagcggcgc taccgccaag tgcactggag ccgccagtgc cgaggaacat 1303681 gctgtggacc cggctcagcg cgttcgagcc cccacccgtc gaggcgtcgg cgctaatcag 1303741 tggatgcccc agcaacgctc ttgcgggcgc attcacggcg ttcaccatgg tctgcgcggc 1303801 gctggcctcg gcggttgcat acgcgtctgc acttgctctc agcgcctgca cgaactggtc 1303861 gtggaaggct gtcatcatct gccggctgag ctgctgatac ccctgagcat gcgcggaaag 1303921 cagcgcggcg acctgagtcg agacctcgtc cgcggctgcg gccaggactc cggtggtggg 1303981 aaccgccgca accacattgg cggcgttaag agtcgaaccg ataccggcca tgtccgcagc 1304041 ggccgccgcc agtgcctctg gcgccgcgaa cacaaacgac atctcgtacc ttctcctggt 1304101 tcaccacgcg gcggctgtcg ccgggggctt gttcagacgc tggcctctca cggatggtat 1304161 cgcgatcggc tgtgacctgc gccttactcc accaaaccgt tggtgccgga cggtcgacgg 1304221 cgtgccgagc tcggcctggc gctactgttg cgcttatggc gccaaggttg gccagcatct 1304281 cacctggtgg ggcgtgcggg tgatatcaga ttgcagggaa ggtataccaa cgtgccgcag 1304341 cctgtaggtc ggaagtccac cgctctgccg agtcccgttg taccgcccca ggcaaatgcc 1304401 tcagcgttgc ggcgggtact gcgacgggcc cgagatggtg tcacgctgaa cgtggatgag 1304461 gcggccatag cgatgaccgc acgcggtgac gagctggccg acctgtgcgc gagcgccgcg 1304521 cgggtgcgcg atgcgggtct cgtgtcggcc ggccggcacg ggcccagcgg caggttggcg 1304581 atcagctatt cgcgcaaggt gtttatcccg gtcacccggt tatgccggga caattgccac 1304641 tattgcacgt tcgtcaccgt gccgggcaag ctacgcgccc aaggttccag cacgtatatg 1304701 gaacccgacg agatcctcga cgttgcccgc cgaggtgccg aattcggttg caaggaagcg 1304761 ctattcactc tcggtgaccg tccggaggcg cgttggcgcc aggcacgcga atggctcggc 1304821 gaacggggct atgactccac gttgtcctac gtgcgcgcga tggcaatccg tgtgctggag 1304881 caaaccgggc tgttgccgca cctgaacccg ggtgtgatga gctggtcgga gatgtcgcgg 1304941 ctcaaaccgg tggcgccgtc gatgggcatg atgctggaga cgacctcgcg acggctgttc 1305001 gaaaccaagg ggctcgccca ctacggcagc cctgacaaag acccggcggt gcggctgcgt 1305061 gtcctgaccg acgccggccg gttgtccatt ccgtttacca ccggtctgtt ggtcggcatc 1305121 ggcgagacgc tatccgagcg cgccgatacg ttacatgcga ttcgcaagtc gcacaaggag 1305181 ttcgggcata tccaagaagt gatcgtgcag aacttccgcg ccaaggaaca caccgcgatg 1305241 gccgccttcc ccgatgccgg aatcgaggat tacctggcga cggttgcggt ggcgcggctg 1305301 gtgctgggcc cgggcatgcg catccaggcg ccgccgaacc tggtgtctgg cgacgaatgc 1305361 cgggcgctgg ttggcgccgg ggtcgacgac tggggcggtg tctcaccgtt gacgcccgac 1305421 catgtcaacc ccgaacggcc ctggcccgct ttggacgagc tggcggcggt caccgccgaa 1305481 gccggctacg acatggtgca gcggctgacc gcgcaaccca aatacgtaca ggcgggcgcg 1305541 gcgtggatcg acccgcgggt gcggggacat gtggtggcgc tggcggatcc ggcgaccggc 1305601 ctggcccgcg acgtcaaccc ggtgggcatg ccgtggcagg agcccgacga cgtggcgtcc 1305661 tggggccggg tcgatctggg cgcagcgatc gacactcagg gccgcaatac cgcagtgcgc 1305721 agcgacctgg ccagcgcctt cggtgactgg gaatcgatcc gcgagcaggt gcacgagctg 1305781 gcggtccgcg ctccggaacg cattgacacc gatgtgcttg ccgccctgcg atcggcggag 1305841 cgtgcgcccg ccggctgcac cgacggcgag tatctggcgc ttgccaccgc cgacggtcct 1305901 gcgctggaag ccgttgccgc actggctgat tcgttgcgcc gcgatgtcgt cggcgacgag 1305961 gtgacctttg tggtcaaccg taacatcaac ttcaccaaca tctgctacac cggttgccgg 1306021 ttctgcgcgt tcgcccagcg aaagggtgac gccgacgcct actcgctgtc ggtcggagag 1306081 gtcgccgacc gggcatggga ggcccacgtc gccggggcca ccgaagtatg catgcagggc 1306141 ggtatcgatc ccgagctacc ggtcaccggc tacgccgatc tggttcgtgc cgtcaaggcg 1306201 cgggtgccct ccatgcatgt gcacgcgttt tccccgatgg agatcgccaa cggcgtcacc 1306261 aagagcgggc tgagcattcg cgagtggctg atcggcctgc gcgaggccgg gctggatacc 1306321 atcccgggta ccgccgcgga aatcctggac gacgaggttc gctgggtgct gaccaagggc 1306381 aagctgccga cgtcattgtg gatcgaaatc gtgacgaccg cccacgaggt gggtctgcgg 1306441 tcatcatcga cgatgatgta cgggcatgtg gacagtccac ggcactgggt cgcccatctt 1306501 aacgtgctgc gcgatattca ggaccgtacc ggcggcttca ccgagttcgt cccgttgccg 1306561 ttcgtgcacc agaattcacc gttgtacctg gccggtgcgg cgcgccccgg gcccagccat 1306621 cgcgacaacc gcgcggtaca tgctttggcg cggatcatgt tgcacggccg catctcgcac 1306681 attcagacca gctgggtgaa acttggagtg cggcgcaccc aggtgatgct cgaaggtggc 1306741 gccaacgacc tgggcggcac gctgatggag gagaccatct cgcggatggc cggttccgaa 1306801 cacggatcgg ccaagaccgt cgctgagctg gtcgcgatcg ccgaaggcat cggccgcccg 1306861 gcgcgccagc gcactaccac atacgccctg cttgcggcct agccccggcg acgatgccgg 1306921 gtcgcgggat ggggcccgca tgggcttaat agttgttgca ggagccggca accgactcga 1306981 caaggccgat gtactgtgcc gcccccggca cagcttgcaa ttgcgcggcc atggcagcgc 1307041 gctgaggtgg cggtgcggcg aggaaattgc gcaaatagga ctgcgccacc ggtgaggcgt 1307101 tgaactgtgc ggcagccccc ggatccgtcg cgttgagcgc agctactacc tgcccgtaat 1307161 tgcaggtggt gttaatgacc gcgtccacgg gatctgcgga ggcgaccccg gccccgacgg 1307221 tcaacgacat tgccacggcg cctacaccgg cgctcaatgc ggtcaacgac agcctcattt 1307281 atggacacct tccccaaact attgcaccgt cgttaagacg gcgacgacat ctgcccagcg 1307341 gttgccgtct gcggtcgagg gtaccaggcg ccgtgggctt gcttctctca aactggttat 1307401 cgggcgacac tgcgcggcca taccaatctg caggtcagca gcgatgaaac aacgttgttt 1307461 acagcccgag aaatgagttt atagcctggc cgcaagttcg gtgccttgct tgatggcgcg 1307521 cttggcgtcc aactcggcgg caaccgccgc gccaccgctg atgtgcgggt taatgccgtg 1307581 ccggcgcagt tcactctcca gatctcgcac cggttcctgg ccggcgcaga ccactacgtt 1307641 gtccaccgcc agcagctggg gccgcctgcg cttcgggccg aagctgatgt gtaggccgtc 1307701 gtcgttgatc tgttcgtagt tcaccccaga cagctgatga acgcccttgg ccttcaacga 1307761 cgcccggtgg acccatccgg tggtcttgcc gagccgcttg ccctgcgggc ctttggtgcg 1307821 ctgcagtagg tacacctcac gggcgggcgg cgccggcagt ggagtcgtca acgctccgcg 1307881 ggcttctcgc ggatcagcga ccccccattc ggccttccac tctttgaggt tgagggtggg 1307941 tgaggagtcg gtgaccagca gttcggtgac gtcgaagcca atgccgccgg cgccgacgac 1308001 agccacggtt cgcccgaccg gtctgacacc ggtgatggct tcggcgtagg ttaacaccat 1308061 ggggtggtcg atgccgggga tggccggaat gcgcggtgcc acgccggtgg ccaagacgac 1308121 ctcgtcgtag ccggtcaact cctgggcggc cacccgagtg cccagtcgca cctcgacacc 1308181 gtgtttggcc agaatcgtcg agaaataccg gatggtttcg ctgaattcct ctttgccggg 1308241 aatgcggcgg gccatgtcaa actgtccacc gataaagtcg ttggcctcga acagcgtgac 1308301 ccggtgaccc cgttgcgcgg cgttggccgc cgtggccagc ccggctggtc cagccccgac 1308361 gacggccacc gagcgggcgc gccgggtcgg ggacagcacc aactgcgtct cgcgcccggc 1308421 gcgtggattg agcagacacg acaccgtttt cctggcaaat gcgtggtcca ggcaggcttg 1308481 attgcaggag atgcaggtgt tgatttcgtc gacccgattg gactgcgcct tgagcaccca 1308541 gtccgggtcg ctcagcatcg gccgggccat tgatatcagc cgcacctggg tttcggccag 1308601 aatccgttcc gcggcctgcg gcatgttgat ccggttggac gccaccaccg ggatagtgac 1308661 gtgttcggcg acggcgctgc tgatgtcgac aaacgcgccg cccggcactg aggtgacgat 1308721 agtgggcacc cgggcctcgt gccagccgaa gccggagttg atgatggttg cgcctgcccc 1308781 ttccacttcg gttgccagcg cgacgatttc atcccaactc tggccttctg caacgtagtc 1308841 ggccattgac agccggtaac agatgatgaa gtcggatccg acggcggcgc ggctgcgtcg 1308901 gatgatctcg accgggaacc ggcgacggtt ggccggtgtg ccgccccacg agtcggtgcg 1308961 cttgttggtg cgcggcgcca ggaactgatt gagcagatac ccttcgctgc ccatgatttc 1309021 gacgccgtcg tagccggcat cgcgggccaa ctgcgcgcag cgggcgaaat ccgcgatggt 1309081 cgcttcgacc ccgcgagccg atagtgctcg cggacgaaac ggggtgatcg gcgccttgat 1309141 cggcgaggcg ctgaccgcaa gtgggtggta ggcgtagcgt ccggcgtgca ggatttgcag 1309201 caggatcttt gcacccgaat cgtggaccgc cctgttgatt cggcggtgcc gtcgggcttg 1309261 cgccgaagtg acgagttcgg aggcgaacgg cagcagccat ccggtgcggt tgggcgcgta 1309321 gccaccggtg atgatcagcc cgacgccgcc gcgtgcacgt tcggcgaagt agtcggcgag 1309381 ccgatcgata tggcgggccc ggtcttccag tccggtgtgc atcgaaccca taaccacccg 1309441 gttgcgcagc gtggtaaacc caaggtccaa cggggacagc agatttgggt atggatttgt 1309501 catcgcttct cctggagcgc ttcagctact tcgtcgagcc aatcgatggc actttcttcg 1309561 gctcggattc cgccgcgcag cacgaggtat tgatgcagtg cggcgccatc gagcgccgac 1309621 ggatctgcga aggtgcgctt ctcgataccg cgataggtgt ccagtgactt gacacgctcg 1309681 gcgcgcagcg cggtgacttg ggtatacagc gcggcaacgt ctccgtagcc ggcgccacgc 1309741 agcttgacgg cgatatcgcg cgtgctgctg tcggtcagcg cactgccgcg gccgggcctg 1309801 gtcgggctga gcggctcggc gatccagcga gccagctcgg cccggccgct gtcggagatc 1309861 gcgtatacct tcttgtcggg ccggccatgc tggagcacgg tcgtcgcgcg cacccagttg 1309921 ttgttctcca tcacccgtaa cgtccgatag atctgctgat gggttgcggt ccagaaatag 1309981 ccgatggagc gatcgaatcg gcgggccaac tcgtagcccg agctggcctg ttcacacagc 1310041 gacaccaaga tcgcgtgggg tagcgccatc cgggcagcat agacggcaag ccggattgct 1310101 atgcaactag gtgcatattg accgtgtacg ccgacgcatg tgccaagtgg tcgacgtgta 1310161 tgtgcaacgt ctagtatcag taaccgaacg cattgcctca gcagggcccg gaggaagcct 1310221 tggcgaggtg gacagcagcc cacacatagc ggtatctgga agacatgttg aggagacgtc 1310281 cgtgacgtac acgatcgccg aaccctgtgt cgacatcaag gacaaggcat gcattgagga 1310341 gtgcccggtc gattgcatct acgagggcgc ccggatgctg tatatccacc ccgacgaatg 1310401 cgtcgactgt ggggcttgcg agccggtctg ccccgttgaa gctatcttct acgaagacga 1310461 tgtgcccgaa cagtggagcc attacaccca gatcaacgcc gatttcttcg ccgagctggg 1310521 atcgccgggc ggtgcggcca aggttggcat gaccgagaac gacccgcaag cggtcaagga 1310581 tctggcgccg cagagcgagg acgcctgagc cggctggggg cagcacccgc tcgcggcgga 1310641 gtgtcggcgt ctctgcccgt cttcccctgg gacaccttgg ccgacgcgaa agcgctggcc 1310701 ggggcccatc cggatggcat cgtcgacctc tccgtcggca ctccggtcga cccggtcgca 1310761 ccgctgatcc aggaggcgct ggcggcggcc agtgccgccc ctggctatcc ggcgaccgcc 1310821 ggcaccgcac ggttacgtga gtctgtggtg gcagcgctgg ctcgccgcta cggcatcacc 1310881 aggctgaccg aggcggccgt gttgccggtt atcggcacca aggaactcat cgcctggttg 1310941 ccgacgttgt tgggcctggg cggtgcggat ctggtcgtcg tgcccgaatt ggcatatccg 1311001 acttatgacg tcggcgcccg cctggccgga acgcgggtgc tgcgtgcgga tgcgctgacc 1311061 cagctgggtc cgcaatcccc ggcactgctc tacctgaact cgccgagcaa cccgaccgga 1311121 cgggtgctgg gtgtcgacca tttgcgcaag gtggtcgagt gggcccgggg cagaggcgtt 1311181 ctcgtggttt ccgacgagtg ctacctggga ttgggctggg acgccgaacc ggtttcggtg 1311241 ctacatccct cggtgtgcga cggcgaccac accgggttgc tggctgtgca ctcactatcg 1311301 aagagctcat cgctcgccgg ctaccgagcg ggtttcgtcg tcggtgacct cgagatcgtt 1311361 gccgagctac tagcggtgcg caaacacgcc gggatgatgg tgccggcgcc ggtacaggcg 1311421 gctatggtgg ccgcgctgga cgacgacgcg cacgaaaggc aacagcggga gcgctacgca 1311481 caacggcgtg ccgcgctgtt gccggcgctg ggctccgcgg gttttgcggt cgactattcg 1311541 gacgccggat tgtatctatg ggccactcgc ggcgagccgt gccgcgacag tgtcgcgtgg 1311601 ctggcgcagc ggggcatcct ggtggcaccg ggtgatttct acggcccggg tggggctcag 1311661 cacgtgcggg tggcgctgac ggccaccgac gagcgggttg cggcggcggt cggacggctc 1311721 acctgttagc gcgaacagac gcaacttgcg gccgggtcac cgccaggtcg tgcgcagctg 1311781 ggttgtcacc gagagcgggt tatcgccgcg gaacagatcg aggatggctt gcccttgtgg 1311841 ggagtctgct ggcagttgtc ggggtgggcc gatgtgcttt cgccatgcct gtgccagatg 1311901 ttgccgccga tccttgtttc gtgcgaacca gcggggcacg gcgtgccagg caaccgtgcc 1311961 gggcagcgat agcccgacga cggcacgaac cgcgaacagc ctccgggcaa cgggccgggc 1312021 gggcggcgtg aggatcttgc gtccgatgag gtagcgcggt tcggcaagcg gcgccagtag 1312081 ctcgtcgagg gccgcggtga accgcaggga ctgctcggtg ggcacgccgt cgagttgaca 1312141 ccggatccag ccttcggggt cggaggccaa ccgtagtgcc gcggatcctc gctgtgcgcc 1312201 gcccgcggcg tacagcgcat ccgcgacgac ggcggccagt tgctcgagcg cgttgggcgc 1312261 gtggtccagg cggcggcttt cggccgccgc agcggttgcc accaggccaa cacccgccgc 1312321 gacgatggcg ccggccgtgc cggcacccgc cagcatgccg agattggcgg aggcaactgc 1312381 ggtggcggtg ctggcgccga ccacggaaac ggcggccacg gcaccccttg ccaggcggac 1312441 cggactgaac tgtcccggca ccggtggggt caatgccgag gcggggatgc ggggtgcggc 1312501 gaccccgagc ggctggcggg agcgcacgcg gatggttgcg acgtcgactc cttcgtaggg 1312561 ctcgccgatt cgccaccagg atctcgcctg ggcgcgttcg gcgacgcgct gcagcgctcg 1312621 cgccgtgatg gcgtgggtat cggtgaccgg aggaccgtac ggcgacagcg acggatcgca 1312681 gtgcgtcaca cccgattcga tgagcccctg cggggttgcc gcgtagtacc cgtcatgttt 1312741 gcgcaccagg cgcaggtagt cggcatcacc gcgcgggtgt tctgtggcga tacagcagac 1312801 cgaccagttg tccgccacct tgtgaccgtc cgaggggtcg ttgcggatgg cgcggccgcg 1312861 catctgagtg atcgctgcct gggtggttgc gctcgtcagg tcgatattga cgttgaccgc 1312921 cgcgcagtcc cacccttcac ctagtagcga acgggtgccg accaggacgc gggcgcggcc 1312981 ggccaggaag tattcggtag ccagcgcgac ccacgtacgt ggcgtgaagc cgccggtgcc 1313041 gcgcatgacc cgcagactag ggtgggcgtc aagcggctcg gcggtgacga gcgcgccgcg 1313101 ctcggcgcag aaggcgatca ggtcatcttc gatcgcggcc gggcaggcga aggtttgacc 1313161 tgttaccaga agggcgtgca gtggggtgcg gcggcggtga tccgacgcgg cgagcatggc 1313221 ggcaaccagc tgggccgaac ccgactgctc gctgacgggt gcgcccttca gcgatgtggg 1313281 aagggcgccg gtcatcgatt cgaaatcgca gagcaccagc gcccgcaacc gcgcccccaa 1313341 gacggcgtcc tcggtgtcga ggatgtgcgc ggtcgcggcg atcttggatt cggacagcgc 1313401 gcacagtctg tctactggcg aggtcgcgac gcgtacgccg cgactggtca gccggtagcc 1313461 caggccgggt agcacccgct tgatcgcggt cagcgcgtgc gcgtcgcgcg gatccgcgct 1313521 ttgttgcagg tgcccgacgc tgaagtcggt caatacgtta acccagtcct gggcatcggg 1313581 cgcaattcgg tgctgctcgc gcaggcgcac gccgtcgggt agtggaatca ggccgtcgta 1313641 ggcgaagcgc aggccgctgc acgcgaggtc gggttcggca cgctcgaacg tcgaccaggc 1313701 gatctgattg ccctcgcgcg tcgctcgatc cacgatccgg gtgtgcagcc acgcggccaa 1313761 cgacatgctg cccacctttt ggtcgatgag ggccagcatg aggtcggcga agcgcgcccg 1313821 gtgggtgccg atccaggcct gctcttcggg cgtcggttgg gtcagataga ccaactcttg 1313881 gtagggagcc aggtcgcctt ccctaaccag agcgggtgtc gggatcacga agtcggcggt 1313941 gccgaacagc tcatcatgca gggtgtgctg ccacgcggtg agctctgtgg ccggggtcgc 1314001 cgttagaccg atcagcgcgg tctgcgctcc gaggaccgac gccaacgcac tgaccagggc 1314061 gccccacgta gctagcagat ggtggcactc atcgagcacc agcgtccacg ggcctagcgt 1314121 cgccgcccgc tcgatcaccg ccctcccgtt ggggtgcagg agatccagca acgcttgctg 1314181 gtcgcggttg cgcaggactt cccgccggac tgtcgaatcg gtttcggcgt cgatgacggc 1314241 aagcgactga tacgtcagga cgttcatcgc cgaggcaagg ccacgctcgg ttccacactt 1314301 cgatgccgac cggtccgacg acggaaaact gttatcccac gcggcggccc actgcgcctg 1314361 caccgccgtg ttgggaacca acaccaaact ccggcgcccc agccggcgcg ctgcttccag 1314421 gccgatcatc gtcttgcccg cacccggcgg cagcaccaga taggcacggt tgtcgccggc 1314481 agcgacgtcg gcgtcgaacg cgtccaacgc ttgctgttgg tatacccgcc agttgccggc 1314541 aaaggcccgc gattccaggt cgcggtgagg atccacaagg attcacccta gccaagcacc 1314601 cacgttgggc gcgagcagac gcaaaaggcc ccgaatccaa cggatttcgg ggccttttgc 1314661 gtctgctcgc gcccgtgcgg ctcgtgcgga tcacacgcgc ggtgcatgct gctgtggctg 1314721 tcgagcagtg ttgctacctt aactttccca ggcctacgac gtctggtagc ggcatggcaa 1314781 cggcctgtga gttggctgga taatgtgttc ttcgtcgtgc tgtggcctgc agattaacaa 1314841 gtcccacaac agttttcccg ttgtatcgga ccttgcagca tgcgatgctt tcgtcttgag 1314901 ccactaccat gaagttagta cgctaaacaa tcctgagccc gaatgtgttg gtaaatgggg 1314961 tttgggagca ttcacccacg gctggtacag ggggactgcg tagtgcgcac cgcaaccgcc 1315021 acatcggtcg ccgttatcgg catggcttgc cggctcccgg gcggcatcga ttccccacaa 1315081 cgcctctggg aagcgctgtt acgcggcgac gatttggtgg gtgagattcc cgctgaccgg 1315141 tgggacgcga acgtgtacta cgaccccgaa cctggtgtcc ctggtcgatc ggtatcgcgt 1315201 tggggcgcct ttctggacga cgtcggcggg tttgactgcg atttcttcgg cctgaccgag 1315261 cgggaggcga ccgcgatcga cccacagcac cgcttgctgc tggaagtgtc gtgggaggct 1315321 atcgagcacg cgggtgtgga cccggcgacg ctcgctgaat cacaaacagg tgtcttcgta 1315381 ggactgacac acggcgacta cgagctgctg tccgcggatt gcggcgccgc ggaaggaccg 1315441 tacggattca ccggcaccag taacagtttc gcgtccgggc gagtggccta cacactcgga 1315501 ctgcatggcc ccgcggtcac ggtggacacc gcgtgctcgt ccgggttgac ggctgtgcat 1315561 caagcctgcc gcagcctgga tgacggtgaa agcgatctcg ctcttgccgg tggtgtggtt 1315621 gtcacgctag aaccgcggaa gtccgtctcg ggttccctgc aaggcatgtt gtcgcctacc 1315681 gggcgttgcc atgccttcga cgaagcagct gatggcttcg tgtccggtga ggggtgcgtg 1315741 gtcctgctgc tgaagcggct accggatgcg gtgcgcgacg gtgatcgtgt gctggcgatc 1315801 gttcgtggca ccgcagccaa ccaggatggc cgcaccgtga atatcgcggc gccgtcggcg 1315861 caggctcaga tcgcggtgta tcagcaagcg ttggctgcag cgggcgtcga agcgtcgacg 1315921 gtggggatgg tcgaagccca cggcaccggc acccccgttg gagatccggt cgaatacgcg 1315981 agcctggccg cggtgtacgg aaccgagggt ccgtgcgcgc tgacgtcggt gaaaacaaac 1316041 ttcggtcacc tgcagtcggc atcggggccc ctggggttga tgaagacaat cctggcgttg 1316101 cggcatgggg ttgtgccgca gaacctgcac ttctgccggc tgcctgatca gctggctgag 1316161 attgacactg aactctttgt gccgcaagcg aatacatcct ggccggacaa caccggacag 1316221 ccacgtcgcg ctgcggtttc ctcgtatgga atgtcgggta ccaacgtgca tgccatcttg 1316281 gagcaagcgc cggtatcaga accagcggct tcgggacctg agctcactcc cgaagccggt 1316341 gggctggcgt tgtttccggt gtcggctacc tcggctgagc aactacacgt cacggccgcc 1316401 cggctggcgg attgggtcga ccagaacggc aacgcgggca gtcgagttag catgcgggac 1316461 ctgggctaca cgctgtcctg ccgccgtgca caccgacccg tccggacggt tgtgacggcg 1316521 agcagttttg acgagctgag cgcggcgctg cgggacgtcg ctggcgatca gattccctat 1316581 cagcccgcag tggggcacga cgaccgcggg ccggtgtggg tgttctccgg gcaaggctct 1316641 cagtggcccg ggatgggcac tgaactgctg gtagccgaac cggtgttcgc cgccaccgtc 1316701 gcggcgatgg agccggtgat cgctagggag tcagggtttt cggtgaccga agcgatgtcg 1316761 gcgccacaga cggtcagcgg tattgaccgg gtgcagccca ccatcttcgc ggtgcaggtc 1316821 gccctggccg cggccctgaa gtcgtatggg gtacgtcctg gtgccatcat cgggcactcg 1316881 ctcggcgagg ctgcggcagc cgtggtcgcc ggagcactgt cgctgcacga cggattgcga 1316941 gtcatctgcc ggcgctcgcg gctgatgtcg cgcatcgccg gtagtggcgc gatggcatcg 1317001 gtggaactgc ccggccaaca agtgttgtca gaacttgcga ttcgtgggat ctccgacgtc 1317061 gtgctctcgg tggttgcctc tccgacctca accgtcgtcg gcggcgccac gcagtcgata 1317121 cgtgacctgg tggcggcctg ggagcagcag gatgtgctgg cacgcgaggt agctgtggac 1317181 gtcgcttcac atacaccgca ggtcgatccc atcctggacg agttgctcga ggtcctggcc 1317241 gaggtcgatc cgacggcgcc ggaaattccg tattactccg caacgttgtg ggatccgcgc 1317301 gagcgaccgt cgttcaccgg cgagtactgg gtggaaaacc tgcggtacac ggtgcgattc 1317361 gcggcggcgg tacaggccgc gctcaaggac gggtaccgag tgttcggcga gctggctccg 1317421 catccgctgc tcacctacgc ggtcgagcag aacgccgcca gtctcgacat gccgatcgca 1317481 acgcttgccg cgatgcggcg cggggaacag ctgccgttcg ggttgcgcgg cttcgtcgcc 1317541 gacgtgcaca acgccggcgc caaggtggac ttctctgtcc agtaccctga tgggcgcttg 1317601 gtggatgcgc cattgccgag ctggacgcac cgcaccctga tgctcagccg tgaggattca 1317661 caccgctcgc acaccggcgc ggtccaggcg gttcatccgc tgcttggggc ccatgtgcac 1317721 ctgttggagg aaccggagcg tcacgtctgg caggccgggg ttggcaccgg ggcgcatccg 1317781 tggctcggtg accatcggat acacaacgtg gctgcgtttc ccggtgcggc ctactgtgag 1317841 atggcattgg ccgcggcgcg caccactctt ggcgagctgt cggaggtgcg cgacatcaag 1317901 ttcgagcaga cgctgttgct ggacgagcag acggtggtct catcggccgc gacgatcgcc 1317961 gcgcctggga tcctacagtt cgcagtcgag agtcatcagg aaggcgagcc cgcacggcgg 1318021 gccagcgcga tgctgcacgc attggaggag atgccgcagc cgcccgggta cgacacgaac 1318081 gctctgaccg ccgcccatga gtccagcatg agcggtgagg aactgcgaaa aatgtttaac 1318141 agcttaggta ttcagtatgg tccggctttt tcaggcctag ttgcggtgca cacggcgcgc 1318201 ggggccgtca ccacagtgct cgccgaggtc gcgctgcctg gagccatccg atctcagcag 1318261 tcggcatatg ccagccaccc ggccctgctt gatgcgtgtt tccagtcggt gcttgttcat 1318321 cccgaggtcc agaaggcgac tgtcggtggt ctgatgctgc ccgtgggcgt gcgtaggctg 1318381 cgcaactatc actcgacgcg cagcgcgcac tactgcctcg cccgggtcac gtcatcgtcg 1318441 cgagccggcg aatgcgaagc cgatctcgac gtgttcgacc aggccggaac ggtacttttg 1318501 accgtcgagg gattacggct ggccgcaggg atttccgaac atgaacgcgc gaaccgggtg 1318561 ttcgacgagc gattgttgac catcgagtgg gagcggggtg agctgcctga ggtgccgcag 1318621 atcgatgcgg gatcctggct gctgctcagt gcgtccgaag ctgatccgct gaccgcgcaa 1318681 ctcgccgacg cgttgaatgc cgttggtgcc cagagcacta gcgtggcttc ggcgtcggat 1318741 gtcgcacaat tgcgttcgct gctcggaggc aggctcaccg gtgttgtcgt ggtgactggc 1318801 ccgccaacgg gtggtttgac acagtgcggc cgcgactatg tgtcacagct ggtgggtatt 1318861 gcccgcgagc tcgcggagct gcccggtgag ccgccgcggc tgttcgtggt gaccaggagc 1318921 gcggcgagcg tgctgccgag cgatcttgcc aacttggaac aggcgggatt gcgtggactg 1318981 atgcgggtga tcgattccga gcatccgcac ctgggtgcca ccgcaatcga cgtcgacaac 1319041 gacgagaccg tcgctgccct ggtggccagc caactacaga gcgggtcgca ggaggacgaa 1319101 accgcttggc gcaatggcat ttggtacacc gcccggctgc gtcccggtcc gttacgcccg 1319161 gccgaacggc gaaccgccgt cgtcgaatac agacgcgacg gtatgcgcct gcagatccgc 1319221 actcccggcg acctcgagtc gttggagttc gtcacattcg accgggtcgc gccgggaccg 1319281 ggcgagatcg aggtcgcggt gaccgcatcg agtgtcaact tcgccgacgt tctggtcgct 1319341 ttcgggcggt atcccacctt cgagggctac cgacagcagt tgggcatcga cttcgccggt 1319401 gtggtgaccg cggtcgggcc ggatgtcacc gagcatcgga tcggtgatca cgtcggcggc 1319461 atgtccgcca atggctgctg gagcacattc gtcagatgcg atgcccggct ggcggtgacg 1319521 ctcccgcccg agctgccggt ggccgccgcc gccgcggtac cgaccgcctc cgcgacggct 1319581 tggtacgccc tgcacgatct ggctcgcatc tgctcggacg acaaggtgct gattcactcg 1319641 gggaccggtg gtgtcgggca ggcggcgatc gcgatcgcac gggccgccgg atgcgagatc 1319701 ttcgccaccg cgggcagtgc ccagcggcga caactgctgc acgacatggg tgtcgagcat 1319761 gtctacgact cacggagcac cgagttcgcc gagcagatcc gaggcgacac cgatgggtat 1319821 ggtgtcgacg tcgtactcaa ctcgctgccc ggcgccgcac aacgtgctgg gatcgaattg 1319881 ctggcctttg gcgggcgatt cgtggagatc ggcaaacgtg acatctacgg cgacactcgg 1319941 ctcgggttgt tcccgttccg ccgcaacctg tcgctgtatg ccgtcgactt ggcgctgctg 1320001 acacacagcc acccgcacac cgtccggcgc ctgctgaaaa ccgtctacca acacacggtc 1320061 gagggcacgc tgccggtgcc gcagaccacg cactatccca ttcacgacgc tgccgttgcc 1320121 attcgtttgg tcggcggagc cgggcacacc ggaaaagtgg tgctcgatgt gccgcgtacc 1320181 ggtgaaggcg tggccgtggt gccccccgaa caggtccgca cgtcccggcc cgacggcgcc 1320241 tatctcgtca ccggtggttt gggcggcctc ggcctgttcc ttgccggcga gctggcggcg 1320301 gcgggctgcg gacgcatcgt gctcaactcc cgttcgacgc ccagcccgca cgccaccagg 1320361 gtcatcgagc ggctccgcgc cgccggtgct gatatccagg tggaatgcgg tgacatcgct 1320421 gatgccgcaa cggcccaccg agtggtggcg gtggccaccg cctcgggctt gccggtgcgc 1320481 ggcgtgctgc acgcggcggc ggtggtcgag gacgctacgt tggccaatgt caccgacgaa 1320541 cttatcgacc gctgttgggc gccgaaggta cacggcgcgt ggaacattca tcgggccacc 1320601 gccgcgcagc cactggagtg gttctgcttg ttctcctcgg ccgcggcctt ggtgggctcg 1320661 ccgggtcaag gcgcatatgc ggcggccaac agctggttgg acgcttttgc ccactggcgg 1320721 cgggcgcagg gccttccggc tacctcaatc gcctggggag catgggccga gattggccgc 1320781 gctaccgcgc tggccgaagg caccggcgca gcgatcgcgc ccgccgaggg tgctcgagcc 1320841 ttccagacgc tgcttcgcta cggccgggcg tactccggct atgccccgat catgggtacc 1320901 ccatggttga cggcctttgc gcaacgtagc cgatttgccg aagcgttcca cgccacgggc 1320961 caaaatcaac cggccaccgg gaaattcctc gccgaactgg gcagcttgcc ccgcgaagag 1321021 tggccccgca cagtcaggcg gttggtatcg gaccagatca gcctgctgct gcggcgaacc 1321081 attgatccgg accggccgct gtccgactat ggtttggatt ccttgggcaa cttggagttg 1321141 cggacccgca tcgaaaccga aacgggtata cgcgtcagtc ccacaaagat caccacggtt 1321201 cgcggcttgg ccgagcacgt gtgcgacgag ctggcagccg cccaatctgc gccggtctga 1321261 tgacggcccg ggtgaagtcg ttgcggaagt ttgagatcga gccgaggagg gcatgttgcg 1321321 ggttggaccg ttgacaatag gcacgctgga cgactgggcg ccgagcacgg gttcgactgt 1321381 gtcatggcga ccttcggctg tcgcgcacac gaaagcgtcg caggcgccga tcagcgatgt 1321441 tccggtcagt tatatgcagg cgcaacatat tcggggctat tgcgagcaaa aggcaaaggg 1321501 actcgactac tcgcggttga tggtcgtcag ctgccagcag cccggccagt gcgatatccg 1321561 ggcggccaac tacgtgatca acgcccatct ccgacggcac gatacctatc gcagctggtt 1321621 ccaatacaac ggcaacggac aaataatccg gcgtacgatc caggatcccg ccgacatcga 1321681 gttcgtacca gttcatcatg gtgagctcac gctgccgcaa attcgcgaga tcgtgcagaa 1321741 cacgccggat cccctgcaat ggggttgttt tcggtttggg atcgtgcaag gctgcgacca 1321801 tttcacattc tttgcaagtg tggatcatgt gcatgtggac gcgatgatcg tcggtgtcac 1321861 gctcatggag ttccacctga tgtacgcagc gctggtgggc ggccatgccc ctctcgagct 1321921 accgccggca ggcagctacg acgacttctg ccgccgacaa cacacgttca gctccaccct 1321981 cacggtggag tcgccccagg ttcgcgcctg gacgaagttc gccgaaggta ctaacggtag 1322041 ctttcctgat tttccactcc cacttggtga cccatcgaaa cccagtgacg cggatattgt 1322101 caccgtgatg atgctcgatg aagagcagac ggctcaattc gagtccgtct gcacggctgc 1322161 cggcgctcgg ttcatcggtg gcgtactagc ctgctgcggc ctggctgaac acgagttgac 1322221 cggtacgaca acctattacg gactaacgcc gcgcgacacg cgccgcactc cagcggatgc 1322281 catgacccaa ggttggttca ccggcctaat tccgatcacc gtccccatcg ccggctcggc 1322341 gttcggcgat gccgcccgag ccgcgcagac ctcgttcgac tcgggcgtga agctcgccga 1322401 agtaccctac gaccgcgtcg tcgaattgtc gtccacgcta accatgccac gaccgaactt 1322461 tcccgtcgtc aacttcctcg acgcaggcgc ggctccgctt tcggtactgc tcaccgcgga 1322521 gttaaccggt acgaacatag gagtgtacag cgacggtcgc tactcttatc aactgtccat 1322581 ctacgtcatc cgcgtcgagc aggggacggc agtggcggtc atgttccccg acaacccgat 1322641 cgcccgggaa tcggttgccc gctacctggc aacgctgaag tctgtgttcc aacgagtcgc 1322701 cgagagcggg cagcagcaga atgttgcctg attcattccc ggtggtgaac ccatcttcgc 1322761 gcggctaggt gaactcgtcg cccggcggcc ttgggttgtg gtcggctgtt gggtcgcgct 1322821 cgccctggta ctgccgatgg cggtgccttc actggcggag atggctcagc gacatcccgt 1322881 cgcggtcctg cctgccgacg cgccctccag cgtcgctgtt cgccagatgg ccgaggcgtt 1322941 ccacgaatcc ggctccgaga atatcttggt agtgctgctc accgacgaga aaggcttggg 1323001 agcggcggac gaaaacgtct accacacatt ggtggatcgt ctgcgaaacg acgctaaaga 1323061 cgtcgtgatg ctgcaggact tcctgactac tccgccattg cgtgaggtgc tcggtagtaa 1323121 agatggcaag gcatggattc tgccgatcgg tctcgcgggc gacctgggta cacccaagtc 1323181 ctaccacgct tacaccgacg tcgaacgcat cgtgaaacga actgtggccg gaaccacgtt 1323241 gacggcaaac gtgacaggac ccgcagccac ggtggcagac ctgaccgacg ctggggctcg 1323301 ggatcgggct tcaatcgagc tggcgatcgc cgtgatgttg ctagtcatct tgatggtcat 1323361 ctatcgcaac ccggttacca tgctgttgcc cctggtgacg attggcgcat ccttgatgac 1323421 cgcgcaggcg ttggttgccg gcgtgtcgct cgtcggcggt ctagccgtat ccaatcaagc 1323481 gatcgtgttg ctcagcgcaa tgatcgctgg tgcgggaacg gattacgccg ttttcctaat 1323541 cagccgctat cacgagtatg tgcggctcgg tgagcatccc gagcgtgccg tccagcgggc 1323601 gatgatgtcc gtcgggaagg tgatcgccgc gtccgcggca acggtcggaa tcaccttcct 1323661 cggcatgaga ttcgccaaac tcggtgtgtt ctcaacggtt ggcccggctc tggcgatcgg 1323721 gatcgcggtg tcgttcttgg ccgcggtcac cctgctgccc gccatcctgg tgctggcctc 1323781 accgcgcggg tgggtcgcac cgcgcggtga acgcatggcg acattctggc ggcgggccgg 1323841 aacgcgaata gtgcggcggc ccaaagctta tctaggcgcc agcttgattg gtctggttgc 1323901 attggccagc tgcgcgagcc tggctcactt caactacgac gaccgcaaac aattgccgcc 1323961 ttcggatccg agttcggttg ggtacgcggc aatggagcac catttctcgg tgaatcagac 1324021 tattcctgag tacttgatca tccactctgc acacgacctg cgaaccccgc gcggccttgc 1324081 cgacctggag cagctggcgc aacgtgtgag ccagatccca ggcgttgcca tggttcgcgg 1324141 tgtgacccgg ccaaacgggg aaacccttga acaggcccgg gcgacatacc aagccggcca 1324201 agttggcaac cggctgggcg gcgcgtcgcg aatgatcgat gagcgcaccg gcgacctgaa 1324261 tcggctggca tcgggtgcca acctgttggc cgacaatctc ggtgacgttc gcggtcaagt 1324321 cagccgggcc gttgcgggtg tccgcagcct tgtcgacgcc ctcgcttaca tccagaacca 1324381 gttcggtggc aacaaaacat tcaacgaaat cgacaacgct gcaaggcttg tcagcaatat 1324441 ccacgcgctc ggtgacgctc tgcaggtaaa ctttgacggt atcgccaaca gtttcgattg 1324501 gcttgactct gttgtcgccg ctttggatac cagcccggtc tgtgacagca accctatgtg 1324561 tggcaacgcg cgcgttcagt ttcacaagct gcaaaccgca cgtgacaatg gcactctcga 1324621 caaggttgtc ggcctggcgc gtcagctgca gtccacgcgg tcaccgcaga ccgtgtcggc 1324681 ggtggtgaac gatctggggc gatcgctgaa ttcggtagtc cgctcgctga aatcactggg 1324741 gttggacaat ccggacgccg cccgggcgcg cctgatcagc atgcaaaatg gagctaacga 1324801 cctcgccagc gccggtcgtc aggtcgcaga cggcgtccag atgctggtcg accagaccaa 1324861 gaacatgggc atcgggctga accaggcgtc agcctttctg atggcgatgg gcaacgatgc 1324921 gtcgcaaccg tcgatggcgg gtttcaatgt cccgccgcaa gtgctgaagt ccgaggagtt 1324981 caaaaaagtc gcccaggcgt tcatctcgcc agacgggcat accgtgcggt acttcattca 1325041 gaccgacctc aacccgttca gcactgcggc catggatcag gtcaacacga tcattgacac 1325101 agccaaaggt gcacagccaa atacctccct ggctgacgcg tcgatatcaa tgtcgggtta 1325161 cccggtcatg ctgagggaca tccgcgatta ctacgagcgc gatatgcggc tcatcgtcgc 1325221 tgtgaccgtc gtcgtggtga tcctgatcct catggcactg ctgcgtgcga tagtggcgcc 1325281 gctgtacctg gtcggttcgg tggtcatctc gtacatgtcg gcgatcgggc ttggtgtggt 1325341 ggtgttccag gtgttcctgg ggcaggaatt gcactggagt gtgcccggcc tagcgtttgt 1325401 ggtgctggtc gccgtgggtg cggactacaa catgctgctg gcgtcgcggt tgcgggacga 1325461 gtcggcattg ggagtgcgtt ccagcgtgat tcgcacggtg cgttgcacgg gcggagtgat 1325521 cacggcagcg ggtctgatat ttgccgcttc gatgtccggc ctgctgttct ccagcatcgg 1325581 aaccgtcgtc caaggcggct tcatcattgg ggtcgggatc ctgatagaca cgttcgtggt 1325641 gcggaccatc accgtgcctg ccatggccac gctgctcgga cgcgcaagtt ggtggcccgg 1325701 acacccttgg cagcggtgcg cacccgaaga aggccagatg tcagcccgga tgtcagcgcg 1325761 cacgaagacg gtatttcaag ccgtggcaga cggatcaaag cggtagtgtt tagccgccga 1325821 aggcggggga gcccagtaag ccacgggcac cttccacgat cgagcccgga gcggtcagcg 1325881 gatccaggcc tcgcaccgga tccacggaga ccggccgggt gaaccaattg tcgttgcgtg 1325941 cataggccgc gtcgatctgt ggttgcagca cactgtcgat ttgatcgact tcggcgtcgg 1326001 acatgccgag gtaacgcagc ggcaaggtca acgggaggtg gttcacgggg accagatacg 1326061 tcgtcgtggt agcgcctcgt gagttgacgg tggtcctgat gttctgcggg ggtacgtcac 1326121 cgggtccggt gaacccgatt ggggtgtgcg cgatggcagc gccgatggcc gcattggcga 1326181 ccgctaacag attgtccggc cggtccggga agtcgctgaa gccgtcgtat gcggtgacga 1326241 catggttggt gtcgtactgg ctatccacct gctggggcat cgtatattcg atgaagggaa 1326301 tcggaatgtg gctaccgggg ggaaaaattc gggccaggaa gctcgctccg aacgcatgac 1326361 gtccggtggg gtcgccgaac gtcgtgaact gcagcttgtc cggtgcaggt gccgtcgggt 1326421 cgttggcgag ccgcgcctgc tcctggtcga gcacgaggga accctgggat aggccgacgg 1326481 ccgcggctgg atcggttccg tgatgaattg cgttatcaag gctgtttgtc ccatctttga 1326541 ccgccacgcc caccgtcatg ttgtcttggt ggctccctgg cggcaaaagc atggtgggcc 1326601 accagctgaa ggccgctccg gcgggatagt cgatgagatc gtgctttgcg tttgggaaat 1326661 attgagagcc agcctggttc gtgtactcgt accagggaat gcccggcatt cgcgcgcccc 1326721 cgagggcgta gacgactttg gcggttgaag cgtcgcccac cggagagggg gacggaggtg 1326781 gcccaggagc ccacgggtac gcgggttcgc ttgccgcaat agcggttccg aatccaccgg 1326841 cccaacccac gagccagacc gcgaatgctc ccgcaatcac tcgcttcatc tgcctctgca 1326901 tcgagaatcg cgtgcgtgaa agcataggaa agcagctatc gttcggcggt tttcgggcgg 1326961 ttatgtcgcc atatcttagt cagccacgtc ccggccgaca ttaaagttgg cagccaacaa 1327021 gctgtgaatc gccctgggtc agccccgact agctcagccg tccaaccggg tgaattgctg 1327081 cagccggtat tgctctacac aggcggccct tctgatcttg ccgctggttg tggtggggat 1327141 cgacccgggc gggaccaaga cgaggtccgc cacgttgaga ccgtgcgagc gtgatatcgc 1327201 ggctgtgacg ttgttcttga tgacatcgag ttcgtccatc gcttcgccgg cggaatcgcc 1327261 gaggagcttg agctcgatga cagtgactaa cttctctgtg tgatcgaccg gaactgaaat 1327321 cgcagcgacc cgaccaccag tgatctcctg gacggtcgac tcgatgtcct cggggtagtg 1327381 attgcgcccg tatacgatca gcatgtcctt catacggccc acgatgaaca tctcgtcctc 1327441 ggagaggaat ccgaggtctc ccgttcgcaa ccaggatcca tcaggagtac ctgccgaggg 1327501 gtggaccagc attgcgccaa aggtgtgccg tgtctcgtcc ggtttgttcc agtagccttc 1327561 ggcgacgttg tcgcccttca cccagatctc gccgatcgtt cccgcggggc actcaatgca 1327621 ggtgtcggga tccacaattc gcactgttgg tgatgtcggc atgccatagc tcagcagcgg 1327681 tgtgccggtc ttgggttcac atcgattcgc actgcccgtg gacagcttgt caggttcgaa 1327741 gtagacgact tctggcttgt cacccgaatt gcggctggcc acataaagag tcgcttccgc 1327801 cagaccgtac gaaggccgta tcatgtcttc gcggaaattg tacggtgcaa accggttgca 1327861 gaatctactg agcgtgttgg ggtggactcg ttcagcacca ctggtgatgc ccaggacgtt 1327921 gccgaggtcg aggccttcta tgtcggcatc tgttgtcttg cggacggcca attcgaaggc 1327981 gaaattcggt gcggccgacc acgaaggact tccgttggcc agcgaatgta gccaacgcgc 1328041 tggccgttgc aggaacgcca gcgggctagt gagttcactg cggtagccgc ccaggatcgg 1328101 tgcgatgatg ccaaggacca agcccatgtc gtggtagaac ggcagccacg acacgatggt 1328161 agtgtcaggt ggcgccacac cgttgcggtc gccgaagtag ttcgacatca gctgttggaa 1328221 attcgcctga aggttccgat gcgagatcat gaccccagcc ggagcgcggg tggagccaga 1328281 ggtgtactgc aagtacgcgg cgcttggcag atccttcacc cgaaagctcg gtgaattccc 1328341 ggtcaagtcc aatgaatcga tttcgatgat cggccctacg ttgttcgtgt tcggccggtg 1328401 gatgtgctcg gcaaccgctt ctgcgaccgc agatgttgtc aggatgaccg aaggtgacgc 1328461 gtcggcaagc accgcgctga cacgttcgtc gtgagagccg atctgcggga ctgacaacgg 1328521 aaccgctatc gctccggcct gcatcgaacc taggaaagcc gcgatgtagg ccaggccctg 1328581 cggagccaga atcacggctc ggtctccggt cgtgcaatgc cgcctgactt cgtgagcaac 1328641 gatgcgggtc cgtcgaaaca cctctgacca cgtgagcgtc tcggtgatgc cggcccaatc 1328701 ctgttcgtag tcgatgtacg tgaacgcggc gtcgtcgggc tgcaggccgg cacgctcgcg 1328761 cagcaaggac aagacagaag agtcggacat tggtgctaca ttaccgtttc gcgcgatctc 1328821 cgataaccca agcgggcagg gggatggttg gcgatagcga tgctgatcat aacgttctgc 1328881 aatgctgtgc atgtgctgaa acaggttgac gcagagtcga agtcggtgta cgcaggggcg 1328941 ccgtgagggg cgtcacggtc gagttgctaa gccgtgcgtt ccatggcccg cagccccagc 1329001 gaaaagagca gccgcacatc cggatcgccc agcgaggtcg acaacagctg ctcgatccgc 1329061 cttatccggt agcgaacggt gttgggatgc acttgcagtg accgtgcggc ggcgccgatg 1329121 tcgccgaagg catccaggta ggcacgcagg gtctgagcca gcaccggatc ctgggcgccc 1329181 aggtcacgta tccgaggatc gacgagccgc tggtcggtgc cgaccagggt gacgatttcg 1329241 tcgagcagaa cggtggtgcg tgcctcggcc agcgatgtca cctgccccaa gatcgggtgg 1329301 cgctcggcac tctcgagtac ccgatccacc tcgacgcgtg ccgggttgac ttcggcaagt 1329361 cccgcgaccg gccccgcgat ggctgcccgt agtgctactc ccagctcggc gcgcagtgcg 1329421 ctgattgtgc cgcggaccca cgaggtgaca gctcggccgg tcgtggtttg gggcagcagc 1329481 acatagatcc gtgagccgtt ggcggcaacc tgagcgtcgt ggcgaaaagc gctggcgctc 1329541 aatgccatga cgtcaacaag ccgaacatgg cggactgcgg tatcgcggtt ttccgcggtg 1329601 tcgaaaccga tcagcgttgc gttgccctcg gcggcgacgc cgagttcacg ggcgatggtc 1329661 gatacgtcga cgggtgctgt ggttgcgttc agctcggcca ggcccagtag ttgctgtacc 1329721 cgcagcgcgt gcgtattggg ctgggtcgcc agtcgcgaca tgatccgggc ggccagcacc 1329781 gcagcacccc gcaacatctc ctcggcatcg tcggccaacg gctgcgagcc ttgctggacc 1329841 cagatcgtgc cggcgaacac cggtggccgc agcgcaccga cccccggctg atgaatcccg 1329901 atggctagcc gaggacgcaa ccccagctcg gggcgctcgg ccacccgcac cacctcacgg 1329961 ccggcccgca gggcatcgaa gatgccccat tgacctatcc actgcagatg ctcgggcggg 1330021 ccggcgcggc ccaggatgga cagccgacgc agctcgtcgg cctcgtcgtt ggaggccgag 1330081 taggcgagca cgtgcgactg ggcgtcctcg atgctgatca tgccgtggat gcggtcggcc 1330141 agggactgtg ccaacccgaa caggtcggtt ccggaatcgt cggtggggtc ggcccggtca 1330201 ccatgatgct ccaagacatg attcaccaag tggtacagcc gttcccagcg ggcccgcggc 1330261 tccacggcta ccaccgccga gccggcgcgg acggccccgg ccaccaccga gtccgacggg 1330321 tgcttgacga agatcgccac cggcgcccgt tggcgtgcct gatcgtcgac ccagcgcacc 1330381 gcctcgtcgt cggtgacccc gatcaggaag aacacatcgg ccgagcccgc cgcggccgcc 1330441 aggcccagcc gcacgtcgtc ggaatcgatc agcgccgtcg acgccaccgg caggtccagg 1330501 ccgcgcgggg cgtccaccag gctgaccacg gtcgcatcca gcgccaggag caactggccg 1330561 agccccacgc cggcgatccg catgttgtcc gatcctacta gcaagtccgc cagatcttgt 1330621 ctgatcggcc aaacatttgc gatgcctggg cggggatgct ggcaggcatg gacgcgatca 1330681 cccaggtgcc ggttccggcc aacgagccgg tgcacgacta tgcgccgaaa tccccggaac 1330741 ggacccggct gcgcaccgaa ctggcctccc tggccgatca ccccatcgac ctgccgcacg 1330801 tcatcggcgg ccgacaccgg atgggcgacg gcgagcgaat cgacgtcgtg cagccgcacc 1330861 ggcacgccgc caggctgggc accctgacca acgccaccca cgccgacgcc gcggccgccg 1330921 tcgaagccgc catgtctgcc aaaagtgact gggcggcact gccgttcgat gaacgtgccg 1330981 cggtgttcct gcgcgccgcc gatctgttgg ccgggccgtg gcgggaaaag atcgccgccg 1331041 caaccatgct cggccaatcc aagtcggtgt accaggccga gatcgacgcg gtctgcgagc 1331101 tgatcgactt ctggcggttc aacgtcgctt tcgcccgaca gattttggag cagcagccga 1331161 tcagtggccc gggggaatgg aaccggatcg actaccgccc gctggacggt ttcgtctacg 1331221 cgatcacgcc gttcaacttc acctcgatcg ccggcaatct gccgaccgcc ccggctctga 1331281 tgggcaacac cgtgatctgg aagccgtcga tcacccagac gctggcggcc tatctgacca 1331341 tgcaactgct cgaggccgcc gggttgccgc ccggggtgat caacctggtc actggcgacg 1331401 gattcgcggt ttccgatgtg gcactggccg atccacggct ggccggcatc cacttcaccg 1331461 ggtcgacggc taccttcggc cacctatggc agtgggtggg taccaatatc ggccgctacc 1331521 atagctatcc gcgactggtc ggcgagaccg ggggcaagga cttcgtggtg gcgcacgcct 1331581 cggcccgccc ggatgtgctg cgcacggccc tgattcgcgg agcattcgat taccagggcc 1331641 agaagtgctc ggcggtgtcg cgagcgttta tcgcgcattc ggtgtggcag cggatgggcg 1331701 atgagttgct ggccaaagcc gccgagctgc gctacggtga catcaccgac ctgtccaact 1331761 acggtggtgc gctgatcgac cagcgcgcct tcgtcaagaa cgtcgacgcc atcgaacggg 1331821 ccaaaggcgc ggccgcggtc accgtcgccg tcggcggcga atacgacgac agcgaaggct 1331881 atttcgtgcg ccccacggtg ttgctctccg acgacccgac cgacgagtcg tttgtcatcg 1331941 agtacttcgg tccgctgctg tcggtgcatg tctaccccga cgagcgctac gagcagatcc 1332001 tcgacgtcat cgacaccgga tcccgctacg cgctgaccgg cgcggtcatc gccgacgacc 1332061 ggcaggccgt gctgaccgcg ctggatcggc tgcggttcgc ggcggggaac ttctatgtca 1332121 acgacaagcc gacgggggcg gtggtggggc gtcagccgtt cggcggtgca cgcggatcgg 1332181 gcaccaacga caaggccggt tcgccgttga acctgctgcg gtggacgtcg gcgcgcagca 1332241 tcaaggagac gttcgtcgcg gccaccgacc acatctaccc gcacatggcg gtcgactgat 1332301 ggccggctgg ttcgcgcaca cgctgcgccc ggcaatgctt gccgccggcc gctcggatcg 1332361 gctgggccgc atcgtcgagc gctcgccgct cacccgcggg gtggtgcgcc ggttcgtgcc 1332421 cggcgacacg ctcgacgacg tggtggatat cgttaccgcg ctgcgggatt cgggccgcta 1332481 cctcagcatc gactacctgg gcgagaacgt caccgatgcc gacgacgctg ccgccgccgt 1332541 gcgggcgtac ctggggctct tggacgtgct gggccgccgc ggcgatatcg catgcgacgg 1332601 ggtgcgaccg ctcgaggtgt cgctcaagct gtcggcgctc gggcaggccc tcgatcgcga 1332661 cggccagaag atcgcgctgg acaacgcccg cgccatctgt gagcgggccg agcgggtggg 1332721 cgcctgggtc acggtggacg ccgaagacca caccaccacc gattccacat tgtcgatatc 1332781 gggcgatttg cgcgtcgact ttccttggct gggcacggtt gtgcaggcct atctgcggcg 1332841 cacgctggcc gattgcgcgg agttggcggc cgtgggcgcc cgagtccggt tgtgcaaggg 1332901 cgcctatgac gaacccgcat cggtggccta ccgagacgcc gcgcaggtca ccgactccta 1332961 tctgcggtgc cttcgggtat tgacggcggg gcgaggctat ccgatggtgg ccacccacga 1333021 cccggtgatc atcgcggcgg taccggggat cacgcgcgaa tcagggcgta gtcaaggtga 1333081 tttcgaatac cagatgctct acggcgtccg cgacgacgaa caacgacgac tgaccggcgc 1333141 cggtaaccac gtgcgggtgt atgtgccctt cggcacccgg tggtacgggt atttcctgcg 1333201 gcggctggcc gaacgcccgg ccaacctggc gttcttcctg cgggcgctga ccgaccgccg 1333261 acgcgcgcgg gggtgcgccg agcgctgaaa tcgccggttg ctgtcacatt cggcggggct 1333321 gtctcgtcct tgatgttatg aattccagca tgggtcggcg ggaggacaca tgtcgcaaca 1333381 cgacccggta agtgcggcct ggcgggcgca tcgggcctac ctggtggacc tcgcgtttcg 1333441 tatggtaggt gacatcggcg tggccgaaga catggtgcaa gaggcatttt cccgcttgct 1333501 gcgggctccg gtcggcgaca tcgacgacga gcgtggctgg ctgatcgtgg tcaccagccg 1333561 gctgtgcctg gatcacatca agtcggcgtc gacacgccgg gagcgcccgc aggacatcgc 1333621 cgcatggcac gacggtgacg ccagcgtgtc atcggttgac ccggctgacc gggtgactct 1333681 cgacgacgag gtccggctgg ctttgctgat catgctcgag cgcctcggcc ccgcggagcg 1333741 ggtggtgttc gtgctgcacg agatctttgg gctgccctac cagcaaatcg ccacgacgat 1333801 tggcagccag gcctccacat gccggcagct ggctcatcgg gcccgtcgca agatcaacga 1333861 atcgcgcatt gcggccagcg tggagccagc ccagcatcgc gtcgtcacca gagctttcat 1333921 cgaagcctgc tccaacggag acctggacac cctgctcgag gtgctggatc cgggtgtcgc 1333981 cggcgagatc gacgcccgca aaggcgttgt cgtcgtgggc gcggatcggg ttggcccgac 1334041 catcctgcgc cactggagtc accccgccac cgtcctggta gcccagccgg tgtgcggtca 1334101 accggcggtg ctggcctttg tcaaccgagc gcttgccggc gtgttggccc tgtcgatcga 1334161 ggccggcaag atcacaaaaa tccatgtctt agtgcagcct tcaacattgg acccgttacg 1334221 ggccgaactc ggcggcggtt agttaggtat cggaggtatg accatgaaat cacttgccgc 1334281 gcttgaccgg ccgagctggt tgtcatcgtc ggcgtggccc tggcagccct acctgctgag 1334341 ccaccatcag ggcggcatcg cggttaccga tatcggcgac gggccggcgg tgctgttcgt 1334401 tcacgtcggc agctggagct ttgtctggcg tgacgtgttg ttgcgtctag ccaacgattt 1334461 tcggtgtgtt gccatcgacg caccgggttg tgggctcagc gaccggctct caaccccgcc 1334521 aacacttgcc caggcggccg atgcaatcac ctcggtcatt gatgcgctgc agttacgtga 1334581 cctcaccctg gtagcccacg acctgggcgg cccggccggc ttcctggccg ccgcccgtcg 1334641 cggcgaccgc gtcgcggcac tggccgcggt caactgcttc gcatggcggc ccacgggtcc 1334701 gctgttccgg ggcatgctcg cggcgatggg cagcgccccc gtgcgtgaac tggacgcggc 1334761 catcaatgcg cttgcccgcg cgacgtcgac gcggttcggg gccggtcggc actggagccg 1334821 cgcagaccgc gcggcttttc gggcgggaat cgatgcgccg gcccgcaggg cgtggcatgc 1334881 ctacttccgc gatgcgcgcc gtgcccatgc cctctatacc gacgtcgacg ccgcgttgcg 1334941 ggggggtctg gccgatcggc cactgctgac catcttcggt cagttcaacg atccgctgcg 1335001 gtttcagccg cgctggaaag agttgtttcc gacggcacgc caactgcagg tccgccgggg 1335061 caaccacttt cccatgtgtg acgacccaga cttggtggcc ggggcactca cgtctttcgt 1335121 gcaacggtca acgtgagccg ccgactgccg tcacacctgg tacaccttgc ggtttgccgc 1335181 cgcgccgcca catgccaagc tactcgccat ggccgtcgct attgcccgtc cgaaattgga 1335241 aggaaacatc gccgtcggcg aggaccgccg gatcggcttc gccgagttcg gcgccccgca 1335301 gggtcgtgcg gtcttctggc tgcatggcac cccaggggcc cggcggcaga tcccgaccga 1335361 agcccgggtc tacgccgagc accacaatat tcgtctgatt ggcgtcgatc ggcccggcat 1335421 cggcgcctcg acgccgcatc agtacgaaac catcttggcg ttcgccgacg atctgcggac 1335481 catcgccgac acgctcggca tcgacaagat ggccgtggtg ggcctgtcgg gcgggggccc 1335541 atacaccctg gcgtgcgccg ccgggctgcc cgaccgggtg gtcgccgccg gtgtcctcgg 1335601 cggcgtcgcg ccgacgcgcg gcccggacgc gattagcggc ggtttgatgc gccttggttc 1335661 ggcggtggcg ccgctgctgc aggtgggcgg caccccgctg cggctgggtg cgagcttgct 1335721 gatccgggcg gcccggcccg tcgcgtcccc tgccctcgac ctgtatggcc tgctctcacc 1335781 gcgggccgac cggcatttgc tggctcggcc cgagttcaag gcgatgttcc tcgacgatct 1335841 gctcaacggt agtcgcaagc agctcgctgc gccgttcgcc gatgtcatcg cctttgcccg 1335901 cgactgggga ttccggctgg acgaggtgaa agtccccgtc cgctggtggc acggagacca 1335961 cgaccacatc gtcccgttct cccacgggga acacgtcgta tcccggcttc ccgacgcgaa 1336021 gttgttgcac ttgcccggcg aaagtcatct cgctgggctt ggccgtggtg aagagatttt 1336081 gagcaccctg atgcagattt gggaccgcga cctgcggaaa tgatcgggcg tgtgaccgag 1336141 ctcgcatggg cgggccgcac tgctttgcat cgccatttgt gcctattgac ggccttaata 1336201 tgacatgctg ttgcctgtgt tagagcccgc tgaccgcccc tgtgatgccc ccggatggtt 1336261 tctctacctc accgacatac cgcgcgcggg tgtcgagtac gggcaattgc tcgccgtgct 1336321 gccgctgcag cggatgctgc cggccggcga cggacatccg gtactggtgc tacctggcct 1336381 gctggccggc gacggttcca cctggatcct gcgacggatc ttgcgtcgcc tcgggtacgc 1336441 ggcctacggc tgggggctcg gccgcaacat cgggccgacg gccaaagcgg tatccgggat 1336501 gcgggacctc ctcgacaagc tccactcccg gtaccacacc ccggtgagcc tgattgggtg 1336561 gagcctgggt ggcatcttcg cgcgcggcct cgcccgcgac catccgtcgg cggtgcgcca 1336621 ggtgatcaca ctgggcagcc cgtttggcat gagggacacc tgtgagacgc gctccgcgtg 1336681 gagcttcaac cggtatgcgc atctgcacac cgagcggcac gagttgccgc tggaaatgga 1336741 aagtgaacct ttgccggtgc cgaccaccgc gatctactcg cgctgcgacg gcatggtcgc 1336801 ctggcagacg tgcatgaatt cgccatcgga gcgcgcggaa aacatcgcgg tgcgcagcag 1336861 ccacatcggc tacggccaca atccgccggt ggtgtgggcc atcgccgacc ggctggcaca 1336921 gccccagggt gcatgggcgc cgtttcggcc gccgaaggtg ttgagcccgc tgtttccgcg 1336981 accggataca ccggcagagg cggtcagcac cccccagacg cgaccggcct gacggggcag 1337041 gcgatcacgg cgccggggta gcctcgctca cgtgctgctg gcctccctga atcctgctgt 1337101 cgtctccgcc gccgatatcg cggacgcggt ccgcatcgac ggcgacgtgc tgagccgtag 1337161 cgacctggtc ggcgcggcaa cgtcggtggc cgagcgggtc gccggtgcgc accgggtcgc 1337221 cgtgctggcc acgccgaccg cgtcgacggt gctggcgatc accggctgcc tgatcgccgg 1337281 cgtgccggtt gtgccggtac ccgccgatgt gggcgtcacc gaacgccggc acatgctcac 1337341 cgactccggc gtccaggcat ggctgggccc gttgcccgac gacccagcgg ggctgccaca 1337401 catcccggtg cgcacgcacg cgcggtcctg gcaccgttat ccggagccct cacccggggc 1337461 catcgccatg gtggtctaca cgtccggcac caccgggccg cccaaaggcg tgcagctgag 1337521 ccggcgggcg atcgccgccg acctcgatgc attggcagag gcctggcagt ggacggccga 1337581 ggacgtgctg gtccacggtc tgccgctgta tcacgttcac ggcctggtgc tgggcttgct 1337641 cgggtcgctg cggttcggaa atcgcttcgt gcacaccggt aaaccaacgc cggccggcta 1337701 cgcccaggcc tgttatgaag cgcacggcac gttgtttttt ggggtgccga cggtgtggtc 1337761 acgagtggcg gccgaccaag ctgccgccgg ggcgctcaaa ccggcgcggc tgctggtgtc 1337821 cgggagtgcg gcactacccg tgccggtgtt cgacaagctg gtgcagctca ccgggcaccg 1337881 gcccgtcgaa cgctacggtg cttcggagtc gctgatcacc ctatcgacgc gggctgacgg 1337941 tgagcgtcgc ccgggctggg tcggcctgcc gctggccggt gtgcagaccc gactggtgga 1338001 cgacgatggc ggtgaggtcc cgcacgacgg ggaaaccgtt ggaaagcttc aggttcgcgg 1338061 tccgaccctg ttcgacggct acctgaatca acccgatgcc accgccgcgg cgttcgacgc 1338121 cgacagctgg taccgcaccg gcgacgtcgc ggtggtcgac ggcagtggga tgcaccgcat 1338181 cgtgggacgc gagtcggtcg acttgatcaa gtcgggtgga taccgggtcg gcgccggtga 1338241 aattgaaacg gtgctgctcg ggcatccgga cgtggcggag gcggcagtcg tcggggtgcc 1338301 cgacgatgat ctaggccagc ggatcgttgc ctacgtagtc ggctcagcga atgtcgatgc 1338361 ggacgggctt atcaactttg ttgcccaaca actttcggtg cacaagcgcc cgcgcgaggt 1338421 gcgtatcgta gatgcgctgc cgcgcaacgc gttggggaaa gtgctcaaga agcagttgct 1338481 gtcagaaggc tgagctacgg cgaattatcg tgtaccgctg gacagttacg ctggcacact 1338541 gttactccga cggcccggtg agcttagcgc atgggccttg ttgccgcgcc actgtagggc 1338601 ttccagggcg acggccacat ggacggaggt gtggtcgagc ggtcgcggta gcagccgctg 1338661 agcggactcg agtctgcgca gaaatgtatt gcggtgagtg tggagacgtt ttgcggcccg 1338721 ggaggcgttg cactgctcgt tgatgaaggt cagcagggcc gtttgtagat ctgggctggc 1338781 agactcgagg tctccaagcg tactcgtgat gaattcgctt gcagcatctg gattttggct 1338841 gatcaatgcg accatcttaa cgtcggcaaa gaaggcgacc cgctgggtcg accgtagccg 1338901 tgacaaggtg cgctgggtga tgagcgcttc gaggtggctg cgccggaacc cctccacccc 1338961 gttggcggtg gtcccgatgg cgatgcgcgc cccgggtgcg ttgtccaccg ccgcctgcac 1339021 tgtgtcgatg tcgagtccgt cggcgtcggt cacccacgcc cagcggctcg ccgccccggc 1339081 gaccaccgtc agcggtcgtg tcgatcccac ggcgtggcag aacagatcag ccgcccggtc 1339141 gaggtagctg tggtcaccgt cgagctcgtc gctccagatg atggcagcgg tatgggcacg 1339201 actcagcggg tagcccaatt tcgcttcggc ccgttcgggg ctgatagggg cgccatcgag 1339261 aatcagcccg acgacctcga ggcgttcggc atgggtgctg cgggtcagtt cgtcgtgttc 1339321 cgactgcact tgcgcggcga taccggtcag cgtggcctcg atgaagtcgt tgacggagcg 1339381 ggccgacacg tctagcagct cgcgcagctc ttgggggtcg gaagtgagtt cgaacgcaat 1339441 ccccatccag aaccgccacc cgatgtgctc accggttcga tagatgttga acgctactgt 1339501 gtccagcccc cggcgcacca ggtctcgggc catccgcagt ggctcggtgc cgagattggc 1339561 gggcacccga gcaccagggt cacgcaggtt ggccgcagcc cagtacacca ggttggcgcg 1339621 attggccgtc tggacaacct tcgcaagcac cggatcgttg gcgatcgccg gattggccgc 1339681 aatcgtggca cggtccagtt cctcgatcca ctccgggctg ggattgaggg cgatgcgtgc 1339741 tccctcgcgg atcagctcac gaattcgcgg cgaaggttgt tgccatgcca cgcgccgatc 1339801 ttagggccag cgggtgcaat ttgcacacta tgttggcact attgtgccgg attcacactg 1339861 cacggccggt gtgtgcgcga aatcacggtg tgggtctgct ggatgagtcg accgtgttga 1339921 acaacttgcg acacaccgca atttgcgaaa tccgccaccg accgggcata gtaacccagc 1339981 tagtcgtcgt tgtcgcgtcg aaccacatgg tgaactgtgc ggcgggtgca ttttgcacat 1340041 caagtgggcg ctgattggga agatttaccc ttcggcggcg gcggtaggtg cagattgcac 1340101 tttggctcct gctgattgaa attttttgac ctgttgcggt ccttgcgggc tcgccatcat 1340161 tggcggcagt tcgtcaccga cgaatcgggg ccaaggacgt aggcgaccag ttcgcttgac 1340221 tgctaaccgc tcctgatcgt acccgtgcga gtgctcgggc cgtttgagga tggagtgcac 1340281 gtgtctttcg tgatggcata cccagagatg ttggcggcgg cggctgacac cctgcagagc 1340341 atcggtgcta ccactgtggc tagcaatgcc gctgcggcgg ccccgacgac tggggtggtg 1340401 ccccccgctg ccgatgaggt gtcggcgctg actgcggcgc acttcgccgc acatgcggcg 1340461 atgtatcagt ccgtgagcgc tcgggctgct gcgattcatg accagttcgt ggccaccctt 1340521 gccagcagcg ccagctcgta tgcggccact gaagtcgcca atgcggcggc ggccagctaa 1340581 gccaggaaca gtcggcacga gaaaccacga gaaataggga cacgtaatgg tggatttcgg 1340641 ggcgttacca ccggagatca actccgcgag gatgtacgcc ggcccgggtt cggcctcgct 1340701 ggtggccgcg gctcagatgt gggacagcgt ggcgagtgac ctgttttcgg ccgcgtcggc 1340761 gtttcagtcg gtggtctggg gtctgacggt ggggtcgtgg ataggttcgt cggcgggtct 1340821 gatggtggcg gcggcctcgc cgtatgtggc gtggatgagc gtcaccgcgg ggcaggccga 1340881 gctgaccgcc gcccaggtcc gggttgctgc ggcggcctac gagacggcgt atgggctgac 1340941 ggtgcccccg ccggtgatcg ccgagaaccg tgctgaactg atgattctga tagcgaccaa 1341001 cctcttgggg caaaacaccc cggcgatcgc ggtcaacgag gccgaatacg gcgagatgtg 1341061 ggcccaagac gccgccgcga tgtttggcta cgccgcggcg acggcgacgg cgacggcgac 1341121 gttgctgccg ttcgaggagg cgccggagat gaccagcgcg ggtgggctcc tcgagcaggc 1341181 cgccgcggtc gaggaggcct ccgacaccgc cgcggcgaac cagttgatga acaatgtgcc 1341241 ccaggcgctg caacagctgg cccagcccac gcagggcacc acgccttctt ccaagctggg 1341301 tggcctgtgg aagacggtct cgccgcatct gtcgccgatc agcaacatgg tgtcgatggc 1341361 caacaaccac gtgtcgatga ccaactcggg tgtgtcgatg accaacacct tgagctcgat 1341421 gttgaagggc tttgctccgg cggcggctca ggccgtggaa accgcggcgc aaaacggggt 1341481 ccgggcgatg agctcgctgg gcagctcgct gggttcttcg ggtctgggcg gtggggtggc 1341541 cgccaacttg ggtcgggcgg cctcggtcgg ttcgttgtcg gtgccgcagg cctgggccgc 1341601 ggccaaccag gcagtcaccc cggcggcgcg ggcgctgccg ctgaccagcc tgaccagcgc 1341661 cgcggaaaga gggcccgggc agatgctggg cgggctgccg gtggggcaga tgggcgccag 1341721 ggccggtggt gggctcagtg gtgtgctgcg tgttccgccg cgaccctatg tgatgccgca 1341781 ttctccggcg gccggctagg agagggggcg cagactgtcg ttatttgacc agtgatcggc 1341841 ggtctcggtg tttccgcggc cggctatgac aacagtcaat gtgcatgaca agttacaggt 1341901 attaggtcca ggtttaacaa ggagacaggc aacatggcct cgcgttttat gacggatccg 1341961 cacgcgatgc gggacatggc gggccgtttt gaggtgcacg cccagacggt ggaggacgag 1342021 gctcgccgga tgtgggcgtc cgcgcaaaac atctcgggcg cgggctggag tggcatggcc 1342081 gaggcgacct cgctagacac catgacccag atgaatcagg cgtttcgcaa catcgtgaac 1342141 atgctgcacg gggtgcgtga cgggctggtt cgcgacgcca acaactacga gcagcaagag 1342201 caggcctccc agcagatcct cagcagctaa cgtcagccgc tgcagcacaa tacttttaca 1342261 agcgaaggag aacaggttcg atgaccatca actatcaatt cggggatgtc gacgatcatg 1342321 gcgccatgat ccgcgctcag gccgggttgc tggaggccga gcatcaggcc atcattcgtg 1342381 atgtgttgac cgcgagtgac ttttggggcg gcgccggttc ggcggcctgc caggggttca 1342441 ttacccagtt gggccgtaac ttccaggtga tctacgagca ggctaacgcc cacgggcaga 1342501 aggtgcaggc tgccggcaac aacatggcgc aaaccgacag cgccgtcggc tccagctggg 1342561 cctgacacca ggccaaggcc agggacgtgg tgtacgagtg aaggttcctc gcgtgatcct 1342621 tcgggtggca gtctaggtgg tcagtgctgg ggtgttggtg gtttgctgct tggcgggttc 1342681 ttcggtgctg gtcagtgctg ctcgggctcg ggtgaggacc tcgaggccca ggtagcgccg 1342741 tccttcgatc cattcgtcgt gttgttcggc gaggacggct ccgacgaggc ggatgatcga 1342801 ggcgcggtcg gggaagatgc ccacgacgtc ggttcggcgt cgtacctctc ggttgaggcg 1342861 ttcctggggg ttgttggacc agatttggcg ccagatctgc ttggggaagg cggtgaacgc 1342921 cagcaggtcg gtgcgggcgg tgtcgaggtg ctcggccacc gcggggagtt tgtcggtcag 1342981 agcgtcgagt acccgatcat attgggcaac aactgattcg gcgtcgggct ggtcgtagat 1343041 ggagtgcagc agggtgcgca cccacggcca ggagggcttc ggggtggctg ccatcagatt 1343101 ggctgcgtag tgggttctgc agcgctgcca ggccgctgcg ggcagggtgg cgccgatcgc 1343161 ggccaccagg ccggcgtggg cgtcgctggt gaccagcgcg accccggaca ggccgcgggc 1343221 gaccaggtcg cggaagaacg ccagccagcc ggccccgtcc tcggcggagg tgacctggat 1343281 gcccaggatc tctcggtagc cctcggcgtt gacgccggtg gcgatcaagg tgtgcactcc 1343341 gacgacgcgg cctgcctcgc gcaccttgag caccagggcg tcggcggcga ggaaggtata 1343401 cgggccggca tcgagcgggc gggtccgaaa cgcctctacg gcttcgtcga gctctttggc 1343461 catgatcgac acttgcgact tggaaagctt tgtcacacca agtgtttcga ccaggcgctc 1343521 catccggcga gtggatactc ccagcaggta gcaggtcgcc accacgctgg tcagtgcgcg 1343581 ttcagctcgc ttgcggcgct gcagcagcca gtccgggaaa tagctgccct ggcgcagctt 1343641 ggggatcgcg acgtcgatgg ttgcggcacg ggtgtcgaaa tcacggtggc ggtagccgtt 1343701 gcgctgattg gaccgctcat cgctgcgttc gcggtagccc gccccgcaca gagcgtcggc 1343761 ttcagccccc atcaaggcgg cgatgaacgt cgagagcagc ccgcgcagca gatccgggct 1343821 cgcctgtgcg agttggtcag ccagaagctg ctcggtgtcg ataagatgag aagaggtcat 1343881 tgcgtcattt ccttcgattg acttttgctg gtcgtttcga aggatcacgc gatgaccgcc 1343941 cactactggg ctacgacacg cccaccggcc ttacctgccc gtacaccaca cccctggacg 1344001 taacttgaca ccaatccaca gcaccgagca gtgacagaag gtgccccaag gtgtggtgaa 1344061 actcgctgga cggtccccag gatgttggca gcacattcac cggacatgac cggagcaaga 1344121 ccggacatcc tcccataccg tcgtcgccgt gtacatccgt agcccgtcct ggcaggtgct 1344181 gggttgacca aaatcagccc aacacctgcc acgacgatga agcgggttgc gctggcatgt 1344241 cttgtcggct cggcgatcga attctacgac ttccttatct acggcaccgc tgcggcgctg 1344301 gtgtttccca ccgtgttctt cccacacctg gatcccacgg tggccgccgt ggcctcgatg 1344361 gggacatttg ctgtggcgtt cctatcccgg ccgttcggcg cggccgtctt tggatacttt 1344421 ggagaccgcc tcggccgcaa gaagaccctg gtcgccacac tgttgatcat gggcctggca 1344481 accgtgactg tcgggctggt tccaacgaca gtggccatcg gcgccgcggc cccactgatc 1344541 ctgacgacca tgcggctgct gcaagggttc gcggtcggcg gcgagtgggc cggttcggcg 1344601 ctgctgagcg ccgagtacgc gcccgccagc aaacgtggct ggtacgggat gttcaccgtt 1344661 gtgggtggcg gcatcgcgct ggtactgacc agcctgacct ttctgggcgt gaactacacc 1344721 attggcgaaa gcagccccac attcatgcag tgggggtggc gcataccgtt tctggtcagt 1344781 gcggcgctga tcgccgtcgc cctatacgtg cggttcaaca tcgacgagac cccggtgttc 1344841 gcccgggaaa gggcagacga aaaaacccgt ttgggcccag ccgaaacgcc gattgcccaa 1344901 gtactgcggc ggcagcggcg agagatagtc ttggccgccg gcagcgccgt ttgctgcttc 1344961 ggcttcgtct acctggccag cacttacttg gccagctacg ctcaaacccg actggggtat 1345021 tcgcgcggca gcatcctgtt cgacagtgtg ctgggtggac tgctgtgcat cgtgttcacc 1345081 gcgctttctt ccgctctttg cgaccaactc gggcgccgcc gcgtcctatt ggccgggtgg 1345141 gcggtggctc taccctggtc gctgttggtc atgccgctga tcgactccgg cagccccagt 1345201 ttgttcgcgg tggctgtcgt cggcatgtat gccatcggcg gattcggttt cggacccacg 1345261 gcatcgttca tcccagaact gtttgctact agctaccgat acacgggcag cgcgctcgcg 1345321 gcgaatctcg ctggggttgc cggcggcgcg ctaccgccgg tgattgccgg cgcgctggtg 1345381 gcaacctatg gcagctgggc gatcggtgtc atgctggcca tcctcgcgtt gatcagcctg 1345441 gtatgcacct atcggttgcc cgaaaccgcc ggatcggccc tcgtcagccg ctagttggcg 1345501 tgcaggtcct cgttgagggc aatgccctga ccgtcgcggg ccagcacttc gaccgccccg 1345561 ctgacggaat tgcggcgaaa cagcaggttg ctgctcccgg agagctcacg cgccttgacc 1345621 gaattgctgt cgggcatggt gaccctcgtg ccggcggtca cgtacagccc ggcctccacc 1345681 acgcagtcgt cgcccagtga gatgcccaga ccggagttgg cgccgagcag acaacgcttg 1345741 ccgatcgaaa tgacgtgtgt tccaccgcca gacagcgtgc ccatgatcga cgctccgccg 1345801 ccgacatcgg agccgtcgcc caccaccaca cccgccgaga tgcggccttc caccatcgag 1345861 gcgcccaggg tgccggcgtt gtagttgacg aagccctcat gcatcacggt ggtgcccggc 1345921 gccaggtgag cgcccaaccg cacgcggtcg gcatcggcga tacgtacgcc ggtgggcacg 1345981 acgtagtcga ccatccgggg aaacttgtcg acgccgtaca cagtcaccgg tccgcggcgg 1346041 cgcagccgcg cccgcaccgc ctcgaaaccg tctatggcgc agggtccgtg attggtccac 1346101 accacattgg tcagcacccc aaacaagccg ccggcgttca acccatgggg cgccaccagg 1346161 cggtgcgaca agaggtgaag ccgcaggtaa gcatcgtatg ggtcagcggc gacatcgtcg 1346221 agcgagccga tgaccgtacg gaccgcgatg gtctcggtgc ggcggtcgtc atcgcggccg 1346281 atcagcgcgg ccagctcgac aggaacgtcg gacaccgcca gtcgtgacgt cgcgctggtg 1346341 cccgattcgg tcagttccgg cgcgggaaac caggtgtcga ggaccgatcc gtcagcggcg 1346401 agggtagcca ggccgatgcc tgctgctcca gtcacggtcg acacgctact tgtgccgccg 1346461 aacagacaca aaaccaccct atttcgacca gaatcgggtg cttttgcgtc tgctcggcca 1346521 actaagctag cgccgtgctg gatttgcgcg gggacccgat cgaattgacc gcggcgctga 1346581 ttgacatccc cagcgagtcg aggaaggagg cacgcatcgc cgacgaggtg gaagcggcgt 1346641 tgcgcgctca ggcatcgggg ttcgagatca tccgcaacgg caacgcggtg ctggcgcgta 1346701 caaagctgaa ccggtcctcg cgggtgctgt tggccggaca cctggacacc gtgccagtgg 1346761 ccggcaacct gcctagccgc cgcgagaacg accagctgca cggctgcggc gcagccgaca 1346821 tgaaatccgg cgacgcggtc ttccttcatc tggccgctac actggccgaa ccgacgcacg 1346881 atctaacact ggtgttctac gactgcgagg aaatcgattc ggcggcaaac ggtttaggcc 1346941 gcatccagcg cgagctgccg gactggctat ccgcggatgt agccatcttg ggtgagccca 1347001 ccgccggctg catcgaggct ggttgccagg gcacgttgcg tgtcgtcctc agcgtgaccg 1347061 gaactcgcgc gcattcagcg cgttcgtggt tgggtgacaa cgcaatccac aagttgggtg 1347121 ctgtgctgga ccggttggcc gtctaccggg cacgcagcgt cgacatcgac ggttgcacct 1347181 atcgggaggg cctctcggcg gtgcgcgtag caggcggcgt cgccggcaac gtgatccctg 1347241 acgcggcctc ggtcacgatc aactaccgct ttgcccccga ccggtcggtg gccgcggcat 1347301 tgcaacatgt ccatgacgtg ttcgacgggc tcgacgtgca gatcgagcag acggacgccg 1347361 cggccggtgc gctgcctggc ctgtccgagc ccgcggccaa ggcgctggtc gaggccgccg 1347421 gcgggcaggt ccgggccaag tatggctgga ctgatgtgtc gcgctttgcc gctttgggca 1347481 taccggcggt caattacggc ccgggtgatc ccaacctggc gcactgccgc gacgaacggg 1347541 tgcccgtcgg caacatcacc gcggccgtgg acttgctgcg ccgatacctg ggtggctagc 1347601 gctgctgtgg ccccaagcgt gctgccgcct tggtcgcgtc ggctgccgcg gctgccatcc 1347661 cgatcccggc cagctcctca gccaccgcgg tcagctcggc agcatctccg tcggccaggc 1347721 cacgggcgtg cttgacgagg atatttccta cggtgcagtc gatttcggcg gcgaggcgag 1347781 tcaccgggtc caccgcacgg atgtcgccca accgaaccgc gttatgccag gcgcataggg 1347841 ccaccgccgc ctgcccggcc cgctcagccg tccgggcggc ctcccgggcc gccgcgatgg 1347901 cccctgtcat gtcctgggcc gccgccctgg tccaggccct ggccagcccg agctcgggtg 1347961 cgaacaacgc ggacttcgtt ccgtgccgag cttcagcgcg ctgcagtgtt tttgcagact 1348021 cggcgatatg gccttgctgc gcgatggccg ttgccaacaa catcagcgac agcggacccc 1348081 acgagtagcc ggttcgttcc agtgtggcgg cggccggctc cagcatcgat gccgcggcgc 1348141 cgaattcgcc tttggtgatc agtacgtacg ccaacaacac ttcaccgatg gaccggccag 1348201 gttgctgcag ctaggcgaag tcggtgaacc gcttggccag ctcctgagcc ggcgcgacgt 1348261 cgcctgccag cagcagcgac gtgatctgag ccaggcccac ggtgaaccgc agcagccccg 1348321 gatgttcggc ggccgacgcc cgttcggcca gccggtcaac gtcgccgaac cggcccattc 1348381 gtgccgatga taacgcggca gcgctggcgg cccaggccac ggccatgtcg tcggcagccg 1348441 gtccggatag cacctcggtg gccagcgtga tggcccgcgg caagtttccg gagttcatcg 1348501 caaacgtggc cgccagcgca tccagggtgc tgcgggccgt gggctcggtc actcggctgc 1348561 gggtcgtctg cagaaacgcc gtggcgcgct cgggctcgtt gagcatccag aaccgattcg 1348621 ccgcccgggg tatcgcccag gccatcagct cggtctcggt caattcggcg ggattcaccg 1348681 ccgccagcac cgcgtcagct tcgcgaccgc gaccctgcca accgagtgcg taagccaagg 1348741 gcaggcgtgc cgccagggcg tccgacctat ccagcgctgc ccgcgccaac cgttcggcaa 1348801 gccggacgtc gccgagccgc agggcctgcc cggctgcggt cgccgcatcc gtgaccgcgg 1348861 ccggggtagc actggcgggg acgtcgatgg ccagtgagga cagccgtaac tgatcgctga 1348921 catggtcgga tgggtgcttg gccagctgcg cgaccagcga cacgcgcaat gcatgcgcgt 1348981 gctcggccgt caatacggcg cgtgcgcggt cggcgtacag cggatggccg acaaaaatct 1349041 cgctggtatc gctgtcggga cccacccgca ccgcgccggc ggcttcggct tggccgagcg 1349101 tgtccaactg ctcgccaccg accagggcca ccaggtcggt gcgcgccaac ggttcggcga 1349161 tggcgaggta gtcgacaacg gcgcgggccg gttccggcag ggcgcacagg tactcgtcga 1349221 tcacgccgga cagcggccga cgatcctcgt ctcgacagcg ccaccggccg tccacgtgtt 1349281 cgagaccacc gccgtcgatg aggtggcgca gatacaacgg gttgccaagg ctgcgccgaa 1349341 agagctcgtc ggcgtcggcg acgtccagtg tcgcgtccag cgccgactcc acgaacgccg 1349401 cggtttgggc cctgtcgagc ggctcgatgg cgacccgggt gagcaggtca tcggaccaga 1349461 gcgcagctat agcgtccggt ggctcggcct ccgaggcgac ggtgaccacc agccgcgccg 1349521 ccccggcccg cgccagctgg tacaccaagg tggccgacag cggatccagg ttgtgcgcgt 1349581 cgtcgaccac cagcagcaga tcgccagcat caccggtcag ggaactacgc gccgcccgca 1349641 gcagcgccgc gggccgccca atgtcggctc cggaggcggg caggctgatc aaatggcgga 1349701 aagcgccgaa cgggatggcc cgccctggag cggttcccac cacccagcga gcccggccgc 1349761 tcctgccgtc ctcggacatg acctgctcgg cagccagttg cgccagcagc gtcttgccga 1349821 cgccgtgtgg cccgaccagc accaccccgc accgatccgg actgtcgacg gccgcctcca 1349881 cgtgtttcca gacgcgcatc gccggatttt atggcggttg cgcccaacga cattcgagcg 1349941 ggggataggc caaaaatgta cgcggttcac atcggtggtc tacgttctgg tgtatgtcgg 1350001 cgaaaatcga cattaccggt gattggactg tggccgtgta ttgcgcggcc tcgccaacgc 1350061 acgcggagtt gctagagctg gccgccgaag tcggcgcggc aatcgccgga cgtggctgga 1350121 cgctggtgtg gggaggtggc catgtttcgg cgatgggggc tgtcgcctcg gcggcgcgag 1350181 cctgcggcgg ctggaccgtc ggcgtgattc ccaagatgct ggtgtaccgc gaactggctg 1350241 atcacgacgc cgacgagcta atcgtcaccg acaccatgtg ggagcgcaag cagattatgg 1350301 aagatcgctc agatgcgttc atcgtgttgc cgggcggtgt cggcacccta gacgagctgt 1350361 ttgacgcatg gaccgacggg tatctcggta cccatgacaa acccattgtg atggtagatc 1350421 cctgggggca tttcgatgga ctgcgggcat ggctgaacgg attgctcgac accggttacg 1350481 tctcacccac ggcgatggaa cggctggtgg tagtcgataa cgtcaaggac gctctgcggg 1350541 cctgcgcacc ttcctgaggt tggtcgacaa ccaattcgac atttcgcaaa cgaatcgagg 1350601 gcttacgtgt ccgattacta cggcggcgca cacacaacgg tcaggctgat cgacctggca 1350661 actcggatgc cgcgagtgtt ggcggacacg ccggtgattg tgcgtggggc aatgaccggg 1350721 ctgctggccc ggccgaattc caaggcgtcg atcggcacgg tgttccagga ccgggccgct 1350781 cgctacggtg accgagtctt cctgaaattc ggcgatcagc agctgaccta ccgcgacgct 1350841 aacgccaccg ccaaccggta cgccgcggtg ttggccgccc gcggcgtcgg ccccggcgac 1350901 gtcgttggca tcatgttgcg taactcaccc agcacagtct tggcgatgct ggccacggtc 1350961 aagtgcggcg ctatcgccgg catgctcaac taccaccagc gcggcgaggt gttggcgcac 1351021 agcctgggtc tgctggacgc gaaggtactg atcgcagagt ccgacttggt cagcgccgtc 1351081 gccgaatgcg gcgcctcgcg cggccgggta gcgggcgacg tgctgaccgt cgaggacgtg 1351141 gagcgattcg ccacaacggc gcccgccacc aacccggcgt cggcgtcggc ggtgcaagcc 1351201 aaagacaccg cgttctacat cttcacctcg ggcaccaccg gatttcccaa ggccagtgtc 1351261 atgacgcatc atcggtggct gcgggcgctg gccgtcttcg gagggatggg gctgcggctg 1351321 aagggttccg acacgctcta cagctgcctg ccgctgtacc acaacaacgc gttaacggtc 1351381 gcggtgtcgt cggtgatcaa ttctggggcg accctggcgc tgggtaagtc gttttcggcg 1351441 tcgcggttct gggatgaggt gattgccaac cgggcgacgg cgttcgtcta catcggcgaa 1351501 atctgccgtt atctgctcaa ccagccggcc aagccgaccg accgtgccca ccaggtgcgg 1351561 gtgatctgcg gtaacgggct gcggccggag atctgggatg agttcaccac ccgcttcggg 1351621 gtcgcgcggg tgtgcgagtt ctacgccgcc agcgaaggca actcggcctt tatcaacatc 1351681 ttcaacgtgc ccaggaccgc cggggtatcg ccgatgccgc ttgcctttgt ggaatacgac 1351741 ctggacaccg gcgatccgct gcgggatgcg agcgggcgag tgcgtcgggt acccgacggt 1351801 gaacccggcc tgttgcttag ccgggtcaac cggctgcagc cgttcgacgg ctacaccgac 1351861 ccggttgcca gcgaaaagaa gttggtgcgc aacgcttttc gagatggcga ctgttggttc 1351921 aacaccggtg acgtgatgag cccgcagggc atgggccatg ccgccttcgt cgatcggctg 1351981 ggcgacacct tccgctggaa gggcgagaat gtcgccacca ctcaggtcga agcggcactg 1352041 gcctccgacc agaccgtcga ggagtgcacg gtctacggcg tccagattcc gcgcaccggc 1352101 gggcgcgccg gaatggccgc gatcacactg cgcgctggcg ccgaattcga cggccaggcg 1352161 ctggcccgaa cggtttacgg tcacttgccc ggctatgcac ttccgctctt tgttcgggta 1352221 gtggggtcgc tggcgcacac cacgacgttc aagagtcgca aggtggagtt gcgcaaccag 1352281 gcctatggcg ccgacatcga ggatccgctg tacgtactgg ccggcccgga cgaaggatat 1352341 gtgccgtact acgccgaata ccctgaggag gtttcgctcg gaaggcgacc gcagggctag 1352401 cggattccgg gcgcagtctc gatacccgca ctggacgctc gacggtgacc aggcactatg 1352461 gatgcgtgcg ttcaacaccg ccggcctcag ccggtcgttc aacaccgccg gcgttagccg 1352521 gccattcaac accgccggcg ttagccggcc attcaacgct gtgcggccgt ccagtcgcag 1352581 gtgatcgtgc gctgatcatg gcgatcgtca accgcacccc ggattcgttt tacgacaagg 1352641 gtgcgacttt cagcgacgcg gctgccagag acgcggtcca ccgggccgtc gccgacggtg 1352701 ccgacgtcat cgacgtcggc ggtgtcaaag ccggcccggg tgaacgcgtc gacgtcgaca 1352761 ccgagatcac gcggctggtg ccgttcatcg aatggctccg cggtgcttac ccggaccagc 1352821 tgatcagtgt cgacacctgg cgcgcgcagg tggcgaaggc ggcctgcgcg gcgggggcgg 1352881 acctgatcaa cgacacctgg ggtggcgtcg acccggccat gcccgaggtg gccgccgagt 1352941 tcggcgcggg cctggtgtgt gcgcacaccg gcggcgcgct gccacgcacg cgacccttcc 1353001 gggtgagcta cggtacgact acccgcggtg tggtggatgc tgtgattagc caggtcacag 1353061 ccgccgccga gcgggccgtc gcggccgggg tggcccgcga gaaggtgttg atcgacccgg 1353121 cacacgactt cggcaagaac accttccatg ggctgctgct attgcgacac gtggccgatc 1353181 ttgttatgac cgggtggccc gtgctgatgg ctttgagcaa caaggacgtt gtcggggaga 1353241 ctctgggcgt ggatttgacc gaacggcttg agggaacgct ggcagccacc gcgttggctg 1353301 cggccgccgg ggcgcgcatg tttcgggtgc atgaggtcgc cgccacccgg cgggtgctgg 1353361 aaatggtggc atcgattcag ggggtccggc cgccgacgcg cacggtgaga ggactcgcat 1353421 gacagcatcg gagctggtcg ccggcgatct cgccggtggc agggcccctg gcgcgctgcc 1353481 cttggacact acttggcacc gtcccggctg gacgatcggg gagttggaag cggcaaaggc 1353541 cggacggacg atttcggtgg tgctgccggc cctcaacgag gaagcgacca tcgaatcggt 1353601 gatcgacagc atctctccgc tggtcgatgg cctggtcgat gaattgatcg tgctggactc 1353661 cggttccacc gacgacaccg agatccgggc catcgcctcc ggcgcccggg ttgtcagccg 1353721 tgaacaggcg ttgcccgagg tgccggtacg gcccggcaaa ggtgaggcat tgtggcgttc 1353781 actggcggcc accagcggcg acatcgtggt gttcatcgac tcagacctga tcaacccgca 1353841 ccccttgttt gtgccatggc tggtcggtcc gctgctcacc ggcgaaggca ttcagctggt 1353901 caagagcttt taccgacggc cgctgcaggt cagcgacgtg acgagtgggg tgtgcgccac 1353961 cggcggcggg agggtcaccg agctggtggc gcggccactg ttagccgcgc tgcggcccga 1354021 gctgggttgt gtactgcagc cgctgagcgg tgagtatgcg gccagccggg agctgctgac 1354081 atcgctgcca tttgcccccg gctacggcgt ggagatcggc ctcttgatag acacgttcga 1354141 ccggttgggc ctggacgcaa tcgcccaggt caacttgggc gttcgggcgc accgtaaccg 1354201 gcccctagac gagctcggcg cgatgagccg ccaggtcatc gcgaccctgc tgtcgcgctg 1354261 tggaattccc gattccggtg tcgggctgac ccagttcttg cccggcggcc cggacgatag 1354321 tgactacacg cggcacacct ggccggtatc actagtcgac cggccgccga tgaaggtgat 1354381 gcggccgcgc tgaccgacac cgcgtcggcg ccttagggca agatcgatga cgtggcgttg 1354441 gtgttggtgt acctggtggt gctggtcctg gtggcgatcg tgctgttcgc tgcggcgagc 1354501 ttgctattcg gccgtggcga gcagttgccg cccctgccgc gggcgacgac ggcgacgacg 1354561 ctgccggcgt tcggggtcac ccgcgccgac gtcgacgcgg tcaagttcac gcaggtgctg 1354621 cgcgggtaca agaccagcga ggtggactgg gtgctggaac ggctcggccg tgagctcgag 1354681 gcgctacgct ctcagctcgg ggcgatccac gcctcgtcgg aagacgccga ggccgagtct 1354741 gacgcgtcaa acccttcgcg cggcgagacc gtcgtgcact accgttctga ccccgcgtga 1354801 gcggcgacgg gctggttcgc tgcccctggg cggaggttcg tccagggccc gatgcccagc 1354861 tgtaccgcga ctatcacgac aacgaatggg ggcgtccgct gtacggccgg gtggctttgt 1354921 tcgagcgaat gagcctggag gccttccaga gtggcctgtc atggttgata atcctgcgca 1354981 agcgggagaa tttccggcgc gcattctctg ggttcgacat cgacaagatc gctcgctaca 1355041 ccgataccga tgtgcgacgg ctactcgccg atgacggaat cgtgcgcaac cgcgccaaga 1355101 ttgaggcgac gatcgccaac gcgcgcgcag ctgccgatct ggggtcgtcc gaagacctat 1355161 ccgagctgct gtggtcgttc gcgccaccgc ctcggccccg gcccgtcgac ggttccgaaa 1355221 ttccctcggt cagcacggaa tcgaaggcta tgtcgcgtga gttgaagcgg cgcgggttcc 1355281 gtttcgtcgg gcccaccacc gcctatgcgt tgatgcaggc gaccgggatg gtcgacgacc 1355341 atatccaagc atgctgggtg cccactgagc gaccttttga ccagccgggc tgcccgatgg 1355401 cggcccggtg aagtcattgc gccggggctt gtgcacctga tgaacccgaa tagggaacaa 1355461 taggggggtg atttggcagt tcaatgtcgg gtatggctgg aaatccaatg gcggggcatg 1355521 ctcggcgccg accaggctcg cgcaggcggg ccagcccgaa tctggaggga gcactcaatg 1355581 gcggcgatga agccccggac cggcgacggt cctttggaag caactaagga ggggcgcggc 1355641 attgtgatgc gagtaccact tgagggtggc ggtcgcctgg tcgtcgagct gacacccgac 1355701 gaagccgccg cactgggtga cgaactcaaa ggcgttacta gctaagacca gcccaacggc 1355761 gaatggtcgg cgttacgcgc acaccttccg gtagatgtcc agtgtctgct cggcgatgta 1355821 tgcccaggag aactcttgga tacagcgctg gcgtccggca tgcccgtagc gctccgccgt 1355881 tgccgggtcg gcgaccaagg cattgaccgc ctcagccaat ctggcctggt aaccggtcgc 1355941 gtcgtcggcg tcgtaatgca ccagtgagcc ggtgatcccg tcggcgacca cctcggggat 1356001 cccgccgacg tcggaggcca ccacggcggt tgcgcacgcc atcgcttcca ggtttacgat 1356061 acccagcggc tcgtacaccg acgggcacac gaaaactgtt gctgccgaaa gtatttctcg 1356121 tagttgtccg atggtaagcc ggtcttggat ccaaaacacg ccagtgcgat tgcgggccag 1356181 ttcggccacc gcgacgcgca cttcgtcggc tacttccggc gtgtccgcag cacccgcgca 1356241 gagcactagc tgtacgtccg atctgaatcg gtgcgcggct gttaccaggt ggacgactcc 1356301 cttttgccgg gtgattcgcc cgacgaacac cgccatgggc cggttcggat cgaccccgag 1356361 ctcggccagc accgacccgg tacgcgcggg cccggccgga taccacgtct cggtgtcgat 1356421 cccgttccgg atgacgtgca ccaggttcgg atccaggctg ggatagaccc gcaacatgtc 1356481 gttgcgcatt gcagaactga ccgcaatgac cgcgttggcg gccagcaccg cggtctgctc 1356541 gacccatgtc gatacctggt agccgccgcc gagttgctcc ttcttccatg gccgcaacgg 1356601 ttcgagcgaa tgtgcggtca aaatatgcgg gatgtcgtag agtatcgcgg ccagatgccc 1356661 cgccagagcg gtgtaccagg tgtgtgaatg cacgacggtg gccgcgctgg cggcattggc 1356721 catcaccagg tccgcggaca aggtggacag cgccgcgttg gcgctgccta gcctcgggtc 1356781 gggccgatag gcaaatgcgc ccgggcgggg tgcgcccatg cagtgcacgt cgaccgcgca 1356841 cagccggcgt aggtaggcaa ccagttcggt gacatgtacc ccggctccac cgtaaacctc 1356901 cggtgggtat tcccgagtca acatcgccac ccgcataccc cgcaccgtag tgcggtgacg 1356961 gggcggcccg cgtggcgggc cgaggaggag gcggaggcgg cacagcaccc gtcgaacggg 1357021 gccaaacacc ttgacggaca gcccgtcaga gcagtagcca ggggcggatt ccccttggca 1357081 gtggtttgcg ggggccgata ggtttgagcc atgagagaag tgccgcacgt gctgggcata 1357141 gtcttagccg gcggtgaggg caagcggctt tatccgctga ccgcggaccg ggccaagccc 1357201 gcggttcctt ttggcggcgc ctatcgattg atcgatttcg tactctcaaa cctcgtcaac 1357261 gcccggtatc tgaggatctg tgttctcacc caatacaagt cgcattcact ggaccgccat 1357321 atctcgcaga actggcggtt gtctggtctg gcgggtgagt acatcacccc ggtgccggca 1357381 cagcagcgcc tcggcccgcg ctggtatacc ggctccgccg atgcgatcta tcaatcgctg 1357441 aacttgatct acgacgaaga tccagactac atagtggttt tcggcgccga ccacgtctac 1357501 cgtatggatc ccgaacagat ggtccggttc cacatcgaca gcggtgccgg cgcgacggtg 1357561 gccggcatac gggttccacg tgaaaatgcg accgcgttcg gttgtatcga cgccgatgac 1357621 tccggccgta ttcgcagctt cgttgagaag ccgctggagc cgcccggaac ccccgacgac 1357681 cccgacacca cgttcgtctc aatgggcaac tacattttca cgaccaaggt gcttatcgac 1357741 gcgattcgcg ccgacgccga cgacgaccac tcggaccacg acatgggtgg tgacatcgtt 1357801 ccgcggttgg tggccgacgg tatggcggcg gtctatgact tctccgataa cgaagtgcct 1357861 ggtgccaccg atcgcgaccg agcatattgg cgcgacgtcg ggacgcttga cgcgttttac 1357921 gacgcacata tggacctggt gtcggtgcac ccggtgttca acctgtacaa caagcggtgg 1357981 ccgatccgcg gggagtcgga gaacctggcg ccggcgaagt tcgtcaatgg cggctccgca 1358041 caggagtcgg tggttggtgc cggcagcatc atctcggcgg cctcggtgcg taattcggtg 1358101 ctgtcgtcga acgtcgtggt cgacgacggc gcgatcgttg agggcagtgt gatcatgccc 1358161 ggcacccgcg ttgggcgcgg ggcggtggtg cgccacgcga tcctggacaa gaacgtcgtc 1358221 gtcgggcccg gtgagatggt cggcgtggat ctggagaagg accgggaacg cttcgcgatc 1358281 agcgccggcg gcgtggtcgc cgtgggcaag ggtgtttgga tctaggtccg gttagcggcg 1358341 cgagcagaca cagaatcgcc catttcggca cgaaattggg cgattctgcg tctgctcggc 1358401 gcggtggggc gcgccggcta gggccctggc ggcccgggtt ggccgaacag ctgcccgcca 1358461 gcgccgccgc gagcgccggc cgcggcggcc ccgcgccacc tcccacgccg ccgttgccga 1358521 tcaacccccc gggcccgccg tcttggcccg gtccgccatt ggcgccgtca ccgatcgaac 1358581 agtgcctggg tgggagcgtt gatcacattc agcacgtctt gctgcacgct ctgcgccaca 1358641 gcagcgttga cggcttcggc agccgcatag gccccgccag cgccggtcag ggacgaactg 1358701 ctgatgaaac gccgtcgcct gcaagctaag cgcctgatag gcctgagcgt gtctggcgaa 1358761 cagtgacgcc acgaccgccg atacttcatc ggcacaggcg gccagcatcg cggtggttgg 1358821 ggctgccgcg gcggcattgg ccgcgctcaa tgccgagccg atgcccgcca aatccgttgc 1358881 cgccgatgcc agcacgtccg gggcgccacc agatacgaca tggccacacc ttatcgtggg 1358941 ctcgttacgg catgcggtgt tttcgacgga ctcgtcaccg acgccgcgcg tgtgacgcgc 1359001 gccgtcagcc agcgctcggc aacccgggct acccagggac ctccggtatc agcaggtgcg 1359061 cgtcgtagcg tgggccccag tgcagcgtga cacgaccacg cggcgggcgt gggtaggcgg 1359121 ccgggaattg gccggtgagc gggttgcggg gggacaacca gcgtccgcca accaccagtc 1359181 gtaactgttc gccggcgcgg aacaatgtcg ccgacgggcc aagcgcgaca tcgacggcga 1359241 cgacctcgcc ggcggtgacc ggccggggcc gagcacacgc cgggaccggc tcccatggct 1359301 gcgagagctc ggggtcgagc tcgcgcagcg agacccgctg ccagccggtg gtcacccggt 1359361 cacggcccca gccgtaggac ccctcaaacg caacgaactg gccatcgcgc cacttctcca 1359421 ctccgacgaa caggttcgcg tcgtcgcagc catccaattg aacccacagg cgggcggcca 1359481 tcgggccggt caactcgatg tcttcgggga tcgtccaatt gaatgctgct gcccgagagc 1359541 gagtttggaa cctgatgctg cccgccgtcg gcggcggctc ggttgccagc agccccggcc 1359601 cggcgagata cattggccgc caacgcgtgc cggcaagcgg ccactgggtc tcttcacgca 1359661 ccgcggtgat ggtgtcgcga tcctcacgca cctcgaggcg aacgctgcgc gaaccggagg 1359721 agccggccag cgcgtctcgc aagaacttca gctgctcgga cagcgcggtc gctgagtaga 1359781 aggtctccca tttgcccccg cgatgggtat acagccgggc gtgaccgcag ccgctgcggg 1359841 taaaagcgcg gatcgacccg cggctgtgca agttgttgtc cgagaagcta ccgcagacca 1359901 gcatcggaac cttgatcgcc gacaggtcgg gtactcgcga gcgccagaaa tcgtcgcgca 1359961 gcgggtgagc ctcttgcatc tgctccatgt cgtaggtctg acgtgtgcga cgtcgcaccc 1360021 cgcgcgacca cagccgggtg aaccctgact cccggatgcc gccgggaaag gccaagtcgc 1360081 ggtaggcgtc ggtgaaaccc tcccacgggc agatcgcccg cagcgccggc ggttgcagcg 1360141 cggccacggc gtactggcta atggccagat aagacacccc cagcatgacg acgcgcccat 1360201 cactccatga ctggtcggcg agccatccca ccaggtcgta ggtgtcctcg gcttcctggt 1360261 gtgacagcag gtctccggta ccgtcggagc ggccgcagcc gcgcgaatcc gcattgacca 1360321 cgacgaagcc ctgcgcggtc caccacgccg ggtccggcgc ctcccagccg gtcagcgccg 1360381 agaaggtcag cggcttcggc tggcgcagca tccggtattg tggtgagaac gtccaccggt 1360441 tgccccgccg ccgcggcagg gcgtccttgc cgtagggatg gatgctcgcg atcaccggcc 1360501 tagccccacc ttcggcgcta cgaaagacgt tgatccgcag cagcgttccg tcgcgggtag 1360561 gcacctcgac gtcgcgttct atgacgacgt cggccggcgg atcggtgacg gtgatcggcg 1360621 gcttggcgac gccgcgaacc cgctccagcg cataccggag agcaccggga cgtcgccacg 1360681 gccggtccaa ggcaggtgac gggtttctgg ccacgcccgt taccctaaag ctattcgacc 1360741 gctaccacac gtagggcacc aaccggtagc gcaccagttg ccggtattcg cggtacccgc 1360801 tgagttcttg cgtcagtagt ttttcctcgt cgaggatgcg gaacaccaac accagtgtgc 1360861 cggggacgag gatgaacatc gcccagtaag agcccagtgc cagcggtatg cctgtcatca 1360921 tgaccacgtt cccggcgtac atcgggtgtc ggacaatttt gtagagaccg tcggaggcca 1360981 atatctggcc cgcctccacc ctgaccgtcg aggcggcata cctgttctgg atgaccacca 1361041 gcatggcgat gccaaggccc gtcatcacta ggacgtcgcc gatcacgcac accgcggctg 1361101 gcactgacga ccaaccataa cgatggtcgc acgcgctcag caccatcatc gcgaagaacc 1361161 ccagaaaagc gccgatgacg atgaacttct gaatcgttcg gccctccgcg agcggaccgc 1361221 tgcgcatgcg acgttgaagg gccgcgggat cgttgcgagc cagatagatt gtggggccaa 1361281 tcgtggtgct cacaaatgcg gcgaggaaca cccacgcctg ccaatagtcg aacgtgccgg 1361341 ctggcccgaa taggagcgcg ccgaaaacga cgagtcctaa cacgccccat atgaatatct 1361401 tcagcccaat gtgcatggct cctcctagca gcgaacgtca cgccgtcgga aggccatggc 1361461 gcccagggtg atcagggccg catctatggc cagcagccac agcaacggca ccgcggtgaa 1361521 atcgccgccg ccgacccgcg ggatgtgggc gaacggctcc aggttgagca gcatctgcgg 1361581 gaaccccgcc aacgagccga gcaggtacag cgcgatgaac ccgaccagca cgccccacgc 1361641 caccggcgtg aaccgcggcg ccaacccgaa caatcccacg gtcaccgccg ataacaacca 1361701 cacggccggc agttgcacgg ccgcggtgcc gaccacggtg ggcagcttgc cgccgacgtc 1361761 accgacggtc atgccgtagg cgagtccggc cgccacgccg gagatcaggg tcgccaccgc 1361821 cgatccggcc agcgccatcg ccagatggct tgccagccaa tgggtccggg aaaccgcccc 1361881 ggcgagcagg gtctcggccc gcagcccggt ttcctcttgg tgcagtcgta gggtcagcga 1361941 gacggcgaat gcggcggcga ccatgccgat catggtgaag gccagcgcaa ggaaggcctg 1362001 ttccagtgcg ccggtgccgc ccatccgggt gacgatgtca cgcaccgcgg tgttatcgcc 1362061 cagctgatcc ccgatgccgt gcaccacact gcccatcacc agcccgtaca ggcacaggcc 1362121 gacggtccac aacagcaggg agccgcgatt gagccgccat gccagcccga agggctcgct 1362181 cagcatgggc ccggcggtgc cggcgccggg gcgttcggcg atcagtccgg caccgacatc 1362241 acggccggcg cgtaatcgat aggccagcac ggtaagcacg gccgcggtcg ccagcgacag 1362301 cagcagcacc caccaacgct ctcccgcgta gggtctgacc tgcagcgacc accccagcgg 1362361 cgagcaccag gacagcgtgc ccgagccggc atcaccgatg gcacgcagcg cgaacgcggt 1362421 gcccaggacg gcgaacgcga ccgcgcgggt gaatcgggcg ctcggcgaca gctgcgcggc 1362481 caccgcggcc accgccgtga agaccatccc ggaggccgcc agcgccacgc caaacgctac 1362541 cgacccggcc ggagccacat cggtggcaag cagacccaat gcaccgatcg cgccggtcgc 1362601 gatcgacgca ccgaacgaca gcagcagcgc gccggtgagg ttggtgtagc gcccgaccac 1362661 ggtcgaatcg atcaattcgg cacggccgct ttcctcgtcc gcgcgggtgt gccgaatcac 1362721 cgtgaggatg accgccaccg cgatgagggt gtgaaacatc ccggctttcc agattccgac 1362781 cgcacccagg ctgtcgttgt agaccggccc gtagagcgcg cgctgtgccg ggctggccat 1362841 aatggcggcc gccgcggcgg cgcgggcgga ccggtcgggg taaaccgttt cgacgctggc 1362901 gatgtacacg gtggccagcg gcaccgacag cagcagcacc cacagcggca acgacacccg 1362961 gtcgcggcgc aggtacaggc gcagcaaccc cagtgtgccg gtgaagcccg aaccgcggtg 1363021 tggtgcacgg tgtcctgcgg gtctcgcgcg atcgatgacc gtactgctca cggcgttgcc 1363081 acctgttgct cggctgcgac ctcggggccc aggctgtagt ggcgcaggaa cagctcctcc 1363141 agggtgggcg gctgactgac caggctgcgc acaccggcgt ggccgagcac ttggatgagt 1363201 tctctcaggc tttcgctgtc gacctgggcg cgcactgtgg tgccctcgat gctgatgtcc 1363261 tcgactccct tgattcggct gaggtctcct ggatcaccga tcatttcggc cttgatcgag 1363321 gtgcggctga ggtgccgcaa ggcgtctagt gaaccgcttt cgacggtctt gccggctcgg 1363381 atgatggtca ccttttcgca cagcgcttcg gtctcggcca gaatatggct ggacaacagc 1363441 accgtcacac cgcgttggcg tgcttcgccg atgcactgct gaaacacgtt ttccatcaac 1363501 gggtccaggc cgctgctcgg ctcatccaag agcagcagag tggcgtgcga cgacaatgcc 1363561 gagatcaggg agaccttttg gcggttgccc ttggagtagg tgcgcgcctt cttggttggg 1363621 tccaggccga agcgctcgat cagttccgcg cgacgagcgt tgtcgatgcc gcctcgcatg 1363681 cgggccagca ggtcgatggt ctcaccaccg gtcagcgacg gccacaatgt gacatcgcct 1363741 ggaacatagg cgatgtggcg gtgcaggtcg acggcgtcgg tccaggggtc accgcccagc 1363801 aaccgcacgc ttccgccgtc ggccttcacc aggcctagca ggatgcgcag ggtcgtggac 1363861 ttgcccgcgc cgttggggcc gaggaagccg tgcacttcgc cctcgcgcac cgtgaggtcg 1363921 agcccgtcga gcgcccgcac cgacccgaag tgcttggtca gtccgcgaat ctcgatgggc 1363981 acctggtggt tgtcagccga catgtgcttc tccttgttga gcttcggcca ggaaggcctc 1364041 gtacatggcg cggtcggcca gcaggccttc ggtgtagacc tccagggaag gcagcaccat 1364101 gtcgtgcgcg tagtcgcgta acgctgcacg gagatcggtt gggttttcgt gcatttgcag 1364161 ataaagcagg aagcctccgc ctccggtgat cgccagaaac cgagcacggg cgcgcgggtc 1364221 gcggctgggc ttgaccgtac cggcgcgtac tccttcgtcc aggtactcct cggcgttgtc 1364281 gatcatcttc tgccacagca tcttcgccag ctcgccgccg gattgcatgc tgcgcaccag 1364341 gtatgccatc agcggtgcgt aggattcgat ctcggccatc tgcgcgagcc aggtggtcgg 1364401 gtcgttggac ttcagtgccg cagccttgct gctgcggatc tcttcggcga cgaagtcgtc 1364461 gcaggccttg cgcagacctt ccttggaacc gaaatggtgg atgaccaatg ccgcgctcac 1364521 ccccgccgct tcggcgatgg ctcgcagccc gacaccgaat ccgtgccgac cgaactgttc 1364581 gatggccgcc tctctgatcc tggcgtgcgc ggtcagatcg gctgaacgca tgttcaggat 1364641 attaaacgta cgttcatccc cggtcaaggg agggcgccgt tgggaatccg tgaaggccgc 1364701 gaactttgcc gagcagacgc aaaatcgccc tggaacgcac ggttcagggc gattttgcgt 1364761 ctgctcgccg aattagtccc gcacggctgc cagcacgccg tcgcccagcg gcaccagtgc 1364821 cggagtgagc cgttcatcct cggcgataag ccgggccgcc tcgcgaaccg cgatcacctc 1364881 ggcgtcgcgc gccccgggat caccggcccg accgcccagc gccgcccggt gcacgacgat 1364941 gaccccgccg gatcgcagca gccgcacccc ctcggcgacg taatctggct ggtcgatcgg 1365001 gtcggcgtcg atgaatacca ggtcgtagga tgcgtcggcg agccgggtca gcacctcttg 1365061 ggcgcggccg ctgatcagcc tggtacgcga cggcccgatg cccgcctcgg caaaggcctg 1365121 cctggcaagg cgtagatgct cgggctcgat atcgatggtg gtcaagacgc cgtcgtcgcg 1365181 catgcccgac aacagccaca ggccgctgac gccggccccg gtacccactt cggccaccgc 1365241 cttgcctccg ctgagcttgg ccagcaagca cagcaacgca cccaccgccg gtgttaccgc 1365301 cccggccccg atgtcggttg cgcgctcgcg ggcgccggcc aggatcacgt cttcagatat 1365361 tgacccctcg gcgtgcgccc agagtgattc gcctcggctg ggggccggct ggccaggcat 1365421 gtcgtcgtgt ccgggggtgc cgtccatgcc cgcagcgtat gtccaattgg cgacgccgtc 1365481 gggcaggcgc gcctggttcg aacgccggcc gagcaccgag ctggacgctt gcggctgtac 1365541 ccgacacgcc cggcgtgccg gacgcgacga aggtcacttt gactcgatat tccctggaca 1365601 gcgcaggtaa cggtatggtt tctaagccaa agctcagatt gctcatatat ggcccatacg 1365661 ccggtacgcg acggtaattc ccatggaact cctcggcgga ccccgggttg ggaatacgga 1365721 atcgcaactt tgcgttgccg acggtgacga cttgccaact tattgcagtg caaattcgga 1365781 ggatctcaat atcacgacca tcacgacctt gagtccgacc agcatgtctc atccccaaca 1365841 ggtccgcgat gaccagtggg tggagccgtc tgaccaattg cagggcaccg ccgtattcga 1365901 cgccaccggg gacaaggcca ccatgccgtc ctgggatgag ctggtccgtc agcacgccga 1365961 tcgggtgtac cggctggctt atcggctctc cggcaaccag cacgatgccg aagacctgac 1366021 ccaggagacc tttatcaggg tgttccggtc ggtccagaat taccagccgg gcaccttcga 1366081 aggctggcta caccgcatca ccaccaactt gttcctggac atggtccgcc gccgggctcg 1366141 catccggatg gaggcgttac ccgaggacta cgaccgggtg cccgccgatg agcccaaccc 1366201 cgagcagatc taccacgacg cacggctggg acctgacctg caggctgcct tggcctcgct 1366261 gccgccggag tttcgtgccg cggtggtgct gtgtgacatc gagggtctgt cgtacgagga 1366321 gatcggcgcc acactgggcg tgaagctcgg gacggtacgt agccggatac accgcggacg 1366381 ccaggcactg cgggactacc tggcagcgca ccccgaacat ggcgagtgcg cagttcacgt 1366441 caacccagtt cgctgaacta ctcaacggcc gccgagcgcg tcggttcggc taccgcatgg 1366501 ttgccaatcg gtcccgaatc ctggggtttt accggctggc gatggttttc cggcaccgcg 1366561 ccgcgctaca ttcgagatac cggtggctcg ctaggtggcg gaaggaggtg gtgatggccg 1366621 accccggaag cgtgggacat gtgttccggc gcgcgttttc ctggctcccg gcgcagttcg 1366681 cctcccagag tgacgcgccg gtcggcgcgc cgcggcagtt ccgttccacc gagcacctgt 1366741 caatcgaggc catcgcggct ttcgtcgacg gcgagctgcg gatgaacgcg cacttgcggg 1366801 ccgcgcatca cctttcgctg tgtgcccaat gcgcggccga agtggacgac caaagtcgtg 1366861 cccgcgccgc tctgcgcgat tcccacccga tccgcatccc cagcacgttg ctcggattac 1366921 tgtccgagat cccgcgttgt ccacctgaag gtccatctaa aggttcgtct ggaggttcat 1366981 cccagggccc gcccgacggg gctgcggcag gcttcggcga ccgcttcgct gacggcgatg 1367041 gcgggaatcg gggccggcaa tcgcgggtgc gtcgctagcc ggtgagccac ttgtcgcagc 1367101 gcatggcggg gggttgctgc gagttcatgg cgagtggtcg cgatccgtgg atactagggt 1367161 ggacacggac aacgcgatgc ctgcacgttt tagcgcccag attcagaatg aggatgaggt 1367221 gacctccgac caaggcaaca acggcggccc gaacggcgga ggccgcctgg cgccgcgccc 1367281 ggtttttcgg ccaccggtcg acccggcgtc gcgtcaagcg ttcgggcgtc cgtccggggt 1367341 ccaagggtcc tttgtggccg agcgtgtgcg cccgcagaag taccaggacc agtctgactt 1367401 cacaccgaac gatcagcttg ctgacccggt gcttcaggag gcgttcggtc gtccgttcgc 1367461 gggcgccgaa tcgctgcagc gccatcccat cgatgccgga gcgctggcag ctgagaaaga 1367521 cggtgccggc cccgacgagc ccgacgatcc gtggcgcgac cccgcggccg cggccgcgct 1367581 ggggacgcca gcgctagccg cgccggcacc gcacggtgcg ctggccggca gcggcaagct 1367641 gggtgtgcgc gacgtgctgt ttggcggcaa ggtgtcctac ttggcgctgg gcatcttggt 1367701 cgctatcgca ctggtgatcg gcggcatcgg cggtgtcatc ggccgcaaga ccgcggaagt 1367761 agtcgatgcg ttcaccacgt cgaaggtgac cctgtcgacc actggcaatg cccaggaacc 1367821 ggccggccgg ttcaccaagg tggcggccgc cgtggccgat tcggtggtga ccattgagtc 1367881 ggtcagcgac caggagggca tgcaaggttc cggcgtcatc gtcgatggcc gcggctacat 1367941 cgtcaccaac aatcacgtga tctctgaggc ggccaacaat cccagccagt tcaagacgac 1368001 cgtggtgctc aacgacggca aggaggtgcc cgccaatctg gtgggtcgtg accccaagac 1368061 cgacttggcc gtcctcaagg tcgacaacgt cgacaatctg accgtggccc ggctcggtga 1368121 ttccagcaag gtacgggtcg gtgacgaagt cctcgcggtc ggcgcgcccc tggggctgcg 1368181 cagtacggtg acccagggca ttgtcagcgc gctacaccgc cccgttccgt tgtcgggcga 1368241 gggctctgac accgacaccg tcattgacgc aattcagacc gacgcctcga tcaaccacgg 1368301 taactccggc ggtccgctaa tcgacatgga tgcccaggtg attggcatca acaccgccgg 1368361 taagtcactg tcggatagcg ccagcgggct gggctttgcg atcccggtca acgagatgaa 1368421 attggtggca aattctctga tcaaagacgg aaagatcgtg catccgacgt tgggcatcaa 1368481 cacccggtca gtaagcaacg cgatcgcgtc gggcgcgcag gtggccaatg taaaggcggg 1368541 aagtcccgcg cagaagggcg ggatcttgga gaacgatgtg atcgtcaagg tcggtaaccg 1368601 cgcggtcgcc gactccgacg agttcgtcgt cgccgtgcgc cagttggcta tcggccagga 1368661 cgctccgata gaggtggtcc gcgagggtcg gcatgtgacg ctgacggtga aaccggaccc 1368721 cgatagcacc tagagtgttc gccaacatcg gttgggggga aatgctcgtc ctcgtcatgg 1368781 tcgggctggt ggtgcttggc ccggagcggc tcccgggtgc catccgctgg gcggcaagcg 1368841 ctctgcggca ggcgcgcgac tatctcagcg gtgtgaccag ccagctacgt gaggacattg 1368901 gacccgaatt cgatgatctg cggggacatc tcggtgagct gcagaagcta cggggaatga 1368961 ctccgcgggc tgcgttgacc aagcacctac tggatggcga tgattccctg ttcaccggag 1369021 acttcgaccg accgacgccg aagaaaccgg atgcggcggg ctcggcgggg ccggacgcta 1369081 ctgagcagat cggtgcgggg cccatcccgt ttgacagcga tgccacctag atcggtgacg 1369141 gccggcggtc gggcccggcg agctaacacc cgagcaacgg cggcaggccg gccaccgagt 1369201 cgatcacgtg gtgcggccgg gtcgcgctgg cgccggccag ccagcgatcc agcgtttgct 1369261 ggcggaactt gccggtgcgc accagcacac ccgtcatgcc caccgcctgg gcggccagca 1369321 cgtcgttgtg cagatcgtcg ccgatcatga ccatctgctg tggatcgaca ccgacgcggt 1369381 cggcggccgc caggaatccc tcggccgcag gcttgccgat ggcggtggcg gtcttgccgc 1369441 aggcctgttc cattccggtc aggtacatcc cggtgtcgat gcgcagcccg tcggtggtgt 1369501 tccaggtcat attgcggtgc atcgccacca ccggaacgcc gtcgagcatc cacccataga 1369561 cccggctgag cgtgcggtga tcgaactggg ggccggcact gccgagcacg acgacgtcgg 1369621 gggcttcggg gcaatcctcg ggaccgatct cggtcgacaa gacgacgtcg atgccgggca 1369681 agtcctcggt gatgtcgccg ttgttcacca ggaagcaccg cgcgccggga taggcgccgt 1369741 gcaggtactc ggccgtcagc accccggccg tgatcacgtc gtcggcggcg acggggatcc 1369801 ccgcggcacc cagcgcctcg gcgatctgcc ggcgggtgcg cgtcgtggtg ttggtcagat 1369861 acgcgcaggc gattccccga tgggtcagtt gccgcacggt ctcggcggcc ccgggaatcg 1369921 cgcgccacga cagcaccagc acgccgtcga tgtcgaacag caccgccgcg gccatcagat 1369981 gcgccacgtc cacacgatat ccgtcagtta gaccgtcgac atcgacacca gcgcggaaaa 1370041 accccagtga gcatcgcgct gacgtcgatc tcgacggtga ggttcatcct ggctcaggat 1370101 ccctcaagat ccgtggcgca accacacact gtcggccacc cagggcgacg cggcgccggc 1370161 caccgaccac gccagctccg cgggcacatc gagcacctga taacccttgc ggcccgccgc 1370221 ggtggccgcc acgagcgtcg ccacccccgc cctccgctgg aacagtgtct ggcgcaccgt 1370281 ccagccgatg atgccggtgc aggcgatgca atcgcggcga cgctgtaggc tgccggcgcg 1370341 cgcaaccaac cagccgtcgg cgacgcggtg cccgagtgat cggacccgat cgacggccag 1370401 cccagcgcaa cccgcggtca acaccgccca cagtgtccac gcccaccccg gcacgccgag 1370461 aatcggcgcc gctgcgatca gcgcaactcc ggccagcgtc gggaccaaca gcgcccgggt 1370521 ccacctgcgc cgggcggcgg ccgggccgtg ccggcgcagc ggccccgctg ccgcgtcggt 1370581 gttgtcgatc aggtcggtca gcacggccgt cgcggtctcg aacggacatg gtggcagcag 1370641 catcgacgac tggccctcgc catgcacgcc ggtcatcact gcgtccagcc gagcaccgcg 1370701 caataaccgc accagcagtg gttcacgcaa ggtggcgcca cgcagccggc gcatgtcgta 1370761 ggtgtgctcg cgcacccgca gcagcccgtg ccgcaggtgt agcacccctt cttgaccgct 1370821 gccgccgcgg cgcagcagca gattgccgta ggtcaaccag gagaacagca ccgccaacag 1370881 tgccgataca cccaccacca gcagcacagt gaccgccacc accagtacca ccccggcgcg 1370941 ttgcgcggcg tccaccgcgg acctggcgaa accggattcc gggagtcgca cggccagtcc 1371001 cgtttggtag ccaagcccga tcaccgcccc gatcatcacc aggcccgaaa agctcagcgg 1371061 cgcataccgc aaccacgacg actgccaccg ggccagcacc cgaccggtcg gctcgacggg 1371121 tgccagcgac tcggccagca gcagcgcgcg cagcctgggc acccgtgccg agtcgaccgc 1371181 gtccagttcg aaggcggcct caccgcgggc ctcctggccg gtgcccaccc gcagcaccgt 1371241 caaccccaac agccggtgca acagccgcgc ctcggtctgc accgagcgaa tccggttgcg 1371301 cggcacggag accgcgcgcc ggctgagtat gccggtacgc agcgacacgt tttcgtcgtc 1371361 gatgcggtag gtggtgaaaa accaacgcag cacgccgaat acgaccgtca cgccgagcgc 1371421 cgccagcggc cagaccgggt tgccggttgc cgaccccagc accacggacc cgatgagtac 1371481 cgggagctgg cgcagcatct cgtgcaccgg atgcaccagc agcatccgcg ggctgaggcg 1371541 gtgccaatcg tgtggccggt cggtcatgtc gcgtcctcgc cgcgcagcgc ggcgatgtcg 1371601 gtcagctgcg ccaccacccg atcggcgacg tcggtgtcca acgcctcgat gtgcaccgcg 1371661 cccgccgagg acgccgtggt tacggtgacg ttggccagcc cgaacagccg gtccatcggg 1371721 ccgcggtagg tgtcgacggt ctgcacccgg gaaatcggtg tgatgcggcg ctcctgcacg 1371781 agccaaccgg tgcgggtgaa tacggcctgc gggctgatct cccaacggtg tacccggtaa 1371841 cgccagagcg ggaccacccc gatgtgcacc accatcgcca ccgcggtgag agcggccgcg 1371901 gccaggtgcg gccagggcgg ctggggatgc accgcccacc acaccagctg cgcgatcacc 1371961 gggagtatcc agcccagcga cgcggacagc gcccacatca ccggcgcctg gctgctcggt 1372021 cgatgggccg gctcggcgag cgcgaggtga tttctctgcg gtccggttgc gcttggcaca 1372081 tttcgagcat ggtccaacgg aaaccgaaca cagtgatcgg gggtcgtggt tatcgtttga 1372141 gctagcgctc aacaagatgc gtgccaactc accctgcccc ggggaggcgc gatgagtcga 1372201 cagtggcact ggctggcagc gacgctgctc ctgatcacca ccgccgcgtg cagtcgtccg 1372261 ggcaccgagg aaccggattg cccgacgaaa ataaccttgc cgcccggtgc tacgcccacc 1372321 acgaccctcg acccgagatg catagtgcgc gcgaccacca ccggcacagc cgacggcgat 1372381 gcggcgtcgc gctggaccgg aaccgtgcgg atcgccgggt tctatgcctc gatctgcaac 1372441 gcggtatggg acgggaacgt cagccttgcg ggaaaggacg agctgaccgg caaggctacg 1372501 cttatcctcg tcgaaaccag ttgcccgggc aaggttgtcg ccggcgaact cgtgctgaag 1372561 gggaacgtcg gttcggacag cctcgcgatc acctgggcgc accccgaact cccgcagcgg 1372621 gcgttcgacc tcggcgccgg acagggcacg atccgccgat cgggcgaccg tgccgaggga 1372681 acgttcaact cggatatggg tgggggcacc gagttcttct tgacgtggtc gctgacgatg 1372741 cgtaactgac gatcacaacg tgcccaccaa aaacagagta gacaacagtc gacaattccc 1372801 ttgtactccg gcgctatgaa gtcgatctcc gtcggtgagc tgcgccagaa tcccgctccc 1372861 atgatcgccg acctcgaccg gggtgagcca tacgcgctga cccgccacaa ccaccggatc 1372921 ggaacgatca ttcctgccgt ctcgtcggca acactcattc cccggaaagc ctagtacgcc 1372981 gagcagacgc aacggcaccc aatttcgacc agaatcgggt tcttttgcgt ctgctcacgc 1373041 ggtcaacgct agcgtcgtgt cgggtccaac cccagcgaca tgcccgccaa tccgcgtcgt 1373101 cgagtcgaca agccgtcggc gatgctatgc agttccttgc cgatcgccga gtccggcgag 1373161 ctcaacacga gcggtacgcc cgaatcgccg gcggccacca gtgcggggtc cagcgggatc 1373221 tgacccagca gcggcacgtc ggcgccgacc gcacgcgaca accgctcggc gaccagccgg 1373281 ccaccgccct cgccgaacac ctgcatcgtg gtgccgtccg gcagcgtgag ccccgacatg 1373341 ttctccacga cgccgacgat gcgttggcgg gtttgcagcg cgatgctgcc ggcccgttcg 1373401 gccacctccg cggcggccag ctgcggggtg gtgaccacca ggagttcggc gttggggatc 1373461 agttgagcca ccgagatggc gacgtcgccg gttccgggcg gcaagtccag cagcagcacg 1373521 tccagatccc cccagtacac gtcggccaga aactgctgca acgcccggtg cagcatcggc 1373581 ccgcgccaca ccaccggggt gttgccctgg gtgaactggg ctatcgagat gaccttcacc 1373641 tggtgggcga tcggcggcag gatcatcgac tcaacctggg taggccggtc ggtggtgccc 1373701 atcatccggg ggatagagtg gccgtggata tcagcgtcca gcaccccgat cgacaggccg 1373761 cggacggcca tcgcggcggc caggttgacc gtgacggtgg actttccgac tccgccctta 1373821 ccggaagcca cggcatacac ccgggtcaag gaatcgggtt gcgcgaacgg gatgacgggt 1373881 tcgcgggtat cgccacgcaa ctgcttacgc agctcggtgc gctgctcgtc gctcatcacg 1373941 tccaagctga cccgcaccgc cgaagtgcct ggcacgtcgg cgaccgcccg ggtgacacgc 1374001 tcggtgattt cggacttctt cgggcagccg gcgatggtca ggtagatctc gacgtgcacg 1374061 ctcccatccg ggccggtgtc gatgcttttg accatcccca gttcggtgat ggggcgccgc 1374121 aattcggggt cgattacctt gcccagcgcg gtgcgtatcg ccgcgttcag gtcgccatca 1374181 cgagttccgg acatcaccgc cgagtgtagg cggcttggca tacggccgag tggtcagccg 1374241 gcaggagccg gcgccggcgg cgccaggccc gcgtcgccag gcgggccggc caatggatcc 1374301 ggaggtgggg gagcggcagg taggaatgga ggtgggggag cggtaggcgg gaacggcggc 1374361 gcgcccactg gcgggccatg tgagccaatg cagatcagcg tgcagccggg catcggcgcc 1374421 gatgggtcag gtgccatcca cgggaacatc ggcggtggat tgagcgccgc ctggcgcggg 1374481 gtcaagtcga tcagcggcag gtgcgccatg gggccatcgg cggtcaggcc gttgacattg 1374541 atcggcaagc cgggcccgag accctccgga ttctcgaggt gcgcgtcgcc gagtggtggt 1374601 ggcggaccgg tgatcggggg caagtcaacc gggaacacac cggtggcgta gccggcggcc 1374661 cagcccagta cgttctgggc gtaaggcatc gagttgttgt agcgcaggag cgcggccatg 1374721 acctgcgccg ggtcgcgcag gttgagccca ccgctacaca ggtagcgggc tgcggccaac 1374781 gtggagtcga acaggttctg cgggtcagcc acaccgtcgt catcgccgtc ggtggcgtac 1374841 cgagcccaag tgccgggcaa gaactgcatt ggccccatcg cgcgggcgta cgtgacgcga 1374901 ttgccgacgc tgctttggat gatgatctcg ttgcctggca gggtgccgtc cagcgttggg 1374961 ccgtagatcg gctggatcgc ggtgccgcgc gcgtcggtgg cgccgccgtt tgcgtgcatc 1375021 gactcgatgc gcccaatccc ggccagcaag ttccaactga cgccacagcc aggggcggca 1375081 gcggccatct tcagctcggc gttgcggtag gcggacagtg ccatggccgg aatgccaagc 1375141 gcaccaggcg aattcacgat catcggtggt ggtggagccg gtatggtagc taccgccacg 1375201 cggaagctgg tcggcgggcg cttcatggcg atgacgaccg gaccggacag gtctatgccg 1375261 gacgcggcga ccgcggccac cggggtgata acggcgtgca ccggcgcggt tctcccgggg 1375321 aataccggag ccgcgccgcc gaccgcactg gcgaatacca acggggcaat cgctgccacg 1375381 ccgaatgccg gcgcccgcgt taggcgacaa gctccccgcc gcactgcagc gacggccggg 1375441 cgtgcacccc agcgtccccc aatgtgcact cgaccgtcct cagtgtgtga gccgtcggaa 1375501 acctatgtct tcttagcttc tttcttcgtt tcgtgaacta gatcaccata cataactctt 1375561 gtcacgggag tggcgcaatg gccgactcgg taatcacccc gatttcttgg cgtgctgctc 1375621 cgcctcgtcg gccacccgcg gctgcgccac atccggatcc gtcggctgca gctccgccaa 1375681 cagagcgcgc aggctgtcca gttcgtggcg caggtagtcg cgcgtgggga cctcgccgat 1375741 ggccagccgc agcgctgcca gctcgcgggc gttgtactcg gtgtcggcct tggtctgtgc 1375801 ggcccgccga cgatcctctt cgaacaccgc gcggtcacgc ttttcctgac ggttctgggc 1375861 gagcagaatc agcggtgcgg cgtacgaggc ctgcgtggag aaggccagat tgagcaggat 1375921 gaaggggtac ggatcccagc gcaagccgac cgcaaacagg ttcagcacga tccatgtcag 1375981 tacgagcagc gtctgcacca gcaggtaacg gccggttccg aaaaaccgtg cgatggattc 1376041 ggttgtcctg ccgacggcct cgggatccag ccgcggggcg agcgtgcgcg atgtgcgtgg 1376101 ggtgtacaga cggcgcggcg cgaagggttt gctcaccgtg gtcctccggg tctgtccggt 1376161 gctccggagg ggtcgagctc cggcatatct acacgccagt catgcggcaa tagatggtcg 1376221 agcaggtcgt ccacggtcac cgctcccagc aggtggttct cgtcgtcaac caccggtccg 1376281 cacaccaggt tgtaggcggc gaagtagcga gtcaccgcgg ccagcggggt ctccggagtg 1376341 agcgtgagca ggtcagtgtc cacaactccg ccgaccagct cggccggcgg gtcacgaagc 1376401 agccgctgca aatgcacaca acccaggtag tgcccagtgg gcgtggccgt gggcgggcgc 1376461 gcgacgaaca ccattgacgc cagggcgggg gtgagatcgg gatcgcggac ccgcgccaac 1376521 gcctccgcaa tcgaggtgtc cggggtcaac accaccggat cggaagtcat caatccgccc 1376581 gccgtgtcgg gggagtgcgt cagcagcctt cgcacctgcc cggagtcgcc gggatccatt 1376641 cgtgtcagca gcaactcggc ttcggtcgga ttcaggaccg cgagcagatc ggcggcgtcg 1376701 tcgggatcca tctcctccag cacgtcggcc gcgcgttcgg tgcccagttg cgacaacacc 1376761 tcggcctgat ccagttcggg cagctcctgc aggacgtcgg ccaagcgctt gtcgtggagc 1376821 gccttgaaca cctcgtggcg gcgcttcggc ggcagcccgc ggatggcgtc ggccacgtcg 1376881 accgctttcc atccctcgaa ctggtcgagc agctgtgcca cgtcttgacc cggcatcgcc 1376941 aaggccgacg gcgtcaaccc cgccacgttg tgccagtcca cgacgtgcac tgggccgcgc 1377001 cgtcggagcc gacgttgggt gcggacggcg accctagtca ccatccagtc gcgacttcgg 1377061 gtttgctcga cacccaggtc ggtgaccacg acgtcgacgc cggccagctc gggtagtgcg 1377121 ggatcgttga ccttcaccag ggtgtcgagc acttgaccca gcgccagagc ctcgcctggc 1377181 cgctgctcga agcggtgcag tgacacgttg ccggtgctca gtgtcaccgc gtgcggctcg 1377241 atcgcggcga cccgcagaat cggtatgaat atcttgcggc gggtcgccaa atcgaccacc 1377301 agcccgagca ctcgcggttg ttggcggaca atgctgatgc tgatcacgac atcgcgaacg 1377361 cgcccgaagg attcgccgag cggtcccagc accgacatcc gcgagagccg cgccaggtac 1377421 accctgttga ccgatcccat gattgagagc ctaggcagct gccttccgga tcaaccgagg 1377481 gtgggccaat gtcgcctaat gctaagggat agcgaagatc cccgcgatca tgtagaccag 1377541 cagggtcgcg atgccaatca caatgccggc caccgccagg ccgtagcctt cttcgcgtgt 1377601 ctgcttgatc tggttgatgg cgatcgcgcc gaacacgatg cccacgatcg agccgatgca 1377661 gcaaagcaca ccgacgagcg ccgagatcag tgagacgagc gccatggtgt tcatgccggg 1377721 ctgcgatggg ccgtagccgt ctaggtagcc cggctccggg tagtagccgc ccggagatcc 1377781 accgtatggc ggaggcatgg gcgggtatgg tatgtcgccg tagcctgctg aagaagtgcc 1377841 ggggggtgga tatccgggcg gcgcatagcc cccgggtggc atcggcggtg gatagtcggt 1377901 cggatacccg ggctggtaag caggcgggta accggacggc ggatacgccg ggggcgggtg 1377961 gtcggccatc ggcgaagatg ccggcggcgc ccaaggagcg tcagcaatgg gctgttcggg 1378021 gggccgctca ccgaccggag gcggtccacc cgcggcgtcg tgcgcactct cgccagagga 1378081 gccgctggga gccgtcatgg tgatcaacct atcccggcaa cgatgctcgc cgttcggtgg 1378141 gcctcggtcg ctcgcgggtt gagtggatag tgtgccggga gtagctggac ctgactggac 1378201 atgaaacgat ggcgctgaaa aaggggggcg gaggagaatg agaaccgatg actagcccat 1378261 tccagcccag acaggttccc ggttcaacac ccgccgccgc aggtgcgggt cgacgtggtg 1378321 tgcccgcatt gcccaccccg ccgaaaggtt ggccagtcgg gtcgtatccc acctatgccg 1378381 aggcgcaacg tgcggtcgac tatctatccg agcagcagtt cccggtccag caggtgacca 1378441 tcgttggcgt ggacctcatg caggttgaac gggtcacagg ccggctgacc tggcccaaag 1378501 tgcttggtgg cggcgtgctg agtggcgcct ggctgggcct gttcatcggg ttggtgctcg 1378561 ggttcttcag tcccaatcca tggtccgcgc tggttaccgg cctggtggcc ggggtgttct 1378621 tcgggctgat cacctctgca gtgccgtacg caatggctcg cggcacaagg gatttcagct 1378681 cgaccatgca actggttgcc ggtcgctacg acgtactttg tgatccgcaa aatgcggaaa 1378741 aggcacggga tctgctggcg cgtctggcga tctgaagccc ggacgagagg caaatgtggt 1378801 catgagtcgc gggcggatac cgaggctggg cgctgccgta ctggtggcgt tgacgaccgc 1378861 ggcggcggcg tgcggggccg atagccaggg gctggtggtc agcttctaca caccggccac 1378921 cgacggcgcg acgttcaccg caattgccca acgctgcaac caacagttcg gcggccggtt 1378981 caccattgcg caggtcagct tgcccaggtc ccccaatgag caacggttac agctggcccg 1379041 acggttgacc ggtaacgacc gcaccctgga cgtcatggcg ctggatgtgg tgtggacggc 1379101 ggagttcgcc gaagcggggt gggcgctgcc gctgtcggac gacccagcgg ggctggccga 1379161 gaacgacgcc gtcgccgata ccctgccagg cccgcttgcg acggccggct ggaaccacaa 1379221 gctgtacgcg gcacccgtca ccactaatac tcaattgctt tggtaccgac cagatttggt 1379281 aaatagcccg ccaacggatt ggaatgccat gatcgctgag gcggcccggc tgcacgcggc 1379341 gggcgagcct agctggatcg cggtacaggc caatcagggc gagggcttag tggtgtggtt 1379401 caacacgctg ctggtgagcg ctggtggatc ggtgctctcc gaggacggcc ggcacgtcac 1379461 cttgaccgat actcccgcac accgagcggc tacggtcagc gcgctacaga tcctcaaatc 1379521 ggtggctacc acgcccggcg ccgacccctc gatcacccgc accgaagagg gcagcgcgcg 1379581 gttggccttc gaacagggca aggccgcgct cgaggtcaat tggccgttcg tgtttgcgtc 1379641 catgctcgag aacgcggtga agggtggtgt gcccttctta ccgcttaacc ggattccgca 1379701 gttggccggc agcatcaacg acatcgggac gttcacgccc agcgacgagc agttccgcat 1379761 cgcgtatgac gccagccagc aggtgttcgg tttcgcgccc tatccggctg tagcgcccgg 1379821 ccagccagcc aaggtgacga tcggcgggtt gaacctggcg gtggccaaga cgacccgcca 1379881 tcgagcggag gcattcgaag cggtgcgttg tctgcgtgac cagcacaatc agaggtacgt 1379941 ctcgctcgag gggggtctgc ccgcggtgcg ggcgtcgctg tactccgatc cgcaattcca 1380001 ggcgaagtat ccgatgcacg ccattattcg gcagcaactc accgatgccg cggtgcggcc 1380061 ggcgacgccg gtgtaccagg cgttgtccat ccggctcgcg gcggtgctga gcccgatcac 1380121 cgagatcgac ccggagtcca cggccgacga acttgccgcg caggcgcaga aagccatcga 1380181 cggcatgggc ctgctcccgt gacctccgtt gaacagcgga ccgccaccgc ggtcttttcc 1380241 cgtaccggga gccgcatggc cgaacggcga ctggcgttca tgctggtcgc acccgccgcg 1380301 atgttgatgg tggcggtgac ggcctatccc atcggttacg cgctgtggct tagcctgcag 1380361 cgcaacaacc tggccacccc gaacgacacc gcgttcatcg ggctgggcaa ctatcacacg 1380421 atcctgatcg accggtattg gtggacggcg ctggcggtga cgctggcgat cacggcggtt 1380481 tcggtgacga tcgaattcgt cttggggtta gcgctcgccc tggtaatgca ccgcacgctg 1380541 atcggcaagg ggttggtgcg caccgcggtg ctcattccgt acggcatcgt cacggtggtc 1380601 gcctcgtata gctggtacta cgcctggacg ccgggcaccg ggtatctggc caacctgctg 1380661 ccgtatgaca gtgcgccact gacgcaacag atcccgtcgt tgggcatcgt ggtgatcgcc 1380721 gaggtctgga agacgacgcc gtttatgtcg ctgctgcttt tggccgggtt ggcgctggtc 1380781 cccgaggatc tgctaagagc agcgcaggtt gacggcgcca gcgcctggcg gcggttgacg 1380841 aaggtcatct tgccgatgat caagccggcg atcgtggttg ctctgctctt caggaccctg 1380901 gacgctttcc ggattttcga caacatctat gtgctgaccg gcggcagcaa caacaccgga 1380961 tcggtgtcga tcttgggcta cgacaacctg ttcaaggggt tcaacgtggg ccttggttcg 1381021 gcgatcagcg tgctgatctt tggctgcgtg gccgtcattg cgttcatttt catcaagttg 1381081 ttcggcgccg cggcgcccgg gggtgagcca agtgggcgtt gaacgggtgg gcgcgcggcg 1381141 cgccacgtat tgggccgtcc tggacacttt ggtcgtgggg tacgcgttgc tcccggtgct 1381201 gtggattttc agcctgtcac tcaagccgac gtcaacggtc aaggacggca agctgattcc 1381261 gtcgacggtg actttcgaca actatcgtgg catcttccgg ggcgacttgt tcagctcagc 1381321 gctgatcaac tccatcggaa tcggcctgat caccaccgtg atcgcggtgg tgctcggcgc 1381381 gatggcggcc tacgcggttg cccggctgga atttccgggc aagcggctgc taatcggggc 1381441 tgccttgctg atcacgatgt tcccgtcgat ctctttggtc acaccattgt tcaacatcga 1381501 acgtgccatc ggcctgttcg acacctggcc ggggttgatc ttgccgtaca tcaccttcgc 1381561 gttgccgctc gcgatctaca ccctgtcggc gttcttccgg gagatccctt gggatctgga 1381621 aaaggcggcc aagatggacg gtgcaacgcc cggtcaggct ttccggaagg tgatcgtacc 1381681 gctggcggcg ccgggcttgg tgaccgctgc aatcctggtg ttcattttcg cctggaacga 1381741 tctgctgctc gcgttgtcgc tgaccgctac caaggcggcg attaccgcgc cggtggccat 1381801 cgccaacttc accggcagtt cgcaattcga ggagccgacc ggctcgatcg cggccggcgc 1381861 gatcgtgatt acgatcccga tcatcgtctt tgttttaatc ttccaacgac ggattgtcgc 1381921 cgggttgacc tctggcgctg tgaagggata gcgcgatggc cgagattgtg ttggaccacg 1381981 tcaacaagag ttaccccgac ggtcacacag cggtgcgcga cctcaacctc accatcgccg 1382041 acggcgaatt tctgatcctg gtagggcctt ccggttgtgg caagaccacg acgctgaata 1382101 tgattgctgg gcttgaagat atctcgtcgg gagaactgcg catcgccggt gagcgggtaa 1382161 acgagaaggc gccaaaggac cgtgacatcg cgatggtgtt ccagtcgtac gcgctttacc 1382221 cgcatatgac ggtgcgccag aacatcgcgt tcccgctgac cctggcgaag atgagaaagg 1382281 ccgacatcgc gcagaaggtc tccgagactg caaaaatcct tgacctgacc aaccttctgg 1382341 atcgcaagcc ctcacaattg tcgggtggtc agcgacagcg ggtcgcgatg ggcagggcaa 1382401 tcgtgcgcca tcccaaagca ttcctgatgg acgagccgct gtcgaacttg gacgcgaagt 1382461 tgcgggtcca gatgcgcggc gagattgccc agctgcagcg gaggctgggt accaccaccg 1382521 tctacgtcac ccacgaccag accgaggcaa tgacgctggg cgatcgcgtg gtagtgatgt 1382581 acgggggcat cgcacagcag atcggcaccc ctgaggagct ttacgaacgg cccgccaatc 1382641 tgtttgtcgc gggctttatc ggctcgccgg ccatgaattt cttccctgcc aggctgaccg 1382701 cgatcggact gaccctgccg ttcggtgagg tgacgctggc ccccgaagtc cagggggtga 1382761 tcgcagcgca cccgaaaccg gaaaacgtca tcgtaggcgt gcggccggag catatccagg 1382821 acgcagcatt gatcgacgcg tatcaacgca tcagggcgct gaccttccag gtgaaggtca 1382881 acttggtcga gtctttaggc gccgacaaat atctgtattt cactaccgag agcccggctg 1382941 tgcactcggt tcagttggac gagttggcgg aggtagaggg ggagtcggcg ttacacgaaa 1383001 atcagttcgt ggcaagggtt cccgccgagt ccaaggtagc catcgggcag tcggtcgagt 1383061 tggctttcga taccgccaga cttgccgtct tcgacgccga ctccggtgcg aacctgacca 1383121 ttccgcaccg cgcctaatgg cggcgagcgg acacataagc ccccgccacg ccgaaggatt 1383181 tggagctttt tgcgtctgtt cgccgacgcg aagctagagc cagtttctgt tgcggaagac 1383241 gtggtagagg aacagacaga taaggaccat cccgccgatc actgtcgggt aaccccacct 1383301 ggagtccagc tcgggcatga agtgaaagtt catgccatag atgcccgcga tcatggtggg 1383361 gaccgcgatg atacctgccc acgcggatat cttgcgcatg tccatgtttt gctgcatgcc 1383421 gacccgggcg agcgcggcct gcaccagcga gttgagcatg tcgtcgtagc tggcgatctg 1383481 gtcggcggcc tcggtctggt ggtcggcgac gtcgcgcagg tagcgccgca cttctttcga 1383541 aatgaggtct ttgctctcgg tctgcatgcg ctggaatgcg gtcgatagcg gattcacgca 1383601 ccggcgcaac tcgaccactt cccgcttgag cagatagatc ggttcgatgt cgagcttgcg 1383661 gcccggcgcg aacgctactt cctcgatgct gtcgatatcg gtctccatga gattggtcac 1383721 ctcgaggtag cggtcgacca cgtagtcggc gatcgcgtgc atcaccgcat acggtcccaa 1383781 ccgcaaatgt tcggggtcgg catccatccg cttacgcacc tcggataacc cgccgtgttc 1383841 gccgtggcgg acggtgacca cgaaatcctt gccgacgaag atcatgatct cgccggtttc 1383901 gacgatctcg cgggccagta ccaccgattc gtgcgggacg tagttgacgg tcttgaggac 1383961 gaggaacagc gtctcgtcgt agcgctccaa cttgggtcgc tggtgcgcgt gcacggcgtc 1384021 ctcaacggct aacgggtgca acccgaaaac gtctgctacg tcctgcatct ggttttcatc 1384081 gggctcgtgc agcccgatcc agacgaacgc ctcctgcccg gtcagttcga tctcgcgcac 1384141 ctcgcgcagc gcggcggcgt aggtgtactt gccgggcagt cgctggccgc agacgtagac 1384201 accgcagtcg accaaggctt gggccggtgg ctgggcaacg gggtgtgcgt tcggcggctg 1384261 gggtcgcgcg accggtcgca gcacttcggg caatgcgtca aaccctggga acacgtcaac 1384321 ctccgatcgc ggtggatctg atcgggcggt gctccaggtt acgcgtcccg gtatggaact 1384381 tggtaaacgt cagtcgtagc tgtgggggtt ggaccccaga tgtccgtccg gtgccggtgc 1384441 gctagtttca acccgaagcc aagtccgtaa ggagcagaac cgacgtgagc gctagtcctc 1384501 tcaaggtcgc cgttaccggc gccgccggcc aaatcggcta cagcctgttg ttccgcctgg 1384561 ccagcggctc tttgctgggc cctgaccgtc cgatcgagct gcggctgctc gagatcgagc 1384621 cggcactgca ggcgctcgag ggtgtggtga tggaactcga cgactgcgct ttcccgctgt 1384681 tgtccggggt ggagatcggt tccgatcccc agaagatctt cgatggtgtg agcctggccc 1384741 tgctggtcgg agcccgcccc cggggcgcgg gcatggagcg aagtgacctg ctggaggcca 1384801 acggcgcgat cttcaccgct cagggcaaag ccctcaacgc tgtcgccgcg gatgacgttc 1384861 gcgtcggggt gaccggcaac cccgccaaca ccaacgcgct gatcgcgatg accaatgcgc 1384921 ccgacattcc ccgcgagcgg ttctcggcgc tcacccggct ggaccacaat cgggcgatct 1384981 cgcagctggc cgccaagacc ggcgcggcgg tcaccgacat caagaagatg acgatctggg 1385041 gcaatcactc ggccacccag taccccgacc tgttccacgc ggaggtcgcc ggaaagaacg 1385101 cggccgaagt ggtcaacgac caggcctgga tcgaggatga attcatcccg acggtcgcca 1385161 agcgcggtgc ggcgatcatc gatgcgcgcg gcgcgtcgtc ggccgcctcg gccgcgtcgg 1385221 caaccatcga cgctgcccgg gactggttgc tggggacgcc ggcggacgat tgggtctcga 1385281 tggccgtcgt ctccgacggg tcctacgggg tgccggaggg cttgatctcc tcgtttccgg 1385341 tcaccaccaa gggcggcaac tggacgatcg tgagcggctt ggagatcgac gagttctccc 1385401 gcggccggat cgacaagtca accgccgagt tggctgacga gcgcagcgcg gtcaccgagc 1385461 tcggcctgat ctgagcgcag gtcagccgcg cactgagcgg agcccgagtc atcttgacgt 1385521 gtgtttgtcc aggcatcatg atgacctgta tgcgcaccac cttgacgctc gatgacgacg 1385581 tcgtccggct ggtcgaagac gcagtgcatc gcgaacgccg cccgatgaag caggtcatca 1385641 acgatgcgct gcgcagagcg ctggcgccgc cggtgaaacg gcaggagcag tatcggttgg 1385701 agccgcatga gtcggctgtg cgttccgggt tggatctggc cggcttcaac aagttggccg 1385761 acgaactgga ggatgaggcg ctgctggatg ccacgcgtcg ggcccggtga tcatccctga 1385821 catcaatctg ctgctctacg cggtcatcac cggattcccg cagcaccggc gcgcgcatgc 1385881 gtggtggcaa gacaccgtca acggccacac ccgtatcggg ctgacgtatc cggcgttgtt 1385941 cgggttccta cggatcgcca ccagtgcccg cgtgctcgcc gcgccactgc caaccgcgga 1386001 tgcgatcgcc tatgtgcgcg agtggctttc gcagccgaac gtggacctac tcacggcggg 1386061 tccgcgccac ctggacatcg cgttgggcct gctcgacaag ctcggcacag ccagccacct 1386121 aaccaccgat gtgcaactgg ccgcctacgg catcgaatac gacgccgaga tccattccag 1386181 tgacaccgac tttgcccgat tcgccgatct gaagtggacc gacccgttgc gcgaataatg 1386241 actgccgctc tgccctcggg tcagccgttc aggccgtgct gaccgttggc gccggtagcg 1386301 ccttgagtac cgggatcgcc gggggcgccg gggttgaacc cggtcccgcc gccgccgccc 1386361 gcgccgccgt tgccgcccgc gccgccgagg cccccggccg cgccggagcc ggggctgccc 1386421 gactgtccga acagtccgcc cgcaccgccg gtcccgccgt ttccgccgac gccaccggcc 1386481 ccgccggccc cgccgtcgcc gccgttgccg ccgtcaccgc cgtcgccgtc ctggttggcc 1386541 atgccgtcgg cgccgatccc gccgttgccg ccgttgccgc cgctgccgcc ttgagcgccg 1386601 atgcccccgt cgcccccgac gccgccgtcg ccgccggcgc cgcccgtgcc gagcagtagc 1386661 ccgccgcgac ccccgctgcc cccaaagccg ccggcgccac caacgtcagc cgaggcaccg 1386721 acgccgccgt cgccgccggc accaccattg cccccggtgg agttgccccc aggaggatta 1386781 tcttgattgg catttcctcc ggcgccgccg gcaccaccgg gagcgccgat accgccgttc 1386841 ccgccggcgc caccgttgcc ccctatgctg ttgccagcat ttgcaacatt ggcgctgcca 1386901 cccgctccgc ccagcccccc gccgccgccg gctccgccgt ttccgccggc gccgccattg 1386961 ccgccgacag cgtcaccaaa gccgctttga gcggcgccac cgttaccgcc ggcacctccg 1387021 gaggcgaagt tggcgccgtc gccgccgtcg ccgccggcac ccccggacac gtcggtctgc 1387081 ccaaggttgg ttccatcccc gcctatgccg cctgcaccac cgcccacgcc ggggttgact 1387141 gcgttgctgc ccgagccggc gtcggtcccg ttgccatcgg gtccggtagt gccgtcggcg 1387201 ccatcggtcg cgtgcgtgac ctgatgggac accgggtttt gcccgttggc gccggccgct 1387261 cctgccgctc cggctccacc cgcccccccg ttgccccata gcccggcgtt gccgccgtgg 1387321 cctccgttgc cgccattgcc cccgatctgg gtggccgccc caccgttgcc gccgagccca 1387381 ccgttgccgt atagccaccc gccgttgccg ccggcaccgc cgttcgcgcc gggcccgccg 1387441 gctcccccgg cgccaccgtt gccgatcaac ccggccgcgc cgccgcgacc gccgggctgg 1387501 cctggggtgc tactcgagcc gccgttgccg ccgttgccgt acaacaagcc accgtcgccc 1387561 ccgttttgtc ccggcccgcc gttggcgcca tcgccgatca gcgggcgccc cagcaacagc 1387621 tgggtgggcc cattgaccac atccagcacc gcttgcatcg gggaggcatt ggcggcctcg 1387681 gccgccgcat acgagccggc cgccgaactc agtgcccgca caaactgctg atgaaatgcc 1387741 gccgcctgtg cgctcagcgc ttgataggcc tgggcgttcc cggaaaacag cgacgcgatg 1387801 gccgctgata cctcgtcggc acccgcggcc agcagccccg tggtcggggc ctcggccgcc 1387861 ctgttagctg cggccagcgc cgagccgatg ccctccaaat cagcggccgc ggccaccaac 1387921 acgtcctgcg ctgcaatcag atactccatc gcggggcctc tctcgcggcg agattgacca 1387981 acgggtcggc acgaagcgtg tcccgttgct tgacggtgca ttgcgtgttt gcctggatcc 1388041 ccgcgccgac ggtgtggatc gggcccagta ccctcaagcc cgtgccaact gcatctgtcg 1388101 cggtgactat cggctcagac acttcggtgt gagaatcacc aggatcctcg cgctgctgct 1388161 tgccgtcctg cttgcagtgt ctggcgtggc tggttgctcg gccgacaccg gcgatcgcca 1388221 cccggagttg gtggtcggat ccacgccgga ctccgaggcg atgctgctgg ccgccatcta 1388281 cgtcgcggcg ctgcggtcgt acggttttgc ggcgcacgcc gaaaccgccg ccgacccggt 1388341 ggcgaaactg gactcgggcg cgttcaccgt cgtacccgct ttcaccggtc agatgttgca 1388401 gaccttgcaa cccgatgcgt cggtgcgctc ggatgcccag gtataccgcg ccatcgtctc 1388461 ggcccttccc gagggcatag ccgcaggcga ctacaccacc gccgcagaag acaaacccgc 1388521 gttggtggtg actcaatcca ccgccaaggc ctggggcggc ggcgatctca gcgagctgcc 1388581 cagccactgc cgcgggttgt tggtcgggcg cgttgccggc gcccacacac ccgcggccgt 1388641 gggaccgtgc cggctgcccg ccccgcgtga gtttcggaat gacgcaacaa tgttcgccgc 1388701 gctgcgggcc ggacagctgg tcgcggcctg gaccaccacc gccgaccccg acatccccgc 1388761 ggacctgatc atgctgaccg acggcaagcc cgcgctgatc cgggccgaga acatcgttcc 1388821 gctgtatcgt cgcaacgcgc tgaccgagcg gaaactgctg gccgtcaacg aggtcgccgg 1388881 cgtgctggac accacggccc tgatcgggat gcgccgccag gtggccgcgg gggccgaccc 1388941 ggcggcggtg gccgccggct ggctcgccga acacccgctg ggacgttgag ccgccacgag 1389001 cgtccgggtc gacgcgatga cacaccgcgt cggccgaaca accttcgggc gcgctttcct 1389061 caccagccgt cagcgcgggc ggggtatcaa ccggccggtg atgatcggaa agatccgctg 1389121 atatccggaa ccggtcagcc ggaccaccag gtccagtacc ttggcgtcga cacccaccaa 1389181 cacccgggcc ttgttcttgg ccacccccgt caggatgatc tgcgcggccc gctgtgggct 1389241 gagatgggcc acccgcttat cgaacgtctc ggccagctcg gcctggtcaa gtccctcggc 1389301 ggcggtggcg ttacgggcga tcgcggtctt gacaccgccg gggtgcaccg tcgtcacctt 1389361 caccgggtga cccgccaacg ccatttcctg gcgcagcgcc tcggtaaagc cgcggacggc 1389421 gaacttggcc gagttgtagg ccgcctgacc cggcgccgaa aacaacccga acacgctgga 1389481 gatgttgatg acgtggccgt ccccggaggc gatcaaatgc ggcaggaacg ccttggtgcc 1389541 gttgaccaca ccccaaaaat cgacgtccat cacccgttcg atgtccttga actggctgac 1389601 ctcgatatcg ccggtaaagg cgatgccggc gttgttgtag atctggttca cagtgccgaa 1389661 gtgctcgttg accgcatcgg cgtaggctag gaaggcttcg cgttcggtta cgtcgagtcg 1389721 gtccgtcttg accggcgtgc tgatcgcctt tagccggtgc tcggtgtctg ccaggccgtc 1389781 ggtgtcgacg tcgctgatgg ccaccttggc gcccgagcgg gccagctcga ttgccagcgc 1389841 ctgcccgatg cccgatcccg cgccggtgac aacggcgacc tttccggcga acccctccat 1389901 gacgtaccct cccttgtctc ggctgccatc aggttagccg gtacccgggg tacggcttaa 1389961 cgtggccggc acgggttcat tcggtagctg gcactgcgac gagcgatgtg gatgatctcg 1390021 actcggtggt ggccgtcgtc gatggcgtag acgacgcggt aatcaccgcg gcgggctgag 1390081 tggaggcctt caaggtcatt gcgcagcggc ttgcccaacc tatgcgggtt gttaagcagc 1390141 ggtccgaaaa caaactcgac acatgcggcg gcgatctttt cgggtaagcg ttgcaggtcg 1390201 cgtgccgctg tcgcggtgat cgccacgtgg tagggatggt cgtcgctcac cgcgcggtgt 1390261 aacggttgcg gatctcgtcg ttgctcacga agcgccctgc ggcaacatcg gcgaggcctt 1390321 cacgaatggc ctcgctggcg ccaggggtgc gtagcacctc cagcgtttcc tcgatggacg 1390381 ccaggtcatc ggccgagatc aataccgccg ccggatgacc gtgccgggtt atcgtgatgc 1390441 gctcgtgtgt cagctcaact tcggcgacgt actcagagag gcgattgcgg acttcgccca 1390501 gtgggacaac agccataacc gcgattgtag ctaaaagtat ggctaaaccc tgtacgccga 1390561 gcatcggctt accgagccga acgcctcgtc gctgtttgat gtctcctcga gcgttcggct 1390621 gagcgaactc agccgaacgc ctcgtcgagg atctcctgct gttcgacggc gtgcaccttc 1390681 gacgagcctg acgacggggc tgacatcgcc cggcgcgaga ttcgcttgat cccggccaac 1390741 ttgtcaggca gcagctcggg tagttcgagc ccgaatcgcg gccacgcacc ctggttggcc 1390801 ggttcctctt ggacccagaa gaactccttg acgttctcgt agcggtccag cgtttcacgc 1390861 agtcgacgcc tgggcagcgg ggcgagctgt tcaagccgca cgatcgcgag gtcattgcgg 1390921 ttgtccttgg ccttgcgggc ggccagctcg taatacagct tgccactggt cagcaggatc 1390981 cggctgacct tgttgcggtc tccgatgccg tcctcatagg tgggttcctc cagcactgag 1391041 cggaacttga tctcggtgaa gtccttgatt tcgctgacgg cggccttgtg acgcaacatc 1391101 gacttgggcg tgaacacgat cagcgggcgt tggatgccgt ccagggcatg ccggcgtagc 1391161 aggtggaagt agttcgacgg agtcgacggc atcgcgatgg tcatcgaacc ttccgcccac 1391221 aactgcaaga agcgttcgat ccgggcagaa gtgtggtcgg gtccctgccc ctcgtgcccg 1391281 tgcggtaaca gcagcacgac gttggacaat tggccccact tggcctcacc ggagctgatg 1391341 aactcgtcga tgatcgactg tgcgccgttg acgaagtcgc cgaactgcgc ctcccagagc 1391401 accacggcgt ccggattgcc cacagtgtag ccgtactcga agccgacggc ggcgtactcc 1391461 gacagtggcg agtcgtagac caggaacttt ccgccggtcg ggctgccgtc ggagttggtc 1391521 gccagcagct gcagtggtgt gaactcctcg ccagtgtggc ggtcgatgag aaccgaatgc 1391581 cgctgggaga aggtgccgcg gcggctgtcc tgccccgaca agcgcaccag cttgccttcg 1391641 gccaccagcg agcccagcgc cagcagctcg ccaaaggccc agtcgatctt gccttcatag 1391701 gccatctccc ggcgcttctc cagcaccggt tggactcgcg ggtgcgcggt gaagccgttc 1391761 ggcaaggcga ggaacgcatc gccgatccgg gccagcagcg acttgtccac cgcagtggcc 1391821 agccccgcgg gaatcatctg gtcggactcg accgactcgc tcggctgcac accgtgcttc 1391881 tccagctcgc gcacttcgtt gaacacccgt tccagctggc cctggtagtc gcgcagcgcg 1391941 tcctcggcct ccttcatcga gatgtcgcca cgtccgatca gggcttcggt gtagcttttg 1392001 cgggccccgc gcttggtgtc gacgacgtcg tacatgtagg ggttggtcat cgacgggtcg 1392061 tcaccctcgt tgtgcccgcg gcggcggtag cacagcatgt cgatgacgac gtccttcttg 1392121 aaccgttgtc ggaagtccac cgccaaccgc gccacccaga cacacgcctc cgggtcgtcg 1392181 ccgttgacgt gaaagatcgg tgccccgatc atctttgcga cgtcggtgca gtactcgctg 1392241 gacctggaat actcgggcgc ggtggtgaag ccgatctggt tgttgacgat gatgtggatg 1392301 gtgccgccga cgcggtagcc cggcagattc gccaggttca gcgtctcggc gaccacaccc 1392361 tgaccggcga acgcggcatc gccatgcaac atcagcggca ccaccgagaa cgcccgttgg 1392421 ccgtcgctgt cgatgcttcc gtggtcgagc agatcctgct tggcccgcac caatccctcc 1392481 agcaccgggt cgacggcctc cagatgcgac gggttggcgg tcagcgacac ctgaatgtcg 1392541 ttgtcgccga acatctgcag gtacagcccg gtggcgccca ggtggtactt gacgtcaccg 1392601 gagccgtgcg cctgcgacgg attcaggttg ccctcgaact cggtgaagat ctgcgagtac 1392661 ggcttgccga cgatgttggc cagcacgttg agccggcccc ggtgcggcat cccgatgacc 1392721 acctcgtcga ggccgtgctc agcgcactgg tcgatcgccg cgtccatcat cgggatcacg 1392781 ctttcggcgc cttccagcga gaaccgcttc tggccgacgt acttggtctg taggaacgtt 1392841 tcaaaggcct cggcggcgtt gagcttgctg aggatgtatt tctgttgggc cacagtgggt 1392901 ttgacgtgct tggtctcgac ccgttgttcg agccactcct tttgttcggg gtcgaggata 1392961 tgggcgtact ccacgccgat gtggcggcag taggcatcgc gcagcaagcc cagcacgtcg 1393021 cgcagtttct tgtactgcgc accggcaaag ccgtcgacct tgaacacccg atcgagatcc 1393081 cacagcgtca ggccgtgggt cagcacttcg aggtcggggt gactgcggaa ccgagctttg 1393141 tccaaccgca gcgggtcggt atcggccatc agatggccgc ggttgcggta ggccgcgatc 1393201 aagttcatga cgcgagcgtt cttgtcgacg atcgagtcgg ggttgtcggt gctccagcgc 1393261 accggcagat atgggatgct cagttcgcgg aagacctcgt cccagaagcc atccgagagc 1393321 agcaactcgt ggatggtgcg caggaagtcg cccgattccg cgccctggat gatgcggtgg 1393381 tcgtaggtgg aggtcaaagt gatcaatttg ccgatgccca gctcggcgat gcgttcctcg 1393441 ctggcgcctt gaaactcggc ggggtattcc atggcgccca cgccgatgat ggcgccctgg 1393501 ccgggcatca gccgcggcac cgaatgcacg gtgccgatgg ttccgggatt ggtcagcgaa 1393561 atcgtcacgc cggcaaagtc ttcagtggtc agcttgccgt cgcgggcccg gcgtacgatg 1393621 tcttcgtagg ccgtgacgaa ctgcgcgaat cgcatggtct cgcaccgctt gatgccggcc 1393681 accaccaggg aacgcttccc gtccttgcct tgcaggtcga tcgccaggcc gagattggtg 1393741 tgcgccggcg tgaccgcggt gggcttgccg tcgacttcgg tgtagtgccg gttcatgttc 1393801 gggaatttct tcaccgcctg caccagggcg tagcccagca aatgcgtgaa cgagatcttg 1393861 ccgccgcggg tccgcttcaa ctggttgttg atgacgatcc ggttgtcgat cagtagcttg 1393921 gccgggaccg cccggacgct ggtcgccgtc ggcacctcca acgacgcgga catgttcttg 1393981 acgacggccg cggcggcgcc gcgcagcacc gctacctcgt caccttcggc tggcggggga 1394041 acggcagttt tggcggccag tgcggcgacc acgccgttgc ccgcggccgc ggtgtcggcc 1394101 ggcttggggg gtgcctgcgg ggcggccgca gcggcccgct cggcaacgag tggcgaggta 1394161 acccgggttg gttcggcagc tggttgggag gtgggttcgg ggctgtagtc aaccaggaac 1394221 tcgtgccagc tgggatcgac cgaggagggg tcgtcgcgga acttgcggta catcgcttcg 1394281 accagccatt cgttttgccc gaatggtgaa cttatgttgg ccacggccgc tgttcgcctc 1394341 gattcttctg ctagttgaag tcctgcaagc gcattgcgcg gcgcctgctg gcagtcggtg 1394401 aacggtctgc cccataaagg ctaacgcttt gccagcgatt cgccagagag accgggcaac 1394461 gcgcgctagc tggcatcccg aacggtcggt agcacgtgca gggtgaccgg ccagcgcgcc 1394521 ggcggggtgc cgaatgccga tcgcgcatta cggacgagct tcttgccgac cagccgattg 1394581 ccgatggcgc cgatgatcgc gccgataccc atcggcacca gcttgccaaa catgagcgcg 1394641 ccgcgtttca gcgcgaatcg tttgacgacg tatttgagca ttcgcgagtt caacgacgat 1394701 atcgccggca gcggcagcga ggccatggtc tccgacaccc agccgccgct ggttcggccc 1394761 ggaccgagca gatcggccac cgcagtagtg ttgtcgccga ccagcaccgc caagaccagg 1394821 gcacggcgcc gttctcggtg gtcgagggga atggcgtgta ccgaggccag cgccagcacg 1394881 aacagcgcgg tggcctcgag gaacacgaca acctctccgg ccgcggcgaa ccatgcggcc 1394941 agggtgccga tccccggtaa ggtcgcggcc gtacctaccg ccgctccact ggccgtcacc 1395001 accgacaaga agcgtttctc gagcttggct acgatcttgg cggggctggc ccccgggtgg 1395061 gcgcgacgca ggcgggccac atacgcctgt gctgccgggc cctgtatccg cgaactccgt 1395121 tcgatgacct gcgccaatgc ccgcgtggac actttgggcc gcccgccggt cccggccagc 1395181 tgcgggtccg gctcagctgc atttgcggat cgattgtcga accttttcca agacctgatt 1395241 cgtcgagcgc tcatcttctc tcctgcgaat ggcgtcccct caggctaatg ccggttcaac 1395301 gatccgagca tgtgtttcgg tagcggcgcg gttcaccgct cgaagcggaa taatgcggcg 1395361 tggacattgg tgacgatacg ggttgccctg gtgcatgccg tgacgcccgt gacccaatgc 1395421 caccgctagc aagccaaacg aggtgcgtgt atgactacgg cgatacgccg ggcggccggg 1395481 agcagctact tccgaaaccc ctggcctgcg ctgtgggcga tgatggttgg cttcttcatg 1395541 atcatgctcg actccaccgt cgtagccatc gcgaatccga ccatcatggc ccagctacgc 1395601 atcggttacg ccaccgtggt ttgggtgacc agcgcctatc tgctggccta cgcggtgcca 1395661 atgctggtgg ccggccggct tggcgaccgg ttcggcccga agaatctcta cctgattggc 1395721 ctgggggtat tcaccgttgc gtcgctgggg tgcggtctgt cgagcggtgc cggcatgctg 1395781 attgccgctc gagtggtgca aggcgtcggc gccggattgc ttaccccgca gacgctgtcg 1395841 acgataacgc ggatcttccc ggctcatcgc cgcggtgtcg cgctgggcgc atggggcacc 1395901 gtcgccagtg tcgccagcct ggtgggaccg ttggccggcg gcgcgctggt cgacagcatg 1395961 gggtgggagt ggattttctt cgtcaacgtt cccgtcggcg tcatcggcct gatcctggcg 1396021 gcctatctga ttccggcact accccaccac ccgcatcggt tcgattggtt cggcgtcgga 1396081 ttgtctggtg cgggaatgtt tctgattgtc ttcggactac agcagggcca gtccgccaat 1396141 tggcagcctt ggatttgggc ggtgatcgtc ggcggtatcg ggtttatgtc gctgttcgtt 1396201 tactggcagg cgcggaacgc ccgcgagccg ctgatcccac tggaggtctt caacgaccgg 1396261 aacttcagct tgtccaacct cgggatagcg atcatcgcct tcgcggggac ggggatgatg 1396321 ctgccggtga cgttttatgc gcaggcggtg tgtgggttgt cgccgaccca cacggccgtg 1396381 ctgttcgcgc cgacggcgat cgtcggtggc gtgctggccc cgttcgtcgg catgatcatt 1396441 gacaggtccc atccgttgtg cgtactgggt ttcggcttct cggtgctggc gatcgcaatg 1396501 acatggctct tatgcgagat ggctccgggc acgcccatct ggcggctggt gttgccgttc 1396561 atcgcgttag gcgttgctgg ggcgttcgtg tggtcgccgc tgaccgtcac cgcgacccgc 1396621 aatctacggc cgcacctggc cggtgcgagc tcaggtgtgt tcaacgccgt ccggcagctg 1396681 ggggctgtgc tggggagcgc gagcatggcc gcgttcatga cgtcgcgcat cgccgccgag 1396741 atgcccggtg gtgtggacgc ccttaccggt cccgccgggc aggacgctac cgtgttgcag 1396801 ctgcccgagt tcgtgcgcga acccttcgcg gccgcgatgt cgcaatcgat gctgttgccc 1396861 gccttcgtcg ccctattcgg gatcgttgcc gcgttgttcc tggttgactt caccggtgct 1396921 gcggttgcca aagagccgtt gcccgaatcc gatggcgacg ctgacgacga cgactatgtc 1396981 gagtacatcc ttcgtcggga accggaagag gattgcgaca cccagccgct gcgggcgtcg 1397041 cgcccggcag cggccgcagc gtcacgcagc ggtgctgggg gtccgctggc ggtcagctgg 1397101 tcgacgtcag cccaaggaat gcccccaggt ccaccaggcc gtcgggcgtg gcaggcagat 1397161 actgagtcaa cagctccgag cgcactataa ccgcggcata ctgtgcccga ctgaccgcga 1397221 cgttgagccg attccggttg agcaggaacg agattccgcg tggaacatcg tcggcggacg 1397281 aggccgtcat cgagatgaag accaccggtg cctgcccgcc ctggaatttg tcgacggtgc 1397341 ctacccgtac tccgtcagcc ccgccaagtc cggcagacgc caaccgccga cggaccagcg 1397401 ccacctgggc gttgtacggc gcgagcacaa gcacatcgga agcggccagt ggccgggtgc 1397461 cgtgctcgtc ggtccacggc gagccgagca gctgccgcag ctcggcgagg atcgcctcgg 1397521 cctcttcggg gctttcgatc gaattgccct tgtggtgcac gccacgcgta tgcacccccg 1397581 ggggataccc gtcgaggcgg cgcacggcgg tgcgctcggt gtgggaacac agcctgccct 1397641 cgtaggacaa cgccgacacg gccgcgcaca ccgccgggtg catccggtac gagcggtcta 1397701 agaagtagcc gcgttcgtcg ggcagcgtgt gttgcccatc taccagccac gacaatgcgg 1397761 aggtgtcgac gggttcggga tgtgtgccct gacttacctg aggcagttgc tgtggatcgc 1397821 caagcagcaa caggtttgtg gccgcgggcg ccacggcgat ggtattggcc aggcagaact 1397881 ggccagcctc gtcgatcacc agcagatcca ggctggcttt cggcacccga ttgccgttgg 1397941 cgaagtccca cgccgtgccg ccgatcacgc atccggcggt gtcgcggatg aattctgtgt 1398001 actggctccc gtcgatcgac tgccagcgcc cagcggtgtg gtcgtgcggc tttttggcga 1398061 cctgccccgg gtccaggcca gcgctgatca caccttccaa caggttctcc accgtggcgt 1398121 gcgactgggc gacaacgcca atacgccagg catgctcggt gaccaactcc gcgatcaccc 1398181 gggccgcggt gtacgtcttg ccggtccccg gagggccgtg caccgccagg tatgacgagt 1398241 ccaagtccag cgccgccgcg gcgatatcgg tgactgggtc actgctgcgg ggcaatgcgg 1398301 cgccgctgcg cgtgcgagga gggcgacgca gcagcacgtc cattagcgcg gtgctgggca 1398361 gttgcggcga tccggaagcc acggcagcgg ccgtcgattc gatcgattcc cgcagggccg 1398421 tcgtcggcac cggcggcccg ggagcgagcg cgaacgggag ctgctgaaat gtgttgccgt 1398481 cactgccggt tcgttcgacg atgaccacct cggtgggcac agtggggtcg tcggtctcaa 1398541 ccactgcggc ggggcccgcg gctcggcgat caggattgtc ggtcatgccc ggcggcgccg 1398601 ggggttcgta gagggcaaac acattcccgt tgaggtcccc acgtgccagt tcaccggtaa 1398661 gccggacccg ccgctgcggc ttgcgcgcgc gaggcggcat atgccagtcg acggtgaccg 1398721 aagcctcgct ggcaaggaag acgtccgtgc tgtccgacca ttcgtcgacg gggtagttga 1398781 gccggtcgaa gtgcgcccac cagaacggct tgtcctcgcg gcgatgatag ccgcgggcag 1398841 cggccagcaa ggcgaccgct gtctgttccg gcgtgcgctc gccggcggcg gcatcgccgg 1398901 tgaacttgga cagtaccgac gccagcgagt caccgtcgtc gatagggtcg gcgtccggaa 1398961 ctggttgagc gccaatgggt gtgacgccgg cttcccaggc gcgcatgagc agccagtcac 1399021 gcagcgcgcg ggtggaccgg cagtcgtagt ggttgtagcc ttcgatctct ttgagcacgg 1399081 ttgccgcctc atcgatgcgg ccggccgcgc gcagttcgca gtaccgggca taggagttga 1399141 tcgagtcggc ggcggtggtg acgtcgccgg agcgtggctg cgtcccgagg tacagcggct 1399201 ccagcgcctt caagctgaac gagtcggtgc ccacccgaat gctcttgcgt accaacgggt 1399261 ataagtccac caggactccg ttgcgcagca agtcgtcgac gtcgtcctcg ccgatgccgt 1399321 agcgtccgac cagccgcagc agcgcggtct tctcgtaggg cgcgtagtgg tagatgtgca 1399381 tgttggggtg gcgccggcgc cgtctggcga ctatcgccag gaaatcggtc agcgcctggc 1399441 gttcggctgt ccggtcatgc gcccacaatg gtcggaatac tcccgcccgt ccggcttcca 1399501 gcaccccgaa caggtattcc aggccccact gtttgccgtc ggcggtccac agcgggtcac 1399561 cctcgaagtc gaagaacagg tcgccggggt ttggctccgg cagcagtgtc agcggccgcg 1399621 ggtcgacgat ctcgaactgt ggtgctcccg tatcgcgttg gcggatttgc agtttggcct 1399681 gtgcggtcag cttgcccagc gcgttcgtgg tcaggccggg aaccggcgcg gtgtgatctg 1399741 ccagttcggc gatcgtggtg atgccggcct caaggagctt gtcgcgctgg cggactcgca 1399801 tccctccgac cagtagcaga tcgtcgctgg cgcgcagccg ctcggtgcac tgcggacagc 1399861 ggaagcacgc ctgcacgcgt tcgtcgtccc agcgcaccgc ggtgcccgcg gtgtagtggc 1399921 cgtccagcaa tcgctgtaaa agcgcacgct gggaccggta gaccgggatg agctcgccga 1399981 cgcggtagcg cacgatcgtg ccgtcgccga gttcgagctc ggcgtcggca gccaccggaa 1400041 cgcccgagtg aaccagcgca tcggcatagg ccgccagctg tagcagcgcg gtcacggttg 1400101 gcgagcgggc gagcttggtg tcggcgaccc ggtaccggtg accgtcgcgg atcaggaagt 1400161 cggcgaaccc gacgaagcgg ccgtcgaaca tggcggcctg atacaccacc ggggcgtggt 1400221 tggcgatggc acgtcgcgtc gcgtcggcgg ctgccgccag cccggcgggc gtgtaggccg 1400281 gccggccaat gatagccacc gcgtcgccga actcgtggcg cagttggtcg agtcggcgtc 1400341 cttcatgcgc gctaccgaga acggcggctc gcgccatcag ttcgtcgtca actgcgacgg 1400401 ccggtccccg gcctagtttc gcgtcgaatt cacggagcag tgcgtactgg caccgggcgg 1400461 cggctgcgag atccgaagca ctgtagacga tgctgtcacc ggtgacgaac acagcagcaa 1400521 ctcctcggtg agacaacgga caggcaaact gggctgcacc cgtcggctta accgccggtg 1400581 gtgttgccga tcagctcgac gccgccgccg ttccagcgga acttgacaac gttgttcaac 1400641 ccgatgccgc tggcatacgt caatgccacc gtgtctcccg tgcactgcga ggtgtcgatg 1400701 ccggtgaacc cataggtatc gggcaccccc tgcggtatgt acttgccgag gtggaacatc 1400761 accgcgcggg tggtcggatt gccggcgttc gtgttggcct tgatgaccac cgccgacagc 1400821 tgggcacact cgttgtagtt gccggccagc ggttctgggt tccagggctg ctcactgcgc 1400881 ggatcgcgag gaagttcgga gacgactttg gcgattgtgg gcgaggcgag gttcaccgca 1400941 cacgggtcga ccggcgcggc gctgtggttg ctgggtgggg cagctgtcgc ggacggcggg 1401001 ctcggttcgc tgctcggcgg ggccgggtga gcagttgaca gggatggggt ggcctccggc 1401061 gtcttagcga ccgtggagtc gcccgaaccg caaccggtca acgtcgcggc gaccaatgca 1401121 gcgaccacgc caacacgcgg cgtggtgggg cagggtggtg accacacacc gggcaccgta 1401181 ccgccatcgg gcccgcgggt gcggtaggcg tggccgggtc accactaaac ttgacggcct 1401241 gatggccttc ccggaatatt cgcctgcggc gtccgctgcg acgtttgctg acctgcagat 1401301 tcatccccgc gtcttgcggg cgatcggcga cgtcggttac gagtcaccga cggctatcca 1401361 ggcggctacg atcccggcgt tgatggcagg ctccgacgtg gtggggctgg cgcagaccgg 1401421 caccggcaag acggcggcat ttgcgattcc gatgctgtcc aagatcgaca tcaccagcaa 1401481 ggtgccccag gcgctggtgc tggtgcccac ccgggagctg gctctgcagg tggccgaggc 1401541 gttcggccgc tacggtgcct atctgtcgca actcaacgtg ctgccgatct acggcggatc 1401601 gtcgtatgcc gtgcaactgg ccggattgag acgcggcgcg caggtggtgg ttggcacccc 1401661 cggtcgtgtg atagaccatc tcgaacgggc gaccttggac ctgtcgcggg tggactttct 1401721 agtgctcgat gaggccgatg agatgctgac catgggtttc gccgacgacg ttgagcgcat 1401781 tctgtccgag acccccgaat acaagcaggt cgccctgttt tccgcgacca tgccgccggc 1401841 gatccgcaaa ctcagcgcca agtatctgca cgatccgttc gaagtcactt gtaaggcgaa 1401901 aaccgctgtg gccgagaata tttcgcagag ctacattcag gtagcacgga agatggacgc 1401961 gctcaccaga gtgctcgaag tcgagccgtt cgaggcgatg atcgtctttg tccgcaccaa 1402021 gcaggcgacc gaggagattg ccgaaaagct gcgtgcccga gggttttccg cggctgccat 1402081 cagcggtgac gtcccgcagg cgcagcggga gcggaccatc acggcgctgc gggacggcga 1402141 catcgatatc ctggtcgcca ccgatgtggc ggcgcgcgga ctcgacgtgg agcggatatc 1402201 acacgtgctt aactacgaca tcccgcacga caccgagtcc tacgtacacc ggatcgggcg 1402261 caccggcagg gccgggcgtt cgggagccgc gctgatattc gtctcgccac gggagcttca 1402321 cctgctcaag gcgatcgaaa aggctacgcg gcaaacgctt accgaggcgc aattgcccac 1402381 cgtcgaggat gtcaacaccc agcgggtggc caagttcgcc gattccatca ccaatgcgct 1402441 gggcggtccg ggaatcgagc tgttccgccg actggtcgag gagtatgaac gcgagcatga 1402501 tgtcccgatg gctgacatcg ccgcggcact ggccgtgcag tgccgcggcg gtgaggcatt 1402561 cctgatggca cccgacccgc cgctttcgcg gcgcaaccgc gaccagcgtc gggaccgtcc 1402621 gcaaaggccc aagcgtagac cggacttgac cacctaccgc gtcgccgtcg gcaagcggca 1402681 caagatcggt ccaggcgcca tcgtcggcgc catcgccaat gagggtgggc tgcaccgcag 1402741 cgacttcggt cagatccgta tcgggccaga cttctcgcta gtagaattgc cggcgaagct 1402801 gccccgcgcg acgctcaaaa agcttgcaca gacccgtatc tcgggtgtgc tgatcgacct 1402861 tcggccatac cggccgcccg acgcggcgcg ccggcataat ggcggcaaac cacggcggaa 1402921 acacgtcgga tgaccctgcc caaggaaaga gccgcccagg gcggactcga gcggatcgcc 1402981 cacgtggacc gggtggcgtc gttgaccggg atccgtgctg ttgccgcatt gctggtcgtc 1403041 ggcactcatg cggcctacac caccggcaag tacacccacg gctattgggg cctgatgtcg 1403101 tcccgcatgg agatcggcgt tccgatcttt ttcgtgctgt cggggttcct gctattccgg 1403161 ccatgggtta agtccgccgc taccggcggc cccccgccgt cgttgagccg ctatgcgtgg 1403221 caccgggtcc ggcggatcat gcccgcctac accgtcaccg ttctgttggc ctacctcgtc 1403281 tatcacttcc gcacggcggg gcccaacccc gggcacacct gggtcgggct gttccgcaac 1403341 ctcaccttga cgcagatcta taccgacggc tatctgggtg cgttcctgca tcagggtctg 1403401 acccaaatgt ggagcctcgc ggtggaggtt gccttctacc tggcgttgcc ggcgttggca 1403461 tacctactgt tggtgctcgt ctgccggcgg cgatggcagc ccaggttgct gttggccacc 1403521 atggcggggc tgacgatgat cagcccggca tggttgatcc tggtgcacaa cacgcactgg 1403581 atgcccgacg gcgctcggct gtggctaccc acctatctgg cttggttcgt cggcggcatg 1403641 atgctggccg tgctggcggc gatgggcgtg cgctgttatg cattcgtggc cataccgttg 1403701 gcggtcatct gctacttcat cgtctccact ccgatcgcgg gcgcgcccac gacgtcgccc 1403761 acagcgctgg ccgaggcgct ggtcaagacc gccttctatg ccgtgatcgc cgtgctggcg 1403821 gtggcaccgc tggccttggg tgaccagggg tggtatgccc agttgctggc cagccggccg 1403881 atggtgtttc ttggtgagat ctcctacgag atcttcctga tccatctggt gaccatggag 1403941 atcgccatgg tggacgtgct cgggtatcgg gtttacacca gttcgatggt gaacctttgt 1404001 ctcgtgacgc tggtgctgac gatcccattg gcgtggttgt tgcaccgttt cactcgggtc 1404061 cagggtgacc ggccttccta gcggcggcag aagcaggtgt cacgatcggg acgacgaact 1404121 ccgcgatcat cgctcgttcg tcggcttcgt cacggccggg gaacatcagc agcgatgtga 1404181 gcatccggac cacccagcgg gcgcgccgtg gccggatgat tgccagcggt ttgccggccg 1404241 aagggtcaaa ggcccggtct tgccggtagc cgtcggtgac ggcggggtcg gtgaccacca 1404301 tcccctcggg cagctcggcc atcaggccag ccagcacatc ggtattcact gagccgatcc 1404361 tacgggccga tcgatgtccg cttggggcgc cagatccagt tcgcgcagcg cgggcagccg 1404421 gatcgcgacc agcccggtgc acacgatggg cagtgccaac gcgagaaacg tggcatgcag 1404481 tccagcggcg tcggtcagtg gaccggccag caacagaccc aacgggccgg cggcgtaggc 1404541 cagcgacgtc atcaccccga ctacccggcc gcgcagatgc tgtgctgccc gcgtctgtat 1404601 cacgtagtta tagatcggct ggatgggtcc gtacaccagg ccgaccaccg cgcacaacac 1404661 catgatgacc ggcagtggcg gcaggaacgc gatgaccatc gatgccaaac ccagggtaag 1404721 aaccgcggtc gacatggtca cgcgacgggg aacgcggata gccaacacgg cataccccag 1404781 cgctcccacc aggccgccgc cggcgatcgc catcaacgcc caacccagct gcaccggttg 1404841 ctggtggtcg gtgaagtatt tcgggaacag cacgctctcc atcggcagat acagcgcggt 1404901 gacggtcagg tcaatcatcc cgagggtgcg caatacccgc aggttccaga cgaagcgcag 1404961 cccctcggcg atcccggata ccaacccttg gggccgcgag gtgtggtgcg gcttgccggc 1405021 accctcgagt tgcagggcgg caatcgcgag gatggacaac ccgaatgccg tcgcggtaat 1405081 ccacattgtg gtgatgccgc caaccgtcgc gatcatcaag ccaccgatgg ccgggccgac 1405141 aataaaggcc aggttgagga tcgcctcgta ggcgccgttg atgcggtcca acgaccagcc 1405201 tgcccgagcg gcggcctcgg gcagcatcga gtcacgagcc gtcatgcctg ccgggccgaa 1405261 ggcggccgcc agggcggcca atacggccag caccagcacg ttgaccgcgt cgccgccgta 1405321 cccccacgcc accaggggga cgccggccac cgccgcaccc gacagcgcat cggccaccat 1405381 cgacacccgg cgacgcccga agtagtcgac cgcggtgccg gcgaccagcg tggcgaacaa 1405441 cagcggcagc atggtcgcac tggccacgat cgaggcctgc ccagcgctgc cctcgcgctg 1405501 caacaccagc cacggaaacg cgactatcga gacgccatca cccgcggccg ccatcagcgt 1405561 tgcgaacagg atcaggaatg ccgggccgcg gttgctgttt ctcatgaata tcgcggctga 1405621 atctagcgcc aaaccggtat gggggccacc gaatttctgc gctgccgcag cccggatgca 1405681 ggatgttcgt gtgctcatgc atccgaagac cggccgggcg ttcaggtccc cggtagagcc 1405741 cggttccggc tggccaggtg atccggcgac accgcagacc ccggtggctg ccgatgccgc 1405801 gcaggtgtca gcgctggccg ggggcgctgg ctcgatctgc gaactcaacg cgctgatcag 1405861 cgtgtgccgg gcgtgtcccc ggctggtcag ttggcgtgag gaggtcgccg tcgtcaagcg 1405921 ccgtgccttc gccgaccagc cctactgggg gcgcccggtg ccggggtggg ggtcgaagcg 1405981 gccgcggttg ctgatcctcg ggctggcgcc cgccgcgcac ggggccaacc ggaccggacg 1406041 aatgttcacc ggcgatcggt cgggagatca gctttatgca gcactgcata gggccggcct 1406101 ggtgaactca ccggtcagcg tcgacgccgc ggacgggctg cgggccaacc ggattcggat 1406161 caccgcaccg gtgcggtgtg cgcccccggg caactcgccg acaccggccg agcggctgac 1406221 atgctcaccc tggctaaatg cggaatggcg gctggtgtcc gatcacatcc gtgcgatcgt 1406281 cgccctcggc gggttcgcct ggcaggtcgc gttgcgcctg gcgggcgcgt cggggacacc 1406341 caagccgcgg ttcggccacg gcgtcgttac cgagctggga gccggtgtgc ggctactggg 1406401 ctgctaccac ccgagccagc agaatatgtt caccggtagg ttgactccta cgatgctcga 1406461 cgacattttc cgtgaggcca agaagctggc cgggattgag tgacgtgaag acggttgtgg 1406521 tttccggcgc cagtgtggcc ggtacggcgg cggcgtactg gcttgggcgg cacggctatt 1406581 cggtaacgat ggtggagcgc catcccgggc tgcgaccagg ggggcaggct attgatgtcc 1406641 gaggtccggc gctggatgtg ttggaacgta tggggttact ggcagccgcc caggaacaca 1406701 agacgaggat tcggggcgcc tccttcgtcg atcgtgacgg caatgagctg ttccgggaca 1406761 ccgaatcgac gcccaccggc ggtccagtca acagtcccga tatcgagctg ctacgtgacg 1406821 atcttgtcga attgctctac ggggcaactc aacccagcgt tgaatacctg ttcgacgaca 1406881 gcatttccac attgcaggac gacggcgact cggtgcgggt gacctttgag cgcgcggcgg 1406941 cccgcgagtt cgacctcgtt atcggtgccg acggactgca ttccaacgtg cgcaggttgg 1407001 ttttcggtcc ggaggagcag tttgtcaagc gattaggaac tcacgcggcg atttttaccg 1407061 tgcccaactt cctggagttg gactactggc agacctggca ttacggtgac tccaccatgg 1407121 ctggcgttta cagtgcgcgc aacaacaccg aagcccgcgc tgcactagcc ttcatggaca 1407181 ccgaactgcg gatcgactac cgcgacaccg aagctcagtt cgccgaactg caacgtcgga 1407241 tggccgagga cggctgggtg cgcgcgcaac tgctgcacta catcgcagcg caccggattt 1407301 ctatttcgac gaaatgtcgc agatcctgat ggatcgctgg tcgcggggca gggtagcgct 1407361 cgttggcgac gctggttatt gctgctcgcc cttgtcgggg caggggacca gcgtcgccct 1407421 gctgggtgcc tacatcctgg ccggcgaact caaggcggcc ggtgacgact accaactcgg 1407481 attcgccaat taccacgccg aatttcacgg ctttgtcgag cgcaaccaat ggttggtcag 1407541 cgacaacatc cccggtggtg cgccgatacc gcaggaggag ttcgaacgaa tcgtgcattc 1407601 catcacgatc aaggactact gagcgccttc acccgggcgc agccaggatg gcgctcgtcg 1407661 gccgcttcac cgaacctgaa gatctgcaga cgaagtacga gtaggggccg gcaaatttac 1407721 cggctcgacg cgcagaagcg ccgagattta gcggcgggtc aatacgacga ccgggattgg 1407781 ccgtgacgtc cggctctggt agttggtgta tcggttggcg ttgttctcgt tgacgatctg 1407841 ccagagccgc gcgtagtccg ggtcgtgggg ctgcaccggt ttcgctgtca caccgaatcg 1407901 cttgggcccg acgttgattt cgacgtccgg gttggccttg aggttgtggt accaacccgg 1407961 cgagcgggga tcgccacctt tggacgccac gatcaggtac gcgtcgccgt cgcgagcata 1408021 ggtgagtgac gtggttcgcg gctggctcgt cttggcgccg gtggtatgca gcagcaaact 1408081 cggtggcgcg ccggggattc ggtgtccgat ccgaccgtta gtgcctcggt agatcgcgtc 1408141 gtgcagcctg agcagctgca cgcctacgtg gcgctcaagc catcgggaaa tgtccatggg 1408201 gtcagtcttg cgcagcggca tcctgttgcg ccagcgcctc ccgcaggatc cgtccggtgg 1408261 cttcccggtc cgggtcgcgg cgcagcatca ttcccttggc gaccgacagc ttgtcgccgt 1408321 tgcgccgcgg taatacgtgc aagtgaacgt ggaacaccgt ctgaaaagcg gcacggccgt 1408381 cgttgatggc gatgtgtgtc gcgtcagcca acttcgtggc gcgggccgcc cgcgcgatgc 1408441 gttggccgat ggcgaccatg tcagccaacg cctccggcgg ggtgtcggtg aggtcaacgg 1408501 tgtgtcgctt gggcagcacc agcgtgtggc cgcgggtgaa cgggcggatg tcgaggatcg 1408561 cgagatagcc gccgtcctcg tagatccgga tggccggagc ctccccggcg atgatcgcac 1408621 agaacacgca gggcatgtcg ctacggtact ggacctctcg gagaccgccc aagtgaacgg 1408681 gatacgctgc cgccgtggac cctactgacc tggccttcgc cggtgccgcg gcacaggcgc 1408741 ggatgctggc tgacggtgca ctcaccgcgc cgatgctgct cgaggtctac ctgcaacgaa 1408801 ttgagcgtct ggacagccac ctgcgcgcct accgggtggt gcagttcgac cgggcgcgtg 1408861 cggaggccga ggccgcccag caacgcctcg acgccggtga gcggctgccg ctcctgggcg 1408921 tgccgatcgc catcaaagat gatgtcgaca tcgccgggga ggtgacgaca tacggcagcg 1408981 ccgggcacgg tccggccgcg acgtccgacg cagaggtggt tcgccggctg cgcgcggcag 1409041 gcgctgtcat catcggcaaa accaacgtgc ctgagttgat gatcatgccc ttcaccgagt 1409101 cgctggcctt cggggccacc cggaatccgt ggtgcctcaa tcgaacccct ggcggcagca 1409161 gcggcggcag cgctgcggcg gtagcggccg ggctggcgcc agtggcactg ggatccgatg 1409221 gtggcggatc gattcgtatc ccgtgtacct ggtgcggtct gtttgggctg aaaccacagc 1409281 gcgatcggat ttccttggag ccgcacgacg gggcctggca ggggctgagc gtcaatggcc 1409341 cgatcgcgcg gtcggtaatg gacgcggcgt tgctactgga cgcgaccaca acggtgcctg 1409401 gtcccgaagg cgagtttgtg gccgcggccg cacgccaacc cggccggctg cgaattgcct 1409461 tgagcaccag ggtgccaacc ccgctgcccg ttaggtgcgg caagcaagaa ctggcagccg 1409521 tccaccaggc aggtgcgttg ctacgtgatc tgggccacga cgtcgtcgtc cgcgatcccg 1409581 actatccggc ttcgacctat gccaactacc tgccccgctt tttccgcggt atcagcgacg 1409641 acgcggacgc gcaggcgcac ccggaccgcc tcgaagcacg tacccgagcc atagcgcgtc 1409701 tagggtcgtt cttctccgac cggcggatgg cggccctgcg ggccgccgag gtggtgctga 1409761 gcagccggat ccagtcgatc ttcgacgatg tcgacgtagt tgtgacgcca ggcgccgcga 1409821 ccggcccgtc ccgcatcggc gcctaccaac gccggggtgc agtttcgacg ttgctgctgg 1409881 tggtgcagcg ggttccgtac tttcaagtct ggaatctgac cggccagccc gcggccgtgg 1409941 tgccgtggga cttcgacggc gacggcctgc ccatgtcggt tcaactcgtc ggccggccgt 1410001 atgacgaggc gacgctgctg gcactggccg cacagatcga atctgccaga ccctgggccc 1410061 atcggcggcc gtcggtgtca tgacattgca gtcgcccgct cgtttttcac gtttttgccc 1410121 ggccgcagga catgtgcggc ggcgttaacg ttgactggtg acagaccacg tgcgcgaggc 1410181 ggacgacgcg aacatcgacg atctgttggg cgacctgggc ggtaccgcgc gcgccgagcg 1410241 tgcgaagctt gtcgagtggt tgctcgagca gggcatcacc cccgacgaga ttcgggcgac 1410301 caacccgccg ttgctgctgg ccacccgcca cctcgtcggc gacgacggca cctacgtatc 1410361 cgcaagggag attagcgaga actatggcgt tgacctcgag ctgctgcagc gggtgcagcg 1410421 cgctgtcggt ctggccagag tggatgatcc tgacgcggtg gtgcacatgc gtgccgacgg 1410481 tgaggcggcc gcacgcgcac agcggttcgt tgagctgggg ctgaatcccg accaagtcgt 1410541 gctggtcgtg cgtgtgctcg ccgagggctt gtcacacgcc gccgaggcca tgcgctacac 1410601 cgcgctggag gccattatgc ggccgggggc taccgagttg gacatcgcga aggggtcgca 1410661 ggcgctggtg agccagatcg tgccgctgct ggggccgatg atccaggaca tgctgttcat 1410721 gcagctgcgg cacatgatgg agacggaggc cgtcaacgcc ggagagtgtg cggccggcaa 1410781 gccgctaccg ggagcgcgac aggtcaccgt tgccttcgcc gacctggtcg gtttcaccca 1410841 gctaggcgaa gtggtgtcgg ccgaagagct agggcacctc gccgggcggc tggccggcct 1410901 cgcgcgtgac ctgaccgctc cgccggtgtg gttcattaag acgatcggcg acgcggtcat 1410961 gttggtctgt cctgatccgg cgccattgct ggacaccgtg ctgaagctgg tcgaggtcgt 1411021 cgacaccgac aacaactttc cccggctgcg agccggcgtc gcctccggga tggcggttag 1411081 ccgggccggc gactggttcg gcagcccggt caacgtggca agccgggtga ccggggtggc 1411141 gcgcccgggt gccgtgctgg tcgcggattc ggtgcgggag gcccttggtg atgcccccga 1411201 agccgacgga tttcagtggt ccttcgccgg cccccgtcgc ctcaggggaa tccggggtga 1411261 cgtcaggctt tttcgagtcc ggcgaggggc cactcgcacc ggctccggcg gcgcggccca 1411321 agacgacgat ttggccggct cgtcaccgta ggcaggcaca ccggtacaca tgggcagacc 1411381 cggcgtgact ctcggggggc gtctgacacc gtcttctgcg ggtcttgcgc ggccggcctt 1411441 caccccgtct tccggcactt tcgattggtc actaaccggg cctgcttcga taccaaaaat 1411501 acaacgtcga atggctgatc acaatggttc tcgccaggcc ggacgctgtt ttcgcgccgg 1411561 ccaggaaccg gtgtcacgtt tcgctgccgg tgaacgcgat gtcattaaag atgaaagtat 1411621 gtaatcatgt aattatgagg caccatcaca tgcacgggcg gcgctacggt cgccccggcg 1411681 gctggcagca agctcagcaa ccagatgcca gtggggcggc ggaatggttc gctggccgcc 1411741 tgcccgagga ctggttcgac ggcgacccca ccgtcatcgt cgaccgtgaa gaaattacgg 1411801 tgattggcaa gctgcctgga ctcgagagcc ccgaggaaga aagtgcggcc cgagcctcgg 1411861 gccgcgtgtc gcgattccgc gacgaaaccc gaccggagcg aatgactatc gccgatgaag 1411921 cccagaatcg ctacggacgc aaggtgtcct ggggcgtcga ggtcggtggt gagcgaatct 1411981 tgttcacgca catcgcagta ccggtgatga cgcggttaaa gcagccggaa cggcaggtgc 1412041 tggacacctt ggtcgacgct ggcgtggctc gttcccgctc ggatgccctc gcgtggtcgg 1412101 tcaagctggt cggcgagcac accgaggagt ggctggccaa gctgcgcacc gccatgtcgg 1412161 cggtggacga tctgcgcgcg caaggcccgg atcttccggc ctaaacggcc accgccgaat 1412221 gcgtcattcc ttgttgactt tgtcaacgat cttggcggcg atctggcctg cttgattggt 1412281 gatctggtac ccgcatgcgt tgacgtcgac gaccacattg ttggccacgc tcatcgcgcg 1412341 ttggcattcc cagccctcag cgccttcttg ggtgtctatc accgtgatcg tcggcgggct 1412401 gcctttgacg tcggcaaacg tccaccggta ggtcttggcc ttattcgtga cggtgaccgt 1412461 cttgcctgcg cagttcttcc atttgtcggc cgaagtctgc acgaacgcgc gggctttgtc 1412521 ggcggtcgga aaggcgacga cggcttggtt cacccaatgt tcgtagttgt cgcccggctc 1412581 ggatgaaatc aagccgttga tggcggtgta gccggtgccg gcatacaccg gatcctggct 1412641 ggtatacagc gcgccctggc agtccggcag ggacaccgtc accggcgaag agtccatcga 1412701 tgtgatcggt ttgcccggct gcatggacga cgagcccatc acggcgttga cttctgagga 1412761 gttcagcagt agggcgctaa ggcgctcctc cgcaaccggc tgaggcggct gtaccggctt 1412821 gggtcggttg gcgatccaga tgccgatggc gcccaacacg aggacgagca cgacggcggc 1412881 ggcgccggcc actaagggcc acgggttggt tttgcgtggg gtctgggccc aggggctggg 1412941 gccgccggac ggcggtgcgc cccagccgcc gccctggtag tactgcgggg tgggagtggg 1413001 tccgctggcc ggcatcgggc cgctattggg cgcccaggac ggctggccgg tcggcgcggc 1413061 ctggatgggc ggcggtgtga cggtgggcat ggtcggcggc tgcgcggtta ccgccgcggt 1413121 gccgggcagg gtggattctt ggctgcggcg caggatgtcg gcggcgtggt cttggtcggg 1413181 gtcgctgagc gcttcgtggg cggccagggc caggtcgccg gcgctggcgt agcggtcttc 1413241 gggctttttg gccatgccgc gggcgaccac ggcgtcaaag gctttgggga tgcccgggcg 1413301 gatggcgctg ggctggggga tgggtcccat caggtgggag ctgaccagtg tgccggcgct 1413361 gtcggcgcga tacggcgggg ccccggtcaa gcattcgtgc agcacgcagg ccagcgcgta 1413421 gatgtcggcg cggtaggtta cctcgtcgtt ggagaaccgt tcgggggcca tgtatttcca 1413481 ggtgcccacc gcggtgccta actgggtcag tttctcgtcg gtggtcgcac tggcgatccc 1413541 gaagtcgacc agataggcaa agtcgtcgcg ggtgatcaga atgttttgcg gtttgacgtc 1413601 gcggtgcatc accccgtcgg cgtgtgcggc atcgagcgcc gaggcgatct gggtgatgat 1413661 ggccaccgcg cgcggtgggg tcagcgggcc gaagcgtttg agcacgctgt caaggtcggt 1413721 gccctccacc aggcgcatct ccaaaaacat ttggccgtcg acttcgccgt agtcgtggat 1413781 gggcaccacg tgaggttcct gcaaccggcc ggcgatgcgg gcttcgcgtt tcatccgctc 1413841 gcgaaacacc gggtccttgc tgaattccgc ggtcatcagc ttgacggcga cggtccactc 1413901 cttgacggtg tgctcggcct cgtagacctc gcccatcccg ccccggccca acagccgttt 1413961 gaggtggtag ggcccaaaca tcgagcccac ccgcgagtcc tgtgcgtcgc tcatcgctga 1414021 tcctcccaac caacccgctg ccgccgacac tatcaacaac gtcaccaggt tccgtgtggc 1414081 ttgtggcaat cgcgcccgcc gaaagctgac tgctggacac cgcgccgagg tttgctagcc 1414141 agcgcgcagc caaggattcg tcaaggatgc ggaaagtgcg ggggcggacg cgttcttagg 1414201 cgctgaaccg aggcttgcgc gttgaacaaa tgaccggctg gtctacgcct ggtcggttca 1414261 cttacgagct ttgcgttgcg gctcgatgcg tttgagctgc agcgcgatca agatcaccag 1414321 cacgagcgcc tgtacggcgc aggcccccgc ggccatcaac cagctaccga cgttgtagtc 1414381 ccacagtgga tcttggtctc caccggcggt ccggcgcagg tcgttgaggt cgacggtggc 1414441 tgcggccatg gcgtaggccc accgcgacgg ggacagccac gacagctgct ctagcggggg 1414501 tcgaccactc acaccgaaca tgcccccgca cagcaccaat tgggccatga ccaccaatac 1414561 cagcagcggc atgccgcggt cggcgttgcc gatcatcgcc gagatcagca ggccgatcat 1414621 catcgagacc acggttacgg cgacgacggc cacggccact tcgacgctgg gccaaggcaa 1414681 tatcaccgat tgatcgggtg ggggcagcaa cgcgacaccg aggaatccca ggatcagcgc 1414741 ttgcaggctg gtcagtgctg taaggaccac caatttggat gccaggtagg cgccgcgtga 1414801 caagccgatg ccgtgttcgc gccgatatat tgctcgttct ttgacgattt cacggatcga 1414861 ggcagcacag cccatgagtg caccgccgat gatcagcaac accagtagtt gcgacggctg 1414921 ggtcgacttc agctcgatgg ccttagccag cgacaacccg gcctggcccg ggacggcatg 1414981 ggcaaacaga ctcaacagca gcggcaggac tagcaagaac accgcatact ggcggtcggc 1415041 ggcgatgacg gccagatacc gtcggcacag gatggcgaat tgagcgaacg cactttgctg 1415101 agcgacgggc ctggcgtgcc gcgcggcggc cggccgcgcg gggcgcatgg cggggtggcc 1415161 gatgagggcc tcccgtaggg gtgaggcgtt gaaccggccg gtccagtcgg tggaggtgtc 1415221 gtgttcgagg agggtgaaca ggtcggcgaa gtcggtgcag ttgaagtagc ccagggcctg 1415281 ttgcggcgga ccaaagtacg cgaggcgacc tccgggggcc aggatgagca gccggtcgca 1415341 catgttgagg tgggcgatgt tgtgggtgac caccaccacc gagcggccgt cgtccgccag 1415401 tttgcgtagg gtctgcatga cggacttttc atagcccggg tccaggccgg atgtcggttc 1415461 gtcgaggaac aacagcgacg gtttggtcag cagttccaac gcgacgctag tccgtttccg 1415521 ctgaccaccc gacaggctgt cgattcgttg atcggcttgg gtggaaaggc cgagttcgac 1415581 cagcacctcc tcgatgcgct ggttgcgttc gtcgacggag acatcctgcg ggaatcgaag 1415641 ccgcgccgcg tagttcagcg ctcgccgcac cgtcagcggg gtgtgcagga tgtcgtcctg 1415701 tggcacgaac ccgatccggt gccgcagctc ggcgtagttg tcatacaggt cgcgctcgtc 1415761 gtagcgcacg gttccgttgc cggccggccg gaacccggtc agcgcgccca gcagtgtcga 1415821 tttcccggca ccgcttggcc ccactaccgc caacaagctg cgttgcggca gaacgaaact 1415881 gacatcggcc agcaacacac gacccttgtt ggtgaccacc cgcagatttg acgcctggta 1415941 ggagatatcg ccggtgtcga cgtattccac gagccgatcg gcggataggt gcagcagctg 1416001 atggccgatg ccgacgatgt cagtcggtcc gatgaccgca cggctgatgc ggtggccgtt 1416061 gacgtaggtt ccgttggcac tggcattgtc gctgagctcc catcggttac cggttcgccg 1416121 caggatggcg tgccggcggg agaccagcaa gtcgttgagg accacggtgt tctcgggtgc 1416181 gcggccgatc gtgacgacca actggtcaat ggcatggaat gcggtcggtg gccgggcgac 1416241 cgtcgtctca ccctgccgtt gagccggttt gggcgttgcc ggccgtggcg gggctggtgg 1416301 gtgcgacgca ggggtgggcg tgggtgactg ggcaaccggg tacagttgca cccgctgccc 1416361 ggaagacgcc gaaccaagaa aaatcgtgat gggctgacgt accgtcaggc gttccacccg 1416421 ctgtccgtcc acgaatgtgc cattggtgct caggttgacc agaacccatc cctcaggtgt 1416481 ggcttccagc actgcgtgtt ggcgcgacac ccgcggattg tccagtcgga tatcggcttc 1416541 actggcgcgg ccaatgctcc actcgcggcc ggcgacggca tgccaggtgc gccccgcggc 1416601 ccgcagttcc agccgcggtg cattgggggt gatcatgtcg gccgtgtctg tcatgcctag 1416661 cctcttaccc gtacttggcc caccagttgt gcagatcctc aatggtcgcg ggatccccga 1416721 agacgtcgct gagcgttagc ttggcctcgt tgctccagat cacgttcggg cgattcttat 1416781 aggtgccgca cgcgatcatg cctgcggtca cgtctggggt ctggttgtaa tgccagccat 1416841 ccggtgatgg tccttcaccg ggacagttca tcagctccac ggcggcgata tcgtcgttga 1416901 aggcctgttt cagcttgtcg ggattggcga acaatccata gatggcgcga cttggcccac 1416961 cctggttggt gttttgcccg cagtcgacca tcgccacggc gttcacccat atgctgttcg 1417021 gcttcggcgt ggtcggttta caggtgccgg tcggatagcc cgacggcaac atgctgagca 1417081 gcctggtctg cgggtcgctg gccggtgctg tggtcggcgt tgtggtcgcg ggtagcgagg 1417141 tcgttgccgt ggtggtgggg gtgcctgggg aggtcgcgat gttccgtttt gggttgtcgt 1417201 ccggtcggtt ggcgatccag atgccgatgg cgcccaacac gaggacgagc acgacggcgg 1417261 cggcgacggc cacaaagggc cacgggttgg ttttgcgtgg ggtctgggcc caggggctgg 1417321 ggccgccgga cggcggtgcg ccccagccgc cgccctggta gtactgcggg gtgggagtgg 1417381 gtccgctggc cggcatcggg ccgctattgg gcgcccagga cggctggccg gtcggcgcgg 1417441 cctggatggg cggcggtgtg acggtgggca tggtcggcgg ctgcgcggtt accgccgcgg 1417501 tgccgggcag ggtggattct tggctgcggc gcaggatgtc ggcggcgtgg tcttggtcgg 1417561 ggtcgctgag cgcttcgtgg gcggccaggg ccaggtcgcc ggcgctggcg tagcggtctt 1417621 cgggcttttt ggccatgccg cgggcgacca cggcgtcaaa ggctttgggg atgcccgggc 1417681 ggatggcgct gggctggggg atgggtccca tcaggtggga gctgaccagt gtgccggcgt 1417741 gtcggcgcga tacggcgggg ccccggtcaa gcattcgtgc agcacgcagg ccagcgcgta 1417801 gactgtcggc gcggtaggtt acctcgtcgt tggagaaccg ttcgggggcc atgtatttcc 1417861 aggtgcccac cgcggtgcct aactgggtca gtttctcgtc ggtggtcgca ctggcgatcc 1417921 cgaagtcgac cagataggca aagtcgtcgc gggtgatcag aatgttttgc ggtttgacgt 1417981 cgcggtgcat caccccgtcg gcgtgtgcgg catcgagcgc cgaggcgatc tgggtgatga 1418041 tggccaccgc gcgcggtggg gtcagcgggc cgaagcgttt gagcacgctg tcaaggtcgg 1418101 tgccctccac caggcgcatc tccaaaaaca tttggccgtc gacttcgccg tagtcgtgga 1418161 tgggcaccac gtgaggttcc tgcaaccggc cggcgatgcg ggcttcgcgt ttcatccgct 1418221 cgcgaaacac cgggtccttg ctgaattccg cggtcatcag cttgacggcg acggtccact 1418281 ccttgacggt gtgctcggcc tcgtagacct cgcccatccc gccccggccc aacagccgtt 1418341 tgaggtggta gggcccaaac atcgagccca cccgcgagtc ctgtgcgtcg ctcatcgctg 1418401 atcctcccaa ccaacccgct gccgccgaca ctatcaacaa cggtcaggta tcacgtcggc 1418461 tgcgatcgcc gggcccagca accttgccag gcaacaatga cgctaggcct tcgccggctc 1418521 gaccgcacga aaatctgcca catcttcgcg ggatgtcggc gactgcggtg gctgtgccat 1418581 tcgctggtac gcgccgctgt tcggctaccg aaaagtgttg tggtaattgg ttaccgcagc 1418641 ccagcgccgg cggccagcgc gcgacgttgc cacgaaaagc tttgtgtagc agtcatatcc 1418701 gtggacatcg gtgttaaggg cttgtgtcca cggatctacg tgccgccatg cgtccccgcg 1418761 ctgatctgga acgtgaattc atggtcacag atgcgaatgt ggtcgccgtc gttcagcgtg 1418821 accgcggagc ggattcgctc gtgctgcaca tgcacgccgt tggacgatcg gaggtcgttg 1418881 atgacgtagt tggtgcccgt gtcgacgatg acggcgtggt ggcggctgac gttggcgctg 1418941 tctaggacga tgtcgttgtc atgcagacgc ccgatccggg tcgccgcggc ttgcagtggg 1419001 tagccgcgac ccgaggcgat gtcgtgcagg taggccaccg cctgctggcc cgacgccatg 1419061 gtgcgctgat cgagcaccgt gacggtgccg gcagcggtgg ttttggcgga cttcttggca 1419121 tccagcggtt gctgacgcag aatccgctcg ttgagagcgc gcaacgtcgg accggggtcg 1419181 atgccgaggt cgtcggccag tgttgtcttc acccggcgat aggcgcccag cgcatcggat 1419241 tgccggtcgg agaggtagta ggcggtgatc agctgtgtcc acagcggctc ccggtagggg 1419301 tgttcgaatg tcagagcctc gagctcggcg atcactgcgc tggcccgccc acacgcgatt 1419361 tcggcctccg ccttggcggt atgggcaaga accttgtctt ctaccagcgc cgtggcaaag 1419421 ggttcgacga actggaagtc gcgcaggtca tcgagcaccg gcccacgcca ttctctcaat 1419481 gcggccgaca ggtggcggct ggcttgttcg aaccggccgg cggcggccgc gtgcacgccc 1419541 gcggtttttt cggcaacaaa ccgccccaga tcgcaagtgt tgtcggggat gctgagccga 1419601 taacccggcg gcgctgcggc caacaccacc cgtgggtcga tcccggcgcc accgaggagc 1419661 ttacgcagat tagacacgta ggagtggata ctcgcgcgtg cgcccgaggg tggccactcc 1419721 tcccagaggg cggtgattag ggcgtcgact cctacgggcc tgttgcggtt gatgaccaac 1419781 atggctagca cagcccgttg cttgggggtg cccgatggca ccggggtgcc gtcgatagtc 1419841 atctgcaatg gtccaagcag gccgaagtcg agccgcttct ccactgtcgc gctaccagcc 1419901 attgcgggtc ctccgtggct tgcggtgcca aggtgccaat agggtgtcgc taccggtcat 1419961 tgtgatacca cgtttcgccg atgcggtaag aacccaggat ctcggcacgc cgtgcgatgt 1420021 accgggtcgg tggcccttga cagcggcatc ggctgtttcc atgcgggtga aatgctggcc 1420081 ctgtaaagat gatcgtgaat gtcccacgcg aatcctgttg gtgctcatcc aaacatgcga 1420141 tcggcgggca gccgacccgg tgttcttgca acgagtggct gcccgctgtg gtgatcgaca 1420201 ttcgagcgcg gttcaggtgg tgacggccat gaagtcgtgg ctggtggccc acgcctcgac 1420261 gaaggtttcc atcgggatct gctcgtcgcg gcccgtgggg gtaccgctgt cgttgaggtg 1420321 aacaatgccg ttttcggtat cgacaccggt caccaccacg gcgtggtcag accgcgggtt 1420381 gccggcactg tcggtttcct cgacgggctg gccccagatc atctcggcgt tgatgctgac 1420441 gatcacggcg tgcccgctgc ccagatactg ctcgagggcg gccatgccgg tggcgactcc 1420501 ggtggctgtg gcgtggtcct cgtcggtgat aacggcgtcg acgccgtaat gcgccagcag 1420561 cgtcggtatg tcggccacgc tggtacccat tcccgagttc gggtgctcgg cgtcggccgg 1420621 ctttgtgtag atggacccgg ggtgcacgac gctgggtgtc gactgggcca ctttgatgat 1420681 ggcgcgctcg gaaggctccc tgccggtcac ttgaccgatc acgtccgcgg ccgacatcag 1420741 gacgcagtcg tcgtatgtct gctggcgcca gtacttggcg gcggctgccg ggtcgccata 1420801 catggtgccc gccgctgcgt cggcggggct ggccaatccc agtgcaacgg caccggcggc 1420861 cagcgcgaag gtggcggtct tgaaggcggt ggcgattttg ctggtcgtca tcgtcggtcc 1420921 ttttctcgtt ccgctatgcg gagtggatgt tgagaaaagg ttccgatggt gacctttttg 1420981 ttatctctag gaattcttgg agtgatctgc agtggtcagc cgaggttcac cggtcgcggg 1421041 caggccgatc tgcgcgggcg cagtcgacag cgttgctacc gggatgcacg gcggtaccga 1421101 cgatcggtcg gctgcctaag cgggcgtgcg ggattagttg caggcccagg tgtcgatgta 1421161 gccgccgccg agcttggtca gggcgtcctt catggcggcg gccaaggtgg gtccaactcc 1421221 tccctggtat gccctatcgt tggcggcgac ggcgccgcag gcggtgaaac tggtgagcac 1421281 cttgcagtcg gagtagccac acgacttgac ggcggtggct tcggcagccg cccgggttgg 1421341 gtagtcccac gatcggcccc acgagccgtt gccggagtag gcaattgcgc catagacatc 1421401 ggcggcattt gctggtgcgg gagccagggt gacggtcgtc gcggcggcag tggcgacgcc 1421461 ggcgacggcc accgcgaacc gtcgccgaag agtaatcatc gtcgtcattg gtgagtcctt 1421521 tccgaatgcc ggcggtgcgg cggtttcaac aagcaattag gacgatggct agaccggttt 1421581 ggtggcggtg acctgcttac cccagtcgga catcgtcaac gtcaccgacg tgtctttggt 1421641 gggagcgatc tggatctgga ccaagtgcga ggatccatcc gaagcgatcc agacggtggt 1421701 gggcaccgtt ttgacgtctt cggaggtcag acgtgagccg gccagcgtcg cgatgtcgtc 1421761 agcagacgag ttcccggtga tcttggtggt cgcgacaccg tccgcctgct ggctgccggc 1421821 aaccgacgcg tccttgaggt tagccaacag gttggccagg cccttgttgg ggtcgaggag 1421881 caccgacacg ttgtagatcg aggtgccgtt gccgaaatcg gtgtaggtgc cgggctggcc 1421941 taggtcggag tacaggtgac cgtcaacata gacgaacttc gcgtcttcgc tcttgttgcc 1422001 gacgagcaat gtcgcgctac cggtggcaac cgtctgcggt gtgttggaga tatcgccttc 1422061 gagcttggtc acccgcaggt ttggcacgtc gcctgtcacc gcaagtctga cgtgcattcc 1422121 ggtgaccttg cgcatcgcat cggtggcctg cttgagtagc atggccgcat cgccgttgga 1422181 tgccgtggcc gcggtgtcag acgctttgcc ggcgtcccct tcggttgagc agccgccgat 1422241 cgccaggacg acggcgagta tggcggtggc ggcggcaaca acggaacaag gtggatgctt 1422301 catcgaaatc tcctcatgtt ggcccacagc ttcgtactgc atagcaatcc cgttgcggca 1422361 gagtcaacag ccgacaccga gtccgagtga gcgccgcacg gcaccgcgag tcgaatcggc 1422421 cgaattgaat ggcgtttcaa acgctttcgt tgtccggcgg caaagcgaat gcggggatcc 1422481 cggttgacgg gatccccgca tcgggtgggc agcggctagg tgagctggct ggcgtattgc 1422541 gggcagtagg ccttggttgc gtcgacgacg aagtaggctg cctgcttagt ggtcaggttg 1422601 gtttggctga ggacctcctc ggcgatctcg gtgccggttt cgccgctggc cagcttcttg 1422661 cagaccagct gggcttgctg ggtggccacc tgcggtgagg agaaggtgac gccaatggac 1422721 tccatctgag caatgaaggc ttcgtctttg gtgttggcgc cggcggtgcc ggcggtggcg 1422781 acggcaagtc cgatggcggc ggcgccgact gcagtggtga acgctgcgat aatgcgaggc 1422841 gataacggcg ataacatggt caagatcctt cgcggtcggg atttccctgg atgacctcag 1422901 cttgcggggg gcgccttggc ggattctcaa caacttcttg gtaacctcgt gggcccgcgt 1422961 cgggctaggc ccgcgtcatc tggtaataga ccccgcgccg ggccaacagc tcggcgtggt 1423021 tgccgcgttc gacgatctgg ccggtctgga ccaccaggat gtggtcggca tcgcgaatcg 1423081 tcgaaagtcg gtgggcgata atgaaactcg tacgatcccg gcgaagctcg cgcatcgctc 1423141 gctggatgag cagctcggtg cgggtatcga ccgagctggt cgcctcgtcc aggatcaaca 1423201 gctgcgggcg ggcaagaaag gcgcgggcga tggtaatgag ttgcttctcg ccgacgctga 1423261 tgctgccgcc gtcgccgctg acccgtgtct ggtagccagc aggcagtgtg ttcacaaacc 1423321 ggtcgacatg ggccgccctg gcggcttcta ctatctcgtc tgtggtggcc tccggccgtc 1423381 cgtaggcgat gttctccgcg atggtcccgt cgtagagcca ggtgtcttgc aacaccatgc 1423441 cgattcgcga tcgcagcgac tgccggctta ccgaggcgat atccaccccg tcgatcagga 1423501 ttcgtccgga accgatctcg tagaaccgca ttagcaggtt caccagcgtg gtcttgccgg 1423561 ctcccgtcgg tccgacgatc gccaccgtgc tacccggttc ggccaccagc gacaggtcgc 1423621 ggatcaccgg cgtgcccggg aggtaagcaa agttcacgtg ctcaaactcg acccgtccgg 1423681 ttaggttcgg cagctccggc tcaggctccg gcgactcctc gggctcgtcg agcacgtcga 1423741 acacccgctc cgcgctggcc accccggact gcagggcgtt gtacatcccg gccagctggc 1423801 tcagcggcat gttgaactgg cggatgtact ggatgaacgc ctggatgctg ccgagcgtga 1423861 tctgcccggt ggctacctgc aggccaccgg ccaccgcgac cgcgacgtag ccgaggttgc 1423921 cgatgaacgc cgtcgccggc tgcacgagac cagagaggaa ctgggcgccg aaaccggcct 1423981 ggtagacgtc gtcattcaac tcgtggaacc gttctcgtgc ggccgcttgg tggccgaacg 1424041 tcttgactac cgtgaacccg ctgtaggtct cttcgagatg ggcgttgagg cgcccggtgc 1424101 tggtccagtg agctacgaat aggggctgtg accgccgggt gatcgcgcgt gtcaccagca 1424161 gcgacagcgg caccgtcagc agtgtgatca gcgccagcag gcccgagatc gacaccatca 1424221 tggccagcac cgccaccatg gtcagaatcg acgtcaccag ctggctgatc gtcattgaca 1424281 gcgacgactg gaggttgtcg atgtcattgg tgacccggct cagcagctca ccgcgctgtt 1424341 gtccgtcgaa gtaggacagc ggcagccggt gcaccttgtc ttcgacatcg gtccgcaacc 1424401 tgaccatcgt tttctgcacg gtgaggttga gcagccgggc ttgtgcccaa atcatcagcg 1424461 ctgcagccag atacagcgcc aacgccagcg ccagtgttcg ctccaccgcg gcgaagtcca 1424521 caccttggcc cggcaccacg ttcatcccgg acagcaggtc ggcgaaggtg ttgtcaccac 1424581 gggcccgagc cgaagcgacg gcctgtgcct tggtgattcc ccccggtagc cctcgcccga 1424641 tcacgccgtt gaacagcaaa tcggtggcat ggccgaggat ccgtggaacg atgacgccga 1424701 tcgtcgtgcc ggcgattccc agtgtgatca ccgcgatgct cagccggcgt tgtggcgcca 1424761 gccgtttcac cagtcgggct gccgatcccc agaagtcgcg ggaccgcatg ttcgggggcg 1424821 ggcttgcggc acgggggcgt gcgcccggtg gcgcggtcac cctacacccc cgaccgtggc 1424881 gctcagcgat tgtgaggcgg cgaattcggc ataggtgggg caatcggcca gcagcgtttc 1424941 gtgggtgccc gtgccgacga tcttaccgtt atcgacaacg atgacctggt cggcctgagc 1425001 ggcattcgaa atccgttgtg taacaacaat gatggttgca tcaccagata cctgtcgcag 1425061 cgatgcgtgg actttggcgt cggtgtgcac gtcaagtgcg gagaacgcgt cgtcgaacac 1425121 atagatggcc ggacgtcgga tgaccgctcg ggctatcgcc agccgttggc gctgcccgcc 1425181 ggagaagttg acaccacctt gggcgacacg cgtctgcagc ccgtctgttt gtacaaagcc 1425241 gtcggccgcg gcgacccgca gcgcctccca catctcctgc tcggtgacta cctggtctgg 1425301 gcccccgccg tagcgcaggt tgtccgcgac ggttccggag aagaggtagc tgcgctgggg 1425361 caccagcccg atcgctgacc agagccgctc ggtgtggtac tcgcggacgt cgataccgtc 1425421 aaccaagacc gcgccagcgg tgacgtcgta gagccggcag atcaacgaca ccagtgtcga 1425481 cttgcccgaa ccggtactgc cgacgatcgc ggtggtggta ccgggccgcg cagtcaacga 1425541 aatgtcctgc agcaccgggc agtcggcgcc aggataggta aaggttgcgc cagccaagcg 1425601 cactacgccc gtgaccccgt ccgtcgggaa cttgggattg tcggggttac cgagtgcggc 1425661 gggcgtggaa agcacctcgg tgatgcgttc ggcgcagacc gacgctcgtg gcagcacggc 1425721 cagcgtcatg gtcgccatca acaccgccat caggatctgg gcgaagtagg acaggaaggc 1425781 gatcagggag ccgacctgca tctggccgct gtcgatgcgt agcccaccga accagatcag 1425841 tgcgacgctg gatgcgttga tggtcagcgt ggtcaccggc agcatcagtg cttgccagtt 1425901 gccggcgctc agtgcggcat tcgacagcgc cgtattggcc tgcgcgaact tgtcgcgttc 1425961 atagccttcg cgggtgaagg cgcggaccac tcgcaccccg gacagctgat cgcgcatcac 1426021 ccggttgatg ccgtcgatca ggctctgcat gcggcggaag agcggcagca tgtgggagat 1426081 gatccagtag tttgctacgg ccagaatcgg aacgctgacc agcagcagcc atgtcagcgc 1426141 ggcctcctgg tggatggcca tgatgattcc gccgacgcac atgatcggtg cggtgaccag 1426201 cacggtggcg gtcatctgga ccaggaacag gatctgccgg acgtcgttgg tgctgcgggt 1426261 caacaacgtc ggagcgccga atcgggcggt ctcgcgttcc gagaaggtga tgatgtgttc 1426321 gaacattgcc gagcgcaggt cacggccgaa acccgccccg gtccgggagc ccagatagac 1426381 tgccccgatc gcgcacagca cctgcaatcc ggtcacccca agcatcaccg cacccagccg 1426441 tacgatggtg gcggtgtcgc ccttggcgac gccgtcgtcg acgattgcgg cgttgaccgt 1426501 cgggaggtat agcgaagcca gggtgctgac cagctgcagc atcatcagca tcgcgaccag 1426561 ccggcggtac ggtcggatgt gctggcgcag cagggccagg agcattgggt aactgtcgca 1426621 cactgcgcat gctgcctacc cgcgccaggc atgagtctta ggccgaaatg cctggttaac 1426681 tggcgtgtcg tggttgaccc gcgggcctgc ggctacagtg catgctgtga tcggcagtgg 1426741 gagaggtagc ggtgcggcgt aaggtgcgga ggttgactct ggcggtgtcg gcgttggtgg 1426801 ctttgttccc ggcggtcgcg gggtgctccg attccggcga caacaaaccg ggagcgacga 1426861 tcccgtcgac accggcaaac gctgagggcc ggcacggacc cttcttcccg caatgtggcg 1426921 gcgtcagcga tcagacggtg accgagctga caagggtgac cgggctggtc aacaccgcca 1426981 agaattcggt gggctgccaa tggctggcgg gcggcggtat cttgggcccg cacttctcct 1427041 tctcctggta ccgcggcagc ccgatcgggc gggaacgcaa gaccgaggag ttgtcgcgcg 1427101 cgagtgtcga ggacatcaac atcgacggcc acagcggttt catcgccatc ggtaacgagc 1427161 ccagtttggg tgactcactg tgtgaagtcg gaatccagtt ctccgacgac ttcatcgaat 1427221 ggtcggtgag tttcagccag aagccgttcc cgccgccgtg cgacatcgcc aaagaactga 1427281 cccgccaatc gattgcgaat tcgaaatgag acgtgtcctg gtcggtgcgg ccgccttgat 1427341 caccgcactg cttgtcttga ccggctgcac gaagtcgatt tcgggtaccg ccgtcaaggc 1427401 gggtggggcc ggtgtcccgc gcaacaataa ctcccaggag cgctacccca acctgctcaa 1427461 ggaatgtgag gtcctgacca ccgacatcct ggccaagacc gtcggtgccg atccgctcga 1427521 catccagagc acgttcgtcg gcgcgatctg ccggtggcag gcggccaacc cggccggtct 1427581 gatcgatatc acccggttct ggttcgagca gggcagtctg agcaatgagc gcaaggtcgc 1427641 cgagggcctg aagtaccagg tcgagacccg cgcgatccag ggcgtggact cgattgtgat 1427701 gcggacgggc gatcccaacg gcgcctgcgg cgtcgccagc gacgcggcgg gagtggtcgg 1427761 ctggtgggtc aatccccagg ctcctggtat cgacgcctgc gggcaggcga tcaagctgat 1427821 ggagctgacg ctggcaacca acgcctagcg ctgggcgagg cgggagcgtg ggcatgagcg 1427881 cgcgcagttg tacggcacta acggcgtgtc ggggtacaga cacgcgcgct cgcgggttcg 1427941 gctgccttca aaaggaagta cgcggctgac ggtttgcgga gcaagagcac ctctaccgtg 1428001 gcacgtgaaa gccgaccagc gcggcacacc ccggttcgac gtctgcccag tgtccggcga 1428061 cgcgtagcac ggcgatcccc gacgtcggga acttctccga gatgcgttcc gcgacagcgg 1428121 cgtcggtgcc gctgatgctg gccaggacga tggcaagggc cgacgtcgtt ggctcgtgcc 1428181 cgaccacaag cagtgaggtg acgttgtcgc caacccggtt gatctcctcg atcactgttc 1428241 cgggtgccgc gccgtagagc cgctcggcgt agcgagcggg tgcgtcgatg ccggtgtgcg 1428301 ccaaggtctg ccgggcgcgc gtagccgtgg agcacagcac ggcatcgacg gccggcaggt 1428361 tggcgcgcag ccagccaccg gccagcccgg cctcccggat accccgcggc gctagcggcc 1428421 ggtcatggtc ggcgatcccg tccgggtacg cagacttcgc gtgtcgcatc agcaccaggt 1428481 tgcggtattg ctcattcact gggctgacgt tagttcagtg acgtgcccgg gatcgctacg 1428541 gttggtcgtc gtcctggtcc ccgccgcgct ccgctggcat gggacagact tcgttgcgat 1428601 cgcctagctc gagccgaggc gtcagccata gggcgctgat aggtagggcg agcattctgt 1428661 gcccaaagga tagggctggc atcgcccggg caagcacggg cggcatgctg ccccgccggt 1428721 gagtccgcgc ccgggacctg ccgggcgagg tcccgcgcct tgtcggtgtg cagacctaca 1428781 ctcgctttgc gttgacagcc acgcactcag gagggatggg atgcgattcc tgcacactgc 1428841 cgactggcag ctcggcatga cgcgtcactt tctcgccggt gacgcccagc cgcgatattc 1428901 tgctgcccgc cgtgacgcag tcgctggact aaaagcgctg gccgccgatg tgggcgccga 1428961 attcgtcgta gtcgccggtg acgtcttcga acacaatcag ctcgcgccac agatagtcgg 1429021 tcaatccttg gaagccatgc gcgtgatcgg ccttccggtc tatctgctgc cgggtaacca 1429081 tgacccgctg gacgcttcgt cggtgtacac cagcacgctg tttcgagccg aacggccgga 1429141 caacgttgtg gtgctcgacc gagctggcgt ccacgaggtc cggccgggag tccagatcgt 1429201 cgcggcgccg tggcggtcca aggcgcccac caccgacccg gttgccgagg tgctggccgg 1429261 cctgcccaca gacgccgcta ttcggctgct cgtcgcccat gggggtgtcg acgcgctgga 1429321 ccccgaccac gacaaaccgt cgctgatcag gctcgccgca ctcgacgacg cgctgactcg 1429381 acaggcgatt cattatgtgg ccctaggtga caaacattcg cttacccagg tcggcagcag 1429441 cgggcgggtc tggtactccg gtgcaccgga agtcaccaac ttcgacgacg tcgaaccgga 1429501 ccccggtcac gtcctagtgg tcgacatcga cgaaagcgac ccgcgacatc ccgtcaccgt 1429561 cgacgcccgt cgcatcggcc gctggcggtt cgttacgttg caccaccagg tcgacaccag 1429621 ccgggacatc gccgacctgg acctgaacct ggatctgatg acggacaagg accgcaccgt 1429681 ggtgcggctg gccctgaccg gttcgctgac ggtcactgac cgcgccgcat tggatacctg 1429741 tctggacaag tacgcgcggt tgttcgcctg gctgggtctg tgggaacgtc acaccgacct 1429801 agcggtgata cccgtcgacg ccgagttcac cgacctcggc atcggggggt tcgccgccgc 1429861 ggccgtcgac gagctagtcg cgaccgcgcg cgggggtgac gacgagtccg ccgtcgatgc 1429921 ccaggcggcg ctggcactgt tgctgcggct cgctgaccgg ggagcggcgt gaagctgcac 1429981 cggctggccc tgaccaatta ccgcggcatc gcacaccgtg acgtcgaatt tcccgatcat 1430041 ggagtggtgg tggtgtgcgg cgccaacgag atcggcaagt cctccatggt cgaggcgctg 1430101 gacctgctgc tcgagtacaa ggaccgctcg acgaagaagg aagtcaagca ggtcaagccg 1430161 accaacgctg atgtcggctc cgaggtcatt gccgaaatca gcagcggccc ttatcgtttc 1430221 gtctaccgca agcgtttcca caagcggtgc gagacggagt tgaccgtgct ggcaccgcgc 1430281 cgcgagcagc tgaccggcga cgaagcgcac gagcgggtcc ggacgatgtt ggccgaaacg 1430341 gtcgacaccg aactgtggca tgcccagcgg gtgctgcagg ccgcctcgac ggccgcggtg 1430401 gatctgtctg gctgcgacgc gctctcgcgt gcgctcgatc tcgccgccgg tgatgacgcc 1430461 gcgctgtcgg gcaccgagtc gctgctcatc gagcggatcg aggccgagta tgcgcgctac 1430521 ttcaccccga ccgggcgccc caccggagaa tggtccgcgg cggtctctag gctggcggcc 1430581 gccgaggccg cggtggccga ctgcgcggcg gcggtagccg aggtcgacga cggggttcgt 1430641 cgccacaccg agctcaccga gcaggtggct gagctgtcgc agcaactact tgctcaccag 1430701 ctgcggctcg aagctgcgcg agtcgccgcc gagaagatcg ccgcaatcac cgacgacgcc 1430761 cgcgaagcca agctgatcgc tactgccgcg gccgcgacca gcggcgcttc caccgccgca 1430821 cacgccggac ggctgggcct gctcaccgaa atcgacacgc gcactgcggc cgtcgttgct 1430881 gcggaggcaa aagcgcggca ggccgcagac gagcaggcga cggcgcgcgc ggaggccgag 1430941 gcctgcgatg ccgcgctcac ggaggcaacc caggtattga cggccgtccg ccttcgcgcc 1431001 gagtcggccc ggcgcaccct cgaccagctc gccgactgcg aggaggccga ccggttggcc 1431061 gcccggctgg ccaggatcga cgacatcgag ggtgatcgcg accgggtctg cgcggagctg 1431121 tccgcggtca cgctgaccga ggagctactg agtcggatcg aacgtgctgc ggcagccgtc 1431181 gatcgcggcg gtgcacagct ggcgtcgatc tccgcggcgg tggagttcac cgccgccgtc 1431241 gacatcgagc tcggcgtcgg cgatcaacgg gtgtcgctgt ccgcgggcca aagctggtcg 1431301 gtcactgcca ccggccccac cgaggtcaag gttcccggcg tcctgaccgc acggatcgtc 1431361 ccgggcgcga ccgcactcga ctttcaagcc aaatatgctg cagcacaaca ggaattggct 1431421 gatgcgctgg cggctggaga ggtcgctgac ctagccgccg cacgctccgc cgatctgtgc 1431481 cgacgcgaac tgctgagccg ccgcgatcag ctgaccgcca ctctggccgg cctgtgtggc 1431541 gatgaacagg tcgaccaact gcgttcccgc ctggaacagt tgtgtgccgg tcaaccggcc 1431601 gagctcgatc tggtttcgac ggataccgct acggcccgcg ctgaattgga tgcggtcgag 1431661 gcggctcgaa tcgccgcgga gaaggactgc gagacccgcc gtcagatcgc tgctggcgcc 1431721 gctcgccggc tcgcggagac atccacgcgg gcaacggttc tacagaacgc agcggccgcc 1431781 gaaagcgccg agctcggtgc ggccatgact cggttggcct gtgagcgggc gtccgtgggc 1431841 gacgatgagc tcgccgccaa ggccgaggcc gacctgcggg tactgcagac ggccgagcag 1431901 cgagtgatcg acctggccga cgagctcgca gctacggcgc cggacgcggt agccgccgag 1431961 ctggccgagg ccgccgacgc cgtcgagttg ctgcgcgaac gtcacgacga ggccattcgc 1432021 gcgttgcacg aggtcggcgt cgaactctcg gtgttcggca cccagggccg caagggcaag 1432081 cttgatgccg ccgaaaccga gcgtgagcac gccgccagcc accacgcgcg ggtcgggcgc 1432141 cgggcccggg ccgccaggct gctccgctcg gtgatggcac gccaccgcga caccacccgg 1432201 ctgcgctacg tcgagccata ccgggcggag ctacatcggc tcggccgccc agtgttcggg 1432261 ccctctttcg aggtcgaggt cgataccgat ttgcgcatcc gcagccgcac cctggacgac 1432321 agaaccgtgc cctacgagtg cttgtcgggc ggggccaaag aacagcttgg catcctggcg 1432381 cgattggccg gcgcggcgct ggtcgccaag gaggacgccg ttccggtgct gatcgacgac 1432441 gcgctggggt tcaccgatcc ggagcgacta gccaagatgg gggaggtctt tgacaccatc 1432501 ggcgccgacg gacaggtgat cgtgctgacg tgcagtccca cccgatacgg cggtgtcaaa 1432561 ggagcgcacc gcatcgatct ggacgccata cagtgagccc gaaacgggga catgcgatgg 1432621 acactcagag cgactacgtc gtggtcggta ccggctcagc cggggcggtt gtggccagcc 1432681 ggcttagcac cgatccggcc acgacggtgg tggccctgga ggcggggccg cgtgacaaga 1432741 acagattcat cggcgtccca gcggcgtttt ccaagctgtt ccgcagcgag atcgactggg 1432801 attacctaac cgaaccgcag ccggagctcg acggccgcga aatctattgg cctcgtggca 1432861 aggtgctcgg tggctcgtcg tccatgaacg caatgatgtg ggtgcgtgga ttcgcatcag 1432921 actacgatga gtgggccgcg cgagccggtc cgcggtggtc gtacgccgac gtgctcggct 1432981 actttcgccg catcgagaac gtcaccgctg cctggcactt tgtcagcggt gacgacagcg 1433041 gagtaaccgg tccgttgcat atttcccggc aacgcagccc aagatcggtg accgcagcgt 1433101 ggctggcagc cgcacgtgag tgcggatttg ccgctgcgcg gccgaattcc cctcgaccgg 1433161 aaggcttttg cgagaccgtc gtcacccagc gccgcggtgc tcgattcagt actgccgacg 1433221 cctatctgaa gcccgcgatg cgccgtaaaa acctccgtgt gcttaccggc gccactgcta 1433281 cccgggtggt catcgacggc gaccgggccg tcggcgtgga ataccaaagc gacggtcaaa 1433341 cccgcatcgt ctacgcccgc cgcgaggtgg tgctctgcgc tggtgccgtc aacagccctc 1433401 agctgctgat gctctccggc atcggcgacc gcgaccacct cgccgaacac gacatcgaca 1433461 ccgtttacca cgcgcccgag gtcgggtgca acctgctcga tcatctcgtc acggtgctgg 1433521 gtttcgacgt cgaaaaggac agcttgtttg ccgccgagaa gcccggccag ttgatcagct 1433581 acttactgcg acgccgcggc atgctcacct ccaacgtcgg cgaggcgtac ggatttgtcc 1433641 gcagccgacc cgaactgaag ctgcccgatt tggagttgat ttttgccccg gcgccgtttt 1433701 acgacgaagc gctggttcca ccggctggtc acggtgtggt attcggcccg attctggtcg 1433761 cgccgcaaag ccgtggccag atcacgctgc ggtccgccga tccgcatgcc aagcctgtca 1433821 tcgaaccgcg ttacctgtcc gatctcggtg gcgtagaccg ggccgccatg atggcgggcc 1433881 tgcggatatg cgcgcggatc gcgcaggccc gcccgctcag agatctcctt gggtccatcg 1433941 cgcgaccgcg caacagcacc gagctggacg aggccactct cgagttggcg ctggccactt 1434001 gttcgcacac cctgtaccac ccgatgggca cctgccgcat gggcagcgac gaggccagcg 1434061 tggtggatcc gcagctgcgg gtccgcggtg tcgacggact ccgcgtcgcc gacgcgtcgg 1434121 tgatgcccag cacggttcgt gggcatacgc atgcgccgtc ggtgctgatc ggggagaagg 1434181 ccgccgactt aatccgcagc tgagctggtc gccgccggct cagcgtcgca tgaacccgat 1434241 ggcggtgtag tccaggtctg ccagacccgt cgcgccgaag ttggccagcg tgctgcggac 1434301 cgcaacggtg ccgggcgact gggtaagcgg caggctgaat ccttcggccc agatcagctc 1434361 gtcgacctgg ttggccaagg ccctcgcctt gccgggatcg agttctgcca gcgttcgctc 1434421 gatcgcggcg tcgatttgcg ggctaccgat cttgccgaag ttgctttccc cgtccgaagc 1434481 gtagatctgg gtgagcgatg acagcggaaa cgcgtcgccc acccagccga actgtgcgat 1434541 gtcgaaagcc cccacgttga cgtagtcgct gaagaaaccg ctgccggact tggcctgaag 1434601 ttcgagtttg acgccgatct gcgccagggt gtgttgggcg atctgggcga actgccgggt 1434661 gctttgtgcg tcgtagaaca gatcgcggat gacgagctgg cgaccgtcct tctcccggaa 1434721 cgcgccgctt cgcctccagc ccagggcgtc cagctcccgt ttcgcttgtt ccgggttgta 1434781 ggcgacaacg ccgctgttgt cctggtagcc gtcttggccg gcgacgaaga cgtggttgtt 1434841 cagtggcacc gggtcgctgg tgaggccgta ttgggcgacc ctggcgatgg tgtatcggtc 1434901 gatgcccttg gcgatcgcca ggcgcagcgc cttgtcggcg aggatcgacc caggcgcacc 1434961 gttgagggtg aagtgatacc agctgggccc gggggcgcgc cggatcgaga tgcccttggt 1435021 gcgcgccgcg atggtcagct ggtccagtgt gccgacgccg gtggcgtcga ttgtgttgtt 1435081 ctgcagcgcc ggcagccggg cggcatcatc gagcaccagg tatgtgatgc tgtccaggcg 1435141 tggccgtgcc ccccaccatc tcgggttacg ggtcaacacg attcgctgcg cggtgcggtc 1435201 cagggcagac acgacgaacg gacccgccga cggaccgggc ccatcgagtt gacccttatt 1435261 gaatgcctcg ggtgtggcgg tcatactggc cggcagcagc atgccgttgc ccgcgaacat 1435321 accgcgccac tccgcgtacg gcttggcgaa cgtcaccacg gcctgccggt cgtcgacccc 1435381 tctggttacc gacgccacac gctcggcgcc gctgctagaa gcgatctcga atgccttgtc 1435441 ggcgccgctg atcgcatgaa tctggctggc gatgtcccgc caggtgatcg gggtcccgtc 1435501 ggaccacacc gcctcgggat tgatggtgta ggtgaccacc tgcggggcgg tcctggtcag 1435561 ctcgatgctg gtgaagtagt tggtgtcgac cgtcgtcgag ccgtccggtc cgatgatgaa 1435621 cgcgcgcggc aaggtggctt tcatcatcgc cgcgacctcg gcgttgttgc cgtcgatgtg 1435681 caagatgttg aagttgggcg gaaagtcggt gagcgacagg cgaagattgc cgccgtcttg 1435741 caacgtggcg ggatcctgct gattgatgtc gctggtggtg ccaaccgcgg ccctgcggtc 1435801 cgcagtgggc gcgagttcga gttgggtacc ggaggccgag catccggtga gcaccatagc 1435861 cacgacgagc ggtgttaata acgcgaaagc ccaatatcga gtctgcgtcc agggtctgga 1435921 tttcccctga aacgacgccc tgagcgcaga cgcgatgccc ggggcgcagc ctcgtcgctg 1435981 gccacggtca gccacgacgg gccggatccg gttgcggtac cgcgcccagc agtcgcctgg 1436041 tgtactcgtg tttcggattg ccgaagacct cctcactgtc gccctgctca acaacggtac 1436101 cggcaagcat gaccgccacc tggtgggcga ggtgtttgac caccgaaaga tcgtgggaaa 1436161 caaataaata tgacaacccg aactgctctt ggaggtcgag cagcaggttg atgatcccgg 1436221 cctgaatgga gacatcgagt gccgacaccg gttcgtcgag tgccaggatc ttgggttgga 1436281 gcgccagtgc ccgcgcgatg ccgatgcgct gcttctgacc gccggagaac tcggcgggat 1436341 aacgactggc gtcgccgtgg cgcagtccga cgatatcgag cagctcggcg acccgcgcgt 1436401 gagtctcgtt cttgccgaac ccattggcct gcaatggttc ggcaatcaga tcgaagaccg 1436461 gcagccgcgg gtctaaggac gccaccgggt cttggaagac cacctggatg tcgcggcgca 1436521 gcgatcggcg ttccgctgtc cccagcgtgg cgacgtcagt gccgaggact tcgatcgatc 1436581 ccgattgcgg cgcagccagc tccaggatct cgtgcagggt ggtcgacttg cccgaaccgg 1436641 attcgccgac gatacccaac gtgcggccct gccggagttc gagactgatg ccgtcgaccg 1436701 cgcggacctc gccgatcgcc cggcgcagca ccacgccctt ggccagccgg taggttttga 1436761 ctagatgacg tacccgcacg accaccgagg cgtcgccgag tgcagccggg cgggcctcgg 1436821 ttttgacccg gtagatgtcg gcggcgctgc gcccggtgac cagctcggtg cggatgcagg 1436881 ccgcccggtg atcggtagcg acgtcaagca attcgggttc cgcggtaagg cattcgtcga 1436941 tgactagcgg gcagcgcggc gcgaacgggc aacccggtgc caagcccgcc agcgacgggg 1437001 gcgcacccgg tatcggcacc agccgggtgc cctgcgcggc atccagccgg gggaccgagc 1437061 ctaaaagccc cacggtgtag ggcatccggc gatcgcggta cagatcattc accccggccg 1437121 actcgacgac ccgtccggcg tacatcacca gcgcccggtc ggcgaactcg gccacgacgc 1437181 cgaggtcgtg ggtgatgatc agcaccccgg cgccggtgac gtcgcgcgcc gccttgagga 1437241 cgtcgaggat ctgcgcctgc accgtgacgt cgagcgccgt ggtcggttcg tcacagatca 1437301 acaggtcggg atcgttggcg atcgcgatgg cgatcaccac gcgttggcgt tcgccacctg 1437361 aaagctcatg cggaaacgca cgggaacgcc gctgcggctg cgaaataccg accaggtcaa 1437421 gcagttccac cgcacgccga cgagcggcct tcttgccaac acggggctgg tgcacctcga 1437481 tggcctcggc gatttggtcg ccgacggtgt agacaggggt gagcgcagac atcggatcct 1437541 ggaacaccgt gccgatcgcc ttgcctcgaa accgggacat cgcgttgtcg gcaagcccca 1437601 acagttcggt accctgtagc cgaaccgaac cacgcacctg cgcgtactcg ggcagcaggc 1437661 ccaccaccgc catcgccgct gcggacttac ctgaacccga ttcgcccacc atcgcgacca 1437721 cctcgccggg ctcgacgcgg tagctgatcc cgcgcaccgc ggtcaccgga tcgccatcgg 1437781 tcctgaaggt gacggccaaa tcggtcacct cgagcagggg gctcatcgca caccacggcg 1437841 cagggatctg ctggctgggt ccagcgcgtc gcgcaggcca tcgccggtca ggttggcgca 1437901 caccagaatc aacaccagga tactggcggg aaacaagaac acccacggga acgcggtcgc 1437961 ggatgcggtg ccgtcggcga tcagggtgcc cagcgacaca tccggcggtt gaataccgaa 1438021 accaaggaag ctcaacccgg tttcggccag gatggcggcg gcaacattga gggcggcgtc 1438081 gatgatcaag atggatgcga cgttgggcac cacatggccg acgatgatcc ggcggctgga 1438141 gacacccata tatcgtgcgg ccctgatgaa ttcgcgttct cgcaagctca tcgtcatccc 1438201 gcgcaccatg cgagagctga tcatccagcc gaagccggcc aacaacaaga caagaaacat 1438261 gatgtttgcc gagttcttgg ttcgcggggt aacgatggcg atcaggatga agctgggcac 1438321 tactagcagc agatcgacca cccacatcag tgtccggtcc cgccagccgc cgaaatatcc 1438381 cgagatcgct ccaaccgtgg cagcgatacc agtcgagatc accgcaacgc aaacaccaat 1438441 cagcatcgac ttctgcatgc cacgcagcgt ctgcgccagc agatcttggc ccagcgcgtt 1438501 agtgcccagc cagtgcttgg tgcccggcgg ctgcagcaat gcgttgaaat caaggtcgtc 1438561 gtaggagtag ggcaatagtg ggggcagcgc ataagcgctg acgaacagca ggagcagcgc 1438621 cgccagcgac gccaccgcgg cccgattgcg taggaacctg cgcaccacta gggtgcgccg 1438681 cgaggcgaat tccgtcatga cacccgtacc ctcgggtcca aagccgcgta gatcacgtcc 1438741 gagagcaaac cggccagcaa cacgaccgcg ccggagaaca cggtaattgc cgcgacgatg 1438801 ttggtgtcct gagtcgagat accgcggacc atccattcac ccatgccgtg ccagccgaag 1438861 atcttctcga cgaaaaccgc tccggtgacc aacccggcca ccccgtaggc gaacagcgtg 1438921 gccatcggta ttagcgccgt tcgcaggcca tgcttgagta gggcccgtcg tcgggtcagc 1438981 cccttggcgc gggcggtgcg aatgaaatcc tggccgagga catccagcat cgcgttgcgc 1439041 tggtagcggc tgaacccggc ggcggccgcc agcgccaacg tcagcgatgg caggatcaaa 1439101 tgctgcaacc ggtcgcctag ccgatcccac accccgccgg caacgccggg tgacgtctcc 1439161 ccggtgtagt cgaaaagctg gatgcccact gcccagttga cccgcagggc gcccaggatc 1439221 aacaggttgg ccaccacaaa cgtcggtgtg ctcaacacca gcagcgccag cgtggtcatg 1439281 acgcggtcgc tgagccggta ctgccggatg gcaccccacg ccccgatcac cacaccggcc 1439341 accgtgccga ataccgatcc aacgaccagc agccgcaggc tgactccgat ccggcgcccc 1439401 agttcggtac cgacaggctg gccggtgatg gtggttccga agtcgccacg gacggcatgc 1439461 gatacccagt tggcgtagcg ggccagtatg ggtctgtcca agccgagatc gtgtgccttg 1439521 gcatcgataa ccgcttgcgg tgggcgcgga ctgcgttgca tcaggctttc cagcggcgag 1439581 aacgccagcg aggtcaggca gtacgtcaaa aacgacgcca gcgccagcag caccaggtag 1439641 ttgagcaacc ggcgggccag atagcgcgtc atgcccaacc accgcgtcgc attgggacag 1439701 ggtagcgagc ccggcgatgg cgtgccgcca gcgcgccggt tgatggggtc acccgtgatc 1439761 cggatggttc cgctcgggcc gattctgatg cgtgaaaact gggtaaccgg ttgttaaaat 1439821 tcaccgcggc gtcgatctga gtagcaaagt ccacaccgcg atacccgagg aggcccgcgt 1439881 gacggttacc gacgactacc tggccaacaa cgtggactac gcgagcggtt tcaagggccc 1439941 gctaccgatg ccgccgagca aacacatcgc aatcgtggcg tgcatggacg ccaggctgga 1440001 cgtctaccgc atgctgggca tcaaggaggg cgaggcacac gtcatccgca acgccggatg 1440061 cgtggtcacc gacgatgtga tccgttcact ggccatcagc cagcggctgc tgggaacccg 1440121 cgaaatcatc ctgctgcacc acaccgactg tgggatgctg actttcaccg acgacgactt 1440181 caagcgcgcc atccaggacg agaccggcat cagacccacg tggtcgcccg agtcgtaccc 1440241 cgacgccgtc gaggacgtcc gtcagtcgct gcgccgcatc gaggtcaacc cgttcgtcac 1440301 caagcacacg tcgctgcgcg gcttcgtctt cgatgtcgcc accggcaaac tcaacgaggt 1440361 cacgccctag cagcccgagc cgtcagccta gggcgcactg gcgcaccggc agcccgccga 1440421 gatggggctg cgttgacagc gatagggaag cctggttgca tagatggcaa taaccataaa 1440481 tatggtcaat cctaccggat ttatcaggta tgaggacgtg gaacaggaag ccatgaccag 1440541 cgatgtgacg gtgggccccg cacccggcca gtaccaactg agccatctgc gcttgctgga 1440601 ggccgaagcc atccacgtca tccgggaggt ggccgccgag ttcgagcggc cagtgctgtt 1440661 gttctcgggg ggcaaggact ccatcgtcat gctgcacctg gcgctgaagg cgtttcggcc 1440721 cgggcgactg ccgttcccgg tcatgcacgt cgacaccggt cacaacttcg acgaagttat 1440781 cgctacccga gacgagttgg tcgccgcggc cggggtgcgg ctggtggtgg cgtcggtgca 1440841 ggacgatatc gatgccggtc gggtcgtcga gaccatcccg tcgcgaaatc cgatacagac 1440901 cgtgacgctg ctgcgggcca tccgggagaa ccaattcgac gcggcattcg ggggagcccg 1440961 gcgcgacgag gagaaggccc gcgccaagga gcgggtgttc agcttccgcg acgagttcgg 1441021 ccagtgggac ccgaaggctc agcggccgga actgtggaac ctctacaacg gacggcacca 1441081 caagggcgag cacatccggg tcttcccgct gtccaactgg accgaattcg acatctggtc 1441141 ctacatcggc gccgagcagg tcaggctgcc gtccatctat ttcgcccacc ggcgcaaggt 1441201 gtttcagcgc gacggcatgt tgctggccgt gcaccggcac atgcaaccgc gagccgacga 1441261 gccggtgttc gaggccacgg tgcgattccg caccgtcggg gatgttacct gcaccgggtg 1441321 cgtcgagtcg tcggcatcga cggtcgcgga agtcatcgcc gaaactgcgg tggcccgctt 1441381 gacggagcgc ggggcgacca gggctgacga ccggatctcg gaggctggaa tggaagaccg 1441441 caagcggcag ggatacttct gatgacgacg ctattgcggc tggcgacagc gggttccgtc 1441501 gacgatggca agtccacgct gattgggcgg ctactctacg actccaaggc tgtgatggaa 1441561 gaccagtggg cgtcggtgga gcaaacgtcc aaggaccggg gccacgacta caccgacctg 1441621 gctctggtca ccgacggcct gcgggccgag cgggaacagg gcatcaccat cgacgttgcc 1441681 taccgctact tcgccactcc caagcggaaa ttcatcattg ccgacacccc gggacacatc 1441741 caatacaccc gcaacatggt gaccggtgcg tccaccgccc aactggtgat cgtactggtg 1441801 gatgcccggc acggcttgct ggagcaatcc cgccggcacg ccttcctggc gtcgctgctg 1441861 ggcatccgcc acctggtgct cgcggtcaac aagatggact tgcttggctg ggaccaagag 1441921 aaattcgacg cgattcgaga cgaattccac gccttcgcgg cccgcctcga cgtgcaggac 1441981 gtcacctcca tcccaatctc cgcgctgcac ggcgacaacg tggtgaccaa atccgaccag 1442041 acgccctggt acgagggacc gtcgctgctg tcgcatctcg aagacgtcta catcgccggt 1442101 gaccgcaaca tggtcgacgt gcgattcccg gtccagtacg tcatccggcc gcacaccctc 1442161 gagcatcaag accaccgcag ctacgcgggc accttggcca gtggggtaat gcgttcaggc 1442221 gacgaagttg tcgtgctgcc gatcggtaag accacccgga tcaccgcgat cgacggcccg 1442281 aacggcccgg tggcagaagc gtttccgccg atggcggttt cggtgcggct cgccgacgac 1442341 atcgatatct cgcgtggtga catgatcgct cgcacccaca accagcccag gatcacacaa 1442401 gaattcgacg cgaccgtgtg ctggatggcc gacaacgcgg tgctagagcc cggccgcgac 1442461 tacgttgtca agcacaccac ccgaaccgtc cgtgcgagga tagccgggct ggattaccgg 1442521 ctcgatgtca acaccctgca tcgcgacaag accgcaacgg cgttgaaact caacgaactg 1442581 ggccgtgttt cgctgcgcac ccaggtgccg ttgctgcttg acgagtacac ccgcaacgct 1442641 agcaccggct cgttcatcct cattgacccc gacaccaacg gaacggtggc ggcgggcatg 1442701 gtgttacgcg acgtctcggc ccgcacgcct agcccgaaca cggtgcggca cagatcgctc 1442761 gtcactgcgc aagatcggcc gcccaggggc aagacggtgt ggtttaccgg actgtccggc 1442821 tccggcaagt cgtcggtggc catgctggtt gagcggaagc tactcgaaaa gggcatctcc 1442881 gcttacgttc tggacggcga caacctacgg catggcctca acgccgacct gggcttttcc 1442941 atggccgacc gcgcggagaa cctgcgccgg ctgtcgcatg tggccacact gctcgccgat 1443001 tgtggccacc tggtgctggt gccggcgatc agcccccttg ctgagcaccg tgccctggct 1443061 cgtaaagtgc acgctgatgc gggaatcgac tttttcgagg tgttctgtga caccccgctg 1443121 caggactgtg agaggcgtga tcccaaaggg ttgtacgcca aagcgcgtgc gggtgagatc 1443181 acgcacttca ccgggatcga cagcccatat cagcggccca agaacccaga cctacggctt 1443241 acgccggatc gcagcataga cgagcaggcg caggaggtta tcgacctgtt ggagtcatcg 1443301 tcttaggccg gcctggttgc tctgctgtcc ctggcaagcg ggtggcacaa tcctgaagca 1443361 tgcggatgtc agctaaggcg gagtacgcgg tgcgggcgat ggtccagctc gccacggccg 1443421 ccagtggcac cgtggtcaag accgacgatc tggctgcggc ccaaggcata ccaccgcagt 1443481 ttctcgtcga tatcctgacc aacctgcgca ccgaccgcct ggtgcgaagc caccgcggtc 1443541 gcgagggtgg ttatgaattg gcgcgtccgg gcaccgagat cagcatcgcc gacgtattgc 1443601 gctgcatcga cggaccgctg gctagtgtcc gcgatatcgg acttggcgac ctgccctact 1443661 cgggccccac taccgcgctg accgacgttt ggcgcgcgct gcgcgccagt atgcggtcgg 1443721 tgctggagga gaccacgctg gctgacgttg ccggtggcgc gctgcccgag cacgtcgccc 1443781 agctcgccga cgactatcgc gcgcaggaga gcacgcggca cggcgcctcg cgccatggtg 1443841 actagccgcc agagccatcg gcagggcctg cctgagccag gtgcaaccga aggagtcaac 1443901 gaatggtcag cacacatgcg gttgtcgcgg gggagacgct gtcggcgttg gcgttgcgct 1443961 tctatggcga cgcggaactg tatcggctga tcgccgccgc cagcgggatc gccgatcccg 1444021 acgtcgtcaa tgtggggcag cggctgatta tgcctgactt cacgcgatac accgttgttg 1444081 ccggggacac gctgtcggca ttggctgcgc gcttctatgg cgacgcctcc ctatatccgc 1444141 ttatcgccgc cgtcaatggc atcgccgatc ctggcgtcat cgacgtcggg caggtactgg 1444201 tcatattcat cgggcgtagc gacgggttcg gcctaaggat cgtggaccgc aacgagaacg 1444261 atccccgcct gtggtactac cggttccaga cctccgcgat cggctggaac cccggagtca 1444321 acgtcctgct tcccgatgac taccgcacca gcggacgcac ctatcccgtc ctctacctgt 1444381 tccacggcgg cggcaccgac caggatttcc gcacgttcga ctttctgggc atccgcgacc 1444441 tgaccgccgg aaagccgatc atcatcgtga tgcccgacgg cgggcacgcg ggctggtatt 1444501 ccaacccggt cagctcgttc gtcggcccac ggaactggga gacattccac atcgcccagc 1444561 tgctcccctg gatcgaggcg aacttccgaa cctacgccga atacgacggc cgcgcggtcg 1444621 ccgggttttc gatgggtggc ttcggcgcgc tgaagtacgc agcaaagtac tacggccact 1444681 tcgcgtcggc gagcagccac tccggaccgg caagtctgcg ccgcgacttc ggcctggtag 1444741 tgcattgggc aaacctgtcc tcggcggtgc tggatctagg cggcggcacg gtttacggcg 1444801 cgccgctctg ggaccaagct agggtcagcg ccgacaaccc ggtcgagcgt atcgacagct 1444861 accgcaacaa gcggatcttc ctggtcgccg gcaccagtcc ggacccggcc aactggttcg 1444921 acagcgtgaa cgagacccag gtgctagccg ggcagaggga gttccgcgaa cgcctcagca 1444981 acgccggcat cccgcatgaa tcgcacgagg tgcctggcgg tcacgtcttc cggcccgaca 1445041 tgttccgtct cgacctcgac ggcatcgtcg cccggctgcg ccccgcgagc atcggggcgg 1445101 ccgcagaacg cgccgattag ccgcaccacg tatacgccgc gggcaggtgg ccgctggccg 1445161 atagcctcat gtgtgtgagc gtgggcgagt cagttgcgca gtcgctgcaa cagtgggatc 1445221 gcaagctgtg ggacgtggcg atgctccacg cgtgcaacgc cgtcgacgag accggcagga 1445281 agcgctatcc cacgctgggc gtcggcactc gattccggac ggcgctacgg gattcactcg 1445341 acatttacgg agtgatggcc acgcctggcg tcgacctgga aaagactcgc ttccctgtcg 1445401 gggtgagatc ggacttgctg ccggataagc gccccgacat cgccgacgtc ctgtatggaa 1445461 ttcaccggtg gttgcacggt catgctgacg aatcctcggt tgaattcgaa gtaagcccgt 1445521 acgtgaacgc cagtgccgca ctccgcattg ccaatgacgg caaaattcag ctgccaaagt 1445581 ccgcaatact gggtttgctg gccgttgccg tgtttgcgcc ggagaacaag ggcgaggtca 1445641 ttcccccgga ctatcagctc agctggtatg accacgtgtt cttcatcagt gtttggtggg 1445701 ggtggcaaga ccatttccgc gaaatcgtca acgtcgaccg ggcatcgctg gtcgccctcg 1445761 acttcggcga cctgtggaat ggctggacgc cagttgggta atcctggtcg cttgtcgccc 1445821 cgccgggctg ggttagattg cccggctcct caacccgccg tttcggcgtg catcgtcgcc 1445881 gggctaggtt agattgcccg gctcctcaac ccgccgtttc ggcgtgcatc gtcgccgggc 1445941 tagccgtctc ggtcagcgga ccggatcgtc gacgccgccg cctgcgcggc ggctacctgg 1446001 ccgaacgtgg acggcggcgg cgctagagtc ccggggcgct cgacgacctc ggtcgcccgc 1446061 gccgcggcac cgagaaccat ggcccggtcg gattcgtccg cgaactcgcg ctgtgctgcc 1446121 cgcacgacca gggcaatttg ggtttgcacc gctacacggc gcgacgggtc gacgcagttc 1446181 tgggcgaccg cgctgagcag ctgcagcagc gcagtgagca ccagcggctc acgcgagccg 1446241 tagcggcgga tctgggcaca tccgacgtgc aggtaggtgg cgaagctggg gtacggcagc 1446301 cagaagagga gctccccggc gcggtcgcgg cgcacgtcgt ccggcagcgc ccgcgatgcc 1446361 agcaccgact ccacggccga aagatggtgc acgacttgga tcgccgtgta cgggtcgttg 1446421 agtgcgggcg atagtgcccg cagcgcgata tccaccatct gccgcaatcc gaagcggatg 1446481 tcctgctgca gggtgcgctc gaatccgatg tgcacatgac gtaagcagcg ttgcgggaag 1446541 tcagaccctg gcgcgcccgg cgcggtgccc ctgcgccagc accagccgag caggcccccg 1446601 gcggtgacgt aatcgccgac gaaggtaacc agcagcgccg tataccggct ggctgccgcc 1446661 aattcggcga tgtcgtcgac gtcgacggtt tgtaggtaac ccgagtgcgg ggccaacagc 1446721 ggcaccgcat cagccggggg gctgggcggt gtctctactt gtcgatccgc cgtatccgat 1446781 tccggataca actggtcaac cagccccagc gtgcgcagcc gcaccttgtc catgatcgtg 1446841 tctatctgga tcgagtgcat gaggtggtgc aggaagtaga tcagcgcggc gatgctgacg 1446901 aatgccagcg cgagtgaccc ggtgaccgcg actttgggaa tgaacgcccc gccgtcgcgg 1446961 tgctccccga cggtgtgtag cccaccggtg ctgtaggcga aggtgcaggc aaagatcgcc 1447021 agcaccacct ggttgggcac atcgcgcagg aaggttcgta gcaaccgcac cgagaactgg 1447081 ctggaggcga tctgtaggga cagcaccgtc agcgagaaga cgatgccgat ggtggtgatc 1447141 atcgtggccg acaccacgat cagcacgcct cgggcgtcgc ctggggtgcc ctgaaacatc 1447201 agcttgtcga tcagcgtgcc ggatttcacg ggaatcatcg acaggaccgc tcccgacccc 1447261 agaccgatcg caacgccgaa tgtcggcagc acccagactg cgccctgtaa gtaatccagt 1447321 atggctttgc gacggttgag catgctggtt gcggtcaccg aataagcatg cacccatccg 1447381 cgagcactag gcggaactac gtaacacttc gatgcggcag tagaagcatt tttccgctct 1447441 cgcttcgccg agcgtgcact catggcgagt ttccggccgt taaccccaag tgatcgctgc 1447501 aacacttggc cagaggtgtt ggcgctgcat gggttatcag aaggggtttc ggggtcgggg 1447561 ggatcgggtg gccgatgggg tgcaggggaa gttctggaag gcgctcgaat cggggttatc 1447621 gccgacggtg tgtcctgctt tcctaccaag gccgactgca ggcggatccg tggcgtgccg 1447681 gtgttcgacg gctatacgcg gatggtcgcc cggctgatgg gatcgctcgc cgtgttgcgg 1447741 tcggtgagca ttccaaaggg ctaccgggac ttcggctttg gcagtctacg tgcggtggcg 1447801 ccgaaaaact gcccggacgt gagtggctga ggcggcccaa tttcggacta ggatttctgg 1447861 ccgctggaag tcactgatga caccgtacgt cacccttgat cgacaagtgc ggatgtgggg 1447921 acccgtccgg ggtccccaca tcgtggtggt cgctgtttag ctcgaggtca cgtactgcgg 1447981 gcagtaggcc gacgcggcgt caacggcgaa cgtcttggcg cccttggcgc tcagaccggt 1448041 cgccttggcc accgccttga tgaccgcttt ggccgagtga ccctcgtcga gggcgtcgca 1448101 gacggcgtgc gcgtccttga tggcgcgcgc tgcgctcggc ggagtgatcc cgtccgcctg 1448161 cagctgcgcg aggaacgctt cgtcggtcga gcttgcgctg gcggtcccgg cgaagccgag 1448221 tgcggccagg cccaaagtgg cggcagtcaa ggtggtgcca accatggagg cggcgaaacg 1448281 gcgagtgaac attgatgatc tccttgtgct gatgtcatcg gaggttgcgc tggtttgcgt 1448341 gccctcagaa tcagcaccgg gccttgacag attctcaata aatccttggc aatatcgata 1448401 ccggttcgac ggtgtcccga cagtgcaagg agaacggtcc gccatggctg tgccggagcg 1448461 cgtcaggcga atgagacaac acggaacgtg cactcggcgc accgggtcgc cagcaacgcg 1448521 gcacgcgggg cgccctggtt cttaccccga cgaatttgag agcgagacca cgaagccaac 1448581 tatgcggccg ccctcgcggg tggcgccgat cacattgttg tagccatgcg tgaggctaga 1448641 tcaacccttg tgcccccggc aggattcgaa cctgcggcct tctgctccgg aggcagacgc 1448701 tctatcccct gagctacggg ggcgcacgac gacacgttgc gccatggggc cccgccagag 1448761 tagcgcatcg cggctaccca ctgaccaccg caacggattc gaagcccaac cacctcagcc 1448821 cataggatgg acgttcgtga cccccgctga cctggctgag ctgctcaaag cgaccgcggc 1448881 cgcggtgctg gccgagcgcg gcctcgatgc ctccgcgttg ccgcagatgg tcacggtgga 1448941 acgcccgcgc attcccgagc acggcgacta tgccagtaac ctggcgatgc agctcgccaa 1449001 gaaagtcggc accaacccgc gtgagctggc cggatggctt gccgaggcac tgacaaaggt 1449061 cgacggtatc gcctcggcgg aggtggccgg gccgggcttt atcaacatgc ggctggaaac 1449121 cgccgcccag gctaaagtcg ttaccagcgt tatcgacgcc ggccacagct acggtcactc 1449181 gctgctgctg gccgggcgca aggtcaacct ggaattcgtc tccgccaacc ccaccggacc 1449241 gatccacatc ggcggtaccc gttgggccgc ggtcggtgac gcgctgggcc gtttgctcac 1449301 cacccagggc gccgacgtgg tccgcgaata ctatttcaac gaccacggcg cccagatcga 1449361 ccgattcgcc aactccctga tcgccgcggc caagggcgaa cccacgcccc aagacggcta 1449421 cgcgggcagc tacatcacca acatcgccga gcaggtgctg cagaaggcgc ctgacgcgct 1449481 gagtctgcca gacgcagagt tgcgcgagac cttccgcgca atcggcgtcg acttgatgtt 1449541 cgaccacatc aaacagtctc tgcacgagtt cggtaccgac ttcgacgtct acacccacga 1449601 agactcgatg cacaccggcg gccgggtcga gaacgccatc gcccgactcc gcgaaaccgg 1449661 caacatctac gagaaggacg gcgcaacctg gttgcgcacc agcgcatttg gtgacgacaa 1449721 ggaccgcgtc gtgatcaaga gcgacggcaa accggcatat atcgccggtg atctcgccta 1449781 ctacttggac aaacgccaac gcggttttga cttgtgcatc tacatgctcg gcgccgacca 1449841 tcacggctac atcgcccggc taaaggccgc ggccgccgcc ttcggtgacg acccggccac 1449901 cgtcgaggtg ctcattgggc agatggtgaa cctggtccgc gacggccaac cggtccggat 1449961 gagcaaacgt gcaggcaccg tgctcaccct cgacgacctg gtcgaggcga tcggcgtgga 1450021 cgccgcacgt tacagcctga tccgctcctc ggtggacacc gcgatcgaca tcgacctggc 1450081 gctatggtcc tcggcgtcga acgaaaaccc ggtctattac gtgcaatacg cgcatgcccg 1450141 gctctcagcg ctggctcgca acgccgccga actcgccctg atcccggata caaaccacct 1450201 cgaactgctt aaccacgaca aggagggcac gctgctgcgc accctcggcg aattcccgag 1450261 ggtgctcgag accgcggcct ccctgcggga accgcaccgg gtctgccgct acctggaaga 1450321 cctggccggc gactatcacc ggttctacga ctcgtgccga gtgttgccgc aaggcgacga 1450381 gcagcccacc gacctgcaca ccgcgcgcct agcgttgtgc caggccaccc gtcaggtcat 1450441 cgccaacggg ctggcgatca tcggcgtcac cgcaccggag cgaatgtgaa cgagctgctg 1450501 cacttagcgc cgaatgtgtg gccgcgcaat actactcgcg atgaagtcgg tgtggtctgc 1450561 atcgcaggaa ttccactgac gcagctcgcc caggagtacg ggaccccgct gttcgtcatc 1450621 gacgaggacg actttcgctc gcgctgccga gaaaccgccg cggcctttgg aagtggggcg 1450681 aacgtgcact atgccgccaa ggcgttcctg tgcagcgaag tagcccggtg gatcagcgaa 1450741 gaagggctct gtctggacgt ttgcaccggt ggggagttgg cggtcgcgct gcacgctagc 1450801 tttccgcccg agcgaattac cttgcacggc aacaacaaat cggtctcaga gttgaccgct 1450861 gcggtcaaag ccggagtcgg ccatattgtc gtcgattcga tgaccgagat cgagcgcctc 1450921 gacgccatcg cgggcgaggc cggaatcgtc caggatgtcc tggtgcgtct caccgtcggt 1450981 gtcgaggcgc acacccacga gttcatctcc accgcgcacg aggaccagaa attcgggtta 1451041 tcggtggcca gcggcgcggc catggcagcg gtgcggcgcg ttttcgccac tgatcacctg 1451101 cgcctggttg ggctacacag ccacatcggt tcgcagatct tcgacgtgga cggcttcgaa 1451161 ctcgccgcgc accgtgtcat cggcctgcta cgcgacgtcg tcggcgagtt cggtcccgaa 1451221 aagacggcac agatcgcgac cgtcgatctc ggtggcggct tgggcatctc gtatttgccg 1451281 tccgacgacc caccgccgat agccgagctc gcggccaagc tgggtaccat cgtgagcgac 1451341 gagtcaacgg ccgtggggct gccgacgccc aagctcgttg tggagcccgg acgcgccatc 1451401 gccggaccgg gcaccatcac gttgtatgag gtcggcaccg ttaaggacgt cgatgtcagc 1451461 gccacagcgc atcgacgtta cgtcagtgtc gacggcggca tgagcgacaa catccgcacc 1451521 gcgctctacg gcgcgcagta tgacgtccgg ctggtgtctc gagtcagcga cgccccgccg 1451581 gtaccggccc gtctggtcgg aaagcactgc gaaagtggcg atatcatcgt gcgggacacc 1451641 tgggtgcccg acgatattcg gcccggcgat ctggttgcgg ttgccgccac cggcgcttac 1451701 tgctattcgc tgtcgagtcg ttacaacatg gtcggccgtc ccgctgtggt agcggtgcac 1451761 gcgggcaacg ctcgcctggt cctgcgtcgg gagacggtcg acgatttgct gagtttggaa 1451821 gtgaggtgac ccgtgcccgg tgacgaaaag ccggtcggcg tagcggtact cggtttgggc 1451881 aacgtcggca gcgaggttgt ccgcatcatc gagaacagcg ccgaggatct cgcggctcgt 1451941 gtcggtgccc cattggtcct gcggggcatc ggcgtgcgcc gcgtgacgac cgatcgcggc 1452001 gtgccgatcg aattgttgac cgacgacatt gaagagctcg tggcccgcga ggatgtcgat 1452061 atcgtggtgg aagtgatggg gccggtggaa ccgtcgcgca aggcgatcct gggcgccctt 1452121 gagcgcggca agtccgtcgt tacggcgaac aaggctttac tcgccacctc caccggcgaa 1452181 ttggcacagg ccgccgaaag cgcccatgtt gatctgtatt tcgaggcggc cgtggcgggc 1452241 gccattccgg tcatccgtcc gctcacccag tcgctggccg gcgacacggt gctgcgagtg 1452301 gccgggatcg tcaacggcac caccaactac atcctctcgg cgatggacag caccggcgct 1452361 gactatgcca gcgccctggc cgacgcaagt gcgctgggct atgcggaggc tgatcccacc 1452421 gcagacgtcg aaggctacga cgccgcggcc aaggcagcga tcctggcatc cattgccttc 1452481 cacacccggg tgaccgcaga cgacgtgtat cgcgaaggca tcaccaaggt cactccggcc 1452541 gacttcggat ccgcgcacgc gctgggttgc accatcaaac tgctgtcgat ctgtgagcgc 1452601 ataaccaccg acgaaggttc gcagcgggta tcggcccgcg tctatccggc cctggtacct 1452661 ctgtcgcatc cgcttgccgc ggtcaacggc gcgttcaatg ccgtggtggt cgaggccgag 1452721 gccgcgggcc ggctgatgtt ctacggccag ggcgcgggcg gcgcgccgac cgcctctgcg 1452781 gtgaccggtg acctagtgat ggccgcccgc aaccgggtac tcggcagccg cggcccccgt 1452841 gagtctaaat acgctcaact tccggtggca ccaatgggtt tcattgaaac gcgctattac 1452901 gtcagcatga acgtcgccga caagccgggc gtcttgtccg cggtggcggc ggaattcgcc 1452961 aaacgcgagg tgagcatcgc cgaggtgcgc caggagggcg ttgtggacga aggtggtcga 1453021 cgggtgggag cccgaatcgt ggtggtcacg cacctcgcca ctgacgccgc actctcggaa 1453081 accgttgatg cactggacga cttggatgtc gtgcagggtg tgtccagcgt gatacgactg 1453141 gaaggaaccg gcttatgacc gtcccgccga cggccactca ccagccgtgg ccgggagtga 1453201 ttgccgcgta ccgtgaccgg ctgccggtgg gtgacgactg gactccggtg accctgctcg 1453261 agggtggtac tcccctcatc gcggcaacta atctctccaa gcagacgggc tgcacgatcc 1453321 acctcaaagt ggagggcctc aaccccaccg gctccttcaa ggatcgtggc atgacgatgg 1453381 cggtcaccga tgcccttgcc catggtcagc gggcggtctt gtgcgcatcg accggaaata 1453441 cctcggcgtc ggcggcggcc tatgccgccc gggccggcat cacctgcgcg gtgctgatac 1453501 cgcagggcaa gatcgcgatg ggcaagctcg cacaggcggt catgcacggc gccaagatca 1453561 tccagatcga cggtaacttc gacgactgcc tggaactggc gcgcaagatg gccgcggact 1453621 tcccgacgat ttcgttggtc aactcggtaa acccggtgcg catcgagggc cagaaaacgg 1453681 cagcgttcga gatcgtcgac gtgctaggta ccgcgccgga cgtgcatgct ctgccggttg 1453741 gcaacgccgg caacatcacc gcgtactgga agggctacac cgagtatcac cagctgggcc 1453801 tgatcgacaa gttgccccgc atgctgggca ctcaggccgc gggcgcggcg cccctggtgc 1453861 tcggcgaacc ggtgagccac ccggagacca tcgcaaccgc gatccgcatc ggctcgccgg 1453921 cgtcgtggac ttcggccgtc gaggcacagc agcagtccaa gggccgcttc ttggccgcct 1453981 ccgacgagga gatactggcc gcatatcacc tggtggctcg tgtcgaaggc gtattcgtgg 1454041 agcccgcgtc cgcagccagc attgcgggtc tcctcaaagc gatcgacgac ggctgggtgg 1454101 cgcgtggttc gacggtggtg tgcacggtaa ccggcaacgg tcttaaggat cccgacaccg 1454161 cgctcaaaga catgccgagc gtgtctccgg ttcccgtgga cccggtagcc gtcgtcgaga 1454221 agctagggct ggcctagtgg cgatcgcaag cgcggcggag ccgggtgcgg cgggtcggca 1454281 cggtttggat tgggtggcga tcgcaagcgc ggcggagccg ggtgcggcgg gtcggcacgg 1454341 tttggattgg gtggcgatcg caagcgcggc ggagccgggt gcggcgggtc ggcacggttt 1454401 ggattgggtg gcgatcgcaa gcgcggcgga gccgggtgcg gcgggtcggc acgcatggtg 1454461 actcaagcat tgttgccttc tgggctggtg gccagtgcgg tggtggcggc gtccagtgca 1454521 aacctgggcc cgggcttcga cagtgtcggt ttggcgctga gtctctacga cgagatcatc 1454581 gtcgagacaa cagattccgg cttgacggtg actgtagacg gcgagggcgg cgaccaggtg 1454641 ccgctgggcc ccgagcacct cgtggtccgc gccgtgcagc acgggttaca ggcagcgggg 1454701 gtcagcgccg ccggcctggc ggtgcgctgc cgcaacgcca tcccgcactc ccgcggcctc 1454761 ggctcctccg cggcagcagt tgtgggcggt cttgcggccg ttaacggtct tgtcgtacaa 1454821 acggattcgt caccatcgag cgatgctgag ctgattcagt tggcttcgga gttcgagggt 1454881 catcccgaca acgcggcggc cgcggttttg ggtggtgccg tggtttcgtg gactgaccac 1454941 agtggtgacc ggcccaacta ttcggccgta tcactgcggc ttcatcccga tatccgcctg 1455001 ttcactgcga ttcccgagca gcgttcgtcg accgcggaaa cgcgggtgct attgcccgcg 1455061 caggttagtc acgacgacgc acggttcaat gtcagtcgcg cggcgctgct ggtggttgcg 1455121 ctcaccgaac ggcccgatct gctgatggcg gccaccgaag atctgcttca tcagccgcaa 1455181 cgtgccgcgg caatgacagc ctccgcggaa tatcttcggc tgttgcggcg tcataacgtg 1455241 gcagcagcac tgtccggggc aggtccttcg ttgatcgccc tgagtacaga ttcagagttg 1455301 ccgaccgacg ccgtggagtt cggagccgca aagggatttg ccgttaccga gctgactgtt 1455361 ggcgaggcgg ttcgctggag cccgacagta agagttcccg gttaatccgc aaggttgcgg 1455421 gggtttgctt gcttccggcc aggaagcggg ctatcctcgg agccgtccag caatcgcagc 1455481 atctgcatac gtactgcctt gccgctagga cagccaccaa ttcttcttgt ggacgaggtt 1455541 cgccgtattc gccgctgatg gcgatcaccg ttgcaaagtc gatgattggc gcactcggcg 1455601 atttggctga ctgcaacaaa accccgtatg acgtgatcag cgggggaagg aaaggaaatc 1455661 cgtgaccgat acggacctca ttacggctgg cgaaagtacc gacggcaagc cgtcggatgc 1455721 cgctgccaca gatcccccag acctcaacgc cgacgagccg gccggctcgc tggccaccat 1455781 ggtgctgccc gaactgcgtg cgctggctaa tcgagccggc gtgaagggaa catcgggtat 1455841 gcggaagaac gaactgatcg ctgcgattga ggagatcagg cgacaggcca acggcgcccc 1455901 agccgttgac cggtcggctc aagagcacga caagggcgac cggccgccca gttccgaggc 1455961 accggccacc cagggggaac agaccccgac cgaacagatc gattcccaaa gccaacaggt 1456021 ccgcccggag cggcgcagcg ccacccgtga agcgggaccc tccggctccg gtgagcgtgc 1456081 gggcacagcc gcagacgaca ccgacaaccg ccaaggcggt caacaggacg ccaagaccga 1456141 ggagcgtggc accgacgcgg gtggcgacca agggggtgac cagcaggctt cgggcggtca 1456201 gcaggcgcgc ggcgacgagg acggagaagc gcgtcagggc cggcgcggac gccggttccg 1456261 cgatcggcgg cgccgcggtg aacgatccgg cgacggcgcc gaggctgaac tgcgtgagga 1456321 cgacgtcgtc cagccggtag ccggcatact cgacgtcctg gacaactacg cgtttgtgcg 1456381 cacctccggc tacctacccg gtccgcacga cgtgtatgtg tcgatgaaca tggtgcgcaa 1456441 gaacggcatg cgccgtggtg atgcggtgac cggtgcggtg cgggtgccca aggaagggga 1456501 gcaacccaac cagcggcaga agttcaaccc gctggtccgc ctggacagca tcaacggcgg 1456561 atcggtcgaa gacgccaaga agcggcccga gttcggcaaa ctgacgccgt tgtaccccaa 1456621 ccagcggctt cgtctggaaa ccagtaccga gcggctgacc acccgggtca tcgacctcat 1456681 catgccgatc ggcaagggtc aacgcgcgtt gattgtgtcg ccgcccaaag cgggcaagac 1456741 aacgatcctg caggacatcg ccaacgcgat caccaggaac aacccggaat gccacctcat 1456801 ggtcgtgctc gtcgacgagc ggcctgagga ggtcaccgat atgcagcgct cggtcaaagg 1456861 cgaggtcatc gcttcaactt tcgaccggcc gccgtcggac cacacgtcgg tcgccgagct 1456921 ggcgatcgaa cgcgccaagc ggctggtgga gcaaggcaag gacgtcgtgg tgctgctcga 1456981 ttcaatcacc cggctaggcc gcgcttacaa caacgcgtcg ccggcgtcgg gccggatcct 1457041 gtccggtggt gtcgattcca cggcgttgta cccgcccaag cgcttcctgg gggccgcgcg 1457101 caacatcgaa gagggcgggt cgctgaccat catcgccact gcgatggtcg agaccgggtc 1457161 cactggtgac acggtcattt tcgaggagtt caagggcacc ggcaacgccg agctcaagct 1457221 ggaccgcaag atcgccgagc ggcgggtttt ccctgcggtc gacgtgaatc cttctggaac 1457281 ccgcaaggac gagctactgc tgtcgcccga cgagttcgct attgtgcaca agctgcgccg 1457341 cgtgctatcg ggcctggatt cccaccaggc catcgacctg ctgatgtcgc agctgcgtaa 1457401 gacgaagaac aactacgaat tccttgttca ggtgtccaag accacgccag ggtccatgga 1457461 cagcgactga tccggcgaga cggctcgccg ggaatgtccg cacgcatctc ggtgtttggg 1457521 gtgatagcgg ttgacctggc ataatcgatg ctcaacgagt tggaaccgga ccaggttctc 1457581 ggcacgccac gacgggcggc caccgatcac agagggcagc atgaaatctg acattcatcc 1457641 ggcatatgag gagaccaccg tggtctgcgg atgcggcaat accttccaga cgcgtagcac 1457701 caagccggga ggtcgtattg tggttgaggt ttgttcgcag tgtcatccgt tctacaccgg 1457761 caagcagaag atcctcgaca gcggcggccg ggtggctcgc ttcgagaagc ggtacggcaa 1457821 gcgcaaggtc ggagctgaca aggcggtttc aaccggcaaa tagctggctt accgacgccc 1457881 gaactgtgca ccagcggtac aggacgggcg tcggttcgcg ttagggtccg cgctcgcggg 1457941 aagaaggttg acatgacgca gccagtgcag acgattgacg tgttgctcgc cgaacacgcc 1458001 gagctcgagc ttgcgctggc agatcccgcg ctgcacagca atccggccga ggcgcgcaga 1458061 gtcgggcgcc ggtttgcccg attggccccg atcgtcgcaa cccaccgcaa gctgacgtcc 1458121 gcgcgcgacg acctcgagac cgcgcgcgag ctggtggctt ccgacgagtc gttcgccgcc 1458181 gaggttgccg cattggaggc tcgggtgggc gaactggatg cccaactcac tgacatgttg 1458241 gcaccgcgtg acccgcacga tgccgatgac attgtgctgg aagtcaaatc cggcgagggg 1458301 ggcgaagaat ccgcgttgtt cgccgccgat ttggccagga tgtatatccg ctacgccgag 1458361 cggcacggct gggcggtgac ggtgttggac gagaccacct cggatctggg tgggtacaag 1458421 gacgcgacgt tggcgattgc cagcaaagcc gacacccccg acggggtgtg gtcgcgcatg 1458481 aagttcgagg gcggggtgca ccgcgtacaa cgggtcccag tgacggaatc ccaaggccgc 1458541 gtgcatactt cggcggcggg tgtgctggtc tatccggagc ccgaggaagt cggccaagtg 1458601 cagatcgacg agtcggatct gcgtatcgac gttttccggt cgtccggcaa gggcgggcag 1458661 ggagtgaata ccaccgactc cgcggtgcgt atcacccatc tgcccactgg aatcgtcgtc 1458721 acctgtcaga acgaacggtc gcagctgcag aacaagacgc gtgcgttgca ggtgctggcc 1458781 gctcggttgc aggcaatggc cgaggagcag gcgctggccg acgcgtcggc cgaccgggct 1458841 agccaaatcc gcactgtgga ccgtagtgaa cgcattcgca cctacaactt cccggagaac 1458901 cggatcaccg accaccggat cggttacaag tcacacaatc tcgatcaggt gctggatggc 1458961 gatcttgacg cgttgttcga cgctctgtcc gccgcggaca agcaatcccg gttgcgacaa 1459021 tcatgacctc cgcgccggcg acgatgcggt gggggaacct cccgcttgcg ggggagagcg 1459081 gcacaatgac cctgcgtcag gcgatcgact tggctgctgc gctattggcc gaagcggggg 1459141 tcgactcggc gcgttgcgac gctgagcagt tggccgctca cctagcgggc acagaccgcg 1459201 gtaggctacc cctgttcgag ccgcccggcg acgagttctt cgggcgctat cgcgacatcg 1459261 tcaccgctcg tgcgcggcgg gtgccgttgc agcatctcat cgggactgtg tcgtttgggc 1459321 ccgtggtgct gcatgtcggc ccgggtgtgt ttgtaccgcg tccggagacc gaagccattt 1459381 tggcctgggc caccgcgcag tcgctgccgg cgcggccgct gattgtcgac gcatgcacgg 1459441 gatctggcgc gttggcggtc gcattggccc agcaccgggc caaccttgga ctaaaggccc 1459501 gcatcatcgg cattgacgac tccgactgcg cccttgacta tgcccgccgc aatgcggcgg 1459561 gtaccccggt agagttggtg cgtgccgacg tcaccacgcc ctgcctgctc cccgaactcg 1459621 acggacaagt cgacctgatg gtttccaacc cgccctacat ccctgatgct gctgttttgg 1459681 aacctgaagt agcgcaacat gacccgcatc acgcgttgtt cggcggtccc gacgggatga 1459741 cggtgatatc cgcggtcgtc gggcttgctg ggcgctggct gcgtcccggt ggcctgttcg 1459801 ccgtcgaaca cgacgacacc acgtcgtcgt caactgtcga tttggtcagc agcacaaaac 1459861 ttttcgtgga cgtacaagcc cggaaagatc tggccggacg gccgaggttt gtgacggcga 1459921 tgaggtgggg gcacctcccg cttgcagggg agaacggcgc cattgacccg cgccagcgac 1459981 gatgcagagc gaagcgatga ggagaagcgg cgccattgac tgagacgttc gactgcgccg 1460041 accccgagca gcgttcgcgt ggaatcgtct ctgcggtagg ggcaatcaag gcgggccaac 1460101 tggtggtgat gcctacggac acggtgtatg ggatcggcgc cgacgccttc gacagctccg 1460161 cggtggccgc gttgctgtcg gcaaaggggc ggggtcgcga tatgccggta ggtgtgctgg 1460221 tcggctcttg gcacacgatc gaggggctgg tctactctat gcccgacggt gcccgcgaac 1460281 tgattcgcgc attctggccc ggcgcgctca gcctggtggt cgtgcaagcg ccgtcgctgc 1460341 aatgggatct tggcgatgcc catggcaccg tgatgctgcg aatgccgctg cacccggtcg 1460401 ccatcgagtt gttgcgtgag gtgggtccga tggcggtatc cagcgccaac atctcgggcc 1460461 acccaccccc ggtcgacgcc gaacaggcac gctctcaact cggcgaccac gtcgcggtct 1460521 atctcgacgc ggggccatcc gaacagcagg ccggctccac gatcgtcgat ctgaccggag 1460581 ccaccccacg cgtcctgcgg caggggccgg tcagcaccga gcggatcgcc gaggtacttg 1460641 gtgtggacgc ggccagcttg ttcggctagc cgccgaacgt gcacgcactg cgaagattcg 1460701 gccaattgtt cgcagctgtt gcacgttcgg cgagtgttca gctctcaggt tggtgcagta 1460761 cggtctcgag gtgtccagcg atgtggccgg cgttgccggt ggcttgctcg ccctgtccta 1460821 tcgcggcgcc ggtgtcccgc tgcgtgagct tgcgctggtc gggctgaccg cggcgatcat 1460881 cacctatttt gcgaccggtc cggtgcggat gctggccagt cgcctgggag ccgtcgccta 1460941 cccgcgggag cgagatgtgc acgtcacgcc tacccctcgg atgggtgggt tggcgatgtt 1461001 cctgggcatt gtcggcgccg tctttcttgc ctcccagctt ccggcactca cccgggggtt 1461061 cgtctattcc accggcatgc ccgcggtgct ggtggccggt gcggtgatca tgggcatcgg 1461121 cctgatcgat gatcgttggg gtctggatgc actgacgaag ttcgccggcc agatcacggc 1461181 ggcgagcgtt ctggtcacca tgggtgtcgc ctggagtgtc ctgtacatcc cggtgggtgg 1461241 tgtgggcacc atcgtcttgg accaggcttc ctcgatcctg cttaccctgg cgctgaccgt 1461301 ttcgatcgtc aacgcgatga actttgtcga cggtctcgac gggctggccg ccggcctggg 1461361 cctgataacg gcgctggcaa tctgcatgtt ctcggtgggt ttgcttcgtg accacggtgg 1461421 tgacgttttg tactacccgc cggcggtgat ttcggtggtc ctggccgggg cctgcctggg 1461481 ctttctgcca cacaacttcc accgggccaa gatcttcatg ggcgattccg ggtcgatgct 1461541 gatcggcctg atgctggccg ccgcttccac caccgcggcc gggccgatct cgcagaacgc 1461601 ctacggcgct cgtgatgtat ttgctttgct gtcgccgttc ctgctggtgg tggcggtcat 1461661 gtttgtgcca atgctcgacc tgctgctagc gatcgtccgt cgcacccgcg ccggccgcag 1461721 cgcgtttagc ccggacaaaa tgcacctgca tcaccggctg ctgcagatcg gtcattccca 1461781 tcggcgcgtg gtcctgatca tctacctgtg ggtgggcatc ggtgccttcg gcgccgcgag 1461841 ctcgatcttc tttaacccgc gcgacaccgc ggcggtgatg ctgggcgcga tcgtggtcgc 1461901 cggcgtcgcg acactgatcc ccctgttgcg ccgcggcgac gactactacg acccggacct 1461961 ggactagccc ggagccgaga actacgacaa ggagtagtag tggtgtctac cttgtggtac 1462021 ggtgcggcta gaaccccgaa ggagacctcg cgggttgccg gcccccggcc catcggatgc 1462081 gtatccggtc gcgccgattc acgaccgaca tagggagcta ccccttgggt gattccggtg 1462141 cgacgactgc gatacgctcg gcgggccacc gatcagtcga tcgggtggtt tccgctccat 1462201 cagcccggaa ttgaggtgcc gcagtgacga caccagcgca ggacgcgccg ttggtgtttc 1462261 cctctgttgc tttccgtccg gttcgccttt ttttcatcaa cgttggactg gccgcagtgg 1462321 cgatgttggt cgccggcgtg ttcggtcacc tgacggtcgg gatgttcttg ggtctcgggt 1462381 tgctgctggg tttgctcaat gccctgctgg tgcggcgttc ggccgagtcg atcaccgcca 1462441 aagagcaccc gttaaaacgg tcgatggccc tcaactcggc atcgcgactg gcgattatca 1462501 ccatcctcgg gctgatcatc gcctacattt tccggcccgc tggattgggc gtcgtgttcg 1462561 ggctggcatt cttccaggtg ctgctggtgg caacgacggc cctgccggtc ctgaagaagc 1462621 tgcgcactgc gaccgaggaa ccggtcgcaa cttattcttc caatggccag accgggggat 1462681 cggaaggaag gagcgccagc gatgactgag accatcctgg ccgcccaaat cgaggtcggc 1462741 gagcaccaca cggccacctg gctcggtatg acggtcaaca ccgacaccgt gttgtcgacg 1462801 gcgatcgccg ggttgatcgt gatcgcgttg gccttttacc tgcgcgccaa agtgacttcg 1462861 acggatgtgc caggcggggt gcagttgttt tttgaggcga tcaccattca gatgcgcaat 1462921 caggtcgaaa gcgccatcgg gatgcggatc gcacccttcg tgctgccgct ggcggtgacc 1462981 atcttcgtgt tcatcctgat ctccaactgg ctggcagtcc tcccggtgca gtacaccgat 1463041 aaacacgggc acaccaccga gttgctcaaa tcggcagcag cggacatcaa ttacgtgctg 1463101 gcgctggcgc ttttcgtgtt cgtttgctac cacacggccg gtatttggcg gcgcggtatt 1463161 gtcggacacc cgatcaagtt gctgaaaggg cacgtgacgc tcctcgcgcc gatcaacctt 1463221 gtcgaagaag tcgccaagcc aatctcgttg tcgctccgac ttttcggcaa cattttcgcc 1463281 ggcggcattc tggtcgcact gatcgcgctc tttcccccct acatcatgtg ggcgcccaat 1463341 gcgatctgga aagcatttga cctgttcgtc ggcgcaatcc aggccttcat ttttgcgctg 1463401 ctgacaattt tgtacttcag ccaagcgatg gagctcgaag aggaacacca ctagtaccgg 1463461 atgctggtaa cggctaccag agccatcaag gaggataagg aaatggaccc cactatcgct 1463521 gccggcgccc tcatcggcgg tggactgatc atggccggtg gcgccatcgg cgccggtatc 1463581 ggtgacggtg tcgccggtaa cgcgcttatc tccggtgtcg cccggcaacc cgaggcgcaa 1463641 gggcggctgt tcacaccgtt cttcatcacc gtcggtttgg ttgaggcggc atacttcatc 1463701 aacctggcgt ttatggcgct gttcgtcttc gctacacccg tcaagtaatt cgacggcaaa 1463761 tggttgcaat aggtagcaat gggtgaagtg agcgcgattg tcctggccgc cagtcaggcg 1463821 gcagaggaag gcggcgagtc cagcaacttc ctcattccca acggcacgtt tttcgttgtg 1463881 ctggccatct tcctggtggt gctcgctgtc attggcactt tcgtggtgcc gccgatcttg 1463941 aaggtcttgc gggaacgtga cgctatggtc gccaaaacgc tggccgacaa caagaagtcg 1464001 gacgagcagt tcgccgccgc acaggccgat tacgacgaag ccatgacgga agcccgagtc 1464061 caggcgtcgt ccttgcgcga caatgcccgg gcagatggcc gtaaagtcat cgaggacgca 1464121 cgcgtccggg ccgaacaaca ggtggcatcg acgttgcaga ccgcccatga gcaattgaag 1464181 cgggagaggg acgccgtgga actcgatctg cgtgcccacg tgggcaccat gtcggcgact 1464241 ctggccagtc gaattctcgg tgttgacctc accgcttcag ccgcgacgag gtaaccacga 1464301 atgtcgacgt ttatcggaca gctgttcggg ttcgcggtca tcgtttatct ggtgtggcga 1464361 tttatcgtgc cgctcgtagg gcgtttgatg tccgcacggc aggacacggt gcgccaacag 1464421 ctggcggatg cggcggcggc cgccgaccgg ctggcggagg cgagtcaagc tcacacgaag 1464481 gcgctggaag acgccaagtc ggaagcgcac cgtgttgtgg aagaggccag gacagatgcc 1464541 gaacgcatcg cagaacaact agaggcccag gccgacgtcg aggcggagcg catcaaaatg 1464601 cagggtgccc gtcaggtcga cctcatccgg gcacagctga cccgtcagct tcgcctcgag 1464661 ctcggtcacg aatcggtccg ccaggcaagg gaattggtac gcaatcacgt ggccgatcag 1464721 gcacaacaat cggccaccgt cgaccgcttc ctggatcagc tcgatgcgat ggcgccggct 1464781 acggccgatg tcgattaccc actgctggcc aagatgcgct cagccagccg gagggcatta 1464841 accagcctgg tggattggtt cggcaccatg gcccaggacc tcgaccatca aggtctgacc 1464901 accctcgccg gcgagctggt gtcggtagca agactgctgg accgcgaggc cgtcgtcacc 1464961 cgctatctca ccgtgccagc cgaagatgcg acgcccagga tccggctgat cgaacggctg 1465021 gtgtccggca aggtcggcgc gccaacgctc gaggtgttgc gcacagccgt atcgaagcgc 1465081 tggtcggcca attccgattt gatcgatgcg atcgaacacg tgtcgcggca ggcgctgtta 1465141 gaactcgccg aacgtgcggg tcaggtcgac gaggtggaag accagttatt ccggttttcc 1465201 cgcattctcg acgtgcagcc ccggcttgcc atcctgttgg gtgactgtgc cgttccggcc 1465261 gaaggccgag tccggttgct gcgcaaggtg cttgagcgtg ccgacagtac cgtcaacccg 1465321 gtcgtggtcg cgctgttgtc tcacaccgtc gagctgctgc ggggtcaggc agttgaggaa 1465381 gcggtgctgt tcctggccga agttgcggtg gctcgccgcg gcgaaatcgt cgcgcaggtc 1465441 ggcgcggcgg ccgagctcag cgatgctcag cgcactcgcc tcaccgaagt gctgagccgt 1465501 atctacggtc accccgtgac cgtgcagctg catatcgacg ccgcgctgct gggcggattg 1465561 tccatcgcgg tcggtgacga agtgatcgac ggtacgctct cgtctcgtct agctgcggcc 1465621 gaggcacgac tgcccgactg aacccgaact agtcagcaca aaccgaagta ggaagacgaa 1465681 aagctatggc tgagttgaca atccccgctg atgacatcca gagcgcaatc gaagagtacg 1465741 taagctcttt caccgccgac accagtagag aggaagtcgg taccgtcgtc gatgccgggg 1465801 acggcatcgc acacgtcgag ggtttgccat cggtgatgac ccaagagctg ctcgaattcc 1465861 cgggcggaat cctcggcgtc gccctcaacc tcgacgagca cagcgtcggc gcggtgatcc 1465921 tcggtgactt cgagaacatc gaagaaggtc agcaggtcaa gcgcaccggc gaagtcttat 1465981 cggttccggt tggcgacggg tttttggggc gggtggttaa cccgctcggc cagccgatcg 1466041 acgggcgcgg agacgtcgac tccgatactc ggcgcgcgct ggagctccag gcgccctcgg 1466101 tggtgcaccg gcaaggcgtg aaggagccgt tgcagaccgg gatcaaggcg attgacgcga 1466161 tgaccccgat cggccgcggc cagcgccagc tgatcatcgg cgaccgcaag accggcaaaa 1466221 ccgccgtctg cgtcgacacc atcctcaacc agcggcagaa ctgggagtcc ggtgatccca 1466281 agaagcaggt gcgctgtgta tacgtggcca tcgggcagaa gggaactacc atcgccgcgg 1466341 tacgccgcac actggaagag ggcggtgcga tggactacac caccatcgtc gcggccgcgg 1466401 cgtcggagtc cgccggtttc aaatggcttg cgccgtacac cggttcggcg atcgcccagc 1466461 actggatgta cgagggcaag catgtgctga tcatcttcga cgacctgact aagcaggccg 1466521 aggcataccg ggcgatctcg ctgctgctgc gccgtccgcc cggccgtgag gcctaccccg 1466581 gcgatgtgtt ctatctgcat tcgcggcttt tggagcgctg cgccaaactg tccgacgatc 1466641 tcggtggcgg ctcgctaacg ggtctgccga tcatcgagac caaggccaac gacatctcgg 1466701 cctacatccc gaccaacgtc atctcgatca ccgacgggca atgtttcctg gaaaccgacc 1466761 tgttcaacca gggcgtccgg ccggccatca acgtcggtgt gtcggtgtcc cgagtcggcg 1466821 gcgcggcgca gatcaaggct atgaaagagg tcgccggaag cctccgcttg gacctttcgc 1466881 aataccgcga gctagaagct ttcgccgctt tcgcttctga tttggacgcc gcatcgaagg 1466941 cgcagttgga gcgcggcgcc cggctggtcg agctgctcaa gcagccgcaa tcccagccca 1467001 tgcccgttga ggagcaagtg gtttcgatct tcctgggcac cggcggtcac ctggactcgg 1467061 tgcccgtcga ggacgtccgg cggttcgaaa ccgaattact ggaccacatg cgggcctccg 1467121 aagaagagat tttgactgag atccgggaca gccaaaagct caccgaggag gccgccgaca 1467181 agctcaccga ggtcatcaag aacttcaaga agggcttcgc ggccaccggt ggcggctctg 1467241 tggtgcccga cgaacatgtc gaggccctcg acgaggataa gctcgccaag gaagccgtga 1467301 aggtcaaaaa gccggcgccg aagaagaaga aatagctaac catggctgcc acacttcgcg 1467361 aactacgcgg gcggatccgc tcggcagggt cgatcaaaaa gatcaccaag gcccaggagc 1467421 tgattgcgac atcgcgcatc gccagggcgc aggctcggct cgagtccgct cggccctacg 1467481 cttttgagat cacccggatg cttaccaccc tggccgctga agccgcactg gaccatccgt 1467541 tgctcgtcga gcgcccggag ccgaaacgag ccggcgtgct ggtggtgtcg tccgatcgtg 1467601 gtttgtgcgg cgcatacaac gccaatattt tccgtcgctc cgaggagctg ttctccctgc 1467661 tgagggaggc cggaaagcag ccggtgctgt atgtggtggg ccgtaaggcg cagaactact 1467721 acagttttcg gaactggaac atcaccgagt cgtggatggg tttctccgag caacccacgt 1467781 acgagaacgc cgccgagatc gcttcgacct tagtggatgc gttcctgctc ggcaccgaca 1467841 acggcgagga tcaacggtcc gacagcggcg agggcgtcga cgaactgcac atcgtttaca 1467901 ccgagttcaa gtcgatgctg tcgcaatcgg cggaggctca ccggatcgcc cccatggtgg 1467961 tggagtacgt cgaggaagac atcggaccgc gcacgctgta ctcgttcgag cccgacgcga 1468021 cgatgctgtt cgagtcattg ttgccgcgct acctgactac ccgggtgtac gcggcgctgc 1468081 tggagtccgc ggcgtcggag cttgcctcgc ggcaacgtgc gatgaagtcg gccaccgaca 1468141 acgccgatga cctcatcaag gccctgacgc tgatggcaaa ccgcgagcgg caggcccaga 1468201 tcacccagga gattagtgaa atcgtcggtg gcgcaaatgc gctcgccgaa gcccgctagg 1468261 cccaagctag gttagcccca cgaggaagcg aagaagatat gactaccact gccgaaaaga 1468321 ccgaccggcc gggaaagccg ggaagctccg acaccagcgg ccgcgtggta cgggtcactg 1468381 ggcccgtcgt cgacgtcgag tttcctcgcg gttccatccc cgagctgttc aatgcactgc 1468441 acgctgagat caccttcgag tcgctggcga aaaccctcac cttggaggtg gcgcagcacc 1468501 tcggcgacaa cctggtgcgc accatctcgc tgcagccgac cgacggcttg gtgcgcggcg 1468561 tcgaggtgat cgacaccggg aggtcgatct cggtgccggt cggtgagggt gtgaagggcc 1468621 acgtcttcaa tgcgctggga gattgcctgg acgagccggg atatggcgaa aaattcgaac 1468681 actggtcgat tcaccgcaag ccgccggcgt tcgaggagct ggagcctcgg accgagatgc 1468741 tcgagaccgg tctgaaggtg gtcgacctgc tgactccgta tgttcgtggc ggcaagatcg 1468801 cactgttcgg cggtgccggg gtgggcaaga cggtgctgat tcaggagatg atcaaccgca 1468861 tcgcccgtaa cttcggtggt acgtcggtgt tcgccggagt gggcgagcgc acccgcgagg 1468921 gcaacgatct gtgggtcgag cttgccgaag ccaacgtgct caaggacacc gcgctggtat 1468981 tcggacagat ggacgagccg ccgggcaccc gtatgcgtgt tgcgctgtct gcgctgacga 1469041 tggcggagtg gttccgtgac gagcagggtc aagacgtatt gctgttcatc gacaacatct 1469101 tccggttcac ccaggctggg tcggaagtgt cgacgcttct cggccggatg ccgtcggccg 1469161 tgggatacca gcccacgctg gccgacgaga tgggcgagct gcaggagcgc atcacctcga 1469221 cgcggggacg ctcgatcacg tcgatgcaag ccgtctacgt gcccgccgac gactacaccg 1469281 acccagcgcc ggcgaccacg ttcgcccacc tggacgccac gaccgagcta tcccgtgcgg 1469341 tgttctccaa gggcatcttc cccgccgtgg acccgctggc gtccagctcg accatcctgg 1469401 accccagcgt tgtcggggat gagcactacc gcgtggccca ggaagtcatc cggatcctgc 1469461 agcgttacaa ggaccttcag gacattatcg cgatcctcgg tatcgacgag ttgtcggagg 1469521 aggacaagca gctggtgaac cgcgcccggc gtatcgagcg gttcctatcg cagaacatga 1469581 tggcagccga acagttcacc ggccagccgg gttcgaccgt cccggtgaag gagaccattg 1469641 aagcgttcga ccgcttgtgc aagggcgatt tcgatcacgt acccgaacag gccttcttct 1469701 tgatcggtgg ccttgatgac ctggccaaga aagccgagag tctcggcgcc aagctgtgac 1469761 gggagttgtg gcatggccga attgaacgtt gagatcgtcg ccgtcgaccg gaacatctgg 1469821 tcgggtacgg cgaagtttct gttcacccgc accaccgtcg gtgagatcgg catcctgccc 1469881 cgccacattc cgttggtggc ccaattggtc gatgacgcca tggtgcgggt cgagcgggag 1469941 ggagaaaagg acctgaggat cgcggtcgac ggcgggttcc tgtcggtgac cgaggagggc 1470001 gtcagcattc tcgccgaatc tgccgagttc gagtcggaga tcgacgaggc cgccgccaag 1470061 caggattccg aatccgacga tccccgcatc gctgccaggg gccgcgccag attgcgcgcc 1470121 gtcggcgcga tcgactaacc cgccgatgag cgtgcccatg atcggcatgg tcgtgctcgt 1470181 cgttgtcctg ggattggccg ttctcgcact gagttatcgt ctgtggaagc tgcgccaggg 1470241 gggaacggct gggatcatgc gggacatccc tgcggttgga ggtcacggct ggcgccacgg 1470301 cgtaatccgc tatcgcggcg gcgaagccgc gttctaccgg ctttctagtc tgcgcttgtg 1470361 gccggatcgc cggctcagta gacggggtgt ggagatcatt tcccggcgcg cgccccgtgg 1470421 cgacgaattc gacatcatga ccgacgagat tgtcgttgtg gaactgtgcg acagcaccca 1470481 ggaccgaagg gtaggttacg agatcgcgct cgacaggggc gcgttgaccg catttctgtc 1470541 gtggttggag tcccggccgt cgccgcgcgc gccgccgtag tatgtgacgc actggtcagc 1470601 agacgcaaaa gcccccattt cgggctctac tgactgatct gtgggtggtt gtgtcggcct 1470661 ggcagggtgg ggcggtggcc ggcgagggtg agcatggcta gggcgatgag ggcttgtggt 1470721 gagcggaatc cgaacgcgat ccgggtcagt aggcggatct tggtgttggt ggattcgatc 1470781 aggccttggg ataggccgtg gtcgagggcg gcgtcgatgg ccacccggtg gcgtttgatg 1470841 cgggcggcaa gctcgacgaa taccgggatg cgacagcgct gggcccagga gatccaccgg 1470901 tccagggcct gtttaccttc ctcgcccttg accgaaaaca catgccgcag gctctctttg 1470961 agcaggtagg cgcgatacag acggggatcg gtcttggcga tccaggccag tttggcgctt 1471021 tggcgttcgg tgaggtcctc ggggttcttc cacagcgcgt agcgggcgcc cttgagccgc 1471081 cgtgcccgct cgcggcccgg acgtggtgcg gcgttcttac cgggccggcc ccggccccac 1471141 ttgggttcgg tgcgcgcgat cgcccgtgcg tcgttccagg ctcggcgccg ctcgacgtcg 1471201 agcgcctcgg tggcccaggc caccacatga aacggatcgg cgcattgaat cgcatccggg 1471261 cagcgctcgg tgaccacgtc agcgatccag tccgcggcat cggccgaaac gtgagtaatc 1471321 tgggcggccc gctcagcgcc cagggcatcg aagaacaagc ccagggtggc tcggggcggc 1471381 ccacaccaac cggccgctgt cgtgatcgac gaccaccgtc aggtaccggt ggtggcgctt 1471441 gtaggagatc tcatcgatac cgatgcggcg caagttcgcg aaccggtcaa tgcgcttttc 1471501 ggtgtcggcc cagacccggg ccacgatcgc cccgacggtg cgccaggcga tccgcatcaa 1471561 ctcgcacacc gcggtcttcg aacacgccac cgccagccag gccaccgtgt catcgaaagc 1471621 atacgtgtgc ccggcatgat gacgcgccca cggcaccgcc accaccgtcg gcccatgggt 1471681 ggggcagttc acccgcggcg cctcggcctc caagaacacc tcgacggtgc cccaatccag 1471741 actgcgccat tggcgcaggc ccgcaccgcg gtcataccag gacgccttgc gaccgcagcg 1471801 accacagcgg cgcaacactg cacttcgtgg ccgcacccgg gcgatcaccc gcgcaccgtc 1471861 tccggcgtca tcctcctcga attcgatgtc ctcaatcacg gtgcgcttgt cgacacccag 1471921 cagcgcacga aatagcctca cattgcgcac gtcgttgtcg gctccttgtg tttctgatcc 1471981 ttgacaagcc agaaacctta agccacaacg acgtgcgcct actcaggaca caaactcacc 1472041 cacggaagtg tcagaagagc ccaaaaaccg tgggtattgg gggctttcgc gtctgctcgc 1472101 acgcggaagg tgccgctagc tcgccgtcct atcaccaccg ggccgccaca gcacgtcacc 1472161 gtcgggattg gctacccgcg acaggatgaa cagcagatcc gacagccggt tcaggtattt 1472221 cgccggcagt acgctgacgc cttccgggtg agcgtcgacc gcggcccacg cggatcgctc 1472281 ggcccggcga acgacggtgc gagcgacgtg caacagcgcc gacagcggtg aaccaccagg 1472341 tagtacaaag gattttagtg caggcaggcc cgcgttgtat gcgtcgcacc acccttcgag 1472401 ccgatcgata taggactgtg cgattcgcag cggagggtgc ttcgggtttt ccactatcgg 1472461 agtcgacaga tccgcaccgg catcgaacaa gtcgttctgg atctgccgca gcacatccgt 1472521 gatttgagtg tccgggtggc ccagcgccag ggcggccccg atcgcggcgt tggcctcgtc 1472581 gcaatccgcg tatgccacca gtcgggcgtc ggttttggcg acacgggaca tatcgctcaa 1472641 tcccgtcgtt ccgtcatcgc cggttcgggt atagatgcgg gtcaggtgga ctgccatgag 1472701 caaacggtac tcgctgactg gcttggctca ctgacaaggc aaaacccctt tactacactg 1472761 accgggtggc cgagcgtttc gtcgtgactg ggggcaaccg gttatcaggc gaagtggccg 1472821 tcggcggcgc caagaacagc gtgctcaagc tcatggctgc gacgttgttg gccgagggca 1472881 ccagcacgat caccaactgt cccgacatcc tcgatgtgcc gctgatggcg gaggtactgc 1472941 gtggtctggg cgccaccgtc gaactcgacg gtgacgtggc ccggatcacc gcacctgacg 1473001 agccgaagta cgatgccgac ttcgctgcgg tgcggcaatt ccgcgcctcg gtctgtgtgc 1473061 tgggaccgct ggtcgggcgg tgcaaacggg ccagggtcgc gctgccgggc ggtgacgcga 1473121 tcgggtcgcg tccgttggat atgcaccagg cgggcctacg gcaattgggt gcccactgca 1473181 acatcgagca cggctgcgtg gtagcccgag cggaaacgtt gcgcggtgcg gagattcagt 1473241 tggagttccc ctcggtggga gccaccgaga acatcttgat ggccgccgtg gtggccgagg 1473301 gagtcaccac tattcacaat gcggctcgag aacccgacgt cgtcgacttg tgcacgatgt 1473361 tgaaccagat gggcgcacag gtcgaaggtg cgggttcgcc gacaatgacc atcaccggtg 1473421 tcccgcggct gcatccaacc gagcaccggg tgatcggaga ccgtatcgtt gccgccacat 1473481 ggggcatcgc tgccgcaatg acccgtggtg atatatcagt ggcgggcgta gacccggcgc 1473541 atctgcagct ggtgctgcac aaattgcacg acgcgggcgc aaccgtcacc cagactgacg 1473601 ccagcttccg ggtgacccag tacgagcgtc cgaaggctgt caacgttgcg accttgccgt 1473661 ttcccgggtt tcccacggat ctgcagccga tggctatcgc tttggcgtcg atcgccgacg 1473721 gcacatcgat gatcacggag aacgtgttcg aggcgcggtt ccgcttcgtt gaagagatga 1473781 tccggctcgg tgcagacgct cggaccgacg ggcaccacgc cgtggtgcgg ggcctcccgc 1473841 agctgtcgag cgctccggtg tggtgttcgg acatccgtgc cggggccggc ttggtgctgg 1473901 cggggctcgt tgccgacggc gacaccgagg tccacgatgt attccacatc gatcgcggat 1473961 atccgttgtt cgtggagaac ctggtgagtc tcggtgccga gatcgaacgg gtatgctgtt 1474021 aggcgacggt cacctatgga tatctatgga tgaccgaacc tggtcttgac tccattgccg 1474081 gatttgtatt agactggcag ggttgccccg aagcgggcgg aaacaagcaa gcgtgttgtt 1474141 tgagaactca atagtgtgtt tggtggtttc acatttttgt tgttattttt ggccatgctc 1474201 ttgatgcccc gttgtcgggg gcgtggccgt ttgttttgtc aggatatttc taaatacctt 1474261 tggctccctt ttccaaaggg agtgtttggg ttttgtttgg agagtttgat cctggctcag 1474321 gacgaacgct ggcggcgtgc ttaacacatg caagtcgaac ggaaaggtct cttcggagat 1474381 actcgagtgg cgaacgggtg agtaacacgt gggtgatctg ccctgcactt cgggataagc 1474441 ctgggaaact gggtctaata ccggatagga ccacgggatg catgtcttgt ggtggaaagc 1474501 gctttagcgg tgtgggatga gcccgcggcc tatcagcttg ttggtggggt gacggcctac 1474561 caaggcgacg acgggtagcc ggcctgagag ggtgtccggc cacactggga ctgagatacg 1474621 gcccagactc ctacgggagg cagcagtggg gaatattgca caatgggcgc aagcctgatg 1474681 cagcgacgcc gcgtggggga tgacggcctt cgggttgtaa acctctttca ccatcgacga 1474741 aggtccgggt tctctcggat tgacggtagg tggagaagaa gcaccggcca actacgtgcc 1474801 agcagccgcg gtaatacgta gggtgcgagc gttgtccgga attactgggc gtaaagagct 1474861 cgtaggtggt ttgtcgcgtt gttcgtgaaa tctcacggct taactgtgag cgtgcgggcg 1474921 atacgggcag actagagtac tgcaggggag actggaattc ctggtgtagc ggtggaatgc 1474981 gcagatatca ggaggaacac cggtggcgaa ggcgggtctc tgggcagtaa ctgacgctga 1475041 ggagcgaaag cgtggggagc gaacaggatt agataccctg gtagtccacg ccgtaaacgg 1475101 tgggtactag gtgtgggttt ccttccttgg gatccgtgcc gtagctaacg cattaagtac 1475161 cccgcctggg gagtacggcc gcaaggctaa aactcaaagg aattgacggg ggcccgcaca 1475221 agcggcggag catgtggatt aattcgatgc aacgcgaaga accttacctg ggtttgacat 1475281 gcacaggacg cgtctagaga taggcgttcc cttgtggcct gtgtgcaggt ggtgcatggc 1475341 tgtcgtcagc tcgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccttgtc 1475401 tcatgttgcc agcacgtaat ggtggggact cgtgagagac tgccggggtc aactcggagg 1475461 aaggtgggga tgacgtcaag tcatcatgcc ccttatgtcc agggcttcac acatgctaca 1475521 atggccggta caaagggctg cgatgccgcg aggttaagcg aatccttaaa agccggtctc 1475581 agttcggatc ggggtctgca actcgacccc gtgaagtcgg agtcgctagt aatcgcagat 1475641 cagcaacgct gcggtgaata cgttcccggg ccttgtacac accgcccgtc acgtcatgaa 1475701 agtcggtaac acccgaagcc agtggcctaa ccctcgggag ggagctgtcg aaggtgggat 1475761 cggcgattgg gacgaagtcg taacaaggta gccgtaccgg aaggtgcggc tggatcacct 1475821 cctttctaag gagcaccacg aaaacgcccc aactggtggg gcgtaggccg tgaggggttc 1475881 ttgtctgtag tgggcgagag ccgggtgcat gacaacaaag ttggccacca acacactgtt 1475941 gggtcctgag gcaacactcg gacttgttcc aggtgttgtc ccaccgcctt ggtggtgggg 1476001 tgtggtgttt gagaactgga tagtggttgc gagcatcaat ggatacgctg ccggctagcg 1476061 gtggcgtgtt ctttgtgcaa tattctttgg tttttgttgt gtttgtaagt gtctaagggc 1476121 gcatggtgga tgccttggca tcgagagccg atgaaggacg tgggaggctg cgatatgcct 1476181 cggggagctg tcaaccgagc gtggatccga ggatttccga atggggaaac ccagcacgag 1476241 tgatgtcgtg ctacccgcat ctgaatatat agggtgcggg agggaacgcg gggaagtgaa 1476301 acatctcagt acccgtagga ggagaaaaca attgtgattc cgcaagtagt ggcgagcgaa 1476361 cgcggaacag gctaaaccgc acgcatgggt aaccgggtag gggttgtgtg tgcggggttg 1476421 tgggaggata tgtctcagcg ctacccggct gagaggcagt cagaaagtgt cgtggttagc 1476481 ggaagtggcc tgggatggtc tgccgtagac ggtgagagcc cggtacgcga aaacccggca 1476541 cctgcctagt atcaattccc gagtagcagc gggcccgtgg aatccgctgt gaatccgccg 1476601 ggaccacccg gtaagcctaa atactcctcg atgaccgata gcggattagt accgtgaggg 1476661 aatggtgaaa agtaccccgg gaggggagtg aaagagtacc tgaaaccgtg tgcctacaat 1476721 ccgtcagagc ctccttttcc tctccggagg agggtggtga tggcgtgcct tttgaagaat 1476781 gagcctgcga gtcagggaca tgtcgcaagg ttaacccgtg tggggtagcc gcagcgaaag 1476841 cgagtctgaa tagggcgacc cacacgcgca tacgcgcgtg tgaatagtgg cgtgttctgg 1476901 acccgaagcg gagtgatcta cccatggcca gggtgaagcg cgggtaagac cgcgtggagg 1476961 cccgaaccca cttaggttga agactgaggg gatgagctgt gggtaggggt gaaaggccaa 1477021 tcaaactccg tgatagctgg ttctccccga aatgcattta ggtgcagcgt tgcgtggttc 1477081 accgcggagg tagagctact ggatggccga tgggccctac taggttactg acgtcagcca 1477141 aactccgaat gccgtggtgt aaagcgtggc agtgagacgg cgggggataa gctccgtacg 1477201 tcgaaaggga aacagcccag atcgccggct aaggccccca agcgtgtgct aagtgggaaa 1477261 ggatgtgcag tcgcaaagac aaccaggagg ttggcttaga agcagccacc cttgaaagag 1477321 tgcgtaatag ctcactggtc aagtgattgt gcgccgataa tgtagcgggg ctcaagcaca 1477381 ccgccgaagc cgcggcacat ccaccttgtg gtgggtgtgg gtaggggagc gtccctcatt 1477441 cagcgaagcc accgggtgac cggtggtgga gggtggggga gtgagaatgc aggcatgagt 1477501 agcgacaagg caagtgagaa ccttgcccgc cgaaagacca agggttcctg ggccaggcca 1477561 gtccgcccag ggtgagtcgg gacctaaggc gaggccgaca ggcgtagtcg atggacaacg 1477621 ggttgatatt cccgtacccg tgtgtgggcg cccgtgacga atcagcggta ctaaccaccc 1477681 aaaaccggat cgatcactcc ccttcggggg tgtggagttc tggggctgcg tgggaacttc 1477741 gctggtagta gtcaagcgaa ggggtgacgc aggaaggtag ccgtaccagt cagtggtaac 1477801 actggggcaa gccggtaggg agagcgatag gcaaatccgt cgctcactaa tcctgagagg 1477861 tgacgcatag ccggttgagg cgaattcggt gatcctctgc tgccaagaaa agcctctagc 1477921 gagcacacac acggcccgta ccccaaaccg acacaggtgg tcaggtagag cataccaagg 1477981 cgtacgagat aactatggtt aaggaactcg gcaaaatgcc cccgtaactt cgggagaagg 1478041 gggaccggaa tatcgtgaac acccttgcgg tgggagcggg atccggtcgc agaaaccagt 1478101 gaggagcgac tgtttactaa aaacacaggt ccgtgcgaag tcgcaagacg atgtatacgg 1478161 actgacgcct gcccggtgct ggaaggttaa gaggacccgt taacccgcaa gggtgaagcg 1478221 gagaatttaa gccccagtaa acggcggtgg taactataac catcctaagg tagcgaaatt 1478281 ccttgtcggg taagttccga cctgcacgaa tggcgtaacg acttctcaac tgtctcaacc 1478341 atagactcgg cgaaattgca ctacgagtaa agatgctcgt tacgcgcggc aggacgaaaa 1478401 gaccccggga ccttcactac aacttggtat tgatgttcgg tacggtttgt gtaggatagg 1478461 tgggagactg tgaaacctcg acgccagttg gggcggagtc gttgttgaaa taccactctg 1478521 atcgtattgg gcatctaacc tcgaaccctg aatcgggttt agggacagtg cctggcgggt 1478581 agtttaactg gggcggttgc ctcctaaaat gtaacggagg cgcccaaagg ttccctcaac 1478641 ctggacggca atcaggtggc gagtgtaaat gcacaaggga gcttgactgc gagacttaca 1478701 agtcaagcag ggacgaaagt cgggattagt gatccggcac ccccgagtgg aaggggtgtc 1478761 gctcaacgga taaaaggtac cccggggata acaggctgat cttccccaag agtccatatc 1478821 gacgggatgg tttggcacct cgatgtcggc tcgtcgcatc ctggggctgg agcaggtccc 1478881 aagggttggg ctgttcgccc attaaagcgg cacgcgagct gggtttagaa cgtcgtgaga 1478941 cagttcggtc tctatccgcc gcgcgcgtca gaaacttgag gaaacctgtc cctagtacga 1479001 gaggaccggg acggacgaac ctctggtgca ccagttgtcc cgccaggggc accgctggat 1479061 agccacgttc ggtcaggata accgctgaaa gcatctaagc gggaaacctt ctccaagatc 1479121 aggtttctca cccacttggt gggataaggc cccccgcaga acacgggttc aataggtcag 1479181 acctggaagc tcagtaatgg gtgtagggaa ctggtgctaa ccggccgaaa acttacaaca 1479241 ccctcccttt tggaaaaggg aggcaaaaac aaactcgcaa ccacatccgt tcacggcgct 1479301 agccgtgcgt ccacaccccc caccagaaca aatttgcata gagttacggc ggccacagcg 1479361 gcagggaaac gcccggtccc attccgaacc cggaagctaa gcctgccagc gccgatgata 1479421 ctgcccctcc gggtggaaaa gtaggacacc gccgaacata caaaaacacc ccggtaacgg 1479481 tggtgttttt gtatgtttat atcgactcag ccgctcgcga gcgggcgaat tatggcttcg 1479541 attttcgcaa tgacgatacc ctcgcgggcg ggggcgctca gtcgaagagc gtcaagtctg 1479601 cgggcgcccg gcttttctcc aactcgagca gagctcgttt ccggttgatt ccaccgccgt 1479661 acccggtgag ctttccgctg gcgccgatca cgcggtggca cgggacgatg atggcgatgg 1479721 gattgtggcc gttggccaat cccacggcgc gtgcggcgcc gggggcgccg atctggtcgg 1479781 cgatttcccc gtaggaccgg gtttccccgt acgggattgt cagcaatgct ttccatactc 1479841 gttgctgaaa gtcggttccc cggaggtcaa gttccacatc gaattcggtg agctcgccgg 1479901 cgaaataagc gttgagttgg tcgacagcgc cagaaaatgc gccggggtcg ggtgtccagt 1479961 gtgtgcggct tggctcatac gtctgctcga gcatccgcag gttcgtcaac accgagccat 1480021 gcccggccag ggttaatggc ccgatggggc tatcgatggt gcggtagtga atcatgcgat 1480081 cttctcctgc ggtggccatt ggtttaccgg atgttccagg gtggtccaca ggtgctgggt 1480141 ggcataggag cgccaggggc gccagcgagc gctgtgcacc gtcagggctc gtcgttgtgc 1480201 aggcaggccc agctttttgg cggccagccg caggccgaga tcactggccg gaaaggcgtc 1480261 cgggtcaccg aggccgcgca tggcgatgac ctccgcggtc caggggccca ctccgggcag 1480321 cgctagcaac tgcccgcggg cgcgttgcca gtcacatccg gcgtccagga ccagactttt 1480381 gtcggcaagg ctggcgacga gcgcgtttat ggtcctttga cgcgccttgg ggacggccag 1480441 atggccggga tcgatctcag cgagctgctc gatcgacggg aaggtgtggg tcaaagcgcc 1480501 gtggcgatcg tggaccggcc gtccgtaggc ggcgaccagt cggcccgcgt gagtgcttgc 1480561 ggccttcgtc gatacctgtt gggcgaggac cgcccgcacg gcgaattctg cctcgtcgac 1480621 tgtgcgggga atgcgttgcc cgggtgcctt gcccaccact gcgcgcagat ccggatcggc 1480681 gcccagcgcc tcgacgatcg cttcgggatc ggcgtcgagg tccagcagcc gtcggcaacg 1480741 tgcagtggcc gtcattaggt cgcggaaatc atcgagcaca agcaggcagc gcacatgatc 1480801 gggtgccggc gtcaggctga cgatgccgtt gccccatggg agccgtagcg tgcgtcggta 1480861 cgcaccatcg cggacctctt cgcaacccgg caccgcggtg gcggccagat ggccgaaaac 1480921 accctcgaag gcgaatggtg cacggacggg tagccgcagc gacaccgtgc ccgctgatgc 1480981 ggtggcagac tcgaatcggg cggccgcgcg cgcacgcaat gccgtcggtg tgccgtcgca 1481041 cgccaggcga acggtgtcgt tgaactgacg gatgctggaa aacccggcgg cgaatgcgac 1481101 atcgccgaac ggcaggttcg tggtctcgat cagcacccgg gcggtctgca tgcgttgggc 1481161 gcgggccaac gcgagcggac cggcgccgac cacggcctgc aacagccgct ccagctggcg 1481221 aatggtgtaa ccgagctggg ccgcgaggcc gctgacaccg tcgcggtcca ccgttccgtc 1481281 ggcaatcagc cgcatcgccc gcgccacgac gtcactacgc acattctatt ccggagaccc 1481341 aggcgaggcg tcggggcggc accgtttgca ggcccggaat ccctccccct gagcggccgc 1481401 cgcagtcggc aggaaccgga cattgcgcgc gaacggtggc cggacggggc aactcggccg 1481461 gcagtagaca ccggtggtca aaaccgcgac gacgaaccag ccgtcgaacc gggcgtcttt 1481521 ggactggacc gcccggtagc agcgttcgaa gtcgtcgtgc acccttcaac aattacaccc 1481581 gcccaccgac atgactggcg gaaaaacgac attgtgatgg ggtcgtcgtg ggttcgggca 1481641 ggttacctac gcggcttggt cagcccgacc ggcttggcca gccggaccgg ttggtcgtgc 1481701 ccacgaagtt tcacatgcct acccaaagac caacgggcgc gctcctcttc gcttgcggcg 1481761 tccacggcct gtgccgaagc cagcaacttg ccgggacgcg atttggccag ttcgcacaat 1481821 cgggccgcct cgttgaccgg ctccccgatc acggtgtact cgaaccgttc tcgggcaccc 1481881 acgttgccgg caatgacctg ccccgccgcc acgccgatcc cggcctggca ctcgggcatt 1481941 tcgttgacca gccgatcggc tatcgcccgc gcggcggcca gtgccttgtc ttcgggacag 1482001 ggaagccggt tcggggcgcc gaagatggtt agcgacgcgt ccccctcgaa cttgttgacc 1482061 aatccgtggt ggcggtcgac ctcgtcgacg acaatcgcga agaacttgtt gagcagcttg 1482121 acgacgtcgg ccggcggccg gctggtcacc aattgcgtcg agccgacgat gtcgatgaac 1482181 acgacggcga cgtggcgttc ttcgccgccc agtttcgaac gttcacgctc ggcggcggcg 1482241 gcgacttcgc gtccgacgtg gcggccgaac agatcgcgca ctcgttcgcg ctcccgcagt 1482301 ccggcgacca tcgcgttgaa accacgctgc agctcgccga gttcggtgcc gtcgaagacc 1482361 accaggttgg tccgtagctc gccccgctcg acgcgccgca gcgccgcacg caccacccgc 1482421 accggggtcg ccgtcagcca ggccaggatc cacatcagga tgaacccgaa caccaatgtg 1482481 accatcgaga tgatcagcac gcccgtcgcg aactgcatcc gagtgagatt gagcagcacc 1482541 atttcgaaca tcgccatcag ggcgatgccg acgacgggta ctcccgaacc gagcagccac 1482601 accaccatgg tccggcccag gattcccggc gccaaccggc gtggcggcgg cccggcctcg 1482661 agcgcctggg cggcgaacgg gcgcaatgcg aactcggtat gcagataggt tgcggttgcg 1482721 accaatacgc cgcaaaagct gaccgcgaac aggaatcgcg ggatgaacgc gttgttgatc 1482781 aggccgtaga gtgtcgtcaa gagcgccgtg ccaacacccc agaacatgag gtggcccacg 1482841 gcgactcgcc agggggccag gaaggtgcgg cgctcctcct cacgagtcgg tttccgtcct 1482901 tcgatcgccc agcgcagggc ttgcacggtc tgcctggtca gtgcgtagct acccaaagcg 1482961 agggctagca ggacatagcc cggtaccacc ccgaacgtga gccaccgtgg cgtgtcgcga 1483021 acgatgctcg gttcggggat ggcgatcgtc accaatagca gggcaacccc gatgccgagc 1483081 aggttcgcgg tcacgaccag cgcggtcagc atgacctgga tccgtacccg tcggcgccgt 1483141 tggctttccg aaacccgccc aagcagccag gagccgtacg cgggagtttc tggcagccgg 1483201 ccgctctgcc gggtcaccgt ctccagcacc cgacccaagc gttgcgccgt gctcttcttg 1483261 gccgacattg tggcgtcaga ctagtttgtc gaagagtcgg gtgcgaccgg ttggcgcgct 1483321 cgtgttgttt gcccggctta ggtgggcacg gccagccgag tcggctgctc atgtccgcgc 1483381 agcgtcaccg tctcgcccaa agaccaatgg gcacgctcgg tttcgctggc agcgtgcagt 1483441 gtgtccgagg atgctagcaa tcgcgcgggg tgtgatttgg ccagttcgca caatcgggcc 1483501 gcctggttga ccggcttgcc gaccactgtg tattcgaatc tttgcttggc gccgacattg 1483561 ccggcgacga tctggcctgc cgccaccccg atgccggctt ggacctcggg catctcgttg 1483621 gccagccgat cggctatggc ccgggcggcg gccagcgcgg cgtcttcggg acggtcgagg 1483681 cggttcgggg ctccgaagat ggccagggcg gcgtcgcctg cgaacttgtt gatcagtccg 1483741 tggtgacggt cgacctcgtt gacgacgatc gcgaaaaacc ggttgaggag cttgaccacg 1483801 tgggcggcag gttggttgtc caccagctgg gtggagccga cgatgtcgac gaagacgacg 1483861 gcggcgtggc ggtcttcgcc gcctagctgt ggtcgttcac gctcggcggc ggcggcgact 1483921 tcgcgtccga cgtggcggcc gaaaaggtcg cgcacgcgtt cgcgctcgcg caggccgttg 1483981 accatcgcgt tgaaaccacg ctgcagctca ccgagttcgg tgccgtcgaa caccaccaga 1484041 tcccctcgca gatccccctg ctcgacacgc ttgagcgcag cgcgcaccac tcgcaccggc 1484101 gccgccgtca gccaagcaag aatccacatc acgagaaacc cgaagatcaa cgtggttatc 1484161 gacaggatca acaccgccga cgcgagctgc gtttcggtca gattgtgcac caataggacg 1484221 tagagcgctg tcgtggcgat gccggtcaca ggcacgcctg aacctagcga ccacaccgtc 1484281 atcgttcggc ccatgatgcc cggtgcaaac cgtcgtggcg gtcgtcccgc ttcgagtgct 1484341 ttagcggcta cgggtcgcag cgcgaactcg gtgaacaagt agcaattggt ggctaccaaa 1484401 acgccgcaga tagtcaccga gaacaaaatt atggtgacga atacgcggtt ggccagcccg 1484461 tagagcgtgg ccaacaacgc cccgccgata tcccacagaa tgaggtggac ggctgccact 1484521 cgaaacggga gcagcaaggt gttgcgcccg tcggcctggc tcggcgcgcg ttcctcgatc 1484581 gcccaccgta tggacgctct gacgattcgc gtggttatcc agtaggtgcc gatggccagt 1484641 gcgagcgtcg cataggccgg tgcgaccccg aaggtgaccc accatggggc gtcggtgtag 1484701 atgctaggca ccggaaaagc gaaggtcacc actagcagcg cgaccacgat cccggtcagg 1484761 ttcgccgtca tgatatagac ggtcacgatg cgcttgatgc gtacccagcg acgcgacggg 1484821 ctttctgaca cccgcccaag caaccaggag ccatacgccg gggtctcggg cagctggccg 1484881 cactgacggg tcatcgtctc gagtgcctgg cccaggcgtt gcgccatggt ctttttcgcc 1484941 ggcatggtgg cgtcagccta atctgtcgga tgcgccccac ggtaaatcgt gtgggtctgg 1485001 tgatcgccca gtgcaccgcc gactatgtcg gccgactgag caggcatctg cagctgttga 1485061 actggcgacg ccagccggat gggctggtcg tgcccgcgaa gtgtcacggt ctcgcctaaa 1485121 gaccaacggg cacattcgtt ttcactggca ccgcgcaacg tttgcgacga cgccaacaat 1485181 cggctcgggt atgattttgc cagttcgcac agtcgtgcag cctcgttgac cggttcgccg 1485241 atcacggtgt attcgaaccg ttcgtgggcg ccgacattgc cggcgacaac ctgacctgcc 1485301 gctaccccga tgccggcttg gcactccggc atttcgctgg ctagccggtc ggcgatggct 1485361 cgtgcggtgg ccagcgcggc atcttcggga tggctcaggc ggttgggggc cccgaagact 1485421 gccagcgagg cgtctccctg aaacttgttg acaagtccac ggtgatggtt cacttcatcg 1485481 acgattaccg tgaagaaccg gttgagtagc atcacgacct ctgccgcagg ccggctggtg 1485541 accaattgag ttgaaccgac gatgtcgacg aagacgacgg cgacatggcg ctcttcgccg 1485601 cccagttttg gtcgctcgcg ttcggctgct gcggcgacct cgcgaccgac gtggcggccg 1485661 aagagatcgc gtacgcgttc gcgctcgcgc aggccctcga ccattctgtt gaaaccacgc 1485721 tgtagctcac cgagttcggt cccgtcgaat acgaccagat cgccgcttag atcgccctgc 1485781 tctacgcggt tgagcgcctc gcggaccacg cgcacaggcg tggccgtcag ccaagcgaga 1485841 atccacatca ggatgaatcc gaagatcaac agtggtgccc acaggatcag cactgtgatc 1485901 atgaattgat cattggagag ttcccaaaac gtatcgtcga agatggcggt gagggcgaca 1485961 ccgacattgg gtacgcctga acagagcagc cacaccagca tggttcggcc cacgatgcct 1486021 cgcaccagcg atcgtggtgt tgctcccact tcgagcgcct gggcggccat cgggcgaagc 1486081 gcaaactcgg ttaacagata gcagctggtg gctgcgacaa cgccgatgac gcccatcgaa 1486141 aacaggaacc gcgggataaa caaccggttg gccaggccgt agattatcgt ccacaacgct 1486201 gcggcggcgc cccacaggaa aagaactgcc aacgccactc gcagtgggac taggaaagcg 1486261 ctgcgcgcct catcatggct gggggtgcgt tcctcgattg cccaccgcaa cgctcgagcc 1486321 gtttgcctgg tgagccagta ggtgcccagt atgaaggcga gcacgcagta tcccggaacg 1486381 atcccgaacg acacccaatg cggggcgtcc aaaatcacgc ttggtttcgg aaaggcgacc 1486441 gtcagtagca tggcaccgac aatgagcccg atcacgttcg tgaccaaaat ggcgacggtc 1486501 agcatgccct ggatacgtac ccgccgcatc cgtgggctct ccgacacgcg cccaagcagc 1486561 catgagccgt atgcgggcgt ctctggccgt cgtccagtgc gcgggctgag agtctcgacg 1486621 gcccctggca agtgtcgagt ggtggccttc tcggatggca tggtgacgtc agcgtagtgt 1486681 gtcggtcacg ctctaaggaa caacgtcgtt gcgcgctcta aggtgagtcg ggtgcgtcta 1486741 gtcatcgccc agtgcactgt cgactacatc ggccggctca ccgcgcatct gccgtccgcg 1486801 cgccggctgt tgctgttcaa ggccgacgga tcggtcagcg tacatgctga cgaccgcgcc 1486861 tacaagccgt tgaactggat gagtccgccg tgctggttga ccgaagagtc cggcggccag 1486921 gcgccagtgt gggtggtcga gaacaaggcc ggcgagcagc tgcgcatcac tatcgaagga 1486981 atcgagcacg acagtagcca cgagctgggc gtggaccccg ggctggtcaa ggacggcgtc 1487041 gaggcccact tgcaggcgtt gctcgccgag cacatccaat tgctgggcga agggtacacg 1487101 ctggtccgcc gcgagtacat gaccgcgatc ggacccgtcg acctgctgtg ccgcgacgaa 1487161 cgaggtggct cggtcgcggt ggaaatcaag cggcgtggcg agatcgacgg cgtggagcag 1487221 ctgacccgct acctcgagtt gctcaaccgc gacagtgtgc tcgcgccggt caagggggtg 1487281 tttgccgctc aacagatcaa gccgcaggct cggattctgg ccaccgaccg cgggatccgt 1487341 tgtttgacat tggattacga cacaatgcgc gggatggata gcggcgagta ccggctgttc 1487401 tgagttgcgc gattaaactg atgcgatggc tcggcgccgc aaaccgctgc accggcagcg 1487461 gccggaaccg ccgtcgtggg ccctgcgccg agtggaagcg gggcccgatg gccacgagta 1487521 tgaagtacga ccggtcgctg cggcccgcgc cgtcaagacc tatcgctgtc cggggtgtga 1487581 tcacgaaatc cgttccggta ctgcacatgt ggtagtgtgg ccgactgact tgccgcaagc 1487641 cggcgtcgat gaccggcgtc actggcacac cccgtgctgg gcgaaccgag caacccgcgg 1487701 tccgactcga aaatggacct aggcttttgg cggctggtgc gccctgctgg tgcgccttag 1487761 ggggccggct ccaccaactc gatcagaacc ccgccggcgt ctttcgggtg gatgaagttg 1487821 atccgtgagt tcgcggtgcc acgcctggcc gtctcgtaga ccagccggac gccctgggag 1487881 cgcagccgcc gacacatggc gtcaagatcg ctgacccggc acgccagctg ttggatgcct 1487941 ggcccgcgct tgtccaggaa cttcgctatc accgaggatt cgtcgagcgg ggccatcaac 1488001 tggatttgcg ccgcggagcc cggcaccgcc agcagtgcct cgcggatgcc ctgatcgtcg 1488061 ttgatttcct cgtggaccag gatcatgcca aggtggtcgt gataccactc gatggcaacg 1488121 tccaggtcgg cgaccgcaat accgacgtga tcgagtccag ttaccaacga ggtagccagc 1488181 atgtgacggg cgtggacttg atcggtcgtc atcacacaac ggtaacctga agggaaagaa 1488241 tctgcttctc cgggtcggtc agatcggctt tcgggtgcgc tgaggaggta gtcataacga 1488301 catcggtgat tgttgctggc gcgcgtacac ccatcggcaa gttgatgggc tccctgaagg 1488361 atttcagcgc cagcgagctg ggtgccatcg ccattaaggg cgccctggag aaggccaacg 1488421 tgccggcgtc cttggtcgag tacgtgatca tgggccaggt gttgaccgcg ggtgccgggc 1488481 aaatgcccgc acggcaggcg gcagtggcgg ccggcatcgg ttgggatgtc cctgcgctga 1488541 cgatcaacaa gatgtgcctg tccggcatcg acgcaatcgc gctggctgat caactcattc 1488601 gggccagaga gttcgacgtg gtggtggccg gcggtcagga gtcgatgacg aaggcgcccc 1488661 acctgttgat gaatagccgg tcgggttaca agtacggcga cgttacggtt ttggaccaca 1488721 tggcctacga cggtctgcac gacgtgttca ccgatcagcc gatgggcgcg ctcaccgagc 1488781 aacgcaacga cgtcgacatg ttcacccgct ccgaacagga cgagtacgcg gctgcgtccc 1488841 accaaaaggc ggccgcggca tggaaggacg gcgtattcgc cgacgaggtg atcccggtga 1488901 acatcccgca gcgcacgggc gatccactgc agttcaccga ggacgagggg atccgcgcca 1488961 acaccaccgc cgccgcgctg gccggtctga agccggcgtt ccgtggcgac ggcaccatca 1489021 ccgccgggtc ggcgtcacag atctccgacg gtgcggccgc ggtggtggtc atgaaccagg 1489081 aaaaggccca ggaactgggg ctgacctggc tagccgagat cggcgcccac ggtgtggtgg 1489141 ccgggccgga ttccacactg caatcgcagc cggccaacgc gatcaacaag gcgctggatc 1489201 gcgagggcat ctcggtggac cagctcgacg tggtggagat caacgaggcg ttcgctgcgg 1489261 tggcattggc ctcgatacgc gaactcgggc tgaaccccca gatcgtcaac gtcaacggtg 1489321 gtgcgattgc cgtcgggcat cccctcggca tgtcagggac gcgaatcacg ctacatgcgg 1489381 cgctgcagtt ggcacgccgg ggatcgggcg tcggggttgc cgcattgtgc ggggctggcg 1489441 ggcagggcga cgcactgata ttgcgggccg gatagcggtt gaggggtcgg tggcggccag 1489501 tgtgatcttg gtcataccaa ccgatcgcgg tatgtcggct cctgccgcag ggtcggcgcc 1489561 accgggtgga tcgatgaccg cagcggcatg acagacttga cggcgtgacg cgtccgcgac 1489621 ccccgctcgg gccggccatg gccggtgctg ttgacctctc cggcatcaaa caacgtgccc 1489681 agcaaaacgc tgcggcgagc acggatgccg accgggcact gtcgacgccg tccggtgtga 1489741 ccgagatcac cgaggcgaac ttcgaggacg aggtgatcgt ccggtccgac gaagtgccgg 1489801 tggtggtgtt gctgtggtca ccccgcagcg aggtatgcgt cgacttgctt gacacgctgt 1489861 ccggcttggc cgctgccgct aagggcaagt ggtcgctggc gtcggttaac gttgacgtcg 1489921 cacccagggt ggcacagata ttcggcgtcc aagcggttcc gaccgtggtg gccttggctg 1489981 cgggacagcc gatctcgagc ttccagggcc tccagcccgc ggaccaactg agtcgctggg 1490041 tggattccct gttgtctgcg acagccggaa agctcaaggg cgcagcgagt tccgaggagt 1490101 ccaccgaagt cgatccagcg gtggcacagg cgcgccagca gctcgaggat ggcgactttg 1490161 ttgccgcgcg caagtcatat caggcgattt tggatgccaa cccaggaagc gtcgaagcca 1490221 aggcggccat ccgccagatc gaattcctca tccgcgcaac cgcacaacgg cccgacgccg 1490281 tctcggtcgc cgacagcttg tcggatgaca tcgacgccgc gtttgcggca gccgacgtgc 1490341 aagtcctcaa ccaggatgtg agtgcggcct tcgagcgcct gatcgcgttg gtgcgtcgga 1490401 catctggaga agagcgcacc cgggtgcgca cccggctgat cgagctgttc gagctgttcg 1490461 accccgccga tcccgaggtc gtggccggtc ggcgcaacct cgccaacgcg ctgtactgag 1490521 gccggctggc gagcagacgc agaatcgcct aaacccgcac gggtttaggc gattctgcgt 1490581 ctgctcgcgc tgggcggcta cgacaacccg ggtgatccgt tcaggccgag cagcccggcg 1490641 gtgccgccgg cgccaccctt acccggcgca ctgccgactc cgccgtttcc gccgttgcct 1490701 ccattgccga tcagggtggc gttgccgccg gagccgccga caccgccgtt tgccaccacg 1490761 cttgcgccgc cggcgccgcc gtcgccgccg ttgccgacca tcccggcctt gccacccttg 1490821 ccgccgttcc ccccgtcgcc ggcgatgccg agtccgccgg cgccgccggc gccgccatcg 1490881 ccgttgagca ggccggcgtt gccgccggcc ccaccgtcgc cggcggtctc gccgaacccg 1490941 ccggctccgc cggccccgcc cgcacccgag agcccggcgg cgttgccgcc ggccccgccg 1491001 gcccccccga ccgacccgaa ttcgatgcca gtcccgccgg cgccaccagc gccgccgtca 1491061 ccgatcaacc cgccggtgcc gccggtgccg ccgctaccgg ccgcgccccg gacgctgtcg 1491121 ccgccggtac cgccggcgcc gccgtcgccg atcagcttgg cggccccgcc gctgccaccg 1491181 gacccgccga tgccgccggc catttggtcc gcactggagg cgccgaaccc tccggtgccc 1491241 ccggcgccgc cgggaccgaa cagtgcgccg gcaccgccga tgccgccgac gcctcctttg 1491301 ccgccggtgc cgccggggtc gggcgcgccg agaccggttc cgccggtgcc gccaatgccg 1491361 ccagcgccga agaggatgcc ggcgttgccg cccgccccgc cggccccgcc ctcaccgccc 1491421 acgagttggt taccggtgcc agttccgccg gcgccgccgg tcccgccgtt gccggtgaag 1491481 atgccgccgt cgcctccggt gccgccgttc cctccggccg agccggagac tccgaacccg 1491541 ccggcgccgc cggcaccgcc attggagaac agcccgccgc ccccgccggt gccgccggtg 1491601 ccaccggtgc ccccgttgac gctcaacccg ccggtgccgc cggcaccgcc ggcggccaac 1491661 acctcgaaca gcccgctgcg accgccggcc ccgccggcgc caccgttgaa gcctggcccg 1491721 ccggccccgc cgatgccacc ggctccgaac agcccggccg ccccgccgtc gccgccgagc 1491781 ccgcctgtgc cggtgttccc gactccgccg ggcccaccgg cgccgccgga gccgaacagc 1491841 agcccgccgg ccccgccggc cccgccagcc gcgccgttgc ccgggccgtc acccccggcc 1491901 ccacccgccc cgccgttgcc gaatagcccc gcggctccac ctgggccgcc ggcctggccg 1491961 ggcgccccgg acccgccggc cccgccgttg ccgtacagca gcccgccggc cccgccggct 1492021 tgcccggtcc ccggtgcgcc gttggcgccg ttgccgatca gcggacgccc cagcaacagc 1492081 tgggtgggcg tgttcacaat gttgagcagg ccctccaacg gcgccgcggc ggcggcctca 1492141 gcgctcgcgt acgcgccagc gcccgcagta aaggtttgaa cgaactgggc atgaaacgcc 1492201 gacatctgag cgctcaccgc ctgataggcc tggccatgcg cggcgaacag cgacgcgatg 1492261 gccgccgaca cctcatcagc accggctgcc agaagttccg tcgtcgggcc caatgccgcg 1492321 gcgttggcgg cccccagcgt cgacccgatg ttcgccaaat ccgaagcggc cctcaccagc 1492381 gtctcgggag cggcgattac aaacgacatg ctttcctccg atcagctgtg cgtcgagtat 1492441 ccagctcgag ttagcacagg gtagcgctat cgcttagcct ttctgatcaa tctcggagtg 1492501 cagtgtgcag agtgcatcga atcggctcat caggcatgtg caatctgctc atggcaggcg 1492561 ctaggcgggc gtcagccaca gcgccgaagt gggcggcagc accagcaccg cggacgccgg 1492621 gcggccatgc caggggtcgt cggtggcgtc cacgccgccg aggttgccga tccctgagcc 1492681 gtggtagatc gtcgcgtcgg tattgagcac ctcgcgccag cggcccgcgc gcggcagccc 1492741 gagtcgatag tcacggtgtt cggcacctgc gaaattgaac acgcaggcca gcaccgagcc 1492801 gtcgctgccg tagcgcataa agctcaacac attgttggcg gagtcgttgg cgtcgatcca 1492861 agaatagcct tcgggggtgg tgtctaagct ccacagcgcc gggtggcatc ggtagatgtc 1492921 gttgatgtcg cgcaccagcc gctgaatccc gttggagaag ccgttttcgt cgagttggaa 1492981 ccagtccagg ccgcgctgct cggaccattc ggcgcgttgg ccgaattcct gacccatgaa 1493041 cagcaattgc ttgccggggt gtgcccattg gtaggcaagc aggctacgca ggccggcggc 1493101 cttgacgtga ttgttgcccg gcatccgccc ccacagcgtg cctttgccgt gcaccacctc 1493161 gtcatgactg agcggcaaca cgtaattttc gctgaacgca tacagcatcg agaacgtcat 1493221 ctcgtggtgg tggtagctgc ggtacaccgg atctcggctg acgtagtcga gcgtgtcgtg 1493281 catccagccc atgttccact tcatcgaaaa gcccaggccg ccaatgttgg tcgggcgggt 1493341 caccccaggc cacgacgtgg actcctcggc gatggtgacg attcccggcg cgaccttgtg 1493401 cgccgtggcg ttcatctcct gcaggaactg cactgcttcc aggttctccc ggccgccgtg 1493461 gacgttgggg gtccagccgc cctcgggtcg cgagtagtct agatagagca ttgaggccac 1493521 cgcgtccacc cgcaggccgt cgatgtggaa ctcctgtagc cagtacaacg cattggctac 1493581 cagaaagttg cgcacttccg ggcggccgaa gtcgaacacg tatgtgcccc aatccagttg 1493641 ctcgccgcgt ttgggatcgg aatgttcgta gagcggagtg ccgtcgaacc gtcccagggc 1493701 ccacgcgtcc ttcgggaagt gcgctgggac ccaatccacg atgacgccga tgccggcctg 1493761 gtgcagggcg tcgaccagcg cccggaagtc gtcgggtgtg ccgaatcgtg atgtcggcgc 1493821 atagtaggac gtgacctgat acccccatga tccggcgaat ggatgctcgg cgacgggcaa 1493881 cagctccaca tgggtaaacc cttgatccac aatgtaatcc gtcaactcac gagcaagctg 1493941 gcggtagctg agtccaggcc gccacgaacc gagatggact tcgtaggtgc tcatcgcctc 1494001 gttcaccggg ttgcgcagcg cacgcccagc catccagtcg tcgtcacccc aggtgtagtc 1494061 actcgacgtc acccgcgatg cggtctgcgg cggcacctcg gtgccgaacg cgaacgggtc 1494121 ggcccgatcg gtaaccacgc cgtcggcgcc gtgcacgcgg aacttgtaca gaccgtcgca 1494181 agggaagtcg ggccagaaca attcccatac ccctgatggg ccgagcaccc gcatgggggc 1494241 ttcgtggcca ttccaaccgt tgaactcgcc gatcaagctg acgcccttgg cgttgggcgc 1494301 ccacacggcg aacgacacgc cactcaccac accgtcggcc gtggtaaacg agcgggggtg 1494361 ggcacccagg acttcccaaa gccgttcgtg gcggccctcg gcgaacaggt gcaggtcgac 1494421 ctcgcccagg gtgggcagga atcggtacgc atcggccacg gtgtgtggct cgcaaccttc 1494481 ataggtcacc tgcaggcggt agtcgatgag gtcgacgaac ggcaatgcga cggcaaacag 1494541 gccagaatcg aggtgctgca acgagaaccg gtccttacca acgagcgcga cgacctcgac 1494601 ggcatgcgga cggaacgctc ggatgacggt atggtcgccg tattcgtggg cgcccaggat 1494661 gccgtgcggg ttgtgatgtg tacccgccac caagcgcgcc atttcggccg gctcgggtgc 1494721 aaggtgctcc ccggtgagtt tctcggatcg actcatgagc ccgtcacctc ctgcgcagca 1494781 gcgtgtttcg gctctcgtag ggcacggctg gcatgttgat gatgtgggcg actgcccgtg 1494841 ctgggtcgat gcggatgtaa ttggcttgcc cccattggta ttcttcgccg gttatctcgt 1494901 cgcgcaccca aaaccggtcg tagtcctcca tgcccaacgc cgccatgtcc aaccacagcg 1494961 tagcttcttc aggaccaaat gcgttgagtg tcaccaccac caacacgcag tcgccggtgg 1495021 ccgggtcgaa cttgctgtag gccagcaatg cgtcgttgtc aacgtggtga aaatgaatgg 1495081 tacgcaactg ttgaaacgcc gggtgcagcc ggcgaattat attgagccgt gtgatgaacg 1495141 gctgcaaaga tctaccctgg tccagcgcgc tggcaaagtc gcggggacgc aattcgtact 1495201 tctccgagtc caggtactcc tcgctgccct cgcgcaccgc acggtgctcg aaaagctcat 1495261 aaccgcagta catcccccag gctgggctca tggtggcggc cagcaccgcg cggatggcga 1495321 acatgcctgg accgttgtgc tgcagcaccg cgtgcaggat gtccggggtg ttgacgaaca 1495381 ggttgggccg acggtagtcg gcgagttcgg ctatctggtt gccgaattcg gtgagctccc 1495441 acttggtcgt gcgccaggtg aaatagctgt aggactgcgt gaagccgagc ttggccagcc 1495501 cgtactggcg ggcgggcggg gtgaaagcct cggacaggaa cagcacgtcg gggtcgacgg 1495561 tcttcacctg cgcgatcagc caggcccaga agttgggtgg tttggtgtgg ggattgtcga 1495621 cgcgaaagaa cttgacgccg tggttaaccc aatgttgcac cacgcgcagc acttcgtcgt 1495681 acaggccctc gggatcgttg tcgaagttga gcggatagat gtcctggtac ttcttcggtg 1495741 gattctccgc gtaggcgatg gtgccgtccg gcagctcggt gaaccactgc cggtgttcgc 1495801 gggcccacgg atgatccggt gcgcattgca gcgccaggtc cagcgcgacc tccatgccca 1495861 gatcgcgtgc cgcggagacg aagtcgtcga agtcgtcgat ggtgcccagg ctgggatgaa 1495921 cggtatcgtg accgccctca tcgctaccga tcgcccacgg cgatcccacg tctgtcggtg 1495981 cggcggtggg cgagttgttg cgacccttgc gatgcacctt gccaattgga tggatcggcg 1496041 gcaggtacac cacgtcgaac cccatgccgg cgatgcgcgg aagttctgcc gcagcggtgg 1496101 cgaaggtgcc gtgtaccggg ttgccgtcgt cgtcccaccc gccggttgag cgcggaaaca 1496161 tctcatacca agcgccgaac cgggccaacg gccgatccac ccagacgccg aattgctcgc 1496221 cccgggtgac caggtcccgc agcggatagt cggccagcag ctcttcgatt tccggtgtca 1496281 gggccaacgc ggtgcgggtc accgggtcac cgggggtccg cagcgctgcc gcggccgcca 1496341 ggaggggatc gcgtaacccg cgcggcacac cggtcgccgc gcgctccaac agcaccgcgc 1496401 ctaccaacag gtcgttggac agctcggtct ctccctggcc ggcatctagc ttggctatca 1496461 gcccatggcg ccaggtgtgg atcgggtcac cccaaccatc cacccggaag gtccacaatc 1496521 cgacccggtc gggggtgaac tggccgtgga aaacgaaggg ctcctggccg ctcgtcatcg 1496581 ggatcagcag cggcttgacg cgttgttggg gctcgctcgg cgtcggaagc accctggccc 1496641 ggggtctgtc ggtgaggtgt gggtaacgca ctccgaggta gcgcacgacc agcgtcgctg 1496701 cgacggcctc gtggccttca cgccagaccg ccgcgctgac cgggaccacc tcgccgacca 1496761 ccgccttggc gggatatacg ccgcacgaaa cgacgggcgc gacgtcatcg atttcgacac 1496821 gaccgggcac ccaccactcc gtttccgttc cgattgcccg gccactcacc gggacatctt 1496881 gtatgtgtcg ttccttgtgt gtccttcttg cgcccgatac ccaccctagt atccgatcac 1496941 acccgcgaag gcacagcggt cggcgggcgc actgcacgcg gtggcatcct cagtaaggta 1497001 aggacgcgtg aaagcccttc gccggtttac cgtccgagcc cacctacccg aacgtcttgc 1497061 cgccctggac cagctgtcta ccaatctgcg gtggtcctgg gacaaaccga cacaggatct 1497121 gttcgcggcg atcgaccctg cactgtggga gcaatgcggt catgatccgg tggcgctgct 1497181 gggcgcggtg aacccagcgc gtctcgacga acttgcgctg gacgcagaat ttttgggcgc 1497241 cctcgatgag ctggcggccg acttgaacga ctacctgagc cgtccgctgt ggtatcagga 1497301 gcagcaggac gccggggtag ccgcacaagc cctgccgacc gggatcgcgt acttctcgct 1497361 ggagttcggg gtagccgagg tgttgcctaa ttactcgggc ggtcttggga ttctcgccgg 1497421 cgaccatctg aaatccgcgt ccgatctggg cgtgccgctg atcgcggtgg ggttgtacta 1497481 ccgctccggc tacttccggc aatcgcttac cgcggacggc tggcagcacg agacctaccc 1497541 atcgctggac ccgcaagggc tgccgttgcg tctgctcacc gacgccaacg gggatccagt 1497601 gctggtcgag gtcgccctgg gagacaacgc cgtgttgcgc gcccggatct gggtagcgca 1497661 ggtgggtagg gttccgttgc tcttgttgga ttctgatatc ccggagaacg agcacgacct 1497721 gcgcaacgtc accgaccgcc tctacggtgg cgaccaggaa catcgcatca aacaagagat 1497781 cctggccggc atcggcgggg tgcgggcgat tcgtgcgtac accgccgtcg aaaagctcac 1497841 cccgcctgag gtcttccaca tgaacgaggg ccacgccgga ttcctcggca tcgaacgcat 1497901 ccgtgaactg gtcaccgatg cgggtttgga tttcgacacc gcattgactg tggtgcggtc 1497961 cagcacggtg ttcaccactc atactcccgt ccccgccggg atcgaccggt tcccgctcga 1498021 gatggtgcag cgctacgtca atgaccagcg cggcgatggc cggtctcggc tgttgcctgg 1498081 gttgccggcc gaccgcatcg tcgcgttggg cgccgaggac gatccggcca aattcaacat 1498141 ggcacacatg ggcctgcggc tggcgcagcg ggccaacggc gtctcgttgc tgcatggccg 1498201 ggtcagtcgt gccatgttca acgagctgtg ggcgggattc gaccccgatg aggtgccgat 1498261 cggctccgtc accaacggtg tgcacgcgcc cacctgggcg gcgccgcagt ggttgcagct 1498321 gggccgcgag ctggccgggt cggactcttt gcgcgagccc gtcgtttggc agcgactgca 1498381 tcaggtcgat cctgctcatc tgtggtggat ccgctcacaa ctgcggtcga tgctggtgga 1498441 ggacgtccgg gcgcggttgc ggcaatcatg gctggaacgt ggtgcaacgg atgccgaact 1498501 gggttggatc gcgacggcat tcgatccgaa tgtgctcacc gtcggcttcg cccggcgggt 1498561 cccgacctac aagcggctga cgttgatgtt gcgcgatccc ggtcggctcg agcaactgct 1498621 gctcgacgaa cagcggccga tccagctgat agtggctggg aagtcgcacc cggccgacga 1498681 cgggggcaaa gcgctgatcc agcaggtggt gcggttcgcc gaccggccgc agttccgcca 1498741 ccgcatcgcc ttcctgccga actacgacat gtcgatggcc cggctgttgt actggggctg 1498801 cgacgtctgg ttgaacaacc cgctgcggcc gctagaggcg tgtggtacct cgggcatgaa 1498861 aagcgcgctt aacggcgggc tgaatttgtc gatccgtgac ggctggtggg acgagtggta 1498921 cgacggcgaa aacggttggg agataccgtc tgccgacggt gtggcggacg agaaccgtcg 1498981 cgacgacctg gaggccggcg cgctctacga cctgctggca caagccgtgg caccgaagtt 1499041 ctacgagcgc gatgaacgcg gggtgccgca gcggtgggta gagatggtcc ggcataccct 1499101 acaaacgctc gggcccaagg tgctggcttc tcgaatggtg cgcgactacg tcgagcatta 1499161 ctacgcgccg gcggcgcagt cttttcgccg gaccgcgggc gcccagttcg acgcggcccg 1499221 cgagctggcc gactaccgcc ggcgcgcgga agaagcgtgg cccaagatcg agattgccga 1499281 cgtcgacagc accggtctgc cggatactcc actgctcggg tcccagctga ccctgacggc 1499341 aaccgtgcgg ctggccgggc tgaggccaaa cgacgtgacg gtgcaggggg tgctgggcag 1499401 ggtcgactcc ggcgatgtgc taatggatcc ggtcaccgtc gagatggcgc ataccggcac 1499461 cggcgacggc ggctacgaga tcttctcgac gacgacgccg ctgccgctgg cggggccagt 1499521 cggatacacc gtgcgggtgc tgcctcgcca cccgatgctg gccgccagca acgagctcgg 1499581 cctggtcacc ctggcctgac ccgccgagaa gacgcaaaag ctcctaaatc tggccgattt 1499641 agtgggcttt tgcgtctgct cgcgcaaggc gccgcagggc cgcgcgcact tgcgtggcgt 1499701 tggtggtctg ccaaaagggc ggcagcgagg ctcgcaggaa ttcgccatag cgggcggtag 1499761 ccatccgtga atcgagcacc gcaaccacgc cccgatcggt gacgcgccgt aacagccggc 1499821 cggatccctg tgccagcagc agcgccgcgt ggctggcggc gaccgtcatg aagccgttgc 1499881 cgccacgggc ggccaccgca cgctggcggg cactcagcag gggatcgtcc ggccggggga 1499941 acgggatgcg gtcgatcaac accaacgaca gcgacggtcc cggcacgtcg accccctgcc 1500001 acagcgacag cgtgccgaac agggaggtcg ccgcatcggc ggtgaacttc tccaccagcg 1500061 tggacgtact gtcgtcgccc tgacacaaca ccggcgtgga cagccgttcg cgcatggcct 1500121 cggtggctgc ccgggcggcc cgcatggacg agaacagccc cagggtgcgc ccacctgcag 1500181 cggtgatgag ttcggcgatc tcggtcagtt gttcggccga gccgctgccg tctcggcccg 1500241 gcggcgggag atgggcggcc acgtagagga ttcccgactt tgcgtgctgg aaaggcgagc 1500301 ccacgtccag gccacgccag ggcgtgtctg cagtcaggcc ccatgccgtg gccatcgcgt 1500361 caaacgaccc gccgattgtc agcgttgccg aggtcaatac ggtcgttgca cgggcgaaca 1500421 cctgggtggc caacagctcg gccaccgata gcggagccac ccgcagcacc gcgcgagccg 1500481 attcgtggtt gtcctcgtgc tccagccaaa ccacgtcgct gcggtcaggg atagcggggg 1500541 cgaacgacgc caggattcgt gacgcggtat cggatatttc ggtcagtacc gcgcccgctt 1500601 cggcgcgcac ggacgccgtc gtggtgtcgc tgccggtatc gatcgctgag cgcgccgcac 1500661 tggccgcatc gcgcagcgcg ctcagatagg tcgccatctc gtcatcgagg caatcaatgc 1500721 ggcccggtct ggcgtcgtga atcgccgaac tgaaggtagc cgaagccgcc tgaagccgct 1500781 gggtcacttt cgggtcgacc agccgggtga tccgtcgtgc ggccataccg agcgtggcag 1500841 acgtcagctc agcggcggct accgaggtca cccggtcggc caattcgtga gcctcgtcga 1500901 caaccagcag ccgatgttct ggcagtaccg ccgattcggc gacggcatcg atggccagca 1500961 gcgcgtggtt ggtgacgacg acatcggcca ggccggccgc tccacgagcc cgttcggaga 1501021 agcactccga gccaaacggg cagcgggcca cgccgaggca ttcccgcgcc gaaacgctga 1501081 cctgcgacca ggatcggtct cccacaccgg gcttaaggtc gtcgcgatca ccagacacgg 1501141 tcgtcgaagc ccaggcggtt agccgttgca catcgcgtcc cagcgcggtg accgccaccg 1501201 ggtcgaagag ctcctcctgc ggccgctcgt cgtcatggtc actggctgtg actgagttgt 1501261 ggatcttgtt caggcacagg tagttccgtc gacctttgag cagggcgaac ttcggtcggc 1501321 gggggagcgc attggtgagc gaatctacca gctggggcag gtcacgatcg acgagttgac 1501381 gttgcaaagc gatcgtcgcc gtcgacacca cgaccggcgc gtcgtcgcaa agagcgcgga 1501441 tgatcgcggg aaccagatac gccagcgact tgccggttcc ggtgccggcc tggaccacca 1501501 agtgctcacc ggtttcaaac gcatgcgcta ccgcggcggc catctcttgc tggccgcgac 1501561 gccgggtgcc gccaagtgcc gccacggcga tggcaagcag ctcaggcaca gacatggata 1501621 ccgactcgga cacgggacgt ggtcacatcc ttgcgctcag gccgggatcg tgcgtgtcgg 1501681 aatcgccggc tcgccgggtg ccaatttcag cccgtcggct ggcaggctgc gcaatccgga 1501741 tgccaccagc tgccgtgccg cggccaggct ggtatcggct accggttgcc cggcgcggac 1501801 cagtggcagc gtcaaaaccc ggtgcggctc gacaatgacc ggcggacggc ccgccggatg 1501861 cacgagctcc tcggtgatgg tgcccgtcgc acgggagcgc cgcagtgcct ctttgcggcc 1501921 gccgggggat tttttgtagc tgctgcgctt ttgcaccggt acaccgtcta cctcgaccag 1501981 tttgtagacc atgttggcgg tcggcgcgcc cgacccggtg accagcgacg tgcccacgcc 1502041 gtagctgtcg acgggttcac cgcgcaacgc ggcgatgctg aactcgtcaa ggtcgccgga 1502101 caccacgatg cgcgtccggg tggctcctag ccggtcgagc tgctcccgcg cttggcgggc 1502161 cagtacccca agctcaccgg aatcgatgcg gatcgcgccg agctcagcgc cggcggcggc 1502221 aacggcattg gccacaccgg tcgtgacgtc ataggtatcc accagcagcg tggtaccggg 1502281 tcccagcgct tcgacctggg cgcggaatgc ggctcgctcg gctagttcgg tggggccgcc 1502341 atgctgggcg tgcaacatgg tgaatgcgtg tgccgcggtg ccgtgcgcgg gcactccgta 1502401 gcgtcgctgc gccgccaagt tggatgacgc ggcgaaaccg gcgatatacg ccgcccgggc 1502461 cgctgccacc gcggcgcgtt cgtgggtgcg ccgcgagccc atctcgatca gtgggcgccc 1502521 cccggcggcg ctgaccatgc gcgccgctgc cgaggcgatc gctgtgtcgt ggttgaagat 1502581 tgacagcacc agcgtttcga gcaggacgca ttcggcgaag ctgccgcgta ccgagagcac 1502641 cggtgacccg ggaaaataca gctccccctc ggcatagccg tcgatatcgc cgcggaaccg 1502701 gaattcgcga agataccgca ccgtggccgg gtcgaggaat tgggccagca actcgcacgc 1502761 gtcagcgtcg aacctgaact gcggcaacgc ttccagcaac cggccggttc cggcgacaac 1502821 tccgtagcga cggccggtgg ggagtcggcg agcgaacacc tcgaatgtgg tggggcgatt 1502881 ggcgctgccg tcgcgcaggg cagccgccag catggtcaac tcgtacttgt cggtcaacag 1502941 cccggctggg tcttgattgt cgggctctcc ctctcgccgc ctggcggctg ggggtggccc 1503001 cacagcggtc cgacgcggtc cgcagcgtcg cccggttggg acccagtcgt tcacaccgcc 1503061 acggtatcgg ctcgcggcca cggtgcgctg ggtatcctgg ggccatggct gttgtgtcag 1503121 cgcccgccaa gccaggtacc acctggcagc gcgagtctgc tccggtcgac gtgacggaca 1503181 gggcatgggt caccatcgtg tgggacgacc cggtcaactt gatgagctac gtgacttacg 1503241 tgtttcagaa gttgttcggc tacagcgagc cgcatgccac caagctgatg ttgcaggtgc 1503301 acaacgaagg taaggcggtg gtgtccgcgg gcagccgaga gtccatggaa gtcgacgtgt 1503361 ccaagctgca tgccgccggt ttgtgggcga cgatgcagca ggaccggtga gattcgagga 1503421 tattcgggat ccatcgtgcg caggtggaag cgcgtcgaga cccgcgatgg tccccgcttt 1503481 cgatcgtcgt tggctccgca tgaggccgcc ctgctcaaga acctggcagg cgcgatgatc 1503541 gggctgctcg acgatcgcga ctcttcttcg ccgtcagacg aactcgagga gatcaccggc 1503601 atcaagaccg ggcatgcgca gcgtccgggt gacccgacct tgcgtcggct gttgccggat 1503661 ttctaccggc ccgatgacct ggatgacgat gatccgacgg ccgtcgacgg ctccgagagc 1503721 ttcaacgctg ccctgcgcag cctgcacgaa cctgagatta tcgacgccaa acgtgttgcc 1503781 gcgcagcagt tattagacac ggttccggac aatggcggcc ggttggagct gacggaatcc 1503841 gacgccaatg cttggatcgc cgccgtcaac gaccttcggc tggcgctcgg agtgatgctt 1503901 gagatcggcc cgcgtgggcc ggagcgcctg ccggggaacc acccgttggc cgcgcacttc 1503961 aatgtctacc agtggctgac agtcctgcag gaatacctcg tgctggtgct gatggggtct 1504021 cgatgatctg cgcggcggcc cgatgaactc catcaccgac gtcgggggca tccgggttgg 1504081 ccactaccag agactggacc ccgacgcgtc cctcggcgcc gggtgggctt gtggcgtcac 1504141 ggtggtgttg ccgccgcccg ggacggtcgg tgcggtcgat tgccgcggcg gcgcccctgg 1504201 aacccgcgag actgatctgc tggacccggc caacagcgtg cgcttcgtcg acgccctgtt 1504261 gctcgccggc ggcagcgcct acggtctggc cgccgccgat ggcgtcatgc gctggctaga 1504321 ggaacaccgg cgcggcgtcg cgatggacag cggcgtggtg cccatcgtgc cgggcgcggt 1504381 gattttcgac cttccggtcg gcggctggaa ttgtcggccg acagccgatt tcggctattc 1504441 ggcctgtgcg gcagccggag tcgacgtcgc ggtcgggacg gtgggcgtgg gggttggggc 1504501 gcgcgccgga gcgctcaagg gcggtgtcgg gactgcatcg gctaccctgc agtccggtgt 1504561 gaccgtcggt gtccttgctg tggtaaatgc cgctggcaac gtcgtcgatc cagccaccgg 1504621 cttgccgtgg atggccgacc tagtcggcga gttcgcgttg agggccccgc cggccgagca 1504681 gattgctgcg ctggcgcagt tatcgtcccc gctgggagcc ttcaacaccc cgttcaatac 1504741 gacgatcggt gtgattgcgt gtgacgccgc gctgagccct gcggcttgcc ggcgcatcgc 1504801 gattgccgcc cacgacgggt tggcccgcac catccggccg gcacacaccc ccttggatgg 1504861 cgacacggtt ttcgcgctgg ccaccggcgc ggtagcggtg ccgccggagg ccggcgtgcc 1504921 ggccgcattg tctccggaga ctcagctggt caccgcggtc ggtgcggcgg cggctgattg 1504981 cctggctcgt gcggtgctgg ccggcgtgct caatgctcag ccggtagccg gaataccgac 1505041 ctaccgtgac atgtttcccg gagcattcgg gtcctgaaac ttcggtgttg cttaggaaag 1505101 gaaccgtcta cgtgctggtg attcgcgcag acctggtgaa tgcgatggtg gcccatgcgc 1505161 gtcgcgacca ccccgacgaa gcctgcggag tgctggccgg acccgagggc tctgaccgtc 1505221 ccgagcggca tatcccgatg accaatgccg agcgctcgcc gaccttctac cggttggatt 1505281 ccggtgagca actgaaggtg tggcgggcta tggaagatgc cgacgaggtc ccggtcgtca 1505341 tctatcactc gcacactgcg accgaagcgt acccgagccg tacggacgtg aagcttgcca 1505401 ccgaacccga cgcgcactac gtgctggtgt ccacccgcga cccgcaccgg cacgagctac 1505461 gcagctaccg catcgtcgat ggcgctgtca ccgaggaacc tgtcaatgtc gtcgagcagt 1505521 actgaaccgt tccgagaaag gccagcatga acgtcaccgt atccattccg accatcctgc 1505581 ggccccacac cggcggccag aagagtgtct cggccagcgg cgataccttg ggtgccgtca 1505641 tcagcgacct ggaggccaac tattcgggca tttccgagcg cctgatggac ccgtcttccc 1505701 caggtaagtt gcaccgcttc gtgaacatct acgtcaacga cgaggacgtg cggttctccg 1505761 gcggcttggc caccgcgatc gctgacggtg actcggtcac catcctcccc gccgtggccg 1505821 gtgggtgagc ggagcacatg acacgatacg actcgctgtt gcaggccttg ggcaacacgc 1505881 cgctggttgg cctgcagcga ttgtcgccac gctgggatga cgggcgagac ggaccgcacg 1505941 tgcggctgtg ggccaagctc gaggaccgca atccgaccgg gtcgatcaag gaccgcccgg 1506001 ctgtgcggat gatcgagcag gccgaggccg acgggttgtt gcggccgggc gccaccatcc 1506061 tggagcccac cagcggaaac accggcattt cgctggcgat ggcggcccgg ttgaaggggt 1506121 accgattgat ctgcgtgatg ccggagaaca catcggttga acggcggcag ctgctcgagc 1506181 tctacggcgc gcagattatc ttctcggcgg ccgaaggcgg gtccaacact gcggtggcca 1506241 ccgccaaaga gctggccgcg accaacccgt catgggtgat gctgtaccag tacggcaatc 1506301 ccgccaacac cgactcgcac tactgcggca ccggccccga gctgctggcc gacctgcccg 1506361 aaatcacgca cttcgtcgcc ggcctaggca ccacgggcac gctgatgggc actggccgtt 1506421 tcctgcgcga gcacgttgcc aacgtcaaga tcgtggcggc cgaaccccgc tacggtgagg 1506481 gggtatacgc cctgcgcaac atggacgaag gctttgtgcc cgagctgtat gacccggaaa 1506541 tactgaccgc gcgatattct gtcggcgcgg tggacgcagt gcgccgcacc cgcgagttgg 1506601 tgcacaccga aggcatcttt gcgggcatct caaccggcgc ggtgctacac gccgcactcg 1506661 gagtcggggc cggcgccctg gcggccggcg agcgggccga cattgcgttg gtggtcgccg 1506721 acgccgggtg gaagtatctg tccaccggcg cctacgccgg tagcctggat gacgccgaga 1506781 ccgctctgga agggcaacta tgggcatgac cccgcgccgg aagcgacggg gaggagcggt 1506841 gcagataaca cggcccacag gccgtccgcg aacaccgaca acgcagacga cgaagcgccc 1506901 gcgctgggtg gtcggcggga cgacgatcct caccttcgtc gcgctgctct atctcgtcga 1506961 actgatcgac cagctgtccg ggagtcggct ggacgtcaac ggcatcaggc cgctgaaaac 1507021 agacggcctg tggggcgtca tctttgcgcc acttttgcac gcgaactggc accacctaat 1507081 ggccaatacc atcccgctgc tggtgctggg gtttcttatg acgctggccg ggctgtcccg 1507141 gtttgtctgg gccaccgcga tcatttggat tctgggcggc ttgggcactt ggctgatcgg 1507201 caatgtgggc agcagctgtg gcccgaccga ccatatcggc gcctctggcc tgatctttgg 1507261 ctggctggcc ttcctattgg tgttcgggct ttttgtgcgc aagggatggg atatcgtcat 1507321 tgggctggtg gtcttgtttg tctatggcgg catcctgctc ggcgcgatgc cggtgctggg 1507381 ccagtgtggt ggcgtgtcat ggcagggtca tttaagtggt gcggttgctg gcgtcgtggc 1507441 ggcgtatctg ttgtccgctc cggagcgtaa ggcccgtgca ctgaaaaggg ccggcgcgcg 1507501 ttccgggcat ccgaagttat gaattcgccg ttggcgcccg tcggagtctt tgattccggc 1507561 gtcgggggac tgacggtcgc gcgggccatc atcgaccaac tgcccgacga ggacatcgtc 1507621 tacgtcggcg acaccggcaa cggcccgtac ggtccgctga ccatcccgga gatccgggcg 1507681 cacgcgctgg ccatcggcga cgatctggtc ggccgaggcg tcaaggcgtt ggtgatcgcc 1507741 tgcaactcgg cgtcgtcggc gtgcctgcgg gatgctcgcg agcgctacca ggtgcccgtc 1507801 gtcgaagtga tactgccggc ggtgcggcgt gcggtggccg ccacccgcaa cggccgcatc 1507861 ggggtaatcg gcacgcgggc gaccatcact tcacacgcct atcaggacgc gttcgctgcg 1507921 gcccgcgaca ccgaaatcac cgcggtggct tgccctcgct tcgtggactt cgtcgagcgc 1507981 ggcgtcacca gcggtcgtca ggtgctcggt ctggcgcagg gctacctgga accgctgcag 1508041 cgcgccgagg tcgacacgct agtgctgggc tgtacgcact atccactgct gtccggactg 1508101 attcaactgg cgatgggcga gaacgtcacg ctggtctcca gcgccgagga gaccgctaag 1508161 gaagtggtcc gggtgctcac cgagatcgac ttattgcgtc cgcatgacgc gccgccggca 1508221 actcggatat ttgaagctac gggcgacccc gaagcgttta ccaaattggc cgcacgattc 1508281 ctgggtccgg tgctcggtgg tgtgcaaccc gttcacccat cgcgcattca ttaggccatg 1508341 gaagagattc tcgtcaccga atgcgtcgat gtattccgca tcgttgtatc gggcatggca 1508401 cagtagtgtc cgtgcgaata accgtgctcg gatgctccgg tagcgtcgtg gggccggatt 1508461 cgcctgcgtc ggggtatttg ctccgagcgc cgcacacacc gccgttggtt atcgacttcg 1508521 gcgggggtgt gctcggcgcg ctgcaacggc acgcggatcc cgcgtcggtg catgtgctgc 1508581 tgtcgcatct gcatgcggac cattgtctgg acttgccggg actttttgtg tggcggcgtt 1508641 accacccgtc gcgtccctct ggcaaggcat tgttgtacgg ccccagcgac acctggtcgc 1508701 gattgggggc ggcgtcgtcc ccgtacggtg gggagattga cgactgttcg gatatcttcg 1508761 atgttcacca ctgggccgac agtgagccag tgacgttggg cgcccttacg atagtgccgc 1508821 ggctggttgc ccacccgact gagtcgtttg gcctgcggat caccgatccg agcggtgcgt 1508881 cactggctta tagcggcgac accggcattt gtgaccagct cgtcgagctg gctcgcggcg 1508941 tcgacgtttt cctctgcgag gcctcctgga cacactcgcc caaacatcca cccgatctac 1509001 acctgtcggg caccgaagcc ggtatggttg ccgcgcaagc cggcgttcgt gagctgctgc 1509061 tgacgcatat cccgccgtgg acttcgcgtg aggacgtcat cagcgaggcc aaggccgagt 1509121 tcgacggccc ggtgcacgcg gtggtatgcg acgagacgtt cgaagtccgg cgagccggct 1509181 aggtctaggg ttggcgtcgt gtccaagcga gaagacggcc ggctcgacca cgagcttcgc 1509241 ccggtgatca tcacccgcgg tttcaccgaa aacccggcgg gatcggtgct catcgaattc 1509301 ggtcacacca aggtcctgtg caccgccagc gtcaccgaag gggtgccccg gtggcgtaaa 1509361 gcaaccggtc tggggtggct caccgcggag tacgccatgc tgccgtcggc cacccacagc 1509421 cgctctgatc gcgagtcggt gagaggcagg cttagcgggc gtactcagga aatcagtcgg 1509481 ctcatcagcc ggtcgctgcg cgcatgcatc gacctggcgg cgctggggga gaacacgatc 1509541 gctatcgatt gtgatgtgtt gcaggccgat ggtggcactc gaaccgcggc catcaccggc 1509601 gcctacgtgg cattggccga cgcagtgacc tacttgtcgg cggcgggtaa gttgtccgac 1509661 cccaggccat tgtcgtgtgc catcgccgcg gtcagcgtcg gtgttgtcga cggcaggatc 1509721 cgggtggatc tgccctacga ggaagattcg cgcgccgagg tcgacatgaa cgtcgtcgct 1509781 accgacaccg gaaccctggt agagattcag ggcaccggcg aaggcgcgac gttcgcacgt 1509841 tcgacactgg ataagctgct ggacatggca ctgggcgcct gcgacacgtt gtttgccgca 1509901 caacgcgacg cgttggcgct gccgtatccg ggtgtgctgc cgcagggacc gccaccgccg 1509961 aaggcgtttg gcacctgacc gcgccgcgac gatgcagagc ggagcgatga ggaggagtgg 1510021 cgcttgtgac caagcttctg gtcgccagcc gcaaccgcaa aaagctggcc gaactgcgcc 1510081 gggtgttgga cggcgccgga ctatcgggtt tgacgctgtt gtcgctgggc gatgtgtcgc 1510141 cgctgcctga aacaccagaa accggtgtga cattcgagga caacgcgctg gccaaggcgc 1510201 gcgacgcgtt ctccgcgacc ggacttgcca gcgttgccga cgactccggt ttggaggtgg 1510261 ccgcactggg cggcatgcct ggcgtgctgt cggcccggtg gtccggcagg tatggcgacg 1510321 atgccgcgaa caccgcgctg ttgctggcgc agttgtgcga tgtgcccgat gagcggcgcg 1510381 gagcagcgtt cgtgtcggcc tgcgcgttgg tctcggggtc cggcgaagtt gtcgtgcgcg 1510441 gtgaatggcc cggcacgatc gcccgtgagc cgcgcggtga cggcgggttc ggctacgacc 1510501 cggtcttcgt cccgtacggt gacgaccgca cagcggccca gctgagcccg gcggaaaagg 1510561 acgcggtatc ccatcgcggt cgcgcgttgg ctctgctgct gccggcgctg cgctccctgg 1510621 cgacaggcta aagcccgaag cgggccttga tctctttggt ctggaagtgc tcgacgacga 1510681 tgccgagcag cggaattgtg ccggcgagca gaacaccggc tgttttgccg agcggccagc 1510741 ggaccttgac cgccaggttc aacgtcagaa gcagatacgt gaagtacacc cagccgtgca 1510801 ccacaccgat ccacgtcggc ggattgtcaa ccttgacgac gtagcggacc acgatctcgt 1510861 agcacagtgc gatgagccag aggcccgtcg tccacgccat gatccggtag ccgagcaaag 1510921 cggtgcgaat cctctcgacg gcgatggcag gctcggcgtg ctgcgccgcg ggcgtttcgg 1510981 gtgcggtcat gcggtggtcc tgttctgctt cctggcatcg tccttggcta gctcggctag 1511041 gtaggcgttg tattcccgta gtacgggatc gtcgggtggc tgctgcgccg gcttcggccg 1511101 ctcgggcagc aatccggcag gtatcgcggc ggcggcgccg ccggtgggcg gttgcggggg 1511161 cgtctcttca taccgaacga agttgcggta cgcgtagacg cagaaccaag caaacaatgg 1511221 ccactgcaac gcgtaaccca gattttgaaa ggtgcccgag gtcgattgaa acctggtcca 1511281 ctgccaccaa cccagggcca ggcaaccaca ggtcgcgatg atcaccaacg cgatcagcgc 1511341 gggtctgcga cggcgggtag tggacacccc acgacgttac cgcgcactgc tctattgggc 1511401 gcccgggcgc gatgtggcga tatccactaa gtacaaggct agccttgcct aataccccag 1511461 gtgtagcctc cttcgccatg acctcatcgc cgtccaccgt cagcactacg ctgctgagca 1511521 tcctgcgcga cgacctcaac attgacctga ctcgagtcac gcctgatgcc aggttggtcg 1511581 acgatgtggg actggattcg gtggccttcg cggtcggtat ggtggccatc gaggagcggc 1511641 tcggagtcgc actgtccgaa gaggagctct tgacgtgcga cacggtcgga gaactggagg 1511701 cagcgatcgc ggccaaatac cgcgatgagt gagctcgcgg ccgtgctcac gcggtccatg 1511761 caggcctctg ccggcgactt gatggtcctc gaccgcgaga cctcgctgtg gtgtcggcac 1511821 ccgtggcccg aggtacacgg gctggccgag agcgtagcgg cctggctgct agaccatgac 1511881 cgacccgccg cggtgggtct ggtcggcgaa ccgacggtcg agttggtcgc cgcgatccag 1511941 ggtgcctggc ttgccggcgc tgccgtgtcg atcctgcccg ggccggtacg tggcgccaat 1512001 gaccagcgat gggcggacgc gacgttgacc cgtttcctcg ggattggggt gcgcaccgta 1512061 ttgagccagg gttcctacct tgcccgcctg cgatcggtcg atacggccgg cgtaacgatc 1512121 ggagatctca gcacggcggc gcacaccaat cgttcggcca caccggtggc gagtgaaggg 1512181 cccgcggtcc ttcaaggtac cgcgggatcg acgggcgcgc cccgtaccgc catcctttcg 1512241 ccgggcgcgg tgctcagcaa cttgcgtggg ctcaatcagc gcgtgggcac cgatgctgcg 1512301 accgacgtcg gttgctcatg gttaccgctg taccacgaca tggggctcgc tttcgtgctc 1512361 tctgctgcgc tggccggtgc gccgctctgg ttggccccga cgacggcgtt cacggcgtcg 1512421 ccgttccgtt ggttgagttg gctctcggac agtggtgcca ccatgaccgc ggcaccgaac 1512481 ttcgcctaca acctcatcgg caaatacgcc aggcgggtat ccgaggtcga cctgggtgcc 1512541 ctgcgagtga cgctcaacgg tggagagccg gttgactgcg atgggctgac gcggttcgcg 1512601 gaggcgatgg caccgttcgg attcgatgcc ggcgccgtgt tgccctccta cgggctcgcc 1512661 gagtcgacgt gcgcggtgac cgtgccggtc cccggaattg ggttgcttgc cgaccgtgtc 1512721 atcgacggca gcggtgcgca taagcacgcg gtcctgggta accccatccc cggtatggag 1512781 gtacggatct cgtgcggtga tcaggcggca ggcaatgcga gccgtgaaat tggcgaaatc 1512841 gagattcgcg gtgcgtcgat gatggcgggt tacctgggtc agcagccgat cgaccctgac 1512901 gattggtttg ccaccggcga cctcggctat cttggcgctg gcggcctggt ggtgtgtggt 1512961 cgcgcgaagg aagtcatctc catcgcggga cgcaacatct ttccgacgga ggtcgagctg 1513021 gtggcagcgc aagttcgcgg agtgcgcgaa ggcgccgtgg tcgccttggg caccggtgat 1513081 cgctcgaccc gccccggtct ggtggtcgcg gccgagttcc gcggcccaga cgaggcgaac 1513141 gcccgcgccg aactgatcca acgcgttgcg tccgagtgcg gtatcgtccc gtccgacgtc 1513201 gtcttcgtgt cgcctggatc actgccccgg acgtcgtctg gaaaactgcg ccgcttggca 1513261 gtccggcgct ccctggagat ggcggactga tgacggccgg ctccgacctc gacgacttcc 1513321 gcggtttgct cgccaaagcg ttcgacgagc gggtggtggc atggaccgca gaagcggaag 1513381 cgcaggaacg ttttccgcgc cagttgatcg aacacctggg tgtctgcggc gtattcgatg 1513441 cgaagtgggc gaccgacgcc cgtcccgacg tcggtaaact cgtcgaactc gctttcgcgt 1513501 tgggccagct ggcctctgcc ggcatcggtg tgggtgtcag cttgcatgac tcggcgatcg 1513561 cgattttgcg ccggtttggt aagtcggact acttgcggga tatctgcgat caggcgatcc 1513621 gtggcgccgc ggtgctgtgc atcggagcct cggaggagtc cggcggatcc gacctgcaga 1513681 tcgtcgaaac cgagatacgg tcccgtgacg gtggtttcga ggtccgcggc gtcaagaaat 1513741 tcgtgtcgct gtctccgatc gccgaccaca tcatggtggt ggcccgcagc gtcgaccacg 1513801 atccgaccag taggcacggc aatgtcgcgg tcgtggccgt gccggccgca caagtcagcg 1513861 tgcagacccc ctaccgcaag gtcggtgcgg gaccgctgga taccgccgcg gtctgcatcg 1513921 acacctgggt accggccgat gcactggttg cgcgggccgg cacggggctg gcagccatca 1513981 gttggggact ggctcatgag cggatgtcga tcgccgggca gatcgcagcg tcgtgtcaac 1514041 gggcgatcgg aatcaccctg gcccgcatga tgagtcgacg tcagttcggt cagacgctgt 1514101 tcgaacacca ggcgctgcgg ctgcgtatgg cggacctgca ggcgcgtgtc gatctgctgc 1514161 ggtacgcgct gcacggcatc gctgaacagg ggagactgga actgcgcacg gcggcagcgg 1514221 tcaaagtcac cgccgcccgg ctcggtgagg aagtcatctc cgaatgcatg cacatcttcg 1514281 gtggggcggg ttatcttgtc gacgaaacga cgcttggcaa atggtggcgg gacatgaagc 1514341 tcgcccgggt cggcggcggc accgacgagg tgctgtggga attggtggct gccggcatga 1514401 cgcccgatca cgacggttac gcagccgtgg tcggagcttc caaagcgtag agcgccatgc 1514461 gccggtttgt cgtgtcatgc tcaccgagga acttgcatcc ggcccactca cacaaccgac 1514521 gggtcgcggt gttgcggtga tcggggtcga acatgatccg ccggcaacgc ggctcgttgg 1514581 caaagacgct ggccacgatc cgcggtagca gcagcgggcc gaagccccga ttgaccttcg 1514641 acaagtccgc gatggccgcg tgcagcccca aatcgtaggg gtctgcgtcg tagtagtgag 1514701 aaatcaaatc ctttgctgcc cagtataatt cgagataacc accatctgtt ccgtgccagc 1514761 tgccgatcaa tggcaacgaa taggttccct caagttgggc gttcaggtgt tgacgccaac 1514821 gtgacgccgg ccagtcgtac tcccaggccg ccgccagatg aggacggttc atccactccg 1514881 ccaacatctc cgcgtcggtc agctgtgcga cccgcaaccc gtatggcggc tccaacgatg 1514941 gaacgggcgg gcgggcgagg cgtcgtacct ggtcaggtag gtcgaatcgc tcgcgggcta 1515001 gccgaaccag cgcgtcgtcg gcctggccag cggatgtggg tttggtcatt gcgggccgag 1515061 cttaccggag ggctcgctgc ttaggttagg catgccatac atgcgtgagc cgggatcacg 1515121 tcgcccgctg cccggctgtc cgggggtcga ggcggtacga tcgctacgcc cgcgggcgtg 1515181 atgaaattgg caaacatgcc ggttttaggt gccggtgctc gaaagagttt gagggttcga 1515241 gtccctccgc ccgcactcca tggtccccga gtttgacctt cggtaaggca acccttagtt 1515301 tggacgagat cgtccgactg gggccgactg ggttgtatgc gcgggctgag tatcagcgcg 1515361 gtcgcggcgc agctcggggt atcggcggag cgcgacgccg ttgcacgccg gttggccggt 1515421 aacccagcgt tcgtggtcgc ccgatctgag aagtcgtggc ggattaggcc gccgcgagag 1515481 aggaccgctg atggcacgcg ggttgcaggg tgtgatgttg cgcagtttcg gcgcgcgcga 1515541 ccacaccgca acggtgatcg aaaccatttc gattgcaccg catttcgtgc gggtccggat 1515601 ggtttcgccg acgctcttcc aggatgcgga ggctgagccc gccgcatggc tgcggttctg 1515661 gttccccgac ccgaacgggt ccaacaccga gttccagcgc gcctatacga tctccgaagc 1515721 tgaccccgcc gcgggccgct tcgcggtcga cgttgtattg catgacccgg cgggtccggc 1515781 ctcgtcgtgg gcgcgcaccg tcaaacctgg cgcaaccata gcggtcatgt cgctgatggg 1515841 ctcatcgcgg ttcgacgtgc ccgaggagca gcccgccggg tatctgctaa tcggcgactc 1515901 ggcgtcgatt ccggggatga acgggatcat cgaaacggtc ccgaacgacg tcccgatcga 1515961 gatgtacctt gaacaacacg acgacaacga cacgttgatc ccgctcgcaa agcatccccg 1516021 gctgcgggtg cgctgggtta tgcgccgcga cgagaaatcg ctggccgagg cgatcgagaa 1516081 ccgcgactgg tcggactggt atgcgtgggc gacgccagag gctgccgcgc tgaaatgcgt 1516141 ccgggtgcgg ctgcgcgacg agttcgggtt ccctaagtcc gagatccacg ctcaggctta 1516201 ctggaacgcc gggcgtgcca tgggcaccca ccgagcaacc gaaccggcgg ccaccgaacc 1516261 tgaggtgggc gcagccccgc agccagaatc ggcggtgcct gccccggcgc gtggcagctg 1516321 gcgcgctcag gctgccagcc ggctgctggc gccgctaaag ctgccgctgg tgctctcggg 1516381 tgtgcttgcg gctctggtca cgctggcgca gttggcgccg ttcgtgctgt tggtcgagct 1516441 gtcaaggctg ctggtctccg gcgccggcgc gcaccggttg ttcacggtcg ggttcgccgc 1516501 ggtggggttg ctggggaccg gggccttgct ggcagccgcc ctcacgctgt ggctgcacgt 1516561 gatcgatgcc cgcttcgcca gggcgttgcg cttgcggctg ctgagcaagc tgtcccggtt 1516621 gccgctgggc tggttcacca gccgcgggtc cggatcgatc aaaaaattgg tcaccgacga 1516681 cacgctggcg ttgcactact tggtcaccca tgccgttccg gacgcggtcg ccgcggttgt 1516741 cgccccggtg ggggtgctgg tctatctgtt cgtcgtggac tggcgagtgg cgctggtctt 1516801 gttcgggccg gttctggtct acctgaccat cacgtcatcg ctcacgatcc aatccgggcc 1516861 ccgcattgtt caagcgcagc ggtgggcaga gaagatgaac ggcgaagcgg gtagttacct 1516921 cgagggtcag ccggtgattc gcgtcttcgg cgccgcgtca tcgagcttcc gtcgccggtt 1516981 ggacgagtac atcggattcc tggtcgcctg gcagcggccg ctggccggca agaaaaccct 1517041 gatggatctg gccactcgcc cagcaacgtt cctgtggctc atcgccgcta ccggcacctt 1517101 gttggtagcc acgcatcgaa tggatccggt gaatttgttg ccgttcatgt tcttgggtac 1517161 cacgttcggt gcccgcctgc tcgggatcgc ctacgggctc ggcggcctac gcacgggact 1517221 tctggcggcc cggcacctgc aagtcacact cgacgaaacc gaactcgccg tgcgggaaca 1517281 tccgcgcgaa ccgctcgacg gcgaggcgcc agcaactgtg gtgttcgacc acgtcacctt 1517341 cgggtaccgc cctggagtgc cggtgatcca ggatgtatcg cttacgctgc ggccgggcac 1517401 ggtcaccgcg ctcgtcggcc cgtccggctc cggcaagtcg acactggcca ccctgctggc 1517461 tcgattccac gatgtcgagc gaggtgcgat acgcgttggt ggacaggata ttcgatcact 1517521 ggccgcggac gagctgtaca cgcgagtcgg ctttgtgcta caggaagccc agcttgtgca 1517581 tggcaccgcc gccgaaaaca tcgcgctggc ggtaccggat gcccccgccg aacaggtcca 1517641 ggtcgcggcc cgcgaagcgc aaatccacga ccgggtgctt cggctgccgg acggctacga 1517701 taccgtgctc ggagccaaca gtggtctttc gggcggggag cgacagcggc tcaccattgc 1517761 ccgtgccatc ctcggcgaca ctccggtcct catcctcgac gaggccaccg cgtttgccga 1517821 tccggaatcg gaataccttg tgcaacaggc gcttaaccgg ctgacccggg accgcaccgt 1517881 gctggtaatc gcccatcgac tgcataccat cacccgggcc gaccagatcg tcgtgctcga 1517941 tcatggtcgg atcgtcgaac gcggcaccca cgaggagttg cttgccgcgg gcggacgcta 1518001 ctgccggctg tgggacaccg gccagggcag ccgggtggcg gtcgccgcag cgcaggacgg 1518061 cacccgatga tccgcacctg gatagccctt gttccgaacg accaccgcgc caggctaatc 1518121 ggctttgcgc tgctcgcgtt ttgttccgtt gtcgcgcgag cggtgggcac cgtgttgctg 1518181 gtgccgctga tggcggcgtt gttcggggag gcgccgcagc gcgcgtggct gtggctgggc 1518241 tggctgtccg ccgcgaccgt ggccgggtgg gtgctagacg ccgtgaccgc acgcatcggt 1518301 atcgagctgg gtttcgccgt ccttaaccac acccaacatg atgtggcgga ccggcttccg 1518361 gttgtccggt tggattggtt taccgccgaa aacaccgcga cggcacggca ggcgatcgcg 1518421 gccaccgggc cggaacttgt tggcctggtg gttaatctgg tgacaccgtt gaccagcgcg 1518481 atcctgctgc cggcagtgat cgcgctggcc ctgttgccga tctcctggca gctcggcgtg 1518541 gctgcactgg ccggcgtgcc gttgctgctg ggggcgctgt gggcctccgc agcctttgcg 1518601 cggcgtgccg ataccgcagc agacaaagcc aataccgcgc tcaccgaacg gattatcgag 1518661 ttcgctcgga ctcaacaggc attgcgggcc gcccggcgcg tcgagccggc tcgaagtctg 1518721 gtcggcaacg ctctggccag ccagcacacc gcgacgatgc ggttgctggg catgcagata 1518781 ccgggccagc tgttgttcag catcgccagc caactggctt tgatcgtgct cgccggcacc 1518841 accgcggcgc tgaccatcac gggaacgctc acggttcccg aggccatcgc cctgatcgtg 1518901 gtgatggtcc gttacctcga gccgttcacc gctgtcagcg agttggcgcc ggccctcgag 1518961 agcacccgcg cgaccctggg gcgcatcgga tcggtgctta ccgcaccggt catggtggcc 1519021 gggtctggca cgtggcgtga cggcgccgtg gtcccgcgta tcgagttcga cgacgtcgcc 1519081 ttcggctacg acggcggcag cgggccggtc ctcgacgggg tcagcttctg cttgcagccg 1519141 ggaaccacga cggcgatcgt cggaccgtct ggctgcggaa agagcacgat cctggcgctg 1519201 atcgcgggcc tgcaccagcc cactcgcggt cgtgtcctca tcgacggcac cgatgtcgcg 1519261 acgctggatg cccgggcgca gcaggcggtc tgcagtgtcg tgttccaaca tccttacctg 1519321 ttccacggga cgatccgcga caacgtgttc gctgcagacc cgggcgctag tgacgatcag 1519381 tttgcgcaag ccgtccggct ggcgcgggtg gacgagctca tcgccaggct gccagacggc 1519441 gcaaacacaa tcgttggcga agccggctcg gcgctgtccg gcggcgagcg gcaacgcgta 1519501 agcatcgcac gggctctgct gaaagccgct ccggtgctac tggtcgacga ggcgaccagc 1519561 gcactggacg ccgagaatga ggccgcggtg gtcgacgcgc ttgcggccga tccgcgatca 1519621 cgcacccggg tgatcgtcgc ccatcggttg gcaagcatcc gtcatgccga ccgcgtcctg 1519681 tttgttgacg atggccgagt ggtcgaggac ggttcgatct ccgagttgct caccgcgggt 1519741 gggcgtttca gtcagttctg gcgccaacag cacgaggccg ccgagtggca gatcctcgcc 1519801 gagtaacgcg agaaaccacc gcgccacgca gatagccact tcctccgtga atctgcatcg 1519861 cgaggtcggc caccttgcca gctagttcgg tgtagaagag cttcgccgcc gacggtgcaa 1519921 aatatgatat tcgcatggcg tcattgctga acgctcggac tgccgtaatt accggcggtg 1519981 cacaagggct ggggttagct atcggccagc gattcgttgc cgagggtgca cgggttgtgc 1520041 ttggtgatgt gaatctcgaa gcgaccgagg tcgcagccaa gcggctgggc ggcgatgacg 1520101 ttgctctggc ggtgcggtgc gatgtgactc aagccgacga cgtcgacatc ctcatccgga 1520161 ccgctgtcga gcgtttcggc ggtctggatg tcatggtcaa caacgccggg atcacccgcg 1520221 acgcaacgat gcgcacgatg accgaagagc agttcgatca ggtcatcgcg gtgcatctga 1520281 agggaacatg gaacggtacc cggctggcgg cggcaatcat gcgggaacgc aagcggggcg 1520341 ccattgtgaa catgtcttcg gtgtcaggca aggtcggtat ggtcggccaa accaactact 1520401 cagcggccaa ggccggcatc gtaggaatga ccaaggcggc cgccaaagaa cttgcacacc 1520461 tcggcattcg ggtaaacgca atagctccgg ggttgatccg ttcagcgatg acagaagcta 1520521 tgccgcaacg catttgggac cagaagcttg ccgaagttcc gatgggtcgc gccggcgagc 1520581 ccagcgaagt cgctagcgtg gccgtgttct tggcttcgga tctatcctcg tacatgaccg 1520641 gcaccgtgtt ggacgtgact ggcggccggt tcatatgaca ccgagatcat tgccacggta 1520701 cggcaattcg tcaagaagga aatctttccc aatgcaccgg ccctcgaacg tggcaacagc 1520761 tacccgcaag aaatcgtcga tcggctgggt gttattggct tgctcggtcg ccggctgcaa 1520821 gggtatcgac accaccgagt tcattctcgg gcgtgccggc gcattcgagc tggcggtgcg 1520881 cgctgcccag caccgtcata ggtacttgac gatggtcaac gtcggacgag cgccaccacg 1520941 tcgctgccga acggtatgca tggcggctac cgatactccg cggaatatca gattgaacgg 1521001 ctgatgcctg atgcgcccgt tgctgctcag cggagcggga accagcgcga tccagaagcc 1521061 tctgaggact cgaaggctgg cctccggagt ccatcgatga tgtgcagttg catcgcgatt 1521121 gccgccaggg gcgttgtcgc ttgagcacat ctgggcatag gctgccatct tggagggcag 1521181 gcaacctgca tgatagggag gagaatatgg cccgcacgct tgcgttgcgc gcatcggcgg 1521241 gactcgtcgc gggtatggca atggccgcga tcacgctcgc acctggggcc cgcgccgaaa 1521301 ccggtgagca attccccggg gatggggtgt ttctcgtggg aactgacatt gcgccaggca 1521361 cctaccgcac ggaggggccg tcgaatcccc ttattttggt gttcggcagg gtgtccgagc 1521421 tctcaacctg ctcatggtcg acacacagcg cacccgaggt gagcaatgag aacattgtcg 1521481 acaccaacac ctctatgggc ccgatgtcag tggtgatccc gccgaccgtg gcagccttcc 1521541 agacgcataa ctgcaagctt tggatgcgga tctcataggg gccggcgtac ccggtaccgg 1521601 ccgcgggcct accacgtgcc ggaactggaa gcgcagtaag ccctcaacgc gccaccgctt 1521661 tggcccgcgc gcccggcgta ggcgcatcgg cggtggccgt ggggcggcgc actgcgacct 1521721 caccagcggc tttcgagctt tgttcgatca accggccagc atggtcgagg atgcattcga 1521781 gaccatattc gaaattggtt tcatcggggg ccccgatccg atgccccctc ccagttgcgt 1521841 gagcaagcag cggagtcgtc gcgggatcga tggccacggg gtgttcaatg gcggatggtc 1521901 cgctgcccgc cgactggctc ttgcgggaga gccgatctag caccaccgat ccgcgcacgt 1521961 ggaccgaaac cgccgagtag atgtcgaaag cgtcttcgag cgacaggccc gccgtcacca 1522021 gattggcgat ggccttctcc atctcttggg cgcccaaccg cgccgttttc ggggacagcg 1522081 ccgctcgaat cagtatcaga tcgcacagta cggggttgtc cgcgaacgtc ttccgcatcg 1522141 agcgggcatg attgcgcaac gtttcgcgcc agtcgccggc ttcgatgtac ggggtagcga 1522201 acacgtactt gctcaaagcg cggtcggtca tcgcgttgag cagatcgtcc ttcttgcgga 1522261 agtaccagta gatgctggtg accccgacgc caaggtgttt gccgagcaat ggcatgctca 1522321 agttgtctat cgatacctgc tgggcgagtt cgaatgcgcc gctgatgatg tcctcggggt 1522381 tgatggatcc gcgctgccgt cgttgacgct tgcctggggt tgtctgcatt gccgttacgg 1522441 cacctccatc aagataacgc cgggtcagtt gcaggtatgc aggtcggcgg tagtcgtcgt 1522501 gcggacaaca tgtgccgcat ggcctccccg gggacaggcc gggagaacaa gaagccttgc 1522561 gcacggtaac agcgctgatc caatagaatt ctggcggcag cctcggtctc gacgccttcg 1522621 gctactacat cgagttggaa gccttcggcg agtgtcatga tgccgcgcac aatgaccaga 1522681 tcgctagtgt tggttccgag ttgccgcacg aatgttttgt cgatcttgag cgtgtcgatc 1522741 ggtagcgtct gcaacagtga tatggcgcta tagccggtgc cgaaatcgtc gatagcgatg 1522801 tgaacgccga cttctttgag tcgagccagg gtggctctgg cggtatgtag gtcttgcacc 1522861 acaacgtttt cggtgatttc caaacacacg gacgaggcgt ccagaccgtg ctggccgatc 1522921 gtgtctgcga cgaagtcaac aaacccgccc gtcaccagct gtccagctga gacgttgata 1522981 cgcagcagcg cgtcgtggcc caaaccggct gactgccact cggagaattc attgcaggcc 1523041 ctccgcagca cccatctatc caattcgcct gcaaggttga tggattcggc cacagggatg 1523101 aagcagcccg gtgccagcag cccacgggtg gggtgctgcc accggaccaa tgcctcggtc 1523161 ccgacaatgt cgccggtccg taggtcgacc tcgggtaggt agaccaggcg aagggcgtcg 1523221 gattcgatac cacgtcgaag gtgtagttca atatcgttgc gcagttcgcc gctgaccgac 1523281 atgtccgcgg tgaaaatcgc gacgctatct ccgccggcgt gtttggctgc cagagcggct 1523341 tggtcggctc ggcgcaggag gtccgacggt gtgtgctgtc cgggagtccc tgaggcgaca 1523401 ccgatactga cggtgcgggt gagcacctca ccgccgatag cgacgtggtc cttgagctgg 1523461 tcgcgaagac gttcggcgag cggttgagcg gcatcggcac tcattggaga tgcgggtatg 1523521 aggacgaatt cgtcgccgcc gagtcgggcg atcaggctct cgccaacgag tgcgtcaccg 1523581 atccgttggg cgaacacatg gatgaactgg tcaccggcgg cgtggcccag gtagtcgttg 1523641 atggccttga ggcggtccaa gtcgagaaat agcgccgcga ccgggccagg ttgtccgggg 1523701 gccagtcttt ggtccaggtg ctgcagcaac gcgcgacggt tatgcagtcc ggtcagatcg 1523761 tcatggtcgg ccagatagcg aagccgcgcc tcggcggcga cgcgagcctg cacctgggcg 1523821 aagagtgtag cgatggtcat gagggcgtta agctcggcct cgtgccattt ccgatcaccg 1523881 aacttgatga accccagcag tccagtggtg atctcgccag ataccagcgg cacggcggca 1523941 gccgacgtta ccggaacccc gcgggcttct tcgatgaggc gttgatagtc ctcggtggcc 1524001 ggctcgggcc ggaacacgag aggctctttg gcgtgttcgc atagcgcaaa caccgggtcg 1524061 gcatcagcga agtagatcag cctgagcgga tcggggtccg gtatgttgag gcgaggtggc 1524121 cattcggcca ccagcctcgt cgcgcgcctg tcgcgatcgt tatgacgcaa aaagctgaca 1524181 tctacgccca gctgttccac tagataggcc aaaacgcgct gactgacttc ggctgacgtg 1524241 gcagcgtcga ctgtcatgag ctggttggct acggtggtga cgagctcctc aagctgcggc 1524301 gtcgcggtgt cgttgcacat ctcggatgct atctgtgcgg ctctggtatg gcgtgccgta 1524361 cgcgtcggcg gctacacacc gacggcggtg gcgcgtggaa caacctgaag atcaacacct 1524421 cgtgcccttc tttgcccggc ttgaccagtt cccgaaagtc gagttgcagg cggtgcagct 1524481 gtgcggcgaa atggggtgac gcttggtcga ggtcgtggcg gccacgtgca tacaggaaga 1524541 tcggtgacat cggttgtacg gccagtccat gttgttgggc cacaatccac accgcctgca 1524601 tggctgatcc gccacgcgca aaatcggtga gcgtggcgcc atcaacgtag acgattgcga 1524661 gcgctgaact cgccgacacg cgctcattgg tgttgtcttc gagggctgtt ccgcaatccc 1524721 attgcgctag ccgtgccacg acgtcggagc gtcgcaggat atcgagaacc cgcaattcgc 1524781 cggaatccag ttcgaggctt cggacatcga tgcccgcatc gagcgaaggg tcgcccggcc 1524841 accggagctc ggacatcatt tcctcatgta gcctcggggt gagatagcgg attcggtctg 1524901 cagccgctaa aattgttgca gcccggtcga tctcgtttcg tgacagcaac agctgtaacc 1524961 gcgcaccctc agccgcggcg gtgttcgtta acaactcaac ggtcgcgggg tggacgtgac 1525021 cgggcatacc gtggtggcga ttggtcgttc tgagcagcat cggccggtaa agggccgcaa 1525081 ggcttggatc atcaccacgg ccaaaatgca ttgtcgcttg cagcggcgag tcgggctggg 1525141 attcgtcgaa ctctactgat cccaggaccc ggtgtgcagc ggcagcgaca cgcgcgttaa 1525201 acatggccgc gccgacggcc actgcgctac cacgaaacgc gatatccatt gcgctggtgt 1525261 gctcaggtgc tagtcggatg gtcagcgaat gctgtttggc cacaacatgc catggctgaa 1525321 cgttgccccc tgaaggcgcg cgaatcgccg cctgagccac gatttcgctg gttggctgcg 1525381 gctcggctgg cgctgttggc ggcacggact cgagcaacca tccgttcccg cgagacggca 1525441 tgggcggttg atcgaggcga tctagcgctg cggacacatc cacccgtacc cggccagact 1525501 caagtggttc tcccagaccg attctgcgta ccgcttcagc taccgtcgct gcgcccaccc 1525561 agatatcgcc tgccaactgc ggccatcccc acaacgtctg gtcaacttcg atcatcgacg 1525621 ccgcacaacg cgccgagagc tcttggcaat caaggatgtt gagaacgtgg gggactttgt 1525681 cttttgtggt cagtccacac agcttgtcgg cgtcgatgtc gcccaatagc ccatgaaaga 1525741 tcggtcgccc aggttcgacg tcgtagcgtt cgacatcgac caggccgcgg tcactggtcg 1525801 ccatcagtac ggggacacca cgggcgcacg cggcttgtcg cagtatcact ttgatatcca 1525861 gcgagtcgca ttcttcgata acgacgtcaa ggccgtcgag gaactcgtcg acggattccg 1525921 gcgagagccc ggatgtaacg aggtccacgg ccaggtaggg atccagctcc gcgatcctgc 1525981 gcgccgcaat catcgccttg ttgaggccaa tgtcgaagac gccgaccggc acgcgattca 1526041 ggttcgacag ctcaattttg tcgaaatcgg ccaaccgcag tgtgccacag gcaccttcgg 1526101 cggcaagggt gtatgcgatc gcatggccgg cgctgagtcc gacgacgccg acccgtagcg 1526161 cgtgcagtgc gcgttgttcc tcagcggtga tgaggtgcct gttgcggtcc aagcgcacgg 1526221 cacggaaccc ccggggaccc agaatggcaa caaccatgcg ccgccaggga taataggccc 1526281 atcgcttcgc ttcttctagc agatctggat caggctgtgg cagcaggcgc cgcacgcccg 1526341 ctagctgttc tgcgaatcgg tcgacgaact cgatgctcgg atctgagcgt agtcgatcga 1526401 gcaccaggac atcgtcgtgg tcatcgtcac gaaggacgag aatgccggtg ctgccgccct 1526461 cgtgtgggat ggtcactgtt cggctccagc ggtcgctgcg gtggttgcgc tcaacgcttc 1526521 tacatcgcgc agaagcttgc gcgactcgac aagcattctt gacagttgtt ttggctcggc 1526581 atggttagcc aaggttctgc ggtcccacca gatcatcttg gtccggtagc gctcgtccgg 1526641 gtatgctgcc gccgggattc tcgctgctat tactcccccc gaagaacgcc accggtccag 1526701 cgcgtgggcc gccgcggtcc ccatcacaaa ctgaaccccc aacagggaca tgcttagcgg 1526761 tagggcgcgc gccaaggcgg cagcaatcgc atcactgcgc tgcgcgtcac tattaaccca 1526821 cccggacttc acttccacga ccccgaatgg cgcccggtca ttgatcatct tgcgcaccgc 1526881 ggataatccg ggattgccag cccattcgac taccgcatgc gagtcatcgg ctgaccgcag 1526941 cggtccgatt acccgagcgc ccccgactac atctcctcca atatcaatgg cggcaaagaa 1527001 caactgtgta tcggaaccgt cactgatggc gtcgagatct aaggtacact cgactccgtg 1527061 cttactatag gcgcgaagcg caccctgaag gtatgtattc cacaacgtgg gatcgagcgc 1527121 gggttgcgat accacaagcc ggcattgcgc atcagaaacc cacacactga gattttccga 1527181 aaaatgcaac ttctgcggtg cgataggacg aagttgagcg gtggtcatga ttctccaatc 1527241 tgttaggtat ccggcaatta acacgagatt tgctgcccct gtatcgagca gcgcagacgt 1527301 tggggctgcg cccggagaat tgctgccgtt gcgcagaacg gcgccgcacg gcagggttca 1527361 acgcccggcc gcgctggtat ttatcgagtc gctgcgagag ccgtgaacaa attgtcacga 1527421 aatcgtgcgc acgcgcgttc acaaatacca cgcgcacgcg ctcgaaaact acatgaccag 1527481 atagccagat ttttccggac cggcaaagcg ttgttcagtg ttggtcacgg ctcttgatcg 1527541 tatttacccc gggtggcgta gaccctatcg atggtggacc ccgttcatcg gggtaatcga 1527601 atggatcatg caaaatatta ctttgacgag tattcattcc gattcaaccg gtcccaccca 1527661 cctctcatgc gtgcggggca tactgttcta cggcttggtg aaacacgccg ttgccatcga 1527721 tctacaccca cgcgacctac tctctgaaaa agtcgaccgg cagtgccttg gcaaagtgcc 1527781 agccttgtgc ggctttacag ccgaaggcgc gcaaccgggc ggcttggctg ggggtttcga 1527841 ctagctttgc agtgacggtg ataccgagct tgtcgccaag gtcgatcatt gcccgggtga 1527901 tctgttcgtt ggccagccga gcttgaatgt cgccatcgag gcactcgatg aactttcccc 1527961 cgagtttgac cacgtcgacg gggaggcggg gaaggtaggc gaggctggag aatccaatgc 1528021 cgaagtcgtc gatggcgatg ccgacgccga gagcggacaa ttcttgtagc ctggtcaccg 1528081 ccttctcgtc tctgctaagg cgcgcgtcct cggccagttc gagctgcagg gcatgggcgg 1528141 gcaggccggt ttcgccgagc acaccttcga ccagcaccag gaagccggga tcgcagatgg 1528201 tgctggcgga gacgttgacg ctgacaaacg gttgcgggtc ggtgctgtgg tcacgccaac 1528261 tgcggacgtg gcggcaggcc tgctcgagca cgaaggccgt gagcggcacc atcagtccgt 1528321 tgttctcggc acggtcgatg aaccggcccg ggagtagcgt gcccaacgtc gggtgttccc 1528381 agcgcagcag ggcctcggcg ccgatgatgc ggttgtcggc aagccggatg attggctggt 1528441 agacgaggaa gaattcaccg cgatccagtg ccacgcgcat cgaagtggac agataatggc 1528501 gagtgttgac ctggtcgcgg tcggagtccg cccattggtc aggattggct accatcgctc 1528561 gcgcttgcat cgcgccccta aacatctctt cgtagtcgat caacttggtc ggcctgagcg 1528621 cgcaagcgaa cgctgtagcg cgctgacaac aacgatccat ccaagggctg catcaggatt 1528681 cacagcccgg tgggcacctc gccgaccgcg gtggcaacgc gaagcacacc accgaagtcg 1528741 tcttgacccg aaccgtcgca gtagattctg gagtcctggg aggcaaagat cgtcagcgat 1528801 aatgcgtaaa agtccgtcac gtactacgta gaaggtccgt gagtgcagcc gttccgggca 1528861 tgcacgaacc ggcgcttaca cgtcgaaggc ggctgcgcgg caatcagtct cggtgggtaa 1528921 cccattgtcg gcgggcgatc ggttacctct cgaatcgacg gccgcccgca tctgagttag 1528981 ccaggccagc ggtttcctac gggcgctggg tgcaaagata cgacttccgg gtgcaatagt 1529041 tacgcgctat cgctgatgtt cttgtccgca ccggccttca gagttgagcc aacgcgtagt 1529101 cgccactcgg cactacggtg ggcgcgtcat cgacgcttcg ctgacgtccc gaggtggcag 1529161 atgttgcgct cgctgcagat cgccgatcaa atcgctcgta cgggtcacat gccagtgagg 1529221 cgtcttgatc tgatctggat cagcgcacga aacgccgcga gaagggagct tgatctgggc 1529281 gtggctgcgc tggtggaggc tgtgacgttg ctcactgctg acgtcgaggg ctcgacacgg 1529341 ctgtcgcaga cgcgactcaa cgagctagcg gccgattacc caaccttgga tcagaacata 1529401 tcggaagctg tcgcggccca tggcggggtg acgcgaccgg tagaccagga ggtgggtagc 1529461 ggtctcgtcg tcgcgttcct gcgtgctggc gacgcgatcg cgtgcgcttt ggaactgcag 1529521 ctctcaacgt tggcgcctat gcggccgcgt gtcggtgtgc acaccggcga tgtccggctg 1529581 cgcggcgacg gcaccatcac cggctccgcg atcaacgaga gtgcgtgtct gcgcgacctc 1529641 gcacacgaag gccagacttt gctttcagcc gccactggcg atctggtcat cgaccagctt 1529701 ccggcaaata cctggctgac cgacgtcggc aagtaccccc tgcggggttt gcatcgccaa 1529761 gaacgggtta tccagttgtg tcatcgagac ctacgcaatg agtttccgcc gctgcggatg 1529821 tcggtcggta acagatccag ccttccggcc cagttcacca cttttgtagg ccgtgacgca 1529881 cagatcaacg aggtgcaaga ggtcctgacg aactaccggc tggtgacgct gcgcggcgag 1529941 ggcggtgtag gtaagacgcg tctggcgatc cagatcgcgg ccgcgtcgga atttcgcgat 1530001 ggtctgtgtt tcgtcgactt ggcaccgatt gccgatcccg gcatggtgtc caccaccgcg 1530061 gcccatgctc taggtctgat cgatcggccg ggcagctcaa cattcgacac tcttagtcat 1530121 gccatcggca actgccacat gctaatggtg ttggacaact gtgagcacgt gttggatgcg 1530181 tgcgccgagc tggtcgttga gctgctgggt gcctgcccgg agttaagcat tttggcgacc 1530241 agccgcgagt cgatcggcgt gaccggcgag gtcacatggg tggtgccgtc gttgtctccg 1530301 gcgaacgaag caatccagtt gttcactgaa cgtgcgcgcc tagtccaacc caattttgag 1530361 atcgttgctg acaacttcgc cgccgtgagc gagatctgcc ggcggctaga cggtatgccc 1530421 ctggcaatcg agttggccgc ggcacgattg cggtcgttgt cgccaaacga gatcgccaac 1530481 agtttggatg accgattccg cctgctgacc ggtggtgctc gcagtacggt gcagcgccag 1530541 cagacattac gggcatctat ggattggtcg tacgcactgc tgactgacac cgaacggatc 1530601 ctgttccgcc gccttgcggt gtttgtgggc ggtttcgacc tcaccgcggc gagcgaagtc 1530661 gccgccgccg gcggcgacga cttcgtcgag cggtattcag tgcttgatca actgacgctg 1530721 cttgtcgaca agtcgctggt ggtagccgaa gaaagccgag gcagtacgcg ctatcggctg 1530781 ttggaaaccg tacgccagta tgcgctagaa aaactgaacg aatccgaaga aatcgacggg 1530841 gtgcgcgcta ggcaccggac ccactacgca accatggcgg cagggctgaa cgttcccgcc 1530901 tccaccgact atgaacaacg cctcctgcag gctgaagccg aaatcgataa tttgcgtgcc 1530961 gcattcacct ggagccgtgg aaacggcgat attgcagccg cattgcagct cgcatccgca 1531021 ttgcaaccgc tgtggtcgca ggggcgcatg cgcgaagggc tggcctggct cgaatccatc 1531081 ctcgagcggg aaggcgacaa tcatcttgtg ccggcggggg tttgggcgcg ggcgcttgcg 1531141 gagaaggtaa tactcaaggc ttggccggcc acgagcccga tgggcgcccc cgacatcgtc 1531201 gcgcaggctc accatgcctt ggcgctggca cgcgacgcag gcgactgcgc agtgttggct 1531261 cgagcgctcg tcgcatgtgg ctgcggcagt ggttgcgaca cggaagccgc tcaaccctac 1531321 ttcgccgagg cgatcgagct ggcgcgcgcc attaacgatg agtggacatt gagccaaatc 1531381 gattattggc aggtggtcgg gatcttcata tcgggtcagc caattccttt gcgagctgcg 1531441 gccgaacaag ctcgagagct cgccgacagc atcggaaacc ggttcgtctc acgtcaatgc 1531501 cgcctgtttg cctgcctggc gcagatatgg gaaggcgacg cgaacggagc attggcacta 1531561 tctcgcgacg ttaccgccga ggccgaggtg gcaaacgatg tcgttactaa ggtactcggt 1531621 ttgtatgtcg aagccatggc actgtcttac atcggcgaca gcgccgcccg gaccatcgct 1531681 ggtgcggctc tcgaagctgc caccgagtta ggcgggattt accaagatct gggttacgga 1531741 gcgataactc gcgcggcgtt ggccgcgggc gacgtagcgg ccattgaggc tagcgaagcg 1531801 agctgggatc ttcgcaatca acacaacgtg gtaacggcac accacgagct gatggcgcag 1531861 gcagccctgg ttcgcggcga tgtgaccacg gcaagacgtt tcgccgacga agctgtgctt 1531921 gcgagcaccg gatggcatct gatgatggcg ctgatagcac gggcgcgagt ggcgattgcg 1531981 caggacgagc tgggaaaggc acgcgatgac gcccacgccg cggtggcgtg cggcgtcggt 1532041 gtgcagacgt acctcgcgat gccggatgcc ctagaacttc tcgcaggtct ggccggtgag 1532101 gccggtaacc acggtcaagc agtgcgcctt ttcggcgcgg ccgcggccca gcggcagcgt 1532161 acgggggagg ttcgccacaa gatttgggac gccggctatg aggccgccac ggcggcgctt 1532221 cgtgatgcga tgggcgacga agatttcact gccgcctggg ctgagggtgc cgcggccccc 1532281 ttggacgagg cgatcgccta cgcacaacgc ggtcgcggcg aacgcaaacg cccaagcaac 1532341 ggctgggacg cgctgacccc ggccgagcac aaaatcgtaa agctcgtcac cgaaggactg 1532401 gtcaccaagg acatcgccgc gaggcttttc gtctcaccgc gtaccgtgca aacacacctc 1532461 acccacatct acaccaagct cgacgtcacc tcccgtgtcc aacttgtaca ggaggccgcg 1532521 caacactcga cctaggattg cgcggccagc gcaggcccgg agttcgaatc ggatgcaata 1532581 cgcaaccaat ctgggctctt ctgcgcgttg tcgctgatgt tcatggctct tcgcgccccc 1532641 atgcttgagc gcatgaacgg tttgcataca gatgacgcgc cggtcaattg gctcgagcgg 1532701 cgaggtggcc ggcttacgtc gaggcggagg gtgacgttgc tccatgctgg agtggaacac 1532761 ccgatgcggc tgtggggcgt ccaatccgag gcgataactg ccgcgatggt gcttagccgg 1532821 aaggtatcgg ccatcattgc cggacactgc ggtgtgcgcc tagttgatca gggcgtgggc 1532881 gatggcttcg tcgccgcgtt cgcccatgcc agcgatgccg tcgcatgtgc tctggagttg 1532941 caccaggctc cgttgtcccc gatcgtcctg cgcatcggga ttcacaccgg tgaggcgcag 1533001 ttggtcgacg agcgcatcta cgccggcgcc acaatgaacc tggctgcaga gctacgggat 1533061 ttagcccatg gtgggcagac cgtgatgtcg ggtgctaccg aggatgcggt actcggccgg 1533121 cttcccatgc gcgcttggct aattggcttg aggcccatgg aagggtcccc ggaaaggcat 1533181 aacttccccc agtcacaacg catagcacaa ttgtgccatc cgaaccttcg caacaccttt 1533241 ccgccgctgc gcatgcgcat cgccgatgcg agcggaattc cttatgtggg gcggattctg 1533301 gttaacgttc aggtagttcc ccactgggaa ggagggtgtg ccgcagcggg gatggtcctt 1533361 gctgggtgaa gcgccattga gggccagacg ataggttggc cagcgacgtc ctcaactcag 1533421 actctcggcg cgacctgacc ggcggttacg atcatctgct cggacattcg caagagagcg 1533481 tgctcgccca ctccctgcgg caaggtgtag gccagctcgc gcaacttaac cgcgcactcg 1533541 tagagcgtct ggcggaaatg cgtttcggtc atcggggtgc cttcgctcgt cggcgagatc 1533601 ggcagcggtg tcttgtgctc atacagatca atctcattca tcagagccat cattcgccgg 1533661 ctagcagctg caaatcatca gcacatccac gtatgtcgtc ggctgcccag cgcgcaatgt 1533721 gtggcacggc gagttgatgt tcaacctcgg cgtgtcctgc atactggatt tcttactgta 1533781 aagtcaccca aatgggtggt gcccgccggc tcaagctcga cgggagcatc cccaaccagc 1533841 tcgcccgggc ggccgacgcg gccgtcgcac ttgagcgcaa tggtttcgat gggggctgga 1533901 cagctgaagc cagccatgat ccctttctcc cgctgctact ggctgccgag cacacgtcgc 1533961 gacttgagct tggcaccaac atcgcggtag cgttcgcgcg caatccgatg attgtcgcca 1534021 acgtgggctg ggacctacag acgtactcga agggaagatt gatcctcggt ctgggaaccc 1534081 agatccggcc gcacatcgag aaacgattca gcatgccctg gggtcatccg gcacgtcgga 1534141 tgcgtgaatt cgtcgccgcg ctgcgtgcga tctggttggc ttggcaggac gggaccaagc 1534201 tttgcttcga gggtgagttc tacacccaca agatcatgac cccgatgttc acacccgagc 1534261 cgcagcccta tcccgttccg agagtcttca tcgccgctgt cggtgaagcg atgaccgaaa 1534321 tgtgcggcga agtcgccgac ggccacctcg gtcaccctat ggtctcgaaa cggtacctca 1534381 ccgaggtgtc ggtgccggcg ctgctacgtg gcctggcgcg atcgggtcgc gatcgcagtg 1534441 ccttcgaggt gtcgtgcgag gtgatggtgg ccactggcgc ggacgacgcc gaactggcgg 1534501 ccgcctgcac tgccacgcgc aagcaaatcg ccttctacgg atccacgccg gcttaccgca 1534561 aagtcctcga gcagcatggc tggggcgatc tgcacccgga gctgcaccgc ctctccaagc 1534621 tgggtgagtg ggaggccatg ggtgggctaa tcgacgacga gatgctcggt gctttcgcgg 1534681 tggtcggtcc ggtggacacg atcgccggtg cccttcgcaa tcgttgtgag ggcgtcgtcg 1534741 accgcgtctt gccgattttc atggccgcat ctcaggagtg tattaacgcc gcactgcagg 1534801 actttcgccg ttgagcgcgc catcggtgga tgaggccacc aagatcgctg cccgcataga 1534861 gggcccgcat tgcgtgcgga tcggcgttac ccggcggcgg gcacacgggg cattacgtac 1534921 gcccgcggcg gcatccgcaa cgcattgcta accccgccga acccgccgcc gctattggtc 1534981 agttgcccca gcggtagccc gcccagcatg tgtccggggg cggtttgggc ggcgctggtc 1535041 aggctggtca gcggcagcgc ccgcgccgcc ggggtgaccg cctggttggc cgcggcccat 1535101 gctggcggca ccgacaacga accgaccgag gccgcccgac ccaagttggc ggccacccca 1535161 gcgcccagac ctgaagaacc cagcgacgaa cccagctggc tgcccagcga gctcatcgcc 1535221 tggaccccgt tttgcgccgc ggtttccacg gcctgagccg ccgccggagc aaagcccttc 1535281 aacatcgagt gcaaggtgtt ggtcatcgac acacccgagt tggtcatcga cacgtggttg 1535341 ttgagcatcg acacgatgtt gctgatcggc gacagatgcg gcgagaccgt cttccacagg 1535401 ccacccagct tggaagaagg cgtggtgccc tgcgtgggct gggccagctg ttgcagcgcc 1535461 tggggcacat tgttcatcaa ctggttcgcc gcggcggtgt cggaggcctc ctcgaccgcg 1535521 gcggcctgct cgaggagccc acccgcgctg gtcatctccg gcgcctcctc gaacggcagc 1535581 aacgtcgccg tcgccgtcgc cgtcgccgcg gcgtagccaa acatcgcggc ggcgtcttgg 1535641 gcccacatct cgccgtattc ggcctcgttg accgcgatcg ccggggtgtt ttgccccaag 1535701 aggttggtcg ctatcagaat catcagttca gcacggttct cggcgatcac cggcgggggc 1535761 accgtcagcc catacgccgt ctcgtaggcc gccgcagcaa cccggacctg ggcggcggtc 1535821 agctcggcct gccccgcggt gacgctcatc cacgccacat acggcgaggc cgccgccacc 1535881 atcagacccg ccgacgaacc tatccacgac cccaccgtca gaccccagac caccgactga 1535941 aacgccgacg cggccgaaaa caggtcactc gccacgctgt cccacatctg agccgcggcc 1536001 accagcgagg ccgaacccgg gccggcgtac atcctcgcgg agttgatctc cggtggtaac 1536061 gccccgaagt ccaccacttc gataatcctt ccgctcggcc ataactagca ccaatgatgg 1536121 acagcaaaca acgtcggcaa caggtcaaat tctctcaagt agtcacaacc ccagatgcaa 1536181 agtgcaccag ccgccctgcc gcgaagctaa atccagctga acaatctgaa catcaggtaa 1536241 atacagtggc aactaatctt aaataacccg gccgattaga cagcggccga gatctgtttg 1536301 aggtcgggcc gtgattgtcc ggagaacggc cggtatctcg cacgagcagc cacccgcgcc 1536361 cccgtcagac ttggcgaccg cctacggcaa cctaaaccgg ggtgaacttg gtgatcagcc 1536421 aattgccgtc gaccttggct agggtcacca tcacgctgct ggccgccatc gacggattgg 1536481 ggctgtcctt actggtagtg ctctggtcga caaaaaccag aacgacggcc gaatccggat 1536541 gtagctccga cacggccgcg cgcaccacct tggcggtggt tttcagtgac ttctgtttgg 1536601 ccgccggagc cacgatctgc tgcgtgaact ggtcgtagta ggacaggaaa tcgccggcga 1536661 ggtgcgacct ggcggtagcg aagtcttggt cgagcgtgtc gggtgaatac gacaacagcg 1536721 cgattgtccc gtcagacgcc gcggcgccgg cagcacgggc ggcgccggag tccgtctgct 1536781 gatcgggtcg gtattgctca aggtatagcc atcccgtcgc gcccccagag atcaacatga 1536841 gcaggatgag aatcaccgga acgggtttca aggtaacctg cattcgccac aggtcacggt 1536901 gccgctgacc cttctgcgcg gtagattccg ttgcagagtc ggtgtcaaat gcctcggtcg 1536961 ccgaatcacc ggcttcgcct gcggctgagt cgatctcagc gacttcggtg gcgtcagtgg 1537021 tttcggtgtt gacgtcgcgt acgtcatcgg tcacggtacg aactcaactt tcgacatctt 1537081 gtactgtccc ccctcttcgg tcacggtcac tttgagccgc cacgcacgtg gttcgtcttt 1537141 cgccccagcg gaattggtga cccgtgaagt cgccgcgacg agcaccacgg cggaatgctc 1537201 gttcatggat tcgacggctg tcgcgttcac cgtgccttcg gtgaccactt tggactgttc 1537261 gacaaccttg gtgaaatcgg ctgcccgctg ctggaagtca tccctgaatt cgccggtgga 1537321 gctgtcgatc acacgcgcga cgtcttcttt ggccttgttg aagtccagcg agttcatgtt 1537381 gatgacacct tgcttggctc cggcggcgaa cgccgcggcg cgctgctggc gttcggtggc 1537441 ctcatggtgt tgccacacaa tgtatccgct gagcccggtg aagccgcaga tgatgacgac 1537501 tgcggccgcc atggcaatcg tggacagtct tggtaaccgc acccgcaacc gccgtcgcca 1537561 ggatgccgac cgtgcggcct cctggtctgc ggcctcatag tcgtcatagt cgtcatagtc 1537621 ttcggcgtct tcccagtctg catactcctc ggggacgttc tcgtcctcgg ctggggccat 1537681 cgccagcgcc tcacgcttca accgggcggc acgggcacgg gcccgcgccg cggcggccag 1537741 cgcttcggct tcggcggctt cggcttcggc ggccaacgcc atcgcgtcgg cttgcgatgt 1537801 ccccgcgtcc gacggtggtt cggttgtctc agccatcgtg gatacagccc gtcagtatca 1537861 ttctcgactg aactccggta tcgctcacag tattgcccat tgctaacatt cgaggcccca 1537921 gcgaactcct acgaggaccg atcaggcaga cgatcaccca cctgagtttt ccccggcagc 1537981 atgactggtg ttgactctct cgaaaacata ctctttactt cgaccctcca agcgtttctc 1538041 caaaagattc ccggttgtgc catgtcgtgt cgtgaacgcg tgcaggcgat ggtcgacgga 1538101 acgcccgtgc aggctcgttg agccgcctac tcctgggcga agatgtcctc ggtgtcggcg 1538161 ccgacaacgg gcagctggac cagcgacaag acatggtgcg ccgggctgcc gggtggggcc 1538221 accagcacgc actcggtccc ctgtttgcgt gcccggtcac aagcggcagc gagggcgccg 1538281 acgccggccg aaccaaggtg ggtgacggca ctgaggtcga tcgtcacggg tgctatgcca 1538341 gaacggcttt cgacggcgat ctggcggtcc aatgtggctg cggtggtcga atcgacgtcg 1538401 ccccggacaa cgatgcggcc ggattcaact agggagacga attcgctgtc gatggtttgt 1538461 tggaaagctg cccggcgaac catcgtgtcg gtgacaaacc gcgccggccg cgacaggcga 1538521 tgcgtaagtg tggcagtcgt tccgccggcg ccatgcatga tgcgcgcctc cgacactagg 1538581 gcctcggcca tcgccaggcc gcgcccacgg ccacgggcgc cgtcgcggtg gtccttccat 1538641 tggccccggt cgattaccga tgcccgcacg ttgccgtcgc cggccagcgc ggcctcgaca 1538701 acgatgccct tggagacgtc cgtggcgtat ccgtgttcga ccgcgttctc gacgaattcg 1538761 gagatcgcgt gcacgatatc ggcgatgtcg gagtggtcgg cgccgatctc tgccagccac 1538821 tcacgaagct gggctcgaac ggttcgtgcc gcgttgatcg tcgcatccag cgttatgtgc 1538881 agcggcggcg ttggcgcccg gcgttgcatc gcaagcaggg tcacatcgtc gttgtagccg 1538941 gtggaccgca gcagcaattc aagtgtgtcc gaacagagtc ggtcgatggg ccgtgccggg 1539001 gcgtcgagca caaagccgcc actgccgctg gcgatgctgg ccgctaggtc ggcaaattcg 1539061 gcggtgctgg cctcgagcgg ccgaccgggc cgctcgatca ggccgtcagt gtaaaagagg 1539121 atcgcgtcgc cgatgttgag cacttcactg cgcactggaa atccggttcc gctgccgagc 1539181 ggacccgcgc cggttggttc gacataccgc gcactcgcgt ccgcggtcac cagcagcggt 1539241 ggcgggtgtc cggctgtgca gtactggaat tcgcccgagg tgaagtcgag cgagccgaca 1539301 cacatggtgg ccgatttcga tccaggtacc tgtttatgga agcggtccac tgcctcaagc 1539361 gcctcgacga ccgtgtaccc cgtcgagatc tgcatgcgta acgccgtacg taattgcgac 1539421 atgaccgctg cggcctccac gccgtggccc acgacgtcgc caacgacgag caccaaccga 1539481 tccccgaggg ccagcgcgtc gaaccagtcg ccgccggccg cggtatcctc ggcggcgacc 1539541 aggtactcgg cggctatgtc ggcgccggga accacgggca ccgacgcggc cagcaacgcc 1539601 tgctgcataa cggtggctga atcgcgcaca ttgcgatagc gctcggacag ttcctccacg 1539661 cgcgcctcgg ccgcctgccg ggctcgcact cggctggtga cgtcgtccac aatgagctgc 1539721 acgccctcga tcgatccgtc cgcccggcgg cgcggtgtga cgacaaagtc gaagtatcgt 1539781 tcctcaactc cggaaccgtc gtaatcagtt tgtagtcgcc actccgatcc tgattgcggc 1539841 tcaccggttt gatagacccg gtccaacatt tcgtagatct gctgaccctc cagttcggga 1539901 tagacctccc gagcgggctg tcccacggtg tcaagcaatg gactgaagcc gcgataggcc 1539961 gcgttcactg cgacaaagcg atggtcaggc ccctcgaggc caaccagaat cgcagggatg 1540021 tgctcgaaaa tgcgtcgtac atcctcggcc gcaccgaccg ttttgtccca gtccatttcg 1540081 gccgccattt ggccgtccct cctacggacc gatgtagcaa acgggtcaac gtgcgcagac 1540141 caattcgcca ggcaacgcaa ccaggttatc aacgtgccct accagcttgc cggaaaagca 1540201 aaagtgcgtt tggggcaggc cgccacttat gtcgctgaca gcgcggactc cgtggtcggg 1540261 tgtacgggca gcacgtcgcc gtatccgcag gcgtggatga tgcgcgcgac agcacggtcg 1540321 cggcttacca ggcgcacgtc cacgccccgg cgtcgacacc gttcggcctt gtgagcaagg 1540381 acggcgactg cgcagcagcc catgaaatcg aggccgttga ggttgaccac gagtggttcc 1540441 ggcgcggtgg tggccgcggc cgccttcgtg accagatctt gccaagtgtg ctcattggcg 1540501 gcgtcgatct cgccacgcgc atggataatc acagccgagt cgtggtgctg gatggtcgcc 1540561 ttgagcgcgt tgctcaccgg agtagtgaat gaccctgcct gagtcgggtt catggtgcac 1540621 tcctcatcgg cggcacccga gcccccaatt ggattgccgg tcctgcgcgc cgcggaaaac 1540681 cgtcgtcttt gtatagcaag gccggcccgc tccgtctata gcgccgaagc cggcggccaa 1540741 tgagcttctc ggcgtcctcg gagccgaccc cattctccgc gcgggtggcc ccgcagatgc 1540801 gatccagcaa ccccgccgac tcgctccgga caggtggtgg tagccctcgt aggctcagca 1540861 atcgtggact tgcattcacg accaccgtgg tcgaacaacg cggtgcgtcg tcttggcgtg 1540921 gcactgcgcg acggagttga cccgccggtc gactgcccgt cgtacgccga ggtgatgctg 1540981 tggcatgcgg acttggccgc cgaagtccag gaccggatcg agggccggag ttggtctgcg 1541041 tcggagttat tggttacctc acgtgcgaag agccaagaca ccctgctagc aaagctgcgg 1541101 cgtcggcctt acctgcaact gaacaccatc caagacatcg caggtgtccg catcgatgcc 1541161 gacctcctgc tgggcgagca gacgagactt gctcgcgaga tcgccgacca cttcggtgct 1541221 gaccagcccg ctattcatga tctgcgtgac cacccgcacg ccggctaccg ggccgttcat 1541281 gtctggcttc ggttacctgc cggtcgtgtc gagatacaga ttcgcaccat tttgcagagc 1541341 ctgtgggcca acttctacga gcttctcgct gacgcgtacg gtcggggcat ccgctatgac 1541401 gagcggccgg agcagctagc ggccggcgtt gtcccggcac agcttcaaga gctggtaggg 1541461 gttatgcaag acgcttcagc ggatctggcg atgcatgaag ccgagtggca acactgtgca 1541521 gagatcgaat accccggcca gcgggcgatg gcgcttggcg aggcgagcaa gaacaaggcg 1541581 acggtgctcg caacgaccaa gtttaggctg gaaagggcca tcaatgaggc cgagtcggca 1541641 gggggaggtg ggtgaggtgg ctggctatgt cgtcgaatac aaccggcgca cccacgtgcg 1541701 tcgcatcacc gagttcgcca ccccgcaaga agcgatggag caccggttga agctggaagc 1541761 cgagcgcacc gacagcaata tcgagatcgt tgcgctcgtc agtaagtcgt tgggaaccct 1541821 gaagcaaacg cattcgcggt acttcactgg tgaagagctg aacgtcggaa acggcgcgcg 1541881 gtaggccctt gggtttccgc gagtgtgccg ggtccggtcg acatggggag gttcggtcaa 1541941 catgtctacc cagcactaga gcccgagcgc ccgataggtg cggcggacga attttggttg 1542001 cgcggtccgc agtttcgcca gggatgggtt acccgcgacg gccgcggcag cgtccacgga 1542061 cagatcggga caatgctgga tcagatacag cagtatcagg tcggcgctcg ggtcggcctg 1542121 ccaccatgtc ccgtacgcgc cgggccagct gaaggtcccg agcccgcccg gcccgaacag 1542181 cggcctggac ttcgccggat cggtcaccac cgataggttc agcccgaagc cgcggcccac 1542241 ccagaacggc gcccccagaa agctgtgccg tttctgctcg tcggtcagcc ggtcggtgcg 1542301 catcaggcgc accgattcag gtgacaacac ccggaccccg tcgaccgtcc cgtcgcccaa 1542361 cagcatccgc acgaaccgca ggtagtcatc ggcggtcgac cacaacccgc cgccggcgtt 1542421 acagaacgac ggcggcgtga cgtgtggcgg ccccatcacg tcgtgccgca accggtcttg 1542481 ttcgtcgagc cggtacatgg tcgcggcccg tcgctgcgcg tcggccgaca cgtagaagcc 1542541 ggtgtcggtc attcctgccg gacccagcac tcgctcgtcg atgacctggt acagcggtgc 1542601 gtcctcgatg cgggagacaa tgacacccaa gacgtcgatg gcgtggctgt aggtcacccg 1542661 gtcgccaggt tggtgcacga gcggaagggt tgccagcgct gccagccaaa cgtcgggacc 1542721 ctggccgaac ggcagtcgct gataggcccg cgaaattggc cccgacaccg agaaaccgta 1542781 agccaggccg ctggtgtgag tgagcaggtc ctcgatcaaa atggctcgtc gcgcgggatg 1542841 tgtgcgatcc agcgggccgg cggcatcgtc cagcacggcc accttgcaga gctccggtgc 1542901 ccaacgcgtg atcgggtcac gcagtgccag tttgccctcg tcgaccaggc tcatcgccgc 1542961 cgccaccgtg accggcttgg tcatcgacgc gatgcgaaac agcgtgtcgc gttgcatggg 1543021 cacgcccgcg tcgatatcgc gatagccgat ctcgttgact tgcaacaatt tttcgcgctg 1543081 ccagaccatg gttaccgcgc cggaaagcag gcccgcgtcg catacctcgc ggatggacgc 1543141 ctgattgccg tcgagattca cccggttcag gatactgtcc gagccagcgc ggctcggcgg 1543201 attactgatt gtgcgaacgt tttcccgcgc accggtcgcg tgttactgtc gcgctctccg 1543261 gcgaatgtga tctggggaac atgctgtgag cgcggcggca tgctagtgac gatggtgtcg 1543321 ctgctggtga accagggtgt gggtaggcag tcaccgagac ccgcaaccat ggacggggct 1543381 ggattcgagg ctccgtgcat gccgtacgac taggggtagc gcccagctgc tcaataccat 1543441 cggttggata acaaaggctg aacatgaatg gcttgatctc acaagcgtgc ggctcccacc 1543501 gaccccggcg cccctcgagc ctgggggctg tcgcgatcct gatcgcggcg acacttttcg 1543561 cgactgtcgt tgcggggtgc gggaaaaaac cgaccacggc gagctccccg agtcccgggt 1543621 cgccgtcgcc ggaagcccag cagatcctgc aagacagttc caaggcgacg aagggcctgc 1543681 attccgtcca cgtggtggtg acggtaaaca atctctcgac cctcccgttt gagagcgtcg 1543741 atgccgacgt gaccaaccaa ccgcagggca atggccaggc ggtgggcaac gccaaggtca 1543801 gaatgaagcc caacaccccg gtggtggcca ccgagttcct ggtcacgaac aagaccatgt 1543861 acacgaagcg gggcggcgac tatgtctcgg tgggtccggc ggagaagatc tatgacccgg 1543921 gcatcatcct ggacaaggac cgggggctgg gcgcggtcgt cgggcaagtg caaaacccga 1543981 caatccaggg acgtgacgcc atcgacggcc tggccaccgt caaggtgtcc gggaccatcg 1544041 acgccgcggt gatcgatccg atcgtgcctc agctaggtaa gggtgggggc aggctcccga 1544101 taaccttgtg gatcgtcgac accaacgcct caacgccggc acccgccgcg aacctggtgc 1544161 ggatggtcat tgacaaggac caaggcaacg tcgacatcac gctgtccaat tggggtgcgc 1544221 cggtcaccat cccgaacccg gcgggataac aggcgcgaac cggcccggtc cagccccatc 1544281 gctggtcgat ggcctggccg gtccggtact cgtccgcggg cggaggccgc cttcgaagaa 1544341 atcctttgag aattcgccaa ggccgtcgac ccagcatggg gtcagctcgc cagccgcgcc 1544401 ggctggcaac cgttcccgct cgagaaagac ctggaggaat accagtgaca aacgacctcc 1544461 cagacgtccg agagcgtgac ggcggtccac gtcccgctcc tcctgctggc gggccacgct 1544521 tgtcagacgt gtgggtttac aacgggcggg cgtacgacct gagtgagtgg atttccaagc 1544581 atcccggcgg cgccttcttc attgggcgga ccaagaaccg cgacatcacc gcaatcgtca 1544641 agtcctacca tcgtgatccg gcgattgtcg agcgaatcct gcagcggagg tacgcgttgg 1544701 gccgcgacgc aacccctagg gacatccacc ccaagcacaa tgcaccggca tttctgttca 1544761 aagacgactt caacagctgg cgggacaccc cgaagtatcg attcgacgac cccaacgatc 1544821 tgctgcaccg ggtcaaagcg cggctagccg agccagcgct ggccgcccgg atcaagcgca 1544881 tggacacact ctatcaacgc catcgttgca gtactggccg tgggttattt cgcggttcag 1544941 ggtgtgcggt tggtggaacc gagctggatg ccgctgtggg ccttcgtgat tgcgatggtt 1545001 ctgctgcgca gttcgttggc cgggttcggt cattacgcac tgcaccgcgc gcaacgaggc 1545061 ctcaaccggg ttttcaacaa tgccttcgat ctcaactatg tggccttgtc cttagtcacc 1545121 gccgacggac acaccctgct gcaccacccg tatacccaga gcgaggtgga catcaagaag 1545181 aacgtgttca cgatgatgat gcggctaccg tggttgtatc gcgttcccgt acatacgatt 1545241 cacaaatttg gccacatgct cagcggcatg gcgatccgga tcgtcgacgt cttcaggatc 1545301 acgcgcaagg taggtgtcga ggaatcctac ggaagctggc gtgccgcgct tccacacttc 1545361 cttggatcgg ccggggtgcg cttgcttctg gtgagtgaat tggtggtctt cgcgatcgcc 1545421 ggcgacttct ggccctgggc actgcaattc gtagcgacgc tgtgggttag taccttcttg 1545481 gtggtggcga gccatgagtt cgaggacgac acccagggcg gtgccgtcaa cggcgaggac 1545541 tggggcatag atcaactcga gcacgctaat gacctaacgg tgatcgggaa ccgctacgtc 1545601 gactgcttcc tgtcagccgg cctgagctcc caccgagtcc atcacgtgct gccgtttcag 1545661 cgcagcggct tcgcgaacat cgtcaccgag gacgttttgc gtgaggaagc agcgaagttc 1545721 ggtgtcgagt ggcttcccgc aaagggtttc atcaccgatc ggctgccgag gctgtgtcgg 1545781 aagtatctgt tgacgccgtc gcgccaagcc aaggagcgtc attggggttt cgtccgcgag 1545841 cactgctcgc cggcggcatt gaaagccagt gccagctacg tggttgcggg tttcgtcgga 1545901 atcgggtcgg tatgaacgtc tcagctgaga gcggtgcgcc gcgccgggcc ggccagaggc 1545961 atgaggttgg ccttgcccag ttgccgccgg ctccgcccac cacggtggcg gtgattgaag 1546021 ggcttgcgac gggcacgccg cgtcgggtag tcaaccagtc cgacgccgcc gatcgggtcg 1546081 ccgagctttt cctcgatccc ggtcagcggg aacggattcc gcgggtgtat caaaaatcgc 1546141 ggatcaccac gcgccggatg gcggtcgacc cgctcgacgc caaatttgat gtcttcaggc 1546201 gggaacctgc gacgatccgt gatcggatgc atctgttcta cgaacacgcg gttccgctgg 1546261 cggtggacgt gagcaagcgt gccctggccg gcctgccata ccgtgccgcc gagatcgggc 1546321 tgctggtgtt ggccaccagc accggattca tcgcgccggg cgtggacgtt gcgatcgtca 1546381 aagagctcgg gctctccccg tcgatatcac gtgtcgtggt caatttcatg ggatgtgccg 1546441 ccgcgatgaa tgccctgggc accgccacca actatgttcg tgcccacccg gccatgaagg 1546501 cgctggtggt gtgtatcgaa ttgtgctcgg tgaacgctgt ttttgccgac gacatcaacg 1546561 acgtcgtcat tcacagcttg tttggcgacg ggtgcgcggc gttggtgatc ggcgccagcc 1546621 aggttcagga gaagctcgag ccaggcaagg tggtagtccg cagtagtttc agtcagctgc 1546681 tcgacaacac cgaagacggt atcgtgcttg gcgtcaatca caacggcatc acctgcgagc 1546741 tgtcggagaa tctccccggc tacatcttca gcggggtcgc accggtggtg acagagatgt 1546801 tatgggacaa tggattacag atatccgata tcgatctctg ggcgatccat ccgggtggcc 1546861 ccaagatcat cgagcagtcg gtgcgctcgc tggggatctc cgcggagctg gcggcgcaga 1546921 gctgggacgt gctcgcccgc ttcggcaaca tgctcagcgt atcgcttatc tttgtgctag 1546981 agacgatggt gcagcaggcg gagtcggcca aagccatctc gacgggggtg gcgttcgcgt 1547041 tcgggccggg cgtcactgtc gaaggcatgc tgttcgacat catccgacgg tgaccgccat 1547101 gaattcagaa cacccgatga ccgaccgggt tgtgtatcga tcgttgatgg ccgacaacct 1547161 gcgatgggat gccctgcaat tgcgcgacgg cgacatcatt atctcggcgc cgtccaagag 1547221 cggcctgacc tggacacagc gcctggtgtc cctgctggtg ttcgacgggc ccgacttgcc 1547281 cggacccttg tcgacggtgt ccccgtggct cgaccagacc attcggccca tcgaggaagt 1547341 ggtcgctact ctcgatgccc agcagcaccg ccggttcatc aagacccaca cgccgttgga 1547401 cggcctggtg ctcgacgacc gcgtcagcta catctgcgta ggacgcgacc cgcgcgatgc 1547461 cgcggtgtca atgctgtacc aatcggccaa catgaacgaa gaccggatgc ggattctgca 1547521 cgaggccgta gtgccgtttc acgagcgaat cgcccccccc gtttgcggaa ctcggtcatg 1547581 cgcgcagccc gaccgaggag ttccgggatt ggatggaggg gccgaatcag cctccccctg 1547641 gcataggttt cacacatctg aaggggatcg gcactctggc caacatcctg caccagctag 1547701 gcacggtatg ggtccgccgt cacctaccca acgtggcctt gtttcattac gccgattacc 1547761 aggcggactt ggcgggcgag ctgctccggc tggcaagggt cctcggtatc gccgcgaccc 1547821 gcgatcgagc ccgggacctg gcgcagtacg ccacgctgga tgcgatgcgc tcccgcgcgt 1547881 cagaaatcgc tcctaacacc accgacggca tctggcacag tgacgagcgt ttcttccgcc 1547941 ggggcgggag tggcgactgg cagcagttct tcaccgaagc cgagcacctg cgctactacc 1548001 accgcatcaa ccagctggcg ccacctgatc tgctggcctg ggcacacgag ggccgccggg 1548061 gatacgaccc ggccaactga ggttcagtgc cgcattctct cctgtcagtt gctgcacttt 1548121 agacgctcaa tgcgctgcga caacattaaa tgtcagcagt cacacccagt gtgggggaaa 1548181 tttgcatatg cgatttagtt gtgtgtagct tgctttgctg tctgtacgac tgcaccgagg 1548241 ggtgagcgcg tgtcgcacga aagtctgttc gaagaaagcg aagcgcccta cgcggcgctg 1548301 tgcgtagttg ccaacttcac gacagacggc gagtgagcag gcgctcatca ccagggctac 1548361 gagcccagca caggggacgc ggtgaagcgc atgtcccacg aatccgtgtt ccaacagagt 1548421 gaagcgctct acacggcata tttttcgccc aacggcgaat gagcgagcgc cgatcggtgc 1548481 gttaggccgg gcgggcgacc gcccccgtcg cccctttaag tgcgcatgtg cgtagtccag 1548541 tcgagggtcg ggagctggcc cagtgcccca agatgcgatg cggctggcca cattctcatc 1548601 cgcaacgcta gttaccacaa gtcacaccat acccattttg gcagaaacta ttgcacatac 1548661 agataattgt cggtagcttg tcttgcggtg cagagaacgg aggagggaat cgcgtgcccc 1548721 acgaaatctt gtttgacgcg gacgaaaagg cattctcggc gttttgcatt atctcgttta 1548781 cgaccgacag cgagtgaagc tgcggtcatc gggggcgcca ctcccagaga ggagaggagg 1548841 tgaatcgcat gtcacaggaa accttgttcc aagaaagcca agcgctctac gccgcgtatt 1548901 tctttgcggc cgacggtgaa tgaccggtcg ccgattggcg cgattccccg cattcagggc 1548961 tggcgtagcg caagacgatg acgtggggtc gaccctgagt cagggctcga cgacaggtgt 1549021 gttgtcgggc ccgaattggt cgtactggcc aagccgtgta ttagggtctg cggacccgac 1549081 gacgatcgct caccggcacg gcacccaccg catcactagc ccggacgaga cctggctggc 1549141 cctgcagccc tttctcgcgc cagcaggcat taccagggtc gccgacgtga catggctgga 1549201 ttgtcttggc attccaacgg ttcaggcggt gcgcccagca tcgctgacgt tgtcggtcag 1549261 ccagggcaaa gccgccagct atcgggctgc ccaggtctcg gcggtgatgg agtccttgga 1549321 gggatggcac gccgagaacg tcactgccga cttgtggtct gcgaccgccc gggatctcga 1549381 ggcagacctg acttacgacc ccgcccaact tcgccaccgg ccgggcagcc tctaccacgc 1549441 cggcgtcaag ctcgattgga tggtcgcgac gacgttgctg accggtcgcc ggacctgggt 1549501 accgtggacg gcggtgctgg tgaacgtggc aacccgcgat tgctgggaac cgccgatgtt 1549561 cgagatggac accaccggac tggcctccgg caactgctac gacgaggcca ccttgcacgc 1549621 cttgtacgag gtgatggagc ggcatagcgt ggctgcagcg gtcgccggag agaccatgtt 1549681 cgaggtgcca actgacgatg tcgccggctc tgacagcgcc cacctggttg agatgatccg 1549741 tgacgccggg gacgatgtgg accttgcccg catcgatgtc tgggacggtt actactgttt 1549801 tgccgccgag ctcacctccg cgacgctgga ggtgaccttc ggcgggttcg ggttacacca 1549861 cgaccctaac gtggcgttat cgccggcgat caccgaagcc gcccagtcgc gcatcacggc 1549921 aatcagcgga gcccgcgagg acctcccgtc ggcgatctac caccggttcg gccgggtgca 1549981 tacatacgcg aaggcgcgaa agacgtcgtt gcggctgaac cgcgcgcggc cgacaccgtg 1550041 gcgggtgccc gatgtcgact cgctgcccga gttggtggcg tcggcggcga cggcggtggc 1550101 caaccgatcc ggcaccgagc cgctggcggt cgtgtgcgac ttcgccgatg cctgtgtccc 1550161 cgtggtgaag gtgctcgccc cgggcctcgt gctgtcgagc gcatcgccga tgcgcacacc 1550221 cctacaggag gctgaatgac ggcctgcggc aggattgtcg tcaccgctgg gcccacgatt 1550281 agcgccgcgg acatccgctc ggtggtgccg gatgccgagg tggcgccgcc gattgcgttt 1550341 ggccaggcgc tctcctatga cttgcggtcg ggtgacacgc tgctgattgt cgacggattg 1550401 ttctttcagc agccgtcggt tcgacataag gagcttttga cgttgatggc cgacggtgtc 1550461 cgagtcgtcg gatcgtcgag catgggcgcc ctgcgggccg ctgagctgca tccattcggc 1550521 atggagggct atggctgggt cttcgaaagc taccgagatg gggtactcga ggccgacgat 1550581 gaggtcggcg tggtgcacgg cgacgccgac gacggctacc cggtcttcgt cgacgcgctg 1550641 gtgaacatgc gccacaccct ggcgcgggcc gtcgcaactg gtgtggtgtg ctccgagctg 1550701 gccgagcgga tcatcgagac cgcgcgggcc acaccgttca ccatgcgcac ctgggcgcgg 1550761 ctgctgagtg aggtcggcgc cccggaccag cgcggcctcg ccgcacagtt gcggtcactg 1550821 cgggtcgatg tcaaacacgc cgatgcgctg ctggcgttgc ggcagctcgg ccagcgcccc 1550881 cgggtggagc cgcttcgtcc gggtccgccg cccaccgtgt ggtcgcggcg gtggcggcag 1550941 ccatgggcac cgcccacctc cgtcgccgca tcggccgacc acggcgagtc ttttgtcgac 1551001 gtcaccgact tggaggtctt gtcgtttttg agcgtgagct cggttgacta ctgggcctac 1551061 cggccagcac tgcaacaggt cgctgcctgg tactggacgt tgaaacaccc cgaacaatcc 1551121 ggaagcgtcg gtgagcgtgc cgcacgagcc gtcgccgagg tggcatcgga gggctacggg 1551181 cgcgccctgg aattcattgc ctatcgctac gcacttgcca ccggcatcat cgacgagacc 1551241 ggctttcccg aggcggtcgc agcgcattgg ctcaccaccg aagagcgcca cggcctgggc 1551301 aatgacccca tctcgatctc ggcgcgagtg atcacccgca cgttgttcgt cgtccggtta 1551361 ttgccggcga tcgaccattt ccttgacctg ctgcggaagg actcccgact gccccgatgg 1551421 cgtgccatgg cggcccacgc actctgcaag cgcgacgatc tggcccggca aaagccgcac 1551481 ctgaacctgg gccggcccga tccgacgcaa ttgaagcgcc tctttggggc ccgatggggg 1551541 acccaggtga accgcatcga gttggcccgg cgtggactga tgaccgagga cgccttctat 1551601 gctgccgcca ccccgttcgc cgtcgcggcc gtcgacgacc aactgccgcg catcgaggtc 1551661 ggcaccttag gacccgcgcc gctgagcgcg gacgttccag aacgccattt cgacttcggt 1551721 tccgtctaac tcgcggcgca cggtggcggg ctccagcgac tcgatatccc agccagcgcc 1551781 accgaggacg tcgcgcagcg tttgctcgga taccgtcgac cgcggccatt cctcatcggg 1551841 cggcatggcg ttggagaagc agctgagtag cagggtggcg cccggtcggg tggcccggtg 1551901 caccgaggcg gcgtagctgc gcttgccgtc gtcgtctagg cagtggaaca tcccgcagtc 1551961 gatcacggta tcgaacgcgc cggtgtagcc ggtcagcttg gtggcgtcac ccactgcgaa 1552021 cttgacatcg actccggcgt cgctggctcg ccgtttggcg gtggtcagcg cggtgggaga 1552081 gatgtccaac ccggtcacct ggtagccgtt cctggcgagg tagatcgcgt tgtcaccgag 1552141 cccgcacccg atgtcgagca cgtcgccgtg cacccagccg ccggtgtgcc agccgatgac 1552201 attgtccttg ggcgctttgg tgtcccacgg cggtgtcgtg atcggcggga ggccctcgcc 1552261 ggggctttcg ccacggtaga gcgcgtcgaa atctatacct ggcatgctgg ccagcttagg 1552321 cggcgtgtag gtgggtgagg gcgacaccga ttctggcttc cacctggcta acgtctatct 1552381 ccaacggccc gggcagtggt ggcgcggtgc agtgatagta catcccggtg ggcgtggtga 1552441 attcagctgt gtgacggccg gtttcgtccg tgtcggtgct gacgcgccag ccgggagctt 1552501 ccttgacgta gttacagcgt tcacacgatc cgaggccgtt ggtcgcggtg gtcgggccgc 1552561 ctcgatgatg cggctgggcg tggtcacggt ggcggatcgg ggcatcgcag tagggcatgc 1552621 gacagcgctg atcgcgcaac ccgatgaacg cggccagccc cttcgggaac cggcgtgccc 1552681 gcgattccat cgccaccaag gcccccgagc gcggatgacg gtagagccgg cgcagcgtgg 1552741 cccgtgaccg cgtatcggca accgcgtcgc gcaccaggtt gcgggccacg gccgccggga 1552801 tggggccata cccgtcgacc accgccgggg cgcggtcgcc agctaacagt gtctcgtcgg 1552861 agagcaccag gttgaccgct accggttggg ccgcctcggc gggttgtccg gtgacccgct 1552921 cgaccaacgt gtcggccatt acctggcccc gtgtccgatc gtcgaatgtc gtgtcggcgg 1552981 cccgcttgag cgccgcatag accgacacgc ctcgggccac cggaagcaac gccgtcaccc 1553041 aggtcatggt gtcgggggcc gggcggatcg tcaccgtgcg ttcggtctcg gccctggcgg 1553101 cccgctccac caccgcctgg gcatcgagcc ggtaggcaat cgcccgggcc gcggcggcga 1553161 tccgcgcatc acccatcccg tccaatgcgg acatgtcggc gcacagctcg gcgtcgagtg 1553221 cgcggcgatc ctcgacgtcc aggcaggccg actcccgcac gatcagcgtg gcccgccact 1553281 ccgatagccg cccgacctcg agcgcggcga gtgtgtgcgg catctcatac accaacgcct 1553341 tcgcgaaccc caggtggcgc ccgccgcgcg ccggcgaatc ccgtcgcgcc agcgctactt 1553401 cactggccac cccacgcccg cgccgccgtg ccggcacccc cgcatccgcc tcattgcagc 1553461 gacgcaactt gtccagcgcc gccgcagcac gtgcctgacc ggcggccgcg gccgatttga 1553521 cccgctccag ctcggcgatc cgcgcggtca ggctcgcctc atcgtcgcgc gaatccacgc 1553581 ccgcgaggct cactaaatcg aacatgtgtt cgagtatagc aggcctgggc caccacggcc 1553641 accgcaccgc gggcccgcag cgtgcgagtg ctacgctgcc gagcggtcga catcctttaa 1553701 cgatccgtcc agagaggtgg agaaggaggt caaggtttcc catgggtgct gcgggtgatg 1553761 ccgcaatcgg ccgggagtcc cgcgagttga tgtccgcggc cgacgtcggc cgcacgattt 1553821 cgcgcatcgc gcatcagatt atcgagaaga ccgcgttaga tgacccagtc ggacccgacg 1553881 cgccgcgggt ggtgctgctg ggaatcccga cccgtggcgt gacgctggcg aatcgcctgg 1553941 ccggcaatat caccgaatac agcggcatcc acgtcggcca tggcgcgctg gacatcaccc 1554001 tgtaccgcga cgatctgatg atcaagccgc cgcggccctt ggcgtcgacg tcgatcccgg 1554061 ccggtgggat cgatgacgcg ctggtgatcc tggtcgatga cgtgctctac tccgggcgct 1554121 cggtgcgttc cgccctggac gcgctgcgcg acgtgggccg gccgcgggcg gtgcaattgg 1554181 cggtgctggt cgacaggggt caccgggaac tgccgctgcg cgccgactat gtgggcaaga 1554241 acgttccgac ctcgcgcagc gagagcgtgc acgtgcggct gcgcgagcac gacggccgtg 1554301 acggcgtggt gatctcgcga tgaccccaag gcacctgctg accgccgccg acctcagccg 1554361 cgacgacgcc accgccatcc tcgacgacgc cgaccggttt gcgcaggcgc tggtcggtcg 1554421 cgacatcaag aagctgccga cgctgcgggg ccggaccgtc gtcacgatgt tctatgagaa 1554481 ctccacccgc acccgggtgt cgttcgaggt agcgggtaag tggatgagcg ccgacgtgat 1554541 caacgtcagc gctgccggat cttcggtagg caagggtgag tcgctgcggg ataccgcgct 1554601 gaccctgcgc gcggccgggg ctgacgcgct gatcatccgc catcccgcgt ccggcgccgc 1554661 ccatctgctg gcgcagtgga ccggcgccca caacgatggg ccggcggtga tcaacgccgg 1554721 tgacggcact catgaacacc ccacgcaggc gctgcttgat gcgctgacca tccgtcagcg 1554781 cctcggcggc atcgaaggcc ggcgcatcgt gatcgtcggc gacatcctgc acagccgggt 1554841 cgcccgctcc aacgtcatgc tgctggacac cctgggcgcc gaggtggtgc tggtggcgcc 1554901 acccacattg ctaccggtcg gggtgaccgg ctggccggcc accgtctccc acgacttcga 1554961 tgccgagctg cccgccgccg acgcggtatt gatgctgcgg gtacaggccg agcggatgaa 1555021 cggcggtttt ttcccgtccg tacgggagta ctcggtccgc tacgggctaa ccgagcggcg 1555081 ccaggcgatg cttcccggcc acgccgtggt gttgcacccg ggaccgatgg tgcgtggcat 1555141 ggagatcaca tcttcggtcg cggactcgtc gcaatcggct gtgctgcaac aggtttccaa 1555201 tggagtccag gtgcggatgg cggtgctgtt ccatgtgctg gtgggagcgc aggatgccgg 1555261 taaagagggt gcggcgtgag cgtgctgatt cgtggtgtgc ggccctacgg cgagggggag 1555321 cgggtcgacg tactcgtcga tgacggccag atcgcccaga taggaccgga tctggcgatc 1555381 cccgatacgg ccgatgtcat tgacgccacc ggacacgtgc tgctgcccgg gttcgtcgat 1555441 ctgcacaccc atctgcgcga gccgggccgc gagtatgccg aggacatcga aaccggttcg 1555501 gccgcggccg ctttgggcgg ctacaccgcg gtgttcgcga tggccaacac caaccccgtg 1555561 gccgacagcc cggtggtcac cgaccacgtc tggcaccgcg gccagcaggt cggcctggtc 1555621 gacgtgcacc ccgtcggcgc ggtcaccgtc gggctggccg gagccgagct gaccgagatg 1555681 ggcatgatga acgccggcgc cgcccaggtg cggatgttct ccgacgacgg ggtctgcgtg 1555741 catgacccgc tgatcatgcg ccgcgccctg gaatatgcca ccggtttggg cgtgctgatc 1555801 gcccagcacg ccgaggagcc ccggctgacg gtcggcgcct tcgcgcacga gggacccatg 1555861 gcggcgcggc tgggcctggc gggatggccg cgggccgccg aggaatcgat cgtcgcccgc 1555921 gacgccttgc tggcccgtga cgccggcgcc cgggtgcaca tctgtcacgc gtcggccgcg 1555981 ggcaccgtcg aaatcctgaa atgggctaag gaccagggta tttcgatcac cgccgaggtc 1556041 accccccacc acctgttgct cgacgatgcc agattggcca gctatgacgg cgtgaaccgg 1556101 gtcaacccgc cgctgcgcga agcttccgac gcggtcgccc tgcgacaggc gctggccgac 1556161 gggatcatcg actgtgtggc cacagatcac gccccgcatg ccgagcacga gaaatgcgtc 1556221 gaattcgccg cggcccggcc cggcatgctc gggttgcaga cggcattgtc ggtggtggtg 1556281 cagacaatgg tggcgcccgg cttgttgagt tggcgcgata tcgcgcgggt gatgagtgag 1556341 aacccggcgt gcatcgcacg cttgcccgat cagggccggc cactggaggt gggggagccg 1556401 gccaacctga cggtggtgga ccccgacgcc acctggacgg tcaccggcgc cgacctggcc 1556461 agccggtcgg ccaacacgcc gtttgagtcg atgagcctgc ccgccaccgt gaccgcgacc 1556521 ctgctgcgcg ggaaggtgac cgcgcgcgac gggaagatcc gggcatgaac tccggcacgc 1556581 tggcggggtc gctgatcttc gcggcggtgc tcgtcatgct gatcgcggtg ctcgctcggc 1556641 tgatgatgcg cggctggcgg cgccgttcgg agcggcaggc ggagctgctc ggcgacttgc 1556701 ccgacgtgcc cgagcacgtg agctcggcca cggtcaccac ccgcggcctg tacgtgggcg 1556761 ccacgctgtc gccggcctgg aacgagcggg tcaccgtcgg tgatctcggg tatcgcagca 1556821 aggcggtgct cacccggtat ccgtcgggca tcatggtgga acgcgcacgg gctcagccga 1556881 tttggattcc tacggagtcg atcgccgcca ttcgcatgga acgcggcgtc gccggcaagg 1556941 tggtggccgg catcgggata ctcgcgatcc gttggcgact gccgtccggc accgagatcg 1557001 atgtcgggtt tcgggcagac aaccgcgacg aataccagga gtggctggag gaacccgttt 1557061 gagcaaagcc gtattggtcc tcgaagacgg ccgggtgttc accggcaggc cgttcggcgc 1557121 gaccggacaa gcgctcgggg aggccgtgtt ttccaccggc atgtccggtt atcaggagac 1557181 gctgaccgat cccagctatc accgtcagat cgtggtggcc accgcgccgc agatcggcaa 1557241 caccggctgg aacggcgagg actccgaaag ccgaggggag cggatctggg tcgccggtta 1557301 cgcggtgcgc gacccgtcgc cgcgcgcgtc caactggcgc gccaccggca cgttggaaga 1557361 cgaactcatc cgccagcgca tcgtcgggat cgccggcatc gacacccggg gcgtggtgcg 1557421 ccatctgcgc agccgcgggt cgatgaaggc gggggtgttc tccgacgggg cgctggccga 1557481 gcctgccgac ttgatcgcgc gggtgcgagc acaacagtcg atgctgggcg ccgatctggc 1557541 cggcgaggtc agcaccgcgg agccgtatgt cgtcgaaccc gacgggccac cgggtgtttc 1557601 gaggttcacc gtggccgccc tagatcttgg tatcaagacc aacactccgc gtaacttcgc 1557661 ccggcgcggg attcgctgcc atgtgctgcc ggcatcgacc accttcgagc agatcgccga 1557721 actcaacccg catggcgtgt tcttgtccaa cggccccggc gacccggcca ccgccgatca 1557781 cgtcgtcgcg cttacccgcg aggtgctggg cgccggaatc ccgttgttcg gcatctgttt 1557841 cggcaaccag atcctgggcc gcgcgctggg cctgtcgacc tacaagatgg tgtttgggca 1557901 ccgcggcatc aacatcccgg tcgtcgacca cgccaccggt cgggtggcgg tgaccgcgca 1557961 aaaccatggc ttcgcccttc agggggaggc gggccaatcc ttcgccaccc cgttcggtcc 1558021 cgcggtggtc agccacacct gcgccaacga cggtgtggtc gaaggcgtca agctcgttga 1558081 cgggcgggcg ttttcggtgc aataccaccc ggaagccgcc gccggcccgc acgatgccga 1558141 gtacctgttc gaccagttcg tggagctgat ggcaggggag ggccgctagt gccccgtcgc 1558201 accgatctgc accacgtgct ggtcatcggc tccgggccga tcgtcatcgg ccaggcgtgc 1558261 gagttcgact actccgggac tcaggcgtgc cgggtgctgc gcgccgaggg cttgcaggtc 1558321 agcctggtga actctaatcc ggccaccatc atgaccgacc cggagttcgc cgaccacacc 1558381 tacgtagagc ccatcacccc ggcgttcgtg gagcgggtta tcgcccaaca ggccgagcgg 1558441 ggcaacaaga tcgacgccct gctggcgacc ctgggtgggc agaccgcgct gaacaccgcg 1558501 gtcgcgctgt acgagagcgg ggtgctggaa aagtacggcg tggaactcat cggcgccgat 1558561 ttcgacgcca tccagcgcgg cgaggaccgg cagcggttca aggacatcgt cgccaaggcc 1558621 ggtggcgaat ccgcccggag ccgagtgtgt ttcaccatgg ccgaagtgcg tgagacggtc 1558681 gccgagctcg gcctgccggt ggtggtgcgg ccgagcttca ccatgggcgg gctgggttcg 1558741 gggatagcgt actccaccga cgaggtcgac cggatggccg gcgccgggct ggcggcctcg 1558801 cccagcgcca acgtgctcat cgaggaatcg atttacggct ggaaggaatt cgaactcgag 1558861 ctgatgcgcg acggccacga caacgtggtg gtggtgtgct cgatcgaaaa cgtcgacccg 1558921 atgggtgtgc acaccggcga ctcggtcacc gtcgcgccgg cgatgacgtt gaccgaccgg 1558981 gaataccagc ggatgcgcga cctgggcatc gcgatcctgc gcgaggtggg tgtggacacc 1559041 ggcggctgca acatccagtt cgcggtcaac ccgcgcgacg gtcggctgat cgtcatcgag 1559101 atgaacccgc gggtgtcgcg ttccagtgcg ttggcgtcca aggcgaccgg ctttccgatc 1559161 gccaagatcg ccgccaaact ggccatcggt tacaccctcg acgagatcgt caacgacatc 1559221 acaggggaaa cgccggcctg tttcgaaccc accctggact acgtggtggt caaggcgccg 1559281 cggttcgcgt tcgagaagtt ccccggtgcc gatcccaccc tgaccaccac catgaaatct 1559341 gtcggtgagg caatgtcgtt gggccgcaac ttcgtcgagg cgctcggcaa ggtgatgcgc 1559401 tcgctggaga cgacccgcgc cgggttctgg acggcaccgg atcccgacgg cggcatcgag 1559461 gaagccctga cccggctgcg gaccccggcc gaaggccggc tctacgacat cgagctggcg 1559521 ttgcggctgg gtgcgacggt ggaacgggtg gccgaggcca gcggtgtcga cccgtggttc 1559581 atcgcgcaga tcaacgagct ggtcaatctg cgcaacgaac tcgtcgcggc acccgtgctg 1559641 aacgccgagc tgctgcggcg cgccaagcac agcggactat cggatcacca gatcgcgtcg 1559701 ctgagaccgg aattggccgg cgaggccggc gtgcggtcac tgcgcgtgcg cctgggcatc 1559761 cacccggtat acaagacggt ggacacctgc gcggcggagt tcgaagccca aaccccctac 1559821 cactacagca gctacgagct cgaccccgcc gccgaaacag aggtggcccc gcagaccgaa 1559881 aggcccaagg tgctgatcct cggttcgggg cccaatcgga tcggccaggg tatcgagttc 1559941 gactacagct gcgtacacgc ggcaaccacg ttgagccagg ctggctttga gaccgtgatg 1560001 gtcaactgca acccggagac ggtgtccacc gactacgaca ccgcggacag gttgtacttc 1560061 gagccgttga cgttcgagga cgtcttggag gtctaccacg ccgaaatgga atccggtagc 1560121 ggtggcccgg gagtggccgg cgtcatcgtg cagctcggcg gccagacccc gctcgggctg 1560181 gcgcaccggc tcgccgacgc cggggtcccg atcgtgggca ccccaccgga ggccatcgac 1560241 ctggccgagg atcgcggcgc gttcggcgac ctgctgagcg ccgccggact gccggcgcca 1560301 aagtacggca ccgcaaccac tttcgcccag gcccgccgga tcgccgagga gatcggctat 1560361 ccggtgctgg tgcggccgtc gtatgtgctc ggtggtcgcg gcatggagat cgtgtatgac 1560421 gaagaaacgt tgcagggcta catcacccgc gccactcagc tatcccccga acacccggtg 1560481 ctcgtcgacc gcttcctcga ggacgcggtc gagatcgacg tcgacgcgct gtgtgatggc 1560541 gccgaggtct atatcggcgg gatcatggag cacatcgagg aggccggcat ccactccggt 1560601 gactcggcct gtgcgctgcc accggtcacg ttgggccgca gcgacatcga gaaggtgcgt 1560661 aaggccactg aagccattgc gcatggcatc ggcgtggtgg ggctgctcaa cgtgcagtac 1560721 gcgctcaagg atgacgtgct ctacgtcctg gaagccaacc cgagagcgag ccgtaccgtt 1560781 ccgtttgtat ccaaggccac agcggtgcca ctcgccaagg catgcgcccg gatcatgttg 1560841 ggcgccacca ttgcccagct gcgcgccgaa ggcttgctgg cggtcaccgg ggatggcgcc 1560901 cacgcggcgc gaaacgcccc catcgcggtc aaggaggccg tgttgccgtt tcaccggttc 1560961 cggcgcgccg acggggccgc catcgactcg ctactcggcc cggagatgaa atcgaccggc 1561021 gaggtgatgg gcatcgaccg cgacttcggc agcgcgttcg ccaagagcca gaccgccgcc 1561081 tacgggtcgc tgccggccca gggcacagtg ttcgtgtcgg tggccaaccg ggacaagcgg 1561141 tcgctggtgt ttccggtcaa acgattggcc gacctgggtt ttcgcgtcct tgccaccgaa 1561201 ggcaccgcag agatgttgcg ccgcaacggt attccctgcg acgacgtccg caaacatttc 1561261 gagccggcgc agcccggccg ccccacaatg tcggcggtgg acgcgatccg agccggcgag 1561321 gtcaacatgg tgatcaacac tccctatggc aactccggtc cgcgcatcga cggctatgag 1561381 atccgttcgg cggcggtggc cggcaacatc ccgtgcatca ccacggtgca gggcgcatcc 1561441 gccgccgtgc aggggataga ggccgggatc cgcggcgaca tcggggtgcg ctccctgcag 1561501 gagctgcacc gggtgatcgg gggcgtcgag cggtgaccgg gttcggtctc cggttggccg 1561561 aggcaaaggc acgccgcggc ccgttgtgtc tgggcatcga tccgcatccc gagctgctgc 1561621 ggggctggga tctggcgacc acggccgacg ggctggccgc gttctgcgac atctgcgtac 1561681 gggccttcgc tgatttcgcg gtggtcaaac cgcaggtggc gttttttgag tcatacgggg 1561741 ctgccggatt cgcggtgctg gagcgcacca tcgcggaact gcgggccgca gacgtgctgg 1561801 tgttggccga cgccaagcgc ggcgacattg gggcgaccat gtcggcgtat gcgacggcct 1561861 gggtgggcga ctcgccgctg gccgccgacg ccgtgacggc ctcgccctat ttgggcttcg 1561921 gttcgctgcg gccgctgcta gaggtcgcgg ccgcccacgg ccgaggggtg ttcgtgctgg 1561981 cggccacctc caatcccgag ggtgcggcgg tgcagaatgc cgccgccgac ggccgcagcg 1562041 tggcccagtt ggtcgtggac caggtggggg cggccaacga ggcggcagga cccgggcccg 1562101 gatccatcgg cgtggtcgtc ggcgcaacgg cgccacaggc ccccgatctc agcgccttca 1562161 ccgggccggt gctggtgccc ggcgtggggg tgcagggcgg gcgcccggag gcgctgggcg 1562221 gtctgggcgg ggccgcatcg agccagctgt tgcccgcggt ggcgcgcgag gtcttgcggg 1562281 ccggccccgg cgtgcccgaa ttgcgcgccg cgggcgaacg gatgcgcgat gccgtcgcct 1562341 atctcgctgc cgtgtagcgg gtgccctgcc accgcgccgc taaatcccac cagcatgggg 1562401 tggtgagccc agcgctcgtg tgaccaaact caccgccctg ggccgtcgtc acgctgtgtt 1562461 aacctctcgt tcaaatgata ttcatattca atagtggcac taagtgtccg gttgaatccc 1562521 cgttgaaccc ccaacagatg gagtctgtgt cgtgacgttg cgagtcgttc ccgaaagcct 1562581 ggcaggcgcc agcgctgcca tcgaagcagt gaccgctcgc ctggccgccg cgcacgccgc 1562641 ggcggccccg tttatcgcgg cggtcatccc gcctgggtcc gactcggttt cggtgtgcaa 1562701 cgccgttgag ttcagcgttc acggtagtca gcatgtggca atggccgctc agggggttga 1562761 ggagctcggc cgctcggggg tcggggtggc cgaatcgggt gccagttatg ccgctaggga 1562821 tgcgctggcg gcggcgtcgt atctcagcgg tgggctatga ccgagccgtg gatagccttc 1562881 cctcccgagg tgcactcggc gatgctgaac tacggtgcgg gcgttgggcc gatgttgatc 1562941 tccgccacgc agaatgggga gctcagcgcc caatacgcag aagcggcatc cgaggtcgag 1563001 gaattgttgg gggtggtggc ctccgaggga tggcaggggc aagccgccga ggcgtttgtc 1563061 gccgcgtaca tgccgtttct ggcgtggctg atccaagcca gcgccgactg cgtggaaatg 1563121 gccgcccagc aacacgccgt catcgaggcc tacactgccg cggtagagct gatgcctact 1563181 caggtcgaac tggccgccaa ccaaatcaag ctcgcggtgt tggtagcgac caatttcttt 1563241 ggcatcaaca ccattcccat tgcgatcaat gaggccgagt acgtggagat gtgggttcgg 1563301 gccgccacca cgatggcgac ctattcaaca gtctccagat cggcgctctc cgcgatgccg 1563361 cacaccagcc ccccgccgct gatcctgaaa tccgatgaac tgctccccga caccggggag 1563421 gactccgatg aagacggcca caaccatggc ggtcacagtc atggcggtca cgccaggatg 1563481 atcgataact tctttgccga aatcctgcgt ggcgtcagcg cgggccgcat tgtttgggac 1563541 cccgtcaacg gcaccctcaa cggactcgac tacgacgatt acgtctaccc cggtcacgcg 1563601 atctggtggc tggctcgagg cctcgagttt tttcaggatg gtgaacaatt tggcgaactg 1563661 ttgttcacca atccgactgg ggcttttcag ttcctcctct acgtcgttgt ggtggatttg 1563721 ccgacgcaca tagcccagat cgctacctgg ctgggccagt acccgcagtt gctgtcggct 1563781 gccctcactg gcgtcatcgc ccacctggga gcaataactg gtttggcggg cctatccggc 1563841 ctgagcgcca ttccgtctgc tgcgataccc gccgttgtac cggagctgac acccgtcgcg 1563901 gccgcgccgc ctatgttggc ggtcgccggg gtgggccctg cagtcgccgc gccgggcatg 1563961 ctccccgcct cagcacccgc accggcggca gcggccggcg ccaccgcagc cggcccgacg 1564021 ccgccggcga ctggtttcgg aggcttcccg ccctacctgg tcggcggtgg cggcccagga 1564081 atagggttcg gctcgggaca gtcggcccac gccaaggccg cggcgtccga ttccgctgca 1564141 gccgagtcgg cggcccaggc ctcggcgcgt gcgcaggcgc gtgctgcacg gcggggccgc 1564201 tcggcggcga aggcacgtgg ccatcgtgac gaattcgtca cgatggacat gggtttcgac 1564261 gcggcagctc cggccccaga gcaccagccg ggtgcccggg cgtccgactg tggtgcggga 1564321 cctatcggat ttgctggcac ggtgcgcaaa gaggcggtcg tgaaagcggc ggggttgacc 1564381 acgctggccg gtgacgactt cggcggcggc ccaacgatgc cgatgatgcc cggcacctgg 1564441 acccatgatc agggcgtgtt cgacgagcat cgctgatagc tgactgggca gtggctggca 1564501 aacagctgag agagcactcg agagctatcg tcagggcaat gtccgatgat gctgagcacc 1564561 cgcgtttggg gcactagcag ccacgatgat ccttgttggg ttgcaccgcg gagatgtcgg 1564621 cgaaaattgg cagggttgcg ttgacgcaac catggcgcga cacgcgcgat aggtcgccca 1564681 accgcgagtg atccccggca ctgcgagttg cgacgccacc tgccgccacc agtcgtcggc 1564741 cgtcgtcgac cggttgagca ggtccggaaa gccgaaatcc attgttaggc aacactattc 1564801 atgttccatg ccagccatgc cggcacggac acggggctcc gtcgagaggc cttcgaggtc 1564861 gcccggcgga ccgctggccg gtggcacgtg ctactcccac gctgcacgtt tgtccccaaa 1564921 accagggggt cgggttagat ttcgtcagga agcctgagta cggtcgtctg cgctggccgg 1564981 cgtacccggc cgggacaaac aacgatcgat tgatatcgat gagagacgga ggaatcgtgg 1565041 cccttcccca gttgaccgac gagcagcgcg cggccgcgtt ggagaaggct gctgccgcac 1565101 gtcgagcgcg agcagagctc aaggatcggc tcaagcgtgg cggcaccaac ctcacccagg 1565161 tcctcaagga cgcggagagc gatgaagtct tgggcaaaat gaaggtgtct gcgctgcttg 1565221 aggccttgcc aaaggtgggc aaggtcaagg cgcaggagat catgaccgag ctggaaattg 1565281 cgcccacccg ccgccttcgt ggcctcggtg accgtcagcg caaggccctg ctggaaaagt 1565341 tcggctccgc ctaaccccgc cggccgacga tgcgggccgg aaggcctgtg gtgggcgtac 1565401 ccccgcatac gggggagagg cggcctgaca gggccagctc acaattcagg ccgaacgccc 1565461 cgtgggggga acccgcccag gagcgccagt gagcgtcggc gagggaccgg acaccaagcc 1565521 caccgcgcgt ggccaaccgg cggcagtggg acgtgtggtg gtgctgtccg gtccttccgc 1565581 ggtcggcaaa tccacggtgg ttcggtgtct gcgcgagcgg atcccgaatc tgcatttcag 1565641 tgtctcggcc acgacgcggg cgccacgccc gggcgaggtc gacggtgtcg actaccactt 1565701 catcgacccc acccgctttc agcagctcat cgaccagggt gagttgctgg aatgggcaga 1565761 aatccacggc ggcctgcacc ggtcgggcac tttggcccag ccggtgcggg cggccgcggc 1565821 gactggtgtg ccggtgctta tcgaggttga cctggccggg gccagggcga tcaagaagac 1565881 gatgcccgag gctgtcaccg tgtttctggc gccacctagc tggcaggatc ttcaggccag 1565941 actgattggc cgcggcaccg aaacagctga cgttatccaa cgccgcctgg acaccgcgcg 1566001 gatcgaattg gcagcgcagg gcgactttga caaggtcgtg gtgaacaggc gattagagtc 1566061 tgcgtgtgcg gaattggtat ccttgctggt gggaacggca ccgggctccc cgtgacccac 1566121 gtcgtgacta gtcagtattt agctttccaa gccgctctac gccgccagga gaaatttcac 1566181 gtgagtatct cgcagtccga cgcgtcgttg gccgccgtcc ccgccgtgga tcagttcgat 1566241 ccgtcgtcag gtgcatcagg tggctacgac accccgctgg gcatcaccaa tccgcccatc 1566301 gacgagttgc tggaccgcgt ctcgagcaaa tacgccctcg tgatctatgc ggcaaagcgt 1566361 gcccggcaga tcaacgacta ctacaaccag cttggcgagg gcatcctcga atatgtcggt 1566421 ccgctggttg agccggggtt gcaagagaag ccgttgtcca tcgcgttgcg cgagatccac 1566481 gccgatctgc tcgagcacac cgagggcgag tagcagggca ggcctgaggt ggtggaccat 1566541 aaacggatcc ccaagcaggt aatagtcggt gtctccgggg gcatcgccgc ctacaaggcg 1566601 tgcacggttg ttcgtcaact caccgaggcc agtcatcgcg tccgagtcat tcccaccgaa 1566661 tccgccctgc gcttcgtcgg tgccgcgacc ttcgaggcgc tctccggtga gccggtgtgc 1566721 accgacgttt tcgccgacgt tccggcggtc ccgcatgttc acctcggcca gcaggccgat 1566781 ctggtcgtag tggcgccggc caccgccgac ctgctggccc gcgcggcggc cggtcgagcc 1566841 gacgatctgc tgaccgcgac gctgctgacg gcgcggtgtc cggtgctgtt cgcgccggcg 1566901 atgcacaccg agatgtggtt gcatccggcc accgtcgaca acgtggccac gctgcgccgc 1566961 cgcggcgcgg tggtgctcga gcccgcgaca ggacggctta ccggcgccga cagcggggcc 1567021 ggccgactgc ccgaggcgga ggagatcacc accctcgccc agctgctgct ggagcggcac 1567081 gacgccctgc cctacgatct cgcggggcga aagctgctgg ttaccgccgg tggcacacgc 1567141 gagccgatcg atccggtgcg ctttatcggc aaccgcagct ccggcaagca gggctatgcg 1567201 gtggcgcggg tggccgccca gcgcggcgcc gacgttactt tgatcgctgg gcataccgca 1567261 gggctcgtcg atcccgccgg cgtcgaggtg gtgcacgtca gctcggccca gcaactcgcc 1567321 gacgcggtgt ccaagcacgc tccgaccgcc gacgtattgg tgatggcggc ggccgtcgcc 1567381 gacttccggc ccgcgcaggt tgccaccgcc aaaatcaaga aaggcgtcga aggcccaccg 1567441 accatcgagc tgctgcgcaa cgacgacgtg ctggccgggg tggtgcgggc ccgagcccat 1567501 ggacaactgc ccaacatgcg ggccattgtg ggcttcgcag ccgagaccgg cgacgccaat 1567561 ggcgacgtgc tctttcatgc ccgagctaaa ctgcgacgca aaggctgcga tctgttagtc 1567621 gtcaatgccg tcggcgaagg cagggccttt gaggtagaca gcaacgacgg ctggctactg 1567681 gcgtccgatg gtaccgagtc ggcattgcag cacggctcca agacactgat ggcgagccgt 1567741 atcgttgatg caatcgtcac gttcctggca ggctgtagca gctaacgggt ccggcggccg 1567801 gttctgtacg ggtcctggac aggtgctgga cgatcccttg ctcgattgga cgagctgaga 1567861 ttgatgcctg aggatataat tcggctaact atttatcgga aggatgacga tagtgagcga 1567921 aaagggtcgg ctgtttacca gtgagtcggt gacagaggga catcccgaca agatctgtga 1567981 cgccatcagc gactcggttc tggacgcgct tctagcggcg gacccgcgct cacgtgtcgc 1568041 ggtcgagacg ctggtgacca ccgggcaggt gcacgtggtg ggtgaggtgg ccacctcggc 1568101 taaggaggcg tttgccgaca tcaccaacac ggtccgcgca cggatcctcg agatcggcta 1568161 cgactcgtcg gacaagggtt tcgacggggc gacctgcggg gtgaacatcg gcatcggcgc 1568221 acagtcaccc gacatcgccc agggggtcga caccgcccac gaggcccggg tcgagggcgc 1568281 ggccgatccg ctggactccc agggcgccgg tgaccagggc ctgatgttcg gctacgcgat 1568341 caatgccacc ccggaactga tgccactgcc catcgcgctg gcccaccgac tgtcgcggcg 1568401 gctgaccgag gtccgcaaga acggggtgct gccctacctg cgtccggatg gcaagacgca 1568461 ggtcactatc gcctacgagg acaacgttcc ggtgcggctg gataccgtgg tcatctccac 1568521 ccagcacgcg gccgatatcg acctggagaa gacgcttgat cccgacatcc gggaaaaggt 1568581 gctcaacacc gtgctcgacg acctggccca cgaaaccctg gacgcgtcga cggtgcgggt 1568641 gctggtgaac ccgaccggca agttcgtgct cggcgggccg atgggcgatg ccgggctcac 1568701 cggccgcaag atcatcgtcg acacctacgg cggctgggcc cgccacggcg gcggcgcctt 1568761 ctccggcaag gatccgtcca aggtggaccg gtcggcggcg tacgcgatgc gctgggtggc 1568821 caagaatgtc gtcgccgccg ggttggctga acgggtcgag gtgcaggtgg cctacgccat 1568881 cggtaaagcg gcacccgtcg gcctgttcgt cgagacgttc ggtaccgaga cggaagaccc 1568941 ggtcaagatc gagaaggcca tcggcgaggt attcgacctg cgccccggtg ccatcatccg 1569001 cgacctgaac ctgttgcgcc cgatctatgc gccgaccgcc gcctacgggc acttcggccg 1569061 caccgacgtc gaattaccgt gggagcagct cgacaaggtc gacgacctca agcgcgccat 1569121 ctagcgtcga gggcgcgagc agacgcagaa tcgcacgcgg aaaggcttcc gcgtgcgatt 1569181 ctgcgtctgc tcggcgctag ctgctgatgc ggtagtcgcc gaggtcgaac cgccggctgc 1569241 gccagtaggc ttcgaccgtg gtggtcgggc gcaacgggac gtcaccgttc ttgtcgaagt 1569301 aatagctgtt ggccagccga caactgtcct gccagaagac ctggcggtgc cggcggcgca 1569361 tcacctccgc gaaatagcga gcgttggctt cttcggtcac ctcgatgcgg gtggcgccgg 1569421 tgcggcgggc tcgcttcagg caccggatga tgtggtgtgc ctgcgtctcg atgagcgcga 1569481 agtacgacga cccgacgtag ccgtacggtc cgaacacggt gaagaagttc gggtagccgg 1569541 gaacgctgac gccctcatag gcctgcagcc gatgctcgtc ccagaaccgg ctcaaggacg 1569601 caccgccagt tccggtgacg gcataggtcg ggatgctgtc ggtgtctagc accttgaagc 1569661 cggtcgccag caccagcaca tcgatctcgt ggctggcgcc gtcggtggtg gccaccgcag 1569721 tgggtgtgat cttgtcgatc ggctcggtga ccagccgcac gttgtcccgg ttgaacgtcg 1569781 acagataggt gttgtggaag ccgggccgct tgcaccccac cgcgtatcgg ggggtgagtt 1569841 gctcgcgcac caccggatcg tggacctgtt ggcgcaggta gcgccgtccc gctgactcca 1569901 tgtgcttggc caacggaaac accgcgaagt agtgcgccgc gatggggaac gttgcttcca 1569961 cgaaggcctg gctgagcagc cggtggacgg ctttgccgcc gggaatccgc atcgcccagc 1570021 ggacggctgt gggcagtgga acgtcgaatt tggggaaaca ccaaataggg gtgcgctgaa 1570081 aaacggtgag gtgggagaca attggcgcca tctcgggaat gacctgcacc gccgaggccc 1570141 cggtgccgat gatcccgacg cgcttgccgg tcaggtcctg ggtgtgatcc cagcgtgcgg 1570201 tgtgcatggt gacgccttca aacgagtcca ccccgtcgat gtcgggtagt ttgggcaccg 1570261 tcagaatgcc gcatgcgctg atcaggaacc tggctgtgat ttcgccgccc gggtccgttt 1570321 gcacccgcca caggctgtgc tcgtcatcga actcggcggc aagcaccttg gtgttcaacc 1570381 ggatccgcga ccggatgccg tatttgtcga cgcagtgttc ggcgtaggcc ttcagctcgt 1570441 gtccgggtgc ataggtgcgc gaccagtgcc ggctctgctc gaaagagaac tgataggaga 1570501 aggacggaat atccacggcg ataccgggat aggtgttcca gtgccaggtc ccgccgacac 1570561 cgtcgccggc ttcgaccacg aggtagtcgc tgaatcccgc ccggtcgagc ttgattgcgg 1570621 cgccgatccc ggagaacccg gcgccgacga tcagtgcgtg gtagtcgggc atcatcgcct 1570681 cctcccgatg acgtgtactc cgtgcttggg tcgcagggtc agcgtcgcct cgagttcgac 1570741 gtgatagcca ggggcgaggt caaaggtgaa gtgttgactc atgattgccg ccatcaaaac 1570801 catctccatc agggcgaagc tctgtccgat gcagatgcgt cggccgccac cgaacggcag 1570861 gtatgcgcag cgaggacggt ccgtggggca ccgcaaaaac cggccaggat cgaatctatc 1570921 cgggtcgggc caccagcgcg ggtcgtggtg aatgtggtga atcgggatga cgacggtggt 1570981 gccgcggcga attcggtgtc cgtcgatgat gtcatcatcg acggcctcgc gcgcgattat 1571041 ccacaccgac gagaagtagc gttgcgattc ctgcaggcac gcggtggtcc aggccagctt 1571101 gcccaggtcg tcggcggtcg ggcggcgcat gcccagcacg tcgtccagct cggtgagcat 1571161 gtggtcgcgg gcctgcgggt tcagcgccat cagataccag aaccaggaca tggcgttggc 1571221 ggtggtttcg tggccggcga gcatgaacgt cagagcttca tcgcgtactc gctggcgggg 1571281 ccagattccg ccgtcggcgc tcagcaacac gttgagcagg tccgcggagt tagtcggctc 1571341 ggccagtcgc cgatcgatca ccgagttgat ggcgcgatcc agggtcagcg tgatctcttg 1571401 catttcccgc aacggcggcg gcagatgaac acccgagtag atacaccaga tcagcgtgtc 1571461 gtaaaccgtc cgcggcatca gcccccacag ccccagccgc tccagctttt ccgcccgccg 1571521 caggccgcga gtcgcaagat cgtgcatgga ctgcaccaac ggcccgaagt cctggctgaa 1571581 cagggcgttg gcgactaccc gcaatgtcgt ctcgaccatg ctttggtgca tgtcgaactg 1571641 cgcgccgggc accagcgcgg cggtgacgtc ggcgattggg tcgatcatca gaccgacgag 1571701 tccgcgcagg tggcgccggg cgaaggtcga gtttaacgcg ccgcgatgtc ttgcccatga 1571761 gtcgccctcg tcggtgagca agttaagacc ggcggtggcc cggatcggtc cgtattcgtc 1571821 ggatttgaca tatttcaggc gggcctcgtg cagcacatgg tcgacgtagt cggggtgact 1571881 gatcgagaca aaacgtctgc cagcacaacg aaatcgggtg atgtcgctgc cgcgtagccg 1571941 gcccaggaag ccgtcgccgg cgtcgaatcc gatggtgatg gcttcccggg tcatcgtcca 1572001 ggtgctcatc cgcttggccg gtcccttcag gggccgctgg gtggtggcgg tggccatgac 1572061 ttcactgtat ggatgacgct gactggcccg aaatgagact atgggacaaa gtgttgtgag 1572121 tttaggacag cctcgtggga catctaccgc ctccggccga ggtgaggcat ccggtgtatg 1572181 cgacccgggt gctgtgtgag gtggccaacg agcgcggggt gccgaccgct gatgtgctgg 1572241 cgggcacggc gatcgagccg gccgacctcg acgatccgga cgcggtggtc ggtgcgcttg 1572301 acgagatcac cgcggtgcgc cggttgctgg cccgattgcc cgacgacgcc ggtatcggga 1572361 tcgacgtagg cagccggttc gcgctcaccc acttcgggtt gttcgggttt gccgtgatgt 1572421 catgtggcac ccttcgcgaa ctgcttacca tcgcgatgcg ctatttcgcg ttgaccacca 1572481 tgcacgtcga catcacgttg tttgaaaccg ccgacgattg cctggtcgaa ctggatgcca 1572541 gccacttgcc ggccgatgtc cgtggattct tcatcgagcg cgatattgcc ggaatcatcg 1572601 cgacgacaac gagtttcgcg cttccgttag ccgcgaagta tgcggatcaa gtatcggccg 1572661 aactggcggt tgacgcggaa ttgttgcgcc cgttgctcga gcttgtgccg gtgcacgacg 1572721 tcgcattcgg gcgcgcgcac aaccgggtgc acttcccgcg tgccatgttc gacgagccgt 1572781 tgccgcaggc cgaccgccat acgttggaaa tgtgtattgc acaatgcgac gtgctgatgc 1572841 aacgcaacga gcgacgccgt ggcatcacgg ccttggtgcg cagcaagctg tttcgcgatt 1572901 ccgggctttt cccaacgttt accgacgttg ctggcgaact tgacatgcat ccgcggacgc 1572961 tgcggcgtcg acttgccgag gaaggcactt cgtttcgggc cttgctgggc gaggcgcgct 1573021 ccaccgtggc cgtcgacctg ctacgcaacg tcgggctgac ggtgcagcag gtgtccaccc 1573081 ggctgggcta caccgaagtc tcgacgttct cgcatgcgtt caaacgctgg tatggcgttg 1573141 cgcccagcga atattcgcgc cgcgggtaga ccagcccttt tcagggtttc gcggcccgcg 1573201 tcggtttggt cgggttaggc ggggccgggc tggccgggcg gaccgggttg gccgggctgg 1573261 ccgaacaggg ttcccccggt cccgccgacg ccgccgcccc cgccgttgcc gggggtgcca 1573321 tcgttgccgg ccccaccgtt tccgccggcg ccgccgcccc cgccgttgcc gattaggacg 1573381 gcggccccac cgtttccgcc ggtcccgccg ttgccgccgg taccgtcctc gccggcggtg 1573441 ccgccctttc cgccggtccc gccggtcccg gcgtcgccga tcaggccggc ggcaccgcct 1573501 cgcccgccgg tcccgccggc gccgcccttg ccgaacacgc cgaagccgtc gccgcccttg 1573561 ccgccggtgc cgccggcacc gccgttgccg atgagcccgc cggtaccgcc ggcaccgccc 1573621 gcgcccccga agttattctc cccgactttg cctccaaccc cagtttgccc gccgtgcccg 1573681 ccggccccgc cgctgcccga caggccgcgg ccgtcgccgc cggtgccgcc ggccccgccg 1573741 ttgcctcctg agctgacgcc ggttccgccc tgcccgccgt gtccgccggc gccgccgtcg 1573801 ccgtgcagcc agccaccacc gccaccggcg ccgccgatac cccccgtgcc gccgtcgccg 1573861 gacttgccgc caccgacccc cccttgcccg ccggtgccgc cggaccctcc gctgccccac 1573921 aatccggcgg cgccgccggc accgccggca ccgccgacac caccggggtc gccggccacc 1573981 ccgaccccgc cattgccgcc ggccccgccg ttgccccaca gccatccgcc ggccccgccg 1574041 gcaccaccgg acgcgccggt gccacccagg ccgccggccc cgccgtggcc gatcagcccg 1574101 gccgccccgc ccgcaccgcc ggcctgaccg gtggcgccgg acccgccgtt gccgccgttg 1574161 ccgtacagga tcccgccggc cccgccggcc tgcccggtcc ccggcgcccc gtcggcgccg 1574221 tggccgatca gcgggcgccc cagcagcgcc atggtcggcg cgttgatcgc acccagcagc 1574281 tgctgctcga cgttggcggc ctcggcgctg gcatacgcgc ctgccgtgct cgtgagggtc 1574341 tgcacgatct gctggtgaaa cgccgccgcc tgagtgctca gcgcctgata ggtctgcgcg 1574401 tggccgctaa acagcgccgc cacggcggcg gacacctcat cggcaccggc ggccagcaca 1574461 cgcgtcgtgg cggccgcggc cgcagcattg gccgtgctga tcgccgagcc gatgcttgcc 1574521 agatccgtcg ccgccgcgcc cagcatttcc ggctgtgcaa acaaaaacga catgaccgtc 1574581 cccctgaatc ctgtgggtat gagcagactt gtcgtgatcg tgcagcataa gcgcaggtga 1574641 tataggccat cattggtaat gttatagaaa cgttataggt gattttgacc ttgtcaaatt 1574701 gttcgacaag gagtgcggtc ttattgcaac tttgtttatt aatgtcgcgc ggcccgcggc 1574761 ctgggacctc cgtcggacag cggcgacacg atgcaactat gggggccgca gcgaggtgtc 1574821 gtcggtgtca tgcccgcggt cggtgccccg gcaccgcaaa tggtggtttc agctgctcga 1574881 acatggggaa atgccacacg ttgagggttg ccaattgcag gccctggacg tcggcggtag 1574941 cagctatcag atagtcgcca agtccaatcc ggttgtggct gcgacgatat cggcgcatca 1575001 tgtcgccggc gcggcgtgcg attacctcgg ttgctggctg tacccgaaac gatgcaagca 1575061 ggcgccacac ctcgcgccgt tcggcggtcc gcattccgcc gatgagttcg gcggtggaca 1575121 ccacgctgat cgccagcggt ccgtccttgc gggcgctgac aagccaatcg cgagcagcaa 1575181 cgacaccccg caaatgcgcg atcagcacat cggagtcgac aaggatcatg aggtggcgcg 1575241 ccacacctgg gcaaggtgct gttcacgacc accggagcga cgcaccggcg gatccaggtg 1575301 gcgaagcgtg ccgaacgaat cgtttatagc ctgcaggtcc gatgcaaggt cgtccccagc 1575361 ggtggtgagg gctcggttca gcaggaggcg gatcagctcg gcgcgcgaaa caccttcttg 1575421 cgcggccaac ttgtcgaggc ttgccgtctg ctcctcgtcg aggtagatgt tggtccgctt 1575481 catacaccat atcatacatc acaatgtgcg gcccgggcgg caccgcggcg ggcggcgatt 1575541 cagccgaccg ggcatgccgc cgacgttatg cgtgcaacgc cctcttcagc gccgccaagc 1575601 cgcggccggt ggcttcggcc gcggcgggca ccaccagggc gaagttgacg tagccgtgca 1575661 ccatggtggg ctcgttgctt agctctacgg aaacccctgc ggccgtgagc aattcggcgt 1575721 agcaagcacc gtcgtcgcgc agcggatcat gctcggcggt gccgatgaag gcgggaggca 1575781 ggccggacag gtcagcgttt cccggggcca gtgtcgtggg cagcatcgtg tgatcactga 1575841 tgtccagccc cggcacatac caggccagga acgcgtcgat gacgtcacgg tccaggattg 1575901 gcgcatcggc attttcggtg aaagacggca gcgacaggtc ggccatggtc gtcgggtacc 1575961 acagcagctg gaacaccagc ggcggtccgc cgacatcccg ggccaactgc gccatgaccg 1576021 ccgagatgtt gccgcccgca gagtcaccgg ccacggcgat ccggctcggg tcaccgccca 1576081 gttcggcggc gttttcgccg acccagcgca atgccgccca gctgtcgtcg atcccggccg 1576141 ggtagggatg ttccggggca agccggtagt cgacggacac cacgatggcc tgcgcgccga 1576201 cggcgtgggc gcgggcgacg gggtcgtggg tgtccagacc gccgagcgac cagccgccac 1576261 cgtggtagta gacaaccacg ggcaggttgt cgcgaacgac cggcggccag tagacgcgga 1576321 ccggaatgtc ggtgagcccg tcgtagccaa cggtccgttc ctcgatccgt agctccggca 1576381 gcaactccgg gggtgtcttc agctggcgga gccgcgcgcg ggcgacttcg acaccgtcgg 1576441 ccgcggtgaa ggtcaccgga aaggtatcga gcagcatctt cagcacggga tcgatatcag 1576501 gccgggcgac ggtcggctct gtcatgggcc taccgtacga ccgccaggcc tatccgtgta 1576561 gcacaacccg tagcgccacc agcccacggt tggtggcctc ggtggcggcg ggcaccacac 1576621 cggcatagcc aacgtagccg tgcaccagcg tctgggcgtt gtgcacctcg acgggaacac 1576681 cggcggcggc cagcagctcg ccgtaccgaa tcccgtcgtc gcgcaaaggg tcgtagccgg 1576741 cgacagcgat gtaggccggc ggcaggtcgg ccaggttctc cgctcggccg ggcgccattg 1576801 gcgctggcgg gttgtgcaag tcgatttcgc ctgcgtacca acgggagaac gcggcaattg 1576861 ccttgacgtc gaggatcggt gcgtcggcat tctcggccaa cgacggcagc gattggtccc 1576921 acagagtgga gggataccac aacagctgaa acacaatggg cgggccgccc atatcgcggg 1576981 ctcgctgcgc gatcaccgcg gcgatggtgc cgccggcgga atctccggcg acggcgatgc 1577041 ggccgaggtc agcaccgacc tggcggccat gctcggcgac ccaccgcgtt gcggcccaag 1577101 catcttcgat ggcagcgggg taggggtgct caggcgccag ccggtagtcg acggacacga 1577161 caatcgcgtc agcgccgacg gcgtgctggc ggcaggtgcc atcgtgcgtg tcgaggtcgc 1577221 ccatgacgaa tccgccgcca tggaaataca gcacaacggg cgcctcggct tgatcgggac 1577281 acgttggcgg ccaatagatc cgggtcccga tcggccccgc cggtccatcg atcgcaaggt 1577341 caacgacccg cagctcgggg tgcaccggct ggcgcggtag atcgcgcaac cgctggcgca 1577401 cggcctcgat cccatcgtcg atcgatagcc gaaacggaac cgcatccagt accttcagca 1577461 ggatggggtc gatcgcgggt ttctcgtcgg cggtgttgtc caaactgggc ataccggtac 1577521 cgtacgcacc tcgcttgctg gccggcggct gggtggtcgc cggctgggcg ggcctcgcct 1577581 acggcgtgta cttgaccgtg atcgcattgc gcttgccacc gggcagcgag ttgaccgggc 1577641 acgcgatgtt gcagcccgcg ttcaaggcat cgatggcggt gctgctggcc gcggccgcgg 1577701 ttgcccatcc catcggccgc gagcggcggt ggttggtacc ggcgctgctg ttgtcggcca 1577761 ccggcgactg gttgttggcg atcccctggt ggacgtgggc gttcgtgttc ggcttggggg 1577821 cattcctgtt ggcgcacttg tgcttcattg gtgccctgct gccactggcg cggcaggcgg 1577881 ctccatcgcg tggccgggtc gctgccgtgg tggcgatgtg cgttgcgtcc gcggggctgc 1577941 tggtgtggtt ctggccgcac ctggggaagg acaacctgac catcccggtc acggtataca 1578001 tcgtcgcgct gtcggcgatg gtgtgcaccg cgttgctggc acggctgccg acgatttgga 1578061 ccgcggtcgg ggcggtgtgt ttcgccgcgt cggactcgat gatcggcatt ggccggttca 1578121 tcctcggcaa cgaggcgttg gcggtgccga tctggtggtc ctacgccgca gccgagatct 1578181 tgattacggc cgggttcttc ttcggccgcg aggttcctga taacgccgca gcacctacgg 1578241 atagctagcg gaccggttgt ctagcagcgg atctcgcggt caagcccgca cgcccgtcga 1578301 agtagagccg atcgcgcggg tgctgccgat gttgtcggtg ccgcacctgg accgcgactt 1578361 cgactacttg gtgcccgccg aacactccga cgatgcccag ccgggggtgc gggtacgggt 1578421 gcggtttcac ggtcggctgg tcgacgggtt tgtcctagag cgccgcagcg acagcgatca 1578481 ccacggcaag ctgggctggc tggatcgtgt ggtgtcgccc gaaccggtgc tcaccacgga 1578541 gatccgccgg ttggtcgatg cggtggcggc gcgctacgcc gggacccgcc aggacgtatt 1578601 gcggctcgca gtgcccgccc ggcacgcacg ggtggagcgg gaaatcacca cggccccggg 1578661 tcggccggtg gtagcgccgg tcgacccgtc gggttgggcg gcctacggtc gcggtcggca 1578721 attcctggcc gcgctggccg actcgcgcgc tgcgcgggcc gtttggcagg cgctaccggg 1578781 cgagctgtgg gcggaccgat tcgccgaggc tgccgcgcag accgtacgtg ccgggcgcac 1578841 ggtactggcg atcgtgcccg atcagcggga tctggacacc ctgtggcagg ccgcgacggc 1578901 cctcgtcgat gagcacagtg tggtagcact gtcggccggc ctgggcccgg aggcacgcta 1578961 tcggcgctgg ctggccgcgt tgcggggcag cgcgcggctg gtgattggca cccgcagcgc 1579021 ggtgttcgcg ccgttgagcg agctgggcct ggtcatggtc tgggccgacg ccgacgactc 1579081 cctggctgag ccgcgggcac cctatccgca cgcccgtgag gtggcgatgc tgcgggcgca 1579141 tcaggcgcgg tgcgcagcgc tgatcggcgg ctacgcccgc acggccgagg cccacgcgct 1579201 ggtgcgtagc ggctgggcgc acgacgtggt tgcaccccgg ccggaggtgc gtgcacgctc 1579261 tcctcgcgtg gttgccctcg acgacagcgg atacgacgac gcgcgagacc cggccgcccg 1579321 caccgcacgg ctaccgtcca tcgcgctgcg cgccgcgcgc tcagcgctgc agtccggggc 1579381 gccggtgctg gtgcaggtgc cgcggcgcgg gtacatcccc tcgctggcct gcgggcgctg 1579441 ccgggcgatc gctcgttgcc ggtcgtgcac gggtccgcta tcgctgcaag gcgccggctc 1579501 gcccggtgcg gtatgtcgct ggtgtggacg ggtggacccg acactgcgat gcgtgcgctg 1579561 tgggtcggac gtggtgcgtg ccgtggtggt gggggcccgg cgcactgccg aagagctcgg 1579621 ccgggcattc ccgggtacgg cggtgattac gtcggccggc gacaccctgg tgccccagct 1579681 cgacgccggc ccagccctgg tggtcgccac tccaggagcc gaaccccggg cgcccggcgg 1579741 gtatggggcg gcgctgctgc tggatagctg ggcgctgctg ggccgtcaag acttgcgcgc 1579801 ggccgaggac gcgctgtggc gctggatgac ggcggccgcc ctggttcggc cgcgcggggc 1579861 gggcggtgtg gtgaccgtgg tcgccgaatc gtccattccg acagtgcaat cgctgatccg 1579921 gtgggatccg gtcggtcacg cggaggccga actggcagcc cgaaccgaag tcggcctgcc 1579981 gccaagtgtg cacatcgctg ctcttgacgg ccctgccggc accgtgacgg cattgctgga 1580041 ggcggctcgg ctgcccgacc cggatcgcct ccaagccgat ctgctgggcc cggtggacct 1580101 gccacccggc gtccgtcgcc cggcgggcat ccccgccgat gcgccggtca tcaggatgtt 1580161 gctgcgggtg tgccgcgagc agggcctgga gttggcggcg agtctgcggc gcggcatcgg 1580221 tgtgctcagt gcgcggcaaa cccggcaaac ccgtagcctg gttcgggtac agattgaccc 1580281 gctgcatatc gggtaaacgg agtaaccgct agctcaacac ttccgggcgg tgaagataag 1580341 gtattcccac tgcatcacgc cgtcgcagag gtattcgcga caaagttcgg tgatttcggc 1580401 gtcgagtgtg gcgacgcact cggggctgtc ggcgatggag cggtaggcgt tgatcgccgg 1580461 gccgtagaaa ttcttgaaat agtcgcgaca ttcgtccggg caaccgaacc ggtccactgt 1580521 cagcgatcct cgccgggtac ggatgtcgga cacatggtcg cgaaacaggc cactcacgta 1580581 atcctcgctt ccccaccaca cctcgtgcgg cgctcccgcc ggcagcgtcg gccggtacgg 1580641 tctgatggtg gacagcaatt tgccgtagaa accctcgggg gtccagttca gggtgctgat 1580701 cttgccgccg cgccggcaga cccgggccag ttcgtcggcg gtgcgctgat gacgcggggc 1580761 gaacatcacc ccgatggtcg agagcaccgc atcgaattcg ccggcgctaa acgggagggc 1580821 ttctgcgttg gcttcccgcc agccgagctc cagtccggct gccgcagcac gcgcctgggc 1580881 gcggcgcagc agctcgggcg tcaggtcgct ggcagtgacg tgggcacctg ccatggctgc 1580941 cgggatcgat acgttgcccg agcccgcggc cacgtcaagc acgcgatcgc cgcggcgaat 1581001 accgctggtg gagactagga ttgggccaag cggggccaac agctcctcgg cgatggcggc 1581061 gtagtcgccc aatgcccaca tttgccgatg cgtggtcgcc ggcgcctggc gctcgctggt 1581121 gggtgtgtag acagtcatcg gaactcctgc gagacgtcgg gtgaggctgg taccgaattg 1581181 tgtcagcaga caacagtata cgttctaaat aatcaatgtc gacgatggtc agatgctaga 1581241 ctttcctgac ttacccgcac ggtgtacgac gaagttgacg ccggggacgg ccccgggaaa 1581301 ggggtaatga tgccaacgga atatccggcg acagccgagg aatccgtgga cgtgatcacc 1581361 gatgcattgc tgacggcgtc ccggttgctg gtagccatct cggcccattc aatcgctcag 1581421 gtcgatgaaa acatcaccat cccgcagttc cggaccctgg tgattttgtc taatcacggt 1581481 ccgattaacc tggctacgct ggcgacgttg ctgggtgtgc aaccgtcggc caccggccgc 1581541 atggtcgacc ggttggtcgg cgccgaactg atcgaccggt taccgcaccc cacctctcga 1581601 cgggagctgc tggcggcgct gaccaagcgt ggacgagatg tcgtccgtca ggtcaccgag 1581661 caccggcgca ccgagatcgc ccgcatcgtg gaacagatgg caccggcgga acgccatggg 1581721 ctggtgcgtg ccctgacggc gttcaccgag gcgggcggtg agcccgacgc acgctacgaa 1581781 atcgagtagc tagcggccga gcccgtgtcg ggccgtccgt tacgtgctgg gacgacccga 1581841 cacaggccgg attgcccgcc tcagcgcttt tcggcggtga gcagcaggta ctcccattcc 1581901 atgacaccgt ccgacaggta ttgcgctgcg agttcgacaa gctggcggtc gagctcggcg 1581961 gccagcaccg cgttgtcacc gatgtgcgcg taggcctcga tcgtcgggcc atagttgttc 1582021 ttgaagtagt cgtggacggc ctgggcggtg tcgaaccgct tcacttccaa caagccacgg 1582081 gccgtcttga ggccagtgac tccatcgccc agcagaccag tgacataggc ctcacgtccc 1582141 cacaacgccg acggcggcag atccgccgac acgctgggcc ggtatggcct aatggttgcc 1582201 agcatccggc cgaagaatcc ctcgcacgtc cagctgatca caccgatcgt cccgccaggc 1582261 cggcagacgc ggaccagctc gtcggccgcg gcctgatgat ccggtgcgaa catcacgccg 1582321 atcgctgaga tcaccgtgtc gaactcgtcg tcggcaaacg gcagggcttg cgcgttggct 1582381 tcctggtatt gcagggtcag cccctgttgg gcggccctgg cctgggaccg ctgcagcagc 1582441 tcgggcgtca ggtcggtgga aatgaccgtg gcacccgtct tggctgcggg cagcgaaata 1582501 ttgccagagc cagcggcgac gtcgagcacc cgaacacccg gcccgatgcc cgcggcggca 1582561 accaggatcg ggccgagtgg cgccatcacc tcttctgcca tcagggcgta gtcacccagg 1582621 gcccacatcg cccggtgtgt ggccgcaagc gtttggtcct cgcgagcagg tgtgtcgata 1582681 gtcatcaggt ctcctgagaa gtaagtgatg tggctgcgaa cttcgacatc gttgtcgcgg 1582741 gcacggcggg agcctgggca gtagcgcgcc ttgcgtaccc accggataca gtatgcatca 1582801 gaaatagtgt attcctctaa ctatcgcgcg tgtcggaatt gtggcccacg ccacgtcggc 1582861 ggcgcttctt agactgggcg cgtgcgcctt gtctttgccg gcacccccga acccgcgctg 1582921 gcctcgctgc gcaggctcat cgaatcgccc agtcacgacg tgatcgccgt gttgacccgt 1582981 ccggatgccg cctccggccg gcggggcaag ccgcagccgt caccggtggc ccgtgaggcg 1583041 gcagagcgcg gcattccggt gctgcggcca tcgcgaccga actcggcaga gttcgtcgcc 1583101 gaactgtcgg atctggcgcc agagtgctgc gccgtggttg cctacggagc cctgctcggc 1583161 ggtcccttgc tggccgtgcc gccgcatggc tgggtcaacc tgcacttctc gctgctgccg 1583221 gcctggcgtg gcgcggcgcc ggtgcaggcc gccatcgccg cgggagacac gatcaccgga 1583281 gccacgacgt tccagattga gccaagcctg gactcgggac cgatatacgg tgtcgtcacc 1583341 gaggtgatcc agccgaccga caccgcgggc gatctactta agcgactggc ggtatcgggg 1583401 gcagcgctgc tatcgaccac gctggatggc atcgccgatc agcggctgac gccgcggccg 1583461 caaccggcag acggggtcag cgtggcgccg aaaatcaccg tagcgaatgc ccgggtgcga 1583521 tgggacttgc cggcggcggt cgtggagcgg cggatccgcg ccgtcactcc caaccccggc 1583581 gcctggacgc tcatcggtga cttacgggtc aaacttggac cggtgcacct cgacgccgct 1583641 caccggccat cgaagccctt gccgcccggt ggaatccacg tggaacgcac gagcgtgtgg 1583701 atcggcaccg gctcggaacc ggtgcggctg ggccagattc agccgcccgg caagaaactc 1583761 atgaacgcgg ccgactgggc gcggggcgca cggctggacc tggccgcacg ggcaacatga 1583821 cccctagatc gcgtgggccg cgccgccggc cgctggaccc ggcgcgtcgt gcggccttcg 1583881 agacgctgcg ggcggttagt gcgcgcgacg cctacgcgaa cctggtgttg cccgcgctgc 1583941 tggcccaacg cggtatcggc ggtcgcgacg ccgcgttcgc caccgagctg acatacggca 1584001 cctgccgagc ccgcggcctg ctcgacgcgg tcatcggtgc ggccgccgag cgttcgccgc 1584061 aggcgatcga tccggtgctg ctagacctgt tgcggctcgg cacctaccaa ttgctgcgca 1584121 cgcgggtcga cgcacacgcc gcagtgtcga ccaccgtcga gcaggccgga atcgaattcg 1584181 attcggcgcg agcaggtttc gtcaacggtg tactacgaac gatcgccggc cgagacgagc 1584241 ggtcctgggt tggcgaactc gctcctgatg cgcagaacga tccgatcggg catgccgcgt 1584301 tcgtgcatgc gcatccccga tggatcgccc aggcctttgc tgacgcgttg ggcgcggcgg 1584361 tcggggagct cgaggcagtt ttggccagcg acgacgaacg gccagcggtg cacctggcgg 1584421 cacgccccgg ggtgctgacc gccggcgaac tggcccgcgc ggtgcgcgga accgtcggtc 1584481 ggtattcgcc gtttgcggtg tatctgccgc gcggtgaccc ggggcgactg gcgccggtgc 1584541 gcgacggcca agcgctggtc caggacgagg gcagccagtt agtcgcccga gcattgaccc 1584601 tggcgccagt cgacggcgat accggacggt ggctggacct gtgtgccgga ccgggcggca 1584661 agaccgcgct gttggccggg ctgggtttgc agtgcgcagc ccgggtgacc gcggtggaac 1584721 cctcgccaca ccgcgcggac ctggtagcac agaacacccg cgggctgccg gttgagctct 1584781 tgcgtgtcga cgggcggcac accgacctcg acccgggttt cgaccgggtg ctggtggatg 1584841 cgccctgcac cgggctgggc gcgttacgcc gtcggccgga ggcccgttgg cgtcgtcagc 1584901 cggcggacgt agcggcactg gccaagctac aacgcgagtt gttgagcgcc gccatcgcgc 1584961 tgactcggcc cggcggtgtc gtgctctatg ccacatgctc gccgcacctg gccgagactg 1585021 tgggtgctgt cgccgacgcg ctacgccgac atccggttca cgcgctcgat acccgcccac 1585081 tgttcgagcc ggtgctcgcg gggctggggg aggggcccca cgttcagctg tggccgcacc 1585141 ggcacggtac cgacgccatg ttcgccgcgg cgttgcgccg cctgacgtga ggttcgccgc 1585201 agcggctcag taatgtgtcg ctcatggccg gtagcacggg gggaccgctg atagcgccgt 1585261 cgatcctagc cgctgatttc gccagactcg cggacgaagc ggccgcggtc aacggcgccg 1585321 actggttgca tgtagacgtg atggacggtc acttcgtgcc aaacctgacc atcggcctgc 1585381 cggtggtgga gagcctgctg gcggtcaccg acatcccgat ggattgccat ctaatgatcg 1585441 acaacccgga ccggtgggct ccgccgtatg ccgaggcggg cgcctacaac gtcaccttcc 1585501 acgcggaggc caccgacaac ccggtcggcg tggcccgcga tatccgggcc gcgggggcca 1585561 aagccgggat cagcgtgaag ccggggaccc cgctggagcc atacctggac atcctgcccc 1585621 atttcgacac cctgctcgtc atgtcggtag agcctggctt cggtggccag cggttcattc 1585681 ccgaggtgct gagcaaggtg cgtgcggtgc gcaagatggt cgacgcgggc gagctgacga 1585741 tcctggtcga gatcgacggc ggcatcaacg acgacacgat tgagcaggct gccgaggccg 1585801 gcgtcgactg ctttgtcgcc ggatcggcgg tgtacggcgc cgatgacccg gccgcggcgg 1585861 ttgcggcact acggcgacag gccggtgccg cctcactcca cctgagccta tgaacgtgga 1585921 gcaggtcaag agcatcgacg aggctatggg tctcgccatc gagcactcct accaggtcaa 1585981 aggcacgact tatccaaacc ccccagtggg ggccgtcatt gtggatccca acggtcggat 1586041 cgtcggcgcc ggcggcaccg agccggccgg tggcgatcat gccgaggtgg tggcgctgcg 1586101 ccgggccggc ggattggcta ccggcgccat cgtggtggtc accatggaac cctgtaacca 1586161 ctacggcaag actccgccat gcgtgaacgc tctgatcgaa gccagggtgg ggacggtggt 1586221 ctacgccgtc gccgacccga acgggatcgc tgggggtggc gcgggccggc tgtcagcagc 1586281 gggcctacag gtgcggtccg gggtgttggc tgaacaggtg gcggccggac cgctgcggga 1586341 gtggctccac aagcaacgca ccggtctgcc gcatgtcacc tggaagtacg ccaccagcat 1586401 cgacggccgc agcgccgccg ccgacggctc cagccagtgg atctccagcg aggccgcacg 1586461 cctggatctg catcgccgcc gcgccatcgc cgacgcgatc ttggtcggca ccggcaccgt 1586521 cctcgccgac gacccggccc tgaccgcgcg gctggccgac ggctcgctgg cgccgcagca 1586581 gccgctgcgc gtggtggtgg gcaagcgcga cataccgccg gaagcacggg tcctcaacga 1586641 cgaggcacgc accatgatga tccgcaccca cgaacctatg gaggtgctca gggcgttgtc 1586701 ggatcgcacc gacgtgctgc tggaaggagg tcccaccctc gccggcgcct tcctacgagc 1586761 gggtgcgatc aaccggatcc tggcctacgt cgcaccgatc ctgttgggcg gtccggttac 1586821 cgcggtcgat gacgtcgggg tgtccaacat caccaacgcg ttgcgttggc agttcgacag 1586881 cgtcgaaaag gtcggaccgg atctgttgct gagcttggtg gctcgttaga gcggctccac 1586941 ttggggcgcc agggtcggtt gctcctggac ttccggttca tcggcatgtt ccttgcggcc 1587001 gctgatcaac agacctagca ccgcgccgaa cacgcacacg atcgcggtga tggtgaatat 1587061 ctcgccgtac atcagcgcga acgcctgctg gtaccgggct ccaattgcgg ccgcgcgctc 1587121 gagcaggctg gcgttgggcg ggatggccgc cgacaacccc gccaggatct ggttgaaccg 1587181 gtacaacccc caggcgctca gcgcggccac gccgatcaac atgccggtca tccgggcgac 1587241 caccaccgcc gccgaagcga tgccgtgctg ggccgacggg acaacccgta gggtggccga 1587301 cgatagcggc ccgatcacca gccccaaccc taaaccagcc accaccaggt cggtgtgcat 1587361 cgccggcacg gtgaacaatc cgaggatgtt gtgccgatcg gccaacaggt ccaccggcca 1587421 gtgggaaata agccagtaac cgtacgccgc aataagcagt ccggcaaagg ccaccgcacg 1587481 gtcaccggcc ctggtggcga tccacccgcc cgtcactgcc ccgatcggta gggcgataag 1587541 gaaccacagc agcattccgg ccgcctgagc ctggtccatc tgcagcacgc cctggccgaa 1587601 cagctcgaca tcaaccagcg tcaccatcag cgccgcgccg gcggcgacgg aggcacccag 1587661 cgcggacagg aacggccgga agtgcacacc ggccgggtcg atcagccggg tgcgagcgaa 1587721 acgttcccaa ccgaagaacg ccaccgcggc aacgagagcg ccgaccagca acggagcccc 1587781 gtagtccggc agtacgtgtt tgccgtcggg attggggttg tacagcccga tgacggcgag 1587841 gcccaacgcg agtgccagca gcagaccacc gaccaggtcg actcgctcgg gctccgtgct 1587901 gcggtcgtgt gagggcaggc tgaagtggat cattaccatg gcgatcgcgg tcaacgggac 1587961 gttgatccag aacacgtcac gccagtcgtg caatagccaa acgatgaaga ttccgtacaa 1588021 cgggcccaga acgctgccga gctcctgcgc ggcgccgata ccgccgagca cgccggcgcg 1588081 gttgcgctgc gaccacaaat cggcgcccag cgccagcgtg atcggcaata gcgcgccgct 1588141 ggcaacaccc tggatcgtgc ggcccgcgat cagcatgtgg aaatcgccaa aatgcccggc 1588201 cagcgcggtc actaccgagc cgatgatgaa cccggccagg ctgacctgca gcatcagctt 1588261 gcgcccgaat cggtcggaag cccggcccag caacggcatg gcggcgatgt agcccaggag 1588321 gtacatcgtg acgatccagg tgatccggtg gagttggttg atcggtatac caacgctgtt 1588381 catgatgtcg cgcatgatgg tgaccacgac ataggtgtcc agggcgccca gcagtactgc 1588441 caggctgccc gcgctaatcg cgactcgacg tcctgctcgc atgctgatca gctcaccggg 1588501 ggcttcgtga cctggacctt ctcgccccat ttcgacaagg tcatctggac ggaattgccc 1588561 gagccgcggt ccaactgggc ctgtgccagt tgatgatcgc cggtctcctg aatccagacg 1588621 gtcgccggca ccggctgcgt cgcgttgaac ggcggcgcta tctggttcac cgcctgtgcc 1588681 gataccttcc cgctgatgcg gatggtgttc tggccgttga tggtatcccg cccttcggct 1588741 tttgcgtcgg cgaaattcgc cagcacgttg gccaggccgg tatccggatt cagcacctgg 1588801 gcggggtcgt agatgtcggc ggcgggaccg aaatcgctcc actggttggg cgtcagggtg 1588861 gcgtacagga tcccgtcgaa caccacgaag tcggcatcga tatcagaccc acccagcgtg 1588921 agcttgacgt ttcccgtcgc ggcggtgggg ttggtggtga gatcgccgct cagcgtcttc 1588981 agagacagtc ccgggatctt gccgttgacc gtcagcacca tgtgcgcgct cttgagagcc 1589041 ttggtctgcg cggtggcctc ctcgaccagc ggcttcgcgt ccggaagtgg tccgccgctt 1589101 ggcttcgagc ccgacgagca gccggcaacg acagtggcgg cgatgctaac ggcggcgagg 1589161 acggcgatgc gacggcagtg gcgtctgggg gtccgcatac cctgcatcgt agagggtgtc 1589221 tgtgagttgg ccggtcggcg agtggggtgc gggtccgcgg gattgctgcc taacctggtg 1589281 cgatgttcac cggaattgtt gaggaacgcg gagaagtgac cgggcgtgag gccctggtcg 1589341 atgcggcgcg gctgaccatc cgcggtccga tggttaccgc cgacgccggc cacggcgact 1589401 cgatcgctgt caacggcgtg tgtctgacgg tcgtcgatgt attgcccgac ggccaattca 1589461 ccgccgacgt gatggccgag acactgaacc ggtccaacct gggtgagcta cggcccggca 1589521 gccgggtgaa cctggaacgc gccgcggcgc tgggcagccg gctcggcggg cacatcgtgc 1589581 agggacatgt ggacgccacc ggtgaaatcg tggctcgttg tccctccgag cactgggaag 1589641 tggtgcgcat cgagatgccg gcttcggtgg ctcgctatgt cgtcgaaaag ggctcgatca 1589701 ccgtcgacgg gatttctctg acggtctccg ggctcggcgc cgaacagcgg gactggtttg 1589761 aggtctcgct gatcccgacg acccgggagc tgaccacgct ggggtccgct gcggtgggaa 1589821 cccgggtgaa cctcgaagtc gacgtagtcg caaagtatgt tgagcggtta atgcggagcg 1589881 ccggctgaca tcgctcgccg agggagggag ccccatgtct tgcattccgg acgagatcga 1589941 tacgcccgac gtgctgatcg accgcgacat ccttgaccgc aacatcgggc gaatgagttc 1590001 cgccgtcgcc gcgaaaggga tcgcccggcg tccccacgtg aagacgcaca agctgcctga 1590061 gatcgcccat atgcaactcc gcgcgggcgc gcggcctgac ggtggccacc atcggggaag 1590121 tcgaggtatt cgtcgaccac ggcgccgacg acgtattcat cacctaccca ttgtggatcg 1590181 gcacacgcca agccgaccgg ctccgtcagc tggctgaccg cgctcgcatc gctgtcggtg 1590241 cgggcaccgc cgagggcgct tcgaacaccg gcgcacggct cgcagacgcc gctggcgcga 1590301 tcgatgttct catcgaaatc gacagtggcc atcaccgcag cggcgtccgt gccgaacaag 1590361 tgttggaggt cgcccacgcc gtcggtgagg ctgggcttca cctggtgggg gtgttcacct 1590421 tccccggtca cagttatgcg ccaggtaaac ccggcgaagc cggcgagcaa gagcggcgcg 1590481 ctctcaacga cgcggcgaac gcgctggtcg cggtgggctt cccgatcagc tgccgcagcg 1590541 gtgggtccac tcccaccgca ttgctcaccg ccgcggacgg ggcctccgag acgtcccggc 1590601 gtctatgtgc tcggtgacgc ccagcaactg gaactcgggc gctgcgcgcc ggcggacatc 1590661 gcgctgaccg ttgccgccac cgtagtgagc cgccaggact gcaggtccgg cttgcgccga 1590721 attgtccttg actgcggtag caagattctc ggcagcgatc gtccggcctg ggcgactggg 1590781 ttcggccgtc tgatcgacca cgccgatgcg cgcatcgcgg cgctgtcgga gcatcacgcc 1590841 accgttgtct ggcccgacga cgccccgctc ccgccggtgg gaacacgtct gcgggtgatt 1590901 cccaaccacg tgtgcctgac caccaacctc gtagatgatg tcgccgtggt gcgcgacgca 1590961 accctgattg atcgctggaa agtcgccgcc cgcggtaaga accattgatc ctgtcgcact 1591021 tggtcacggc aataccgcct ggctcaatgg ttcatactga atggaacacg tgggcttcgc 1591081 gtgcggccag gcctgacagc taggtagcaa agatgacgag gttggactcc gtcgagcggg 1591141 cggttgccga cattgcggcg ggtaaggccg tcatcgtcat cgacgacgaa gaccgggaga 1591201 acgagggtga cctgatcttc gccgccgaga aggcaacgcc ggagatggtg gccttcatgg 1591261 tccgctacac ctccggatac ctgtgcgttc cgctggacgg tgccatctgc gaccggctgg 1591321 gcctgttgcc catgtacgcg gtgaaccagg acaagcacgg gacggcatac accgtcacag 1591381 tcgatgcacg gaatggcatt ggaactggca tttcggcgtc cgatcgggct accaccatgc 1591441 ggttgctggc cgatccgacc agtgtggccg acgatttcac ccgccccggt cacgtggtcc 1591501 ccttgcgggc caaggatggt ggggttctgc gccggcccgg ccacaccgag gccgccgtgg 1591561 acctggcccg gatggccggg ctgcaacccg cgggggcgat ttgcgagatc gtcagccaaa 1591621 aagatgaggg ctcgatggcg cacaccgatg aattgcgggt gttcgccgat gagcacggtc 1591681 tggcgctgat caccattgct gacttgatcg aatggcggcg caagcacgag aagcacattg 1591741 agcgggtcgc cgaggcgcgg attccgactc gtcatgggga gtttcgcgcc atcggctaca 1591801 ccagcatcta cgaggacgtg gaacatgtcg cgctggtccg cggcgagatc gccgggccca 1591861 acgccgacgg tgacgacgtg ctggtccggg tgcattcgga gtgcttgacc ggcgatgtgt 1591921 ttgggtcacg ccgctgcgat tgcgggcctc agctggacgc cgcgctggcg atggtcgccc 1591981 gtgaggggcg cggcgtggtg ctgtacatgc gtggccacga gggccgcggc atcggcctga 1592041 tgcacaaact gcaggcctac caactgcagg acgccggtgc cgacaccgtt gacgccaatc 1592101 tcaagcttgg actacctgcc gacgcaaggg attacgggat cggcgcacag atcctggtcg 1592161 atcttggggt acgttcgatg aggctgctga ccaacaaccc ggccaagcgg gtgggactgg 1592221 atggatacgg attgcacatc atcgagcgcg tgccgctgcc ggtgcgggcc aacgcggaga 1592281 acatccgtta cctgatgacc aagcgtgaca aattggggca cgacttggct gggttggacg 1592341 attttcacga atccgtgcat ctgcccggag aattcggcgg tgccttgtga agggtggcgc 1592401 cggggtgccg gatctgccgt cgctggatgc gtctggtgtg cggctggcga ttgtcgccag 1592461 cagctggcac ggaaagatct gcgacgcgct gttggacggc gcccgcaagg tggccgccgg 1592521 gtgtggcctc gatgacccga ctgtggttcg ggtgctcggc gcgatcgaga ttccggtggt 1592581 ggcgcaggaa ttggcccgca atcatgatgc cgtcgtcgca cttggcgtcg tgatccgcgg 1592641 tcagacacca catttcgact acgtgtgcga tgcggtaacc cagggactga cccgggtatc 1592701 gctggattcc tcgacgccga tcgccaacgg cgtgctgacc accaacaccg aggagcaggc 1592761 gctggatcgg gcggggctac cgacgtcggc cgaggacaag ggcgcccagg cgactgtggc 1592821 agccctggcc accgcgttga ccctgcgcga gctgcgcgct cactcgtgac cgccgcaccg 1592881 aacgactggg acgtcgtgtt gcgtcctcac tggacgccgt tatttgccta cgctgcagcg 1592941 tttctgatcg cggtagcgca cgtcgcgggg ggcctgctgc tcaaggtcgg gtccagtggc 1593001 gtggtcttcc agaccgctga tcaggtggca atgggtgccc tggggctggt cctcgccggg 1593061 gcggtgctac tgttcgcgcg gccgcggctg cgggtgggtt ctgccgggct ttcggtgcgg 1593121 aatctgttgg gtgacaggat cgttgggtgg tctgaagtga tcggtgtgtc gtttcccggc 1593181 ggtagccggt gggcgcggat cgacctggcc gacgacgagt acatcccggt gatggcgatc 1593241 caagcagtgg ataaggaccg cgccgtggcc gccatggaca cggtgcgctc gttgctggct 1593301 cgataccggc ctgacctgtg cgcccgctga agcgacttcc cgtacgatcg cgaaatggca 1593361 tgtcttgggc gccctggctg taggggttgg gcgggggcga gcttggtcct tgtggtggtg 1593421 ttggccctgg ctgcttgcac cgagtcggta gcgggccgcg cgatgcgtgc taccgaccgg 1593481 tcgtccgggc tgcccacatc cgccaagccg gcgagggcgc gcgacctgct gctgcaggac 1593541 ggggatcgcg ctccgttcgg ccaggtaacc cagtctcgcg tcggcgacag ctacttcacc 1593601 agcgccgttc cacccgagtg ctcggcggcg ctgctgttca aaggttcccc gctgcggcct 1593661 gacggctcgt cggaccacgc cgaggcggct tataacgtca ccggtccgct gccgtacgca 1593721 gagtcggtcg atgtctacac gaatgtcctg aacgtccacg atgtggtctg gaacgggttc 1593781 cgcgacgtgt cccactgccg tggcgatgcc gtcggagtga gccgggccgg cagatcgacg 1593841 cccatgcgac tcaggtactt cgctacgctg tcagacggtg tcctggtatg gaccatgagc 1593901 aatccgcgct ggacgtgtga ttacggattg gctgtggtcc cgcacgcggt gctggtgtta 1593961 tcggcgtgtg gcttcaagcc cggattcccc atggcggaat gggcgtcgaa acggcgggcc 1594021 caactggaca gccaggttta acgccagccc ccatgctctt cgcgggcggg tttgaaccgg 1594081 ccaaacgggg tcaaagtcac ggcggcctgg gcatactcaa atgtgtccca cggcccacca 1594141 tcggatcccg acgacggccc actgtgaact gtgccgctcg tggtgcatta cccagaccac 1594201 gatgagagaa tggcggggaa atgggtgaat tacggttggt gggcggtgtg ctccgggtcc 1594261 ttgtcgtggt cggtgcggtg ttcgatgtgg cggtgctaaa cgccggtgcg gctagtgccg 1594321 acggcccggt ccagctgaag agccgattgg gcgatgtttg cctggacgcc ccgagtggga 1594381 gctggttcag cccgctggtg atcaacccct gcaatgggac cgactttcag cgctggaatc 1594441 tcaccgatga ccggcaggtc gagagcgtgg ccttccccgg ggaatgcgtg aatatcggaa 1594501 atgctttgtg ggcgcgcctg cagccctgtg tgaactggat cagccagcac tggactgtcc 1594561 agcccgacgg cctggtcaag agtgatcttg atgcctgcct cacggttctc ggcggtccgg 1594621 atcctgggac ctgggtgtcc acccgctggt gcgaccccaa tgcacccgac caacagtggg 1594681 atagcgtgcc gtaaccggcc tgcccggcga acccccgcct ttctgggcgc cgtcgaagcg 1594741 accactagcc tagatacgtg ccagatcccg caacgtatcg ccccgcgccc gggtccatcc 1594801 cggtcgagcc gggcgtgtac cgattccggg accagcatgg gcgagtcatc tacgtcggca 1594861 aggccaagag cctgcgtagc cggctgacgt cctattttgc cgacgtggcc agcctagcgc 1594921 cgcggacccg gcagctggtg accaccgcgg ccaaggtcga atggacggtc gtggggaccg 1594981 aggttgaggc actgcagctg gaatacacct ggatcaagga gttcgatccg cgattcaacg 1595041 tccgctaccg cgacgacaag tcctaccctg tgctggcggt caccctgggc gaggaatttc 1595101 cccggttgat ggtctatcgc ggtccgcggc gcaagggtgt gcgctatttc gggccgtact 1595161 cgcacgcgtg ggcaatccgg gaaacgctgg atctgctcac ccgggtgttt ccggcgcgaa 1595221 cttgctcggc gggggtgttt aagcggcaca ggcagatcga tcgtccatgc ctgctcggct 1595281 acatcgacaa atgttccgcg ccgtgtattg gcagggtcga tgcggcccag caccgccaga 1595341 tcgtggcaga cttctgcgac tttctgtccg gcaagaccga ccggttcgcc cgcgccttgg 1595401 aacagcaaat gaacgccgcg gccgagcaac tggacttcga acgagcggcg cggcttcgcg 1595461 acgacctgtc cgcactgaag cgtgccatgg aaaagcaggc cgtggtgctc ggggacggca 1595521 ccgacgccga cgtggtggca ttcgccgacg acgaactcga ggcggcggtg caagtgttcc 1595581 acgtgcgcgg cggacgggtc cgcggccagc gtggctggat tgtcgaaaag ccaggagagc 1595641 caggagattc cggaatccag ttggtcgagc aattcctgac acagttctac ggcgaccagg 1595701 cggcgttgga cgacgccgcc gacgaatccg ccaacccggt tccccgcgag gtgctggtgc 1595761 cctgtttgcc gtccaacgcc gaggagctgg ccagctggct gtccggcctg cgcggctcaa 1595821 gggtcgtgct gcgggtgccg cgccgcgggg acaagcgggc actggccgaa acggtgcacc 1595881 gaaacgcaga agatgcactg caacaacaca agctgaagcg ggccagcgat ttcaacgcca 1595941 gatccgctgc gctgcagagc attcaggact cgttgggcct ggcagacgca cccttgcgga 1596001 tcgagtgtgt cgacgtcagc catgtgcagg gcaccgacgt ggtcgggtca ctggtggtgt 1596061 tcgaagacgg cctgccgcgc aagtcggact accgccactt cgggatccgg gaagccgcag 1596121 gccaggggcg ctccgacgac gtggcctgta ttgccgaggt gacccggcgc cgcttcctgc 1596181 ggcacctgcg cgatcagagc gatccggatc ttctttctcc ggaaaggaag tcgcgtagat 1596241 tcgcctatcc gcccaatctg tacgtcgtcg acggcggcgc gccgcaagtc aacgcggcca 1596301 gtgcggtaat cgacgaactc ggtgttaccg acgtcgcggt gatcggcctg gccaagcggc 1596361 tggaagaggt atgggtgccg tcggagccgg acccgattat catgccgcgc aacagtgagg 1596421 gactctatct gctgcagcga gtgcgagacg aggcacaccg gttcgctatc acctaccatc 1596481 gcagcaagcg gtcgacgcgg atgactgcct cagcgctgga ctcggtgccg ggattggggg 1596541 agcatcgccg caaagcgctg gtcacccatt tcggatcgat cgctcgcctc aaggaggcca 1596601 ccgtcgacga aatcaccgct gttcccggta tcggcgtggc cacggccacg gccgtccacg 1596661 acgcactgcg acctgactca tcgggggccg cgcgatgatg aaccatgcta ggggcgtcga 1596721 gaatcgttcg gaaggcggcg gtatcgacgt cgtcttggta accgggctgt ccggggccgg 1596781 gcgcggcacg gcggctaaag tgctggaaga cctgggctgg tatgtggccg acaatctgcc 1596841 gccccagctg attacccgca tggtggactt cgggctggcc gccggatcac ggatcaccca 1596901 gctggcggtg gtaatggatg tgcgatcgcg cggattcacc ggcgacctcg attcggtccg 1596961 caacgagctg gccacgcgtg ccatcacccc gcgtgtggtg ttcatggagg cgtccgatga 1597021 cacgttggtg cgccgctacg aacagaatcg ccgcagtcat ccgctgcagg gtgagcagac 1597081 tctggccgag ggcattgccg cagagcgcag gatgctagca ccggttcgcg ccaccgccga 1597141 cctgatcatc gacacgtcga cactgtcggt ggggggctta agggatagca tcgagcgtgc 1597201 cttcggcggt gatggcggcg cgaccaccag cgtcaccgtt gaatccttcg ggttcaagta 1597261 cggcctgccg atggacgccg acatggtcat ggacgtgcgg ttcctgccga acccgcactg 1597321 ggtggacgag ttgcggccac tgaccggcca acatccggcc gtgcgcgact atgtgctgca 1597381 ccggccgggc gcggctgagt tcctcgagtc ctaccatcgg ttgctatccc tggttgtcga 1597441 cggctaccgt cgagagggga agcgctatat gacaatcgcc atcggctgta ccggtggtaa 1597501 gcatcgcagc gtcgcgatcg ctgaagcact gatgggactt ctgcgctccg atcagcaact 1597561 gtcggtgcgg gcgctgcacc gggatctggg tcgcgaatga ccgatggcat cgtcgcgctg 1597621 ggcggcggac acggcttgta tgcgacgctg tctgcggccc gccggttgac accctacgtt 1597681 accgccgtgg tgaccgtcgc cgatgacggt ggctcgtcgg gccggctgcg cagcgagctc 1597741 gatgtggtgc cgccgggcga tctgcgaatg gccttggcgg cgttggcatc cgatagcccg 1597801 cacggacgcc tgtgggcaac tattctgcag cacagattcg gcggcagtgg tgtgctggcc 1597861 ggacatccga tcggcaatct gatgctagcg ggcctgtccg aggtgctggc cgatccggtc 1597921 gcggctcttg acgaactcgg gcgcatcctc ggggtgaaag gcagggtgct gccgatgtgc 1597981 ccggtcgcgc ttcagatcga ggccgatgtc tccggtctgg aggccgaccc gcgcatgttc 1598041 cgcctgatcc gtggccaggt ggcgatcgcg accacgcccg gaaaggtgcg ccgggtgcgg 1598101 ctgctgccga ctgacccgcc ggcgacccgg caggctgtcg acgccatcat ggctgccgat 1598161 ctggtggtcc tggggcccgg gtcgtggttc accagcgtga taccccatgt gctggtgccg 1598221 ggtctggccg cagcgctgcg agcaacgtcg gcccgccgtg ccctggtgct caacctggtg 1598281 gctgaaccgg gagagacggc cggtttctcg gtggagcgtc atctgcacgt gctagcccaa 1598341 cacgcgcccg ggttcaccgt tcacgacatc atcatcgacg ccgaacgagt gccgagcgaa 1598401 cgggagcggg agcaactgcg ccgcacggcg acgatgctgc aggccgaggt ccacttcgcc 1598461 gatgtcgcca gacctggtac acctttacat gacccgggca agctggcggc ggtcctcgac 1598521 ggggtgtgtg cgcgcgacgt cggcgcgtcg gagcctccgg tggcggccac acaggagata 1598581 ccgatcgacg gtggacgacc gaggggtgac gacgcgtggc gatgacgacc gatgtcaaag 1598641 acgagctgag ccgactggtg gtgaagtccg tcagcgcgcg gcgcgcggag gtcacctctc 1598701 tgctgcgatt cgccggcggg ttgcacatcg tgggcggccg cgtggtggtc gaagccgagc 1598761 tggacctggg cagtatcgca cggcggctgc gtaaggagat cttcgagctc tacggctaca 1598821 cggcggtggt gcatgtgttg tcggccagcg ggattcgcaa gagcacccgc tacgtgctgc 1598881 gggtcgccaa cgacggcgag gcgttggcac gccaaaccgg actgcttgac atgcgcggtc 1598941 gtcccgtgcg gggtctgccg gcccaggtcg tcggcggcag catcgatgac gctgaagctg 1599001 cgtggcgagg agcatttttg gcgcacgggt cgctgactga gccgggacgc tcctcggcgt 1599061 tggaggtcag ttgcccgggc ccggaggccg cgctggcgct ggtgggtgcg gcacgccggc 1599121 ttggggtcgg cgccaaggct cgtgaggtgc gcggtgccga tcgcgtggtg gtgcgcgacg 1599181 gtgaggcgat cggcgcactg ctgacccgga tgggggccca agacacccgg ctggtctggg 1599241 aggagcggcg gctgcgtcgt gaggtgcgtg cgacggccaa ccggctcgcc aatttcgacg 1599301 acgccaatct gcgccgctcg gcgcgggccg cggttgccgc ggccgcccgg gtggagcgtg 1599361 ccttggagat cctcggcgat acggtgcccg agcacttggc ctcggccggc aaattgcgtg 1599421 tcgagcaccg gcaggcgtcg ctggaggagc tgggccggct tgccgatcct ccgatgacga 1599481 aagacgctgt agccggacgt attcggcgat tgttgtcgat ggcggatcgt aaggcgaagg 1599541 tggacggcat ccccgatacg gagtccgtag tgacgcccga tctgctggaa gacgcctagc 1599601 gggctgactt acttcggtgc cacgcacacc aattggctgc ttgccggggg tattgctggc 1599661 ccttcgattt cctcgggcgg ctgcagagag actgacgcgg aatcgcagcg ccctccggca 1599721 ccgaggctct tgatctcggt gacgacgaat cggctgaact cccggtttgc agaacgtgtt 1599781 ccaggcacaa gcgcggtggc tacccgcggt gaaggcagcg attcgtcgca cgccgacggc 1599841 gcgtacagca gcacggatgg cggcttgccg ggggtcgtca ccgccggata gcagtatccg 1599901 acccgcacca ggtacttcag gcagtaatac gcccgatggt tctgggtgat caattcgtag 1599961 tcgatccgca tgcaactcgg agcgttggca tggaatccgt cattgcggat ccgggcctcc 1600021 cggctatcgc aagcaacgac ctgcggacga gacggcgcca gcttggagtc gtaggtgaac 1600081 cggttcaggc acatccccaa gtccatggag aacaccaact tcagcgtcgc acgatcgtag 1600141 ccccgcggct cgttgtaacc cgaagcggaa gcggtctggc acgcactcag cagcagcgtg 1600201 agaatcccca gcaacactgg gaaaacgagc ttctcggctg gcggtcgccg gtacgacggg 1600261 aagctatacc gcctcgccga tgtttgggcc gaagcttgca cacattgacg ataacttggt 1600321 cgcgagaccg cagaagctgg cctcgacggc gcgccgggga ctacggtcat accatgaagc 1600381 ggctttcgag cgttgatgct gcgttttggt ccgcggaaac cgcaggctgg catatgcacg 1600441 tgggcgcact ggcgatctgc gatcccagcg acgcgcccga atacagcttt cagcggctcc 1600501 gcgagttgat catcgaacgg ctgccggaga tcccgcagtt gcggtggcgg gtcaccggcg 1600561 ccccgctcgg actggaccgg ccgtggttcg tcgaggacga ggaactcgac atcgactttc 1600621 acatccgccg catcggtgtt ccggctcccg gtgggcggcg cgaactcgag gagctcgtcg 1600681 gacggctgat gtcctacaaa ctggaccgtt cccggccgct gtgggaactg tgggtcatcg 1600741 agggcgtcga gggcggccgc atcgccacgc tgaccaagat gcatcacgcc atcgtcgacg 1600801 gtgtctccgg tgccgggctg ggcgaaatcc tgttggacat cacaccagaa ccacgaccac 1600861 cgcaacagga aacggtcggc ttcgtgggat tccagattcc gggcctggaa cgccgggcga 1600921 taggtgcgct gatcaacgtg ggcatcatga cgcccttccg catcgtcagg ctgctggagc 1600981 aaaccgtgcg tcaacagatc gcggcattgg gtgtggccgg caaaccggcg cgatacttcg 1601041 aagcgcccaa gacgcggttc aatgcgccgg tgtcgccgca ccggcgggtt accggcacac 1601101 gcgtcgagct ggctagggcc aaagcggtca aggacgcgtt cggcgtcaag ctcaacgacg 1601161 tcgtcttggc gctggtggcc ggggcggccc ggcaatacct acagaagcgt gacgagctgc 1601221 ccgccaagcc gttgatcgcg cagattccgg tctccacccg cagcgaggaa acgaaggccg 1601281 acgtcgggaa ccaggtcagc tcgatgaccg cgtcgctggc aacccatatc gaggatccgg 1601341 ccaagcgcct ggcggccatc cacgagagca ccctcagcgc caaggaaatg gctaaggcgc 1601401 tctccgcgca ccagatcatg gggctgaccg agaccacgcc accgggtctg ctgcagctgg 1601461 ccgcccgggc ctatacggcc agcgggctgt cacacaacct ggccccaatc aacctcgtcg 1601521 tctccaatgt ccccggtcca cccttcccgc tatatatggc cggcgcgcgg ctggattcgc 1601581 tggtgcccct ggggccgccg gtgatggacg tggcgctgaa catcacctgc ttctcctacc 1601641 aggattatct ggatttcggc ctggtgacca cacccgaggt ggccaacgac atcgacgaga 1601701 tggccgatgc catcgaaccg gcactggccg agctggagcg tgccgcggaa tagcaatagc 1601761 tggcctatag ctgactacgt ggccggcggg ttggtcgcgt acacccaaga caggaagcgg 1601821 gccacggcct cggcggtgtg atgcgcccgc ggggagccga agacgtcgaa ggcgtgttgg 1601881 gcgtggggca ggtccgcgta ggcgacgggc gacttcgaca ccgcccgcag ttcctcgacg 1601941 aacgcatggg cttcggccac ggggatcagg gagtcgtggc ggccgtgcag aacgaagaac 1602001 ggtggggcgt cggcccgcac atggtggatc ggtgaggcat cgacgaagat gtcgcggtgc 1602061 gtgctgaatt tccgtttcac cacgaacgtt tcgagcaacc cgacgaattc ccgacgcccc 1602121 ggcgcatcgg tcgtaaacca gtcgtaacgc ccgtataccg gaaccgctgc cgccaccgag 1602181 gtgtcgacct gttcgaaccc gggctgaaat cgcggatcgt tgggggtcaa cgccgccagg 1602241 gcgcacagat ggccgccggc cgaaccgccg ctgatggcaa cgaaattcgg atccccgccg 1602301 taggcggcga tgttttcctt gacccacgcc agcgcgcgct tcacgtcgac aatgtggtcg 1602361 ggccaggtgt ggcgcggcga cacccggtag ttcagcgaca cgcataccca gccgcgcgca 1602421 gccagatggc tcatcaacgg atacgcctgc gggcggcgcc accccagtac ccaggcgccg 1602481 ccgggcacct gtaccagcac cggtgccttg gcgtcgcgtg gcaggtcgcg gcggcgccag 1602541 atgtcggcca ggttggcccg cccgtatggg ccgtagcaca cgacgttcgt cgtctcgacg 1602601 tagcgccggc gtgccatggc ggtacgcagc gggagattgc gacctctgct acgcatcggt 1602661 tccgtgggca gggtagcgag ttccttagcg tagtcgggcc cgagctgttc ggtcaggccc 1602721 gcttcgagca ccggtccagg ggtggtggcg ccgcggtagc ggatcaccgc aaggatcacc 1602781 caggccgctg ccgttaaggc cagtgccgcc tttcctttca gcccaccgaa gtcgcctcgg 1602841 cggccgcggc gcagtgcgtc cagcacggag gcgcctaggt acactcctgg cacttccgac 1602901 gtcggccagc ccaaccaaaa cgccagaacc gtgctgtagc cgctaccgga cagtgggcgt 1602961 aatccgttgg cggcattgag caattccacc gctgcacgtg ttaacggtct cgggcgtgcc 1603021 atccgccgaa atcgcattag ctgccgaccc gtgattgcag ctcggtgcgc aggatcttgc 1603081 cggtaatgcc gcgtggcagc tcgtcgagga cggcgatgtc gcgcggtacc ttgtagttgg 1603141 ccaggttgtc tcggacatgc tgcttgaggg tttccggggt ggccgaaaca ccgggcttga 1603201 gcaccacgaa ggccgccagc cgctggccgt actgctggtc gtccacgccg atcaccgcgg 1603261 cctcggccac gtcggggtgg gtggccagcg tcttctccac ctcgatcggg tagatgttct 1603321 caccgccgga gacgatcatc tcgtcgtcgc gcccgacgac gaacagccgg ccgttctcgt 1603381 cgaggtagcc gacgtcgccc gatgacatga acccggcatg gaaatccttt gcggcgccag 1603441 atgtatagcc atcgaattgg ctgtcgttgc ggacgtagat ggtgccgacc tcgccggtgg 1603501 gcacctcggt gaactgctgg tccaggatcc ggatttcggt tccttcggcg ggccgacccg 1603561 cggtgtcggg tgcggtccgc aggtccgccg gtgtggcggt ggcgatcatc ccggcctcgg 1603621 tcgcgttgta gttgttgtag atcacgtcgc cgaattggtc catgaatgcg atcacgacat 1603681 cgggccgcat ccgagaaccc gacgcggcgg cgaaccgcaa cgaccggccg tcgtagcggt 1603741 ttcgaatctc ggccggcagg tccatgatgc gatcgaacat caccggcacc accaccagac 1603801 ccgtcgcgtg gtggcggtcg atcaggtcca gcgtcgcctc cgggtcgaac ctgcgtcgcg 1603861 tgacgatcgt gcaggccagc gaggaggcca gcaccagctg cgagaagccc caggcatgaa 1603921 acatcggcgc cacgatcacg gtgacctcct cggcccgcca cggcgtgcgg tccaagatcg 1603981 ccttcagtgt cccgatgcca ccgccagaat gcctggcgcc cttgggtgtt ccggtggttc 1604041 cggaggtcag caggatcact tttccgtggc tgccggtgtg ctcgggccgc cgtccggcgt 1604101 gcgcggctac aagtttctca acggtcaggt cgtggtcttc gtcggtccac gccacgatac 1604161 gggtggcctg cggtttttcc gccagcgcgc gatccaccgt cgcgctgaac tcttcgtcat 1604221 agacgacagt gtcgacgcct tcgcgggtaa ccacctcggc cagtgccgga ccggcgaagg 1604281 aggtgttgag caacaggatg tgcgcgccaa tccggttgac cgccaacagc gcatcgacga 1604341 agccgcgatg attgcggcac atgatgccga cgaccctggg gggtccggct ggcagggcct 1604401 gaagcgccgc ggccagcgcg ttgccgcgtt cgtcgagctg gcgccaggtc agcgtgccca 1604461 gttcgtcgat caggccgggg cggtccgggc agcgtcgggc cgcaccggcg aaccccgccg 1604521 taaaccccat gccttcgcgg cgcatggcgg cgacgatccg caggtagcgg tctggtcgca 1604581 gcggagcgat caaccctgcc cggcgcatgg tggcgatcaa gccgaatgct tgtctgatac 1604641 gcatggctta gcccagaatc gggaagcggc gcttggcggc gaggtcgttg agggcttgct 1604701 gcatcaccga ccgtacgtgc tcgtcgaccg cgtcgacatc agggtcctcg ccgaactgct 1604761 tggtgaggtt gatcgggtct aacacctgca tgacgatctt ggcgggcagc ggcagattgg 1604821 gcgggatcgc ggcggagaac ccgaacggaa agccgaacga gatcggcagg atgtcgctgc 1604881 ggagcagtcg cttgagccct agccgccggg cgagccaggt gccgcgggac aggtagagct 1604941 ggctttcctg gccaccgatg gacaccgccg gcacgatggg cacgccagct tcgacggcag 1605001 tgctgacgta tcccttgcgg ccgttgaagt cgatcacgtt ctccgcgaaa gtcggccggt 1605061 acgcgtcata gtcgccgccg ggaaaaacga ccaccacacc cccggaccgc aacgccttag 1605121 ccgcgttttc tcgggtggcg cgaatgtagc cggtgcgtcg gaacaagtcc ccggtcaggc 1605181 ccatgaacaa gatgtcgtgg ctgagcgtgt agaccggtcg gtcgtagccg aacttgtcgt 1605241 agaagtcgac gctgaagacc ggcacgtcca tcgggaacat gccaccggag tggttggcca 1605301 cgaccagtgc gccacccggc gggaaggagt ccaggccatg cacctgcgac cggtggtagg 1605361 tcttcaagac tggacgcagc acacttatca ggcgctgggt taggccaggg tcgaatttgc 1605421 cgatgtcgcc gatacctgca tcgtccccgt taccagggct atcggtttcg ctcaactgtt 1605481 ctccctcgag gcctccgagg cctcattgcc gcgtcgggtc tttagatggt agcgatgcac 1605541 ggtggatagg cacacgcggc aggtctgcta gcaaggacga gaggtggtcc agagtggctg 1605601 aagctggtgg cgggcccatt tcggtgatcg cccggcatat gcagttgatt cgcgatgact 1605661 tcatctccga gttgtttgac aagatgaagg cggagattcg ggggctggat tacgacgcgc 1605721 ggatggcgga cctgtggcgg gcgagcatca ccgagaattt cgtgacggcc gttcactatt 1605781 tggatcgcga tacgccgcag tccttggtgg aggctccagc ggccgcgctg gcatacgccc 1605841 gcgccgcggc gcagcgtgat attccgttgt ccgggttggt tcgggcgcac cggctcgggc 1605901 atgcgcgttt cttggaggtg gcgatgcagt acgtgtcgct gctggagccc gctgaccggg 1605961 tgtcgacgat catcgagctg gtgaatcgct ccgctcgcct cgttgacctg gtggccgacc 1606021 agttgattgt cgcctatgag cacgaacacg atcgctggct gagtcgccgc agcggtctgc 1606081 aacagcaatg ggtcagcgag ctgctcgccg ataccccggt cgacgttccg cgggccgagc 1606141 gcgcgttggg ctatcggttg gacggtgtgc atatcgccgc ggtggtatgg gtcgattcgg 1606201 cggtgcccat cggtgatgtg gtggcgcaat tcgaccaggt gcgctgcttg ctggccgggg 1606261 agctgggccc cgaactgggc cccgtggcga actcgctgat ggtgccgacc gatgagcgcg 1606321 aggcacggct gtggttttcg cccgcgccca cgcgggcctt cgccccgtcg cggattcgcg 1606381 cggcgttcga gtcggcggga atccgggcgc gtttggcgtg cggtcgggta ggggacgggc 1606441 tgcgtgggtt ccgggcgtcg ttgaaacagg ccgaacgagt gaaggcgttg gccctggccg 1606501 gtggcgcccg gcccggcggc cgggtcatgt tttatgacga tgtcgcgcca gtcgcgttgc 1606561 tggccgacga tctagaggaa ctgcggcggt tcgtcaccga tgtgctgggt gacctgagtg 1606621 ttgacgacga gcgcaatagc tggctacgcg agacgttacg ggagttcttg ctgcgtaacc 1606681 gcagctacgt cgccacggcc gacgcgatga tcctgcaccg caacaccatt caataccggg 1606741 tgatccaggc gatggaacta tgcggacaga atctcgacga tcccgatgcc gcgtttcggg 1606801 tgcagatggc gctggaggtc tgccgctgga tggcaccggc ggtgctccgc gccaaacaat 1606861 agtgtctcgg taaccgccgg tccgttcatg ccgtgcgcac aatcgtggtc gtgagcttcg 1606921 gtgtcggcgc atatggtctc cgacggattc ggcgcctaac gtttgcccac gtcaaacaac 1606981 ccgaccagaa agccagccgg gtccgccaga ggggggcgga cccggcgtat acccaattcg 1607041 cgtcgctcgg ttctagttgg gcgctatcat ccgttgccac ggggttggtc ggaaggtcgg 1607101 tatgtcgttc gttttcgcgg tgccagagat ggtggcggca accgcttccg atttggccag 1607161 cctcggagcg gcgctgagcg aggccaccgc ggcggcggct atccccacca cacaagtact 1607221 ggccgcggcc gccgatgagg tgtcggcggc catcgcggag ttgttcggtg cgcacggcca 1607281 agaatttcaa gcgctcagcg cccaggcatc ggcgtttcat gaccggttcg tgcgggccct 1607341 aagcgccgca gcgggctggt atgtcgacgc cgaggccgcc aacgccgcgc tggtggacac 1607401 cgcggccacc ggcgcgtcgg agttggggtc aggtgggcgc acggcgctga ttctgggctc 1607461 caccggaacc ccgcgaccgc ccttcgacta catgcagcag gtctacgacc gctacatcgc 1607521 accccactac ttgggctatg cgttttccgg cctgtacacg cccgcgcagt ttcagccgtg 1607581 gaccggcatc cccagcctga cctacgacca atcggtcgcc gaaggcgccg gctatcttca 1607641 caccgcgatc atgcagcaag tcgcggccgg caatgacgtt gtggtgttgg gtttctcgca 1607701 gggcgcgtcg gtcgccaccc tggaaatgcg ccatctggca agcctgccgg ccggcgtcgc 1607761 gccgagtccg gatcagctct cgttcgtatt gctgggcaac cccaacaacc caaacggggg 1607821 catcctcgcc cggtttccgg gtctgtacct gcagtcgctc ggcctgacgt tcaacggtgc 1607881 gaccccggac accgactacg cgaccaccat ttacacgacc caatacgacg gctttgccga 1607941 cttcccgaag tacccgctca acatcctggc ggacgtcaac gcgctgctgg gtatttacta 1608001 ttcgcacagc ttgtattacg ggctcacgcc cgagcaggtc gcttcgggta tcgtcctgcc 1608061 ggtgtcttcg ccggacacca acaccaccta tattctgctt cccaacgagg atctgccgct 1608121 gctgcagccg ctgcgcggta ttgtgcccga gccgctgctg gatctcatcg agccagacct 1608181 gcgcgcgatc atcgaattgg gttatgaccg aaccggatac gccgatgttc cgaccccggc 1608241 cgcactgttc ccggtgcaca tcgacccgat cgcagtcccg ccccagatag gcgctgcgat 1608301 cggtggtccg ctcaccgccc tggatggctt gctcgacacc gtgatcaacg atcaactcaa 1608361 tcccgtcgta acgtcgggca tctatcaggc cggtgctgag ctgtcggtgg ccgcggccgg 1608421 ctacggtgct cccgcaggcg tcaccaatgc catttttatt gggcagcaag tgttgccgat 1608481 tttggtggaa ggccccggtg ccttggtgac ggccgacacc cattacctgg tcgatgcgat 1608541 tcaggatttg gccgccggtg acctcagcgg gttcaaccaa aacctgcaac tcatcccggc 1608601 taccaacata gccctgctgg tcttcgcggc cggaattccc gctgtggcgg ccgtcgccat 1608661 ccttaccggt caggattttc cggtataggc ccccggcccc cgctgtaccg agctcggcca 1608721 gtgaagaaca accccaggcg ttgccagtcc gaatagattg tattcgtcag ccggcgcagg 1608781 acaggaagcg aggccgccat gggatttctg aagcccgatc ttcccgacgt cgatcacgac 1608841 acctggttga cccagccacg ccggacacga ttgcaggtcg tgacacggga ctgggtagaa 1608901 cacggtttcg gaacgccgta tgcggtgtac ctgctctatc tgaccaagat tgcggtgtac 1608961 gtcgccgccg gcgccgcgat catctcgctg acccccggac tgggcgggct gagccgcata 1609021 ggcgactggt ggacacagcc gatcgtgtac cagaaggtca tcgtcttcac gttgctgttc 1609081 gaggttttgg gttttggctg cggatccggc ccgctgaccg ggcggttttg gccacccatc 1609141 gggggcttcc tttattggtt gcggcccaac acaattcggc tgcctgcttg gccggataag 1609201 gtcccgttca cccaaggcga cacccgcacc gtcgtcgacg tcgccttgta tgccatcgtg 1609261 ttgatcggcg gggtgtgggc gctgttgtca cccggctcgc caggtccggg gggaacgccg 1609321 gtcaccgccg ccggcgacgt cggcctgatc aacccggtgc tggtagtgcc gacgatcgtc 1609381 gccctgggcg tcttggggct gcgtgacaag acgatctttc ttgccgcccg cggcgaacac 1609441 tactggctga agctattcgt gttctttttt cccttcaccg accagatcgc ggcgttcaag 1609501 atcatcatgc tgtgcttgtg gtggggggcg gcgacttcca aactcaacca ccatttcccc 1609561 tacgtcgtcg cggtgatgac cagcaacaac gccctgttgc gcagcagagt gttcaacccg 1609621 atcaagcacc tgctttaccg cgaccacgcc aacgatctgc ggccctcctg gctaccgaaa 1609681 ctcatggccc acgggggtgg caccacggcg gaattcctgg tgcccgggat tctggtgctc 1609741 gtcgccgacg gtcacccatg gcggtggttc ctcatcgggt tcatggtgct ctttcacctc 1609801 aacatcctgt ccaacctccc gatgggggtc ccgttggagt ggaacgtgtt cttcatcttc 1609861 tcgctgtgct atctattcgg ccactacggc gcgatcactg ccaccgacct tcggtcgccg 1609921 ttgctgctgg cgatcgtgat cgcggtggtt gccgtggtga tcatgggaaa cctgttgccc 1609981 gaaaagattt cgtttctgcc cgccatgcgc tactacgccg gcaactgggc caccagcatc 1610041 tggtgcttcc gaggtgatgc ggaagccacc atggaaacca gcgtcgtgaa aagctctgcg 1610101 ctggtggtca atcagctggc caagctctac gacggggcca cggccgaaat catgaccgac 1610161 caggtcgccg cattccgggc catgcacacc cacggcaggg cgctcaacgg cctgctgccc 1610221 cgcgctctcg atgacgaagc tcactaccgc atccgcgagg gcgaaatcgt ggccgggcca 1610281 ctggtcgggt ggaatttcgg cgagggccat ctgcacaacg agcagctggt ggccgccgtg 1610341 cagcggcggt gcaacttcgc cgacggcgat ctgcgggtga tcattctcga aggtcagccc 1610401 atccacgttc agaagcagtg gtatcgcatt gtcgacgcca agaccggttt gttcgaggcc 1610461 ggttacgtca cggtcgagga catgttgagc cgccagccat ggcccgagcc cggtgacgag 1610521 ttcccggttc acgtcacgac gcaacgcggc acgccgtcaa agccatgacg accgcggtcg 1610581 tcgtcggagc cgggcccaac ggcctggccg cggcgatcca cctggcccgt cacggtgtcg 1610641 acgtgcaggt gctggaggcg cgcgacacca tcggcggggg agcacgctcc ggtgagctga 1610701 cggtgcccgg ggtcatccac gaccactgtt cggcgtttca tccgctgggc gtcgggtcgc 1610761 cattctgggc ggcgatcgac ctgcaacgct acgggctgac gtggaagtgg ccggacgtcg 1610821 actgcgcaca cccactcgat gacggcaccg cgggcgtgct atatcggtcg atcgaagcca 1610881 ccgccgccgg cctgggtccc gacggcaagc ggtggcagcg cgccgtgggt gacctcgccg 1610941 ccggattcga tgagctggcc gaggatctgc tgcgcccggt gctcaacatg ccgcgtcacc 1611001 cgatccgcct ggcccgcttt ggtccgcgcg cggcgctgcc ggccaccgcc atggcgcgtc 1611061 ggtttcacac cgagcgggcg cgcgcgttgt tcggcggcgc cgcggcgcac gtctacacca 1611121 ggttggatcg gccgctgacc gcgtcgctgg ggttgatgat cctggccagc ggccatcgcc 1611181 acggttggcc ggtcgcccgg ggcggatccg ggtcgatcac gaaggcgctg gccgcggccc 1611241 tggacgcgta cggcggcacc gtcgccaccg gggtgaccgt caccagccgc cgcgacatcc 1611301 ccgacgccga catcgtgatg ctcgacctca gcccggccgc ggtgctcggg atctacggcg 1611361 atgtgatgcc cacccgcatc aaccggtcct atcggcgcta ccgcgccgga tcgtcggcct 1611421 tcaaggtcga cttcgccatc gagggcgacg ttgggtggac caaccccgat tgccggcgcg 1611481 cgggcaccgt ccacctgggc gggaccttcg cggaaatcgc agacaccgaa cgtcaacgcg 1611541 cccaaggcac gatggtgcag cgaccattcg tgctcgtcgg gcagcagtac ctcgccgacc 1611601 cgtcccgctc ggtcggcaac atcaacccca tctgggccta cgcgcacgtg ccgttcggct 1611661 acaccggcga cgccaccgcc gccgtcatcg accagatcga gcggttcgcc cccggattcc 1611721 gcgaccgcat cgtggcaacc gtcagcacct ccaccaccga actgcaaacg tacaaccgca 1611781 acttcatcgg cggagacatt atcggcggcg ccaacgaccg gctgcaggtc atcttccgcc 1611841 cgcgcgtggc cgtcgatccg tatgcgatcg gtgtgccggg tgtctatctg tgttcacagt 1611901 ccgcgccacc cggtgccggg atccacggat tgtgtggcta ccacgccgcc gaatcggcgc 1611961 tgaggtggct gcgcaagcga cgttgacgca ggtcatcgcc gagatcgacg ttagcgcgac 1612021 gtccactcgt gccgtagcca aaacgtgacg gaggtttgat cgaattgcta aggcgcgcct 1612081 gcacttccac tcttcaatgc acctctacca tcactggtgc aactgtgtcg ttgacaggga 1612141 attggagcca tgcgggcggt ttttgggtgt gctattgccg tcgtcgggat cgctgggagc 1612201 gtggttgcgg ggccggccga catacacctg gtggcggcga agcagtctta cgggttcgcc 1612261 gtcgcgtcgg tgctaccaac gcgcggccag gtggtgggcg tggcgcaccc cgtggtggtg 1612321 acgttcagtg cgccgataac taacccagcc aatcggcacg cggccgagcg cgccgttgaa 1612381 gtcaaatcga cgcccgcgat gaccggcaag ttcgaatggc tcgacaacga cgttgtgcag 1612441 tgggttcccg accgcttctg gccggcgcac agcacggtgg agctttcggt gggcagcctg 1612501 tcgagcgatt tcaagacggg tcccgccgtc gtcggggttg ccagcatctc ccagcacacg 1612561 ttcaccgtga gtatcgacgg agtcgaggag ggaccgccgc ctccgctgcc ggcgccgcac 1612621 caccgagtgc acttcggcga agatggggtg atgccggcat cgatgggtag accggaatac 1612681 ccgacgccgg tcggctccta cactgtcttg tccaaggaac gctcggtgat tatggattcg 1612741 agcagcgtcg gcatccccgt cgacgatccc gatggttacc ggctttcggt ggattatgcc 1612801 gtccgcatca ccagccgcgg cctctacgtg cattcagccc cgtgggccct tccagcactg 1612861 ggacttgaaa atgtcagcca cggctgcata agcctgagcc gcgaggacgc agagtggtat 1612921 tacaacgcgg tcgacattgg cgacccggtc attgtgcagg aatagcagct gatgcgggcg 1612981 tcgcccgcag agcgcgtcga cggcgcgtac gcgggtgcgg ggcctcacac ccagtccgtc 1613041 ctggaagagg accagcgtca gcgcgcacct gcgggcgcag aggccgaagg accgggcaga 1613101 accggctgac caggcaccgg tccgccagct ggcgccggat cggtcagcgc atccttgacc 1613161 ccggacatgc caatgatggg agcactgacc acaccatccc cgggagcacc agccaggacc 1613221 ggcccaagcg caatcagcgg agttccgacc ggtatcaccg gagccggaac ggcgggtacc 1613281 ggtaccggtg cgcccggtat cggtaccggt ccgccgggga ttggtaccgg tgcgcccggt 1613341 atcggtaccg gtgcgccagg gatcggtacc ggtgcgccag ggattggtac cggtgcgccg 1613401 atgggcaccg gtgcagctgc cggcactggc ccaggcgcga cgaacggaac accagccatg 1613461 tcagtaagtg cggcactgca cgctcccgtg gctgccggtc caccggcagc caccgggtcg 1613521 ccggcggcta ccggcgcgtc gcccgccatg ccctggatgc acgcgtagcc acccgtcatc 1613581 agcgggtcag ccgccgcgtc cgggcttaac gctatagcag ctgcaaacaa cccagcgccg 1613641 gcaattactt tgatgttgaa ccgattgacg atcgccatca gcgtcaactc tcctctattc 1613701 gcgcgcagat atttccgcaa tcaatttggt tcagcagaac cgcatagccg tatcgagttc 1613761 cttttcgacc accggctcaa ttgtcagcat cctatgggga acatgagccc cgccgcaccg 1613821 ggccgtttcc aaatggtgac gtcacaacgg tgtcacaagc cagcgcaatg tccgcggtag 1613881 ggacgcggcg gctgggatcg gtggggtgag cgcccggctt ctcaaagcga ggggagcccc 1613941 gggactctta ccggccgaag gcggcgggtg tcactgatct aggctgacgg ccagtggttg 1614001 tttagccaac aaggatgaca acaaataagc cgaggagaga caagtgacgg tccgagtagg 1614061 catcaacggg tttggtcgaa tcggacgcaa cttctaccgg gccttactgg cccaacagga 1614121 gcagggcacc gccgacgtgg aggtggtcgc cgccaacgac atcaccgaca acagcacgct 1614181 ggcgcatctg ctcaaattcg actcgattct gggccggctg ccttgcgatg tcggcctcga 1614241 aggcgacgac accatcgtcg tcggccgcgc gaaaatcaag gcgctcgcgg tccgggaggg 1614301 gccggcggca ttgccatggg gagacctcgg cgtcgacgtc gtcgtcgaat ccaccggcct 1614361 gttcaccaat gcggccaaag ccaaaggcca cctggacgcc ggcgccaaga aggtgatcat 1614421 ctctgcgccc gccaccgacg aggacatcac catcgtcctg ggagttaacg acgacaagta 1614481 tgacggcagc cagaacatca tctccaatgc gtcgtgcacc acgaactgcc ttgcgccgct 1614541 ggccaaagtg ctcgacgatg agttcggcat cgtcaagggc ctgatgacca ccatccacgc 1614601 ctacactcag gatcagaacc tgcaggacgg gccgcacaag gacctgcgtc gcgcccgcgc 1614661 cgccgcgctg aacatcgtgc cgacctccac cggcgcggcc aaggccatcg gcctggtgat 1614721 gccgcagcta aagggcaagc tcgacggtta tgcgctgcgg gtgccgatcc ccaccggctc 1614781 ggtcaccgac cttacggtcg acttatccac acgggccagt gtcgatgaga tcaacgcggc 1614841 gttcaaagcc gcggccgaag gcaggctcaa gggcattctg aagtactacg acgcgccgat 1614901 cgtctcgagc gacatcgtca ccgacccgca cagttcgatt ttcgactctg ggttgaccaa 1614961 agtcatcgac gaccaggcca aggtggtgtc gtggtacgac aacgagtggg gctactccaa 1615021 ccgcctggtt gatctggtca cgctggtcgg caagtcgctc tagccatgag cgttgcaaac 1615081 ctcaaggatc tactcgccga aggtgtttcg gggcgtggag tgctggtgcg ctccgatctc 1615141 aacgttccgc tcgacgagga cggcaccatt accgatgcgg gccgcatcat cgcgtcggcg 1615201 ccgacgttga aggcgttgct cgacgccgac gccaaggtgg tggttgccgc gcacttggga 1615261 cgtcccaagg acgggccgga cccgacactg tcgctggcgc cggtcgccgt ggcgctgggt 1615321 gagcaactcg gccggcacgt ccagctggct ggagacgttg tcggcgccga tgcgctggcc 1615381 cgcgccgagg ggctcaccgg cggcgacatc ctgctgctgg agaacatccg cttcgacaaa 1615441 cgcgaaacca gcaagaacga tgacgaccgg cgggcactgg ccaagcagct ggtcgaactg 1615501 gtcggaacgg gaggcgtttt cgtctccgac ggctttgggg tggtgcaccg caagcaagcc 1615561 tcggtctatg acatcgcaac cctgttgccg cactacgccg gcacgctggt cgccgacgag 1615621 atgcgggtac tggagcagtt gaccagctcg acccagcggc cctatgcggt agtgctcggc 1615681 ggatcaaagg tgtccgacaa gctgggtgtc atcgagtcgc tggcgaccaa ggcggacagc 1615741 attgtgattg gcggcggaat gtgcttcaca ttccttgctg cacagggatt ttcggttggc 1615801 acatcgctgc tggaagacga catgatcgaa gtctgtcgcg ggctgctgga aacctatcac 1615861 gacgtgttgc ggctgcccgt ggatctagtg gtcacggaga agttcgccgc cgactcgccg 1615921 ccccagacgg tcgacgtcgg cgctgtgccc aatggcttga tgggcctgga tatcgggccg 1615981 ggatcgatca aacggttcag cacgctgctg tccaacgccg ggaccatctt ctggaacggg 1616041 ccgatgggag tattcgagtt cccggcttat gcggccggca ccagaggcgt cgccgaggcg 1616101 atcgtcgccg ccaccggcaa aggggcgttt agtgtggtcg gcggcggtga ctccgcggcc 1616161 gcagtgcgcg cgatgaacat ccccgagggc gccttctcac acatatccac cggcggcggt 1616221 gcctcgctgg aataccttga gggcaagacg cttcccggca tcgaggtact gagccgtgag 1616281 cagccaaccg gaggagtttt gtgagccgca agccgctgat agccggcaac tggaagatga 1616341 acctcaacca ctacgaggcg atcgcgctgg tgcaaaagat cgcgttctcg ttgccggaca 1616401 agtattacga ccgggttgac gtcgcggtga tcccgccgtt taccgacctg cgcagcgtgc 1616461 aaaccctggt cgacggcgac aagctgcggt tgacctatgg tgcacaagac ttgtcaccac 1616521 atgactccgg tgcctatacg ggtgacgtca gcggcgcctt tctggccaag ttggggtgca 1616581 gttacgttgt cgtcgggcac tccgagcggc gcacctatca caacgaggat gacgcgctgg 1616641 tggccgccaa agccgccacc gcactcaagc atggcttgac cccaatcgtg tgtattggcg 1616701 agcacctcga cgtccgcgag gcgggaaatc atgtggccca caacatcgaa cagttgcgtg 1616761 gatcgctggc cgggctattg gccgagcaga tcggcagcgt cgtcatcgcc tacgaaccgg 1616821 tctgggcgat cggcaccggg cgggtggcca gcgccgccga cgcccaggag gtgtgtgcgg 1616881 cgatccgaaa agagttggcc tcgttggcct cgccgaggat tgccgatacg gtgcgggtgc 1616941 tctacggcgg ctcggtgaac gccaaaaacg tcggcgacat cgtggcccag gatgacgtcg 1617001 atggtggcct ggtcggcggg gcgtcgctgg acggggagca tttcgcgacg ctggccgcga 1617061 ttgcggccgg tggtccgttg ccgtagcgga tcgcgggcgt gctacacccg tagaccttcg 1617121 agtagggcca taaatgcgcg ttcgacctcg actctggtcc ggtctttgtc cgtcgcgtcc 1617181 gcgatctgca gcgcggattc ggttagcgcg gccagcagca gatgcgaaag tggtggcaac 1617241 ggtacgcgct gaatcacccc ggcggccatc ccgcgttcga gagccccgac cagcagacca 1617301 agccctagcg catgtcgatc cggcgccatt cgccccaccc gagcactgac gggccgtcaa 1617361 tcgcaatgac ctgcagcgca tccggtttgg tcgccgcgtc aaggaaggcg tggaagccga 1617421 cgaccagcag atccaggcgt cggtgacctt cgctatggcg gcttcgacgt cggcgaccag 1617481 gtcggcttcg acaacctcga gtaccgtctg gaacagatct ttcttgctgt cgaagtggta 1617541 gtccagggcg ccacgggtga ctcgggcacg ggtgacgatg tcttcgatcg agacgtcacc 1617601 atagtcgcgc cgcgcgaata ggtaacggcc agcgtcgacg agggctcgac gcgtcgcgtc 1617661 cgtgtggtcc gagcgcctgc tggccgtcat ttcgacgtca agcccggctt cgcatggttg 1617721 tcaaccagcc acgccaggcc gacggatgct tgactacctt gatcaacagt gggagcgagt 1617781 cgaaatagct cacgcgttct acggccttgt cgccacgcag caggaaccga tcgacgactg 1617841 gccactcgac gacctcgctg ccgagccgtg ctatcagccg gaactcgatg aacaccacgt 1617901 cgcctgcttg gctccaccgg tcaacttccc cgtgcaggtc aggcagcaaa cccagaatcc 1617961 gagtgaactc ccgctgggcc gcccccaggc cgtgcctcgg cggtgacagt ggccgtacca 1618021 ggactacgtc gggatgaagg tgatcggtca gtctatccgg cgacggcgcc ttccagaagt 1618081 cggcgaaccc ttcgacgaat gcgttggatg cgctcatctg catggccctt tcggtgtttg 1618141 ttcgctcgac agtcttactg cgtaagcctg ggggcgaatt cagcggacat cgttgcttat 1618201 cggtaggaag ctacggccgt cacagtggtc tcagcagcgg gggaatacac attttgcccg 1618261 ccccggcgcg acaactcggt tgaagtcatg cccggatcgg catgtttggc cacgaacgga 1618321 atcgcgacag cgccacggcg tcgagcctcg ccatgcacct agccggcgcc tttgaactcg 1618381 tgagcggacc gaagtggacc gcctgtcgct tcgaggcggg cacagtgcgt attccctcgc 1618441 aagggaagcg ccggtggcag gcgtgacagc cgcggtcagt gcacgcctca aagccgatga 1618501 ggcgcgacgg cctgggttct acgcggcagg cagcggtccg ctgccgcagg ttcgggggag 1618561 tacgctaccc gtcatggaat tggccctgca gatcacgctg atcgtcacga gcgtgctggt 1618621 ggtgttgtta gtactgctgc accgggccaa gggtggcggg ctatcgacac tgttcggcgg 1618681 tggtgtgcag tcaagcctgt ccggctcgac ggtggtggag aagaacctgg accggttgac 1618741 gctgttcgtt accggcatct ggctggtgtc catcatcggc gtggcgttgc tcatcaaata 1618801 ccgctagcgc tggtcggcta ccgccgaccg gaccggggga agcggtagct cattgccgat 1618861 tacgacttgg tgcagcgcag gattctgctg accatgaccg ggctggccag cgcgctcaga 1618921 aacagtggtt agtcggcctg accggtcacc cgtgctttcc ttgcgcgcca ttggcgccgc 1618981 cgatcccgtc gggcacaccg acgccgccag gtccgccggt gccgccgtcg ccgccaaagc 1619041 cgggattgcc gccaccttgg ctgggcccgc cgtcaccgcc gttgccgccg gcgccgccgt 1619101 taccgccggc gccggtgccg cctccgcctg ccccacccgc ggcgccgttg ccaccgttgc 1619161 cgccgttgcc ggcttggcct ttgccgtcga ggctttcgat atagccgccg gtgccgccgg 1619221 tgccgcctgc gccgccagcg ccgccggcgc cggcgctgct gccattgccg atggtcaatg 1619281 cgctggcgcc gccggtgcca ccgacgccgc cgttgccgcc ggtaccgcct ttgccgccga 1619341 tcattgagct gccgccgccg ttgccgccgg cgccgccgtc gccgccggcg ccgccggcgc 1619401 cggcgctgct gccgccgatg ccagctgtgc cgccagtacc accggcgccg ccggtgccgc 1619461 cgtcgccgcc gatgccgcca gcgcctagcg ccgtgccgcc gtcgccacct tggccagcgg 1619521 tgccgccgtt gccgccggcg ccgccattgc cgaacagccg gccaccagcc ccacccgcag 1619581 cgccgttgcc gccgtcgccg ccacgggcgc cgttggcacc gctgttagga ctgtcgccgg 1619641 caccgccggc gccgccgtcc ccgccggtcc caccggcgcc gccggtgccg aacatcccag 1619701 cagcaccacc ggcaccaccg ccaccgccgt taccgccagg gctggcgggg acggggggga 1619761 ggccgccgcc gccgtcggcg ccgctggcgc cagtaccgcc gttgccgccg gcgccgccgt 1619821 tgccgcttag ccagccaccg gctccaccgg cgccgccggc tccaccggcc gcgccggttc 1619881 cggccgccgg gctgtaaccg gcaccgccgg ccccgccgtt accgaacatt ccggcatccc 1619941 cgccgttgcc gccgttgggg tgggcggcgt cgccggctcc gccgttcccg ccgttgcccc 1620001 acagcaaccc gccggcccca ccgttttgac cgggcagccc gtcggcgccg ttgccgatca 1620061 gcggacgccc cagcagtgtc tgggtgggcg cgttgatggc cgcgagcacc tgttgttcga 1620121 gggcctgcaa gggggaggcg ttggcggcct cggcggcggc atacgagccc acgctcgcgg 1620181 ttaaggcctg cacgaactgt tgatgaaatc tggccatttg ggcactgagc gcctgatagt 1620241 cgcgggcgta gccagaaaac aacgacgcga tggccgccga cacctcatcg gccccggcgg 1620301 ccaggacacc agccgtcgtg ggtgccgcgg ctccgttggc cgcgctaagc gccgcaccga 1620361 tgctcgccac atccgctgcc gccgctgaca acattcccgg gactaccatc acgttcgaca 1620421 tcgctgcagt ctaaaacctg gtgccatcgt tgcgacgcaa aacaatcgac atgcttacca 1620481 tttctgagct caactagctg ctaggttgcc gcactagact gctgcaaatg caggtctata 1620541 cgtcggcaac gcactggggc gtgttcaccg ctcgggtgca cggcggcgac attgcggccg 1620601 tggccgcgct cgccagtgac accaacccgg ctccgcagct gcaaaacctg cccggcgcgg 1620661 tacgtcaccg cagccgcatc gccaaccccg ccgtacggcg cggatggctg cagcatggcc 1620721 cggggcccag ctcggctcgc ggcgccgaag agttcgtgga ggtcagctgg gacgagttga 1620781 tcgagctgct ggcttccgag ctgcgccgta ccgtcgaccg ctacggcaac gaggcgatct 1620841 atggcagctc ctacggctgg gccagcgccg gacggttcca ccacgcgcaa agccaggtgc 1620901 accggttcct caacatgctc ggcgggtaca ccgcatcccg gcacagctac agcgccggcg 1620961 cgtccgaagt gatcttcccg catatcgtcg gcgcggccct gttcgaagcc ctggccgaga 1621021 ccacgacctg ggatgtcatc gtcgaccaca ccgcgctgtt ggtggcgttc ggcggattgc 1621081 cggtgaagaa caccgcggtg atgcccggcg gtaccaccgc tcatccggac cgcgactacg 1621141 tcggccggta ccgggctcgc ggcggtcggc tggtgtcggt cagcccgcta cgtgacgaca 1621201 tcgccgcgat cgccggtccg ctcgacgatc gatgtcgctg gcttgcgccg gtgcctggca 1621261 ccgatgtggc gatcatgctc gggctggcat acgtgctggc caccgagtcg ctggccgatc 1621321 gcgcgttcct tggcaggtat tgcaccggct acgaacgctt cgagcgctac ctgctgggcc 1621381 tggatgatgg gattcccaag acacccgaat gggccgccgc gctgtccggg ctcgccgccg 1621441 gcgatctgcg agatctggcc cgccggatgg ccgagcaccg gactctgatc accaccagtc 1621501 tgtcgttaca gcggatagag cacggcgagc agaccgtgtg gatggccgcg accctagcgg 1621561 cgatgctggg ccagatcggg cttcccggag ggggtttcgg tcacggctac agcagcaacg 1621621 gcgtcggcaa cccgccgttg gcgtgcggcc tgccggcatt gccgcaaggc aacaatccgg 1621681 tgtcgacgtt cattccggtg gcggcgatca gtgagctgct gcagcggccc ggccagcggc 1621741 tggcctacaa cggccgattg ctggagctgc ccgacatcaa gtgcgtctac tgggccggtg 1621801 gaaatccgtt ccaccaccac cagaacctgc cgcggctgcg tcgtgcactg tctcgggtag 1621861 acacgatcgt ggtacacgaa cagtattgga ccgcgatggc caaacacgcc gacattgtgg 1621921 tgccaaccac caccagtttc gagcgcgacg acttcgccgc cagcaagacc aatcccacct 1621981 tgatcgcaat gcctgcgatg gtgccgccgt atgccaacgc ccgcgacgac taccacacgt 1622041 tctccgcgtt ggcccaccgg ctggggttcg gcaagcaatt caccgagggc cgcagcgcgc 1622101 gcgagtggct cgagcacatg tacgacaagt ggtcggccga gctggatttc ccggtgccgt 1622161 cattcgccga attctggcgg accggccggc tggaactacc gaccagaacc ggtttgacgt 1622221 ggcttgccga tttccgggcc gacccggcgg cccatccgtt ggggacaccc agcgggcgga 1622281 tcgagatctt ctcggacacg gtcgacgcgt ttgccttgcc ggactgtgcc gggcacccca 1622341 cctggtatga accgtccgaa tggctaggcg ggccgcgggc cgcgcgctac ccgctgcatc 1622401 tgatcgccaa ccagccgcgg acccgactgc acagccagct cgatcacggc ggcgccagca 1622461 tggcatcgaa aatccgtgga cgagaaccga tccggattca cccggatgac gccgcggccc 1622521 gtgagcttac tgacggcgac atcgtgcgcg tgttcaacga ccgcggcgcc tgcctggcgg 1622581 gtgtggtgat cgacgacggg ctacggccca aggtggtgca actgtccacc ggtgcgtggt 1622641 tcgatcccgc cgatccgcgc gacccggact cgatgtgtgt gcacggcaat cccaatgcgc 1622701 tgagcaacga ttccggcacg tcgtcactgg cccacggcag caccggccag catgtcttgg 1622761 tccagatcga gaggttcact ggcgaactgc cgccggtgcg cgcccacgag ccaccgcggc 1622821 tggcttagcg ccggacgtcg acttgttggg cgcgaaacgc cgcaatggac cgaacgactc 1622881 gacgtaagtg tgccctgctg gtgtcggctc gagtcgcagc acgggtgagc accacgtgcg 1622941 ccactagccc tgagcgaagt gtcgctgcaa ccgccggtgc cgatgaccga agagcgcgcg 1623001 caaccctgcc gcgatgagcg gcgcggcaaa cctgagtccg gcacgcgtct ggaacgtgat 1623061 gcggtcgcgg acaatcgttt tcgtgtcacc ctcgggcgtc acggtgcgtt cgtgctgcca 1623121 ttgccgcatg ctcagcatcg tcgaatcctc gcgaaaccgc cgtcccggct cgagctcggc 1623181 gatgctgagc cggtcatagt cgaatggcaa cacaccgaac agtcgcagcc aggcacgtcc 1623241 gatcggcgcg ccgatcggca ccgtgtcgac ggtcatccct ttcgcgccgc gaggcaccga 1623301 catcgtcatc caggggcgca actcatcgtt gatgccctcc ggggtgacga cccgttgcca 1623361 cacctgctcg gcaggtgcgg cgacgacgct ttgccgttca atgagcaccg gttcagcgta 1623421 tccgaccacg cggcgcggtg gggctacgtc tccctcgcct cggtggctgc ctaaaggccg 1623481 ttccgtcccg ggttgagttc tgcgatgcag aggtggcaga tcgtcaatgc gggcgagaat 1623541 ttgttccggc ctctgttgat gcgggtgaca tcggaaggtg tgggtaaagg gatcagcccg 1623601 agatcatgca atcactgtcc tgacaaccag attcagcacg gcctggtaat cgacaggatt 1623661 ctgggactat cagactccag catcacggtt ctcacccggg cccaggtcga ggcgatggtc 1623721 gcggcgctgc cgcgaagcta ctgattccgc gcagctgctc tgtcagggcc gctgactttt 1623781 ctctcggtca tcgtggtcgc aggcgccgca ctcggtgtct tcgggggggg gggggggggg 1623841 ggggggggga agcgcgacct cgaaggccac tgaaacgcct tacggagacg cgacgaacca 1623901 aatgccgacg aatacggcga ggccggtggc taccgggagc ctgccacaga ggatcgccca 1623961 acctgcccag atcgttgcct ggccgaggaa catcgggttc cgcgagaacg cgtagggacc 1624021 tccagctcct cgatgacccg cctcagttcg tcggctcgtg cacggtccgg tttcggagcc 1624081 ggtccaacac gccgcgaacc gcgtgctcgg tgaccgacag cggtgacatc accgtttcgc 1624141 cgaggctcac gatgtagtcg atccgatcga cgatggcttc caccggctcg accaacacga 1624201 tgagccgttt cgcgagatcg tccaggctgt gcagggtacc ttcgagatgg tccagaccgt 1624261 cctccaagcg ctccacggtg ctgttcagct gtgacagcga gctgttcagc tcggccatgg 1624321 tcttacccag accgtccagg acgtcttcga cctgctccac cgtcttgtcg gcgttcaatg 1624381 cggcctgggt gagggttttc attcgccgtc gcacgggcgc ggggcggccg cttctgtctg 1624441 ccatgacggt cattatgacc ctgacgcggt taactcggaa gcttggcggc ggcgtcgcgg 1624501 tccagcagcc agagcgtgtt ctgacgcccg acggccccgg ccgccggtac cgaaaccgga 1624561 tcggcgccgc cgatggccgc ggccacggcg tcggccttac ccggcccgga aaccagcagc 1624621 cacacctcgc gggaacgctg aatcgccggc agggtcaagg tgattcggcg tggcggcggt 1624681 ttcggcgagt cgtcgaccgc caccaccatg cgggtgctct cgaggacggc ggggctgtgc 1624741 gggaacagcg agttaatgtg gccctcgggc cccatgccca gcaggtggac gtcgaaattc 1624801 ggcgccgggt cacctggtgc ggcactggcg gccagcacct gttcgtaggc cagggccgcg 1624861 gcgtccagat cgccgccgaa gtcaccatca ctggcggcca tcgggtgcac ctggttcgat 1624921 ggaatgtcga cgtgattgag caacgcccgc cgggcctgct tgagattgcg ctcgtcatcg 1624981 tcttcgggaa cgtagcgttc gtcgccccag aacaggtgca ccttggacca ttcaatctgc 1625041 tgtgcttggg cgctgaggta gcgcagaagc gcaatcccgt tgccgccccc ggtcagcacg 1625101 atcagcgcct gccctctggc cgccaccgcg gccccgatgg cgccaaccaa gcgcttaccc 1625161 gcggccgcga ccagaatgtc gctatcgggg aagatctcga tgctactgct caccggtact 1625221 gcaccttctt gattccctcg agcgcggcgc agtagatttc gtcggggtcc agccggcgca 1625281 ggtcttcggc taggcactca ccggttaccc tgcgcgccaa aggaaccaga gcgtcgggct 1625341 tgcccgtccg ggtcagggtg gccgtgattc cctcctgggg acggcttagc acgatggtct 1625401 cgctgttgcg caccagctcg actttgagtt cgccgaccgc ccgtcgcacc ggaccttcga 1625461 tccggctggc tagccagccg gctaggacgt cgagcgccgg ttcggtcttc aagccggaca 1625521 ccagcgccga ctcgatcggc tcgtgtggcg gctggtcgac ggccgacgtg agcagcgcac 1625581 gccaataggt gatgcggctc caggccagat cggtgtcgcc ggcgccgtag ccggctagcc 1625641 ggctcttgat ggccgacagc gggtcgattg cgttggtggc gtcggtgatg cgccgaattg 1625701 ctaacttgcc caacgcatcc tgtgctggca ccgccggtgc gatgtcgggc caccacgcca 1625761 ccaccgggat gtcgggcagc aggaagggga taacgacgct gtcggcgtgg ccggccagtg 1625821 gcccggacag ccgcagcacc acaaactcgc cggcgccggc gtcagcgccg acccgcagtt 1625881 gtgcgtccag ccgcggtctg tcggcgtacg gatcgccccg catcgttacg atgatgcggc 1625941 tgggatgctc atggctggcg tcgttggccg cctcgatgga ctcttccagc atggcttcgc 1626001 tgtccggcgc aatgatgagc gtgagtaccc ggcccatcgc gacggcgccg atcttttcgc 1626061 gcagctcgtc gagcttcttg ttgaccgcgg tggtggtggt gtcgggcaag tcgacaatca 1626121 tctgcgccgc tcctcctcat cgcttcgctc tgcatcgtcg ccggcgcgga tcactatggc 1626181 cgccgccatt cccggccggt gcggcgcagc atctccaagg atgattccgg accccaggta 1626241 cctgcctcgt aggcgtcggg cgtcccgtgt gccgcccaat gttccaacgc tggatcgagg 1626301 atctcccacg ccagttcgac ctccgcgttg accggaaaca gcgagggctc gccgagcagg 1626361 acgtcgagga tgagccgctc gtaggcctcc ggtgaatctt cggcgaatgc cgagccgtag 1626421 gagaagtcca tgttgacgtc gcggacttcc atggcggtgc ccggcacctt ggagccgaac 1626481 cgcaatgtga caccttcgtc gggctgcacg cggatgacca tcgcgttggt gcccagctcg 1626541 tcggtcatgg tggcgtcgaa cggcagatgc ggcgcccgcc tgaagaccag agcgatctcg 1626601 gtcacccggc ggcccaatcg ttttcccgtt cgcagataga acggcacgcc ggcccaccgg 1626661 cgcgtatcga cttccagggt gatagcggcg aaggtttcgg tggtggagtc ctcggcgaac 1626721 ccctcctcgt cgagcagccc aaccaccttc tccccgcctt gccagccggc ggcgtactgg 1626781 ccgcggctgg tggtctggtc gagtggctcg gcaaggcggg tggccgagag caccttgatc 1626841 ttctcggcct gcaacgctgc cgggtggaag ctgaccggct cctccatcgc ggtcagcgcc 1626901 agcagctgca tgagatggtt ctggatgaca tcgcgggccg cgccgatgcc gtcgtaatag 1626961 cccgcgcgcc cgcccaggcc gatgtcttcg gccatggtga tctgtacgtg gtcgacgtag 1627021 tgcgcattcc agatcgggtc gaacagctgg ttggcgaacc gcagcgccag gatgttctgc 1627081 accgtctctt tgcccaggta gtggtcgatg cggaagaccg cttcctccgg gaagaccgcg 1627141 ttgaccgcct tgttcagctc gcgtgcgctg gccaggtcgt ggccgaacgg cttctctatc 1627201 acgactcggc tccaccggtc gccttgcggg cgggccaggc cggacttgtg cagctgctca 1627261 cacaccaccg ggaaggattt gggcgggatc gccaggtaga aggcgtggtt gccgccggtg 1627321 ccgcgctcgg cgtcgagctt ctccagcgtc tcggcgagtt gggcgaacgc gtcgtcgtcg 1627381 tcgaaagtgc ctggcacaaa acggaatccc tcggccagcc ggtcccagtt ctgttgccga 1627441 aacggtgttc ggcagtgctc ttggacggcg ttgtacacca cttgaccgaa atcctgggtg 1627501 ctccagtctc ggcgggcaaa ccccaccagc gagaatgtgg gcggcagcag gccgcggttg 1627561 gccaaatcgt agacggccgg catcaccttc ttgcgggcca ggtcgccggt gacgccgaaa 1627621 atcaccatgc cgcacgggcc ggcgattctg ggtaatcgct tgtcccgctt gtctcgtagc 1627681 gggttgcgcc acgacgccgc ggcgtgggcc ggtttcattg ggcagcggtg tcgagatgcg 1627741 cccgggtttc ctggagtagc tcgttccagg aggcctcgaa cttccgcacg ccttcctcct 1627801 cgaggacggc aaacacgtcg gtgaggtcga tgccgatcgc ccccagctgg tcgaacaccg 1627861 cctgggcatc ggatgcagtt ccggtgaccg tgtcgccttg gatcacgcca tgatcagcga 1627921 cggcgtcaat tgtcttttcc ggcatagtgt tcacggtgtg tggggcgacc aactcggtga 1627981 cgtagagggt gtccgagtaa tcggggttct tcacgccggt ggaagcccac aacgggcgct 1628041 ggacccgggc gccgtcgacc ttgagggacc gataacgatc gctgtcttcg aagacctccc 1628101 ggtaggcggc ataggccagg cgggcattgg cgacaccggc ctggccgcgc agttcgagcg 1628161 cttgccgcga gccgattctg tccagccgct tgtcgatttc ggtgtccacc cgggagacga 1628221 aaaacgatgc caccgaatgg atcttggaca ggctgtgtcc ggcttgccgg gccttttcca 1628281 tcccggtcag gtaggcgtcc atcacctcgc ggtaccgctg cacggagaag atcagcgtaa 1628341 cgttgaccga aatcccttcc gccagaacgg cactgatggc gggcagaccg gccttagtgg 1628401 ccgggatctt gatgaaaagg ttcggccggt cgacgatctt ccacagctcg attgcctgtt 1628461 ggatcgtttt ttcggtttcg tgtgccagcc gcgggtcgac ctcgatcgac acccggccgt 1628521 cgaccccgtc ggagtcctcc cactggggga ccagcacgtc gcacgcgctg cgcacgtcgt 1628581 cagtggtgac ggtgcggatg gtggcatcca cgtcggcgcc gcgcgcggcc agctcggcga 1628641 tctgggcgtc gtaggtgtgg ccctccgaca gcgccttctg aaagatcgac gggttggtgg 1628701 tcaccccgac gacgctcttg gtgtcgatca gctcctgcag attgcccgag cgcagccggt 1628761 cccgcgacag gtcatccagc cacaccgata cccccgcggc gctcaatgcg gccaggttgg 1628821 ggttctgagc ggtcatcggt aatcaccctt cctcagttat ccagcgctcg ttcggcggcg 1628881 gcggccacgg cctcggcagt gaagccgtac tcgcggaaca aggtcttgtg gtccgcggat 1628941 tcgccgtagt gctcgatcga gacgatctcg cccgtgtcgc caaccagctg gtgccagcat 1629001 tgcgcgacgc cggcttcgac ggccacccgc gccgacaccg tcgggggcag caccgcgtcg 1629061 cggtactcgt agggttgggc ctcgaaccac tccaggcacg gcatcgacac cacccgagcg 1629121 aggatgtcgt tgtccgccag caacgtctgc gccgcgaccg ccagctgcac ctccgagccg 1629181 gtggcgatga gaatgacgtc gggttcctcg cccggttgca gaccaccggc gtcactcagc 1629241 acgtaaccgc cgcgggcaac cccctcggcg tcggtgccgt ccagcaccgg cacaccctgg 1629301 cgggtcagga tcaacccgac cggcccgctg ccgttgcggc gggccaggat cgtgcgccag 1629361 gcgtaggctg tctcgttggc atctgccggg cgcaccaccg acagccgggg gatcgcgcgc 1629421 agcgccgaga ggtgctcgat cggttgatgg gtgggcccgt cttcgccgag gccgatcgag 1629481 tcgtgcgtcc agacgtagat ggtgtcgatg tccatcaacg ccgccagccg caccgccggg 1629541 cgcatgtagt cggagaactg caggaaggtg ccgccgtaag cccgggtggg tccgtgcagc 1629601 acgatgccgg acaggatggc acccatcgcg tgctcgcgaa caccgaagtg caaggtgcga 1629661 ccataccagt gcgcggtgta ctccttggtg gaaatcgagg gcgggccaaa ggagtcggcg 1629721 ccctttatcg ttgtgttgtt gctgcccgcc aggtcggccg aaccgcccca caactcgggc 1629781 agtttcggcc cgagcgcgga cagcaccgca cccgaggccg cacgggtggc cagcgccttg 1629841 gaccccggtt cccagtgggg caagtcggcg tcccagccgt cgggcaactt ctgcgcgagc 1629901 agccggtcca gcagcgcctt gcgctcgggt tcacgccgcg cccaggcatc gaattcgagc 1629961 tgccagcgtt cgtgggcctg tttgccgcgg gccaccagcc ctcgggtgtg ggtgaggacg 1630021 tcctcgcgga cctggaacgt cttgtccgga tcgaagccga cgatcttctt gactgcggcc 1630081 acctcgtcgt cgcccagcgc cgcgccgtgc gccttgccgg tgtccatcag gttcggcgcc 1630141 ggatagccga tgacggtgcg cagcgcgatg aacgagggcc ggtcggtgac cgcctgcgca 1630201 ttggcgatgg cctcctcgat gccgacgacg ttctcaccgc cctcaacctc ttgcacgtgc 1630261 cagccgtacg cgcggtagcg ggccgcggtg tcctcacaca gcgcgatgtt ggtgtcgtcc 1630321 tcgatcgaga tctggttgcg gtcgtagaac acgatgaggt tgcccagttg ctggaccgcg 1630381 gccagcgacg acgcctccga ggtcacccct tcttcgatgt caccgtcgga ggcgatgaca 1630441 tagatgtagt ggtcgaaggg gctggcgccc ggttcggcgt ccgggtcgaa caggccgcgc 1630501 tcgtagcgcg aggccatcgc catcccgacc gccgacgcca gtccctgccc cagcgggccg 1630561 gtggtgatct caacgccggg ggtgtggcgg aactccgggt gtccgggggt cttggatccc 1630621 caggtgcgca acgactcaat gtcggacagt tccaggccga agccgccgag gtagagctgg 1630681 atgtagaggg tcaggctgct gtgcccggcc gacaaaacga accgatcgcg gcccagccag 1630741 tgtgtgtcgc tgggatcgtg acgcattgtc cgctgaaaca gcgtgtaggc caacggagcc 1630801 aggctcatcg ccgttccagg atgaccgttg ccgacctttt ggacggcatc ggcggccaat 1630861 acccggatgg tgtcgacggc agccgaatcg atctcggtcc agtcgtcggg atggcgcggt 1630921 cgggtaagcg cggagatctc ttcgagtgtg gtcacaaatt cagtcctcga gtcagcaaga 1630981 tgatcagtcc tcaccctagt gcgggaatcc cggcgcttgc agtgccgcat atccgggtac 1631041 ccatccgggc cctgtgaaac gtaacccgcg cgctacccac gcttcgcatt cggtgccgat 1631101 atgccgaaaa atcaccgtca tcgaccctgc ggctctgctg ctggggctac gtcgaacacc 1631161 gtacgtcgca gaagtgtggt gcgggtcggg cggccggctt aatcgcggtg ataatcggtt 1631221 ggtcggcgat caccggcatc atcggttggc cggcgctggt gatgctgttc gccgggcctc 1631281 gcgtcggcga gccgggcaag ccggtgcgcc tgccgatccc atggcgggat gttggtgggt 1631341 accgcccgac cggaagaagc atcgcggcat gccggcgtgg cgagcctcgg ggtctacacg 1631401 aattcgccgc cgccgagccc gccgaagcca ccgccgccac cggcgccgcc ggcggtacct 1631461 gtggcgatgg accccgggct accgaggccg ccgagaccgc cgagaccaag gaggatgctg 1631521 aagccgccgc caccgccctg ccccccgtgg ccaccggtcc caccggtgcc tgttccaaag 1631581 ggccccgcgt cgccggtgcc gccggtgccc ccggagccac ccatcccgcc ccggccaccg 1631641 acgccggcaa aaccattgcc gccaaagccg cccgcacctc cgttgccacc catcccaggc 1631701 tgagagccgt tgtggccggt gccgccggtg ccgccagcgc cgccgttgcc gccagtgccg 1631761 cctctgccgc cggtgaggcc gccgttggca ccctggccgc cggtgccgcc ggtgccgccg 1631821 gtgccgccgg tgccccagtc gccgggggtg ccacctgggc cgctggaacc gccaagtcct 1631881 gcatcgcctc cgcgtcctgc atcgcctccg cggcccccgc cgccgccgtc accaggtgag 1631941 gtgacaaggt cgccactggc gccgttgcca ccgttgccgc cgttgccggg tgtcccgccg 1632001 gtcccaccgt tgccgccggc tccggtgagg ccttggccgc cgttgccgcc tctgccgccg 1632061 ttgccgcctc tgccgccgtc accgccatcg ccctcgttgg tgccgaggac gcccttggcg 1632121 ccggtgctgc cggcgccgcc agtcccgccg atgccaccgt tgccgccgtt ggcgccggtg 1632181 ccgccgttac cgccgttacc cccgtggccg ccggggccgc cgtttccgcc gctggcagcg 1632241 ccgtggccgc cgtggccgcc gttgccgccg tcgtgcagga tgctgccggc cggccccgcc 1632301 ttgcctgcgg tggagccggt gccgccgggg ccgccggcac cggcgttgcc ggcgttgccg 1632361 ccgtcgccgc ctcgcccgcc gccgccgccg gcgaaggccc ctgctccctg gccgttgccg 1632421 ccgttggccc cgtcaccggg agcaccgccg tcgccgccgg ccccaccggc accgcccgcg 1632481 ccgtcgctga ctacgccttg accgccgttg ccgccggccc cgccgttgcc gccggcgccg 1632541 ccgtgcccgc cggcaccgcc gggttgtccg ggcgcaccca cggccacgcc gttggcaccg 1632601 gcggcgccgt tgccgccgaa tccgccgagg ccgccgttgc cgccggcgcc accgttaccg 1632661 ccgttcaggc cggccccgcc ggccccgccg gcgccaccgt tgccgccggg gttaccgttt 1632721 ggcccgtttt caccagggtt ggtggcgttg gcactcatgc caccaaacgc gccgtcgccg 1632781 ccgcggccgc cgttgccgcc cgtgccggcg ctgccgccgt tgccgccatt gccgccgtcg 1632841 ccgccgttgc cgccgaccac ttgggagtgg ccgccgttgc cgccgtcgcc gccgtcgccg 1632901 ccgctggttg gagtgaagcc gtgggcgccc ttggcgcctg gggtagagcc ggcgccaccg 1632961 ctaccgccct gcccgccggc gccggggtta ccgccgttac cgccgtgacc gccgttacca 1633021 tcgccgaagg cgaagttgcc gttggcgccg ttgccgccgt caccggcgag cccgccggcc 1633081 ccccctttgc cgccggaccc gccgacaccc tggattccgt tctggccaaa gaggttcccc 1633141 gccaaaccgc cgggcccgcc ttggccgccg ttaccgcctt gcgcgccggg cccgccgtgg 1633201 ccgccgtcgc cgcccttggc gcccggcgtg gtggcgttgg cgccgttggc gccgttgccg 1633261 ccggccccac cgtcggcgcc gttgccgccg gccccaccgg tcccgccgtc gcccccgaag 1633321 tctccgcccc ggccgccggc cccgcccgcc ccgccagccc cgccgttctg gccgctcgtg 1633381 ccggattcgc ccgcggtggt gggcgaggaa ccggcgacac cggccatgcc gtccccgcct 1633441 ttgccgccgg ccccgccatt accaacaagc ccgccgttgc cgcccttgcc gccggccccg 1633501 ccggccccgc cggcgacggt ggcgttcgcg ccgttgccgc cggtgccgcc gttgccgccg 1633561 ctggtcgggg tggcgccgcg ggcaccgtct gcacccgcgg tggatccggc gccgccgatc 1633621 ccaccagcac caccgatgcc gcggctaccg ccgttgccgc cgttgccacc aactccatcg 1633681 ccgccgttat cgaacgtgcc cttggcaccg ttgccgccat caccgcccat gccgccggcg 1633741 ccgccgtttc cgccggcccc gccggcaccc atgctgccgt cctggtgggt ggctgcaagc 1633801 gccttaccgc cttgcccacc ggctccaccg ccaccgccgg ctccaccgtt gccgcccttg 1633861 ccgccgtcgg tgccatccgc gcctgccccc aggccgttaa ggccggtggc gccggtggcg 1633921 ccgttgccgc cgttgccgcc cttaccgccg gcgccgccag caccgccgtc gcctgcttgg 1633981 gctccgccgt cgccgccctt accgccagcg ccgccagctc cgccgccacc gccgttaggg 1634041 tcgccgccag aaggcggggc accgggggcg ccgttgccgc cggcacctcc ggcgccgccg 1634101 ttgccgacca gcccgccggc cccgccggcc ccgccgttac cgccggcttt gccgcccgat 1634161 gagaagtggg cgccgttgcc gccggccccg ccgttgccgc cgctggtggg gctggccccg 1634221 gccgcgccgt gggcaccgat cgtggagccg gctccgccgg tgcctccggc cccgccggcg 1634281 ccggggtcac cgccatggcc gccggccccg ccggcacctg cgttgacggc ctggttgccg 1634341 ttggcgccgg ctccgcggtc accgccgacg ccaccagcgc cgccgtcccc gccgtcaccg 1634401 ccggcgcctt ggccgcccag caggctgatc aggccgccgg ccccgccggg gccgccagcc 1634461 ccgccagccc cgcccatccc gccgttacca ccatcaccgc cgttatcccc agcgacaatc 1634521 aaggcacgag aaaatccggc cccgccggcc ccgccggtcc cgccgatacc gccgtccccg 1634581 ccggccccgc cggcgccggc cagccagccg ccccgccctc cggtgccgcc atcgccggcg 1634641 tcgccgccgg ccccgccgtt gctaccgtct gggaagatac cgcccttacc ggcggcgccg 1634701 gcgatacccg cagcgccgtg tccgccggca ccaccgtgcc cgcccacgcc caacagcccg 1634761 gccgcacccc cgacaccgcc gtgtccaccc acaccaccga tcgggccggg cccgccggca 1634821 cctccgtgcc cgccggcccc gtagagggtc ccgcccaggc caccggcacc accggtaccg 1634881 ccgaccccgc cgggcccgcc gggcccgccg ggcccgccgg gcccgccggg cccgccgggc 1634941 ccgccggttc cgccgacccc gaacagtccg gcgttgccgc cggccccgcc ggttgccccg 1635001 cccagcaggc tctgcccgcc ggccccgccg actccaccat tgcccagcag ccagccgccg 1635061 ctacccccgg ccccaccggc ggcgccggcc ccaccggccc caccggcccc gccggtgccg 1635121 aacaacccgg cggccccgcc ggccccgccg acttggccgg gcgcgcccga gccgccggcc 1635181 ccaccgttgc cccacaagat cccgccggcc ccgccggcct gcccggtgcc gggtgctcca 1635241 gccgccccat caccgatcaa cgggcgaccc agcaacgcct gggtgggcgc attgagggca 1635301 ttgagcacgt tgtgctccag cgtcgccaac ggtgcggcgt tggtcgcctc cgcgctgaca 1635361 tacgagccga ccgcggcgct taacgtctgc gcaaatcggt catgaaacgc tgccacctgc 1635421 gtgctgatcg cctgatactc ccgagcatgg ctgccaaaca gcgtcgcgat cgccgccgac 1635481 acctcatcgg cgcccgcggc cagcacgctg gtggttgacc ccgccgccgc gctgttggct 1635541 acaccgatcg atgacccgat gcgcgccaca tctaaggctg cggccgccac cgtctccggg 1635601 gccacgatca ccaacgacat cacagtccac ccgccacgcc cctgcccctt cggcaggtca 1635661 cactcctgcc agataagggt cgcgccgcca ccttgtccga ttccaggtca aaatccccat 1635721 aaccagcacg aatctgctgt gcacagtgca cattcgccct actatcggct cgtggcattg 1635781 cggctagcaa cggttggtct tcgggcccaa tccttagggc gtcacactga tcaatcccag 1635841 atagcgattt tcatcgggct ggtgtgaaaa ttgtcctgac cgcggttcgg gctggcgagc 1635901 ggtgccgata tgccggcgaa gtcgtgtgaa tcgaccctgc ggctctgctg ccacagttac 1635961 ccggtctacc atcgtgcgta gtagaagctg cgcgcggctg cgattcccga ggagttagtg 1636021 cgtgaacgtt cgcgggcgcg tcgcgccgcg ccgagtgact ggtagggcaa tgagcaccct 1636081 gctggcctac ctggcgttaa ccaagccgcg agtcatcgag ctgctgttgg tcaccgcgat 1636141 accggcgatg ctgctggccg accgcggcgc cattcatccg ctgctcatgc tcaacacgct 1636201 cgtcggcggg atgatggccg ccaccggcgc caacacgctc aactgcgtcg ccgacgccga 1636261 tatcgacaag gtgatgaagc gaaccgcgcg ccggcccttg gcgcgggaag cggtgccgac 1636321 ccgaaacgcg ttggcactcg ggttgacgtt gacggtgatc tcgttcttct ggctatggtg 1636381 cgccacgaac ctgctggcgg gggtgctggc cctggtcacc gtcgcgtttt atgtgttcgt 1636441 ctacacgctt tggctcaagc gacgcacgtc acagaacgtg gtgtggggtg gggcggccgg 1636501 ctgtatgccg gtgatgatcg gctggtcggc catcaccggc accatagcct ggccggcgct 1636561 ggcgatgttc gcgatcatct tcttctggac gccgccacac acctgggcat tggcgatgcg 1636621 ctacaagcag gactaccaag tggccggggt gccgatgctg ccggcggtgg cgaccgagcg 1636681 tcaggtcacc aagcagatct tgatctacac ctggctgacc gtggccgcga cgctggtgct 1636741 ggcgttggcg accagttggc tttacggcgc ggtggccctg gtggccggtg ggtggttcct 1636801 gacgatggcc caccagttgt atgccggggt gcgcgccggc gagccggtca ggccgctgcg 1636861 gctgtttctg cagtcgaaca actatctggc ggtggtgttc tgcgcactgg ccgtcgactc 1636921 ggtgatcgcg ctgcccacgc tgcactgatt gggggcccag ttccgctgcg gtgccggccc 1636981 tgctcggcca acgtagtcag atggttggat cgccaccggc gccaccggcg ccgcccgcgc 1637041 caccagcacc gccgctgcca tctgggtccg tcgagtcgcc gaggacgccg gcgccgccat 1637101 tgtcgccaaa taccgtgaga cctagcaggg tgccggcgcc gcccttgccg ccggccccgc 1637161 cggcgccgcc caatccaccg aagcccctcc cttcggtggg gtcgctgccg ccgtcgccgc 1637221 cgtcaccgcc cttgccgccg gccccgccgt cgccgccggc tccggcggtg ccgtcgccgc 1637281 cctggccgcc ggccccgccg tttccgccgc cgccgccatc gccgatgatg ttttccccgc 1637341 ccttgccgcc agccccagcg ttcccgccgg ctccgccact ggcgccggtg ccgccgggtg 1637401 caacggcgtt ggcgccgtta ccgccgttgc cgcctttgcc cccggtgtct gcaaagtcgg 1637461 gggtcgcacc ctgcgcggcg cgggtcacgc cgtcaccgct gagccccccg agcccgccag 1637521 cgccgctgaa gccaggattg ccgccgttgc cgccatggcc gccgttggca ccgggtgcga 1637581 cggcgttgcc gccggtcccg ccgaccccac cgttgccgcc tttaccaccg tcctggccac 1637641 gctcgcccgc ggtggtggca ttggcaccct cggcaccact accaccgagc ccgccgtctg 1637701 cgccgcggcc gccagtccca ccggccccgc cattgccggc gagagttccg ccgtcgccgc 1637761 cggcgccgcc ctggccgccg ttgccgccgc tattgccttt gccaccgact gcgcccgaat 1637821 cgctcgcgtt cgtccctgcg gcgccgttgg cgccgttgcc gccggccccg ccggcgccgc 1637881 cgttgccgac cagcccgcca tggccgccgg ccccgccggc cccgccgtta ccgccggctt 1637941 tgccgcccga tgagaagtgg gcgccgttgc cgccggcccc gccgttgccg ccgctggtgg 1638001 ggctggcccc ggccgcgccg tgggcaccga tcgtggagcc ggctccgccg gtgcctccgg 1638061 ccccgccggc gccggggtca ccgccatggc cgccggcccc gccggcacct gcgttgacgg 1638121 cctggttgcc gttggcgccg gctccgcggt caccgccgac gccaccagcg ccgccgtccc 1638181 cgccgtcacc gccggcgcct tggccgccca gcaggctgat caggccgccg gccccgccgg 1638241 ggccgccagc cccgccagcc ccgcccatcc cgccgttacc accatcaccg ccgttatccc 1638301 cagcgacaat caaggcacga gaaaatccgg ccccgccggc cccgccggtc ccgccgatac 1638361 cgccgtcccc gccggccccg ccggcgccgg ccagccagcc gccccgccct ccggtgccgc 1638421 catcgccggc gtcgccgccg gccccgccgt tgctaccgtc tgggaagata ccgcccttac 1638481 cggcggcgcc ggcgataccc gcagcgccgt gtccgccggc accaccgtgc ccgcccacgc 1638541 ccaacagccc ggccgcaccc ccgacaccgc cgtgtccacc cacaccaccg atcgggccgg 1638601 gcccgccggc acctccgtgc ccgccggccc cgtagagggt cccgcccagg ccaccggcac 1638661 caccggtacc gccgaccccg ccgggcccgc cgggcccgcc gggcccgccg gttccgccga 1638721 ccccgaacag tccggcgttg ccgccggccc cgccggttgc cccgcccagc aggctctgcc 1638781 cgccggcccc gccgactcca ccattgccca gcagccagcc gccgctaccc ccggccccac 1638841 cggcggcgcc ggccccaccg gccccaccgg ccccgccggt gccgaacaac ccggcggccc 1638901 cgccggcccc gccgacttgg ccgggcgcgc ccgagccgcc ggccccaccg ttgccccaca 1638961 agatcccgcc ggccccgccg gcctgcccgg tgccgggtgc tccagccgcc ccatcaccga 1639021 tcaacgggcg acccagcaac gcctgggtgg gcgcattgag ggcattgagc acgttgtgct 1639081 ccagcgtcgc caacggtgcg gcgttggtcg cctccgcgct gacatacgag ccgaccgcgg 1639141 cgcttaacgt ctgcgcaaat cggtcatgaa acgctgccac ctgcgtgctg atcgcctgat 1639201 actcccgagc atggctgcca aacagcgtcg cgatcgccgc cgacacctca tcggcgcccg 1639261 cggccagcac gctggtggtt gaccccgccg ccgcgctgtt ggctacaccg atcgatgacc 1639321 cgatgcgcgc cacatctaag gctgcggccg ccaccgtctc cggggccacg atcaccaacg 1639381 acatcacagt ccacccgcca cgcccctgcc ccttcggcag gtcacactcc tgccagataa 1639441 gggtcgcgcc gccaccttgt ccgattccag gtcaaaatcc ccataaccag cacgaatctg 1639501 ctgtgcacag tgcacattcg ccctactatc ggctcgtggc attgcgggaa acctcaccgc 1639561 gaatacatga gctgatccgc gaggcagcgc gaatcgccct caacccgacc caggaatggc 1639621 tcgacgaatt cgaccgtgcc attctggccg ccaacccatc catcgctgcc gaccccgccc 1639681 tggccaccgt tgtcaagcgt tccaatcggg cgcatctcat ccatttcgcg gccgccaacc 1639741 tgcgcaatcc cggcgccccg gtgcccgcga accttggtcc cgagccgctg cgcatggccc 1639801 gtgatctcgt gcgcgtcggt ttagatgcct tggccctcga catctaccgc atcggacaaa 1639861 acgtggcctg gcggcgctgg acggacatcg cgttcggact gacctccgac cccgacgagt 1639921 tgcacgaatt actggatgtg ccatttcgga cagccaacga gttcgtcgac accacccttg 1639981 cgggcatcac caccgagatg caattggaac gcgacaagct cacccgcgac gttcctgccg 1640041 aacgccgcaa aatcgtccag ctgctcatcg acggtgcccc catcagccgt gagcacgccg 1640101 aagcgcgatt gggctaccct ctcgaccgat cccacaccgc cgccgtcatc tggggtgacc 1640161 aggcccaggg cgaccacagc cacctggacc gagtcgccga cgcgttcggc catgccggcg 1640221 gatgcccgca cccgctggtc gtggtagccg gcgccgcgac tcgctgggtg tgggtaaaag 1640281 acgcccccgg gtttgacatc gacctgattc acgaggtgct ccatgacata cccgacgcgc 1640341 gtatcgccat cggggccacc gcgccgggaa tcgaggggtt ccggcgcagc caccgagacg 1640401 cactcaccac cgctcggatg attatccggc tggaatcacc gcaccgagtc gcctttttca 1640461 ccgacgtcga gatggtcgcg ttgctcaccg aaaacgccga gggtgccgac gacttcatcc 1640521 aacgcaccct cggaaacctc gagtcggcca gcccggctct gaaaacgacg ctgttgacct 1640581 tcatcaacca gcagtgcaac gcttctcggg ccgcgagact tctcttcacc caccgcaaca 1640641 ccttgatgaa ccgactcgag accgcgcaac gacttctgcc ccgccctctc gccgacacca 1640701 ccattcacgt cgccgtcgca ctcgaagccc agcagtggcg ggagaagcaa accagcgatc 1640761 ctccggcaaa gaaagagtcg aatggcacca agatgcgcta gcaagacagc gcagcacaga 1640821 ccgctacgct acggcagcag cacgaccgag ccgaccgtct tgcgagcctc caggtcctga 1640881 tgggcgcgca aggcgtcggc cagcgggtaa cgtccgccga ccgccacggt gatcgcttcg 1640941 ctgccgatcg cgtcgaacag ctcagcggcc cgccagctga actcctcgcc ggtgcgggtg 1641001 aagtggaaca gcgagggacg ggtgaggtac accgatccgg cggcattgag gcgctgcgga 1641061 tcgaccggtg gaaccggacc gctggcggcg ccgaacagtg ctaatgtccc gcggacagcc 1641121 aggctggcta ggctggcgtc gaaggtggtg gcgccgacac cgtcgtaaac ggcttgcaca 1641181 ccggtgccgc cggtcagttc gcgaacccgc ccggcgaact gccaggcatc ctccgggtag 1641241 tcgagaacca cgtccgcgcc ggcatccttg gacagcttgg ccttctccgc cgtcgaaacg 1641301 gtggtgatca cccgcacccc caggtgagtg gcccattgtg tcaggatcaa gccgacgccg 1641361 ccggcgccag catgcaccaa gacggtgtca ccacgcttca ccgggtacac cgacttcagt 1641421 aggtaatgcg ccgtcaggcc cttcagcagc gccgaagccg ctacctcaga cgtgacgtcg 1641481 tcggggacct tggcggtcag agatgctggc gctgtgcaga attcggcgta ggcgccgttg 1641541 gctgaggcgc tgaccacgcg gtcgccgacg ctgatggcgg tgtcggctgc ggtaactcct 1641601 gggccgacgg cctccaccgt gccgcatacc tcggagccga tgacgaacgg gagttcgcgc 1641661 ggatattgcc cggagcggaa gtaggtgtcg atgaagttga caccgatggc ctcggccttg 1641721 atcaggagct cgccgtggcc gggttgaggt tgcggctggt cgacgtggcg taagacgcct 1641781 ggcccgccgg tttcggtgac ttcgattgcg tgcatgtggc tatcatgccc gggcatgaag 1641841 cttgcccggc cggacgtctt ccatccgcgc gtcgttttgg cgggttggcc acagcagccc 1641901 gccggtgacg gcgacgatgc tgggctggtt gcggccctgc gccaccgcgg cttgcatgct 1641961 ggttggctgt cttgggacga tcccgaaata gtccacgcgg atctggtgat tttgcgggct 1642021 acccgcgatt accccgcgcg gctcgacgag tttttggcct ggactacccg cgtggccaat 1642081 ctgctgaact cgcggccggt ggtggcctgg aatgtcgagc gccgttacct acgtgacctg 1642141 atggatcggg gggtgccgac cgtgcccggc gaggtgtatg tgccgggaga gccggtccgg 1642201 ttgccacgca aaggccaggt cttcgtcggt ccgaccatcg gtaccgggac acggcgctgt 1642261 agtgcccggt tcgctgccga gttcgtcgcg caactgcacg cggccggcca ggcggtgctc 1642321 gttcagcccg gaggttccgg tgacgagacc gtgttggtct tccttggcgg tgagccgtcg 1642381 catgcgttta ccaagcaggc cgacacttgg cgccagaccg agcccgactt cgaaatctgg 1642441 gacgtgggtg cggccgccgt ggccggcgcg gccgcgcagg tgggtgttga cccaggtgag 1642501 ctgctctacg cgcgggccca catcacaggt ggaagccgag atccccggtt gctggaattg 1642561 caattggtgg acccgtcgct gggctggcag tggctggacc cagacatccg caatcttgcc 1642621 cagcgtgact tcgcgctatg cgtccagtca gcgttggagc ggctggggct gggcccgttc 1642681 tcccatcgac gcccatagcg cggcggtggc cgccgtaacc gccgcggcac cggccacgtg 1642741 aatggcgacc agggcggcgg gtaccccggt gaagtattgc gtggtaccga cggcggcttg 1642801 cgtggcaacc agggcgagca gcacggcgag tcgcaccaga atcgcccggg tggcacccac 1642861 ggccagcagc ccgaaaccca acccgatcag cagcgcaagg taggcaacca acagcgacga 1642921 atgcatatgc accaaggtgg tgatttcgac tttcagccgc ggcacggtcc ggctggggct 1642981 gcgatctccc gcgtgcgggc ctgccgccgt gactagcgtg cccgtcacca gcaccgcggc 1643041 caggttcagc gcgctgagcg ccgtgagcgc acgcaacggg ctgaccacca gttcgtggac 1643101 gactccgtca tcgggctggc cgatcttgac gtagagcagc accgccagcc acaccatcgt 1643161 catcgacgcc agcaggtgga tggccaccgt ccaccacagc agcccggtgc gtacggtgat 1643221 gccaccgatc atcgcctgca ccaccgtcga caccggcatc agccacgcgt aggccaggac 1643281 ttccgtgcgc cggcgcgccc gggtgacgac cagcacggcc agtgccgcgg ctatcaccac 1643341 cgcaaacgtg accatccggt tgccgaactc gaccgcctga tggacccgcg gcacctcggc 1643401 gaccaccacc ggggtgaagc tacccggaaa acactgcggc caggtcggac accccaggcc 1643461 tgaggcggta acccggacga ttgccccggt gacggcgatg ccgccctggg tgaggatgac 1643521 gattgcggcg atgacccgct ggacacgcag gctgggagac accgcccgat cgtaaggcac 1643581 caaaaactac acgctgtagt acgggcggac cggtgtcgaa actgcaacca cgcaccgatg 1643641 cgtcggcgtg tcttgtgcgt ggttgcagtg tcgcgaagcc gggcggccgg ttcaggtgaa 1643701 ccggaaccag cgcagtgcgg ccagtgcggc cagcgcgccc cacaccgcta ggacgacgat 1643761 cccgaaccag tccaccgaca cggtcatggc ctgcgacagc gcctcggtga gcgcgcccga 1643821 cggggtaacc cgagccaccc atttgaacgc cgtcgggatc acgttcgact ccaaggtcag 1643881 cgcaccgaaa ccggcgaata cgaaccacat caggttggcg acggcgagaa cgatctcggc 1643941 tcgcaaggtg ccgccgagta gcaggccgag cgccgcaaag cccgcggtac ccagcgcgat 1644001 gatcccggcg cccaatgtca gggccgtcag cgccggccgc cagccgagcg caaagccgat 1644061 ggcgcccaag atgatggcct gcaagaacac cacggcaacc actgccagcg acttgccggc 1644121 gatgatcccc caaaccggca gcggggtagc accgagtcgt ttgagggcgc cgtagcggcg 1644181 atcgaacgcg accgcgatgg cttgcccggt gaatgcggtg gagatcaccg caagcgccat 1644241 gatgaccgga acaaaggtgg cggcgcggtt gtggccgaac gagcccatcg gcagcaaagt 1644301 cagcccgacc agcagggtga tcgggatgaa catggtcaac agcagttgct cgccgttgcg 1644361 taacagcagc ttcaattcca ggctgaactg tgcggcaagc atcaggggga cggcgttggg 1644421 gcgggggtcc gggctgaagg tgcccgcggg aaaagcgggg cgattggttt gggtcactgc 1644481 cgcaacttcc tgccggtgag atccaggaac acgtcttcga ggctgcgttg ctcgacccgc 1644541 atgtcggtgg ctagcacgtc gatttgtgcg caccacgcgg tgaccgtcgc cagcacctgc 1644601 gggtcaaccg gaccttcgac caggtactcg cccggggtca gctcggtggc ctggtagccc 1644661 tcgggcagtg ccgaggccag cagcgacagg tcgagccgcg gcggcgcggt gaaccgcaac 1644721 tggtctttgg cgccgctgcg catcagttct gccggtgtgc ctgcggccac cgtcaccccg 1644781 tggtcgatga tcaccaaccg atcggcgagt tcctcggcct ccttgagatg atgcgtggtc 1644841 agcaccacgg tcacgccatc gcggcgcagc gcgtcgatca actcccacac cagtacccgg 1644901 gcatgggcat ccatgcccgc ggtgggctcg tcgaggaaca ccagttgggg acgcccgacc 1644961 agcgcgcagg ccagcgcgag tcgttgctgc tgcccgccgg agagccgtcg ataggtggtg 1645021 cgggcggcct cggtgagacc caaggtgtcc agtagccagt gcgggtccag cgggttggcg 1645081 gcgtaggacg cgaccagatc cagcatttcg ccggcgcgtg ccgccgggta gccgccgcca 1645141 ccctgcaaca tcacgccgat gcgtgcgcgc aggcgtgcgt tgtcggtgat cgggtccagt 1645201 ccaagtacct caatgctgcc ggcgtccggg cggacgaagc cctcgcacat ctcgacggtc 1645261 gtggtcttgc ccgcgccgtt ggggcccagc agcgccatca cttcggcgtc atgcacgtcg 1645321 agatcgaggt tggaaacggc ggttattgac ccgtatcgct tacatacccc gcgaagccgc 1645381 agtaccacct cgggggtgtc tggggcgcgg ttcacgagcg ccgctcctcc tcatcgcttc 1645441 gctctgcatc gtcgtcggcg cggttcacga gcgccgctcc tcctcatcgc ttcgctctgc 1645501 atcgtcgtcg gcgcggctca cgtggaatca gcgtaggcgt cgggcgctgc cgtcggccgg 1645561 cgggtcgcag gggtcttgct ggccgactcc gcggcggtga ccacttgctc ggctgcaagt 1645621 ggccgccatg gtaaccgggt gtaggtcagg gcaatcagga ggatcacgat gatggcgctg 1645681 gccgcggtcg catccacgat ctgaaacagc gcgaagcggt caccattggc ggtcggaccg 1645741 aaaattccga cgatgagggt ggccaggatt gccgctaccc gaaatcccgg gcgggttgcc 1645801 caggcggcca gcggaattat cgcccacagc aggtaccagg gctgcacgac gggaaacagc 1645861 agcacggtga cagctagcgc aacgcccagg ccgccgatcg ggtgcagccg gccgcggagc 1645921 acggccaata acagccagca caccatcacc gtgatgatca gcacgccgat ggcgcgggtg 1645981 agtgacaaca cggcggtggt gtgatcaccc aggcccagca ggatgccgac gtgcccggtg 1646041 cccagggcca gcagtgtcgg cggcgacatc cagctgcgca ccacattggc ggtgcccagc 1646101 gtgttgatcc agccgaatcc gagaccgctg gcccaaccca ggatggccat tatcgccagc 1646161 gttagactcg ccatcacagc ggcggcgagc agcagtgctc gcaagttgcc accccagcgg 1646221 tatgccagca ctgtcgtgac gaagcccatc gccagcagcg agggtagctt cacttgcgac 1646281 gacagcgtga tcaggatgga acccgccagc agcatggcca ggggccccca ttccggacgg 1646341 ggtttgactg cccggctcgc gcccgcacgc ggggatgccc ccagctcggg ccgccggctg 1646401 gcccgtattg tggcggggcc caaccgccag gtttcgggcg acgggcgtgg ggtattcgcc 1646461 atatcaaggc cgcgcagcgc gaattcgacg ccggtcagca tcagcccgag catcagcgct 1646521 tcgttgtgga tgccggcgac caaatgcatg atcagcagcg gattggccgc gcctagccac 1646581 agcgcgctga cctcggcgac gccacagcgc tgagctagcc gaggggtcgc ccacacgatc 1646641 agggtcacac cgatcaacac cacaagccgg tggcagagca cggcagcgac gatgttttcc 1646701 ccagtcagcg acgagattcc gcggccgatc cacaagaaca gcggaccata tggcgccggt 1646761 gtctcccgcc acaggctggg caccgacagg gtgaacacgt ggccgaggcc caagccggac 1646821 gccggaccca cccggtaagg gtcgagtccg tccctgccga tctcactttg ggctagatat 1646881 gagtagacat ccttgctgta catcggtggt gcgatcaata gcggcagcat ccagagcagc 1646941 agggtgcggt ccagttcgcc gcgcgacatc cgccgcctgc ccagcgtgaa ccggccgagc 1647001 atcagccagg ccagcgccat catgaccgcc ccggtcgtgg tcatggtcaa cgacaccgtt 1647061 tggattcgtg acggcagatt gagcagccgg accccgaagg tggggtcctg gacgacgggt 1647121 cgggccccgg cgcccagggc gccgatggcc atcaggacgg tgccggtggc cccaaacagg 1647181 cgggtgcgcg ccagcgcggt gagctcggta gtggtcagcg gtgcacccac cgcctgctcg 1647241 tcgccatgca ggctggcgat cgaccagctc agcgtatggt ggcgggctgc cattggtgca 1647301 gcctaacggc atgcccggga attgcttagg cgatctcaat gtgaccagca caaccctgcc 1647361 gcatagggca tccctggtag accgatcaac ggaattttgt cacactgatg ttgtgaaaat 1647421 cccggcggtc tctaccactg tccccgcggc agtctcggac ggtcacactc gtcgggccat 1647481 tgtgcgcttg ctgctggaat ccggatcgat caccgccggc gagatcggtg accggctggg 1647541 cctgtcggcc gccggtgtgc ggcgtcatct ggacgcgctg atcgaggcgg gtgacgcgga 1647601 agcgtcggcg gccgcgccgt ggcagcaggt gggacgcggg cggcccgcca agcgctaccg 1647661 gctgaccgcg gccggccggg ccaagctcga ccactcctat gacgacctgg cgtcggcggc 1647721 catgcggcag ctgcgggaga tcggcggcga ggaggcggtg cggacgtttg cccggcgccg 1647781 tatcgacgcc atcctggccg acgtcgcgcc ggccgacggt cccgacgacg ccgcgctcga 1647841 ggcggccgcc gagcggatcg caacggcgct cagcaaagcc ggctacgtcg ccaccaccac 1647901 gcgggtgggc gggccgattc acggtgtgca aatctgccag caccattgcc cggtatccca 1647961 tgtcgccgag gaattccccg aattgtgcga aaccgagcag caggccatgg ccgaggtgct 1648021 cggcacccac gtccagcggt tggcgaccat cgtcaacgga gactgcgcct gcaccaccca 1648081 cgtacccctg tcgccggcgc ccagcccgcg cccacccgcc accagcaccg aaggagtgtc 1648141 ccgatgacac tcaccccaga ggccagcaag agcgttgccc agcccccgac ccaggctccc 1648201 ctgacccagg aagaggcgat cgcgtcgctg ggccggtacg gctacggctg ggcggactcc 1648261 gacgtcgcgg gcgccaacgc gcagcgcggg ctttccgagg cggtggtccg cgacatctcc 1648321 gcgaagaaga acgagcccga ttggatgctg cagtcgcggc tgaaggcgct gcgcattttc 1648381 gaccgcaagc ccattccgaa gtggggctcc aacctcgatg gcatcgattt cgacaacatc 1648441 aagtacttcg tgcgctccac cgagaagcag gccgcgagct gggatgattt gccagaggac 1648501 atccgcaaca cctacgaccg gttgggaatc ccggaggccg agaagcagag attagtagct 1648561 ggagtagccg cacaatacga aagtgaagtt gtatatcacc agatcagaga ggatctggag 1648621 gctcaaggag tcatattttt agacactgat actggtttgc gagaacaccc ggatattttc 1648681 aaggaatatt tcggtacagt aatccctgcc ggcgataata agttttctgc attgaatact 1648741 gcagtttgga gtggtgggtc ctttatttac gtcccgcccg gtgttcacgt cgacattccg 1648801 ctgcaggcct acttccgaat caacaccgag aacatgggcc agttcgagcg gacgctgatc 1648861 atcgccgatg agggctctta cgtgcactac gtagagggct gcctgcccgc cggcgagctc 1648921 atcacgaccg ccgacggcga tttgcggccc atcgagtcga ttcgcgtcgg tgacttcgtc 1648981 accggccacg acgggcggcc acaccgcgtc accgctgtac aggtgcgtga cctcgatggc 1649041 gagctgttca ccttcacacc gatgtcgcct gccaacgcat tctctgtcac cgccgagcac 1649101 ccccttctcg ctattccccg cgacgaggtg cgtgttatgc ggaaggaacg caatgggtgg 1649161 aaggctgaag tcaacagcac caagctgcgt agcgccgagc cgcgatggat cgcggcgaag 1649221 gatgtggccg agggtgactt cctgatctac cccaagccga agccgatccc ccacaggacg 1649281 gttttgccgc tcgagtttgc gcgcctggcg ggctactacc tggcggaggg tcacgcgtgt 1649341 ctcaccaatg gctgtgagtc gctgatcttc tcgttccaca gcgatgagtt cgagtacgtc 1649401 gaggatgtgc gccaagcgtg caagtcgctg tacgagaagt cgggatcggt attgatcgag 1649461 gagcacaagc attcggcgcg cgtcaccgtg tacacgaagg cgggctatgc ggcgatgcgc 1649521 gacaacgtcg gcattggatc gtcgaataag aagctgtcgg atctgttgat gcgtcaagac 1649581 gagacgttct tgcgtgagct ggtcgacgcc tatgtgaatg gagacggcaa cgtcacgcgc 1649641 cgtaacgggg cggtgtggaa gcgggtacat acgacatcgc gcctctgggc gttccagttg 1649701 cagtccatcc tggcgcgtct gggtcactac gccactgttg aactgcgccg accgggcggc 1649761 cctggtgtga tcatgggccg caacgtcgtt cgcaaggaca tctaccaggt gcagtggacc 1649821 gagggcggcc gcggaccgaa gcaggcccgc gactgcggcg actactttgc ggtgccaatc 1649881 aagaagcgag cggtccgcga agcacatgag cccgtctaca acctcgatgt cgagaatccg 1649941 gacagctacc tcgcctacgg gttcgccgtg cacaactgca ccgcaccgat ctacaaatcg 1650001 gattcattgc actcagcggt ggtcgagatc atcgtgaaac cccatgcgcg cgtgcgttac 1650061 accaccatcc agaactggtc gaacaacgtc tacaacctgg tcaccaagcg ggcccgcgcc 1650121 gaagccgggg ccaccatgga gtggatcgac ggcaacatcg ggtccaaggt gaccatgaag 1650181 tacccggcgg tctggatgac cggcgagcac gccaagggcg aagtgctctc ggtggcgttc 1650241 gccggcgaag accagcacca ggacaccggc gccaagatgc tgcacctggc gccgaacacg 1650301 tcgagcaaca tcgtgtccaa gtcggtggcc cgcggcggcg gccgcacctc ctaccgtggc 1650361 ctggtgcagg tcaacaaggg ggcgcatggg tcgcggtcca gcgtgaaatg cgatgcgctg 1650421 ctggtggata cggtcagccg cagcgacacc tacccctacg tcgacatccg cgaggacgac 1650481 gtcaccatgg gccacgaggc caccgtgtcc aaggtcagcg agaaccagct gttctacctg 1650541 atgagccgcg ggctgaccga ggacgaggcg atggcgatgg tggtgcgcgg cttcgtcgag 1650601 ccgatcgcca aggagctgcc gatggagtac gcgctggagc tcaaccggct gatcgagctg 1650661 cagatggagg gcgcggtcgg atgacggctc cgggactgac agcagccgtc gaggggatcg 1650721 cacacaacaa gggcgagctg ttcgcctcct ttgacgtgga cgcgttcgag gttccgcacg 1650781 gccgcgacga gatctggcgg ttcaccccgt tgcggcggct gcgtggcctg cacgacggct 1650841 ccgcgcgggc caccggtagc gccacgatca cggtcagcga gcggccgggc gtatacaccc 1650901 agaccgtgcg ccgcggcgat ccacgactgg gcgagggcgg cgtacccacc gaccgcgttg 1650961 ccgcccaagc gttttcgtcg ttcaactccg cgactctggt caccgtcgag cgcgacaccc 1651021 aggtcgtcga gccggtaggc atcaccgtga ccgggccggg ggagggcgcg gtggcctatg 1651081 ggcacctgca ggtgcgtatc gaggagcttg gcgaggcggt cgtggtcatc gaccaccggg 1651141 gcggcggaac ctacgccgac aacgtcgagt tcgttgtcga cgacgccgct cggctgaccg 1651201 ccgtgtggat cgccgactgg gccgacgaca ccgttcacct cagcgcgcac catgctcgga 1651261 tcggcaagga cgcggtgctg cgccacgtca ccgtcatgtt gggcggcgac gtggtgcgaa 1651321 tgtcggcggg cgtgcggttc tgcggtgcgg gtggggacgc ggaactgctg gggctgtatt 1651381 tcgccgacga cggccagcac ctggagtcgc ggctgctggt ggaccacgcc caccccgact 1651441 gcaagtcgaa cgtgctgtat aagggtgcac tgcaaggtga tccggcgtcg tcgttgcccg 1651501 acgcacacac ggtctgggtg ggtgacgtgc tgatccgtgc gcaggccacc ggcaccgaca 1651561 ccttcgaggt gaaccggaac ctggtgctca ccgacggcgc gcgtgccgac tcggtgccca 1651621 acctggagat cgagaccggc gagatcgtcg gcgccggaca cgccagcgcc accggtcgct 1651681 tcgacgatga gcaattgttc tacctgcgtt cgcgcggtat tcccgaagca caggcccgcc 1651741 ggctggtggt ccgcggcttc ttcggtgaga tcatcgccaa gatcgcggtg cccgaggtac 1651801 gcgagcgcct gaccgcagcc atcgaacacg agctggaaat cacggaatca acggaaaaga 1651861 caacagtctc atgaccattt tggaaattaa ggacctgcac gtcagcgtgg agaaccccgc 1651921 ggaggcggac cacgagatcc cgatcctgcg cggcgtcgac ctcaccgtga aatccggtga 1651981 gacacatgcc ttgatgggac ccaacggctc gggcaagtcg acgctgtcct acgccatcgc 1652041 gggccatccc aaataccacg tgacgtcggg caccattacc ctcgacggcg cggacgtgct 1652101 ggcgatgagc atcgacgaac gtgcgcgggc cggcctgttt ctggccatgc aatatcccgt 1652161 cgaggtgccc ggtgtctcga tgtcgaactt cctgcgctcg gcggcaaccg ccattcgcgg 1652221 cgagccgccg aaactgcggc actgggtcaa agaggtcaag gccgcgatgg ccgcgctcga 1652281 catcgacccg gccttcgccg agcgcagcgt caacgagggt ttctccggtg gcgagaagaa 1652341 gcgccacgag atcctgcagc tagaactgct caagcccaag atcgccatcc tggacgagac 1652401 cgactccggc ctggacgtcg acgcgctgcg cgtggtcagc gagggggtga accgctacgc 1652461 cgaatcccag cacggcggca tcctgctgat cacgcactac acccgcatcc tgcgctacat 1652521 ccacccggaa tacgtgcacg tgttcgtcgg cggccgcatc gtcgagtccg gtggttcgga 1652581 gctcgccgac gaactcgacc agaacggcta cgtgcgtttc tcccccgcaa gcgggcggta 1652641 cccccaccaa cccgcgccaa ccggagcctg acatgacggc ctcggtgaac tcgctcgatc 1652701 tggcggcgat tcgcgccgat ttccccatcc tcaagcgcat catgcggggt ggaaacccgt 1652761 tggcgtattt ggactccggc gccacctcac aacgcccgct gcaggtcctc gacgccgagc 1652821 gcgagttcct gaccgcgtcc aacggcgcgg tccatcgtgg cgcgcaccag ctgatggagg 1652881 aggcgaccga cgcctacgag cagggccgcg cggacatcgc gttattcgtc ggcgccgaca 1652941 cggacgagct ggtgttcacc aaaaatgcca ccgaggcgct caacctggtg tcatatgtgc 1653001 tgggggacag ccgtttcgag cgtgccgtcg gccccggcga cgtgatcgtc accaccgagc 1653061 tggagcatca cgccaacctg atcccgtggc aggagctggc ccggcgcacc ggggccacat 1653121 tgcgctggta cggggtgact gacgacgggc gcatcgacct ggactcgctg tatctggacg 1653181 accgtgtcaa agtcgttgcg ttcacccatc attccaatgt gaccggggtg ctgacaccgg 1653241 tgagcgagct ggtctcccgc gcccaccagt cgggtgcgct gaccgtgctg gacgcctgcc 1653301 agtcggtgcc gcaccagccg gttgacctgc acgaactcgg cgtcgacttc gccgcgtttt 1653361 ccggacataa aatgctgggc cccaacggaa tcggtgtgct gtacggccgc cgtgagctgc 1653421 tagcgcagat gcccccattt ctcaccggcg gttcgatgat cgaaacggtg accatggaag 1653481 gcgccaccta cgcgccggcg ccgcaacggt tcgaggccgg taccccgatg acctcccagg 1653541 tggtcgggtt ggccgccgcg gcccgctatc tcggcgcgat cggcatggcc gcggtggagg 1653601 cccacgagcg ggagctggta gccgcggcca tcgaaggcct gtccggcatc gacggtgtgc 1653661 ggatccttgg cccgacgtcg atgcgggacc gagggtcgcc ggtggcgttc gtcgtcgagg 1653721 gcgtgcacgc gcacgacgtg ggtcaggtac tcgacgacgg cggcgtggcg gtgcgggtcg 1653781 ggcaccactg cgcgctgccg ctgcaccgca ggttcggtct ggccgccacc gcgcgggcgt 1653841 cgttcgcggt gtacaacacc gcagacgagg tggaccgctt ggtggccggc gtgcggcgat 1653901 cccggcattt ctttggaaga gcgtgacgtt gcgtctggag cagatctatc aggacgtgat 1653961 cctcgatcac tacaagcatc cgcagcatcg ggggctgcgg gagccgttcg gcgcccaggt 1654021 gtatcacgtg aacccgatct gcggcgacga ggtcacgctg cgggtcgcgt tgtccgagga 1654081 cggcaccagg gtcaccgacg tttcctatga cggacaaggc tgttcgatca gccaggccgc 1654141 gacctcggtg ctcaccgaac aggtaatcgg acaacgcgtg ccgcgggcgc tgaacatcgt 1654201 cgacgccttc accgaaatgg tgtcctcccg cgggaccgtg ccaggcgacg aggacgtctt 1654261 aggcgatggg gtcgcgttcg ccggggtggc caaatacccg gcccgggtga aatgcgcgct 1654321 gctcggatgg atggcgttca aagatgcgct ggcccaagcc agcgaagcct tcgaggaggt 1654381 tacagatgag cgaaaccagc gcaccggctg aggaattgct cgccgacgtc gaggaggcga 1654441 tgcgcgacgt cgtcgacccg gagctgggga tcaacgtcgt tgacctgggc ctggtctacg 1654501 gcttggacgt gcaagacggt gacgaaggga ccgtcgcgct gatcgacatg accctcacgt 1654561 cggcggcgtg cccgctgacc gatgtcatcg aggatcagtc gcgcagcgcg ctggtcggca 1654621 gtggcctggt cgacgacatc cgcatcaact gggtgtggaa cccgccgtgg ggcccggaca 1654681 agatcaccga agacggccgc gaacaattgc gggcgctcgg cttcaccgtc tgaaccggcg 1654741 cgtcgccgaa cgtgaactga gggcggagaa tccggcaaaa taccgccgtg agttcacgtt 1654801 cggcgggcgg tgcgagcgaa acccgcctca gaaggcgtct tcgggcacgc gcatgatgtc 1654861 gtcgtcgatg ttttcgatga cactgcgcac cccggtcagt ttcggcagca tgttcttcgc 1654921 aaagaacgcc gcgaccgcga tcttgccccg atagaacgct tcatcgttct gcgatggccc 1654981 gtcggccagt gcggcgtgtg cgaccccggc cagcacgagc agccgccagc cgatgagcaa 1655041 gtcgcccacg gcgagcaaat agcgcacgga tccgagcccc accttgtaga tgtcgctgga 1655101 gtgctgcgcg gcggacatca ggtacccggt cagcgcgccc gtcattgccg tgatgtcgtc 1655161 gagcgcggtg cgcagcagct cggcttgcgg ttttagcgac gggtcaatgt tctcgacggt 1655221 gtgggtgacc tgagccagca caaattgcaa agccttgccg tgatcgcgca cgatcttgcg 1655281 gaagaagaag tccagtgcct ggatcgccgt ggtgccctcg tagagggaat cgatcttggc 1655341 gtcacggatg tactgctcga ggggatagtc gaccagaaag cccgagccgc ccagcgtctg 1655401 cagcgactcg gtgaggattt cgtaggcgcg ttctgaaccc acgcccttga cgatgggcag 1655461 cagcagatcg tccacgcggt gcgccatgtc gtgatcggca cccgaaaccc gttgggccac 1655521 agcgtcgtcc tggtgagcag cggcatacag gtacagcgcc cgcaggcctt cggcataggc 1655581 cttttgggtc atcaggctgc gccgcacgtc ggggtggtgc atgattgtga cccgcggcgc 1655641 cgtcttatcc gtcatctggg tcagatccgc gccctgcacc cgctccttgg cgaaggcgag 1655701 tgcgttgaga tagcccgtcg acaatgtgcc ggcggactta actccgatgg tcatgcgagc 1655761 atgctcaatc accgtgaaca tctgcgcaat cccgttgtgc acgccgccga ccagatagcc 1655821 aacggcgggc acgtcggcac cgccgaacgt caattcgcat gtcggagagg actttaagcc 1655881 catcttgtgt tccaggccgg tcacgtagac gccgttgcgg gcgccgagct cgaacgtatc 1655941 ggggtcgaag aggtagttgg gaacgtagaa caggctcaac cccttggtgc ctgggccggc 1656001 gccctcaggt cgggccaaca ccaaatggaa gatgttctcc gcggtattgc cgacatcccc 1656061 accggagatg aaccgcttga cgccctcgat gtgccaggtg ccgtcgggtt gttcgaacgc 1656121 tttggttcga cccgcgccga catcggaacc ggcgtcgggc tcggtgagca ccatggtggc 1656181 ctgccagccg cgctgcacgc cctcggccgc ccacctgcgt tgctcatcat tgccctcgat 1656241 gtaaagggac tgggccagca ccgggcccag gttgaaaaag cacgccgacg ggttggcgca 1656301 gtagatcatt tcgttgacgg cccatgccag cggcggcggc gctggcatgc caccgatctc 1656361 ctcggccagg cccagccgcc accagccggc ctccttgatt gcctgcactg tcttggccaa 1656421 ctcgtcgggc acgctgatgg agtgggtgtt cgggtcgaag accggtgggt tgcggtcggc 1656481 gtagccgaag gattcggcga tcggaccctc ggccagccgc gccgcttcgg ccaagatggt 1656541 gcggaccgtg tcgacgtcca gatcgctgta gcgtccggtg cccaggaccg cgccgatatc 1656601 aaggacttcg agcaggttga actcgagatc gcggacattg gcgatgtagt gtcccaatgc 1656661 ggttcccttc aggtggctga tcggccctga tcgggcccag tctctccgag cgggaagaac 1656721 gtacgcaacc gtaacctgcg gtgggagggc ggaactgcgg cgactatgtt ccgttcgcgc 1656781 cgggcaggcc gagcagcagc ccgcccctgc cgccgagccc gggggcgccg gccccgccgc 1656841 cgtcgccgcc gtcaccgccg ttaccgatca gctgggcgtt gccaccgttg ccgccgttgc 1656901 cgcccaacgc gccgccatcg ccgccttccc cgccgttgcc gaacaacccg gcctggccgc 1656961 cggccccgcc gtgggcgctc gatgcccccc cggctccgct gccgccggcg ccgccgttgc 1657021 catagaagaa cccggcatcg ccgccacgcc cagcgctacc cgcggatagg gctgccccgc 1657081 cggcaccacc gtcgccgaac aggaaggccc tgccgccggc gccacctccg ccgaggaagc 1657141 tgctggcgcc agcaccgccg ttgccaaaaa acagcccgcc gttgcctcca gagccaccgg 1657201 ctcccatgcc gttggggctg atgccacccg cgccgccggc cccgaagagc acggcggagc 1657261 cgccgatgcc gccggcaccg ccgccaccgc cgctattgcc gccggccccg ccgttgccga 1657321 acagccaccc gccggtgccg ccggcgccgc cgttggcgcc cagggcgccc acgccgccgt 1657381 tgccgccgtg gcccagcagt ccggcggcgc cgccgttgcc gccggggccg gcagcgggcg 1657441 agaagccgtt gccgccattg ccgatcagga gtccgccggc ctggccgttg gggttcgccg 1657501 cggtcccatc ggcgccgttg ccgatcagcg gacggttcaa cagcgccagg gtgggcgcgt 1657561 tgatgacgtc gaatagcggc tgcaaggggc caaagttggt ggcctccgcg gcggcgtacg 1657621 cctgcgcgcc gccggacagg gcttgcacga actggctgtg aaacgccgcg gcttgggcgc 1657681 tgagcacctg ataggtctgg ccgtgcgcgc cgaacaacgc cgcgaccgcc gccgacacct 1657741 catcggcgcc tgcggccgcg acagcggtcg tctgggccgc cgcggccgag ttggccgcgc 1657801 taatcatcga accgaggcgc gcgagattcc ccgccgctcc cgacacgaac tccgtattcg 1657861 cgaccacgaa cgacatctgg cacctccgca atgaagagct agcgaccgac gtatcttatc 1657921 gcgatccagc ggccgcttca cccgtttcgg ggtaacgcac cccgccagaa tggttaatcc 1657981 gttagtggcc ccgcttgcct tgtgccagtg accaattcaa tcgcataccg caatgcaatc 1658041 gagatttttg gtcgttcctg cgtccctaca ctcggttcat cctgacgaat tcgcacccct 1658101 gtcgtgaggc cgccggaatg accttgaccg cttgtgaagt aactgccgcg gaggctcctt 1658161 tcgaccgcgt ttcaaagacc attccccacc cattgagctg gggagccgcg ctgtggtcgg 1658221 tagtctccgt gcgctgggcc accgtggcgc tgctgctgtt tctcgccgga ctagtggcgc 1658281 aactgaacgg tgctcccgag gccatgtggt ggacgcttta cctggcctgt tatctggccg 1658341 gcggctgggg ctcggcatgg gcgggcgcac aagcgttgcg gaacaaggca cttgatgtgg 1658401 atctgctgat gattgccgcg gcggtcggag cggtcgcgat tgggcagatc ttcgacggcg 1658461 cgctgctgat cgtgatcttc gccacgtccg gtgcgctgga tgacattgcc accagacaca 1658521 ccgcggaatc ggtcaaaggc ctgctggacc tcgcgccgga tcaggcggtg gtggtccagg 1658581 gcgacggcag cgaacgggtg gtggcggcca gcgagctggt ggtgggggac cgggtggtgg 1658641 tgcggccggg ggaccggata cccgcagacg gtgcggtgct gtcgggggct agcgacgtcg 1658701 accaacgctc gatcaccggt gaatcgatgc cggtggccaa ggcccgcggt gacgaggtgt 1658761 tcgccggcac cgtgaacgga tcgggtgtat tgcatctggt ggtcacccgt gacccgagcc 1658821 agaccgtggt agcccgcatc gtcgaactgg tcgccgacgc ttcggcgacg aaggccaaaa 1658881 cccaactgtt cattgagaaa atcgagcaac gctactccct gggcatggtc gcggccaccc 1658941 ttgccctcat cgttattccg ctgatgttcg gcgccgacct gcggccggtg ctgctgcgcg 1659001 ccatgacctt catgatcgtg gcatcgccat gcgcggtggt gctggccacc atgccgccgc 1659061 tgctttcggc gatcgccaac gcaggccgtc atggggtgct ggtcaaatcc gcggtggtcg 1659121 tcgaacgcct ggccgatacc agcatcgtcg ctttggacaa gaccggtacg ctgacccgtg 1659181 gcatcccgcg actggcttcc gtcgcaccgc tggaccccaa cgtggtcgat gcccggcgat 1659241 tgttgcaatt ggcagctgcc gcagaacaat ccagcgagca cccgcttggc cgggcgatcg 1659301 tcgcggaagc tcgtcggcgt ggtatcgcca taccgcccgc caaggacttc cgcgcggtcc 1659361 cgggctgcgg ggtccacgcc ctggtgggca acgatttcgt cgagatcgcc agcccgcaaa 1659421 gctaccgcgg tgcaccgcta gcagagctgg cgccgctcct ttctgccggc gccactgccg 1659481 ccatcgtctt gttggatgga gttgccatcg gtgtgctcgg gctcaccgat cagcttcgtc 1659541 cggatgccgt ggagtccgtc gcggcgatgg ctgcattgac cgccgcacca ccggtgctgc 1659601 tcacgggtga caacgggcga gcggcttggc gggtcgctcg gaacgccggg atcaccgatg 1659661 tgcgagccgc attgctgccc gagcagaagg ttgaagtcgt gcgcaacctg caggccggtg 1659721 gtcaccaggt gctgctcgtc ggcgacggcg tcaacgacgc tcccgccatg gccgccgccc 1659781 gcgccgctgt cgccatgggc gccggcgccg atctgaccct acagaccgca gacggggtga 1659841 ccatacggga cgaactgcac accatcccga cgatcatcgg gttggcacgg caggcgcgcc 1659901 gggtggtcac cgtcaacctg gccatcgcgg ccaccttcat cgccgtcctg gtgctgtggg 1659961 acctttttgg gcagctgccg ctgccactgg gtgtggtggg tcacgaaggg tccactgtgc 1660021 tggtggccct caacggcatg cggctattga ccaaccggtc gtggcgggcc gcggcttcgg 1660081 ctgcgcgtta ggctcgatgt cgcagaactg accagggctg cgttaggggt gcccgtgacc 1660141 actcgagacc tcacggcggc gtatttccaa cagaccatct ccgccaacag caacgtgctt 1660201 gtgtactttt gggcaccgct gtgcgccccg tgcgacctgt tcacaccgac ctacgaggcg 1660261 tcgtcgcgga aacactttga cgtcgtgcat ggcaaagtca acatcgaaac cgagaaagat 1660321 ctggcctcga tcgccggggt caagttgttg cccacgctga tggccttcaa gaaaggcaag 1660381 ctggtcttca aacaagccgg catcgccaat cccgcgatca tggacaatct ggtgcaacaa 1660441 ctccgggcat acaccttcaa gtccccggcc ggcgaaggta tcggccctgg aacaaagact 1660501 tcatcctgag gcgttgaggc aggcgtgact acccgagacc tcactgccgc acagttcaac 1660561 gaaaccatcc aaagcagcga catggtgctc gtcgattatt gggcctcctg gtgcggcccg 1660621 tgccgcgcgt tcgcgccgac ctttgccgag tcgtcggaaa aacaccccga cgtggtgcac 1660681 gccaaggtcg acaccgaagc cgaacgagag cttgcagcgg ccgctcagat ccgatccatc 1660741 cccacgatca tggccttcaa gaacggcaag ttgttgttca accaggccgg cgcgctgccg 1660801 ccggcagcat tggagagcct ggtgcagcag ctcaaggcct acgaggtgga ggccggcgaa 1660861 gccaccaccc agaacgggcg agcccaacaa gcctgaccgg gcgccaggcg cccggctgtg 1660921 ccccaccgct gcgcggcgca agtcgtcgcc gggtaccgtt caacggtgag tttggtcctc 1660981 gtcgaacacc cgcggcccga gatcgcgcag attaccctca accggccgga gcggatgaac 1661041 tccatggcat tcgatgtcat ggtgccgctc aaagaggcct tagcgcaggt cagctacgac 1661101 aactcggtgc gggtggtggt gctgaccggc gcgggtcgag ggttttcttc gggtgcggat 1661161 cacaagtcgg cgggggtggt gccgcacgtc gagaacttga ctcggcccac ctacgcgctg 1661221 cgttcgatgg agctcctcga tgacgtcatc ttaatgctgc gacggctgca ccagccggtg 1661281 atcgccgcgg tcaacggccc cgccatcggt ggtgggctgt gcctggcact ggctgcagac 1661341 attcgggtgg cctcgagtag cgcctacttc cgggccgccg gtatcaacaa cgggctgacc 1661401 gccagcgaat tggggctgag ctacctgttg cccagggcca ttggatcctc acgtgcgttc 1661461 gagatcatgt tgaccggtcg cgacgtcagc gccgaggaag ccgagaggat cgggctggta 1661521 tcccgtcagg tacccgatga acagctgcta gatgcctgct acgcgatcgc cgcacggatg 1661581 gcgggattct cgcggccggg aattgagttg accaaacgta cgctgtggag tggactggac 1661641 gccgccagtc tggaggcgca catgcaggcc gagggcttgg ggcagctctt cgtccggctg 1661701 ctcaccgcca acttcgaaga agcggttgcc gcacgggccg agcagcgggc gccggtgttc 1661761 accgatgaca cgtaacagcg cccaagacaa ccgacgacca gggagcttat gtgatcacag 1661821 ctacggacct cgaggtccgc gctggcgcgc gcatcctgct cgcacccgac ggccccgacc 1661881 tgcgtgtgca gcccggcgat cgtatcgggc tggtcggacg taacggtgcc ggcaagacca 1661941 ccacgctgcg cattctggcg ggggaggtcg aaccctatgc cgggtcggtt acccgtgccg 1662001 gcgaaatcgg ctacctgcca caggatccca aagttggcga tctcgacgtg ctggcccgtg 1662061 accgggtgct gtccgcccgc ggactggacg tcctgctcac tgatctggag aagcagcagg 1662121 cgttgatggc cgaggtcgcc gacgaggacg agcgtgaccg cgccatccgc cgttacggtc 1662181 agctcgagga gcgattcgtc gcgctgggcg gctatggcgc cgaaagcgaa gccggccgca 1662241 tctgcgccag cctaggcttg cccgagcggg tgctgaccca gcggctgcgt accctttccg 1662301 gaggtcagcg ccgccgggtg gaactagccc gcattttgtt cgccgcgtcc gagagtggcg 1662361 ctggaaattc caccaccttg ttgctcgacg agccgactaa ccacctcgac gctgattcgc 1662421 tgggctggct gcgggacttc ctgcgcttgc atacgggcgg gctggtggtc atcagccaca 1662481 acgtggacct ggtggccgat gtcgtcaata aagtgtggtt cctggatgcc gtgcgcggcc 1662541 aggtcgatgt ttacaacatg ggctggcagc gctacgtcga cgctcgggcc accgacgagc 1662601 aacgtcgcat ccgggaacgc gctaacgccg aacgcaaggc ggccgcgctg cgtgcacagg 1662661 ccgccaagtt gggcgccaag gccaccaaag ccgttgcggc ccagaacatg ttgcgccgcg 1662721 ccgatcggat gatggccgca ctcgacgagg agcgagtcgc cgacaaggtg gcccggatca 1662781 agttccccac cccggcggcg tgtggacgca caccgctggt ggccaacggt ctgggcaaga 1662841 cgtatggctc gctggaagtc ttcaccggtg tcgacttggc catcgaccgc ggctcgcggg 1662901 tggtcatact cggactcaac ggtgccggca agaccacgct gctgcgattg ctggccggtg 1662961 tcgagcagcc cgacaccgga gtgctggaac ccggatacgg tttacggatc ggctatttcg 1663021 cgcaagagca cgacacgctc gacaacgatg ccaccgtttg ggagaacgtc cggcacgcgg 1663081 caccggatgc cggcgaacag gacctgcgcg gcctgctggg tgcgttcatg ttcaccggtc 1663141 cgcagctcga gcagccggcc ggcacgctct ccggcggtga gaagacccgg ctcgcgctgg 1663201 ccggcttggt ggcctccacc gcgaatgtgc tgctgctcga tgaaccgacc aacaatctcg 1663261 atccggcctc gcgcgagcag gtgctcgacg cgctgcgcag ctaccgaggt gcggtggtgc 1663321 tggtgacgca tgatcccggg gcggccgcgg cgctcggtcc ccaacgggtg gtgctgttgc 1663381 ccgacggcac cgaggactac tggtccgacg agtatcgaga tctcatcgag ctggcctgac 1663441 ctagatgcgg ctgccgcgta acgatttcgg ccaaagcacc accggggcgg cggcgggttc 1663501 ttaggctagg tgcctgggat cgacggaggg taccgatgcg gaagtcaaag aagacgcgcg 1663561 atcagctgct gcgcgagttg cgcaacgcct acgagggcgg ggccagtatc cgcaacctgg 1663621 cggccaccac cggccggtcg tacggatcta ttcacagcat gctgcgcgag tcaggcacca 1663681 cgatgcgcgg ccgcggcggc cccaatcgcc gtccccggcc gcgttgatcc gccgattgtg 1663741 aatctgacga cgcgacagcg gcgtgtcgcg tcgtcagatt cacagtcagc gcatgtcaag 1663801 accgacgcac cgagttctcc accaggtcga ggacggcggc tagccgctgc gggtcttcgc 1663861 cggaggccag ccgcgccagc aatccgtcga gcaccaggtc caggtagcac cgcaaaacgt 1663921 cgctaggcac atcgtcacgc actcggttag cctgcttttg ccggcgcagc cgatcggtgg 1663981 tcgccgccgc caattccgcg gagcgctccg cccagccgcg gctgaagtca gggtcgttgc 1664041 gcagcttgcg tgcgatctcc aacctggtgg ccagccagtc gaactggtcg ggcgcggcaa 1664101 gcatgtcgcg catcacaccg atgaggcctt cgcgggatgc tacagccgcc attcgctcgg 1664161 tatcctcgcg cgccagcgcg aaaaacagcg cgtccttgtc gcggaagtgg tgaaagatcg 1664221 caccgcgcga catcccgatt gcctgttcca ggcgccggac cgtggccttg tcatagccgt 1664281 attcggcaaa gcaacggcgc gcaccgtcga ggatctgacg gcggcgagcc gccagatggt 1664341 cctcgctgac cttgggcacg ggcgctcggt cagcctgact tcagtatgtt gcgcagcacg 1664401 tactgcagga tgccgccgtt gcggtagtag tccgcctcac cgggggtgtc gatgcgcacc 1664461 acggcgtcga actcgatcgt ggcgccgtcg cccttggtgg cctggacgca caccgtcttg 1664521 ggtgtcttgc cgtcgttaag cacgtcgata ccggtgatgt cgaagacctc ggtaccgtcg 1664581 agtcccaacg acgacgctga ctttccttcg gggaactgca gcgggatcac gcccatgccg 1664641 atcaggttgg accggtggat ccgctcgaat gactcggcga tcaccgcccg cacgcccagt 1664701 agcaatgtgc ctttggccgc ccagtcccgt gacgaacccg acccgtactc tttgccgccg 1664761 aacacaacca gcggaatgtg ttgcgccgca tagttctgcg cggcgtcgta gatgaacgcc 1664821 tgcggaccgc ccggctgggt gaagtcgcgg gtataaccgc cggacacgtc gtctagcagt 1664881 tggttacgca gccggatgtt ggcgaaggtg ccacgaatca tcacctcgtg gttgccgcgg 1664941 cgagaaccga aggagttgta gtccttgcgg tcgacaccgt gttcgtcgag gtagcgcgcc 1665001 gcgggagttc cgggcttgat ggcgccggcg ggggagatgt ggtcggtggt caccgaatca 1665061 ccgagcagcg ccagcacccg ggcaccgctg atgttgccga ccggttcggg tttggctgtc 1665121 atcccctcga aatacggcgg cttgcgcacg taggtcgaat tcgggtccca ctcaaaggtg 1665181 ttgccgctcg gggttggcag gttgcgccag cggtcgtcgc ccttgaacac gtcggcgtag 1665241 ttgcgggtga acatctcctg gttgatcgcc gcggcgatgg tgtcggagac atcctgctgc 1665301 gatggccaga tatcgcggag aaaaacgttc ttaccgtctt tgtcttgacc gagcggctgg 1665361 gtttggaagt cgaagtccat ggtcccggcc agcgcgtagg cgatgaccag cggcggcgat 1665421 gccaggtagt tcatcttcac gtctgggttg atacggccct cgaagttccg gttgccggac 1665481 agtaccgcgg tcaccgaaag gtcgttgtcg ttaaccgctt ttgagatttc ctcgggcagc 1665541 ggcccggagt tgccgatgca ggtggtgcag ccgtagccga ccagatagaa gccgagcttc 1665601 tccagatacg gccacaggcc ggatctgtcg tagtagtcgt tgaccacttg cgagcccggg 1665661 gcaatcgtgg tcttcaccca cggcttcgag gtcagtccct tttcgacggc gttgcgggcc 1665721 agcagcgccg cgcccagcat tacttcgggg ttggaggtgt tggtgcagga cgtgatcgcg 1665781 gcaatcacca ccgcgccgtg gtcgagcacg aattcgccga gttcgtccga cttcacccgc 1665841 actgggttgc tcacccggcc atcggcatgc gcggcagccg agtgcacggt ttcgtcagtg 1665901 gcgacgtcgt cgttggcgaa cgtcagctgc cccgggtcgc tggccgggaa tgtctcctcg 1665961 actacctcgt ccagcttcga gtgcgggtcg tggggggaat ccggggaacc attgccgaca 1666021 tagtggtaaa tctgctcgcg gaatgttgat ttggcttgcg ccaacgcgat tcggtcctgt 1666081 ggacgctttg gtccggcgat cgacggcacc acgtcggata ggttgagttc gaggtattcc 1666141 gagaactccg gctcgtgctt gggatcgtgc cacatgccct gcgccttggc gtaggcctcg 1666201 accagtgcga cctgctccgg cgtgcgaccg gtaaaccgca gatacttgat ggtttcttcg 1666261 tcgatcggga aaatcgctgc ggtggaaccg aattcgggac tcatgttgcc cagggtggcg 1666321 cggttggcca gcggcacctc ggccacgccc tcgccgtaga actcgacgaa tttgccgacg 1666381 acgccgtgct ggcgcagcat ctcggtgacg gtcaacacca cgtcggtggc ggtgactccc 1666441 ggctggatct cgccggtcaa cctgaaaccc acgacccgcg ggatcagcat cgataccggc 1666501 tgacccagca tcgcggcctc cgcctcgatg ccgccgacac cccacccgag cacacccagg 1666561 ccgttgacca tggtggtgtg tgagtcggtg cccacgcagg tgtcggggta ggccactccg 1666621 tcgcgagtca tcaccacgct ggccaggtac tcgatattga cctggtgcac gatgccggtg 1666681 cccggcggca ccactttgaa gtcgtcgaaa gcgccttggc cccagcgcag gaattggtaa 1666741 cgctcaccgt tgcgctggta ttcgatttcg acgttgcgct cgaatgcgtc ggcgcggccg 1666801 aacaaatcgg cgatcaccga gtggtcgatc accaagtctg cgggcgccag cgggttgacc 1666861 ttgtccgggt tgccgcccag atcggcgatc gcctcgcgca tggtggccaa gtcgacgatg 1666921 cacggtacgc cggtgaagtc ctgcatcacc acccgggcgg gcgtgtactg gatctcgatg 1666981 ctgggctcgg ccttagggtc ccagttggcg atggcctcga tgtggtcctt ggtgatgttg 1667041 ctgccgtcct cgttgcgcaa caggttctcg gcgagcactt tgaggctgta ggggagtttc 1667101 gcggtattgg ggacggcgtc gagacgatag atctggtaac tcttttcgcc gaccttcagg 1667161 gtgtcgtggg ctccgaatga gttcacagat ttgctagtca catcaactcc cagggatttg 1667221 gttcgcccgc cgacgggccg tgtcgacggc gtggtgtcag cctagcagta cgcttgtcct 1667281 gctttgttgc cgtgtgggtg cgcgccgaag tgcgagcagc gcgtaacgtg ccagtagcac 1667341 gtcggcagga aggatgcgat gaccgggcca tattttcctc agacgatccc gttcctgccc 1667401 agctacattc cgcaagacgt cgacatgacc gcggtcaaag cggaggtcgc cgcactcggt 1667461 gtcagcgctc caccggcggc cacgccgggc ctgctcgagg tggtccagca cgctcgcgac 1667521 gagggcatcg atctcaagat cgtgctgctc gaccacaacc cgcccaatga cacaccgctg 1667581 cgtgacatcg cgaccgttgt cggggccgac tactcggatg ccaccgtctt ggtgctcagc 1667641 ccgaactatg tcggcagtta cagcacgcaa tacccccggg tcacgctcga ggccggggaa 1667701 gaccattcca agaccggcaa tccggtgcag tccgcgcaga actttgtcca tgagctgagc 1667761 acacccgagt ttccctggag cgcgctgacc attgttttgc tgatcggtgt gctggcagcg 1667821 gctgtgggtg ctcggttgat gcaactgcgc gggaggaggt cagcaacgtc gactgacgcc 1667881 gccccagggg cgggggacga tctcaatcaa ggcgtctagc cagccacatc tatctcttct 1667941 cgtgttgccg cgctaaccgg gcggttgttt gcggcaaacg cgcgaggtca ccgttgggtc 1668001 acattagtcg cacgtaccgg gggcagtttg tgacttacgt ttccatagcg tcagatgtga 1668061 cgtacggtgc aaatgatgct tgtggtgtcg ttggcgttga cctgcgctgt ccctccgagt 1668121 tgagccctag gagatctgag tcgaatgaga cggaatcgcc gtggctcgcc agcgcgaccg 1668181 gccgcacggt ttgtccgtcc ggcaattccg tcggctttga gtgtggccct gctggtatgc 1668241 acaccggggc tggctaccgc cgatccacag acggacacca tcgccgcgct gattgccgac 1668301 gtcgccaagg ccaaccagcg cctgcaagac ctgagcgacg aggttcaggc cgaacaggaa 1668361 agcgttaaca aggcgatggt cgacgtggaa accgctcggg acaacgctgc cgcggccgaa 1668421 gacgacctgg aggtcagcca gcgcgcggtt aaggacgcca acgcggcgat cgccgcggct 1668481 cagcaccggt tcgacacctt cgcggcggcc acctacatga acggtccctc ggtcagctac 1668541 ctcagcgcga gcagccccga cgagatcatt gccactgtga ccgccgccaa gacccttagc 1668601 gccagttccc aagcggtgat ggccaacctg cagcgggccc ggaccgagcg ggtgaacacg 1668661 gagtcggcgg cgcggctagc caagcagaag gctgataagg ccgccgccga cgcaaaggcc 1668721 agccaggatg ccgcggtggc ggcgctcacc gagacccggc ggaagttcga tgaacagcgc 1668781 gaggaggtcc aacgcctggc cgccgagcgc gatgcggctc aagcccgact gcaggcggcc 1668841 aggttggttg cctggtcctc ggagggtggt cagggtgcgc cgccgttccg gatgtgggat 1668901 cccggatcgg gccctgccgg tgggcgtgca tgggatggct tgtgggaccc cacgctgccc 1668961 atgatcccca gcgccaacat ccccggcgac ccgatcgcgg tagtgaacca ggtgttgggg 1669021 atctcggcaa cgtcagcgca ggtcaccgcc aatatggggc gcaagttcct ggagcagctg 1669081 ggcatcttgc agcccaccga taccggcatc accaacgctc cggcgggctc ggcccagggc 1669141 cggattccgc gagtttatgg gcgccaggct tctgaatacg tgatccgccg cggcatgtca 1669201 cagatcgggg tgccctattc ctggggcggc ggcaatgccg cgggcccgag caagggcatc 1669261 gactccgggg ccggcaccgt cggcttcgac tgctcaggcc tggtgttgta ctcgtttgct 1669321 ggggtgggca tcaagctgcc gcactactcg ggttcgcagt acaacctggg ccgcaagatc 1669381 ccgtcctcgc agatgcgccg cggcgacgtc atcttctacg gcccgaacgg tagccagcac 1669441 gtgacgatct acctcggcaa cggccagatg ctcgaggcgc ccgacgtcgg tttgaaggtg 1669501 cgggttgcgc ccgtgcgcac ggctggcatg accccgtatg tggtccgata catcgagtac 1669561 tagacgagga ttcatgcgcc acacgcgttt tcacccgatc aaactggcct ggatcaccgc 1669621 ggtggttgcc ggcctgatgg tcggtgtggc aacgcccgcc gatgccgaac ccggacaatg 1669681 ggatcccacg ctgccggcat tggtcagtgc gggggcgccc ggagatccgc tggcggtagc 1669741 caacgcgtcg ttgcaggcca ccgcccaggc cacccagacc acgctggatt tgggcaggca 1669801 gttcctcggt gggttgggaa tcaacctcgg cggccctgct gccagcgctc ccagcgccgc 1669861 cacaaccggc gcgagccgga ttccgcgggc caacgcccgt caggccgtcg aatatgtgat 1669921 tcgccgggcc gggtcgcaga tgggggtgcc ctattcgtgg ggtggtggct cgcttcaggg 1669981 ccccagcaag ggcgtggact cgggggccaa cactgtcggc ttcgactgct caggtctggt 1670041 gcggtatgcc ttcgccgggg tcggcgtgct gatcccgcgg ttctccggtg atcagtacaa 1670101 cgccggtcgc cacgttccgc ccgctgaggc caagcgcggc gacctgatct tttacggccc 1670161 aggcggcggc cagcacgtca ccctgtatct gggcaacggc caaatgctgg aggcatccgg 1670221 aagcgccggc aaagtcacgg tgagcccggt gcgaaaggcc ggaatgacgc cgttcgtgac 1670281 taggatcatc gaatactgag ccaggtgtga tttgccgggc accaccgcgg cgtcgacgga 1670341 atccaggagg cctggaatag ttgaacgcgg gcgcgtcgct gccccgcgac gttggtcatg 1670401 tcggcagtcg tgtccgattg agctgtggag gattttgatg acatcagcag gtgggttccc 1670461 cgcgggcgcc ggcggttacc agaccccggg tgggcattca gcttcgccag cccacgaggc 1670521 gccccccggt ggtgccgagg ggctggccgc cgaggtgcac acgctggagc gggccatctt 1670581 cgaggtcaag cggattatcg tcggccagga ccagctggtg gagcggatgc tcgtcggcct 1670641 gctgtccaag gggcatgtgc tgcttgaggg cgttcccggc gtggccaaga cgttggcggt 1670701 ggagaccttc gctcgggtgg tcggcgggac attttcgcgc atccagttca ccccggatct 1670761 ggtgcccacc gacatcatcg ggacgcgcat ctaccggcaa ggcagggagg aattcgacac 1670821 cgaactcgga ccggtggtgg ccaacttcct gctcgccgac gagatcaacc gggctccggc 1670881 gaaggtgcag tcggcgttgc tggaagtcat gcaggagcgc catgtgtcca tcggcggtag 1670941 gaccttcccg atgcccagcc cgttcctggt gatggcgacg cagaacccga tcgagcacga 1671001 gggcgtctac ccgctaccgg aggcgcaacg ggaccgcttc ctgttcaaga tcaacgtggg 1671061 ctacccgtcg cccgaagaag agcgcgaaat catctaccgt atgggtgtta ccccgccgca 1671121 ggccaagcag atcctgagca cgggcgacct gctgcggctg caggagatag cggccaacaa 1671181 cttcgtccac cacgcgctgg tcgactatgt cgttcgagtc gtcttcgcca cccgcaaacc 1671241 cgagcagttg gggatgaacg acgtgaagag ctgggtcgcg ttcggcgcat ccccgcgtgc 1671301 ttcgctgggc atcatcgccg ccgcacggtc cctggcgctg gtccggggcc gtgactatgt 1671361 catcccgcaa gacgtcatcg aggtcattcc tgatgtgctg cgacaccggc tcgtgctcac 1671421 ctatgacgcg ctcgccgacg aaatctcacc ggagatcgtc atcaaccgtg tgctgcagac 1671481 tgtggcgctg ccacaggtga atgccgttcc acagcaaggc cattcggtgc cgccggtgat 1671541 gcaggccgcg gccgcggcga gcggccggtg accgaatcca aagcgccggc ggtggtgcat 1671601 ccgccgtcga tgctgcgcgg ggacatcgac gacccgaagc tggcggcggc gctgcgcacc 1671661 ctcgagttga ccgtcaagca gaagctcgac ggtgtcttgc acggcgatca cctcggcctg 1671721 atacctgggc cgggttcgga gccaggggag tcgcgcctct accagcccgg tgacgatgtc 1671781 cgccggatgg actgggcggt caccgctcgc accactcacc cgcatgtccg gcagatgatc 1671841 gccgaccggg aactggaaac ctggctggtg gtcgacatgt cggccagcct ggattttggc 1671901 accgcctgct gcgagaaacg tgacctcgcg gtggcggcgg cggctgccat caccttcctc 1671961 aacagcggcg gcggcaaccg gctcggtgcg ctgatcgcca acggcgccgc gatgactcgg 1672021 gtgccggctc gcaccgggcg ccaacatcag cacacgatgt tgcgcaccat tgcgaccatg 1672081 ccgcaggccc ctgcgggggt ccgcggcgac ctggcggttg ccatcgatgc gctgcgccgg 1672141 cccgaacgtc gtcgcgggat ggcggtgatc atcagcgatt ttctgggccc gatcaactgg 1672201 atgcgtccgc tgcgggcgat cgcagcccgc catgaggtgc tggccatcga agtgctcgat 1672261 ccgcgcgatg tcgaattgcc ggacgtgggt gatgtggtgc tgcaggacgc cgaatccggg 1672321 gttgtgcgcg agttcagcat cgaccctgcg ctgcgcgacg acttcgctag ggcagctgcg 1672381 gcgcaccggg ccgacgtggc gcgcaccatc cgcggttgcg gggcaccctt gctatcgctt 1672441 cgcaccgacc gcgactggct tgccgatatc gtacgattcg tcgcctctcg ccggcgtggg 1672501 gcattggcgg gacaccagtg atgggtcagt tatgacattg ccgttgctgg ggccgatgac 1672561 gctatccggc ttcgcgcatt catggttctt cctattcctg tttgtcgtgg ccggactggt 1672621 cgcgctgtac atcctgatgc agctggcgcg ccagcggcga atgctgcggt tcgccaacat 1672681 ggagttgctg gagagcgtcg cacccaagcg gccatcccgc tggcggcatg tcccggcgat 1672741 cctgctggtg ttatcgctgc tgctgttcac catcgcgatg gccggtccga cgcatgacgt 1672801 ccggattccc cgtaaccgcg cggtggtgat gttggtgatc gacgtgtcgc agtcgatgcg 1672861 cgccaccgac gtcgagccca gccggatggt ggccgcgcag gaggctgcca agcagttcgc 1672921 cgacgagttg accccgggca tcaatctggg attgattgcc tacgcgggca cggcgacggt 1672981 cctggtgtcg ccgacgacca accgggaggc gaccaagaat gcgctggaca agttacagtt 1673041 cgccgaccgt accgccaccg gggaggcgat cttcaccgcg ctgcaggcca tcgccacggt 1673101 tggcgcggtg atcggtggcg gcgacacgcc gccgccggcg cgcatcgtgc tgttctccga 1673161 cggcaaggag acgatgccga ccaacccgga caaccccaag ggcgcctaca ccgccgcccg 1673221 caccgccaag gaccagggcg tgccgatttc gacgatctcg ttcggcaccc catacggctt 1673281 cgtcgagatc aacgaccagc gccaaccggt gcccgtcgac gacgaaacga tgaagaaggt 1673341 cgcccagctc tccggtggaa attcctacaa tgcggcgact ttggccgagc tgagggccgt 1673401 ttactcgtcg ctgcagcagc agatcggcta cgagaccatc aagggtgacg ccagcgtcgg 1673461 ctggttgcgg ttgggtgcgc tggcgctggc gttggcggcg ctagcggcgc tgctcatcaa 1673521 ccggcggttg ccgacttagc ttcccccgcg gccccggcag cccgcgagcg taacctggct 1673581 gcgatttccg gcgcggattt tcgcagtgcg gttacgctcg gaaagcgcgg gcctcgccca 1673641 cgcggcggat gatgtcagcg gggtggtcct cggcgacgac ccggaccacg atccacccgt 1673701 agcggtgctg gactttctcg tgccggagga tgtctttccg gtagtggtag cgactggtca 1673761 gatggtggtc gccgtcatac tcggccgcga ccttgatgtc ttgccagccc atatccaaat 1673821 gggcttccgc ccagccccat tcgttgcgca ccgcgatctg cgtctggggg cgcggaaagc 1673881 cggcgcggat caacaacaag cgcagccagg tttccttggg ggactgggca ccgccgtcga 1673941 cgaggtccag agcggctctt gcggccttca tgccacggcg gccccgatag cgctcgatca 1674001 gcggctcgac gtcggccacc ttcaaatcgg tggcctgtat cagggcgtcg acggccgcga 1674061 cggcggggtc caatggaaat cgactggtca ggtcgagcgc cgttcgctcc ggtgtggtca 1674121 cgcgcatgcc ctcgatgacg cagatctcgt cgggctcgat gcgctcttcc cagacttgca 1674181 gccccggggc acggcggcgg ttggtgtcga tgatcgcggc gggaagatcc gcgtcgatcc 1674241 acttggcgcc atggaaggca gaagccgagt agccggccag cacgccgcgg cggcgcgagc 1674301 gcagccacag cgcttttgca cgcaattgcg cggtcagttc cacaccctgc ggcacgtaca 1674361 cgtctttatg tagcgcgaca tacctgctgc gcaattcgta gggcgtcaat acacccgcag 1674421 ccagggcctc gctgcccaga aagggatccg tcatggtcga agtgtgctga gtcacaccga 1674481 caaacgtcac gagcgtaacc ccagtgcgaa agttcccgcc ggaaatcgca gccacgttac 1674541 gctcgtggac ataccgattt cggcccggcc gcggcgagac gataggttgt cggggtgact 1674601 gccacagcca ctgaaggggc caaaccccca ttcgtatccc gttcagtcct ggttaccgga 1674661 ggaaaccggg ggatcgggct ggcgatcgca cagcggctgg ctgccgacgg ccacaaggtg 1674721 gccgtcaccc accgtggatc cggagcgcca aaggggctgt ttggcgtcga atgtgacgtc 1674781 accgacagcg acgccgtcga tcgcgccttc acggcggtag aagagcacca gggtccggtc 1674841 gaggtgctgg tgtccaacgc cggcctatcc gcggacgcat tcctcatgcg gatgaccgag 1674901 gaaaagttcg agaaggtcat caacgccaac ctcaccgggg cgttccgggt ggctcaacgg 1674961 gcatcgcgca gcatgcagcg caacaaattc ggtcgaatga tattcatagg ttcggtctcc 1675021 ggcagctggg gcatcggcaa ccaggccaac tacgcagcct ccaaggccgg agtgattggc 1675081 atggcccgct cgatcgcccg cgagctgtcg aaggcaaacg tgaccgcgaa tgtggtggcc 1675141 ccgggctaca tcgacaccga tatgacccgc gcgctggatg agcggattca gcagggggcg 1675201 ctgcaattta tcccagcgaa gcgggtcggc acccccgccg aggtcgccgg ggtggtcagc 1675261 ttcctggctt ccgaggatgc gagctatatc tccggtgcgg tcatcccggt cgacggcggc 1675321 atgggtatgg gccactgaca caacacaagg acgcacatga caggactgct ggacggcaaa 1675381 cggattctgg ttagcggaat catcaccgac tcgtcgatcg cgtttcacat cgcacgggta 1675441 gcccaggagc agggcgccca gctggtgctc accgggttcg accggctgcg gctgattcag 1675501 cgcatcaccg accggctgcc ggcaaaggcc ccgctgctcg aactcgacgt gcaaaacgag 1675561 gagcacctgg ccagcttggc cggccgggtg accgaggcga tcggggcggg caacaagctc 1675621 gacggggtgg tgcattcgat tgggttcatg ccgcagaccg ggatgggcat caacccgttc 1675681 ttcgacgcgc cctacgcgga tgtgtccaag ggcatccaca tctcggcgta ttcgtatgct 1675741 tcgatggcca aggcgctgct gccgatcatg aaccccggag gttccatcgt cggcatggac 1675801 ttcgacccga gccgggcgat gccggcctac aactggatga cggtcgccaa gagcgcgttg 1675861 gagtcggtca acaggttcgt ggcgcgcgag gccggcaagt acggtgtgcg ttcgaatctc 1675921 gttgccgcag gccctatccg gacgctggcg atgagtgcga tcgtcggcgg tgcgctcggc 1675981 gaggaggccg gcgcccagat ccagctgctc gaggagggct gggatcagcg cgctccgatc 1676041 ggctggaaca tgaaggatgc gacgccggtc gccaagacgg tgtgcgcgct gctgtctgac 1676101 tggctgccgg cgaccacggg tgacatcatc tacgccgacg gcggcgcgca cacccaattg 1676161 ctctagaacg catgcaattt gatgccgtcc tgctgctgtc gttcggcgga ccggaagggc 1676221 ccgagcaggt gcggccgttc ctggagaacg ttacccgggg ccgcggtgtg cctgccgaac 1676281 ggttggacgc ggtggccgag cactacctgc atttcggtgg ggtatcaccg atcaatggca 1676341 ttaatcgcac actgatcgcg gagctggagg cgcagcaaga actgccggtg tacttcggta 1676401 accgcaactg ggagccgtat gtagaagatg ccgttacggc catgcgcgac aacggtgtcc 1676461 ggcgtgcagc ggtctttgcg acatctgcgt ggagcggtta ctcgagctgc acacagtacg 1676521 tggaggacat cgcgcgggcc cgccgcgcgg ccgggcgcga cgcgcctgaa ctggtaaaac 1676581 tgcggcccta cttcgaccat ccgctgttcg tcgagatgtt cgccgacgcc atcaccgcgg 1676641 ccgccgcaac cgtgcgcggt gatgcccggc tggtgttcac cgcgcattcg atcccgacgg 1676701 ccgccgaccg ccgctgtggc cccaacctct acagccgcca agtcgcctac gccacaaggc 1676761 tggtcgcggc cgctgccgga tactgcgact ttgacctggc ctggcagtcg agatcgggcc 1676821 cgccgcaggt gccctggctg gagccagacg ttaccgacca gctcaccggt ctggctgggg 1676881 ccggcatcaa cgcggtgatc gtgtgtccca ttggattcgt cgccgaccat atcgaggtgg 1676941 tgtgggatct cgaccacgag ttgcgattac aagccgaggc agcgggcatc gcgtacgccc 1677001 gggccagcac ccccaatgcc gacccgcggt tcgctcgact agccagaggt ttgatcgacg 1677061 aactccgtta cggccgtata cctgcgcggg tgagtggccc cgatccggtg ccgggctgtc 1677121 tgtccagcat caacggccag ccatgccgtc cgccgcactg cgtggctagc gtcagtccgg 1677181 ccaggccgag tgcaggatcg ccgtgaccgc ggacatccgg gccgagcgca ccacggcggt 1677241 caacggtctc aacgcatcgg tggcacgctg agcgtccgac aacgactgcg ttccgatcgg 1677301 caatcgactc agcccggcac tgaccgcgat gatcgcatcg acgtgcgcgg cattctcgag 1677361 cacccgcaat gcgcgcgatg gcgcgtggtc gggaacccgg tgttgccgtg acgattcgag 1677421 caactgctcg acgaggccac ggggattggc gacgtcgcta gatcccagtc cgatggtgct 1677481 caaggcttcg gcggccgagc gcaccgctga ccgcaacgcg tattcggcat cgccgagctc 1677541 gtagtgttcc aacactgggg ccccgggaag tgaatacacc atccaagaaa gtgcacacaa 1677601 ttcgggcgtg agtggctcgc tctgcgcggc ttcgtcgaca tcgccatagg agaactccgg 1677661 gaccaggcca acggcgctgc cgggatcctc cgggttggcg acgatcaccg cctcgccggc 1677721 ggcaagggcg tcgtgctcga actgtgttcc cgcagccagc ccgcgcacat cgcccggcac 1677781 cggcaacacc acattgatcg tcccccgcag tcggcgccgg cccaccgcgg cgcgcagtgt 1677841 ctgcaggagc gagaccgttc cagcatcgtg gacgtcgggc cacggcagcc cggtgtggcc 1677901 cgcagccacg gcatcataag ctgcgacgga ttgcgttggc gcccaaagtg ataatgcatc 1677961 caacacgtcg tcgggagcag ccttgccggc gagccaagcg ttagcccaga tcgacagcga 1678021 aacactggga caccacatga tcttgcagtg tagttgttcg acccggctga cgcggatcac 1678081 gcgtatccta agcgcatgcc cgtcgctttg atctggctta tcgcggcgtt ggtgctcgtc 1678141 ggcgcagagg cactgaccgg cgacatgttc ttgctgatgc tcggcggcgg tgcgctggcc 1678201 gcctcggtaa gcagctggct gctggcttgg ccgatgtggg ccgacggggc ggtgtttctc 1678261 ctcgtctcgg tgctgctgct ggtgttggtt cggccggcgg tgcggcgccg gctgacgcag 1678321 accaaaggtg tgcagctggg catcgaggcg ctggagggta agaaggcggt ggtgcttggt 1678381 cgggtggccc gcgacggggg tcaggtgaag ctggacggcc aggtgtggac ggcgcgcccg 1678441 ctcaacgacg gtgatgtgtt cgaacctggt gactcggtga ccgtggtgca aatcgacggc 1678501 gccacggcgg tggtcttcaa ggacgtgtag ggactcgaga aaggaattcc ggtgcaagga 1678561 gccgttgctg gtctggtgtt tctggccgtc ctggtgattt tcgccatcat cgtggtggcc 1678621 aagtcggtgg cgctgatccc gcaggcggag gccgcggtga tcgagcggct gggtcgctat 1678681 agtcgtacgg tcagtgggca gttgacgctg ttggtgccgt tcatcgaccg cgtccgggct 1678741 cgggtggacc tgcgcgagcg ggtggtgtcg tttccgccgc aaccggtgat caccgaggac 1678801 aacttgacgc tgaacatcga caccgtcgtc tacttccagg tgaccgttcc gcaggcggcg 1678861 gtgtacgaga tcagcaatta catcgtcggg gtcgaacagc tcaccaccac caccctgcgc 1678921 aacgttgtcg gcgggatgac gctggagcag acgttgacct cgcgtgacca gatcaacgcc 1678981 cagctgcgcg gcgttctcga tgaggcgacc ggccgctggg gtctgcgggt ggcgcgggtg 1679041 gagctgcgca gcatcgatcc gccgccgtcg attcaggcgt cgatggaaaa gcagatgaag 1679101 gccgaccggg agaagcgagc gatgattctg accgccgaag gtacccggga ggcggcgata 1679161 aaacaggccg aggggcaaaa gcaggcgcag atcctggccg ccgagggcgc caagcaggcc 1679221 gcgatcttgg ctgctgaggc cgatcggcag tctcggatgc tgcgcgctca gggtgagcgc 1679281 gccgcggcct acctgcaggc gcaagggcag gccaaggcca tcgagaagac gttcgccgcg 1679341 atcaaggctg gccggcccac cccggagatg ctggcctacc aatacctgca gacgctgccg 1679401 gagatggcgc gtggggacgc caacaaggta tgggtggtgc ccagcgactt caacgccgca 1679461 ctgcaggggt tcaccaggct gctgggcaag ccgggtgagg acggggtgtt ccggttcgag 1679521 ccgtccccgg tcgaagacca gcccaagcac gcggccgacg gtgacgacgc cgaggtcgcc 1679581 ggctggttct ccaccgatac cgacccgtcg atcgctcggg cggtggctac agccgaggcg 1679641 atagcccgca agccggtcga gggttcgctg gggacgcccc ccaggttgac tcaatagagt 1679701 ggtccgatga gtggtttgac ctcaccgaaa acctatgcgg tactggcagc tctgcaggcg 1679761 ggcgacgcgg tggcgtgcgc catcccgctg ccacctatcg ccaggttact cgacgacttg 1679821 gacgttccgg tcagcgttcg cccggtgctg ccggtggtca aggccgcctc tgcggtcggt 1679881 ttgttgtcgg tcacccgatt cccggccttg gcgcggctga cgacagcgat gttgacgttg 1679941 tacttcatcc tcgccgtggg ggcacatgtc cgggtgcgag atcgcgttgt taatgcgatt 1680001 ccggcggcgt cattcctgac gttgttcgcg ctgatgacgg caaaggggcc ggagcgcact 1680061 taagcatgga ggcgcaactc gacctatggc agtggtgtgt cggtcggtga ggtcgaggtg 1680121 ctcaaggtcg aaaacagccg ggtgcgcgcc gagcagctgg ccaaactgta cgaattgcgc 1680181 tcaagtcggg atcgggtcag ggtcgacgcc gcactagccg agctgagccg cgccgcggcc 1680241 gcccgcggtt gtgccggtac tagcgggctc ggcaacaacc tgatggcgcc ggggccgccc 1680301 cattccctcc tgggacggga tcgctgacgc cacaatcgac ctgctacgaa ggctggccga 1680361 gcggctgggg tacacactgg attggcgagc gatccgtgga gccgaacccg ttgccaccgc 1680421 cattctgcgt cggttagtct ctttttcgac cttggggcgc ggagggtcgt tatggtgtgt 1680481 cacagtgctt tgctgtcaaa ggcattggcg gtgccgacca agcgacactg ggcagtgcag 1680541 aaatcctcgt gaaatacgct caactcgctg acaaacgcgc tcgggtatat gtcctggtgt 1680601 cgacctggtt ggtcgtgtgg ggtatctggc atgtgtattt tgtcgaagct gtctttccga 1680661 atgccatcct gtggttgcat tattacgcgg ccagctatga attcgggttt gtacgtcgcg 1680721 ggctgggcgg tgaactgatt cgcatgttga ccggcgatca tttctttgcc ggcgcctata 1680781 ccgttctgtg gacgtctatc acggtgtggc tgatcgccct tgccgtcgtg gtgtggctta 1680841 tcctttccac gggcaaccgg tccgagcgca ggataatgct tgccctcctc gttccggtgc 1680901 taccctttgc cttttcttac gccatctata atccacatcc ggaactcttc gggatgaccg 1680961 cgttggtagc cttcagcatt tttctgacca gggcccacac ctctcgaacc cgggtgatcc 1681021 tcagtacgct gtacggactt acgatggccg tgctggcgct catacacgaa gcgattccac 1681081 tggaattcgc actcggcgcg gtgctggcga taatcgtgtt gtcgaagaat gcgacaggtg 1681141 cgacaaggcg aatctgtact gcgttggcca tcggtccggg gaccgtctca gtattgttgc 1681201 tcgctgtggt cgggcgtcgc gatatcgcgg accagttgtg tgcccatatc ccgcatggga 1681261 tggtcgaaaa tccgtgggcg gttgcaacga caccgcagcg agttctcgat tacatattcg 1681321 gtcgtgtcga gagccatgca gattaccacg attgggtgtg cgagcatgtg accccgtggt 1681381 ttaacctcga ctggattacc tctgcaaagc tggtggccgt ggttggcttc cgcgcactat 1681441 tcggtgcatt cctcctcggg ttgctgttct tcgttgccac gacatcgatg atccgctatg 1681501 tctccgccgt gccggtcaga accttctttg ccgaactgcg cggcaatctg gcgttgccgg 1681561 tgctggcatc ggcattgctg gttccgctgt tcatcaccgc tgtcgactgg actcgctggt 1681621 gggtgatgat cacactcgac gtggccattg tctacatctt gtacgcgatc gacagaccgg 1681681 agatcgagca accgccgtcg aggagaaacg tgcaggtctt cgtctgcgtt gtgttggtgc 1681741 tggcggtgat accgaccggg tccgccaaca acatcggcag atgaggcacc ccgcgggacc 1681801 acccgaaggc gggcatggtg acgtaggcca accgccgctg acatgcttgg gacggtgatg 1681861 ctgttgcagg cctattaggg gttgtcggat cgggagcctt gtgaccggtt ggcccttgat 1681921 ctgcgttggg aggccgcggc ggggttgacg gtgcacgcgc cgtcgttgca tcccacggtg 1681981 ttggtcggga tgcgtaaccg gctgcgggct tcggatccga cgtgttggtg gatgcaccat 1682041 tttcaaatgc cgtcgcgggt cgaaacttgg gtgccgtcga agaataaccc caccaaaggc 1682101 cctacatcag cgccgtccta cgttcgtgtg tcggacaatc cttagtgccg atgccggata 1682161 ttcgggcact aacggaaaag acgtcctccg cgtagaggct ccgttgttcg aggcccagtt 1682221 acaggggcaa ggtcagtggc cgtgacctct gcttcccgac acgagaatgc tggccgaccg 1682281 aacgtagcgc ggtgcgttga cggcatcgag ctgccacgcc aaatttgcac gcgctgatgc 1682341 gctgaccccg accgaaggtt tatcaaatga gagccggctc gcgcacaggg tcgtcgtaac 1682401 ccggcatgcg tcggtgctgc cgtcgataat tgcggatctc atagacgagc ccagtcagac 1682461 ctagcgcgcc cgtgcatacc gacaccagga tcagcagcgg gctaccgcta ccggcgaacg 1682521 cgtcgcccag gatgaccacc gccgcggtgc cggggagcaa gcctgccaag gtcgcccagg 1682581 cgaaggacag gatccgcacg cccgaggcgc cggcggcata gttgatcgcc gcgaacggga 1682641 cgacgggaat gagccgcagc gacaagatgg ccagccagcc tcgctcacgc agacgctcgt 1682701 ccagccggtt gatcgctcgg cggcgcacca gactgttcag ctgccagccg gtggcacgca 1682761 ccagcagcat cgcgattacc gcgctagcgg tgctgccgac caccgcgatg aatacgccca 1682821 ccacagagcc gaacaacagc ccggcggcca acgtgaacgc ggtgcggggg aatggcggca 1682881 ccgtgacgac ggtatgcacc agcaaaaatg ccagcgggaa ccacgcgccc agtgacttgg 1682941 cccagtcgcg caattccacc gcagtgggca ccggaaccag cagcgcgacc actaccagta 1683001 ctgtgattcc caccactgtt cccacgatgc gcggcagcga cgcctgacgc gcgaccgcgc 1683061 cgagcgaggt ggcgataccg tgtacggttt cggtggtgtt gcagatggcg ggagccgtca 1683121 cgtcttcgga gcgtacgggg tcaacatgaa taactcgttt ccccaggctg gcgtttcgtc 1683181 acactccggc cgcgattgcc gcacctgggc gtctatatgg gcgtcccgat caactagcct 1683241 tattagttaa gtgacaatcc cgaagcaagc ccaagcaaca tcgctaattg ctgggaaaac 1683301 aggagcagtc ggtgtccatt gatgtacccg agcgtgccga cctagaacag gttcgcgggc 1683361 gctggcgcaa cgcggttgcc ggtgtgctgt ccaagagcaa ccgtaccgac tcagcacaac 1683421 tcggcgatca ccccgagcgg ctgctggata cccagaccgc tgacgggttc gccatccggg 1683481 ccctctacac cgcgttcgac gagctcccgg agccgccgtt gccgggccag tggccctttg 1683541 tgcgcggcgg agacccgctg cgcgacgtgc attccggctg gaaggtcgcc gaggcgtttc 1683601 ccgccaacgg tgcgacggcc gacaccaacg cggcggtgct ggccgcgctc ggcgaggggg 1683661 tcagcgcgct gctgatccgg gtgggggagt cgggtgtggc gcctgaccgg ctcacggcgc 1683721 tgctgtccgg ggtgtatctg aacctggcgc cggtcatcct cgacgccggc gccgactacc 1683781 gcccggcctg cgacgtcatg ctggcgctgg tcgcccagct cgatcccggc cagcgcgaca 1683841 ccctgtcgat cgacctgggc gccgacccgc tgacggcgtc gctgcgcgat cgtcccgccc 1683901 cgccgatcga ggaggtcgtc gcggtcgcat cccgggcggc cggcgaacgt gggcttcgtg 1683961 cgatcaccgt cgacggaccg gccttccaca acctgggcgc gaccgcggcc accgaactcg 1684021 cggccaccgt cgcggccgcg gtggcctacc tgcgggtgct caccgaatcc gggctcgtgg 1684081 tgagtgacgc gctgcggcag atcagcttcc ggctcgccgc cgacgacgac cagttcatga 1684141 cgctggccaa gatgcgggct ctacgtcaac tgtgggcgcg ggtcgccgag gtcgtgggcg 1684201 acccgggtgg cggcgcggcc gtcgtgcacg cggagacgtc gctaccgatg atgacccagc 1684261 gtgatccgtg ggtgaacatg ctgcgctgca cgctggcggc cttcggcgcc ggtgtcggtg 1684321 gcgcggacac cgtgctggtg cacccgttcg acgtggcgat tcccggcggc tttcccggca 1684381 cggcggccgg ctttgcgcgc cggatcgctc gcaacaccca actgctgctt ttagaagagt 1684441 cgcatgtcgg cagggtgctc gatcccgccg gcgggtcgtg gttcgtcgaa gagctcaccg 1684501 accggctggc tcggcgcgcc tggcagcgtt tccaggccat cgaggcccgt ggcggcttcg 1684561 tcgaggccca cgacttcctg gccggccaga tcgccgagtg cgccgcccgc cgcgccgacg 1684621 acatcgccca tcggcgcctg gcgatcaccg gcgtcaacga atacccgaac ctgggcgaac 1684681 ccgcgctgcc gcccggtgat ccgacatcgc cggtgcgccg ctacgctgcc ggattcgaag 1684741 cattgcgcga tcgatccgat caccacctag cccgcactgg cgcacggccg cgggtgctgt 1684801 tgctgccgtt gggtccgctg gccgagcaca acatccggac gaccttcgcc accaacctgc 1684861 tggcgtccgg cggcatcgag gcgatcgacc cgggaacggt tgatgcgggc accgtcggga 1684921 atgccgttgc cgatgccggt tcgcccagcg ttgccgtgat ctgcggcacc gatgcgcgct 1684981 accgggacga ggttgccgac attgtgcaag cggcccgagc cgccggtgtt tcgagggtgt 1685041 acctcgcggg tcccgagaag gcgttgggag atgccgcaca ccggcccgac gagtttttga 1685101 ccgcgaaaat caatgtggtg caagccttgt cgaatctgct gacgcggttg ggggcctaga 1685161 tgacaaccaa gacacccgtg atcggcagct tcgccggcgt tccgctgcat agcgagcgtg 1685221 ccgcgcaatc gcccacagag gccgcggtgc acacgcatgt cgccgccgcc gcggcggcgc 1685281 acgggtacac gcccgaacag ttggtgtggc acacgccgga aggcattgac gtcacaccgg 1685341 tatacatcgc cgccgaccgg gccgccgccg aagccgaggg ctacccgctg cacagcttcc 1685401 cgggcgagcc cccctttgtg cgcggcccct atccgacgat gtatgtgaac cagccgtgga 1685461 ccatccgcca gtacgccggg ttttccaccg ccgcggattc caatgcgttt taccgacgca 1685521 acctggccgc cggccagaag gggctgtcgg tggccttcga tctggccacc caccgcggct 1685581 acgactccga ccatccccgc gtgcagggcg atgtcggaat ggccggtgtg gcaatcgatt 1685641 ccattctcga catgcgacag ctgttcgacg gcatcgacct gtcgaccgtg agcgtgtcga 1685701 tgacgatgaa cggtgcggtg ctgccgatcc tggcgctgta tgtggttgcc gccgaggagc 1685761 agggcgtggc gccggagcag ctggccggca ccatccagaa cgacatcctc aaagagttca 1685821 tggtccgcaa cacctacatc tatccgccga agccgtcgat gcggatcatc tccgacatct 1685881 tcgcctacac cagcgccaag atgcccaagt tcaactccat ctccatttcc ggctatcaca 1685941 tccaagaagc cggtgccacg gcggatttgg agctggccta caccctggcc gacggcgtcg 1686001 actacatcag ggcgggcctg aacgccggcc tggacatcga cagcttcgcg ccccggctat 1686061 cgttcttctg gggcatcggg atgaatttct ttatggaggt cgccaaactg cgggccggcc 1686121 ggttgctgtg gagcgagctg gtcgcacagt tcgcgcccaa gagcgccaaa tccctttcgc 1686181 tgcgtacaca ttcgcaaaca tcggggtggt cactgaccgc ccaggatgtg ttcaacaacg 1686241 tggcgcgcac atgcatcgag gcgatggccg ccacccaggg gcacacccag tcgctgcaca 1686301 ccaacgccct ggacgaggcg ctggcgctgc ccaccgattt ttcggcccgc atcgcgcgca 1686361 acacccagct ggtgttgcag caggagtcgg gcaccacgcg gccgatcgac ccgtgggggg 1686421 gctcctacta tgtggagtgg ctgacccatc ggctcgcgcg gcgagcccgg gcgcacatcg 1686481 ccgaggtcgc tgaacatggc ggcatggcgc aggccatcag cgacggcatc cccaagctgc 1686541 gcatcgagga ggcggccgcg cgcacccagg cccgcatcga ctccggtcag caaccggtgg 1686601 tcggggtgaa caaataccag gtgcccgagg accacgagat cgaggtgctc aaggtcgaaa 1686661 acagccgggt gcgcgccgag cagctggcca aactgcagcg gctgcgggca ggccgggacg 1686721 agccggcggt acgggccgcg ctggccgagc tgacccgcgc cgccgccgag caaggacgcg 1686781 ccggagcaga cgggctgggc aataatctgc tggccctggc catcgacgcc gcccgggccc 1686841 aggccaccgt gggcgagatc tccgaagcgc tggagaaggt gtacggacgg caccgggccg 1686901 agatccgtac catttccggg gtctaccgcg acgaagttgg aaaggccccc aacatcgcag 1686961 ccgcaaccga gctagtggag aagttcgccg aggccgacgg ccgccggccc aggattctga 1687021 tcgccaagat gggccaggac ggccacgacc gcgggcagaa ggtgatcgcg accgcgttcg 1687081 ccgacatcgg gttcgacgtc gacgtggggt cgctgttttc cacccccgag gaggtggcgc 1687141 gtcaggccgc cgacaacgac gtgcacgtga tcggggtgtc ctcgctggcc gccggccatc 1687201 tgacgctggt gccggcgctg cgcgacgcgt tggcgcaggt gggcaggccc gacatcatga 1687261 tcgtggtcgg tggtgtcatc ccgccgggcg acttcgacga gctgtacgcc gccggggcca 1687321 ccgccatttt cccgccgggg acggtgattg ccgacgcggc gattgacctg ctgcacaggc 1687381 tggccgagcg gctggggtac acgctggatt agcgagaggc ccgtggtgcc gtttctggtt 1687441 gcattatccg gtatcatctc gggcgtgcgt gatcattcga tgaccgtgcg gctcgaccag 1687501 caaactcgcc agcgcctgca agacattgtg aaaggcggat accggagcgc taatgcggcg 1687561 atcgtcgacg ccatcaacaa gcgctgggag gcgctacacg atgagcaact cgacgccgcc 1687621 tacgcggccg cgatccatga caatccggcg tacccgtacg agtctgaggc cgaacggagc 1687681 gccgcgcggg cccggcgcaa cgccaggcag cagcgctcgg cacagtgaac gcgccgttgc 1687741 gtggtcaggt ctatcgatgc gacctcggat acggggccaa accgtggctc atcgtctcca 1687801 acaacgcccg caaccgtcac accgccgacg tggtggctgt gcgcctgaca acaacgcgga 1687861 gaaccatacc gacctgggtc gccatgggcc ccagcgatcc attgaccgga tacgtcaacg 1687921 cggacaacat cgagaccctc ggcaaagacg agctcggtga ctacctcggt gaggtcacgc 1687981 cggcgacgat gaacaaaatc aacacggcgc tcgcgaccgc gctggggcta ccgtggccat 1688041 gatggccgca tcccacgacg acgacaccgt cgacgggttg gcgacggccg tgcgcggcgg 1688101 tgaccgtgcg gcgctgccac gggccatcac actggtcgag tcgacccgcc ccgaccatcg 1688161 tgagcaggcg caacagctgc tgctgcgatt gctgccggac tccgggaacg cccatcgcgt 1688221 cggcatcacc ggggtcccgg gggtgggcaa gtcgactgcc atcgaggcgc tgggcatgca 1688281 tctgatcgag cgcgggcatc gggtggcggt gctggcggtc gacccgtcgt cgacccgcac 1688341 gggtggatcg attcttggtg ataaaacccg gatggcgcgg ctggcggtgc acccgaacgc 1688401 ctacatccgg ccgtccccga cgtcgggaac gctgggtggg gtgacgaggg ccacccggga 1688461 aacggtggtg ctgttggagg cggccggttt tgatgtgatc ctgatcgaaa ccgtcggggt 1688521 gggccagtcc gaggtcgcgg tggccaacat ggtcgacacg ttcgtgttgc tgaccttggc 1688581 ccgcaccggt gatcagttgc agggcatcaa gaagggcgtg ctggagctcg ccgacatcgt 1688641 ggtggtgaac aaggccgacg gggagcacca caaagaggcc cggctggccg cccgggagct 1688701 gtcggcggcg atcagattga tctatcctcg cgaagcactg tggcgcccac cggtgctcac 1688761 catgagcgcg gtggagggca ggggactggc cgagctgtgg gacaccgtcg agcgtcatcg 1688821 ccaggtgctc accggggccg gcgaattcga cgcccgtcgg cgcgatcagc aggtcgactg 1688881 gacctggcag ctggttcgcg acgccgtcct ggatcgggtg tggtccaatc cgacggtgcg 1688941 caaggtccgc tccgagctcg agcgtcgggt ccgcgccggc gaactgaccc cggccctggc 1689001 ggctcagcaa atactggaga tagctaacct aacggatagg taaataaatc cgtgtttgcc 1689061 gatggtcgct gcgaaatcca cgtaagttcg accgtgtgat ggttgacacc ggagtcgatc 1689121 accgcgcggt ttcgtcccac gacggaccgg acgcgggccg gcgggtgttt ggtgcggcgg 1689181 acccacgctt tgcgtgcgtc gttcgagcct ttgccagcat gtttccgggg cgccggttcg 1689241 gtggcggagc gctggcggtg tatctcgacg ggcagccggt cgtcgacgtg tggaaggggt 1689301 gggctgatcg ggccggatgg gtgccgtggt cggcggattc cgcgccgatg gtgttctcgg 1689361 cgaccaaggg catgacggcc acggtcatcc accggctggc cgaccggggg ctgatcgact 1689421 acgaagctcc cgttgccgag tattggccgg cgttcggcgc caacggcaag gcaaccctga 1689481 cggttcgtga cgtgatgcga caccaggccg gcctgtccgg attgcgtggc gcgacgcagc 1689541 aagacttgct ggatcacgtc gtgatggaag agcggctggc ggcggcggtg cccgggcggc 1689601 tgctgggcaa atccgcctac cacgcgctga cgttcggttg gttgatgtcg ggcctggcca 1689661 gggccgtcac cggaaaggac atgcgcctgc tgttccgcga ggaacttgcc gagccgttgg 1689721 acaccgacgg cttgcacctg ggtcggccgc cggccgacgc gccgacgcgg gtcgccgaga 1689781 tcatcatgcc gcaagatatt gccgccaatg cggtgctgac ctgtgcgatg cgccggctcg 1689841 cccatcggtt ctccggcgga tttcgctcca tgtattttcc cggcgccatc gcggccgtgc 1689901 agggcgaggc gccgttgctg gacgccgaga tacccgcggc caacggggtg gcgacggcgc 1689961 gagcgctggc gcggatgtac ggcgcaatcg ccaacggcgg cgagatcgac ggcatacggt 1690021 tcttgtcgcg ggagctggtc acgggcctga cccgcaaccg acggcaagtt ctgccggatc 1690081 gaaatctatt ggtgccctta aattttcatc ttggctatca cggtatgccg atcggcaacg 1690141 tgatgccggg gtttggtcat gtgggcttgg gcggctcgat cggctggaca gacccggaga 1690201 ccggggtggc gttcgcgctg gtgcacaacc ggctgctgtc accgttggtg atgaccgatc 1690261 acgcaggctt tgtcggcatc taccacctga tccggcaggc cgccgcccag gcgcgcaagc 1690321 gtggttacca gccggtgacg ccattcgggg cgccgtactc ggagccggga gccgcggcgg 1690381 gctaatctgc ccgcctaatc ggcctgccgg cagcggcgct cggcgccacg gtgtcgcgat 1690441 gcttcccgga tgccgaccta gctcgcggtt ttggtcgcga tgacgatgtc ctggaagctt 1690501 aggtgtggtt cccggccact ccatgagccg tagtgcaatg gttcgtgcac ggcgaggccg 1690561 aacttgccat agacatccct gacgaaggtc tccggcaagc cgattgcttc ttcgggccgc 1690621 ttcttgtgga ttgtccgata acccggtccc tcatgctgga agttgtgcgc actctttcct 1690681 tccgcgatgt gggctaacga ctcgtcattg agcaagaagt acgtgcacag gcatcgtccg 1690741 ccgggcttca gcacgcggga gatctcgtcc agatagtgct ccacgtccgg cggaaacatg 1690801 tgggtgaaca ccgaggtaag aaacaccaca tcgaacgacg catccggata tggaaagcga 1690861 aagtctagtg actggtattt ccctttcggg ttgtacagcg agttgtagat gtcggagacc 1690921 tcgaactgga agttggggtg cgccgaggtg atgtgctcct ggcaccacgc gatggctttc 1690981 tgcgagatat cgaagccggc gtagcgtccc tcgctgttca gatagccggt gagcggcaac 1691041 gccatccgcc ccgagccgca gccgacgtcg agcaccgctt cgtccggctg cagcccacac 1691101 aggtcgacca gatacccgac gaattcagca ccgacttcct tgtaggcgcc gccgacgaat 1691161 tgtcgcaggg attttggagg cagcgcctcg gcggagccac cgtcggctga accgcgtttc 1691221 gagcgcgtca ggatgttctg gaaaagtcgc ttaatgatgc acctcagtta tcggccgcgc 1691281 ttgaaggttc aggaatcctc caggcggaag ccgactttca tagtcacctg gaagtgcgcg 1691341 accgctccgt cgaccaggtg gcctcgaatt gactgtactt cgaaccagtc cagcgcgcgc 1691401 atggtctgcg cagctcgggc cagaccgccc tggattgccg cgtcgacgcc gtcgggcgag 1691461 gtcccgacga tctcgatcac tcggtaggtg tgattgctcg tcgtgtcccc tcacattctt 1691521 ttacccgctc ttaccggcca gcggcacacc agaatagtcc ggtgccatcg ggggagccct 1691581 ctacggccgg tcactttgag cacttgccgc gcggcagctt cggccggatt ctctccgtcc 1691641 tcaatgccgc tgccgaccat catccacgtg agttgctcgt cgtcgggatt gcgaccttcg 1691701 accagaagcg cccggcggtg ggggtcgatg agcacgaccc gggtggtgcg gcgacgccgg 1691761 cggtggtcat caactacgag agccggtcgt cggctggcgg caccatcggc cattcaacaa 1691821 cgtcacaggt agcgtgctgt ttgtatcagc agccgaaacg cccagcgctc cggccgacca 1691881 aggcggcagc gacgaccgca gcgacaacct ggatcgaacg agtccaaaac cgccgcggac 1691941 gccactcggc cctcgtatga tcccgaggag ataccctacg gggtggattg gggatggatc 1692001 ggcgatgcgc ctctcgatcg taacgactat gtacatgtca gagccttacg tgctggagtt 1692061 ctacaggaga gcgcgcgcgg cggcggacaa aatcacgcct gacgtcgaga tcatcttcgt 1692121 ggatgacggc tcgccggacg cagcgctcca gcaggccgtc tcgctgctcg acagcgaccc 1692181 ctgtgttcgg gtaattcagc tttcgcgaaa tttcggccac cacaaagcga tgatgaccgg 1692241 cctggcgcac gccacggggg atctcgtctt tctgatcgac tcagacttgg aagaggaccc 1692301 ggctctccta gagccgttct atgaaaagct gatctcgacg ggcgccgacg tagtatttgg 1692361 ttgccacgcg cggcggcccg gcggttggtt gaggaatttc ggaccgaaaa tccattatcg 1692421 ggcgtccgcc ctgctgtgtg accccccgct tcatgaaaat actctcaccg tgcggctgat 1692481 gacagccgac tatgtacgca gcttggtcca gcaccaggag cgtgaacttt cgattgccgg 1692541 tctgtggcag attactggtt tttaccaggt gcccatgtcc gtaaacaagg catggaaagg 1692601 aacgaccaca tacacgttta ggcgtaaagt agcgacactg gtcgacaatg tcacttcatt 1692661 tagcaacaaa cctctagtct tcattttcta tcttggtgcg gccattttta ttatttcaag 1692721 ctcggccgcg ggctatctga tcatcgatcg aattttcttt cgcgctctgc aagcggggtg 1692781 ggcatccgtg atcgtatcca tctggatgct ggggggtgtg acgattttct gcatagggct 1692841 ggtcggaatt tatgtatcca aagtcttcat cgaaactaag cagcggccat acacaattat 1692901 ccgaagaatc tacggttcgg atttaacaac ccgggagcca tcctctctga agaccgcctt 1692961 cccggccgcg cacctgtcga acgggaaacg cgtcacatca gagccagagg gattggcaac 1693021 tggcaacagg tgaataagcg tagcatgatt cctgtaaagg ttgaaaacaa tacttcgctc 1693081 gatcaggtgc aagacgctct taattgcgtc gggtacgcgg ttgtagaaga tgtgcttgat 1693141 gaggcgtcac tggcagcgac ccgtgatcgc atgtatcgtg tacaggagcg gattcttacc 1693201 gagattggca aagagcggct ggcaagggcc ggtgagctcg gtgttcttcg actcatgatg 1693261 aagtatgacc ctcatttctt tacctttctt gaaatccccg aagtcctaag catcgttgat 1693321 cgtgtgctat ctgaaacggc catcttacat ctgcagaatg gctttatcct tccgtccttc 1693381 ccgcccttct ccacgccgga cgtttttcag aatgcgttcc accaagactt tcccagggtt 1693441 ctgtccggtt acattgcctc cgtcaatatt atgttcgcca tcgatccctt tacacgagac 1693501 accggcgcaa cgctcgtagt gccggggagc caccagcgca tagagaaacc ggaccatacc 1693561 tacctcgcgc gcaatgccgt tcccgttcaa tgcgcggcgg gctcgttgtt cgtttttgac 1693621 tctacgcttt ggcatgcggc tggccgaaac acctccggca aagaccgctt ggccataaat 1693681 catcagttta cgcgctcgtt tttcaagcag cagatcgact acgtccgcgc gctgggcgac 1693741 gccgtggttc tggagcagcc tgcgcgtact cagcaactgc tcggatggta cagtcgagtg 1693801 gttaccaatc tggacgagta ttaccagccg ccggacaagc gattgtatcg gaaggggcaa 1693861 ggctagtttt gcgagaattc cgttgcgcct atttgaaagc ccgacatgaa acgatcgctt 1693921 ttaagcgcat atgtctgttc tgcaaaaatg tctaattttt ccgataaagg ttggtgggaa 1693981 agctcgatgc gtgccgtgtt ttgtaggtgg ccggatgatc cacttagaca ggccgtggaa 1694041 gcagaatttg cgcgtcccga tggcgttgcg gtggcgtaat ggcctggcga aagctcggga 1694101 gaatttttgc tccgtcgggc gaactcgact ggtcgcgaag tcatgctgcg ctaccggttc 1694161 ctgaatggat cgagggtgat attttccgca tctatttcag cggccgcgat ggtcagaatc 1694221 gttccagtat cggtagcgtg atcgtcgatc tcgccgtggg cggcaagatt ctggacattc 1694281 cggcggagcc gattttgcgc cccggcgctc gaggaatgtt tgacgactgt ggggtgtcaa 1694341 tcggatcgat tgtgcgtgcc ggcgatacgc gacttttgta ctacacgggc tggaactcgc 1694401 tgtcaccgtg ccctggaaaa acaccatagg cgtggcgatt agcgaagcag gtgcaccatt 1694461 cgagcgatgg tctacttttc ccgtcgttgc gctggacgag cgtgatccat tctcgctttc 1694521 ttatccctgg gtcatccaag atggagggac ataccgtatg tggtatggct caaatctagg 1694581 ctggggagag ggcaccgacg agatacctca cgtgatcagg tatgcgcaat caagggacgg 1694641 tgtccactgg gaaaagcagg atcgcgtgca tatcgacaca agcggatccg acaatagcgc 1694701 ggcctgtagg ccgtgcgtcg tccgcgatgc gggagtatac agaatgtggt tttgcgctcg 1694761 cggtgcgaaa tatcggattt actgcgctac atcggaggat ggtttgactt ggcggcaact 1694821 cggcaaagat gagggcatcg acgtttcgcc agatagctgg gactcggata tgatcgagta 1694881 tccttgtgtg ttcgatcaca ggggacagcg ctttatgctt tattcgggcg atggctacgg 1694941 tcgcaccggg ttcggtttgg cggtgctgga gaactgatca gggctgacaa tagatgttta 1695001 gcggctgatg atgcgcttcc cgctcgaata ggctgagacc attattgccg cggtagcgat 1695061 gatttcccgg attatcgtcg tcgccgcgat cactcactgc tcgtcgaggc cctttaaggg 1695121 cttcattgta tccttcgcac tgcttatctt catgcgcgca acgtcaggat gcgcgtgagc 1695181 gcctcgacaa cgcggctctg atctacctcc tgaagtccaa cccacatcgg cagacggatt 1695241 aggcgggaag ccacgtcgtt ggtgacggtc aggttgccat tggtgcggcc gtagcgacgc 1695301 ccggccggcg aatcgtgaag cggcacgtaa tgaaagaccg cgcctatacc tttgctcgtc 1695361 agacgcgcca gcacctcctc ccgatcggcg ctgggcgcta gtaacacgta gtacatgtgg 1695421 gcgttgtgag agcagccctg tgggatgatc ggacggcgca ggagcccccg ctgttccaat 1695481 gattcgaagc tttcatgata ccggttccat aggtccaatc ggatacgcgt gatccgctcg 1695541 gcttcctcga actgagccca tagaaaggca gcgactaatt cgctgggcaa ataggaagac 1695601 cctttgtcct gccacgtata tttgtcgacc tcgttgcgaa ggaagcggct gcgattggtg 1695661 cccttttccc tgagaatctc tgcccggagc aggaagtctt ctgagttgac aagcagggcg 1695721 ccgccttcgc cggaaatcac attcttggtc tcgtgaaatg agagcgctcc caggtcgccg 1695781 atgctgccga gcgcccgccc acgatacgac gccatcgcgc cttgggccgc gtcttcgacc 1695841 accgccaggt tgtggtgcgt ggcgatcttc atgatcgcgt ccatctcgca ggccacgccg 1695901 gcatagtgaa cggggacgat ggccttggtt cgcggggtga tggcgtctac gatgcgagtt 1695961 tcatcaatgt tgagcgtgtc gggccgaata tcgacaaaga ctggcacacc accgcgcaac 1696021 acgaaggcgt tggcggtaga gacaaaggtg tatgacggca gtatgacttc gtccccctcc 1696081 tctatgtcca gaagcagcgc catcatttcc agcgcggcgg tgcatgaggg ggtgagtagt 1696141 gccttgcgac aaccggtctg ctgttcgagc catgcatggc tacgccgggt gaagggacca 1696201 tcgccggcca ggtggccgca agaatgcgct tcggcgatgt acgcgagctc ccggccggtc 1696261 atgtacggcc gattgaatgg aactttgtga tctgacactc gacgccaact tctcaaatca 1696321 tcgaacaggg cgctgaagtg ttcggtgatc ggggtcgaac atccaccaga attctccttg 1696381 tggccggcgg atccctagcc ttttcaggta tcccaacatg ccttcactat ttcttcatat 1696441 cttccgcaac tccgtgctgg gcaccggacg gcgctccgtc ttggttccta tatagacacc 1696501 atccgcgtca gcgtcgccaa ggagtagggc gcccgctccg accacacacc gtgaaccgat 1696561 ggtgatatgg tcgcgtagcg ttgcattgac gccaatgaaa gattgctcct ctattaccac 1696621 gccaccggat acgacgatat gagacgctag aaaacagtga tcgtgaatcg tcgagtgatg 1696681 gccgatatga ttgccgctcc acaatgtgac gttgttgcca atcgatacga atggctggat 1696741 agtgttgtct tcaagcagga agacattttc accgatccgc ccatcgttca agacggtagc 1696801 gtgggagctc acatagctgg cgagttcata gccgagagcc ttagcggcaa gatatttttc 1696861 cttccgcaca ccgttcagtt tggcgtaggc cagcgccacg aacatcgcgt gggactccgg 1696921 cggaaagcgt tgtgcgacct cgtcgaaggc cactaaaggc aggccgcaaa actcggacac 1696981 gcttgcatag tctcggtcga ctgtgaacgc gacgacctca tattccgaat cccttgtgaa 1697041 gtagtaatgt gcgagctgag cgatgtcgcc gctcccaaaa attaccaatg gtttggtcat 1697101 gacgccttcc taaccagaat tgtgaattca tacaagccgt agtcgtgcag aagcgcaaca 1697161 ctcttggagt ggcctacaac ggcgctctcc gcggcgcggg cgtaccggat atcttagctg 1697221 gtcaatagcc atttttcagc aatttctcag taacgctacg gggcgcgccg tgccgtagta 1697281 gcgtccccac tgatgtggac gatggtgctc cttttggggt tggggatggc gattgacccg 1697341 gcgcgtctgg gactcgcggt cgtcatgctg tcgcggcgtc ggcccatgct gaatctgttc 1697401 gccttctggg tgggcggcat ggtggcgggt gtcggcatcg cgctagccgt gctggtgttc 1697461 atgcgcgatg tcgccttggc ggccatacaa ggcgtggtgt ccgcggccaa cgagttcagg 1697521 gaagcggtcg ggatcctggc gggtgggcgt ctgcacatcg tcatcggtgt catcatgctg 1697581 ctgttggccg cgcgcatggt ggctcgcgcg cgggcgcagg taggggtacc ggtagggcca 1697641 gtgggggtag ccgacggtgg aatgtcggcc ctggcgctag cgcagcgccc cccgggtctt 1697701 gttgcgcggc tggaagtgcg tactcaacag atgctgcagg gcgacgttgt gtggccggcg 1697761 ttcgtggtgg gcgtcgcctc gtccgcaccg cccttcgaga gtgtggtggc gttgacggtc 1697821 atcatggcat cgggagccga gatcggcact cagttcggcg catttgtcgt gttcaccctc 1697881 ctggtgcttg cggtcatcga gattccgttg gtcgcctacc tggcgatacc gcagcaaacc 1697941 cagcaggtta tgctgcggtt tcaggattgg gtacggtcca atcgtcggca gatctccctc 1698001 accatcctga taggggtcgg gttcctcttt ttgtaccagg gcgtgactag tctctgagtc 1698061 gccatgtggt gcctggtgat gcatcaagcg tggtatcggt gaacccggcg aaaccgctta 1698121 tctcggtgtg catcccgatg tacaacaacg gcgccaccat cgagcgctgt ctgcgtagca 1698181 tcctcgaaca ggagggcgtc gagttcgaga tcgtggtcgt tgacgacgac tcgtccgacg 1698241 actgcgccgc gatcgccgca acgatgctgc gacccggaga ccgcttgctg cgaaatgagc 1698301 ctcgcctcgg cctcaaccga aaccacaaca aatgtctgga agtcgcgcgc ggcggactta 1698361 ttcagttcgt acatggtgat gatcggctgc tccccggagc cctgcagaca ctcagccgac 1698421 gttttgagga tcccagtgtc ggaatggctt tcgccccccg acgggtggag agcgacgaca 1698481 tcaagtggca acaacggtac ggcagggtcc atacccgttt ccgcaagctg cgcgaccgca 1698541 accacgggcc gtcgctggtc ttgcagatgg tattgcacgg cgcgaaggaa aattggatcg 1698601 gcgaaccgac cgccgtgatg tttcggcggc aattggcgct ggacgccggt ggttttcgca 1698661 ccgatatcta ccagctcgtc gatgtggact tctggcttcg gttgatgctg aggtcggcgg 1698721 tctgcttcgt tccgcacgag ctctcggtgc gccgtcacac ggcggcgacg gagaccacac 1698781 gggtgatggc gactcggcgc aacgtgctgg accgacagcg cattctcacc tggttgatcg 1698841 tggacccgtt gtcgcccaac agagttcgca gcgccgcggc gctgtggtgg atacccgcat 1698901 ggctggccat gatcgtggag gtggccgtgc tcggaccgca gcggcggacg cacttgaagg 1698961 ctttggcgcc ggccccattc cgcgagttcg cccacgcccg gcgtcaactg ccgctggctg 1699021 actagcagtc gcactctgcc tggccgtcgt cggagccaca gacaattcca acccatttgg 1699081 cctggcggcc aagatgacat ttttacaagg taaggctagc cttaagcgtc cgcgtatcca 1699141 ggacctcggg tctgttgcgt tgtggttgcc tcgcatgcga cggagtgctc tgcgccaacg 1699201 gcccaggtcg tccgagaagg ccagccttga cctgtacagc tgtggcgacc cgaacgttgc 1699261 acagcttggc gacgaatgcc gagttggtcg agtcggccga tctgaccgtc accgaggata 1699321 tttgctcgcg aatcgtgtcg ctgccagttc acgaccacat ggccattgcc gacgttgcgc 1699381 gggtcgttgc gccgttcggg gaagggttag cgcgcggtgg ttgacccgac agcgacggat 1699441 tcgcccaagg tgagtatcgt ctcgatctcc tacaaccaag aggagtacat tcgcgaggcc 1699501 ctggacggct tcgccgccca gaggaccgag ttccccgtcg aggtgatcat cgctgacgat 1699561 gcctccacgg acgccacccc gaggatcata ggagagtacg ccgcccgcta tccgcagctg 1699621 tttcggccga tcctgcggca gaccaacatc ggtgtccacg ccaatttcaa ggatgtgctg 1699681 tccgccgctc gtggcgagta cctcgcactg tgcgaaggcg acgattactg gaccgatccg 1699741 ctgaagctgt ccaagcaggt aaagtacctg gaccggcatc cggagacgac ggtgtgtttt 1699801 catcctgtgc gagtgatcta tgaggatggc gcaaaagact ccgagttccc gccgctcagc 1699861 tggcgccgcg acctgagcgt cgatgccctg ctcgcgcgga acttcatcca aaccaactcg 1699921 gtcgtgtacc gccgtcagcc gagctacgac gacatcccgg ccaacgtcat gccgatagat 1699981 tggtacttgc atgtgcggca tgcggtgggc ggcgagatcg ccatgttgcc cgagacgatg 1700041 gcggtctacc gtcgccacgc tcacggtatt tggcattccg cgtacactga ccgccgaaag 1700101 ttttgggaga cacgaggcca tgggatggcc gcgacgctcg aggcgatgct cgacctagtt 1700161 cacggccacc gcgagcgcga ggcgatcgtc ggtgaggtgt ccgcctgggt gcttcgcgag 1700221 atcggaaaga cacccggccg acagggtcgc gccctgcttc tgaagtccat cgcggaccat 1700281 ccgcggatga cgatgctgtc gctacaacac cggtgggcgc aaacgccctg gcggcggttc 1700341 aagcgccggc tgtccaccga gttatcgagc ttggcggcgc ttgcgtacgc cacccgacgg 1700401 cgcgcactcg aaggtcggga cggcggttat cgcgaaacca cttctccgcc gaccggtagg 1700461 ggacgtaacg tccgcggatc acatgcctag atcttgatag atcgcccgtc tggcctctat 1700521 ggatggagca tgcgggatcg gaccggttgc cgccgactcg acgaccgaaa gagccatcaa 1700581 atagccttgc ggcccatctt tgagatctgt caacccgccg gtcctgatgt cctccaggct 1700641 ctggtcggga tgagctagtg cggttcccga actcggcatc ttcgtcagtc ctggagagaa 1700701 acaacaccag cgaaggtagt gtgatgtccg tggtcgaatc ctctcttcct ggtgtgctgc 1700761 gtgaacgcgc cagttttcag cccaacgaca aagcgctcac ctttatcgat tacgagcggt 1700821 cctgggatgg tgttgaagaa actctgacgt ggtcgcagtt atatcggcga acgcttaacc 1700881 tcgccgcaca gctaagagaa catgggtcga ccggcgatcg ggcattaatt ctggcgccac 1700941 aaatcctcga ctatgtcgtt agctttattg cctcgctgca ggccggaatt gtcgcggttc 1701001 cgctttcgat tccccagggt ggtgcccacg acgagcgcac cgtttccgtg ttcgccgata 1701061 ccgcaccggc gatcgttctc acggcgtcct cggtcgtcga caatgtcgtc gaatacgtcc 1701121 agccgcagcc cggccaaaac gcaccggcgg tgatcgaagt cgatcggctg gatcttgatg 1701181 ctcggccgag ctccggttct cgttctgccg ctcacggcca tccggatatc ttgtacttgc 1701241 agtacacctc gggttccacg cgcacgccgg ccggtgtcat ggtctcgaat aagaatcttt 1701301 tcgccaattt cgaacaaatt atgaccagtt actacggcgt ctatggcaag gtcgccccgc 1701361 caggctccac cgtggtgtcg tggttgccgt tctatcacga catgggtttc gtcttgggac 1701421 tgatattgcc gattctggct ggcatccccg ccgtgctgac cagcccgatc ggtttcctgc 1701481 agcgcccggc tcgctggata cagatgttgg caagcaacac tcttgcgttt accgccgcgc 1701541 cgaacttcgc attcgatctg gcgtctcgta agaccaaaga cgaggacatg gagggcctcg 1701601 atctcggtgg cgtgcacggc atcctcaacg gcagcgaacg ggtgcagccg gtgacgctga 1701661 agcgcttcat cgaccggttc gccccgttca atcttgaccc caaggcgata cgtccgtcgt 1701721 acggaatggc agaggccacg gtatatgtgg ccacccgcaa ggcgggtcaa ccgccaaaga 1701781 tagtgcaatt cgatccccag aagctgccgg acggccaagc tgagcggacc gaaagcgacg 1701841 gcggcacacc gctggtcagc tacggcatcg tcgacaccca gctggtgcgc atcgtcgacc 1701901 cggacaccgg catcgagcgc cccgcgggaa cgatcggtga gatttgggtg cacggcgaca 1701961 acgtcgccat cggctattgg cagaaacccg aggcgaccga acgcaccttt agcgcaacga 1702021 tcgtcaatcc ctccgaaggc acacccgcag gaccatggct gcggacggga gattcgggtt 1702081 tcctctccga gggtgagctg ttcatcatgg ggcgcatcaa ggacctcttg atcgtgtacg 1702141 ggcgcaacca ctctcccgac gatatcgagg cgacgattca gacgatcagt ccgggccgct 1702201 gtgcggcgat cgctgtttcc gagcatggtg ctgagaagct ggttgccatt attgaactca 1702261 agaagaagga cgagtccgac gacgaggcgg cggaacgact gggtttcgtg aaacgcgaag 1702321 tgacctcggc aatctcgaag tcgcacgggt tgagcgtggc ggatcttgtg ctcgtctccc 1702381 cgggctcaat cccaatcacc accagcggca agatccggcg agcacagtgt gtggagctgt 1702441 accgtcagga cgagttcact cgcctggacg catagcaccc acaggcgagg ctcccgcaat 1702501 ggggcgcaat ggggatcgtc acaccagtag caccagcccc tggaggggca acaggggaaa 1702561 actgagttga gcgccaaccg tgcgcactga ggctcaggtg ctcagcttcg cgtcgggctt 1702621 tgaccccgcg tgaccgactg cgggttcgcc gatagacgtg tcatcccaac ggtcgtagct 1702681 cggtaggccg gcaagaccga acagcggcag ctagtggccg agtagatggt cgacgggttc 1702741 tttaccgatg tgggcgccgt cgccgttggg tttcgacacg ccatccacga catcgtaggc 1702801 cggcatggcg gcatagccga acagcggaag cgaatgtctg cgcaggtggt cgatcaggta 1702861 ctccccgttg ccttcggcta ggtcagcggg cttgccgttg ggaaccgact tggttgccgc 1702921 cttgcttgcg tgcccgttgg tgttgcggac gaccttggtg tggggcggct tgggcgccgg 1702981 gatcggggcc ttgcgtcggt ggcctttcac ccgccgcagc caccgatcgg ctttggtcgg 1703041 cggcgtggat gggtcgcgtc ccagctcgga cggccaccag tttgcccggc caatcatcgt 1703101 ggtcaaggcc ggcaccgtga ccgtgcggac caggaaggtg tccagcacga tcccgatgcc 1703161 gatggtgaaa ccggcctgag ccatcgtgtt gatgttcgcg cccaccagac cgaacatcga 1703221 cgcggcgaag atgagacccg ccgaggtgat aacaccaccg gtggagccca cggttcggat 1703281 gacgccgatg cgtataccgt gtggtgattc gtcgcggatg cgtgaaatga gcagcatgtt 1703341 gtagtcagcg ccgatggcaa ccaataatat gaaggacagt cccggcaggc tccaatgcat 1703401 ttcctggccc agtatcaatt ggaaaacgag agttcctatg cctagggccg acaagtaaga 1703461 aatcagcacc gagcctatca gatatatcgg agccacaagt gcgcgcagca gaatgacgag 1703521 aatcaagaat acgataacga tcgtcgcaat gacgatgaat ttcatatcgc tgttgtagta 1703581 gtcgcggata tcccgcagcg cagtcggaac ccccgccaga cctatcgtgg catcctcgag 1703641 ttcggtattc ggtcgcgcgg aatccgcaac acggaggata tcgttgacct gatccatcgc 1703701 ctcggtggtg gccggattca gcgcgctctg cacgaagtac cgcgccgcat gaccatcggc 1703761 cgacaggaaa atctgggcgc ccttcttgaa ctcgtccctc gaaaaaatct gcggtggaat 1703821 gttgaagccc gccattgacg gcttgtccgc atcccgcttg atccccaaca ggaagtcggc 1703881 ggcctcgttg agccctgagc ccatcttttt gacctgatcg accaattcct gcacgcctgc 1703941 cgccagcgct gcgctgccgt cggcgagagc gttggctcct tgctgcattt gagccaattt 1704001 ggtgggtagg ccgtcgaccg ctttgagggt gctgacgact tgcttcagtt gcccgtccag 1704061 tgtgctcacc gtccgggcga gtgtctggta ttcctgcgtc tgttgcaggg tgacggctag 1704121 cgctctgatg gacctgagca ggccgtcgtc ctgcgcctgg acaatcgccg ccaactgtgc 1704181 gcgcgacgtc cgacaggcgg gatcgctgtt acacaccggg ctggagttga gggcgttgac 1704241 catagggctg gcccaagtgg cgatttgttc ggcatcggtg acggtcccgc tcagattgtc 1704301 ccccagagcc cgcatgcgcc cgacatattg ggacgcattt tccagttgtc ggatggtctt 1704361 gtcgccgccc atcaggtcca tcatggcctg cagggtgttg actatcccgc tcgagctggc 1704421 cacggcccca ttgatttcgt tgcgtatttg ggcgagggcg tcggccaact ggtgcgcacc 1704481 gccggtcagc tggtccagct cgcctccgtg ctcttcgagc agggtggtcg cttcgtcgag 1704541 cttgccgccc acttcaccag cctgaaacga gaccttggtc tccttcagag gttccccgtt 1704601 cggtcgggtc aagccccgca ccatcacgat gttgggcaat tctgctatct cgcgggacat 1704661 catctcgatg tcggcaagcg cgccgggtgt ccgcaggtct cggggggatt tgatgaacag 1704721 caccatcgga gtcatcgcgt tcatcgggaa atggcggttc atcgcctcgt atcctttgac 1704781 gctttcgacg tgctgcggca ccgtcttgag atcgtcgtag ttgaatcgga tcagcagcgt 1704841 gcagccggcc agggcgacca gcacaatgag actgccgacc aggtggatgg tgggccgacg 1704901 cacgatgcga acacccgaac gccgccacat tcgactggtc aggtcgcgtc gcggcttgat 1704961 ccagccccgc cgtccggtga gtgtcaggat ggcgggcagc agggtgaccg cacccagcag 1705021 cgacaccgtg atggcaaccg caattgccgg gcccaccgcc gaaaacactt ccagtttggt 1705081 gaacaccatc gccagaaatg tgacggcgac ggtggccgcc gatgcggtga tcaccttgcc 1705141 gatggacatc aacgccttct tgaccgccat gtccgatttt tcgccgtggc gcacatagtc 1705201 gtgatagcga cttatcagaa agacggcgta atcggttccc gccccgatca tgaccgcgct 1705261 cataaagacg atcgcctgca tgttcacggc caggccgaac tcggcgagcc cggacaacgt 1705321 gccctgcgca gtgaccaccg acgctccgat ggtggccagc ggcaccagca tggtcaccag 1705381 gttccgatag acgaggatca ggatgatcag cacgctgacc gcggcgccga tctcgatgat 1705441 ccgcacatct ttctcgccga gctccgtcag gtcggcgacc gtggcgatcg gaccgctgag 1705501 gtggacggtc aggctggttc ccgcgactgt ttgcttgacg atcgcggcga cgcgtttgaa 1705561 cgccgcttgt gtctcaggcg acgcggcatc gcccgcgaac gtgatgggca ggttccaagc 1705621 cttgttgtcc ttgctggcca acagctcctt catttcgggg acggcgagaa aatcctgaac 1705681 cgatattttg tcctgcgtgt ccgcccgcag gttttcgatc agttttcggt agacggcctc 1705741 gtcggcgggt cccagcccgt tctcgttggt caagaggacc aaaaggaggg cggaggtctc 1705801 aattttttcc tggaaagccg cgctcatctc cttttgcagg accatcgatg gggccccggg 1705861 cggcagggga gcttgctcgc gctttgcggc ttgcgcctgc agcgttggga gcaacagcgt 1705921 cagcgcggcc gccaccgcga tccagcaccc aatgacgatc agcggccatc gcaccacgaa 1705981 gttgccgata cggtcgaaca gtcccccggc tttggcctcg tcatgccttg ccacccgata 1706041 accgtacaag cctggcaatc ggtggcgtgg ggaaatgacg ataaccgcat taaccgtgac 1706101 gttgccgtta ctttggcggc gtttgaccac tgcgggcgtc aaatacgcag atcaggggca 1706161 tttcgtggga tcggctggcg tgcccgcagc cgacgctggc gggcgggatg cggcgtccga 1706221 acagatagct cgctggactc aaacttgcac ggtcgtgctg gtttgcggtc acggtccggc 1706281 aaagtgggca tttcggtcct ggtgcacctc gcggtcgtgc gacactctcc ccgtggctct 1706341 taggtatcgc ctgcagtcca atccgttggt cggcaagctc acgaccaagt acttcttgcc 1706401 gcttggcact cgccaggtcg gcgatcacgt ggtgtttttc aacttcggct acgaggagga 1706461 tccgccgatg gcgttgccgc tgtcggagtc cgacgagccc aatcggtatt gcatccagct 1706521 ctaccaccag acggccagtc aggtggacct caccggcaag gaggtgctag aggtcagttg 1706581 tggcgccggt ggcggggcct cctacatcgc ccgcaaccta ggtccggcct cctacacggg 1706641 gctggacttg aatccggcca gcatcgacct ctgccgggca aagcaccggc tgcccggcct 1706701 gcagttcgtg cagggcgacg cgcagaacct gcctttcccc gacgaatcct tcgatgcggt 1706761 ggtcaatgtc gaagcctcgc accagtaccc cgactttcgc ggcttcttgg ccgaagtggc 1706821 gcgcgtgctt cgcccgggcg gacacttcct ctacaccgat tcccgtcgaa atcccgtcgt 1706881 cgccgaatgg gaggcggcgt tggccgatgc tccgctgcgc acgatttcgc agcgggacat 1706941 cggcgcgcag gccaagcgtg ggttggatgc gaacacggcg cgttcgcaag aggccatcgg 1707001 ccgccgcgca cccgtattgc tggccggctt gacccgctgt gcggtgcgtg tgctggactg 1707061 ggatctacgt cgcggcggcg ggttcagcta tcggatctac ttgttcgcca aggattgatt 1707121 cggcgagacc acacccatga aaaactcatg aaatttgtcg tggccagcta tgggactcgc 1707181 ggcgacatcg agccctgcgc agcggtcggc ctggagctgc agcggcgcgg ccatgatgtg 1707241 tgccttgccg tgccgcccaa cctgattggt ttcgtggaaa cggccgggct gtctgctgtc 1707301 gcatacggaa gcagggactc tcaggagcag ctcgacgagc agttcctgca caacgcgtgg 1707361 aaacttcaga accccatcaa gctgctgcgt gaagcgatgg cgcccgtcac cgagggctgg 1707421 gcggagctga gcgcgatgtt gacgccggtg gccgccgggg ccgacctgct gttgaccggt 1707481 cagatctacc aggaggtggt cgccaacgtc gccgagcacc acggcattcc gttggccgcg 1707541 ctgcattttt atccggtgcg agccaatggc gagatcgcct ttcccgcgcg gctgccggcg 1707601 ccactggtcc gctccaccat cacggccatc gactggctgt attggcgcat gacgaaaggt 1707661 gttgaggacg cgcagcggcg tgaactgggc ctgccgaagg cgtcaactcc cgcgccgcgg 1707721 cgaatggccg tacgcgggtc gctggagatc caagcctacg acgcgctttg cttcccgggg 1707781 ctggcagcgg aatggggcgg ccgacgcccg tttgtcggcg cgttgacgat ggaatcggcg 1707841 accgacgcgg acgacgaggt cgcttcatgg atcgctgccg atacaccgcc gatttatttc 1707901 ggctttggca gcatgccgat cggatccctg gccgaccggg tcgccatgat cagtgcggcc 1707961 tgcgcggagt tgggcgagcg cgcgttgatt tgctcgggac ccagcgatgc gaccggaatc 1708021 ccgcagttcg atcacgtgaa ggtggtgcgt gtggtcagcc acgcggcggt ctttcccacc 1708081 tgccgtgcgg tcgtccacca tggcggcgcg ggcaccaccg ccgccggtct tcgagccggt 1708141 atccccacct tgattctgtg ggtcacctcc gaccagccga tctgggctgc tcagatcaaa 1708201 cagctgaaag taggccgggg gagacgcttt tcaagcgcca ccaaagaatc gctgattgcc 1708261 gaccttcgaa cgatacttgc gccggactat gtcacccgag cgcgggagat cgcgtctcgg 1708321 atgaccaaac ccgccgccag cgtcacggcc accgccgatc tgctcgaaga tgcagcccgc 1708381 cgtgcgcgct aagcgagggt ggcgcttcgg cgaatggcct tcggcgcgag gatgatcgtt 1708441 gtacgctccg cttgtgtccc tgatgattac ggtgccggtg tttgggcagc acgaatacac 1708501 ccacgcactc gtggccgacc tggaacgtga gggcgccgac tatctcatcg tcgacaaccg 1708561 cggtgattat cctaggatcg gcaccgagcg agtgagcaca ccgggagaga acctaggctg 1708621 ggccgggggg agcgagctcg gtttccgact tgcgttcgcg gagggttact cccacgcaat 1708681 gacgctcaac aacgacaccc gggtctcgaa gggatttgtt gccgcgttgc tcgactcgcg 1708741 gctaccggcc gacgccggaa tggtcgggcc gatgtttgac gtgggttttc ccttcgcggt 1708801 agctgacgag aaaccagacg ccgaaagcta tgttccgcga gcgcgatacc ggaaggtgcc 1708861 cgcagtcgag ggaacggcgc tggtgatgtc gcgggattgc tgggatgcgg tcggcggcat 1708921 ggacctgtcc acgttcgggc gctacggatg ggggctcgac ctggatctcg cgttacgggc 1708981 tcgaaagtcc gggtatggcc tgtacacaac cgagatggcc tacatcaacc atttcgggcg 1709041 caagaccgcc aatacgcact tcggtgggca ccggtatcac tggggtgcaa gtgcggccat 1709101 gatccgggga ttgcgtcgaa cgcatggctg gcccgccgct atgggtatct tgcgggagat 1709161 ggggatggcc catcatcgta agtggcacaa gtcatttccg ctcacctgcc cggcgagctg 1709221 ctaggcgtgc tcccaggcgt ttggcgtgcc gtcgcctcca gcaggtccgc ggccgcggtg 1709281 acggcggctg tcggccgggt catccgtgtc gagatctcac gtgcccgcgc ggcgcattcc 1709341 ggcgccagga tcgatcgtag ctccttgagc aatgacccgc gggtgatgtt cgtaaagcgt 1709401 ttggcagagc cgactttgag tcgttggacg gcaccggccc agatcggttg atcggccacg 1709461 tcccagagaa tcagcgtggg cattcccgct cgcaggccgg cggcggtggt accggcgcca 1709521 ccgtggtgga cgaccgcgcg gcacttggga aggatggtcg aatagttgac caggccgaca 1709581 cgtttcacgt ggtcggcatg acgaatgcgg gtggagttgg ctgccggaga atagatcagg 1709641 gctcgctcgc cgagctgtgc gcagacatcg gagatcatgg cgagcgtttg gacgggcgtt 1709701 tggacgggcg tgctgccgaa gccgaagtag atgggtggtg ttccggcggc gatccacgac 1709761 tcgagttctt cgttgggttc gctgtgtaac tccatggtca gcgggccgac aaacgggcgg 1709821 cggtcgctcc attcggccgc cagtccgggg aaaaaaaccg ggtcgtaggc ttggatttcg 1709881 ggcgctccgc gttccgccag ccgacgcacc gccggcgccg gtgctggcgg taggcccagt 1709941 tcacgtcgtt gcgcgcgatc ggcatccttg ctgacgtacg catacagccg ccatgagacc 1710001 ttcatcgtcg cgcgcaccag agtcgccggc gtcggtatcg acgggatcgc gatttggccg 1710061 ttgacctgca tcggaaagtg atgcagtgcc gcagccggaa tgtcgtagta ctcggcgacg 1710121 ttggctgcca caccatgata tgtctggccc gtcatcacca ggtcggcgcc gtcggccaac 1710181 gtggtcaacg tcgtgcccat ctccgcccag ccttcgacga atagttcctt gacggcgcgg 1710241 gcgaggttga gcggattctg ggctctggtg aggttgcgga cgaatgccgc gaccgtgttg 1710301 atctgttcgt ccgagtccgg gccgtaggcg acgccggtca gacctgccga ctcgacgaac 1710361 tcgatcaggt tgggcggcac tgccatatga actgcgtggc ctcgccgccg cagctccacg 1710421 ccaaccgcgg cgcaaggttc gacatcaccg cgggttccgt ggaccgccaa gacaaacttc 1710481 atcagcgcct tcccgcgttc gacgtcaggc gggtgccggc gcgtccctgt cggccgccaa 1710541 cttgtcgcac atcagatccg ccaggccacg aacggtggtg ttgatttcgg tggcggaaat 1710601 gcggatcccg gtttcggctt ccacccgcgc acgcagttcc tggctgctca gtgagtccag 1710661 gccgtactcg ctgagcagcc ggtcggtgtc gatggtgcgg cgtaggatta ggccgacctg 1710721 cttggagagt agccgccgca gccggtctgg ccattcctcg cggggcaggt ccaccagctc 1710781 ggcaaggaat ttgcttgtgc ctgaacggtt ttgccccagg gattggaact tctccgcgaa 1710841 tgggctgtgc tgggcgaagg ctgtcagcca gggtgatccg atcaccgggg cgtagccgct 1710901 gtaggcgcgg ttgtggcgca gcagggtctc gaaggcgtag gcgccttcct cgggggcgat 1710961 ggcgtcgccg gtttgttcgg caaaggcgat cgcgcggccg atctggcccc aggcgcccca 1711021 ggcgatggag gtggctggta ggtcttgggc tcgccgccag tgggtgaagg tgtccagcca 1711081 gctgttggcc gcggcgtagg cgccctgacc cggcgagccc accagggcgg ccgctgagga 1711141 gaatgagcag aaccagtcca gcggctggtc cgcggtggcc cggtgcagtt gccaggcgcc 1711201 atatgccttg ggcgcccagt cgcgttcgat gagttcgtcg gtgatgttgg ccaaggtggc 1711261 gtcctcgacc accgcggccg cgtgcagcac gccgcgcagc ggcaaacccg tcgcggtggc 1711321 cgccgtgacc aaccggtcgg cggtgtccgg ctgggcgata tcgccgcact ccaccactac 1711381 gtcagacccg atcgcgcgga cgagttcgat ggtctccaac gccttttggc tgggctgtga 1711441 gcgcgagctg agcacgatgc ggccggcccc ggcgttggcc atcttctcgg ccaggaataa 1711501 gcccagccca cccaggccac cggtgatgat gtaggacccg tctgaacgga aaacccgagc 1711561 ctgttcgggg ggaagcacca cgctgctgcg cccggcgtgg gggacgtcga ggatgagctt 1711621 gccggtgtgc tcggccgcgc ccatcacccg gatcgcggtg gccgcctcgg ccagcgggta 1711681 atgggtgctc tgcggcatcg gcagcacacc ctcgacggtc aaccgataca ccgtgctcaa 1711741 cagttcgcgg accgcagccg gatggctcac cgacatcaac cccaggtcta gaccgtagaa 1711801 cgccagattg cgccggaatg gcaagagttc cagtcgggta ttggagtaga tgtcgcgttt 1711861 gccgatttcg atgaagcggc cgcccagggc cagtagtttg aggccggcca actgtgcggc 1711921 accggtcacg gagttgagca cgatgtccac gccgtagccg gcggtgtcgc ggcggatctg 1711981 ctcggcgaac tcgacgctgc gcgagtcata gacgtgttcg atgcccatgt cgcgcagcag 1712041 gtctcgacgc ttttcgttgc ctgcggtggc gtagatctgg gctccggccg cacgcgcgat 1712101 cgcgattgcg gcctggccca ctccgccggt ggcggagtgg atgagcacct tgtcgccggc 1712161 cttgatccgc gccaggtcct gcagcccgta ccacgcggtg gcgctggcgg tggtcactgc 1712221 cgcggcttgg gcgtcggtca gcccctcggg cagtctggtg gccaggcggg cgtcgcaggt 1712281 gacgaacgtg gcccagcagc cgttgggtga catgccgccg acccggtcac cgaccttgag 1712341 ttcgctgacc ccgggcccga ccgcgctcac caccccggcg aaatcggtgc ccagctgcgg 1712401 ctgtcgcccg tcgagggttt ggtagcggcc gaaggtgacc agcacgtcgg cgaagttgat 1712461 gctggacgcg gtgacggcga cctcgatctc tcccgggccc ggcgggaccc ggtcgagcgc 1712521 ggcgaactcc aaggtttgca ggtcaccggg agtacggatc tgtaggcgca tgccggcctc 1712581 ggcgtggtcg acgacggtgg tttgccgctc ctcggggcgc agcggggctg ggcacaaccg 1712641 ggcggtgtac cactggtcgt tgcgccaggc ggtctcatcc tcgccgctgg ccgccagcag 1712701 ctgacgcgcc accgactccg cgccggtctg ctcatccaca tcgacatagc tggccttcaa 1712761 atgcggatgc tcagcaccaa tcacccgcaa caacccccgc atcccaccct gctcaagatt 1712821 gggtcggtca ccagacaaca ccgcctgagc attgtgggtc agcacataca accgcggctc 1712881 ttgggccgtg atctctggaa tctcgcgggc gatacgcacc acatgtttga caagctcgcc 1712941 gccgcgcacg ggggattccg cgtcggggtc gccggtctgc ggcgcggtca acacgaatac 1713001 gccggtgaac ccgccggtgc cgagctggtc gcgcagccgc gcggcctggg ctgcgtggtc 1713061 ggcgcgctgc ggccaggaca tcgttgtgca ctgcgcgtcg tgcaccttca gcgcgtcggt 1713121 caactgtgcg gccaccaaat ccgtagcgtc acacgtgctg atcagcagcc aggcgccggg 1713181 ttcggcgtgg ctgttttcgg gcagctcacg ttcgtgccat tcgatgctca gcagccgctc 1713241 acccaaaacc cgggcacgtt cgctggcctg cgacgcgccg gtacccaact gcagcccacg 1713301 caccgccaac accaccgcgc cgtgctcgtc caacacgtcc aggtcggctt ccacgcccac 1713361 accgcacgcg gtcaccgtcg tgcagcagta ccgggcatga cgggccgacc cataggaccg 1713421 caaccgccgc acacccaacg gcagcaacaa accaccgtcg gccataccct ggacggcggg 1713481 atgagccgcc accgactgga agcacgcatc cagcagcacg ggatgcacgc cgtaagcttt 1713541 gacctgcgag cgaagcgggc ccggtaggtt gacctcggcc agcaccgtgt cgccggcccc 1713601 ttcggcgatg tacgcgtcaa ccagacccgc aaaagccggc cctaagcgat gaccacgctt 1713661 gtccagccat tgccgaacct cggcgccgtc caccttgtgg ggatggctgg ccagcagttc 1713721 ggcgatgttt ttctggggtg gctggtccgg ggcgtcgtcg gcttcccgga caacgtgcag 1713781 aaccgcggcg agttgccgtg tgtacctacc atcatggctg gtctctactg tgagtgggac 1713841 aacgccgggg gcttctaccg tggcggtgac gccgatgggg gtttcgtcgt cgagcagcaa 1713901 catctgctcg aatcggatgt cgcggacttc ggaggcttcg ccgaggacgg cgcgggctgc 1713961 ggccaacgcc atctcgcagt aggcggctcc cggaagggcg gccgcgccgt ggatttggtg 1714021 atcggccagc cagggttgtg tcacggtgcc gacctcgccc tgccagacgt ggcgttccgg 1714081 ctcctcgggc aggcgcacgt gggagcccag caatgggtgt acggcgacgg tattggcgtg 1714141 ggcgatgcga cgggtcgtgt cgtcgagcag cagacgacgg tggttccatg tgggcagtgg 1714201 tgcgttgatc agtcggccgg tggggtagag cacggcgaag tcgacggcgg cgccggcggc 1714261 gtagaggtcg ccggccagtg cgcgcagccc gtggggcagt ggttgttcgc ggcgcatgcc 1714321 ggccagcgca gccgcggaca tgtcgaggct gcgggcggtc tggtcgaccg cgtgggtcag 1714381 cagggggtgg ggggtcagct cggtgaagac ccggtagccg tcttcgaggg cggcttgcac 1714441 cgccgcggcg aagcgtacgg tgtggcgcag gttgtccacc cagtagtagg cgtcgcagta 1714501 gggctcctcg cgcgggtcga acgaggtcgc cgagtagtag gggatttccg gttgcagcgg 1714561 gctgatttcg gcgagcgctt cggccagttc gtcgaggatc gggtcgacct gcggggagtg 1714621 cgatgctacg tcgacggcca cctcacgggc cagcacgtcg cgttgctccc aggcggccac 1714681 caggtcgcgt accgtctggg tggccccgcc gatcacggtg gactgcgggg aggccaccac 1714741 cgcgaccacg gcgtcgttga cgccgcgcgc catcaactcc gaaagcactt gttgagcagg 1714801 cagttccacc gatgccatgg cgccggcgcc ggcgatacgg gtcatcagcg ccgaccgccg 1714861 gcagatgacg cgcactccgt cttcgaggca gagcgcgccg gcgaccaccg cggccgcgga 1714921 ctcgcccagc gagtggccga tgaccgcgcc gggcgctacg ccgtaggact tcattgtggc 1714981 cgccagcgcg acctgcatgg caaacagggt cggttgcacc cggtcgatgc cggtcacgac 1715041 ctcgggggcg gtcatggctt cggtcaccga gaagccggat tccgcggcga tcagtggttc 1715101 gatcgcggcg atggtggcgg cgaataccgg ttcggtggcc agcaggtcgg cgcccatgcc 1715161 cgcccattgc gagccttgcc cggagaacac ccagaccggt ccgcggtcgt cttggccgac 1715221 cgcgggtggg tagggggttt cgccggtggc gacttcccgc agcgcctcgg tcagctccgc 1715281 ggtggtggcg gccagtacgg cggtgcgcac cggccggtgt ccgcgccggc gggccagggt 1715341 gtaggccaga tccgccggcg ccagctcggg tccttgggcg tcgacccaat cggccagccg 1715401 cgcggcggtc tgccgcagcg cgtcctgcga gctggccgac agcgcgaaca gcagcgcgcc 1715461 gtcgataccg ggtgtggccg gggtgtcgcc tggtgcaccg gattcggggg ctggcaccgg 1715521 tgcctgctcg acaatggcgt gcacattggt gcccgtcatg ccatacgacg acaccgccgc 1715581 gcgccggggc gtttcttgat cggcgccggg ccacggcgta atctcttgcg gcacaaacag 1715641 gttggtttcg attgcggcaa gcttgtcagg cagggccgtg aagtgcagat tctgtgggac 1715701 cacgccgtgt tggagggcca ggaccgcctt catcagtccc agcgctccag cggccgactg 1715761 ggtgtggccg aaattggtct tcaccgatgc cagcgcgcag gggccgtcgt tgccgtatac 1715821 ctcggccagg ctggcgtatt cgatggggtc acccaccggg gtgcccgggc cgtgcgcctc 1715881 gaccatgccc accgtagccg ggtccacacc ggccacatcc aacgcctccc gatacgccgc 1715941 gacctgcgcg gaccgtgatg gtgtcgcgat attgacggtg tggccgtctt ggttggcggc 1716001 cgtgccacga attacggcca ggatccggtc cccatcggcc agcgcatccg gcaaccgctt 1716061 gagcgccaac atgacacaac cctcaccgga gacgaaaccg tccgcggaaa cgtcgaacgc 1716121 atgacagcgc ccggtcgcag acaacatgcc caacgccgag cccgaggcga accgccgcgg 1716181 ttcgagcatc acgtagacac cgccggctag cgcaatgtcg ctttcgccgt cgtgcaggct 1716241 acgacaagcc aggtggatag cggtgaggcc agacgagcat gcggtatcta ccgtgatcgc 1716301 gggaccctgc aagcccatgg cgtacgccac ccgcccggat gcgaagcagg cattggtgcc 1716361 cgtgttgccg tacggccctt cgaaagtctg gttgtcggcg tgtaccaata tgtagtcggt 1716421 atgaaccaac cccacgaaaa cccctgtccg cgaggccatc tggttcggtg ttaggccgcc 1716481 gtgctccatg gcttcccagg aggtttccag caacaagcgg tgctgcggat cgatcgctat 1716541 cgcttctttc tccccgatcc cgaagaactc gggatcaaag tcgccgacgt tatcgaggta 1716601 cgcgccccat ttgcagtcgg tgcgtccggg cacgccgggt tcggggtcgt agtactcgtc 1716661 gatgtcccag cggtcggcgg ggatctcggt gaccagatcg tcgccccgca gcaacgcctc 1716721 ccacaaccga tcgggtgagt cgatgccccc cggcagccgg caccccatac caatgacagc 1716781 taccggcgta acacgtgtcc tatccacggt ctttgttctc tccttaccca cggttcaagc 1716841 ttttgccagc ggcgtatcgt cgaacttcgg tccgggttga tagaaccgca gcaccaaacg 1716901 cacccaccga cccccacgct tcacgccaac cctttagttc attggcgtga acagcagcgt 1716961 agccggttgc cccgatatat gtggaaaaat cgttcggacg tacaaaaaaa gttcctgacg 1717021 ctggcgtcaa ctcgaaactg cctcggaagt catgattgat tcatcagtca atattaaagt 1717081 cgcagttcac aactataata cgccggtgca gcggacaatt gcggaagcgc cggacgcctc 1717141 gcggtccgat gtcgcctttc cctgcctcgt cgtcaatatc tgatggtgga cgaccgcccg 1717201 tgccggaccg gcttaggtag ccagccgggc ttcgcgccac gcaatttgcc tagtcgtgga 1717261 agacggattg ccgaagtgtc gaaggcaacc cgaactccga tgttcaggtt atgccaattg 1717321 gtgcccggaa atccccgaaa tcgaaaatgt tacgtgcagg tttcactgga cggatcaagg 1717381 ccgtcgtcgc tgaagctggg cggctggggc gacatcgcgc gatccgccct cggcgatgcg 1717441 cacgtacgcc gattgcatcg tctctggatg ccgcgcgatc gagccctgcg cgatcggact 1717501 actgggggac aacgcggtga cggtgctctc ctcgtgaaac ttgttgaccc acatgcacgc 1717561 ttgcgcgccg atccagccat cgccgaatac tctggcattc atccggtcca gttgtattgc 1717621 gatgaccgca gacagcagaa gcgcgccggc cggcatcgag gcacgacggg aacggaagcc 1717681 gccacctaga ggatccaacg agcatctatg cttttccctt cccacggccg cgcgtgaggc 1717741 atcctcgctg tgcagcaccg ccaggtcagg gatcaacgcg ccgactattt ctccgtcgat 1717801 gtggctggac tgcacctgct ccgtctctct ttgctgccac cagcgccagg ttggttgtgg 1717861 aagctgagtc accgtcgggc gaaaccgtca gcgttgacga agcgttagag gtagtgtgct 1717921 gccgtggtcg cgtcttcgat tcccaccgcg ctgcgcgagc gcgccagtgt gcaccccaat 1717981 ggtgcggcca tcacctacat cgattacgag caggactggg ccggtgttgc cgaaaccctg 1718041 acctggtctc agttgtatcg gcgaatgctc aatgtcgccg agccgctccg gcatgtgggg 1718101 gcgaccggtg atcgggcagt gatactggca ccgcagggaa tcgaatacgt cgttggattt 1718161 ctcggcgcgt tgcaggccgg acgtatcgcg gttccgctgc cggttccaca tgccggcgcc 1718221 cacgatgagc gtacgatttc ggtgctaagc gacacttcgc ccgctgtcat tctgacgacg 1718281 tcgggggccg ttgacgatgt cagagaatgc gctcagccac agccaggcca gtccgcacca 1718341 tcaatcgttg agcttgattt gctggactta gattctcggc agcgctcccg cagccctggc 1718401 gcgcgcccaa ccggcaggga tacgccggaa accgcgtatt tgcaatatac ttcgggatcc 1718461 acccgtacgc cggccggtgt catggtctcg aacaaaaatg tcttcgccaa tttcgagcag 1718521 atcgtggccg acttctttgc gcccgagggg ggcgtcgtcc cgccggacct cactgtggtg 1718581 tcttggctgc cgctgtacca cgacatgggt cttctattag gcgcgatcat gccgatcctg 1718641 gcgggtgtac ccaccgtgtt gacgagtccg gtggggttcc ttcagcggcc ggctcgatgg 1718701 atacaactgc tggcacgtaa cggtcgcacg atttcggcag gaccgaattt cgctttcgaa 1718761 ttggcggtgc gtaagacgtc agacgacgac atggacggac ttgacctcgc cggcgtgcac 1718821 accatcctca acggcagcga gcgagtacac ccggcgaccc tcaaacgatt tgctgaacgg 1718881 ttcggccgct ttaattttgc cgccgcggcg ctgcggcccg cgtatggcat ggcggaagca 1718941 acggtgtaca tagcgacccg taatgtgaac gaaccaccag aaatcgtcga cttcgaatcc 1719001 gagaaactgc ctgcgggcca agcgatccgg tgcccgagcg gaagcggcac accgctggtc 1719061 agctacggcg tcccacggtc acagctagtg cgcatcgttg atccagacac gtgtatcgag 1719121 tgtccgcagg gatcggtcgg tgagatctgg gtgcaaggtg gcaacgttgc gtccggctat 1719181 tggcacaaac ccgaggagag caagcgcacg tttggcgcca ggattgtcac cccttcggcg 1719241 ggcacacccg aagcgccttg gctgcgaacc ggggattcgg gtttcgtctc cggcggcgag 1719301 ctgttcatca tcggccgcat caaggacctc ttgattgtgt atgggcgcaa ccacgctccc 1719361 gacgacatcg aggcgaccat ccaggagata acctccggcc gctgtgcggc gatcgcggtc 1719421 cccgaccacg gcaccgaaaa gctggtcgcg attatcgaac tcaagaaacg gggagactcc 1719481 gacgaggatg tggcggaccg gctgcgcatc gtcaagcgtg acgtcgccgc ggcgatattt 1719541 gattcgcacg gtctgagcgt ggccgatctc gttctggtgt cgcccgggtc gattcccatc 1719601 accaccagcg gcaagatcag gcgggcacag tgcgtccagc tttaccgacg gcgtgagttc 1719661 acccggttag acgcttgact gcatcgttgg agcttgtttt ccattgtgct acaaccggtt 1719721 tgctgtctct gtggcccagt gttagtgggc cgctcggcat tgactgagca cgacacgatt 1719781 cctagtgtgc tggtatgtcg gacggcgcgg tggtacgggc attggtattg gaggcgccgc 1719841 gcaggctggt cgtgcgccag taccggctgc cgcgcatcgg cgatgatgac gcactagtgc 1719901 gagtagaggc ctgcgggctg tgcggcaccg atcacgagca atacacgggc gagctggccg 1719961 gtgggtttgc cttcgtacct ggccacgaga cggtcgggac gattgcggcc atcggtccgc 1720021 gggcggagca gcggtggggc gtgtcggccg gcgaccgagt agccgtcgag gtattccagt 1720081 cgtgtcggca gtgcgctaac tgtcgtggcg gcgagtaccg gcgttgtgta cggcatggcc 1720141 tcgctgacat gtacgggttc atcccggttg accgagagcc tggcctgtgg ggcggttacg 1720201 ccgaatatca gtacctggca ccggattcga tggtgttgcg ggtggccggt gacctcagcc 1720261 cggaagtggc caccttgttc aacccgctgg gggcgggaat acgttgggga gtaacgattc 1720321 ccgaaaccaa accgggcgac gtcgtggcgg tgctgggtcc aggaatccgg gggctgtgcg 1720381 ccgccgcggc ggcaaaaggg gccggtgccg ggttcgtgat ggtgaccggg ttgggacccc 1720441 gtgacgccga ccggttggcg ctggcggcac agttcggagc cgacctcgcc gtcgatgttg 1720501 cgatcgatga cccggtcgcc gccctgaccg aacagaccgg tgggctggca gacgtcgttg 1720561 tcgacgtgac cgccaaggcg ccagcggcat tcgcacaggc gatagcgcta gcccggcccg 1720621 ccgggaccgt tgttgtcgcc ggcacccggg gcgtgggcag cggggcaccg ggattttcgc 1720681 ccgacgtcgt tgtgttcaag gagctgcgtg tgcttggcgc cctcggcgta gacgccaccg 1720741 cctaccgggc cgcgcttgat ctgttggtgt ccggtcgata ccccttcgca agcctgcctc 1720801 gccgctgcgt gcggctcgaa ggcgccgagg atctgctggc taccatggcc ggtgaacgcg 1720861 acggtgtccc gcctatccac ggagtgctca caccatgaca acatcccgcg tgcccctgtt 1720921 gccggtcgac gaggccaaag ctgctgccga cgaagcgggc gtgcccgact acatggctga 1720981 gctcagcatc ttccaagtgt tgctgaatca tccgcgacta gcgcggacct tcaacgacct 1721041 gctcgccacc atgctgtggc acgggaccct ggactcacgg ttgcgtgagt tggtgatcat 1721101 gcggattggt tggctcaccg actgtgacta cgaatggacc caacactggc gggttgcttc 1721161 agggcttggc gtgtcggccg acgatctgct cggtgtacgg gattggcaag ggtacaacgg 1721221 gttcgggccc gctgagcagg ccgtcctggc ggccaccgat gacgtggtgc gcgagggcgc 1721281 ggtgagtgcg cagagctggt cggcttgcga gcgggaatta cattgcgaca aagtggttct 1721341 catcgaactc gttacggtga taagcgcatg gcgaatggtc gcttcgatcc tgcacagcct 1721401 cgaggtccca ctggaagacg gcgtttccag ctggccgccc gacggccttt cgccaaggtg 1721461 actgcgccga gcgtgtaacc atggcgagat tccgccggcg atttttccgc cctgagtgca 1721521 cgttcggcgc agaagcacta gacgatccgg taggtctgca cagcgtgagc gacgatgttc 1721581 ccgtcgggat cggttgcggt gatctcggta aaggtgagtt ccttgcggcg tcgggcagtg 1721641 cgcgcatgac agagcaagtc acaccgcttg gcggcgccgg tgtactggat gctcatcgcg 1721701 accgtggcgg cgcgggtgcc cctgtcgaag tcgtggttcg accaagcggc ggcggcaccg 1721761 gcggtgtcca tcaccgacgc gatcacccca ccgtgaaagt aggtgccgtc attggtgagg 1721821 tcggtgcgaa acgggagtcg gatcacgacg tcgtcgggtt cgtagcgttc gaacacgatg 1721881 ccgagcccgc cgatgaacgg cgtcctcggc atcagctcac gcaccgcctg gcgacgtttg 1721941 tgctgctctt gggcggtcaa cgggtcggac atggcaggta atctacccta ttagattgac 1722001 atatcaatca ataactctta gcgtcgtcgc aatgcggacc agagtcgccg agctgctcgg 1722061 tgctgagttt ccaatatgcg cgttcagcca ctgccgggat gtggtggcgg cggtgtccaa 1722121 tgcgggcggg ttcgggatcc tcggtgccgt cgcacatagc cccaaacggc tggagagcga 1722181 gctgacctgg atcgaggagc acacgggtgg caagccgtac ggagtcgacg tgctgctgcc 1722241 gcccaaatac atcggcgccg agcaaggcgg tatcgatgcc cagcaggccc gggagctcat 1722301 acccgaaggg catcgcacct tcgtcgacga cttgctggtt cgctatggca tccccgcggt 1722361 caccgaccgg cagcgttcgt cctcggccgg tgggctgcac atctcgccca agggttatca 1722421 gccgttgctg gatgtggcct tcgcccatga catccggttg atcgccagcg cgctcgggcc 1722481 gccgccaccg gatctcgtgg agcgcgccca caaccatgac gtgctggttg ccgccctagc 1722541 cggcacggcg cagcacgcgc ggcgacacgc ggctgcgggt gttgacctga tcgtcgcgca 1722601 gggcaccgag gccggaggcc acaccggcga ggtggcgacc atggttctgg ttcccgaagt 1722661 cgtcgatgcg gtgtcgccaa cgccggtgct ggccgcgggc gggatcgccc gtggccgcca 1722721 gatcgctgcg gcgttggccc tgggggcgga aggcgtctgg tgcgggtcgg tctggttgac 1722781 caccgaagaa gccgaaacgc ccccggtggt caaggacaag tttctggccg caacatcctc 1722841 ggacacggtg cggtcccggt cgctaaccgg caagccggcg cgcatgctgc gcacggcctg 1722901 gaccgacgaa tgggatcggc ctgacagccc cgacccgctt ggcatgccgc tgcagagcgc 1722961 gctggtcagc gacccgcagt tgcgcatcaa ccaggccgcc ggccagcccg gggccaaggc 1723021 tcgtgagctg gcgacctact tcgtcggaca ggtcgtcggc tcactcgacc gggtgcggtc 1723081 ggcccgctcg gtggtgcttg acatggtcga ggagttcatc gacaccgtcg ggcaactgca 1723141 ggggttggtg caaaggtgag ccgcgctagc gcgcggcggc gccgagcggt cagcgatgag 1723201 gacaagtcgc aacggcgcga cgagatcttg gccgcggcca aaatagtgtt tgctcacaag 1723261 ggttttcatg ccaccaccgt cgcagacatc gccaagcagg ccggcctggc gtacgggctg 1723321 atctactggt acttcgactc caaggacgac ttgttccacg ccttgatggc cggtgaagag 1723381 gaggcgctgc gcgcgcatgt cgcggccgaa ctggcccgcg ttggcgggtc taccgaggcg 1723441 ccgcttcggg ccctgttaca ggccgcggta caggccacgt tcgagttctt cgaaaccgac 1723501 aaggctaccg tcaaactact gttccgtgac gcttacgcgc ttgggggccg attcgaagag 1723561 catctcggcg gaatctacga gcggttcatc gacgacatcg aagccgtcgt tgttgccgct 1723621 caacggcgcg gtgaggttgt cgaggccccg tcccggatgg ccgcgtacac gttggcggcg 1723681 ctggtggggc agttggcaca ccgacggctg aataccgacg ataacgtcac cgccgcccag 1723741 gtagccgact tcgtggtgtc gctggtgcta gacgggctgc gtccgcgtgc actggcggtc 1723801 ggggcccgcg gtggtcgggc cgcccgaacc tgagcaaagg ctgccaaata catggtgaac 1723861 gcgtaaggat tcgcgacacc cgcccggatc acgttgaccg agacgggtag gtcgtgcatg 1723921 atcggtccgg taagcacctc gttaggtgag gcggctacac gaacataggc cactgacccc 1723981 gaacgtcgag agacgccccg ggtcaggaca gctcttcccg gcttaagggt tgagcccagg 1724041 tggcttccgg cttaccggac acgtcgtgtg gtgccgaagc tctgacgaga ggggtgcgga 1724101 tttccggcag ttgccggcat ctctgtactc ctgtgacgcg ctttatcgtg cggacaaccg 1724161 tacgtgtcgt ggccgtgagg aggtgaggga cgcatgagtt ccggtgacag tccggaccga 1724221 tatccgggct ctgtttcgtc ccgatccggt ttccggcgcg acgttttgcg ctgagtcgtc 1724281 aaaccaagat cagccttctt ggatcggaac cgctacggga cgggaccaac tcggttcagt 1724341 ccatatgtgc tcgttttgat ttccgtcctc gcttgcaact ccgtctagga ggcgatcatg 1724401 accgctgctc tgcacaatga cgtagtaacc gtagcttcgg cccccaagct gcgggtggtg 1724461 cgggatgtgc ccccggcccc cgcgtccaag aaggttgctc gccggctcga cgcgcagcct 1724521 ttcggcaccg gaggggaccc gctggtcgac ggggcagctc gtttgctgag cattccgctg 1724581 cgccacctct acgccgcgtt gtggcgcgtc gggctgctcg aggtccaggc ctagtccgat 1724641 gggcaggcag ccgaccttgc gccgcgatgt ggatttgcgg cgctgggcga caatccccgt 1724701 agaatcaggg gaacggcatc gatccggcga tcaccgggga gccttcggaa gaacggccgg 1724761 ttaggcccag tagaaccgaa cgggttggcc cgtcacagcc tcaagtcgag cggccgcgca 1724821 tcggcgtggc aagcggggtg gtaccgcggc gttcgcgcac cggcgtggcg tcgtccccga 1724881 gcctggattg caggcacgca gtgccgaacg gtgctggggc ctggggagac gacgcgcaaa 1724941 gtgaccgata acgcatatcc aaagctggcc ggcggggcac ccgacctccc ggcactcgaa 1725001 ctcgaggtcc tcgactactg gtcccgtgac gacaccttcc gggccagcat tgctcgccgc 1725061 gatggcgccc ccgagtatgt gttctatgac gggccgccgt ttgccaacgg tctgccgcat 1725121 tatgggcacc tgctcaccgg ctacgtcaaa gacatcgtgc cgcgatatcg cactatgcgc 1725181 ggttacaagg tggagcgtcg cttcggctgg gacactcacg ggctgcccgc cgaactcgaa 1725241 gtcgagcgcc agcttggcat cactgacaaa tcccagatcg aggccatggg tatcgccgcc 1725301 ttcaacgatg cctgccgcgc atccgtgttg cgctacaccg acgagtggca ggcgtatgta 1725361 actcggcaag ctcgctgggt cgacttcgac aacgattaca agacgctcga tctggcttac 1725421 atggagtcgg tgatttgggc cttcaaacag ttgtgggaca agggcctggc ctacgagggc 1725481 taccgggtgc tgccgtactg ctggcgcgac gaaactccgc tgtcgaatca cgaactgcgg 1725541 atggacgacg acgtctacca aagccgccaa gatcccgcgg taacggtggg cttcaaggtg 1725601 gtgggtggcc aaccagacaa cgggctagac ggtgcctact tgctggtgtg gacgacgact 1725661 ccgtggaccc tgccgtcgaa cctcgcagtt gcggtaagcc cggacatcac ctacgtacag 1725721 gtccaggcgg gcgatcgccg tttcgtactg gccgaggcac ggctggccgc ttacgcccgc 1725781 gaactcggtg aagagcccgt ggtgctcggc acctatcgcg gcgccgaact gctgggcacc 1725841 cgctacctgc cgccgtttgc ctatttcatg gactggccca acgcttttca ggtgctagca 1725901 ggcgactttg taacgaccga cgatggcacc ggcatcgtgc atatggcacc ggcctatggt 1725961 gaggacgaca tggtggtcgc ggaggcggtc ggtatcgcgc cggtgactcc ggtcgactcc 1726021 aagggacgct tcgacgtcac cgttgccgat taccaagggc agcatgtctt tgacgccaac 1726081 gcgcagatcg tccgggacct gaagacccaa agcggcccgg ctgcggtgaa tggcccagtg 1726141 ttgattcgtc acgaaaccta cgagcaccct tacccacact gctggcgatg ccgtaacccg 1726201 ctgatctacc ggtcggtgtc gtcgtggttc gtcagggtga cggacttccg agaccgcatg 1726261 gtggagctaa accagcagat cacgtggtat cccgaacacg tcaaggacgg ccagttcggc 1726321 aagtggctgc agggcgcccg cgattggtcg atctcccgga atcgctactg gggtaccccg 1726381 attccggtat ggaagtccga cgacccggcc tacccgcgca tcgatgtcta cggcagcctc 1726441 gacgagctgg agcgcgactt cggcgtacgc ccggccaatt tgcaccggcc ctacatcgac 1726501 gagctcaccc gtcccaaccc agacgatccg actggccgta gcacgatgcg acgcattccc 1726561 gatgtgctcg acgtgtggtt cgactcggga tccatgccgt atgcccaggt gcactacccg 1726621 ttcgagaacc tggattggtt ccagggacac taccccggcg acttcatcgt cgagtacatc 1726681 gggcagaccc gtggctggtt ttacacactg catgtgttgg cgaccgcgct ctttgaccgg 1726741 ccggcattca aaacctgtgt ggcgcatggg attgtccttg gtttcgatgg ccagaagatg 1726801 agcaagtcgc tgcgcaacta tccagacgta acagaggtgt tcgatcgcga cggctccgac 1726861 gccatgcggt ggttcctgat ggcatcgccg attctgcgcg gcggcaacct gatcgtcact 1726921 gagcaaggaa ttcgcgacgg tgtgcgacaa gtcctgctgc ccctgtggaa cacctacagc 1726981 ttcctggcgc tgtatgcacc gaaagtcggt acctggcgcg tcgattcggt gcacgtgctg 1727041 gatcgctata tcctggccaa gctggcggtg ctgcgcgacg acctcagcga gtcgatggaa 1727101 gtttacgata ttcccggtgc ctgtgaacat ttgcgtcagt tcactgaggc gttgactaat 1727161 tggtatgtgc gacggtcgcg ttcgcggttc tgggcagaag acgccgatgc catcgacacg 1727221 ctacacaccg tgttggaggt gaccacgagg ctggccgccc cgctgcttcc gctgatcacc 1727281 gagataatct ggcgtggtct gacacgcgag cgatcggtgc acctgacgga ctggccagcg 1727341 cccgacctgc tgccgtcgga tgccgacctg gtcgccgcga tggaccaggt ccgcgacgtg 1727401 tgctcggcgg catcctcgct gcgcaaggcc aagaagctac gggtgcgcct gccgctaccg 1727461 aaactcattg tggcagttga gaatccgcaa cttctgaggc cgttcgtcga cctcattggc 1727521 gacgagctta acgtgaagca ggtcgaactg accgatgcca tcgacaccta tggccgattc 1727581 gagctcacgg tcaacgcccg ggtagccgga ccacggctgg gcaaagatgt gcaggccgcc 1727641 atcaaggcgg tcaaggccgg cgacggcgtc ataaacccgg acggcacctt gttggcgggc 1727701 cccgcggtgc tgacggccga cgagtacaac tcccggctgg tggccgccga cccggagtcc 1727761 accgcggcgt tgcccgacgg cgccgggctg gtcgttctgg atggcaccgt cactgccgaa 1727821 ctcgaagccg agggctgggc caaagatcgc atccgcgaac tgcaagagct gcgtaagtcg 1727881 accgggctgg acgtttccga ccgcatccgg gtggtgatgt cggtgcctgc ggaacgcgaa 1727941 gactgggcgc gcacccatcg cgacctcatt gccggagaaa tcttggctac cgacttcgaa 1728001 ttcgccgacc tcgccgatgg tgtggccatc ggcgacggcg tgcgggtaag catcgaaaag 1728061 acctgaggtc gactgggcga cgagcgtaac gtcacggctg aaaatccgtg cccgacttcg 1728121 ccgtggcgtt acgctcgcgg cgcggggacc cgatctctag ggcgttgtcg cccagatcca 1728181 cgtcggccaa ggccgatggc agcggctgag gttgatcgcc atagcgaaaa ctagctcggt 1728241 agccccaaat agcatcacgg gtgtggagtc ccgctgggtg ctgcacctgg acatggatgc 1728301 gtttttcgcc tcggtcgaac agctcacccg gccgaccctg cgggggcggc cggtgctggt 1728361 tggcgggctg ggtgggcgag gtgtggtggc cggcgcgagc tatgaagcgc gggcctacgg 1728421 tgcccgatcg gccatgccga tgcatcaggc ccgcaggctg atcggggtga cggccgtggt 1728481 gttgccgcca cgcggggtgg tgtacgggat cgccagccgc cgggtattcg acaccgtgcg 1728541 cggcctggtg cccgtcgtcg aacagctttc tttcgatgaa gcgttcgccg aaccgcccca 1728601 actcgccggg gcagtggccg aggacgtcga gacgttctgc gaacggttgc ggcgacgggt 1728661 gcgcgacgag accggcctga ttgcctcggt cggagcgggc tcgggcaagc agatcgccaa 1728721 gattgcttct ggtctggcca aacccgacgg cattcgggta gtccggcacg ctgaagagca 1728781 agcgcttctc agcggattgc cggtacgacg gctgtggggc atcggcccgg tcgccgagga 1728841 aaagctgcat cggctcggca tcgagacgat cgggcagctg gccgcgctga gcgatgccga 1728901 ggcggccaac atcctaggcg cgacgattgg gcccgcgctg caccggctgg cccgtggcat 1728961 cgacgaccgc ccagtggtgg agcgcgccga agccaagcaa atcagcgccg agtccacgtt 1729021 cgccgtcgat ctgaccacca tggagcaatt gcacgaggcg atcgactcca tcgctgagca 1729081 cgcgcaccaa cgcctgctgc gcgacggccg cggcgcccgc accatcacgg tgaagctaaa 1729141 gaaatccgac atgagcacgc taacccgctc ggcgacgatg ccctacccga cgaccgacgc 1729201 cggcgcgctg tttacggtgg cccgccggct gctgccggat ccactgcaaa tcgggccaat 1729261 tcgtcttctg ggtgttgggt tttcgggttt gagcgacatt cgccaggagt cgttgtttgc 1729321 cgactcggac ttgacgcagg aaacggcggc agcgcattac gtcgaaacac cgggagcggt 1729381 cgtgccggcc gcgcacgacg ccacgatgtg gcgggtcggc gatgacgtcg cccaccctga 1729441 gcttgggcac ggctgggtgc agggagcggg ccacggcgtg gtcaccgtgc ggttcgaaac 1729501 gcgtggttca ggcccgggct cggcgcggac gttccccgtc gacaccggcg acatcagcaa 1729561 cgccagcccg cttgacagct tggactggcc ggactacatc ggccagctat cggtcgaggg 1729621 gtccgccggc gcctcagccc caacggtcga tgacgtcggc gaccggtgag ttggccgcca 1729681 gcgcggccat tagcagcacc cgggcctggg acggcggcag tcgcggtacc atcaccgcgc 1729741 cagcctccac caggtcgtgc ccgggaccat agcctgcgcc gacccgcgcg ccggcgaccc 1729801 gggtagacac cgcgatcacc accggatcgc tcccgtctcg acagtggcga cggactccct 1729861 cgatcacggc ggccccggca ttgcccgagc ccagcgcctc cagcaccacg gcgcgcgcgc 1729921 cggctgccac acaggcgtcc atcgccaccg cgtcacttcc cggatagacg gcgacgatgt 1729981 cgactcgtgg cgccacggca gcgcccagat cgccgagata gggccgcgtc ttggtgcgcg 1730041 tcagccgcac cccgcccgac gtgaagccaa gcgactcgcc ggcgaatccg cacaggtccg 1730101 ggttggccac cttgtgcagg cccaaaggct gtaacacccg gccgccgaaa ctcaccagca 1730161 ccccgaggtc gcgggcggct gggtcggcgg cgaccgcaag cgcgtcgcgc agattggccg 1730221 ggccatcggc gccgggggca tcggcgctga gcatggcccc ggtcaacacg accgggcggc 1730281 tacccgcata ggtgaggtcc agccacagag cggtctcttc gagcgtatcg gtgccgtgag 1730341 tgatgaccac cccatctgcg ccgccgcgga atgcctcctg cactgcagcg cctatccggt 1730401 cccaatcggc cggcgtcaac tttgagctgt ccagcgccat gaggtcgact acttcgatgt 1730461 cggagtccat gtcgagaccg gcgatcagcg tcgccccgca atgggttggc cgtagcaccc 1730521 catcggggcc ggcggtggtc gagattgtcc ctccagtagt gatgacggtg aggcgggcca 1730581 tgatgggatc attgcgcacg tggtttgctc ccatccggcc gcggggtctg ggcgggccat 1730641 atcggcccta ggggatgatg atggtgtgcc tgacgaacca acaggatcgg ctgatccgct 1730701 gacctcgacc gaggaagccg ggggggcggg ggaacctaac gctcccgcgc cgccgcgacg 1730761 gctgcgcatg ctgctgtcgg tcgctgtggt ggtgctcaca ctcgacattg tcaccaaggt 1730821 ggtagctgtc caactgttgc cgcccggcca gccggtgtcg attatcggcg acacggtgac 1730881 ctggactctg gtgcgtaatt ctggggcggc cttctcgatg gcgaccggat acacctgggt 1730941 tttgacgctg attgcgacgg gtgtcgtggt cggaattttc tggatggggc ggcggctggt 1731001 atcgccgtgg tgggcgctgg gtcttgggat gatcctgggc ggtgccatgg gcaacctggt 1731061 tgatcgcttc tttcgggcac cggggccgct gcgcgggcac gtcgtcgatt tcttgtcggt 1731121 cggctggtgg ccggtgttca atgtcgccga tccgtcggta gtcggtggcg ccatcctgct 1731181 ggtcatcctg tcgatctttg gctttgactt cgacaccgta ggtcggcgac acgccgacgg 1731241 ggacaccgta ggtcggcgca aagccgatgg ctgaccgctc aatgcccgtt ccggatggat 1731301 tggcgggaat gcgtgttgac accggactgg cccgcttgct gggactgtct cggaccgctg 1731361 cggctgccct cgccgaagag ggcgcggtcg agctgaatgg cgtgccggcc ggaaagtccg 1731421 atcggctcgt ctccggcgcc ttgctgcagg tgcggttgcc cgaggcgccc gcgccgctgc 1731481 agaacacccc catcgatatc gagggcatga cgattctgta ttccgacgac gacatcgttg 1731541 cggtcgacaa accggctgca gttgccgcgc atgcgtcggt cggctggacc ggaccgacgg 1731601 tgctcggcgg actcgccgcc gccgggtacc ggatcaccac atccggggtg cacgagcggc 1731661 agggcatcgt gcatcgcctc gacgtcggga cctccggggt gatggtagtg gcgatctccg 1731721 agcgggcgta caccgtgctg aagcgggcgt tcaaataccg cacggtggac aagcggtacc 1731781 acgcgctggt tcaaggacat ccagatccgt ccagcggaac gatcgacgcg ccgatcggtc 1731841 gtcatcgcgg ccatgaatgg aagttcgcga tcaccaagaa tggccggcac agccttacgc 1731901 actacgacac gctggaagcg ttcgtggcag ccagcctgct cgacgtgcat ctggaaactg 1731961 gccgcaccca ccagatccgg gtgcacttcg ccgcgttgca tcacccatgt tgcggcgacc 1732021 tcgtttacgg agctgatccc aagctagcga agaggctcgg gttggaccgt caatggctgc 1732081 acgcgcgttc actggcgttc gctcatccgg ccgacggccg gcgggtggag atcgtcagcc 1732141 cgtatccggc cgatctgcag cacgcgctaa agatattgcg tggcgagggt tgaccggcat 1732201 cacgaggtgc ggcagacgaa cgtggcgcca tggaaatcga ggcgcacctc gccctggtgt 1732261 tcccagtatt caatgccttg gcgaccatac cgggcgccac tgccggacag ttcgacgaac 1732321 acgatcactt ggtcgccttt ccagttgagc acggccgtct tcgggtcgaa ttggttgtag 1732381 aactgtgcgg tcaatgggcc gtcctgcgtg ggacaccggt aggtgaggac gggtggagtc 1732441 gcggtggcgg gatcggcgat tgccaattgg accagccggg tctggtaggc ctcctgcaca 1732501 caggtacgcg ggtcggtgtc ttgcgcacaa gcatcacgca gcatcgtcca gctgctttgt 1732561 gcggcttcca gcgccgccga gcgtcgatgg gccagcgcct gttgataggc ggtcgaaagc 1732621 cggtggtcca gactggtcaa ctgccggtcg tggcaaacca gttgctgcac tatggttgcc 1732681 ggtttggtgc agtcgagcga ctgcccggcg gtcggcgagg ttgtgttagc cggagggttt 1732741 gcggcgcagg cgctcagaac cagggcggtc accaggacgc cgatccatct catggaaacg 1732801 gactacccgg ctaccgacgc ggtgtccagc gcgacacgcc acagggctca gactggtgcc 1732861 gtggtgctct cgcccgatgt gacgtcgacc gccagcggcg cgatgacgcc gaggatttcc 1732921 gtgatcgttt cggagggcac gccggctgcg gtcagcgcgt cggccaagtg tccggcgacc 1732981 aggctgaagt ggtgcatggt aattccgcgc ccctgatgga cttgcttcat cggcgcaccg 1733041 gtatagggct cgggcccgcc aagcgcggcc gcgaaaaact ccacctgctt gcccttgagg 1733101 cggctcatgt tcgtaccgct gaagaaggcc gatagttggt catcggcaag cacacgaaca 1733161 tagaagtcct cgacgacgac ttcgatggcc tcatgcccgc cgatcttgtc gtagatgctg 1733221 atcggctcac gtttgcgcaa gcgtgacagt agtcccatcg tgccagggga ccatgccggc 1733281 gttgcctgcc ggttaggtcg cgatcacgct cggattatca gctgtaacaa gctgattgcc 1733341 gccaacgtcg cacagatccc gtcgcaaaca gatccttggt cgccgcaccg gccggtagtg 1733401 gactccattc atcagctcat gtgctagtag gttggcttca tgacccgtgt ccatcacccc 1733461 acgccgccat caggagctac ccacgatgaa tcttggtgac ttaacgaact tcgtcgagaa 1733521 gccgctcgcg gcggtgtcca acatcgtcaa caccccgaac tcggccgggc gatatcggcc 1733581 cttctacttg cgcaacttgc tcgatgcggt gcagggccgc aacctcaatg atgctgtcaa 1733641 gggcaaggtt gtcctcatca ctggtgggtc atcaggcatc ggtgcggcgg ccgcgaagaa 1733701 aattgccgag gccggcggca cggtggtgtt ggtcgcacgc accctggaaa acctcgagaa 1733761 cgtcgccaac gacatacggg cgatccgagg caacggtggg accgcccacg tctacccgtg 1733821 cgatctatcc gacatggatg cgattgccgt gatggccgac caggtgctcg gcgacctcgg 1733881 cggcgtcgac atcttgatca acaacgcggg ccggtcaatt cggcgctcgt tggagttgtc 1733941 ctatgaccgg atccacgatt accagcgaac gatgcagctc aactacctcg gcgcggtcca 1734001 gctgatcctg aagttcatcc ccggaatgcg agaacgccac ttcgggcata tcgtcaacgt 1734061 ttcctcagtc ggcgtgcaga cccgcgcgcc gcgcttcggc gcttacatcg ccagcaaggc 1734121 cgcgctggac agcctgtgtg atgcgttgca agccgagacc gtgcacgaca acgtccgatt 1734181 caccaccgtg cacatggcat tggtaaggac tccaatgatc agcccgacca cgatctacga 1734241 caagtttccc acgctgacgc cggatcaggc ggccggtgtg atcaccgatg ccatcgtgca 1734301 tcggccccgg cgagccagct caccgttcgg acagttcgcc gccgttgccg acgccgtcaa 1734361 ccccgcggtg atggaccggg tacgtaaccg tgccttcaac atgttcggcg actcgtccgc 1734421 agccaaggga agtgaatccc aaaccgacac atcagaactc gacaagcgaa gcgagacgtt 1734481 tgtgcgggcc acccgaggga tccattggtg acaccatgag ccttccgaaa ccgaacaatc 1734541 agaccaccgt tgtgatcacc ggcgcctcct ccggcatcgg tgtcgaattg gctcgtggct 1734601 tggccggccg cggcttccca ctgatgctag tggcgcggcg ccgcgagcgc ctcgacgaac 1734661 tggccgatca gctgcgccag gaacactgcg tcggggtgga ggtcttgccg ctcgaccttg 1734721 ccgatacgca agcgagggca cagctggctg atcgcttgcg tagtgatgcg attgccgggc 1734781 tgtgcaacag cgcaggtttc ggcaccagtg ggcgtttttg ggagttgccg ttcgcacgcg 1734841 aaagcgagga agtcgtcctc aatgctctgg cgttaatgga actcacccat gccgcactgc 1734901 caggcatggt caagcgcggc gccggtgcgg tgctcaacat cgcctcgatc gcgggtttcc 1734961 agccgattcc ctatatggcc gtgtattcgg ctaccaaagc ctttgtgctg acgttctctg 1735021 aagccgtgca ggaggagctg cacggaacgg gcgtgtcggt gactgccctg tgcccaggcc 1735081 cggtacccac cgagtgggcc gagatcgcca gcgccgagcg gttcagcatt cccctcgccc 1735141 aagtttcgcc gcacgacgtc gccgaagccg ccatcgccgg gatgctctcc ggtaagcgca 1735201 ccgtcgtgcc gggcatagtg ccaaagttcg tcagcaccag cggcagattc gctccgcgca 1735261 gcctgctgct gcccgcgatc cggatcggca accggctgcg cggcgggccc agccgctgat 1735321 gtgaggggcg ttccggcctg gtgccgaacg gagtgctggg cctgggcaat cccagccggc 1735381 tagccgcgtt gtatgggttg cagctggcgc acgagtcgca gtgctgccag atgcacaatt 1735441 tgccctctgc agcgcgacaa gtcactgttg cgtgtcgcga ggaggtgggc ataacgacca 1735501 tccttgccgg cagagacgaa tgcggcgtgt gtgacaagac agctgggttg gatggcgccg 1735561 ctccttagcg ggccatagcg cacggcccgc ttcgtcgccg gcgctagtct catgcgatgg 1735621 cctctgttga gctgtccgct gacgtcccca tcagcccgca ggacacgtgg gaccacgttt 1735681 cggagctgtc agagttgggg gagtggctcg tcatccatga ggggtggcgc agcgagttgc 1735741 ctgatcaact gggcgaaggc gtccagatcg tgggtgtcgc gcgggccatg ggcatgcgca 1735801 accgggttac gtggcgggtg accaagtggg acccgccaca tgaggtcgcg atgacaggat 1735861 ccgggaaggg tggaacaaag tacggagtca ccctcaccgt gcgacccaca aaaggcgggt 1735921 cggcgctggg gctgcgtctc gagctgggcg ggcgtgcgct gttcggcccg ctgggttcgg 1735981 cggcggctcg cgccgtcaag ggcgacgtcg agaagtcgct taagcagttc gccgagctat 1736041 acggctagcc gctagaagac acactttgcg acacgcccga acggtgtcgg tcctcggtca 1736101 tagactggcg tccctatgag cggttcatct gcggggtcct ccttcgtgca cctgcacaac 1736161 cacaccgagt attcgatgct ggacggtgcc gcgaagatca cgcccatgct cgccgaggtg 1736221 gagcggctgg ggatgcccgc ggtggggatg accgaccacg gaaacatgtt cggtgccagc 1736281 gagttctaca actccgcgac caaggccggg atcaagccga tcatcggcgt ggaggcatac 1736341 atcgcgccgg gctcgcggtt cgacacccgg cgcatcctgt ggggtgaccc cagccaaaag 1736401 gccgacgacg tctccggcag cggctcctac acgcacctga cgatgatggc cgagaacgcc 1736461 accggtctgc gcaacctgtt caagctgtcc tcgcatgctt ccttcgaggg ccagctgagc 1736521 aagtggtcgc gcatggacgc cgagctcatc gccgaacacg ccgagggcat catcatcacc 1736581 accggatgcc cgtcggggga ggtgcagacc cgcctgcggc tcggccagga tcgggaggcg 1736641 ctcgaagccg cggcgaagtg gcgggagatc gtcggaccgg acaactactt ccttgagctg 1736701 atggaccacg ggctgaccat cgaacgccgg gtccgtgacg gtctgctcga gatcggacgc 1736761 gcgctcaaca ttccgcctct tgccaccaat gactgccact acgtgacccg cgacgccgcc 1736821 cacaaccatg aggctttgtt gtgtgtgcag accggcaaga ccctctcgga tccgaatcgc 1736881 ttcaagttcg acggtgacgg ctactacctg aagtcggccg ccgagatgcg ccagatctgg 1736941 gacgacgaag tgccgggcgc gtgtgactcc accttgttga tcgccgaacg ggtgcagtcc 1737001 tacgccgacg tgtggacacc gcgcgaccgg atgcccgtgt ttccggtgcc cgatgggcat 1737061 gaccaggcgt cctggctgcg tcacgaggtg gacgccgggc ttcgccggcg atttccggcc 1737121 ggtccgccgg acgggtaccg cgagcgcgcc gcctacgaga tcgacgtcat ctgctccaaa 1737181 ggtttcccat cgtactttct gatcgtcgcc gacctgatca gctacgcgcg gtcggcgggc 1737241 ataagggtgg gtcccggccg cggctcggcc gccggctcgc tggtcgccta cgcgctgggc 1737301 atcaccgaca tcgacccgat tccacacggt ctgctgttcg agcggttcct caaccccgag 1737361 cgcacctcga tgcccgacat cgatatcgac ttcgacgacc ggcgccgcgg tgagatggtg 1737421 cgctacgcag ccgacaagtg gggccacgac cgggtcgcgc aggtcatcac cttcggcacc 1737481 atcaaaacca aagcggcgct gaaggattcg gcgcgaatcc actacgggca gcccgggttc 1737541 gccatcgccg accggatcac caaggcgttg ccgccggcga tcatggccaa agacatcccg 1737601 ctgtctggga tcaccgatcc cagccacgaa cggtacaagg aggccgccga ggtccgcggc 1737661 ctgatcgaaa ccgacccgga cgtacgcacc atctaccaga ccgcacgcgg gttggaaggc 1737721 ctgatccgca acgcgggtgt gcacgcctgc gcggtgatca tgagcagcga gccgctgact 1737781 gaggccatcc cgttgtggaa gcggccgcag gacggggcca tcatcaccgg ctgggattac 1737841 ccggcgtgcg aggccatcgg tctgctgaaa atggacttcc tgggcctgcg gaacctgacg 1737901 atcatcggcg acgcgatcga caacgtcagg gccaacaggg gtatcgacct cgacctggaa 1737961 tccgtgccgc tggacgacaa ggccacctat gagctgctgg gccgcggcga caccctgggc 1738021 gtgttccagc tcgacggcgg gcccatgcgc gacctgctgc gccgcatgca gccgaccggg 1738081 ttcgaagacg tcgtcgccgt tatcgcgctg taccggcccg gcccgatggg catgaacgca 1738141 cacaacgact atgccgaccg caagaacaac cggcaggcca tcaaacctat tcacccggaa 1738201 ctcgaagaac cgctgcgcga gatcctcgcc gagacctacg gcctcatcgt ctatcaagag 1738261 cagatcatgc gcatcgcgca gaaggtggcg agctactcgt tggcccgcgc cgacattcta 1738321 cgcaaggcca tgggcaagaa gaaacgcgag gtgctggaga aggagttcga gggcttctcc 1738381 gatggcatgc aggccaacgg gttctctccg gcggccatca aggcgctgtg ggacaccatc 1738441 ctgccgttcg ctgactacgc gttcaacaag tcacatgccg ccggctacgg catggtgtcc 1738501 tactggacgg cctacctcaa ggccaactat cccgccgagt acatggccgg tctgttgacg 1738561 tcggtcggcg acgataaaga caaggccgcg gtttatctgg ccgactgccg caagctcggc 1738621 atcaccgtgc tcccgcccga cgtcaacgaa tctggcttga acttcgcatc ggtcggccaa 1738681 gacatccgct acgggctggg cgcggtgcgc aacgttggcg ctaatgtcgt gggctcgttg 1738741 ctccaaaccc gcaacgacaa gggcaagttc accgactttt cggactacct gaacaagatc 1738801 gacatctcgg cgtgcaacaa gaaggtgacc gaatcgctga tcaaggcggg tgcgttcgac 1738861 tcgctggggc atgcccgcaa gggtcttttc ctggtgcaca gcgatgcggt ggactcggtg 1738921 ctgggcacca agaaggccga ggcactgggg cagttcgatc tcttcggcag caatgatgat 1738981 gggaccggca ccgcagatcc cgtgttcacc atcaaggtgc ccgatgatga gtgggaggac 1739041 aaacacaaac tcgccctaga gcgcgagatg ctgggactgt acgtctcggg gcatcccctc 1739101 aacggtgtgg cacacttgct ggctgcccag gtcgacaccg cgatcccagc gatcctcgac 1739161 ggcgatgtcc ccaacgatgc ccaagtgcgg gtgggcggca tcctggcgtc ggtgaaccgg 1739221 agggtcaaca aaaacggaat gccatgggct tcagcgcaat tggaggatct cacgggcggc 1739281 atcgaggtga tgttcttccc gcacacctac tccagctatg gtgccgacat cgtcgacgat 1739341 gcagtcgtgc tggtcaacgc caaggtggcg gtccgtgacg accgcatcgc attgatcgcc 1739401 aatgacctca cagtgcccga cttttccaac gccgaggtgg agcggccgct ggcggtcagc 1739461 ttgcccaccc ggcagtgcac ctttgacaag gtgagtgcgc tcaaacaggt gttggcgcgc 1739521 caccccggca cctcgcaggt gcatctgcgg ctcatcagcg gagaccggat caccacgctg 1739581 gcacttgatc agtcgttgcg ggtgacgccg tcgccggcgt tgatgggtga cctcaaggag 1739641 ctgctcggcc ctggatgtct ggggagttag cgaggcgacc gcccccagcg gtttccgcac 1739701 gatcgcccgt gagcgccgct aatggatcca gcccgacgcc cgactgtccc cgttgagata 1739761 ccccgagacc tcgtcgtcga agttggcgaa gcccgacatg aggctgccga agttgaagaa 1739821 gccagagatc gaggttcccg cattgacgaa acccgaaatg ttggccgtac cggtgatcgt 1739881 gggtaccgag tttctgaagc ccgacagatg gtcacccacg ttgatgatgc cggcgttgta 1739941 gttgcccacg ttggccaggc ccgagttacc ggtgacaaat tcggcgtgtt cgtcacgggc 1740001 gttgttatcg aagcccgagt tgaaggtgcc ggcgttagcg tagcccgagt tgttggtgcc 1740061 cgtgtgcagc cagccggagt tctgtaccgg ctggttgacc gagttgaaca atccggtgtt 1740121 gagatcgccc gagttgaaga agccggtgtt gatgttgccg gagttgaagt agccggtgtt 1740181 cacgttgccc gagttgaaat cgcccgtgtt catgctgccg gcattgaagc tacccgtgtt 1740241 ggtatggccg gagttaaaca gacccatgct gccggtgacc aggtttccgc ccgagtttcc 1740301 gattccggtg ttcgtggtgc ccgagttgaa ccagccggta ttggtggtgc cggagttgaa 1740361 ccagccggtg ctggtggtgg ccgagttgaa ccagcccgtg ctgagctggc cggagttgcc 1740421 gataccggta ctgagctcgc cggagttgcc gaagcccgag ctgcgctcgg ccgaactgcc 1740481 gaactccacg ctcagcgccc cgactccatt gcccgagttt cccatgccga tgttgttatt 1740541 gcccgagttg aaaaagccga tattaccgtt gcccgagttc ccgaagccga ggttcccgct 1740601 acctgaattg agggcgccga aacctatctg gttatcaccg gtgagcccga agccgatgtt 1740661 gttgttgccg ctgttcccaa acccgatgtt atgagagccc ttgttgccga aaccgatatt 1740721 tccactgcca aggttgccgc tgccaaagtt ggtgtcgccg gtgtttccgt taccaaagtt 1740781 gacgttaccg gtgtttccga acccgaaatt cgagttcccg gtgttgccgc cacccacgtt 1740841 gagatttccg gtgttaccgc cgccaaagtt ggtgtcgccg gtatttccac tacccaggtt 1740901 atagctgccg aggttgccgc cgcccaggtt atagctgccg atgtttccgc taccccagtt 1740961 gagcgtgccc gtgtttccac tgcccgggtt gagatccccg gtgttgccgc cacccaggtt 1741021 gtagctgccg atattgccgc tgcccaggtt gagatcaccg atattggcgt tgcccaagtt 1741081 gacgctgccc gtgttggcgc tacccgggtt gaagctgccc aggttgccga ggcccagatt 1741141 gccgctggcc agattgaagc caccgatggt gacgttgccc agatcgagaa acggaaaccc 1741201 cgcgatgatc accgcaccgc cgggcgtggc tgccgcgctc ggcgacggtg tgaacggcgt 1741261 cagcgacaag gcgaccgccg acgcctcgcc gtgataaccg agcatcgcgg cgacatcggc 1741321 ggcccacatc tgctcataga cggcctcgac ggccgcgatg gccggggcgt tttgccccaa 1741381 cagattcgag gccaccaagg agcgcaaccg acctcgattg gcgctcaccg cgcccggatg 1741441 caccgtcgcc gccagcgccg cctcgaacgc cgataccgcc gcctgggctt gacccgccgc 1741501 ctgctcagcc tgcgctgcgg ccgtggtcag ccagcgagca taggacgcgg ccgcgcctgt 1741561 catggccgct gacgccggac cttgccagga cccggtggcc aactgcgagg tcaccgccga 1741621 aaatgaggcc gccgccgaac ccaaatcgcc ggccagcccg gtccaggccg acgccgccgc 1741681 caacatcggt cccggcccgg caccagcgaa catcagcgcc gaattgatct ccggcggcaa 1741741 caccgaaaaa ttcatcacaa ccatcccgtc agccggccac acccaccggg cttcacggcg 1741801 ctgtctggcc ccaaccgcag cgaagcctac gaaaaagccg ggcgcttcgg acgggcgcag 1741861 gttaaatcca ggtaacgcgt gacgaatctc gcgacgagcc tccttgcggc catgggccgc 1741921 cacgggtctc ggtggtcgcg gcccccgtgc ttccgcgtcc ttcgattgtg gacgtacgct 1741981 caccgatgtg acctgggcca tactgatccg ctgtcaagga gaacggaaat gaccacaaca 1742041 gagcgcccga caaccatgtg cgaggcgttc cagcgcaccg ccgtcatgga cccggacgcc 1742101 gttgcgctac ggacccccgg cggtaaccag acaatgacat ggcgagacta cgcggcgcag 1742161 gtgcggcggg tcgctgccgg cctggcaggt ttgggagttc ggcgcggcga cacggtctcg 1742221 ctgatgatgg cgaaccggat cgagttctac ccgctcgacg tcggtgctca gcacgtcggc 1742281 gccacctcgt tttcggtgta caacaccctg cccgccgagc agctgaccta cgtgttcgac 1742341 aacgcgggga ccaaggtggt catctgcgag caacagtacg tcgatcgcgt tcgcgccagc 1742401 ggtgtgccca tcgaacacat cgtctgcgtc gatggcgcgc cccccggcac gctctcgctg 1742461 acggatttgt acgcggccgc ctccggcgac ttcttcgact tcgagtcgac gtggcgtgcc 1742521 gtacaacccg aggacattgt caccctcatc tacacgtccg gcacaacggg aaaccccaag 1742581 ggtgtggaga tgacccacgc caacctgctg ttcgagggat atgccatcga cgaggtgctc 1742641 ggaatccggt ttggcgatcg ggtgacgtcc ttcctgccat cggcgcacat cgccgatcgg 1742701 atgaccgggc tgtacctgca ggagatgttc ggcacccagg tcaccgcggt ggccgacgcg 1742761 cgcacgatcg cagccgcgct ccccgacgtg cggccaaccg tgtggggggc cgttccccgg 1742821 gtttgggaaa agcttaaggc cggaatcgaa ttcaccgtcg ctcgtgagac cgacgagatg 1742881 aagcggcagg cgttggcgtg ggcgatgtcg gtggctggca aacgcgccaa cgccctgctc 1742941 gcaggtgaat ctatgtcgga tcagctggtc gccgaatggg ccaaagccga cgagtcggtg 1743001 ttgtccaagt tgcgcgagcg gctgggcttc ggcgagctgc ggtgggccct gtccggagcg 1743061 gcgccgatcc ccaaggagac gctcgcgttc ttcgcaggta tcggcatccc aatcgccgag 1743121 atttggggaa tgtcggagct gagctgcgtt gccaccgcca gccatccccg cgacgggcgg 1743181 ctgggcaccg tcggaaaact acttcccggg ctgcagggca agatcgccga agacggtgag 1743241 tacctggtcc gcggtccgct ggtgatgaag ggttatcgca aagaaccggc caagaccgcg 1743301 gaggcgatcg actccgacgg ctggctacac accggagatg tcttcgatat cgactccgac 1743361 ggctatctgc gggtggtgga ccgcaagaag gagctgatca tcaatgcggc cggaaaaaac 1743421 atgtcgccgg ccaacatcga gaacaccatc ctggccgcgt gccccatggt cggggtgatg 1743481 atggcaatcg gtgacgggcg aacgtataac accgcgctgt tggtcttcga cgccgactct 1743541 ctcggtccgt atgcggccca gcgtggcctc gatgcctcgc ccgcggctct ggcggctgac 1743601 ccggaggtga tcgcgcgcat cgccgccggc gtggccgagg gcaacgccaa attatcgcgg 1743661 gtcgaacaga tcaagcggtt ccgcatattg cccaccctgt gggagcccgg cggggacgag 1743721 ataaccctga cgatgaaact caagcgccgt cgaatcgccg cgaaatattc cgcggagatc 1743781 gaggagctct acgccagcga gctgagaccg caggtttacg agcccgctgc cgtgccatcg 1743841 acacaaccgg catgacgggg gctagccagt gactgcacgg gaggtgggcc gcatcggact 1743901 gcgaaagttg ctgcagcgca tcggtattgt tgctgaatca atgacgccgc tagcgaccga 1743961 ccccgttgag gttacccaac tgctggatgc ccgatggtat gacgagcggc tgcgtgcgct 1744021 ggccgacgag ctcggacgcg atccggacag cgtgcgcgcc gaggcggcag gctatctgcg 1744081 ggagatggcc gcctcgctgg atgagcgggc cgtgcaggca tggcgcggct tcagtcgctg 1744141 gctcatgcgc gcctacgacg tactggtcga cgaggaccag atcacgcagc tgcgcaagct 1744201 tgatcgcaaa gccaccctgg cgttcgcgtt ctcgcatcgt tcgtacttgg atgggatgct 1744261 gctgcccgag gcgatcctgg ccaaccggct ctcgccggcg ctgaccttcg gcggggcgaa 1744321 cctgaacttc tttccgatgg gcgcttgggc caaacgtacc ggggctatct tcattcggcg 1744381 tcagacgaaa gatattcccg tctaccgctt cgtattacgt gcttacgccg cgcagctggt 1744441 gcaaaaccat gtcaacctca cctggtcgat cgaagggggt cggaccagaa cgggcaagct 1744501 acggccaccg gtgttcggga tcctgcgtta catcaccgat gcggtcgacg aaatcgacgg 1744561 tcccgaagtg tatttggtgc cgacctcgat cgtgtacgac cagctgcacg aggtggaagc 1744621 catgaccacc gaggcctatg gcgcggtgaa acgacccgaa gacctgcgct ttctggtccg 1744681 gttggcgcga cagcagggcg agcgactggg ccgcgcctat ctcgacttcg gcgaaccgct 1744741 gccgcttcgc aagcgcctgc aggagatgcg cgccgacaag tcgggcaccg gcagcgagat 1744801 cgaacggatc gcgttggatg tcgagcaccg gatcaaccgc gccacaccgg ttacccccac 1744861 cgcggtggtg agtctggccc tgctgggcgc ggaccgctcg ttgtccatca gcgaggtgtt 1744921 ggcgacggtt cgcccgttgg ccagctacat agctgcccgc aactgggcgg tggccggcgc 1744981 cgccgatctg acgaatcgct cgacgatccg gtggaccttg catcagatgg ttgcttccgg 1745041 cgtggtgagt gtctacgacg cgggcaccga ggcggtgtgg ggcatcggcg aggaccagca 1745101 cctggtggcg gcgttttacc gcaacaccgc gatccatatc ctggtcgatc gggccgtcgc 1745161 cgagttggcg ttgctggcgg ccgcagagac cacaacaaac ggctcggttt ccccggcgac 1745221 cgtgcgtgat gaggcgttga gccttcgcga cttgctgaag ttcgagttct tgttttctgg 1745281 ccgtgcccag tttgagaaag acctcgcaaa cgaggtactg ctgatcgggt cggtggtcga 1745341 cacctccaag cccgcggccg cagccgatgt gtggcgcctg ctggaatcgg ccgatgtgct 1745401 gctggcccac ctggtgctgc ggccgtttct cgatgcctac cacattgtcg ccgatcggct 1745461 ggccgcccat gaagacgact ctttcgacga ggaagggttt ctggccgagt gtctacaggt 1745521 cggcaagcag tgggagctgc agcgcaatat cgccagcgcc gagtccaggt cgatggagct 1745581 gttcaagacc gcactgcgcc tggctcgcca tcgcgagctg gtcgacggtg ccgatgcgac 1745641 ggacatcgcc aaacgccgac agcagttcgc cgacgagata gccacggcaa ccaggcgggt 1745701 aaacacaatc gcagaactgg cccgcaggca atgagcgaca aatgcggccg ccagggccgc 1745761 tgcgccgtcc agcgaacggg tcaaacggtg gacgcgccat ccccccgggc atagtctgaa 1745821 tgtgatctag gtcacgtgcc agcaccggag gaggcgggac tatggtcgcg accactacgc 1745881 acttcccgaa gcaaaaagcg ccctgcgggc acatggttga cggcgatcac cacatcgagc 1745941 gcgacgacga aggccttgcc tacgacgacc tcaagttttc ctgcggctgc cgcgaaatcc 1746001 ggcatttcta ccacgacgga tccatgcggg tacgcacgat tcgacacgac ggcaaggtgt 1746061 tgaaggacga gcacagcggc gatcacgaag cgtgaaccag cgcgatgacc gcccaacaca 1746121 acatcgtggt tatcggcggc ggtggtgcgg gtctgcgcgc cgcgattgcg atagccgaaa 1746181 ccaatccgca cctggatgtg gcgatcgttt ccaaggtgta cccgatgcgc agccacaccg 1746241 tctcggctga gggcggcgcc gcggcggtga ccggtgacga cgacagcctc gatgaacacg 1746301 cgcacgacac ggtatccggt ggcgactggc tgtgtgacca agatgcggtc gaggctttcg 1746361 tggccgaggc gcccaaagag ttggtgcagc tcgagcattg gggctgtccg tggagccgta 1746421 aaccagacgg gcgcgttgcc gttcgcccgt tcggcgggat gaagaagctg cgcacctggt 1746481 ttgccgccga caagacggga tttcacctcc tgcacacgtt gtttcaacgg ctgctcacct 1746541 attccgacgt catgcgctat gacgagtggt tcgctacgac gctgctggtc gacgacggca 1746601 gggtatgtgg tctggtcgct atcgagttgg cgaccgggcg catcgagacg atccttgccg 1746661 acgcggtgat tctgtgcacc ggcggatgcg ggcgggtatt tccattcacc accaacgcga 1746721 acatcaagac cggcgacggc atggcgctcg cattccgcgc gggcgcgccc ctaaaagaca 1746781 tggaattcgt ccaataccac cccaccggac tgccgttcac cgggatcttg atcaccgagg 1746841 ccgcacgagc tgaaggcggc tggctgctca acaaagacgg ctaccgctac ctccaggatt 1746901 acgacctcgg caagcccacg cccgagccca ggctgcgcag tatggagctc gggcccaggg 1746961 accgactgtc gcaggccttc gtacacgagc acaacaaagg aaggacggtc gacaccccgt 1747021 acggccccgt cgtctatcta gacctgcggc acctgggggc ggacctgatc gatgcaaagt 1747081 tgccgttcgt acgtgagctg tgccgcgact accagcacat cgaccccgtg gtcgaattgg 1747141 tcccggtacg accggtagtg cactacatga tgggtggcgt tcacaccgat atcaacggcg 1747201 ccacaacgct tcccgggcta tatgccgcag gtgaaacagc ctgcgtgagc attaatggcg 1747261 ccaaccgcct ggggtcgaac tcgctgcccg agctgctggt gttcggggct cgagcgggcc 1747321 gtgccgccgc ggattacgca gcgcgccacc aaaagtcgga ccgtggcccg tcgtcggcag 1747381 tgcgggctca ggcccgcacc gaggctctac ggctagagcg tgagctcagc cgccatggcc 1747441 agggaggcga acgaatcgcg gatattcggg cggacatgca ggccaccttg gaaagcgccg 1747501 cgggtattta tcgtgacgga cccaccctca ccaaagcggt cgaggagatt cgggtgctgc 1747561 aggaacgatt cgccacggcg ggcatcgacg atcacagccg cacattcaac accgagctga 1747621 ctgcgctgct cgagttgtcg gggatgctcg acgttgcact ggcgatcgtc gaatctggtt 1747681 tgcgccgaga agaatcccgt ggcgcacacc agcgaaccga ctttccgaac cgggacgacg 1747741 agcatttctt ggcgcacacc ttggttcata gagaaagcga cggaacgctg cgggtcggct 1747801 accttccggt cactatcact cgctggccac cgggcgaacg cgtgtatggg aggtaaggat 1747861 gatggatcga attgtcatgg aggtctcccg gtatcggccc gagatcgaat cggccccgac 1747921 atttcaggcc tacgaggttc ccctcacccg cgaatgggcg gtgttggacg gcctgaccta 1747981 catcaaggat cacctcgacg gaacactctc cttccgctgg tcgtgccgga tgggtatctg 1748041 cggcagtagt ggtatgacga tcaacggcga cccaaagctg gcgtgcgcga cattccttgc 1748101 cgattaccta cccgggccgg tgcgggtgga gccgatgcga aacttcccgg tgatccgcga 1748161 tctcgttgtc gacatcagtg acttcatggc caagctgccc agtgtgaagc cgtggctcgt 1748221 ccggcatgat gaaccgcccg tcgaagacgg cgaataccgg cagaccccgg ccgaactcga 1748281 tgcattcaag cagttcagca tgtgtatcaa ctgcatgttg tgctactcgg cgtgcccggt 1748341 gtacgcgctg gaccccgact tcctcggtcc ggcggcgatc gcgctggggc agcggtacaa 1748401 cctggactcg cgcgaccaag gtgcggcgga tcgcagggat gtcctggccg cggccgacgg 1748461 cgcttgggcg tgcaccctgg tgggcgaatg ttcgacggct tgtccgaaag gcgtcgatcc 1748521 tgccggcgcg atccagcgct acaagctgac cgcggccacg cacgcgctga agaagttgct 1748581 gttcccttgg gggggggggc ggatgagcgc ctatcgccag ccggtcgaaa gatactggtg 1748641 ggcgaggcgg cgttcttacc tgcgattcat gcttcgcgaa atcagttgca tcttcgtggc 1748701 ctggtttgtt ctctatctgg tgctggtatt gcgcgccgtt ggcgcgggcg ggaattccta 1748761 ccagcggttt ttggacttca gcgccaatcc ggttgtcgta gtgctgaacg tcgtcgcgtt 1748821 gagtttcctg ctgctgcatg ctgttacctg gttcggatcg gcaccgcgcg cgatggtgat 1748881 tcaggttcgc ggccgccggg tacccgctcg cgcggtcctt gctgggcact acgcggcatg 1748941 gctggtggtt tcggtgatcg ttgcctggat ggtgctgtca tgactccctc gacatcggat 1749001 gccaggtcgc gccgacgctc ggcggagccc ttcctgtggc tgctgttcag cgccgggggc 1749061 atggtcaccg ccctggttgc gcccgtcctg ctgttgctgt tcggactcgc gtttccgctc 1749121 gggtggctcg acgcgcccga ccacgggcac ctactggcga tggtgcgcaa cccgatcacc 1749181 aagcttgttg tgctggtcct ggtggtactg gccctgttcc atgcggcgca ccggttccgg 1749241 ttcgtgctcg accatgggct gcaactgggc cggttcgacc gagtgatcgc cctgtggtgt 1749301 tacggcatgg ccgtgttggg ctcggcgacg gcgggttgga tgttgctcac catgtaaagt 1749361 cgctggccgg gcgctttggc cgccggcacg gtacggtacg gacctgtacc accacaacgg 1749421 ttctatggta ggcgctgtga cccagatagc ggatcggcct acagacccct cgccctggtc 1749481 gccgcgagag accgagttac tggcggtgac actacggctg ctgcaggagc acggttatga 1749541 ccggctaaca gtggatgccg ttgcggcgag cgcccgcgcc agcaaggcaa cggtctaccg 1749601 gcgctggccg tcgaaagccg aattggtgct ggccgcgttc atcgagggca tccgccaggt 1749661 cgcggtcccg cccaataccg gcaacctgcg cgacgacttg ctgcgactgg gggagctgat 1749721 ctgtcgggag gtgggccaac acgccagcac catccgcgcg gtgctcgtcg aagtgtcgcg 1749781 caatcctgcc ctcaacgacg ttttgcagca tcagttcgtc gaccaccgta aggccctgat 1749841 ccagtacatc ttgcagcagg ccgtcgaccg cggtgagatc tccagcgcgg ccatcagcga 1749901 tgaactctgg gacctgctac ccggctacct catcttccgg tccatcatcc ccaaccggcc 1749961 gcccacccag gacacggtgc aagccctcgt cgacgacgtg atactcccca gcctcacccg 1750021 atccaccggt tgagtcagcg gtgcgaatgg ctgggcaccg ttgtggtgtc cggtcccgta 1750081 ccgtactgtt gaatccgcgg atccccgcct gaggtacggg gcgtggtcgc gccccgggca 1750141 atagcgtcgc cggttatcga aaggctaacg ggtgcagggg atttcagtga ctggcctggt 1750201 caaacgcggc tggatggtgc tggttgccgt ggcggtggtg gcggtcgcgg gattcagcgt 1750261 ctatcggttg cacggcatct tcggctcgca cgacaccacc tcgaccgccg gtggtgtcgc 1750321 gaacgacatc aagccgttca accccaaaca ggtaaccctc gaggtctttg gcgctcccgg 1750381 aaccgtggca acgatcaatt atctggacgt ggatgccaca cctcggcaag tcctggacac 1750441 gaccctgccg tggtcataca cgatcacgac gaccctgccc gcggtcttcg ccaatgttgt 1750501 cgcgcaaggc gacagcaatt ccatcggctg ccgcatcacc gtcaacggtg tagtcaagga 1750561 cgaaaggatc gtcaacgaag tgcgcgccta taccttctgc ctcgacaagt cctcatgagc 1750621 aaccaccacc gcccgcggcc ttggttgccg cacaccatcc gacggctttc gttgccgatc 1750681 ttgctgtttt gggtgggtgt ggccgccata accaatgccg ccgtgccgca attggaggtg 1750741 gtcggggagg cgcataacgt cgcacagagc tccccggatg acccgtcgct gcaggcgatg 1750801 aaacgcatcg gcaaggtgtt ccacgagttc gattccgaca gtgcggccat gatcgtcttg 1750861 gaaggcgata agccgctcgg caacgacgcc caccggttct acgacaccct gctccgcaac 1750921 ctttcaaacg acaccaaaca cgtcgagcac gttcaggact tctggggcga tccgctgacc 1750981 gcggccggct cgcaaagcac cgacggcaaa gccgcctacg ttcaggtcta tctcgccggc 1751041 aaccaaggcg aggcgttgtc aatcgagtcc gtcgacgcgg tgcgcgacat cgtcgcccat 1751101 acgccaccac cggccggggt caaggcctac gtcaccggcg cggccccgct catggccgat 1751161 cagtttcagg tgggcagcaa aggaaccgcg aaagttaccg ggataactct ggttgtgatc 1751221 gcggtgatgt tgctcttcgt ataccgttcc gtcgtcacca tggtcctggt gcttatcacg 1751281 gttcttattg agttggccgc ggcccgcggg atcgtcgctt ttctcggaaa cgccggggta 1751341 atcgggctgt cgacatactc gacgaatctg ctcacactat tggtaatcgc ggcgggcaca 1751401 gactacgcga tttttgtcct cggccgctat cacgaggcgc gctacgccgc acaggatcgg 1751461 gaaacggcct tctacacgat gtatcgcggg accgcccacg tcgtcttggg ctcgggtctg 1751521 accgttgccg gcgcggtgta ttgcctgagc tttacccggc taccctattt tcaaagcctg 1751581 ggtattcccg cctcgatagg ggtgatgatt gcgttggcag ccgcgctcag cctggcccca 1751641 tccgtgctca tcttgggcag tcgtttcggt tgtttcgaac ccaagcgcag gatgaggacc 1751701 aggggatggc ggcgcatcgg cacggccatc gtgcgttggc cgggacccat cctggcagtg 1751761 gcgtgcgcaa ttgcggtggt gggtctgctc gcgctgccgg gatacaaaac gagctacgac 1751821 gctcgctatt acatgcccgc caccgccccg gccaatattg gctacatggc cgcggagcga 1751881 cattttcccc aagcgcggct gaatcccgaa ctactgatga tcgagacgga tcacgatatg 1751941 cgcaatccgg ccgacatgct catcttggat aggatcgcca aggctgtctt ccatctgccc 1752001 ggcatagggc tggtgcaggc catgacccgg ccgctaggaa ccccgattga ccacagctcg 1752061 ataccgtttc agatcagcat gcaaagcgtc ggccagattc agaatctcaa gtatcagagg 1752121 gaccgagcag ccgacttgct gaagcaggcc gaagagctgg ggaagacgat cgaaatcttg 1752181 cagcgccaat atgccctaca gcaggaactc gcggccgcta ctcacgagca agccgaaagc 1752241 tttcaccaaa cgatcgccac ggtaaaggaa ctgcgagata ggatcgccaa tttcgacgat 1752301 ttcttcaggc cgattcgtag ttacttttac tgggaaaagc actgctacga tatcccgagc 1752361 tgctgggcgc tgagatccgt ctttgacacg atcgacggta tcgaccaact cggcgagcag 1752421 ctggccagcg tgaccgtaac cttggacaag ttggctgcga tccagcctca attggtggcg 1752481 ctgctaccag acgagatcgc cagccagcag atcaatcggg aactggcgct ggctaactac 1752541 gccaccatgt ccgggatcta tgcccagacg gcggccttga tcgaaaacgc tgccgccatg 1752601 ggacaagcct ttgacgccgc caagaacgac gactccttct atctgccgcc ggaggctttt 1752661 gacaacccag atttccagcg cggcctgaaa ttgttcctgt cggcagacgg taaggcggct 1752721 cggatgatca tctcccatga aggcgatccc gccacccccg aaggcatttc gcatatcgac 1752781 gcgatcaagc aggcggccca cgaggccgtg aagggcactc ccatggcggg tgctgggatc 1752841 tatctggccg gcacggccgc caccttcaag gacattcaag acggcgccac ctacgacctc 1752901 ctgatcgccg gaatagccgc gctgagcttg attttgctca tcatgatgat cattacccga 1752961 agcctggttg cggcgctggt gatcgtgggc acggtggcgc tgtcgttggg cgcttctttt 1753021 ggcctgtccg tgctggtgtg gcagcatctt ctcggtatcc agttgtactg gatcgtgctc 1753081 gcgctggccg tcatcctgct cctggccgtg ggatcggact ataacttgct gctgatttcc 1753141 cgattcaagg aggagatcgg tgcaggtttg aacaccggca tcatccgtgc gatggccggc 1753201 accggcgggg tggtgaccgc tgccggcctg gtgttcgccg ccactatgtc ttcgttcgtg 1753261 ttcagtgatt tgcgggtcct cggtcagatc gggaccacca ttggtcttgg gctgctgttc 1753321 gacacgctgg tggtgcgcgc gttcatgacc ccgtccatcg cggtgctgct cgggcgctgg 1753381 ttctggtggc cgcaacgagt gcgcccgcgc cctgccagca ggatgcttcg gccgtacggc 1753441 ccgcggcccg tggttcgtga attgctgctg cgcgagggca acgatgaccc gagaactcag 1753501 gtggctaccc accgttaagg tggtgggatg ccgctttcag gggaatatgc gccgagcccg 1753561 ctcgactggt cgcgcgagca agccgacacg tatatgaagt ccggcggaac cgagggcaca 1753621 cagctgcagg gaaagccggt catcctgctc accaccgtcg gggcgaagac cggcaaactc 1753681 cgtaagaccc cgctgatgcg cgtcgagcac gacggccagt acgcgatcgt cgcctcgctg 1753741 ggtggggcgc cgaaaaatcc ggtctggtac cacaacgtcg tgaagaaccc acgggtcgag 1753801 ctgcaggacg gcaccgtgac cggcgactac gacgcccgcg aggtgttcgg tgacgagaag 1753861 gccatctggt ggcagcgcgc cgtggcggtc tggccggact atgccagcta ccagaccaag 1753921 acggaccgcc agattccggt gttcgtgctg accccggtgc gcgcgggcgg ctagccattg 1753981 ggatagggcg gcgtggcacc attgaccggt gtccgccgaa ctgagccaga gcccgagcag 1754041 ctcgccgctg ttttcactat ctggggcaga catcgaccgt gccgccaagc ggatcgcacc 1754101 ggtagtcacg cccaccccgt tgcaacctag cgatcggttg tcggcgatca ctggcgccac 1754161 ggtctacctc aagcgcgaag acttgcagac ggtgcgctct tataagctac gcggagcgta 1754221 caacctgttg gtgcagttgt ccgatgagga actggccgcg ggcgtggtgt gttcttctgc 1754281 gggcaaccac gcgcagggct tcgcgtatgc gtgtcgctgt ctgggtgtgc acggccgggt 1754341 ctacgtacct gccaaaaccc ccaagcagaa gcgtgaccgg atccgctacc acggcgggga 1754401 gttcatcgac ctgatcgtgg gtgggtcgac ctatgatctg gctgcggcgg cggcccttga 1754461 ggacgtggaa cgcaccgggg ccacgctggt accgccgttt gacgacctgc gcaccatcgc 1754521 cggccagggc acgatagccg tcgaagtgct tggccagctc gaggacgagc cggacctggt 1754581 ggtggtcccg gtgggtggcg gcggctgcat cgcggggatc accacctacc tggccgagcg 1754641 gacgaccaac accgcggtgc tgggcgtcga gccggctggt gcggccgcca tgatggccgc 1754701 gctcgcggcg ggcgagccgg tgacgctgga ccatgtcgac cagttcgtcg acggcgccgc 1754761 ggtgaaccgg gcgggcacgc tgacctatgc cgcgctagcc gccgccggcg acatggtttc 1754821 gctcaccacc gtcgacgagg gtgcggtgtg cacggcgatg ctcgatctgt atcagaacga 1754881 gggcatcatc gccgaaccgg ccggtgccct gtcggtcgcc ggtctgttgg aagccgacat 1754941 cgagcccggg tccaccgtgg tgtgcctgat ttcgggcggc aacaacgacg tgtcccgtta 1755001 cggggaggtg ttggagcgct cgctggtcca cctgggcctc aagcactatt tcctggtcga 1755061 cttcccgcag gagcccggtg cgctgcgccg gtttctcgac gacgtgctcg gacccaacga 1755121 cgacatcacc ttgttcgagt acgtcaagcg caacaaccgg gagaccggtg aggcgctggt 1755181 gggtatcgag ctgggatcgg ccgcggatct agacggtctg ctggcccgga tgcgggcgac 1755241 cgacattcac gtcgaggcgt tggaaccggg gtcgccggct taccgctatc tgctgtagcg 1755301 aggcgtcggc gcgaccgtgc cgacaaacct cgcatgtgta tcgttggtgt atgtcgcgca 1755361 ccaacatcga catcgatgac gaacttgccg ccgaggtcat gcgcaggttc ggtctgacca 1755421 ccaagagggc ggcggtcgac cttgccctac gacggttggt cgggtcgccg ttgagccgtg 1755481 agtttctgct cgggctggaa ggcgtcggct gggaaggcga cctggatgac ttgcgaagcg 1755541 atcgcccaga ctgatctcga tgatcctcat cgacacatcg gcctgggtgg agtacttccg 1755601 tgccaccgga tcaatcgccg ctgtcgaagt acgccggctg ctgtccgaag aagcagcgcg 1755661 aatcgctatg tgtgagccca ttgcgatgga aatcttgagt ggcgcgctcg acgacaacac 1755721 ccacacgacg ctagagcggc tcgtgaatgg cttgccgtcg ttgaacgttg atgacgcgat 1755781 tgactttcgt gctgccgcgg gtatctatcg cgccgcccgg cgcgccggcg aaacggttcg 1755841 aagcatcaac gactgcctca tagcggcgct cgcgatccgc cacggtgcgc gtatcgtcca 1755901 ccgtgacgcc gactttgatg tgattgcccg gattaccaac ctgcaggccg catcgtttcg 1755961 gtgagcatgc cgccccagca tcaggccggc tccgcagccc gcagtatcgc aagcgaatac 1756021 gctgctagct cggtggaatt atcgccgata atcggcgact cccaggccag caccagctca 1756081 ccgctgaccg gcacgcaggt tggctcggca ccaaggttgc aggcgatcat cagctggccg 1756141 cggcgcatca caacccagcg ttgctgctcg tcgtagtcga ccataaggtg gtccagccag 1756201 gggtccgcaa ggtcggcctc gttgtgccgc aaagcgatca gatcgcgata aaaccggtgc 1756261 aacctggcgt gttcgccgga gccggcttcg gcccagttca gcttgcagcg ctggaatgtc 1756321 tgcgggtcct gcgggtccgg aatgtcgtcc gcggcccagc catgttcggc gaactcctcc 1756381 ttgcgtcctg ccacggtgct atgggccagt tccggttcgg gatgtgagca aaagaactga 1756441 aacgggctgg aggcccccca ctcttcgccc atgaaaagca ttgcggtata gggagatcca 1756501 agggtcaacg ccgccttgat cgcgagctgg ccaccggtca ggtattgcga tgggcggtcg 1756561 ccgagagcgc ggttgccgac ttggtcgtgg gtgcaggtgt aggcgagcag cctggtggcc 1756621 gggatcgcag aagtgtccaa tgcacgcccg tgccgacgac gccggaacga cgaatacgtg 1756681 ccggcgtgga agtagccgtt gcgcagcgtg tacgcgagag tggccagcga gccgaaatcc 1756741 gcatagtagc cttgccgctc accggatacc gcggtatgga tggcgtgatg gatgtcgtca 1756801 ttccattggg cggtgatccc gtagccgcca tggctgggcc gggtgatcag ccgcgggtcg 1756861 tttcggtcgg tttcggcgat cagcgacaac ggacggccca actggcctga cagccagcgg 1756921 gtcgcgttgg caagctcctc gaggacatgc acggcggtgg tgtccaccag tgcatgcacg 1756981 gcgtccaacc gcaagccgtc ggcgtggaag tcgcgcatcc atcgcagcgc gcagtcgatg 1757041 atatagtggc gaacctcgtc ggagtcggcg ccggcgatat tgatgccgtc cccccacggg 1757101 ttgctggccg acgacaggta cgggccgaat cgcggcaggt agttgcccga tgggccgaga 1757161 tggttgaaca ccgcgtcgat caacacgccc aaacgacggg tatggcatgc gtcgatgaac 1757221 cggaccagac cgtcggggcc gccgtagggt tcgtgcacgc tgtaccacag cacaccgtca 1757281 tatccccaac cgcgggttcc ggcaaaggaa ttgaccggca tcagctcgac gaagtcgatt 1757341 ccgagatcga ccaggtaatc cagcttttcg atggcggcgt cgaacgtgcc agccgtggtg 1757401 aacgtgccga tgtgcaactc gtagatcacc gcgccctcga ccgaccgccc cggccagcca 1757461 gtgtcggtcc gggcagcacc aaactggccg ggcggctccc accgctggga gcgtgcgtgc 1757521 accccgtcgg gttggcgggc cgatcgcggg tcgggtagca cggtggggtc gtcgtcgagt 1757581 aggtatccgt agcgggcgtc cgccggcgcc gccaccgtcg tgtgccacca gccgtcggct 1757641 gagcgggtca tcgcatgtac cgcaccgttc acgtcgagcc ggaccagcgc gggtttgggt 1757701 gcccatactc ggaattcagg cattgtcgcg caccagcagc accacaggca gatccgcgaa 1757761 cagctcgacg gccggcgtgt gcccactggc cgtgaatccg gtgagggcat ctgtccacga 1757821 cccgtcgggt aggggcagta cggtgtggtc ccagccggtt tgctgcaggc gcaccgtcca 1757881 gcgggtcacc gcgaccagga tgtcgtcacc gcggcggaac gcaacgacgt ggtcggcggc 1757941 cggcccggcg gcgaacaccg gatggtatgc gccgcccagg aagctctccg gatgggtgcg 1758001 ccgcagtcga agcgccgcgg ccaacacccg aatcttaggg tgctgcaagg ctttcagagc 1758061 gacacgccgg gtgccgtagt cgacgggacg gcggttgtcc gggtcgacca ggctgtcgtc 1758121 ccacagttcg ctgccctggt agacgtcggg tacgccaggc acggtcaacg cgagcagctt 1758181 agcggccagc gcgtcgcttt cggcatgcga gttgaggtgg gccacaagtc cggtcagctc 1758241 ggacgccagc ggtccgtcga gcaccagatc aagccagccg tgcacgtcgt cctcgaacgc 1758301 ccggttcggg ttgtgccacg aggtgtgcca tgccgcctcc cggatcgcct tctcggcgta 1758361 agtgtgcagc cggccgcgca gcgcggcgct gacctctcca ctcactggcc acactccgaa 1758421 gacgttctgc cacagaaact gtccagtcac ggcatcaggg gcgggcgcaa tggcttgggc 1758481 gtggccgatg aacttggccc acagccacgg cacttgggac agcacgccga tgcgggcacg 1758541 cacgtcctcg ccgcgtttgg tgtcgtgggt ggacagtgtc gtcatggacc gtggccacaa 1758601 ccgagcacgg gtggcggccc ggtgatgaaa ctccgcggcg cccacaccaa accgagtgct 1758661 ggcggtcgtg cacagcgggg ccggtgccgt ccccgctgcc gggggcgacg ggcagcgcca 1758721 ggtcgcccag ccgcagcagg tcgccgtcga ctctgaggtt ggcaacgtcg ctgtcggagc 1758781 ccaatagcgg caggatgatc cggccatcac ctagctccca gtcgatgtcg aagaactcgg 1758841 catacgccga ggaccggccg aacttcaaga catcccacca ccacgcgttc tgctcgggct 1758901 tgccgacgcc gacatggctg ggcacgatgt cgacgatcag gcccatgccc cgcgaccgcg 1758961 ccgccgcgga taaccgcgct aggccgtcag agccaccaag ctcgggtgac accgtcgtcg 1759021 gatcggtgac gtcatagccg tgggtcgacc cgccgaccgc cgtcaaaatg ggggacaggt 1759081 acagatgcga taccccgagg tcgtcgaggt agtccagcag gttctcggca tcggcgaagg 1759141 tgaacccgaa tccgttcgac cgaccgcgca tctgcacccg gtaagtggaa ataaccggaa 1759201 atgccatatt tcacaacgtc ttacgcagga ccagcagcga gcgcgcaggt accgaaaacg 1759261 tgtcagtggc ggttaccgtc aggtcgatgt caccgacggg atcgttggta tccagctctc 1759321 cggtccactg ctgcgcatag ccgtcatgcg gcatcacgaa ctccacgtcg tggtcatggg 1759381 cgttgaagca caacaggaat gaatcgtcga ctactcgctc accacgggcg tccggtgcgg 1759441 taatggcttc accgttgaga aacaccgcaa cacacctgtc gaagcctctg ccccaatcct 1759501 cgtgcgtcat ctcccgaccg ctcggtgtca accaggcgat atcgcggact tcgtcgccac 1759561 tgcggatcgg ttcaccctca aagaaccggc gtcggcgaaa caccttgtgg ttcttgcgca 1759621 aggtcgtcgc cttgcgtgcg aaagctagca gatcggcatt cttgtccacc aatgaccaat 1759681 ccatccaaga taattcggag tcctggcagt agacgttgtt gttgccgtat tgggtgcgcc 1759741 caatctcgtc gccgtgggcg atcatcggcg tgccctggct gaccataagc gtggcccaca 1759801 tgttgcgcat ctggcgggca cgcagcgcca agatgtcggg gtcatcggtg gggccctcga 1759861 caccgcagtt ccacgatcgg ttgtagcttt ccccgtcgcg gttgttctcg ccattggcct 1759921 cgttgtgctt gtcgttgtac gagaccaggt cgttgagtgt gaacccgtcg tgggcggtga 1759981 cgaaattgat actggcactg ggccggcggc cggttgcttc gtagaggtcc gacgacccgg 1760041 tcagccggga ggcgaattcg cctagggtgg ccggctcgcc tcgccagtag tcgcgcacgg 1760101 tgtcgcggta cttgccgttc cattccgtcc acagtcctgg gaagttgcca acctggtagc 1760161 caccttcgcc gacatcccat ggctcggcga tcagcttgac ctgactgacc accggatctt 1760221 gttgcaccag atcgaagaat gccgacagcc ggtcgacgtc gtgcagctcg cgggccagcg 1760281 tggacgccag gtcgaaccgg aacccgtcga cgtgcatttc gatcacccag tagcgcagcg 1760341 aatccatgat cagctgcagg gtgtgtgggt ggcgggcatt gaggctgttg ccggtaccgg 1760401 tgaagtcctt gtagaacctc aagtcgtggt ccatcagtcg gtagtaggcg gtgttgtcga 1760461 ttccgcgaaa gttgatcgtc ggacccaagt ggttgccttc agcggtgtgg ttgtagacga 1760521 cgtcgaggat gacctcgatg ccggcttcgt gcaggctgcg caccatggtt ttgaactcgg 1760581 ctaccgcgct gccggcttgc cgggtcgacg cgtattgatg gtgcggggcg aagaatccga 1760641 aggtgttgta accccagtag tttcgcaagc cgaggtccag cagccgggag tcgtgtagga 1760701 actggtgcac cggcatcaac tcaacggcgg tgacgttgag ctcgttgagg tggtcgatga 1760761 tcaccgggtg ggccaggccg gcgtaggtgc cccggagttc gggcgggata ctgggatggg 1760821 tctgtgtcat gcctttgaca tgcgcttcgt agattacggt ctcgtggtac ggggtgcgcg 1760881 gcgaccggtc gtatgcccag tcgaagaacg gattgatcac gacgctggtc atagtgtggc 1760941 ccagcgagtc gaccatcggg ggagtgctgt ccgggtcgac ggcgttgacg tcataggaat 1761001 acagcgcctg cccgaaggtg aaatcgccgt ggaacgactt cccatacggg tcgagcagca 1761061 gcttgctggg gtcacaccga tggccggccg ccgggtcgaa cggcccgtgc acacgaaacc 1761121 cgtagcgctg gccgggggtg atgttcggca gataggcatg ccagacgtac ccgtccacct 1761181 cgtcaagcgg gatccgcgac tcgacgccgt cctcgtcgat cagacatagc tcgaccttct 1761241 cggcgatctc ggagaacaac gaaaagttgg tcccggcgcc gtcgtaggtg gctccaagcg 1761301 gataggcgtt gcccggccac accgtgggta gagcgggccc ggtcccgtcg gactccccgg 1761361 cgttgttcga cgacatcaca cgaccttatc caggttctcc ggcgggtgta ggcgtcacca 1761421 ccagtcggtg ttcgccgcga tttgccgacc gagctcgctg gtcatcgtcc gcatgtaggt 1761481 gggggtcagg tgatgactgt cgcggtacac cagaacattt ccctcgaccg cgcggcaggt 1761541 gtcggtccgg catatcgcgt cggacatatc gagtggctta agcagcggga accgcgcaac 1761601 gaagtcgagg gttggattcc gatcgaccag caccttggac cgcgcgatcc cacacgactg 1761661 cggattgccg cctttggcca ggcagtccgc agggatgaac ggttggccgt ccttgaccag 1761721 ccaaggggta tcccgcatcg cgagaacggg aatgttgttg tcggcgaacg tttgccagat 1761781 cccgacatag gttgctggca tcacatcgcc gggtttgatg ttccacggtc gagtcgaggt 1761841 tgtgaaaacg tagtcggggt ggtcagcgac caacttggcc atcgccgctt gcacccactg 1761901 gtgacactgc ggatagggag cgttattgcc catgatcagc gggacttcct cggtggacaa 1761961 cgggcaaccc attttgaggt acgtcaccac cttgaagtgg tgcatgcgac ccagcagatc 1762021 cagtgcggtc agccagtgtt cggcgtgtga acccccggcc agtgcgatgg tccggggtgc 1762081 gtccacatcg ccgtaggtgc agttgatgat cgccgggttg acgaagtcgc tgatgcagcc 1762141 gtccttggtc gaggtcggca ggtcgtgacg gacttccagg acggttgggc gcatccgcag 1762201 cttgggcacc cggacgtggt cgatcagggc ccgcgccccg ggatagtcgc gggagctcaa 1762261 cccgctcaac tctttgccgg cggcgcgctg gacgatgacg tgctcacgcc acgtgaacga 1762321 ggtcgcggta agagcgacgc caagcagtgc caccacagat cccagcacga tcgttggccg 1762381 acgcagccgc agccgccagg gaatcggcgg gaccgccgcc ggcgatctca cgccggcggg 1762441 tgcccgatag cgtaatgggt cttcgacaag ccgggtggtc aggtatgcca gcaacccgga 1762501 taccagcagg actgccgcgc cttcgacaaa gttggcgtgc cggtgcccgg tgtaggagag 1762561 ccagaagatg agcagcggcc aatgccacag ataccaggaa taggccatcg cgcccagcgc 1762621 caccaacgga gcggtggcta gcaggcgatt gggcagtggc agccggtcgc gggtaccggg 1762681 atggccctgc cggttggctc cggcaaggat catcagcatc gtggctccga cgggtaccag 1762741 cgcccacggc cctggaaatt ccttgacacc gtcgatcagg gcgccgcacg acagtatcgc 1762801 cgccagcgcg gcggtggcca ccgcggtgcg cagccacatc ggccagcgca catggggcac 1762861 cacagcgccg accagtgctc ccgccaacaa ctcccaggcc cgcgcgaagg tgttgtagta 1762921 agcggtcgcc tggtaggcgt gatgcgcaac gatggcatag atgaatgagg ccaacgtcaa 1762981 cgtgctcaat aacaccacaa acatcgtccg caggtacggg gcccgcgggc cccgaaacag 1763041 tctgcgcagc aagtaggcgc acccggcaac aagcagcagg aaagcgagat agaactgacc 1763101 ctgcaccgac atagaccaga tgtgctgcaa ggggctcacc gcttcaccgg ctcgcagata 1763161 gttggagacc gtgctagcca gctcccaatt ctggtaatac cccaagctgg ccaggctctg 1763221 gttggcaaac gcttcccacc gcgtctgcgg ttgtattgcg atggtgagca gcgcgcagcc 1763281 ggcgaggacc acaaccagtg ccgggagcag ccggcggatg agtcggatca cttcggctat 1763341 aggcgagagt gacagatccg ggttgagggc ggcgcgaagt attttcccgc caaagaagaa 1763401 gccggacagc gccaggaaca cgtctactcc gccggaaacc cggccgaacc aaacgtggaa 1763461 cactgccacc agggcgatcg cgacaccgcg caatccgtcc aggtcgtgcc ggtaaaagcc 1763521 ggtcgtacgg gtccccatgg taaccggggg caaggccggc tccggggtca aggccggtgg 1763581 gcgaggcggc gacagggtca acatggttga cagttaattt acccaaacca gcctcctgct 1763641 tcgcgcgctg agcagcggga agcaggaggc gggtttggga ggcgagaaag caagcgggac 1763701 cgttagcgtg agcgcgcggt gccgaaggga ggcggctgga cgggcgcttg ctggacgggc 1763761 gcttgctgga ccggcgcctg ttggaccggc gcctgttgga cgggcgcctg ttggacgggc 1763821 gcttgctgga cgggcgcttg ctggaccggc gctggctgga ccggcgcctg ttggacgggc 1763881 gtcggctggg tcccgagaac ccggaccagg taaggcgtca tgccgttggt gcgcaccggc 1763941 gaaacctgga cgacgtcgcc cacctccagc atctggccct tcccgaggta taacgcgacg 1764001 ctttgcgtgc cttcggggcc gtagaagatc aggtcgccct tgcgcgcttg ctgcggcagg 1764061 accttttgcc caaccttgta catctggccg gaagaacgcg gcagctttag cccggcaccg 1764121 gcataggcgt actggatcaa accggaggcg tcgaacccga cggtgttgat gccggtaccg 1764181 gtgccgcgcg tggggccgct gatgccgccg ccggcccagg agaacggcac gccgcgctgc 1764241 gacagcccgc gcgcgatcac gacgtcggtg atctgttgat aatccaccgg ccgcgtggcc 1764301 gggtctgcgg ccgcaagacc gggcgcggcc accatcgggg cgagcatcat tgccagaccg 1764361 atcgcgaagg agccgctttt catgctgcgt ttcatggggt tgtaacctcc ttggcactct 1764421 cgggtggtgt gtgcctcagc acgtgacttc accgtctgcc attccagccg gaagtcactt 1764481 tattcacacc aatcactaca gacactttga caacagatgc cggccgcgtc catagctggc 1764541 cagatccacc agaagtcttt ttgccgtaac gtgaccggac ggtgactgcc gcgctcaatc 1764601 tttgatcggc agttgtgatt tcagtcacgc gcgattaatg ccaatagcgt tcgctgaatc 1764661 ccgctatcgc gtagcccgcg atggaggtga cggtgatgac ggcgatcgag atgatcggcc 1764721 agaaggacat gtaccagccc ttcaacatgg aaacgaaggg gccgatgacg gctgtggcga 1764781 tggccgcacc gatccctccc cacatcaccg ggtagatgta atagttgacg ccgaacggta 1764841 ccagcgggca agcatccggc gggcacacgt tgtcggtgaa ggcgaagagc cgtgatggcc 1764901 agctagtcat cgtgaccatg accagaaata ctgccaatat cactacggta cataccacgt 1764961 cccagggcgc tatccgcagc gtgagtaccc gtgggggtgt ccgctcgtcg ggctcgtcta 1765021 gagccgaccg ggattcggtg ccagcatcgg gctgagtgtc ttcaggctga ttcggcggtg 1765081 ccatgcatgc atgctccccg atggcagagg ttttggcgac cgttactggg atgggccgtg 1765141 gcgtggctgc attaccctcg atctccatgg ctgcggcgac tggcgggttg acgcccgagc 1765201 agatcatcgc ggtcgatggc gcccatctgt ggcaccctta cagctccatc ggcagggaag 1765261 ccgtgtcgcc ggtggtggcc gtcgccgccc acggagcgtg gttgacgctg attcgcgacg 1765321 gccagccgat cgaggtgctc gacgcgatga gctcctggtg gaccgcgatc cacgggcacg 1765381 gccaccccgc tctggaccag gcgttaacca cccagttgcg ggtgatgaac cacgtcatgt 1765441 tcggggggct gactcacgag ccggcggccc ggctggcgaa gctgctggtc gacatcaccc 1765501 cggcgggtct cgacacggtg ttcttcagcg actccggctc ggtgtcggtg gaagtcgcgg 1765561 ccaagatggc gctgcagtac tggcgcggcc gcggcctgcc cggcaagcga cggctcatga 1765621 cctggcgcgg cggctatcac ggcgacacct tcctggctat gagcatctgc gacccgcacg 1765681 gcggcatgca ctcgctgtgg accgacgtcc tggccgccca agtgttcgcg ccacaagtgc 1765741 cacgggacta cgatcccgcc tacagcgcgg cgttcgaggc gcagctggcg cagcacgccg 1765801 gcgagctggc cgcggtggtc gtggagccgg tcgtgcaggg tgcgggcggt atgcgttttc 1765861 acgacccgcg ctatctgcac gacctgcggg acatctgccg ccgttacgag gtgctgctga 1765921 tcttcgatga gatcgccacc ggcttcggcc gcaccggcgc gttgttcgcc gccgaccacg 1765981 ccggggtgag cccggacatc atgtgtgtcg gcaaggcgct caccggcggc tacctcagct 1766041 tggccgccac cttgtgcacc gccgacgtcg cgcacaccat cagcgccggt gcggccgggg 1766101 cgctgatgca cggccccacc ttcatggcca atccgctggc ctgtgcggtc tcggtggcca 1766161 gtgtggagct gctgctcggc caggactggc gcacgcgcat caccgaactg gccgccgggc 1766221 tgaccgccgg cctggatacc gcccgggcgc tgcccgccgt caccgatgtg cgggtgtgcg 1766281 gcgcgatcgg cgtcatcgaa tgcgaccgac cggtcgacct ggccgtcgcg actcccgcgg 1766341 cgctggatcg aggcgtgtgg ctgcgcccgt ttcgcaacct ggtctacgcc atgccgccct 1766401 atatctgcac accggccgag atcacgcaga tcacctcggc gatggtcgag gtcgcacggc 1766461 tcgtaggctc actgccatga aagccgccac gcaggcacgg atcgacgatt caccgttggc 1766521 ctggttggac gcggtgcagc ggcagcgcca cgaggccgga ctgcggcgct gcctgcggcc 1766581 gcgtcccgcg gtcgccaccg agctggactt ggcctccaac gactatctcg gtctgtcccg 1766641 acatcccgcc gtcatcgacg gcggcgtcca ggcgctgcgg atctggggcg ccggcgccac 1766701 cgggtcgcgc ctggttaccg gcgacaccaa gctgcaccag caattcgagg ccgagctcgc 1766761 cgagttcgtc ggcgctgccg cgggattgct gttctcctct ggctacacgg ccaacctggg 1766821 cgccgtggtc ggcctgtccg gcccgggttc cctgctggtg tccgacgccc gttcgcatgc 1766881 gtcgttggtg gatgcctgtc ggctgtcgcg ggcgcgggtt gtggtgacgc cgcaccgcga 1766941 cgtcgacgcc gtggacgccg cgctgcgatc gcgcgacgag cagcgcgccg tcgtcgtcac 1767001 cgactcggtg ttcagcgccg acggctcgct ggcgccggtt cgggagttgc ttgaggtctg 1767061 ccggcgtcat ggtgcgctgc ttctggtgga cgaggcgcac ggcctgggtg tgcgtggcgg 1767121 cggacgcggg ctgctctacg agttaggtct agcgggtgcg cccgacgtgg tgatgaccac 1767181 cacgctgtcc aaggcgctgg gcagccaggg tggtgtggtg ctcgggccga cgccggtgcg 1767241 ggcccatctg atcgatgctg cccggccgtt catcttcgac accggtctgg cgccggcggc 1767301 ggtgggtgcc gcacgggccg cgctgcgcgt cttgcaggcc gagccgtggc gaccgcaggc 1767361 ggtgctcaac cacgctggtg aacttgcgcg gatgtgcggt gtggctgcgg tgccggactc 1767421 ggcgatggtg tcggtgatcc tgggcgagcc ggagtcggca gtggccgccg cggcggcctg 1767481 cctggacgcc ggggtcaagg tgggctgctt ccggccgccg acggtgcccg cgggtacgtc 1767541 gcggctgcgg ctgaccgcgc gcgcatcgct gaacgccggc gagctcgagc tggcccggcg 1767601 ggtgctgacg gatgttctcg ccgtggcgcg ccgttgacga tcctggtcgt caccgggacc 1767661 ggcacggggg tcggcaagac ggtcgtctgc gcggcgctgg cgtcggccgc acgtcaggcc 1767721 ggcatcgacg tggcggtgtg caagcccgtt cagaccggca ccgcccgcgg tgacgacgac 1767781 ctcgccgagg tcggccggtt ggccggggtg acccagctgg ccggcttggc gcgatatccg 1767841 cagccgatgg ccccggccgc cgccgccgaa cacgccggga tggcgttgcc cgcccgcgat 1767901 cagatcgtgc ggctgatcgc agacctggac cgtcccgggc ggttgaccct cgtcgagggg 1767961 gcgggcgggc tgctggtcga actcgccgag ccgggcgtca cgctgcgcga tgtcgccgtc 1768021 gacgtggccg ccgcggcttt ggtggtggtc accgcggacc tgggcaccct caaccacacc 1768081 aagttgacgt tggaagcgct tgctgcacaa caggtttcat gtgcagggct ggtgatcggc 1768141 agctggccgg acccgcccgg gttggtggca gcctcgaatc ggtccgcgct ggcgcgcatt 1768201 gctacggtgc gggccgctct gcccgccggg gccgcgtcgc tggatgccgg ggacttcgcg 1768261 gcgatgagcg cggcggcgtt cgaccgcaac tgggttgccg ggctggtcgg ctgatggtgc 1768321 attcgatcga gctggtcttc gacagcgata ccgaggcggc gatccggcgc atctgggcgg 1768381 ggttggccgc cgccggcata cccagccagg cgccggccag ccgtccgcac gtgtcgctgg 1768441 cggtggccga acggatcgcc ccggaggtcg atgagccgct gggtgcggtt gcccgtcggc 1768501 tgccgctgga ctgcgtgatc ggcgcgccgg tgctgttcgg gcgggccaat gtcgtgttca 1768561 cccggctggt ggtgccgacc agcgagcttt tggccctgca tgccgaggtg caccggctct 1768621 gcggcccgca cctggcgccc gcgccgatgg ccaacagcct gcccggtcag tggaccgccc 1768681 atgtcaccct ggcccgacgg gtcggtggtc accaattggg gcgggcgctg cgcattgcgg 1768741 gacggccgtc gcggattgac ggtcggttcg ccggcttgcg ccgctgggac ggcaacacgc 1768801 gtgccgagta cctgctgggg tgaggcgggc ccaaaaagct tgatggcgaa ggggtttgat 1768861 cgcaacttcg tcttaatggc cagctcgcgg gttcgggcgg gtgctggcca ggtggcgagg 1768921 acgcacgtcg atgtggggat gtccaaagat cttcgcgggc ggcgattctc acggatcgtc 1768981 gtggttgtcc tcgtcgttgt ggcgtagcag cttctcgtgg tggtggaagg tgttggtgcg 1769041 gggttggccg tggactgctg aagaacattc cacgccagga gatcaaccat gaccaccaca 1769101 ccagcacgtt tcaaccactt ggtgacggta accgacctgg aaaccggtga ccgcgccgtc 1769161 tgcgaccgcg accaggtggc cgagacgatc cgggcgtggt tcccggacgc gcccttggag 1769221 gtgagggaag cgctcgttcg gctgcaggcc gcgttgaatc ggcacgagca caccggcgag 1769281 ctcgaagcgt tcctgcggat cagcgtcgag cacgccgacg ccgccggcgg cgacgagtgc 1769341 ggcccggcga tcctggccgg ccgctccggg ccggaacaag ccgccatcaa ccggcaactc 1769401 ggactcgccg gcgacgacga gcccgacggc gacgacaccc cgccgtggag ccggatgatc 1769461 gggcttggcg gcggaagccc agcggaagac gagcgctgac ggtgaacacc gcggcaacag 1769521 gacgctgggc ggtcccacgg gcggggcatg gatagcttcc ggcccatggg ccggaagcta 1769581 tctcggagaa acaaatggcg ccgctggccg ccggatcgcg gagctggagc ggccgaaagc 1769641 caagcagcgg cagcgcgagg ggcaggatca tggccgccag gctcgatatt ctggtttggg 1769701 gcccatgggc tacaaaccag aatcagagcg tcattcgacg aaaacagaca ctgctatcgg 1769761 cgcagccctc ggcatctccg ccggcaccta ccggcggctc aaacgaatcg acaacgcaac 1769821 ccacagcgac gacaaagaaa tccgccggtt cgcggagaaa caaatggcgc cgctggtcgc 1769881 cggatcgccg agctggaacg cccgaaagcc aaggagcgcc aacgcgaggg tggtcgcctc 1769941 ggtgcatcga tcaccaatgc cggctttggt cccatggaac caaagccgtc tcagcgccac 1770001 actgacaagg aggtaggcgc agccctcggc atctccgccg gcacctacaa gcggctcaaa 1770061 cgaatcgaca acgcaacccg cagcgacgac aaagaaatcc gcctgttcgc ggagaaacaa 1770121 atggcgccgc tggccgccgg atcgccgagc tggaacggcc gaaagccaag cagcggcaac 1770181 aggaaggcgg cgaccatggc cgccaggctc gatattctgg cttggggccc atgggcccaa 1770241 gccagaatcg gagcgtcgtt cgacgaaaac agacactgct atcggcgcag ccctcggcat 1770301 ctccgccggc acctaccggc ggctcaaacg aatcgacaac gcaacccgca gcgagttggc 1770361 ggcgtgggcg gcccggcacc cctaagcaga ggccgcccac gcctggccct atcctaccta 1770421 cgcggtagtc tccaccttca gaactcgaaa cgcgttgcgc accagcacat ctgatccgac 1770481 cctgaaccag gcgaagaatc cgcgctgccc ggtcggccgg cgattcggcc cgaacaggtg 1770541 aggcaccaac tccaccatgg acccaactct gtcgacgatg aggaattgct tccagtcgcc 1770601 aagcaccagt ggatgattcg tcgctgtcac cgccgaatca acggtgtcca tgtgggagac 1770661 ttccaggaca gacttcccgg ctagcatcgg cggactgtcg tgcagcgatg ggaatttcag 1770721 cgcgccattc gaagtttccg cctgccgcaa cgtgttgatg gtggacaagt tcgccgcgaa 1770781 cgcggcgctg gcctggaacc ttggcggcag cgccgactgc aacgcgtaaa catccgccgc 1770841 cacaatcgct tctgaccccg cgccgacgac cacctgatcg gaggtgccgg ttagcgcgct 1770901 gacgaacccg gtgggctcgc cgttgccgga gccgttgacg aacgccgcgg tctgcagttg 1770961 ctcaacgctg tccgcgagaa tcttgccgat ctcgccaacg aagctcgccg cgtcaccctc 1771021 cagctcgatg gagaacggaa tccagcagct tccacggtag ttcggcaccg ccggctgggc 1771081 caacgctggc gaatcgtcgg acacctcctg ggcttcggag taccaacgag cttcggcgcc 1771141 ttcggaagtc acgccccgcc aaatctcgga ggtcgtttgc accaccctcg ccacctgccg 1771201 aatcgggttc gtcgacccat cacccgacag caggatcgcc gggtccagcg ccgccgggat 1771261 cagaaacccg ccttgggtgt ccaccaggcc catcgctcgc tgctcggcgg ccaccgcggc 1771321 agcctcacgc cacgcggccg cttcccggtc ggtccaaacc gtgtgccccg caacaggatt 1771381 ggaaacccgc ttgacgaacg cgcccaaata gtcgcggctg ccggtggccg ccagccagcg 1771441 ctgcgcccac gaggtggact gcggcggccc ggtgcggcac aaggtttccg cggtctccgc 1771501 cgcccgcgac gacatcaggc cgtctcgcac acaagaatcc agtgtgcgaa acgcggtgtc 1771561 ccgcaacgag ttgcccggcg gcgcgtcgcc gtcgtcgccg ccggtgggag cgccgggcac 1771621 caccctcagc tcaccggccc ggtagcggcg cagctcctcc tcggcttcgc ggccgcggcg 1771681 gcgctgctcc gcccgcagtt cctcggcgtg gcgcgtcagc gcctgaaaac gctgcgccgc 1771741 ctcaccggtc aggtcgccgg cgacactgtc gaggagctgc ttcgccgcgt cacgggtttc 1771801 aggtaaagag aggtttttga tgtcgtcgaa ttcggtcata gattgttcac caatcgagta 1771861 gggacagcca ggcttcggct gtcgaacggg aaacgactgt aagcgattcc gcgcgcaccc 1771921 cggcgatttg tgcccccgaa taggccggaa cgccggttag ggaaacctct aacagcgccg 1771981 cttcgacgcg caccagcaca tccccttcgc gacggtcccg gatcggtcgg aaacccaccg 1772041 aaaacgagtc gacgacacca gcttttacgt tcgccaaagc ctcgtcgccg tccggggtgt 1772101 ccgcaatctc gaacgccccg aacaagccgt gaggctcctc ccgcaactca acggcccggc 1772161 ccaccgggta gcgggttcga gcgtcgtgag agaccagcag cttcaatttg tggccgcgct 1772221 cggcgatgga gcgccgaaaa gcgccaggag cgaacatttc ctggaactcg ccgtcgaagt 1772281 cgcggacggt ggtcgcctcg ttgtagggca cgatggtgcc gtgcacggtt cggccttcgc 1772341 cagaccgcag ctcggccatg cggaaaagga tgctactcaa aattcggcca ccacctagca 1772401 gacgcaagaa acgcgcggaa tcgcttgtgg cgcatggcgg ccgctatccg ggttccagcc 1772461 gccccgcggc gactgcccgg cgtcagcgga tgccgagatg ccaaactcga ttgtatcaca 1772521 cacaaaaggt catcaccggt ccggggcaaa cgggttgagc ccgtcgccgt cgtcgcccgg 1772581 cgccaccgcc agtcgctgct cggcggccgg ggtcaggcca aactcggagg ccaagcgcag 1772641 cagatgcatg cgcgccgtct ccgcaaccgt caccgccggg ttccggtgca cgacaccgga 1772701 tttcggtgag gtaattgtga ggccttcggc gcggacccgc tgaaccgccg cgacgtagac 1772761 ggaccaggtc tcgcagtacg cggacaggag cgcccgatcc tcaggtttga gcaggtcaag 1772821 ccgctccaaa gtcggtgcga cgcgccgcca ttcggccagc gcctcggcgt cgagccagtc 1772881 cggggcatcc ggtgcctgac ggataaactt cggcgactcg gggactttcc ggccgccgga 1772941 atcgcggccg ggggagcggc cctcaaccag tttgagccgg gccggtttcg gtggtcttgg 1773001 catcggtcct cccatcaatt tttagtctag gtaatgagcg tgcatgcgcg ccggcaccgt 1773061 ggcggtgtcc gggctgggcc tggtcacgat ggcgaccccg ccccctggtc gtcgtcctgc 1773121 tcgataggtc gggcgtctcg cagcgggtcg ttgccgggat acgacgcgtg gaaggcaagc 1773181 cagtcacgat cggcatggac aagaacattg cctccggctt cgaggtaggc gttctcggtg 1773241 tcgccgcgga ggataagtgt tccgtctttc gcgaaggatt cgagtgcttc gaccgccgca 1773301 tcgggtgagc aacccgtctg cgagcagaca acctggactg ccagctcgtg catcagttgt 1773361 cgttcgtcgt tggtcagggg ccggttgatc ggggtcactg gtcgacctct atggtgtcgt 1773421 cggtactgtc ggcgacaccc tcggcgatta ggaacgggca cggcttaccg acgtcgacgg 1773481 gacagttcgc gcgcttccat ttgttgtcgg cgacacccat gacgggttcg ccggtctgca 1773541 ggttcggcag gatgatgtac ggcacggcgt caccacagcg tttgcaggtc gccataccga 1773601 catcggcgtt gaaagccatg tcggcgattc gtcgccgcag ttcggcgtgg tcgggggttt 1773661 cagccatggc ttgtgtcctt tcaagcaggg ttggtaagtg cggttctggc ggcattgagc 1773721 tgctgttgca gtatcgggca tccggttggg gcgtcggggt gcagcacttt ggataaagcc 1773781 ctgtacacgg cgggtgtccg ctggggtccg accgcccgga acaacgcttt ggcccagtcg 1773841 gtgcactgct gttgcgccgg gtcggcgggt ccggtgacgg tgtggccgtg gtagcgcagc 1773901 tcggcggcca gcagtggggt ccagtcagcg tcgatgaacc agcagcgggt gtgcgcggac 1773961 caggagcggg cataggcggg gatcgtggac ttgatcaacg acacgatcgc agagtcgtag 1774021 gcgaatcgga cgctgtgccg accgccggat gccggggtga tcgcgacagc ggtcatgcgg 1774081 caccgccggt gtttgctggt ttggcggtac agtccgcccg gtggcggccg tgaacggcca 1774141 ggtaggtacc gcatcctggg cagaacggca cggcaggccg tgacgcatgt gacgtttgcg 1774201 cggtataacc gccatacgtg cgcgcgcgta ggcgaacctg gaaatgcgtc acatgcgtca 1774261 cgttaggtgt gctaatcatc gaaatcatcg gcccctctca ccgctattcc ggcccgccaa 1774321 cgaccatcac gggccttgtc agtgaccggg tatccgtggg tgtcgagcga ctggccgaac 1774381 gctttgcgcg agatttcggg tacgccttct tgcacccgcc acctttgcca cgcctcgaac 1774441 agatgcgtag tagtggcttt cagcaccggc gagctggtga cgcattcgtc gtcgatgaac 1774501 ctctttatcg tgtcggagtc ctcgcggtaa ttcgacgttg ccgcgagcac cgcgtccggc 1774561 tgggatagtc cgattcgctg atagtcgctc catccggcca ccgcccagga caggatgctg 1774621 tcggcctcca actgcaaccg tgcgtccagt tcccggtcct gctcgtcggc aggaatcact 1774681 acttcaaacg gcaccactcg aattcgccgc cagatggccg tatcatcgcc gggcactctc 1774741 ggtaggtggt tggtgatgag cagtggggta tgtgacggcg tgaattccac gaagtcttgc 1774801 cgcatctttc gggcgcggat ggtgtcgccg ccagtcagcc gttttatcgt tgattcggcc 1774861 agccggcgat ctttttcgct ctcggatacc gctacccatc gcacgccgcg gaggtccatt 1774921 tcgcctgttg ggtgagcgtt ttcccggtgc atgaaaaggt caggctcagc ggtgcaggca 1774981 taatcgccaa gggcatagcg aatcgccttg tcgaacacag attttccgtt ggcacctaca 1775041 ccgataagaa tcgccaggac atgttcgcgg acggtgccta gtaggccgac gccggccagg 1775101 cgttgcacga acccgcgcac accttcatcg ggcagaacgc gggtcaagaa cgcttgccag 1775161 agaggcgatt cggtgtcgga ctggtaggca ccgcggcata tctttgtgat gcggtcagcg 1775221 ggcgcgtggg gccgcaattt gagcgtgtgc aggtccagcg tcccattcgc gacgttgagc 1775281 aagtgcgggt cgctgtcgag gtcggctagc gtcgcggcga atggtaccag tgcggcggcc 1775341 aggtcgagca cgccggccac gccggacgcc gattcgcatt ttcggacgtc ggcgcgtaat 1775401 tccttgtcgt tgaggctgtc tgagagcgct tggcgcagct ctgccagcac tgcacgtttg 1775461 gcttcgccgc ggtcgtcggc tgcccagcgt ctgccgtccc aggagtgcca gccgatcccg 1775521 gccacgtgca gcagcttgtc ctggtaacgt tcggctagcc ggtaggcgat tcgggcttgg 1775581 ccgcgatgaa cttgcgtcgg tttgccaccg tcgtcgatga gcacgtgccc gtcccggtcg 1775641 atccaggggg cgtcgggata gtcggtgccg taggggatgt cggccatcac gccacccccg 1775701 cccgcgggat gtacacgccg cgccgtcgga cgatctcgcg ggcgatgccg ggccagtcgg 1775761 cggccgcaga tacgtcacgt gacgcctgcg ccatcgcctc ctggcacgtc tctaccctca 1775821 gagcccagtg ccgggctgcg tcgcagatcg cggcccattt gcgaggatcg gcgtcgtcga 1775881 gctgacgcca ggccggtgtg ccggccatcg gccacgaccc ggcagcatcc aggaccggcg 1775941 cgacatgctc gtgcaccgac caccacgaca cggcgcgtga cgcggtagga tcggcgctag 1776001 acggtgtggc gactgtcgcg ggtgcccggt cctccgtggc cgagcatcgt cgcgtcggcg 1776061 gcgacccgcc ggcgccggcg gtcatcgggc accgcctgac cgccgcacgg ggcgcagcag 1776121 ctcggccagc cgggtgcgct gctcgtcagt caggggcggc gctgcggcga gggtgcggat 1776181 gaggtagtcc gcgatgttcg cggcaacgag atcggttttc gcggcgatga actcgggatc 1776241 gtcggatgcg cgggaacgag acagtgcggc tacgcggccg cgatgatggt agatggtcga 1776301 cacgtgcgac tccttgggga caccaaaacc ccggagtcga agccggctac gtcggagtct 1776361 agcagctacc acgcgttggg gtggcgcgta gtttgttcgg cgtgtcgctt tcgcagagcg 1776421 tgcgccacag ccacatggcg acgaccaccg cgtccgactg caccgcaaaa cccggtgcgt 1776481 agtcggggtt gccggccagt ccggtcagta gccgcggaac gttctcgcac agtgtttgca 1776541 ggtcgccgtt gcgaacccgc acgggccgct gctcgcacgt gcgcacgccg gcggcgccgt 1776601 acctcaggca ggcgaattcc cagtgcagcc gcacccaccc gtcgcggcgc atttcaggtc 1776661 gtaggcgaga ctcgttgggc ggcaagccaa taacggctcg cggctggttg tcgtcgtcga 1776721 gtaatgttgc caccgctggc gcctgcggta accagccctg ggtctcgtcg acccaaatga 1776781 tcggcgcgac aagatcgcag cgctcgtgtt cgatgcggcc atcgcctttg cggccgtggt 1776841 cacagacgat cacgatgttg tggtgccggc tcatcgccaa ttcacctgca cccgttcggg 1776901 attgaatatc ctgccgctct tgccgaccgg ctggacaacg acttcagcga ggacgtcgag 1776961 gacggcgcgg aaccggtccg gcgacagctc ggctatcatc ccggcgactt gcggtgttcc 1777021 caacggtatc ccgtcgaaca ctcggagccg ttcctgatcc tgttggcggg cctgaagttt 1777081 cgttatcttg gcgttgacga tgtcggtgct gatcttcacc tggcgcgcgg tcagtagccc 1777141 ttcggcgcgt tcgacggcga gcctgtccag ctccccgtag agggtttcca gttccaggcg 1777201 gatggtttcg gcttcggcgg cgtcgtgaat ctcccggcgc aacaagtcaa cggcgtcggg 1777261 catggccagc cgctcggcca cgatgtgata caggatcggt tcgatgttgt cggccaggat 1777321 ggccaccccg tggcacgcct tgcacacgta gacgacctgg ccgtcggtgc ggtagctgcc 1777381 ggccaggtgg ttgccgcatt tgccgcagcc tgccagcccg gtcagcaggt ggcggcgcac 1777441 gcttttgcgg ccgggggcgc ggccgggggc gtccagcacg gcctgggcgg cccagaacgt 1777501 cgcctcgtcc accagcggcg accactgggc cttgccgaca atcgcgtcgc ggtccaccgg 1777561 gccgtagcgg gcacccttat atgcgcgtag tccggcgttg cggggtttgc gcaagaattt 1777621 cgacagcgtt gtagtcgtcc acgggcggcc ggtgatggtg aacgccccgg cgtcgttcca 1777681 ctggcggcac acgtcgccca gggacgcccc ggcgaggatg tcggcgtagg cctgtttgac 1777741 cagcggcgct gtccgggggt cgggttcggg accgttgggg ccgggcaggt agccgaaggc 1777801 tttcgaccag ttggggtggc cgcgttcagc tttctggcgg gcggcgcggc gctgtcgtgc 1777861 cttcttgtgc tcggtttcgt gagcggccac cgaccccttc aggcgggcga ctagccggcc 1777921 ctggggtgtc gccaggtcaa cgtcgccggc gacggtggcc agggccagcc gcttctcgtc 1777981 ggctaatgac atgaaggctt ccagctcgat gggacggcga tggagccggt ccaggtccca 1778041 ggccaccacg gcggcgatct tgccggcggt gatgtcggcc aacatctgct cgtaggcggg 1778101 gcggcgcttg ccggttgatg cgctgacgtc gttgtcgagg tactcgacgg gcacccattt 1778161 tcgctgcccg cacagcttta ggcagtcctc gcgttggcgg gccacgccga gctgttcgcc 1778221 ggagcggtct tctgagattc ggaggtagac agcagcacgc acaggtgtag tgtatctcac 1778281 aggtccacgg ttggccgtgg tcgaggtggg gtggtggtag ccattcggtg tggccgtggg 1778341 tgtttttgcg ggtggtccag cctttttcgg cgagtcggtt gtcggggtcg caggccaggg 1778401 tgaggtcggt gatgtcggtg cgtccggtgc tggtccagcc ggtgacgtgg tgggcttggc 1778461 tgtggtaggc cggtgcgtca cagccgggtt tggtgcagcc gcggtcgttg gcgaacagca 1778521 tgatccgctg ggccggggag gctaggcgtt tggtgtgata cagcgccagg ggtgtgccgt 1778581 ggtcgaagat cgcctggggg tacctcccgc ttgcggggga gtagtggtgg gcgtggctgg 1778641 tcatgcggat cacatcggcc atgggtagca gggtgccgcc gccggtgaag cccttgccgg 1778701 cgccggtttg caggtcggtc agggtggtgg tgaccacgat cgagacggga agaccgttgt 1778761 gttggcccag ttccccggag gcgatcagcg cgcgcagccc ggccagcagc ccgtcgtggt 1778821 tgcgttgggc ttggctgcgg gtgtcgcggt cgatggcggc cgcatcgggg gtggtgtcga 1778881 tgaccggggt gtggtcgtcg gggttggtcg cgccgggggc ggccagtttg gctagcacgg 1778941 cttcaaaggt ggcccgcgct tggggggtca ggtagccact tagccgtgac atgccgtcgt 1779001 attgctggtt gctcagggtg atgccgcgtt tgcgggcgcg ttcggtgtcg gtgaggtcgc 1779061 cgtcggggtg tagccagtcc atgacccgct gggcgtagcg ggccagctcg tcgggacgat 1779121 attgagcggc tttgccggcc aggtcggctt cggcggcctg gcgggtggac acatccaccg 1779181 cggcgggcag gtgggcgaaa aagggcgcga atcactttga tgtgcgcctc gccgatcagg 1779241 ccctggcgtt gggcggtggc ggtggcggtc aactgtgggg ctagcggttc gccggtgagt 1779301 gctcgacgag gtccgagatc ggcggcgtcg gcgatgcgta gggcggcgtc gggcttggtg 1779361 atgcgtaacc ggttggccag cgcgcagcac agcgtgccgc ccagttcttc ctcgctggct 1779421 tgggtgtcga gttggttgat caacgtgtgc ccgaccgccg gtagccggcg caccaagcat 1779481 tccagacgtt ccagagaccg cagccgttcc ggggtggtca acacctcaaa agacacctcg 1779541 tccaagcggt ccagctcggc atccagcgca tcaaagacct cgacaagctc ctcccggcta 1779601 ttcgctaaca tgttcgaatc ataacgtcgg gcactgacaa gaagtcgcgc cgacagctgc 1779661 tagaactggt gttagctaag tgaattcagt gactcgagag ccctcgcgag cttggccgcc 1779721 caccaggtcg gcggggatgc ctaccaggat tcgatcccgc caaccggcaa tctgaccaac 1779781 cgggcataac ccccgccggt gaaccgcagt ttagtgagcg gcttgaggtt gcgggatcga 1779841 cgattcggcg tctgggccgc tgtgtgggat gcctggcggg tcgagtgcga gtgctgatag 1779901 ctgggccgct gccaacgatc cgtgacctcc gcccacgtcg cgtttgtccc cgtgcgcacc 1779961 gctaccgtag cctgaacacc gtttcattca ggccgccgag caggcggcgg atgggttccg 1780021 cgcgtgcgga gatgacgaag gatgcagggg agtacctggt gacgcaagcg gcaacgcgac 1780081 cgacgaacga cgccggccag gatggcggga acaactcgga cattctggtg gttgcccgcc 1780141 aacaggtgct gcagcgcggt gagggcctga accaggacca ggtgctggcg gtgctgcagc 1780201 tacccgacga ccggctcgag gagctgttgg cgctggccca cgaggtgcgg atgcgctggt 1780261 gcggacccga ggtcgaggtc gaaggcatca tcagcctgaa aaccggtggc tgcccggagg 1780321 attgccattt ctgctcgcaa tcggggctgt tcgcctcccc ggtgcgcagc gcctggctgg 1780381 acatacccag cctggtcgag gcggccaaac agaccgccaa gtccggcgcc accgagttct 1780441 gcatcgtggc cgcggtgcgc ggacccgacg agcgattgat ggcccaggtc gcggccggca 1780501 tcgaggcgat tcgcaacgaa gtcgagatca acatcgcctg ctccctaggg atgctgaccg 1780561 ccgagcaagt ggaccaactg gcggcgaggg gggtgcatcg ctacaaccac aacctcgaaa 1780621 cggcgcgctc gttcttcgcc aacgtcgtca ccacccacac ctgggaagag cgctggcaga 1780681 cgctatcgat ggtgcgtgac gcgggcatgg aggtttgctg cggcggcatc ctcggcatgg 1780741 gggagacgct gcagcagcgc gcggaattcg ccgccgagct tgccgagctg ggccccgacg 1780801 aggtcccgct gaacttcctc aacccgcggc ccggtacccc gttcgccgac ctggaggtaa 1780861 tgccggtcgg tgacgcgctc aaggcggtgg ccgccttccg gttggcgtta ccgcgcacca 1780921 tgctgcggtt cgccggtggc cgcgagatca ccctgggtga cctcggcgcc aagcgaggca 1780981 tcctgggcgg catcaacgcc gtgatcgtcg gcaactacct gaccaccctc ggccggcccg 1781041 cggaagccga cctggaactg ctcgacgagc tacagatgcc gctgaaggca ctcaacgcca 1781101 gcctgtaaat ggtggaaatc gtggctggaa aacaacgcgc tccggtcgct gccggcgtgt 1781161 acaacgtgta caccggggaa ctggcggata cggccacgcc gacagcggct cggatgggtc 1781221 tggagccccc ccggttctgt gcgcagtgcg gtcgccggat ggtcgtccag gtccggcccg 1781281 acggctggtg ggcgcgctgt tctcgccacg ggcaggtgga ctcggccgac ttggcgacac 1781341 agcggtgacc gagccacccg gttttggcgg accgtccgag ccttccggtg caccgcggac 1781401 gtcgcggaca cgggcggtcc tgtttgtgat gctgggtctg tcggcgaccg gtgtgttggt 1781461 cggtggcctg tgggcgtgga tcgccccgcc aatccatgcc gtcgtggcca tcacacgcgc 1781521 gggtgagcgg gtgcacgagt atctgggcag cgaatcccag aacttcttca tcgcgccatt 1781581 tatgctgctg gggctcttga gtgtgctggc tgtcgtggca tcggcattga tgtggcagtg 1781641 gcgagagcac cgcggaccgc agatggttgc tgggctgtcg attgggctga cgaccgctgc 1781701 ggcgatcgcg gcgggagttg gcgcgctggt ggttcggttg cgctacggtg cgttggactt 1781761 tgacaccgtg ccactttccc gcggcgacca cgccctgacg tacgtcaccc aggccccgcc 1781821 ggtgtttttc gcccgccggc cgctgcagat cgccctcact ctcatgtggc cggctggcat 1781881 cgcgtcgctg gtatatgccc tgcttgcggc cgggacggcg cgggacgacc tgggcggcta 1781941 tccggctgtc gatccgtcgt cgaacgctcg tactgaagcc ctggaaaccc ctcaggcccc 1782001 ggtgtcctag gagagtcgca gccgcccgcc ggcatccgga gcggaccgtg tctccggtcg 1782061 ggtgtcagcg cttggattca agcggcagat cgtcgaactg gtttaagtct ggcgtgacga 1782121 ggttgtgtgc caggtccgag ttcgcgccgg tatgcgcaga gcgcattggc caggtcagag 1782181 cggacggcgg ctcaacttcc tgccggtgat caccttggcc gcgatcacgg ccagtctcgc 1782241 catgccggcg taggtcatcg ggttgaagat ggtcggccac gtggtccgga cgcggtggtc 1782301 ggtcagtggc ttgccggcga accggtcggt gagccagcga agcgtcattg gggccgacag 1782361 cgggtgcagg gacacatgtt cgctgaacag gtcgcggtgg taggtgacgt tggcgccgcc 1782421 ggctgtatag ctgtcagcga gcgcgtcgat gtcagagacg tcgatgaggt agtcatgcac 1782481 ggcctgcacg atcaataccg gcggggtggg caccgcgcta cccagcttgg tgtcgccgaa 1782541 gacatgggaa acctccggcg tcgacagaat gtcctcaagg ggttcgtcga ggaagtcacc 1782601 catgtccctg ccggccatcc ggatcactgc gtctaccgtt gtcatctccg tcagttgctc 1782661 cagcagctga cgtccttcgt cgttggcgtg ctccttgatc acccgggcca ggccggggta 1782721 gctgtgttgc agcgcggcca ccaccaacgc gggcagaccg gcaagaagag tgccattgag 1782781 ccggcggaac gtgtgaccaa ggtcaccgac gggtgatccc agcacggcgc cgacgatgtc 1782841 taggtccggt gcgtactcgc cgcatgcttc ggcggcccac gcgctggcca gcccgccgcc 1782901 ggagtagccc cacagcccga tcggcgttgc cggggacaac ccgacacgct cggaattcaa 1782961 ggcagcccgg attccgtcga ggactcggta accgggttca tacggcgacc cccacagccc 1783021 tttcggccct tcatggtcgg gtactgatac cgcccatcct tcggcaagtg cggcgctgat 1783081 catcaacagc tccatttggg tcagtgaccc cagggccttg gcccgtcgtc gcagggcata 1783141 tgacggaaaa cagcgcgacg acatggcatc gatcgcacac tggtacgaca gcaaggggca 1783201 ggtctgaccc ggggcaagct ccgctgggac gatcaccgtg gtcaccgtcg cctcggggtt 1783261 gccgtacatg ttcgtggtcc ggtacagcag ctgggtagcg gtgacgggct gcggaatcaa 1783321 gcccataaac gccagttcga catcgcgcga gcgcaacacc gttccgggca cggcatgctg 1783381 gtagccggca ggtgggaagt agaacggatc gtcggatggc agcagcgggc gcactttgcg 1783441 ctgcaattcc tcgtgcggtg gccggccgat ccattcggcg ccggtcgcgc ctgccaaatt 1783501 gccgggctct accattaggc tcccttcatg gccatccggc atcctcgcgc gtgatcggtc 1783561 cctgacgggg tagcagcgcg gtttgcctgt cgcagttcag cgccggcact caaggtcagc 1783621 gtcggcactc gaatggcgcc agcggctctt atccggctct taaagtctca tacaagttac 1783681 aggatccaag ggccgactcc gaggccagcg cggcgtggcg cctatcacag gttgggtacg 1783741 ccgagttccc ccatcgctgg tgcgaccaga ttcaaagctg gccgggaggc cgcagtgcgg 1783801 cgaactcgtc agtgactctt agctgcgagt cggtaaaccg gtacaacgcc gccgggcggc 1783861 caccgctgcg gccggactgc gcgatggttc cggtttgggt gatgactctg cgacgggcca 1783921 gtacccgctg caggttggtt gcgtcgacct ggtagcccag tgcggcgccg tagatgtcgc 1783981 gcagcgttga gagcgcgaat tcttttggag ccaaagcgaa tccgatgttt gtataggaca 1784041 tcttggcaat cagccgggtg cgggcatggg tcaccatcgg accgtgatcg aacgccattg 1784101 gcggcaagga actcaccggg tgccagcggg tgtctgctgg cagctcgggg gtggcggggg 1784161 agggcaccac ccccaggtag gtcgacgcga tcatccggat gcctggcagc cggtgtgggt 1784221 cggaaaacac cgcgagctgt tctagatggg ccaactctcg caggtcgact ttctcggcca 1784281 gttggcgccg aaccgagctg gtcatgtctt cgtcgttgcg tagccgtccg cccggcagcg 1784341 accacgcgcc gcgctgcggc tccttcgcac gttgccacag cagcacattg agctggggtt 1784401 ttgccgcacc gcggctcatg ccaactccgc gcacttgaaa gacgacggcc agcacttcgt 1784461 gggcggtgct accatgggcc atgttttcga ttataagtcg aaaacctgtt ggagcgcgga 1784521 aggggcggca atgactgtgc tgaatcgcac ggacacgctc gtggatgaac tgactgccga 1784581 catcaccaac acaccgctcg gctacggcgg ggttgacggt gacgaacggt gggccgccga 1784641 gattcgccgt ctggcgcatt tgcgcggggc caccgtcctg gcgcacaact accagctgcc 1784701 cgcgatccag gacgttgccg accacgtcgg ggattcgctg gcgctatcgc gggtggccgc 1784761 cgaggcaccg gaggacacca tcgtgttctg cggagtgcac ttcatggccg agaccgccaa 1784821 aattctcagc ccgcacaaaa ccgtgctgat cccggatcag cgggccggct gttcgctggc 1784881 cgattcgatc acccccgacg agctgcgcgc ctggaaggac gagcatcccg gcgccgtcgt 1784941 cgtttcctac gtcaacacca cggcggccgt caaggcgctc accgacatct gctgcacctc 1785001 gtcaaacgcc gtcgacgtgg tcgcatccat cgatcccgac cgcgaggtgt tgttctgtcc 1785061 ggaccaattc ctcggtgcac acgtgcgccg ggtgaccggc cgcaagaacc tgcatgtgtg 1785121 ggccggcgaa tgccacgtac acgccgggat caacggcgac gagctcgctg accaggcccg 1785181 cgcacatccc gatgccgaac tgttcgtgca tccggagtgt ggttgcgcaa cctcggcgct 1785241 atacctcgcc ggcgaaggag cattcccagc cgagcgggta aagatcttgt ccaccggcgg 1785301 catgctcgaa gcggcgcaca cgacgcgcgc ccgccaggtg ctggtcgcca ccgaggtcgg 1785361 catgttgcac cagcttcgcc gggcggcacc ggaagtcgac tttcgcgcgg tcaacgaccg 1785421 cgcctcatgc aagtacatga agatgatcac ccccgcggcc ctgttgcgct gcctggtaga 1785481 gggtgccgac gaagtccatg tcgatccggg aatcgccgcc agtgggcgtc gcagcgtgca 1785541 gcggatgatc gaaatcggcc atcccggcgg tggcgaatga tggccggtcc cgcttggcgg 1785601 gatgcggccg atgttgtcgt gatcggcacg ggcgttgccg ggctggcggc ggcattggcc 1785661 gccgatcgcg ccgggcgcag cgtcgtggtg ctcagcaagg ctgcccagac gcacgtgacc 1785721 gcgacacact acgcgcaagg cggtatcgcg gtggtgctgc cggacaacga cgactcggtc 1785781 gacgctcacg tcgcggacac cttggccgca ggcgcgggcc tatgcgatcc cgatgcggtg 1785841 tactcgatcg tcgccgacgg ctaccgagcg gttaccgatt tggtcggagc tggggcacgg 1785901 ttggatgaat cggtcccggg ccgttgggcg ttgacgcgcg aaggcgggca ctcgcggcga 1785961 cgcatcgtgc acgcgggtgg cgacgcgacc ggcgccgagg ttcagcgggc gctccaggat 1786021 gccgccggga tgctcgatat ccgcaccggc cacgtggcgt tgcgagtgct gcacgacggt 1786081 accgcggtga ccgggctatt agtggtcaga ccggacggat gcggcattat cagcgctccg 1786141 tcggtgatcc tggccaccgg cgggctcggg cacctgtaca gcgcgaccac caatccggcg 1786201 ggctccaccg gcgacggcat cgccctggga ttgtgggcgg gcgtcgcggt cagcgatctc 1786261 gagttcatcc agttccaccc cacgatgctt tttgccggac gcgccggggg tcggcggccg 1786321 ctgatcaccg aggccatccg cggcgagggt gcgatcttgg tggacaggca aggcaattcg 1786381 ataacggcag gcgtgcatcc gatgggtgat ttggcgccgc gcgacgtcgt cgccgccgcc 1786441 atcgacgcgc ggctgaaggc caccggcgat ccgtgcgtct acctcgacgc ccgcggcatc 1786501 gagggcttcg cgtcccggtt cccgacagtc acggcatcct gccgggctgc cggcattgac 1786561 cccgtccggc aaccgatccc ggttgttccc ggtgcgcact acagctgcgg cggcatagtg 1786621 accgatgtgt acggccagac cgagctgctc gggttgtacg ccgctggcga ggtggcccgc 1786681 accgggttgc acggcgccaa ccgcctggcc tccaacagct tgctagaggg tttggtggtg 1786741 ggcggccgcg ccggaaaggc cgccgccgcc cacgccgcgg cggccgggcg ttcgcgtgcg 1786801 acctcgtcag cgacctggcc cgaaccgatc agctacaccg cactggaccg cggcgacctg 1786861 caacgggcga tgagccggga cgcgtcgatg taccgcgccg ccgccgggct gcaccggctg 1786921 tgcgacagcc tatccggagc acaggttcgc gacgtggctt gtcgccgcga tttcgaggac 1786981 gtggcgctca cgctggtcgc gcagagcgtg accgccgccg ccttggcccg caccgaaagc 1787041 cgtggctgcc atcatcgcgc ggagtacccg tgcaccgtgc cggagcaggc acgcagcatc 1787101 gtggtccggg gagccgacga cgcaaatgcg gtgtgtgtcc aggcgctagt ggcggtgtgc 1787161 tgatggggtt atccgactgg gagctggctg cggctcgagc agcaatcgcg cgtgggctcg 1787221 acgaggacct ccggtacggc ccggatgtca ccacattggc gacggtgcct gccagtgcga 1787281 cgaccaccgc atcgctggtg acccgggagg ccggtgtggt tgccggattg gatgtcgcgc 1787341 tgctgacgct ggacgaagtc ctgggcacca acggttatcg ggtgctcgac cgcgtcgagg 1787401 acggcgcccg ggtgccgccg ggagaggcac ttatgacgct ggaagcccaa acgcgcggat 1787461 tgttgaccgc cgagcgcacc atgttgaacc tggtcggtca cctgtcggga atcgccaccg 1787521 cgacggccgc gtgggtcgat gctgtgcgcg ggaccaaagc gaaaatccgc gatacccgta 1787581 agacgctgcc cggcctgcgc gcgctgcaaa aatacgcggt gcgtaccggt ggcggcgtca 1787641 accatcggct ggggttgggt gatgccgcgc taatcaagga caaccacgtt gccgccgccg 1787701 gatccgtggt agacgcgcta cgtgcggtgc gaaatgctgc acccgatctg ccgtgcgagg 1787761 tggaagtgga ctcgcttgag cagctcgatg ccgtgctgcc ggaaaaaccc gagctgatcc 1787821 tgctggacaa ttttgcggtg tggcagacgc agaccgcggt gcagcgtcgg gactcgcgcg 1787881 cgcccaccgt catgctggag tcatccggtg ggctcagcct gcagacggcg gcgacctacg 1787941 ccgaaaccgg ggtggactac ctggcggtcg gggcgctcac acactcagtg cgcgtgctcg 1788001 acatcggctt ggatatgtag ccgggcggcc ccggcgccca ttaggcggcg ccggataggg 1788061 taggcgccgt ggcgcgaacg ttcgaagatc tcgtggccga agccgcatca gcatccgtcg 1788121 gcggctggga tttttcctgg ttggacggcc gcgcgaccga agaacgcccg tcatggggct 1788181 atcaacgaca actcagtcag cggctggcga acgcgacggc tgccttagat cttgagacag 1788241 gcggcggaga ggtgctagcc ggcgcgggca acttcccgcc caccatggtc gctaccgaag 1788301 cgtggccacc caacgcggct atggccacta ggcggctgca tccgctgggc gcggtcgtcg 1788361 tcatcaccgg cgataaaccg ccactgccct ttgccgatgc ggcgtttgac ctggtgacca 1788421 gccgccaccc cagcacccga tggtggaccg agattgcccg ggttctccgg gctggcggca 1788481 gttacttcgc ccaacacgtc ggaccggcca cgctgtggga cctgcgcgag catttcctcg 1788541 ggccgcgaga acacaacggg gccgatcagt acgcgcaggt tgtgcgcacc tgcatcaccg 1788601 acgccggcct cgagatcgtc gacctgcaga tggagcggtt gcgggtggaa ttcttcgacg 1788661 tcggtgccgt catctacttt ctgcgcaagg tgatctggtt tctgccggac ttcaccgtcg 1788721 agggctacca cgatcggctg cgtgcactgc atgagcgcat ccaggccgaa gggcccttcg 1788781 tcacctactc cacccgcgcg ctcatcgagg cccgcaaacc gtcctgacgt cggccggggc 1788841 cttaggctca ggcgatatcg ccgacgaaga ccccgatccg gcgcagctgc aagcgcgcca 1788901 tccgcggcag cccctgaccg tcggagccgg cgggcacaat ccgggggttg atgacgtgaa 1788961 caacgccgcg ggcgaagtgc acgtcggctt cgcccgcggc caacacgttc ttgacccaat 1789021 ccgtcttacc gtgcgcgagc gcgatcgcca gcacaccgtc cttgcggtag gcggtcacaa 1789081 tcgtttggta tggctttcca gacttgcgac cgcggtgctc gatcgtggcc gttccgggta 1789141 ggtagcgcgc tatcggtttg agcgcccggt tgatgtactt gacctgcaga cgctcgagcc 1789201 agagcgggaa caccatcgga acgcccgggg cgttattcgg gtgatccttt gcggacatgg 1789261 cggctcctct ttgccggtcc tttctactgc actgtaccgg tcagatatcg acttgagctg 1789321 ctctgggaga atggtctacg tgaccgcgcc gccgcccgtg cttacccgta tcgacttgcg 1789381 gggagccgag ttgacagctg ccgagctgcg ggccgctctg ccacgcggcg gcgccgatgt 1789441 ggaagccgtg ctgccgacgg tacggcccat tgtggcggcc gtcgccgagc gcggggccga 1789501 ggccgcgctg gacttcggcg catcgttcga cggtgtgcgg ccccatgcca tccgggtgcc 1789561 agacgcagcg ctggacgcgg cgctggccgg actggactgc gacgtctgcg aagcgttgca 1789621 ggtgatggtc gagcggaccc gcgccgtgca ctccgggcag cgtcgcaccg acgtcacaac 1789681 cacactgggc ccgggcgcga cggtcaccga gcggtgggtt ccggtcgagc gggtaggcct 1789741 gtacgtgccg gggggcaatg cggtgtaccc atccagcgtg gtgatgaacg tggtgcccgc 1789801 ccaagccgcg ggcgtcgact cgttggtggt agccagcccg ccgcaggcgc agtgggatgg 1789861 aatgccgcat ccgaccattc tggccgcggc ccggctgctg ggcgtcgatg aggtctgggc 1789921 ggtcggcggc gctcaggcgg tggcgttgct ggcttacggc ggcaccgaca ccgacggcgc 1789981 agcactgaca ccggtcgaca tgatcaccgg gcctggcaac atctatgtca cggccgccaa 1790041 gcgactgtgc cgttcgcggg tgggcatcga cgccgaagcg gggccaaccg agatcgctat 1790101 cctcgccgat cacaccgccg acccggtgca tgtggccgcc gacctgatta gccaggccga 1790161 acacgacgag ttggctgcca gcgtgctggt cactccgagt gaggacctgg ccgatgccac 1790221 cgacgccgaa ctggctggcc agctgcagac tacggtgcac cgcgaacggg tgacggccgc 1790281 gctgaccgga cgccagtcgg cgatcgtcct ggtcgacgac gtggacgccg ccgtcttggt 1790341 ggtgaacgct tacgccgctg agcatttgga gattcagacc gccgatgccc cgcaggttgc 1790401 cagccggatc cgctcggcgg gagccatttt cgtcggcccg tggtccccgg tgagcctcgg 1790461 cgactactgc gcgggatcca accatgtact gccgaccgcg ggctgcgccc ggcattccag 1790521 cggcctgtcg gtgcagacgt tcctgcgcgg catccacgtc gtggaataca cggaggcggc 1790581 cctcaaagac gtttccggac acgtgatcac gctcgccacg gccgaggact tgccggcgca 1790641 cggtgaggcg gtacggcgga ggttcgagcg atgaccaggt ccggacaccc ggttacattg 1790701 gacgacttgc cgctgcgcgc cgacttgcgt ggtaaagcac catacggtgc accgcaatta 1790761 gctgttccgg tacggctgaa caccaacgag aacccgcacc cgcctacccg ggcgctggtt 1790821 gacgacgtgg tgcgatcggt gcgggaagcg gccatcgact tgcaccgcta ccccgaccgc 1790881 gacgccgtgg ctctgcgtgc tgacttggcc ggctatctca ccgcgcagac cggaatccag 1790941 cttggtgtcg aaaacatatg ggctgccaac ggttccaatg agattctgca gcaactgtta 1791001 caggcgtttg gcggtccggg gcgtagcgcg atcggtttcg taccgtccta ttcgatgcac 1791061 ccgatcatct ccgacggcac ccacacggaa tggatcgagg cgtcccgcgc caatgacttc 1791121 ggtctcgacg tggacgtcgc cgtcgcggct gtggtcgatc gcaaacccga tgtggtgttc 1791181 attgctagcc ctaacaaccc gtccggacaa agtgtttcgt tacctgacct gtgtaagctg 1791241 ctggacgttg cgcccggaat tgcgatcgtc gacgaggcct acggcgagtt ctcctcgcag 1791301 cccagcgcgg tgtcgctggt cgaggagtat ccgagcaagc tcgtcgtcac gcgcaccatg 1791361 agcaaggcat tcgctttcgc cggcggcagg ctcggatacc tgatcgctac gcccgcggtg 1791421 atcgacgcaa tgctgctggt gcggttgccg tatcacctgt cgtcggtcac tcaagccgcg 1791481 gcccgggccg cgctgcggca ctccgacgac accttgagca gtgtcgccgc actgatcgcc 1791541 gaacgcgaac gcgtaacaac ctcattgaac gacatgggtt ttcgagtcat cccaagcgat 1791601 gccaacttcg tgttgttcgg cgagtttgcc gatgcgccgg ccgcctggcg gcgctatctg 1791661 gaggccggca ttttgatccg cgacgttggg attcccggct atctgcgggc caccaccggg 1791721 ctggctgagg agaacgatgc gttcctgcgg gcaagcgccc ggatcgccac cgacctggtc 1791781 cccgtcaccc gcagtcctgt aggagcgcca tgacaaccac ccagacagcc aaagctagcc 1791841 ggcgggcgcg tatcgaacgg cgtacccgcg aatccgatat cgtcatcgag ctcgaccttg 1791901 acggtaccgg gcaggtggcc gtcgacaccg gtgttccgtt ctacgaccac atgttgaccg 1791961 cgctgggcag tcacgccagc ttcgacctca ccgtgcgcgc cacaggtgat gtcgaaatcg 1792021 aagcccatca caccatcgag gacacggcaa tcgcgctggg caccgcgctc gggcaggccc 1792081 taggtgacaa gaggggcatc cgccggtttg gcgatgcctt catcccgatg gacgaaacac 1792141 tggcccacgc cgccgtcgac ttatccggcc gcccctattg cgtgcatacc ggagagccgg 1792201 atcacctgca gcacaccact attgccggca gttcagtgcc ctaccacacc gtcatcaacc 1792261 ggcacgtgtt cgaatcgttg gcggccaacg cccgcatcgc gctgcacgtc cgcgtgttgt 1792321 acgggcgcga cccgcaccat atcaccgaag ctcaatacaa ggccgtcgcg cgcgcgttgc 1792381 gtcaagcggt cgagccagat cctcgggtgt caggcgtgcc gtccaccaaa ggtgctctgt 1792441 gacagcaaaa tcggttgtag tccttgacta cggctcagga aacctgcggt cggcccaacg 1792501 tgcgctgcaa cgagtaggcg ccgaggtcga agtaaccgcc gataccgacg ccgcaatgac 1792561 cgctgacgga ctggtggtgc cgggcgtcgg tgctttcgcg gcgtgcatgg cgggcctgcg 1792621 caagatcagc ggagagcgaa tcatcgccga gcgggtggcc gccggccgcc cggtgctggg 1792681 ggtctgtgtc ggtatgcaga ttctgtttgc ttgcggggtc gaattcggtg tgcagacgcc 1792741 aggctgcggg cactggccgg gggcggtcat tcgacttgag gccccggtga ttccgcacat 1792801 gggctggaat gtcgtggatt ccgctgcggg cagcgcgctg ttcaaagggt tggacgtcga 1792861 cgcccggttt tatttcgtgc attcctatgc cgcgcagcga tgggaaggct cacccgacgc 1792921 gctgctgacc tgggccacat atcgggcgcc gttcctcgct gcggtggagg acggcgcatt 1792981 ggccgccacc cagtttcatc cggagaagag tggcgatgcc ggtgcagccg tactgagcaa 1793041 ctgggttgat ggactttaaa ggatactggt gatgccgctg atacttttgc ccgccgtcga 1793101 cgtggtcgag ggtcgtgccg tgcgcctcgt tcaagggaag gccggcagcc aaaccgagta 1793161 cggctcagcg gtggatgccg cgttgggctg gcaacgcgat ggcgccgagt ggatccattt 1793221 ggtggacctg gatgctgcgt tcggccgcgg ttccaaccac gaactgcttg ccgaggttgt 1793281 cggcaagctc gacgtacagg ttgagctatc cggcggtatt cgagacgacg agtcgctggc 1793341 cgcggcgctg gccaccggat gcgctcgggt caatgtgggc actgctgccc tggaaaaccc 1793401 gcagtggtgt gcccgggtga ttggcgagca cggcgaccag gtcgccgtcg gcttggacgt 1793461 ccagatcatc gacggcgagc atcggttgcg cggacgcggc tgggaaaccg acggcggcga 1793521 cctgtgggac gtgctagaac gcctagacag tgaaggatgt tcgcggttcg tcgtgaccga 1793581 tatcaccaag gacggcaccc tgggcggccc caatctggac ctgctggccg gtgttgccga 1793641 ccgcaccgac gccccggtga tcgcgtccgg aggtgtgtcc agcctcgatg acctgcgcgc 1793701 cattgcgact ctcacgcacc gcggcgtcga gggggccatc gtcggcaagg ccctctacgc 1793761 ccgtcggttc accttgccgc aagcgttggc cgcggttcgg gactagatcg gcgatgcact 1793821 tggattcgtt ggttgccccg ctggttgaac aggcgtcggc gatcctggat gccgcaacgg 1793881 cgctctttct cgtcggtcat cgcgccgatt cagcggtccg caagaagggt aacgacttcg 1793941 ccaccgaagt cgatctagcg atcgagcggc aggttgtcgc agcgctggtg gcggccaccg 1794001 gcatcgaggt gcacggcgag gaattcggcg gcccggcagt cgactcgcgg tgggtgtggg 1794061 tactggaccc catcgacggc acaatcaacc acgccgccgg atcgccgttg gctgcgatcc 1794121 tgttgggcct gctgcacgac ggagttccgg tggccggctt gacctggatg ccattcaccg 1794181 accaacgcta taccgccgtg gcgggtggtc cgctgatcaa gaacggtgta ccgcagccgc 1794241 cgctggctga cgccgaactg gccaacgtgc tcgtcggcgt cggcacattc agcgccgact 1794301 cacggggcca gttcccgggg cgatatcgac tggcggtgct ggaaaagctc agccgagtgt 1794361 catcgcggct gcgcatgcac ggatccaccg gcatcgatct cgtcttcgtc gctgacggga 1794421 tactcggtgg tgcaataagt ttcggaggtc acgtttggga ccatgccgct ggggtggcgt 1794481 tggtacgagc cgccggtggc gtggtcaccg acctggctgg gcaaccgtgg acccctgcat 1794541 cgcgttctgc cttggccggg ccactgcgcg tgcatgccca gatcctcgag attcttggca 1794601 gcatagggga accagaggac tactgagatg tatgccgacc gtgaccttcc gggggctggg 1794661 ggcctcgcgg tacgcgtgat cccgtgtctg gatgtcgacg atgggcgggt ggtcaaggga 1794721 gtcaacttcg agaacctccg cgacgccggt gatcccgtgg aactcgccgc cgtctatgac 1794781 gcggagggcg cggacgagtt gacctttctc gacgtgaccg cgtcgtcgtc cggaagagcc 1794841 accatgctgg aggtggtgcg ccgcaccgcc gagcaggtgt tcatcccgct gacggtgggc 1794901 ggtggggtac gcaccgtcgc cgacgtcgat tcgctgctac gggctggggc tgacaaagtc 1794961 gccgtcaaca cggccgccat cgcttgcctg gacttgctgg cggacatggc gaggcagttc 1795021 ggctcgcagt gcatcgtgtt gtccgtcgac gcgcgcacag ttccggtggg atcagccccg 1795081 acaccgtcgg gttgggaggt caccactcac ggcggtcgtc gtggcaccgg tatggacgcc 1795141 gtgcagtggg cggcccgtgg cgccgacctc ggtgtggggg agatcctgct caactcgatg 1795201 gacgccgacg gcaccaaagc cggattcgac ctggctttgc tgcgtgcggt ccgtgccgcg 1795261 gtcacggtgc cggtaatcgc cagcgggggc gccggtgctg tggagcactt cgcgccagcg 1795321 gttgccgcgg gggccgatgc agtgttggcg gccagcgtct ttcacttccg ggagctgacg 1795381 atcggtcagg tgaaggcggc cctggccgcg gaaggaatca ccgtgcgatg acactcgacc 1795441 caaagatcgc ggcgcggttg aagcgtaatg ccgacggact ggttaccgcc gtcgtccagg 1795501 agcggggcag cggtgacgtg ctgatggttg cctggatgaa cgacgaggcc ttggcccgta 1795561 ccctgcaaac ccgtgaggcc acttactatt cgcgatcccg tgccgaacaa tgggtcaagg 1795621 gcgcgacgtc cggccacacc cagcacgttc actcggtgcg cctggattgt gacggcgacg 1795681 ccgtattgtt gacggttgac caggtcggcg gtgcctgcca taccggcgat cacagttgct 1795741 tcgatgccgc ggtgttgtta gaacccgacg actaacccgc cgcggaaaga ctggggctag 1795801 cggctcgcgg cgcaacagat tgcagtggtc gcccgcgagg caagagtgcc catcgacacg 1795861 ccgccgagcg agcgcggaca taccaccttg ggatccatgc agatgtcaag gggggttgcc 1795921 cgtccgggcg atggcgtcga tgagaatggc ggtcgatgct gaaacgagtg ccctggaccg 1795981 ttgtgctgcc ttcgctggcc tttgtcgcgc tggtattgac ctggggaaag cagatcggcc 1796041 cggtggtggg cttgctagcg gcggtgctgt tagccggtgc tgtcctggcc gcggtcaacc 1796101 atgccgaggt ggtggcggcc cgggtgggtg agccattcgg ttcgctggtg ctcgcggtcg 1796161 cggtgacgac catcgaggtg gcgctgatcg ttgcgctcat ggtgtccggc ggggacgatg 1796221 cggcgacgct cgcccgcgac accgtgttcg ccgcggtgat gatcaccacc aacgggatcg 1796281 ccgggttgtc cctgctgctg ggttcgctgc gctatggcgt gacgttgttc aacccccacg 1796341 gcagcggcgc cgcgctggcc acggtcacca cactggcgac gctgagcctg gtgctgccca 1796401 cgttcaccac cagtcagtcg ggccccgagc tatcgcccgg ccagctcatc ttcgccggcg 1796461 ccgcgtcgct gggactctac gtgttgttcc tgttcaccca gactgtccgg catcgagact 1796521 tcttcctacc ggtggcgcaa aagggcgcgg tcgaggatga cagccacgcc gatccaccga 1796581 gcacccgcgc ggcgctgctg agccttggat tgctgctcgt cgctttggtt gcggtggtgg 1796641 gtctggccaa ggtggaatcg ccggtcatcg aggaggtcgt ctcggcggcc gggtttccgc 1796701 aatccttcgt cggcgtggtc atcgccacac tggtgctgtt gccggagaca cttgcggcgg 1796761 cccgcgcggc ccggcaaggc cgcctgcaga ccagcctcaa tctggcgtac ggttccgcga 1796821 tggcgagtat tggactcacc atcccgacca tcgcccttgc ttccctgtgg ctcagtggcc 1796881 cgctgcaact tggcctcggt gccattcagt tggtgctgct ggtgctcacg gttgtggtca 1796941 gcgtgctgac cgtggttccc ggtcgggcca cccgtctgca gggcgaggtg catctggtgt 1797001 tgctggctgc ttacctgttt cttgccgtcg tcccgtgatg aatccgtgcg caagcgatgg 1797061 ttttcgccgc cgctatccag atctgattgc ccgcagcgtc gctaacgctt tgtcggcgtg 1797121 ggcgtccatg ctgaattcgc tggagatcac gtcgagcacc ttacggtcgg tgtcgatgac 1797181 aaaggtcgtg cgtttgaccg gcatcaactt gcccaacaga ccgcgcttga ccccgaattg 1797241 ggcggcgacc gtgccttggg cgtccgaaag cagcgggtag tcgaaacgcc gcacctcggc 1797301 gaatttggcc tgctttcgaa cgggatcggt gctgatgccg acccggctgg ccctgacctc 1797361 ggcgaattct ttggccaagt cgcggaagtg gcaggcttct ttggtgcagc caggcgtcat 1797421 cgccgccgga tagaagaaca ggaccacggg tccgtcggat agcaggacgc taagcctgcg 1797481 aggagtcccg gtctgatcgg gcagttcgaa gtcggctacc gtgtcaccgg ttttcatagt 1797541 cgtcaggcta caaccgattg cccgactcct tgcgcgccgc ttcgcggctg ggggtgcccc 1797601 catgcgcgcc gtttgcgcgg cgtgcatcgt cgtcgggcta cgcccgggcc gatcggcgta 1797661 tctgggaaga tggttcggtg cacgccgacc tcgcagccac cacctcgcgt gaggatttcc 1797721 gcctcctggc ggccgagcac cgggtggttc cggtgactcg caaggtcttg gccgacagcg 1797781 agacgccgct gtcggcctac cgcaagctcg ccgccaatcg cccgggtacg ttcctgctgg 1797841 agtcggccga gaacggccgg tcgtggtcgc gatggtcgtt tatcggtgcg ggggcgccaa 1797901 cggcgttgac cgtgcgtgag gggcaagcgg tatggctggg tgccgtgccc aaggacgctc 1797961 ccactggcgg agacccgctg cgggcgctgc aggtgacctt ggagctgctg gctacggcgg 1798021 atcgtcagtc cgagccgggt cttccgccgc tgtcgggtgg catggtcggt ttcttcgcct 1798081 atgacatggt gcgacggctg gaacgattgc cggaacgggc cgtcgatgac ctctgcctgc 1798141 cggacatgct gctgttgctg gccaccgatg tggcggcggt cgatcaccac gagggcacca 1798201 tcacgttgat cgccaacgcc gtgaactgga acggcaccga cgagcgggtc gactgggcct 1798261 acgacgacgc ggtcgctcgg ctggacgtga tgaccgcagc gctcggccaa ccactaccgt 1798321 caaccgtggc caccttcagc cgacccgagc cgcgccaccg tgcgcaacgc accgtcgaag 1798381 aatatggtgc gatcgtcgaa tacttggtgg atcagattgc agccggtgaa gcgttccagg 1798441 tggtgccctc gcagcgcttc gagatggaca ccgatgtcga tcccatcgac gtgtaccgaa 1798501 ttctgcgggt aaccaaccca agtccctaca tgtatctact gcaggtgccg aatagtgatg 1798561 gtgcagtgga cttttcgatt gttggatcca gtccggaggc gctggtaacg gtccacgaag 1798621 gctgggcgac gacgcatccg atcgccggaa cccggtggcg cggaaggaca gacgacgagg 1798681 acgtgcttct ggaaaaagag ctgctggcgg acgacaaaga acgtgccgag catctgatgc 1798741 tggtcgacct cggccgaaac gacctgggtc gggtctgcac gccgggcact gttcgggtcg 1798801 aggattacag ccacatcgag cggtacagcc acgtgatgca cctggtgtcc acggtgaccg 1798861 ggaagctcgg cgaagggcgc accgcgctgg acgcggtgac cgcctgcttt ccggccggca 1798921 cgctgtcggg cgcgccgaag gtgcgggcga tggagctgat cgaagaggtg gagaagacac 1798981 gccgcggcct ttacggcggt gtcgtcggtt accttgactt cgccggcaac gctgacttcg 1799041 ccatcgccat ccgcaccgcg ctgatgcgta acggcacggc ttatgtccag gcaggcggtg 1799101 gtgtggtggc cgactccaac ggatcctacg aatacaacga ggcgaggaac aaggctcggg 1799161 ctgtgctcaa cgcgatcgct gccgccgaga cgctggccgc tccgggcgcg aaccgcagtg 1799221 gctgctaatg ccggcagtgt tcggcccaac cgccgggcca ggccgatgat cggcatcgcc 1799281 cagttgctgt tggtggttgc cgccggggcg ctgtggatgg ccgcacggct gccctgggtg 1799341 gtcatcgggt cattcgacga gctggggccg ccgaaggagg tgacgctgac cggtgcgtcg 1799401 tggtcgaccg ctttgctgcc gttagcgctg ctgatgctgg ccgcggcggt ggcggcgctc 1799461 gcggtgcgcg gctggccgct gcgggcgctg gcagtgttgc tggccgcggc cagcttcgcg 1799521 gtcggctacc tcggcatcag tctgtgggtg gtcccggatg tcgcggcccg cggagccgat 1799581 cttgcccatg tcccagtggt gacgctggtc ggaagcgccc ggcactattg gggcgcggtg 1799641 gcggcggtgt tggcggcagt gtgtgctttg ctcgctgccg tcttcttgat gagttcggcg 1799701 gcgattcgcg ggtcggctgg cgaggacatg gcgagatatg cggcgccccg cgcccgccgg 1799761 tcgattgccc ggcgccagca ctcgaatgcg gccggccggg cggctccgca agacgacggg 1799821 ccggatatgg ggccgcggat gtcggagcga atgatttggg aagctcttga cgagggccgt 1799881 gacccgaccg atcgggagca ggagtctgac accgaggggc ggtgacggac cgcgcgctga 1799941 cggtcgctac ccttcatgga cgtcgtcgaa attgacgagc gcgtgtgggt gacagtggga 1800001 agggaacggc aggcatgagt ccggcaaccg tgctcgactc catcctcgag ggagtccggg 1800061 ccgacgttgc cgcgcgtgaa gcctcggtga gcctgtcgga gatcaaggct gccgccgctg 1800121 cggcgccgcc gccgctcgac gtgatggccg ccctacgcga gcccggcatc ggcgtcatcg 1800181 ctgaggtcaa gcgcgctagt ccttcggcag gcgcattggc gaccatcgcc gacccggcaa 1800241 agctggccca ggcctaccag gatggcggtg cccggatcgt cagcgtggtg actgagcagc 1800301 ggcgttttca gggatcgctc gacgacctcg acgcggtgcg ggcctcggtt tcgattccgg 1800361 tgctgcgcaa ggactttgtg gtgcagccgt accagattca tgaggcgcgt gcgcacggcg 1800421 ccgacatgtt gttgctcatc gtcgccgcat tggagcagtc ggtgttggtg tcgatgttgg 1800481 accgcaccga atcgttgggt atgacagcac tcgtcgaggt ccataccgag caggaagccg 1800541 accgagcgct gaaggccggg gccaaggtga ttggcgttaa cgcccgcgac ctcatgacgc 1800601 tggacgtgga ccgggattgc ttcgcgcgaa tagctcctgg tttgccgagc agtgtgatca 1800661 ggattgctga atccggcgtg cgtggcaccg ctgacctgct ggcgtacgcc ggcgcgggcg 1800721 ctgacgcggt gttggtaggc gaaggtctgg tcaccagcgg cgacccacgt gccgcggttg 1800781 ccgatctggt taccgcgggc acccatccgt cctgtccgaa accggctcgc tagccgtcga 1800841 tgagccgctt gcatcttgag cctcggtgat gacagatcta tccaccccgg atcttccgcg 1800901 catgagtgct gccatcgccg aaccgaccag tcacgatcct gattccggcg gccatttcgg 1800961 cggccccagt ggttggggtg gccgctacgt tcccgaggcg ctgatggcgg tgatcgaaga 1801021 ggtcaccgcc gcctaccaaa aggagcgcgt cagccaggac tttctggacg acctagacag 1801081 gctgcaggcg aactatgcgg gccggccttc gccgctttac gaggcgaccc ggttgagcca 1801141 gcacgctggg tcggcgcgaa tctttctgaa gcgagaagac ctgaaccata ctggttctca 1801201 caagatcaac aacgtgctcg ggcaggcact gctggcgcgc aggatgggca agacccgggt 1801261 gatcgccgag accggtgccg gccagcacgg ggtcgccacg gccaccgcat gcgcattgct 1801321 cggcctggac tgtgtcatct acatgggggg catcgacacc gcccgtcagg cgctaaacgt 1801381 ggcccggatg cgattgctgg gtgccgaagt cgtcgcggtt cagacgggct cgaaaacgct 1801441 caaagacgcc atcaatgagg cgttccggga ttgggttgcc aacgccgaca acacctacta 1801501 ctgctttggt actgcggccg gaccgcatcc gtttccaacc atggtgcgcg atttccagcg 1801561 aatcatcggc atggaggcac gtgtgcagat ccagggtcag gccggtcggc tgcctgacgc 1801621 cgtcgtcgcg tgcgttggtg gcgggtccaa tgccattggt atttttcatg cgtttctcga 1801681 tgacccaggc gtacggctgg tcggattcga ggcagccggc gacggcgttg agaccggccg 1801741 gcatgccgcg acattcaccg ctggttcgcc cggggcattt cacggatcgt tctcgtactt 1801801 gctgcaagac gaggacggtc agaccattga atcccattca atttccgcgg gtctggatta 1801861 tccgggggtg ggcccggaac atgcgtggct caaggaggcc gggcgtgtcg attatcggcc 1801921 gatcaccgac tccgaggcga tggacgcgtt tggcctgctg tgtcgcatgg aaggcatcat 1801981 cccggctatt gaatccgcgc acgcggtggc cggcgccctc aagctaggtg ttgagttggg 1802041 aaggggcgcg gtgattgtgg tgaacctgtc gggacgtggc gacaaagatg tcgagacggc 1802101 cgcgaaatgg tttggcttgc tgggcaacga ctgatggtgg cggtggaaca gagcgaagca 1802161 agtaggctcg ggccggtttt cgattcctgc cgtgcaaaca accgcgcggc attgattggt 1802221 tacttgccga ccgggtaccc ggacgtgcca gcgtcggtgg ccgcgatgac agcgctagtt 1802281 gaatccggtt gcgacattat cgaagtcggg gttccgtatt cggacccggg catggacggc 1802341 cccaccatcg ccagggcaac cgaggcggcg ctccgtggcg gggtgcgagt ccgggatacg 1802401 ttagccgcgg tcgaggccat cagtatcgcc ggcgggcgtg cggtagtgat gacctactgg 1802461 aatccggtgc tgcgctatgg ggttgatgca ttcgcgcggg atctggcggc ggccggagga 1802521 ctcggcctga tcactcctga cctcattccc gacgaggcgc aacagtggct ggcggcatcc 1802581 gaagagcatc ggttggatcg cattttcttg gtcgcgccgt cctcgacacc ggagcggttg 1802641 gcggccaccg tcgaggcttc acgcgggttc gtctacgcgg cgtcgacgat gggggtgacc 1802701 ggggcgcggg atgcggtgtc gcaggcggca cccgaactgg tgggccgggt gaaggcggtg 1802761 tctgacatac cggtgggcgt cggtctgggt gtgcggtcgc gcgctcaagc cgcgcagatc 1802821 gcccaatacg ccgacggtgt catcgttggt tccgcattgg tgacggcgct aaccgagggg 1802881 ttgcctagat tgcgggcact gaccggagag ctcgctgccg gggtacgact agggatgtcc 1802941 gcatgatgcg gatgttgccc agctatatcc ccagcccacc gcgcggggtt tggtacctgg 1803001 gcccgctacc cgtccgcgcc tacgcagttt gcgtcatcac cggcatcatt gtcgcactgc 1803061 tgatcgggga tcgccggttg acagcccgcg gcggcgagcg cggcatgacc tacgacatcg 1803121 ccttgtgggc cgtgcctttc ggcctgattg gcggcaggct ctatcacctg gctaccgact 1803181 ggcggacata tttcggtgac ggtggtgccg ggctggccgc ggcactgcga atctgggatg 1803241 ggggcctggg catctggggt gcggtaaccc ttggtgtcat gggcgcgtgg attggctgcc 1803301 ggcgttgtgg aatcccgctg cccgtcttgc ttgatgcggt ggcgcctggt gtcgtgttgg 1803361 cgcaggctat cggtcggctc ggaaactact tcaatcaaga gctctacggc cgggaaacca 1803421 ctatgccgtg gggtttggag atcttctacc gccgggaccc ctccggattc gacgtcccga 1803481 attcgctgga cggcgtctcg acgggtcagg tggcgttcgt cgtgcagcca acgttcctct 1803541 acgaattgat ctggaatgtt ttggtattcg tcgcattgat ctacattgac cgccggttca 1803601 tcatcggcca cgggcgactg tttgggttct atgtcgcttt ctactgcgcc gggcgattct 1803661 gtgttgagct gctgcgtgac gatcccgcca cgcttattgc cggcatccgg atcaattcgt 1803721 tcacgtccac cttcgtgttt atcggggccg tggtgtacat catcttggcg ccgaaggggc 1803781 gcgaggctcc tggggccctg cgtggcagcg agtatgttgt tgatgaggcg ctggaacgtg 1803841 aaccggctga actcgccgcc gctgctgtgg cctccgctgc gagcgctgtg gggccggttg 1803901 gcccggggga accgaaccaa cccgacgatg tggcggaagc ggtgaaagcc gaagtcgccg 1803961 aggtcaccga tgaagtggcc gcggaatccg ttgtccaagt agcagaccgg gatggtgagt 1804021 caacccccgc tgtcgaggag acctccgaag ccgatatcga gcgggaacaa ccgggcgacc 1804081 tcgcgggcca ggcgccagcc gcgcaccagg tcgacgccga agctgcatcg gccgcgcccg 1804141 aggagccggc agcgttggct tcggaggcac acgacgaaac cgagcccgag gtgcccgaga 1804201 aggcggcgcc catccccgat ccggccaagc cggatgaatt ggcggtcgcc ggacctgggg 1804261 acgaccctgc tgagccggac ggcattcgac ggcaagacga tttcagctcg agacgccgcc 1804321 gttggtggcg gcttcgacgg cgtcgacaat gacgacccac gacggcactg cctggtcgcc 1804381 ggtgctggac tcaatagacc gccgatcggg cggccgttgc cgcagccgga acgatgcgcc 1804441 gacgaagttc ccggtcacaa aatggccacc ggctggaacg gtaatcagcc gaaccccgac 1804501 gcgcttacga gccagaccac taagcccagt aggctagcaa gcccggcagg ttccatattt 1804561 tttcgcaacc cggacgcgca cgcgacgccg gggcgctgcc tccgatgccc gaccgccaca 1804621 tgaatatctg tccgtaccgc tctttcgtca cgtccgcaac actggccttc gccgtcggcg 1804681 atggtcgctg tgcccagcta agcgcgacaa ctcggtttct gcaggtcaac gcccgcctcc 1804741 aatcccgcac agccgcgacc aactcgggaa caaaaccgcc ggtcaggcag ctgtcgctga 1804801 gagccgggca catcgggtgt cgcccggtgc agtgacacat gtgagagttg tggccgtgcg 1804861 atgtgcccga ccctcggtgc gcaccaattt gagccaactc aggaaatgaa tctctgagcg 1804921 gaggtgcacc ggttgcccgc ctcacaacga catgctgagg cgcacacggt cgctcgcagc 1804981 cgggcacaac gaacactcct gctctgccgc gccgatgttg ggaacgcatg ggcctacggc 1805041 cggcacgggt cgtgcgcccg gctcgatctg gcatgctgaa aggcgtgacc gatcccctgc 1805101 agcacggtgc cttcgagccg ggctggcaat ccgcaccacc cggatatcca ccgccttatc 1805161 cgcaatatcc ggggcctggc tcttactttg acccgttcgc gccatatggt cgccatccgg 1805221 tcaccggcca accattttcc gacaaatcga agactgttgc cggcctgttg cagttgcttg 1805281 gactgttcgg catcgccggg atcgggcgaa tctacctggg ccataccggc ctgggcatcg 1805341 cgcagctgct ggtgggctgg gtgacgtgcg gtttgggcgc cgtcatctgg ggcgtcattg 1805401 acgccctgct gatattgacc gacaaagtcg gcgacccttg gggtcgtccc ttgcgcgatg 1805461 gaagctagcg ggcgtcaacg tcgctacgcc gcggccggtt cggtcgtgct attggccggc 1805521 gcgcttggct acatcggact tgtcgacccg cacaactcga attcgctata tccaccgtgc 1805581 ctattcaagt tgcttacggg ctggaactgc cccgcgtgcg ggggtctgcg gatgatccac 1805641 gatctgctac acggtgagct ggcggccagc atcaacgaca atgtctttct gcttgtcggc 1805701 gtcccagtgc tggccagttg ggtcctgctg cgccgccgcc acggcgactt ggcgctcccg 1805761 ataccggtga tgattgctgt ggcggtcgcg gtgatcgcgt ggacggtgct gcgcaacctg 1805821 ccaggcttcc cgttagtgcc gacgatcagc ggatagccgc gcctacccgc ggtctggttg 1805881 gctgggctgc ccgcggtggt gttgaccggt gtgccgaccc ggcggtgccg gccctaccgc 1805941 cgtcgcgact atgctgagtc gtcgtgacga gacgcgggaa aatcgtctgc actctcgggc 1806001 cggccaccca gcgggacgac ctggtcagag cgctggtcga ggccggaatg gacgtcgccc 1806061 gaatgaactt cagccacggc gactacgacg atcacaaggt cgcctatgag cgggtccggg 1806121 tagcctccga cgccaccggg cgcgcggtcg gcgtgctcgc cgacctgcag ggcccgaaga 1806181 tcaggttggg acgcttcgcc tccggggcca cccactgggc cgaaggcgaa accgtccgga 1806241 tcaccgtggg cgcctgcgag ggcagccacg atcgggtgtc caccacctac aagcggctag 1806301 cccaggacgc ggtggccggt gaccgggtgc tggtcgacga cggcaaagtc gcattggtgg 1806361 tcgacgccgt cgagggcgac gacgtggtct gcaccgtcgt cgaaggcggc ccggtcagcg 1806421 acaacaaggg catctcgttg cccggaatga acgtgaccgc gccggccctg tcggagaagg 1806481 acatcgagga tctcacgttc gcgctgaacc tcggcgtcga catggtggcg ctttccttcg 1806541 tccgctcccc ggccgatgtc gaactggtcc acgaggtgat ggatcggatc gggcgacggg 1806601 tgccggtgat cgccaagctg gataagccgg aagccatcga caatctcgaa gcgatcgtgc 1806661 tggcgttcga cgccgtcatg gtcgctcggg gcgacctagg tgttgagctg ccgctcgaag 1806721 aggtcccgct ggtacagaag cgagccatcc agatggcccg ggagaacgcc aagccggtca 1806781 ttgtggcgac ccagatgctc gactcgatga tcgagaactc gcggccgacc cgagctgagg 1806841 cctccgacgt cgccaacgcg gtgctcgatg gcgccgacgc gctgatgctg tccggggaaa 1806901 cctcggtagg gaagtacccc cttgctgcgg tccggacaat gtcgcgcatc atctgcgcgg 1806961 tcgaggagaa ctccacggcc gcaccgccgt tgacacacat tccccggacc aagcgtgggg 1807021 tcatctcgta tgcggcccgt gacatcggcg aacgactcga cgccaaggcc ttggtggcct 1807081 tcactcagtc cggtgatacc gtgcggcgac tggcccgcct gcataccccg ctgccgctgc 1807141 tggccttcac cgcgtggccc gaggtgcgca gccaactggc gatgacctgg ggcaccgaga 1807201 cgttcatcgt gccgaagatg cagtccaccg atggcatgat ccgccaggtc gacaaatcgc 1807261 tgctcgaact cgcccgctac aagcgtggtg acttggtggt catcgtcgcg ggtgcgccgc 1807321 caggcacagt gggttcgacc aacctgatcc acgtgcaccg gatcggggaa gatgacgtct 1807381 agccgggtcg tgccggacgg taaacccatg tccgacttcg atgaactact ggcggtattg 1807441 gacctcaacg ccgtcgcaag cgacctgttc accggatccc accccagcaa aaacccgctc 1807501 cggacatttg gtggccagct catggcgcag tcattcgtcg cgagcagccg aacgctaacc 1807561 cgccaccacc taccgcccag cgcattctcg gtgcacttca tcaacggcgg tgacacggcc 1807621 aaggacatcg agttccaggt gatacgactg cgcgatgagc ggcgcttcgc caaccggcgc 1807681 gtcgatgcgg tacaggacgg cacgttgctg tcctcggcga tggtgtctta catggccggt 1807741 ggtcgcgggc tcgagcatgc gctggatccg ccgcaggtgg ccgagcctca tacccggccg 1807801 ccgatcggtg agctgttgcg cggttacgag gagaccgtcc cgcattttgt caacgcgctg 1807861 caaccgatcg aatggcgcta cgccaacgac ccggcctgga taatgcggga caagggcgat 1807921 cggcttgcct acaaccgggt ctgggtcaag gcactagggg agatgcccga cgacccggtg 1807981 ctgcacacgg cgacactgtt gtactcctcg gacaccaccg tgctggactc ggtcattacc 1808041 acccatggtc tgtcctgggg cttcgatcgc atctttgcgg cctctgccaa ccactcggtg 1808101 tggtttcacc ggcaggtcaa cttcgatgat tgggtgctct actcgacgtc gtcaccggtg 1808161 gccgccgatt cacgtgggtt gggttcgggg cacttttttg atcgctcggg gaagctcatc 1808221 gcaactgtgg tgcaggaagg tgtgttgaag tattttcccg ccacccctga cagtgcggca 1808281 ggacgctcgt aggattccgg gtcagcacgg ctgtgatcag gcgtaacgtt cctggtagcc 1808341 agatgaccga tggtggcagc ggccggcgag ccgctgaatt gccagcgagc gaacccggag 1808401 gtgactgtga agctgccgtc ggccgatgtg gtaccgaggc tccgtggtcg ccagcgtgta 1808461 gtcgtgcacg tcgattcccg cacggcccgc tgtgtcggcg cgctggcgct ggtgtgcgcg 1808521 gcctgctggc tgatcgcgct gctcgccggc gactaccggc acgcccagtg ggcggtcgcc 1808581 ggccggttgg gctggtcgct gacggtcctg gctgcggtgg cattcattgc tcgcggcatc 1808641 ttcctgggcc gcccggtcac ggccatgcat gcgaccgcgg ccggcctatt tttgctcgcc 1808701 ggactggctg cccacgtgtt ggtcgcagat ctgctcggtg agattctgat agccggttcg 1808761 ggatgggcac tgatgtggcc gacgtcggcg catccgcgac ccgaagatct gccccgcgtg 1808821 tgggcgttga tcaatgccac ccgcgcggac tcgcttgctc cgtttgccat gcaggcgggc 1808881 aagagccatc acttcagcgc ggccggcacc gcggctctgg cgtatcggac ccgtatcggc 1808941 tatgcggtgg tcagcggcga cccgatcggc gacgaggcgc aattccccca gctggtcgcc 1809001 gacttcgcgg ccatgtgtca catgcacggc tggcgaatcg tggtcgtggg ctgcagcgaa 1809061 cgacggctcg gcctgtggag cgaccccatg gtggtcggac aatcgttgcg gcccataccg 1809121 attggccggg atgtcgtcat cgacgtgtct aactttgaga tgaccgggcg taggtttcgc 1809181 aacctgcgtc aggcggtgaa acgcacccac aatttcggcg tcacgaccga gatcgtcgct 1809241 gaacagcaac tcgacgacca gcggcaggcg gagctggccg aggtgctggc ggcgtcacct 1809301 agcggcgccc gcaccgatcg cggcttttgc atgaacctgg acggcgtgct ggagggtcga 1809361 taccccggaa tacaactgat catcgcgcga gacgcatcgg gtcgggtgca gggtttccac 1809421 cggtacgcga ccgccggcgg cggcagcgac atgtctctgg atgtaccgtg gcggcgccgc 1809481 ggggccccga acgggatcga tgagcggctc agcgctgaca tgattgcggc cgccaaagat 1809541 gctggggtac aacggttgtc actggcattc gccgcgttcc ccgacctttt cggcgccaac 1809601 cagctcggcc gcctgcagcg tgtctgccgt gcgttgatcc atatcctcga tccgttgatc 1809661 gctctcgagt cgttataccg atacctgcgc aagttccacg cgctggatga gcggcgttac 1809721 gtgctgatat cgatgactca ggtctttgcg ctggcgttgg tgttgttgtc gctggagttc 1809781 gtcccgcggc ggcgacatct ctgatccgtc gctatggaca gctcggcgca ttgaatgtcg 1809841 ttgggcaggt ggtgggtggc taccaccacg gtccgcatag cgctcatgat cccggagttc 1809901 ggggccagca gatcgcgcag aaggtcggcg ttggcggcgt cgaggtgttc gacaggttcg 1809961 tcgagcaaca cgatccgagc cggggaaagc accgcccggg cgagcagcaa ccttctgcgc 1810021 tgacccgccg agaccgcttg cgcgccaccg atcaacaccg tcgacaaccc ctcgggcagg 1810081 ccggcgagcc agccgcacag gccgacccga tccagggcct cgatcagttc gtcatcgggg 1810141 cagtctcctc gggcggtcag caagttgtcc cgaacggtgg tagcaaagat atgcgcatct 1810201 tcagcgaaaa agctgacagc gctgcgtaat tcatcctcat cgaagtcgct caggttagtt 1810261 ccgtccagca acacccggcc gtgcaccggc ggcagcaagc cggccagcgt catcaacagc 1810321 gtcgtcttgc cggcgccgct cgcgccggtg acggccagcc gggcacccgg cggtaggtca 1810381 atcgtcaccc ggatcgactg cgcctcttgg tgaccgcaac acacgtcggc cgctagcacc 1810441 ccggtaccta ccggcagtcg cgccgacacc gtggattcgg tctcgcggac ccggtttgac 1810501 ccagtcaggt cgagcagacg agccgccgcg atgcgcgacc gtgtcaactg gacggcggcg 1810561 gcgggtagtg caacggtcgc ctcgaatgcg gacagcggca acaacatcag gatggccagt 1810621 gttgtgggcg cgaccgtggg ggccatgccg atcccggcca ccacggcgcc cagcaggctg 1810681 gccccgatcg ccgcggtcgg catggcctcg gcgatcgccc ccgttcgtgc ggcggcgtcg 1810741 agcgcatcgg cccaggcatg ttggcgccgt tgtgagtcgg cgatgacgtt gcgtagggca 1810801 ccggcgacac gaagctcggg ggcatgctca agggcgatca tcgccgacgt gtcgcgcatg 1810861 ccccgatgtt ggcgggcgat cgcttcctgc gctgcggcgg ttctgccggc aagccagggc 1810921 gcaacaacgc cggcaaccaa aaggcagacc gccagtacca cggcggctgg caccgaaacg 1810981 gccgcgacga ccgcggtcgc ggctactgcc agcaccgctg cgacggctat cggcaccaga 1811041 gcacgcacca gcatgttggc cagttcgtcg acgtccgcgc cgacgcgtgc tgccaggtcc 1811101 ccgctgtgca gcccgacggc ggccgccgcc ggtccgtggg ccagccggtg atagataagg 1811161 gtgcgggccc ggccggcggc ccgcaacgcg gtgtcgtggg tggccagtcg ctcgcagtag 1811221 tgcagcacgc cgcgcgaaat cgcgaacgcc cgcaccgcca cgaccgccac cgacaggtcc 1811281 aggacgggcg gcatctgcca ggcccgagtg atcagccagg ccgacacccc ggccagggcc 1811341 agcgcgctgc ccagcgacag cacgcccagc gcgacggccg ccaagatccg gggcaaccgg 1811401 ggacccaaca gcccagacgc ggccagcagg tcccgctggc ggcgactcac agcactcggt 1811461 cggttcatcg tcggaaacca tccgagttca cttcgacgac ccggtcaccg gccgcggcga 1811521 cctgctggcg atgggcgacg accagcaccg tcgcacccgc gcgggcacgc tcgacaatgg 1811581 cgcccaacac gtgttgttcg gtgcgggcgt ccaggtgcgc ggtgggctcg tcgagcagca 1811641 gcaccgcagc cggtgatccg agcgcgcggg ccaggcccag ccgttgccgc tgccccaggg 1811701 ataacccgac accaccgcgc cccagcacgg tatccagccc gcggggcaac tcgtctagta 1811761 cagcgtcgaa tccggctgct gcgcaggcac gctcgagatc atccacaggg cccagcagaa 1811821 ccaggttgtg gcggacggtt cctgggacca gcaccggccg ctgcggcagc cacgacagtt 1811881 gccgccacca ggcagccggt gccaggttgg tgacgtcgac tccggcgacc gtgattcgtc 1811941 ctgacgacgg tgcggtgagc ccggcgatcg cttgcagcgt agtgctcttg ccggcgccgt 1812001 ttcggccggt cagcaccgtc acccgaccgg gttcgatgtc tgcggtgaga tcatacggtg 1812061 cgcggccgtc gcggcctctg acactgagtc tctccaggcg aatcaccccg ccgcgcgcgg 1812121 tgaccgttcg tcggccgggt gttggtgagg gtgactcgcc gaggagggcg aatgccttgt 1812181 cggccgcggt tctgccgtca gctgcggcat gaaactggac cccaacgcga cgcagcggcc 1812241 agtacacctc cggcgccaat agcagcaccg tcaaaccggc cgtcaggctc atctccccga 1812301 agaccagccg tagcccgatg cccaccgcga ccagggccac gcccagcgtg gccagcaatt 1812361 cgagcaccag ggccgacaag aacgcgatcc gcagcgtcgc catcgccgac cgccggtggt 1812421 cagcagacag ttccgcgatg cgttgttccg ggccggaagc acggcccagc gcccgcaggg 1812481 tggggatgcc ggcaatcagg tctaacaacc gggcctggac ggcggtcatg gccgccagcg 1812541 cggccgccga ggggttagtg gtagccagcc cgatcagcac catgaagatc ggtatcaggg 1812601 gcagtgtgat caccacaatg gccattgact tcaagtcata gagcccgatc acggcgacgg 1812661 tggccggggt caggatcgcg gccagcagca acgtgggcaa atagccggtg aagtagggcc 1812721 gcaagccgtc caggccccgg gtaatcagca ccgcggcggc gtctcgctgc gcagccagtt 1812781 ggctgggtcg gcgggcggtt accgcggtca gcacctgacc ggacaggtcg gcgatcactg 1812841 cgctggcgcc gcgctgggcc aggcgcgctt gtagccactg aatcgacgca cgcaaccccc 1812901 acagcaccaa caggattgac agtggcccta gccaacgacg caggccagcc atcccagggt 1812961 tggcggggtc gatgacgccg gcgacgatgc ttgccaacac gatcgccgag ccgatggcgc 1813021 agccggagat cccgaccccg caggccaccg tgctgagtag atagcggcgc agcgccgccg 1813081 atgcctgcca cagccgcgga tccaggggcg cccgggttcc ccgggccttg gtgctcaggg 1813141 cgcgcgcctc gccagacccg tgggtggagg tatccgttca gctgagatcc gttgccggaa 1813201 aacccaatac gtccatgtct ggtacgccac cgtcagtgga gcgaagaacg cggtcaccca 1813261 cgtcatgatc ttgagggtgt acggggtcga cgacgcgtta tggatcgtta ggctccactg 1813321 cgggttcagg gttgagggca ccaggttcgg gtacagcgcg ccgaacagca gcaccaccac 1813381 agccgccacg actatcaacg tgcacatgaa cgcccagccg tcggacaccc gccgccacac 1813441 taagaccgtc gccgccgcct gcgcgcaccc cgcaactgcc agcaccagcc acgtccagtc 1813501 tttgccgtat gccagttgcg tccaaagtcc aaagcccgca accagtcccg ccacaggaag 1813561 cgaaagccat acggcgaatc ggtaggcatc gtcgcggatc ggcccggagg ttttcaaagc 1813621 gatgaacacc gcgccgtaga gcgagaacag tccggcggtc gccagaccgc ccagcagggt 1813681 gtaggcgttg agcacgtcgg gaatcgacag ggcaacatga ccgttcgcgt ctaccgggag 1813741 tccgcggacc agaatggcga acgccacacc ccacaacagg gcaggcagcc aggatcccgc 1813801 cgcgatcccg aagtctgccc cggtccgcca tttcgggtcg tcgatcttgc cgcgccattc 1813861 gatggcgacg gcgcgcagga tcataccgaa caggatcgcc agcagcggca gatacagcgc 1813921 ggagaacacg gtcgcgtacc agccgggaaa cgcggcgaat atggccgcgc cggcggtgat 1813981 cagccagact tcgttgccgt cccagaccgg tccgatggtg ttgagtgccg tgcgccggtg 1814041 ggtctccgga tcgcccatac cgacatgagc gaacggcgcc atcagcatgc ccacgccgaa 1814101 gtcgaaccct tctaggatga agaaaccgag gaacagcgct gcgatgacac cgaaccacaa 1814161 ttcttggagt accaccggct gctcctttcc ggggtcagtt ggcctcagta agcaaacgac 1814221 aatggtgcta cctcgtcgtc gcggggtgcc ccgtgcgcag ccggttccgc gtcgtgttcc 1814281 agggggcctt cgacgatgta acgcttgagc agccagcacc agatgaccgc aagtaccgcg 1814341 tagaccaagg tgaacatcag caaagacgtg gcgaccacgg tggcggagtg atccgagacg 1814401 cctgctttga cggtgagtcg aaccagctga tcaccggtcg ggttagggac gacgacccag 1814461 ggctggcgcc ccatctcggt gaacacccat ccggcgctgt tggccaggaa cggggcgggc 1814521 atggttagca gcgccagcca ggagaaccag cgttgattgg ggatctggcc gccacgggtg 1814581 agccagagcg caatcagtgc gaacagcacc gggatcgcca tcaacccgat catcatgcga 1814641 aatgaccagt aggtgacgaa gaggttgggc cggtagtcgt ttggtccgaa gcgctgctgg 1814701 tattcctgct gcagatcgcg gataccctgc aacgtcacac cgctgatccg gccctcggcg 1814761 aggaacggca acacataggg cacttcgatg acacgggtga ggctgtcgca gttgttttgc 1814821 cggccgaccg tcaggacaga gaagtttgga tctgtctggg tatcgcacaa cgattcggcc 1814881 gacgccatct tcatcggctg ctgctggaac atcagcttgc cttggtggtc gccggtgaac 1814941 aacaacccgg ccgtggcggc caacgcaacc caacacccca ggatggtcgc gggacgatac 1815001 atggcttggg tatctgagtc ggcgtgcgtg gtgctcgaac ggaccagcca ccaggcgctc 1815061 accgcggcga cgaaggtccc ggcggtcagc agcgcaccgc tgacagtgtg ggtaaacgcc 1815121 gcctgtgcgg tgttgttggt cagcagcacg acgatgctgc tcaactcggc acgcccggtg 1815181 gtcgggttgt agtgcgcgcc gaccggatgc tgcatgaagg agtttgccgc gatgatgaag 1815241 aacgcggaca cgttgaccgc gattgcgacg atccagatgc aggccagatg caccagccgg 1815301 ggcagcctgt tccagccgaa gatccacaac ccgatgaagg tggattcgaa gaagaaggcc 1815361 gccaggccct ccatggccag cggggcgccg aagacatcgc cgacgaatcg ggagtactcg 1815421 ctccagttca tgccgaactg aaattcctgc acgattccgg tcgccacgcc gatggcaaag 1815481 ttgatcagga acaatttgcc gaagaatttg gtgaggcgat accaggcggg gttatcggtg 1815541 acgacccaca gcgtttgcat gaccgcgatc agcggggcca ggccgatggt cagcggtacg 1815601 aaaatgaagt gatagacggt ggtgataccg aactgccacc gcgaaatgtc gacgacattc 1815661 atctgtcatc tccggagata ctacggggcc gactgatttg gctacgacga agtgtagtag 1815721 gcacgagtgg gcccgcgcta ctggcaatcg tgggtgcacc gcgattctgc ggtcagccga 1815781 gcgtctgcga agccttgcgg atggcgaacg acgacgcgat ctcgcatgcg ccgatcacca 1815841 cgagccagat gccgacgacc aacgccagta tccagatgga ctcgaacggc gatgccatca 1815901 ccacaatgcc ggcgatgagg ctgatcacgc cgacgaagat ggaccatccc cgtcccggca 1815961 gcatcggatc actaatcgcc gaaaccgtgg tggcgacgcc gcggaagatg aacccgatgc 1816021 cgatccagat ggccagcaac agaaccgcgt caccgaaatg gcgaaaggcc agcacagcca 1816081 ggatgagtga ggcggcaccg ctgatgaaca acaggatccg gccgcccgcc gaaacatgca 1816141 ggctgaacgc gaacgcaacc tgagcgacac cggtaatcag gaggtagaca ccgaacgcca 1816201 tggcagcaac gagaatggat attcctggcc aggccagcac caggacgccc aggatcagcg 1816261 acagaattcc cgatgccaga gtggacttcc agagatgcgg caacaacctt gggagagggc 1816321 tcacgacagg gcttggttcc atgggcgcag tgtgacacat gtagcggccc cgggatagcg 1816381 cttggcggtc agacccctgc cgtgcggggt tcggccccgc gcacctcgcc gggatctgcg 1816441 gctaccttgc ggccgatgag gtaccacgtg cgcattacgc ctttcccctt gacgtttatg 1816501 tggccgcgct cgcgcaacac gaagtcgtcc ttgagacgct cgtaaacctc gtctggcacc 1816561 tgaatttgcc ccaccgaatc ggtggattcc atccgcgacg cgacattgac cgcgtcgccc 1816621 cacacgtcgt agaagaaccg tcgagaaccc accacacccg ccaccaccgg gccggtggcc 1816681 aggcccaccc gcagcggcac cgggttgccg cgtggatcct tcaattgcgc tgcgacattg 1816741 gtcatgtcga gcgcaaagtc cgccagtgct tgcgtatggt caggccgggg ccgcggaacg 1816801 ccgctgacaa ccatgtagga gtccccgctg accttgattt tctccagccc gtgctggtcg 1816861 accagctcgt cgaaagcgct gtagaggcgg tccaggaacc ggaccaggtc cgccggcgcg 1816921 gtgctactgg cgcgttcggt gaacccgacg atgtcggcga acagcaccga ggcctcgtcg 1816981 tatttatcgg cgatgatgtt tcgctcgggt tctttaagcc gctcggcgat gctggccggc 1817041 aacatgttgg ccagcagtgc ttcggagcgg tcgtgctccg cctccatgac cgcctccgcg 1817101 cgcgcagtat cacgcagcgc gaaccacacc gttgcgaccg ctaccccgca ggcggagacg 1817161 gtcgtgagga cgaaacttac cgacatggcc cagggcggct gaagcccagt atcgggcggg 1817221 accaggaact ccagggcaat caccagaccg gcggcgaccg ccgctaggcc caccgctaac 1817281 gcggtgtgtt cgatgccgac cagcaacacc accaacgcgg cggctaccaa gaagaagaac 1817341 tgggcacccg cgtcggtgcc cacatcccag ccgatggcga agatcgccac ataggcggtg 1817401 ccgatgaacg taagcggtgc caccaatccc ccgaagcgat gtagcagggg cacgatcgcg 1817461 aaagtaaccg cggtgaagac gttgatcagg gcgatgtacc agcccccggc cccggtcgct 1817521 agttgcatta gcgcgaagct cccggttacc acgacagcga gccaggcggt gatggtaagt 1817581 acgcgctgcc gccgcgcgac gctttcggcg tagtgctgcg tgggagcgcg ggcctgagtg 1817641 cgcacggccg tgacacagtc tgggcgtcgt gtcgagccat ccgctgctat cggtggggcg 1817701 ccgcattttc ttgccgccac gaactaaagc ctaatcggtg agttagcgtt taccgactct 1817761 gtcggcgctt tccgagtgcg ttcgcttggt gccctcggtg ggattcgaac ccacactgga 1817821 cgggttttga gtccgtttcc tctgccagtt gggatacgag ggcttgatcc ggtctcctac 1817881 tctagaggag ccacgtcccg actcaccgcc ccccgaggtt cccgatcgcg cccgctcacg 1817941 acacaatgtc cgtcatgacc ggccccacca ccgacgccga tgccgctgtc ccacgtcggg 1818001 tcttgatcgc ggaagatgaa gcgctcatcc gcatggacct ggccgagatg ttgcgagagg 1818061 agggatatga aattgtcggc gaggccggcg acggccagga agccgtcgag ctggccgagc 1818121 tgcacaagcc cgacctggtg atcatggacg tgaagatgcc gcgtcgggac gggatcgacg 1818181 ccgcatccga aatcgccagc aaacgtattg ccccgatcgt ggtgctgacc gcgttcagcc 1818241 agcgtgatct ggtcgaacgt gcgcgtgatg ccggggcgat ggcatacctg gtaaagcctt 1818301 tcagcatcag cgacctgatt ccagcgattg aattggcggt cagccggttc agggagatca 1818361 ccgcgttgga aggcgaggtg gcgacgctat ctgaacggtt ggaaacccgc aagctggtgg 1818421 aacgagcaaa aggcctgctg cagaccaaac atgggatgac cgagccggac gctttcaagt 1818481 ggattcaacg tgccgccatg gatcggcgca ccaccatgaa gcgggtggcc gaagtcgtgc 1818541 tggaaaccct cggaacaccc aaagacacct gagggcgagc agacgcaaaa tcgcccattt 1818601 cgtacccgaa atgggcgatt ttgcgtctgc tcgcggaacc tagcgcgcga cgatcaccga 1818661 cgagccgtgc ccgaacaggc cctggttggc ggtgacgccg accttggcgt ccgccacctg 1818721 ccggccggtg gcctgaccgc gcagctgcca ggtcagctcg cagacctgcg cgatcgcctg 1818781 ggcgggaatc gcctcaccga aacacgccag cccgcccgac gggttgaccg ggaccctgcc 1818841 gccgagggtg gtcgcgccgc tgcgcagcag cgcctcggcc tcacccttgg ggcagagccc 1818901 caggtgttcg taccagtcga gttccaacgc ggtggacagg tcgtagacct cggccaggct 1818961 taagtcttct ggaccaatac cggcctccgc gtaggcagcg tcgaggatct gatccttgaa 1819021 cacccgctcc ggagccggca ccgcggcggt ggaatccgtt gcgatatccg gcaattcggg 1819081 caaatgttgc gggtatttcg gggtaacggt gctgatcgcg cgcaccgacg gcacgcccgc 1819141 caccgagcca aggtgcttct cggtgaaaga cttgctggcc acgatgagtg cggccgcacc 1819201 gtcggaggtg gcgcagatgt caagcagccg aagcggatcc gagaccaccg ggctagccag 1819261 cacgtcgtcg atcgagttct ctttgcggta gcgggcgttc gggttgtcta ggccgtgccg 1819321 ggagttcttg accttcactt gagcgaagtc ctcgactgtg gcgccgtaca ggtccatgcg 1819381 ccggcgcgcc agcagcgcga agtacaccgt gttcgtcgcc ccgatcagat ggaagcgctg 1819441 ccagtcgggg tcgcccttgc gctcgccgcc cacgggcgcg aaaaagccct tcggtgtggt 1819501 gtcggcgccg atcaccagcg ccacgtcaca gaaaccggcc aagatctgcg cgcgagcact 1819561 ctgcagcgct tgggaaccgc tggcacacgc ggcgtagctg gagctgaccg gcacaccggt 1819621 ccagccgagc ttctgggcga acgtggcacc ggcgacgaag cccggatacc cgttgcggat 1819681 ggtgtccgct ccggcgacca gctgcacgtg ccgccagtcc acgccggcgt cccgcaacgc 1819741 ggcgcgggcg gcgaccacgc catactcggt gaagtcatta ccccatttcc cccacgggtg 1819801 cataccggca cccaggatgt aaacgggttc cggcgcgctc atcctcatcg gcgccgctcc 1819861 tcagcatcgc tgcgctctgc atcgtcgccg gcgcgcgatg ggatccgcca cgcgtagacg 1819921 atgcgctgca caccgtcgtc gtcggcgaac agcggcatgg tcgtcagctc catctccatg 1819981 ccgaccttca gatcggcggc cagcgtgcca tcgaccactt tgcccagcac gatcagtccc 1820041 tcgtcggcca gttccaccgc ggccacggca aacggctcaa aggggtcggg tgccgggtac 1820101 ggcggtggcg gggcgtaccg gttttcggtg tagctccaaa gctttccgcg ggtcgacagt 1820161 ccgaccgact ctagtgtgtc gctgccgcaa gccggattcg gacaattgtc cgcccggggt 1820221 gggaagacgt acgtgccgca ctggggacac ttgccgccga gcagatgcgg gttgccggcc 1820281 ttatcggtgg tgaaccatcc atcgattgcc ggttcttcac gggtgacctc tggcaccggt 1820341 ccagcctacc gagcccgggc gtaaaactga aacgtgttgc agttctgctg gcacctgcgc 1820401 ccgcattcca cgtcagcgtc ggtgcataaa gtgtgagccg tggtgactac tgccagtgcc 1820461 cccagcgagg atcgagccaa gccgacgctg atgttgctgg atggcaattc gctggcgttt 1820521 cgggcgttct acgcactgcc cgcggagaac ttcaagaccc gcggcgggct gaccaccaac 1820581 gccgtctacg gcttcaccgc catgctgatc aacctgctgc gcgatgaagc cccgacgcac 1820641 atcgcggcgg ctttcgacgt gtcccggcag accttccgct tgcaacgcta cccggagtac 1820701 aaggccaacc gatcgtcgac ccccgacgag ttcgctggcc agatcgacat caccaaagaa 1820761 gtgctgggcg cactcggcat caccgtgctc tccgagccgg ggttcgaggc cgacgacctc 1820821 atcgccacgc tggccaccca ggccgagaac gagggctacc gggtgctggt ggtcaccggg 1820881 gatcgtgacg cactgcaact ggtcagtgac gatgtgacgg tgctctaccc ccgcaagggc 1820941 gtcagcgaac ttacgcgctt cacaccggag gccgtcgtcg aaaagtacgg gctcacccct 1821001 aggcagtacc cggacttcgc cgcgctgcgc ggcgacccca gcgataacct gcccggcata 1821061 cccggggtgg gggagaagac cgccgccaaa tggatcgccg agtacggctc gctgcggtca 1821121 ctggtggaca acgttgacgc cgtgcgcggc aaggtgggcg atgcgctgcg ggcgaacctg 1821181 gccagcgtgg tgcgcaaccg tgagctcacc gacctggttc gcgacgtgcc gctggcccag 1821241 accccggaca cgctgcggct gcagccctgg gatcgcgacc acattcaccg gctcttcgac 1821301 gacctggagt ttcgggtgtt gcgcgaccgg ttgttcgaca cgttggccgc ggccggggga 1821361 cccgaggtcg acgaggggtt cgacgtgcgc ggcggcgcgt tggcgcccgg cacggttagg 1821421 caatggttgg ccgagcacgc cggcgacggg cgccgagcgg gcctgacggt ggtgggtacc 1821481 catctgccgc acggtgggga cgctaccgct atggccgtcg ccgccgccga cggcgaaggc 1821541 gcttacctcg ataccgcgac gctgacgccc gacgacgacg ccgcgttggc ggcctggcta 1821601 gcggatccag ctaaacccaa agccttgcat gaggcaaagg cggccgttca tgacctggcg 1821661 ggtcgtggtt ggaccttgga gggcgtcacc tccgacaccg cactggcggc ctacctggtg 1821721 cggccggggc agcgcagctt caccctcgac gacctctcgc tgcgctatct gcgtcgcgag 1821781 ctgcgtgcgg aaacaccgca gcagcaacaa ctttcactgc tcgatgacga cgatacggac 1821841 gccgagacca ttcaaacgac gatcctgcgg gcgcgggcag tcatcgacct ggccgacgcg 1821901 ctggacgccg agttagcgcg tatcgactcc accgcgctgc tgggggagat ggagctgccg 1821961 gtccagcggg tgctggcgaa gatggaaagt gccggtatcg ccgtcgacct gcccatgttg 1822021 accgagctgc aaagccagtt tggcgaccag atccgcgacg ccgccgaggc cgcctacggc 1822081 gtgatcggca agcaaatcaa cctgggctca cccaagcagc tgcaggtcgt gctgttcgac 1822141 gaactgggca tgccgaagac caaacgcacc aagaccggct acaccacgga tgccgacgcg 1822201 ctgcagtcgt tgttcgacaa gaccgggcat ccgtttctgc aacatctgct cgcccaccgc 1822261 gacgtcaccc ggctcaaggt caccgtcgac gggttgctcc aagcggtggc cgccgacggc 1822321 cgcatccaca ccacgttcaa ccagacgatc gccgcgaccg gccggctctc ctcgaccgaa 1822381 cccaacctgc agaacatccc gatccgcacc gacgcgggcc ggcggatccg ggacgcgttc 1822441 gtggtcgggg acggctacgc cgagttgatg acggccgact acagccagat cgagatgcgg 1822501 atcatggcgc acctgtccgg ggacgagggc ctcatcgagg cgttcaacac cggggaggac 1822561 ctgcattcgt tcgtcgcgtc ccgggcgttc ggcgtgccca tcgacgaggt caccggcgag 1822621 ctgcggcgcc gggtcaaggc gatgtcctac gggctggctt acgggttgag cgcctacggc 1822681 ctgtcgcagc agttgaaaat ctccaccgag gaagccaacg agcagatgga cgcgtatttc 1822741 gcccgattcg gcggggtgcg cgactacctg cgcgccgtag tcgagcgggc ccgcaaggac 1822801 ggctacacct cgacggtgct gggccgtcgc cgctacctgc ccgagctgga cagcagcaac 1822861 cgtcaagtgc gggaggccgc cgagcgggcg gcgctgaacg cgccgatcca gggcagcgcg 1822921 gccgacatca tcaaggtggc catgatccag gtcgacaagg cgctcaacga ggcacagctg 1822981 gcgtcgcgca tgctgctgca ggtccacgac gagctgctgt tcgaaatcgc ccccggtgaa 1823041 cgcgagcggg tcgaggccct ggtgcgcgac aagatgggcg gcgcttaccc gctcgacgtc 1823101 ccgctggagg tgtcggtggg ctacggccgc agctgggacg cggcggcgca ctgagtgccg 1823161 agcgtgcatc tggggcggga attcggcgat ttttccgccc tgagttcacg ctcggcgcaa 1823221 tcgggaccga gtttgtccag cgtgtacccg tcgagtagcc tcgtcaggta ccaatctgtc 1823281 cctacgaccc aaccctgtcc ggagcaaccc aacaatatgc cgagtcccac cgtcacctcg 1823341 ccgcaagtag ccgtcaacga cataggctct agcgaggact ttctcgccgc aatagacaaa 1823401 acgatcaagt acttcaacga tggcgacatc gtcgaaggca ccatcgtcaa agtggaccgg 1823461 gacgaggtgc tcctcgacat cggctacaag accgaaggcg tgatccccgc ccgcgaactg 1823521 tccatcaagc acgacgtcga ccccaacgag gtcgtttccg tcggtgacga ggtcgaagcc 1823581 ctggtgctca ccaaggagga caaagagggc cggctcatcc tctccaagaa acgcgcgcag 1823641 tacgagcgtg cctggggcac catcgaggcg ctcaaggaga aggacgaggc cgtcaagggc 1823701 acggtcatcg aggtcgtcaa gggtggcctg atcctcgaca tcgggctgcg cggtttcctg 1823761 cccgcctcgc tggtggagat gcgccgggtg cgcgacctgc agccctacat cggcaaggag 1823821 atcgaggcca agatcatcga gctggacaag aaccgcaaca acgtggtgct gtcccgtcgc 1823881 gcctggctgg agcagaccca gtccgaggtg cgcagcgagt tcctgaataa cttgcaaaaa 1823941 ggcaccatcc gaaagggtgt cgtgtcctcg atcgtcaact tcggcgcgtt cgtcgatctc 1824001 ggcggtgtgg acggtctggt gcatgtctcc gagctatcgt ggaagcacat cgaccacccg 1824061 tccgaggtgg tccaggttgg tgacgaggtc accgtcgagg tgctcgacgt cgacatggac 1824121 cgtgagcggg tttcgttgtc actcaaggcg actcaggaag acccgtggcg gcacttcgcc 1824181 cgcactcacg cgatcgggca gatcgtgccg ggcaaggtca ccaagttggt tccgttcggt 1824241 gcattcgtcc gcgtcgagga gggtatcgag ggcctggtgc acatctccga gctggccgag 1824301 cgtcacgtcg aggtgcccga tcaggtggtt gccgtcggcg acgacgcgat ggtcaaggtc 1824361 atcgacatcg acctggagcg ccgtcggatc tcgttgtcgc tcaagcaagc caatgaggac 1824421 tacaccgagg agttcgaccc ggcgaagtac ggcatggccg acagttacga cgagcagggc 1824481 aactacatct tccccgaggg cttcgatgcc gaaaccaacg aatggcttga gggattcgaa 1824541 aagcagcgcg ccgaatggga agctcggtac gccgaggccg agcgccggca caagatgcac 1824601 accgcgcaga tggagaagtt cgccgccgcc gagacggctg gacgcggcgc ggacgatcag 1824661 tcgtcggcca gtagcgcacc gtcggaaaag accgcgggtg gatcactggc cagcgacgcc 1824721 cagctggcgg ccctgcggga aaaactcgcc ggcagcgctt gatcttgcag ctgatcgcgt 1824781 tcacgtaatg ctgcgcatcg ggctgaccgg cggcattggc gccgggaagt cgttgctgtc 1824841 cacgacgttc tcgcaatgcg gcggaatcgt tgtcgacggc gatgtgttgg cgcgtgaagt 1824901 ggtccagccg ggcaccgagg ggctggcctc gctggtcgac gcgttcggtc gcgacatcct 1824961 gcttgcagac ggagcgctgg accggcaggc gttggcggcc aaggcgtttc gagatgacga 1825021 gtcgcgcggt gtgctcaacg gaatcgtgca cccgctggtc gcccggcgcc gatccgagat 1825081 catcgcggcg gtttcggggg acgcggttgt ggtcgaagat attccactgc tggtggaatc 1825141 cgggatggcg ccattgtttc cgctggtggt ggtggtgcac gccgacgtcg agctacgggt 1825201 gcgacggctg gtcgagcaac gcggcatggc cgaagccgac gcccgggcta ggatcgctgc 1825261 gcaggccagc gaccagcagc gtcgtgccgt cgccgacgtc tggctggaca actcgggcag 1825321 cccagaggat ttggtgcggc gggcccgcga cgtctggaac acgcgcgtcc agcccttcgc 1825381 gcacaacctg gcccaacgtc agattgcgcg cgcgccggct aggttggtgc cggcggatcc 1825441 aagctggccg gatcaggcgc ggcgcatcgt caaccggcta aagatcgcgt gcgggcataa 1825501 ggccttgcga gttgaccaca ttgggtcaac cgccgtgtcg ggcttccccg attttctagc 1825561 caaggatgtc atcgacatcc aggtcaccgt cgaatcactt gacgtggccg acgagctggc 1825621 cgagcccttg ctggccgccg gctacccacg cctcgagcac atcacccagg acaccgaaaa 1825681 gaccgacgct cgcagcaccg tcggccgcta cgaccacacc gacagtgccg ctctgtggca 1825741 caagcgcgtg cacgcctcgg cggatcccgg tcggccgacc aacgtgcacc tgcgggtgca 1825801 cggctggccc aaccaacagt tcgccctgct gttcgtcgac tggctggcgg ccaatcccgg 1825861 cgcgagagaa gactatttga cggtcaagtg tgacgccgac aggcgcgccg acggtgagct 1825921 cgcgcgctac gtcaccgcca aggagccgtg gttcctggat gcctaccagc gggcatggga 1825981 gtgggcggat gcggtgcact ggcgtccctg aacgagggcc tgccgcactg ggcgatgacg 1826041 ccatcgatcg agcaggccgc ccagctgtca tccccggcca gcctcatctg aggcttccag 1826101 ctcgggggcg ccggcgcccg gggcggtggg cgcttctgct acccgagccg gcacgcgcgc 1826161 ttcatgagcc gctgcgccag gtcagctcca tccccttggt ggccagccag cgggtgaggt 1826221 catagccgtt gcgggccagg ccctcgacgg cgtcgactgc gtgccgcacc gcctgctcgg 1826281 cgaccgtcgg ggtcaacagg ccgtggcgga cggcgtcgag tagttcgtcg acgtcggcga 1826341 gctcggcccc gccgccggtg cggacttcga tgtcgaggta gtggtcttcg gaacgccata 1826401 cggaagggcc cggtgtgtat tcgccgacgt ccagatagta gtcgtgatcg cgtttgtggc 1826461 tgggattgaa gtgaaagaca gtggcgcgta ggcccaacga cggcaacagc cacgactcga 1826521 ggtagtggaa ttgggcacgg cccggggtgg gccgggccag gtagagcccc cacggatgca 1826581 ccgtgtactc atcgaccgcc cgcactatgc ccttcggatc ggtattggtg ttggcgatca 1826641 ggtcgaacgt ctcgtgcttg ggtgggtgaa tggctcaccc tatctggtcg cacgaggcgt 1826701 gccggtacat cgacacgccg gtactggtgg cattctgcgc acgctcgccg cacggtgtgt 1826761 ccgcgggtgg ctctaggctg gttggcgtgg ctttcgctac cgagcatccg gtggtcgcgc 1826821 attcggagta tcgcgcggtc gaggagattg tgcgcgccgg cggtcacttc gaggtggtca 1826881 gtccgcatgc tccggccggc gaccagccgg ccgcaatcga cgagctggag cggcggatca 1826941 acgcggggga gcgtgacgtg gtgttgctcg gcgccaccgg caccgggaag tcggcgacca 1827001 ccgcgtggct gatcgaacgc ctgcagcggc ccaccctggt gatggcgccc aacaagacgt 1827061 tggccgccca gctggcgaac gaactgcgag agatgttgcc gcacaacgcc gtcgagtact 1827121 tcgtctcgta ctacgactac taccagccgg aggcgtatat cgcgcagacc gacacttata 1827181 tcgaaaagga tagctccatc aacgacgacg tggagcggct gcggcactcc gcgacctcgg 1827241 cgctgctgtc gcgtcgtgac gtggtggtgg tggcttcggt gtcctgcatc tacggcctgg 1827301 gcacaccgca gtcctacctg gaccgctccg tcgagctgaa ggtgggcgag gaagtgccgc 1827361 gcgatgggct gctgcggctg ctggtcgacg tgcaatacac ccgaaacgac atgtccttta 1827421 ctcgcggctc gtttcgggtg cgcggcgaca ccgtcgagat catcccctcc tacgaagagc 1827481 tggcggttcg catcgagttc ttcggcgacg agatcgaggc gctgtactat ctgcacccgc 1827541 tgaccggcga ggttatccgc caggtcgact cgctgcggat ctttcccgct acccattacg 1827601 tcgccggtcc ggagcggatg gcgcatgccg tctcggccat cgaggaagaa ctcgccgagc 1827661 gactcgccga gcttgagagc cagggcaagc tgctggaggc gcagcggctg cggatgcgca 1827721 ccaactacga catcgaaatg atgcggcagg tcgggttctg ctcgggcatc gagaactact 1827781 cccgccacat cgacggtagg gggcccggca cgccgcccgc gaccctgctc gactatttcc 1827841 ccgaggattt cctgctcgtt atcgacgagt cacatgtcac cgtgccgcag atcggcggca 1827901 tgtacgaggg cgacatctcc cgcaagcgca acctggtgga gtacggtttc cggctgccgt 1827961 cggcgtgcga caaccgtccg ctgacctggg aggagttcgc tgaccggatc gggcagacgg 1828021 tgtatctgtc tgccaccccg gggccctacg agctcagcca gaccggcggc gagttcgtcg 1828081 agcaggtgat ccggccgacc ggtctggtgg acccgaaagt ggtagtcaag ccgaccaaag 1828141 ggcagatcga cgacctgatc ggcgagatcc gcacacgggc agacgccgac cagcgggtgc 1828201 tggtgacgac gctgaccaag aagatggccg aagacctcac cgactacctg ctggagatgg 1828261 gcattcgggt gcgctacctg cattcggagg tcgacacgtt gcgccgggtc gagttgttgc 1828321 gccagctgcg tctgggtgac tacgacgtgc tggtcggcat caacctgctc cgcgagggcc 1828381 tagacctgcc cgaggtgtcg ctggtggcga tcctcgacgc cgacaaagaa ggattcctgc 1828441 ggtcaagccg cagcctgatc cagaccatcg gacgcgccgc tcgcaacgtg tccggcgagg 1828501 tgcacatgta cgccgacaaa atcaccgact cgatgaggga agccatcgac gagaccgaac 1828561 gccggcgggc caagcagatc gcctacaacg aggccaacgg aatcgaccca cagccgctgc 1828621 gcaaaaagat cgccgacatc ctcgatcagg tctatcggga ggccgacgac accgccgtcg 1828681 tcgaggtcgg cggatccggg cgcaacgcat cccgcggccg gcgggctcag ggtgagcccg 1828741 gccgggcggt cagcgccggc gtgttcgagg gccgcgacac ctccgccatg ccgcgcgctg 1828801 agctggccga cctaatcaaa gacctcaccg cacagatgat ggcggccgcg cgcgacctgc 1828861 agttcgagct ggcggcccgg ttccgcgacg agatcgccga cctcaagcgg gagctgcggg 1828921 ggatggacgc ggccggcctg aagtgaccga aacagcgagc gagaccggca gctggcgtga 1828981 gctactgagc aggtatctgg gcacctccat agtgctggcc ggtggcgtcg cgctgtacgc 1829041 caccaacgag tttctgacaa tcagcctgct gccgagcaca atcgccgaca tcgggggtag 1829101 ccggctgtac gcctgggtga caaccctgta tctggtcggg tcggtggtgg cggcgaccac 1829161 cgtcaatacg atgttgctgc gcgtcggggc gcgctcgtcg tatctgatgg ggttggccgt 1829221 cttcggtctg gccagcctgg tatgtgcggc ggcgccgagc atgcagattc tggtggccgg 1829281 gcgtaccttg caaggaatag ccggtgggct gctggccggc ctaggctacg cgctgatcaa 1829341 ctcgaccttg cccaagtcgc tgtggacccg tggctcagca ctggtgtcgg cgatgtgggg 1829401 ggtcgcgacg ctgatcggac cggcgaccgg aggccttttc gcgcagctcg ggctgtggcg 1829461 atgggcgttc ggcgtgatga cgttgctgac cgcgttgatg gccatgttgg tgccggtcgc 1829521 gctcggtgcc gggcgggtcg gcccgggcgg cgagacgccg gtgggcagca cacacaaggt 1829581 gccggtgtgg tcgctattgc tgatgggggc cgccgcactg gcgatcagcg tcgccgcgct 1829641 tccgaactac ctcgtccaga cggccgggct gctagccgcc gccgcgctgc tggttgcggt 1829701 gtttgtggta gtcgactggc ggatacacgc agcggtgttg ccgcccagcg tatttggctc 1829761 cggaccgttg aaatggattt acctgaccat gtcggtgcag atgattgcgg caatggtcga 1829821 tacctacgtg ccgctgttcg gtcagcgact gggacacctg accccggtgg cagccgggtt 1829881 cttgggtgcc gcgctggcgg tgggctggac ggtcggtgag gtcgccagcg cctcgttgaa 1829941 cagtgcacga gttatcgggc atgtcgtggc agccgcaccg ctggtgatgg cgtcggggtt 1830001 ggcgctaggc gccgtcaccc agcgcgccga tgcgccggtg gggatcatcg cgctgtgggc 1830061 gctggcgctg ctgatcatcg ggaccggcat cgggatcgcc tggccgcatc taacggtgcg 1830121 cgctatggat tctgtcgccg acccggccga gagcagcgcg gcggccgcgg cgatcaatgt 1830181 cgtacagctg atctccggtg ctttcggcgc cgggctggcc ggtgtggtgg tcaacactgc 1830241 caagggcggc gaagtggcgg cggctcgtgg gctatacatg gcatttacgg tgctggccgc 1830301 cgctggtgtc atcgcctcct accaggccac gcaccgcgac cggcgcttac cgcgttgact 1830361 tgaccacctg cgagtagtgg aactgccagc gctcgacgat gcggaagccg aggtagctcg 1830421 ggaaccggta tacgggcgtg cgcccgaagc ctgttcccgg tgacaacatt tcgccgacct 1830481 gatgatcggg caacgacttg tcacgattgg ctatcgtcca cagcgtgggg cacttgtcga 1830541 tcttggccgt cgtaagccac acagcgacat ggccatccca caaagtgccg accttggggc 1830601 cgtaggtgcc gcgctcgacg tcaatcagcg accggaacgc cgccggccgg gtggccagca 1830661 gggcgcggat gggcccgggt cgccaacccg cggtgttgtc caccagcagg caatccccgg 1830721 gcttggcatg ggcgctgatg acatctgcca cctggctgta atcccagccc tctttcgcgt 1830781 acggcccccg ctgtgtgaag aagtagttcg gaaacgctgc ggcggcaagg agaaacacga 1830841 ccccggcgat gagccacggc ttgcgggcga tggtgacgac gcaaaccgcc aggatgacgg 1830901 ccgcggcggg ggcggtgagg atcaggtagc gcgggtagta gatcggttcg acggtcgccg 1830961 agtagatgag gacgacggcg gtgggcacga cgatccaggc tgcgctgacg agcacgagcc 1831021 ggtgggtatc gccaccgggt ccacgagctc cggccagatg cgccgcgatg ccggcagcga 1831081 cgatgaggcc cgcgaggatg gcgaacggaa cactgtgatc gaaatactgg cggtgtatga 1831141 cgtcgagaat gatgtttctg ttcaaccctg cgatccaccc gacctgccaa acctggccgt 1831201 gggcgaacag tatgaacggt gtcatggccc cgagcgcggc tgccgtgacg accgtccacc 1831261 agatcacggg agatttgcgt gatttcccgg acgccagcag cggcaccatc gtcgcatagg 1831321 ccggtaccaa cagggccagg ttgatactga ccaagatcga cagcatcaaa accagcgcgt 1831381 agagcagcca ccgccgctgg gtgttgcacc gcaccgcggc cacgagtaat acggtcagcc 1831441 agacggcggc tgctaccgac agcgcggagg agcgtgcttc gattccggcc cacgtcaccc 1831501 tgggcagaat cgcgaacacg gctcccgcac acaccgccgt ggtgcgtccc gaaaactgtt 1831561 tggcaaaaac caccacgccg gcggcggccg ctccaatggc caggcagctg ggaagccgcg 1831621 accataattc ggtgggcgga aatatggcga accagccatg catcaacagg tagtacaggc 1831681 cgtgcacggc gtcgatatgg cccagcagac tccatagctc tggcaatgtc cggctggctg 1831741 aagccgagat cgttgccccc tcgtcgaacc acaacgatgg cctgcttgcc caggcgccgc 1831801 tgatgaccgc ggccagcact gcaatcgcca gcgggtcgag cagccggccg cgcatccgcg 1831861 ccaccaactc gtcgacgtgt gctgccgcgg gctgctccag agtggaggcg gacatgatgc 1831921 gggtcacctt agggtccgcg cgatgatcct ggtcaccggc ggttcggcga ctgggcagcc 1831981 cggcgtgcgg cggtgcgccg ggacgactcg catgcatttc ccaaaaagcc ttgcacagca 1832041 acattttccg cgatcagcgt gcgtattgaa tcgtcgtgtc atcgccacca ttgtcggctg 1832101 gttcaccgcg atcgggcaaa tgagggttgc gccacgccgt tgcggtgtga ttaatctgac 1832161 ctatctatat ccggcaacgc gatactgtct ggggttggcg tagcaaccga cacctgggag 1832221 ggtaaatgag cgcctataag accgtggtgg taggaaccga cggttcggac tcgtcgatgc 1832281 gagcggtaga tcgcgctgcc cagatcgccg gcgcagacgc caagttgatc atcgcctcgg 1832341 catacctacc tcagcacgag gacgctcgcg ccgccgacat tctgaaggac gaaagctaca 1832401 aggtgacggg caccgccccg atctacgaga tcttgcacga cgccaaggaa cgagcgcaca 1832461 acgccggtgc gaaaaacgtc gaggaacggc cgatcgtcgg cgccccggtc gacgcgttgg 1832521 tgaacctggc cgatgaggag aaggcggacc tgctggtcgt cggcaatgtc ggtctgagca 1832581 cgatcgcggg tcggctgctc ggatcggtac cggccaatgt gtcacgccgg gccaaggtcg 1832641 acgtgctgat cgtgcacacc acctagcggc cgttaccagc cgcgcgcacg ccattcgctg 1832701 aggctggggc gttcggcacc cagctccgtg tcgtcaccgt ggccggggta gatgacggtg 1832761 gagtcggcgt acacgtcgaa aacccgggtg gtgacgtcgt cgagcagttg ggtgaagtcg 1832821 gcaggttgcc aggttttgcc gacaccgccg gggaacaagc agtcgccggt gaagagctgt 1832881 gtgacgcctc cggtcaccgg cccgccgagg gccagcgcga tcgatccggg tgtgtgtccg 1832941 cgcaagtgga tgacgtcgaa tgtcagctcg ccgatgcgca cgctgtcgcc gtgggtgagc 1833001 aaccggtccg gtttgaccgg cagcgggtcg gcgtcgatcg gatgggccgc ggtcggcgcc 1833061 ccggtggccg cggccaccgc ttgcagcgcc tgccagtggt cgaagtgctg gtgactggta 1833121 acgatcaggg ccagcttcgg cgcgtaccgc cggaccaggt cgatgaggac ctccgcgtca 1833181 ttggcggcgt cgatcagcag ggtttctccg gtcgctgaac acgtcaccag gtaggcgttg 1833241 ttgtccatcg ggcccaccga tgccttgagg atcgtggcgc cgggcaggaa gcgacgcgcc 1833301 gccttgccgc gttcgacgtg tccggtgtaa ttgtcgtcga ctgttgtcat atgcgccact 1833361 gctcctatgc cggctgcgcc ggcatcatcg tcgttggcgc gggtcatatg cgccgacgtt 1833421 acgacgttac cggtcccctg atggttgtcg gtacgggcac atagcatggg atacggcctt 1833481 tggccggcga gatgagtttc agtgaaaggg acagcgtggc tgaccgcctg atcgtcaagg 1833541 gtgcgcgcga acacaatctg cgcagcgtcg acctcgacct gccccgcgac gcgctgatcg 1833601 tcttcaccgg gttatccgga tcgggcaagt cctcgctcgc gttcgacacc atcttcgccg 1833661 aggggcagcg gcgttacgtg gagtcgctgt cggcctacgc ccgccaattt ctcgggcaga 1833721 tggacaagcc ggacgtcgac ttcatcgagg ggctgtctcc ggcggtgtcc atcgaccaga 1833781 agtcgaccaa ccgcaaccca cgatcgacgg tcgggaccat caccgaggtg tacgactacc 1833841 tgcggctgtt gtatgcgcgc gcgggcacgc cgcactgccc gacctgcggg gagcgagtcg 1833901 cgcgccaaac cccgcaacaa atcgtcgatc aggtgctggc catgccggag ggcactcggt 1833961 ttctggtgct ggccccggtg gtgcgtaccc gcaagggcga gttcgccgat ctgttcgata 1834021 agctcaacgc ccagggctac agccgggtgc gggtcgacgg tgtggtgcat ccgctgaccg 1834081 atccgccgaa gctgaaaaag caggaaaagc acgacatcga ggtggtggtg gaccgtctca 1834141 ccgtcaaggc cgccgccaag cggcggctca ccgattcggt ggaaaccgcg ctgaatttgg 1834201 ccgacgggat cgtggtgctc gaattcgtcg atcatgaact gggtgcaccg catcgcgagc 1834261 agcggttctc cgagaagctg gcctgcccca acgggcacgc gctggccgtc gacgacctgg 1834321 agccgcggtc gttctcgttc aactcgccct acggcgcctg ccccgaatgc agtggtctgg 1834381 gcatccgcaa ggaggtcgac ccggagctgg tggtgcccga tccggatcgc accctggcgc 1834441 agggtgcggt ggcgccgtgg tcgaacggcc acaccgcgga gtacttcacc cggatgatgg 1834501 ccggccttgg cgaggcgctc gggttcgacg tcgacacgcc ctggcgcaag ctgccggcca 1834561 aggcccgcaa ggcgattctg gaaggcgccg acgagcaggt gcacgtgcgc taccgcaacc 1834621 gctacggacg cacccggtcg tattacgccg atttcgaggg tgtgctggcg ttcctgcaac 1834681 gcaagatgtc ccaaaccgag tccgagcaga tgaaggagcg ctacgagggt ttcatgcggg 1834741 acgtgccctg cccggtgtgt gcgggcaccc ggctcaagcc cgagattctg gcggtgacgc 1834801 tggctgggga gtccaagggg gagcacggcg ccaagtccat cgccgaggtg tgtgagctgt 1834861 cgatcgccga ctgcgcggac ttcctgaacg cgctcacgct gggtccgcgc gagcaagcga 1834921 tcgccgggca ggtgctcaag gagatccggt cgcggctcgg gtttctgctc gacgtcgggc 1834981 tggagtacct gtcgctgtcc cgggcggcgg ccacgctgtc cggcggtgag gcacaacgta 1835041 tccggctggc cacccagatc ggctccggcc tggtgggtgt gctctacgtg ctcgacgagc 1835101 cgtccatcgg gctgcaccag cgcgacaacc gtcgtcttat cgaaaccctc acccggttac 1835161 gggatttggg gaacactttg atcgtcgtcg agcacgacga ggacaccatc gagcatgcgg 1835221 actggatcgt cgacatcggc ccgggggccg gtgagcacgg tggccgcatc gtgcacagcg 1835281 ggccctacga tgaactgcta cgcaacaagg attcgatcac cggcgcctac ctgtccggcc 1835341 gggaaagcat tgagataccg gcgattcggc gttccgtcga cccccgtcgt caactcaccg 1835401 tcgtcggcgc ccgcgagcac aacttgcgcg ggatcgatgt gtctttcccg ctgggtgtgc 1835461 tgacctcggt gaccggtgtc tcgggttcgg gcaagtcgac gttggtcaac gacatcctgg 1835521 ccgcggtgct ggccaaccgc ctcaacggcg cccggcaggt ccccggccgg cacacccggg 1835581 tcaccgggct ggactatctg gacaagctgg tgcgggtgga ccaatcgccg atcgggcgca 1835641 caccgcgatc caacccggcc acctacaccg gtgtgttcga caagatccgc accctgttcg 1835701 ccgccaccac cgaggccaag gtccgcggct atcaacccgg acgattctcg ttcaacgtca 1835761 agggcggtcg ctgcgaggcc tgcaccggcg acggcaccat caagatcgag atgaacttcc 1835821 tgcccgacgt gtacgtgccg tgcgaggtct gccagggggc ccggtacaac cgcgaaaccc 1835881 tcgaggtgca ctacaagggc aagaccgtct cggaagtgct ggacatgtcc atcgaggaag 1835941 cggcggagtt cttcgagccg atcgccggcg tccatcgcta tctacgcacc ctggtcgacg 1836001 tgggcctggg ctacgtgcgg ctcggccagc ccgcgcccac gctgtccggc ggtgaggccc 1836061 agcgggtcaa gctggcctcg gagctgcaga agcgctccac cgggcgcacc gtctacatcc 1836121 tcgacgagcc gacgacggga ctgcacttcg acgacatacg caagctgctc aacgtgatca 1836181 acggcctggt cgacaagggc aatacggtga tcgtcatcga acataacctg gacgtgatca 1836241 agacatcgga ttggatcatc gacctgggcc cggagggcgg tgccggcggc ggaaccgttg 1836301 tcgcccaagg cactccggag gacgttgccg cggtgccggc gagctacacc gggaagtttc 1836361 tcgctgaggt cgtcggcggc ggtgcctcgg ccgccacatc gcggtcgaac agacggcgca 1836421 acgtcagcgc ctgagctgga ctatcgccgc gcgtcaagtc tgtgctcacg gcggcgaact 1836481 gggtgcggtc tcactcatcg gtgtgcatcg actcacggat ctgagctagc cgttcggctg 1836541 ccgcgcgctg ccgctgcgcg tactgatctt cgagccggcg gccttgcggg ctctcggcgt 1836601 cgagttcggt tgcccccagg gccgttccgt atcgggtttc gatcttctcg cggacggatt 1836661 cgaaggtcgg taccccagcg ctgtcgtacc gcggatcgga ctcggaattc ggcgtcgtgg 1836721 cttccggtgg tgtcggttcg tcgggcatgc tctggcaatg ctcctatctg ccggtaccgg 1836781 cgatctgctg tgtcgtaccc ggcaacggga tcttaggcac tcccggagtg gccagttggc 1836841 cggccagcca tggcagcgcc gcagcgaaaa cccggtcggc gaaaggccag tcgtgcttgc 1836901 ccggttgtgg aaccacggcg cagtagatgc cgttggcgcg gccgagggcg cacagtgcat 1836961 tggcggcagc ggcctggttg cctgggttgg cggcggcatc gcgaccggcc agccgcatcg 1837021 tggtggtatc ggcgacagcg ttgtcgggcg agggtggacc cggcgaagag atcgcgaacc 1837081 aacccgacag tccggtgtag ctgccatgcc gggtgatcac cgtcgtcggg tcaaacgccg 1837141 accaggcgtc ttcgttgccg ccgaacaacc tgacgatggt ttgcgtcttg ttgccagcgt 1837201 tcgggtagaa atcaccggcg atgtcgacaa acgcgctaaa cagtgtcggg tgcatgacgg 1837261 tcagatccac cgcgcaggtc ccacccatcg accaacccac gatgccccag ctggtctgtt 1837321 cgggactgac gccgaatttc gagaccatgt agggcacaac atctttagtc aagtggtcgg 1837381 ccgcgttgcc acgccgtcca ttgacgcatt cggtgtcgtt gttgaacgcg ccgccggaat 1837441 ccacgaatac cacgacggga gcattgccgc tgtgggcggc cgcaaagtcg tcgagcgtct 1837501 tcaccgcgtt accggctcgc gcccaatcgg cgggtgtgtt gaattgaccg ccgatcatca 1837561 tcaccgtcgg cagctgcggc ggcggagggt tctcggaacg atgctctcgg tcgaaccagg 1837621 ccggcggcag gtacaccagt tcgccgcgat gcttgaagtg tgatgcgtcg gaagggatca 1837681 ccactggcaa caaggtgccg tgcgacggcc gcaccccact gtgcgccagt gcggcaacag 1837741 cggcctgatc ggcctggtcg ggcaacgggc cggaggtgag ctggttccac gcggtctgca 1837801 cggtcgggaa gtagccaacc cacaggttga gcgtcaaggt cgcgctgagc agacagaggg 1837861 gcacggccag cagcgacgcg ccgcggcgcc accaccgcgc gctgcgccag cccaggatca 1837921 acaccgtcgc cgccgcgccg gtcaacgcga cccagatcca cagcgtgctc ggcggccgtt 1837981 cgttggccag gccgttgccg gtgacatacc agcgcgtccc ccatgccagg gtggccccga 1838041 tagcggcggc cgtcggcagc caccgccgtt gccagtgacg tgatcgccac cctgccgcca 1838101 gcaccagcac gaccgcggtc acgacctgga cagcgagcgg cacccaaccg tgcatcagcg 1838161 atgtgtggcc tactgctaac ggctgcgtcg cggctggcgg cgtcgacgcg gtcaccagtt 1838221 cattctgagc catttcgggc ggtgatttgt tgggggtttc ctgtgatccg acggacgccc 1838281 accggctccg gctaatgcgg ttttgccaac ggaaagggca gtgtttcgcg aatgctgcgc 1838341 ccagtgatca gcatgaccac ccggtcaatg cccatgccca agccgccggt gggcggcatg 1838401 gcgtactcca tcgcttgcag gaagtcttcg tcgagttcca tcgcctcggg gtctccgccg 1838461 gcggccagca gggactgctc ctgcaggcgg cgccgttgct ccaccgggtc ggtcagctcg 1838521 ctgtaggcgg tgcccagctc gataccccac gccaccaggt cccaacgctc ggcgacaccg 1838581 cgcttgctgc gatgcggtcg ggtcaacggt gacaccgatg tcggaaagtc gatgtagaac 1838641 gtcggttgct cggtgcggca ctccaccagg tgctcgtata gctcgagcac gaccgcgccg 1838701 gcatcccatt gggtccgata ggggacaccg gcggcgtcgc acagcttgcg gagagtggtc 1838761 aagccggtat cggcgtcgat gcgttcaccg agtgcttccg agatcgcatc atgcaccgtc 1838821 cgcaccggcc atatcccgga gatgtcgacc ggttcgaggt ggtggcgggt gccgtcggaa 1838881 cccttgtccg tccggggccg catggcgatg ggcgccccgt tggcggcctg ggcggcgttc 1838941 tggatgagtt cgcggcagcc gtcaatccac tcaaggtagc cggcgtgtgc ttgataggcc 1839001 tccagtaggg tgaactccgg gttgtggctg aagtcgacgc cctcgttgcg aaaggcacgg 1839061 ccgagctcga atacccgttc cacgccgccg acgcacaggc gcttgaggta gagctctggt 1839121 gcgatgcgca ggaacagatc catggaatac gtgttgatgt gcgtgacgaa cggtcgggcg 1839181 gtggcgccgc cgtgcagctg ctgtaggatc ggcgtttcga cctcgacgaa tccctttgcg 1839241 aacagcgtct cgcgcacagc gcgcagcacg ctgctgcgag cggtgatcag cgcacgggac 1839301 tcagcgttga ccgccaggtc gaggtaacgg gtccggactc gggcttcggg atccagtagc 1839361 cccttccact tattcggcaa cggccgcaaa cacttaccga tcaggcgcca gccgctgacg 1839421 atcaacgatg gagttccggt cttgctggcg cccatgtgtc cggtcatctc caccagatca 1839481 cccagatcgg tcgccgcgtt gaagtcggcc gcgcagccct ggtccaggcg tgaattatcc 1839541 agcagcactt gcatttcgcc cgaccagtcg cgcagctggg cgaacaacac accaccgtag 1839601 ttacgtattc gcatgatgcg tccggacacc gacacgctag cctggtggtc tgcggccagc 1839661 gcctgtgcca ccgtgtgact gggcggccgg cccacgggaa aggcgtcaat gccgctgctc 1839721 cgcagcttct ctagcttgtc gaaccgaact cgcacctgct cgggtagccg ccgctcgacc 1839781 ccgtcgccat tggtgaggcc tacttgccgc agcccgctca cgtccggtgc cgagccgtcg 1839841 tgatgcaata ggccggtggc cgccaaccgc tcgggcactg ccggatgatg ccccgtgtgt 1839901 actcggttgc gccggctgaa cggcagcacg aggaacccct ctgcgatcac cgaggcgacg 1839961 cccactcggg gaatcactcg ggcgtcttcg tagcaggcgt agcgcggtac ccattcgggt 1840021 tggtacttca tgttggagcg gtagagcgtc tcgagctgcc accaccgtga gaagaagacc 1840081 agcagccccc gccacaaccg ggcaaccggg ccggcgccga gttgggcgcc ctgctcgaag 1840141 gccgcgcgaa acaccgcgaa gttcaacgaa atacgagtga taccaaggct ttcagcgtgc 1840201 aaggcgagtt cgctgaccat aagttcgata gtgccgttcg gggattgtgg agaacgacgc 1840261 atcaaatcca gggagacacc ggtggttccc cacggcacca gcgacagcat tgccagcacc 1840321 tggttgtgcg gatcaatcgc ctccaccagc aggcagtcgg agtccgcggg gtcgccgagg 1840381 cggcccagcg ccatcgagaa gccgcgctcg gtctcggtgt cgcgccagga atccgcccgt 1840441 gtgatggtct gcgccatctc gtcttcggca atgtcgcgat gccgccggat gcgcaccgtc 1840501 aaccccgccc gccgggcccg cgtcacggcc tggcgcaccc cgcgcatctc cgggccggac 1840561 aacttgaaat cggctggccg caggatggcc tcatcgccca gctcgagcgc ggttaggccc 1840621 gcttcgcgat atgtctgagc cccttgtgaa ctggcgccca tcacgccggg tgcccagccg 1840681 taggtctggc acagccgcag ccacgcgtcg acggcctgcg gccatgctct gtggtcgcct 1840741 accgggtcgc cgctggctag gcagacaccg acctcgacac ggtaggtgat acaggcgcgg 1840801 ccgctggatg cgaataccac cgacttgtcg cgacgggtgg cgaagtagcc cagtgagtcg 1840861 tccttcccat acaaatccaa taacccgcgg atagcggatt cgtcctctcc ggtcagcgca 1840921 ttgtcagcgc gctgagatag gaacaagacg atcgcagccc cgatcaacgc gaacgcgccg 1840981 aacaacccga agatcgcgtt gaggaagacg tgcggtctgc cggtgaacag atcgggatcg 1841041 gcgagggcga atccgaccac ccggttggcc gcgtaaccca accgctcgtc cggcgctagt 1841101 gatcccggaa acagttcgac cagaccccaa gacgccacga ttccgaccac cgcgccggca 1841161 agccacaccg cagccgcccg aaacagcgcg cccctgcgga ccttggccca gaactcccga 1841221 tagcccagca ccagaacgac gattgccaca acatgcacgg cgaatccgag attctccccg 1841281 aagctctcgg cggcggtgtt gccgcccgct gcgatctcgg cggcgttgac cacggcggcc 1841341 aggaccatat ttgccagcaa gaccaaccag gcaatgcgtt tgcgtgccgt taacgcggcg 1841401 gccagcaatg ccagcacgaa ggaccacgcg aagttggtgt cggggaagtt gaacagataa 1841461 tcgttgatga attcgcgcgg aaccttgatg atccaccgaa tcaacggcga cacactggcc 1841521 agtagtgaca gggtcgcgat cacgccgacg gtccagccgg ctgccgcggg aacccagtga 1841581 taccgggagt ttcccctggt ggccgagcga ggtttggtga gtgtcacaga ccgcgaggat 1841641 attcccaaaa gccgggaaat gcccggcgtt gcagcccttt gtagccccgc atcggtgtgc 1841701 tgagggcacc ggctgatgtc ggccgttgtc ttagatgacg tgtcatggct gttagactgg 1841761 acgccgcgac catcccggcg aaggccaggg acagttaagt ggagtcccac tcccaccgct 1841821 agccacgaga tcgtttcaca ccttctcaag gttcagcggt ccggtcacag gcatctcgga 1841881 tgcctgttct gcgtgcagcg tgggcggctt tggccgcgat cggtcggcat tgggccctgc 1841941 ttgtgcaggg cttttttgct gatggtttgg gtgtgttccc cacctgattc cggccgggtc 1842001 caacaagctg gtcgcgcctg gaacagcagc caacgaggga ggccccatca gcactgaaac 1842061 ccgcgtcaac gagcgcatcc gcgtacctga agtccgattg atcggcccag ggggggagca 1842121 ggtaggcatt gtgcgtatcg aagacgcact tcgcgtcgcc gcggacgcag atctcgacct 1842181 tgtcgaagtt gctcccaatg ccagaccgcc ggtctgcaag atcatggact acggcaagta 1842241 caagtacgag gccgcgcaga aggcgcgcga atcccgcaga aaccaacagc agaccgtcgt 1842301 caaagaacaa aagctgcgac caaagattga cgatcacgat tacgagacca aaaagggtca 1842361 cgtcgtccgc ttcttggagg cgggatcgaa ggtcaaggtc accattatgt tccgtggacg 1842421 tgagcagtcg cggccggagt tgggctatcg attgctgcag cggctgggtg cggacgtcgc 1842481 cgattacgga ttcatcgaga cgtccgccaa gcaggacgga cgcaacatga cgatggtgct 1842541 ggcaccgcac cgcggtgcga agacccgcgc tagggcccgc cacccgggtg aaccggccgg 1842601 cgggccgccg cccaagccca cggccggtga cagcaaagcc gcaccgaact agctcgccag 1842661 caagacacgc agaacctaga aattctagaa attgaggaaa catgcccaag gccaagaccc 1842721 acagcggggc ctcgaagcgg ttccggcgca ccggtaccgg caagatcgtc cggcagaagg 1842781 ccaaccgtcg gcacctgctc gagcacaagc cgagcacccg caccaggcgc ctggacggcc 1842841 gcaccgtggt ggcagccaac gacaccaaac gggtcacgtc gttgctgaac ggctgaccgt 1842901 accgccggcc ggctccggca cctgaccaat cacgtccgaa cgagagtagg aagatccatg 1842961 gcacgcgtaa agcgggcggt caacgcccac aagaagcggc gcagcatcct gaaggcatcg 1843021 cgaggctatc gcggccagcg atcgcggctt taccgcaaag ccaaagagca gcagctgcat 1843081 tcactgaact acgcctaccg tgaccgccgg gcgcgtaagg gcgagttccg caagttgtgg 1843141 atcgcacgga tcaacgcggc tgcgcgcctc aacgacatca cctacaaccg gcttatccag 1843201 gggctgaagg ccgccggcgt cgaggtggac cggaaaaacc tcgccgacat tgcgatcagc 1843261 gacccggcgg cgttcaccgc gctggtcgac gtcgcccggg cggcactgcc cgaagacgtc 1843321 aacgccccct ccggggaggc cgcctgatcc ggattccggc ctgaggcagg gctacgccgg 1843381 tgctcaccga acgctcggcc agggtggcca cggcggtcaa actgcatcgt cacgtaggcc 1843441 ggcgccgggc gggacgtttt ctcgccgaag gccccaacct ggtagcggcg gcgttggcgc 1843501 gcgggctggt acgggaggta ttcgtcaccg aagttgcggc gcggcggcac gagctcttgt 1843561 tggccgcgca cgaggcttcg gttcatctgg tgactgagcg ggccgcgaag gcgctctctg 1843621 atacggtcac gccggccggg ttggtggcgg tgtgcgatct gccggcgacc cgacttgagg 1843681 atgtattggc cggctcacct cagctgatcg cggtgaccgt cgagatccgc gagccgggca 1843741 acgcgggcac ggtaatccgc atcgccgacg ccatgggtgc cgcggcggtg atcctcgccg 1843801 ggcgcagcgt cgacccatac aacggcaagt gtctgcgcgc gtccaccggt agcatcttcg 1843861 cgatcccggt cgtcgtcgcg cccgatgtcg gtgccgccat cgccgacctg cgagcggccg 1843921 gactgcaggt gctggccacc gcagtggacg gcgagatggc tctcgacgat gccgatcggc 1843981 tgcttgccga gccgacggca tggctgttcg ggcccgaagc acacgggttg tcggccgaga 1844041 tcgcggcctt ggcggaccac cgcgtacaca tcccgatgtc gggaggggcg gagagcctca 1844101 acgtcgcggc cgcggccgcg atctgtctgt atgagagcgc tcgggcgttg ggccgccgct 1844161 gattgtccgg ccctacgcag cgcggctggg gccccgcgcc ggccgcacgc cggccagcga 1844221 aagtgtggaa tggaccagcg ccccggcgcg ttccatcagg gccttggcgg gatcgagagc 1844281 cgaccgggta aagcgatggg aacggggtgg gaagtagtgc atcggcagcc cctgccgatc 1844341 ccgtggtggc gccatgaagc cgggcgcttg caccaggggc gcgtcatggc gagtacggcc 1844401 tgccgctcgc cggtaggcgg ccagcgcgcg cgggtgcaac cggatttcgt cgggcaccgc 1844461 caaaaacgcc agttctacca ccttgccgaa cactcgcagc aacacctcgt cacccggtgt 1844521 ccagtgcatt cccgccttct cccgcacggc cgggtcgaac aggccggccg cgatccagcg 1844581 ctgaccggct attagcggct tgaacagctg atcccagatc ggcgttggca tgagtacaaa 1844641 cctcggtttg ggaatccgca tctggaggat gtccacggtc gcctgattga tctcgagctt 1844701 gtcgcggcac acccggtccc aatagtcctg aaagtcttcc cacgacttgg gcaccggtcg 1844761 catgctcatc ccatacatcc ggtaccagcg cacgtgctcc tcgaagagct ggtgtttttc 1844821 ggcctcggtc aagcctccgc agaagtattc ggcgaccttg atgacaagca tgaaaaacgt 1844881 cgcatgcgcc cagtagaacg tatctggatt cagcgcgtga tagcgacgcc cctcagcgtc 1844941 gactcccttg atggttcggt ggtagccctt gatctgctgg ccggtctggg ccgctcggtc 1845001 accgtcatag accacaccca tgatcgggta caccgagcgg gctacccgct gcaagggttc 1845061 gcggagcagg attgaatgct cctcgacacc ggcacctagc tcgggataca tattttggat 1845121 cgcgccgatc cacacaccca tcatcccggt gcgcaggtct ccgaaatatt tccaggtcag 1845181 cgaatcgggc ccgagcgggt cggcggatgt cctcgatgcg acagtcatga ctgcctccgt 1845241 gccaggttag tctgcgccca cgataggcat tgacaacgcg cgttgtccac gatttggtcc 1845301 gccgatatcg cgccgtgtca cccagtgcct cctccgggtg gcaacgagcg tggacgagga 1845361 ctgcagctgc atagcttggc ccgcggtgcg tgcgggggca gggagtccaa tgaaaaatgt 1845421 tgcttagaac gccagaaagt ttttaactag atcaggattg cttagctgta gactttattt 1845481 ctcaatgacc acgtaaggat tgctgcggcc agtacaacgt gtacaaggag tcgggctatg 1845541 tcgtttctca ccgtggcgcc ggacatggta acggcggccg ccgggaattt ggaaagcgtt 1845601 ggctcggcac tgaatgaggc cgctgcggcg gcggcgccag ccacggttgg gctggcggcc 1845661 ccggccgcgg atcgggtgtc ggcggtcgtc gcggcgatgt tgggggcata tgcccgggat 1845721 tttcaaggca tcagtgctca gatcgcgggt tttcataacc agttcgtggg cgcgttgcgg 1845781 ggcggtgcgg ccgcctacgc cagcgccgaa gccgccaacg tccagcagac cgtggtgaac 1845841 gccgtgaatg cgcccgccca ggcgctgttg gggcacccgt tgatcgggcc cgagacggtc 1845901 ggctccagcg ccgccgcggt ctccttcggc ttcggcccgt tgctcctcgc tggtagcgat 1845961 ccgctgctgg ccgtgccatt cagctatccg gccagtctgc ccaccccatt cggtccagta 1846021 acgatgacgc tcaacgggtc gtttgatccg cttacccaac aggttgtttt cgactcggga 1846081 tcactcaccg cgcccgctcc gttcgtgtac ggtcttggtg cggtaggtcc agctctcacc 1846141 accatgaccg cgctgcaaaa cagcggcaca gcattttccg gcgcggtgca aagcgggaac 1846201 ctgctagggg ccgcgggcgc gcttctgcaa gctcccggca acgcggtgac cggcttcctg 1846261 tttggccaaa cagcgatatc gcagtcgata ccggggccat cgaatctggg ctacgagtcg 1846321 gtgggtatca gcgttccggt cggggggctc ttggctccgc tgcagcccgt gacggtcacg 1846381 ttgacgccca catctggtat gccgactgcc attcaattga gtggtacgca gtttggcggc 1846441 cttcttcccg ccctactcaa cggtttctaa ccgtctgcgg acagccgccg caaaccgcgt 1846501 gatcagcgtg tttgatgcga cttgtgccac aaacaccgag gtcgtcattg ccgggctcag 1846561 cccgcaccac ctacccttgc cacgtggagg tcgggccgca ggattcggag tccggcgcgc 1846621 ccgacgagac ggcaaccgcc atggcgtcgc cagtacctcg acaacggtcc gcactacgct 1846681 ggctgcgcac cgtgaaccgc agccctggcc tggtgtcatt catccaccgg gcgcgccgcc 1846741 tgttgcctgg cgatccggaa ttcggcgacc cgttgtccac cgcgggtgag ggtggtccac 1846801 gtgccgcggc tcgagctgcc gatcggctgc tgcgggatcg cgatgcggcc tcgcgcgagg 1846861 tcggcctgag tgtgctgcag gtgtggcagg cgttgaccga ggccgtttcc cgccggccgg 1846921 caaacccgga ggtgacgttg gtgttcaccg acctggtcgg cttttccacg tggtcgttgc 1846981 acgctggtga cgatgccacc ctcacgctgc tgcggcaggt ggcccgggct gtcgaatccc 1847041 ccctcctgga cgccggcggg cacatcgtca aacggctggg cgacgggatc atggcggtgt 1847101 tccgcaatcc gaccgtcgcg ctgcgagccg tgctcgtcgc ccaagatgct gtgaagtcgc 1847161 ttgaagtgca aggctataca ccgcgaatgc ggatcggtat ccacaccggc cggccgcagc 1847221 ggctggccgc cgactggctc ggcgtcgacg tcaacatcgc cgcccgggtt atggaacgtg 1847281 ccaccaaagg gggcatcatg atctcgcaac cgaccctgga cctgatcccg caaagtgagt 1847341 tggacgcgct gggcgtcgtg gcccggcggg tgcgtaaacc cgtgtttgcc agcaagccca 1847401 ccggcattcc gcccgacttg gcgatctatc gcatcaagac tgttagcgag tcgacagctg 1847461 ccgataactt cgatgagatg agtcccgatg cacagtagaa cgcgatgatc taccgcgtcg 1847521 cctgcctgct ggcccggatc cggttcaccg tgggctacgt ggcggctctt gcatcggtca 1847581 gcaccaccat cctgatgcat ggtccgcagg tgcacgccca ggtgattcgg catgccagta 1847641 cgaacctgca caacctggcc catggacacc tgggaacgct gtggaacagc gccttcgtca 1847701 tcgacgaggg cccgctttat ttctggttac cctgcttggc gtgtctgctc gcggtcgcgg 1847761 agctgcagct gcgcagcttg cggctgaccg tggcgttcgt cgtcggtcat attggggcga 1847821 cactgttggt ggcggccgtg cttgccgggg cgatcgagat cggctggttg ccatggtcca 1847881 ttagccgggt cagcgatgtc gggatgagct acggtgccct cgcggcgctc ggggcgctga 1847941 ccgcggcaat ccctgggcgg tggcggccgg catggattgg ttggtgggta tcgctgggct 1848001 tggcgactgc gaccatcggc ggtggtttca ccgatgccgg ccacacggtt gcgttgctgt 1848061 tgggcatgtt agtgactgcc tgcttcaccc ggcccgcgcg ctggacactc gggcggtgtg 1848121 ccttgctggc ggtggcgtcg gggttctgct tggtgctgct agcccatagc tggtggagct 1848181 tggtgagtgg gtcggccttg ggtctactcg gggccctggg tgccgccggg tttgcgcgtt 1848241 ggaccagagc gcgcgccaca tcgctgccac ccggcgcgct ggcgattccg cagccggcgc 1848301 taagtcgctg agtcccgcac aacgcgtgcc gagccgggcc gaccgaatca cctatgattt 1848361 gcacttgcgt cacgccgtta gcgggcaagt cgggtacgtc catcagtcca gtttccgctc 1848421 cgcgacgatg cgggcggtcc gaatagcctc gtcagcaagg agagtggcgc cgcgtgggtg 1848481 atccccccct cgagtcgatt gtgtcgatgt tgtcgccgga ggcattgacc acggcggtcg 1848541 acgccgccca gcaggccatc gccctagcgg acaccctgga cgtcctggcg cgcgtcaaga 1848601 cggagcatct cggcgaccgc tcgccgttgg cgctggcgcg gcaggcgctg gccgtgctgc 1848661 ccaaagaaca gcgagccgag gccggtaagc gcgtcaacgc cgcccgcaat gccgctcagc 1848721 gcagctacga cgaacggctg gcgacgctgc gtgccgagcg cgacgcggcc gtgctggtgg 1848781 ccgaaggtat cgatgtcaca ttgccctcga ctcgggtgcc ggccggcgcc cggcacccga 1848841 tcatcatgtt ggccgaacac gtcgccgaca cgttcatcgc gatgggatgg gaactggccg 1848901 aggggcccga ggtggagacc gagcagttca acttcgacgc cctcaacttc cctgccgacc 1848961 accctgcgcg cggcgaacaa gataccttct acatcgcgcc ggaggattcg cggcagctgc 1849021 tgcgcaccca tacctcaccg gtgcagattc gcaccctgct agcgcgtgag ctgccggtct 1849081 acatcatctc gatcggtcgt acctttcgca ccgacgaact cgacgccacc cacacgccca 1849141 tcttccatca ggtggaaggc ctagcggtgg accgcggtct gtcgatggct cacctacgtg 1849201 gaacgctgga cgcttttgcg cgcgccgagt tcgggccgtc tgcgcggacc cggatccggc 1849261 cacacttctt ccccttcacc gaaccgtccg ccgaggtcga tgtgtggttt gccaacaaga 1849321 ttggcggcgc cgactgggtg gagtggggcg ggtgcggaat ggtgcatccg aacgtgttgc 1849381 gggccaccgg cattgatccc gatctctact ccggtttcgc gttcgggatg gggttggaac 1849441 gcaccctgca gtttcgcaac ggcattcctg acatgcgcga catggtcgaa ggcgacgtcc 1849501 gattctcgtt gccgttcggg gtgggtgcct gatgcggcta ccctacagct ggctgcgcga 1849561 ggtggttgcg gtcggcgctt cgggctggga cgttacccca ggcgaactcg agcagacgct 1849621 gttgcgcatc ggccacgagg tcgaagaggt catccccctt ggtccggtgg acggcccggt 1849681 gaccgtgggg cgggtggccg atatcgagga gctcaccggc tacaagaagc cgatccgggc 1849741 ctgcgcggta gatatcggcg atcggcagta tcgcgagatt atttgtggtg caaccaattt 1849801 cgcggttggt gatctggtgg tggtagcgct gcccggtgcc acgctgcccg gtggattcac 1849861 cattagcgcc cgcaaggcct acggtcgcaa ctccgacgga atgatctgct cggcagccga 1849921 actcaatttg ggcgcagacc attccgggat cctggtgttg ccccccggag ccgccgagcc 1849981 cggagctgac ggcgcgggcg tgctggggct cgacgacgtg gtcttccatc tggccatcac 1850041 cccagaccgc ggttactgca tgtcggtgcg cggcttggcc cgcgagctcg cgtgcgccta 1850101 cgacctggac ttcgtcgacc ccgccagcaa ctcgcgggtg ccgccgctac ccatcgaggg 1850161 gccagtctgg ccgctgacgg ttcagcccga gacgggggtg cgccggttcg cgctacgccc 1850221 ggtcatcggg atcgaccccg ccgcggtatc gccctggtgg ttgcagcgcc gactgctgct 1850281 ctgcggtatc cgcgcgacct gtccggcggt cgacgtgacc aattacgtga tgctcgaact 1850341 tggccacccc atgcacgccc acgaccgcaa ccggatcagc ggaaccctcg gagtgcggtt 1850401 cgcccggtcc ggcgagaccg ccgtgaccct cgacggtatc gagcgcaagc tcgataccgc 1850461 cgatgtcctg atcgtcgacg atgctgcgac agcggcgatc ggcggcgtga tgggggcggc 1850521 cagcaccgaa gtgcgggccg actccaccga tgtcctgttg gaggccgcga tatgggaccc 1850581 ggctgcggta tcgcgtaccc agcggcggct gcacctgcct agcgaggccg cccgtcgtta 1850641 cgagcggacg gtggacccgg ccatctccgt ggccgctttg gaccggtgcg caaggctgct 1850701 cgccgacatc gccggggggg aggtttctcc cacccttacc gactggcggg gtgacccgcc 1850761 gtgtgatgac tggtcaccgc cgccgatccg gatgggagtc gatgtgccgg accgcatcgc 1850821 cggggtggcc tatccgcagg gcactactgc caggcgcttg gcccagatcg gcgcggtggt 1850881 gacccacgac ggcgacacct tgaccgtgac cccgccgagt tggcgacctg atctgcggca 1850941 acccgcagac cttgtcgagg aggtgctgcg gcttgagggg ctggaagtta tcccgtcggt 1851001 gctgccaccg gcgcccgcgg gtcgtggact caccgctggg cagcagcgcc gtcgcacgat 1851061 cggcaggtcg ctggcgctgt cgggctatgt cgagattctg ccgactccat ttctgccggc 1851121 cggtgtgttc gatttgtggg ggctggaagc cgatgactca cggcgcatga ccacgcgggt 1851181 gctcaacccg ctggaggccg atcgtccgca actggcgacc acgctgctgc cggccctgct 1851241 ggaagccttg gtgcgcaacg tgtcccgagg gctggtcgac gtcgcgctgt tcgccatcgc 1851301 ccaggtggtc cagccgaccg agcagacgcg cggtgtcggg ttgatcccgg ttgaccggcg 1851361 gccgaccgat gatgagatcg ccatgctgga tgcctcgctg ccccggcaac cccagcacgt 1851421 cgcggcggtg ctggccggac tgcgcgagcc tcgaggcccc tggggcccgg gccgcccggt 1851481 agaggcggct gatgcgttcg aggcggtgcg aatcatcgcg cgcgccagcc gcgtggacgt 1851541 gaccctgcgg ccggcccaat atctgccgtg gcatccgggc cggtgcgcgc aggtgttcgt 1851601 cggggaaagc tcggttggtc acgccgggca gctgcatccc gccgtgatcg agcgctcggg 1851661 tctgccgaaa ggcacctgcg cggtggaact gaacctagat gcgattccgt gcagcgcgcc 1851721 gctgccggca cccagggtgt cgccgtatcc ggccgtgttc caagacgtca gcctggtggt 1851781 ggccgcggac atccccgctc aggcggtggc cgacgccgtg cgcgcggggg caggcgacct 1851841 gctggaggat attgcgttgt ttgacgtgtt caccggcccg cagattggtg agcaccgcaa 1851901 gtcgctgacc ttcgcgctgc ggtttcgtgc gccggatcgc accttaaccg aagacgacgc 1851961 cagcgccgcc cgcgatgccg ctgtgcaaag cgcagccgaa cgggtgggtg ccgtgctgcg 1852021 tggctgaacc gactcagcac gcgttcaacg aaaatttgac gacggcattt cagcgcgccg 1852081 cgtttatacc tcgccgccct gtccgggtag cggcgccgcc ctaaggggca attgcctgcg 1852141 ctagctgtgt gggagcgtag ttcaccaacg cgggaacgat gccgccggcg ggcgtacctt 1852201 cgagcgtaac ggtaaccggc ccgatgaccg gtattaccgc cgtggcctga aacggctgca 1852261 gaggcgcaag aatgccgccg acgggaacct cgaccgtcac cggaatcccc cctgtcgccg 1852321 atgttggcag ggccagcggc agcctggcct cgccattgag gaagccgttg gcgacgttgg 1852381 cgggagcacc gaccagggcc gccgctgccg cctgcaggtt tccggcctgc acggcgctga 1852441 cgaacgctgt cgtgctctcg gcgaatgcga ttgccgtcgt gatcggcgaa cccaccgcat 1852501 taagggtcat cgccagcggc aatccaaacg tcatcacccc ggtcaagttc gtggtatcga 1852561 tcgaaaaggc gatggttgtg tccgtgaccg tcatcaccac attggtgaag ttttgtgaca 1852621 tggcgccggg gatgctcagg atggggaaca ggtctcccac cggcccgagc agcaggatgt 1852681 tcgacaagtc actcgcgtca acaccgctga cgaagacctt caccaccgcc cctaacacgt 1852741 cggtcaccgc gccgctgacg tcgcctgccg cgagggcttg caaggccgat tgcaggctcg 1852801 gcggtatgcc agccagccca atagcgaagt ccctggtggc atctgtcagc gcggttaggg 1852861 tcagctggcc gtagccgaac tggttggcga ggtactgctg caggaacggc gccgggtcgg 1852921 caagccaggt attgccgatg ctcgccaggt tggcgaccgt gttggcgatg aggtcttcgt 1852981 atggcccgag gatgggaaca ctgctgctca ggctagggaa ggccagcgct gccgcgcccg 1853041 gtggaccgga gctgccactt tggccgaaca acaccccgcc ggtgccaccg gtgccgccat 1853101 tacccggggc accggccggg ctgccggcgc cgccggcgcc accggcgcca ccgtcaccac 1853161 cgttgccgat caaggtggcg ttgccgccgt gaccgccgtc gctaccggtg ccgccggcct 1853221 tggtaccgga accggcgccg cccgcgccgc cggttccgcc ggtcccgccg tcgccgaaca 1853281 cttggccggc gttgccgccg tttccgctgg caccgccctt accgccgata ccgttgccgc 1853341 cactgtgatc cccaccggta ccaccggcgc cgccctgccc gccattaccg aacgcgatgg 1853401 cgctgcctcc ggtgccgccg ataccgccgg tgccaccctc aagggcgcca tcggcggtgg 1853461 tgccaccgtt gccgccgttc ccaccggccc caccgttgcc ccatatcagc ccgccggcac 1853521 cgccgtgacc gccggcaccg ccggtaccgc cgggacttgc gaagagggag ccagagttag 1853581 ccccaccagt gccgccgttc ccaccggccc caccgctgcc gagaagcaac gcggtgccgc 1853641 cgctgccgcc ggcaccgccg acaccggagc taaatagcgc tgcagcccca ccggcgccac 1853701 cggccccacc gttgccgcca ttgccgatga agctactgcc cgcggcacca ccggcgccgc 1853761 cggcaccggc gttggcgagt atgttgatag cagccccgcc gatgccaccg gcccccccgt 1853821 tcccgccgtt gccgtagagc agcccgccga cgccgccggc cccgccggcc ccgccggctc 1853881 cgctggtagc gctggccaga tcgctgctcg tccccccctt gccgccgacg ccaccggtcc 1853941 caccgttacc gaacaagctg gcgttgccgc cagcaccccc ggcaccgccg acgccggagt 1854001 cgaacaatgg caccgtcgta tccccaccat tgccgccggc cccaccggca ccgccgttgc 1854061 cgtacagcag gccggcgttg ccgccggccc cgccagcgcc ggcgttcatg ccgacgccca 1854121 acaatgacgt ggcggcgccg ccgtcgccgc cggcaccgcc ggagccccac aggccgacgc 1854181 tgccgccggc cccgccggcc acgccgctac cggtgagacc gctggtgccg ccagcgccgc 1854241 cggcaccgcc attgccgacc agggtattcc cgcccgcacc cccggcggcg acggtgctcg 1854301 atccgccgtc cccgccgttg ccgaacagtg catttccacc tgcaccgcca gccttcgagg 1854361 tgctggaacc accgtccccg ccattgccga acaacccgcc gtccgcgccg gctagccccg 1854421 atccggcccc agcattgccg ccgttaccaa atatcgtccc ggcgtggccg gcggctccgc 1854481 cggaagcccc acttccgccg ttcccgccgt tgccgaacag cagggcgttg ccgccggccc 1854541 caccggccgc agcggcactg cccccgttgc cgccggcacc gccgttgcca tacaacagtc 1854601 cgccggtgcc gccggccccg ccagcgccgc ctgcacctcc aacaccgccg gcgccgccgg 1854661 cgccgccgtt gccgatcaat ccggcggccc cgccgttacc gccggccccg ccgtcgggtc 1854721 cgccggcccc gccgttgccg ccgttgccgt acaaaattcc accgggcccg ccgttgccgc 1854781 cggcatttga cccggtcccc gccactccat cggcgccgtt gccgatcagt gggcgcccca 1854841 gcagcgtctg cgtgggcgca ttcaccgcgt cgagcagggc ctgcatcgac gacacgctgg 1854901 cggcctcggc gccggtatag gccgccgcgc cgccgttcaa caagctcacg aactcggcgt 1854961 gaaacgtcgc cgcccgggcg ttgagcgctt gaaattgctg accgtaggcg ccgaatagtc 1855021 gcgagacagc cgccgacacc tcatcggcgc cggccgatgc cagcgcggtc gtgggggtcg 1855081 atgcggcggc agcggcttcg ctcagtgccg agcgaatacc agctaaattg gcggccgctg 1855141 ctgtgaccaa gtccggctcc acgagtaaga acgacatggc ggtccccctt cgactcggcg 1855201 cagctagtgg acatgtgtca cgggaaattc agcctagttg ggtcttatgt catgtgaggg 1855261 aaaacgcacg ttttcgcgga cgcaacttcg agtcccatcg gcgccgcccg gcggtgtgtc 1855321 aagtcccggc gcagtcaccg cggaatgagt ttgcaaactg ttgcataacg atgcaaaatc 1855381 ggcaggtggc caatgcgacg aaggtggcgg ttgccggtgc cagcggatat gccggtggtg 1855441 agattctccg cctgctgctc gggcatccgg cgtacgccga cggccggctg aggatcggtg 1855501 cgctgaccgc ggcgaccagc gccggcagca cgctcggcga acaccatccg cacctgacgc 1855561 cgctggccca tcgagtagtc gaacccaccg aagctgccgt gctcggtggc catgacgccg 1855621 tcttcttggc cttgccgcac gggcattcgg cggtgttggc gcagcaactg agccccgaga 1855681 cactgatcat cgactgcggg gcggactttc ggctcaccga cgccgccgtc tgggagcggt 1855741 tctacgggtc gtcgcacgcc ggtagctggc cgtatgggtt gcccgagctg ccgggcgcgc 1855801 gggaccaatt gcgcggcacc cgccgcatcg cggtgcccgg ctgctatccg accgcggcac 1855861 tgctggcgct ttttcccgcg ctggccgcag accttatcga gcccgcggtg accgtggtcg 1855921 ccgtgagcgg tacctcgggg gcgggtcgtg cggccaccac cgacttgctg ggcgcggagg 1855981 tcatcgggtc ggcgcgcgcc tacaacatcg ccggcgtcca ccggcacacc cccgagatcg 1856041 ctcaagggct acgcgcggtc accgaccgcg acgtctcggt ctcgtttacc ccggtgctga 1856101 tcccggcctc ccgtggcatc ctggccacct gcacggcacg cacccgatca cccctgtcgc 1856161 agctgcgggc agcctacgaa aaggcctacc atgcagagcc tttcatttat ctgatgccgg 1856221 aggggcagct gccgcgcacc ggcgcggtga tcggcagcaa cgcagcgcac atcgccgtcg 1856281 cggtggacga ggacgcgcag acgttcgtgg cgatcgccgc gatcgacaac ctggtcaagg 1856341 gcaccgccgg cgccgcggtg caatcgatga acctggcgct gggctggccg gagaccgacg 1856401 gcctttcggt tgtgggggtg gcgccgtgac cgacctggcc ggcaccaccc ggctgctgcg 1856461 cgctcagggc gtcaccgccc cggccggctt tcgggccgcc ggcgtcgccg ccgggatcaa 1856521 ggcctccggt gcgctggatc tggcgctggt gttcaacgag ggacccgact acgccgccgc 1856581 cggggtgttc acccgcaacc aggtcaaggc ggcgccggtg ctgtggaccc agcaagtgct 1856641 gaccaccggg cggctgcgcg cggtgatcct caactccggc ggcgccaatg cctgcaccgg 1856701 gccggccggc ttcgccgaca cccacgccac cgcggaggcg gtggccgcgg cgttgtcgga 1856761 ctggggaacc gagaccgggg ccatcgaggt cgccgtctgc tccaccgggc tgatcggcga 1856821 ccggctgccg atggacaagc tgctcgccgg cgtcgcccac gtggtgcacg agatgcatgg 1856881 cgggctggtc ggcggcgatg aagccgccca cgccatcatg accaccgaca acgtgcccaa 1856941 acaggttgcg ctgcaccatc acgacaactg gacggtcggc ggcatggcca aaggcgcggg 1857001 catgctggcg ccgtcgttgg ccaccatgct gtgcgtgctc accaccgacg cggccgccga 1857061 gccggccgca ctcgagcggg cgctgcgccg cgccgccgcg gccacgttcg accggctcga 1857121 catcgacggc agctgctcca ccaacgacac cgtgctgctg ctgtcgtccg gggccagtga 1857181 aatcccccct gcccaggccg atctcgacga ggccgtgcta cgggtctgcg acgatttgtg 1857241 cgcccagctg caggccgacg ccgaaggcgt caccaaacgc gtcaccgtga ccgtgaccgg 1857301 ggccgccacc gaagacgacg cgctggtcgc cgcccgccag atcgcccgcg acagcctggt 1857361 caagaccgcg ctgttcgggt ccgacccgaa ctggggacgg gtgctcgccg ccgtcgggat 1857421 ggcaccgatc accctcgacc cggatcgaat cagcgtgtcg ttcaacggtg ccgcggtgtg 1857481 tgtgcacggt gtcggcgctc ccggtgcgcg cgaggtggac ctgtcggacg cggacatcga 1857541 tatcaccgtc gacctcggcg tcggcgacgg gcaggcgagg atccgaacca ctgatctgtc 1857601 gcatgcctac gtcgaagaga actcggccta cagctcatga gccgcatcga agcactgccc 1857661 acccacatca aagcgcaggt gctggccgag gccctgccct ggctcaagca gttgcacggc 1857721 aaggtcgtcg tcgtcaaata cggcggcaac gcgatgaccg acgacacgct gcggcgcgcg 1857781 ttcgccgccg acatggcgtt tctgcgcaac tgcggcatcc atcccgtcgt ggtgcacggc 1857841 ggggggccgc agatcaccgc catgctgcgg cggctcggca tcgagggcga cttcaagggc 1857901 ggattccggg tcaccacacc cgaagtgctc gacgtggccc ggatggtgct gttcggtcag 1857961 gtgggccggg aactggtcaa cctgatcaac gcgcacggac cgtatgccgt cgggatcacc 1858021 ggcgaggacg cgcagctgtt caccgccgtg cggcgcagcg tcaccgtcga cggcgtggcc 1858081 accgacatcg gcctggtcgg cgacgtcgac caggtgaaca ccgcggcaat gctggatctg 1858141 gttgcggcgg gccggatccc ggtggtgtcc acgctggccc cggatgccga cggcgtggtg 1858201 cacaacatca acgccgacac cgccgccgcg gcggtcgccg aagccctggg cgccgaaaag 1858261 ctgttgatgc tcaccgatat cgacggcctg tacacccgct ggccggatcg cgactcgctg 1858321 gtcagcgaga tcgacaccgg cacactggcg caactgctgc cgacgctgga atcgggcatg 1858381 gtccccaagg tcgaagcgtg cctgcgggcg gtcatcggcg gggtgcccag cgcgcacatc 1858441 atcgatgggc gggtcacaca ctgcgtgttg gtggagttgt tcaccgacgc gggcaccggc 1858501 accaaggtgg tgcgcggatg accggcgctt cgaccacgac ggcgaccatg cggcagcggt 1858561 ggcaagccgt gatgatgaac aactacggca cccccccgat agcgctggcc agcggtgacg 1858621 gcgccgtggt caccgacgtg gacggcagaa cctatatcga cctgctcggc ggcatcgcgg 1858681 tcaacgtgct gggccatcgc caccccgcgg tcatcgaggc cgtcacccgg cagatgtcga 1858741 cgctggggca cacctccaac ctgtatgcca ccgaaccggg catcgcgctg gccgaggagc 1858801 tggtcgcgct gctgggggcc gaccagcgga cgcgagtgtt cttctgcaac tccggcgccg 1858861 aggccaacga ggcggcgttc aagctgtctc ggctcaccgg acgcacgaaa ctggtcgccg 1858921 cccacgacgc cttccacggc cgcaccatgg gctcgctggc gctcaccgga caaccggcca 1858981 agcaaacgcc gttcgcgccg ctgcccggcg acgtcacgca cgtcggctac ggcgacgtcg 1859041 acgcgttggc cgccgccgtc gatgaccaca ccgccgcggt gttcctggaa ccgatcatgg 1859101 gggagagcgg ggtcgtcgtc ccgcccgcgg gctaccttgc cgccgcccgc gacatcacgg 1859161 cgcggcgcgg cgcgctgctg gtgctcgacg aggtgcaaac cgggatgggc cgcaccggag 1859221 cgttcttcgc ccaccagcac gacggcatca ccccggacgt ggtgaccctg gccaagggtc 1859281 tgggcggcgg gctgccgatc ggtgcctgcc tggccgtcgg gccggccgcc gaactactga 1859341 ccccaggcct gcacggcagc accttcggcg gcaacccggt ctgcgccgcg gcggcgctgg 1859401 cggtgctacg ggtgctggcg agcgacggcc tggtccgccg cgccgaagtc ttgggcaaat 1859461 cgttgcggca cggcatcgaa gcgctcggcc acccgctcat cgaccacgtg cgcggacgcg 1859521 gactgctgtt gggcatcgcg ctgaccgccc cgcacgccaa ggacgccgag gccaccgccc 1859581 gcgacgccgg ttacctggtc aacgcggccg cacccgacgt catccggttg gcgccgccgc 1859641 tgatcatcgc cgaagcacag ctcgacggct ttgtcgccgc cttgccggca atcctggacc 1859701 gcgccgtggg ggccccgtga tcaggcattt cctgcgcgac gacgatctgt ccccggccga 1859761 acaggccgag gtgctcgagc tcgcggccga gctgaagaaa gacccggtta gccgtcgtcc 1859821 cctgcaaggg ccgcgcgggg tggcggtcat cttcgacaag aactccaccc gcacccggtt 1859881 ctccttcgag ctgggcatcg cgcagctggg cgggcatgcc gtcgtcgtcg acagcggcag 1859941 cacccagctg ggccgcgacg aaaccctgca ggacaccgca aaggtgttgt cccgctacgt 1860001 cgatgccatc gtctggcgaa ccttcggcca agagcggctg gacgccatgg cgtcggtcgc 1860061 gacggtgccc gtgatcaacg cgctctccga tgagttccat ccgtgtcagg tgttggccga 1860121 cctgcagacc atcgccgaac gcaagggggc gctgcgcggc ctgaggttgt cctacttcgg 1860181 cgacggcgcc aacaacatgg cccactcgct gctgctcggc ggggtcaccg cgggtatcca 1860241 cgtcaccgtc gcggctcccg agggcttcct gcccgacccg tcggtgcggg ccgcggccga 1860301 gcgccgcgcc caggataccg gcgcctcggt gactgtgacc gccgacgccc acgcggccgc 1860361 cgccggcgcc gacgttctgg tcaccgacac ctggacgtcg atgggccagg aaaacgacgg 1860421 gttggaccga gtgaagccgt ttcggccgtt tcagctcaac tcgcgacttc tggcgctggc 1860481 cgactcggat gccatcgtgt tgcattgcct gccggcccat cgcggcgacg agatcaccga 1860541 cgcggtgatg gacgggccgg ccagcgcggt gtgggacgag gccgaaaacc ggctgcacgc 1860601 gcagaaggcg ctgctggtgt ggctgctgga gcgctcatga gccgcgccaa ggccgcgccc 1860661 gttgcggggc ccgaggtcgc cgcaaaccgc gccggccgcc aggcgcgcat cgtggcgatc 1860721 ctgtcgtcgg cgcaggtgcg cagccaaaac gaactggcgg cgctgctggc cgccgagggc 1860781 atcgaggtca cccaagccac actgtcacgc gatctggaag agctcggcgc ggtgaaactg 1860841 cgcggcgcgg acggcggcac cggcatctac gtggtgcccg aggacggcag cccggtgcgc 1860901 ggcgtctcgg gcggtaccga ccggatggcg cggctgctcg gtgagctgct ggtgtcgacc 1860961 gacgacagcg gcaacctcgc ggtgttgcgc accccgccgg gcgcggcgca ctacctggcc 1861021 agcgccatcg accgcgcggc cctgccccag gtcgtcggca ccatcgccgg tgatgacacc 1861081 atcctggtgg tggcccgcga gccgacgacc ggcgcgcaac tggccggcat gttcgagaac 1861141 cttcggtaag gagagtcatg tcagagcgcg tcatcctggc ctattccggc ggtctggaca 1861201 cctcggtggc gatcagctgg ataggcaagg agaccggccg tgaggtggtg gcggtggcga 1861261 tcgacctcgg gcagggcggc gagcacatgg acgtcatacg gcagcgggcg ctggactgcg 1861321 gcgcggtgga ggctgtcgtc gtcgacgccc gcgacgagtt cgccgaaggc tactgcctgc 1861381 ccaccgtgct gaacaacgcg ctgtacatgg accgctaccc gctggtgtcg gcgatcagcc 1861441 ggccgctgat cgtcaaacac ctggtcgccg cggcgcgcga gcacggcggc ggcatcgtcg 1861501 cgcacggctg caccggcaag ggcaacgacc aggtccggtt cgaagtcggg ttcgcctcgc 1861561 tggcaccgga tttagaggtg ttggcgccgg tgcgcgacta cgcgtggacg cgggagaagg 1861621 cgatcgcgtt cgccgaggag aacgcgatcc cgatcaacgt caccaaacgt tcgccgttct 1861681 ccatcgacca gaacgtctgg ggccgcgcgg tggagaccgg cttcttagag cacctgtgga 1861741 atgccccaac caaggacatc tacgcctaca ccgaagaccc cacgatcaac tggggggtcc 1861801 ccgacgaggt gatcgtcggc ttcgaacgcg gcgtgccggt gtccgtcgac ggcaagccgg 1861861 tgtcgatgct ggcggcgatc gaggagctca accgccgcgc cggagcgcaa ggtgtcgggc 1861921 gcctcgacgt cgtggaggat cggctggtgg gcatcaagag ccgcgagatc tacgaggcgc 1861981 ccggcgcgat ggtgctgatc accgcgcaca ccgaactcga acacgtcacc ctggagcgtg 1862041 agctgggccg gttcaaacgc cagaccgacc agcgctgggc cgaactggtc tacgacgggc 1862101 tgtggtactc gccgctgaag gccgcgctgg aggctttcgt cgccaagacc caggagcacg 1862161 tgtccggcga ggtgcggctg gtgctacacg gcggccacat cgcggtcaac ggccggcgca 1862221 gcgcggaatc gttgtacgac ttcaacctgg ccacctacga cgagggcgac agcttcgacc 1862281 agtccgccgc ccgcggcttc gtctacgtgc acgggctgtc ctccaagctc gccgcccgcc 1862341 gggatctgcg gtgacggttc tcccgcgagc agacgcagaa tcgcaccgcc acgcccgtcg 1862401 gcgtgcgatt ctgcgtctgc tcgccacaga aaagtgagca ccaacgaggg gtcgctgtgg 1862461 ggcgggcggt tcgccggcgg cccgtccgac gcgctggccg cgctgagcaa gtccacccac 1862521 ttcgactggg tgctggcccc ctacgacctc accgcgtcgc gggcgcacac catggtgctg 1862581 tttcgggccg ggctgctcac cgaggagcaa cgcgacgggc tgctcgccgg cctggacagc 1862641 ctcgcccaag acgtcgccga cggcagcttc ggcccgctgg tcaccgacga ggacgtgcat 1862701 gccgcgctgg agcggggcct gatcgaccgg gtcggaccgg acctgggcgg ccggctgcgg 1862761 gccgggcgct cgcgcaacga ccaggtggcc gcgctgtttc ggatgtggct gcgcgacgcg 1862821 gtgcgccggg tcgccaccgg tgtgctcgac gtggtcggtg cgctggcaga gcaggccgcc 1862881 gcacacccga gcgccatcat gcccggcaaa acccacctgc agtccgccca gccgatcctg 1862941 ctggcacacc atctgctcgc gcacgcccac cccctgctgc gcgacctgga ccgcatcgtc 1863001 gacttcgaca aacgcgcggc ggtgtccccg tacggctcgg gcgccttggc cggctcgtcg 1863061 ctgggcctgg atcccgacgc gatcgccgcg gacctcggtt tctcggctgc cgcggacaac 1863121 tccgtcgacg cgaccgccgc ccgcgacttc gccgccgagg cggcgttcgt gttcgccatg 1863181 atcgccgtcg acctgtcccg gctggctgag gacatcatcg tctggagctc gacggaattc 1863241 ggctacgtca cgttgcatga ctcgtggtcc accggtagct cgatcatgcc gcagaagaag 1863301 aatccggaca tcgccgagct ggcccgcggc aagtccgggc ggctgatcgg aaacctggcc 1863361 gggctgctgg ccaccctgaa agcccagccc ctggcctaca accgcgacct gcaggaagac 1863421 aaggagccgg tgttcgattc ggtggcccag ctggagctgc tgctgccggc gatggccggg 1863481 ctggtggcca gcctgacctt caatgtccag cggatggcgg agctggcccc ggccggctat 1863541 acgttggcca ccgatctcgc cgaatggctt gtgcggcaag gtgttccgtt taggtccgcg 1863601 catgaggccg cgggtgcggc ggtgcgtgcg gccgaacagc gcggcgtggg gctgcaggaa 1863661 ctcaccgacg acgagctggc cgccatcagc cccgagctga ccccgcaagt ccgcgaggtg 1863721 ctgaccatcg aaggctcggt gtcggcccgc gattgccggg gtggcaccgc gccgggccgg 1863781 gttgccgagc aactgaacgc cattggtgaa gccgccgagc ggctgcgccg ccagctggtg 1863841 cgctgagggg gcctcgaaac tttgccggcc agttccaggc gggctaaact tcgggctcta 1863901 ggcgacccgg ttgaaccatt cggcctcgat gtgcgtgtca aaggggtggg accagtgagc 1863961 gtcatcgcag gtgtgttcgg cgcgttgccg ccgtatcgct attcacaacg cgagctcacc 1864021 gactcgtttg tcagcatccc ggatttcgag ggctacgaag acatcgttcg ccagctgcac 1864081 gccagcgcca aagtcaacag ccgccacctg gtcttgccgc tggagaaata cccgaagctg 1864141 accgacttcg gcgaggcgaa caagattttc atcgaaaaag ccgtggactt gggcgtgcaa 1864201 gccctggcgg gggcactcga cgagtccggt ctgcgacccg aggatctcga cgtgttgatc 1864261 accgccacgg tcaccggact ggcggtgccg tcgctggatg cccggatcgc cgggcggctg 1864321 gggctgcgcg ccgatgtccg gagggtgccg ctgttcgggc tgggctgcgt ggccggggcg 1864381 gccggggtcg cccggctgca cgactacctg cgcggggccc cggacggcgt tgccgcgttg 1864441 gtctcggtcg agctgtgttc actcacgtat ccgggataca agccgacgct gccgggcctt 1864501 gtcggcagtg cgttgtttgc tgacggcgcc gcggcggtgg tggccgcagg tgtgaagcgc 1864561 gcccaggaca tcggcgccga cgggccggac atcctggatt cgcgcagcca tctgtacccc 1864621 gactcgctgc gcaccatggg atacgacgtc ggctcggccg ggttcgagct cgtcctatca 1864681 cgggacttgg cggccgtggt cgagcagtat ctgggcaatg acgtcaccac cttcctggct 1864741 tcgcacggcc tgagcaccac cgacgtcggc gcctgggtca cccatcccgg gggacccaag 1864801 atcatcaacg ccatcaccga gaccctcgac ctgtcgccgc aggctctcga gctgacgtgg 1864861 cgctcgttgg gcgaaatcgg gaatctgtcg tcagcgtcgg tgctgcatgt gctgcgtgac 1864921 accatcgcca aaccgccccc cagcggaagt cccgggttga tgatcgccat gggcccaggc 1864981 ttctgttccg aactcgtgtt gctgcgctgg cactgatgct ggattccgcg agcgtaacgc 1865041 cactgcgcta ttcggatcgc aatctcgcag tgacgttacg ctcggcggac ctcgtgccat 1865101 gaacagcact cccgaagacc tcgtcaaggc cctgcgcaga tcgctcaagc aaaacgagcg 1865161 actgaagcga gagaaccggg atcttcttgc ccggaccacc gagccggtgg cggtggtggg 1865221 gatgggatgc cgctatccgg gtggggtgga ttcgccggag acgctgtggg agctggtggc 1865281 acacggccgt gacgcggttt cggagttccc ggcggatcgc ggctgggatg tggcggggtt 1865341 gtttgacccc gatcccgacg cggtaggcaa gtcgtatacc cggtgcggcg ggttcttgac 1865401 ggatgtcgcc ggttttgacg ccgagttttt cgggatcgca cccagcgagg cgcttgcgat 1865461 ggatccccag cagcggttgc tgttggaagt gtcgtgggaa gcgttggagc gggcgggcat 1865521 cgacccaatc acgttgcggg gttcgcagac gggcgtgttc gccggggtgt tccacggctc 1865581 gtatgggggc caaggccggg tgccgggtga cctggagcgc tacgggctgc gtggctcgac 1865641 gctgagcgtg gcctccgggc gggtggcgta tgtgttgggc ctgcagggcc cggcggtgtc 1865701 ggtggatacc gcgtgttcgt cgtcgttggt ggcactgcat ttggcggtgc agtcactgcg 1865761 cctcggcgaa tgcgacctgg cgctggtcgg tggggtcacc gtgatggcca ccccggcgat 1865821 gttcatcgag ttcagcaggc agcgggcgct gtccgccgat ggtcgttgta aggcctatgc 1865881 gggtgccgcc gatgggaccg cgtttgccga gggcgccggg gtgctcgtgc tggcgcggtt 1865941 ggctgacgcg cgccggttgg ggcatccggt gctggcgctg gtgcgcggat cggcggtcaa 1866001 tcaggacggc gcctccaacg ggctggccac gccgaatggg ccggcgcagc aacgggtgat 1866061 cactgcggcg ctggccagtg cgcggttagg tgtcgccgac gtggatgtgg tcgaggggca 1866121 cgggacgggc accacgttgg gggatcccat tgaggcgcag gcgattttgg cgacgtatgg 1866181 acagcggccg gccgatcggc cgttgtggct ggggtcgatc aaatcgaaca tcggtcatac 1866241 gtcggcggct gcgggggtcg ccggggtgat caagatggtg caggcgatgc gccacggcgt 1866301 gctgcccaag acgttgcacg tggatgtgcc gacgccgcat gtggattggt cggcgggggc 1866361 ggtgtcgttg ttgaccgagc cgcggccgtg gcacgtgccg ggccggccgc ggcgggccgg 1866421 tgtgtcgtcg ttcgggatca gcggcaccaa cgcacatgtg attctggaag aggcaccggc 1866481 agtggaaccg gttggcgcgg cccatggcaa cgacccggtg gcggtgccgt gggtgctgtc 1866541 ggcgaggtcg gcgcaagcgt tgaccaacca ggcgcgacgg ctgttggcct gggtgggcgc 1866601 cgatgagaac gtgcgcccgc tcgatgtggg gtggtcgctg gtcaacaccc ggtcgctgtt 1866661 tgatcatcgg gccgtggtcg tgggcgccga ccgcactcag ctgatggaag ggctgacggg 1866721 tctggcggcc ggcgtgcccg gcgccgacgt ggtggcgggc cgcgcccaga cggtgggcaa 1866781 gacggcattc gtgttcccgg gccagggcgc gcagtggctg ggcatgggag cccagttatg 1866841 tgctaccgca ccggtgttcg ccgaacatat ccatcgctgc gaacgggcgc tgcgtgagca 1866901 cgtggagtgg tcgctgctcg acgtgctgcg cggggcaccc ggcgcaccgg ggctggatcg 1866961 ggtggatgtg gtgcagccgg cgttgtgggc ggtgatggtg tcgctggccg aattgtggcg 1867021 gtcggtgggt gtggttcccg acgcggtcat cgggcattcg cagggggaga tcgcggcggc 1867081 atatgtggcg ggcgccctgt cgctttggga cgcggctgcg gtggtggcac tgcgcagccg 1867141 gttgctggtg cggttgggcg gtgccggcgg catggtctcg ttggcctgtg gccagccgca 1867201 ggccgagaag ttggcgtccc aatggggaga ccgactgaat atcgctgcag tcaatggtgt 1867261 ctcgtcggtc gtgctggccg gcgagacgga tgccgtgacg gagctgatgc agcgatgtga 1867321 ggccgaaggc attcgtgccc gcaggatcga cgtcgactac gcgtcacact cggcgcaggt 1867381 ggacgcgatc cgggaggagc tcatcgcggc gctgcgaggt atcgaacccc gtacttccac 1867441 ggtggcgttc ttctccactg tcaccggcga actcatggat accgccggtg tgaacgccga 1867501 gtactggtac cgaagcatcc gccagccggt gcagttcgaa cgcgccgtcc gcaacgcctt 1867561 cgacggcgga taccgggtgt tcgtcgaatc cagcccccat ccggtcctga tcgccggcat 1867621 cgaagagacg ttggtcgact gtgatcgcgg cgctacgggt gaaccgattg tcattccgac 1867681 gctgggtcgc gatgacggcg gggtgggccg gttttggctg tcggcggggc aggcccacgt 1867741 tgcgggcgtg ggtgttgact ggcgtgccgc gtttgccgac ctgggaggcc gccgggtgga 1867801 gttgccgacg tacgcgtttg cgcgccagcg gttctggcta gacggcctag gtgctgttgg 1867861 cggcgatctg ggtggtgtcg gcttggtggg cgccgagcat ggattgttgg ctgcagtggt 1867921 gcaacggccc gactcgggtg gggtggtgtt gacgggccgg atatcggtgg tcgctgcgcc 1867981 gtggctggcc gatcatgcgg tgggcccggt ggtgctgttc ccgggcacag ggtttgttga 1868041 gttggccttg cgggccggtg acgaggtggg ttgttcggtg ctgcaggagt tgacgttgca 1868101 ggcaccgttg gtgctgccgg cagatggggt gcgggtccag gtggtggtgg gcggcgtcga 1868161 gcagtcgggt actcggaatg tgtgggtgta ttcggctgcc ggccaggcgg attcgagtcc 1868221 gggatggacg ttgcacgcgc agggcgtgtt gggggttggc tcggtgcagc cggccgcgga 1868281 gctgtcggtg tggccgccgg ttggggcacg ggcgatggac gtcgccgacg ggtatcaggt 1868341 gttggcggcg cgggggtatg ggtatgggcc ggcgtttcgg ggtttgcagg ccttgtggcg 1868401 gcggggggcc gaggtgttcg ccgacgtcac tctccctgag ggtgtgccga tacgggggtt 1868461 tgggattcat ccggcggtgt tggatgcggc gttgcatgcg tggggaattg tcgagggtga 1868521 gcagcagacg atgttgccgt tctcgtggca gggggtgtgt ttgcacgcaa gcggggctgc 1868581 gcgggtccgt gtgcgactgg cgccggtggg ccggggggcg gtgccggtgg agttggccga 1868641 tccgcagggg ttgccggtgt tgtcggtgcg gcagttgatg gttcgtccgg tctcagcggc 1868701 cgcgttgtcg aggtcgaccg ccggcgaccg gggattgctg gagatgatct ggacaccggt 1868761 gccgttggag ggcggcgaca ttggcgacga cgccgtggtg tgggagctgc cgcctcacgc 1868821 cggcgcgcag gccggcgggg atgtgctggc agcggtgtac cggggtgtgc acgaggtgtt 1868881 ggaggtgttg cagtcgtggt tggctagcga tgcgaccggt ctgggtgtgg tggtgacgcg 1868941 tggggcggtg ggtccggttg atgacgatgt caccgatttg gcgggtgctg cggtgtgggg 1869001 gttggtgcgc tctgcccagg ctgaacatcc gggccgggtg gtgttggtgg ataccgatgg 1869061 gtcggtcgct gtcgaggatg cggttggttt cggcgcacgc tcgggtgagc cgcagctggt 1869121 ggttcgtcga ggccgggtat atgcggcacg gttggccccg gtagcggccg ggttgacttt 1869181 gccttcggcg tcggctgggg gctggcggtt ggttgccggt ggtgggggga ctttggcgga 1869241 tgtggtggtg gcgcccgttg ctccggtgga gctggcgacg gggcaggtgc gggtggccgt 1869301 gggtgcggtg ggggtcaatt tccgggatgt gttggtggcg ttggggatgt atcccggcgg 1869361 cggggaactg ggtgtcgacg gggcaggggt ggtcgttgaa gtcggcccgg gggtaaccgg 1869421 tttggccgtt ggtgaccggg tgatggggtt attggggctg gtgggttcgg aggcggtggt 1869481 ggatgcgcgg ttggtaacca tggtgccggc gggctggtcg ttggtggagg cagcggccgt 1869541 gccggtggcg tttctgacgg cgttttacgg gctgtcggtg ttggcggagg tcgcggcggg 1869601 gcagaaggtg ttggtgcatg ccggcaccgg cggggttggt atggcagcgg tgtcgttggc 1869661 gcggtattgg ggtgcagagg ttttcgtcac ggcgagtcgc gccaagtggg atacattgcg 1869721 ggcgatgggt tttgacgata tccatatctc cgactcgcga tcgttggagt tcgaggaggc 1869781 gtttctgcgg gccaccgagg gcagcggtgt ggacgtagtg ctgaactcgc tcgccggtga 1869841 gttcaccgat gcctcgctgc ggctactgcc cagcggtggc cgctttatcg agctgggtaa 1869901 aaccgatatt cgcgacgggc agacggtggc cgagcggcat cggggggtgc ggtatcgggc 1869961 gttcgatttg gtcgaagccg gcccagaccg cattgcggcg atgctttccg aggtagtggg 1870021 gttgctagcg gccggagtgt tggcgcggtt gccggtcaag acttttgatg cgcgatgcgc 1870081 cccggcggcc taccggtttg tcagtcaggc ccgtcatatc ggcaaggtcg tgttgaccat 1870141 ccccgatggt ccgggtgggc agtccgggtt ggcggggggc accgtggtgg tcactggggg 1870201 gaccggcatg gccggttcgg cggtggctac ccatttggtc cggcgacatg gggtggccaa 1870261 tctggttctg gtcagccgaa gcggtgagca ggccgacagg gcggcagaag tcgcggccct 1870321 gttgcgcgag ggcggggccc aggtggcggt ggtctcctgt gatgtggctg atcgtgatgc 1870381 gctggcggca ttgttggcgg gtctggatcc gcgctatccg cttaaagggg tgtttcatgc 1870441 cgctggggtg ttggacgatg ccgtgatcac gggcttgaca ccggatcggg tggatacggt 1870501 gttgcgggcc aaggtcgatg gggcctggaa tctgcacgag ctaaccgagg acatggattt 1870561 gtcggcgttt gtggtgtttt cgtcgatggc cgggattgtg ggcacaccgg ctcaggggaa 1870621 ttatgctgcg gcgaatgcgt ttttggacgg gttggtggcc tatcggcgct cgcgtgggct 1870681 ggccggattg tcggtggcgt ggggactgtg ggagcaggcc tcggcgatga cccggcacct 1870741 cggcgagcgg gatcgcgcca ggatgacgca ggccgggctc gctccgctaa ccaccgagca 1870801 ggcgctaggg ttcctggaca ctgcgctgca ggccgatcgc gcggtggtag tggcggcccg 1870861 gctggatcgt gccgcgctgg ccggcgctgg tgctgcgcta ccggcattat tcagccagtt 1870921 ggctgccggt ccgacccggc ggaggatcga cgccgccgat acggcggtgt cgatgtcggg 1870981 cttagtcagc cggctgcatg cgctcacgcc cgagcggcgg cagcgcgaac tcaccgattt 1871041 ggtgatcagc aatgccgcgg cggtgttggg tcgttccagc agtgtcgata tcaacgctca 1871101 caaagcattc caagatctcg ggttcgattc cttgaccgcc gtggagctgc gcaaccgact 1871161 caagaccgcc accgggctca cgttgtcgcc cacgctgatc ttcgactacc ccacgccggc 1871221 cacgctggcc gaacacctcg acagccggct agtcaccgcc agcggtagcg atcaacaaag 1871281 cctgtcagac cgtgttgacg acatcacccg cgagctagtt gtgctgcttg accaacccga 1871341 cttgagcgcc aacgtcaaag cgcacctgcg cacccgcctg caaaccatgt tgaccagcct 1871401 gaccactgaa gacgacgaca tcgccgccgc gaccgaaagc cagcttttcg ccatcctcga 1871461 cgaggaactc ggctcctaac cccccgcaag gaacaccaat gtcgggaacc accacgcatg 1871521 ttgactacct gaagcgtctc acggcagatc tgcggcgcac ccgcagacgc ctgtccgact 1871581 tggaagccaa gttgtccgag ccggttgcgg tggtcggaat gggatgccgt tatccaggtg 1871641 gggtggattc gccggagacg ttgtgggagc tggtggccca gggccgtgat gcggtatcgg 1871701 attttccggc ggatcgcggg tgggatgtgt acgggttgtt tgatcctgac ccggatgcat 1871761 gcgggaagat gtatacccgc cgcgggacgt ttctggagca tgcgggtgac ttcgacgccg 1871821 gattctttgg aatcggtcct agcgaggcgc tggcgatgga cccgcaacag cgcctgctat 1871881 tggaagtgtc gtgggaagcg ttggagcgta cgggaattga cccgaccaag ttgcggggtt 1871941 cggcaacggg tgtgttcgcc ggtgttatcc atgctggcta tgggggccag ctatccggcg 1872001 agctggaagg ctatgggtta acgggttcga cgctgagtgt ggcctccggg cgggtggcgt 1872061 atgtgctggg gttggagggt ccggcggtgt cggtggacac ggcgtgctcg tcgtcgttgg 1872121 tggcgctgca tttggcggtg cagtcgctgc ggtcggggga atgcgatttg gcgctggccg 1872181 gtggggtgac ggtgatggcc acccccgccg cattcgtcga gttcagccgg cagcgggcgc 1872241 tggcgcgcga cggtcggtgc aaggtatacg ccggtgccgc cgacgggacc gcgtggtcag 1872301 aaggcgccgg ggtgctggtg gtggagcggc tggtggatgc acggcggttg gggcatccgg 1872361 tgctggccct ggtgcgcgga tcggcggtca atcaggacgg cgcctccaac ggtttgacgg 1872421 cacccaatgg gccatcccag cagcgggtga ttcgggcggc gttggccagt gcgcgactgc 1872481 gcgcggttga ggtggatgtg gtcgaggggc acgggaccgg gaccatgctg ggggatccga 1872541 ttgaggcgca ggcgcttttg gcgacctacg gtcaggaccg cgttgagccc ctgtggttgg 1872601 ggtcgatcaa atcgaacatc ggtcatacat cggcggcggc gggggtggcc ggggtgatca 1872661 agatggtgca ggcgatgcgg catggggtga tgcccaagac attgcatgtg gatgttccta 1872721 cgccgcatgt ggattggtcg gtgggggcgg tgtcgttgtt gactcaaccg cgggcgtggt 1872781 cggttcacgg ccggccgcgg cgggccgggg tgtcgtcgtt cgggatcagc ggcaccaatg 1872841 cgcatgtgat tcttgagcag gcaccggtag ttgaaagtgt tgtgccagaa gttgcatccc 1872901 caacagcggc gtccgccgtg ccgtgggtgc tgtcggcccg gtcggagcag gcgttggccg 1872961 gtcaggcgca gcggctgctg gctttcgtcg cggccaaccc ggatttggat ccgatcgatg 1873021 tggggtggtc gttggtcaag acgcgggcga tgttcgagca tcgggcggtg gtcgtgggtg 1873081 ctgatcgcgg ggccctgctg gcggggttgg cggcgttggc cgctggtgag tcgggtgcgg 1873141 gcgtggcagt gggtcgagcg cggtcggtgg ggaagacggt gttcgtgttt cccgggcaag 1873201 gggcccaatg ggtaggcatg ggagcgcagt tatatgccga attacccctg ttcgccctgg 1873261 cttttgacgc ggtggccgaa gagctggatc ggcacctgcg gctgccgctg cgaaacgtgc 1873321 tctgggaagg tgacgaggcg ctgttgacta gcaccgagtt cgcccagccg gcgttattcg 1873381 caatcgaagt ggcgttggca acgttgttgc agcactgggg tatcagcccg gatttcctga 1873441 tcggacattc ggtgggcgag atcgcggcag cacatttggc cggggtgttg tcgttgaccg 1873501 atgcggcggg tttggtggct gcccgcggca ggttgatggc ggagttgccc gccggtgggg 1873561 tgatggtggt ggtggccgcc agcgaagaag aagtgctgcc agtgctggtc gacggggcga 1873621 atctcgcggc ggtcaacgcg ccgcactcgg tggtggtttc agggtgcgag gcagcggtca 1873681 gcgatattgc cgatcacttt gcccgcaggg gccgccgggt gcatcggcta gcggtatcac 1873741 atgcgtttca ttcgttgctg atggaaccga tgcttgccga gttcacgcgg atcgctgccg 1873801 gtatttcggt gtcgaaaccg cggattccgt tggtgtccaa tgtgaccggg cagatggccg 1873861 gcgcaggcta cggcgatgga cagtactggg tggagcatgc gcggcgcccc gtgcgatttg 1873921 tcgagggcgt ccagttgctg aatgcggttg gggccacaag gtttgttgag gtgggtcccg 1873981 gcggtggcct gacagcattg gtcgagcagt cgctgccttt aggcgaggcg ctatcggtgg 1874041 cgatgatgcg tagagagcac cccgaagtgt cgtcggtgct cggcgccgtg gcgacattgt 1874101 tcactgcggg tgcccaaatg gattggccgg cggtgtttgg cagtccgggt cgacggatcg 1874161 aattgccgac ctatgcgttt cagcggcagc ggtattggtt gccgcctacg tcggcgggtt 1874221 cggcagacat cagcggtgtt ggtctgctgg cagcccggca tggtttgttg ggtgcggttg 1874281 tggagcaacc ggattcggac gtggtagtac tgaccggccg gctatcggtg ggggagcagc 1874341 ggtggttggc cgatcacgtg atcgctggag tggtgttgct cgccggtgcg gctttcgtgg 1874401 aactggcgct gcgagccgcc gaccaggtgg attgtggggt ggtcgaggag ctgacggtgg 1874461 tgactccgtt ggttttgccg acggtgggcg gggtgcagct acaggtggtg gtgggtgtcg 1874521 gtgagatggg tcagcggcca gtgtcgatat attcacgcaa cgctgagtcg gattccgggt 1874581 gggtgttgca tgcccggggc gtattggggg caaaggcggt tgccccggca gcggatttgt 1874641 cggtgtggcc gccgctgggt gctgccccgg ttgatgtcga tggcgcctat cagcgattcg 1874701 ccgaactggg ctatgaatat ggccgggcgt ttcagggtct gacggccatg tggcggcggg 1874761 aatcggagct cttcgccgat gttgccgtcc ccgacgatgt cgatgtgacg ttgagtgggt 1874821 tcggaattca cccactggtg ctggatgcgg ccttgcatgc aatgggcatg gtgggcgagc 1874881 aggcagctac catgctgccc ttctcctggc aaggggtctc cctgcatgcc gcgggtgcgt 1874941 cccgggttcg ggcgcggatc gcgccggccg gtgatggcac ggtgtcggtg gagttggccg 1875001 atcaggcggg gttaccggtg ttgtcggtac aggcattggt catgcgttcg gtgtcgtctc 1875061 agctgttgtc ggcggccgtc gccgctgccg atgccgcagg tcgcgggttg ttggaagtgg 1875121 cgtggttgcc agtggaattg gcgcacaacg acatcagcgc cgacctcgtg gtctgggagt 1875181 tggagtcttt ccaggacggt gtgggtccgg tgtattcggc tacgcatcgg gtgttggtgg 1875241 cattgcagtc ctggctggcc caggagcggg ccggcggact ggtggtgctg acccaagggt 1875301 cggtcggcca ggatgccacg aacttggccg gcgccgcggt gtgggggttg gtgcggtcgg 1875361 ctcaagccga acatccgggt cgggtgatgt tggtcgattc ggacggctcg atggatgttg 1875421 gagatgtcat tggctgtggt gaagagcaat tgatgatccg gaacggcaca gcctatgccg 1875481 cccggctggc acagcttcga ccacagccga tcctgcagtt gcccgatacc aactcgggct 1875541 ggcggttggt cgccggcggc gcgggcaccc ttgaggattt gacgttggca tcatgccctg 1875601 caaaggaatt ggcacctgga caggttcgaa tagaggtgcg ggctttgggt gtcaatttcc 1875661 gggatgtgtt ggtggcgttg ggaatatatc ccggtgccgc ggagttgggg gccgaagggg 1875721 caggggtggt caccgaagtc ggtccaggcg tgaccggttt agcagttggt gatccggtga 1875781 tgggtctgtt gggggtggcg gggtcggaag cggtggtcga tgcgcggctg gtggtcaagc 1875841 tgccgaaccg gtggccgctg accgatgctg cgggtgtgcc ggtggtgttt ctgacggcct 1875901 actgcgcgtt acgcgtgctg gcgcaggtgc agccgggcga gtcggtgctg gtacacgccg 1875961 ctgcgggcgg ggtgggtatg gcggcagtgc aactggctcg gctgtgggga ttggaggttt 1876021 tcgctactgc cagtcgcggc aagtgggaca cgttgcacac aatgggatgt gacaacacgc 1876081 atgttgccga ttcacgcaca ctggcattcg aggagacgtt ttggctgacc accgagggtc 1876141 gcggcgtgga tgtggtgctc aactcgctgg ccggtgagtt caccgacgca tcgttgcggt 1876201 tactgccgcg aggcggtcgc ttcatcgaga tgggcaaaac cgagttcggg acgcccaggt 1876261 cgttgcccag gaccatcctg gggtggccta ccgggctttc gacttgatgg aggccggacc 1876321 gcagcggatt gcgcagatgc tggccgagtt agtcgagttg ttcaaaactg aagcgctgca 1876381 tcggcttcca gtcaagtcat gggatgtgcg gcacgctcgg gaggcgtatc ggttcttgag 1876441 ccaggcgcgc catgtcggca aagtggtgct gaccatgccg gacgcgtggg ccgcgggcac 1876501 ggtgctgatc accggtggca ctgggatggc aggttctgcg gtggcgcgtc atctggtgag 1876561 tcgatacggg gtgcggcagg tggtgttggc cagtcgtgct ggtgagcaca cggagagcgt 1876621 cgcagcattg gtggacgagc tcggctcggc cggcgcccga gtgcaggtgg tgtcttgcga 1876681 tgtggccgat cgtgatgcgg tggcgggttt ggtggcaagc caaccagatc tgactgcagt 1876741 gtttcatgcg gctggggttc ttgacgatgc ggtaatcacc ggattgacgc cggagcgggt 1876801 ggataaggta ttgcgggcca aggtcgatgg ggcctggaat ttgcatgagc tcacccggca 1876861 cctggatgtg tcagcgtttg tgttgttttc gtcgatggcc gggattgtgg gtgcgccggg 1876921 ccaggccaat tatgctgcag cgaacgcgtt tttggacggg ttggcggcct atcggcgatc 1876981 acgtggactg gccgcgttgt cggtggcgtg gggattgtgg gagcaggctt cggcgatgac 1877041 cgagcattta ggcgagcggg atcgggtccg gatgagtcgg gttggactgg cgccgttgcc 1877101 taccaaccag gcgatgggat tcctggatgc cgcgttgctg gcggatcggc ccgtggtggt 1877161 ggctgctcgg ctggatcgtg ccgcgctggc cggtgccgag ctgccggcac tatttagcca 1877221 gttggttgcc ggtccgatcc gacggatcat cgacggcgcc gatgaggtgt cggggtcggg 1877281 attggcgtcg cggctgcacg ggctgactcc cgagcagcgg caccgcgaac tcaccgagtt 1877341 agtatgtagc aacgccgcga tcgtgttggg gcattccggc actgagatcg acgcgcacaa 1877401 ggcattccag gatctcgggt ttgattcgct gacagcggtg gagctgcgca accggctcaa 1877461 gactgcgacc gggttgacct tgccaccgac cttgatcttt gactacccca cggccgccga 1877521 gttggccgaa cacctcgaca tccagctggc gaacgcccct gccgtcacgg tcgaccaacc 1877581 caacccgtcg actcgtttca acgaggtcac ccgcgaacta caagcattgc tcgaccaacc 1877641 caactggaac cccgacgaca aaacgcgcct gatcaagcga ttgcaagcga ttttgaccga 1877701 ttgcaccgct ccaccggcca gctccggccc gtctaccacc catgacgacg aggacatcac 1877761 caccgccact gaaagccagc tttttgccat cctcgacgac gaacttggac cttagcgcac 1877821 gtgcaaccga caggcatcgc aatcatcggg ctggcatgca ggtttcccac cgtcgtcagc 1877881 cccggcgacc tctgggacct gttgcgcgac gggcgagagg ctactggatc cattgacaac 1877941 gtcgccgatt tcgacgccga ctttttcaac ctatcccccc gcgaggcgag cgcgatggac 1878001 cccaggcaac gactggcgct cgaactcacc tgggaactgc tcgaagacgc tttcgtggtg 1878061 ccggaaacgc tgcgcggaca accgatcgcg gtctacctcg gagcgatgaa cgacgactac 1878121 gcagtactga cgctcgcggc ggaccgtgtt gaccatcacg cgttcgctgg cactagtcgg 1878181 gcaatcatcg caaaccgcgt gtcgtttgct ttcgggctgc gtggaccaag cgtgacgatc 1878241 gactccggtc agtcgtcatc cctggtagcg gtgcatctgg catgcgaaag cgtgcgaaca 1878301 ggcgaagcgc cgctggcgat tgccggtggt gttcacctca acttggcacg cgaaacagcc 1878361 atgctggaac aagaattcgg cgcggtatcg ccgtccggcc atacctacgc attcgatgaa 1878421 cgtgccgacg gctacgtacc aggcgacggc ggtggcctcg ttctgctgaa gccggtgcaa 1878481 gctgccctgg acgacggaga tcgaatccac gcgatcatcc gcggcagcgc ggtcggcaac 1878541 gccgggcaca gcgctaccgg gctgaccgtg ccgtcggtcg ccggccaggt ggacgtcatc 1878601 aggcgggcga tgtccggcgc gggggtggat tgccatcagg ttcactacgt cgaggcacac 1878661 gggaccggca ccaagatcgg cgacccgatc gaggcgcggg cgctgggtga gatcttcgcg 1878721 gcgcggcaac gtcgcccggt gagtgtgggg tcggtcaaga ccaatattgg tcataccggg 1878781 ggagccgctg gaatcgccgg attactcaag gcggtgttag cgattgaaaa tgccgtgatt 1878841 ccacccagcc tcaactacgt cggtgcccca attgatttgg atagccttgg gcttcgggtc 1878901 gacaccgcgt tgacgccgtg gccggtggcg gatgagccgc gacgggctgg ggtgtcgtcg 1878961 tttggcatgg gtgggacgaa cgcgcatgtg atcctggaac agggtccgac gcagtcgcca 1879021 gagatagtgg aatctgttgc cgcagcgggt agtaacgctc cggtggcggt gccgtgggtg 1879081 ttggctgcgc ggtcgccgca ggcgctaacc aaccaggcgg ggcggttgtt ggcgcacctg 1879141 actgccgacg acggcctgac cgcgctcgat gtggggtggt cgttggtgag tacccggtcg 1879201 gtgttcgacc atcgcgcggt ggtggtgggc gctgatcggg ggcgtctgat ggcggggttg 1879261 gcggggttgg ccgccggtga gccgggcgcg ggtgtggtgg tgggtcgtgc gcggtcggtg 1879321 ggcaagacgg tgtttgtgtt tcccggacag gggtcgcagt ggctggggat gggccggcag 1879381 ttgtacggcc ggtactcggt gtttgcccgg gcttttgacg aggtcgttgc ggtgttggat 1879441 gggcagctgc ggctgtctgt gcggcaggtg atgtggggcg ccgatgccgg gctattggaa 1879501 agcacagagt ttgctcagcc ggcgttgttt gtcgtccagg tggcattggc cgcgttgttg 1879561 caagactggg gtgtgctgcc cgatcttgtg atgggtcatt cggtgggtga gattgctgcg 1879621 gcgtatgtgg ccggggcgtt gtcgctggtg gatgccgcgc gggtggtggc ggcgcgcggc 1879681 cggttgatgc aggcgttgcc cgctggtggg gtcatggtgg ccgtagcggc cagcgaagac 1879741 gaagtggcac cgttgctcac cgagggcgtg tgcatcgctg cggtgaacgc gccggaatcg 1879801 gtggtgattt cgggtgagca ggctgccgtg ggtgtggtag tggatcgatt ggtggggttg 1879861 ggtcggcggg tgcggcggtt ggcagtgtcg catgcgtttc attcggtgtt gatggacccc 1879921 atggtcgagg agttctcgaa ggtgctggct gatgtctgcg tgcgggcgcc gcggattggg 1879981 ttggtctcga atgtgacagg tcagctggcc ggtgctgggt atgggtcgcc ggcgtattgg 1880041 gttgaacatg tgcgcaagcc ggtgcggttc ttcgacggtg tgggattggc tgaatccctc 1880101 ggggccaggg tgtttgtgga agtgggtccc ggtgccgggt tggaggcgtc ggtggcgctg 1880161 ctagccaggg atcggcctga ggtggagtcg gtgctggccg gggtggggcg actgttcgcc 1880221 gaaggggtgg cggttgattg gtcttcggtc tttgcgggtt tgggcggccg gcgggtggag 1880281 ttgccgacgt atggatttgc ccggcagcgg ttttggttag gtgacaatgg cgagttgtcg 1880341 gtggaccaga cgggcaaaga cgccggcgca attgcgcgat tgcaaagcct agccccaccg 1880401 gaactgcagc gccagctggt agagttggtg tgcttccatg cagcaatcgt tttgggtcgc 1880461 aagagcagcc atgacatcga ccccgaatgt gctttccaag acttgggatt tgattcaatg 1880521 agcggggtcg aactacgcaa tcgtctccag atggctatcg gtttgcccgg cttgtcgctg 1880581 ccgcgcactt tgatcttcga ctatcccact gcgagtgccc tcgccgaatg ccttggccag 1880641 ctcttaggcg gccaacacga atcatccgac gacgagagta tttggcagct gctgaaaaac 1880701 attcctatcc accagcttcg acgcaccggc ttgctggaca aattgctgct gctggccggc 1880761 cagcccgagg agtccttggc tggtcggacc gtcagcgacg aggttatcga ctcgttaagc 1880821 cccgaagctc ttatcgggct ggcgctcgat gaggacgaga acgatattcg atgacgaaat 1880881 ccgtcctggc aggctcaaat tatgctatcg gcataggtgc aaatacgaca ggcgttgaat 1880941 agcgatgttt ttgcgagatc gcgtaatgtg gcttaaactt tgggcttcga gggtggcaag 1881001 taacttaagt gggcaggggc atgagcgtca tcgcgggtgt gttcggtgcg ttgccgccgc 1881061 atcgctatag ccaaagtgag atcactgatt cgtttgtcga gtttcccggc cttaaggaac 1881121 acgaggagat cattcggcgt ttgcatgccg ccgccaaggt caacggtcga cacctggtgc 1881181 tgccgctgca gcaatacccg tcgctgaccg acttcggcga cgccaacgag atctttatcg 1881241 agaaggctgt cgaccttggc gtcgaggcct tgctgggcgc gctcgatgat gccaacctgc 1881301 gccccagcga catcgacatg atcgccaccg caaccgtcac cggcgtcgcg gtgccgtctt 1881361 tggatgcccg gatcgccggg cggcttggtc tgcgccccga cgtgcggcgg atgccgttgt 1881421 tcggtctggg ctgcgtggca ggggcggcgg gcgtggcccg cctgcgcgac tacctgcgtg 1881481 gcgcgcccga cgacgtcgcg gttctggtct cggttgagct ttgctcgctg acgtatcccg 1881541 cggtcaaacc aaccgtgtcg agtctggtcg ggaccgcact gttcggcgac ggagcagccg 1881601 cggtggtcgc cgtcggcgac cggcgcgccg agcaggttcg cgctggcgga ccggacatct 1881661 tggactcgcg cagcagcctg taccccgact cgctgcacat catgggttgg gatgtcggtt 1881721 cccatggcct gcggctgcgg ctttccccgg acctgacgaa cctgatcgaa cggtacctag 1881781 ccaatgacgt caccacgttt cttgatgccc atcggctgac caaagacgac atcggcgcct 1881841 gggtgagcca tcccggtggt cccaaggtca tcgacgccgt cgccacgagc ctcgcgctgc 1881901 ctcccgaggc gctcgagctg acctggcgct cgctgggcga gatcggcaac ctttcgtcgg 1881961 cctcgatact gcatattttg cgcgacacca tcgaaaagcg gccacccagc ggaagcgccg 1882021 ggctgatgct ggcgatgggt cctggtttct gcacggaact cgtcttactg cgctggcgct 1882081 gacttcctga tttcaacggt caatcccggc caggggcgca gcgcggcaaa gttggccgcc 1882141 cgaatgcggt gagtccgctg agcgggcaac tgcagcatgg ccctggcgac cagccgagcg 1882201 agaatcacgg tcatctcggt ggtggccatg acggctccga tgcatcggtg cagcccgccg 1882261 ctgaacggga tgaattcatg tggcgcgggt ttgcggtagt ccgctgcgtt gggatcccag 1882321 cgcagcggac ggaattcggt tggctcgggc cagatttctg ggagccggtg ggtgacgtag 1882381 gcgctgaaga tcaacaggcg tcccgcccgg atgcgatgcc cgtcgaacca gaggtcacgc 1882441 agcaccctgc gggccgagat cacgccgggc gagtacaggc gcagcgtctc gtgaacaact 1882501 ccgttgaggt aggtgagcgc gctcaggtca tcggcggcgg ggactctgcc acccagcacg 1882561 cgcgcgacct cgctggccgc actctcccag gtgccgggca cggtcagcag tgcgtagatc 1882621 gcccaggcca gcgcgccgct ggtggtctcg taccccgcgg tgatcagcga aacgatcgaa 1882681 tcgcgaatct cgttgtcgct taacgtagta ccctcttcag agcagccact aatcaacgtc 1882741 gtcaacatgt ggtcgtcggg tctgggtgcc gtgcgcgcgt cggcgatctg agcgtcgatg 1882801 aggtcgtcga tgcgtttgcg ggctgccatg gcccgtcgcc acccgggcga gttgacccgc 1882861 tgctgcagcc gcatcacctg aggcggccgt cgggttaggt ccagcagggg ctgcagttgc 1882921 tcaccgagaa aatcggaatg tacggcgagg cgctggccga acagactctc ggcggtactg 1882981 cgccggaccg ccgagcgcaa ctcttggtag atgtccagcc gctgtccggg ctgccaaccg 1883041 tcgatcaccg tgtcgatatt ggacaccatc gttgccacat agcgctggac gtgatggtgc 1883101 cgcagccccg gtgccaccac actgcggcgg cgccggtggt ccgcgccgtc gctgacgatc 1883161 agcgcggtcg gcccgtcgac gggaaccagg ctctcaaacg tttggctcca gctgaacgcg 1883221 tcggcattgg cgaacacgaa tctgttggcc tctgctccca ggagataagt gtagccatgc 1883281 ccaccgactc cggcgttgat cagcggaccg cgccatcgat acagcgccag cagcgcttcg 1883341 ccaagcgggt agcgcaccgt ccgatacgtc ctcattcgag catctccgaa agctccagcc 1883401 agcgattctc catcgccgcg acgtggtctt gcaggacacg tagttgctgg gtcagccggg 1883461 tgatgccgac gtggtcggac tggtcatgct cggccagttc ggtatgtttg gcggccaccc 1883521 ggtcggccag gcgggcgagt tgacggtcga ctgcggccaa ctctttttcg gtggcacgtc 1883581 gctgtgcgcc cgacatcgcc ggcggcgctg gccgctcggc cggtgctggg gcgctaacgc 1883641 gggcagccag ctgcaggtat tcgtcgatgc cgccgggcag gtgccgcaac cggtcatcga 1883701 gaatcgcgta ctgctggtca gtgacccgct cgagcagata ccggtcgtgt gagacgacga 1883761 tcaacgtacc cgcccacgag tcaagcaggt cttcggtcgc cgtcagcatc tcggtgtcca 1883821 cgtcgttggt gggctcgtcg aggagcagca cgttcggctc ggacaacagc gtcagcatga 1883881 gctgcaaccg ccgacgctga ccaccggaga ggtcgtcgac tcgcgcggac agctggtccc 1883941 ggcggaaccc gagacgctct agcagctggg tcggggtaac ctcgcggcct tcgacctgat 1884001 agccgccacg cagcctgcct agcacatcgg cgatccggtc gtcggcaaac ggtgccagat 1884061 cgtccccgtg ctgatcgagc actgccagcc ggacggtctt gccgcgcttg acacgtccga 1884121 caccgggctg gacggtgccg gcgatcaagc ccagcagggt cgacttgccg gcgccgttag 1884181 ccccgacgat gccgatacgt tcacccgggc cgatccgcca ttcgatgtcg cgcaacaccg 1884241 ggcggccccc agaaggctgg tacgagaccg acacgccgag caggtcgacg acgtcctttc 1884301 cgagccgagc ggccgccagc ttggccagct ccacggtgtt gcgcggtggc ggcacgtctg 1884361 cgatcagttg gttggcggcc tcgatccgga acttgggctt gcaggtccgc gccggtgcgc 1884421 cgcggcgcaa ccaagccagc tccttgcgca gcaggttctg ccgcttggct tcggccgcgg 1884481 cggtcagccg gtcccgctcg acgcgctgca gcacgtacgc cgcgtagccg ccttcgaaag 1884541 gttcgacgat tccgtcgtgc acttcccatg ttgtggtggc gacctcgtcg aggaaccagc 1884601 ggtcgtgggt gaccacgagt aggccgccgg tattgcgggc ccagcgccgc cgtaggtggt 1884661 cggcgagcca ggtgatgcct tggatgtcga ggtggttggt gggctcgtcg agagcgatca 1884721 cgtcccattc gccgaccagc aggctggcca gttgcacccg tcggcgctgg ccaccgctga 1884781 gggtgctgac cggggtgtcc caggcgatgt cggataccag gccggcgacc acgtcccgga 1884841 tacgcgggtt gcccgcccat tggtgttcgg gttggtcacc gatgagcgtc cagccgacgg 1884901 tgcggttggg gtcgagggtg tctgtttggc tgagcgcgtt cacccgcaat ccgctacgcc 1884961 gggtgacccg accggagtcc ggccgcagtt gaccggtgag caggcccagc agactggatt 1885021 tgccgtcgcc gtttcgcccg acgatgccga tgcgcgcccc gtcgttgacc ccgagcgtga 1885081 ctgcctcgaa caccacctga gtcggatagg ccaggtgcac ggcctcggct ccgagtaggt 1885141 gcgccatggg gccgacccta gcgtggcgac gatgcgggct gggatgggcc gctgaggagc 1885201 cgcgcggtcg agctctagcg tggcgacgat gcgggctggg atgggccgct gaggagccgc 1885261 gcggtcgagc tctagcgtgg cgacgatgcg ggctgggatg ggccgctgag gagccgcgcg 1885321 gtcgagctct agcgtggcga cgatgcgggc tgggatgggc cgctgaggag ccgcgcggtc 1885381 gagctctagc gtggcggccc agccgcagtg cagttgattg gcggcggggc ttgccgggtg 1885441 gggtggaggt cgttgtaggc gtcgattggg ctgggtgtga ttgaggtgtt tgaacatttc 1885501 gtgggtttgc acccggttga acggggtcaa tgtcacggcg accgggatac tcgaatggac 1885561 gtgccggggc cagccggcaa gctgctcgtg gcggctcggc aggggcgtcg tcggtagctt 1885621 tctccagcca gcccaactgc ggactgactg aatcagtctt gggccaccaa gtcactggaa 1885681 tatgcttggg cacaatacat cttgatgcca tgcagtggcc atggtcgtcg gcgtaccgat 1885741 tggagccggc cgttgctacc acattaatcg gcatcagtgc ctggtgggcg aatggcagcg 1885801 tgaagcaata cgccggtgat ctgactgatc gtgtcgccac gatgacagtt tgccggcgca 1885861 cgccggctcc gcgagtgcat tatcgacagt gacacgtttg gcaggaccca aggaggccga 1885921 gtccatgatt cgtgctgtgt ggaatggaac agtgctcgct gaggcgccgc gaaccgtacg 1885981 ggtggaaggc aaccactact ttccgcccga gtcgctgcac cgcgagcatc taatcgaaag 1886041 cccgaccacg tcgatatgcc catggaaggg tctggcccat tactacaacg tcgtcgtgga 1886101 cggcccctat ggtccggtta acccggacgc tgcctggtac taccgccggc ccagtccact 1886161 ggctcgccgg atcaaaaacc atgttgcgtt ctggcacggt gtgacggtcg aaggtgaatc 1886221 cgagagtcgg catggcttgg cgcgccgggt tgtggcgtgg ctcggcaaat agcggcgtga 1886281 tgccaacggt cggacccgcg gaccacgcgg cgggcctaga tcggcgcgcg acgcctgacc 1886341 agctgccgat atggcgtatc ggcatcatca gtgggctggt cggcatgctg tgctgtgtcg 1886401 ggccgaccat cctggcgttg gttgggatta ttagtgcggc aacggctttc gcgtgggcga 1886461 acgacctcta tgacaactac gcgtggtggt tccgcgtgag cgggctcgcg gtgcttgcca 1886521 ttctggtgtg gtgggcgcta cgacatcgaa accgatgtag cgtcaacgca atccgccggt 1886581 tacggtggcg gctgatggca gtgctggcaa tagcggttgg tacttacggt gtcttgtccg 1886641 ctgtgacgac gtggttcggt acgttcgtat agttgcagta ttagacgaac ggggtcgccg 1886701 gcgacgggtg cagcatgatt tcggagtcgt tggcgcatac tgtccggcca gcgtgccgca 1886761 gtagcaagct acaagccgcc gcggcagcaa gtacggcggc gacggtcagc aacgcgagat 1886821 ggtaagtgcc ggtggcgtct ttgaggtggc cggtggcgta gggaccggcg aagctcgcca 1886881 gactggccac ggcattgacc gtcgcgatgg ccacggcgac ccggggaccg gccagcgcgg 1886941 cggtgcaacg gctccagaaa gcgggcatcg cggcaaggat tccggcgacg gcgatggtca 1887001 gccaactcag cgtcactatc ggtgacatcg gactcaatgc cgcaccgagc gcggcgctgc 1887061 ccgcggccgt tgttggcagt gtgatatggc ccgcttgggc gcccgagcgg tcgatgctgc 1887121 ggtggctcca ggccaacatg gccagcgcgg cgacaccgta cggcagggcc gccaacgtgg 1887181 cagcggtcag cgtggcggtg ccgtgtgcca gcgacgcaac tagttggggc agaaagaact 1887241 gcaacgcata cagcgcgaaa tacaggcccc cgtagacgac agcgaaaagg acaagatccc 1887301 aaccggctcc actcgaccga ccggtcgggg caggggtgtc ctcggtcagc cgggccgaca 1887361 gctctgcacg ttcctcgggg gtgagccagc ttgcccgttg cgggttatcc ggcaacaggc 1887421 gccgaagaag cggcgccagc agcagtgcag gcaatgcctc gatcacaaac attgcccgcc 1887481 agccgggtag cccggccatg tgaacgtggc cgacgatcag cccagacagc ggcaggccga 1887541 ccgtgttggc gaccggaatg gccagcagaa aggtggctac ggcgcgggct cgctgcgcgc 1887601 acggaaacca caccgtcaga tacgcgatga cgccggggaa gaagccgccc tcggcgacgc 1887661 cgagggcgaa gcgcgccaga tacaaggtgt gcgcgctggt gaccaaggcc gtggccgccg 1887721 agcacacacc ccaagccagg acgaccgccg tgagcgttcg accggcaccg aagcgcgcca 1887781 acgccgcgtt ggcgggaacc tggaacagga cgtagccgag gaagaagacg ccggcggcgg 1887841 tgccgtatgc ggtggcgctc aggcgcaggt cggcgttcat cgccagggct gcgaccgaga 1887901 tgttggcccg atcaacgaag ttgatcacat acaacacgaa cagcaggggc aacagccggc 1887961 gcgcggcctt gcccagggca ttgtgcgtgg ggcttgccgc gattgtcgcc acctgcggct 1888021 ccttccgtgg gcctgtcgaa caattgcatc atgaaatgac cccaacccgg tctttgtagt 1888081 ccggcgtgtc actaacacga tcggttatgt cattgcagta aaacggattt ggcgttgcgc 1888141 cggatgtgtt tcgccgtcaa tctcggcgta ggggccggcg aagaacaggc tccggcccgc 1888201 ccgctgtggt ggggcgagca ggatgtcgcg gccgatcgac cacgcgatgt ggttggcctg 1888261 caggttcgcg aacaggccgt gggtgccgta cttcgtcgca caggaggcgt ccgccggtag 1888321 ccaacccagc ccagcgacga agaattccgc ccagcagtgg tagccgcaca cctcgcaatc 1888381 ctgcgcaccg ggctgcggta gctccaaggc ctgaccgagc acaaatcgtg cggggatgtc 1888441 gaccgatcgg cacagcgaga cgaacaatgc gtggatgtcg ttgcagttgc ccaccgagca 1888501 ggtcagggca tgctcggtgc tgcccaggaa agactgcttc gtcgcgtcgt agtccatggc 1888561 gccggtgacg tagtcgtaga tgcgacgggc ctgttcgagc gggttggtct cggggccgac 1888621 gacgtcttgg gccaacgtac gggtgcgctc atcgacatcg acatgtgctt cggggatcaa 1888681 ggcgcggctg aacaattgcg ccgtggccaa cgggcgggcc cgtgccggat ccggagcatg 1888741 cccgatcgcc cggcgttcca caacatagcg gatagaccaa ctcgccgccg tcgccaagcg 1888801 cagccggctg tacaacatca ggttcccgaa ctccggctca cgcgtgaggt catagggatc 1888861 ctcgctggtc acctcgacgt ccagaacgcg ttgaaacgcg ccgtcaccga tgaccgggca 1888921 ccacatctcg acggtgtggg caccttgggt ggaatcgatc gtgatgtgat cggtgatttc 1888981 gaacagcccg atcgtcgcat ccgcgtgtgc ggataccgcg gggtcggtga tcgtcatcgg 1889041 ttagctcctt ccgctgagac tggtttatgt tcgaacaacc ggcagatcgg ctgccagcca 1889101 ttcggagaac ccgccgtcga gtcggcgggc agaaaatccg ttggggcgca acagttctag 1889161 cgcgtcatag gcatacacgc agtaaggtcc tcggcagcag gcgacgatgt cgatgccgga 1889221 cgggagttca tcaagccgct cggccagttc gtcgagggga atgctcactg ccccgggcag 1889281 atgcccggcg gcgtattcca tggccggccg cacgtcgagg accagcaccg acccggcggc 1889341 cacccgagct tgcaactcgt ctcggctgat cggttccagg ctgtctctgt cggtgtagta 1889401 ctgccgcacc agggagccga ccgaggccag attgcgttcg gccacagcgc gcaccgcgcg 1889461 cactacgtcc cacacctgcg gatccgacag tgcgtaaatc acccgtttgc cgtcccggcg 1889521 gctggtcacc aggccggcgc gccgaagttg caacaagtgc tgggaggcat tggcaaacgt 1889581 caaccccgac gcacgagcca gcgcgtccac actgcgttca ccctgcacca gcagatccaa 1889641 cagctccaat cgatggccgc tggacagcgc ttgcccgacc agggcgaact gctcgaagat 1889701 cagcttcttt gcaccggaca tgccgccgct ccattcctcg attcagatgt tcgtatattc 1889761 aattgattgt ttgatcatgt cattccgaca cgctgctgcg gtttcgccgc cggggcgtcg 1889821 caccgttact cggtgccggc tacggcctca cccgcggccg cgggttcgcg accgggccct 1889881 gcgccgcgcc ctcggggtgg gcggaatgtc ctccgcggtc agcaccggtg cattcctgac 1889941 caccgtgtgc ctcgcgcacc tggtgctcgg cgcgcttatg ggtgtactag tgcacgaatt 1890001 cggcgccgac atgctgtcgt tgtggcccgt gggaccggcg ctgtgtcatt gagcccgggc 1890061 gcgtaatccg tgttggtcgg tgatctcgat gaccgcatac ccgacggtga tcaatcggtc 1890121 gcgctcgaac tccttgagaa tcttgttgat cgatgggcgc tgcgctccaa gcattgcggc 1890181 gagggtgcgt tgggcaagtt cgatacgggc atcgattgcc tcgtcgagca ggagctgcgc 1890241 aacctgcgcg ggcagcgggc ggccaagcat gcccattaac cgaatctgcg cagtcgacac 1890301 ccgttgcgcc acactcgaca gccaccgccg tgcgatggcc gggtgggtag ctagcagccg 1890361 ctcgaacgcc tgccggtcca ggaacaggca ggtcgcttgg gtcaaggcgc gccccgtgta 1890421 gaccatcggc atctccagta gcagcgggat gtcgccatcg acatcgccgg gatgaaggat 1890481 gttcaccacg gcgcggcgcc gcctggagcc gaccgcgagc tcaattaatc cgtgtcgcac 1890541 aatccacacc ccgtccgcgg tttgatcggc gtggaatacc actgccccgg gggcaaactc 1890601 cttggcttgt aacgtttcgg ccaatgccga cacatcgtca cggtgcagtg gcgccgagcc 1890661 tccgcgaccg acgcaccgcg caatccaggc tgcctgtcgg acctgggcct cggaaggcgg 1890721 ttggccccca gtcaccgcat gaacgagatg ccgcagcggg cgcaccgacc gatctgccat 1890781 ggcccctcct tgagagcagg cgatgccgtc atcgtgctgc caattgtcag cgcgcgtgga 1890841 ttgcgtgcgg gttggcttgc cctgaatggg aaattagtcg atcgaagaga acacgcaagc 1890901 ccgttctgcg ccccaggcac tctgtcagca cgctgacaaa ccgattcttg gcggagtttt 1890961 gccatcggta tggtattggg gtgcctactc gattggcccg cggtgcgacc gtgccgactc 1891021 gccgtctgca ggacatcaac gatcaaccgg tggacgtccc ggctgcgacc ggaaggacac 1891081 acctgcagtt tcggcggttc gcggcctgtc cgatctgcca cctgcacctg cgcagcttcg 1891141 ccaaccggca ccaagaggtt gcggacagtg gaatcaccga ggtggtgttt tttcattcgg 1891201 cggccgacgc gctgcgcgga taccagtcct tgctaccgtt cgccgtgatc gccgaccccg 1891261 accgagtgca gtaccgcgag ttcggcgtag agaaaagcct gggcgccatc actcatccgc 1891321 gggcattgtg ggctgccgtt cgggggtcgg cggcgatgtt gcatcgcaac gatccggaac 1891381 gggcgggcgt cggattcggt gacggcacaa cgcatctggg attgcccgcc gactttctcc 1891441 tggatgccga tggaactgtc gccgctgtgc actatgggcg tcatgccgac gaccaatggt 1891501 cggtggatca gctcatcgac atcaaccgct cgcttggagg taagggcact cagtgactca 1891561 ttcccgtctg attggcgcac ttaccgtagt cgcaattatc gtcactgcat gtggttcgca 1891621 gccgaaatcc cagcccgcag tggcacctac cggggacgcg gccgctgcca cccaggtgcc 1891681 ggcgggccaa accgttcccg cccagctgca gttcagcgcc aaaacccttg atgggcacga 1891741 ctttcacggg gaaagcctgc tgggtaagcc cgcggtgctg tggttctggg cgccctggtg 1891801 tccgacgtgc caaggcgaag cgccggtagt cggccaggtc gccgcgtcac acccggaagt 1891861 gacgttcgtc ggggtggccg gcctggatca agtacccgca atgcaggagt tcgtcaacaa 1891921 atacccggtg aaaacgttta cccagctggc tgataccgac gggtcggtct gggcgaattt 1891981 cggtgtcacc cagcagcctg cgtacgcgtt cgttgacccg cacggcaacg tcgacgtcgt 1892041 caggggtcgg atgtcgcagg acgaactgac gcggcgcgtc acggcgttaa ccagccgttg 1892101 atcgacgcca cgccggtcgg cttggcgttg gcccacgcag aaatgcctgg ccttcgcgac 1892161 gagttggggc ttgcgccgcg tgtgatactg ccctcatgac gatggctcgg gtgcgtcgcg 1892221 gcacggaact gttgttgtca cctcagtcgc cgccggccac cggcgggctg atcgtgttga 1892281 ccggtctgcg gctgttggct gggttgatct ggctctacaa cgtggtctgg aaggtgccgc 1892341 cggacttcgg tgagcgcggc cggcgggacc tgtatcactt cacgcatctg gcggttgaac 1892401 acccggtgtt cacaccgttc agctgggtga tcgagcatgc cgtgctgccg tacttcacgg 1892461 cattcggttg gggggtgttg ttcgcggagt ccgcgctggc ggtgctgctg ctgaccggga 1892521 cggccgtgcg gctggccgcg ttgatcggga tcgggcagtc ggtcgcgatc gggctgtcgg 1892581 tggccgagtc acccggggag tggccgtggg cgtacgcgat gctgctgggc atccacgtcg 1892641 tcttgctgtt cacctgctcg acccggtacg ccgccgtcga cgcggtgcgc gccgccgcca 1892701 cggggtcggc cgctcggacg gcggcgcagc ggctgctggc cggttgggga atcgtgcttg 1892761 ggctgatcgg acttgtcgcg gtatggcgtg gcctgggcga tgatcgaccc gcctatgtcg 1892821 ggatacgggc gttggagttc tccctcgggg aatacaacct gcgcggcgca ctggcgctga 1892881 tcgcgatcgc gctggcaatg ttggcggccg ccaaacgcgg ctggcgcacc gtcgcgttgg 1892941 tcgcggcggt ggtcgcggtg gccgccgcgg ccgccattta cctgcaagtc ggccggaccg 1893001 cggtgtggct cggcgggacg aacaccaccg cagcggtttt cgtgtgcgcg gcggtggtga 1893061 gtctggcaac cgaattccgg atcggacggg tggaaggggc gtgatggcca caccgggcgt 1893121 tgtgcaggaa gtcgtttccg tcgctgcaga acacgccgag cgggtcgaca ccgactgtgc 1893181 tttcccggcc gaggcggtcg acgccctccg caagaccggc ctgctgggtc tggtgctgcc 1893241 ccgcgagatc ggcggaatgg gttccggacc agtggaattc accgaggtgg tcgcccagct 1893301 gtcggctgca tgtggatcaa cggcgatgat ctatttgatg cacatggcgg ccgctgtcac 1893361 ggtagccgcg tcgcctccgc cgggtctgcc ggatctgttg gcggacatgg cttccggaaa 1893421 acaacttggc accttggcat tcagtgaacc gggttctcgt tcgcacttct gggcgcccgt 1893481 gtccacggcg agcgccgacg gtgacggcat cgcggtgcgg gccgacaaga gctgggtgac 1893541 ctcggcgggg ttcgccgacg tctatgtggt gtccgtcggt tcggccgacg gtgccgcggg 1893601 cgacgtcgac ctctacgcgg ttccggcgga cacaccgggc ctgcgggtag cgggcacctt 1893661 caccgggatg ggtctgcggg ggaatgcctc cgcgccaatg gccgtcgaca ttcgcatccc 1893721 ggattcgtat cgtctcgggg aggccggcgg cggattcggc atcatgatgc aaacggtact 1893781 gccctggttc aatctcggaa atgcggctgt ctcactgggt ttggcgaccg cagccaccgg 1893841 tgccgcggtc aagcacgtcg ggaccgcccg gttggaacac ctcggtggca gcctggccga 1893901 gctgcccacg atccgcgccc agatcgctcg gatgggcacc acgctggccg cgcaaaaggc 1893961 gtaccttgag gtcgccgcca acagtgtcag ctcgcccgac gacaccacct tgacccacgt 1894021 gctgggtgtg aaggcctcgg tcaacgacgc cgcgctgacc atcaccgaat cggccatgcg 1894081 ggtgtgcggc ggggccgcgt tctccaagca tctgcccatc gaacgcgcct tccgcgacgc 1894141 ccgggcgggg tcggtgatgg cgccaaccgc cgacgcgctc tacgacttct acggcagggc 1894201 cgtcaccggg ctgccgctgt tctaggaggc gatatgtcaa ccgaaccgct cgtcgtggga 1894261 gcagtcgcat acacacccaa cgtggtcccg atttgggaag gcatccgcgg ctacttccaa 1894321 gactccgaaa gcccggacac ccaaatggat ttcgtgctct actccaacta cgcgcggctg 1894381 gtcgattcgc tgatcgccgg ccacatcgac atcgcctgga acaccaacct ggcctacgtg 1894441 cggaccgtgc tgcaaaccgg cgggcggtgc acgccattgg cccagcgcga taccgacgtc 1894501 gactacacca ccgtgttcgt tgcacatgcc ggcagcgatc tgcacggcgc taaagacatt 1894561 gccggaaagc gccttgcgct cgggtccgcc gactctgcgc acgcggccat cttgccgctc 1894621 tattatctgc gccgggcggg catcgccgag tctgacctgc aggtgatccg cttcgacacc 1894681 gacatcggca agcacggcga caccggtcgc agcgaactcg acgcggtgga tgcggtgctc 1894741 gccggtgagg ccgacgtggc ggcgatcggc agctccacgt gggccgcgat gggcgccgcg 1894801 gagctgatgg gggagtcgtt gaccgaggtg tggcgcaccg acggctactg ccactgcatg 1894861 ttcaccgcgc tggatacgct gcccgccgaa agataccagc cgtggctcga ccggttgctg 1894921 gggatgagct gggatgactc cgagcatcga aagatcctcg aactcgaggg tttacgacgt 1894981 tgggtgcctc cgcacctgga cggctacaag ccgctgttcg aggccgtgca ggagcagggc 1895041 atcgacccgc gatggtgatc atagagctga tgcgccgggt ggtaggtctc gcacagggag 1895101 ctaccgccga ggtcgccgtc tatggcgacc gagatcgtga tctcgcggag cgatggtgcg 1895161 cgaacaccgg aaacaccctg gtgcgcgccg acgtggacca gaccggcgtc ggcaccctgg 1895221 tggtgcgccg cggccatccg cctgacccgg caagcgtgtt gggccccgac cggctacccg 1895281 gggtccggtt gtggctgtac accaacttcc actgcaacct gtgctgcgac tactgctgcg 1895341 tctcgtcgtc accaagcacc ccgcatcgcg aactgggggc ggagcggatc ggccgaatcg 1895401 tcggtgaagc ggcgcgctgg ggagtgcgcg aactgttcct caccggcggt gagccgttcc 1895461 tgctgcccga catcgacacg atcatcgcga cctgtgtgaa gcagttgccc accaccgtcc 1895521 tcaccaacgg catggtgttc aaagggcggg gtcggcgcgc gctggaatcc ctacctagag 1895581 ggctcgcctt gcagatcagc ctggactcgg ccaccccgga gctgcacgat gcgcaccgcg 1895641 gcgcggggac gtgggtcaag gcagtagctg gtatccggtt ggcgctctca cttggcttcc 1895701 gggtgcgggt ggccgcgacg gttgccagcc ccgcacctgg cgagctgacg gcgtttcacg 1895761 acttcctcga cgggcttggc atcgcacccg gggatcagct ggtccggccg atcgcgctgg 1895821 agggcgccgc gtcgcaaggg gtggcgctca cccgcgaatc gctggttccc gaggtgaccg 1895881 tcaccgccga cggcgtgtac tggcacccag tggccgccac cgacgagcgc gccctggtca 1895941 cccgtaccgt cgaacccttg accccggcgc tggacatggt aagccggcta ttcgccgaac 1896001 agtggacacg agccgccgaa gaggccgcgt tgttcccgtg tgcgtagtgc ccagtctgcc 1896061 ggccgcgaac ccaggattaa ttgctgatga caagtattgc cctactgcac tatagttctg 1896121 cttgcacttg aaaacaacga gccgtgatgc gggtcgtaag ggattccggt aaggaacaca 1896181 gtcaagttct tgcacgcgtc ggcggcagtg ttgcctcaac gcccaaactg caccaaactg 1896241 tttcgcccac ggcggggcgt gtctgagagg tatcgcgtga ccaccgccca taacggatcc 1896301 gctccgcgtt ttcaacgtac ccgctctggc tacgacccgg tcgcagtcaa tcattacatc 1896361 gccgaactcg tgctgcgtca gcaggcgcag cactgtgaga ttgaaacgct caaggcagaa 1896421 atagccagtc tgaaggacga aaacgctgcc ctgaaggaca cctcgccgtc agcacaggcg 1896481 gtgaccgatc ggatggcgaa aatgcttcga ctcgctgtcg acgaggtctt ccagatgcag 1896541 tcggaggcac gggccgaggc cgcaacatta gtttctgcgg ctagggatga ggcggaagcg 1896601 gtccgaacgc agaagcgaga aatgctggcg gatatgaacg cccggcaaag agcgctggag 1896661 tccgagcatg ccgacgtgat gcgccgcgct cgtgaagagg ctgaacagct tgtggcgcag 1896721 gcaaccgccg aggtggagcg gatgcgtgtc atcgatgcca gacgccgtga gaaagccgag 1896781 caggaacttg atgccgaaat catcaggctt cgcaccgatg cccaatttca gatcgacgat 1896841 cagctgcagg ccacacagca ggagtgtgag aagcggcttg gcgaagccaa aatcgaggcc 1896901 gatcgacggc tgcatgttgc cgacgagcag attgagcacg gcctcagcga ggctcggcga 1896961 acgttggaag agatcagcca gcggcgagtc ggcatcctcg aacaactagc gcgtattcac 1897021 gcacagctcg agaatattcc agcgctcctg gaatcggctc gacatagcga gacggagcca 1897081 ctgcagtcca taaacggcgc cgtcgctgag ctacgggcca tttagcgatc gcgtgcctga 1897141 gcgcgactca tctgtgacag ttccgtcacg gctgggtcag gtgccggtgt cctggcgacg 1897201 ccgactgcgc acagaccgaa acagcacggt gtggatgtgc catgatgtgc acgctgtcaa 1897261 ggccagtcgg gtgacgatgc gggccggtgt ggtccgagga ggagcccgac aatttaagct 1897321 agtcagggag ccctcaggag cggtggtgga tctcaatttt tcgatggtca cgcgaccaat 1897381 cgagcgcctg gtggccacgg cgcagaacgg tctggaagtc ctgcgactcg ggggcctgga 1897441 aaccggcagt gttccgtcgc cgtcccaaat cgttgagagc gtaccgatgt acaagctgcg 1897501 gcggtatttt ccgccggaca accgcccggg acagccaccg gtgggtccgc cggtgctgat 1897561 ggtgcacccg atgatgatgt cggcggacat gtgggacgtc acccgtgaag acggcgcggt 1897621 ggggatcctg cacgccagcg ggctagatcc ctgggtcatc gacttcggct cacccgacga 1897681 ggtcgagggc ggaatgcgcc gtaacctggc cgaccacatc gtcgccctca gcgaggcggt 1897741 cgataccgtc aaggacgcca ctggccacga tgtgcacttc gtcgggtatt cgcagggtgg 1897801 catgttctgc tatcaggccg cggcataccg gcgttcgaag gacatcgcca gcgtggtcgc 1897861 gttcggctcg ccggtggaca ccctggccgc gttgcccatg ggcatcccgg cgaacatggg 1897921 cgctgcggtc gccgatttca tggccgatca cgtcttcaat cgcttggata tcccaagctg 1897981 gatggcgcgc atgggttttc agatgatgga cccactcaaa accgcgaagg cccgggtgga 1898041 cttcgtgcgt cagttgcacg accgcgaggc actgctgccg cgggaacaac agcgccggtt 1898101 cctggaatcc gaaggatgga tcgcctggtc gggcccggcg atctcggaac tgctcaagca 1898161 gttcatcgcg cacaaccgaa tgatgacggg tggtttcgcc atcagcggcc agatggtgac 1898221 gcttaccgat atcacttgcc cgatactggc gttcgtcggt gaggtcgacg acatcggcca 1898281 gccggcgtcg gtacgcggca tccggcgggc cgcgcccaac tccgaggtct acgaatgtct 1898341 catccgggca gggcatttcg gtctcgtcgt gggatcccga gcggcacaac agagctggcc 1898401 gaccgtggcc gactgggtgc gctggatctc cggcgacggc accaaaccgg aaaacatcca 1898461 cctgatggcc gatcagccgg ccgaacacac cgatagcggt gtggctttca gctcccgggt 1898521 cgcgcacggc atcggggagg tctcggaggc tgcgttggcg ctggctcgcg gcgcggccga 1898581 cgcggtcgtt gcggccaaca gatcggtgcg cacgctggcg gtggagacgg tgcggacgct 1898641 gccgcgacta gcccggttgg gtcagctcaa cgaccacacc cggatctcgc tgggccgcat 1898701 catcgacgaa caggcacacg atgccccgaa gggtgaattc ctgttgttcg acgggcgcgt 1898761 gcacacctat gaggcggtaa accggcggat caacaatgtc gttcgtggcc tcatcgcggt 1898821 cggggtgcgg cagggtgacc gtgtcggcgt gctgatggag actcggccca gcgcgctggt 1898881 cgccatcgcc gcgctgtctc ggctgggagc ggttgccgtg gtgatgcggc cagacaccga 1898941 cctgtccgcg tcggtccggc tcgggagagt gaccgagatc ctgaccgacc ctaccaatct 1899001 ggatgctgcg cgccagttgc ccggacaggt gctggtgttg ggtggtggtg aatcgcgtga 1899061 tctggatctg ccggccgacg cacttgaaca gggccaagtc atcgacatgg aaaaaatcga 1899121 cccggacgcc gtcgagttgc cggcgtggta tcgaccgaat cccggattgg cgcgggatct 1899181 ggcgttcatc gcgttcagtt cggccgacgg cgacctggtg gccaagcaga tcaccaacta 1899241 ccgctgggcg gtgtcggcct tcgggaccgc ctcgacggcg gccctcggcc gcagagacac 1899301 ggtgtactgt ttgacgccgc tgcaccatga gtccgcactg ttggtcagcc tgggcggcgc 1899361 ggtcgtgggc ggaacccgta tcgcattgtc ccgcggcttg cgcccggacc ggttcgtggc 1899421 cgaggtacgc cagtacggcg tcaccgtcgt ctcctacaca tgggccatgc tgcgtgacgt 1899481 ggtcgacgat ccggcgttcg tgttgcacgg caaccatccg gtgcggttgt tcatcggctc 1899541 gggcatgccg accggattgt gggagcgggt cgtcgaagcg ttcgcaccgg cgcacgtcgt 1899601 cgagtttttc gccaccaccg acggacaggc ggtgctggcc aacgtggctg gcgccaagat 1899661 cggcagcaag ggccgtccgt tgcctggcgc cggacgtgtc gaacttgggg cctacgacgc 1899721 cgaacatgac ctgatcctgg agaacgaccg cggcttcgtg caggtcgccg gtgtcaacca 1899781 ggtcggggtg ctgctcgcac aatccagagg gccgatcgat ccgaccgcgt cggtcaaacg 1899841 cggtgtcttc gctcccgccg acacctggat atctaccgac tacctattct ggcgtgacga 1899901 cgatggggac tactggctgg cgggtggacg cggctcggtg gtgcgcactg cgcgcgggat 1899961 ggtttacacc gagccggtca ccaacgcgtt gggcctcatc accggtgtcg acctcgcggt 1900021 gacctacggt gtattggtgc gcggtcgcca cgtcgcggtg tcggcggtga cgttgctgcc 1900081 tggagcgacc atcacagccg ccgacttgac cgaagccgtg gcgagcatgc cggtggggct 1900141 gggacctgac atcgtgcacg tggtgccgca gctaacgctc agcggtactt accggccaac 1900201 ggtcagcgcg ttgcgggcca acgggattcc caaggcgggc cgtcaggcat ggtatttcaa 1900261 ctccggcggc aacgagtacc ggcggttgac gccggcggtc cgcaccgagt tgaccggcca 1900321 gcatcggcgc ggcaatgctt gacgaggcgc tgctcgccat cctggtgtgc ccggcggatc 1900381 gaggtccgct cgtcttggtc gaggacggcg acatccaggt gctctataac ccgcggctgc 1900441 ggcgcgccta ccgcatcgag gacggtatcc cggttctgct ggtcgacgag gcccgcgagg 1900501 tcgacgagga cgagcacgcc cgcctcatgg cgcgaggtcg tccggcagct ccccagtgag 1900561 gtagcgctgc aggttgggcg cgatggtttg cacgatctgt tcggccggca acgaagcaaa 1900621 cggttcgatt ctgacgatgt agcgcgccat gaccacaccc atcagttgcg acgcgacgaa 1900681 ctgggtacgg atcttgccgg ttcccggcgg gttgtcgacg cgggacccaa gctccacggt 1900741 gaccacttcc tcaaggaagg agcgcgccag gcccacgtcg gagcctgaga tcaaggatct 1900801 cagcgtcgcg atcaacccgg cacccagttc ggaatcccaa atcggcagca acaaggacgg 1900861 cagcttgtaa ccgagttcct cgacaggcgc ctcgcgaatc ggaccgatga tgaccatcgg 1900921 gtcgatcgga atgtggatcg cggcggcgaa aagctgctgt ttggtgccga agtagtgatg 1900981 cactagtgcg gcatcaacac cggccttggc ggccacggct cggatcgatg ttctgtcaat 1901041 gccgttgtgc gcaaagagtt ctcgggcact ggacaggatt cgctccctag tgtcagagct 1901101 gccggcgggt cgcccgggcc gtctgcggct gttgtccggc gccgccacgc tatgacgtcc 1901161 gtcgccgcag tgtcaccgcc gccagacaca gcgacgcgac cgcgaaactc agcacgacga 1901221 cgacgtcgcg caccgcgata ccggtcagct ccggatgcgc acccacctgt tgtagcgcct 1901281 cgagcgcgta gctggccggc atcacgttac tgatccactc cagccacgtc ggcatcagtg 1901341 cccgcgggac gatgatgccg gcgagcagca gctgcggcac catcaccagc gggatgaact 1901401 gtacggcctg aaattcggtg cgggcgaagg cactacacaa tagaccgagc ccgacaccca 1901461 agacggcgtt gacgatcgcg atcgcgaaca cccacaccgg gctgcccgcc gtgtcaaagc 1901521 caaggaacca gaacgccaca atgcaggcca gcgtggcctg cgccgccgcg gcgatcgaga 1901581 acgcggtccc gtagccggcg agcagatcaa gccggcgtag cggggtggtc aggatgcgct 1901641 ccagcgttcc cgaagccctt tcgcgttgca tggtgatcgc cgtgatcaca aacatcacaa 1901701 agagtgggaa caggcccagt agcaccaggc aagcggtgtt gaacccggat ggggtaccgg 1901761 ggcgatgcgg gaagttctcg aacatgaaat acatcagcgt gatgatcagg atgggtacca 1901821 gcaagatcat cgcgacactg cggtgatcag cggcaagctg ccggagaatc cgcgccgtag 1901881 tggccgtgta gttctgcagc gttagccggc cgcgggcacg gtggtggtgc gtcggacgat 1901941 ggacagaaac gcttcctcca gtgatgtgca tccggtttcc tttcgtagac ggtgcggcgt 1902001 tgtgtgggcc agcagctgcc cctggcgcag aagcaacaga tcgccgcagc ggtcggcctc 1902061 gtccattacg tggctggaca ccaacagcgt ggtgccacgc cgcgccagcg ccgtgaaccg 1902121 atcccataat tcgacgcgca ataccggatc caggccgatg gtcggctcgt cgagcactag 1902181 cagatcaggc cggccgacca gcgcacacgc cagcgagacc cgggcccgct ggccgccgga 1902241 caggttggca caacgggcgg tgcggtgatc gcgcaggtcc accgcttcga tcacctcatc 1902301 ggcggcttgc ctgtcgacgc cgcagagttc agcgaagtag cggatgttgt cgatcacccg 1902361 caggtcgttg taaatggtcg ggtcctgagg catgtatcca acccgatggc gtagttcggc 1902421 tgacccagcc ggttggccca gcacgctcac cgaacccgag gcaatgattt gggagccaac 1902481 gatgcagcga atcagtgttg tcttgcccga cccggacgga ccgagcaggc cggtgatcgt 1902541 gccgcaggcg acccggaccg aaacatcctg cagggcaagg cgtttaccac ggatgacgcg 1902601 cagctggtcg atgatgaccg cggggtcggc accgtcgcga agtaattcat cacttgatga 1902661 aatcatcatg tgatgaatat ccgccagtcg tgcgggtttg tcaagggccg gtgcacaatc 1902721 gtctctgatg aacgctgagg aactggcgat cgacccggtc gcggccgcgc atcggctgct 1902781 cggcgcaact attgccggac ggggtgtgcg tgcgatggtg gtcgaggtcg aggcgtatgg 1902841 cggggtgccc gacggtccct ggccggacgc cgcggcgcac tcttaccgcg gccgcaatgg 1902901 ccgcaacgac gtcatgttcg ggcccccggg gcggctttac acctaccgca gccatgggat 1902961 ccatgtctgt gccaacgtcg cgtgcgggcc cgatggcacg gctgccgctg tgctacttag 1903021 ggccgccgcc atcgaggacg gcgccgagct cgccacgtct cggcgcgggc agacggtgcg 1903081 cgctgtcgca ctggcgcgcg gcccgggaaa cctctgcgct gccctcggaa tcaccatggc 1903141 cgacaacggg attgacttgt ttgatccgtc cagtccggtg cggctgaggc tcaacgacac 1903201 gcaccgtgcc aggtcggggc cgcgcgttgg ggtcagtcaa gccgctgacc ggccgtggcg 1903261 attgtggctc acgggtcgac cggaggtgtc ggcctaccgg cgaagctcgc gggcaccggc 1903321 ccggggagcc agcgactaga gtcttgcggg atgtctggca tgatcctcga tgagctcagc 1903381 tggcgcgggt tgatcgcgca gtcgaccgac ctcgacacgt tggccgccga agcacagcgc 1903441 gggccgatga cggtgtacgc cggcttcgat cccaccgcgc ctagcctgca tgccggacat 1903501 ttggtgccgc tgctgacgtt gcggcgcttt cagcgcgccg gtcatcgccc catcgtgctg 1903561 gccggcgggg ccaccggcat gatcggtgat ccacgtgacg tcggcgagcg cagtctcaac 1903621 gaggccgaca ccgtcgccga atggaccgaa cggatccgtg ggcagctgga gcgcttcgtc 1903681 gacttcgacg actcaccaat gggcgcgatc gtcgagaaca acctggaatg gaccggctca 1903741 ctatcggcta tcgagtttct acgtgatatc ggcaagcact tctcggtcaa cgtgatgctg 1903801 gcccgcgaca ccatccggcg gcgtctggcg ggggagggga tctcttacac cgaattcagc 1903861 tacctgttgc tgcaggccaa cgactacgtc gaattgcacc ggcgccacgg ctgcacgctg 1903921 cagatcggtg gtgcagatca gtggggcaac atcattgccg gcgtccggtt ggtgcgccag 1903981 aagctcggtg ccaccgtgca tgcgcttacc gtccccttgg tgaccgctgc cgacggcacc 1904041 aagttcggca aatcaaccgg cggcgggagc ctgtggttgg atccccaaat gaccagcccc 1904101 tatgcctggt accagtactt cgtgaacacc gcggacgcgg atgtgatccg ctacctacgg 1904161 tggttcacct tcttgtcggc cgacgagttg gccgagctgg aacaggcgac agcgcaacgc 1904221 ccgcaacaac gggccgccca gcgccggctc gccagcgagc tcaccgtctt ggtgcatggc 1904281 gaggcggcga ccgcagccgt cgagcatgcc agccgggccc tcttcggtcg gggcgagttg 1904341 gcccgtctgg acgaggcgac actggctgct gcgttgcggg aaaccacggt cgccgaactc 1904401 aaaccgggca gtcccgacgg aatcgtcgac ttattggtgg ccagcggcct gtcggccagc 1904461 aagggcgcgg cgcggcgcac gatccacgag ggtggggtgt cggtcaacaa cattcgggtt 1904521 gataacgagg aatgggtgcc gcaaagttcg gacttcttgc acggccgctg gttagtgcta 1904581 cgtcgtggaa agcggagtat cgccggggtg gaacggattg gctgagccga gccaccacgt 1904641 cctcgacgtc ctcgggtccc aaggtgatat gcgacgtgag cggcccatgg aatatcgctg 1904701 ggcggtaggg gagggccagc gggggatctt atctcgaggg atggggtggg gatgcatcga 1904761 taagcccccc gctgaagcct ggggttcgac ggggatctca gacttggggg gattgggagg 1904821 tgatgagacc cccgtcgaag tctagtgcgt tgacctcact cggcggtgtc gccggcgtgg 1904881 aacaacggga tcgagtacgt ggtctcgctc tcactaaaca gctgtgcgtg tgacaacggg 1904941 tcatcatcct ttcatgtgac aggcgagcgg cgttgcgttg tagtcgattt ccacttcctg 1905001 acttatcttt ggcgggtttg gactccgctg gtatcccacg actagtcggt ggccggggga 1905061 aatgccgaat cccgcatccg gtggatcgtg aagtccacca atcgggggac gatcggcccg 1905121 cggtgccccc ctacccggtt aacgcgcaca cattccacac gaaacgcgtt agtgtgcaaa 1905181 cctttatccc actgtgctgt gaacgtgact cttgttggcc actgttgtcg aggtgcctta 1905241 aatgacgcaa gtgcgacaac aacgagaagc gggagatgac ggcacacaca cacacgacgg 1905301 gacacggacc tggcgaacgg gccggcaggc gacgacgttg ctcgcgttgc tggccggggt 1905361 gtttggtggt gccgcgagct gcgcggcgcc gatccaggcc gacatgatgg gtaacgcatt 1905421 cctgacagcg ttgaccaacg ccggcattgc ctatgaccaa ccggcgacca cggtggcgct 1905481 aggcagatcg gtttgtccga tggtggttgc gccgggcggg acgttcgaat cgatcacgtc 1905541 cagaatggct gagatcaatg gcatgtcgcg tgatatggcg agtacgttca ccattgtcgc 1905601 gattgggacg tattgcccgg cggtgattgc gccgctgatg cctaaccggt tacaggcctg 1905661 atagttacgg ggcgcagcaa cccccgtaac ctctaccgag tggtcgacga caggcaaggg 1905721 cgcaggggcg ggcgacgacc gcgctcggct gccgccgaca accgacctgc gttccgggat 1905781 gggcccgcga ttccgccggg tatccacgcc aggcaactgg cgcccgagat ccggcgcgaa 1905841 ctgagcacct tggaccgtgc cacggccgac gcggtggcat gtcacctggt agctgccggc 1905901 gagttgatcg acgacgaccc agaagccgct ctgcgccacg cgcgggcggc gcgggttcgg 1905961 gccagcagga tcgccgctgt gcgcgaagct gtcggaatcg ccgcctaccg ctgcggcgat 1906021 tgggcgcagg cgttggccga attgcgggca gcccgaagaa tggggagcaa gtcccccctg 1906081 cttgcgctga tcgcggattg cgaacgcggt ctgggccggc cgcagcgggc catcgaattg 1906141 gcgcgcgggt ccgaggcggt cgagctcagc ggtgacgccg ccgacgagtt gcgcatcgtc 1906201 gccgccggcg cgcgcgccga tctcgggcaa ctggagcagg cgttgacggt gttgtccacg 1906261 ccgcagctcg acccgggccg tacgggttcg accgcggcgc gcctgttcta cgcctacgct 1906321 gaaatactgc tggcgttggg ccgtggcgac gaggccctgc aatggttcct acggtccgcg 1906381 gcggcggaca tcgacggcgt caccgacgcc gaagatcggg tagacgagct aggcgcacga 1906441 gaacagaaat gaaaagcatt gcgcaggaac atgactgtct gctgattgac ctggacggga 1906501 cggtgttttg tggccgtcag cccaccggcg gcgcggtgca gtcgttgagt caggtgcgca 1906561 gccgcaagct gtttgtcacc aacaacgcgt cgcgtagcgc cgacgaggtg gcggcgcact 1906621 tgtgcgagct cggcttcacc gcaaccggtg aggacgtcgt caccagcgct cagagcgctg 1906681 cccacctgct ggccggccag ctggcgccgg gtgcgcgggt gctcatcgtc ggcaccgagg 1906741 cgttggccaa cgaagtcgcc gcggtcggat tgcgtccggt acgacgcttt gaggatcgac 1906801 ccgacgccgt cgtacagggc ctttcaatga ccaccggatg gtccgacctt gccgaagccg 1906861 cgctggccat ccgggcgggc gccctgtggg tggcggccaa cgtcgacccc accttgccca 1906921 ccgaacgggg cctgctgccc ggcaacgggt ccatggtggc tgcgctgcgc acggccaccg 1906981 gcatggaccc ccgagtggcg ggcaagcccg cgcccgcctt gatgaccgag gcggtggccc 1907041 ggggcgactt ccgggcggca ctggtggtcg gtgaccggct ggacaccgac atcgagggtg 1907101 ccaacgccgc ggggttgccc agcctgatgg tgctcaccgg ggtcaacagc gcctgggatg 1907161 cggtgtacgc cgaacccgtg cgccggccca cctacattgg ccacgacctg cgctcgttac 1907221 accaggacag caagctgctg gcggtggcac cgcagccggg ctggcagatc gacgtcggtg 1907281 gtggtgcggt aacggtctgc gcgaacggcg acgtcgacga tctggaattt atcgacgacg 1907341 ggctatccat cgttcgggct gtggccagcg cggtatggga ggcgcgggcc gccgatcttc 1907401 accagcggcc actgcgcatc gaggccggcg acgagcgggc ccgtgcggcc ttgcaacgct 1907461 ggtcgttgat gcgcagcgat catccggtga ctagcgtagg aacgcaatga ccatcgatcc 1907521 tgaccagatc cgtgccgaaa tcgacgccct acttgcttcg ctgcccgacc ccgccgacgc 1907581 cgagaacgga ccgtctctgg ccgaactcga aggcatcgca cgtcgtcttt ccgaggcgca 1907641 cgaggtgttg ttggccgccc tggagtcggc ggagaagggt tgagtgcggc gtggcacgac 1907701 gtgcccgcgt tgacgccgag ctggtccggc ggggcctggc gcgatcacgt caacaggccg 1907761 cggagttgat cggcgccggc aaggtgcgca tcgacgggct gccggcggtc aagccggcca 1907821 ccgccgtgtc cgacaccacc gcgctgaccg tggtgaccga cagtgaacgc gcctgggtat 1907881 cgcgcggagc gcacaaacta gtcggtgcgc tggaggcgtt cgcgatcgcg gtggcgggcc 1907941 ggcgctgtct ggacgcgggc gcatcgaccg gtgggttcac cgaagtactg ctggaccgtg 1908001 gtgccgccca cgtggtggcc gccgatgtcg gatacggcca gctggcgtgg tcgctgcgca 1908061 acgatcctcg ggtggtggtc ctcgagcgga ccaacgcacg tggcctcaca ccggaggcga 1908121 tcggcggtcg cgtcgacctg gtagtggccg acctgtcgtt catctcgttg gctaccgtgt 1908181 tgcccgcgct ggttggatgc gcttcgcgcg acgccgatat cgttccactg gtgaagccgc 1908241 agtttgaggt ggggaaaggt caggtcggcc ccggtggggt ggtccatgac ccgcagttgc 1908301 gtgcgcggtc ggtgctcgcg gtcgcgcggc gggcacagga gctgggctgg cacagcgtcg 1908361 gcgtcaaggc cagcccgctg ccgggcccat cgggcaatgt cgagtacttc ctgtggttgc 1908421 gcacgcagac cgaccgggca ttgtcggcca agggattgga ggatgcggtg caccgtgcga 1908481 ttagcgaggg cccgtagtga ccgctcatcg cagtgttctg ctggtcgtcc acaccgggcg 1908541 cgacgaagcc accgagaccg cacggcgcgt agaaaaagta ttgggcgaca ataaaattgc 1908601 gcttcgcgtg ctctcggccg aagcagtcga ccgagggtcg ttgcatctgg ctcccgacga 1908661 catgcgggcc atgggcgtcg agatcgaggt ggttgacgcg gaccagcacg cagccgacgg 1908721 ctgcgaactg gtgctggttt tgggcggcga tggcaccttt ttgcgggcag ccgagctggc 1908781 ccgcaacgcc agcattccgg tgttgggcgt caatctgggc cgcatcggct ttttggccga 1908841 ggccgaggcg gaggcaatcg acgcggtgct cgagcatgtt gtcgcacagg attaccgggt 1908901 ggaagaccgc ttgactctgg atgtcgtggt gcgccagggc gggcgcatcg tcaaccgggg 1908961 ttgggcgctc aacgaagtca gtctggaaaa gggcccgagg ctcggcgtgc ttggggtggt 1909021 cgtggaaatt gacggtcggc cggtgtcggc gtttggctgc gacggggtgt tggtgtccac 1909081 gccgaccgga tcaaccgcct atgcattctc ggcgggaggc ccggtgctgt ggcccgacct 1909141 cgaagcgatc ctggtggtcc ccaacaacgc tcacgcgctg tttggccggc cgatggtcac 1909201 cagccccgaa gccaccatcg ccatcgaaat agaggccgac gggcatgacg ccttggtgtt 1909261 ctgcgacggt cgccgcgaaa tgctgatacc ggccggcagc agactcgagg tcacccgctg 1909321 tgtcacgtcc gtcaaatggg cacggctgga cagtgcgcca ttcaccgacc ggctggtgcg 1909381 caagttccgg ttgccggtga ccggttggcg cggaaagtag cggcgcgccg aaggtgttga 1909441 ctgaattacg gatcgagtcg ctgggcgcca tcagcgttgc caccgctgag ttcgatcgcg 1909501 gctttaccgt gctgaccggg gagaccggca ccggcaagac catggtggtg accgggctgc 1909561 acctacttgg tggtgcccgg gccgatgcaa ctcgcgttcg gtccggtgct gaccgtgccg 1909621 ttgtcgaagg gcgttttact acaaccgatc tcgacgacgc gaccgtcgcg gggctgcagg 1909681 cggttctcga ctcgtcgggg gccgagcgcg acgaggacgg cagcgtgatc gcgttgcgct 1909741 cgatcagtcg cgatggaccg tcgcgcgcct acctcggcgg ccgcggtgta cccgccaaat 1909801 cgttgagcgg tttcacgaac gagctgctta ctctgcacgg gcagaacgac cagctgcggt 1909861 tgatgcgccc tgacgaacaa cgtggtgcac tggaccgctt tgcggccgct ggcgaagccg 1909921 tccagcgtta ccgcaagctg cgggatgcct ggctaacggc ccgacgcgac ctcgtcgacc 1909981 gtcgcaaccg ggcccgggaa ctagcgcaag aggccgatcg gctgaaattc gcgctcaacg 1910041 agatcgacac cgtcgacccg cagccggggg aggacgtggc gttggtcgcc gacatcgccc 1910101 ggctttccga actggacacc ctgcgggagg ccgcgactac tgcacgcgcg acgttgtgcg 1910161 ggacaccaga cgcggacgca ttcgaccgcg gcgccgtcga cagcctcggg cgggcacgtg 1910221 cggcactgca atcgagcgat gatgccgcgt tgcgggggtt ggccgaacag gtcggtgagg 1910281 cgttgacggt ggtcgtcgat gcggtcgccg agctcggcgc ctacctggac gagctgcccg 1910341 ccgacgccag cgcgctggac gccaagctgg cgcgccaagc ccagctgcga acgttaaccc 1910401 gcaagtacgc cgccgacatc gatggcgtgc tccggtgggc ggatgaggcg agggcaaggc 1910461 tggctcaact cgacgtctcc gaagaagggc tggcagcgct ggaacgccgt accggtgagc 1910521 tcgcccacga attaggccaa gccgcagttg atctcagcac gatccggcgg aaggcggcca 1910581 agcggctggc caaggaggtc agcgcggagc tgtccgccct ggcgatggcc gatgccgaat 1910641 tcaccatcgg tgtgaccaca gagctggccg accacggcga tcccgtcgcc ttggccctgg 1910701 cgtcgggcga attggcccgg gccggtgccg atggcgtcga tgcggtcgag ttcggtttcg 1910761 tcgcacaccg ggggatgaca gtgctgccgc tggccaagag cgcatccggc ggcgaactgt 1910821 cccgggtgat gttgtccctg gaggtggtgc tggctacttc gcgaaaacaa gcggctggca 1910881 ccacgatggt gttcgacgag atcgacgccg gcgtcggcgg ctgggctgcg gtacagatcg 1910941 ggcggcggct ggcgcggttg gctcgcaccc accaggtcat cgtggtcacc catctgccgc 1911001 aggtcgccgc ctatgccgat gtgcacttga tggtgcagcg caccgggcgc gacggtgcca 1911061 gcggtgtgcg gcgcctgacc agcgaggatc gggtggccga gctggcacgg atgctggccg 1911121 ggcttggtga ttccgacagt ggtcgcgcgc acgcgcggga gttactcgag accgcgcaga 1911181 acgacgagct cacctagcaa ggctgtgact gaagtgatgt catataactt gtgaggctaa 1911241 tgttacggcg cgcctccacg cacctgccca gcttcaccgc cagaatcccc ccatgaggat 1911301 gtcagcgctt ctgtcccgta acacctcccg gccgggcctg atcggcatcg cccgggtcga 1911361 ccggaatatc gaccgattgc tgcgtagggt ctgtcccggc gacattgtgg ttctcgacgt 1911421 cctggatctg gaccgcatca ccgccgatgc actggtggaa gcggagatcg ccgccgtggt 1911481 aaacgcatcg tcgtctgtct cgggccgcta tccgaacctc ggtccagagg tgttggtcac 1911541 caacggtgtc acgctgatcg acgagaccgg accggagatt ttcaaaaagg tcaaagacgg 1911601 tgccaaggtt cgcttgtatg aaggcggggt gtacgccggc gaccgccggc tgatccgcgg 1911661 taccgagcgt acggatcatg acatcgccga cctgatgcgg gaggccaaga gcgggttggt 1911721 cgcccacttg gaggcgttcg ccggcaacac aattgagttc atccgcagtg aaagcccgct 1911781 attgatcgac ggcatcggga ttcccgatgt cgacgtcgat ctgcggcgtc ggcacgtggt 1911841 gatcgtcgcc gacgaaccca gcggacccga tgacctgaag tccctcaagc cgttcatcaa 1911901 ggagtaccaa ccggtgctgg ttggtgtggg caccggcgcg gacgtgttgc gcaaggcggg 1911961 gtatcgcccg cagctcatcg tcggcgaccc tgaccaaatc agcaccgagg tgctcaagtg 1912021 cggtgcccag gtggtgttgc ccgccgacgc cgatggacac gcgccgggcc tggagcgaat 1912081 ccaggatctc ggtgtcggcg ccatgacatt cccggccgcg ggctcggcga cggatctggc 1912141 cttgttgctg gccgaccatc atggcgcggc gctactcgtc accgccggcc acgctgccaa 1912201 catcgagacg ttcttcgacc gcacgcgtgt gcaaagcaac ccttcgacct tcctcaccag 1912261 actccgggta ggggagaagt tggtggacgc caaggcggtg gccacgctct accgcaacca 1912321 catctcgggc ggcgccatcg cattgctggc actgaccatg ctgatcgcca tcatcgtggc 1912381 actgtgggta tcccgcaccg acggcgtggt cctgcattgg atcatcgact actggaaccg 1912441 attctcactt tgggtgcagc acttggtctc ctaggttttc ttggacggtg ggttcatgat 1912501 ctcgttgcgt caacatgcgg tctcactggc tgcggtcttc ctggcgctgg ccatgggcgt 1912561 agtgttgggt tccggctttt tctccgatac tttgctgtcc agcttgcgta gcgagaagcg 1912621 ggacctctac acgcagatcg accgactcac cgatcagcgg gatgcacttc gcgaaaagct 1912681 cagcgcggca gacaatttcg atatccaagt aggcagccga atagtgcacg acgcgctagt 1912741 cggcaagtcg gtggtcatct tccgcacccc ggatgcccac gacgacgata tcgctgcggt 1912801 gtcgaagatc gtgggacagg ccggcggtgc ggtcaccgca acggtctcat tgacccagga 1912861 gttcgtcgaa gccaactccg ccgagaaact gcgctcagtg gtgaactcgt ccattctgcc 1912921 ggccggtagc cagttgagca ccaaactcgt tgaccaaggt tcccaagccg gcgacctgct 1912981 cggcatcgcc ttgctgagca acgccgaccc ggcggcgccg actgtcgagc aggcgcagcg 1913041 ggacactgtg ctggcggcac tgcgcgaaac cggcttcatc acctatcagc cccgcgaccg 1913101 cattgggacg gcaaacgcca cggtggtggt caccggcgga gcgctctcta cagacgccgg 1913161 caaccagggg gtcagcgtgg ctcggttcgc cgcggcgctg gcgccgcgcg ggtctggcac 1913221 gctgcttgcc ggccgggacg gttcggcgaa ccgacccgcc gccgtcgccg tgacccgcgc 1913281 cgatgccgac atggcggccg aaatcagcac cgttgacgac atcgacgccg agcccggacg 1913341 aatcaccgtg atccttgccc tgcatgacct gatcaacgga ggccacgtgg ggcactacgg 1913401 caccggtcac ggggcgatgt cagtcacggt ttcccagtag gcccgcgtta gggcgtgttc 1913461 cccgcggtga ggcgccgtgg atgttagggt gggtttccgt gggtcggcag gcccagcaag 1913521 gccagagaaa tcttggcagc gtcaagaaca gccctgcccg tcttcacgga ggtcgctcag 1913581 tgcgaaagca cccgcaaacc gctaccaagc acctcttcgt cagcggcggc gttgcttcct 1913641 cgctcggcaa gggactgacc gccagcagcc taggacaatt gttgacggct cgtgggttac 1913701 acgtcacgat gcaaaagctc gacccgtacc tcaacgtcga cccgggtacc atgaacccgt 1913761 tccagcacgg cgaggtcttc gtgaccgagg acggtgccga aaccgatctc gacgtcggcc 1913821 actacgaacg gttcctcgat cgcaatttgc ccggctcagc gaatgtgact accgggcagg 1913881 tgtattcaac ggtgatcgcg aaggagcgcc gcggcgaata cctgggcgac accgtgcagg 1913941 tgatccccca tatcaccgac gagataaaac ggcgcatcct ggcgatggcc caaccggacg 1914001 ccgacggtaa ccgcccggac gtggtcatca ccgaaatcgg gggcactgtc ggcgatatcg 1914061 agtcacagcc cttcctggag gcagcgcggc aagtccggca ctatctcggc cgggaggacg 1914121 tgttttttct gcacgtgtcg ctggtgccct acctggcgcc gtcgggtgag ctcaaaacca 1914181 agccaacaca gcactcggtg gccgcactgc gcagcattgg gattaccccg gacgcgttga 1914241 tcctgcgctg cgaccgcgac gttcccgaag cgctgaaaaa caagattgcg ttgatgtgtg 1914301 acgtcgatat cgacggcgtt atctccaccc cggacgcgcc ctccatctac gacataccca 1914361 aggtattgca ccgcgaggag ctcgatgcgt tcgtggtgcg ccgactcaat ctgccgttcc 1914421 gcgacgtcga ttggaccgaa tgggacgacc tgctgcgccg ggttcacgaa ccacatgaga 1914481 cagtgcgaat tgctttggtg ggcaagtacg tcgaattatc cgacgcttac ctctcggttg 1914541 ccgaggcatt gcgtgccggc ggattcaagc accgggccaa ggtcgagatc tgttgggtgg 1914601 catccgacgg ttgtgaaacg accagtggtg ccgcggcggc gctcggcgat gtgcatgggg 1914661 tgctcattcc gggcggattc ggcatcaggg gcatcgaggg caagatcggc gccattgcat 1914721 acgcgcgggc gcgcgggttg ccggtgttgg ggctgtgcct cggtttgcag tgcattgtga 1914781 tcgaggccgc gcgatcggtc ggtctcacca acgccaattc ggccgaattt gatcccgaca 1914841 caccagatcc cgttatcgcc acgatgcccg atcaagaaga aatcgtggcc ggcgaggcgg 1914901 atctgggcgg taccatgcgt ctcgggtcct accccgccgt gttggagccg gattcggttg 1914961 ttgcccaggc ataccaaact acccaggtgt ccgagcggca tcgccaccgg tacgaggtca 1915021 acaacgcgta ccgagacaag atcgccgaaa gcggcctgag gttttccggg acgtcacctg 1915081 acggacactt ggtagagttc gtcgagtatc cgccggatcg gcatccgttc gttgtcggca 1915141 cccaggccca ccccgagttg aagagccgac ccacccggcc gcacccactg tttgtcgcat 1915201 tcgtcggggc agccatcgat tacaaggcgg gtgagttgct gcctgtcgag atccccgaga 1915261 tccccgagca cacacccaac ggtagctccc atcgggacgg cgtgggccag ccgctaccgg 1915321 aacctgcgtc tcgtggctga gcatgatttc gagacgatat cgtcggaaac cttgcatacg 1915381 ggagccattt tcgcattacg tcgggaccag gtgcggatgc ctggtggggg tattgtgacg 1915441 cgtgaggtcg tcgagcactt gggtgccgta gccattgtgg cgatggacga caacggcaac 1915501 atcccgatgg tttatcagta ccgccacacc tatggtcggc ggctttggga actgcccgcg 1915561 gggttgctcg acgtcgctgg ggagccacct catctcacgg ccgcccggga gctgcgggag 1915621 gaggtcgggc tgcaagccag cacctggcag gtgctggtcg atctggacac cgcgccgggc 1915681 ttcagcgacg aatcggtgcg ggtctatctg gccaccggac tgcgcgaggt gggccggccc 1915741 gaagcccatc acgaagaagc cgacatgacg atggggtggt atcccattgc cgaagcggct 1915801 cgccgggtgc tgcgtggcga aatcgtcaat tccattgcca ttgccggtgt tttggccgtg 1915861 cacgcggtga cgaccgggtt cgcccagcca cgcccactcg ataccgaatg gatcgacagg 1915921 ccaacggcgt tcgccacgcg gagagccgag cgatgaagac gctggcactg caattgcagg 1915981 gctacctcga ccatctgacg atcgaacgag gtgtcgcggc aaacacattg agctcctacc 1916041 gacgtgatct gcgccgctac tccaagcacc tggaagaacg agggattacc gatctggcca 1916101 aggtcggcga gcacgacgtc agcgagttcc tggtggcatt gcggcgcggg gatcctgatt 1916161 ccggcacggc ggcgttgtcc gcggtgtcgg cggcacgggc gctgatcgcg gtgcgcgggc 1916221 tgcatcgctt cgctgccgca gaagggctgg ccgaactgga cgtggcgcgc gccgtccggc 1916281 caccgacgcc gagccggcga ttgcctaaga gcctgacaat cgacgaggtg ctatcgctgc 1916341 tcgaaggtgc gggcggcgat aaaccgtccg acggcccgct gacgctgcga aaccgtgcgg 1916401 tgctggaact gctgtactcg accggggcgc ggatctccga ggccgtcggc cttgacctcg 1916461 acgacatcga cacccacgcc agatcggtgt tgttgcgcgg caagggtggt aagcagcggc 1916521 tggttccggt gggacgcccg gcagtgcacg cgctggacgc ctatctggtg cggggacggc 1916581 ccgacttagc gcggcggggc cgcggaacgg cggcgatctt tctcaacgcg cgcggcggcc 1916641 ggttgtcacg gcaaagcgcg tggcaggttc tgcaggacgc ggccgagcgt gccggcatca 1916701 ccgccggtgt ttcgccgcat atgttgaggc attcgttcgc cacgcatctg ctggagggtg 1916761 gcgccgatgt ccgggtggtg caggaattgc tggggcacgc ctcggtgacc acgacgcaga 1916821 tctataccct ggtcaccgtc catgcactgc gcgaggtgtg ggcgggagct cacccgcggg 1916881 cacgctaagc gatgaccgtc actagcggta gcggttgctg gtcacttggc tcgcccgcga 1916941 cacagaggtt gcgcctctcg ctcatggatc gtcttcgtcg ctgtcgtgca ggagtttttc 1917001 ggggtgaaag taactgttgg tgcggggttg tccatggtcg aggtgggctg ggggaagcca 1917061 ttcggtggtg ccgtctttgc gtttgcgggt gatccagccc ccggtggtgg ccagttggtg 1917121 gtgggggccg cagccctggg tgagttcgtt gatgtcggtt tcttggcatt gggcgaagtc 1917181 cgtcacatga tgcacctcgg tgagatagcc cggtacgtcg cagttgggga acgagcagcc 1917241 gcggtccttg gcgtagagga cgattcgctg tccgggtgag gctagccgct tggtgtgata 1917301 gagggccagc tcgcggccgt ggtcgaagat acgtaggtag tggttggcgt ggctggccag 1917361 ccggatcacg tcgctcatgg gcagcagggt gccgccgccg gtcagcgcgt ggccggcgcg 1917421 tgattgcagt tcggtcaggc tggtggacac gatgatggcc gcgggtagcc cgttgtgttg 1917481 gcccagctcc cccgagcaca gcagggcccg cagcgcggcc agcaggccgt cgtggtggcg 1917541 ttggccggcg ctgcgggtgt cggcctcgat cgcggcctgt gacggggtgc cggccaggca 1917601 gggggtgtca tcggcggggt tggccatgcc gggggcggcc agcttggcca acacggcgtc 1917661 gacggtggcg cgggcttcgg gggtcaggta gccgctgatc gccgacatgc cgtcggggcc 1917721 ttggttgccc aggatgatgc tgcggcggcg ggcgcggtcg gtgtcgttgt agttgccgtc 1917781 ggggttcaaa cagtcggcga gtttggtggc tagtttgtgt agttggtcgg ggcgaaaccg 1917841 gccgcctagg gtggccagct cggcttcggc tttctcccgg gtgggtaggt ccacatggtg 1917901 gggtagctgg tgcaggaagc agcggatgac ctgcacgtgg gcggggccga ggtggccggc 1917961 gcgttgggcg gcggcggtgg cggtcagcaa cgggggcagg ggttggccgg ttagcgtgcg 1918021 gcgtgggccc aggtcggcgg cttcatggat gcgtcgggat gcttcgccgc ggctgatgtg 1918081 tagccgttcg gccagggcga agggtagttt gccgcccagt tcggtttggt cggtttggtc 1918141 ggcgagtttg ttgatgaagg ggtgttcggc ggcgggtagg cgccgacgga tcttttcgca 1918201 gcgctgcagc attgccaggc attccgggat ggtcaggtcg tcaggggaga ccttcaggac 1918261 ccggttaagg gcggtgtcga ggttgtcgaa cgcggcgacg gcctcctccc ggctactcga 1918321 atacatgttc gaatactatc acggttagcc ggccgatgcc atgctgattg tgggttaatc 1918381 caatgtggtg cagttgaatt caggagcatc gccagccgcg aggccacgcc tattcggcga 1918441 gcataatggt cggctcggag acatccagca acatgagcga tgaagacatc acgtgcgatg 1918501 ggtggtcacg gtgggcagct ctgacgcgct gtttcgcgta gtcgacggcg tgcaggtagc 1918561 cccggccttg acacgttccg gcccgctcaa gcgagtagtc cgcggatgtc gtcgacggtg 1918621 ggtacggagc cgaaggcgtt gccgtcgtcg acgacgctgg cgaataggtt tgaggtccag 1918681 cccgaagccg cgggcttgag gctgatgagg aaaaccggcg cgttgcgctc gttgaactgc 1918741 tggatcacat tggggcgtca tcgaggtcga tcgacggata catcagggaa tgcatggccg 1918801 caccgtatcg actcggtctg acagccatcc gcagccacac cgcaaccgca cgcgatgacc 1918861 aatcgacgac taaccgtcga ctaacccagg tattcggact ccaataccaa gtcgggcacc 1918921 agggtctggt attcgaggtg cgtcttgtgc tcaatggtgt tccatgacat gccttgttgc 1918981 cggcgcatat atgcacggta cttcggcgcg ccgggcaccc tgacattgtc ggcaaccacg 1919041 atcgagcccg ggtgcaacca gccccggtct aggatgctct gcagatcggg caggtaagcc 1919101 ttcttgtcat ggtcgaggaa cacaaaatcg agtgtgccag ttgcgaatcc gtgctcggtt 1919161 agcgcgtcca gggtgcgccc accgtcgccg atggtgccga ccacgcacac caccctgtca 1919221 tcgacgccgg catgcgccca tattcgccgg gcgttgctgg cgttggcttc ggcgagttcg 1919281 acggagtaca ccctggcctc cggagcggcc cgggcgatcc gcagcgcgcc gtagccgagg 1919341 taggtgccca actccagcgc caatgccggg tcggcgcgcc gaaccgccgc gtcgagcagc 1919401 gtccctttct cgtcaccgac gttgatgagc atcgacttct cataggcgaa cttgtcgatg 1919461 gtggccagca cgtcgtcgat gttgccggcc ccggcgtggg cgaggacata gtcgacggcc 1919521 gccgcttcgc gtccatcacc gatctggccc gtcgtggtga tattgcggat cccggccgcc 1919581 atccgccaga ccgaccaccg caacggggca atgcgcgctt tgcgaatcat cgctcgctag 1919641 cttacgcaca gatttcgcgg acctgcgggc acctggttca cctgctgaca ctggctcgac 1919701 gacgaccgca cttcggagtt tgggccgcgc gtggattttc attgcaagcc tggccatacc 1919761 gcggccgagc tgctgacgaa ccccgacgac ctggcagtga aaaccaaagc tgcggcggct 1919821 ctgccggcgc tgggtgacga gccaacccac ggcgagcagc acgaaccata gcgggaacca 1919881 cgccaacgcg gttgcggttt cggtttcggt ggtaagtgtc cagatcacga acgcgaaaaa 1919941 caccagcacg gcccagcaca tcaccacgcc accgggcatc ttgtacaccg agtcggtgtg 1920001 acgctgtggg tgtcggcgac ggtagacgag gtagctgatg atgatcattg cccacacaaa 1920061 catgaacagc agggatgaga ccgtcgtgac gagtgtgaat gccccaatca ccgaccgacc 1920121 ggcatagagc agcgggatgg aggtcagcag tagcggagcc gtcagcagca gggcgggtgc 1920181 gggcacgccg ccgcgattga gttggtggaa agcggccgga gcgtggcctt cgtcggcgag 1920241 gccgaaaagc attcgcccgg tggagaagaa gccggagttc gctgacgagg ccgctgcggt 1920301 gaccacgacg aagttgacga ccgacgccgc agcggcaagt ccggctaggg agaacatcgt 1920361 cacaaacggg gactcgccac tggcgaactg ccgccacggc acgacggcca ggatcgccag 1920421 cagggcaccg atgtagaaca ccgcgacccg caacggcacg gcattgatcg cgcggggaag 1920481 ggtgcggcgc gggtccgctg tctcagccgc ggcggtgcca acgagctcca caccgatgta 1920541 tgcgaaaaac gcgatctgaa agccactgac cacgcccagg aaacccgttg ggaagaaccc 1920601 gttgtcgttc cacaggttct cgatggtcgc gtgcacacca tgaggggaga cgaagttggt 1920661 tgccaccagg atcgcgccga cggcgatgag gcacacgatg gcagcgacct tgatcaatgc 1920721 gaaccaaaac tccagctccc cgaagtggcg gacgctgaac aaattgacag cgagaatcag 1920781 ggcgaccgtg accagggccg ggacccagat tggcaagccg ggccaccaaa acctggcata 1920841 gccggtgatc gcgacgaggt ctgcgatccc ggtgaccacc catgcgaacc agtacgacca 1920901 ccccacgaaa aagcccgccg ccgggcccag gaggtcggcg gcgaagtcaa cgaacgactt 1920961 gtagttcagg ttcgacagca gcagctcgcc catcgcgcgc aacacaaaaa acacaaaaaa 1921021 cccaatgatc ccgtagacca ccatgaccgc cggaccggcg agcgagatcg ttcgcccaga 1921081 tcccatgaat aggccggtgc cgatcgcgcc tccaatcgcg atcaactgaa tatggcggtt 1921141 ggcaaggtcc cgacgcaggt gcggctgggt gtctgtcggg tcggcagccg cgatatcgtc 1921201 cggcatatat ggcgtcctca agttctgggg tagggaaggc ctcgcgttat ccggcaaacg 1921261 gcggccggga catcaccgta acccggaacc cgtagcgggg acccgcaccc cccgtaccgg 1921321 tgcccgaacc ggctagcggc atgccgccca acaggtttcc cgccgcaccg gcctccggtt 1921381 gctcgacgat atcgctgacc aggggtgcgg aggccgaacc cacggtcggg gctaggctcg 1921441 gactggcccc tgcccagttg ggcggcaccg ataacttgcc gatggtggcc gcgttgccta 1921501 gacccgcgga tacgggtccg gtaccgccaa ccgcggcgcc gacggccgcc ggcgccgcgg 1921561 cggcagcttc ggcggcctcg ggaccgatcc atcccagcgc ccgccacgat gtaataaggc 1921621 tgttgccaat accaatggcg aaatatggca aacccacggt gttgtaaaac agctgtgata 1921681 tcggcagata ccagttgatg aaccattcca gccaccccgg ggtcgcggcg gcggtcaacg 1921741 cggacgacag gggcgaggtg aggcccagca gcgtgttggg caagtgggcg atcagctccg 1921801 ctattgcgct ctgcgccgcg ccggctgagg tgccggcggc tttggcgact gcggacaact 1921861 gcgtcgccgc ggcggatggg ctggtggtgt tcggcggcgg ggcaaacggc gtcactttgg 1921921 tcgcggtcgc cgaggagccc gcgtaaccgt acatggccat ggcgtcttgg gcccacattt 1921981 cagcgtattg agcttcggtg gccgcgattg atgcggtgtt ttgaccgaac acgttatgcg 1922041 tgaccagcga cgtgagccgc gcgcgattgg ccgcgatcag cggcgggggc acaatggcgg 1922101 caaacgcggt ttcgtaagcg gccgccgccg cacgcgcctg actggctgcc tgctcagctt 1922161 ggatggcggt ggctcgcatc cacgccacat acggggcgac cgcttcgacc atcaacgtcg 1922221 acgccggacc cagccattct tcggtttgca gcgtcgtgat cacccgctcg tagccgacgg 1922281 cggccacact gagctcggcg gccagcccgt tccacgcgga cgctgcggca accatcggtg 1922341 ccgagcccgg gccgcaatac atgcgcccgg agttcacctc cggtggcaac gccccaaaat 1922401 ccatcgctat gaactcctta cctcgtcacg ggttttcggt gggctatccg acgttcggcc 1922461 ggtcagccat cacggtgagt cgtcttccat atcggcgtcc catatgggcg gcgcgactcc 1922521 tgcccggagt cggtgccccc cggagtagga ccgatgtttc agccgcctcg gcggcgctgc 1922581 gaataccggg aatcgatcgc gcgacggttt gcgcctgggg cgtggcgggc ggtgcggacc 1922641 agcccggcgg caccgacatc ggtccgatct tggcggccag agtcgcgctc gccgccaccg 1922701 gtccagctcc cagctgcgac cacgccgacc atccagcggc tccacccgcg ccggccgctg 1922761 cctcggccgc accggcttct gccaatgcgg tgctccacaa catcccgccg acgaactgca 1922821 gagcattcag cgttaaccca ccgctgtcgt aaatgaaccc ttcggcagtg gcgagggcgc 1922881 ccaggaacat catccagtat ttctgtatgt cgctccatgg aatcgccgcg gcccagctgt 1922941 gctgcccccg cagcgcggcc accgctgttg cgtggccgac gaggccggtc gcgttggtgg 1923001 tttgcggcgg tggtgcgaac ggagtcaaaa ccgtggcggg tgccgcggcg ctggcatagc 1923061 cgtacatcgc ggcggcgtct tgggcccaca tctcggcgta ttgggactcg gtggtggcga 1923121 tcgccggcgt gttttgcccg aaccagttgg tatcgacgag cgtcatcaac aaggtccggt 1923181 tggccgcgat cgccggcggg ggcaccgtca tggcgaaggc ggcttcaaag gccgctgcgg 1923241 ccgccctagc ctgcatcgcg gcctgttcgg ctagcgtcgc ggtggtactc agccagccga 1923301 caaagggcag gacggcggcc accatcgaat ccgatgccgg ccccgaccac caccgcatgt 1923361 ttgtcagctc cgagatcgcc gcaccgtagc cagtcgctgc cgacgacaac tctgcggcca 1923421 gcccgtccca ggccgccgcg gcagccatca gtggcccgga tcccggaccg ctatacatac 1923481 gacccgaatt gatctcggga ggtaacgccc caaagttgga cagggaatgc ccggcgatgc 1923541 cgtcagcaac ggcggtgacc ccaacaaggc agcaggcgac gctgcccggg gggacatgcc 1923601 cctggttgac cgggacatcg agggtcatcg aaaaccgcct cgttatgggt gggctggctc 1923661 gacaccgtcg tcgatacgat agctatgact agggcaacag tgacctagca cgttaatctc 1923721 cataagagat cttctgcaaa aaaggtttcg gccgtgtgac gcgcgtgtta ataccccata 1923781 ggggtataat cgttactgtt ggcaacgtct ggcgtcctgg ctcgggcgac acaccgtccc 1923841 gatacatgtc agcaaccggg tcgatcgtgg tgaatgcaca ggcgggcaag gcgaatgccg 1923901 atgcgacccc gacgaagtaa gagggtacgt aatcgataca ccatggggac atttgccctc 1923961 catggcctca cccatcgcct accgtcggcc tcgttgcaga cgacggctgc ccgccacccg 1924021 gatgtgacgc aattctcaat gcctgggcac taccgataac gccgacctgc cgcagctcgc 1924081 gcatgtggac gctgaaagcc cggaaggagc acaccggcat atccggcaag cccaccgcac 1924141 ggaccgatcg ccatggctct actcggtccg gagattctga gctacaagct agtgcgcggc 1924201 gtttttctcg attgccggat cgctgtggcg ctcagggcgt tacgtgaaag gttcggcagc 1924261 ggtgctgccc agcctggccg gtggcgaaca cggtcaacat ggtgaggccc tgcggcaccc 1924321 gaaatgcggt gagcagaacg acgtttggtg ccatcgcgga tagcagccag ccaagcttga 1924381 acgctgcgag cgagcccatg tagagcgttt ggtaccaaac cgatcggtgg gccaacttgc 1924441 catgggctca cagcggctat cgcgagcgtg tagccgatca tcgtccaggc gacggtggcc 1924501 tgagcggcag gggttgcctt attcatcctc ttgcggcatg gttgccgcag ggagtgccgg 1924561 taagtctggt cggcaacctg gcccgctgcg ggttgggttc ggattcgctc ggctagtaag 1924621 gtgctcgcct ggtgttacaa cgaatcgcta gagagctctt atcgggagtg gccgtcgcga 1924681 tcgttgcgct gccgctggcg atcgcgttcg gcattaccgc caccggaacg tcccaaggtg 1924741 cgctcatcgg gctctacggc gccatcttcg ccggattctt cgcggccgtg ttcggtggga 1924801 cacccggaca ggtgacgggc cccaccggcc ccatcaccgt cgtcgctacc gcaaccatcg 1924861 ccgaacacgg actcgagggt gccttcttcg cgtttatcct cgccggcgtc tttcagatcc 1924921 tgttcggggc gtgccggctc ggttcactca tccgctacgt gccccacccc gtgatctctg 1924981 gattcatggg gggaatcgcg atcctcatca tcatgaccca gctggatcag gtgcgcagca 1925041 gctccctgct cgtgttggta acggtcgtcc tgctgctggc tagcggccgg tttatcaaag 1925101 cgattccacc gagcctgctc gtcctggttc tggtcagctc ggtgctgccg ctcgcggcgc 1925161 catggctgcg cgacctgcgc gctgggccgg tctcgatcaa caggacggtc gactacatcg 1925221 gcgagatccc acaggccatg ccgtctttcg acttcccgca agtcgccaat tcgacgatgc 1925281 tgcaggtgct gctgtcggcg gtggccatcg cgctgttggg atccctcgat tcactgctga 1925341 cgtcgctggt catggacaac atcaggggca cccggcaccg gagcaacaaa gaactgatcg 1925401 gccaggggat tggaaatatc gccgccgggc tcttcggcgg gctggccggt gccggcgcga 1925461 ccgtccgatc ggtggtgaac gtcagaaatg gtggtcagac cgccctgtcg gcggccactc 1925521 acagtgtcgt tttgttcgtt ttcgttgccg ggcttggtgc cgtggtgcag tacatcccgc 1925581 tcgccgtgct gtcggggata ctgatattgg ttgccgtcgg catgttcgac tggcacgcca 1925641 tgcgcaaagc gcatgtgtca cccaggggcg acgtcatcgt catgttcacg acgatgatca 1925701 tcaccgtcgt cgtcgacctc accatcgcgg tgatggtcgg aatcgccctc tcgctgctgg 1925761 tccataggct ccgatcccgg caacgcaaag ccaaggtcac ccaggacgac accggcacct 1925821 atcgcatcga cggtccgttg tcgttcctgt ccgtcgacgg tgtatttggc tccctgcgcg 1925881 acggtcgtga ggacgtgtcg ctggacctcc agcacgtcac ctacctcgac acctctggtg 1925941 cccaggccct gctgtatttc atcgaccact ccgagaagga cggcgtcgcg gtaagcatca 1926001 agcggatccc cccacgcctc gaaagccaac tcaccgcact cgccgacaac gagcaacgtg 1926061 acaagctgag aaccgtcctc gaatccgcct gacgcattgg ctggttgatt tgcctgcggg 1926121 tctcccgggc caggcgtcgg tagccgttag actttcctgc gatgtccccc ctgacgcccg 1926181 tcaccacgag ccacgaccgg gtatgaccga ccaccccgac accggcaacg ggatcggcct 1926241 caccggacgg ccaccacggg caatccctga ccccacgccg cgcagctcgc acggcccggc 1926301 caaggtcatc gcgatgtgca accagaaggg tggcgtcggg aagacgacgt cgacgattaa 1926361 cctgggtgcc gcgctcggtg agtatggccg gcgggtgctg ctggtggata tggatccgca 1926421 aggagcgctg tccgcgggcc tgggcgtgcc gcactacgag ctggacaaga ccatccacaa 1926481 cgtgctggtg gagccccggg tgtcgatcga cgacgtgctg atccactccc gggtgaaaaa 1926541 catggatctg gtccccagca atatcgatct gtccgcggcg gagatccaac tggtcaacga 1926601 ggtgggtcgc gagcagacgt tggcccgggc gctgtacccg gtgctggacc gctacgacta 1926661 tgtgctgatc gactgccagc cgtcgctggg cctgctcacc gtcaacgggc tggcctgcac 1926721 ggacggcgtg ataattccga ccgagtgcga gttcttctcg ctgcgcggcc tggcattgct 1926781 caccgacacc gtcgataagg tgcgcgaccg gcttaatccg aagctggata tcagcggaat 1926841 cctgatcacc cgctacgatc cgcggaccgt caactcgcga gaggtcatgg cccgtgtcgt 1926901 ggaacggttc ggtgacttag tgtttgacac cgtgatcacc cgcacggttc gtttcccgga 1926961 gaccagcgtc gcaggcgaac ccattaccac ctgggcgccg aagtcggcgg gtgccctggc 1927021 ctaccgtgcg ctggctcgcg agttgatcga ccgatttggc atgtgaacgg ccttcagaac 1927081 agcctggcga acggtgggac ggcacccgag aacggctact cggctggttt tcgggtccgg 1927141 ctgaccaact tcgagggccc gttcgacctg ctgctgcagc tgatctttgc gcaccaactc 1927201 gacgtcaccg aagtggcgtt gcaccaggtc accgacgact tcatcgccta caccaaagcg 1927261 atcggcgctc ggctggaact agaggagacc acagcgttcc tggtgatcgc cgcaaccttg 1927321 ctcgatctca aagcagcccg gctcctgcca gccggacagg tcgacgacga ggaagacctc 1927381 gcgcttctgg aggtacgcga cctgctgttt gcccggctgc tgcaataccg ggcgtttaag 1927441 cacgtcgcag agatgttcgc cgaactggag gccaccgcgc tgcgcagcta tccacgggcg 1927501 gtgtcgttgg aggacgggtt cgtcggtctg cttcccgagg taatgctcgg cgttgacgct 1927561 caccggttcg ccgaaatcgc tgcgatcgca ttaaccccgc ggccagcccc gacggtggcc 1927621 accgagcacc tgcacgagtt gatggtctcg gttcccgagc aggccgaaca cttgctggcg 1927681 atgctgaaag cgcggggcag cggccagtgg gcgtcatttt cggagctggt cgccgactgc 1927741 acggcgccca tcgagatcgt ggggcgcttc ctggcgctgc tcgaactgta tcggacccgg 1927801 gcggtagcat tcgagcagtc agagccgctt ggcgcgctcc aggtttcgtg gaccggtgac 1927861 gatgcagagc gcagcgatga gaaggagcgg cgcttgtgac cgaacatatg cccgaacacg 1927921 atccgagcta tggcatcccg gatatcgctg agcccgcgga gctggatgcc gacgagctta 1927981 agcgtgtgct agaggcgctg ctgttggtga tcgacacccc agtgacagcc gacgcgttgg 1928041 ccgcggccac cgaacagccg gtctaccggg ttgcggcaaa gctacagttg atggccgacg 1928101 agctcaccgg gcgtgacagc ggcatcgacc tgcgccacac gagcgagggt tggcggatgt 1928161 acacccgcgc ccgattcgcg ccctatgtcg agaagctgtt gctggacggc gcgcgaacca 1928221 agctcacccg ggccgcgctg gagaccctgg ccgtggtggc ctaccgccag ccggtcacac 1928281 gagcgcgggt tagtgcggtg cgcggggtca acgtggacgc cgtgatgcgt acgctgttgg 1928341 cccgcggcct gatcaccgag gttggtaccg acgccgatac cggcgcggtg acgttcgcca 1928401 ccaccgagct cttcctggag cgcttgggat tgacgtcgct gtcggagctg cccgatatcg 1928461 caccgctgct tcccgacgtc gacacaattg acgacctgag cgaatccctg gacagtgagc 1928521 cacgtttcat caaactcacc ggtgagctgg cgtccgagca gacgctgtcg ttcgacgtgg 1928581 accgtgattg atggccgagc cggaagagtc ccgggagccc cggggcatcc gcctgcagaa 1928641 agtgttgtct caggctggaa tcgcgtcgag gcgagccgcc gagaagatga tcgtcgacgg 1928701 ccgcgtcgaa gtggacgggc acgtggtgac cgagttgggt actcgggtcg accctcaggt 1928761 cgcggtggtc cgtgtcgacg gggccagggt ggtgctcgac gactcgctgg tgtacttggc 1928821 gctgaataag ccgcgcggca tgcactcgac catgtccgac gatcgcggcc gcccgtgcat 1928881 cggcgacttg atcgaacgaa aggtccgggg caccaagaag ctttttcatg tcggacgcct 1928941 agacgcggac accgagggac tgatgctgct gaccaatgac ggcgagttgg cgcaccggtt 1929001 gatgcatccc tcccatgagg tgcccaagac gtatctggcg acggtgacgg ggtcggtgcc 1929061 gcgtgggctg ggccgaacgc tgcgagcggg aatcgaattg gacgacggac cggcgttcgt 1929121 cgacgatttc gcggtagtgg atgcgatccc cggcaagacg ttggtgcggg taacgctgca 1929181 tgagggacgc aatcgcattg tgcgccgact gctggcggcc gccggcttcc cggtggaggc 1929241 attggtgcgt accgatatcg gcgcggtgtc actgggaaag caacgcccgg gcagcgttcg 1929301 ggccttgcgg tcgaacgaga tcgggcaact gtaccaagcg gtgggcctgt gagtcgccta 1929361 agcgcagcgg tagtcgcgat cgacgggccg gccggcaccg gaaaatcctc ggtgtcaagg 1929421 cgattagcgc gcgagctggg cgcacgcttt ctggacaccg gggcaatgta tcggatcgtg 1929481 acgttggcgg tgctgcgtgc cggtgctgat ccgtccgata tcgctgccgt cgagacgatt 1929541 gcgtcgacgg tgcagatgtc gttaggctac gatcccgacg gagacagctg ttaccttgcc 1929601 ggagaagacg tttcggttga gatacgcggt gacgcggtca cccgtgcggt ctccgcggtg 1929661 tcgtcggtgc cggccgtacg cacccggctg gtcgagctgc agcgaacaat ggctgagggc 1929721 ccgggcagca tcgtcgtgga gggccgcgac atcggaaccg tggtgtttcc ggatgcgccg 1929781 gtgaaaatct tcttgaccgc ctcggccgaa acgcgggccc ggcggcgcaa cgcccaaaac 1929841 gtcgcggcgg gtttggccga cgactatgac ggggtattgg ccgatgtgcg ccggcgcgac 1929901 cacctcgatt ccacccgggc ggtgtcaccg ctgcaagccg ccggtgatgc cgtcatcgtg 1929961 gacaccagcg atatgaccga ggccgaggtg gtcgcccatc tgttggagct ggtcacgcgg 1930021 cgaagtgagg cagtgcggtg acccaggacg gcacgtgggt ggacgaaagc gattggcaac 1930081 tagacgattc ggagatcgcg gagtccggag cggcgcctgt ggtggcggta gtcggccggc 1930141 ccaatgtcgg caagtccacc ctggtcaacc ggatcctggg ccgccgcgag gcggtggtgc 1930201 aggatattcc cggcgtgacg cgtgaccggg tctgctacga cgcgctgtgg accggacgcc 1930261 ggttcgtcgt acaggacacc ggcggatggg agcccaatgc caagggcctg cagcggttgg 1930321 tggccgagca ggcctcggtg gccatgcgca ccgcggatgc ggtgatcctg gtggtcgacg 1930381 ccggtgtcgg tgccaccgcc gccgacgagg ccgcggcccg tatcctgttg cgatccggca 1930441 agccggtgtt cttggccgcc aacaaggtcg acagcgaaaa aggcgaatcc gacgccgcgg 1930501 cgttgtggtc gctgggcctg ggtgagccgc atgcgatcag cgcgatgcac ggtcgggggg 1930561 tggccgacct gctcgacggg gtgctcgccg cgctgcccga ggtgggggag tccgcgtcgg 1930621 cgagcggcgg tcctcgccgg gtggcgctgg tcggtaagcc gaacgtcggc aagagctccc 1930681 tgctgaacaa actcgcgggt gatcagcgat cggtggtcca tgaggcggcg ggcaccaccg 1930741 tcgacccggt ggattcgctg atcgagttgg gcggtgacgt ctggcggttc gtcgacaccg 1930801 cgggattgcg gcgcaaggtc ggccaggcca gtgggcatga gttctacgcc tcggtgcgca 1930861 cgcacgccgc catcgactcc gccgaagtgg ccatcgtcct gatcgacgcg tcgcagccgc 1930921 tcaccgaaca ggacttgcga gtgatatcga tggtcatcga ggccggacgg gcgctagtcc 1930981 tggcctacaa caagtgggac ctggtcgacg aggaccggcg cgagctgctt cagcgcgaga 1931041 tcgaccgaga gctggtgcag gtgcgctggg cgcaacgggt caacatctcc gccaagacgg 1931101 gccgggcggt gcacaagctg gtgccggcca tggaggatgc gctggcgtca tgggacacca 1931161 ggatcgcgac cggcccgctg aacacctggc tcacagaggt gacggcggcc acaccgccgc 1931221 cggtgcgcgg cggcaagcag ccacgcatct tgttcgcgac ccaggccacc gcgcggccac 1931281 cgacgttcgt gttgttcacc acgggttttt tggaggccgg ctatcggcgg ttcttggagc 1931341 ggcggctgcg tgagacgttc gggtttgacg gcagcccgat ccgggtcaac gtgcgggtgc 1931401 gagagaagcg ggccggcaag cgccgctgag cgcacctcga acgtgtgacc cgggtaaccg 1931461 gggatggaca gcgaggccgg ttctgctgtc ccataatgcg gctatgttca gctgcattac 1931521 gggatttagg tgttgacacc cgagcgctcg gcgcttacgc tttctcgtat aacgggtgat 1931581 aagtaccgta ttgcgggagt aggtggagga aatggcgctg gctcagcagg tgccgaacct 1931641 gggtctggcg cgcttcagcg tgcaggacaa gtcgatcctg atcaccggcg cgaccggttc 1931701 gttgggccga gttgccgccc gggcgctggc cgacgcggga gcgcggctga cactggccgg 1931761 cggcaactcg gccggtctgg ccgagctggt caacggcgcc ggcatcgacg acgccgccgt 1931821 cgtgacctgc cggccggaca gcctggccga tgcccagcag atggtcgagg cggcactggg 1931881 ccgatatggc cgtttggacg gagtgttggt ggcctcgggc agcaaccatg tggcgcccat 1931941 taccgagatg gccgtcgagg acttcgacgc tgtgatggac gcgaacgtgc ggggtgcctg 1932001 gctggtgtgt cgggcggccg gacgggtgct gctcgagcag ggtcagggcg gcagcgtggt 1932061 gctggtgtcg tccgttcgcg gcgggttggg caatgccgcc ggttacagcg cgtactgccc 1932121 gtcgaaggcg ggcaccgatc tgttggccaa gacattggcg gccgaatggg gcggtcacgg 1932181 cattcgggtg aacgcgctgg cgccgacggt gtttcggtcc gcggtgaccg agtggatgtt 1932241 caccgacgat ccgaagggcc gggccacccg ggaggcgatg ctcgcccgga tcccgttgcg 1932301 ccgcttcgcc gaaccggaag acttcgtcgg cgccctgatc tatctgctca gcgacgcctc 1932361 gagcttctac accggccagg tgatgtatct ggacggcggg tacaccgcat gctgacctcg 1932421 cacgggttct cccgtgccgc cgtcgtgggt gccgggctga tgggccggcg catcgccggc 1932481 gtgctggcct cggcgggcct ggatgtcgcc atcaccgaca ccaacgctga gattctccac 1932541 gccgcagcgg tggaggccgc ccgggtagcc ggtgctggcc gtggctcggt ggccgcggca 1932601 gccgacctag ccgcggcgat accagacgcc gacctggtga ttgaggccgt cgtcgaaaac 1932661 ctggccgtca agcaggaact cttcgaacgg ctggcgacac tcgcgcccga cgcggtgctg 1932721 gccaccaaca cctcggtgct gccgatcggc gctgtcaccg aacgggtcga ggacggcagc 1932781 cgagtgatcg ggacacactt ttggaacccg ccggatctta tcccggtggt cgaggtggtg 1932841 cccagcgcgc gcaccgcccc agatacggcg gatcgcgtcg tggcgctgct gacccaagtc 1932901 ggcaagctgc cggtgcgggt cgggcgcgac gtgccgggtt tcatcggcaa ccggctgcag 1932961 cacgcgctgt gacgcgaggc gatcgcgctg gtcgccgagg gtgtctgcga cccgaagacg 1933021 gtagatctcg tggtacgcaa caccattggg ctgcgactgg ccaccttggg gccgctggaa 1933081 aacgccgact acatcgggtt ggacctcacc ctggccatcc acgacgcggt gatcccgagc 1933141 ctcaaccacg acccgcaccc cagcccgctg ctgcgggaac tggtcgccgc cgggcaactc 1933201 ggggcgcgta ccggtcacgg ctttctggac tggcccgcag gagcccgcga ggccaccacc 1933261 gcccgacttg cccagcacat cgccgcgcaa ctccaagcca acgaaaaagg aagggggaca 1933321 tagccatgac gttcgcctgg cccctcggtg ccgccgaatc gacgttggag ttctacgacc 1933381 tgtcccaccc ctggggacac ggcgcgccgg cctggccgta cttcgaggac gtgcagatcg 1933441 aacgactcca cggcatggcc aagagtcgtg tgctgaccca aaagatcacc accgtcatgc 1933501 attccggcac ccacatcgac gcgccggcgc acgtggtgga aggaacaccg tttctggacg 1933561 agatcccgct gagcgccttc ttcggcaccg gcgtcgtcgt ctcgatcccg aagggcaaat 1933621 gggggatggt caccgccgag gatctgcaaa acgctacccc cgacatccgg cccggtgaca 1933681 tcgtcgtcgt caacaccggc tggcaccaca aatacgccga cagcgccgag tactacgcct 1933741 attccccggg cttcgacaag aaagcgggcg agtggtttgc ggccaaaggc gtcaaggcgg 1933801 tcggcaccga cacccaggcc ctggaccatc cgctggccac ggccatcgcc ccgcacggtc 1933861 ccgcggaggc acagggcggc ctattgccgt gggcggtacg cgaatacgag gcgcagaccg 1933921 gccgcaaggt gctcgacgac ttcccggact gggaaccgtg ccatcgggcg atcctgtcgc 1933981 agggcatcta cggctttgaa aacgtcggcg gtgacctgga caaggtcacc ggcaagcgcg 1934041 tcactttcgc ggcgttcccg tggcgctggg tgggtggcga cggctgcatc gtgcggctgg 1934101 tggcgatcgt cgaccccacc gggagctatc gcatcgagac cggaaaggcg gcctgatgaa 1934161 actgacacga gcgtcgcagg cccccaggta tgtggcgccg gcgcatcacg aggtgtccac 1934221 catgcggttg cagggccgcg aggcggggcg caccgagcga ttctgggtgg ggctgtcggt 1934281 ctatcggccc ggcgggacgg ccgagccggc gccgacccgg gaggagaccg tctacgtcgt 1934341 gctcgacggc gagctggtgg tcaccgtcga cggcgccgaa accgtgttgg gctggctcga 1934401 cagcgtgcac ctcgccaaag gcgaactgcg atcgatacac aaccgcacgg atcgtcaggc 1934461 gctgctgctg gtgaccgtcg cgcacccggt tgccgaggtg gcgtgatgag ctgcaccggc 1934521 gacgatgcag agcgaagcga tgctgaggag cggtgcgaat gagcatcgtc atcaccgtcg 1934581 cacccaccgg ccccatcgcc accaaggccg acaacccggc gttgccgacg agccccgagg 1934641 aaatcgcgac agccgtcgag caggcctacc atgccggtgc cgcggtggcc cacatccacc 1934701 tgcgcgacga aaacgaaagg cccacagcgg atccgaacat cgcgcgccgg gccatggacc 1934761 tcatcggcga gcggtgtccg atcctgatcc agctgtccac cggggtcggc ttgacggtgc 1934821 ccttcgagca gcgcgagcaa ctggtcgagt tgcgcccgcg gatggccacg ctgaatccgt 1934881 gctcgatgag cttcggcgcg ggcgaattcc gcaacccgcc gcaagcggtt cgtcggttgg 1934941 cggcacgcat gcgggaactg gacatcaaac cggaactgga aatctatgac accgggcatt 1935001 tggaggcgtg cctgcgactg tgggcggaag acctgctggc cgaacccttg cagttcagca 1935061 tcgtgctcgg ggttcggggc ggaatggccg ccaccgccga taatctgctc acgatggtgc 1935121 gccggctgcc cccggggcga tctggcaagt catcgcgatc ggtaaggcca acatggaact 1935181 gaccgccatg ggcctggcgc tgggcggcaa cgcccgagtc ggcttggagg acaccttgta 1935241 cctgcgcaag ggcgagctgg cgccgagcaa tctggcgctg gtatcgcgca cgatacgtct 1935301 cgccgaagcc ttggacctgc cgatcgcctc ggtcgaagaa gccgaggcgg cgctgcagct 1935361 gcccggcacg tcctgagagg agctcgcttg tgtccgccga agagcaggac acccgcagtg 1935421 gtggcatcca ggtgatcgcg cgggcggccg aactgctgcg ggtgctgcag gcgcaccccg 1935481 gcggtctcag ccaggccgag atcggcgagc gggtgggcat ggcccgctcg accgtgagcc 1935541 ggatcctcaa cgcgctggag gacgaggggc tggtggcctc gcgcggggcc cggggaccct 1935601 atcggctggg cccggagatc acgcggatgg ccaccacggt acggctgggt gtcgtcacgg 1935661 agatgcaccc gttcttgacg gagttgtcgc gcgagctgga cgagacggtg gacttgtcga 1935721 tcctggacgg ggatcgggcg gacgtcgtgg accaggtcgt gccgccgcag cggctgcggg 1935781 ccgtgagcgc ggtgggggag tcgtttccgc tgtactgctg cgccaacggc aaggcgctgc 1935841 tggccgcgtt gccgcctgag cggcaagccc gcgcgctgcc gagtcgactg gcgccgctga 1935901 cggcgaacac catcaccgac cgcgcggcgt tgcgggacga gctcaatcgc atccgggtgg 1935961 acggtgtcgc ctacgaccgt gaggagcaga ccgaaggcat ctgcgcggtg ggcgcggtgc 1936021 tacggggggt gtcggttgag ttggtggcgg tgagtgtgcc ggtgcccgcg cagcggttct 1936081 acggccgtga agccgagttg gccggtgctc tgctggcctg ggtttcgaag gtagacgcgt 1936141 ggttcaacgg cactgaggat cgcaaatgac agaagcgttg tgcgacaagc tcgttggggc 1936201 ctgggacctg gtgtcctacg tggagcgggc cgcggctttg gcgttgggat acctggccta 1936261 cggcggacgg tagttcgtcg acaaggcgta gggcgtggcc gggtttgcag gccggctgcg 1936321 gtaggctttc gacctgccgc cggtggtgtc gccggtggca ccgggctgtg gcgcagtttg 1936381 gtagcgcact tgactggggg tcaagtggtc gcaggttcaa atcctgtcag cccgacttac 1936441 gtttccgcag gtagaccgcc ctgctggcgg tcctcggctg ccgctgaggc agtaccgcca 1936501 aggggtatgt acagcaaccg gtacagcaac ccggtcaaat ccccagagca ccgctgagac 1936561 cttccactgc ggctcgcgcc gcttcgtcgc tggtatgacc gcgccaccgt gctggacacc 1936621 gcctaccgag accacctcga gcggttcgtt cgcaaaccac ccgagccacc cgcgctaccg 1936681 gccttcagcg cgatcaaccc accaccaaag gaggaccagc cgactcaatg aatccccgaa 1936741 aatcgtgtct cagaaatgtt gacaggttcc gcggtagatc aggcgacaag ctcgatctcc 1936801 gcattatggc catgggattg ggcaagtcgc ccgtcgcaag tgataagcgg tacgccgagg 1936861 ccctcggcga gggcgacgta ggctccatcg gccacggtat gagtggaccg aagttggtag 1936921 gcacgctggg tgaatggctt taacggccaa cgccgaacgg gcaggctaag gaagttgaca 1936981 accacgacga gtccttcatg atcgctgatc agctgacgca cgaccgcttg acgtatcgcc 1937041 ccgatcacct cgacatcgaa atgtgcaggg gcgtgcacgg tttcgccccg caagcgccgg 1937101 gcgaccgccg cacccgccgg cgtcgtgagc atgagctcga cggccgccga ggcgtccaac 1937161 acgatcactc agatcgagcc tcgtcaacaa gctcggctgc gctcgcgccg agatcccggc 1937221 gcggcagtgc cgctagacgg tcgagaacgt cgtcgagggc cggttcttcc gcgatctcgg 1937281 caagccgtgc taggaggaaa tcgctcaggc tcatccgttg cgccgctgcg cgggccttca 1937341 gctcgtggag aagctcgtcg ggaacgttgc ggatctgaac catggcggac atgttgtaag 1937401 catatcggac atgtgaaaca catgtccggt tgccggtgtg accggctggg gcgtgtaggc 1937461 gtcaacccac gccgtgcacg cggccatggg cgggtgcaga cttttgccat gcaaccatgt 1937521 gagctcaccg ccgtcgcgct gaccgcaacg cccccgcccg cgcctccgtc cctgcgccgg 1937581 gcaccggcgt cgacgtcacc gcggctggcg tgatcgtgcc cgcccgcgag cctgagcccc 1937641 agccgcgccg cgtgctgaac ggcctttcgg acgtacgcgc gttctttcac aacaacacca 1937701 tgccgctgta cttcatctcg ccgacgccgt tcaacctgct gggcatctat cgctggatcc 1937761 gaaacttctt ctacctgacc tactacgact ctttcgaggg cgaacattcg cgcgtgttcg 1937821 tgccccggcg gcgcgaccgc agggatttcg acggcatggg ggatgtgtgc aaccacctgc 1937881 tgcgtgatcc cgagacactc gagttcatca agaacagggg tcccggtggc aaggcctgtt 1937941 ttgtgatgct ggacgaagag acccaggcgc ttgcgcgcca ggcggggctc gaggtcatgc 1938001 accccccggc ggagctgcgt catcgcctgg aatccaagat cgtcatgacg cgcctggccg 1938061 acgaggcggg cgtacccagc gtgccgcacg tgatcgggcg ggtgagctcc tacgacgaat 1938121 tgtcggcgct cgcgcacggc gcagggctgg gagacgacct cgtcgtcgag gccgcctatg 1938181 gcaacgccgg cagcgcaacg ttctttgtgc gcggattgcg cgactgggac cagtgcgccg 1938241 gtggcatagt ggggcagccg gaaatcaagg tcatgaagcg catccgcaat gtcgaggtgt 1938301 gcatcgaggc caccgtgacc cgccacggca ccgtgatcgg cccggcgatg acgagcctgg 1938361 tcggttaccc ggagctgact ccgtaccggg gcgcctggtg cggcaacgat gtttggcgtg 1938421 gggcgctacc acccgcacag acccgcgccg cgcgagagat ggtggcaaag ctgggcgacg 1938481 tcttgagccg cgagggctac cgcggctact tcgaggtgga cctgttgcac gacctggacg 1938541 ccgacgagct ctacctcggc gaggtgaacc cgcgcctctc cggtgcaagc ccgatgacga 1938601 acctgaccac cgaggcctac gccgacatgc cactgttcct cttccacctg ctcgagtaca 1938661 tggacgtgga ctacgagctg gacatcgagg cgatcaactc gcgctgggag cggggctacg 1938721 gcgaggacga ggtctggggt cagctgatca tgtcggagac ctcgccggac ctcgagctct 1938781 tcaccgcgac cccacgcacc gggatgtggc gcctgaacca cgacgggcgt gtctcctttg 1938841 cccgccaggg caacgactgg gccacgatgc tcgacgagtc cgaggccttc tacatgcggg 1938901 tcgccgcacc gggcgaccta cgctgcgagg gcgcccaact cggtgtgttg gtcacccgcg 1938961 ggcacctgca gaccgacgac taccagctca ccgagcgcgg ccggcgctgg atcgacggcc 1939021 tcaaggcgca gttcgcctcg acgccgctga cgcccgccgc cccgatcgtc tcgcggctcg 1939081 tcgcacgggc gtgagcggcg gcgtcccggc cggtctcgca ctggacaact ggctgtcgtc 1939141 gccgtattcg cattgggcat tccagcacgt cgaagacttc atgccgacca cggtcatcgc 1939201 gcgcggcacc gagccggtcg tgacgttgcc cgcggacaat gcgccgatcg ccgacatcgg 1939261 cttgaccagc acggacggga tcgccaccac cgtgggcgcg gtgatggccg ccaccgctac 1939321 cgacgggtgg gcggtcgcgc atcgcggtgc gctggtggcc gagcagtacc tcgacggcct 1939381 gggaccccgg acccgccacc tgctgttctc ggtgagcaag tcgctggtgg cggctgtggt 1939441 cggcgcgctg cacggggccg gggcgatcga gcttgacgcg ccggtcacgg cgtacgtgcc 1939501 cgccttggcg gactgcggct acgccggtgc gacggtgcgc cacctgctgg acatgcgatc 1939561 gggtgtcgcc ttctcggaga actacgacga cccggccgcc gagattcacg tgcgcgagca 1939621 ggtgatcggg tgggcgccca agcgcggtcc ggacctgccc gccacgctgc gcgactacct 1939681 gctgaccttg cggcggaagt cggcgcacgg cggcccgttc gaatatcgct cgtgtgaaac 1939741 cgacgtcctc ggctggatct gcgaggccgc ggccggacag ccgatgcccg aactgatgtc 1939801 ggaactactg tggagccgca tcggggccca gtgcgatgcc accatcgccc tagacgtagc 1939861 cggcgcggcg ggcaccggaa tattcgacgg cggcatcagc gcctgtctga ccgacatgat 1939921 ccggttcggg tcgctgtacc tgcgcgacgg tgtctcgttg gccggccagc aagtggtgcc 1939981 cgcggcctgg atcgccgaca ccttcgacgg cggccccgac tcgcgtcagg cgttcgccgc 1940041 cagccccgac gacaacccga tgcccggcgg gatgtaccgc aaccaagtgt ggtttcccta 1940101 cccgggcagc aatgtcgcgt tgtgcgtggg catgtgcggc cagctgatct acgtcaaccg 1940161 cgccgcggag gtggtcgccg ccaagctgtc cacccagccg cactcccatg agccgcacat 1940221 gttagacacc ctgcgcgcat tcgatgcggt ggcacacgaa ttgtcaggaa tcagatcgag 1940281 ttcgaccaac gacccgcagc ggccttcccc gccagcccag gaggccagtc cggggtaacg 1940341 gcttgtgccc acgtaaccga gttccagggc gatgggctta ttagcggaaa tatgactcgt 1940401 cccaggtatc catacgacgc ttgcgtacct cggcgagctt gtggtcaagc gccgcctgct 1940461 cattttcgat ggcacgaccg gcgtttctca cggcgttgta gacggcatcg tccagcttgc 1940521 atagatcctt tgcggacacg tcggtcgata cgaaaaccga gaaccgaata cggtcgtcga 1940581 gcagcgaaat gtcgatcttt ggtgatggtt tgagttggcg ctggaagtgt ttggctagtg 1940641 cttggatcca atactttggc caatgcggca ccggtctcag atcgtagacg atgatggctt 1940701 gctcgccgtt gtagacgccg atgggcgctt ccgctcggct gaagcatggt cgccgcaggt 1940761 tccgcaggtc ctggagttcg ttctcttcat tacccaccat gagcctccgg catctggtct 1940821 acggacacca cggcttgccg catggctggg gcgaagggac tccaagccat ccaggatggg 1940881 aacgcgcgcc gcatcgccgg caggccgtcc agttcgatgc gctcggcgga gatctcggcg 1940941 gccagtgtgc tgcggccgct gtagacccga tacagatctc gaggatggcc gcgcaccgtg 1941001 aggtcgacag gtaggcatgg atcgtgcagg cacaccgaga tgtccccagg ttccaacacg 1941061 agccaggccc acagtggccg ctcgccgtgg tagcggaact ccaccaccac ccgccggccg 1941121 ggaagggcct cggtgttgac gcgccgggag atccacaacg tgagtagttc ggggtcgcat 1941181 tcggcgggag tggggtcggc catcaaccaa cgggagaccc agtcccccag ggtctgcagc 1941241 acggggcgta gctcctcgcc ggccaccgtg aaccgatagc ccccgcccgt gtgttcgggg 1941301 accgcttcga tgatgcggtc gtgctgaagt cggcgtagcc gctgggccag caccgagcgg 1941361 gagatgccgg gcaggccccg ctcgatttcg gtgaaccgca gcgggccgaa gagcagctcc 1941421 cgcacgatta gcagcgtcca gcggtccccc agcagctccg ccgcccgcgc taccgggcag 1941481 tactggccgt acggctgcac gacaccaggc taatcgccat ccctggctgc gtggttcgga 1941541 attcgaactt cccgcacccc ctgtgggagg cgtaacgctt ggtgctggag gtgagaggcg 1941601 atgaccgcga cgctgaccaa gacgctgggt tccctcgacg atttcagggg aacgctttgt 1941661 gtccccggtg atccggacta ccccagggtg cgggccatct ggaacgggca ggtggcccgc 1941721 gaaccggcct tgatcgccac gtgccacgac gcgtgcgatg tccgaacggt gctgcggcgc 1941781 ccggtggacg ccgggatggt gaccgcggta cgtggcggcg ggcacaacgt ggccggcacc 1941841 gcgctgtgcg acggcggcgt ggtgatcgac ctctcggcga tgcgggccgt ctcgctggat 1941901 ccagcgactg ggcgggtacg ggtgcagggt ggtgccacgc tcgccgattt ggaccacgcc 1941961 acggtcccgt tcgcccgggt ggcccccgcc gggatcgtca ccaccaccgg tgtcggcggg 1942021 ctgacgttgg gcggcggggt gggttggacg actcgacgtt tcggactgag ctgcgacaac 1942081 ctggtcgcgg tgcggctagt caccgccgcc ggcgactacc taagcgtcga cgacgagcgc 1942141 gacccggagc tgatgtgggg cctgcggggc gggggcggca atttcggcat tgtcactgaa 1942201 ttcgaattcg ccacccatcc gttcggtccg gtcgccgtgg ccggcttcgt cgtctaccgg 1942261 ctggatgacg ggcccgcggt gcttcgcggc taccggcagt tcgccgctgc ggcacccgag 1942321 gaggtgacca cgatcgtggt cttgcgccac gccccgccgg caccgtggat tcccgttgac 1942381 cagcgcggca agccggtggt catgatcggc gccgtccaca ccgggagcat ccagaccggg 1942441 atcgaagcgc tgcgaccggt caagtccctc gccagacccg tcgccgacac cgtgtggccg 1942501 accccgttcc tggcccacca ggcggtgctg gacgcctcca acccggccgg tcaccgctac 1942561 tactggaaat ccgactactt ggccgagctg aacgacgagg ccatcgactt gctagttgag 1942621 cagacggcgc agctgtcctc gccggacagc ctcatcggaa tcttccagct cggcggcgcc 1942681 gccgctcgcg gcggtgagcg ttcctgcttc ccgagccggc acgcgcgatt catggtcaac 1942741 tacgccaccc attggaccga ggcccgcgag gacgaccttc accgccaatg gacccgcgac 1942801 gcgatcgagg cgctggcccc gtacgggctg ggcaccgcgt atgtgaactt caccgccgac 1942861 gacgcaccga tgcacgtcga aacactttac agcacaacgg agttcagtcg tttggtgacc 1942921 ctcaagaacc gactcgaccc ggacaacgtg ttccgcaata accacaacat ccgcccctcg 1942981 gcatgagggg gcccaagttg accgtaggaa ggacgatcat ggacctctat tcaaacctcg 1943041 tcgaagccga acaacgcctg gtcgcgctgg tttcgtcgat agaagccgac agctactcct 1943101 cgccgacgcc gtgcgaccgc tgggacgtgc gggcgctgct cagccacgcg ctggcctcga 1943161 tcgacgcctt cgcggcggcc gtcgacggag cacccggacc ggacatggcg caggtgttca 1943221 gcggtgccga catcgtcggg gacgaccccc tcggtgcgac gcagcggatc acccggcggt 1943281 cgcaggcggc ctggtcgacc gtgcgcgatc tgaacgcgga gctgtcgacc ttcatcggcg 1943341 tgatgccggc ggggcaggct cttgcgatca tcaccttctc caccgtcgtc cacggttggg 1943401 acctagcggt ggccacgggc caggccggcg aactcccgga gcacctggcc gaagcggccc 1943461 aacaggtggc ggccgaactg gttcccgtcc tgcgtccgcg gggcctgttc gcacacgacg 1943521 tcgacctagc gggggaagcc acgcccactc agcggctcgt cgcccttacc ggacggaaac 1943581 cgcggtgagc tgcgtttggt tgtcgcgttc gatcattctg gcggcgtagg gctcatggat 1943641 ccgacgtagt aggtttcccg cccggttggg tatccgccgc cgtcggtggc gatatggacg 1943701 tggtcgtagt ggttgagggt ttctgagccg tagtccgccg tccagctcgg cgcgccgatg 1943761 cctgggtagt agccctgccg ccagatcaca tggagcactc cccatcgttt cgcattcgcc 1943821 aaggcaagtc cggcgacttg gttgccgagc tggataccct cgtcgctgtg atggttcggg 1943881 atcatcacgt cgatcgctaa cccgttggga tgccacttca agggatcctg cctatagcca 1943941 aagatgttgg tgatctgagg aaatagcaca gagacggcac gggctaccca gatcgtcttg 1944001 acctgcaacc cctcttccga cgcaacgcca gcaggtagcg cgaactggaa ttgctgggca 1944061 gcgacaggcg cgcttgccgc caacaagtcc gcttccgtgg gactggcgat gcgcggtgcg 1944121 ttggccggcg cagagtcggg gcctgtcggg attgccgccg gggtctcgcg gcagcacgta 1944181 tgctctgcgc cttgggcata gaggatggcg gcagagacga cgagcgaggc cgcgattgcc 1944241 aaccagcggc cccggccgtt ggccaacacg cctttgctca cgaacagcac tttagtgtgt 1944301 cgtgtgcgac gcgtgtggca acctttgcta tcgattggtt gcagacccgc gttgtgcgca 1944361 ccgggcaagc cgttcacgct catcgccaac ccgctgccgt cggcggtgaa atggaagagt 1944421 ggtcggtcag gcagccgctg atgaagatgg tgtcgtcgtc tgcagcgcgc acatcgagac 1944481 cgctgcgccg gaacagttct gccagcgaca cgccttcggg ttgccagccc ttggcggcga 1944541 ggtagtcgag gacgtggttg cgtgggccgg tgtagaccaa cgaagccaag tcgacgtcca 1944601 cgccatgaca gcggaacggg ttggatatgg tccgcgctcg ttcggcgctg aaatccgcta 1944661 taccggtaac aaattcggtg gctaccatgc tcccgggcgc gctgagtgcg gtgatgttgt 1944721 cgaacagcct gtcttgagtt tgcggcttaa gatatatcag taacccctcg gccagccacg 1944781 ccgtcggtgc tgccgagtca aacccggcgg cttgtaatgc cgtcggccag tctgcgcgca 1944841 agtcgatggg cacggcgcgc cggatggcgg agggttcggc gcccaggtcg gctaaggttg 1944901 tcgtcttgaa ctccatcact tttggttggt cgatctcgta taccaccgtc ctggtcggcc 1944961 acggcagccg gtaggcccgg gagtccaacc cggacgcgag gatagcgact tgccgaatcc 1945021 ccccagcggt ggcgttaagg agatagtcgt caaagtattt ggtgcggacc gcgtttccgt 1945081 acaccattgc ctgcgccacg gccggtgaaa cgtccgcgat cgtcgacatg tcgagttcac 1945141 cgtccatcat cttggtgaac aaatccagcc ccaccgcacg gaccaggggt tcggcgaacg 1945201 gatcgttgat caaaccgcgc ggatccttgg tggccagcgc acgcccgacc gcgacaatgg 1945261 tcgcggtgac gccgacgcta gacgtcagat cccagttgtc gtcgtcggtg cgggccacca 1945321 gcccacccta gtctgattgc ccggttcctc ctcgcgccgc aaacggcgcg catcgtcacc 1945381 gggcgtcgtc tgattgcccg gttcctcctc gcgccgcaaa cggcgcgcat cgtcaccggg 1945441 cgtcgtctga ttgcccggtt cctcctcgcg ccgcaaacca agccggctgg tgctgtgcta 1945501 ttggcgtcgg aacagacggc cgtgctggct acagaaccag gcgatgttgc cgtccggccc 1945561 gcgcacgaag ttggagcgac tgccggtggg cttgttgtcg ggtccaaggt cgagcccata 1945621 gtcgggccga tagaaggcga ggcccagatt ggcgctgttt tggccatccg ggttggcatc 1945681 gtcggtgctc atgcttccag caagctggcc gtccctggcc cggaagtcga tgaccgttgt 1945741 ctcgaggtcg ccattttggg cgacttgctt ggcgatgtac cggccctcgt agggcgccag 1945801 gtcgacggca ccaaggcgtt gcggcgtggc cggaagattg ctgagcccgg cgaatctctg 1945861 caatgcccag tcggatgcga aaaggtcgtt gatcatatga aatccgccat cagagttagt 1945921 gagcacggtc atggcgaagt ttcgatcggg caccatgacg aacccagagc gctgcccctt 1945981 ccaggtgccg ccgtgctcaa cgatggtcac attctccgcg gagggccgca gcatccaggt 1946041 cacgcccatc ccggtcagtt ccacccaaag tgttccgccc gccccagggt tagagcgcat 1946101 tgccttcagc gattgtcggc tcagaatctg ctcaccgtta ggcgccctgc cgtcgccgag 1946161 gtggaactgt gcgtaacgca gctgatctcg cgctgtggac atcaacccac cggtggggtt 1946221 gcagctgcgc gggaatgtcc aaaagtcagt aacggcaatc ggtttgccgt cgaccacgct 1946281 atgcgatgcg gccacattca gaccgattat ttggtcggaa aagtagcgag tgtgagcaag 1946341 ctgcagcggg tcaagcaaca gcctctgaac cgtagattcg taggttgttc cggcgacaag 1946401 ctcgatgatg cggcccgcaa ccacaagacc tgaattgttg tacgcgaacg cggttcccgg 1946461 aggggtgagc tgcggtaggc gtgtcatcgc cttgacatag agcgccaccg cgtcatcgcc 1946521 gcgcccaaag tcctgcccat tgcgaccatc ccagcctgcg gtatggttga gcagttggcg 1946581 aacggtaacc gtagcgctgg ctgattcgtc ggctaccgcg aagtcgggga tgtagcggcg 1946641 cacaggtgaa tccaggtcca ccttgcctcg ctcgaccagc cgcatcatca ccgtacctgt 1946701 gaaagtcttt gtggtggaac cgattctgaa gacagtgtcg ccgtcaacag gcatcggatg 1946761 gtcgacattg gtgaccccgt agcctttgac gtattcttgc ccgccggccc agacagcaac 1946821 cgcgacgccc ggaatcgcat aggccttcat gcccgcgttg atttttgcat cgagttcgtc 1946881 gaacgctgca ccagggtctg cgcagttgac agtttcaacc actgcagtgg cgatttcgtg 1946941 cggcagtcga tctagcgcac gcacgtattc ggtgacgacc gcgcgcccat ggcgcgtccc 1947001 gcaccgcgtg ccggtcggcg tcgcggaact caagatgatc ggcggacaca aggaccgcgg 1947061 cgacccggcc ggtggcggcc gatctgaaca gcttcgtggg gggatccgct tcgtcaacca 1947121 acgcggaaag catggctttg gccttccgcg gccgcgtcca catgagtgtc aatatagctg 1947181 gactaacatg aacatcgcga ggccggttct tcgtggtaac gtgccgggat cccaagggac 1947241 tgccggaagc gaatttggtt gcgccgcttg gggcgtcgcg agagattcgg caatcccctg 1947301 gctggaggat cccgttcagc cagggcgtag gcgctgcggc gtgcacggct tggccccaca 1947361 acccgtattg atgccacctg aacaagaaga acccggcatt cgtcgagaat gcctttggtc 1947421 accaatcgca ggccgatact ctgtgcccta gacacccgca tttcttcgaa agaggtgacg 1947481 atatgcctgc accctcggcc gaggttttcg atcgcttgcg taacctggcc gcgatcaagg 1947541 acgtcgccgc acgtccgacc aggacgatcg acgaggtctt caccggcaag ccgttgacta 1947601 cgattccggt cggcacggcc gcggacgtcg aagcggcatt cgccgaagct cgcgcggcgc 1947661 agaccgactg ggcgaagcgt cccgtcatcg agcgagctgc agtcatccgc cgctatcgcg 1947721 acctggtcat cgagaaccgc gagttcctca tggacctcct gcaagccgag gcgggcaagg 1947781 cccgatgggc ggcgcaagag gaaattgtcg atctgatcgc gaacgcgaat tattacgcac 1947841 gagtctgtgt ggacctgctg aagccccgta aggcacagcc gctgctgccc gggataggca 1947901 agaccacggt gtgctatcaa ccgaagggcg tggtgggggt gatctcgccg tggaactacc 1947961 ccatgacgct tacggtgtcg gactcggtgc ccgcgctggt ggccggtaac gcggtggtgc 1948021 tcaagccgga cagccagacg ccgtattgtg cgctcgcgtg tgccgagctg ctgtatcggg 1948081 cgggtctgcc gcgagcgctg tatgcgatcg tgcccggtcc gggctcggtg gtgggcaccg 1948141 ccatcaccga caactgcgac tacctgatgt tcaccggttc atcggcgacc ggcagccgcc 1948201 tcgccgagca cgccggccgc cggcttatcg gtttctcggc cgaacttggc ggcaagaacc 1948261 ccatgatcgt ggcgcggggt gccaacctcg acaaggtcgc caaggcggcc acccgtgcct 1948321 gcttctcgaa cgccggccag ctgtgcatct ccattgagcg gatctacgtc gaaaaggaca 1948381 tcgccgagga gttcacccgg aagttcggcg atgcggtgcg gaacatgaag ctcggcaccg 1948441 catacgactt ctcggtcgac atgggtagtt tgatctccga agcacagctg aaaaccgtgt 1948501 ccggtcacgt ggatgacgcg acggccaagg gcgccaaggt gattgcgggc ggcaaggctc 1948561 gacccgacat cgggccgctg ttctacgagc cgaccgtgct gaccaacgtc gcacccgaaa 1948621 tggaatgcgc ggccaacgag acgttcgggc cggtggtctc gatctacccg gtcgccgacg 1948681 tggacgaagc cgtcgaaaag gccaacgaca ccgactacgg gctcaacgcc agcgtctggg 1948741 ccggctccac cgcggagggc cagaggatcg ccgcccggct gcggtcgggg acggtgaacg 1948801 tcgacgaggg gtacgcgttc gcctggggca gcctcagcgc gccgatgggc gggatgggcc 1948861 tctcgggggt cggccgccgg cacggtccgg agggcttgct caagtacacc gaatcacaga 1948921 cgatcgcgac cgcccgcgtg ttcaatctcg atccgccctt cggcatcccg gccacagtct 1948981 ggcagaagtc actgttaccc atcgtgcgca ccgtgatgaa gcttcccggc cgcaggtgac 1949041 ggcgcggcct agcgccactt gatgccgcac ccgatcgacg gtcgttggtc ggggttgact 1949101 ggccgcccgg cgagcagggc gtcgaccgcg gcccggacgt cggcggccgt caccggtcgg 1949161 ccattgcccg ggcgggagtc gtcgagctga ccacggtaga caagtcggcg ctggccgtcg 1949221 aagacgaacg tgtcgggtgt gcaggccgcg gagaaggcgc gggcgacgtc ttgggtttcg 1949281 tcgtagagat acgggaacgt ccagccgtgg cggcgggcct cggcgaccat ctgatcgggc 1949341 ccgtcctgcg ggtaggtgac gacgtcgtta ctggagatac cgaccatcgg gacgccttga 1949401 tcggcgaggt cccggccgag cgtggccaat ccggcggcga cgtgttgcac gtacgggcag 1949461 tggttacaga tgaaggtgac gacgagggcg ggacccgtga gctcgtcgag gctgaccgtg 1949521 gcgccggtcg ccggctgggg cagtgtgaac gacggcgcgg gggtgccgag ggcgagcatg 1949581 ctggattcaa cggccatgcc gtccagagta cggtcgcggt ccagcttggc ggagccctgg 1949641 ttgccgctac cggacggttg tcaccgctgc gtgcagaaca ggctgtcgat gtcgtgttgc 1949701 caactggcgt tgcgaacgcg gatcagaatc gcccgagtga gcgccagcag ggcgcccgca 1949761 accgcggcga cgctcaacca gagtcccaag gcggccaggg ccgcatccgc aatggcacgg 1949821 gccggcggag ctggttcatc gaccagctga ccggcactgt cgacccaaat gccgacgcgg 1949881 tcaccggatt tggttcccgg cttcgcgttg acctcaccgc tgcgttctat tccgttcacg 1949941 acccatcggg caggcacggt gatcttcgtg cgcggcggcg ctgacgtggc ggtcgtgttg 1950001 ctgtcgatca ccccctcgtg atcgatcacg gtcgcggttg cgggatggcg ggtctgggcc 1950061 tggtgggcat agacgtggct gcgggaatca tggactgcgg tgccggccgc ggcggcgaac 1950121 gggatagtca gcagcgagac cgtgacggcc agcagcatga cgaccgcctc gagtcgatcc 1950181 gtcccacgca ccagcggatt gcggctgaac acccgcagta tcgtccggca cggcaagcgc 1950241 agcctaaacg tgatcatggt ggctccttca cgatcgcggg ttgtggcgat catcgctgtg 1950301 aattgctcgt ggctcctagg gtcgttcggc cttggggctg gggacgtcgg tcacgaatgg 1950361 ctgggcgccg tgcatatcgg gtgaaccggg cgtcgaacaa gcgaagtttt attgtcggat 1950421 aagggacttt cgccccttcc cgcctgctgt gtttggtggc agtattggtg ataccgggga 1950481 aacccggtga tctgcccgaa gtgctgggcg attgagcggg tatgtacacc cggtttgacc 1950541 taccgtccca agacggggct accgccttcg ggcagatcct catcctgcta ctgcggcgca 1950601 ccgcgtcagc tcgttgatcg acaggaagaa cagcgcgccg cgatggtcat cgctgcagcc 1950661 gtggtcagcg ggcagcgtag ccagcacggt cgtcatgacg tggatcgcgc cgtcgacggc 1950721 gcaaactcgt tgtgccggct tgccgaaact gaccagcgcg acctgaggtg ggtagatcac 1950781 cccgaagacc gcgtcaaccc cctggtcacc gacgttggtc atggtgatcg tgaggtccga 1950841 tagctccgag cccggggact tcggtcccca gagcccccga cttggatcag tggtcggata 1950901 tcgctcgatg acagcgatga gctctacaca actggccgag gccagaacac gaggttcgcc 1950961 cgcgtgctcg gaccatatct ggtcgttgtc accgccacgg cgctagcgca cgcgtcgtcg 1951021 cggacgtccc gcttgttacg ggcgattggt ggccaggcgg tcatggtgct gatggcattg 1951081 tcgggcggta tctcagctac atcggctggt ttcgacgaaa cgctcgaact tggtggtcga 1951141 acgaccgcgg cgggcggcag atgatggcat gggtgtcatc agcggccccg atggcgtgcg 1951201 atgaccggcc gctgcggccg atggtggcgg ctaggtggtg cagcatggca acgaaggtga 1951261 tcgtccacac cgccaacgcg acccaacctt cgaattcgcc gatgctttcg acgatgggca 1951321 gatgggcggc caggccgaga cggtatgcgc ccacgccgta catgccgagc gggaacacga 1951381 cgctccacaa cgttgcctcg tagcgcagcg ggacacggtg gacgacatgt ttccatatgc 1951441 tggcggcgac cagcggtggg atcagccacg gtccgaaggc ccagaacacc accgacgctc 1951501 ccgcaacgag tccgctggtg acgatagcca ttggtgcatc agccatttcg acgatgtggg 1951561 cgccggccag cacggtgata gccgtggcgc ccatcgccac ccaatagggc ggggtgagat 1951621 ccgcgggccg cagcgggtag agcagcaggc gggcgacgac caggctgccg acagcgacgt 1951681 acagaaacac gcctactgac caactaatcg cgtgccgagc acgtcggacg cagcgacaaa 1951741 ggtgaacatc ccaaatcccc ggcgcggatc agctaggtcg tcggcgaatt ctttgcggaa 1951801 gatgacgatt cgtgtcgtgc tcaccgcgat caagacagca taggcggtgc aggtcaccca 1951861 cagcaggacg acggaaaggg catacgtcca cccgcacagg cgggtgacga cagggccgaa 1951921 agtcgtgtct cgcatcgaaa tcgacgccag cgcggacttg ttcgacgagt agacgtgtcg 1951981 ctaacgtcga tctcgatggg cagtcctgtc cgctcgccga agacgcactc ccgtcaccac 1952041 ccgcgccgcc gcggccgcgt tagcaccagc tcctcgcggc tgcggtagat gatgtacggg 1952101 cggaacagat agccgatcgg ggcgctgaac gcgtgtacca gccgggtgaa cggccacaac 1952161 gcgaacaacg ccaacccgat cagcacatgg atctggtaat acagcggagc ctcggccatc 1952221 aggtccccgc gcggttgcag tacccacacc gagcggaacc acaccgacac cgtctcgcgg 1952281 tagttgtacg cctcgccgac aacgccggag cccaacgccg tcgcacccag tcccgcgacg 1952341 atcgccgcca ccagcacgag gtacatcacc ttgtcgttga cggtggtagc catgaacacc 1952401 ggcccgcggg tgcgccgccg gtagatcagc agggtaacgc cggccaaggt ggtgatgccg 1952461 gcgatcgacc ccagcacgac ggcctgcacg tgatatgcgc cctcgctcaa accggcggcc 1952521 tgagtccacg actgcgggat cacgagcccg ataccgtggc cgacgatgac caccaggatg 1952581 ccgaaatgaa acatcgggct ggcgatccgc agcagccgcg actcgtacag ctgggacgag 1952641 cgggtggtcc agccgaattt gtcatagcgg tagcgccacc aggagccgac cgcgacgatc 1952701 gtcatcgtca catacggcac gacggtccag aagagttcgc ccatcatgtc acccgtccgg 1952761 cataccgcgg ccaccgtgtg tgcgtatggc aatgcggcct cggtcagggc attgcacagc 1952821 gcggcgatgg gcacccggta cccgctcagc aaccgccgcc ccgcctcggg gtcgacggtc 1952881 gcggcgaatt cgagcaccac cggcaggaag tccggggtct cgccgcgcgg tggtgcgacg 1952941 tcggtgctgc ggtaggtctg ggcgaaggcc agcatctccc ggccgcggtt gcgggtgtcg 1953001 ccggcggtcc agtaggtcag gtacagggtg gcgcggcctc gcaggtcgaa ggtgtcgacg 1953061 tagcgggtcg ccgcggtcag cggatcggca cggcgcagct cagagaccgt gcgccccaac 1953121 agatccgcgg ccggaccgtc gatgtgggcc agcaattcct ctgcggtgcc gagttgccgt 1953181 gagttcgggt aggtcagcag caccgaggcg cattgccaca ccacgtccca ccaatctccg 1953241 gactccggca cgtcggtctg gtcgccgaac acctgcgggg aggccaccgg caggtcggcg 1953301 taccagtcgt agaacgacgt catcaccccg ccgattagct ccacgaaccg cgaccccgcg 1953361 gcgtggctca ccatggacat cgccgggatg ggggagaagc cggcaacccg gtccgggccg 1953421 tatgtggaga tggtgtgcac gtgggcggcg gcgatcatct cggtggcctc ggcccagctg 1953481 acccggacca gcccgccctt gccgcgggcg cgctggtagc ggcggcgccg ccgcgggtcg 1953541 gcctggatgt cggcccaggc cgccaccgga tcacccaaac gtgccttcgc ctcccgatac 1953601 atctcgacaa gcacgccgcg ggcgtacgga tggcgcaccc gcgtcggcga atacgtgtac 1953661 caggaaaacg ccgcgccgcg cgggcagccg cggggctcat actcgggccg gtccgggccc 1953721 accgacggat agtcggtctc ctgcgtctcc caggtgatga tgccgtcttt gacgtagatc 1953781 ttccaagaac acgacccggt gcaattcacc ccgtgtgtgg agcggaccac cttgtggtgg 1953841 ctccaccggt ctcgatagaa cacgtcgccg tcgcggccgc cgcggcgggt cacggtacgc 1953901 agatccgccg agatctcacc cgggatgaag aaccggccgc tgcgtgcaag cagctcctcg 1953961 atgcggctgc cggtccgtgg tgtcaccgtc acctggacgc ctcctcactc accggctccc 1954021 gcgcgtgcag cgcggtgtag gtacacgcga ccagcgcggt cgccaccagc agcagcaacc 1954081 cgaccgtgta gtcgttgtcg accgggtcgt aggtcgcgcc catcaccagc ggcgggaagt 1954141 aaccgcccaa tccgcctgcc gcggcgacga ttccggtgac cgagccgacc gatgcggccg 1954201 gggcgcggcg ggccacccac gcgaacacgc cgccggtgcc cacgccgagg cagaccgcca 1954261 gggtgatgaa ggtggccgcc gaccacacct ccggcggcgg ctgcaacgcc gcggcgaacg 1954321 ccagcagcgc ggtcccggcg agcgaggcca gcaccacgtg cctcggtgcg atccggtcgg 1954381 agagccaccc gcccaccggc cgggccagca ccgccgccag ggcgaacccg gcggtgcgag 1954441 cgcccgcgtc gaccgtggag aacccgtaga tcgtggtgat gtaggtgggc aggtagttgc 1954501 tgaacgccac gaacccgccg aacacgatcg cgtacagaaa cgacatctcc caggtcaccg 1954561 gcaaccgtgc cgcggccttg agcctgggca gcaccgggtc ggcgttgggc cgaaagtagg 1954621 gtgcatcacg aagcacgacc atggccacca cggcggtcga cgcgagcgcg gccgcgacga 1954681 tggcgtgggt ggtgaacagg ccgaaccacc gtacaaaccg cggggtgaag aacgccgaga 1954741 gcgcggtgcc gaccatgccc ataccgaaca cgccggtgga gaaaccgcgc cgcgccggct 1954801 ggtaccagtt gttggcgaac gggatgccga cggcgaagat cgtgccggca acgcccagga 1954861 agagcccgaa aaacaccagc aacgcgtagg agcccatggt tgccgcgacc ccgaccgcga 1954921 gcaccgggag gatcgacgcc agcgtcaccg cgatgagcat ggcgcgcccg ccgaagcggt 1954981 cggtgagcgg cccggtgacg atgcggccaa gggcacccac caggatcggg gtggcgacga 1955041 gcagcgacgc ctcggcgctg gacagtgaca tgtcacgcgc gtagctggtc gacagcgggc 1955101 cgatcaggtt ccacgcccag aagttgacca ccgagatcca ggtggccagc acgagattgg 1955161 ccgcttgccc tctcatcgac acgatccggg gtctcggact ccggcgaact ccgcgccccg 1955221 cccggacagc catgcgctag ccctggcttc gatggcgccg gctcagttag ggccggaagt 1955281 ccccaatgtg gcagaccttt cgcccctggc ggacgaatga ccccagtggc cgggacttca 1955341 ggccctatcg gagggctccg gcgcggtggt cggatttgtc tgtggaggtt acaccccaat 1955401 cgcaaggatg cattatgacc agcgagctga gcctggtcgc cactggaaag gggagcaaca 1955461 tcatgtgcgg cgaccagtcg gatcacgtgc tgcagcactg gaccgtcgac atatcgatcg 1955521 acgaacacga aggattgact cgggcgaagg cacggctgcg ttggcgggaa aaggaattgg 1955581 tgggtgttgg cctggcaagg ctcaatccgg ccgaccgcaa cgtccccgag atcggcgatg 1955641 aactctcggt cgcccgagcc ttgtccgact tggggaagcg aatgttgaag gtgtcgaccc 1955701 acgacatcga agctgttacc catcagccgg cgcgattgtt gtattgaggg tgccggcgcg 1955761 ttagcgccga cggaacgcct gcactgcggt aggcaatgtc ataaagatat ggtcttcgcc 1955821 aatcttatcg agaagactgg cggccctgag tgattcacgc aagtcttgtt tgacccgggc 1955881 catggcgaac actattcccc gacgcagcag ctcggtgcgg agttggtcga gcgcatccag 1955941 cgcagtcagg tcgacctcca cattggattc ggcgttgagt acgaaccact cgacttgccc 1956001 cggatcctga tcgaccacgg tcagtgctcg cctgcggaag tcttcggcat tggcgaagca 1956061 caacggcgcg tcatagcgat acaccaccag cccgggcacg cgcttggcct gcggatagtc 1956121 atcgatgtcg tgcatgccgg caatgcccgg cacgaacccg agaacgctgt catgcggatg 1956181 tgcgacccga cgaagcagtt cgaggatgga cagggcaacc gcggcgagga ctccatagaa 1956241 cactcctagg cctaacacgg ctgctgtggt ggctagtgcc agcatgagtt cgctgcgccg 1956301 aaaccgcgcc agtcgccgga attctgacaa gtcgatcaag cgtagcgcgg catataccac 1956361 caaagcgccc agagcggcga tcggaaacat ggccagcagc ccactcgcga aaaccatcac 1956421 gatgacaaca agccccaacg cgatcagcga gtacagctgg gtgcggccac cgacgacgtc 1956481 ggcgagggcg gtacggctgc tgctggaact caccggaaaa ccgtgtgtca gcccggcggc 1956541 gatgttgcag gccccgaccg cgcgcagctc ggcgttggca ttgacttcct gacctcgacg 1956601 agcggcgaag gcgcgtgcgg tcaacacacc gtcggtgaag gtaacaatcg cgatcccggc 1956661 agccggaatg atcagtgccc gcaagtcttc caccgaaacg ggcggcacac ccggcgtcgg 1956721 cagaccggaa ggtatccgac ccacaatcgc aatacctttg gcatccaagg acataacggc 1956781 cactagcatc gtggccgcaa gaaccgcgat gatcggtccg ggggcgcgcg gcgcccaccg 1956841 cgtgagcata gttagcagcg ctaggacaga catggctaac acaaaagtcg gccagtgaac 1956901 tcgcgtgacg ctagtcgcga aagagtgtac ttcgctgaag aattcgttgc cttcgaccga 1956961 ggtgccggtg atagtgccga gttggctgga gatcatgaca agcgcgatgc cggccatgta 1957021 tccgacgagc accggccgcg atagcaggct ggcgaggaaa cctagtcgcg ccgtgccagc 1957081 gagtaggcag ataaggccga ctagcaatcc gagggttgcc gccagaacgg catagcgtcg 1957141 aagatccccg gcggccatcg gagcgagcac ggccgccgtc atcaaggcgg tggcggattc 1957201 cgggccgatt gaaagctgcc gggacgatcc gagcagtgcg taaatggcaa gcggcgcgat 1957261 cgacgcccac agcccggctg ccggcggtag gcccgccacg gtcgcatacg ccatcgcttg 1957321 cgggatcaga taggcggcca cggtcaggcc ggcgaggaca tcgccgcgca gccaacgccg 1957381 ttggtattcg cggaactgca ccacccctgg tgcccagccg gccgatgtca tcgtgggaat 1957441 cattgtccga cggctggccg cttagctaga gtcggtctag aacccgccca atctttatag 1957501 aatcctgacc atggaattgg cggctcgaat gggcgagact ttgacacaag cggtcgtagt 1957561 tgcagtgcgg gagcaactgg cccgccggac cgggcgcacc agatccattt cgctacgcga 1957621 ggagttggcc gccattggcc ggcgctgcgc ggccttaccg gtgctcgaca cccgagccgc 1957681 ggacacgatt ctcggctacg acgagcgcgg gttgcccgcc tgatggtgat cgatacctct 1957741 gcgctggtcg cgatgctcaa cgatgaaccc gaggcgcaac ggttcgagat agccgtggca 1957801 gcagaccacg tttggctgat gtcgacggcg tcatatccgg agatggcgac cgtgatcgaa 1957861 acacgcttcg gggaaccggg gggacgtgaa cccaaggtca gcggccagcc tctcctctat 1957921 acgggtgacg atttcgcatg tatcgatatt cgcgcggttc tcgccggctg agccggctat 1957981 gagcgccctg ctggatgggg tgttggacgc ccacggcggg ctgcagcgat ggcgcgccgc 1958041 ggaaacggtt catgggccgg gtacgcacgg gagggctgtt gcttcgaacc cgggtgccgg 1958101 gcaaccgctt cgcggactac cgcatcacgg tgcatgtcca acaggcccgg acggtcttgg 1958161 atccgttccc gcgtgacggg taccgcggag tcttcgagag cgggcaggtg cggatcgaaa 1958221 gccacgatgg cgcggtcatc agctcgcgcg cgcacccgcg agcggcgttc ttcggacgct 1958281 cgggcctgcg ccggaacatc cggtgggacc cgctggactc ggtctatttc gccggttacg 1958341 cgatgtggaa ctacctcacc acgccgtacc tgttgacgcg cgaaggcgtg gcggtcgagg 1958401 agggagcgcc ctggcagcag gagggcgaga cctggcggcg cctgattgtg agcttcccgc 1958461 cggatatcga cacccactcg cctcgccaga ccttttacgt cgatgccagc ggtctcttgc 1958521 gccgccacga ctacgtcccg gaggtcgttg gccactgggc acgggcagct cattattgcg 1958581 ccgaccccgt ggatgtcgac gggtttgtat tcccgacttg ccggtgggtc cacccgatcg 1958641 gcccggggaa tcgctcactg cccttcccaa ctctggtatc gatcctgctg accgacatcc 1958701 gggtcgagac cgattaggtt tcgccggaag tcgccgcacc tcgcggttgc tgaaaccatt 1958761 agccttatgc ctgtcacacc accgcggttg gcggggtgag gagtcgggcg atggatggca 1958821 ccgcggaatc gcgggagggt acgcagttcg ggccgtatcg gttgcggcgg ttggtgggtc 1958881 gcggcggcat gggcgacgtc tatgaggccg aagacacggt gcgcgagcgg atcgtggcac 1958941 taaagctgat gtcggagacg ctctccagcg atccggactt ccgcacgcgt atgcagcgcg 1959001 aggcccgcac cgcggggcgc ctgcaggaac cgcacgtcgt gccgattcac gacttcggtg 1959061 agatcgacgg gcagctctac gtggacatgc gcctgatcaa cggcgtggat ctggccgcga 1959121 tgctgagacg ccaggggccg ctggccccac cgcgagcggt cgcgatcgtg cgccagatcg 1959181 gctcggcgct cgacgccgcg cacgctgccg gggcaacgca tcgcgacgtc aaaccggaga 1959241 acattctggt tagcgcggat gacttcgcct atcttgtcga tttcgggatc gccagcgcca 1959301 ccaccgacga aaagctgacc cagctcggca acacggtggg caccctctac tacatggcgc 1959361 cagagcggtt cagcgagtcg cacgcaactt accgcgccga catttatgcg ttgacctgcg 1959421 tgttgtatga gtgcttgacc ggatcaccgc cgtatcaggg agaccagctc agcgtgatgg 1959481 gcgcgcacat caaccaggcg atcccgcggc ccagcacggt acggccgggt attccggtcg 1959541 ccttcgatgc ggtgatcgcc cgtggcatgg ccaaaaatcc ggaggaccgc tatgtcacct 1959601 gcggtgatct gtcagcggcg gcgcacgcag ccctggccac cgcggatcag gatcgtgcca 1959661 ccgacatctt gcggcgcagc caggtggcca agctgccggt gccatcgact cacccggtgt 1959721 caccgggtac ccggtggccg cagccgacgc catgggctgg cggggcgccg ccatgggggc 1959781 caccgtcgtc tccgctgccc cggtcagccc gccagccctg gttgtgggtt ggtgttgccg 1959841 tcgccgtcgt ggtggcgctg gcgggcggcc tgggtatcgc gcttgcccat ccgtggcggt 1959901 catctggacc ccgcacgtcg gcaccgccgc caccgccgcc cgcagatgcg gtcgagctcc 1959961 gcgttctcaa cgacggtgtc tttgtgggta gctcggtggc gccgacaacg atcgacattt 1960021 tcaacgaacc catctgtcca ccctgcggca gtttcatcag gtcgtatgcg agcgatatcg 1960081 ataccgcggt ggccgacaag cagctggcgg tgcgctacca cctgctcaac ttcctcgacg 1960141 accagtcgca cagcaagaac tattcgacgc gagcggtggc cgcctcgtac tgtgtagcgg 1960201 ggcaaaacga cccgaaactc tacgccagct tctactccgc cctattcggc agcgactttc 1960261 agccgcaaga gaacgccgca tcggatcgca ccgatgccga actggcacat cttgctcaaa 1960321 cagtcggcgc cgagcccacg gcgatcagct gtatcaagtc aggagctgat ctgggcaccg 1960381 cccaaacgaa ggccacaaac gccagcgaga cgctggccgg cttcaatgcc agcggtacgc 1960441 cgttcgtgtg ggacggcagc atggtcgtga actatcagga tccgagctgg ctcgcgaggc 1960501 tgatcgggta gcgcgggtgg tgtggcctcg tcccggacaa ttccgcttgc tctcgcagca 1960561 tgtccgcagc ggtgcgcggt tgtgacggtg aattcacgat gctcgccgtt gatgtcggca 1960621 ggtaccaccg cggtgtggct tgcgtcgcgg acggtgcggt cagattcggc gatggtcccg 1960681 agggcggcag ctactatgcc aacgacaggc gcccacaaat atcctgcggt tgagttgcag 1960741 accgggtggg tcgttcaccg atccactgta gggccggtga ctcagaacgt ggccgttaat 1960801 tcgaaacccg gcccaggttg ccaacccgaa gattttgggc gccgaccaca ttccgcagtc 1960861 ccgaacaatt cacgcaccac aaacacccca cacagtcggt gcagcgcacg cagccgatac 1960921 aggccacgca ccgggtgcag gtgatgcatg ctaggcatgc cacacactgc cggacagcca 1960981 cgcacaatac ggtcagcaga ctgccgatta tcccgacgct gcccgccgtg gctgccgccc 1961041 cggctatcgc gacgctgccc gcggtcgcga ccgagccggc gactgcgacg ctgcccgcgg 1961101 tcgccaccga gccggcgact gcgacggcgc ccgtggtcgc ggccgatccc gcgacggcga 1961161 tgctgtcgat gctggcgatc gagcggttaa ttaccatgtg cggctttcgg tagccggcag 1961221 tcgtcggcca cgggccactg tgccggacat ggtccaagtt tggtcagtta gcccagttgt 1961281 gagcggcacc aaggggatac cggggcgatt acgccggcgg taacatcgcg cacgaattgt 1961341 tcccaggaca accagcggat cgcgtcgacc tcgtccgagt tcggccgggg ctgttggtca 1961401 acctggactc ggtagacggg gcagatctcg ttttccacgg tgccatcggc catagcggcc 1961461 cggtagcgga accccggcag gatcagatcg acccgatctg gggtcagtcc gagttcggca 1961521 gcgagccgcc gccgtatggc gccgggtagc gattcgccag gcagggggtg cccgcagcaa 1961581 ctgttggtcc ataccgccgg ccacgtcctc ttggtggcgg cccgccgcgt gatcaacagc 1961641 tgatcgtgca gatcggacac atagctggag aacgcgaggt gcaaaggggt gtcgccggtg 1961701 tgcacggtgg ccttgtcggc cacacctgtc gcgtcgccgc ggtcgttgag caaaaccacc 1961761 cgctcgatcg gtggagctgg ccggtagctg cgggtcatgc cagacctcct tacgcttgct 1961821 tgcgagggtc ggttcgcggc cccaacgctg gcaaactacc ggagagtcac ttgtcgcgtg 1961881 cggagttcca cgattctcgt cgagtgtcgc aagccctgcc ctcctggcgg gctacgatgc 1961941 cgccatgccg ctcgcggaag gttcgacgtt cgccggcttc accatcgtcc ggcagttggg 1962001 atccggcggg atgggcgagg tgtacctggc ccggcatccc agactgcccc gccaggacgc 1962061 gctcaaggta ctgcgggccg atgtgtcagc cgacggcgaa taccgggcac ggttcaaccg 1962121 cgaagccgat gccgcggcgt cgctgtggca tccacacatc gtcgccgtcc acgaccgcgg 1962181 cgagttcgac ggccagctct ggatcgacat ggacttcgtc gacggcaccg acaccgtatc 1962241 ccttctcagg gatcgttatc cgaacgggat gcccggcccc gaggtcaccg agatcatcac 1962301 tgcggtggcc gaagcgctcg actatgccca cgaacgtcgg ctgttgcacc gcgacgtcaa 1962361 acccgccaac atcctgatcg ccaatcctga ttcacctgat cgtcgaatca tgttggccga 1962421 cttcgggatc gccggctggg tcgatgatcc aagcggattg accgccacaa acatgactgt 1962481 gggcaccgtg tcatacgcgg ctccggaaca gcttatgggc aacgagctcg atggacgggc 1962541 cgaccaatac gcactagccg cgacggcgtt tcacttgctg accggctccc cgccctttca 1962601 gcacgccaac cccgccgtgg tgatcagcca gcatctcagc gcgtcacccc cggcgatcgg 1962661 cgatcgggtt cccgagctga caccgctgga cccggtcttc gccaaagcgc tggccaagca 1962721 acccaaggac cgttaccagc ggtgtgtcga cttcgcgcgc gcactcggcc atcgtctggg 1962781 cggcgcgggt gatcctgacg acacgcgggt gtcgcaaccg gtcgccgtgg ccgcgcccgc 1962841 gaaacgctcg ctgctgcgga ccgccgtcat cgtccccgcg gtgctggcga tgctgctggt 1962901 gatggccgtc gcggtcaccg tgcgggagtt ccagcgtgct gacgacgagc gtgcagcgca 1962961 gcctgcgcgg acgcggacca ccacatcggc cggcacgacc acttcggtag cccccgcgag 1963021 cacaacgcgc ccggccccca cgaccccgac cacgactggc gccgccgaca ccgcgactgc 1963081 atcgccgacc gctgcggttg tcgccatcgg cgccctctgc ttcccgctcg gcagcaccgg 1963141 caccaccaag accggggcga cggcctactg ctcgacgctg caaggcacca acaccaccat 1963201 ctggtcgctg accgaggaca ccgtggccag tccgactgtg accgccactg ctgacccgac 1963261 ggaggcgccg ctgcccatcg agcaggaatc gccgattcga gtgtgcatgc agcagaccgg 1963321 ccagacccga cgggaatgtc gcgaggagat tcgcagaagc aacggctggc cgtgatggtc 1963381 ggcttgcctg accgggtgca cccgccccgg cgtcggctgc ggtcccgata cagttggtgc 1963441 cgatgagcca accagccgcc ccgcccgtgt tgaccgtgcg gtatgaggga tcggagcgca 1963501 cgttcgccgc aggacacgat gtcgtcgtcg ggcgtgacct gcgcgcggat gtccgcgtcg 1963561 cacaccccct gatctcccgg gcacacctgc tgctgcgatt cgaccagggt cgctgggtcg 1963621 ccattgacaa tggcagcctc aatgggctct acctcaataa ccgtcgggtg ccagtcgtgg 1963681 acatctacga tgcccagcga gtccatatcg gaaaccccga cggtccggcg ctggacttcg 1963741 aagtgggccg ccaccggggt tcggccgggc gaccacccca gacgacgtcg atacgcctgc 1963801 ccaacctgtc cgcgggagcg tggcccaccg acggcccgcc gcagaccggc acgctcggct 1963861 ccggccagct acaacagctt ccaccggcca ccacccggat acccgccgct ccgccatcgg 1963921 gaccacagcc gcgatacccc accggtgggc aacagttgtg gccacccagc ggaccgcaac 1963981 gggcgccgca gatttaccgg ccacccacgg ccgcaccgcc gccggcgggt gcccgcggcg 1964041 gaactgaggc gggaaacctc gcgacatcga tgatgaagat cctgcggcca ggcaggttga 1964101 cgggggagtt gccgcccggt gccgtcagga tcggccgggc gaacgacaac gacatcgtca 1964161 ttcccgaggt gttggcctca cgtcaccacg ccaccctggt cccgacgcct ggcggcacgg 1964221 agattcggga caaccgcagc atcaatggca ccttcgtcaa cggcgcccgg gtcgacgcgg 1964281 cgctgctgca cgacggcgac gtcgtgacca tcggcaacat cgacctcgtc ttcgccgacg 1964341 gcaccctggc gcgccgtgaa gagaacctgc tggagacccg cgtcggcggc ctcgacgtgc 1964401 gcggggtgac ctggaccatc gatggcgaca agacactgct ggacggcatc tcgttgacgg 1964461 cgcgccccgg tatgctcacc gccgtcatcg gtccgtcggg cgctggcaag tcgacacttg 1964521 cccggttggt ggctgggtat acgcacccga cggatggcac ggtgacgttc gagggccaca 1964581 acgttcacgc cgaatatgcc tcgctgcgca gcaggatcgg catggtgcca caggacgacg 1964641 tggtgcacgg tcagctgacc gtgaaacacg cgctgatgta tgccgccgaa ctacggctgc 1964701 cgccggacac caccaaagat gaccgcaccc aggtagttgc ccgggtgctc gaagaactcg 1964761 agatgtccaa gcacatcgac accagggtcg acaagctgtc gggtggtcaa cgcaagcggg 1964821 cgtcggtggc gcttgagctg ttgaccgggc cgtcactgct gatcctcgac gagccgacat 1964881 ccggcctaga tcctgcgctg gaccggcagg tcatgaccat gctgcggcag ttggccgacg 1964941 ccggtcgggt ggtgctcgtg gttacccact cactgaccta cctggacgtc tgtgaccagg 1965001 ttctgctgtt ggcccccggc ggcaagaccg cgttctgtgg gccaccgact cagattggtc 1965061 cggtcatggg gaccacgaac tgggccgaca tcttcagcac cgtcgccgac gacccagacg 1965121 cggccaaagc ccgctacctg gcgcggacgg gtccgacccc accaccgcca ccggtcgagc 1965181 aacccgccga actgggcgat ccggcccata ccagcttgtt tcggcagttc tccacgatcg 1965241 cgcggcgaca gttgcgattg atcgtttccg accgaggtta cttcgtcttt ctggcgctgt 1965301 tgccgttcat catgggtgcg ctgtccatgt cggtaccggg cgacgtgggc ttcgggtttc 1965361 ccaacccgat gggtgacgcg cccaacgagc ccggccagat cctagtgttg ctgaatgtcg 1965421 gtgcggtctt catggggacc gcgctgacca ttcgtgacct catcggtgag cgagccatct 1965481 tccggcgcga acaggcagtc ggcctgtcca ctaccgccta cctgatcgcg aaggtctgtg 1965541 tctacaccgt gctcgcggtg gttcagtcgg cgattgtgac ggtgatcgtc ctggtcggca 1965601 agggcggtcc gactcagggt gccgtagcgt tgagcaagcc agatctggag ctgttcgttg 1965661 atgtcgcggt gacctgtgtc gcctcggcga tgctcggatt ggcgctgtcg gcgatcgcca 1965721 agtccaacga acagatcatg cccctgctgg tcgtggcggt catgtctcag ctggtgttct 1965781 ccggaggcat gattccggtc accggacgtg ttccccttga ccagatgtcc tgggtcacac 1965841 cggcgagatg gggtttcgcg gcgtcggccg ctacggtcga cctgatcaaa ttggtgcccg 1965901 gtccgctgac cccgaaggat tcgcattggc atcacaccgc cagcgcgtgg tggttcgaca 1965961 tggccatgct ggtagcgctc agcgttatct acgtcggctt tgtgcgctgg aagattcgcc 1966021 tcaaggcgtg ctaggcggca gttcactgcc caacccaggt ggaattaacg ggaatggctg 1966081 tctcactcac cggctcaaca ggtggccttg ggcgcgcgac gcgaccgcac ccgccgaccg 1966141 tgacgtgcga ctgattctga gctaacgcac gcagggggaa ctcgagcccg gtgaccagct 1966201 cgagcgcggc gccgggcggg tgagatcgac gtgtgggtcg ccaacgccgt gctgccagcc 1966261 tccggcaagc tcgacagcat caccgcggag ccggttggcc gcgcgctgcg gggacggcgc 1966321 gcttgacggc gaacgcgccc gagatcgccc tcctcggcgt cgccgaccag gtcgcggccg 1966381 gtcagattga caagcggtga agccggttgc cgggtggtgt ctgctccggc cgaccctggg 1966441 gccgtccatg gtggcatcct ggcctggtgg ggctactgat tcggctagcc gagttgctcg 1966501 ttgtgatgct gccgctcatc ggagtgctat atgtcggcat caaagcgctg tcgtccttca 1966561 cgcggcggct aggggaggcg tctggcgatc ttgcgtcgga tagccccgcg atgccacgcc 1966621 caaccactgt cgaaaacgac gcagcgcggt ggcgggcgat cactcgcgcg gtcgaggcgc 1966681 acgagcgaac ggatgcacgc tggttggaat acgagctcga cgccgccaag ctgctcgact 1966741 tcccggtcat gaccgacatg cgggacccgc tcacgacggc atttcacaag gccaagctac 1966801 aagccgactt tcacaagccg ttgcgggcgg aagatcttct cgacgacccg gacgccgcgg 1966861 gccactatct cgatgcggtt cgggactatg tgaccgcgtt cgacaccgcg gaggccgagg 1966921 cgatgcgcag acgcagaacc ggcttttccc gcgaggaaca gcagcggctg gcaagagcgc 1966981 aaagcctgct gcgggtggca tccgacgccg gcgcgacggc ccaggaacgc gagcgcgcat 1967041 atcgtttggc gcgcaccgaa ctcgacggac tcatcgtgtt gccggaccgt acgcgggccg 1967101 gcatcgagcg ggggatcgcc ggcgagctcg atgactaagg ctgacctttc ggcaccgcgt 1967161 cgccgttgct gtgccacgac cacgcataga gcgcccacat gacgatgggt agcaggatgt 1967221 cggtccacag cgggacgccg atgttgtatg ggttggtgtt gttctccacc acccagtagt 1967281 agatgtggcc ggccgcgtct ccgacgtact ggatggtgag caccacgatt gtcgccagcc 1967341 agaagtgccc gcggaagcgg tacgccatca ggccgaccac cccgattgcc aggtcgccca 1967401 ttgcgttctc ccattggaac ccgccgtcgc cgcgcgtata gccgatcaac tcggcggtcc 1967461 gctcgccgtc gaagacgtgg tatcccgcgc cgatgatcga taccacgccc acgatcagca 1967521 ccatccacca cagcatatgg atgtccgcgg ctgggcggtg ccggtgacgc cggctctgca 1967581 cgaacgcacc gattagcgcg acgattaccc cgacaatggt gaacattcca acacccttcc 1967641 ctagctttag ggtcccgtca tgctgtcgaa tctcattgac cgcacgcaac actagcggac 1967701 gggctggcgc tcaccgctgt tgcgggcgtc ccgagaacgc cggccgagta atgggggagc 1967761 ggacctttcc gtacttcata tcgcttttgc cggtccggac gcgtggtggt aagcgctgcc 1967821 tcgtggttcg cgcacccaca gggtgtccgc tttgccgacc gcggttccct cgtcgatcaa 1967881 ctggcgcttg agcaccttgt gtgtggcggt gctgggaagg tcggccgcga tgcggatgta 1967941 tcgtggccgg gctttagtgg ataggtcagg ctgggcgtcc agaaatgctt cgaacgcgtc 1968001 agggtcgaag gtgtcacctg ctcgcaagac caacgccgcc atcacctgat cgccgacgta 1968061 ttcgtccggg acggcataca ccgcgacacg gttaatagcc ttgtatcgta atagaattcg 1968121 ctcgattggt gccgctgtca ggttctcgcc gtctacccgc atccagtcgg cggtgcggcc 1968181 agcaaggtag atccagcctt cagagtcccg gtatgcgagg tctccagacc agtacatgcc 1968241 gtggcgcatg cgctcggcgt tggcttcggg gtcattgtag tagccggtga agaagcccga 1968301 ccccgtcgtg ttgaccaact cacctatggc ttcatcggcg ttggtgagtg ctccgtgagc 1968361 gtcgaaccgc gcgacggcgc actcggtgac ggtttcgccg ttgtacaccg cgaccccgtg 1968421 ggctccccgg ccgatcgagc ccggtggcgt gccgggttcg cggatcacga tgaccgcgtt 1968481 ctcggtcgag ccaaagccgt cctcgacctg gactccgaag cggcgtgaga attcctcgat 1968541 gtctttgtca ttggcctcgt tgccgaaagc cacccgcagc ggattgtcgg catcgtcgtc 1968601 gcgttcgggg gtggcaagga tataggcgag cggcttgccg acgtagttca tataagtggc 1968661 gtggtatcgg cggacgtcgt cgaggaagcc ggtcgccgaa aacgtcgccg gcgcgatcgc 1968721 ggcaccggag accaccgctg gcgcccatcc cgcgaccacc gcgttggagt gaaacagcgg 1968781 catggataca tagcaggtgt cctgttcggt gagcccgaag cgctcggtga ggctacgccc 1968841 ggcgaacgtg gccattaggt gtgacaccgg taccgctttg ggatttccgc tggtgccgga 1968901 cgtgaagatc atcatgaacg gatccatcgt gtcgacttct cgatagggga caaaggcgcc 1968961 gtcaccagcc accaattcag cccaccgcgg tgtcgaggta tcaaggatcc gcgcgcccgc 1969021 gaggtctaaa ccgtccaaca gcgctcggtg gtcggcatcg gtcaccacga tctggcaatc 1969081 ggctcgcctg acgtcagcgg ccagtgcatc gccacgtcgc gttgtgttca ggccacacag 1969141 cacatagccg cccaacccgg ccgcagccag ctgggccagc atctcgggcg tattccccag 1969201 cagagagccg atatgcgtcg gacgttgcgg atcggcgatt gtgatgaggg ccgccgcgcg 1969261 ggccgccgac tccgccaggt actgactcca agtccattgc agaccaccgt atttcacggc 1969321 aatcgttgga tcggatacgt gctggcgcaa gagcgattga atcgtgtcgg tcatgaattc 1969381 gctcccatgt cgagtcgcgg gctttggccg cgacgctgtc atccagcatg atcgccacga 1969441 tgccatcaat ggccaggagg tcgcgacatg acaacaagat caccacgccg gcagtggatt 1969501 gcctcacgat cgaacgtcta gattctcccg cgtccggcgc ccctcaggtc accccttatg 1969561 ctagggcgct aatgggcgag acaaccacgt gcgcgatcat cggcggcggc ccggccggga 1969621 tggttctggg cctgctgttg gcgcgggcag gtgtgcaggt caccctgttg gagaagcacg 1969681 gagacttcct gcgcgacttt cgtggcgaca cggtgcatcc gacgacgatg cggctactcg 1969741 acgagcttgg gctgtgggaa cgctttgcgg ctttgcccta cagcgaggtc cgcacggcca 1969801 cattgcattc gaatggtcgc gcggtgacct acatcgactt cgagcgactg catcagccct 1969861 acccctatgt cgcaatggtg ccgcaatggg acctgctgaa cctgctggcg gaggccgccc 1969921 aagcggaacc gagctttacg ctgcggatga aaaccgaggt gaccgggttg ctgcgggagg 1969981 gcggcaaagt tacgggggtg cgctatcaag gagccgaggg cccgggtgaa ttgcgggcgg 1970041 aattgaccgt ggcgtgcgac ggccgatggt cgatcgcccg gcacgaggct ggactgaagg 1970101 cgcgtgaatt cccggtgaac tttgacgtgt ggtggttcaa gctgccacgt gaaggtgacg 1970161 ccgagttctc gttcctgccg cgattctccc cgggcaaggg gctcggcgtg atcccacgcg 1970221 aaggttattt ccagatcgcc tacctcgggc ccaagggaac cgacgctcag ttgcgcgagc 1970281 gaggtatcga ggaattccgt cgggacgtca gcgaactgct gcccgaagcg acggcatcgg 1970341 tggcggcgct agcgtccacg gacgaggtca agcacctcaa cgtcaaggtg aatcggttgc 1970401 gtcgttggca cattgatggg ctgctgtgca tcggcgacgc ggcgcacgcg atgtcaccgg 1970461 tggcgggagt cggcatcaac ctagcggtcc aagatgcggt cgcggcagcg accatcttgg 1970521 ccgaaccgct gcgtgagcat cgagtcagca gccgccacct ggcagcggta cggcgtcgtc 1970581 gcgcatttcc caccgcggtg acccaagcgg tgcagcgggt gttgcaccga aggctgctcg 1970641 gcccgctgct gcagggccgg gaccccacgc cgccggcggc cctgcttggc ctggtcgaac 1970701 ggctgccatg gctctcggcg gtgcccgcct actttgtggg agttggagtc cggcctgagc 1970761 atgctccggc cttcgcacgt cgcgggcccg gcaaccgcaa aggcccttga gccgacatgc 1970821 gcgccgccgc gaatcggcgt cttgggtata gcccggatag cgccgttggc gctcatcaag 1970881 ccggtcagcg ggagcgtcgt ggtggcagca cgtgatgtgt cgcgggtggc gcgaccatgg 1970941 acgctggctg ctatgccgtc cacatggccc acacgttcgg tggggccacg ccggaagtgg 1971001 tttcggcgca agccaaatta cgcgatccag cggtcgatcg ggccatgacg gccgaactga 1971061 aatttccagg cgggcacacc ggcgggatcc gctgttcaat gcggtcgtcg gatctgttga 1971121 atgtgagcgc tcgagtggtc ggcgaccgtg gcgagttgcg cgtgctcaat ccggttgtgc 1971181 cccaactctt ccaccgattg ccgcccctcg catgcgtatc agctcgacgc tttcgctgcc 1971241 gcagtgctgc gcgggcaagc ggtcaagacg acgcccaagg acgcggtcga gaacatgagc 1971301 gcgatccacg cgatctatcg ggccgccggg ctcccatcgc gcaacccgag ctgaatatgg 1971361 tcgccgcgag cgggtccgcc gcctgacagg ccaatggcgt cggtcgctta cccgccaggg 1971421 ttaggacgtg gtgccttgga agaaacccgc caggttggtg ccgatattgg caaagccgga 1971481 aacgacgctg gctaccgaga acggcaggat gcccctgttg gcgaagcctg agacgccgct 1971541 gccaaggttg gaaaagcccg aggatagccc gccgaagttc tgatagcccg agccgcccaa 1971601 cagcccggcc gggttggtgt tgaaccaacc cgagaggccc gagccgttgt tgccgaagcc 1971661 cgagttgccg cccgcaccgg aattgaagaa gcccgacgaa ggcgcggtgc tcgagttgaa 1971721 gtagcccggc cccccgggga tcgcgaaggc cccgatcgtg gtgctgggca ggtggatgcc 1971781 gggaacggtg agcgggggcg tggtgaagcc ccccacgccg atcggctcga tggtgagcgg 1971841 tggggtggtg atgggtgggg tggtgatttg ggggagggtg aagccggtga ggttgatggg 1971901 gtcgatggtc agcggtgggg tggtgatggg tggggtggtg atttggggga gggtgaagcc 1971961 ggtgaggttg atggggtcga tggtcagcgg tggggtggtg atgggtgggg tggtgatttg 1972021 ggggagggtg aagccggtga ggttgatggg gtcgatggtc agcggtgggg tggtgatggg 1972081 tggggtggtg atttggggga gggtgaagcc ggtgaggttg atggggtcga tggtcagcgg 1972141 tggggtggtg atgggtgggg tggtgatttg cggcagggtg aacccgccga cgccgatcga 1972201 gttgatggtt agctccgggg tgatgatttc ctgggtggtg atctgcggca gagtgaagcc 1972261 gcccacgccg atcggaggga tcgcgaactc cggggtggtg atagctgggg tggtgatctg 1972321 cggcagggtg aagccatcga cgttgatagc ggggacatcg atcccgggta tgttgaaggc 1972381 gggcagaaag aatgaaccga tgacaatagg gccggtcaat gtgtatgggt gaaccaccaa 1972441 ttgtggtaag tcaaactcac cgaagatgag ggcgccattg gtgaaagtac taagcccgcc 1972501 gccgggcggc tgaagcgcag gcacattggt ctggaattgt agggtaaagg gtattccaaa 1972561 agccggtact gttatcctag gtgtgcttag gaaaacatcc cagcctatgg agggcaggcc 1972621 aaattggccc acgccaatct ggccgaccgt tatcggttga gtatgtatcg caggtagact 1972681 aaagccaccg attgtgatac ccgcgggtat cgtcagctgc ggaatagtta cttccggaat 1972741 ctgcaatggc ggcaaattaa aagcacccac cgtaatgggc gggaccgtca ccggcggaat 1972801 ggctacggaa ggaatactca gcggaggcaa ctgaaagccg cttacggtga tgttggctgg 1972861 tgtggtggcg gccgggatgt tcaacgacgg caacgtcaac ccgggcaggc tgaaggcgcc 1972921 gacggtgatg ttggctggtg tggtggcggc cgggatgttc aacgacggca acgtcaaccc 1972981 gggcaggctg aaggcgccga cggtgatgtt ggctggtgtg gtggcggccg ggatgttcaa 1973041 cgacggcaac gtcaacccgg gcaggctgaa ggcgccgacg gtgatgttgg ccggtgtggt 1973101 ggcggccggg atggtcagcg acggcagcgt tattgccggc agactgaagg cgggaaccga 1973161 tatccccggt atttgcagcg gcggcagagt cagatcaggt gtcgtaatac tgaactgcag 1973221 gctgccctgc cccacgcccc ggtagaagac gccattgttc atgtcacccg tgttgaacag 1973281 cccattattc atgtggccaa tattgaagac accagtgttg atatttccgg cgttgaggaa 1973341 acccgtgtta gcatttcccg tgttgaacgt gccggtgttg gacgaccccg gattgaagtc 1973401 gcccatgtta taactgccgg tgttcaggct gcctgtgttc gcgttgccga cgtccaacat 1973461 accggtgtta aacgagcccg cattgaagaa gcccgtgttc ccgtgtccag aattccagcc 1973521 accggtgttg aaattacccg agtttccgat gccaaagttt ccattgccgg agttgaagaa 1973581 gccgacgttt ccgctgcccg agttgaacaa tccgaaattc ccggtgcccg agttgagccc 1973641 gccgattccg atctggttgt tgccggtaag accgatgccg atgttgttgt tgccagtgtt 1973701 gccaaagccg aagttgccca agccggtgtt ggcaaacccg gtgttgagat tgccaaggtt 1973761 tcccacgccg acattgttgc tgccgaggtt cccgaagccg atgttgttat tacccaggct 1973821 tgctgagccg atattggagt taccgaaatt tccggacccg aaattgtagt tgccaaggtt 1973881 ggcgttgccg atgttggcaa ggccgttgtt ggcgttgccg acgttgccac cgccgacgtt 1973941 ggctatgccc aggttgatgg cggtgggtcc gcccgcaagc gccggtatgc ctgcggctgc 1974001 ggtcatggcg gccgcaggcg cgccgctggc caaccaagcc ggcaagccag ccaggttctg 1974061 cagcggttta ctgaacgggg acagcgccga ggcgatcgcc gatgccccgg catggtaggc 1974121 agacatcgcc gacacatcgg cagcccacat ttgctcgtac gtggcttcaa tggcagcgat 1974181 cgccggagcg ttctgtccaa acaggttcga catcaccagc gacaccaggt cggcacggtt 1974241 ggccgccacc agcatcggct gcaccaccgc cgtcttgacc gcttcaaact cggctatcat 1974301 cgccgcagcc tgagcggccg tctgctcggc ctggaccgcc gccgcggcaa gccacgccgc 1974361 atagggggct gccgctgccg ccatcgccga cgacgacgcg ccctgccacg ccccgcccac 1974421 gagtccggat gtcactgagc cgaaagaggc tgcggccgag gccaattcca tggccaaccc 1974481 gtcccaggcc gtcgcggccg ccgccatcgg ttccggccct gccccggcga atatcagcgc 1974541 tgaattgatc tccggcggca gtacagaaaa attcatcgtc cagccttccc tgcgtgcccc 1974601 gcgtgatcag cggtaaaccg tggccggtga gtggctcttg gcccacaagc tagacgctga 1974661 accgtcgtgg ccacataaat atcgcgcaca aatggccacg actcataggt ttcgtaaatt 1974721 tgatttacaa aaggcgctct cgggtcatgc ggaccgcaag cggcgtccga acgcaggggc 1974781 tatggcagca cggtgtgcat caacatcacg ttgtatgccg accacaaaga caggttaaag 1974841 tagacgtctt tgcccgtcga ccagggatgc atcatcggcg cgtagatgcc gccgggcatc 1974901 tgccatgacg acaccagcat ttgctctgcg ctccacggtc cttgcggagc cggcgcggtc 1974961 cttgccacca cgtcgttcat accgttggtg tagagcgcca ggtattgctt gaggtaggtg 1975021 ttgtattgga cggacatttc gcccaccggg cccggaataa cgggtgttgc cgcgtccggc 1975081 ttgtttggaa cccaggagtt cgagtcgccg ttccagtact ggtacttggt gaggtcgggc 1975141 acaaagcgct gcggaactcg tgccagatat gccgaaccgc ctcgcccggg cggggtcccg 1975201 aacgagtaga ggtaaccgtc gttggacttg aggtacgccc ccatctggaa gttctcattt 1975261 cccggaacga acctggcttt tccgccgctg tccggtccgg acgcgcggat ggtgcccggg 1975321 aagacccccc aggtctgacc attgtccttg gacaccgcga tgcccgagta gttcgtcgtc 1975381 cattccccat cacggcccca attcctgatg gacatgaagt tgacgtattg ggttttgccg 1975441 acggcgatgc ccgcggtcgg aatgatcccc gtctcgtcgc gcgcccattt gatgctgttg 1975501 atgagctgtt tggagaagcc cggttggcgt accggtgagc cggaatatct gttggaagcg 1975561 tcaccggatg tcacatgaac tccgttgccc aggtcgcggt cttggctgcg gaacagcgtg 1975621 ttgtatcgcc attgatggcc atcgacagcg cagtagccga atgtgtcgcc gaagatcatg 1975681 agcacctgac ggttggcggg atcgccgtta tcccaaggaa ttccgaggtc ggtcccggag 1975741 atgccgaagc gttccagggt cttgttgggg ctgtccggtc cggtcaccca ctcggcgagg 1975801 gatgtggtag ccccggcgag cgtggcacca ggatccggcg ccgccgccgg agcagggtcg 1975861 ggtgctgggg ctgggttcgg agttagctga gtggcattcg ggggttgtgg gcccgtggct 1975921 ggcggattgg gtgccggatt gggcccagga ttggccctgg ggactagcgc ttgctgttgt 1975981 agcggcgcgg catttctagc acccgggttg agcaatgcgg atatcagtgg acccagcttg 1976041 ggtagcggtg cacggtcgtt ggcaccgcga ggcttgcgtc cggtcggtat cggaccgtgt 1976101 ccaggtcgta ccgggccgag cgccgtggcc ccggggtcgg tcacaatggc gctcggtggc 1976161 ggcggagcgt tcgcggcgtc tccgctgcac ggcgccgcca tcgctggtgg cgccaggcct 1976221 attggaacca tgagtccaat agcggccgcc cacgccagcg ataccgacac gattcgagga 1976281 atcggcgaca tgtcacacct tcccgggctg gacgttgcaa ttgacgtccg cagttcgctg 1976341 atgtgacgat agtgatctct gggactcttg tgatcagtga tccactgata ggtatgcctc 1976401 cgtgaccgtg tcgcaaccca tctgttcatc tccgacctgc gctgctgcac tcggacttgg 1976461 taccggtaca ttcaaggccc atcggggccg cggataccac gaccaccggt gccgaacatc 1976521 gacgatccga tcaatttgcg tccactgtcg cccggacagg tcaacaaggt gtggctctgg 1976581 caatcgctac ccggtccctg gatcgggtcc gcacggaata ccgtgtacct gaccggattt 1976641 gagttcctcg agccttagca cggaccgctc ggaataccac gggtaggcgt ggtttcctgc 1976701 gtgggcatga tctgtggatc aggaacccga tacgggattc cacggtttat cgtgcccagc 1976761 gccgcgttgg gcacgcactg cggcaccgtt gatagcgcgt gcagcccggg ataatccagg 1976821 ttgggccatg atgagttggg cgggacagcg aagttgaacg ttgacgtcat gtcgccggtc 1976881 acactgcgcc gccaagccgt gaggttggga actggcaccc cgaaccgagt ttcgagcaat 1976941 ctcagctgtg aggtgtggtc aaacgtgtcg tgaaccatct gcgggccacg gctgtacggc 1977001 gaaatgacga agcagggaac gcgaaagccc aaaccgatcg gcccgcgtat tccgccggag 1977061 cccggcacct gatcgatgtc aggcaccgtg acatattcgc cgggagtccc ggccggcgcg 1977121 gtagcaggaa caacgtggtc gaaaaagccg ccgttttcgt cgtagctgac gatcagcgcc 1977181 gtcttttccc acaccgcagg attggcaagc aatattctta agatgttgac gattgcgaaa 1977241 gccccggccg cggctggaac cgcaggatgt tcggattcga gaacattggg aatcacccag 1977301 gagacccgcg gcagtctatt ggctaagacg tcggccgcga agctcgcggg atagcttggt 1977361 gccacgccaa agcggacaag atctgacctg ggatcggctg actgtttgaa agacgtcaca 1977421 agcgagccgt aagtaagaac cgaggagatg ggcccgagtg tcttgttgcg atacaccttc 1977481 cagctgacgc cggcatcgct aaggttctgc ggcatgatgc gccaaccgaa tctccgcacc 1977541 ggttggaaag tgggactctg cagctccggc ccaccatttt ggccgtcggg gtcgatggtg 1977601 gcgctcagcc aataaaggcg gttgggcagg gtgggaccca ataccgagca aaagtagcgg 1977661 tcgcagacgg tgaacgcgtc ggccaacaga tagtggatcg gaatgtcttg gcgcgtgtag 1977721 tagcccatca ccgtgggagt gtgggccgcc gagcgagtct tggcctgcgc tggtagccag 1977781 ttgtcgttga cgccaccatt ccacgactca tgcatcgcca cccagctatg gtcagggtcg 1977841 ttgacacacg cgccgtcgag gaacgggcct cgggtggtgt cgaagcggta gggcatcgta 1977901 acgccggtgg cgtcaagagc ctgcgtcatc gggttccaac ccttttgttg gaagagcggc 1977961 gataccgtgt tgaatccatc ggtgccggag agtgttccga agtagtgatc gaatgagcgg 1978021 ttctcctgca tgaagaacac aaagtgttcg atgtcggtca aatggccgga gcagggcccc 1978081 gcgccgtagg ccttttcaat caccggaccg gcgaaagaca tcaacgcgcc ggcgccgccc 1978141 gcagcgacct tagctaggaa ttctcgccgc gaaactccgc cgatgtggct ttggctcacc 1978201 gctgtgttct cctgtcgaac ctccagccgc atttcagctc aaggtagagg actccgacga 1978261 acgatcacga cgcgccaacc ggcgtgtcgc gcgcgaactt gcggatttcg gccgcaaact 1978321 tgaaccgcta tgcggtgtct tgggtgtcgc cgcaccgggt gcttacggcc atggcgccca 1978381 atgttgctag catgctggct gctgcccgtt gccaggtgaa tatttcggcg cggcggcgcg 1978441 cgcagcggcg ccggtggcgt tcgggccggc tgacgattgt gcggactgcg tgtgcgatgg 1978501 cctctgggcg gttgtcggcg caggcgccgc tgtctgcggt gatgatctcg gtcagcgccg 1978561 aggtgcggga caccacggcc ggtgtgccac acgccagcga ttcgagtgcg gctagcccaa 1978621 atgtctcgtg tggcccaggt gccaatgcga catcggccga tgccagcagg ccagcgacgg 1978681 catgccgatc cgagatgaaa ccggtgaagt cgatcggcaa cccggttgcc ttgcgttcca 1978741 gcctggcgcg cagcggaccc tcgccagcga tgaccagtcg agcgtcgacg ccggcgtcac 1978801 acaatgcggc gagtgcgtcg atgctgcggt cagcgtgctt ttccaccgac agccggccgc 1978861 agtggaccag caggatctgc gtcggggtgg cccagtgctg ccgaacccgg gcacagcgcc 1978921 gccgcgggtg gaaggtcttc aggtctacgc ccagtgggac ggtgacggta tttgtcgctc 1978981 cgatgcggtc gaattcttcg cgcgcgaacc cggtagtaca cacgacagtg tcgtagttgg 1979041 cggcggttcg cgcgttggcg aagtctgcga acttctgcgc ggctcgacgc ggaagcaatt 1979101 ggcccgcaaa gcgatcaaga cgctcgtggg agatcatcac cgtcgtaacg ccgtgttcgc 1979161 ggccccaccg gcccagtgac ctcagggtga gccggtcgga gacctccagg gtgtctggtc 1979221 gcagtgtttc caatacagtc cgcacggctc ctggcataac cgcgcgataa ccaccggtat 1979281 atggaatatg cttggcgggc aaggtaattc gaacaacacc cgtgcgtagg aggtgtcgtt 1979341 cggtgcgcgc ccccgggacg atcaaaacac ctcgtgtccg ctggcgcagt attccgcgcc 1979401 cagccggtcc accgcggtgc ggagtccgcc cgagcgaggt ccatagaagt tggcgacctg 1979461 aacaacacgc ataccgtgag cagaaccggc cgtcgtgtgc ggtcaacgac atagcatcga 1979521 cggttccctg aacggaccat gaactcctgc ggagcgggca cctgcctgcg cttcgcgcca 1979581 gccgacagac acaaccagaa cttgtgagcg cacaaggtca aacccgctac tggaagttcg 1979641 agcaacggcg gagcatggga gttgaccgat cgaggggaga acaggacaac gattgccatg 1979701 ctggagaccg ctggattatg gggcaagcgc gccgacatga ttgtgcgtgg atgcttgcct 1979761 tataacgctg agccaccgcc ggccgtgttg gctggcagcg acatcacccc gatcaatgcg 1979821 ttctacgtcc gcaatcacgg cccggtcccc gacatcgcgc cgcagcattg gcggctgacg 1979881 gtcggcgggc tggtggacaa cccgcttacc gtgacctatg aacggctgac caccgagttc 1979941 gaccaacact gtgtggtggc gacgctggcg tgcgccggca atcggcgtgc ggagctgtta 1980001 cgggtgcgac agatcccagg taaggaaccc tgggcgcacg gtgcgatctc gaccgctcag 1980061 tggtgcggtg tccgtctggc agacatcctg caggccgccg atgtgcatat cgacgagggg 1980121 ctacacgtgg ccttcgatgc tccggatgta gctgaggagg ctcgccccat ccagccctat 1980181 ggcagctcga tcccgctgag caaagcgctg tcgccggaag ttctgctggc ctggcagatg 1980241 aactccgaac cgctgccgcg tgcccacggt ggtccggttc gcgtggtggt acccggattc 1980301 atcggggccc gcagcgtcaa gtgggtcacc gccatcaccg tgcagcctgg tgcttcgcag 1980361 aattactttc aggctctgga ttaccgcatc cttccggcgg atgcggacgc cgacatcgtc 1980421 gggccgggcg aagggatttc gctttcgtcg ctggcgctca actgcgacat cctcgacccc 1980481 accgatggcg acgacgtacc ggcaggggcg ctgaccattc gtggctatgg gatggccggc 1980541 gatggccgca gtgtcgaacg agttgatgtc tctgtcgacg acgggctcac ctggcagcag 1980601 gccgacctac acgccgcgcc cagccagtgg tcatggcggc cgtggtcgct gacggtcgac 1980661 gtggagccgg ggccgttggg tatcaccgca cgtgcctggg acgataccgg ggcgctgcag 1980721 cccgaatcgg ctgtgtccct gtggaatccg cgcggatacg gcaacaacgc ttgggcccgc 1980781 gtcgcattgc gcgtgagtta gccgggtact cggtcatcaa ccggttgcgg ggccctccga 1980841 agaccactgg aaagcactgc cgatctgatg gttagggtgg ttgaattagc cgactcggtc 1980901 ggcggtgacg ccccgaggtc aggtgaggcg aaggtgatgc cagttgacgg aactagccgg 1980961 cgacacgata cccgaccggt ggctctgctg aggccgacgc ggtgaccgcc attggacgac 1981021 ttatccatcg ctacgcgata tggatcgtcg gcgtctgggc gctcgcggcc atcatcggga 1981081 ataactttgc cccgccactc gagcaggtca tcaccgccga ggatcagccg ttctcgccgg 1981141 ctggcaccgc cacttcgcgt gccgtggaac ggtcagcggc ggccttctcc caagcgcccg 1981201 gcgacaacat cggatatctc gtgctggagc gaaacggagt cctcaacgac caggaccggg 1981261 cttactacga tgcgctggtc gtggccctac gccgtgattc ccgccacgtc atcgaggtgg 1981321 tggactggtg gggaaccccg gccatcgcgg aggtcgcccg cagcgacgac catcacgcgg 1981381 tgacagctgc cctgcgcttt gggggcatgg tcggaacgtc gcaagccggg gagtcgataa 1981441 ccgccgcgcg cagcatcgtt acccaactgc atccccccga cggtttgcac gtattcgtca 1981501 ccggtcctgg cgccaccatc gtggacgagt tcgcggcaat cgacagacag acccagctca 1981561 ttacggcaac gacaatcgtg gtgttactga tcctcttgtt gatcgtctac cgatccgcga 1981621 tcaccgcgac ggtgccgttg ttgtcggtcg tcgtttccct agccgtggcc aagccgatcg 1981681 tttccgtcct tgtcgaccgc gatttcatcg ggatatccct gttttccctc ggacttagcg 1981741 ttgcggtggt tgtcggcgcg ggaaccggct tcgcgatgtt cctgatcggg cgttaccacg 1981801 aacgacgaag gcaacatatt gccccggcgg cggcgctggc agacgcgtac cgcggggtgg 1981861 cgccggcgat cgcgggtgcg acgttcatcg tggtcacatc gctgggcgct gtgggatggc 1981921 tgagcctggc acggatcggt atgttcgcaa caaccggaat cctttgctcg attggcgttc 1981981 tcgcagtggg cctggccgca ctgacgttga cgccagctct cgtcgcgctg gccagccgtg 1982041 ccaacctcct caaaccgcca caacacaagc gcatacagcg ccaatttcgg cgactcggca 1982101 cacatgtggc gcgctggccg gcgccgatat tggtagccag cggtgtgttc gtactcatca 1982161 tgatgatcgc gctccctagg gtgccgatcg gctgggacga agccgcggca accccgtccg 1982221 cggcggaatc caatcgcggt taccgggcgg ccgatcgcca ctttgccccg aaccaactgc 1982281 tgcccaccca ggtgatgatc gagaccgacc acgacatccg caatcccgcc ggtctgaccg 1982341 cgatcgaacg aatcactgcc gcgatcatgg ctattggcgg tgtgcgcatg gtgcagtcgg 1982401 cgagtcatcc caacggaatg gtgtccaagc aggctgcctt gacagcatcg gcggggaatc 1982461 tcggtgatca gctcgacgaa ttttccgatc agctcacatc caggcaggca acgttcacca 1982521 atctcgaagc tgcggtccgc gacgtggtgt cagccctcga tctggttcag gctggcatac 1982581 gacaggatgg ctatggactt ggccaggtca gtctggccgt ccggctgatg caacaggcga 1982641 taaccaaact tcagggcagc gccggtgacg tcttcgacat attcgacccg ttgcgtcgtt 1982701 tcgtcgcggc gatacccgag tgccgggcca accccgtgtg ttcggtcgcc caagaggtgg 1982761 tgcagtgggc aaacaccgtc accgagagct gtgcgaagct ggccgatgcg gcagggcagc 1982821 tcgcgcgggg gatcgctgat gtcgcctcgg cgacatcggg tgtgtccggg ctaccgaatg 1982881 ccctggacgg cattggaggt cagctggcgc aggtacgaga atcggccgca ggcgttcaag 1982941 agttacttaa caatgtcggc gcagcaccat tgcgagagct tcccgactat ttacgcgaac 1983001 ttgccgccgt ctcccagagt gcgccgggcg tggatctcta cgccgctcgg cgaattctga 1983061 ccgacccgaa tatgcgcgcg gtcttggact attttgtctc accaaacggc catgcaacgc 1983121 gtttactcgt ctacggcgac gggagcgagt ggggtgacga tggcgcccaa cgcgctcgcg 1983181 cgatcgtgac tgcggtggcc gaggaaaccg acgagggcac gctgcgaccc accgctgttg 1983241 agctgaccgg cgttggaccg gctacccgtg acctgcagga tctggtgggc agtgacctga 1983301 ccttgctggc ggtcatcaca ctggccgtta tcttcgcgat agccgcactg ctgctgcgca 1983361 gtccgcttgc cgggcttgtg gtcgtcggca caatcgcgac atcgtatatc tgtgcgcttg 1983421 gcgccagcgt agtgatttgg aaacacatac ttggcgataa cttgcactgg tcggtattgc 1983481 cgattgcgtt tgttttgctg atatcggtgg gttcggccta caacctgctc ttcgcgctgc 1983541 gcatccgcga agaaagtcct gccgggccac gaaccagtgt catccgagcg ttcgcggcga 1983601 ccggaatggt agtcacggcc gctggaatcg tgtttggcac aacgatgttc gcgctggccg 1983661 cgagtacctc gctgagcgtg gcacagatcg gcgttaccgt tggcatgggg ttattgctgg 1983721 acgcccttgt gatacgaggc tttgtcctgc cggccctgat ggttttgctg ggccgctggc 1983781 tgtggtggcc gcgccgatcg gttagcaacc ggcaggtacc cgagccgtcg ccggcctaaa 1983841 ttgaaccgat tcacgcgtgc atacgtatcc gagagtgtga cgagccgaag cgcagcagcc 1983901 ggctaggagc ttcgctgtcg gccgaaggct gggttgatgc ctgggccacc gcaacccgct 1983961 gcatttgcgc accggacagc accttagctg tgcaccaggg cgacgacccg tgcgcgaccg 1984021 gtcgccgaga tgggataccc gagcccgatc cctggccggt tcggccaggc acgccggacc 1984081 ctcctggtgt gggaatgtgt ccaaagggca ttattggtcc gtgagcgcat tcaacatctg 1984141 tcgattcgtt ggtgccccaa tggtcacgac gtgggcactg ctgttcgcgc ccgtacccgc 1984201 cgcgtctgcc gacccacccg acccgacggt atcggatggc gcgtgtcccg atgttgaggt 1984261 ggttttcgcc cgcgggaccg gcgagccacc cggcgtgggt gggatcggag aggacttcat 1984321 cgatgcgctg cgttccaaga ttggcgagaa gtctatgggc gtttatgggg tcgactaccc 1984381 ggcgaccacg gatttcccga cagcgatggc cggtatttac gacgcgggca cccatgtcga 1984441 acagacggcg gcgaactgtc cccaaagcaa gctggtgctc ggcggatttt cccaaggtgc 1984501 ggccgtgatg ggctttgtta ccgcggcggc gattccggat ggggcgccgt tggacgcgcc 1984561 caggccgatg ccgcccgaag tcgccgacca cgtggccgcc gtcacactct tcggaatgcc 1984621 ctcggttgcg ttcatgcact cgatcggcgc gccgccgatc gtcatcggtc cgctatatgc 1984681 agaaaagacc atccagctgt gcgccccggg cgaccccgtc tgttctagcg gaggcaattg 1984741 ggcggcgcat aacgggtacg ccgacgacgg catggtcgag caggccgcag tgtttgccgc 1984801 cggtcggctc ggttaaggca gtgtcagcca ctcgccactc agcccgacac cgatcggacg 1984861 tcgtgaccgg cgggaccgag aactgctcga tccgcaacaa cgccgcgacg tggattgtgt 1984921 cccatggtga gctgtgactt ggagtgcggg tggtgagctg aaggcccgtt gtcgaccgaa 1984981 acggggcgac gtccgcgact tcctgtacaa cctgatgctc tgggatttgg gctgcggatg 1985041 cgcggcgggg gttcgctctg gtgtcgtcgg tgttccgccg cgctacgtca agccgtgctg 1985101 cccatcccgg ccgagtacca gcccaccggc gccgccggca cccgccgtgc ccccggcttt 1985161 tccggcattg ccgccgttgc cgccgttgcc gatcaccacg gcgttgccgc cggctccacc 1985221 cttgccgctg gtggcgccat ccccgccggc gccaccgtca ccgccgttgc cgtacagccc 1985281 ggccttgccg ccggcgccgc cgttcccgcc ggcgccggta tcgctggcgc cgccggcgcc 1985341 gccggcgccg ccgaagccgc tgcgaaggcc ggtgatttgg ccggccccac cggtgccgcc 1985401 atcaccgcca gtgccattga ggctgtagcc cccgttgccg ccggccccgc cggagccgta 1985461 gaacaatccc gcgctgccgc cggcgccgcc agcaccggcc ttgcctgaca ggctggagcc 1985521 gccgctgccg ccggcaccgc ccgacgcatt gagggtgagc gagccagcat tgccgcctac 1985581 accgccaccc ccgccggcca tgcccccatg gccgccggcc ccgccagagc cgccggcacc 1985641 gtacagccca ccgggcccgc cggcaccgcc tgtccctccg gcccccgtgg agccgccgtt 1985701 cccacctggt ccgccggttc ccccgtgggc gtacagcccg ccggccccgc cggccccgcc 1985761 ggcgcccccg gcggtgctgc cggtcccgcc ggcgccgccg gcccccccgt tggcgaacaa 1985821 cccggcagcg ccgccagtcc cgccggcgcc gccagtggta acgcctgcgg tgaaagcgcc 1985881 gccgccacac ccgccgagcc cagccgcgcc gatgagcaag ccggcgttcc cgccggcccc 1985941 gccgacgccg ccggtggtgg tggcggcccc gccgacacca ccggtaccgc cggaaccgat 1986001 caagaaggcg gatccgccgg cgccaccggc cccgccggca cccgccgttc cgacgccgcc 1986061 ggccccgccg gcgccaccgg tgccaaacag gatcccgcct gccccaccgg cgccgcccgc 1986121 gctgccgttg gtgccggccg caccggcccc gccgttgccg ccgttgccga acaaccagcc 1986181 gccggcaccg ccatcgtccc cggttcccgg cgtcccactg tcgccgttac cgatcagcgg 1986241 gcgtccggtc aatgcctcgg tgggttcgtt gatgaaactg agaatgtcct gctgcaggtt 1986301 gtgccatggc gaggtgctct cgggagcgtt atatccgtcg gcgcccagca gcaacccgcc 1986361 gaagccgccg aagccggact tgccggcgag cgcgccgatg ccgccctcgc cgccgttgcc 1986421 gatcagcacg gcatttccac cggccccgcc gacaccaccg gtgccgccac tctcgccgcc 1986481 gttgccgccg ttgccgccgt tgccgatcaa cccgggcgcc ccacccgccc cacccgcccc 1986541 acctgcggcg gtgcccgcgg ggcccccaga gccgccagca ccgccggagc cgccggagcc 1986601 gctgagcatg ccggcgctgc cgccgacccc accctgcccg ccggcggcga agccgaaccc 1986661 gccggtgccg ccacccccgc cggagccgaa gagcatgccg gcgttaccgc cggctccgcc 1986721 ggcgccgccc ttaccaccac cgaagacagt gccgccagcc ccaccggtgc cgccggcgcc 1986781 accggcggca cccagggaaa gcgtcccggc gttaccaccg ttaccggcag cgccgccggt 1986841 ggtcagtcct gacccgcctg ccccgccgtc cccgccggcg ccgaacaaac cgccgccccc 1986901 accgtccccg ccggccccgc cggtgccgag cgttccgtga tccccgaatc cgcccgcccc 1986961 gcccatgccg ccggcaccaa acaacccgcc ggccccgccg gcgccgcccg ccccgcccgt 1987021 gtgaccctgc ccgccggcgc cgccgacacc gccggtggtg aacagcccac cggccccgcc 1987081 ggcgccgcca gccccaccgg cagtgctgaa gctgaacccg ccggcaccgc cggccccggc 1987141 ggcgccggcg agcataccgg cgttgccgcc ggttccgccg gtaccgccga tgccaccgac 1987201 aagagacgtc gcagccccgc cggcgccgcc ggcgccgccg gccccaaaca gcatggcgga 1987261 cccgccagcg ccaccggccc cgccgatccc gttgttggcg gtggcggttc cgccggcacc 1987321 gccggccccg ccgttgccga acagcccggc ggccccacca gggccaccag ccccgccgtt 1987381 ggcgcccttt gcaccggatc cgccggcgcc accgttgccg atcaaccagc cggcatcccc 1987441 tccgttggcc ccggtgccgg gagcaccgtt agccccgtta ccgatcagcg gacggccggt 1987501 agcggccagg acgggcgcgt tgatcgagtt gagcagcggc gtcacggcgg cggcctcggc 1987561 ggccgcatag gcgccccccc ggtggtcagc gcctgcacga accgaccatg aaacgccgcc 1987621 gcctcggcgc tcgccgcctg ataggcccgg ccgtgcgcgc cgaacaacgc agcgattgcc 1987681 gccgagatct catcggcacc ggcggccagc aggctcgtcg tgttggccgc cgcagccgcg 1987741 ttggccccag cgatcgtcga gccgagatcg gctagatccg tcgccgccgc cgcgatagtc 1987801 tccggcaccg cgatcacaaa cgacatctga aaacctccca cgaccgctga ccaccaggta 1987861 atgccgacga cccaggaagc ctcggcgccg ggtgaatcgg tgccaatcag cgtatgggcg 1987921 ggcaggcgac ccaaccggtg ttccagcccg actcataccc gctgtcaaat gacctgacaa 1987981 tcactcggtg gtcacacgct gcgtgcttca cattggtagc ttgggcacgt cggcaaccgt 1988041 cacagctgtc acacgggtcc ctgtggggtt ggtcggccac cggcgacaac gtttcctgcg 1988101 cgccttgatc tgtcgccgct gggcaggcat cgccgcgacg gccgtatcag gcttggtcgg 1988161 tgtgagccgc caaatcggta ttgacgaatt cgtcatcgaa ctcccggcca agaccactta 1988221 ggtctgatgg cctggttctc gtcctcaagc cgcgttagca ccacttcggg acgccacgcg 1988281 gttcagcccg ttctcctcga atagcagcct gccggtgcca ccggcgtctg ggcaccccag 1988341 actttcgcgc cgctgtcacc cgttgcgaag gcccccgcaa tggcacggtc accgacatgt 1988401 gatgccgagg ggctgcgccg gggctagatt cgcgtgcaat gcgtgcctaa actttttggc 1988461 ggggttgggg atttctgaac cgatcagtcc cgggtgggcg gctatggagc gactaagcgg 1988521 actcgatgct ttcttcctct atatggagac accgtcgcag ccgctgaacg tgtgctgcgt 1988581 cttggagttg gacacctcga cgatgccggg cggctacacg tacggccggt ttcatgccgc 1988641 gttggagaag tatgtcaagg cggcgcccga atttcggatg aagctcgccg ataccgagct 1988701 taacctggat caccccgtgt gggtggacga cgacaatttt cagatccggc accacctgcg 1988761 ccgggtcgct atgcccgcgc ccggagggcg tcgcgagctg gccgagatct gtgggtacat 1988821 cgccgggttg ccgctggacc gtgaccgccc gctgtgggag atgtgggtca tcgaaggcgg 1988881 tgcccgtagc gacaccgtgg cggtgatgct caaggtccac cacgccgtgg tcgacggtgt 1988941 cgccggtgcg aacctgctgt cccacctgtg cagcctgcag cccgatgcgc cggcaccgca 1989001 acctgtccgg ggcaccggtg gcggcaatgt gctgcagata gctgcgagtg ggctggtggg 1989061 gttcgcgtcg cggccagtgc ggctggcgac ggtggtaccg gcgacagtgc tcacattggt 1989121 gcgcacattg ctgcgtgccc gtgagggccg taccatggcc gccccgtttt cggccccacc 1989181 gactccgttc aacggccccc tcggtcggct gcgcaacatc gcgtatacac agctcgacat 1989241 gcgcgacgtc aagcgtgtca aggaccggtt tggggtgacc atcaacgatg tggtggtggc 1989301 gttgtgtgcc ggagcgctac ggcgcttcct actcgagcac ggcgtgctgc ccgaggcccc 1989361 gttggtggcc accgtgccgg tttcggtaca cgacaagtcg gaccgacccg ggcgcaacca 1989421 ggccacctgg atgttctgtc gggtaccgag ccagatcagc gaccccgccc agcgcatccg 1989481 caccatcgcc gccggaaaca ccgtcgctaa agaccacgcc gcggccatcg gccccaccct 1989541 gctgcacgac tggattcagt tcggcggctc gacgatgttc ggagcggcca tgcggatctt 1989601 gccgcacatt tcgataacgc atagccccgc ctacaatctg atcctgtcga atgtgcccgg 1989661 accccaggcc cagttgtact ttctgggttg ccgaatggac tcgatgtttc ccctcggccc 1989721 ctccttggca acgcgggcct caacatcacc gtcatgtccc tcaacgggga actgggtgtc 1989781 ggcattgtct cctgccccga cctgctgccg gacttgtggg gcgtggcaga cgggtttccc 1989841 gaggcgctca aagagctgct ggagtgcagt gatgaccagc cggaaggcag caaccaccag 1989901 gactcctgag tcgtacgttc agaaccggta gtcggtgccg gtgcccagaa cttcgatggc 1989961 tgcgttgatg ttcgggatca ctgtggcgcc gtatcggctg acgatctgcc caagcgcgcg 1990021 agcaaggtgc ggacccacgg cctcggcgat gagggcgtcc tcggcgatga cgatgccgtt 1990081 caccatgtgg gcagcgagca gccggccgtc gtgggcgacc tcgtaacgcc agtcaccgat 1990141 ctggattcgc tcgggattag accgaaaaaa gccacgtcgt gcgggggtat gaatcactcc 1990201 cggaagtccg gcgaacactt tgaccaccaa cgcgacaccg ccgggaccga cgagcgcggc 1990261 cgcgacggcg cggctcactc gctcggtatc gaaatcagac atcagctgtc catcggcagg 1990321 acgaatgacg gtgtgatcgt ttccccgctg ccggtgcggc gcacggcggt tcccgcggtg 1990381 tagaactcca ccgtgtgcac tccccacgcg tagttcgaga tggcgaagtg cacgcccacc 1990441 acgccggttg cgccgtctcg ctcggcctcg ctctgcatgc gtgacattgc cagctcacgc 1990501 gcttggtagt tgccttgcgt ccactgtggc atctccatgt tgcggccgat ctggcgaagc 1990561 gtttgcatga atccctgcac ggcgatgtgg aatacgcaat tgcccatcac gaacgccacc 1990621 ggcgcaaacc cggatcgcag cagcgtcacc atgtcctggc cggatagatg actggagaat 1990681 gcttggccgt tgggacgccg aaatgctccg ggcttggcgg tgtatcgcac tgcggtaccg 1990741 accgccatga actcaaggtg ttccccgccc tccccatggt ggcgccagtt gagccggaca 1990801 ccgacgatcc cgtccgcttt gagggcatcg gcttcggcct gcatgcgcgc catcgcattc 1990861 cagcgcgccc ggtatgtcgc ctcggtgagg acacccagtt cctgttgctg cctcatgccg 1990921 ctgaattgga agccgacgtg atagaccgag acacccatga ccagctcgat gggctcaaac 1990981 ccggccccat gcagcaatgc gaactcgttg atcgacaagt cggacgtgaa tgacttctca 1991041 gcgtgcgaca gccgttcgct ggctactgga tcgagcgagc ttgattgcat cgttgtgcgt 1991101 ccttcctgtg gtgtgtgtca gcgtacgacg cgcaaaccat gcagcgtctg ccatcagcgt 1991161 ccccagggca tcggcggcgt cttggcgccg gcaacgctgt tgtctggcag tcgcgccggg 1991221 gagtcgacgc taccggtcgg caccgcgccg gccgcgcatg agtgaggtgg cagcgcgtaa 1991281 cgcgccgcgt agtgcgtaga cggcagtcac cgccgccaac aggatcaaca ggacaaaggt 1991341 cggcaaccag ttggcccggc cgtgattgac attccaccag atgatcgtga acaacagcac 1991401 gcagacaagt agcacgatgt ctcgccaatt gccccggtag gacatcgccg ccttgcgcag 1991461 tgcgtgactc ttatcggctg catcgatcag gtcgtcaatg cgggcatcga ttgtgcgctg 1991521 taggttggcg cgccttttgg tggcatccgc cgggagccga tcgagcaggt ccatatcctg 1991581 cctgatcagc gctcgaaagt cggggccttt gaattgcccg gccgcaatgg caagcatggc 1991641 gccaccaaaa atgggcgcac tgttcagtgc gatttgtccc agccccggca catccactcc 1991701 ttcttcggct gatcatcaat gtcggcccgt tgaggatcgc cgaacttcgg ctggggatca 1991761 accgccacac cgcggacatc ccaaccaccg tccatcagtg cggcccgagg ggacaggcgg 1991821 cgttcttagc cgtcgggggc atcgggtctg tgaacagcgt gtgatggtga gtttcgtcgg 1991881 tggtggcgtg gtcgatgatc cagcccagtg ccgaacccga gccgagcagg caaatgcccg 1991941 gccccaattc gtgcgtagcg ccgccgcgga tcttgatggt ggcgtgcgca ggccgtcggg 1992001 tagcgggtgg gcgccaccag cgggtagcac gaacgaaatc cttgatgccc caaaccgttt 1992061 aagcgttact gcagggtaca ggtaccgagc gggacccgct gccgggccta gttgcttatc 1992121 ggtggtggtt gcggctggaa gggttcatac caccaccagt cggcgcgctc gccggtgggc 1992181 ccaggccacg gcgctaccgc cggcggcggc ttcgtcgacg cccgcgccaa cgatcccgcg 1992241 ctcaaaggtc ggcccgcgct gtcggcgacg gtgaggttgt ctgccggtcc ggtaatggtg 1992301 atcaggcccc gatggtgtgc ccggtggtga tacgggcaca ccagcaccag gttggccagc 1992361 tcggtggccc caccgtcctg ccaatgtcgg atgtggtggg cgtgcaaacc ccgggtggcc 1992421 ccacaaccgg gaaccacaca cgtgcggtcg cgatgctcaa gcgcccggcg caaccgacga 1992481 ttgatctgac gagtcgttcg accgcagcca atgacctgcc cgtcacgttc gaaccaggcc 1992541 tcaaaggtgg catcacagag cagatatcgg cgttcggact cgctgagcag cggacccagg 1992601 tgcaggccag cggcacgctc ctgcacgtct agatgcatca ccacggtggt gtgctgccca 1992661 tgtggccgac gagccacctc ggcgtcccag ccggcctcaa ccagacgcag aaacgcctca 1992721 acattgcccg gcaacggggg ccgctgatcc gacacaccgt cgctgttgtc gtgatcacgc 1992781 ttgtactcgg cgatcaacgc atccagatga gactgcaacg ccgcatcgaa cttcgccgcc 1992841 tccacgtgcg gaagcttgat tcgccaacaa ctgaactgct catcggcgct cctggtgatc 1992901 gagggccgcg gttccggccg aaaatccggt tcgggttcgg gtcgcggttc caacttgagc 1992961 gcggtccgca actgattcac cgtggcaacc ccggccaact gcgcataatg cgcatccgaa 1993021 ccctcacccg cccgccccgc gatcacccca acctgatcca acgacaaccg cccctcccgc 1993081 ataccccggg cgcagcgcgg aaactccggc aaccgccgcg ccaccgtggc gatcgtgtgg 1993141 gcgttgcccg acgagcagcc catcttccag gccaccaacc ccgccaccga ccgcgccccc 1993201 gtcacacccc acaacccgtc gcgatccagc tcagccacga tctccacaat gcgcccatca 1993261 atcgcattgc gctgaccggc caactccgcc aactcctcaa acaacacctc cacacgctcg 1993321 gcaggactga ctaccgctgc gccagacgtc gcggtcgagg acatgagttc atcatcgcag 1993381 cagggtctga caactccggc caacccgaat ccacgcccgg ggccgtgccg tcatcacccc 1993441 gcaaagagat gctcggctcc gccggtacgg gcaccccacg atccaacacc gcctgctcag 1993501 ccgccgacca ctcaacaacc acaaccgtca atgcagttaa cccggcccca ccacggcccc 1993561 aactacggcg ctcgatccag cgcgatccaa caacaccaaa accacacgat ccgcaccgca 1993621 ctcgcccccc gaaacggtcc tcacgatgcc cacgatggcc acctgaacta tcccaggctt 1993681 tgttcctagt cggtgcgagg gccggggttg gctggctcgc ggggtgtgag gtgccggtga 1993741 gggcggcctc gtactcggcc tggactccgg tagcctaggg cttcgtgcag gcattcctgg 1993801 ttgtaccagc cagccattcg gcggttgcca gtttgacgtc gtcgatgcac cgccagggtc 1993861 tgccgcggtt gatcaactcg gacttggagg cgacgttgac cgcgagggcg ttgtcataac 1993921 agtcgccacg agacccgacc gaaggggcga tcccgagctc agccagtcgg tcggtatagg 1993981 tcagcgatag ttactgcgat ccggggtcgg aatgatgcac caactcagaa agatctgaat 1994041 ttgattgcca aacagcatga ttgaatactt gtacgggcag atcttcggtg cgcatcgtcg 1994101 ccgagacggc ccgaacgacg atctttcggg tgcacacgtc ggtgacgaac gcggtgtagc 1994161 agaacccctg ccaggtccgc acgaacgtga tgtcggcgac ccacaaccgg ttgggcttac 1994221 tgccttgaat tgccggttta ccagatcagc cggccgtggt cggctacgtc ggtgacggtg 1994281 gtgaacacgg ccgttgcacg ccgcacagcc cggccttgcg catcaacggg cgggtttgtt 1994341 ctctgccgag gtgccaaccc ttgcgtttca tggcctggtg catcttgtta atcccgtaga 1994401 ccgagtagtt gtcgcggtgc gccgtgcgta ggcgaacttg agactatgac tgtgttttcc 1994461 ggccagtcgg atgcgccctg gcatggccgg cggtaggaga tccaatcgtg cattgttttc 1994521 gtgcagccat ccaatacccc cctgggtact atggcggtgc cacttcaacg agatagaggg 1994581 tgcatgtgat tggtgatcaa gacagcatcg ccgcggttct caacaggtta cgccgtgctc 1994641 agggacagct tgccggggtg atttcgatga tcgagcaggg ccgcgactgc cgggacgtgg 1994701 tcacccagct cgccgcggta tcgcgcgcac tcgaccgcgc cggattcaag atcgttgcgg 1994761 cagggttgaa ggaatgcgtg tccggggcca cggccagcgg cgcggcaccg ctgagtgcag 1994821 ctgagctaga aaagctgttc ctggcgctcg cttgaatggg cccgaagcca tcaataacca 1994881 aggccgccgt ccgtgtatac ccataggggt atattggacg ccatgtcgga ccagccacgt 1994941 catcaccagg tcctcgacga cctgctgccc caacaccgcg ctctacgtca ccagattccc 1995001 caggtgtacc agcgatttgt agccctgggc gacgccgcgc ttaccgacgg cgctctcagc 1995061 cgcaaggtca aggagcttgt ggcgctggcg atcgcggttg tgcaggggtg cgatggctgc 1995121 gtcgcatcac acgcccaagc cgcggtacgg gccggcgcta cagcgcaaga agccgctgag 1995181 gccatcgggg tcaccatctt gatgcacggt ggaccggcca ccatccacgg tgctcgtgcc 1995241 tacgcggcat tttgcgaatt cgctgacaca acgccgtcct agtcgtcgcg gccaccgagc 1995301 ggaccgcgct gacccgggct gaaacgttcc gaggcggact ggcgaaacgc atggtaggtc 1995361 acgcggaaat gcggggcgtg ttggcgcgat ggcgatagcc tttgccgagg gttcaatggt 1995421 gaccgggcgc ccgccgggtt tccatgaggc gggaggtccc tgatgtccta tctcgtcgtg 1995481 gtgccggagt tggtcgcagc ggcggcaaca gatttggcga acatcggttc gtcgattagt 1995541 gcagccaacg cggccgcggc ggcaccgacc acggcactgg tcgcagccgg cggcgacgag 1995601 gtatcggcgg ccatagccgc gttgttcgga gcgcatgctc gggcatatca agcgttgagt 1995661 gcccaggcgg cgatgtttca tgaacagttt gtccgggccc tcgccgccgg cggtaactcc 1995721 tacgccgtcg ctgaggcggc aaccgcgcaa tcggttcagc aagatctgct caacctgatc 1995781 aatgcgccca cccaggcgct gttggggcgt ccgctgatcg gcaacggcgc caacgggctg 1995841 ccgggtacgg gccagaacgg cggcgacggc gggattctgt acggcaacgg cggcaacggt 1995901 gggtccggcg gggtcaacca ggccggtggc aatggcggga atgctgggct gtggggcaat 1995961 ggcggatccg gcggagccgg cgggaacgcc accactgccg gccgcaacgg cttcaacggg 1996021 ggcgccgggg gaagcggcgg tttgctgtgg ggcaatggcg gtgccggcgg ggccggtggg 1996081 cacggcggtc cggctccgct cgtgggcggg gtgggcacca ccggtggcgc cggcgggaac 1996141 ggcggcggcg ccgggttgtt ctacggtttc ggcggcgccg gtgggaacgg cgggatgggc 1996201 ggggtggcac cgagcaccgg cccctcgatg ggcatcctcc cggccggcgg tgtcggcggg 1996261 cctggtggct ccggcggggc gagcgcgctt gccttcggct ccggcggcgt cggcggtgcc 1996321 ggtggcttgg gcgggccgac cgatggcacc gtccaggggg tgggcggctt cggcggtcag 1996381 ggcggcaacg gcgggcagag cggcttgttg tttggcaacg cgggagccgg cggggcaggc 1996441 gctgccggcg gagccggcac cggcgacacc gagagcttcg gcggccacgg cggggccggc 1996501 ggtgatggcg gcgctgttgg cttgatcggt aacggtgggg gcggcggtaa cggcggggcc 1996561 ggcggcaccg gatctcccgg cgctgtggtg ggtggtaacg gcggcgtcgg tggtctgggt 1996621 ggcgccggca gtcccggggg tctgttgtac ggcaccgggg gggccggcgg caatggcgga 1996681 ccgggtggtg acggtggtac tggcgcgacg gtgggctttg ccggctccgg cggtttcggc 1996741 ggtgcggggg gcatcgccca gctgtttggc acgggtggca tgggtggtag cggcggtggt 1996801 ataggcgctg gcaccacgac cgtggtgccg cccgacgtcg ccccggtggg tggcacaggc 1996861 ggcaatggcg gtcgcgccgg gctgctgttg ggtgtgggtg gcatgggcgg taatggcggt 1996921 gccaccagcg tcggcgggac gctctacgcc gccggtggaa acggcggcga cggcgggttg 1996981 gtgtggggca acggtggcac cggcgggagc ggtggcgccg gcggggcggg cagcgtcggc 1997041 aacggcggtg cgggtggcaa cgcggcactg ctgttcggca acggcggggc gggcggggcc 1997101 ggcggcgccg gcggcatcgg tgccggcgga gccggcggct tcggcgcggt tctgtttggc 1997161 aacggcgggg ctggcgggag cggtgccccc ggtggcatcg gcgccggtgg caatggcgga 1997221 aacgcgctgc tggtcggcaa cggcggcaac ggtggggcag gtaccggtgg ggctgctggc 1997281 ggtgccggtg gctcgggcgg gttgctattc ggccaaaatg ggatgcccgg gccgtgagcg 1997341 ccccaaccca ggccaacccc ctatgggcaa tctgcacatc aattggccag gtcgacagca 1997401 gaccgcacac atctacgaga ttggttcccg atccgtgggt ggggccggga aaagcggctg 1997461 taagagttgg ctaggttcag tagggtggcg gcgtgcatga ggtggctgct cgtgagcaac 1997521 gttcggacgg gccgatgagg ctggatgcgc agggccgact gcagcgttac gaggaggcgt 1997581 tcgctgacta cgatgcaccg tttgcgttcg tagatctcga cgcgatgtgg ggcaatgccg 1997641 atcaactgct tgcgcgcgcc ggcgacaagc cgatccgggt ggcgtcgaag tcgctgcgtt 1997701 gccgaccact gcaacgcgaa atccttgatg ccagtgagcg attcgacggg ctattgacgt 1997761 tcacgcttac cgagacgctg tggcttgccg gccaaggttt ctcgaacctg ttgttggcct 1997821 acccgccgac cgaccgggcg gcattgcgtg cgcttggcga gctgacggcc aaggacccgg 1997881 acggggcgcc gatcgtgatg gtggacagcg tggagcacct tgacctgatc gagcgcacga 1997941 ccgacaagcc ggtacggttg tgtctggatt tcgatgccgg ctattggcgc gccggcgggc 1998001 ggataaaaat tggttccaag cgctcgccgc tgcacacccc ggagcaggct cgcgcactcg 1998061 cggtggagat cgcgcggcgg ccggcgctaa cgttggcggc gttgatgtgc tacgaggccc 1998121 acattgcggg cctcggtgac aacgtcgccg gcaagcgggt ccacaacgcg atcatccgtc 1998181 ggatgcagcg catgtcgttc gaagagctgc gcgagcgtcg tgcccgggcc gtcgagctgg 1998241 tgcgcgaggt cgccgacatc aagatcgtca acgccggtgg caccggcgac ttgcagctgg 1998301 ttgcgcagga gccgttgatt accgaagcga ccgccggctc gggtttttac gcgccgacac 1998361 tgttcgactc gtattcgacg ttcacgctgc agcccgcggc gatgttcgcg ctgccggtat 1998421 gccgtcgtcc cggtgcaaag accgtgaccg cgctcggggg tggctattta gccagcgggg 1998481 tcggggcgaa ggaccgcatg ccgactccct acctgccggt cgggctgaag ctcaatgcgc 1998541 tggagggaac gggcgaagtt cagacaccgc tatccggtga tgcagcccga cggctgaagc 1998601 ttggcgacaa ggtctacttc cgccacacca aggccggtga gctgtgtgag cggttcgacc 1998661 atctgcatct ggtccgtggc gctgaagtag tcgacaccgt ccccacctac cggggtgaag 1998721 ggcgcacctt cctctaatgc tgaaatggac gaggcccacc cggctcaccc ggcagatgcg 1998781 gggcggcccg gtggcccaat tcaaggcgcg cgaagaggag ctgccatgac accgatcacc 1998841 gccctgccga ccgagttggc ggccatgcgc gaggtagtcg agacgctcgc acccattgag 1998901 cgtgccgcgg gcgagccggg tgagcacaag gcggccgagt ggatcgtcga gcgcctgcgc 1998961 acggcgggcg cgcaggacgc gcgcatcgag gaggagcagt acctcgacgg ctacccgagg 1999021 ctgcacctca agctgtcggt gatcggggtg gcggccggcg tcgcgggcct gctcagcaga 1999081 cgtttgcgca tccccgccgc gctggccggg gtgggtgcgg ggctggcaat cgccgacgat 1999141 tgcgccaacg ggccgcgcat tgtgcgcaaa cgaacggaga cgccccggac gacatggaac 1999201 gcggtagccg aggccggtga tcctgctggt cagctaacag ttgttgtgtg cgctcaccac 1999261 gacgccgcgc acagcggcaa gtttttcgag gctcatattg aggaggtaat ggtcgagctg 1999321 tttcccggga ttgtggagcg catcgacacg cagctgccga actggtgggg gccgatcctc 1999381 gcgcccgcac tcgccggtgt cggcgccctg cgcggcagcc ggccgatgat gatcgccgga 1999441 acggtgggta gcgccctggc cgccgctttg ttcgccgaca tcgcgcgcag tccggtcgtc 1999501 cccggtgcca acgacaatct ctccgcggtt gcgctgctgg tcgcgctggc cgagcggctg 1999561 cgcgagcggc cggtgaaggg cgtgcgagtg ttgctcgtgt ccctgggggc cgaggaaacg 1999621 ttgcagggcg ggatctacgg gttcctggcg cgacacaaac ccgagctgga ccgcgaccgc 1999681 acatacttcc tgaacttcga caccatcggc tcacccgagc tcatcatgct cgagggcgag 1999741 ggcccgacgg tcatggagga ctacttctat cggccattcc gggatctggt catccgggcg 1999801 gccgagcgcg ccgacgcgcc gctgcggcgc ggcatccggt cgcgcaacag taccgacgcg 1999861 gtgttgatga gccgcgccgg ctacccgacc gcgtgctttg tgtcgatcaa ccggcacaag 1999921 tcggtggcca attaccacct gatgtccgat acacctgaga atctctgcta tgagacggtg 1999981 tcccacgccg tcaccgtcgc cgaatccgtg atcagggagc tggcccgatg agcccgatat 2000041 ggagtaattg gcctggtgag caagtctgcg cgccgtcggc gatcgtacgg ccgacctcgg 2000101 aggctgagct ggccgacgtg atcgcgcagg cggcgaaaag aggcgagcgg gtacgcgcgg 2000161 ttggcagcgg gcattcgttt accgacatcg cctgcacgga cggggtcatg atcgacatga 2000221 ccggcctgca gcgggtcctc gacgtggacc agccgactgg cctggtgacg gtcgaggggg 2000281 gcgcaaagct acgtgcgctg ggaccccaat tggcgcaacg acggctcggc ctggagaacc 2000341 agggtgacgt ggatccccaa tccatcaccg gcgcgaccgc gaccgcgacg cacggaaccg 2000401 gggtgcgttt ccagaatctg tcggcgcgga tcgtttcgct gcggctggtc accgcgggcg 2000461 gggaagtgct cagtctgtcc gaaggtgacg attacctggc ggcacgggtt tccctcggcg 2000521 cgctaggagt gatctcacag gtcaccctgc agacggttcc gctattcacg ttgcatcgcc 2000581 atgatcagcg acgctcgctg gcgcagacgc tggagcgcct cgacgagttc gtggacggta 2000641 atgaccattt cgagtttttc gtattccctt acgcagataa ggcgttgacg cgcaccatgc 2000701 atcgcagtga cgagcagccc aaacccacgc ccgggtggca gcgcatggtc ggcgagaact 2000761 tcgagaacgg gggattgagc ctgatctgcc agaccggccg tcgttttcct agtgtggcgc 2000821 cgcgactgaa ccgcctgatg acgaacatga tgtcgtcctc caccgtgcaa gaccgcgcct 2000881 acaaggtctt tgcgacccaa cgcaaggtca ggttcaccga gatggagtac gcgatcccgc 2000941 gtgaaaacgg gcgcgaggcg ctccagcgtg tcatcgacct tgtgcgccgt cgcagcttgc 2001001 cgatcatgtt tccgattgag gtgcgattct ccgcccccga cgattccttc ctgtcgaccg 2001061 catatgggcg cgacacttgc tacatcgcgg ttcatcaata cgccggtatg gagttcgaaa 2001121 gctacttccg cgccgtcgag gagatcatgg acgactacgc cggtcggcca cactggggta 2001181 aacgtcacta tcagaccgcc gccacgcttc gtgagcgcta tccgcagtgg gatcggttcg 2001241 ccgcggttcg cgatcgcctc gatccggacc gggtgtttct caacgactac acccggcgcg 2001301 ttctcggtcc ctgacaacga atcaacgaac cctcgtggtg ttcggccgat atcgacacgg 2001361 tcacaaccgc gtaccgatat cagcggtggt atggcgtaac gggcacgatg cacaaatcat 2001421 ggcagcatgc gcgttgggag ccaccgtcgc gaaccaagcg tgcgcgttca cggattcgtc 2001481 cgcctgagtt ggcggatatc ggttgggttc aacaggaggt agccaaccca tgacggcgaa 2001541 tcgagggccc gctgcaatct cgagcggctc gaactctggc cgcgttctcg acaccgcccg 2001601 gggtatcctc atcgctcttc ggcggtgccc cgcagagacc gcgttcgacg agttgcacaa 2001661 cgccgctcaa cggcacagat tgccggtctt cgaaatagct tgggcactag tgcatttggc 2001721 ggtcgaggga agcacgccat gccggagctt cgtcgatgcc cagtcggcgg ctcggcggga 2001781 gtggggtcag ctttttgcgc atgcggcggc gtaatgccag cttggcggtg gtgtggggaa 2001841 gcaccgccgc cagctaaacg gatcggcttc gaatccagga gcccaatcag cgagtccagt 2001901 ccggcgagtc cgcggcggcg cgcaacgcgg cgattatgcg ctgctctttt tccagaaatc 2001961 gtgcggtggg cgccggcacc gagatcgcga tcacgttgtc gcccagggcg cgtcgtgcga 2002021 tcgcagccgc ggatatccct ggggtgtgct cgttgcggtc aaaagcgata ccggtgcgcc 2002081 ggatctcgac gatctcgcgc cgtagacctt cggccaccat gggatccaga cggcagagcg 2002141 cggcctcggc gtcggcgtcg tcgagagcag ccagcgccgc ttttccattc gcggttccgt 2002201 tcaacgggaa gcggagcccg acggctgaga ccgcacgcag ccggtaagac gattcgatct 2002261 ggtcgacaaa ccacattcgc tggccgcgca gtaccgacag gtcgaccgtt tcgccgtcgg 2002321 tcgcgcgggc aactcgctcg acggtcggcc ggaacgccgc ggctatgtgg gctccggtga 2002381 cacttccgaa tcccagcaaa cgctcgccca gtgcgaagcg gccgtgcgaa tcgacactaa 2002441 ccagccccac ctcgaccagg ccgaccagca agcgtcgagt cgtcgatttg gccagcccca 2002501 gccgctcgca gagatcgact aggcgcaggt gtcccggttc ggcagctatt tcgtccagcg 2002561 cggcgacggc gcgacggagc acctggatgc cttcgtcgcg attcgttgtc gactttcctt 2002621 ccgtaggcgg cacaactgca atatagtgaa ccgaaatacg gatcacaatg attcgaaata 2002681 cggaccagga gttttgctat gagggcgcta ccggccgggc ggcacttctt ccggggcagt 2002741 gacgggtacg aggcggctcg ccgcggcacc gtgtggcatc ggcgcgtacc ggatcgctac 2002801 cccgaggtga tcgttcaggc tgtcagtgct gacgacattg tcagcgccat ccgctacgcc 2002861 acggtcaatg gccataaggt gagcgtcgtg tccggtgggc acagttttgc cgccagccat 2002921 ctgcgcgatg gcgctgtgct gctcgacgtg agccggatag accacgcctc catcgacgcc 2002981 gataagggcc gcgcggtcgt cggtccaggg aagggcggca gcgtgctcat ggccgaactg 2003041 gaggcgcagg gcctgttctt cccgggtggc cactgcaggg gagtctgtct cggaggttat 2003101 ctgctgcagg gcggatacgg ctggaacagc cggatctacg gcccggcgtg cgagagcgtg 2003161 attggcctgg acgtcatcac cgccgacggc gcgcagatcc attgcgacgc agacaatcac 2003221 gccgatctgt actgggccgc ccgcggcgcc ggtccgggct tttttggcgt cgtcacctcg 2003281 ttttacctga agctgtatcc gaggccggcc acctgtggca ccagcgtcta tgtctaccca 2003341 ttcgaccttg ccgacgaggt ctttacctgg gcccgcgcgg tcagcgccga agtcgaccct 2003401 cgggtcgagc tgcaagccct tgcctcccgc ggtgaaccga gcatgggcat cgacgtcccc 2003461 gtcatctccc ttgcctcgcc cgctttcgct gactcgcccg aagaggccga acaggccctc 2003521 gccctgttcg gcacctgccc ggttgtcgag caggcactgg tcaaagtccc ttatatgcca 2003581 accgatttgc ctgcctggta tgacgtcgcg atgacccact acctgtcaga ccatcactac 2003641 gcggtggaca atatgtggac gtcggcgtcc gctgaggacc tgctgccggg tatccgctca 2003701 atcctggaca cgctgccccc gcatccggcg cacttcctct ggctgaactg gggtccatgc 2003761 cctccccgtc aagacatggc ctatagcatc gaagccgaca tctacttggc gctctacggc 2003821 tcctggaagg atccggccga cgaggcgaag tacgccgact gggcgcggtc ccacatggcc 2003881 gcgatgtcgc atctggcggt cggcatccag ctcgccgacg agaacctcgg tgcgcgtccg 2003941 gcgcgcttcg ccagcgacgc ggccatggcc aagctcgacc gggtgcgcgc cgaatacgac 2004001 cccgacggtt tgttcaacag ttggatggga agaatctgat ggccagcgat ctgtacctgg 2004061 gctaccgcaa cgacgacgcg gacacgccgt tcggcaagtt cttcaaaccc gagatggccc 2004121 cgctgccaca gcatgtcgtg gtggcgttgc agcatggccc ccaggccggg atggcgttgc 2004181 tcgccttcga cgacgccgcg agcatcgttg atgagggcta tcagcagacc gagaacggct 2004241 acgggattct cggcgacggc agcatgcagg tatccgtgcg caccgacatg cccggggtca 2004301 ctcccgcgat gtgggcatgg tggttcggct ggcacggcag cgacacccgc cgctacaagc 2004361 tgtggcaccc gcgggcccat ctatcggcgc ggtggaagga cggcgaccag gacagcgggg 2004421 ccggccgtcg gggcgcgcag cgttacgtcg gccgctggtc gatgatcagc gagtacatcg 2004481 gctcgacgaa actgggtgcc gcaatacaat tcgtcgagcc ggcggccatg ggtctgcccg 2004541 acgacagcga cgatacggtg tcgatctgtg cgcggttggg ctctgctgac gccccggtgg 2004601 atgcgggctg gttcgtccat caggtccgat cgacgccggg cgggtccgag atgcggtcac 2004661 ggttttggat gggcggaccg cacatcgcgg tgcgcaaggc acccgaggtc gcgtccaagg 2004721 cggtgcgtcc catcgcgtcg aagctaatcg gcgtctcgga atcgaccgcg cgtaatctgc 2004781 tggtgtactg cgcgcaggag atgaaccacc tggcggggtt cttggcggac ctgtgggaaa 2004841 gcttcggtga cgagtgaggt ttcagctttg ctcggcaaac gctggcgcca cgtatttttc 2004901 gaccagccgg cgttcggctt cgtcgttctc agctggccaa tacatcagtg agagcaccac 2004961 gcgtaccacc catttcgcgc cttgcggatc accgccggct atgccggtga gctcggtagc 2005021 aaagtccgca agcaacggtg actcggtgag ccaggccaat tcaccggcgc caccgtggat 2005081 cgagccgaac atgagcttgc ccagcgggtc ggatcggatt cgctgaagcg ataacaggat 2005141 cgccgcgacg actcgctccc gcccccgcag agtttcgaca tccgagcgca cgccgtcggc 2005201 gatccgggcc gcggcccggg tcagaacgac atcccggatc tgggccttgc cgccggcacg 2005261 gcggtagatg gtcgctcggg agcagtggac ctcgcgggct aatttgtcga tgtcgagtgc 2005321 gttgagcccg tagcgcgtaa tgaggtcggt tgcggcggcg tagatccgtt cggcagcgat 2005381 cgtgcggcgg ttgccgccca cgatccaatc gttacccggc actggtcagg cgcatttcca 2005441 tcgagaggcg aagagcgatt cttctcatag tgagacacaa accttactta ttctcatcgt 2005501 agttgcaggt ccgcctcccg cggtgagacg ttcgccgaaa ggctccccgg gcgcagttct 2005561 cgacttgcag cgacgcgttg accaggcggt atccgccgat cacgctgaac taatgacaat 2005621 tgccaaggat gccaacacgt tctttggtgc cgaatccgtg caggacccct acccgctgta 2005681 tgagcgcatg cgcgccgcag gctcggtcca ccggatcgct aactcggact tctatgccgt 2005741 gtgcggttgg gacgctgtca atgaggccat cggtcgtccg gaggacttct cctcgaattt 2005801 gaccgccacg atgacctata cggccgaggg caccgctaaa ccgttcgaga tggacccact 2005861 cggcggaccc acacacgtgt tggccaccgc cgacgatcct gcccacgccg tgcaccgcaa 2005921 gctcgtgctg cgtcacttgg cggccaagcg gatccgcgtt atggagcagt tcaccgtaca 2005981 ggctgccgac cggctgtggg tcgacggcat gcaggatggg tgcatcgaat ggatgggcgc 2006041 catggccaat cgcctaccga tgatggtcgt agctgagctc atcggcctgc ccgaccccga 2006101 catcgcccag ctggtgaagt ggggatacgc ggccactcag ctactcgaag ggttggtcga 2006161 aaacgatcag ctcgtcgccg cgggtgtggc gttgatggag ctcagcggtt acatcttcga 2006221 gcagtttgac cgtgccgcgg ccgatccgcg ggacaatctg ctcggtgagc ttgccaccgc 2006281 ctgcgcatcg ggggagctgg acactctcac cgcccaggtc atgatggtca ccttgttcgc 2006341 cgccggcggc gagtccacgg cggcgctgct gggcagcgcg gtatggatac tggcgacacg 2006401 tcccgatatc cagcaacagg tgcgcgcgaa ccccgagctg ctgggagcgt ttatcgaaga 2006461 gacgctgcgt tacgagccgc catttcgcgg ccagtaccgc cacgtgcgaa acgccaccac 2006521 cttggacggc acggaactgc ccgcggattc gcacctgctg ctgttgtggg gcgcggccaa 2006581 ccgcgatcca gcccagttcg aggcacccgg cgagttccgt cttgaccgtg caggaggcaa 2006641 aggccacatc agtttcggaa aaggggccca cttctgtgtc ggcgctgcac tggcacgctt 2006701 ggaggctcga atcgtcttgc gtctgctgct cgatcgcacc tcggtaattg aggcagccga 2006761 tgtcggcggg tggttgccca gtatcctggt gcgccgcatc gagcggctag agctagctgt 2006821 acaataggcg ctcgacgact cctattgcag cacaacggat atcagcaaca gcaggtgcca 2006881 accgcggcga tcggatgcgt gagaatagtg aaagtggttg tcgcggtcag gatttctgcg 2006941 atcaacccta cccgcatgac gccggcgggt ggcccccgcc ggccacgata aatgcttcga 2007001 ccgccgtggc ccgctcgtaa ccttcgacct ccacgcgcca ctcgtagatt cctggttcca 2007061 agggaattcc cgcaggaatg ttgagggtga gcggcatgcg aaccgaggtg ccgtggattg 2007121 cgccaggagc gcggcccgcc tcggcggcgg cttcaaagag gatccgctgc ggcccgtgtg 2007181 gtcccggcac gaccaccgga tcgccgtcgg cggtgagcaa ctggcatttc agctggtgct 2007241 gcttattggt ctcatcccag tcgatgtcaa ggaacagtac caaagcgaat gggggggtcg 2007301 gtctttggca ttgccgccag cccagcccga gcgcatggac cttcccggac tgggcatcag 2007361 cctgcgccgc gtccgacagg aacagactga ccctcatgtc gccgccgctg cgatcgaact 2007421 cccgggttcc gattccaccc ctgtccttcc ccaatgtgca actagccgaa ggtcggtcaa 2007481 taccgcaccc acttagactg actccatccc gacggcagga taatacgtgg cgaccggtag 2007541 atctatgttg tgctatctgg gcggtggcag ctggcgcgga ccgtcggggg aacgcagttc 2007601 atgcggaccc tcccgttggg tcagctcccc ggtcgaccca actgataggc tcgccaggtc 2007661 tcgcggaggc acctgcgcta acggcgggtg atcatcgttg gtagccagcc ggctacggcg 2007721 cgcagagtcc gacgatgcga tcgggttgct ttcggcaggg gcagccgggg agtgggattc 2007781 cagcgggggc tggtcgttgg ggtcggaacc ttcggcattg accgtcacct tgtgggtgcg 2007841 cttgaacgaa aacgcgatct cctcgacctc ttcgaacacc tgacgcgcgg tgcgcaacgg 2007901 cgcagtggtg ttgtcgagca ttcgtgcgac aaacggcggc accagtgggc gaatccaccg 2007961 agccgcagcc ttggtggcgt ctgggatcga ccggatcact ggggttgcac gcttctcgcg 2008021 tggttcaacc gcaccctccg gttggacctg tgccggcagg tttttcgcga taccggggcg 2008081 gtgatgggct gccccagcag gcaattgggc gaccgtccgg ctggcagctt cggtttcggc 2008141 ggcgatgggc agatacacct cgtcctccac tggctgcaag gttcgctccg acgaagcgcg 2008201 cactggacct tcgagcgctt cgacgactcg gcggcgttgc tgttcgcggt cgatttcggc 2008261 ctgggcctcg atggcgaggc gggtctgggt gagttggtgc tcggcccaca tgatttcggc 2008321 ggcccggcga acctccgccc gcttgatcgc gatcgcggtg tccgcctcca actcggcgcg 2008381 ctcccgttcc gctcgcgcgg ccgcgtggcg atcgtgcgtt gtgtcaccgc gccatagccg 2008441 gagaatcagc ggcaataggt acagcagcgc gaagaaagcg atcgccagca ttcgcgccgt 2008501 caaggcgccg gcgctggcca atgtcagatc gttcatggcg acccagcgcg agcccaaacc 2008561 acgacccgca tccgcgacca cggcctggcg cacctcggca agagcctgct cgtcgtgtgc 2008621 cattttggcg tccaaggcag gtgcttggtg atcacgagcc gccagcgcgt tgtccagctc 2008681 acgctgcgcg tcggcgagaa gctggttcgc cgttcgtgtt tcgggccctc ggccgggaac 2008741 gccggtgatc cgggtctgcg ggcaggccgg agttgggtgg tattcgcagc gtgcgacgac 2008801 cagcgcatcg tccagtcgtc cgcgcgcccg ctcgaccgcg ctgtccagcg cagtgcgcgc 2008861 attgcgggcc tgttgcaggg aggccgacgc ttgcacggcc gccggcgtcg cgtcggcgct 2008921 gtgcatagct tgttcatcga gacggcggtc gatggcaccg gaaaacatga ccagcgcagc 2008981 gagttcgccg acgacgaaac cgacggcgac cgcgacggac gcgcgtcccg taacgccggc 2009041 ccgaccgcga gctgggccac tggccgtacc gcgggtcacc gcgccgacca gcaggccgag 2009101 caccagggcg agcgaggcag ccccgatggg ggacgagatc ggcccctggg ccgcctcgct 2009161 caccgcgagg ctcgcgagga gtccggccag cgcggcgccc acggccacaa tcacgccggc 2009221 cacggcgtgc gtggaccgct cgtgacgctc gccgagttcg cgccagtgtc cgccgccgag 2009281 ccaggtaagc agcccctcga ttccggacac ggccgagcgc tgctcagcat attcgtgggc 2009341 gcacatgaga ctgaaaacac ctcctgctgg tcaagcctgg caggcccccg cccgacacac 2009401 cgaatcgaag cggccccttg tggtgttgtt cacaactgcg cgagagatga cgcagatcac 2009461 gtcgcggctg cccagccgaa tcctcagcga gttcaatgtc aaaattaccg cggcgcgagc 2009521 ggatcagcgg ccattatggc aggtgacgtg agacggtata cacctatgca aaatcacgac 2009581 tacgttacct acgaagagtt cggccgcaga ttcttcgagg tagcagttac cccggaccgc 2009641 gtcgccgccg cgtttgccga catcgcgggc agcgagttcg caatggaacc gatctcccag 2009701 ggccccggcg ggatcgccaa ggttagcgcg aacgtcaaga tccgagagcc ccgggtgacg 2009761 cgaaagctgg gtgacctgat cacgtttgtc atccatatcc cgctgtcgat cgatctcctt 2009821 cttgacctgc gcctcgacaa gcagcggttt atggtcgccg gcgacatcgc gctgcgcgcc 2009881 accgcacgcg ccgccgagcc gctgctactg attgtcgacg tcgccaaacc gcggccctct 2009941 gatatcacgg tcaacgtgtc gtcgaagtcg atccgcggtg aggtgttgcg catcctcgca 2010001 ggcgttgacg gtgagattcg gcgatttatc gcccagtacg tctctgccga gatcgactcg 2010061 cccaaatccc aagccgctca agtcatcaat gtggccgaac aattggactc tacctggagc 2010121 ggcccgtagc cagctctgga tgcagtctgg ctgccggcca ccgaaagctc accaacagct 2010181 catcggtgag gtcgtcgcag cgcgcaccgc ctcggccaag gtggctgccc tgcgatcggt 2010241 gaagatgtcc tcgagcagca tcggctgacc gtcggggccg gtgagcggca cccgccagtt 2010301 cgggtactcg tcggtggtgc caggctggtt ttgcgtccgg cggtcgccga ccgcatcggt 2010361 caacgccact gccaacagcc gcgagggcgt tcggcccagg tagcggtaga gagccaggac 2010421 ggcctcctcc gagtcgggct cggcaccgtc cgccagcagt ccgacccggc gcagctcggc 2010481 catccaggct gcccggtcgg cccgggcgga ttcgagttcc gcctccacgg ggttggttaa 2010541 caacccaagg gactcgcgca gccgtacctg gtcgccggcc aggtagccgg cggtcggcgg 2010601 cagatcatgg gtggtcaccg acgacaagca gtactcccgc cagcgttcgg ccggcaatgg 2010661 tgttccagcc ggcccgcaat ctcgatcctg ctcaaaccag agaattgagg tgcccagcag 2010721 gccccgcaat agtagatagt cgcgtaccca cggctcgacg gtgccgagat cctcaccgac 2010781 gacaaccgcc ccggcccggt gggcttccag ggcgacgatg ccgatcatcg cgtcgtggtc 2010841 gtagcgcaca taggtgcctt gggtgggcgg tgcgccgtcg gggatccacc acaaccggaa 2010901 cagcccgatg atgtggtcga tgcgtaccgc accggcgtgc cgcaacgcgg cctggatcag 2010961 cgcgcgaaac ggtcggtact cctgctcagc gagccggtcc ggccgccacg gtggctgcga 2011021 ccagtcctgg ccgagttggt tgaactcatc cggcggcgca cctgcggtca caccttgggc 2011081 cagcacgtcc tgcagagccc aggcgtcggc cccgttgggg tgcacgccaa cggcgaggtc 2011141 tgccatgatg cccagcgaca tgccggcccg gagcgcctgc gactgcgcac tggcgagctg 2011201 ctcgtccagc tgccactgca gccagcggtg gaaatcgacg gcatcggcgt gtttgtcgac 2011261 gaaatcggcg acacctgagg catcgggatg ccgcagcgat ttcggccatc gatgccaatc 2011321 atcgccgtac gtctcggcca gcgcgcacca ggtggcgaag tcgtcgaggg cgcggccctc 2011381 gcgggtacgg aaggcggcgt aggccagctc gcgacccgcc gaccgcggca cccggtgcac 2011441 gagcttgagt gctgcgcgtt tggccgccca ggcgctgtcg cggtcaatgg tgtcgagctg 2011501 gtcggcgtgc tgttgcacgt tggtgcgcaa ccgttgcacc cggccacgct tgggcagatc 2011561 gacgagttcc ggaatggcct ccacccgaag gtagagaggg ttgacgaagc gtcgcgatgt 2011621 cggcaggtag ggcgatggtt cgattggctt cgagcgccca gcgggcccgg gaagcgtagc 2011681 cgcatgcagg ggattgacca gcacatagcc ggcaccgtgc gcagacgccg accacagcgc 2011741 gagattcgcc aaatcggtga gatccccgat gccccatgac tgccgggacc gcacgctgta 2011801 gagctggacg gccaggcccc aggcacgacg gcctgccagc ttgtccggca gccccaacca 2011861 atccggcgtc acgacaacag cggcgctggc ctgcgagtcg cccgaacgca gattcacccg 2011921 gtggtagccg aggggcaggt cggcgggcaa cacgaagctg gcctcgccga tccagcgtcc 2011981 gtcaagatcg aatggcgggg tgaaattgtc gacctgcacc acctcggcac gtgtcgtgcc 2012041 gtcctcgagc tgcaaccaca cgtcggccgg agcgccatcg gtcacatgca ccctgaactg 2012101 cgtctgctct ccggcgcgca tgacgatggt cgccggcaat ggacgcgccc agtaggaccg 2012161 cagctgcgcg gccagggcgt cattgcgttg ctgttcggtc tgggcgggaa cgccgagggc 2012221 ggcaagagca gccaccaatg tagcctcgga gaccagcacc tgccggccag tccagtccgt 2012281 gtactcggtg gcaatgccga atcgtcgggc aagttcgacc agcgaaggcg cgagctcggt 2012341 catgtcgccc atcttgcgtc cggcacccgt gtgcgggcga gcgcaggaat ctgagccttc 2012401 cgtcagcaca gcacggttgg ctaccgaaca ccactacgtt gcaggtcaac gaggtagact 2012461 gcggagcgga cagttccaca ggcggactcg gtcattcgcc gctaccatgc ccagtgaaga 2012521 cacgacgaat ccttggggga tccgcgcagt ggcaaatacc caggtcaatg tccaggtgtt 2012581 ctgagcagac cggaaggtga tctagcgtgg ctgaagagag ccgcgggcag cgggggtcgg 2012641 ggtatggcct tgggttgtcc acgcggaccc aggtaaccgg ttatcagttc ctggcgcgtc 2012701 gaaccgcaat ggcgttgaca cgctggcatg tgcgtatgga gattgagccg ggtcggcggc 2012761 agacgttggc ggtggtggcg tcggtgtcgg cggcgttggt gatctgtctg ggggcgctgt 2012821 tgtggtcgtt catcagcccg tccggccagt tgaatgagtc gccgatcatc gcagaccgcg 2012881 attccggtgc gctctatgtc cgtgtcggtg acaggttgta cccggcgctg aatttggcat 2012941 cggcacggct gatcaccggg cggccggaca acccgcacct ggttcggtca agccagattg 2013001 ccaccatgcc gcgcggtccg ctggtgggta tcccgggtgc gccgtcatcg ttctcgccaa 2013061 agagtccacc cgcgtcgtct tggctggtct gcgacacggt agcgacctcg tcaagcatcg 2013121 ggtcgctgca aggcgtgacg gtgacggtca tcgacgggac cccggacctt accggtcacc 2013181 ggcagatttt gagtggatcg gacgcggtag tgctgcgcta cggcggagat gcgtgggtca 2013241 tccgggaggg gcgccggtca cgaatcgagc cgacgaatcg agcggtgttg ttgccgctgg 2013301 ggttgacgcc ggagcaggtt agccaggcgc gtccgatgag ccgggcattg ttcgacgctt 2013361 tgccggtcgg gcccgaactg ttggtgccgg aagtgccgaa tgcgggtggt cctgcgacgt 2013421 tcccgggcgc tcccggaccg atcgggacgg taatcgtcac accgcaaatc agtggaccac 2013481 aacagtattc gttggtcctg ggcgatggag tgcaaacgct cccgccgttg gtggcccaga 2013541 tcctgcagaa cgctggtagt gcgggcaaca ccaagccgtt gaccgtggaa ccctcaacgc 2013601 tggccaagat gccggtggtg aatcggttgg atctctctgc gtatccggac aatcccctgg 2013661 aagtggtgga cattcgcgag catccgtcga cctgttggtg gtgggagcgg acggccggtg 2013721 aaaaccgggc ccgtgtgcgg gtcgtgtccg ggcctaccat tccggtcgcg gcgaccgaga 2013781 tgaacaaggt ggtgtcgttg gtgaaggccg acacgagtgg ccgccaagcc gatcaggtct 2013841 acttcggccc cgaccatgcg aacttcgtgg ccgtcaccgg caacaacccg ggggcccaaa 2013901 cgtccgaatc gctatggtgg gtgaccgatg cgggcgcgcg gttcggggtg gaggacagca 2013961 aagaagcgcg tgacgcgttg gggttgaccc tgacgccgag cctggcgccg tgggtggcgc 2014021 tgcggctgct gccacagggc cccacgctgt cacgagcgga cgcgttggtg gagcacgaca 2014081 cgctcccaat ggacatgacc ccggcagagt tggtggtacc gaaatgaagc gtggttttgc 2014141 ccgcccgaca ccggaaaagc ctccggtcat caagcccgag aatattgtcc tatcgacacc 2014201 gctgagcatt ccgccgccgg agggcaagcc ctggtggctg attgtggttg gcgtcgtggt 2014261 ggtgggcctg ctgggcggca tggtcgccat ggttttcgcc agcggatcac acgtgttcgg 2014321 cggcatcggc tcgatcttcc cgctcttcat gatggtcggg atcatgatga tgatgttccg 2014381 cggcatgggc ggcggccaac agcaaatgag ccggccgaaa ttggacgcga tgcgcgctca 2014441 gttcatgttg atgctggaca tgctgcgcga gacggcccaa gagtcggccg acagcatgga 2014501 cgccaactat cggtggttcc acccggcgcc caatacgttg gcggccgccg tggggtcacc 2014561 ccggatgtgg gagcgcaagc ccgacggtaa ggacctgaac ttcggggttg tccgcgtcgg 2014621 cgtgggaatg acgcgtcccg aagtgacctg gggtgagccg cagaatatgc cgaccgacat 2014681 cgagctggag ccggtgacag gtaaggcgct gcaggaattc gggcgctacc aaagcgtcgt 2014741 gtacaacctg ccgaaaatgg tttcgctgct ggtcgaaccc tggtatgcgc tggtcgggga 2014801 acgcgagcag gttctgggtt tgatgcgggc gatcatctgc cagctggcgt tctcccacgg 2014861 gcctgaccat gtccagatga tcgttgtcag ttccgatcta gaccaatggg actgggtgaa 2014921 gtggctaccg catttcggtg actcgcggcg gcacgacgcg gcgggtaacg cgcggatggt 2014981 ctacacctcg gttcgtgagt ttgccgcaga gcaagccgaa ttattcgcgg gccgtggttc 2015041 tttcacgcct cgacacgcga gttcgtcggc gcagaccccg accccgcaca ccgtgatcat 2015101 cgccgacgtc gacgatccgc aatgggagta cgtgatcagc gccgagggtg tcgacggggt 2015161 gacgttcttc gacctgaccg gctcttcgat gtggactgac atcccggagc ggaagctgca 2015221 gttcgacaag accggcgtga tcgaggcgct gccccgcgac cgcgacacct ggatggtgat 2015281 cgacgacaag gcttggttct tcgctctcac cgaccaagtc agcatcgccg aggcagaaga 2015341 gttcgcgcag aagctggcgc agtggcggct ggctgaggcc tatgaagaga tcggccagcg 2015401 ggttgcccac attggtgccc gagacatctt gtcctactac gggattgacg atcctggcaa 2015461 catcgacttc gactcgctgt gggctagccg gaccgacacc atgggacggt cgcgattgcg 2015521 ggcgccgttc ggtaatcgct ccgacaacgg cgagctgctg ttcttggata tgaaatcgct 2015581 cgacgaaggc ggcgacggcc cgcacggggt catgtccggg acgaccggtt ccggtaagtc 2015641 gacgttggtg cgaaccgtga tcgaatcgct gatgctcagc catccgccgg aggagttgca 2015701 gttcgttttg gcagacctca aaggtggctc ggcggtcaag ccgttcgcgg gagtgccaca 2015761 cgtgtcgcgg atcatcaccg acctcgaaga agaccaggcg ctcatggagc gctttctgga 2015821 tgcgctgtgg ggcgagatcg cccgccgcaa agcaatatgc gacagcgccg gtgtcgacga 2015881 cgccaaagag tacaactcgg tgcgagccag gatgcgtgcg cgcggtcagg acatggcgcc 2015941 gctgccgatg ctcgtggtgg tcatcgacga gttctacgaa tggttccgca tcatgccgac 2016001 ggcggtcgac gtcctcgact cgatcggccg gcagggccgc gcctactgga ttcacctgat 2016061 gatggcgtct cagaccatcg agagccgagc cgaaaagctc atggagaaca tgggttaccg 2016121 cttggtgctg aaagcgcgta ccgcgggagc ggcgcaggcg gccggggtgc ccaacgcggt 2016181 gaatctgccc gcgcaggccg gtctgggcta cttccgcaag agcctcgagg acatcatccg 2016241 attccaggcg gaattcctgt ggcgggacta cttccaaccc ggcgtcagca tcgacggcga 2016301 ggaagcgcct gccttagtac acagcatcga ctacattcgc ccgcaattgt ttaccaactc 2016361 gttcacaccg ctggaagtta gcgtgggggg tcccgatatc gagccggtag ttgcccagcc 2016421 caacggtgag atgctcgagt cggacgacat tgaaggcggc gaggacgagg acgaagaggg 2016481 ggtgcgcacc ccgaaggttg ggacggtgat cattgatcag ctgcgcaaga tcaagttcga 2016541 gccgtaccgg ctctggcaac cgccactaac ccaacccgtc gccatcgacg acttggtcaa 2016601 ccggttcctc ggccgcccgt ggcacaagga gtacggttcg gcgtgcaatc tcgtgttccc 2016661 gatcgggata atcgatcgcc cctataagca tgaccagcca ccgtggacgg ttgacacctc 2016721 cgggcccggt gccaacgtgc taatcctggg cgccggcggt tcgggcaaga ccactgcgct 2016781 gcagacactc atctgctcag cggcactgac tcacaccccg cagcaggttc agttctactg 2016841 cctggcctac agcagcaccg cgttgaccac ggtctcccgc atcccccacg tgggcgaggt 2016901 tgccggtccc accgatccct acggtgtgcg ccggacggtg gccgagttgc tggcgctggt 2016961 gcgcgagcgc aaacgcagct tcctggaatg cggaatcgcg tcgatggaga tgttccggcg 2017021 ccgcaagttc ggcggagagg ccgggccggt acccgacgac ggcttcggtg acgtctacct 2017081 ggtgatcgat aactaccggg ccctggccga agaaaacgag gtgctgatcg agcaggtgaa 2017141 cgtgatcatc aaccagggcc cctcgttcgg ggtgcacgtg gtggtcactg ccgaccgcga 2017201 atcggagctg cggccgccgg tgcgcagcgg cttcggatcc cgtatcgagc tgcgcttggc 2017261 ggcggttgag gacgccaagc tggtgcgttc tcgattcgcc aaggacgttc cggtcaagcc 2017321 ggggcgcggc atggttgcgg tcaactacgt ccgcctggac agcgacccgc aggccggcct 2017381 gcacaccctg gtggctcgac cggcgttggg cagcacaccc gacaatgtct tcgagtgcga 2017441 cagcgtggtc gcggcggtga gccggctcac cagcgcccag gctccaccgg tgcgccggtt 2017501 gccggcgcgg ttcggcgtgg aacaggtgcg ggagctggcc tcgcgggaca cccgccaagg 2017561 cgttggcgct ggcggaatcg cctgggcgat atcggaattg gatctggcgc cggtttatct 2017621 gaatttcgcc gagaattcgc acctgatggt gactggtcga cgcgaatgtg gccgcaccac 2017681 cacgctggcc accatcatgt ccgaaatcgg gcggctctac gcgccgggcg ccagcagcgc 2017741 accgcctccc gcccccgggc ggccctctgc gcaggtatgg ctggtcgacc cgcgccgtca 2017801 gctgctgacc gcgctcggtt cggactatgt ggagcggttc gcctacaacc tcgacggggt 2017861 ggtggcgatg atgggtgaac ttgcggcggc gttggccggt cgtgagccgc caccgggcct 2017921 gtccgccgaa gagttgttgt cgcggtcgtg gtggagcggc ccagaaatct tcctgatcgt 2017981 cgacgacatc cagcagctgc cgccgggctt cgattcaccg ttgcacaagg ctgttccgtt 2018041 tgtgaacagg gccgccgatg tcggcttgca tgtgatcgtc acgcgcacct tcggtggttg 2018101 gtcgtcagcc ggcagcgacc cgatgttgcg ggccctgcat caggccaatg cgccactgct 2018161 ggtgatggac gccgatcccg acgagggctt cattcgcggc aagatgaagg gcggcccgct 2018221 gccccgcggt cgaggcctgt tgatggcaga agacaccggt gtgttcgtcc aagtggcagc 2018281 caccgaggtg cgtcggtagt tcggccaaac cgatcagctc cagcgtagcg gcaagttctt 2018341 aagcgcgaag gacttggacg ggaaccgtat ttcgggcgcg tagtccggcg cgagctcgaa 2018401 gtcgggaatt tgattcagcc actcgcccac cagcagggtg agctctaaac gggctagatg 2018461 cgaacccagg caacggtgtg gaccgccgcc aaatccccag tgccggtgca cctttccatc 2018521 catcaccaac tcgtcggtgg acatcgcgtc gctgccgtcg cggttgactg cggccatgca 2018581 taaccgcact ggtgaccccg caggcagtgt catgccgccg acggtgacgg gctcggtggt 2018641 aactcgcggc gccaccggcg ccgatggctc cagccggacg atctcttcga tgaaaaccct 2018701 gatctgcttg ggattgtcgc gcagcatggc gcgcagctgt ggtctgcggg cgagctcgag 2018761 cagcgaaaag cctaccgctg ccgtcacggt gtccagtccc gccagtatca ggaggtggct 2018821 caaacccaaa acctcgatct cgctcaacgg gtcctcgccg atctgcactt gcgacaagac 2018881 gtccggccct gggtttcgcc ggcgttcggc gaccatggcc gtgagatact cgagcagctc 2018941 gcgcgccgca gcgacatcgg cttcggtcgg gtgaggtcga tccgacatgg cgatgacggc 2019001 gtctttccag ccgatcagac ggtcacggtc ttcgagcggc aggccgtaca ggacgagaaa 2019061 caactgaaac ggaaacagat tcgcgagatc ggccatcgcc tcgcactcgc cccggcctgc 2019121 gatggcgtcg atcatagcga cagtgtgacg gcgcagcgac ggtagcgcct tgctcaaagc 2019181 ggccgggctg aagtatggct gcaggatcct gcggtatcgg gtgtgctcgg gcgggtcgaa 2019241 cgcgagcgga accaccggca gcggatttcc cggaggttgc agcgctttcc gcgacgagaa 2019301 aaccttcgga ttccgcagcg ccgcgagcac atcttcgcgg cgcgtcaggt agtaccagcc 2019361 gttcatgaac accacgggcc ccgcgtcgcg gagggtcttc cagccgacac cccggtcaac 2019421 ggccatcggt aacgtcgaat attcgagccg cggtagataa aacgagccgg cgtggtcctc 2019481 gccgggggtg gtcatgcgct caagtctttc gtgtctccgt tcttgtcgca ggtcgcagac 2019541 gtagccaagc ggtgccgacc tagccaatat cgcacgtggg cgtgcaccca ccattgtggt 2019601 gtcgagcgca tctgggggct cagcggctaa tcttcgaagc gaactgtccg gtccaagctg 2019661 gcgtgtgctt tgggcggtaa agggaggaaa tcccgtgaaa gtccgtctcg atccatcgag 2019721 atgcgtgggt catgcgcagt gctatgccgt cgatccggac ctgttcccga tcgacgactc 2019781 gggcaactcg atcctggcag agcacgaggt gcggcccgag gacatgcagc tgaccagaga 2019841 cggtgtggcc gcttgccccg aaatggcgct catcctcgag gaggacgacg cggactgacg 2019901 attccgggtc ataccacaaa attaacgctg gccaaacgat cgtttacgag gaatgaatat 2019961 ttggcgtcat cggcgctgga ggccggtatt gcaatctaat gtgttttcta tgcaacagtt 2020021 gcgcagcgac gccgttatcg actagcggtg ctatattcgg cgccttttcg atgccgagcg 2020081 cgcgtctcgt tggccacgtt tggtggcaat gctcatcagg gctcatccgg atcgccaacg 2020141 cgatcgtgtg tggagaggga ggactggttg gacttcgggg cgttaccgcc ggagatcaat 2020201 tcgggccgta tgtattgcgg tccggggtcg gggccgatgc tggctgcggc cgcggcctgg 2020261 gacggggtgg ccgtggagtt ggggttggct gcgaccggtt atgcgtcggt gatagccgag 2020321 ctgaccggtg cgccgtgggt gggtgcggcg tcgttgtcga tggtggcggc ggccacgccg 2020381 tatgtggcct ggctgagcca agccgcggcg cgggccgagc aggcggggat gcaggccgcg 2020441 gcggccgcgg cggcttatga ggccgctttt gtgatgacgg tgccgccgcc ggtgattacg 2020501 gcgaatcggg ttttggtgat gacgctgatt gcgaccaatt ttttcggtca gaactcggcg 2020561 gcgatcgcgg tcgctgaggc gcagtacgcc gaaatgtggg cgcaagacgc cgttgctatg 2020621 tatggctatg cggctgcgtc ggcgagcgcg tcgcggttga ttccgttcgc ggcgccgccg 2020681 aagaccacca actccgctgg ggtggtcgca caggtggctg cggtcgcggc gatgcctgga 2020741 ctgctgcaac gactttcgtc ggctgcatcg gtcagctggt cgaatcccaa tgattggtgg 2020801 ctcgtgcggt tgctgggctc gattaccccc acggaaagga cgacgatcgt tcgtttgctc 2020861 ggtcagtcgt acttcgcgac gggcatggcg cagttcttcg cctcgatcgc acagcagctg 2020921 accttcggcc cagggggcac aacggctggc tccggcggag cctggtaccc aacgccgcaa 2020981 ttcgccggcc tgggtgcaag ccgggcggtg tcggcgagtt tggcgcgggc caacaagatt 2021041 ggggctctgt cggttccgcc gagctgggtc aaaacgactg cactgaccga accgggcgcc 2021101 cacgcggtga gcgccaaccc taccgtcggt tcgtcacacg gaccgcatgg cctgctccgc 2021161 ggactgccgc tagggtcgcg gatcactcgg cgtagcggcg cctttgccca ccgatatggg 2021221 ttccgtcaca gtgtggttgc ccgcccgcca tcggccggat aacgccatga cctcagctcg 2021281 gcagaaatga caatgctccc aaaggcgtga gcacccgaag acaactaagc aggagatcgc 2021341 atgtcgtttg tgactaccca accagaagca ctggcggcgg cggccggcag tctgcaggga 2021401 atcggctccg cattgaacgc ccagaatgcg gctgcggcga ctcccacgac gggggtggtc 2021461 ccggcggccg ccgatgaagt gtcggcgctg acggcggctc agttcgcggc acacgcccag 2021521 atctatcagg ccgtcagcgc ccaggccgcg gcgattcacg agatgttcgt caacactcta 2021581 cagatgagct cagggtcgta tgctgctacc gaggccgcca acgcggccgc ggccggctaa 2021641 aggagtcact gcgatggatt ttggggcgtt gccgccggag gtcaattcgg tgcggatgta 2021701 tgccggtcct ggctcggcac caatggtcgc tgcggcgtcg gcctggaacg ggttggccgc 2021761 ggagctgagt tcggcggcca ccggttatga gacggtgatc actcagctca gcagtgaggg 2021821 gtggctaggt ccggcgtcag cggcgatggc cgaggcagtt gcgccgtatg tggcgtggat 2021881 gagtgccgct gcggcgcaag ccgagcaggc ggccacacag gccagggccg ccgcggccgc 2021941 ttttgaggcg gcgtttgccg cgacggtgcc tccgccgttg atcgcggcca accgggcttc 2022001 gttgatgcag ctgatctcga cgaatgtctt tggtcagaac acctcggcga tcgcggccgc 2022061 cgaagctcag tacggcgaga tgtgggccca agactccgcg gcgatgtatg cctacgcggg 2022121 cagttcggcg agcgcctcgg cggtcacgcc gtttagcacg ccgccgcaga ttgccaaccc 2022181 gaccgctcag ggtacgcagg ccgcggccgt ggccaccgcc gccggtaccg cccagtcgac 2022241 gctgacggag atgatcaccg ggctacccaa cgcgctgcaa agcctcacct cacctctgtt 2022301 gcagtcgtct aacggtccgc tgtcgtggct gtggcagatc ttgttcggca cgcccaattt 2022361 ccccacctca atttcggcac tgctgaccga cctgcagccc tacgcgagct tcttctataa 2022421 caccgagggc ctgccgtact tcagcatcgg catgggcaac aacttcattc aggcggccaa 2022481 gaccctggga ttgatcggct cggcggcacc ggctgcggtc gcggctgctg gggatgccgc 2022541 caagggcttg cctggactgg gcgggatgct cggtggcggg ccggtggcgg cgggtctggg 2022601 caatgcggct tcggttggca agctgtcggt gccgccggtg tggagtggac cgttgcccgg 2022661 gtcggtgact ccgggggctg ctccgctacc ggtgagtacg gtcagtgccg ccccggaggc 2022721 ggcgcccgga agcctgttgg gcggcctgcc gctagctggt gcgggcgggg ccggcgcggg 2022781 tccacgctac ggattccgtc ccaccgtcat ggctcgccca cccttcgccg gatagtcgct 2022841 gccgcaacgt attaacgcgc cggcctcggc tggtgtggtc cgctgcgggt ggcaattggt 2022901 cggcgccgag atctcggtgg gttatttgcg gtgggatttt ttcccgaagc cgggttcagc 2022961 accggatttc ctaacggtcc cgcgactcaa cggcaccgcg ccgtcagcaa gttccggtgg 2023021 tgttgatcgc ggtatccatg caggtggtga tggcgcggcg agactggtcg tgtgcgctga 2023081 agcacagggt acttggcggt tgtggctccc gggatgtagc tggccgccca acgtcccgca 2023141 gcgtcggggt cagcggcgga gcagcacggc gatttagcct cacaaccgag cagctagctc 2023201 gcgtttccca gcggctcaat ccccgtcgag ccattgaaag gcacctcaga tgtcgtttgc 2023261 gactccgcaa ccggagaaag ggttcggaat ggacttcggg gcgttaccgc cggagatcaa 2023321 ttcgggccgt atgtattgcg gtccggggtc ggggccgatg ctggctgcgg ccgcggcctg 2023381 ggacggggtg gccgtggagt tggggttggc tgcgaccggt tatgcgtcgg tgatagccga 2023441 gctgaccggt gcgccgtggg tgggtgcggc gtcgttgtcg atggtggcgg cggccacgcc 2023501 gtatgtggcc tggctgagcc aagccgcggc gcgggccgag caggcgggga tgcaggccgc 2023561 ggcggccgcg gcggcttatg aggccgcttt tgtgatgacg gtgccgccgc cggtgattac 2023621 ggcgaatcgg gttttggtga tgacgctgat tgcgaccaat tttttcggtc agaactcggc 2023681 ggcgatcgcg gtcgctgagg cgcagtacgc cgaaatgtgg gcgcaagacg ccgttgctat 2023741 gtatggctat gcggctgcgt cggcgagcgc gtcgcggttg attccgttcg cggcgccgcc 2023801 gaagaccacc aactccgctg gggtggtcgc acaggcggtt gcgtcggtca gctggtcgaa 2023861 tcccaatgat tggtggctcg tgcggttgct gggctcgatt acccccacgg aaaggacgac 2023921 gatcgttcgt ttgctcggtc agtcgtactt ggcgacgggc atggcgcggt ttcttacctc 2023981 gatcgcacag cagctgacct tcggcccagg gggcacaacg gctggctccg gcggagcctg 2024041 gtacccaacg ccgcaattcg ccggcctggg tgcaggcccg gcggtgtcgg cgagtttggc 2024101 gcgggcggag ccggtcggga ggttgtcggt gccgccaagt tgggccgtcg cggctccggc 2024161 cttcgcggag aagcctgagg cgggcacgcc gatgtccgtc atcggcgaag cgtccagctg 2024221 cggtcaggga ggcctgcttc gaggcatacc gctggcgaga gcggggcggc gtacgggcgc 2024281 cttcgctcac cgatacgggt tccgccacag cgtgattacc cggtctccgt cggcgggata 2024341 gctttcgatc cggtctgcgc ggccgccgga aatgctgcag atagcgatcg accgcgccgg 2024401 tcggtaaacg ccgcacacgg cactatcaat gcgcacggcg ggcgttgatg ccaaattgac 2024461 cgtcccgacg gggctttatc tgcggcaaga tttcatcccc agcccggtcg gtgggccgat 2024521 aaatacgctg gtcagcgcga ctcttccggc tgaattcgat gctctgggcg cccgctcgac 2024581 gccgagtatc tcgagtgggc cgcaaacccg gtcaaacgct gttactgtgg cgttaccaca 2024641 ggtgaatttg cggtgccaac tggtgaacac ttgcgaacgg gtggcatcga aatcaacttg 2024701 ttgcgttgca gtgatctact ctcttgcaga gagccgttgc tgggattaat tgggagagga 2024761 agacagcatg tcgttcgtga ccacacagcc ggaagccctg gcagctgcgg cggcgaacct 2024821 acagggtatt ggcacgacaa tgaacgccca gaacgcggcc gcggctgctc caaccaccgg 2024881 agtagtgcct gcagccgccg atgaagtatc agcgctgacc gcggctcagt ttgctgcgca 2024941 cgcgcagatg taccaaacgg tcagcgccca ggccgcggcc attcacgaaa tgttcgtgaa 2025001 cacgctggtg gccagttctg gctcatacgc ggccaccgag gcggccaacg cagccgctgc 2025061 cggctgaacg ggctcgcacg aacctgctga aggagagggg gaacatccgg agttctcggg 2025121 tcaggggttg cgccagcgcc cagccgattc agctatcggc gtccataaca gcagatgatc 2025181 taggcattca gtactaagga gacaggcaac atggcctcac gttttatgac ggatccgcat 2025241 gcgatgcggg acatggcggg ccgttttgag gtgcacgccc agacggtgga ggacgaggct 2025301 cgccggatgt gggcgtccgc gcaaaacatt tccggtgcgg gctggagtgg tcaggccgag 2025361 gcgacctcgc tagacaccat gacccagatg aatcaggcgt ttcgcaacat cgtgaacatg 2025421 ctgcacgggg tgcgtgacgg gctggttcgc gacgccaaca actacgaaca gcaagagcag 2025481 gcctcccagc agatcctgag cagctagcgc cgaaagccac agctgcgtac gctttctcac 2025541 attaggagaa caccaatatg acgattaatt accagttcgg ggacgtcgac gctcatggcg 2025601 ccatgatccg cgctcaggcg gcgtcgcttg aggcggagca tcaggccatc gttcgtgatg 2025661 tgttggccgc gggtgacttt tggggcggcg ccggttcggt ggcttgccag gagttcatta 2025721 cccagttggg ccgtaacttc caggtgatct acgagcaggc caacgcccac gggcagaagg 2025781 tgcaggctgc cggcaacaac atggcgcaaa ccgacagcgc cgtcggctcc agctgggcct 2025841 aaaactgaac ttcagtcgcg gcagcacacc aaccagccgg tgtgctgctg tgtcctgcag 2025901 ttaactagca ctcgaccgct gaggtagcga tggatcaaca gagtacccgc accgacatca 2025961 ccgtcaacgt cgacggcttc tggatgcttc aggcgctact ggatatccgc cacgttgcgc 2026021 ctgagttacg ttgccggcct tacgtctcca ccgattccaa tgactggcta aacgagcacc 2026081 cggggatggc ggtcatgcgc gagcagggca ttgtcgtcaa cgacgcggtc aacgaacagg 2026141 tcgctgcccg gatgaaggtg cttgccgcac ctgatcttga agtcgtcgcc ctgctgtcac 2026201 gcggcaagtt gctgtacggg gtcatagacg acgagaacca gccgccgggt tcgcgtgaca 2026261 tccctgacaa tgagttccgg gtggtgttgg cccggcgagg ccagcactgg gtgtcggcgg 2026321 tacgggttgg caatgacatc accgtcgatg acgtgacggt ctcggatagc gcctcgatcg 2026381 ccgcactggt aatggacggt ctggagtcga ttcaccacgc cgacccagcc gcgatcaacg 2026441 cggtcaacgt gccaatggag gagatgctag aggcaacgaa gtcgtggcag gaatcggggt 2026501 ttaacgtctt ctccggcgga gatctgcgcc gaatgggcat cagtgccgcg acggtggccg 2026561 cgctggggca ggcgttgtcg gatcccgcgg ccgaggtcgc agtgtatgcg cgacagtacc 2026621 gagacgacgc caagggcccc agcgcctcgg tgttgtcgct gaaagacggc tccggtggac 2026681 gcatcgcgct gtatcagcag gcgcgaacgg caggttccgg cgaggcgtgg ctggctatct 2026741 gcccggctac cccgcagttg gtgcaagtag gagtgaagac cgttttggat acactgccct 2026801 acggcgagtg gaaaacacac agcagagtat gacgccaggg cgtgaaaccc gaagtacaac 2026861 aacaaatttg agcatcagat acaacccaga tacgtacagg gcaaattgct ctagaatcga 2026921 ctgcaatact gcaaggcaag gtcaaccaca acgatttggt cgcgaggcaa ggcaaatgga 2026981 atcggagtta gtcgagccgc agctcccggt gggctaccgc gcctcggtgc ctacaccgac 2027041 ggagctcccc gcgccactga agccacggtg taacacgttt gccatggcag ggggtacagg 2027101 acgatgaccg cagtagctga cgcacctcag gctgacattg agggtgtggc atcgccccag 2027161 gctgtcgtcg tgggcgtcat ggccggcgaa ggcgtccaga tcggcgtcct gctggatgcc 2027221 aacgccccag tttcggtgat gaccgacccg ctgctgaaag tggttaatag tcggctcaga 2027281 gagctcggtg aggctccact ggaagccact ggacgcggcc gatgggcgct gtgtctggtg 2027341 gacggcgcgc cgttgcgtgc tacccagtcg ctgaccgaac aagacgtcta tgacggcgac 2027401 cggctgtgga ttcggttcat cgcagacacc gaacgtcgct cccaagtcat cgaacatatc 2027461 tccaccgcag tcgcctcgga tctcagcaag cggttcgcca ggatcgaccc gatcgttgct 2027521 gtgcaggtcg gggcgtcgat ggtggcgacc ggggttgttc ttgccaccgg ggtgctcggc 2027581 tggtggcgct ggcatcacaa cacctggttg accaccatct acaccgcggt gattggtgtg 2027641 ctggtgctgg cggtcgccat gttgctgttg atgcgtgcca agacggacgc ggatcgacgc 2027701 gtcgccgaca tcatgctgat gagcgcgatc atgcccgtga cggtggcggc ggcagcggcc 2027761 ccgcccggcc cggtgggctc cccgcaggcc gtgttgggct tcggagtgct gaccgtcgct 2027821 gcggccctgg ccctgcggtt caccggtcgc cgcctgggga tttacaccgc aatcgtcatc 2027881 atcgatgcgc tgacaatgct tgcagccttg gcgcggatgg tcgcggccac aagcgcggtg 2027941 acgctgttgt cgtccttgtt gttgatttgc gtagtggcct accacgcggc gccggcactg 2028001 tctcggcggc tggccggcat ccgactgccg gtgttcccgt ccgccaccag ccggtgggtc 2028061 ttcgaggctc ggcccgacct accgaccacc gtggtggtgt ccggtggcag cgcaccggtc 2028121 ttggaagggc cgtcatcggt gcgtgatgtg ctgctgcaag ctgagcgcgc tcggtcgttc 2028181 ttgagcggcc tgctaacggg acttggcgtg atggtggtgg tgtgcatgac atcgttgtgc 2028241 gacccgcaca ccgggcaacg ttggctgccg ctgatactgg ccggatttac ctcgggcttc 2028301 ctgctgttgc ggggccgctc ctacgtcgac cgttggcagt cgattaccct ggccggaact 2028361 gcggtgatca tcgctgctgc ggtgtgtgtg cggtacgcgc tggaattgtc ctcgccgttg 2028421 gctgtgtcca ttgtcgccgc gatcctggtg ctgctgccgg cggcgggcat ggcagctgct 2028481 gcacatgtgc cccacaccat ctacagtccg ctattccgca agtttgtgga atggattgaa 2028541 tacctctgcc tgatgccgat cttcccgctg gcgttgtggt tgatgaacgt ctatgcagcg 2028601 attcggtacc ggtagcagca ggtcgtggtg tggtcgcgcg ggtaccgcga ccattgccgc 2028661 agtcttgcta gcttcgggcg cgctgaccgg ccttccgcca gcgtatgcaa tttcgcctcc 2028721 gacgatcgat ccgggcgcgc tgccacccga cgggccgccc ggaccgctgg cgcccatgaa 2028781 gcagaacgcc tactgcaccg aggtcggggt cttgcccggc accgactttc agctgcagcc 2028841 aaaatatatg gagatgctga acctgaacga ggcttggcag ttcggccgcg gcgacggtgt 2028901 gaaggtcgct gtcatcgaca cgggtgtgac tccacatccc cggttgccgc gtctgatccc 2028961 tggcggcgac tacgtgatgg ccggtggcga cggtctgtcg gactgcgacg cccacggcac 2029021 cctggtggcg tcgatgatcg cggcggttcc ggcgaacggg gcggtaccgc tgccgtcggt 2029081 accgcgcagg ccggtcacca ttcccacgac cgaaacgccg ccgccgccac agacggtgac 2029141 cctttcaccg gtaccgccgc agaccgtgac cgtgattccg gctccacctc ccgaggaagg 2029201 agttccgccg ggcgcaccgg tgccaggacc ggagccgccg ccggctcctg gtccacagcc 2029261 gccggccgtg gaccgcggtg gcggcacggt gacagtaccc agctactccg ggggccgcaa 2029321 gatagccccg atcgacaacc cgcgtaatcc gcacccgagt gcgccatcgc cagcgctggg 2029381 accaccgccg gacgcgttca gtgggatcgc ccccggtgtc gagataatct ccatccgcca 2029441 gtcaagccag gccttcggcc ttaaggaccc ttacactggg gacgaagacc cgcagacggc 2029501 gcaaaagatc gacaacgtcg agacaatggc gcgcgcgatc gtgcatgctg ccaacatggg 2029561 tgcttcggtg atcaatatct ccgatgtgat gtgcatgagt gctcgtaatg tcatcgacca 2029621 gcgtgcactg ggtgccgcgg tgcactacgc cgcggtcgac aaggacgcgg tcatcgtggc 2029681 tgcagcgggc gacggcagca agaaggactg taagcagaac ccgatttttg atcccttgca 2029741 gcccgacgat ccacgcgctt ggaacgcggt caccacggtg gtgacaccct cgtggttcca 2029801 cgactacgtc ctgacggtcg gagcggttga cgccaacggt caaccgctca gcaaaatgag 2029861 tatcgcggga ccctgggtct ccatttcggc gccgggaacc gacgtcgtcg gactctcgcc 2029921 ccgtgacgac ggcctgatca atgcgattga cggcccggat aattcgttgc tggttccggc 2029981 tggcaccagt ttttccgccg cgatcgtgtc cggggtggct gcgctggtac gtgctaagtt 2030041 ccccgaattg tcggcgtacc aaatcatcaa tcggctgatt cataccgccc ggccacccgc 2030101 tcgcggcgtc gacaaccagg tcggctacgg tgtggtcgac ccagtggcag cactgacttg 2030161 ggatgtgccc aaaggcccgg ccgagccgcc caagcagctg tcagcgccgt tggtggtgcc 2030221 gcagccgccc gccccccgcg atatggtgcc gatatgggtg gccgccgggg gattggccgg 2030281 ggcactattg ataggcggtg cggtgttcgg taccgcgacc ttgatgcggc gatcacggaa 2030341 gcagcaatga aggctcagcg cagcttcggg ttggcgttgt cgtggccgcg ggtgaccgcg 2030401 gtgtttctgg tggatgtcct gatcttggcg gtggccagtc attgcccgga ttcctggcag 2030461 gccgatcatc atgtggcgtg gtgggtcggc gtcggcgtgg cggccgtagt gacgttactg 2030521 tcggtggtca gttaccacgg catcacggtg atttcgggtt tggcgacgtg ggtgcgggat 2030581 tggtcggcgg atccgggcac gacactgggt gcggggtgca ctccggcaat cgaccaccag 2030641 cgccgttttg ggcgtgacac ggtaggggtg cgtgagtata acggccggct ggtctcggtg 2030701 atcgaggtca cctgcggtga gagcggcccg tcgggtcggc attggcaccg gaaatcgccg 2030761 gtacccatgt tgccggtggt cgcggtcgcc gatggtttgc gccagttcga cattcacctc 2030821 gatggcatcg acatcgtgtc ggtgctggtg cggggcgggg ttgatgctgc taaagcttcg 2030881 gcctcgctgc aggagtggga gccgcagggc tggaaatccg aagaacgagc cggtgatcgc 2030941 actgtcgccg atcggcgccg cacctggttg gtgttacgga tgaatccgca gcgaaatgtg 2031001 gctgcggtgg cgtgtcgtga ctcgttggcg tcgacgctgg tggcagccac cgagcggttg 2031061 gtccaggatc tggatgggca aagttgtgcg gcccggccgg tgacggccga tgagctgacc 2031121 gaggtcgaca gcgccgtgtt ggctgacttg gaaccgacat ggagtcgccc cggttggcgt 2031181 cacctcaagc atttcaatgg ttatgcgacc agtttttggg ttacgccgtc agacatcacg 2031241 tcggagacct tggatgagct gtgtctgcca gatagccccg aagtcgggac gaccgtggtc 2031301 acggtgcgtc tgaccactcg ggtcgggtcg cccgcgctat cggcatgggt gcgttatcac 2031361 agcgacacgc gcctgcccaa ggaggtagcg gccggactca accggctcac cggtcgccag 2031421 ttggccgcgg tgcgtgccag cctgccggcc ccgacgcacc gtccactcct ggtcatcccc 2031481 agtcggaacc tgcgtgacca cgacgagctc gtgctgccgg tgggccagga actcgagcac 2031541 gcgacaagct cgtttgtggg gcaatgacac gcccgcaggc cgccgccgaa gatgcccgca 2031601 acgccatggt cgccggtctg ctggcatcgg ggatctccgt caatggactg cagcccagcc 2031661 ataacccgca ggtggccgcc caaatgttca ccacggcgac caggctggat cccaagatgt 2031721 gtgatgcctg gctggctcgg ctgctggccg gcgaccagag catcgaagtg ctcgccggcg 2031781 catgggctgc ggtgcggact ttcggctggg aaacccgccg cctcggcgtg acggatctgc 2031841 agttccgccc cgaggtgtcc gacgggctat tcctgcgact ggcgattacc agcgtagatt 2031901 cgctggcctg cgcttacgcg gcggtcctcg ccgaggccaa gcgttaccag gaggcggcag 2031961 agctgctcga cgccaccgat cctcgccatc cgttcgacgc cgagctggtg agttacgtgc 2032021 ggggcgtgct gtacttccgc accaaacgct ggcctgacgt tcttgcgcag ttccccgagg 2032081 caacgcagtg gcgtcacccc gagctaaagg ccgcgggggc ggcgatggcc accacggcgc 2032141 tggcgtcgct cggggtgttc gaagaggcct ttcggcgcgc tcaggaagca atcgaaggtg 2032201 accgggtgcc gggcgcggct aacatcgcct tgtacaccca aggcatgtgc ctgcggcacg 2032261 tcggccgtga ggaggaagct gtcgaactcc tgcgccgcgt gtattcgcgc gatgcgaagt 2032321 tcaccccggc ccgcgaggcg ctggataacc ccaactttcg gctgatcctc accgacccgg 2032381 aaacgattga ggcgcgcaca gatccgtggg atccggacag tgcgccaacc cgcgctcaga 2032441 ccgaggccgc ccgccatgcc gagatggccg cgaagtactt ggccgaaggg gatgccgagc 2032501 tcaacgcgat gcttggcatg gagcaggcca agaaggagat caagctcatc aagtcgacga 2032561 cgaaggtgaa tttagcgcgt gccaagatgg ggcttccggt cccggttacg tcgcgccaca 2032621 ccttgttgct cgggccgccc ggtaccggga agacttcggt cgcaagggct ttcaccaagc 2032681 agctgtgcgg gttgacagtg ctgcgcaagc cgctggtggt ggagaccagc cgcaccaagc 2032741 tgttgggccg gtacatggcc gacgccgaga agaacaccga ggagatgctc gaaggggcgt 2032801 tgggcggtgc ggtcttcttt gacgagatgc acactctgca tgagaagggc tactcccagg 2032861 gcgacccgta cggtaacgcg atcatcaaca cgctgctgtt gtacatggaa aatcaccgtg 2032921 acgagctggt ggtgtttggt gcgggttacg ccaaagcgat ggagaaaatg ctcgaggtga 2032981 atcagggtct gcgccggcgc ttttcgacgg tgatcgagtt cttcagctac accccgcagg 2033041 agctgatcgc actgacccag ctgatgggtc gggagaacga agacgtgatc actgaggaag 2033101 agtctcaagt gttgttgccg tcgtatacca agttctacat ggagcagagc tactccgagg 2033161 acggcgacct gatccgcggg atcgatctgt tgggcaatgc cggctttgtg cgcaacgtgg 2033221 tggagaaggc ccgcgaccac cgtagtttcc gtttggacga tgaggatctc gacgccgtac 2033281 tggccagcga tctcaccgaa ttcagcgagg atcagctgcg ccgattcaag gagttgactc 2033341 gcgaggacct ggccgaaggg ctgcgcgctg cggtcgcgga gaagaagacg aagtaggcac 2033401 tcttttcgtc ggtgtcactg gctactttga cctgaacagt cggcggtggg tgagtggtct 2033461 gtggttggcg aatgaggcgg ggcggggcgg agactggtcc agatggtgtc cgtgcacgcg 2033521 ggggagggtg tggtgttcag ccgctcaggg cggggtacgt gccgtctcaa tccgtgctgt 2033581 gtccaaattg tttacaatta acggtggtgc cacaccttaa attccaaatg taaatatatt 2033641 tgacgtcggt caaaaatccc acgtttggca caagtatcgg tggcgcgttg ccaagtcatt 2033701 aggcaatcga gcggactccc gggcatggaa atgcgtgtct ttcgtttgtg ggtgtccggt 2033761 atccagacag catcgcttgc gcctcgacta caggtttgct actaaaattc ctatgcgcca 2033821 tagtgattga gaagggccac gcccccttcg tgtgacgcac ggcgggcgac ggcggcgccg 2033881 tgcccggcat tggttgggtg tcaatgaggc ttcaaggata tctaccaaat ttcccagaaa 2033941 tatttcacgg aggccgcaat ggagctagca tttaatcggc gtacggtcag gccaatatat 2034001 cgaaacatga gaggaatgat cgatgagcgt caagagtaag aacggtcgtc tcgccgctcg 2034061 ggtactggtg gcactggcgg ccctgtttgc gatgatcgcg ctgacgggct cagcatgtct 2034121 ggcagagggt cccccgcttg gccgcaaccc tcagggggca ccggctccgg tgggtggcac 2034181 tgtgatcgtc gcgccgatgc acagcggcgt ctgaccgccc cgttcgggat ctgtacgcac 2034241 tttcatccga ctgcgcggtt gtttgttagc gcatcggatg aaagtgtgcc gtctcggctg 2034301 aggaaggacc gtcgcgatgc tgccgaattt cgcggtgctg ccccccgagg tcaattcggc 2034361 gagggtgttc gccggtgcgg ggtcggcgcc gatgttagcg gcagcggccg cctgggatga 2034421 tctagcctcc gagctgcatt gtgctgcaat gtcattcggg tcggttacgt cgggattggt 2034481 ggttgggtgg tggcagggat cggcgtcggc ggcgatggtg gacgcagccg cgtcgtacat 2034541 cgggtggctg agcacgtcgg ctgcccacgc cgagggcgcg gccggtctgg ctcgggccgc 2034601 ggtatcggtg ttcgaggagg cgctggccgc gacggtgcat ccggcgatgg ttgcggcaaa 2034661 tcgcgcccag gtggcgtcgc tggtagcgtc gaacttgttt gggcagaacg cgcctgcgat 2034721 cgccgcgctc gaatccttgt atgagtggat gtgggcccag gatgcagcgg ccatggcggg 2034781 ttattacgtt ggggcttcgg cggtggccac acagttggca tcgtggctgc aacggctaca 2034841 gagcatcccc ggcgccgcca gtcttgatgc ccgtctgccg agctcggccg aggcaccgat 2034901 gggagtcgtc cgcgcggtca acagcgcgat cgccgccaat gcggctgcgg cacaaaccgt 2034961 tggcctggtc atgggaggca gcggcacgcc aataccgtcg gccagatatg tcgagctcgc 2035021 gaacgcgctg tacatgagtg gcagcgtccc gggtgttatc gcgcaggcgc tcgtcacgcc 2035081 ccaagggctc tacccggtgg tcgtgatcaa gaacctcact ttcgattcct cggtggcgca 2035141 gggtgccgtc attctcgaaa gtgcgattcg gcagcaaatt gccgccggca acaacgtcac 2035201 cgtcttcggc tactcgcaga gcgccacgat ctcgtcacta gtgatggcca atcttgcggc 2035261 ttcggccgac ccgccgtctc cagacgagct ttccttcacg ctgatcggca atcccaacaa 2035321 ccccaatggc ggggttgcca ccaggttccc ggggatctcc tttccaagct tgggcgtgac 2035381 ggccaccggg gccactccgc acaatctgta cccgaccaag atctacacca tcgaatacga 2035441 cggcgtcgcc gactttccgc ggtacccgct caactttgtg tcgaccctca acgccattgc 2035501 cggcacctac tacgtgcact ccaactactt catcctgacg ccggaacaaa ttgacgcagc 2035561 ggttccgctg accaatacgg tcggtcccac gatgacccag tactacatca ttcgcacgga 2035621 gaacctgccg ctgctagagc cactgcgatc ggtgccgatc gtggggaacc cactggcgaa 2035681 cctggttcaa ccaaacttga aggtgattgt taacctgggc tacggcgacc cggcctatgg 2035741 ttattcgacc tcgccgccca atgttgcgac tccgttcggg ttgttcccag aggtcagccc 2035801 ggtcgtcatc gccgacgctc tcgccgccgg gacccagcag ggaatcggcg atttcgccta 2035861 cgacgtcagc cacctcgaac tgccgttgcc ggcagacggg tcgacgatgc caagcaccgc 2035921 accgggctcg ggtacgccgg tccccccgct ctcgatcgac agcctgatag acgacctgca 2035981 ggtggctaac cgcaacctcg ccaacacgat ttcgaaggtg gccgcgacga gctacgcgac 2036041 ggtgctccca accgccgaca tcgccaatgc ggcgttgacg atcgtgccgt cgtacaacat 2036101 ccaccttttt ttggagggca tccagcaagc gctcaagggc gacccgatgg gactcgtcaa 2036161 cgcggtcgga tacccactcg cggccgacgt ggcactgttc acggccgcag gcggtcttca 2036221 gctcttgatc atcatcagcg cgggccgaac gattgccaat gacatctcgg ccattgtccc 2036281 ctgatcgtgt tttgcgtgaa ctttaaagcg ttgtgctgag gtatgttccg ctcgcgtgtg 2036341 gggcggcccg cgcgaccacc tatgcatgag cgccaatggt cgagacaact acctgcgcgg 2036401 tcatcgggcg gccacccaga gggcatggtt ctcgggctgc tactggctcg cgtgcttcca 2036461 tcgagcgtga atacatgccg ccaaatcggc agtcggcgcc gctggcgtgc cgctagctga 2036521 tcacaaagcg ccgataccga tgcggctggc catagcaatg ccaatgttgg cgaatagatc 2036581 tcacgcgcgg cccaagccaa cagcgaggtg atggtgatca ttctttacgt tgcgattacc 2036641 tcgccggaac gtgacacgag caatactcgc caaccatgat cgccagatat ttggaacggg 2036701 tttgggtcca gcggccgcca aaaaccgact cgccgccgtc cctgacaact cagcggcgag 2036761 aggtgaacac gggtgatttg tcactacggg ccgctgcggt tcctgcgctg ccagggggcc 2036821 gcgagtgcga ttccggcgag ccacgcgatt agggattaag cgaaatggat ttcgggttgt 2036881 taccgccgga gatcaactca ggcaggatgt atacggggcc ggggccgggg cccatgctgg 2036941 ccgccgcgac agcctgggac gggctggctg ttgagctgca cgcaacagcg gctggctacg 2037001 cctcggagct atcggctttg accggggcat ggagcggtcc ttcgtcgacg tccatggcat 2037061 ctgcagccgc accctatgtg gcatggatga gcgccaccgc agtgcatgcc gagctggcgg 2037121 gcgcgcaagc caggttggcg atagctgcct atgaagctgc gttcgctgcc accgtgcctc 2037181 cgccggtgat cgccgctaat cgtgcccaac tgatggtgtt gatcgcgacg aacatcttcg 2037241 ggcagaacac gccggcgatc atgatgactg aggcccaata catggaaatg tgggcgcagg 2037301 atgccgccgc gatgtacggg tacgccggct cgtcagcgac cgcctcgcga atgacagcgt 2037361 tcactgagcc gccgcaaacc actaaccatg gtcagttggg ggcccagtcc tccgccgtcg 2037421 cacaaaccgc cgccaccgcg gccggcggca acctgcaatc ggcattcccg cagctgctct 2037481 ccgcggttcc ccgcgccctg caaggcctgg cattgccgac cgcatcacag tcggcatcgg 2037541 cgacgccgca gtgggttacc gacctgggga acctgtccac cttcctgggc ggggcggtca 2037601 ccggcccgta cacctttccc ggggtattgc ctccctccgg ggtgccatac ctgttaggca 2037661 ttcagagcgt cttggtaacc caaaacgggc agggggtaag cgccttgctt ggcaagatcg 2037721 gggggaaacc aatcaccgga gcgttggctc cgctggccga atttgctttg catacaccaa 2037781 ttttgggttc ggagggcttg ggtggtggat cggtttccgc gggtattggc cgggcaggct 2037841 tggtcggaaa gctatcggtg cctcagggct ggacggtggc cgccccggag atcccatcgc 2037901 cggcggcggc gttgcaggcg acgcgcctgg ccgccgcgcc gattgcggcc accgacggcg 2037961 cgggtgcgtt gctcggtggc atggcgctgt cgggcttggc tggccgcgct gccgccggtt 2038021 ctaccggcca ccccatcggc agcgccgcag cacccgccgt cggtgccgct gccgctgccg 2038081 tcgaggacct ggccaccgaa gccaacatct tcgtgatacc ggccatggac gactagcgcc 2038141 atgtcacggg agagaaggtt gtcgacactt ttgcgaccag cgccggttcg gtatgtggcc 2038201 accggggctg ccaatggggt tacggcccgt taaggaggga tgcggtaatg gatttcgggg 2038261 tgttaccacc ggagatcaat tccgggcgca tgtatgccgg tcccgggtcg ggtccgatgc 2038321 tggccgcggc agcggcctgg gacgggctgg ccaccgaatt acagtccacg gcggccgact 2038381 atggctcggt gatctcggtt ctgaccggcg tgtggtcggg acagtcgtcg gggaccatgg 2038441 cggctgcggc cgcaccgtat gtggcgtgga tgtcggccac ggcggcgctc gctcgggaag 2038501 cggccgccca ggccagcgcg gcagcggcgg cctacgaggc agcgtttgca gccacggtgc 2038561 cgccgccggt cgtcgcggcc aaccgcgccg agctggcggt gttggcggcg accaacattt 2038621 tcggtcagaa caccggtgcg atcgcggccg ccgaagcccg ctatgcggaa atgtgggcgc 2038681 aagacgcagc cgcgatgtat ggctatgccg gctcgtcgtc ggtggcgacc caggtgacgc 2038741 catttgctgc accgccgccg accaccaacg cggccggact ggccacccaa ggcgttgcgg 2038801 ttgcccaggc tgtcggcgcg tcggccggca acgcgcgctc actggtgtcc gaggtgctgg 2038861 aattcctggc aacggccggg acgaactaca acaagacggt ggccagcctg atgaacgcgg 2038921 tcaccggggt gccgtacgca tcttcggtgt ataacagcat gctcgggctt ggcttcgctg 2038981 agtcaaaaat ggtcctgccg gctaacgaca ccgtaatatc gaccatcttc ggcatggtgc 2039041 agttccagaa gttcttcaat ccggtgacgc ccttcaatcc cgatttgatc ccgaaatctg 2039101 ctctaggggc cgggcttggc ctgcggtctg cgatctcgag tggtctgggc tcgaccgcgc 2039161 cagcgatatc ggcgggtgcg agccaggccg gctcggtcgg ggggatgtcg gtgccgccga 2039221 gctgggcagc ggccaccccg gcgatccgga cggttgccgc tgtgttctcg agcaccggac 2039281 ttcaggctgt cccggcggcc gcaattagcg agggcagtct gctcagccag atggccctgg 2039341 cgagtgtggc cggaggggcc cttggcggcg ccgctgcacg cgccactggt ggtttcctcg 2039401 gcggaggccg agtcaccgcg gtcaagaaat ctctcaagga cagcgactca ccggacaagc 2039461 tgcggcgggt ggtcgcgcac atgatggaga agcccgaatc ggtgcagcac tggcacaccg 2039521 acgaggacgg gctcgatgat ctactcgcgg aattgaagaa gaaaccgggc atccacgccg 2039581 tgcacatggc cggcggcaac aaggctgaaa ttgcaccgac gatatcagaa tcgggctagg 2039641 gcagggttag ggcgtgtctt ccaattgata ggccccgagg cagacacgag tcgccagacc 2039701 gcaccattgc ttgagttggt tgatgccctt gagatcggaa cccgaatccc acagcaggag 2039761 aattagtttc gtccccagac cggcggctac ggctgcccgt tctgcccagg caaaccgatc 2039821 aatccgccct tgccgccttg gcccccgggt gcgggtggga caccatctcc gccgtcgcca 2039881 ccggtaccga tcagcagggc ggcgttacca ccgtcaccgc cggcaccgcc agtgcccgca 2039941 ctgccgccgg ttccgccggc cccaccggta ccgccacttc ccccgggtcc gccgttgccg 2040001 atccccaggc cgcttgcccc gccttggcca ccatcgccgc cgttgccgcc gtcgttgccc 2040061 gacccgccga cgccgccggc cccgccgatg ccgccggccc cgccgctacc gaacagtagg 2040121 cctccgctgc cgcccgcgcc accgtcgctg ccgtcacctc cggcttcctg aatgatgttg 2040181 cctactccgg tcccaccggt cgccccattc cctccggccc cgccgttgcc gatcagaatg 2040241 gcggcgccac cctggccggc gctgccttcg aagccggtac ctccgccgct gccggcgtta 2040301 ccaccgttgc cgccattgcc gtatagcacc ccaccttggc cgccgtcacc ccctgatgca 2040361 ccaaattgaa tgctgaggct gccggcccca acgtcaccac cgttgccacc gtcaccgccg 2040421 tgaccaaaca acccaaaacc ctggacacta atcggtccga agttaacggc acctacccag 2040481 ccgccaccgc cagcagagcc gccaaagccc gcaaatccgt tggtgcctgc gtcaccgccc 2040541 tgacccccgt tgccgccgga cgcgccgctg ccgaacaacc acccgccgtt gccgccgtcg 2040601 ccgcctgagc caccgacccc gccgctcccg ccgaagatgg tagtacccag cgcagacccg 2040661 gccgctccat tcccgccgtt gccaccggcc ccgccgttgc cgactaggcc cgcgtcaccg 2040721 ccgttaccgg ctatcccggc cgaaccgccg gaagcgccgt tcatgacgcc cgtttgagtg 2040781 gagtcggtgc cgccgccggc gccgctgtcc ccaccttgcc ccccgttgcc gatcagccac 2040841 ccgccctggg cgccgttgcc cccgttgcct cctaatccgc cgctgccggc cgaatctccg 2040901 gctgtgtcat tagtgccttg tcctcccatg ccaccgaccg agccgttgcc gccgttgccg 2040961 aacaacagtc caccgcgacc acccgaaccc ccgtttccgc cggcgccccc atcgaagcct 2041021 gccgacgcac ccccgggcgc tgtggcgttg cccccgtttc cgccggcccc accatgtcca 2041081 ccgtgaccat agatccatcc accggcgccg ccgttgccgc cggacccgcc cgctttcccg 2041141 ggttgaccgg ctgcgccgtt ggcacccgcc ccgccctggc cgccattgcc gccgctaccc 2041201 catagcccag ccgacccgcc gttgccgcca ttttggcccg tcccgccgga cccgccgttg 2041261 ccaccgttgc cgtatagcca tccgccatcg ccaccgtttt gcccggttcc tgcgaccccg 2041321 ttggcaccgt tgccgataag cggacgcccg gtcagcgtct gcacgggtga attgatggca 2041381 tcgagggcgg tttggcccac gatttgcaat ggtgacgagt tggctgcctc ggcgctggcg 2041441 tactaggccg cccccgcgct catgagctgg acgaactgct catggaatgc gaccgcgtgg 2041501 gcactgagct cctggtatgc ctgcccgtgc gttgcgaaca gcgccgcaat cgcagccgac 2041561 acgtcgtcag cgcccgcagg cagtaacgcc gttatcggga ccaacgcttc ggcgttggcc 2041621 cggctaatcg ccgaaccaat agttgctagg tcctttgccg ctgcatcgac aaacgccggc 2041681 gccacgatca tctgcgacgt ccacacctcc tggccgttgt cgccgcatgg ggaatccata 2041741 cgaccgccaa aggaattttg gaaccgacgc caacgttaca gttttgcgga cccgctatgg 2041801 ggtgcattca ccagattcac tggcaacgat gtgaaccccg tgtcacccca agcggggtca 2041861 atccactgat tactctctag cccaaactat ttcgcgctga cgctggtttt agtgatctgg 2041921 tgggggcaat agacatgcgc ggagatcgca gcgaacttgc aacaaccgtc catcgaaaac 2041981 ccgggattgc gggtccgcag ctcgttgacg acctgaagac ccgattcgcc gctttcgact 2042041 aacgcgcata cggccttgcc cgatgctatg gcttgatccg ggtggctgta ggtaatgcct 2042101 gcccgctcta gcgaggcaag aaagaccgcg tcgtcaccgc tgggccccgc gtgggccgga 2042161 accgccaagc cgatcatcaa cggaatgctg agtagcgttg acacaactct catagacaac 2042221 gattctcccg gaattgcgct tctcttgcgg tgcaaccggt taccgcgtca ttccaatacg 2042281 ttacggctgc gctaacttcc cgtctcaggg tgttcgggtt gcgctggacc tgaaggtcgt 2042341 ctgctgaccg gcgttgtctg ctcgctggct aacagccgat cttgatagcc tccggggcat 2042401 cggatgagtc aagccgttgg gttgacgcgc gtcgctacga gtgtcacgat tacccttgca 2042461 agcacctcgc taggtgaggc gtctgcgcgg atataggcca ctgacctcga acgtcgaaag 2042521 acgcccaggg tcaggacagc tcttcccggc ttaagggttg agcccaagtg gcttccggct 2042581 ggaccggccg gatacgccgt gtggtgccaa agctctgacg agaggggtgc cgagttcggt 2042641 ggtctgctgg gctgtcatcc ctttgtgctg tgcatcggca tccccgtgtg ccccggccgt 2042701 gaggaggtga gagcgaaatg agtcccggcg atagtccgta tccgagatcg acgaccgttt 2042761 cgttccgatc cgaccccggc gccgttttcg cactctgaat cggccttccg gttcgaaatc 2042821 cgttatttcg caagctcgtt gcttcgcggc cttgtgtgag tgacgttcac gggaagtagc 2042881 cacgacagaa gcggtcatag gcctccgggt tcggtcgtct gtcaggagaa gacccatggc 2042941 gtttgttctt gtctgtccag atgcgctggc catcgcggcc ggtcagttgc gccatgttgg 2043001 atcggtgata gccgcgcgga atgcggtcgc ggcaccggca actgccgaat tggccccggc 2043061 ggccgctgac gaagtatcag ctttgactgc aacacaattc aacttccatg ccgccatgta 2043121 ccaagcggtc ggcgcccagg cgatcgccat gaatgaggcg ttcgtcgcga tgttgggcgc 2043181 cagcgcggat tcttacgcgg ctaccgaagc cgccaacatc attgctgtga gctaacgagg 2043241 agatcaacga tgactgccgc acttgacttc gccacgctac cgcccgaaat caactcggcg 2043301 cgtatgtatt ccggcgcggg ctcggccccg atgctggccg cagcgtcagc ctggcacggc 2043361 ttgtccgcag aactgcgcgc cagcgcactg tcatacagct cggtgctttc gacgctgacc 2043421 ggtgaagaat ggcacggtcc ggcgtcggca tcgatgacag ccgcggccgc cccctacgtg 2043481 gcctggatga gcgtcaccgc cgtccgggcc gagcaggccg gggcacaggc ggaggctgcc 2043541 gctgcagcgt acgaagccgc gttcgcagca acggtgcccc cgccggtcat cgaggccaac 2043601 cgcgcccagc tcatggcgct gatcgccacc aatgtgctag gccaaaacgc ccccgcgatc 2043661 gcggccaccg aggcccagta cgccgaaatg tggtcccagg acgcgatggc catgtacggc 2043721 tacgccggcg cctcggcagc cgctacccag ctgaccccgt tcaccgagcc ggtgcagact 2043781 accaacgcgt ccggcctggc ggcccagtcg gctgcgattg cccacgccac cggcgcctcg 2043841 gctggtgctc agcaaacgac gctgtcgcag ctgatcgccg ccataccgtc tgtactgcaa 2043901 ggactttcgt catcgactgc agccacgtcc gcgtcggggc cgtccggatt gctgggcatt 2043961 ctcgggtctg gatcttcctg gctcgacaaa ctctgggcgt tactggaccc caactccaat 2044021 ttctggaaca cgatagcttc gtccggactg ttcttgccga gtaacacgat tgcgcccttt 2044081 ttgggtctac tcggcggcgt ggcagctgcg gatgcggccg gggatgtgtt gggagaggcc 2044141 accagtggcg ggctcggtgg cgcgctggtg gcgccgcttg gctcagcggg cgggctaggc 2044201 ggcactgtcg cggccggcct gggcaacgcg gccaccgtcg gaaccttgtc ggtgccgccg 2044261 agctggacgg cggccgcacc actagccagc cccttgggct ccgcgttggg aggcacaccg 2044321 atggtggcac cgcccccagc agtggcggcc ggcatgcctg gaatgccttt cggcaccatg 2044381 ggcggtcaag gcttcgggcg tgccgtgccc cagtatggct tccgccccaa cttcgtcgca 2044441 cgaccgcccg ccgccgggtg atcccgtagg gggtgggttc cctggaaagc gccagggtca 2044501 cgatggcgca gccgaatagc cgacagtgct tttctctgcg aataccggag ttggtcgcgc 2044561 gaaatcattt ccgtttagcg cgttcaccag cgcaggcggg ccaggctcaa taagcggaaa 2044621 tttctcgggc gaagcacccg tgcagcagcg caaatagatg ggatcggcag gacgtagaca 2044681 ttgggatatc tggtgaagtt cataagagct tgaccagttg gtgggcagaa ctacgcgagc 2044741 gtgattagca tggcggccat cgaggggacc ggaggtcagg gatgttggat ttcggggcgc 2044801 taccaccgga gattaattcg gggcgaatgt acgcgggtcc gggatccgga ccgttgctgg 2044861 ccgccgcagc ggcctgggat gcgctagccg ccgagttgta ctccgcggcg gcgtcctatg 2044921 gctcgacgat tgagggcctc accgtagcac cgtggatggg tccctcctcg atcacgatgg 2044981 ccgccgcggt cgctccatat gtggcgtgga ttagcgtcac cgccggccag gccgaacagg 2045041 caggggccca ggccaagatc gctgcgggcg tttatgagac ggcatttgcg gcaacggtgc 2045101 cgccaccggt aatcgaggcc aaccgcgctt tgttaatgtc gctggtcgcc acgaacatct 2045161 tcgggcagaa cacaccggcg atcgcggcca ccgaggccca ctacgcggag atgtgggcgc 2045221 aagatgcggc cgcgatgtat ggctatgccg gctcgtcggc cactgcgtcg cagttggcgc 2045281 cgttcagcga gccgccgcaa acgaccaatc cgtcggcaac ggccgctcaa tcagccgtcg 2045341 tcgcccaggc cgccggcgcc gcggccagct ctgacatcac agcgcagctg tcccagttga 2045401 tcagcctgct acccagcacc ttgcaaagcc tggcgacaac agcgaccgcg acgtcggcca 2045461 gcgctggttg ggacaccgtc ctgcaaagca tcaccactat cttggcgaac ctcactgggc 2045521 cgtacagcat catcgggctg ggcgctatac ctggcggctg gtggctgacg ttcggccaga 2045581 tcctcggcct agcccaaaac gccccaggtg tggccgccct actgggcccg aaagccgccg 2045641 ccggcgcgtt gtcgccattg gcgccgctac ggggcgggta tatcgcagat atcacgcctc 2045701 tcggtggtgg ggccacaggg ggcatcgccc gtgcgatcta cgtcgggtcg ctctcggtcc 2045761 cgcagggctg ggccgaggcc gcaccggtga tgagggcggt cgcatcggta ttgccgggca 2045821 ccggcgccgc ccccgccctg gccgccgagg caccaggtgc cttgttcggc gagatggccc 2045881 tgtcgagtct ggccggacgc gcgctggcag gaaccgcggt gcgctctggt gccggagctg 2045941 ctcgcgtcgc aggcggttcc gtcaccgaag acgtcgccag cacgaccacc atcatcgtca 2046001 tacccgcgga ctgacaggac tttcgagatg gcacttgaac tgggtgttag cccccaccgg 2046061 agaggagaga aggacggtgt catcgccact gtggccggtg gctggcggcc agccagttag 2046121 cggccggttg aggaaaggtg tggcaatgga tttcggattg cagccaccgg agatcacctc 2046181 cggggagatg tacctaggtc cgggcgccgg tccgatgttg gctgcggcag tggcctggga 2046241 tgggttggcg gccgaattgc agtccatggc ggcctcctac gcctcgatcg tcgagggcat 2046301 ggcgagtgag tcatggttgg gtccgtcgtc ggccggtatg gccgctgcgg ccgcaccata 2046361 tgtgacctgg atgtcgggta cctcggcaca ggccaaggcg gccgctgacc aggccagagc 2046421 cgcggtggtc gcctacgaaa ccgcgttcgc ggcggtggtg ccaccgccgc agattgcggc 2046481 caaccgcagc cagctcatat cgctggtggc gaccaacatt ttcggacaaa acaccgccgc 2046541 gatcgcagcc accgaagccg aatacggcga aatgtgggcc caggacacca tggcgatgtt 2046601 cggctatgcc agctcctcgg cgaccgcctc gcggctgacc ccgttcactg caccgccgca 2046661 gaccaccaac ccgtccggac ttgccggcca ggcggccgca acggggtaag cgaccgccct 2046721 agcgagcggc accaatgcgg tgacaaccgc gctttcgagt gcagcggcgc agtttccgtt 2046781 cgacatcatc ccgaccctgc tgcagggcct ggccacactc agcacccaat acacccaact 2046841 catgggccaa ctcattaacg ccatcttcgg gccgacgggc gcaacgacct atcagaactt 2046901 gtttgtcacc gcagccaacg tcaccaagtt cagcacgtgg gccaacgacg ccatgagcgc 2046961 gcccaacctg ggaatgacgg agttcaaggt gttctggcaa cccccgccgg cgcccgagat 2047021 ccccaaatcg tcgttgggtg ccggacttgg cctgcggtca gggcttagcg cgggcctggc 2047081 ccacgccgca tcggcgggtc tgggtcaggc gaacctggtg ggagacctgt cggtaccgcc 2047141 cagttgggcc tcagctaccc cggcggtcag gctagttgcc aacacattgc cggccaccag 2047201 cctggctgcg gcccccgcga cacagatccc agcaaacctg ctcggtcaga tggctctggg 2047261 gagcatgacc ggaggtgccc tcggtgccgc cgcccccgcc atctacacgg gcagtggcgc 2047321 ccgggcccgc gccaatgggg gaacgcccag cgctgagccg gtcaagctgg aggctgtcat 2047381 cgcgcagcta caaaagcaac cggacgcagt gcgacactgg aatgtcgata aggccgatct 2047441 tgatggcctg ctggatcgat tgtcgaaaca gcccggcatc cacgcggtac acgtgtcgaa 2047501 cggcgacaaa cccaaggttg ccttgcccga tactcagttg ggttcacact caacgtgatt 2047561 cgaaatccac actgatactg gaggtgatta ccggctgaag caaagcgcat tggaaatcca 2047621 ggcttagacc attgccatgt ggccgtgaga ttcgtcacgt cttgacatcc gcgtccggcg 2047681 ggtcaccttc gaccgcggtc aatgtcattg gtaggtaagg gctttgctgt actgatggcc 2047741 gaattttgac tcgaaaagta tgtcgggccc tcgcagcaga tctgccgcag gacgcgatgc 2047801 aattacaacg cacgatggga caatgcagac ctatgagaat gctagtagcg ctcctgctga 2047861 gcgccgccac catgatcggc ctagccgcac ccgggaaagc cgatccaaca ggcgacgatg 2047921 ccgccttcct tgccgcgttg gaccaggccg gcatcaccta cgctgaccca ggccacgcca 2047981 taacggccgc caaggcgatg tgtgggctgt gtgctaacgg cgtaacaggt ctacagctgg 2048041 tcgcggacct gcgggactac aatcccgggc tgaccatgga cagcgcggcc aagttcgctg 2048101 ccatcgcatc aggcgcgtac tgccccgaac acctggaaca tcacccgagt tagcggggcg 2048161 catttcctga tcaccgcggt ggtgcgcggt ggtgtggtgc gtccgagggg gttgcgatgc 2048221 acccggttcg cctaggctca aactgctgtt aacctgcgcg tggttggctg ccgtggccgt 2048281 cttgcgatcg ggaaggactc ggcgtcatgc aaacgctgac tgtcgccgat ttcgctctcc 2048341 ggctggccgt cggagtgggt tgcggggcca ttatcgggct cgagcgccag tggcgggcgc 2048401 ggatggctgg gttgcgcacc aacgctctgg tggcgaccgg tgctaccttg ttcgtgctgt 2048461 acgcggtcgc caccgaggac agcagcccca cccgagtggc gtcctacgtg gtttctggaa 2048521 ttggattcct gggcggcggg gtcatcctgc gggaggggtt caacgtccgc ggtctgaaca 2048581 cggctgccac gctttggtgc tcggccgcgg tcggagtgct ggccgcctcc gggcatctgg 2048641 tgttcaccct gattggcacc ggaaccatcg tcgctgtcca tctcctgggg cgcccacttg 2048701 gccggctggt cgaccgcgac aacgccgtcg aagacgaagg gctgcagccc taccaggtac 2048761 gggtgatttg tcggcccaaa gcagagacct atgtacgtgc ccatatcgtg cagcgcacca 2048821 gcagcaacga catcacgctg cggggtatac gcacggggcc ggccggagac gacaacatca 2048881 cgttgacggc ccacctattg atggttggcc ataccccggc caagctagag cggttggtgg 2048941 cggaactgtc gctgcagccg ggcgtttacg ctgtgcactg gtatgccggt gagcacgcgc 2049001 aggccgaatg acccacgaca ctaggggcgg ggctgtactc gcggcgcggc cgcagccagc 2049061 aagtctgccc gactgccgtt cagcggcggg tagatccgcc gggtattgat tgactgcttg 2049121 gtggtcttgg ccggtgcgcc ctgcgatacc actttgcgtt cccatccctc agtgtacacc 2049181 gcgcccgccg atcctagatc gagaaccgtg acataccaag ggatccgaag agccagcaac 2049241 ggttggtcga acagatcgtt gatgacgttg cagccggcat agcggcccat cgggcgccca 2049301 tgctgacacg acatgaccga caggtgctcg tcatccatcc gggccgcggc cacatcgcca 2049361 gcagcaaaca tcgcaggcac cccgatcacc cgcaggtagt cgtcgacttg caggcgtccc 2049421 agccgatcac gggctaccgg cagctgctcg gtcaggcggc tggcccgcat gccggcgcac 2049481 cacaccacgg tggccgctgc cagccgttcc cccgatgaca gcgttacacc gcccgggctg 2049541 acggcggcaa cgctcacgcc ggttctggtc tcgacgccgt tgtccaacag cgcctgttcg 2049601 atcaccggcc gcgccgataa acccatatcg gagccgacga aggggttgtg gtcgatgagt 2049661 accacgcggg gggtgacacc atcaccacgg gcgaacaacg cgtgcagtcg gcccggcaac 2049721 tcgcaggccg tctcgatacc ggtcagcccg gcaccgacga ccacgacggt tgccgccgcc 2049781 gatgtcagcg gcccgccggc cagtccttgc agatgctgct gtagcctgac cgcgccgtcg 2049841 tacgtgtcga catcaaaacc gaactctgcc agtcctggca acgcgggttt gaccacgtga 2049901 ctgcccgacg cgaggaccag tcggtcatag ctatatgagg caccggtcga cgtggtgacg 2049961 cggcggccgt cggcgtcgat cgcggtcacc tcggcggtga catgcgcaac gccggcaggg 2050021 ccgagcacgt cgccgagcgg gatgcggcag gcgctcagat cagcctcata gttgcgaacc 2050081 cggatatcat gaaacggttt gttgctcacc accatgacgt cgaccgtgcc cgctgggacg 2050141 gcgagctcgt cgagtcgtcg ggccgcaccg agcgccgccc acaggcccgc gaacccggag 2050201 ccgatcacca ccacccgggt caacggctaa acacctgacg actctggggt atcgccgccg 2050261 ccgcgtggcg accgggcagg aacatccaca cgtgccaacc tccttcgagc ccgggccatc 2050321 cgataacccc gttagccgtc gcgagcttac agaaggtgca ggcatcggga ttgagtgcat 2050381 catgggatac cggtgaatac cgtcagccgg ggcagccagg gtaggggaca ccccccgctc 2050441 gggctgccag cggagtatcg agcggatcgc catcggcgta gcagataccg ggtcagagca 2050501 gcgtacgctg gcacattcgg cttcggctcg ctggttagcg attgttagtt gcacgcccag 2050561 ttgacgatcc gcccgccttc gagtcggttc acggcgtcgt cttctgccgc gcggcgcgtg 2050621 agtccggttc cgccttggta tttcgagccg ttgtaggcga ccgcgccgca cctggtgaag 2050681 cgactaacca ctttgcaagt cttgtcaccg cacttttcta gtgcgacttg ctctgctcgc 2050741 gccggtgtgc gctggtgcca cgctttgccc gacgcgccgc tgggggcata ggcaatcgcc 2050801 ccgtaatgga taatcggagg gataggcaac ccggcaattt ccgacatcat gacttccgac 2050861 atcgaaccgt tggcgagatg ggcgtccacc gtcggaacca gcaggatgcc cagcccgaga 2050921 gcagccccta ggccggcggc tgccatcgcg gttcggcgtc ggaggtttgt gatcatgtcc 2050981 tgcccccttt ctgcggtcgg taatccagcg gtttgaaagg gttgagccga cttacgcgca 2051041 gtggatgcgt cgaagggtca atgaggctgg gtactgagac ggccacggtt ggaagcccgg 2051101 cgccctggcc gatgatcgat caggtcatcg ctgtatggag gctgcccacc cacggtgctc 2051161 ggttcggtcc gggattctgg cgcttgtgtg tcatgtgccc aagtgtgcga taaatatacc 2051221 tgacccgggt agggcataaa gtctctaaca gcaccgaccg gatagggaac aacggccttc 2051281 gggcaagcgg cttcactgtc aagtcgtcac ctgtcacgca tgcgagtcgt agcctgtctg 2051341 atgtggatgc cgtcgccgga ttcttctcag cgctgcccga ggaaatgcgg gacccggtac 2051401 tgttcgccat tccatgtttt ctattgctgc tgattctcga atggacggcg gcccgcaagc 2051461 tggaaagcat cgagaccgct gctaccgggc agccacggcc cgcctcgggc gcttacctca 2051521 cccgcgactc ggtggccagc atctcgatgg ggctggtttc gatagccacc accgccggct 2051581 ggaagtccct tgccctgctc ggttatgccg caatctatgc ctaccttgcc ccctggcagc 2051641 tgtccgccca ccggtggtac acctgggtga tcgcgatcgt tggtgtcgat ctgctgtact 2051701 actcctatca ccgcatcgcc caccgagttc ggctgatctg ggctacccac caggcgcatc 2051761 actccagcga atacttcaac ttcgccaccg cgctgcgcca gaagtggaac aacagcggcg 2051821 agattctcat gtgggttccg ctgccactga tggggcttcc cccttggatg gtgttctgca 2051881 gttggtcgct gaacttgatc taccagttct gggtgcacac cgagcggatc gacaggctgc 2051941 cgcggtggtt cgaattcgtc ttcaataccc cgtcgcacca ccgggtccac cacggaatgg 2052001 acccggtgta tctggacaag aactatggcg gcatcctcat catctgggac cgcctgttcg 2052061 gtagctttca gccggagcta ttccgaccgc attatggcct gaccaagcgg gtcgacacgt 2052121 tcaacatctg gaagctgcag acccgcgagt acgtggcgat cgtgcgtgac tggcggtcgg 2052181 caacacgtct gcgggatcgg ctgggctacg tcttcggacc gccgggctgg gaaccgcgca 2052241 ccatcgataa atccaatgcc gccgcctccc tggtcacgtc tcggtaacgt cgcgacccga 2052301 cattgcgaaa gtattaccgt cgggttttgg tacgccttag ccgtaaccgg cggcgggcga 2052361 tgcgcttggc cccgacggat gggagttcaa ggtggtccgc ctggtaccac gcgcattcgc 2052421 agcgacggtc gccctattgg cggccgggtt ttcgccggcg accgccagtg ccgatccggt 2052481 cttggtgttc cccggcatgg aaatccgtca ggacaaccac gtctgcaccc tgggctacgt 2052541 cgacccagct ctgaaaatcg cgtttaccgc ggggcattgt cggggcgggg gagcggtcac 2052601 cagccgggac tacaaggtta tcggccatct cagggccttc cgggacaaca cacccagcgg 2052661 ctccaccgtg gccacgcacg agttgatcgc cgactacgag gcgattgtgc tggctgacga 2052721 cgtcacggca agcaacattt tgccgagcgg gcgtgcactg gaatccagac cgggtgtggt 2052781 tcttcacccg ggccaagcgg tctgccattt cggcgtcagc acaggcgaaa cctgtgggac 2052841 cgtcgaaagc gtcaacaacg gctggttcac catgtcccac ggcgtgctca gtgagaaggg 2052901 ggattcgggg ggcccggtct acctggcccc cgatggcggc cccgcgcaga tcgtcgggat 2052961 cttcaacagc gtctggggcg gctttcccgc ggcggtgtcc tggcggtcga cgtccgagca 2053021 ggttcacgcg gatctcggcg tgacgcccct tgcttagcaa gcaccccgtt agcggccacc 2053081 aggttgatcg ccgtgtgttt gctagagcgg tgatctcggt tgtgtcagac ttgccgcgtg 2053141 ggcaagcgcc gggatgcgag ggaacagatc gaggcgaaaa ttgtcgaact cggccgtcgc 2053201 cagctgctgg atcacggcgc ggccgggttg tcgcttcggg caattgcccg caacctgggc 2053261 atggtgtcct cggccgtata ccgctatgtg tccagtcgtg atgagctgtt gactttgctg 2053321 ctcgtcgacg cctactccga cctggccgat accgtggacc gagcccgcga cgacaccgtc 2053381 gccgactcgt ggagtgacga cgtcatcgca atcgctcgag cggtgcgcgg ttgggcagtc 2053441 actaaccccg cccgctgggc cttgctatac ggtagcccgg ttcctggtta tcacgcgccg 2053501 cctgaccgta ccgcgggcgt cgccacccgc gtggtcggag cgttcttcga cgcgatcgcc 2053561 gcgggaatcg ccaccggaga catcaggtta accgatgacg ttgcgccgca gccgatgtca 2053621 tcggacttcg aaaagatccg gcaggagttc ggctttcccg gcgacgatcg tgtcgtcaca 2053681 aagtgctttc tgctctgggc gggcgtggtg ggcgcgatca gcctggaggt attcggtcag 2053741 tacggggccg acatgctaac cgatccagga gtggttttcg atgcccagac acggctgctg 2053801 gtggccgtgc tggccgagca ttgaagctgc tgcaatcggc gtgtccagcc ggaattagaa 2053861 cgtgttcact caaggctacc agtgctgaca cttgcggtgg tggcaaatgc aatctgagcc 2053921 ctttctggcc tctggcaagc tgggctgtcc tgcgagacgc tcatccttct cgttctgtcg 2053981 ctgatacaga tcgcaggggt tacccccgga cctagaagcc gccgaaacgg ctctcaccgg 2054041 cttgttaggc gtccggaagc ggattcggat gcgcgatgtc cgctttgcgc acgacacctg 2054101 tagcagtctg ggcaagcccg cgatgtcgtc gcgagtatct cgttgagcta tctcggagag 2054161 atgcccttcg agttagtatc gtcggttcgt gtagagaata tctatagtga cttttgcggg 2054221 actgtgggcc gggtctacac caggggctcg aagccgcatt ggccgaagca agcggaggtg 2054281 caagtgccga catgagcggc gccaatgagc cgcgccggcg acgatgcagt gggggtaccg 2054341 cccgcttgcg ggggacgaag cgatgacgag gagcggcgcc aatgagccgc gccggcgacg 2054401 atgcagtggg ggtaccgccc gcttgcgggg gacgaagcga tgacgaggag cggcgccaat 2054461 gagcaccgac atacccgcca ccgttagtgc ggagaccgtg acgtcctggt cggatgacgt 2054521 cgatgtaacg gtgattggtt tcggcatcgc cggcggttgc gcggcggtca gcgcggccgc 2054581 cgccggcgcc cgggtactgg tgctcgaacg tgccgccgcg gcgggcggca ccaccgcgct 2054641 tgccgggggg cacttctacc tggggggcgg aaccacggtg cagctggcga ccggtcatcc 2054701 cgattcaccc gaggagatgt acaagtacct ggtcgcggtc tcccgagagc ccgatcacga 2054761 caagattcgc gcctattgcg acggcagcgt cgagcatttc aactggttgg agggcctggg 2054821 ttttcagttc gagcgtagtt actttcccgg caaggctgtg attcaaccca acaccgaggg 2054881 cttgatgttc accggaaatg agaaggtgtg gccattcctg gagttggcgg tgccggcacc 2054941 gcgcgggcac aaggtacccg tgccgggcga caccggcggt gccgccatgg tgatcgacct 2055001 gctgctcaag cgagccgcaa gcctggggat acagatccgc tacgagacgg gcgccaccga 2055061 gctcatcgtg gacgggaccg gcaaggtaac cggggtgatg tggaagcggt tctccgaaac 2055121 cggtgcaatc aaagcgaagt cggtaatcat cgcggccggc ggattcgtga tgaacccgga 2055181 catggtggcc aaatacactc cgaaactggc cgagaagccg ttcgtgctgg gcaacaccta 2055241 cgacgacggg ttgggcatcc ggctgggtgt atcagccggc ggcgccaccc aacacatgga 2055301 ccagatgttc atcacggctc cgccgtaccc gccgtcgatc ttgctcaccg gcatcatcgt 2055361 caacaaactc ggacagcggt tcgtcgccga ggactcctac cattccagga ccgctgggtt 2055421 catcatggaa cagccagaca gcgcggcgta tttgatcgtc gacgaagccc acctggagca 2055481 ccccaagatg ccgctagtcc cgttgatcga cggctgggaa acggttgtgg aaatggaagc 2055541 cgcgcttggc attccaccgg gcaacctggc ggcgacgctg gaccgctaca acgcctacgc 2055601 cgcgcgcggc gcagatcccg atttccacaa gcagccggaa ttccttgcag cacaagacaa 2055661 cgggccgtgg ggggcgttcg acatgtcgct gggcaaggcg atgtatgccg gattcactct 2055721 gggcgggctg gccacgtcgg tggacggtca agtactgcgc gacgacggcg cggtggtggc 2055781 cggcctgtac gcggtcgggg catgcgcgtc caatatcgcc caggacggca agggatatgc 2055841 cagcgggacc cagctgggtg aggggtcgtt tttcgggcgt cgcgccggag cgcatgcggc 2055901 agcccgagcg cagggcatgt aagcctcctc gcgccgcgac tgggaatcct gcgacgcgac 2055961 acgccgacaa ggcgtcgtga gattcacagt cgcagcgcgg cttcaggtaa gacgccggga 2056021 gcgcggtagc cggcctcccg gctacggtaa cccgttcatc ccgttcttac ccaacagccc 2056081 gccggcaccg ccggtgcccg cgctgccgtt aggtgtgcca ctcccggcgt tgccgccgtt 2056141 gccgccgttg ccgaccagga tggcaccgcc gccagcgccg ccgtcaccgc ccttggcacc 2056201 ggtgccgttt cctccggcgc cgccgtcacc gccgtcgccg atcagcccgg ctttgccgcc 2056261 gagcccaccg gcgcccccgg caccgccgaa gccgaatccg ccggcgccgc cggcgccgcc 2056321 ggcaccaaac agcaggcccg cagtgccgcc gtttccgccg gcgccgccca ccccggtagc 2056381 gccaccgccg agtgcgccgg cgccgccggc cccgccggcg cctaccagca ggccggcgtt 2056441 gccgcccgcc ccgccggcac cgccggtagt ggacccgacc ccacccgcgc cgccggcacc 2056501 gccgtcgccc cagagcaggg cggacccgcc ggaccccccg gcaccgccgt tcccgaccaa 2056561 tccgattccg ccggcgccgc cggccccacc gacgccgaac agcccaccgg ccccgccggc 2056621 accaccgggc ccgccggggg cggtgcccag gaatgccaca ccgtcaccgc caacaccgcc 2056681 caccccgccg gcgccgaaca ggagcccgcc attgccgccg gccccgccgg caccgccggt 2056741 gacattagtg ccggtgccgc cggccccgcc ggcaccgccc acgccgaaga acaacccgcc 2056801 gtctccgccg gccccgccgt caccggcgtc ggccgcgagt ccgccgacgc tgccggcccc 2056861 gccggcgccg aacagcagcc cgccattgcc gccggccccg ccggccccac caataccgcc 2056921 caccccacca ccggcgcgtc cgccggcgcc gccggccccg ccggcgccgt agagcagccc 2056981 gccggccccg ccggccccgc cgaaccctgc ggtgccggac gctacgttcc ccccggcgcc 2057041 gccggccccg ccgttgccga acaggccagc ggctccgccg ttgcccccgg gcatgccggc 2057101 cgcgccggag ccgccggccc cgccgttgcc gatcaagatt ccgccgtcgc cgccgtttgc 2057161 cccggtcccc ggggccccgt tggctccgtt accgatcagt gggcgcccca acagcgccag 2057221 ggcgggggcg ttgatcacgt cgagcacacc ctctagcggg gccgcgctgg cggcctcggc 2057281 ggccgcatac gagcccgccc cggcggtgag cgcccgcacg aactgctcgt gaaacagcgc 2057341 cgcctgggcg ctcagcgcct gataggcctg ggcgtgtccg gagaacaatg ccgccatcgc 2057401 cgccgacacc tcatcggcgg cggcggccaa caccgtcgtg gtcgggaccg cggcggccgc 2057461 gttggcggtg ccgatcgtcg acccgatacc cgccaaatcg gtcgccaccg ccgctagcgc 2057521 ctccgggatc gtgaccacaa atgacatctg gcgcctcgtc aacaccctgt ggccccggcg 2057581 cggggccgct accgatcgcc tggtcactcc ccagagatcg acggattcag cgtatcgcga 2057641 tcacggaagc ggccacgccg atttgggaag ctcgtcccgg cttacacttc ggcgggcgcc 2057701 gcctcgactg gggccagccg ccattggccg ccaccgagta gttcgagctg gttttcgtgc 2057761 agccgctcga gggcggggcg atggctgacg ctgaccacga tgcagtccgg cagctcgctg 2057821 cgcagcaatt ggtagagcgc aaactccagc ccggtgtcca gcgccgaggt actttcgtcg 2057881 aggaagaccg ccttgggttt ggtgagcagg atgcgagcaa aggcaacacg ttgctgctca 2057941 ccgggggaga gcaccttggc ccagtcgcgt tcctcgtcca gccggtcaca cagtggggcc 2058001 agcgccacct tggtcagcgt gtcccgcagg gtggcgtcgg ggatggcggc cgcagagttg 2058061 gggtagcaca ccacgtcacg cagcgtcccc agcggcacat acggcaactg cgacaagaac 2058121 atcgtctcgt tctcgccgcc cggccggtgc agggtccccg atgcgtaggg ccacagttcc 2058181 gccagactgc gcagcagcgt ggtcttgccg gccccagaac gcccggtgat caccagcgag 2058241 cctccgcggt ccagccgcac atcgagcggg tcgatcaacc gatcgccggc aggcgtacgc 2058301 acctcgatgt cgttgagctc gacggactcg tcgtcgctcg gtcgggtcag gaccgcgggc 2058361 agggcgcggc ctttctcgtt ggcgtcgacc agcccatgca atcggatgat tgctgcgcgg 2058421 aaggacgcaa acgcgtcgta gttgttgcgg aagaacgaca acgagtcgtg aatgttgccg 2058481 aaggaagtcg ccgtctgccc gacatcgccg aagtcgatct gcccggcgaa taatcgaggc 2058541 gcctggatga cccacggcaa cggaacaatt gtctggctca ccgacagatt ccatccattg 2058601 aatgcgatgc tgcgccgaac gtagcgacgg taattgtcga tcaccggcgt gaaccgccgc 2058661 tgtagctggg taccttccac ccgctcgccg cggtagaaac ccaccgcctc ggcggcgtcg 2058721 cgtagccgaa ccagcgcgta acggaaagcg gcattgagct tttcattgcg gaagctgagc 2058781 cagatcaggg gccgcccgat gatgaacgag atgaccgtgg ccacgaacac atagaccagc 2058841 acggtccaga acattgcgcg cgggatggac acgccgaaga tattcagggt gcccgagaga 2058901 ttccacagga tcgctgtgaa agaaatcacc gaaatgatcg actgcacggc cccgaaaagc 2058961 agcgtgctgg ccgtcccgtt ggagggagca ttcggagtgc cgcctgcccc ggcggtgaag 2059021 atatcgacgt cttgctgaat gcgctggtcg gggttgtcga tcgtttcgtc gatgaacagg 2059081 tctcggtagt aggccctgcc gtcgagccag tcttgtgtga ggtggtgggt tagccagacc 2059141 ctccaggcga tgatgaagcg ctgcgtcaag tagatgtcgg ccatgacccg ggtcacgtgc 2059201 agcacggcca tcacgctgaa aaccccgatc gacatccaaa atcctcgcgc gcctgagcgt 2059261 ttgaccgtgc catcgccaga ggcgatgccc tcgaaggcct tctgcaaggc cgtgtacatg 2059321 tcgttgcctt ggtagctgaa tagcacattc aggcgcactg ccagcactac cgaaagcaac 2059381 aacacgccga gcatcagcca cacgcgaacg ctgttggggc caacgaagta tgcgcgggtg 2059441 atccgccaga actgccggcc ccagggcgtc aaatacctga gcagaaccaa tatcgcgagc 2059501 acacagatgg cactgatcgt ccaggctttg ccgacccaat acacggaatc cgggaatgct 2059561 ctagaccaat cgatggacgg cttaaacaat ttcgggccca aggtcgacgt ctcctcacaa 2059621 acagaaatcc ttcgggcgaa ggtacccgaa ggttgtcgat aggctgccga tatgagcacc 2059681 gacaccgccc cggcccagac catgcatgct ggccggctta tcgcgcgccg acttaaagcc 2059741 agtggtatcg acacggtctt cacgttgtcg ggcggccacc tgttttccat ctacgacggc 2059801 tgccgtgagg agggcatccg cctgatcgac acccgccacg aacaaaccgc cgcctttgcc 2059861 gccgaaggct ggtcgaaggt gaccagggtg ccgggcgtgg ccgcgctcac cgcggggcca 2059921 gggatcacca acgggatgag cgcgatggcg gcggcccagc agaaccagtc accactggtg 2059981 gtgctcggcg gccgggcgcc ggcgctgcgc tggggtatgg gctccctgca ggagatcgat 2060041 cacgtgccgt ttgtggcgcc ggtggcccgc ttcgccgcta cagcgcagtc agccgagaac 2060101 gcgggcctgc tggtcgatca ggcgttgcag gcggcggtga gtgcgccgtc gggtgtggca 2060161 ttcgtcgact tcccgatgga tcacgcgttc tccatgtcct cagacaatgg ccgccccggc 2060221 gcgctcaccg agctaccggc cggtcccacc ccagccggcg acgccctgga ccgggcggcg 2060281 ggcctgcttt cgacggccca gcgtccggtc atcatggcag gtaccaacgt ctggtggggc 2060341 catgcggagg cggcattgct gcgtcttgtc gaggaacggc acattccggt gctgatgaac 2060401 gggatggcgc gcggcgtggt gcccgccgat caccggttgg ccttctcacg ggcgcggtca 2060461 aaagcgctgg gggaggctga tgtcgcgctg atcgtcggtg tgccgatgga tttccgtctg 2060521 ggcttcggtg gggtattcgg gtcgacaacg cagctcatcg tggcagaccg cgtcgaaccc 2060581 gcacgcgaac atccgcgacc agtcgcggcg gggctctatg gggatctgac cgccaccctt 2060641 tcggcgctgg ccggatctgg cggcaccgac caccagggct ggatcgagga gctcgcgacg 2060701 gccgagacca tggcgcgtga tctcgagaag gccgagctgg tcgatgaccg gatcccattg 2060761 catccgatgc gggtgtacgc cgagctggcc gcgctgctgg agcgggatgc tctagtcgtt 2060821 atcgatgcgg gcgatttcgg gtcgtacgcc ggccggatga tcgacagcta tctgccaggc 2060881 tgttggctgg acagcggtcc gtttggctgc ctggggtcgg gtcccggcta cgccctggct 2060941 gccaaactgg cgcggccgca gcgccaggtc gtgctcttgc agggcgacgg cgcgttcggg 2061001 ttcagcggca tggaatggga cacgctggtt cggcacaacg tggcggtcgt gtcagtgatc 2061061 ggcaacaacg gcatctgggg tttggagaag cacccgatgg aagcgttgta cggctattcg 2061121 gtggtggccg aactgcgccc gggaacccgc tacgacgagg tggtgcgcgc actgggcggc 2061181 cacggcgagc tggtgtcggt gcccgctgaa cttcggccgg cgctggaacg ggcctttgcc 2061241 agtggcctgc ccgctgtggt caacgtgctc accgacccaa gcgtggctta tccacgccga 2061301 tccaacctgg cttgacgtcc agccgggccg tgaacgtgca cggttgtcca cgaattgcgg 2061361 cctgtcggtg tacagacacg caccctcgcg gccggccggc attcgcgtac cgttggtttg 2061421 tgcccaagac cacccgcgct caacccggcc ggctgagcag ccgattctgg cgattgctcg 2061481 gcgccagcac cgaaaagaac cggagccgct ccctggcgga tgtaaccgct tcggcagaat 2061541 acgacaagga agctgccgat ctgtccgacg agaagctgcg taaggcggca ggcctgctca 2061601 acctcgacga cctcgcggag tccgccgata tcccgcagtt tctcgcgatt gcccgggaag 2061661 ccgccgagcg gaggaccggg ctgcgaccat ttgatgtgca gttgcttggc gcgttgcgca 2061721 tgctcgccgg agacgtgatc gagatggcca ccggtgaggg caaaaccctt gccggggcga 2061781 tcgcggccgc cggttatgcg ctggccggcc ggcacgtgca cgtcgtgacg attaacgatt 2061841 acctggcccg ccgcgatgcg gagtggatgg gcccgctgct ggacgcgatg ggcctgacgg 2061901 tcggctggat caccgcggac tcgacccctg acgagcgccg gaccgcatat gaccgtgatg 2061961 tcacctatgc ctcggtcaac gagattggct tcgatgtact gcgcgatcag ttggtgactg 2062021 atgtcaatga cctggtatcg cccaatccag acgtggctct catcgacgaa gccgactccg 2062081 tgctggtcga cgaggcgctg gtgcccctgg tgctggccgg aaccacacat cgtgagacgc 2062141 cgcggctgga gatcatccgg ctggtcgctg agcttgttgg cgacaaggac gccgacgagt 2062201 actttgccac cgattccgat aaccgcaatg tccacttgac cgagcacggg gcacgcaaag 2062261 tcgagaaagc gctcggtggc atcgacctgt actccgagga gcacgtcggc accacactga 2062321 ctgaggtcaa tgtcgcgctg cacgcgcatg tgctcctgca acgcgacgtg cactacatcg 2062381 tccgcgacga cgcggtgcac ctgatcaacg cgtcgcgtgg ccgtatcgcg caactgcagc 2062441 gctggccgga cgggttgcaa gctgcggtcg aggccaagga aggtatcgag accacggaaa 2062501 ctggggaagt gctcgacacc atcacggtgc aggccctgat caaccggtat gcgactgtgt 2062561 gcggaatgac gggaaccgcg ctggccgccg gtgagcagct acggcagttc taccagctcg 2062621 gtgtctcacc gataccaccg aacaagccaa acatccgcga ggacgaggcc gaccgggtct 2062681 acatcaccac tgcagccaag aacgacggga tcgtcgagca catcaccgag gtgcaccaga 2062741 gggggcagcc tgtgctggtc ggtacccgcg acgtggccga atccgaggaa ctgcacgaac 2062801 gcctggtgcg ccgcggtgtg cccgccgtgg tgctcaacgc gaagaacgac gccgaggagg 2062861 cccgggtcat cgccgaggcc ggcaaatacg gcgcggtcac ggtgtcaact caaatggccg 2062921 ggcgcggcac cgacatcagg ctcggcgggt ccgacgaagc tgaccacgac agggtcgcgg 2062981 aattgggcgg cctgcacgtg gtcggcactg gccgtcacca caccgagcgg ctagacaacc 2063041 agctgcgcgg tcgggccggg cgccagggag atcccgggtc gtcggtgttt ttctcaagct 2063101 gggaagacga tgtcgttgcg gccaacctcg accacaacaa gctgccgatg gcaaccgacg 2063161 aaaatggccg gattgtcagc ccgaggacgg gtagtctgct cgaccatgcc cagcgcgttg 2063221 ccgagggccg gttattggat gtgcacgcca acacgtggcg ctacaaccag ctgatcgccc 2063281 agcagcgcgc catcatcgtc gaacggcgta acacgttgtt gcgcaccgta accgcgcgtg 2063341 aggaactcgc cgaactggcg cctaagcggt acgaggagct gtccgacaaa gtatccgagg 2063401 aacgcctcga gacgatttgt cggcagatca tgctgtatca cctcgaccgt ggctgggccg 2063461 atcacctggc gtatctggcc gacatccggg agagcatcca tctacgcgcg ctgggccggc 2063521 agaacccact cgacgagttt caccggatgg ctgtggacgc gttcgcgtcg ctggccgccg 2063581 acgccatcga ggcggctcaa cagacgttcg aaaccgcgaa cgtccttgac cacgagccgg 2063641 ggctggacct gtccaaactg gcccggccga cgtcgacatg gacctacatg gtcaatgaca 2063701 acccactgtc cgatgacacg ctttctgccc tcagtctgcc cggggtgttc cgctgagctg 2063761 cccagcgtaa gcgccgagcg taacgccact gcgaaatttc gggcagaaaa tcgcagtggc 2063821 gttacgctcg cggctagggg tgcccccaca gcccgccgtt tcggcgcgca tcgtcgccag 2063881 gctagatccg attgcccggc tcctcagccc gccgtttcgg cgcgcatcgt cgccaggcta 2063941 aggtcacggc tcatggagcc ggtgctcacg cagaatcggg tgctgactgt ccccaacatg 2064001 ttgagcgtta ttcgcctcgc gctcatccca gcattcgtct acgtcgtgct cagcgcgcac 2064061 gccaatggct ggggggtagc gatcctggtg ttcagtggcg tttcggactg ggctgatggc 2064121 aagattgcac ggctactaaa ccagtcatcg cggctgggcg cgctgctgga cccggccgtt 2064181 gatcgcctct acatggtcac tgttcctatc gtgtttggcc tgagcggcat cgtgccgtgg 2064241 tggtttgtcc ttacgttgct gacccgcgat gcgctgctgg ctgggacgct gccgctgcta 2064301 tggagccgtg gactgtcagc gctaccggtg acctacgtcg gtaaggcagc gactttcggc 2064361 ttcatggttg gctttccgac cattctgttg gggcaatgcg atccattgtg gagccatgtg 2064421 ctgctggcct gtggttgggc attcttgatc tggggtatgt atgcctactt gtgggccttc 2064481 gtgctgtatg cagtgcagat gacgatggtg gtgcggcaga tgcctaagct caagggcagg 2064541 gctcatcggc cggcggccca gaacgctggt gaacgtggct gagtctgacc ggctgctcgg 2064601 cggctacgac cccaacgccg gctacagcgc ccacgcaggg gcgcagccac aacgcatccc 2064661 ggttccgtcg ttgctgcgcg cgctgctatc agagcatctg gatgctggat acgcggcggt 2064721 tgccgccgag cgcgagcgtg ctgcggcacc acggtgttgg caagcccgcg ccgtcagctg 2064781 gatgtggcag gcattggccg cgaccctagt cgccgccgtg ttcgctgccg cggtagcgca 2064841 ggcgcgctcg gtggcacccg gcgtgcgcgc cgcccaacag ttgctcgttg cgagtgtgcg 2064901 atcaacccag gccgccgcga ccacgttggc tcaacggcgc agcacactct cggcgaaagt 2064961 cgacgacgtg cggcggatcg tactcgcaga cgacgccgag ggacagcggc tgctggcccg 2065021 tctcgacgtg cttagcctgg ccgcggccag cgcaccggtt gtcgggcctg gtctgacggt 2065081 gaccgtgacc gatcccggtg cgagccctaa tctttccgac gtgtccaagc agcgggtcag 2065141 cggtagccag caaatcatcc tcgaccgcga tttgcagctc gtcgtcaact cactgtggga 2065201 aagtggcgcc gaggccatct cgatcgatgg cgtccggatc gggccgaacg tcacgatccg 2065261 gcaagccggc ggagcaatct tggtcgacaa taatcccacg agtagtccct acaccatctt 2065321 ggcggtcggg ccgccacatg ccatgcagga cgtcttcgat cgcagcgccg ggctgtaccg 2065381 cctgcggctg ctggagacct cctacggtgt cggcgtcagt gtgaacgtcg gcgacggtct 2065441 ggcattgcct gccggtgcga cccgggatgt caagttcgcc aaacagattg ggccctagtg 2065501 agagaagtcc tggtgaatag gaaaccatgg ggagcgatac ggcctggagt ccggcgcgca 2065561 tgatcgggat cgcggcgctc gccgttggaa tcgtgctggg tttggttttc catcccggcg 2065621 tgccagaggt catccagccg tatctgccga tcgcggtggt cgccgcgctc gacgcggtgt 2065681 tcggtggctt gcgcgcctat ctcgagcgga tctttgaccc gaaggtcttc gtggtttcgt 2065741 tcgtgttcaa cgttttggtg gctgccctaa tcgtctatgt cggtgaccaa ctgggcgtcg 2065801 gcacacagtt gtccaccgcg atcatcgtcg tgctgggcat ccgcatcttc ggcaacaccg 2065861 cggccttgcg gcggcggttg ttcggagcgt gacggagatg agatcaccgt gagtgagaat 2065921 cgcccagaac ccgtggcagc cgagacttcc gccgccacaa ctgcgcgtca ctcccaagcc 2065981 gacgcgggcg ctcacgacgc cgtgcgacgt ggtcgtcacg aactaccagc cgaccatccg 2066041 cgctccaagg tcggaccgct gcggcggaca agattgaccg aaatactgcg gggtggtcgc 2066101 tcgcgtctgg tgttcgggac gcttgcgatc ttgttgtgct tggttctggg ggttgccata 2066161 gtcactcagg tccgtcagac cgactccggt gattcattgg aaacagcccg tcctgcagac 2066221 ctattggtgt tgttggattc gttgcggcaa cgcgaggcca cgttgaacgc cgaagtgatc 2066281 gaccttcaga acacgctgaa cgcgttgcag gcatccggca acaccgatca ggcagcgtta 2066341 gaaagcgccc aggctagatt ggccgcgttg tccatcctgg tcggcgccgt gggtgccacc 2066401 gggccgggcg tcatgataac gatcgacgat ccgggacccg gagtagcgcc tgaggtgatg 2066461 atcgacgtga tcaacgaact gcgtgccgct ggagccgagg cgatccagat caacgatgca 2066521 caccggtcgg tgcgggtcgg ggttgacacc tgggttgtcg gtgtgcccgg ctcactgaca 2066581 gtcgacacca aggtcctgtc cccgccgtat tcgattctgg cgattggtga tcctccaacg 2066641 ctggccgcgg cgatgaacat tcctggtggt gcacaggacg gtgtcaaacg cgtcggcggg 2066701 cggatggttg tgcagcaggc cgaccgtgtg gacgtgaccg ccttgcggca accaaaacag 2066761 caccaatacg ctcagcccgt caagtgaact agcccaactc cgagccgacc agaataggat 2066821 taccgtgagc gatatcccgt ccgatctgca ctacaccgcc gaacacgagt ggattcgccg 2066881 cagtggcgac gacaccgtcc gggtggggat caccgactat gcacagtcgg cgcttggcga 2066941 cgtcgttttc gttcagctac ccgttatcgg caccgcggtc accgccggcg agaccttcgg 2067001 cgaagtggaa tcgacgaaat ctgtgtcgga tctctatgcg cccatttcgg gtaaggtgtc 2067061 tgcggtcaac agcgatctgg acggcactcc gcaattggtg aattccgacc cctacggagc 2067121 cggctggctg ctggacatcc aggtcgacag ctcggatgtc gctgccctgg agtcagcttt 2067181 gacgacactg ctcgacgctg aggcctaccg cggcacactg accgagtgac gattgctaag 2067241 gtccctgcca gcgtcacgtg ggaggtcgcg ggtctgcacg gatccgggcc gggcagggca 2067301 atcgagcctg ggatccgctg gggtgcgcac atcgcggacc cgtgcgcggt acggtcgaga 2067361 cagcggcacg agaaagtagt aagggcgata ataggcggta aagagtagcg ggaagccggc 2067421 cgaacgactc ggtcagacaa cgccacagcg gccagtgagg agcagcgggt gacggacatg 2067481 aacccggata ttgagaagga ccagacctcc gatgaagtca cggtagagac gacctccgtc 2067541 ttccgcgcag acttcctcag cgagctggac gctcctgcgc aagcgggtac ggagagcgcg 2067601 gtctccgggg tggaagggct cccgccgggc tcggcgttgc tggtagtcaa acgaggcccc 2067661 aacgccgggt cccggttcct actcgaccaa gccatcacgt cggctggtcg gcatcccgac 2067721 agcgacatat ttctcgacga cgtgaccgtg agccgtcgcc atgctgaatt ccggttggaa 2067781 aacaacgaat tcaatgtcgt cgatgtcggg agtctcaacg gcacctacgt caaccgcgag 2067841 cccgtggatt cggcggtgct ggcgaacggc gacgaggtcc agatcggcaa gttccggttg 2067901 gtgttcttga ccggacccaa gcaaggcgag gatgacggga gtaccggggg cccgtgagcg 2067961 cacccgatag ccccgcgctg gccgggatgt cgatcggggc ggtcctcgac ctgctacgac 2068021 cggattttcc tgatgtcacc atctccaaga ttcgattctt ggaggctgag ggtctggtga 2068081 cgccccggcg ggcctcatcg gggtatcggc ggttcaccgc atacgactgc gcacggctgc 2068141 gattcattct cactgcccag agggaccatt acctgccgct gaaggtgatc agggcccagc 2068201 tggacgccca gcccgacggt gagttgccac cattcggatc tccttacgtt ctaccgcgat 2068261 tggtgcccgt agccggcgac agtgctggcg gcgtcgggtc ggacaccgcg tccgtgtcgc 2068321 tcacgggtat ccggctcagt cgggaagacc tcctggaacg atcggaagtg gccgacgagc 2068381 tactgacggc cctgctcaaa gccggtgtga tcaccaccgg gccgggcggc ttcttcgacg 2068441 aacacgccgt cgtgatcctg caatgcgcac gagcgctggc cgaatacggc gtcgagccgc 2068501 ggcatctacg cgccttccgc tccgcggccg accggcagtc cgacctgatt gcccagattg 2068561 ccggcccgct cgtcaaggcc ggcaaggccg gtgcccgcga ccgggccgac gacttggccc 2068621 gtgaggtggc cgcgcttgct ataactttgc acacgtcgct gatcaagtct gcggttcgcg 2068681 acgttcttca ccgctgagga ctagacttcg ttcgacagct tggtgttcga cgtcacggta 2068741 gagacgtggc gcccaccgcg tcgtcgcacc gagcgtgagt cggacaccgg ttgcatgtgc 2068801 ggagggcaga cgcagatggg tgaagttcgt gttgtcggca ttcgcgtcga gcagccgcag 2068861 aaccagccgg tgctgttatt gcgcgaggcc aacggtgatc gatacctgcc gatctggatc 2068921 ggccagtcgg aggctgccgc tatcgcgctg gagcagcaag gcgtcgagcc gccacgtccg 2068981 ctgacccatg atctgatcag ggatctcatt gctgcgctgg ggcattcgct caaagaggtg 2069041 cgcattgtag acctgcagga aggaactttc tacgctgatc tgatcttcga ctgcaatatc 2069101 aaggtgtccg cccgtccctc ggactcggtg gcaatcgcat tgcgagtggg tgttccgatc 2069161 tacgtcgagg aggccgtact agcccaggcc ggtctgctga ttcccgacga aagtgacgag 2069221 gaggccacca ccgctgttcg cgaggacgag gtggagaaat tcaaagagtt tctcgacagt 2069281 gtgtcacctg acgatttcaa ggccacctag cgcggcgacg atgcgcgccg ggacggcggg 2069341 ctgaggaggc gcgcgataag gccgagcgcg gcgacgatgc gcgccgcgac ggcgagcatc 2069401 cattatttgc cggccagcaa cgtcacggct gcgtctcatc tctggctgca attgtcgaca 2069461 cgcctagcgg ttagtgccta atgcgcccgg cgaccgcgat actttgatca cgacttgata 2069521 gttaaccggg agcatcgcgc ccatcgaaca gcgtatgctc tctaacactc gggccctcag 2069581 taatggctgt cgggggagcc agtgacgcag ctagtgacaa gagcgcgatc ggcgagagga 2069641 agcaccttgg gcgagcagcc acgtcaagac cagctcgact ttgctgacca cacgggcact 2069701 gctggtgatg gtaacgacgg cgccgctgcg gccagcggac ccgtgcagcc cggcctgttc 2069761 cccgacgatt ccgttcctga cgagttggta ggttatcgcg gaccgagcgc ctgccagatc 2069821 gctgggatca cctaccgcca gctcgactac tgggcgcgca catcgttggt tgtgccgtcg 2069881 atccgtagtg cggcaggatc cggcagccag cggctgtact cgttcaagga catcttggtt 2069941 ctcaagatcg tcaaacggtt gctcgacacc ggtatctcgc tgcacaacat ccgggttgca 2070001 gttgaccatc tgcgccagcg tggcgtccag gatctggcca acatcacctt gttctccgat 2070061 gggaccaccg tgtacgagtg cacgtcggcc gaggaggtcg tcgacctcct gcagggcggc 2070121 cagggtgtgt tcggcatcgc cgtctcgggc gcgatgcggg agctgacggg tgttatcgcc 2070181 gacttccacg gtgagcgcgc cgacggcggg gagtcgattg ctgcccccga agatgaactg 2070241 gcctcccgac gcaagcatcg cgaccgcaag atcggctagc cgagagttcc cccgcgaaca 2070301 gacacagaat cgcacgcggc aggctcctcg gatgcgattg tgtgtctgct cggcagtaga 2070361 ctggacaacg catcgctcta gtgcgggaga gttctgtggc tgccagctac ggacgccgaa 2070421 ggagcaatac ctctccgtca acctctcagg cacccggacc gcgcgagact acgatgcctc 2070481 tggaaagcgg tggcgacccc tggcggtcct cacccgccga tggggaaagg cgattcacct 2070541 gacggtggac agagtcgccg aatctctcag gcgcctggcg tgcaggtgaa gacagaggga 2070601 gagggccgct agtcctctgc tttgtcagga gttcaccgtg tccgaccatt cgacgttcgc 2070661 agaccggcac atcggtctgg acagccaggc cgtcgcgacc atgctcgccg tgatcggggt 2070721 ggattcgctc gatgacctgg cagtcaaggc ggtcccggcg ggcatcctag acacactcac 2070781 cgacaccgga gccgcaccgg gtttggacag tctgccaccg gctgccagcg aagccgaggc 2070841 gctggccgag ctgcgagcgc tggccgacgc taacaccgtc gccgtgtcga tgatcgggca 2070901 aggctactac gacacacaca cccccccggt gctgttgcgc aacatcatcg agaacccggc 2070961 ctggtatacc gcctacacgc cgtaccagcc cgagattagt cagggtcggc tggaagcctt 2071021 gctgaacttc cagaccctgg tcaccgatct gaccggcctc gagatcgcga acgcgtcgat 2071081 gctcgacgag ggcaccgcgg cggccgaggc catgactttg atgcaccgcg cggcccgcgg 2071141 gccggtgaag agggtggtcg tggacgccga cgtgttcacc cagaccgcgg cggtgctggc 2071201 cacccgcgcc aagccgctgg gtatcgagat cgtcacggcc gacctgcgcg ccggtctgcc 2071261 cgacggcgaa tttttcggcg ccatcgccca gctgcccggg gccagcggcc ggatcaccga 2071321 ctggtctgcc ctggtgcaac aggcccacga ccgtggcgca ctggtggccg tcggcgccga 2071381 cttgttggcg ctgacgctga tcgcgccgcc cggagagatc ggcgctgacg tcgcctttgg 2071441 caccacacaa cggttcggag tgccgatggg gtttggcggc ccgcatgccg ggtaccttgc 2071501 ggtgcacgcc aagcatgcgc gtcagctgcc cggccggctg gtcggtgtgt ccgtcgacag 2071561 tgacggcacg ccggcctatc ggttggcgct gcagactcgc gagcaacaca tccgccgcga 2071621 caaggccacc agcaacatct gcaccgcaca agtgctgttg gcggtgcttg ccgcgatgta 2071681 cgcgagctac cacggcgcgg gcgggctgac cgccatcgca cgccgggtgc atgcccacgc 2071741 cgaggctatc gccggtgcac tgggcgatgc gttggtgcac gacaagtact tcgacacggt 2071801 gttggcccgg gtgcccggtc gtgccgacga ggtgctggcc agggccaagg ccaacggcat 2071861 caacctgtgg cgtgtcgacg ccgaccatgt gtcggtagcc tgcgacgaag ccaccactga 2071921 cacccacgtg gcggtcgttc tggacgcgtt cggtgtagcg gccgccgcac ccgcccatgc 2071981 ggacatcgca acgcgcacat cggagttcct gacgcatcca gcgttcacgc aataccgcac 2072041 cgagacgtcg atgatgcggt acttgcgtgc gctggcggat aaggatattg ccctcgaccg 2072101 cagcatgatt ccgctcggct cgtgcacgat gaaactcaac gccgccgccg agatggagtc 2072161 gattacctgg cctgaattcg ggcgtcagca tccatttgcc ccggcatctg ataccgctgg 2072221 gctgcgtcaa cttgttgccg acctacagag ttggctggtg ctgatcaccg gttatgacgc 2072281 ggtgtcgctg caacctaacg cgggctcgca aggcgagtat gcgggcctat tggcgatcca 2072341 cgagtaccac gccagccggg gtgaaccgca tcgcgacatc tgcctgatcc cgtccagcgc 2072401 gcacggcacc aatgccgcgt cagccgcctt ggccggcatg cgcgtggtgg tggtggactg 2072461 ccacgacaac ggcgacgtcg acctcgatga cctgcgcgct aaggtcgggg agcatgccga 2072521 gcggttgtcg gcgctaatga tcacctaccc gtccactcac ggcgtgtacg aacacgacat 2072581 cgccgagatc tgcgctgccg tgcacgacgc gggcggccag gtatacgtcg acggagccaa 2072641 cctcaacgcc ctggtcggcc tggcccggcc gggcaagttc ggcggtgacg tcagtcacct 2072701 caacctacac aagacattct gcattccgca cggcggcggt ggcccaggcg tcggcccggt 2072761 ggcggtgcgg gcgcacctgg caccgtttct gccaggtcac cccttcgccc ccgagctgcc 2072821 caagggctat ccggtgtcgt cggcaccata tgggtcggct tcgattcttc cgatcacctg 2072881 ggcatacatc cggatgatgg gggctgaggg actgcgggcg gcatcgctga cagcgatcac 2072941 gtcggctaac tacattgcgc gccgccttga cgagtattac ccggtgctgt acaccggcga 2073001 gaacggcatg gtcgcccacg agtgcatcct ggacttgcgc ggtatcacta agttgaccgg 2073061 tatcaccgtc gacgatgttg caaaacggct ggcagactat ggttttcacg caccaacgat 2073121 gagttttccg gtggccggta cgctcatggt ggagcccacc gagagcgaga gcctggccga 2073181 agtggacgcc ttctgcgagg ccatgatcgg catccgcgcc gagatcgaca aagtcggggc 2073241 cggggagtgg cctgtcgacg acaatccgct gcgcggcgca ccgcacaccg cgcagtgcct 2073301 gctggcgtct gattgggacc acccgtatac gcgggaacag gccgcctacc cgctcggcac 2073361 cgcattccga cccaaggttt ggcccgcggt acgtcgcatc gacggcgcct acggggatcg 2073421 caacctggtc tgctcatgcc cgccggtaga ggcttttgcc taaacgctcg tcgaccggcc 2073481 cccggtcgag ctcgaggccc gggtgctact gggtgggtag ctgacgtgtc ggctgctatg 2073541 ggtcgttgtc ggggttgcgg agtttttcgg ggtggcggca ggtgttggtg cggggttgac 2073601 cgtggtcgga ggtggggtgg ggagctattc ggtgtcgcca cccgcgctcc aacaatgcca 2073661 gctgttgcgg ggtgctcagc gacaaaggtt cagccgaagc gctcaatgat cgcggcggcg 2073721 atccggtcgg gggcgtcctc ctggatgaag tgtttggcgt tgggcagctc caccaggacg 2073781 tggtcgggaa atgtcgcact cagtctgggg ataatcgttt tcggcctgaa tgcgacatcc 2073841 ttcatccccc aaatcaacag ggtgggcttg gtgcccagcg tggctggcac ctcccgggcg 2073901 agccgtgcca gcaggggacg ggcggccagg atctgtttgg gcatctcggc tacgcctcgg 2073961 cgtgccgcgg cgttgggctg caccgcccgg tagtgcgcca tcaccgcgct actcggccgg 2074021 tgctcggttc ccgcgggtat caagcgctcg acaaagaagt tgcgccgtaa gatcgcgtac 2074081 tgcactggcg ggctggacat caccctgctg aaggccttca tcgccagcgt gtccgccggc 2074141 cagaaccacg tgttgcccaa cacgacgccg cggacccggt cggcacgctc gacagcgacc 2074201 gccatgctga tcgggccacc ccagtcctga cccatgctca ggtagcggtc caggcccagg 2074261 tgatcgacga attcgccgat cacccgcgcg tgctcgtcga tctggtaccc gaatcccgag 2074321 ggacgctccg ataacccgaa acccagataa tccggagcca cacaacggaa acggtcccgc 2074381 agtgcgacga tgatgtcccg atacaggaaa ctccacgtcg ggttgccgtg acacaacagg 2074441 atcggcggac ccgtgccctc gtcgacgtag tggatgcgtc cacgcgagct gtcgaaccag 2074501 cgcgactcga acgggtacag ctgcggatcc ggcgtgaaat cgatgctcat taccctcctc 2074561 cgatcgcgct catgatggta tgcccgaagg gtgacatcac cgagtgtccg ggagtggcgt 2074621 gacggtggcc gctggctgcc gacggctgtc ggaaaggtgt tcgtccggtc ggggccgggc 2074681 gacacgccaa caatgctcct gctgcatggc tatccgtcca gttcgttcga cttccgggcg 2074741 gtgattccac acctgaccgg ccaggcttgg gtaacgatgg attttctggg ctttggcttg 2074801 tccgacaagc cgcgcccgca ccggtacagc ctgctggagc aggcccacct ggtggaaacg 2074861 gtggtcgccc acaccgtgac cggcgcggtc gtcgtgctgg cccacgacat gggcacgtcg 2074921 gtgaccaccg agctgctagc ccgtgatttg gacggccggt tgccgttcga tctccgacgt 2074981 gcggtgctga gcaacggcag tgtgatcttg gagcgggcca gcctgcgtcc gatccagaaa 2075041 gtactgcgca gcccgcttgg tccggtcgct gcccggctgg tcagccgcgg tggcttcaca 2075101 cgagggtttg gccggatctt ctccccagcg cacccgctgt cggcgcagga ggcccaagcc 2075161 cagtgggagt tgctgtgcta caacgacggc aaccggatcc cgcacctgct gatcagctac 2075221 ctcgacgagc ggatacggca cgcgcagcgc tggcatggcg cggtccgcga ttggcccaaa 2075281 ccgcttgggt tcgtgtgggg actcgacgat ccggtggcaa caaccaacgt gctcaatgga 2075341 ctacgggaat tgcgccccag cgccgccgtc gtggaactgc cagggttggg ccactacccg 2075401 caggccgagg ctcccaaagc atatgccgag gccgcgctat cgctgctcgt cgactagccg 2075461 gctacggctg tatcacgggc agatcgatgc gagaggcatg catccggcta cggtagacgc 2075521 gcacggtcgg tgcgcaaccg ggaaggatgg cgaagtggct tgcgtccgcg ccggcgatgg 2075581 cgatgcggat gcggtgcccc ggttggaaca gatacgacgt cggcagcagg tcgaatgtca 2075641 gccgggcaat ctcgcccggg actaggggcc acgcgtcccc gctcgcgaac gttcggtagg 2075701 ggaccacctg gcggtacggc ggcggcccgt cgctgagccg gcggtggatg gcgcgtagct 2075761 ggccctcggt gatgtaggcg acacggccgc gcggatcgac gtcttccaga tagacgaaga 2075821 aggtgccgtc gctcgacgtc gacgtgataa acagcgtgac caccacatga ccggtcacct 2075881 ccaggggatg gtcgagcggt gcggaggtat aggtcagcag cttggcatcc tgggccttgc 2075941 ggtccgggta gcaaacgtgt ccaccgatgc ccacttgcga gcgccagcgt gagcgctcgc 2076001 ccgttccggc cgtctgatcc accacgtatt cgtctgcacc gctgtcgcaa tcgggtgcgt 2076061 ccgggcgcag ctgtcggtct gcggacaggt agtagctctg cgtggtggcg ggcggcggcc 2076121 aggtgtcggc cgacttccag cggttctcga ccatggtgaa gtagtgcacc ggcggctcgg 2076181 agccgatgcc cgtatcggcc cccttgacgt gatggtcgat gaacctcaac agctcgccgt 2076241 cgtgatcgaa gtcgggtctg ctgagcccgc gcagtgggtc gacgcgccag ccgccggtgt 2076301 ggttccatgg accgaggatc aagtggctgc ccggggtgga gacggtcaga aaacgtttga 2076361 ttgcggcatg cgcatacccg ccgtcgaacc agccgctgta gctgtagatg gccgctcccg 2076421 acgcctgcac gtcacgccaa taattgtgcg ggctgatcag gttgatgctg cccgactcga 2076481 tcggtgtacc gatcggctcg agccgggcgt caggttggcc acgataggga tccgaggcgg 2076541 atacgtcgtc ccggaacgtc aatgaccccg cgatctggtg aacgtcgtag ttgccgcgat 2076601 gcgcggcgat ggccccgtcc cgcagcgagc gatcacggtc ctcctgcacc ggctgcatgc 2076661 cggtcaccgg gagcttcgcc caccacccga ccacttcgtg cagggcgttg cggtcgagcg 2076721 cctcgttgta gcgtccccag gtgtcggtga accaggcggc gtggatgccg ccggggaacg 2076781 cgatgtcggt gtagacgtcg aacagcgaga agcacggggc gatcacccgc accgcgggat 2076841 gctggttgac cagcagtaac tcggccgacg tgccgtcgta cgaatttccc agcgcagcga 2076901 ccgttccgtt gcaccaaggc tggcgcacga tccagtcgac gatctcggcg ccgtcccgga 2076961 tctcgtcgga ggaccattcg cacacgcggg cgccgaacga cgcgcccgat ccgcgcacat 2077021 ccacatcgac ccaggcgtag ccgctggcga cgaaacgtct ccgacgacgc ttatctgcgg 2077081 cgatgtgctg gaggggcttg cccccgagca acatccgcaa cggccagcgc aactgcagcg 2077141 accggtagta gcgggtctga tgcaggatcg cgggcagcct tgcggcactc gtcaggcccg 2077201 cgggcaggta gaggtcgatg gcgatgcgca ccccgtcgcg catcgtcaca tagcacgagg 2077261 agtagcgcat cccacgatat ctcgggtagg cggatcgttg gtccggcgcg gagtaccagg 2077321 ccgcatccga gccgccgcgt ctggtcatcg ggtagccagg cgatcagctc aagaagatgt 2077381 tgaccgcggt tgccaggtcg ggggatgccg atgtctccag gttttggtag ctgccgccgc 2077441 tgagctgtgc gacggcttcc caggttgccc gatcgggatc agcaccgaag tcgatgatgt 2077501 tgaccgcgat cggcttggcc gggtctgcgc tcttgcggat gaaatcctgc aggcccggcc 2077561 cgtcgagggt ttggtccgta tgcggccccg cggtaataac cagcacagaa ttagcctggc 2077621 caacatggta attggctagc atctcctgat agatcaagcg cagagtggtg aacgacaccg 2077681 cgccaccgcc cgaggagtat tgcttgccca acgcggccgt caaggccgcg gggcggggct 2077741 ggccgttgac cgggtcggcc aatggcccgg ccggcacctc tgttcggccc tcgcggccgt 2077801 cgaatgtcca cagtccgacg accgaactgg gcggcatcgc cttgatccgg ttctcaagcg 2077861 ccgcaacgac attgctaagc cggctattgc cgccttcatc attgggcatc gattggtcga 2077921 gcatgatggt cgcggccact ccggccgacg cggtgaccat ggtgtccgcc agggtcgcgc 2077981 gcatggagtc gtcacccacc gacaaagtcg aaggcagcgc tgggaaactg gtgacggggc 2078041 tgctcggcgg tttgacgtcg ctgactcgga aaccagctct ggccagtttg gccagttgct 2078101 cgggcttgtg caaatacctg gcaaacgcgc tggccgccga cgtttgctcc tgcgatagcc 2078161 atgcaccact gagcagcacc gtcggatagt cagcgaccgc agccggcccc ggcggcagcc 2078221 aggaacccaa ggtgttctcg gcatctgaaa gtgactggcc gcgctggaac aactgttgtt 2078281 cggtggtgac caccgcgtgc acgggtgccg tggcgacatc gccgggcttg agcagcgtgt 2078341 ccatcgccgc ggtcaaggag tcgtcggcga gcttaggtcg tgcgcccatc agggtgcgca 2078401 ccgcgccgat acccgctgtt gctggcgcgc cagcaggtgc tgacgcggca gccaccgcct 2078461 cgccggccaa atacgcggca tcgccgttgc cactgctcgg cattgccagc cgcagtgatc 2078521 cccaggcagg caagtccaag ccggacaacg agttcggatt ggtttgcagg ccgggcaacg 2078581 ccgcccagtt ctggttggcg agggcctgct gcaattcggg ccgcacggcg agcaacaccg 2078641 gcgatatcac cagtgagcgg ctatcgctaa tggcttggct gcccgcggcc ccggtaagcc 2078701 gcgccgccga gatggagcta ctcggaatcc acaatcccgg ctggccgccc agttcggtcg 2078761 gccatttgcc gatgaaacca ttgatgacgg catcggagcc ggccgaggtg acagccactg 2078821 ccacacaacg gtcgccgacc gggcccgccg acgcgttgta gctgtcggct gactccttta 2078881 cctgatcggc gattgatggg tcggctataa cagcgacggt gtccttgccg cccacgcagc 2078941 gggcggcagc cgtatgcgag cggttggaca acgcgtcacc gaagaagcac cacaagatca 2079001 ccccggccac cattaccacc actgcgacaa gggccacgat cacgccgata ctgactcccc 2079061 gccgcccgtc cgcgctacgg tgcccggcct gccagtcgcc gggcccccga tgtccgaagc 2079121 gaaacagcgg cggcggggcg gccgctatgg gctcggcacc cgttggctcc cagtcggggc 2079181 ggggtggaat gtcagggtag tcttctgagc cgctagccga gtagccgccg acggcggagt 2079241 agtggccctc gctggataac gggccgtcat cgggctgatc tacaccggga tagtcgtagc 2079301 tacccgatat gtcctcccag tgctgttgtt ccgccgcatg cccgtcggac aggtcgtcaa 2079361 cggaatcctc ggggtcgggc ttgctgtgcc tacccatacc ggcgtctgcg tcctctccgt 2079421 cgaaggccgg cgcctgtcaa gcacgagcta cgcaccggct ctgcccgatg gggccggctc 2079481 tctcccgcaa gcgggcggtg cccccacagc ggcccgctag cgggccgcat cgtcaccggc 2079541 cctgtccgat ggggccggct tctcagcggc ccgggcctta aactcccgac gacgtcggtg 2079601 caggatcggc tcggtgtagc cgttgggctg ctgggccccg gacaagatca gctcctgcgc 2079661 ggccaggaag gcgatactgt cgtcgaagtt gggtgccatc ggtcggtatg ccacgtcgcc 2079721 cgcgttttgt cgatcgacca acggcgccat ccgctccaag ctggcccgca catccgcgct 2079781 ggtgatcaca ccgtggcgca gccagttggc caacaattgg ctggagattc gcagcgtggc 2079841 ccggtcctcc atgagcgcga cgtcgtggat gtcgggcacc ttcgagcagc cgacaccttg 2079901 atcaacccag cgaaccacgt agccgaggat ggattgacag ttgttgtcga cctcttcgcg 2079961 gatctcgtcg ggagcccagg ccaattcctt ggccagcgga atggtcagca attgttcgat 2080021 ggtggcgcga cgcttccccg ccagtccttg ttgcaccgcg gcgacgtcga cctggtggta 2080081 gtgcagcgca tgcagggtgg ccgcagtggg agagggaacc caggcggtgc tggccccggc 2080141 gcgcggctgg gcgatttttg tctcgaccat gtcggccatc agctcggtca ttgtccacat 2080201 gcccttgccg acctgggctc ggccgctgaa cccggcggcc aggccggcat cgacgttgtg 2080261 gtcctcgtag gccaagatcc acggctggct cttcatggtg cccttgcgca ccatcgggcc 2080321 ggcctccatc gaggtgtgga tttcatcgcc ggtgcggtcc aggaacccgg tgttgatgaa 2080381 caccacgcgg tccgcggcag ctttgatgca cgccttgagg ttgaccgtgg tccggcgttc 2080441 ctcgtccatg atgccgatct tcatggtgtt ttgcggcaac cccagcacat cttcaacccg 2080501 gctgaacagt tcgcaggtaa acgccacctc ggccggaccg tgcatcttcg gcttgacgat 2080561 gtagatggag ccggtgcggc tgttgatcag cggcccgttg acgtcgctgg cctttagccc 2080621 gtggatggcg atcaggccgg tgaatagggc atccatgatg ccttcgaaca cctcgctgcc 2080681 gtcagtgtcg acgatggcgt cattcgtcat caagtgaccg acgttgcgga cgaacatgag 2080741 gctgcgtcca ggcagcgtga actggccacc gccgggtgcg gtgtagttcc ggtccctatt 2080801 gagcacccgc aggaaagcgg tgccgtcctt gtctaccgct gctgccaggt cgcccttgtt 2080861 caggccgagc cagttccgat aacccagcac cttgtcggcg gcgtccacgg cggccaccga 2080921 gtcctcgaag tccatgatcg tggtgatcgc ggattccagg atcacgtcct tgacgccggc 2080981 ccggtcggtg gtgccgacct gcgactccgg atcgatcagg atctcgatgt gcaaaccgtg 2081041 attgattagc agcaccgatg tcggcgactc ggctgcgccg gtgtagccgg cgaactggcc 2081101 ggggttggcc aggccggtgg acttatccgg caaggcaacc acgagctggc catcctgcac 2081161 tgtgaaaccg gtggcgtcgc caaaggaacc cgacgacagc ggaacactgt cgtcgaggaa 2081221 cttgcgggca tacgcgatca ccttgtcgcc acgaaccttg ttgtacgtgg ggcctttttc 2081281 ggcgccgtcg gtctcgggga tgacatcggt gccatacaag gcgtcgtaga gggagcccca 2081341 gcgagcgttg gccgcgttca gagcaaaccg cgcgttgagc accggcacca ccagctgggg 2081401 gccggcggtc gtggtgatct cagcgtcgac accggacgtg gtgatggtga agtcatcagg 2081461 ttcgggaagc aggtagccga tctcggtgag gaactggcgg taggcatcca tgtcgatggg 2081521 ctcgatcacc cgacgccggt gccacttgtc gatctgcgcc tgcagctcgt cgcgggcgtt 2081581 caacagagct tggttctgcg gggtcaggtc ggcgacgacc ttgtcgacgc ccgcccagaa 2081641 gctgtccggg tcgatatcgg tgccaggcag ggcttcattg ttcacgaagt cgtagagcac 2081701 ccgagcgatg cgcaagttgc ccaccgacac gcgatctgtc attgcttcct cccttactgg 2081761 caattgctca gcctaccggc cgacaagacg actactacat ccggcgaccc gcaaccgcag 2081821 gtcacgtcaa gctctgtcag cacctcggca cccggcatgc tcgctggctg gcaacgcgac 2081881 gcagtggccg cagcgatcat acgggtgggg cggtctgcct actacaatcc cgttggatcc 2081941 gttctggccg gacagcatcc cgccgggagc ggctccggcc acgtcggtgc cgctcattgc 2082001 ggcggtgtga ttccgaatca ggccagacgc ttgatccccg gataggagtc gaacccacgg 2082061 tcgaagctca tcagccgggt aatgtcgtgg tgagccatga cggcgatgtg tagtgcatcc 2082121 ctggccgaca acgtttgata gcgcaacagg gcatccctcg cgtgttcgac atcggtgcgc 2082181 tcgatcggca gcacttcgtc gaccacgccg ataattgcat cgaaagccgg ctgaatcgcc 2082241 tcacggcgtt tgattgccac ataccggtgg catatctcct gcagcacctc ggcgtcggtg 2082301 actaggcgtt caccgcccga cagcgccgac tccagcagac gttgcgcgtc cagcttatgc 2082361 gggtgcgagg cacccaccag atacatggga atgttggagt caacgaggat caccgtgatc 2082421 cttcgcgctc cgcaccgcgt ccgcgttcga tttcctcgag catctgctcg acgtcggctg 2082481 tcgggaactc atggcgtgcg gcggcacgga cagatcgcag cttcatgtct agatcgccgc 2082541 gcggttctcg ctcccgcgcc tcccgcagcg tccggcggac ccactcggac actgtcgtgc 2082601 ggtgccggcg tgcaatctct cggagttctt cccactcgtc ggggtccagc agaacctgca 2082661 ggcgcttact catagcatga gtgtatacag ctcatacggg tgtatgaatc cagctcgcct 2082721 gcgcgcggga gctatccccc gggggacccg ttctggccgg ccagcgttcc gcccgtaccg 2082781 ccgctgccgc ccggcccgcc agagtcgccg ggcccaccgt cgccgccgtc gccgatcaac 2082841 tgggcgtagc cgccgttacc gccggtgccg ggggtgccgt ccccgctgac gccggcggcg 2082901 ccgccattgc caccgttgcc gatcaacccg gccgtgccgc cgttaccccc ggtgccgccg 2082961 gcgccggtga cgggtaccgc gactgaggga atctgggttg gcccaccgga gccgccggcg 2083021 ccaccgttgc cgaccagcag cgcgccgttg cctccgttcc cgccactgcc accgccgggg 2083081 gcgaagacgc cggtaccggc ggacccggcg cctccggcgc caccgttgcc gatcagcccg 2083141 acggcgttgc cgccgtcgcc gccgtggcct cggagaaagc cgtcggtttg cacgctgttg 2083201 ccgccgtcgc cgccgttgcc gatcagcgtc ccgccggtgc cgccgtcgcc gccgttgcca 2083261 tcgaagaagc tgaacccgcc gttaccgccg tcgccgaaca tcccggcgtt accgccggta 2083321 ccaccgtcgc tgaaaccttg cagggtgctg cctccggaac cgccgtcgcc gtacagccac 2083381 ccgccgttgc cgccgttgcc gccgttgccg atgccggtgg gggcggcacc gacggagccg 2083441 ccgtcgcccc cgttgccgat cagccgggcg tcaccgcccg ctccgccgtc ggcggctccc 2083501 gatgggacat tgccgccgtt gccgccgttg ccgtacagca gtccgccggt gccgccggtg 2083561 ccgccggccc cgcccgcgaa gccggccttg ccgagcccgc cggcaccacc ggcaaggccg 2083621 gtcgccccgg gcccgccgat gccaccgttg ccgccgttgc cgatcaaccc tgcatcccca 2083681 ccggcgccgc cgggctggcc gatcccgccg ttgccgccgt tgccgccgtt gccgtacagc 2083741 aacccgccgg gcccgccggg ctgtcccggc gcgccattgg cgccatcacc gatcaacggg 2083801 cggccgaaca acgcctgaaa cggcccgttg agcacgtcca gcgcggcggc gtccgccgcc 2083861 tcggcaccgg cataggcccc cgcgccggcg gtcatggcat gcacaaactg ctcgtgaaac 2083921 agcgccgcct gcgcactcaa tgcctgatag gcctgggcgt gcgcgccgaa caaatccgca 2083981 acagccgccg acacctcatc ggcgcccgcg gccagcaccc ccatcgtggg cacggccgca 2084041 gccgcattcg ccgcgccgat cgccgacccg atgccggcca aatccgacgc cgccgccacc 2084101 accacttctg gggccgccac cacaaacgac atgacgcgct cctcacggga ccgggtgcgc 2084161 agtcccagcg gttacagcgt attgacgtcc cgccaccacg tccggcgttc gggccaactg 2084221 atccgaaacg attgtcagcg gcagcagccc ccgattacgc tcggtgtccc gtcagacacc 2084281 gatccctgcg tcagtcaacg atgcgtcccg tcgcgcatgg tgccaaccag gtcctccacc 2084341 acgtcctcca gcgccaccat ccccacgaca gaaccgttgt cggcggttac caaggccaga 2084401 tggctgttga tgcgccgcat ccgcgacagg gcgtcggcca gcggcaacga ttggggaacc 2084461 cgcggcagcg ggcgcacaac ggccagatcg atcacggttt gcggattgtc accgagggtc 2084521 agcacgtcct tgatgtgcag atatccgatg aaccttccac cgcgatccac caccggaaag 2084581 cgggagtagc cggtttgcgc caaggcctgt tcgaccccgc cgatggtggg cccggaccct 2084641 accgccgaca cctgcactgc ccgaatgttg accagcggca ccgcgacatc ggcaaccagg 2084701 cgagttcgaa tccgaagggc tcgggttagc cgcgtgtgct cctcgtgatc cagcaggcct 2084761 tcggatagcg attcggcgat catctcggac agttccgcag tggagacggc gatgtcgagt 2084821 tcatccttcg gctgcacccc aaccagccgc agtatcgcgt tggcgcagtt gttgtagaac 2084881 gcgatgaacg gccgggcgag gcgcacgtag accaggtacg gcgggaccag caacatcgct 2084941 gttcgctccg gaccagccaa agcgatgttc ttcggcacca tctcaccgag caggacatgc 2085001 agcgccacca cgatcgccaa cgccaccgcc aacgacaagg tgtgcagcag cgccggcggt 2085061 acaccgctca gcccgaacga cagctgtagc agcttgacga ctgccggttc gccgacccgg 2085121 ccaagcagga tcgaggacac cgtaaccccc agctgtgcgc cggtcagcat cgccgggagc 2085181 tgttcgcccg cccggatcac ggtgacggca gtggccttgc cctgctcggc cagcgcttcg 2085241 aggcggtcac gacgcgccga gatcaacgcg aattccgcgc ccacgaagaa cgcgttggcg 2085301 ccgatcagca aaagcgccag caacaccgcg gacagcacat ccatcagcgg ccccgccccg 2085361 acccggggtc ggcatggccg cccattttga tcaactccaa cgagtcgatc cggcgcccgt 2085421 ccatctggat cacggtggct aaccaccgca tcgagtcgtc gggaagtccg tcctggtcca 2085481 aggcagtcag ctcgaccgtt tcgccggcca ccgggatgtg gccgagctct cgaagcacca 2085541 acccgccgat cgtctcgtac ggaccgtcgg gggctcgata gccggtggcg ctggccacct 2085601 cgtcgatgcg tagcagaccc gagacccgcc atccgttgcc ggctgccacc acatccggtg 2085661 tcgcatcgtc gtgttcgtcg cggacgtcgc ccacgatctc ttcgatcaag tcctccaggg 2085721 ttaccatgcc cgcggtgccg ccgtactcgt ccacaaccat ggcggtctgt agcgcactgg 2085781 cgcggacctg cgccatcacc gcatcgccgt cgagcgtcga gggcaccacc gcgaccggct 2085841 cggcgaccgt cgttagcagc gtgtgcgcgc gatcgccggg cggaacctcg aacacctgct 2085901 tgacgtgcac gatgccgacg gtcgcatcga gatctccctc gaccaccggg aagcgcgaga 2085961 atcccgatgc ggccgcggcc gcaaccaggt cggcgatggt gtcatcggtc tgcagcgcca 2086021 cgatcttcga ccgtggcgtc atcagctcct cggccgtcag ggcgccgaac tgcagcgagc 2086081 ggcgcatcag ccacgccgtg gcgtcatcga gtgcgccgct gcgcgcggaa ctacgcacca 2086141 acgacaccag ctcctgcggt gtgcgagctg agcgcagctc ctcggccggc tcgatgccaa 2086201 gtcgacgcac gatccagttc gccgctccgt tcgtgagacg gatggccggg gtgagcagca 2086261 gtgagaacag cacctggccg gccacgactg agcgcgcggt gcgcagcggg cgcgccaccg 2086321 cgagatactt ggggaccagc tcgccgaaga ccatcgacag cgatgtcacg atcaccaggg 2086381 caaaaaacgt gataagaccg tcggccaccc gatcagacat tccgactgcg accagcccag 2086441 gatgcggtag ctcggccacc agcggttcgg tcaggtagcc ggtagccaag gtggtgatcg 2086501 agatacccaa ctgagcaccc gaaagctgga acgacagccg gtggtgtgcg cgctggatga 2086561 agcggtcccg actggtgccg ccgcgggcgt tggcctccac ggtgctgcgg tccagcgcgg 2086621 tcagcgagaa ttcggccgcg acgaacaccc ccgtgcctgc ggtgagcgcc aagatcgcca 2086681 ggatggtggc gacggtatcg gtgaggttca cgggcggctc ggtcgtcgcg ctatatcggg 2086741 ccgagccagt accggccgct cgcctggaaa accgacggtg tggacgggtg cccgcggcac 2086801 gtcatccctt tcgctcgcaa ccgcgcagcg cgatactgcg ggtttgaagt acacatcgta 2086861 gcgagatagc tcgtggcgcc agcttcacca gccggcgggc agcggatggc cctcggcaaa 2086921 gcccgctccc gactgcacgc ccaccacggc ccgctcatgc agctccgcca ggttcgaggc 2086981 acccacatag gtgcaggtgc tgcgcacgcc agaagtgatg tggtcaatta ggtcctccac 2087041 acctccgcgg tcggggtcaa ggcccatccg cgacgtcgag atgccttcct cgaacaacgc 2087101 cttacgagct cggtcgaacg ggttgtccgc gccggtccgg gccaccaccg cccgcttgga 2087161 tgccatgccg tagctctcct tgtacggctg atcgtcgcgg tcacgcatca ggtctccggg 2087221 ggattcgtag gtgccggcga accacgatcc gatcatcacg ttcgaggcgc cggcggccag 2087281 cgccagagcc acgtcgcgtg gatgccggat cccaccgtcg gcccagatat gaccaccgag 2087341 ctgccttgcc gcagaagcgc attcgagcac agcggagaac tgcgggcggc cgacaccggt 2087401 catcattcgg gtggtgcaca tggcgccggg gccgacaccg accttgacga cgttcgcccc 2087461 ggctttcagc agatcccggg tgccctccgc cgacaccacg tttcccgccg ccagcggcaa 2087521 acccaagtcc agtgccgaga ccgccttgat cgcgtccaag gtcttgacct ggtgtccgtg 2087581 tgcggtgtcg atgaccagca cgtcgacgcc ggcttcggcg agcgctcggg ccttagcgcc 2087641 cacgtcgccg ttgatgccga cggccgcgcc gatccgcagc cggcccgcgc tatcggtggc 2087701 cggggtgtag ataccggcgc ggatagcccc ggtgcggctt agcactcccg ccaacgtgcc 2087761 gtcggcgtcg gtcagcaccg caacgtcgac cggggcgtgc tccagcaggt cgaagatctt 2087821 gcgtggctcg gttcccgctg gagcggtcac atagtccgtc acggcgatat cgcgcacccg 2087881 ggtgaagcga tccacgccca ggcaggacga ttcgcgcacc aatccgatcg ggcgaccctc 2087941 gaggatgacc accgcgacgc catgtgcgcg cttgtggatg agcgccatgg cgtcggacac 2088001 cgaatcgtcg ggtgccagcg tcactggggt gtcgagcacc aggtcccggc ttttgacgaa 2088061 cgccaccgtc tgctttaccg ccgggatcgg cagatcctgc ggcaggatta cgatgccacc 2088121 gcggcgggcg accgtctcgg ccatccgccg cccggctacc gcggtcatat tggcgaccac 2088181 taccggaatg gtggtgcccg agccgtcggc ggtggacaaa tcgacgtcga agcgcgacgc 2088241 gacctcggat cggttcggaa cgatgaacac gtcgttgtat gtcaggtcgt acccgggtgg 2088301 gtgcccgtct agaaatctca tcacttaccc ctttaccccc ctttagttct agcccgctac 2088361 accggtactt cggtgcggtc tgaactccat agtgtgtgga acttgcctgg ttcgtcgatc 2088421 cggccgtagg tgtgtgcgcc gaagaagtcg cgctgggcct gggtgagtgc agcgggcagc 2088481 cgcgcggtgc gcagcgcgtc gtaatacgac agggccgacg agaatcccgg ggtcgggata 2088541 cccagttggg ccgccgtcga caccacacgc cgccaactgt cgatcgccga ttcgacggcg 2088601 cgcggaaata cggggccaca atcagactgg ccaggttcgg gctggcgtca aaggcttcct 2088661 tgatgtggtt gaggaacttc gcccggatga tgcagccgcc acgccagatg gtggccaggt 2088721 cgcccggcgt gatgtcccag ccgaattcgg cgctgccggc ctggatctgg ttgaagccct 2088781 gagcgtaggc cacgatcttg gaggcgtaca acgcctggcg gacgtcttcg gtgaacgtgg 2088841 cggggtcggc gggctgctcg ccgagcttgc ccgaagccag accgctggcg gccgagcgtt 2088901 gccccacgga tcccgagaga gcgcgggcaa acaccgcttc ggcgatgccg gtcaccggca 2088961 cacccaggtc cagcgcggac ttgacggtcc aacggccggt gcctttctgc tcggcccggt 2089021 ccacgatgac gtcgacgagc ggtttgccgg tcttggcatc ggtctgccgc agcacctcgg 2089081 cggtgatctc gaccaggtag ctgtccagat cgccattgtt ccactcggtg aacacatcgg 2089141 cgatcgccgg cgcggtcaga cctagcccgt cgcgcatcag ctggtaggcc tcaccgatga 2089201 gctgcatgtc ggagtactcg atgccgttgt ggaccatctt gacgaagtgc ccggagccgt 2089261 ccgggccaat gtgggtgcag cacggcacgc cgtcgacatg cgcggagatc tcctcgagca 2089321 gcggacccag cgattggtat gactcggcgg gtccgccggg catgatcgac ggcccgttca 2089381 acgcgccctc ttcgccgccg gagatcccgg ccccgacgaa gtgcaagccc cgctcacgca 2089441 tcgctttctc gcggcgcatg gtgtcggtgt acaacgcatt gccgccgtcg atgatgatgt 2089501 cgccgggttc catggcgtca gcaagttcgt tgatgacagc gtcagtggcc tctccggcct 2089561 tgaccatgat cagcacccga cgcggttttt ccagtgcggc aagaaattcg gggatcgttt 2089621 cactgcgcac gaacttgccg tctgagctgt gctccttaag cagcgcgtcg gtcttggcga 2089681 ccgaccgatt gtgcactgcc acggtgtagc cgtgccgggc gaagtttcgg gcgatgttgg 2089741 aacccatcac ggccaggcca gtgacgccga tctgcgcgat gccggctggc gattccgacg 2089801 aactcatgtc ctgcctttca gttgggcccg gcttcgctag gcgatgaaca gccgctgcag 2089861 ctgcgtgagc cacggtacgg ccagcgcgac ggtgggcacc accaggacgg cggccgcagc 2089921 tagatatgcg gccgcggaca gaaccgcgct atttccacgc cccgacagcc ggcgcacgcg 2089981 gagcaccgtg ctgggacctc cgacggccaa cgcacccgac ggcgcccgcc cggacgcaca 2090041 ggcgaccaat gcccgagcca ggggagtgcg cccggcggcg cgcaccgcgg cgtcatcggc 2090101 caggagctcg acgagtagct gcaccgcccc cagcgcattg gcgctgcgga ccaaccgcgg 2090161 gaaagccgcg tgcaccgcgg taaacgcctc caggacaaga tcgtggcggg cgcgtagatg 2090221 agcccgctca tgggtaagga tcgccgcgac ctcggcgtcg gcgagcgcgg tcagtgtgcc 2090281 ttcgctgacc acaacccggc tacgcacacc gggcagacag taggcaaggg gctgcgcgac 2090341 gtccaagacc cgaaggtcgc gggcccgcgc gcacggctgg gcaagcgcgc cattgtgtcc 2090401 gaccccgacg agatcgacca ccatgcggtg gtgtgcccgt cgtcgtcgcg tggcggtggc 2090461 gacgcgcacc acggcgaccg ccagccgggc accgaccagc acagtcaacg caaagacggt 2090521 gatgtaggcc gcccacagcg gccagccgag gcggccggcc gcgccgacga agctggtcgt 2090581 agggcgtccg tcgggaccgg gcatgagcag cctgctagcg atcgcgattc cggcgctgaa 2090641 cgacgacagc accgcggcca gggcaatcgc ctgccacagc accatggcgg cgcgcggtgc 2090701 gcgcagtggc cacgttgccc gggctagcag ggctggggtc gggccagcca gcagcaccgc 2090761 gaggatggtg aaggccagcg cggacacgcc gttagtctcc ctcaggtctc cgttgccgcg 2090821 ccagccggtg gccgattgcc atgaccggct tccaattcgg cgagcgcacg tcgtagcgca 2090881 tccgcctcgt cggcaccgac tcgctcgacg aagtgcacca gcgcggcttg cctgctgccg 2090941 gagtactcgg cctgagccaa tgcatcgacc atcagcccgg cgaccaattc gtcgcggccg 2091001 tgcacgggag cgtagcggtg ggctcgatcg tcgcggatct gcagcacgag gttcttcttt 2091061 gccaaccgtt gcagcacggt catcaccgtc gtgtaggcaa ggtcgcggcg cgccgacaac 2091121 gcttcgtgga cttggcgaac ggtttggggt tccgtcctgg accacaaatg gtccatgacc 2091181 gcgcgttcca aatcccccaa ccgtgtcagc ttggccattg ttcgttcatc tcctgcgggt 2091241 tgaaaccagc gtactccggc ttactactcg ctgtcgtatc caaaccggcg ggcggccgta 2091301 ccgggcctat gcacccggct cgcaaacatt acacgctaac gcttgctaaa ttagggcagc 2091361 cttgcctatc attacttcgt cgagccacaa cgaccgcggc cgagtcctga gggctgcagt 2091421 gacccccggt cgactcgatc ggcgagcccc gtgccttggt gcacggggct cgcccgttgg 2091481 tgtagacaca aggacgtgca gccatcgccg gactcacccg ctccgctgaa tgtcaccgtg 2091541 ccgttcgaca gcgagttggg tttgcaattc accgaactgg gtcccgacgg ggcccgagcg 2091601 cagctcgacg tccggcccaa gttgttgcag ctgacgggcg tcgtgcacgg cggtgtctac 2091661 tgcgcgatga tcgagagcat cgccagcatg gcagcctttg cctggctcaa ttcgcacggc 2091721 gaaggcggga gtgtggtcgg cgttaataat aatacggatt tcctgcgctc catcagctca 2091781 gggatggtgt atggcaccgc cgaaccgctg catcggggtc ggcggcaaca gctgtggctg 2091841 gtcaccatca ccgacgacac cgaccgggtg gtcgcccgcg gccaagtgcg gctgcagaac 2091901 ctcgaggcgc ggccttaacc cgctcgaaac cgttgaacct gccgcggcgt ggcaggatcg 2091961 cagagcatgc gcctgacgcc gcacgaacag gagcgtttgc tgttgtccta cgccgccgag 2092021 ttggcccgcc ggcgtcgggc ccgcggcctg cgcctcaatc atccggaagc catcgcggtg 2092081 atcgccgacc acatcctgga aggcgcgcgt gacggccgca ccgtcgcaga gttgatggca 2092141 tccgggcgtg aggtgctcgg ccgtgacgat gtgatggagg gagtgccgga gatgctcgcc 2092201 gaggtacagg tggaggcgac gtttccggac ggcaccaagt tggtcaccgt gcatcagccg 2092261 atcgcatgat tcccggagaa atcttttacg gcagtggtga tatcgagatg aacgccgcgg 2092321 cactctcccg cctgcagatg cggatcatca acgccggcga tcgtccggtg caggtcggta 2092381 gccacgtcca tctcccgcag gccaatcggg cgctgtcatt cgaccgtgcg acggcccacg 2092441 gctaccgtct ggacatcccg gcggcgacag cggtgcgctt cgagccgggc attccccaaa 2092501 tcgtcgggtt ggttccgttg ggcggacggc gcgaggtacc cggtctgacg ctaaatccgc 2092561 ccggacggtt ggaccgctga tggcgcgact gtcaagggag cgctacgcac agctgtacgg 2092621 acctaccacc ggcgaccgga tacggctggc cgacaccaac ctgctggttg aggtcaccga 2092681 agaccggtgt gggggaccgg gactggccgg tgacgaggcg gtgttcggcg gcggcaaggt 2092741 gctgcgcgag tccatgggcc agggccgtgc gagccgggcc gacggtgccc ccgacaccgt 2092801 gatcaccggt gcggtgatca tcgactactg gggaatcatc aaggccgaca tcgggattcg 2092861 cgatggccgc atcgtcggga tcggaaaggc cggcaatccc gacatcatga caggtgtgca 2092921 tcgggatctc gtcgtcgggc cgtccaccga aatcatcagc ggcaaccgtc gaatcgtcac 2092981 cgcaggcacc gtcgactgtc acgtgcactt gatctgtccg cagatcatcg tcgaagcctt 2093041 ggccgcgggc accaccacga tcatcggcgg tggcaccgga cccgccgagg gcaccaaggc 2093101 caccacagtc actcccggcg agtggcacct ggcccggatg ctggagtcac tggacggttg 2093161 gccggtgaac ttcgcgctgc tcggcaaggg aaacaccgtg aatcccgacg cactgtggga 2093221 acagttgcgc ggtggcgcat cgggtttcaa actccacgaa gactggggat cgaccccggc 2093281 ggccatcgac acctgcttgg cggtcgccga cgtggccggg gtgcaggttg cgctgcactc 2093341 cgacactctc aatgagaccg gattcgtcga ggacaccatc ggcgcgatcg ccggacgttc 2093401 gattcacgcc taccacaccg agggcgccgg cggcgggcac gcaccggaca tcattaccgt 2093461 cgcggcgcaa ccgaatgtac tgcccagctc gaccaatccg acccgcccgc atacggtgaa 2093521 cacccttgac gagcatctcg acatgctgat ggtgtgccac cacctcaacc cccggatccc 2093581 ggaggacctc gcgtttgccg aaagccggat ccgaccgtcc accattgcgg cagaagatgt 2093641 gttgcacgat atgggggcaa tctcgatgat tggcagcgat tcccaggcga tgggccgtgt 2093701 cggcgaggtg gtgctgcgca cctggcagac cgcgcacgtg atgaaagccc gccgcggggc 2093761 actggaaggt gacccgtctg gtagccaagc cgccgacaac aaccgggtcc gccgctacat 2093821 cgccaaatac accatctgcc cggccatcgc acacggcatg gatcacctga tcggttcggt 2093881 ggaggtggga aagttggccg acctggtgtt gtgggagccg gcgtttttcg gggttcgccc 2093941 gcacgtcgtg ctcaaaggtg gggcgatcgc ctgggcagcg atgggcgatg cgaacgcgtc 2094001 aatcccgacc ccgcaaccgg tgctcccgcg accgatgttc ggcgcggccg cggcaaccgc 2094061 ggcggcgacc tcggtgcact tcgtcgcgcc gcaatccatc gacgcgcgcc tggcggaccg 2094121 gctcgcggtc aatcggggac tagcgccggt ggccgacgtg cgcgcagtgg gcaagaccga 2094181 cctgccgctc aatgatgccc taccgagcat cgaggtcgat cccgacacct tcaccgtgcg 2094241 aatcgacggc caggtgtggc aaccgcagcc ggccgccgaa ctacctatga cacaacggta 2094301 tttcctgttc taatgacctc gctggccgtg ctgctcaccc tcgccgactc gcggctgccc 2094361 acgggtgcgc acgtgcactc gggcggcatc gaagaagcca tcgccgccgg cttggtgacc 2094421 ggcctggcca ccctggaagc gttcctgaaa cggcgggtcc gcacccacgg cctgctgacg 2094481 gcgtccatcg cggccgcggt gcaccggggc gagctggccg tcgacgacgc cgaccgggaa 2094541 accgacgcgc gcacaccggc tcccgcggcc agacacgcct cacgcagcca gggccgcggg 2094601 ctgatcaggc tggcacggcg ggtgtggccc gattccggct gggaggaact gggcccgagg 2094661 ccgcatctgg cggttgtggc cggacgggtc ggcgcgctga gcgggctggc gcccgagcac 2094721 aacgccttgc acctcgtcta catcacaatg accggctcgg ccatcgccgc ccagcgactg 2094781 ctggcgctag atcccgccga agtgaccgtg gtgaccttcc agctgtccga actgtgcgag 2094841 cagatcgcgc aggaggccac agccggactg gcagacttgt ctgatccgct gctggacacg 2094901 ctcgcccagc ggcatgacga gcgcgtgcgt cccctgttcg tttcctgaaa ggtaaggcat 2094961 ggcaacgcat tcccatcccc actcgcacac cgtgcccgct cggccaaggc gggtccgcaa 2095021 accgggcgag ccactgcgca tcggcgtcgg cggcccggtc ggctccggca agaccgcact 2095081 ggtggcggcg ctgtgccggc aattgcgggg agagctgtcg ctggcggtgc tgaccaacga 2095141 catctacacc accgaagacg ccgacttctt gcgcacacat gcggtgctgc cagacgaccg 2095201 gatcgcggcc gtgcagaccg gcggctgccc gcacaccgcg atccgcgacg acatcaccgc 2095261 caacctggat gcgatcgacg agttgatggc cgcccacgac gcgttggacc tgatcctggt 2095321 cgaatccggc ggcgataacc tcacggccac cttctcttcg gggctggtgg atgcgcagat 2095381 cttcgtcatt gacgttgccg gcggcgacaa ggtgccgcgc aagggcgggc cgggggtgac 2095441 ctattcggat ttgttggtag tcaacaagac tgacctggct gcattggtgg gcgccgacct 2095501 ggcggtgatg gcccgcgatg cggacgcggt gcgcgacggc cgcccgacgg tgctgcaatc 2095561 gttgaccgag gacccagctg ccagcgatgt cgtggcctgg gttcgtagtc aactggccgc 2095621 cgatggagtc tagtgttctg gtggtcgcgt cgccgaatcg gttgccgcgc atcgactgtc 2095681 ggggcggtgt ccaggcacgc cgaaccgcgc ccgacacggt gcacctggtg tcggcggccg 2095741 cgaccccgct gggcggtgac accatgagaa tccgggtgat cgtggaacgg ggtgcccagc 2095801 tacggctgcg tagtgccgcc gcgacggtgg ccttgcccgg cgtggatacc ctgacgtcgc 2095861 atgctcactg ggagatcgac gtgaccggca ccctggatgt ggacctggag ccgacggtcg 2095921 tcgccgcctc agcccggcat ctgtcgcatg ccaccttgcg cctgcacgac gacggtcggg 2095981 tccgcttgcg cgagcgcgtg cagattggca gatgcaatga gcgcgaagga ttttggtcgt 2096041 catcgctgca ggccgatcgg catggtcgtc ccctgctgcg gcaccgggtg gaactgggtg 2096101 ccgggtcttt ggccgacgac gtcattgcgg cgccgcgcgc cactatcagc gagctgcgct 2096161 atccggcgac ggcattcacc gacgccatcg acgcacggtc gaccgttttg gcgttggcgg 2096221 gtggcggaac actgagtacc tggcaggctg accggttgcc tggctaacgc tagctggcca 2096281 ccttagcgct tgccgctgag ccctgcgcct cggcggccag ctcggccagc tgttcgagcc 2096341 gcgttcgcgc aaatgcctgc tggtcggtga tggtcagctg gccgcggcga gtactgagga 2096401 aagtcaccgt ccacgacagc agagtggtga tcttggtctt gaacccgatc aggtacgcca 2096461 ggtgcagcac cagccaaatc agccaggcga taaagccgct gaactcaacg ggaccgatct 2096521 tggccaccgc cgaaaacctc gaaaccgtgg ccatcgatcc cttgtcgaag tactggaatg 2096581 gctcacgctc cgccgggttg gcgccggcca gttcggcctt gatcgtgctg gcgacgtatt 2096641 tcgccccctg gatggcgccc tgcgccacac ccggcacacc ctccacagcg gccatatcgc 2096701 ccaccacgaa cacgttcggg tacccgggaa tggacaggtc gggcagcact tggacccggc 2096761 cggcccggtc gagctcaacc cgtgattgct cggcaaggtc cctgcccaac caactggccg 2096821 aaaccccggc cgaccagacc ttgcaggccg actcgatgcg ccggacggtg ccgtcggagt 2096881 ccttgacggt gatgccgttg cggtcgacgt cggtgaccat cgcacccagc tggatttcca 2096941 cgcccagctt ctgcaaccgg gcagccgccc gctgaccgag ctttgcgccc atcggtggca 2097001 gcaccgccgg ggcggcgtca agcagaatca cccgcgcctt ggtcgagtcg atgtgccgga 2097061 atgcgccctt caacgtgtgc tcggccagct cggcgatctg tccggccatt tcaacaccgg 2097121 tggggccagc cccgacaacg gtgaatgtca gtagcttggc ccgccgttcc ggatcgctgg 2097181 accgttcggc ttgctcgaaa gcgctcaata tgcggccacg caactccaac gcgtcgtcga 2097241 tggacttcat gccgggtgcg aattcggcga aatggtcgtt gccgaaataa gactggccag 2097301 cacccgcggc gacgatcagg ctgtcgtagg gggtttggta ggtgtgaccg agcaattccg 2097361 agacgacgca ctgcccggcc aggtcgatgt gggtgacgtt gcccaacagt acctggacat 2097421 tgcgctgctt acgcagcacg acccgggtcg gcggagcgat ttctccctcg gagataatcc 2097481 cggtggccac ttggtacagc agcggctgga acaggtgatg ggtggtgcgc gcgatcagct 2097541 tgatgtcaac gtcggcccgc ttgagcttct ttgccgcgtt tagcccgccg aacccagatc 2097601 cgatgatcac aactcgatgc ctacgaggtg gttgcgctgt gggttcttgc tggggactca 2097661 tgttccgctg ctcctgacgg ggtcacctcg atgagcgagt tcagttagct actacggtag 2097721 tcaacccgac cgctgcaggc ccagttgagg acatgtgtca tcagccacac cacagcgtgc 2097781 ctgcgtcacc ggccccggtg gctacacacc cagcagcggg cgcagcgctt cagcggcggt 2097841 ggtgatgacc ccgggcagat agccgtgcgg agccaagttg atgattaatc cgtcgacacc 2097901 ggcatcgagc accttggcct gaatttggtc ggcgatctgt gccgggctgc ccaccaccac 2097961 gcgaccgctc atctccgcgg gaatcgcatc tggcgagagt gtctcgtcga tcatcaccgt 2098021 caacagcagg ctggtctgaa gcgtcgaccg gtcccggccg gcctcgtcgc accgcgcggc 2098081 cagcgcccgc atcttgcgcg gcagctcgtc gaccgccgcc acgatgttga gatggtcggc 2098141 aaagcgggcg gcgatcgcga atgtcttttt ctcaccaccg ccgccgatca agattgggat 2098201 gcggtcgcga taccgcggct cggccatcgc cgattcggtg gtgtaccaat cgccgaaaaa 2098261 cgttgggcgc tcacccttga ccattggctc gaggatctgt agcgcctctt cgagccggtt 2098321 gaaccggtca ctgaaagtgc cgaactcgaa gccgagctgg cggtgttcca gctcaaacca 2098381 accggctcca atgccgagga tcgctcgacc ggcgctaacc acgtcgagcg tggtgatgat 2098441 ctttgccagc agggtcgggc tgcggtaggt attgccggtc accaacgcgc ccagttgcag 2098501 ccgctcggtc gccgtggcca gcgcaccaag ggccgtgtag gcctccagca tcggctggtc 2098561 gggcgtcccc aacatgggca gttggtagaa gtggtccatc acaaacaggg agtcgtaacc 2098621 agccgcttcg gcctcacgcg cttgagcgat gacggacggg aaaagcttct ccacccctgt 2098681 gccgtaggag aagttgggga tctgtagacc cagccgaata gtcacactac ctaccgtagc 2098741 gatcggccgg tgaagcgaaa ggttcagccg aagtgagcca gcgcgccgtg gctgacgtgc 2098801 agcgtctggc cggtgatgtg gcgagccgca ggggtggtaa ggaacagcgc cagccgcgca 2098861 atctcggccg cgacgggcgc gggtgtgtgc gaaagccctt cgtaaccggt ctgcacgctg 2098921 cggccgcaag cgactgtatt gatggtgatc ccgcgcgtgc cgaaaacggc ggcctggccc 2098981 gcgatccaat tcgagagggc cgctttgatc gcggactcgg cgccaccggc aggcgggttc 2099041 tctgccacca cgctgacaat cgagccgccg gagcgcaggt gatcgcccac ggattgcacc 2099101 gtcagcacca ccgagagcac cgtcgcgtcg agcgcattgc gccaggcgtt ggccgtgtcg 2099161 gacaccgagt aggcgcgcgg gtcaccggca tcccaggacg gcgctggcac gttgacgatg 2099221 gtgtccaggt gacgggggaa cagtccccgt gcctcggtga ggctggtcgg gtcggtggtg 2099281 tcgcacacaa cggcgtccac gtcgagttcc ttcgcggcga cctcgaggtc gccgcggcgg 2099341 gcacccacca gggtgacctt gtggccgtcg ttgcgaaagc cttcagccat tgtgcgcccg 2099401 agatcggtat ccccgccggt gaccagcacc tccactgcca tgacctcctc gtgttcaacg 2099461 ctgaacccag accctggacc gttgcctgga atcgcatcgt gatggcgtaa gctccggtag 2099521 atgttactgg acagtagcta ttcggggaaa ctccgcaccg ccacgacgcg cagacgatct 2099581 tggtaaccat taggtttggc cagtgcgttg gatcggactg tcaactggcc tagtgtcagc 2099641 gatgctggtc gcgggcctgg tggcatgtgg atcgaattca cccgcatcgt cgccagccgg 2099701 gccgacgcag ggtgcccggt cgatcgtggt gttcgcggct gcctcgctgc agtctgcgtt 2099761 cactcagatc ggtgagcagt tcaaagccgg caacccaggg gttaacgtca acttcgcttt 2099821 cgctggttct tctgagttgg ccacccagct gacccagggc gcgaccgccg acgtctttgc 2099881 atctgcggac accgcgcaaa tggacagtgt ggccaaggcg gggttgctgg ccggtcatcc 2099941 gacaaacttc gccaccaaca cgatggtcat cgttgccgcc gcaggcaatc ccaagaagat 2100001 ccgatctttt gccgacctca cgcggccggg gctcaacgtg gtggtctgcc agccgtcggt 2100061 gccatgcgga tcggcgaccc ggcgcatcga agatgcaacc gggattcatc tcaacccggt 2100121 cagtgaggaa cttagcgtga ccgacgttct gaacaaggtc atcaccgggc aagccgatgc 2100181 cgggctggtc tatgtcagtg acgcgctcag cgttgccacc aaagtgacgt gtgtcagatt 2100241 tcccgaagcc gcgggtgtgg tcaatgtcta cgccatcgcg gtgctaaagc ggacctccca 2100301 gcccgctctg gcccggcagt tcgtggccat ggtgaccgct gcggcaggtc ggcggatcct 2100361 ggatcagtcg ggtttcgcca agccctgacg atgcacccgc ctacggatct gcctcgttgg 2100421 gtatatctcc cggcgatcgc ggggatcgtg ttcgtggcaa tgccgctggt cgcgatcgcc 2100481 atccgggtcg attggccgcg tttctgggcg ctgatcacta ctccgtcttc tcaaacggcc 2100541 ctgctgttga gcgtgaagac cgccgcggcc agcacggtgc tgtgcgtact gctgggcgtc 2100601 ccgatggcgc tggtgctggc ccgcagccgc ggacgactgg tgcggtcgtt acgaccgctg 2100661 atcctgttac cgctggtgct gccgccggta gtcgggggta tcgcgttgct ctacgcgttc 2100721 ggccggctcg gcctgatcgg gcgctacctg gaggcggccg gcatcagcat cgcattcagt 2100781 accgcggctg tggtgctggc gcagaccttt gtctcgctgc cgtatctggt gatttcccta 2100841 gagggtgcag cccgcaccgc cggagccgac tacgaggtgg tggcggcgac acttggggcg 2100901 cggcccggca ctgtctggtg gcgcgtgacc ctgccgttgc tgctcccggg cgtggtgtcc 2100961 ggatcagtac tggcgtttgc ccgctcgctc ggagagtttg gcgcgaccct aacctttgcc 2101021 ggttcccggc aaggggtcac ccgtaccctt ccgctggaga tttacctgca gcgggtgacc 2101081 gatccggacg cggcggtggc attgtcactg ctgctcgttg tggtagcggc actggtggtg 2101141 ctgggtgtgg gtgctcgtac gccgatcggg accgatacca ggtagccggt catgagcaag 2101201 ctgcagctgc gcgcggtcgt cgccgaccgg cgtttggacg tcgaattctc ggtgtccgcg 2101261 ggcgaggtgc ttgcagtgct cgggcccaac ggtgcgggca agtccaccgc cctgcatgtt 2101321 atcgcggggc tgcttcgccc cgacgcgggc ttggtacgtt tgggggaccg ggtgttgacc 2101381 gacaccgagg ccggggtgaa tgtggcgacc cacgaccgtc gagtcgggct gctgttgcaa 2101441 gacccgttgt tgtttccaca cctgagcgtg gccaaaaacg tggccttcgg accacaatgc 2101501 cgtcgcggga tgtttgggtc cgggcgcgct aggacaaggg cgtcggcact gcgatggctg 2101561 cgcgaggtga acgccgagca gttcgccgac cgtaagcctc gtcagctatc cgggggccaa 2101621 gcccagcgcg tcgccatcgc gcgagcgttg gcggccgaac cggatgtgtt gctgctcgac 2101681 gagccgctga ccggactcga tgtggccgcg gccgcgggta tccgttcggt gttgcgtagt 2101741 gtcgtcgcga ggagcggttg cgcggtagtc ctgacgaccc atgacctgct ggacgtgttc 2101801 acgctggccg accgggtatt ggtgctcgag tccggcacga tcgccgagat cggcccggtt 2101861 gccgatgtgc ttaccgcacc tcgcagtcgt ttcggagccc gtatcgccgg agtcaacctg 2101921 gtcaatggga ccattggtcc ggacggctcg ctgcgcaccc agtccggcgc ccactggtac 2101981 ggcaccccgg tccaggattt gcctactggg catgaggcaa tcgcggtgtt cccgccgacg 2102041 gcggtggcgg tgtatccgga accgccgcac ggaagcccgc gcaatatcgt cgggctgacg 2102101 gtggcggagg tggatacccg cggacccatg gtcctggtgc gcgggcatga tcagcctggt 2102161 ggcgcgcctg gccttgccgc atgcatcacc gtcgatgccg ccaccgaact gcgtgtggcg 2102221 cccggatcgc gcgtgtggtt cagcgtcaag gcgcaggaag tggccctgca cccggcaccc 2102281 caccaacacg ccagttcatg agccgacccg cgccgtcctt gcgtcgcgcc gttaacacgg 2102341 taggttcttc gccatgcatc aggtggaccc caacttgaca cgtcgcaagg gacgattggc 2102401 ggcactggct atcgcggcga tggccagcgc cagcctggtg accgttgcgg tgcccgcgac 2102461 cgccaacgcc gatccggagc cagcgccccc ggtacccaca acggccgcct cgccgccgtc 2102521 gaccgctgca gcgccacccg caccggcgac acctgttgcc cccccaccac cggccgccgc 2102581 caacacgccg aatgcccagc cgggcgatcc caacgcagca cctccgccgg ccgacccgaa 2102641 cgcaccgccg ccacctgtca ttgccccaaa cgcaccccaa cctgtccgga tcgacaaccc 2102701 ggttggagga ttcagcttcg cgctgcctgc tggctgggtg gagtctgacg ccgcccacct 2102761 cgactacggt tcagcactcc tcagcaaaac caccggggac ccgccatttc ccggacagcc 2102821 gccgccggtg gccaatgaca cccgtatcgt gctcggccgg ctagaccaaa agctttacgc 2102881 cagcgccgaa gccaccgact ccaaggccgc ggcccggttg ggctcggaca tgggtgagtt 2102941 ctatatgccc tacccgggca cccggatcaa ccaggaaacc gtctcgctcg acgccaacgg 2103001 ggtgtctgga agcgcgtcgt attacgaagt caagttcagc gatccgagta agccgaacgg 2103061 ccagatctgg acgggcgtaa tcggctcgcc cgcggcgaac gcaccggacg ccgggccccc 2103121 tcagcgctgg tttgtggtat ggctcgggac cgccaacaac ccggtggaca agggcgcggc 2103181 caaggcgctg gccgaatcga tccggccttt ggtcgccccg ccgccggcgc cggcaccggc 2103241 tcctgcagag cccgctccgg cgccggcgcc ggccggggaa gtcgctccta ccccgacgac 2103301 accgacaccg cagcggacct taccggcctg accggatccg gccgcacccc aagtgatacc 2103361 cctgggcggg gtgtcagcgc ggccgggcgc tcttgagccg gcgcagcggc gtccatggag 2103421 cgccgccggc caacgcggcg ttcttggcgc cggcgcgaac gttgttcagg tgccaaccgg 2103481 tggtgggtcg tggttggcga cttgtaccgc ttccggttct ccataggtcg cgccggggac 2103541 gggcagcggg tcgtgtgcgc gtctttcagt gcaccgtgcg aaacgccgac accgttgaac 2103601 tccacctgaa agcaccgctg aacagcagaa aagcgcccac gaaaacaccg tggggcgcca 2103661 cacacgtttg atcacgccac aacccaccga caccgtcact accctcaaat cgttacgcag 2103721 aagcggtata ccgatatcac ggccctgtgc tgggctaagc cagcgtctgc aaggagaacc 2103781 gcatggacat cacggcaaca accgaatttt ccgccatgaa cctcgacggc aagacgggta 2103841 taggttggct cggctacatc gtcatcggcg gtatcgccgg ctggctcgcc agcaagatcg 2103901 ttaagggggg cggctcgggc atcctgatga acgttgtgat cggcgtcgtc ggggcattcg 2103961 gcgccggctt ggtccttaac gcgctgggcg tcgacgtcaa ccatggcggg tactggttca 2104021 ccttcttcgt cgccctgggc ggggctgtcg tcctgctgtg gatcgtcggc atggtgcgca 2104081 agacctagcg ctaaactgtt gtcggccatg caaattgagt gtgactgcgg cggccggcga 2104141 cgggtagcgg catgatggag tgatggtctc accggcgacc acggcgacga tgagtgcgtg 2104201 gcaggtgcgt cggcccggcc cgatggacac cggcccgctc gaacgagtga ccacccgggt 2104261 gccgcgcccg gcgccatcgg agttgctggt ggccgtgcac gcatgcgggg tgtgccgcac 2104321 cgatctccac gtgaccgaag gtgacctgcc cgtgcaccgc gaacgggtga ttcccggcca 2104381 cgaggtagtg ggagaggtca ttgaggtggg ctcagcggtg ggcgcggctg ccggtggcga 2104441 attcgaccga ggagaccggg tgggtatcgc ctggctgcgt cacacttgcg gggtctgcaa 2104501 gtactgccgg cgcggcagcg agaacctctg cccgcaatcc cgctacaccg gctgggacgc 2104561 cgacggggga tacgccgaat tcacgacggt tcctgcggct ttcgcgcacc atctgccgag 2104621 cggctatagc gacagcgagc tggcgccgtt gttgtgcgcc ggcatcatcg gatatcgatc 2104681 gctgctgcgc accgagctac cacccggtgg ccggctgggt ctctacggat tcggcggcag 2104741 tgcccacatc accgcccagg tcgcgttggc gcaaggcgcc gaaatacatg tgatgacacg 2104801 cggggcccgc gcgcgcaagc tggcgctgca acttggcgct gcatcggctc aggacgccgc 2104861 cgaccggcca cccgtgccgc tggacgccgc gatcttattc gccccggtcg gggatctggt 2104921 gctgcccgcg ctggaagcgc tggaccgtgg cggcatcttg gcgatcgccg ggatccacct 2104981 gaccgatatt ccggacctga actaccagca gcacttgttc caggagcgtc agatccggtc 2105041 ggtcacgtcg aacacccgcg ccgatgcgcg cgcgttcttc gacttcgccg cccagcatca 2105101 catcgaggtc accacgccgg agtacccgct tggccaagcc gatcgtgcgc tgggcgacct 2105161 gagcgccggc cgcatcgccg gtgccgccgt gctgctgatc tgaccgagct caggtcgaca 2105221 ggtgccagac cagggcagcg gccagggcac ccatcccgtt cagcgaccaa tgcagtgcga 2105281 tcggtgcgat caggctgccg ctgcgccgtc gcagccagct gaacacgaat ccggccactc 2105341 cggtggccaa caccgccagc atgacaccgg ccaccagccc gatgatcccg ccaccgaaca 2105401 gtcgagtgaa gccgacattg ctgctcgtga gccccagcga cgtcgcaata tgccacagac 2105461 cgaacagcac cgaacccgcc accgcgacac cccggaatcc ccaagcccga ttcagcgccc 2105521 catgcaacac accgcggaag gccagctctt cggggatgac ggtttgcagc gggatcatga 2105581 ccatcgaggc gatcaccgcg ccggagatcg tcgcgtagtg atggttcatg aacatcggcc 2105641 gggttatcgg cagcaggaca cctaccgaga tcaccgccac caccagggca acggccgcta 2105701 gcgcatagac gagcccggat ttccagtgtt ggcggctcag tccgagttca gcccagccca 2105761 ggcctctact ccgcaccaag atcaccagtc cgaccgcggc ggccgggacg gtggcgatgc 2105821 tcgcccacgg tgtggtgaaa tgcgcgatca ggttcgtcag taccagcacc aggacgacga 2105881 cggcgatgtc gacatatatc cggaaccggt gcatcaccga gaggtgcgac accagtggac 2105941 ctggatgaac ggctgcgcaa gcagtcaagt ggtcagacat cgtcagcaga gtctaccggc 2106001 ggagggctcg gtgtccgctc tcgcgcgtag gccttgagct cggctgcgag cgcgtctgcc 2106061 gccaacagct ggggaagcag ctccgattca gaggttcggg cgcgaaacac gagcccgacg 2106121 gtcacgttgt gctcagggcg gtaatcgacg gtgattgtgt cgccggcgcg cactgttccg 2106181 ggagcgatca cccgtaggta ggcgcctggt ttggcggccc gggtgaaggt cttgatccaa 2106241 taacgcaaat ccaggaaggc cgcgaaggtc cggcacggga tccggggcgc cgagacttcc 2106301 aacaccaatc cgtcggagcc gatgcgccag cgttcaccaa tccgcgcgta cgtcacgtcg 2106361 acgcccgagg tggtcagatt ctcgccgaac attccgttgt gaagggtgcg gtgaagctgg 2106421 gtttcccacg cgtcgaggtc ttctcgcgca tacgcataga cggcctgatc atcaccgcca 2106481 tggagcttcg ggttgccgac ggtgtcgcca accaggccgc tgccgacacc cgcatgcatc 2106541 gacccgggtg cccgcaccat gaccgcctca gatgccgcca ctttgtcgat tccggtcaac 2106601 ttcgactgcg cgcgcggatc agggttcgcc cgaacacgag ccaggttgac cgacaacaca 2106661 tgcgccaccc gcacagggta gctctgacgc gcgttggtcc acgccagccg gcgcggcgca 2106721 acggtcactc ctcgccgcga gcccgagcct cgtaggtcct gcgcttctcc atgtcgacat 2106781 cgtcggtgaa gacatgctcg ccgccgagga gtcggttcaa gccctcggaa acctggcgcg 2106841 gcatgaaccg ctgtgccacg atcatcgagc cagccgcttt cgtgacccgc acccgcggtt 2106901 tgggatgaac aatcagcccg acgatcgcgt cggcgatatc ggccggctcg gcgttcttga 2106961 atcctttgat cccaccggtg cccgcaatga gctcggtgtt gacaaacgac ggcaacacca 2107021 tcgagaactt cacgccggcc gaacggtatt caagcctggc cgaatcggtg aacgcgacca 2107081 ccgcgtgctt gctggcacag taagtggcca cgcctacggc gtagatttcc ccggcaagcg 2107141 aggcgacatt gataacgtgt ccccgcccgc gcgggaccat ccgctgcgcc gccagcttgc 2107201 tacccaagat caccccgtag acgttgatgt ccaggattcg gcgggttacc gggtctggtt 2107261 cgtcgacaat ccgccccacg ggcatgatgc cggcgttgtt gaccagcacg tcgatcgggc 2107321 cgagttggcg ctcgacggcg tcgaggaatc ccgaaaacga atccgggtcg gtgacatcga 2107381 gtttgccgta catgtcgagg tcgagatcgg cacccgactc tttcgccatc gcctcatcga 2107441 tgtcgccgat agcgaccttg gctcccaagt tgtgcagcgc ggccgctgtg gccaatccga 2107501 tcccccgggc gccgccggtg atggcgatta ctttgtcctg gaccttgtcc cggatcttga 2107561 cgccgatgga tgtcctgcct ggcactgtcg tcccttcgct cggcgggcct tagccgccgt 2107621 ccaatgcggt cgcgcccgtg tagtcacggt agccgcgaac gccgatgaaa cagctacggt 2107681 gtgcacgtgc ccgaacgatt gctcgatgcc gtgcgtgtgc tcgacttgtc cgacggctgt 2107741 tctgctggag gcaccgatat ggtgacacga ctgctcgccg acctgggcgc agacgttctc 2107801 aaggtgcaac cccccggcgg cagcccagga cgccacgtgc ggcccacgct ggccggcacc 2107861 agcatcgggt tcgccatgca caacgcgaac aaacgcagcg cagtgctcaa cccgctcgac 2107921 gagagcgacc gtcggcggtt cttggacctc gccgccagcg ccgacatcgt cgtcgactgt 2107981 ggtcttccgg gacaggccgc cgcgtacggg gcatcgtgtg ccgagttggc cgatcgctac 2108041 cgacacctgg tggcgctgtc gatcaccgac tttggcgctg ccggtccgcg gtcgtcatgg 2108101 cgcgcgaccg atccggtgct gtacgcgatg agtggtgctc tctcgcggtc gggccctacc 2108161 gccggcacgc cggtactgcc gccggacggt atcgcttcgg caaccgcagc ggtgcaggca 2108221 gcctgggccg tactggtcgc ctatttcaac cgattacgtt gtggtactgg ggattacatc 2108281 gacttctccc ggtttgacgc cgtcgttatg gcgttggatc cccccttcgg ggcgcacggg 2108341 caggtcgcag ccggcatccg cagcaccggg cgatggcggg gacggcccaa gaaccaggac 2108401 gcttacccga tttatccgtg ccgggacggc tacgtacggt tctgcgtgat ggcgccgcgg 2108461 cagtggcgcg ggctgcgccg ctggttgggg gagcccgaag attttcagga ccccaagtac 2108521 gacgtgatcg gcgcacgttt ggccgcatgg ccgcagatca gcgtgttggt cgcgaagttg 2108581 tgcgccgaga agaccatgaa ggagttggtg gcagccggcc aagcgctcgg ggttcccatt 2108641 accgcggtgc tgacaccgtc gagaatcctg gcctccgaac acttccaggc ggtgggtgcg 2108701 atcaccgatg ccgagctcgt tccgggggtg cgcaccgggg tgcctaccgg atacttcgtt 2108761 gtcgacggga agcgcgccgg tttccgtact ccggcccccg ccgcggggca ggacgaaccg 2108821 cgctggctcg cggatccagc gccggtgccc ccaccctcag gccgggtcgg cggctatcca 2108881 ttcgaaggtc tgcggattct tgatctgggc atcatcgtgg ccggcggcga gctcagccgg 2108941 ctgttcggcg acttgggcgc cgaggtcatc aaggtcgaaa gtgccgacca ccccgacggg 2109001 ttgcggcaga cccgagtcgg ggatgcgatg agtgaatcat tcgcgtggac ccatcgcaat 2109061 cacctcgcgc tgggcctgga cctgcgcaac agcgagggca aagcgatctt cggtcgcctg 2109121 gtcgctgaat ccgacgcggt gttcgccaac ttcaaaccgg gaacccttac ctcacttggg 2109181 ttttcctacg atgtactgca cgccttcaac ccccggatcg tgctcgccgg gagtagtgca 2109241 ttcgggaacc gagggccgtg gagcacccgg atgggctacg ggccactggt gcgcgccgcc 2109301 accggggtca cccgtgtttg gacatccgat gaggcgcagc cggacaactc tcggcatccc 2109361 ttctacgacg cgacgacgat cttccccgac cacgttgtcg ggcgggtcgg tgccctgctc 2109421 gcgctggcgg ccctgatcca ccgcgatcga actggcggcg gagcccacgt ccacatctcc 2109481 caggccgaag tcgtcgtcaa tcagctagac accatgttcg ttgccgaggc cgcccgagcg 2109541 accgacgttg ccgagatcca cccggacacc agtgtgcatg cggtctaccc ttgtgctggc 2109601 gacgacgaat ggtgcgtcat ctcaatccgc tccgacgatg aatggcgtcg cgcgacatct 2109661 gttttcggcc agcctgaatt ggcgaacgac ccacgcttcg gggcaagccg gtcacgcgtg 2109721 gccaaccgtt cggagttggt ggccgcagtg tcggcctgga ccagcacccg taccccggtg 2109781 caagcggccg gcgcgctgca ggcggccgga gttgcggccg gcccgatgaa tcgcccgtcg 2109841 gatatcctcg aggatcccca gctgatcgag cgaaacctgt tccgcgacat ggtgcatccg 2109901 ctgatcgccc gtccgctgcc cgccgagacg ggtccggctc cgtttcgtca cattccgcag 2109961 gcaccccaac gcccggcgcc gctgcccgga caggacagcg ttcagatctg ccgcaagctg 2110021 ctcggcatga ccgcggacga gaccgaacgc ctaatcaacg agcgcgtaat gttcgggccg 2110081 gccgtcactg cctaagtggt ctcgccggtg tcgttcgtcg acggtcggct gattgccctt 2110141 ccggctccga gatcgacgtt ttgcccgcct gttcgtgctt tatctgcgaa gccccgatct 2110201 gggcgcatcg gggtgacgca ttcgggcagc taaagctttt cgacccgcaa gccggcggtg 2110261 cccctcctcg ttccgctgcc cggtctgctc gatcggttcg gggtcgccgc gctaggccca 2110321 attgcccggc tcctcctcgg gccgttccac gacccgcatc gtcgccgggc taggttcaag 2110381 ccatgccggt agaccccagg acgccagtgc tgatcggcta tggacaggtc aaccaccgag 2110441 gcgacatcga cgccgagaag cagtccatcg aacccgtcga cctgatggcc gccgcggccc 2110501 ggaaagccgc ggattcgacg gtgctcgagg cggtggattc gatccgtgtg gtgcacatgc 2110561 tgtcggcgca ttaccggaat cccgggcagc tcctcggcga acgaatcaag gcgaggacct 2110621 tcaccaccgg ttacagcggg gtgggcggca acatgccgca atccctggtc aaccgggcat 2110681 gcctggacat ccagcgcggg cgggccggcg tggtgctgct ggctggcgcc gaaacctggc 2110741 gcacccgaac gggcctgcgc gccaagggca gcaaactgga gtggactgtg caggacgaat 2110801 ccgttccgct gccggacatg gccggcgacg acgttccgat ggccggtgcg gctgagctgc 2110861 ggatcaacct ggaccggccg gcctacgtgt acccgatatt cgagcaggcg ctgcgcatcg 2110921 cctacggcga gtcgatcgag aaccaccgaa agcggatcgg cgagctgtgg gcgcggttca 2110981 gtgccgtagc tgctgacaac ccgcacgcgt ggatccgcaa cccggttacg gctgacgaga 2111041 tctggcagcc cggcccacag aaccggatgg tcagctggcc ctacaccaag cttatgaact 2111101 ccaacaacat ggttgaccag ggtgccgcgc tgctgctgac gtcggtcgaa cgtgcgacac 2111161 gtctgcgaat accggccgaa cgctgggttt atccacaggc tggcaccgac gcccacgaca 2111221 caccggccgt cgccgaccgc caccgactgc atcggtcgac ggccattcgg atcgccggtg 2111281 cccgggcgct ggaactggct gggctggggc tcgatgacat cgaatacgtc gacctgtatt 2111341 cgtgctttcc ctccgctgtc caagtcgccg caatcgaact cggcctggac accgacgatc 2111401 ctgcccgccc gctgaccgtc accgggggcc tgaccttcgc cggcgggccg tggagcaatt 2111461 acgtcacgca ctccatcgcc accatggctg aactgctggc ggccaatccc gggcgccgag 2111521 gtctgatcac cgccaacggc ggttacctga ccaaacacag tttcggggtc tacggcaccg 2111581 agccgccgtc ggaattccgc tgggaggaca tgcaacccgc ggtcgatagg gagcccaccg 2111641 gagatgggtt ggtcgagtgg gaaggcatcg gcaccgtcga agcgtggacc acaccagtca 2111701 accgggacgg acaacccgag aaggcgttcc tggcggtgcg cacgcccgac gggtcgcgca 2111761 gcttggccgt gatcaccgat cccgcatcgg tgcaagcaac ggtgcgcgag gacatcgccg 2111821 gcgtcaaggt tgccgtcgcc cccgacggca ccgcgaccct gcgatagccg gcgggcagca 2111881 cgagtcacgt tccagaagca atggtcgcgc aagcgacact gacgtgccta ttgtcatgag 2111941 gagacgttgg gggaggtgag gccgggtgca gatcctggtt accgacgcca cgggtgccgt 2112001 cgggcggtcg gtcactcggc agttgatcgc tgccggacac acggtgagcg gtatagccca 2112061 gcacccgcac gatgctctgg acccccgcgt cgactatgtt tgcgcgtcgt tgcgcaaccc 2112121 agtgctgcaa gagttagccg gcgaagccga cgcggtgatc catctcgccc cggtcgacac 2112181 cagcgccccg ggcggtgttg gcatcaccgg actggcacat gtggccaacg cggccgcccg 2112241 cgccggtgcc cggctgctgt tcgtttctca ggccgctggg cgacccgaac tatatcggca 2112301 ggctgagacg ctggtgtcca ccggttgggc acccagcttg gtcatccgta ttgcgccacc 2112361 ggtcggccgc caactcgatt ggatggtgtg ccggacagtg gccacgctgc tgcggagcaa 2112421 agtctcggca cggccgatac gagtgctaca tctcgacgac ttggtccgct tcctggtttt 2112481 ggcgctgaat accgaccgca acggtgtcgt tgacctggcc acccctgaca ccaccaatgt 2112541 ggtcaccgcg tggcggctgc tccgatccgt ggacccgcac ttgcgaacac gtcgggtccg 2112601 cagctgggag caattgattc ccgaggtgga tatcgctgcc gtgcaggagg attggaactt 2112661 cgagttcggc tggcaagcga ccgaagcaat tgtcgacacc gggcggggcc tcgtcggccg 2112721 cagactgcac ccggcaggcg cgaccaacgg atcgggtcaa ctagcactgc cggtggaggc 2112781 gcccccgcgg tctgtgcctt cccacgggga acccttgggc agcgcggctc cagaagggtt 2112841 ggagggagag ttcgacgacc gtatcgacga gcggttcccg gtcttcagct cggccagtct 2112901 cgccgaagcg ctgccgggtc cgctgacccc gatgacgctg gatgtccagt tgagtggact 2112961 gcgcgcggcc ggtcgggcga tgggtcgggt actggcgctt ggcggtgtcg ttgccgatga 2113021 gtgggagaga agagccatcg cggtgttcgg tcaccgcccg tatatcggag tgtcggccaa 2113081 tattgtggcc gccgcccaac tgccggggtg ggacgcgcag gccgtagccc ggcgggcact 2113141 gggcgagcaa ccgcaggtca ctgagctgct tccgtttggt cgaccgcaac ttgcgggcgg 2113201 accgctcggc tcggtcgcga aggtggtcgt gacggcgcgg tcgctggccc tgctgcgcca 2113261 tctccggagc gacacacacc actatgttgc cgccgcagat gccgagcacc tcgctgccgg 2113321 gcagcttgcc tcgctaccgg acgccggctt ggaggtccgg attcggctgt tgcgtgatcg 2113381 catccaccaa ggctggattc ttacggtgct gtgggtgatc gacacgggcg tcacagcggc 2113441 gacgttagag cacacccgcg caggctccgc ggtgtccgga gggggcatga tcatggaaag 2113501 tggcagaatc ggcgccgaga ttgctccgct ggctgcggtg ctgcgcgccg acccgccgct 2113561 gtgcgcgctg gccaacgacg gcaacctcgc cagcatccgc gcgctgtctg ctcccgccgc 2113621 cgccgcagtt gacgcggtca ttgcccggat agggcaccgc gggttaggcg aagccgagct 2113681 ggctaacctg acgtttgccg acgatccggc gctactgctg aagacagccg ccgaaatcgc 2113741 cgcgcggccc gccgggccag ctcacccagc gacgttgatc cagcgactgg ctgccggcac 2113801 gcgcagtgcc cgggagctgg cgcacgacac caccatccga ttcacccatg agctccggat 2113861 gacattgcgg gagttgggat ctcgacgagt cgcggcggat gtgatagacg tcgttgacga 2113921 cgtgttctac ctgacctgcg acgaactgat taccacgccg gccgacgctc ggctgcgaat 2113981 caaacgtcgg cgcgccgaac gagaacgcct gcaggcacag cgcccgccag acgttatcga 2114041 tcatgcctgg gtacccgtgg agtagcggtc aacacacgtc aattcgtcgt caggtccgcc 2114101 aacggccact gcggatcaac cagcctgtca acgtcgaccg ggttcccgga ccggatcagg 2114161 cccttgacgt cgtccaccac gtcccagacg ttgacattca tcccggctag cacccggctg 2114221 tcgccgtcga gccagaagga gaggaactcg cggccggcaa cgttgccacg gaacaccacc 2114281 cgatcacagc tgggggcgtg gccgacgtac tccatgccga ggtcgtattg atcggtgaac 2114341 aaatagggca gttcagcgta ttcgcccggc cggcccagca tgccggcagc cgccaccgcg 2114401 ggttgtttga gcgcgttggc ccagtgttcg gtacggacgc gggtacccaa tagcgggtgt 2114461 tcagcggcgg caatgtcgcc gactgcgtag atgtcgggat cgctggtgcg cagcgatgca 2114521 tcaaccaaca caccgccctc gcccatcgcc agcccggcct gttgggcgag ttctacgttg 2114581 ggcttcgcgc ccacagcgac tagcacggcg tcggcggcaa ccgtcgaccc gtcacgcatc 2114641 ttgagcccgg tcgccttgcc gtcggctgca gtgatctctt cgagctgggt ctgcaaccgt 2114701 aagtccaccc cttgatctcg atgtaggtcg gcaaacactt tgccaaccgc ttccccgagc 2114761 gcggccagca gcggttgtat ggcggtctcg acgacggtga cgtcgacgcc acgttgacgc 2114821 gcactggcgg ccacttccag gcctatccag ccggcaccca ccactgcgag ggaagacccc 2114881 tgcaccagaa cggagttcaa tgccacggcg tcgttgtagc tgcgcaggta gtggacgccg 2114941 gcggcatcgg atccaggtat tggtgggcgc cgtggggccg atcccgtggc caacagcagc 2115001 ttgtcgtagc gcaccgcagc gccgtcggga agctctaccg tgtgtgcgga ccgatccaat 2115061 gacgacaccc gcacgccgag ccgcacatcc acgtcatggt cgcggtacca atcggaggtc 2115121 tggatggtga agtcgctcag cgactttttg ccggccagaa actccttgga aagcggcggc 2115181 cggtcgtagg gcaggtgctc ttcgtcgccg aacaagataa tccgaccgcc gaagtcgctg 2115241 cggcgcaacg cctctacggc tttagccccg gcaagtcccc cgccaacaat gacgaacgtg 2115301 gttgagctgg ccataattgc tgctccgtcc tgttgtgtgc ggtgccgctt gacagcctac 2115361 gagccggtcg cgtacctggg tcaaccggtc acctgtaggc gcagctcgtc gtctaacgcc 2115421 actcgcacta acgcagcagc gagcagcgca ttggagctgg gtgccaccga cgccagcttc 2115481 ttcgggtcag tgggcaagcc gagctgcttc gccgcggcgg tggctcgatc gtcgaaatac 2115541 ggtcgtaccc agatccagac gtcttggacc tcgcgtaaga aaatgtcggc accggtgtcg 2115601 ccgattccgt tgaaagtctt gagcatacgt ttggcggccg aaacgtcggg tcgtgtgcgc 2115661 tgggcgagtt cccgcaaatc accggagtac tcgtcgcgaa cccggtgagc gatagcggtg 2115721 agccgggtgg ctgagctctc gtcataccgc acgtagtggg cacggccaaa cgcactgatc 2115781 atcgtttgtc gctctgctga cagcacagct ttgggtgtcc gcaggcccga gcagaacaat 2115841 tcccgggcgg cacgtgctgc cgtggcggca ccgatcggct tgctggccag catgcacagc 2115901 accagcagct gaaacagcgg catcggtttg tccctgatcc ggattcccgc ctccgccgcg 2115961 taagtggtgc cggcgagttt aagcagtcgt cgtgccagtg gctccggctt gatcacaagc 2116021 aaccgcatac ccgcaatgcg tggcggcaaa ccgcgactat tgctcgggca agcgcgctcc 2116081 ggcggcctaa gccccggttc cggccaaccc ctgtcagtcc aaatccaccc ggatggtcag 2116141 caagtcggtg cccatcgcgc gtacgccggc actgttcagc cggggtaggc cgcgcagccg 2116201 ctgcctcgga tcgtcgtcgg gtagcaggta ggcggtccca ctgcgccatc ggccgccgat 2116261 gcggacccgc acggcggggt tggccttgat gttgtagacg taatcggaat gctcgccgtg 2116321 ctcggacacc atccagaact ggttgtctac gacgcgcccg cccaccgcgg tacgccgcgg 2116381 ctgtcccgtt ttgcggccga tggtttcgag catggtcatc ggcagttgcc ggccgattgg 2116441 attgaccacg aaccgttgca cgcgatggac gaattcccgc ttgagattca tagctgcatt 2116501 caacgctacc gatctggccg cggcctcacg ttggtgcccc gatagggccg agccgccgca 2116561 gttgtgtcac gtgccgaggt gacagctcct caaggcaggt cacgcccagt agccgcatgg 2116621 tccggatcac acctgtctga aggatctcga tcgcgcggtt gacgcccgcc tcaccaccgg 2116681 ccatcagccc gtaaaggtag gcccgcccga tcagcgtgca ccgtgccccc aacgcgatcg 2116741 ccgcgacgat atcggcgccc gacatgatgc cggtgtccac caggatttcg gtgtgtttgc 2116801 ccagttcgcg tgccacgtgg ggcaacaggt ggaagggtac cggggctcgg tcaagctggc 2116861 ggccgccgtg attggacaac acgatgccgt cgacgccgcg gtccaccacg gcgcgggcgt 2116921 cgtcgagtgt ttggatccct ttgacaacga gcttgcccgg ccactgcgac ttgatccagg 2116981 ccaaatcgtc gaaggtgagg ctggggtcga acacggtgtt caagtactcg ccgacggtgc 2117041 caggccagcg atccagtgaa gcgaaggcca gcggttcggt ggtcaacaag tcgaaccacc 2117101 accgcgggtg tcccatcgcg tcgagaacgg ttcgcagcgt cagcgccggc gggatggaca 2117161 tcccgttgcg gacatcgcgt agccgggcac cggcgaccgg gacgtcgacc gtgaccagca 2117221 tggtgtcaaa tcccgcggcg gcggcgcgcc gcaccaatgc catcgagcgg tctcgatcac 2117281 gccacatata cagctggaac catttgcggc cctgcggcac agcgatgacg aggtcttcga 2117341 tggcacaggt ggccagggtg gatagcgaaa acgggatccc agccgcggcc gccgcccgcg 2117401 cgccggcgat ctcgccctcg gtgtgcatca agcgggtgaa cccggttggc gcgatcccga 2117461 atggcaagac ggtgggctga ccgaggacgt tccagccggc gcacacggtg gtgacgtcac 2117521 gcaggattgt cgggtgaaac tcgatgtcgc ggaacccttg tcgagcacgc gcgatggaca 2117581 gttcgtcctc ggcagccccg tcggcgtagt cgaacgccgc cctaggggta cgccgtttgg 2117641 caatgcgtcg caggtcctgg atggtcagcg cggcgcccag gcggcgcttg gaggtgtcga 2117701 actgcggcct gttgaactgg agcaggggtg ccagatcgcg cactctgggc actcgccggt 2117761 tgaccgccat ccgtttatct aaccagtgtg atatgaagtc agcaagcgac ccgttcgacc 2117821 tgaagcgttt cgtgtacgcg caggctccgg tctaccgcag cgtcgtcgag gagctgcgcg 2117881 ccggacgaaa gcgcgggcat tggatgtggt tcgtcttccc acaactccgc gggctaggta 2117941 gtagcccact ggcagtgcgc tacggcatct cctcgctcga ggaagcccag gcctatctgc 2118001 agcatgacct gctcgggccc cgcttgcatg agtgcaccgg gttggtcaac caggtgcaag 2118061 gccgctcaat cgaggaaatc ttcggcccgc ccgacaacct caagctgtgc tcgtcgatga 2118121 ccctgttcgc ccgtgccacc gacgccaacc aggactttgt cgcgctgctc gccaagtatt 2118181 acggcggcgg agaggaccgg cggacggtgg cattactggc ggtcacatag accgcgcgat 2118241 ccaccggggc gtcgacgcct gacagcggat gtaggttcgg gctcatggag aaggtgatcg 2118301 ccgtgctcat gcggcccgag ccagacgacg actggtgtgc ccgccaacga gctcaagtcg 2118361 ccgacgccct gctgggactg ggcgttgctg ggctgtcgat caatgtccgg gacagtaccg 2118421 tgcgcgactc actgatgacc ctgacaacgc tgtacccacc ggtcgcagcg gtggtcagcc 2118481 tgtggaccca gcagtgctat ggcgagcagg tagcagccgc cctcaggcta ctggctcagg 2118541 agtgtgatga actcggcgca tacctggtga ccgagtcggt tccgctgacc ttcccatcgc 2118601 tcgtcgagtc cggttctcgt acaccgggtc tggccaacat cgcgctcctg cgccggcccg 2118661 atggcctgga ccaggcgacc tggctgaccc gctggcagcg cgaccacacg caagtggcta 2118721 tcgaggcaca ggcgacattc ggctacaccc agaactgggt ggtacgagcc ctcaccccag 2118781 aggcaccggg aatcgcgggc attgtcgaag agttgtttcc cgtggcggcg acaaccgatc 2118841 tgaaagcctt cttcggagcc gccgacgaca acgatctgcg gaatcggata agccggatgg 2118901 tcgcgagcac atctgcattc ggtgccaacc agaacatcga caccgtgcca accagccgct 2118961 acgtgttcag aacaccgttc aaggattgag gaacgtgaga tgacaacact caacgaagcc 2119021 gcggcactgg cggcggcaga acgtgggctt gcggtggttt ccaccgttcg tgccgacggc 2119081 accgtgcagg cgtcgctggt caacgttgga ctgttgccgc atcctgtcag cggcgaacca 2119141 tctctgggat tcaccaccta tggcaaggtc aaactcggca accttagggc gcgcccacaa 2119201 ctggccgtca cgttccgcaa cggttggcag tgggcgaccg tcgaaggccg agcacaactt 2119261 gtcggccccg acgatccgcg gccgtggctg gtcgacggcg agcgattgcg gctgctactc 2119321 cgcgaggtct tcactgcggc gggtggcacg cacgacgact gggacgagta cgaccgggtg 2119381 atggcgcagg agcagcgcgc cgtggtgctg atcacgccca cccgcatcta cagcaacggc 2119441 tgagggactc agcaaacggc gtcgctcgtg cgacctgcgg ggtcgagttg ggttgggttg 2119501 agtcgggcgg ctgcgatgat agctcgcagt gtgcgccggc agcgtccgca gtcgccgcca 2119561 gccccgcaca cagcggccac ttctttggag gtcgacgcac ctcgcgccac ggcgtcacac 2119621 acggtttggt tggtgacgcc gacgcacaag cacacgtaca tcagcaaacc cccagcagat 2119681 gctgcgtcgg cgaacgatca agccgcatat tagtggagtc tagcctaagc tgattagtgg 2119741 agtctaacct aacaatgacc cgcggcttgg actttgcgcc ggcgagacgc gccgacgccg 2119801 caacaaaccc tgcgccgacc cgtactcgct gcactagatt gagacgcggc acgcaaacgt 2119861 gctgttatca gcccaagacg agcccgacac cggtgcgctc cagccctgcc cacctggcgc 2119921 ggttcgccac gacagcctta tatcccatag gagtggtcat gcaaggtgat cccgatgttc 2119981 tgcgcctgct caacgaacaa ttgaccagcg agctcaccgc tatcaaccaa tactttctgc 2120041 actccaagat gcaggacaac tggggtttta ccgagctggc ggcccacacc cgcgcggagt 2120101 cgttcgacga aatgcggcac gccgaggaaa tcaccgatcg catcttgttg ctggatggtt 2120161 tgccgaacta ccagcgcatc ggttcgttgc gtatcggcca gacgctccgc gagcaatttg 2120221 aggccgatct ggcgatcgaa tacgacgtgt tgaatcgtct caagccagga atcgtcatgt 2120281 gccgggagaa acaggacacc accagcgccg tactgctgga gaaaatcgtt gccgacgagg 2120341 aagaacacat cgactacttg gaaacgcagc tggagctgat ggacaagcta ggagaggagc 2120401 tttactcggc gcagtgcgtc tctcgcccac cgacctgatg cccgcttgag gattctccga 2120461 taccactccg ggcgccgctg ataagctcta gcatcgactc gaacagcgat gggagggcgg 2120521 atatggcggg ccccacagca ccgaccactg cccccaccgc aatccgagcc ggtggcccgc 2120581 tgctcagtcc ggtgcgacgc aacattattt tcaccgcact tgtgttcggg gtgctggtcg 2120641 ctgcgaccgg ccaaaccatc gttgtgcccg cattgccgac gatcgtcgcc gagctgggca 2120701 gcaccgttga ccagtcgtgg gcggtcacca gctatctgct ggggggaact gtcgtggttg 2120761 tggtggctgg caagctcggt gatctgctcg gccgcaacag ggtgctgcta ggctccgtcg 2120821 tggtcttcgt cgttggctct gtgctgtgcg ggttatcgca gacgatgacc atgctggcga 2120881 tctctcgcgc actgcagggc gtcggtgccg gtgcgatttc cgtcaccgcc tacgcgctgg 2120941 ccgctgaggt ggtcccactg cgggaccgtg gccgctacca gggcgtctta ggtgcggtgt 2121001 tcggtgtcaa cacggtcacc ggtccgctgc tggggggctg gctcaccgac tatctgagct 2121061 ggcggtgggc gttttggatc aacgtgccgg tttcgatcgc ggtgctgaca gtggcggcaa 2121121 ccgccgtccc tgcgttggcc cgaccgccca aaccggtcat cgactacctt gggatcctgg 2121181 tcatcgctgt ggccacgacc gctttgatca tggccacaag ttggggcgga accacctacg 2121241 cctggggctc agcgaccatt gtcgggctgt tgatcggggc cgcagtggcg ctgggtttct 2121301 tcgtgtggct ggagggccgc gcccgctgcg gccatcctgc cgcccaggct gtttggcagc 2121361 ccagtatttg ccgtgtgctg cgtcctgtcc ttcgtggtcg gattcgcgat gctgggtgca 2121421 ctgaccttcg taccgatcta tctggggtac gtggacggcg cgtcggcgac cgcgtcaggt 2121481 ctgcgcacgt tgccgatggt gatcggcctg ctgatcgcct cgaccgggac gggtgtcctg 2121541 gtcggccgga cgggccgcta caagatcttc ccggtcgcgg ggatggcgct gatggcggtt 2121601 gcgttcctgc tgatgtcgca gatggacgag tggacgccac cgctgctgca atcgctgtac 2121661 ctggtcgtcc taggtgccgg catcggattg tccatgcagg tgctcgttct catcgtgcag 2121721 aacacgtcgt ctttcgaaga cctcggcgtc gcaacatcgg gtgtgacctt cttccgggtg 2121781 gtcggcgcct cgtttggtac cgcaacattc ggtgcgttgt tcgtaaactt cctggaccga 2121841 agactcggtt ccgcgctgac gtcgggcgcc gtgcctgtcc cggcagtgcc atctccggct 2121901 gtcttgcatc agctgcccca gagcatggcc gccccgatcg tgcgggcata tgccgagtcg 2121961 ctcacccagg tgttcctttg cgcggtctcg gtcacggtgg tcggtttcat cctggcgctg 2122021 ttgctgcgag aggtaccgct caccgacatc cacgatgacg ccgacgacct cggcgacggg 2122081 ttcggtgtgc ccagagccga atcgccggag gatgtgttgg aaatcgcggt tcggcgtatg 2122141 ctgccgaacg gggtgcgact gcgcgatatt gcgacacaac ccggttgcgg actcggcgtc 2122201 gccgagctgt gggcccttct gcggatctat caataccagc ggctgttcga ggcagtacgg 2122261 ctgaccgata tcggtagaca cctgcacgtg ccctatcagg tctttgaacc cgtcttcgac 2122321 cgtctggtcc agaccggcta cgcggcacgc gacggcgaca tcttgacgct aaccccgtcc 2122381 gggcaccgtc aggtcgactc cctcgcagtt ttgatccgtc agtggctgct cgaccacttg 2122441 gccgtggcgc ccggcttgaa gcgacagcca gaccaccaat tcgaagccgc tctgcagcac 2122501 gtcaccgacg cggtgctcgt tcaacgagac tggtatgaag atctgggcga cctgtcggaa 2122561 tcacgccaac tcgcggctac aacgtagcga tgcttgccgc gcgtagccgc gcgagctgat 2122621 ccgcgctgca gaatgactgc catgacagcc acaccgcttg ccgcggccgc gatcgcccaa 2122681 ttggaggcag agggcgtcga caccgtcatc ggcaccgtcg tgaaccccgc cggactcacc 2122741 caggccaaga ccgtgccgat acgccggacc aacacattcg ccaatcctgg cctcggcgcc 2122801 agtccggtgt ggcatacctt ctgtatcgac caatgcagta ttgcattcac cgcagacatc 2122861 agtgtggtcg gcgatcaacg tctccgcatc gatctgtccg ccttgcgcat catcggcgac 2122921 gggttggcgt gggcgcccgc cgggttcttc gagcaggacg gcacaccggt ccccgcctgc 2122981 agccgaggaa cgctgagccg gatcgaggcc gcgcttgctg atgccggcat cgacgcggta 2123041 atcggccacg aagtcgaatt cctcttggtc gacgcggacg gccagcggct gccttcgacg 2123101 ctgtgggcgc agtacggtgt cgccggggtg ctcgagcacg aggcgttcgt ccgcgatgtc 2123161 aacgccgcgg caacggcagc aggcatcgct atcgagcagt tccatcccga atacggtgcc 2123221 aaccaattcg agatctcgtt agcgccgcag ccgccggtcg cggccgccga tcagctggtg 2123281 ctgacccgcc tcatcatcgg ccgtaccgcc cgccggcacg ggttacgcgt gagcctatcg 2123341 ccagcgccct tcgccggaag tatcggatcc ggtgcccacc aacacttctc gctgactatg 2123401 tcggaaggga tgctgttctc cggtgggact ggagcagctg gcatgacctc ggccggggag 2123461 gccacggtgg caggagtgct tcgcggactg ccggacgccc aaggcatcct gtgcggatcg 2123521 atcgtgtccg gtctgcgaat gcgacccggt aactgggccg gaatctatgc atgctggggt 2123581 accgaaaacc gggaagcggc ggtgcgattc gtcaagggcg gggctggcag cgcgtacggc 2123641 gggaacgtgg aggtgaaggt cgtcgacccg tcggccaacc cgtatctcgc gtcggcggcg 2123701 atcctcggac tggcactcga cggcatgaag accaaggcgg tgttgccgtc ggaaacgacc 2123761 gtagacccga cacagctgtc tgacgtggat cgtgaccgtg ccggcattct gcgacttgct 2123821 gccgatcagg cggatgcaat tgctgtactg gatagttcga aactgcttcg gtgcatcctt 2123881 ggcgatcccg tggtagatgc agtggtcgcg gtacgccagt tagagcatga gcgctacggt 2123941 gacctcgatc ctgcgcagct ggccgacaag ttccggatgg cttggagtgt gtaacgatgg 2124001 ccgactccgc cggttcggac ctgacgcggc acacggccga agtgccgttg atcgatcagc 2124061 acgtccacgg atgctggctg accgagggga accggcggcg gttcgagaac gcgctcaatg 2124121 aggccaacac cgaacccctg gcagacttcg actcgggatt cgactcacaa ctcgggttcg 2124181 ccgtgcgcaa ccactgcgct cccatccttg gattgcctag gcacgttgat ccgcagactt 2124241 attgggatcg ccgcagtcaa ttcagtgaag ctgaattggc tcgcagattt ctgcaggccg 2124301 ccggggtaac cgactggctg gtggagaccg gaatcggcta cgacgtgtcc ggaatggcaa 2124361 gcgtcgccgg cctcggcgaa ctgtcgggca gccacgctca cgaggtggtt cgtcttgaac 2124421 aggtggccga acaggccgtg caggcatccg gcgactacgc ctcggcgttc aacgagatac 2124481 tgcgccggcg cgcagccaca gcggtggcaa ccaagtccat cctggcctat cgaggtggat 2124541 tcgacggtga tctgaccgag ccacccgcgg cgcaggtcgc cgaggccgcc aagcgctggc 2124601 gcgaccgtgg cggtgtccga ttacaggatc gggttctgct gcgcttcggg ttgcatcagg 2124661 cgttgcgcct gggcaagccg ctgcagttcc acgtcggatt tggcgaccgg gacgctgatc 2124721 tgcacaaggc caatccgctg tatctgctcg acttcctgcg gcagtccggc aataccccaa 2124781 tcgtgttgct gcactgctat ccctacgaac gagaagccgg ttatctggca caagccttca 2124841 acaacgtcta tcttgacggc gggttgagtg tgcactacct gggggcccgg tcgccggcct 2124901 tcatcggccg actactggag cttgccccct tccgcaagat cgtgtactcg tcggacggat 2124961 tcggccccgc ggaactgcac tttctcggtg caacgttgtg gcgcagtgga attcagcgtg 2125021 ttctgcgtgg ctttgtcgag cgcgacgact ggtgcgagac cgatgccctg cgggtggtcg 2125081 acctaattgc ccatggcact gccgcacgca tctatcgcct tggcgatcgg tagctttcag 2125141 gtggcgcagg tgtggccccg tcacgggcta accatggacc gtgccggacc cagtgtcacc 2125201 ggcagcgtcg accaaccgcg cagcacccgc gtgtcacgcc gacttccggc acccgcggcc 2125261 cgcacatcgg ggaagcggtc gaagaacgtt ctcagcccga cctcgccttc ggcgcgggcc 2125321 agggcggccc ccaggcagaa gtggcggccg gtagagaacg caagatgtcg tccggcattg 2125381 gggcgttcga tgtcaaagcg gtgcggatcc gggaacacag cgggatcgcg gttggcggct 2125441 gctaggtaga tcaccacgac ttcgccgcgt ttgattcgca caccagccac ctcgacgtca 2125501 cggcaagcca cccgggcggt gagctgaacc ggcgaatcca gccgcaggat ttcttcaacc 2125561 gtattcggcc acagctccgg atgttggcgc agtgtggcca gatgttcggg ggtatccaac 2125621 aacatgcgaa tcccgttgcc taacaggttc actgtggttt cgaatccggc gaccaaaacc 2125681 agtccggcga tcgcccgaag ttcggtctcg tcgagctgtg tctcgttgtc cccgctttcg 2125741 gcgatctgga tcaactgact catcaggtcg tcacccggag cgtgccgcaa ctgctgcaga 2125801 tgcccttcca gccagcagtc gaatcctcgt atcccctgct gcacacgcag gtactgccgc 2125861 cacggaatcc cgatgtctag actcggcgct gccaactcac caaattccag gacgcgcggc 2125921 ctgtcatgct cgggcacgcc caaaatttcg ctgatgacca cgatcggcag ttgcgagcaa 2125981 tagcgtccta cgacgtccac aatcccgggc tgctcagcga accgatccaa gagattgatc 2126041 gcggtctgtt cgaccagatc gcgtagcgcg ctgaccgccc gtgaggtgaa caccgccgac 2126101 accgttttgc ggtagcgagt gtgatcgggc ggctcgacgg ccagcagcga aggttctcgc 2126161 agggggtgaa gttgatcgcc gcgggtccgc cgctccagcc agcgcagcgg tggtggcaga 2126221 ttctcgccga aggagacgac gcggaagtcg tccgatcgca gcaggtcatg ggcgagccga 2126281 tggtcgacgg tcaggtagtt ggcgcggttg cgcaccaggg cgccgtggga ccggacttcg 2126341 tcgtaaaagg gcaccggatc ggtggcgacg gccggatccg cgatcagccg ggcctgcaag 2126401 tcgccacgcc gaatcccgat tgccgcaatg ccgcggatca ccccgtgcat cgccaaccag 2126461 tgcagcttgt ccttcaccgc gcctccgtcg atcgagtggc ttttcttcaa gactagaacc 2126521 cgcaattcaa cattcggcga ggatgttgaa gtctgttgac accaccgtgt tgggtttttt 2126581 gctgctgatg ccgtaggcac tgccggcaac tgtgtatgtg ttgcgggcgc ggtcggcgcg 2126641 ggcgttgccc accccgccgt gccagaagct gccgttgaac ccgtcgacgt tacggatctt 2126701 cacccactgc gggattaccc tgtcaccgct gagcaatacc acggcttgga cggtgctgtc 2126761 gtggttgcgg atgtcgatgg tccggtagga gtgctcctgg ctgcatgttg cgggacgtgt 2126821 cgtgtgagtg acgccgtcaa tagtcaggcg tgcggctttt cggggtaccg tctgagcttg 2126881 cccgcacgcg gagagaccag ccgcgacgac cattgcaact ccggtcactg tgaccaaccg 2126941 attgcacacc agccacctcc attcgggcct gagcattgtg ctcgggacat tacttccgtt 2127001 ttggctccaa cgtggccagg gacttggcaa tgtgacgtcg gacgaactcc ggactgacgc 2127061 ccttgagccg atcaatccag cgaatgcttc ggggcacata ccaatgcaac cgtgtggggt 2127121 gctggtaggc ccgccaggct gcctcggcga cgctggacga gggcatcagc cggaacatgc 2127181 ccttcttggg cgcggcagcg cggatctgct ccgcggagat cgtgtagggg ccctcgtcgg 2127241 aatgctggcg cgtcgaggtg aggatagcgg tgtcgatcag accgggcagc acgtcggcga 2127301 cgcgaacccc atgacgctgc cactcaacgc tcaacgcctc ggtcaacccc ttgacggcgt 2127361 gtttggtcgc cgagtagacc gcgatacgcg gcatgccata ggtgcccgag gacgacgacg 2127421 tcgagaacat cagacttccc ggtgctttct tgaggtaagg cagtgcggcg taggcgccag 2127481 tgagcaccgc cttgaagttc acgtcgacga cgcgcacggc ggcctcgtac ggcacgtcct 2127541 cgaaccaacc gccttcgccg atgccggcgt tgttccacat catgtcgaga ccgccgccga 2127601 cattgccggc gcagaaatca gcgagcgcac cctcaagggc cgccttgtcc gtaacgtcga 2127661 cggcgcgggc ccacagccgt tcggcaccaa gctgtacgcg cagggcagcc agcccatcct 2127721 cattgcggtc tatcgcacct actcgccagc cgttggcgtg gaaaagcgtt gcaccctcgc 2127781 ggcccattcc actgccggcg ccggtgatga atatcgcttt catgcggaat ccggaatagc 2127841 cgaaccgccc tcagcctgct tcaaccagat ctttgatgcg ctgcaacgtc ttggtcatgt 2127901 ctcggatgtt gcggcgctga cgcagccagc ccccgaacac ccggtagtac acggtggtca 2127961 acacggacgg ggggagccga aacgactcag tgacctcggt gccgtcggcg gtgggcgtca 2128021 aacgataatg ccaattgttc accggtctgt cgccgagcag cacagcaaac ccgaactcac 2128081 ggcccggttc gcatgccgtc acctcgcata ccgtccagta gaccggcccg atcccgttgc 2128141 gccggacatg cccgcggaat cgagcgccaa gcgcggggcc ggtggcaccg tcaagccact 2128201 cggcctcgaa ggtttccggc gagaaccggc cggtattgcg gacatccgcg atcaatgtcc 2128261 agatcttgtc cggcggcgct gccatgtgaa ctgtggccga accttccatg acctgatcca 2128321 aacacatacg tcgacctggt catagaccgc acacgccgcc aaccgtcagc gcggaatact 2128381 tgcctgaatg cctgcccaaa tgatctcgtt gatgatttgc ttgatgccct gcgcgggttt 2128441 cgaccacagt gcgatcggaa ggccagaggc ggcgccgcac gtcggccacg cgtccaatcc 2128501 ctgttcggcg agaacccgat tggcaactgc gatttgttgt tcccgagagg cagctgctgg 2128561 gttgccgaca ccgccgaatg cggcccaggt ggccggcttg aactgcagtc cgccgtattt 2128621 gccgtttccg gtgttggccg cccagttgcc cccggattcg cactgcgcga cggcgtccca 2128681 gttcgggctg ggaccggcgt gggcaacggc ggtggagagc gacatggatg ccgtgacgag 2128741 tcctgcggcc atggcggact tgatgagcgg cttggcgatt cttgtcatgc tcgacatatc 2128801 gccggaagtg gccgaagcgt taccgattag agagagtggt gagatcgggt gtctattgca 2128861 ccgcgaccgg ccgtggtcgg ccggcaaagg atgcacaacc ggattgatca ggccggcggt 2128921 agggcctggc aatacgactg tgttgctgtc gtcagggccc gttgatagag gctatcgagg 2128981 tggcgggacc gcactatgtc gcgtttggcg cggtcgagtt gggcggcgca ggacggcgcg 2129041 gacagcaaac tccagtgact ccaaatctgc gacagcatcc gattattcag ggagtcgatc 2129101 gccgatcgcg atgccgatag atccggcggc tccgggggcg cgctggccgg gttgagcttc 2129161 cagtccgaga accggctgta ctcgattgcc tcggtggcgc gaatctggtc gtcgaagacg 2129221 cgggtgacgt agtcggggtc gatgtgctgc gagcgggcat cttcgcccaa ctttgcgagt 2129281 tgctgttcga ctcggccgga atcctcaatg ggcagctgag cacgccactt gaaggctgcc 2129341 accgggtcgg cgacctccaa ccgctcagcg gcggcgtcga ccaactcggc taactggctg 2129401 gtgccgtcgg ctcgcgccag cggggggcct agtggtgcaa tcagcgacaa caggatgccg 2129461 atcgagacgg cggtcgcgag gtatatctca cgtggacggg taagcaaccc ttcggttgat 2129521 cccgtcagcc ggcgcctaac gaactctgca ggtcaccctt catggcgttg agctgagcgc 2129581 cccagtactc ccagctgtgc gtgccgttgg gcgggaagtt gaacacggcg ttgtgcccgc 2129641 ccgcggcgtt gtacgcatcc tggaacttca ggttgctgct acgaacgaag ttctccaaga 2129701 actcggcggg tatgttggca ccgcccaact cgttcggggt gccgttcccg caataaaccc 2129761 atagccgggt gttgtttgcg accagcttgg ggatctgctg cgtagggtcg ttgcgctccc 2129821 atgccgggtc actcgaggga ccccacatgt ctgcggcctt gtaaccgccg gcgtcaccca 2129881 tcgcgaggcc gatcaggcta ggccccatcc cctgagaggg gtccagcagg gccgacagcg 2129941 agccggcgta gatgaactgc tgggggtggt aggcggccaa gatcattgcc gacgagccgg 2130001 ccatcgacaa gccgattgca gcgctgccgg tgggcttcac ggccctgttg gcggacaacc 2130061 attgcggcag ctcgctggtc aggaaggttt cccacttgta agtctggcag ccagccttac 2130121 cgcaggccgg gctgtaccag tcgctgtaga agctggactg cccgccgacc ggcatgacta 2130181 tcgacagtcc cgactggtag taccactcga acgccggggt gttgatatcc cagccgttgt 2130241 agtcgtcttg ggcgcgcagg ccgtcgagca gataaaccgc aggtgagttg ttcccaccgc 2130301 tctggaactg aaccttgatg tcgcggccca tcgacggcga cggcacctgc aggtactcga 2130361 ccggcagccc cggccgggag aacgcgcccg cggttgccgc tccgccggca agccccacca 2130421 ggcccggaag gactacagcc gctgccgtgc cgatcatcaa tcggcgtccc caagctcgaa 2130481 tctttcggct cacgtctgtc atacctgtgc ccctttgtcc tgtatgtcgt cgtgtgctcg 2130541 ggccagaaca taccgtgtgt ggaggccaaa tgtcgattcg ggcgcaaagt cgtctcattt 2130601 ccgtatcggt taccgccgcg gacagagcaa gtgtgcttag ggggctcaca aacggtatgg 2130661 cggtatggat ctatcgcgga tttctcagaa tcgcggcccg gggctaccgg ctgtgctccc 2130721 ccagggaggc cgaacttgcg ttcaccgcgt aggctcgctc gaagcaagcc gacgaagacc 2130781 acgctatccc ggtctgttcc ggcgtccgcg taacaccgca ctggggtttg tggcgtgcga 2130841 tggtgcgggc tgagggcatc ggaggttccg ggaacgattg aggtgcgaga atttggacac 2130901 ggtacttggg ctctcgataa cgcctaccac cctggggtgg gtcctcgctg aaggacacgg 2130961 cgcagacggc gccatcttgg accgcaacga attggagcta catagcggtc gtaacgcgca 2131021 ggccatacat accgcagagc agctggcggc ggaagttctg ctcgcccatg aagtggccgc 2131081 tgcaggcgat catcggttgc gcgtcatcgg agtgacctgg aacgccgaag cttcggctca 2131141 ggcggcgctg ctggtagagt cgctgaccgg tgcaggtttc gacaatgtgg tgccggttcg 2131201 gcggctacgt gccatcgaga cactggcgca ggctatcgca cccgttatcg gctacgagca 2131261 aatcgcggta tgcgttcttg agcatgagtc ggcgaccgtc gtcatggtcg acacccacga 2131321 cggaaagacg cagatcgccg tcaagcatgt gtgccgcgga ttatcaggac tgacctcctg 2131381 gctgaccggc atgtttggtc gcgatgcctg gcgcccggcc ggcgtggtcg tggtcggctc 2131441 ggatagcgag gtcagcgaat tctcgtggca gctcgaaagg gtcctgccgg tgccggtctt 2131501 tgcgcaaacg atggcgcagg ttacggtcgc gcggggtgcg gccctggcgg cggcccagag 2131561 caccgagttc accgatgcgc agctagtggc cgacagcgtc agccaaccaa cggtcgcgcc 2131621 caggcgatcc cggcactacg ccggggcggc ggcagcgttg gccgccgcgg ccgtgacctt 2131681 cgtggcttcg ctgtccctag cggtgggcat ccagctggct ccgcacaacg ataccgggac 2131741 ggcgaagcac ggagcgcaca agccgacgcc acgtatcgca aaggccgtgg cgccggcggt 2131801 gccgcctccg ccgacggtca cgccaccagt ccctgctcgg gcaccccggc cggctgcgca 2131861 gcacgaacca cccgctcgcg tcacctccgg cgaagcgctc acggagccga acccgcctga 2131921 ggagcaaccg aatgcttctg cgccgcaaca ggatcggaat gacagccagc cgatcactcg 2131981 agtgctagag cacatacccg gcgcttacgg tgactcggca cccccagctg agtagtcgga 2132041 ggccgccgta gccggttgcg aaacctgttc gcgcggaccc atgtcgaggc gaagcggtgg 2132101 gtactcgtcg cgcatcagcg tggtgtatgc gcggacccgc aacgcccacc ggtttactcc 2132161 gatcaccagg ttgtatagcc ctatggggta ccggccggtg aacaggagcg ccaccacggc 2132221 gacgagcagc aggatcacca gtagcgaagg ccacattatg ccgacgcggt cgtgggggtc 2132281 gatcaggaag actcgccaac cgctggagag gaagaccgcc aggatcaggt agtgggggat 2132341 agcaagtagc caccacttaa tcagcaccag gccgcggctc aaccgctccg gatagtcaac 2132401 ctccaagtca gccggatact ccgcctttgt ctgcaggctg aagggcgggt accggtcggt 2132461 tcccagcgcc gacagcgcat agaaggcaac ccgccagcgc caccgcatga cgccgacatt 2132521 gaagtcgaac agcgtccggg gatatctacc cgtgaacagg atggcaaaga acgcgatcac 2132581 ggtgaccacc acggcggcaa cgtgcaagaa gaacaagaca atgtagtgcg ggatggccaa 2132641 aaaccacttg actagccact gccaacgtga caacgcagga tcgaggtcac cccggacccg 2132701 gactggatag gcgtcaggtt gcatgatcga cggctccttt acatgcgcgt cggctcgatc 2132761 cacgagccag gcccattgtc tctcatctgc cgcgcatggg cgaagccatc gtcgtgcgct 2132821 acggacaccg gatcgacgtg cagtaatagc cttgggctgt aggcagcttt ccgggcgatg 2132881 acggcggcac tggtaatcca ttgtcggcca acaatttact gagaggggtc ggtacagatt 2132941 gccagccgtg gctatccagg tacgtggcaa cgtcgttgcg ctcgccgtcg ctgtatagca 2133001 ggttgggtga tttctatgtc gaacccgtgg acgctccagc gctgcgaagc ggtgtcaagg 2133061 aggtcggatg ccggtccgcc atcgtcatct accgactccc cggcgcgctg agcgcggtga 2133121 tgttgtccag caagcgatcc tgagcgtcgg gcggcaggaa tgccagcagg ccctcggcga 2133181 gccaggcact gggttggttg gggtcaaagc ccgcttggcg caacggggtc ggccagtcac 2133241 gtctgaggtc gacgggaacc acgcgaggtc agctgttgca gtggcatcca gatcagcaag 2133301 tgtcgtcatc ggtatgcgcg cgcgtcaaga cctgaggcca ggatgacagc ctgccgaatc 2133361 cccgcagacg tcgcgtcacg gaaaaacgca tcgaagtacc gcgtccgggc ggccaacagg 2133421 tcggccaatc gctgcagtcc ccacgtgccg tcggggtcgt cgacgtcggt cgccttgatg 2133481 tttccggcgg cccagcgagt gaagaagtca atccccacgg ctctcaccag tggttcggcg 2133541 aacggatcat cgatcagcgg gttatcggcc ctggtggcca ccgcgcgggc ggcagccacc 2133601 atcgtcgccg ttgctccgac gctagttgcc aggtcccacg cgtcgttgtt ggtgcgcggc 2133661 atcgggatcc tttcggctcg gccagcgata tacagccttc gaagtccacc gcttgtggga 2133721 tcaatcgtcc tttgcccgaa ccgcggtgaa tgccacgctc acttcgtcgg cgacccgtat 2133781 tgaacccatc aacagcgagt agggtttgac gccgtagttg gactggcgaa ccgtggtgtc 2133841 ggcagagatg cgccacgcag caccaagatc ctctgtgtgc aagtcgatga cgtgttctcg 2133901 cgactttccc cggatgtgca gtttcccggt caggcggtac ccattcccgg tctgggcaat 2133961 ggcttccgtg gtaaagcgaa tatgggggaa gcggctggcg ttgagcgttt tcagcgcgtt 2134021 cgcccgcacc agagctttct caggctcgga cagccccttc acgccaccct caccgcgcat 2134081 cacctcgaag gaatccacct cagccacaag ctcgccggcg acgggatcgg tgccggacca 2134141 gttcaccagg gcctgccacc gtgtcatcgc gatggtcagg cgatgaccca agcgcgcggc 2134201 tctgccaacg actccggtgc gaagtaccag ctcgccgtcg gaagcatcaa gagtccacac 2134261 cgcgtcgctc acgccacgac tgtattcaga cgacctgcct gcccgcccct cccgccgcgt 2134321 cttgtgggcc acgacacaat cgttatgctt ggtgaggctc gccggtgccg ttggaggggt 2134381 gcaacatgat tcgcgaactg gtcaccaccg ctgcgatcac gggtgccgcg atcggtgggg 2134441 cgccagtcgc gggcgcagac ccgcagcgtt atgacggcga tgtgccgggg atgaactatg 2134501 acgcttcgct gggcgcccca tgctccagct gggagcgctt catttttgga cgaggcccct 2134561 ccggtcaggc cgaagcctgt cattttccgc ctcctaacca gttcccgccg gccgaaaccg 2134621 gctactgggt gatctcctac ccgctatacg gcgtccagca ggtcggtgcg ccgtgtccga 2134681 agccgcaggc ggccgcgcag tctccggatg ggttgccgat gctgtgtctg ggagcccgtg 2134741 gatggcagcc gggatggttt accggggccg ggttcttccc tccggagcca taaccggtgg 2134801 gcgtttctca tgatcatgtg cgaaggccgg cccaccgaat caccgatccc acggtggctg 2134861 cgcttcgtgc ttacgtctga ccgtgccggc tcggcatggt atatcggggc aggcttcttc 2134921 ttcgcgccag tgctggcggt gctttcgcca tggccgacca tcaccgcggt gctgtggtgg 2134981 atcatcggac tggcgggact atggctcgga ctgctcggaa tcgcgatggc agtcggactg 2135041 gcccgggtgt tgcgttccgg cgccgaaata ccggaagcct actggcgcac gctggtcgac 2135101 taccgatccg ccaacgaata ggagactccg atgagcttca atcccaaaga tgcggtcgac 2135161 gctgtccggg acattgcggc caatgccgtc gagaaggcct cggacatcgt ggaaaacgcc 2135221 ggccacatca tccgcggcga catcgctggc ggggccagcg gcatcgtcaa ggactccatc 2135281 gacatcgcca cccacgcggt cgacagaacg aaagaagtgt tcaccggcaa gacggacgac 2135341 gaaggttagt cgagactagt cggcgcgcgc ttgtcgtccg ttgtcaaacg gacgcggcag 2135401 cattgagtgc gtccaaccgg gcggtcgcct cgaggtactc ctgcacccag cgttcgataa 2135461 cggtagccgt cttttccacc ttggtgaact gcccaacaac ctgccccacc gggttgaacg 2135521 cgacgtcgac ggtctcgttc gggtatttat gtgtggcttt gacggccatg ccggagacca 2135581 tgtattgcaa cggcataccg agcggcttcg ggctctccgg ttgctcccag gcctcagtcc 2135641 agtcgttgcg cagcatccgg gccggcttac ccgtgaagga acgactgcgc acggtgtcgc 2135701 ggctggtcgc cttgacgtat gcggcctgtt gaaccgcggt gtttgcggct tcctcgacca 2135761 tcagccactg cgaaccggtc catgcccctt gggtccccag cgccaacgct gcagcgatct 2135821 gctgaccgct gccgatgcca cccgccgcca acaccggaac cggcgctacc tccttgacga 2135881 cctgaggcca caacacaatg gagcccacct cgccacagtg cccgccggcc tcgccgccct 2135941 gggcgatgat gatgtcgacg cccgcatcgg cgtgcttgcg ggcctgcgag ggtgagccgc 2136001 acaatgcggc caccttgcga cccgagtcgt ggatgtgctt gatcatgtcc gctggggggg 2136061 tgccaagcgc gttggcgacc atcgtcatct tggggtgctt cagcgccgcg tcgacctgtg 2136121 gggtggccgt cgcctcggtc caaccgagca gctgcagact gtcctcgtcg gcgtcctcga 2136181 ccgggacacc atgatcggcg aggatcttgc gggcgaagtc cagatgctcc tgcgggacca 2136241 tcgaccgcag cgtcttggcg agctcatccg ccgacagctg ggagtccatg ccctcgtact 2136301 tgttcgggat cacgatgtcg accccgtagg ggtggtcgcc gatgtgttca tcgatccagt 2136361 tgagctcgat ctccagctgc tccggcgtga acccaactgc tccgagcaca ccaaaaccac 2136421 cagctttgct gacggcgacc accacatcgc ggcagtgagt gaaggcaaaa ataggaaact 2136481 cgataccgag ctcgtcgcaa atggcagtgt gcatgcctgc tcctggaatg ctagcggacg 2136541 caaatagaac tgaaacgtgt tctagtttag tacccgtctt ggtaaggtgg ccaacagccc 2136601 aggttccggt cgggtttcgg cgcgcacccc ggcgaagctg acgaggcggt ctaaggtcac 2136661 cttcacccgc gcatggccgg ccagcaacaa cgacggctgt cccaccgagc agaagtactg 2136721 ggcgatggtg tgcaccgcgg tcggctacca ccgcgacgac cccgccgcag aactgctgtt 2136781 acgcaacgaa ggcttggcag ctgcagtcca aactggccac ctacgtctac ccgccacaga 2136841 aactagtcgc caaggtccgt gcgggcgcca aagtgtccga caaccacgac caggcgacca 2136901 ctctgttcca ccacgcgatc gatcacccaa ccgtgaccgt gcagcagacc tactccctga 2136961 tcaaccctca atcggccccg gggcgatgga ccttgatccg ctggggcccc gccggtagcc 2137021 tagtgctgcg aattacgcta tgccgagtct cggaattgcc ggcccgccgt tcaccacgtt 2137081 caaacgcccg agaccggtgc caggcaggta cgcgaacctc atgggtctca attcgttctg 2137141 ccacaaagaa agtgagtaag ccagcatgcg tgcggtagtc atcgacgggg ccggcagcgt 2137201 cagagtcaac acccagcccg acccggcact gcccgggcct gacggagtgg ttgtcgccgt 2137261 gaccgccgcc ggcatctgcg gatccgatct gcatttctac gaaggcgaat atccgttcac 2137321 cgagccggtg gccctcggtc acgaggcggt aggcaccatc gtcgaggccg ggccacaggt 2137381 gcgcaccgtc ggagttggcg acctggtcat ggtgtcttca gtggccggct gcggcgtctg 2137441 cccgggatgc gaacccatga tccagtcatg tgcttctccg gcccgatgat cttcggcgcc 2137501 ggcgtgcttg gcggcgcaca ggccgatctg ctggcggtgc cggccgccga tttccaggtg 2137561 ctcaagatcc ccgaaggtat caccaccgag caggcactgc tgctcacgga caacctcgcc 2137621 accggttggg cggcagccca acgagccgat atttcattcg gctccgccgt ggcggtcatc 2137681 ggcctgggag ccgtcggcct ctgcgcgctg cgcagcgcct tcatacacgg tgccgcaacg 2137741 gttttcgctg tcgaccgagt aaagggacgc ttgcaacgcg cggccacctg gggtgctacg 2137801 ccgataccgt caccggcggc cgagacgatt ctggccgcga cgcggggtcg cggcgcagac 2137861 tcggtgattg acgccgtcgg caccgacgcc tcgatgagcg acgcgctcaa tgcggtgcgc 2137921 cctggcggca ccgtctcggt tgtcggcgtg cacgatcttc agccgtttcc cctgcccgca 2137981 ctgacgtgcc tgttgcgaag catcacgctg cgaatgacca tggcaccggt acaacgaacc 2138041 tggccggaac tgatcccgtt gctgcagtcg ggccgactcg atgtcgatgg catcttcact 2138101 accaccctgc cgttggacga agcggccaag ggctatgcaa ccgcgagggc gcgctcgggt 2138161 gaggagctta aggttctgct tacgccctga cagccgtgat gtactgggag cgcatgaaac 2138221 tgtcgatctt tacgtccacg tccggcggtg tcagtccgta gccgacctgc agctcgaggg 2138281 tgctgcggac ggggtcgacg gcccatccat gctcaactag ccactccacg ggatcggtct 2138341 tgtcgtcgta ggtgagcgcg gagaaattca cgtcaccaga catattgacc cccgggtgtg 2138401 cggtttccag cgcggcgagc tgctcgtgat ccaaccggga ccctaaggcg cccaaggcaa 2138461 ctcggctgcc aggcgcacac aactcatcga tccgggcgaa cagagcatat tgcgcatcgc 2138521 cggtcaggta gggcagtagt ccctcgaccg accaggcgct gggtcgttgc ggatcgaacc 2138581 cggccgctgt cagcggcgtg ggccagtccg tacgcagatc tgctggcacc gccacccggt 2138641 gagctttggg tacagcaccc cgctcactta gcacccgtgc tttgaattcc aggaccttcg 2138701 gcacatcgat ctcgaaaacc gttgtcccgg gctgccagtc aaggcgataa gcgcggcagt 2138761 ccagaccggc ggcgacgatc accgcctgtc gtatgccagc ctcatcagcg cagttgaaga 2138821 agtcgtcgaa aaaccgggtt tgcacgccgt agagccgagg gaaagcggtg ccgtcctccg 2138881 acgttctcgg gtttgctaac agaccctcca gatacgggtc ggccgaagcg gtgatgaaat 2138941 gcttcgcgta ttcgtcttgg accagcggtt tagggcccgt ggtgtgcagt gcacgccaac 2139001 ccgcaaccag tagcgcggtg tagcccacgt tgctgacaat gtcccagtgg tcgtcatcgg 2139061 aacgaagcga gccatactca ggtgtagtca tctcatcagc cttccagcat tacggtcacc 2139121 ggaccgtcgt tgaccagttc gacctgcatg tgggcaccga acacgccggc ttccacgtgc 2139181 gctcccaact ggcgcagcgc tgccgcgaac gctgctatca ggggctgcgc caccgcacct 2139241 ggcgccgcgg cgttccagga cggtcgccga cccttcgcgg tatctgcgta gagggtgaac 2139301 tggctgatta ccaggatcgg tgcgtgcatg tcggaggcgg atttctcgtc ggcgagaacc 2139361 cgcaaattcc agagcttttc ggcgagacgg cgcgccttgt cgagatcgtc gccgtgggtg 2139421 acaccgacga acgcgaccag gccctgcccg tccggccgga tagcgccgac cacccgacca 2139481 tcgaccctca ccgcagccga tgagacccgt tgcaccagaa cccgcacgag cctcgatgct 2139541 gccaggccgg ctatgcagtc gctggggctg ggtaggctca ttgtgtgtct gtgctggtcg 2139601 cgttttccgt caccccgctg ggcgtggggg agggggtcgg cgagatcgtc accgaagcga 2139661 ttcgcgtggt ccgtgattcc ggcctgccga accagacaga tgccatgttc accgtgatcg 2139721 aaggcgatac ctgggcggaa gtgatggccg tcgtgcagcg cgcggtggag gccgtggccg 2139781 ctcgggcacc gcgagtcagc gcggtgatca aggtggactg gcgtcccggg gtcaccgacg 2139841 cgatgaccca gaaggtcgct accgtcgagc ggtatcttct ccggcctgaa tagcagcgct 2139901 aaacgcccgc tcggccgcat ccccatggac cgcaaatacc accctttgca gcgaccccgg 2139961 ccggtgccga cggacggcgc cgaccatcag ccgcgcagcg tcgtcgagcg gaaagccgcc 2140021 cacgcccgtg ccgaaagcca ccagcgccag cgagcggcaa ccgagctcgt cggctttccg 2140081 cagggtagca gcggtggctg cggtgatgat ctcgcccgag gtcggacctc ctagctccat 2140141 cgtcgccgcg tggatcacgt agcgcgccgg catgtcaccg gccgtggtct cgaccgcttc 2140201 cccaagccca atcggcgcct tctcggtgga ctcgcgctgc agctcggggc cgccggcgcg 2140261 ggcgatggcc gcagcgacac caccggcatg ccgcagtcgg gtgttcgccg cattggtgat 2140321 ggcgtcgagc tcgagcttgg tcacgtcggc ctgatgtacc tccaactcga tcatcgacac 2140381 attgtccccc ctgcaagtac tcggcggccg cggtgatgca ccccttgttg tgttggaccg 2140441 tcgccaccat cgcccacaca atcgaaccct cgcccggcgc cactgcggcc catcgagact 2140501 ccggcccatc gagcccagat tcggggtagc ttgaggtgaa cgaggacaat caagcggctg 2140561 gcaaggacac agaccgatgg ctcgtgatcc agttgtaccg agggtgcgaa cagcatgagt 2140621 ggcgacgacg ccgggccggg cgaggtcagc catgcccgcg gcgtcggtgg gccgggcgga 2140681 gccggaggcg ccggtggccg gggtggtgcc ggcggtcgcg gcggggcggg cggtagaggc 2140741 ggcgacggcg ggcaaggcgg tcgcggcggg gcgggcggta gaggcggaga tggcggcata 2140801 ggcggggcag cgggccccgg cggtcaaccc ggccagggcg gggtgggcgg cgcacccggc 2140861 cccggtggaa cccccggcga accaggtcag cccggcaaac caggacaacc ggggcaaccc 2140921 ggcagcccgg gacattagcg cgtgcgggtg gcgtcgtcgc gcatgagcac gcatagccgc 2140981 catctgcccg gtacgccctt gagttcctgc tcaccacgct cggcgaaccg gtgccgtgat 2141041 ccggcgacga tgtctcgcac ggtcgaggac accagcacct cactgggtcc ggccagcgcg 2141101 cagacgcgcg caccgatatg cacggccacg ccggcgacgt cggtaccgtg cgaggcatcg 2141161 cgcacctcga cctcgcccgc atgaataccg atccggacct caatacccag cgcggcgacc 2141221 gcgtcgacga tgtcgtccgc gcacgcgatc gcggcactcg gactggtgaa cgtcgcgacg 2141281 aaaccgtcac cggccgtgtt cacttcgcga ccgccgaacc gctggatttc gtggcacacg 2141341 atggtgtcgt ggttgtccaa caggtcgcgc catcggtcgt cgccgagcgc ggcggcgtgc 2141401 tgggtcgagc cgacgatgtc ggtaaacatg atggtggcaa gcatgcgctc ggcgtcagcg 2141461 ccgccgcgca cgccggtgat gaattcctcg atttcatcga gcatcggccc ggtgtcgcca 2141521 acccagtaca gggtatcggt gccgggtagt tcgaccaagc gggatccagc gatgtgctcg 2141581 gcgaggtagc gaccatgtcc caccgggatg tacgtcgatc cgacacggtg caagatcagt 2141641 gttggagcct cgatgtgtcc caagacatct cgtacgtcgg cctcggctat gacctttgaa 2141701 acggcacggg ccatgctcgg cggtccggca cggttgccgg cgagatccca ccaggctcga 2141761 aacacgtcat ctccggccac ggtaggagcc acgatgctca gcacgtcgaa gccctgctcg 2141821 acggcatccg gttccagcgc caccgtcagg aacgggtcag ctcgacgaac ctgggcgcct 2141881 accgggtagt cgggcgccca tagtgggcgc gccgagccgt tgacgacgat caggctgcgc 2141941 acccgctcgg ggtagtcggc ggcgagaaca agtccgttca tggcgtggaa actgggcgcg 2142001 aaaattgtcg cctgctcgca tccgaccgcg tccatcaccg cgatcgcgtc ctgggcccag 2142061 aacttcggcc ccagcgtggt tatcgcggcg agccgtgacg acaggccgac cccacgatgg 2142121 tcgaggcgga tcaccctgct gaatgacgca agacggcgat ggaaacggta cagcgatggc 2142181 tcgtcgtcga tcgagtcgat cggcacgaac ggccccggca acaccagcag atccgtcgga 2142241 ccgtcaccca gcacctggta ggcgatatcc atgtcgccgc attttgcgta gcgggtcctg 2142301 tgaatgtggg gagcctgcgc cacggtccta cgttagttca tgcgtaggct catggcggtg 2142361 agcgcacgtg cgggcatcgt gatcaccgga accgaggtcc tgaccgggcg ggtccaagac 2142421 cgcaacggcc cctggatcgc cgatcggctc ctggagctcg gggtcgagtt ggcacacatc 2142481 acgatctgcg gcgaccgtcc cgccgacatc gaggcacagc tgcgattcat ggctgagcag 2142541 ggtgtggacc tgatcgtcac cagcggcggc ctggggccga ccgccgacga tatgaccgtc 2142601 gaggtggtgg cgcgctattg cgggcgcgag ctggtgctgg acgacgagct ggagaacagg 2142661 atcgccaaca tcctcaagaa gctgatgggg cgaaatcccg ctattgaacc cgccaacttc 2142721 gactccatac gcgccgccaa ccgcaaacag gccatgattc cggccggatc gcaagtgatc 2142781 gatccggtgg gcaccgcccc cggtctggtt gtgccgggac ggccagcggt gatggtgctt 2142841 cccgggccac cgcgcgagct gcagccgata tggagcaagg ccatccagac ggctccggta 2142901 caggatgcga ttgccggccg gacgacctac cgacaggaga ccatccggat cttcggcctg 2142961 ccggagtctt ctctggccga cacactgcgt gacgccgagg cagccatccc gggttttgac 2143021 ttagtcgaga tcaccacctg cctgcggcgc ggcgagattg aaatggtcac tcgctttgaa 2143081 ccgaacgccg cgcaagtgta cacgcaattg gcacggttat tgcgcgaccg gcacggccac 2143141 caggtctatt cggaagacgg tgcgtccgtg gacgagctgg tcgcaaaatt gctaactggc 2143201 cgccggatag cgaccgccga atcctgcacc gcagggttgc tggcggcacg gctcaccgac 2143261 cggcccgggt cgtccaagta cgtggcgggc gcagtggtgg cctactctaa cgaggcgaag 2143321 gcacagcttc tcggtgtgga tccggcgctg atcgaggccc acggggcggt ttccgagccg 2143381 gtcgcccagg caatggcagc gggggcgctg caaggcttcg gcgccgacac cgccaccgcg 2143441 atcaccggaa ttgcgggtcc gagtggggga acgccggaaa agcctgtggg aacagtgtgc 2143501 ttcaccgtcc tgctggacga tggccgaaca accacccgaa ccgtgcggct gcccgggaac 2143561 cggtcagaca ttagggagcg ctcgacgact gtggcgatgc acctgctgcg gcgcaccctg 2143621 agcggtatcc cgggctcacc ctagcgacgg cgaaatcgac agcagcgcga caaagttcga 2143681 cgagaagaca ccgcgctaat gtcgatttcg atgacgaaca agaaaagcag tttccgtagt 2143741 accaaagcgg attccggtgg catccttgcc aatcgccgtc agcaccgcta cgaccaatag 2143801 cacgggcacg atcgtcgcgg ccaaggcgaa ggggtagcca tgggattcgg ccagacgctc 2143861 ttgaatagga aggttgaacg ccgccagcag attaccgagc tggtaggtta cgccggggta 2143921 gacgccccgg atagcgtctg gcgacatctc ggtcagatgc gcggggatca caccccaggc 2143981 accctgtacg aagacttgca tcaaaaacga acccaggcac aacatcgccg cagtgcgcga 2144041 gtaagcgaac agcggcacga tcggcagtcc cagcgccgca cagaaaacga tggtgtaacg 2144101 gcggctgaac cgctgggaca acgtgccgaa cgccagaccg ccgatgatgg cgccgatgtt 2144161 gtagatcacc actatccacc tggcggtcag gctggacaaa ccggcaccat gatcggtagt 2144221 cgcggtcagg aaggtcgggt agacatcctg ggtgccgtgg ctcatccagt tgaaggcggt 2144281 catcaacagc actaggtaga caaaccggcg cacaattgcg gggttaccca ggacatcgcg 2144341 gattcgggtc ttggtgagcc gcatgcggtc ctgcgcggct tcccagactt cggattcctt 2144401 tacccggtac cggatgatca agctgatcag agccgggatg atgcttaggc cgaacaacca 2144461 ccgccacgac agccctagcc agttcatcac caccagcgct gccacactgg ccagcagata 2144521 gccgaacgcg tagccctcct gcagcagccc ggagaagacg ccacgccgct cggctggaac 2144581 cttctccatg gacagcgcgg cacccagccc ccactctccg cccatgccaa tgccgtagag 2144641 cagtcgcagg atcaccagca cggtgaagtt gggtgcgaat gcgcacagaa atccgatcac 2144701 cgaatagaac gacacgtcga ccatcagcgg gacccgccgg cccacccggt cggcccatag 2144761 cccgaacagc aacgcaccca cggggcgcat ggccagggtg gcggtggtga gaaacgcgac 2144821 gtcggtcttg gtgtggtgga aggtcgttgc gatgtcggca tagaccagca ccacgagaaa 2144881 gtaatcgaac gcatccatcg tccaacccaa gaaagatgcc ataaaagcgt ttcgctggtc 2144941 gccggtcaac cgcggtgctg ccacgtctgc atcgtggcgt accgggcgcg gcaccgcgag 2145001 tccggggaca tggcgaacag cggcggctcg catgtccgtg gcaggatcgg gcaatggtgc 2145061 cttttctgat gcgcgccgca gtgaccggat tcgcattatg ggtggtgact cttttcgtcc 2145121 cgggcatgcg gtttgcgggc ggcgacacaa cgctgcagcg ggtcgccatc atcttcgtcg 2145181 tcgcggtgat cttcggtctg gtcaacgcgt tcatcaagcc catcgtgcag atcttgtcga 2145241 tcccgttgta catcctgact ctcggtcttt tccatgtagt cgttaacgcg tcgatgctgt 2145301 ggcttaccgc gtggatcact gagcacacca cccactgggg actgcagatc gaccacttct 2145361 ggtggaccgc gatctgggcg gcgatcttgt tgtcgatcgt cagctggatc ctgtcgctgt 2145421 tggctcgtga ctttcgacgt gtcactcgcg cacactagag ccacaaattt tggtgggggg 2145481 acatcctagg ttttcggggc atgttccact tatgcttact cacactgctt gccaacctcg 2145541 tccaagacag gcaccctgtc ttcggcgtga tgacgctgac ctcccgccct ccaatacgcc 2145601 ggacggcagc acctaacagc acacgacgac gggactgcaa atgatgcgca ctgtcgcgat 2145661 tggaccaggt gccggtcctt cgagcacacg gccgagttcg caacccagtg acctgcatag 2145721 cggcctacgc gcggttaccg agtgcaccgg ctcagcggtg gtcgttcatg tgggcggcga 2145781 catcgacgcc agtaacgagg tcgcttggca gcgtctggtg agcaagagcg ccgctatcgc 2145841 catcgcgccg ggtccgttcg tcatcgatat tcgggacctc gacttcatgg gatcatgtgc 2145901 atacgctgtg ttggcccagg agtcggtgcg gtgtcgccgg cgcggggtga atatgcggtt 2145961 ggtgagtaac cagccgatcg tggcccgcac cattgccgcg tgcggactgc ggcgactaat 2146021 tccgctgtat gcaatggtcg agaccgcact ggcgccgcct cccagcgcgc attgaccgac 2146081 ccattaaccg accggtgcca cccaacccgc catggtgtcg ggttaaccgc cgccgacaag 2146141 attgaccacc tcccgcgcac aaccccatga cagggtcacg ccgtcacctc cgtggccata 2146201 gttgtggatg cacagcgctc gcccgatcgg ttcagcttcc acccgcacgg acggccgatc 2146261 aggacgcagc ccggtaatcg tctcaatcac tgccgcctcg gcaagccgtg gttgtatgcg 2146321 gcgacaccgt tgcaggatcc gctcggttat ctccggctct ggggtggtgt cccacctgcc 2146381 agggatactg atgccgccgc agactacacg ctgcgggtgg gcaaagtagc agatccattc 2146441 cgagccgccg gtgcgctcga taaacagttg ctctagacct ggattggtga ggacgacgtg 2146501 ctggccgaac cgcggccaga ccgtggcgtc gccggccagt tcccgagcgc ccagaccagc 2146561 acagttgatc actatgggcg ccgcctcagc ggcctcggcc agcgaccgta gcgggcgcgt 2146621 ttcgatttca cagccagtcg ccgccaatcg ctgggtcaga cagtcgaggt actggggcat 2146681 atcgatcatc ggcaaggtgg catgaaaccc agcacggaag cccccgggca cgtcggccgg 2146741 gtcagccggc cgcacgtcgg ggatcagctc caacccgggc ggcatcgcac cggtctcgat 2146801 acgatcgccg acactcagcg ccggcgtcat gcgcacgccg gtggcgggat ccttggccaa 2146861 gtcgcgaaac acgtgcaatg actgttcgat ccacccgcgt accttggcaa cgggttcctt 2146921 cggccgcggc ccccagaccg cacccgccac cgccgatgtc gtttgctgcg gcaatgcggc 2146981 cgcccatacc cgcaccggcc accccgcctc ggccaggcat atggccgacg tcagtccgct 2147041 gacgccggcc ccaatcacga tgacctgttg ctcacctatt gccacagcag gaccgtagcc 2147101 gaagccagcg tcagttaggg ctgaggcact cgccctccag tcggtccgag taagccgttg 2147161 aggatgccga gctgattttg tagttgggcc cccgcttcag gtccaggaac tccggcaggg 2147221 gcagcgcctt cgctgcccgt gttctgccag ggttggcagc cgtgcgtctt gaacgccttg 2147281 tcggtcggct caatcgtcac tacctgtggt ttcttgctga gtgcgttatc gatgagcgcg 2147341 ccatcggggt tacccatccg cttccaatag caggtgccgt cgccgacggg tcccgcggag 2147401 ctgtacgtgc cgggagcgat gtcaatcccc accgcatagg tgccgtcgct atcaattgcc 2147461 gtcttcggtg tcggtgccgg ctccggatcg gcgccggcga ggcccacgga tccggcccag 2147521 cctgcgagga tcaggccggc gacggcaaag gctgcagcag gagatggggc tggcttcaag 2147581 cgcatcacac aatagcctac tggggcctac cggtatccgg aactcactcg gcctggaagc 2147641 aatcactcgt tctcccgccg ccgatgggct tgttcgatcc ccatatgcgc ctgcgagcgc 2147701 acggacggcg cgccaccgac gcagtgtccg gcaatgatgc ggtaaatcgc ggacggcgcc 2147761 aacgcttcca ccgagtcaca gccttgtccg ccagcacacc gcccagaccg catgtatcgg 2147821 aggatgtccg gaagccgttg gccacctccg tgtcgagcaa ccaccgctgt cactgcattg 2147881 ctgtcactaa atcgttgtcc ggcaacacgt ttagagcgct cgcgtcaggc tgacctcctg 2147941 gtggctcgca tcccgagcac cggctgggta ccgcgacctt cgtcgaagtc cgccgcccac 2148001 ggccagcgac cacgccggtc ggcccacacc aactgcaagg ccgtcacctt gtcgccaaag 2148061 atggcgatcg cacaatacaa atgcgcgtcc ggatgtgtaa cctggaccgt ttcgacaaga 2148121 gggccggctg ggagggtggt ctgcataccg ggagtcagca agtcaccgac cagagccctg 2148181 cgagcggcga tgttcaacaa ccgctgccca cgtcgtggcg agaggccagt caccaccagt 2148241 tcgggcaagc cgcgccgggt tagaccaacc gtgtaggcaa atggccgtcg ctcgcactcc 2148301 acgtgctgta ccgcccagcc atgcatgagc attatcccgt acacctcgtc gaggtactcc 2148361 tcggcggtgg cttccgggtg atcgcacatc cagcacattt cggcgccctt tctcctcatc 2148421 cccgtctcgt catccccgtc tcgtcgtgcc tgcgaccacc atgcacgcgg ggtctgacaa 2148481 atcgcgccgg gcaaacacca gcaccccgcg agccggtcag ctcgcggggt gctgcggcgg 2148541 gttgtggttg atcggcgggc agggccgatc aacccgaatc agcgcacgtc gaacctgtcg 2148601 aggttcatca ccttgtccca ggcagcgacg aagtcctgca cgaacttcgg ctgcgcgtca 2148661 tcggcgccat agacctcgac aagcgcccgc aactccgagt tggacccgaa gaccaggtcc 2148721 acgcggctgc cggtccactt caccttgcca ctgccatcct tgccctggta ggtcccgtca 2148781 tctgctggcg agggctccca ggtgataccc atgtcgagca ggttcacgaa gaagtcgttg 2148841 gtcagtgact cggaggcctc ggtgaacacg cccagcggta agcgcttgta gtttgcgccg 2148901 aggacgcgca ggccacctac cagcaccgtc atctcagggg cactgagcgt aagcaggttc 2148961 gccttgtcga gcagcatgta ctcggccggc aacgggttgc cctttccgag gtagtttcgg 2149021 aagccatctg ccttgggctc cagcacggca aaggattcca cgtcggtttg ttcctgcgac 2149081 gcatccgtgc ggcccggggt gaagggcacc gtgatgttgt ggccagccgc ctttgctgct 2149141 ttctctatgg cggcacagcc accgagcacg acgaggtcgg cgaaggacac tttgatgttc 2149201 cccggcgccg cggagttgaa tgactcctgg atctcttcca gggtgcgaat gaccttgcgc 2149261 agatccccgt cggggtcgtt gacctcccac ccgacttgtg gctgcaggcg gatgcgacca 2149321 ccgttggcgc cgccgcgctt gtcgctacca cggaacgacg acgccgccgc ccatgcggtc 2149381 gaaactagct gtgagacagt caatcccgat gccaggatct ggctcttaag gctggcaatc 2149441 tcggcttcgc cgacgaggtc gtggctgacc gcagggaccg gatcctgcca cagcagggtc 2149501 tgcttgggga ccagcggccc aaggtatctc gcaacgggac ccatgtctcg gtggatcagc 2149561 ttgtaccagg ccttggcgaa ctcgtcggcc aattcctcgg ggtgttccag ccagcgacgc 2149621 gtgatccgct catagatcgg atccacccgc agcgagaggt cagtggccag catcgtcggg 2149681 gagcgccctg gcccgccgaa cgggtccggg atggtgccgg caccggcgcc gtccttggcg 2149741 gtgtattgcc aagcgccagc agggctcttc gtcagctccc actcgtagcc gtacaggatc 2149801 tcgaggaaac tgttgtccca tttcgtcggg gtgttcgtcc atacgacctc gatgccgctg 2149861 gtgatcgcgt ccttaccggt tccggtgcca tacgagctct tccagcccaa gcccatctgc 2149921 tccagcggag cagcctcggg ttcggggccg accagatcgg ccgggccggc gccatgggtc 2149981 ttaccgaaag tgtgaccgcc gacgatcagc gccgctgttt cgacgtcgtt catggccatg 2150041 cgccgaaacg tctcgcgaat gtcgaccgcc gcggccatgg ggtccgggtt gccgttcggc 2150101 ccctccgggt tcacgtagat cagccccatc tgcaccgcgg ccagcgggtt ctccagatcc 2150161 cgcttaccgc tgtaacgctc atcgccgagc caagtggctt ccttgcccca atagacctca 2150221 tcgggctccc actggtcgac ccggccgaag ccgaacccga acgtcttgaa gcccatcgat 2150281 tccagcgcgc agttgccggc gaaaacaatc aggtccgccc atgagagctt cttgccgtac 2150341 ttcttcttga ccggccacag cagccggcgc gccttgtcca agctggcgtt gtcgggccag 2150401 ctgttaagcg gcgcgaaccg ctgcatgccg cccccggcgc cgccgcggcc gtcgtggatg 2150461 cggtaggtgc cggcagcgtg ccacgccatc cggataaaca gcggcccgta gtggccgtag 2150521 tcggcgggcc accacggctg cgaggtggtc atcacttcct cgatgtcccg cgtcagggcg 2150581 tcaacgtcga tggtcgcgac ctccgcggca tagtcgaacg ccgcacccat cgggtcagcg 2150641 acggccgggt tttggtgcag taccttcaga ttgagccggt tgggccacca gtcctggttt 2150701 ccgccgccct cgactgggta tttcatatga cccacgacgg gacagccgtt gctagcggct 2150761 ccggtggtgg tttctgtaat gggtgggtgt tgctcgggca cagcattcct tccaggagtt 2150821 ggtgttatcg ggctgtgatc acggatgtga tcgcgaagtg tcggatatcg aacaatcagg 2150881 acatagaccc cagtagatga cctccgcctc gtccaacagg aagccgttat ggtccgaggc 2150941 cgtcagacag ggtgcctcgc caacagcaca gtcgacatcg gcgataaccc cgcaagaccg 2151001 gcagacgatg tgatggtggt tgtcgccgac cctggactcg tagcgcgcga cggagcccga 2151061 gggttggatc tttcgcacca agcccgcggc ggtcagggca tgcagcacgt cgtacacggc 2151121 ttgccgggat acgtcgggca gcgcaaaacg cacggcaccg aaaatcgttt ccgtgtcggc 2151181 gtgtggatgc gcattcactg cttccaggac ggcgacgcgc ggtcgggtca cgcgcaggtc 2151241 ggccgtccgg agctgttcgg cgtagtccgg tatagaggac acactagaca atatgactcc 2151301 cttttctgga atcagtcaag actttggcta gcgtgacagg cgtctgctag gacccgatcg 2151361 ccccggggcc gctggatcgt gggatggcgg gtggatcagc cttcgtatgt tccgatgagc 2151421 cgggcctgca tggtggcggc ctgcgcgatc acccgcgccg cttgtgtccc agccagtccc 2151481 gcgagtggag gcacggcagg aaggtggtag agggtaaacc ggtagtggtg tgtcccggtg 2151541 cccgccggcg ggcagggtcc ggtgtatgcg ggctgaccgc tggagttcgg caggctgatt 2151601 ccgccaccgg gtgtctcacc atcggcggtg ctgccagcac caggggcgat cccgatcacg 2151661 atccaatgga cgtaaggttc gcgaggtgcg tccggatcat cgacaacgag tgcgccgcca 2151721 aacggcgccg accaggtcaa cggaggcgcg atattggctc ctttgcaggt gtactgttcc 2151781 gggatcggcg caccgtcggc gaatgccgga ctgctgattg tcagtacatc gccggtaggc 2151841 gtttcgggca tactccgacc gagcgctgct gctttcggcg ccagcggcgc cgcctttcga 2151901 ctgtcaccgt tgccaccgta ggcaactagc gccacgggga gcgccagccc caagatggcc 2151961 agtgcgaacc ggtgaaatgc gtgcgccact gtcgattcca tattgatcat tgtcgccagg 2152021 cgcaattgga gaagccaggg tttcgaccac ctcgccaggg atgccgcggc gtcagccttc 2152081 gaatgtgccg acgagccggg cctgtccgct ggcggcctgt gctatcgcct gtgccgcttg 2152141 gactcccgtg gctcccggtg gcagctggag cgcgacagga aggtggtaga gggtaaaccg 2152201 gtagtggtgt gtcccggtgc ccgccggcgg gcatggaccg aagtatcctt gccgaccacc 2152261 agaattcggc acgctgtgcc caccagcagg agtctgacca tccgccgtgc tgccagagcc 2152321 aggggcgatt ccggtcacga tccagtgcac gtacagtccg ccgaccgcgt cggggtcatc 2152381 gacgacgagt gccagttcgg ctgcgcccgc gggcgacgac cacgtcaacg gtggcgccac 2152441 gttggccccc ttgcagctga attgcaccgg gatcggggcg ccgtcggcga acatgggact 2152501 ggcgatcgtc agtggctcgg cggccggcgc cggcgttgtt gcgtcgacgg tcgtcgcttt 2152561 cggcacgtat ggcggtgtct ctcgactgtc accgcccccg cccccgcagc cacccagcgc 2152621 cactacgagc gccagccccg cggtggctaa tggggttcgg tgaagtgtgc tcgtcattgg 2152681 agattccata gcacattgtt actaactggg attcgagagt acagctgttt tgcggccgcg 2152741 cttaccagac agccgggccc cgggccaccc atcgcctcac ggtaccagca ccaccttgcc 2152801 gacgttctcc cgtgcggcca gaatccgatg tgcttcagga gcttcggcga acggcacgat 2152861 tgcatgaacg atcggcagga tcgttccgtc gttgagcgcc ttggtcagcg gcgcgatcca 2152921 gggttcaagg gtgcggcgat cgtcccacaa ccgcagcatg ttaagaccga tcacggtttt 2152981 cgactcctcg agttgtttca tcaggttaaa gccgcgcagc attgacaacg cgtggggcgc 2153041 caccctgcgc atcgatcgtt tctcgccgtg ctgcatattc gaaatcccgt agccaaccag 2153101 ccttccaccc gggcgcagca gagtgtagga ccgccgcagc gaggtgccgc cgagcgcgtc 2153161 aagcacgacg tcatacgggc ccaatccctg ccaccagccg tcccggcggt agtcgatcgc 2153221 gcggtccaca ccgaactcgg ccagcttctg atgtttttgg ggtgatgcgg tgccgtgcac 2153281 ttcggccttg gctgctttcg cgaattggac cgccgcgatg ccgactccac cggccgcggc 2153341 gtgaatcagc acccgctcac cggcgcgcaa cgatccgtag ccgtgcagcg ccgcccaggc 2153401 ggtcgcgtaa ttcaccggga ccgcggcacc ctgttcgaag ctcagcgcat cggggagcac 2153461 aaccgagtcg gtggccgcaa cgttgacgat ctcgcagtag ccaccaaatc gtgtaccggc 2153521 caggactcgt tcgccgaccc ggttcgggtc gaccccatca ccgacagcct cgaccgtccc 2153581 agcgacttcg tatccgacca ccgccggaag tttcggcgcg tctgggtaca ggccgacgcg 2153641 ggcgagatgg tcagcgaagt tcacccctgc tgcgcggacg gcgacccgca gctggcccgg 2153701 gcccggtggc ggcgggtccg gtcgctgccg cacctgcaag accgatgggt cgccatgttt 2153761 ggtgatgacc actgctcgca taatgttctc cttgtcaggc ttgacgggtc gcacccgcga 2153821 acacccctct gtgatagcac gagttatcag gaggttcggc ggggcgttac ctttgcggtt 2153881 gtgcacttcg actgggagcg cctgaccgac agcgtgcatc gctgccggct gccgttctgt 2153941 gacgtcaccg ttgggctggt ccggggccgc accggaatac tgctcgtcga caccgggacc 2154001 accctcggcg aagcaacagc aatcgcggcc gacgtcaagc agatcgctgg ttgccaggta 2154061 acgcatgttg tgttgacaca caagcatttc gaccatgtgc tgggttcctc ggtgttcgac 2154121 caagcggagg tgttctgcgc tcccgaggtc gtcgaatacc tacggtcggc taccgaccgg 2154181 ctccgcgaag atgccctgag ctacggcgcg gacacagctg aggttgaccg cgcgatcgcg 2154241 gccctgaaac cacctcagca cgggatctac gatgcagccg tcgatctcgg ggaccgcacc 2154301 gtcaccatca ctcaccccgg cagcggccac accacagcag atctcgtcgt ggtggcgccg 2154361 gccaccggcc atgcagacgg cccaacggtg gtcttcacgg gtgatcttgt cgaggagtca 2154421 gccgatcctg atatcgacgc cgattccgac ctggcggcct ggccggcaac gcttgatcgg 2154481 gtacttgcga tcggcggccc tgacgccagc tacgtcccgg ggcacgggaa ggtcgtcgat 2154541 gcgcagtttg tccgtcgcca gcgcgcctgg ttgcgaacac gtgcgagccg ccagcctcgt 2154601 gaaacgccag ctactttgcc gtgcaagcgg tgacgagcgc atccgggtcg gtaacgctga 2154661 cccacaattc gcgcaccgtc atcgacttct tccacatctt tgcctgttcg ggcggatcga 2154721 tcgtcagtgc caccaggccc ttacgtgacc cgttgaccag ccagcggccg aatccaaagt 2154781 gcaccccggc tgcgtagacc cttgcgttgg tcgcctctgc cttcgtgatc gacgtcaacg 2154841 ggatgtcggc ggcaaatgcc catcccatct tgacgtgcag gctccccgcc ccaacccata 2154901 gctcgctgtt cttggggccg agcccgagcg gcaccgcaag cgggagaaac caacggtcaa 2154961 agcgcaactg ggtcggcacc aagatgaccc taccggtgct agtgcggctc agtaccatgt 2155021 aggagttagt ctcgaaccgc cccagtggcg ttgcggaatt tgcgagccgt catcggtcag 2155081 tgatctaggt cgcccgtccg gggatacact cggtccgtca ggtgaatcgg ggctgcagag 2155141 gagcgcaagg ccatggccat cgccgaaacg gacaccgagg tccacacacc gttcgagcag 2155201 gactttgaga aagacgtagc cgccactcag cgatacttcg acagctcgcg ctttgctggg 2155261 atcattcggc tctacaccgc ccgccaagtc gtggaacagc gcggcacgat ccccgtcgac 2155321 cacatcgtgg cgcgagaggc ggcgggcgcc ttctacgagc gtctgcgcga actctttgca 2155381 gcccgcaaga gcatcacgac gtttggcccc tactcgccgg ggcaggcggt gagcatgaag 2155441 cggatgggta tcgaggcgat ctacctcggt ggttgggcta cctcagctaa gggctccagc 2155501 accgaagatc cggggcccga cctcgccagc tacccgctga gccaggtgcc tgacgatgcc 2155561 gcggtgctgg tgcgcgcctt gctcaccgcg gaccgcaacc aacactatct acgcctgcag 2155621 atgagcgagc gacagcgtgc ggcgacaccg gcttacgact tccgcccgtt tatcatcgcc 2155681 gacgccgaca ccggccacgg cggcgatccg cacgtacgca acctgatccg ccgcttcgtc 2155741 gaggtcggtg tgccgggcta ccacatcgag gaccaacgac ccggcaccaa gaagtgcggc 2155801 caccagggcg gcaaggtcct ggtgccgtcc gacgaacaga tcaagcggct caacgccgcc 2155861 cgcttccagc tcgacatcat gcgggtgccc ggcatcatcg tcgcacgcac cgacgcggag 2155921 gcggccaacc tgatcgacag tcgcgccgac gagcgtgacc agccgttcct tctcggcgcg 2155981 accaagctcg acgtaccgtc ctacaagtcc tgtttcctgg caatggtgcg gcgtttttac 2156041 gaactgggcg tcaaggagct caatggtcat cttctctatg cgcttggcga cagcgagtac 2156101 gcggcggccg gcggttggct tgagcgccaa ggcattttcg gcttggtctc cgacgcggtc 2156161 aacgcgtggc gggaggacgg ccagcagtcg atcgacggca ttttcgacca ggtcgagtcg 2156221 cggttcgtgg cggcctggga ggacgacgcg ggcctgatga cctacggaga ggccgtggcg 2156281 gacgtgctcg aattcggtca gagcgagggc gaacccattg gcatggctcc cgaggagtgg 2156341 cgggcgttcg ccgcgcgtgc atcgctgcat gccgcccggg caaaggccaa ggagctgggc 2156401 gccgatccgc catgggactg cgagctggcc aagaccccgg agggctacta ccagatccgc 2156461 ggcggcatac cgtatgcgat cgccaaatcg ctggccgcgg caccgtttgc cgacattctt 2156521 tggatggaga ccaagaccgc cgatctcgcc gacgctcgac agttcgccga ggcgatccat 2156581 gccgagttcc ccgaccagat gctggcgtac aacctctcac catcgttcaa ctgggacacc 2156641 accggcatga ccgacgagga gatgcggcgc ttccccgagg agctcggcaa aatgggcttc 2156701 gtcttcaact tcatcaccta tggcgggcac cagatcgacg gtgtcgcggc cgaggaattc 2156761 gccaccgcgc tgcgccagga cggcatgctg gcgctggctc ggttgcagcg caagatgcgc 2156821 ttggtcgaat ctccctatcg cacaccgcaa acgctagtcg gcgggccgcg cagtgacgcc 2156881 gcattggctg cctcctccgg acgcacggcg accacgaagg caatgggcaa gggctccacc 2156941 cagcaccagc acttggtgca aactgaggtg ccgcgcaagc tgctagagga atggctggcc 2157001 atgtggagcg gtcactacca gctcaaagac aaactgcgcg tacagcttcg gccgcagcgg 2157061 gccggctcgg aggtgctcga gctcggcatc cacggcgaaa gcgatgacaa gctcgccaac 2157121 gtgatattcc aaccgatcca agatcgccgc ggccgcacca tcctgttggt acgcgaccag 2157181 aacacgttcg gtgcggaact acgccaaaag cggctgatga ccctgatcca cctctggctc 2157241 gtccaccgct tcaaggcgca ggcggtgcac tacgtcacgc ccaccgacga caacctctac 2157301 cagacctcga agatgaagtc gcatggaatc ttcaccgagg tcaaccagga ggtgggcgag 2157361 atcatcgtcg ccgaggtgaa ccacccgcgc atcgccgaac tgctgacgcc cgatcgggtg 2157421 gcgctgcgga agttgatcac gaaggaggcg tagccagcgc tgccaactgt cttgggggcc 2157481 aaccgggtgt gcgtcgaggt ggcgcacatc gcgaaacgcg aaggatgctg tcagacggcg 2157541 tctgcggtgg cctgtcgaag atccagcgca ccggcgttca cctgcgtcgg cccgcggtcg 2157601 cgactaccat cgccgccccc gtttacggcc cggcacccgg tgagaagaag cccaggagca 2157661 tttggccgat gttgttgacg cccgagttaa acgcagcggt gaggtgacca acggtgctcg 2157721 tgttgttgaa gcccgagacg gtgttgccta agttcgccac gcccgacgcc agctgcccga 2157781 cgttgtagat tcccgagact ccgccttgca gcgcgttcgg cacctggttc cagaggcccg 2157841 aaatgccggg cccgacgttg ccgaagccgg atgcgcttcc atcgccactg ttgaagaagc 2157901 ccgaagacgg ggtggtggtg gagtttccga agcccggggc gctcgtgatg ttgatcggtc 2157961 ccaagccgcc gttggcggtc aagttcaggg gggatccggg aatggtgaag ccggggatcg 2158021 taaccgggct cgtgcccccg ctcaacggaa cattcaaccc aaacggatta atcgcgaagc 2158081 cggggatcgt aaccgggctc gtgcccccgc tcaacggaac attcaaccca aacggattaa 2158141 tcgcgaaacc agggatcgtg acagcgttgg tgcccccgct caacggaaca ttcaaatcaa 2158201 acggattaat cgcgaaacca gggatcgtaa ccgggctcgt gcccccgctc aacggaacat 2158261 ccaacccaaa cggattaatc gcgaaaccag ggatcgtgac agcgttggtg cccccgctca 2158321 acggaacatc caacccaaac ggattaatcg cgaaaccagg gatcgtgaca gcgttggtgc 2158381 ccccgctcaa cggaacatcc aacccaaacg gattaatcgc gaaaccaggg atcgtaaccg 2158441 ggctcgtgcc cccgctcaac ggaacatcca acccaaacgg attaatcgcg aaaccaggga 2158501 tcgtgacagc gttggtgccc ccgctcaacg gaacatccaa cccaaacgga ttaatcgcga 2158561 aaccagggat cgtaaccggg ctcgtgcccc cgctcaacgg aacattcaac ccaaacggat 2158621 taatcgcgaa accagggatc gtgacagcgt tggtagcacc gctcagcgga atattcaaac 2158681 cgaacggatt aacactgaat ccctggatgc cagactccag ggtgccgccg gccagcgtga 2158741 cgcctaatac gaatgtgcta agcgggatgg ggccgatgta gcccgtgaag ataccagcga 2158801 cgttaaacgg aagttcgttg agagtgatgt tgaccggtat cctgatgtta atcgtaaggg 2158861 ggatgcggga aatagggacg ccgggaacgg tgatcggacc gacaccaccc agcgcgttca 2158921 ggctcaacgg aataccagga atagtaatat ccggcaccac aatcggaccg acaccaccca 2158981 gcgcgttcag gctcaacgga ataccaggaa tagtaatatc cggcaccaca atcggaccga 2159041 caccacccag cgcgttcagg ctcaacggaa taccaggaat agtaatatcc ggcaccacaa 2159101 tcggaccgac accacccagc gcgttcaggc tcaacggaat accaggaata gtaatatccg 2159161 gcaccacaat cggaccgatg ccaccattca cttcgacgct cagtgggatg gcgggaatgc 2159221 tgagtgtgtc tgagtagcca atcagaccct ggtaatcgcc cctccacagt atgccgttgc 2159281 tgtagctgcc cgagatcagg gcgccggtgt taaggtcgcc aatgtttccc cagccggtgt 2159341 tgaggtcgcc gaggtttagg taccccgtgt tggcgttgcc cgggttgagg tcgcccgtgt 2159401 tggtgtcgcc ggcgttgtag ctgcctgtgt tgtagcttcc tgcgttgccg attccagtgt 2159461 tgacgttgcc ggtgttgaac aggcccgtgt tggcgttgcc cacgttaccc aggccggtgt 2159521 tgtagttgcc ggagttgccg atgccgacgt ttccgttgcc tgagttgaag aagccgatgt 2159581 tgccgttgcc ggagttgaag aagccgatgt tgccgctgcc ggagttcagc gccccgaatc 2159641 cgacctgatt gtcgccggtg agcccgatac caatatttcc agtgcccgtg ttgccgaagc 2159701 cgatgttgcc gttaccgatg ttcgcgaagc cgtagttgtt gccgccgatg ttcccaaagc 2159761 caatgttgtg cagggcctcc gtcaaccccg gacccgtgtt tgcaaaccca aggttgttgc 2159821 tgccgacgtt tccaaaaccg aagttgttgc ttccgatgtt tccgaaaccg aaacttccgt 2159881 tgccgatgtt tccgctaccg aagttgtagc taccgacgtt tccgctaccc acgttgtagt 2159941 cgccgaggtt tgcgttgccc aagttgagtg tgccgtcgtt ggcgaagccg aagttgaata 2160001 acgtcccacc tgcggcgttg cgcatgaagc cggcgagttg gctgtcggtg ttaccgacgc 2160061 cggagtgaaa ggccgatgtc gctaggccca gcgtgctggt gttgtagagg cctgagactg 2160121 tgttgccgaa gttcaagatt cccgatgtca gtggcccgac gttaaggaat ccggagttgc 2160181 cgagattccc agcaatgttc cagaagccag atccgcccga accgacgttc ccgaaacccg 2160241 atgtgccgcc cgtaccgctg ttgaagaagc ccgatgacgg ggtggtggtc gagtttccga 2160301 agcctggggt gcccgcgatt tcgatcggga tgttgatcgg cccgaggctg ccggacacgt 2160361 cgatgcccaa cgggattgcg gggatcgtga ttggcggggt agtgaggggg ccgatggcgc 2160421 cgcccacatc aatacccaac gggattgccg gaagtgagta gccatccggg aacaccgtaa 2160481 acgggcctaa ccctccgccc acatcaatac ccaacgggat tgccggaagt gagtagccat 2160541 ccgggaacac cgtaaacggg cctaaccctc cgcccacatc aatacccaac gggattgccg 2160601 gaagtgagta gccatccggg aacaccgtaa acgggcctaa ccctccgccc acatcaatac 2160661 ccaacgggat tgccggaagt gagtagccat ccgggaacac cgtaaacggg cctaaccctc 2160721 cgcccacatc aatacccaac gggattgccg gaagtgagta gccatccggg aacaccgtaa 2160781 acgggcctaa ccctccgccc acatcaatac ccaacgggat tgccggaagt gagtagccat 2160841 ccgggaacac cgtaaacggg cctaaccctc cgcccacatc aatacccaac gggattgccg 2160901 gaagtgagta gccatccggg aacaccgtaa acgggcctaa ccctccgccc acatcaatac 2160961 ccaacgggat tgccggaagt gagtagccat ccgggaacac cgtaaacggg cctaaccctc 2161021 cacccacatc aatacccaac ggaatagccg gcaaactata accacccgat aagaaggtga 2161081 tgggaccgat ttgaccactc actgtcacgt aatctggagg gaatccgggg aaaaatggcg 2161141 gaatcgcggg aatctcagga gtgcctagct gtatcgatat gctacccggg cctatgctgc 2161201 caacggtggg atttacgccg aataagccga tcgcaagcgg agacgcgggg atcgaaatcg 2161261 atcccacgtt aatgacctgg aacgccgata gctctaggcc aatagaattt agagtgatag 2161321 gcggaacatt gattggcccc accaacgccc ccgaactact cacacccaaa ccgatggcgg 2161381 gaacagtaat aggcggaaca ttgattggcc ccaccaacgc ccccgaacta ctcacaccca 2161441 aaccgatggc gggaacagta ataggcggaa cattgatcgg ccccaccaac gcccccgaac 2161501 tactcacacc caaaccgatg gcgggaacag taataggcgg aacattgatt ggccccacca 2161561 acgcccccga actactcaca cccaaaccga tggcgggaac agtaataggc ggaacattga 2161621 ttggccccac caacgccccc gaactactca cacccaaacc gatggcggga acagtaatag 2161681 gcggaacatt gattggcccc accaacgccc ccgaactact cacacccaaa ccgatggcgg 2161741 gaacagtaat aggcggaaca ttgatcggcc ccaccaacgc ccccgaacta ctcacaccca 2161801 aaccgatggc gggaacagta ataggcggaa cattgatcgg ccccaccaac gcccccgaac 2161861 tactcacacc caaaccgatg gcgggaacag taataggcgg aacattgatc ggccccacca 2161921 acgcccccga actactcaca cccaaaccga tggcgggaac agtaataggc ggaacattga 2161981 tcggccccac caacgctccg gaactgttaa tgcccaggcc gatttcggga atggtgatgg 2162041 acgggatggt gatggggccg acggagccga ggccgttgag gtctaggcca gcagcgggaa 2162101 tggtcagtgt gccggagaag ccgatcaagc cctggtagtc gcctcgccag aagaagccgt 2162161 tgctgtagtt gccagagttg aatccaccgg tgttgacgtt gccggtgttt cccacgccgg 2162221 tgttgaggtt gccggggttg aagaagcctg tgttggagct gcccgtgttg aagtcgcccg 2162281 tgttgaagct gcctagattg aagctgcccg tgttgtagtt gcccgtgttg ccgatgccag 2162341 tgttggcgat gccggcgttg aagaagcccg tgttggcttg gcccgtgttg ccgatacccg 2162401 tgttgtagct ggtacctgag ttcccgatgc cgaagtttcc ggtgcccgta ttgccgatgc 2162461 cgatgttgcc ggtgcctgag ttgaacaagc cgatattgcc ggtccccgag ttccagccgc 2162521 cgaacccctg ctggttgtcg ccggtgaggc cgaagccgat gtttccgttg cccgtgttgc 2162581 cgaagccgat gtttccgttg ccggtgttgc ccaagccaac gttgttgctg ccgacatttc 2162641 caaggccgaa gttgttgccg ccgggattta ggctgcccaa gttcaaaatg ccaaggttag 2162701 cggcgcccat ctgtccgaag cccgagtttg ccaggcctaa gctaagattt gccagcacac 2162761 ccttggaact ggtgatcgcc gcggtgacga cggccgccgg agcggccgcc aactgggcgg 2162821 gcaggtctgt cagattctgc ggcggcgcag tgaacggcgt cagggccgac gccaccgccg 2162881 atgccccggc atggtaggcc gacatcaccg acacatcgat ggcccacatc tgttcgtatg 2162941 cggcctcaat cgcagcgatc gccggagcat tctgcccgaa gaagtttgag aacaccaatg 2163001 acaccaggtc ggagcggttg gccgccacca gcgccggttg caccatcgcc gcccggacag 2163061 cctcgaactc ggccacgagg gccgcggctt gggttgccga ccgctgggcc tgggccgctg 2163121 ccgcggcaag ccatcctagg tacggcgcta ccgcagcggc catcgccgcc gatgacggtc 2163181 cgagccacga ttcgccgacc aggcccgacg tcacggagtt gaaagaggcc gctgccgagg 2163241 ccaattccat cgccagttgg tcccaggcga ccgcggccgc cgacatgggt tctgatcccg 2163301 ccccgccgaa tatgagggcc gagttgatct ctggtggcaa tgttgaaaaa ttcatggccc 2163361 cgactttccc tgggtgcacc gaattcatgg cggctcacca acccgcggtc ggcgagcgcc 2163421 gtgtcgctcg acgctactcg gcgatcttcg cggccgtatg catatcaccc gaatagggcc 2163481 atgattcata gatctcgtca aactgattta cggcgggcgc tttttagccg ctctaggaat 2163541 cgacgccaaa cccaacgaac gagcctcagc caaggccgaa atcgattaat tccccgatga 2163601 tttcatcgtt gtggaggtcg tcgcaggcgt cgttgatctg atcgtggcga ttacggctgg 2163661 tgatcctctc cgcggggcgg ggtccgcacg gattatggcg tggtgctctg gaagaacagg 2163721 cccgacaggt tgttgccgat gttggccaaa ccggagacca agctggtcac ggcaaacggc 2163781 agggtgccgg tgttggcgaa gcccgatatg ccgctgccaa ggttggagaa gccggagata 2163841 agaccaccgt agttctggta gcccgagccg gctagcagcc caacaggact tgtgttgaac 2163901 caacccgaca gtcccgagcc gctgttgccg aagcccgagt tcccaccgat tccggcgttg 2163961 aaaaagcccg acgagggcgt tgcgctcgag ttgaagtatc ccggccccgc tgggattgcg 2164021 aatccgccca tggtggtgct cggcaggtgg atgcgggcga tggtgagtgc gggtgtggtg 2164081 aaggccgcca agcccaccgg ctggatggtg aactctggcg tggtgatctc cgggatattg 2164141 acctggggga gggtgaaacc gctaagtccg atcgggtcga tggcgaacgg tggagtcgtt 2164201 atctcgggcg tcatgatctg aggaagcgtg aaaccaccca gcgctatcgg atcgatcgtg 2164261 aacgccgggg tggtaatcgc cgggatgctg agctgcggca gcgtaaaccc acccagcgtg 2164321 atcgggtcga tggtcaactc cggggtcgtg aactgttgag taatgatatc cggcaggctc 2164381 aatgcaccga caccaatcgg actgatcgtc aacgccggag tggtgaattc ttgggtgctg 2164441 atctctggca gggtgaaccc gtcgaccgag atccccccga gggaccacgg ttggatgacg 2164501 acgttgggga gggtgaaggg ggtgacgttg attgcgccga tcgagaagcc gacgccgttg 2164561 atttgacctc cacccacggt aatggtccca gtattaataa aggcaggagg tgtattagcg 2164621 aagccgccaa tctgcgggaa taccccgggc atattggttt gcaaggcagt gatgttgttc 2164681 ggaatgaaca ccaccaaatt agttatcgta atgccgttaa ggctaaaggt gggaagattg 2164741 atgacaccag aatttgcttg cgtggctatg ccgggagtgc taaagccgcc tatacttatt 2164801 tggggtgtac ttattaacgg ggtgtgtatc gtgggtagcg taaatccgcc gacagtggtt 2164861 gccagccgga atcgtgatcg gcggaaccgt caccgacgga atactcagcg tcggcagatt 2164921 gaacgcacct agcgctgtgc cagccggaat cgtgatcggc ggaaccgtca ccgacggaat 2164981 actcaactga ggcaagttaa acgcacctac cgtgatgttg gctggtgtcg ttgtagctgg 2165041 aatcgtcaac gacggcaccg tcaaccccgg caaatcaaac gcacccaccg tgatgttagc 2165101 tggcgtcatc gccgctggaa tcgtcaacga cggcaccgtc aaccccggca aatcaaacgc 2165161 acccacggta acgttggccg gcgtcgtcac cgccggaatc gtcaacgacg gcaaggttat 2165221 cgcgggcagg ctgaacgcgg gaaccgagat tccgggtatt tccagagacg gaagcgtcaa 2165281 atcagggctg gtgatggcga actgcaggct gccttggccc acaccacggt aaaagacacc 2165341 attgttcatg tcgcccgtgt tgaacaagcc gttattcatg tcgcctatgt tgaaggcgcc 2165401 ggtgttgatg cttcctgtgt tgaaccagcc ggtgttggcg ccgcccgtgt tgaaggtacc 2165461 cgtgttcgac gggcccgggt tgaaggcacc gaagttgtag tggccgacgt tgaagctgcc 2165521 ggtgttcgcg ttcccaacgt cgaacatgcc cgtattgaaa gagcccgcat tcaggaatcc 2165581 ggtgttgccg tgtccggggt tgaacaggcc agtgctgaag ttaccggagt tcccgatgcc 2165641 aaagttgcca ttgccggagt tgaagaaacc gatattggcg ctacccgcgt tgaataatcc 2165701 gacgttaccg ttgccggaat tgagtccgcc aatgccgatt tggttgtttc cagtcaggcc 2165761 gttgccgata ttgttgttgc cggtgttcgc aattccgaag ttgccgatgc ctgcattggc 2165821 gaatcccgtg ttcaaattgc ccaggtttgc aagtccgaag ttgttggcgc ccgcgtttcc 2165881 tatgccgatg ttgttgccac ctaggttggc cgagccgata ttgaagctac cgaagttgcc 2165941 ggaccccaga ttgttgttgc caaggttggc gttgccgatg ttgccgagcc cattgttggc 2166001 attgccgagg ttgccaccac cgacgttggc caagccgagg ctagcggcga tcgcccggcc 2166061 ggcaaaagtc ggcatgccca cggccgtggt gagcgcggtc accacggccg cgggcccggc 2166121 cgccagaccc gccggaagcc gcagcgggag ggcgaatgcc ggtagcgcca cagcgaccgc 2166181 cgacgccccg gaatggtagg ccgccatcgc cgatacatcc agagcccaca tctcctcgta 2166241 tgcggcttcg gcggccgcga tcgcgggagc gttttgacca aaaaggttcg atatcaccag 2166301 cgatatgagg ccggaacggt tggcggccac cagcgccggt tgtaccatcg ccagccgcac 2166361 agcctcgaac tcggccacca tcacctgggc ctgggtggcc gcctgctcgg cctgggtcgc 2166421 cgccgcggcc agccaccccg catacggggc tgccgccgct gccatcgcca ccgatgaccg 2166481 accctgccac gacccgccga ccagcccggc tgtcaccgag ccgaaagaga cagcagccga 2166541 ggctaattcg gttgccagcc cgtcccaggc cgacgccgcc gccagcatcg gtccggagcc 2166601 cgccccggcg aagatcaagg ccgagttgat ctccggcggc aacactgagt aatgcatcgc 2166661 tccccacctt ccggggtgag cctggtgctg atgaaaggtc acacgcccgt cgtcgctgac 2166721 tcgttcgtag cgcatgagag tacgcggaga tcttgaattg tgtatccgag caaatgaaac 2166781 cgttatctat ttgttataga catatcgggc acggatgcaa agttctttta cacgctatgc 2166841 gtaatcacga tccgtgcccg tctgatgtaa accaccgacg taggcgcact gatataaatg 2166901 catttattac caaggtgatt gggtgaaata attaccccgg aaaactgtgc tcaataggaa 2166961 cgattattag tttgaatcac tgccataatc caccctatgt gcaacccgga tgaattccga 2167021 tcgcgtgctt attcctgcca aacattcggg ctttagccct ggcccaccac gcgggcacca 2167081 atccgacgct gcccctacag cgaaatcacc ggcgcaccgc ctcccgctcg gccgccttca 2167141 ccagttgacc cgcgaagaac ctgaccgcgc cacccagcgc cgcccgcatc accggccccg 2167201 tcccacgaac cttttcggta aacgagccac tccagcggag atcggtaccg cccgacgcat 2167261 ttggtgtaag gaccacctcg ccgaagtagt cctggacggg tgtcctcgcg ccaaccagct 2167321 tgtagacgtg gcgacggtcc tgctcatact cgacggtctc ttcctgcacg aacaccggcc 2167381 acatgcctag tttgcggatg gccccgatgc cgccgggcgc gggatcaccg cgtcgcgccc 2167441 aactcgattg agcaacgatg ggcttggccc aggtcgccca gttgccaccg tctgtcacga 2167501 gccgaaacaa ggttgcagcc ggcgcgctgc tggtcttggt gacctcgaac gaaaatttcc 2167561 gacccgacat gcgcgactcc cgaaacgaca actgaagcgg cccgatatgg tgctgccgcg 2167621 taccctaccg cgcagccgtc cgtgccggcc gtagtggacc agccaaggtg ttcccgcgct 2167681 ggccgcagca ggcgcataat cacgaggtgt cccgcgcaga taccgtctca gtgccccgtg 2167741 cgcccaccca ggctgaggtc gccgcagtgc tgcgcatcat gacgccgctg cgcaaggtga 2167801 ttaaaccaaa ggtctatggg atcgaaaatg tgccgaccga acgcgcattg ctggttggca 2167861 accacaacac gcttggcttg gtcgacgcgc cattgctggc cgccgagctc tgggagcggg 2167921 ggagaatcgt ccggtccctt ggcgaccacg cccatttcaa gattccgggg tggcgcgacg 2167981 cgctgacacg aacaggggtc gtcgaaggca ccagagagat cacctcggag ttgatgcgac 2168041 gcggcgagct cgtcatcgtc tttcccggcg gcgcccgtga ggtcaacaag cgcaagaacg 2168101 agcgctacaa gctggtgtgg aaaaatcggc tggggttcgc gcgcttggca attcagcacg 2168161 gctatccgat tgtgccgttc gcttcggtgg gtgctgaaca cggcatcgac atcgtgctcg 2168221 acaacgaatc cccactgctg gcaccggtcc agttcctcgc cgagaagctg ctcggcacca 2168281 aagacggtcc ggcgctggtc cgtggtgtcg gactgacacc ggtaccgcgc cccgaacggc 2168341 agtattactg gttcggcgag ccaatcgaca ccacagagtt tatggggcag caagccgacg 2168401 ataacgccgc acgcagggtg cgcgagcgtg ccgccgccgc tatcgaacac ggcatcgagc 2168461 tgatgctggc cgagcgcgca gccgatccaa atcgatccct ggtcggacgg ctcttgcgct 2168521 cggacgccta aggcgcccct gaggcgttcc cggggcctga ttcagaagtc agaagaccga 2168581 gtcgacttga tcggggattg gggtgccgtc gttgcgcaat accggttgtt tcgatccgtc 2168641 ggggttgatg aatgcctccc cgcatacgta aggagcgtgc tggggcagcg ggtcgataaa 2168701 catcgggttg atcgcccact taccgcccct ggtgaacagg ccgtcgtagg cccggcacat 2168761 gaggtcgtcc tggttgcggt tgatcacgag tgacaccacg gtggcgtcgc cgacgaaggt 2168821 ggcgtcgctg ttcgagtcgc cggcggcgag gacttgacga cgatccgccg cgagctgatt 2168881 gaaggcttgc gggccagtca ccccgaagat gacctgattg gcccaacacc gtttgccatc 2168941 aaggtaggtc atgactgaat cgtcgccgtc gcggacgcct ccgcaaccga cgaggtgagc 2169001 ggtgagtttc ccggactggt cggcgacgct gcggactccg acgacatgct gatcgtctag 2169061 acctacctcg cccgcccaca ccttgacgat cggttcgggt gacgctgaca ccacccaggt 2169121 gtcgataccg tgtgcctgca gagtaccgat gaggtctttc atttgtggat agacgcggat 2169181 gtaaccatcg acctgctgtg ttccgacctg ctgggtggcg ccgacatcgg cggcaaggtt 2169241 ctgtttcttg gcctggtctg cgaatccggc gagctcctca gcggtgtagc ccgccgacag 2169301 tgcgttgctc cacgcgtacg gacccgccaa ccggcgcacg ttgttaccca cgaaagccgg 2169361 ctgtcccgtg gtggtttcgc cgtcgagaag ggaaaggatc tcgttcgcgc acaacgcatt 2169421 gctgccggtc ggcagcggct tgccggcagg tacaaccttg ccgcatgcca cgctcagcgc 2169481 gttcgccgcc gcgtcggtca ggtatcggct ggcggcatgc caatcctggt tggctggctg 2169541 cagcaccagg ctgtgctgca gcatgtagta gttcgtggcg tagccgatgt cgttcttgac 2169601 gacggtgttg tcccagtcaa agatggcgac cttgcgcgca gaaccgtccg cggtgccggt 2169661 gcacctgctg ttggcatcga tcgccgactg caggaattca cgaactccgt ggtgccactt 2169721 cagaaacgcg tcgagctgac gacagccgga cgctggggtc gggggttggt gggccgagca 2169781 gccgatgacg ccaccgagca cggttgccat tgccaacagc gacggtatga gtcgcaccat 2169841 gtaagccctt cgtcagccct tggtcgtgcc agcatgcgcc ggatggaagg gggatgggaa 2169901 ctgaatggtt gcctgctgaa ctgaacgctg agcaaattcg atgccgacga aacattatgg 2169961 gtttgtttct cgacggcaac ccgtgcgcga ttcgacagtc accgcgatgc tgccgacgcc 2170021 ggcccgcgct cccgggcgat ccgcgtgagc agcgtaatct cgtgcgcacg gatttgcggc 2170081 ccggactagc gcgaaagata ctgttgaaca gatggattcg actgtaacgg cctcgatccg 2170141 acgcatgctg ggactgctcg ccgccacatt gctgctcggc ggctgcaccg gccagcacac 2170201 gacacgcaca gcggcgagca ccacatacac gccccacatc aaggccagca gtcaggacgt 2170261 actggacggc gccatcaatg ccgacgagcc aggttgttcg gccgcggtag gagtcgaggg 2170321 gaaagttatc tggtcaggcg ttcgcggcat tgcggatctg gcatccggcg ccaagatcac 2170381 cacggacacc gtgttcgaca tcgcgtcggt gtccaagcag ttcaccgcca ccgcgatcct 2170441 gctgctcgtc gaagccggaa agctaacact cgacgacccg atatcccaat acgtacccga 2170501 gctacccgac tgggcccaaa ccgtcaccgt cgagcagctc atgcatcaaa ccagcggcat 2170561 ccctgattac gtcgcattgc tggcagccag ggggtatcag gtcagcgacc gcaccatcga 2170621 ggccgaagcc aggcaggcgt tagcggccgc ccccgagctg caattcaagc ctggcaccag 2170681 gttcgattac tccaactcca actacttgct gctcggcgag attgtccacc gcgcatcggg 2170741 acaaccgctg cctgagttcc tcagcgccga gatctttcaa ccgcttggtc tggccatggt 2170801 ggtggatccg gtcgggaagg ttcccaacaa agccgtgtca tatgagaagg gcactggtgg 2170861 aaaccggtcc gagtaccggg tgggcaatcc ggcctgggag cagatcggcg acggtggcat 2170921 ccagaccacg cctagccaac tggcccggtg ggcggacaac taccggacag gaagcgtcgg 2170981 cggcctgaaa ctgctcgaag cacaacttgc cggtgcggtg gaaaccgaac ccggtggcgg 2171041 cgaccgctac ggcgccggaa tcgtgtcgcg cgccgacgga acactcgacc acgcgggcgc 2171101 ctgggccgga ttcgtcacgg cattccacat cagcagtgac cgacggactt cggtggccat 2171161 cagctgcaac accgacaagc cggacccggt ggccatggcc gatgcgctgg ggcgcctttg 2171221 gatgtagcgg ggctaccgcg gttggccgcc ggtacccagg ctgcaatcat tcacggtatg 2171281 gcgcaaccac cgtcactcct cacaactgac aatggcctac ccttcggcgt gcaaggtgcc 2171341 tgcgactccc gtttcaccgg agtcatccgt gcctttgctg ggctgtaccc cggccgcaag 2171401 ttcgggggtg gggcactgtc ggtttatatc gacgatcgcc aggtcgtcga tgtctggacg 2171461 gggtggtccg atcggcaggg caaagtaccc tggacggccg ataccggggc aatggtgttc 2171521 tccgcgacca aagggttggc cgcaacggtg attcaccgtt tggtcgatcg cggccttttg 2171581 tcctacgacg cgccggtcgc ggagtactgg cccgagttcg gagctaacgg caagtctgag 2171641 gtcaccgtca gcgatgtgtt gcgacatcgg tccggactgg cgcacctcaa gggggtggac 2171701 aaggacgagg tcatggacca cctcctgatg gagcagaagt tggcggctgc gccgctaaac 2171761 cgccagcacg ggaagttggc ttaccatgcg gtgacttacg gatggctgct gtccggcttg 2171821 gctcgtgcag tgaccggcaa aggcatgcgt gaactgttcc gcgaagaact cgctcgcccg 2171881 ctgaacaccg atggtattca tctcggccgg ccaccggccg actcgcctac caaggcggca 2171941 cagacacttc tgccccaagc caaggtcccc accccactgc tcgatttcat cgcaccaaag 2172001 gttgcggggc tgtcgttctc cgggctgctc ggcgccgtct acttcccggg catcctgtcg 2172061 ttgctgcaag acgatatgcc gttcctcgac ggtgaggttc cggcggtcaa cggcgttgtg 2172121 accgcgcgcg ccctggccaa gacgtatggg gcgttggcca atgacggtgt gatcgacggc 2172181 acccgactgc tgtcgtcgca ggcggtacgt ggattgacgg ggaagtccga gctatggccg 2172241 gaccttaatc tcggtcttcc ttttacctac caccagggtt accaatcgtc tccggtgcct 2172301 gggctgctgg aggggtacgg ccacatcggg ctcggtggca cgatcggatg ggccgacccg 2172361 gagaccggca gcgcattcgg atatgtgcat aaccgcttgc tgacgctact gttgttcgat 2172421 attggctcgt tcgcagggct ggctgcgctg ctgaacagcg ccgtcgtggc agcacgtcgc 2172481 gatgaccccc tggaagtgcc gcatttcggt gcgccctata gcgaaccgcg tcatgagcag 2172541 gcggcctcgg gtgcataact gctcccgtta tgccgcgagc gcgagcccga cgggctagaa 2172601 ctcgtaaacg agtagccaga cgagagcgac ggccgccaag aacagaccaa ccaggatagc 2172661 cgcgcgggta accagtacct ggcgatggaa ccactctcgc agctgggtga atcgccagtc 2172721 ggtccaggcg taggcgcgca cagcccactg cgcctcgacc gcgagcagtc gaaacgcgac 2172781 cagcagggcc gggatgccga gttcggggag cagcacgatc atcggcaggg atacgacgaa 2172841 tagcccgcca ccgaccacag cgagtgtcgc gcgaatcagt agcggcctgg cccgtacccg 2172901 ctgtcggtat gcgagcactc gggcgagcgc ggcgtcgcgg gtggaagtcg ggttgatgac 2172961 gtcggccggg tccatgactg ctcctagtgt gcctgcctcg acgcctagcg gacggctgtg 2173021 tcgggggtgg tttggttcgg actctagtgg agcccggttg cgcactcggg tccgaccaat 2173081 gcggggccgc gcctcatacg cacgataagc gtgggtgcat agactgcggt tatgaatgac 2173141 ggctcccggc aggaactcag ggttcgtagc ggcctactac aaatcgagga ctgcctggat 2173201 gctgacggcg gcatcgcatt gccggcaggc accacgctga tctcgctcat cgagcgcaac 2173261 atcaagtatg tcggcgacct cgtggcgtat cgctacctgg accacgcccg ttcggccgcc 2173321 ggatgcgccc tggaagtgac ctggacgcaa ttcggtatgc gattagcggc cataggtgca 2173381 cacgtgcaac ggttcgcagg ccccggcgac cgcgttgcga tcctcgcacc acagggcatc 2173441 gactatgttt gcgggttcta cgctgcaatc aaggcaggca ccgtcgcggt gccgttgttc 2173501 gcacccgaac tgccgggtca cgccgagcgt cttgatacgg cacttcgcga ttcggagcca 2173561 gcggtcatac tcacgacggc ggcggcgaaa aacgccgttg aaggttttct gaacaacgtt 2173621 ccgcgcctgc gaaagccgac agtcctcgtc atcgatcaaa tacccgaccg cgagggggag 2173681 ctgttcgtcc cggtcgagct ggacatcgac gccgtatccc acctgcagta cacctcgggc 2173741 tcgacgcgac ccccggtcgg tgtcgagatc acccaccgcg cggtcggcac caacctggtg 2173801 caaatgatcc tgtcgatcga cctgctcaac cgaaacaccc acggcgtcag ttggttaccg 2173861 ctgtaccacg acatgggcct atccatgatc ggctttccgg cggtctatgg cggacactcc 2173921 accctgatgt cgcccacggc gtttgtccgc aggccactgc gatggatcca ggcgttgtcc 2173981 gaggggtcgc ggaccggacg cgtggtcacc gcggcgccaa acttcgccta cgagtgggcc 2174041 gcacagcgtg gactacccgc gcaaggcgac gacgtcgacc tcagcaatgt cgtgctgatc 2174101 atcggttccg aaccagtcag catcgatgcg gtgaccacgt tcaacaaagc gttcgcgccc 2174161 tatggtttac cgcgtacagc gttcaaaccc tcgtacggca tagccgaggc gaccctgctc 2174221 gtcgcgacca tcgaccatgc cgctgagccg acggttgttt atcttgaccc agagcagttg 2174281 ggcgccggac acgcgacgcg cgtcgcgccg gatgcgccca acgccgtcgt gcacgtgtcg 2174341 tgtggccatg tggcccgcag cctgtgggcc gtgatcgtcg acccggatac cggccccgag 2174401 gcgggcgccg aactgcccga cggtgagatc ggtgaggttt ggttacaagg cgacaacgtt 2174461 gctcgggggt attggggacg gccggaagaa acgcggatga cgttcggtgc ccgcttgcaa 2174521 tcaccgctcg ccgaaggcag ccacgccgac gggtccgcga tcgacgacac ctggctgcgc 2174581 accggagacc tcggcgtgta cctcgacggt gagctctaca tcaccggtcg aatcgcggat 2174641 ctgctgacca tcgacggccg caaccactat ccgcaggaca tcgaggccac ggccgccgag 2174701 gcctcgccga tggtgcggcg cggatacata accgctttca cggtgccggc cagcgacggg 2174761 gacgaccgca atcagcgact ggtgatcatc gccgaacgtg cggcaggcac cagtcgcagc 2174821 gacccgcggc cggcgctcga cgcgattcgc gcagcggttt gcaaccgcca cgggttatcc 2174881 gttgcggacc tgagtttcct gccggccggc gccattccac gcaccaccag cgggaagctg 2174941 gctcgccagg cctgccgcgc ccaatacctc agcggtcgcc tgggcgtgca ttagctacga 2175001 tctacggctc ccaaatcagc agatcctcca tgccgttgtt catcgcgacg atggttggcg 2175061 atgggccggt gacatcgaag tagattttgc cggtcgattg ttcgccttgg gggatagtgg 2175121 ctccgctaat ggtgtcgggg cccgcggctt gccacagcac ccggtagttg atgccgtcgg 2175181 cggtgcgggc attgaactgc gagaccgcgg gcgtgacgct gccgcgaatc gcattgaccg 2175241 tggcagtggc ctcccagacc tggccggcca ccggatagcc ggggatgact gccgtgctgg 2175301 atttgagatc actgaccttc cagccgagca cgacttggcc aacggtgtcg gtcatcgtta 2175361 gctcactgcc aagttttccg gtgatgggat aggcagccaa cgcgaccggt gccgcaaagg 2175421 tcgcgatggc cgccatggcc acgaccgcta ctgccgtctt gatcattgtg gtgagcttca 2175481 ttggtcccta cctccactac ttgttggggc gattacctgg ttcgaacctc gccgacgtca 2175541 ttaccttaag ccgcaaatga cccgctgcta actccagatt cgataggaac cgtggggcag 2175601 acgatgccgt tcacatccgt agccggcgca ccgacgacgg gcgtggccat gaatgcttga 2175661 tggccgagtc gtaggcgacc agcgcaaggg agccaaaccg catgtcagga tggtgtggtg 2175721 accgccatac ccggcccgtc gggcgccgaa cccggtgaga gccgcgcgct cgcgggttac 2175781 ccggtgacgc cgccggcgct gccccgcccg gtgatcttcg accagcgctg gactgacctg 2175841 accttcatcc actggccggt gctgccggag agcgtggcag gcagctaccc gcccgggact 2175901 cgccccgatg tcttcgccga tgggatgact tacgtgggtc tggtcccgtt tcgcatgagc 2175961 agcaccaaac tcggcaccgc actgccgatc ccgtatgtcg gcaccttccc ggagaccaat 2176021 gtccggttgt actccattga taacgccggc cggcacgggg tgcttttccg gtcgctggaa 2176081 acagctcgac tgactgtcgt accgctcacg cggataggac tcggcatccc gtacgcctgg 2176141 tcgaggatgc ggatgatgcg ctctggtaag cacattacgt atcacagtgt ccgccgctgg 2176201 ccacggcgcg gactgcgcag cctattgacg atcaccatcg gtgacctggt tgagccgacg 2176261 ccgctggaag tctggcttac cgcacggtgg ggtgcgcata cccgcaaggc tggccggact 2176321 tggtgggtgc cgaacgagca taagccgtgg ccgttgcggg ccgcggagat cgccgagttg 2176381 aacgacgagt tgatcgacgc aagtggcgtg caacccactg gcgatcggtt gcgcgccctg 2176441 ttttcaccgg gtgtgcatgc ccgattcggc cgtccgtgtg tcgttcagtg acgtttaggg 2176501 gcaggtgtat ccaccatcaa tcacgatgtc ggaaccggtc atatagctgg aagcctcgct 2176561 agccagatac aggtagaggc cagcgagttc ttcgggccgg cccaaccggc ccaacggaat 2176621 cttgggctcc catagcggct ggtattccgt gtacggttcg acgagctcgg tcaggatata 2176681 gcccggactg acactgttca cccggatttt atgcggcgcc aactccacgg ccatggcttt 2176741 ggttagatga atgaccgccg ccttggaggc gcagtagtgg gaaacctgct gcgggacgtt 2176801 gatgatgtgg cctgacatgg aagcagtgtt gatgatgacc ccgccttggc cttgtttgac 2176861 catcgccttg gcagcggcct gcgcggtaag gaagacgcct gtcacattgg tgttttggag 2176921 gcgctggaac tcttccagcg gcatgtccag catcggagtg accgtgatga tgccggcgtt 2176981 gcagaccgcg atgtcgatcc cacccagctc cgcggtcacc tgatccaaca tgctggtcac 2177041 ctgctggtgc tggctcacat cgcagcagac gggcacgacc ttgccacctg atgtgccaat 2177101 ctcatccgcc aacttctcta aggcatccaa atgccgtgcg gcgatcgcca cttgagcccc 2177161 ggcttcgacg tatgccaggg caactctctt gccgatgccg gtggatgccc cggttatcag 2177221 cgccctcttg ccgtgcaagt cgaacaggtc caacacgctc attcgtgatc ccctttcgcg 2177281 cgacgcaggg ccgatacctg atggaatcac atgccgaaat gcgttcgatg aactgccgca 2177341 atggcttcca gtggtccgct cacttcgacc cgcgctacgg ctcggcgtcc aaagacgtac 2177401 agcagcaact cgccgggcgg tccggtcagg cgagccgtcg gctcgcctga ccggaccctc 2177461 acccgcttac cggttccaac ccactcgatc tcaagcccgc aaccgtgcag ccgccgactc 2177521 aggaagtggc tgccgcgccg aacatttcgc catagggcag catccatttc gggcgtgagg 2177581 cttcggggcc ctcgtccgct ggcgcggcga acgtcctcgt gatggacaaa gaattcgttg 2177641 aggttcgcca aggtacgaac ccatccgatg cggaagaacc ccatcggtgg accggaccga 2177701 atccgagcga cgagccacgt gaagtcttta ctctgagcca atctcgctct acggcgttcg 2177761 gcaaaccgct ggaagggacc cggtagaacg atgcaaaggc cagcaacgag atcgcgttca 2177821 cgcagcacga tgtgagcggc caggtcgtga gcagtccagc cctcgatcag tgtagcaacc 2177881 gcaggaccga gctcctcaag gagatcacag agctccaagc gttcttgcgc gtccaacggg 2177941 acatcagcca cgccgcggga gtctacgggc gacgtgcctg cgcgccaacg ggctgccgct 2178001 tgcgccgtcg cgactgcaca gcagccagcg cccgctccca ggcgagcagc gttgcggccg 2178061 tcagattggc cggtttggcg ctgtccttgg acagcagcgc ggtcgcggcg gctttggtgg 2178121 tcggcgacgc cttcgacatg tgaccggagt cgaacggcgg ctgcgggtcg tactcgatcg 2178181 ccagctgaat cgccttggcc cgggcctccc cgcccagctg tccggccagc cagagggcga 2178241 gatcgagccc ggcggacacg cccgcgctcg tgacaatgtt gtcctggtgc acaatccgct 2178301 cgtcggcgac cgggatagcg ccgaatgcct tgagcgcggg aagcgtcagc caatgcgagg 2178361 tcgcgcgccg gccccggagc cacacgaacc gcacctgggc gtgcggcagg tttcgcagca 2178421 cctcgtacgg gccgaccacg tccagcgcgg taacgccggg gtaggccacg aatgcgattt 2178481 gcgtcatcgg tgttctccct agtgtcaggc gaaggctttg cggtattggt cgggtgatat 2178541 cccgacgcgg cgaatgaagc tgcggcgcat ggtttccgcg gtcccgaagc cgcatcgggc 2178601 ggcaattgcc accacggtgt cgtgggtctc ctccaactgg cggcgcgcag cctcggtgcg 2178661 gatgcgttcg acgtaccggc cgggcgcctc gccgacctcg tcgctgaaca cccgagtgaa 2178721 atgacgcggg ctcatggccg cacgttgagc cagttcgccg atgcggtgcg cgcccccggc 2178781 tcggcctcga tggcctcctg cacccggcgg atcgaggtcc gtttggcgcg tggcatccac 2178841 accggagccg cgaactgggt ctgcccaccg ggtcggcgca gatacaggac gagccagcgg 2178901 gcaaccgtct gggcaatctc ggtgccgtgg tcgtcttcga ccagtgccag cgcgaggtcg 2178961 atgccggcgg tgactccagc cgcggtccac accttctgcg aactgcgcat gaagatcggg 2179021 tcggcatcga cccgaacggc cggaaattcg cgggcgaaat gttcggcaaa ggcccagtgc 2179081 gtcgtcgctc ggtgtccgtc ccaacaaccc cgcttcggcc gcaagaaacg cgcccgtgca 2179141 cacggtgacg acgcggcggg cggtgccgga gacggctttg acccagtcga tgagggccgg 2179201 ttcggaccgt gcggcatcga ctccggcgcc accgggcagg atcacggtgt cgacggggtc 2179261 gccggggaat cccacgataa ccactcttcg cgccatgaat gccagtgttg gccaggcgct 2179321 ggcctggcgt ccacgccaca caccgcacag attaggacac gccggcggcg cagccctgcc 2179381 cgaaagaccg tgcaccggtc ttggcagact gtgcccatgg cacagataac cctgcgagga 2179441 aacgcgatca ataccgtcgg tgagctacct gctgtcggat ccccggcccc ggccttcacc 2179501 ctgaccgggg gcgatctggg ggtgatcagc agcgaccagt tccggggtaa gtccgtgttg 2179561 ctgaacatct ttccatccgt ggacacaccg gtgtgcgcga cgagtgtgcg aaccttcgac 2179621 gagcgtgcgg cggcaagtgg cgctaccgtg ctgtgtgtct cgaaggatct gccgttcgcc 2179681 cagaagcgct tctgcggcgc cgagggcacc gaaaacgtca tgcccgcgtc ggcattccgg 2179741 gacagcttcg gcgaggatta cggcgtgacc atcgccgacg ggccgatggc cgggctgctc 2179801 gcccgcgcaa tcgtggtgat cggcgcggac ggcaacgtcg cctacacgga attggtgccg 2179861 gaaatcgcgc aagaacccaa ctacgaagcg gcgctggccg cgctgggcgc ctaggctttc 2179921 acaagccccg cgcgttcggc gagcagcgca cgatttcgag cgctgctccc gaaaagcgcc 2179981 tcggtggtct tggcccggcg gtaatacagg tgcaggtcgt gctcccacgt gaaggcgatg 2180041 gcaccgtgga tctgaagagc ggagccggcg cataacacaa aggtttccgc ggtctgcgcc 2180101 ttcgccagcg gcgcgaccgt ctggagttcg tcaccgttgg ccgcgctcat cgcggcgaac 2180161 atcaccgtcg cccgggtggc gtcgatctcg atcatcatgt cggcgcaggc gtgcttgacc 2180221 gcctggaagg aaccgatcgg tcgatcgaat tgcgttcgcc gcccggcgta ttgcaccgcc 2180281 aggtcgaggc aggcctcggc gccgcccagc atctcggcgg ccaacagcac ccgggccacg 2180341 tcgagcaccc gctccatatc gtcgggcgtc ccggcggtca gcggctcggc gggggacccc 2180401 gccagccgga gcgtggcgac cggacgggtg atgtcaaacg agggcaacgg tgtgacggtc 2180461 accccggggg cgtcggcggc cacgacgtgc agaacgatcg acccgtcggc caccgcgggc 2180521 accacgaaca ggtctgcgac gtgaccgtgc agcaccgggg tgcactcgcc ggtgagtgcg 2180581 ggccgaccgt cgcgccgaac ggcccgaacg gtggtagccg acgcgacgtc gtggccactg 2180641 acggcgatcg ttccgatccg cgcgccggta agcagaccgg cgagcaggcg cttgcgctgc 2180701 tcgtcgtcgc ccatgcgcag aatcgcttcg atcgcaaaca ccgtggccgc aaagggaatt 2180761 ggggtgagcg cccggccgag ttcggcaaac gcgatcgcgg tctcgactaa ggtggcaccc 2180821 aatccgccgt gctccggcgg gacgtgcagc gcgggtaatt cgagctcggt gcaaagccgt 2180881 tgccacagcc tgcggtcgga tccgtccgcg gcagccatct cccgcacggg cgcgccccgg 2180941 ccaaggaagc cgcgcagcga ggcgcggaaa tcgtcttgtt cggtgctgta tcggaagtcc 2181001 acgtcagcag agcacttcgg gccgcggctc cttggggagg ccgagcagcc gctcgccgat 2181061 cacgttgcgc tggatctgcg agctgccggc atagatcgtc gcggcccgtg cgtagagcag 2181121 ctcatccatc cagcaggccg gggagtttgg cgtacccgcc tccgggacca gccgcgcacc 2181181 gccgttgccg ggcccccgcg ggcccagcgc ctcgagcccc aggatttcga cggcgagatc 2181241 ggtgtaccgg cggaaatatt cgctccagat gaccttcgtg atcgcggctt ccgcgccggg 2181301 cggccgtccg gtcagggcca gggtgaggtc acggtagccc cgataccgca tgatctgaac 2181361 ccgggcatag caccacgcca agccgtctcg tacccgtgga tcggtgtgta atccgcggtc 2181421 acgggccagc tcgcacagcc gctgcaggtc ccgctcaaaa tcgatggcgg cggtggcgat 2181481 gtgcgatccg cgttcgaagc cgagcagcgt catggcggtc gaccagccgt cgccgacccg 2181541 gccgacgaca ttgccggcgc tggtgcgggc atcggtcagg aagacctcgc tgaacgagga 2181601 gtgcccggcc gcgttgacga tcggccggac cacgacgccg ggctggtcca tgggcaccag 2181661 cagaaacgac aggccccggt gtttcgcagc gctgggatcg gtccgcgcca gcaggaagat 2181721 ccagtttgcg gtggtgccgg ccgacgtcca gattttgtgg ccgttgatca cccattcgtc 2181781 accgtcgagc acccccctgg tgcgcaccga ggccaggtcg gagccggcct ccggctcgga 2181841 gaagccctgg caccaccgat gctcgccgct gaggatgcgc ggcaggaaat gccgcttctg 2181901 cgcctcggaa cccagggcga tcagggtgtt gcccagcagg tcgattccga gcaggtcgtt 2181961 ttccgcgcgt tcgggcgcgc cggcgcgggc gaattcctcg gcgagcacca cttgttccat 2182021 cggggacagg ccaccacccc cgtattccgt cggccaggac accgcgacca ggccagcgcc 2182081 ggccagggcc cgccgccagt gccgggcgaa ctcttcccgc tcgtggggcg gcagcgcccc 2182141 gggtccgggc cacccgggcg gcaggtgctc ggccacaaac tcccggatcc ggtcgcggaa 2182201 cgcttccgct tcgggtgggt agctgacgtc cactgcgcgc cccggcctca gggccgctgc 2182261 ttgatcgcgg gccggatctg cggtgcggcg cgccagtcct ccaggccgta ctcgaccgtt 2182321 ccgtaggaca gcttgccgcc ggtgacttcg ccccagtgcg cgtgattgag ctggtggatc 2182381 ttgaagcaac cgtccagcgc ggcggaaaac cccatggcat cgacggtttg gttcaccgat 2182441 tccttgatca gcagtgccgc catcgtcggc accttcgcga tccgacgcgc gaattcgatt 2182501 gtgctggtcg cgagttcgtc agcgggaaac accttgctga ccatccccag cgcgtgggcc 2182561 tcgtcggcgc ctatgcagtc gccggtgagc agcagttcct tggtcttgcg cggcccgaac 2182621 tcccacggat gtccgaagta ctcgaccccg cacatgccca gccgggtgcc gaccacatcg 2182681 gcgaacacgg tgtcctcgct ggcgacgatc agatcgcagc accaggccag catcaacccc 2182741 gccgacagca cggccccgtg cacctgggcg atggtgatct tgcgcaggtt gcgccaccgc 2182801 ttggtgtttt cgaagtagta gtgccactcc tggcggttgc gtgactcgac cccgccgaag 2182861 gtcgccccgt tgcaccggta gctggggtgc tggtccggcc cgggcgagcg ttcccggata 2182921 tcgtcagcgg atccgaggtc gtgaccggcg gagaaggcgg ggccggcggc ccgcaggatc 2182981 accacccgga cggtgtcgtc cgcctcggca agttcgaagg cggcgcccag ctcgaccagc 2183041 atgccgcggg tctgggcgtt gcgttgtttc gggcggtcca gggtgatcgc ggcgatgcgc 2183101 ccatcgtcga tggtttcgta gcggatgtat tcgaactccc ggggccgtcg ggagcgttcc 2183161 ccgtccgacc ggcgatcgac cggaccgacc ctgccgacga acatgtccgc tccttactgg 2183221 acgtgaacgg ctgacctgtg cgaggttacc cgtcccttag ccaacatgtc catagccaat 2183281 acgcacatga gagtgatcga tatagacaaa ttcccatgca aagaagcact tgtgtacaac 2183341 gaagtatctt ggtagtactg tgatatacgc aaagggcgcc accgcagcgc gccgggcatc 2183401 cgaccggtac aaccaggaag ggttgacgat ggagatcgga atattcctca tgccggccca 2183461 tccaccggag cgcaccctct acgacgccac ccggtgggat ctggacgtca tcgagctggc 2183521 cgatcaactc ggctacgtgg aggcctgggt cggcgaacac ttcaccgtgc cgtgggagcc 2183581 gatctgcgcc cccgatctgc tgttggcgca ggcgctgctg cgcacccaac agatcaagct 2183641 cgccccgggt gcgcacttgt tgccctacca tcatccggtc gagttggccc accgggtggc 2183701 ctatttcgac cacctcgccc agggtcggtt catgctcggc gtgggcgcca gcggcatccc 2183761 gggtgactgg gcgctgtatg acgtggacgg caagaacggc gagcatcgcg aaatgacccg 2183821 ggaagcgctg gagatcatgc tgcgcatctg gaccgaggac gagccctggg agcatcgcgg 2183881 aaagtactgg aacgccaacg gaatcgcgcc gatgttcgag ggtctgatga ggcgccacat 2183941 caagccgtac cagaagcccc acccgcccat cggcgtcacc gggttcagcg ccggctcgga 2184001 gaccctcaag ctcgccggcg aacggggtta catccccatg agtctggacc tcaacaccga 2184061 atacgtcgcc acccactggg acgcggtgga ggaaggcgcg ctgcgcagcg ggcgaacccc 2184121 ggatcgccgc gattggcggc tggtgcggga ggtgctggtg gccgagaccg atgagcaggc 2184181 gttccggtat gccgtggacg gcacgatggg acgcgccatg cgtgagtatg tgctgccgac 2184241 gtttcggatg ttcggcatga ccaagttcta caaacacaat ccgtcggtgc ccgacgacga 2184301 ggtgacaccg gagtatctcg ccgagaacac cttcgtggtc ggctcggtgc agaccgtggt 2184361 cgacaagctc gaggccacct acgaccaggt cggcgggttc ggccacctgc tgatcctcgg 2184421 gttcgactac agcgataacc cgggcccgtg gaaggagtcg ttgcggctgc tggcccacga 2184481 ggtcatgccc agactcaacg cccgcctcgc caccaagccc gccaccgcgg tggtgtagcc 2184541 atggcggttc gtcaggtcac cgtcggctat tcggacggca cgcacaagac gatgccggtg 2184601 cggtgcgacc agacggtcct ggatgccgcc gaggaacacg gcgtggccat cgtcaacgaa 2184661 tgccaaagcg ggatatgtgg cacctgtgtg gccacctgca ccgccggccg ctaccagatg 2184721 ggacgcaccg agggactgtc cgatgtcgag cgggcggcgc gaaagatcct cacctgccag 2184781 acgtttgtta cctccgattg ccggatcgag ctgcagtatc cggtcgacga caacgccgcc 2184841 ctgctggtca ccggtgacgg tgtggtgacc gcggtcgagt tggtgtcgcc cagcaccgcc 2184901 atcctgcggg tggacacctc tggcatggcc ggcgcgctga gataccgggc cggccagttc 2184961 gcccaattgc aggttcccgg taccaacgta tggcgcaact actcctacgc ccatccggcc 2185021 gacggccgcg gtgagtgcga gttcatcatc aggttgctgc cggacggcgt gatgtcgaat 2185081 tatcttcgcg accgcgccca gcccggtgac catatcgcgc tgcgctgcag caagggcagc 2185141 ttttatctgc gcccgatcgt gcgaccggtg atcctggtcg ccggaggaac cggcctgtca 2185201 gcgatcctgg cgatggccca gagcctggat gccgatgtcg ctcacccggt ctacctgctc 2185261 tacggggtcg agcgcaccga agacctgtgc aagctcgacg aactcaccga gctgcgccgc 2185321 cgcgttggcc gcctggaggt gcacgtcgtc gtcgctcgcc cggaccccga ctgggatggg 2185381 cgcaccgggc tggtcaccga cctgctcgac gagcggatgc tggcgagcgg tgacgccgac 2185441 gtgtatctgt gcggtccggt cgccatggtc gacgcagccc gaacctggct ggaccacaat 2185501 ggctttcacc gtgtcgggtt gtactacgag aagttcgtgg ccagcggggc ggcgcgccgc 2185561 cgcaccccgg ctcggctgga ttacgcgggc gtggacattg ccgaggtgtg ccgccgcggc 2185621 cgcggcaccg cggtggtcat cggcggcagc atcgcgggca tcgcggcggc gaaaatgctc 2185681 agcgagacct tcgatcgcgt catcgtgctg gagaaggacg gcccgcaccg tcgccgcgag 2185741 ggcaggccgg gcgcggcaca gggttggcac ctgcaccacc tgctgaccgc cgggcagatc 2185801 gagctggagc gcatcttccc tggcatcgtc gacgacatgg tgcgcgaggg agcgttcaag 2185861 gtcgacatgg ccgcgcagta ccgtatccgg ctgggcggca cctggaagaa gcccggcact 2185921 agtgacatcg agatcgtctg cgcgggaagg ccgctgctcg aatggtgtgt gcgccgccgg 2185981 ctcgacgacg aaccgcgcat cgacttccgc tacgaatcgg aggtggccga tctcgccttc 2186041 gaccgcgcca acaatgccat cgtcggcgtc gccgtggaca atggcgacgc cgacggaggc 2186101 gacggtttgc aggtggtgcc cgccgagttc gtcgtggacg cgtcgggcaa gaacacccgc 2186161 gtgccggagt tcttggagcg tctcggtgtt ggcgctcccg aggccgagca ggacatcatc 2186221 aactgcttct actccacgat gcagcaccgg gttccgccgg agcggcggtg gcaggacaag 2186281 gtgatggtga tctgctatgc gtaccgccct ttcgaggata cctacgccgc gcagtactac 2186341 accgacagct cccgcaccat cctgtccacc tcactggtgg cctacaactg ctattcgccg 2186401 ccgcgtaccg cccgagaatt ccgcgcgttc gccgacctga tgccgtcccc ggtcatcggg 2186461 gagaacatcg acgggctgga gccggcatcg cccatctaca atttccgcta tcccaacatg 2186521 ctgcggctgc gctacgagaa gaagcgcaac ctgccgcggg ctttgctggc ggtgggcgat 2186581 gcctacacca gcgccgaccc ggtgtcgggt ctgggtatga gcctggcgct caaggaagtt 2186641 cgggagatgc aggcgctgct ggctaaatac ggcgccggtc accgggatct gccgcgccgg 2186701 tactaccggg cgatcgccaa gatggccgac acggcctggt tcgtgatccg cgagcagaac 2186761 ctgcgcttcg actggatgaa ggacgtcgac aagaagcgcc cgttctattt cggtgtgctg 2186821 acctggtaca tggaccgcgt gctggagctg gtgcatgacg atctcgacgc gtaccgggaa 2186881 ttcttggccg tcgtccatct ggtcaagccg ccgtcggcgc tgatgcgacc caggatcgcc 2186941 agccgcgtcc tcggcaaatg ggcacgaacc cgattgtcgg gccagaagac gttgattgcc 2187001 cgcaactacg aaaatcatcc gataccagcc gaacccgcgg accaacttgt aaacgcttag 2187061 gagagcccaa cgtgtcgcag gtccatcgaa tcctgaactg ccggggcacc cgcatccatg 2187121 ccgtggcgga cagcccaccc gaccaacagg gaccgttggt ggtgttgctg cacgggtttc 2187181 cggagtcctg gtactcgtgg cggcatcaga ttcccgcgct tgccggcgcg ggctaccgcg 2187241 tggtggccat cgaccagcgc gggtatggcc gctcgtcgaa ataccgggtg caaaaggcct 2187301 accgcatcaa ggaattggtt ggcgacgtcg tgggcgtcct cgactcctat ggtgcggagc 2187361 aggctttcgt ggtgggccac gactggggtg cgccggtcgc ctggaccttc gcctggctgc 2187421 accccgaccg atgcgccggc gtggtgggaa tcagcgttcc gtttgccggt cgcggcgtga 2187481 tcggcctgcc gggcagcccg ttcggcgagc gccgtcccag cgactaccac ctggagctgg 2187541 ccgggcccgg aagggtctgg tatcaggact atttcgccgt gcaggacggc atcatcaccg 2187601 agatcgagga agacttgcgg ggctggctgc tcgggttgac ctacaccgtt tccggtgagg 2187661 ggatgatggc ggcgaccaag gcggccgtcg acgcgggcgt cgacctggag tccatggacc 2187721 cgatcgacgt gatccgtgcc ggaccgctgt gtatggccga aggcgcgcgg ctcaaggacg 2187781 cgttcgtcta cccggagacc atgccggcct ggttcaccga ggccgatctc gatttctaca 2187841 ctggcgaatt cgaacgttcc gggttcggcg ggccgctgag cttctaccac aacatcgaca 2187901 acgactggca cgacctggcc gaccagcaag gcaagccgct caccccgccg gctctgttca 2187961 tcggcggcca gtatgacgtc ggcaccatct ggggcgcgca ggccatcgag cgtgcgcacg 2188021 aagtcatgcc gaactaccgc ggcacccaca tgatcgccga cgtcggacac tggatccagc 2188081 aggaagcgcc cgaagagacc aaccggctgt tgctcgactt cctaggcggg ctgcggccgt 2188141 gagctgcacc ttcgacatgg tcccggagac cgtcgatcat ctcgacgagg tcgggctgcg 2188201 gcgggtcttc ggctgctttc cgtgcggcgt gatcgccgtc tgcgcgatgg tcgacgacca 2188261 gccggtcggc atggcggcca gctcgttcac gtcggtttca gttgacccgc cgctggtatc 2188321 gatctgtgtg cagaactgtt cgacgacgtg gccgaagttg cgcgaccgcc cacggctcgg 2188381 tgtgagcgtg ctcgccgagg ggcacgacgc ggcctgtatg agcctgtcgc gcaaggaagg 2188441 taaccggttc gccggggtgt tctggagcga attgtccagc gggggtgtgg tgatcgccgg 2188501 ggccggcgcc tggctggatt gccgcccgta cgcggagatc ccggcggggg atcacctgat 2188561 cgccctgctg gagatctgcg cggtgcgcgc cgatcccgag acaccgccgc tggtgtttca 2188621 cggtagccgg ttccgccggt ggagtctcga tgaagacgac cgatgtgcgg gtacgtcgtg 2188681 cgatcacggc gatggcgggc ggtcacgccg tggtcctgac cggcgacccc aatggcgatg 2188741 gctatctcgt cttcgccgcc caggccgcga cgccgcggct ggttgccttt gcggtccggc 2188801 acacctcggg ttatttgcgc gtcgcgctgc cgggcgccga atgcgagcga ctgcacctgc 2188861 cgcccatgtg tgaccgagac accacgcatt gcgtgtcggt cgacgttcgc ggcaccggca 2188921 ccggaatctc ggcgagcgat cgcgcctgga ccatcgcggc actggcttcg gccacctccg 2188981 tcgccgccga tttccaacgt ccgggccatg tggtgcccgt gcaggcgcaa gccgacggtg 2189041 tgctgggtcg gcggggaccc gccgaggcgg ccgtcgacct ggcccgcctg gcggaacggc 2189101 ggccggccgc cgcgctctgc gagatcgtct cgcccgataa tcccgtccag atggcgcacc 2189161 acgccgagtc ggtcgaattc gccgtcgaac acggactggc catggtctcg atcggggagc 2189221 tggtggcgta tcgccggcgg atcgagcccc aggtggtccg gtttacggca gcgacgctgc 2189281 ccacctgggc cggcgcctcg cgtgtcatcg gctttcgtga cgtttacgac ctcggcgagc 2189341 atttggcggt catcgtgggt gcggtcggtg ccggggtgcc cgtgccgctg cacgtccaca 2189401 tcgagtgcct gacgggcgac gtgttcggct cgacggcgtg ccgctgcggc gaggaactca 2189461 acggcgcgct ggcgaggatg tcggctcagg gcagcggcgt ggtcttgtat ctgcgtccgc 2189521 ccggacccgc gcaagcgtgc ggcttgttcg cccggggcga tgcggcgacc gatgtcatgc 2189581 cggagaccgt gacatggatc ctgcgcgatc ttggggtgta tgcgatccga ctttccgatg 2189641 atgtgccagg atttgggctt gtcatgttcg gggcgatccg agaagccagc acgttggcgg 2189701 ccgcaggttg aaccatccag acctggccgg caaggtcgcg atcgttactg gggcgggcgc 2189761 cggaatcggt ctggcggttg cccggcgact cgccgacgag ggctgccatg tgctgtgcgc 2189821 ggacatcgat ggtgatgccg cggatgccgc ggccaccaaa atcggttgtg gcgcagcggc 2189881 ctgccgggtt gacgtcagcg acgaacaaca gatcatcgcc atggtcgacg cctgtgttgc 2189941 cgcgttcggc ggggtggaca agttggtcgc caacgccggt gtcgttcatc tggcttcgct 2190001 catcgacacc accgtcgagg acttcgatcg ggtcatcgcg atcaatctcc gcggcgcctg 2190061 gctgtgcacc aagcatgcgg caccgcggat gatcgagcgc ggcgggggag ccattgtcaa 2190121 cctgtcgtcg ttagcgggcc aggtagcggt gggcggcacc ggcgcatacg gcatgtcgaa 2190181 ggccggcatc atccagctca gccgcatcac cgccgccgaa ctgcgctcgt cgggcatccg 2190241 ctccaacacg ctgctgcccg cattcgtcga caccccgatg cagcagaccg ccatggcaat 2190301 gttcgacggg gccctgggcg cggggggtgc gcgctcgatg attgcccggc tgcagggccg 2190361 catggccgca ccggaggaga tggccggcat cgtggtgttc ctgctgtccg acgatgcgtc 2190421 gatgatcacc ggcaccaccc agatcgccga cggcgggacg attgccgcgc tgtggtgatc 2190481 ccctcgggtc aggcggtttc gaaagatcac gcgagacatt gcctgcgacg gcatgctaca 2190541 tatgtgattc cggtgtattc gggcctctgc gcattgcttt cgatcacaat gagcttggcc 2190601 gcgagccgtc ttgttcgttg agccacgggg ccgttcgaat gcgttcgtca gaactccggc 2190661 tcggattctc gctagtttgc tgacgtgtca tcgagagcaa tcgacggcga cctcgagggc 2190721 cgtgcagatg gcgcgcatcc ggatgtcggc gaggcggcca agccgattca ccaataccgc 2190781 gaccgagaca ctttcgactg agtccaaatt caccgcggaa cggcgcggga tcgggtcgga 2190841 accgggttca agaacaacct cactggctag ccctcggatg gtcgtggtgc agggcgcgac 2190901 aagtgcgcgt cgcagccgag ggatcgcggc atcgcgcgac agcacgacga ctggtcgccg 2190961 accgatctca gccatctcac accaccacac ctctccgcgc gccggaagtg cggtcacgag 2191021 tctccagccg cccgccgcca cgacgctaga tcgccccact cgtcgggctc atcgaccggg 2191081 tgcttgtcgt aggccgcata gctggcatcc acctcggccg atcgatgacg agccagtaat 2191141 gccgcaaggg cctcatcgat gagggctgcg tcagtgattc ctgcccgcat gtcgcgcgca 2191201 cttgtcaaga gtgcggcgtc gacagtagtg ctcagccgta tgcgattcat gccactacta 2191261 tgccacactc cggggcgtgg atccgcctga tcggacgcaa cgtgctcgat acgggcgaaa 2191321 cattggtcgc tggacgaatt gatgaggtct accgcgcagc gcaacgtcac ctgcaaccgg 2191381 gccgtcttca cggtgcgggt tccgtgtcga tgaacgacgc tgcggcacaa cactttttgt 2191441 acttgtgccc cgagccgcac cagcactgtt ggttgcggcc cggtggccag gccatcacat 2191501 cgtggtcacc gtgtgctgtc aggtacgcgg catactcggc gcgggcctcc ggcgagtccg 2191561 gctcctgacc ctgttcggcg caccaggcag cgaagggtgc cacgcggatc gcggcgaccg 2191621 ccagtcctgg gaaaccagcc tcggcgaatt cgaccagctt ttgctgcatc ctccggcagt 2191681 acagcgggtg cgccaccggc ccgtccggac cggtcaccag gtcgctgccg gcgaagtctg 2191741 gccacaggtc gagcgcccgc tcgtagtcac cggcaggcag ccacgccaat gacaccgcgg 2191801 tgatcggttc ggcggattcc gccgcgggtg tctcatcgac gggaggcacc cggctggctc 2191861 cgttgtcact catggtccaa catcctgccg catcaccacc gcacgcggca tatgatgctc 2191921 gcagtcgcgg tggtgcggcc ttatcgccat gagcgaaatc ttctgtatca ctgatcattc 2191981 cgagcctatg acggcccggt tcttgtcagt ggtgcttcgt agaatccgag gcatgaggtc 2192041 ggacacgcgc gaggagatct ccgcggcgtt ggatgcctac cacgcctcgt tgtcgcgggt 2192101 gctcgatctc aagtgcgatg cgttgaccac cccggaattg ctggcctgtt tgcagcgact 2192161 cgaggtcgaa cggcgccgcc agggcgccgc cgagcacgcc ttgatcaacc aactcgctgg 2192221 gcaagcctgc gaggaagagc tcggcgggac gctgcgcacg gcgttggcca accggctaca 2192281 catcactccc ggtgaggcca gccgccgcat cgccgaagcc gaagacctcg gtgagcgccg 2192341 cgccctgacc ggtgaaccgc tgccagcgca gttgaccgcg accgcggccg ctcaacgtga 2192401 gggcaagatc ggccgagaac acattaagga gatccaggcc ttcttcaagg agttgtccgc 2192461 cgcggtggat ctgggtatcc gcgaggccgc cgaggcccag ctggccgaac tggccaccag 2192521 tcggcgtccc gatcacctgc atggcctggc cacgcagctg atggactggc tgcaccccga 2192581 cggcaacttt tccgaccagg agcgtgcccg caagcgcggc atcacgatgg gtaagcagga 2192641 atttgacggg atgtcacgta tcagcggtct gctgaccccg gagttgcggg ccaccatcga 2192701 ggcggtgttg gccaaactgg ccgcaccggg ggcgtgcaac cccgatgacc agaccccgct 2192761 cgtggatgac acaccggatg cggacgcggt gcgccgcgac acccgcagcc aagcccaacg 2192821 acaccatgac ggtttactgg ccgggctgcg cgggttgttg gcctccggtg agctagggca 2192881 gcatcggggg ttgccggtga ccgtcgtggt gagcaccacg cttaaagagc tggaagccgc 2192941 caccggcaag ggggtaaccg gtggtggttc gcgggtgccg atgtcggacc ttatccggat 2193001 ggcgagcaac gcgcaccact atctggcatt gtttgacggc gctaagccgt tggcgttgta 2193061 tcacaccaag cggttagctt ccccggcgca gcgaatcatg ttgtacgcca aggatcgtgg 2193121 ctgctccagg ccgggttgcg acgccccggc ctaccacagt gaggtccacc acgtaacgcc 2193181 gtggacaacc acccaccgta ccgacatcaa cgacctcaca ctggcctgcg gccccgacaa 2193241 tcgccttgtc gaaaaaggct ggaaaacccg caagaacgcc aaaggcgaca ctgaatggct 2193301 accgccggcc cacttggacc atggccaacc acgcatcaat cgataccacc accccgagaa 2193361 aatcctgtgc gaacccgacg acgacgaacc acattgacac ccaatgaccg tggcattgcc 2193421 ggtcacgtcg caaccaagta ctgcgaccgt agccgcgctc aaggctcggg gtagacgagc 2193481 gcggagagag gcacgttgcc gagctgcctg ccgacgacga gtatcccaat atcgtgctca 2193541 cccatagcgt ttcagcgggc aaccaacgat tgccggccag cgaatctcgg tggcggtagc 2193601 cagcatgaag gacgcagatg acctcgccga ctacgggctg agcatagagc aggtgcgtgc 2193661 agccgtcgac tcgcatgtgg acgtggacca ttctgtctca gcgctgtgac cgtacggtag 2193721 agttcgccat cgtggctgac gatgacgtca ccggtcagga tggctccggc gacggcaccg 2193781 atccgcgcac catgctgggc cggtttgcca accagcacaa cgaatgggtg cgcctgagcg 2193841 tgcgccacgt gctcgatgcg ggcgaagcat tggatgccgg acagattgat taggtctacc 2193901 gccactttcg gcaggaaaag gcactggaca cacgccaccg agccggccgt accaccgttg 2193961 acactcggca tcagcaaccc ggaaacagcc gaacccctga tcatctggcc gacctcgccc 2194021 ctggccgcac cgcgaccatc gggctgcggg attccagctg cctgcgcgtg gaccgctaca 2194081 acgaccaggc gtccgggcga gcgctcatcg agatccggtt gtgcaacgaa cgtgccacgc 2194141 cgatgccaat cccgatcggg ctgtggatgt ttcagaccaa gctccacgtc aacgccggcg 2194201 gcgctgacgt gttcctgccg gtctgcgacg tgctggagca agacctcgcc gagcgcgacg 2194261 aggaggtacg ccagctgaac ctgcagtacc gcaaccggtt ggagtatgcg atcgggcgga 2194321 cttgctcggc ggcctggtcg gtgaacggct cgcggcgccc gtcggcagtg tggaccacct 2194381 ggctgccggt cgccgaaaca ccccacaccc gggcccggtc ggtggagaac gcgctgttgt 2194441 ccatggacag tcgcggaggg gttacgtagc ggactggcgt cgttcgtcgc gggatatgga 2194501 agctggtttc agggtcaggc ggctgtcgcg gccgagctgc ccgagcacct gcacccgacc 2194561 gccgacgaga ggctggctca tgttgcggcc gaaaaggaag cgctgcgctg cttccagttc 2194621 atgaaccagg tgatgcgcga tcaccgtaaa agcttgtcag aggtgcagtg aacactgttt 2194681 ccatgaccaa gagcaacggg cactgttgag acacagcgcg tcgccaacgg gcgctgcctg 2194741 tggccgaaca tcgtaaatca agcatattcg tcaacagata tcatcaatgt cggcgccgga 2194801 ctattcaaat catcgatata ctggtggcct ggtccttcgc catcgatcaa tggcgatagc 2194861 ttatcgagga tttctaccaa cttcgtgtca tcgaagcgcc atacaacggt ttgcgatccc 2194921 agttccatat ccgcagttcc gctttctcga actatccgtt gctgtacacc atctatgtcg 2194981 aaagttgcct gaccactctc atgggccgat cgcacggcgt actggaaaat gcgaagccca 2195041 tcccggtctg cggccgccag aaccacgtca ccgaagtagt tatccggctt gatacggaaa 2195101 acggtcattc tggtccaatc acttgtgagt cggaaggtcc ccgatgggaa tattctgcca 2195161 cctggcggtc ggcgaatcgt gggggttgta atcccaatgc ggatagcggt aattgtctcc 2195221 cggaaaatat cgccactcgc cgccgttctc gtcccagagg ctttcgccgc cgcgagcctg 2195281 ttggatagga cgactgggcg gtccaaccgt taggttgctc tcggcgggcg ggctgacacc 2195341 gggcggaggt aagccttcgt tcggttgtgg tccagcgggg tcgggagcag gagggggttc 2195401 gccttcgacc ggcacgccca attcgcctaa ccgggctctg attgcggctt gtctttcaag 2195461 gagagaaccc ttgtccgcga tgcacgcgtc gtaggctgcc tgctcgttgg gcagaacgaa 2195521 ggtgcgtccg catcgggcgt tgtaccgagc gatgtcagcg ttgacggcgt cccaggctgc 2195581 gcgtgcctgt acggctgtca tgtctttcgg gtcgcccggc atcggtgagg gtggatcttg 2195641 tttccagctg cggtcgaccg cgtggatttg cggtttctcg ttgtggggca ccggtgtcgg 2195701 tagcgacggt gcgattgggg gttcgtggaa gccgacggtg ttaaggggtg cggtagcggt 2195761 ggctattttg gcggccactt cgtgttcgac tccgatgagt tgtgtcgcgc gttggcggat 2195821 atccccggcc aatgcttgtg cttgggcctg gcgagctgct tgttcggcga aggtgcggct 2195881 ggttcgggtg tcggtgaccg agaggtcctc ttcgacgttg aagcccgcgt tgtgggcatc 2195941 ttgaacggca tagatgaccc tgcgctgggc tgcgccgatg gttccggcgc cttcacgggc 2196001 aagcccactc gcttggcgca aatgctcggc tatgccactg actatctgta ggtcagcgcc 2196061 ggttcgctgt cgcagccatc accgcctgcg ccttcccacg cgatgaagtg ggatcggtta 2196121 cgcatctcta ggaacacgtc ttcccactga tcggcgacct tcgtccagta gtaggccgcc 2196181 tcgatgagat gttcggtgtc ccaggcgtgg atatgcgaca gggtcggcag caattacacc 2196241 agcctcgttt gcggcacggc cgccatctcg gccgctgcgg ccgcctcgtt gttggcatac 2196301 tccgccgcgg ccgcctcgac cgcactggcc gtggcgtgtg tccgggcggt aaacgccgcc 2196361 acggcaagac caaccgctgc gtgggcaccg cccacagccg ccgtggtggg ttggaacggc 2196421 tgccccagcg gaggtggtgc aaggacgctg agttcagtgc ttcgcccgct ccattggctg 2196481 gccgtagccg ctacctgttg gatattgacc cgcagctcac cggctttcat cctcggaaag 2196541 tttaatagcg agctacaggg tggcaactca tcgcaggtcg agccaactac tgccgggccg 2196601 ggtgaccgca gctcgtgctg aggcagcacc gaggctggct gactcaagca gtctcggcgt 2196661 atgccagcct gatcgcgaac acgggagtca accggggcaa ccgccgtccg ccggacaacc 2196721 tcgatccgat atcaattaag cgatatcgtc atctccgatg gagcagatcg tgatccgcaa 2196781 ccttcccgag gggaccaagg cggcactacg ggtccgtgct gcacgtcatc accactccgt 2196841 cgaagcggaa gcccgcgcga tcctcaccgc gggattgttg ggcgaagaag tccccatgcc 2196901 ggtactgctg gccgccgaca gtggccatga catcgacttc gagcccgaac gtctcggcct 2196961 gatcgcccgc accccgcaac tgtgacctac gtcctggaca ccaacgtggt gtccgctttg 2197021 cgcgtgccgg gacgccaccc cgccgtggcg gcgtgggcgg actcggtgca agtcgccgaa 2197081 cagttcgttg tggcgataac gctggccgag attgagcgag gcgtgatcgc caaggaacgc 2197141 accgacccga cccagagtga gcacctacgg cgctggttcg acgacaaggt gctgcgcata 2197201 ttcgtgttcg cccgccgggg cacaaacctc atcatgcagc ccctagctgg gcatataggt 2197261 tacagcctat attctggtat aagctggttt tagacgaaaa ggaccccacc tcggggtctg 2197321 atggccaggg gcagggtcgc gtgcattggg gatgcaggtt gcgactgtac acccggcgtg 2197381 ttccgcgcga cagcgggtgg gatgccggtg ctggtggtca tcgagtctgg gacaggaggt 2197441 gatcagatgg ctcgtaaagc tacgtccccg ggtaagccgg ctccgacgtc gggacagtat 2197501 cgcccggttg gcggtggcaa cgaggtgacc gttccgaagg gacaccgtct gcctccctcg 2197561 cccaagcccg gtcagaagtg ggtgaacgtc gatccgacga agaacaagag cggccgcggc 2197621 tgagcttgtg ccgtcgggat gggtgtcgca ccgtctcggc gggtcgccca agtgcataag 2197681 tgctttgtcg ctgccctccg gtaccgtcgg agccccgtcc aagccggaca acgacgccac 2197741 tcgaggcagg acaagaccaa ctgtgccgcc ccctgatcca gccgccatgg gtacctggaa 2197801 gttcttccgg gcatctgtgg atggccggcc ggtattcaag aaggagttcg acaagcttcc 2197861 tgatcaggcc cgggccgcgc tgatcgtgct aatgcagcgg tatctcgtcg gcgacctcgc 2197921 cgcagggagc atcaaaccga ttcgtggcga cattctggag ttgcgatggc atgaggcgaa 2197981 caaccacttc cgggtactgt tcttccgctg gggccagcat cccgtagcgc tgacagcgtt 2198041 ctacaagaac cagcagaaga ctcccaagac gaagatcgag acggccctgg accggcagaa 2198101 aatctggaaa agagccttcg gcgacacccc accgatctga acaacgccca accactgtta 2198161 cgaggctagg agagcacaac catgagcatt gacttccctt tgggtgacga cctcgccggc 2198221 tatattgccg aggcgattgc ggctgatccc agcttcaaag gcactctcga agacgccgag 2198281 gaggcacgca ggctggtcga tgcgctgatt gcgctgcgca agcactgcca gctgagccag 2198341 gttgaggttg ctaagcgtat gggggtgcgc cagcccaccg tgagcggttt cgagaaggaa 2198401 cccagcgacc ccaaactgtc tacgctgcaa cgttatgccc gtgcattgga cgcccggctg 2198461 cggctggtgc tcgaagttcc cacgcttcgc gaagtgccta cgtggcatcg gctctcctct 2198521 tatcggggct ccgcacggga ccaccaggtc cgggtgggtg cagacaagga aatcctgatg 2198581 cagacgaact gggcccgcca catttcggtt cggcaggttg aggtggcatg actgaccgaa 2198641 ccgacgccga cgaccttgac ctgcaacgcg ttggcgcgcg gctggcagcc cgcgcacaga 2198701 tccgcgatat ccggctgctg cgcactcagg ccgctgtcca tcgtgcgccc aagcctgcgc 2198761 agggcctgac ctacgacctc gagttcgaac ccgctgtgga tgccgatccg gccactatct 2198821 cagcatttgt ggtgcggatt tcttgccacc tgcgcattca aaaccaggcg gcagacgacg 2198881 acgtcaagga aggcgatacc aaagacgaga cacaggacgt agccaccgct gatttcgagt 2198941 tcgcggcact gttcgactac cacttgcaag aaggtgaaga cgaccccacc gaagaagaac 2199001 ttacggcata cgccgccacg accgggcggt tcgcgcttta tccgtacatc cgcgaatacg 2199061 tctacgacct caccggccgt ctcgcactgc caccgttgac ccttgagata ttgtctcggc 2199121 cgatgccggt ttctcccggc gcccaatggc cggcaacgag aggaacgccc tgaccaaacg 2199181 agggtgaatc aagctgcccg acgaccatgg tttccacacc taccgccaga tgcagcgctg 2199241 gactgtcagc ccagcggcac gggtcgagat cctgggccgc tactggtgga gaatccgccg 2199301 ccgtgccacc gaaggggcga aggcgaaatc caaaggcaag gcccgccgcg gctctcagtt 2199361 caaggttctc gaacacgggt gatgcggttc gagcccggga aggtggagcg ttagccgcag 2199421 gggagggaat cttggcgggt cggccgacaa gaggttgaac ttgactgcgg gacagcagtt 2199481 tacggctctt gtcgccacgc ctacagcgga ttcgcatacc gccggggttc attgacaacc 2199541 ggcgggggtt cgttccgccg tgtttccgag gtaggtatcg gcgggggtgt atgtcggtag 2199601 gcctcgggaa tgtccgacag gcgcgatggg agatcttcgc gttgatcacc gcgccaatgg 2199661 atggtgtcgg gatcatcccc cggctgacgg gaaatgcggc cggccattct tcctcaagat 2199721 cgagtcagag gttccggtcg acgtccatcc gttggtgcag gactcgcacg acgtcgatgg 2199781 tgccttcgcc agtcacccga tagaacaacg tgtgtgaccc ggccgagagc ttgcgatagc 2199841 cggggcgaat ctcgtcgcac gctcgtccga tccgcgggtt tgccgcagca cggtcgatag 2199901 cgtgttgaag ttcgcgcagg tactgctcgg cctgatcgac accccaacgg tcataggtgc 2199961 agtcccagat ctcttccaga tgtgcctgcg cggcaggcga gagaaggtat cggctactca 2200021 ccggccacgc gaggcgtcag cccgcttacg accgaggaat ccgtcgaagt cgaacggtgt 2200081 cgagctgccg ctgcgttcgc cggcctcgag agcctcacga agcgcgcgca gctgggtttc 2200141 acggtcctcg agcagtcgca acgcggagcg gatgacttca ctggccgacc ggtagcggcc 2200201 cgcggcgatc tcgccgtcga tgaaggcgct gtagtgctcg tcgaggacga aggacgtgtt 2200261 cttacccacg aacgcacaat accaattgtt ggtagtaggt gttagcccct gggacacccc 2200321 aagccccagc ggcagaatct cctggggatc ggcatggccg caccaggcgc ggcgcgccca 2200381 gacatgtcag agggtgaggc gacactggat gatcgacacc accgaagcgg catatcggct 2200441 gacgtatcag ccggacggca cgtcgatcac cgtccgggag aacctggtcg acatcctggc 2200501 gcgtgagctg ctcggcccga tccgcggccc gcaggaggtg ttgccgttca gcccgcgctc 2200561 gcaatacctg gtcgggcacc tcgccccggt aaagctgacc ggcgccgcgc tcatcgacga 2200621 caacgcggtc caggcccgtg ccaacgccga ggcgctcgcc gagggcggtg gcgtgccggc 2200681 ctacgcggcc gacgaaacga cgccgacacc gacgacgacg cccaagaccg cgcacccaag 2200741 cagggcctga tgatcccggc atcaatgggt ttacggtttc aggtgccacc cgatctggtg 2200801 tcgttcacca tcaccgcgtc atggataacc tacgagaccg tcgagagcgg gaggtgacca 2200861 aggccggccg tacgatagcc agcgcgatag cagtgatctc gtcccggctt catcgcgctt 2200921 gtccgggtgc gacgaccgcc aacgacaggg cctcggcggc ttccttaagg cggttgtcgt 2200981 aggtaaccag cgcggtcaat ggtgcaacgg atccggcggt ttgagcagtg gctaggtgta 2201041 tcgcgtcgag cgagcgcagt gctgggttgg ggtaggccgc cgcggtggag cgtatgaccg 2201101 cgtcgatttc gaaacggtcc agcctggcta gcacggaggg caccgccggt agcccttctg 2201161 gggagactgc gcggatggct ctggatagct caacttcggt caaagccgat gtgatccacc 2201221 gtagttcggt gcggtcatcg agccaatcag ctaaagcgtc agattcgacc tcgatccgaa 2201281 ttagcttgac cagcgccgag gtttccaggt agatcacgcg ctagtaccgc tcctcggcgc 2201341 gcatgcgctc caacagcgtt cccgagtcga gaccgccgcg catcggaatt gtgggccgag 2201401 gcgccgggcc atgcactctc gccggttgca cactgccggt gctgatcagt gagtcgagag 2201461 ggccggcaga agccgggatt attcgggcga taaccttgcc gcgctcagtc aggttgatct 2201521 cttcaccgcg cttgacgcgg gccaggacct tggacgtctc ctggttgagc gttcgtatgg 2201581 acacctcatt cacaccgata atgtactacc tatttgttct acatgctatg cgcgcaagag 2201641 gttacctgcc ccgctggtca ggatcgccag cgccaggcca ctgatctcgt cggcgactcc 2201701 ggcgtagcgc gtgagatgcc aggtgcgagc gacgtcttcg atgaagctaa tcgccgccgc 2201761 gaccagcagt cgcccctggg cgacactggt cgcgggtacc agcttgccga tgaggtcgat 2201821 ccacacggcc tcgcggtcgc cctgatttcg caggtagccg tcgcgtactt cgacagaggc 2201881 gtgcgacagt tcggtgaccg acactgccac cagatccgga gcgtccaagc tgatccgaac 2201941 gtgcccttgg acaaggccgc gcaaccgttg tgccgcttgc tgattcgctc gtagcgctcg 2202001 gatgcactcc aggcagcgcc actcgtcgag gcggcggatg agcgcgtcca ggatggcctg 2202061 tttggaagaa aacgaacggt acagccccgg gcccgcgatg ccggctccct tgccgatttc 2202121 gctggtgttg acggccggat agccctgcgc acggaacagc cgcgcgcccg cggccagcag 2202181 ggtctcgtag cgggagaaca gcacgtcggc ctcgtcgcgt gcggcatcac cggccggcag 2202241 tggcggcaat tcgcagacgg gaggcgtcct tgccgcggcc atacacgcct ggtagagaag 2202301 ctttttcagt tcctcgcccg gcaggcttag gctgtgccgg cccaggctgg tcaaagtgct 2202361 ggacaccgcc cacgcccgca actccgaatg ctgtggactc agatcgggca cctccagcag 2202421 cacgctgtca cgcatgccgg cgacgatcgc gttgatgcgg cgccggaccg ccgtgcggtc 2202481 gtcctcgttg aggtagcggg cctcgcgctg ccacagcacc gtcaacgccc gagaggcgac 2202541 cgccgcggcg atcaggtctt ccagatcggc gttcaacggc cgcggcgtcg gctccgtctc 2202601 gccctcggtg agacgacgcg cgctctggta ctgatcctgg ccggttcgga tcgcttcggc 2202661 gagcaacgcc tgcttgttgt cgtagtggcg atacaacgcg cgcgcggtca ccccggccgc 2202721 ctcggcaatg tcctccaatt tgaccgaatg gaagccacgt tcgatgaaca gtccaacggc 2202781 ctgatccaaa atctgcttct tccggtcctt tgggcggcgc ctaacgggtt gggcgacgga 2202841 tgccatcggc tcgaaccccc ttcttgcgca ccggaatcac aaatcctgct agcagcatcg 2202901 cctcagcttc accccgctca ttcttcacct cgaatgcgcc ggtcaccggg tgcgacactt 2202961 accggccgtc gttcatggtg acgtttcgag gctgtgctgc tgccaagacc ccaggaagtc 2203021 tcggacgaga gactcgctag cctccgtggt atcgggcatc cctatcaccc ctgctcgatc 2203081 ctcaatatcg gactaacaaa atacatcatc gcgcctgtat acgcgattac attgcaattt 2203141 atccttatca cccttcttag agtgcatatc agtaatagac atatcgcgct cctcgcgccc 2203201 caggaggcgg tcgacgaatt cgccgtgcgc aacgacatga gccgtcgctg agcctgaaaa 2203261 cctgcagaca aagcgcgagt gggggctggc aaaactacag gctcgttagc agcaagttgc 2203321 ttcgacgacc atggtggcaa cctcgccggt cgcgaaggct ctggtcggcg ggcccgaatc 2203381 gaggcggtca ggatgcggca tccgatcacc gcccgtcggg cgcgctgttg atgcctgatc 2203441 gtggtgcctc gccagcgtga ctcgagccaa cggcttgacc ggtgatgcgc ctgtcggccg 2203501 ccaaggcagc agagcacatc gccccgcgct ataggatact agcaagatac atcatagcca 2203561 atatatgcca gtttgcattg ctatttaccg atcagttgtc caagcaatcg cgtattggct 2203621 atggacatca gcggtctctg ccgcgtacgc tcaccaatgt caccgatcgt cgacctgtcc 2203681 ggggggccag cgtgcgccac ctcacccaac ggcccagcat cgaatccagc tggtgcgccg 2203741 cgccatggta atcgtggccg acaaggcggc cggtcgggtc gctgatccgg tcttgcggcc 2203801 ggtgggcgcg ctgggcgatt tcttcgcgat gacgctcgac acgtccgtgt gcatgttcaa 2203861 gccgcctttc gcgtggcgtg aatacctact tcagtgctgg ttcgtggcgc gggtgtcgac 2203921 gctgcctggg gtgttgatga cgatcccatg ggcggtgatc tcggggtttc tcttcaacgt 2203981 cttgctgacc gacatcggtg ccgcggactt ttccggcacc ggctgtgcga tcttcaccgt 2204041 gaaccaaagc cgtgggacgg gcagcctcga acgcggccga ttcattgggc cgcaagatca 2204101 ccgagtggcg gcagccctcg aagtgacggc ccctctgcta cgtagctaag cacgcgcgac 2204161 cggcgggctg gggagcccgg tcagcggtct catagcattg cgaacacggg acgtcgagag 2204221 gggaagagct gccatgggtg aggcgaacat ccgcgagcag gcgatcgcca cgatgccacg 2204281 gggtggcccc gacgcgtctt ggctggatcg tcgattccag accgacgcac tggagtacct 2204341 cgaccgcgac gatgtgcccg atgaggtcaa acagaagatc atcggggtgc tcgaccgggt 2204401 gggcaccctg accaacctgc acgagaagta cgcccggata gccctgaaac ttgtttctga 2204461 cattcccaac ccgcgaatcc tggaacttgg tgcgggccat ggcaagctct cagcgaaaat 2204521 cctcgagcta cacccgacag cgacggtgac gatcagcgat ctagatccca cctcggtggc 2204581 caacatcgcc gcgggagagc tgggaacaca tccgcgagca cgcacccaag tgatcgacgc 2204641 caccgcaatc gacggccacg gccacagcta tgacctggcg gtcttcgcgc tggcatttca 2204701 ccacctgccg cctacggtcg cctgcaaagc gatcgccgag gccacccggg tggggaagcg 2204761 ctttctgatc atcgacctca aacggcagaa accgctgtcg ttcacgctct cttcggtgct 2204821 gctactgccg ctccacctac tgctgctgcc atggtcgtcg atgcgctcga gcatgcacga 2204881 cggctttatc agcgcactac gtgcctacag tccctcggcg ttgcagacgc ttgcccgcgc 2204941 cgccgatccg ggaatgcagg ttgaaatctt gcccgcaccg accaggctat tcccgccatc 2205001 gctcgccgtt gtgttctccc gttcgagctc agcgccaacg gaatctagcg agtgctcggc 2205061 cgatcgccaa cccggcgaat gattcggtag tagtgcagat aagccatcgc cggtaccacg 2205121 atgaacgtga tcacgatcaa agcaatcgag aagtagttcg gaccaccccg cactagaaag 2205181 atgcagcggt agtcgtagga cactgccagc ccaaccgaga ccacgatcgc aacaagcggt 2205241 aacaccttgt cggtgaacgc atttcgccgc acagcagcat gttctactgc ctgagacctc 2205301 gccaatgcga tgagagcgat cggcacgatg atgaactgga cgaatcgggc gatcaccgcc 2205361 aggccggtca ggtgcaggtt gtcgaaccgc agcgccaacg ggaatgcgag cgccaacgac 2205421 gccgtaattg cgaaggagac catcggcacg tcgtattggt tcttgcgtga caagcgtgtc 2205481 ggcagaaccc cgctgtccgc taacgcggtc caaagccgcg gtgcaccgaa cgaggccgcg 2205541 acattgatgc cgaacatcga tatcagggct ccgacgacga tgatcgttcg gaaggtagcg 2205601 tttccgatgg ccgcggccag tttcacggtg tcgcccgacg cggcgatctt gttcgatccg 2205661 agcagcatcg ctaccgttag ggtgagcaag tagatcgcgc caaccgagaa gatcgcgatc 2205721 ggtatagctc tcggcaggtt ccggtccggc gcgtccattt cttcggcggc gttcgcgatc 2205781 gattcgaaac cggtgaatgc gtacaacgcg acaatcgtgg ccagcgccat actcgagaac 2205841 gtgcccttgc caatttcggc gacgccaagc aacgagtacg gggtcgcgct gtatgccgac 2205901 cacgccgttg cgtagttgtt cacgtgctgg gtggtgatga tccacagccc gccgacaatg 2205961 aatgccgaga gcgcgaatgc cttgcctacc gttgacgttc cgttggccca cttgatcgcc 2206021 cggttgccga agaggttgat ggccaacagc acgccgataa agccgagaaa cgtcagcgtc 2206081 ttcacactga acagttgctc ggcgtcggcc caggccttgt cggggaaggc cactcgcaac 2206141 agcgtcgaga cgaaaaaaga agccaacacc ccccaagcga tggacgcggt aatggcgtgg 2206201 gtgacaccga catagatgcc gatccggcgc ccaaatgcgg ccgttgtgta ggcgtaggag 2206261 gcaccgtttg ttctgacgta ccttgccgcc gtcgcgaaga cgatcgccac gacacccgcg 2206321 aaaatgccag ctaaaacata ggccatcggc gcgaagggtc ctgcgagccc gatcacctca 2206381 cctggagtta ggaagatacc ggcgccgatt atcgagttga tcccgagcat gacgacgctg 2206441 cagaaaccca gcttgtggat cgcatatcct ctcgtccgcg ggccgaccac cgcaccaagg 2206501 ctgtctagca gggaatcctc taacgcacca tagattctct agcgacgatt cttgagctcc 2206561 cggcctgtcg atgccggcgc tgcaggtgag tcaccgcagt gggcgcaccg aacactcact 2206621 tccgccgccc caaatccgcg cagtgaccac cgcgcggtcc tcgcgagtct aggccagcat 2206681 cgagtcgatc gcggaacgtg ggaccaatac ctgggttggg ccggctgctt cgggcagcaa 2206741 ctcccccggg ttgaagaaga aaatcacccc gtcgttcgtg actgcgaagt tctgataatt 2206801 caccgggtcc aagccggcat tcggcgctat cgatacctgt tgtccggtct gcttgctcag 2206861 ttcaccttgc acaatgggga agacgactgg cagcggatcg gtgtcagcct gccacagcgt 2206921 gtcataggtg attggcttgc gataggcctg gtcccaatcg aaggccttgt acgtggtcgt 2206981 tgggtgcgtg ccgccggcgt tctggtagac cttgagcacc acggcctgcg taccacgcgg 2207041 cggtatcgcg gactggtatg tggccgaggt gatattcaat tcgtaggggg cttcgcgtgg 2207101 agtggacgat gtggccgcgc tgaggaactt gtcgcgcgtc tgggcgatgt aattttccag 2207161 cgacttctgg tcggggtagt aactgggcag gctgatgttg atgttgtagg ccgggtcgga 2207221 catttgaatc tggcacgcct ggccggtatc ggtgcctttc aactcctcgc agtaggtctt 2207281 gggcgcggcc gtggccacac ccgaacaaca gagcaaaacg acagccgtga ccagcatgaa 2207341 gatcttgatg cgcacgtcga aattcctccg ggagtagttt gcagcaccgc cggccgcagg 2207401 cgggagattg gattgccgcg atatctgagt cgacgacaaa catagggcat cgcgctgctg 2207461 acgacgatgc ctgaccagac tcaagctagc agatcgatcg ggcccggtgt cgcgtggtgc 2207521 tcgacgcccc cgacgcgctg ggcggttaga agtcccagtc ggtgtcggtg gtgggttggt 2207581 gggtgcccat tacgtatgag cttccggagc cggagaaaaa gtcgtggttc tcccctgcac 2207641 cggggtcgag agctgcgcgc acggccgggt tcacctggca ggtgtcacga tcgaatgcag 2207701 gctggtatcc caggttggct agcgccttgt tggcgttgta acgcatgtag ggcaaaacgt 2207761 cgtcggtcca gcccaactcg tcgtacaagt cgtgcgcata gtcgatctcg ttcgcgtaga 2207821 gcgtgtgcag cagctcgcag gtgtattcgc ggtggtcggc ccgctcggcg tcggtcaggt 2207881 cggccaaacc tcgttgacat ttgtagccga tgtagtagcc gtggacggct tcatctcgga 2207941 tgatcagccg gatcagatcg gcggtgttgg tgagcttacc ccgcgacgac cagtacatgg 2208001 gcaggtagaa gccggagtag aacaggaagg actccagcat taccgacgat gctttgcgct 2208061 tgagcgcgtc gtcaccgcgg tagtagtcga cgatgatctg cgcttttcgc tgcaggtaag 2208121 ggttctgttc cgaccagtcg aaggcatcgt cgatctgctt ggtcgagcac agggtcgaga 2208181 agatcgagct gtagctcttg gcgtgcactg actccatgaa cgccatgttg gtcaggaccg 2208241 cctcttcgtg gggggtgacc gcgtcgtcga tcatggccac tgctcccacc gtcgcctgcg 2208301 cggtgtcgag cagggtcaag ccggtgaaca cccggatcgt cgtctgctgc tcggtggaac 2208361 tcaacgtttg ccaagatgcc aggtcgttgg agagcggaat cttttccggc aaccaaaagt 2208421 taccggtcaa acgttcccag acctgcaaat ctttagcatc gagcaaccgg ttccaattga 2208481 ttgcgtgcac ccgctcaacg cgcttgccgg tcatcgaggg ccgtcctgcc ttgccatggt 2208541 catgccgctg ttggccggtg cgtacgctcc tgtgggcgtc aagtccggca gtcggtcctt 2208601 gggcatttcg gccgtcctcc ttgtcattga cggtctttca tggcgtgcac cagcactgta 2208661 gcttagtgat ttcggctacc catattttat tcttcgtgtc gctgaactca ttacaaacag 2208721 cgatcaccgc gcatacggtt acgcgacgcc tggccagtag ccgacgacgc cgcggaactc 2208781 aaggtcggtt tgcgggaagt cgttgccgac ggccagcagt ggttggtggc ccagctgggc 2208841 ggtcgcgtac gtcatacagt ctccgaagtt gagagccgcg cggtggcgcc ccttgccgta 2208901 tcgcagaaag gctcgttgcg tggcagcggc atgctcggcg gtgaaagatg acacgctcaa 2208961 gccgatttcg ctgcgaagtc gttcgaagat cgtgcgcgca acggggccgt gacgggcggt 2209021 caagacaatc aggcattcgg cgacggtggg tgcagacatg acggggctat gggcgccggc 2209081 cagggcggcc gcgaccaggg tggcgtgcgg ccgctcgcct tgaaccaggg ccaccacggc 2209141 gcttgtgtcc acgatcattg cggtgctcag actccggttg cggggtcgta gccgaggatt 2209201 tgttcgcgct cgagcttggt gatgggggag cggtcggcaa gcaggggcca gatttcggta 2209261 cgcaagatgt cgagaagttg tgcctcacgg tcgccggcgc gcgactccaa aaacgccagc 2209321 tgggcagaca gggcatgccg gatggcggca gtcttgctgg tgtgcagccg gtcagcgagt 2209381 tcggcggcta gtcggtctac ctcagggtct ttgatattca gcgccacagg tagatggtac 2209441 cagcaaatag ccactatcta cctaacgcgt gctgtgccgt gcggtagcta ctgaaaatcc 2209501 gagatgtcaa aggcagcgtc tggatacgct gtatgcgcgc agggatggtg atcgaggcgg 2209561 aggggcggcg tgtcatttct ggtcgtggtt cccgagttct tgacgtccgc ggcagcggat 2209621 gtggagaaca taggttccac actgcgcgcg gcgaatgccg cggctgccgc ctcgaccacc 2209681 gcgcttgcgg ccgctggcgc tgatgaggta tcggcggcgg tggcagcgct gtttgccagg 2209741 ttcggtcagg aatatcaagc ggtcagcgcg caggcgagcg ctttccatca acagttcgtg 2209801 cagacgctga actcggcgtc aggatcgtat gcggccgcgg aggccaccat cgcgtcacag 2209861 ttgcagaccg cgcagcacga tctgctgggc gcggtcaatg caccaaccga aacgttgttg 2209921 gggcgtccgc taatcggcga cggagcaccc gggacggcaa cgagtccgaa tggcggggcg 2209981 ggtgggctgc tgtacggcaa cggcggcaac ggttattccg cgacggcgtc gggggtcggc 2210041 ggcggggccg gcggttccgc ggggttgatc ggcaatggcg gcgccggggg agccggcgga 2210101 cccaacgccc ccgggggagc cggcggcaac ggtggctggc tgctcggcaa cggcgggatc 2210161 ggcgggcccg ggggcgcgtc gagcatcccc ggcatgagtg gtggagccgg cggaaccggc 2210221 ggtgccacag gacttttggg ctggggagcg aacggcggag ccggcggcct cggtgatgga 2210281 gtcggtgtcg atcgtggcac gggcggcgcc ggaggccgcg gcggcctgtt gtatggcgga 2210341 tacggcgtca gtgggccagg cggcgacggc agaaccgtcc cgctggagat aattcatgtc 2210401 acagagccga cggtacatgc caacgtcaac ggcggaccga cgtcaaccat tctggtcgac 2210461 accggatccg ctggtcttgt tgtctcgcct gaggatgtcg ggggaatcct gggagtgctt 2210521 cacatgggcc tcccaaccgg attgagcatc agcggttaca gcggggggct gtactacatc 2210581 ttcgccacgt ataccacgac ggtggacttc gggaatggca tcgtcaccgc gccgaccgcc 2210641 gttaatgtcg tcctcttgtc catcccaacg tcccccttcg ccatttcgac ctacttcagc 2210701 gccttgctgg ccgatccgac aacaactccg ttcgaagcct atttcggtgc cgtcggcgtg 2210761 gacggcgttc tgggagttgg gcccaatgcg gtgggaccag gccccagcat tccgacgatg 2210821 gcgttaccgg gtgacctcaa ccagggagtg ctcatcgacg cacccgcagg tgagctcgtg 2210881 ttcggtccca acccgctacc tgcgcccaac gtcgaggtcg tcggatcgcc gatcaccacc 2210941 ctgtacgtaa agatcgatgg tgggactccc atacccgtcc cctcgatcat cgattccggt 2211001 ggggtaacgg gaaccatccc gtcatatgtc atcggatccg gaaccctgcc ggcgaacaca 2211061 aacattgagg tctacaccag ccccggcggt gatcggctct acgcgttcaa cacaaacgat 2211121 taccgcccga ccgtcatttc atccggcctg atgaataccg ggttcttgcc cttcagattc 2211181 cagccggtgt acatcgacta cagccccagc ggtataggga caacagtctt tgatcatccg 2211241 gcgtgatcga gcctgttcgc cgcgaatgtc gccgcctggc ttgtcatccc cgactgaaca 2211301 tacgaaacat gcgccataat attgccgcct ccggtgcata ttggatcgtc gggagcacac 2211361 aagtttatgg tcttagagct atacagcgga ccgattgtcg gcaacgaccc gccgccccac 2211421 aacatgctgg agaaaccact ggatggctcg ccgaaaaggg cgacagcggc gacatgatct 2211481 gccaccgcgg gcggcatcgc cgaggtggac aaatcgatga ccgtcgcacc ctgcgaatag 2211541 ccaccaagca caatcctggt gttcgggcag ctggcgacgg tgcgctggat gtgggcgctc 2211601 gcatcatcgg aaccgtttga cgcgctcgcg cggtagtcgt cgcttgctgg gtagttcacc 2211661 gcgtagaccc caatcgaccg cccgccaact tgcgaggtaa gcgagtcgac gaacgcctca 2211721 ccgacgtcgc caagaccaga agcctgatgc gtgccgcgag cgaaaacgac cgcgatgtcc 2211781 gaacacggat ccgcatgcgc ggcacgaccg ccggcgggtg cgctcaccag cgccaaggtc 2211841 gtcgcaacca cgacaccaac gatgcgaaca aggctgcgtg gagtcatctg cacatgctga 2211901 catactgccg gcgaccgagg tggcggtggg ccgctgagac atgacgtgcc tcacgtcgtc 2211961 ggcgcccacg cagccccagg tcagaacggt agccttaggc gatgaccgac tctgtggtcg 2212021 tccgcgtcaa gcccggcagt cacaaaggac ccctggtcga ggtcggtccc aacggtgagc 2212081 tgattatcta cgtccgcgag ccggcgattg atggcaaggc caacgatgcg gtcacccggc 2212141 tgctcgcagc tcaccttcaa ttgccaaaga gccgagtcaa attggtgtcc ggagcgacgt 2212201 cgcggttcaa gcgtttccgt ctgagtcgtt aagttcaacc tgtttgagga agcgggtcca 2212261 gcaaggccgg gacatcgaga ccaagccgcg ctgacacaac aacatgctgg cgtcggtcaa 2212321 cccggtcggc ggcggcgttg ctggccccgg tacagaccgc ttgccgccgc cctcaccgtg 2212381 tcggtaattc gcgcgatgat cggactgtcc agtttccagc attgccaata gagagggacg 2212441 tcgaggtgta tgtcgcagac ccgtacgaac gatccatcgg caagcggaga tgctgccagc 2212501 ttctcgggga acatgcccca tcccagcccg gcgcgcgctg cggcggtgaa gccctctgtg 2212561 gtcgggacaa agtgcgtcgg tctggtgatg gcgcgacgaa aggccttacg caccaacatg 2212621 tcctgcagcc catcgtcacg attccacgcc agtgacggag ctttagccgc cgcggcggca 2212681 gtgaacccgt cggatagatg gcgctggacg aatggcctgc tggccactgg taggtagcgc 2212741 atttcaccca gcgggtgcac ccggcagccc ggcaccgggt tccgctcggt ggtcaccgcg 2212801 cccatcgcca caccctcccg tagcagccgc gcggaatggt cctggtcctc gatccgaacg 2212861 tcgagcagga cgtcgccgag accgtcgaac acggccgaaa accatgtcgc catggaatcg 2212921 gcgtttaccg caatggtgat ccgcgtgcgt ttcagcgacg cgttgccacc catttcagcg 2212981 agcgcctcgg actcgagcaa cgctgtttgc gcggccaacc gcaacagcgg gatacctgcg 2213041 gtcgtcgccc gacatggctt ttccctgacc accagcacct ggccgacctg ctgctccaac 2213101 gacttgatgc gctgactgac agccgagggg gtgacatgta ggcgctccgc ggccgcatcg 2213161 aagctgccca gttcgaccac ggcagccaat gcggccagct gtggaccgtc aagctgcgga 2213221 tccaccatct caggtgtaga ccatctgcgg agcgtcgcac tgcacattaa taatgctaat 2213281 gtaaatgaag aattattagc tatactgacc catacaaact gcctagtgtc gattgcgtga 2213341 actcaccact ggtcgtcggc ttcctggcct gcttcacgct gatcgccgcg attggcgcgc 2213401 agaacgcatt cgtgctgcgg cagggaatcc agcgtgagca cgtgctgccg gtggtggcgc 2213461 tgtgcacggt gtccgacatc gtgctgatcg ccgccggtat cgcggggttc ggcgcattga 2213521 tcggcgcaca tccgcgtgcg ctcaatgtcg tcaagtttgg cggcgccgcc ttcctaatcg 2213581 gctacgggct acttgcggcc cggcgggcgt ggcgacctgt tgcgctgatc ccatctggcg 2213641 ccacgccggt tcgcttagcc gaggtcctgg tgacctgtgc ggcattcacg ttcctcaacc 2213701 cacacgtcta cctcgacacc gtcgtgttgc taggcgcgct ggccaacgag cacagcgacc 2213761 agcgctggct gttcggcctc ggcgcggtca cagccagtgc ggtatggttc gccaccctcg 2213821 ggttcggagc cggccggttg cgcgggctgt tcaccaaccc cggctcgtgg agaatcctcg 2213881 acggcctgat cgcggtcatg atggttgcgc tgggaatctc gctgaccgtg acctagtaca 2213941 gcacgtgtgc acacgcgggt tggaccacgt gatcgtcgat gggcacatac cgttcggcag 2214001 gagggcgcgc ggtcagtctg cacaactcag tcaccagctg acacgccgac ggcggcctcg 2214061 cccgggcgtg tcggcgccac cagtgcacat tcggcgtgac gcggccctac ggatcgtgtt 2214121 ggagctgtag cccgttgata ccggtcgcga acggtgaacg gcgctaatcg ggggagtggg 2214181 gtcgaggctg tctggccttc cccgtccgca agttcgcgtt cggccgggcc gatatctggt 2214241 tcagggtggg tcgaggccaa atttcatcac ggttgcggtt gagcaaagtt gctgtagctt 2214301 gctcgcgagg agacggccga tatcgcctca ttggcattag tgttggctgt catggccgga 2214361 ctgaacattt acgtgaggcg ctggcggaca gcgcttcacg caaccgtgtc ggcattgata 2214421 gttgccatcc tcggactcgc catcaccccg gtcgctagtg cggcgacggc cagggcgacg 2214481 ttgtcggtga catcgacgtg gcagaccggt ttcatcgccc gcttcaccat cacaaactcg 2214541 agcacggcgc cgctaaccga ttggaagctt gaattcgact tgccggcagg agaatccgtc 2214601 ttgcacacat ggaatagcac cgttgcacga tctggcacgc actacgttct cagcccagcg 2214661 aattggaatc gcatcattgc ccccggtggt tcagccacgg gcggcctaag aggcgggctg 2214721 accggttctt actcgccgcc gtcgagttgt ctgctcaacg ggcaatatcc ttgcacctag 2214781 acgcgactgc gcactgaggc tcgccgactg cgacaatgcg gctactgcca ggtgggtcta 2214841 gtgggtcgtc acggccaacg tcatctcgga gttgatgcgg acggcgccag agccctgggg 2214901 ctggtgatga ccagaaggtt gcctgaaccg agaaattgga ttgatcgcag tgccggtggc 2214961 gggctacggt cgggcgcgtg ggcatctacg cagtgacggt acgtcgtgtc cgccctcgga 2215021 cggtcgcgac gggcatgggg ctggcaccgg ctccatgacg aatgggcagc gcgggtagtc 2215081 agcgcggccg cagtgcggcc cggtgagctc gtgtttgaca tcggcgccgg cgaaggggca 2215141 ctgacggcgc atctagtgcg agcgggggcg cgggtggtcg ccgtggagtt gcacccgcga 2215201 cgagtcggtg tcctccgcga gcgattccct ggcattaccg tggtgcacgc ggacgccgcc 2215261 tcgatccggt tgcccggccg gccgttccgg gttgtggcga acccgccgta cgggatttcg 2215321 tcccgcctgc tgcggacgct gctggcaccc aacagcgggc ttgtcgcggc cgatctcgtg 2215381 ctgcagcgag ccctcgtatg taaattcgct tctcgcaacg cgcgaaggtt caccctgacc 2215441 gtcggcctca tgctgccacg gcgcgcgttc ctgccaccgc cgcatgtgga ttccgcggtg 2215501 ctcgtcgtcc gccgccggaa gtgcggtgac tggcaggggc ggtaaacccg cggccgccag 2215561 taggtgtacc acctttgcta gaagtggcac acttcgttct atgtcgacca ctcgtccgcg 2215621 ctaccaaata accgaaaccc cggaggtagc tcaggcattg gaccgggccg cccagcgatg 2215681 gcctggcgag ccccgttcca aattattgcg gcgcctgatc atcgatgctc gacgatccgc 2215741 gttccgcggg tagcgtcgtt gcgccgtacg acgatggcga gctgctgcgt ctcgccgaac 2215801 tacgcgctag cagcgggcta aaactacctg attgctgcgt gccggatgtg gcaattcatc 2215861 accaggcaag cctcgcaacc tttgacgaca cgctcgctgc cgcagcacgc acaaggagcg 2215921 tgcccgctag cacaaacggc gcagctaacc caatacgacc agcttcactt gacataatgt 2215981 cgcttatcgg cttataagtg atgcgagttg ctccttacga tgaccatggc acagcggcat 2216041 ccttctctgc gccaagctgg ccagctacgt ggctcgaagt tcttggtaaa gagcaggcgt 2216101 cagatcgacg ctttgtcgca gttgtagttg gcccggccga gttcgctgtt catacgcggt 2216161 gacaacgagg ccgacaccgc ccgccgccgg cacgaggaca ccttgcatgt gcaagaacca 2216221 ggccgcatgt ccgaccgcct ggcaccctga ccagtcgtcg ccatagatgt cgtcgttctc 2216281 gagccccacg gcttcccgag cttgcggggt tgtcagatcg aggacggcca ggtccgtgac 2216341 gtcgatcgtg tgtagtcggt aggccgcctc gagcatcttc tctgcggtcg ttgaagccgc 2216401 ttgcgccgcc cgttccacct caaccatgca ggcttgggcg gaatcagcaa gatagatcgc 2216461 cggaaagagc agcggcggat tccacctgcc tccgaatctg cgcgcgccct caccggacaa 2216521 ggcgtcacgg tgcgcgccgg tataccggta gcacgtttcc gaccactcaa ttgttccgcg 2216581 tgcgtcgata cgctggacga gcccttcatc gagggcatcg ctcacacgaa cactccctcc 2216641 gccatcgcgt cgatgagcgc caacacgcgt tggtactcgc cgtctcgcac gaggtcggca 2216701 ggcttgcggt gttccagtaa ccgattcggc gaaaacatcc acacgttcgc ctggtcacgc 2216761 ggcagcactt ccgcgagggc gtcggcgaca taggccagct cgataagtcg ttgcttgttg 2216821 aggcgttggg gaaccacctg acctgcggtc catcgcgcca cggaacgcgg cgaggcatcg 2216881 acgatgtcac cgacttcctc gtaggtcaat cccaagcgct cgatcgcacc cgacacggtc 2216941 gaggcgagca catttactcc catgggcagc ctgtcttcct tttgtctatt gatttgtcat 2217001 gtattatgac acgaaccgag gcgtcgatgc gagaggaact tcacgacgat gggcattcag 2217061 tttcggctcg ggccgggtga tcacaaaccg gtcgaggact tcctgtcccg cgaccacgcc 2217121 ggcaccactg cgatcacgct ggacaccaac gccactcgtc accagcacga cgctgccgca 2217181 gccgcagtcg acgcaggcct agatgtctac tgggagccag cagccgagcg cctcgccgcg 2217241 cacccggctt cgggctcgac aagttccctc tgtgaaacgg gcagccctac gacacggatg 2217301 ccctgacgcg cgacgcggcg gcacgcgccg aactcgtcgg caggactctc gacaaacacc 2217361 cgtcgatcgt cacgcacgtc acggccccac acttctacct caccaacgag cgcaccgcac 2217421 gcctcaacat cgaccttgcc gagcgcacgc gcttggccgt cggctaggcg gaccgcatgc 2217481 gaacggcctt gaccccgagc cacgcccgta atgaatgcaa ccttgccctc aagcctgccc 2217541 acaacaccac ctccggcgag tagttccccc ggcggggggg cttacaccaa gcaggaacgt 2217601 caccgtgacg aattgtcgcg tggcgcagtg tcaaaggtcc agtacgcgac gaagtcctcg 2217661 gtcaacctcg tgcatcaagc tcgctggcac ctccccaact cggtcggtga ggtcagtctt 2217721 gttgagcgtg acaatcgccg tgacgttgac gaccgagtca cgtggcagtc gcgttgtggt 2217781 cgcgggcaag aacacgttgc cgggcattgc cgccagcgcc gtattggacg tgatcaccgc 2217841 tgcgatcaca gtggcaaggc gacttgcgtt gtacggatct gactggatta cgagcaccgg 2217901 gcggcgcttc gccggctgac tgcctgatgg cggcccgagg tcagcccagt agatctcggc 2217961 acgactaatc accactcatc gtccatggtt tctagcacgc ggtatgcgtt ggccacggcg 2218021 agggcctccg cttcgtcggt gccatggatg ctctctagag ccctgtcgat ctggcccgtg 2218081 agcaattggg cgtccagctc gtgcaggtag cgctgcgcag ccttcgtgaa gaactcggac 2218141 cgactcatgc cgagctcact cgcacgccgc gatacccgat cgaacgtctc atccggcaga 2218201 gaaatagctg tcttcataca gatagtataa ccgggtataa cttccagaag acggcggctg 2218261 tttcgtcaca gtgacgctat tgctggtcca aacacactcc acgattccgc gcgtcgctac 2218321 cccgggatag tccgatcagg tgtcttgggt ggcccggcaa gtggtttgat gcgtccggcc 2218381 cgcacgccgt tggcgatgac gatgacctcg gtgaactcgt gcacaagcac gaccgcggcc 2218441 agtccgagga tcccgaacaa cgccagcggc atcagcacgg tgatgatact tagggacaat 2218501 ccgacgtttt gcaccatgat ctgccgcgag cgccgggcat ggtctagggc ttggggcaga 2218561 tgccgcaggt cttggcccat cagggcgacg tcggcggttt cgatggcgac gtcggttccc 2218621 atggcgccca tcgcgattcc caggtcggcg gcggccaggg ccggagcgtc gttgactccg 2218681 tcgccgacca tcgcggtggg ttgccgagcc cgcagctgtg cgaccagatg agccttgtcc 2218741 tcgggccgca attcggcatg tacctgctcg atgccggctt gggctgccag ggcggcagcg 2218801 gtggcatggt tgtcgccggt gagcatcgtc acctggtagc cgccggtgcg cagcccggcc 2218861 accacctcgg cggcttccgg gcgtagttcg tcgcgcacgg cgatggcacc aagcagctgc 2218921 tggtcgcgtt cgacgagaac cgctgtggcg ccggcttgtt gcatgcacgc cacatgatct 2218981 gcgagctcgg cggcatcgag ccagccgggt cgccccagtc gcaccacccg cccgtcgagg 2219041 cggcctatca gcccggcgcc cgggacggct tgcacgtcgc tggcggcggt cgtcgcttgg 2219101 gtcgcggcaa gcacggccac agccagggga tgttcgctgc gggcttccag ggcggctgcc 2219161 accgccaaca cttcctcgcg ggtagcgccg tttgtggtgg cgacgtcgat gacgacgggc 2219221 cggttggcgg ttaacgtacc ggttttgtcc agggctaccg cgcggatggt gcccagggtt 2219281 tccagcgcgg cgccgccctt gatgagcacg ccgagtctgg aggcggcgcc gatggacgcg 2219341 accacggtga ccggaacggc gatggccagc gcgcacgggg cggcggcgac taataccacg 2219401 agcgcgcgtt cgatccagac cagcggatta cccaagacgc tgccggtccc ggcgatcagc 2219461 gccgcggcga tcatgatgct gggcaccaac ggtcgcgcga tacagtcggc tagccgctga 2219521 ctagcacctt ttcggacctg ttcggcctcc acgatgtgca cgatgcgcgc cagcgagttg 2219581 ttggccgcgg tagcggtgac ccccacctgc agcacgccca agccgttgat cgacccggcg 2219641 aacacttcgt caccgggtcc aacctcgacc ggcaccgatt cgccggtgat cgcggagaca 2219701 tccagggcgg tgcgcccggc acgaatgatg ccgtcggtgg ccaggcgttc gcccggttta 2219761 acgatcatct ggtcaccgac gtgcaattcg gttgaggcca cgatggtttc ggtgccctcc 2219821 cgcagaactg tggcctgatc cggcaccagc gacagcaggg cgcgcaggcc acggcgagtg 2219881 cgcgccgtcg cgtattcctc caagccttcg ctgatcgaga acagaaacgc cagcgtagcg 2219941 gcctcaccca gctcgccaag tgcgacagcg cccagcgcgg cgatggtcat cagggtgcct 2220001 acgccgacgc ggccttcggc cagtcgtttg aggctggagg gcacgaatgt cgaggcccca 2220061 accgccagcg caagggcctt cagtcccagt acgaccggcc acagcggata agcccatgcg 2220121 gcaactagcg acgcggtcag caacactccg gagaatgcgg ctcgccgcag tttggcgact 2220181 tgccagagct gctccggctc gcggtcctcg ttgtcctcgc cgtcgcagca ggcatcgctc 2220241 gtctcccccg atggctgcgc ggccacgtca cgccgaactc ctgatagtgt tcgcgtgctc 2220301 cagtcgatga ttttctgcac taccccggcc ttgcggttac tggccgagcg cgaagcatac 2220361 gcgggcaccg ccgccgcagg gacggtctcg gcatcgatga ttgccgacag gatggcagcg 2220421 gtgtcgcaga ttgcgcgtga ataccagatc acaatggatg ccgtccgcgg ataggcatgc 2220481 acggcctgca caccggccac cttgccgacg gtgtcctcga tcgcaacggc ccgtcccgcg 2220541 tcgaactgaa acccggtggc ctgcacacgc atccgcccgg ctgcatcgga tacaacggtc 2220601 agctggacct cggcgtcaac tacagtcgtc actcgtcgac cctggcgcca gcgggcaggg 2220661 gcgcctcctc accgatgcgc ccgcgagcct cggcaacgac gtcggcgact gtcagccggg 2220721 ccgactcggc cgccgcctcc gcgcgccggg ttccgcgcag gccccactcc atcacggtca 2220781 ccgacgcccg gcgaatgggc gccgtaccca gcgctttgcg cagcgtttcg taggcgctca 2220841 ccccgaccag tccggtgagc accgccccgg ccgccttaac caatagctca tgcgtaacca 2220901 cggtcagttc tcctttgctt tgtcctgtaa ccacaagtcg tgtcgtctgc tgctcagcta 2220961 cctgtcatct cgaccgcctc cccggacgcg gcgcgctcgg cgacacaggg ttggtcggta 2221021 tccaccgcga gaacgacctg gaccaactcg cccaaggctc gcgccaggtg actgtcggcc 2221081 agcgcatacc gaacctgccg gccctcatag gttgcgacta ccagcccgca gccccgcaaa 2221141 cacgacagat ggttggacac attcgatcgg gtcaacccga ggtgcgcagc tagctggccg 2221201 ggatagcaaa cgccatccag caacgccacc agaatccggc accgcgtcgg atcagccaga 2221261 gcccggccga gtcgagccag ggccgattcc cgcatctcac acgtcagcat agatcaaata 2221321 gtacaccata tactggtata acagcaagag ctgaattgta catccatagc agatatgatc 2221381 ggcgcgcgtc acaagcttcc ggccgcagag ccgccaactc acgatatcgt taaccgatat 2221441 cccgagccga tagctggcgg gctcgggtgg tggccagcgg cgctgcgacg aaaggtgtga 2221501 ccgtcatgaa acagacacca ccggcggccg tcggccgtcg tcacctgctc gagatctcag 2221561 catccgcagc cggtgtgatc gcgctttcgg cgtgtagtgg gtcgccgccc gagcccggca 2221621 aaggccggcc cgacacaacc ccggaacagg aagtcccggt caccgcgccc gaggacttga 2221681 tgcgcgaaca cggagtgctc aaacgcatcc tgctgatcta tcgcgagggg atccgccgcc 2221741 tccaagccga tgatcagagt cccgctccag cactgaacga aagcgcgcag atcattcgac 2221801 gcttcatcga ggactaccac ggacagctgg aagagcaata cgtcttcccc aagctggaac 2221861 aagccggcaa gctcacggac atcacctcgg tcttgcgcac ccagcatcag cgcggccggg 2221921 tgctcacgga ccgggtactc gccgccacca ctgcagcggc tgcattcgat cagcctgcgc 2221981 gagacaccct ggcccaagac atggcagcgt acatccgaat gtttgagccg catgaggcgc 2222041 gcgaggacac ggtcgttttc ccggcgttgc gcgacgtgat gtccgctgtc gagtttcgcg 2222101 acatggccga gacctttgaa gacgaggagc accggcgctt tggcgaggcc ggttttcaat 2222161 cggtggtcga caaggtcgcc gatatcgaaa aaagccttgg catctacgac ctgagccagt 2222221 tcacccccag ctaaagacac taatgccctt gggttaggga ccatcgcctc ctgacgcgat 2222281 cgcgacagct ggctaacgtc ggtagtacac ccatgcagag gggacgccaa tgtcagccca 2222341 acaaacgaac ctcggaatcg tggtcggtgt ggatggttca ccctgctcgc atacggcagt 2222401 cgaatgggcc gcgcgcgatg cgcagatgcg caacgttgcg ctccgcgtgg tgcaggtcgt 2222461 gcccccggta ataaccgccc cggaagggtg ggcatttgag tattcgcggt ttcaagaagc 2222521 ccaaaagcgc gaaatcgtcg aacactcgta cctggtcgcc caagcgcacc aaatcgtcga 2222581 acaggcccac aaggtcgccc tcgaggcatc ctcctcaggt cgcgccgcgc aaatcaccgg 2222641 cgaagtgctg cacggccaga tagtgcccac gctggccaac atctccaggc aggtcgcgat 2222701 ggtcgtgctg ggctaccgag gtcagggcgc cgtagccggc gccttgctgg gatcggtcag 2222761 ctcaagcctg gttcgccacg ctcatggccc tgtcgccgta atacccgagg agccgcgacc 2222821 ggcgcgcccg ccgcacgcgc cggttgtggt gggcatcgac ggctcgccca cctcgggatt 2222881 ggcggccgag atcgccttcg acgaggcatc gcgccgcggc gtggacttgg tggcgctgca 2222941 cgcgtggagc gacatgggcc ccctcgactt tcctaggctc aattgggcgc cgatcgaatg 2223001 gagaaacctc gaagacgagc aggagaaaat gctcgcccgg cgtctgagcg gatggcaaga 2223061 ccggtatccc gatgtcgtcg tgcacaaagt cgtggtgtgc gatcgaccgg caccccgcct 2223121 gctcgaattg gcacaaaccg ctcagcttgt ggtggttggc agccacggcc gcggggggtt 2223181 ccccggcatg catctcggct cagtcagcag agcggtggtc aattccggtc aggctccggt 2223241 tatcgtcgcc cgaatccccc aagatccggc agtgccggcc tgagggcctg tgcgatctgc 2223301 tcgggtggtg cccacccgcg cggaaagccc cgtccgaacc gtgattgggc aacgtcgggc 2223361 cgggccagca gcgctggacc gtaggtccct gcagtggatg acttacggcc ctgatccaca 2223421 ccggcgaccg ttaggcaggg ttgagccaac cgtcggttga gcgtctggct gcgaggtgag 2223481 gtgattgtcg gcgtcagtgt ctgccacgac ggctcatcat ggcttgccag cacatgaagt 2223541 ggtgctgctg ctggagagcg atccatatca cgggctgtcc gacggcgagg ccgcccaacg 2223601 actagaacgc ttcgggccca acaccttggc ggtggtaacg cgcgctagct tgctggcccg 2223661 catcctgcgg cagtttcatc acccgctgat ctacgttctg ctcgttgccg ggacgatcac 2223721 cgccggtctt aaggaattcg ttgacgccgc agtgatcttc ggtgtggtgg tgatcaatgc 2223781 gatcgtgggt ttcattcaag aatccaaggc agaggccgca ctgcagggcc tgcgctccat 2223841 ggtgcacacc cacgccaagg tggtgcgcga gggtcacgag cacacaatgc catccgaaga 2223901 gctggttccc ggtgaccttg tgctgttagc ggccggtgac aaggttcccg ccgatttgcg 2223961 gctggtgcga cagaccggat tgagcgtgaa cgagtcagca cttaccggcg agtcgacgcc 2224021 ggttcacaag gacgaggtgg cgttgccgga gggcacaccg gtcgctgatc gtcgcaatat 2224081 cgcgtattcc ggcacattgg taaccgcggg ccatggcgcc gggatcgtcg tcgcgaccgg 2224141 cgccgaaacc gaactcggtg agattcatcg gctcgttggg gccgccgagg ttgtcgccac 2224201 accgctgacc gcgaagctgg cgtggttcag caagtttctg accatcgcca tcctgggtct 2224261 ggcagcgctc acgttcggcg tgggtttgct gcgccggcaa gatgccgtcg aaacgttcac 2224321 cgctgcgatc gcgctggcgg tcggggcaat tcccgaaggt ctgcccaccg ccgtgaccat 2224381 caccttggcc atcggcatgg cccggatggc caagcgccgc gcggtcattc gacgtctacc 2224441 cgcggtggaa acgctgggca gcaccacggt catctgcgcc gacaagaccg gaacgctgac 2224501 cgagaatcag atgacggtcc agtcgatctg gacaccccac ggtgagatcc gggcgaccgg 2224561 aacgggctat gcacccgacg tcctcctgtg cgacaccgac gacgcgccgg ttccggtgaa 2224621 tgccaatgcg gcccttcgct ggtcgctgct ggccggtgcc tgcagcaacg acgccgcact 2224681 ggttcgcgac ggcacacgct ggcagatcgt cggcgatccc accgagggcg cgatgctcgt 2224741 cgtggccgcc aaggccggct tcaacccgga gcggctggcg acaactctgc cgcaagtggc 2224801 agccataccg ttcagttccg agcggcaata catggccacc ctgcatcgcg acgggacgga 2224861 tcatgtggtg ctggccaagg gtgctgtgga gcgcatgctc gacctgtgcg gcaccgagat 2224921 gggcgccgac ggcgcattgc ggccgctgga ccgcgccacc gtgttgcgtg ccaccgaaat 2224981 gttgacttcc cgggggttgc gggtgctggc aaccgggatg ggtgccggcg ccggcactcc 2225041 cgacgacttc gacgaaaacg tgataccggg ttcgctggcg ctgaccggcc tgcaagcgat 2225101 gagcgatcca ccacgagcgg ccgcggcatc ggcggtggcg gcctgccaca gtgccggcat 2225161 tgcggtaaaa atgattaccg gtgaccacgc gggcaccgcc acggcgatcg caaccgaggt 2225221 ggggttgctc gacaacactg aaccggcggc aggctcggtc ctgacgggtg ccgagctggc 2225281 cgcgctgagc gcagaccagt acccggaggc cgtggataca gccagcgtgt ttgccagggt 2225341 ctctcccgag cagaagctgc ggttggtgca agcattgcag gccagggggc acgtcgtcgc 2225401 gatgaccggc gacggcgtca acgacgcccc ggccttgcgt caggccaaca ttggcgtcgc 2225461 gatgggccgc ggtggcaccg aggtcgccaa ggatgccgcc gacatggtgt tgaccgacga 2225521 cgacttcgcc accatcgaag ccgcggtcga ggaaggccgc ggcgtattcg acaatctgac 2225581 caagttcatc acctggacgc tgcccaccaa cctcggtgag ggcctagtga tcttggccgc 2225641 catcgctgtt ggcgtcgcct tgccgattct gcccacccaa attctgtgga tcaacatgac 2225701 cacagcgatc gcgctcggac tcatgctcgc gttcgagccc aaggaggccg gaatcatgac 2225761 ccggccaccg cgcgaccccg accaaccgct gctgaccggc tggcttgtca ggcggactct 2225821 tctggtttcc accttgctcg tcgccagcgc gtggtggctg tttgcatggg agctcgacaa 2225881 tggcgcgggc ctgcatgagg cgcgcacggc ggcgctgaac ctgttcgtcg tcgtcgaggc 2225941 gttctatctg ttcagctgcc ggtcgctgac ccgatcggcc tggcggctcg gcatgttcgc 2226001 caaccgctgg atcatcctcg gcgtcagtgc gcaggccatc gcgcaattcg cgatcacata 2226061 tctacccgcg atgaatatgg tgttcgacac cgcgccaatc gatatcgggg tgtgggtgcg 2226121 catattcgct gtcgcgaccg caatcacgat tgtggtggcc accgacacgc tgctgccgag 2226181 aatacgggcg caaccgccat gatgccccgt ccgtgagtac ggtgtgcgtg cggtcgatcc 2226241 ggccagagtt accaggtcgg aactagccag ttacgttgta ctcgtgcggt tctcgtagtc 2226301 aaccaagcgt gcctgcagtt cggcgtacgg tacggaccgt ggcagctgct ctccgtcgcg 2226361 cacggcccga gccgcgtggg ccgctgcata caaccccgcg ctgtagggca ctgaaccggt 2226421 tgacacccgg gccaccccga gctcaccaag gtcggcgatc gtcaagccgg gcacgggcaa 2226481 cgtgttaacc gggcacggaa tgttgcgagt gagctcagca agttcgtcgg gatcgttggc 2226541 cagtgggaca aagacgccgt cggcgccggc atcgacgtag cgaagtgcgc gctggatcgt 2226601 gctggtggta tcggcgtgct ggcgcaacca ataggtgtcg acgcgggcgt tgacgaacac 2226661 ctcggggtta cgttgtttga tcgcaacgat tttagcggct gccagggcgg ggtcgatgag 2226721 cttttcggcg ctactgtcct cgatattgat tccggctgtc gacagttgtg cgacgtagtc 2226781 agcaatggcg tcgggttcgt cgctgtatcc gtcctcgatg tcgacgctga cgtagcattg 2226841 cagcggtgcc agggcggccg ccagtgcgat gttggcgccg cgagtggcgc ggtgcccgtc 2226901 cgggtgcccg ccgctggacg agaccccgaa actggttgtg ccgatagccg tgaagccctc 2226961 cgcgaggtag gccagggccg acggcacatc ccaggcgttg ggcaacacga acggaacacc 2227021 ttggtgatga agatcgtgga aactcattcc ctacctccct gctggcggat gggcctgatt 2227081 gtatgtgtga cccgcgtcag cagggtcagt cggtgagacc cgtcgccgct ggccgattca 2227141 actaggttgc ggacggatga ccacttcgtt gggtatcacc agaatcagtc tgtcgtgctc 2227201 gacgagtgat gatgcggcgc acaccgtatg ccgccacacc gacaccgagc accgcggccc 2227261 cggcggccac cgaggagagt ggcagcgcga acgccaggac tacgcagccg atcagtccca 2227321 ccagcggaat caggcggcgg ggccggccct cgtcgagccc cagagtcaag gcggaggcgt 2227381 tggcgatcgc gtagtagacc agcacaccga aggacgaaaa gccgatcgca ccacggatat 2227441 ccgctgtcgc cgccagcgcc gccaccaccg cgccaaccac cagttcggca cgaaagggca 2227501 ccttgaacct agggtgcacg gcggccagcc agcgcggtag gtgccggtcg cgtgccatcg 2227561 ccaaggtggt gcgggagacc ccgagaatca aggccagtag cgagcccaat gcggccaccg 2227621 cggcccctat ctgcacgacg ggaatcagcc agttcacccc cgcgaccctc atggcctccg 2227681 acaacggggc ggcggcccgc gcgagccgct gcggacccaa cacagcgatc acggccacgg 2227741 cgaccagggc atacaccgcc agggtgatgc ccagcgccag cgggatggcg cgtgggatcg 2227801 tgcgggccgg gtcgcggacc tcctccccca gcgtggcgat gcgggcatag ccggcgaacg 2227861 cgaaaaacag caggccggcc gcctgcagca tcccccagac gtgtgcatct acaccgatat 2227921 cgagtcgcgc cgggtccgca gcgccggagc cataggcggc gaccacgact gcggtcaaga 2227981 ccaccaacac cacggcgacg atcgaccggg tgagccaggc ggacttctgt atcccggcgt 2228041 agttcaccgc ggtcagtgcc accaccacgg cgacggccac cgcgtgcgct tgcgcgggcc 2228101 acacatagaa gccgaccgtc aacgccatcg ccgcacacga tgccgtcttg ccgaccacaa 2228161 agccccagcc cgccaggtat ccccagaagt cgcccagccg catccggcca tacacatagg 2228221 tgccccccga ggccgggtag cgcgcggcca gccgcgccga cgagatcgca ttgcagtagg 2228281 ccaccaccgc ggccactgcc aacccgagca acaacccaga accggccgcg tacgcggccg 2228341 gggccagggc ggcaaagatt ccggcaccga tcatggaccc aagcccgatc accaccgcat 2228401 ccaagagccc cagccgtcgc cgcagctcat ctggaatatc gcgtgggtct agcgggcgtc 2228461 tcatgcctcg ataaggctac ggcatccgat atcggtatac gatatctacc cggaatttga 2228521 cgcccgagac ccgcatgcgt ccagggtttg tgggtttggg gtttggtcag tggccggtct 2228581 acgttgttcg ctggcctaaa ctccacctga cgccgcggca gcgaaagcgt gtcttgcatc 2228641 ggcgacgatt gctcaccgat cgcccgattt cgttgtcaca aattccaatc cgcacaggag 2228701 ggcccatgaa cgacccgtgg cccaggccaa cgcaagggcc ggcgaaaacc atcgaaaccg 2228761 actacctggt gataggtgcc ggagcgatgg gaatggcatt cacggatacc ctcatcaccg 2228821 agtccggtgc gcgcgtcgtc atgatcgacc gcgcatgtca acctggtgga cattggacca 2228881 ccgcctaccc gttcgtgcgg ctacaccagc catcggccta ttacggcgtc aactcaaggg 2228941 cactaggcaa caacaccatt gacctcgtcg gttggaacca gggactgaac gaactggcac 2229001 cagtcggcga gatatgcgcc tacttcgatg ctgtattgca gcagcaactg ctccccaccg 2229061 ggcgggttga ctacttcccg atgagcgaat acctgggcga cggccggttc cggacactgg 2229121 caggcaccga atacgtcgtc accgtcaatc ggcgcatcgt cgatgccacc tacctgcgtg 2229181 ccgtcgtacc gtcgatgcgg ccggcgccgt actcggttgc acccggcgtc gactgcgtcg 2229241 ctccaaacga actgcccaaa ctcggcaccc gggatcgcta cgtggtcgtc ggtgccggca 2229301 agaccggcat ggacgtctgc ctatggttgc tccgaaacga cgtctgccct gacaagctga 2229361 cctggatcat gccgcgtgat tcctggctga tcgaccgagc gacgctgcag cccgggccca 2229421 cattcgtcag gcagttcagg gaaagctacg gtgcgactct cgaggccatc ggggccgcga 2229481 cctcgaccga cgatctgttc gaccgactag agaccgccgg aaccctgctg cgcatcgacc 2229541 cctcggtgcg tccgagcatg tatcgctgcg ccactgtgtc gcacctcgaa ctcgagcagc 2229601 tgcgccgtat ccgcgacatc gtcaggatgg gccacgtcca acgcatcgag cccaccacga 2229661 tagtgctcga cggcggatcg gttcccgcca cacccacggc cctctatatt gactgcaccg 2229721 ccgatggagc accacaacgt ccagccaagc cggttttcga cgcagaccac ctaaccctgc 2229781 aagccgtgcg cggatgccaa caggtgttca gcgccgcgtt tatcgcgcac gtcgaattcg 2229841 cctacgagga cgacgcggtg aaaaacgaac tctgtacccc gattccacac ccggactgcg 2229901 atctggactg gatgcgtctg atgcactccg atctaggcaa ctttcagcgc tggttaaacg 2229961 accccgatct gacggactgg ctgagctcgg cgcggttgaa cttgctcgcc gacctgctgc 2230021 cgccgttgtc tcacaagccg cgggtgcgcg agcgggtggt gtcgatgttc caaaagaggt 2230081 tgggcaccgc cggcgaccag ctagcgaagc tgctcgacgc cgccaccgca acaaccgaac 2230141 aacgctaagg atcggccgtg caccataacc gcgatgtcga cttggcgctt gtcgagcgac 2230201 ccagctcggg atacgtctac acaacgggtt ggcgactggc cacaacggac atcgacgagc 2230261 accaacaact gcgcctcgac ggtgtcgcgc gctatatcca agaggtcggt gccgagcatc 2230321 tcgccgatgc ccaattggca gaggtccatc cccattggat tgtcctgcgc acggtcatcg 2230381 atgtcatcaa cccgattgag ctacccagcg acatcacctt tcaccggtgg tgcgcagcgc 2230441 tttccaccag gtggtgcagc atgcgtgtgc agctgcaagg atccgccggc ggccacatcg 2230501 aaaccgaagg gttctggatc tgcgtgaaca aagacaccct gacgccgtcc cgtctcaccg 2230561 atgactgcat cgcacgtttc ggcagcacca ccgaaaacca ccggctcaag tggcgcccat 2230621 ggctcaccgg gccgaacatc gatggtaccg agacaccatt tcccttgcgt cgcacggata 2230681 ttgacccgtt cgagcatgtc aacaacacca tctactggca cggtgtgcac gaaatactct 2230741 gccagatacc caccctgacg gcaccctacc gcgccgtgct cgagtaccgc agccccatca 2230801 agtccggcga accgctgacc attcgttacg agcagcacga cgacgtcgtg cgcatgcact 2230861 tcgtcgtcgg cgacgacgtg cgcgcggcag cgctgctgcg caggctataa ccgtctggac 2230921 gaatcggcgg tatgccgacc accatgaacc aaggtccgca acgcatcgaa gcacgaggag 2230981 aatccatgtc tggacggttg atcggaaagg tcgcacttgt cagcggcggg gcgcgcggta 2231041 tgggtgcatc ccatgtgcgg gcgatggtgg ccgaaggcgc aaaggttgtg ttcggcgaca 2231101 tcctcgacga ggagggcaag gcggtggccg ccgaactggc cgatgcggcc cgctacgtcc 2231161 atctcgacgt tacccaaccc gcgcaatgga cggctgcggt ggacaccgcg gtcaccgcat 2231221 tcggtggcct gcacgtgctg gtcaacaacg ccggcattct caacatcggg acgatcgagg 2231281 actacgccct caccgaatgg cagcgcatcc tcgatgtcaa cctgaccgga gtcttcctgg 2231341 gcatccgcgc tgtcgtcaag ccaatgaaag aggctggtcg cggctccatc atcaacattt 2231401 cgtcgatcga ggggctggcc ggcacggttg cttgtcatgg ctataccgcc accaagttcg 2231461 ccgtgcgggg gctgaccaag tccaccgctc tcgagttggg gcccagcgga attcgagtca 2231521 actcgattca ccctgggttg gtcaagacgc cgatgactga ctgggtcccc gaagacatct 2231581 tccagaccgc gctgggccgc gcggccgaac ccgtggaagt gtccaacctc gtcgtctacc 2231641 tggccagcga tgagtcgagc tattccaccg gcgcggaatt tgtggtcgac ggcgggaccg 2231701 tagctggcct ggcacacaac gacttcggtg ccgtcgaggt gtcctcgcag ccggaatggg 2231761 tgacgtaaac gccgattggc aggcaatgcc cgaccggtct ggcgatgacg atcgcgtccg 2231821 cgctcaaccg caatcggata cccagccggc ctgtcccgca cccggcccaa ggaacggcgt 2231881 cgtggtggct attccgactc gagtgggtga tcatccttag gctcgtgcgc ttggtcgacc 2231941 gccgagatag caacgaagcc ggcgccggct tggataccgt catgggcggc ttcgatgtcg 2232001 taccgggcga gtcccggcgg ttggtgcagc gtgcagcggc gggcgatgac ccggaatccc 2232061 gagtctgcga gcagttgttc gagttcggcc gcggtgtaga agcgggcgtc gcggtagcct 2232121 ggctgtccgc gggccgcgcg cagagcgtac aggtcggccc acggtgtccc gcgaggcaag 2232181 aacccgataa caaggccgcc gccgtcggcg agcagacgcc gcgtttcccg gaatatggcg 2232241 gccgggtcgg tgacgaaaca gagcgtgaat gccatgagga ccgccccgaa gtgccggctg 2232301 acgaaaggga ccgcctcgcc gacggcattg gcgaccagga cgccgcgccg gcgtgcgaac 2232361 atcagcgcat cacgggatgg atcgagtccg aaccgcacgc cgagcaggtc ggcgaaacgt 2232421 cctgtaccga caccgatttc caagcgtggc tgggcaaaga cctcgatgag cggccgcaac 2232481 gcggcgacct cggtcgccag gatcggccgc ccggtgggtg agtcatacca ggcgtcgtag 2232541 gccgccgcgt cgcgcccggc ggccgacgat gccggcatcc gggtgtcagg cgtcaccgcg 2232601 agctgattcc agcaacaatc ggcgttcggc ggccgcgacc gaccccgggg tagcagcaat 2232661 cgcgcccgaa tggaccgaca ctgaggtgat tcccatccgg accagatgct cggcgaaagt 2232721 cgggttgccc gagagcgctt gaccacacag cgacgatgtg ctgctgacag tgatggttgg 2232781 catcggtttt cctttcggcg ttctcagatc gcgctgcgcc agatgtggta ggcctgtccc 2232841 acggagcgct cacgcggccc cgccgtgtcg atccggtgcc cggtgtccca gtccgcttgc 2232901 cgggcggcca aggccgccgc gatctcggcg gtggcgtcgg agttgccccc ggctctagca 2232961 acgattctgt cggccatcac gtcaaccgtc gccgaacacc tgaattcgac aatcgccgag 2233021 tgcgtgtccg ccgcgagacg ccgggcgcag gcgcgcatct gcggatcacc ccaggtaccg 2233081 tcgaggatca ctgagtgccc actacccaag agcaggcggg ctttgcgcag cgcctcctgg 2233141 tagaccgcca caacgttggc acgactgtag agcccggagt ccaaaacgcc gggctccccg 2233201 gtgattactc cgcaatcgcg tagccgccgg cgcacatcgt cggttgagat cacctgcgcc 2233261 cccaccagtt cggcgacccc gcgggccagg gtcgacttgc cggtgcccgg attgccaccg 2233321 accagcgcca accggaccgt agcgtgctgt aggtgttggg tggcgatgat caggtggcgc 2233381 acggcgtccg cagcggcctc cggtttgccc tgggagaatc gcacgcactc gactttcgcg 2233441 cgcaccaccg cgcgataagc aatgtagaag tcgcgcagcg acgccggggc ggtatcaccc 2233501 gaacgcaccg catagccggc caggaagtag tccccaagat ctttgcggcc caagaactcc 2233561 agatccatgg ccaaaaaggc ggcgtcgtcg atgcggtcga ggtagcgaag ctcgtcttcg 2233621 aactccaagc aatccagcag cgccggttcg ccatccacca agaagatgtc atcggccagt 2233681 agatccgcgt ggccgtctac aatacaacct tctttgatcc ggccggcgaa caaaacctcg 2233741 cgcccggaaa cgaattcgtc gaccatgtgt tcaatccgcc gaatcacatc cccggagacc 2233801 actttgtccg cgtggtggcg aagttcggcc aggttttcgt gccaacgccg cgccaccgca 2233861 ccgacctcgc cttgagtatc gatgcaccgg ttacgctgtg cgcgctggtg aaaccgggcc 2233921 aacacctcag cgatcgcgtc cagggcaccc tcgaccggca ggccggcggt caccatcgac 2233981 gccagccgct gcttgtcgcg gtaacgccgc atgacgacga ccggttcggc gtgcccgccg 2234041 cttggatcgc tgagatgggc aatgcccaag tagctctgcg cggccagccg actattcaac 2234101 tcgaattccc ggatacaggc gcgctcacgc tgttccgccg tgcggaagtc gcagaaatcc 2234161 gtcaccacag gctttttcgc cttgaacgcc cggtcgccgg ccaacacaac cactgcggtg 2234221 tgggtttcgc gcacatcgat gaaaggctca tctgtcacag gatgggcgtc acacgtgccg 2234281 tcgttggtcg gtgagtccat ggcggtagcc aagccaagta gtcacgactg ccgtgccacg 2234341 atcactggca cccgcgcggc gtgtaagacc gcgttactga ccgaccccag aagcatgccg 2234401 gtcaagccac ctcggccatg actgccaacg acgacaagct gggcggacgc cgacttttgc 2234461 accagcttcc gcgccgggcg atcgcaaacg acaacccggc tcaccggcac atcgggatag 2234521 cgttcttgcc aacctgccaa gcgttcggcg agactaagct ccgcttcctg ctgtacagcc 2234581 gagaagtcca aacccggaag ttccaccact tcgacgtcac tccacgcgtg cacggcgatc 2234641 agttcgacgc cgcggcgcga cgcctcgtca aatgccaccg ccgtcgcaag ctccgaaacc 2234701 ggcgaaccgt cgattcccac cagcacggga gcgtgctgcg gatcagggat caccgcatca 2234761 tcgctgtgga tgaccgcgac cgggcacccg gcgcgtcgca ccaggctcga gctgaccgaa 2234821 ccgagcaagc ctcgggccag cgctccccgg cccgagctgc ccaacaccac catctctgcc 2234881 tcgttggaga tttcaaccat ggtaggtacc ggcgtggaaa atacgagctc gctctttacg 2234941 ctgagctttc gatccgctcc aaccgcctct ttggcgagct tgacggcgtt ggcgacgatc 2235001 tggcgaccct cgtcctcctg ccaaaccccc caggtctccg gatacggcat cggcggccac 2235061 gtcgctacat cggcgttcac cacgtggacc acggtcagcg gaatgttcct catcgccgca 2235121 tcggtggcac cccaacaggc ggcggcatcc gattcgagcg aaccatctac cccgacgaca 2235181 actccgtgct gcttgcgggg tttagacatc tcattctccc ttcgcctcga gcaacgctat 2235241 gaaccgggac agtcaccggt catgaggctt tagtccccaa tcggacggcc aaccgaccat 2235301 gattggattc gacgcccgaa tccaggcgtg cgctgtggca tcgtcgtcaa tgtgaccgga 2235361 ccgccgccca ccatcgaccg gcgctaccac gacgctgtca tcgtcggcct cgacaacgtg 2235421 gtcgacaagg ccacgcgagt gcacgccgcg gcatggacga agttcttgga tgactacctc 2235481 acccgacgac cccagcggac cggcgaagac cattgccccc tcacccacga cgactaccgc 2235541 cgcttcttgg ccggcaaacc cgacagtgta gccgacttct tggccgcccg cggaatcagg 2235601 ctgccgccgg gctccccgac tgatctcacc gacgacaccg tgtacgggct gcaaaacctc 2235661 gagcgccaga cattcctgca actgttgaac accggtgtcc ccgagggcaa gtcgattgcc 2235721 tcgttcgcac gtcggctgca ggttgccggt gtccgcgtgg ccgcccacac ctcccaccgt 2235781 aactacgggc acacgctgga tgccaccggc ctggcagaag tgtttgccgt ctttgtcgac 2235841 ggcgccgtca ccgccgagct cgggctaccg gccgagccta acccggccgg cctgatcgag 2235901 acggcgaagc ggctgggagc aaaccccggt cgctgtgtgg tcatcgacag ctgccagacc 2235961 ggtctgcgcg ccggccggaa cggcggattc gcgctggtga ttgccgtcga cgcgcacggc 2236021 gatgccgaga acctgctgtc cagcggagcc gacgccgtgg tcgcagacct ggccgctgtc 2236081 acggtgggaa gcggcgacgc cgccatctcc acgattcccg acgccctgca ggtctacagc 2236141 caattgaaaa gactactgac cggccgacga ccagcggtgt ttctcgattt cgacggcacg 2236201 ttatccgata tcgtcgagcg ccccgaagcg gcaacgctcg tcgacggcgc agcagaagcg 2236261 ttgcgagcgc tggcggccca gtgtccggtg gcggtgataa gcggacgcga cctggccgac 2236321 gttcgcaacc gggtcaaagt cgacgggctg tggctggccg gcagccacgg cttcgaatta 2236381 gtggcgccag acggcagcca tcaccaaaac gccgccgcca ctgccgctat cgacggattg 2236441 gccgaggcgg cagcgcaatt ggccgacgca ctccgcgaaa tcgccggagc agtagtggaa 2236501 cacaaacgct tcgcagtcgc agtgcactat cgcaacgttg ccgacgacag cgtcgacaac 2236561 ctgattgcgg cggtgcgccg actcggacac gcagcagggc tgcgtgtcac caccggccgc 2236621 aaagtcgtcg agcttcgccc ggatatagcc tgggacaagg gcaaagcact cgattggatc 2236681 ggtgagcggc tcggcccggc cgaagtcggc cccgacctac ggttgccgat ctacatcggc 2236741 gacgacctta ccgacgaaga tgcctttgat gccgtgcgtt tcaccggtgt cgggattgtg 2236801 gtgcgccaca acgaacacgg tgatcgacgg tctgccgcta cctttcgtct cgaatgtcct 2236861 tacaccgttt gccaattcct ctcccagctg gcttgcgatc tgcaggaggc agtgcagcac 2236921 gacgatccgt ggactctggt cttccacggc tacgaccccg gccaggagcg gctgcgtgaa 2236981 gcgctgtgcg cggtgggcaa cggctacctg ggttcgcggg gctgcgcacc cgaatcagcg 2237041 gaaagcgagg cacattaccc gggcacctat gtggccgggg tgtacaacca gctcactgac 2237101 cacatcgaag ggtgcaccgt tgacaacgaa agcctggtca acctccccaa ctggttgtcg 2237161 ctgaccttcc gtatcgacgg cggagcatgg ttcaacgtcg atacggtcga gttgttgtcc 2237221 taccggcaga cgttcgacct acgccgtgcc acgttgaccc gcagcttgcg attccgagac 2237281 gccggcggac gagtgaccac gatgacccag gagcggttcg cgtccatgaa ccggcccaac 2237341 ctggtcgcac tgcaaactcg gattgaatcc gaaaattggt cgggcacagt tgatttccgg 2237401 tcactagtcg acggaggtgt gcataacacc ctggtggacc gctatcggca actatccagc 2237461 caacacctta ccaccgccga gatagaagtc ctggcggact cggtgttgtt gcgcacccag 2237521 acgtcgcaat cgggtatcgc gatcgcggtc gccgctcgca gtaccctgtg gcgcgatggc 2237581 caacgggtcg acgcgcaata tcgggtcgcc agggacacca accgcggcgg ccatgacatc 2237641 caggtcaccc tgtcagcggg gcaatcggtc acgctggaaa aggtcgcgac gatcttcacg 2237701 agccgggacg ccgcgacatt gacagcggca ataagcgcac agcgctgtct aggtgaggcc 2237761 ggtcgctatg ccgagctctg tcaacagcac gtccgcgcgt gggcacggct gtgggaacga 2237821 tgcgccatcg atttgaccgg caacaccgag gaattgcggc tcgtgcgact gcacctactg 2237881 cacctgctac agaccatttc gccgcatacc gctgagctcg acgccggggt cccagcgcgc 2237941 gggctgaacg gagaggccta ccgcgggcat gtcttctggg atgcgctgtt cgtcgctccg 2238001 gtgctcagcc tgcggatgcc gaaggtggcg cgatcgctgc tggactatcg gtaccgacga 2238061 ctacccgcgg cccgccgagc ggcgcaccgg gcgggccacc ttggcgcgat gtatccctgg 2238121 cagtcgggca gcgacggaag cgaagtgagt cagcagctgc acctcaatcc acggtccggg 2238181 cggtggactc ccgatcccag tgatcgtgcc catcacgtcg gtctagcggt tgcctacaac 2238241 gcgtggcact actaccaagt gaccggtgac cgccagtatc tcgtcgactg cggggcagag 2238301 ctgctggttg agatcgcacg cttctgggta ggcctggcca agttggatga cagtcgcggc 2238361 cgctacctga tccggggagt aatcggtccc gacgaattcc attcggggta tcccggcaac 2238421 gagtacgacg gaatagacaa caatgcgtac accaacgtga tggcggtatg ggtgatcctg 2238481 cgggcaatgg aggcgctgga cctgctaccg ctgaccgatc gccgccatct gatcgaaaag 2238541 ctcgggctga caacgcagga gcgcgaccaa tgggacgacg tgagccgacg catgttcgtt 2238601 ccattccacg acggcgtgat cagccagttc gagggctatt cggaactggc ggaactggat 2238661 tgggatcact atcggcaccg atacggaaac atccaacgac tcgaccggat cctggaagcc 2238721 gagggcgaca gcgtgaacaa ctaccaggcg tccaagcaag ccgacgcgct gatgctgctc 2238781 tacctgctgt cttccgacga gctgatcggc ctgttggccc ggcttggcta ccgcttcgcg 2238841 cccacacaaa tcccaggcac cgtggattac tatcttgccc gcacctcgga tggatctacc 2238901 ctgagcgctg tcgtgcatgc gtgggttctc gcccgcgcca accggagcaa tgccatggag 2238961 tacttccgtc aggtcctgcg ctccgatatc gccgacgtcc agggcggcac aacccaggaa 2239021 ggaattcacc tggcggccat ggctggcagc atcgacctgc cgcagcgttg ctattccgga 2239081 ttggaactgc gcgacgaccg gctggtgttg agcccgcaat ggccggaagc acttggacca 2239141 cttgagtttc cgtttgtgta ccgccgccac cagctgagcc tgcgaatcag tggccgaagc 2239201 gccacattga ccgcagaaag tggagacgcc gagccaattg aggtcgaatg ccgtggccac 2239261 gtgcagcggc tacggtgcgg gcacaccatc gaagtcggtt gcagcaggtg accaatgtcg 2239321 cacatggtgg gtcgacgatc tctcctggaa aggacggccg gccgcggtct cccttattgc 2239381 gttgggtgtt gtgtgctcgt cgcctgcgac taagggcact ccaccgggat agccgcgacc 2239441 agaggcgtgt cgactccgat cgggcccacc gctgcggcac cacccggcga acccagcgga 2239501 gccactcggc ccggcaggac ttggtggaaa aaggcggcgt tgtcccccag atgctggtgt 2239561 tgatcgtcgg gtagatcgcc ttcccagtag atcgcctcga cgcggcaggc cggtttgcac 2239621 gcaccacaat ccacgcactc gtcggggttg atgtagagca ttcgggcgcc ctcatagata 2239681 cagtcgaccg gacactcctg cacacaggac ttgtccatca catccacgca ctcactaccg 2239741 atcacatagg tcacaaacgg caagctaccg gcccgatgcc gaggatcgcg cctatccaaa 2239801 gacccctacc ggaaaggacc aaaggcctta ttcgtcaagt tcgtcactgg cacgtcgacg 2239861 cggggtgcaa gaaaaccggg gcggttcacc cgaccgccag cgggattcac gctcccccag 2239921 gccataaact tacgatagcc cgtcatttca agagcgcgag aagttcatcg acactcccgg 2239981 tggtcaagat ctgatccgcg ggaaccgcaa cgaccgtgtc gctcaaggga aagcggtgtt 2240041 cgccagggta gacgattgcc aacctcgcca gttggaggtc gacaagagcc gagcgcatcg 2240101 accgggaaat cgacggtgta gacgtccgct tgatctcgaa tccataggga cggccagata 2240161 attcgacata gagatcgagt tcggcgtctt gctgggtgcg ccagtaatac agcggattcg 2240221 gggcgagcag ggccgcaagc tgctcgagca cgaacccctc ccagctcgcg ccgagcttcg 2240281 gattgcgttc gagggcaagc cgatcgtcga taccgagcaa cctgtgcaac aaaccggtgt 2240341 cccggatgta gatcttgggt gatcggcgtt gtcgctttcc gatgttggcg aaccagggcg 2240401 tcagctgacg gacgacgagt gcatcggtga gcgcatcgag gtatcgccgc gccgtcgtct 2240461 gagcaacgtc gagtgagcgg gcaagttctg cgccgctgaa gagctggcca tggtagtggg 2240521 cgagcatcgt ccacgcgcgc cgcatcgtcg cggccggaat gcgcacacca agctgggcga 2240581 gatcgcgctc cagaaacgtg gtgatgtagc cgtcgcgcca cgccgcggag tcctcgttgg 2240641 agcgtgccgt gaacgagggc ggtagacccc cacgcaacca gaggcgatcg gcggccgagg 2240701 atccgacgtc gcggaccgtc aggccggaca actccaccaa ctcgacgcgt ccggccaaac 2240761 tttcggacgc cagcccgaca agatcgggtg aggcgctacc caggataaga aaccgggccg 2240821 gcatgacagg cctgtcgacg agcacgcgta ggaccggaaa cagatccgga atccgttgcg 2240881 cctcgtcgat cgtgatcaac ccgctaaggc cggataaagc caacatcggg tcggcaagcc 2240941 gtgtcgcgtc gacgggattt tcggcgtcaa acgtacattc gggtgcggac ttgcccacca 2241001 gccggctaag ggtggtcttg ccggcttgac gaggtccggt aagcaacacc accggcgctc 2241061 ggtgtagcgc gcgtcgcaac cgcgcggcgg cgtcgcggcg ttcgatcaac atgcatgaaa 2241121 ttctagcggt aggcgctgat atttcatggt tagccgcccc cgggagactc ggtggtgggt 2241181 cccacacgcc tagaaagtcg ccggcgataa cgaccggcca ggtcagcggg gttggccgca 2241241 gcccgataag gctctcgatc tcgtccatca ggcatgctcc acatcgcctg caccagggca 2241301 aagctgcacc ggtcgtgcga gccggttagc aaatagcacg ttcatacaca taaatgtgta 2241361 tagtggtgtt gtgtcacgga ccaacatcga gatcgacgac gaactcgtgg ccgccgcaca 2241421 gcggatgtac cgactcgatt ccaagcgaag tgccgtcgac ctcgcgctgc gccggctcgt 2241481 gggtgaaccg ttgggccgcg atgaggcttt ggcgctgcag ggcagcggtt tcgacttcag 2241541 cgacgatgag atcgaatcgt tctcggatac ggaccgcaag ctcgccgacg agtcgtagat 2241601 gatcgtcgac acctcggtct ggatcgcata tctctccacg tcagagtcgt tggccagtcg 2241661 ctggctagcc gatcgcattg ccgctgactc gacggtgatc gtgcccgagg tggtgatgat 2241721 ggagctgctg atcggtaaga ccgatgagga caccgccgca ctgcgccgac ggctcctgca 2241781 gcgattcgct atcgaaccgc tggccccggt ccgcgacgcg gaagatgccg ccgccattca 2241841 ccggcgctgt cgtcgcggcg gcgacaccgt acgcagcctg atcgattgcc aggtggccgc 2241901 gatggcgttg cggatcgggg tcgccgtggc gcatcgtgat cgcgactacg aggcgatccg 2241961 cacacattgc ggactacgca ccgagccgtt gttctgactg cggacacccg gacgatttcg 2242021 tgtctcacat ctgacccgtg gccgtcgtcg tccgccgccg ggtacatcga catagtggac 2242081 cagggaacat cgccagcgca tgagtgagcg cggataccac ccggtccggg gacgcgttgg 2242141 cgctggccga agccgaccgg cccagcgatg acatcgactt caaggacgtt cggctttcag 2242201 cgcgacgatc atccgcctca ggctgtcgcg ggtcgcttgc agcgcggccg ggtctacggc 2242261 gtcagcaagt cggttgttga cgacgatggc gcgttcggtg attgcctgga gaacgcggcg 2242321 accattctcg gttagcacca gcagtggtga tgtgcggtgg tcggggttgt gtctgagctc 2242381 ggccaagccg caaacgacca gatcgttggc cactcgctgc accccctgac gggtaacacc 2242441 aaggcggcga gcggcttggg gcacggtcag cgctcgatcg gagaccacgc tcagcagctg 2242501 ccatcgcgcc tgcgtgtgcc cctctctggc agcgaccacc tcacctgagc gccgtagcag 2242561 gccagcgagc tcgaatacgt ctgctaccag ccgagcgatc tcatcggaca tcccgcctcc 2242621 aactttgaca atatattgtc atcatggttc gatgctgtca aaatcgaaac ggtcctgtcg 2242681 tcgtcgtgaa acccttcgca tcggagaaaa gatgagcgct ccaattacga atcttcaagc 2242741 cgcacagcgt gatgccatca tgaaccgacc agcggtcaac ggcttccccc atctggccga 2242801 gacgctgcgc cgcgccggtg tccgaaccaa tacctggtgg ctaccggcga tgcaaagcct 2242861 gtacgagact gattacggtc cagtccttga ccaaggcgtg cccctgatcg acggcgtggc 2242921 cgaggtcccg gcattcgacc gcacggccct cgtcactgcg ctgcgcgccg atcaggcggg 2242981 tcagacgtct ttccgagagt tcgccgcggc agcctggcga gccggtgtgc tccgctacgt 2243041 cgtggacctc gagaaccgca cctgcaccta cttcggcctg catgatcaga cgtatatgga 2243101 gcactacgcg gcagtggagc cttccggtgg tgcccctacg agttgagctg cgcccgtcgc 2243161 agcgacattc cagcagaccg cgacgtcagt cttgggcggc ctgactatcg cgatgatccg 2243221 tcgcccgctc atcaacccgg ttcgtggtca agacttttca ccggggcgac gtttcctggg 2243281 gctagtaagg cggttgccga tcttcgtgaa gcggcggtgt ccgagaccca cgacaccaag 2243341 gacgtgttag ccgctttggc cgcgcgcaag tccccggtgc gacctttctg atgcgatcga 2243401 cgatgtaggt gggatctcgt gctcttcgca ccagtcgttg ggatcctggg cgattccgga 2243461 cgctttgtcg gtggtgacgc ggtcgatgat ccagcctagc gccgaacccg agccgagcag 2243521 gcaacgcccg gccccaagtg gtgcgcaccg ccgccgtgga tcttgatggg agcacgcgaa 2243581 gctcactggt gcaccatcct tgtgtcggtg accttggatg gattgccgat gcacccaagg 2243641 cgccgctggg ttatcgccct gctcgctcga cagccgtgat gtccacgatg agttctgcgg 2243701 agtccggcgg tagccccgga cgcgccgacc gtcgacagga ctgagcgccg acgagcgccg 2243761 aacagtgagc ggcccaaacc actaccctgc ccgacgagcc gcggaacggc gtcacgggtg 2243821 gaatcgattg ggcgcgagat gatcacgcgg tgtcgatcgt cgatgcgcgt gggcgcgagg 2243881 ttcgccgcgc cacgatcgag cacaacgccg ccggactgcg cgagctgctc gagctgctga 2243941 gccgggccgg tgcccgcgag gtcgccatcg aacgcccgga cggcccggtc gtggataccc 2244001 tgctcgaggc cgggatcacg gtggtggtga tcagccccaa ccagctgaag aatctgcgcg 2244061 gtcgttacgg ctcggctggc aacaaggacg accggttcga cgcgttcgtg ctcgccgaca 2244121 cgttgcgcac cgaccggtcc cggctgcgcc ccctgctgcc cgacaccccg gccacggcca 2244181 ccctgcgccg gacctgccgc ccccgcaaag acctcgtcgc ccaccgggtt gcgttggcca 2244241 atcagctgcg cgcgcacctg cgcgtcgtct ttccgggtgt ggtcgggttg ttcgctgacc 2244301 ttgactcgcc gatcagcctc gcgtttttga cgtttttgcc ccgtttcgac tgccaggacc 2244361 gcgcggactg gctgtcggtc aagcgcctgg ccggctggct ggccgccgct ggctactgcg 2244421 gccgtgctcc acgaccggct caccggtgcc ccgcgcggcg ccaccggtga cgagggtgcc 2244481 gccaacgccc acatcacccg ggccatggtc gccgcgctca ccagcgtcgc gacccagatc 2244541 aagacgctcg acgcgcagat cgccgagcag ctctccttgc acgccgacgc gcatatcttc 2244601 acctccctgc cccgctccgg caccgtccgc gccgcccggc tgctcgccga gatcggggac 2244661 tgccgagccc gtttccccac gcccgaatcg ttggcctgcc tggctggcgt cgccccctcc 2244721 acccgtcagt ccggcaaagt caaacacgtc ggattccgtt gggccgcaga caaacaactc 2244781 cgcgacgccg tctgcgactt cgccggtgac agccgccgag ccaacctctg ggccgccgac 2244841 cgctacaacc gcgccatcgc ccgaggacac gaccaccccc acgccgtgcg catcctggcc 2244901 cgcgcctggc tctacgccat ctggcactgc tggcaagacg gcgccgccta ccaccctgcc 2244961 aaccatcgcg ccctccaggc actgctcaac caagatcaag accgggcggc ttgacacacg 2245021 gctactcatc ggcctagcgg gtgggcgcca ccagcgggta gcacgaacga aatccttgat 2245081 gccccaaacc gtttaagcgt tactgcaggg tacaggtacc gagcgggacc cgctgccggg 2245141 cctagttgct tatcggtggt ggttgcggct ggaagggttc ataccaccac cagtcggcgc 2245201 gctcgccggt gggcccaggc cacggcgcta ccgccggcgg cggcttcgtc gacgcccgcg 2245261 ccaacgatcc cgcgctcaaa ggtcggcccg cgctgtcggc gacggtgagg ttgtctgccg 2245321 gtccggtaat ggtgatcagg ccccgatggt gtgcccggtg gtgatacggg cacaccagca 2245381 ccaggttggc cagctcggtg gccccaccgt cctgccaatg tcggatgtgg tgggcgtgca 2245441 aaccccgggt ggccccacaa ccgggaacca cacacgtgcg gtcgcgatgc tcaagcgccc 2245501 ggcgcaaccg acgattgatc tgacgagtcg ttcgaccgca gccaatgacc tgcccgtcac 2245561 gttcgaacca ggcctcaaag gtggcatcac agagcagata tcggcgttcg gactcgctga 2245621 gcagcggacc caggtgcagg ccagcggcac gctcctgcac gtctagatgc atcaccacgg 2245681 tggtgtgctg cccatgtggc cgacgagcca cctcggcgtc ccagccggcc tcaaccagac 2245741 gcagaaacgc ctcaacattg cccggcaacg ggggccgctg atccgacaca ccgtcgctgt 2245801 tgtcgtgatc acgcttgtac tcggcgatca acgcatccag atgagactgc aacgccgcat 2245861 cgaacttcgc cgcctccacg tgcggaagct tgattcgcca acaactgaac tgctcatcgg 2245921 cgctcctggt gatcgagggc cgcggttccg gccgaaaatc cggttcgggt tcgggtcgcg 2245981 gttccaactt gagcgcggtc cgcaactgat tcaccgtggc aaccccggcc aactgcgcat 2246041 aatgcgcatc cgaaccctca cccgcccgcc ccgcgatcac cccaacctga tccaacgaca 2246101 accgcccctc ccgcataccc cgggcgcagc gcggaaactc cggcaaccgc cgcgccaccg 2246161 tggcgatcgt gtgggcgttg cccgacgagc agcccatctt ccaggccacc aaccccgcca 2246221 ccaaccgcgc ccccgtcaca ccccacaacc cgtcgcgatc cagctcagcc acgatctcca 2246281 caatgcgccc atcaatcgca ttgcgctgac cggccaactc cgccaactcc tcaaacaaca 2246341 cctccacacg ctcggcagga ctgactaccg ctgcgccaga cgtcgcggtc gaggacatga 2246401 gttcatcatc gcagcagggt ctgacaactc cggccaaccc gaatccacgc ccggggccgt 2246461 gccgtcatca ccccgcaaag agatgctcgg ctccggctcc gcccccgccg gggccaaggg 2246521 cacacgagac aacgaaatca gcgaacccac catggaaacg ctcaacggcg tgggccgcga 2246581 agccggcgaa atgctgggag cagctggtgg acatcgcata gataggcccc agacccagcc 2246641 agcacggctc caaccgtcga cgcgcctagc tgcaaaatcg catgcttgtc agcggatacc 2246701 ggtatatttt ccggtatgtt ttcagagcct tatccgaccg atggcgaagt catgacggaa 2246761 ctcggcgaca agttccttgc tgctcttgtt ggcaccatca gggatacgcg cttcgacatc 2246821 gccgacatgc ggaactggcg gccgggatgg tttccgacca tgcatagccg gtgtctgtcc 2246881 aacctcatcc acgacagaat ctgggcacac ctggtcaccc tcatcgcgag caatccaggc 2246941 accagcatca aggacaaggg tgccacccgc gagattgtgg ttggcgcaca cctgcggttg 2247001 cgaatcaaac gccaccacgc aggtgacgag atcagcacct acccgacccg aaccgccatc 2247061 gaattctggc aacagggcag ccagcccgcc ttcccggggc tggaagaggt tcgcattgcg 2247121 gtgggctatc ggtgggaccc tgatacccgc gagatcggag cccccctgct gtcgcttcgc 2247181 gacgggaaag atcacgtcat ctgggtagtc gaactcgacg agcctgcggc cggcgtgaag 2247241 atcacctgga ccccgatcga gccgacacta ccgtccatcg acttcggtga cttgggtgaa 2247301 gactctggag catcggggga acgatgaacg gcctgggaga cgtgctcgcg gtcgcccgga 2247361 aggctcgtgg actcacccag atcgaattgg ccgagctggt gggactcacc cagccggcga 2247421 tcaaccggta cgaatcaggc gaccgtgacc ccgaccaaca catcgtggcc aagctggccg 2247481 aaatcctcgg tgtgaccgac gatctgctca tacacgggaa caggtttcga ggtgcgctcg 2247541 cagtcgatgc gcatatgcgc cgccacaaga ccacgaaggc gtcggcctgg cgtcagctgg 2247601 aggcccggtt gaacctgttg cgcgtgcacg cgtcattcct cttcgaggaa gtggctatca 2247661 atagcgagca acatgtgccc gcgttcgacc cggagttcac cgccgccgag gacgccgccc 2247721 ggttagtccg tgcccagtgg cgcatgccga tgggcccggt cgtcaacctg acccggtgga 2247781 tggaggccgc gggctgcctg gtgttcgaag aggacttcgc cacccagcgc atcgacgggt 2247841 tgtcgcagtg ggtcgacgac taccccgtca tgctgatcaa cgccaacgca gcacccgacc 2247901 gaaaacgctt gacccttgcc cacgaactcg gccacctcgt gctgcattcc accaacccca 2247961 cggagaacat ggagaccgaa gccaccgcct tcgccgccga gtttctcatg cccgagagcg 2248021 agattcggcc cgagctgcgt cggctcgatc tcggcaagtt gctcgaactg aaacgggaat 2248081 ggggcgtctc gatgcaagcc ctcctggagc gggcatatcg catgggcctg gtatcggccg 2248141 aggctcgcac caagctctac aaggcgatga acgcgcgcgg ctggaaaacc aaagagccag 2248201 gcatcgagtc catcgtgcga gaaaaaccga gcctacccgc ccacatcggc atgacactcc 2248261 gaagccgcgg attcaccgac cagcaagccg ccgccatcgc cggatacgcc aatcctgcgg 2248321 acaatccatt ccgccccgaa ggtggccgcc tccatgcgat ttgacttccg attgacgctg 2248381 ggttgtcatg ccgacggcgc caggtgcggt cacacaaggc ggccggaaca ggcatcgatt 2248441 cttggcgacg ccgttgctgt accgatagcg actgccccgt atcgatccca gggaacgtga 2248501 ccatggtcgt agggatgact tgacagtttc aacggggtgc gaccaccgtt gcgctcagaa 2248561 ggcatacgtt ggtggaacac gtcggaaagc tgggaggtga atctgatggc tggcgaccaa 2248621 gagctggaac tgcggttcga cgttcctctt tacacgcttg ccgaggcatc gcggtacctg 2248681 gtggttcccc gcgccaccct ggctacgtgg gctgacggct acgagcgtcg gccggccaac 2248741 gcaccggcgg tccaggggca accgatcatc acggctcttc cccacccgac cggcagtcac 2248801 gctcggctcc cattcgtcgg aatcgccgag gcgtatgtgt tgaacgcctt ccgccgagcg 2248861 ggcgtcccta tgcagcggat ccggccatcc ctcgactggc taatcaagaa tgtcgggcca 2248921 cacgcgcttg cgtcccagga tttgtgcacg ggcggtgccg aggtgctctg gcggttcgct 2248981 gaacggtccg gggagggcag tcctgatgat ctggtggtca gggggctgat tgtcccgcga 2249041 tccgggcagt acgtcttcaa ggagatcgtc gagcactacc tgcaacaaat cagctttgcc 2249101 gacgacaacc tggcttcgat gattaggttg ccgcagtacg gcgatgccaa cgtcgtcctc 2249161 gatccacgcc gcggctatgg gcaaccggtg ttcgacggaa gcggcgtccg ggtagctgac 2249221 gtgctcggcc cattgcgcgc cggcgcgacg ttccaggctg tcgccgacga ctacggtgtg 2249281 accccggacc agcttcgaga cgcgctcgac gccattgcag cctgatcgga atctcctcgc 2249341 cgacctcgat cacatctttg tcgaccggag tttgggcgct gtgcaagtcc cgcaactcct 2249401 tcgggatgcc ggattccggc tgacaacgat gcgggagcac tacggcgaga cgcaggctca 2249461 gagtgtcagc gaccacaagt ggatcgcaat gaccgccgag tgcggctgga ttggatttca 2249521 caaggatgcc aatatccggc gcaacgccgt cgagcgacgg acggtgctcg acacgggagc 2249581 ccggctattc tgtgtgccgc gggccgacat cctggcagag caagtcgcgg cacggtatat 2249641 tgcgtccctt gcggcgattg cccgtgccgc acgatttccg ggaccattca tctacacggt 2249701 tcacccgagc aagatcgttc gcgtgctcta gtcgttcatc gctccgttaa ccgccggcga 2249761 ggccgtcgac gatcttcatg gtctcgacgc tgacggtggt caccttcttg atgaggtcga 2249821 cgatgtaggt gggatcgtcg tgttcgtcgc accagtcgtt ggggtcgttg acgatgcccg 2249881 acgctttgtc ggtggtgacg cggtagcgct cgatgatcca gccgagcgcc gagcgggagc 2249941 gagcaggtag cgctcggcct cgtcgggaat gccggcgatg gtgacgcggg agtagaacga 2250001 tcgccaagtg gtcggtcttg gctgcccact tcatccccgg cgccaccggc aggtctcgcg 2250061 gtcatctcga ccaacggagg gccgtcggtg gttcgtatcc ggccaagaac ggcgagaacg 2250121 gtttgtgcct ctatgccagg gtgaatgtct catctcccag gcggacggtg atatccagtt 2250181 ctccgccaag agcggacacg tatttgcgca gtgtgttgac ctgtgcggag ccgatgtcgc 2250241 cgttctcgat gctggatacc cggctctgcc ggatgtgcgc cagcgcagcc acctggacct 2250301 gggtgagtga ctgagccgcg cgcagctccc ggagccggaa tgcccgcact tcatcgcgca 2250361 ttcgtgcctt gtgccggtcc accgcctccc ggttaacggg acgtacggcg tccatgtccc 2250421 gtagtgtcat cgccatcgtg ccacttaccc tttcttgcgc ttgcgcctct ttggcttcgt 2250481 gtcctcgaac tgtgcgagat gttcggcaaa catctcatcg gccgctttga tcttctcgtc 2250541 gtaccactgg gtccaccgcc cggccttgtt accggcggcc agcatgatcg cctgccgcgc 2250601 cgggtcgaag gcgaacagaa tgcggacctc ggaccgccct tgtgatcctg gacgcagctc 2250661 cttcatgttc ttgtggcgcg acccacgcac cgtgtccacc agaggacagc caagtgcggg 2250721 gccctcttcc tcgagaacct cgatagctgc gaacaccaat tcgtaggtct ctcggtccaa 2250781 gccgttgagc caggcggaga tgcgctccac atccgccgtc caccccacag agtcgcagag 2250841 tagcgcgata cgcgatatca cacaagggtg atattcctcc gggtaagagc agcgggcgac 2250901 ggggctaccg tcgaggaaat gccggcaggc gaggacggac tctgcgcacc cgggccgttg 2250961 aaacagtagc ctgtgccagg ccgagaattc atccccacgt atgaggcagt acagtgcgcc 2251021 gccgtgcgcg ttctcccatg gaacgttcac gggctcccgt ggatgacagg cgtttcatga 2251081 acgccagcgc cgccgcaacc cgaccgaaag cggttgaccc caaggagagc tggaagtcga 2251141 ggccaccacc ttcgccgcgg agttgctcat gcccgagagc gagactcgtc ccgaaatacg 2251201 ccggctcgat ttcggcaagt tgctcgaact gaagcgggaa tgggcgtcga cccgctcgac 2251261 cagccccagc cgggtgacca gccccagccg ggtgaccagc ctatgcaccg cggcgatccc 2251321 accgaagccg gtggcatcga tgttggcgcc gacctcgtag cgcaccgcgc ccgaacccag 2251381 catcggcctg ggctgcgccg cccagcgtcc agcccgcgcg tgccgcgccg ccaccctgcg 2251441 ccctcggcgt gtgatgtttc gccgactctg ttcatgggtt atcttcttca ccacaaaggc 2251501 ctttcctgct gggctgtgtt gaggtcgcaa acccagccag ggtaaggcct ttggcctctc 2251561 ctacccggcc gacacgctta ctgaaggcct agtctaggca ggccattcaa tctgcggaat 2251621 cgaaaaattc ggttccagcc tgctcgtttc ctttccgaca gcgatctgac gttgcgtaac 2251681 gtcatttgta cggactcttt tagcggcatt gatttcagat gctaacgccg tctgtgctgt 2251741 agcgccgatt ggccgaaact gtaaatttgt atgattattt aaatctttga cgaacacgcg 2251801 ccacaaacgt actatctctt tggcaaagtc caccggcatc tcattcaacg gttttgtttg 2251861 cgcgtggtcg tcatatgttg gtaactgtgt aaccggccgc ctatcttgcg cgtgcatcat 2251921 atgactatga atcggccttc tccagtgaaa ttgatacaag atcgatccga taagcggtac 2251981 cttgtacaca gtgcaattgt agtaattcgc gttttgtcct acgcttgtat tctgcgtgaa 2252041 gaattcaaac agcgtccggg cgtgcagcac agtcgaatcg agtgcgtaga cctgcatgct 2252101 atagccgtca atcccaagat tcatacaatg ttgcgcgtgc cattctgttg cagcacgcaa 2252161 taaccatgcc agctctttgg tcaagtgctc ctctagatag tgctctagct gagcgacgct 2252221 tgtcatcgct ggcgttcgtt gaaccatcct aaaaggatag cgttgaacca tcctaaaagg 2252281 atagcggttt catggcgctg cctaactaat ttggaagctg cgcagcgccg ttgggccccg 2252341 gccagcatct gctgcgcgcc ctcgcgcaca atctcatcga tcaccgacga caccgggccc 2252401 gccaacacga ccggcacggc agaatcacga ttgccgacct cagggactac attgcgcatg 2252461 ggcgtacctt cccgaaccag cgcgccaacg ccggtccttg atcagacaaa tggacttcag 2252521 atcatcctcg gggaggtgcg cccaatcacg ccgcttcgcc gaggctcatc cacaggttct 2252581 gatcattgct caagcaagaa ggcgcggccg tattgacagt atcgcggaag actttcgttt 2252641 ccgggcgttt tgtcgcatcc gaatgtcaac aaccggtcga ttcagcttgc ggaggaggtt 2252701 ggggacactt gagaacacgt ccaccgactt tctgcatgtc cggattgcta gggtccgctt 2252761 cggcaacgaa gtctgcccac gcccaaccac caccggccat gtgcggtcca agctgcttgt 2252821 aattgaaccc ggcgtcgtcg aagtaacaga agttgctgtt catagtgttg agggaagggg 2252881 cgtccaggtt cgacaggacc gcaggcatag ttcgctcacg cggaatcgct accgccatgg 2252941 cgtgcctgcc aacccctggg ttgtcctctg aaattcgcct ctgtagtgcc accatcagcc 2253001 tggccgcacg agttatcccg tcttgctgct tcagcattcg cggaacacgc cgaagcaaga 2253061 tgttcgcata cattttctca gtcaacgcag ctcccgtgat acggaagctc gccgagtgcc 2253121 ccggcgctcg gcggattcgg tagcacttga actcgttttg gttggccggg atcggcgctt 2253181 ccggatcgaa gttcgaaatc tcggcgacga gaggaatcct gcggcggtcg aagccggcga 2253241 tgataatccc gagccgtttg tcctcccaac ccttcccggt cggtaattgg ccgatctgcc 2253301 cggaggccca gtaccgaagc gcatcgactc cgtcttcgaa cgaggcgtag tcgcataggg 2253361 tctcggcgag ccactccgat gttgacttct tctgggcggg atcgatgcga gcaagccctg 2253421 tgaacccgac cgtgaagctt gtattccagc acaccagttt ggtgtagtcg tcatcgaaca 2253481 ctgacccgtc cggtctcgac agtcgccgat cggcaacctg gatgaccagg tcgtcggtcg 2253541 taaccgtctg aatcaacgtc atgatgctgc taccgatgcg ccaagttggt ccgtattgca 2253601 caaggtgtgc agacggagag gatcagattt gtcagagttc gagaatgacc acggtggtcc 2253661 gcgctgcttc cgccgcgggg cctcggtgac gggcgtcgcc ctcgggcggg gcgccgccgg 2253721 gcaaggttaa gcggtagttg gcctgtcacg ttgaatccga acccgctgat gcaagtgcca 2253781 caatgctgtc cacgatcttc atggtctcga cgctgacggt ggtcaccttc ttgatcaggt 2253841 cgacgatgta ggtagggttc gcgtgctcgt cgcaccagtc gttggggtcg ttgacgattc 2253901 cggacgcttt gtcggtggtg acgcggtagc ggtcgatgat ccagcccagc gcggagcggg 2253961 agccgagcag gtatcgctcg gcctcgtcgg gaatgccggc gatggtgaca cgcgagttgt 2254021 agatgatcgt tgagtggtct tgcttggact tccatttcat cttttcgacg cgccaggtct 2254081 cgcggtcctc cggatctgcg cccggtttga gttgcacatc aaggggatac ggcttgaccg 2254141 actcgtagcc gacatgtaag tcggctagtt tccggccggc gctggcgagc tggtcgaagc 2254201 gttcgcgggt ctccggtgtt gggatgtgcg ggagcatctt cttgaggtca gcggcgtatt 2254261 ttgtgcggta ggcggggtca tgcagcaggc cgtagacgta gtagaagatg tcgtctttgg 2254321 tgacttggtc gccgatcgtg tcgcggtaga gcttgaggat gacgccggtg atgttgtcga 2254381 cgcggcggta gccgtggtcg tctacttcgg cgttggtggt ggactcgaaa tcgagttcgc 2254441 cgtcacgtgg ttcggtcttc tcgtaggtcc agcgcgggaa gaattgaccg ttgcttgagc 2254501 cccagaatgc gagatcgggg atagcgttta gcatcagaca cgagaagggc ttgtctgagc 2254561 ccatgccaac cacgtagtaa ccgacattcc cgtgctccgg cgtcggaaac atcgacggaa 2254621 gctggtaggt acagttgttg agctgctggt tggggtcgag gtaggcgtgc tctttcgtaa 2254681 atggtcggta cgtgccgagc cgcattcccg cgggagcgaa ttcgatgcga atgccttgtg 2254741 ccacttgccg cttgttgatg cggtcccagc tgaacttggc cgagtccacg gtaatgaggg 2254801 cgtcaaccgg cggggtcttg gcgtcccttc cgcggatctc gttgatccgg tcgacctccg 2254861 agttgtagaa gtcgatcgtg cgtccgatgt tggcctcgag cgcaccacgt gaaaagttgt 2254921 aacaccacgc atcccggctg gtcttcaagc ccgcggaata gttcgcgaag acacgtgtca 2254981 cgtcaagagc agccttcttg tcgccgataa ccggccacgc gctgaacgcg tcgtcgcgtt 2255041 ggttgaccca gtcaccgtgc aagttgggtg tgactgtctg ccattccacc gtgtcgaggt 2255101 agccgtcgcc gacgatccgc aacttctcct cgcgactcag gtaatcgccg atgtcgcggt 2255161 aaaggacatc gcatggcccg ctgtgcttcg gatccttgat gccaaggaag atcgccaccg 2255221 tgttgcgact ccccccgcca aagaccttgc cgccttcctg gcgtgagagt tccccagctg 2255281 tgcgctggtt cccccgcagg ttgtacacat ataccgccgc gtagtcgtcg gcgagcgaca 2255341 accgcatgcc gtctgccgtg ttgccgtcta tgtacccacc attggagacg aatccgacaa 2255401 caccgttgtc accaatgcgg tcggtcgccc accggaacgc gcgaatatac gagtcgtaca 2255461 ggctgttctt cagctgcgcc gtcgaccgct tcgcgtacgt ctgctcaatc cgcccgtcca 2255521 acgtcggata cttcacgttg gcgttcaggt cgttcgcgct gctctgcccc accgagtacg 2255581 gcggattccc gatgatcacg ctgatcggcg tcgccagctg tcgcaagatc cgagcgttgt 2255641 tgtacgggaa catgatcgcg tccatcgagt ccccggcttc ggaaatctgg aacgtgtcgg 2255701 ccagcgccat cccggggaac ggctcatagg cgtcggcgtc ggcggtcttg cccgccaaag 2255761 catggtaggt cgactcgatg ttcaccgcgg cgatgtagta cgccagcagc atgatctcgt 2255821 tggcgtgcag ctcttgcgag tactttcggg tgaggtcggc ggccgtgatc aggtcggact 2255881 gcagcagccg ggtaatgaat gtgcccgtcc cggcgaagcc gtccagaata tgcacgccct 2255941 cgtcggtcag cccgcgcccg aaatgcttgc gcgacacgaa atcagccgcc cgcacaatga 2256001 agtccacgac ctcgaccggc gtgtacacga tccccagcgc ctcggcctgc ttcttgaagc 2256061 cgatgcggaa gaacttctcg tacagctcgg cgatcacctg ctgcttgccc tcggcgctgg 2256121 tgacctcgcc ggcgcgccgt cgcaccgatt cgtaaaagcc ttccaaccga gcggtttcgg 2256181 cctccaggcc ggcacccccg acggtgtcga ccatcttctg catggcccgc gacaccgggt 2256241 tgtgcgacgc gaagtcatgc ccggcgaaca gcgcgtcgaa caccggcttg gtgatcaggt 2256301 gctgcgagag catgctgatc gcgtcatcgg gggtgatcga gtcattgagg ttatcgcgca 2256361 gcccggccag gaactgctcg aacgccgccg ccgccgtagc gtcggcgccg ccgagcaggg 2256421 cgtggatacg ggtggtcagc gtcgcggcga tgtcggcgac atcggcggcc cactgctccc 2256481 aataggtccg ggtgccaacc ttgtcgacga tgcgcgcgta gatcgcttcc tgccactgcg 2256541 acaacgagaa catcgccaac tgctccgcga cggcgggtcc cgcctcgtcg gaggtcggcc 2256601 cgatgtgacc gcccaacagc ttgtcgctgc cttcaccggt cttcgtcggc ttcacgttca 2256661 gcgcaatgct gttcaccatc gcgtcgaagc gctcgtcgtg cgaccgcaac gcgttgagga 2256721 cctgccacac caccttgaac cgtttgttgt cggccaacgc ggcagacggc tcgacaccct 2256781 cgggcaccgc caccggcaag atgacgtacc cgtagtcctt gccgggcgac ttgcgcatca 2256841 cccgaccgac cgactgcacc acgtcgacga tggaattgcg cggattcagg aacagcaccg 2256901 cgtccagcgc gggcacgtcg accccttcgg agaggcagcg ggcgttggac aggatgcggc 2256961 attcatcctc ggcgaccacg cctttgagcc aggccagctg ttcgttgcgg accagcgcgt 2257021 tgaacgtccc gtccacgtgg cgcaccgaac acgccaggcc cgggccgtcg tcaaccaatt 2257081 cgcggtatgc ctcaaccact ttcgggaaca gctcggcaac ctgcttggac gtcttgatgt 2257141 ccttggcgaa cgccaccgcc cgacgcatcg gcggctcacc ggcgacaatg ccggtaccgg 2257201 accgcttggc caggccattc cagcagccga cgatcttgga ggcgtcgtcg agcatcagct 2257261 cgccggaaac cccggagagt tcctgctgca accggggcgc gatcacgccc tgatcgacgg 2257321 tgagcaccat caccttgtag tcggtgagca gcccgcgctc caccgcctcg ccgaacgaca 2257381 gccggtgaaa ctccggcccg aacgtcagct cgtcgtccat cgacaccaac tcggcggagt 2257441 gctggtcggc cctgtccttg atgctctcgg tgaaaatcct tggcgtggcg gtcatataca 2257501 gccgccgggc cgccttcaga tactgaccgt cgtgcacccg cacgacgttc gactcatcgt 2257561 cccccgccag cgtcacgccg gtggtgcggt gggcctcgtc gcacatcacc aagtcgaact 2257621 cgtcgacccc cagccgttgg gccttggcca ccgtgggcag cgactggtag gtgcaaaaca 2257681 ccacggtcag gccctgggcg cacctgcggt gcgccatttc gtgcagcaat acccgcgcgt 2257741 cggtggtgac cgggatcggc acatcgtgga cgtggtagtc ctcggccgag cgcgacacct 2257801 tggtgtccga gcacaccgcg aacgcccgca catccagctc actctgtgcg gtccactccc 2257861 gcagcgtctg gctcaacagc gaaatcgagg gcaccagcaa cagaatccgc gcgctgccgc 2257921 cgttgtcggc ggcgatgcgc tcggcgatct tgagcgcggt gaacgtcttg ccggtgccgc 2257981 aggccatgat cagcttgccg cgatcgttgc ccaccgcgaa cccgcggaac accgcgtcga 2258041 tcgcctgctg ctggtgcggc cgcagctcgt ggcgtttggc cggggtcagg ttcacctgca 2258101 ggtcgccggc cggccaggcg atgtcccagt cgatcggcga ttcggcgatc tcggccatgc 2258161 cgatgcgctg caccgggacc aactgatcgg ccagcgcgtc ctcggcattg cggccccacc 2258221 gatccgtcgt ggagatgatc acccggttgg tgaagcccgt cttgcccgac gcggtgaaaa 2258281 acgagtcgat gtcccccttg gccagtgtgt gcgtcggctc gtagaacttg cactggatcg 2258341 cggtgtagtt gccggtgtca cgttcgcggg cgaccaggtc gattccggtg tcggtcctgc 2258401 cccgccgctc cggccagtcg atccaccgcc acaccgcgtc gtactgctgg gccatcgtcg 2258461 ggtccagctc gaaatagcgc accatcaact gctcgaactt ggtcccgcgc tccgcgttcg 2258521 acggagcctt ccggaacgcc tcgatgacgt cgtgcaccga ccccatagtt caatgaccat 2258581 actggcggca accgacacgt ggcgggatcc ctcgcgttcg atccaaccca accagctcgg 2258641 ccaaccgcat cgcgggccgg catcttcgcc gtcctaactc gggaaatagc ggttgtcact 2258701 atctgagcgc agctatctca tttgcggaga actagccctg atcaattcct gcctcggtta 2258761 cgtgtgtcat gatcagccgg ccagttcgag gttgaggtga ccttcacata gtgaagcctc 2258821 ccgggtttcg tgcgcacctt ctttcgaggg aaggacgcac gctgagctgc gagttcgtcg 2258881 ccgagcatcg agcccggttc gaggtcgctg cgatctgtcg cgtgctgtgt gggcagggct 2258941 gcagatcacc cggagaacct tctacgcctg ggcagcgtcg gccgccgtct aggcgtgccc 2259001 tgcgggagat gacggtcacc gagcccctgg ccggttacga cgggcccgat accgatggcc 2259061 gccgtaagcc cgagtcactc tacggtgcgg ccacgatctg ggatcgacga gccatgttca 2259121 gccggatagg cgtggatgag ggcggtggtc agcttgggaa cggtgtgggt gagttcgtgt 2259181 tcggcgtcgt gggcgatgcg gtgagcttgc gcgaggtcca gggcggggtc gacgtcgagt 2259241 tcggcatcgg cgtgcaagcg gtgtccgatc cagcgcatcc gcacgctgcg taccgcctgc 2259301 acgccgggcc gggccgccag ggcttgttcg gcggcatcga ccatcgctgg gtcgacgccg 2259361 tcgagcaggc ggcggaacac atctcgcgcg gcagttcgta gcacggccag aatcgccgcc 2259421 gtgatgagca ggccgacgat ggggtcggcc agtgggaacc caagtgcgac accgccggcc 2259481 gagcacagca cggccagcga ggtgaatccg tcggttcgag cgtgtagtcc gtcggcgatc 2259541 agggcggccg agccgatgcg gtgcccaacc ctgatgcggt agagggcaac ccactcgttg 2259601 ccgatgaatc cgaccagccc ggccagggcg acccagccga catgctcgat ctgctgcggg 2259661 tggatcaggc gggcgatggc ttcgtaaccg gcgatgatgg ccgacatcgt gatcatcgcg 2259721 accacgaacg acccggccag gtcctcgacg cgaccgaatc cgtaggtata tcggcgagtg 2259781 gcgggcttgg cgcccaacgc gaacgcgatc cacaacggca ccgcggtcaa cgcatcagcg 2259841 aagttgtgga tggtgtcggc ggccagcgca accgaccccg acatcaccac gatcacaatc 2259901 tggatgagcg cggtcaaccc gagaaccaac aagctgatct tgaccgtacg gatccctgcc 2259961 gcagtggatt ccagggtgtc gtcgacgctg tcggcggcgt cgtgggagtg cggcgcgaag 2260021 atctccttga tcatcgccgg cacacctcgt gaatgagcgt ggtcgtgggt catcgggcgc 2260081 aggccctttg tgacagcagg ccagatcggc cgcgttcgac caccaagcaa gctcttttat 2260141 ctgcgttcat acgcagataa tagcggatgc tctcgccggt tccagtacta gctgggacgg 2260201 acgacgatca ccgggattct caccgaatgg gctaccgcgg aactcaccga acccaacagc 2260261 atgccggaaa accccccgcg cccatggctg ccgaccacca ccagctgagc ttgctcagaa 2260321 tgctcgagca gccaccgagc gggcttgtcg cacaccagcg atcggtgcac gcggacatcc 2260381 ggatactgct cttgccagcc ggcgaggcgt tcagcgagga cctcagcctc tctcttctcg 2260441 cgctctcgcc aatccatccc cagaaccgga aacatcccca gatcggtcca ggcgtgcaac 2260501 gccaccaggt ccacccttcg gcgggaggct tcgtcgaagg ctagggccgt tgccgcctca 2260561 gaggctggcg atccgtcgat gcccaccaac accggtgcat cggagtcggg agtcgcgcca 2260621 ttaccggaat gaatgatggc cactggacac cgcgcatggt ggagcaacgc ggtgctgatc 2260681 gagccgagca gcagtcgacc caatgcgccc atcccctggc tgccgacgac catcaaccaa 2260741 gcctgttggg atgcatcgat aagcgtcggc acaacattgg aaaagaccaa ctcggtatgc 2260801 acctgcggcg gtttggactc acccaagctg ttggtgagcg cctcgcgggc ctgctcaatg 2260861 acctgctgtg cgttgtcctt ttgccactca gtcatattcg cgtacagctg gcccaccggc 2260921 cagccgacaa ccacaggggc aacaatgtgc agcagggtga tgggcagctg gcgcatgacg 2260981 gcctcacggg cggcccaggc taccgccgcg ttggattgcg ctgatccgtc gacgccaacg 2261041 agtattccgt atttcgctgt cgcagcagac atttcacgct ccttgcggtc ggaacacagt 2261101 ccatcaatcc atcagcgcag cggtgcagac caccgcagca aggtgcctcc ggtcggcatg 2261161 ttctcgactg tgaattcgcc gcccgcgtcg tcggcacgct ggcggagatt gcgcaggccg 2261221 ctttcggtga tgtcgccgga gatgccgaca ccgtcgtcga cgacctcgac ccgcacatca 2261281 tcctcgacgc tgacgttgat ggccaggctg gtcgcgttcg cgtgccggac agcgttgcta 2261341 accgcctccc gcagaaccgc ttcggcgtgg ttggccagga cggtgtcgac aacggacagc 2261401 gggcccgtgt actggaccgt ggtgtgcagc gcggggatcg cgagttggtc gatgaccttg 2261461 tccagtcggt ggcgcagacc cgtcgcccgg gagggcccgg cgtgtaggtc gaagatcgca 2261521 gatcgaatct cctgaatgat ttcctggaga tcgtcgatgc tgctgtagat ggattcccgg 2261581 acggcgggga cacgtgctcg cggagcggca ccctgcaggg tgagcccgac tgcgaagagc 2261641 cgctggatga cgtggtcatg cagatcacgt gcgatccggt cgcgatcggt caggatctcc 2261701 acttctcgca tctgtcgctg cgcggtcgcc agccgccagg cgagcgcagc ctggtcagcg 2261761 aaggcggcca tcatatcgag ctgtttgtcg ctgaacggct gttcatcggc actgcgaagt 2261821 gcgaccagca caccggcaac agtgtcggcg gcacgcagcg gcagcaccag ggcgggcccg 2261881 ggctccaccg ggccgtcgac cgcgaggtca agccggtcga accggcgggg cgtacggtcg 2261941 tgaaagactc ccccgatcga cgttccgctg acggcaaccg tcatttgctt gaccgccggg 2262001 gagatctctc cggccacctc tacgatgacc aggtcgtcga cctcgcaagc cggcgcttcg 2262061 tcgtcgagcg gcaccgccac caaggtggct gccccagcca tcaacgtcaa cgcttcctcg 2262121 gcgatgagcc gaaacaccat ggccgggtcc gcaccggcca gcatctgcgt tccgatgtcg 2262181 cgggttgcct cgatccacgc ttcccgggtc cgtgattcct cgaagagacg ggcattgtca 2262241 acggcaatcc cggccgcggc ggccagcgcc tgcaccagca cctcgtcgtc atcgctgaac 2262301 ggctggccat ctgccttctc ggtcaagtaa agattgccga acacctcgtc gcggatgcgc 2262361 actggaaccc cgaggaaggt ccgcatcggc ggatggtgca gcggaaatcc aaccgatgcg 2262421 ggatgccgcg agatatcgtc cagccggatc ggctttggct cctcgatcag cgcgccgaga 2262481 acacctcgcc cctccggcaa tgagccgatg aggtgccggg tctcttcgtc gatcccctcg 2262541 tagacgaatt cgaccaatct atggtcgtaa ccgcgcaccc cgagcgcccc gtagcgggca 2262601 tccaccaact cggcggcggt atgcacaatg gcgcgcaggg tggcgtcgag cttgagtccc 2262661 gatgtgatcg ccaagatggc gtcgatcaga ccatccagcc ggtcgcggcc ttcgacgatc 2262721 tgttcaatcc ggtcttggac ttccagcagc agctctcgca accgaagctg cgacagtgtc 2262781 tcgcgcactg gcgggctgcc agggttaacg ttcgccctgt cagggtgtgt cacatagcta 2262841 tgttgacacc ggagctgcgc tcaaccaact ggtctggcta cccagcggca cagtcacaga 2262901 tactgctgac cgacgaccag cagggtgcag ccggcctcct gcaacacggc gttgcccggc 2262961 gctcccacaa gttgctccac atgctcctgg tcgctcgcgc tgagcaccac catgtgtacc 2263021 gatcgaccca gcccagccag ataatccagc agctcgccgt gcactgccgc cgattgcacc 2263081 cgcacatcgg gataccgtgg ttgccaacgg gcaagccagc ggtccaggct ggcacggacg 2263141 tcgtccccgg tatcgcccac tccggattgc cggcaggtga ccacccgaac cggcgagtcg 2263201 cgcagccgtg cttcggccat caccgccccc agcaaaacac cgatatcgga cgacccgtcc 2263261 gcctcgacga cgatccatgc ggcgtcgcgt ccgatgggga cccggtgggg tcgcacgatc 2263321 gccactgggc actgcgccga taacgccagg gccgctgcgg tagatcccac ccgctccggt 2263381 cggaagtggt gcacgccgat agcgccaacg cacaccaggg cagcagccgc cgaagcgcgg 2263441 atcaacgagg tgaccggccg ctcctgggtg atctccacct cgaccttgac cggccggtcc 2263501 gccgcctcga ccgctgtgaa cgcgtagcgc accgcgttct cggcggcggc gagtttgcga 2263561 gccgccgcgc cgtgtgcggc gtacccggga tcgtcgggtt cgatcgcgta cagcagacgc 2263621 agcgggatgt cacggctggc tgcctcgtcg accgcccaca gtgcggcttg cacggccggc 2263681 ttcgagccat caataccgac gacgatcgat gggggtttgt gtgattggtt catggcgagg 2263741 cttccgggtt aacgatcggg tgccaaacgt attgatcctg cccgacttcg gtgggttcgg 2263801 ccgccagctc gaagaacctc tccacatcgt cgcgattgca ggccgcggtg cctggcgtca 2263861 gcagcatggc tgcacctgcc gcgtttccca agcgaacgga cttgatgagc gaccagccac 2263921 ggctgaggcc cacggtaatc gcggccacca tcgcgtcgcc ggcgccgaca ccgctaaccg 2263981 cggtcatcgg aatcgacgaa aatcgatggc tcgcatgtcg tgtggccaat agcgcgccct 2264041 gagatccaag cgagaccacc acgacctcgg cgcgcccacg gtcaatgagt tcgtgtgcgg 2264101 cggccagttg ttcgggctcg gtcagcagtt cggatccgac gcactcgcgc agttcccgca 2264161 cgctcgcctt gagaagaaac accccggacg aaatgtgctg caacccgcca ccagatgtat 2264221 ccaggatcag cggagtgctc gatcggcggc agatgtcggc aacccgctga tagtagtcgg 2264281 cagccacacc tggcggcagg ctgccactgg ccaccacaaa ggcggccgaa gccgccgcac 2264341 cgcgcagttc gtcgaggcat tgctcctgct ccgcgacggt cagcgacggc cccggaagca 2264401 cgaaacgata ctgcttggcg gtcctggact cgttgaccgt gaagctctcc cgcgtcgagg 2264461 ccgcgatcgg aatgacgcga aatggcactc ccgcatcacc gagcagcgcc atcagcaggc 2264521 tcccggtcga cccgccggcc gggaacagtg ctgtcgagca accgccgagg acatgcacaa 2264581 tgcgggcgac attgataccg ccgccgccgg gatcgtagcg aggtgcgcca caacgcattt 2264641 tctcggtcgg gcgcaccacg tcgacgctcg tcgtgatgtc caaggcgggg ttcatggtca 2264701 aagtgatgat tcgcggcttg ccttcgtccc acgccgctgg ctccgtcatc gtcgtggact 2264761 ctgcgctaca gaccggtcgg gtaggtttcc gggttctcgc cggcgatcca ccggctcgtc 2264821 acctcgagag gttccagggc acgggtctga tcgatgtgga tcatggcgtc gaactggtcg 2264881 gcgggccgca cgtgcaagta gtgactttgc cgttccgttg ccggtagata aacgacgccg 2264941 atggcacgtc ccaaccggac aacgtccagc ggggcttcgg cgtcgcggct tagccgcgct 2265001 gacaccagga aactgtctgc agtctggtgg aagagctcct cgacactgcg tgcagtgccg 2265061 gccgaaccgc tttgcgttgg gcgataccac cccattcgct ggccgcggtg acggtgcccg 2265121 tgtacgtgct gaatccgatg ctgcgcgact cgtcaccgta tcgctcacgg actatctggc 2265181 cgagggtgag ctgcccgtcg gcccacacct cggtagcgcg tgcgtcaccc acgtgggagt 2265241 tatgagccca caccactatt cgcgccggcg gcgcatcgag gtgtcggtcc aaatgcgtca 2265301 gcaaactgcc aagggtctgc gccatgtgct ggtcgcgcag gttccacgag gtaacgcgtc 2265361 cactgaacat ggcccggtaa tacacctctg cgtcgcgcac cgtctgcgcg ttttgctggg 2265421 cgtagaacag ttcgtcctcg gcaagcagcc cgtcttggcg cgcatacgcc agggcattgc 2265481 gctgaacgtc gaccagttgc tcgacggctt cacgttcgca cgacggaccg gcgccgaatg 2265541 cggccgcgaa tccgtacgcc tgaccgtcat cggcgcaggc atggtcgaag cacgcatacc 2265601 gggcccgcgc ccgtgccgcc gcacgcgggt cgaccttgtc gagatagctg atcacctctt 2265661 ggatcgaccg atgcaggctg taaagatcca gaccgtagaa gccggcttgc cgcagcgcgc 2265721 ccgactcgta gcgctggttg cgtgtgcgca gccattccac aaaatctcgg accacggtgt 2265781 tgcgccacat ccaggcggga aaccgctcga atccgctaag cgcctcgtca gcgttggtgt 2265841 cctcgccgag gccgcgaacg taccgattga cccggtaggc gtcgggccag tccgcctcgg 2265901 cggctaccgc accaaagccc ttctcctcga tcagccactg tgtcatggcg gcccgggcct 2265961 ggtagaactc gtgtgtgccg tgcgagcttt cgccgatcaa cacgattcgt gcatcgccga 2266021 ccagctccgc caacacctcg tgcgtcggaa cacccccggg ggcgtcgatc gcgactctgc 2266081 gcagaacatc ggccgccgtt gacgccgcgg gccggcgcag cgacggccca gcggtcgggg 2266141 tggccaggag ccggcggacc tcctcgtcgg tgacctgccg gaagtcccaa aacgactcac 2266201 cgacggccag gaacggggtc ggcatggtcg cgcacacaac gtcgtcgacg aggccggcga 2266261 actcccggca cgtggactcc ggcgccgccg gcacggcaat cacgatctgc gctggttgcg 2266321 catcgcgcaa tgcctgtacc gccgcgaaca tgcttgcgcc ggtggccaaa ccgtcatcga 2266381 cgacaatgac cgtcttgccg gtgatatcgg tgggcgggcg ctcgccgcgg taggcggact 2266441 cgcgccgaag cagttcccga ccctcacgtt cggcgatgtc gcgcagttgc tgcggtgtga 2266501 tccgcaggcc ccgcacgacg tcgtcattga ccacgacgcg gccgccgctg gccagtgcac 2266561 caacggcgaa ctcgtcatgc cccggggcac caagtttgcg cacgacgaag gcgtctagcg 2266621 gggcatgcag tgccgcggca acctcccatg cgaccgggag gccaccccgg gccaagccga 2266681 gcacaatcac gtccggctgg tcccgatagg cggcgagtaa ttccgccagc acccggccgg 2266741 cctcgcggcg gtcacggaac acgcgccgcg gcgagcgccg ggtgacatca gccgctgcgg 2266801 tcatcagcac ggacccagtg gtcagttggt ggaccggatc tgaatgtgct tttcggttgg 2266861 cttcccttcc gaaaccgcca ccgacacagt aagaatgccc ttgtcgtagg tggccttaat 2266921 gtcgtcctcg tcagcaccta ccggcagcga caccgtgcga acgaaggaac cgtacgcgaa 2266981 ttccgagcga ccgtcgaagt ccttctgctc ggtgcgctcg gccttgatgg tcagctgacc 2267041 atcgcggacc ataatgtcga cgtccttgtc ggggtcgacc ccgggaagct ccgcgcgtac 2267101 ctcgtagcgc ccctctttca tctcgtcttc cagccgcatc aaccgggtgt cgaaggtggg 2267161 ccggagtccg gcgaatgacg ggaaggccgc gaacagctca gaaaactcgg ggaagaggga 2267221 ccgcgggtgg cgctgaacgg gaagggtggt ggccatttga tgcctcctaa tcgatggaaa 2267281 cggatgcctt tgatccgacc agcccatcgt ggccagggct agggacagaa gtccccgaag 2267341 cgcgggccat ttgtccgcgc ccgtcggtga tccacttggg gaccattgac cctgttgtct 2267401 gccaaccgcc gttcagaaag atcggggtga tatcgaacag cggaggttga tcatgccgga 2267461 caccatggtg accaccgatg tcatcaagag cgcggtgcag ttggcctgcc gcgcaccgtc 2267521 gctccacaac agccagccct ggcgctggat agccgaggac cacacggttg cgctgttcct 2267581 cgacaaggat cgggtgcttt acgcgaccga ccactccggc cgggaagcgc tgctggggtg 2267641 cggcgccgta ctcgaccact ttcgggtggc gatggcggcc gcgggtacca ccgccaatgt 2267701 ggaacggttt cccaacccca acgatccttt gcatctggcg tcaattgact tcagcccggc 2267761 cgatttcgtc accgagggcc accgtctaag ggcggatgcg atcctactgc gccgtaccga 2267821 ccggctgcct ttcgccgagc cgccggattg ggacttggtg gagtcgcagt tgcgcacgac 2267881 cgtcaccgcc gacacggtgc gcatcgacgt catcgccgac gatatgcgtc ccgaactggc 2267941 ggcggcgtcc aaactcaccg aatcgctgcg gctctacgat tcgtcgtatc atgccgaact 2268001 cttttggtgg acaggggctt ttgagacttc tgagggcata ccgcacagtt cattggtatc 2268061 ggcggccgaa agtgaccggg tcaccttcgg acgcgacttc ccggtcgtcg ccaacaccga 2268121 taggcgcccg gagtttggcc acgaccgctc taaggtcctg gtgctctcca cctacgacaa 2268181 cgaacgcgcc agcctactgc gctgcggcga gatgctttcc gccgtattgc ttgacgccac 2268241 catggctggg cttgccacct gcacgctgac ccacatcacc gaactgcacg ccagccgaga 2268301 cctggtcgca gcgctgattg ggcagcccgc aactccgcaa gccttggttc gcgtcggtct 2268361 ggccccggag atggaagagc cgccaccggc aacgcctcgg cgactaatcg atgaagtgtt 2268421 tcacgttcgg gctaaggatc accggtagcg ggcgccgccg ggaccgcgtc taagcaccgc 2268481 agctgaatcg ggcggatgat gtgtcgatga gcggatccgg cgatggcgac ggtgtcgcgc 2268541 ggttgggcag acatcttccg cggctattcg tccccggccg gctgagtgac gaagtcgatc 2268601 agttcttcca cccggccgat caacgccggc tctaggtcgg tccagtcgcg tacttgcgaa 2268661 cggatgcgcc gccacgccgc ggcgatgtcg gcctggtcgg cgtgcggcca gccgagcgca 2268721 tcgcacacgc cgtgcttcca ctcgatgtgc cgcggcaccc tcggccaggc ggccagccca 2268781 actcgttgtg gcttgaccgc ctgccaaatg tcgacgtagg ggtgcccgac gaccaaagtg 2268841 tcggagccgc ccggtccgcg gcgcaccacc tcggcgatgc gcgcctcttt cgaccccgcg 2268901 accaaatggt caacgagaac gccgagccga cgccgcgggc cgggccggaa cttggcgacg 2268961 atctccacca ggtcgtcgac gccaccgaga tgttcgacga cgacaccttc gattcgcagg 2269021 tccgctcccc ataccgccgc gatgagttca gcgtcgtgtc ggccctcgac atagatccgg 2269081 ctggcccggg ccacccgggc acgcgcgccc ggcaccgcga ccgagccgga tgccgttcgc 2269141 ctcgggccgg cagccgctgc gcaccgcggc gcggtgagga tcaccggcag gccgtcgagt 2269201 agatacccgg ggcccagcgg aaacccgcgg gtcttcccgt agcggtcttc caagtcgatg 2269261 cggccatatt cgactcggac caccgcaccg acgtagccgg tctcggcgtc ttcgacgacc 2269321 atgccgagct cgaccgggtg ctcaaccgag cggggccggc gccgcccgcc tgcggcaagc 2269381 acgtcggttc catagcgatc cagcacgccg caatactagg gagcctctct gccggtcatc 2269441 gccgcgacgc gccgcatggg ttctcggaaa atgcttgtac cagtcgactt tccggcgggc 2269501 caacgtcgcc aaccgatact cggctccaac gccatgggtg acgggatgcc cggatcacgt 2269561 gtcacaccac ccgcgcaccc ttgcggaaga atatccgtaa gtctaaactt acggttcgtg 2269621 tccacttaca gatcaccgga tcgcgcttgg caggcgctgg cggacggcac tcgccgggcc 2269681 atcgtggagc ggctggcgca cggcccgctg gccgtcggcg agttggcccg cgacctgccc 2269741 gtcagccgac ccgcggtgtc acagcacctc aaagtgctca agaccgccag gctggtgtgc 2269801 gaccgccccg cgggaacacg ccgcgtctac cagctcgacc cgacaggcct tgcggcattg 2269861 cgcaccgacc tcgaccggtt ctggacacgc gccctgactg gctacgcgca gctcatcgac 2269921 tccgaaggag acgacacatg acacgcccgc gaaccgatgc catccaccac cacgttgtcg 2269981 tcaacgcccc gatcgagcgt gcgttcgccg tgttcaccac gcggttcggc gacttcaagc 2270041 ctcgcgagca caatctgctt gctatcccga tcaccgagac ggtattcgaa tgccatgcgg 2270101 gaggccatat ctacgatcgc ggtgttgacg gaagcgtgtg caaatgggcg cgcgtgctgg 2270161 tctatgaacc gcccagccgg gtgctattca cgtgggatat cggcccgact tggcggccgg 2270221 aaaccgatct ggccaagacc agtgaggtcg aagtccgctt caccgcgcag tccgccgaga 2270281 cgacacgcgt cgacctcgaa catcgccatc tcgaccgaca cggtccgggc tgggagtcgg 2270341 tcgccgacgg cgttgacagc gaggccggat ggccgttata cctacgccgc tataccgacc 2270401 tgctctgcat ccaggtgcag ccatgatcgc ggcagacgac gataccgaga agtccatgat 2270461 ggacatggcc cgcgccgagc gggccgaact agcggcgttt ctgactaccc tcacactgca 2270521 gcaatgggaa acacccagcc tgtgcgccgg gtggagcgtc aaagaagttg tcgcacatat 2270581 gatcagctac gaagatctcg gcgttttcgg gttgctcaag cgctttgcca aaggccggat 2270641 cgtccgggcc aatgaggtgg gtgtcgacga attcgctggg ctcagcccac aggagttggt 2270701 tgactatgtc ggccggcatc tccaaccgcg tgggctgaca gcgggtttcg gcggaatgat 2270761 cgccctcgtc gatggcatga tccaccacca ggatatccgc cgcccgctcg gtcagccccg 2270821 caccatcccc gcgcagcgac ttgaccgcgt gttgcggctg atgccgaaga accccaggct 2270881 gcgagctcgg ccacgcatca aagggctgcg actgcgagcc accgacctcg actggacaat 2270941 cggcaccggg cccgaagtaa ccgggcccgg cgaagccttg ctcatggcaa tggccggcag 2271001 gccagcggcg gtcagcgacc tctccggccc cggaaagccc acgctagccg gacgactcgg 2271061 ttaacgacag ctacagcgac ggcgtgaacg ggccgccgca gtcagccaga taatcggcgt 2271121 aattccagtt cgccaagaac ttttgacccg cctgaaatcc gcgttggtaa agagcctcgc 2271181 gttgttcggc ggtgatgtcg aagtcgatcg gactcacgtc gtgggcgggc acgaagatgg 2271241 tgcgccgaac ggtacacgga tcgtcgatgt aggcgttgtc ctgattgctc accagtgttt 2271301 cgatcgccgc gatgcccaac gacactggcc cttggaccgg ccgggtaggt ggaatgcccg 2271361 gacgcgctga caacctgatc ccgaacgtgg gccatcgcgg ttcagcgtcg gttcggtcga 2271421 acagcgccac cggaaagttc gacagcaagc caccgtcgac ccaggtagcg ccgcgcaccc 2271481 gaacaggctc gaacacaaac gggatcgccg atgaggcgtg caccgcacgc gccaccgaga 2271541 agtcgtccgg gtggatgccg taggagtcca ggtcccacgg gatgcgaacg agtcggcgac 2271601 gggataggtc gctggcggtg accaccagcg accaggcgaa ctgttcgggt gcctcgccgg 2271661 tgcgcaagtc gccaaaggtg tgcacgccta ggtcagcgag caaaccgccg agcagctgtt 2271721 ccagataggc cccgcggtaa acgccgtccg acaacagcag agaaagtccc ccgccgatca 2271781 acggcacgtg tcctatcaga ttgcggtcga ggaacttcgg gtagtcgatg ctgcgcatca 2271841 tctcggcaag ccgcgtcacc ggctcaccgg ccgtttgtag ggccgcgacc agcgacgcga 2271901 cgatcgcacc cgcgctgctg cccgccaccc tgggaaatcg gtaaccggca tcggccagcg 2271961 cgtccaccgc tccaaccaac cctatccccc ggaccccgcc gccttcacac accaggtcga 2272021 cgcgtgctgt gctcaccagc gccacgttag cccggaatcc gacgcccgtc gacggcgaag 2272081 aagtgcaggt gtcccggtgt gggacatagc cgcacgcgac taccccgctc gggcgggccg 2272141 cggccgtcca ctcgagcgac gattgactgg tccatttcgc agccgcccga cacgattcgg 2272201 ccatacaagt aggcgtccgc tccaagttct tcgaccatgt cgacgtccat ctcgatgccg 2272261 gcgccgccca gctccaaatg ttcggggcga acaccgataa tgacctcggc tgccgtaccg 2272321 acgaccgcac gcggcagcag gatctgccaa tcacccagtg acaccgtgga atcggcgatg 2272381 gaaagcctga acaggttcat cgccggggaa ccgatgaacc ccgcgacgaa cacgttgccc 2272441 gggttgcggt agagctctcg aggcgaagca cactgttgca gcacaccgtc agacagcacc 2272501 gcgacgcggt cacccatcgt catggcctcg acctggtcgt gagtgacata cacggtggtc 2272561 gtacccagtt gccgttgtaa cgcggcgatc tgattgcggg tttgcccgcg aagtttggcg 2272621 tcaagattgg acagcggttc gtccatcagg aatacctgtg ggcgccgcac gatcgcacga 2272681 cccatcgcca cccgttgccg ttggccgccg gagagatctt tcggcttgcg atccagataa 2272741 gattgcagat caagcaattt cgctgcggca agcacccgct cgcggatctc ggccttgccg 2272801 atcttggcga ccttcaacgc gaagcccatg ttctgcgcca ccgtcatgtg cgggtagagg 2272861 gcgtagttct ggaacaccat ggcgacatca cgatccttgg gatcgacctc ggtgacgtcg 2272921 cgctcgccga tccggatacg cccacagtcc agcgtctcca agccagccac catccgtaac 2272981 gacgtcgtct tgccacatcc ggacggcccc accaggacaa cgaactcgcc atcgccgacg 2273041 atcaggtcga gccgatccag ggccggtcgg tccgtgccgg gatagcgccg ggttgcctgc 2273101 tcaaaactca ccgaagccat ggttacccgc cgagcccagt caccgcgata ccacggacaa 2273161 aggaacgttg tgcgaccgca taaaggatga ccaacggcac cagcatcagc atcgacgccg 2273221 ccatcagcac cggccaccgg gcgacgtatt cgccccgcaa tcggaccagg ccaagggtga 2273281 gcgtcgccag gctgtttcgc tggatcatca gcagcggcca cagaaagtcg ttccacacgt 2273341 tgacccaggt gagcacaccc agcaccagca ccgcgggacg tgaatgcggc agcagaatcc 2273401 gccagtagat ctgccacggc gagcaaccgt cgagaatcgc ggcttcctcg agatcggtcg 2273461 gcagcgtgcg gaagaactgc cgcatcaggt aggtaccgaa cgcgctaccg aacaatcccg 2273521 gcacgatcat cgcccacggc gtatccaccc accccacgat ccgcatgaga atgaactgtg 2273581 ggatgacggt caccgtcaac ggcaccatca aagtgctcaa gtacaagacg aacaacgtat 2273641 cgcggccccg gaactgcagt cgcgcgaagg cataaccggc caacgagcag aagaagacct 2273701 gcccggcggt gacacatccg gcatacagca cggtgttgaa gaacatccgc cagaacggca 2273761 tcaacgcgaa cacctcgcgg tagttggacc attgcggatg cgacgggaac agcgtcggct 2273821 cggtcacctc gccgtccgcc ttcagggagc ccgacagcgc ccagatgata gggaacagcg 2273881 cgcaccaagc gatcccgatc agtcccgcgt acagggcaag cccacgaatg aagtggcggt 2273941 ggactattcg atcagcccag cccacgggac gcctcccagg agcgccggtg cgtaattcgc 2274001 aactgcagca cggtcaacac cagcaagatg gcgaacatca cccacgccaa cgcggacgca 2274061 tagccgaatt ccaggaacga aaacgcgtgc tggaacagca tgatgcccaa aacataggta 2274121 gccgtctcgg gaccaccgtt ggcaccggta aggacgtaga caaggtcaaa cgcctggaac 2274181 gcgtggatga tcgatatgac aaccacgaat gacaatgccc cccggatcag cggtaccgtg 2274241 atggacacga actggcgaat ctcgccggca ccatcgatcc tggccgcctc gtacacagtc 2274301 tccggaaccc cctgcatcgc ggccagcagg acgaccgtgg cgaagggcac actgcgccag 2274361 acgctgacca ggcaaagcga gaccatggcc catcggggtt cgattagcca tgggatgggg 2274421 ccgattccca gccagccgag catgatgttg agtaggccat tgtcggtgtt gaagacgaac 2274481 tgccagacga ccgccatcac caccgaggaa atcgccaacg gcaagaagac gaccgtccga 2274541 aagaggctga tgcctttgat tttccggttt agaaaggcgg cgacgacgag gctgacgata 2274601 acggtcggta ccacggtgcc gacggtgtaa accgcggtgt tgaccacggc gatgagaaac 2274661 agcggatcag aagtgaagag gtttctgaaa ttgtccaacc tcacgaacgt cgcatgcgta 2274721 aacaagtccc acttctgaaa gctcatgtac agcgagaatc ccagcggaaa cagcatgaac 2274781 accacaacgg cagccaagtt cggcgcgacg aacatacgcc ccgcccacgc gcgtcgcccc 2274841 ctgcgccgtg tcatggattg cgcagcactt catcgacggc ctgtgatagc ccggtcagcg 2274901 aggtcgccgg ccgggatcca cgcagcacgg gtccgaagta gcggtccatc agggcggcga 2274961 tcttctccca ggccggggtc accggcaagc cttccgaata ggccggcccc tcgctgagca 2275021 cggcaagatt gcctaccctg cggtgggcgt tggcgaatcc gtgcgagttg atcgccgatc 2275081 tcagcaccgg cacgaacagg cgggattcgc cgatcaatgc ctgccccacc gggccggtcg 2275141 cgaactttac gaattcccac gcctggtcct tgcgtcgact ggtcgccgca atggccagcc 2275201 cggtgacacc gatatctgaa caggcggctc gtccgcgcgg accgatgggc agtggggcga 2275261 cgtcgaagtc cagaccgtcg gcacggtcga acgtctgata tcgccagtgc ccggccaacg 2275321 cgatcccggc cttgcccaca gaaaacaggt ccgccgtcga catcgactgc tgctcagcag 2275381 cgctgggggc caccttgtgc ttgttggtca ggtcggcgta gaactgcacc gcttcgagga 2275441 acccgtcgtg gtcgaaattg aggtgggtgg gattcatccg cggaaccgac cacggtacac 2275501 cgttattcat ggcgaacaac ccggcagcgt agaacgagac ccacgcgttg acgaagcccc 2275561 attgcctgtc ccgtcccgac cggccctgct tggtaagcgc ctgggcggca tccaggaatt 2275621 cggcgaagct ccatggccgt tcccagctac cgggcggcgg tggcacgccg gcgtcgtcga 2275681 atagctgttt gttgtagaac aagaagttgc cggaccattg ctccggaaag gcgtactggc 2275741 ctccgttgaa cgtgaaagtc tcatacaggg ccccgatgct gtccgatttc agctccgcgg 2275801 cgaaaacctg gtcgcgcgcc aatagcgtgt tcaggtcaag caacaccccc cggtcggcca 2275861 gttcggcata ggtcagttcc catgccatca gcacatccgg acacttgcca cccgcgcaaa 2275921 acgttgcgag ctgctgcatg acgccgggtc cggacaacag ggcccgtacc ttgatatcgg 2275981 gatagcgccg ctggaattcg ttgacgacgc gcatccgggg acggagctcg tccggattgg 2276041 ctgcaaaaaa gaaagtcaac gcgtcatcgt catcggcagc acacccagcg gcccagggag 2276101 ccagcgaggc cgcagtaagc gcgcccgcac cccgtaacag actgcgccgc tcgaacggct 2276161 tattgaccat cgtgctcccg attttgggtc ctgtggtaca acgaccgtca ggctgggaag 2276221 taccgaatcc gattgatccg gttgccgcgc cacggcacgt cggcgaacat gatgccgcgc 2276281 cggtggtccg acgcgagcga caccgcgacg gtggatccgg caccggtcac cttggtccag 2276341 ctagccccgc gcagctgctc gaacagctcg acaatgtcga gtagctcatc ctcgcctaaa 2276401 gtcattgtgg cagtgcgtga taacgcgtga tacgcagcgg acttgtctgc ccgggacgcg 2276461 gcgttgagga acgtttccac cagcttcttg tgccgccggc ccgcccggcg aaagccggtc 2276521 aggaatcctg cggtgccgcc caacccttga ttgcctagca gcgctcgcga cagttgcagg 2276581 gcgggtcttg tggcccccga tcccgtgcgc agaaactgca gcatcatcgc cggcaactcc 2276641 cagtacgccc gcagtgcggc aatctgccac tcgccggtaa ccggtcgtag gtcatagcgt 2276701 aggaaggcgg gaatgaacac cgtcacagcc gagtccatcg cgacctcgag ttcgagatcg 2276761 cgcagcacca ccgtgccgga gacgatatcc agatcgcgat ggaacgtgat atcccgcggc 2276821 ccgatgaagg tgtcgtagaa gcggccgatg gcctcatgcc ccacctgcgg ctgcgaaccc 2276881 accgggtctt cgacccgcgc gtcaccggtg aacaacccga cccagccggc gcggtcgtgc 2276941 gcggcggccg cttgcggcga gcgctccacc gccgccaaca gttcatcccg gttcggcggt 2277001 gccatcagga gctgcaaacc aactcgacgc tggcggtgcg catctcctcc agcgcggcga 2277061 cggtggtatc ggccgacaca cccgctgtca ggtccaccag caccctggtg gccaagccat 2277121 tgcgtaccgc gtcctcggcc gtctggcgca cacaatgatc ggtggcaata ccgaccacat 2277181 cgacctcatc gacgccgcgt tgccgcagcc aattcagcag tggcgtgccg ttctcgtcga 2277241 ctccttcgaa gccgctgtac gctccggtgt aggcaccctt gtagaacacc gcctcgattg 2277301 ccgacgtgtc cagactggga tggaagtccg cgccgggagt accgctgacg caatgcggtg 2277361 gccacgacga ggaatagtcc ggtgtgccgg agaagtcgtc acccgggtcg atgtggaagt 2277421 ccttggttgc cacgacgtga tggtagtccg ccgcttcggc caggtagtcg ctgatggcgc 2277481 gggccagcgc ggcgccaccg gttaccgcca gcgagccacc ctcgcagaag tcgttctgca 2277541 cgtcgacgat gatcaacgcc cgcatacgtc caccatacgt tcgggcgact gcccgggcag 2277601 tttgcctacc gacgcggcag ccacagatat agggtccatg acgccgcgac gatcgcgaac 2277661 atgaccagct gagcggcggc cacccaaccg gcgggataga tcacgccggt gatgtagtga 2277721 gcgacaaatc cgtccggtga cagaggtgtc atcgcggcct tggtgcgagc ccagcgctcc 2277781 acccaggtca gcgggcagtc gacccgctta gcggcgatgc cgatccccca tatcaccgcc 2277841 ggaacatgca gccacatcgt gcgtcgccac cgcagggcaa ggaaaccgcc ggcaaggacg 2277901 taagcgatga aagcgaagtg cattaccacc gttgatacaa cgacggtttc gtacatctct 2277961 cgggttgcct ttccaggtcg cggcgctccg gccactgaca gaaaaggttc aattcgccag 2278021 cgaaaacccg tcccatgcga tccggcggtg ctgatgcgga tcgaactcga tgcggcacct 2278081 gcggtcgaaa accaggacgg cacggtcgtc ttgggtgtaa gccggccagt cgtcgcccgg 2278141 aacaccaatt tggctgaaac aacgccagcg gcgttgcacc tcgttgctga cccgaagggc 2278201 ggcacggcgg tcggcggcgg cggtcagcaa tgcgccaaat ctggtgcgat agatgtcgaa 2278261 gacggcaaac agttcggtgg catgggtggc gccgaaaccc gaccagcgca gcgtccgtgg 2278321 cgcgtagtca tatcggtata ggtaggtggg cgcattggcg ccgtgagcct cggcgatctg 2278381 ccaggccgcc gagctaaagg cgaagtcacc accgagctgg atgcacgccg agggcgcagg 2278441 gtaattcggg taggcggcgg taatgcgttc acgatcggcc ggtttcatgc ccgacagtag 2278501 ctcttcaacc atcggttcgt tggtcggcag catccccaga aagcgggtga acaaccgacc 2278561 ctcttcggcg ttggttccca cgatcagcgg aaccgcgtgc acccggccgg accgcatcgc 2278621 ctcgacgggg tccatgggca ggtagtcgtc gccgaacacc ggaccaatcg ggaaggcgcc 2278681 cagccttttc cgcattccct ggcgaatcag gtggtgttgg gcttccacca gctgcgcggg 2278741 ggacgcctgc atcaacgcat tggcggcatc ctgggtacgc gcgccgatca gattggcaaa 2278801 gcgtgccgcg aactcggcgg ccacctcgcg cgaacgcacc atgcccgccg ctgggctttc 2278861 cgagatcgcc ctggcgaata ggcctttggc ggctggcacc gccaacagtg tggcggtgat 2278921 atgcgcgccc gcgctttcgc cgaaaatggt gacattgcct gggtcaccgc cgaactccgc 2278981 gatgttgtcg tggacccaac gcaacgccaa caccaggtcg cgcaggtaca cgttgctgtc 2279041 gagggtgatc tgcggtgtcg acaaggacga caggtcaaga caccccaacg cgcccagccg 2279101 gtagttgacc gacacgtaca cgcagccgcg gcgtgccaac gctgcgccgt cgtatatcgg 2279161 ggttgccgag ctgcccagga tgtagccccc accgtggatg aacaccatta ccggcagcgg 2279221 ctgggtggct ggctcttcgg gtgtgacgac gttgagggtg agacagtcct cgctgcgggt 2279281 ctggtacctg ccgatgccca tcacggtgta gcggcgctgc tgaggagcac agttggcaaa 2279341 cgtgtggcag tgccgtacgc ccggccaggg ctgcgctggc tgcggcgccc ggaatcgcag 2279401 cgagcccacc ggcgccctgg cgtaagggat tgatcgccaa cggtgcacac cgtcgcgcgt 2279461 gaagccttca acgatgccgg tggccgtgcg ggcgcgcacg gtgcgctcgt gcatagaccc 2279521 gacggtagcc gactccaggg ccacgcggca tgcgcagtgc aggaatgggg gcggggcggc 2279581 tagcctgtcg ggatgcggat cgccgcgctg gtcgcagtgt cgttgctgat tgcggggtgc 2279641 ccgcgcgagg tcggcggtga tgtagggcag tcgcagacca tcgccccgcc ggcgcccgcc 2279701 ccgtcggcgg cgccgtcaac accaccggcc gcaggagcgc cgatcaccac tatcgtgtct 2279761 tggattgagg cgggtcaccc ggttgatccc gccgcctatc acgtcgccac ccgcgacggc 2279821 gtcaccaccc agcttggcga cgacgtcgcg ttcagcgctt cgtcgggcac ggtggcctgt 2279881 atgacggatg ccaggcacac tagcggcacc ctggcctgcc tggtccgact cgcgaaccca 2279941 ccaccccggc ccgagacggc ctacggcgaa tggaagggcg gctgggtcga ctttgacggc 2280001 atccacctgc aggtcgggtc cgcccgcgcc gacccgggcc cgttcgtcta cggcaatgga 2280061 cccgagctgg ccaacgggga cacgctgtcg atcggggact accgctgccg ctcctatcaa 2280121 gcgggcctgt tctgcgtgaa ctacgcccat cagtccgcgg tccggttcgc cagcgccggg 2280181 atcgagccgt tcggctgcct gaagccggcg ccgccacccg acggcgtggg cgttgcgttc 2280241 ggctgctgag gtgcacccgt cacaagctga cacgacgaac taggttcagc gactgagatc 2280301 gcttcccgga agcgccggcc catcttcgga cgccagctca accacatgaa tttccccggt 2280361 agccccgtcg acctccacca atgctcctgg tggcagaaac cgggtagctc cctgggcgtc 2280421 gaccacgcaa gggaatccga actcgcgggc gaccaccgcg gcatgtgaca tcgggccgcc 2280481 gagctcggtc accacggcgg cggcgtagca gaaggccgcg gtgtatccga cgtcggtgac 2280541 ctcggcgacc agaatctcgc cgggctgcaa atcgtcgatg gtctccggac gcacgatccg 2280601 cacccggccg cgcacccgtc cgccgcagac gccgactccg cgtagagtgt ccccggctgc 2280661 cagcgccgcc gccgacgaag gcgacggttc ccagcttccg ctgaacaccg tgggcggaac 2280721 gatgccggca agcctgcgct gttcggcacg gcgccgagcc accagccccg acacgtctgc 2280781 cggcagcgca tcgatttcat cgaccaagag gtagaacaca tcgtccgggg tgtcgaagac 2280841 gccggcctcg gtcagccggc gcccgtactc ccgcagcaga gcacgcagca cccagatggc 2280901 acgcaccatc ctgtcgcggc ggacctcgcg gtcgcggagc tggcgggccg ccagcaacgc 2280961 aacgggcttg gcccgcaacg gaatcaccgg cgtcggcggt tgcggcgctg gcaccgcacg 2281021 tagcgtcttg gctaccatcc gcaccagcaa ctcggggttg tcggcatagc tggtggcggc 2281081 catctcgact tccgccggac cgcggtgccc gatcagcgtc agctcggcca gcaccgcgga 2281141 atggaactcc ggcgcctcga cagctagctt gtccagacgc tcccccggct cggccagcaa 2281201 ccgaatcacg accggatccc gccgtgccgc ggccaccagc cgctgcaccg cctccaccga 2281261 tcgcgcgctg accaactccg gcccggccgc cggtgcggtg tcccgcccgc acaatcctcg 2281321 caacaacacg ttgaacgccg cacacagcat gaacgacccc gaggccagca cccagccgtg 2281381 cacgacgtgg tcacgtgcca acaagatcag gctcaacaac cggcggtcgt cgtgggtagc 2281441 gaggttatcg aaggcgagac gctccaggcg atcgacgtcg gcgacatagg catcggtgtc 2281501 gcggggtgag ccggcggaca ggcccaccag gttgacgccg aacaccccga tattgcgtag 2281561 cgtacgtaac cacctgcggg cacggctgga ttccgatggc ggtcgctgcg cgccaaagat 2281621 gggcagcgaa gccatgctgg gtccgaagaa cccgctgttg ctgacgatcg tcgccggctt 2281681 ggcgaagggg acggttgctg ccatgaaatg cgccgacgtg atggccccgt acagccggtg 2281741 ggcgaacacc gcgacggtcc gcatggcgat ttcgcgctgg atcaccccgc tgggccgcag 2281801 ccgctcggcg atgcccaccc cgccggcacg caggccccgc acagtcaccg atgccgacga 2281861 cggcgagaac gggccgggca gcgcctccga gaggttggtg gccagatagg tcgggaagcg 2281921 cgggtcgatc ggcgtgtcga actcgccgtt ggccccttct gggccggcca atctgggtgc 2281981 gacaccgtcg tctgccgggg agtcgaccgc cggaaggtcc tggatgttag ccagccgcca 2282041 aggcagggaa aatgttcgct ttcccagacc gatccggcca cgcaccgcca gggtgaagtc 2282101 ttcgagacac tcctcggcgt tccaggccgg ctggaatccc cagcggtcac gcaggagcgt 2282161 gacatccatc aatggcgcgc tgtgcaggag ttcgagttcg gcgaacgagg tgacacgtcg 2282221 tagcactggg gagccaatag gcaccatggg ccgcccgagc gcggccgcaa tgcgccgaaa 2282281 cgtcaactcg ccaggggcgg cgagattaac agggccgctg tcgattaccg tgtccagtag 2282341 cgcgcgaacc aacagccgct gcgcgtcgtc ggagtggacg acttgtacga cgcgatcagc 2282401 atacccggcg ggtaacaccg gcagagcaaa cagccgctgc acccagttgt cgacatttcg 2282461 accgaaaatg agcgcgcagc gcacggcgac ccattccagg ccgcagtcgg ccagcatctg 2282521 ctcgacgcgg ggttggtgac cgctggacgt gaaaacgatg cgcccggttc cggtctcggc 2282581 catcgccttg aggacattgg cggtgccgtc gatattgatg tggtcgtttc ggccacgcac 2282641 ccacgcacaa tgcgcgacca catccgcacc tgtcatagca ctttcgacgg cggtggcatc 2282701 ccggatatcg gccgcaatga aatccgctga gctcggccag ctgtccggtc gatgacgtgc 2282761 gattccgacg acctcgtgac cctgactcag caatctggcg gtcaggccgc ggccgagaac 2282821 tccgctggcc ccggtgacgg cgattctcac ggtcctactc gtcgtcgttc cgaaacgccg 2282881 cgttgaccag gtcgtcgagg tccatgtccg cgatctcttg ttctgcggtc ggcgccaacg 2282941 ccgggtcctg gccgctggtt tcggtttcat ttgccagcgc gagcaacaga tccagcactc 2283001 ccgcctgccg taagcgcttg accggaatgg acgccacaat gcgttgtagt tcggcttccc 2283061 cggccgccac ggctgaagtg tcttgcggtg atgagccgag cagttctcga cgcatatagc 2283121 cggccagcgc cgcggagttg gggtagtcga agatgagcgt gggtgaaagt gaaaggccgg 2283181 tggcggattt gagccggtta cgcatttcga ccgcggtcaa cgagtcaaaa cccaggtcct 2283241 ggaatgccct atccgggtcg atggcttcgg ggctggcgct acccagcacg gtggcgatgt 2283301 gcgagcgcac caggtccagc aggacggcgt gttgctcgtc ttcgggcagc ccgtgcaggc 2283361 gatgcgcgag cgccgatttc gatttcgccg cggccaacga gtcatcgacc tggcgcctgg 2283421 tcggcgcgtt gatcagatcg acgaacatcg gcggcaacgt gccgccatcg aacttgacct 2283481 tcaacgccgc aaagtcgatg tgggcgggca gcatgaatgg ctcgtcgacg atcattgcgg 2283541 tgtcgaacaa ttgcagggcg tcagcagacg acatcgccac gatgccgtcg cgggcgaagc 2283601 gtttgaagtc caccgtcgcc aggccgctgg tcatggcgct ggcctgatcc cacagacccc 2283661 agcccaggga gatggccggc agcccatggg cccgccggtg ggcggccagc gcatccaaaa 2283721 acgaattggc ggccgcatag ttggcctggc ccgacgatcc gaccagcccg gccatcgacg 2283781 aaaacatgac aaacgccgac acatccaggt cgcgagtcaa ctcgtgcagg tgccacgccg 2283841 cgtccacctt ggaccgcaac accacatcca cccgatccgg tgtcagtgac atcaccaccg 2283901 cgtcgtcgag tgcgccggcg gtgtggatca cgcccgacaa tggatgctga accggaatat 2283961 cggcgatcac cttggccaac gccgctcgat ccgccgcgtc acaggccacc acctgtacct 2284021 gcgcaccggc ggcggccaac tcggccacca gctccgcagc cccgggagca tccgggccgc 2284081 gccggctcac caacaccaga ttgcgcaccc catgacgagc caccacgtga cgggccaccg 2284141 ccgaacccgc catcccggtg ccaccggtga tcaacaccgt gcccgccgcc cacgagccgg 2284201 gcatcagcat gacgaccttg ccggtgtggc gcgcctggct cagataacgc aacgccgcag 2284261 gcgcgcgccg cacgtcaaaa gtggtgaccg gcaacggccg cagcacccca tcgccgaaca 2284321 gcgtggcgag ctcggcaagg atctgcgcaa tgcggtccgg tcccgcttcg aataggtcga 2284381 aggcgcggta gcgcacgccc gggtactgct gggcgatcac gccggggtcg cggatgtcgg 2284441 tcttgcccat ctccaagaac accccacccg gtgccaccag acgcagcgac gcatccacga 2284501 attcaccggc cagcgagtcc aacaccacgt cgaaccctcg accgccagtg gccgcgcgga 2284561 acttgtcctc gaactctagg ctacgtgaat cggatatgtg gtcgtcgtca aagcccatgg 2284621 cgcgcaaggt gtcccactta cccttgctcg cggtcgcgaa cacctccaac cccagatgcc 2284681 gagccagctg caccgccgcc atgcccaccc cgccggtgcc ggcatggatc aacacgcgct 2284741 ggcccggttg tacgtcggcc aaatccacca gcgcgtagtg ggcggtggcg aacaccaccg 2284801 aggtggtggc ggcggccgtg tgcgaccacc ccgccggcac cttgaccagc agccgctggt 2284861 cggtgctggc gacggttccg gtgccctcgg ggaacaggcc cattacccgg tctccgaccg 2284921 cgaaagatcc cttgttcaag ctggtttcga taacgacgcc gcaggcctca acgcccatga 2284981 ccgcgtccgg atcgggatac agacccagcg cgatcatgac gtcgcggaag ttggcggcaa 2285041 tcgcggacac cgcaactcga acctgcccgg ggcccagcgg cgcgtcggca tcgggaatca 2285101 gctccagccg cagattctcg aaggtgccgg cggtgctcat cgccaaccgc cacggccggt 2285161 cactcggagg aaccaacagc ccgcccaccg cgcggctacc gtgcacccgc gccgtataaa 2285221 cctccccgcg ccgccacaac acctgcggct cgcctgtcgt cactaccgcc gccagggccg 2285281 aatcgtcgag cggcgcatcg gaatcgacca gcacgatccg gcccggatgc tcggtctgcg 2285341 ccgaccgcac caatccccat acggcggcac ccgccaaatc ggtgacatct tcgcccggca 2285401 atgccaccgc accgcgggtc atcaccacca aaacccctgc cccatcacgg gttagccacg 2285461 actgcaacac atcaagcacc gaactcgtgg cggcatacac gcccgccact acgtcaccgg 2285521 ccagaggcac cgactcaaac accaccgccg ccgagtcctc cgttgtcccc caggcgcaca 2285581 ccggtagcgg ctccaccgcg gccgatggct gcggcgacca ggtgacctcg aatagccggt 2285641 ccggacccga gctcgacacc gccgcccgca attgctgatc ggtcaccggt cgggccagca 2285701 tggaagcgac tgacaacacc ggcaatccca acccatcggc cagctcgatc gacaccgccg 2285761 acggacccac tggcgcgatg cgggcccgca ccgccgacgc ccccgctgca tgcaacgaga 2285821 ccccctgcca ggagaacggg accaacaccg aaccttggcc acgctcggcg ctttccgcgc 2285881 tcaacaccac cgcgtgcaag gccgcatcca gcagcaccgg atgcaccccg aagccggtga 2285941 ccgagacccc ggcatcggcg ggcaacgcca cctccgcgaa cacctcatca ccccggcgcc 2286001 acatcgcggt cagtccccga aacgccggcc cgtagccgta tccgcgctcg gccagctgct 2286061 gatagccgtc cgccacctca accgggacgg cgcccgccgg cggccacatc gctagatccg 2286121 cggtcggttc cgccgacccg gcgcgcagcg cgccctcggc gtgcaacacc cagccggtac 2286181 cgacgtcacc acgcgaatac accgacaccc cgcgcacgcc ggactcgtcg ggaccattga 2286241 cgaccacctg aaccgccacc gaaccggatg cgggcaacac caacggcgcg gccagcgtta 2286301 attcgtcgac aacgccacaa cccacttcgt cgccggcgcg gatcgccaac tccacaaatc 2286361 ccgctcccgg gaagatcgtc acgccggcaa cggagtggtc ggccaaccag ccctgcacgc 2286421 tgggcgacag ccgacccgtc aacaccaccc cgcccgaggc cggcagatcg atcaccgcgc 2286481 ccaagagcgc gtgctcactg gccgccaacc ccaagctggc cgcgtccgcc gcgacaccat 2286541 caccggacag ccaaaaccgc cgccgttgga aggcatacgt cggcaactcg acaaactgcg 2286601 cctcgcctac cacagcgcgc caatccaggt ccataccggt gacaaaccct tgcgcgacgg 2286661 cgttggtcaa cgtcgccggc tcggggcgat ccttgcgcag cgcagacatc gttgtcaccg 2286721 caacgtcggg caacgactct tcgatcgacg caacaaggcc accgctgggc ccgacttcga 2286781 ggaatcggct gcctccggcc gcctgcgcga agcgcacact gtcggcgaac cgcacggctt 2286841 gccggatgtg acgtcgccag taggccgctg atccgaaatc gtcgcccgcc aactgcccgg 2286901 tcacgttgga gatgactccg atggtgggcc ggccgatggc gattccggca gcgacggctg 2286961 cgaattcgtc gatcatcgga tccatcaacg gcgagtggaa cgcgtgggaa accgccagct 2287021 ggtggactcg tcgtccgtcg gcgcgcagct ggtcggccac cgcggccacg gcgttttgtg 2287081 cacccgaaat caccagtgac gctggaccgt tgaccgcagc gatgtcaacc tcagcgctca 2287141 gcagcggccg cacctcttcc tcggcggctt gcacggcgac catcgcccca ccggccggca 2287201 acgcctgcat gagccggccg cgggcagcca ccaacaccgc agcgttctcc aacgacagga 2287261 caccggcgac atgtgccgca gacaactcac cgatcgagtg gcccatgaca aaatccggtc 2287321 gtacacccca ggatcccagc aaccggaaca gggcaacttc caccgcgaac agcgcgggct 2287381 gcgcgaattc cgtgctgttc agtaggtttt cgtcgtgacc ccacatcact tcgcgcagtg 2287441 ggcgcagcag atgccggtca agttcgccca ctacggtgtt gaacgcctcg gcgaacaccg 2287501 ggtatccggc gtgcaatccc attcccatgc ccagccattg ggagccttgg ccggggaaga 2287561 cgaacaccgt cttacccgcc gcagtcgccg tgccccgaac aaccgagccg cccaactggt 2287621 cacccgccag ctcatcgagc ccggccaaca accgatcacg gtccccgcca accaccaccg 2287681 cccgatgctc aaaaaccgaa cgacccgcca acgaccaccc cacatcggca acatcgaggc 2287741 catcatcgcc acgcacgtac gcggccaacc gagccgcctg cccccgcaac gccgactccg 2287801 acttcgccga caccacccac ggcaccaccg gccccgccca accagcctcc cgccgcggca 2287861 ccaccggcac cgcctcgata atcacatgcg cattagtgcc actaatccca aacgacgaca 2287921 cccccgcacg acgcgtccga gcaccagcag gccacacccg cggcgcggtc aacaactcca 2287981 ccgcccccgc cgaccaatcc acatgcgggc taggcacatc cacgtgcaac gtcgccggca 2288041 acagctcatg gcgcatcgcc aacaccatct tgatcacccc ggccaccccc gccgcggcct 2288101 gcgtatgacc catattcgac ttcaccgacc ccaaccacaa aggttctccc ggctcccccc 2288161 gatcttgccc ataagtggcc aacaacgcct gagcctcaat cggatccccc aacgtggtcc 2288221 cggtcccatg cccctccacc acatccacct cggccgcgct caacccggca ttggccaacg 2288281 ccgcccgcac cacccgctgc tgcgaaggac cattaggcgc ggtcaaccca ttcgacgccc 2288341 catcctgatt aaccgccgac ccgaccacca ccgccaacac cggatgaccc aaccgccgcg 2288401 catccgaaag ccgctgcagc accaacatcc caccgccctc ggagaatccg gtgccgtcgg 2288461 ccgccgcggc gaatgccttg cagcgcccgt ccggggataa tccgcgccag cggctgaatt 2288521 ccacgaagat gtcgggtgtg gcgttgacgg tgacgccgcc ggccagcgcc agatcgcact 2288581 cccccgaccg cagcgatccc accgccatat gcaacgccac caacgacgac gaacacgccg 2288641 tatccaccga caccgccgga ccctccaacc ccagcacata ggccacccga cccgaggcga 2288701 cgctggacaa ttggccggtc agccggaagc cttctaccgg ctcggcggcg aacatgccgt 2288761 agccttgcgt cattaccccg gcgaataccc cggtggcgct gccgcgcaat ccggtcggat 2288821 cgataccggc ccgctccaac gcctcccagg acaactccag caacatccgt tgctgggggt 2288881 ccatcgcgag ggcctcgctc ggccccaccc cgaagaaggc ggggtcgaag tcgccgaccc 2288941 cgtccacaaa gccgccggtg cgggtgtagc acgcacccgc ggcgtcgggg tcggggttgt 2289001 atagcccggc caggtcccac ccgcggtccg ccgggaattc ggagagcacg tcgcggccct 2289061 ggatcagcat gtcccacatg tcgtccgggg aattcacccc gccgggatag cggcacgcca 2289121 tgcccacgat cgcgatcgga tcctcgctcg tggtgcgtac cgcgggtgtg tgcttgattt 2289181 cctgtgggag gccggcaagt tcggtgcgga tataggaggc cagccgattg ggtgtcgggt 2289241 agtcgaagat gagcgtgggt gaaagtgaaa ggccggtggc ggatttgagc cggttacgca 2289301 tttcgaccgc ggtcaacgag tcaaaaccca ggtcctggaa cgccttgtcg gggtcgatgg 2289361 cttctggcgt gatgttgccc agcacggtgg cgatgtgcaa acgcaccagg cccagcaaga 2289421 cggcgtgctg ttcggcttcg ggcagcccgt gcaggcgatg cgcgagcgcc gatttcgact 2289481 ttgcggcggc cacggagtcg tcgacctgac ggcgggtcgg cgcgctggct aggtcggaga 2289541 acatgggcgg caccgccacc gcatgggctc gcagtgcggt gaggtcaatg cgggcgggcg 2289601 ccaggaatgg ctcgtcgacg atcattgcgg tgtcgaacag ttccagcgcc tcagcggtgg 2289661 acagcgccag caccccttca cgacccagcc gggccaggtc tgcggcgtcc aggccgctgg 2289721 tcatggcgct ggcctgatcc cacagacccc agcccaggga gatggccggc agcccatggg 2289781 cccgccggtg ggcggccagc gcatccaaaa acgaattggc ggccgcatag ttggcctggc 2289841 ccgacgatcc gaccagcccg gccatcgacg aaaacatgac aaacgccgac acatccaggt 2289901 cgcgagtcaa ctcgtgcagg tgccacgccg cgtccacctt ggaccgcaac accacatcca 2289961 cccgatccgg tgtcagtgac atcaccaccg cgtcgtcgag tgcgccggcg gtgtggatca 2290021 cgcccgacaa tggatgctga accggaatat cggcgatcac cttggccaac gccgctcgat 2290081 ccgccgcgtc acaggccacc acctgtacct gcgcaccggc ggcggccaac tcggccacca 2290141 gctccgcagc cccgggagca tccgggccgc gccggctcac caacaccaga ttgcgcaccc 2290201 catgacgagc caccacgtga cgggccaccg ccgaacccgc catcccggtg ccaccggtga 2290261 tcaacaccgt gcccgccgcc cacgagccgg gcatcagcat gacgaccttg ccggtgtggc 2290321 gcgcctggct cagataacgc aacgccgcag gcgcgcgccg cacgtcaaaa gtggtgaccg 2290381 gcaacggccg cagcacccca tcgccgaaca gcgtggcgag ctcggcaagg atctgcgcaa 2290441 tgcggtccgg tcccgcttcg aataggtcga aggcgcggta gcgcacgccc gggtactgct 2290501 gggcgatcac gccggggtcg cggatgtcgg tcttgcccat ctccaagaac accccacccg 2290561 gtgccaccag acgcagcgac gcatccacga attcaccggc cagcgagtcc aacaccacgt 2290621 cgaaccctcg accgccagtg gccgcgcgga acttgtcctc gaactctagg ctacgtgaat 2290681 cggatatgtg gtcgtcgtca aagcccatgg cgcgcaaggt gtcccactta cccttgctcg 2290741 cggtcgcgaa cacctccaac cccagatgcc gagccagctg caccgccgcc atgcccaccc 2290801 cgccggtgcc ggcatggatc aacacgcgct ggcccggttg tacgtcggcc aaatgtatga 2290861 atgcgtagta cgcggtggtg aagacagccg agatggcggc ggcttcggcg taggaccagt 2290921 cggcgggcat cggcagcagc agccggacgt cgccggccac cagggtgccg ctgccgtcgg 2290981 ggaagaatcc gaacaccgaa tcaccgaccg agaattcggt gacaccgggg ccgacctcga 2291041 cgaccacgcc cgcgccttcg ccgccgagca gcgcgtcgtg ggtgaacatg cctagggtga 2291101 tcatgatgtc gcggaagttc gcggcgatgg cgcgcatggc cacccggacc tggccgggcc 2291161 ccaacggtgc gtcggcgttg ggaaccggct cgagccgcag attttcgaag gtgcccgcgc 2291221 tgcccagacc caaccgccat ggcccatcgc ccggcggcac caagatggca tccgccgcgc 2291281 ggctgccgcg cacgcgcgcg gtgtacacct gtccgccccg cagcactacc tgcggctcgc 2291341 cagtcgccaa cgccatcgcg atcgccgcgt cgtcggtggc cgcatcggaa tcgaccagca 2291401 cgatccggcc cggatgctcg gtctgcgccg accgcaccag cccccacacg gccgcgcccg 2291461 ccagatcggc gacgtcttcg cggggcagcg ccatcgcgcc ccgggtcgcc accaccagca 2291521 ccccggattc atggtcggtc agccacgact gcactgcggc cagagcctgg tggctgcgca 2291581 cgtagctgcc ggctaccgga tcttggtcag ccgcaaccga ttcaaagatc tggtaggcgg 2291641 gggtaggccc cggggacgtg gccgccgacg cgggcgacca gatcacttcg aacagccggt 2291701 cgggacccga gcccgacacc gccgccagca gctgccgctc ggtcaccggg cgggccacca 2291761 tcgaggccac cgacaatacc ggcagaccca gcccgtccgc caactccacc gacaccgccg 2291821 acggccccgc cggcgcgatc cgggcccgca ccgccgaggc ccccgtggca tgcaacgaca 2291881 cgccctgcca agcgaacggc aatgcgagtt cgtccgggtc gccggcgatc acgaccgcat 2291941 gcaagacggc gtccaacaaa gccggatgca caccgaaccc accgactccc ccggccgcct 2292001 ccggcagcct cacctcggcg aatatttcct cgccgcgggc ccacatcgcg gtcagcccgc 2292061 gaaacgccgg tccgtaccgg tagccgcgtg tcgccaaccg ctcatagcca tcggccacgt 2292121 ccaccgtcac ggcacctgcc ggtggccaca ccgataggtc cgcgcctggt tcaaccgacc 2292181 cgggccgcag gataccctcg gcatgcaaaa gccagcccgc ttgcgcgtca gctcgggaaa 2292241 atatcgacac accacgggaa ttcgaatccc ggccagcgtc gactaccacc tgcaccgcaa 2292301 cggagccggt ggcgggcaac agcaggggtg cggccagcgt cagctcgtca agcaccgagc 2292361 agccgacttc gtcgccggcg cggatcgcca gctccacgaa tccggtgccc gggaacagca 2292421 ccacgtctga aacggcgtgg tcggccaacc acggctgcac gttgggcgac aaccgacccg 2292481 tcaacaccac cccgccggag gcgggcaggt cgaccaccgc gcccagcaac gggtgttcgc 2292541 tcgcacccaa ccccaaaccg gatacgtcgg cgcctgagcc ctcggccgag agccaaaacc 2292601 ggcgcttgtc aaaggcatac gtcggcagct ccacatagcc cgctccgtcc agcgtgcccc 2292661 gccagttcac agccaccccc gccacaaacg cggacgccgc cgagagcagg aatcggtgca 2292721 gcccaccatc tccacgcccc agcgtgggga cgacaatggc ctcgctgtca ccgtcggtgc 2292781 acgcggcgaa tgtttcctcg acaccggtaa tcaacgccgg atgcgggctg gattcgatga 2292841 acgtgcggta gccctgctcg caggcgttgc gcaccgcctg gtcgaatagc acggtctggc 2292901 ggacgttgcg gtaccagtag tcggcgtcca aaccagctgt atccaaacga tttccggtca 2292961 ccgtagagaa gaagacggta cgcgtggatc gcggttcgat gccggacaga gcttcggcga 2293021 gtgggccacg gatcgcctcg acctccaccg aatgcgaggc atagtccacc tcgatccggc 2293081 gggtccgcag ttccttggtg gagcacaccg cgatcagctc ctccagcgcg cccacttcgc 2293141 ccgacaccac caccgccgag gggccgttga cgacggcgat gctgacccga tcgccgaagg 2293201 gcgccaacaa atcccgcgcc tggtcggcac cgcacgcgat ggacaccatg ccgcccgggc 2293261 cggccagtcc ggccagcaac ttgctgcgca gcgtgaccac ccgtgcggcg tcgcgcagcg 2293321 acagcgcgcc ggcaacgtag gcggcagcga tctcgccttg cgaatgaccg atcaccgcat 2293381 ccggatgcac tgcgaccgac ttccacagct cggccagtga caccatcacc gcgaacagca 2293441 cgggctgcac cacatccacg cgatccagtc ccggtgcacc gggggcgcca cgcagcacgt 2293501 ccaccagcga ccagtcgaca aattccgcga acgcctcggc acacgcgtcg atctgctgcg 2293561 cgaatgccgg tgcggtatcg agcagttcga ttcccatgcc cagccattgg gagccttggc 2293621 cggggaagac gaacaccgtc ttacccgccg cagtcgccgt gccccgaaca accgagccgc 2293681 ccaactggtc acccgccagc tcatcgagcc cggccaacaa ccgatcacgg tccccgccaa 2293741 ccaccaccgc ccgatgctca aaaaccgaac gacccgccaa cgaccacccc acatcggcaa 2293801 catcgaggcc atcatcgcca cgcacgtacg cggccaaccg agccgcctgc ccccgcaacg 2293861 ccgactccga cttcgccgac accacccacg gcaccaccgg ccccgcccaa ccagcctccc 2293921 gccgcggcac caccggcacc gcctcgataa tcacatgcgc attagtgcca ctaatcccaa 2293981 acgacgacac ccccgcacga cgcgtccgag caccagcagg ccacacccgc ggcgcggtca 2294041 acaactccac cgcccccgcc gaccaatcca catgcgggct aggcacatcc acgtgcaacg 2294101 tcgccggcaa cagctcatgg cgcatcgcca acaccatctt gatcaccccg gccacccccg 2294161 ccgcggcctg cgtatgaccc atattcgact tcaccgaccc caaccacaaa ggttctcccg 2294221 gctccccccg atcttgccca taagtggcca acaacgcctg agcctcaatc ggatccccca 2294281 acgtggtccc ggtcccatgc ccctccacca catccacctc ggccgcgctc aacccggcat 2294341 tggccaacgc cgcccgcacc acccgctgct gcgaaggacc attaggcgcg gtcaacccat 2294401 tcgacgcccc atcctgatta accgccgacc cgaccaccac cgccaacacc ggatgaccca 2294461 accgccgcgc atccgaaagc cgctgcagca ccaacatccc accgccctcg gaccagccga 2294521 ccccatcagc ccgcccggcg taaggcttgc accggccgtc gggtgccagc ccacgatgcc 2294581 tgctgaattc cacgaagacc gtcggtgtgg cgttgacggt gacgccgcca gccagcgcca 2294641 gatcgcactc ccccgaccgc agcgatccca ccgccatatg caacgccacc aacgacgacg 2294701 aacacgccgt atccaccgac accgccggac cctccaaccc cagcacatag gccacccgac 2294761 ccgaggcgac gctggaggtc atcccggtca gccggtagcc ctcgatctcc tcggccaaca 2294821 ttccgtagcc gccgacgatg agcccggcga ataccccggt ggcgctgccg cgcaatccgg 2294881 tcggatcgat accggcccgc tccaacgcct cccaggacaa ctccagcaac atccgatgct 2294941 gtggatccat cgctaacgcc tcgctgggcg aaataccgaa gaacgcggga tcgaaatccg 2295001 cgacgccatc cacgaagccc ccagtgcgcg cgtacgactt atggcgcacg tcgggatccg 2295061 ggtcgaacaa cccggccaga tcccacccac ggtcggtggg aaattctgac atcacgtccc 2295121 tggcgtcggc caccatctgc cacagccctt ccggggaatc gacgcccccc gggaagcgac 2295181 acgacatgcc cacgatcgcg atcggctcgc tcgagcgctc cagcaacgca cggttggtgc 2295241 gcttcaggcg ttccacctgg accagcgctt tgcgcagcgc ttcggtcgca tgctggagtt 2295301 gatcaaccat tactaacctc gcctaactct cgctaatatt ggccgtcgcc gaccgccgga 2295361 tgcggctccc gccgagtcac cgaagttgct gcacaaaacg acgccgtcgt acggcgctct 2295421 ggcgcaagtt cgctggtgag tattgccaac tccggcagga tttcaaagcg tccaatactc 2295481 cctgggcacc agtgcgcccg tgcaaagcct gccgtccatg gcgcgactgt acccgcccgc 2295541 ccgtcaacgc cggatgggcg catgtcaatg cggtgctagc ggtggtcttc acaacacagc 2295601 cgcacgaatg cagcgactag gcgccggctc ggcgccaccc atcggcagcc ctggcggccc 2295661 ggatcagctc gtcgcacaga tcgcgcagtt cggtcgccgc ggctccttcg tcgagcgcgg 2295721 tgacgacatc ctcggcggcg catcgcacct ggtaaacacg atccgacaga tcggccgcgt 2295781 cgtcggctga caacacgacc gcatcggcgg gcagcgccct cacctcaccc cgggtcagca 2295841 tggcgcgctg ctcgtaagcc cgctgccggc aagactgccg gcaataccgg cggcgacggc 2295901 ccatgccgac gtcggtcacg tcacggccac accacccgca cggctgcgga cgggcacgac 2295961 gagtcatgcc tgcagacatt agtccgcccg ggtgtccgat cccggtatca ttgatggtcg 2296021 cgccgcgcgc gtcgcgtgcc gggaactacg cagacggccg cagcgtttgc caaccggagc 2296081 cagtcgccag tacgcaacct accagcagag cccagggctc acaggaccta aaggagtagc 2296141 gcccatggct gatcgtgtcc tgaggggcag tcgcctcgga gccgtgagct atgagaccga 2296201 ccgcaaccac gacctggcgc cgcgccagat cgcgcggtac cgcaccgaca acggcgagga 2296261 gttcgaagtc ccgttcgccg atgacgccga gatccccggc acctggttgt gccgcaacgg 2296321 catggaaggc accctgatcg agggcgacct gcccgagccg aagaaggtta agccgccccg 2296381 gacgcactgg gacatgctgc tggagcgccg ttccatcgaa gaactcgaag agttacttaa 2296441 ggagcgcctc gagctcattc ggtcacgtcg gcgcggctga cccgggaacc ccctgctccc 2296501 ggccgggcaa tgtccggtcg tgcgcgtgcg tggtccgagc gcgaaaggcg tccctcgatg 2296561 ccccagcggg cgactttgac cagcgcctca cgaatgttgg acccgctcat cttggacaca 2296621 ccgagctcgc gctcggtaaa ggtaatcggc acctcggtga cgacgaaccc gttgctcacc 2296681 gtgcgccagg tgagatcgat ctggaagcag tagcccttgg agtccacgcc gtccaggtca 2296741 atcgcttcga gtgcttcgcg gcggtacgcg cggtagccag cggtgatgtc gtggatcccg 2296801 attccgagcg ccaggcgcga ataggtgtta gcggttttgg acaggactag ccgccgccaa 2296861 ggccagtttc gtaccgtccc ccccgcgaca tagcgcgaac caatcgcaag atcggcacca 2296921 gcgtcgacgg cgtccagcag gcgctgcagc tgttcgggcg cgtggctgcc gtcggcatcc 2296981 atctcgacca gcaccgaata ctcccggctc aacccccagg cgaaacctgc caggtacgcc 2297041 gcgcccaaac cgttcttggc ggtgcggtgc atcacgtggg tgcggccggg atcggcctgc 2297101 gccagctcgt cggcgagctg gccggtgccg tcggggctgc tgtcgtcgac gaccagcacg 2297161 tgcacggcgg ggcatgcttg cgtcagccgc cggtggatca ccggaaggtt ctcccgctcg 2297221 ttgaacgtag gaatgatcac caggacgcgc tggctgggac ggttacccgg ggctgggggc 2297281 gccggctggc cggtggtcat gtaactcctc gatgttgctc tgtgtcgtcc gaaaccggat 2297341 gagtgtcggc cgccctgctc gggctgaatg agttcgtcgt cggattcact cagggccggc 2297401 ggaccggagg cctcagatct gcccgggggc gcatcggaat cgtcattttc gccctttggc 2297461 tccgagcgcc tcggacgcgg gaaccaccca ttctgccgca tggcgacgag aacgaccgct 2297521 gcggctgccc cgacgagaat ccattgcagg attggacccc atcgagttgc cggtgtcagc 2297581 ctcgtcttga ggcgcacctg gctgtccagg tatgcgggct ggaaaaagtc ggtccggatc 2297641 agctcacccc cgtctggtgc tatcaccgca ctgatcccag tggtaccggc aaccaccacg 2297701 tatctgtcgt gctcgacggc ccgtaccttg gcgaatgcca gctgctgttc gctcattgtc 2297761 ttgttgaagg tggcgttgtt gctgggcacg gtcaacagct gcgcgccgcc cagaatcgac 2297821 ttccgcgggg cgcggtcgaa gatcacctcc cagcaggtag ccaccccgac cgggacccca 2297881 gcgatgcgca ccacaccggt gccgttgccg ggcacgaagt ggccggcgcg gtcggcgtag 2297941 ccggagaggt gccgaaacag ccacggcatg ggcaggtact cgccgaaggg ctgcacgatt 2298001 gccttgtcgt ggcggtcggc cggcccggtg ccgggattcc agacaatggc cgtattggtc 2298061 cactccggat tttcacgagg acggcccgga acatccatca gggtgccgat caggatcggc 2298121 gcgccgatcg cttcggccgc tgcggagatc cgttgaccgg cgtcggggtt gacgaacggg 2298181 tcgatgtccg acgagttctc cggccagatg acgaactggg gttgctgcgc cagccccgca 2298241 tgcacgtcgg cggccagccg caacgtctcc tcaacgtggt tgtctagcac cgcccgacgt 2298301 tgcgcattga agtcgagacc gagccggggc acattgccct ggaccaccgc gacggtgacc 2298361 gtgggttcgc cgcccgatcc gctacccgca tgccgcacct gcggccagac gacgatggcg 2298421 gcgaacaaga ccaggcatat gcacgcggcc ggcagcacca ccgccggcgg cgcatccccc 2298481 tgaccaccgg ttcgccacca cttctcgatt tccagcgcga tcgcggtcaa gccgcatccg 2298541 accagcgcta cccccgttga cagcagcgcc acaccgccga gctggaccaa cggcaacagc 2298601 gggccttcgg cttgaccgaa ggcgaccgac ccccacggaa atccaccgaa cggaaggatc 2298661 gacttcaacc actcctgcgc cgcccacccc accgcgaacc agatcggcca acccggcaac 2298721 aggcgtacca cgacggcgaa cagaccgaag atgccgggga acagcgcgca cgtcgtcgcc 2298781 agtgccaacc agggcccggg gcccaccagc tcgccgatcc acggcaacaa cgagacgtag 2298841 aacaccaggc cgaatagcag gccgtagccc agcccaccca ccggtgtcgt cgcgcggtgg 2298901 gtcagcaccc aggccagcaa tgcgagcgca accaccgccg cccaccagca gttgcgcggc 2298961 gggaagctgg catacaacag cagaccggcc acgatgctga ccaccaggcg cgtcagccgc 2299021 gtccgcaccg cggtccgtgt ggtgggcagc tgcgctgcca cccaggcgcc aagcttcacc 2299081 aggcgccggc gggccgcggc gccgagccag gcagccgcgc tcggcgcgtc ggggccttcc 2299141 gccggctcgg ccgacagttc gatctctgga tcggcggggc tctccgggcc ggcctcggcg 2299201 acctcagcgg gccgcgcctt ccggccgaac cattccctag ccatagatga ccgcacctcg 2299261 atgcacggtt tggcggcaac gcggcaaggc gtcggtcggg cccagccgcg gcaatgcggg 2299321 tacccgggag cgcgggtcgg tagaccagcg ctggactgcg tcgcgcggtg cgtcgacgtc 2299381 aaagtccccg gcgtcccata tcgcgtagga cgcgggcgcg ccgggcacca gggtgccgat 2299441 ccggccgtct cgaacaccac cggcccgcca gccgccgcgg gtcgcggcag caaacgccgc 2299501 ccgcgccgat accccgctgc ccggcgtgcg gtgattgacc gccgcgcgca cgctggccca 2299561 gggatcaaag cccgtgacgg gcgcgtcgga gccaagcgcg aggggcacgc cttgggatgc 2299621 taacagcgcc agcgggttga gttcgctgcc tcgctgggcg cccaggcggc gagcgtacat 2299681 gccgtcgcca ccgccccaca gctcatcgaa gttgggctgc acactggcga tgacccccca 2299741 agcgcccagc ttcgcggcct ggtccgcggt gaccatctcc acatgctcga ggcggtggcc 2299801 gcagcgggcg acggcaacca cgccgagatc tgccaccacc cgttcgaagg cggcgactgc 2299861 ggccgacacc gcagcgtcgc cgatgacgtg gaagccggcg gtcacttcgg ccttggtgca 2299921 tgctcgtacg tgcgcttcga tgccgtctac gtcaaggtgg caggtgccga tgcagtcggg 2299981 ggcgtccgcg tagggctcgt gcagccaggc ggtgcgcgac ccgagcgccc cgtcgacgaa 2300041 caaatcaccg gccagccctc gagccccggt ctcggtcacc aggtcacggg cctgggccgg 2300101 cgtggccacg gcctcacccc agtacccgat cacctcgact ccgtgctcga gtgcacgcag 2300161 ccgcaaccag tcgtcgagcc cgccgatttc cggaccggcg cattcgtgca cggcgacgac 2300221 gccggccgcg gctatggcct gcagcgccac ggcccgggcg tcggcaagct ggacgtcggt 2300281 caagaggtag cgtgcggcgg cccgggctag gtggtgggca tcaccggtca gcggccgctg 2300341 ggccgtgtaa ccggttgccg ccgccagctc ggggaccagc cgccgcagtc cggaggagac 2300401 caacgcggag tgcgagtcga tcctggccag gtaggcggga cagtcaccga gaaccgcgtc 2300461 taggtcggcg gtgctgggcg cagcattctc cggccaggcc gactcatccc aaccgtgacc 2300521 ccacagcggc tgacccggat ggtcggccgc atagtcggcg accatccgta ggcactgcgc 2300581 gcgtgaggtc gcgggccgca agtccagccc gctgagcatc agaccggtcg cggtcaggtg 2300641 gatgtggctg tccacgaacc ccggcgccac gaatcggccg tcgagatcct gcacgtcagc 2300701 gtctgggaac tggtcgcggc cgacgtcgtc gctgcccaac caggcgacga catcgccgcg 2300761 caccgccatc gcggtggctt cggggtgggt ggggctgtac acccggccgt tgaccaggag 2300821 tttgacggga atctggctca caccgctaat tcgaccccgg cgatggaggt tctgcggcta 2300881 cccgaggggg ctgaagggtc aacggctcga catctatgac gtcgatgacc tcgccatcaa 2300941 taaagtccgg gtcggtgccg ctctcgccga aggccccggc catgttggcc gccgcatcgg 2301001 ccgtcagtgg cacgttccgc aggaaaccgc gcacggcgat cgcggtcagc ccgggtcgag 2301061 cgagcgcccg gatcggcggc accagcagca acagccccat cgtcgtggtg accagaccag 2301121 gaacaagcac caagaccgag gcaacggtga ccagcgcgcc gtcactcagt gcgcttcgtg 2301181 gttccgccaa gccggatcgc aaccacagga gccgtcggcc gagctgccag ccaccgagcg 2301241 gcgccagcag accgaacccg aggacgaacg tcgccagcaa caccagcaaa gtccagccaa 2301301 acccgatcgt cgccgccagc gcgaaaacca ccgcgagctc gacgacggcg tagctgagca 2301361 gcagccgcga caccacgtga cgccaacgtc tgcgggctag gcccgagttc ctcgggggcg 2301421 gacatcgagg ctgcagttag atgacgctat gacaacgata gagatcgacg ctcccgccgg 2301481 acccattgat gcgctgctgg gccttccccc cggccagggc ccgtggccgg gtgtggtggt 2301541 ggtgcacgac gcggtcgggt atgtccccga caataagttg atttccgagc gtatcgcccg 2301601 ggcaggctat gtggtgctca ccccgaacat gtacgcccga ggcggccgcg cccgatgtat 2301661 cacccgagtc tttcgcgagc tgttaacgaa gcggggccgc gcgctcgatg acatcctggc 2301721 cgcccgcgat cacctgctgg ccatgccaga atgctccggt cgggttggca ttgtgggctt 2301781 ttgcatgggc ggtcagtttg cgcttgtctt gtcgcccaga ggttttggcg ccaccgcgcc 2301841 cttttacggc actccactgc cgcgccacct cagcgagacg ctaaacgggg catgcccgat 2301901 cgtcgccagc ttcggcaccc gcgacccgct gggtatcggc gcagccaatc gactacgtaa 2301961 agtgaccgcg gccaaaaaca tccccgccga tatcaagtcc tacccgggcg ccgggcacag 2302021 cttcgcgaac aaactgcccg gtcagccgct ggtgcgcatc gcgggattcg gctacaacga 2302081 ggccgcgacc gaagacgcgt ggcgtcgggt ctttgagttc ttcggccagc acttgcgcgc 2302141 cggctcgcct ggtgagcctt aggtacgact tcgactcccc gcggatgccg atgaccttgt 2302201 cccgtcggag ggcggcgggg ctgtcatgtc cgcgtgcacc ccgaaggcga gatgaacatg 2302261 attgtcatca tgaagtagtg ggccacagct gcgggtgtca gctggcgaaa aatgcgcgcg 2302321 gcgccctctt cgttgcctga cgtgtgcggc gcgccgacat gggtttggcg agcatggcct 2302381 cggtaagttc cccggcttgc cggatgcggg tcatgggcac agtgcagcgc gtcgctgcct 2302441 gtcctggccc gggtagggca gcagcgccat ctcgcgggcg ttcttgatcg cctgggcgac 2302501 ttggcgttgc tgctggactg tcaggccggt cactccccgg gagcgaatct tgcctcggtc 2302561 agagatgaac acccgcaatg ttgcggtgtc tttgtaatcg acgctctcga cgccgaggct 2302621 atcgagcagg tttttcttcg ccttcgtcgg gccctttcgc gcggatttgg cggccatcta 2302681 ccagctggcc ttccggacac cgggcaggtg tccgtcgtgg gccagttggc ggacccgcac 2302741 acgggagagc ccgaatttgc ggagatgtcc gcgcggccgg ccgtcgatgg cgtcgcggtt 2302801 gcgtaaccgc acgggactgg cgtcgcgggg ctggcgggca agggctcgct gggcggtact 2302861 gcgctgttcg ggggcgctcg atggggatcg gatgatgtct ttgagcgcgg tgcgacgcga 2302921 tgcgtaacgg gcgacggtgg ccgcccgccg ctgattcttg acgatcttgg acttcttggc 2302981 cacgtcagcg ttcctcgcga aagtccacgt gacgccgcag gatcgggtcg tatttgcgca 2303041 agatgagacg gtcggggtca ttacggcggt tcttgcgggt ggtgtaggtg tagccggtgc 2303101 ccgccgtgga acgcagcttc acaatcggcc ggatgtcggt gcgcgccatc agatccgctg 2303161 cccctggcga cgcaggcggg ccacgaccgc ttcgataccg tcgcggtcga tgacctttat 2303221 acccttcgtg gacacccgca gccgaatgcg acggccctcg gagggcaggt aatacgttcg 2303281 ttgctgaatg ttgggcgacc atcgccgacg gcttcggcga tgggagtgcg agacggtgtt 2303341 tccaaatccc ggcttgcggc cggtgacttg gcagtgggcg gacaaggggc acccttcctt 2303401 cgaagctcgg cttattgaaa atcattttcg acaacagcta ggtggcactg taccgtcgac 2303461 gtcgcaataa tgaaaactgt tatcgataag gaggacggtg gccaccccgg tgatccttgt 2303521 caccggacac gagggcaccg ccgccgtgac cgctgacctg ctgggcctgc tcaccgatca 2303581 cggcactgcg acacttcggt cagtggcacc aggatccgtg cggcgagccg atccccgccc 2303641 acggtgtcac cgccgagaac aacgacgacg acaccgggca tccatgaaat ccgccatcca 2303701 tcccgaccac cacccccgtc gtcttccacg gtgcccggtc ctccgccgcg accaagttgt 2303761 actggaaatg attgtcatta cgatggtcgg gcggccgagc gggccgggcg aaaggaaatg 2303821 ggatgtgtgg ggcagcgtgg cacgcgcggt caccggcggg catgtacccg tcaaatccat 2303881 cctcaccggc gcccatgccg acccgcattc gtaccaggcc agccccgcgg acgccgccgc 2303941 gatcgtcgac gcggagctgg tgatttacaa cggcggcggg tacgacccgt gggtcgacca 2304001 ggtgttggcc ggccatcctg gtgtccaggc ggtcgatgcc tactcgctgc tcggcgccgt 2304061 gggcgacgac gacgcgccca acgaacacgt cttctacgac cccaatgtcg ccaaggcggt 2304121 cgcggcaacg atcgccgacc ggttggcgga cctcgacccg tccaattccg ggaactatcg 2304181 agcgaacgcc gccgagttca gccgcggcgc cgacgcaatc gcaatttccg aacacgcgat 2304241 cgccaccacc tatcccgacg ccgcggtcat cgcgaccgaa cccgtcgtgc actacctgct 2304301 ggcggcagcc ggcctgaaaa atcgaacccc ggctaccttc atcgcggcca acgaaaacgg 2304361 caacgacccc accccggccg atatggcggc cgtgctcgac atgatcgccg gccgtgaggt 2304421 cgcggcgttg ctggttaacc cgcagacacc taccgcggcg accgacgaac tgcaggtggc 2304481 cgcccggcgg gcaggagtgc caatcaccga gttgaccgag accttgccca gcggaaccga 2304541 ccgggaccag ttttgcgctg ctgaccggcc agatcgtcgg ggtcggtcac tccgggctga 2304601 ccatgctgac cgtggtttgt ctgctcgtgg tcaccgtgtt ggcgatctgc taccgaccgc 2304661 tcttgtttgc caccgtcgat ccggaggtcg cggccgcccg cggcgtgcca gtgcgcgccc 2304721 tgggaattgt gttcgccgca ctgatgggcg tggtagccgc ccaggctgtc cagatcgtcg 2304781 gggcactcct cgtgatgtct ttgctgatca cccccgccgc ggcggccgcc cgggtcgtgg 2304841 ttgccccggt cgccgcgatc gcgacctcgg tggtcttcgc cgaggtttcc gccgtcggcg 2304901 gcatcctgct gtcgctggcg cctggagtcc cggtgtcggt gttcgtggcc accatctcgt 2304961 ttgtgatcta cctgatttgc tggttgctcc ggcggcgccg ctaactagcc ggtctcgctt 2305021 tcggccactt tgagctctag gccaatgttg ttccgcatgc cgccgcgcag cttactgacg 2305081 aaggtgaaca gcttgccctg gatgccgtag cgcttgacga tcgcgtcgta gacggcgccc 2305141 gtttgggatt cgtcgaggat ggccgcggtg gcttcgacgg cctcgctggt cggccggccg 2305201 cgcaaggtgc aggtcgccag cgtcacccgc ggcgtgttgc ggatccgctt gaccttccac 2305261 gatttcttct cggtgatgac cagcagtcga tccccgcggt cggtgtccaa ggcggcccag 2305321 atgggaaccg gcttgggccg gccgtccttg gtgaaggtgg tcagcagcag gtactgcgcc 2305381 tcggcaaggt cagaaaaggt aggggtcacg ggtgccaacc taccgcgcga gcagacgcag 2305441 aatcgcactg cgcggggtcc cgcgcatgcg attctgcgtc tgctcgccgt actcaggctt 2305501 ccaggtcgcc ctcggtttcc agcagcacct ggcgcaaccc gtccagggtt tccggtgccg 2305561 gctgtgccca caggccgcga ccggccgctt ccaacagccg ttcggccatg ccgtgcagcg 2305621 cccacgggtt ggactcggtc atgaacgtgc ggttctgcgc gtccaggacg taacgctgcg 2305681 tgagctgctc gtacatccag tccgccatca ccccggcggt ggcgtcataa ccgaacagat 2305741 agtcgacggt ggccgccatc tcgaatgcgc ccttgtagcc gtgccggcgc atcgcggcca 2305801 tccacctcgg attgaccacg cgggcgcgaa acacccgcgt ggtctcctcc gacagcgtgc 2305861 gggtgcggat cgcgtcgggt cgggtgttgt cgccgatata ggcggccggt gcttggcccg 2305921 tgagcgcccg cacggtggcc accatgccgc cgtgatactg gaagtagtcg tcggagtcgg 2305981 cgatgtcgtg ttcacgggtg tcggtattct tggcggccac cgcaatacgc cggtactggc 2306041 ggttcatgtc gtcgatcgcc tcgcggccat ccaggtcgcg cccgtaggcg aatccgcccc 2306101 aggcggtgta cacctgggcg aggtcggcgt cgtcgcgcca gctgcggctg tcgatcagct 2306161 gcagcagccc ggcgccgtag gttcccggtt tggatccgaa aatccttgtg gtggctcgcc 2306221 gttgatctcc gtggtgggcc agatccgctt gggcgtgcgc gcgcacgtag ttgtcctcgg 2306281 cggcctcgtc gaggtcggcg accaaccgca ccgcgtcatc gagcatggtc accacatgcg 2306341 ggaaggcatc acggaaaaag ccggagatcc gtaccgtcac gtcgatgcgc gggcggccca 2306401 gctcggccgg ctgcatgggc gccaggtcga tgacccgccg cgaggcgtcg tcccataccg 2306461 gccgaacccc cagcagcgca agcacttcgg cgatgtcgtc gccggccgtg cgcatcgccg 2306521 aggtgcccca caccgacagc cccaccgacc gcggccaccg cccatgctca tcgcggtagc 2306581 gcgccagcag cgaatcggcc agtgccacac cggcttccca cgccagccgg gacggcaccg 2306641 ccttgggatc cacggagtag aagttgcgcc cggtgggtag cacgttgacc aggccgcgca 2306701 gcggcgaccc cgacggcccg gccgggatga accggccgtc caaagctctt agcacctgct 2306761 cgatttcggt tgcggtgcca gccaaccggg gtatcacttc ggtggcggcg aaccgcagca 2306821 ccgcggcggc gtcggcgttg ccggtgagtc ggtcggcggc ggaggggtcc cagccggtgg 2306881 cctgcagggc cgcgaccagt tcgcgggctt tcgcctcggt ctggtcgact gtcgcgcgtt 2306941 cgtcggtgcc atcctcggcc aggccgagtg cctgccgcag gccggggatc gcgtgcgcgc 2307001 cgccgaacag ctggcgggcc cgcaagatgg ccagcaccag gtcgagttct tgctcccccg 2307061 ttgggttttg cccgaggatg tgcagcccgt cgcggatctg gacgtccttg atctcgcaca 2307121 gccagccgtc gacgtgtagc agcatgtcgt cgaacgagtc ctcttccggg cgttcggtca 2307181 gtcccaggtc gtggtccatc ttggcggcgc ggatcagcgt ccagatctgc tggcggatgg 2307241 cgggcagctt gccgggatcc agcgcggcga cgctggcatg ctcgtcgagc aactgttcca 2307301 aacgcgcgat gtcgccgtag gtttcggcgc gggccatcgg aggaatcaaa tggtcgacta 2307361 gcaccgcgtg cgcgcgccgc ttggcctggg tgccctcgcc ggggtcgtta accagaaacg 2307421 ggtagatcag cggcagatcg cccagcgcgg cgtcgggtcc gcaggacgcc gacatgccca 2307481 gcgtctttcc cggcaaccat tccaggttgc cgtgcttgcc caaatgcacc acggcgtgcg 2307541 ccccgaaacc gttcgagaat ccggtatcga gccagcggta ggcggccagg tagtggtggc 2307601 tgggcggcag gtccgggtcg tggtagatcg ccaccgggtt ctccccgaag ccgcgcggcg 2307661 gctgaaccat gagcaccagg ttgcccgctc gcagtgcggc gatgacgatc tcgccgtccg 2307721 ggtcgtggct acggtcgacg aacagctcac cgggtggcgg gccccagtac gctgttacca 2307781 cgtctgtcag ttcggcgggc agggtggcga accagtcccg atactccttg gccgacaccc 2307841 ggatggggtt gccggccagc tggccttcgg tgagccagtc ggggtcgtgt ccgccgcatt 2307901 cgatcaacgc gtgaatcagc gcgtcgccgt cgtttgattc gacacccggc agatcaccca 2307961 cccgatatcc gcgctgccgc atcgcttgca gcaaggccac cgcgctggcc ggggtgtcca 2308021 ggcccaccgc gttgccgatg cgggcgtgtt tggtcgggta ggccgagaag accagggcca 2308081 cccgcttgtc ggcgggggcg acctggcgca gccgtgcgtg ccggaccgcc aggcccgcga 2308141 cccgggcgca gcgctccggg tcggccacat aggagatcag cccgtcgtcg tcaatctcct 2308201 tgaacgagaa cggaaccgtg atgatgcggc cgtcgaactc gggcaccgcc acctggctgg 2308261 ccacgtccag cggcgacagg ccgtcgtcgt tggcgcacca ctgatcccgc gggctagtca 2308321 aacacaggcc ttgcaggatc gggatgtcca gcgccgccag gtgctcaacg ttccagctgt 2308381 catcgtcgcc gccggccgag gcggcggccg gcttgactcc cccggcggcc agcacggtga 2308441 ccaccatggc gtcggcgccg ccgagccttt ccagcagccg cggctcggcg gtgcgcagcg 2308501 acgcgcagta gagcggcagc gggcgtccgc cggcgtcttc gatcgcccgg cacagcgcct 2308561 cgacgtagcc ggtgttgccg gccaggtgct gggcacggta gtagagcacc gcgatcgtcg 2308621 ggccggtctt gccggcgtcc ggacgctcca gcacccccca ggtcggggtg gcgaccggcg 2308681 gcgtgaaccc gaagccggtc atcagcacgg tgtcgcacag gaaggcgtgc aactcgcgca 2308741 ggttgtcgac gccgccgtgg gccaggtaga tgtgggcctg cagcgcggtg ccggccgcga 2308801 ccgtggagcg gtcggtcaac tcggcatcgg cggcctgctc tccgctgacc agtgcggccg 2308861 gtaccccgcc ggcgatcacc gtgtcgattc cgctctgcca ggcgcggtag ccgccgagaa 2308921 tccggatcac cacgatcgac gcttcggcca gcaggtcggt cagttccagg tcagacagcc 2308981 gcgagggatt cgcccaccgg tagttcttgc cgctggaccg ggcgctaatc aggtcggtgt 2309041 cggacgtcga caacagcaga acggtcggtt ccggcaccaa ttcttcttac cggagcagga 2309101 ctcgagcggt ggcgtcgggc ccgcgagctt tgtagccacg cctagactac aaacatgtct 2309161 acatccacga cgattagggt ttcaacccag actcgggatc gtctggccgc ccaagcccgc 2309221 gaacggggaa tctcgatgtc ggctctgctc accgaactgg ccgcccaggc cgagcgccag 2309281 gcaatcttcc gcgccgaacg cgaggcctcg cacgccgaga cgaccaccca ggcagtccgc 2309341 gacgaggacc gcgagtggga gggcacggta ggcgacggcc ttggctgagc cacggcgagg 2309401 agacctttgg ctggtcagcc tcggcgccgc tcgcgcgggt gagcccggca agcatcggcc 2309461 cgcggtggtc gtttccgtgg acgagctact caccggaatc gacgacgaac tcgttgtcgt 2309521 cgtgccggtg tcaagctcgc gctcccgcac cccactccgg ccacctgtcg cgccctcaga 2309581 aggtgtagct gccgatagcg tcgcggtgtg ccgcggcgtc cgcgcggtcg ctcgtgcccg 2309641 actcgtggag cgactcggcg ccctcaaacc cgccacgatg cgcgcaatcg aaaacgccct 2309701 gaccctgatc ctcggcctcc cgacgggacc tgagcgcggc gaggcggcga cccattctcc 2309761 cgtacggtgg acgggtggcc gggacccgtg acgcggacgc ctgccccggt gcgttgcggc 2309821 cgcaccaggc cgccgacggg gcgctggcgc ggatccggct gcccggcggg atgatcaccg 2309881 cggcacaact ggcgacgctg gccagcgtcg ccagcgactt cggctccgcg acactggaac 2309941 tgaccgcgcg cggcaatgtc cagttgcgcg ggatccgcga cgtggcagcg gtcgcggacg 2310001 cggtcgccaa agccgggctg ctgccgtcgg caacacacga gcgggtgcgc aatatcgtcg 2310061 cctcgccgct gtccggccgg gccggcgggc tagccgacgt gcgggcatgg gtcggtgagc 2310121 tcgacgcggc gatccgcgcc gagccccggc tggcggaact gggcggccgg ttctggttcg 2310181 gtctcgacga cggccgcgcc gacgtgtccg gcctgggtgc cgacgtcggc gtgcaggtgt 2310241 tccccgacgg tccccgactg ctgttgaccg gacgtgacac cggcgtgcgg gtggccgatg 2310301 tcgccgagac cctgatcgag gtcgcgttgc gtttcgtcaa gatccgcgaa accgcctggc 2310361 gagtaacgga attagccgat atcggcgagc tgcagtccgg tgtcgagctg ggcccatccg 2310421 ttcggcccgt caccaaaacg cccgtcggct ggatacccca ggatgacagc cgggtaacgc 2310481 tgggcgccgc ggtgccgctg ggggtcttgc ccgcccgggt cgcggaatgc ctggccgcga 2310541 tcgaggcccc gctggtgatc acgccgtggc gatcggtgct gatctgcgac ctcgacgacg 2310601 cgacggccga cgccgcgctg cgggtgctgg cgccgctggg cctggtgttc gacgagaact 2310661 ccccctggct gaacatcagc gcctgcaccg gcagccccgg ctgcgcgcac tcggccgccg 2310721 acgtacgggc cgacgccgcg cggtcactga acgtggagtc agccgggcat cggcatttcg 2310781 tcggctgcga gcgggcctgc ggcagcccac cggccggcga ggtgctggtc gccaccggcg 2310841 gtggataccg gcgattgcgg ccgtagggtg agcgagtgct cgactaccta cgcgacgccg 2310901 cggaaatcta ccggcggtca ttcgcggtta tccgcgccga ggccgatctg gcgcgcttcc 2310961 ccgccgacgt cgcgcgggtg gtggttcggt tgattcacac ctgcgggcag gtcgacgtcg 2311021 ccgagcatgt ggcctacacc gacgacgtcg tcgcgcgggc gggtgccgcg ctggccgccg 2311081 gtgccccggt gctgtgcgat tcgtcgatgg tggccgccgg gatcaccacc tcgcggctgc 2311141 ccgccgacaa ccagatcgtc tcgctggtcg ccgatccacg cgccaccgag ctggccgccc 2311201 gtcgccagac cacccgatcg gcggccgggg tcgagctgtg tgccgagcgg ctgcccggcg 2311261 cggtgctggc cataggcaac gcgcccaccg cgctgtttcg gctgctcgaa ctggtcgacg 2311321 aaggggcacc cccaccggcg gccgtgctgg gcggaccggt gggtttcgtc ggatcggcac 2311381 aggccaaaga ggagctcatc gagcggcccc gcgggatgtc ctacctggtg gtgcgcggtc 2311441 gccgcggcgg cagcgcgatg gccgccgccg ccgtcaatgc gatagccagc gaccgcgaat 2311501 gagcgctcgg ggcacgctgt ggggagtcgg gctggggccc ggcgatccgg agttggtgac 2311561 cgtcaaggcc gcccgggtga ttggcgaggc cgatgtggtg gcctatcaca gcgccccaca 2311621 cggtcacagc atcgcccgcg gcatcgccga accgtatctg cggcccggtc agctcgagga 2311681 gcacctggtc tacccggtga ccaccgaggc cacgaatcat cccggcggct acgccggtgc 2311741 gctcgaagac ttctacgccg acgcgaccga gcgcatcgcc acgcacctgg acgccgggcg 2311801 caacgtggcg ctgctcgccg aaggcgaccc gttgttctac agctcctaca tgcatctgca 2311861 cacccggctg acgcggcggt tcaacgccgt catcgtgccc ggtgtgacgt cggtgagcgc 2311921 cgcgtcggcg gccgtggcca caccgctggt ggccggcgac caggtgttgt cggtgctgcc 2311981 gggcacgctg ccggtcggcg agctgacccg ccggctggcc gacgccgacg cggccgtggt 2312041 ggtcaagctg ggccgttcgt atcacaatgt gcgggaggcg ctttcggcgt ccggcctact 2312101 cggcgacgcg ttctacgtgg agcgggccag caccgccggc caacgggtat tgccggccgc 2312161 cgacgtcgac gagaccagcg tgccgtactt ctcgctggcc atgttgccgg gcgggcggcg 2312221 tcgtgcgttg ctgaccggca ccgtcgcagt ggtgggcctg gggcccggcg acagcgactg 2312281 gatgacaccg cagagccggc gtgagctggc cgccgcgacg gatctgatcg gctatcgcgg 2312341 ctacctggac cgggtcgaag tccgcgacgg ccagcggcgc catcccagcg acaacaccga 2312401 cgaacccgcc cgggcgcggc tggcctgctc gctggccgat cagggccggg cggtggcggt 2312461 ggtgtcctcc ggcgacccag gggtattcgc gatggccacc gccgttttgg aggaagccga 2312521 gcagtggccg ggggtgcggg tccgggtgat tccggcgatg accgccgccc aggccgtcgc 2312581 cagccgggtc ggcgcgccgc tgggacatga ctacgcggtg atctcgttgt ccgaccggct 2312641 caaaccctgg gacgtgatcg ccgcgcgcct gaccgccgcg gccgccgccg acctggtgct 2312701 ggccatctac aacccggctt cggtgacccg cacctggcag gtcggcgcga tgcgcgagct 2312761 gctgctggcc catcgcgacc ctggcatacc ggtggtgatc ggccgcaacg tctccggacc 2312821 ggtttccgga ccgaatgagg acgttcgggt ggtgaagttg gccgacctga accccgccga 2312881 aatcgacatg cgctgcctat tgatcgtggg gtcctcgcag acccggtggt attcggtgga 2312941 ttcgcaggac cgggtgttca ccccgcgccg ctatcccgag gcgggcagag ctaccgcgac 2313001 aaagtcgagc cgccacagcg actgaaagag cttgcggccg aattcctcaa ggtcggccag 2313061 gctgcctccg gaaggctcgc cagttcgcgc cacgcacccg gcaatctccc gaatcgtgcg 2313121 gcgaccgtca acctgctgca gaaaggccaa ctgggcgggg ctgggcgcca tgcgccaacc 2313181 cggccaaaac atatcggtgc cggagacacc gcaacgcgtg cgcatcagcg gtacgtaatc 2313241 gagcgcggca accgtcgaaa aatcgatcgt gtactgctcc ttgggtcggt cacgacggca 2313301 cgccataaag agatgggtag cgttcaaggt ctccagacgt tccatcacgg accaggcctt 2313361 gacctcgggt aacgtgttca cggccgcata aaactcgctg ttcgggacga aaaaatcgtg 2313421 cgggtaatac ggcgccttgt ggaaccatcc ctgaaatacc agtccggcgg acgtgaccag 2313481 atcgacgcat tcctcgacgg tgtaactgcg ttggcgacca tgcaagaacg tatcgacgag 2313541 ggcgctatcg gaaagtaaat cccgagcttt cgtgagatag tttcggagcg gatgatacgt 2313601 cggtagtaac gagattgctt ccttcgccaa tttgatcgat gcatcgtcct gccctaatcc 2313661 aagatcacga aagaccgaac cgagcagttc gactccgatc cgaccgtact tcccgtagag 2313721 catcgccgcc acgacgccat cccggcgcag gcagtgggcg agttctttca tgcccgcccg 2313781 cggatctgcc aggtgatgta aaacgccggt cgataccacg aggtcgaagt cgcgtcccag 2313841 cgtcgccagc tcttcgatcg gaagcagatg caactccaga ttcgccagcc cgtgcttgtc 2313901 tttcagatat tgctgatggt ccagtgccgg tcgactgata tcgatcgcca ctactttcgc 2313961 cgcacgattg gtgaatgcga aaatcgccgc ctggttggtt ccgcaaccgg cgatcagaat 2314021 atccagatcg ggccggtatt cgcggtccgg ccataatatc cggtgggagt gcaccgggtc 2314081 gaaccattcc caattcgctg tggtccacgc ctcaagatcg gcgatcgggt gcgggtacaa 2314141 ccaccggtgg tactgccggg acacaatgtc ggcgcgcgga tgatcgtcgg tcacttcggt 2314201 cccacgagcc tatgcaagca caccggcaac gcacgtcgcc gcctcggcga gcagcgcctc 2314261 acggggctcg gcgtcatacc cgccgccggc acgatcggac atgacggcca ccacgtaggg 2314321 cacgccggtc ggtgaccaca cgaccgcgat gtcgtttgct cgtccgtagt caccggtccc 2314381 ggtcttgtcg atcaccttcc aatcggcggg aaagcccgct cggatccgct tggctccggt 2314441 ggtgttgcgc gccatccaat cggtgagcag tgcccgcttg tcgggcggca acgcgttgcc 2314501 gagaacaagc tgctgcaaca ccagggcgat ggcgtgcggt gttgtggtat cccgttcgtc 2314561 cccgggcgga tcgcggttca actccggttc ctcggcgtcc aaccggctca cggtgtcacc 2314621 caagctgcgg aggtagccgg taaatgccgc ggtgccgccc ccgggaccgc caagatcggc 2314681 cagcaacagg ttggcggcgg tgccgtcgct atagcgtatc gccgcatcgc aaagctgccc 2314741 gatcgtcatc ccggtctgaa cgtgttgttg ggccaccggg gagatcgacc gaatgtcgtc 2314801 actggtgtag gtgatcagtt tgtccagatg cgtgagcggg ttttggtgca gcaccgccgc 2314861 cacgagcggc gccttgaacg tggagcagaa tgcgaaccgc tcatcggcgc ggtattcgat 2314921 cgcggcggtg gtgccggtgg cgggcacata caccccaagc cgggcatcgt atctgcgctc 2314981 cagctcggcg aagcgatccg ccagatccgc tccggccggc aaggttgtcg atgccggacg 2315041 ggccccgctc gcatgccgtg cacaccccgt cacggaaacc agcattgcca tcgctaccag 2315101 cagttcgcga cgaccgaatc ctctgttgcg catgccgtag tatcacacgc gcgcagatgg 2315161 caggcgccaa agcgcattcg acgccgcgct cccccggctg ctcggcggcg ggatctacga 2315221 cgaccggtcg tagactgacc ggacctgccg ggctatggtt tatgcccatg accgcgacgg 2315281 caagcgacga cgaggccgtt accgcactcg ccttgtcggc ggccaagggg aacgggcggg 2315341 cccttgaggc gtttatcaaa gccacccagc aagacgtgtg gcggttcgtc gcctatctgt 2315401 ccgacgtggg cagtgcggac gatctcaccc aagagacatt cctacgagcg atcggcgcca 2315461 tcccgcggtt ttccgcacgc tccagcgccc gaacttggtt gctggccatc gcgcgccatg 2315521 tcgtcgccga tcacatccgc cacgtccgat cccggccccg caccacccgc ggcgcgcgtc 2315581 ccgaacatct catagacggc gaccgccatg cccgcggatt cgaagacctc gtcgaggtaa 2315641 ccacgatgat cgccgaccta accaccgacc aacgggaagc gctgctgctg acccagctgc 2315701 tcgggctgtc ctatgcggac gccgcggcgg tgtgcggctg cccggtgggc accatccgat 2315761 cccgtgtcgc tcgagcgcgc gatgcgctgc ttgccgacgc ggagcccgac gacctcaccg 2315821 gctaggcaga ccggccaccc acatggcggc ccggtggaca gaatcgaccg ccgctacccc 2315881 agccggcagc agcgggcgcg ctatcatgac caccgaaata cccagcgcag cagcggcatc 2315941 cagcttcgct cgggtcatct tgccaccgct gttcttggtg accaatgcgt cgatgcgctg 2316001 ctcacgcagc agtgcgaact catcgtggta accatatggc ccgcgagata gcaccagttt 2316061 gtgccgccgc ggcagggcgg tgccatcggg cgcggtaacc acgcggatca aaaaccacgc 2316121 gtcgctgttg gcgaaggccg caatacccga gcgtccggtg gtcaggaaca ctcgcgaata 2316181 accttgttca gcaacaacgt ctgcagcctc gatgtccgat accgcgatga tggcggtacc 2316241 gggatcccac ggcgggcgag ccagtaccag gtacgggagc ccgagctcac cgcacacctg 2316301 cgcggcgtgc gcggtgatgg ttaccgcgaa ggggtgggtg gcgtcgacga cggcatcgat 2316361 gcgctcctct cgcagccaac cgcgcagccc ctcgacaccg ccgaacccgc cgatgcgcac 2316421 cggaccgatc ggcagggcag ggttgggtac ccggccggcc agcgagctga cgatctcaac 2316481 gtgtgggttg caactctttc gccagcgcac ggccctcggc ggtgccgccg agcaacaaca 2316541 cccgggtcac tgtgcatacc gaccgtgccg tgccaccgaa tataggtagc tgtcggtaaa 2316601 gccctcagcg gtcagcacgt cgccaacaac gatcacggcg gtcctggtga tcttggcatc 2316661 gtgcatccgc gcggcgatat cggccaacgt gccgcgtagc gtccgctgtt gcggccaact 2316721 cgcgaaagcc accaccgcaa ccggcgtttc gggtcggtaa ccaccgtcta gcagtcgcgg 2316781 aacgatggcg tcgatctggg ctgcggccag gtgcaagacc agagtggcgc gggatcgggc 2316841 gagcgcggcc aggtcctcac cgggcggcat gggtgtggac agcgtcgcca cccgggtgag 2316901 cgtcaccgtc tgcgccacac ccggcacggt gagttcgcgc tttagcgccg ctgcggctgc 2316961 ggcaaaagcc ggtacgcccg gcacgatttc gtagccgatg cccagcgcgt cgagttcgcg 2317021 gcactgttcg gccagcgcgc tgtacagcga cgggtcgccg gaatgcagcc gggcaacgtc 2317081 gcggccgtcg gcgtcggcgt cggcaagttt gcgcacgatt tgttcgaggg tcagcggacc 2317141 ggtgtcgaca atcgtcgcgc cgggcggaca ctgcgccaac aggtcgtcgg gcatgatcga 2317201 acccgcatac aggcacaccg ggcatcgttg caggagccgt tggccgcgga cggtgattag 2317261 gtcggcggcg ccggggcccg ctccgatgaa atagaccgtc atcgcttggt caccgaccac 2317321 tgggtgaccg gcagctgtgg gcgccaaccg gtgaagccgc ccagcggttc gccgagatag 2317381 tgctggaatc gtcgtagctc gccaccgagg cgcgaatatg catgcgccag agcggcttcc 2317441 gattcgacgg tgacagcgtt ggcgaccaag ttcccgcctg cgggcaggct gtccaggcag 2317501 gcctcaagca ggcctggctg ggttacacca ccgccaagaa aaatcaccga cggccgtgcg 2317561 gcgtcgtcga acgcatcggg cgcgtcgccg cgcacgtcga cgctcacccc gaaggccgcg 2317621 gcattgaacc caatgttgcg gcggcgccgt tcgtcgcgct cgaacgccac cgcggtgcag 2317681 cccggccagc tccgacacca ctggaccgcg atggcgcctg agcccgcgcc gacgtcccat 2317741 aaccgctgcc cgggccttgg cgccagcgca gccagggtca gcacgcggat tgggtgtttg 2317801 gtgatctgcc cgtcgtgcgc gaatgcctcg tcgggtgccc acgacgtgcg ctcgtcgggc 2317861 aggtagcgca cggcgatcac gttgagctca tcgacatcga ggggtgggtc gcaggcccat 2317921 gcccgggccg taccgtcgcg gcggcgttcg gccgggccgc caagctgttc gagcacgctg 2317981 aacttggagt caccgcgacc gtgctcggtc agcagcaccg ccagcgcctg cggggtggac 2318041 cgatcgccgg acagcacgat ggcccggccg ccgcggcgca ccgcggtgtg tggttgcgcg 2318101 gtgaccaggc tgatcacctc ggtgtcatac acgttccagc ccatccgggc gcacgccaac 2318161 gtcaccgcgg acacgtgcgg caaacacggt cacgttgtcg tggccgaaca gccggatcag 2318221 ggtggagccg ataccatgca acaacgggtc gccgctggca accacgtgta ggtcagcccc 2318281 atccggtgac aggccttgca ccgcgggcag catcggcgtc ggccactccc agcgctcggc 2318341 ggtgacggta tcgtcgagca gggcaagttg ccgtttcgag ccgtaaatta ctgtggccca 2318401 cgccgggccg ggttgggacg gtagataagg cttgttccgt ccgcacgccg caacacttgg 2318461 tcgagggtag ccaccaccga ctcatacgcc gacgcgttct tcagctggtc ctccaggtag 2318521 agcaggatga cctcctcggt atgcccgggt gcgttcaacc agttggcgat ctgcggcagc 2318581 actgtggcca gcagaggttc gacggtgcag cctaggttcg cgttcttcgg tcccagcccg 2318641 tgacacacgg tgacgccggg ggcgccgtgg ccctcgaggc ggggcaagta gtgcaggtct 2318701 agctcgagcg cgcggacgtc gatgtcgagc tgttgggcca acgacagctg ctggtttgag 2318761 tctgcgtgcg agaccgtgaa cgaatcgctg aggctgttga acgagttgtg cgtgccgagc 2318821 cactgagttt cccgcagcgg caccgggtct tgcaacgcat cctggaaccg cgcggtgcga 2318881 tgcacccaag actgtaggta ggcatcacgc gcggcctggg tcacccggtg cgcgagcgga 2318941 agcacgcacc gcgcatcggg cacaccgacg cggcgacact ccgcagcgac cgcgtcggcg 2319001 aacttgccga gcgccacgca ggggatcgca accgggctta ttacgtcaca ggatgcggtg 2319061 ggcgagggcg gagcgggcac ctggtaggca tcggcggcca ccggtgccgc ggttatcaac 2319121 accacggcca aggcgcccat gagggccgcg ctctgcagcc atcgggcgcg gggcatgcgc 2319181 tactttggca cgtcgataca ccgcttacca ggggtgttgt cgaagtgttg cgtggtctcg 2319241 tcgaagccgt cacgtaactc caagccgccc cgcgtcgatg acgagacact agggctgcga 2319301 ccgccagggc cgtgtagacg ttgctctaca aggtcaccgg tcctggtcag aacttatccg 2319361 acggctcctg cgcattttcc cgtacacaac cgcggggatg aggaccagca acccgagact 2319421 ccagatatcc caccacagtg accccttcac ggcattggcg attgcactga tggccagcag 2319481 aacccaggcg acacccataa aggcgaaata gcacccgccc ctgagccgtc ccgctggccg 2319541 aggccacagg gagcctgcga caccgccgat gaggcagaca accacgacgg caacgctgaa 2319601 gacgacaacg ggagtcgcgc tacttggtgg cacagttgac caccgccgct cccgatccgc 2319661 caacccccag taaggcggcg ccaagtagtg cccagtcggc agggccggcc ggaattgcta 2319721 gggcggtccc aactacgcca gccgatgatc ctgcgaaacc cgcaacagcc gccgtccact 2319781 ccgcgctgtt acatggccgg tgcagagctt gaaacgccag ccggcgcgcc tccaccgcga 2319841 tggggtcatc cggcggccta gcggccagct cgttgaccat gttgtccacc caccgtttgg 2319901 tcgcgtctga tacgggtgcc tctgtcccgg ccggtggcag catcattgaa gtgatcggat 2319961 cggagtacgg tccgtcggct ccaccggacg ggtgtggcgc tcctggcggt ggaggtgttg 2320021 ggccgtcttg tttgaagaag tcgactagct gaaccgcgct gtgggcaaca accggggcgc 2320081 ccgcgaagcg gacatttccc acatcaccgg tggtcgcggc gagctgaccg gacacctcgt 2320141 tctccgccgc aaccagttgc ccgacacgta agcggatgtc accggccaac gcctgagcct 2320201 gagctagtcg agcggcctgc actgcagccg gctgcgtcgt tttggtgtcg gtgaccgata 2320261 ggtcttcacc gacgttgaaa ccggcgtcct gggcgtcctc tacggcatac ataactcttc 2320321 gttgtgccgc gtcgatagtg ccggcgccgt tgcgcgcgat cgtggctgct ctccgcagct 2320381 ggtcggctat gccactgacc gttgagaagt cagctcgggt tcgttgtcgc agcccgtcgc 2320441 ccccggcgcc attccaggcg atggcatggg cctggtttcg catctgcaga aacacgtctt 2320501 cccaccgatc cgcggtttcg gtccagtagc cggccgcatc gataaggtgc tcggtgctcc 2320561 atgcccggat ttgggacagg gtggccagca tctaaaccac cgtcacctgc gtcaccgcgg 2320621 ccatctcgct cgccgcagtt gcctcttgat gggcgtaagc tgcggccgcg gcagccaccc 2320681 cggtagccgt agcctgcgtc cgggtggcga attccgctgc cgcgcagcag attgctgcgt 2320741 tgataccact taccgccacc gtcgtggctt ggaatggttg gccggattct ggcggcgttg 2320801 ccgaggcggc gaactgggcg ccaaggccct gcgattgact ggccgcaacc tcaagctgac 2320861 caagtacaac ctgtagttca ttcgacccca cccgcgggag tctaaatcga gaccacgcag 2320921 agggctattc acgccgattc aaagccgtcg aagaaacgac accacccgcg ggccgatgag 2320981 acaggaacga tcacacaggt gcttgcgaag atccgtcacc acgtatgcgg gcgaacggtg 2321041 tgttcggcct gttggcggcc gccgcgtgcg gtgttcccat ccccgttatc gacaaccgcg 2321101 ccgaggagat gacgggccgg cacgccacaa cggcaacgag tttcagcatc acggaccagt 2321161 cgtgcgcatc atgaggactg ccgcgccgct gcgctcaccg cggtcgtcaa agcattggat 2321221 ccaatgacgc catcgcggtg gcgcccggtg aggtgcgtga ccgtggtctc cggttatacc 2321281 ttcgagccga ccgcagggtg acttgatcgt caaatccacg acagtagcct tacaccaagt 2321341 ccgaagggag tagcggtgtt tgtcgatgtt ggacttttgc attcgggggc aaacgagtct 2321401 cactacgccg gtgagcacgc ccatggtggt gctgatcagc tgtcgcgggg acccctgctg 2321461 tcggggatgt tcggtacatt tcctgtcgcc cagacttttc acgacgcggt cggcgcggcc 2321521 cacgcacagc agatgcgaaa cctgcacgct caccggcagg cgttgatcac ggtgggcgag 2321581 aaagcgcgcc atgccgcgac ggggttcacc gacatggacg acggcaacgc cgctgagttg 2321641 aaagctgtgg tatgcagctg cgccacataa acatccgggc gctgatcgcc gaggccggcg 2321701 gcgatccctg ggcgatcgag cacagcctgc acgcgggtcg gccggcccag attgccgagc 2321761 tggcggaggc gtttcacgcg gcgggtcgat gcaccgccga ggccaacgcg gccttcgagg 2321821 aagcccgtcg ccgcttcgaa gcgtcctgga atcgagaaaa cggcgagcac ccgatcaacg 2321881 actccgccga agtgcagcgc gtgaccgcgg cgctgggtgt gcagtctttg caattgccca 2321941 agatcggtgt cgatttggag aacattgcgg ccgacctcgc cgaggcgcaa cgggctgcgg 2322001 ccgggcggat tgcgacgctc gaaagtcaac tgcagcggat cgacgatcag cttgaccaag 2322061 cgctggaact cgagcacgac ccccgactgg ccgcggccga aagatccgaa cttgatgcgc 2322121 tgatcacctg ccttgagcaa gatgccatcg acgacacggc gtcagcactg ggccagctgc 2322181 aatcgatacg cgccggatac tcggatcacc tgcagcaatc gctggccatg ttgcgtgccg 2322241 atggctacga cggggcgggg ctgcagggat tggacgcacc gcaatcgccg gtgaaactcg 2322301 aagagccgat tcagattccg ccaccaggca ccggggcacc agaggtgcat cggtggtgga 2322361 cgtcgctgac gtctgaggaa cggcagcgtc tgatcgccga gcacccggaa cagatcggca 2322421 atctcaacgg cgttccggtc agcgcgcgca gcgatgccaa catcgcggtg atgacgcggg 2322481 acctgaatcg ggtacgtgac atcgccactc ggtaccgcac gtcggttgac gacgtcctgg 2322541 gtgatccggc gaaatacggt ctgtccgccg gcgatatcac ccgctaccgc aacgccgatg 2322601 agaccaagaa aggcctcgac cataacgccc gtaatgatcc ccggaacccc tccccggtat 2322661 acctgttcgc ctacgatcca atggcattcg gcggtaaggg acgagccgcg atcgctatcg 2322721 gcaaccccga caccgcaaaa cacaccgccg tgattgtgcc cggcaccagc agcagcgtga 2322781 aaggcggctg gttgcatgac aatcacgacg acgcgctgaa cctctttaac caggccaagg 2322841 ccgccgaccc gaataatccg accgcggtga tcgcctggat gggatatgac gccccgaacg 2322901 acttcaccga cccgcgtatc gccactccga tgctggcccg aatcggtggt gcggcactgg 2322961 ccgaggacgt caacggtttg tgggtaacgc atctcggcgt cggccagaat gtcaccgtgt 2323021 tgggccactc gtacggctcg accaccgtgg ccgacgcgtt cgccttgggc ggcatgcatg 2323081 ccaacgatgc ggtgctactg ggctgcccgg gaaccgacct ggcccacagc gccgcgagct 2323141 ttcacctgga cggaggccgg gtgtatgtgg gtgcggcctc tacggatccg atcagcatgc 2323201 tcgggcagct cgacagcctc agccagtatg tgaaccgtgg caaccttgcg ggtcagctgc 2323261 aaggtttagc cgtcggcctg ggcaccgacc ccgccggcga cggattcggt tcggtgaggt 2323321 ttcgcgctga ggtgcccaac tctgatggca tcaaccccca cgaccactcc tattactacc 2323381 accggggcag cgaggcgttg cgcagcatgg ccgacatcgc ctccggtcac ggcgacgcgc 2323441 tagcatccga tggcatgctg gcccaaccac gtcaccaacc cggcgtcgag atcgacattc 2323501 caggtcttgg gtcggtggaa attgacatac cgggcacgcc ggccagcatt gacccagagt 2323561 ggagccgccc tccgggatct atcaccgacg accatgtttt cgatgcccca ctccaccgct 2323621 gatcgacggc ttcggctgac gcggcaggct ttgctcgccg cggccgtggc gccgttgcta 2323681 gcaggatgtg cgctggtgat gcacaaaccc cattccgcgg gttcgtctaa tccctgggat 2323741 gattccgcgc acccgctcac cgacgatcag gccatggccc aagtcgtcga gccagccaaa 2323801 cagatcgtcg ccgccgccga cctgcaggct gtcagagcgg gattctcgtt cacctcgtgt 2323861 aacgaccaag gcgatccgcc ttatcagggc accgtcagga tggcctttct gttgcagggc 2323921 gatcacgacg cgtactttca gcacgtccgt gccgccatgc tgtcgcacgg ctggatcgac 2323981 ggccccccac cgggacagta cttccacggc ataaccctgc acaagaacgg agtgaccgcg 2324041 aacatgagct tagcgttgga ccacagttac ggagagatga tccttgatgg tgagtgccgc 2324101 aatacgaccg accaccacca tgacgacgag accaccaaca tcaccaacca actcgttcag 2324161 ccatgaaggc gtcgggtgcc ttcactgttc ccacatcgat gtcagtgatc accaacccgt 2324221 gtggcacgtg gcgaccggcg accggcgagc ccgcatcgca ccaggtatcg aggaactcgg 2324281 acccaccctg gtcgaaacgg tacgccgccg cgacgcactg ccccgcatcg cccaagccgt 2324341 agtagtggcc gccacccgca actacggcgt ccccgacaac gaaaccgacc tactgcggtc 2324401 gcccaggcca aggtggccac caaacgctgc tggcatgcag gtggagtgca cagacacggc 2324461 agctgcaata gccttacgcg ggtgaccaac accccccccc ccacccacca caggacaatg 2324521 gacaccaacc caccccccag cgccgccgcg ttcacgcaat tggccgttgg cggcggtggc 2324581 cagcgtcgcg attgccgcgg ttgtgctggg tgccgcagct ttaatcgtgg cactgacgcg 2324641 cccgacgaac agcggtccag ccaccgccgc tggaacgacc gccgagccga catacaccgc 2324701 agcagaaacc gccgccgcgc accaaaagtt atgcgaggtg tacaaactgg cagcgcgggc 2324761 ggtccaaatc gcgacaaacg gcgacaaccc ggcgttcgca aacattgcca cagtcaatgg 2324821 tgcggtgatg cttcagcaga cactgaatac gaccccggcg ctcgtgcccg gcgagcgcac 2324881 cgatgcactt gcactagcag aagcatatgg ccaagctaca gcctttgcga tggagcaaga 2324941 ccatccagcg tggcagtcag cagccaatga tgtcaatgcc aaggatgcgc gcatgaaggc 2325001 catctgcggt ggcgggtgat ctgccacccg gtcggtggtc ggcgctcttg gtgggtgcgt 2325061 ggtggccggc gcggcccgat gcgccgatgg ccggggtgac gtattggcgt aaggcggccc 2325121 agctcaagcg caacgaggcc aacgacctgc gcaacgagcg atccctgtta gcggtaaacc 2325181 aagggcgcac cgccgacgat ttgttggagc gatattggcg cggcgaacag cgactagcca 2325241 ccatcgcgca tcagtgcgag gtcaaaagcg accaaagcga gcaagtcgcg gatgcggtga 2325301 actatttgcg ggatcggctg accgagatcg cacaatccgg caatcagcaa atcaaccaaa 2325361 tcctggccgg caaagggccg atagaggcca aagttgccgc ggtgaacgcc gtcatcgagc 2325421 agtcgaatgc catggccgac catgtgggag caaccgcgat gtccaacatt atcgacgcga 2325481 cgcaacgagt gttcgacgag accatcggtg gtgacgccca cacctggttg cgtgaccacg 2325541 gtgtaagcct cgacgctccc gcgcggccac gcccagtgac cgctgaagac atgacttcta 2325601 tgacggcgaa ctcgcctgca ggatccccat tcggtgctgc tccgtctgcg cccagtcatt 2325661 cgacgacaac cagcggcccg ccgacagctc caacaccaac atcaccattc ggcactgctc 2325721 ccatggtgct aagttcatct tcaacaagta gcggcccgcc gacagctcca acaccaacat 2325781 caccattcgg cactgctccc atgccgcccg gcccaccccc accgggtacc gtctcaccac 2325841 ccctaccccc cagcgccccc gccgttggtg ttggtggccc gtcagtaccg gccgctggca 2325901 tgccaccagc agcggcggcg gcaacagcgc cgttatcccc acagtcgttg ggccagtcgt 2325961 tcaccaccgg gatgacgacg ggcacgccgg ccgcggccgg tgcacaggcg ctgtcggcag 2326021 gggcgctgca cgcggcaacc gaacccctgc cgccaccggc gccacccccg acgacaccca 2326081 cggtcaccac accgacagtc gcgaccgcca ccacggccgg gattccccac atccccgaca 2326141 gcgcgccgac ccccagcccg gcaccgatcg cgccaccaac caccgacaac gccagcgcca 2326201 tgacacccat cgcgcccatg gtcgctaatg gcccgccagc atccccggcc cccccggccg 2326261 ccgcccccgc ggggccactg cccgcctacg gcgccgacct gcgcccaccg gtaaccacac 2326321 cccctgccac gccacccacc ccaaccggac ccatctccgg tgccgcggtc acaccctcct 2326381 cacccgcagc aggcggctca ctaatgtcac ccgtcgtcaa caaatccacc gcaccagcca 2326441 ccacccaggc ccaacccagc aacccaacac caccgctagc cagcgccacc gcggccgcca 2326501 ccaccggcgc cgcagccgga gacacctccc gccgagccgc cgaacaacaa cgcctacgcc 2326561 gcatcctcga caccgtcgcc cgccaagaac ccggattatc gtgggctgcc gggctacgcg 2326621 acaacggcca aaccaccctg ctggtcaccg acctcgccag cggctggatc cccccacaca 2326681 ttcgcctacc cgcccacatc accctgctcg aaccggcccc ccgacgccgc cacgccaccg 2326741 tcaccgacct actgggcacc accaccgtag ccgcggcaca ccacccccac ggctacctca 2326801 gccaacccga ccccgacaca cccgcactca ccggcgaccg cacagcacgc atcgcaccca 2326861 caatcgacga actcggaccc accctggtcg aaacggtacg ccgccacgac acactgcccc 2326921 gcatcgccca agccgtagta gtggccgcca cccgcaacta cggcgtcccc gacaacgaaa 2326981 ccgacctctt acaccacaaa accaccgaga tccaccaagc cgtactgacc acctacccca 2327041 accacgacat cgccacggtg gtcgattgga tgctgttggc ggcgatcaac gcactgatcg 2327101 caggcgacca gtcgggggcg aactatcacc ttgcctgggc gatcgccgcg atatcaacga 2327161 ggagatccag atgacgtcaa tcgaatcgca tcccgaacaa tattgggcgg cggccggcag 2327221 gccagggccg gtgccgctgg cgctgggacc cgttcatccc ggtggaccga cgctgatcga 2327281 cctgctgatg gcgctgtttg gcttgtccac gaacgccgat ctgggaggca cgaacgccga 2327341 catcgaggga gatgacaccg atcggcgggc acatgcggcc gatgccgcgc gcaagttctc 2327401 ggcgaacgag gccaatgcgg cggagcagat gcagggggtg ggcgcgcagg gaatggcgca 2327461 gatggcgtca ggcatcggcg gagcgctcag cggcgcgctc ggcggcgtca tggggccgct 2327521 gacccagctc ccgcaacagg cgatgcaagc cgggcagggc gccatgcagc cgctgatgag 2327581 tgcaatgcaa caggcccaag gcgctgacgg actggcggcc gtggacgggg cgcggctgct 2327641 ggacagcatc gggggcgagc ccggtcttgg cagcggtgca ggtggcggtg acgtcggggg 2327701 cgggggcgct ggcggcacta cccccaccgg ctatctgggt ccaccacccg taccgacgtc 2327761 gtcaccgccg acgactcccg cgggggcacc gaccaagtcg gcgacgatgc ccccgcccgg 2327821 cggcgcttca cctgcctcag cgcacatggg tgcggccggg atgccgatgg tgccgccggg 2327881 cgcgatgggc gcccggggcg aagggagcgg ccaagaaaag ccggtcgaaa agcgcgtgac 2327941 cgcgcctgcg gtccccaatg gccagccggt caagggccgc ctgacggtgc ccccgagcgc 2328001 accgaccacg aaacccaccg acggcaagcc cgtagttcgc aggcgcatcc tgctgcccga 2328061 gcacaaggac ttcggacgca tagctcccga cgagaagacc gatgccggtg agtgacgatt 2328121 cgtcgtcggc gttcgatctg atttgcgccg agatcgaacg ccagttgcgc ggcggcgagc 2328181 tgctcatgga tgccgcagca gcatccgaat tactactcac cgtgcggtat cagctcgata 2328241 cccagccgcg gccacttgtc atcgtgcatg gaccgctgtt tcaggccgtc aaagcggccc 2328301 gcgcacaggt gtacggacgc ctgatacagc tgcgacacgc gcgctgtgag gtgctcgatg 2328361 agcgatggca gctacggccg acgggtcagc gcgatgtgcg cgcactgctg atcgatgtgc 2328421 tgaacgtgtt gttggcggcc attaccgccg caggcgtgga acgggcatac gcgtgcgcgg 2328481 agcggcgggc gatggccgcc gcggttgtcg ccaagaatta ccgggacgcg ttgggtgtcg 2328541 agctgcagtg caattccgta tgccgagccg ccgccgaggc gatccacgcg ctggcgcacc 2328601 gcacaggggc taccgaggat gccgactgcc tcccgccggt tgatgtgata cacgccgacg 2328661 ttactcgccg catgcatggc gaggtggcga ccgacgttgt cgcggccggc gaactggtga 2328721 tagcggcgcg acacttgctg gaccccatgc ccaggggcga gctcagttac ggcccactcc 2328781 acgagggggg aaatgcggcc cgtaaatcgg tctatcgacg cctggttcag ctatggcaag 2328841 cgcgccgggc tgttaccgac ggtgacgtcg acctgcgcga cgctcgcacg ctgctgaccg 2328901 atctggacag cattttgcgt gagatgcgca cggccgcaac cattcaacag gcgtacacac 2328961 gagcggaacg gcgggcgatg gcggcggcgg tcgtcgccaa gattcgcggc gacgcaatgg 2329021 gcctcgacgc ccagcgcgac gcggtacatc gcgcggccgc cgatgcgctc cacgcgttgc 2329081 aatcggttgg catacaccaa taggcgaccc tttggcagtt gagggtgtag aggagatcgg 2329141 cgcgtcgttg ccggggcggg agtcgacgcc ttccgatgat ggaggttccc tacacccatc 2329201 aggaagacct cgacgcgtcc atcgccgccg gtggtgcggg cttggcctgt gctgacacat 2329261 gaccgctttc cgccgccttg attgttgacc ggcactgggt ttgggggcgg ccgcgtcact 2329321 gtaggtgagt atgggacgtg agcgacatgt gcgacgtggt gtcgttcgtt ggcgccgccg 2329381 agcgtgttct gagggcgaga tttcggccga gcccggaatc tggcccccca gttcacgctc 2329441 ggcggtgcgg ttggtctctg gggatcagcg cggagacgct gcgccggtgg gcaggtcaag 2329501 ccgaggtcga tagcggtgtg gtggccggcg tgtccgccag cagaagtggg agcgtaaaga 2329561 ccagcgagct tgagcaaacc atcgaaatac tcaaggtcgc aacgagtttc ttcgcgcgga 2329621 agtgcgaccc gcgacaccgc tgatctgtgc gttcggcgac aagcacaagc acacctacgg 2329681 ggtcacaccg atctgtcggg cactggccgt gcacggcgtg cagatcgcct cgcgcaccta 2329741 tttcgcggat cgcgcggcag cgccttcgaa acgcgcactg tgggacacca caatcaccga 2329801 aatcctggcc ggctactacg aacccgacgc cgagggcaaa cgcccaccgg aatgcctgta 2329861 cggcagcctg aagatgtggg cgcacctgca gcgccagggc ttccggtggc cctctgccac 2329921 ggtgaagacg atcatgcggg ccaacggttg gcgcggagtg cccctcgcag cgcacatcac 2329981 acaccaccga accagacccg gccgcggccc aggccctaga cctggcgggt cggcaatggc 2330041 gggctttagc aacgaacctg ctggaagcgg ccgacttcac ctacgcgccg atgacgtgga 2330101 gttccggcta caccgcgttc gtggtcgacg cctacgccgg tgtgatcgcg ggctgggaat 2330161 gctcgctgac caaagacgca gcgttcgtcg aacgcgcatt acgccacggc cttccagact 2330221 cacctaggtc acccgtttgg cggagctatt catcatcgcg acgccggaag tcagtatact 2330281 gcaatatatt tcggcaagac accgatgcta gccgggctgc ggccgtcgat aggcattgtt 2330341 ggcgacgccc tcgacaacgc cttatgtgaa accacgacag ggccccacag gaccgaatgc 2330401 agccacggca gcccgtttcg tagcgggccg atccgcaccc tggctgacct ggaagacatc 2330461 gcctcggcgt gggtggagca cacctgtcac acacaacaag gtgtgcgaat acccgggagg 2330521 cttcaacctg cgtagtgggc ggaagcgttt cacgacgcga tcggcttagc gtatgcgcgg 2330581 gccgatacca cgggtgcacg cgatcacctg gaactggtga gttggctatc gtggtttggt 2330641 gattacttgc gcttgggggc ttgccgacgg ttgcgccggg cgcaagtggg gtgcggtttt 2330701 gcggttgatg gatggtagct ggtggcccac gagttgagtg cgggttcggt ttttgccggg 2330761 taccggatag agcggatgct aggtgccggc ggaatgggca ccgtatatct ggcgcgtaat 2330821 cccgatctgc cgcgtagcga agccttgaaa gtccttgctg cggagttgtc gcgtgacctc 2330881 gattttcggg cacggtttgt ccgcgaagcc gatgtggccg cggggttgga tcatcccaac 2330941 atcgtggcgg ttcatcagcg cggccagttc gagggtcggc tatggattgc gatgcagttc 2331001 gtcgatggcg ggaacgctga ggatgcgctg cgggcggcga ccatgaccac agcgcgggcg 2331061 gtgtacgtga tcggcgaggt cgccaaggcg ctcgactatg cgcaccaaca aggcgtgata 2331121 catcgcgata tcaagccggc gaacttcttg ttgtcgcgag ccgctggcgg cgatgaacga 2331181 gtgctgctaa gcgattttgg gatcgcgcgt gcgctcggcg acacgggact gacgtccacc 2331241 ggttcggtgc tggccacgtt ggcctatgct gcgccggaag ttcttgcagg gcaaggtttt 2331301 gatggccggg ccgatttgta ttcgttgggg tgtgccctat ttcggctact aaccggtgag 2331361 gcgccgtttg ccgccggtgc tggagcggcg gtggcagtgg tggcgggtca tctgcaccaa 2331421 ccgccgccga cggtcagcga tcgcgtgcca gggctgtcgg cggcgatgga tgcggtgatc 2331481 gccactgcga tggccaagga tcccatgcgt cggttcacct cagcgggtga attcgcacat 2331541 gccgccgccg cagccctgta cgggggagcc accgacggat gggtgccgcc gagccccgcg 2331601 ccgcacgtca tatcgcaagg cgccgtgcca ggttcgccgt ggtggcagca tccggtcggg 2331661 tcagtgaccg cgttggccac gccgcccggt cacggttggc cgccaggcct gccgccgctg 2331721 ccgagacgac cgcgccgcta ccgtcggggc gtggcggcgg tggcggccgt gatggtggtg 2331781 gccgccgcgg ccgtcaccgc ggtgaccatg acatcgcacc aaccgcggac cgcgacgccg 2331841 ccaagcgctg cagccctttc tcccacctcg tccagcacaa caccaccgca accaccgatc 2331901 gtgacaaggt cgcgcctacc cgggttgttg ccgccccttg atgacgtcaa aaacttcgtg 2331961 ggcatccaga acctggtcgc ccatgagcca atgcttcaac cccagactcc caacgggtca 2332021 atcaaccccg cggagtgctg gccggcggtt gggggtggcg ttcctagcgc ctacgacctg 2332081 gggaccgtca tcggctttta cgggttgaca atcgacgagc cgcccaccgg gactgcccca 2332141 aatcaagtgg ggcaactgat cgtggccttt cgcgacgcgg ccacagccca aaggcatttg 2332201 gccgatttgg cgtcgatctg gcgccgatgc gggggtcgaa ccgtaacact cttccgtagt 2332261 gagtggcgaa ggcccgttga actgtcgacg agcgttcccg aagtcgttga tggcatcacc 2332321 accatggtgt tgacggcgca gggaccggtg ctacgagtcc gcgaagacca tgcgatcgcc 2332381 gcgaagaata atgtgcttgt cgatgtcgac atcatgacgc ccgacaccag ccgcggccag 2332441 caggcggtca tcggcatcac caactacatc ctcgccaaga tacccggctg agcgcgacac 2332501 cattggccta ggacaccggc accacgatca actcgtgcgg gcagttgttg acagacacag 2332561 caccgtcctc ggtcacgatc acgatgtcct cgatgcgggc gccccaccgg cccgggaaat 2332621 agattcccgg ctcgatggaa aacgccatgc cgggaaccaa caccaggtca ttgccggcga 2332681 cgatataggg ctcctcgtgc acgcacagcc cgatgccgtg cccggtgcgg tgcacaaaat 2332741 actccgcgag cccggcctcg gcgagcacgt cacgcgcggc ggcgtccacc tgctccgctg 2332801 tcacccctgg gcggatggcc tcgaacgccg cccgctgggc tcgctgcaac atcgaatatg 2332861 actgcgctac atcagaatca ggctcgccga tgctgtaggt tcgggtggag tcggagtggt 2332921 atccaggccc atacgtgccg ccgatgtcga cgacaacgat gtcaccctcc cgcaattcgc 2332981 ggtccgaata tccgtgatgc gggtcggcgc cgtgcggccc ggaacccacg atgacgaacg 2333041 ctacctccga atgcccttcg gcgacaattg cttcggcgat gtcggcggct acgtcggctt 2333101 ccgttcggcc cgggaccaga aactccggca ctcgggcatg cactcgatcg atcgccgcgc 2333161 cggccttacg cagcgcgtcg atctcggttt cctccttgac catccgcagc ctgcgcagca 2333221 cgtcggtggc caataccggc agcacaccca gtgcgtcggc cagcggcaac atgtgcaacg 2333281 ccggcatgga atcggtgacc gcggtcgcta ccggagctcc gcccaacacg gcactcacca 2333341 acccgtaggg gtcgtcaccg tcgacccaat cgcacacgcg cagacccaat tccgctgcgg 2333401 cggattgctt gagggcggcg agctccagcc gcggcagcac aaccgccggc gcaccggcgg 2333461 ccggcaacac caacgcggtg agccgctcga acgtctccgc tcgcgacccg atgaggtaac 2333521 acaggtcgta gccgggagtt atcaccagac ccgccagacc ggcgtccgcc gtcgcggccg 2333581 ccgctaaagc cagccgccgt gcataaacct cggcgtcgaa tcggcgagaa cccatgtcag 2333641 ccaggttaac cgcgcgttcg cgagcgctgg caagatagcc cgcatgcccg cacccgatcc 2333701 gatgcgtggc gacccgccgc acccggctcc gccgcgcttg cgatcgccac tggacccaac 2333761 aagtggcgac ccgctgcacc cggctccgcc gcgcttgcga tcgccactgg acccaacaag 2333821 tggcgacccg ctgcacccgg ctccgccgcg cttgcgatcg ccactggtgc tactggacgg 2333881 cgccagcatg tggttccgct cgttcttcgg tgtgccatca tcgatcaccg ctccggatgg 2333941 ccggccggtc aacgccgtac gcggcttcat cgactccatg gcggtggtga tcatacagca 2334001 gcggccaaac cggctggcgg tctgcctcga cttggattgg cgcccgcagt tccgggtgga 2334061 cctgatcccg tcatacaagg cacaccgggt ggctgagcct gagcccaacg gccagcccga 2334121 cgtcgaggag gtgcccgacg agctgacccc gcaggtcgac atgatcatgg agttactgga 2334181 cgcgttcggg atcgcgatgg caggcgcccc gggattcgaa gccgacgacg tgctgggcac 2334241 gctggcaacc cgggagcgcc gcgacccggt aatcgtggtc agcggagacc gcgacctgct 2334301 gcaagtggtc gccgacgatc cggtcccggt ccgggtgctc tacctgggcc gcggccttgc 2334361 caaggccacc ttgttcggac cggccgaggt cgccgagcgc tacgggttgc cggcacatcg 2334421 cgccggcgcg gcctacgccg aactcgcgct gctgcgtggc gatccgtccg acggcctacc 2334481 cggcgtgcca ggcgtcggcg agaagaccgc cgctacccta ctggcccgac acggctcgct 2334541 agatcagatc atggcggccg ccgacgaccg caagaccacg atggccaagg gcctacgtac 2334601 caaactgctt gccgcgtcgg cctacatcaa ggccgccgac cgggtggtgc gggtcgccac 2334661 cgacgcaccg gtcacgctgt cgacacccac cgacaggttg ccgctggtcg cagctgaccc 2334721 ggagcgcacc gccgagctgg cgacccgatt cggggttgaa tcctcgatcg cgcgactaca 2334781 aaaagcgctc gacacgctgc ccggatgacg attactgtgg ccggccgacc tcgtaggtgc 2334841 ccttgttgtc ctggaaggtc acggtcacgc gctttgaggt gccgtcgatg ctcaccgtgc 2334901 attcgaaggt ggcgcccttt ttgaccgtgg ggtctgaacc gttgttgcac ttgacgtctt 2334961 tgacgttctt ggcgccgtac cccgtggtct catcggtgag aacctgctgc acaccggcct 2335021 gcgccttaat gacgtccagc ttggtggtga cgaagaatcc gggtgcccag aagccgagta 2335081 ttagaaccgc gccgatgaac agcacggcca tcacggcgat cacgccgccg atcaccgcaa 2335141 ccgaacgctt cgacccctga cccgactggc catacgggcc gtattgcccg gggtactgac 2335201 cgggcggtgc gtactggccg ggctggccgt actgtcccgg ctggccatat tggcccggct 2335261 gctggtattg gccgtactga ccgggcacgc cgagctgggt gggctgtgca ccgaactgtt 2335321 cgggctgcgc atagccgggt gtgggctgcg ggtactgctg cgggtacgcc gggtcagccg 2335381 gctgttggta ctgcggtgtg tacgccgggg cctgccacgt cgcctcctgg gtcggctgct 2335441 gctgccaggg atatcccgcg gccacggtgg ggtccgagga atggtcggcg ccctggccgg 2335501 gcggctgcca cggctgcctt gggtccgatc cctgcggtcc gctcatcgct tctcctcagt 2335561 ctgtgttaac cgtaactctg gcccagccta cccggcgtca accgcgacga cgccgcgccg 2335621 aatgtcaccg atagcgcgct ttgcggtagc ccgcagttcg gggttgggcg cagcgttacg 2335681 aacttggtcc agcagatcga gcacctgacg gcaccaacgc acgaaatccc ctgccaataa 2335741 cggtgatccg ctgccgttca cgtcggcagc ggccaatgcc gccgctagat caccggttcg 2335801 cgaccagcgg tagatgactc tgacaaagcc atcgtcgggt tcgcgactcg gggtgatgcg 2335861 gtgtgcctgc tcgtcggcgc gcaatgtcgt ggacagcctt gatgtctgag tcagagcctg 2335921 ccgtaaccgc ggtgtgggca catcggctcc gaacggggcg ccctggccgt caccaccgcg 2335981 cgtctcgtag accaccgccg acaccacccc cgccaattcg gccggcttta aaccctccca 2336041 cgcacctgta cgtaggcact cggccaccaa caggtcgctc tcgctgtaaa tccgcgccag 2336101 cagccggccg tcgtcggtga ccacgggatc agtggccggg ccatcgatga actcccgttc 2336161 ggtgagcagc ccgacgaatc ggtcgaacgt gcgggccaac gagttggtgg cggcggcgac 2336221 cttcctctct aattgcgcgt tgtcgcgttc gatgcgtaag taacgctcgg cctggcggat 2336281 ctggtcctcg agcccgggcg aggtatgcac cggatgacgg cgcaattgtt cgcgcgacga 2336341 ctccagctcc ggatcgtgaa acccgccggc ctcgctgacg cgccgggcgg ctggaataac 2336401 cagacccgcg gctgccgatc gcagcgccga ggccaggtcg cgccggaccc gcggctggcg 2336461 gtgctccacc cgcttgggca gcgtcatcga ccccaccggc gtcgtgcccg agtagtcggc 2336521 cgaggagatc cgtcccgccc atcggtgttc ggttagcacc agcggacgcg ggtcgtcgcg 2336581 gtcgcgggct gattccagga cgacggccag accaccgcgg cggccgtggg tgatggtgat 2336641 gatgtcaccg cggcgcagcg cggccagcgc atcggtggcc gcctgccgtc gctgtaaccg 2336701 cgacgcgcgg gcctgcgcac gttccagctc ggacacccgc gcgcgcaatc gagcgtattc 2336761 gaggatgggc gcatcagatc cgcccagttc ggctgcgatc tcgccgagta tcctgttgcc 2336821 ccgctcaatt ccgcggacca gtccgaccac ggatcggtcg gcctgatatt gggcgaacga 2336881 ctgctcgagc agtcggtgcg cctgttgcgg acccatccgg tgcaccaggt tgatcgtcat 2336941 gttgtacgac ggggcaaacg agctgcgcag cggaaaggtg cgggtggagg ccaggcccgc 2337001 cacctcggac ggttcaattt ccgggtgcca gatcaccacc gcgtgaccct cgacgtcgat 2337061 accgcgccgg ccggcgcgac cggtcagttg ggtgtactcc cccggcgtca gcggcatgtg 2337121 ctgctcaccg ttgaacttca ccagccgctc cagcaccacc gtgcgggccg gcatgttgat 2337181 accgagcgcc agagtctcgg tggcgaatac agccttgacc aaaccggcgg tgaacagctc 2337241 ctccaccgtg tgccggaagg ccggcaacat gcccgcgtgg tgggcggcca gaccgcgcag 2337301 taacccttcc cgccattcgt agtagccgag taccgccagg tcggagtcgg ccaggtcacc 2337361 gcagcggtgg tcgatcacct cggcgatccg tgcgcgctcc tcttcgctgg tcaaccgcag 2337421 cggtgaccgc aggcattggg tgaccgcggc gtcacaaccg gcccgggaga acacgaaggt 2337481 gatcgccggc aacagccctt cagcgtcgag tttggcgatc acctcgggtc ggccgggtgg 2337541 ccggtagaag ccgggccggc ccgagcctcg gcgccgaggc tgccaatcgg ccatccggtc 2337601 ggcctcacgg cgatgcgcga tgtggcgcag caactcgcgg ttgacttggg gctgcccttc 2337661 ggcttcgccg atccggtaat cgaacaggtc gaacatgcgc ttgcccacca agacgtgttg 2337721 ccacaacggc accggccgat gctcgtcgac caccaccgtg gtgtcgcccc gcaccatctg 2337781 gatccaaccg ccgaactcct cggcgttgct caccgtcgcc gacaggctga ccacccgcac 2337841 gtcgtcgggc agttgcagga tcacctcctc ccacaccgga ccccgcatcc ggtcggcgag 2337901 gaaatgcacc tcatccatca ccacatagga aagcccctgc agcgcaggcg aatccgcgta 2337961 gagcatgttg cgcagcactt cggtggtcat caccaccacc ggcgcgttgc cgttgaccga 2338021 caggtcaccg gtcagcagcc cgatctggtc acggccgtag cgtgctgtga gatcggtgtg 2338081 cttttggttg ctcagggctt tcagcggcgt ggtgtagaaa catttactgc cggccgccag 2338141 cgccaggtgc acggcgaact cgccgaccac cgtcttgcca gcgccggtcg gcgcgcacac 2338201 cagcacaccg tggccgcgtt ccagcgcgct gcaagcccgc tgctgaaagt cgtcgagcga 2338261 gaacggtagt tccgcggtga accggtccag ctcggccagc tcagtcacgt cgccgccgcc 2338321 tcgccagttg accgcgcccg ctcgcggcta gcgggcctac gtgacgtcgt catgagatcc 2338381 gatgaccgat ggcgccggca ccggcgaggg cgggtcgatg accgaagctt cgtcgtcggg 2338441 aatcgcggct tcgcgcttgg cttttcgctt gtcatgcacg cgggcgatct gaatggcgag 2338501 ctctagcagc acggtcaacg ccgcaccgag cgcggtcatc gagaacggat cggatccggg 2338561 cgtgaagatc gccgcgaaga cgaacatcgc aaagatcaac ccgcgccgcc aagacttgag 2338621 ccgctcatag gtcagcaggc ccgccaggtt cagcatcacg atcagcaggg ggaattcgaa 2338681 gctgaccccg aacaccacca gcaggttgag cagaaagcca aagtagcggt cgccagacag 2338741 cgcggtcacc tgcacgtcgc tgccgacggt caacaaaaag cccaacgcct tggacaacac 2338801 caggtaggcc agtacggcac cggcgacgaa cagcaccgct gctgggatca cgaaggccac 2338861 cgcgaagcgg cgctccctct ggtagagacc aggcgtgatg aacgcccaca gctggtagaa 2338921 ccacaccggg caagccagca caatgccggc ggccatcccg accttgagcc gcaacatgaa 2338981 ctggtcgaac ggcgcggtgg ccaacaaacg gcactctccg tcggcgctga tatccgcccg 2339041 ggccgactgc ggcagggcac agtagggatg ccgcagccac tctccgaggc tgtccaaccc 2339101 gaaaatcgaa tgcgaatacc agacgaaccc gaagattgtg gtgaccaaga tcgcggccag 2339161 ggagatcagc aacctggtgc gtaactcggt caggtggtcg accagcgaca tcgtcgcgtc 2339221 aggattgacg cggctgcgcc tgttacgtgg gttgagccgt ttgagaagac cggcggcgcg 2339281 cactgaagcg acgcccgagc taagccggcc gagcctcggt gctgtcttga ccagacgccg 2339341 ctgaggggtc gacacgctgc gattgcaccg gcgtgggggt ctcgatagac gcttccgctt 2339401 tgttctcgtt ctgcagttca cggacctcgg acttaaagat tcgcaatgac ttgcccaacg 2339461 agcgcgccgc atcggggagc ttcttggcac cgaacaacac gatcaccacg acagcgagga 2339521 tcgcccaatg ccacggactc agactgccca ctttgattac ctccagacgt tgacccgatg 2339581 ctaccgcagc ggccgcggca cccggagatt tcgcgccgtc acggcggcgc agctgcctgg 2339641 tatgcatcca gcgcggccgt cgcggcgtcg cgaacccgct gagcgagcga ctccggcgcc 2339701 agaacgcgca cgtccgaacc gaagcccagc aataggcgcg tcatccaatc ctcagaggcg 2339761 taggtcatga ccacctcaca ggagccgtcc ggcagctgtc gtagctcccg aatcgggtag 2339821 tactccagca tccacgaggc cgacggtgcc acccgcaacg tcgccgacgg cagcgatagg 2339881 tcaccgtcga acagcgacgt gtccggtggc gcctgccgtg ccgattccgg cggaaccgcg 2339941 ggctcgccca actcggcggc atcgacaatc cggtcgaaac ggaacaggcg aaccccttcg 2340001 gcctcacgcg accaggcctc caaatagctg tgcccgccga tcaacagcac ccggatggga 2340061 tccacgatcc gagtggtgag ggtgtcatgc gacgcggcgt aatagtcgat ggtcagcgcc 2340121 cgactgttcc gcaccgcggc ccgtacggcc gcggcggccg ggctttctgt gggtgcctgt 2340181 tcggcaacgg cggccaccgc gccggccgcg gcggcgatct tggcgatggc gctgcgcgcc 2340241 gcctgcgggt caaccacgcc gggaatgtcc gctagcgccc gcaacgccac cagcagcccg 2340301 gtggcctccg gcgatgtgag ctttaacggc cggtcgatgc ccgccgagaa cgtcacctcg 2340361 atggtgtcac cgcagaattc gaagtcgatg aggtcacccg gggaatagcc cggaaggccg 2340421 cacatccaca gctggttgag gtcctcctcc agctgcttgg cggtgacacc cagctcggcg 2340481 gcggcctcgg cgcgggtgat ccgggggttg gcctggaagt acggcaccat gttgagcagc 2340541 cgcaccagcc gggtggacag ggcgctcatg ccagtgctcc ggcttgcgcg cgtagtcggg 2340601 ccagcacatc gtcgcgcaga gacccgggct gcagcacgat tgcgtcggcc ccatagccgg 2340661 tgatctcacg cgccagccgg tcgctggatc gaatctcaag ctcgatcacc tcgccatcgc 2340721 gaccaccaag ttgtcgcggc ccggcggacc gcccggcacg tcgcaacgcg gtggcccgac 2340781 cctcggctac ccataccgtg gcttgctcac cggtcggcac ctccgtcacc ttctgcgcca 2340841 cgatgctgcg taggtccaca ccggcaggca cggtggttgc gccggccggc ccgattggcg 2340901 tcacctgcgc tccgatccgg gacagccgga agacgcgggt tgcatcccgg tcgcggtcgt 2340961 ggccgaccag ataccagcgg cccttctcgg taaccacacc ccacggctcg acggtccgaa 2341021 cggtgtacgg ctctgcgcgc gacgatcgat gagagaactg caccacctgc ccggaatcga 2341081 tggccgacaa caagattccg agaacgtcct cagagccgcg cagtcccgaa acggccgccg 2341141 ccgacgcgat ggccaccggt gccccggtat ccaagggatc gacgtccacc ccggcggccc 2341201 gcagcttcag caacgcgccc tgggtcgcgg tgatcaactc cggtgactcc cacagctggg 2341261 tggcgacggc taccgcggcc gcctcatccg gggtcagctc gacaggcgac agggcgtagg 2341321 cgtcgcggtt gatgcgatag ccctcggtgg gctccaacgc cgagaccctg ccgacctcga 2341381 gcggaatgcc gaggtcacgc agctcgttct tgtcgcgctc gaacatccgg gagaacgcct 2341441 caacgctggg gctgtccgaa tagcctgcca cgctggacct gatcttctcc gcagtgatgt 2341501 agccacgagt ggacagcaag gctatgacga gattgaccag ccgttcgact ttcgaggtcg 2341561 ccattggtgg tgctacatgc tcgcgatcag ccgcttaacc cgctcatcga ccgcccggaa 2341621 cgggtctttg cacagcacgg tgcgctgcgc ctggtcgttg agtttgagat gtacccagtc 2341681 gacggtgaaa tcacgtcccg cctcctgcgc ggcgctgatg aactcaccgc gcagccgggc 2341741 ccgggtggtc tgtgggggct gatcgacggc ctccgcgatt tcttcgtcgg tggtgacgcg 2341801 cgcggccaac cctttgcgct gcaggagatc aaagatcccg cgtccgcgct tgatgtcgtg 2341861 gtaggccaga tccagctgag cgatcttcgg gtgggacaac tccatgtcat agcggtcctg 2341921 ataacgctga aacagcttgc gtttgatcac ccagtcgatt tcggtgtcga ccttggcgaa 2341981 atcctggctt tcgacggcat cgagttggcg gccccacagg tcgacgacct gctcgatctg 2342041 cgcgttgggc tcccgagtct gcaagtgctc gactgcgcgg gtgtagtact cccgctggat 2342101 gtccagcgcg ctggcctgac ggcctccggc caaccgcacc ggccggcgac cggtgacatc 2342161 atggctaacc tcacggatgg cgcggatcgg gttatccagg gaaaaatcac ggaaggcgac 2342221 tccactttcg atcatttcca gcacgagcgc cgcggtgccc accttgagca tggtggtggt 2342281 ctcggacatg ttggagtcgc cgacgatgac gtgcagccgc cggtacttct cggcgtcggc 2342341 atgtggctcg tcgcgggtgt tgataatggg gcgggatcgg gtcgtggcgc tagagacgcc 2342401 ctcccaaatg tgttcggcgc gttggcttaa gcagtaggtg gcggccttgg gggtctgcag 2342461 caccttgccg gccccgcaga tcagctggcg ggtgaccagg aagggcagca gcacgtcgga 2342521 gatccgggag aactcaccgg cccgcacgat caggtagttt tcgtggcagc cgtaggagtt 2342581 gcccgccgaa tcggtgttgt tcttgaacag gtagatgtcg ccgccgatgc cctcgtcggc 2342641 cagccgctgc tcggcgtcaa cgagcaggtc ttccagcacc cattcaccgg cccggtcatg 2342701 ggtgaccagc tgcaccaggc tgtcgcattc ggcggtggcg tactcgggat gactgcccac 2342761 gtcgagatac aggcgcgcac cgttacgcag gaagacgttg gagctgcggc cccaggacac 2342821 cacacggcga aacaggtagc gggccacctc gtccggggac agccgacggt gaccgtgaaa 2342881 tgtgcaggtg acaccgaact cggtttcgat gcccatgatt cgacgctgca cgtatttgag 2342941 ggtactggtt gttggttggc ggcggcgcga tagccacgcc cgttacccgt ccgggccgga 2343001 cgggccgggg actccgaaca gcagcccgcc ggtgccgccg ctgccgccgg gccccgcggc 2343061 cccgtccgga gtaccgggtc cgccggcggc gccagcccca ccggcgccac cgtcgccgaa 2343121 caagatggcg gtgccgccgt gcccgccgac accgcccggc ccgccgggcg aggtggtgtt 2343181 catgccgggc cccccttggc cggcggcccc gccggcgcca ccgttgccgt accacacgcc 2343241 gccgttgcca ccgctgccgc cggggttgcc cgcgcccccg acgccaccgc tgctgaccga 2343301 gccaggcgcg ccgctcccgc cgctaccgcc ggcaccaccg ttgccgatga gccgcgcgct 2343361 gccgccgttg ccggcgttgg tggagccaat aaatggcagc cccccattac cgccgtcgcc 2343421 gccgccgccg ccatcgccgt acagccaccc gccgacccca ccgtcgccgc cgcgactacc 2343481 tacctggaac aggcgcgcac cgagccctgg gtcgccgccg ttgccgccgc cgccgccgac 2343541 accaatcagc cccgcgtcgc cgccccggcc accgctgccg cccaacccgg cgaaaccgtc 2343601 gctggagacg ccggtacctg catcgccacc gttgccgccg gaaccggccg cccctccatt 2343661 gccgtacagc agcccgccgg cgccaccgaa ctggccgaag ccgccactgc cgccactgcc 2343721 gccggccttg ccgctcccgc cgtgcccacc gtccccaccg tttccgccgg caccgccgtg 2343781 accgatcagc cccgcccgtc cacccaaacc gccatcgcct cccccgccgc cagcaccgag 2343841 gtctcccaca ccgttgtcac cggtaccgcc gactcccccg gcacccccgt tgcccagcag 2343901 caatccgccc aggccgccat tgcctcctgc accgccgggc gcaccgttgc cgcccctgcc 2343961 gccgttgccg atcagccccg ctgacccgcc ggctcccccg gcaaccccgg ggctcgtgct 2344021 gtcgccaccg ttgccgccgt tgccccacaa gatgccgccg tccccgccgg gctgtcccac 2344081 cggaccactg gccccgtcgg cgccgtcgcc gaccagcgga cgccccagca gcgtctgggt 2344141 gggcgcgttc accgcgttca gcaggttctg ctgcgcgttg gcaatctcgg cgctcgcata 2344201 tgaacctcca ccggcgttaa gcagttggac gaaccggtca tgaaacgccg ccgcctgggc 2344261 gctgaccgct tgatagctct gggcctgggc gccaaatagc gccgctatgc ccgccgacac 2344321 ctcatcggcg gcgggcgcca acgcccccgt cgtcgggacc gccgccgccg cgttcgctgc 2344381 cctgatggtc gaacggatgg ccgctaaatc cgtggccgca gccagcaatg cctccgggct 2344441 cgcaatcaca aacgacattg cgcacctccc accaacccgc gataacccgg ctgcgccgga 2344501 accgtcgatg cgtatggcag gaatatcgta ttgcgatccc ccaccctcag tcggggtgtt 2344561 cgccagattc gtcgcagctc agcgctgcgc cggcgccagc attggcgatg gctggtggtt 2344621 aacgcgagtg gtcgaaggtg atggccgggg cactgttcga accgtcgttc gccgcagcgc 2344681 acccagcggg gcttctcaga cgacccgtga cgcgaaccgt cgtgctgtcg gtggccgcta 2344741 ctagtatcgc acacatgttc gagatatcgc tgccggaccc gacggagctg tgccgatccg 2344801 atgatggcgc gctggtggcc gcgatcgagg actgcgctcg tgtggaggcg gctgcgagcg 2344861 cccggcggtt gtcggcgatc gccgagctga ccggccggcg caccggcgcg gaccagcggg 2344921 ccgactgggc gtgtgacttc tgggactgcg cggccgcgga ggtggctgcg gcgttgacta 2344981 tcagccacgg caaagcctcc ggacaaatgc atctgagcct tgccctgaac cggctgcccc 2345041 aggtggcggc gttgtttttg gccgggcatc ttggtgcgcg gcttttctcg atcatcgcct 2345101 ggcggaccta cctcgttcgc gacccgcacg cactgagtct gctcgatgcc gccctggccg 2345161 aacacgccgg cgcgtggggg ccgctgtcgg cccccaaact ggaaaaggcc atcgactcct 2345221 ggatcgatcg ctacgatccc ggggcgctgc ggcgcagccg tatctcggcc cgcacccgcg 2345281 acctatgcat cggtgatccc gatgaggacg ccggcaccgc cgcgctgtgg ggccggctgt 2345341 atgccaccga cgccgcgatg ctggatcgcc ggctcaccga gatggcccac ggcgtgtgcg 2345401 aggatgaccc gcgcaccctg gcccagcgcc gcgccgacgc gctgggcgcg ctggccgccg 2345461 gcgccgacca cctggcgtgc ggctgcggca agcccgactg cccctccggt gccggcaacg 2345521 acgagcgggc cgccggtgtg gtcatccacg tcgtcgccga cgcctcagca cttgacgcac 2345581 aacccgaccc acacctatcc ggcgacgaac ccccttcgcg gcccctcacc ccggagacga 2345641 ccctgttcga ggcgttgaca cccgaccccg aacccgatcc ccccgccacc cacgcgccgg 2345701 ccgagctgat caccaccggc ggcggtgtgg tgcccgcgcc gctgctggcc gaactcatcc 2345761 ggggtggggc caccatcagc caagtgcgcc atcccggcga tctcgcagca gagccgcact 2345821 accggccgtc ggccaagctg gctgaattcg tccggatgcg ggatttgacg tgccggtttc 2345881 ccgggtgtga cgtgcccgcc gagttttgtg atatcgacca ttcggcgccc tggccgttgg 2345941 ggccgacgca tccatcaaat ctgaagtgcg cgtgtagaaa acaccacctt ttgaaaactt 2346001 tctggacggg ctggcgggat gtgcagttac ccgatggcac ggtcatctgg accgcgccca 2346061 acggccacac ctacactacc catcccggca gccgcatctt ctttcccacc tggcacacca 2346121 ccaccgccga actaccccaa acatcaacgg cagcagtcaa cgtcgacgca cgcggcctga 2346181 tgatgccgcg acggcgccgg acccgagccg ccgagctggc ccaccgcatc aacgccgaac 2346241 gcgccctcaa cgacgcgtac atggccgaac gcaacaagcc accatcgttc tgatgggcgg 2346301 ctattcccac ctcatgtcaa acaccccttc tggatgtcac gccccttctg gacaccaccg 2346361 acgagttctc gtgtcgccgc acctatccaa gaagaccaac cgctacgatc ggtcgatgtc 2346421 gcggcgccgc agtcgacgca ggagaaccgc gaaacgtgcc ggccgctccg tcgacaagag 2346481 agaaggactg catgctggtt ttgcacggct tctggtccaa ctccggcggg atgcggctgt 2346541 gggcggagga ctccgatctg ctggtgaaga gcccgagtca ggcgctgcgc tccgcgcggc 2346601 cacacccgtt cgcggcgccc gctgacctga tcgccggcat acatccgggc aaacccgcaa 2346661 ccgccgtttt gctgttgccg tcgttgcgat cggcgccgct ggactcgccg gagctgatcc 2346721 ggctcgcccc gcgcccggcc gcgcgaaccg atccgatgct gttggcgtgg acggtaccgg 2346781 tggtggacct ggaccccacc gcggcgttgg ccgccttcga ccagcccgcc cccgacgtcc 2346841 gctacggcgc gtccgtcgac tacctggccg agctggccgt tttcgcgcgc gagttggtcg 2346901 agcgtggtcg cgtgctgccc cagctgcgcc gcgacaccca cggcgcggcc gcctgctggc 2346961 gtccggtgtt gcagggacgc gacgtggtcg cgatgacctc gctggtctcg gcgatgccgc 2347021 cggtctgccg cgccgaagtt ggtgggcacg acccgcacga actggcaacc tcggctctgg 2347081 acgcgatggt cgacgccgcc gtgcgcgcgg cgctgtcacc gatggacctg ctgcccccgc 2347141 gacggggtcg ctccaaacgg catcgggccg tggaggcttg gctgaccgcg ttgacctgcc 2347201 cggacggccg gttcgacgcg gagcccgacg aactcgacgc gctggccgag gcgttgcggc 2347261 catgggacga cgtcggtatc ggcaccgtcg gcccggcgcg ggcgacgttt cggctgtccg 2347321 aagtcgagac cgaaaacgag gagacgcccg cgggctcgtt gtggaggctg gagttcttat 2347381 tgcagtcgac gcaggacccc agcctgctgg tccccgccga gcaggcatgg aacgacgacg 2347441 gcagcctgcg ccgctggctg gaccggccgc aggagctgct gctgaccgaa ctgggccggg 2347501 cctctcggat tttccccgag ctcgtcccgg cgctgcgcac cgcgtgcccg tccgggcttg 2347561 agctcgacgc cgacggcgcc taccgattcc tgtcgggtac ggccgcggtg ctcgacgagg 2347621 ctgggtttgg cgtgctgctg ccgtcctggt gggaccgccg ccgcaagctg ggcttggtcc 2347681 tgtccgcata taccccggtc gacggcgtgg tgggcaaggc cagcaagttc ggccgcgagc 2347741 agctcgtcga gttccgctgg gagctggccg tgggcgacga tccgctcagc gaggaggaga 2347801 tcgcggcgct gaccgaaacc aagtccccgc tgatccggct gcgtggccag tgggtggcgc 2347861 tcgataccga acagctgcgc cgcgggctgg agtttttgga gcgtaagcca accggccgca 2347921 agaccaccgc cgagatcctc gcgctggccg ccagccaccc cgacgacgtg gacaccccgc 2347981 tcgaggtcac cgccgtacgc gccgacggct ggctcgggga cctgctcgcc ggggccgccg 2348041 cggcgtcgct gcagccgttg gacccgcccg acggattcac cgcgacgctg cgtccctacc 2348101 agcagcgcgg tctggcgtgg ctggcgtttt tgtcctcgct cggtttgggc agctgcctgg 2348161 ccgacgacat gggcctgggc aagacggtgc agctattggc cctggaaacc ttggaatccg 2348221 ttcagcgcca ccaggatcgc ggcgtcggac ccacactgct actgtgcccg atgtcgttgg 2348281 tgggcaactg gcagcaggaa gcggccaggt ttgcacccaa cctgcgggtg tacgcccacc 2348341 acgggggcgc ccggctgcac ggcgaggcgt tgcgcgacca cctcgagcgc accgacctgg 2348401 tcgtgagcac ctataccacc gccacccgcg acatcgacga gctgtcggaa tacgaatgga 2348461 accgggtggt gctggacgag gcccaggcgg tgaagaacag cctgtcccgg gcggccaagg 2348521 cggtgcgacg gctacgcgcg gcgcaccggg tcgcgctgac cgggacaccg atggagaacc 2348581 ggctcgccga gctgtggtcg atcatggact tcctcaaccc gggcctgctc ggatcctccg 2348641 aacgcttccg cacccgctac gcgatcccga tcgagcggca cgggcacacc gaaccggccg 2348701 aacggctgcg cgcatcgacg cggccctaca tcctgcgccg gctcaagacc gacccggcga 2348761 tcatcgacga tctgccggag aagatcgaga tcaagcagta ctgccaactc accaccgagc 2348821 aggcgtcgct gtatcaggcc gtcgtcgccg acatgatgga aaagatcgaa aacaccgaag 2348881 ggatcgagcg gcgcggcaac gtgctggccg cgatggccaa gctcaaacag gtgtgcaacc 2348941 accccgccca gctgctgcac gatcgctccc cggtcggtcg gcggtccggg aaggtgatcc 2349001 ggctcgagga gatcctggaa gagatcctgg ccgagggcga ccgggtgctg tgttttaccc 2349061 agttcaccga gttcgccgag ctgctggtgc cgcacctggc cgcacgcttc ggccgtgccg 2349121 cccgagacat tgcctacctg cacggtggca ccccgaggaa gcggcgtgac gagatggtgg 2349181 cccggttcca gtccggtgac ggcccgccca tttttctgct gtcgttgaag gcgggcggta 2349241 ccgggctgaa cctcaccgcc gccaatcatg ttgtgcacct ggaccgctgg tggaacccgg 2349301 cggtcgagaa ccaggcgacg gaccgggcgt ttcggatcgg gcagcggcgc acggtgcagg 2349361 tccgcaagtt catctgcacc ggcaccctcg aggagaagat cgacgaaatg atcgaggaga 2349421 aaaaggcgct ggccgacttg gtggtcaccg acggcgaagg ctggctgacc gaactgtcca 2349481 cccgcgatct gcgcgaggtg ttcgcgctgt ccgaaggcgc cgtcggtgag tagcacctgg 2349541 tatccaccac cgtcccggcc ccgtccggtc gagggtggga tcaaggcgcg cagcacccgc 2349601 ggcgcgatcg cgcagacctg gtggtcggag cggttcattg cggtgctgga ggacatcggc 2349661 ctgggtaacc ggctgcagcg tggccgcagc tatgcgcgca aggggcaggt gatctcgctg 2349721 caggtggatg ccggcttggt caccgcgctg gtgcagggca gccgggcccg gccgtaccgg 2349781 atccgcatcg ggattccggc gttcggcaag tcgcaatggg cgcacgtcga gcgaaccctg 2349841 gccgaaaacg cttggtacgc agcaaaattg ctgtccggcg aaatgcccga agacatcgag 2349901 gacgtcttcg ccggcctggg cctgtcgcta ttccccggca ccgcccgaga gctatcactg 2349961 gactgctcct gccccgacta cgcggtccca tgcaagcacc tggccgccac cttctacttg 2350021 ctggccgagt ccttcgacga ggatccgttc gccatcctgg cgtggcgtgg ccgcgagcgg 2350081 gaggatctgc tggccaacct ggccgctgcc cgcgccgacg gagcggcacc ggccgccgac 2350141 cacgccgaac aagtggccca gccgctcacc gactgcctag accgctatta cgcccggcag 2350201 gccgacatca atgtccccag cccgccggca accccatcga cggcattgct cgaccagctg 2350261 cccgacaccg gactcagcgc ccgcggacgg ccgctgaccg agctcctgcg acccgcctat 2350321 cacgccctga cgcaccatca caacagcgcg ggcggctgat cccagcgcac cccttcgaat 2350381 cggccgaagt cactgtcgta ggacacgatg ctggcgcgat gctcgacggc aagcgcggcc 2350441 agatgcgcgt cgttgaccag gttggcaccg gttcccacgt acgtcagcat tctcgccagg 2350501 atatcggcgt gccggacggt cggattcacc aagacggcgc tgggtgcggc tagccaatcc 2350561 gcgacctggg tgatggccgc ctcccgcgga agcggacggg ggaacaaccc caccttggtc 2350621 gccaatcgca cgaacgccaa caacggcacc caggcgaacc cgacgcggtc ggcgcccgac 2350681 agcgcaccgt caagccagcg cagcgacggc ttgtggtgct cacttgtggt gttcacggcg 2350741 tagagcaaga cgttcgcgtc gacgatcttc atcaaccgct atgacccgcg gcgttgacgg 2350801 cgcacaagct cttcgtcctc gaggtcggcc gcaagctgca aggcccggtc gaggttgacc 2350861 gcagggacgc ccaagtctgc cgtgcgggtg ctgaagtgac tcggcgcagg tcgcccggag 2350921 gcgccgtcgc gaatcgcgtc gttgagggcc ttcttgaagg acacttgccg ctcggccatc 2350981 cggcgcctta ccaactgctc gacgtcgtca tccaatgtga cagtcgtccg cattttgata 2351041 gcatagcatc aagattgtcg acagcatctc gtcaatcggc gcgcgggccc gtcactaatc 2351101 cggcgattcg ccgtcggact gggagtcttt ggcgcccgtg gaaccccttt gtgtcccttg 2351161 gcatctttgc gatccagttc ccgcagccgt tttcccaacg cggcacccgc gatgcgccga 2351221 aacgcgcgcc gtagccggtc ggcgtcgagc acggccacct ccagggtggc caggggtggg 2351281 ttgagcacca cggtcgtgaa cgtcattcgc ggtagcccga ctcggcgacc tcgagcagtc 2351341 gacacgcctt ctgcacggga agtccttctg cggccatcgt tgctatggcc gcttactgcc 2351401 ttctagtccg tgcggctctc gcaacagctc acgggacctt tttgaggatc gccacttcag 2351461 gtcttcaact cgcggatgcc ctcattggca acgtttgcgc ctgccttggg gcggccggca 2351521 gccaccaagt cgagcacttt gcggcggaac tactcggggt aacacttcgg cacggacacg 2351581 gctcgttcga cggacgtcgt gaccagaagt cgagcaaacc gactccactc tagctagtga 2351641 tacaagcttt tttgtagccg cgcgattggg ccgcccgtaa ggaatgcgtc atgagcgact 2351701 tcgcatcacg ggcgaccaat cattaatttg tcaaaccctt tgacatgcac tacttgtcca 2351761 cattttgtac acgaaatacc taacacacta tggtgcacat cacgcacttc cacgttccgt 2351821 attcggtgta cgattttgtc acgcaactaa gcgttcaaga gggagtacta tgactcatcc 2351881 aaaagtaaaa gatgacatag aaatagaaga gtcgtggttc cggtgcgggt agctcccgat 2351941 ggcttgactg tggtaagcac cagtggcgtg ttccccgtgg ttgagaccag gaagttttaa 2352001 agtcctacag cccgcggtat tccgcagagg acattgtgtg catttcgcac cttcgggtgg 2352061 gagaaatcgg gatgatctca ccaccggcca ccggtgggcg cactttgtac ccttcgattc 2352121 cgttattcgg cggatttaag cagttcgcac cattaccaag cagccaatga ggaagagcgc 2352181 aggtgactag gtcgcttgat ctttccctgt gcagtagctc gggttctttg agtttcgagg 2352241 aggagaaacc acatgtcctt tgtgaatgta gacccatttg ggatgttggc ggcagctgcg 2352301 acactggagt cccttggttc ccacatggcg gtaagcaatg ccgcggtggc ctcggtgacc 2352361 accaaggttc ctcccccggc cgccgactac gtatcaaaaa agttatcgct gttctttagt 2352421 agccacgggc agcagtacca ggtgcaagcc gctcggggca cggcctttca tcgaaaattg 2352481 gtccggaccc tggcgaatgg cgcgcttgcg tatgaggaag tcgagatcgc caacaacgaa 2352541 ggtttctaac gtgtcgccag ttacgcacga gtggctacca gcgagtacaa gggagtaacg 2352601 aattatgccc aatttctggg cgttgccgcc cgagatcaac tccacccgga tatatctcgg 2352661 cccgggttct ggcccgatac tggccgccgc ccagggatgg aacgctctgg ccagtgagct 2352721 ggaaaagacg aaggtggggt tgcagtcagc gctcgacacg ttgctggagt cgtatagggg 2352781 tcagtcgtcg caggctttga tacagcagac cttgccgtat gtgcagtggc tgaccacgac 2352841 cgccgagcac gcccataaga ccgcgatcca gctcacggca gcggcgaacg cctacgagca 2352901 ggctagagcg gcgatggtgc cgccggcgat ggtgcgcgcg aaccgcgtgc agaccacagt 2352961 gttgaaggca atcaactggt tcgggcaatt ctccaccagg atcgccgaca aggaggccga 2353021 ctacgaacag atgtggttcc aagacgcgct agtgatggag aactattggg aagccgtgca 2353081 agaggcgata cagtcgacgt cgcattttga ggatccaccg gagatggccg acgactacga 2353141 cgaggcctgg atgctcaaca ccgtgttcga ctatcacaac gagaacgcaa aagaggaggt 2353201 catccatctc gtgcccgacg tgaacaagga gagggggccc atcgaactcg taaccaaggt 2353261 agacaaagag gggaccatca gactcgtcta cgatggggag cccacgtttt catacaagga 2353321 acatcctaag ttttgattcg ggaacatcct aagaaacggg gggcgtcgcc gttggagacg 2353381 tcgcaacgtg tccgcagtcc caagggcaac agtgaagggc ccacggtgcg atccccaaca 2353441 cccggctaga gtgcgcatat attttcccgc ctcggctcaa ggcgtgcacc cccatcaccg 2353501 ctaaccatgc tgtgtatcaa cagatttcat tgtcccggcc gtcgcgcgac cgaccaatag 2353561 ggtgagttcc atgtgcgata tcgcctaaca gccggctccc gtactcccgt ggccgatgtg 2353621 attattgatt acgtggatca ccatgtgggt gatcgcggtc gacagctttg gtaccgagca 2353681 catcgccaca acgcgcggta cgaatctagt acacaaatcc gcaccagccg ccatgcgact 2353741 tcgcaggtca tagccccgca gagtcgccga acctgccgca gtgacaaaag tcaggacggc 2353801 cggcgacgcg tcgagccggg gttaggcgca gttaacgtcg cagcggggtc ccagacacgc 2353861 gtcggacttt cggactcagc ccgacgattc gccgtcagac tgcgggcttt cctggtctac 2353921 cagcaacgct tgcagggcgg agccggtgat gcgccggaac gcgcgccgtg gccggttggc 2353981 atcgagaacg gccacctcta agctggccac gccaagggtg ggttgatcac cacccgaggt 2354041 gtcggcactg ccggcccgca atgcagcgac cgcgataccc agggcgtcgg tcaggctggc 2354101 gttctcggca tacgactctt tgagcgcgtt ggcgatcggc tccgtggtgc cgcccatcac 2354161 cacgaaatgc ggctcgtcgg cgatcgaccc gtcgtaggta atacgataca actcaggggg 2354221 tttcgtctcg ccgtaatgcg ccacctcggc cacacacaac tcaacctcgt agggcttggc 2354281 ctgttcggtg aagatggtgc ctagagtctg cgcgtagaca ttggccaact gccgacccgt 2354341 gacgtcacga cggtcatagg cgtaaccgcg ggtgtcggcg aactggatcc cgccgcggcg 2354401 caaattgtcg aactcgttga acttgcccgc agccgcaaaa cccacccgat cgtagagctc 2354461 actgatcttc tgcagcgacc gcgacggatt ctccgcgacg aacagcacac caccggcata 2354521 ggccagcgcc accacgcttt tggcccgcgc aatgccctta cgcgccaact cgctgcgctc 2354581 gcgcatcgcc tgctcaggcg agatgaaata cggaaaactc acttctcacc gccatcggag 2354641 ccgaaagtat ccgcacccga acggctttcg atgatcgcgc gggccaattc ggcaatccgg 2354701 ctctccggca cgtcaaccgc cccgtcggcg tcgatgatca ccgccgtcgg aaagatgccc 2354761 cgcaccaggt ccggaccgcc ggtggcggag tcgtcgtcgg cggcgtcgta gagcgcctcg 2354821 accgccaccc gcagccccga atcaccgtcg gtaacctgcg aatacaactt cttcatcgac 2354881 gacttcgcga acagcgaacc cgagcccacc gcctgatagc cctcttcctc gatgttccaa 2354941 ccgccggcgg cgtcgaacga aacgatacga cccgcgctct gcgggtcaga cgcatgaatg 2355001 tcgtagcccg ccagcaacgg caacgccagc agaccctgca tcgcggccgc cagattgcca 2355061 cgcaccataa tcgccagccg gttgattttg ccggcaaacg tcagcggcac accctcgagc 2355121 ttctcgtagt gctcaagttc cacggcatac agccgggcaa actcaaccgc gaccgcagcc 2355181 gtgccagcga tgccggtagc ggtgtagtca tcggtgatat acaccttgcg cacatcacgc 2355241 ccagaaatca tgttgccctg cgtcgaacgc cggtcacccg ccatgacaac accgccgggg 2355301 tatttcagcg cgacaatggt ggtgccgtgc ggcagttgcg catcgccgcc tgcgagtggc 2355361 gcaccgccgc tgatgcttgc cggcagcaac tccggcgcct ggcggcgcag gaagtcagtg 2355421 aaagaagata ggtctacagc gggtgttcca gagagtgaat taatggacag gcgatcgggc 2355481 aacggccagg tcactgtccg cccttttgga cgtatgcgcg gacgaagtcc tcggcgttct 2355541 cctcgaggac gtcgtcgatt tcgtcgagca gatcgtcggt ctcctcggtc agcttttcgc 2355601 gacgctcctg gcccgcggcg gtgctgccgg cgatgtcgtc atcatcgccg ccgccaccgc 2355661 cacgcttggt ctgctcttgc gccatcgccg cctcctgctt cctcatggcc tttcaaaagg 2355721 ccgcgggtgc gcgtcacacg cccgctgtct ttctctaccc taccggtcaa caccaacgtt 2355781 tcccggccta accaggctta gcgaggctca gcggtcagtt gctctaccag ctccacggca 2355841 ctgtccaccg aatccagcaa cgcaccaaca tgcgccttac taccccgcaa cggctccagc 2355901 gtcgggatgc gaaccagcga gtcgccgccc aggtcgaaga tcaccgagtc ccagctagcc 2355961 gcggcgatat cagccccgaa ccggcgcagg cattcgccgc ggaaatacgc gcgggtgtcg 2356021 gtcggcgggt tctccaccgc actcagcacc tggtgttcgg tgactaaacg cttcatcgag 2356081 ccgcgcgcga ccagccggtt gtacaggccc ttgtccagcc ggacatcgga gtactgcagg 2356141 tcgacgaggt gcagccgggg cgccgaccag ctcaggttct cccgctgccg gaaaccgtcg 2356201 agcagccgca gtttggccgg ccagtccagc agctccgcgc aatccatcgg gtcacgctcg 2356261 agctgatcca gcacgtgtgc ccaggtttcc acgatgtcgg ccgcccgcgg gtccgggtcg 2356321 cggctatcca ccaacttagc cactcggtcc aggtagatcc gttgcagcgc aagaccggtc 2356381 agttcccggc cgtcggccag cgcaacggtc gctcgcagcg acggatcgcg ggagattgcg 2356441 tgcaccgcat gtaccgggcg ggccagcgcc aggtcggtca gatctattgc gtgggctggt 2356501 ccttcttcga tcaggtcgag caccagcgcc gtggtaccca acttcagata ggtcgacgtc 2356561 tcggcaaggt tggcgtcgcc gatgatgacg tgcagccggc ggtacctgtc ggcgtcggcg 2356621 tgcggttcgt cgcgggtgtt gatgatgccg cgcttgagcg ttgtttccag ccctacctcg 2356681 acctcgatgt agtccgaacg ctgggatagc tggaagccgg gctcatcacc cgagggcccg 2356741 atgccgaccc ggcccgagcc ggtcaccacc tgccgggata ccagaaaggg ggtcagcccg 2356801 gtgatgatcg ccgagaacgg tgtctgccgc gacatcaggt agttctcgtg cgacccgtag 2356861 gaggctccct tgccgtcgac gttgttcttg tacagctgca gtttcgcggc cccgggcacg 2356921 ctggcgacat ggcgggcagc ggcctccatc acgcgttcgc ccgccttgtc ccagatcact 2356981 gcgtccagcg ggtcggtgca ttcgggcgcg gagtattccg ggtgcgcgtg gtcgacatac 2357041 agccgcgccc cgttggtcag gatcatgttg gccgcgccga cctcgtcggc gtcgaccacc 2357101 ggcggcggcc cggccgagcg actcaaatcg aagccccggg cgtcgcgcag cggcgattcc 2357161 acctcgtagt cccaacgggt gcgtttggca cgctgaatgc cggcggcggc ggcgtatgcc 2357221 agcaccgcct gcgtcgaggt gaggatcggg ttggcggtcg ggtccgacgg cgaggaaatg 2357281 ccgtactcga cctccgttcc gataatccgc tgcatgccgt agagcctagg cccgccgacg 2357341 atgcgggccg cgcagcgggc cgctgaggag gcgggcatca agcaacgccc gccgacccag 2357401 aacatcggag cgggccgcgc aggaggtgga caatcaagca gggcccggcg ctagggtagg 2357461 ccggcatgag cctttccgtc cgtcgccccc cggcggcccg agcagcggcc attgtggagg 2357521 ctgaaagctg gttcttgaag cgtggtctgc cctcggtgct gaccatgcgg ggccggtgcc 2357581 gtcggctgtg gccgcggtcg gctccgatgt tggccgcctg ggcggtggtc gagggctgcc 2357641 tcatggccgt cttcttcgtc accgacggcg gcgaagtctt catcagcgcg acgccgacga 2357701 cagcgcaatg ggtgatcctg gcgctgctcg cggttgctct tccgctggcc tccctcgtcg 2357761 gctggttggt gtcgcagata tcaagcgggc gtggccaagc ggcggtggcg accatggcgg 2357821 tggccttcgc ggccgcatcc gacgtcatcg aatccggccc gatccagctg ttgcggaccg 2357881 ccgtcgtggt gggcctggtg ctgctgcaga ccggctgcgg cgtcgggtcg gtgcttggct 2357941 gggcggtgcg gatgacgctg gagcaccttg cgacggtcgg cacgctggcg gtccgggccc 2358001 tgccgatcgt gctactgacg gcattggtgt tcttcaacac ctatgtctgg ctgatggccg 2358061 ccaacatcaa cggcgagcgg ctgccgctgg cgatggtttt tctgctcgcc atcgccgggg 2358121 cgttcgtcgt gtccaagacg gtggaacggg tgcgtccgct gcttcgctca acgacggtga 2358181 tgccccaagg cagccaaagc ctggccggca cacccttcgc gaccatgggc gacccctctc 2358241 ccggcttccc cctcacccgg gccgaacgcc tcaacgtggt cttcctgctg gcggccttgc 2358301 aactcgtcga gatcctggta gtggcgtcgg tcggcgccgc gatatacctc gttctgggca 2358361 tgatcattct cactccgccg ctgcttcggg aatggacgca ctacgattcg atgaccacga 2358421 cggtgctcgg catgacgttc ccggcgccgg attcgctcat ccgtatgtgt cttttcctgg 2358481 gcgcgctgac gttcatgtac atcagcgccc gcgcggtcga cgacgccgag taccgcgcga 2358541 tgttcctcga ccctctgatc gacgacctgc acaccgcgct gctcgcgcgc aaccgctatc 2358601 gcaacaacgt ggtgaccgcg ccgtgcgccg gtgttgacgc cggtcacgtc gatgactagg 2358661 ttcaccctga tgtcggctcc cgaacgggta accggcttgt ccgggcaacg ttacggggaa 2358721 gtccttctcg taacacccgg ggaggccggt ccacaggcca ccgtttacaa cagcttcccg 2358781 cttaacgatt gtccggccga gctgtggtcc gcgctcgatc cgcaagccct agccaccgaa 2358841 cacaaagcgg ccaccgccct gctcaacggt ccgcgctatt ggttgatgaa cgccatcgag 2358901 aaggcgcccc agggcccgcc ggtgacgaag accttcggcg ggatcgagat gctccagcag 2358961 gccacggtgc tgctgtcatc gatgaaccct gccccataca ccgtcagcca ggtcagccgc 2359021 aacacggtct ttgtgttcaa cgccggcgaa gaggtctacg aactgcagga ccccaaggga 2359081 cagcgctggg tgatgcagac gtggagtcaa gtggtggacc ccaacctgtc ccgagccgac 2359141 ctgcccaagc tgggtgaacg gctcaacctg ccagccgggt ggtcctatca tacccgcgtg 2359201 cttaccagcg agttgcgggt cgacactacc aaccgggagg cccgcgtcct gcaagacgac 2359261 ctcaccaaca gctactcgct ggtgaccgcc tgagccctac aggtactggc cgaggttgga 2359321 ctcggtatca atagccctgc tggccgacga actctttccg gtgaccaggg tgcggatgta 2359381 gacgatccgc tcccccttct tgcccgagat ccgcgcccag tcatcggggt tggtggtgtt 2359441 gggcaaatcc tcgttctcgg cgaactcgtc gacgatcgaa tcgagcagat gctgtatacg 2359501 cagtcccggt tggccggtct ccagcaccga tttgatggcg ttcttcttgg ctcggtcgac 2359561 gacgttctgg atcatcgccc cggagttgaa gtccttgaag tacatgactt ccttgtcgcc 2359621 gttggcatag gtgacctcca ggaaccggtt gtcgtcgatc tcggcataca tccggtcgac 2359681 aaccttctcg atcatcgcct tgatgcaggc cgaacggtca ccgtcgaact cggcgagatc 2359741 gtcggcgtgc accggcaaga actcggtcag gtacttcgag tagatgtcct gcgccgcttc 2359801 ggcatcaggc cgctcgatct tgatcttcac gtcgaggcgc ccgggccgca ggatggcagg 2359861 gtcgatcatg tcctctcggt tggaggcgcc gatcacgatg acattctcga gtccctccac 2359921 cccgtcgatc tcgctgagca gctgcgggac caccgtggtc tcgacgtccg aggaaacgcc 2359981 ggtgccacgg gtgcgaaaga tcgagtccat ctcgtcgaaa aacacgatca ccggagtgcc 2360041 ttccgacgcc ttctcgcggg cccgttggaa gatcagccgg atgtggcgtt ccgtttcccc 2360101 gacgaatttg ttcagcagct cggggccctt gatgttgagg aagtacgact tcgcctcgtg 2360161 ggcatcgtcg ccgcggacct cggccatttt cttggccaac gagttggcca cagccttggc 2360221 gatcaacgtc ttaccacagc cgggtgggcc atagagcaac acacccttgg gcgggcgcag 2360281 cgagtactcc cggtacaact ccttgtgcag gaacggcagc tccacggcgt cgcggatctg 2360341 ctcgatctgg cggctcagac cgccgatgtc ggcgtagctg acgtccggca cctcttccag 2360401 caccaggtct tctacctcgg ctttggggat gcgttcgaag gcatagccgg ctttggtgtc 2360461 gaccagcagc gagtcgccgg ggcgcagctt gcgcggccgg gtgtcatcgt tgagggcctc 2360521 agggaggccg tctggcaggt cctcggcgat caggggatca gccagccaaa caacgcgttc 2360581 ctcgtcggcg tggccgacga ccagagcccg atgaccgtcg gccaggatct cgcgcaaggt 2360641 ggatatctcg ccgaccgcct cgaatgtgcc ggcctccacg acggtcaggg cctcgttgag 2360701 ccggaccgtc tgccccttct tcagcgatgc agcgtcaata ttcggtgagc acgtcaggcg 2360761 catcttgcga cccgatgtga acacatcgac cgtgtcgtcg tcgtgcgtgg ccagcaggac 2360821 gccgtagcca ctgggcggct gccccagccg gtcaacttcc tcgcgcagcg ccagcagttg 2360881 ttgacgggct tctttaagag tttccattaa tttggaattg cgggcagcaa gtgagtcgat 2360941 acgggcttcg agttgatgta tatcgcgggc agagcgcgtc ggggcatgtg atccgacggc 2361001 gttctcaagt tgctcgcgca ggaccgcagc ctcgcgccgc agctgttcta attcggcggc 2361061 atcaccactg gacagcgggc tatcccgggg gatgccgaat gcctcagaac gctctgactc 2361121 acccatgttg cgctcctttc ccacgccagg aatcgcgcgg cggatactcc aacgctaccg 2361181 gcgatcggcg cttcatgttg gcagtcgaat gccgatggaa agtaacaact tgtatcgctg 2361241 gtaatctcgg ccccgaatcc ggccgatcaa acggaccttg agaggagcac tgtgaccgcg 2361301 aaatccctag ccacaggcgt agtgggcgac gcggcgatca gtgcggcggc cgccgccgag 2361361 acttctgctg cattcgcaag cggccggtag ccgagcgtgt cgctggatgc gccgaaacat 2361421 ccgtgtaacc ctgggcgccg ccaccatcgt ggcggcgtta gggctctccg ggtgttcaca 2361481 ccctgagttc aagcgttcgt cgccgcctgc cccgtcactg ccgcccgtca cgtcgagccc 2361541 gctcgaggcc gcgccgatca cgcccctgcc cgcacccgaa gccctgatcg atgtgctgtc 2361601 ccggctcgcc gacccggccg tgccgggcac caacaaggtg cagctcatcg agggcgcgac 2361661 ccccgaaaac gccgctgccc tggacaggtt caccaccgca ctgcgtgacg ggagctactt 2361721 gcccatgacc ttcgcggcca acgacatcgc atggtcggac aacaagccgt ccgacgtgat 2361781 ggccaccgtc gtcgtcacca ctgcccatcc ggacaaccgc gagttcacgt ttcccatgga 2361841 attcgtgtcc ttcaagggcg gctggcaatt gtctaggcag accgcggaaa tgctgctggc 2361901 catgggtaac tcaccggatt cgactccgtc ggctaccagc ccggcgccgg ccccatcacc 2361961 gactccccct ggctgagctc ccgatgtgga ttggctggct ggaattcgac gtgctgctgg 2362021 gcgacgtgcg ctcactcaag cagaagcggt cggtgacccg ccccctggtc gccgagttgc 2362081 agcgcaaatt cagcgtgtcg gccgccgaga ccggttcgca tgatctgtac cggcgggcgg 2362141 gcatcggtgt ggccgtggtg tccggtgacc gcagccacgc cgtcgatgtc ctcgacaacg 2362201 ccgaacgtct ggtagccgca catccggagt tcgagttgct gtccgtgcgc cggggcctgc 2362261 accgcactga cgactaagtg gactggctcc cagctgtgtc tcccgctacc cgtcgcgtcc 2362321 ctcgcgctta cgacctagcg gcgccggagc cacagccccc ggcgccaacc ggcgcgttgc 2362381 taccaggaac gcggtatgcc cgcgcatcga atgctgcggc cgaaccgcca accctacgac 2362441 gttccagccc cgctgcagcg tctcccaggc tctcggttcg gtccagcact gcttggcccg 2362501 cagtgcctcc acgatcctcg acagctgagt gacggtggcc acgtagacca tcagcactcc 2362561 gccggcgacc agcagccgcg ataccgcgtc gagcacctcc cacggcgcca gcatgtcgag 2362621 cacggcccga tcaacggatc cgtcgggcag ttcggagtcg gcgaggtcgc tgacgaccag 2362681 tcgccagttg tccggcggct ggccgtagca gccgctcaca ttgcgccggg cgtgttcggc 2362741 atgatcggcg cgctgttcgt aggagatcac ctgtccggcc ggcccaaccg cccgcagcaa 2362801 agacaaggtc agagcaccgg atccggctcc tgcctccagc acccgcgcgc cgggaaatat 2362861 gtcgccctca tgcacgatct gggccgcatc tttgggatag atcacctgcg ggccgcgcgg 2362921 catcgacatg acgtagtcga ccagcagcgg gcgcagcacc aggaacaggg cgccgttgct 2362981 ggatttgacc acgctgcctt gctccaaccc gatcaccgcg tcgtgggcga tcgagccacg 2363041 atgagtgtgg aattcggcac cgggagtcag cgacatggtg tagcggcgcc ccttagcgtc 2363101 ggtgagctga acacgttcgc cgatgctgaa tgggccggtt gctgacacgc cgtctagcgt 2363161 gccagccgac tcgccgcgat cggtgctcgg ggttgtcggc gcccaaccct aagctgcgga 2363221 catggccgac cagccggacc cgcccacacc acggccggcg ttatcaccgt cacgggcgac 2363281 ggacttcaag caatgcccgc tgctataccg gtttcgcgcg atcgaccggc tacccgaggc 2363341 gacgtcggcg gcgcagttac ggggttcggt ggtgcacgcc gcgcttgagc agctctatgg 2363401 gctacccgcg gggctgcgca gcccggatac tgcgaggtca ctggtgcagc gcgcttggga 2363461 ccagatggtc gccgcggagc ccgaactggc cggcgaactg gaccccggac aaccaaccca 2363521 gctgctggag gacgcccgcg cgttggtgtc cggctactac cggctggaag acccgactcg 2363581 gttcgacccg caatgctgcg aacagcgggt ggaggtcgaa ctggccgacg gaactctgtt 2363641 gcgcggctac atcgaccgca ttgacgtcgc cgccaccggc gagctgcggg tggtcgacta 2363701 caagactggc aaggcgccgc cggcggcgcg ggcgttggcg gagtttaagg cgatgtttca 2363761 gatgaagttc tacgcggtgg cgctatttcg gtcgcgcggc gtgccgccca cccggctgcg 2363821 gctcatctat ctggccgacg gccagctgct cgactattca ccggaccgcg acgagctatt 2363881 gcgtttcgaa aagacgttga tggcgatttg gcgtgctatc caatccgcag gcgagacagg 2363941 cgatttccgc cccaacccat cgcggctctg cgattggtgc ccgcatcaac agcgctgccc 2364001 ggccttcggc ggaacaccac cgccctatcc agggtggccc accgagccgg cggcataaac 2364061 gatcgcgtcg aagtgcggtg tcatagggcc gccgcggcgg cgacgatggc aaacccgccc 2364121 aacaccgcga ccgaatcctc gagcagcgcg atcggcaggt cgtggccgcc acgggcagcc 2364181 accagcctcg tacgtgcctg atagccgccc atggtgccga gcacggcgcc gataacccca 2364241 gcgccaagcc cgccccaccg gtagccccac gcggtgccga tgaccgcgcc ggcgaacgcg 2364301 cccaaaatga tccggacagc gaacaccggc gtcacggtac gcggcggtgt tttgggacgt 2364361 ttgtcgttaa cgagttcggc gaccgcaaga acgctgacga tcaccacggt cacgaaattg 2364421 cccatccagg atgcccaggt tccatgcagg ttgatccagc cgagaaaggc ggcccaggag 2364481 accacggccg gggccgtcag ggaacgcaac ccggcgacga caccgataag cagcgccagc 2364541 agcagaacaa ggacatgcgt cacagcgatc cctcctgaga cagacgttat gggcaatcag 2364601 gccccagcgg acgctaacac agcgtgggcc ccgccacagg atcagaatcg gcagaacctg 2364661 atgtccgacg ccagaatcgc tttggccccg atcgcagcga gctcatccat gatgccgttg 2364721 acgtcccggc gcggcaccag ggcgcggatt gccacccagt ccgggtcggc cagcggggcg 2364781 atggtcggtg actccagccc cggcgtgatc gccgtggcct tcttcaacgc cgagcgcggg 2364841 caatcgtagt cgagcatcag atactgctgg ccgaagacca ccccctgcac ccgagcgacc 2364901 agttgatcgc gcgcctcggt ctggtcttgg ccgtccgtac cggcccgctc gatgagcacc 2364961 gcctccgaat cgcacagcgg ctcaccaaag gccaccaggt cgtgctggct cagcgtgcga 2365021 cccgacccca ccacatcggc gatggcatcg gccaccccga gctgcaccga gatctccacg 2365081 gcaccatcaa gtctgatgac cgttgcttcg attcccttgg tggccagatc tttccggacc 2365141 agattcgggt aggcggtggc gatccgcatc ccggctaggt cggcagtcgt ccagttccgc 2365201 ccggcgggag cggcatagcg gaagctggac gacccgaagc ccagcgccag gcgttcccga 2365261 acctgtgcac cggaatcgca caccaggtcg cgtccggtga tcccgaagtc gagctctccc 2365321 gaaccgacat atatggcaat gtctttgggc cgcaagaaga agaactcgac gttgttgacc 2365381 ggatcgatga cggtcaagtc tttggaatcg gtgcggcggc ggtagccggc ctccgcgagg 2365441 atctcggtgg ccggctcgct cagcgcaccc ttgttgggaa ccgcgacccg cagcatgctc 2365501 acagctttcg atagacgtcg tcgagggaca gtccacggga gatcatcagc acctgcgtcc 2365561 agtacagcaa ctggctgatc tcctccgcca gtgcgtcgtt ggattcgtgc tcggcagcca 2365621 gccacacctc gccggcctcc tcgagaagct tcttacccag agcatgaacc ccgccgtcca 2365681 atgccgccac cgtggtgctg tcggccggcc gggtgcgggc acgatcgccg agttcggcga 2365741 acagatcctc gaaggtcttc acggccagcg attgttgcac gtgtcagcca gccaagtcac 2365801 ggtggtttga cgccacacgt tcgccaccgc cgcgccgcgc attagggcat cctaatatag 2365861 gttaggctac cctagttatt cctgtggtcg aaggaggcag ccgaacgtga ccttcccgat 2365921 gtggttcgca gttccgccgg aagtgccgtc agcatggctg tccaccggca tgggccccgg 2365981 tccgctgctg gccgcgcaat acaccgaaat tgcaacggaa ctcgcaagcg tgctcgctgc 2366041 ggtgcaggca agctcgtggc aggggcccag cgccgaccgg ttcgtcgtcg cccatcaacc 2366101 gttccggtat tggctaaccc acgctgccac ggtggccacc gcagcagccg ccgcgcacga 2366161 aacggccgcc gccgggtata cgtccgcatt ggggggcatg cctacgctag ccgagttggc 2366221 ggccaaccat gccatgcacg gcgctctggt gaccaccaac ttcttcggtg tcaacaccat 2366281 cccgatcgcc ctcaacgagg ccgactacct gcgcatgtgg atccaggccg ccaccgtcat 2366341 gagccactat caagccgtcg cgcacgaaag cgtggcggcg acccccagca cgccgccggc 2366401 gccgcagata gtgaccagcg cggccagctc ggcggctagc atcagcttcc ccgacccgac 2366461 caaattgatc ctgcagctac tcaaggattt cctggagctg ctgcgctatc tggctgttga 2366521 gctgctgccg gggccgctcg gcgacctcat cgcccaggtg ttggactggt tcatctcgtt 2366581 cgtgtccggt ccagtcttca cgtttctcgc ctacctggtg ctggacccac tgatctattt 2366641 cggaccgttc gccccgctga cgagtccggt cctgttgcct gccgggctga ccgggcttgc 2366701 cgggctcggt gcggtatcgg ggccggccgg accaatggtc gaacgtgtgc actccgatgg 2366761 tcccagccgg caaagctggc ctgcggccac cggagtcacc ctggtgggta ccaacccggc 2366821 tgccctggtt accacgcccg cacccgctcc gaccacgtcc gcggcaccga cggcaccgtc 2366881 gactcccgga tccagtgccg cccaaggcct ttacgcggtc ggtggtcccg acggggaagg 2366941 gttcaacccg atcgccaaga cgacagcact cgccggtgtt accaccgatg ccgccgcacc 2367001 tgccgccaaa ctgcccggcg accaagctca gagcagcgcc agcaaagcaa caagactgcg 2367061 gcgacgtctc cggcaacacc gcttcgagtt tctggccgac gacggccgcc tgaccatgcc 2367121 aaacacaccg gagatggcag acgtcgccgc cggcaaccgt ggattggatg cgctggggtt 2367181 cgccggcacg atcccaaaat cggcgcccgg atcagcgacc gggcttactc acctaggcgg 2367241 cggattcgcc gacgtcctgt cgcagccgat gcttccgcac acgtgggacg ggtcagatta 2367301 aacgttgaag tacttggctt ccggatggtg caggacgaac gcgtcggtcg actgttcggg 2367361 atgcagctgt aattcctcgg ataacgtcac accgatgcgt tcgggctcca gcagcgccat 2367421 catcttggcg cggtcctcca gatccgggca tgcgccgtag ccgaaggcaa agcgagcacc 2367481 gcggtagccg agcttgaaat agtcttcttt cgcctccgga tcctcggccg ccatcgcccg 2367541 atccccggag aacttgagct cctcacggat ccgccggtgc cagtactcgg ccagcgcctc 2367601 ggtgagctgc acgccgatac cgtgcacctc caggtagtcg cggtaggcgt tggacgcgaa 2367661 cagctcgttg gcgaaatccg cgatcggctg acccatggtc accagctgga acggcagcac 2367721 gtcaacctcg ccacgctcgg cggccagctc ccgcgagcgg atgaaatcgg caatgcacaa 2367781 aaaccgaccg cgctgctggc gcgggaagtg aaaccggtag cgcaccgggg cgtcgggctt 2367841 gggctcggtg agcaccacga tgtcgttgcc ctcggacacc gccgggaaat agccgtacac 2367901 cacggcggcg tgcgccaaga tgccgtcggt ggacagccgg tccaaccagt accgcagccg 2367961 cggccggccc tcggtctcga cgagatcttc gtaggacgga ccctcaccgc cgcgctggcc 2368021 gcgtaaaccc cactggccca aaaacaatgc gcgctcatcg agcagaccgg tgtagtcggc 2368081 caccgccagg cccttgacga tccgcgaacc ccagaacggc ggcgccggga cctcgatgtc 2368141 ggccgcgaca tcggagcgtt cgggcacctc gactggttct tcggcggctt tgcgctgtgc 2368201 ggcaatgcgt ttggatcgct ggtggcgggc cttacgttcg gcttctttct cacgcgcctt 2368261 aatggcttcc gggctgtttt cgtcgggcgc ctcgccgcgc ttggcgctca tgatggtgtc 2368321 catcaacttc aggccctcga aagcgtctcg cgcgtaatgc acttcgccct ggtagatctc 2368381 ggccaggtcg ttttcgacat agctgcgcgt caacgccgcg ccgccgagca gcaccgggaa 2368441 cttttcggcg actccccggg tgttcatctc ctcgaggttt tccttcatca ccacggtcga 2368501 cttcaccagc aggcccgaca tgccgaccac gtcggcgctc ttgtcctcgg cgacttcgag 2368561 gatggtggcg attggctgct tgatgccgat gttgaccact tcgtagccgt tgttgctcaa 2368621 gacgatgtcg accaggttct tgccgatgtc gtgcacgtcg cccttgacgg tggccagcac 2368681 gatgcgtccc ttgcccgaat cgtcgtccga gcgctccatg tgcggttcca gatacgcgac 2368741 ggcggctttc attacctccg ccgactgcag cacgaacggc agctgcatct ggccggagcc 2368801 gaagagctcg ccgaccgtct tcatgccggc cagcagatgt tcgttgatga tctgaagcgg 2368861 cggcttttgc gtcatcgcct cgtcgagatc ggcgtccagg ccgttgcgct cgccgtcgac 2368921 gatgcgttgg gccagccgtt cgaacagcgg cagcccagct agttcagcca gtcggtcctc 2368981 tttcgaggag gccgccgaca cgccttcgaa cagccgcatc agctcctgca gcggatcgta 2369041 gtcctcgcgg cggcggtcgt agaccagatc cagggcgacg ttgcgttgct cctcgggaat 2369101 ccggttcatc ggcaggatct tcgacgcgtg cacgatcgcc gaatccagcc ccgcttcttg 2369161 gcattcgtgc aggaacaccg agttgagcac ctggcgcgct gcgggattga gaccaaacga 2369221 gatgttggac agaccaagtg tggtctgcac atccgggtgg cgctttttca gttcgcggat 2369281 cgcctcgatg gtctcgatgc cgtcgcggcg ggactcctcc tgaccggtgg cgatggtgaa 2369341 cgtcaaggtg tcgatgagga tggatgattc gtcgacgccc cagttgccgg tgatgtcgtt 2369401 gatcagccgc tcggcgatct cgaccttctt ctgcgcggtg cgggcctggc cctcttcgtc 2369461 gatggtcagc gcgaccaccg ccgcgccgtg ctcggcgacc agcgccatgg tcttggcaaa 2369521 gcgcgattcc gggccgtcgc cgtcctcgta gttcaccgag ttgatcgcgc accggccacc 2369581 cagatgctcc aaacccgcct gcagcaccgc ggtttcggtg gagtccagca tgatcggcag 2369641 cgtcgaggac gtggccagcc ggctggccag cgccttcatg tcggccacac cgtcgcggcc 2369701 cacgtagtcc acacacaggt ccagcaggtg ggcgccgtcg cgggtctggt ccttggcgat 2369761 gtccaggcac ttctggtagt cctcggcgat catcgcctca cgaaaaccct tggagccgtt 2369821 ggcgttcgtt cgctccccga tcaccagaac cgaggcgtcc tgggcgaacg ggattgcggt 2369881 gtacagcgac gacaccgacg gctcgtagct gacctgtcgc tcgggacgct tgatgttcgc 2369941 aaccgcggca gccacttcgc ggatatgggc cggggtggtg ccgcagcagc caccgaccag 2370001 cgagagcccg aactcggcga tgaagccggc cagcgcctcg gccaattcgt cgggcagcaa 2370061 cggatattcg gcgcccttgg cgcccagcac cggcaacccg gcgttgggca tcaccgacac 2370121 cgggatgcgg gcgtgccggg acaggtggcg caggtgctcg ctcatctcgg ccggacccgt 2370181 cgcgcagttc aagccgatca tgtccacacc gagcggctcg acagcggtca acgccgcccc 2370241 gatctcgctg cccagcagca tggtgccggt ggtctcgacg gtgacgtggg caaacaccgg 2370301 aatgtgccgc ccggcccgcg tcatcgcccg ccgcgacccc aacaccgccg ccttcagctg 2370361 cagtaggtcc tggcaggttt ccaccaggat ggcgtcggct ccgccgtcca gcatgcccag 2370421 cgcggcctcg gtgtaggcgt cgcggatcac cgcgtattcg gtgtggccca gagtcggcag 2370481 cttggtgccc ggccccatcg accccagcac gtagcgcttg cggtcgggac tgcccagctc 2370541 gtcggccacc cggcgtgcga tcgcggtgcc cttctgtgat agatcgcgga tcctgtcggc 2370601 gatgtcgtag tcgccgaggt tggacaggtt gcagccaaac gtgttcgtct cgacggcgtc 2370661 ggcgcccgct tcgaaatagt tgcggtgaat ggtttccagc acgtcagggc gggtttcgtt 2370721 gaggatctcg ttgcagccct ccaggccgcg gaagtcgtcg agcgtgaggt ccgcggcctg 2370781 tagttgggtt cccattgcac cgtcgccgac catcactcgc tgcgacaaga cgtcgagcag 2370841 atcggtgtcg tagaggtgct tgtcggccgc agtcacatgg caaggatagt cggcctatga 2370901 aatttcctca gtcgttgaca gcgctctgcc aggtaccgcg acgtcgcatc ggtcacagct 2370961 gccacaagag tctcagctga ggcaggcaca caacgtgccc acctcagcgc gacaaagcgt 2371021 ggccatcgct actagccggg ccgcctcaga cgacgtgcac ggttcgcatc gtcgcccggg 2371081 tggacgccgt aggctgacca ggtgacccca tcggagggca acgcaccgct gcccgaactg 2371141 cacaacaccg tcgtcgtggc tgcgttcgag ggctggaacg acgccggcga cgcggccagc 2371201 gatgccgtgg cacacctggc ggccagctgg caagcactgc cgattgtcga gatcgatgac 2371261 gaggcctact acgactacca ggtcaatcgg ccggtcatcc gccaagtcga tggggttacc 2371321 cgggaactgc agtggccggc catgcggatc tcgcactgcc gcccacccgg cagcgaccgc 2371381 gacgtggtgt tgatgtgcgg ggtggagccg aatatgcgct ggcgcacgtt ttgcgacgag 2371441 ttgctggcgg tcatcgacaa actcaacgtg gacaccgtgg tgatcctggg ggcgctgctg 2371501 gccgacaccc cacacacccg gccggtgccg gtctcgggcg cggcctactc cgcggcgtcg 2371561 gcgcggcagt tcggccttca agaaacacgc tacgagggcc ccaccggcat cgccggcgtc 2371621 ttccaatctg cctgtgtggg ggccggcatc ccggcggtga cgttttgggc ggcggtgccg 2371681 cactatgtgt cgcacccacc gaacccgaag gcgacgattg cgttgctgcg ccgggtcgag 2371741 gacgtgctcg acgtcgaggt gccgttggcg gacctgcccg cacaggccga agcgtgggag 2371801 cgcgagatca ccgagacgat cgccgaagat cacgagctgg ccgagtacgt gcagacgctg 2371861 gaacagcacg gcgacgccgc ggtggacatg aacgaggctc tcggcaacat cgacggcgac 2371921 gcgctggccg ccgagttcga gcgctatctg cgccggcgcc gcccggggtt cgggcgctag 2371981 agggaggttg cgctgcggcg gacgacggtg tcagccgggc ggcccaggat cgccggaatc 2372041 accctgagtg cccggagcgc cggctttgcc gggattcgtg cctgtcgacg taccaccgcc 2372101 ggcaccagcc tcgccgcgcg caccgccgcc gccgccggcc ccgccctcgc cgccgctgcc 2372161 catggcgccg tgggcgcccg agtggccact gagccagccg cccgccccgc ccgcgccacc 2372221 ggcaccgcgg gcacctccgg cgccgccggt gccaccatcg ccaccatcgc cgccgtcacc 2372281 gccgcggcca ccgaaggcgt taccaccccc aaaggcgctg gcaacgccgc cgccaccggt 2372341 cccgccggcc cctcccccac cgcccacacc gccggcaccg ccaatgccgc cggcaccgcc 2372401 agccccgccg tcaccgatca gcagcccgcc ggccccgcca gccccgcccg cgccgccggc 2372461 accacccatg ccgccagagg ctccggtatt cccgttgccg ccgaagccac cgcggccacc 2372521 ttgggcaaag cccccagtga actcgtcggc gaacccacct ttcccgccgt cgccaccgag 2372581 gccgccggga gcgccggcgc ctccgatgcc gccattgcca ccggcgccgc cggccccgcc 2372641 gttaccgatc aatcccgcac tgttgccggc cccacggtcc tgaccggcgg cacccgcccc 2372701 accgtgcccg ccattgccat acagcaatcc acccggccca ccgggctggc ccggcccgcc 2372761 gttagcgccg tcgccgatca acggccggcc aacaatgcct gggtgggcgc attgacggca 2372821 ttgagcactt cgtgtgccaa cgtggcgttg gcggtctcgg cggccgcgta ccacccgcca 2372881 cccgaggtta gggccgcgac aaactgcgcg agatacgcgg ccgcctgcgc gctgatctgt 2372941 tggtattcct gcgcattcgc gccgaacagc gccgcgatag ccatcgacac ctcgtcggcg 2373001 gcggtcggcc gccaacgccg tcgtcgggcc tgccacgagc gcattggccg cgctgatcgc 2373061 cgaaccgatc cccgccacat ctgcggcggc cgcggtcaga aagaaggttg cgcgattacg 2373121 aacgacatgt agtctccaac cgtttacggc cgcccggcaa ggacctaacg aaccgttaag 2373181 taggcggcga cagcgcgaac gctaccgtga ccgcactcgc gcgaccccac actaggaagc 2373241 agcactaatg attttcttat cttctccgca gcatcgacgg cgccagccga cgttgcggtg 2373301 tgtgcgggta cgattccggt ggagttgccg ccacccctag agtgggcgag gatcgcaaga 2373361 gcaaattccg cgccggtagg acaacgatag gaccgccatt acgaagccgc ccgagactcc 2373421 tgaattgagc gcggcctcac agcgtgtcga cgccttcggc gaagaggccg gctatcacaa 2373481 aggcctcaag ccccgacaac tgcagatgat cgggatcggc ggcgcgattg ggaccggcct 2373541 gttcctcggc gccagcggcc ggcttgccaa ggccggacct gggttgttct tggtgtacgg 2373601 cgtgtgcggg gtttttgtct tcctgatcct gcgggcgctg ggtgagctgg tgctgcaccg 2373661 tccgtcgtca ggctcgtttg tgtcgtatgc acgtgaattt ttcggcgaga aggccgctta 2373721 cgcggtgggc tggatgtact tcctgcactg ggcgatgacg tcgatcgtgg acaccaccgc 2373781 gatcgccacc tacttgcagc gttggacgat cttcacggtg gtcccgcaat ggattcttgc 2373841 cctgatcgcc ttgacggtgg tgttgtcgat gaacctgatt tcggtcgaat ggttcggcga 2373901 gctggagttt tgggccgcgc tgatcaaggt tctcgcgctg atggcgttcc tagtggtggg 2373961 aaccgttttt ttggccgggc gataccccgt cgacggccac agcaccggat tgagcttgtg 2374021 gaacaaccat ggcgggctgt tcccgacaag ctggctgccg ctgctgatcg ttacctcggg 2374081 agtggtgttc gcgtactcag cagtcgaatt ggtagggacg gcggccgggg agaccgccga 2374141 gccggagaag atcatgccgc gggcgatcaa ttcggtggtc gctcgcatcg cgatctttta 2374201 tgtcgggtcg gtggccctgc tagcgctgtt gctgccgtat accgcctaca aggccggcga 2374261 gagcccgttc gtcacgttct tttccaaaat cggtttccac ggtgccggtg acttgatgaa 2374321 catcgttgtg cttaccgccg cgctgtcgag cctgaacgcg gggctgtatt cgaccggccg 2374381 cgtcatgcat tcgatcgcga tgagcggcag cgccccaagg ttcaccgcgc gaatgtcgaa 2374441 aagcggtgtg ccctacggcg ggatcgtgtt gaccgcggtc atcaccctgt tcggtgtcgc 2374501 gctgaacgcc ttcaagcccg gtgaagcctt cgagattgtg ctcaacatgt ccgcgctggg 2374561 catcatcgcg ggttgggcca ccatcgtgct gtgtcagctt cgacttcaca agctggccaa 2374621 cgccgggatc atgcagcggc cgcggttccg catgcccttc tccccctaca gcggctacct 2374681 caccttgctc ttcttgcttg tcgtgctggt tacgatggcg tccgacaaac cgatcggcac 2374741 ctggacggtg gcgacactga ttattgtcat tccggccctg accgcaggct ggtacctggt 2374801 acgcaagcgt gtcatggccg tcgcccgcga aaggctgggt cataccgggc catttccggc 2374861 ggtcgccaac ccgcccgtga ggtcaagaga ctgatgcttc gaagaggtga atcgatcatc 2374921 cgcaaccgtt acgccagtaa gccaccactg tacggaatgg caatggtctt cttggccatg 2374981 gccgtcgtcg ccgtgaccgc gtactttcgc atgggctggt ggtcgatcat cggttacgcc 2375041 gccgctgcca ttatcggagt gatcgggttc gcactcgcct tccgcgacct gtcctgaatc 2375101 gagcgcggca gaacctctag gaattctcga gtgattcggt gtaggcgctg gcaaagcggc 2375161 cgagcgcggc gacctcggca tccatctggg gcatcagctt ggcaacggtg ttgcggatgg 2375221 gacgttggcc tacccgggtg gacaacaacg gcttgagcca acggaacagg gccacccagc 2375281 ccgggcagta cacgcgatct tttcggccct caatgccgtt gacgaatgcg gccgcacact 2375341 tgttgaccga cgtggtcttg ttcaacggcc aagggaggcg cgccagcaat tcggcgaacg 2375401 caggcaggtc ggccttggta tcgcgaacca acgcggtgtc gatccacgac atgtgcgccg 2375461 agccgacgct gacgcccagg tgtgcgacct cgagtcgcaa cgcgttggcg aagtgctcgt 2375521 tacccgcctt cgacatgttg tagggcgcca tcccgggcgg cgccgcgaac gcggcaagcg 2375581 acgagacgat caatacgtaa ccgcggcggt cgatcagcgc gggcaacgtc gcccgcaccg 2375641 tgtggaagtt acccagcaaa ttgacgtcca acacccgccg gaacgcctgc gggtcgacct 2375701 tcagcacgga gccgtagctg gcgatgccgg cgttggccac gacgacgtcg atgccgccga 2375761 atcgttcgac ggccgtctcg gctgcggcct gcatggcggg caggtcgcgc acgtcggcta 2375821 ccacggtgag taggcggtcg tcgccgccga gttcggcgcc catcaccgcc agctctgatt 2375881 tgctcaggtc ggtcagcacc agtttggcgc ccttgttgtg cagccgacgg gcgacctcag 2375941 ccccgattcc ccgggcagca ccggtaatga agacgacctt gccttgcagc gatgtcatgg 2376001 ccgaaaacgt accgccgcgc cggctacagg tccaccccga gcagggcatc gatcgccgtc 2376061 gccaccaact tcggcgcccc ggcatcgtgg ccgccgtact ccaccgcatc ggtgacccaa 2376121 ccatccagtg cggcaatcgc tttgggcgta tcgagatcgt cggccaggta gcggcgcacc 2376181 cgagcgacaa cgtcaactgc ggccggaccg gcgggaagtg cggttgcggt gcgccaacgg 2376241 tgcagccggg cggtcgcctc gtcaagcacc tgctggctcc agaaccgatc ggctcggtag 2376301 tgtccggcga gcaaacccag ccgaaccgcc gatggctcaa cgtcctgcgc acgcagcgcc 2376361 gacaccagca cgaggttgcc gcggctcttt gacatcttgt gcccgtccca gccgatcatc 2376421 ccggcatgca cgtagtgccg cgcgaatcgc cgttcgccgc tgacacattc ggcgtgcgca 2376481 gcggtgaact cgtggtgcgg aaagatcaga tcgctaccac cgccctggat gtcgaggccg 2376541 cttccgatac gactgagcgc gatggctgcg cactcgacat gccagcctgg ccggccaggc 2376601 ccgaacgggg acggccagct gggctcaccg ggccgcgcgg cccgccacaa caacgcgtcg 2376661 agttcgtcgc tcttgccggg gcgccgcgga tcgccgccac gttcctcgca cagccgcagc 2376721 atggtgtcac ggtcataccc tgactcgtag ccgaactgca gggtggcgtc agcgcggaag 2376781 tagatgtcct ggtactctcc catttcccgg tctatgacat aggccgcccc gcacgccagc 2376841 attttttcga tgagctcgac catttcagca atcgcttcgg tggcccccac gtagtcttgc 2376901 ggtggtagca cccgcagcgc cgccatgtcc tcacagaaca gggcgacctc ggcttgggca 2376961 aggtcacgcc agtcgacacc gtcgcgatcc gcgcgctcaa atagtggatc gtcgatgtcg 2377021 gtgatgttct ggacatagtg caattcatga ccgagatcca gccacagccg atggatcagg 2377081 tcgaacgtca cataggtggc agcatggccc agatgcgtgg cgtcgtaggg cgtgatcccg 2377141 cagacgtaca tggtggcctt agatccgggc gccaccggac ggacctgccg gtcggcgctg 2377201 tcgtacagcc gtagctgcgg gcctcgtccc ggcaacaccg gaaccggtgg gcaataccac 2377261 gactgcatgt cctcgactct aaacggcccg gtgactccag cctttctgag cagcccgcgc 2377321 gccgatcagc gccacgcgtc ggcgatggca ccgagcagga tcggcgccac ctcggctcga 2377381 cacatcagca gatccggcag gtaggggtcc agttggttgt atcgcagcgg cgagccatcg 2377441 agtcgtgacg cgtgcatgcc ggcggccaac atcaccccag ccggcgccgc ggaatcccac 2377501 tcccattggc ctccggcgtg caggtaggcg tcgacgtagc cgtcaatgac ggccatcgct 2377561 ttggcgcccg ccgaaccgat cgacaccggt tggatcgcca gcgtctggcg gatgcggtgc 2377621 aggactgccg gtggccgggt ggcgctgacg gcaatccgca aggtgccagg aacgccggcc 2377681 ggcgcggcgc cggaagtcac cgtatcggtg cggtacacca cgttgccacg ggccggcaac 2377741 gccaccgcgg cgtcggtgat ctcgggctgg ccattggagg aacgccgcca cagcgcaatg 2377801 tgtaccgccc agtcgtcgcg acccggtgtg gagaactcgc gggtgccatc caacgggtca 2377861 ataatccaca cccgatcgga tttcagccgg gccagatcgt cgtgggcctc ctcactgagc 2377921 actgcgtcac ccggccgttc ggcctgcagc cgtcgcaaca gcagcgagtt agcctggcgg 2377981 tcaccggctt ccccgagcgt ccatggctga tcgaaaccga tctccgcacg cacctggagc 2378041 aacagctttc ccgcgtccgc cgccaggtcg gcggccagct cggcgtcagt caggtcgtcc 2378101 gtcagatcag gtgcggcagg gctcaccact tcagtatcgc cgagctgaac gcaggcattt 2378161 gacaatgggg cttcacatca tgatgctcta taattcgcat cttgatgcac aatagtggga 2378221 tgcgaaccac ggtcagtctc gccgacgacg ttgccgctgc cgtgcagcgc ttgcggaagg 2378281 aacgctcgat cgggctgagc gaagccgtca acgagttgat ccgcgccggg ctcacgaaac 2378341 gacaggtcgc aaatcggttc cagcagcaga cgtacgacat gggcgaggga atcgactact 2378401 ccaacatcgg cgacgcgatc gaaacactgg acggcccggc aagcggctaa tgctcattga 2378461 cgcaaacctc ctgctctatg ccgtcgacga gcgtgccgcg cggcaccgcg ccgcggttgg 2378521 ctggctttcg gaacaactca acggctcccg tcgggtcggc ttgccgtggc agagcctggc 2378581 cgccttcctg cggatcggga ctcatccacg tgcgttcccg cgaccactca cacctgccgc 2378641 ggcattcgac atcgtcgacc taaaacgccg gccagggaat gggacgatgc ccattcggcc 2378701 cgggcatcac cggttggtcc agcagcgatt gcgcgcgtcg gcgcagcgcg ccgatttcgg 2378761 ctgcggcgat tcgcccggcc aacgcctcgg caagcgggcc gccaagagca tcggcgagcc 2378821 cggcaaccgc ctgcagaatt tggtcgtcaa tcggcttgcc ggcccaaccc cacagcacgg 2378881 tgcgcagctt gttctcgacg tgcagacaca atccatggtc gaccccgtag acctggccgt 2378941 cgatgccgca caggatgtga ccgcccttgc ggtcggcgtt gttgataagc acgtcgaaca 2379001 ccgccatccg gcgcaaccgg atgtcgtccg cgtgcatcaa aacgacctcg tcaccggcgt 2379061 agtcgtaggc ccgcagcacc ggcagatagc ccggccgcgg ccggtgggcg ggaaacaggt 2379121 cgaccaggtc gggcccgggc agagggtcgg agtcgaccgc gtcgccgggt tgctgcaccc 2379181 agagctgtag catgcctatg cccgccggac cgtctcggat gatggtgtgc ggcaccaggt 2379241 tccagcccaa ctgtgtcgac accagatagg cgctgagttc gcggccggcc agcgttccgt 2379301 cggggaaatc ccacaacggc cgctcgcccg agaccggctt gtagacgcaa tgcaggctgc 2379361 gcagacccag cgtggactca cacaaaaagg tggcgttgct cgccgagcgg atccgcccga 2379421 ggactgtcag ctcgccgtcg gccaacaccg catgctcgtc atcccgcagg gtcatcgcca 2379481 gacccgagca gcacatcgcg ccggtaaccg ttggtgcgcg cacagatgtg tccctcggga 2379541 tccagcggtt catcgcagag cgggcacggc gggcgtcccg cagagatgac gcggtaggac 2379601 cgagtagcga actgtcgggc ggactccggc gtcagaaata cccgcaccgc gtcgggccct 2379661 tcctcggtgt cgtcgagcac cacggaagcg tcgaactccg cgtcggtgac ggccagcagt 2379721 tcgaccacca cgctctgcgc ctccgaatcc cagcccagcc ccatcgtccc gacccgaaac 2379781 tcggcatcca ccggcatgat cagggggctg aggtcgtcga tctcagtggg ttccgggggc 2379841 accggggtgc cgaaccggcg gttaacctcg aacagcagcg ctccgatgcg ctcggcgagc 2379901 accgcaacct gctgcttctc caggaccacc gacaccaccc gggagtcgtg caccgcctgt 2379961 aggtagaacg tgcggtttcc gggctggcca acagtcccgg cgacgaaacg gtcgggtgtg 2380021 cggaatacgt gaattgcgcg ggccatggca cctccaaaat accgcgcaga cgccgttgcc 2380081 gcgttcttcg tcgacggtca ccccacacgc tagtcggtgg aaccgccgat caccgcgtcg 2380141 ccgggtggca ccgccgcgtt cggctccggt gacgcaccct gggcggaagc cgcggcctgc 2380201 aaagccgggg ccagccgggc gccggtgtgg ttgacgtgca gcacgaacgg gcgcagctgg 2380261 gtgtagcgga cgacactcac cgaacccggg tcggcggtga ttcgctgaaa gctgtccaga 2380321 tgcataccga acgcgtctgc gatcaccgcc ttgatgacat cgccatgggt gcaggccagc 2380381 cacagcacgt cgtggccgtg ctgatcggcc agccgccggt cgtgttcgcg gacggctgcc 2380441 acagcgcgag tctgcacctg cgccaaaccc tcaccgccgg gaaacaccgc cgcgctgggg 2380501 tgggcctgga ctacccgcca caacggctcg tcgaccaggt caccgatttt tctgccagtc 2380561 cattcgccgt agtcgacttc ggagaaccgg tcatcgatga gcggctccag gcacagcgcc 2380621 tcggccagcg gttcgacggt gcgttgacac cgcagcattg gagaagacgc gaccgcccgg 2380681 atcggcaggt caccaattcg atcgatcaac ccggtggcct gctcgcgccc cttctcgtcg 2380741 aggtcgacgc cggaccggcc ggccagcacg cccgcggtgt tcgaggtgga acgggcatgg 2380801 cgtagcaaga tgacggtcat gtcgcggcta ccgtcccggt agccagcagc acgagcatgc 2380861 ccgtcccgac gagcacccgg tagccgacga accagtacat gttgtgtcgc accagaaacc 2380921 gcagcagcca ggccaccgcg gtcagaccga ggacgaacgc gatcagggtg gccaccagca 2380981 actgcgggcc agtagcgctc atgccctcgg ttaccgggtg gaatgcgtcg ggcaacgaga 2381041 acaacccgga ggcgaacacc gctggaatgg ccagcaggaa tccgaatcgg gcggccagtt 2381101 cacggtcgag tccgagaaac agtccagcgc tgatggtcga cccggacctg gataccccgg 2381161 ggaccagcgc cagggtttgg gcaataccaa ccaccacggc atcccgccag gtcaaccgct 2381221 caatgtgacg actctggcgc cccacgtatt cggcgagtgc gatcaccccg gaaaacacca 2381281 ccagcgcggt caccacgacc cacaggttgc ggacgcccga ccggatgtcg tctttgaaga 2381341 acaggcccag aatgcagatc gggattgtgc cgatgatgac ataccagccc agccgataat 2381401 cggtgtttcg atgtgccttc acgaccaggc cgtgcaccca agcgctcagg atgcgcacaa 2381461 tatcgcgcgc aaagtagatc actacggcgg cctcggtgcc caactggctc acggcggtga 2381521 acgaggcacc ggcgtcgccg ctgaagaaga tccgcgacac gatcgccaga tgtcccgagg 2381581 acgacaccgg caggaactcg gtcaaaccct gggccgcggc caacacgatg acttgccacc 2381641 aagacatcgc cggagccgcg gtcacgacga cgacggtacc cggttaccgg cggccggtag 2381701 acggatgcgc ctaacgcgac accggcgccg cggatgccac cgagtcgcgc acggccgcag 2381761 caaggctgcg ttcgtcggtc aggtcaatgt cgaccaggcc acggaccgcc atcgcgacca 2381821 catcctcagc ctgcggcacc ggaccggcca cgcgcggtcg atagatttcg acgacgagcg 2381881 agcgatgctc gatgtggaag gagaaactgc gtccatcgcc gacttgccca tacccgctgg 2381941 cgaaaattcc cgtcgatata tcttcaacag caaactcttc gcgcttgtct gcgagatgcc 2382001 ggtcagcggt gacggtcatg cccagagaat acctctggag taccatttcc cgtgggcgac 2382061 atgacgagat tgaaagcaac ttgccagatt cggattcgtg agaggttgac ttcatgtttc 2382121 gcatccgaag gctgaccgtt gctaacaggg aataaaccag cagttcaacg ccgcttcatc 2382181 ggcctgttga tgttgtcagt cctggtcgca ggctgttctt cgaacccgct ggctaacttc 2382241 gcacccgggt atccgcccac catcgaaccc gcccaaccgg cggtgtcacc gcctacttcg 2382301 caagacccgg ccggtgcagt gcgaccactg agcggccacc cccgggcggc actattcgac 2382361 aacggcaccc gccaattggt ggctctgcgc ccgggcgccg attcggcggc acccgccagc 2382421 atcatggtct tcgatgacgt gcacgttgca ccgcgcgtca tttttctgcc gggcccggca 2382481 gccgcgttga ccagcgacga ccacggcacg gccttccttg ccgcccgcgg cggctacttc 2382541 gtggccgacc tgtcctccgg tcacaccgca cgagtgaatg tcgctgacgc agcgcacacc 2382601 gatttcaccg cgatcgcccg ccgctccgac ggcaagctgg tgctgggcag cgcagatggc 2382661 gccgtctaca cgcttgccaa gaaccccgca gttgacccgg cgtccggcgc cgccaccgta 2382721 gccagccgga ccaagatctt cgcgcgcgtg gatgcccttg taacacaagg gaatacaacc 2382781 gttgttctgg atcgtggcca gacctcggtg accacgatcg gcgccgacgg tcatgcccag 2382841 caggcactgc gcgccggcca aggtgcgacg accatggccg ccgatccgct gggccgggtg 2382901 ctgatcgccg acacccgtgg tggccaacta ctggtgtacg gcgtcgaccc gctgatcttg 2382961 cgccaggcct acccggtgcg gcaggctccg tacgggctgg ccggatcccg cgaattggcg 2383021 tgggtgtccc aaaccgcgtc caacaccgtc attggttacg atctgaccac cggaataccc 2383081 gtagagaagg tgcgttaccc aaccgtgcaa caacccaact cgttggcctt cgacgaaacg 2383141 tcggacacct tgtacgtggt gtcgggatcc ggtgccgggg tccaggtcat cgaacacgcg 2383201 gcgggcaccc gatgagcagc cgacccgcgg cgcggcggac ctggttgcct accggctggg 2383261 attccgagat gtccgacgag tacgagtggg cgccattgcg cctaccgcca gaagtgacca 2383321 gggtcagcgc gtccacccgg ctgtccatcg aggccgaata ccgcggctgg gagctagcac 2383381 gggtacgcct ctataccgac ggcagcaggc gggtattgtt gcgccgcaag aaatctcgct 2383441 gggcagacgc agaggcgaac cgccggccag accagccgca gctgtggctc tgaaggccgg 2383501 ggccagcccg cgcgcagacc gctatcggat gtatcccctg gtgcgtcggc tgttgttcct 2383561 gatcccaccc gagcacgcgc acaagttggt tttcgccgtg ctgcgcggcg tggccgccgt 2383621 ggcgccagtg cgccggctct tgcgccgact gctgggcccg acggatccgg tgctggccag 2383681 cacggtgttc ggggtgcgct tcccggcacc gctcgggctg gccgcggggt tcgacaagga 2383741 cggcaccgca ctatccagtt ggggtgcgat ggggttcggc tacgccgaga tcggcaccgt 2383801 caccgctcat ccgcagcccg gcaacccggc cccccgcctg ttccggctgg ccgacgaccg 2383861 cgccctgctg aaccggatgg ggttcaacaa tcacggtgcc cgggcactgg cgatccgact 2383921 cgcgcggcac cgacccgaga tcccgatcgg ggtgaatatc ggcaagacca agaaaacgcc 2383981 ggccggcgac gcggtcaacg actaccgggc cagcgcccgg atggtcggcc cgctggcgtc 2384041 gtatctggtg gtcaacgtca gctctccgaa cacaccgggg ttacgcgatc tgcaggcggt 2384101 cgaatcgctg cggcccatcc tgtctgccgt ccgcgccgag acttcgacgc cggtgctggt 2384161 gaagatcgcg ccggacttgt ccgattccga cctcgacgac atcgcggacc tggccgtcga 2384221 gctagacctg gccggcatcg tggcaaccaa caccacggtg tcacgcgacg gcctgaccac 2384281 accgggggtc gaccggttgg gtcccggcgg catctcgggg ccaccgctgg ctcagcgcgc 2384341 ggtccaggtg ctgcgtcggc tctatgaccg ggtcggtgat cgattggcgc tgatcagcgt 2384401 gggcgggatc gagacggccg acgacgcgtg ggagcgcatc acagcgggcg catcgctgct 2384461 acagggctat accggcttca tctacggcgg ggaacggtgg gccaaggaca tccatgaagg 2384521 cattgcccgc aggctgcatg acggcgggtt cggctcgctg cacgaagcgg tcggctcggc 2384581 aagacgtcgg caacccagct aaagcgctaa cgctgctcgt aggtgccgaa gatgaccgct 2384641 cgtgcaatcg cgtgctggaa caggttgaat cccagatatg caggactcgc gtcctcgggg 2384701 aggtcgagct tttcgacctt caccgcgtgt accgcgacgt agtagcgatg caccccatga 2384761 ccgggaggcg gcgccgcacc cacataccgg cgcataccgg cgtcgttgac caatgtcagt 2384821 gccccgcccg gcagttcgcg gccatcgccg acaccctcgg gcaactcggt gacgttggca 2384881 ggcaggttgg ccaccgccca gtgccagaac ccggacaggg tgggggcatc agggtcgtag 2384941 acggttaccg cgaagctgcg ggtctcgctg ggaaatcccg accacctcag ctgcggactg 2385001 gcatccgccc cgcccgcacc catgatcccg ctgacctggg gtgtagccag cggctgccca 2385061 tcggtgatcg aggttgacgt caggctgaag gacggcagct tgggcagcgc ggcatacggg 2385121 tcgggtgaag ttgtcatggt cagtcctctc gtgtgatcga cgttgcgact agcctcgttt 2385181 tcgactagca gtgtgtcagc aagtgcgtta gcacctcggt gccgaaccgc aacccatcga 2385241 tgggtacccg ctcgtcgacg ccgtggaaca acgaggtgaa atccaagtcc ggcggcaagc 2385301 gcagcgggct gaagccaaag caccgaatac ccaagcgcgc gaacgccttc gcgtccgttc 2385361 caccggacag catgtacggc accgtgcgac cgtctgggtc gaccgccaac accgcggcgt 2385421 tcatggcggc gaccagatca ccgtcgaagg tggtctcata tgatggcaga tcgctgaccc 2385481 actcccgggt cacgtcgggt ccgatcagcg cgtcgacttc ggcctcgaac gccgcccggc 2385541 gacccggaag cacgcggcag tccacaactg cctccgcggt cgccgggacg acgttggcct 2385601 tgtatccggc cttgagcatc gtagggttcg cggtgtcatg tagcactgcc ttcaacatgc 2385661 gggccatcgg gccaagcttg tcgatcgtcc cggccaggtc cggcgagtca aggtcgaagg 2385721 ccagtccggt ctcctctccg actacggcca agaactgggc gacggtgtca gtgcagacca 2385781 gcggaaactg gtggcgccct aggcgagcga ccgcctcaca aacggcggtg accgcgttct 2385841 ggtcgtgcac catcgagccg tgcccagccc ggccgcgtgc cgtcagccgc atccactgga 2385901 tgcccttctc ggcggtttca atcaggtaca ggcgacgttc gccaccatcg tgccggggca 2385961 cggttagcga gaaaccgccg acttcaccga ttgcctcggt gatgccgtcg aacagatcgg 2386021 gcctattgtc gaccagccag tgcgacccgt acttgccgcc gtgctcctcg tcggcaacga 2386081 acgcgaacac cagatcccgt ggcggcacga tagcggcctg acgaaggtgg cgggcaacca 2386141 caatcatcat gcccaccatg tccttcatgt cgaccgcgcc acgaccccag acgtagccgt 2386201 cttcgatggc gccggaaaac gggtgcacac tccattcggc cggttcagcc ggcaccacat 2386261 cgagatgccc gtggatcagc agcgcgccgc gagaactatc ggcgcccgcc agccgggcga 2386321 acacgttgcc gcggccgggc gcaccggatt caacgtattc aggttggtag ccgacttcgg 2386381 cgagctgctc ggcgacccag cgtgcgcact cggcctcacc cttggtggtc ccgggttcgc 2386441 cactgttggt ggtatcgaac cggattagcc tgctgacgac ctgggcgaca tcatcgctgt 2386501 ggtcgcttga agccccggtc tcatctgtca cagtcacctt tcctaccact cgtaaccctg 2386561 gcgagccgat cgcccctggc gcgccgggcc cgcgtcgtcg ccgagctgga tttgcttacg 2386621 tgggctgatt gcctggctcc tcctcacccc gttacccggg gcgcatcgtc gccgagctcg 2386681 atttgattgc ccggctcctc ctcaccccgt tacccggggc gcatcgtcgc cgagctcgat 2386741 ttgattgccc ggctcctcct caccccgtta cccggggcgc atcgtcgccg agctcgattt 2386801 gattgcccgg ctcctcctca ccccgttacc cggggcgcat cgtcgccgag ctaggttggg 2386861 ccggtgcggg gcaatccgat agccttagct gccagccccg gtggttggtt ggtccgagtg 2386921 gcggaatggc agacgcgcta gcttgaggtg ctagtgccct actaatgggc gtgggggttc 2386981 aagtcccccc tcggacacaa cttcttagct ctatagatca aaaccaagcc ttgacctcgt 2387041 caaggactaa cgttatgagt ttgctcatac caacgatggt catctcgttg atgtcctaat 2387101 acctaagaac tcaccgatca ctcgaaggtg cggccagaga tctcagcctc gaccgcgttc 2387161 gggttctcca ttccgtgccg aacagccagt atgtcgatag cctcgtcggt cgtccgatag 2387221 gcaacgtagt acctgaaggg ccggaggtag atgtgtcgat agtgcttgaa taacggcgca 2387281 aacgcgttcg gagcctgcgg aatccgcttc gtcacggcat cgacaaacaa gttgtaaagc 2387341 cgatcgatct gatctggcgc cgcgtccgcg tagtaggaaa acgcctcgaa taggtcgtct 2387401 tcaaccccgt tatggacgcg cagcctgcgc gtcatccgag ccgggcgcgg atccgcttgt 2387461 cgaagtcatc aatggtggac caatgagcat cgtcggtgtc attggcccgc gcttcgatga 2387521 gcgcctggtt ggcctcgctg atatgcatgc cctcggctag gtttccgttg atgtgctcga 2387581 cgagctcaat ctgctcatca cgcgacagtg cgtcgacgct cgccagcaat gcccggttga 2387641 ccaccactca acgataccca aggcagccaa cgccggcagc gcagcattcg gaggccaggc 2387701 tgaacttcaa gctggcaggt gtcatccgct cagttgaaag acctcaaccc gggtcgcagg 2387761 tggccaagtc ccccctcgga caccacatgt gacgggtcga agacgaggca cgccgcggac 2387821 gacttcgagg gtaagcagcc gtatcccggg gaggcctgcc atgaccactt ctgggcctat 2387881 ggcagctagc aaacgtcatg aatggaagcg ccaccatatg ccggcgatcc gacgttcgag 2387941 cgtttgcggc gctcgtttca gccagccgat ctactgccgg aactgcaagc ggcaggagtg 2388001 cattacacaa tcgctgtcga ggcggcggac gatccggccg agaatgagtc tctgttggcc 2388061 actgcgcgcc accatgattg gatagcgcgc gtgatcggtt gggtcccact cgccgatccg 2388121 gatgaggtta ccgagagctc gacgcacggg cggcaccgcc cggacgcctc ctggcgacga 2388181 gatctgcggt gccccggcct gctgccgccc gggtgccacc agccagtctt ggtcgtaggc 2388241 ttggtaggtc agcagccgga aatgcgaccg atgaatccac caagtggttt tctccggcgg 2388301 acgccgaccc gcaggtttcg cgaccgccgc gatgctggtc gcgtattggc cgacgaactt 2388361 gcgtcctatc gcggcaggga ccggttgctc gtcctcggcc ttgcccgcgg tggcgtcccc 2388421 gtcggctggg aagtcgcgtc ggcgctaggc gccgaattgg atgtatttct ggttcgcaag 2388481 ctcggcgtgc cgcagtggcg cgagctggcg atgggcgcgt tggccagtgg gggcggggtc 2388541 gtgatgaacg acgacgtggt ttccagcttg cgcatcaccg accagcaggt gcgtgcggcg 2388601 atcgacagcg agacggcaga gctccagcgg cgcgagctgg cgtatcgcgg cggacgccct 2388661 gtcgtcgatc cgcgcgccag gatcgtgatc ctggttgacg acggcatcgc caccggcgcg 2388721 agcatgctgg cggcggtgcg caccatccgt gccaccggac cggagtcgat cgtcgtcgcg 2388781 gtcccggtcg gtccggccac agcctgccgc gagctcgcgg cggaagccga cgacgtggtg 2388841 tgcgcaacca tgccggcagc gtttgaggcc gtcggccagg tctataacga ctttcatcag 2388901 gtcaccgacg acgaggtccg cgagctgctc gcgacgccaa ccacaggcgc agcgacctaa 2388961 cgagaggatt ctcgtgaggt gactgggatg gtcaggatgc gtggtcgagg gtctagatcc 2389021 ggagctgggc gacaaaccac ccgataacct cccacgacgc ccctaccgag gtcggtgtcg 2389081 ctgacgaccc tacttggcgc tgtcgtcgct tcggtccgcc gataccgccg actcctcggt 2389141 cgcttcgctg gcctcctccg atggctcctc actgccgatg acaccggcgt cgacggcttg 2389201 gctctcctcg ggggcttcct ccgggtagtc gacgtcggct tcctccgcga cacccgtttc 2389261 cccagcccca tcagcttcgt ccgcgccacc ttgctggcgt tctcgcaacg catcgacgat 2389321 cagcaacgcc acacccagca cgctggcccc gatgcatacc caggccacta gctggttgct 2389381 ggtgaccacc gcgaacacca aggccaggag cccaatcagg gccaagacca gcgcaatgat 2389441 cagcatcggt catcctccaa ccggctagca gcgactgccc aacctaccag gatctggctg 2389501 ccgacctcga aaactggcgc gtgtccggca cgcctggtgg ctagtttttg ccccggttga 2389561 attgatcgaa gccaccggca tccgcattgg aatcgaccgg cgccgccgat ccacgctggc 2389621 cgagttcctc cagctgcgat tccaggtagg tcttgagcct ggtgcggtac tcacgttcga 2389681 aggtacgcag ctgctcgagg cggccttcaa gcaccgcgcg ctgctggttg atggttccca 2389741 tgatctcgga gtgcttgcgt tccgcatcgg cctgtaaggc atcggccttc tcctgcgcct 2389801 ggcgcaactg ggcctcggat cgggattggg catcggccag catggcatcg gcacgctggc 2389861 gggcctcggc gaccgtggcg tcggcggtgt gtcgggcctc accgaggatc tgctccgcat 2389921 tggcacgggc atcggccagc atcttgtccg actcggcttt ggcggtgttt gtaagccggt 2389981 cggcggtgtc ttgggccaga ctcagcactc gcgccgcctt cagggcctgt tcctcgttca 2390041 tccccgccga gaccgccgcc ggcgccggct tgcccggttc gggctcatac gccgggattg 2390101 cctgggtggc ctgcggcgta acgccggcac cgccgcccgc ggcgagctct tgatccagct 2390161 cgttgatcct ctgacgcaga tcggagttct cttcgatcag gcgggtcagc tcgttttcca 2390221 ccaggtcgag gaaggcgtcg acctcatctt cgttgtaccc acgtttgccg ataggcggct 2390281 tactgaacgc cacattgtgg acgtcggcag gtgtaagcgg cattgtttgt cccctcgagt 2390341 tcctggacgg tcaaacgatc tggaagtgta gaacggagtg gtagccgtgg tgcaactacc 2390401 gtccatcctg tcacaccaga ctcggcggtt gccgattgga ctaagtaaat aaggaccaat 2390461 ttcaaactct aagaccaaat aaatcacaat ccttagattt gaaatcgtgc gcgccaaact 2390521 tgtccccaaa tcgtggccga accgtctctc aatcctcgtc atgcacccgg ccgtgtgacc 2390581 gcgccgcggc tcaggccgca gcaccaaacg ccagttgcat accgatgaac gcaaccagca 2390641 gcagcaccat gatcgacagg tcgaaccgga ccgcgccgat cgtgagttgc gggatcagcc 2390701 ggcgcagcac cttcaccggc ggatcagtga tcgacatgat gatctccaag atcaccacgg 2390761 tgacaccggt gggacgccag tcacggctga acgagcggat gaactcaacg acgacccgag 2390821 cgatcagcag cagccagaag atgaacagcg cgaacccaag gatctgaaaa aacaccacca 2390881 acgagagccc cgaccttact gaggaggatg aagaaatgtt gcgtcgccac cgatgcgggc 2390941 ggcaacgcca gcctaccgac tcgggtggcg tgcccacatc tcatggcggg ccacgccccg 2391001 cccagcgtgg atgcccaatg ggtctacagg cgaccgtcgc gtctattggt aggcgtagaa 2391061 cccggtttcg gcgatcctgc ggcgctcctc gggggacaca tcgacgtctg caggcgagag 2391121 caggaacacc ttggtcgcga ccttgtcgaa cgagccgcgc agcgcgaagg ccaggccggc 2391181 cgcgaaatcg accagccgct tggcatcggc gttgtccatc gacaccagat ccatgatgac 2391241 cgggctgccg tcgcggaacc gctcaccgat ggtgcgagcc tcgctgtagt ccttgggccg 2391301 cagcgtggtg atcttcgaga gcggatggcc atcctcgaac atcatcgcca tccggcgggg 2391361 gtccatcgct agcgcgccgc gggtggagtt gcgcagccac gatccgaagc gcggccgtgt 2391421 catctccgcg cggtcgaact cccggggccg gaaacgtggt tcgtccgcgt acccgccgcg 2391481 atatcccggt ggtggatagt cggccggctc accgcgcagg tcaccgcgtg aatcgctgcg 2391541 cgcgtcgtcg tagtcgcgcc catcgtagcg gccgtagtcg tcgtcgaatc ggggccgcgc 2391601 atacccgcgc gagggagcgc ggtcgtcgta gtactcgtcg tcgtaatcct ccatgggagc 2391661 cataccgaag taggccttga ccttgtgcag tgtgctcatt gcgtgacccc ttctagccct 2391721 gggagatctg ttgtctgtga tgaaggtgtg actacagtga ctattcacgg tgaccgtaac 2391781 cgccgcggac ccaatagcgc ggtaccgaca cgcacacagg tcgaaccatg tttgacggcg 2391841 acttcaaggt cgttggacat gcccgccgac agaccgatcg cgtgcgggaa catcgcacgc 2391901 acccggttgt gctccgattg cagccggtca aaggcctcgt ccgggtccca atccagcggc 2391961 ggaatgccca tcaacccgac cagttcgagg ccctctgact cctgcacctg cgcgcaaatc 2392021 cggtctacgg cgccgggcgt cgtgctgtcg acgccgcccc gggatccgtc accgtcgagg 2392081 ctgacctgga cgtaaacccg cagccgctcg ccacgacggt gttcggccag cgccgcaaca 2392141 accgcccgat ccagcgcggt caccaaccgc gagctgtcca ccgagtgagc ggtgtgcgcc 2392201 cagcgagcca gcgacccggc tttgttgcgt tgaatccggc ccaccatgtg ccagtgcaca 2392261 ccccccgagt gacccaactc ggcagccgcc aacaaccgat taagttcggc catcttggct 2392321 gaagcttcct gttcgcgcga ttcgccaacg gaccgacaac ccaatcgaaa caaaatcgca 2392381 acatcggttg ctggaaagaa tttggtaatc ggtagaagtt caatttcgcc gacattgcga 2392441 cccgccgcct ccgcggccgc cgcaagtcgc gatcgcattg ccgccaacgc atgcgtcaat 2392501 tccgattcgc ggtctggata cgccgaaaga tccgccgcca tcgcggtcat tccatccaca 2392561 ccaacgacgc gaaccgtccg gtgggcgcat cgcggcggtg gctgaacaac gtcggatcgg 2392621 ccaccgtgca gcggggatcg acgtcgatag actcaacacc caaatcgcgg agctggcaag 2392681 cgattccggc gcgcaggtcg actccgggag tgccggcagc ggtggtggtg cggctgcccg 2392741 gcaacgccgc ctcgacctca tcggccatcg ctgcgggcac ttcgtagttg cgaccactga 2392801 ccgcgggacc caacagtgcc gagatgtcgc ggacctgggc acccaggctc aacatcacct 2392861 ccagcgcgcg aaccaccaca ccgcgctgcg cgcctgcccg accggcatga accgcggcgg 2392921 cgataccggc ccgtgcgtcg gccatcagca ccggcacgca gtcggcggtc acaaccgcca 2392981 gcgccaatcg gggtgtagcg gtcaccaatc cgtcggtgtc atcgagtgcc gtattgcgcg 2393041 gctggtcgac cagctcgacc cgatccccgt gcacctggtt catccacacc actcggttgc 2393101 cgggcagtcc gatggctgcg gccagccgag cgcggtttgc cgccaccgcg gccgggtcgt 2393161 caccaacgtg gtcgccgagg ttgaaggtgt cgaacggtgg ggccgacaca ccacctgccc 2393221 gggtggtggt gacccgacgg atgcgaacac tcacgttccc agtatcgccg cgggcgatgt 2393281 gccgcgtact ggcgagcaag ccgatgctct cagcggcgca tgaagggcgg cacgtcgaca 2393341 tcgtcgtcat caccgccgat gctcagggtt gcgccgttgg tgtgcaacgg cacgctgacg 2393401 gcgtcgaccg gctcgaacaa ggtcgaggtg agcttgcctg ccttggctga ctcgatccgg 2393461 tgggcgccgc cggtctcgcc catcaccggc ttgcggccgg gaccgctgac gtcgaagccg 2393521 gccgcgatca cggtcacccg cacctcgtca ccgagcgaat cgtcgatgac ggtgccgaag 2393581 atgatgttgg catcggggtg agcggcgtct tgtaccaacg aggccgcctc gttgatctcg 2393641 aacaagccca agtcgctgcc gccggcgatc gacatcagca cgccttgcgc gccctccatc 2393701 gaggcttcca gcaacggcga gttgatggcg atctcggccg ctttgagcga ccggccttcg 2393761 ccccgggccg agccgatgcc catcagtgcg gtgccggcac cggacatgat gcccttgacg 2393821 tcggcgaagt cgacgttgat tagacccggg gtggtaatca ggtcggtgat gccctgcacg 2393881 ccgttgagca gcacctcgtc ggcgctacgg aaagcatcca tcagcgatac cgcggcatct 2393941 cccatctgca gcaaccggtc gttgggaatc acgatgaggg tgtcgcaact ctcccgcagc 2394001 gccgcgatgc cattttcggc ctgattgctg cgtcgcttgc cctcgaacga gaacggccgg 2394061 gtgaccacac cgacggtcaa cgcgcccagc ttgcgggcga tgctggcgac gacgggtgcc 2394121 cccccggtgc cggttccgcc cccctcgccg gcggtgacaa acaccatgtc ggcaccgcgc 2394181 agcagctctt cgatctcgtc cttggcgtcc tcggcggcct tacggccgac ctccggatcg 2394241 gcgccggcgc ccagcccgcg ggtggagtcg cggccgacgt cgagtttgac gtcggcatcg 2394301 ctcatcaaca acgcctgggc gtcggtgttg atcgcgatga attccacgcc tttgaggccc 2394361 tgctcgatca ttcggttgac ggcgttgaca ccgccaccac cgatacccac gaccttgatg 2394421 acggccaggt agttgtgcgg gggggtcatc gttcggcttc ctccctggtg gggctcggtt 2394481 cttcggtgtg tctgctggca aactctcaac ctcaaccata ggcttagagt tatgtcaagt 2394541 agttgctcgt agtcagaacc gtatggctac gacggttgct aaccgtgcag gcgcgccgat 2394601 acgcggcggg cattttttcg gctatttcac ggtcggcagg tcggggctgg acacgtcgta 2394661 cgttctgcct ggctgggtca acagcgccgc cagcttttcg gccttctctt cgcagcggtc 2394721 ggtggttccc cagatcacca cgcggccatc ggccaacgtc agggtgatcg aggccaccga 2394781 cggggccgcg atccgcccca cctggcttgc aacttcagga tgcagcgcgg tcaacacctg 2394841 cagcgccgcc ttggtcgtcg gatcgctagg accgggattg tccacatcga aataaggcaa 2394901 cgccggcggt ggcggatcgg tcgcgaagtc gacgccgtcg cggtcaaaaa ggtgcgggcc 2394961 gtccgaaaaa tccttgacca ccaccgggac ccgctcgacg atggtgatcc gcaaggccga 2395021 cgggtactgc cgctgcaccc gcgcactggc cacccgccgg atcgtggcca ctcggtcagc 2395081 aacctgttgg gtgtcgatct gcagcaacgg cgttgccggc cgcactctgg cggcgtcgag 2395141 aacctcctcg cggctcaccg ccccgatccc gatgatcacg atctcgcggg ccgacatcgc 2395201 cggcgtgaag tacagcgcga gcccaagccc gatcccgacg acggccagca cgaccgtcgc 2395261 gagcagcgcc ttcagccctc gaacaacacc tcgggcggcc ggtttggcgg ggttctgctc 2395321 actgacgatc tgcccgcggg ctcgccgttt ggccgcgcgg cgagcctgct cgatcgcggt 2395381 agctcgagcc tgcgcggcgc gacgttcggc acgttcgcgg cgggcgcgcc gacgcggccc 2395441 ttcgaattct gggtgctcgg ccggttcgtc cttcgattcg gtggccaacg gctccgtaac 2395501 cgcctcctcg tcggcggcgt cgtcggccac gcgctcgatc tgtgggtcct cgttgtgttc 2395561 cgtcatccca gcacccccgg acggccgggg gcgcttcggt tggcccggac ccgaagggcg 2395621 gtcaggattt ccgggcccag caaggtcacg tctccggcac ccatcgtgac gatgacgtcg 2395681 cccggactag cggcggcggc cacttgctgt gcgaccgccg aaaaatccgg gacgtagcgc 2395741 atcggcacag tgacgtgctc agcgacgctg gctccgctga caccggccag cggttgttca 2395801 cgagctccgt agacgtcgag tacgaacacc tcgtcagcgg cattcagcgc acgcccaaac 2395861 tcagcagcga atgcctttgt ccgcgaatac aaatggggtt gaaacacaac catgcagcgg 2395921 ccaccgtcgc cctgttcgag caccatgcgc gccgccgcca gtgtcgcgct gatctccgtc 2395981 gggtggtggg cgtagtcatc gaacacgcgc accgacgcct ttccgacgcc gcaggtccca 2396041 accagttcga atcgtcgccg cactccttcg aagccggcca gcccgtcgag cacctcgtcg 2396101 gccggggcgc cgatctgcac cgcggccagc agcgctccca gcgcgttgag cgccatgtgt 2396161 cgcccgggca ccgacagccg catcacgcgg ggaccctgtg ctgtggctag ttctgaggcc 2396221 aaccggatat gtgcgaccgc gccgaccccc tgttgctgcc acgagaccaa cgtggctgcc 2396281 atggtctcac ccggcaccga cccgtatcgc agcactcgaa ttcccagctc agtcgcgcgc 2396341 tgagccagcg cggcccctcc ggggtcgtca gtgcacacca ccagcgcacc cccggggaca 2396401 atgcgctcca cgaaggagtc gaacaccgca acatacgcct cgacgctgcc gtagaagtcc 2396461 aggtgatcgg actcgatgtt ggtgatcacc gcgacgtggg gtgtgtactg caacagcgag 2396521 ccatcgcttt cgtcggcttc ggcgacgaaa cagtcgccac tgccgtgatg ggcgttggta 2396581 ccggcctccc ccagctcacc gccgaccgca aaggacgggt caagcccgca gtgctgcagg 2396641 gcgacgatca gcatggacgt cgtcgttgtc ttgccgtgcg tgccggtgac catcaatgtg 2396701 gtgcgcccgg ccatcaactt ggccagcacg gccggccgca gcaccacggg aatgccgcgg 2396761 cgcctcgctt cgacgagctc ggggttggtt ttggggatgg cggcatgggt agtgacgacc 2396821 gccgtggcgc caccgggcaa caggtccagc gacgacgcgt cgtgtccgat ccggatcaac 2396881 gcgccccgcg cccgcagcgc atgcacaccg cgcgactcct tggcgtctga cccggagacc 2396941 agcccgccgc ggtccagcag gattcgggcg atgcccgaca tgccagctcc gccgatgccg 2397001 accatgtgca cccgccgcag atcgggcggc aactgctcgg tgctcacgtc gttgtcctgg 2397061 caccggcccc ggtggcgacg gccagcgcgg cccgggccac ctggcccgcg gcatcgcgat 2397121 gtcccaccct ggctgcggcc gcggtcatcg cggccagccg cgcggggtcg gtgagcagcc 2397181 cggcaacctg gcgggccacc aactcggggg tcagggcggc gtcggcgacc accatgccgc 2397241 cgccggcatt gactaccggc aacgcattca gccgctgttc accgttgccg atcggcagcg 2397301 gcacgtagat ggccggcaga ccgacggcgg atacttcggc gaccgtcatc gccccggccc 2397361 ggcagatcac cagatcggcg gcggcgtagg ccagctccat ccggtccaaa tagggcaccg 2397421 ccacgtacgg tgggtcacct tgagcccgac ggcgcaactc cagcacgttc tggggtccat 2397481 gggcatgcag cacgcaaaca ccggcggcgg ccaggtcggc ggcggcgccg gacaccgccc 2397541 ggttgagcga gaccgcgccc tgcgaacccc cgaacaccag cagcacccgc gcgtcgtcgg 2397601 ggaagccgaa gtgtgcccgc gcctcggctc gcagcaccgc gcggtccagc gcggcgatcg 2397661 acgcacggac cgggacccca accacctcgg cgcgccgcag cccggaatcc ggcaccgcgg 2397721 agagcacccg gtccgcggta tgggcgccga cccggttggc cagtcccgcc ctggcgttgg 2397781 cttcgtggat caccaccggg atccggcgcc ggcgccgggg cggcaaaggc aggccgcgag 2397841 cggctaggta agccggtagc gcgacgtacc caccgaaacc gacgacgacg tcggcgtcga 2397901 catcgtcgag cacgtcccgg gcctcccgga cggcgcgcca cacccgcgac ggcagccggg 2397961 ccaggtcgcc gccgggcttg cgcggcatcg gcaccgccgt gatcagctcc aggtggtagc 2398021 cgcgctgggg caccagcctg gtctctagtc cacggggggt gcccaacgcg gtaatccgga 2398081 cgcgcggatc caacgcgacc aaggcgtcgg cgacggccat ggcgggctcg acgtgcccgg 2398141 cggtcccgcc gccggcgaga acgaccgaca aggaatcagc agacggcgag gaaccacaag 2398201 acggcgaggc ggcatcggcg ggccggggcg ccgttgcccc gcgcccgccg gccggctggc 2398261 tgaccgtgtc cttcacccgt aacgctgacc ttccaatgcc cgaacgcgcc gtgtgcgacg 2398321 ctggcccgcg taccgctggc cagctccatg atgcactgat cgacgaaccg gcggatcggc 2398381 cgtgcggggc gagccgggtc gcgggggcag gcccatctgc cgggcaggct gtccgggcgc 2398441 cgtgcggggg gtcttccgcg cgggctgcgt ttgggccggt tgcgggttgg cgcgcttgcg 2398501 gtcacgaaac gcctcgagac gagggggcag atacggctcg ggcagcggca gccgcagcaa 2398561 ccggttcacc ttgtcgtcgc gcccagcccg cagcgcggcc accgcctccg gttcgtggcg 2398621 agccgcgttg gcgatgatgc ctatcagcga aagtgttgcg gccgtggagg ttccaccggc 2398681 ggagatgagc ggcagctgca ggccggtgac gggcagcagc ccgatcacat agccgatgtt 2398741 gatgaacgcc tgtcccagca cccacagtgt cgtggtggcg gtcagcagcc gcaggaacgg 2398801 gtcggcggac cggctagcga tgcgcatgcc ggtgtaggcg aacaatccga atagccccag 2398861 cagtccgagc gcgccgacga gacccagctc ttcgccgatg atggcgaaaa tgaagtcgtt 2398921 gtgggcgttg ggcaagtagt tccacttggc cacgccttgg cccagaccgt cgccgaaaat 2398981 gccaccttga gccagcgcga actttgcctg tcgggcctgg tagccggagt cttgcggatc 2399041 gttttcgggg ttgagccacg accgcacccg gtcggatcgg tagcccgcgg acaccgccag 2399101 gatggcggcc gagacgacga ccgccgccag tgagctgagg aagacgcgca gcggcagccc 2399161 cgcataccac agcaggccca acaagatgat gcccatcgac acggtctgtc cgaggtcggg 2399221 ctgggccacg atcagcgcca gcgcaacgac ggcggccggc accagtggaa tcagcatctc 2399281 gcgcagtgaa gcccgttcca tgcgccgggc ggccagcaga tgcgctcccc agatggcgaa 2399341 cgccatctta gccagctcag agggctgcat cgagaagccc gcgaccacga accagccgcg 2399401 cgagccgttg gcctccttgc cgatccccgg caccagcacc agcaccagca tcacgatggt 2399461 gatcgcgaaa ccggagaagg cgatgcgccg catgaaccgc accgacatcc gcagacagac 2399521 atagccgccg ataagaccca caagcgtcca caagacctgc ttgccgaaga tcacccaagc 2399581 cgatccgtcg tcgtcgtagg accgcaccgc cgatgccgac agcaccatga tcagtccaag 2399641 ggtggtcagc aatgcggcaa cggcgatgat gaggtgaaac gaggtcatcg gacggcccag 2399701 ccaggcaccg aaacgggtgc ggggcctcgc cgaacccggg ttagaggctt cttccgggcc 2399761 cgtccgctgc ccctcgaccg gctcggcccc tcgagtctgg gagccgtcgg tgtcgctggt 2399821 gccccgacgc agcaaccggg ttagcacgct gcccccgcct accggatcac cgcgcggacc 2399881 gcggtcgcga atgcctcgcc ccggtcggca taaccggtga actggtcgaa tgaggcgccg 2399941 gccggtgcca gcagcacggt gtcaccgggt tgggccatcc gccgggccgc ggccaccgca 2400001 gcggtcatca cggcagcgcc aacggtctca ccggctttgt catcttttgc cacatctaga 2400061 acacaagcaa caggaacctc aacagtcgca ggcataccag tatcctcgcc tgccacaacc 2400121 tgaacgactg ggacatcggg cgcgtgtcgt gataacgcct cggcaaccgc tgcgcgatcc 2400181 cggccgatca gcaccgcacc gaccagccgc gacgccatcg ccgcaacctc ggcgtgaagc 2400241 gacgcgccct tgagcaggcc accggcgatc catacaaccc tcgggtatgc aagcaccgaa 2400301 gcccgcgcgg cgtgcgggtt ggtggccttg gagtcgtcca cgtaggtgat gccgtcggca 2400361 acggccacca cctcggcgcg gtgtcggccc actcgaaacg acgtgaccgc gtcggcgatc 2400421 gcaccggcgg gcaccccgac cgagcgggcc agcgccgccg cggccagggc gtcaagcacg 2400481 ccgaccggac ctggcaccgg tatcgacgcg accggcagca gcgtcaagtc gtcggagaag 2400541 gcgcgatcga ccaggtgggc gtcgcgcacg cccagttccc ccgcggccgg ctcgccgagc 2400601 cggaagccga cccgcacctg cgccggtgag ccgtccagca gtgcggccgc tcggctgtca 2400661 tccagcccgg ccaccgctac cccgccggtc agcacccggg ccttggccgc ggtgtattcg 2400721 gccatcgtgg catgccagtc caggtggtct tcggcaatgt tgagcaccgc gccggcctcg 2400781 ggccgcagcg acggcgccca gtgcagctgg aaactggaca actccacggc cagcagctcg 2400841 gccggctcgt ccagcacatc cagcaccgca ctgccgatat tgccgcacag cacggcgcgg 2400901 cggccaccgg cgatcagcat ggcgtgcagc atcgacgtcg tggtggtctt gccgttggtg 2400961 ccggtcacca ccagccagct gcgcggcggt ccgtagcagc ccgctgcgtc tagccgccag 2401021 gctaactcca cgtcacccca gatcggcacc cccgccgccg cggccgcggc cagtagcggg 2401081 gttgcgggcg agaagccggg actggcgacc accagcgcat acccggttat ctgctgcacc 2401141 gcgtccgagg aactaacggt cggcagccca cgttcggcgt gcggtcgcag catgaccgga 2401201 tcgtcgtcgc acaccgtcgg cgtcgcacca aaccgagtca gcaccgcggc caccgcctga 2401261 ccggtcaccc ggccaccggc taccaacacg ggcgcacccg gccccagagg gtcaagcacg 2401321 tcaggcaccg accgcggcaa gccactcacc gtagaacaag gccacgccca gaccgcaggt 2401381 gatcgcggtg agcagccaga accggatgat gaccgtggtt tcagcccaac cgaccaactc 2401441 gaaatggtgg tggaagggcg ccatccgaaa catccggcgc ccggtggtcc ggaaggtcag 2401501 gatttgcaac accaccgagg tgatctcggc gacgaacagc gcacccagca ccaccgcaag 2401561 gatctcggtg cggctggtca ccgacaaccc cgcgatgacg ccgcccaacg ccagcgaccc 2401621 agtgtcaccc atgaagatct tggcgggcgc ggcgttccac cacaaaaaac cgatgcaggc 2401681 gccagcggtt gcggccgcga tgagcgccag gtccagcggg tcgcgcacgt tgtagcagcc 2401741 caggcccggc gccgtcacgc acgcgttgcg gtactgccag aaggtgatca gcacgtaggc 2401801 ggcggtgacc atcgccatgg tgccggcggc cagcccgtcc aggccatcgg tgaagttgac 2401861 cgcgttcgac caggcgctga cgatgaccac gcagaacaac acgaacagca ccggcgccaa 2401921 tgtgacggtg gcgatctcac gcacgtagga cagatccgcg ctgcccggtg tcaggccggc 2401981 agcattccgg aactgcagca ccagcacgcc aaacagcacg gcggaggtga tctgcccgac 2402041 ggtcttggcc gtcttgttca acccgagatt gcgcgacctg cggatcttga tcagatcgtc 2402101 gatgaacccg acgccgccca aagcggtggc taggcccagc accaacagac ccgatgcgcc 2402161 gatgccttca ccgtcaaacg ccaggcccgc taggtgggcg cccaggtagc ccgcccagat 2402221 gccggccaga atcgccaccc cgcccatcga cggcgtaccg cgcttggtgt ggtggctggg 2402281 cgggccatcc tcacggatct ggtggccgaa gccctgctta gtgaacaacc ggatcagcac 2402341 cggggtcagc aagatggaca ccgtcaccgc tacggcaacg gcgataagga tctgcctcat 2402401 gggcgcacac tcccgcatgt gtcgtctgcg accaatgcat cggccaccgc acccagcccg 2402461 gccgcgttcg aggccttgac caagaccaca tccccgggtc gcagctcggc gcgcagtagt 2402521 gccagggcgg cgtcaccgtc ggccacattg acggccgtgc gatccgcacc gtgatcagca 2402581 gtggcttccc ccgagcccca cgccccctcc aggaccgctc cgtggtgcat ggcgctgatc 2402641 gacctcccgg ttcccacgac aacgagtcga gacacatcta agcgcaccgc gagccggccg 2402701 atgcgatcgt gctcggctat cgcgtcctca cccagctcgg ccatctcacc cagcaccgcc 2402761 cagctgcggc gggtggcctc gggttggtgc gcgatccagg ccagcgcctg cagcccggcc 2402821 cgcatggagt cggggttggc gttgtaggcg tcgtcgatca ccgtcacccc gtcgccgcgg 2402881 gtggtcacct gcatccgatg ccgcgacacc ggcggcgccg cggtcagcgc ggccgcgacc 2402941 tgttcaacgc tggccccaca ctccagcgcg accgccgcgg cgcacagcgc gttagtgacc 2403001 tggtggtcgc cgcagacccc gagtcggacc tcggcttggg catcgtgggc atgcagcgta 2403061 aagcgcggcc tggccaattc gtccagcgac accggccccg cccaaacgtc accggtgttg 2403121 tcccggctga cccgcaccac ccgggccgcg gtcagcttgg ccatcgccgc caccgcgggg 2403181 tcatcagcgt tgaggacgac cgctccggaa tgcggaacag cctgcggcag ttcggctttg 2403241 gtctgtgcga tgacctcgcg ggagccgaac tcacccaaat gtgcggtgcc gacgttgagc 2403301 acgactccga tcgacggggg cgcgatctcg gcgagcgcgg cgatgttgcc gtgatggcgt 2403361 gccgccatct ccaaaatcag gtagtcggtg cgccgcgtcg cgcgcagcac cgtccacggg 2403421 tgacccagct cgttgttgaa cgatccgggc ggggccacca cctcccccag cggggccagc 2403481 acggcggcca tcaggtcctt ggtcgacgtc ttgcccgacg agccggtgat cccgatgatg 2403541 gtgagcccgc cggccaccaa ctgcgcggcc accgcggtgg ccagcttggc cagcgcggcc 2403601 agcaccgccg cccccgaccc gtcgttgtcg tgctcgagga cgccggccaa tacgttcggc 2403661 gcggccactg gcggaaccac gatggccggc acccccaccg ggcgggcggc cagcacgacg 2403721 gcggcgcccg cggctaccgc cgacgcggca tggtcgtggc cgtcggcgcg cgcccccggc 2403781 agggcgagga acagcccgcc cgggccgatg gcgcgcgagt cgaactcgac ggtcccggtg 2403841 acgcggcggt gcgcggcgtc ttgcggggag atatcggcca ctgcgccccc gacgatctcg 2403901 gcgatctgcg cgacggtcag ctcgatcatg cgcgccgctc gagggcctct agcgcggcag 2403961 ccagctccac ccggtcgtcg aacgggcgga cccgcccgcc gccgcgttgc ccggtctcgt 2404021 ggcctttgcc ggcgatgagc accacgtcgc cggggcgcgc ccaggcgacc gcgtgccgga 2404081 tcgcgtcccg ccggtctgcg atctcgacga cctgggcatc accgccgact tcggccgccc 2404141 cagccaggat ttcgcggcgg atcgccgtgg gatcttcgtc acgcgggttg tcgtcggtga 2404201 cgaccaccaa gtcggccagc tgcgcggcta tccggcccat cggggcccgc ttgcccgggt 2404261 cacgatcgcc gccggcgccg aacaccaccg ccagccggcg gtccgggtgc gccaaggtgg 2404321 tcagcaccga ccgcagcgct tccggtttgt gcgcgtagtc gaccagcgcg agaaagccct 2404381 ggccgcggtc gatctgctcg agccgccccg ggacccggat ctcacgcagg cccggcaccg 2404441 cctgttccgg ggagaccccg acggtgtcca gaatcgccag ggcgaccagg caattggcga 2404501 cgttgtagcg gcccggtagc cggattccga tgtgatgccc tacgccggcg gggtcgatgg 2404561 cggtgaattg ttgcccgccc gcgtccgtgg gcgccacatc cgtggcgcgc cagtgtgcgg 2404621 gccggtcggc ggcgctgacg gtgatcgcgt cggcggcccg cgccgccatc gcgcgcccgg 2404681 cgtcgtcgtc gatgcacacc acggcggtgc gggcgcgcag tgccgagtcc ggatcgaaca 2404741 atgacgcctt ggcctcgaag tagtcggcca tgctggggtg gaaatccagg tggtcacggg 2404801 agagattggt gaaggcgccg acggcgaacc gggtgccgtc cacccggccc agcgccagcg 2404861 cgtggctgga cacctccatg accacggtgt ccaccccgcg ttcgaccatc gccgccagca 2404921 tcgcctgcag cgtgggggcc tccggggtgg tcagcgcgct gggaaggtcg gcgccgccga 2404981 cgcggatgcc gatggtgccg atcagcccgg cgacgcgtcc ggcagcccgt aacccggcct 2405041 cgaccagata ggtggtggtg gtcttgccgg acgttccggt gatcccgata accgtcaacc 2405101 gctcggacgg atgcccgtac acggtggcgg ccaagccgcc gagcacgccg cggggtgcgg 2405161 ggtgcaccaa cacgggcacg gccgctcgtc cggcgatctc ggcgaccccg gcggggtcgg 2405221 tgagcaccgc gacggcgccg cgtgcgatcg cgtcgccgac gtggcgggcc ccgtgggtgg 2405281 tcgagccggt cagggcggcg aacaggtcac cgggtgacac gtcctgggcg cgcagcgtga 2405341 ccccggtgac cgtccggtcc tcggtgacgg cacgctgagc tggaccctcg gccagggccg 2405401 cgccgacctg atcggccagt gcggccaacc gaacgcccac gacggcgttg gggcgcaagc 2405461 cagtgggcgc agcctccacc tgtgtcgcca cctccgttcg ccgccgcgag atccctcggg 2405521 ccagcgatga caccctaccg acagggcgcg cacactcacc cagtcgggtt ttgccgcgac 2405581 acctggccct cggcggcggc gccgatccag gtgccgatgc gccgcgcggc ggtgaaggcg 2405641 gcccaggcca gggcgccaac cagcgccgca tcggtgtcga gcagggatcg ggccgcggcg 2405701 acgtcgtcgt cggtcacctg atgcggggcc aggccggtca gcagggcaag acgggtgggc 2405761 gcgtgcaggt cggcgggcag ctcggcggtg tgctcgttcg tccagcgact gctcatcggc 2405821 attggctcgc cgtgccacga ccccacgacc cgcctgacca cctgacgagt cggtggcggc 2405881 aggtgcggcg cggtgtccag gtggtggctg agcgcggcga acgcggttgc tatgggctcg 2405941 gacggtgttg cccatgccag atcgtcgggc agcgttcgcg gctcgagccg gcgggtggag 2406001 cggcccggcc gatgctccgc gcgcaccttg cgggcgaaca ccagtccacc ggcgcggcgc 2406061 atgagctgtt gggcgcgcgg gccccccggc aggaaggttt cgtccagcag caccaggacc 2406121 aggcgtgcga tgaagtggaa ttgcaccgcg gtgcccaggt attcggcggc gacatccggg 2406181 ccgaacggtg ccggcggtcc cgccggtgtc ccggttcctg ccgcccacgc cacatacggc 2406241 gcgttcgggt caccggcggc aggtgctgtg ccggccaaga tcgccgcggc ggtgtcggtt 2406301 tggcctgccg cgtacagcat ggtggtgtgt gcgtcgacgc accaggggca gcgcaggctg 2406361 gccgcgacgg cggcggcgac ggcttccttg cggccacgcg gcacctggcc caccagcagt 2406421 gtctcgcgca acgtcgccca gccggcggtg agcagtccct cgtccgggga cagcatggcg 2406481 agcggctcgg gcagccggcc gaactcgcgg cgggcctcgg catagacctc ggcgaccgcg 2406541 ccgccggctc ggcggggcgc gacgggctca atatggttga caaatttcat gattcgactc 2406601 cctcctgggt ggtgccgact ctggccaggg ccgcgtcgat cgccgttcgc gcccgctctc 2406661 cggcgccgtc gtcgtcgagc agcagcagcg cccagttggc ctccatcgcg taggcgtgca 2406721 gctcgaacgc gagttggcgc gcttcgatat ccgcccggat ctcgccccgg cgttgcgccg 2406781 tttcgacgtc ggccgtgatg gcggcgattc cggcccgccc ggtcgcggcg atgcggtcgc 2406841 gcaccgggcc aggctgtgag tccacgtcgg cggccgcggc cgcgaaaaag cagccgccgg 2406901 ggaacacgtc gcgttccagg tatccgaccc acgcatgcat gagggcgcgc acccggtcca 2406961 ccccgggcgg cgctgccatc gcgggagcca cgacctcggc ttcgaacacg ctcacggcgg 2407021 cctcgacggt cgccagctgc agctgctcct tggcgccgaa atgccggaac aggcccgact 2407081 tgctcatgcc cagccgcccg gcaagctcgc cgatggacag ccccgagagc cccttcaccg 2407141 aggcgatatc catcgcggcg cgcaggatct gcgcccgggt ttggcggccg acgtcggcgc 2407201 taggcatggc ttttgacctc ccggtcgtct ccggcgaacg catccaccag cggggccgac 2407261 cggtccaggg cggccagcac ctggtcgcgg cccgccgagg gcagcgcgag cgccacctcc 2407321 gcgacaccgg cccggcggta ctcgtgcagg gtcgccgggt cgccggccga cgagtacaca 2407381 cagacctggg cggtcgccgg atctcgcccg gcacgctcga acgcggcgtg cagcatcggc 2407441 aacgcgccca ggagctcgcc gtacccctcg atcggctgcc aaccgtcgcc gtggcgggcg 2407501 atcacctcga acgcccgcgc actgggccgg cacccgaaca gcaccggcgg cgccacggcc 2407561 ggtttcggcc acgcccacga cggcggcacc gacgcgtgcg tgccctcgta gtggaccggc 2407621 tctgcggccc atagcgcccg catggcggcg agcttgtcca ccgtcaccgc gatccggtcg 2407681 gcgaacggca cgccgtggtc ggcgagctcc tccacgttcc acccgaaacc cacccccagc 2407741 acgaaccgct cgccggacat ggcgcacagc gaggcgatct gtttggccag caggatcgga 2407801 tcatgcaccg ccaccaggca ggccccggtg cccacgcgca gccgcgtcgt gaccgccgcg 2407861 gcggcggcca gcgccaccac cgggtcatag cagcggcgat accagtccgg cagctctcca 2407921 ccgggccacg gcgtgctcct gctgatcggc acgtgcgtct tctccggcac atacaggccc 2407981 gcgaagccgc gctcctcggc ccacaccgcg accaactgcg ggggtggggt caggtcggtg 2408041 acgaactgca tgagcgagac gagcatcggc ggcggctttc attaagcacg aacgttcgtg 2408101 tttaacgatg gtccgcctgg ggcgtgctgt caatgccgga ttgcgtgacc gctcgctcgg 2408161 ggcccgggtc agccgtcggc gccgtttgct ccaggggtgc cgaacagcac accacggctg 2408221 ccgccggcac caccacttcc gacgggtatg ccgaagccgc cggccccgcc gttgccgccg 2408281 ttgccgatca ccacggcgtt gccgccgttg cccccgttac caccgtcgcc tttaacgcct 2408341 ggggggccgt cgccgccttc cccgccgttg ccgccgcgcc cgccgtcacc gatcagcctg 2408401 gcgttgccgc cggcgccgcc gtcaccggca ttacccggcg tgggagcctg tccgccgttg 2408461 ccgccgcgcc cgccgacgcc gccgttgccg tagaacagcc cgccgttgcc gccggccgcg 2408521 ccggcgccgc cggaccccgc atcggcaagg gtggagctgt tgtcgttgcc cccgttgccg 2408581 ccggcaccgc cgttgccgcc gttgccgatc agcccgacgt gcccaccggc cccaccggcg 2408641 ccgccggccc caccgctgtt gccgtggcca ccgttgccgc cgtgcccccc gtcgccgcca 2408701 ttaccgatca gggcggcccc gccggcacca cccgccccac cgctggcgcc ggtgttgctg 2408761 agcccgccgt taccgccgtc accgccggcg ccaccgttgc cgatcagccc ggcggcgccg 2408821 ccggcaccgc cgtttccgcc gtgcaccccg gtcaccccgt tgcttccgtt cccaccagcc 2408881 ccgccggccc cgccgtcgcc gtacagcagc ccgccgcgcc ctccgtcacc accggcgccg 2408941 ccggcccccg gtgccccgga tgcgctgccg ccggccccgc cggccccgcc atggccccac 2409001 agcccggcgg ccccgccggc cccgccggcg gggctgacgc ccccggcggt gccgagcccg 2409061 gccgcacccc cgggcccccc gttgccgtac agccatccgc cattgccgcc cgcacctccc 2409121 gctccggtgg ccgcacccgc gcctccggcc ccgccgttgc cgatcagccc cgccgatccg 2409181 ccgttgccgc cgttgggatt ggcggtgtca ccggccccgc cgttaccacc gttgccgtac 2409241 aacaagccgc cgggcccacc gttttgcccg ggacccccgt cggcgccgtt gccgatcaac 2409301 gggcgcccca gcaacgtttg ggtgggcgcg ttgatcgcgt tgagcagggt ctgctgcacg 2409361 ttggcggcct cggcgctggc atacgagccc gcgccggcgt tcagggcccg cacgaactgg 2409421 ttgtgatacg ccgccagctg agcgctcaac gtctgatagc cctgggcgtg cgacccgaac 2409481 aacgccgaga tcgccaccga cacctcgtcg gcgccggccg ccaggattcc catcgtgggg 2409541 cctgccgccg ccgcactggc ggcgctgatc gacgacccga tgttcgccaa atccgtggcg 2409601 gccgctgcca tcacctccgg cgccgcaatc acaaacgaca tcccgcacct ccgaccagct 2409661 cagcacgact tcacgaatcc cagacctgcg acaccgtcgg cagggctttc gatcctataa 2409721 caatctggaa acaggatgtc gcactttcct taaaagcgct tccgccaacc cgatcgtcag 2409781 cgcgcacatg ttgcgcaaaa gttgttggag ccgaaacgaa ccggcgcgcg ccgttaccgg 2409841 cgccgccgcc ctaggtggcc tgcaagacca aaggaggccc gggatcgggt gacagcggga 2409901 cgttttcgcg ctgcatcagc cagcccgcga tgttgtggaa cagcggggcg gccgagtgcc 2409961 caggcgcgcc gtcggagttg cgcgccgggt tgtccaacat gatgccgatc acgtagcggg 2410021 gattgtcggc agtggcgatt ccggcgaagg tgatccaata cacgtcgtcg aagtagcagc 2410081 cgcagccagg gttgatctgc tgcgcggtac cggtcttgcc ggccatctga tagccgggca 2410141 ccccggccgt cggcccggta ccctgctggt agcccatcgg atcgcgttgc accacggcac 2410201 gcagcatctg gcgcacggtc tgggcggtct gcgccgacac cacgcgaatg tcgtcggggc 2410261 gcggttcttc ggtccggctg ccgtcgggtg cgacggtggc cttgataatg cgtgggggta 2410321 cccgcactcc atcgttggcg atggcctggt acatgccggt catctgcagc aaagtcatcg 2410381 aaagaccttg gccaatagga agattagcga acgtactgcc cgaccactgg tcgattggcg 2410441 gcaccagtcc ggcgctctca ccgggcaggc ccacgccggt gcgctgtccc aacccgaact 2410501 tgcggagcat atcgtaatag cgttccggtc cgacacgttg ggaaagcatc agcgtgccga 2410561 cgttggagga ctttccgaac acccccgtgg tggtataggg catcacgccg tgctcccaag 2410621 cgtcatgcac ggtaacaccg cccatctgga tcgagccagg cacctgtagc acctcgtcgg 2410681 ggctgctcaa cccgtgctcg atgaccgcgg acgcggcgac gatcttgttc accgagcccg 2410741 gctcgaaggg cgacgacacc gccgggttgc ccaactgctt gtcgccctgg cgcccgatgt 2410801 cttgcgacgg gtcgaaggtg ttgtcgttgg ccatcgcgag cacctcgccg gtcttggcgt 2410861 ccaggacgac ggccgagacg ttgtgagccc ccgataggtt cttggcctgc tgcacctgct 2410921 gctgcacgta gaactggatg tcgttgtcga gggtgagcac gacggtggaa ccgtggaccg 2410981 ccttgtgccg attccggtag ctgccgggga tgacgacgcc gtctgaccca cggtcgtagg 2411041 tgaccgatcc gtcggttccg gccagcaccg catccaggga gtcctccaga cccagcagcc 2411101 catgaccatc ccagtcgatg ccaccgacga cgtttgccgc cagcgaccca cccgggtact 2411161 gacgcagatc ctgtctttcc gcaccgacct cgggatactt cgcgcagatc gcgctggcga 2411221 cagccgggtc gaccgcacgc gccaagtaga cgaaggtctc gtcgctttgc agcttcttca 2411281 gcacggccgc ggcatctggc ttgttgttca gcttgccggc gacctcctgg gcgatatcgc 2411341 gcaggcgctg ctgcgggtcg ggtgcagccg acgtcttctt cctggcctct tccaattgcc 2411401 gccgaatccg cttcggctgg aacgtcaggg cacgcgcctc gatggtgaac gcgagccggt 2411461 cattgttgcg gtcgacgatg ctgccgcgag ccgctggctg gacgtcggtg accttgagtt 2411521 ggccggccgc ctgcgcacgc aggcccgcgg catgtgatac ctgcagaaag aacaattgtg 2411581 ttgccgcgac caacatcaac accaagatga ccgcgtttcc ggtccgatgc cgaaagacga 2411641 acgacgcacc gcgcgtcccg acgtccacca cctgccgggt gcgcctcgca cgagtcgagc 2411701 gacccgcggg tgcgacgtct gaccgtgtcg cagggcggga tttcgtggct tcctgggctt 2411761 gccgggcttt ctgcgttttg ccgggccgtt tgcgttgccc aacctcctgg gctcccggtg 2411821 gccggcgcaa accgcgcgcc ggtcgcgtcg actgcgactg actggcccgc ctgggggcgg 2411881 cgcggctcac ctgggagccc ccggcgccgt tggcaccggc gccgtgacgg gaccgaactg 2411941 ttcgccgttg gccggcactg gagcgggtgg tgccaccatg ggttgcgacc cacccgacag 2412001 cccgggcgtc gcagccaccg gtgctggtcc agggagcccg gccgggggcg ccgcacccac 2412061 ctggagcggc accggatttt ccgctggtgc cggggatggc actgcgccga gcggaggagc 2412121 cggcatcgga cccggcgccc caggtatcgg caccggaccg ggcagctgcg ggccggcctg 2412181 ggtgggcagg tgggttgcgc cgcccagcgt cgctgtgccg tctggggtac gcaccagcac 2412241 ctccgggcca gaccgggcgg gcggagcggg atcatcgggg cctggtgtca cccggaccgg 2412301 cacctcgagg ggcaccgccg cgggtttcgg gggcggcggc ggatcttcgg gcaacttcgt 2412361 gttcagcggc ggcggtggaa ctccgtcagc cggcttgggt gtaccgacca ccacccaatt 2412421 gccgtccgga tcctgagcca ggtgggcggt atccctcgtc gggatcatgc cctggcgacg 2412481 agccgcctcg gccagcgccg gcgccgacgc agcctcgcgt acgtcgcgtt ccagcgcttc 2412541 cttgtgctgc tgcagcatcc gggtccgctc ccgggcgttg ctcagctggt aggacctctc 2412601 ggcggcatcg gtggacaacc acagtgtgag gcctagtccg acgccgagcg aaccgataac 2412661 cagcaccaca aacggaacct tgtttgccaa cgtgcgcggc cgcaggtcga tcgacgtgag 2412721 ccgggcggcg agacgctcca tcggcgtagg acggaccagc ttgggcgcct tggcttttcg 2412781 ggccttggcc cgcgccttgg cctggctggt gttctttgcg ggggccggcc ggtcgaacgg 2412841 gctgagcatc gggctggttt gcggtccagg gcgcgacacc cgggcctgcc ggccgggtgc 2412901 cgaggtcttg ccggcacggc tccggatgcg gcgcgacggc gccgagttcg tagtcgttcg 2412961 cctcgtcgcc gcggcaggac tgtcggctct cctgcgacga tcgctgctgc ggcttttcgg 2413021 tgcctcacgc ttggccctca tgaatcaccc ttctcggttg cccattgctg cgattgcgcc 2413081 cggtgctcga ctcgttgcag ggcccgcaac cgcactggag tactgcgggg attgcgttcg 2413141 atctcagcca cactcgctcg ttcggcgccg tgcgttaacg aacggaatcg cggctcatgg 2413201 ccgggaagtt cgaccggaag tcccgcaggg gtggccgacg cgactgcctc ggcgaacacc 2413261 cgtttgacga tcctgtcctc tagcgactgg taggccagca ccgcgatgcg cccaccgata 2413321 gcgagggcat ccagcgcggc aggaacggcc gtgcgcagcg attccagctc atcgttgacc 2413381 gcgatgcgca gcgcctggaa tgttcgcttg gctggatgcc cgccgacacg ccgggccgga 2413441 gctggaatcg cctggtacag cagggcaacc agttcggcgg tcgaggtgaa cggggttttt 2413501 gcgcgtcggc ggacgatacc ggcagcgatg cgccgagcaa accgctcctc tccgtagcga 2413561 cgcaggatgt cggctagtgc cgcctcgtcg taagtgttga caatgtcagc tgcggtcaac 2413621 ggcgtcgtcg ggtccatccg catgtccaat ggcgcgtccg tggcgtaggc gaagccccgc 2413681 tcggcgcggt cgagctgcat ggatgagacg ccgagatcga acaggattcc gtcgactgat 2413741 cccactgcgg cataaccgga ttcagccagc gctgcgccca gacagtcata gcgggtgtgc 2413801 accagggtaa gtcggtcagc gaatcgcacc agccgagacc gcgcgacgtc cagagcggtt 2413861 gggtcacggt cgagcccgat caggcgcaga cccggcaatc cctccaaaaa ccgctccgca 2413921 tgcccgcccg cgccgatggt cgcgtcgaga aggaccgcct gcgagccgtc tggatagtag 2413981 cgggttagtg cgggggtaag cagttcgaag caacgttgcg ccaataccgg cacatgaccg 2414041 aaaccggttg gccccgaacc tggatcagcc accgtgatac ctccccaggt ctggcaagcc 2414101 gtacttcggg acgcggctat tccaggcgcc gcccctgcac cgaggtccct gtccgaagac 2414161 acgaacctgg cgttggggaa gtacgccagg gtcgcttcgg gcagagacca cggtgcacgg 2414221 gtttgcacct cagaagatgt caccgagtgc ttcatcgctg gccgcggaga agttctcttc 2414281 atggatttgt tggtagttct gccaggcttg cgcatcccag atctcgagat agtcgaccgc 2414341 gccgatcacc acacagtcct tggaaaggct tgcgtagcgg cggtggtcgg ccgacaaggt 2414401 gatccggcct tgactgtcgg gatgctgttc gtcggtaccg gcggcgagat tacgtaggaa 2414461 cgctctcgcc tcggggttgc ttcgtggcgc cttgctggcc cggcgcgcca gctgctcgaa 2414521 cgccgcccgc gggtaaacgg ccaggctgtg atcttggctc ttggtgacca tcaacccccc 2414581 tgccaacgcg tcgcgaaact tggccggcag cgtcagccgc cccttgtcgt cgagtttggg 2414641 cgtgtaggtg ccgagaaaca tggggcacct ccctgccaaa tccatctcac ccaaacacct 2414701 cagccaccat accccacaat cccccacttt gccccataac tggggtatca aagcggcgtt 2414761 ttgccgtctc tgtaccactg aagcgcgcgg ctagcccggc tacgacctca gaaaaccgca 2414821 tgtcgccggg caaatgggtg gcaagtgggg ccaagtgggg cacaactggg gctcaaaccg 2414881 gactcaatat cgccgacagc cggtgacgac ccggctgggg cttcccgaga ctgcgattcc 2414941 caaacgatga cgcccaaaca aaaagcggga ccgccgatgg ctgccccgct gccgctggtt 2415001 gcgttcggct tactcgtcga agcggcgccg gaaccgatct tccatacggc tggtgaatga 2415061 gcccccggcc cccttggtac gacgctggcg cgaagcccca gcagccgatc cgccacgatc 2415121 catcctgccg gacaaccgag gaccggtgat ggcatacacc acaccaccga acatcacgac 2415181 aaaaccgaaa acgctgagta tcgggaaact tccgatcatg gtctctttga acgccacgcc 2415241 ggaaaccaac atccccagac cgatgatgaa caacgccgcg ccctgcaggc gccgccgcgc 2415301 ggtcggtgcg cggaagcccc cgccacggac actcgatgcg aacttgggat cttcggcgta 2415361 gagagcgctc tcgatctggt caagcatccg ctgctcatga tcggagagtg gcatgcgtcc 2415421 ctccttgccg acagactgtc acgtaatacc gataacacgc ggatgcccat tgcgcgggca 2415481 actaactcag atgatacgag gtcaatctgc gccgtaccac tggttcgcgg gcgattctat 2415541 cccggcggcg ccgcagcgac gagctgagcg gaaacggcca tacgctacaa gccccgtcca 2415601 gcgcgggcgg cctcatcggc ttgtccgata ctggtgcgca agcacgcatc ggttcatcac 2415661 atgaggagga caccgcgcgt tggcgatatt cctcatcgat ctgccgccca gcgatatgga 2415721 gcgccgcctc ggtgatgccc tgacggtgta tgtcgacgcg atgcgctacc ccaggggcac 2415781 cgagactttg cgcgccccaa tgtggctgga gcacatccgg cggcgcggct ggcaggcggt 2415841 cgcggccgtc gaggtaacgg cagccgaaca ggccgaggcc gccgacacca cggcgctgcc 2415901 gtcggccgcc gaactgagca acgcgccaat gctcggagtg gcgtacggct atcccggggc 2415961 gcccggccag tggtggcaac agcaggtggt actgggcttg caacgcagcg gctttccgcg 2416021 cctagcgatc gcccgactga tgaccagcta cttcgagttg actgaattgc acatccttcc 2416081 ccgcgctcaa ggccgtggcc tcggggaggc gttggcccgc cgactgctag ccggtcgcga 2416141 cgaggacaac gtcctgctct ccacaccgga gaccaacggt gaggacaatc gggcgtggcg 2416201 gttgtaccgc cggttgggct tcaccgacat catccgcggc taccacttcg ccggtgaccc 2416261 ccgagcattc gccatcctgg gtcgcacgct accgctctaa cccgcgcccg acagcttgcc 2416321 gacgcggcat gcccggtctg gcacgatgac ctggtgcgcg ctagctatgc cccaccgtca 2416381 tcccaaggat cgcgagtggc aaggacccga cggcgcggca tgctggccat cgcgatgttg 2416441 ctgatgctgg tgcctctggc taccggatgc ctgcgggtcc gagcctcgat caccatctcg 2416501 ccggatgacc tggtgtccgg ggagatcatc gccgcggcca agccgaaaaa cagcaaagac 2416561 accggccctg cgctcgatgg cgatgtgccg ttcagccaga aggttgcggt ctcgaactac 2416621 gacagcgacg gctacgtggg gtcgcaagca gtgttttccg atttgacctt tgccgagctg 2416681 ccccagttgg ccaatatgaa ctccgacgcc gccggagtga acctgtcact gcgccgaaac 2416741 ggcaacatcg tgatcctgga aggccgagcg gatctgacat cggtatccga tcccgacgcc 2416801 gacgtcgagt tgaccgtcgc cttccccgca gcagtgactt ccaccaacgg cgaccgcatc 2416861 gagcccgagg tagtgcagtg gaagctcaag ccgggcgtgg tgagcacgat gagcgcacag 2416921 gctcgttata ccgatcccaa cacccggtcg ttcaccggag ccggcatctg gctgggcatc 2416981 gccgcgttcg cggccgccgg tgtggtggcc gtgctggcgt ggatcgaccg ggaccgctcc 2417041 ccacggttga ccgcttcggg cgacccgcca accagctagt ccggcttgcc cggctcggca 2417101 ggtgaccagt aggcaagcat ttccgcgaag gtctcgaaag ccgcggccga aacgccatac 2417161 gtcgcctcga gatggatgct tagcggaaaa cccagatcgg cgacgccgtc tagcacacgc 2417221 ttgtacaagt cgaccatgag ccggcgccgc cgtgcgggtt cactgccggc caacttctgc 2417281 acgaacgcct gctcgtcggc caccgcggcg tttcccgggt cctggatcag ccagttgatc 2417341 aggccgatgc gggtctcgac cttcgggaca aagccgaacg acagcagaat ctcgggtcgg 2417401 tgttcggtgg tcctggcgaa ctcgcgcagg aagcccacga tcgcgtcgga atacaacagc 2417461 tgggtcatgc cgtaggttgc gccccgactg cacttgaaat tgagccggcc ctgctcgccg 2417521 tctcgggtgg ggatcacgat cacaccacgg ttggccacca gctggcgata cagcgacagg 2417581 gcatccgtcg gcgcgactcc ggagccctcg ccgtcctgca tcgtgcgcgg tacaccgacg 2417641 aatacgatgc cctccatgcc ggcatcggac agatcgacca gccgccggtg caacgatggc 2417701 tcgtccatga acgcggttac ctgcgtacac aggccatgga ctcccgccaa ctccggtttg 2417761 atgatcgacc agaaatcgag tacatccagc ttcggctgca tcgggatggg cctatcgtca 2417821 tcctcggcga tcatccccgg catcattacg tgccgtatcc ggccgtcaag cccggatgca 2417881 gccgagtact gcaccacctt gcgagcatct tcgattgccc gctccttgcc accctcgagg 2417941 ttcggtggca ccagctccag cgcgatcgtg ttgagggtca cacggctcct cttcgtcaaa 2418001 cgagtacttc catggccgcc aatggggcca ccggtgggcc gcgccgcgtc gcgcaaatcg 2418061 ccatcctggg ccgggccgga ccagccaacc caagggcgct gaagacagca taaacacgaa 2418121 atagtcagtt agtcgaagca acttgtgtgg tttccgcgag cccacccgcc gaatcatcga 2418181 tagcggccac tcgcgccggc gcggaataca ctgtcgggcc ataggcacgc caaatgagaa 2418241 aggggcgccg cgctgagcct gaatgcgccg gcagcaccgg cagcggtcca gttggccggc 2418301 gccatcaccg accagctgcg gaggtatttg cacggccgcc gccgtgcggc cgcccacatg 2418361 ggcagtgact acgacggcct gatcgccgac ctggaggatt tcgttctcgg cgggggcaag 2418421 cgcctacgac cgctcttcgc ctattggggc tggcacgccg ttgccagtcg ggaacccgat 2418481 cctgatgtgc tgctgctgtt ttccgcgctg gaactgctgc acgcctgggc gctggtccac 2418541 gacgacctga tcgaccgttc cgccacccgc cggggccgcc cgaccgccca gctgcgctac 2418601 gcggcgctgc accgcgatcg ggactggcgg gggtcaccgg accagttcgg catgtcggcg 2418661 gccatcctgc tcggcgacct cgcacaggtc tgggctgacg acatcgtctc gaaggtctgc 2418721 cagtccgccc tggcacccga tgcccagcgg cgagtgcatc gggtgtgggc cgatatccgc 2418781 aacgaggtgc tgggcgggca atacctcgac atcgtcgcag aggccagtgc cgccgagtcg 2418841 atcgagtcgg cgatgaacgt cgcgacgctc aagaccgcct gctacacggt atcgcgaccg 2418901 ctacagcttg ggacggccgc cgcggccgac agatccgacg tagcggccat cttcgagcat 2418961 ttcggagcgg acctcggcgt agcgtttcag ttgcgcgacg acgtgcttgg cgtgtttggc 2419021 gacccagccg tgacgggcaa gccgtccggt gacgacctaa agtcgggcaa gcgtaccgtg 2419081 ctggtagccg aagcggtgga attggcggac aggtcagacc ccttggcggc caaactatta 2419141 cggacctcga ttggcacccg attgactgat gcgcaggtac gtgaactgcg cacggtcatc 2419201 gaggcagtgg gcgcgcgcgc cgccgcggag agccgcatcg ccgcgctcac ccagcgagca 2419261 ctggccaccc tggcgtccgc acccatcaac gcaacagcca aggccgggct gtccgaactg 2419321 gccatgatgg ctgcgaaccg gtccgcctaa ccgatgacta ctccgagcca tgctccagcg 2419381 gttgatttgg ctacagcgaa agatgctgtt gtccaacacc tttcgcgact tttcgagttc 2419441 actaccggtc cgcagggcgg accggcgcgg ctgggcttcg ccggcgcggt gctgatcacc 2419501 gcaggcgggc tgggagccgg cagcgtccgc caacatgacc cgctgctgga gtcgattcac 2419561 atgtcctggc tgcgcttcgg ccacggactc gtgctgtcgt cgattctgtt gtggacaggt 2419621 gtgggtgtga tgctgcttgc gtggctgggt ctaggccgac gggtcctcgc cggcgaagcc 2419681 accgagttca ccatgcgggc aaccaccgtt atctggctgg cgccgctact gctgtcggtg 2419741 cccgtcttca gccgggacac ttactcgtat ctggcccaag gggcgcttct gcgcgacggt 2419801 ctggatcctt acgctgttgg cccggtcggt aatcccaatg cgctgctgga cgacgtaagc 2419861 ccgatctgga cgatcaccac cgcgccctac ggtcctgcgt tcattctggt tgcgaagttc 2419921 gtcacggtaa tcgtcggcaa caatgtcgtc gccggaacca tgctgttgcg tttgtgcatg 2419981 ctgcccgggc tggcgttgct ggtctgggcc actccacgct tggccagcca tctcggcacc 2420041 cacggcccga ccgcgctgtg gatctgcgtg ctgaacccac tggtcctcat ccatctgatg 2420101 ggcggggtgc acaacgagat gctgatggtg ggtctgatga ccgccggtat cgcgttgacc 2420161 gtccagggcc gtaatgtcgc ggggatcatc ctgatcaccg ttgcgatcgc ggtgaaggcc 2420221 accgccggaa tcgcgttgcc cttcttggtc tgggtttggc tgcgtcatct gcgtgagcga 2420281 cgggggtacc ggccggtcca ggcgttcctg gcagccgccg cgatatcgct gctgatcttc 2420341 gtcgcggtgt tcgcggtgct gtctgcggta gccggcgttg gcctagggtg gctgaccgcg 2420401 ctggccggct cggtgaaaat catcaactgg ctgacggtgc ccaccggggc ggccaacgtg 2420461 atccacgcgc tgggcagagg gctcttcacg gtcgacttct acaccttgct gcggatcacc 2420521 cggctgatcg gaatcgtgat catcgcggtg tcgctgccgc tgttgtggtg gcggttccgg 2420581 cgcgacgacc gggccgcgct gaccggggtc gcatggtcga tgctgatcgt ggtgctgttc 2420641 gtacccgccg ccctgccgtg gtactactcc tggccgctgg cggtcgctgc cccgttggcc 2420701 cagtcacgac gggcgatcgc ggccatcgcc gggctctcga cttgggtgat ggtgatcttc 2420761 aaacccgacg gatcgcacgg gatgtattcg tggctgcact tctggatcgc caccgcctgc 2420821 gcactgactg cgtggtatgt cctgtatcgg tcaccggacc ggcgcggagt gcaggctgca 2420881 accccggtgg tcaatacgcc atagcctggg cccggcgcac cacctcgcga gcctggtggg 2420941 catgcaatgc atcgacggga cgggcgttgc tgacggcgtc acgcgagccg tcgcgggtga 2421001 tggtcagcga cggatcgggg gtaaacagcc agcgcataat ctcggtgtcg cgatagcccc 2421061 cgtcgtgcag gatggtcaac agccccggca ggctcttgac cacctgaccg gagttggtga 2421121 agaagacctg agggatcacc acgccaccag cgcgccgcac ggccaccaga tgaccttccc 2421181 gcagctgctg ggccaccttg ctgaccggaa cgccgagcag ctcggcgacc cggggcaggt 2421241 cgtacgtcgg ttcgtcaggg tccaaaacgt catcgccagc gagaatgctg cccacccgcg 2421301 caagtgtaga gcctggtgcg cggccaggca tgcgcgttag gcttccgttc tgcatccaat 2421361 cgcggcggcc acctacgatg accccgtggt cgaagctggc acgagggacc cgttggagag 2421421 cgcgctgctg gacagccgct atctggtcca ggccaagatc gccagcggcg gcacctcgac 2421481 ggtctaccgg ggcctggatg tccgactcga ccggcccgtc gcgctgaaag tgatggatgc 2421541 tcgctacgcg ggcgatgaac agtttctgac ccgctttcga ctggaggccc gtgcggttgc 2421601 ccggctaaat aaccgcgcgc tggtcgcggt ctacgaccag ggcaaagacg gcaggcaccc 2421661 gtttctggtg atggagctca tcgagggcgg taccctgcgc gagctgctga tagaacgtgg 2421721 tcccatgccg ccacatgccg ttgtggcggt gctgcgccca gtgcttggcg ggctggctgc 2421781 cgcccatcga gccggtctgg tgcatcgcga tgtcaagccc gagaacatct tgatctccga 2421841 cgacggcgac gtcaaactcg ccgatttcgg gttggtccgc gcggtcgccg ccgcttcaat 2421901 cacgtctacc ggcgtcatcc tgggtaccgc ggcctacctg tcccctgagc aggtccgtga 2421961 tggaaacgcc gatcctcgaa gcgacgtcta ctctgtcggc gttctggtct acgagctgct 2422021 aacggggcac acaccgttca ccggcgactc ggccttgtcg attgcctacc aacggcttga 2422081 tgctgacgtg ccgcgtgcca gtgctgtaat cgacggtgta ccgccacaat tcgatgagtt 2422141 ggtggcatgt gcaactgccc gcaaccctgc cgaccgatac gccgatgcga tcgcgatggg 2422201 cgccgatctg gaggcgatcg ccgaggagct ggccctgcct gaattccggg taccggcgcc 2422261 gcgcaactcc gctcaacacc ggtcggccgc gttgtaccgc agccggatta cccagcaagg 2422321 gcagctgggt gccaaaccgg ttcaccaccc tactcgccag ctgactcgcc aacccggcga 2422381 ctgctccgag ccggcttcag ggtcggagcc cgaacacgag ccgatcaccg gccaattcgc 2422441 cggcatcgca atcgaggaat tcatctgggc gcgacagcac gcccgtcgaa tggtgcttgt 2422501 ctgggtgtcg gtggtgctgg cgatcaccgg gctagtggcg tccgcggcat ggacgatcgg 2422561 gagcaacctg agcggcctgc tctaaggcag gcgagcagtc gcaaaagccc ccatttcggc 2422621 acgaaaatgg gggctggtac gtgaattaag gtgaccacgg caagcgtgac ccgccggcga 2422681 ctgcagcgaa gccgggtctg ttggtgacag tgtgtatgtc ggggtttcag gcggcaggtt 2422741 cgagggtgac ccccaatcct tgggcttcga gtttggcgac gaggcgacgt cgttctttgt 2422801 cgggatccat gcgggtggtg aagtagtcgg cgccgagatc ctggtaaggc cggccggtgg 2422861 ccagcacgtg ccaaatgatg acgatcagct tgtgggcgac ggcgatgatc gccttcttgt 2422921 tggcagcggg actgcggaag ccaccgaact tgcggacctg gcggcggtag tactcgcgca 2422981 ggtagccatc ggtgcgcacg gcggcccacg cgcactcgac caggaccggc tgcaggtgct 2423041 ggttgccttt gcggcgggca ccgtgatggc gtttgccggc cgattcgtgg ttgcccgggc 2423101 acagccgcac ccacgaggcc agatgctcag ccgaggggaa ccaggccgcc gggtcggcgc 2423161 cgatttcaga gatgaccgtc gccgaggcac ccaccccgat ccccgggatc gatgcaatca 2423221 gctcgcgtcg ggcacaaaag ggatgcatca gctgctcgat ctgctcgtcg agagcaccga 2423281 tcatcgcatc gagctgatcc agatgagcca ggtgcaacct acacatcagg gcatggtgat 2423341 catcgaagcg cccttccagc gcccgctgca gatcggggat cttcgagcgc ataaggtgcg 2423401 cttgatcgtc ttgcgcccgt caggaccctc gcccggcacc cacaccgcca ccgcgatgat 2423461 gtcctggccc acatcaacaa aggcgcaccg ctcgtacaga atatgcatcc caccagcccc 2423521 tttccggctc agcgtcgcaa ccaacaacgc gcgctgcgaa gggagccccc aaacatgaac 2423581 taaagagact ggtactcgcg ctcgtagcag caaccgggac acacccgaaa gtgggggggc 2423641 tccaacgtca gtctcttgca cggccacaca cagccaagcc cctacgacgt cgacaccgca 2423701 acgcacgcac cgattctcat tcaccatgag cgggcgcacc agcgcccatc atgttctttt 2423761 acgactgctc gccgagctag tcccgcagca tctccgcgac caggaacgcc aactccagcg 2423821 actgctgggt gttcagccgc ggatcacatg ccgtctcata gcggccggcc aagtccgtct 2423881 ccgaaatgtc ttgcgcgcca ccaagacatt cggtgacgtt ctcgccggta atctcgacat 2423941 ggatgccgcc cggatgggtt ccgagggcac gatgcacctc gaaaaaaccc tgcacttcat 2424001 cgacaatgcg atcgaagtga cgggtcttga accccgtgga cgactcgtgg gtgttgccgt 2424061 gcatcgggtc gcattgccag atcacctgat gcccggtggc ctggaccttc tccacgatcg 2424121 gtggcaacag atcgcggacc ttgtggttgc ccatcctgct caccaacgtc agccggcccg 2424181 gcttattgtg cgggtcgagc cgctcgacgt actccacggc cagttccggg gtcatgttgg 2424241 ggcccaactt gaccccgacc ggattagcaa tcacctgggc aaacgcgatg tgcgcgccat 2424301 cgatttgtcg ggtccgctcg ccgatccaca cggtgtgtgc ggacaggtca aacagttgtg 2424361 gttcaccgtc ttcaccgtcg gacaacctca acatggcgcg ctcgtagtcg agcaccaaag 2424421 cttcatggct ggcatagatt tcggcggtct gtagattgcg gtcggccacc ccacaggcac 2424481 tcatgaaccg cagcccacga tcgatctcgg tggccagcgc ctcatagcgc gcgccggccg 2424541 gcgaggtccg gacgaattcc cggttccagt cgtgaaccag atgcagcgac gccaggcccg 2424601 acgaagtcag cgcacgcacc aagttcatcg ccgcactggc gttagcgtaa gcccggacca 2424661 gccgcgacgg gtcgtgctcg cgcgccgcgg cgtccggggc gaagccgttg atcatgtcgc 2424721 cgcggtaaga ccgcagaccc agcgcgtcaa tgtcggctga ccgaggcttc gcgtactgac 2424781 cggcgatgcg ggccaccttc accactggca tgctggcgcc gtaggtcagc accacggcca 2424841 tctgcaacaa ggcacggaca ttgccccgaa tatggggttc ggtgttgtcc atgaatgtct 2424901 cagcgcagtc gccgccctgc agcaggaaag cctcaccctt tgccacctgg gccagctgct 2424961 cttgcagccg gacgatctcg gacggcaccg tcacgggtgg cacgctctcc aacaccgtgc 2425021 gcatcgccaa cgcctggtcg gccggccagg tgggttgctg ggccgccggc ttggccagcg 2425081 cggcgtccag tcgtgttcgc aggtcagtcg gcagcggcgg aagcgacggg agctggtcga 2425141 tcggtatgtc gacggtccag ttcatcggtc catggtaacc ggggatttcc tgacggctgc 2425201 tcagggcgag gttcgctcgg aggtcctcgc cggcgggatc tgactgtccg tctcctcagc 2425261 gggccgcgcc gcggcccgca tcgtctgtgg acgtgatgag acgaaaccgg cgcagctgat 2425321 ctcgggcatc gaccagcgcg tcgtggacgt cgcgtggccg cggcggcatc cgggggcatc 2425381 cccggtcctc ccacaactgc cgcagttccc gggtgaaacg gggcactgtg ggtggcaagg 2425441 cagtcatcgg gccccacaat tgacacagcg ctacatggtc gtaggccccc acccaggccc 2425501 acaactcgat cgaatccgtg ccgtcgatgc ggaggaattc ttccaggtca agacgaatct 2425561 gctggcgcga gcgccacagt tgcgaggcgg gcggcggcag cttgggcagc acatgggtgc 2425621 gcacccagct gccggcccgc tcgggatcga attccgtgga tactgcgtag tattcgcggc 2425681 cgtcttctgc gaccaccccg atcgagatca actcgatggt gtgcccatcc tcgatgaatt 2425741 cggtgtcgta gaagtaccgc accgccgcag cctaatccga ccagaccgag ccgctgatca 2425801 gaatgggcgc ggttctctcc ggcggtggtg cggggcgcac gtcctggtcg agctgggcgt 2425861 cgaccgcgcg ctcgtcgggc atccgtggcg tcccggcgat cacgtattgc agccacagct 2425921 tgatccgcac caccgggcgg cgccatgtcc gttcccgctg tagcgcacgg cgcatctttt 2425981 ccgggtggcg ggtgtagcgc caccgggccc acggagcgtg cggacgcgac agccggactg 2426041 cgccgacgac caacagcacg acaacgaaca tgccaagcaa cccggtccaa accttgccct 2426101 tgagcagcac caccaccgcc aacggcaacg tcaagaccaa tcctgcgatc agggtggttt 2426161 gcagcaccac ccagttggcg ccttgccgaa ccgtcaggaa gaagatcagc gggtgtaggc 2426221 ccatgatcaa cagccccgcg acggccactg cggcaaagac ggcgtctacc gacgtgcgtc 2426281 cgtcttcctc ccagtagaca tcggacagat gcaggatcag tgcgtactcg tcgagcacca 2426341 aggcggcccc gactccgaaa atgctcgccg ctatggtgaa ttcgggttct cgaccgtcga 2426401 ctgacaaggt gaccagcgtc agcccggaga tcatcaccag caccacccca aacgcgacgt 2426461 ggtggatgtg caccgacccg atgtggacat ttcgcggctg ccaccacctg gccggccgac 2426521 cgtccgcggc gcgacggtgg ataaaccgta caaaactacg cgtgacgagg aaggtcagga 2426581 caaaggcgac caagcagcac aacaacggca gccggccacg gtcgacgatg tcgtgctgca 2426641 gccagtggaa cacctccaaa aagctacgcc caccttgact gcatatgcag gcgccgtaca 2426701 gcgccaccat gcgcgcctac gcgaaactac taggctgttc tgcgacatga gtgcatggcg 2426761 ggcgcccgag gtgggcagtc gactcgggcg gagggtgttg tggtgcctgc tgtggctgct 2426821 ggccggcgtg gcgttgggct acgtggcctg gcggttgttc ggccacacgc cgtatcgcat 2426881 cgatatcgac atctatcaga tgggcgctcg agcttggctg gacgggcgtc cgctgtatgg 2426941 cggtggtgtg ttgttccaca cacccatcgg gctgaacctc ccgttcacct atcctccact 2427001 ggcggccgtc ctgttcagcc cattcgcctg gttgcagatg ccggctgcca gcgtcgcgat 2427061 cacggtgcta accctggtgc tgctgatcgc gtcgacggcg atcgtgctga ccggcctcga 2427121 cgcatggcca acctcccgac tggtacccgc gccggctcgg ttacgccggt tgtggttggc 2427181 cgtgctcatc gtggctccgg caacgatttg gctggagccg atcagctcga acttcgcttt 2427241 cggtcagatc aatgtggtgc tgatgaccct ggtgatcgtc gactgcttcc cacgccgaac 2427301 gccatggcca cgcgggctga tgttggggct ggggatagcc ctcaaactca cccccgcggt 2427361 gtttctcctc tacttcctgc tacgtcggga cggtcgggcc gcgctgacgg cgctggcgtc 2427421 gttcgcggtc gccacgctgc tcggtttcgt cctggcgtgg cgcgactcct gggagtactg 2427481 gacgcatacc cttcaccaca cggaccggat cggcgctgcc gccttgaaca cagaccagaa 2427541 catcgcgggc gcactcgcgc ggttgacgat tggcgatgac gaacgcttcg cactgtgggt 2427601 ggccggatcc ctgctcgtgt tggcagcgac catatgggcg atgcggcgag tgttgcgggc 2427661 cggcgagccg accctggctg tgatctgcgt cgccctgttc gggttggtag tttcgccggt 2427721 ctcgtggtca caccattggg tgtggatgct gccggccgtg ctggtgattg ggctactggg 2427781 ttggcgtcgc cgcaacgtcg cgttggccat gctcagcctg gccggggtgg tgctgatgag 2427841 gtggacaccg atcgacctgc ttccccaaca ccgggagacg actgcggtct ggtggcgtca 2427901 actcgcgggg atgtcctacg tgtggtgggc gctggcggtc atcgtcgttg ccggactcac 2427961 cgttaccgcc aggatgacgc cgcagcgctc gcttacgcgc ggactgaccc cggcgccgac 2428021 ggccagctga ctagccagcg gctgtctcgg ggattcgtgc ggcgtccgtt gaattgggat 2428081 ttgcaccggc accgcccgcg ttgcggccgt ctttgacact ggcggcatag atgtcgacgt 2428141 actcctgacc cgagagcccc atcagctcat agatcacttc gtcggtaacg gcccgctcga 2428201 tgaaatggtt accggccaac ccctcgaacc gggagaagtc catcggcttg ccaaaccgaa 2428261 cggtgaccct gccgaacctc agcatcttcc tgcccggcgg gttgacgacg ttggtaccga 2428321 tcatcgccac cggaatcacc ggaaccccgg tgtgcaatgc caaccgggct aggccggtct 2428381 tgcctttgta gagccgcccg tccggcgagc gagtgccttc tggatacatg cccagcagct 2428441 tgccctgacc cagcaacacc actgccgtct gcagtgcgcc ctgcgcggag tcggcattgg 2428501 tgcggtcgat gggaacctgg ccggagacgc tgtagaacca gcggttgatc cagcctttca 2428561 gtccggtgcc ggtgaagtat tccgatttcg ccaggaacca gatacgacgg cgaactacca 2428621 acggaaggta gaagctatcc gccaccgcaa gatggttact ggcgaggatg gccggacccg 2428681 aactcgggat gtattccagt ccttcaactt tcgggcgacc aagcaacgta aagagcggac 2428741 ccatgaaaat gtacttgaac aggtagtacc acatggccct ccctctcgcc cacaccggat 2428801 ggtgtctgcg ccaactgtac ccatccgcga tggctgcgac tacctgcgcg ggcagcggct 2428861 cactcctcga tggtaaccgg gatggcctgg tagtgggatt tcatgctgcc ctcgccgttg 2428921 gtattctcgc cgcctgacgc accggtctgg ccgccgcccg gcgggccttc gggtggcggt 2428981 ttcgccgacc ggtcaatgtc gtcgactatg gcgcggatca cctccagcag agccagactg 2429041 tgatccgcga tgaccgtcag cagcggatgc tgctcgccgg ttaccaacgc tgccaacgcg 2429101 cacaacggac accacacttg ctggcacttg ccggtcccgg gacctccacc cgaagccatc 2429161 gccgccgcca cccgcaccgc ggggtcgatc ccatcgagga ttgcctgcgc cagcttgcgc 2429221 agctcgggac gaacgtcggt atgggccccg ctcacgtcgg ccacacctcc ggatttggtc 2429281 ggaaacgaac tgttaactca ccaccccgca gatgcgcgtc cagcaccgtg cacctccgca 2429341 atacggacgc caaccgaact cggcgccgca tccccccagc actgacgatc aagtcgtcgt 2429401 cggcccggcc caacgtcagc gtcccgggat cgagctgggg caacgctagc cgcagtcggt 2429461 atatcgacgc caatcccgac ccggattcca ggtccacaat aggctgtagc gggcctggcg 2429521 gcgcgcttcc ttggcgacga cgggcactat cgagcaaccc gcccagggcc ttggggccga 2429581 tcggttcgcc ggccaagtgc ggcaccagca ccagtgccac gtcaccgatg gtggcatcga 2429641 ggtcgtcgag gacggcacgc tgctcaccga tgcgttcggc ataccagtgg aaagccggat 2429701 ggtcgggcag actgcgatac tcatagttct cgtcttgcac cagaagctga ttgacgagca 2429761 gctcttcgac ccggaccccc atgagcgcca acgaccccag cgtccggacc gcctcagcgg 2429821 cgaccacccg ctccggagtc agcaccaggt gggcactgac cagggcaccg tcggtcagca 2429881 atgtgcttag ccgctcgacg ctggcgcgga tgcgctccag cagttccgcc agcacggctg 2429941 acctgccgtc gtcggcgccg atggacaacc tgcgatgccg cggccaggca cgttcgacgt 2430001 acagcccgaa ggtggcgggt agcgtcaaca tccgcaaggc gtccgccgtc gaggcgcagt 2430061 cgaccacaat ccgatcccat cgtcgggctg ccgcaagctc gccgacggcg tgcagcccca 2430121 gcacctcctg gatcccgggc agcgcgcaga gttcttcggg cgcaatgctg ctcaactcgg 2430181 agcccggaaa tctgcggtcc agggtctcga ccacgtgcaa ccaccggccc tcgagcaggg 2430241 ccagggtatc cagcgccagc gcgtcgagaa atccgccccc ggcttcgggg tcgtaggcga 2430301 gcacgcgaac aggatcgccc tgaccggtag gcgggaccgc gatgcccagc acgtcgccca 2430361 gcgagtgcgc ctggtcggtg gataccacca acactcgctg gccggccccg gcatcacata 2430421 ccgcggtggc ggacgccaga gtggactttc ctaccccgcc cttgccgaca aagagactga 2430481 tccgggcctg agccggcgta ccggaatcac tcagccctcg actcgtttct tcagatcctt 2430541 caacgcgccg tctatcaacc tgcgttccgc cttacgcttg agcatcccga tcatggggac 2430601 agcaaggtcg acggcaagct cgtaggtgac ctcagtgcca gaacccttgg gcgccaagcg 2430661 atacgtgcct tcgagggact ttagcagcga gctggattcg agagtccagc taagcgattg 2430721 gcggtcttcc ggccactcgt aggacatgat caaggtgtct ttgaagatgg ctgcgtccat 2430781 caacattcgc gctcgtttcg ggtagccctc gtcgtcggcc tctaggatct cgacttcctt 2430841 atactccgaa atccattgcg ggtaggcttc tatgtcggcg atcgccttca tcacctcgcc 2430901 tggatccgcg tcgatgtaaa tcgtctgtgt cgtcttgtcc gccacctggc tacttccctt 2430961 tccccgcaag cgggtcggcc ccggtcatct gcgggagctc ccgatctccc ggggagaaac 2431021 ggtactccct cgtgccaacc ttgacccggt taagttaccg gagaaacccc gatggggcgt 2431081 gaccgttcta gcactgtctt gacctcgaag gccatttttt tgcccgcgac ccgtcggtgg 2431141 tgcgtcattc tggccaggtt catccgggcc agctgccagg ctgctacccc ggtcggttcg 2431201 gcgtgcagga aatagtgcag tagcacccca tccatcgacg gttccagcca gatctccatg 2431261 gtgccggtca gggcgccggt aaccgtccac ctgattccct tgtcggcacg atcctcggtg 2431321 acctgtagcc gtaggtcagg ccaccaccga cgccagctgc atcgatccgc gaccgcggct 2431381 gaaacccgcg cggcgtcggc tgcgacatag gtctcgtcag cgatctggat gctgttcatc 2431441 gcctcagctt cacatacccg aggccgtggg caagccggac cccgaagggc accaaccaac 2431501 ggacacgcga tatcggtcta ttccgcaccg gcatcaaccc ctctaggctt gacgacagca 2431561 aaccggaccc ggaagacggc aacaggtcaa gtgaggtgtt gatcgtgcgt gagattagcg 2431621 tccccgcccc attcactgtc ggcgagcacg acaacgtcgc ggccatggtg ttcgagcatg 2431681 aacgtgacga tcccgactac gtcatctatc aacgcctgat cgacggcgtc tggaccgatg 2431741 tcacgtgtgc ggaggcagcc aaccagattc gtgccgcggc tctcggtttg atttcactgg 2431801 gggtgcaggc cggcgatcgg gtagtcatct tctctgccac ccgctacgag tgggcgatcc 2431861 tcgatttcgc gattcggctg tgggtgcggt caccgtaccg atctacgaga cctcgtcagc 2431921 ggagcaggtg cgctgggttt tacaagactc cgaagcggtg gtgttgttcg ccgaaaccga 2431981 ctcacacgcg acaatggtcg ccgaactctc cggcagcgtg cccgccctgc gggaggtact 2432041 gcagatcgcc ggttcgggtc ccaacgcgct cgatcggctc acggaggcgg gcgcctcggt 2432101 cgacccggcc gagctaaccg cccgcctcgc cgcactacgg tcgacggacc cggcgacgct 2432161 tatctacacc tcgggcacca ccggacgacc caagggctgc cagttgaccc aatccaacct 2432221 ggttcacgag attaagggcg ccagggcata tcacccgacg ctgctgcgca agggtgagcg 2432281 gctgctggtt ttcctgccgc tagctcatgt gctggcgcgc gcgatcagta tggccgcctt 2432341 ccactccaaa gtcaccgtgg gattcaccag cgacatcaag aatctgctgc cgatgttggc 2432401 ggtgttcaag ccgacggtgg tggtgtcggt gccgagggtg ttcgagaagg tgtacaacac 2432461 cgccgagcag aacgccgcca acgccggcaa agggcgaatc ttcgcgatcg ccgcgcagac 2432521 cgcggtcgac tggagcgaag cttgcgaccg cggcggaccg gggctgctac tgcgcgccaa 2432581 gcacgcggtg ttcgaccggc tggtctaccg caagctgcgt gcggcactgg gtggcaactg 2432641 ccgcgccgcc gtctccggcg gcgcgccgct gggtgcgcgg cttggtcact tctatcgcgg 2432701 cgccggtctc accatctacg agggatacgg cctgagcgag accagtgggg gcgtcgccat 2432761 cagccagttc aatgatctaa agatcggaac tgtcggaaag ccggtgcccg gcaacagtct 2432821 acgcatcgcc gacgatggcg agctgctggt gcgcggtggc gtggtattca gcggctactg 2432881 gcgcaacgag caggctacca ccgaggcatt caccgacggc tggttcaaga ccggtgatct 2432941 cggtgcggtg gacgaagacg ggttcttgac gatcaccggc cgcaagaaag aaattatcgt 2433001 caccgcgggc ggtaaaaatg tcgcccccgc tgtgctggaa gaccagctgc gggcccaccc 2433061 actgatcagc caggcggtgg tggttgggga cgccaagccc ttcatcggcg cgttaatcac 2433121 catcgaccct gaggcattcg agggctggaa gcaacgcaac agcaagacag ctggcgcgtc 2433181 ggtgggcgat ttggccaccg accccgatct gattgccgag atcgacgcgg ccgtcaaaca 2433241 ggccaatctt gcggtgtcac atgccgagtc gatccgcaag ttccgaatac tgcccgtcga 2433301 cttcaccgag gacaccggcg agctgacccc gacaatgaag gtcaaacgca aggtggtggc 2433361 cgagaagttc gcttccgata tcgaggcgat ctacaacaag gaatagccga ctgtgcccgg 2433421 ctcctccccg gcccgctcaa cgggccgcat cgtcgccgcg cagaaaatct gctagcttgg 2433481 cggccagcgt gtcccaacgc cactgcgccg tgacccattc tcggccggcg gcgcccatcg 2433541 cgacggcccg atcccgatcg atcagcaact cggccacggc gtcggccacc cggtccaccg 2433601 acctaccgtc gaccactagc ccagtcttgt tgtgctgcac cgtttccggc gctccgccag 2433661 aattgccggc gattaccggc acgccggcgg cggaggcttc gaggaacacg atgcccaagc 2433721 cctcgacgtc catcccggcg ccgcgggtgc ggcatggcat ggcgaacacg tcggccagtg 2433781 cgtggtgggc gggaagttcg tcggttgcca cgccgccggt gaacgtcacg tggtcggcca 2433841 ccccacagtc gtgagccagc ttgcgcaacg tctctagata tggaccgccg ccgacaatca 2433901 ccaacgcggc tccatcaacg cgacgccgga tcgacgggag cgccgtgacc agggtgtcct 2433961 ggcctttgcg cggcaccaac cgcgacagac acactaccgt gggccgctcg cctagccgat 2434021 agcgcttccg caactcggcg cgtgcggccg gatcggggcg gaaccggtcg gtgtccactc 2434081 ccggcggtag gtattccaac gaagccgcgg gcccgaacgc agaagcaaac cgggaccgcg 2434141 tgtagctgct gacgaaagtc accacgtcgg tgccgtcgcc gatgcggcgt agcaccgatc 2434201 gagcgaccgg aagcatcgac cagcccactt cgtggccgtg cgtgctggcc aacacccggc 2434261 tagctccagc cagccgggca cgcggggcca gcagggccag cggtgcggcc gcaccgaacc 2434321 agacggtttc gatgtcgtgc tcggcgatca gccggcgcat ccggacatcg accgttggac 2434381 ccggcagcat caccgtgctg ggatggcgca ccacccggta accggcagca cgggctgcgt 2434441 cgtcgaaggc gtcggcgcct ttccactgcg gtgcatacac tgtcatcgca tgcgctcggg 2434501 agccgaccag ccgaccgacg aactccccca gataggactg gatgcccccg cgtcggggtg 2434561 gaaagtcgtt agttaccaac aggacccggc tcacctgggt caggctagcg ggtccacctt 2434621 gcgtgagcag acgcaaagtc gcccaaaatc gccggtttcc gggtgatttt gcgtctgctc 2434681 gcggcggaag ctagcccatt agccaccgct gccagcgcgc gagcaggcca gcggcgtcaa 2434741 tgccgaggac gtcgtgggcg gcggtagcca ggtcgaagtg tccgacacca caagtggcca 2434801 gataaagctc acgcagcttg gcggtaccgt aggcggccgc gacgaaccga gcgaaccacc 2434861 acgcgcggtc atatgccagc gagcgctgtg gccccggagt gtccaggtcg gtgtccgacg 2434921 gtaacgacag cgccacagac accgcatccg cgggcggcgg ggtcttgggc ctggcaacga 2434981 aatcggccac cccctcggcc agccatcgag gtgcatccag ggccgtgtcg gcccgggccg 2435041 catagtgaaa aagctcgtgg cccaacacta ttcgtagcgc cgctgggctc atgtgtgccg 2435101 cgcccggcgc gaacacaatc cgttggccga ccaccgtgcg acgagcagga tcgacacggt 2435161 cgaccaccgt gatcgcggcg atgtccgccc attgcgacgc caaacccccg cctgcggcgg 2435221 catgaaactg ctcgtcgcta ccggcggcaa ccacaaagat gtcgtgcgac caatcggtgc 2435281 cccagaatgc caccacctcg tcgaccgcgg cgtcgatgcc cgccgcgatg cgcgacagca 2435341 agcggtcggt ggccgcgcca ccaaggctga gcagccgcac cgtgcggtcg tcggcgaccc 2435401 gcagcgcgac aaacccatcg gctggcgcga ccacctgtgc cggtgctgca ggtccatcgc 2435461 gcaccggatt accagacagg gccgccgcgc caatcagctc cgcaacaaac agacaggcca 2435521 gcaagatccg agggcaaagc cgccgcgcga cgagccggtc agtaacggcg ggcgtcgtag 2435581 atcggccccg acgagtccat cggcaccacc cgcaccggaa caccgtaggt ggaggaatga 2435641 accataagac catcaccgat gtagatgcct gcgtgtgacg cgtcggaata gaaggtcaac 2435701 acgtcgccgg gctgcagatc cgacaacgcg accggctgac caccgtgagc cagcgcctgg 2435761 ctggagtgcg gtaacgcgat accagcctgc tggaacgccc acatcaccaa gcctgagcag 2435821 tcgaacccgc cgggcgcggc accaccccac gcgtagggcg cgccgacctg cgtcaacgcc 2435881 gcttggacaa cggccgtacg gtcgccgcca gcgccgtcgg gctgcacgaa aggcaatccg 2435941 ggcatcccac caggcggcgg cgccacgcca ggcgccgggc cgtcgccagg cggcgcaccg 2436001 ggcggcaacg ccgcaggtgg ggccccgggg gcgatcgcag caaccgccgg gaccggtcct 2436061 ggatcagcga gggccgtgcg ctcctccggc gtcaacgcga cgtattgcga cttgacgacg 2436121 gcaatctgca cctgcagctg gctctgtttg tgctgcagat tcgctcgtac cgcggcagct 2436181 tgctcggccg cggacctggc atcggccgcc gatttggctg cagcctgctc ggccttgacg 2436241 gcctgttctc cagcggcctt gaaacgggcc atctgcgtgg acatttgatg cgccatcacc 2436301 cgctgtaccg atagccgatc gatcaacagt tgcggggact ccgccgtcag gatcgcatcc 2436361 atgccgtggg tacgaccacc catgtaggta gcggccgcga ccttgttcac cgccgtctga 2436421 aaagtcgcca agcgtgctct cgcagcatcc aaggccgttc tgttgtccgc aagcttctgg 2436481 tcggcggccc gctgggcagc gagcttttcg ttgagatcca gctgcgcact gtgcagcgcc 2436541 tcggtggtct gctcggcctg ccgggataac tcgttgagct tggccagcgc gtcgtcggcc 2436601 ggatcagcca gcacattcgc ggccaggacg ccggaggaga cggtgaagct cgcaaagaaa 2436661 cctatggcgg accgcatgat tacacgcgcg atcaaccacc tctggtcgag cctcaaaatt 2436721 tgcttcctta aacgggccat cgacggatga cgtcgagctg gtttaggtct caaacaggtt 2436781 acgaaacgat ctcggaattg tccaaaaggg gaagttaaga aaatggatag atttctacca 2436841 tttcgctgtg gacgatcgta cttctgctat agggctccag gggcatcgac acgcaacgac 2436901 cttacgcgac accggatccg cgctggcggc ggaccggcac caggcgcaac cgaggggcca 2436961 atccgacatc ggcgagcact tccaacgcag cacgctcgtc atgcgacagg ctttccggta 2437021 ccccgatgag cacgctgacc acacagtcgc ggcatccaga tccgcgcgcc gcgcaatcgt 2437081 cacagtcgat taccaccggc gcccccggcc cgggggctgt gccgccgcta gtgtctggtc 2437141 cgctgcgtgc catggggtcg ttcctctcgg cttggctcat gaggtcgtcc gaacgctaat 2437201 cgcgagcacc gacatccgtt gccgccgcgt gcgcgctcgg cgtagggagc gtttgcgtgt 2437261 cagtgcaggg gcctaacgtc gcggccatgg gtgcaaccgg tgggactcag ctgagtttcg 2437321 ccgacctggc acacgcccag ggggcagcct ggaccccagc cgacgagatg tccctgcgcg 2437381 agaccacctt cgtcgtggtc gacctggaaa ccacaggtgg gcgcacgacg ggtaacgacg 2437441 caacaccgcc ggacgcgatc accgaaatcg gggcggtcaa ggtatgcggc ggcgcggtgc 2437501 tcggtgaatt cgccaccctg gtaaacccgc aacacagcat tccgccccag atcgtgcggc 2437561 tcaccggtat cactacggcg atggtgggta atgccccgac gatcgacgcc gtcctgccga 2437621 tgttcttcga gttcgccggc gactcggtgc tcgtggccca caacgctggg ttcgatatcg 2437681 gattcctgcg cgccgccgcg aggcggtgcg atatcacctg gccccaacca caggtgttgt 2437741 gcacgatgcg gctggcccgg cgggtgctga gccgagacga agcccctagc gtgcgtctgg 2437801 ccgcgctagc gcggctgttc gccgtcgcca gcaaccccac ccaccgcgcc ctcgacgacg 2437861 ctcgcgccac cgtcgacgtg ctgcacgcac tcatcgagcg agtgggcaac cagggcgtgc 2437921 acacctatgc cgagctgcgc tcgtatctgc ccaacgtgac ccaggcgcag cgctgcaaac 2437981 gggtactggc ggaaacactg ccgcaccggc cgggggtgta cctgttccgc ggaccgtcgg 2438041 gcgaggtgct ctatgtcggc accgcggcgg acttgcgccg ccgggtaagc cagtacttca 2438101 acggcaccga ccgccgcaag cggatgacgg agatggtcat gctggccagc tcgatcgatc 2438161 atgtcgaatg cgcgcacccc ctggaggccg gtgtccgtga gctgcggatg ctgtcgacgc 2438221 atgccccgcc gtataaccgc aggtcgaagt tcccataccg gtggtggtgg gtggcgctca 2438281 ccgatgaagc atttccacgc ctgtcggtca tccgggcccc gcgacacgac cgcgtcgtcg 2438341 gcccgttccg atcccgctcc aaggccgccg agacgacggc gctgctggca cgctgcacgg 2438401 gactgcgaac ctgcaccact cggctgacac gttccgcccg gcacggaccc gcctgccccg 2438461 agctggaagt gtcggcctgc ccggccgccc gcgacgtcac ggccgcgcaa tacgccgagg 2438521 cggtactgcg cgcggcggcc ttgatcggcg gattggacaa cgccgcgctg gccgcggccg 2438581 ttcaacaggt cactgagctc gccgagcgcc gtcgctatga gagcgctgcc cgactgcgtg 2438641 accacctcgc caccgccatc gaggcgttgt ggcatggcca acgattgcga gcactggccg 2438701 cgctgcccga gttgatcgcc gccaagccgg acggccccag ggagggcggc taccaactgg 2438761 ccgtcattcg ccacggccaa ctcgccgctg ccggcagggc accgcgcggg gttcctccga 2438821 tgcctgtggt cgacgccatc cgccgcggcg ctcaggcgat cctgcctacg ccggcaccgc 2438881 tcggcggggc actggtggag gagatcgcgc tcatcgcccg ctggctggcc gagccgggag 2438941 tgcgcatcgt cggggtctcg aacgacgccg cagggttggc ctccccagtg cgctcggccg 2439001 gcccgtgggc agcgtgggcg gcaacggcgc gctcggccca gttggccggc gagcagctca 2439061 gcagaggttg gcagtcagat ctgccgaccg aaccgcaccc atcgcgcgag caactgttcg 2439121 gccgcaccgg tgtcgattgc cgcactggcc cgccgcaacc cctcctccca ggccggcagc 2439181 cattcagcac ggctggataa tccggcgtgg gcgacgatcg caccggcggc gttgagcacc 2439241 acagcgtccc ggaccgggcc cctggcaccg cccaacaccg cgcgcaccgc ggccgcgttg 2439301 gcttgcgcat cgcctccagc cagctggtca agctgggcgc gcgcaaaccc gaatccggcg 2439361 ggatcaaacg tcaacttatc cacgctgccc gccgcaacgc gccagatcgt gctcgtggtg 2439421 gtggtggtca actcgtccag cccatcgtcg ccgtgtacca ccagcacact ggaccggcgc 2439481 gcagcaaaca ccccggccat cacttcggcg aggtcggcga acgcgcatcc gatcagtcca 2439541 gcccggggcc gggccggatt ggtcagcggc ccgagaagat tgaacacggt gggcacaccg 2439601 atctcgcggc gtaccgcggc cgcgtgccgg taggagggat ggaaccgcgg cgcgaagcag 2439661 aacccgatcc caacctccgc gaggctgcgc gcgaccaggt cgggtcccag gtcgatgcgc 2439721 acccccagcg cctccagcgt gtcggcgcca ccggacaacg aggacgccgc tcggttgccg 2439781 tgcttgacca ccggcacacc cgcagccgcc accacaatcg ccgccatggt ggataggttc 2439841 accgtgttga ctccgtcgcc accggtgccg acgacgtcga cggcgtcgtc ggggaccgta 2439901 tcggcgggca acggatgcgc gtggctgagc atgacgccag cgagctcacc gacttcgtcg 2439961 gcggtcggag ccttcatcgt catcgccacc gcgaaggcgg cgatctgcgc cggccgcgca 2440021 ttgccggtca tgatctggtc catggcccag gcagcctggc cccgcgccag atcgcggttg 2440081 tcggtcaacc gccccaaaat ctgcggccag gacggcaccg atgcggcttc tgctttcggc 2440141 gagcccccgc gagatccccc cgaagaaccc tcagctgaca gcgccacgcg ctgatggtcc 2440201 catgaggatc aaccaacccc aaccgcgccc tgaacacgtc gacgacttgc gctaaccaaa 2440261 cggccgggcg acacgcggaa ctgacttacc gaaatttccg acccgggtag agttcgacaa 2440321 ctacaaagcg tcatacttgc ggatgtgacg agtgctgttg ggacctcggg tactgccatc 2440381 acatcgcgcg tgcattcgct gaatcggccc aacatggtca gtgtcggcac catagtgtgg 2440441 ctatccagtg aattaatgtt ctttgctggg ctgttcgcgt tctatttctc ggcacgagct 2440501 caggccggcg ggaattggcc gccgccaccg acagaactga atctgtacca ggccgtcccg 2440561 gtcacgctgg tcctgattgc ctcgtcgttc acctgccaga tgggcgtgtt cgcggccgaa 2440621 cgcggcgaca tcttcgggct gcgccgctgg tatgtgatca cattcctgat gggcctgttc 2440681 ttcgttctgg gccaggccta cgagtatcgc aacctgatgt cgcacgggac gagcatcccc 2440741 agcagcgcat acggcagcgt gttctatctg gccaccggat tccatggact gcacgtcacc 2440801 ggcggcctca tcgccttcat cttcctgctg gtacgcactg ggatgagcaa atttactccg 2440861 gcgcaggcca cagccagcat cgtcgtctct tactactggc atttcgtcga catcgtgtgg 2440921 atcgcgctat tcaccgtgat ctatttcatc cgatgagccg gcgtccgacg aacatcccac 2440981 gaacaggagt gctcggttga cgaaactggg gttcacccga tccggtggca gtaagagtgg 2441041 tcgcacgcga cggcgcctgc gccgccgatt gtccggcgga gtgttgctgc tgatagcgct 2441101 gaccatcgcc ggtggattgg cagctgtgct gacccctacc ccacaggtgg ccgtcgccga 2441161 cgaatcctcc tcggcgttgc tgcgcaccgg caaacaactt ttcgacacct cgtgtgtgtc 2441221 ctgccatggc gccaacctgc agggcgtgcc cgaccacggg ccgagtctga tcggggtcgg 2441281 cgaggccgcc gtctacttcc aggtgtcgac cggccggatg ccggccatgc gcggcgaggc 2441341 acaggcgccg cgcaaagatc cgatcttcga cgaagcacag atcgacgcga tcggcgccta 2441401 cgtgcaagcc aatggcggtg ggccgacggt ggtacgtaac cccgatggca gcattgcaac 2441461 gcagtcgcta cgtggcaacg acctgggccg cggcggcgac ttgttccggc tcaactgcgc 2441521 ctcgtgtcac aacttcaccg gcaagggcgg agcattgtcg tccggcaaat acgcacccga 2441581 ccttgcgccc gccaatgaac agcaaatcct caccgcgatg ctgacgggtc cacagaacat 2441641 gccgaagttc tccaaccgcc agctctcctt cgaagcgaaa aaggacatca ttgcctacgt 2441701 gaaggtcgcc accgaggcgc ggcagcccgg tggttaccta ctcggcggat tcggacccgc 2441761 acccgaaggc atggccatgt ggatcatcgg aatggtcgcc gcgatcgggc tggcactgtg 2441821 gattggggcg cgatcatgag ccgcgccgac gacgatgcag tgggggtacc acccacttgc 2441881 gggggacgaa gcgatgagga ggagcggcgc atagtgcccg gacctaaccc gcaagacggg 2441941 gccaaagacg gggctaaggc aaccgccgtc ccccgtgaac cggacgaagc cgcgctggcc 2442001 gcgatgtcca accaggagct gctcgcattg ggcggcaagc tggatggtgt ccggatcgcc 2442061 tacaaagagc cccgctggcc ggtcgagggc accaaagccg agaagcgcgc cgagcgttca 2442121 gtggcggtgt ggcttttgct aggtggcgtg ttcggactgg cgctgttgct gatcttcctg 2442181 ttctggccgt gggagttcaa ggcggcggat ggcgaaagcg acttcatcta ctcgctgact 2442241 accccgctct acggcctgac tttcggattg tccatcctgt cgatcgccat cggcgccgtg 2442301 ttgtatcaga aaaggtttat tcccgaagag atttcaatcc aggaacgtca cgatggcgct 2442361 tcgcgggaga tcgaccgcaa gacggtggtg gcgaacctga ccgacgcgtt cgagggctcg 2442421 acgatccgac ggcgcaagct gatcgggctg tccttcggcg tgggcatggg tgcgttcggg 2442481 ctaggcacct tggtcgcgtt tgctggtggc ctcatcaaga acccctggaa gccggttgtc 2442541 cccaccgccg agggcaaaaa ggcggtgctc tggacgtcgg gttggacccc ccgctaccag 2442601 ggcgagacga tctatctggc gcgcgccacc ggcacggagg acggaccacc gttcatcaaa 2442661 atgcgcccgg aggatatcga cgccggtgga atggagaccg tttttccctg gcgggagtcc 2442721 gacggcgacg gcaccaccgt cgaatcacac cataagctgc aggaaatcgc gatgggtatc 2442781 cgtaacccgg tgatgctcat ccggatcaaa cccagtgacc tgggccgcgt ggtcaagcgc 2442841 aagggccagg agagtttcaa cttcggcgaa ttcttcgcgt tcaccaaggt ctgctctcat 2442901 ttgggttgcc cgtcatcgct gtacgagcag cagagctacc gaatcctgtg cccttgtcac 2442961 cagtcgcagt tcgacgcatt gcatttcgct aagccgatct tcggtccagc ggcccgcgcc 2443021 ttggcgcaac tgccgatcac gatcgacacg gacgggtatc tggtcgccaa cggtgacttt 2443081 gtcgagcccg tcggaccagc attctgggag cgaacaacaa catgagtccg aaactgagtc 2443141 cgccgaacat tggtgaggtc ctggcccgcc aagccgaaga catcgacacc cggtatcacc 2443201 cctcggcggc gctgcgtcgt cagctcaaca aggtcttccc gacccactgg tcgttcttgc 2443261 tcggcgagat cgctctgtac agcttcgtgg tcctgctgat caccggcgtg tatttgacgc 2443321 tgtttttcga tccgtccatg gtcgacgtca cctacaacgg tgtctatcaa ccgctgcggg 2443381 gcgtcgagat gtcgcgtgcc taccagtccg cgctggacat ttccttcgag gtgcgcggtg 2443441 gcctgttcgt gcgccagatc catcactggg ccgctttgat gttcgcggcg gcaatcatgg 2443501 tgcacctggc acgcatcttt ttcaccggag cgttccggcg gccccgcgag accaactggg 2443561 tgatcggttc gctgttgttg atcctggcga tgttcgaggg ctatttcggc tactcactgc 2443621 ctgacgacct gctgtcggga ctcggtctgc gcgcggcact ctcgtcgatc acgctgggta 2443681 tgccggtaat cgggacctgg ctgcactggg cgctgtttgg cggtgacttc cccggcacca 2443741 tcttgatccc caggctctac gccctgcaca ttttactgtt gccggggatc atcttggcgc 2443801 tgatcgggct gcatctggcg ttggtgtggt tccagaagca cacccagttc cccggcccgg 2443861 gccgcaccga gcacaacgtc gtcggcgtgc gggtgatgcc ggtgttcgcg ttcaagtccg 2443921 gcgcattttt cgcggctatc gtcggtgttc tgggcctgat gggcggcctg ctgcagatca 2443981 acccgatctg gaatctgggg ccctacaagc catcacaggt gtcggcgggc tcgcagccag 2444041 acttctacat gatgtggacc gagggtctgg cccggatctg gccgccgtgg gagttctact 2444101 tctggcatca caccattccc gccccggtct gggtcgccgt gatcatgggc ctggttttcg 2444161 tcctgctacc cgcctaccca ttcctggaga agcggtttac cggcgactac gcgcatcaca 2444221 acctgttgca gcggccacgg gacgttccgg tgcgcaccgc gatcggcgcc atggcgatcg 2444281 ccttctatat ggtgctcact ctcgcggcga tgaacgacat catcgcgttg aagttccata 2444341 tttcgctgaa tgcaaccacg tggattggcc gcatcggcat ggtgattctg ccgccgttcg 2444401 tctacttcat cacatatcgg tggtgtatcg gattgcagcg cagcgatcgg tcggtgctcg 2444461 agcacggcgt cgagaccggc atcatcaagc ggctgcccca tggcgcctac atcgagctgc 2444521 atcagcccct cggcccggtc gacgagcatg gccacccgat accgcttcag tatcagggag 2444581 cgccgctgcc caagcgaatg aacaagctgg gctcggccgg atcgccgggt agtggcagtt 2444641 ttctgttcgc cgactccgcg gcagaggatg cggcgctgcg cgaggcaggg cacgccgccg 2444701 aacaacgtgc ccttgccgca ctgcgcgaac accaggacag catcatgggt tcgccagacg 2444761 gcgagcacta gcccggcgac gacccgggtc ggcacgaccc gggaaggaac cgggcaaatc 2444821 aagcacagcc cggcgacgac ccgggtcggc acgacccggg aaggaaccgg gcaaatcaag 2444881 cacagcccgg cgacgacccg ggtcggcacg acccgggaag gaaccgggca aatcaagcac 2444941 agcccggcga cgacccgggt cggcacgacc cgggaaggaa ccgggcaaat caagcacagc 2445001 ccggcgacga cccgggtcgg cacgacccgg gaaggaaccg ggcaaatcaa gcacagcccg 2445061 gctaactgga ctggggcgcc accacccggc gcagctgccg agagtatagc cactcgatca 2445121 ccggcatgcc cgcggtgacc accccggcca acccgtagct gatccaagat ggcccgtcgt 2445181 gaccgaccgc catgaggtag gtcgccgccg ccacggcaat caacgcaatg ccaatcgcac 2445241 tggtcaacac gactgtcccg cgcaaccaga tccggtccac cgcctcactg gaccactcgg 2445301 cggccacctc gaatgcatcc gcgtgctgta cgggtgccga ctcggccaca gcgcgtttcg 2445361 ccggatgccc ggatccgatc gatcgcccgc cccgcacgga tgcacccgtc ggcctcgtcg 2445421 cgggctcggc ctcagccatg cggcgagctc gcaacagcac cggtatcgcg cccacgatga 2445481 ccagtgcgga gaccacaatt acggcgtaca gcacccacgt ggtgtgcggg tttccggcca 2445541 tctcgtggaa gcccctaccc aggtccatca gggcgacagc ggcggccacc gacacgccgg 2445601 tgaacaccag ccacaccgcg gcacatgccc caaccaggat gcgatcgatg acgtccggcg 2445661 agattacatc cggcccacgc cggtatgcgg aatatctgct caccatcagc agctcgtttg 2445721 cggtccatcg ttggagttcg atgagagcac cgttccgtcg ctcgtggtga tcgagcagtt 2445781 gagtttgctg acccggaaaa ggctggaggc ctccaccgag ccaacgtcgg attgcgagat 2445841 cggggtgacc gtcatggacc acgggatgta cacattgtgc tgtgtccgtc ggcgcccggc 2445901 ggcatcgacg taagtcaccg agataatgtc acccggcgcc ttggtaccgg tcaccgaata 2445961 ggtgacttgc cgcggaccgg tcggcgtggt ggtcgtgggc ggcggtgccg ccgccgttgt 2446021 ggtggtcgcc ggcggcggcg ccgtggttgt cgccggtggg ggcggtggtg gcggcgtcac 2446081 agtgaccgtc tgtgtctccg tcgctgtcgg gatctcggtg gtgggcggtg gggctggtgg 2446141 cggcggtggc ggcgccggct tggtggtcgt gatttcgtcc tgcacgggcg gtgcagagga 2446201 cgtagtgtcg ccggtggcga gtttgctggt atgtggtcgc gtgacgagca acgacaccga 2446261 aaccacgagc gcaacggcgg caattatggc ggcgacaccg accacccacg gccagcgcgg 2446321 agcggccagt tcgtcgtcca ggtcggacga ctcctcatag tcgtcgtagt catagagcct 2446381 gagatcggct ggcacatacg ggccgccggt gacgtgctcg gattccggag cagagtatgc 2446441 ccgagaatat gcgtcggtct cgcccgtctg gtcactgggc agtttgtcgc cgcccccggc 2446501 gacgggcggc aagtggttgc cggaagcccg ttcgtcgccc gtgtcgctga cgggttccga 2446561 ttcgggttcg tcaggttccc gtcccggggg attcggcccg ctcatgtttg cctaccctgt 2446621 ccaactgcct caccaacacg cgtggctttc cgcctgcatc cttgcccgcg cgctcggcgc 2446681 attcttcatt ggtgccacgg aaaccctacc caaccgggca ggaccgagaa gtctgggcaa 2446741 ccgtgctact ggtcaactga tgccctgatt gtgaccttcc cggcgccgga tcagtgcttc 2446801 tcaggaccga cgtaatattc gaagaccaat ccggccgccg aggcgaggat gaatgccaca 2446861 ccggcggcga tcagccacgg gagccacaac gcgatgccga ccgctgccac cgagccggac 2446921 aacgcgacca tgatcggcca ccagctatgc ggactgaaga atccaagttc tcctgcgccg 2446981 tcgctgattt cagcgccttc gtagtcctcg ggccgggaat ctaaccggcg ggccacaaac 2447041 cggaagaagg tggcgacgat caacgccatg ccgccggtaa gcgccagcgc agtggtgcca 2447101 gcccactcga caccaccggt ggcgaacatc gaggtcaaca cgccgtacag caccgccgtc 2447161 accacgaaga acgcggcgac aaactcaaac agtcgggctt cgatatgcat gagcgtccta 2447221 acctacgggc tgcggggcca attcaccgcg gcgagtatca aacgggtggg tggtcaccgc 2447281 aaggggcggc tggttgatcg cccgcagggc ctcggcgttt gtcttcccgt cgatgcgttg 2447341 ctgcaggtag gccttgaaat cgttgggggt cacgacgcgg acctcgaagt tcatcatcga 2447401 gtgatacgtg ccacacatct cggcgcagtg gcccacgaat gctccggtct tggtgatttc 2447461 ttcgatctgg aagacgttga ccgagttgtt tgccaccggg ttaggcatca cgtcacgctt 2447521 gaacaagaac tccggcaccc agaatgcgtg tatcacatcg gctgaggcca tttggaattc 2447581 gatacgcttg ccggacggca gcaccagcac cggaatttcg gtgctggtgc ccaacgtctc 2447641 gaccttgtcg aaattcaggt aggtccggtc ctcggtgttg agcccgcgca ccggcccgac 2447701 cagctcttcg ccgtacttgt ccttgccctc tggcttggaa accatggcgc gcttgcgctc 2447761 cggatcggca ccatcatagg tcagtgtgcc gtctttgaag ttcacccttt gatagccaaa 2447821 cttccaattc cactggaaag acgtgatatc aatcacgacc tcgggatcct tggctatctg 2447881 cagcatcttc tcctgcacca cgacggtgaa ataaaacagc accgagatga tgaggaacgg 2447941 tatgacggtg agaaccagct ctagcggcat gttgtagccg aactggcggg gcaactcagt 2448001 gtcggtgttc ttcttccggt gaaataccgc ggaccagaag atgagacccc acacgattac 2448061 cccaaccgcc agggaggcga tcaccgcccc gatccacagt tctcgattga ggtgtgcctc 2448121 cggggtaatg ccctccggcc aaccgatgcc cagggcttcc gaccagctgc atccactgac 2448181 ggtgacggcc aatgccccca gcattgctgc gagcgccagc tgtcgaagac cacgggcagg 2448241 ccctccggag ccgcgctgag gcctgcactg cgacaagcgt tgcaaacgac ctggcccgcg 2448301 aggtgtcact gttggcgcct cctgtatcac aagctgggcc gactgggata gcaccggctg 2448361 cggcgagaac catcggctaa ctcagacatc gaatactacg cagcgtagac cacgccgccc 2448421 gcgcgggcga cgatgcgggc cgaaacggcc cgctgaggag ccgcgccatc agccccgcgg 2448481 gcgactgcct ggtcgtcgcg acccgccgga cgaggcatcc acaagagtcg ccaagtgggg 2448541 catactgggg cgccgtgtgt ggactgctgg ccttcgtcgc ggccccggcc ggtgctgcgg 2448601 ggcccgaagg tgccgacgct gccagcgcca tcgcccgcgc atcgcatttg atgcgccacc 2448661 gcgggcccga tgaatcgggc acctggcacg ccgtcgatgg cgcctccgga ggcgtcgtgt 2448721 tcgggttcaa ccgactgtcc atcatcgaca tcgcgcactc gcatcagccg ctgcggtggg 2448781 ggccgccgga ggctccggac cgctacgtgc tggtgttcaa cggcgagatc tacaactact 2448841 tggagctgcg tgacgagctg cgcacccagc acggcgctgt gttcgccacc gacggcgacg 2448901 gtgaggcgat cctcgccggc tatcaccact ggggcaccga ggtgctgcag cggttgcgcg 2448961 gcatgttcgc attcgcgctg tgggacaccg tcacccgcga attgttctgc gcgcgagatc 2449021 cgttcggcat caagccgttg tttatcgcca ccggagccgg cggcacggcg gtggccagtg 2449081 agaagaaatg cctgctggac ctcgtcgagt tggtggggtt cgacaccgag atcgaccatc 2449141 gggcgttgca gcactacacc gtcctgcagt acgtgccgga acccgagaca ctgcaccgtg 2449201 gggtacgtcg gctggaatca ggctgcttcg cccggatccg tgccgaccag ctcgcgccgg 2449261 tgatcacccg ttatttcgtg ccgcgatttg cggccagtcc gatcaccaac gacaacgacc 2449321 aggcccgcta tgacgagatc acggcagtgc ttgaggactc ggtggccaag catatgcgcg 2449381 ccgatgtcac cgtcggcgcg tttctgtccg ggggtatcga ctccacggcc atcgcggcgc 2449441 tggccatccg gcacaatccg cggctgatca ccttcaccac cggtttcgag cgcgagggct 2449501 tctccgagat cgacgtcgcg gtggcttcgg cagaggccat cggtgcccgt cacatcgcca 2449561 aggtggtcag cgccgacgag ttcgtcgccg ccctgcccga gatcgtctgg tacctcgacg 2449621 agccggtcgc tgacccagcg ctggtaccgt tgttcttcgt cgcccgcgag gcccgaaagc 2449681 acgtcaaagt ggtgttgtcg ggcgaaggcg ccgacgaact gttcggcggc tacacaatct 2449741 atcgagaacc gctgtcgttg aggccgtttg actacctgcc caagccactg cgccggtcga 2449801 tgggaaaagt ttccaagcca ctgccggagg gcatgcgcgg caagagtctg ctgcaccgcg 2449861 gatcgctgac actcgaagag cgctactacg gcaatgcccg cagtttctcc ggcgcgcagc 2449921 tgcgcgaagt actgcccggg ttccggccgg actggaccca cacagatgtc acggcgccgg 2449981 tctacgccga atcggccggc tgggatccgg tggcgcgaat gcagcacatc gacctgttca 2450041 cctggctgcg cggcgacatt ctggtcaagg ccgacaagat aacgatggcc aactccctgg 2450101 agctgcgggt gccgttcctg gacccggagg ttttcgcggt ggcctcccgg ttgccggcgg 2450161 gcgccaagat cacccgtacc accaccaagt acgcgctgcg gcgcgcgctg gagcctattg 2450221 tgcccgcaca cgtgctgcac cggcccaagc tcgggttccc ggtcccgatc cggcattggc 2450281 tgcgtgccgg cgagctgctg gagtgggcgt atgcgacggt gggctcgtcg caggccggtc 2450341 acttggttga catcgccgcc gtgtatcgca tgctcgacga gcaccggtgc ggcagcagcg 2450401 accacagccg ccggctgtgg accatgctga tctttatgct gtggcacgcg atcttcgtcg 2450461 agcacagcgt ggtgccccag atcagcgagc cgcagtaccc cgtccagttg taaccgcccc 2450521 ttcgcgagca gacgcggaat cgcatcggcg gggcccacac ggtgcgattc cgcgtctgct 2450581 cggcggtgcc gcggctaggc caagccgcgg ctaggccagc acggcgacga tctcggcggc 2450641 cgcgtgctcg ccgtaagcac cagccagcct gctggccgcg gcctcgtagt cccactgcca 2450701 ctcctgagtt ccggtcgact ccagcaccag cacggcaacc agcgagccca gctgcgccga 2450761 acgctccagg cctagtccgg cactgcggcc agtcaggaaa ccggcgcgga acgcgtcgcc 2450821 gacgccggtg gggtcggtct ggctggtttc ggggaccacg ccgacgtgga tggtggtgcc 2450881 gtcaggttct accaaatcga cacccttagg acccaatgtg gtcacccgca ggtcgatctg 2450941 cgccatcaca tcggcctctg accagccggt cttggacagc agcagatccc attcgtagtc 2451001 gttggtgaac aagtaagcag caccgttgac gagcctgcga atttcctcac ccgacagcct 2451061 cgccagctgc tgagacggat cggcggcgaa ggccagcccc agcttgcgac actcctcggt 2451121 gtgcaagaac atcgcctcgg ggtcgttggc gccgatgatc accaactccg gcttgccgat 2451181 ggccgacacc acgtcggcaa gcttgatgtt acgtgcctcc gacatagccc cggggtagaa 2451241 cgatgcgatc tgggccatgt cgacatcggt ggtacaggta aaccgcgccg tgtgcgcggt 2451301 ctcggagatc agaacgtggt cgcagttgac accgcgggct ttcagccagt cgcgataatc 2451361 ggcgaagtcg gcgcctgccg ccccaactag cgcgacctcg ccacctagca caccgatggc 2451421 gaaggccatg tttccggcca cgccgccgcg gtgcatcacc aagtcatcga ctaggaagct 2451481 aagcgacacc ttgtgcaggt gttcgggcag tagctgctcg gaaaatcggc ctggaaaccg 2451541 catcaaatgg tcggtcgcaa tcgaaccggt taccgcgatc gtcacaaaat ctccgtcctt 2451601 cgttcctaag gttgcctagt ctttcaacat tatcggcgcc gcggcccgcc ccgtcgcgtt 2451661 gagagctgac ggcagctgtt gcgctagcct gcctagggag ctcacctgat tgccgatgct 2451721 gccggctgac gcgacgggcg gttgtcgccc tagcagctgg tcccgtccac caccctagga 2451781 gaaccacaat gcccggtccc cactcgccga accccggtgt cggcaccaac ggaccggcgc 2451841 cgtaccccga gccctcatcc cacgaacccc aagccctgga ctacccccac gacctcggcg 2451901 ccgccgaacc ggccttcgcc ccgggaccgg cagacgacgc ggcgctgccg cccgccgcat 2451961 atcccggcgt gccgccgcag gtgtcctacc cgaagcgacg gcacaagcgg ctgctgatcg 2452021 gcattgtggt agccctcgcg ctggtgtcgg ctatgacggc ggcgatcata tacggggttc 2452081 gcaccaacgg agccaacacg gcaggcacat tctcggaggg accggccaaa accgcgattc 2452141 agggatacct caacgcgctg gagaaccgcg atgtggacac catcgttcgc aatgcgctgt 2452201 gcggtatcca cgacggcgtg cgcgacaagc gctccgatca ggccttggcc aagctgagca 2452261 gcgacgcgtt ccgcaagcag ttctcccagg tcgaagtgac ctcgatcgac aaaatcgtgt 2452321 actggtcgca atatcaggcc caggtgctgt tcaccatgca ggtgacacct gccgccggcg 2452381 gcccgccacg cggtcaggtg caaggcatcg ctcagttgct tttccagcgc ggtcaggtct 2452441 tggtgtgctc gtacgtgttg cgcaccgcgg ggtcgtacta gcgttttatc agttgaacga 2452501 atccccgcac gcgcaggagc cggtggcgtt gggattgtcg atggtgaagc cttgcttctc 2452561 aatagtgtcg acgaaatcga tcgacgcgcc ttccacatac ggcgcgctca tccggtccac 2452621 gatcaacctg acaccaccga actccgcggt ttggtcacca tccagcgtcc ggtcgtcgaa 2452681 gaaaaggtta tagcgcaatc cagcgcaccc ccccggctga accgcgatcc gcagcgccag 2452741 atcgtcccgt ccctcctggt ccaacagcga cttcgccttg gcggcggccg cttcggtcag 2452801 gatcacgccg tgggtcttgg cgctcggctc gttctgcacc gtcatgactt ctcctagatg 2452861 tctcatcgtt gggtgggccc cgcccactag cgtttcagcc tgcggaatcc agtctggggt 2452921 ctgcttgggg aaaatcccac ttcctcaacg gtaccctgaa ggaccgctat tcccgagtcg 2452981 cgccgctacc tgagacgcca agcccatgag ctgattggcc gcatcggcca gcgccaaccg 2453041 caccgaaccg gcgtactcag cgatggacaa tgcggccata atgcccgccg accgcaacgc 2453101 ggacttgtcc agcgacacct ggccggccaa cactatcacc ggaattgcga gcgggcgggc 2453161 cgcagccgcg atcgcaccaa ccaccttccc gtgcagggat tgctcgtcga atcggccctc 2453221 accggtgacg atcagctccg catcggcaag gtcgtcggca aaatgcgtgt gctctgcgat 2453281 gattgccgca cccgactggt accggccgcc aaccgcgagc agcccagccc cgataccacc 2453341 ggcggcgccc gcgcccggct cggcgctcac cccgcgcccg gcggccgcgt ccagttcaat 2453401 cgcccatgcc gccagacggc cttccaacac tgcgacggtg gccatgtccg cgcccttctg 2453461 cggcgcgaac accctggccg tgccccatgg tcccagcaat gggtattcga catccgaggc 2453521 ggcgatcacc tcgacgtcgg ccaactgtcg gcgggccgcg tccaggccgc caagctcggc 2453581 aatcatcccc ttccccccgt cggtacatgc gctgcccccc aaccccacca cgatccgagc 2453641 cgccccggcc cgcagtgccg cggcgatgag ctggccgacg cccttgctgt gggccgccag 2453701 cgcggtctcg ggcgtgggcg ggccgccaag caaccccaga ccacaagcct gcgcgcactc 2453761 caaatacgcg gttgccgagc ccggatcgaa cacccacgcc gcgttcacga cggtgttcag 2453821 tggcccgcaa acacgcagcc ggcgggtctc tcctagccgg ctgcccagca cctcaacaaa 2453881 acccggaccg ccatcggatt ggggggcgac gatgaacgaa tcgcctggtc gcgaccgcgt 2453941 ccagccggtc gcaatggccg cggcggcctc caccgcagac aggctgtcgc cgtagcagtc 2454001 cggtgccacc aacacccgca tggcgggcag ctggagtcgg ccgggcccca agctaccggt 2454061 cgcgtcatcc gaggcctgcg agcctttcat cactggccag agtaggtctg cgcacccaca 2454121 cgcgtaccta aacgcacgca aattccaacc gggccccgcc gcgaagtagc ctggcgactg 2454181 tgaagctgct gggccaccgg aagagccatg gacaccaaag ggccgacgca tcacccgatg 2454241 ccgggtcgaa agatggttgc cggcctgatt ccggacgcac gtccgggtcg gacacatcgc 2454301 gcgggtcgca aaccaccggc cccaagggcc ggcccacgcc caagcgcaac caatcccgtc 2454361 gccacaccaa gaagggcccg gtcgcaccgg caccaatgac tgcggcccag gcacgggccc 2454421 ggcgcaagtc gcttgccggc cccaaactta gccgcgagga acggagagcc gaaaaggccg 2454481 caaaccgggc ccggatgacg gaacgccggg aacgcatgat ggccggcgaa gaggcctacc 2454541 tgctcccgcg cgaccggggc ccggtacgcc gctacgtgcg cgatgtggtg gactcccggc 2454601 gcaacctgct cgggctgttc atgccctcgg cgttgaccct gctgttcgtc atgtttgccg 2454661 tgccgcaggt gcagttttac ttgtctccgg cgatgttgat actgctggcc ttgatgacga 2454721 tcgacgcgat catcttgggt cgcaaagttg gccggctggt tgacacgaag ttcccgtcta 2454781 acaccgaaag ccggtggagg ctgggtcttt acgccgccgg ccgagcttcc cagatacgcc 2454841 ggttgcgggc gccccgaccc caagtcgagc gcggcggcga tgttggctaa cggacgccgg 2454901 aagtcatctc acccggtgta caccctagtg ctcagcgggc ggaccgaacc gatcaagccg 2454961 gcgaaaggat gatcggcttc gcgccggtgt cgacgcccga tgcggctgcc gaagcagccg 2455021 cccgcgcccg acaagacagc ttgaccaagc cgcggggagc gctgggcagt ctcgaggacc 2455081 tgtctgtctg ggtcgcgtcg tgccagcagc gctgtccgcc gcggcaattc gagcgcgccc 2455141 gggtggtggt gttcgccggt gaccatggtg tggcccggtc cggggtgtcg gcgtacccgc 2455201 cggaagtcac cgcccagatg gtcgccaaca tcgacgctgg cggggcggcg atcaacgcgc 2455261 tggccgatgt cgcgggcgcg accgtgcggg tcgcggacct ggccgtggac gcggacccgc 2455321 tgtctgagcg catcggcgcg cacaaggtgc gccgcggcag cggcaatatc gccaccgagg 2455381 acgcgttgac caacgacgag accgccgccg cgatcacagc cggccagcag atcgccgacg 2455441 aagaggttga tgccggcgcc gacttgctca tagccggcga tatgggaatc ggaaacacta 2455501 ccgcggccgc ggttcttgtg gcggcgctga ccgatgccga gccggtcgcg gtggtcgggt 2455561 tcgggaccgg tatcgacgac gccggttggg cgcgtaagac ggccgcggtg cgcgacgccc 2455621 tgtttcgggt gcgcccagtg ttgcccgacc cggtcgggtt gctgcgctgc gccggcggcg 2455681 ctgacttggc cgcgatagct ggcttctgcg cgcaggccgc ggtccgacgc accccgctgc 2455741 tgcttgacgg ggtggcggtg acagccgccg ccctggtcgc tgagcgtctt gcgcccggcg 2455801 ctcaccggtg gtggcaggcg ggtcatcgat ccagcgaacc gggccacggg ctggcgctgg 2455861 cagccctcgg gctggacccg atcgtggacc ttcacatgcg gctgggcgag ggaaccggcg 2455921 ccgcggtggc gttgatggtg ttgcgcgccg cggtcgcggc gctgtcgtcg atggcgacct 2455981 tcaccgaggc cggcgtgtcc acccggtccg tcgacggtgt cgaccggacc gcacccccgg 2456041 cagtctcacc gtgatgcgtt cgctggcaac agctttcgca ttcgcaacgg tgatacccac 2456101 accgggctca gcgaccaccc cgatgggccg tggcccgatg accgcgctgc cggtggtggg 2456161 cgcggcgctg ggtgcactgg cggcggcgat cgcatgggct ggcgcgcaag tgttcggccc 2456221 gtccagcccg ctgtccggca tgctcacggt ggcggtactg ctggtcgtca ctcgaggcct 2456281 gcacatcgat ggcgttgccg ataccgctga cggactgggc tgctatgggc cgccgcagcg 2456341 tgcgcttgcg gtgatgcgcg acgggtcgac cggaccgttc ggggtggcgg ccgtggtctt 2456401 ggtcatcgcc ttgcagggcc tggccttcgc gaccctcacc acggtcggga tcgctgggat 2456461 cacgctggcg gtcttatccg gccgggtcac cgccgtactg gtctgtcgcc ggtcggtgcc 2456521 ggcagcccac ggcagcaccc tgggctcgcg ggtcgccggt acgcaacccg cgccggtggt 2456581 ggcggcctgg ctcgccgtcc tgctcgccgt ttcggtgccg gccggtcccc ggccttggca 2456641 aggaccgata gcggttctgg tagcggtgac ggccggcgcg gccctggcgg cgcattgcgt 2456701 gcaccggttc ggcggtgtca ccggtgacgt gctgggcagc gcgatcgagc tgagcacgac 2456761 ggtcagcgcc gtgacgcttg cgggcttggc ccggctttag caggcggcga gcgggacgct 2456821 gcagtagact catgtccgcc gtcccttcca acacagggct cccctccgtg tccccagatt 2456881 aggggacatg aaattcaacc gacggtgtcc gattggcgga tcgttttggc cgcgcggcat 2456941 atatagcgtc gttaatcatg cccgcatcac gactggtcag acaagtgtct gcgccacgga 2457001 acctgttcgg gcggctggtt gcccaggggg gcttctacac ggccgggctg cagttgggca 2457061 gcggtgcggt ggtactgccg gtcatctgcg cacatcaggg cctcacctgg gcggctgggc 2457121 tgttgtatcc ggcgttctgc attggcgcca ttctgggaaa ttcgctgtcg ccgctgattc 2457181 tgcagcgcgc cggccagctc cggcacctgc tgatggcggc gatatcggcg acggcggcgg 2457241 cgctggttgt gtgcaacgct gcggtcccct ggactggcgt tggcgtcgcc gcggtttttt 2457301 tggcgaccac gggggccggt ggtgtcgtca ccggagtctc cagcgtcgcc tacaccgaca 2457361 tgatctccag catgttgccc gcggtacggc ggggcgagct actgctcacc caaggtgccg 2457421 cggggtcggt gctggccacc ggcgtcacat tggtgattgt gccgatgctg gcccatggca 2457481 acgagatggc gcgctatcac gatctgctgt ggctgggcgc cgcaggtctg gtttgctccg 2457541 gcatcgcggc gctgttcgtc ggcccgatgc ggtctgtgtc cgtcacaacc gccacccgaa 2457601 tgccactgcg ggaaatctat tggatgggct tcgcgatcgc ccgctcccag ccgtggtttc 2457661 gccggtatat gacgacttac ctgctgttcg ttccgatcag cctgggcacc acgttcttca 2457721 gcctgcgcgc cgcccagtcc aacggcagtc tgcacgtgct ggtgatcctt tccagcattg 2457781 gattggtcgt cggttcgatg ctgtggcgac agataaaccg cctgttcggg gtgcgtggcc 2457841 tgctgctggg cagcgcactg ctcaacgccg ctgctgcgct gctgtgcatg gtggccgagt 2457901 cgtgtgggca gtgggttcac gcctgggcgt acggcacggc gttcctgctg gctacggtgg 2457961 ccgctcaaac ggtggtcgcc gcatcgatat cgtggatcag cgtcctcgcg cccgagcggt 2458021 accgcgccac cctgatctgc gttgggtcga ccttggccgc cgtcgaagcc accgtgctgg 2458081 gagttgcgct cggcggaatt gcccaaaagc atgccaccat ctggccggtt gtcgtcgtgc 2458141 tgacactggc cgtaatcgcc gcggtggcga gtctgcgcgc accgacacga atcggggtga 2458201 cggcggacac gagcccgcaa gcagcgacct tgcaagccta ccgcccggcc actcctaacc 2458261 ccatccatag cgatgaacgt tcgacgccgc ccgaccatct ctcagtccgc cgcgggcagt 2458321 tacgacacgt atgggacagt cgccggcccg cgccacccct gaaccggcca agctgtcgcc 2458381 gcgcggcccg ccgtccagcg cccggcaaac ccgctgccgc actaccccag ccgcgccatc 2458441 cagccgtggg tgtccgcgaa ggtgccccgc tggatgccgg tcagcgtatc gcgtagtgcc 2458501 atggtcacct cacccggctg accgtcggcg attctgaact cgctggcacc gtgccgcacc 2458561 cgcgcgaccg gggtgatgac agcggcggtg ccgcacgcaa acacctcggt gatctcgccg 2458621 gcggcggctt tcttctgcca ctcgtcgata tcaatcctgc gttcctcgac cgcaaatccg 2458681 gcatcaatag ccaactgcaa caacgaatcc cgtgtgatcc cgggcagcag ggaaccggac 2458741 agctccgggg tgaccagccg cgccgatccg ccgctgccga gcacgaagaa gatgttcatg 2458801 ccacccatct cttcgatata gcggcgttcc acagcgtcca gccacaccac ctggtcgcat 2458861 ccgttctcgg cggcttcggc ctgcgccagc aacgaggcgg cgtagttgcc gccgaacttg 2458921 gccgcaccgg tgccgcccgg acaggcccgt acatactccg tcgaaaccca gacgctgaca 2458981 ggggcgatgc cgcccttgaa gtacgcaccg gccggcgagg cgatcaacag gtaacggtat 2459041 tgggtggcag gccgcacgcc cagtcccggc tcggtggcga agatgaacgg ccgcagatac 2459101 agcgcctcct caccgccggc accgggcacc caagctttgt cgacagcgat tagctggcgc 2459161 agggattcga tgaacaccgc gtcgggcagt tcgggaatcg ccaaccgccg cgccgacgaa 2459221 cgcaacctgg cggcgttggc gtcggcgcga aacgacacga tggacccgtc ggcccagcgg 2459281 taggctttga gcccttcgaa cacctcctgc gcatagtgca gcacgatcgc cgagggatcc 2459341 agctcgatcg ggccataagg gattacccgc gcgttgtgcc aaccacggcc ctcggcatag 2459401 tcgatcgaca ccatatggtc ggtgtggtat ttgccgaaac ccggcgcccg cagcatcgat 2459461 tcacgctgcg cgtcggtggc cggattgacc gcacgtaaca ccgtgaattg aagggagccg 2459521 ctggtcatgg gccgattcta tccgtgggcg aacggttatt gacggcccgg aggccactcc 2459581 gctgccacca agtggtgact cagcgcgttt tcacggcaac gaacggcgga cacaccactt 2459641 gacattcgac agcacggccg cggacgtcga cattgatttg ctggccgtct tcgatgccgg 2459701 catcactgtc gatcagcgcc agcccgatgc cgacctgcaa cgtgggagaa aacgttcccg 2459761 acgtggtgac cccaaccgtc tcatccccga caagcacagc cagcccgggg cgcagcacac 2459821 cgcgaccgac catgcgcagc ccccgcagca gccgccgcgg cccggccgct ttctcggcca 2459881 acaacgccgc acgaccaaag aaggcgtcct tccgccagcc gaccgcccag ccgcatcggg 2459941 cctgcagcgg cgagatgtcc agcgaaagct cgtgcccgtg cagcggatag cccatttcag 2460001 tgcgcagtgt gtcgcgagca ccgaggccgg cgggctcgcc gcccgcggct gataccgccg 2460061 ccaacagtgc gtcgaacacc acacccgccg actcccatgg cggcagcagt tcgtaaccgt 2460121 gctcaccggt gtagccggtg cgacagacac gcaccggcac ccccgagtac gaagcgtcgg 2460181 cgtagcccat gtagtccatc tcggttggca gccccaacgc ggtgagcacg tcggtcgaac 2460241 acggcccctg tacggccagc accgcgtagg accgatgcag attggtgatg ctcagaccgc 2460301 ccggtgcggc agcttgtagc gcgccgacca ccgcggcggt attggcggcg ttgggcacca 2460361 gaaagatctc gtcgtcgctg acgtagtagg cgatcaggtc gtcgatcaca ccgccggatt 2460421 cggtgcagca caaggtgtat tgcgccttgc cgggcccgat acgacccagg tcgttggtga 2460481 gcgcggagtt gacgaactgc gccgcacccg gtccacggac cagtgccttg cccaggtggc 2460541 tgacgtcgaa aaggccgacg gcggtgcggg tggcgttgtg ctcgctgacg gttccggcat 2460601 acgagaccgg catcagccag ccgccgaact cggcgaaact cgcacccagc tcgcgatggc 2460661 ggtcttccag cggtccgtgt atcagctctg gcacatcgct cacggcgtcc caccctaatg 2460721 ggcgtccctg ctggcacact taggcaggtg tacgattcct tggacttcga cgccctcgag 2460781 gccgccggaa ttgccaaccc acgcgagcgg gccggcttgc tcacctacct ggatgagctt 2460841 ggcttcacgg tcgaagagat ggtgcaagcc gaacgccgcg gccggttgtt cgggctggcc 2460901 ggtgacgtcc tgctatggtc cgggcccccg atctacaccc tggcgaccgc ggctgacgaa 2460961 ctggggttgt cagccgacga cgtcgcacgc gcgtggagtt tgctcggcct caccgtcgcg 2461021 ggtcccgacg ttcccacgct gagccaggcc gacgtcgacg ccctggcgac ctgggtcgca 2461081 ctgaaggcgc tggtgggtga ggacggcgca ttcggcctgc tgcgagtgct cggcactgcc 2461141 atggcccgac tcgccgaggc cgagtcgacc atgatccgcg ccgggtcacc gaacatccaa 2461201 atgacgcaca cccacgacga acttgccacg gcacgggcct atcgcgcggc tgcggagttc 2461261 gtcccccgga tcggtgcgct gatcgacacc gtccaccgtc accacctggc cagcgcacga 2461321 acctactttg aaggcgtcat tggcgacacg tcggcaagcg tgacgtgcgg tatcggcttt 2461381 gcggatctgt ccagcttcac cgcgttgacc caggcgctca cccccgcgca gttgcaggac 2461441 ctgctcaccg aattcgacgc cgccgtcacc gacgtggtgc atgccgacgg tggccggttg 2461501 gtgaagttca tcggcgacgc cgtgatgtgg gtgagctcgt cgcccgaacg actggtgcgg 2461561 gcggcggtgg atctcgtcga tcatccgggt gcgcgcgcgg ccgaactgca ggtccgtgcc 2461621 ggtcttgcct atggcacggt gctggccctt aacggtgact acttcggcaa cccggtcaac 2461681 ctggctgcgc gcctggtggc ggccgcagcg ccagggcaga tcctggccgc agcgcaactc 2461741 cgcgacatgt tgccagactg gcctgccctc gcccatggcc cattgacgct caaggggttt 2461801 gacgccccgg tgatggcctt cgaactgcac gacaaccctc gtgcgaggga tgctgacacg 2461861 ccaagccccg ccgccagtga ttagggtggt tgcccgtgac caccgaaccg ggttacctat 2461921 ccccctccgt cgccgtcgcg acctcgatgc cgaaacgtgg tgtcggcgct gcggtgttga 2461981 tcgtgccggt cgtctcgacc ggcgaagagg atcggcccgg cgcggtcgtt gcctcggccg 2462041 agcccttcct gcgcgccgac acggttgccg aaatcgaggc gggcctgcga gcgctggacg 2462101 ccaccggcgc cagtgaccag gtgcaccggc tggcggtgcc gtcgttgccg gtgggcagcg 2462161 tcctgacggt cggcctgggc aaaccgcggc gcgaatggcc ggccgatacc atccgctgcg 2462221 ccgccggcgt ggccgcgcgt gcgctcaaca gttcggaggc agtgatcacc acgctagccg 2462281 aattacctgg cgacggcatc tgctcggcca ccgtcgaggg gctgatcctg ggcagctacc 2462341 gattcagcgc cttccgcagc gacaagaccg cgcccaaaga cgccggactc cgcaaaatca 2462401 ccgtgctctg ctgtgcaaag gacgccaaga agcgcgcgtt gcacggtgcg gccgtcgcga 2462461 ccgcggtggc caccgcccgg gacttggtca acactccccc aagccacctg tttcccgccg 2462521 agttggctaa gcgcgcaaag actttgagcg aatctgtcgg cctcgacgtg gaagttatcg 2462581 acgaaaaggc gctgaagaag gccggctatg gcggggtgat tggtgtcggc cagggctcgt 2462641 cgcggccgcc gcgactggtg cggttgattc atcggggatc gcggctggcc aagaaccccc 2462701 aaaaggccaa gaaggtggcc ttggttggca aggggatcac cttcgatacc ggcggcatct 2462761 cgatcaagcc ggcagcgtcg atgcaccaca tgacctcgga catgggcgga gcggccgcgg 2462821 tgatcgccac tgtcacgctg gctgcccggc tgcgactgcc gattgacgtg atcgccacgg 2462881 tgccgatggc cgagaacatg ccgtcggcga cggcgcagcg cccgggcgac gtgctgaccc 2462941 aatacggtgg gaccaccgtc gaggtgctca acaccgacgc ggagggccgg ttgatcctgg 2463001 ccgacgccat cgtccgggca tgtgaggaca agccggacta tctgatcgag acatccacgt 2463061 tgaccggtgc gcaaacggtg gcgctgggga cgcgcatacc gggtgtgatg ggcagcgacg 2463121 agttccgcga ccgggtcgcc gcgatctcgc agcgggtggg cgagaacggc tggccgatgc 2463181 cgctgcccga tgacctcaag gatgacttga aatccacggt ggccgacctg gccaatgtga 2463241 gtggccagcg tttcgcaggc atgctggtgg ccggggtttt cctgcgtgag ttcgtcgccg 2463301 aatcggtgga ttgggcgcac atcgacgtgg ccggcccggc ctacaacacc ggcagcgcct 2463361 ggggttacac gcccaagggc gccaccggtg tgcccacccg caccatgttc gcggtgctcg 2463421 aggacatcgc gaagaacggg taggcggccg cccggaccca aagcacttca cgagtagcgg 2463481 ttagatcacc cgcagccgcg cggtactgcg cagcgcctgc ggcagcaccc gggagatgcc 2463541 gtatagcgca taggcttccg gcgcgaccgg tctgatcggc ttcttcttct tgaccgcgga 2463601 cacgatcgcg tcggctacct tgtccggccc gtagctgcgc agcgcaaaca tcttgtcgat 2463661 ctgcccccgc cggccgtcga tcttctcctc gtcggttccg ggcgcgtgga aaccggtggt 2463721 agcgacgatg ttggtgtcaa tgacaccggg gcagatggtg gtcagtccga caccggcggc 2463781 atcgagttcg gcccgcaaac agtcggagaa catgtaggtc gccgctttgg aggtgcagta 2463841 cgcgctgagc gactgcagcg gcgcataggc ggccatcgac gacacgttga cgatgtgccc 2463901 gccagtcccc cgctcgacca gacgctgccc aaaagcgcgg caaccgttca ccacgccgcc 2463961 caggttgacg gccagcaccc ggtcgaactg ctcagccggg gtgtccagga accgacccgc 2464021 ctggccgatg ccggcgttgt tgacgacaat gtcggggacc ccgtgttcgg cgctgacccg 2464081 ctcggcgaat gcctcgaccg cctcggcgtc ggacacgtcg agcacatagg ggtacgcgat 2464141 gccaccacgt gcggcgatct cggcggcggt gtccttgacg gtggcctcgt cgatgtcgct 2464201 gataacgatc tctgcaccct cacgagcaaa ggcgagcgcg gtctcgcggc cgattccgct 2464261 gcccgccccg gtaaccgaca ccagcgtgtc accgaagtac ccgcggggcc gtccgacctg 2464321 ggcgcgtaac agcgcgcggc tcggctgctt gccgtcggcc aggtcggcga agtcgtgcac 2464381 ggcggccgcc atcacctgcg ggtgcgacat cggcgaaaag tgaccagctt tgatgtcacg 2464441 ccgccagagc cgcggcaccc agcgcgccgt ctggtcgtat ccgtagggcc gcacgtaggg 2464501 gtcctgggaa ttgacgatca gctgcaccgg cacatcaact atcggaatgg cccggccgcg 2464561 gcggctgctg gaaaacgacc gaaagtagtt tgcggggtaa gtcttgaccg agtgggcggc 2464621 atcacgggcc agcgtctccg agtgatgaat ctggtcgacg ggaatgtcgc cgaccatgtt 2464681 gcggcggacg gccgcactcg acagcgcaac ccgaagcagc agcggtgcga ccaccggtac 2464741 cgagaacaag gccatgtagc tcaaccgcag tgtctggctg atcgcccgta gaaaggttcg 2464801 cggacgccaa ggccgccgca gaccgccata aacgtagttg accaggtggt cttgactggg 2464861 gccggacacc gacgtgaacg aggcgacccg atcactggct ccgggccggc gcaggtactc 2464921 ccacaccccc accgaacccc agtcatgggc cagcacgtgc accggctcac cggggctcag 2464981 ctcgccgatg acggcgtcga aatcgtcggc gaaatgggcc atggtgtagg ccgaaatggg 2465041 tttgggcacc gatgagcgac cgacaccacg gttgtcgtag cgaacgatcc ggaaccgttc 2465101 ggccagcagc ggaacgacac cgtcccacag cacgtgcgag tccggaaagc catgcaccag 2465161 cacgacggtc gggccgtcgg gattgccttc gtggtagacc gcgatgcgaa cgccatccgg 2465221 gctgtcgacc agacgggaca tctgttgtgt tgccggcatc gcacctccgc ccaccgggac 2465281 ttgctgttgc aaccagtcgc ccaaaccgta gcaaggacgg ccgactgcac cgatgtcccc 2465341 gccgaggtgt cggcaacggc cgccggggcc accaactcgc cgcgccctgg atgtgtgtcg 2465401 ctccgggcgc agtgacagga taggtttcga catccacctg ggttccgcac ccggtgcgcg 2465461 accgtgtgat aggccagagg tggacctgcg ccgaccgacg atcgatcgag gagtcaacag 2465521 aaatggcctt ctccgtccag atgccggcac tcggtgagag cgtcaccgag gggacggtta 2465581 cccgctggct caaacaggaa ggcgacacgg tcgaactcga cgagcccctc gtggaggtgt 2465641 cgaccgacaa ggtcgacacc gaaatcccct cgccggccgc gggtgtgctg accaagatca 2465701 tcgcccagga ggatgacacg gtcgaggtcg gcggcgagct cgctgtcatt ggcgacgcca 2465761 aggatgccgg cgaggccgcg gccccggcac ccgagaaagt ccctgcggcc caacccgagt 2465821 ccaagccggc acccgaacca ccaccggtcc aaccgacgtc cggagcgcct gctggtggcg 2465881 atgccaagcc ggtgctgatg cccgagctcg gcgaatcggt gaccgagggg accgtcattc 2465941 gttggctgaa gaagatcggg gattcggttc aggttgacga gccactcgtg gaggtgtcca 2466001 ccgacaaggt ggacaccgag atcccgtccc cggtggctgg ggtcttggtc agtatcagcg 2466061 ccgacgagga cgccacggtg cccgtcggcg gcgagttggc ccggatcggt gtcgctgccg 2466121 acatcggcgc cgcgcccgcc cccaagcccg cacccaagcc cgtccccgag ccagcgccga 2466181 cgccgaaggc cgaacccgca ccatcgccgc cggcggccca gccagccggt gcggccgagg 2466241 gcgcaccgta cgtgacgccg ctggtgcgaa agctggcgtc ggaaaacaac atcgacctcg 2466301 ccggggtgac cggcaccgga gtgggtggtc gcatccgcaa acaggatgtg ctggccgcgg 2466361 ctgagcaaaa gaagcgggcg aaagcaccgg cgccggccgc ccaggccgcc gccgcgccgg 2466421 ccccgaaagc gccgcctgcc cctgcgccgg cgttggcaca tctacggggc accacccaga 2466481 aggccagccg gattcgtcag atcaccgcca acaagacccg cgaatctttg caggcaacgg 2466541 cacagctgac acaaacccat gaggtcgaca tgaccaagat cgtggggcta cgggcccggg 2466601 ccaaggcggc gttcgccgag cgtgagggcg tgaacctgac cttcctgccg ttcttcgcca 2466661 aggccgtgat cgatgccctc aagattcacc cgaacatcaa cgctagctac aacgaggaca 2466721 ccaaggagat cacctactac gacgccgagc acctaggatt cgctgtcgac accgagcagg 2466781 gcctgctctc cccggtcatc cacgacgccg gcgatctgtc actggccggt ctggcgcggg 2466841 cgatcgccga tatcgcggcc cgtgcccggt cgggcaacct gaaacccgac gagttgtccg 2466901 gcggcacctt caccatcacc aacatcggta gccagggcgc gttgttcgac accccgatcc 2466961 tggttccgcc gcaggccgcc atgctgggca ccggggcgat cgtcaagcgg ccgcgggtgg 2467021 tcgtcgatgc cagcggcaac gagtcgatcg gggtgcgctc ggtctgctac ctcccgttga 2467081 cctatgacca tcggctcatc gacggcgccg acgccggacg tttcctcacc acgatcaagc 2467141 accgcctcga agagggagcg ttcgaggccg atttaggact gtgatggcca acgccgttgt 2467201 cgcgatcgcg ggttcgtctg gcttgatcgg ctctgccctg accgcggcgc tgcgcgcggc 2467261 cgaccacacg gtgctgcgga tcgtgcgccg ggcacctgcg aattccgaag aactgcactg 2467321 gaatcccgaa agcggcgaat tcgatccgca cgcgctcacc gatgtcgacg ccgtggtcaa 2467381 cctctgcggc gtcaacatcg cccagcgtcg gtggtcgggg gctttcaaac agagcctgcg 2467441 cgacagccgg atcacaccca ccgaggtgct atccgccgca gtcgccgacg ccggcgtcgc 2467501 taccttgatc aacgccagcg cggtgggcta ctacggaaac accaaggacc gggtggtcga 2467561 cgaaaacgac tcggcgggaa caggttttct ggcccagctg tgcgttgact gggaaaccgc 2467621 cacgcggccg gcgcagcaga gcggtgcccg cgtggtgctg gcccggaccg gagtggtgct 2467681 gtctccggcg gggggcatgc tgcgacgcat gcggccactg ttttcggtgg gcctgggcgc 2467741 gcggctgggc agcggccggc aatatatgtc atggatcagc ctggaggacg aggtgcgggc 2467801 gctgcagttc gctatcgcgc agcccaacct gtccggcccg gtgaacttga ccgggccggc 2467861 ccccgttacc aacgccgaat tcaccaccgc gtttggccgc gccgtcaacc gccctacccc 2467921 gctgatgttg cctagcgtcg cggtacgcgc ggcgtttggt gagttcgccg acgaggggtt 2467981 gctcattggt cagcgcgcca tcccctccgc gctggagcga gccggatttc agttccacca 2468041 caacaccatt ggcgaggcgc tcggctacgc caccacccgg cccggctagg cttgaccccg 2468101 tctgcccagc cgtgcgctgg cggccgagta gcctagctat cgtgacgggt tctatccggt 2468161 cgaagctgtc cgcgatcgac gtccgccagc tggggaccgt cgactaccgg accgcgtggc 2468221 agctacagcg agagctagcc gacgcccggg tcgccggcgg cgccgacacg ctgctgctgt 2468281 tggaacaccc cgcggtctac accgccggac ggcgtaccga gacacacgag cgacccattg 2468341 acggcactcc ggtcgtcggc accgaccgcg gcggcaagat cacctggcac ggtccggggc 2468401 aattggtcgg ctacccgatc atcgggctgg ccgaacccct cgacgtggtc aattacgttc 2468461 ggcgccttga agaatcgctg atccaagtct gcgccgatct gggcctgcac gccggccgcg 2468521 tcgacggccg gtccggggtc tggctgcccg gcaggccggc gcgcaaggtc gcggccatcg 2468581 gtgtccgggt gtcgcgggcg acgacactgc acgggtttgc gctcaactgc gattgtgatt 2468641 tggctgcctt caccgccatc gtgccatgcg gaatcagtga cgccgcagtg acatcgctgt 2468701 ccgccgaact cggccgtacg gtcaccgtcg acgaggtccg cgcgacggtc gccgccgctg 2468761 tctgcgccgc tctggacggc gtcctaccgg tcggtgaccg cgtgccctca cacgccgtac 2468821 catcgccgtt atgagtgtcg ctgccgaggg ccggcgcctg ttacgcctgg aggtgcgcaa 2468881 cgcgcagacc ccaatcgagc gcaaaccgcc gtggatcaag acacgagccc gcatcgggcc 2468941 ggagtacacc gagctgaaga acctggtccg ccgcgagggg ctgcacacgg tctgcgagga 2469001 ggccggctgc cccaacatct tcgaatgctg ggaggaccga gaagccacct tcctgatcgg 2469061 cggtgaccag tgcacccgcc gatgcgattt ctgccagatc gacaccggaa agcccgccga 2469121 gctggaccgc gacgagccac gccgagtcgc cgacagcgtg cgcacgatgg gcctgcgcta 2469181 tgccaccgtc accggcgtgg ctcgcgacga cctgcctgac ggcggggcct ggctgtacgc 2469241 cgcgaccgtg cgcgccatca aggaactcaa tccgtcgacc ggcgtcgaac tgctgattcc 2469301 cgacttcaac ggcgaaccaa cccggctggc cgaggtcttc gagtccggcc cggaagtcct 2469361 ggcacacaat gtcgaaaccg tgccccgtat cttcaagcgg atccggccgg cgttcacgta 2469421 ccggcgcagc ctgggtgtgc ttaccgctgc gcgcgacgcc ggcctggtca ccaagagcaa 2469481 cctcatcctc ggcctgggcg aaacctccga cgaggtgcgc accgccctgg gcgatctgcg 2469541 cgacgccggc tgcgacatcg ttaccatcac ccaatacctg cggccgtcgg cgcgccacca 2469601 tccggtcgag cgctgggtga agcccgagga gttcgtccag ttcgcgcgat tcgccgaagg 2469661 gctgggcttc gccggggtat tggcgggacc cctggttagg tcgtcatatc gggcgggccg 2469721 gctctacgaa caggcacgta actcacgggc cttggcatcc cgctagccag cgtttacgta 2469781 ttctggacga ttatggcgaa accccgaaat gccgctgaaa gcaaggccgc caaagctcag 2469841 gcaaacgctg ctcgtaaggc tgccgcccgc cagcgccgcg ctcagctgtg gcaagcgttc 2469901 accctgcagc gcaaggagga taagcgcctg ctgccgtaca tgattggtgc tttcttgctg 2469961 atcgtgggcg catcggtggg ggtcggggtg tgggctggcg ggttcaccat gctcacgatg 2470021 atcccgctgg gggtgctgct gggtgcactg gtggcgttcg tcatcttcgg ccggcgagcc 2470081 cagcgaacgg tttaccgcaa agccgaaggc caaaccggcg cagccgcctg ggcgctggac 2470141 aacctgcggg gcaagtggcg ggtgacgccc ggggtggccg ccaccggcaa cctcgacgcc 2470201 gtgcaccggg tgatcggccg gcccggtgtc atcttcgtcg gcgagggatc agcggcccgc 2470261 gtcaaaccac tgctggctca ggagaaaaag cgcaccgcgc gactggtcgg ggacgtgccg 2470321 atctacgaca ttatcgtcgg caacggcgat ggcgaggttc cgctggccaa gttggagcgc 2470381 cacctcaccc gccttccggc caacatcacg gtcaagcaga tggacacggt ggagtcgcga 2470441 ctggcggcgc tgggttcgcg tgccggtgcg ggcgtcatgc ccaagggacc gctacccacc 2470501 acggccaaga tgcgcagcgt ccagcgcacg gtccgccgta agtaacgcgg ctcagcgtcg 2470561 caccaccgcc gtagcagtga gccgatcgtg cagcccacgc ccgtccgagt cggtgaacag 2470621 cggcggaacc accagcccga tcagcaggcc acgcaccacc agacggccga tccccaccgg 2470681 ccgccggcca cccactgcca ccacgaccag acccagcatc aactgcccgg gtgtgaatcc 2470741 gaacaagcgg accgccgcca ccccgagcag cagccaaatc accaggacaa ccgtcgacag 2470801 catcggggtc gaccaaacac cgaattccac gcccagcaac gccagaccgt aggcgatcag 2470861 ccagtcgatc agcagagccg ccagccggcg ccccatcgga gccagcgaac ccggtccggt 2470921 gtccggcaag cccagcgtct tgccgggata gtcgggcggc gatttcgccg tcatcgggca 2470981 gacccgataa ccaggttccc gttcggcatg ccaccggtta cgatcttgcc gaccatggcc 2471041 ccacaatagg gccggggaga cccggcgtca gtggtgggcg gcacggtcag taacgtctgc 2471101 gcaacacggg gttgactgac gggcaatatc ggctccatag cgtcggccgc ggatacagta 2471161 aaggagcatt ctgtgacgga aaagacgccc gacgacgtct tcaaacttgc caaggacgag 2471221 aaggtcgaat atgtcgacgt ccggttctgt gacctgcctg gcatcatgca gcacttcacg 2471281 attccggctt cggcctttga caagagcgtg tttgacgacg gcttggcctt tgacggctcg 2471341 tcgattcgcg ggttccagtc gatccacgaa tccgacatgt tgcttcttcc cgatcccgag 2471401 acggcgcgca tcgacccgtt ccgcgcggcc aagacgctga atatcaactt ctttgtgcac 2471461 gacccgttca ccctggagcc gtactcccgc gacccgcgca acatcgcccg caaggccgag 2471521 aactacctga tcagcactgg catcgccgac accgcatact tcggcgccga ggccgagttc 2471581 tacattttcg attcggtgag cttcgactcg cgcgccaacg gctccttcta cgaggtggac 2471641 gccatctcgg ggtggtggaa caccggcgcg gcgaccgagg ccgacggcag tcccaaccgg 2471701 ggctacaagg tccgccacaa gggcgggtat ttcccagtgg cccccaacga ccaatacgtc 2471761 gacctgcgcg acaagatgct gaccaacctg atcaactccg gcttcatcct ggagaagggc 2471821 caccacgagg tgggcagcgg cggacaggcc gagatcaact accagttcaa ttcgctgctg 2471881 cacgccgccg acgacatgca gttgtacaag tacatcatca agaacaccgc ctggcagaac 2471941 ggcaaaacgg tcacgttcat gcccaagccg ctgttcggcg acaacgggtc cggcatgcac 2472001 tgtcatcagt cgctgtggaa ggacggggcc ccgctgatgt acgacgagac gggttatgcc 2472061 ggtctgtcgg acacggcccg tcattacatc ggcggcctgt tacaccacgc gccgtcgctg 2472121 ctggccttca ccaacccgac ggtgaactcc tacaagcggc tggttcccgg ttacgaggcc 2472181 ccgatcaacc tggtctatag ccagcgcaac cggtcggcat gcgtgcgcat cccgatcacc 2472241 ggcagcaacc cgaaggccaa gcggctggag ttccgaagcc ccgactcgtc gggcaacccg 2472301 tatctggcgt tctcggccat gctgatggca ggcctggacg gtatcaagaa caagatcgag 2472361 ccgcaggcgc ccgtcgacaa ggatctctac gagctgccgc cggaagaggc cgcgagtatc 2472421 ccgcagactc cgacccagct gtcagatgtg atcgaccgtc tcgaggccga ccacgaatac 2472481 ctcaccgaag gaggggtgtt cacaaacgac ctgatcgaga cgtggatcag tttcaagcgc 2472541 gaaaacgaga tcgagccggt caacatccgg ccgcatccct acgaattcgc gctgtactac 2472601 gacgtttaag gactcttcgc agtccgggtg tagagggagc ggcgtgtcgt tgccagggcg 2472661 ggcgtcgagg tttttcgatg ggtgacggtg gccggcaacg gcgcgccgac caccgctgcg 2472721 aagagcccgt ttaagaacgt tcaaggacgt ttcagccggg tgccacaacc cgcttggcaa 2472781 tcatctcccg accgccgagc gggttgtctt tcacatgcgc cgaaactcaa gccacgtcgt 2472841 cgcccaggcg tgtcgtcgcg gccggttcag gttgagtgtc ggggattcgt cgtgcgggcg 2472901 ggcgtccacg ctgaccaacg gggcagtcaa ctcccgaaca ctttgcgcac taccgccttt 2472961 gcccgccgcg tcacccgtag gtagttgtcc aggaattccc caccgtcgtc gtttcgccag 2473021 ccggccgcga ccgcgaccgc attgagctgg cgcccgggtc ccggcagctg gtcggtgggc 2473081 ttgccgcgca ccaacaccag cgcgttgcgg gcccgggtgg cggtcagcca ggcctgacgg 2473141 agcagctcca cgtcggctgc gggaaccaga tcggcggccg cgatgacatc cagggattgc 2473201 agcgtcgagg tgttgtgcag ggcgggaacc tggtgcgcat gctgtagctg cagcaactgc 2473261 acggtccatt cgatgtcggc cagtccgccg cggcccagtt tggtgtgtgt gttggggtcg 2473321 gcaccgcgcg gcaaccgctc ggactcgata cgggccttga tgcggcgaat ctcgcgcacc 2473381 gagtcagcgg acacaccgtc gggcggatac cgcgttttgt cgaccatccg taggaatcgc 2473441 tgacccaact cggcatcgcc ggcaaccgcg tgtgcgcgta gcagggcctg gatctcccat 2473501 ggctgtgccc actgctcgta gtatgcggcg taggacccca gggtgcggac cagcggaccg 2473561 ttgcggccct cgggtcgcaa attggcgtcg agctccagcg gcggatcgac gctgggtgtc 2473621 cccagcagcg cccgaacccg ctcggcgatc gatgtcgacc atttcaccgc ccgtgcatcg 2473681 tcgacgccgg tggccggctc acagacgaac atcacgtcgg catccgaccc gtagcccaac 2473741 tcggcaccac ccagccgacc catgccgatg accgcgatgg ccgccggggc gcgatcgtcg 2473801 tcgggaaggc tggcccggat catgacgtcc agcgcggcct gcagcaccgc cacccacacc 2473861 gacgtcaacg cccggcacac ctcggtgacc tcgagcaggc cgagcaggtc cgccgaaccg 2473921 atgcgggcca gctctcgacg acgcagcgtg cgcgcgccgg cgatggcccg ctccgggtcg 2473981 gggtagcggc tcgccgaggc gatcagcgcc cgagccacgg cggcgggctc ggtctcgagc 2474041 agcttcgggc ccgcaggccc gtcctcgtac tgctggatga cccgcggcgc gcgcatcaac 2474101 agatccggca catacgccga ggtacccaag acatgcatga gccgcttggc caccgcgggc 2474161 ttgtcccgca gcgtggccag gtaccagctt tcggtggcca gcgcctcact gagccgccgg 2474221 taggccagca gtccgccgtc gggatcgggg gcatacgaca tccagtccag cagcctgggc 2474281 agcagcaccg actgcacccg tccgcgccgg ccgctttgat tgaccaacgc cgacatgtgt 2474341 ttcaacgcgg tctgcggtcc ctcgtagccc agcgcggcca gccggcgccc cgcggcctcc 2474401 aacgtcatgc cgtgggcgat ctccaacccg gtcgggccga tcgattccag cagcggttga 2474461 tagaagagtt tggtgtgtaa cttcgacacc cgcacgttct gcttcttgag ttcctcccgc 2474521 agcaccccgg ccgcatcgtt tcggccatcg ggccggatgt gggccgcgcg cgccagccag 2474581 cgcactgcct cctcgtcttc gggatcggga agcaggtggg tgcgcttgag ccgctgcaac 2474641 tgcagtcggt gctcgagcag cctgaggaac tcatacgacg cggtcatgtt cgccgcgtcc 2474701 tcacgcccga tgtagccgcc ttcgcccaac gccgccaatg cgtccaccgt ggacgccacc 2474761 cgtaacgact cgtcgctacg ggcatgaacc agctgcagta gctgtacggc gaactccacg 2474821 tcgcgcaatc cgccgctgcc gagtttgagc tcgcggccgc ggacatcggc gggcaccagc 2474881 tgctccaccc gccgccgcat ggcctgcacc tcgaccacaa agtcttcgcg ctcgcaggct 2474941 cgccacacca tcggcatcaa ggcggtcagg taacgctcgc caagttccgc gtcgccaacg 2475001 actggccgtg ctttcagcaa cgcctgaaac tcccaggtct tggcccagcg ctggtagtag 2475061 gcgatgtgcg actcgagcgt acggaccagc tccccgttgc gcccctccgg acgcagggcg 2475121 gcgtccacct cgaaaaaggc cgccgaggcc acccgcatca tctcgctggc cacgcgcgcg 2475181 ttgcgcgggt cggagcgctc ggcaacgaat atgacatcga cgtcgctgac gtagttcagt 2475241 tcgcgcgcac cgcacttgcc catcgcgatg accgccaggc gcggtggcgg gtgctcgccg 2475301 cacacgctcg cctcggccac gcgcagcgcc gccgccagag cggcgtccgc ggcgtccgcc 2475361 aggcgtgcgg ccaccacggt gaatggcagc accggttcgt cctcgaccgt cgcggccagg 2475421 tcgagagcgg ccagcattag cacgtagtcg cggtactggg ttcgcaatcg gtgcacgagc 2475481 gagcccggca taccctccga ttcctcgacg cactcgacga acgaccgctg cagctggtca 2475541 tgggacggca gtgtgacctt gccccgcagc aatttccagg actgcggatg ggcgaccagg 2475601 tgatcgccca acgccagcga cgagcccagc accgagaaca gccgcccgcg cagactgcgt 2475661 tcgcgcagca gagccgcgtt gagctcgtcc catccggtgt ctggattctc cgacagccgg 2475721 atcaaggcgc gcagcgcggc atcggcgtcc ggagcgcgtg acagcgacca cagcaggtcg 2475781 acgtgcgcct gatcctcgtg ccgatcccac cccagctgag ccagacgctc accagcaggg 2475841 gggtcaacta atccgagccg gccaacgctg ggcaacttcg gccgctgcgt ggcgagtttg 2475901 gtcacgacca cgacggtagc gcaaagcgcg tcggcgtcgg atcaaccggt agatctgggc 2475961 tacagcgaca ggtaggtgcg cagctcgtat ggcgtgacgt ggctgcggta gttcgcccac 2476021 tccgtgcgct tgttgcgcaa gaaaaagtca aaaacgtgct cccccaaggc ctccgcgacg 2476081 agttcggagg cctccatggc gcgcagcgca ctatccaaac tggacggcaa ttctcggtac 2476141 cccatcgctc ggcgttcctc gggtgtgagg tcccatacgt tgtcctcggc ctgcgggccc 2476201 agcacgtaac ccttctctac accccgcaat cccgcggcca gcagcacggc gaatgtcaga 2476261 tagggattgc acgccgaatc agggctgcgt acttcgaccc gccgcgacga ggtcttgtgc 2476321 ggcgtgtaca tcggcacccg cactagggcg gatcggttgg cggcccccca cgacgcggcc 2476381 gtgggcgctt cgccgccctg caccagccgc ttgtaagagt tgacccactg atttgtgacc 2476441 gcgctgatct cgcaagcgtg ctccaggatc ccggcgatga acgatttacc cacttccgac 2476501 agctgcagcg gatcatcagc gctgtggaac gcgttgacat caccctcgaa caggctcatg 2476561 tgggtgtgca tcgccgagcc cgggtgctgg ccgaatggct tgggcatgaa cgacgcccgg 2476621 gcgccctctt ccagcgcgac ttctttgatg acgtagcgga aggtcatcac gttgtcagcc 2476681 atcgacagag cgtcggcaaa ccgcaggtcg atctcctgct ggccgggtgc gccttcgtga 2476741 tggctgaact ccaccgagat gcccatgaat tccagggcat cgatcgcgtg gcggcgaaag 2476801 ttcaaggcgg agtcgtgcac cgcttggtcg aaatagccgg cgttgtcgac cgggacgggc 2476861 accgacccgt cctcgggtcc gggcttgagc aggaagaact cgatttcggg atgcacgtag 2476921 caggagaagc cgagttcgcc ggccttcgtc agctgccgcc gcaacacgtg ccgcgggtcc 2476981 gcccacgacg gcgagccgtc cggcatggtg atgtcgcaaa acatccgcgc tgagtggtgg 2477041 tggccggaac tggtggccca gggcagcacc tggaaggtcg acgggtccgg gtgcgccacc 2477101 gtatcggatt ccgagacccg cgcaaagccc tcgatcgagg atccgtcgaa gccgatgcct 2477161 tcctcgaagg cgccctcgag ttcggctggg gcgatggcga ccgacttgag gaaaccgagc 2477221 acgtctgtga accacagccg gacgaagcgg atgtcgcgtt cttccagggt acgaagaacg 2477281 aattccttct gtcggtccat acctcgaaca gtatgcactg tctgttaaaa ccgtgttacc 2477341 gatgcccggc cagaagcgtt gcggggcggc ccgcaagggg agtgcgcggt gagttcaggg 2477401 cgcgcaccgc agactcgtcg gcggcaaggt cccgtcgaga aaatagtgca tcaccgcaga 2477461 gtccacacac tggttgccat cgaacaccgc agtgtgttgg gtgccgtcga aggtgatcag 2477521 cggtgcgccc agctggcggg ccaggtctac cccggactga tacggagtgg ccgggtcgtg 2477581 ggtggtggac accacgacga ccttgccagc cccggccggc gccgcggggt gcggcgtcga 2477641 cgttgccggc accggccaca gcgcgcacag atcgcggggg gcggatccgg tgaactgccc 2477701 gtagctaagg aacggggcga cctgacggat ccgttggtcg gcggccaccc aggccgctgg 2477761 atcggccggt gtgggcgcat cgacgcaccg gaccgcgttg aacgcgtcct ggtcgttgct 2477821 gtagtgcccg tctgcatccc ggccgtcata gtcgtcggca agcaccagca agtcgccggc 2477881 gtcgctgccg cgctgcagcc ccagcagacc actggtcagg tacttccagc gctgagggct 2477941 gtacagcgcg ttgatggtgc ccgtcgtcgc gtcggcgtag ctcaggccac gtggatccga 2478001 cgtcttaccc ggcttctgca ccagcgggtc aaccagggcg tggtagcggt tgacccactg 2478061 ggccgagtcg gtgcccagag ggcaggccgg cgagcgggcg cagtcggcgg cgtagtcatt 2478121 gaaagcggtc tgaaatcccg ccatttggct gatgctttcc tcgattgggc taacggctgg 2478181 atcgatagcg ccgtcgagga ccatcgcccg cacatgagta ccgaaccgtt ccaggtaagc 2478241 ggtgcccaac tcggtgccgt agctgtatcc gaggtagttg atctgatcgt cacctaacgc 2478301 ttggcgaacc atgtccatgt cccgtgcgac ggacgcggta ccgatattgg ccaagaagct 2478361 gaagcccatc cggtcaacac agtcctgggc caactgccgg tagacctgtt cgacgtgggt 2478421 gacaccggcc ggactgtagt cggccatcgg atcgcgccgg tacgcgtcga actcggcgtc 2478481 ggtgcgacac cgcaacgcag gggtcgagtg gccgacccct ctcgggtcga agcccaccag 2478541 gtcgaagtgg cggagaatgt cggtgtcggc gatcgcgggt gccatagcgg cgaccatgtc 2478601 gaccgccgac gccccgggtc ccccaggatt gaccagcagt gctccgaatc gctgtcccgt 2478661 cgcggggacg cggatcaccg ccaacttcgc ttgtgtccca ccgggttggt cgtagtcgac 2478721 ggggacggac accgtcgcgc agcgtgcagt gcgaatttcg ctggtgtcgg cgatgaactc 2478781 gcggcagctg ttccaactct gttgcggcgc cacgaccggc gcacccgggg tttggccggc 2478841 gccgggttct tcagtcgcgc cggccaacgg gggcgctgct aggggcagtc cgccgagcag 2478901 caacccgaag gacagcagcg ccgagctcaa cggtctgcgg cgccacatgg ccgccatcgt 2478961 ctcaccggcg aatacctgtg acggcgcgaa atgatcacac cttcgtctct tcgccccgct 2479021 agcacttggc gccgctgggc ggcgtggtgc cgccgatcaa atacgccgtc acgtactcgt 2479081 caatgcagct gtcgccctgg aataccaccg tgtgctgggt tccgtcgaag gtcagcaacg 2479141 aaccgcgaag ctggttcgcc aggtcgaccc cggccttgta cggcgtcgcc gggtcatggg 2479201 tggtggatac caccaccgtc ggcactaggc cgggcgccga gacggcatgg ggctgacttg 2479261 tgggtggcac cggccagaac gcgcaggtgc ccagcggcgc atcaccggtg aacttcccgt 2479321 agctcatgaa cggtgcgatc tcccgggcgc ggcggtcttc gtcgatgacc ttgtcgcgat 2479381 cggtaaccgg gggctgatcg acgcaattga tcgccacccg cgcgtcaccg gaattgttgt 2479441 agcggccgtg cgagtcccga cgcatgtaca tgtcggccag agccagcagg gtgtctccgc 2479501 gattgtcgac cagctccgac agcccgtcgg tcaagtgttg ccacagattc ggtgagtaca 2479561 gcgccataat ggtgcccacg atggcgtcgc tataactcag cccgcgcgga tccttcgtgc 2479621 gcgccggcct gctgatcctc gggttgtccg ggtcgaccaa cggatcgacc aggctgtggt 2479681 agacctcgac ggctttggcc gggtcggcgc ccagcgggca gcccgcgttc ttggcgcagt 2479741 cggcggcata gttgttgaac gcgtcctgga agcccttggc ctggcgcagc tccgcctcga 2479801 tgggatcggc attggggtcg acggcaccgt cgagaatcat tgcccgcacc cgctgcggaa 2479861 attcctcggc atacgcggag ccgatccggg tgccgtacga gtagcccagg taggtcagct 2479921 tgtcgtcgcc caacgccgcg cgaatggcat ccaggtcctt ggcgacgttg accgtcccga 2479981 catgggccag aaagttcttg cccatcttgt ccacacagcg accgacgaat tgcttggtct 2480041 cgttctcgat gtgcgccaca ccctcccggc tgtagtcaac ctgcggctcg gcccgcagcc 2480101 ggtcgttgtc ggcatcggag ttgcaccaga tcgccggccg ggacgacgcc accccgcggg 2480161 ggtcgaaccc aaccaggtcg aacctttcgt gcacccgctt cggcaatgtc tggaagacgc 2480221 ccaaggcggc ctcgataccg gattcgccgg gtccaccggg atttatgacc agcgaaccga 2480281 tcttgtctcc cgtcgccgga aagcgaatca gcgccagcgc cgccacgtca ccatcggggc 2480341 ggtcgtagtc gaccggtaca gcgagcttgc cgcataacgc gccgccgggg atctttactt 2480401 gcgggtttga cgaccggcac ggtgtccact ccaccggctg gcccagcttc ggctccgcca 2480461 tacgagcgcg tcccccgacc acgcggatgc agcccacaag aaccaacgcc acggcggcga 2480521 gcgcggccca gatcaacagc atgcgcgcga tcttgtcgcg gcgagacagc ctcatgccca 2480581 caatgctgcc agagcagacc cgagatcctg gccagcggcc accgtcggcc gactaaccgg 2480641 ccgctgccag cagtcctgcc atcgccgatg gcgaactcgt cggccatccc ccatacgtcc 2480701 ggtaacagat ccgggcaaga caccgacccg tcgaccggat ccggcacggg cgcgtcggcc 2480761 tcggcggtgc acaactgcga catcaggttg gcgctggcac cccgtccacg ccggcatggt 2480821 gcaccttggc catcgcccga gggcgatccc cgatgccgtc caccccttcg acgaacccat 2480881 ctcccacggc ggtcgccggc agcgacgcga tgtggccgca gatctccgag agttcggccc 2480941 gcccgcccgg cgacggcaac ccgatgccgt gcaagtgacg atcgatgtga ggttcaaggt 2481001 tcagcgcact gctggcaagc tttttccgaa accgcggcct cgccttgatc tggagtcaga 2481061 acgcgtcacg cagccggtca aaggcgtaac ccatgctcga gcaaacatgc atgggctgag 2481121 tggacgtttc cagacacagc aactggcgtc caggccactg agccgctgca tgcgcgatgg 2481181 tatgccgatg ggggccccgg gcgcgtctga ggggaagaag tggcagactg tcagggtccg 2481241 acgaacccgg ggaccctaac gggccacgag gatcgacccg accaccatta gggacagtga 2481301 tgtctgagca gactatctat ggggccaata cccccggagg ctccgggccg cggaccaaga 2481361 tccgcaccca ccacctacag agatggaagg ccgacggcca caagtgggcc atgctgacgg 2481421 cctacgacta ttcgacggcc cggatcttcg acgaggccgg catcccggtg ctgctggtcg 2481481 gtgattcggc ggccaacgtc gtgtacggct acgacaccac cgtgccgatc tccatcgacg 2481541 agctgatccc gctggtccgt ggcgtggtgc ggggtgcccc gcacgcactg gtcgtcgccg 2481601 acctgccgtt cggcagctac gaggcggggc ccaccgccgc gttggccgcc gccacccggt 2481661 tcctcaagga cggcggcgca catgcggtca agctcgaggg cggtgagcgg gtggccgagc 2481721 aaatcgcctg tctgaccgcg gcgggcatcc cggtgatggc acacatcggc ttcaccccgc 2481781 aaagcgtcaa caccttgggc ggcttccggg tgcagggccg cggcgacgcc gccgaacaaa 2481841 ccatcgccga cgcgatcgcc gtcgccgaag ccggagcgtt tgccgtcgtg atggagatgg 2481901 tgcccgccga gttggccacc cagatcaccg gcaagcttac cattccgacg gtcgggatcg 2481961 gcgctgggcc caactgcgac ggccaggtcc tggtatggca ggacatggcc gggttcagcg 2482021 gcgccaagac cgcccgcttc gtcaaacggt atgccgatgt cggtggtgaa ctacgccgtg 2482081 ctgcaatgca atacgcccaa gaggtggccg gcggggtatt ccccgctgac gaacacagtt 2482141 tctgaccaag ccgaatcagc ccgatgcgcg ggcattgcgg tggcgccctg gatgccgtcg 2482201 acgccggatt gccggcgcgg acgcgccagc gggacccatc ggcgtcgcgt tcgccggttg 2482261 agcccggggt gagcccagac attcgatgtg cccaacacca tccgccacag cccaattgat 2482321 gtggcactct atgcatgcct atccccgacc aaccaccacc gcggcgacgc atcatgaccg 2482381 gaggcgaaga tgccagtaga ggcgcccaga ccagcgcgcc atctggaggt cgagcgcaag 2482441 ttcgacgtga tcgagtcgac ggtgtcgccg tcgttcgagg acatcgccgc ggtggttcgc 2482501 gtcgagcagt cgccgaccca gcagctcgac gcggtgtact tcgacacacc gtcgcacgac 2482561 ctggcgcgca accagatcac cttgcggcgc cgcaccggcg gcgccgacgc cggctggcat 2482621 ctgaagctgc cggccggacc cgacaagcgc accgagatgc gagcaccgct gtccgcatca 2482681 ggcgacgctg tgccggccga gttgttggat gtggtgctgg cgatcgtccg cgaccagccg 2482741 gttcagccgg tcgcgcggat cagcactcac cgcgaaagcc agatcctgta cggcgccggg 2482801 ggcgacgcgc tggcggaatt ctgcaacgac gacgtcaccg catggtcggc cggggcattc 2482861 cacgccgctg gtgcagcgga caacggccct gccgaacagc agtggcgcga atgggaactg 2482921 gaactggtca ccacggatgg gaccgccgat accaagctac tggaccggct agccaaccgg 2482981 ctgctcgatg ccggtgccgc acctgccggc cacggctcca aactggcgcg ggtgctcggt 2483041 gcgacctctc ccggtgagct gcccaacggc ccgcagccgc cggcggatcc agtacaccgc 2483101 gcggtgtccg agcaagtcga gcagctgctg ctgtgggatc gggccgtgcg ggccgacgcc 2483161 tatgacgccg tgcaccagat gcgagtgacg acccgcaaga tccgcagctt gctgacggat 2483221 tcccaggagt cgtttggcct gaaggaaagt gcgtgggtca tcgatgaact gcgtgagctg 2483281 gccaatgtcc tgggcgtagc ccgggacgcc gaggtactcg gtgaccgcta ccagcgcgaa 2483341 ctggacgcgc tggcgccgga gctggtacgc ggccgggtgc gcgagcgcct ggtagacggg 2483401 gcgcggcggc gataccagac cgggctgcgg cgatcactga tcgcattgcg gtcgcagcgg 2483461 tacttccgtc tgctcgacgc tctagacgcg cttgtgtccg aacgcgccca tgccacttct 2483521 ggggaggaat cggcaccggt aaccatcgat gcggcctacc ggcgagtccg caaagccgca 2483581 aaagccgcaa agaccgccgg cgaccaggcg ggcgaccacc accgcgacga ggcattgcac 2483641 ctgatccgca agcgcgcgaa gcgattacgc tacaccgcgg cggctactgg ggcggacaat 2483701 gtgtcacaag aagccaaggt catccagacg ttgctaggcg atcatcaaga cagcgtggtc 2483761 agccgggaac atctgatcca gcaggccata gccgcgaaca ccgccggcga ggacaccttc 2483821 acctacggtc tgctctacca acaggaagcc gacttggccg agcgctgccg ggagcagctt 2483881 gaagccgcgc tgcgcaaact cgacaaggcg gtccgcaaag cacgggattg agcccgccag 2483941 gggcggacga gttggcctgt aagccggatt ctgttccgcg ccgccacagc caagctaacg 2484001 gcggcacggc ggcgaccatc catctggaca caccgttacc gggtgcctcg agcggcctac 2484061 ccgcaggctc gggcgagcaa ccctcaagcg cctgcgcggc cgcactttcg gtgcggcctt 2484121 cttggccttg cttcgggtgg ggtttgccta gccaccccgg tcacccggaa tgctggtgcg 2484181 ctcttaccgc accgtttcac ccttgccacc acgaggatgg cggtctgttt tctgtggcac 2484241 tttcccgcga gtcacctcgg attgccgtta gcaatcaccc tgctctgtga agtccggact 2484301 ttcctcgact cgacgctgaa cctcgtgaat ccacacaagc cctacgcgag ccgcggccgc 2484361 ccagccaact catccgcgac gaccacgcta ccccgctggg cggtgtcgcg gccagtgtga 2484421 ccgctggacg acacggctag tcggacagcc gatccggcgg gcagtcctta tcgtggactg 2484481 gtgacacggt gggacaaacg cgtcgactcc ggcgactggg acgccatcgc tgccgaggtc 2484541 agcgagtacg gtggcgcact gctacctcgg ctgatcaccc ccggcgaggc cgcccggctg 2484601 cgcaagctgt acgccgacga cggcctgttt cgctcgacgg tcgatatggc atccaagcgg 2484661 tacggcgccg ggcagtatcg atatttccat gccccctatc ccgagccgat cgagcgtctc 2484721 aagcaggcgc tgtatcccaa actgctgccg atagcgcgca actggtgggc caaactgggc 2484781 cgggaggcgc cctggccaga cagccttgat gactggttgg cgagctgtca tgccgccggc 2484841 caaacccgat ccacagcgct gatgttgaag tacggcacca acgactggaa cgccctacac 2484901 caggatctct acggcgagtt ggtgtttccg ctgcaggtgg tgatcaacct gagcgatccg 2484961 gaaaccgact acaccggcgg cgagttcctg cttgtcgaac agcggcctcg cgcccaatcc 2485021 cggggtaccg caatgcaact tccgcaggga catggttatg tgctcacgac ccgtgatcgg 2485081 ccggtgcgga ctagccgtgg ctggtcggca tctccagtgc gccatgggct ttcgactatt 2485141 cgttccggcg aacgctatgc catggggctg atctttcacg acgcagcctg attgcacgcc 2485201 atctatagat agcctgtctg attcaccaat cgcaccgacg atgccccatc ggcgtagaac 2485261 tcggcgatgc tcagcgatgc cagatcaaga tgcaaccgat ataggacgcc cgacccggca 2485321 tccaacgcca gccgcaacaa cattttgatc ggcgtgacat gtgacaccac cagcaccgtc 2485381 gcgccttcgt agccaacgat gatccgatca cgtccccgcc gaacccgccg cagcacgtcg 2485441 tcgaagcttt ccccacccgg gggcgtgatg ctggtgtcct gcagccagcg acggtgcagc 2485501 tcgggatcgc gttctgcggc ctccgcgaac gtcagcccct cccaggcgcc gaagtcggtc 2485561 tcgaccaggt cgtcatcgac gaccacgtcc agggccaggg ctctggcggc ggtcaccgcg 2485621 gtgtcgtaag cccgctgtag cggcgaggag accaccgcag cgatcccgcc gcgccgcgcc 2485681 agatacccgg ccgccgcacc aacctggcgc caccccacct cgttcaaccc cgggttgccg 2485741 cgccccgaat agcggcgttg ctccgacagc tccgtctgcc cgtggcgcaa caaaagtagt 2485801 cgggtgggtg taccgcgggc gccggtccag ccgggagatg tcggtgactc ggtcgcaacg 2485861 attttggcag gatccgcatc cgccgcagcc gattgcgcgg cggcgtccat cgcgtcattg 2485921 gccaaccggt ctgcatacgt gttccgggca cgcggaaccc actcgtagtt gatcctgcga 2485981 aactgggacg ccaacgcctg agcctggaca tagagcttca gcagatccgg gtgcttgacc 2486041 ttccaccgcc cggacatctg ctccaccacc agcttggagt ccatcagcac cgcggcctcg 2486101 gtggcaccta gtttcacggc gtcgtccaaa ccggctatca ggccgcggta ttcggcgacg 2486161 ttgttcgtcg cccggccgat cgcctgcttg gactcggcca gcacggtgga gtgatcggcg 2486221 gtccacacca ccgcgccgta tccggccggt ccgggattgc cccgcgatcc gccgtcggct 2486281 tcgatgacaa ctttcactcc tcaaatcctt cgagctgcaa caagatcgct ccgcattccg 2486341 ggcagcgcac cacttcatcc tcggcggccg ccgagatctg ggccagctcg ccgcggccga 2486401 tctcgatccg gcaggcacca catcgatgac cttgcaaccg cccggcccct ggcccgcctc 2486461 cggcccgctg tctttcgtag agccccgcaa gctcgggatc aagtgtcgcc gtcagcatgt 2486521 cgcgttgcga tgaatgttgg tgccgggctt ggtcgatttc ggcaagtgcc tcgtccaaag 2486581 cctgctgggc ggcggccagg tcggcccgca acgcttggag cgcccgcgac tcggcggtct 2486641 gttgagcctg cagctcctcg cggcgttcca gcacctccag cagggcatct tccaaactgg 2486701 cttgacggcg ttgcaagctg tcgagctcgt gctgcagatc agccaattgc ttggcgtccg 2486761 ttgcacccga agtgagcaac gaccggtccc ggtcgccacg cttacgcacc gcatcgatct 2486821 ccgactcaaa acgcgacacc tggccgtcca agtcctccgc cgcgattcgc agggccgcca 2486881 tcctgtcgtt ggcggcgttg tgctcggcct gcacctgctg gtaagccgcc cgctgcggca 2486941 gatgggtagc ccgatgcgcg atccgggtca gctcagcatc cagcttcgcc aattccagta 2487001 gcgaccgttg ctgtgccact ccggctttca tgcctgatct ctcccagttt cgtgatcgag 2487061 gttccacggg tcggtgcaga tggtgcacac acgcaccggc agcgacgcgc cgaaatgaga 2487121 ccgcaacact tcggcggcct ggccgcacca cgggaattcg cttgcccaat gcgcgacgtc 2487181 gatcagggcc acttgcgaag ctcggcaatg ctcgtcggct ggatgatgtc gcagatcggc 2487241 cgtaacgtac gcttgcacgt ccgcggcggc cacggtggca agcaacgagt ccccggcgcc 2487301 gccgcagacc gcgacccgcg acaccagcag gtcgggatcc ccggcggcgc gcacaccggt 2487361 cgcagtcggc ggcaacgcgg cctccagacg ggcaacaaag gtgcgcagcg gttcgggttt 2487421 tggcagtctg ccaatccggc ctaacccgct gccgaccggc ggtggtacca gcgcgaagat 2487481 gtcgaatgcc ggctcctcgt aagggtgcgc ggcgcgcatc gccgccaaca cctcggcgcg 2487541 cgctcgtgcg ggtgcgacga cctcgacccg gtcctcggcc acccgttcga cggtaccgac 2487601 gctgcctatg gcgggcgacg ccccgtcgtg cgccaggaac tgcccggtac ccgcgacact 2487661 ccagctgcag tgcgagtagt cgccgatatg gccggcaccg gcctcaaaga ccgctgcccg 2487721 caccgcctct gagttctcgc gcggcacata gatgacccac ttgtcgagat cggccgctcc 2487781 gggcaccggg tcgagaacgg cgtcgacggt cagaccaaca gcgtgtgcca gcgcgtcgga 2487841 cacacccggc gacgccgagt cggcgttggt gtgcgcggta aacaacgagc gaccggtccg 2487901 gatcaggcgg tgcaccagca caccctttgg cgtgttggcc gcgaccgtat cgaccccacg 2487961 cagtaacaac gggtggtgca ccaatagcag tccggcctgg ggaacctggt ccaccaccgc 2488021 cggcgtcgcg tccaccgcaa cggtcaccga atccaccacg tcgtcggggt cgccgcacac 2488081 cagacccacc gaatcccacg actgggcaag ccgcggcggg taggcctggt ccagcacgtc 2488141 gatgacatcg gccagccgca cactcatcgg cgtcctccac gctttgccca ctcggcgatc 2488201 gccgccacca gcacgggcca ctccgggcgc accgccgccc gcaggtaccg cgcgtccagg 2488261 ccgacgaagg tgtcaccgcg gcgcaccgca attcctttgc tctgcaaata gtttcgtaat 2488321 ccgtcagcat cggcgatgtt gaacagtacg aaaggggccg caccatcgac cacctcggca 2488381 cccaccgatc tcagtccggc caccatctcc gcgcgcagcg ccgtcaaccg caccgcatcg 2488441 gctgcggcag cggcgaccgc ccggggggcg cagcaagcag cgatggccgt cagttgcaat 2488501 gttcccaacg gccagtgcgc tcgctgcacg gtcaaccgag ccagcacgtc tggcgagccg 2488561 agcgcgtagc ccacccgcaa tccggccagc gaccacgttt tcgtcaagct acggagcacc 2488621 agcacatcgg gcagcgagtc atcggccaac gattgcggct cgccgggaac ccaatcagcg 2488681 aacgcctcgt cgaccaccag gatgcgtccc ggccggcgta actcgagcag ctgctcgcgg 2488741 aggtgcagca ccgaggtggg gttggtcgga ttacccacga cgacaaggtc ggcgtcgtca 2488801 ggcacgtgcg cggtgtccag cacgaacggc ggctttagga caacatggtg cgccgtgatt 2488861 ccggcagcgc tcaaggctat ggccggctcg gtgaacgcgg gcacgacgat tgctgcccgc 2488921 accggactta ggttgtgcag caatgcgaat ccctccgccg ccccgacgag cgggagcact 2488981 tcgtcacggg ttctgccatg acgttcagcg accgcgtctt gcgcccggtg cacatcgtcg 2489041 gtgctcggat agcgggccag ctccggcagc agcgcggcga gctgccggac caaccattcc 2489101 gggggccggt catggcggac gttgacggcg aagtccagca cgccgggcgc gacatcctga 2489161 tcaccgtggt agcgcgccgc ggcaagcggg ctagtgtcta gactcgccac agcgtcaaac 2489221 agtagtgggc cggtgtgcgg gccaagaatc cagagcaccg ccgacgcgtt gtctacgcgg 2489281 cgacaaccgc gacatcacag gcagctaaca gggcgtcggc ggtgatgatc gtcaggccaa 2489341 gcagctgtgc ctgggcgatg agcacacggt cgaatggatg tcgatggtga tccggaagct 2489401 ctgcggtgcg cagtgtgtgc gtggtcaact gacagcggcg acgtgccgca gcggcgcatt 2489461 cgatcgggca cgtaagaggc cgatggctcg ggcggcggga gcttgccgag gcggtagttg 2489521 atcgcgatct cccaggcact ggcggccgac aagagaatgc tgttgcggac gtcctgaaca 2489581 atcgcccgtg tttcgttgac ggcatccgca gccaaacgtg ggtgtcgatg aggtagcgct 2489641 tcaccggtga aagcgttcga gcacgtcgtc tgacaacgga gcgtccaaat cgtcgggcac 2489701 gcggtacacg ccatggtcaa tgcctaaccg ccgagtctca tgaggatgca gcggcacaag 2489761 ctttgctacc ggctcgccgc ggcgggcaat ctcaacctct gcccgccgta gacgagccgc 2489821 agcagctcgg acaggcgtgt cttcgcctcg tgaacgccga cccgcttcgc aggcgcccag 2489881 actttcgcgt cgaccacctg ctcaccaaac ttcgcgatca tcgcctgata ccacagcgcc 2489941 aacgggtagc ggtttgtcca accgcttcgt caacgacaat gggatcgtga ccgacacgac 2490001 cgcgagcggg accaattgcc cgcctcctcc acgcgccgcc gcacggcgcg catcgtcgcc 2490061 gggtgaatcg ccgcagctgg tgatcttcga tctggacggc acgctgaccg actcggcgcg 2490121 cggaatcgta tccagcttcc gacacgcgct caaccacatc ggtgccccag tacccgaagg 2490181 cgacctggcc actcacatcg tcggcccgcc catgcatgag acgctgcgcg ccatggggct 2490241 cggcgaatcc gccgaggagg cgatcgtagc ctaccgggcc gactacagcg cccgcggttg 2490301 ggcgatgaac agcttgttcg acgggatcgg gccgctgctg gccgacctgc gcaccgccgg 2490361 tgtccggctg gccgtcgcca cctccaaggc agagccgacc gcacggcgaa tcctgcgcca 2490421 cttcggaatt gagcagcact tcgaggtcat cgcgggcgcg agcaccgatg gctcgcgagg 2490481 cagcaaggtc gacgtgctgg cccacgcgct cgcgcagctg cggccgctac ccgagcggtt 2490541 ggtgatggtc ggcgaccgca gccacgacgt cgacggggcg gccgcgcacg gcatcgacac 2490601 ggtggtggtc ggctggggct acgggcgcgc cgactttatc gacaagacct ccaccaccgt 2490661 cgtgacgcat gccgccacga ttgacgagct gagggaggcg ctaggtgtct gatccgctgc 2490721 acgtcacatt cgtttgtacg ggcaacatct gccggtcgcc aatggccgag aagatgttcg 2490781 cccaacagct tcgccaccgt ggcctgggtg acgcggtgcg agtgaccagt gcgggcaccg 2490841 ggaactggca tgtaggcagt tgcgccgacg agcgggcggc cggggtgttg cgagcccacg 2490901 gctaccctac cgaccaccgg gccgcacaag tcggcaccga acacctggcg gcagacctgt 2490961 tggtggcctt ggaccgcaac cacgctcggc tgttgcggca gctcggcgtc gaagccgccc 2491021 gggtacggat gctgcggtca ttcgacccac gctcgggaac ccatgcgctc gatgtcgagg 2491081 atccctacta tggcgatcac tccgacttcg aggaggtctt cgccgtcatc gaatccgccc 2491141 tgcccggcct gcacgactgg gtcgacgaac gtctcgcgcg gaacggaccg agttgatgcc 2491201 ccgcctagcg ttcctgctgc ggcccggctg gctggcgttg gccctggtcg tggtcgcgtt 2491261 cacctacctg tgctttacgg tgctcgcgcc gtggcagctg ggcaagaatg ccaaaacgtc 2491321 acgagagaac cagcagatca ggtattccct cgacaccccg ccggttccgc tgaaaaccct 2491381 tctaccacag caggattcgt cggcgccgga cgcgcagtgg cgccgggtga cggcaaccgg 2491441 acagtacctt ccggacgtgc aggtgctggc ccgactgcgc gtggtggagg gggaccaggc 2491501 gtttgaggtg ttggccccat tcgtggtcga cggcggacca accgtcctgg tcgaccgtgg 2491561 atacgtgcgg ccccaggtgg gctcgcacgt accaccgatc ccccgcctgc cggtgcagac 2491621 ggtgaccatc accgcgcggc tgcgtgactc cgaaccgagc gtggcgggca aagacccatt 2491681 cgtcagagac ggcttccagc aggtgtattc gatcaatacc ggacaggtcg ccgcgctgac 2491741 cggagtccag ctggctgggt cctatctgca gttgatcgaa gaccaacccg gcgggctcgg 2491801 cgtgctcggc gttccgcatc tagatcccgg gccgttcctg tcctatggca tccaatggat 2491861 ctcgttcggc attctggcac cgatcggctt gggctatttc gcctacgccg agatccgggc 2491921 gcgccgccgg gaaaaagcgg ggtcgccacc accggacaag ccaatgacgg tcgagcagaa 2491981 actcgctgac cgctacggcc gccggcggta aaccaacatc acggccaata ccgcagcccc 2492041 cgcctggacc acccgcgaca gcaccacggc gcggcgcaga tcggccacct tgggcgaccg 2492101 gccgtcgccc aaggtgggcc ggatctgcaa ctcatggtgg taccgggtgg gcccacccag 2492161 ccgcacgtca agcgccccag caaacgccgc ctcgacgaca ccggcgttgg ggctgggatg 2492221 gcgggcggcg tcgcgccgcc aggcccgtac cgcaccgcgg ggcgacccac cgaccaccgg 2492281 cgcgcagatc accaccagca ccgccgtcgc ccgtgcgcca acatagttgg cccagtcatc 2492341 caatcgtgct gcagcccaac cgaatcggag ataacgcggc gagcggtagc cgatcatcga 2492401 gtccagggtg ttgatggcac gatatcccag caccgcaggc acgccgctcg aagccgccca 2492461 cagcagcggc accacctggg cgtcggcggt gttttcggcc accgactcca gcgcggcacg 2492521 cgtcaggccc gggccgccca gctgggccgg gtcacgcccg cacagcgacg gcagcagccg 2492581 tcgcgccgcc tcgacatcgt cgcgctccaa caggtccgat atctggcggc cggtgcgcgc 2492641 cagcgaagtt ccgcccagcg ctgcccaggt ggccgtcgcg gtggccgcca cgggccagca 2492701 cctgccgggt agccgctgca gtgccgcgcc gagcaagccc accgcgccga ccagcaggcc 2492761 gacgtgtacc gcaccggcga cccggccgtc acggtaggtg atctgctcca gcttggcggc 2492821 cgcccgaccg aacagggcca ccggatgacc tcgtttgggg tcgccgaaca cgacgtcgag 2492881 caggcagccg atcagcacgc cgacggccct ggtctgccag atcgatgcaa acactccggc 2492941 agcgtcgcac acgtggtcta cgctcagcta tttatgacct catacggcag ctatccacga 2493001 tgaagcggcc agctacccgg gttgccgacc tgttgaaccc ggcggcaatg ttgttgccgg 2493061 cagcgaatgt catcatgcag ctggcagtgc cgggtgtcgg gtatggcgtg ctggaaagcc 2493121 cggtggacag cggcaacgtc tacaagcatc cgttcaagcg ggcccggacc accggcacct 2493181 acctggcggt ggcgaccatc ggaacggaat ccgaccgagc gctgatccgg ggtgccgtgg 2493241 acgtcgcgca ccggcaggtt cggtcgacgg cctcgagccc ggtgtcctat aacgccttcg 2493301 acccgaagtt gcagctgtgg gtggcggcgt gtctgtaccg ctacttcgtg gaccagcacg 2493361 agtttctgta cggcccactc gaagatgcca ccgccgacgc cgtctaccaa gacgccaaac 2493421 ggttagggac cacgctgcag gtgccggagg ggatgtggcc gccggaccgg gtcgcgttcg 2493481 acgagtactg gaagcgctcg cttgatgggc tgcagatcga cgcgccggtg cgcgagcatc 2493541 ttcgcggggt ggcctcggta gcgtttctcc cgtggccgtt gcgcgcggtg gccgggccgt 2493601 tcaacctgtt tgcgacgacg ggattcttgg caccggagtt ccgcgcgatg atgcagctgg 2493661 agtggtcaca ggcccagcag cgtcgcttcg agtggttact ttccgtgcta cggttagccg 2493721 accggctgat tccgcatcgg gcctggatct tcgtttacca gctttacttg tgggacatgc 2493781 ggtttcgcgc ccgacacggc cgccgaatcg tctgatagag cccggccgag tgtgagcctg 2493841 acagcccgac accggcggcg tgtgtcgcgt cgccaggttc acgctcggcg atctagagcc 2493901 gccgaaaacc tacttctggg ttgcctcccg aatcaacgtg ctgatctgct cgagcagctc 2493961 acgcatatcg gcgcgcatcg catccaccgc ggcatacagg tcggccttgg tcgccggcag 2494021 ctggtccgac gtcattggcc gcaccggcgg tgctgtctgt cgcgccgcgc tgtcgctttg 2494081 aaacccaggt cgctcaccca cgaccacgac actgccatat ccggcgcccc gccgacaacg 2494141 aagcacagct agccggtggg cgcggacggg atcgaaccgc cgaccgctgg tgtgtaaaac 2494201 cagagctcta ccgctgagct acgcgcccat gaccgccgca ggctacacgc cttgcggcca 2494261 agcacccaaa accttaggcc gtaagcgccg ccagagcgtc ggtccacagc cgctgatcgc 2494321 gaacttcacc cggctgcttc atctcggcga accgaatgat ccctgaccga tcgaccacaa 2494381 aggtgccccg gttagcgatg ccggcctgct cgttgaagac gccgtaggcc tgactgaccg 2494441 cgccgtgtgg ccagaagtcc gacaacagcg gaaacgtgaa tccgctctgc gtcgcccaga 2494501 tcttgtgagt gggtggcggg cccaccgaaa tcgctagcgc ggcgctgtcg tcgttctcaa 2494561 actcgggcag gtgatcacgc aactggtcca gctcgccctg gcagatgccc gtgaacgcca 2494621 acggaaagaa caccaacagc acgttctttg caccccggta gccgcgcagg gtgacaagct 2494681 gctgattctg gtcgcgcaac gtgaagtcag gggcggtggc tccgacgttc agcatcagcg 2494741 cttgccagcc cgcgatttcg gctgtaccaa tctgctggcg ctccagttgc ccagattgac 2494801 cgacgaggtc ggcatcagcc cagctgtggg cgccgcctcg gcaatctcgg cgggcaatac 2494861 atggccgggc tggccggtct tgggcgtcac cacccaaatc acaccgtcct cggcgagcgg 2494921 gccgatcgca tccatcaggg tgtccaccaa atcgccgtcg ccatcacgcc accacaacag 2494981 gacgacatcg atgacctcgt cggtgtcttc atcgagcaac tctcccccgc acgcttcttc 2495041 gatggccgcg cggatgtcgt cgtcggtgtc ttcgtcccag ccccattcct ggataagttg 2495101 gtctcgttgg atgcccaatt tgcgggcgta gttcgaggcg tgatccgccg cgaccaccgt 2495161 ggaacctcct tcagtctccg cgggccatgt gcacaccgtc gcgatgggca ttatcgtcgc 2495221 acagccagaa ccggtccacc cgcccgcctc agaaggcggc cacgcacatt ttcaatgcct 2495281 ttgtcttggt gtcgttgagc cgatcaaccc gccggttgaa ttccgctgtc gacgcgtgcg 2495341 caccgatggc atttgccacc gcgcgggccg cgtcgacata tgcgttgagc gcatccccca 2495401 gttgcgcgga cagcgcggcg ctcagactgc ctgagaccgt cgaggcactg ttgttgagcg 2495461 cgtcgatggc cggaccttcg gtcggcccgg tgttgcggcc ctgattgaac gcggccacgt 2495521 aggcgttcac cttgtcgatg gcgtccttgc tggtggccgc cagcgcgtca cacgaggtgc 2495581 gaatcgcctt ggtcgtcagc gattgttggc gctgcgactc ccggatgctc gacgtcgccg 2495641 ccgaagccga caccgacgcg gacaccgacg agcggtaggc cggtgcgacg ttggtgtcgg 2495701 gcatggccgt accgtcggtg acagtggtac atccgacgat ccccatcagc agcagcgcga 2495761 tgcagccgag cgccagggcg cctcgcctgg ggagctcccc cccgtgcctg cgaggcacgg 2495821 cgcgccatcc gatgagcacg gcatgtgagg ttacctggtc gcagcgcgac cgcgctggcc 2495881 gtggtgtgtc gcgcatccgc agaaccgagc ggagtgcggc tatccgccgc cgacgccggt 2495941 gcggcacgat agggggacga ccatctaaac agcacgcaag cggaagcccg ccacctacag 2496001 gagtagtgcg ttgaccaccg atttcgcccg ccacgatctg gcccaaaact caaacagcgc 2496061 aagcgaaccc gaccgagttc gggtgatccg cgagggtgtg gcgtcgtatt tgcccgacat 2496121 tgatcccgag gagacctcgg agtggctgga gtcctttgac acgctgctgc aacgctgcgg 2496181 cccgtcgcgg gcccgctacc tgatgttgcg gctgctagag cgggccggcg agcagcgggt 2496241 ggccatcccg gcattgacgt ctaccgacta tgtcaacacc atcccgaccg agctggagcc 2496301 gtggttcccc ggcgacgaag acgtcgaacg tcgttatcga gcgtggatca gatggaatgc 2496361 ggccatcatg gtgcaccgtg cgcaacgacc gggtgtgggc gtgggtggcc atatctcgac 2496421 ctacgcgtcg tccgcggcgc tctatgaggt cggtttcaac cacttcttcc gcggcaagtc 2496481 gcacccgggc ggcggcgatc aggtgttcat ccagggccac gcttccccgg gaatctacgc 2496541 gcgcgccttc ctcgaagggc ggttgaccgc cgagcaactc gacggattcc gccaggaaca 2496601 cagccatgtc ggcggcgggt tgccgtccta tccgcacccg cggctcatgc ccgacttctg 2496661 ggaattcccc accgtgtcga tgggtttggg cccgctcaac gccatctacc aggcacggtt 2496721 caaccactat ctgcatgacc gcggtatcaa agacacctcc gatcaacacg tgtggtgttt 2496781 tttgggcgac ggcgagatgg acgaacccga gagccgtggg ctggcccacg tcggcgcgct 2496841 ggaaggcttg gacaacttga ccttcgtgat caactgcaat ctgcagcgac tcgacggccc 2496901 ggtgcgcggc aacggcaaga tcatccagga gctggagtcg ttcttccgcg gtgccggctg 2496961 gaacgtcatc aaggtggtgt ggggccgcga atgggatgcc ctgctgcacg ccgaccgcga 2497021 ccgtgcgctg gtgaatttaa tgaatacaac acccgatggc gattaccaga cctataaggc 2497081 caacgacggc ggctacgtgc gtgaccactt cttcggccgc gacccacgca ccaaggcgct 2497141 ggtggagaac atgagcgacc aggatatctg gaacctcaaa cggggcggcc acgattaccg 2497201 caaggtttac gccgcctacc gcgccgccgt cgaccacaag ggacagccga cggtgatcct 2497261 ggccaagacc atcaaaggct acgcgctggg caagcatttc gaaggacgca atgccaccca 2497321 ccagatgaaa aaactgaccc tggaagacct taaggagttt cgtgacacgc agcggattcc 2497381 ggtcagcgac gcccagcttg aagagaatcc gtacctgccg ccctactacc accccggcct 2497441 caacgccccg gagattcgtt acatgctcga ccggcgccgg gccctcgggg gctttgttcc 2497501 cgagcgcagg accaagtcca aagcgctgac cctgccgggt cgcgacatct acgcgccgct 2497561 gaaaaagggc tctgggcacc aggaggtggc caccaccatg gcgacggtgc gcacgttcaa 2497621 agaagtgttg cgcgacaagc agatcgggcc gcggatagtc ccgatcattc ccgacgaggc 2497681 ccgcaccttc gggatggact cctggttccc gtcgctaaag atctataacc gcaatggcca 2497741 gctgtatacc gcggttgacg ccgacctgat gctggcctac aaggagagcg aagtcgggca 2497801 gatcctgcac gagggcatca acgaagccgg gtcggtgggc tcgttcatcg cggccggcac 2497861 ctcgtatgcg acgcacaacg aaccgatgat ccccatttac atcttctact cgatgttcgg 2497921 cttccagcgc accggcgata gcttctgggc cgcggccgac cagatggctc gagggttcgt 2497981 gctcggggcc accgccgggc gcaccaccct gaccggtgag ggcctgcaac acgccgacgg 2498041 tcactcgttg ctgctggccg ccaccaaccc ggcggtggtt gcctacgacc cggccttcgc 2498101 ctacgaaatc gcctacatcg tggaaagcgg actggccagg atgtgcgggg agaacccgga 2498161 gaacatcttc ttctacatca ccgtctacaa cgagccgtac gtgcagccgc cggagccgga 2498221 gaacttcgat cccgagggcg tgctgcgggg tatctaccgc tatcacgcgg ccaccgagca 2498281 acgcaccaac aaggcgcaga tcctggcctc cggggtagcg atgcccgcgg cgctgcgggc 2498341 agcacagatg ctggccgccg agtgggatgt cgccgccgac gtgtggtcgg tgaccagttg 2498401 gggcgagcta aaccgcgacg gggtggccat cgagaccgag aagctccgcc accccgatcg 2498461 gccggcgggc gtgccctacg tgacgagagc gctggagaat gctcggggcc cggtgatcgc 2498521 ggtgtcggac tggatgcgcg cggtccccga gcagatccga ccgtgggtgc cgggcacata 2498581 cctcacgttg ggcaccgacg ggttcggctt ttccgacact cggcccgccg ctcgccgcta 2498641 cttcaacacc gacgccgaat cccaggtggt cgcggttttg gaggcgttgg cgggcgacgg 2498701 cgagatcgac ccatcggtgc cggtcgcggc cgcccgccag taccggatcg acgacgtggc 2498761 ggctgcgccc gagcagacca cggatcccgg tcccggggcc taacgccggc gagccgaccg 2498821 cctttggccg aatcttccag aaatctggcg tagcttttag gagtgaacga caatcagttg 2498881 gctccagttg cccgcccgag gtcgccgctc gaactgctgg acactgtgcc cgattcgctg 2498941 ctgcggcggt tgaagcagta ctcgggccgg ctggccaccg aggcagtttc ggccatgcaa 2499001 gaacggttgc cgttcttcgc cgacctagaa gcgtcccagc gcgccagcgt ggcgctggtg 2499061 gtgcagacgg ccgtggtcaa cttcgtcgaa tggatgcacg acccgcacag tgacgtcggc 2499121 tataccgcgc aggcattcga gctggtgccc caggatctga cgcgacggat cgcgctgcgc 2499181 cagaccgtgg acatggttcg ggtcaccatg gagttcttcg aagaagtcgt gcccctgctc 2499241 gcccgttccg aagagcagtt gaccgccctc acggtgggca ttttgaaata cagccgcgac 2499301 ctggcattca ccgccgccac ggcctacgcc gatgcggccg aggcacgagg cacctgggac 2499361 agccggatgg aggccagcgt ggtggacgcg gtggtacgcg gcgacaccgg tcccgagctg 2499421 ctgtcccggg cggccgcgct gaattgggac accaccgcgc cggcgaccgt actggtggga 2499481 actccggcgc ccggtccaaa tggctccaac agcgacggcg acagcgagcg ggccagccag 2499541 gatgtccgcg acaccgcggc tcgccacggc cgcgctgcgc tgaccgacgt gcacggcacc 2499601 tggctggtgg cgatcgtctc cggccagctg tcgccaaccg agaagttcct caaagacctg 2499661 ctggcagcat tcgccgacgc cccggtggtc atcggcccca cggcgcccat gctgaccgcg 2499721 gcgcaccgca gcgctagcga ggcgatctcc gggatgaacg ccgtcgccgg ctggcgcgga 2499781 gcgccgcggc ccgtgctggc tagggaactt ttgcccgaac gcgccctgat gggcgacgcc 2499841 tcggcgatcg tggccctgca taccgacgtg atgcggcccc tagccgatgc cggaccgacg 2499901 ctcatcgaga cgctagacgc atatctggat tgtggcggcg cgattgaagc ttgtgccaga 2499961 aagttgttcg ttcatccaaa cacagtgcgg taccggctca agcggatcac cgacttcacc 2500021 gggcgcgatc ccacccagcc acgcgatgcc tatgtccttc gggtggcggc caccgtgggt 2500081 caactcaact atccgacgcc gcactgaagc atcgacagca atgccctgtc atagattccc 2500141 tcgccggtca gagggggtcc agcaggggcc ccggaaagat accaggggcg ccgtcggacg 2500201 gaaagtgatc cagacaacag gtcgcgggac gatctcaaaa acatagctta caggcccgtt 2500261 ttgttggtta tatacaaaaa cctaagacga ggttcataat ctgttacacc gcgcaaaacc 2500321 gtcttcacag tgttctctta gacacgtgat tgcgttgctc gcacccggac agggttcgca 2500381 aaccgaggga atgttgtcgc cgtggcttca gctgcccggc gcagcggacc agatcgcggc 2500441 gtggtcgaaa gccgctgatc tagatcttgc ccggctgggc accaccgcct cgaccgagga 2500501 gatcaccgac accgcggtcg cccagccatt gatcgtcgcc gcgactctgc tggcccacca 2500561 ggaactggcg cgccgatgcg tgctcgccgg caaggacgtc atcgtggccg gccactccgt 2500621 cggcgaaatc gcggcctacg caatcgccgg tgtgatagcc gccgacgacg ccgtcgcgct 2500681 ggccgccacc cgcggcgccg agatggccaa ggcctgcgcc accgagccga ccggcatgtc 2500741 tgcggtgctc ggcggcgacg agaccgaggt gctgagtcgc ctcgagcagc tcgacttggt 2500801 cccggcaaac cgcaacgccg ccggccagat cgtcgctgcc ggccggctga ccgcgttgga 2500861 gaagctcgcc gaagacccgc cggccaaggc gcgggtgcgt gcactgggtg tcgccggagc 2500921 gttccacacc gagttcatgg cgcccgcact tgacggcttt gcggcggccg cggccaacat 2500981 cgcaaccgcc gaccccaccg ccacgctgct gtccaaccgc gacgggaagc cggtgacatc 2501041 cgcggccgcg gcgatggaca ccctggtctc ccagctcacc caaccggtgc gatgggacct 2501101 gtgcaccgcg acgctgcgcg aacacacagt cacggcgatc gtggagttcc cccccgcggg 2501161 cacgcttagc ggtatcgcca aacgcgaact tcggggggtt ccggcacgcg ccgtcaagtc 2501221 acccgcagac ctggacgagc tggcaaacct ataaccgcgg actcggccag aacaaccaca 2501281 tacccgtcag ttcgatttgt acacaacata ttacgaaggg aagcatgctg tgcctgtcac 2501341 tcaggaagaa atcattgccg gtatcgccga gatcatcgaa gaggtaaccg gtatcgagcc 2501401 gtccgagatc accccggaga agtcgttcgt cgacgacctg gacatcgact cgctgtcgat 2501461 ggtcgagatc gccgtgcaga ccgaggacaa gtacggcgtc aagatccccg acgaggacct 2501521 cgccggtctg cgtaccgtcg gtgacgttgt cgcctacatc cagaagctcg aggaagaaaa 2501581 cccggaggcg gctcaggcgt tgcgcgcgaa gattgagtcg gagaaccccg atgccgttgc 2501641 caacgttcag gcgaggcttg aggccgagtc caagtgagtc agccttccac tgctaatggc 2501701 ggtttcccca gcgttgtggt gaccgccgtc acagcgacga cgtcgatctc gccggacatc 2501761 gagagcacgt ggaagggtct gttggccggc gagagcggca tccacgcact cgaagacgag 2501821 ttcgtcacca agtgggatct agcggtcaag atcggcggtc acctcaagga tccggtcgac 2501881 agccacatgg gccgactcga catgcgacgc atgtcgtacg tccagcggat gggcaagttg 2501941 ctgggcggac agctatggga gtccgccggc agcccggagg tcgatccaga ccggttcgcc 2502001 gttgttgtcg gcaccggtct aggtggagcc gagaggattg tcgagagcta cgacctgatg 2502061 aatgcgggcg gcccccggaa ggtgtccccg ctggccgttc agatgatcat gcccaacggt 2502121 gccgcggcgg tgatcggtct gcagcttggg gcccgcgccg gggtgatgac cccggtgtcg 2502181 gcctgttcgt cgggctcgga agcgatcgcc cacgcgtggc gtcagatcgt gatgggcgac 2502241 gccgacgtcg ccgtctgcgg cggtgtcgaa ggacccatcg aggcgctgcc catcgcggcg 2502301 ttctccatga tgcgggccat gtcgacccgc aacgacgagc ctgagcgggc ctcccggccg 2502361 ttcgacaagg accgcgacgg ctttgtgttc ggcgaggccg gtgcgctgat gctcatcgag 2502421 acggaggagc acgccaaagc ccgtggcgcc aagccgttgg cccgattgct gggtgccggt 2502481 atcacctcgg acgcctttca tatggtggcg cccgcggccg atggtgttcg tgccggtagg 2502541 gcgatgactc gctcgctgga gctggccggg ttgtcgccgg cggacatcga ccacgtcaac 2502601 gcgcacggca cggcgacgcc tatcggcgac gccgcggagg ccaacgccat ccgcgtcgcc 2502661 ggttgtgatc aggccgcggt gtacgcgccg aagtctgcgc tgggccactc gatcggcgcg 2502721 gtcggtgcgc tcgagtcggt gctcacggtg ctgacgctgc gcgacggcgt catcccgccg 2502781 accctgaact acgagacacc cgatcccgag atcgaccttg acgtcgtcgc cggcgaaccg 2502841 cgctatggcg attaccgcta cgcagtcaac aactcgttcg ggttcggcgg ccacaatgtg 2502901 gcgcttgcct tcgggcgtta ctgaagcacg acatcgcggg tcgcgaggcc cgaggtgggg 2502961 gtccccccgc ttgcgggggc gagtcggacc gatatggaag gaacgttcgc aagaccaatg 2503021 acggagctgg ttaccgggaa agcctttccc tacgtagtcg tcaccggcat cgccatgacg 2503081 accgcgctcg cgaccgacgc ggagactacg tggaagttgt tgctggaccg ccaaagcggg 2503141 atccgtacgc tcgatgaccc attcgtcgag gagttcgacc tgccagttcg catcggcgga 2503201 catctgcttg aggaattcga ccaccagctg acgcggatcg aactgcgccg gatgggatac 2503261 ctgcagcgga tgtccaccgt gctgagccgg cgcctgtggg aaaatgccgg ctcacccgag 2503321 gtggacacca atcgattgat ggtgtccatc ggcaccggcc tgggttcggc cgaggaactg 2503381 gtcttcagtt acgacgatat gcgcgctcgc ggaatgaagg cggtctcgcc gctgaccgtg 2503441 cagaagtaca tgcccaacgg ggccgccgcg gcggtcgggt tggaacggca cgccaaggcc 2503501 ggggtgatga cgccggtatc ggcgtgcgca tccggcgccg aggccatcgc ccgtgcgtgg 2503561 cagcagattg tgctgggaga ggccgatgcc gccatctgcg gcggcgtgga gaccaggatc 2503621 gaagcggtgc ccatcgccgg gttcgctcag atgcgcatcg tgatgtccac caacaacgac 2503681 gaccccgccg gtgcatgccg cccattcgac agggaccgcg acggctttgt gttcggcgag 2503741 ggcggcgccc ttctgttgat cgagaccgag gagcacgcca aggcacgtgg cgccaacatc 2503801 ctggcccgga tcatgggcgc cagcatcacc tccgatggct tccacatggt ggccccggac 2503861 cccaacgggg aacgcgccgg gcatgcgatt acgcgggcga ttcagctggc gggcctcgcc 2503921 cccggcgaca tcgaccacgt caatgcgcac gccaccggca cccaggtcgg cgacctggcc 2503981 gaaggcaggg ccatcaacaa cgccttgggc ggcaaccgac cggcggtgta cgcccccaag 2504041 tctgccctcg gccactcggt gggcgcggtc ggcgcggtcg aatcgatctt gacggtgctc 2504101 gcgttgcgcg atcaggtgat cccgccgaca ctgaatctgg taaacctcga tcccgagatc 2504161 gatttggacg tggtggcggg tgaaccgcga ccgggcaatt accggtatgc gatcaataac 2504221 tcgttcggat tcggcggcca caacgtggca atcgccttcg gacggtacta aaccccagcg 2504281 ttacgcgaca ggagacctgc gatgacaatc atggcccccg aggcggttgg cgagtcgctc 2504341 gacccccgcg atccgctgtt gcggctgagc aacttcttcg acgacggcag cgtggaattg 2504401 ctgcacgagc gtgaccgctc cggagtgctg gccgcggcgg gcaccgtcaa cggtgtgcgc 2504461 accatcgcgt tctgcaccga cggcaccgtg atgggcggcg ccatgggcgt cgaggggtgc 2504521 acgcacatcg tcaacgccta cgacactgcc atcgaagacc agagtcccat cgtgggcatc 2504581 tggcattcgg gtggtgcccg gctggctgaa ggtgtgcggg cgctgcacgc ggtaggccag 2504641 gtgttcgaag ccatgatccg cgcgtccggc tacatcccgc agatctcggt ggtcgtcggt 2504701 ttcgccgccg gcggcgccgc ctacggaccg gcgttgaccg acgtcgtcgt catggcgccg 2504761 gaaagccggg tgttcgtcac cgggcccgac gtggtgcgca gcgtcaccgg cgaggacgtc 2504821 gacatggcct cgctcggtgg gccggagacc caccacaaga agtccggggt gtgccacatc 2504881 gtcgccgacg acgaactcga cgcctacgac cgtgggcgcc ggttggtcgg attgttctgc 2504941 cagcaggggc atttcgatcg cagcaaggcc gaggccggtg acaccgacat ccacgcgctg 2505001 ctgccggaat cctcgcgacg tgcctacgac gtgcgtccga tcgtgacggc gatcctcgat 2505061 gcggacacac cgttcgacga gttccaggcc aattgggcgc cgtcgatggt ggtcgggctg 2505121 ggtcggctgt cgggtcgcac ggtgggtgta ctggccaaca acccgctacg cctgggcggc 2505181 tgcctgaact ccgaaagcgc agagaaggca gcgcgtttcg tgcggctgtg cgacgcgttc 2505241 gggattccgc tggtggtggt ggtcgatgtg ccgggctatc tgcccggtgt cgaccaggag 2505301 tggggtggcg tggtgcgccg tggcgccaag ttgctgcacg cgttcggcga gtgcaccgtt 2505361 ccgcgggtca cgctggtcac ccgaaagacc tacggcgggg catacattgc gatgaactcc 2505421 cggtcgttga acgcgaccaa ggtgttcgcc tggccggacg ccgaggtcgc ggtgatgggc 2505481 gctaaggcgg ccgtcggcat cctgcacaag aagaagttgg ccgccgctcc ggagcacgaa 2505541 cgcgaagcgc tgcacgacca gttggccgcc gagcatgagc gcatcgccgg cggggtcgac 2505601 agtgcgctgg acatcggtgt ggtcgacgag aagatcgacc cggcgcatac tcgcagcaag 2505661 ctcaccgagg cgctggcgca ggctccggca cggcgcggcc gccacaagaa catcccgctg 2505721 tagttctgac cgcgagccgc tcctcgcatg ctcgaacggt gcctaccgac gcgctaacaa 2505781 ttctcgagaa ggccggcggg ttcgccacca ccgcgcaatt gctcacggtc atgacccgcc 2505841 aacagctcga cgtccaagtg aaaaacggcg gcctcgttcg cgtttggtac ggggtctacg 2505901 cggcacaaga gccggacctg ttgggccgct tggcggctct cgatgtgttc atgggggggc 2505961 acgccgtcgc gtgtctgggc accgccgccg cgttgtatgg attcgacacg gaaaacaccg 2506021 tcgctatcca tatgctcgat cccggagtaa ggatgcggcc cacggtcggt ctgatggtcc 2506081 accaacgcgt cggtgcccgg ctccaacggg tgtcaggtcg tctcgcgacc gcgcccgcat 2506141 ggactgccgt ggaggtcgca cgacagttgc gccgcccgcg ggcgctggcc accctcgaag 2506201 ccgcactacg gtcaatgcgc tgcgctcgca gtgaaattga aaacgccgtt gctgagcagc 2506261 gaggccgccg aggcatcgtc gcggcgcgcg aactcttacc cttcgccgac ggacgcgcgg 2506321 aatcggccat ggagagcgag gctcggctcg tcatgatcga ccacgggctg ccgttgcccg 2506381 aacttcaata cccgatacac ggccacggtg gtgaaatgtg gcgagtcgac ttcgcctggc 2506441 ccgacatgcg tctcgcggcc gaatacgaaa gcatcgagtg gcacgcggga ccggcggaga 2506501 tgctgcgcga caagacacgc tgggccaagc tccaagagct cgggtggacg attgtcccga 2506561 ttgtcgtcga cgatgtcaga cgcgaacccg gccgcctggc ggcccgcatc gcccgccacc 2506621 tcgaccgcgc gcgtatggcc ggctgaccgc tggtgagcag acgcagagtc gcactgcgcc 2506681 ggccggcgca gtgcgactct gcgtctgctc gcgctcaacg gctgaggaac tccttagcca 2506741 cggcgactac gcgctcgcga tcccgtggca ccagaccgat ccgggtccgg cggtcgagga 2506801 tatcgtccac atccagcgcc ccctcatggg tcaccgcgta ttcgaactcc gcccgggtca 2506861 cgtcgatgcc gtcggcgacc ggctcggtgg gccgctcaca tgtggcggcg gcagcgacgt 2506921 tggccgcctc ggccccgtac cgcgccacca gcgactcggg caatccggcg cccgatccgg 2506981 gggccggccc agggttcgcc ggtgcgccga tcagcggcag gttgcgagtg cggcacttcg 2507041 cggctcgcag gtgtcgcagc gtgatggcgc gattcagcac atcctctgcc atgtagcggt 2507101 attccgtcag cttgccgccg accacactga tcacgcccga cggcgattca aaaacagcgt 2507161 ggtcacgcga aacgtcggcg gtgcggccct ggacaccagc accgccggtg tcgattagcg 2507221 gccgcaatcc cgcataggca ccgatgacat ccttggtgcc gaccgccgtc cccaatgcgg 2507281 tgttcaccgt atccagcagg aacgtgatct cttccgaaga cggttgtggc acatcgggaa 2507341 tcgggccggg tgcgtcttcg tcggtcagcc cgagatagat ccggcccagc tgctcgggca 2507401 tggcgaacac gaagcggttc agctcaccgg ggatcggaat ggtcagcgcg gcagtcggat 2507461 tggcaaacga cttcgcgtcg aagaccagat gtgtgccgcg gctggggcgt agcctcaggg 2507521 acgggtcgat ctcacccgcc cacacgcccg ccgcgttgat gacggcacgc gccgacagcg 2507581 cgaacgactg ccgggtgcgc cggtcggtca actccaccga agtgccggtg acattcgacg 2507641 cgcccacgta agtgaggatg cgggcgccgt gctgggccgc ggtgcgcgcg acggccatga 2507701 ccagccgggc gtcgtcgatc aattgcccgt cgtacgcgag cagaccaccg tcgaggccgt 2507761 cccgccgaac ggtgggagca atctccacca cccgtgacgc cgggattcgg cgcgatcggg 2507821 gcaacgtcgc cgccggcgta cccgctagca cccgcaaagc gtcgccggcc aggaaaccgg 2507881 cacgcaccaa cgcccgcttg gtgtgaccca tcgacggcaa caacgggacc agttgcggca 2507941 tggcatgcac gagatgagga gcgttgcgtg tcatcaggat tccgcgttcg acggcgctgc 2508001 gccgggcgat gcccacgttg ccgctggcca gatagcgcag accgccgtgc accaacttcg 2508061 agctccagcg gctggtgccg aacgccagat catgcttttc caccaaggcc accgtcagac 2508121 cgcgggtggc agcatctaag gcaatgccaa caccggtaat gccgccgcct atcacgatga 2508181 cgtcgagtgc gccaccgtcg gccagtgcgg tcaggtcggc ggagcgacgc gccgcgttga 2508241 gtgcagccga gtggggcatc agcacaaata tccgttcagt gcgtgggtaa gttcggtggc 2508301 cagcgcggcg gaatcgagga tcgaatcgac gatgtccgcg gactggatgg tcgactgggc 2508361 gatcagcaac accatggtcg ccagtcgacg agcgtcgccg gagcgcacac tgcccgaccg 2508421 ctgcgccact gtcagccggg cggccaaccc ctcgatcagg acctgctggc tggtgccgag 2508481 gcgctcggtg atgtacaccc tggccagctc cgagtgcatg accgacatga tcagatcgtc 2508541 accccgcaac cggtcggcca ccgcgacaat ctgctttacc aacgcttccc ggtcgtcccc 2508601 gtcgaggggc acctcccgca gcacggcggc gatatggctg gtcagcatgg acgccatgat 2508661 cgaccgggtg tccggccagc gacggtatac ggtcgggcgg ctcacgcccg cgcgccgggc 2508721 gatctcggca agtgtcaccc ggtccacgcc gtaatcgacg acgcagctcg ccgctgcccg 2508781 caggatacga ccaccggtat ccgcgcggtc attactcatt gacagcatgt gtaatactgt 2508841 aacgcgtgac tcaccgcgag gaactccttc caccgatgaa atgggacgcg tggggagatc 2508901 ccgccgcggc caagccactt tctgatggcg tccggtcgtt gctgaagcag gttgtgggcc 2508961 tagcggactc ggagcagccc gaactcgacc ccgcgcaggt gcagctgcgc ccgtccgccc 2509021 tgtcgggggc agaccacgat gcgctggcgc gcatcgtcgg caccgagtat ttccgcaccg 2509081 ccgatcgcga ccggctgctg cacgccggcg gcaagtccac cccagacctg ctgcggcgca 2509141 aagacaccgg tgtccaggat gcgcccgacg cggtgttgct gcccggcggc cccaacgggg 2509201 aggacgccgt cgccgacatc ttgcactact gctccgacca cggcattgcc gtggtcccgt 2509261 ttggtggcgg caccagcgtc gttggtgggc ttgaccccgt tcgcaacgac tttcgcgcgg 2509321 tgatctccct ggatatgcgg cgcttcgacc ggctgcaccg gatcgatgag gtgtccggcg 2509381 aggccgaact ggaggccggt gtcaccgggc cggaagccga acgtctgctc ggcgaacatg 2509441 gcttctcgct cgggcacttc ccgcagagct tcgagttcgc caccatcggg gggttcgcgg 2509501 ccacccgctc gtcaggccag gactcggctg gctatggccg gttcaacgac atgattcttg 2509561 ggctgcgcat gatcactccg gtgggggtgc tggatctggg tcgagtgccg gcgtcggcgg 2509621 ccggcccgga cctgcgccag ctggcgatcg gctccgaagg cgtcttcggc gtcatcaccc 2509681 gggtgcggct gcgggtgcac cggattccgg aatcgacgcg ttacgaggcg tggtcgtttc 2509741 ccgatttcgc gaccggggtt gcggcgctgc gcaccatcac ccaaaccggc accggcccca 2509801 ccgtcgttcg gctctctgac gaggccgaaa ccggcgtcaa cctcgccacc accgaggcga 2509861 tcggggaaac ccaaatcacc ggcggctgtt tggggatcac cgtgttcgag ggcacccagg 2509921 aacacaccga gagcaggcac gccgagacgc gcgcgttgct ggcggcccga ggcggcacct 2509981 cgttgggcga aggaccggcg cgggcctggg aacgcggcag gttcgccgcg ccgtatctgc 2510041 gtgactccct gttggccgcg ggagcgctct gcgagaccct cgagaccgcc acggtgtggt 2510101 ccaacacccc cgtgctgaag gccgccgtga ccgaagcgct caccacctcg ctggccgcat 2510161 cgggtacacc ggcgctggtg atgtgccacg tgtcgcacgt gtatcccacc ggcgcgtcgt 2510221 tgtacttcac cgttgtcgcc gggcagcgag gcgatccgat cgagcagtgg ctggccgcca 2510281 agaaggcggc gtcggatgcg atcatggcca ccggaggaac gatcacgcac caccatgcgg 2510341 ttggttccga ccaccgcccc tggatgcgcg cggaggtggg tgatctgggc gtgacattgt 2510401 tgcgcacgat caaggcgacg ctggatccgg ccggaattct caaccccggc aagctgattc 2510461 catgagcgcc gggcagctgc gccggcatga gatcggcaag gtcaccgcgc tgaccaatcc 2510521 cctgtcaggc catggcgccg ccgtaaaggc tgcacacggc gcgatcgccc ggctgaagca 2510581 tcggggggtg gacgtcgtcg agatcgtcgg cggggacgcc cacgacgcac gccatctgct 2510641 cgccgcggca gtcgcaaaag gcactgacgc ggtgatggtg accggcggtg acggagtcgt 2510701 ctccaacgcg ctacaggtct tggcgggcac cgacattccg ttaggaatca ttccggccgg 2510761 cactggtaac gaccacgcac gcgaattcgg gcttcccaca aagaatccca aggcagccgc 2510821 agatatcgtt gttgacggct ggacggaaac cattgacctg ggccggattc aagacgacaa 2510881 cggtatcgaa aagtggttcg gtaccgtggc ggctaccgga ttcgactccc tggtcaacga 2510941 tcgcgccaac cgaatgcgct ggccacacgg gcggatgcgc tattacatcg cgatgctcgc 2511001 cgaactgtcg cggctgcggc cgttgccgtt ccggctggtg ctcgacggca ccgaagagat 2511061 cgtcgccgac ctcacacttg ccgacttcgg caatacccgc agctacggcg gcggattatt 2511121 gatctgcccc aacgccgacc actcggacgg cctgctcgac atcaccatgg cccagtcgga 2511181 ttcccgtacc aagttgctcc gcctgttccc caccattttc aaaggcgccc atgtcgagct 2511241 tgacgaggtg agcaccacac gagccaagac agtccacgtc gagtgccccg gtatcaacgt 2511301 ctatgccgac ggcgacttcg cctgcccgtt accagccgag atctccgcgg tgccggccgc 2511361 ccttcaggtt cttcgccccc gccacggata agcgggtggt aacgactcgg tcgtaaagcg 2511421 cgacatcctt ccaaacccgc tgtacgggag gaacagatgt ccggacaccg caagaaggca 2511481 atgctcgcct tggcggctgc gtcgctggca gcgacgctgg ccccgaacgc agtcgcggcc 2511541 gcagaaccgt cgtggaacgg gcagtacctc gtgacgttgt ctgccaacgc gaaaaccggc 2511601 accagcatgg cggccaaccg gccagagtat ccacacaaag cgaactacac gttcagctcg 2511661 cgctgcgcgt ccgatgtctg cattgccacc gtggtcgacg ctccgccacc aaaaaacgag 2511721 ttcatcccgc ggccaatcga atacacctgg aatgggactc aatgggtacg ggagatcagc 2511781 tggcaatggg actgcctgct acccgacggc acaatcgaat atgccccagc caaatcgatc 2511841 acggcctaca cgcccggtca gtacggaatc ctcaccggcg tctttcatac cgatatcgcc 2511901 agcggcacgt gtaaaggcaa tgtcgacatg ccagtgtcgg ccaaaccgat cgttggctga 2511961 cgttgccagc cctgccgagc atgggcggca catcacgcaa acgcatggac gaccagcaca 2512021 gccccgaatg cggcgataac ggcgttgccg gcaaggactg tcatccgacg gacgcgggcg 2512081 gtcgcccggg acctgagaaa cgctcccgcc gagacaagca gcaactgcca gagcaacgac 2512141 gcgagcgcta ccccgaccac aacagcgatc gcggtcgttg cgcgcaacgc gcgcgccagc 2512201 gtcacggcta ctgcggtgaa gtacacgaac gtggccgggt tgatcaccgt taggccgaag 2512261 atcaacgcaa accgaacaca gcccagctgt ttttgtgggg ccggaaccgg ctccggcgat 2512321 ggacgcaacc cgtgcccgat tcccatcgca gcgatgacca gcagcacgat cgcaccgacg 2512381 atttcgggcc aaaccctcaa cacgttgatc gtcggtgccg caactgtctc caaatcgcgg 2512441 tagcgcaata cgctacgtcg acaagggcga ccgccgcggc ggccggtatt ccacgacgcc 2512501 agccgcgctc aacacctgct tgccgaggaa agcgttcccg gtggcggcat cgaggcgttt 2512561 gtcgatgtgc gacttcgacc ggcggaggac agcgacgact ttctggcatg gtcgagcacg 2512621 gacaccacga tcgacgatgc cgtccacgtc accggaccct acgactacct gctacacatt 2512681 cgggtctgcg acacagcgga cctggaccgc ctgttacgca ggctcaagac ctccgcggaa 2512741 gctgcgcaaa cccaaacgcg cattgcgctc aagtcccggc gttgacaccg cgccagcagg 2512801 cgccaccaaa cccttagcca actccccgac tcagccaagt cacctcgccg gcgtcgccgc 2512861 cgtcacgata cacctcgagc gcctggtccc aggccgttcc cagcaccgaa tccagttcgg 2512921 cggccagtgt gtccgcacct tgggccatca tcgcccgcag ccgcatctcc cccaccatga 2512981 tgtcgccgtt ggcgctcatc gccccgctcc acagacccag ttgcggggtg tgactgaatc 2513041 gctggccgtc gactccaggg ctagggtctt cggtgacctc gaaccgcagc accgaccagg 2513101 aacgcaaggc gttggctagt cgcgccccgg tgcccaccgg cccgacccaa ttggtgaccg 2513161 cacgcaactg cggcggcagg gccggttgcg gcgtccagac caggttcgcc ttggcctgta 2513221 gggtcgacga caacgcccac tcgacatgcg ggcacaccgc cgcgggcgag ccgtggatgt 2513281 acaccacacc ggacgtcacg tcggcgaatt ggttcgacgc tcgcatctgc tgctccttcg 2513341 gttccacgag ggacgtcttc cccaacgacc tggtgaaccc gacaagcagg atgcctgctg 2513401 tgaaatttcg aatttttgtg tcgtgcgttt ctattgtgcc ttgtgatacc cgtgttgcgc 2513461 tagtgtgcgg ttctgcctag gtgtactcgg ctagaaccgc gtcggaaatc gcgggccaca 2513521 agtccaacgc ccagtcgccg aaatcgcggg ccgtgaggac caccagagcc aggtccgcct 2513581 tgggatccac ccagatgaaa ccgcctgatt ggccgaaatg gccgaatgtc cgcgtcgagt 2513641 tgcactcgcc ggtccagtgg ggcgatttcg aattcctgat ctcaaagccc agcccccagt 2513701 cattgggccg ctgcacaccg tacccgggca gtacaccgtc caggccggga aactgcaccg 2513761 tggtcgcgtc ggcatgcatc tgcgccgaga ccgtcgatgg ccgcagcaga tcacccgcga 2513821 acaccgccaa gtccgcgacc gtcgaggtcg ccccgaaccc ggcggcagcg gggcccccgt 2513881 ccagccgggt ggtcaccatg cccaggggtt cgcacaccgc ctcggtcagg tagcgcccga 2513941 actcgatccc cgactcccgc tgcacgctct cggccagcac ggtgaaaccg tagttcgaat 2514001 acatccggcg ggtgccgggg cgggccagcg cctgatcgga atgcatcgcc aaccccgatg 2514061 tgtgcgccag caggtgacgg accgtggagc cgggcgggcc tgccggggtg tcgagattca 2514121 ccaccccctc ctcaacggcg acctgtgcgg ctcgggccac cagcggcttg gtgaccgacg 2514181 ccagcgcgaa cacccgcgcg gtatcgccgt gggtggctag cacccctgcg ggtccgatca 2514241 ccgcggcggc cgcagccggg accggccagc caccaagcac ttcgagagcg gtcatcgact 2514301 ccggcgcgtc acttccgggc gatgtagtag ttgttcaaca cgtccgactc gatctcggcc 2514361 accgtcacgt cggtaaaacc ggcgtcggcg agcatcgagg tggccaactg cctgccccac 2514421 accgtcccca acccggcccc gtcaagcgcc agcgacaccg tcatgcagtg cattagcgag 2514481 gtcgtgtaca ggtaggtgct cagcggaacg ccgacattgt cttccagttg actcgatgcc 2514541 ttgatgtcga ccatcagcag cacaccaccg ggtcgcagcg cacgatagat gttctgcagg 2514601 acgcgcgccg gctgcgcctg gtcgtgaatc gcgtcgaaca cggtgatcac gtcgtaggcc 2514661 cccaccttgt ccagctctgc caggtcatgg cgctcgaagg tcgcgtttgc caggcccaac 2514721 cgagccgcct cctcggtccc cgccgcaacg gcctcgtcgg aaaagtcgat gccggtgaat 2514781 cggctcgcgc cgaacgcctg cgccatcagc ttgaccgcgc gaccactgcc gcaaccgaaa 2514841 tcggccacgt cggctccgga ccgcaagcgg tccggaaggc cgtcgaccag cgggagcacc 2514901 acgtcgatca aggcggcatc gaacaccatg ccgctcatct cggccatcag cttgtggaag 2514961 cgcgggtatt cgctgtaggg cacaccgccg ccttcccgga agcagcgaat gaccttttgt 2515021 tcgacctcgc cgagcagcga aacgaactgt gctatcacgg cgaggttgtc cggcccggcc 2515081 gcacgggtca gcatgccggc gcggtgggca ggcagcgagt aggtcgagct ccccgcgtcg 2515141 tattcgacga tctgcccggt ggtcatgccg cctagccact cccgaacgta gcgctcttcc 2515201 aaccccgcag cctcggcgat ctccatgctg gtggctggcg gaagtccggc catggtgtcc 2515261 agcagcccgg tctggtgtcc aacgctcacc aggatcgcca aaccggcgct gtcgatggcc 2515321 gcaacaaaac ggttgccgaa ttcttcggtg gtctcgagtg ctccgctcat ctgcgccgct 2515381 cctcctcatc gcttcgctct gcatcgtcac cggcgcgact catctgcgcc gctcctgctc 2515441 atcgcttcgc tctgcatcgt caccggcgcg actcatctgc gccgctcctg ctcatcgctt 2515501 cgctctgcat cgtcaccggc gcgactcatc tgcgccgctc ctcctcatcg cttcgctctg 2515561 catcgtcacc ggcgcgcatg gtcagcgacg ctacaccgta ggttggacac catgagtcag 2515621 acggtgcgcg gtgtgatcgc acgacaaaag ggcgaacccg ttgagctggt gaacattgtc 2515681 gtcccggatc ccggacccgg cgaggccgtg gtcgacgtca ccgcctgcgg ggtatgccat 2515741 accgacctga cctaccgcga gggcggcatc aacgacgaat acccttttct gctcggacac 2515801 gaggccgcgg gcatcatcga ggccgtcggg ccgggtgtaa ccgcagtcga gcccggcgac 2515861 ttcgtgatcc tgaactggcg tgccgtgtgc ggccagtgcc gggcctgcaa acgcggacgg 2515921 ccccgctact gcttcgacac ctttaacgcc gaacagaaga tgacgctgac cgacggcacc 2515981 gagctcactg cggcgttggg catcggggcc tttgccgata agacgctggt gcactctggc 2516041 cagtgcacga aggtcgatcc ggctgccgat cccgcggtgg ccggcctgct gggttgcggg 2516101 gtcatggccg gcctgggcgc cgcgatcaac accggcgggg taacccgcga cgacaccgtc 2516161 gcggtgatcg gctgcggcgg cgttggcgat gccgcgatcg ccggtgccgc gctggtcggc 2516221 gccaaacgga tcatcgcggt cgacaccgat gacacgaagc ttgactgggc ccgcaccttc 2516281 ggcgccaccc acaccgtcaa cgcccgcgaa gtcgacgtcg tccaggccat cggcggcctc 2516341 acggatggat tcggcgcgga cgtggtgatc gacgccgtcg gccgaccgga aacctaccag 2516401 caggccttct acgcccgcga tctcgccgga accgttgtgc tggtgggtgt tccgacgccc 2516461 gacatgcgcc tggacatgcc gctggtcgac ttcttctctc acggcggtgc gctgaagtcg 2516521 tcgtggtacg gcgattgcct gcccgaaagc gacttcccca cgctgatcga cctttacctg 2516581 cagggccggc tgccgctgca gcggttcgtt tccgaacgca tcgggctcga agacgtcgag 2516641 gaggcgttcc acaagatgca tggcggcaag gtattgcgtt cggtggtgat gttgtgatgg 2516701 ccgccatcga gcgcgtcatc acccacggca ccttcgaact cgatggcggc agttgggaag 2516761 tcgacaacaa catctggctg gtcggcgacg actccgaggt ggtggttttc gacgccgccc 2516821 accacgcggc tcctatcatc gacgccgtcg gcggccgcaa ggtggttgcg gtgatctgca 2516881 cgcacggcca caacgaccac gtgacggtgg cccccgaact gggcacggcg cttgacgcac 2516941 cggtgctgat gcatcccggc gacgccgtgc tgtggcgaat gactcacccg gacaaaagct 2517001 ttcgcgccgt ttcagacggt gatgcggtgc gggttggcgg gacggagttg cgtgcgctgc 2517061 acaccccggg gcactcccct ggatcggtgt gctggtatgc gccagagctg ggtcccggaa 2517121 caggcaccgt gttcagcgga gacacgctgt tcgctggcgg gccgggtgca accggccgct 2517181 cgtattccga cttccccacg atcctgcggt cgatatccgg acggctcggc gcattaccgg 2517241 gcgacaccgt cgtgcacacc ggccacggcg acagcaccac catcggcgac gagatcgtcc 2517301 actacgagga atgggtggcc cgtgggcatt gatcccgcgg gcgcgcgcag aatgccggtc 2517361 gtagcggcgt gtcggtgtac aagcaccgcg cggtccatga gccgagcgct acttatccgc 2517421 gcaatctgac actcgagcca agctgcggcg cagaaacacc gcaaagccgg cacccatgac 2517481 cacaaatgcc gtcactggca cccagtcacc caaccgaagg tagagcgtga cattcgatgc 2517541 caacggaacg ttcaccacga tggcaccgtt gaattccgcc gagcaccagg ccagccgacg 2517601 gccccgggta tcaaaggccg agctgtcgcc cgacaagctg gcgtgcaccg ctgggatgcc 2517661 ggcttcgacg gcgcgcaccg cgggctgggc ggccaactgc ggctgcgccc aactcccttg 2517721 gaacgtcgag gtggaactct gatacaccag cagcgccgcc ccgagccgcg cggcgtgccg 2517781 ggtcagatcg gagaaggtca tctcgtagct gatcaacggg gcgatatgca aggagttcac 2517841 cgccaacacc accggcccgg cgccgcgctg ccgatccttt gcggcggcct tgctgtagcg 2517901 ggtgatccag ccgaaaagcg ggcgcagcgg cacatattcg ccaaacggaa ccaaccgggt 2517961 cttccggtag ctgcccacag cttcgtgcgc gccgacaagc accgccgact tgtagattcc 2518021 cccgtccggt gccggggcgt cgacgttgac caacaaatcc gcgcccaccc gctgtgacag 2518081 ctcggccagg cgagccagga cgtcaggatg gcgggtgagg tcttgtccga cgctgctttc 2518141 cccccagacc accaagtccg gccgctggtc cgcaacggcc gcggtgaact cttcaccggc 2518201 cgccagtcga gccgccgcat cggctatgtc gccggcctgt accagcgcca cgcgcaccgt 2518261 cggaccgccg accggcaccg agcccagcag gtaggaagcc gggccgagtc ccgcacaccc 2518321 aatcacgcat cccagcgcga ccagccggcc gcccgttgcc cggcacacga gcacgctcgc 2518381 gatggcggta ttggtcgcaa ccagaagaaa acttgtcagc cacaccccac ccagcgacgc 2518441 cgacgctagc gtcacgggct ggctccattg cgatgcaccc agcaacgccc acggaccgcc 2518501 cagcgattgc caggaccgca ccgcttcggc tgccacccac gcgctgggca ccacgaccag 2518561 ggcggcaccg acgcggcatg tggtcaccgg taccgacaac agccggtgcg ccaaccaccc 2518621 ggccggcagc cacagcacac ccaggccggc ggccaacagc accagcatcg gaccagcact 2518681 ggtcaccagc cagtactggg ttgccagcac aaatccgccc atacccgtcc acgcccgcag 2518741 cgcgccctcc cacgacgtcg gcgcggcccg caccactaac agcagtggga ccaagccgaa 2518801 ccaggccagc caccaccaag acggcgcggg aaaggccagc gcgggtaacc cgccgaacac 2518861 caacgctgcc gcacaaccaa tgaccggttg tcgccgggct cccgcgcgca acgccatgcc 2518921 gatcagcatg ccggccacat tcgcctgcgt cgaggaaaag agcagactaa gaccggcagt 2518981 ccccgccaga aagggagtga tttgcatggc caaggatctg gtcgccacgg tgcccgatct 2519041 ttccgggaag ctggcaatca tcaccggcgc caacagcggt ctaggcttcg ggctggcccg 2519101 gcggctgtcg gcggctggcg ccgacgtaat catggcgatc cgcaatcgcg ccaagggcga 2519161 ggcggcggtc gaggaaatcc ggaccgcggt tccggatgcg aagctgacca tcaaggccct 2519221 cgacctgtca tcgttggcgt ccgtcgccgc gttgggggaa cagctcatgg ctgacgggcg 2519281 gccgatcgac ctgctgatca acaacgccgg cgtcatgacc ccaccggaac gcgttaccac 2519341 tgccgacggc ttcgaattgc agttcggcag caaccatctc ggacacttcg cgctaaccgc 2519401 acacctgctg ccgctgttgc gcgcggcaca gcgcgcgagg gtcgtctcgt tgagcagctt 2519461 ggcggcccgc cgcggccgca tccacttcga cgacctacag ttcgagaggt cgtacgcccc 2519521 gatgacggcc tatggccagt cgaagctggc ggtcttgatg ttcgcccgcg agctggaccg 2519581 ccgcagccgc gcggccggct ggggcatcat ctccaatgcc gcgcatcctg gcttgaccaa 2519641 gaccaacctg cagatcgcgg gaccgtccca tggccgcgac aagccggcgc tgatggaacg 2519701 cttgtacaag acgtcctggc gtttcgcacc gttcctctgg caggagatcg aagaggggat 2519761 cttgcccgcg ctgtatgcag ccgccacccc gcaagccgac ggtggcgcgt tctatggccc 2519821 ccgcggccgc tacgaggtcg ccggcggtgg tgtgcgagag gccaaggttc ccgcagccgc 2519881 ccgcaacgac gccgatagca agcgactttg ggaggtctcc gagcagctca ccggtgtcag 2519941 ctacccgaaa tcgcgctgaa ctgcccgatc ccgggaacct gaggtattcc gggggggagc 2520001 tgcggaatct ccggaatcgg tgggatcggc gggatcggtg gagggctggg ggacgtggtc 2520061 gccggcggct gcgtggtcgc cggcggctgc gtggtcgccg gcggcgcgga agcgggggtc 2520121 gtcggtgccg gagtgatgac atcggtggtc accgccggtt gcgtattcgt cgttgtcggc 2520181 ggaggcggca acggctgctg cagcggcggc gcgggcccac cggtggctgg cgcctgtacg 2520241 ggaggtgcgg gctcggtagt ggggccatcg gatgccggcg ctggggacgg gggcggtgcg 2520301 gcggtcgtgg tcacacccgg cctctgcggg gtccccggcg ccgtcggctg gtcgccggtg 2520361 gacaacccga tcgccacggc ggcacccacc agcaacaccg ccaccgtcgt gccggtgatg 2520421 atcacggccg gcaggcgata ccacgggatt ggcggggact tgggctcggg ctccgcatgg 2520481 gcatcgtggt cgaagctcag cgacgggcgg gccgctgtgt agccaggggc cggcccgatg 2520541 tgggagtcct cgtcggcctc cgaccaggcc aaagccggct gcaggaccga cgccggcgca 2520601 tcggccggcg ccgtcgccgt cgccgaggtg accgcggtca gcaccgttgc gctggtgtcg 2520661 ccgggtctgc gtgccgccca caacgcgccg ccgaaagcgg ccgtcaattg cggacgaggc 2520721 gtcctgacca ccggcacgca gaaacgtccg gacagcgtcg tggtgactgc cgggatattt 2520781 gcaccaccac ccaccgaaac gatcgctacc agctcggccg tgcgaattcc gctgcgggcc 2520841 agggtttgtt ccaaggccct gcccacgctg tccagcgagt cacggattgt gtcctcgagc 2520901 tcgttgcggg tcaaccggat atccccgccc aacgcgtcgg tcagcgtggt caccgtgctt 2520961 gacgaaagcc gttccttggc tttgcgacat tcgatccgca gcttagtcag tgagccgatc 2521021 gccgaggtgc cggctggatc gaacgcgccc gtgcccggta gttcggacat gacgtagctc 2521081 aacagcgact gatcgatcag atcgccggag aaagcctgat ggcgcaccgt cgcggccacc 2521141 ggccgatact cgtctgcggc gtcgacgagc gtgatgccgg tcccgctgcc accgaagtcg 2521201 cataccgcga cgatcccacg ggccggtatg cccgggtcgg cccgtatcgc gtacagcgct 2521261 gccgcggcgt cagggagcag tgacagtggc tgggccgtac tcgaagtccc gtgcgaccat 2521321 tccgaggccc gacgcagcgc gctatccaac gctgctaccg cagccggccc ccagtgggcg 2521381 ggataggtca ccgtgacact tccgggaaga gcacgaccgc cggtagcggt gtaggccagc 2521441 gccagcagtg cgtcagccac tagcgcctcg ctgcggtaca ccgagccgtc ggcagccacg 2521501 atgccgaccg aatctcccac ccggtctacg aagtcggtga tcaccaggcc tggctcgtcc 2521561 agcctcgggt tctccgatgg cacaccgacc tcgggcgggc gctgtcgata cagcgtcagc 2521621 acgggtttac gtgtgatgga gtgatcggca gccacagccg ctaggttggt gacaccgatc 2521681 gacaagccta atgccggtct cgcccctgtt gccatatggc ccaatccccg tgtccggcgg 2521741 ctcgtcgcaa ccgcctacct cgaattttcc gtcataccta tagccaatgt gggcgccggt 2521801 gatctggata gcgacattgc cgcaacgccc ggttggtcag caaatggtgc ccatgctggc 2521861 gaccaacggg acctccggcg cggtaaggca gccgggctcc agtaatccca gcggctaggc 2521921 caaggcctcg atgtcgtcgg tggcgacgat gcctaccggc ttggagccgt gttgagaaat 2521981 gagttcggcc gtcggcagca acctccccac tcagcaatcc cagcttcacc ctaaacctgg 2522041 cgttcgtacg ccacctagca tctggtgggt gcgaacggtg atgtcgcgtt gagccgcatc 2522101 ggcgccaccc gtccggcatt gagcgcgtgg cgattcgtca cagtgttcgg ggtggtcggc 2522161 ctgctcgccg acgtcgtgta tgaaggggcc cgttcgatca ccggcccgct gctggcttcg 2522221 ttgggagcga ccggactggt ggtcggagtc gtcaccggcg tcggtgaggc cgccgccttg 2522281 ggcttgcggc tggtgtcggg gccattggcc gatcgaagcc gacggttttg ggcctggacc 2522341 atcgccggct acaccctgac ggtggtaacg gttccgctgc tcggcatcgc gggcgccctg 2522401 tgggtggcgt gcgcgttggt catcgccgag cgagtcggga aagctgtgcg cggccccgcc 2522461 aaagacaccc tgctgtcgca cgcggccagt gtgaccggcc gaggccgcgg tttcgccgtg 2522521 cacgaggcgc tggaccaggt cggtgcgatg atcggccctc tcaccgttgc cgggatgctc 2522581 gcgatcaccg ggaatgccta tgcgcccgcg ctcggcgtgc tgaccctgcc cggcggtgcc 2522641 gcccttgctc tgttgctgtg gctgcagcgt cgggtgcccc gcccggagtc ctacgaggac 2522701 tgtccggttg tcctcggtaa tccttcggcg ccgcgaccct gggcgctgcc ggcgcagttc 2522761 tggctgtact gcgggttcac cgcgatcacc atgctggggt ttggcacgtt cgggttgctg 2522821 tcgtttcaca tggtcagcca cggcgtgctg gccgccgcca tggtcccggt ggtctatgcg 2522881 gccgcaatgg ccgcagatgc gctgacggcc ttggcctcag gcttcagcta tgacagatat 2522941 ggcgcgaaaa cccttgccgt tctgccgatt ctgtcgattc tggtagtgct attcgccttc 2523001 acggacaacg tcacaatggt ggtcattggc acgttggtgt ggggcgcagc ggtcggaata 2523061 caagagtcca cgctgcgcgg cgtggtggcc gacctggtcg ccagcccacg gcgggccagc 2523121 gcctacggcg tgttcgccgc agggctgggc gctgcgaccg ccgggggcgg cgccctcatc 2523181 ggctggctgt acgacatctc catcggcacg ctcgttgtgg tggtgatcgc acttgaactg 2523241 atggccctgg tgatgatgtt cgcgatccga ctaccccgcg tagcaccgag ctaaagaagc 2523301 gatcaggcgg cccaacggaa cagcaggttg gtatgcgaca acatgcttga ccggcacgcc 2523361 aacaagcacg actgccaccg atccaggtaa gtggcggcca aggacggtca accggtctag 2523421 gctcgccagt attacccctt caagggcgaa gggggcagga ggatctcgat gggcctcaac 2523481 acggcgatcg cgactcgggt gaatggcacg ccgccgccgg aggtgccgat cgccgatatt 2523541 gaactgggtt ccctggattt ctgggcactc gatgacgacg ttcgcgatgg cgccttcgcc 2523601 accttgcgcc gcgaggcgcc gatctcgttc tggcccacga tcgagctgcc cgggtttgtc 2523661 gcgggcaatg ggcattgggc gctcaccaag tacgacgatg tcttctacgc cagccgtcat 2523721 ccggacattt tcagttcgta ccccaacatc acgatcaacg accagacacc agagttagcc 2523781 gaatacttcg gctcgatgat cgtgctcgac gatccgcgcc atcagcggct gcgctcgatt 2523841 gtcagccgag ccttcacccc gaaggtggta gcccgcatcg aagcagccgt gcgtgaccgg 2523901 gcccatcggt tggtctcatc gatgatcgcc aataatcccg accggcaggc cgatctggtc 2523961 agcgaactcg caggtccact gccgctgcag attatctgtg acatgatggg gattcccaag 2524021 gcggaccatc agcgcatttt tcactggacc aacgtcattc tcggcttcgg cgatcccgat 2524081 ctggccaccg atttcgacga gttcatgcag gtttcggcgg acatcggcgc ctacgccacc 2524141 gcgctggccg aagaccgccg ggtcaaccac cacgacgatc tgaccagcag cctggtcgaa 2524201 gccgaggtcg acggcgagcg gctgtcgtcg agggagatcg cgtcgttctt catcctgctg 2524261 gtggtggccg gcaacgagac gacgcgcaac gcgatcactc acggcgtgct ggcactgtcc 2524321 cgctatcccg agcaacggga caggtggtgg tctgacttcg acggcctggc gcccaccgcg 2524381 gtcgaggaga tcgtgcggtg ggcctccccg gtggtctaca tgcgccgcac cctgacccaa 2524441 gacattgagt tgcgcggcac caagatggcc gccggtgaca aggtctccct gtggtattgc 2524501 tcggccaacc gggacgagtc aaagttcgcc gatccctgga cattcgacct agcacgcaac 2524561 cccaatccgc atctcggttt cggtggcggt ggcgcccatt tctgcctggg cgccaaccta 2524621 gcgcgtcggg agatcagggt cgcgttcgac gaactacgca ggcagatgcc cgacgtcgtc 2524681 gcgaccgagg agcccgcacg gctgttgtcg cagttcattc acggaatcaa gacgctgcca 2524741 gttacgtggt cctgaaaggc cgaacgtggc tcggcgggta tatggtgcgc cattcccggt 2524801 ggctgtggga tttgcactac acaggaagcg ttgtcgccca cccactggcg gaccggtagg 2524861 caccgatcgg tgccggcctg ttttgggtag cggatcaagc gcacaaacga ctcgcggtgg 2524921 ccgaacagga tgatgttggc gggacgcccc gtctggcatg accgctgccg acgcgttcga 2524981 gtgcggtcga gagccaaagg cggcttgatc agccgccaac cgcaggccga agacgtgccg 2525041 gctcaggtgt gtgacgatcg tagccgtagc ggtcgatgat ctcgccccag tgctcatcga 2525101 caatcgcacg ctgctcgacg gtcagttgat agctgttggt tttgtagtcc gcatggtcag 2525161 ctaggtattg ccgcagacgc ggcaggtaac actcgaagtc gcccagtccc aggtgctggt 2525221 atagccggcg cagctgtccc tcgggatcac cgatcaaatc ctcataacgc aattcgtaaa 2525281 agcgtgtggg gtcaacgagt tctcggcctt cgtccaactt tcggtatagg tcgacgtagg 2525341 tcgacacgac cttgtcgtcc aacccgtcga acgtcggttg ttgcaagcca tgtatgcggt 2525401 acagcgcctt atgaagatgg atggttgatg gatagaccac atagggatct cggacgatgt 2525461 ggatgaactt cgcttgcggg aatacctcca gcagcacctt gattcgaaaa ctatgcgttg 2525521 gattcttgag gatcaccgtc ttgcgacggc ggaagtacac ctgctgaacg aaccggaaca 2525581 gggtccgttt ccagatttct agttctcgcg gtgccacctg ctctagatcc aggtactcct 2525641 catactgggg cggccggttc gggaatgcga tggtcagata cggcgacggc aggccctgca 2525701 tacaccacac gaactcgtct tcctgcgggt gatgcaagct caaatccatg ttgtccattg 2525761 cccgatgctt cgataccagg aattccacat atggcgcaaa ccactcggtc agtagaaaat 2525821 ggtgtggcgc aaggcattcg tagccggtgg gaccggtgtg gcgatcatcg acgaccaaca 2525881 gttcatgcag caaggtggtg ccggtacgcc aatgcccaac aatgaagatt ggcggatcgg 2525941 cgatcaccgt ttcggccact cgcctaccga aaacgatctt ctgccacaac cccagacagg 2526001 aattgaccat gctgagaaac gtatagagga ccgcgaagtg ccagcggctg tgatgcacgg 2526061 cgaagcggtt acggatcaaa agccgcatcc aggccgagaa gttgcagccg acccacagcg 2526121 gtgcggccca ctcgcgccac cgggaaagtc gagacgacga acggagagcc ttcatggtgc 2526181 gacgcggggg gtaacggcga cccgtaaccg ggtcaagccg cgaaggttgg cgtttgtcgt 2526241 ccacgtcggc ggctcgacca cctctattcg gtcgatattg gcgacgatct cgcgcaagat 2526301 cgcctgaccc tccatgcgcg ccagctgggt ccccggacac aggtggatgc cggagccgaa 2526361 cgcgagatgc ccgaccgggt tgcggtcggc gcgaaagaca tccgggtctt cgtactggcg 2526421 cgggtcacgg ttggctgcac cccatgccag cagcaccagt gagcctgccg ggatgaccgc 2526481 ttgaccgacc gaatagtcga cgcgcgttgt gcggcagatg ttttggattg gcgatataaa 2526541 gcggaggtgc tcctcgatcg ccgacgggat caggtctggt tgctgcgcaa ggagtgtcag 2526601 ctgatctgga tagtcggcca gcgtcagaaa caatgtgcta atcatatgag cagtgctctc 2526661 atagcccgca accagcagca acaccgcgaa gaagaacaat tcgtcatcgc tgagtcgacc 2526721 ttgctcggca tgggtggcaa gcttcccgag aacagtgcat tccctaagca gcccgttgtc 2526781 acgccgatga gtgaagagtg cacgcaatcg ccggaatccg gcaaagccct gcacaagcga 2526841 aatcaacccg gaggctgaca aggcaacgtc ggtgatccgt accgcctggt tggacaaacg 2526901 gcagaaggcg gcctcgtccg gtccatctac gccgagcaca ctggtgatag cgcgcatcgg 2526961 catcggtgcg gccacggtgg agacgacgtc cgcgggcgtc tgggtcagta acccgccgac 2527021 cagttctcgg gcaagctggt cgaccatcgg gcgccacgtc tccaacgcgc cacgcgccat 2527081 acctggtgcc agttgcttgc gcatccgggt gtgcgccggc ggatcggacg tcggcagaaa 2527141 cggcagccac ccccgtgaga aggtgacccc acgggcgctg gacaacgtgt cgtggttacg 2527201 cgcagcctcg cggacgtcgg cgtatcggct caaaatgtag acgtcgcgct tggggttgta 2527261 ctgcacccgc tcgccggcca acagctctcg ataatgcggg taaggatcag cggcaatcgc 2527321 gggatcgaac gggtcaaagt cggtgagctg cataaatttc cggcaatgcc ggccggtcaa 2527381 cctggaccga gccttcccgg cgaccctcag cgcaagtgct ttcgcgaccg cgggcccgta 2527441 ggttcgcaca gtttgcgcgt cgcgccacat gctggtggct accgggatgc caccagatga 2527501 cgcgcgccgg cgcgtgggaa cgcccagagc cgtggtcgcg tcctgcgcgg tcagaccaac 2527561 gtcgggcgtg cccgctaacg ggcacccggc cagccgcact cggtccggcg cgggctcggg 2527621 aggggactgt gtcgcggtca tgaccctccg aactcagaga ggcgtagaac agtcacaggg 2527681 taacggcggg catcgcaata attgcgcagt ttcgcaaagc gtttcgcaac gcaataagat 2527741 ggttacccgg agttcggaca ggcgaatctg cccagcgcaa ggctggtgat agcgccgacc 2527801 aacggcgccg tgatcggtaa ccgtttccga ccggccgata ccggcccggc caccatagcg 2527861 gaggtcaacc ccacctgttg gcggaacgcc caaaactggg ccgactgtgt aggcatgcgt 2527921 cgcacttgat tggtcgccga cccggcaatt cgctagccgc gctaagggtc gcgcatcgtt 2527981 ggccacaaca ggcgcgactt gcgcgaatgt gctttctcgc cggcatcgcg atgcctaact 2528041 ttatgttttc gaggagactg cgatgcggct tccaggccgt catgtgttat acgccctgtc 2528101 ggcggtcacc atgctggcgg cctgctccag caacggtgct cgtggcggca ttgcgtcgac 2528161 gaacatgaat ccgacaaacc cacccgcaac tgcggagacc gctaccgtct caccgacacc 2528221 ggctccgcag agcgcgcgaa ccgagacctg gattaacctt caagtcggcg actgcctggc 2528281 cgacctgccg ccggcggatc tgagccggat aaccgtcacg attgtcgatt gcgcgacagc 2528341 gcattcggcc gaggtatacc tgcgtgctcc ggtggccgtc gatgccgccg tcgtttccat 2528401 ggccaatcgt gattgtgctg ccggatttgc gccctacaca ggccaatccg tcgacaccag 2528461 cccatactcg gtggcgtatc tcatcgactc gcatcaggat agaaccgggg ccgatctcac 2528521 cccgagcacc gtcatctgtt tgctgcagcc cgccaacggt cagttgctca ccgggtcggc 2528581 ccgtcgctga ccggacgacc cgttgttcgg gtgcgtggca cacgacacca accggtatcg 2528641 tctgttgccg tgacttctcc gattgctccg aataccaaaa gcgacggttc tcgctgatga 2528701 ctaccccacc cgacaaggcg cggcgccggt ttcttcgcga cgcctacaag aacgctgagc 2528761 gcgtcgcacg aaccgctttg ctcacaatcg accaggacca gcttgagcag ctgctcgact 2528821 acgtcgacga gagactcggc gaacagcctt gtgaccacac cgcccggcat gcgcaacgat 2528881 gggcccaatc acaccgcatc gaatgggaga cgctggccga gggcctacaa gagtttggtg 2528941 gctactgcga ttgtgagatc gtaatgaatg tcgaacctga ggcgatcttc ggctagtcct 2529001 ctgccggcga tgttctcata acgacatggc aagccacgcg cttgactaaa ctcagccgac 2529061 gtcaaaccgc ctgtccccga tatgccctgc gaggttgcct cgtggctgat gactcaaacg 2529121 acaccgcgac cgatgtcgaa cccgactacc ggttcaccct tgccaacgag cggaccttcc 2529181 tggcctggca gcgcaccgct ctaggcctgc tggccgcggc ggtcgccctg gtgcagctcg 2529241 tcccggaact gacgatcccc ggcgcacgcc aggtgctcgg tgtggtgctc gcgattttgg 2529301 caatcctcac cagcggaatg ggtctgctgc gctggcagca ggcggatcgc gccatgcgcc 2529361 ggcacctgcc attgccccgt caccccacac cgggctacct cgcggtgggg ctgtgcgtgg 2529421 tcggggtcgt cgcgctcgca ttggtggtag ccaaggcgat caccgggtga accgtcactc 2529481 gacggcagcg agcgatcgcg ggctgcaggc cgaacggacg acgctggcct ggacccggac 2529541 ggcctttgcg ttgctggtca acggcgtgtt gctgacgctc aaggacacgc aaggcgccga 2529601 cgggccggct gggctgatcc cggccggcct agctggtgct gcggcctcgt gctgctatgt 2529661 gatcgctcta caacgccaac gagcactttc gcaccgcccg ctaccggcac gaatcactcc 2529721 ccgcggccag gtccacatcc tcgcgacagc ggtgctggtg cttatggtcg tcaccgcctt 2529781 tgctcaactg ctctagcgcg gcgaacagac gcaaaagccc ccgcacgcac ggagtgtcgg 2529841 gggcttttgc gtctactcgc caaatgcgat cgtggccgat ggcggcgcgg accttcctgt 2529901 aaattgccgg aattcacgat tttgtgcggc tagaccaacg ccgggagcca gcgtgcctgc 2529961 gaggatagga gcgcctcggc cgatccgccg gcgcagccgt tcggtcacaa cggatctgac 2530021 ctgctcagcc tgcaagtcaa ccacaagacc ggtccaggct gatacgcaaa acatgtgagt 2530081 gtacccgccg ccacagcggc agcagctgga tccccctttt ggtggacacg agatccaccc 2530141 aataggctgg gccgatcggg cgatagacat tgtcagttcg tgccggcacc ctgatcactg 2530201 acctcaacac cgagcgtcga ccccgtccct atggtccaag gaaaacaatg tcatacgtgg 2530261 ctgccgaacc aggcgtgctg atctcgccga cggacgactt gcagagcccc cggtcagccc 2530321 cggcagcgca tgacgaaaat gcggacggca taacaggcgg gaccagagac gactctgctc 2530381 ccaactcacg gtttcagcta ggcaggcgca ttccggaagc caccgcccag gaagggtttc 2530441 tggttcggcc attcacccaa caatgtcaga tcatccacac cgaaggagat catgctgtta 2530501 tcggggtatc cccggggaac agttacttct cccgccagcg cctacgggat ctcgggcttt 2530561 ggggtctcac gaattttgat cgtgtggact tcgtctacac cgatgtccat gtcgccgaga 2530621 gttacgaagc gctaggcgat tccgcaatcg aagcccggcg caaggcggtc aaaaacatcc 2530681 gcggcgtccg cgccaagatc accaccacgg tgaacgaact cgatccggcc ggggcccggc 2530741 tgtgcgttcg tccgatgtcg gagttccagt ccaacgaggc ataccgggag ctgcatgcgg 2530801 acctgctcac gcgcctgaaa gacgacgagg acttgcgcgc cgtctgccag gacctagtgc 2530861 ggcgcttcct gtccacgaaa gtgggtccgc ggcagggggc gacggctact caagagcagg 2530921 tgtgcatgga ctacatttgc gccgaggccc cgctattcct cgacacacct gcgattctcg 2530981 gagtgccgtc gtcgttgaat tgctaccacc aatcactgcc cctcgccgaa atgctctacg 2531041 cccgaggatc gggactacgg gcatcgcgca atcaaggcca cgccattgtt acccctgatg 2531101 ggagccccgc cgaatgaccg cgaccgttct gctcgaggtc ccgttctctg cacgtgggga 2531161 tcggattcct gacgccgtcg cagaattacg aacccgcgag cctatccgca aggtacggac 2531221 cattaccggc gccgaagcct ggctcgtctc ctcgtatgca ctgtgcacac aggtgctcga 2531281 ggatcggcgt ttttccatga aggaaaccgc cgctgccggc gccccccgcc tgaacgcgct 2531341 gactgttcca cccgaagtgg tcaacaacat gggaaacatc gccgacgcgg gactgcgcaa 2531401 ggcggtgatg aaagcgatca cacccaaggc acccgggttg gagcaattcc tacgagacac 2531461 cgcgaactcg ctgctggaca acctgattac cgagggcgca ccagccgatc tgcgcaatga 2531521 cttcgccgac ccgctggcca ctgccctgca ctgcaaggtt ctgggcatcc cgcaagaaga 2531581 cggcccgaag ctgttccgta gcttgagtat cgctttcatg agttcggccg acccgatccc 2531641 cgccgcgaag atcaactggg atcgcgacat cgaatacatg gccggaattc tggaaaaccc 2531701 aaacatcacg accggcctca tgggtgagct cagccgcctc cggaaagatc ccgcctactc 2531761 gcacgtctcc gacgaactat tcgcgaccat cggcgtcact ttcttcggtg ccggcgtcat 2531821 ctcaaccggc agcttcctca ccaccgcgct gatatcgctg atacaacgcc cgcaacttcg 2531881 gaacttgttg cacgagaagc cggaactgat cccggccggt gtagaggaac tgctgcggat 2531941 caatctctcc ttcgccgacg ggttaccgcg cctggccacc gccgacatcc aggtcggcga 2532001 cgtgctggtc cgcaaggggg agctggtgct ggtgctgctc gagggcgcca acttcgatcc 2532061 cgagcacttc cctaacccgg gcagcatcga actcgaccgg cccaacccca cctcgcacct 2532121 cgcgttcggc cgcggccaac acttctgtcc tggatcagct ctcggtcgcc gccacgcaca 2532181 gatcggcatc gaagcgctgt tgaaaaagat gcccggcgtc gacctggctg tgcccatcga 2532241 ccaattggtc tggcgcaccc gattccaaag acgcatcccc gaacgccttc cggtgctctg 2532301 gtaggcttcc ggaaactcac ccgagccatc accgcaagat ttggcaagcg ttgggacaga 2532361 acaatttcga ccttgcaccg gccgaaggcg ctgccttcta ccgaataaaa gtacgggcct 2532421 cccccaaact ccgaaatcgt cagtaccgca cgcaattcaa atgaaccgca ccctgacagc 2532481 gagcgacgtt aatgacgcca ttgttgggcc gccagcggcg agtccacaag taccgcatcg 2532541 agtccgattt tgtgagccag gcggtagtcg tcgacagttt tcaccgcgaa acccatgacc 2532601 ttcatgccgg actgcgatct gaaacagtcg accgaggcct cgtcccacaa ctcggcattc 2532661 accgcggaga taccggaccc caacgtgaat tcttcggtga cggtgacatc gcgatgcaac 2532721 tcgaatccgg cccacttccc aggatccggc tgcggatcac agtgatggtt caatgccatg 2532781 ttgaaaaggc gctggcgggt cacgtcacga ctttcggcga cctgcagtcc ctcctgccgc 2532841 gaggctgcag ccgtgatgtc agcgttggtg gaatatacga tcgaccgccc ggcagcacca 2532901 gtcctggtca acacctgcgc gaccgctgag accagcggct gtggcggagt ctgcttgggg 2532961 tctagaaaca gagtcatatc gggcggagtc gcgccaatgg cttgctccag tgtcggtatc 2533021 ggggtcgccc gttgccggta gggatggccc tcgacgcccg gcgtggtgaa attccatccc 2533081 gcgttgagct gctggagttg ctgaaccgtc ttcgaattca ccgggccggc gccgtcggtc 2533141 aacgttgcca gatcggacgg acgatacagc accggcacgc catcgctgct gacctggacg 2533201 gtcagccaca tgccatccac accagctgcg actgcgttgg taatcgccag aacggtgttc 2533261 tcgggaaaat cgcgcgtacc cgcgcgatgc gcgacaatca tcgggtcgtc agtctggccc 2533321 agcggcaaag catccgccac accgcaagtc cctcccaagg cgatcaccag cgccaccgcg 2533381 cccaacatag ccgtcttcac catcggtccc cttcaggctt tccccaccgt agaaacgtgc 2533441 gcaatgcgcg gcgcacagta tcgaaccgta ccgctgagag ccaaccacga tgatttgccc 2533501 gcaccggcag cgataaagta agtcgcggtc gggcacgcag cgcagcgttg gaaagtgagg 2533561 cctccgatga gtgaaatgac agctcggttt tccgaaatcg tcgggaacgc caatttgctg 2533621 accggcgacg caatccccga ggactacgca cacgacgaag agttgacggg gccgccgcag 2533681 aagccagcct atgccgccaa gccggccacc cccgaagagg ttgcccaact gctgaaggcc 2533741 gcctctgaaa acggtgtgcc ggtgacggcc cgcgggtccg ggtgcggctt gtcgggggcc 2533801 gcacgaccag tcgagggtgg gctgctgatc tcgttcgacc ggatgaacaa ggtcctcgag 2533861 gtcgacaccg ccaaccaagt cgccgtcgtg cagcccgggg tggcgttgac cgacctggac 2533921 gccgctaccg ccgataccgg gctgcggtac acggtttacc cgggcgagct gtcctccagc 2533981 gtcggcggga atgtcggaac caacgccggc gggatgcgcg cggtcaagta cggagtggcc 2534041 cgccataacg tgctcgggtt gcaagcggta ttgcccaccg gcgagatcat ccgaaccggc 2534101 ggcaggatgg ccaaggtgtc caccggctac gacctcaccc agctgatcat cggctcggag 2534161 ggcaccctgg ccttggtcac cgaggtgatc gtcaagctgc atccgcggct cgaccacaac 2534221 gccagcgtgc tcgccccgtt cgccgacttc gaccaagtca tggcggcggt gcccaagatc 2534281 ctcgccagcg gcctggcacc tgacatcctg gagtacattg acaacacttc gatggccgca 2534341 ctcatctcca ctcagaacct ggagctaggt attccggacc agatccgcga cagctgcgaa 2534401 gcttatctcc ttgtggcgct tgagaaccgc atcgccgacc gactgttcga ggacattcag 2534461 acggtgggtg aaatgctcat ggaattggga gcggtggacg cctacgtgct cgaaggaggc 2534521 tcggcgcgca agctgatcga ggcccgcgag aaggcattct gggcggcaaa agcactcggc 2534581 gccgacgaca tcatcgacac cgtcgtccca cgcgcgtcga tgccaaaatt cctgagcacc 2534641 gcgcgcggtc tggcggcggc agcggacggt gccgcggtcg gttgcgggca cgccggcgac 2534701 ggcaacgtac acatggccat cgcgtgcaag gatccggaga aaaagaagaa gctcatgacc 2534761 gacatctttg ctctcgcaat ggaattgggt ggcgcgatct ctggcgaaca cggcgtcggc 2534821 cgggccaaaa ccggctattt cctcgagctg gaagacccgg tcaagatcag cctcatgcgc 2534881 cgtatcaagc agagcttcga tccggcgggc atcctcaacc caggcgttgt cttcggagac 2534941 acctgagcac ggacaagagc cggccggacc aaggccggtc atcggccggc caacaggcct 2535001 gcaagtctcg agcgcaacat cttcgtggac agctcggtcc gccggtcgtc aaagccgatt 2535061 tccccgcatc tgtccggtca gtccgatgca gcgtcggtca ccgttattca tccggcgttt 2535121 acccgttgct agccgccatg acgtagcctg ctgacgctcg atcgccaaca caagccgaca 2535181 tgagcgacaa tgccaaacac cacagggatg ggcatttggt ggctagcgga cttcaggatc 2535241 gcgcagcgcg cacaccgcaa cacgagggct tcctcgggcc ggaccgacca tggcacctgt 2535301 cgttcagtct gctgctggcg ggttctttcg tgctgttctc gtggtgggca ttcgactacg 2535361 cagggtccgg cgcgaacaaa gtcatcctgg tgctcgccac cgtcgtcggc atgttcatgg 2535421 ccttcaacgt cggcggcaat gatgtcgcca actcgtttgg caccagcgtc ggcgcgggca 2535481 cgttgaccat gaaacaggcg cttctggtcg cggcgatctt cgaggtcagc ggcgcggtga 2535541 tcgccggcgg cgacgtcacc gagaccatcc gcagcggcat cgttgatctg tccggggtgt 2535601 ccgtcgaccc acgcgacttc atgaacatca tgctgtcggc gctatcggca gccgcgctct 2535661 ggctgctgtt tgctaaccgt atggggtacc cggtgtcgac cacacactcg atcatcggcg 2535721 gcatcgtcgg cgcggcgatc gcgctgggga tggtgagcgg ccagggcggt gccgcactca 2535781 ggatggtcca gtgggatcaa atcggccaga tcgtggtgtc ctgggtgctg tcgccggtgt 2535841 tgggcggctt ggtgtcgtac ctgctctacg gcgtcatcaa acggcacatc ctgctgtaca 2535901 acgaacaggc cgaacgacgg ctaacagaaa ttaagaaaga gcgcatcgca caccgcgagc 2535961 gccacaaggc ggcgttcgac cggctcaccg agatccagca gatcgcctat accggcgccc 2536021 tggcgcgcga cgccgtcgcg gcaaaccgca aggactttga tcccgacgaa ctggaatccg 2536081 attactaccg cgagctacac gaaatcgacg ccaagacatc gtcggtcgac gcgttccggg 2536141 ccctgcagaa ctgggttccg ctggtcgccg ccgccggatc catgatcatt gtcgcgatgc 2536201 tgctgttcaa ggggttcaag cacatgcact tgggccttac cacgatgaat aactacttca 2536261 tcatcgcgat ggtcggtgca gcggtgtgga tggccacctt tattttcgcc aagacacttc 2536321 ggggcgaatc actttcacgg tcaacgtttt tgatgttcag ctggatgcag gtctttacgg 2536381 cctcgggctt cgccttcagc cacggcagca atgacattgc caacgccatc gggccgttcg 2536441 cggcaatcct ggatgtgctg cgcacgggcg ccattgaagg caacgcagcg gtgcctgccg 2536501 cggccatggt aacgttcggc gtcgcgttgt gcgcggggtt gtggttcatt ggacgacggg 2536561 tgatcgccac cgttggacac aacctcacca cgatgcaccc ggcatcgggg tttgctgccg 2536621 aattgtcggc cgccggggtg gtcatgggag ccacggtcct gggtcttccg gtttccagca 2536681 cgcacattct tatcggcgcc gtcctcggcg tcggcatcgt aaaccggtcc accaactggg 2536741 gactgatgaa accgatcgtg ctagcgtggg tcatcacgct gccttcggcg gcgatcctcg 2536801 cctcggtcgg tcttgtcgcg ctacgcgcga ttttctgacg acgccgggtc catcaacccc 2536861 agcgcaacct ccgcgagcag tcgctaaagc ccccgacacg ccgtgcgtgc gggggcttat 2536921 gcgactgctc gccggacgga ggtcctacgt gctgcgggaa gtgatgtggc tgagcaggtc 2536981 tcgtatcgca cccgccggcg gggtgcgccc accgacccag atggctcgaa gctggcgccg 2537041 caggttcaac gcggggatgt cgaccgcgag taatcgaccg aacgccaggt catcggctat 2537101 cgctagccgg ctcatcgcag ccggtccagc gccggccaag accgcggccc gcacggccgc 2537161 agccgatgat aattccagca ccggtggcgc ttgctgcatg tcctccccga gcgtgtcacg 2537221 taacgccgcg gtgagtgaat cgcggatgcc agagttcggt tcgcgagtca ccaaaggcgt 2537281 ctgagcgagc tcccgggcgc tcactactcg tgaccgtcgg gcccacttgt gacccggcgg 2537341 cacgacgacg accagttcgt cgcgtgcaac cacaacgctg cctaatcccg tgggaggaca 2537401 ggggttttcg atgaatccaa gatctgcgat gccgtcacga acggctgcga tcgcatgctc 2537461 gctattggtg gcggtcagga ttacctcagg gacagtacca ccgcggcgca tgtcggcggc 2537521 ccgcaaggac agcatccaat gcggcatcag ctgttcggct atcgtctggc tggccaccac 2537581 tctgatgcgc tggcggcctt cggtgcgcag cgagccgagg ccggcatcga tctcgtcggc 2537641 gacttcgagc aagcgggccg cccattcggc gacgacgatg ccggcaggcg tgagttggga 2537701 gccacgtgtc gtccggatgg ccaatcgcac cccgatctgg gcctccatcg atgcgagccg 2537761 ccttgacaca gcttgttgag tcaacccgag ttcgcgtgcg gcgccgccaa gactgccggc 2537821 ctcagcgatg gccagaaaga tttcgaagca ggtgagtccg ggcatacgag agctgagcgg 2537881 catgcctgat caaatcacaa ccaatgtttg ttcccaacaa cattcagacc cctagtgacg 2537941 acggcccatg ctcgaaaaat gcccccacgc gagcgtcgac tgcggtgcct cgaaaatcgg 2538001 catcaccgac aacgaccccg cgaccgccac caaccgcagg ctggcgagca caattcgcaa 2538061 gccgccgatc gagcacgcgg ccgggccctt agggtccaca tcacgcgctg gccaccgttc 2538121 gtacggcggg gtggcctcgt aaggtaacca catgggcgct cctcgactca tccacgtcat 2538181 ccggcaaatc ggggccttgg tggtagcggc agtgaccgcc gccgccacga tcaacgcata 2538241 taggccgctg gcgcgcaacg gattcgcatc gctgtggtcg tggtttattg gcctggtggt 2538301 taccgagttt ccgttaccga cgctggcgag ccagctcggc gggctggtgt tgacagccca 2538361 acgcctgacc cggccagtgc gggcggtctc ctggctggta gcggccttct cggcgctggg 2538421 gctgctgaac ctcagtcgcg caggccgtca ggccgatgcc cagctcaccg ccgcattaga 2538481 cagcggcctg gggcccgatc gccgcaccgc ctcggccggt ctgtggcgcc gcccagccgg 2538541 cggtggtacc gccaagaccc ccgggccgct gcgcatgctg cggatctacc gcgattacgc 2538601 acacgatggc gacatcagct acggcgaata cggcagggcc aaccacctcg atatctggcg 2538661 acgtcccgat ctagatctga ccggaacagc gcccgtgctg tttcagatcc ccggcggtgc 2538721 atggaccacc ggaaacaaac gcggacaggc gcatccactg atgagccacc tcgccgagct 2538781 aggctggatc tgcgtggcga tcaactaccg acacagcccg cgcaacacct ggccggatca 2538841 catcatcgac gtcaagcgcg ccctggcgtg ggtcaaggcg cacatcagcg aatacggcgg 2538901 cgatccggac ttcatcgcca tcaccggtgg ttcggccggc ggccacctgt cgtcactggc 2538961 cgcgctaacg ccgaatgacc cacgattcca accgggattc gaagaggcgg acacccgggt 2539021 gcaggcagcc gtgccgttct acggcgtcta tgacttcact cgtctgcagg acgcgatgca 2539081 cccgatgatg ctgccgctgc tggagcgaat ggtggtcaaa caaccgcgca cggcgaacat 2539141 gcagtcctac ctcgacgcct caccggtcac ccacatttcc gccgacgctc ccccattctt 2539201 tgtgctacac ggccgcaacg actcgctggt tcccgtacag caggcgcgtg gcttcgtcga 2539261 tcagctgcgg caagtcagca agcagccggt ggtatacgcc gaattgccct ttacccagca 2539321 cgctttcgac ctgctcggct cggcacgtgc ggcacacacg gcgatcgccg tggagcaatt 2539381 cctggccgag gtctacgcaa cgcaacacgc gggcagtgag ccgggccccg cggttgcgat 2539441 cccatagctt ttggggttga ggtcgctagg gttggccttg tgaagctgct cagcccgctg 2539501 gatcagatgt tcgcgcgcat ggaggcgccg cgcacgccaa tgcacatcgg cgcgtttgcg 2539561 gtcttcgacc tgcctaaggg agcaccgcgc aggttcatcc gcgacctgta cgaggcgatc 2539621 tcacaactgg cgttcctgcc cttcccgttc gacagcgtga tcgccggcgg cgcgtcgatg 2539681 gcgtactgga ggcaggtgca gcccgatccg agctaccacg tccgcttgtc cgccctacct 2539741 tatccgggga ccggccgcga tctcggcgcg ttggtcgagc ggctgcattc gaccccactt 2539801 gacatggcca agccgctatg ggagttgcac ctcatcgagg ggctaaccgg ccgtcagttc 2539861 gccatgtact tcaaggccca ccactgcgcg gtcgacggat tgggtggggt gaacctgatc 2539921 aagagctggc tcaccaccga tcccgaggca cccccaggct cgggcaagcc cgagccgttc 2539981 ggcgatgact acgacttggc cagcgtgttg gccgccgcca cgacgaagcg ggcggtcgag 2540041 ggcgtttccg cggtcagcga actggccgga aggctatcca gcatggtgct gggcgccaac 2540101 agctcggtgc gggcggccct caccaccccg cgtaccccgt ttaacacccg cgtcaaccgg 2540161 catcgacggc tagcggtgca agtgctgaaa ctgccgcgcc tcaaggcagt ggcccacgcc 2540221 accgactgca ccgtcaacga cgtgatcctg gcgtctgtcg gcggggcttg ccgacgctac 2540281 ctgcaggagc tgggcgacct gccgacgaac accctgaccg cctcggtgcc ggtcggcttc 2540341 gagcgcgacg cagacacggt caacgccgcc tcgggtttcg tcgcgccgct gggcacctcg 2540401 atcgaagacc cggttgcgcg gctgaccaca atctcggcgt cgaccacccg cggcaaggcc 2540461 gaactgctgg cgatgtcacc aaatgccttg cagcactact ccgtattcgg cttgctgccg 2540521 atcgcggtgg ggcagaagac cggcgcactc ggggtgattc caccgctgtt caacttcacc 2540581 gtctccaatg tggtgctctc gaaggacccg ttgtatcttt cgggcgccaa gctggatgtg 2540641 attgttccga tgtcgttcct gtgtgacggc tatggcctca acgtgacgct ggtcggctac 2540701 acggacaagg tcgtcctcgg ctttctgggc tgccgtgaca ccttgccgca tctgcagcgg 2540761 ctagcgcagt acaccggcgc ggcattcgag gaactcgaga ccgccgcctt gccatagcga 2540821 ccaaacgacg acaacgctcc gcccatcgcc ggcagtaccc gccaatcacc acggtgtagc 2540881 cgctcaggag cggcccgcca gccggtcgat atcaacgatc tccccgcgat tgatgctcac 2540941 ccaatcgcgc ccatcgaggt aggggcgtag ctgctgggca atgagttcaa catcggctgg 2541001 tgacttcggt cgctgaagtt cgtagacgtg cggcagcccc gccatgcccg tgacgacgct 2541061 ccacaggttc agggcggccg gtccagcagg cgggtccacc agcaccggcc caaaaaggca 2541121 ttgtccgtcc aaaaagagcg tcggcacgcc gtatccgccc gcggcgacaa cccgttagtg 2541181 gtcggcgcgg acgtcgtcgt gggtcgtcgg atcatccagc gccgcgtcca aaatcgccgc 2541241 attgacgccg acgtcgcaca gtaggcgtcg cgccaccgcg ggatcatgcg gtttgccgcc 2541301 cagggtgtgc agctcatgac cgatcgctgc ataccaccga tcaagcaacg acatgttcgt 2541361 tcgacgcagc agcgcaccga tccgcatcaa cgaccagcca taggaccagt ctcgctccca 2541421 cgggtgcttc ttgcccgcta ccaggttgat ctcctcgagg ctgaaaaacc gccagttgat 2541481 cgtgattccc aattgcgcgc gcacatcacg gatccacacc gaggtctgat aggcgaacgg 2541541 gcacaaaggg tcaaagtgga aatccacggt ggtcatcaga cctgagtcct ccagctgatc 2541601 gagtcgacac ctcgatgaca ttgtgccgtg cgccacgttg tcagcggact gagtcgaccc 2541661 aacatctcgg ggtgttcgcc agggtgccga aacaggtcaa cgcggcggta tgaatggtcg 2541721 acgcaccata ggcgaggatg ggctggtgtt cgggctcgtc gttatcgttg cgctggtcgc 2541781 cgccgtggtc gtggggaccg tcctgggcca ccgctatcgc gtgggccctc cagtgttgct 2541841 catcctgtcc ggttccctgc tgggtctgat tccccgtttc ggtgacgttc agatcgatgg 2541901 cgaggtggtg ctgctgctgt tcctgccggc gatcctttat tgggagagca tgaacaccag 2541961 ctttcgcgag atccgctgga acctgcgcgt catcgtcatg ttcagtatcg ggctggtgat 2542021 tgccaccgcg gtcgcggtgt cgtggacggc acgagcgctg ggcatggagt cccacgccgc 2542081 ggctgtcctc ggtgccgtgc tctcccccac cgatgccgcg gcggtggccg gcctggcgaa 2542141 acggttgccg cgccgggcgc tgacagtgct acgcggcgag agcctcatca acgacgggac 2542201 cgcgctcgtg ctgttcgccg tcaccgtggc ggtcgcggaa ggtgccgctg ggatcggccc 2542261 ggccgcgctg gtcggccggt tcgtcgtctc ctatctcggc ggaatcatgg ccgggctgct 2542321 ggtcggcggc ctggtgacat tgctacgccg cagaatcgac gcaccattgg aggagggagc 2542381 cctgagcttg ctgacgccgt tcgcagcgtt cttgctcgct caatctctga agtgcagcgg 2542441 tgtggttgcg gtgctggttt cggccctggt cctcacctac gttggtccga cggtgatacg 2542501 cgctcgttcc cgcctgcagg cgcatgcgtt ttgggacatc gccacgttcc tgatcaacgg 2542561 ctcgttgtgg gtgtttgtcg gcgtccagat cccgggcgcg atagaccaca tcgccggcga 2542621 ggacggggga ctaccacggg ccacagtcct ggccctggcg gtgacgggtg tcgttatcgc 2542681 cacccggatc gcctgggtac aggcaaccac ggtcctgggt cacaccgtgg accgggtcct 2542741 gaagaagccc acccgccacg tcggcttccg tcagcgttgc gtcacaagct gggccggttt 2542801 ccgcggcgcg gtatcgcttg ccgcagcgct ggcggtgccg atgaccacca atagcggcgc 2542861 tccattccca gaccgcaacc tgatcatctt cgtcgtctcg gtcgtcattc tggtcaccgt 2542921 gctggtccaa gggacttcct tgcccaccgt cgttcggtgg gcgaggatgc ccgaagacgt 2542981 cgcgcacgcc aacgaattgc agctggcccg cacccgtagc gcccaagccg ccctcgacgc 2543041 tttgccgacg gtcgccgacg aactcggggt cgcccccgat ctcgtcaaac acctggaaaa 2543101 ggaatacgaa gaacgcgcgg tgctcgtcat ggccgatggc gccgactccg cgaccagcga 2543161 tctggccgag cgcaacgatc tggtccggcg cgtgcgtcta ggcgtgctgc aacaccagcg 2543221 gcaggccgtc accacgttgc gcaaccaaaa cctcatcgac gacatcgtgc tgcgcgagct 2543281 gcaggcggcg atggatctag aggaagtgca actcttggac cccgccgacg ccgagtgagc 2543341 cggcgccgcc cgctgatcga accagcaacg gttcaggttt tggccattgc tttcacagac 2543401 tcattcagcg tttcattgca ctggccgcag cgcgagcagg gctgccgcac agcgatcttg 2543461 gcgcctatgc gaaggtggtg cgatggtgat gtggacgggc gaaagttact gccaccggca 2543521 cgccgcactg gcacccaaca gaggaggatc aggcccgccg cacccagggt ctacacgacc 2543581 ggcgacatcc tgcgtgatcg gaagggcata gcgccatggc aggaacaacg cgaaccgggc 2543641 tgggcgccgt tcggttggct gcacgagccc tcgggcgcaa ggtgcccaaa agccgacggg 2543701 cagtcagtct aagtgtcttg ataggtgcgg tgatagcagc tcttgccggg gcgctgattg 2543761 cggtaaccgt accggcgcgg ccgaatcgcc ctgaggccga ccgtgaagca ctgtggaaaa 2543821 tcgtgcacga ccgttgcgaa ttcggctatc ggcgtaccgg tgcgtacgct ccctgcacat 2543881 tcgtggatga acagtctgga acggcgttgt acaaagcgga ttttgatccg taccagttcc 2543941 ttttgatccc gcttgctcgt atcaccggaa tcgaggatcc cgccctacgg gagtcagcgg 2544001 gtcgcaatta cctctacgac gcttgggccg cacggttcct cgttaccgcg cgcctgaaca 2544061 actcacttcc agagtcagac gtagtcctca ccatcaaccc gaagaacgcg cgcactcagg 2544121 atcagctgca catccacata tcgtgttcgt caccaacaac atcggcagcc ctgaggaacg 2544181 tggatacctc agagtacgtt ggctggaagc agctccccat cgacctcggt ggtcgcaggt 2544241 ttcaaggatt ggcggttgac acgaaggcgt tcgaatccag gaacctgttc cgggacatct 2544301 acctgaaggt aaccgctgac ggcaagaaaa tggaaaatgc atcgattgcg gttgccaacg 2544361 tagcgcagga ccaattcctg ctgctcttgg cagagggaac tgaggaccag cccgttgcag 2544421 ccgagactct ccaagaccac gactgctcca tcaccaagtc ctgatagcac gatgccagcg 2544481 ggccacacga cagggcgcag tgtgcgaacc tgaccccgcc acggcgggcc gttgatggca 2544541 ttttgctagt gtcggagcgg caatccgcct atatttctcc tcgcctacca gtgagggagc 2544601 cgggcttgac tgatccgcgc cacaccgttc gaatcgctgt cggagctacc gcgctcggcg 2544661 tgtcggcact cggggcaact ctgccggcct gctccgcaca cagcgggccg ggttctcccc 2544721 cagtgcgccg tcagctcccg cggccgcgac cgtcatggta gagggacata cgcacacaat 2544781 ttccggagcg gtcgagtgcc gcacctcgcc agcggtaagg acggcgacgc cgtcggagtc 2544841 ggggactcaa actacacggg ttaacgcaca cgacgattcg gcctcggtga cactgtccct 2544901 gtccgactcc acgcccccag acgtcaatgg ttttggtatc tcccttaaaa tcggaagcgt 2544961 cgactaccag atgccctacc agccggttca gtccccaact caggtcgaag cgaccaggca 2545021 gggcaagagt tacacactga ccgggacggg tcacgcggtg atcccgggcc aaaccggcat 2545081 gcgtgagctg ccgttcgggg tacatgtaac ctgtccgtaa ctacactgat tgcgcgacaa 2545141 gggaattagc cgcgttggca ggcaacacgg aggtgaccgg tgcaagcccg tggtcaggtc 2545201 ctgatcaccg ccgcggaact ggctggcatg atccaggccg gcgatccggt gtcgatcctg 2545261 gatgcgctgg cggcttgatg aacctgacgg gcatgcggcc tacctacagg gtcacctgcc 2545321 gggagcggta tttgtgtcac tcgaggacga actgagcgat catacgatcg ccggccgggg 2545381 ccggcacccg ctgccgtcgg gggctagtct gcaagccacc gtccgccgat gcggaatccg 2545441 acacgatgtg ccggtcgtgg tctacgacga ctggaatcga gccggttccg cgcgagcgtg 2545501 gtgggtgtta actgcggctg ggatcgcgaa tgtacgcatt ctagacggcg gcttgcccgc 2545561 gtggcggtcc gcaggcggca gcatcgagac cggccaggtc agcccgcagc tcgggaatgt 2545621 gactgtgctg cacgatgatt tgtatgccgg acagcggcta accctaacgg cgcagcaagc 2545681 cggtgcgggt ggtgtgacgc tgctcgatgc gcgcgtaccg gaacgtttcc gcggcgatgt 2545741 cgagcccgtg gatgcggttg ccggtcacat ccccggcgcc atcaacgttc ccagcggtag 2545801 tgtcctggcc gacgacggca cgttccttgg caatggcgcc cttaacgcac tgctgtccga 2545861 ccacggcatc gatcacggtg gccgcgtggg tgtctactgc ggctcgggtg tcagcgcagc 2545921 tgtcatcgtc gcggcactgg cagtgatcgg ccaggatgcg gcgctgtttc cagggtcatg 2545981 gtcggagtgg agttcggatc cgacccgtcc cgtcggccgt ggcactgcat agtcagacgc 2546041 cggcccagtt ctgcaggaag gcttcggtga cccgggcggc gttgttggcc gcaatctgct 2546101 tgtaaacgaa gaactggacg gggaagcccg gcagatgcag cgggtcgccg ggcccgtcgg 2546161 acataccgcg aattcccagg aacgggacgc cgtgtgcatc ggcgaccgcc tgcgcggctg 2546221 ccgtctcctg gtcaaccgcg tcgaagccgg ggttcaccgt cgatacgatg ttcaggttgc 2546281 tgatcagagc gttcttcagc cagggtcccg ccgcctggaa aaagttaccg gtatagccaa 2546341 gtgagcgatc gggtgcacta cagggttggc agccaaaaac gctgccgccg ttcgggatgc 2546401 aaggaaaagc ctggccgttg ttcttgtcgg agctagaccc gtcaccgccg acgaacagtt 2546461 gcggctggcg ccccaggtgg ttcaaccgga cgaccggaac gttcctgcac agacagacag 2546521 gattgccgag cgtgttgatg ttgtccagta caacagaaag cgtctgggca gtagccagca 2546581 tgccgggatc gaccccacgg aatgttgccc cgttgtccag ggtccaccgt gctggtattg 2546641 ccacgtcccc aatgctggtg cggccggcac caccggcgac gcccgagaac atcacggcgg 2546701 caatggcaat ggaagaagca caggtaaagc gtgcgaaggc ggtctcggtg gtgttggtag 2546761 cgttcactag gccgatgccg gtcatcgcca caatcacctt cttgccgctg atcgagccca 2546821 ggtagtagcg acgacggtcg gcgaccacca ccgggttggc gtccagcgcg gtgtgcgcca 2546881 gcaccgcgtc ggcctcagcc ggaaacgccg acaagaccag cgtgcgctgt tcgcacggga 2546941 tcacatttgc cacgtatccg ggatcggccg ccgccacgcc acagcccagc gacaacgcgg 2547001 ccgccaccaa aagacagtgc cgcaaaggcg cgcccacaat cccttatccc caaaaatcgt 2547061 gatttgacat ggatgccgga actctctgtc atttagccgt ggccgatttg gggcttggcc 2547121 ctgattttcg cgcaccatcg gcgacggacg aatatttgtt atcgtttttt tcgtctagcg 2547181 attcctcggc gttatttcat cgcggcggaa cgagccgccc tatgaccaac tgtgcaagcg 2547241 tgattggtcg atagccccgg tcgggctatg ttccccggtg tggctagacc agttgaccgg 2547301 tgcgggacgc ggatacggct agtctgccgg agtgatacct aacccactcg aggagctaac 2547361 gctcgagcaa ctgcgaagcc aacgcacgag catgaagtgg cgtgcgcacc cagccgacgt 2547421 cttgccgttg tgggtcgcgg agatggacgt gaagcttccg ccgacggtgg ccgatgccct 2547481 ccgtagagct atcgacgacg gcgacaccgg atatccctat ggaacggagt atgccgaagc 2547541 cgtccgcgaa ttcgcttgcc aacgttggca atggcacgac ctggaagtga gccgcacggc 2547601 catcgttccc gacgtcatgc tcggcatcgt cgaagtgctg cgtctgatca ccgaccgcgg 2547661 tgaccctgtg atcgtcaact ccccggtata tgcgccgttc tacgctttcg tgtcgcatga 2547721 cggccgccga gtgatcccag cgccgctgcg gggagacggc cggatcgatt tggacgcgct 2547781 gcaggaagcg ttctcgagcg cgcgtgcttc aagcggctcg agcggcaacg tcgcctacct 2547841 cctgtgcaat ccgcacaacc cgacggggtc ggtgcacacc gccgacgaac tgcgcggcat 2547901 cgcggaacgc gcccaacggt tcggtgtccg ggtggtgtcc gacgagattc atgcccctct 2547961 tatcccgtcc ggggcacggt ttacgcccta tctgagcgtc cccggtgcgg aaaacgcatt 2548021 cgcactaatg tcggcttcca aggcgtggaa tctcggcgga ctcaaggcag ccctggccat 2548081 tgccggtcgc gaggcggcgg ccgacctcgc tcggatgccc gaggaggtcg gtcacggccc 2548141 cagccacctg ggtgtcatcg cgcacaccgc ggcgttcagg actggtggca actggctcga 2548201 cgcgctgctg cgcggtctgg accacaatcg aacgttgcta ggcgctctgg tcgacgagca 2548261 tcttcccggg gtgcaatacc gatggccgca gggtacttac ctggcgtggc tggattgccg 2548321 agaactcggc ttcgatgacg cggctagcga cgagatgacc gaaggcctgg cggtggtgtc 2548381 agatctgtcc gggccagccc gctggttcct cgaccacgcg cgggttgcgc tcagttctgg 2548441 tcacgtcttc gggattggcg gtgccgggca tgtgcgcatc aacttcgcga cctcccgagc 2548501 cattctcatc gaggcggtat cgcggatgag ccggtcacta ctcgagcgcc ggtagcgcgt 2548561 ccagagaacc gctagcgcca acacgatcac ctcgggtgac ggtcttgtcc gctcggcggc 2548621 ccttcagtgc ccagccaatg cggccgaccc cgcggcggcc gcattcggta gacaaaggaa 2548681 gtctgacacc gtaggcgcct cgttgatcgc gttttcgccg agaaacgtga aggccgtttg 2548741 cccgcccgtg cggatcagct acgatcaagg cggccacatg gaccagtcgg ccaaccatgc 2548801 gtgtctgccc accccgctgg cgagcacaac agggcgcggg caagatcatg agatgcctgt 2548861 cgaagagacc tccacccccc agaagctgcc ccaatttcgt tatcaccccg atcccgtcgg 2548921 caccggctcg atagtcgccg acgaggtgag ctgcgtgagc tgcgagcaac gtcggcccta 2548981 cacctacacc ggcccggtgt atgcggagga ggagcttaac gaggccatct gtccttggtg 2549041 tatcgcagat ggcagtgcgg cgagtcgctt cgatgccacg ttcaccgacg ccatgtgggc 2549101 ggtgcccgac gacgttccag aggacgtgac cgaggaagtg ctgtgccgaa cacccgggtt 2549161 cacgggctgg ctgcaggagg aatggttgca tcactgcggg gacgccgccg ccttccttgg 2549221 cccggtgggc gccagcgagg tggccgacct ccctgacgcc ctggatgcgc tgcgcaatga 2549281 gtaccgcggc tacgactggc ccgccgacaa aatcgaggaa ttcatcctga cgctcgatcg 2549341 aaacgggctg gcgaccgcct acctcttcag gtgcctgagc tgcggcgtcc acttggccta 2549401 cgccgatttc gcttaacctc ggcggcgact gagtcgacgc gagcgcggat atcggacgct 2549461 tttgcacaac aatggttccg acgtggcaca gctcagagag gagcagatca tggatgtcct 2549521 acgcacccca gactcccggt tcgaacacct ggtgggctac ccgtttgcac cgcactatgt 2549581 cgatgtgacg gccggcgaca cccagccgtt gcgaatgcac tacgtcgacg agggcccggg 2549641 cgacggtccg ccgatcgtct tgctgcacgg cgagcccacc tggagttatc tgtaccgaac 2549701 catgattccg ccgctctccg ccgccgggca ccgtgtgctc gcgcccgacc tgatcggctt 2549761 cggccgctcc gacaagccga ctcgcatcga ggactacacc tacctgcggc acgtcgagtg 2549821 ggtgacgtcc tggttcgaga atctcgacct gcacgacgtt acgctcttcg tgcaggactg 2549881 ggggtcattg atcggtctgc gcatcgctgc cgagcacggt gaccggatcg cgcggctggt 2549941 ggtcgccaac gggtttctcc ccgccgcgca ggggcgcacc ccactcccct tctacgtgtg 2550001 gcgggcgttt gcgcgctatt ctccggtgct tcccgctggc cgtctggtga acttcggcac 2550061 cgtccacagg gttcccgccg gggtccgagc cggctacgat gcacctttcc ccgacaaaac 2550121 gtatcaagcc ggcgcccggg cgttcccacg gttggtgccg acctcacccg acgatccggc 2550181 ggtaccggcc aaccgcgcgg catgggaagc cctgggccgg tgggacaaac cgttccttgc 2550241 catcttcggt tatcgcgacc cgatactcgg gcaagcggac ggtccgctga tcaagcacat 2550301 tcccggcgcg gcgggtcagc cgcacgcccg catcaaggcc agccacttca tccaggagga 2550361 cagcggaacc gaactcgccg aacgcatgct ctcctggcag caggcaacgt aaccgcgacg 2550421 gctgcggacg aaggatcggc agaatggcga tggagatggc gatgatgggc ctgctcggca 2550481 ccgtggtggg tgcctcggcc atgggcatcg gggggattgc gaagtcgatc gcggaagcgt 2550541 atgtcccggg ggtcgcggct gccaaggacc gtaggcagca gatgaacgtc gatctgcaag 2550601 cacggcgcta cgaggcggtg cgagtgtggc ggtctgggtt gtgcagtgcc agcaacgcct 2550661 accggcaatg ggaggccggg tctcgggaca cccatgcgcc caacgtcgtc ggcgacgagt 2550721 ggttcgaagg tttgcggccg cacctgccca ccactgggga ggcagcgaag ttccgtaccg 2550781 cttacgaagt ccgttgcgat aacccaactc tcatggtgct ttcgcttgag attggccgta 2550841 tcgagaagga atggatggtg gaggcgagcg gccggacacc aaagcaccgg ggatgactgc 2550901 gaagactcgc ggttggtagc gcacccggct ggtgcggcgc cgacaagctg cccacattcg 2550961 gtgacactga atttctgcag caaaagcgcg agtgaccaac ggtctgcgaa attaccggct 2551021 cggggtcggc tacaccgtcg agcgacgcgg tcgccgccgc gccgagcccc tcggtacggt 2551081 ggcagacatg aaatatctgg acgtcgacgg aatcggacag gtcagccgga tcgggttggg 2551141 cacttggcag ttcggctcgc gtgaatgggg atatggggac cggtacgcca ccggcgccgc 2551201 ccgcgacatt gtcaaacgcg cacgcgcctt gggggtcacg ctgttcgata ccgccgagat 2551261 ctacggcctg ggcaaaagcg agcgtattct cggggaggcc ctcggcgacg accgcaccga 2551321 ggtggtggtg gctagcaagg tcttcccggt cgcgccgttt ccggcggtga tcaagaaccg 2551381 cgagcgcgcc agtgcgcggc ggctgcagct gaaccgtatc ccgctgtatc agatccacca 2551441 gcccaacccg gtggtccccg attcggtgat catgccgggg atgcgtgacc tgctggacag 2551501 cggcgacatt ggcgcggccg gtgtctccaa ctactcactg gcgcgatggc ggaaggccga 2551561 cgccgcgctt gggcgcccag tcgtcagcaa ccaggtacat ttctcgctcg cccaccctga 2551621 tgcgctcgaa gatctggtgc cgttcgccga gctcgagaac cgcatcgtga tcgcctacag 2551681 cccgctggcg caaggactat tgggtggcaa gtacggactc gagaatcgtc ccggtggcgt 2551741 gcgcgcgttg aacccgctgt tcggcaccga gaacctgcgc cggatagagc cgctgctggc 2551801 tacgttgcgc gccatcgccg tcgacgtcga cgccaagccc gcccaggtgg cactggcctg 2551861 gctgattagc ctgccggggg tggtcgccat tcccggagcg tccagtgtcg agcaactcga 2551921 gttcaacgtc gcggccgctg acatcgagct cagcgcgcaa tcccgcgacg cgctcaccga 2551981 cgccgcccgg gcgtttcgcc cggtttccac cggccgcttc ctcaccgaca tggtgcgtga 2552041 gaaggtcagc cgtcgttgag ctcgctacaa ggtacgcgcg agacgttcgg ccagcagctc 2552101 ggcgaacctc gccggatcct cgagtgcgcc gccttcggcg agaagcgctg tgccgtaaag 2552161 taattccgcg gtttcggcca atgatttctc ggcatcgtct gcgcggtcct ggtgggcttg 2552221 gcgcaggccg gtcaccaacg gatggctcgg gttgagctca agtatccgct tgccgaccgg 2552281 aacctcctgg ccggaagccc ggtagatgcg cgcgagcgcg ggtgtcatcc cgaaggcatc 2552341 ggtgatcaga caggccggtg actcggtcag gcgggtggac agccgcacct ccttgacgtg 2552401 atcgctcaac gtctcctgca accaggtcag caggtcggca aattccttct gccgctcctc 2552461 gcgctcggcc tcgctggtgt cctcttcgga actcaagtcc acctcgccct tggcaaccga 2552521 ctgcagcggt ttgccgtcga actccggcac cattcccacc cagacctcgt cgaccgggtc 2552581 ggtgagcagc agcacttcgt accccttggc cttaaacgcc tccaggtgcg gtgacttcag 2552641 cagttgttgg cgcgtctcgc cggtggcgta gaagatctgt tgctgaccgt ccttcatgcg 2552701 ctcgacgtat tcggccagcg tggtgggttc ctcctcgctg tacgtggaga caaacgaaga 2552761 aataccgagc agggtctccc ggttatcgat gtctgacagc agtccctctt tgaggaccct 2552821 gccgaactgt gtccagaacg tgcggtagtc ctccggccgg ctggactgca cgtccttgat 2552881 cgtggacagc accttcttgg tcagccgccg gcggatggcc ttgatctgcc ggtcctgctg 2552941 caggatttcg cgagaaacgt tgagcgacat gtcctgcgcg tcgaccacac ccttgacaaa 2553001 acgcaagtac tcgggcatga gctggtcgca gtcgcccatg atgaacaccc gcttgacgta 2553061 gagctggata ccgacgtggg cgtcccggtc gaacagatcg aacggggcat gagacgggat 2553121 gaacagcagg gcctggtact cgaaggtgcc ctcggccttc atcgcgatga tctcgagcgg 2553181 gtcgtcccag gcgtgcgcga cgtgtttgta gaactccttg tactcctgct cagacacctc 2553241 ttctttgggc ctcgcccaca gcgccttcat cgagttgagg gtttcggttt cgatggtgac 2553301 ggtctcctcg ccgccttccc ccccttcttc ctgggaggct ggggtgcggc gctcgacgtc 2553361 catccggatg ggccaggcga tgaagtcgga gtatttcttg accaggttac ggatcttcca 2553421 ttccgaggtg tagtcgtgca ggtcgtcctc ggcgtcttcc ggcttgaggt gcagggtgac 2553481 cgacgtgccc tggggggcat cctcgacgga ctcgatggtg taggtgccct caccgctgga 2553541 ctcccatctg gtggccgcgc tctcgccagc cttgcgggta agcagttgga ccttgtcggc 2553601 caccatgaac gacgagtaga agccgatgcc gaactgaccg atcagttcct cggaggcggc 2553661 cgcgttcttg gcctcacgca gctgtgcgcg cagctcggcg gtgcccgact tggccagcgt 2553721 gccaatcaga tccaccacct cctcgcgcgc catcccgatg ccgttgtcac gaacggtaag 2553781 agtccttgca gctttgtctg cgtcgatctc gatgtgcaga tcggaggtgt cgacctccag 2553841 gtccttgttc cgcagcgcct caatccgcag cttgtctagc gcatcggagg cattcgagat 2553901 caactcccgc agaaacgcgt ccttattgga gtagaccgag tggaccatca aatccagcag 2553961 ttgccgggcc tccgcctgaa actccaactg ctcgacatgg gcgttcatga gattccttcc 2554021 gacgacatag cgactcgaat ttagcgagct gcgatccggc gccgagctgg gggtggcctg 2554081 gctaggccgt atcgcgagca agctgataga ggtcgggatc gtgtgcgcag acgatgagta 2554141 gatccgggtc gtggcgtcga tggagttcga cgattcgggc ctggttgtcg cgcagttggt 2554201 tgcggttata cgacaacagc ttttcctcgg cccgcatcac gaagggcacc cggaaccggc 2554261 catcgagggt gccgcgatga tagaaggcgt cgccgcagtg caaaacccag cggtgaccgg 2554321 catcgacagc taccgcggcg tgcccgcggg tgtgaccggg catcggcacc agaacgacac 2554381 cggtgccgat ggaatcgagg ggtttggccg atgcgaatcc gcgccagggt tccccgtcgg 2554441 gaccgtgctc caccagcttc gggccatggg cccactgtcc gcgtcgatat cgcagtcgct 2554501 cgcggagcga aggggcgtgg atggcaccgc gggcttcggc ggcggtgacg tggaggtgag 2554561 cctcggggaa gtcggcgatc ccgccgatgt ggtcgaagtc gaagtgggtg agcacaatgt 2554621 gtcgaacgtc ggacgtgcgg tagccgagct gttcgatctg gcgggccgcg gtttcggcct 2554681 gcaagaatgc cggccgcagg acatgacgga atagacctac ccggccgggg tcaaggcagt 2554741 cctggatacc gaagccggtg tccaccagca ccaatccatc gtcggtctcg acgagcagaa 2554801 cgtggcataa cagagcgatg ccaaatgcat tcatggtgcc gcagttgagg tggtggacct 2554861 tcaccggcgg tcccttcgct tcgggggcga cacctaacat actggtcgtc aacctaccgc 2554921 gacaccgctg ggactttgtg ccattgccgg ccactcgggg ccgctgcggc ctggaaaaat 2554981 tggtcgggca cgggcggccg cgggtcgcta ccatcccact gtgaatgatt tactgacccg 2555041 ccgactgctc accatgggcg cggccgccgc aatgctggcc gcggtgcttc tgcttactcc 2555101 catcaccgtt cccgccggct accccggtgc cgttgcaccg gccactgcag cctgccccga 2555161 cgccgaagtg gtgttcgccc gcggccgctt cgaaccgccc gggattggca cggtcggcaa 2555221 cgcattcgtc agcgcgctgc gctcgaaggt caacaagaat gtcggggtct acgcggtgaa 2555281 ataccccgcc gacaatcaga tcgatgtggg cgccaacgac atgagcgccc acattcagag 2555341 catggccaac agctgtccga atacccgcct ggtgcccggc ggttactcgc tgggcgcggc 2555401 cgtcaccgac gtggtactcg cggtgcccac ccagatgtgg ggcttcacca atcccctgcc 2555461 tcccggcagt gatgagcaca tcgccgcggt cgcgctgttc ggcaatggca gtcagtgggt 2555521 cggccccatc accaacttca gccccgccta caacgatcgg accatcgagt tgtgtcacgg 2555581 cgacgacccc gtctgccacc ctgccgaccc caacacctgg gaggccaact ggccccagca 2555641 cctcgccggg gcctatgtct cgtcgggcat ggtcaaccag gcggctgact tcgttgccgg 2555701 aaagctgcaa tagccaccta gcccgtgcgc gagtctttgc ttcacgcttt cgctaaccga 2555761 caaacgcgcg cacgatggag gggtccgtgg tcatatcaag acaagaaggg agtaggcgat 2555821 gcacgcaaaa gtcggcgact acctcgtggt gaagggcaca accacggaac ggcatgatca 2555881 acatgctgag atcatcgagg tgcgctccgc agacggctcg ccgccatacg tggtgcgttg 2555941 gctggtaaac gggcacgaga caacggtgta ccccgggtcg gacgcggtcg tcgtcaccgc 2556001 caccgagcac gcggaggccg aaaagcgcgc tgccgcgcgg gccgggcacg cggcgacata 2556061 gccggtgaaa agctctgctg gcgatgtggg gcctacaggt ctcacgtgtc gagccgcagc 2556121 acacgtgtgg cgttacgcca tagccagtcc tccaggactt cccgggggac cggcagcgca 2556181 cgcatctcgt cgcacaattg caggtaaggg cggttgatga gaaatccgcc ggtaccgtaa 2556241 acgatcttgt tgcggattgt tgtctgccca aaccgcatca gcggctccca tccagcgccc 2556301 ggtgaagcga agtacttggg acggtgcgcg gccaattcca ggtagacgtt cgggtgtttc 2556361 caggcgatca ggcatgcctg cagcacccac gggtagccgc cgtggctcat caggatcgtt 2556421 aactcaggga agcggcaggc aacgtcgtcg atgtggcggg gatggccgag atcgctgagc 2556481 cgtgtccgag tccaatcggc ggaggtgtgg atggaaacgg gcacaccaag ctcgacgcat 2556541 ttggcgtagc aagggaagta ggcggggtcg gatgcgggcc gtccaatcat gaacggacgc 2556601 aagctcaacc cgcggaaacc gtgctcgacc acccagcgct cgaactcgtc gactgccgag 2556661 tcgccggcca ggatgtcggc accggcgaag ggtaggaacc gatctggata gcgggccgcg 2556721 acggcggcca ccgaggcatt gtggacaaag gtgacaccac acgtggaccg ttcatcgaat 2556781 cccgtgatca gactgcgggt aatcccggcg tcgtccaggg agtccagtat ttggtcgtct 2556841 gtcctgcgta gcgactccgc gtaggcaccg aactgctcgg cgctgatcgt cgtcttggtg 2556901 aagacctcga aatacgacag cagctcgacg ggaaatcctt cccgaagatc gtcaatgacc 2556961 tcggcggacg gaacgaacgg tgcccacata tcgatgaccg gcacccgcgg ttcgggcgcg 2557021 gtcatggggt gctccgcggg ccaaccggac cgtgcaggaa gtcatcgaat ccggcatcgc 2557081 gctccacggc gaatgcctgc tcgaacgtcg tggcggggcg gccggtgcac gtcgcctcct 2557141 tgacatcgac tcgcttcccg gtgaggtcgc acaggacaca acggtccagc gcgccgtcgt 2557201 cggcttcctc ggtagcaatg tcgtggctca tcgctcctcc gttgactgtg tcgaccagct 2557261 gagcatgcgc tcttatgcga ttacgccaag tcaactgacc ccgccgacgc ttcgcatacc 2557321 tagtgtcggc cagggccacc tggcccgccc ggacctcccg gcccgcctgg tccgcccgga 2557381 cccccgggtc cgcctggtcg gtttacggcg gggagccaga acacgcattg attcaagtcg 2557441 gggctccaca cccagccggg cgcgcaatca tcgtcggccc ggctgttcgc cggctcgaca 2557501 agtccggtca ccgccaacac cgtcaacacc gcagtgaacg cgcaaagcgc gcagcgtcgt 2557561 agaaaatgcc tcatcgcaga cctcacggtt tgtcgtccgg cgctggacct aggttatcgc 2557621 cacgaccgcc gcggcggcag cacacgtggc gactcaccgc ggccgtagaa ccggttgagc 2557681 agcaagccac tgcgcgttgg taagagcgga tccaagcgcc ggcaacggat ggtcggcgag 2557741 ggcgctgatc gggcaacgat gcccaggcca ggcggcccca gcgaacgccg caccggctgg 2557801 aggaagatag ccccatgacc caaacgctgc gccttaccgc gctggacgag atgttcatca 2557861 ccgatgacat tgacatcgtt ccttcggtgc agatcgaggc gcgggtgtcc ggtcgtttcg 2557921 acctcgaccg gcttgccgct gccctgcgcg ccgccgtcgc caagcacgcc ctggctcggg 2557981 cgcggcttgg ccgcgccagc ctaaccgcac ggacgctgta ttgggaggta cccgaccgcg 2558041 cggatcacct cgccgtggag atcaccgatg aacccgtcgg tgaagttcgc agtcgctttt 2558101 atgcgcgggc tcccgaactg caccgaagcc cggtctttgc cgtcgcggtg gtacgcgaga 2558161 ccgtgggcga ccgcctcctg ctcaacttcc accacgcggc cttcgacggc atgggcgggc 2558221 tgcgtctgtt gctctcactg gcccgggcct atgcggacga gcctgacgag gtcggtggcc 2558281 ctccgatcga ggaagcccgc aaccttaaag gcgtcgccgg ctcccgcgac ctgttcgacg 2558341 tcctgatccg cgcccgcggc ctggcaaaac cggccatcga ccggaagcgg accacccggg 2558401 tcgccccgga tggcggctcg cccgacgggc cgcgcttcgt gttcgcccca ctcaccatcg 2558461 agagcgacga gatggcaacc gcggttgctc gtcgacccga gggggcgacg gtgaacgacc 2558521 tggcgatggc cgcgctggcg ttgacgatcc tgcagtggaa ccgcacacac gatgtcccag 2558581 ccgccgattc cgtgtcggtg aacatgccgg tgaacttccg gccgaccgcg tggtcgaccg 2558641 aggtcatctc gaactttgcc agctacctgg cgatcgtgct gcgggtcgac gaggtgaccg 2558701 atctcgagaa ggcgaccgcc atcgtcgccg ggatcaccgg accattgaag caatccggcg 2558761 ccgccgggtg ggtcgtggat ctgctcgaag ggggaaaggt gttgccggcg atgctcaagc 2558821 gccaacttca gctgcttctc cccttggtcg aagatcggtt cgtcgaaagc gtctgtctgt 2558881 ccaacctggg ccgcgtcgac gtccccgctt tcgggggcga ggccggggac accactgagg 2558941 tgtggttcag tccgacggcg gccatgagcg tcatgccgat cggggttggc ctcgtcggct 2559001 tcggaggaac gctgcgcgcc atgttccgcg gcgacgggcg aaccatcggc ggcgaggcgc 2559061 tgggccgctt cgccgcactg tatcgcgaca cactgctgac ctgagggccc ggcatgaccg 2559121 acaacgagtg cccggccgac agccgacggc gccatgtcct gcggctcgcc ctgttcgccg 2559181 ggattttgct ggggctgttc tacctggttg cggtggcacg agtcatccac gtcgacgggg 2559241 tccgtagcgc ggtcgtggtg gcgacgggtc cgatcgcacc cctggcgtac gttgtggtgt 2559301 cggccgcact cggcgcgttg ttcgtcccgg gcccgatcct cgccgccggc agcggggtgc 2559361 tgttcgggcc gctactagac acctttgtga ccctgccagc tttctcggcc ggcgcgcagg 2559421 ccggaatgac gcccaggcgc tgctgggtgt cgatcgcgcc catcgcctcg atgcacagat 2559481 cgaacggcgc ggattgtggg cggtggtcgg tcagcgcttc gtccccggca tctcggatgc 2559541 gctggcctcg tacaccttcg gggcgttcgg agttccgttg tggcagatgg tcgttgggtc 2559601 gttcatcggg tcggcgccac gggtgttcgt ctacaccgcg ctgggcgcgt cgatcaccaa 2559661 cctgtcgtcg ccgctggttt actcggcgat cgcggtgtgg tgcgtgaccg ccatcatcgg 2559721 ggcgttcgcc gcgcggcgtt ggtaccggaa gtggcgtgcg cgcccgcgcc ggcggtgcgg 2559781 cctggctcag ctcacgaccg gtagtcagca acgccacacg agtcaccgga caccggcggg 2559841 cgtcgtcatg cccggttcac tgtccgagca ccgccgtctc cgtcaagaag cgccggatcg 2559901 catcgagcat cacccgccca tcgagtagtt ccgggtcgtt gtgacccaca ccgggaacca 2559961 ccacgtatcg cttaggctcg gcggccgctg cgaccagcca ctcactaagc gtagcgggga 2560021 cgatgtcgtc gctgccgccc gcgatgacca gcaccggcgc gtgtacagag gcgatgcgct 2560081 cgatcgacgg gtagtggtcc agcagcaacc ggcgcagcgg cagccacggg tagtgcaccg 2560141 cgccgacctc ggccagcgac gtgaacggag atctcagcac gagtgccgcc ggcggccgtt 2560201 gcacggccag cccgaccgcc accgccgcgc cgagggattc gccgaaatag gcaatgcgcg 2560261 cggggtcgac gtcggactgg ccggacagcc actcctgcgc ggcccgagcg tcggcggcca 2560321 ggccctgctc agacggccga cccgggttac cgccgtagcc gcgatagtca aacagcaaca 2560381 ccgacaggcc caggccatgc agcgcgacag ccagctccgc acgcatcgac cggtcgccgg 2560441 cgttgccatt gcacaccagc accgcgggcc cactaccgcc cgaagtatgc gggaagtacc 2560501 agccacccaa gcgcattcca tcttgtgttt cgaccacgac atcgcggccg gcgggcaaaa 2560561 cggaggaagc cgatggcacc ggacccgcag acgggaagta gattagccga cgctgctgcg 2560621 accagatgaa cgtaatcacg cccgatgcca ccagcgcgac gatagcgacc accggcaacg 2560681 cgcgacacct ctttagcgac atctagcccc gcaccggtgc gacgcatcga aagcggggtc 2560741 cccgcgacca gtggattacc gaaaccaccg ttccaaacag aaaatcgaca cgaaattcaa 2560801 cgacgcggcg ggccggcgat ggccacgaga cacccacaac cagcaaccgc cccaatcatc 2560861 acgccaacca gctcagtaca ccgccgtggc gcgaacacgt gcctgaccgg tgtgtgctga 2560921 acgagtacga cccgtcccta caaattgcgg tggcgccggg tggcgccccc gaacctggcg 2560981 gcacttgccg gggagcaggt atgcactgac cgtccacgtt ctcgtagtag ccgctaggac 2561041 aggcaaacac cgaagtcggc gtcgacggag aaatggccgg gacgaagccg aaaccaactg 2561101 ccgccgcaac aacgccgacg gcaaaccgcc tccgagcaga cactgctagc cttcgatcat 2561161 cacgcttacg actccgcgtc ccagcaaagc gtaccgagta catcgccagc cgggaaggga 2561221 tatggtcccg cgactagcgg atcagcagag tgcgcagttc cagtgctctg gcaaaccaac 2561281 acgtattgct cgccgatcca acatattcgt tgaaccttga gaaaggcttg cggcgcatcg 2561341 cccagcccag cgccactgcc accacgggag gagaaatcca accgtcacca cgacaccacg 2561401 gatagcgaag atcaacaaat gccacccacg ttcgggcgca ccaaggaagc caccgtcgcg 2561461 atacttacct atcgttgcat ccgttctggc gatatttttc aactcgcatt catgcgcccc 2561521 ctccgcaaga gccgggagcg gctaatggtg gcaccgggct accatcgtca ataacacacg 2561581 acaaggtaag cgtcgtacca acaaacggcg ctggtacccg cacttgatgc caatagctgc 2561641 cgtctggata tctgattccg tcacaatatc cccacccggt aatcccacca aagccaccgc 2561701 cagggcaata tcccatcgca ctattcggca tgtgcggatc ttgtcccggc ggtggcgcgg 2561761 gttcggcgtt ggcagaaacc atagaaaatt caactgccat agtcaatgta ccgattgcga 2561821 tagcaatact atttttatac attttctcaa cacctgaatt cattcgtgtg gggaatgcag 2561881 ccttttggcc cccacatgcc cggtgtccca tcgctggcgg gccagtaggg acttcttcca 2561941 cggccggaag atcattgcgc gttggttgtg cgagcgggcg gctgacggct tcgcataatg 2562001 gcgtggacgg gctgtcatcg ttgtccctca gcgctacaac aagtcaggga aactcttcac 2562061 aggcggtgcc gtcgtcgccg tggtcgaggc caagacggta acccggctca ccccatagag 2562121 cggggccacc cccgcgtccc gccttgcagt tctggtagta ccggaaccac gcgggtatcg 2562181 gcgttggggc tgcatgagcc acaggtggcg ccacatcgcc gaccgcgatc acagctgcga 2562241 ggaccggtgg acgctgcatg atgagcccta cgtgtagtac cagacggctt tggttgtgac 2562301 tggctggtca gtcgcgtaaa ccgtggacct ggctactgct gaaagtacca tgacgcgggg 2562361 caacgaaaca gcagcaacgt cgacagacag cggaactgtc ggctaccgcc gataacgttg 2562421 tgtcatgcgt gcggacatgt ccgtcacctc gatgctcgac cgagaggtct acgtatacgc 2562481 cgaggtcgat aagctgatcg gcctccccgc cggcaccgcg aagcggtgga tcaacggcta 2562541 cgagcgtggc gtcaaagatc acccgccgat cctccgcgtc acgccgggag ctacgccgtg 2562601 ggttacgtgg ggcgagttcg tcgagactcg catgcttgct gaataccgcg accgccggaa 2562661 agtgccaata gtgcggcagc gcgcagcgat tgaagaactg cgtgcgcggt tcaatctccg 2562721 atacccgctg gcacatctgc ggccgttctt gtcaacgcac gagcgggatc tgacgatggg 2562781 cggcgaggag attggtctgc cggatgcgga agtgacgatc cgtactgggc aagcgttgct 2562841 tggtgatgcc cggtggctcg ccagcatcgc gacacccggt cgggatgagg ttggcgaagc 2562901 cgtgatcgtc gaactgcccg tcgacaaggc ctttcccgaa atcgtcatca acccaagccg 2562961 atatagcggg cagcccacgt tcgttgggcg tcgtgtgtcg ccggtgacga tcgcccaaat 2563021 ggtagacggc ggtgaggaac gcgaggacct ggccgccgac tacggtctca gcctgaagca 2563081 gattcaagac gcaatcgact acaccaagaa gtacaggctg gcccgactgg tggcggcata 2563141 aggcccggcg atgctcgaag tcgacaaagt cacccatgtt gtcgatgaaa acctgcttcg 2563201 gcttggtgtg gccttgtcgc cgtcagaaaa gacacggccc ggtttggccg cccgcccgtc 2563261 gacgacctgc taccgcaagg catcctcgac accgactgga tccccatcgt cgggggtcgg 2563321 gtgggtggtc atcagcaacg acaggcatct ccggacgcgg ccagtggagg ccgagctggc 2563381 ggtcgcccac aagctcaaag tcgtgcactt gcatggccgt gtgggcggac tagtccgcgt 2563441 gggcacagct gacgcggctg gctgcgcggt ggccggccat tgagcaccaa tatgagaagg 2563501 caccggaagg gccttggtgg ttgtcggtgc ggaggagcag gaccgccgta atggagttcg 2563561 cgcccggcgc cgtcgacacc atagcgtcgg acaacatggc tgcccaaaat gtccacgata 2563621 cggctgtgaa gacctcgagg tgatggccga aaggtgacca cctcgcagtg gtaggacgac 2563681 agcgacccga tcgaaggcaa tgccgccgca tcgagcgcga ctttgggcat gacaggattt 2563741 cgagtaagcg catcaacgtg tccgaaatgt ggggcgggcg gggctcgaac ccgcgaccaa 2563801 cggattatga gtccgcggct ctaaccaact gagctaccgc cccttgtgct aactagctgc 2563861 agatatgttc tccaccgcga ctgaatcagg gtcggaatac cgcagtgatg ccgcagcact 2563921 cttgatggcc tgcacaagca aacctgccac gcccgccagg tcgtcgctga gcaggtggcc 2563981 atggcggtcc aaggtcatgg ccgctgtggc gtgtccgaga agcctctgca cgactttgac 2564041 attagcgccc gcactgatcg ccagcgacgc cgtggtgtgc ctcagcccgt gcgggaccag 2564101 gtcggcaatg ccaaccgcct tgcatccctt gtcgaaggct ctgcggtact cctcgatagg 2564161 taggtgcccg ccgcggtagc ttgggaacac gagggcattg ggctcggttg gcagttcatc 2564221 acgcaggcgc tccgataccg gctcggggac aggcacgtga cgcacccggt tggtcgtcgt 2564281 ctcgacaatc ccggcgccgg tcacacagat gagcgaatcg tcaactcccg gtccccacgt 2564341 tcttgcgacg cagggccgct gcctcgccga agcgcagtcc gcagtagccg agaaccaggg 2564401 tcagcgtcag taccgcaaca atccgtggta cgaatccgtt actcacccac tcccccgcgc 2564461 tcggtcttgg cagcttccgc ctctacccac gcatccaggt cggcgatatc gcaaaacgtg 2564521 tgtcggccaa ggcgatagct gcgcggactg gtgcccagta acgccagtac ctcagcgtcg 2564581 actcaggtag gccgccgagg tattcggctg cggccttggt gcccagccga accacagatg 2564641 tcgccgtcat cgctctactt cctgtcgtcg ctcaacgcgc ttatgtccca atccctttgg 2564701 cagtcccagg gccgaccgca aaatcctttc cattgaccgc acagtaacca ttagcccgat 2564761 ggcatctaac aaccgaagaa cgccgagaag tcgacaccaa gatcccgatg tttgccgtag 2564821 caggaacggc ggtcacactc gggctgattc gagccagtcg tacatgtcgc gccgcgtcca 2564881 gcgccgatcc cgcccgccga ggcgcacata atggggtccc ggtgcgtcaa ttccgcgctc 2564941 tcgctttgca gcccagccgt gcagggtcga cactggcaca ccggtgatct cccgcaccta 2565001 gatggtggta agcatctccg cgtggctttc gttgtcttcc atcatgtgct ttggccacca 2565061 gtagcgacga catcaccata aatcgacacc ctccgttgaa ttgcgccgta aatcgccacg 2565121 acgaaagccg acggtctccg ctgcgccggg gcctactcgc caacggccta agagagaggc 2565181 aagctggggc attattcgaa cgttacaaaa gccagttcga ttcattcgga tatatcgaga 2565241 aggtgcggta tcggggctca gggtatcgag tcgaagacgt ttatgcccga gcggacagtg 2565301 gacctagcgc cggtgctgag cttcctgtcg gcccatgagc ggcggcgcgg ccgcacgctg 2565361 gcccccagct acgcgctggt gggcgccacg agcacgaccg cgtcgagctg ccgcgcgagg 2565421 ttcatcaggc gctaaggcag gtggtggctg cgctgcacgc cggcaaggcg gtgaccatcg 2565481 cgccgcagag catgacgctg accacccagc aggccgccga ccttctcggg gtgagtcgtc 2565541 cgaccgtggt gcgtctgatc aagagcggcg agctggccgc cgagcgcatc gggaatcgcc 2565601 accggctcgt gctcgacgac gtgttggcct accgggaggc ccgccggcag cgccagtacg 2565661 acgcgcttgc cgagagcgca atggacatcg acgccgacga ggatcccgag gtgatttgcg 2565721 agcagttgcg tgaggcgcgg cgtgttgtcg ccgcgcgccg tagaactgag cggcggcgcg 2565781 cctgagacca tcgctgcatg ctcgacacgt cgctgctgtg gtcaagccgg cagcgcgact 2565841 ttctgttgtc gttggcgacg tcgccgcgaa ctacgacggg cgggtggtgg tggcgccgac 2565901 aggccaggcc gtcgacgtcg cggtacgtga aggcgccggc gatgtcggct acagcgtcga 2565961 gcgagagaat cttccggccg acgatccggt gcgcaacggc aaccgctggc gggtcatcgc 2566021 ggtcgacacc gaacaccacc ggatcgccgc ccgccgcctg ggcgacggcg cacgcgccgc 2566081 cttcagcggc gactacctgc acgagcacat cacccacgga tatgccatca ccgtccacgc 2566141 cagccagggc accacggctc actccaccca cgctgtgctg ggcgacaaca ccagccgagc 2566201 aacgctgtac gtggcaatga cgccggcacg cgagtcgaac accgcttacc tatgcgagcg 2566261 aacggcgggc gaaggcgcgc gagtggatct cgccggatgg gacctttggg tgagtgggaa 2566321 agctgaggca atgagtgacg agaaatccgc atcgccagtt tggtgccgtg tcggagctcg 2566381 gtgcgatcat cggggaaagc gttcctgctg gtgagggcag aattgttgtg cacgtcgtgc 2566441 gctataccgt ggtgacgact cgccgaagca tggactaagg aggtagctgc gatgatgaag 2566501 gagatcgagc tccatctggt tgacgctgcc gcccccagcg gcgagattgc gatcaaggac 2566561 ctagccgccc tcgcgactgc tctgcaggaa ttgacgactc gaatcagccg cgacccaatc 2566621 aacacgcccg ggcctggtcg cacaaaacag tttatggaag agctctcgca actggccagc 2566681 gcccccgggc cagacatcga cggcgggatc gacctaactg acgatgaatt ccaggcgttt 2566741 cttcaggcgg cgcgttcgtg aatcaagtag cggcgacggt ggtcgacacc gacgtcttca 2566801 gcctgatcta agacaccgac tcgcgtgacc tcggctgccg cgcccagacg ccgtggcact 2566861 tctgccgttc ggtcgccgcc tggctggccg gagtcatcac agcacgctcc aatagcgcct 2566921 catggaatca gccggcgccc tcgaatcgag ccttacccgc ccgaaacgac acgcctcgac 2566981 ggtacctggc gcgctgacct ggccctacat cagtcaacgt atacgaacca cagcgtcgcg 2567041 gagctgccag accgccgtca accgaacacc gtctgaccgt caagcccaat gcgataccgt 2567101 tcggtgccct gctgcaccct gggcgcatca gcacccaacg acactgcaac cttgttgctg 2567161 gcgttgcgca tgatgtcaaa ggtcagctcg acggcctcgt cgtccgagaa ccgggagcgc 2567221 acctcgacgg caacgtcgac ggcgaggtgc gcaggggtcc aaattaacgc atctgcatac 2567281 ctcagggcgg cttttgcgcg aacgtcgagc aagaccgagg tatcgaaacg ctcgatctcg 2567341 ccatacaacg tctccgaacc gcccgcatca agcgcggaga cctcccgcaa cgacttgcac 2567401 acccggcaat tgtgctgcgc agctccacgc agccgcacca gctcagaggt gaccgggtcc 2567461 agtgcccgca tccgggccac cgccggcaga aatccgttga acaccgcagc ggacagatcg 2567521 gtgttgtgat cccaggagat cggcccggtt acccagccca gatactcctt gccgacgccc 2567581 aatgcttcca acccggcgcg cacccgcggc acaaagtcgg cgatgtacat cgccacaacc 2567641 gcaccgaaag cgtcttcccc caaatgcgtc cacagcaggg atcgctgctc gccggtgatc 2567701 gctgagacat cgacgctgaa ctgctcggcg aactcggcaa cgacggcctc ggccggcgac 2567761 tccggctcgt tcaccgcaac ctcacacggc aacgacggta gcgacagcgc ccgcgcgcac 2567821 acctgcctca ccagccccgc aatccggccg tcgcccggcg atagcgccac caaccgacac 2567881 agatcgtcac gaaccgaaac cggggccggc acccttcaca cgctactgcg cctggctcac 2567941 cgaggacatg tggaagtcgg gaatccgcag cggcggcatc gcggtacggg taacccaatc 2568001 tgaccattcg cgcggcagtg tcggctcgct gacacctgct tcggtggccc gtcgcagcag 2568061 gtccagtggg ctttcgttaa accggaagtt gttgaccgcc gcgctgacct cgccgtcttc 2568121 gaccaggtag acgccgtcgc gggtcagccc ggtgagcagc agcgtggtcg ggtcgacctc 2568181 gcggatgtac cacagcgtgg tcagcaacag tccgcgctcg gtgcccgcga tcatgtcggc 2568241 gagatcggcc gacccgccgg tcatgatcaa gttgtcggcg gcgaccgcaa ctggggcgtc 2568301 gaatttggcg gcagtggccc gtggatacgc cagcgcattg atcacaccgc tgcggatcca 2568361 gtccacctgg ctgatttcca tgccgttgtc gaacaccgat tgcgtctccg aggagttgct 2568421 caccgccaca aacggcgtac acgccagacc cggcgcagcc ggatcggtga acaacgtcag 2568481 cggcagctcg gtcaaccgct ctcccacccg ggttccaccg ccaggagccg agaaagcggt 2568541 tcggccctcc tgcgcgccgc gcccggccat cgaccaaccc aggtagatca tcatgtcggc 2568601 caccgtcgac ggaggcatga tggtctggta gcgcccggcc ggcagctcga cggtgcgttg 2568661 cgcccaccgc agccgcgtcg acagccgctc gagcatcaga tcgatgggca cctcgacgaa 2568721 atcgggtgtg ccgatcccca cccaagcgct ggcgtcgccg cgtttggcgt tgatctcgat 2568781 cgccccggtg ggctgggtgt agcggcggcg cagacccgtc gacgatgcca gaaacgtcgt 2568841 ggacacactg cggtgcgcgt agccgtacaa gcggtcggcc ccgcggaagc ccctgctcag 2568901 tgagccggcg ataccggtga aaacccctgc cccggtgccc ggaaccgggg catcccagtc 2568961 gtcgggctct ccggtatcgg caagcagcgg cgcggcatca ccggcctccg gcgcggagcg 2569021 ggccgcgtcc tgggaggaca ccaccagacc gggcagcacc gacgggtcca cttcggcgga 2569081 gaccacggag ccgacgaagg cgctatctcc ccgtcggacg atcgaaatca cggtgacgtt 2569141 tcggctgtgg gaaacgccgt tggtggtcat cgaattgccc gcccaacgca gtgtcgcctc 2569201 gaccttttcg gtgaccagca ccatggtctc gtccgcccgg ccagacctgg ccgcttcctt 2569261 taaaacgatg ttgacggcgt gctgcggctc gatcatcgac caccttcagt acgagtattg 2569321 agcacattga cgccccggaa caacgccgac ggacagccat ggctgaccgc ggcaacctgg 2569381 ccgggctggg ccttgccgca gttgatggct ccgcccattc gccaggtcga cggcccgccc 2569441 acggcttcca tggcattcca gaaatcggtg gtgctcgatt gataggcgac atcacgcagc 2569501 tgcccgtaca gctggccacc tcggatgcgg aagaaacgct ggccggtgaa ctgaaagttg 2569561 tagcgctgca tgtcgatcga ccatgacttg tcgccgacaa tatagatccc gtcgtcgacc 2569621 cggccgatca ggtccgcggt gctgaggtct tcgatgcccg gctgcagcga tatgttggcc 2569681 atccgctgga tcggcacgtg atgtggcgag tcggcatacg agcagccgtt ggaacgtggc 2569741 tcccccaacc gtggggcgaa cgcccggtcg agctggtaac caacgaacac cccgtcacgc 2569801 actagatccc agctctgcgc ggccactccc tcgtcgtcgt aaccgacggt ggccaagccg 2569861 aattcggcgg tacggtcggc ggtcacgttc atcaccggcg agccgtagcg cagggtgccg 2569921 agtttgtctg gggtggcaaa cgatgtcccg gcataggcag cctcgtagcc gatggcacgg 2569981 tcgtattcgg ttgcgtggcc gatggattcg tgaatagtca gccataggtt agtggggtcg 2570041 atcaccaggt cggtgggccc cggcatcacg ctaggcgctc ggaccttctc ggccaacagc 2570101 gatggcagct gcgcgagctc gtcggtccag ttccagatct cgtcgccggc caccacttcc 2570161 cagccccggg cggtcggcgg agccaacgtc cgcatcgatt cgaagttgcc cgccgcggaa 2570221 tcaacagcaa ccgcatccag gcacggcagc agccgcaccc gctgttgggt aatcgatgac 2570281 ccgaaggtgt cggcgtagaa ggtctgctcc ttgacggcgt tcaagctggc cgatacgtgg 2570341 tcgatgccgt cggcgtccag taaccgcccg gagtagtcgc gcagcacggc gatcttctcg 2570401 gaggcgggaa cgccgaacgg atcgatccgg tagttcgaga cccactccgc gtcggtgtat 2570461 acgggctcgg gcgccaatct gacccgctcg gtgttcagcg ccgccagcac ggtagccacg 2570521 tgtaccgcat ggcgagcggt cgcggccgcg acgtcgggtg ccaactcagc atgggaggcg 2570581 aatccccacg tgcccgcgac gattacccgg acggccaggc cgagctcacg gctgatcacc 2570641 gcggtctcca gctcaccgtc acgcagttgg atgatctcgg tgctaatgcg gtgaacccgc 2570701 aggtcggcgt ggctggcccc ggccgtggcg gccgccgaca atgcggcgtc ggccaactgc 2570761 tggcgcggca ggtccaggaa gtcttcatcg atcccccggt tcggtgtcac gactccaccg 2570821 taacgaccag ctttaataca cccatgcgcg acgcgccacg tcggaggacg gcactggcat 2570881 atgccctgct ggcgcccagc ctggtgggcg tggtcgcctt cttgttgctg cccatcctgg 2570941 tggtggtatg gctgagcctg caccggtggg acttgctggg cccactgcgc tacgtcggcc 2571001 tgaccaactg gcggtcggtg ctgaccgatt ccggcttcgc agactcattg gtggtcaccg 2571061 ccgtcttcgt ggcgatcgtg gtcccggcgc agacagtact gggactgctg gccgcgtccc 2571121 tgctggcccg gcgactgccg ggcaccggcc tgttccgcac gctgtacgtg ctgccctgga 2571181 tctgtgcacc gctggcgatc gcggtgatgt ggcgctggat tctggcgccc accgacggcg 2571241 cgatcagcac tgtgctcgga caccgcatcg aatggctcac cgatccaggc ctcgcgcttc 2571301 ctgtggtttc ggccgtcgtg gtgtggacca acgtcggata tgtctcgttg ttcttcctag 2571361 ccggattaat ggcgattccg caggacattc acaacgccgc acgcaccgac ggcgccagtg 2571421 cctggcagcg cttctggcgc atcaccctgc ccatgttgcg gcccaccatg ttcttcgtcc 2571481 tggttaccgg aatcatcagc gccgcacagg ttttcgacac cgtctacgcg ctgactggcg 2571541 gtgggccgca gggcagcacc gacctggtgg cccaccgcat ctacgccgag gcgtttgggg 2571601 ccgcggcaat cgggcgggca tcggtgatgg cggtggtgct gttcgtcatc ctggtcggtg 2571661 ccaccgtggt gcagcatctg tatttccggc ggcggatcag ctatgagctc acctagtcgc 2571721 gtctccaaca ctgcggtcta cgcggtgctg acgatcggcg cggtaatcac gctgtccccc 2571781 ttcttgcttg gcctgttgac ctcgttcact tccgcacacc agttcgcgac gggtactccg 2571841 ctgcagttgc cgcgaccgcc cacgctggcc aactacgccg atatcgccga tgccggattt 2571901 cgccgcgcgg cggtggtgac cgcgttgatg acggcggtga tcctgctggg ccagctgaca 2571961 ttttcggtgc tggccgccta cgcgttcgcg cggttgcaat ttcggggacg tgatgcgttg 2572021 ttctgggtct acgtcgcaac cttgatggtg ccggggacgg tgaccgtggt gccgctgtat 2572081 ctgatgatgg cccagctagg cctgcgcaac acgttctggg cgttggtgct cccgtttatg 2572141 ttcggttcgc cgtacgcgat tttcctgcta cgcgagcact ttcgcctcat cccagatgac 2572201 ttgatcaatg ccgcgcgcct cgacggtgcc aacactttgg acgtgatcgt gcatgtggtg 2572261 atcccaagca gccggccggt cctggccgcc ttggcgatga tcaccgtggt ctcgcagtgg 2572321 aacaacttca tgtggccgtt ggtgatcacc agcggccaca aatggcgtgt cctaacggtg 2572381 gcgacggctg acctgcagtc gcggttcaac gaccagtgga cgctggtgat ggcggcgacc 2572441 acggtggcaa tcgtgccgct gattgcgctc ttcgtgacct tccagcggca catcgtcgca 2572501 tcgattgtgg tctcggggct caagtgaccc ggccccgcca gtccacgctg gtcgccaccg 2572561 cccttgtgct ggtggcgatc ctgctgggtg tgacggcggt gctattgggg ctctccgccg 2572621 aaccgcgtgg cggaaagatc gtcgtaacgg tgcgactctg ggacgagccg attgctgcgg 2572681 cgtatcgaca gtcgtttgcg gcattcaccc gcagccatcc cgatatcgag gtgcgcacca 2572741 atctggtggc ctattcgacc tacttcgaaa ccctgcgcac cgacgtggct ggcggcagcg 2572801 cggacgacat cttctggcta tccaacgcct acttcgccgc ctacgctgac agtggccggc 2572861 taatgaagat tcagaccgat gccgccgact gggagccggc ggtggttgac cagttcactc 2572921 ggtccggcgt cttgtggggt gtgccgcaac tgacggacgc cggaattgcc gtgttctaca 2572981 acgccgatct gctggctgcc gccggtgtcg accccacgca ggtggacaac ttgcgatgga 2573041 gtcgcggcga tgacgacacc ttgcgcccga tgctggctag gctcaccgtc gacgccgatg 2573101 gacgcaccgc caacacgcca ggattcgatg ctcggcgggt ccgccagtgg ggatacaacg 2573161 ccgccaacga tcctcaggcc atctacctta actacatcgg ctcggccggc ggtgtgttcc 2573221 agcgcgacgg caagttcgcg ttcgataacc ccggcgccat cgaagccttc cgctatctgg 2573281 tcggcctgat caacgacgac cacgtcgcac cgccggcctc ggacaccaac gacaacggcg 2573341 atttctcccg taaccagttt ctggctggca agatggcgct attccagtcc ggcacctaca 2573401 gtttggcgcc ggtagcccgt gacgccctct tccactgggg tgtggcgatg cttcccgccg 2573461 gccccgcagg ccgggtaagc gtcaccaatg gtattgctgc agctggtaat tcggcgtcca 2573521 aacatccgga tgcggtgcgt caggtgctgg cctggatggg cagcacggag ggcaactcct 2573581 acgtgggccg ccacggtgcg gccatccccg cggtgttgtc tgcgcaaccg gtctacttcg 2573641 actactggtc tgctaggggc gtcgatgtca cgccgttctt cgcggtgttg aacggtccgc 2573701 gcattgcggc ccccggcggc gccggcttcg ccgccggaca gcaggccctc gaaccctact 2573761 tcgacgaaat gttcctcggc cgtggcgatg tcacgacaac cctgaggcag gcacaggcgg 2573821 cggccaatgc tgccacacag cgctagttgc gatctagccc ggtagtacta gcacggggac 2573881 cgggctgtag cgaatgatct tgccactcca ggagccgagg aatactctcg cgacatcacc 2573941 gaacggcgag gtgcccaagg ccaggatctc cccgtcctgc cagtccgcag cgtccagcgc 2574001 ctgcgcccag ccgttcccgg tgaccacttg cagcacaacg tcttcactca cgacgccgtt 2574061 aattcttagt ttttccaaca gttctcgcgc ttgcgccgcc catgcctcca gaaccgaagc 2574121 ctcggcatgc agccccactt cgggcggata catggtccgg ccgcggaccg cgaatgtgat 2574181 cacccgcatc ggcacgccat accggctggc caggtggccg catcgcctca ccacgtcgac 2574241 cgaacccgac gtcgcggagt agccgcagct gagccgtgtc aaccggtcgg tgtagcaacg 2574301 gtagcggcgg ggggtgatcg ccaccggtac cggcgacgaa tgcagcagcc ggtcggcggt 2574361 cgagccgatc aacacccgcg cgcgccgccc gctgggaaac gaccccagca ccagcacctc 2574421 ggcttcgagt tcctcgacga cgtcgagcag accagccgac accgatcggt gtgcgcggtg 2574481 gtggtagctg acctcgatcc cgtcggccag tctgcgcagg tagcgctggg cctctcgcgc 2574541 ggaggcggca gccagctgct cagaccagag ctcgtactcg gcgtcgacgc gggcgagcga 2574601 cggtgtcggc cagtgcctgc gcacgatggt ggccactgtg agcgacgtct tgtgcatccg 2574661 cgcgacgcgg acggctagat gtaatgcgga cggaccgacc ttgccagcca aatacccgac 2574721 gacgatggtc acggcacttc ctcgttgagc gcactgtggt gccgacccca catcaggtaa 2574781 aagatcactg ccaccgccac ccatccgctg aacgccagcc aggtgtacca gtgcaagctg 2574841 gccaggatat acccgcaggc cagcaccgaa agaacaggcg tcacagggta accgggtacc 2574901 ttgaaccctc ggggtaagtc gggctcgcgc acccgtagaa cgatcacacc cacagccacc 2574961 acgctgaacg cggtgagcgt gccgatggac accatgtccg ccaagctatc cagcggtatg 2575021 aaggcggcca gcgtcgatgc gaagatcgcg acgatcaccg tgttgtgcac cggcgtcatg 2575081 gtgcgcggat tcaccttcgc gaaccgcgcc ggcagcagcc cgtcgcgccc catcgcgaac 2575141 aggatccggg tctggccgta catggtgacc agcgtgacgg tgaaaatcga gaccaccgca 2575201 ccggcggcca gaatcgtgct ggcccattcg ccatgcgtga cgttgtccaa gatgatggcc 2575261 agcccggcgg tttcctgctc tgcgaagtcc tgccacggtt gggtgcccag cgcggccagt 2575321 gcgaccagca cgtagacacc ggtgacgacc accagcgctg cgatcagcgc acgcggcatg 2575381 gtcttctgcg ggtccttcac ctcgtcgccg gcggtcgaca ccgcgtcaag gccgatgtat 2575441 gagaagaaga tcgtgcccgc cgcggagccg atgccggcga cgccgaatgg gacgaaatcc 2575501 ttgaggtggt cggcgctgta cgcgctgaac gcgatgatca tgaacatgcc cagcacgccg 2575561 agcttgatca gcaccatgat cgcgttgacc ctcgccgact cgctggcccc tcgaatcaac 2575621 agcagcgcgc atagcccgat caggatgacg gcgggcaggt tcacccaacc gggatgggtg 2575681 tcccacggcg ccgccgacaa tacgtgcggc atctgaaatc cgaacagatt actcagcagc 2575741 ttgttcacgt agccactcca gccgaccgcg accgctgcgg tggctacccc gtattccagc 2575801 agtaggcagg ccgccaccac catcgcgacc gcctcgccca gcgtcgtgta cgcgtaggag 2575861 tacgccgacc cggaaatcgg cacggcggaa gccagttccg cgtagcagat agccgcgagc 2575921 ccagcggcga tgccggcgat gatgaacgaa acaatcacgc ccgggccggc ctctggaact 2575981 gcctgggcaa gcacgaaaaa gatgccggta cctatcgtcg cgccaacccc gaacatggtc 2576041 agctggaagg tgccgaaact ccgcttgagg ttccccgatg ccccggatgc gaccggggcg 2576101 ccgctcaccg ggcggcgccg cagcatcagt tctcgaaggc tcatcgacgt tgtcggcaat 2576161 tatgaacccg cctcccatag cgcgtcggcg aaccggcgaa ccgcgcagtc gatctcctgc 2576221 gcggtgatca ctaacggcgg cgcgaaccgc agggcggcgc cgtaggtgtc ttttaacagc 2576281 acaccgcgat cggccaaccg catgctcatg tctgtgccaa tggcaagcgc ccgttcgatg 2576341 tcgacgtcag cccaccatcc gaggccgcgc agggccaccg caccatcgcc gatcaggtcc 2576401 gccaggcgct gatgcagatg cgcacccaat ttagcggagc gagcttgaca ttctccccag 2576461 acgaccatgg aaaccacggg ggtaccgatc gcggcggcca acggattgcc gccgaacgtc 2576521 gacccgtgtt cgccgggatg caccacgccg aagatttcgc ggtccgcgac catcgccgac 2576581 aacggaaccg caccgccacc aagtgtcttg ccgagcaggt aaatgtctgg cagcacaccc 2576641 ccgtggtcgc aggcgaacgg gtaacccgta caggccagcc ccgattggat ttcgtcggcg 2576701 atcatcagca cgttgtgctc gacgcagccg gcaggtagtc gtcggccggg acgatgatgc 2576761 ccgcctggcc gggaatcggc tcgagcaggt cagcgacggt gttgtcgtcg attgtctgcg 2576821 ccggtgccgc agcatcgcca aacggtaccg agcggagtcc cggggtagaa ggttcgacgc 2576881 cgctgcccgc agccgggtcc gacgagaagc tgacgacact gctggtgtgg ccatgaaagt 2576941 tgttgtttgc caaaatgata tcgtgccggc ccgcggggag gccgttgacg tcggctcccc 2577001 acttgcgggc gaccctaaga ccgctctcca ccgcttcagc atcagagttc attggcaaca 2577061 ccacgtcttt gccgcacagc tgggcaagcg cggcgcccaa cggcccgagt cggtcggcat 2577121 gcaaggcccg attcagcagg gtgacggtgt cgacttgggc atgagccgtg gcggtgctcg 2577181 cggggttgcg atggccaagg ttgaccgccg agtacgcagc cagccagtcc aggtagcgca 2577241 ggccgtcgat atcggcgatc cacgcaccct cagcgctggc cgccaccaca ggcagcggcg 2577301 aataattgtg cgctgcatgc ctttcgacca gtgccatagt ggcctgagtg gcatccgcga 2577361 gatttgtcat gggtgtatct ccagcgtgca gcacttgacg gaaccgccgc ccttgagcag 2577421 ctcggacaga tcgacaccga ccggctcgaa gccggctgcg cgtaactgcg ccgcaaaacc 2577481 catggccgcg accggaagca ctacgttcag accgtcagag acggcgttga gtccgaacac 2577541 gaacgcgtcg gcactgccga ccacaatcgc gtcggggaac agcgccgaca actgttcctg 2577601 cgctgccgta ctgaacgccg gcgggtagta ggcgatcgtg tggtcgtcga gcacggccag 2577661 cgcggtgtcc aggtgataga accgtgggtc gaccaactcg agggagacca ccggcagacc 2577721 aagcaccgcg gcgatttcgg cgtgtgcgcg ctggtctgtg cgaaagccgt agcccgccaa 2577781 caccctttcg ccaaccatca gcaggtcgcc ctgtccctcg ttgacgtggc gggtggtcac 2577841 cgggcgatat ccgaccgagg acatccagct ggcataggct ctagactcac cagctcgttc 2577901 ggggaaccgg aaccgggcga ccacggcgat gtcgtgcgtg atgaacccac cgttggcggt 2577961 gtacaccatg tccggtaacc cggaaatggg ctcgatcaga tccacgctgt ggcctagccg 2578021 aagataggtc tggtggaggt gctcccactg tgcttgcgcg acttggacgt cgactggcgc 2578081 ggtgacgtcc atccaggggt tgatcgcgta tgcgacggca aagaaggccg gcggggtcat 2578141 tgcataccgc cgcgtccggg gggtgcggcg tgcaggtgac cctagacggg cagcagcgac 2578201 gtaggaatcc gtcataaacc aacgatattt ggctctgatt tcacaatcaa acgatggtcg 2578261 ttgcgtattt tccattgata cattgcgtta acctcgaatc tgtggtgatt cgttgcgtgc 2578321 ttagaacgga ggagggccga tggaccgcct ggatgacacc gacgaacgca tcctcgccga 2578381 gctggccgag catgcacggg ccaccttcgc cgagatcggt cacaaggtga gtttgtccgc 2578441 tccggcggtg aagcgccgcg tcgaccggat gctcgagagc ggcgtcatca agggcttcac 2578501 cacggtggtc gaccgcaacg cgctcggctg gaacaccgag gcttacgtgc agatcttctg 2578561 ccacggcagg attgcgcctg atcagctgcg tgccgcctgg gtgaatatcc ccgaggtggt 2578621 cagcgcggca acggtgactg gcacgtccga cgcgatcctg cacgtgctcg ctcatgacat 2578681 gcggcatctg gaggccgccc tcgagcgcat ccggtccagc gctgacgtcg aacgcagcga 2578741 aagcaccgtc gtgctgtcaa acctcatcga ccgcatgccg ccctagtgtt ccgcgccaat 2578801 gctagaaaag gcctgctgag ctacgtagac gcagcatgag caggtcctcg cgccgccaac 2578861 ccgcgaaacg gcgcgtctgt acaccgacac gccgttaggg cgcgcgccca cgcccagcta 2578921 tcgcccaagc tcaccatcgc gttgggcggc ggcggtggcg gccaacatcg gggctatagc 2578981 ggctggccgg tccgcgcgcc gcccgcgccg ccacctagga gtgcaatatc aggctctcta 2579041 tcgccaccgc tgtcccgctg gccatggcag tgatcgcaag cgtcacccag tcggcaagtt 2579101 tgggtcgccc aggatgcgct gacagctggc cggtgccgcc acgggcggtg atcgcgtcgc 2579161 ccatctcgtc ggcacggcgc agggtcaccg tgatggcagc ggcaagcagg tcgatcagct 2579221 cgcgcgcatg ccgctggcgc cgagccttgc ggcttggcgg catccgcttg ggccgcagcc 2579281 ggcgcgcggc gtagagcacc tggaattcgt cgatcaacat cgggaaggcg cgcagcgcga 2579341 gcgccaacgc caccgcccat tcgtcgaccg ggatccgcaa cacccgaaac ggccgaccca 2579401 aagtggctac cgcagggctg atttcggcaa cattggtggt ccaggacacc atcgccccca 2579461 gcgccaggag cacaaccgac agcgcggtga tccgcaggaa gtgcagtgcg ccgcccaatc 2579521 cgagctgcac tccgcccacg gcgaccactg gagtgccacc ggctagcgca gcggtcagaa 2579581 agccgatcgc gaggacgatc cacagccagc gaggtaccga cggcagcgcg ccgcgcggaa 2579641 tgtgcgcgat tcgggccgcg gccagcacca aagccgccat catcccgatc gtcacccatc 2579701 ccgggtagaa cgtcagcaac accgaaatgc cgaaaaccac caataatttg gtgccggccc 2579761 acaggtcgtg gatgaccgag ctacccggca ccggaatcaa cagcacaatc ggacgtgacg 2579821 ggcgacgagt cccgttgcgt gccggggccg aagttgtggt catgacattc cccccgcctc 2579881 cgacgccgcc gccgattcca gcacaccgtc gcgcagatgc agggtacgcg ggcaaagctc 2579941 ctccatcccc gcgaagtcgt gcgaaactac gaccaccgtc aggccgcgcg cccgacgcaa 2580001 gtcttccagc agccgcagca ggccgcgctg gctggccgcg tccaaccccg ccaacggctc 2580061 atcgaggatc aacgcccggg gtgcacgcgc aagcagcccg gccagcacca cccgacgcat 2580121 ctggcccccg ctgagctggt cgattcgtcg cgcgcccagc gcggggtcca acccaacgac 2580181 agtcagcgcc gcagccaccc ggtcctgctc gctagccgaa aaacctgctg cggaagcaac 2580241 ttccaggtct acacggctgc gcatcagctg cagccgggcc gcctgaaaag acaacgccac 2580301 cgcgccgacc tgctcgtggg tgggccgacc gtcaagtagg caggctccgg tcgtggggat 2580361 cgtcagcccg gccatgatcc acgccagcgt cgacttcccc gagccgttgc cgccgtggat 2580421 cagcaccccg tctccctgct caacaacgaa gttgatatcg cgcaacgcgg tctttgccca 2580481 cggggtgccg ctagcgtatt cgtggccgac gcccaccagt tcgagcgccg gcgcgtgctg 2580541 gggctgatcc accccgatga ccggggccgg catcgcggcg gtgtggacca tatcggtgtt 2580601 atccggcgaa tcgctcaggc tgagcgtgcg gtcggcggaa tcggcttcgt tgtcgtagtg 2580661 cgtgatgtgc accaaggcgg tccggtgccg ctgcgtcaga cccgacagca cggccagcaa 2580721 agcgtccctg ccctgctggt caaccatggt ggtgacctcg tcggcgatga gcatcgccgg 2580781 ctcccgggcc agcgctgccg ccagcgccag gcgctgcagc tcaccaccgg acaggcttcc 2580841 ggtgtcgcgt tcggcaagcg cttccaagcc gacctcgctc agcaaccggc caacgtcagc 2580901 ggtggtaccc agcggcagcc cccacaccac gtcgtcggca acccgggtgc ccaggacctg 2580961 gctttccgga tgctgcaaga cgacagcggt gccgcccagc tttcccaaac ccaccgtgcc 2581021 cggacgatcc acggtgcccg acgtcggtgc ccggccggcc agtatcagca tcaaggtggt 2581081 cttccctgat ccgttggccc cgatgatcgc taggtgctcg ccggcccgga cgtcgaggct 2581141 gacctcccgc agcgcatctt ggccggcgcg ggggtaacgg aagcggacct tgtccaaccg 2581201 caccggcacc ggcccgatca gagcgtccac gtcatctcct ggcggtgggt caagtttgtg 2581261 tacatcgggg attccgcgca tccgctccag caggcgcgac aacgcccacc acccaatcag 2581321 cgacacgatc atgatcccaa tgttgaaata gcccagcagc acccacggcc agtactgcag 2581381 tccctcggcg aaataccgct tgacgtcggc ggctgccccc tgcatgtgca tccgggccaa 2581441 ggtggcggcg ataccgtcca cgtttgcggt catgaccttg aaaatcagat gccgcagtcg 2581501 gaccatggcg gccaacatcc cgaccatcgc cgcgccgaac acgaatccgc cgatcagcga 2581561 cgagacgacc accgtcgggg tgccccggcc cctgcgtttg acgattccgg tcagcccacc 2581621 gatgtaggca ctgtggacca cccccatgaa accgcccagc cccgcgatca ggaaggcgat 2581681 catcccggcc gcaaccgtcg cggccgccag cacgcggaga cggtagcggt aggccagcag 2581741 gccggtgggc acggtgccca acagcgccag accggccgcg aacggaacga cgacggagat 2581801 gatcgcggtc accgcgcaca gcgccgccat caccgacgcc tgcgccaatt cactcggccg 2581861 cagcggcccg ccccgatgtt gcgcggggca agggccgagc ggggtcactt caccgattct 2581921 gccaggctca ggcccgcaca cggcgcagca catcgattag cctcgcatag caaagctatg 2581981 caacgatggg gggatgagtc cctcccccgc cgccgccaac cgcagcgagg tcggcgggcc 2582041 actaccgggc ctgggagcgg atctgttggc agtggtcgcg cggctcaacc gcctagccac 2582101 gcagcgcatc cagatgccac tgcccgcggc tcaagccaga ctgctggcca ccatcgaagc 2582161 ccagggggaa gcccggatcg gcgacttggc cgccgtcgat cactgctcgc aaccaacgat 2582221 gaccacgcag gtacgacgac tcgaggacgc tggactggtt acccgaaccg ccgacccggg 2582281 agacgcccgg gcggtccgca tccgcatcac gccggaaggc atccgcacgt tgaccgcggt 2582341 gcgggcagac cgcgcggctg cgatcgagcc tcagctggcc ctgctcccac cggcggaccg 2582401 ccgggtgttg gcggatgcgg tagacgtgtt gcgccggctg ctcgaccatg ccgccaccac 2582461 gccgggccgg gcgacgcggc aataggcatc gagatgtcga acgccgcgcc gttggcggtg 2582521 tgggtcggat cgatgcgccc gaaaacgcaa agggaatcgc ttggcggctc ctgctgctgg 2582581 agttgtccgg accatcccga ctactccgaa aggccaatgc gagccggctg attgacggcg 2582641 aacgccaact tggcccgaaa agaccggcat ttcactacta tcaatgtgcc tcgatcgtcg 2582701 ttggataaca accgtagtga gtcgagagga accagtatgc agttcctgag cgtgattcca 2582761 gagcaggtcg agtccgcggc tcaagatttg gcgggcattc gctcagcgct gagcgcgtct 2582821 tacgcggccg cagcgggacc cacaacagcg gtggtttccg ctgccgagga cgaggtgtcg 2582881 accgcgattg cgtcgatatt cggcgcctac ggtcgacagt gccaggttct cagcgcccag 2582941 gcctccgcgt ttcatgacga gttcgtcaac ctgttgaaaa ctggcgcgac tgcataccgc 2583001 aacaccgaat tcgccaacgc ccaaagcaac gtgctgaatg cagtgaacgc accggcccga 2583061 tcgctgttgg ggcacccgag cgcggctgag agcgtgcaga actcggcccc aacgctaggc 2583121 ggtggccaca gcaccgtgac cgctgggctt gccgcacagg ccggtcgtgc cgtcgcgacg 2583181 gtcgaacaac aggctgcggc tgcggttgcc ccgttgccaa gcgccggcgc cggactggct 2583241 caggttgtca acggcgtcgt gaccgccgga cagggttccg ccgccaaact tgccaccgcg 2583301 ctgcagagcg ccgcgccctg gctggccaag agcggcggcg agttcatcgt ggctgggcag 2583361 agcgcgctga ccggtgttgc tttgctgcaa cctgccgtgg tcggcgttgt tcaggcgggc 2583421 ggtacgttct tgaccgccgg aacgagcgct gctaccggac tgggtctgct cacacttgct 2583481 ggtgttgagt tcagtcaagg cgttggcaac cttgcgctgg cttcagggac cgccgcgacc 2583541 ggacttggtc tgctgggcag tgccggtgtg caactgttca gtcctgcctt tttactggct 2583601 gtgcccaccg cgttgggtgg agttggctcg ctcgcgatcg cagtagttca gcttgtgcaa 2583661 ggcgtccaac acctgtcgtt ggttgtgccg aacgttgttg ccgggatcgc tgcactgcag 2583721 accgccggtg cccagtttgc ccagggtgtt aaccacacga tgctggccgc tcagctcggt 2583781 gcccctggga tagctgtctt acagaccgcc ggtggccatt ttgctcaagg cattggccac 2583841 ctgacgacgg ctggcaatgc cgctgtcacg gtgctgatct cctagccggg cggtcgagct 2583901 tcatcccgga gccgctacgt tacgccgaga tgctgcaccc ggagaatcgg tccgattgag 2583961 ttctgggacc gataagttcg gctggcgtcg atgccggctg ccgcaccaag gccgcctgca 2584021 acatccccat gtcggtgacc gttcggcggt cgtacacttt ccaagtcaga acggcggcag 2584081 cggcgtagca catcatgaag atccagaacg catcggtccc actgccggtg ctgaggtagg 2584141 actcacgcag cgccatattg attccgaccc cgccgagcgc gccgaaggcc gccacaaacc 2584201 cgatgactac tcctgagatg atgcgtgacc agtcgcggcg ttcggcttca ctgagatcca 2584261 gggagcggct gcacgcctca aaaatcgtcg gaatcatctt gtacacagac ccgttgccca 2584321 acccggatag gacgaacaac gcgacgaagc agacgaagta gccgaccatg gtagcgcccc 2584381 gatgctggcc gacatgtcgg ccttcgaggg tgctggcact gatcagcagc ccagcggcga 2584441 gcgtcatcgc cacaaagact ataagggtca agcggcttcc accgactcga tcggccagcc 2584501 ggccaccgta aatccgggcc accgccgcca gcaacggccc gacaaacgcc aactcgacgg 2584561 catgcagcgt cgcgcgcgcc gggctttgtc cgcacgccag gaagttggtc tgcaacacct 2584621 ggccaaacac gaaggagaag ccgatgaatg agccgaaagt gccgaggtag agcagcgaga 2584681 gcaaccacgt gtcgcgggtc gacagaaccg cggaaacgat cggccgaagc cggttcacct 2584741 gcacccggtg ctgttcgaca ttgttcatga acagcgacac tccgattacc gcgattgcca 2584801 ccagaaccac atacagtgcg cagaccaggt aaggcttccg ctcaccgaca gtggcgattg 2584861 ccaacaaccc aactagctgg atcgccggca ccccgagatt gcctacccca ccggcaattc 2584921 cgagcgccga acccttgagc cgatgtggat agaaagcatt ggcgttgctc attgacgacg 2584981 cgaagttgcc gccgcctaag ccggtcaggg ccgcacacac cagatacggc cacagcggta 2585041 gccccggatg ggtcaacaac accgttgtgc caatggccgg aattagcaac acgattgccg 2585101 aaaaagtcgc ccagttgcga ccgccaaaga tcgcgctggc caacgcgtag ggcatccgca 2585161 ggaacgcgcc aaacagcgtc gcgatggtgc cgagcagaaa cttgtcactg gttgaaaagc 2585221 cgtagacgtc ctggggcatc agcaactcca gcaccggcca gagcgtccac accgagtaac 2585281 ccaggtgaac cgtcacgacc gaccaaagca gattgcgtcg ggcaatgccc ttgttgcctg 2585341 cctcccacgc tcctagatcc tcgggatccc aatgcgtgat gtgacgtgag ccacccaggc 2585401 gcctgagcga aggggccgcg ggactgcgcg gcgactcctc gcgttgcagc agcgtgtgct 2585461 gttccatcac cctccttgtt cccaccctgg tgcgaatgcg ggccggccta ccagggtgcc 2585521 agccttgcgt gtacgaagtt gtttcctggc agcctgaaac tcctgtagaa ctcctgtaaa 2585581 agtgctgaag gcaatacaca attgggctcg cccttgagcc gagaagacct aaaccctaca 2585641 tgtaaagctg cgctgttgtc ctcgcagcaa gaaaacagcg aaagctattg tgctcgagta 2585701 ctactgatgg gggatcgagc cgagcgcctc gagcttgcca tctgatccga tgtggaatcg 2585761 caccgtgccg atgccggtgg gacagcactc ctggtcgctg ccgatctgcc attggtactg 2585821 aaccgtcacg gtgtcatcgc ctgcaggcaa tacggtgatg tagggcttcg gattccgagt 2585881 cggcgagccc agcgggatgt tgcggtcgaa gaacaacagc tgttggggag tcgactggga 2585941 ggcaattgtc gggatgattt gcacccaatg caagcggcag ttgcgggtat gtcctcgggt 2586001 gatttcgacc catttggagc ccggtaccac gatcgggacc gcagcgatgg cctgccgtac 2586061 cgtgtcagcg gtcggcccgt cggaatcctt gcaggtgttc ggtggtgacg gtcgtgttgt 2586121 cggcggcttc caggcgcaac cggaggcgcc cagccccaca atcagtgcca acagtgccaa 2586181 cagtgccaac agtgccagaa tcgggacggc gctacgctga cgacgcacgt cacgagctta 2586241 gcgaaaactg ggaatttccc ctacgtttca tcaacgcctc aggtgtcgat cctaaagcgc 2586301 gggtgccgcc ggtattcttg ccccaaatcg gtcggttgac acccgatgcg gtcggcgaag 2586361 ccatcggcat cgcggccgac gacatcccga tggcggcacg ctggatcggc agccgaccat 2586421 gctcgctcat cggccagccc aacacgatgg gcgacgaaat gggttacctg ggaccaggtc 2586481 tagcgggtca gcggtgcgtt gatcgattgg tcatgggcgc cagtcgatcc acctgctccc 2586541 gattgccggt catcgcgtcc gtcgacgaac ggctgtcggt gctcaaacca gttcggccgc 2586601 gcctgcattc aatctcattc atctttaagg gccgccccgg ggaggtgtac ctgacggtca 2586661 ccggttacaa ctttcgcggt gtgccgtagt tcggggtgtg ctcgacctgc ctcgccgagc 2586721 gcccccgaca atcgggtcgc catctatgaa aggacatcta gcaacattcg gccacccagc 2586781 gcttccgaca taccgaggat catggttgag tcgggaaccg ggatccccct accggcttcc 2586841 tgctggagcc ggacgagatc gaggcgatgc atgccgaagg cttcctcgcc gcactggatc 2586901 tggcactctt ctgcggccag ggcagcgctg tacgttcgcg gcaaacgccg acccgatggc 2586961 caagggcgtc gatcgtgcgc tctgcgaaat cgtggccgaa cgccggcaac tggacctgga 2587021 cctggccaaa gcccaagtcc ggtcggcgct cgccaaccag cgttaccatc gcgacgtcca 2587081 ttaaacccag cacggtcacg aacggaggtt gtgatgagcg acgcccgcgt gccacggatc 2587141 ccggccgcgt tgtccgcacc aagtctcaac cgtggagtcg gcttcaccca cgcgcagcgg 2587201 cggcggctgg ggctgaccgg ccggcttccg tcggccgtgc tcacgctcga ccaacaggcc 2587261 gaacgcgtat ggcatcagtt gcagagcttg gccaccgatc tgggccgcaa cctgcttctc 2587321 gaacagctgc actaccgcca cgaggtgctg tacttcaagg tgctggccga ccatttgccc 2587381 gaactgatgc cggtggtgta cacgcccacc gttggcgagg caatccaacg cttctccgac 2587441 gaataccgcg ggcaacgcgg actgtttctg agcatcgacg aacccgacga aatcgaggaa 2587501 gccttcaaca cgttggggct ggggcccgag gacgtcgacc tgatcgtgtg caccgatgcc 2587561 gaggcgatcc tgggtatcgg tgactggggt gtgggtggca tccagatcgc tgtgggcaaa 2587621 ttggccctct acaccgccgg cggcggcgtc gatccgcgcc gctgcctcgc ggtgtctctg 2587681 gatgtcggca ccgacaatga gcagctgctg gccgatccgt tctatctggg caatcgccac 2587741 gcccggcggc gcggtcggga atacgacgag ttcgtcagtc gctatatcga aacggctcaa 2587801 cggttatttc cgcgtgccat tctgcatttc gaggacttcg ggccggcgaa cgcgcggaag 2587861 atcctagaca catacggcac ggattactgc gtgttcaacg atgacatgca aggaaccggc 2587921 gcggtggtct tggccgccgt atacagcggt ctgaaggtta ccggtatccc gctgcgcgat 2587981 cagacaatag tcgtcttcgg cgcaggcacc gcagggatgg ggatcgccga tcagatccgg 2588041 gacgcgatgg tggcagacgg tgccacgctc gagcaggcgg tgtcccagat ctggccgctc 2588101 gacaggccgg gcctgttgtt cgacgacatg gatgacctgc gcgacttcca agtgccgtac 2588161 gcgaaaaacc gccaccagct cggtgtggcc gtcggggatc gggtcgggct gagcgacgcg 2588221 atcaagatcg catcgcccac tatcctgctc ggctgctcaa cggtctacgg agcgttcacc 2588281 aaagaggtgg tcgaggcgat gacggcgtcc tgcaaacacc cgatgatctt tccgctgtcc 2588341 aacccgacgt cgcgcatgga agccatcccc gccgacgtgc tggcgtggtc gaatggcagg 2588401 gcgctgcttg ccaccggcag cccagtcgcc ccagtggaat tcgacgaaac cacctacgtc 2588461 atcggtcagg ccaacaacgt gttggcgttt cccggcatcg gactgggcgt cattgtcgct 2588521 ggtgcccggt tgataaccag gcgcatgctg catgcagcag cgaaggccat tgcgcaccag 2588581 gccaatccga caaatcccgg agactcgctg ttgccggatg tccaaaatct gcgggccatc 2588641 tcgacaacgg tcgccgaagc tgtctatcgg gccgccgtcc aagacggggt ggcttccagg 2588701 acgcacgacg acgtcaggca ggccatagtc gacaccatgt ggctcccggc atatgactaa 2588761 ccgcgcactc gacggtcatc gctgtaggca gcctctcgct tagcgtcgct gcccgcggtt 2588821 tgcacgtcac gcggaaacca tcgccagccg gcgagaaaca cgacagccag tgttgcagtg 2588881 gcgacgagca acgccacccg aatgccttcg atgaaatcct cctccgcaat cgcgacgggg 2588941 tcgcgatgct caatgtgccg ccgggggacg attccgccca catgcgctcg cggattggca 2589001 ctgtcgataa tgatctcggc aaggacgtgg cgctggaccg ggtcgggcac cgcgcgctcc 2589061 agatggggct cgagtgtggc cgaaagccag gcggcaagga cggagcccaa aaccgcgaac 2589121 ccgatcgtcg agccgatcgc ccgctgagca ctcatgatgc cggacgccat gcccgcacgc 2589181 tcggcgggga ccgcggtcat ggcgacggtc gtgatcggcg tcaggcacaa cgcgacgccg 2589241 ctcccgcaca agcccagccc gaccaggacc agggccgagc tccggtgctc gctgaagatg 2589301 agcatgagca gacccagcat caacatgcac agccccgcca ggatgggaac gcgtgctccg 2589361 atccggccaa ccaggtgccc aacaagtggc gacacgatgg ccacggccgc actgaacgga 2589421 aggatcatca ggccggtcac gctcggggta tagccgcgca cgttctgcag gaactgggtg 2589481 gtgagcagca gcatcccata gacggcgaag aacaccgtgc agatggtcgc gatggccagg 2589541 gcgtatgagg tgtcgcggaa cagggtcaga tccatcatcg gattcgatga tctgcgctca 2589601 agccagacga acagggcgca gccgacggcg gctgtccaga gcatcacgat ggtctggaca 2589661 gacgtccagc cgatctgggg gccttcgatg accgcataca ccagggcacc cacggcaacg 2589721 atgaacagca gctgcccgga cagatcgaag cggcgtgccc gctcgttaca cgactcctcg 2589781 acgtagcaca aagtcaggaa gaggacgagt gcgcccatgg gcaggttgac atagaagatg 2589841 ctgcgccacc cccactggtc caccagcaga ccgcccagtg tcgggcccgt cgtcgtaccg 2589901 atgctcgcga tggcggtcca gatcccgatg gcgcgcgcct tctccttcgc ctccggaaag 2589961 gccgcgctga ccagggcgag cgaggttacg ctgacggccg ccgcacctag gccctgcgcg 2590021 ccccgcgcgg tggtgagcac cgcgattgag ggcgccaacc cgcaggcgat agatcccagc 2590081 gtgaacaacg aaacacctat caagtaccag cggcgccgac cgtagaggtc ggcaagcgtc 2590141 gccgccgaca tgatgaagac cgccattccg aggctgtagg acgccaccac ccactgcagg 2590201 ccgtcctccc ccaccgcgaa actgcgctgg atgtcgggca gcgccacgtt cacgatcagt 2590261 gcgtcgagaa agatcatgaa caggcccagg ccagtggcga tgagcgtgag gagctgcgtg 2590321 cggttcatgc gggccccgat ctacatggat ttcggtggcg atctgtgacc agacactagg 2590381 ctgcgccagc gacggcgtca gccgcttcgg tcgattcgag ccgaatggtc gacggctgcg 2590441 gaaccgaccg caaaactggg gcaaaaggtt caccgcgggt gtaagccagc taggcgaacc 2590501 gatcccgctg gcccatggcc tatagtgggc ccatgcaaca ggccatacag ctgcgcttta 2590561 tcctcccgcg ccgcctcgcc gtgggctgtt gttgttgttg attcctggcg tccacagcaa 2590621 tcctcgcgct cttgcccgca aacgggtgga aatcggtgtt cgcccgcggc gtacagccgc 2590681 cgcgcactca cgagtcgttc agaaagatca acagccatga ccgtgcccac ggatgcagcc 2590741 atcgacttcg acgtcagctg ggaggccaac tgggcctgga ccgacactgt tgggcgtagc 2590801 agatgagcat cgccgaggac atcacccaac tcatcgggcg cacaccgctg gtccgactgc 2590861 gccgagtcac cgacggcgcc gttgccgaca tcgtcgccaa gctggaattc ttcaacccgg 2590921 ccaacagcgt aaaagaccgt atcggggttg ccatgctcca agcggccgag caggcaggtt 2590981 tgatcaagcc ggacacgatc attctcgaac ccacgagcgg taacaccggc atcgccctgg 2591041 ccatggtttg cgcggcacgc ggctaccggt gcgtgctgac catgcccgag acgatgagtc 2591101 tggagcgccg gatgttgctg cgcgcatacg gtgctgaact catcctcact ccgggtgcgg 2591161 acggcatgtc aggtgccatc gccaaggctg aggagctggc caagaccgat caacgctact 2591221 tcgtgcccca gcaattcgag aacccggcga acccggccat ccatcgcgtc acgaccgccg 2591281 aggaggtctg gcgtgacacc gacggcaagg tcgacatcgt cgtcgcggga gtcggcaccg 2591341 gtggcaccat caccggcgtc gcgcaggtca tcaaggaacg caagccgtcg gcccggttcg 2591401 tggccgtaga gccggccgcg tcgccggtcc tttctggtgg ccagaaggga ccgcacccga 2591461 tccagggcat cggcgccggg ttcgtcccgc cggtactcga ccaggaccta gtcgacgaga 2591521 tcattaccgt cggtaacgaa gacgcgctca acgtggcgcg ccggctggcc cgggaagagg 2591581 gcttgctggt cggcatctcc tcgggcgccg ccacagtggc cgctcttcag gtggcccgcc 2591641 ggccagagaa cgccgggaag ctaatcgtcg tagtgctccc cgacttcggc gaacgatatc 2591701 tgagcacacc gttgttcgcc gacgtggctg actaagccat gctgacggcc atgcggggcg 2591761 acatccgagc agcccgggag cgggatccgg cggcccctac cgcgctggaa gtcatcttct 2591821 gctacccggg cgtgcacgcc gtgtggggcc accgcctcgc ccactggctg tggcagcgtg 2591881 gcgccaggct gctcgcgcgg gcagctgccg aattcactcg catcctgacc ggtgtagata 2591941 tccaccccgg tgccgtcatc ggtgctcgcg tgttcatcga ccacgcgacc ggcgtggtga 2592001 tcggagaaac cgcggaggtc ggcgacgacg tcacgatcta tcacggcgtc actctcggcg 2592061 gcagtggcat ggttggcggg aaacgccatc ccaccgtcgg tgaccgcgtg atcatcggcg 2592121 ccggggccaa ggtcctcggt ccgatcaaga tcggcgagga cagccggatc ggcgccaatg 2592181 ccgtcgtggt caagcccgtc ccgccgagcg cggtggtggt cggggtgccc gggcaggtca 2592241 tcggccaaag ccagcccagt cccggcggcc cgtttgattg gaggctgccc gatctcgtgg 2592301 gagccagcct cgattcgctg ctcaccaggg tggccaggct ggaggccctc ggcggcggcc 2592361 cgcaagcagc aggagtcatc cggccacccg aagccgggat atggcacggc gaggacttct 2592421 cgatctgagg caatacccgg ccgccgacaa tgccttcttc ggcgccgccc accgacgcgc 2592481 atcatcggct gctagccccc gcaccgggtt ccgtcctcgc cgaattcacc tcgggccgga 2592541 ggttgagctg cttgggcttc ggcagccgaa accggggcga tacaaacgtg ggttgcggat 2592601 acgaccgctt tgcgacgcgg tttgtccaac gcaggcttgg aaaacttctc caagcacgag 2592661 cgagattact gattcgaatt ggctcttgac agcaccggcg aagaggtgta gagatgcgaa 2592721 tcactatgtg gacagcaatc tttggaaagc tcttgctgtc aaatccgtca cgaacctatg 2592781 cttagcgata ccttgcgcca aacatgcagt cgcttgaccg ttgagatcgc tgaggtatcg 2592841 gccatggatg tccctcacga gcagccagcc ctctcttcga gcaaatcgaa tcgctttact 2592901 tcgcaaaggc aaacaactgg tgtgggaacc accactgttg aacggctcga accgcggtta 2592961 tctcccgcgt cccgccacat cactgaggct aaagctttcg gcaccgagtg ccacgtaagt 2593021 tcctttaccc gtgagcagga tcccgacagg gcggtccgtg tggagcagat ccacggtgaa 2593081 gcgtatgtcg ccgccggcca tgtgtacgaa tctgcgctcg atgaattggg ccggctggac 2593141 aattccaacg ccgagttcat cctcgacaag gcacgcggta gcacccgaga aaccgaggtc 2593201 atatacctgc atgcggttcc cgcggagccc ctctccggca gccaaggcga aggaggcctg 2593261 cgaatagtcg gcatttccgc tgtggggtca attgacgacc tcagtgcatt taaggccgcc 2593321 aaaccgtcga tgggcctggc gcatcaacgc aagctttatg acgcgatcga agacctgggt 2593381 cacggcgggg tcaaggagat tgcggcatta tcggttacgg ccgatgcccc tcccacggtg 2593441 tcgtattcgc tcatccggga ggttttgcgc ttgtaccacc gaaccggcga aaaattgata 2593501 atcacatttg ccatgccagc atacgccaag atggtgatga attttggtcg atttgcgatg 2593561 cctcaagtgg gcgaaccgtt ctatgcgcat agaaataatg accctaggac atcgaatgat 2593621 ctcttgctgg ttccctcaat agtcgagcca tcgaattttc tcgagaatat ttcccgcggg 2593681 gtcgtgacag cggatgacgg cccgaccgcg agaaggcgat tcgccaccct atgctatatg 2593741 accgacggcc ttgatgacta tttcatgccg ttgactcggc aggtccttag cgaaggaatc 2593801 caagacatct gagttctgga agcggtaatg ggcggtcggg cgtgcgcaac tccggcaaca 2593861 aacagcttgg agcttttacg cgaagcggga ttcactatcc gaaccagacc gctcggcagg 2593921 ggcatagcaa taagcttcaa ccgattgacg cattgtgcga actgacggcg cccgcgcatg 2593981 gccaatccgg aagaccatca ttggccagtg gccgggcgct aacaggttcc agccccccac 2594041 cagtgccgct cgaacatgcg gtgcaaccca ttcgcaggcc ggcagggaaa gcaccgcgga 2594101 agccgcaaag ggctgcagtt ccgcgcccaa tagtgtcgtc cgcaaccaga tgcgctcgaa 2594161 aaccgcgccg gcagtcagcg cacccgacgc gaggtcgaga gacgtcgtca gcgcgcccac 2594221 atggggtgcc aatcggcacg gcaggtaggc cgcgcgcaac ccgagcgcgt ggtgcatgcc 2594281 cacggtccgc aggaggcgca gcaccctcca atgccgaagc ccacgaaaca tcgggcgcat 2594341 ccacgcttca acctcaagag acccgggcgg caacccatcg tcgctgctcg cggtccagcc 2594401 aatgtcgaag cggacggccg aaaagagttc ttcgtgtagt tcacgagatc gaaagcgctc 2594461 agtttcggcc aatctgacca accgaaggat ctgtttcctg gtctctggcg agtcaaacca 2594521 atgcagttgg atcccgtcaa tgccggtggc ctccgccgag agtgcaccca attcgccctg 2594581 cgaaagtggc ggtccacgaa agcgaacacg ccggttggtg cgccttcgct cgatcgcgac 2594641 ctcgattgga tccaccctgg tttgtggcag gcgatccacg tcgatctccg ccaccagccc 2594701 cggattcccg ctatccggaa accagcacac cttcgtttca aaaccaaggc gtccagcgcg 2594761 cagcttcacg ttttcgacag ccgcaccgat cgcgaccaaa ctcatgatgc ggcggtgctc 2594821 gggggcggac ctccaagtct gatcgcccca caaccgcacc cgcctaccgg catgttcgag 2594881 ctggacttcg cgccggttgt ccgcggatgg cgccagcgcc gccgcctcga cgagcgacag 2594941 gaattcagca ggatccagac ccgtcatacc cgggccccag cggccggcac gcattgccgt 2595001 ggcagagtgg tcgcgccgac gaacagtgcg gaagcgatat gtctatccca tgttcgctca 2595061 aacagcggtg cgctggcagt atctgagtac accattctag gtgcagctcc caactagtag 2595121 ctcggttccg tcctgtgata ccgcagtccc ggtattaccc ccgccgatcg tcgatttatg 2595181 tagcgggcca gcaatcgccg cttgactctc tgtagagggt ggcgatttcc gcaccgtaaa 2595241 cgcttccgaa catagatgct gcggtaagca tcgaactgat gaaagtaggg agcagcgtaa 2595301 acgcgcccat gtccgagcag aatcttcaac acctcggccg ccaccacacc ggaagccaga 2595361 tgacaggcga ggccaaccga tggacccgtg cgattttcga tgtcgacgta ggacagatct 2595421 atggagcgcc gatgcgtcgc ggatggtgct attccagcta taaatgcgac gaacttatcc 2595481 accgtgttca tcgcatcaga cagatcgaaa taccgatcga acgtcatacc cttaggatcg 2595541 aaaacgaccc aggccgtact gaacccgagc gggccagcgc ctagcgcgta gattccccgc 2595601 tgctgtgctt cacgatagag caggcgacgc aaatcgattt cgaacgcgtc gatgccgtcc 2595661 accaaaacat ctgctccctc tagaaaggta gctgcattct ctttcccaat aggttcgcag 2595721 aaagcacgga tttctgcttc agggttaata tcatgaacga tattgcgcat gacctctgcc 2595781 ttggcctggc cgttggtcga gcgcatagcg ccgtactgcc gattcgagtt gcgtatttcg 2595841 aagacgtccg ggtctgcaat ggtgaacttt cctattccca tccttgcgag ggcgaccatg 2595901 tcaattcccc caaccccacc catcccagcg attgcaacgc gactattccg aagccgttgt 2595961 tgttcggttg ggctaatcaa tccaaggttg cgacagaaag cttcgtcata agaccatggt 2596021 gcgctttctt tcacccgtcc agagtcgggg gcatccgcac cggctcgcat cgcatcatcc 2596081 tcccacgacg ggccgctcat cagcttgggc catttcaatg tacttgatac cccgcgctgc 2596141 gggtaggcca ctgcgacgat tcaaacacgg tgtcacacgg tgaatagtgt cgagatgggc 2596201 tctgatcaac cgtcgcaaac ccggtttcgc atcgatagcg gaatcgcacc gggttgcatg 2596261 gaggctgctg accttggaaa acaagatgta ttcattacga caaaacaagc gccgcggaaa 2596321 ctttgcacgc tcgagcattc cgccgcggct cacgcacatc ctggccgcct tcccgcaacc 2596381 gtcccccgga attactgatc aaaccctggg tttaccaact tccgggcatg gggcgaaggt 2596441 cgacagccag aacatggccg tgcgtgatat gggcattcac gggacggagc cgctaaggag 2596501 accggtacga ttcaatctcc atatgagcgg tgcggcggct gttgtcaggt acgttgaaca 2596561 ccggtgggcg atcgggtgcc ggcaggttgg tcttctcctg tgatgcgagc gcgcctccgc 2596621 gccaaccacc gcgtgcgaag caggtgctga tgccacagtg ctgatgtcac aaggaaccgc 2596681 gagggggtcc cggaccctac atggtgccgg gcgaagtcca catgagtgat acgccgtcag 2596741 gcccgcaccc aatcatcccg cggacgattc gcctggccgc gattcccatc ttgctgtgtt 2596801 ggctgggatt taccgttttc gtcagcgtcg tcgttcctcc gttggaggcg atcggtgaaa 2596861 cccgggccgt ggcagttgcc cccgacgatg cgcaatcgat gcgtgcgatg cgacgtgccg 2596921 gaaaggtgtt caacgaattc gattccaata gcatcgcgat ggtcgtcctg gaaagcgatc 2596981 aaccactagg cgagaaggcc cataggtatt acgaccacct ggtcgatacg ctcgtactgg 2597041 accagagcca tatccagcac attcaagact tttggcgtga tcccctgacg gcggcgggtg 2597101 cggtcagcgc agatggtaag gcggcgtacg ttcaacttta cctcgccggc aacatgggtg 2597161 aagcactcgc aaacgaatcc gttgaagccg tccggaaaat tgtggcgaat agtacaccgc 2597221 cggaaggcat cagaacctat gtcaccggac cggcggcctt gtttgccgac caaatcgccg 2597281 ccggtgaccg aagcatgaag ctgatcaccg gattaacgtt cgcggtaatc accgtgttgc 2597341 tgctgctcgt ctatcgctcg atcgccacca cgctgctgat tcttcccatg gtgtttattg 2597401 gactcggcgc gacgcgtggc accattgcct ttcttggata ccacggaatg gtcggccttt 2597461 cgacttttgt ggtcaatatc ctcacggcac ttgccattgc tgccggtaca gactacgcga 2597521 tcttcctggt cggccgctat caagaagccc gccatatcgg ccagaatcgc gaagcctctt 2597581 tctacacgat gtacaggggc accgctaacg tcattctcgg atcgggactg accatcgccg 2597641 gcgcaacata ttgtctgagt ttcgcccggc tgacgctgtt tcacaccatg gggcctccgt 2597701 tggcaatagg catgctggtt tcggtcgcgg ccgcgctgac cctggcgccc gccatcattg 2597761 ccatcgccgg ccgcttcggc ttgctcgacc ccaagcgaag actgaagacc aggggctggc 2597821 gtcgtgtggg taccgcagtc gtgcgctggc ccgggccaat tctggccacg tcggtcgcgc 2597881 ttgccctggt gggattgctc gcactaccgg gctaccggcc cggctataac gatcgctact 2597941 acctgcgcgc tggcacgcct gtcaaccgcg ggtatgcggc cgccgaccgg cactttggcc 2598001 cagcccggat gaaccccgag atgctgctgg tcgagagcga tcaagacatg cgaaatccgg 2598061 ccgggatgct cgtcatcgac aagatcgcca aggaggtcct gcacgtgtcc ggggtcgagc 2598121 gggtgcaagc gatcacccgg ccgcaggggg tgccccttga gcatgcgtcg attccctttc 2598181 agatcagcat gatgggtgcc acccagacga tgagcctgcc ctacatgcgc gaacgcatgg 2598241 ccgatatgtt gaccatgagc gacgaaatgc tggttgcgat caattccatg gaacagatgc 2598301 tcgacttggt gcagcagctc aacgacgtta cccatgagat ggcagccacg acgcgcgaga 2598361 tcaaagctac taccagcgaa ctgcgagatc accttgcgga catcgacgat ttcgtcaggc 2598421 cgttgcgtag ctatttctac tgggagcacc attgcttcga cattccgttg tgctcggcga 2598481 cgcgatcact gtttgacacc ctagacggcg tcgacacgct gactgaccaa ttgcgggccc 2598541 ttaccgacga catgaataag atggaggcgc tcacaccgca atttctcgca ctgctgccgc 2598601 caatgatcac gaccatgaag accatgcgga ccatgatgtt gaccatgcga tcaacaataa 2598661 gtggcgtaca agatcaaatg gccgatatgc aagaccatgc gactgcgatg gggcaggcct 2598721 tcgacaccgc aaaaagcggc gattcattct atcttcctcc ggaagccttc gataatgcag 2598781 aattccagca aggcatgaag ttgtttttgt cgccgaatgg taaggcggtg cgcttcgtaa 2598841 tttcccacga gagcgatcca gcaagtactg aaggtatcga tcgcatcgaa gcgataaggg 2598901 ccgcgaccaa agatgccatc aaggcgacac cattgcaagg cgctaaaatc tatatcggtg 2598961 gcacggctgc gacctaccaa gacattcgag acggtaccaa gtacgatatc ctcatcgttg 2599021 gtatagccgc ggtatgcctg gtatttattg tcatgctcat gattacccag agcctgattg 2599081 cgtcactcgt cattgttggc acggtacttc tgtcattggg tactgcgttc ggactgtccg 2599141 tgctcatctg gcagcacttt gtcggtctcc aggtgcattg aacgatcgtc gcgatgtctg 2599201 tcatcgtctt gctggccgtc ggttctgact acaacctcct tttggtgtcc cggttcaagg 2599261 aggaggtcgg cgctggatta aagaccggga tcatccgggc gatggccggc accggcgcag 2599321 ttgtcacgtc ggccggtctg gtattcgcgt tcaccatggc gtccatggcc gtcagcgaac 2599381 tccgcgttat cggacaggtc ggcaccacca tcgggctcgg tctacttttc gataccctgg 2599441 tggtccgatc gttcatgacg ccatccatcg cagcgctgct aggtcgctgg ttctggtggc 2599501 cgaacatgat ccactcgaga cccaccgtcc cggaggcgca cacacgccag ggcgctcgcc 2599561 gaattcagcc gcatctgcac cggggttgat atgcacttcg gtgccgtgat cggcgcccgg 2599621 ggtgttcgtc gaccatgcga ccggcaacgc ggccttgcgc acaggcgcga tcgctcattc 2599681 gtgcccgggc ggtcgaagac caagagcgcg cagcagttgg tcgcggtccc acggccggcc 2599741 gctgccacta gcattgtcgc cggatgctgt cagcagccca tttcgagctc gaagcccgga 2599801 caacttcttt agcgtgtggc gcaacccccg aagactcgtc aacggaagaa gcagtagctg 2599861 ctcatcgcgc ccgccatcag cccggcgcgc cgagttgccc gggtcggccg ggttgccctg 2599921 gtgtgccgcg ttgccggggt tggtcgtcgc gtgcatcgcc tgcgccttgg tcgccggcgt 2599981 cgagccggat tcggctgcca ccgcagacgt ggtagccggc gacaccgcag gtgccgtggc 2600041 taccgcaggt gccacctgtt cacccactcc gccgatagca ccggaatggc cttcaccgcc 2600101 gagcccccag tgcccgccaa cccacggagc cccaccgaag ccgggcatca acccgccctc 2600161 agcagaagcg cccgcagcgc cggtgctcag accgccgtca ccgctggcca tgccgactcc 2600221 gccgtccccg cgaacagacc cgacgatgtc cctgccagtc ccggcaccac cgactgcgtc 2600281 tgcgctgccg gccccagtcc cattgccggc tgcgctaccg accccagcac cactgccacc 2600341 cgggtccgcg acaccggcgg cacctgttgc gccgctgccg tgcgaagcag aaacgccgct 2600401 gccgtgcgcg gcgccagcca ctccggggtt accgtcggtg ctgccgagct ggccggcccc 2600461 atgctgctcg ccgacctgac cgttgccgat cggtccgccg tcccagccct ttccgccagc 2600521 ctggccgaca ccgccaaacc cgccggcacc gcgagattgc ccggtgcccg tccccccagt 2600581 gcgatcttgg gcgagttcgc tgaccacgcc ctgcgctttc tgcagcggca aggcgttggc 2600641 ggcctcagcc atggcatacg ccgccgcgcc ctcttgcagg atctgtacaa accggtcgtg 2600701 aaacagcgcc gcttgagcgc taatggcctg ataggcctgc gcccgcgcgc caaacaacgc 2600761 cgcgatgcca gccgacacgt catcgccgcc ggcagccagc actcctgccg tcggggcggc 2600821 cgcagcggcg ttggccgcgc gcatagtgga gccgatcgcg gctagctccc cggccgacgc 2600881 cgccagaacg ttcggggccg cggtaacgtg cgacataagc gagcacctgc ccgtgttgcc 2600941 aactcgctgt gaccggatcg ctggtcgacc cgcgttgtca ccgcgaatcc tatcgcgatc 2601001 gaccaggaac atcccagcat tcaggcatgc ctactgcgcc tcacactgaa gtgtcgaggt 2601061 cggcggagtc ccggcatcat caggcgagtg gcatgcactc accaaccgcg gccagctcgg 2601121 caccagcttg gtgtcggcgc acagagctgt tcgggcccat acgtcgacgt agccgaaccc 2601181 gccccgactc tcgtcggaca cgttctgctg tttggcgtgg ccgaacgatc agatctcgtc 2601241 gcgccgaacg tgtattgccg ggccggtgga agagtctgcc gggagaaaaa ggaaaagccc 2601301 tgcagagact ggtgtgacac gccttgcgca gccacgcggt cggaaaaccg aaccttagct 2601361 catcagaacc caacacaaga ggcgggacaa gccgagttca agccgaacgc cctgctcccc 2601421 cgggaggact cgaacctcca acccttcggt taacagccga acgctctgcc aattgagcta 2601481 caggggaccg cctggtccgt gcgaacgctg gcgcagtcgc gggacgactc tagcgtactg 2601541 gtgtgacggc gcccaactag ggagattcct taccgatggg agcaggctga tggcagcagg 2601601 cacgatgcca gtaggtggtc ggcagcacgt tttcgagaag ctggccagca tcctgggctt 2601661 ggtcgccgcg ccgctcatgc tccttggatt gagtgcctgc ggccgcagcg ccggcaagac 2601721 cagcgaaccg acctgcccca cggagccgat cgatgcggcc gacagctcga caacaccgga 2601781 cccctcgtgt gtggtgcggg ccactgagat caacggcaac gggtcgcgca tccagacctg 2601841 gaccggcagc tatgatgcgg ccgcaaccca gtccggtggt gtgtgtggtg gcacctgcaa 2601901 cttccacgcc acagtgcggt tcacggtcga cgaaggccag atctcgggca gcgtcgatca 2601961 ggtctatcaa gcggcgatgg ttgctatcgc aacacgcccc acttcgccat ctctggcacc 2602021 atgacgatga cgcggtgacc atcgcgtgat ccaagacgta cctgacgggc aataagccga 2602081 taccaaagcc gagcccgcat cacgccgaaa caaccgcgga gtatctgctc ggcgtcgtga 2602141 attgggtgac caagtggaac ctcgattgcg tcgaaggctg caaataggac atcgggtacc 2602201 gcataaccgg atcgggcgcg cgtagccagg cgtgtaaggc aggatggatg caaccgcacc 2602261 gttagtcgga gggaccgcat tgatcgggta tgtcgccgtg ttgggactgg gttacgtgct 2602321 gggcgcaaaa gccgggcgcc gccgctacga gcagatcgcg agcacctatc gcgcactcac 2602381 cggcagcccc gtggccaggt cgatgatcga aggcgggcgt cgcaagatcg ccaatcggat 2602441 ctcacccgat gctgggtttg tgaccctggc cgagatcgac aaccagaccg ccgttgtcca 2602501 gcgcggggtc gagcggcagc cgaaaaccgc gcgctgaccc tcacgcggtg agatcgtcgc 2602561 cgctggcctg ctctaacagg ctgcgccgat aggcctccat ggcgaccagg tcgccgaaca 2602621 gcgcgtggta ttcgtcgcct tgctcgatcg gcgacatgcg ctgcagtttg gacttcacct 2602681 cggcgatctg ccgccccaac caaacctcct gcagacgggc cagcacgccg gcgatatagc 2602741 gcggcagctt gtcgtcgtcg acctgaatcg cctccacccc cagctcgctg atcaaagccg 2602801 aggtcacggt tgatgtcgtc tgctggcgca ccatatccag ccactgcgca ccgctaaggc 2602861 cagccgaggt accgcccgcc gtgtcgatgg ccgcgcgcac agccgcgtac tcggggtgcg 2602921 tgaagccttc gacggtcagc gcgtcgaaca ccgggccggc caacgccggg tactgcaacg 2602981 ccgatttgag tgcctcacgc tgtggccaca gggtcgggtc acgcggatca ggtcggactg 2603041 cgagttcggt cggggggccg gcggtgggcc gctgcgctgc ccgggcgatg gtcgtcgatc 2603101 ccagtctgcc cagcctgggg tgcttggttc gtttggcctc accccgcacc cgaccgatga 2603161 cctgtgcgac gtcggcccac ccgacccagc cggcgagctg acgggcgtat tcgtcacgca 2603221 gcgtggggtc tttgatctgg cccaccatcg gtacgcaacg gcgcagcgcg gccaccctgc 2603281 cctcggcgct atccaggtcc atctcggcaa tcgcggcgcg aatcgcgaac tcgaacaatg 2603341 gggttcgtcg tgccacgagg tcgcgcaggg cagcgtcgcc gcacttcagt cgtaggtcgc 2603401 aggggtccat gccgtcggga gccaccgcga cgaaagactg accagccagc ttctgctcac 2603461 cgtcgaaggc cttgagcgcg gcggcgcggc cggcctcgtc gccgtcgaaa acgtagatca 2603521 gctcgccgcg gaagaagctg tcgtccatca tcagtctgcg cagcatcgcc aggtgctcgc 2603581 cgccgaatgc ggtcccgcac gacgccaccg cggtggtgac cccggccaga tgcatggcca 2603641 tgacatcggt gtagccctcg acgacgacgg cctgatgtcc cttggcgatg tcgcgtttgg 2603701 ccaagtcgat gccgaacatc accgatgact tcttgtacag caatgtctcg ggcgtgttga 2603761 cgtacttggc ctccatcgcg tcgtcgtcga acagtcgccg ggcaccgaac ccgaccacct 2603821 cgccggccga ggtgcggatg ggccacagca gccgacggtg aaaccggtcc atcgggccgt 2603881 gccggccctg ccgggacagt cccgcggcct ccagttcctc gaactcaaaa cccttgcgct 2603941 gcagatgttt tgtcaatgag tcccagcccg acggggcgaa cccacagccg aatttacgag 2604001 cggccgccgc gtcgaagctg cgttcggtca ggtactggcg agccggtgcc gcctcgtcgg 2604061 actgcagcgc ctgcgcatag aacgctgccg cggccgcgtt ggcggccagc agcctgctgc 2604121 gactgccgcg gtcgcgctgc acgctggtgg ccgcaccggt gtagctgatc gtgtggccga 2604181 tccggtcggc aagcaactca accgcctcga cgaagctgac gtgctcgatc ttctggatga 2604241 acgcatacac gtcgccgccc tcgccgcagc cgaagcagtg gaagtggccg tggttgggcc 2604301 gcacgtgaaa ggacggggac ttctcgttgt gaaacgggca cagccccttc agcgaatcgg 2604361 caccggcacg cctgagctgg acatagtcgc cgacgacatc ctcgatacgg gccccctcgc 2604421 ggattgccgc gatatcgcga tcggagatcc ggccggacat cggctcagtc taaagcgttc 2604481 ctgctgacgc caagctgatc ggcatcgatg cgttccaacc gaccctcggt ataggaggcg 2604541 atctgatcaa cgacgacccg caaccgggca gcgtcgtcgg cggcggtatt gaacgcagcg 2604601 gcataaaccg ggtcgagcgt ctgcggcgcc cccgagtaca gcctgtgcgc cacccggtga 2604661 atacgttcgc gctgccgtgc ctgggtttcc agatgccgag ggtcggacat gatgaactgc 2604721 agcgcgagga ttttcagtac cgcgacctcg gcacgtacca gatcgggcac ctgcaggtcg 2604781 gcccggaagc gcaccaacgg tcccggaccg gccgcggccc gggtggtcgc gatcgcggcc 2604841 gatgcaaagc ggcccaccag ctcgctggtc aaccgcttga gcgcgaccga tgccgacaag 2604901 gtggcgtcat acttgccgac ggcggccacc acgggcagcc gcgacagccg ccgcgcggcc 2604961 gccatcaact cgtcggcgct cacccgggag aactcgcgct cccctaacct ggccagcgcg 2605021 gcagcgtcct cttcggcggc cagcacacgc aggtcgatgc gttcggagac aacgccgtcc 2605081 tcgacgtcgt gaaccgagta ggcgacgtcg tcggcccagt ccatcacctg cgcttccagg 2605141 cacgcccgct ccgggggcgc gccttgccga acccataccg ccgattcgcg gtcgtcgtcg 2605201 tagaagccga acttcctccg ctggctgcca agcccgtcac cacgcatcca cggatacttg 2605261 gtgaccgcgt ccagggacgc gcgagttagg ttcagccccg cactaagtcc ttgtgcgtca 2605321 actactttgg gctcaaggct ggtcaagata cggaagttct gcgcgttgcc ctcgaaaccg 2605381 ccgtggctgg ctgcgacttc atcaagcgcc cgctcaccgt tgtgtccata cggcgggtgc 2605441 ccgatgtcat gggctagacc ggccaattcg accagatcaa ggtcgcagcc cagcccgatc 2605501 gccattcccc gtccgatctg agccacttcc agcgagtggg tcagccgggt acgcggcgta 2605561 tccccttccc ggggtccgac cacctgggtc ttgtcggcta gccggcgcag tgcggcgctg 2605621 tgcagcaccc gggcccggtc ccgggcgaag tcggagcggt actgaccctc agtgcccggc 2605681 agaccggcag tctttggcgc ttcggctacc cgccgctggc ggtcgaagtc gtcgtagggg 2605741 tcgtgctcac tcgcgctcac cgacccacag tctgccaggg tggtcgccgc acgcccgtat 2605801 ccgccggcac agcgtctaaa ttgacggtat gcgtctcgtt cgcctgctcg gcatggtcct 2605861 gactatcctc gccgccgggc tgctgctggg gccgcccgct ggcgcgcaac cacctttccg 2605921 gctgtcgaac tacgtgaccg acaacgcggg cgtgctgact agctccggtc gcaccgcggt 2605981 gacggcggcc gtcgaccggc tctatgccga tcgccgcatc cgactgtggg tggtctacgt 2606041 cgagaacttc tccggtcaga gtgcgctcaa ctgggcgcag cgcacgacgc ggactagcga 2606101 gctgggtaac tatgacgcgc ttctggccgt ggccaccacc ggtcgcgaat atgcctttct 2606161 agtgccatcc gcgatgccgg gtgtcagcga ggggcaggtc gacaacgtgc ggcgctatca 2606221 gatcgaaccg gcgctgcacg acggcgacta cagcggcgcg gccgttgcgg cggcgaacgg 2606281 actcaaccgg tcacccagtt cgtcgagtcg agtggtgttg ttggtcacgg tcggcatcat 2606341 cgtcatcgtc gtcgcggtcc tgctggtggt gatgcgccac cgcaaccggc ggcgccgcgc 2606401 cgacgagctg gccgcggcac gccgcgtcga ccctaccaac gtaatggcac tggccgccgt 2606461 gccgcttcag gccctcgatg acctctcccg gtcgatggtg gtagacgtcg acaacgccgt 2606521 gcgcaccagc accaacgagc tcgcgctggc catcgaggag ttcggcgaac ggcgaaccgc 2606581 accgtttacc caagcggtga acaacgccaa agcggctctg tcccaggcgt tcaccgtacg 2606641 ccaacaactt gatgacaaca cgcccgagac gccggcgcag cgacgtgagc tactcacccg 2606701 agtgatcgtg tcggcggcgc acgccgaccg tgaactcgcg tcgcaaaccg aggccttcga 2606761 gaagctacgc gatttggtga tcaacgcccc ggcccggctt gatctgctca cccagcagta 2606821 cgtcgaactg accacccgga tcggcccgac tcagcaacgc ctggccgagc tgcataccga 2606881 attcgacgct gcggcgatga cgtcgatcgc cggcaatgtc accaccgcca ccgagcggct 2606941 ggcgttcgcc gaccgtaaca tcagcgcggc tcgggatctg gccgaccagg cagtgagcgg 2607001 acggcaagcc ggactggtgg atgcggtgcg tgccgccgag tcggcactcg ggcaagcccg 2607061 ggcgctgctc gacgcggtgg acagcgccgc caccgacatc cggcacgccg tcgcgtcgct 2607121 gccggcggtc gtggccgaca tccagacggg catcaagcga gccaaccaac acctacagca 2607181 ggcgcaacaa ccccaaaccg ggcgcaccgg tgacctgatc gcagcccgcg atgcggcggc 2607241 cagggccctc gatcgcgcgc gcggagccgc cgatccgttg accgcatttg accagttgac 2607301 caaggtcgac gctgacctcg accggctgct cgccaccctg gccgaagaac aggcaaccgc 2607361 cgatcggctc aaccgctcac ttgagcaggc gctgtttacc gcggagtcgc gggtgcgcgc 2607421 cgtctcggag tacatcgaca cccgccgcgg cagcatcggg ccggaggccc ggacccggct 2607481 ggccgaggcg aaacggcagc tggaagccgc acatgaccgg aaatcgagca acccgaccga 2607541 agcgatcgcc tacgctaacg cggcatcgac gctggccgca catgcgcagt cgctggccaa 2607601 tgccgacgtg caatccgccc agcgcgcata cacccgtcgt gggggcaaca acgccggcgc 2607661 gatcctcggt ggcatcatca tcggcgacct gcttagcgga ggcaccagag gcgggttggg 2607721 tggatggatc cccacgtcgt tcggcggttc gtcgaacgcg ccgggaagtt cacccgacgg 2607781 cgggttcttg ggcggcggcg ggcggttcta agccacgcgc cagcgcacgg ggatacccgt 2607841 acgctggcgc gtgtggccgt cgacctaggc ttcttcctag ggttcgtcga ccctgtcagg 2607901 cccagctgga gccgacggcg ctgtcggttt gcgccatgtt gttgccggca gcctgcacct 2607961 tctgcccgtg ggcgttggcc tgctcgtaga tcacctggaa gttacggccc agctgggtga 2608021 tgaactcctg gcaagccacc gaaccggcgc cgccccaaaa gtcacccgcg gccaacaaaa 2608081 ctcgagttta tgtttccggc gttaagactg ccgaagttgt agttgccagc gtcgaaaaag 2608141 cctgtgttgc cggcgccggc gttagcgagg ccagtgtttg tgctgccgcc gttccaaaat 2608201 ccggtgttga cgttgcccgc gttcccgaat ccagtgttcg cagtaccgga gttcccgaag 2608261 cctacgttgc cggtgccgga gttgaacaaa ccgacgtttc cggtgccgga gttcccgaaa 2608321 ccgatgtttc cgctgcccga gttcagtccg ccgatgccga tctgaccatc gccggtgagc 2608381 ccgataccga tgttgttgtt gcccgtgttt ccgaaaccga aattcccgct gccggtgttt 2608441 ccgaagccga tgttgttact gccggtattg ccgctaccaa agttgaagtt gccgttgttt 2608501 ccgttaccga agttcgtgtc gccgatgttg ccgctgccca cgttggtgct gccgatgttg 2608561 ccgctgccca cgttggtgct gccgatgttg ccgctaccca ggttttggct accgaagttt 2608621 ccgctgccca ggttgaggtt gccaacgtta ccgatgccca ggttgatgtc accttggtta 2608681 ccgctgccca ggttgaagct gccgatattg ccgccgccta ggttggtgtt gccgatattg 2608741 ccgctgccga cgttcgtgtc accggtattt ccgctgccca aattgaggga cccgatgttg 2608801 ccaaagccga agttgatgcc ctgcagaggc ccaggcacgg tggagccgac tgcttgggcg 2608861 gtcaccgctt gcgcgctgga ctgtgcacta tctagcaagc ccaacaagcc cggcaccgcc 2608921 tgctgccatg gcgccaacgc cgccgccgcc gatgccccac cgtgataacc caccatcgcc 2608981 gccacatcgg cagcccacat ctgctcatag gtggcctcag cagcggcaat cgccggcgca 2609041 ttctgcccaa acacattcga cagcaccaac tgcacaaacg cactgcgatt agccgccacc 2609101 accaccggat ccaccatggc cgcccgcgcc gcctcaaacg cgccggccac cgccttagcc 2609161 tgaaccgcag cccccccagc ccgcgccgcc gcagcagcca accaccccgc atacggcgcc 2609221 gccgccacca ccatcgccgc cgccgcaccc tgccacgcct gacccgaccc acccgccaga 2609281 cccgaggtca ccaacccaaa cgactccgcc gccaacccca actcagccgc caacccatcc 2609341 caggccgccg ccgccgccaa catcggcccc gaccccgcac caaaaaacat ccgccccgaa 2609401 ttaatctccg gcggcaacac cgaaaaattc accactaccg cccctcctct aacaaatcat 2609461 tctcaaccgc acccccgcgc gttaccccaa acgacacgcg gacacccgtc accgagacgt 2609521 cctacgttgt ctgggcgcca aaccggctcg atccccgact tggctcacga ttcgcggctc 2609581 agcattaata gagcccgttg acctgtgagt ttgcttggtg acgggtcgaa aattgtgcac 2609641 ttgatgcact caggagtacc tggacgcccg gacggccaac cggggcgccg ccgaaccacg 2609701 gtggcgcgcc agatgactca attgacccga gtgctgctcc cgctgtccgt accgctcttt 2609761 cgtcacgtcc gcaacactgg ccctcgccgt cggcgatggt cgctgtgccc accttagcgc 2609821 gacaactcgg tttctgcagg tcaacgcccg cctccaatcc cgcacagcca cgaccaactc 2609881 gggaacaaaa ccgccggtca ggcagctgtc gctgagagcc gggcacatcg ggtgtcgccc 2609941 ggtacagtga cacatgtgac cgttgcgacc gtgcgatgtg cccgacgctc gatgcgcacc 2610001 aattcgaacc aactcaggtc ttacgttgcc tggacgccga actagctcga tccagcgccg 2610061 acccgcaccc cactaccggc atctgaaggt gagccagaga cgcgtcgacc aggaagaacc 2610121 gtggccgcac gggtcacccg ggcacaccca accgggccgt ggcaagtgcc gactacctga 2610181 agaatcccga aagtcctaca cccgcattga aagcaccgga gttctggcta cccgaattta 2610241 ccgcacccga actgtcgtca cccgagttgg agataccggc gaggttgtta cccgagtttg 2610301 caattcctgc attgaaagag ccaatgtttg caaacccgga gttgaagcca ggaagcatgg 2610361 ctgggccggc gttggagaag cccgaattac cgttgcctgt gttgaagaac ccggagttgc 2610421 cggtgcccga ggggtcactg ttcccccaac ccgagttgcc ggtaccgagg ttcccgaagc 2610481 ccgagttggc acctgcttgg gtgagcgcgc tgccaaaacc ggtgttgacg ttgccgccat 2610541 tgaaaccacc ggtgttgata tctccaccgt taaagaaacc ggtgttgacg ttacctgcgt 2610601 tcgcgaaacc ggtgttcgag tcacccgcat tgaagaagcc ggtgtttcca gcacccgaat 2610661 ttccgaagcc ggtgttctga aagcccacgt tgaagctgcc cgagtttgag ttcccgccgt 2610721 cgaagatacc gacgtttccg ttgccggcac tcccgaagcc cgtgtttaaa ttgcctgcgt 2610781 tccagaaacc ggtgttgata tttccggcgt tcccgaaacc cgtgttgccg tcacccgagt 2610841 tgaagaagcc gatgtttcca tcacccgagt tgaagaagcc gatgttgttg ttgccggagt 2610901 ttccgaagcc gatgtttcca gtgcctgagt tcagtccgcc gatgccgatc tgaccatcac 2610961 cggtgagccc gataccgata ttgttgttgc ccgtgtttcc gaaaccgaaa ttcccgctgc 2611021 cggtgtttcc gaacccaaag ttgagggtgc cattattccc gccgccaaag ttgaagtcgc 2611081 cggtgttccc gccgccgaaa ttgacatcac cgttatttcc gttggcgagg ttgagcgtgc 2611141 cgaagtttcc gctgcccaca ttgaggctgc cgatatttcc gctgccgaag tttccgctgc 2611201 cgaagtttcc gctgccaggg ttgtagtcac ccgtgtttcc gctgccggca ttgccggtac 2611261 cggtgtttcc gctgccccag ttcaggctgc cgtagttccc gctgcccagg ttggtgccgc 2611321 cgacattgcc actgcccacg ttggtaccgc cgatgttgcc gctgcccagg ttgaggctgc 2611381 cgatgttgcc gacacccaaa ttcaaggtca gctcggcgag gccttgtgca gcgccttgtg 2611441 cagcggccgc cggtgcgtta gccgcaccgc ctagcaagcc cgacaagccc ggcaccgcct 2611501 gctgccatgg cgccaacgcc gccgccgccg atgccccacc gtgataaccc accatcgccg 2611561 ccacatcggc agcccacatc tgctcatagg tggcctcagc agcggcaatc gccggcgcat 2611621 tctgcccaaa cacattcgac agcaccaact gcacaaacgc actgcgatta gccgccacca 2611681 ccaccggatc caccatggcc gcccgcgccg cctcaaacgc gccggccacc gccttagcct 2611741 gaaccgcagc ccccccagcc cgcgccgccg cagcagccaa ccaccccgca tacggcgccg 2611801 ccgccaccac catcgccgcc gccgccgcac cctgccacgc ctgacccgac ccacccgcca 2611861 gacccgaggt caccaaccca aacgactccg ccgccaaccc caactcagcc gccaacccat 2611921 cccaggccgc cgccgccgcc aacatcggcc ccgaccccgc accaaaaaac atccgccccg 2611981 aattaatctc cggcggcaac accgaaaaat tcaccactac cgcccctcct ctaacaaatc 2612041 attctcaacc gcacccccgc gcgttacccc aaacgacacg cggacacccg tcaccacggc 2612101 gccgcccacc cagcggccac cacagctcac cgggtcgtgc ccggaccggg gctgctagct 2612161 gcccttgagc cgcaccgcga gatagtcggc cacgctgctc atcgcaaccc ggtcctgcgt 2612221 catggcgtca cgctcccgca cggtgacggc attgtcctgc agcgagtcga agtcaaccgt 2612281 cacacagaac ggggtaccga cctcgtcctg gcgccggtaa cgccgcccga tagcgccggc 2612341 atcatcgaaa tcgatgttcc agcatttccg taattcggcg cccaggtccc gggccttcgg 2612401 gctcaggtcc gcgtgccggg acagcggcaa caccgccgcc ttgaccggcg ccagccgcgg 2612461 gtccaatcgc agcaccgtgc gcttatccat cccacccttg gtattcgggg cctcgtcctc 2612521 ggtgtacgcg tcgatcaaaa acgccatgaa tgaccgggtc aagccagctg ccggctcgat 2612581 gacgtacggc gtgtaccgaa catcgttgat ctggtcgtag aaagacaggt cgacgccgga 2612641 atgccgcgca tgcgtcgata ggtcaaaatc ggttcggttg gccacacctt ccagttcacc 2612701 ccatggattg cccatgaagc cgaacttgta ctcgatgtcg acggtgcggt cggagtaatg 2612761 tgacaacttg tctttggggt gctcccacaa ccgcaggttc tcccgacgaa tacccaggtc 2612821 gatataccac tgcagccggt tgtcgatcca gtactgatgc cattccttgg cagtcgccgg 2612881 ctcgacgaag aactccatct ccatctgctc gaactcgcgg gtccggaaga tgaagttgcc 2612941 cggagtgatc tcgttgcgaa agctcttgcc gatctgtccg ataccgaatg gcggcttctt 2613001 acgagcagtt gtcaccacgt tggcaaagtt cacgaagatg ccctgcgcgg tttccgggcg 2613061 cagatagtgc agcccctcct cggtctcgat gggtccgagg taggtcttga gcatcatgtt 2613121 gaactcgcgt ggctgcgtcc actggccggg ttcgccggtt tccgggtcgc gaatgtcggc 2613181 caacccgtta ggcggcggat gcccgtgttt ggcttcgtag gcctcgatga gatggtcggc 2613241 ccggtagcgc ttatgtgtga tcagcgactc gaccagcggg tcatgaaaga catcgacgtg 2613301 accggaagcc acccacacct cacgcggcag gatgatcgac gaatcgattc cgacaacgtc 2613361 gtcgcggcca gtcaccaccg atcgccacca ctggcgcttg atgttctctt tgagctcaac 2613421 ccctagcgga ccatagtccc acgccgactt tgtgccgccg tagatctcgc ccgacggata 2613481 gacgaagcct cgccgtttgg ctaggttgac cacggtgtcg atgacgggcg ccacggggtg 2613541 gtgcactccc ttcgagggat cgggcagacg cgcgcagccc gacacgacta cgcgcaaaac 2613601 atcagtcatg gtagcgatcg ggacctgggt ctcctattgc ctttgacatg catcatcatg 2613661 catgtgacag tggaggtcag tggcaggtcc ttcctaatac ggcacttctc gaggtgaaga 2613721 ctccaatatg gtgacgtccc cctcaacgcc gaccgccgcc cacgaagatg tgggtgccga 2613781 cgaagtaggc ggtcaccagc atcccgcgga taggttcgcc gaatgcccca cgttccccgc 2613841 accaccgccg cgggagatcc tagacgctgc cggcgagctg ctgcgtgcgc tggccgcacc 2613901 ggtgcggatc gccatcgtgc tgcaattgcg tgaatctcaa cgctgcgtgc acgaactggt 2613961 cgacgcactg cacgtgcccc agccgttggt cagccaacat ctgaagatcc tcaaggcggc 2614021 gggcgtggtc accggggagc gatcgggccg agaagtgctg taccgacttg ctgaccacca 2614081 cctcgcgcac attgtgctcg acgccgtcgc gcacgccggt gaggacgcaa tatgagtgca 2614141 gccggtgtcc gctctacccg ccagcgggca gccatctcga cactgttaga gacgctcgac 2614201 gactttcgtt cggcccagga actgcacgac gaactgcgcc ggcgcggcga gaacatcggt 2614261 ctgaccaccg tctaccgcac actgcagtcg atggcatcct ccggactggt ggacacactg 2614321 cgcaccgaca ccggtgaatc ggtctaccgc agatgctcgg agcaccatca ccaccatctg 2614381 gtgtgccgca gctgcggttc caccatcgaa gtaggtgacc acgaggtgga ggcgtgggcg 2614441 gcggaggtgg ccaccaaaca tggattctct gacgtcagcc acaccatcga gatcttcggc 2614501 acctgctcag actgccggag ctaggacacc accgaggtcg agcgacccca cacgccgaac 2614561 gtgcaaccat ggcggctccg cccggcgtgt cgccgccacc agggcacgtt cggcgcacag 2614621 cgagcacact cctagccaac gagcgcgctg cggatcgtgg cgcccgtctc cagcaccaaa 2614681 aggatcaacg tgcgcaacgc gtcgtcggtc aaaccggtgc cgggaaagtt gtatcgcagc 2614741 atcacgtccg cggtgttgga cgcgggccga ccagagcttc gccgcgcagc cttctcgctg 2614801 accttttccc gcaggctcac cgagccgaag ttgatgtcgc gtgcctgctt ggccacttgc 2614861 tcggtgagcc tcttggtcaa cggcagatcc cacgccagga tctgggtaag ggacaccagc 2614921 tcgaggtcct cagcgatgct cactacccgc aacgaggcaa acgtaccgtc atggcgaacc 2614981 gtcagcgcgc cgtcgggttc ctcctcggca ggaaggacat cgcgcagtat cgatgccagc 2615041 cggtccggta gggatggcac taggcgctcc cgaaccgccg agtgcgcgac gcgtattcct 2615101 cgcaggccgc ccacaagtcg cggcggtcat agtcgggcca gagcttgtcc tggaatatgt 2615161 attcagcgta ggccgcctgc cacagcatga agttgctgga gcgctgctca cccgaggtcc 2615221 gcaggaagag gtcaacgtcg ggaatgtcgg gtcgctgcag gtggcgggcg atcgtggatt 2615281 cggtgatccg ctccgggttg agcctgcccg cggcgacctc acgagcgatt tcgcgggtgg 2615341 cttcggtgat ttcggtgcgt ccgccgtagt tgacgcaata gttgatggtg atgacgtcgt 2615401 tgcttttggt catctcctcc gcgaccgcca actcattgat gacgctacgc cacagccgtg 2615461 gtcgtgaacc cacccaccgg atccggaccc ctagcttctt tagggtgtct cggcgccgtc 2615521 gcaccacgtc gcggttgaag cccatcagga agcggacttc ctcgggcgaa cgcttccagt 2615581 tctccgtgga aaaggcgtag aggctgagcc acttgatccc aagttcgata gcaccgcaag 2615641 cgatgtcgat caccaccgcc tcgcccatct tgtgaccttc ggtgcgggcc agcccacgtt 2615701 gggtggccca gcggccattg ccgtccatga caatggcgac atggttgggc agccggtcgg 2615761 ccggtattcg tggcgcggcc gctttcgaag tgtgctgcgg tggccggcag gggcctccgt 2615821 agggcgctgc cggcaactcc gggaagacga cgggccacgt cgacgtatca ggaaaggtcg 2615881 ggtagtcgtc gggggccgga ggcagctgcg ggaagttgct ggacgtccgc ttccgtgcat 2615941 ccctagccac cggctatatc ctgcccgatc agcgcggcgc gacgttcggc aaccgatcga 2616001 tcggcctggt agaaccgctc caccagcggc aacgttttca gctgccgttc cagatgccat 2616061 tgcaggtgtg cggccaccaa cccgctgaca tggctgcggg ccgattgcgg cgccgcctcg 2616121 gcggcctccc aatcgccgtc gtacagcgcg gacatcaggt ctacgacgcc cagcggcggt 2616181 gtggtcgagc cggccggacg gcagtgtgcg cagacactgc ccccggtcgc gatgtgaaac 2616241 gcccgatgcg gaccaggcgt ggcgcagcgg gcgcactcgg tcaacgctgg tgcccagccg 2616301 gcgatgccca tggcgcgcag cagataggcg tccaacaaca ggtcccgagg ccgctgtcca 2616361 tcggccaccg cccgcagcgc gcccaccgtg agccggtgca gagccggagc gggcgcccgc 2616421 tcctcaccgg ccaggcgttc ggcggtttcc agtatcgcgc atccgcaggt gtagcggccg 2616481 taatcggcga cgatgtcggt ggcgaacgcg tcgacagaga caacctgggt gacgatgtcg 2616541 aggttgcggc cagggtgcag ttgcacctcg atatgcgcga acggctccag gcgcgcgccg 2616601 aatttgctgc gggtgcgtcg aacacctttg gccaccgcgc ggaccaaccc gtgatcgcgg 2616661 gtcagcaggg tgacgatccg gtcggcttcg ccgagcttgt gctggcgcag cacaacagcc 2616721 cggtcccgat acagccgcat cacaatagtt ttgcaccccg ccacgacatc gcgggtatcc 2616781 gcgccgatag tctcgtaccc cgtggttggc gcttctgggt cggatgctgg agccatttcc 2616841 ggctctggca accagcgcct gcccaccctg accgacctgc tctaccagct ggccacccgc 2616901 gcagtgacgt ccgaagagtt ggtgcgacgt tccctgcgcg cgatcgatgt gagccagccc 2616961 acattgaacg ccttccgggt agtgctcacc gaatccgcgc tggccgacgc ggcggccgcc 2617021 gataagcggc gggcggccgg cgacacggcg ccgctgctgg gcattccgat cgcggtcaag 2617081 gacgacgtcg acgttgctgg agtgccaacc gccttcggca cccagggcta tgtcgcgcct 2617141 gctaccgacg actgtgaggt cgtccggcgc ctcaaggcgg ccggagcggt gatcgtcggc 2617201 aagacgaata cttgtgaatt gggccagtgg ccgttcacca gcggacccgg gttcggacac 2617261 acccgcaacc cctggtcgcg ccggcacacg ccgggtggat cctcgggcgg tagcgcggcg 2617321 gcggtggccg ccggcctggt taccgccgct atcggctccg acggcgccgg cagcatccgc 2617381 atccccgcag catggacaca cctagtgggc atcaagccac aacgcggtcg gatctccacc 2617441 tggccgctgc cggaggcgtt caacggcgtc acggtcaacg gcgtactggc ccgcactgtg 2617501 gaggatgcgg cgctggtgct ggacgccgcg tccggcaacg tcgagggcga ccgccaccag 2617561 ccacccccgg tgacggtgtc cgatttcgtc ggcatcgccc ctggaccgct gaagattgcc 2617621 ttgtcaaccc acttcccgta caccggcttt cgggccaagt tgcatcctga gatcttggcc 2617681 gcgacccaga gggtgggcga ccagctcgag ctgctcggcc atacggtggt gaaaggcaat 2617741 ccggactacg gcctacggtt gtcgtggaac tttcttgccc ggtccaccgc gggcctctgg 2617801 gaatgggcgg agcggctagg cgacgaggtg accctggatc gtcgcaccgt atccaacctg 2617861 cgcatggggc acgtgctgtc gcaggcgatt ctgcgcagcg cgcgccgcca cgaagccgcc 2617921 gaccagcgtc gggtcggctc gatcttcgac atcgtcgacg tggtgctggc accgaccaca 2617981 gcacaaccac cgccaatggc gcgcgcgttt gaccggttgg gcagcttcgg caccgatcgc 2618041 gccatcatcg ccgcgtgccc gtcgacctgg ccgtggaacc tgctgggctg gccgtcgatc 2618101 aatgtgccgg cggggttcac ctccgacggt ttgccgatcg gtgtgcaact gatgggaccg 2618161 gccaacagcg agggcatgct gatctcgctg gccgccgagt tggaagccgt cagtggctgg 2618221 gcgaccaagc agccgcaggt gtggtggacg agctaaaacc ccagtcggcc aagctgtttg 2618281 gggtcgcgct gccagttctt ggcgaccttg acccgcaagt cgagatagac cttggtgccc 2618341 agcaggtttt cgatctggct acgggccgcg gtacccacct cccgcagccg ggcaccaccc 2618401 ttgccgatga cgatgccctt ctgactatct cgctcgacgt acagcgcggc gtgtacgtcg 2618461 atcaggtcgt cacgcccctc acgtggactg acctcgtcaa tcaccaccgc cagcgaatgg 2618521 ggcagctcat cgcgcacgcc ctgaagggcc gcctcgcgga tgagctcggc catcagaacc 2618581 tcctcgggtt cgtcagtcaa ctcaccgtcg gggtaatacg cggggccggc cggcaatgcc 2618641 gcggccagta cgtcgatcaa caggtctacc cggtcgccgg tcatcgccga aaccgggaca 2618701 atctcggccg cattcgtgac gagttcgctg accgctacca gctgggcgac cactttttct 2618761 ttcggcacct tgtcaatctt ggtgacgatg accaccagtg tcgtattggc agggccggtc 2618821 gaacgaagct gctcgacaat ccaccggtct cccggaccga tcgcctcgtc ggcggggatg 2618881 catagcccga tgacgtcgac cgccgcgtag gtttcgcgga ccaagtcgtt gagccgcttg 2618941 cccagcagag tgcgcggccg gtgcagaccg ggagtgtcga cgaggatgat ctggaagtcg 2619001 tcgctatgca cgatcccacg aatagcgtgc ctggtggtct gcgggcgcgt cgacgtgatt 2619061 gccactttcg ccccgaccag cgcattggtc agcgtggact tgccggtgtt cggccggccg 2619121 accaaacaca caaagccaga atggaattcg gtcatgccgg tttcctcgcc gaacgtgaac 2619181 acagggagac ttttcccgct tttttccgcc gtgaatgcac gttcggcgtc atagcgggtt 2619241 acctgcccga tcggtgacga tgatcgcagc ggtcggggcg agttcgcgga cggcggcaat 2619301 gcccggatcg tcaacggacc cggccaccaa gacggcggcc tgaagaccgg tcgccccact 2619361 ggacacggcc gcggccaccg ccgcctgcag accggtcagc tcgagcgccg acagggccac 2619421 cggcgccgcc gcgtacgtgc ggccgtcgac atcgcggacc gccgcgccgg caccggcctc 2619481 ggcacgtgcc atcgccgccc gtgccaacac aaccagcttt gcgtcctcgg catctagctg 2619541 ctcagccagg gtgatcggcc tcctcatcat cggcgccgtc gggttcggcc ggactcagca 2619601 acacggtgcc gattcgtacc cgtccccgat gatcggtgcc accctcggca tgcagccgca 2619661 ggccatgcga tatcacctca gcgccgggca gcggcacccg gcccagttct agggccagca 2619721 gcccgcccac cgtgtcgacg tcaaggtcgt cgtcgaactc cacgccgtac agctcgccga 2619781 cgtcttcgat gggcaggcgc gccgataccc ggaaacgctt gtcgcccaag tcttccaccg 2619841 gcgccgtctc ggcctggtcg tactcgtcgg caatctcgcc gacgatctcc tccagcacgt 2619901 cttcgatgct gacgaggccg gctatcgcgc cgtactcgtc gaccagcagg gccatgtggt 2619961 tacggtcgcg ctgcatttcc cgcagcaatg cgtccagcgg cttggagtcc ggcacgaaca 2620021 cagctggccg catcacccgc gcgacggtcg tttcgcggcc gccgttcgtc gagcagaacg 2620081 tctgctcgac aaggtctttc aggtacacca cgccgacgat gtcgtcgacg ttctcgccga 2620141 tcaccgggat tcgggaatgt ccgctgcgta ccgccagggt cattgcttga ccggctgtct 2620201 tgtcgctttc gatccagatc atctcggtgc gcggcaccat cacctcgcgg gctggggtgt 2620261 cacccagctc gaagaccgac tcgatcatcc ggcgctcgtc ggcagcaacc acgccccgct 2620321 gctgggctag gtcgacaact tcgcgcagct cgatctcgga tgcaaacggc ccgttgcgaa 2620381 agccgcgccc gggggtcagt gcgttgccca gcaacaccag caagcggctg atcggcatca 2620441 acaaccacga gatcagccgc agcggaaggg ccgtggccaa cgagatggaa tatgcgttct 2620501 ggcgcccaag ggtgcgtggc cccactccca cgacgacaaa gctggccaaa accatgatgc 2620561 ccgcggcaag atacaacccc cacaccatgc tgaagtggta tcggatgaaa accaccagca 2620621 gcgcggtcgc ggtgatctca cagctggtcc gcagcaacac gaccaggttg acgtaccgcg 2620681 gccggtcggc catcacctta cgcagcgacc ccgcgcccgg ccgctggtcg cgtactagct 2620741 catccacccg ggccggagac acggtgctga tggcggcgtc aatcgcggcg aacaacccac 2620801 ccaaaccgat caatacgatc gagccgagca gctggtagta cccggtcaaa ggtcaaaata 2620861 ccttgacttg tcgagcaacc ggcggtcctt ctcgtcctgc cggtcgtgct ggtaggcctc 2620921 aacctggtcg gctacccact cttcaagcaa ccggtcctgc agggcgaaca tctctttttc 2620981 ctcgtctggc tcggcgtggt catagccgag caggtgaagc acaccgtgga tggtcagcag 2621041 ggccaattcg tggcccaggc tgtggccggc cgcagccgcc tgctcagcgg cgaattccgg 2621101 gcacagcacg atatcgccca gcatggacgg tcccggttcg ggggcgtcgg ggcgaccacc 2621161 cggctcgagc tcgtccatcg ggaagctcat cacgtcggtc ggcccgggaa gatccatcca 2621221 gcgcatgtgt aggtcggcca tcgccgcggt gtccagcagc agcatcgaca attcggcgca 2621281 cggattgacg tccatcttgg cgatgacaaa ccgtgcgaca ctgactagtt ccgcttccga 2621341 gacgtcgatg cctgactcgt tggctacctc gatgctcata agatgctcac gcacccatca 2621401 tcggcgaccg cgggcgccgg acgcccgccg agccgcccga ttcagccccg acccgggctc 2621461 ctcgtaccgc gcataagcgt ccacgatctc cgagaccaga cggtggcgta ccacatccac 2621521 gctggtcagc tccgcgatat ggatgtcgtc gatgtcttcg aggatgtcga ccgccgcccg 2621581 cagacccgac cgggcgccgc ccggcaggtc gatctgggtg acatctccgg tgaccacgac 2621641 cttggatccg aagcccaggc gggtgaggaa catcttcatc tgctcggccg tggtgttctg 2621701 cgcctcgtcc aggacgatga acgcgtcatt cagggttcta ccccgcatgt acgccagcgg 2621761 tgccacctcg atgactccag cggacatcag cttcgggatc agctcggggt ccatcatgtc 2621821 gtacagcgcg tcatagagcg gtcgtaggta cggatcgatc ttttcgctca gcgtgcccgg 2621881 cagaaatcca aggcgttcac cggcttccac cgcgggtcgg gtcaagatta tgcgggtcac 2621941 ctgcttggtc tgcagcgcgt ggaccgcttt ggccatcgca agataggtct ttccggtgcc 2622001 ggccgggccg attccgaaga cgatggtgtt ggcgtcgatc gcgtccacgt agcgtttctg 2622061 gttgagcgtc ttgggccgga tcgtcttccc ccgacgcgac aaaatgtcta gagtgagcac 2622121 ttcggccggt gactcgttgc ctgtgccgac cagcatggca acgctgtggc gcactacctc 2622181 tggggtcagc gactggccgc tggccacgat cgcaatcagt tcggagatca cccgttcggc 2622241 tagcgcgaca tccgccggct caccgcagag ggtcaccgcg ttgccgcgca cgtgcaggtc 2622301 ggcactcagc gtgcgttcga gggcacgcag attttcgtcg gccgaaccga gtaagcccac 2622361 gacgaggtca ggcggaacgt cgatgctgct gcgaacttga gcgtcggctt gccgggctcc 2622421 agccgcgtca gcagcgcggg tctcgcggga cgtcacctgg cttctgatgc ctgctttctg 2622481 gcctatcgac tggaacctgt cgaactgacg agtgttgaag tttcattcta acgccggtca 2622541 gggacggcgt cggagcacaa cgcacaacgc cgagcccgtg cgcgctcacc tttatccgcg 2622601 atgaggcctg tctgtgtccg cccgttcgat gccgacgaac ggcagccact ctcgggcctg 2622661 ccagctgtgc ctgccggtgc gcggcaacat cccgaccgtg cccatgccgg tccgccaagc 2622721 cgacgatcac cgctcaagct gggccagccg tgagcgtcgg cgccccaatg attcgggtgg 2622781 cgggctagta atcccttcga cgggggtttc cacggggtcg ctggtctgac tgccgcgcca 2622841 ttggagggcg ctgatggcca cggcgacctg gatgatcgtg tggtcgaggg gtcggggtag 2622901 gagtcgttgg gcggtttcga gtcggcgtag cagggtgttg cggtgggtgt gtagtacgtg 2622961 cgcggcgcgg gaggcgttgc attgttcgtt gatgtaggtc aatacggtgg tgagcagttg 2623021 agggctggcc gattcgaggt ccccgagggt gctggtgatg aaatcggctg cgctgtccgg 2623081 gttttcggtg agtaccgcga tcatgtggat gtcggcgaag aaagccaggc gctgttggga 2623141 tcgaagccgg gccagcatgc gttgggtggc cagggcgtcg cggtggctgc gccgaaaccc 2623201 gtcgattcct cgtgcggtgg tcccgaccgc gatgcgggca tgtggtgcgt ggtcgagcac 2623261 ctggtggatt cggtcggtgt cgagggttgc cgcgtcgctg acccataccc agcgggtggc 2623321 cgcgctggcg accgcgatca ggggctgtgg gcatcccagt gcgcggccga acgcgcgtgc 2623381 ggtgtggtcg aggtggtttt ggttgtcgtc gggatcgtca taccagatga tggcggcggt 2623441 gtgggatcgg tctagggggt agcccagttt ggcttcggcg ctttggcggc tgatgggggc 2623501 gccgtcgagg atcagttcga cgatgcggcg gtgttcggcg tggacgtcgc gggtcagttc 2623561 gtcgtattcg agctgcattt gtgcggccag gccggccagg gtggcgtcga tgaattcgga 2623621 ggccgagcga aacggcaggg tgagcagttc gtgcagttct tgggggtcgg tggtgagtcc 2623681 gaacgcgatt tcggtccatc gttgccaggc gacgttttgt ccgacgcggt agacgtccag 2623741 cgctgaggcg tctagtccgc ggcgcaccag gtcgcgggcc atgcgtagcg ggtcggggcc 2623801 gaggtttgcc ggtacgggtt ggccgggttt gcgcaggttg gcggtggcga agtggatcag 2623861 gtgggagcgg ttggcgcggc tgaccactgt tgctagggcg gggtcggcgg cgatggatgg 2623921 gtgggcggcg agggtggcgc ggtcgagttc gtcgagccat tccggggtgg ggtgcagggc 2623981 gacttttgct gcttggcgga tgagttcacg tccgcgcggt gtgggtttgg gcaataccac 2624041 gagatgagac tagttgccta ggtgcgttgt gcaccacgtt ctggggaatg ttggtgaggt 2624101 ttactccttc agccgtggtg gacgtttagc cggtgtggcg cgttcgggat tattgggatg 2624161 aacggttacc caccgcggcg gcagcgggcc gtgcgcctgc cgagtcgtcg acatttagcg 2624221 ttcaggaggt ctcgatgtcg ttggtcagcg tggccccgga gttggtggtg acggcggtac 2624281 cggatgtggc gcgcatcggg tcgtcgatcg gtgcgcccga caccgcggcg gcggcgagac 2624341 cgaccaccag cgtgctggcc gccggcgccg atgaggtgtc ggcggacgtc gtggcgctct 2624401 ttggctgggt cgcccgttga tggtgatggg gccgctgggg cgcccgagac cgggcaacgg 2624461 cggggccggc ggctcgggtg cgcccggcca agccggcgag tgggattctg acgaccggct 2624521 accggcgtgt cacgtcgcag tattcacagt cgctcgctga tgcatcccaa cgagatgtga 2624581 gcacaccgac agcacccaat gccaccgcgg ccgcggtcga tgtccgcagc accgtcgggc 2624641 ccagccggac cgcgacggcg ccggcatcgg tcagcgcggc aagctcgtcc ggtgcgatcc 2624701 caccctcggg tccaaccacg agcatcaacg aaccagcttg cgccgcagcg atatccacaa 2624761 tccgctcggt cgcctcctcg tgcaggacca gcaccgccgc gccggcggcc acctcttctc 2624821 ggacacgctg tacaagcatt ggcgtcgaca acacgccgtc gaccggcggg atgcgcgccc 2624881 gacgagattg ccgggccgcc gagcggacca ccgctcgcca ccgacgcaaa cccttgtcga 2624941 cacgcgcccc gtcccagttc gccacgcagc gcgccgcctg ccatgccagg aacgcgtcgg 2625001 ctccggcttc ggtggccagc tcgattgcca attcggagcg ttcggatttg ggcagcgcct 2625061 gcaccacggt caccggtggc cgcacgggcg ggacgctcca gcgcctaagc acccgggccc 2625121 gcagcccgcc acgtccggcc tgctccacca cacagcgggc caggcgaccg acaccgtcac 2625181 caagcaccaa ctgctcgccg ggacggatcc gccgcacggt ggcggcgtga aatccttcgt 2625241 cgccgtctac gaccgccacc gcaccggtgt cgggcagtgt gtcgacgtaa aacagcatcg 2625301 ccaccatgtg cgggccgtga ttagcgcccg gtgaaggtct cgcgcaaccg gctgaacagt 2625361 ccgccggcgg cggcgtgggt cgaacggacc tcggccacct cgcggtcgcg gcgacccttc 2625421 agctcgcgca gcagttcgat gtcctggtga tccagccggg tcgggaccac cacctccacg 2625481 tgaacgtgca ggtcgccacg cgtgttggaa cgcaggtgcg gcattcctcg accgcgcagc 2625541 gtgatcaccg aacctggctg cgtgccgggt ggaatggtga tctcgctcag gccgtccagg 2625601 atggcgtcca ccgtgaccgt aacacccagc gccgcgtcga ccatgggcac cgaaaccgtg 2625661 caatgcagat ggtcaccttc gcggacaaag acgtcgtgcg cctgctcatg gacctcgacg 2625721 tagaggtcac ccgccggccc tcccccgggc ccgacctcgc cctgagcggc gagccgaact 2625781 cgcatcccgt cgccgacacc ggccgggatc ttgacgctga tctcccgacg ggcccggatc 2625841 cggccatcgc ccatgcattg ctggcacggg tcggggataa ccaccccgac gccgcggcag 2625901 gtgggacacg gccgcgacgt caacatctga cccaacagcg atcgctgcac ggtctgcacc 2625961 tccccgcggc caccgcaggt gtcgcagggt atcggaaccg aatcgccgtt ggtgcccttg 2626021 ccctggcacc ggtcgcacaa caccgcggta tcgacggtga cctgcttggt gacacctgtt 2626081 gcgcactctt cgagatccag ccgcattcgt agcagcgagt ccgaacccgg ccggacccgg 2626141 ccgatcggcc ctcgggacgc cgcgccccca ccgaaacccc cgccaaagaa cgcctcgaac 2626201 acgtcgccga ggccgccgaa gccaccgaac ccattgccgc ccgcagcggc gctctccagc 2626261 ggatccccgc ccaggtcgac gatgcgacgt ttgtccgggt cactgagcac ctcgtaggcg 2626321 acgctgattt ctttgaattt cgcctgcgca gcctcgtccg ggttgacgtc gggatgcagc 2626381 tcgcgcgcca gcttgcggta ggcgcgtttg atgtccgcgt cgctggcgtt cttgctcacg 2626441 ccgagcagcc cgtaataatc gcgtgccacg cttgattctc ctatgccgcg tctttatgcc 2626501 gcttctcaag cggctatcca caaaccctgc agcaggtgcg cgttcatcga gcacccagga 2626561 cgtcgccgat ataaagagca accgcagcca cgctggcgat agttcccgga tagtccatcc 2626621 gggtggggcc caccacaccc ataccgccgt agacggtatg ggcggtaccg taggccgtcg 2626681 acaccatcga ggtgcccacc atctgctcag acgccgtctc atgacctatg cgaaccgtca 2626741 ccttgccggc ttcctgctga gccgccagca gccgcaacac caccacctgc tcctcaagtg 2626801 cttccaatat tgaccgcagt gaaccaccga agtccgcagc gttgcgggtt aggttggcgg 2626861 taccgcccag caaaaggcgt tcctcggtgt gctccactag cgactccagc aatacggtcg 2626921 ccgcgcggcc cacggcgtcg cccaatccgc cggcgccgcc cagctggctg gcgaggtcgg 2626981 cgaccgccac cgaagccgct gaaagcttct tgccttccag cgcctggccg agtatttcac 2627041 gcagctgggc tagctggtga tcgtcgatga catcgccgag ttcgacgatg cgctgatcaa 2627101 cccggccgga gtcggtgatg accaccatca gcagccgggc cggtgtcagc gcgatcacct 2627161 ccaagtggcg aacggtcgac gttgacaacg tcgggtactg cacgacggcc acctggcggg 2627221 tcagctgcgc cagcaatcgc acggcacggc gcagcacgtc gtcgagatcg acaccggatt 2627281 caaggaagct ctggatcgcc cggcgctcgg ccgacgatag gggtttgacg tcctcgagcc 2627341 ggtcgacgaa ctcgcggtag cccttctccg tgggcacgcg tccggaactg gtgtgtggct 2627401 gagtgatata gccttcggct tccagcaccg ccatgtcatt gcggactgtg gccgacgaga 2627461 ctcccaggtt atggcgttcc accagggatt tggagccgat cggttcctgg gttgcaacga 2627521 agtcggcgac gatggcacgc agcacctcaa agcgacgctc gtcggcgctt cccatcgact 2627581 gctcacctca cttcttacgc tgcctgaccg gcttcatttt acgttctcgg cggccactga 2627641 ccgtcatcta gcaggcgtct gccgatggtc agggggcgtg tgccccgact aacgtgtcca 2627701 gcatgatctc gtcgggcttc agccactcgg tagggaagat ccgccaatga tcttcaaagg 2627761 ggtgcgggaa ggcaagccgt atcccgagca tggactgtcc tatagggact ggtctcagat 2627821 accgccgcaa cagatccggc tcgacgagtt ggtcaccacg actacggtgc tcgcgctgga 2627881 ccgcctgctg tcagaggact ccacgtttta cggtgacctt ttcccccacg cggtgaagtg 2627941 gcgaggcacc acctatctcg aggacggctt gcaccgggcg gtgcgtgcgg ccctgcgcaa 2628001 ccgcaccgtg ctacacgcgc gagtgttcga catggacgcg tcaccaggcg ggcggcgtag 2628061 ctgaacagcg ggctgaagcc ggcccgccaa tcagttccct gcggcctgca gcaactccat 2628121 cgccgatgcg cgtgacagca tccagccgcc ttgattcacg aacgtgacgt tctgcgtgac 2628181 cggcgacgag agcttcggac ccgagacgga aacgtcggcg gtggccgaac cggcggccgc 2628241 cggctggatg ttcgtcacgc tgaacgacag cggcagatcc ccgtgctcgg cggccttctt 2628301 cagcttgtgg tcggcgatgc gcgcctcggt gcccccgatg ccgccctcga ccagactgcc 2628361 cttgttcgca aacgacacgt tgggatcggc gaggctgttg agcaggctgg tcaactgggc 2628421 ggcggtcggg acgtcagggg cggatgccgg gtccaacggc agtggcgcgc cgaagacgac 2628481 cggctgcatc tggtatacga ccgggccgcc agccatgatc gaagtcacac cggccgcagc 2628541 ggcgccgatt gcagccgcgg cggtcagacc tgcggcgatc gatttcacca tcttcatggt 2628601 tgtgttcttc cgttcgtttg cccgtcgatt gtgcgtttgg ttcaaactac cggtgcacgc 2628661 gccgggcaag tctgtgtcgt agctgtgagc gagcggtcag tcctcggcca tggcgtcacg 2628721 caggctcttc ggccgcagat cggtccagtt cttttccacg tagtccaggc aggcggcacg 2628781 gctggcttcg ccgtgcacca cgcgccagcc ggccgggata tcggcgaaca ccggccacag 2628841 gctgtgctgg tcttcgtcgt tgaccagcac gaagaatgcg ccgttgtcgt catcgaaagg 2628901 attggtgctc accgcttctc cttgtgtcgt tgtgtttggt cggcgtaaag atgccggcgc 2628961 cgagcacccg gtccgacaac aagccgaggc agctcaggtt ggggaacccg ggcccctggg 2629021 tgagtccgga cagggtgggc aggaacagct tgggcgtgac gtcggtgact gccaagtcgt 2629081 agccgatcgc ttcctgcagg cggtcggcgg tcagcggtcc acccagtccc agctcgagca 2629141 ggtcgagggt gtgctgactg aacagtgagg tgaaccacag cggatcggcg cccgagccgt 2629201 cgatgacgag atcgaatccg tgcacggtct cgaagttctc gctgccccgg ttggtgctca 2629261 gcgtcaaccg gatctgcccc tgacggccca ccgcgtgggc gacccggcca cgcagatgat 2629321 ggatgcggtc atcggccagc agcgcttcct gcacggtcgc cgagaacact cctcggtcgg 2629381 tgcgggccag cgcgtcgcgc cgttcgtcga acgtcaaggc cgcccagtcg gtcggatcgg 2629441 aaaacagtga gttctcgaag aatccctcgc cgcgggtgaa cagggttacc tgcggggaga 2629501 tgacggtgat ggttgagacc cgatgccgga acagctcgtt gagcatcgat gcggccgtct 2629561 ctccgccacc gatcaccgcg acccgctcgg cgttgatccg gtcgtggccg gcggcacggt 2629621 cccagaactg tgcgattgag agcacgcgcg ggtttccggg cagtagcgac ttttcagcct 2629681 ggccgggccc ggtgatcatc aacgcgtcgg cctgcacggt ggtctcgtgg gtgcacaacg 2629741 cccagcggtc accggtgacg gcgagccgtt cgacctcgcc gtggatcacc ttgaggccaa 2629801 tgtgatcggc cacccaggct aggtactgac tccacctgcg atgggtgggc gccgggcggc 2629861 cccggtcgat ccattccgcg aacgacgcgg tggcgatcag atacgactgc cagctgtagc 2629921 gggtcatccg ctcgtccaat tctgcgttgc gccgtggcac cagcgccgac cggtagggaa 2629981 aaccgacatc cttttctggg ctggtgccca gccggtgggc tccgtcggtc cagccaccgc 2630041 tggcctgcca gttggccccg accccgatgc gttcgacggc gatcacgtcg ggcacgtcga 2630101 cccccatgtc acgcagcacg gatgccttgg ccgcgaccgc caccgccttg gctccagcgc 2630161 ccaggaccgc gagcgtcgga ttcatgctgt tatctccgcc agcgcgccct gccacagtga 2630221 ttgcagcgtg gcgacgtcgt cggcggacag gatgtcgggc agcgtgcgcc accgcgtggc 2630281 tagcacggga gcgtcggcgg gcccgaggag cgccgccagc accgtcagtt cgtggcgcac 2630341 cggctgttcg ggttcaggca gttgccccac atcagccagt agtgcgcggt cgaccgccag 2630401 atctcccacc ccgacgtgca ggctacccag atagttcagc agcagctggg gttcgcggtg 2630461 ggcgcgtagt cgctccgcgg tatcggcgcg caggtaccgc agcaggccgt aatcgatgcc 2630521 gctgccgggt atccgcgcga agtcggtcgc gccgtcgcag tggatgcgca gcggatagat 2630581 cgcgctgagc agcccgaccg tgtcgctggt gtcggcagtc ttatcgacgt ggacgtccgc 2630641 gcggccatgc gtctccaacg ccaacagcgg tgctggtgtt tgttgaccgc gttgccggcg 2630701 ccaggcggtc accatccgcg cagcggcggt agccagcaga tcggtcatcg accgtcccgt 2630761 cgaaagcagc cgcgcggtca gatcggcgtc ggagatcgac atggtgatcg ctagctcacc 2630821 aacccggtcg gtctgcggcg ccaccctgcg ggcacccaac ggcggatcgg cgccctcgag 2630881 ttcggcgacc cagaaatcaa cgctatccag cgccttagcc cgctgcgcca gcagccgcga 2630941 ccactgccgg tagctggtgt tctcgcgcgc tgggctgggc gcgcgcccgg ccgccagcgc 2631001 gtgcaggccg gcgtcgagtt cacccagcac aatccgccag gaggctgggt ccatcgccag 2631061 cacatgggcg gtcagcacca gaacaccggg cccgtcgggt tcgcgcagcc acaccgccga 2631121 gagcagtcgg ccggcctggg ggtcgagact cgccagcacg ccaagagtct gctcggccac 2631181 cgcggtgacc agttcaccgc tgacccaaac ctcgctgaga atgtccgttt tcggttgtgc 2631241 gacaagggcc atcgcatccc ggtcgaaccg gcaccgcaac acctcgtgtc cgtcgacgac 2631301 cgcggccaac acggcatcca ggcgttcgcg ggtgatccgg tcgggcaacc tgatgacctc 2631361 ggtttgtgcc agccggcgcg ggtcgccgta ctcgtagagc caatgagtgt tgggtagcac 2631421 cgggatcggc tcgccggcat cgttggccgg tgcctgccat gcggcatcgg agtcaatggc 2631481 cgccgcgagt tcacggatgg tgtcgcactc caccatcagc ctggcccgca acgcaatccc 2631541 acgacggcgc gcggcctgca ccaccgacag cgccacgatg ctgtctagac ccatctgcaa 2631601 aaagcccgcg gtgacatcga cgttcgaggt ttccatgaca tcggcgaacg cctcggccag 2631661 caccagctcg gtcggtgtct gcggcggagt tgccggtcct tcggtgacat tgattgccgc 2631721 caaagcgttt tcgtcgatct tgccgtgtgg agtcagcggt aactcgtcga ggacgacgat 2631781 atggtgcggg actagataac gcggcaaccg ctctagcagc atcgcccgca attcggccac 2631841 cggtggcggt tgtggtccgc ctgccacata cgccgtcagc cgggggccac tggcatggcc 2631901 gcgggccgtc acatggcaac cgtgcaccgc atggtggccg ttgagcaccg cggcaatctc 2631961 acccggctcg acgcggaaac cgcggatctt cacctggtca tcgctgcgcc cgaggaactc 2632021 cagcccaccg tcgggcaggc ggcgcaccac atctccggtg cggtacattc ggctaccgcg 2632081 cccgtttggc tcagcgacaa agcgcgccgc agtctcggcc gggcggccga ggtaaccgcg 2632141 ggtcaactgg gcgcccgcca gatacagctc gccggcgacg ccatcgggca ccggccgcag 2632201 ccaggagtcc atgacgtagg cgcgggtggt gcaggtcgga cgtccgatga ccggtcgcgc 2632261 atgctcagca acggcggcga ccacggcttc gaccgtggtc tcggtaggcc cgtagcagtt 2632321 gaaggccgtc atggccgtgc gcgcgcagtt ctgctggatc atccgccacg tcgcggcgcc 2632381 caaggcttcg ccgccgagcg caagcaccgc caacggcgcc cggtcgagca gtccagcgtt 2632441 gtgcagctgg gcgaacatcg acggcgtggt gtcaatcatg tccagaccga atcggtcgat 2632501 cgcttcgacc agcgcccctg cgtcccgctg acgatggtcg tcgacaatgt gcaccgcgtg 2632561 gccgtcaagc agtgcgacca acggctgcca cgccgcgtcg aaggtgaacg accaggcatg 2632621 cgcgattcgc agcgggcgcc cgagccgctg ggccgccggc cgcaacacgc gctcgatgtg 2632681 gtcgtcggcg taggccgaca gcgcccgatg ggtgccgatg acacctttcg gggtaccggt 2632741 ggtgccggag gtgaaaatca cgtaggccgc ctggtccacc ggcaccgtga tggcacggtc 2632801 gtcctcgagt atgtcagcgc caaccgaagc ggcgaacacg ccctcatcga tgaccaccgg 2632861 agccgatgtc tggcgcaaga tctcggcgac acgctcaccg ggcatcgccg ggtccagcgg 2632921 cacgatcatg ccacccgcct tgaggaccgc cagcatggcg gccacgtagc gcggaccacg 2632981 ggacagcgcg acggccaccg gggtctcgcg actcacgtcc gcgcggcgca gcccagtggc 2633041 cagccggtcg gccaatgcat ccagctcccg gtacgtcagc tgaccatccg cccaactgac 2633101 cgccaccgag tcaggctgtg ccgcagcgat ttcggcgaac cgggtatgca ccgcgggtgc 2633161 cgacgtcgtc acatccggca ggccgggtgc ggtcggatcg tgctcgccgt ccagcagaat 2633221 gtcgacgtcg cgcagcggcc gatcccaccg gctgaccaag cgctgtaaca cagccagcac 2633281 ccgcctgccg aggctttcgg gcgccatcgt gcccagcgca ccgtcgagca cctccactag 2633341 cagcgtgagc tcaccggtgc tgcggtgcgc ggcgacggtc accggaaagt gcgacaaact 2633401 ctctagcgcc accggacgga acgtcacccc gtttgcgacg aactccgcgg tgcccaccac 2633461 ctcgccgggc gggaagttct catacaccag tagggtgtcg aacatctcac cgataccggc 2633521 gatggcacga aactcgttga aaccgagata gctgtggtcg cgcaacatgg cgaattgacg 2633581 ttgtaggaca gcgcattgcc cgccgacggt agcgcgggcg tccaggcgga cccgcagtgg 2633641 caccgtattg atgaacaggc cgatcatcgt ttccacgccg gacagttcgc tgggcctgcc 2633701 ggacaccgtc acaccgaacg tcacatcgcc acgaccggtg aatgctgaaa gcgtggtagc 2633761 ccaagccatt tgaacaagtg tgctgatcgt gacgccacgg gtgcgggcgg catcggccag 2633821 ctccgcggtg gcttcacggt caaggcgcac ttcggtgcgt cccggaatac ccggctgcac 2633881 aggagtgtcg gcgagtgccg gcgataacag agtcgggccg tccaggccat tgaggtggtc 2633941 cgcccacatt gcgcggctag ccgtctgatc gcggccggcc agccagccga tgtagtcgcg 2634001 atacggccgc ggcgctgccg gcaacgcggc gacgtgacca ccagcccgat acaaggcgag 2634061 cagctcggag acgaacagcg gcaacgacca tccgtcgatg acgatgtggt gcgcgacgat 2634121 gaccagatgc caacattcgt ccggtagttc gatgagcagg aaccggatga gtggtccgcg 2634181 gccgacgtcg aagcggcgcc ggcgctcttc ggctgccagc gccccgacct cactggggtg 2634241 ggcgcgcacg tgacgccaaa gcacctcggc actggatggt attacctgca cgggccggct 2634301 caggttcccg tgtaggaagc tcgcccgcag gttggggtgc cgggtcagca tcgcggcagc 2634361 gcagtcgcga agcaaggcga tgtcgagcgg gccggccgcg tcggccgcca tcgcgatcac 2634421 atacgggtcg gcctctgcgg cctcagagcc ggactccgcg gcgaccagtg tcgccctaga 2634481 aaacagtccc tgttgcaatg ggctgagcgc catcacatcg tcgatggcgc cccgcgcgtc 2634541 ggctcgcgtc acggccactg gtcccatgac gcggtcaggg ccgacagttc gtctggggaa 2634601 agccctgatg tgctcatcgg cgcgtgatgc ttgtcgtccg gctcggcctc gacgtgaggc 2634661 ttggcgtcga ccgcggcggc gagctcacac aaaacgggat gctcgaaaac catccgcgcg 2634721 gtcagcggta tcccgccatc tcgagcccgg gcagccacct gagtcgcgag gatgctgtcg 2634781 ccgccgaggt tgaagaagtc gtcgtagcgt ccgacctccc ccacctcgag cacgtcggcg 2634841 aggatggcag ccagcgcgcg ctcggtttcg gtgtcggcgg gctcggccgg caccggtgcc 2634901 gcctgcgccg ttggcagccg ttcgatttcg gccagcagct ccagttggcc gtcggccttc 2634961 cacactccgc gctcgccgtt gcggtagagc cgagaacccg gttgcgcggc gaacggatcg 2635021 gcaacgaatc gggtcgcggt ctccgatggc cgggccaacc gggcaccgac ggcgggaccg 2635081 ccaccgtaat aaacgtcacc caccacgcct accggaacgg gcttaagtgc gtcgtcaagc 2635141 aggtacaccc gggccgttcc ggcgcccgca ttggaccggt ccaaaatgcg ccgtctggct 2635201 tgcgcgctga ccatctcgac ctcgcgcagc ggttggtccg gacggtcggc gaacgcctcg 2635261 acgacacgga ctagccagtc ggcgaagcgt tgtgcggtgg cgcgctcata caactcggtg 2635321 cggtagatga cgtggccgcg gtactcgtcg ccgcaggcga agaagttgac cgatagatcg 2635381 gcttgcgcgg catcgaatgt cggctccagc acgcgcaacg tggtgtcacc gtcgggcccg 2635441 gtgtcgatga cgtggtcttg cggcatttgt tcgcgaacgt gcacaacaat gtcgaacaac 2635501 ggattgcggg acagcgaccg ctgggggttg accgcctcca ccacctggtc gaacggcagg 2635561 tcctgatgtg catacgctgc cagcgccatc tgcctggtgc gctgcagcac ctcgcgcagc 2635621 gtggggttcc cgcgcaggtc gttgcgcaac accacgatgt tgatgaagaa cccgatgagc 2635681 tggtccaggt tggcctcgct gcgaccggcc accggggcgc cgatggggac gtctaccccg 2635741 ccgccggcct tgtgtaacac caccgcgacg gcggcctgta gcagcatgaa ctcggtgaca 2635801 ccgaggtctc ggctcacggc agccaatttg tcgcggatcg cggcgccgag acgaaattcg 2635861 accgcgtcac cggcaccgct gagcagggcc gggcgcggga agtccgggcg cagaccggtt 2635921 tcgcctgcca ggccccccag ctggcggatc cagtagtcgc gttgcggacc gacgatgccc 2635981 gcaccgtcgt cgagtagcgc cgactgccac acgctgtagt cggcgtactg caccggcagc 2636041 ggtgcccacg acggccgttg tccggtgctg cgggcccggt atgcggtcag cagatcggtg 2636101 aacaacaccc cagccgacca gtggtcgccg gcgatgtgat gcaccaccag cgacaacacg 2636161 gtctgctccg gcgtgctcag cagcgccgcc cggatcggcc agtcggtttc caggtcgaaa 2636221 acgtaacctc gctcgttgtt cagttcggct cgcagccacg cggcgtcgga cccggcggcg 2636281 caccgcaccg gcacctcggc gggcggctgg atgatctggt gtggcacgcc gccgatctcg 2636341 cggtagacgg tgcgcaggat ctcgtggcgt gccaccacat cggtgatggc cgccgcgaac 2636401 gcgttggtgt cgcagggccc atgcaatgcc gcggcgaagg gaatgttgtt gacggcgttg 2636461 ggcccgtcga agcgatagtt gaaccagcta cgcatttgag acgacgacaa tcgcactggc 2636521 ccgtcatgat ccacccgggt cagccgcggc ctcgccgaat ccgaatccaa cgtatcgatg 2636581 tgtccggcca acgcggtcac cgtggcgaat tcgaagatct cccgcacacc gacatcgacg 2636641 ccgaacgcgt tgcgcacggc cgcaacgagt ttggttgcca gcagcgagtg accgccgagg 2636701 tcgaagaacg agtcgtcagc acccactcgg tcgcggccga gcagctcacc gaacagttgg 2636761 gcaaggcgcc gctcggtggc ggtctgcggc gcgcggaact cggtgtccga cgcgatctgc 2636821 ggttccggca gcgcggcgcg gtcgattttg ccatgcgcgg tgatcggaat ctcatccagc 2636881 acaacatagg ccgcgggcag catatattca ggcagtgccg cggccacccg ggcgcggatg 2636941 cggtcgagat cgacgccgac atcggcgggt ccgtcgccgc ccgcggcggg tgtcacgtag 2637001 cccaccagac tcttgcccag ccgcggcagg tcgctaacca ccacaacggc ctgcccgacc 2637061 gtagggtcga ccgcgatggc cgctgctacg tcaccgagtt cgattcggaa tccgcgaatc 2637121 ttgacctgct cgtcggcacg gcccacgaac tcgatgtcac cgtcagcatt gcggcgcgcc 2637181 agatccccgg accggtacat gcgggaaccg ggattaaacg ggtcggcaac gaatcgctcc 2637241 gcggtcagcc cggcgcggcg atggtatccg tatgcgacat gcgtccctcc aatatagatc 2637301 tcgccgatca caccggtcgg caccggctgc aacgaatcgt cgagcaggtg catggtggtg 2637361 ttgatcttgg gccggccgat gggcacgatg cgggtgccct gtgggcccac cactttaaac 2637421 cggctggcgt tgatcacggt ttcggttgga ccgtagaagt tgtgcagcag cgcatcgaat 2637481 gtcgcgtgga acttgtcggc cacctcaccg ggtagcggct ccccgccgat gggtacccgc 2637541 tgcaacgtcc gccactggct cacacccggc agcgacagga acagcccgag tagggacggc 2637601 acgaaatgca ttgccgtgat gccctcgtcg cgcaacaggg cggtgagata tccaatgtcg 2637661 gtgagtcccc cggggcgtgg tatcaccatc cgcgcgccac aggccagcgt gccgaagatc 2637721 tcggcgatcg agacgtcgaa gctgggtgag gcgacctgca gtagccggtc ggtgtcgtcg 2637781 acgtcgtatt cgcccttgaa ccagacgaag tactcggcga cggggcggtg tggcaccgcg 2637841 acacctttgg gcaatccggt ggtaccggac gtgtagatga gataggccgt gttgtctggc 2637901 cgtagcggcc ggattcgatc ggcgtcggtg gggtcgtcgc tgcggtatcc ggccagctca 2637961 cgtactggcg tgcgcagcac cagtttcgcg tcgcagtcgg cgaggatgaa atccagccgg 2638021 tcttgcgggt agctgggatc cacgggcaca tacaccgccc cggacttgac cacccccaag 2638081 gccgtgacga tcaggtccgg cgatttgtca agaagtaccg cgacccggtc ttcgctgcct 2638141 atcccctgct cgatcagcca gtgccccaac cggttcgacg cctcattgag gtcgtggtag 2638201 gtgaagtgtt ggccctcata caccacggcg gtggcgtcgg gagtccgcgt ggtctgctcg 2638261 ttcaccaggt ccacgagggt tttgacaggg gtatcgaacc gctcgccgcg cgacacctcg 2638321 cgcagcctgg cggcgtcgcg ctcatccatc agcgccagcc ccgacaacgt gttgtcgggg 2638381 gcggccagcg cattgtcgag cagcacaccg aagtgtcgaa gcatctgctt ggccagggcg 2638441 ggttccagga tctccaccag gtgttcggcc tcgaccagca cacccgcgcg gtcgaattcg 2638501 accatgaagc ccaacggcag ctgcgtgatg ttgctgcgca ggtcgtagcg ctcgcactcg 2638561 atgcctggcg ggttgaatcc gccgccgtcg ggctcccgga aaccgaagct gacccgggtc 2638621 atgcgctcgg caccgtgccg gcgatcgggg ttcagttccc ttaccacgcg gtcgaggttg 2638681 atccgttggt gtgcgaacgc cccgctggcg atgtcgcggg tggcggtcag caactcccgg 2638741 aaactcatcg ccgattgcgg tcgcagccgc atcgctaccg tgttgccgaa atagccgatg 2638801 gcatcttcgg ttccggcgcc acggttgagc accggagccg ccacgaggaa gtcgtcactg 2638861 tgggtgtagc gatgcaccag ggcaccgaac gcggccagca gcaccatgta gggagtgcaa 2638921 ccggtgttct tcgccatcgt ggccacccgc gcagcggtgt cggcgggcag ccgcaacgtg 2638981 gcgcgcgcgg cacgccaact ggtcggcaca cacgttccgg ctgggccggg aagttccagc 2639041 ggctctggcg gatcggccat gatcgcgcgc caatagttga ggtcggcctc ggtagtgtcg 2639101 ggtccggatg cggccgacgg acggtgttct ggccccagat cggcccctag gtcagctcgc 2639161 gagtacgcct gggtgagatc ggtgaagaac acccgccacg aaccatcatc ccaggcgatg 2639221 tggtgggcca ccaacagcag cacgtgttcg tcggcagccg tgcgcaccac cgtgattcgc 2639281 aatggcgcgt cgcgggaaag ctcgaaggga gcgcagaatt cgcgctgagc caacacctcc 2639341 aggcgcagcc gctgggcgcg ttgggacagg tccgtcaggt cgtattgtgt ccagccgggg 2639401 cgaagatccg cgtgcacggt cggctgggcg actccgtcgt cgccgacagg gtaggtggta 2639461 cgcagtatcc gatggcgacg ggcgacggcg ttgactgcgt cgcgcaacct ggccagatcg 2639521 atgtcaccgg tgatgcggta ggacacacag atgttgagta acgcaccgct ggggtcggcc 2639581 atctgcacga accacatccg ggcctggccg tcggagagcc gatcgtcagt gtgcgggcca 2639641 atgtcctgcg cagccgagga caggccgcgg tcggcgagcc tgcgacgcag cagctccaat 2639701 cgggcctcgt cgaggcgggc gccgatgtcg gcggtattag tcacgcgaaa tgtccacttt 2639761 ctgtgcggtg tgtgagcgct cgtcggcatc ttcgagtttc gcgacaagtc catcaccggt 2639821 gatgtcgccc atgagcgtgg ccagcgacac cgtcgcgccg attgatcgtt tgagtcggtt 2639881 acgcaagtcc agtgccagca tggaatcgac accgagatcg aacagcgatt cctgcaggtt 2639941 cacctcgccg gcctgcggga tcccgagcac ggccgccaat tgggtgcgca ccgcgtccac 2640001 gatcgtcagg ttggggtcgg ttggaccctc gtaccgttcg aattgcctgc tgtccaacaa 2640061 catctgcaac cgggccgcgt cggcggcgaa cactagcggg tcgacagtga attcgtgcag 2640121 gctcgcctcg atcgcctgct ggggcgccat ctggcggagt ccagaccgct cgacgcgggc 2640181 gatcgtaacc gcatccgcga ttccccgagc tggttcgccg gccttggggg cctgccatag 2640241 gccccatttc accgccacgc agtgcctgcc ctgggcgcgc agctgggcgg ccatcacgtc 2640301 gagcagccgg ttggccgccg agtacgcgac caccccgtgt ccaccccaca cccccatcac 2640361 cgaggaacac agcagggttc gcacatccgg gcgcagcggc cacagctcga tcatctgggc 2640421 caggccgagc accttggccg cgaagttgtc aacgacggcg gccgacgtca cccccggtgc 2640481 ggtaccagag atcacgctgc ctgccgcgtg cacgatcaac gaggcgccga cgccaccgta 2640541 ttcggctgca atcgctgaca actgggtggg atcggtgata tcgcacggcg gcgacacgat 2640601 cacggtgcca tgttgctttc tgagcatggc caccgtcgcc tgatccgcgg cgcgccggct 2640661 gagcagcacg atgcgccgtg cgccatgctc ggcgagatac cgcgcgtagt gcatcccgat 2640721 ggcacccgcg ccaccggtga cgacgacatc gtcgagcacg ccggagtcca acgaccagtt 2640781 cgggacggcc ggggcatcgg cgagggttcg ctcgaacagc gtgtacccgt ttaccgagcc 2640841 gcgtagcgcg gtctcaccga agccccgcag taccgccgtt atgaccgaga cgccgaggac 2640901 cgggtccaag tcccacgacg gcaagtccag gtggctgaaa gtctgttcgg gatgctcgaa 2640961 tccgatgctt cgatgcatcg cggccagcgc ggcctggccg gccgacggca ccgcgtccgc 2641021 tgcgtcgacc tgctcggcgc cgacggtgac cagacatacc gattggcaac gggcaccgat 2641081 atgcatcgga tagtccagca aaccggcccc gacgaggtcg gcgagtgcac cggcggcccg 2641141 gacggcgtcg gtgtgttcga agtcgggcgc gatcaccagg atcaactcgg cgtcccgcgc 2641201 agcactcagc tcggtatcgg ggtgcgaatc aattgctgcg cacagtgttt gagccagcgc 2641261 gcggtgagca ccgagatcga gcactgcgag gtgacggtgc cgcccagcga ccggtgtcga 2641321 cggcaccatc cgttcccacc gctcaaccgc aatggtcagt ccggacaccg gcggcagcgg 2641381 ttcggggtgc gcccacatcg gcaccgcacg catcggcgcg ttcgggaacc cggacagatc 2641441 gacgtcgccg tcgagtgggt caccgcccag gtcaccccac gggtagccag ggtcagcgac 2641501 cgccgcgcta acaatattcg ccgacaacgc atcaacaaac cgctcgccac gacgtgccga 2641561 cccgaccagc acagcgggac cgtccggcag gttggcggcg ccctcacagt tctgaccgat 2641621 cgcaaacaac agcgcgggat gggccgatat ctcgatgaac gcccgtgctc cacagcggat 2641681 tgccgattcg acagcgcggt cgaaacgcac cgtatggcgc aggtttgcgt accagtagtc 2641741 gccgaaagtg gtgcctggcg ccaccacgtc gccggtggtt ccgccgatga attgcactgg 2641801 cgcttccata aattcggagt caggcagctg ctcgcataat tcatcgcgga gcgattcgag 2641861 cacgctggta tgcaccggga agcccacggt gatcccgcgg gcgaagtgac cgctggaccg 2641921 gactgtgtca acgatggccg ctaccgcttg gcgctcaccg gacacggcga cggtcgagga 2641981 ggcattgacc acagacagtt ccagccagcc gccggtggtc gcgatcagcg cgctcgcgtc 2642041 ctgttcaccg atgcccagcg ccgccaccgc atagcgacca ggcaagcggc ccaccacgtt 2642101 ggcgcgggcc gccaccacgg ccacagcatc cgacaaggtg atacttcctg cgagataggc 2642161 cgccgctact tcgccgaggc tatgaccgac tgttagatcg ggcagcacac cgcaggaacg 2642221 ccatacctcc gccagcgcaa cggcatggac gaactgcgcg ccttcgatct cgatctcgca 2642281 gaacgcttgc cgctcatcgg ttccgggcgg ggcgatcagg tatggcagcg gcgagtcgac 2642341 accagcggcc gcaaatgcgg cggcgcacgt gtcggtcgcg gtccgatagg tcggcagctc 2642401 gcggtaggcg acggcgccca tgcccggcca atgaccaccc tggccgggaa agacgaacgc 2642461 ctggcgcggg gccgagccca acgacgaccg cgcgatgagc ggatgctcgc gtccggcggc 2642521 cagcgcgcgc aagccctcgg cgagttccag ccggtcggcg gcccgaagca ccgcccgatg 2642581 ccgacggacc cgtcgggtct tgcgcagctg ccgagccact tcggtcacgg tcgtagccgg 2642641 aaagcgctcg aggtagtcgg cgatggcccg agcgtccggc ccgatcagtt cctcggcatg 2642701 ggcgctgagc aaaaccgcaa cccgcccatc gggcagctgt ttgggggcca tcacacctcc 2642761 ccacactcgg ggccacgctc gggcgcggaa acggtgtccg gcatcgaaac gatcacgtgg 2642821 ctattggtac cgctcatccc gaacgcggac accgccgcgg tgcgccatcc gtcaacggcc 2642881 cgccacggcg tgagtttgtc ggccagccgc agaccctgtt tctcccaatc gatttcgcgg 2642941 ctgggctcgt cgacgtgcag tgtcggcggg atcgcggcgt gctgggcggc cagaatgacc 2643001 ttcacaaggc ccagcccgcc cgccgccgcc tgagcatgcc cgatgtttga cttgaccgat 2643061 cccaacagcg gcccgcgtcc ggccggggcg gtgccgtagc tggctgccag tgaccgcaat 2643121 tcggtgcgat cgccgagccg ggtcgcggtg ccgtgccctt cgaccatccc gacatcggcg 2643181 ggcacaactg ctgcctgcgc gatggcgcgc cggagcagtc gcgtttgcgc gtcgccgctg 2643241 ggcgcggtca gcccgtcgct aagtccatcg gagttcaggc aactggcacg cacctcggcg 2643301 aggacacgac gccggtcagc ggttgcccgc gaccggcgct gcaggaggaa catggcggcg 2643361 ccctctgccc aggcggttcc gctggcgtgc gcgctgtagg gccggcagtg gccgtcgtcg 2643421 gatagcgcgt gctgcttgga gaactcgacg aaatagccgg gcgtacccat cacgcacacg 2643481 ccgccggcga gtgccaggtc gcagtcgccg gcccggatag cttgaaccgc ggtgtgaaag 2643541 gccgccagcg ccgacgaaca cgaggtatcg acggtcagcg ccggcccggc caggtcaagg 2643601 gtgtaggcga tgcgcccgga gatgacaccc agcgacgtcc cggtgatcag atggccactg 2643661 tggtgggaga attcggtcaa agcgggaccg tattcgagcg ccgaggcacc gacataacag 2643721 cccacatcgt gaccggccag gtcatcggga ttgatcccgc tgttctccag ggtgcgccat 2643781 gctactcgca gccccacccg ctgctgcggg tccatcgccg tcgcctcgcg cggtgagatg 2643841 cggaagaact caggatcgaa tgtagttgcg ctggaaagga atccgccaag gttgtggatc 2643901 ggtttgaatc cgtttcgacg cgacccgtcg aacagctcgc gaagtgccca acctcgatcg 2643961 gtggggaacg gtccgagtcc ctcgcgctgt tcggagagca gtgtccagta gtcgtcggcg 2644021 gtttcgacac caccgggtgc ctcgatggcc agcccgacga tgacgaccgg gtcgttatcg 2644081 gacatcggca ctcaccatcc gggccacggc gtcgaggtga tcgttgagat agaagtgacc 2644141 accatcaaag tgcgacagcg tgaagcgacc ggaggtgtga gtctcccaac tggtcaacat 2644201 ctcccggctg atgcggtggt cgcggttgcc gccgaccgcg tggatgttgg cgcggatgcg 2644261 cacgtcgggt ggacatgaat agccgctgag ggcccgatag tcggccttga ccgccggcac 2644321 cagcagttca acgaattcct cgtcctcgag cagcacggga tcggtgccgc caagatccac 2644381 catgtcggcc aggacgtcac ggtcggcggt cggcaacggt ccggacgcgg ccaccgtcga 2644441 cggagcctga ccggaggaag cccacagtgc acgtaccggc acgccattgc gctcggcgag 2644501 gcgagcgaac tcgaaggcca ctatcgcacc catgcaatgg ccgaacagcg tcagcggagc 2644561 cgtcaggtgc cagtcgcccg cctcgaacag ctcgagcgcc agcgcctcga tgctgtctgc 2644621 cgccgggtgg ctgcgccggt cagcccgctg cgggtactgc accacgaacg tgtcaacgtc 2644681 gttggccact aacgattttg ccaaccaccg gtaagccgcg gcagcgccgc cggcgtgtgg 2644741 aaacaccagc accgcgccgg gcttgtcagt accggtgaac cgcttcaccc acggtttgaa 2644801 ggctggctgt gcgggctgct cgatcggatc gagcgccgcc atcacgtcgg cacttgtcat 2644861 attcgcgatt tctaagtaca cctcggcgac cagttcgagt cggtcggcgt tggcttcccg 2644921 accggtgagc aactgggcca acgcggcaat ggtcctggcg gcaaacatgt cggcgaccat 2644981 caggctcggc gaatccagcc accgccggat accggcgacg acctgggtcg caagcacgga 2645041 atcgccgccc agggcaaaga agtcgtcgtg cacgcccacg gcatcgttgg cacggcccag 2645101 gatgtccgcg acgatgcggc gcagtgcccg ctgaagcacc gttcgcggcg ccgcataggg 2645161 tgccgatctg tcgccagacc gctcgacctc ggcggcaagc agggcgccaa cctccgcgcg 2645221 gtcgatcttg ccgctgtcgg taaaggggat gcggtctagc agcgtgacgt ggcgcggaat 2645281 catgtgcgcg ggcaccagat cggcgagctg ctgtcgaatc gactccgcgg tcacgccggc 2645341 atcgtcgacg cagaccgccg cggccagcac atcggacccg ccaggaagca cggtggccgc 2645401 cgccgcgtgc acaccgggca agcgctgcag cgcggcttcg atctcgccga gttcgacgcg 2645461 gtacccgctg atcttgacgc ggtgatcggc acggccgacg aactccaggg tgccgtcgtg 2645521 ccagtagcgg gccagatcac cggtgcgata ccaggtgcgg ccgtcatgct cgacgaagcg 2645581 ctccgcggtc agctcgggac ggccacggta accccgggcg attccgcgac cggacaccca 2645641 caactcaccg gccacccaat cggggcagtc gtcgccgctg tcggccacta cccggcaggc 2645701 gttgttggga aacgggacgc cgtatggcac cgaggcccag tccggtggca gattggccgc 2645761 gtcctggacc tcgaaaatgg ttgcgtggac cgcggtttcg gtggctccac ccaaccccgc 2645821 gaaccgtgcg ctcggcgctt gcacctgcag gcggcgggcc aggtcgggac gcacccagtc 2645881 gccgccgacg gccaccgctc gcagcgacga cagccggccc ccgccgactt cgagcagcat 2645941 gtccaaccag cccggcatga aattcaacgc cgtgacctcg taagtgtcga taagccgggc 2646001 ccaggcgtcg ggatcgcggc gctgcgcttc gtcgaccacc acgatcgctc cgccggagcg 2646061 cagggcggcg aagatgtcca gcaccgacat gtcgcactcc agcgtcgcca gggcaagcca 2646121 gcgatctgcg gcgcctagct cgaagtgccg gatgaaggtc tccacggtgt tcatcgcggc 2646181 gtcgtgcgcc acctcgacac ccttgggttc cccggttgag cccgaggtga acaacacata 2646241 ggcgagcgcg gtgggatcgc taggcccggg cacgaattct gccggcgcgg cggcaagcac 2646301 gtcagccagc aacagcgtcg ggaccggcac ccgcacttgg catggcgggc cgcaaacgag 2646361 cgctaagttg accgaaccgg tcgccaggat gcgctccgcg cggtcgcggg gctggtcgac 2646421 gccgatcggc agatagaccc cgccggcggc caaaatcccc agcacagccg ccacttgttc 2646481 gcccgttttc ggacccagca ccgcgacggt gtcgccgact cgtaggcccg cagcacgcag 2646541 cgccgcggcc accgccgatg cctggtcgcg cagttgggcg tagctcaagt cgccggaact 2646601 ggcgaacacc gccggcgcgt cgggctgctg ttgggcctgg cggaaaaacc cgtcgtgcag 2646661 cgcctcggtg ctgggggcgg cggtgcgacc gttcagcgcc gcgcgcaccg cgcgttgcgc 2646721 ggcgggtagc gcggacgggc tcggcgcatc ccaggcgtcg tccccggcgg ccaaccggag 2646781 caattcgtcg acctggtggg tgaacatggc gtcgatgacg ccgggtgcaa agaccccctc 2646841 gcggacatcc cagttcacca gcacaccgcc gtcgaactcg gtgacctggg cgtcgagcag 2646901 cacctggggc ccctgcgaaa tgatccatcc gggtgtgccg aattgctcgg tgacgtccgg 2646961 gcagaaaagg tcgccgagcc ccagcgcgct ggtgaatacc accggtgcca gcacctgggt 2647021 gccacggtgg cggctgaggt cacgcagcac agacagcccg gggtatgcac tgtggcctgc 2647081 ggcgctgcgc agggcttcct gcaccgcctg cgcccgcgcc gccgccgtgc gcgcaccggt 2647141 cagatcgacg tcgagcaaca gcgaggaggt gaagtcaccg accagcaggt cgacgtctgg 2647201 atgcagggcc tggcgactga acaacggcag gttcagcagg aaccgcgacg acgctgacca 2647261 acgcgccagc acgttggcaa aggccgcggc cagcgtcatc gccggggtga tgccgcgggc 2647321 ccgggctcgg gcgaacaacg cgtcgcgggt ctgcgggtct agccagtgcc agcgccgggt 2647381 gctgcggcgc cggtcgcgtt cgccgccggc ccgggtaggc agcgcgggcg gatccggcag 2647441 ctgcgggatg cgctgcgccc accagtcccg gtcggcgtcg cgaaccggtt ggggcagcgt 2647501 ctcctccgcc tcgatagcct gccggtattc ctggtaggtg tagcccagtg ccggcggttc 2647561 acggccgtca tagagggccg ccaggtcggc cagcaagatg cggtagctca tcgcgtcagc 2647621 ggcctgcatg tccaggtcga catgtaggcg ggtgcgctcc cccggtaata acgtcaacgc 2647681 aagttcgaat accgcaccgt cgagctgctg gtgcgatttg gcgtcgcgga tccccgccaa 2647741 ccgctgatcg acgacatccg gggccacgtg acgcaggtcg gcaacactga tgggaaagtc 2647801 gcgagatccc gccgccggcg ggatgcgctg ggtgccgtcg ggcaagaact gcacccgcag 2647861 catcgggtgc cgcagcgcca accgggtggc cgccgcgcgg agcctgtccg gatcgacccg 2647921 ggcaccatcg aactcgacgt agaggtgccc agctaccccg ccgagctgtt ggtggtcgtg 2647981 gcggccgacc cacatcgcgt gctgcatcgg cgccagcggg aaaggctcgc cttcctggga 2648041 taacccggca tcccctggtg cggcaactgc cgtgggcgcg acgccggtgc cggcggacac 2648101 cagttgggac caggcctcga ttgtgggtgt ggcggccagt gtggcgaagt cgacggcgat 2648161 gcccttccgg cgccagcgcc ccaccagcga catcatccgg atcgagtcca ggccctgacc 2648221 aacgaggttg gcgccggggt gcagagcatc ggcgcggaca ccgagcaact ctgcgacctc 2648281 ggcgcgaatg atctccgagc acgccgtagc atgcaccaca aaccctcccc tgttagcaca 2648341 ggctgcccta attttagtgg ttaccctatc ttcgaaccac gcacctgcgc taccagcccc 2648401 cctgttaagg agcccacatg ccaccgaagg cggcagatgg ccgccgaccc agtcccgacg 2648461 gcggactggg tggctttgta ccgttccccg cggatcgggc cgcgtcgtac cgggcggccg 2648521 gctattggtc ggggcgaacc ctggacaccg tgctctccga tgccgcgcgg cgctggcctg 2648581 accgcctcgc ggtggccgac gccggtgatc gtcccggcca cggcggcctc agttacgccg 2648641 aactcgacca gcgggccgac cgggccgccg cggcgctgca cggcctgggc atcacgccag 2648701 gcgaccgggt actgctccag ctgccaaacg gctgccagtt cgcggttgcc ctgttcgcgt 2648761 tattgcgggc gggagcgatc ccagtgatgt gcctgcccgg tcaccgcgcc gccgaattgg 2648821 gccacttcgc cgccgtcagc gcggccaccg ggctggtggt cgccgatgtg gccagcgggt 2648881 tcgactatcg gccgatggcg cgcgaacttg ttgccgatca ccccaccctg cgccatgtca 2648941 tcgtcgatgg cgatccggga ccgttcgtgt cgtgggcgca gctgtgcgcc caggccggca 2649001 ccggttcgcc ggcaccgccg gccgatcccg gatcgccagc gctgctgctg gtctccggcg 2649061 gcaccactgg catgcccaaa ctcattccac gcacccacga cgactacgtg ttcaacgcga 2649121 cggccagcgc cgcactctgt cggcttagcg ccgacgacgt ctatctggtg gtgctggccg 2649181 ccggccacaa tttcccgctg gcctgcccgg gcctgctcgg cgcgatgacc gtcggggcca 2649241 ccgccgtgtt cgcccccgat cccagcccgg aggccgcctt cgccgccatc gagcgccacg 2649301 gtgtcaccgt caccgcgttg gttccggcac tggccaaact gtgggcccaa tcctgtgagt 2649361 gggagccggt gacaccgaag tcactgcggt tgttgcaggt tggcgggtcc aagctagaac 2649421 ccgaggacgc tcgccgggta cgcaccgcgc tcaccccggg cctgcagcag gtgttcggca 2649481 tggcggaggg gctgctgaac ttcacccgca tcggcgaccc acccgaagtg gtggagcaca 2649541 cccaggggcg gccactatgc ccggccgacg aactgcgcat cgtcaacgcc gatggtgagc 2649601 cggtggggcc cggggaggaa ggcgaactct tggtgcgcgg gccctacacg ctgaacggct 2649661 attttgctgc cgaacgcgac aacgagcgct gcttcgatcc ggacggcttc taccgcagcg 2649721 gcgacctggt ccgccgccgc gacgacggca atctggtggt caccgggcgc gtcaaggatg 2649781 tcatctgccg tgcgggagaa accatcgccg ccagcgacct cgaagaacag ctgctgagcc 2649841 atccggcgat cttctcggcc gcggcggtgg gactacctga ccagtatctg ggggaaaaaa 2649901 tctgcgctgc agtcgttttc gctggagctc cgattacgct tgcggagttg aacggctacc 2649961 ttgaccggcg tggtgtggcc gcgcatacgc gacccgacca gctggtcgcg atgccggcgc 2650021 tgcccacaac gccgatcggg aagatcgaca aacgagcgat cgtccgccag ctcggcatcg 2650081 cgacgggtcc cgtgacgacc cagcgctgcc attgactgac gtcaacaagt tgaattgact 2650141 gcgttgcatg accgacggtg ttccggcccg cgggtcactt cgatcacgcg gcgcggtagc 2650201 ggtgagctcg atggtgttgc ggcccatcac cggggcgatt ccgccagacg ggccgtgggg 2650261 gatatgggcc tcgcgccgga tcatcgccgg actcatgggc acgttcgggc cctcgctcgc 2650321 gggcacccga gtggaacaag tcaactccgt tctgccggac ggacgccggg tcgtcggcga 2650381 atgggtgtat ggaccgcaca acaacgcgat caatgccgga cccggtggcg gcgccatcta 2650441 ttacgtacac ggcagcggtt acacgatgtg ttcgccccga acccaccggc ggctgacatc 2650501 ctggctgtcg tcattgaccg ggctaccggt attcagtgtc gattaccgac tggcgccgcg 2650561 ctaccgtttc ccgaccgcgg ccaccgacgt gcgggcagcc tgggattggt tagcgcacgt 2650621 atgcggctta gccgcggagc acatggtgat cgccgcggat tccgcgggtg gccatctgac 2650681 cgtcgacatg ctgctgcaac ccgaggtcgc cgcccgacct ccggcggcgg tggtgttgtt 2650741 ttcgccgctg atcgacctca ccttccggct gggcgccagt cgtgagctgc agcgccccga 2650801 tcctgtcgtg cgcgctgacc gtgcggcccg gtcggttgcg ctgtactaca ccggagtcga 2650861 tcccgcccac caccggctgg cgctcgatgt tgccggcggg ccaccgctgc caccgacgct 2650921 gatccaggtg ggtggagccg agatactcga ggccgatgcg agacaactcg atgccgacat 2650981 ccgcgctgcc ggcggcatat gcgagttgca agtgtggcct gatcagatgc atgtgttcca 2651041 ggccctgccg cggatgacgc ccgaagcggc caaagccatg acctatgttg cccagttcat 2651101 ccgcagtaca acagcacgtg gagacctctg aacgttactg gcgtgcaacc agataaggcg 2651161 tcaatgtgga tagcttttcg caagtctcct cgaattcgcg ctctggctcc gattcttcga 2651221 tgatgccggc gccggcccgc agccaagtcc gcccgccgac ctggtatgcc gcccgcagcg 2651281 tcagcgcggc gtctagcccg ccatccgccg aaagcatcac caccgcaccg gaatacagcc 2651341 cacgtgggca ctcatcgagg cgaaagatgg cctcaacgcc agctgctttc gggattccgg 2651401 atgcagtgac agcaggaaaa agggcttcca gggcggccat ccggtcgctc gatggatcca 2651461 accgtgctct gatggtggag ccgaggtgct gcacactgcc gcgctcgcgc accgtcatga 2651521 aatcgatgac cgcagcactc cctggttcgg cgatgtcggt aatctcctca agcgaagagc 2651581 gcactgaaat ggcgtgctcg acaatttctt tggagtttga ttccaggtca tcacgagcca 2651641 gtcggtcaat ggcgggacca cggcccaagg cgcgggtacc ggccaacggc tcggtgatca 2651701 ccactccgtc ggcgcgcacc gccgtgacga gttcggggct gtaacccaga gcacggattc 2651761 cgcccaactg caacaaaaac gacctcaccg gggtgttgtg ccgacgcccc agccggtagg 2651821 tcaacggaaa gtcgatcgcg aaaggcactt cgacacaacg ggacagaatc accttgtggt 2651881 agcggccggc agcgatttca tcgacggcta ccgccacccg acggcggaag ccggatggat 2651941 cgtcggagac gtcgacggag cgggactgcg gcacctctcg caccccggtg gcgagtaatc 2652001 ggtcgatggc ctcgcggtgg cgaatcccag catcgaacag gcgaatctcc ttttcgctca 2652061 ccatgatccg ggttcggggc gaaaacaccc gggccagtgg ggtgtgcggc gccagccgct 2652121 gctgcaaccc atagcggtgc acgccgaatt cgaaggcgac ccagccaaaa gcttgatcgg 2652181 tttccagcaa cagccgatcg acggcttcgc ccagggccgc tcccgggcga cccgaccatt 2652241 gctgtcgccg cgtaacgcca tcacggatga cgcgcagttc gtcgctgtct agctccacca 2652301 tcgcctgcac accggcggcc aggacccatt ggccgtcgca ctcgtagagc aggtaatcct 2652361 cgtcgacgga ctcggtaacc accgccgcca gctccgctgc caggtcggcg gggttgacac 2652421 cggcgggcat cgggatggac gacgacgcgg tgctgacggc gcctgtcgcg acgctgagct 2652481 cggacacagc tagtaaatgt agcctaacct acttaatggg tcgcagcccc ccggggtcgt 2652541 cgcatgtcca acgtgctcga ctggaagaaa atgctcgtcg ggagcaaatg gcaccagccg 2652601 gggcggcgac aggacccacc cacggccgga cggtccgcgg actgcgtttc gcagcgtaat 2652661 catttccgca ggcagaggcg gtcgcggccg gtgctcgccg gttaccatgc ccgccaactc 2652721 acgcacacga aatcgtgaaa cctttgccaa ccgtttactg gctagctaca aagcaaggtt 2652781 ttgccttcgc cggaattctc ctaacatcac tcactaacca cgtagaccat ccggtcgacg 2652841 acgtagtcgc ggtacgcgtg gctcgccaag ctcggtatgt ccgctgggtc tgcccaggca 2652901 tcgcccgatc gttagccagt caacagagag gacccgacga tgttcgtaat ccggctcgcc 2652961 gacggcgaag aagtccacgg cgagtgcgac gagctgacga ttaacccagc aaccggcgtc 2653021 ctcacggtct gccgggtcga cgggttcgag gaaaccacca cgcactactc gccgtcggcg 2653081 tggcggtcgg tgacacaccg caagcggggg gtcggcgtta gaccatccct ggtctcaacg 2653141 gctcaataag cccgagccac actttctaga ttcgacttga tattcctggt cgctcccctg 2653201 acgctgggtg cttcctggat cgccgcacca ggtatgggag gcgccaatgc tgcatgagtt 2653261 ctgggtgaac ttcactcaca acctgttcaa gccgctgctg ctgttcttct atttcgggtt 2653321 cttgatcccg atcttcaagg tgcgattcga gttcccctat gtgctctacc agggcctaac 2653381 cctgtatctg ctgctggcca tcggttggca cggcggcgaa gaactcgcca agatcaagcc 2653441 gtccaacgtc ggcgccatcg ttgggttcat ggtggttggc ttcgccttga acttcgtgat 2653501 cggcaccttg gcatacttcc tgctgagcaa gctgaccgcc atgcgccggg tcgacagggc 2653561 gacggtcgcc ggctattacg ggtcggactc ggcagggaca tttgccacct gtgtagcagt 2653621 cctgaccagc gtcggcatgg ccttcgacgc ctacatgccg gtcatgttgg ccgtcatgga 2653681 gatccccggc tgcctggtgg cgctgtatct ggtggcgcgg ctgcggcacc gagggatgaa 2653741 cgaggcgggg tacatggccg acgagcccgg ctacaccaca gcggcgatga tcggagcggg 2653801 gcccggcacg cccgcccggc ccgctcacag cgacagcctc acggcccaag ccgagcgcgg 2653861 catcgaagaa gagttggagc tctcgctgga aaagcgcgag catccaaatt gggatgaaga 2653921 cggcgtcaaa gacagcggca cgaatgcgtc gatcttctca cgcgagttgc tgcaggaagt 2653981 tttcctcaac ccggggctcg ttctcctctt cggcggcatc gtcatcggcc tgatcagtgg 2654041 actgcaggga cagaaggtcc tacacgacga cgacaacttc tttgtggcgg cattccaggg 2654101 cgtactttgc ctgttcctgt tggagatggg catgacggcg tcgcgtaagt tgaaggatct 2654161 ggcgtcggcg ggcagtgggt tcgttttctt cggcctgctg gcaccgaatc tgtttgcgac 2654221 gcttgggatc atcgtggccc acggctacgc atacgtcact aacaacgact tcgcgccggg 2654281 cacatatgtg ctgttcgcgg tgctctgcgg cgcggcgtcc tatatcgccg tcccggccgt 2654341 gcaacggctt gcgatccccg aggccagtcc gaccttgccg ctggccgcgt cgctgggttt 2654401 gacgttctcc tacaacgtca cgatcgggat cccgctgtac atcgagatcg cccgcatcgt 2654461 cgggcaatgg ttccctgcca ccggggcttc gatcggttag cccagcagag tgcgcaccac 2654521 cgcgtcggcc agcaatcgcc cccggccggt gaggaccagt cggtcgccgt ggtagtccag 2654581 caatccgtcg gccaacaccg cctcggcacg ttcccgttcg gcagccccta gccgggcgag 2654641 cggtagcccc tggcgcagcc ggaccttcag caacacgtct tcggtgtgca aagcgtcggc 2654701 gcccagctgc tcgaagcccg ctaccggcaa cgtcgccccg gccagtatct cggcgtaagt 2654761 gttggggtgc ttgacattcc accagcgtgt cacgccaatg tagccgtgcg cgcccggacc 2654821 tgcgccccac cactggccac cgtcccaata acccaggttg tgccggcact cgccgcccgg 2654881 tcgacaccaa ttggacacct cgtaccaggc aaacccggcc gccgacagcc gagcatcgac 2654941 caactcgtag cgatgcgcca gcacgtcgtc atcgggcgcg gccagctcac cacgccgaac 2655001 ccggcgagcc agtgccgtgc cgtgctcgac gaccaaggca tacgcggaca catgatccac 2655061 accggcctgc accgtggcgt ccactgagcg caccaggtcg tcgtcggact cccccggggt 2655121 tccatagatc aggtcgaggt tgacgtgtgt gaagccctcc gctatcgcct cggtggccgc 2655181 ggccgccgcc cggcccggcg agtgcacccg gtccaaggtt gccagcaccc tcggggccac 2655241 cgactgcatg ccgagcgaca cccgcgtgta accggccgcg cggatcgtgg cgaagaactc 2655301 cggccacgtc gactcggggt tggcctcggt gctgacttcg gcgtcgggcg ccagcacaaa 2655361 gtggtcccgc accatgtcca gcaacgtggc caggcgctcc cccccgagca gcgatggcgt 2655421 cccgccaccc acatacacgg tatgcaccgt cggtgcgtcc agcttggcgg ccgccagttc 2655481 gagctccgcc cgcagcgcca gcagccaacg gtccgggctg acgccaccca gctgggccgg 2655541 ggtgtaggta ttgaagtcgc agtacccgca acgggtcagg cagaacggga cgtgcaggta 2655601 gaccccgaac ggttgtccgg gcatgggcgc caggccgggc agctcaactg gtgcctgccg 2655661 aaataccatg ccaaatcatc gcatagcgcg taccagctag ggtggccagc aatgtaacgc 2655721 aggcacacct caatcgtccc tgctccccga acaacctcca gtctcggccg cgaggaacgt 2655781 caggatgtgg gtgagcgagc ccagcggtgc gtctccctga ctacaagaac tacatttcgg 2655841 ccacgcaccc gggccttggg ttttcataat gttgtctgcg acctcgatct gttgctgggg 2655901 actcgcggcc gccggcgacc cgacaccacc gttggaatcc cacgtcgcct ggctgatctg 2655961 cagaccaccg tataacccgt taccggtgtt ggccgcccaa ttgccgccgg attcgcattg 2656021 cgcgatggcg tcccaatcga tgtcgtcggc tttcgagctg atggtggaca gacccaacaa 2656081 cgcgacaaac atggtcgcga caacggcggt ttcgatgaac accgtgcata cgatcctggc 2656141 gcacctgtca cgtggtcggc cagcacccgc agtagtaagc aaacccggtg tcatagcagc 2656201 tccaccttgc tggccagcca gcggccgttc accttgtcca tgatcacctt gatccgactg 2656261 cggtcgatct gcggcgttgg gctgttccgg ttgctgaccg actggtcgat gaacatcagg 2656321 accactacct tgttcgtggt ggctgatttg accgacgccg ccacgacggt cccgtgggtg 2656381 gccacccgat tgtcggccag cagttggcga aggtgcgcac tggatttgcc gtacttatct 2656441 ttgaactcgc cggtcgaacc ctcgagaatg tccctcatgt tgtggtcgat ccgctcacag 2656501 tccatggtgg ccagcttgac gacatagctg cgtgcggcct gcagtgcctg gccggcggcg 2656561 acgtctgtct gatgcttctc aaagagcacc catccgcacc atccagaccc ggccaacgac 2656621 acaaccaccg cgacggcgcc aacccagcca gtcaccgatc tggttaaccg accgcggccg 2656681 ggagtttcgg ccggttcgcc ggtgccgcct ggctcacttg caccgtggcc ccggccgaag 2656741 atggccattc tgcgcacgat tgacctcgat cactatccgc taagacaact atctcagtag 2656801 tcatatttgg tcacatctgt cactcctgtc aacgtcaggt gcgcgtctcc cagcggattc 2656861 ccgggtcggc ctatccatcc atccaggctt gttgcgtagt tttgatcatc gtgaaaagaa 2656921 atttgaccag gtcgcgcagc tgcacgccat ccatggcaga atgtcaccgt gaccgccgcc 2656981 aagaacccgc gccccgatct gcgaatcgcg ctggtggctc ggcggcacat cgacctcaag 2657041 cgggtctgca gctgtggctg tcggccttga cgccgtaaac ccagcccacc tgtatctgca 2657101 gccggcgacc ggatctgccc ctcccggaac aagcggcgtt tagcgcgtcc taggtcggcg 2657161 atgtccgcga aggagaaccc ccaaatgacc actgcacgtc ccgccaaggc tcgaaatgag 2657221 ggccagtggg cgctgggaca tcgcgagcca ctcaacgcca acgaagagct gaagaaggcc 2657281 ggcaacccgc tcgacgtgcg ggagcgcatc gaaaacatct acgccaaaca gggtttcgac 2657341 agcatcgaca agaccgacct gcgagggcgc tttcgctggt ggggcctgta cacccagcgt 2657401 gagcagggct acgacggcac ctggaccggt gacgacaaca tcgacaagct cgaggccaaa 2657461 tacttcatga tgcgggtgcg ttgcgacggc ggcgcgctct cggctgccgc gctgcgcacg 2657521 ctgggccaga tctcgacgga gttcgcgcgc gataccgccg atatctccga ccggcagaac 2657581 gtgcaatacc actggatcga agtggaaaac gtccctgaaa tctggcgacg gttagacgat 2657641 gtcggactgc agaccaccga ggcgtgcggt gactgcccgc gggtagtgct gggctcgccg 2657701 ttggccggcg agtcgctcga cgaagtgctc gacccgacct gggcgatcga ggagatcgtg 2657761 cgtcgctaca tcggcaagcc cgacttcgcc gacttgccgc gcaagtacaa gaccgccatc 2657821 tctggcctgc aggacgtcgc gcacgagatc aacgacgtcg ccttcatcgg cgtcaaccat 2657881 cccgagcacg gaccaggcct ggatctgtgg gtgggcggtg gactgtcgac caacccgatg 2657941 ctggcccagc gggtcggcgc ctgggttcca ctgggcgaag tgcccgaggt gtgggcggcg 2658001 gtcacctcgg tgtttcgcga ctacggctac cggcgactgc gcgccaaggc ccggctgaaa 2658061 tttctgatca aagactgggg catagcgaag ttccgcgaag tgctcgaaac cgagtacctc 2658121 aagcgtccgc tgatcgacgg tccggccccc gaaccggtca agcatccgat cgaccacgtc 2658181 ggggtgcaac gactcaagaa cgggctcaac gccgtcggag tcgcccccat cgccgggcgg 2658241 gtatcgggca ccatcctcac ggcggtcgcc gacctgatgg cgcgggccgg ttccgaccgg 2658301 atccggttca ccccctacca gaagctggtc atcctcgaca ttccggacgc cttgctcgac 2658361 gacttgatcg ccggtctgga cgcgctgggg ctgcagtcgc gcccgtcgca ttggcgccgg 2658421 aacttgatgg cgtgcagcgg gattgagttc tgcaagttgt cattcgccga aacccgggtt 2658481 cgagcacagc atttggtgcc cgagctggaa cgccggcttg aggacatcaa ctcgcagctc 2658541 gacgtaccga tcaccgtcaa catcaacggc tgcccgaact catgtgcgcg aattcaaatc 2658601 gccgacatcg gattcaaggg acagatgatc gacgacggac acggcggctc cgtcgaaggc 2658661 ttccaggtgc atctgggcgg acacctcggc ctggatgccg gattcggccg caaactgcgc 2658721 cagcacaagg tcaccagtga cgaactcggc gactacatcg accgggtggt gcgcaacttc 2658781 gtcaaacacc gcagcgaagg tgaacgcttc gcgcagtggg tcatccgggc cgaggaggac 2658841 gacctgcgat gagcggcgag acaaccaggc tgaccgaacc gcaactacgt gagctggccg 2658901 cgcgcggagc tgccgaactc gacggcgcca ccgccaccga catgttgcgc tggaccgacg 2658961 aaaccttcgg cgacatcggc ggcgccggcg gcggcgtgag cggacatcgc gggtggacaa 2659021 cgtgcaacta cgtagttgct tccaacatgg ctgatgcggt gctggtggat ctggccgcca 2659081 aggtgcgacc gggcgtaccg gtcatctttc ttgataccgg ctaccacttc gtcgaaacaa 2659141 tcggcaccag agatgcgatc gagtccgtct atgacgtccg ggtgctcaat gtcactccgg 2659201 agcacacagt ggccgagcag gacgaactgc tgggcaagga cttgttcgcc cgcaaccccc 2659261 atgaatgctg ccggttgcgc aaggtcgttc ccctgggcaa gacgctgcgt ggctactccg 2659321 cgtgggtgac cgggctacgg cgggtcgatg caccgacccg ggccaatgcc ccgctggtca 2659381 gcttcgatga gacgttcaaa ctagtgaagg tcaacccgct ggcggcgtgg accgaccaag 2659441 atgtgcagga atacattgcc gacaacgacg tgctggttaa tccgcttgtg cgggaaggct 2659501 atccgtcgat cggttgcgct ccgtgcacag ccaaacccgc cgaaggcgcc gacccgcgca 2659561 gcggacgctg gcaggggctg gccaagaccg aatgcgggtt gcacgcctcg tgaccgcgcc 2659621 ggcgacgatg cagagcgcag cgatgctgag gagcggcgcc atcgaagcac cgccggcgac 2659681 gatgcagagc gcagcgatgc ggtgggggca cctcccgctt gcggaggaga gcggcaccat 2659741 cgcgcctcag ctcgtcctca ccgcacacgg cagcaaagat ccgcgatcgg ccgccaacgc 2659801 acgggctatc gcgggccggc tggcgcgcat gcggcccggg ctcgacgtgc gggtcgcgtt 2659861 ctgtgagctc aactcgccca acctggtcga cgtgctcaac cgctgtcgag gagcagctgt 2659921 ggtcaccccg ctgctgctgg ccgatgccta ccatgctcgc gtcgacatcc ctgcccagat 2659981 cgccagctgc cgcgttggtc accgggtacg ccaggccagt gtgctgggtg aggacattcg 2660041 gctggtgtca gcgctgcatg agcgcctcac cgagctgggg gtttcgccgt tcgaccacac 2660101 actgggggtg gtcgtgctcg cgatcggctc atcgcatccc gcggccaatg cgcgcacctc 2660161 gacggtggcg tcaaggctgg cggaggggac ccagtgggcc gcggtgacga ccgctttcat 2660221 cacccgaccg gaggcttcgc tggccgatgc caccgatcgg ttgcgacgcc acggtgcccg 2660281 tcggatggtc atcgcgccat ggctgctcgc ccctgggata ctgtctgacc gggtacgcgg 2660341 atacgcacgg gaagccggca tcgcgatggc acaaccgctg ggtgcacacc cgatggtggc 2660401 cgcgaccatg tgggatcgct accgacaagc cgtggccggt cggatcgcgg cctaggtctt 2660461 ctcgaaggtc tgctggaacg gatgtcctct ggtgagtgtt tggttgcgag cgggcgcctt 2660521 ggtggctgca gtgatgctgt cgctgagcgg atgtggcggc ttccacgcgg gtgcgccaag 2660581 cacggccggt ccgtgcgaga tcgtccccaa tggcacgccg gcgcccaaga cacccccggc 2660641 taccgtgcct tcgtcgcgca acctcgcgac caaccccgag atcgccaccg gctaccgccg 2660701 ggacatgacc gtggtgcgga ccgcccacta tgcggcagcc accgccaatc cgctggccac 2660761 tcaggtggcc tgccgagtat tgcgcgacgg tggtaccgcc gccgatgccg tcgtggccgc 2660821 ccaggcggtg ctggggttgg tcgaaccgca atcctccggg atcggcggcg gcggatatct 2660881 ggtgtacttc gacgcccgca cgggctcagt gcaggcctac gacggccgtg aggtggcccc 2660941 agcggccgcc accgagaact accttcgctg ggtcagcgac gtcgaccgca gcgcgcccag 2661001 gcccaacgcc cgagcctcgg gacggtcgat cggagtaccg ggcatcctgc gaatgctgga 2661061 gatggtgcac aacgagcacg ggcgcacacc ctggcgcgac ctcttcggcc ccgcggtaac 2661121 gctggccgat ggcggttttg acatcagcgc caggatgggc gcggccatct ccgacgctgc 2661181 gccgcaactg cgagacgacc cggaggcccg caagtatttc ctcaatcccg acggcagccc 2661241 gaaacccgcg ggaacccggc tgacgaaccc cgcgtactca aaaaccctgt ccgccatcgc 2661301 ctccgccggc gccaacgcct tctattccgg cgacattgcc cacgacatcg tggcggcggc 2661361 gagcgacaca tcgaatggcc gcacgccggg cctgttgacc attgaggacc tggcgggtta 2661421 cctcgccaag agacgccaac cgttgtgcac gacctatcgc ggccgggaga tctgcggcat 2661481 gccatcgtcg ggtggcgtcg ccgtggccgc aaccttgggc atcctcgagc acttcccgat 2661541 gagcgactac gcgcccagca aggtcgacct caacggcggt cgcccgaccg tgatgggggt 2661601 tcacctgata gcggaggccg aacggctggc ctatgccgac cgcgaccaat atatcgctga 2661661 cgtcgatttt gtccagctgc ccggcggctc gctcaccacg ctggttgacc cgggctactt 2661721 ggcagcacgc gccgcgctaa tctcgccgca acacagcatg ggcagcgcca gaccggggga 2661781 cttcggcgca ccgacggccg tcgccccgcc agtgcctgag catggcacca gccacctcag 2661841 cgtcgtcgat tcgtacggca atgcggccac gttgacgacg acggtggaat cttcgttcgg 2661901 ctcctaccac ctggtggacg gattcatcct caacaaccag ctgagcgatt tcagcgccga 2661961 gccacacgct actgacggat caccggtggc taaccgggtc gagcctggga agcgaccgcg 2662021 cagttcgatg gcaccgacgt tggtgttcga tcactcgtcg gcggggcgcg gtgcgctgta 2662081 cgcggtgctc ggttctccgg gcggctccat gatcatccag ttcgtcgtga aaacacttgt 2662141 ggcgatgctg gattggggtc tgaatccgca gcaggcggtt tccctggtcg atttcggcgc 2662201 cgcgaactcg ccgcacacta acctcggcgg tgagaatccc gagatcaaca cttccgacga 2662261 tggtgatcat gacccgctgg tgcaaggcct gcgcgcgctg gggcatcgag ttaatcttgc 2662321 cgagcaatcc agtgggctct cggcgatcac ccgcagcgag gcgggttggg ccggcggcgc 2662381 cgacccacgc cgcgaaggcg cggtcatggg cgacgatgcc tgagccgttc gccggcgggc 2662441 ggccaaacga acgcggacca cttcgagccg ataattttgc cggccctctc gggctttgtc 2662501 tgcggtttta ccggctcggt gcattcgcgc gctagccgat agggtctatc gccatgtccg 2662561 gtgccacggt gggtgcgcgc gaaatcacca tccgcggagt cgtcctgggc gcattgatta 2662621 ccttggtgtt caccgcggcc aacgtgtacc tggggctaag ggttggattg acattcgcca 2662681 cttctcatac cggccgcggt gatctcgatg ggcgtgctgc ggttgttcgc caaccactca 2662741 gtggtggaga acaatattgt tcagacgatc gcgtcggcgg ccggcacgct gtcgtcgatc 2662801 atcttcgtgt taccggcact gctcatgatc ggctggtgga gcgggtttcc gtactggaca 2662861 acggcggcgg tgtgtgcact gggcgggatc cttggcgtca tgtactcaat tccgttgcgc 2662921 cgcgcactcg tcaccggatc agacctgccg tacccagaag gcgttgccgg agccgaggtt 2662981 ctcaagatcg gtgactccgc acgggagatg gagcacaacc gtaggggaat tggggtaatc 2663041 gccctgggcg cggcagcggc ggcgggatat gcactgctgg catccctgcg ggtgatcaac 2663101 aactcactgt cggccacctt ccgagtaggt tccggtgcga cgatgatcgg tgccagcttg 2663161 tcgctggcgt tgatcggcgt cggtcatctt gttggcgtca ccgtcggtgt cgcaatgatc 2663221 gtcggattgg ctatcgcctt tggggtaatg ctgccaatac ggacagccgg ccaactgccg 2663281 ccggacgggg actacgccgt cgccgtcgcc agaattttct cgacggacgt gcggttcatc 2663341 ggggcgggcg ccattgcggt ggcggccgcc tggacgttct tgaagatcct ggggccgatt 2663401 ctgcgtggca tcgccgacgc cgcggtctca gctcgaaccc gacgccgagg gcaagcggtt 2663461 ggccagaccg agcgcgacat cccgatccac atcgtggcca tggtggttct tctctcgctg 2663521 atcccaatcg gatggctgct cgcggacttt accgacggga caccgctcga tgaccgcagg 2663581 cccggcgcca tcgccgccgg ggtactgctc gtcttggtca tcgggttgat ggtcgctgcg 2663641 gtctgcggtt acatggccgg gttgatcggc tcgtcgaaca gcccgatctc gggcgtgggc 2663701 attctggtgg tggtgctggc cggtctgctg atcaagactg cgtatggtcc ggccaccggc 2663761 tcgcagattc cggccctggt ggcctacacc gtgtttaccg ctgcattggt cttcggcgtg 2663821 gcgactattt ccaacgacaa tctgcaggac ctcaaaaccg gccaactcgt cggcgctacc 2663881 ccatggaagc agcaggttgc actgatcatc ggcgtgctcg tcgggtcggt ggtgatggcg 2663941 ccgatcctgc agctgatgca ggctggattc gggttccagg gggcgccggg cgcaacggcc 2664001 aacgcattgg ccgccccgca agccgcgctc atgtccgcgc tggccaaggg agtatttggt 2664061 ggctcgctga actggtcgct ggtcggtgta ggggccttga ccggcgtgat agcggtcgcg 2664121 ctcgacgaga cactggccaa gacgacaacc aaccttcggc tgccgccact agcggtgggt 2664181 atgggtatgt acctgccggc cgcactgacg ctgatgatcc cgatcggcgc attcctcggg 2664241 cggatctatg actcctgggc gcggtggtct ggggatgacg acgagcgcaa gaaacggttg 2664301 ggcgtcatgc tcgcgacggg cctgattgtg ggcgaaagcc tatacggggt gctctttgcc 2664361 gtcatcgtcg cgacaactgg caaagaggag ccgctggcca tggtcggcga cggattcagg 2664421 tttgcctccc agccgctggg agccatcgtc tttgccggcc tcctcgcttg gctctaccag 2664481 cgcacccggg tcacagcgtc gtaccggctg gcagcgccgg ccggcagctc caagccactg 2664541 cccgatttgc ctgggtaacc gcattgcgcc cgaggggtcc ggcttttcac agcaacttca 2664601 cggttgacat ccaccttggc tcgcagctct gcgaggcagc ctgaggtgac aaagccggcg 2664661 gcccgacaca tgcagccgag ttggctggct cggaaggggg acagagttga ccatgacagc 2664721 gagtgtggcc aaggtgacag ctgcacgccc ggagccaagc gcggcgtggg ctgaagcccg 2664781 gcggcgggta cgccaacgcc gcgaggacat gctgcgccat cctgcatttc tgtccaagca 2664841 gctccctgcc gaaccagcag acgacgacgg cgtcgcggcc gtctacgaca tcgcgattgc 2664901 gcgtcggcgc cgacctgctt gagcgggtcc cggcgggtca acgtcggcgg ctgccgggta 2664961 aaccggcaat cgacgaccgg gccttggcgg gcgcgtcgcg ttctgccagc tgaactcgcc 2665021 gagcctggtc gatgtgcctg ggctggtgcc cgcgatgccc ttggacgcgc tccggccggc 2665081 gagacagccg acgagtggct tgggcgaatg cgccacgatg cgtcggccag aggcgggtaa 2665141 cgagaaggtg gcggtgatct gggaaagcct ggatgtcgtt ccccccgagt cgctatagtc 2665201 aactgcgccg atgggtcaat gctggccagg cgatgctctg gtcgacatgg cttagcaatc 2665261 ctgacatttt ggaggtgccg gatgtcgttc ctgattgctt cgccggaggc gctagcggcg 2665321 acagccacat atttgacagg tatcggttcg gcaatcaacg cggcgaacgc ggtcgcggcc 2665381 gccccgacaa cagagatcct ggcggcgggg accgacgagg tgtccaccgc catctcagcg 2665441 ctgttcggcg ctcatgccca ggcatatcag gcgctcagcg cccacgtggc ggcatttcac 2665501 gaccagttcg tgcatacctt gaccgccggt gccggctcat acatggccgc cgaggccgcc 2665561 gcctcgcccc tgcaggcttt gcagctggag ctgctcaacg ccatcaatgc acccaccctg 2665621 gcgctgttgg gacgcccgtt gatcggcgac ggcaccgatg cggcgccggg gagcgggggg 2665681 gccggcgggg ccggcggcat cttgatcggc aacggcggga ccggcggcgc cagcgactta 2665741 gccgggaccg gccgcggcgg ggtcggcggg gcgggcggcg ccggcgggct cttcggcatc 2665801 ggcggcgccg gcgggggctg cgggtccgcg gtggcgatcg ggggtgacgg cggggccggt 2665861 ggcgccggcg gcgtgttcag cggcggcggc gccggcgggg ccggcgacgc catcgggggt 2665921 agcggcggcg cgggcggcac cggtgggctg ttgggtggtg gcggcggcgc gggcggcgcc 2665981 ggcggcgccg gcggcaatgg cgggggcgcc agcaacagcg caagtatcgg gggtgacggt 2666041 gggtccggcg gcgcgggcgg catgctctac ggtgccggcg gcgtcggcgg caacggcggg 2666101 gccgcggtcg ctatcggggg tgacggcggg gccggcggca gggccggagc gatcggcaac 2666161 ggcggtgacg gcggcaacgg cgggacttcc aacacccccg gcggtagcgg cggcgacggc 2666221 ggcaatggcg ggaacgccgg actgatcggc agcggcggta acggcggcaa cgccgagatt 2666281 gtcatctccg gcggtagcgt cgccggcacc ggtggcaacg gcgggttgct gttgggcttc 2666341 aacggcacga acgggctgcc gtagcgggcg agcccgccgg cctctggatc acgtcgatgt 2666401 gactttgacc cgttccacgc cggcatcgtc gacgcccgat acgccaccgg caatcggcgg 2666461 cacccgggtg gcacgcacgt agacggtgtc accctcgcgt agggccagcg cctcggcatc 2666521 gccgcgggtg atctgggcgg tgaaggcccc gccggtggcc gcgctggtca actccacgcg 2666581 gacctcgaag cccagcacca ccacccgatc cacaacagcc cgtagcacac cggtggaccc 2666641 ggcggtgccg tcagcggcgg ccacggccat attgggagtc cggccgaccc ggatgtcgtg 2666701 cgggcgcacc agggagccgt tcaacgtgga aaccgctccc aagaaggaca tcacgaaggc 2666761 gttcgccggg gcgtcgtaaa cgtcggtcgg ggatccgacc tgctcgatac ggcccttgtg 2666821 gagtacggcg atgcggtcgg ccacatccag cgcttcggcc tgatcgtggg tgaccagcac 2666881 cgtggtgaca tgcacctcgt cgtgcaggcg gcgcagccag gcacgcagct cttcgcgcac 2666941 cttggcatcg agtgcgccga acggctcgtc gagcagcagc acctccggat cgaccgccag 2667001 cgccctggcc agcgccatcc gctgtcgttg cccaccggag agctgattgg ggtagcggct 2667061 ctgaaatccg ctcaggccca ccacctgcag cagattgtcg accttggcct tgatctcggc 2667121 cttggggcgc ttacggatct tcaacccgaa cgccacgttg tcacggacag tcaggtgttt 2667181 gaacgccgcg tagtgctgga agacgaatcc gatgccacgc cgctgtggcg gcacccgggt 2667241 gacgtcgcgg ccgttgatcg tgatggttcc ggtgtccggt tggtcgaggc cggctatggt 2667301 gcgcaacagc gtcgacttgc ccgaaccgct ggggcccaac aatgcggtca gcgaaccggt 2667361 cggtacgacg aaatccacgt ggtcaagtgc gacgaagtcg ccgtagcgtt tggtggcgtc 2667421 ggccacgacg atggcgtagg tcattttcac cgtctccttc tcagccctcg ctgaccgctc 2667481 gtgcccggtg ggcgtctagc accatctgga cgatcagcac caccacggaa accgccatca 2667541 gcagcgtcga cagcgcgtag gcaccgtact cggccccacg gtggtagcgg tcggagacca 2667601 agagggtcag tgtttgcgat gtccctggaa ggttcgacga gacgatgatg accgccccat 2667661 attcgccgag ggttcgagcg acggtcaata cgatgccgta cgtcaggccc caccggatgg 2667721 agggcagcgt gattcgccag aatgtctgcc accaaccgga acccagcgtc gccgccgcct 2667781 gctcctggtc ggtgcccaat tcgtgcaata cgggttccac ttcgcgcacc acgaatggac 2667841 aggtgacgaa catgctggca agcacgattc ccggcagccc gaagatgatc ttgaagccaa 2667901 ggtcctgctc gacgaagccc agggcgccgg ccgatcccca cagcaagatc aacgagacgc 2667961 ccacgatgac gggtgaaacc gcaaaaggca gatcgataat cgcctgcaag acgcccttgc 2668021 cgcggaaccg gttgcgggcc agcaccaatg ccgtcgtgac tccaaagatc acgttcagcg 2668081 gtaccacgat agccaccacc agtagcgaca ggttcagcgc tgatatcgcc gccggggtac 2668141 tgatccaggc gtagaactgg ccaaagcccg gttcgaaggt ccgccacagg atcagcgcta 2668201 ccggaacgat caacagcaca aagacgtacc ccagcgcgac cgatcggacg aggtagcgag 2668261 ccgccggcaa ggaggtcatg cggccatctc ctcacgtttg gccgcacgcg cgccgacgac 2668321 acgtaggatg agcagcacaa tgaacgaaat cgagagcaat acaaccgata tcgcggccgc 2668381 tccggtgcgg tcgtcgttct cgatcagggt gcgaatccat tgcgaggaca cctcggtctt 2668441 gcccggcacg gccccgccga tcagaaccac cgaaccgaac tcgccgatag cgcgcgaaaa 2668501 cgccaggccc gcaccggata acaatgccgg cgtcagcgac ggcaacacca ccgaagtgaa 2668561 gattttggca ccattagcgc ccagcgacgc cgccgcctcc tcggtctcgc gatcgatttc 2668621 cagcagcacc ggctgcacgg cgcgcaccac gaacggcaat gtgacgaacg ccaacgccac 2668681 cccaacaccg gtcgcggtgt gttgaaaatg aagccccacc gggctgttgt tcccgtacag 2668741 tgccaacatc accaggctgg cgacgatggt gggcaacgca aacggcagat cgataatcgc 2668801 atcgacgatc cgcttgccag cgaagtcgtc acgcaccagc acccaggcga tcagcaagcc 2668861 gaacaccagg ttgatgaccg tgactgcggt cgaaatcgtc agcgttaccc ggaacgactc 2668921 catcgcggca tgcgacgaga ccgccagcca gaaggcccgc caaccaccgc ccgcggcctg 2668981 ccagacgatg gcggccagcg gcaacagcac gatcaccgaa agccacacca ctgccatacc 2669041 gacccgaacg gaaggggggc ccgcggggcc ggaaaggcgc gcgcggaact gcggcgcgcg 2669101 gcgttcgccg accaacgatt ccgtcatccg gtggcccgca gataaatctt ggtgatgctg 2669161 ccggtcgcct tgtcgaacag ctgaggatcc acgctgcccc agccaccgag gtcggcgatc 2669221 gtccacagtt tcgccggcac cggaaacagg tcggcaaaat cggcggcgac cgccggatcg 2669281 accggccgga aaccggcctg cgcccataac ttctgcgcct gcacggtgta ctggaagttt 2669341 ctgaatgcgg tcgccgctcc aaggtgtgtg ctggtcgcca ctacggccaa cggattttcg 2669401 atcttgaacg tctgcggcgg ggtgacgtgc tgcaccggtt tgcccgcccg ctcggtggcg 2669461 atggcttcgt tctcgtagct gatcaacacg tcaccgctgc cctggacaaa aacatcggtg 2669521 gcttcccgcc ccgacccggg gcgcaatttg acgtgttcat tcaccaatgt attgacaaag 2669581 tcgatccccg cttggttatt ccggccaccg tcacttttcg cggcgtaggg ggctagcaga 2669641 ttccacttgg cagaacccga actcagcgga ctgggcgtga tgacctcaat acccgggcgc 2669701 aacaggtcat cccaatctct gatgttcttc gggttacccg cgcggaccac aaacgtcacc 2669761 accgacccga acgggatgcc cttggtggca tcggcgtccc agtccttgtc aaccttgccg 2669821 gccttgacca ggcgagcgat gtccggttcg accgagaagt tcaccaggtc ggccggttta 2669881 ccgtcggcaa caccgcgcga ctggtcggcc gacgcgccat atgaggtaat cacctggact 2669941 ccccggccct gttcggaagc gttgaacgcg ggaatcaccg cactccagcc gggttccggg 2670001 acggcgtagg cgaccagggt gatgctcgta tgcgcacggt ccggtcccgc acggccgacc 2670061 acgtcgctgg gaccgccatg acaccccacg ccgataccgg cgatcaatgc gcacaccacc 2670121 ccggcaggga taatgtgccg ccagcgggat gcgctagcga tgcagctcgc ttcagaaagc 2670181 gtcaaggaga gcattggcga ccttccggtg cgggactttg gacaacgttc ccgtagcggc 2670241 ggaaaggcga tcgctgaaca ttgcaggact cacgaactcc acatcagacc gcgcacgggt 2670301 ggggagtcag cgacaacagt gcaggttggc cgcagcgccg caaacgagcg cgccgacata 2670361 gcgccccgaa aaacccgatg ctgcgtgcac gtggcgaagc ctaacagaat tcggctggcc 2670421 gaccagttgg cgcgcagctc aatgggtgag aagccaggtc acgatcacca gcgcaaccag 2670481 cgtgaccaga accaacgtga cgtgcgacct cggcatccgg gctacctggg cgcctgatcg 2670541 gggcggcggg cgcggcgaat caactgaatg acccggccga gcagggcatc cagcaatgcc 2670601 gcggtgaaat aggccaaagc cagcacgacc ggcatgactg ccaacgcctg cagcgcgaac 2670661 gggagtccgg acagccacag ctcgacgccg tcccaccaac tcaggaaccc gttcatcggg 2670721 cccacactat agcgccggca ggcaaaaccc caggtgtgtc gcgattacgg tgaccgccga 2670781 cgccaaaccg cgacacggca cacggctgct aggcccacct gagcacgcac ccaactacgc 2670841 cgggcgccgg gcgtgaagtg gacgccgagc aagtcgacag atgatgatgt cggcatggtc 2670901 ctgcacgctc aaccccccga ccaatcgacc gaaacagccc gcgaggctaa agcgttggcc 2670961 ggggcaacgg acggggcaac ggccacatcc gcggatctgc acgcacccat ggctctatcg 2671021 tccagttcgc cactgcgcaa cccgtttccg ccgatcgccg actacgcgtt cttgtccgat 2671081 tgggaaacga cgtgcctgat ttcgccggcg ggttcggtgg agtggctgtg tgtgccacgg 2671141 ccggactccc ccagtgtgtt cggcgcgatc ctggaccgca gcgccggcca ttttcgtctg 2671201 ggcccctacg gtgtttcggt gccttcggcg cgacgctacc ttccgggcag cctgatcatg 2671261 gagaccacct ggcagaccca taccggctgg ctgatcgtgc gagacgcgct ggtgatgggt 2671321 aaatggcacg atatcgaacg gcgatcgcgg acccaccgcc gcaccccgat ggactgggac 2671381 gccgagcaca tcctgttgcg cacggtgcgc tgcgtcagcg gcaccgttga actgatgatg 2671441 agctgcgagc cggcgttcga ctatcaccgc ttgggcgcca cctgggaata ctcggccgag 2671501 gcttacggcg aggccatagc ccgcgccaac acggagcccg acgcgcaccc gacgctgcgg 2671561 ctgaccacca acctgcggat cgggctggag ggccgggaag cacgcgcacg cacccggatg 2671621 aaggagggtg acgacgtgtt cgtcgcgctg agctggacca aacacccgcc gccgcagacc 2671681 tacgacgagg ccgccgacaa gatgtggcaa accaccgagt gctggcggca gtggatcaac 2671741 atcggcaact tccccgacca cccatggcgg gcgtacctgc agcgcagcgc gctaaccctg 2671801 aaggggttga cctactcccc caccggggcg ctgctcgcgg cgagcaccac gtcgctgccg 2671861 gaaaccccgc gaggcgaacg caactgggac taccgctatg cctggattcg cgactcgacc 2671921 ttcgcgctgt gggggctcta caccctggga ttggaccggg aagccgacga cttctttgcg 2671981 ttcatcgccg acgtgtccgg cgccaacaac aacgaacgcc atccgctgca ggtgatgtac 2672041 ggggtgggcg gtgaacgcag cctggtcgaa gcggagctgc accatttgtc cggctacgat 2672101 catgcccgcc cggtgcgcat cggcaacggc gcctacaacc agcgccaaca cgacatctgg 2672161 ggttcgatcc tggactcgtt ttacctgcac gcaaagtccc gcgagcaagt cccggagaac 2672221 ctatggccgg tgctgaagcg gcaggtggaa gaggccatca agcattggcg tgagcccgac 2672281 cggggaatct gggaggtgcg cggcgagccg caacacttca cgtcgtcgaa ggtgatgtgc 2672341 tgggtcgcct tggaccgggg ggccaaactg gccgagcgtc agggcgagaa aagctacgcc 2672401 cagcagtggc gggccatcgc cgacgagatc aaggccgaca ttctggaaca cggggtggac 2672461 tcgcgcggcg tgttcaccca gcgctacggc gatgaggcgt tggacgcctc actgctgctg 2672521 gtggtgctga cccgattcct gccgccggac gacccgcggg tgcgcaacac cgtgctggcc 2672581 atcgccgacg agctgaccga ggacggcctg gtgttgaggt accgggtgca tgagaccgac 2672641 gacgggcttt ccggcgagga aggcacgttc accatctgct cgttttggct ggtatcggcg 2672701 ctggtcgaga tcggtgaggt gggccgcgcc aagcggctgt gcgagcggct gttgtccttc 2672761 gccagcccgc tgctgctcta cgcggaggag attgagccgc ggagcgggcg tcacctgggc 2672821 aacttcccgc aggcgttcac ccacctggca ctgatcaacg ccgtggtcca cgtgattcgc 2672881 gccgaggagg aagccgacag ctcggggatg tttcagcccg ccaacgcccc catgtaggac 2672941 ttccgatgcc gagcagacgc aaaatcgccc aaattcgggc cgaaatgggc gattttgcgt 2673001 ctgctcggca agcgtcaact caattcgctg atcctgtcca tcatcgcgtg tgcgatatcg 2673061 acggcgctgg tgctgatgtc ggccgacccc tgatccgacg ggtgggtgat gccaaagaag 2673121 gtgaccgcga cctcgaccac gcaattgccc cgtacgccga cggcacgggc ctgagggacg 2673181 gacgccagta tggagtgcgt gccgcgtcgc agcgagaccg ttgccgcgac aactgaatcc 2673241 gcaacccgga cgtcggtgat ggagcgttga ccgaacgcgc tggcgggcac cgtcagcgtt 2673301 gtgccatcac attccttcca ctgcgcagaa aacctcgcga acagatcatc ggcggctgcc 2673361 gcggaaggca gggcgacgac accctcatcg aggtcatcca ccttcaccga ggaaccgtcg 2673421 tgtcgccacg acacccgggc gacgcttttg acctcgacgg accggtaaac gttccgctgc 2673481 gtcaggtaac cgacgcccac gcagtcagcg ggccgagccg atacatcact gtctcccaaa 2673541 ctgtcgctgc ccccgaacac cggcgggaaa ggtggaaggg cctgaaacgg ctggttgagg 2673601 agcgttgaca gcgcagcgcc gtcgagcggt acccgctgga tcagtgaacc catcagcgga 2673661 cgcggcactg cgttcggcgc cagacctgct ttcccggtcg tcgttgtggt gcacccggca 2673721 gcgaggaaca cggcaaacag cggaaccacc cagcgccagc ggtttgtcac ttcttgcctt 2673781 tgtccccggc ggcatcggtg gacaatgccg cgacgaaagc ctcctgtggc acctcgacgc 2673841 gcccgatggt cttcatccgc ttcttgcctt ccttctgctt ctccagcagc ttgcgtttgc 2673901 gcgtgatgtc gccgccgtag cacttggaca acacgtcctt gcggatcgcg cggatgtttt 2673961 cgcgggcaat gattttcgat ccgatggcgg cctgcaccgg cacctcgaac tgctggcgcg 2674021 ggatcagctc cttgagtttg gtggtcatct tgttgccgta ggcatacgcc gtgtccttgt 2674081 gcacgatcgc gctgaacgca tccaccgcct cgccctgcag caggatgtcg accttgacca 2674141 gcgcggcctc ctgttcgccg gcctcctcgt agtcgaggct ggcatagccg cgggtgcgcg 2674201 atttcagtgc gtcgaagaag tcgaagatga tctcgccgag cggcatggtg tagcgcagtt 2674261 ccacccgctc gggggagaga tagtccatgc cgcccaactc gccgcggcgc gactggcaca 2674321 gctccatgat ggtgccgatg aactcgctgg gcgcgatgat ggtggtcttg acgacgggct 2674381 cgtagaccgt gcggatcttg ccctccggcc agtccgacgg attggtcacc cggatttcgg 2674441 tgccgtcgtc tttgtgcacc cgatacacca cattgggtga ggtcgagatc aggtccaggc 2674501 cgaactcgcg ctcaaggcgc tcacgggtga tctccatgtg cagcaggccc aagaaaccgc 2674561 accggaaccc aaaacccagc gccaccgagg tttccggctc ataggtcaag gccgcgtcgt 2674621 tgagctgcag cttgtccagg gcgtcgcgca ggttcgggta gtccgaaccg tcgaccggat 2674681 acaaccccga gtagaccatc ggtttgggct cacggtagcc ggtcaacgct tcggcggcag 2674741 ccccgcgggc ccgggagagg ctggtcacgg tgtcgcccac cttggactgg cggacgtcct 2674801 tgacgccggt gatcaggtaa cccacctcgc cgacaccgag gccctcacac ggtttcggct 2674861 cgggtgagac gatgccgacc tcaagcagct cgtgggtggc gccggtggac atcatcatga 2674921 tgcgctcacg ggggctgatc ttgccgtcga cgacgcggac gtaggtcacc actccgcggt 2674981 agatgtcgta aacggagtcg aaaatcattg cgcgggtagg tgcctcggcg tcgccctgag 2675041 ggggcggcac ctgtcggacc acctcgtcga gcaggtcgga cacgccttcg ccggttttgc 2675101 cggacacccg caacacctcg gccggctcgc agccgatgat gtgtgccatc tcggcggcgt 2675161 aacggtccgg gtcggccgcg ggcaggtcga tcttgttgag caccgggatg atgtgcaggt 2675221 cgcggtccaa cgccaggtag aggttcgcca gcgtctgcgc ctcgatgcct tgcgcggcat 2675281 cgaccaacag caccgcaccc tcgcaagcct ccagcgcacg cgagacttcg taggtgaagt 2675341 cgacatggcc cggggtgtcg atcagatgca gcacgtagtc ggtcttgtcg acccgccagg 2675401 gtagccgcac attctgggcc ttgatggtga tgccgcgttc ccgctcgatg tccatccgat 2675461 ccaagtactg ggcccgcata gagcgttcgt cgaccactcc ggtgagctgc agcatccggt 2675521 cggccaacgt tgacttgccg tggtcgatgt gggcgatgat gcaaaagttc ctaatctgcg 2675581 ccggcgcagt gaaggttttg tcggcgaaac tgctgatggg aatctcctgg agcgggggtt 2675641 gacgggtatc cagggtatcc gcgtcgggca gctgcgaccc aatcgcgctc ggtcgatcgc 2675701 gtctatgctg cgagcatggc gtccgcacgg aagtcacagt ggaaaacgtt gcagcgcttc 2675761 gcggagaacc tggtgttcac tgaggctcct aagctggtgc gtcacctgca aaacacgcag 2675821 gaaacgcttc gcacaatccg gcaagccgtc aagatcaccg cgaacatcat gaccaccgcc 2675881 gtgccgtcgc caccggccga aattgccgcg ggccggccgg tgaccagcac cagctgtccc 2675941 accgcagcgc gagcccgcag acttgtctac gccccggacc tcgatggccg ggccgatccc 2676001 ggcgagatcg tgtggacttg ggtggcctac gagcaggacc ccacccgcgg caaagaccga 2676061 cccgtgctcg tcgtgggccg agaccgcagc gttctgttgg ggttgctggt gtccagccag 2676121 gagcgccatg ctgccgaccg ggactgggtg ggaatcggtt ctggcgcttg ggactacgag 2676181 ggccgagaaa gctgggtacg gctggaccgg gtgctcgacg tacccgagga gagtatccgc 2676241 cgcgaaggcg cgattctgga acgcgaggtc ttcgacgtgg tagccgcccg gctgcgtgcc 2676301 gactacgctt ggcgctaaac cgggccgggc ggccagcgca atcggctggg caacgagccc 2676361 cgatcaggcc ccaatcagcc ccgcctggcg acgacgcggg ccgcccagcg gcccgctgag 2676421 gagccgggca gtcagccccg cccggcgacg atgcgggccg cccagcggcc cgctgaggag 2676481 ccgggcaatc agccctgagt gatgtaggac tgaagctgct gctgctcggc ctcgagttct 2676541 cccatgcgcg atttcaccac gtcaccgatg ctaacgatgc cgatcagttt cttcccgtcg 2676601 agcaccggca cgtggcggac ccggttttcg gtcatcagca cactgatctt gtcgaccgtg 2676661 tcggattttg tacaggtggc gacggtggtc gacataatct tggcgaccgg gcgagacagc 2676721 acgctggcac catacgtgtg tagctggcgc accacgtcgc gttccgacac gataccgacc 2676781 acgccttcgg cgccgaccac taccatggcg ccgatgttct gctcagcgag gccagcgagc 2676841 agctccccga ccgtggcgtc ggggttgatc gtcaccaccg ccgccccctt gttccgcaag 2676901 acgtccgcga tgcgcatcaa ggcctcccgc cggtggtgag ctggttcaca ccaggctacg 2676961 gcgaactcgg gcggcgggaa agccgatacc ggaatatgcg gcatctagca cccgaacccg 2677021 caggtgcccg gcggtcggta gctgcgtagc ccgggcagga attcggccgc cgacaacgcc 2677081 catgtcggcc gcatcctcga ggctaaaact cgttggccat cagccgaatc ggtcgatcgg 2677141 ggccgctgga tccatcgagc ttgtcaggat agggccatgc ttgagatcac gttgctcgga 2677201 actgggagcc ccattcccga cccggaccgt gccggaccat ccactctggt gcgggccggc 2677261 gcgcaggcgt tcctggtgga ctgcggtcgc ggcgtgctgc aacgcgcggc ggccgtcggt 2677321 gtgggcgccg caggattgtc ggcggtgctg ctcacccatt tacacagcga ccatatcgcc 2677381 gagctcggcg acgtgcttat caccagttgg gtcaccaact tcgctgctga tcccgcgccc 2677441 ttgccgatca tcggaccgcc gggcaccgcc gaagtggtgg aggcgacgtt gaaggcattc 2677501 ggtcacgaca tcggctatcg gatcgcccac cacgccgatc tgacgacacc accaccgatc 2677561 gaggtgcacg aatacaccgc aggcccagct tgggatcgcg acggcgtgac aatccgggtg 2677621 gcccctaccg atcatcggcc ggtcacgccg acgatcggat tccggatcga atccgacggt 2677681 gcttcggtgg tgctcgccgg tgacaccgtt ccttgtgaca gcctcgacca gctggccgcc 2677741 ggagcggatg cgttggtaca cacggtgatc cgcaaagaca tcgtcacgca gatcccgcag 2677801 caacgggtca aggacatctg cgattaccac tcgtcggtgc aggaagccgc cgcaaccgcg 2677861 aaccgcgcag gggtgggaac cctggtcatg acgcactatg tgccggctat cgggcccgga 2677921 caagaagaac agtggcgggc gctggccgcg accgagttca gcgggcggat cgaggtcggc 2677981 aacgacctac accgagtcga ggtgcacccg cggcgctagc acgccagcta tgaccaacca 2678041 gccccgacac cagggcgatc gataaggcaa gaagtagatc gcccgaacca gcgccgggtc 2678101 cgtgcggcgc tagcacgcca gctatgacca accagccccg acaccagggc gatcgataag 2678161 gcaagaagta gatcgcccga accagcgccg ggtccgtgct gaccctcggg cgccacacgg 2678221 tcttgcccag caaaccggtc agcccggacg ctcccgcccg ccacggtgcc gccggccaac 2678281 gccgatcgtc gaaccccacc cggtcactga aagctgccgc aggcggttgg ctgatgcaac 2678341 accgcggtgg caatacgtgc agcgcgaccg gctcatcgcg gatctacggc gcaaccgcgg 2678401 tgatcggcgt cacgccgcgg gtgcgacccc cacgggaccc cggttcccac tgctgtttgg 2678461 cggtgaatcg ctgacaccgt ggacggcgcc cagccgcggc tgttcgcggt ggtgcagccg 2678521 acccgatttc acggaaacac aggctgtcat cagcgaggga aactattcgc cgtgcaaagc 2678581 atttccatgg cgccacaccg atagccggct tgtgctgatc gcacgtcccg atatcttatg 2678641 cagtcgcggt ccggaggcaa tgcgggccaa agccgccgat ttggacttgg ctgcggcggc 2678701 aaagacggtc ggagtgcagc ccgccgccga tcaggtggcg gcggcaattg ccgcaatatt 2678761 gctgtcacac gcccagatct accaggacat cagcacacag atggcggcat tccacgacca 2678821 gctcgtagag aaccgcacgg cagatagcac gtcgtacgcc agcgccgagg ccaacgccca 2678881 gcagagcctg ctcaatgcga tggatgcacc gagctggcaa cagcgccgag aaaccgtcgg 2678941 cgaggtgggg ctcccagcgg acccagcggg atccggcacg gcgacggcgg cagtggcggc 2679001 ggcgacgacg gcgcgggcag gaagccgttc ggccgcccag gcaaccgtgg cgcctatcgg 2679061 cgggctgaaa ctccgccgcg aatctgcgct aagccagccg ggtgatctcc accaccacgt 2679121 cgaggtcggt gacgccctcc ccagagtaga tccctttcag cggggaaacg tcggtgtagt 2679181 cgcggcctac acccacactg atgtattgct cggtgatctc attgtcattg gtggggtcgt 2679241 agtgccacca tccaccggtc caggcctgaa cccaggcatg gctgcgcccg tctaccgtct 2679301 ttcccaccac ggcatcacgc ttagggtgta gatacccaga cacgtaccga cagggaattc 2679361 ccatgctgcg caacaccatc agcgacaagt gcacgaagtc ctggcagacg cccttgcctt 2679421 gttccagcgc atcgagcccg gacgagtgca cactggtggt gcccggaatg tagtccagct 2679481 cgctgcgcgc ccaccgggcg gcggcgacta cggcctcgct gggctcatgg catttcctga 2679541 tccgcctgcc gacggcatca acgcgggcgc ttgccggggt gtgcggggtt gggcggagca 2679601 cttcgtcgaa cctgtcgatc acggccgtcg attgcaggtc ggcccaggtt gccttggcgg 2679661 ccaacggctc cgggcgctcg gtctccacca ccgacgagga cgtcaccgtc agttcggtgt 2679721 gcggcgcatg caagtcaaac gccgtcacgg cagtacccca ataatcgata tagcggtagg 2679781 agcgggtggc cgggatggtt tcgactcggt tgaggacgag gttctgccgc gaactcgacc 2679841 gaggggtcag ccgggcttcg ttgtatgagg ccgtcaccgg cgactggtag acatatccgg 2679901 tggtgtgcac cacccgggtt cgccacatca ggattcctct tggcttccga cgagttggcc 2679961 acgctggcct gcatccgacc acgcaaccca gggagctgcg tgaaagtact gcagcgccaa 2680021 tgcatctccg acatcacgac aggtcgtctg caagcccgcc aggcggctct ccaaggtctc 2680081 gagcaggacg ccgggttgca cgaattccag ctcgctgcgt gcttgcccta acaaccgctg 2680141 tgcttcggtg gtcgccccga tccggctgtg cggattgtgc atcaactcgg cgagattgtg 2680201 ttcggccagc ttcaacgagt gaaagaccga gcgcgggaaa agccggtcga gcatcatgaa 2680261 ctccaccacc cggcccgcgt ccagcacacc gcggtaggtg cgcaggtacg tgtcgtgcgc 2680321 acccgccgag cgcagcagcg tcacccaggc cggcgacgat gcgctatccc ccacccgtga 2680381 cagcaacagc cgcaccgtca tgtcgacccg ctcaatcgcg cgcccaagca acatgaagcg 2680441 atatccgtcg tcacgcaaaa gcgtcgaatc ggccaggccg gcaaacatcg ccgcacggcc 2680501 ctcgatgaac gacagaaact cgtgcggccc aaggcgtttg gcagcgcgtt cgcgttcagg 2680561 cagggcgtta taggtggtgt tgagacactc ccacgtctcg ctggaggtga cttcccgcgc 2680621 cgattttgcg ttttcccgtg ccgccgagat cgcgtcgaca atggaagaac caccctggct 2680681 attggtgctg aaagccacca ggtccgtcaa ggaccagaca tccagctcgt ggtcgggcgg 2680741 ctcgatgccc agcacccgca gcagcagccg ggaggcctgg tcgggatcga cactggaatc 2680801 ctcgagcaat tgatgcaccg cgacgtcgag aatgcgcgcg gtgtcgtcgg cgcgctcgac 2680861 gtagcgaccg atccaataca gtgcttcggc gttgcgggcg agcatcagtg gaacgcctgc 2680921 tgttgttgtt gctgctgctg ttgcggttgt tggtcgtgcg gttcataccc ggacgcgtcc 2680981 accgttgggt cgcacagcgg ctgcggcagc gaacgcacaa tctgtgcagc gcccaactcg 2681041 cgggcggccg ccgaagcgcg cggggccagc acccaggtgt ccttggagcc gccgccttgg 2681101 ctggagttga ccacccggga accctcaacc aacgccactc gggtcagccc gcccggcagc 2681161 acccatacct cgttaccgtc gttgaccgcg aacggccgca agtccacgta gcggggcgcc 2681221 agcgtgcctt cgatccgggt cggcacggtc gacagttcca tcatcggctg cgcgatccag 2681281 ctgcggggat cgtcgcggat cttttggcta acggccgcca attcggcctg agaggcttcc 2681341 gggccgaaca cgatgccgta accaccggat ccctcgaccg gcttgaggac caattcgcgg 2681401 atccggtcca acacctcttc gcgttcgtca tccagccagc atcggagggt ttccacgttc 2681461 gccagcagcg gcttttcgtg gaggtagtac tcgatcatgg tcggcacgta cgtgtagacg 2681521 agtttgtcgt caccgactcc gttgccgatc gcactggaca gcacgacgtt gccggcccgg 2681581 gcagcgttga ccaatccggc caccccgagc accgaatcgg cacggaactg cagcggatcc 2681641 aggaaggcgt catcaatgcg ccgatagatg acgtcgacct ggcgctcccc ctcggtggtg 2681701 cgcatgtata cctggttgtc tcgacagaac aggtcgcggc cctcgaccaa ttcgacaccc 2681761 atctgccggg ccagcaatga atgctcgaaa tacgccgagt tgtagacccc aggggtcaga 2681821 accacgaccg tggggtcggc ctcgttggtg gccgccgagt tgcgcagcgc gcgcagcagg 2681881 tgcgaagcgt agtcatcgac cgcccgcacc cgatgggtgg cgaacaggtt cggaaagacc 2681941 cgcgccatgg tgcgccggtt ctccatcaca tacgacaccc ccgacggcga gcgcaggttg 2682001 tcctcgagaa cccgaaagtc gccgcggtgg tcgcggatca ggtcgatgcc ggcgacgtgg 2682061 attcgcacac cgttgggtgg cacgatcccg actgcctgac ggtgaaagtg ctcacaggag 2682121 gtcaccaacc ggcgcgggat gacaccgtcg cgcagaatct cctgatcacc atagatgtcg 2682181 tcgaggtagc actcgagggc cttgacccgc tgggtgatgc cacgttccag tcgggtccac 2682241 tcgggggccg aaatgacccg tggcaccagg tcgagcggga acggccgctc ctggcccgac 2682301 agcgaaaacg tgatgccctg gtcgatgaac gcacgcccca gcgcatcagc gcgggccttg 2682361 agttcggacg cgtccgacgg cgccagctcg gcgtagatac ctttgtaggg gccgcggaca 2682421 atgccctggg catcgaacat ttcgtcgaag gccatcgcat agacgtccga cgtgttgtag 2682481 ccgccgaaga tgcgttcgcc gcgtgtgggc gaccgccgcc gggtctcgtt gagttggttt 2682541 ggcagactca cgcgtctcat gctgcctcaa attcgacatt ccggcagacc acagattccg 2682601 cttttgggcg aaaacgtaac cgactgataa cctgggcagc cgaatcacac cgacaaaggg 2682661 aacttgcacg tggccaacat caagtcgcag cagaagcgca accgcaccaa cgagcgcgcc 2682721 cggctgcgca acaaggcggt gaagtcctcg cttcgtaccg ctgtccgtgc cttccgcgaa 2682781 gctgcccatg caggcgacaa ggcaaaggcc gcggaactgc tggcgtcgac caaccgcaag 2682841 ctggacaagg cggccagcaa gggcgtgatc cacaaaaacc aggccgccaa caagaagtcg 2682901 gcactggccc aggcgctcaa caagctctga cagccacctg ccgactcatc ggccgcggtc 2682961 ggccaccaac tcggcgacct gccggaccgc ggattccagc gcgtagtccg catccgcgac 2683021 ggcgcccttg acgttagcat tgagttcggc caccaacctc atcgcggtcg ccaccgtgtc 2683081 acgcgaccac cgccgagcct gcttctgggc tttctgcacc cgccagggcg gcatccccag 2683141 ttgtgcggcc aggcggtacg ggtcgccgga ctgcggcccg acccggccga tggtgtgcac 2683201 ggcttcggcg agcgcatcgg ccaacaccac tagcggctca ccgcgcatca tcgcccaccg 2683261 caacgcttcg gcagctcccg ccacgtcgcc ggctaccgcc ttgtcggcga tgtcgaagcc 2683321 cctcacctcg gctttgccgc tgtgatagcg ccgtacagcg gcggcgtcga cggctcctcc 2683381 ggtatcggcg accagctgtg aacaggccga ggcgagttcg cgcacgtcgg agccgacggc 2683441 gtccagcagg gcggtcacgg tctcgtcgtc gaccttgacc cgcagcgacg cgaactcgct 2683501 acggatgaag tcggcgcgct cactgacctt ggtgatccgc gcgcacggat gaacctgcgc 2683561 acccatcgac cgcagctggt tggccagcga tttggcgcgc ccgccacccg agtggaccac 2683621 taccagcacg gtgccggccg gaagatcggc ggcggccgac tcgattaccg cggcagcgtc 2683681 cttgcccgcc tccgcagcgg cccccagcac aacgatccgc tcctcggcga acagtgacgg 2683741 gctcagcagt tcggcgagct cataggcacc gacgtcaccc gcgcgcattc ggctcaccgg 2683801 gacgtcggct gtacctgccc gctgccgagc cgagcgcaac acgtcggcca ccgccctttc 2683861 gaccagcagt tcttcgtctc ccaggaccag gtgcaacggc ttagcctcgc tcaccccacg 2683921 atggtgtcac gaagggccga ccagcccgga cagcgaccag gcaagcagac atatgacggc 2683981 caccgccatc gttttgcaca tggccgcgcg aaaccagcgc caagcgccac tgcgcaaccg 2684041 tgaacacggt ggcgccaccg accagcagta cgccgggcag acctgcggcc accggaacgg 2684101 tcgccgcggg cacacccgac gcccaatgcg ccacgcgcaa cacccaccac acttcgggcc 2684161 cggtgaaccg gatcagcacc tgcgcgccgg ccggccacgg cacgaccagc acggccgcaa 2684221 cgctgcccag cacggtgatc ggcgcgatca cggccgccac cgccagattg gccaccacgg 2684281 ccaccagact gacccggccg gagatggcgg ccaccagtgg cgccgtcacc agctgcgcgg 2684341 ccgccgcgac tgcgagggca tcggccagca ccttcggaca tccgcggtcg accaagcggc 2684401 gtgaccaaac cggcgcgatg acgaccagtg cacccgtggc cgccacggac agcgcgaagc 2684461 cgatgtccac agcaagatgg ggagcggcag ccagcaaaac cagcacgcta cccgacaaag 2684521 ctggaatcgc ctgccgccgg cgcgcagaca gcatccccac gagggcaatg gcgcccatca 2684581 cagctgcccg caacacgctg gccgtcggct gcaccaggat gacgaatgcc accaacgcga 2684641 cggccgcgca caccacggcc gcacgcggtc cgatcaaccg tgccgaaacc agcgccgccg 2684701 cacacacgat cgtgacattg gcccccgaga ccgccgtcaa gtgcgtcagg cccgccgcac 2684761 ggaactcgcg gctggttaag gcggtgaccg tcgaggtatc gccgagaacc agggccggca 2684821 acatcgtggc ctggtcagcg ggcagcacct cacgaaccgc ggccgcgaat cgatggcgga 2684881 cgatgtgagc ggcgcggtgt accgggccgg cacggcccac ggtcggccga ccggtcgcat 2684941 tgaacaccgc gaccgtcagg tcgtgacgcg ccgggcgact gatacgcgcg cggaactgga 2685001 cgggctgtcc gaccatcagc tcgccgaagt ccagcgctcg cgcgaaaacc actacccggc 2685061 cggatgtctc gtcatcccgc agccgttgaa ccgtcgcccg gaacatcaac cggccccgcc 2685121 ccagcgacac tgggctctcg ctgggggtga ccgtgaccag cgcggaggtg ccaaatgcca 2685181 cggtgattgg gtggcgatcg accgcctcgg agcgcaacgc gaccgcaagc ccgtaccccg 2685241 cgcccaccat accgaccgcg accaggccgg cgctgatcga acccagtcgc ggagcgtgcc 2685301 acgaccggcg cgccacacac caccacagtg cgccgccgcc gagggccacc acgacgcagc 2685361 acaaggcaca cacgttgccg atcggccaca cgatcccggc cgccgtcaca atccagctga 2685421 ccagcgccgc cgggaccagg cgtacgtcca aacgggacgc gccgaagccc atatggcgca 2685481 ccggtatcag acacggacca gattgcgcag cttgtccagc cgcgccggac cgatgccgtc 2685541 gacgtcggca agctggtcga cgctggtgaa cctaccattg cgctgccgcc acgccacaat 2685601 cgctgcggcg gtgaccggcc cgatgccggg cagggcgtcc agctgctcca cggtcgcagt 2685661 gttgaggtcg agcacctcag ctgtcttagg agctgtctta gggcctgtcg tggctgtgcc 2685721 cgaggtaccc gccggtcccg gcgtccccgc accgaccgag ctgcccagca ccctcggctg 2685781 tcccgagggc ggagctagcc cgaccacgat ctgctcaccg tcaccaagct gccgagccat 2685841 gttcagtccg acggtgtccg cgccgtctac cgctccgccg gcggcctgta gcgcatcggc 2685901 gatccgcgcg cccggcgcca gggtgacgag tcctggggtg tgcaccaggc caaccacgct 2685961 gaccaccacc ggcaggccgg aacggtccgg cgagcccggg cttgccgacg acctagggtt 2686021 cgtcggcgaa accggctcta ccggaggaag tttggctgac attaccggct cagtccggtc 2686081 gcggatcaag gtgaataccg tcaccagcac cgcgagggcg gcgatcaccg ccaatgcgac 2686141 ggcgccggca cggcccggat ctgcgcgtat cctgtccgcc caaccttgcc cacgggaagt 2686201 gtcgggaagc cagcgcggca gcagcgagtt cggatcgtcg cgtggctcgt cgtggtctgg 2686261 accgtcgtcc gttggatcgt gtggctccgg gtctaagtgt gcagatgcgg cgtgcgagtc 2686321 gatatccggg acggcaccga gccgcctttg cagtcgctcg gcgggcagtt ctgttcgcat 2686381 gggccgaccg tagctgcggg gaccgccaga accggcgcgc acgacggcgt cgcgctgccc 2686441 tgctgtggat caatccgagg ctgtggacaa gccgctttgg cgatggatca agatgggaca 2686501 aaccgcgcca acatccccga acaaccagca ccgggctgcg acgtccatcc ggactcggct 2686561 caccgcgatc gagagtgtac tcggcaacgc gatccgcgag tgctgagccg cgcggccgat 2686621 ccccatccat ggcgtgtggc gaccgaagcc ggcgcgcagc acgcgggtcg ctgaccacgc 2686681 cgaaaagccc gtcagcctag cgccggccta gcacggcctt cagaactcga acgcggtctg 2686741 gacgggaaca tcactggcaa acgccgcgtc gagtcgacga agcagctggg aatctttggt 2686801 gcgcaaccgg ttagcggcgg ctaacgtcga agcgcggtgc gctccaaggt aaaggctgcc 2686861 cagtacgtcc cgatccattt cgatctcggc tgccgcatcg gtcggggtac accgcgcacg 2686921 gccgtcaccg atcttgagcg cgaaccggcc gccatcggat acctcgagga ccgtggaaaa 2686981 ctcgccaact tcgtgagcgt aaccacgcgc ctcgagtgcg gccggtacgt tcatgatgcg 2687041 caaccacagg ccgtcctggc gccaggtagt gcgggccagt cgggtatcgg tgagcaggtg 2687101 gggtaacggg tcctgtggat gggtgatgat gctgattcgc tccatggagt cgaggccaat 2687161 cagggcccgc cacaacgcac aatgcgcatc tgcggttacc gccctgagtt cgctgacgcg 2687221 cgctagcttg agatcggtgc gatccacccg gtacagcgcg tacccgtcgg gatgcagtaa 2687281 cgcgaacgat tcacggtctc caccgggcgc ggctttgcat tctgccagca gctcgtccca 2687341 gagcacctgc gggcgtagca gcccgcccgg cacctgctgg cgccatcgct cgtagatcgc 2687401 ctcaaactcg ccgcgatgct cggtgggtct gaccaaccgg acgctgctgc cacctaggcc 2687461 gccgcccggt gcgtcggcgt gaaagcgcgc gaagcgtcgg tcgaccgtca gctcatgcaa 2687521 ggtggtagcg ggcccgtagc cgaaccggcc gtagatgccg ccctcgctag catgcagtgc 2687581 cgcgaccgga tagccggaat cggctatgcg gcggtgcagt tcggcgcaca tcgcgcgcag 2687641 caagccgcgc cggcgatgcg tcggcgccac cgcgacgaaa ctgagaccgg cggtcgggag 2687701 caccacttca ccaggcaccg ccaaccgcag atccatgtac agcgccatcc cgaccacctc 2687761 agaacccggg ccggcaccat cgcggaccac caccgctccg tcggtgggca ccagggtccg 2687821 ccaggcggtc gctgattcag ggccgatgaa atcggtgaaa ctggccgcgg ccagtaggaa 2687881 catccccggc cagtcgtcct cggtcgggct acacagggtc acagtcacag aatccgactg 2687941 tggcatatgc cgcggccacg tgcacgtgaa tattacgacg acagtgtctg gcaaaggatc 2688001 acgcgatgcg ggtagccccg ccagcgtgac gccactgcga gaatcagcga cgaatttcgc 2688061 cgtgacgtta cgctggcggc gacgctccca cgtcgacgca taccccgacg gctccggcac 2688121 cgacgtgcag agcaagtacc ggtcccatgg cggtcaccat ggccggctca cacgccggca 2688181 gccgctccgc cagcgccgcc gccacgtcgt tcgcagctgc cgggtcggcg acgtgatgca 2688241 ccgcgagagc ggcggggcgg tcgccgacaa gctggcaaac ccggtcgatc atcaccgccg 2688301 tcgcgttgct cacagtgcga acccgttgga ccagaacaag ttttccgtcg tcgactgaca 2688361 gcagcggctt gagcgccagc gcggtgccca accatgcctt ggccccactg atgcgcccgc 2688421 tgcggcgcag attgtccaac cgcgctacag cgacgaacgc gtgaatccgg cttaccgccg 2688481 cagccgctgc gcgcgcgacc gtatccagct catcgcctgc ggcggctgcc cgcccggccg 2688541 ccagtgccgc gaaaccgacg cccatcgcgg ccgacctcga gtcgatcacc ctaacggcgg 2688601 gacctagttc cgccgcggtc agctcggcgg ctcgaaaggt acccgacagc gccgacgaaa 2688661 tgtgcaccgc cactaccccg tcgccgccac tgtccgccaa cgcccgttgg taggcggcgg 2688721 acagctcaac cggggtcgcc ccagcggtgg tggcgtggcg cttgtggatg tcatcgggga 2688781 tttcgtccac accgtcgcgc aggtcgaggc cgtcaagcaa gatatgcagc gggacctggc 2688841 ggatcgacca ctgttcgcgc aggtcggccg gcagtcgaca cgacgtatcg gtcaccacca 2688901 caacggtcac cggcgccgct ctcccccgca agcgggaggt gcccccacct catcgcttcg 2688961 ctctgcatcg tcgccggcgc ggggcatgtc tcagccgcgc gatttctcgt tcggcacccc 2689021 ggcttcggcc agtgccttga gcatcagttc ggcgaccgcc tggtgggctt caaaattcca 2689081 gtgaatgcca tcacgattac catatccact caatatctgt tctgcgacag cggctttgag 2689141 atcaactaga ggaatgtcat ggtgctgtgc ccattccgtg atcgccgcca ccgtgcctgc 2689201 gcggccgtga tgggccttgc cgtaggtctc ggcgatatgc accgagggca gcgatgcgat 2689261 gatcggtatg cccggacgat tgaaatcaat tgcaccacgg gtcttttcaa ggtactcagc 2689321 ggtcaggtgc ggcggcaacg ccgcacgggc cactggcgac agtcgcggtt gaacccaggc 2689381 gtagccgtcg cggacccacc gccgcagcca agacggacgt acatagcgga tgagctcacg 2689441 cagcgccgtc ggtaataccg acggcagcga atccattccg ccggtcgcga agatcaccgc 2689501 tccggccctg ggtaacgccg cccaagcgcg cggatcctgg gttgccgccc accagacatc 2689561 ccgacaggtc cagccgatgc ggccaatcag ctctaaatcc caatctagtt gggaagcaac 2689621 aatattgggc cagatacggg ggtcatcggc aggcaggccg ccggtgggcc cgtagtaggc 2689681 cagcgagtca gcgaagacca acaatgcggg cctgcgcccg cgcctagagg acatcgctgg 2689741 agacctgcgc cgaagcattc cacacatcaa ggcgccaccg gatgctctcg aagtcggagc 2689801 ccggggccca atggccactc agctgagtcc aactggcatt gcccatgccg cccaaagccg 2689861 gccagttggc caccggcaac ttcagcagcg ccgccgacaa cacggcgatc agacccccat 2689921 gggctaccag caccaccggg cgatccggct cgtcagcgcc accccattcc ggttcgctgg 2689981 caaccaactc ggcaaccaac ggccgacttc gggcagccac gtcaaccctg ctttccccgc 2690041 cgtgcggcgc ccaggtcgca tcctcgcgcc aggccaaccg ggcgcccggg gcatcagcgt 2690101 cgatctgagc gtgggttaag ccctgccaat cgccaaggtg agtttcccgc aatcgggtgt 2690161 cgacccggac cacaaggccg gtgcgctcgc ccagcttgac cgccgtgtca tatgcgcggc 2690221 gcaggtccga cgatacgatc agtagcggct gccgcttgcc cagcacctcg gcggccgcga 2690281 ccgcttgggt gcggccaagt tcgctcaact cagtgtccag ctggccctgc atccggctac 2690341 cgacgttgta gtccgtttgt ccatgccgca gcatcaccag tcgccgcgct ctcattgcgc 2690401 acccgctgag ttcgccgata aatcaaccgg caccaccggg cagtcacccc acaaccggtc 2690461 cagggcgtag aaattgcggt cgtcctgatg ctggatgtgc accacgatgt cccggtaatc 2690521 caacagcgtc cagcgaccct cgcgggcacc ctcacggcgg gccggccggt aacccgcctg 2690581 tcgcattttc tcctcgacct catcgacgat ggcgttgacc tgccgctcgt tggagcccga 2690641 agcaatgacg aagcagtcgg tgatgaccag ctgcccggag acatcgatga ccacgacgtc 2690701 atcggcgagc ttggcggcgg ccgcgccggc ggccaccctc gccatgtcga tggcttcccg 2690761 gttggcggtc ataggccatt cccagcggcc aggctggtcg ttgaacgcgc gcccgcgtcg 2690821 caggcgccac agtagagccg gcacttggag acatactgca cgacgctgtc gggcatcagg 2690881 taccacagcg gccgggactg ctcggcgcgc tgacggcagt cggtcgacga aatggccagc 2690941 gccgggatct cgaccagagt caacgcatcc ttggccagct gacccagcag gctagtgatg 2691001 tgttcgttgc gcaactcgta gccgggccgg ctgaccccca cgaaccgcgc caattcgaac 2691061 agctcctccc agccctgcca ggacattatg gaagctagcg catcggcgcc ggtggtgaag 2691121 tacagctcag agtccgggtg caaagcatgc agatcggcca gcgtgtcctt ggtgtaggtg 2691181 ggtccgccgc ggtcgatgtc gacccggctc acagagaatc ggggattgga ggcggtggcg 2691241 atcaccgtca ttaggtagcg gtgctcggcg gcggagacct gtcgaccctt ttgccagggt 2691301 tgcccgctgg gcacgaatac cacttcgtcg agatcgaaca ggtcggccac ctcgctggcg 2691361 gcaaccaggt ggccgtagtg gatggggtcg aacgtcccac ccatgactcc caatcgacgc 2691421 ccatgcacga ttggccagct tactggagta tcttgccgca gttccgttcg cggcaactgc 2691481 cagccagcct aagcgagcag ccattgataa ggcagcacga ttggttattc ctaagccttt 2691541 gcgtgatcat cttggtctcg ttccgggtga ggtcgaggtc gtcgccgacg gggcgggact 2691601 gggtgtcgcg accctcgccg gtgactccct cggcgagcgg catggcctac cgatgatacc 2691661 cgcgggcagt gcggcgcgat gccggccagc gttagcaccg tgctcgtgga cacgagcgtc 2691721 gcggtcgcac cggtggtcgc cgatcacgac caccacgaag atacctttca agcgctacgt 2691781 ggccgcaccc tcggtctggc cgggcacgcg gcttttgaac gcaggacgct ggcgaccgtg 2691841 gcgaagctgc ttgcacacac attcccggcg accaggttcc tcggcgctgg ggcggcgatg 2691901 tcgctgctac ccgaactcgc accggccgaa atcgccggcg gagccgtcta ggatgcgctg 2691961 atcggtacgg ctgccaacga gcatcggctc cccctggcaa cccgcgaccg gcaggcgctg 2692021 aaggtctacc gcgcgctcgc aatggaagcc gagctgctgg cctgagcgtc gcggttgcgc 2692081 ggccaatcac acccgcgccg ctgccaggcc aacggctgcc cagctgcccg gtccctcacg 2692141 tttttcaccc gatgtacccg caacgatcta ctcggtcgtg tagaaggggt ctgtggataa 2692201 tttgccgatc gaatcagccg agtcgacgcg gttggcgaag gcggcgatga cccgacggtt 2692261 ttacacccgc tcggtggtga aaggcgagat cacgctgccg gccgtgccga gcatgatcga 2692321 cgagtacgtg acaatgtgcg ccggcctttt tgcgggtgtg ggcagaaagt tttccgacga 2692381 agaacttgct catcttcgcg cggtgctcca gggtcagctg gcagaggcgt acgcggcctc 2692441 ccagcgttcg accatcgtca tctcatacaa cgcccccatg ggcccgacct tgcactacca 2692501 agtccgagcc caatggcgga cggtggcgca ggaatacgag aactggatcg ccacccgtga 2692561 gccgccgctc ttcggtaccg aaccagacgc acgtgtgtgg gcgctggcca acgaagcagc 2692621 cgatcctacg acgcatcggg tgctcgaaat tggcgccgga accgggcgta acgccctggc 2692681 gttggcacgg cgcggacacc cggtcgacgt ggtggagatg accccgaagt tcgccgacat 2692741 cattcgctcc gacgccgaac gagattccct cgacgtgcgc gtcatcatgc gtgacgtctt 2692801 ctcgaccatg gacgacttga ggcaggacta tcagctgatg gtgctctccg aggtggtgcc 2692861 ggacttccgg acgacgcagc agctgcgcaa tctgttcgaa ctcgctgccc agtgccttgc 2692921 tcccggtgcc cgcttggtgt tcaacgcctt cctggcgaac ggagattacg cacccgacca 2692981 agccgcgcgt gagttcgggc agcagatgta taccgggatg tgcacgcggg ccgagatgtc 2693041 tgctgcagcg gccggccttc ctctcgaact cgtcgccgac gactcggtat acgactacga 2693101 gaaaacgcac ctgccaccgg gcgcctggcc gcccaccagt tggtacgccg actggatccg 2693161 tggcctcgac gtgttcacca ccaacgttga gagctgcccg atcgagatgc gctggttggt 2693221 gttccagagg aggcggtgag cagtcgcaaa agcccccgaa accggtcgga tttgggggct 2693281 ggtacgtgaa ttagggtgac cacggcaagc gtgacccgcc ggcgactgca gcgaagccgg 2693341 gtctgttggt gacagtgtgt atgtcggggt ttcaggcggc aggttcgagg gtgaccccca 2693401 atccttgggc ttcgagtttg gcgacgaggc gacgtcgttc tttgtcggga tccatgcggg 2693461 tggtgaagta gtcggcgccg agatcctggt aaggccggcc ggtggccagc acgtgccaaa 2693521 tgatgacgat cagcttgtgg gcgacggcga tgatcgcctt cttgttggca gcgggactgc 2693581 ggaagccacc gaacttgcgg acctggcgac ggtagtactc gcgcaggtag ccatcggtgc 2693641 gcacggcggc ccacgcgcac tcgaccagga ccggctgcag gtgctggttg cctgtgcggc 2693701 gggcaccgtg atggcgtttg ccggccgatt cgtggttgcc cgggcacagc cgcacccacg 2693761 aggccagatg ctcagccgag gggaaccagg ccgccgggtc ggcgccgatt tcagagatga 2693821 ccgtcgccga ggcacccacc ccgatccccg ggatcgatgc aatcagctcg cgtcgggcac 2693881 aaaagggatg catcagctgc tcgatctgct cgtcgagagc accgatcatc gcatcgagct 2693941 gatccagatg agccaggtgc aacctacaca tcagggcatg gtgatcatcg aagcgccctt 2694001 ccagcgcccg ctgcagatcg gggatcttcg agcgcatact gccgcgcgcc agatcagcca 2694061 gcaccgccgg gcggcgttca ccgtcgatga gcgcctccac catcgcccgc accgacttgg 2694121 gggtgaccga ggacgccacg ctgtcggcct tgatcccggc gtcttgaagc attgcccagg 2694181 cgctgcagct tcgaggtgcg atgctcgacc agcttgcggc ggtagcggat cacgtcgcgg 2694241 gcggccttga tgtcggcggg cggaatcaac caaccccgca gcagaccgca ttccagcagg 2694301 tgcaccaacc actcggcatc caagaggtcg gttttgcggc ccggccgttc ttcacgtgcc 2694361 cggcattgca caccagcagc tcactcgccg tgggccaaca acgcgtgata agcgggcgca 2694421 ccagcgccca tcatgttctt ttacgactgc ccgcccggcc tacaccggca gtagctggtc 2694481 gatcaccgtg gccagctgct tggcggatcg gcattcgtgc atcgtgatca cctcttggta 2694541 gcgcggcacc gccgagtcac cgctgcccca cagatgcttg ggctccgggt tgagccagtg 2694601 cgcgtgccgg ctggcggtca ccatgtcggc cagcacgtcg gtggccgggt tgcggtagtt 2694661 ggtgcgcccg tcaccaagca ccagcagcga gctgcgcggc gacagcacat ttgggaagcc 2694721 ctgcatgaac gagacgaacg cgttgccgta gtcggaatgg ccgtcgcggg catacacacc 2694781 agcctcccgg gtgatccgct ggatcgctat ggccaggtcc gattccggcc cgaacatatg 2694841 ggtcacctcg tcggtggagt cgatgaaggc gaagacgcga acccgggaga actgttggcg 2694901 cagcgcgtgt accagcagca gcgtgaagtg gctgaagccc gcgaccgagc ccgacacgtc 2694961 gcacaacacg acgagttccg ggcgcgccgg gcggggtttg tgcaacacca ggtcgatcgg 2695021 cacgccgccg gtggacatcg acttgcgcag cgtcttgcgc agatcgatcg atcccgcgcg 2695081 ggcgcggcgc cgccgggcgg ccaaccgggt cgccagggtg cgggccaacg gggccaccac 2695141 ccggcgcatc tggcgcagct gctcacccga ggcacgcaga aactcgacgt tctcggaaag 2695201 ctgtggaatt ccgtacatct ggacgtgctc gcggccgagt tgctcggctg tgcgccgctt 2695261 ggtctcggcg tcgaccattc tgcgcagctg cgcgatcttt tgtgcggcaa gcgctttggc 2695321 aatctgttcc tgggtggctg tgggctcatc gccgtaggga gcaagcaggc ccgccagtag 2695381 cttgccctcc agttcgtcca gcgccatggc cttgagtgcc tgatacgacg agaacgacgg 2695441 accgcggctg gaactgtact tgccataggc ctcaacgatc cgcgcgatca tctccaccaa 2695501 ccgctcgtcc ttgccggcca ggtcttggtt gttggccagc agatccagca gcagctgccg 2695561 catagcctcg acatcatcgg gcggcaaacc cccggagcct gccgactcgt cttccgtggt 2695621 gatgaccgcc cgagccccca gtgccgcggg aaaccacagg tcgaacatgg cgtcataggt 2695681 atcgcggtgg tcaggccggc gcagcaccgc acaagcaatg ccctcccgca acacctcacg 2695741 atcacccagc ccgagggtgg ccatcacccg gccggcatcc accgtctctg acgggcccac 2695801 cgaaatcccg ctgccacgca gcgcttccac aaagcccacc aagtgtccgg gcagcccatg 2695861 cggggcgagt ggccgggcag cacgaatacg acgggcggcc actagttcaa cctgagctct 2695921 ccggtggccc gttgctggtc ggattggtgc ttgagaacca cgccgagcgt ggcggcaacg 2695981 accgcatcgt cgatggtgtc cagtcccagt gccaagacgg tgcgacccca gtcgatggtc 2696041 tcggcgatcg atggcacctt cttaagctgc atgccgcgca gcacgccgat gatgcgcacc 2696101 aactcctcgg cgaagtgctc gggcagctcg ggaactcggg ataacaggat gcgacgctcc 2696161 agctcggggg tcgggaagtc gatgtgcaag tacaggcagc gacgcttgag cgcctcggac 2696221 agctcacggg tggcgttgga ggtcagcagc acgaacggcg cccgggtggc ggtcagggtg 2696281 cccagttcgg ggacggtcac cgcgaagtcg gacagcacct ccagcagcag gccctcgatc 2696341 tcgatgtcgg ccttgtcggt ttcatcgatc agcagcacgg tgggctcggt gcgccggata 2696401 gcggtcagca gcggacgctg cagcaggaac tcttcgctga gcacatcggt tttggtggcc 2696461 tcccaatctc ccgagccggc ctggatacgc aggatctgct tagcgtggtt ccactcatac 2696521 agggcgcgag cctcgtcgac gccctcgtag cactgcagcc ggaccagacc ggatccagtg 2696581 gcctgcgcca cggcgcgcgc cagctcggtc ttgccgaccc cggcggggcc ttccaccagc 2696641 agcggcttgc cgagccggtc ggcgagaaag accgccgtcg cggtggcagt gtcgggcagg 2696701 tagccggtct cggccagccg ccgcgagacg tcggcgatgt cggcgaacag cggcgtgggc 2696761 cgggcgggca cggtcacgat cgggtctcct ctagccaacg gcgtcaggcc ggacgggtgt 2696821 ggccggctcc ccatgcgatc cacttggtcg acgtcaattc cggtagtccc atcggtccgc 2696881 gggcatgcag tttctgggtg gagatgccga tctcggcgcc gaagccgaat tgctcgccgt 2696941 cggtgaacgc cgttgatgcg ttcaccatca ccgcggccgc atcgatctgt tcggtaaagc 2697001 gttgggccgc atcaagattg gtggtcacaa tcgcttctgt gtgcccggtg ccgtattcgt 2697061 tgatatgggc gatggcagcg tcgacaccgt cgaccaccgc caccgcgatg tccagcgaca 2697121 ggtattcgcg gcgcaggtcg gcctcgtccg ggtcgagatg tacggtgaca ccggcgtgct 2697181 gcagggcggc cagcaatcga ggcaacgccg tttcggcgat cgctgcgtcg accagcagcg 2697241 tctcggcggc gttgcagacg ctgggccgcc gcgtcttgga gttcagcaag atacgctcgg 2697301 ccacgtccag gtcggccgct tggtgcacgt agacatggca gttcccgacg ccggtctcga 2697361 tggtgggcac ctgggcatcg cgtacgaccg cctcgatcag gcccgctccc ccgcgtggaa 2697421 tcaccacatc gaccaggccg cgggcctgaa tcaggtgagt gacggtggcg cggtcggcag 2697481 ccgacagcag ctggaccgcg tcggccggca gctccaggcc gaccagcgcg gtgcgtaaca 2697541 ccgccaccag ggcctcgttg gactttgcgg ccgacgagct gccgcgcagc aatgcagcgt 2697601 tacccgactt gagtgtcagc ccgaaggcat ccacggtgac attggggcgg ccctcgtaga 2697661 tcatgccgac cacgcccagg gggacgcgct gctggcgcag ctgcagcccg ttgggcaggg 2697721 tatagccacg cagcacttca ccgaccggat cgcgcagtcc cgcgacttgc cgcaacccgg 2697781 cggcgatacc gtcgactcgt tgcgggttca aggacaaccg gtccagcatg gcggccgggg 2697841 tgtccgcctc gcgcgccgcg ttcaggtctt cggcgttggc cgccaggatc tggtcgcggt 2697901 gagccagtag ctcgtcggca gccgcgtgca gcgcgcggtc tttgacagtc gtcggcagcg 2697961 atgccagccg gcgggcggcc acccgggcgc ggcgtgcggc gtcgtgcacc tcttgacgca 2698021 agtcgagctg cgacggtgct ggcacggtca ttgccccagg gtaacgggct tgcgctggcc 2698081 aggtaagacg acccgctccg gacgggccgc gcagcgatcc ggctgggtgg ttgctatgcg 2698141 atcaggcgta cttgacggtc gcccctgatc agcttgccga taatcccggc aagacgctgg 2698201 taggacttct cgcggccgcc gaaagagcta aacaccaaac cgattcgtcg cgccgggcag 2698261 gggcgacgaa tcgggcgagt tccagccggc ttcgcgtggt ctcgacggcg gccgcggtct 2698321 gcggaatcag tgtcaccccc agcccgccgg tcacgcactg cacgacggtg gccagctaca 2698381 ccgcccgggt gttgggcagc atcgagcgtc tggtcgcgta ggcagtgccc ctcatgcagt 2698441 cacaacaaag tcagctctga cagcgcggtc agcggcaccc gctgcttgcc ggaaagacat 2698501 gccctggggg tgcaccgaga ccggcttccg accaccgctc gccgcaacgt cgactggctc 2698561 atatcgagaa tgcttgcggc actgctgaac cactgctttg ccgccaccgc ggcgaacgcg 2698621 cgaagcccgg ccacggccgg ctagcacctc ttggcggcga tgccgataaa tatggtgtga 2698681 tatatcacct ttgcctgaca gcgacttcac ggcacgatgg aatgtcgcaa ccaaatgcat 2698741 tgtccgcttt gatgatgagg agagtcatgc cactgctaac cattggcgat caattccccg 2698801 cctaccagct caccgctctc atcggcggtg acctgtccaa ggtcgacgcc aagcagcccg 2698861 gcgactactt caccactatc accagtgacg aacacccagg caagtggcgg gtggtgttct 2698921 tttggccgaa agacttcacg ttcgtgtgcc ctaccgagat cgcggcgttc agcaagctca 2698981 atgacgagtt cgaggaccgc gacgcccaga tcctgggggt ttcgattgac agcgaattcg 2699041 cgcatttcca gtggcgtgca cagcacaacg acctcaaaac gttacccttc ccgatgctct 2699101 ccgacatcaa gcgcgaactc agccaagccg caggtgtcct caacgccgac ggtgtggccg 2699161 accgcgtgac ctttatcgtc gaccccaaca acgagatcca gttcgtctcg gccaccgccg 2699221 gttcggtggg acgcaacgtc gatgaggtac tgcgagtgct cgacgccctc cagtccgacg 2699281 agctgtgcgc atgcaactgg cgcaagggcg acccgacgct agacgctggc gaactcctca 2699341 aggcttcggc ctaaccggga tctggttggc cgggaatcaa tgagtataga aaagctcaag 2699401 gccgcgctcc ccgagtacgc caaagacatc aagctgaacc tgagctcaat cacccgcagc 2699461 agcgtgctcg accaggaaca actatgggga accctgctgg ccagcgccgc agcgacacga 2699521 aatccgcagg tattagctga cattggcgct gaagcgaccg accatctgtc ggctgcagcc 2699581 cgccacgcag ccctcggagc cgcggccatc atgggcatga ataacgtgtt ctaccgtggc 2699641 cgcggcttcc ttgaaggccg gtacgacgac ctgcgccccg gactgcggat gaacatcatc 2699701 gccaatccgg gcataccgaa agccaacttc gagctctggt ccttcgcagt gtccgcgatc 2699761 aacgggtgct cgcattgcct cgtcgcccac gagcacacgc tgcgtacggt aggtgtggac 2699821 cgagaggcga tctttgaagc gctgaaagcc gcagcaatcg tttcaggcgt tgcacaagcg 2699881 ctggccacaa tcgaggcact aagcccaagc taagtgtctg tacgcgatga cgccgtgctg 2699941 ggtgacaccg gtgcgaccaa cacggtactg tgggcgatcg gcggcggcgc cttccacgga 2700001 gtcaacttcg acaacgcatc cgacacccga agcctgtagt ccctcatcac ctctccgtcc 2700061 tcgtcccaga agtcgtcata ttcttggtcg aggtcggcga tctgcgcagt gaattgccca 2700121 agcgcgttgt tgtcgatcag aatctgcctc tcagcacggt tgttgtagat ctgcgccagg 2700181 ggcaccatat cgtgatgtgc ccattcatag gcccgcacga tctcgtggat ctgcctctcg 2700241 acctcagaca gctgcacaca gaggtcggtc agccacctga caaacggctt ggctgcctcc 2700301 atcaactgca tcaccactgg acccgcccag gcgtccatca gagacagcag cgttcggttg 2700361 aacgacctct gcacggccgt catttccaca tccaacgacc tccacgccct ggcggcagcc 2700421 aacatcgagt caggaccggg gccggcatat atgttggcgg agttgacctc cggtgggtac 2700481 gcttcgaaat gcatctgttt gttcgttgtt ccgtcggctc gtgacgactg tagtgactcg 2700541 ttaactaaag gtcttgatgt tgtcggcctc ggcggtcgca tacttatcgg cgccagtggt 2700601 caaggcgtgc gcaaactcct caagaaccac cgccgccgca gcgatggtct gccgatactt 2700661 ccttgcgtac tcgaccagga acgtcgccgc cttctccgac accagatccg cagccggagg 2700721 ccgcacagcg gttgtcatcg gggccacctg agcatcgctt tggatcgcac gatcgcgaat 2700781 ccgtcgtacc tccgtggccg ccacggtcaa cgcctcggga tttgtgatca caaaagacat 2700841 gccgactcca ctccgcgatt aacgaacccc cggcactgca ccgggctgat caaccaccgg 2700901 ttgtttgcgc tgcgactgcc cacgttgaga aaacgcaacg acttcactgg cataattatc 2700961 caacagacga agggatcaat tccgggtaag ccgctgcaaa tcaagccgac tcaaggcaca 2701021 caacgccacc cgacgatacc ccccatgctg acgttcaaag cagtagggat gtagcttacc 2701081 atgccccaat ggccgtcgga ggccagccac cagcgcacat gcctgcacgg tgcacctaat 2701141 cggtgccggc ttccagcggc cggcagttga cccgcgatga cagacacgcc caggctcgtt 2701201 gaggccgatc gcgttctcac acagcgcatt acggctcgtg acagatgagg gagtagagcg 2701261 ggtcggcaag ggaaacccat catggcgccg ggctccgccc cgggttgggc cagtgcgatg 2701321 gctgcgacgt cgtggaaaag cgcggtgggg tgctcgggca tgacggatgg gcactcatca 2701381 catcctcctg ctccacggca gtgttcggct cgcaccgtca ttgtgctgat ggatcagaac 2701441 gtccgctcgc acgccggaca ccgcgccgcg caccgaggta gccgacgcta gccaccagga 2701501 acgggaccaa ataattgatc accatccgta cccacgtgcc gatcgtcgcg gcgccctcgg 2701561 caagggtggc accctgattc accgcacata gcacggtacc cacgatcagc gccgtcggag 2701621 ctgcggtgcg cagggtgtgg ccgcgcagaa acagaccgat cgcttggcca accgtgtccc 2701681 accgctcgtc agcgtcgcgc aggcccacga tgctcgccgg ccgccgctgg ggattggtgc 2701741 aagtcgcggc gaattgcctg ttgcgccttc ctttgccgct cgtcgataac ccgacccagc 2701801 tcctgcagca gcatcggctt gttcatcacg acctgctcaa ggtgctcccg gccgatctgc 2701861 agcgcggtca cctcttccag cgctaccgca ccggcggggt cgggttgccg agtcagcgca 2701921 gtcaagccca gaaacgtgcc ctttttgagg gtggcaatcg caaccacgga tccgtcatcg 2701981 gtcgtaaccg tcagccgcac gctgccggcg atcacgaagg tgatacccat gggaaccaca 2702041 cccgcgtgct gcacgatctc atcagtgccg tagcgcacca gcctcgcgta ccgagccagc 2702101 gactgctgat cgctagagct cagtcgcagc tcgggcccca ccaccgtgcg cagggcggac 2702161 tccacacgtt cggccgtcga gaactcgtcg tcggcctcgt cgaggtgtag cccttcccga 2702221 cgcgcggcgt accagaccca gcgcagaaac gttgcctgcg ttgggccttc gtcggccggt 2702281 gatgtcagcc tgaccgtcgt tcgatactcg gcggctcctc gggcaattgt ggcgggcacg 2702341 accccgggct taacatgcgg tagcgcgctg gcagccctgt tcagcatggc gcataccttg 2702401 tccgggggat cggacgtgga aaatgtggtc gtgatcgagc attcgtgcgc cccggccggc 2702461 cggctgagat tggtaaacgc ggtggtggcc aacatcgagt tgggcatgat ctgcagtccg 2702521 ctgccggtgt cgatatggac agcccgccag ttcacctcga cgactcgtcc gcgggctgtg 2702581 ggtgtttcca accaatcatc gatccgaaag ggctgttcga acagcatgaa caagcccgac 2702641 acgatctggc cgacggagtt ctgcagcatc aggccgatga cgactgacgt cacacctaac 2702701 gcggcgaaca gtccaccgac ccgcaccccc cagatgtagg acaggatcac cgccaaacct 2702761 atgccgatca gcgcgaagcg cgcgacatcg acgaagatgg cgggtagccg cttgcgccag 2702821 ctctgttggg gcgcaccctg aaacagggtg gcattcagta acgacagcag cagcaccagc 2702881 accagaaatc cgaacgctgt cgtgagcacc cgcacggtgg ggtcctcggc cgggacttca 2702941 gatgccttga caagcagcag caaaaccgcg cccaggggta gcaggtagtt tcgcagcaga 2703001 cttgcctgcc tggccagatg gctgttccgt cggacgagta tgttgtgcag ttcggtgaga 2703061 acgattagcc cggccggcaa tccgatcgca atgccaacgg cccagtagaa ccatgtcgag 2703121 tcgagcaggt tcatgatcgc tccgacaatc ggtagatcgg ctcttctaac cctccgacag 2703181 aaatcgtgcc cgcagccgtg aactgccaca cgtctcgcat cgcctcatac acctgcgagg 2703241 tgacatagat gccgggctgt ggtgaaccgc tgtgcatttg gtaggccaga ctcactgccg 2703301 cgccccacat gtcgtagacg acacttgatc tgccgaccag cccgctaatg acgtccccgg 2703361 tgttgatacc gactcgcagg tgcagatcgt taccggtttg gcaattgaac cgatcgacga 2703421 tgcgccgcat ctctagggcg aagtcgacgg ttcggggaat attgtccagc cgtggcgtgg 2703481 ttaccccgca accggcgaga tagccattgt gcagcgtgcg aatgcgttcg acaccaaggt 2703541 gttcggcggc cgaatcgaac tggcggacca gctcgtcgac aattttgacc agttcgttac 2703601 ccgacaggcc gctggaaatc tcgtcgacac ccaggatgtc ggcaaacagg acggtgacat 2703661 cttggtgctc ctgcgcaatg gtctgctccc caaggcggta ccgctcgaca actggctcgg 2703721 gcatcatcga tagcaataac cggtcgtttt ccttgcgttg ctcgttgagc agctcctctt 2703781 tggtttgcag attccgactc atctcgttga aagcggctgt aagatcaccg atttcgtcgc 2703841 gtgactttac cggaatgttg acttcgtagt cgcctgcgct gatcttctgg gtgccaacct 2703901 cgagccgccg gattggccgc accatcgcat gggcgatcag catcgacgcc acacagatga 2703961 cgacaatgat gccaactgta accagcacaa gcgccctgct gaacgacgcg acggccgcga 2704021 acgcctcaga atcgttccgc gttgccagga tcgaccagtg cagatcggag tccggcacat 2704081 tcagcggcgc gtaggcctcc agttccctgc tacccgtgta gtcggtggag gtgacggttc 2704141 cggtctgtcc gcgttgggcg gcgcgcagtc cttcggtcgc aacaggctgc agcagcgtcg 2704201 tcccaccgaa ctggatcgct ctgttgacca catcaagtga cgtgcctgct gccacaacct 2704261 gtttccggta ttcctccggg tcttgcagga agagccgaga atcggaccgc atcagactgt 2704321 ccggaccggc gagataggtt tccgtcccac tacccatgcc agccgcttgc cattgcctgt 2704381 cggcggtcat gatcttattg atcttgtcga tcggcaacgg cagcgccaaa acgccctgag 2704441 ttttgccgcc cgcttcgacc ggtgccacca accacgcggt cggcacgccg agttgaggct 2704501 gatacggctt gaagtcggta atccaggtaa agtcgacggc gttggcgccc aacgctttaa 2704561 ggtaggcgtc acgcagattg gattcgcgat acggcccggt cagaatgttg gtaccgaggt 2704621 cggggtcctt gctcagggta tagacgatat tgccccgggt gtccagcaat accgcgtcgt 2704681 cgtaatcgaa ccgggtgacg atttcccgga aatagctgtt gaattgcgcg ttggcggccg 2704741 accatgcact gccgtcgccg gcatcgtcca gccgcatcgc atcttggtcc gacgtgaatg 2704801 gtgcagtgta gtacgcctga agataccttt gggccggaga agtcggcagc agcgcggtga 2704861 tgtcgagttt atcgccggtc gtgcgttcga cgggtgtgat gaattcgttg ttgtagtagt 2704921 tgacgatcgc ctgttgttgg gcggggctga tcgtggcgtc agccagctgg tcaaagccgg 2704981 ccgtgaaccg cacgacggca tcgacaaccg tgagtccacg ttcgtaaatg accagcgaat 2705041 tcgtcaggtc agaaaatagt gtctcaactg cccgcttctg cgactcgcgc aactgggtca 2705101 accgctcgta ggcggctgct cttagcgaag tgcgaccaga ttgatagaca atggccgcaa 2705161 tcgccgcgac ggacacgata ctcgtcaaca gcagcagcac catgagcttg gactggatgc 2705221 tggcccggaa acgcggccga cgccggagca cattcttatg gcgcttctta gccggggtgg 2705281 attcactctc ggctaccgag tccagtgcct cacccgacgt caaccggctt cccctctgct 2705341 gtcggtcgca ggctgctcta cgacgcgccg attacgcagc agactaacgt gcccagccca 2705401 cgatcatcgc gtttgccgaa aagccaccag cgaccaatcg aggaatgcgc cgcgtacggg 2705461 tcctcgtcat cgtcctcgtc gcgcacgcgc ttcttgaggg gtggtagcga ttggttctgg 2705521 gtggtccgta atcgaggtga gccaggatag ctcggccgta tcgcaatgtt cagcgagtcg 2705581 atggatcaac acatgccccg gcaccggcaa ccgctgcgga gtggtcaacg catcaaacga 2705641 cgacgcactc agaccatcga ccgacagcat cgagccggtc cagcatcccg accacccgtc 2705701 ccgatccgca agcatgttcg aatactggga cgcgccaccg acaggacacc cactgatacc 2705761 cttgggacaa aagtgacaca agtgatttca gccaacagca agcatggcaa acgccagtga 2705821 gactaacgtc ggccccatgg cgccccgggt gtgcgtggta ggcagcgtga acatggacct 2705881 gacgttcgtg gtggacgcgc ttccgcgccc cggcgagacg gtgcttgcgg cgtcgttgac 2705941 ccgaacgcca ggcgggaagg gcgccaacca ggcggtggcc gcagcgcgcg caggcgcgca 2706001 ggtacagttc tccggtgcat tcggcgacga tccagccgcc gcccagctgc gggcccacct 2706061 gcgcgccaac gccgttggac tggacaggac cgtcacggtg cccggaccga gcgggacggc 2706121 gattatcgtg gtcgatgcca gcgccgagaa caccgtgctg gtggcgccgg gtgccaatgc 2706181 acatctgact ccggtaccct cggccgtcgc caactgcgat gtactgttga cccagttgga 2706241 gattcctgtt gcaaccgcgc tggcagccgc gcgggcagcc cagtcggccg atgcggttgt 2706301 catggtcaac gcctccccag ccggccagga tcgaagctcc ttgcaggact tggccgctat 2706361 cgccgacgtg gtgatcgcca acgagcatga ggcaaacgac tggccgtcgc caccaacaca 2706421 tttcgtgatc accctgggtg tgcgcggtgc ccggtacgtc ggcgcggacg gggtgttcga 2706481 ggtacccgcc ccaacggtaa cgccagtgga taccgccggc gccggcgacg tatttgccgg 2706541 ggtccttgct gcgaattggc cgcgcaaccc aggttcgccg gccgagcgac tgcgcgcatt 2706601 gcggcgggcc tgcgctgcgg gtgtgctggc aactttggtg tccggtgccg gcgactgcgc 2706661 accggccgcc gccgcgatcg atgcggccct gcgagccaac cgccacaacg gttcatgacc 2706721 actgctacgc accgaaggag acccgctgat gcgaacgacc accgcggccg acttagcgct 2706781 ggcactcttc gcggttttca gtgtggtcgg attcggctga cgcagttggc tgcagcaccg 2706841 acgcaccgga tccaccggct ttcgcggcgt cagcggccgg gtcggttcgc tggagtggat 2706901 taccgggacg tgctttgtca tcgccctgat cgtgacggtg gtcgctgcgg tgctgcagcg 2706961 gaccaacgtt gtccaaccgc tgaatactct gcgcatggtc tggattcagg ttgccggcat 2707021 aatcccggcg acggccggga tcgcggccac ggtttacgcc cagcttgcga tgggcgattc 2707081 gtggcggatc ggggtggacg agcaggagaa caccactctg gtgcgcaccg gcccgtttaa 2707141 atgggtgcgt caccccatct acacggccat gatggcgttt ggcctcgggc tgttgctggt 2707201 gactccgaat ctcgttgccc tcgccgggtt tatcctgctc gttgccacgc tcgaggtgca 2707261 tgtccgccgc gtcgaagaac cctacctgtt gcggacgcac agtgccgtct accgcggcta 2707321 caccgccagc gtcggccggt tcgtcccggg tgtggggttg atccgctagc ccttgggcac 2707381 ctcacggtcg atctgatcga gccagattcg cgctgacata tccgacgggg cccgccaatc 2707441 cccacgcggc gacaacgcgc ccccgtggga caccttgggg ccgttgggca atgccgaacg 2707501 cttgaactgg ctaaacgaat aaaaccgctg gacgaaaatc tgcagccaat gccggatttc 2707561 ggccaatgaa taggacgggc gttcgctctt tgggaagccg ggcggccagt tgccccgctc 2707621 cgcatcgttc cacgcatgcc aggccaaaaa cgcaatcttc gacgggcgaa atccgtagcg 2707681 cagtacctga aaaagcgaaa agtcctgtag ggcgaaaggt ccgaccttgg cctcgctgct 2707741 ctgcagctcc tcctcgccgg tcggaatgag ttcgggggtg atctcggtgt cgagcaccga 2707801 ctgcaatacc tcacccacct tctcaccgaa ctcacccgcc gaaatgaccc accggatcag 2707861 gtgctggatc agcgtcttgg gcacaccggc gttgacgttg tagtgcgaca tctggtcgcc 2707921 gacaccgtat gtcgaccaac ccagtgccag ctccgacagg tccccggtgc ccagtacgat 2707981 tcccccgcgc tggttggcga tacggaaaag atagtcggtg cgcaacccgg cctggacgtt 2708041 ctcgaaggtg acgtcgtaca ctttttcgcc aaccgaatac ggatggccga ttgtgtgcag 2708101 catcaaccga gcggtgtcgc cgatatcgat ttcggagaag gtaaccccca gcgcacgtgc 2708161 cagcttgatc gcgttgttct tagtgtgctc cccggtggcg aatccgggca acgcaaacgc 2708221 cagaatgtcg ctgcgcggcc ggccctcgcg gtccatggca tgggtcgcga cgatcagcgc 2708281 gtgcgtcgag tccaatcccc cggacacacc gataacgacc ttcggatagt ccagcgcccg 2708341 caaccgttgc tcgagtccag acacctggat gttgtaggcc tcgtagcaat cctgttgcaa 2708401 tcgttgcgga tcggccggaa cgaacgggaa ccgctcgacc tcgcgcagca gtccgatgtc 2708461 gcctgccggt gggtcgagtg cgaagtcgat gcgccggaac gattccgtta actcccggtg 2708521 gtgacgccgg ttgtcgtcga acgtgcccat ccgcagccgc tccgaccgaa gcaactcggt 2708581 gtcaacgtcg gcgacactgc ggcgcactcc tttggggaaa cgttcggact ccgcgagcag 2708641 tgcgccattc tcccagatca tcgtctgacc gtcccaggcc aggtccgtcg ttgactcccc 2708701 ctcccccgcg gcggcataga cataggcagc cagacaccgc gccgacgccg agcgcgcaag 2708761 cagccggcgg tcctcggcac ggccgatggt gatcgggctg ccggacagat tcgccagcac 2708821 cgtcgcgccc gccagggccg cctcggcgct gggcggcatc ggcacaaaca tgtcctcgca 2708881 gatctccaca tgcaacacaa agccgggtag atctgacgcg gcgaacaaca ggtccgtgcc 2708941 gaaggccacg tcggcgccac cgatgcggat cgtgccccgc tccccgtctc cgggcgccat 2709001 ctggcgccgc tcgtagaact cgcgataggt gggtagatac gacttgggca ccacgccgag 2709061 cacggcgccg cggtgaatga cgaccgcggt gttgtagatg cggtgtcgat gccgcagcgg 2709121 agccccgacc accagtacag gtaacaggtc ggcggattcg gtcaccaggt cgagcagcgc 2709181 gtcctcgacg gcatcgagca gagagtcctg cagtagtacg tcctcgatgg agtagcccga 2709241 cagcgtcagc tcaggaaaga ccgccaacgc tgcgccatcg tcgtggcacg cacgggccat 2709301 gtccaatacc gacgcggcgt tggccgccgg gtcaccgatg gtggtgtggt gagtgcaggc 2709361 ggcaacgcgc acgaacccgt gctggtaggc ggagtaaaag ttcatcgtcc tttcattgtc 2709421 gcccagcgac gtcagaacgc ccgaatcacc cgccgagtat ccacgctcga caccgtggaa 2709481 tcccccgcgc tgctggcaga tggcggcatt gaccggcgtg gggatgctac cgactgggcc 2709541 gctgccgacc ctgggccctg attggccgcc gagcagtccc atgacgatcc actagttcac 2709601 ctcggatacc cgctcggccg caatgcgcag ctagcggcca tgttgatcga aatcatttgg 2709661 ggtacaccgc atctcggagc aatatggtag ctaaacttgc ttagcttgct tcgccgacac 2709721 cgcgaccaga tcgtcggcgt gcaccaccgg gcggcgcagc tcgccgggta gctcagaggt 2709781 ggaccggccc accatggtgg ccagctcgga cgcgtcgtag gcaaccaccc cgcgggctac 2709841 catggccgcg tcgggtgcac gcagttcgac cacatcgccg ccgcaaaacc ggccggacac 2709901 cgcggtgata cccgccgcca gcagtgaccg gcgttgtcgc accacagcgc gcaccgcacc 2709961 ggcgtcgaga gtcagtgcgc cggttgcttc ggcggcataa cgcacccaga accgccgggc 2710021 cgacagacgc gcgggccggg ccgcaaacac cgtgcccacc gacgcgtcgg cgagcgcggt 2710081 cgcggcgtcg gccgcggggg ccagcagtac cggcaccccg gcgtcggcgg ccaacagcgc 2710141 cgccgacacc ttggacgcca tgccgccagt acccaggtgg ctactgcggc cggcgaccac 2710201 accgtccaga tccgccggcc cggacacctc cggaatgaac gtcgcgtccg cggttttgcg 2710261 cgggtcgcag tcgtagaggc cgtcgatgtc cgacagcagc accaaagcgt cggcgccgac 2710321 caggtgcgcc accagtgcag acagccgatc gttgtcaccg aaccggatct cgttggtggc 2710381 cacggtgtcg ttctcgttga caatcgccac cgcgtgcaac gcgcgcagcc gatccagcgt 2710441 gcgttgggcg ttggtgtgct gcacccgcat cgaaatgtcg tgcgcggtca gcagcacctg 2710501 gcccaccgtg cggccgtagc gggcgaacgc cgcgctccac gagttcacca gcgcgacctg 2710561 cccgacgctg gccgccgcct gcttggtcgc cagatctttg ggacgacggg acagcccgag 2710621 cggctcgatg ccggcggcga tggcgcccga agacacgatg acgacgtcgg aacccgcctt 2710681 catccgccgc tcgaccgcct cggccagtcc ggccagccgg ccggcatcga acatcccgga 2710741 cggtgtggta agcgccgtgg tcccgacctt cacgacaagg ccgcgcgcgg tccggattgc 2710801 gtcccgatgc ggacttctca tcagccatcc ccgtgttcgc gacgccgact ccgagcggcc 2710861 tttcgctcgg ccgcgcccac ccgcttgttg ctgtccagcc gcggatcggt gccccggccg 2710921 gacatcgcga ccggctcacc cgcaggcgtt tgcggctccc aatcgaacgt catctcgccg 2710981 atggtcaccg cgcatcctga ccgcgcaccc agcctcagca attcctcctc gacacccagg 2711041 cgcgccagcc ggtcggcgag atagccgacg gcctcgtcgt tgtcgaagtt ggtctggtca 2711101 atccaacgct cgggccgggc accgctgacg acaaagccac catgcccgtc gggttcgacg 2711161 gtaaaaccgc tgtcgtccac cggaatcgga cgaatcaccg gccgccgtgg caccgccacc 2711221 ggccgcgcag cgttgtagtc cgagatcatc tgcgacagcc caaagatcaa cggctgcagg 2711281 ttttcccggg ttgcggtcga cacgcagaac accggccagc cgcgctgggc gatgtcgtca 2711341 cggacgaact ccgcgagctc gcgggcctcc ggcacatcga ttttgttgag gaccaccgca 2711401 cgcggccgtg cggcgagatc gcccagagcc gcgtcccctt gcagcgtggg cgtgtagcac 2711461 gcgagttccg tttccagcgc gtcgatgtcc gagatggggt cgcggcccgg ctcggcggta 2711521 gcgcaatcca ccacatgcac cagtacagcg cagcgctcga tgtgccgcag aaagtccagc 2711581 cccagaccac ggccccggga tgcgcccggg atcaaccccg gcacgtcggc gacggtgaac 2711641 gcgtgctcgc cagccgagac cacaccgagg ttgggcacca gggtggtgaa cgggtagtcg 2711701 gcgatcttcg gcttggccgc cgaaatcgcc gacaccagcg aggattttcc ggccgacgga 2711761 aacccgacca ggccgacgtc ggcgacggtc ttgagttcca aggtgaggtc tcgggactgt 2711821 cccttttcgc cgaggagtgc gaaaccgggg gccttacgca cgcgggaagc cagcgcggcg 2711881 ttgcccaaac cgccacggcc tccggcggcg gcttcaaagc gggtgcccgc gccgaccagg 2711941 tcggccagta gccggccgtt ctcgtccaat accacggtgc cttcgggaac tttcacttcc 2712001 aaatccgcgc cggcggcccc gtcgcggtta ttgcccatcc cgtgcttgcc cgaagccgcg 2712061 gtgagatgcg ggcggaaatg gaagtcgagc agggtgtgca cttgcggatc gacgacgaag 2712121 acgatgctgc cgccccggcc gccatttccg ccatcggggc cgcccagcgg cttgaatttc 2712181 tcgcgatgga ccgaagcgca gccgttaccg cccgaacccg ctctggtgtg gatgacgacc 2712241 cgatcgacaa accgaggcac cgagctcccc ttcatctgcg gagtgtgcag ctactgcggg 2712301 ttttgcccct cgtgaatctt cgcagtgggc gcacacgcgc gacgctcagg cagtggtcga 2712361 accgacgatg ctcaccgtct tacgtccgcg tttgatgccg aactcgaccg ccccggccgt 2712421 cttggcgaac aaggtgtcat cgccgccacg cccgacgttg acgccgggat ggaatttggt 2712481 accgcgctgg cggaccagga tctcgccggc cttgacgacc tggccgccgt accgcttaac 2712541 ccccagccgc tgggcggcgg aatcgcgacc gttgcgcgag ctggaagccc ccttcttgtg 2712601 tgccatgtct gtcgcctccg ttatgcgatg ccggtgacct tcaggaccgt cagctgctga 2712661 cggtgtccct gccgtttgtg gtagccagtc ttgttcttga acttgtggat acggatcttg 2712721 gggcccttgg tgtgcccgag cacctcaccg gtcaccgcga ccttggccag tgccttcgca 2712781 tcggtggtga cggtggcgcc gtcgacaacc agagccaccg gcagggacac cttctccccc 2712841 tgctcggatt ccagcttttc gaccttgacc acatctccga cagcgacttt gtactgcttg 2712901 ccgccggtct tgacgattgc gtaggtcgcc atcattgctc ctgcctcttc atacttccgc 2712961 tgcatgcgtt gcgcttcgcg cgcgggccag cggcgggacg cgtgctgggt cttgggcggg 2713021 cacctacaac ggaccccgca tcgtctccag ccgtcagcct ggcgacaact ggtcaagggt 2713081 acgtgacctg caactacggg gtcaaaccag cggggcctca gcgagatcga cgccagcaca 2713141 cgaaagtgcg ccggtagcgt cgatctcgac gctaccggcg cactccgggg cccgggtggt 2713201 gacgtcatcc gggttggacc gctgatggct gcggctaaca tcgtgccgaa tcgcgtccga 2713261 tgtcgacctg gaggaaccgc cgatgaccgc ccccttggat cgtgcgccgg tcacggattt 2713321 gccggctaac aacaaaggcc gagaccgcac ccactggctg tatctcgcgg tcattttcgc 2713381 agtgatagcc ggtgtgatcg tggggctgac ggcgccgtcg accggaaaaa gcctcacggt 2713441 gctcgggacg gtgttcgtca acctgatcaa gatgatgatc gcaccggtca tcttctgcac 2713501 gatcgtgctc gggatcggct cggtgcgcaa agccgcggcc gtgggcaagg tcggcgggct 2713561 ggctttggcc tactttctaa cgatgtcatc ggtggcgctc gggatcgggt tgatcgtcgg 2713621 caacctactc agtccgggta gggatctgca ccttaggcct ggtgcggtcg gaagcggcgc 2713681 agcattggcc ggccaggctg cggagtcaca cggaatcgct gggttcatcc agcagatcat 2713741 tccgaggtcg ctcccctcag cccttactga aggcaacgtg ctgcaggtgt tactcgtcgc 2713801 gctgctggtc ggtttcgcgg tccaaggcct gggccccgca ggcgagtcca tcctgcgtgc 2713861 cgtcgagaac ctgcaaaagc tggtgttcaa ggtgctcgtg atggtactgt ggctggctcc 2713921 gatcggcgcg ttcggtgcga tcgccaatat cgtcgccacg actggcttca acgccgtcac 2713981 caacctgctg ctgctgatgg ccggcttcta cctgacgtgc gtggtgttcg ttttcggcgt 2714041 cctgggagtg ctactgcgca tcgtgtcggg tttgtcgatc tttcggctgc tgcgctatct 2714101 agcccgcgag tacttgctga tcttcgcaac atcgtcgtcg gaggtggtgc tgcccagact 2714161 gatcaccaag atgaaacact tgggcgtgca atccagcacg gtcggcgtgg tggtgccgac 2714221 cggctactcg ttcaatcttg acggcaccgc tatctatctg accatggcgt cgctgttcat 2714281 cgccgacgcg atgggacatc gcttgacatg gggcgagcag atcgcgctgc ttgcgttcat 2714341 gatcatcgcg tccaagggcg ctgccggggt cagcggtgcg ggccttgcga cgctggccgg 2714401 cggcctgcag gctcatcgcc ccgagctgct ggacggtgtc gggctgattg tggggatcga 2714461 ccggttcatg tcggaagccc gttcgctcac gaacttctcc ggcaacgccg tcgcaaccat 2714521 cctggttgcc tcgtggacaa agaccattga cctgtccaaa gccgacgagg tgttgcgcgg 2714581 tcgtgatccc ttcgacgaat cgaccatggt cgatccccac gatgaggagc cacccgccgc 2714641 cacaccccac gggggcggcg tcccgacgaa ccctgcgctg tgcgatttcg agcaggtcag 2714701 tctaggcgga ttggtgggcc ggccggccgg cccgcaacgc gccgacgtgg acgggtaggg 2714761 gccagctccg tgacaccggg gacgtcgact tcgcccgggg aaccgtccaa gccggctgca 2714821 tcctcctcgt cgacgtcggc atcggcggcg tcttcgtcgg agtcttcgtc gtcggaatcg 2714881 gagtcttcaa cgtcgagatc ctcgtcgagg tcctcgtcgt cgaggtcctc gaggtcctcg 2714941 tcggcgtcga gctcgtcctc gtcttcgtcg gtgtcctcgg tgtcctcgaa gtccgcttgg 2715001 gcggtgtcgt cgaggtcagt gggcggttga tcgccggcct gctcggcgag ttcggcagcg 2715061 ggctccccgg attcctcgtc accgcgacca gccagcgagg acaagcccgc tgccatcgcc 2715121 ttgaacatgg gatgctcacc gggagcgtgc acggggacct tggcgaccat gctcctatca 2715181 ctggactctt cggaccggct ctttttcgat cgcttgcccc gccgagcacc gggctcagac 2715241 tttcgcccag tcgccgcggc cgaatcgacc gggtcggcgt gcagcaggat cccgcggcca 2715301 ctgcagttcg gacacgatgt ggagaacgct tcgatcagtc cggttcccaa ccgcttgcga 2715361 gtcaactgca ccagccccag cgacgtcacc tcggacacct ggtggcgggt gcgatcgcgg 2715421 gccagcgact cggtcaaccg gcgcaacacc aagtcgcggt tggactccag caccatgtcg 2715481 atgaagtcga tgaccacgat gccgccgata tcgcgcagcc gcagctggcg cacgatctcc 2715541 tcggccgctt ccagattgtt cttggtgacc gtctgctcga ggttgccccc ggctccggtg 2715601 aatttaccgg tgttgacgtc aatgaccgtc atggcttcgg tccggtcgat caccagcgtc 2715661 ccgcccgacg gcaaccacac cttgcggtcc atcgctttgg ccagctgctc gtcaatgcgg 2715721 tgcaccgtga agacgtccgg cgcggactgg ccatccggcc cgtcagcgga ctcgtacttg 2715781 gtcaacttcg aaaccaattc gggagcaaca gaattcacgt attcattgat cgtgttccaa 2715841 gcctcgtcgc cggaaacgat gaggccgacg aagtcctcgt tgaacaggtc acggataacc 2715901 ttgaccagca cgtccggttc ttcgtacagc gccaccgcag cgcccgcggc cttctccttg 2715961 gtctcttgtg ccttggcctc gatctgctcc cagcgttccc gtagccgagc gacgtctgcg 2716021 cgaatgtcgt cctctttgac gccctcagac gcggtacgga tgatgacccc agcgtcagac 2716081 ggcaccacct cgcgcaggat ctccttgagc cgctgacgtt cagtgtcggg cagcttgcgg 2716141 ctgatcccgg tcgacgacgc gcccggcaca taaaccagaa atcgaccggc cagcgacacc 2716201 tgcgtggtca gccgcgcgcc cttatgccct accgggtcct tgctgacctg caccacgaca 2716261 tagtcgccgg gtttgagggc ctgctcgatc ttgcgatcgg ccccgcccaa ccccgctgca 2716321 tcccaattga cttcaccggc gtagagcact ccattgcgac cgcgcccgat gtcgacgaac 2716381 gccgcctcca tcgacggcag cacgttctgc acaattccca ggtagatgtt gcccaccagg 2716441 gaagccgagg ccgcagacgt cacgaaatgc tccacgacga taccgtcttc gagcaccgca 2716501 atctgggtgt accgcgtgcc cggcagcggt ggctcggtgc ggacccggtc gcgcaccacc 2716561 atcacccgct cgaccgcctc acggcgagcc agaaactcgg cctcactcaa caccggtggg 2716621 cggcgccggc cggcgtcgcg cccgtcgcgg cggcgttgcc gcttggcttc caggcgggtc 2716681 gagccgtcga tgcccttgat ctcagtggag ccagagccgc catcctgcga gttgccggcc 2716741 ttgtcacccg cgcggggcac gcgttcgtgt acgacagtgt tgggcggatc gtcaggcaac 2716801 gggccctcta acgcagcgtc gttgtcgtca ccagaagccg acttacgccg tcgccggcgg 2716861 cgccggcgac gattgccggc ctccagcgaa ccgttttcgt cctcgccgtt gtcaccggct 2716921 tcggtatctt cggaatctcg atcgtcgccg tcgtcggttt cggcggcgtc cgcgctggta 2716981 aattgttggg cccggggctc ggattgctgg tcaaccggat caccgtcgga tccaccctgc 2717041 tccccgcgtc cgcgaccgcg accccgacgg ccgcgacgtc gccgccggtt cgccggccgg 2717101 tctagctgcc cttcgtcgtc agcgtcggaa tcgtcggcga cgtagtcggg gccatcgtcg 2717161 acgtcctcgt cgtccgctaa cggctcggga atcggctggg gcgcgacgaa cagcggcata 2717221 tagtgcggcc gctccacgtc ggcattccga gtctcctggg tctctagcat cagccgggac 2717281 tcgggttcct cggacgcttc gggcgcatgg accgaggccg ccagcacgcc ggcagtctcg 2717341 agatgagtgg ccagcagatc gcgcacccgg accgcatcga cgcgatccac cgtggaatgt 2717401 gcgctgcgga cccgtccgtc gagcgcggtg agcgcatcca gcacccgcct gctggtggtt 2717461 cccagcgttc gtgccagcga atggactctt aggcggtccg gcagttcctc atgctggctc 2717521 ggttctggtg gatctgaagg tggggcaccg tctatcacgt attctcctca agcccccggg 2717581 cgcgtcttga tcgacgcggc cacgcgaggg cttcgctatc tgcccgggtc acttgtctcc 2717641 cgagcttgtg atggtcttgt cccgagcagc tcatgacgaa cccactcggc accgtgctga 2717701 atgacggccc gacatgccgc gccgcatcga aggatggcga tggtcgcggt tgcctaagtc 2717761 ttcattcggg cgtccgacac cgcttcggcg acgttcaccc gtcatcagta tcccacatca 2717821 ctgggccgag tcaccttccc ttgagctggg gtgctgccca agccgcccgg gatcggcgca 2717881 cccggagcta ggcgccggga aaccagagcg cgatttcgcg ctgcgcggat tcggccgaat 2717941 cagacccgtg caccaggttg aactgcgtct ctagagcgaa gtcgccccgg attgtgccgg 2718001 gcgccgccgc ctgcaccggg tcggtgccgc cggcgagttg gcgaaccgcc gcgatggctc 2718061 gggttccctc cacgatcgcc gctaccaccg gacccgacgt gatgaactcc agcaacgatc 2718121 caaagaatgg tttgccttca tgttcggcgt agtgctggct ggccaactcc gcgctgacgg 2718181 tcctgagctg cagcgcagcg atggtgaggc ctttgcgctc gatgcggctg atgatctcgc 2718241 cgatcagctg cctttcgatg ccatccggct tgatcagtac cagagtccgt tcggtcacgg 2718301 tgcccaacac tagatgccgc aagatgtatg cccaaaccgg tcattgcgac acccggtaat 2718361 cccgacgccg ccgcacctcg gcacgcaaat acgcgatcag gacccacaac gcggcgaaca 2718421 gcacgccaat gaaacccaca cccgggtaca cggcgaagcc ggcaaccagc accggttgtg 2718481 cgcccaggtt cacccagatt gcccagggtc tgcgctgcag cccggtcagc agtatcaaca 2718541 gcacggccag accgaccaaa tagcccagcg aggccggacg cagcccaccg ccgaccgcgt 2718601 ccactaccgg tattgccagc agcaccacga tcgcctcgag gatcagcgtc gccgccatca 2718661 ccgcgctgaa tcccttccac gggtcagccg gctcacgcga ccggtcggtc attgcggatc 2718721 acgaccgaac aaggtccgag ccgcccctgc ggtgacaacc gagccggtga tgacgatccc 2718781 ggttctcgag aatgcgtccc cggccacatc cgggtcggcg gcggcgtcgt cgaccagtga 2718841 ggtggcaacg tcgatagcat cgcgcaggtt ctcggcggtg cgcacccggt cgggtccgaa 2718901 ccgctcgccg gccgccagcg ccagggcctc gacatccagc gcccgcggcg acccgttgtg 2718961 ggtcacgacg acggaatcga acaccggctc cagtgcggcc aggatgccgt ccacgtcctt 2719021 gtcgcccagc acgctgagca ccccgaccag aaatcggaag tcgaactcat gcgccagcgt 2719081 ttgtgccaga gcactcgccc cggccggatt gtgcgcggcg tcgatgaaca ccgtgggtgc 2719141 gctgcgcatg cgctccaacc ggccgggact ggtgacggcg gcaaagccgg cccggacggc 2719201 gtcgccgtcg agctgacgct gcgcaccggc accgaaaaag gcctcgacgg aagcgagggc 2719261 gagcaccgcg ttgtgcgcct ggtgttcacc gtgcagcggc aagtagatgt cggagtaaac 2719321 cccgccgagg ccctgcagtt gcagtacctg accgccgacc gcgatctgtc gccgtagcac 2719381 cgcgaattcg gaatcctccc gggccaccga cgcgtcggcg cgcaccgatt cggccagcag 2719441 cacctccatg accttcggga cctgacgccc gatgaccgcg acggtgtccg gcgaaccgtc 2719501 gggggcccga gtgatgatgc ccgccttctc cccggcgatc ccggcgatat cggcaccgag 2719561 atagtcgacg tgatcaatgc tgatcggggt gatgacggcg accggtgcgt tgatcacgtt 2719621 ggtggcgtcc caacgtccgc ccatgcccac ctcgaccact gccacgtcga cgggcgcgtc 2719681 cgcaaaggcc gcgaacgcca tcgcggtgag cacctcgaac ttgctcatcg ccgggccacc 2719741 cttacccgca gaagcctgcg actgctggtc gatcagcgcc accaacggct cgatctcccg 2719801 gtaggtcgcc acatactgcg ccgggctgat cggcttgccg tcgatcgaaa tgcgttccac 2719861 cggtgactgc aggtgtgggc tggtggttcg gccggtgcgc cggtgcagcg cggtgaccag 2719921 cgcgtcgacc atgcgcgcca ccgaggtctt gccgttggtg cccgcgatat ggatcgacgg 2719981 atagctgcgt tggggcgagc ccagcaggtc catcaacgcg ctgatccggg tcaggctcgg 2720041 atcgatgcgg gtctccggcc agcgttggtc gagtagatgc tcaacctgca gcagggacgc 2720101 gatctcgtcc ggagtgggca cgacgccggt ggccgatccc gagtcaggcg ggccggaatt 2720161 cgtcgaattc attgcagcgc agccaaccgg gtggtgatgc gctcggtttc ctgctgcgcc 2720221 acgcgctggc ggtcccggat cttggcaatg acggcgtcgg gcgctttggc cagaaagtcc 2720281 gcgttggcca acttggcggc ggtcgacgcc agctcctttt gggcgccggc caactccttt 2720341 tccaggcggc gacgctcggc ggccacgtcg atggtgcccg aggtgtcgag ctcgacgacg 2720401 acggtgcggt tcatctcggg gccgagccga acctccaacg agaccgacgg ctcaaaatcc 2720461 gggcccggct cggtgagcca cgccagcgag gtcacggcgg ccacctggtt gctcagatcc 2720521 gagtcccgca caccgtgcat tcgggccgga accttctgcc ggtcggccag accttgatcg 2720581 ctgcggaacc gccgcacttc ggtcaccaac ttctgcatat cgttaatccg ttgcgcggca 2720641 acaaggtcca cgctaatccc ggaaggctcc ggccagtcgg cgctgaccag cgattccctg 2720701 ccggtcagcg ccagccatag cgcctcggtg aggaagggaa tcaccgggtg cagcaggcgc 2720761 agcagcgtgt ccagcccggc ggccagcacg gcggtggtgt gtgtgagtcc ctgggcaagc 2720821 tgcgttttgg ccagttcgag gtaccagtcg cagaattcgt cccaggcgaa gtgatacagg 2720881 gactcacaag cgcggctgaa ctcgtatccg tcgaaggccg aatcaacttc ggcccgaacc 2720941 tcttccaacc ttccgagaat ccagcggtcg gcgtcggtca gctcgttcgg cgatggcagg 2721001 ggtgctggcg cggcgccatt gagcagtgcg taccgagtgg cgttgaacag cttggtcccg 2721061 aaattgcgcg acgcccgcac ggcatcctcg ctcaccgcca agtcaccacc gggactggcc 2721121 ccgcgggcca gcgtgaaccg cagcgcatcg gccccgaaca tttccaccca atccagcggg 2721181 tcgatgacgt tgcccttgga cttgctcatc ttgcggccag actcgtcgcg gatcagccca 2721241 tgcagaaaca cgtcggtgaa cggcacctgc gggccccggc ggccgtcgag ggtgatggcg 2721301 gcgtcgtcgc cgacgaaggt gccgaacatc atcattctgg ccacccaaaa gaacaagatg 2721361 tcatagccgg taaccagaac gcttgtcgga tagaactttt ccagctccgc cgtcttgtcc 2721421 ggccaaccca gcgtggaaaa cggccacagc gccgacgaaa accaggtatc cagcacgtca 2721481 ggatcctgtt cccagccctg cgggggtgtt tcgtccgggc cgacgcacac ctgttcgccg 2721541 tcgggtccgt accagatcgg gatccgatgc ccccaccaga gctgtcgcga gatgcaccag 2721601 tcgtgcatgt cgtcgaccca ggagaaccag cggggttcca tgctggccgg gtgaatcacg 2721661 gtgtccccgt tgcgcaccgc atccccggcc gctttggcca gcgattccac ccggacccac 2721721 cactgcaggg atagccgcgg ctcgatcggc tcgccgctgc gttcggagtg tccgacgctg 2721781 tgcaggtagg gtcgcttttc ttcgaccacg cggccctggg ccgcgagcgc ttggcgcacc 2721841 gcgacccgtg cctcgaagcg gtccatgccg tcgaatcgcg ttccggtgtc gacgatccgg 2721901 cccttggtgt ccaggatcga gggcatcggc agctggtggc gcaccccgat ttcgaagtcg 2721961 ttggggtcgt gggcgggtgt gactttgacc gcgccggtgc cgaattcagg gtccacgtgc 2722021 tcgtcggcga caatggccag ctcccggtcg acgaatgggt gcgccaggct ggtgccgacc 2722081 aggtgacggt agcgctcgtc atcgggatgg acggcgatcg cggtatcgcc cagcatcgtc 2722141 tcgacccggg tggtggcgac cacgatgtgg ggttgcgagt cgtcaagcga gccgtaccta 2722201 aacgacacca gctcgccttc gacgtcgcgg tagttgacct cgaggtcgga gatcgcggtc 2722261 tgcagcaccg gcgaccagtt gaccagccgc tcggcccgat agatcagccc ggcgtcataa 2722321 agccgcttga agatcgtgcg caccgcccgc gacagacctt cgtccatggt gaaccggtcg 2722381 cggctccagt ccaccccgtc accgagtcgg cgcatctggc cgccgatggc accgccagac 2722441 tctcgcttcc aatcccacac cttgtccacg aacagctcgc ggccgaggtc ttctttagtc 2722501 ttgccgtcga ccgccagctg ctgctcgacc acgctctggg tggcgatccc ggcatggtcg 2722561 gtgcccggct gccagagcac ctcatagccc tgcatccgct tgcgccgcgt caaggcgtcc 2722621 atcatggtgt gttccagcgc gtggcccatg tgcaggctgc cggtcacgtt cggcggcggc 2722681 agcacgatcg aataggccgg cttggtgctg gtcgggtccg cggtgaagta gccagcgtcc 2722741 agccacttct gatagatggc gctctccatc gcggccggat cccacgactt gggcagcata 2722801 tcggcggcag ggtgagggct ggcggtcacc gatcaattct aggaaccgct tcacaccggc 2722861 atgaaagcgc ccgaaaccgc ccggattcag ctagccagtc gcgtggtctg cagcgacaca 2722921 ccggcggccg gcaaacgctc cagcagggcg tcacccattg cggccgcggg ggttaacaca 2722981 ccacgcatgt cggacagctt gtcgcgatcc agtgccagcg ccagaccaca ctcccccaac 2723041 aacaccgacg tcgccttgta gccggggtca ccatcttggg ccatgcgcgc caggtaccgg 2723101 gctccggtgg ttgtggtggt gtaggtctcg atgcggtagt agccgcgctc gcgagccgcc 2723161 gcactggggc cggtgccggg tttggggacg acacgcttta ccagtccccg cggcagcagg 2723221 cggatgtagc ggctggccaa gccgaacatc gcgttgccga caccgccgcc gacaaccgat 2723281 accaccggcg ccagcaccgt ggaccctacg ctcatggttt cgctgtagcg gaaccgccgg 2723341 ccgtaggccc agtccaggag cgcgttgctg cggcgcacga tccgggtgtt ggtgggcgcc 2723401 atgatgaatc ccgcggtcca cacaccggcc agttccggcg cgagccgacg gccacgacgc 2723461 gacggcaggt caggctgtgg gcccagttcg ggttcggcgc cgcggtctgg gctcagcatg 2723521 taggggtcgg atagctggcg gcgcgcatcg ggatcgttag aagcggtgct caacacctcc 2723581 agcatcgatg cgatggtgcc gccggagaac ccgcctttga aggaacgcac cacgcagttg 2723641 gtgtcggtca gctcgccggc gccgtcttct cgtgccgcgt ggtatagggc gtacacgctc 2723701 agatcagatg ggacggagtc gaatccgcag gcgtgcacga tgcgtgcacc ggtgtcggcg 2723761 gcctgcttgt ggtacaagtc gatgctgttg cgcatgaaca tcggctcgcc ggtcaggtcg 2723821 gcgtagtcgg tgccggcggc agcgcatgcg gccaccagcg gcagcccgta gcgagtgtag 2723881 ggcccaacgg tggtgaccac gacctgggcg cgggcggcca tggcttgcag cgtcgacggc 2723941 aacgacgcgt cggcggtcag gatcggccag gtctgcgcgg attcgcccag ggcttcgcga 2724001 acggcgagca cccgttgcgt cgacctgccg gccagcgcga tccgggcatc tcccccggcc 2724061 cgggccaggt attcggcggt cagcttgccg acgaagccgg tcgccccgta caacacgatg 2724121 tcgaattcac gcggcgtagc ggtcacgggt ttgacgctac tccggggtgc gcgagcagac 2724181 gcaaaagctc ccaaatccga ccggatttgg gagcttttgc gtcttttcgc ggtggtcagc 2724241 cgcggcggcc gcagaccggc caggcgcgga taccctgcga acgcagcacg ttctcagcca 2724301 cccggatctg ctcctcccgg ctcgcgttgg ccgcggaccc cgagccaccg ttggcacgcc 2724361 aggtgccggc ggtgaactgc aggccgccgt agtaaccgtt accggtgttg atcgaccagt 2724421 ttccaccgga ctcgcactgc gcgatcgcgt cccagttcac gctgtaggcc acgggcacgg 2724481 gaggcgcttc ctccgcaggc ggggacagga agtccggggc cagcggcggg gggaggttgg 2724541 gatcaaagcc cgcgtcctcc ggagccggcg gagtatcgac gggtgcagcg tccggggccg 2724601 gcggcaggtt cgggtcaaag cccacggcat ccgggccggc tgcggcgttt gggtccaagc 2724661 ccgcgtcgtc ggcattggcg ataccggctg gtgacgtggt caccaacgtc ccggcaatcg 2724721 cggcggcgat gagcgtcgta cgggcgttct tcaacgttgt tcctttcgcg gtgcgcgcgc 2724781 gccaaagcca acccacgggc atgggttagc tgccaggtgc attcgagggt gctgcgtggg 2724841 acgtgccgtc tcggtccggc acggcagcgg agcgcttgat ctgcccggcg cggctgctag 2724901 ccgccgcctg cgggtcggcc aaccgattca gccgtccccg gctccgctcg cacgcgggtc 2724961 cgtagattca attgttgaga tttcttgctg cccgtctgcc gggccaaggg gaccgtacga 2725021 tgacgatttg gattcgtcat ctccggcaaa ccgagatatc agttcaatca caagccgatc 2725081 acggcgcggt ggcacaattg ttgttgcagg tcagaagtgc ggttttggct cagctgtatc 2725141 tttgcgaccg cggcgctatc gtgagccgaa tcacgcaaat attgtgaccc cggacacgga 2725201 tttgtcacca tcgtggccct ggtccgggat ctgatccaca cgccgtggtg acctgcgcca 2725261 caacgacttg cacaccccga cgtccaccac acctcgaatc agctagactg ctcccaataa 2725321 tccgccctaa tactaagtgc cgcactgtga ttcataggta acctggggca ccaccaaata 2725381 gcagtctgcc gtaacagccg gatcctctac cgtcagcaga ctcaaatgtc ctccacccca 2725441 acgcaatacg tgatcaaccg cgcaccagag acgccaactg tagtcaaggc agtactagaa 2725501 gcggcggcca tggccaatgt taataacgtc ttcattgaaa acaagacgag aatatctcga 2725561 aaggccacca gaaaattaat atggaatagt attagcgtcc gggctgcatc ggtgcgtgca 2725621 agcctgcggc cgaattgacg ttggtcagcg gtcgggaatc cgccatcacg atccgcagtg 2725681 catccgaagc gtcgaccagg gcgctcatct ttcgctcgcc ggcaccgacc aacgtgtcga 2725741 cgcggtcggc cagatccgtc cgatacaccg cggccaggta gtggttacgg ccatcccagg 2725801 gcaacaccac ttcggcatcg gtctgcaccg cgcggcgcgc gagatcctcg atcaattcca 2725861 ctgtcagata aggcatgtcg accgcacaga caaacgcgag ccggacaccg gcctccgcag 2725921 ccgcacgcaa cccgcgaccg gtcgccggca gcggccccag ccccggcagc tcatcacgca 2725981 gaacggggac cggcagcgtg ggcaacggtt gtcccggagc ggccatcacg aaaaccggcg 2726041 cgcagcgctg gccgagaatg ccgaccatat gctccaccag cgtggtggtt cccccgggga 2726101 ggggcagggt ggctttgtcg cgacccattc ggcgggattc acctcccgcg agaacaaccc 2726161 cggccagcgg cactgtgtcg ggcgcgagct cagccacgtc agtcgacggt ccaagtgtcg 2726221 cgcccgtgca acagtgactg cagtgcagcg gtgccggacg gggcggcgtt tcgggccgcg 2726281 actacctgcg agcgggccgc atcatcgtag gttggccggc tgatgtgccg aaagattccc 2726341 agcacggtgt ggtccaggtt ctgatcggac agccgggaca gcgcgaaggc gtaggccggg 2726401 tcgtcgacct gcgcatcgtg cacaatgatc tcgtcgatgg ccacatcggc cgtcttggcc 2726461 acttcgaggc cgaatccgga cttgaccacg cagtattcgc cgttggcccc gaagacgatc 2726521 ggctcgccgt ggcggacctt gatgacccgc tcctcggcgc cctccttgcg cagcgcatcg 2726581 aacgagccgt cgttgaagat cgggcagtcc tgcaggattt cgaccagggc agcaccgcga 2726641 tgctgggccg cggcacgcag cacttcggtc agcccgttac ggtctgagtc cagcgcgcgg 2726701 ccaacgaacg tcgcctctgc ccccagcgcc aacgacaccg gattgaacgg gtgatccagc 2726761 gagcccatcg gtgtcgactt ggtgaccttg ccgacctccg atgtcggcga atactgtcct 2726821 ttggtcagcc catagatccg gttgttgaac agcagaatcg tcacgttgat gttgcggcgc 2726881 agcgcgtgga tcaggtggtt accgccgatc gacaaggcgt caccgtcgcc ggtgaccacc 2726941 cataccgaca gatcctcgcg agccagcgcc agaccggtcg ctatcgcggg cgcgcggccg 2727001 tgaatcgaat gaaagccgta ggtttccagg taatagggga accggctgga gcatccgata 2727061 ccgctgatga acacgatgtt ctcacgccgc agccccagtt cgggcaggaa gtttcggatg 2727121 gtgttgagga tgacgtagtc gccgcagccc gggcaccagc gcacctcctg gtcactggtg 2727181 aaatccttgc ccttctgcgg ctgatccgtg gtgggcaccc cagcgttctt ggtcaagctc 2727241 ggagtcaggc cgagctcggt gcccgccaaa tcaccggtca cgccggtcat gagctgtgct 2727301 tcatcgccgg agcgggtcat ccgtttgctc ccgctcccgc cgtggccgcc gacaatctgg 2727361 cgaccaacgt cttgtcttgc tcaagctcgg ccaatctccc ggcaagtgcg gcccggataa 2727421 agcgcccaat ctcgtcggcc aggaacgaga cacccttaac cttggtgacc gattgcacgt 2727481 cgaccaggta cttaccgcgc agcacctggg ccagctggcc caagttcaac tccggagcca 2727541 ccaccttggg gtaacgccgc agcacctcac ccaaattggc cgggaacggg ttgagatagc 2727601 gcagatgggc gtgcgctacc ttggtgcctc ggcgacgcgc gcgccggcac gcttcaccga 2727661 ttgggccgta ggagctgccc cacccgatca acaacagctc ggcgtccccg gtcggatcat 2727721 cgacttccag atcgggaaca tggataccgt cgatcttggc ttggcgcaac cggaccatga 2727781 ggtcatgatt agtcggctcg taggagatgt cgcccgagcc attggcagct tccagcccgc 2727841 cgatgcggtg ttccagaccg ggggtgcccg gaatggcgaa ctggcgggca agggtttccc 2727901 ggtcacgggc ataaggctgg aagggctcgc cgggtttggc gaaggtgtgc ttaatgggcg 2727961 gtagcgcatt gacatccggg attcgccatg gctccgagcc gttggcgatg gcgccgtcgg 2728021 acaacaagat caccggggtg tggtaggaca ccgcgatgcg caccgcctca agggcggttt 2728081 caaagcagtc ggcaggagag cgcggcgcca gcaccgccac cggtgactcg ccattgcggc 2728141 cgtagagcgc ctgcagcaag tcggcctgct cggtcttggt gggtagaccg gtcgacggcc 2728201 cgccccgctg cacgtctatg accagcaacg gcagttcggt catcacaccc agtcccagcg 2728261 cttcggactt cagcgaaatt cccggtcccg atgtgctggt gactcccaac gcaccaccgt 2728321 aggcggcacc cagcgcagcg cagatgccgc cgatctcgtc ttcggcctgg aaggtgacga 2728381 cattgaagtt cttgtgcttg gacagttcgt gcaggatgtc cgacgccgga gtaatcggat 2728441 aactgccgag cacgaccgga aggccggcga gctgaccggc caccacgatc ccgtaggcca 2728501 gcgcggtatt gcccgagatc tgccggtact cgccgggcgg caaagtcgcg ggcggtatct 2728561 cataggtcgt gccgaaggcc tcggtggttt cgccgtagtt ccagccggcc ttgagggcca 2728621 acacgttggc ctcggcgatt tcgggcttgc gggcgaactt ctccctgatg aaggcctcgc 2728681 tgtgctcgag ctcgcgcccg tacatccacg acagcagacc cagcgcaaac atatttttgg 2728741 cgcgctggcc atccttcttg gacgcgccga tcgcctcgac ggcacccagg gtcagtgtgg 2728801 tcatggcgac ggtgtgcacc acatagtcgg acagctcgcc ggactccagc gggtttgtca 2728861 cgtagcccac tttcgtcagg ttgcgcttgg tgaactcgtc agagttcacg atcaccattc 2728921 cgccaagcgg taggtcgccg atattggcct tcaacgctgc cgggttcatg gcgacgagca 2728981 cgtcgggacg gtcaccggcg gtcaggatgt cgtaatcggc tatctgaatc tgaaaagacg 2729041 acactccggg caacgtgccc gccggtgccc ggatctctgc ggggtagttc ggctgggtcg 2729101 ccagatcgtt gccgaaaagc gctgcctccg aggtgaatcg gtcgccggtt agctgcatgc 2729161 cgtcgccgga gtctccagcg aaccggatca ccacattttc caagcgttgc cgatcaggcg 2729221 cggcatgaaa tgccgcgtca tgagactctg gcccggcccc gctgccgttc ggatccacgt 2729281 ctccgccttc catgtgttat cggacaggca ctccgcgctg cagcttcagg ttacgcgtcg 2729341 tcggagcgac acccccgcgc cgcacggctt gtgtcactgg cggtagcgat tatgacattt 2729401 catttcgggt gtaaggcggt ctccgatgcc atatatgcgg ccggtaaccg accaaaaggc 2729461 gaagtcagcg agggctggcg gtagcgacga cacagaactg tggtattggt cacttccccc 2729521 cgagggttgg ccgcgaccgc acccggacat ccgaatccac ggtttccggc atcgcgacca 2729581 ggtacaggag gaagccggcc ccggcgagcg cacccagcga catgaatgcc gcgtcatagc 2729641 ccgcgacgac cacgatccag ccggcaacaa gattagacag cgcggcacca atgcccgttg 2729701 ccgtggttac cgccccgagg ctgatattga aatgtcccgt tccgtgtgtg acgtcctgta 2729761 cgacaagggg aaacaacgcc ccgaaaatgc cggctccgat accgtcgagc aactgcacgc 2729821 ccaccagcca gtaggagtta tccgacaacg tgtagaggaa cccgcgagcg gtcaagacag 2729881 cgaaccccac caaaaagatc ggctttcgcc cccacgcgtc ggccctggtc ccgaccacat 2729941 acgccaccgg caccatcacg acctgcgccg cgacgatgca cgacgacatc agcgccgttc 2730001 cttcgtctcg attgtgcaac gccaacagct cgccgaccag cggcagcatc gccgcgttgg 2730061 cgaagtggaa cgcgacaacc gccgccccga agatcaccag ttcgcggttg tgcgccaaca 2730121 cggtgaaccg cgacggctgc ggatgcggct cgccgggcgc atggtccata ccacgcgcta 2730181 aatcgtggtc gaccgcgtcc ggcgggatcc gcagtgtcgc cagcacgctg atcaacgcca 2730241 tgccggccag cacccagaac accaccaccg gcccgaagaa gtacgccagc gcgccggtcg 2730301 ccccagccgc cgacgcgtta ccggcgtggt tgaacgcttc gttacgccca atccgtctgg 2730361 cgaaaaactg aggaccgaca gcacccaacg tgatcgccgc caacgccgga gcgaaaaccg 2730421 agctggcgat cccggtgacg gcctgcagca ccgagatgga atacaagccc gcaaacagcg 2730481 gcatcgccac tgcggcggcg gtgaccagca ccgcgccggc gacgaccagc gcccgcttgg 2730541 ccgtggtccg gtccaccagg gcgccaatcg gcgtctgggc cacgatggcc gcaatgccgc 2730601 cgaccgccat gacgaacccg atcgaggctt gatcccaatc gtggatcaac aggaggtata 2730661 tcgacagata ggggcccaga ccgtcgcgaa catcagccaa cgagaaattc agcaggtcca 2730721 gcgcacgcgc cacccgtggc ggcactgcca caacggtgcc cgacatgcag tcgtcgcggg 2730781 gctacgcgct cttgtcgcgg cgctccgaac gggacggctt gcgtggcacg attgtcggca 2730841 acacgttgtc ctgcacggtc tccttggtga ccaccacttt ggcgacatcg tcgcggctcg 2730901 ggatgtcgta catcaccggc agcaggactt cttccatgat cgcccgcagg ccgcgggcac 2730961 cggtgccgcg atggatcgcc tggtcggcga tcgcttccag cgcatcgtcg gtgaactcca 2731021 actccacgcc atccatctcg aacagccgga tgtactgctt gaccaaagcg ttcttcggct 2731081 cggacaggat cttgaccaac gactctttgt ccaggttggt gaccgaggcg accaccggca 2731141 ggcggccgat gaattccggg atcaggccga acttgatcag atcctccggc atcacgtcgg 2731201 caaagtggtc ggtggtgtcg atctcggcct tggaacgaac ctcggcgcca aagccgaggc 2731261 cccgcttgcc gacgcgctcg taaatgatct tctccagccc ggcgaacgct cccgcgacga 2731321 tgaacagcac gttggtggtg tcgatctgga tgaactcttg atgcgggtgc ttacggcccc 2731381 cctgcggcgg aaccgacgcc tgagtgccct ccaggatttt cagcaaggcc tgctgaacgc 2731441 cctcaccgga gacgtcgcga gtaatcgacg ggttctcact cttgcgggcg atcttgtcga 2731501 cctcgtcgat gtagatgatg ccggtctcgg cgcgtttgac gtcgtagtcg gcggcctgaa 2731561 taagtttgag caagatgttc tcgacgtcct cgccgacgta accggcctcg gtcagcgcgg 2731621 tggcgtcggc gatggcaaac ggcacgttaa gcatcttggc cagcgtctgg gccaggtagg 2731681 tcttgccaca accggtgggt ccgagcatca agatgttcga cttggtcaac tcaacgggct 2731741 cacatcggga gtcacggccc ttctccccgg cctggatccg cttgtagtgg ttgtacaccg 2731801 ccacggccag cgtgcgtttg gcggtatctt gcccgatgac gtagccctcg aggaactccc 2731861 ggatctcggc cggcttgggc agctcgtcga gtttcacatc gtcggcgtcg gcgagttcct 2731921 cttcgatgat ctcgttacac aggtcgatgc actcatcgca gatgtacacg ccggggccgg 2731981 caatgagctt cttgacctgt ttttggctct tcccgcagaa cgagcacttc agcaggtcac 2732041 caccgtctcc tatgcgcgcc ataatgctga tggcctactt cctgatcgcc gttcgtgttg 2732101 ccgtgccccg tgtatgcccc gacgctaccc gcttgctccg gccccccgcg accgttagca 2732161 ccgaatagcg tcctagagat ttcagggtgt tcacgcctct cgtctgaatg aaacatatag 2732221 cccgactgcg ccccactcgc cgagacgcgc gatccgtgtc tctggcgtgt cgcggtcgta 2732281 accccaccga ggcccgcgcg tcgcggaccc ggcagcggcc cgaccgccag ctgaccaccc 2732341 tacagtggcg ttgtggaatt ggtcagcgat tccgtgctga tcagcgatgg cggcctggcc 2732401 accgagcttg aggcgcgcgg tcacgacctg tccgacccgt tgtggtcggc gcggctgctg 2732461 gtggacgctc cgcacgcgat caccgcggtg cataccgcgt actttcgcgc tggggcccag 2732521 attgccacga ctgccagcta ccaggcctcg ttcgagggct tcgcggcgcg cggcataggt 2732581 catgacgacg ccaccgtgct gctgcgccgc agcgtcgaac tcgcccaggc tgcgcgcgac 2732641 gaggtcggcg ttggcggtct atcggtcgca gcctcggtcg ggccatacgg cgccgcgctg 2732701 gctgacggat ccgaataccg tggatgctac ggcctgtccg tcgcagcctt gatgaagtgg 2732761 catctgccac ggctcgaggt gctagtcgat gccggcgctg acatgctcgc cctggaaacc 2732821 atccccgata tcgacgaagc cgaagcgctg gtcaacctgg tgcggcggtt ggctacgccg 2732881 gcctggctca gctacacgat caacgggacg cggactcgcg ccgggcaacc gctcaccgac 2732941 gcgtttgcgg tggccgcagg agttcccgag atcgtcgccg tcggcgtcaa ctgctgcgca 2733001 cccgacgacg tgttgccggc catcgctttc gccgtcgccc acacaggcaa accggtgatc 2733061 gtgtacccga acagcggtga gggttgggat ggtcggcgcc gcgcctgggt aggtccgcgg 2733121 cggttttccg gatcttccgg gcagcttgcg cgggaatggg ttgcggcggg cgcgcgcatc 2733181 gtgggcggat gctgccgagt acggccgatc gatattgccg aaatcgggcg agcgctgacc 2733241 accgcgccgc cccgaggctg aaagcgaaaa ttgcctctac tgcctcatcg aggcgttacc 2733301 tagggttagt tcttgtgacc gcgaagcccg gctacgcatg agtaagaacc gcattatggg 2733361 caaccaaccg gagaagtcag atgtgactgc ggcacccgac accgtggagg gcgattccca 2733421 cactgcaatg acaccgcgcc agcggctgac cgtgttggca acggggctgg gcatcttcat 2733481 ggtgttcgtg gacgtcaaca tcgtcaatgt cgcattgccc agcatccaaa aggtgtttca 2733541 cacgggcgaa caaggtctgc agtgggcggt cgccgggtac agcctgggca tagcggccgt 2733601 gctgatgagt tgcgccctgc tgggcgatcg ctacggtcgc aggcgcagtt ttgtgttcgg 2733661 ggtcacgctc ttcgtcgtga gctctattgt ctgtgtgcta ccggtcagcc tggcagtttt 2733721 cacggtcgca cgagtgatcc aaggtttagg agcggcgttc atctcagtgc tctcgctggc 2733781 cttgctaagc cactcctttc ccaatccccg aatgaaagca cgggcgatat ccaactggat 2733841 ggccataggc atggtcggtg cggcatctgc ccccgcgctg ggcgggctca tggtcgacgg 2733901 cctcggttgg cgcagcgtgt tcctggtgaa cgttccgctc ggtgccatcg tgtggctgct 2733961 gacgctagtc ggtgtcgacg agtcacagga tcccgagccc actcaactcg actgggtggg 2734021 acagctgacg cttatcccgg ccgtcgccct gatcgcatac accatcatcg aggctccccg 2734081 gttcgaccgg cagtccgccg ggttcgtggc ggcgttgctg ttagcggctg gggtactgct 2734141 gtggctgttt gttcgacacg aacaccgcgc cgctttcccg ttggtcgatc tcaaactgtt 2734201 cgccgagccg ttgtaccgat cggtgctgat cgtctacttc gtggtgatgt cctgcttttt 2734261 cgggactctg atggtgatca cccagcactt ccaaaatgtg cgcgacctat cgccgctgca 2734321 cgcgggtttg atgatgttgc cggtccccgc gggattcggg gtggcgagtc tgctggcggg 2734381 tagggcggtc aacaaatggg gtcctcagct cccggtgctg acgtgcctgg cggccatgtt 2734441 catcgggttg gcgattttcg cgatctcgat ggaccacgcg catccagtgg cccttgttgg 2734501 cctgacgatc tttggcgcgg gagccggcgg ctgcgccaca ccgctgttgc atcttggaat 2734561 gaccaaggtc gatgatggcc gtgccggcat ggccgccggg atgctcaatc tgcagcggtc 2734621 gctgggcggc attttcggcg tcgccttcct gggcaccatt gtcgcggcct ggttgggtgc 2734681 cgcgctgccg aacaccatgg ccgacgaaat tcccgatccc atcgctcgcg cgatcgttgt 2734741 cgacgtcatc gtggacagcg cgaatccgca tgcccacgcg gcatttatcg ggccaggaca 2734801 ccggataact gcggcgcagg aggatgagat cgtactggcc gccgacgcgg tcttcgtgag 2734861 cggaatcaag ctcgcgttgg gcggcgccgc cgtattgctg accggcgcgt tcgtccttgg 2734921 ttggacgcgc ttcccccgga cccccgccag ctaagtggtc tcgctcggtg cgcccccaca 2734981 gtccctgcgc cgagatcgac gttagcgtca cgccttatgg tgattttccg ctctggcgtg 2735041 gatctcggcg catgtcgggt ggcgaccacc aagccacgcc acggccgcac cacccgccca 2735101 tggctcaggc ggtttgcgcg gagagcttcc ggtactcgag caccgtgtcg atgatgccgt 2735161 agtccttagc ctcttccgcg gtcaagatct tgtcccggtc agtgtctttg cggatcactc 2735221 cggcgtcctt gccggtgtgg cgggccagcg tggtttccat cagggtgcgc atccgctcga 2735281 tctcggcggc ctggatctcc agatcggaga actgtccctg gatcacgccc gacaacgacg 2735341 gctgatggat caacacccgc gcattcggca gcgccatgcg cttgcccggt gttccggcgg 2735401 ccagcagcac cgcagccgcc gaggcggcct ggcccagaca caccgtctgg atatcggccc 2735461 gcacgtattg catggtgtcg tagatcgcca tcagcgaggt gaacccaccg cccggcgagt 2735521 tgatgtacat ggtgatatcg cggtcgggat ccaacgactc caacaccagc aactgtgcca 2735581 tgatgtcgtt cgccgacgcg tcgtcgacct ggacgccgag gaagatgatg cgttcctcga 2735641 acagcttgtt gtatggattg gactccttga ccccgaagct ggagtgctcg atgaacgacg 2735701 gcaggatgta gcgcgcctgg ggctggatct gagaattttg ggaattcact gtgcttctcc 2735761 attgacgtgg gcgcgggtga tgatgtgatc gacgaaaccg tattccaggg cttcggcggc 2735821 ggtgaaccag cggtcgcgat cggaatccgc ctcaatgcgc tcgatcggct ggccggtgaa 2735881 ttcggcgttg agccggaaca tttctttctt gatcacggcg aactgctcgg cctggatggc 2735941 gatatcggcc gcgctgccgg tcaccccgcc caacggctgg tgcatcagga tgcgagcatg 2736001 cggcagcgcg tagcgcttgc ccttggtacc tgccgccagc aggaactcgc ccatcgaggc 2736061 ggccatgccc atcgcgtagg tggcgatgtc acagggcgcc agcaccatgg tgtcgtagat 2736121 cgccatgccg gcgctgatcg atccacccgg cgaattgatg tagaggctga tgtccttgct 2736181 ggcgtcttcg gcggccagca gcagaatctg agcgcataac cggttggcga tctcgtcgtt 2736241 cacctccgag cccaggaaga tgatgcgctc ggagagcaag cgctcgtaga ccgaatccgt 2736301 gaggctaaga ccctgcgagt tcgaacgcat gtcagtcact tggctcacag tggggcacct 2736361 gctttcctcg agttcttcta tgctccgaca ctaaccaacc aggctggctg tttcgcggtc 2736421 acgcaccccc tgaaaccggc gcgttcgctt acagcgtcat acggtcacgt tgtcgcttcg 2736481 tcggacgccg cccgcgcggc accctcgtct gccggttcgg cctcctcagc ctcaccggcc 2736541 gacacacgct tgccgaagaa ctcactggta tcgatcgtgt ttccgtcact gtcggtgacc 2736601 gtcgccgcct ccactgcggc cctgatcgcc agctcgcgcc gcacgtcagc gaacatggtc 2736661 ggcagctggt tgcgctcttg gaggtagccg aacagctgct gcggctcgat gccgtattgc 2736721 cgagacgtcg tcaccagtcg ttcggtcaga tcatcctggc caacttggac ctgcagctca 2736781 tcggccaggg cgtctagcaa cagctgcctc ttgacgtcct tttctgaggc ggtgcgcgcc 2736841 tcggcatcga acgccgcgcg tgacgagcct tgctcgacga gcaactcatt gaaccgggct 2736901 tcgtcgtgat taagaccgct gagcgcgctg tgcagcacgc tgtcgaattg ggcctgcaca 2736961 tacgactccg gcaacggcac gtcgacctgt tcgagtagcg catcgatggt ggcgtttcga 2737021 atctgctcgg cctgctgggc gcgcttggcc tggcgcacct ggtcgctgag gctggcccgc 2737081 aattcgtcga tgctgtcgaa ctcgctggct aactgcgcga attcgtcgtc gggctctggt 2737141 agttcgcgct ccttaaccga cctgaccgtg acggtaacct gagcttcctg cccggcgtgc 2737201 tcgccggctg ccagcttggc ggtgaagacc cgggactcgt cggcggacag accaacaacc 2737261 gcgtcgtcga gacctgcgat gagccggccg gagccgacct cgtgggagag tccctcagcg 2737321 gctgcgttcg gtatgtcctc tccgtcgacc gtggcagaca agtcgatcga gacgacgtcg 2737381 ccgacggcca ccggccggtc caccgcggtc agggtgccga accgggtacg taacgactgc 2737441 agttcggcgt cgacgtcgtc ctcaccgatt tcgatcggat ccaccgagac cgtcagcgcg 2737501 ctcaggtccg ggagactgat cttcgggcgg atgtcgacct cggcggtgaa ttgcaggtcc 2737561 tggccgtact ccttcttggt cacctcgatg ttgggccggc cgagcggttg gacatccgac 2737621 tcggccaccg cctgtccgta ccggctgggc agcgcatcgt tgacgatttg atccagcatg 2737681 gcctcccggc cgatgcgggc ttcgagtagt ttggccggcg ccttcccggg ccggaagccg 2737741 ggcagccgca cctgtttggc cagctctttg taggcccgct ggaaatccgg ctcaagctcg 2737801 gcgaatggca cctccacgtt gatacgaacc cgggtggggc tcaactgctc gacggtgctc 2737861 ttcacgggtg tgctccttgg tagtcgataa cggcggtcgg ctggtcgggg tgacaggatt 2737921 tgaacctgcg gccttccgct cccaaagcgg atgcgctacc aagctgcgct acaccccgcg 2737981 ctgacctcgc gatcctacgg cccggcgaca ccggcaccgc aatgacctct tgagacctca 2738041 cgggaaggtc tcaaaacgac tccgattaga tttgatgtct gtcaccacgt acagtcgcgc 2738101 tcgactaaat acatgcgggc gtagctcaat ggtagagccc tagtcttcca aactagcgac 2738161 gcgggttcga ttcccgtcgc ccgctcgggc catgcgtttg ttcggcagaa aggcgccatg 2738221 cgcgacccat gaatcagcct gacatcaagg gctcgtgcgc gtcggagttc accaaggtac 2738281 gcgacgcgtt cgagcgcaac tttgtgctgc gcaacgaggt cggcgcggcc gtcgcggtgt 2738341 gggtcgacgg ggatcttgtc gtcaacctgt ggggcggctc cgccgacgcc ggcggtaccc 2738401 ggccctggca gcacgacacg ctggccaccg tgctgtccgg taccaaggca ctaacggcca 2738461 cgtgtgtgca tcagctcgtc gatcgcggtg agcttgacct gcatgcgccg gtggcacgct 2738521 actggcccga gttcggacag gcgggtaagc aggccatcac gctggcgatg gtgatgagcc 2738581 accgctccgg ggcgatcggg ccgcgcggac ggctgggctg ggagcaggtc gccgattggg 2738641 attttgtctg cgagcaactg gccgccgccg aaccgtggtg gcagccgggt gccgcgcagg 2738701 gctaccacat gaccaccttc ggtttcatcc tcggcgaagt gttccgccgc gtcacaggcc 2738761 gtacggtcgg tcaatacctg cgtaccgaga tcgctgagcc gctgggtgcg gacgtccaca 2738821 ttggcttgca tcccggcgaa cagctccgct gcgccgatct agttgataag ccgcacatcc 2738881 gccaattgct ggccgacgtc caagcccccg gctaccccac cagcctaaac gaacatccca 2738941 aggctgcatt gtcggtgtcg atgggcttcg cccccgacga cgaactcggc tccaacgacc 2739001 tgcagctgtg gcgtcagatc gaattccccg gcaccaacgg ccaggtgtct gcgctggggc 2739061 tggcgacgtt ctacaacggg cttgcccagg agaagctgct cagccgcgag cacatggagc 2739121 tggtccgggt ctcacagggc ggcttcgaca ccgatctggt gctcggcccg agggtcgccg 2739181 accatggctg gggtctgggc tacatgctca accagcgcgg cgtcaatgga cccaacccac 2739241 ggattttcgg gcatggtggc ctcggcggct cgtttgggtt cgtcgacctc gagcaccgga 2739301 tcggctacgc ctacgtgatg aaccgcttcg acgccaccaa ggccaacgcg gatccgcgca 2739361 gcgtcgtcct gtccaacgag gtctacgccg cgctcggggt aaaccgttcc tagacggcta 2739421 gccaccaggc ggtcaggtct gacagaccgg gcaccagaaa acattgcggc cctcgagcag 2739481 tgccgtgcgg atcactcccc cacacacccg acacggctcg ccggctcggc ggtacacata 2739541 ggtgcggggc cggtcgggca gatatgacgg cagaccatgg tcatgttcgg ggcgcaccac 2739601 gatgatcttg ccgcggcgca agcccacctt catcaacgac accagatcgt tccaggccgc 2739661 gtcgaattcc ggctcaccga tcccgcggcc gggccgctgt gggtcgatcc ggtgccgaaa 2739721 aagcaactca ttacggtaga cgttgccaac accggcgatc accgtttggt ccatcaagag 2739781 cgcgcctatg ggcctgcgag acttggtgat ccgagaccat gccgacgacg ggttggcgtc 2739841 gctacgcaac gggtcgggtc ccagcctggc aaccacgtcc gcaacctcgc cgtcgtcgat 2739901 cgactcacac accgtcgggc cgcgcaagtc ggtgccgaat tctgccccga ccatccgcat 2739961 ccgcacctgc cccgcgggtt cgggtagcca cccatctgtg gggcgtgccc attcggtgaa 2740021 ggtgccatag agcccgagat gcacgtgcac cacggggccg ccgacgtagt gatggaacag 2740081 gtgtttgccc caggcactgg cccgccgcaa cacccgaccg ttgagcgcgg aagccgaatc 2740141 ggcgaaccgg ccctgggggc tggacaccga gaccggcgca ccggcgaacc ggcgctggtg 2740201 cagccgggcc agccgatgca gcgtatgccc ctcaggcacg ggagtcaggc cggagcgccg 2740261 ggcaccggcg gcgcttcgtg ggtccgttcg tactcggcga gaatgtcgat acgccgttgg 2740321 tggcgttgcg ctttcgacca cggcgtggtg acgaaggcgt cgactatcgc cagtgcctcg 2740381 gccaccgtgt gcatgcggcc gccgatgccg atcaactggg cgttgttgtg ctcgcgagcc 2740441 agcgccgcgg tctgcacact ccaggccagc gcgcagcgag cgccgggcac cttgttggcg 2740501 gcgatctgct ccccgttgcc cgatccgccc agcacgatgc ccaggctgcc cggatcggcg 2740561 acagtgcgcg tcgctgcggc aatgcagaat gccgggtagt cgtcgtcggc gtcgtagcgc 2740621 aacgcgccgc agtcgatcgg ctcgtggccg gtttgcttca ggtgctcgat gatccgctgc 2740681 ttgagctcat atccggcgtg gtcggccccc aggtagacgc gcatgcccga cattgtgccc 2740741 gacacactgc cgggcgccgg cgcgggcgcc cgccgatagt gaattcggcg acaagaaccc 2740801 gggcgtgttc cggcgccgaa ttcactatcg gcggctagtc gaactgaggc ggctcggtgc 2740861 gggtccgctt gagctcaaaa aagtgcgggt aggaagcgaa ggtaaccgag gcatcccaga 2740921 gcttgccggc ttcctcgccg cgcggaatct tcgagagcac cggcccgaag aacgccacac 2740981 cattgacatg gatcgtcggc gtaccgacgt cctcgcccac cgcgtccatc ccggcgtggt 2741041 ggcttttgcg cagggcgttg tcgtaagcgt cgctggtagc ggccttggcc aactccgcgg 2741101 gcagaccggc gtccgccagc gactgggtga tgacctcgtc gagttcgtgg ttgccctggt 2741161 tgtgaatccg gttgcccatc gcggtgtaca gcgggtccag gactttcgcc ccatgggcct 2741221 gctcggcggc gatcgccacc cgtaccggtc cccatgccct cgccatgcct tcgcggtatt 2741281 gctcgggcag gtcgtcacgg ttttcgttga gtattgccag gctcatgacg tggaagttca 2741341 cctcgatgtc gcggaccttt gccacctcga ggatccagcg cgacgtgatc cagcaccacg 2741401 ggcacagcgg atcgaaccag aaatcggcga cagacttctg gggggccttc tcgagcatgg 2741461 cgcggtcctc tcgttggagt cagcagcggt gagtacaccg cccagcacaa ccacggccgc 2741521 cccgcacctg ttcccgccga cccggttaag ttggacgccg tggcccttcc aaacctcacg 2741581 cgggaccaag ccgtcgaacg cgccgccctg ataaccgtgg acagctacca gatcattctc 2741641 gatgtgaccg acggtaacgg cgctcccggc gaacgcacct tccggtcgac caccaccgtg 2741701 gtgttcgacg cactccccgg cgccgacacg gtcatcgaca tctccgccca caccgtgcgc 2741761 cgcgccagcc tcaacgacca agacctggac gtctcgggat atgacgaggc ggccgggatc 2741821 ccgttgcgcg gactggccca gcgcaacgtc gtcgtcgtcg acgccgactg ccactactcc 2741881 aataccggcg agggcctgca tcggtttgtc gatccggtgg acggcgagac ctacctgtac 2741941 tcgcaattcg aaaccgccga cgccaagcgc atgttcgcct gcttcgacca acccgacctc 2742001 aaggccacgt ttgacgtgcg ggtgaccgcg cccgcgcact ggaaggtgat ctccaacggc 2742061 gcgccgctgg ccgcggcaaa cggcgtacac accttcgcca ctaccccgcg gatgagcacc 2742121 tatctggtgg ccttgatcgc cggaccatac gcggcctgga cggacactta catcgacgac 2742181 cacggggaaa tcccactcgg catctattgc cgggcctcgc ttgccgaata catggacgcc 2742241 gagcggctgt tcacccaaac caagcaggga ttcggcttct accacaagca ctttggcctg 2742301 ccatacgcgt tcggcaagta cgaccagctc ttcgtccccg aattcaacgc cggcgcaatg 2742361 gaaaacgccg gcgcggtgac cttcttggag gactacgtct tccgcagcaa ggtcacccgg 2742421 gcatcctatg agcggcgcgc ggagaccgtg ctgcacgaga tggcccacat gtggttcggc 2742481 gacctggtca ccatgacctg gtgggacgat ctgtggctga acgagtcctt cgccaccttc 2742541 gcctcggtgc tgtgccaaag cgaggccacc gaattcaccg aggcttggac gacgtttgcg 2742601 accgtggaga agtcttgggc gtatcgccaa gaccagctgc cgtcgacgca cccgatcgcc 2742661 gccgacatcc ccgacctggc cgctgtcgag gtgaacttcg acgggatcac ctacgccaag 2742721 ggcgcctcgg tgctcaaaca gctcgttgcc tacgtcgggc tggagcgctt tctggccggc 2742781 ctgcgtgact acttccgcac gcacgctttt ggcaatgcca gctttgacga tctgctggcc 2742841 gcgttggaaa aggcctcggg ccgcgacctg tcgaattggg gcgagcagtg gctgaagacg 2742901 accgggctca acaccctgcg accagatttc gaggttgatg ccgagggcag gttcacccgg 2742961 ttcgcggtga cacagagcgg tgcggcaccc ggcgcaggtg agaccagggt gcatcggttg 2743021 gcggtgggca tctacgacga tgatggttcc aagagttccg gcaagctggt ccgggtgcac 2743081 cgcgaggaac tcgatgtctc cggtccgatc acgaacgtcc ctgcgctggt tggcgtttcg 2743141 cgcgggaaac tgattctggt caacgacgac gacctgacct actgttcgct gcggctggac 2743201 gagcggtcgc tacagaccgc gctagaccgc atcgccgaca tcgccgagcc gctgccgcgc 2743261 acgctggtgt ggtcggccgc ctgggaaatg acccgtgaag ccgaactgcg tgcccgcgac 2743321 ttcgtgtcac tggtgtccgg cggcgtgcac gcagaaacgg aggtcggggt cgcgcagcgg 2743381 ctgctgctac aggcgcagac agcgttgggt tgctatgccg agcccggctg ggcccgggag 2743441 cggggatggc cgcagttcgc cgaccggctg ctggagttgg cgcgcgaagc cgagcctggg 2743501 tcggatcatc agctggccta tatcaactcg ctgtgttcgt cggtgttgtc cccccggcat 2743561 gtgcagaccc taggggcgtt gctcgagggt gagcccgccg catgtggatt ggcaggctta 2743621 gccgtcgaca ccgacctgcg ctggcggatc gtaaccgcgc tggccaccgc gggcgccatc 2743681 gacgccgacg ggccggagac accgagaatc gacgccgagg tgcagcgcga cccgactgcc 2743741 gccggaaagc ggcatgccgc ccaggcccgc gcggcgcggc cacagttcgt cgtcaaggac 2743801 gaggcattca ccacggtggt cgaggacgac accctggcca acgccactgg ccgcgcgatg 2743861 atcgccggca ttgccgcacc cggacaaggc gagctgctca agccgttcgc gcgacgctac 2743921 tttcaggcga tccccggagt atgggcacgg cgatccagcg aagtcgcgca atcggtggtg 2743981 attggcctgt atccgcactg ggacatcagc gagcagggca tcaccgccgc cgaggagttc 2744041 ctcagcgacc ccgaggttcc gcccgcattg cgccggctgg tgctcgaggg ccaggccgcg 2744101 gtgcagcgat cgttgcgggc ccgcaacttc gacgctgacg gctagccctc accgcgaggg 2744161 cgcgtgtctg tacaacgaca cgccgcatcg ggcgtacatt cgggcgtgct cgccgggtca 2744221 gcccggcgcg atccccgcgc tgagcacgcg gatcgcgctg atcagcccat ctaccagctc 2744281 accctgctcg aacgctgagg aagcggcggc aaccccgagc ggagccgccg actcggcacc 2744341 gcggccgcgg acttgcgagc cgtagaccac ttcgatggcg cactggttgg gcgagaccgc 2744401 gagcagcaca gcattgtccg gcgtgggcac cttgcccaag atctcgcggg cccgcgcggc 2744461 ggtgtcacga cccaagtcgc cgaggtagat ggcgaacctc acctgacacg cccgcgagct 2744521 gtaggtcagc gcgtcgtcca gggcgacgag atctgcgatg gggaacgggt agtgcacgga 2744581 cagttccccg ggctcggtga cccccgagat ccgtccgctg gtggtcagca cccaacccgg 2744641 cggcagctcg gcgtgctcaa tcgtcgcaac gtcaccacgt gccactggcc ccacctccaa 2744701 ccgtgaactc cgatgcgtca tgcccgtgtc cgccgtgcgc gctgccaacg acctcgtcgg 2744761 tggcggccca caggatgggc gggtgtgtcc aaggctccga cagtttgtag gttgccgggt 2744821 gaggtccctt gcgcgaccag atcagcacag acagcacaac caccagcaac aacgggatac 2744881 cgacaaagaa gaggtggatc tccatagcac tcacgacgca aaccgtatcc caccgggttt 2744941 tcaggccgca ccctcaccga ggtatcgcgc ccaggacggg tccagctcct tgaccgccga 2745001 cagcagtcgc cagtgcggtc ccgtgggcgg cagcggcgcc cggcggagtg cccagccaag 2745061 ctcggtcaac agcctgtcac ccttgcggtg gttacacggc gagcagcacg caacgcagtt 2745121 ctcccaggag tgggcaccgc cccggctgcg gggtaccacg tggtcgacgg tgtcggcctt 2745181 gccgccgcag taggcacaac agaaccggtc ccgatgcatg agcgcggccc gggtcatcgg 2745241 aacccgggca cggtagggaa cccggacata ggagcgcaac tggatcaccg acgggaccag 2745301 gatcgatctg gtcgacgagt ggatgaccgg cccggacggg tcttcgtgca ccacgtcggc 2745361 cttgccacag atcaccatga caatcgcccg ccgcatcgac aacgcggtaa gcggctcgta 2745421 ggtggagttc aggagcagca cccgccggcg gttccagatc gatgcgctct cgtgacggtt 2745481 cggtggatgg gtctcgacgc ctgacgcgag tcggtgggag tggacaccgt gcaggcatga 2745541 agcgggcccg gttacgcctg ccgcgacacc ggaactgcgg tggccgcggc gcttcttgcc 2745601 gtgcgccata ggtcctccgc cgaacagtcc accatgattc gcggctaatc gcacgccaaa 2745661 tgccacgtcc acaccgtgtc gctccggtga acaaaccggg ggctggctgg tcggccacga 2745721 caaatagacc acaatggagg ggatggatca gatgccgaag tctttctacg acgcggtcgg 2745781 cggcgccaaa accttcgacg cgatcgtgtc gcgtttctat gcgcaggtcg ccgaggacga 2745841 agtactgcgg cgggtgtacc ccgaagatga cttagccggc gccgaggaac gattgcggat 2745901 gttcctcgag cagtactggg gcggcccacg aacctactcg gagcagcgcg gccacccccg 2745961 attgcggatg cggcatgccc cgtttcggat ctcgctcatc gaacgcgacg cctggctgcg 2746021 gtgcatgcat acggctgtgg cctccatcga ctcagaaacg ctcgatgacg agcaccgtcg 2746081 agagttgctg gattatctgg agatggccgc tcactcgctg gtcaactccc cgttttgatg 2746141 gaccaacacc agcgaccgga tccaatgggc cccggctctc ctcgcgccag cgctcgtcga 2746201 ccggagccag atccgatggg cgagccgtgg tggtcgcgag ccgtgttcta ccaggtctat 2746261 ccccgatcgt tcgccgacag caacggcgac ggggtgggcg acctggacgg gttggcgagc 2746321 cggcttgacc acctgcaaca gctcggtgtc gacgcgatct ggatcaaccc ggtcaccgtc 2746381 tcgccgatgg cagaccacgg atacgacgtc gccgatcccc gcgacatcga cccactcttc 2746441 ggcgggatgc cggcgttcga acggttggtc gctgcggcac accggcaggg catcaaagtc 2746501 accatggacg tggtgcccaa ccacaccagt tcggcgcacc catggtttca ggccgcgctg 2746561 gctgacctcc cgggtagccc ggcgcgggat cgctatttct ttcgcgacgg gcggggcccc 2746621 gacgggtcgc tgccgccgaa caactgggag tcggtgttcg gcgggccggc ctggacccga 2746681 gtgcgcgaac cggacggcaa cccgggccag tggtacctgc accttttcga caccgaacag 2746741 ccggacctga actcggacaa cccggaaatc cttgacgact tcgagaaaac actgcgcttc 2746801 tggctggacc gcggcgtgga tggcttccgc atcgacgtgg cgcacggcat ggccaagccc 2746861 ccgggcctgc cggactcacc ggacctgggc atcgaggtgc tgcaccaccg cgatgacgac 2746921 ccgcgcttca accacccgaa tgtgcacgcg attcaccgcg acatccgcac ggtgatcgac 2746981 gagtaccccg gagcggtaac cgtcggcgag gtgtgggtac acgacaacgc ccgctgggcg 2747041 gagtatctgc ggcccgacga actgcatctc ggcttcaatt tccggctggc gcgaaccgag 2747101 ttcgacgccg ccgagatccg cgacgcggtg gcgaactccc tggccgccgc ggcgctgcag 2747161 aacgcgaccc caacctggac gctggccaat cacgatgtgg gacgggaggt tagccgctac 2747221 ggcggcggcg agatcgggct gcgccgggcc aaggcgatgg cggtggtgat gctcgccctg 2747281 ccgggcgtgg tcttcctcta caacggccag gaactgggtt tgcccgacgt ggacctgccc 2747341 gacgaggtgc tgcaggatcc gacgtgggaa cgctcgggac gcaccgaacg cggtcgcgat 2747401 ggctgccggg tgccgattcc ctggtcgggc aacattcccc cgttcgggtt ctcgacgtgt 2747461 ccagacacct ggttgccgat gccgccggaa tgggcggcgc tgaccgccga aaaacaacgc 2747521 gctgatgccg gctcgacctt gtcgtttttt cgacttgcac tcagattacg tagggaacga 2747581 aatgaattcg acggcgacgt cgactggctg gccgcgcccg acgatgcgct gatattccgg 2747641 cgtcacggcg ggggtttggt gtgcgcgctc aacgccgctg agcgtccgct ggcgctgccg 2747701 gcaggtgaac ccatcctggc cagcgcaccg ttgaccgacg ccacgttgcc acccaatgcc 2747761 gcggcctggc tggtgtagcg gcattccgag ctatgcttgc ccgacatata agcgcatacg 2747821 catcctaggc gggcaccgtc taggtatgat gatgcggatc gccgtgcggc tacccgggga 2747881 agtcatcacc ttcgtcgata gcgaggtcag ccaaatccgc atacccagcc ggcgcgccgc 2747941 agtggtgttg cgtgcctcga acgcgagcga cgccgcgatt cttaccgcca ccgaacccaa 2748001 tcaccacctc gacgcactcg ccggacaggc cgcaaagcta gcaccaacat cgattgatgc 2748061 ggctcatcca gctcgcccag ctagacgaga cccgtgcctt tacccgcgaa ctggccaggc 2748121 cttacctcgc accgggtaac cgtggcaccc acctcgagca gcgtagccag cgaactgctc 2748181 atgccctggc cgagcgctgc cgctagcggt gtggtcggct ggcgcaccac cgcgaccgcc 2748241 agtcagcgat accatcggcc gatgtcggat actccgttcg ccgagcccta tcccgagcag 2748301 cggcccccct ggggtgtccc gccaccaggt tgggacggat cgtcgcggcc agcgccctcg 2748361 acgactcctc gatcgcccgg gcggtggtct ctagtggcgg ccctagccct tgcggtcgtc 2748421 tcattaggcg tgggcatcgt cggatggttt catcggcaac cgcacgacaa gccatcaccg 2748481 gccccatccg cgccgacgtt caccagccaa cagatttccg acgcgaaaga aaacgtctgc 2748541 gccgcacacc ggatcgtgcg ccaggcggcc gtgctgaata ccaatcaggc caacccggta 2748601 cccggagacc cgaccggcga tttggcggtg gcagccaacg cccgcctggc gctgtatagc 2748661 ggcggcgact acctgctgag gcgtctcacc gccgagccag cgactcctgc cgagttgcgc 2748721 gatgccgtcc gctcgctcgc caacgctcta caagagcttg cagtgaacta tctcgctgga 2748781 gctcccgatt ccgtggtaac tcccctgcgg ctggcgctgg aaagggacac cagagccgtg 2748841 gatccgctat gcgtgtgacg gcgatccgga aatgaaccat cctcgcccat cagcgcagca 2748901 ccagcgccgc gtggccgcga tgccgataaa ccgagccgaa gcgggcgtcc agacgcaacc 2748961 acgccggcga tatccggacc cggatcagct cgtcggcgct gatcgtttct gcggactggg 2749021 gaaggaaacc cattgcggtc aaagcgaaca cacagcgcat gggtagccca acgacgacat 2749081 cggccgagct gacctggatg acctcctgat cgagcagcga taccggcgga ccagccgaac 2749141 tcccgtgctc cttggccagc cgcgcgccac ggtgcgccag gtccaacatc acccgggccg 2749201 gtacgtcgtc gagataggtg aagccggact ccggcggcaa cccaccccgc cacgcggagt 2749261 ccatcgagta accgggatcg acatagcccg aggcatccgt tgtggccaga ccgtgcgcga 2749321 gtgaccgtgc ggccaccgac agatcgtcgg gtcgcacctt gccggccacc acccgactgg 2749381 ccagcacgtc gaagcccgtt gctacccaag ccgatagcaa tccggtagac cgcgcgcgaa 2749441 tacggataac ggcgacatcg tcgagccgaa gcgcgtgatc cacgaacgtg gccagatccg 2749501 cgcggtgagc cgggtcaggg agccacaacc cacgctcaac caccccgcct atccccgaaa 2749561 ccaccgttgc aggtactcgc gatggtgtgg cgatagtcga accaaccgct gttcctcgat 2749621 atggaacgcg gccagctgcg actcggcgat gaccgcaggc ctcgagtctg gctccgcgtt 2749681 gaccgaccgc acctcgtacc cgagcgtgaa gtcgaccgcc cgcagccgct tggtccagat 2749741 cgtcacctgt agcggcgagt cggacaaccg cagttgaccc ttgtaggtca cccggacatc 2749801 ggcgatcagc agcccggtgg acgtgatgtc ggctccgaaa gcatccttaa gaaacgggac 2749861 ccgtgcctct tcgagaatcg tgaccatggt ggcgtggttg acgtgctgat acatgtcgat 2749921 gtcagaccag cgcaccccca ccggcgtgac gaacccgacg ctcaccccga gattcctcgc 2749981 ccgcttgtcc gtgtcatgcg gcggatctgc cgcgcggcga ccgacaacgt cgccagatcc 2750041 ttctggccgc tggcacggat gtcgtcgagt gtccgacgtg cccgcgccac ccgggaggcg 2750101 ctgaggtgtt cccactcggc gatcttttgc tcgctactct cgccgggttc ccccacggcc 2750161 agcacgtcga aacacaacga ccgtagcgca ccgtaaatat cgtcgcgaat cgccaagcgc 2750221 gccaacgaat gccagcggtc gtgtcggggc agctgggata ccgcggtcag caggccatcg 2750281 gtgcccagcc ggtccatcag ggcgaaatag gtgtcagcga cctcggcggc gtcgatgtcg 2750341 gcgatgtcgg cgatgtcgat gatgtcgagc aggctgtacc ggtacaggcc ggtcgagaca 2750401 cggtaggcca agtcttcagg cacaccctgc gatgcgaatt ccgcagctgt cttttcgacg 2750461 atggccttgt catcaccacg caaccactcc gacatgcgcg gtgtcagtgc cttgaccatg 2750521 gccgcgaatc ggttgatctc ggcgccgacg gccaagggct gcggacggta gttgagcagc 2750581 cagcgtccgg cacggtcgat cagccgacgg gtgtccagcg tcaacctgtc tgacagcgcg 2750641 attggcaggt tcgccgcacg gatccggcgc caaatgtgac cgacaccgaa gatggcatcg 2750701 gtggcgacat aggtgcgcac ggcatcgatc ggcgtgacac caacgtcttc ggcgatccgg 2750761 aacgcatagg tgatgccggc ggtatccacc agatcgttga tcagcatggt ggtgacgatc 2750821 tcgcggcgca gctggtggga acggatctcc ggggtgaacc gttcgcgcag cgccgtcggg 2750881 aaataacgag gcaacctgga agcgaagaca tcctgatccg gtagttcggt ggctagcacc 2750941 tcctctttga gccccagctt gacgtgcgcc atcagcgtgg cgagttcggg cgaggtgagc 2751001 ccgatgccgg cctcggagcg ccgggcaatc tccttctccg acggcagcgc ttccaattcg 2751061 cggttgaccc cgcgctcagc caccaaatac ttgatctgca ttgcgtgcac cggcagcagg 2751121 ctggccgcgt tggcgcgact ggtgcccatc aagtcgttct gatcttcgtt gtcggcgagc 2751181 accagttgcg ctacctcgtc ggtcattgac tcgagcagct gtgtgcgttc gtcggctttg 2751241 accgtgccgg cgctcaccag cgagtcgatc aggatcttga tgttgacctc gtggtccgag 2751301 cagtccacgc cggcggagtt gtccagcgcg tcggtgttga tccggccgcc ggacagatcg 2751361 aattcgacac ggcccaacgc cgtcactccg agattgccac cttcgccaat gaccttggcg 2751421 cgcacttgat tcgcgttgac tcgcaccgga tcgttggcgc gatcgccgac atcagcatcc 2751481 gactctgact cggccttgat gtaagtgccg atgccgccgt tgaacagcag gtccaccggc 2751541 gcccgcagaa tcgcccgaat aaggttgggc ggggccatct cggcggcccc cccgtcaact 2751601 gagccgtcga tgccgaggac ggcgcggacc tgcgcgctga gcgggatggc tttctgttcg 2751661 cggctgtaca ccccgccgcc ctcgctgatc agagacctgt catagtcgcc ccagctggac 2751721 cggggcaact cgaacatccg ccggcgttcg gcccacgaca ccgcggcatc ggggttgggg 2751781 tcgaggaaga tgtggcggtg gtcgaaggcg gcgatcagcc ggatgtgctt gctcagcaac 2751841 atgccgttgc cgaatacgtc gccgctcatg tcgccgattc ccacgacggt gaaatcctgg 2751901 gtctgggtgt cgatcccgat ctctcggaaa tgccgtttta cggcctccca ggccccccgg 2751961 gcggtgatgc ccatggcctt gtggtcgtag cccaccgatc cgcccgaggc gaacgcgtcg 2752021 cccagccaga acccatagga cttggcgaca tcgttggcga tatcggaaaa ggtggcagta 2752081 cctttgtcgg cggccactac caagtaggcg tcgtcgccgt cacgtcgcac cacctcgggc 2752141 ggggggttga cgcttgcggt cgcatgatcg acgttgtcgg tgacatcgag caacccggag 2752201 atgaacagct gatagcaggc gaccccttcg gcgcgggtgg cgtcgcggtc ggcggcgggg 2752261 tcgccggtgg gcagcggggg acgcttgacc acgaacccgc ccttggcccc gaccggcacg 2752321 atgacggcgt tcttcaccgc ttgcgccttg accaatccga gaatctcggt tcggaaatcg 2752381 tcacggcggt ccgaccagcg caacccgcca cgcgcaactg ggccgaacct cagatgcacg 2752441 ccttcgacgc ggggcgaata cacaaaaatc tcgtaccggg gacgcggcag cggaagttcg 2752501 tcgatcaact gggcattgag tttcagcgcc aatacatcac ggcagcgggc cgaaccctgg 2752561 cgtgtcacaa agtaattggt gcgcaacgtg gcctgaacca acgacgcgaa ggcgcgcagg 2752621 atccggtcgg tgtccaggct caccagcgcg tcgatgtccg cggcgacagc ggcagcggcc 2752681 gcttgggcat cgcgattgct cgccgacccc gacggcaccg gaacgaaaag cgcttcgaac 2752741 agatcgacca aagaccgaac ggtagcaggg tgctcgttga gcaccgattc aatgtaggac 2752801 tggctgtacg ggaagcccgc ctggcgcagg tacttcgcgt aggcacggag cagcacgacc 2752861 tgctgccaag tcagcccggc acgcatcacc agctcgttga atcggtcgat ttcgacccgg 2752921 ccgtgccaga tcgcggtcac cgcctcggcg aatcggtgcg cggtcgcggc ccgctcggca 2752981 accgtcgggg ccaacgggat cgtgggatgc ggcgagatct tgaactgata gatccagacc 2753041 ggcagaccgt ccggccgggt gacggagaac ggtcgctctt cgagcaccac gactcccatg 2753101 ctttgcagca tcggcagcag ctggctcagc gaagcggtgc gcccaccgag gaaccaggtc 2753161 aactgggcga caccctgctc gtcgcgttcg gaaaacacca gcttgaccga atcgtcggtc 2753221 agctccgtga tgaccgcaat gtcgccaatg gcatcggccg gggtgacggc ctgtttgtag 2753281 gcctcggaga aggcggcagc gtaatgcata gcgtcggcct gtccgacgga gccagccgcc 2753341 gccgcgccga tcaaacggtc ggcccaggtt cgcgcggctt cggtcagcag accctggatc 2753401 cggatccggt tggcttcgga aacgtccacc ggcggggcgg ccgccccttc tcctgccaca 2753461 cccacttcgg gtagccgcac catgaaatgc atgagtgccc aaggtgattc actgacccga 2753521 gcggtgaact ccagtcgtgt tcccccgaac tcgcggacaa ggatgtcctc gaattgcatg 2753581 cgcacggcgg tggtgtagcg atctcggggc atgtagacca ggcacgacac gaagtactgc 2753641 aaccgatccg cgcgcaggaa caacaacgcc tgccgttgcg atcccaagtc caccacggcc 2753701 ctggccatgg tcagcaggcg ctgcgcgctc agggtgaaca gctccggtcg cgggacggtc 2753761 tggatgacgt cgagcagcaa ttggcctggg tggctgggat cgctttcggc catcgccagc 2753821 gcctcgcgga cccggcgcga gatcgtcggg atctccagca cgtccgcatt catggccgcg 2753881 acgctgaaga gcccgacgaa gcggtgctcg accacgctgc cgtcgacgta ttcgcggacc 2753941 gcgatggcat agggataggc gccgtaacgc aggtagctgc cgacccgcgc ttgggccaac 2754001 accagcagtt tgtcgtcgtc ggtcagccgg ggacgcgaac cggtgcggcc ccgcaggacg 2754061 cccataccgc ttgacccctc gccgtagacc atcccgtcag ccacccggca ccgttggtag 2754121 cccagcagca ggaagttccc gtcacccagc caacgcaaca gttccccgac gtcttgtcgg 2754181 tcgggcgcgg aaaatcggcc gccggcattg gattcgactt ctcccgccag ctcgctcagg 2754241 gtggcgatca gcgctgtggc gtcggtggcc acccgctgga cgtcggccag caccttgggc 2754301 agcaaccgct ccacctcggc gaggcctttg tgatcaacgg cgggcgagag cgctacgtgc 2754361 atccaggcct cacccaggtg cggcgacgtg ccctcggcct tcggttcgat gcgcagcagc 2754421 tctcccgtgg ggctgcggtg cacgtcgaac accggggtca gaatcgccgc gtaggcgatt 2754481 ccaagccggt gcagcagcac cgtaacggaa tccatcagca tgccgccgtg ctcggcgacc 2754541 acctgcagcg ccggaccgaa ccccgcggga tcgtccgccc gatagacggc gacacagctt 2754601 tcaccggccg cgcggtgccg gccaagccga taatgtgcgc ccagcatggc gggcgtcagc 2754661 agggaggctg gaagccaact ggcctcggcg gccttggtgg cttccgacga gtcgtcgcgc 2754721 ggtcctcgat agctgtcgat gtaggccttc gagatccagt caggaatgtc cgcactcgcg 2754781 gtgaacgtgg tccacgcctc aacatcctgc ttagccccgg gatcgatcgt catgccgatt 2754841 gctcccaact cacgacgggt accgctcgat tcaattttcc cgctcctggg tgcggcgttc 2754901 cggacgcatc gtcacggggc gtgggcgaag ctaacattag ccgcgcgtca gcttgcggtg 2754961 ggtgacccta tgcggtcgag cggcgtcgac accgagccgt tccaccttgt tctcctcgta 2755021 ggcgccgaag ttgccctcga accagaacca cttcgcctcg ttgtcgtcgt caccctccca 2755081 cgccaggatg tgcgtgcacg tgcggtcaag aaaccagcga tcgtgcgaaa tcaccacggc 2755141 gcagccgggg aagttcagca gagcattctc cagcgaaccc agagtctcga catccaggtc 2755201 gttcgtcggt tcgtcgagca gaatcaggtt gccgccctgt ttgagcgtca acgcaaggtt 2755261 gagcctgttg cgctccccgc cggatagcac accggccggt ttttgctggt ccggtccctt 2755321 aaacccgaat gccgacacgt aggcccgtga cggcacttcg gtttgaccga cctggatata 2755381 gtccagaccg tccgagacaa cctcccagac ggtcttccgc ggatcgatgc cagcacgggc 2755441 ctggtccacg taactcagct tgacggtctc gccgaccttg acgctgccgc tgtccggcgt 2755501 ctcgagcccg acgatggttt tgaacagtgt ggtcttgcct accccgttgg gcccaatgac 2755561 gccgacgatg ccattgcggg gcaagctgaa cgacaggtcc ttgatcaggg cgcgcccgtc 2755621 gtagccctta tcgaggtggt cgacctcaac caccacgttg cctaggcggg gcccgaccgg 2755681 gatctgaatc tcctcgaagt cgagcttgcg ggtcttctcc gcctcggctg ccatctcctc 2755741 gtagcgctgc aggcgcgcct tgcttttggc ctggcgcgcc ttggccccgg accggaccca 2755801 agccaactcc tcggtcaacc gcttttgcag cttcgcgtcc ttgcggcctt gcaccgcgag 2755861 ccgctcggct tttttctcca gataggtcga gtagttgccc tcataggggt aggcgcggcc 2755921 acgatcgagc tccaggatcc attccgcgac gttgtccagg aagtaacggt cgtgggtgac 2755981 cgccaggatc gcaccggggt agctggccag atgctgttcg agccactgca cactttccgc 2756041 gtctaggtgg ttggtcggct cgtcgagcaa caacaggtcg ggtttggaca acagcagttt 2756101 gcacagcgcc acccggcgac gctcgccacc ggataggttg gttaccggct cgtcggccgg 2756161 cggacagcgc agcgcatcca tggcctgctc gagctgcgcg tcgaggtccc acgcgtcggc 2756221 gtggtccagt tcctcttgca gccgacccat ctcttccatc agctcgtcgg tgtagtcggt 2756281 ggccatcaat tcggcgacct cgttgaagcg gtcgagcttg atcttgatgt cccccatgcc 2756341 ctcttccaca ttgccgcgaa cggtcttgtc ctcgttcagc ggcggttcct gttgcaggat 2756401 gcccacggtg gcgccggtgg ccaggaaggc atcgccgttg ttcggcttgt ccaaaccggc 2756461 catgatccgc aagacgctcg acttaccggc cccgttgggg ccgacgacac cgatcttggc 2756521 gcccggatag aaactcaacg tcacgtcgtc gaggatcacc ttatcgccgt gcgccttgcg 2756581 gaccttcttc atcgtgtaga tgaactcagc catgccgcgg tgttgccttt ctggtccttc 2756641 gggttacctc gcgaaccatc ctaggcaccg ccggggcagc atcgaggcga cccctaagcc 2756701 gatatgggca gggggttgtg gccagtgatg gcgtcgtcga ccacgacatc ggaaaccgag 2756761 tcggctgccg acgctggggc gtcggcggca ccggccgccc cggtccccgt ggcggccggg 2756821 agatcaccgg cgcttggacc ggtgtaggcc ggcttttcga tgcgcacgat cacgcgcgac 2756881 aaatccggcc ctaccgacgt cgcccgcatc tccagcgacg agcgacgaat gccgtcccgg 2756941 tcctcatatt cactggtgta cacgtgtccc accacaatca ccggtgcgcc cttgcccaat 2757001 gctgcgccca ccccggtgac cagccttccc cagcaattga cggtgataaa cagcgagttg 2757061 ccgggctccc aaccgccgtc gctggtgcgc cggcgcgaat tgctggccac ccggaacttg 2757121 acgacctctt gatcaccgac tttgcggcgc tgcaaatcgt tgacgatgtg accgaccacg 2757181 gtcagtggcg tttcgaacat ttgctcattc ctttcctagt tgcgttggca cagttgcgtt 2757241 ggcaccgggt gattccgcga actgcccacg catatgccga gtgctattca cctcggccac 2757301 accgacattt gccgggatga gaccgtcgcc gcgcgcaatc ctgtgaatga agcggtaact 2757361 gtggattaac caattaattg gccgcttggc ctgcaaacct gggaaccaga ccgaaacctc 2757421 gctcagtatt cacaaaacgg tccaatgggg cagggtgacg gcgataacat cccaatgacc 2757481 gtgattcttc gaaccatggc gacgtacggg ccacgacaac ctgccatcga aggggcgacg 2757541 acaatgaaga caaggaaccc acggacgctg ctaacctggc tgctcggcgc gatagttact 2757601 gggttgtacg tggttttcgc tacgggctgc caattgcaag cgcccgcgcc tcccactccg 2757661 gaaataggtt ggtcgggccc gcaggctcca ctgccggcgc cggatgcggc gccaacgcac 2757721 ctcggcgtct agccgatcgc ggcggacaag tcgcccggca cccagggcga gcagggcttc 2757781 acgacagcta gtgagctata gacgacttcg tgttagcgcc gctggcgggg acgttggcgc 2757841 tgatggggat cgagttcctc agctgcccgt ggacaagaac cgcaccccgc agcgagtcgg 2757901 catggacact cgcgacgccg ccacgagtct ggcggtgtac gcccattgcg cgcgtcacgc 2757961 gcccactgac ccagttcact ggggtgccgt tcgccgtgct cgcggcggcg ctcacggcgc 2758021 tgcatctgac ggcatggcgc accgcattcg gtttttctga gcgctgggaa aatggccagc 2758081 cgtctggctc atggcgtcta cgcaacgcca cgcccccaac acgttcttag attcggtcgc 2758141 gtccttgacg cgctttgaac tcgcgggcga cgaactggtt gcgcgcgatc tgctcgacat 2758201 agtcgaaatc ccgcagaatg tttcgtaact cccgccggaa ggcgacccta cgttcggcga 2758261 ggtcggccgc cggcgctatc agctcctgat cgacggcgac ctggcgtgca gtggcgaaca 2758321 gcagcgtcga taccggttcg ctgctgcgga cccggccctg tgccacaaac tgacggccga 2758381 ggccgagcgc cagctccgtc aactcctcag gaccgatgtc aggcggagca tcgcgcaaca 2758441 cgtcggcaac gatctcatag gcttcgaaga agacccgcaa catcgcgtcc gacatcagcg 2758501 gccgtttggc atacagcatc gcgtcgatct cattgccccc gacgccaaga tgatcctccc 2758561 agtcttggtg ccaggccatc tcttgggcga tgttggcccg aaacgccgtg gaatccgcga 2758621 aatagaagtc gaacttcagc agatcccgca accgcatcgc ctgggcccag aacgcggcga 2758681 cgcggtcacc ttcggcgtgc ttggcatggg ccagcgcgag ctcgacgatc gaggtctcca 2758741 aaaacgcatg gatcaccgag ctccggtaga acgccgcggc gtgctcgtcg tcaggcgcta 2758801 tgtaccatac cggctcccgg ccactgtcga cccgagtgac cgggtggccg ttggacaacg 2758861 cgtccgccgc cgcacggacg ccttcgcgcg agcgcagtcg caatgcgctt gtcgaaaccg 2758921 gcgattgttt gcgttccaga tagtccagtg agtcctgcaa cgtgtggtgc agctggtcga 2758981 gcgtcaacgc ggtgccgcgg gtggtgagca gcagtgcgga caccaaaccc gtcgcggtca 2759041 ccggcgtcgc ctgcaaaatc ctccaggcca cctcgaacga catcttctgc aacgcaagcc 2759101 gtttcgcggc cggatcctgg gtcagctcgc cgtgcggtgc gccgaggtac tggcgcatcg 2759161 agaccgcttc ggggaagcga acgtagatct tgccgaagtt gcgttccccc tgcgccttga 2759221 tgaagttgta gagccagcgc aaaccttcgg gcgtcttctc cgcgccacgc gcgtaggcgg 2759281 cgtattcggt gatctcgtgc agctgatcga agcaaatcga aaccccctgc agcaggatgt 2759341 cgtcactgcg gccgtccagg taagcatcgg ccacgtagct catcaaaccg agcttgggcg 2759401 gcaacatctt tccggtgcgc gaccgggtgc cttcgatgga ccagctcagg ttgaaccgct 2759461 tctcgaccac gtagcccacg tactccttga gcacgtactt atacagtggg tcgttgccga 2759521 tattgcgccg gatgaagatc atccccgagc gccgcatgag gggtcccatg agaccgaacg 2759581 acaggttgat gccgccgaac atgtgcaccg gcggtaaccg gttgtcctgc atggccaccg 2759641 gtaccaccac gccgtcgatg taggaccggt gcgagaacag caggaccgcc ggatgagcct 2759701 ccagtgcggc gcgcatcgcc gcgacctgat actcgtcgta gtcgaattcc ggatcgaagc 2759761 cgcggctagc cagcctgccg aggacggaaa ccaggtctac cgacacctgg ctccatccgg 2759821 tggagagttc gtcgagcatc ttcccggcat cttcgaccgt ggcgcccgga atccggtcca 2759881 ggccggcacg aaatcgtgcg gacgccaaca tctccggctt caccagccgg ggagatttgt 2759941 attgcggtcc aaggatccga tattcggcgc gcgccagcgc caacagcgct cggcggctga 2760001 cgaactgggc gaaatcgcgc ttgtgctctg ccaccgtggt atcgcgccac tgctggcgca 2760061 gttcggacac cttggccgac tcgccggcca ccacccgcgc gcgcctggga tcggtacgca 2760121 ggatgcgacg ctgctgacgc tggctgggat ggtagggatc ccgacccggg agcagtgcgg 2760181 ccaccttgcc cgcccggctg cgatcggcgg gaggcagcca gatcacccga accggcacga 2760241 tagaacggtc ctcgccagat tgcgggctgg atgcgaagcc gggctcgagc tgctcgacca 2760301 gtgccgtcag cgccgccggc ggagcgttgc gcggtggcag cttcaatatg tcgaacttcg 2760361 agtccggatg gcgtgcacgc tgctggccca gccagcccat gatcagctcc atctcgaccg 2760421 gcgtcgccgt ggaagccagc accagtgtgt cctcggcagt aagcaccgcg ctggcatcgg 2760481 ccgccggttt ggtcacgacc gtcctttggc gctagagctt ggcgatgcgg aggcctcacc 2760541 atccttgcca gcgatcttag attcgctggg tttggccttc ggcgatgcct tctttgtagc 2760601 cgccttcgtc gcggccgcgc ctttattggc ggcgcttttg gcgggagcct tcttagccgg 2760661 cacccttttc gcggtggcct tggcgacctg agccctggct tttctggcgg ccttctgctc 2760721 ggcgtacaga tcgaccgcgg gcaacccatc gaccggccag tccgccagcg tgtccagata 2760781 cagctggcgc acctcggcga tacgatccgg cagggcgtcc agggtccagt catcgaccgg 2760841 aatcggcgga aacaccgcga cgtcgaccgt gcccggattg atcgtggtgg agttgcgcga 2760901 ggcgacgatc tccgcattgc ggatcacgat cggcacgatc gggatcttcg cggccatggc 2760961 gatacggaag ggccccttct tgaatgaccc gacttcggtg gtatccaacc gggtaccttc 2761021 gggagcgatc acgatcgata gtccattgcg ggcgcgctcc tcaaccgtgt gcagtgtctc 2761081 caccgcggcg accggatcat cacggtcgat gaacacaccg tccagcaact tccccagcgt 2761141 gcccatgatc gggtcgctcg ccagttcctt cttgcccacc ccaacccagt tgtcgcgcac 2761201 cagcgcaccg gcaatgaccg ggtcaacctg gttgcggtgg ttgaagataa agacggcggg 2761261 ccgctgggcg gtcagattct cttttccgat cacattcagg tgcacgccgc tggtcgccag 2761321 cagcagctga gagaaggtgg aggtaaagaa attcacgccg cggcgccggc taccggtcag 2761381 cacaccgatc cctaccgcgc cggccgcgac cgggacgatg gtgctcagac cggcaagtgt 2761441 ccgcaactgc cgccggatgc ccacaccgcc gcgactgttg aacttcaaga tcggccagcc 2761501 ccgtcgcttg gcgaccgcgg ccatctttcc ttccggattg gtcggtcgcg gattgcccac 2761561 cagatacatc agggcgacgt cctcgtcacc gtcggcatag aagtaactgt ctttgagatc 2761621 gatgtcgtgc tcggccgcaa agcgttgcac cgcagtggct ttgcccggac cccacaaaat 2761681 tggcttcagc acacccccgg tgagtatccc gtcctcgttg gtctcgaact tgttggtgag 2761741 catgttgttg atccccagaa aacgtgcgac tgggccaact tggatggtca gcgccgacga 2761801 gctgaggacc acggtgtggc cgcgggccac gtgagcccgg accagttccc gcatttccgg 2761861 gtagatccgg gactcgatcc gctgggcgaa tagccgctcg ccgatttctt ccaggtcggt 2761921 caagagccgc ccggccagcg ccgcggcggc ctttccgata aggtcttcga actcgattcg 2761981 cccgagcgtg tgattcaggc cggcctgaac cataccgagc agctcgccca cgcccatatc 2762041 gcggcgccgc agcctctcct gggtgaggat gacggccgtg aagccggcga ccagcgtgcc 2762101 gtccaggtcg aaaaacgcac cgaccttcgg gccggcagga ctggccagaa tctcggctac 2762161 cgaaccgggt aggcgcaaat ccggcgccga cttccgcgtc gcccgctctt ccccctgctc 2762221 gtcagcggcg ctcatgagcc cgacaccgat cgaggcactg aaccggctcc ttgagtatcg 2762281 aacgacgccg gcagcacacg cggtgccgga tcaccggcca gcgcaaggat ttcgtcgaaa 2762341 cccgcctgca ggcattgagc gaacaactcg tcgtttcgca ccgacgccct gtcgtagcgc 2762401 accgtgacgg tgcaccaccc gccccgggaa attagcacta ccatcatcgc cacaccgggc 2762461 aacggtccaa taccgtactg ccgcagtatc ttcgcgccgg caaggtaggt atcccctggg 2762521 tagaccggaa cattgctggc ttgcacatcg gaaccgatca ccgaaccggt gatcccctcc 2762581 agcacggccg tcggcaagac actcagcacc ggtgcaatgg aaccgatgat gttcatcgcg 2762641 ggctcgtcgc gacgctgggt catctgcgcc cggatcttct tcatccgggc caccggatcg 2762701 atagtgccca ccggcgccgc caggttgaca ccggtgaact ggttgccgcc ggccgcatcg 2762761 ccctcggccc gcaggttgac cggcaccgcc atcggcagcg tgctgatcgg cacgcccagg 2762821 gcctcgtggt agcggcgcag cgcgccacac agacccgcaa ggtaggcgtc gttgatcgac 2762881 ccgccgccgg cctttgcggc cttgtgcagg tcggcgagcc ggatgtcgat ggcctcggta 2762941 cgggtggtca ggctgcgccg gcgcagtagg ggtgagggtt cagcagctcg gttcagcacc 2763001 cggatgcccg acctggcgta gcccaagatc cccgacacgg tggacaccgg ttccagaaca 2763061 gcccgcccgg ccatcgatac cgccccggac agcgcgtcca ggacaccgcc gacgacagca 2763121 attggcaggt ggttgatgcc ccggcgcatc aggtcattgg gggacagatc ctccggaatg 2763181 ggttgcggcg gcgtcgacct aggtggtgga tcgcgctcga ggtcatagat ctgcgcgaac 2763241 atctccacgc cgccgacacc gtcggtgacc gcatggctga cgtgcagcag catcgccgct 2763301 ctgccgtcag ccataccctc caccagggtg gccgtccaca gcgggcgcga tatgtccagc 2763361 ggcgactgca gaatcacctc ggcgagatcg agcacttcgc gcaacgtggc gggtccggac 2763421 acacgcaccc gacgcacatg gaagtccaga ttgaagtccg gatccaccac ccagcgcggg 2763481 gccgcggtcg gcaaggtcgg caccaccacc ttctgccgca gccgcaacac ccgtcgcgag 2763541 gcgttttcga atcgggtccg gaagcgatcc cagtccggcg tgccgtccag cagttccagc 2763601 gccatgatcc ccgaacgagt ccgcggattt gcctcgcccc gatgcatcaa atagtcgacc 2763661 ggcccaagct cgtcggacaa cctgggggac tcgccggact cagccatggc cacgaccccg 2763721 cgcgggttgg gcaactcgac gcacaaactc tgtcaccgcc gatcagacct cctgcttcaa 2763781 acccgccacc gccacgcacc acagtgccaa cacaacgcta gtcgcgatga cgcggtggtg 2763841 aaagccgatg cgggccatga tccacccgca gcaccgatgc cgcggccacg accgacgaaa 2763901 cctcgtgttg ggcagccgag ttggaacggc caagctcagc tggccggagg tgacgacagc 2763961 gccagcgaac ccttgcgagc acccatccgt cgcccgtaga tcacacccaa gaagtccgag 2764021 accgcttcgg cgaccatccg cgatcggacc gtggcggcga ggtcgaacgc gtggtgggcg 2764081 ttggggagct cagcgtagga caccgtcgcg gcacccgcgt cgcgcagcgc cgcgctgaag 2764141 gcgcgagatt gcgcgctcgg caccatcgga tccttctcac cgtgcaacac gaagaacggc 2764201 ggagcctcgc tgtggacgta cgaaatcggc gacgccgcct tgaacagccc cgggttgtcg 2764261 acgtagcggc tacgcatcac gaagtgctcc aggaacggca tcatcatttc gtgcatattc 2764321 tcggcgttgg tgaggtcgta gacgccgtag tagggcgccg cggcttgtac cgccgtgtcg 2764381 gcgctttcga agcccggctg cagcgccgga tcattcgccg aaagcgcggc caacgcggcc 2764441 aggtgcgcac cggcggaccc gccggtgatc gtgatgaaat ccggatcgcc gccatagtcg 2764501 gcgatgttct cgcgaaccca cgcaatcgcc ctcttcacgt ccacaatgtg cgccggccac 2764561 gtgcaccgtg ggctcttgct gtagttgatc gacacacaga tccagccgag ttccaccatc 2764621 cggctcatca acgggtaagc ctgagggcgt ttgccgttga tggtccacgc cccgcccggg 2764681 acctggatga ggaccggagc ccggcggccg ggcgctaaat cgggacgccg ccagatgtcg 2764741 agtagattct cgcggccgcc gggcccgtac gggatgtcgg aggtctgggc cgcatagcgg 2764801 cgatggggtc cgggaatgtg cggtaggttc agcagcccgc tgcgccgggc agcctctgac 2764861 tgttcgccgg tcggatgcca cactaggtca cggaaatccg ggccgaaagc gtccacgagc 2764921 gccgcgtgca ggatttgatc cgcccgctgc gccgcccagc tggtgccaaa ccggccgatc 2764981 gagcgtggtg atatgcggga cagcgcgtgg ccggtgacga cgcgggccgg aaactccgcg 2765041 gacaaccatc ctgcgaccca accgatggcg cacggtgatc cacgcagcag cagggcgccg 2765101 gcccggcagg tgtctctggc gtcggccgcc agctgcgctc cctggcgcaa tgcctcggcg 2765161 ccggcccgcg agcaccgcga agtcacgctg gcgatgtgca taacaaagcc cacccctcga 2765221 cgtcaggcac acgcatcgtt gcggtaaacg gctggttgcc agccggtttt gtacgtgtgt 2765281 cgaggatcac acaataacca ataattgacg tggcggtaga cctttcgcgc gtgtggcgtc 2765341 tggaaaaatt cctcgacggc caccgttaga taaactgacc tgcgcatcgc ctccgtagct 2765401 caggtggata gagcaagggc cttctaatcc ctaggtcgca cgttcgagtc gtgccggggg 2765461 cactgtggaa atagcaggtc agcatggtgg cgtggcttga caccgcctcg ttatgggtcg 2765521 acgcccagag tcgccttcaa actcaaacca cggaggtgcc cgatggccca atacgacccg 2765581 gtcttgctca gcgtcgacaa gcacgttgcg ctcatcacgg tcaacgaccc ggaccgacgg 2765641 aacgccgtca ccgacgagat gtcggcgcag ttgcgtgcgg cgatccaacg cgccgaaggc 2765701 gaccccgacg tacacgccgt agtcgtgacc ggggcgggca aggccttctg cgccggggcc 2765761 gacctgagtg cgctgggcgc cggggtcggc gatccagccg agccgagatt gttacggctc 2765821 tacgacggtt tcatggccgt cagtagttgt aatctgccca ccatcgccgc ggtcaacggc 2765881 gcggctgtgg gcgccggact caatctggcg ttggccgccg atgtgcgcat cgccggaccg 2765941 gccgcattgt tcgacgcccg cttccaaaag ctgggactgc atccaggtgg cggcgcaacc 2766001 tggatgctgc agcgagcggt gggtccgcag gtcgcccgtg cggccttatt gttcggcatg 2766061 tgcttcgacg ccgaatccgc tgtgcggcac ggcttggcgc taatggttgc cgacgatccc 2766121 gtcaccgcgg cgctggagct ggccgccggg cccgcagccg ccccgcgcga ggtcgtgctg 2766181 gcgagcaaag ccaccatgcg cgccacagcc agccccggat cgctggacct tgagcaacac 2766241 gaactcgcca aacgcttaga acttgggccg caggcgaaat cggtccagtc gcccgagttc 2766301 gccgctcgct tggctgccgc tcaacacagg tagcgcctac cagcctcgag ggtttccatg 2766361 gcgtgcccca gtccgaagct gctgctgctt gactccgcgc gctgggcccg agcgcgcgct 2766421 gttgtacggc ccaaacggcg tgtcggtgta cagtcgcgcg ctcgcggctt cagtccggcc 2766481 cccgactccg gcaggcccga cggcgcccag cgctagccgg gcgcgccggc catgccttcg 2766541 gtgccggaaa cgccagggga cccggggccg ttggtgaggc cccccgcgcc tgcctcaccg 2766601 ccgctaccgc ccgcgccacc ggcaccgcct gcgccgcccg cgccaccgat accgtcagcg 2766661 ccgctgactc ctgcggcacc gctgaggaac cctccggacc cacccgcacc gccggcaata 2766721 ccgccagcgc caccgttacc gccgtttgcg ccgttgcccc cgttgccgcc tgtcccgccg 2766781 gccccgccga tggagttctc atcgccaaaa gtactggcgt tgccaccgga gccgccgttg 2766841 ccgccgtcac cgccagcccc gccgactcca ccggccccac cgactccgcc gctgccaccg 2766901 ttgccgccgt tgccgatcaa catgccgctg gcgccaccct tgccacccac gccaccggct 2766961 ccgcccaccc cgccgacacc aagcgagctg ccgccggagc caccatcacc acctacgcca 2767021 ccgaccgccc agacaccagc gaccgggtct tcgtgaaacg tcgcggtgcc accaccgccg 2767081 ccgttaccgc caaccccacc ggcaacgccg gcgccgccat ccccgccggc cccggcgttg 2767141 ccgccgttgc cgccgttgcc gaacaacaac ccgccggcgc cgccgttgcc gcccgcgccg 2767201 ccggtcccgc cggcgccgcc gacgccaagg ccgctgccgc ccttgccgcc atcaccaccc 2767261 ttgccgccga ccacatcggg ttctgcctcg gggtctgggc tgtcaaacct cgcgatgcca 2767321 gcgttgccgc cgcttccccc gggccccccg tggcgccgtc accaccgata ccacccgcgc 2767381 caccggcgcc accgttgccg ccatcaccga atagcaaccc gccggcgcca ccattgccgc 2767441 cagctccccc tgcgccaccg tcggcgccgg aggcggcact ggcagccccg ttaccaccga 2767501 aaccgccgct accaccggta gaggtggcag tgacgatgtg tacgaaagcg ccgcctccgg 2767561 cgccgccgct accaccccca ctgccggcgg ctacaccgtc ggacccgttg ccaccatcac 2767621 cgccaaaggc gctcgcaatg tcgccctgcg cgactccgcc gtcgccgccg ttgccgccgc 2767681 cgccaccggc agcggcggta ccgccgtcac caccggcacc gccggtggcc ttgcccgagc 2767741 ctgccgtcgc ggtggcaccg tcgccgccgg tgccaccggt cggcgtgccg gcagtgccat 2767801 ggccgcccgt gccgccgtcg ccgccggttt gatcaccgat gccggacaca tctgccgggc 2767861 tgtccccggt gctggccgcg gggccgggcg tgggattgac cccgtttgcc ccggcgaggc 2767921 cggcgccgcc ggtaccaccg gcgccgccat ggccgaacag cccggcgttg ccgccgttac 2767981 cgcccgcacc cccgatgcct gcggccacgc tggtgccgcc gacaccgccg ttgccgccgt 2768041 tgccccacaa ccaccccccg ttcccaccgg caccgccggc cgcgccggta ccaccggccc 2768101 cgccgttgcc gccgttgccg atcaacccgg ccgcgcctcc gctgccgccg gtttgaccga 2768161 acccgccagc cgcgccgttg ccaccgttgg ccaaacagca acccgccggc cgcgccaggc 2768221 tgcccgggtg ccgtcccgtc ggcgccgttt ccgatcaacg ggcgccccaa aagcgcctcg 2768281 gtgggcgcat tcaccgcacc cagcagactc cgctcaacag cggcctcagt gctggcatac 2768341 cgacccgcgg ccgcagtcaa cgcctgcaca aactgctcgt gaaacgctgc cacctgtacg 2768401 ctgagcgcct gatactgccg agcatgggcc ccgaacaacc ccgcaatcgc cgccgacact 2768461 tcatcggcag ccgcagccac cacttccgtc gtcggcatcg ccgcggccgc attagccgcg 2768521 ctcacctgcg aaccaatact cgctaaatcc aaagccgcag ttgccagcag ctgcggcgtc 2768581 gcgatcacca acgacacctc gcacctcccg ataccccata tcgccgcacc gtgtccccag 2768641 cggccacgtg acctttggtc gctggctggc ggccctgact atggccgcga cggccctcgt 2768701 tctgattcgc cccggcgcgc agcttgctgc gcgagttgaa gacgggagga caggccgagc 2768761 ttggtgtaga cgtgggtcaa gtgggaatgc acggtccgcg gcgagatgaa taggcggacg 2768821 ccgatctcct tgttgctgag tccctcaccg accagtagag ccacctcaag ctctgtcggt 2768881 gtcaacgcgc cccagccact tgtcgggcgt ttccgtgcac cgcggcctcg ttgcgcgtac 2768941 gcgatcgcct catcgatcga taacgcagtt ccttcggccc aggcatcgtc gaactcgctg 2769001 tcacccatgg attttcgaag ggtggctagc gacgagttac agcccgcctg gtagatcccg 2769061 aagcggaccg ctcccatgcg cccccgggcc gcgtcggccg cgccgaacag ccgcaccgct 2769121 tcccggttgc tgccggcatc cgccatcacc gaggcgaggc actcgagaat gtcggggacc 2769181 cataggtatg ccccaatgga cgcggccacg ccgagggcgt cgtgggcatc gcgctcggcc 2769241 cggtggcgat ccccttgggc gatctcgatg cggcaacggg tagtcagggc gcgggcgcgg 2769301 tgcacgccac gagtgatcga cgctgcgccg tcggccaatc ggtgcgccgc gttcagatca 2769361 cctcgcgcac acgatatttg agccgaactg gtggggtcgt tgatgatcgc cgccgcgctg 2769421 gcaccaaaga atcgcgttgc cgattcgcgg gcgtgttcgg cggccgcgac gtcaccggcg 2769481 gccagggtcg cgaagaccag cgcggagcag gccgagcccg acagcaccgg gctgagtcca 2769541 acggcggtgt cgatgctggc ttgggcggcg gcggccgcct cggtgtcgcc gcggtgcgct 2769601 aacgcgtgcg ccaagcaagc ctggcccgcg cagctgctaa ccatgtcgtg cgcggcgtcg 2769661 gactcgccga tcacctcgcg cgacaggccg accgctgcct cgaggttgcc ctgccagaga 2769721 ttcgccgcgg ccagcgccca gcgacatgaa cgtgaaagga atgcatcacc aatctcgtcg 2769781 gcgaggcttc gtgcctcctc gcccgccgcg cgggtcgcgc ccgggtcacc ctcgccggcg 2769841 aacccgacat aggcctgcca ggccagaacc tcggccaacc gccacttgtc gcccaccgcc 2769901 cgggccaggc cgacggcctc ggccagccac ggtcgcgcca gatccgcgtt gtaggcggcg 2769961 acacccccgc acgcggtcag cgcccgcgcc agcagggccg gatcctcgat gtcgcgcgct 2770021 atagccagcg ccttctgggc atcatctagg cggtcggtga tgccggccac ggcatctatc 2770081 agggcccggt cggccagtgc ccgcgcatac aacccagggt cggcccccgc cggatgtgca 2770141 tcgtggtcgg ccagggcggc ggcgaaccag gccagcccct cttgcaggcg gccccgggca 2770201 cgccacaacg gctgcagaca tgatgccaac agcaacgcgt ggccggtatc gccattctcg 2770261 cggctgaacg cgaaagcggc ccgtaggttg tcgatctcga gctcggcctg gttgagccgg 2770321 cgttcatggc cggccaccga gggggcgtca agcccggcgg caacggccgc gtagtggtcg 2770381 cggtgtcgcg cacgcacggc atcggcatcg ccggattcac gcagcttctc caacgcatac 2770441 tggcgcaccg tctctagcag gcggtagcgc gttcggccgt cgctgtcgtc ggtcaccacc 2770501 agagacttgt ctgccagcag gctgagcaga tcgaccacct cgtagcgctg aacgtcaccg 2770561 ccggcggctg ccgcttgggc accgtcgaga tcaaacccgc tcgggaaaac cgccagtcgc 2770621 cgaaacagca cctgctccgg tccggtcagc agcgcatgtg accagtcgac ggaagcccgc 2770681 atcgtctgct ggcggcgcac cgcaatacgc gatccaccgg tcagcaggcg gaaccggtca 2770741 tgcaagctgt cgacgatttc ggtcagcgcc agggcacgca cccgcgacgc tgcaagttcg 2770801 atcgccagcg gaatgccgtc gagtcggtgg cagatctcgg tcaccagggc gaggttgtcg 2770861 gcagtgatct cgagttcggg ccgcgcctca cgagcgcggt cggtgaacaa ctcgatcgcc 2770921 tcgccgtgcc ccagcggggg aacccgccaa atctgctcac cggccaccgc gatcggttcc 2770981 cggctggtcg ccaataccct cagcgctggg cacgccccga gcaacgcgac gatcagagcc 2771041 gcgcacccgt cgagcaagtg ctcgcagttg tccagcacta ccagcatgcg ccggtcgccg 2771101 atacgccgca caatggtgtc caccgtcgag cggcccggct gatccggcaa ccccaaaacc 2771161 cgcgccgccg cgatcggcac cagcgccggg tcggtgatcg gcgccaggtt gacataccaa 2771221 accccgtccg gataaccgtc ggcaacggcg ctcgcgacct gtgtcgccag gcgtgtcttt 2771281 ccgaccccgc cgacaccggt aagggtgacc caccgtttga cgtccagcag cccacggact 2771341 tgcgccactt cgtcgacgcg ccccaccagc cgagtgagct gggccggaag acagtgcgca 2771401 ccaacgactt tccgggtccg cagcggcggg aacgcgttgt gcagatcagg gtgacacagc 2771461 tgcaccaccc gttccggtcg gggcaggtcg tccagccggt aggtaccgag gtcgttcagc 2771521 cacgcgtcct tgggcagcag gtcagcaacc agatcgctgg tagttcccga caacacggtc 2771581 tggcccccgt gggccagctc gcgcagccgg gcggtgcggt cgatggtcgg ccctacgcag 2771641 ttgccctcgt cgggtgacga cacctccccg gtgtgcatgc cgatgcgcag ccggatcggt 2771701 gccagcggcg cccgctgcaa gcccagggcg cacgccacgg cgtcggatgc gcgggcgaac 2771761 gccaccaaga agctgtcgcc ttcgccctgt tcgaccgggc aaaccccgcg gtgctcgcga 2771821 accaattcgg tcagcgttcg gtccagtttg gcgatcgccg tcgtgtcaag ctgagacccc 2771881 ggcaggtggg tcgcgccctc gatatcggcc agcagcaacg tcaccgtgcc cgtcggtaca 2771941 agctcgctca caccatctgc gctccagtcc acaggtacca cgtcgacgcc ggggtgaatc 2772001 ttgctcatgc tagccagcat cgagccagcg cgtagcgcat tacatcggca cctgcgccta 2772061 gattgctcga aatctcttgg ccgccggtcc atgtgttcta cgcgctttag tcgatgcatt 2772121 cggcgaccgg cgtgccatcg cggcggacct acagtgcccg tgctgtccgc tggcaattgt 2772181 gagtccccca gtgctggcag catcgcccgc aagaaccgac acgaccgcat cgtgggcggt 2772241 gccgtcgaag tcgccggctg accgatcggc ggagtcaccg gcccgatggg gtttccgaag 2772301 gctagggaat gatgacgatg gggcggccgc ctcggccgcc ttcgccgtaa cccccaacca 2772361 tgcggaaaac gagcctagcg tcgcccggcc gcgcagagcg agccatcgcg gtggcgccaa 2772421 cgacaggaag cgatccggat tctctgacca tggtgggtgt tctggctacg tgacgttaac 2772481 ggagatggag gggccgcctt cgccgccttc accgccggaa ccgccggagc cagggtcgcc 2772541 cctcccgttg ccggagccac ccgactcgcc cgacgagccg acgccgccgg aggtcaagcc 2772601 accggcaccg cgtccgccgt cacctccgcg cccgccgtcc ccgccgtcac cgccgccgat 2772661 gctgcgaggc ggaggggcgc cgaagccgcc ggagccgccg gtcccgccgt cgcctccgtc 2772721 accaccgggg gcgccaccgt ctcgcccggc cccacccaag ccgccgttgc cgccgttgcc 2772781 acccggcccg ccgtcgcctg catcagcaaa gctgccgttg ccgtccccac cgtgaccgcc 2772841 gttcccgccg tcgcctccgt caccgccggg ggcgccgaag ccggccttgc cgtgcgcgcc 2772901 acttgtggaa ccgaaaccgc cttgtccgcc ggggcggccc cacccgccgt cgccgccgtc 2772961 acctccgtcg ccgccaggct ctccgtcaaa atccgcgaga taggtaaagc cgtcaccgcc 2773021 caagccacca ttaccagcgt ccccgcccga cccgccgtca ccgccgtccc cgccaacgcc 2773081 tcgattgccg acctcgccgg cgggtgccga cccgccggcc ccgccgtttc cgccggcgcc 2773141 gccccacccg ccgtagccac cgtcgccgcc gtcgccgtcg cggcccgtcg tttcgttaat 2773201 gtcaaagccg tcaacgccgt taccgccgac cccaccagcc ccgcctaggc ctccggcccc 2773261 gccgtcacca ccgtcgccgg tctgagttcc gccggcgcca ccggccccgc cgtcgcctcc 2773321 cgccccaccg ctgccgccgt cgaagccgtc gaagcccttt aggtcggagt cgggcgacca 2773381 acccgcgcca ccggccgcgc cgttgcctcc ctggccgccg gttccgccgc cgccgttcat 2773441 cccggcgtcg ccgcccgccc cgccgtgtcc accaaccccg ccgccgccgc cggggctgcc 2773501 gccccggcca gccccacctt ggccgccggc tccgccgttc ccgccgtcgc ccagaaatgc 2773561 tccgccggcg ccaccagccc caccggcgcc accagcccca ccgttgccgc cagcaacggt 2773621 gagccctccg agggcaccgt gcgcgccgtc gccacccttg ccgccgtcac cgccgtcacc 2773681 gatgtcgccg gcgtcaccgc ccttgcctcc agccccaccg gccccgccat caccgccgag 2773741 agcttcggca gcggtgccgt cggccccatc accaccggct ccgccgtccc cgaatagccc 2773801 ggcgttgccg ccgtcaccgc cctggccgcc gtcgccgccg gccgcggcgg ccttggcacc 2773861 gttgccgccg acgccgccgt cgccgccggt cagtggcccg tgtttgctgg cgtccacgcc 2773921 gttggccgcg gaggtgccgt tgccgctgtc accccccaga ccgccgcgac cgcctgcgcc 2773981 ggggtcaccg ccgttaccgc ccgctccgcc ggcgccgccg acggtgatac caatgccgcc 2774041 gttgccgccg gccccgccaa cgccgccggc gccgccgagt ccgccgtcgc caccgacccc 2774101 accggtgccg tgactgccga ccgtccccga aggtgcggcc ccgccgaccc caccgtcccc 2774161 gccatgtcca ccgaccccgc cggcaccgcc atcgccgcca ccaccgccgg ccccaccggt 2774221 gccgccgata ctgtcgatac cgttggcgcc cctggccccg gccccaccgc tagcgcccac 2774281 accgccgttg ccgccggccc cgccgttgcc gccggcaccg ccgtcacccg acaccgaccc 2774341 accggcgcca ccggcaccac cggcaccgcc ggcctgcccc gcgtcgccct gacccccgtt 2774401 gcctcccggt tggccgaggg cgagggcatc tgaaccaggc gcgcccgaat tggccccgtt 2774461 ggcgccggcc gcgccatcgc caccattgcc gccggcgcca ccatcgccga cccggccggc 2774521 attgccgccg tcgcctccgt tgcccccggc gccgcccgcg acgctggctt gcgcaccgtt 2774581 gccaccgtta ccaccgttgc cgccgctgcc gggcccgtgg tcgctggcgt ccacaccgct 2774641 ggccgcgcgg gtgccgttgc cgctgtcgcc gcccaagccg ccgaggcctc ccgcgccggg 2774701 gtcaccgccg tcaccgccgt ccccgccatc actgccatga ccgccgtcac cgccgttgcc 2774761 gccggctccg ccgagcccgc cgtcaccgcc aacgccgccg acaccgtggc tgccgacctg 2774821 acccgcgggt gcggccccgc cggcgccgcc atcaccgccg ggcccgccgt caccgccaac 2774881 gccgccgaca ccgccgtcgc cgcctttccc gccggcgcct ccaacggcct cagcgctgtc 2774941 ggcgccggag gcgcccttgc cgccgccgcc accactagct ccggcaccac ccgcaccgcc 2775001 ggccccgccc ttgccaccgg gtccaccgtc gcccgacacc gacccaccgg cgccaccggc 2775061 tccgccggca ccgcccgcgc cgccggcctg ccccgcatcg cctcgaccgc cgttgccgcc 2775121 actacctaac gccgaactcc cggcaccacc gtcaccgccg gcaccgcccg cgccgccacc 2775181 gccaacgccg ccttgaccgc cgttggcctc gcttccttcg cccggttgcc ccgagagggt 2775241 gccgtcggcg ccgtcctcac ctacgttcgc gccattcgca ccggccgcgc cgctaccacc 2775301 gtcgccgccg gccccgccgt caccgaccaa cccgccgtta ccaccggcac cgcccttgcc 2775361 gccgttcgcg ccagccacgg tggcgtcggc gccgttaccg cccttgccgc cgttgccacc 2775421 actgtgcacg gtagcgccgg tcgcgccggt cacaccctcg gtggcgccgg cgccgccggc 2775481 gccacccttc ccaccggcgc ctgggtcacc accatcgccg ccggccccac cgtcaccatc 2775541 cttgaaagcc atgtcgccgc ggccaccgct gcccccgtta ccgggcgccc caccagcccc 2775601 gccggcacca ccgtccccgc caacaccctg gctgcccgcc cgacccgcag gtgcgtcacc 2775661 gccagcccca ccggccccac cgtcaccgcc gcgaccgccg gctccgccat caccaccgtt 2775721 gccgccgtca gatacgagca cagcattgaa accgtgagct ccgttaccac cggccccgcc 2775781 ggccccaccg ttgccgccgg caccgccggc cccgccatcg ccggcgtggg ccccacccgc 2775841 gccgccggcc ccaccggccc cgccgttacc tccatcctca ccgggggtac cggatgaacc 2775901 caggaagatc gccgtcatat cggcatagcc ggcacccgcg gctccgtcac cgccatgacc 2775961 gccggcccca ccgtcaccga ccaacccgcc gttgccaccg gcaccgccgt taccgccatg 2776021 accgccggcg accgatgcgt gggcgccatt gccacctttg ccgccgttgc cgccgctgac 2776081 cggcccgtcg ttgccggccg ccagaccatt ggcccccgcg cgagcaccgg cgccgcccga 2776141 cgcccccccc cccccccggc tccgccagcc ccacccaggc ccgggtcgcc accgtgaccg 2776201 ccggccccgc cgtcaccggc ggccagccaa ctgccaccgt tgccgccggc accgccgtca 2776261 ccaggcgctc cacccagccc cccacccccg ccagccccgc cgtctccggc ccggccgaca 2776321 taccccaata gtccagccga cccgccagca cccccggcgc cgcccacgcc gccgtttccg 2776381 ccggcaccgc cattgccgcc ggcgccgccg tcaccgccgg ccgccgagat accggccggc 2776441 ccatttattc cggtagcccc ggcaccgccg gcaccgccgg ccgcaccggc accaccggcc 2776501 ccgccgacac cgccaacgcc accggcgccg ccgttaccga gaagccaccc tccccgacca 2776561 ccgttgccgc ccaccccacc ggcaccgcca tcgcccccgt ccgaccctgc caaaccgtca 2776621 ccgccggcac cgtcgtccga cccggcaaca ccagccgccc catcctgacc gggcgtagca 2776681 ccgttggccc cggccgcacc tacaccaccc acgccgccag cgcccccatg accaaaccac 2776741 cccgcgttac cgcccgcgcc gccggcgcca ccaccagccc caccggcacc accggcgccg 2776801 ccgttgccga acaggcccgc gttgccaccc gccccgccgg cgccaccacc agccccaccc 2776861 ataccgccga cgccgccccc acccgccagc catccacccg tcccgccggt acctccgggt 2776921 gcgcccgccc cgcccgcgcc accggcgccg ccgatcccaa ataacccggc cgccccgccg 2776981 gcgccgccca cctgcccgac ggcaccggca gcgccgttgc cgccattgcc gaacaacaat 2777041 ccaccggccc caccgggctg cccaggcgct gtcccgtgcg caccatcgcc gatcagcggg 2777101 cgtcccagta gcgtctgggt gggcgcattc acggctccga gcacgactcg catcgccgcg 2777161 gcgttggcga tctcggtggc cgtgtaccac ctcgcggccg cggtcagcgt gtgcacgaac 2777221 tggtcgtgaa acgccgcggc ctgcgcactt agcgcctgat actcctgagc gtgcgcgctg 2777281 aacaacgccg cgatgcccgc cgacacctcg tcggcgccgg cggccaccac ttccgtcgtc 2777341 ggcatcgccg cgaccgcact agccgcgctc acctgcgaac caatacgcgc cagatcaaaa 2777401 gccgcagttg ccatcatctc cggcgtcgcg atcacatatg acatctcgca cctacccaat 2777461 agcccgaccg tcgccgcgcc gctcccgctg cgactagtga ccccttggtc tcttgagcca 2777521 gcgaccccaa ctaccgccgc gacaggcctt gttctgattt gccgcgacga cctcccaggt 2777581 gggtcgaacc cactctgtcg gccagcagca atgccaccga acccgccgcc accggattgc 2777641 ccacgtcgtc tttgctgact ttgctgcagt ccagaggtgc cacgtcggca cccgggtcaa 2777701 tctcgcgcat gccagccagc atcgagccag cgggcaccgc aatacatcag cacgtgtgac 2777761 tagattgctc cgaattctgt cgaacgcggg tccgcgtgat ctgcgcgttt cggtcgatgc 2777821 tttcggcagc ccggcctccg atcattaacg aacccgagac gaggagagcg ccatggtcga 2777881 cacgagcgcg cccgccagcc ggctggacac cgatccgcgc cgcgctcatg tgagtcttag 2777941 taagcacccc taccagattg gagttttcgg gtccggaaca attggtccga gagtctacga 2778001 actggcctat caagtcggtg ccgagatcgc aaagcaaggc cacattctca tcagtggcgg 2778061 gatgactggc acaatggaag cctcctcacg gggtgcgtcg gacgccgacg gccttgtcgt 2778121 cggcgtcctg ccgggcgaca agtttaccga tggcaatgcc tattccacga taaagattct 2778181 gagcggtatg cagtttgctc gtaactacat aacaggtttg agctgccacg gagcaattgt 2778241 cgtcggcggc tcgagcggcg cctatgaaga agcccgtcgt gtctgggaag gccgtggccc 2778301 cgtggtggtt ctagcgaaca gcggatcgcc aacgggtgcg tctgcgcaaa tgctgtccat 2778361 gcaggaaatc tttggggtcg cctttccgga ggacaaaccc aagccctggc gagtcttttc 2778421 ggcggcaacc cccgccgaat cggtgtcgct tgtcattggc ctgatccgga aaggatatgc 2778481 ccaacatgag ccgtaggata attaacgagt tcggagtaca gatctacggg gccacgatag 2778541 gtgacacctg ggccgggctg gtcagggcgg tgcttgacct tgggtctcag tgttttgacg 2778601 aagaccgaga gcgtatagcg ctgtccaacg tccgcatcaa gtcttcggtg cagaattatc 2778661 ccgatctcac tattgaagaa cattgcaaca gcgaccaact aaaggccatg ctagatttca 2778721 tgttcaacac cgataccatg gaggatatcg atgtggtcaa gagcttcagt cgtggcgcaa 2778781 aaagctacca tcgccggata aaagaaggac gaatgattga gttcgtaatt gagcgactga 2778841 gtctaattcc ggaaagcaag aaagcagtgg tcgtgttccc gacttacgag gattacgcgg 2778901 cggtcatgcg taatcatcga gacgattact tgccttgcct tgtttcgata cagttccgct 2778961 tgttgccaga cggcaaagat tacgtcttcc acacgacgtt ctattcgcgg tccatggacg 2779021 cctggcaaaa aggtcacggc aatcttttgt ctatcgccaa gctatcggat tgggtgcgag 2779081 agaacgtcag tgcgcgcatt gggcgcaaga tcatgcttgg cccgcttgat ggcatgattt 2779141 gtgatgttca tatctacaag gagacgtatg cagaggcttg caagcgtttg gccaacctcg 2779201 accttaggcg aacacaattt gacgcggtgc ggaattagtg aggacgctaa gcctccccag 2779261 ctgatgcgtt gatgcgctag catcagggct gtgcgaacga cacttgacct tgacgacgat 2779321 gtgatcgccg cggcacgtga acttgcctcc agccagcgcc gctcgctcgg ctcggtgatt 2779381 tccgaactcg cacgccgtgg tctcatgccc ggacgcgtcg aggctgacga cgggctgccg 2779441 gtgatccgcg ttccagccgg gaccccgccg atcacaccgg agatggtccg tcgcgcgctc 2779501 gatgaggact gacgcgggtg gcgctgctcg acgtcaacgc attggtcgcg ctggcgtggg 2779561 actcacacat ccaccacgcc cggatccgcg agtggtttac cgccaacgcc acgctcggct 2779621 gggcgacttg cccgctcacc gaagccggct tcgtgcgggt gtcgacgaac ccaaaagtac 2779681 ttcccagcgc gatcgggatc gcagacgctc gacgggtcct cgtggcacta cgcgccgtgg 2779741 gaggccaccg cttcctggct gacgacgtat cgctcgtcga tgacgatgtt ccgttgatcg 2779801 tcggttatcg ccaggtgacc gacgcccatc tgctgacact cgcccgccgg cgcggcgtcc 2779861 gcctggtcac cttcgacgcc ggtgtcttca ccctcgccca acaacgcccc aagacgccag 2779921 tggagctgct gaccatcctc taaccaaagc tgccagcccg cccggctaca gatccaacag 2779981 cgcggtctcc ggcgactcga tcagatcccg cagctcacac atgaactggg ccacctgagc 2780041 accatcgaca acgcggtggt cgaacacaca agtcaacgtc atcgtcggcc gtgcgacaac 2780101 ctcgccgccg acgaccaccg ggcgcggctt gatcgccccc agacccagga tcgccgcttc 2780161 gggatggttg atcaccggca cgccgtcgtc gactcccagc gccccgaagt tcgacaccgt 2780221 gaacgtcgaa ccgcgcagct ccgcgggtgt gagagtgcct tcacgtgcgc cggtgattaa 2780281 ttccgctacg cgggaggcaa gttcgcgggt gttcttgtcc tgggcgtcgg tcaccaccgg 2780341 caccagcaat ccacgctcag tggccgcgcc gaaccccaga tgcacaccgc gatgcacgtg 2780401 tacttgcggg ccttcgcccg agtcgaccca cgtcgagttg agaattacgt tgtgtttcaa 2780461 tgcaataacc agcagccgca gcgtcagcgc gaacggtgta atctcgggcg ccgccgaaac 2780521 gaaccagtcg cgcagccgca gcagttcggc gcaaattacc tcaacgctgg cctttgcggt 2780581 cggaatctcc ttgtgggaca acgtcatttt ttcggccatc cgcgcgtgca cgccgtggac 2780641 cggccgcacg tccggcccgg ctccgacgcc gcctcgagca gcggccagca catcggcccg 2780701 ggtgatcaca ccgccggcgc ccgacccacg ctgcaatgcg gccaggtcga ccgccaactc 2780761 tttggccagc ttgcgcacta ccggtgccgc cagcggccgg cttgtccgtc tactggtttc 2780821 gatcgcggcg tcggcaccgt agccgaccaa cgtggggacc gctccttcac cgttaggctg 2780881 cgcaactgcc gtgggcccgg tgtcgatccg aactagctcc gcgcccactt tgagcacatc 2780941 gccttcggcg ccgcctaact cgacgatccg gccggcatac gggctgggga tttcgacctc 2781001 ggccttggcg gtctccaccg aacacagcgt ctggttgatc tccacatcgt cgccgacggc 2781061 gacgctccaa cacgtcaccg tcacttcctg cagtccctcg ccgaggtcgg gcaccgggaa 2781121 agacctgatg ctgtcctcac cgctcatggc tgacgcagca cacgttcgac gcagtccaac 2781181 agccggtcgg ggccgggtaa ccacaatttt tccaaccgcg caggcgggta gggtgtgtca 2781241 aaaccgcagg cacgcaacac cggagcctcc aattggtaga acatctcttc ctggatgcgc 2781301 gcggccagac cggcaccata gccgaggctg cgcggccctt cgtgcatcac cacgcaacgc 2781361 ccggtgcgct ggatcgacgc agcaatggtg tcgaagtcca gcggcgccaa cgaccgcaga 2781421 tcgataacct ccagactcca atcatgttgc tgctctgcag tatccgcgct agacagggcg 2781481 gtgctcacca ggtttccgta cgttaccacg gtcacatcgg tgccggaccg gcgcaccatc 2781541 gcgtgcccga tcggcggttc cggccggcta gtgtcgacca tcccgcggct gtggtagcgg 2781601 cgtttgggct ccagatacat cacggggtcc gggcaggcga tagcgtgccg cagcagccag 2781661 taagcgtcac cgggtgtcga cggcaccacc accttgaggc ccgcggtgtg cacccagtag 2781721 gactccgtgg agtccgaatg atgttcggcc gcaccgatac cgccaaacga ggggatccgg 2781781 acggtcaccg gcatgtccac ctcaccgcgg gtgcgagtcc ggtacttggc cagatggctc 2781841 accacttggt cgaaagccgg ataggaaaag ccgtcgaact ggatttctgg caccggcaca 2781901 aagccacgta gtgccaaccc gacggctatt ccgatgatcg cggactccgc cagtggcgtg 2781961 tcgaagcacc ggtctgcacc gaacgtatcg gccagtccct cggtcacccg aaacacccca 2782021 ccctcgaccg cgacatcctc gccaaacacc aatacccgct cgtcggcggc catcgcgtcg 2782081 tacagggcgc ggttgatcgc ctggaccatg gtcaacgact gcgtgatgtc gctcaccgct 2782141 accgcaagcg tctcatccgg cctggccgga cggtctgcga tttgagtcat gcccgcctcc 2782201 tcagtcagtc cgcgccagtt cggcacgcag ctgttcgcgc tgcgcctgca acccgggtgt 2782261 gatttcggcg tacaccgtgg tgaacacctc atcgacgtcg aagtcaggcg catcaaagac 2782321 cgcgtcgcgt agctcggacc gcacgtgttt tgcccgagcc gtcacctgtt cctcgaggcg 2782381 ttgcgaccac aggccctgat cttgtaagta agtgcgatag cgcggaatcg ggtccagcgt 2782441 cgcccagcgg tccacctcct cctggctgcg gtaccgggtt ggatcatcgg cggtggtgtg 2782501 cggaccaaga cggtaagtga ccgcctcgat cagcgttgga ccgtcgccgg cccgagcccg 2782561 agcggcagct tcggccatca ccgcatagca tgccagcacg tcgttgccgt ccacccggat 2782621 gcctggcatc ccgtagccaa tcgccttgtg cgcgatagat ggtgcggcgg tctgcctgga 2782681 taccggcatc gagattgccc actggttgtt ctgcacgtag aacacgcacg gtgtggtgaa 2782741 caccgccgcg aaattgagcg cctcatgtac gtcgccctcg ctggtggcgc cgtcgcccag 2782801 aaaggccacc gtcacggagt cctcgtccag gcgttgcgcg gccatcgccg cgcccaccgc 2782861 gtgcaaggtc tgggtgccga tgggaaccga catcggtgca cagcacttcg tggtgaattg 2782921 cagcccgccg tgccaggttc cacgccacgc gaccccaaca tgtccaggcg ggatgccacg 2782981 cactaggtag acgcccaatt ctcggtattg ggggaacaac cagtcggttt tgcgtaggca 2783041 agccgccgca cccacctgcg cggcttcctg cccgcgacag ggcgtgtaca acgccagctc 2783101 cccctggcgc tgcagattga cgaattcggt atccagctcg cgggtgacca ccatcatctc 2783161 gtagagccaa cgcagcgttt cctcaggaag gtcacggtgg tagcggcgtt cggccgtcgg 2783221 cgtaccgtcc gggccgacga gttgcaccgg ctcaagatcg acagacatca acatcccaga 2783281 tggcctccga gaaccctccc ccataccgtc tcctcagctc gcgatcacaa cgcggttacg 2783341 cgtcagaaga tgccgtgcgt tccatcctta gcgtcggcgc tggtggtgcg gcgctcacag 2783401 cacatcccgc ctgggaaacc gctgcgagcc gaagttgatc gccggccgca ccaaagcttc 2783461 cgctcgccgc aacactggcg cttcctccat tatgccccca aatgtgaaga gtccggaccg 2783521 atcgcgaacg catcgcaacc gtgtcgcggg ggatctgcgc cctcattcgg aggtggcttc 2783581 cccggctcgc cgcaacatcg tttccgcgtg cgtgagcact ggagagtcga ccatctggcc 2783641 ttcgaacgcg aacgccccac gctcgcttcg cgacgcggcc aaaacccgcc gagcccaggc 2783701 cagcttctcg tggctgggtc gataggcctt gcgcaccacc gggatctgac tcgggtgaat 2783761 gcacacggtc acgtcaaagc ccaccgccgc ggcgtctctg gcctcttcct gcaagccctc 2783821 gacatcgagg atatccagat gtacggcatc gagcgcgaga cggccgaacg cggacgcggc 2783881 gagcaggatg gtcgagcgga catgtcgggc cacgtcacga taggcaccgt cggcccgccg 2783941 gctcgagctg ccgccaaggg tggcgatcaa gtcttcggca ccccacatca ttcccacggt 2784001 gggatcggcc gcggcgattt cggcggcgca cacggcaccg cgcgcggtct ccaccagcgc 2784061 gatgacatca cgcggcgcaa gctcgatgac ttgggccgcc gattcggcct tgggcagcat 2784121 caccgtggta taggcggtgc ctgcgagggc ctccagatcg cgggcctgat cagcagtacc 2784181 gcccgcattg atacgcacca ccgtgcgttc cgggtccagc ggggtgtccc gcaacgcatt 2784241 gcgcgcggca ggcttctgcg cctcggccac gccgtcctcg aggtcgagaa tcaccacgtc 2784301 ggccgcggcg gcagccttcg caaagcgttc cggacgatcg gcagggcaga acagccaccc 2784361 cggaccggcg gcacgcaggt tcattgcgcc tccttaatgg actgcttttg gaccagcgtc 2784421 gtgcgcaccg cgcgggccac cacctcaccg tgctggttgc gggcgatgtg ctcgagtgtg 2784481 acgatgccct cgccgggccg gcttttcgac tcacgtttac cggtacagac ggtctctgca 2784541 taaagcgtgt cgccgtggaa gaccggtttg ggaaacgaca cctcggagaa gccgaggttg 2784601 gccacgatgg tgcccaacgt caactgcgca accgacagac cgaccatcgt cgagagagtg 2784661 aacatcgagt tcaccagccg ctcgccccga aaacccggct gctgcccagc ccacgccgcg 2784721 tcgaggtgca gtgactgggt gttcatcgtc agcgtggtga acaacacgtt gtcggcctcg 2784781 gtgaccgtgc ggccgggccg gtgcaggtat gtggtgccga tctggaactc ttcaaaccac 2784841 aagcctcgtt gaagaatcct tctgccgact gtggatcctg cgacgcgaca cgccgatacg 2784901 gcgtcgtcag attcacggtc gccggcgtgc tttgtcactg cagtcccaac gatcgcgcga 2784961 taagcatcag ctgcacttcc gtggtgccct caccaatctc gagcaccttg ctgtcgcggt 2785021 aatgacgcgc caccggatat tcgttcataa agccgtatcc gccgtgtatc tgggtggcat 2785081 cgcgggagtt gtccatcgcc gcctccgagg agatcatctt cgcgatcgcc gcctccttct 2785141 tgaagggctt gcccgccaac atctttgcgg cggcatcata gtacgctgtg cgggcaacat 2785201 gggcgcgtgc ctccatccgc gcgatcttga agccgatcgc ctgataagcg ccgatcggct 2785261 ggccaaacga ctgacgctgg ttggcgtact tgacgctctc gtcaacacag ccctgcgccg 2785321 cgccggtggc cagcgctgca atcgcaatcc ggccctcgtc caggatggac aagaagttgg 2785381 catagccgct cccccgggct cccagcaggt tctccctcgg gacccgcgca tcggcaaatg 2785441 tcagtgggtg ggtgtccgag gcgttccagc cgaccttgtt atagaccggt tccacggtga 2785501 atcccggtgt gccgctgggc acgatgatcg tcgaaatctc tttcttggca tccgcagcgg 2785561 ttccggtggt cccggtaacc gcagtgacgg tgaccagcga tgtgatgtcg gtgcccgagt 2785621 tggtgataaa ttgcttggag ccgttgatga tccactcgtc accttcgaga cgcgccgtgg 2785681 tgcgggtgct gcccgcgtcc gatcccgctc ccggctcggt gagaccaaaa ccggcgagcg 2785741 cacggccaga cgtcaagtcg ggcaaccact tctgtttctg ctcctcggta ccgaaccggt 2785801 agatcggcat cgcacccagg cccaccgcgg cctccagcgt gatcgctacc gattggtcaa 2785861 ccttgcccag ctcctcaagt accagcgaca gcgcgaagta gtcgccgccc atgccgccgt 2785921 actcctccgg aaacggcagc ccgaacaggc ccatctctcc catcttggcg acaatttcgt 2785981 atgggaagct gtgttccgca tcgtgtttgg ccgataccgg cgcgaccacg gtgcgcgcaa 2786041 aatcggccac cgtatcccga agatcttggt attccttggg taatatcccc ccagaaatcg 2786101 ttgtagtcgt tgtggtcatg atcctagtcc ttgatcctcg ccagtacctg ttcgactttc 2786161 acctgatcgc caacggacac caacacctgt acccgtcccg aaaccggcgc ctccagcgag 2786221 tgctccatct tcatcgcttc caccaccacc accacatcac ccgcagagat ctgggagccg 2786281 gactcgacct gcacggcgat cacgctgcca ggcatagggc tgacgacctc cgccggccgc 2786341 gcacccacgg cgcggtgaat cttgtgctcc tcggcctcgc gcaggtgcca agtcccgcgc 2786401 tcgtcggcga tccacaggtg ccggtcagcc tctgcccacc gataatcccg gcgcagcccg 2786461 cttatcgtca cgctcatctg ttctcgggtg acctgcacgc tcgcacaatc gatctcacca 2786521 tcgccaacct gaacctgcgc cgactcgggt ggcccccaca ccgaaacggt ctcgctgcgc 2786581 agcggggtgc gcatggcggt gcggaccggt gccatatggc ccccgccgcg ccatccggac 2786641 ggcgcggccc acaggtcgcc ctgtgcgcgc cgggccaggg cccactggcg gtagaggccg 2786701 ccggcagcta gcacgtcgtc aggcgccggc cgcgcagtga aatcggccga tcgctcgtcc 2786761 agtacagcgg tgtccaaatc cccgacccgc acccgctcgt cggcgagcag aaagcgaagg 2786821 aactcgacat tggtctgcac tcccagcacc gcagtccgcg ccagcgcctg gtccagccga 2786881 tccagcgctt cctcgcgatc ggccccgtgc gcaatcacct tggtgagcaa cgggtcgtaa 2786941 tcactgccga ccaccgtgcc gcctagcagt gacgaatcca cccgcacccc ggggccggcg 2787001 ggttcgaaca ccgccagcac ccggccgccg gtgggcagga attcccgcgc gggatcctcc 2787061 gcatacaccc gagcctcgat cgcgtgccca cgcagctcga tgtcgttttg ggcgaagccc 2787121 aacttttcgc ccgcacccac ccgcaactgc cactcgacca ggtccaatcc agtaatcgcc 2787181 tcggtgaccg ggtgttccac ctgcagccgg gtattcatct ccatgaaaaa gaactcgtcg 2787241 gggcgctgcg cggagacgat gaactccacc gtgccggcgc cgacgtagtc cacgcagcgg 2787301 gcggtgttgc aggccgcgac cccgatgcgc tcgcgggtct gcgggtcaag cagtggcgac 2787361 ggcgcctcct cgataacctt ctggtggcgc cgctggaggc tgcactcacg ctcacccaga 2787421 tgcaccacgt tgccgtgagc gtcggcaagc acctgcactt cgatgtgcct gggccgcaac 2787481 acaaaccgct ccaggaatag cgtatcgtcc ccgaacgaag acatggcttc gcgccgggca 2787541 ctcaccagcg cctcaggcag ccgcgccgga tcttgcacta accgcatccc tttgccgccg 2787601 ccgccggccg acggtttgat cagcaccgga tagcccacct cagcggcagc ggtgaccagc 2787661 gcgtcgtccg tcagcccggc gcgcgccaca ccgggcacca ccggaacatc gaaagcggcg 2787721 accgcgttct tggcggcgat cttgtcgccc atcacctcga tcgcgcgcgc cggcggaccc 2787781 aggaacacca cccgggcgcg ttcacacgcc gcagcgaaat cggcattctc ggcaagaaac 2787841 ccgtagcccg gatggatcgc ctgggctccg gtgcgcgccg cagcatcgag caccttgccg 2787901 atatcgaggt agctttcgcg tgctggggcg ggccccagcc gcaccgcagc gtccgcctcc 2787961 aagacgtggc gggcatcgac gtcggggtcg ctgtagaccg cgaccgaccg gatgcctagc 2788021 cggcgcagcg tccgaatcac ccgaaccgcg atctcaccgc ggttggccac tagtacggtg 2788081 tcaaacatcg cctcacatcc ggaagacgcc gtagccaacc tgatccagcg gagcgtgggc 2788141 acacaacgaa agggcaagcc caacaaccgt tctggtgtcc gcagggtcta tgataccgtc 2788201 atcccacagc cgggcagttg aatagtaggg gttaccctgg tcttcgtact gcgctcggat 2788261 gggcgccttg aacgcttcct cctcgtcggg tgaccagggt gtgccggccg cggacagctg 2788321 ctcgccgcgc acggtcgcca acacggacgc ggcctgctca ccgcccatca ccgagatccg 2788381 cgcgttcggc cacatccaca ggaaccgggg cgagtacgcg cgtccgcaca tcgaatagtt 2788441 acccgcacca taggatccgc cgatcaccac ggtcaacttg ggcacccgcg cgcaggccac 2788501 cgcggtgacc atcttggcgc catgcttggc gattccgccg gcctcgtagt cgcggccgac 2788561 catgaagccg gcgatgttct gcaggaacag cagcggaatc ttgcgtttgt cgcacagctc 2788621 gatgaaatgc gctcccttga gcgcggattc gcttaacaac acgccgttgt tggcgacgat 2788681 cccgaccggg tggccgtgga cgcgtgcaaa cgcagtcacc agagtcttgc cgtatttagc 2788741 cttgaactcg ctgaattcgc tgccgtcaac aatccgcacg acgacctcat gaacgtcgta 2788801 agggacccgg ggatccgggg gcaccacatc gtagagctcg gcctgcgggt acttgggctc 2788861 gaccgaacgg cgcacatccc attgggcggg ttcgcacggg ccgaaggtgt ccgcgatcgc 2788921 gcgcacgatc cgcagcgcgt cctcgtcgtc gtcagccaga tggtcggtga caccggacgt 2788981 gcgcgagtgc aagtcgccac cgccaagttc ctcggccgag acgatctcgc cggtggccgc 2789041 cttcaccagt ggcggaccgc cgaggaagat cgtgccctgc tcacggacga tgacggcctc 2789101 gtcactcatc gccggcacat aagcgccacc cgccgtgcag gagccgagaa ccgccgccac 2789161 ctgcggaatg cccttggcgc tcatcgtcgc ctggttgtag aagatccgcc cgaaatgctc 2789221 gcggtcggga aacacctcgt cttggcgggg caggaaggcg ccgccggagt cgaccagata 2789281 gatgcacggc agcatattct gcagcgcgac ctcctgggcg cgcaggtgct tcttgaccgt 2789341 catcgggtag taggtaccgc ccttgaccgt cgcgtcgttg gcgacgatca cgcactggcg 2789401 tccggatacc cggccgatcc cggtgatgat tcccgcgccc ggggattcgt cgctgtacat 2789461 gccgccagcg gccagcggag ccagctcgag gaaagggctg cccgggtcga gcaggcggtc 2789521 cacccgttcg cggggcaaca gcttgccgcg gctgacgtgg cgtttccggg cgcgttcgtt 2789581 gccgcccagg gcggcggcgg cgagcttatt gttcaattcc gccaccagcc ggcggtgctc 2789641 gtcggcgaac gagggggcta ttgctatcga cggggtggtc actgggtcgc caggtcccga 2789701 agcacaaggg gcggttgagt cttcgcgact acctcgtcga ccgacacgcc aggagcggtc 2789761 tggaccaggt gcaggccgtc agcgcagaca tcgatgaccg cgagttcagt gacaatgcgg 2789821 tcgacgcagc ccacaccggt caacggcaat gtgcaccgct ctaggatctt ggggctaccg 2789881 tccttggcgg tgtgctccat catcacgatc accttgcgag cgccgtgtac cagatccatc 2789941 gcgccgccca tgcccttgac catcttgccg gggatcatcc agttggctag gtcaccggtg 2790001 accgaaacct gcatcgcgcc aagcactgcg acatcaaggt ggccgccgcg gatgattccg 2790061 aacgaagtcg acgagctgaa gaatgcggca cccggcagcg tggtgaccgt ctccttgccc 2790121 gcgttgatca aatcggcatc cacgtcctcc cgccgcgggt aggggccgac gccgaggatg 2790181 ccgttctccg agtgcaggac gacatggacg ccgtcgggaa tgtggttggg aatcagggtg 2790241 ggcatgccga tgccaaggtt gacatactga ccgtcttcga actccgcggc cacccgtgcg 2790301 gccatctcgt ctcggctcca gcccggggcg ctcattgccg caccgtctcc ctctcgatct 2790361 tcttggcggg gttgggcaca tgaaccaccc ggtgcacaaa cacgcccggg gtgtgtacgg 2790421 tggcagggtc gatctcaccc ggctcgacca agtgctcgac ctcggcgatc gtgatcctgc 2790481 ctgcggatgc gcactccggg ttgaagttgg ccgcggcgtg gcggtacatc aggttgccgt 2790541 gccggtcccc ctgccaggca tgcaccagtg cgaagtcggt ccggatcccc cgctcgagga 2790601 cataggtgac accatcgaac tcccgagtct ccttggccgg cgacaccacc gccaccccgc 2790661 ccgaggcgtc gtagcgccac ggcaacccgc cgtcggcgac ctgggtaccg acccctgccg 2790721 gtgtatagaa ggccggtatg cccatccctc cggcccgcaa ccgctcggcc agcgtgccct 2790781 gcggggtcag ttccacctcg agctcgcccg cgaggaactg gcgggcgaac tccttgttct 2790841 cccccacgta ggaggagact gtccggcgaa ttcgcttgtg ttgcaacaat agtcccagac 2790901 caacaccgtc gattccgcag ttgttcgaga ctgtttccag gtcggtgaca ccgctatcca 2790961 ccaacgctgc gatcagtgct tcggggatgc cgcaaagccc gaatccacca accgcaagcg 2791021 acgacccgtt ggctatgtct gcgaccgcct ccgcggcggt ggccaccacc ttgtccatac 2791081 cgcagagcct cctagcattt cagttaatta tcattaactg aggtgagaat accattgccc 2791141 ccgcggtgcg tctagggacc tcactgttgg ccgcggaggt attcgagcgc ctgttgtcgc 2791201 atctccactt tgcgtacttt gccggtgacg gtcatcggga actcgtcgac gatccacagg 2791261 taccgcggga tcttgaatcg cgcgatgcgg cccatgcagt actcgcgcag ccgctcgatg 2791321 gtcagttccg gcgcgtcgtt tctcagcttg accaccgcca tgagctcttc gccgtatttg 2791381 gcgtcgggca ccccgatgac gtgaccgtcg acaatatcgg gatgcgtgtg gaggagttcc 2791441 tcgatctccc gcggcgagat gttctcgccg ccccggacga cgaggtcttt gatccggccg 2791501 gcgatccgca cgtacccgga cgggtccatc tcagccagat ctccggtgtg catccagccg 2791561 tcggcgtcga tcacctccgc agtcttctgc gggtcattcc agtacccggc catcaccgaa 2791621 tagcctcgcg tgcagaactc gccgaccacc ccgcgcggga ccgtctcgcc cgtggccgga 2791681 tccaccacct tgatctcaag gtgtggaccc acccgaccga ccgtgccgac ccgtcgatcc 2791741 accgagtcgt cggcgcgcgt ctgcgtggaa accggtgacg tttcggtcat tccatagcag 2791801 atcgagaccc cgggcatatg catgcgtgag atcaccttgc gcatcacctc gaccgggcac 2791861 gcggcgccgg ccataatccc ggtgcgcaga ctgcccagtt cgtagtcggt gaagtccggc 2791921 aggcccagct cggcgatgaa catcgtcggc acgccgtaca agctggtgca tcgctcgtcc 2791981 tgcaccgcgc gcagcgtggc cgcagggtca aagcccggcg ccgggatcac catggccgcc 2792041 ccgtgactgg tggccgccag atttcccatt accatgccga agcagtggta gaagggcacc 2792101 gggatgcaaa tccgatcttg tgcggtgtac ccgagcagct cgcccaccag gtagccgttg 2792161 ttgaggatat tgcggtggct tagcgtgaca cccttcgggt atgccgttgt gccggaggtg 2792221 tattggatgt ttaccggatc actgccgtct agcctcgccg cggtctgctg cagcgcaggc 2792281 agatcgggct cggcacccgc cagcgcgtcc cagcgatcgc tttccagcaa aatcacgtcg 2792341 gccagatcgg ggcatcgcgg cccaacctcg gccagcatcg cggcatagtc cgcatccttg 2792401 aaactcgcta cggcaatcac catcgcgaca ccggactgcc taagcgcata ctccacttcg 2792461 cggacccgat aggcggggtt tatggtcact aggatcgcgc cgatctcagc ggtcgcgtac 2792521 tggacgagca cccactccca ccggttcggt gcccagatgc cgacccgatc gcccgggccg 2792581 atccccgccc gcaccagccc cgtcgccagc cggtgcacgt cagtcagcag ttcgctgtaa 2792641 ttgaaccgtc gccgggccac catgtccacg agtgcttccc gatgtccgta cctggcagcg 2792701 gtcgctgcga ggttggcgcc gatggtcgac tcgagcaatg atggcgcact cggaccgcga 2792761 tcataggaaa gccgattggg gtctacgact tccgcggctg ccacggttcc tccgcctggt 2792821 gcctaccgca tgtctgactc gcgttaacat cgaatagctc gtgctacgtt agtgacgatt 2792881 aaccgaagtg tccagcatga gtcgtgtacg gagaccgtcg tgacagcgtc cgccccggac 2792941 ggtcggcccg gccagcccga ggccacaaat cgtcgcagtc agctgaagtc cgaccgacga 2793001 ttccaactct tggcagccgc cgaacgattg tttgccgaac gaggattcct ggcggtgcga 2793061 ctggaggaca tcggcgccgc cgcgggcgtc agcggtccgg ccatctaccg acacttcccc 2793121 aacaaagagt cgctgctggt ggaattgctg gtcggcgtca gtgcgcgact tcttgccggc 2793181 gcacgcgatg tgacgacccg cagcgctaac ttggccgcgg cactggatgg cctcatcgag 2793241 tttcaccttg acttcgcact cggcgaagca gacctcatcc ggatccagga ccgggaccta 2793301 gcgcacctgc cggccgtcgc tgagcggcag gtgcgtaagg cccagcgaca gtacgtggag 2793361 gtctgggtcg gggtgctgcg cgagctgaac ccaggcctgg ccgaagccga cgcccggctg 2793421 atggcccacg ccgtgttcgg actgctgaac tccaccccgc atagcatgaa agcggccgac 2793481 agcaagccgg cacggacggt gcgtgcacgc gccgtcctac gggcgatgac ggtcgccgcg 2793541 ctatcggccg cggatcgttg tctatagctc gccaggctgc gatgtcgccg ggtacatcag 2793601 cgcacccgca cccagcgcgg gtaccctgca tgccatgagg tggacatgaa cgatccacgt 2793661 cgcccccagc ggtttggtcc ccctctatcc gggtacgggc cgaccggacc gcaggttccc 2793721 cccaatccgc cgaccgccga cccggcttac gccgaccagt cgccgtatgc atccacgtac 2793781 ggcggttacg tttccccgcc gtggtctcca ggagggcccc cgccaaggcc tccccagtgg 2793841 cccccaggcc cccacgaggc cagtccgacc caacagctgc cgcagtactg gcaatacgac 2793901 cagcccccac cgggcggatt tccccccgac gggctgactc ccccgccacc gcaagggccg 2793961 agaacgccgc gctggttgtg gttcgccgcc ggctcagccg tgctgctcgt cgtcgcgttg 2794021 gtcatcgcac tggttatcgc caacggctcg gtcaaaaagc aaaccgcgat cgagccgtta 2794081 ccccccatgc ccgggcctag cccgacacgt ccgaccacga ccacaccgac cccaccctca 2794141 cccagcgccg caccggcacc gacaactacg accggtacgc ctagtgagac ggtcgccggc 2794201 gcgatgcaaa ccgttgtcta cgacgtcacg ggggaaggcc gggcaatcag catcacgtac 2794261 atggatagcg gcaacgtcat acagaccgag ttcaacgtcg ccctgccgtg gcggaaagag 2794321 gtcagcctgt caaagtcgtc cttgcatccc gctagcgtca cgatcgtcaa catcggccac 2794381 aacgtcacct gctcggtcac cgtggccggg gttcaggtac gccagcgcac cggggcgggg 2794441 ttgaccatct gcgacgctcc cagctaggag gattgcgccg tcgtcagcgc accgccgtgc 2794501 cgcgacacct gtacccgcag catgagcagc aggccggttg tcaacacgag gcacacgccg 2794561 ccgagcccgg cacggaccgt gtggaacacg tcgacgaaga ccgaaaacaa ccacggcccc 2794621 agaaacgaca ccgcccggcc ggtcatcgtg tagagcccaa aggccacacc ctccttgccg 2794681 tgctgcgcca tatgcagcag cagagcgcgt gccgacgact gcgccggccc gatgaacaca 2794741 cacaacagca gcccgcacgc ccagaacgcc gttgggcccg acaacgtcag caacgtgagc 2794801 gccgcggcga tgatggcggc cagtgatccg acgatgaccg gtttggaccc gatccggtgg 2794861 tcgacgaacc cacccagcac ggcccccacc gcagccacca cgcttgcggc cgcaccaaag 2794921 atcaggacat cggcctgggt gagcccgtat gcgttgacgc caagtaccgc gccgaaggcg 2794981 aaaatggccg ccagcccgtc gcggaatatc gcgctggcca ccaggaagta gaccaagttg 2795041 cggtcgcgcc gccactccgc gctgatctcc gtccacagct tgcggtagcc gcccagcagg 2795101 ccggtcgaag gatgagacgc cgcaccggaa tcgggtagtc ggtgcgcgac caacaacaat 2795161 ggcaggccca gcaacgccaa ccaggccgcc gcaaccagca tcgccattcg cacgttgagt 2795221 ccgttcgcga cgggtagctg cagcaggccg cgctgcgaac cgctacctga catgaaaccc 2795281 agatagatca ccagcaagag cgcgacgctg ccgacatagc ccgacgccca accgaagccg 2795341 gagatccggc ccgccgtgct gggtgtggac agttggcgca gcatcgcgtt gtacggaacg 2795401 ctggacaaat cgctggacgc cgcggtggcc gcgagcaaaa ccagcccggc ccacaggtag 2795461 cgggggtcgt cgcggatcag gaacattgcg caggtcagcg cgaccgcggt gccggtcagc 2795521 acagacagtg ccacccgacg gcggtgcgga gactccaccc acacgccgac gacgggcgcc 2795581 agcaccccga tggtcaaccc ggcgaccgcc cccgcacgac ccaaccaact cgccggtgag 2795641 gtgccgcccg gcagaccctg acccacggcg ctggtcaggt agacggagaa cacaaaggtt 2795701 gtcacgatcg cgttcagacc ggtggaaccg caatcccaca tggcccacgc caccacccgg 2795761 aagtgcagga gggtgcccgc gcgcgacccc gggttattca tgtccggcac tttattgctt 2795821 ttggcagcga cccgctgcgc ccggctccgc cgcgctcgcg atcgctacgt gtctacgatt 2795881 ggcgcatgcc gatacccgcg cccagccccg acgcacgtgc cgttgtcacc ggggcttcgc 2795941 agaacatcgg cgcggcgctg gccaccgaac tggccgcacg cgggcaccac ctgatcgtca 2796001 ccgcacgacg cgaggacgtg ttgaccgagt tggctgcccg gctggccgac aagtaccgcg 2796061 tcacggtcga cgtgcgaccg gccgatctgg ccgatccgca agaacgatcg aaactggccg 2796121 acgagctggc tgcccggccc atctcgatcc tgtgcgccaa cgcgggtacc gcgacattcg 2796181 gcccgatcgc atcgctcgat cttgccggcg aaaagacgca ggtgcagttg aatgccgtgg 2796241 cggtgcacga ccttacgttg gcggtgttgc cgggcatgat cgagcgcaag gccggcggca 2796301 tcttgatttc tggttcggcg gccggcaatt caccgattcc ctacaacgcc acctatgccg 2796361 cgaccaaggc cttcgtgaac accttcagcg aatctctgcg cggtgagcta cgcggctccg 2796421 gcgtgcacgt cacggtgctg gccccgggcc cggttcgcac cgagctaccg gatgcctccg 2796481 aagcgtcact ggtcgagaag ctggtgccgg acttcctgtg gatctcgacg gagcacaccg 2796541 cccgggtatc gctgaatgcc ttggagcgca acaagatgcg cgtcgttccg ggtctgacgt 2796601 caaaggcgat gtcggtggcc agccaatacg ctccgcgcgc catcgtggcg ccaatcgtgg 2796661 gtgcctttta caaaaggctt gggggcagct aggcatcact tccggcggcg gcgcccggtg 2796721 ccgaagatgc tgcgggtgat ctcgcgtgcg gtggtgttga ggacgctctt gacggtcgga 2796781 ttcttgagta tctcctccca caccgcgggg ccctgcggct ccaccggagc gggcatcggc 2796841 ggaacttcaa aatcgtccgg ccagggcagc ggatcgtact gcccccttgg ggctggggcc 2796901 tcctgggccg gggcctcttg cgccggcgcg agtttggcgc tcagtatctc gtgggctgac 2796961 gggcggtcga tggtctggcc atatacggcc tgcaacgagc ttgcctgggc cgcggcgcca 2797021 atcgcttcgg ctccgatcgc ggccatcagc gaccgtggcg ctcgcatcct ggtccaggcg 2797081 accggcgtcg gtgcgccctt ctccgatagc acggtgacga cggcctcgcc ggtgcccagc 2797141 gacgtcagcg cggactccaa gtcgtagaca tcggttttcg ggtaggtgcg cacggtcttg 2797201 cgcagcgcct tgtggtcgtc gggggtaaac gcgcgcagcg cgtgctgaat tcgggctccc 2797261 agctgggaga ggacatcgtt gggtagatcc gtgggcagct gggtgcagaa gaacacccca 2797321 acacccttgg aacggatcag cttcacggtc tgctcgacct gctcgagaaa ggccttcgag 2797381 gcatcggtga acaacaggtg cgcctcgtcg aaaaagaaca ccagtttggg cttgtccagg 2797441 tcacccacct cgggcaggaa ggtaaacagg tccgccagca cccacatcag aaaagtggag 2797501 aacatcgccg ggcgcaacgc ctggctcccg aactccagca acgagatgat gccccgaccc 2797561 tggctgtcga cgcgcagcag gtcctcgggc ctcagttcgg gctcaccgaa gaatgtgtcg 2797621 gcaccttcgg cttccaggtt gaccaaagcc cgcaggatga ccccggccgt cgtgggcgac 2797681 accgccccaa gggatttcag ctctaccttg ccctcatcac tggtcagatg ggtaatgacc 2797741 gcccgcagat ccttcaggtc cagcagcgga agtcctcgtt ggtcggccca gtgaaagatc 2797801 aggcccagtg tagattcctg ggtagcgttg agccccaaca cctttgccag cagaatcggg 2797861 ccgaagctgg agatggtcgc acgcaccgga accccgacgc cactggcacc cagcgacagg 2797921 aactccaccg ggaaggccgt cggcacccag tcgtcaccgg tgtctttcgc acgggcggcc 2797981 gtcttgtcgg cggcctcccc cgggcgggcc agaccggaca aatcgccctt cacgtcggcc 2798041 atcagcactg ccacccccgc cgcactgagc tgttcggcga tcagctgcag cgtcttggtc 2798101 ttgccggttc cggtggcccc ggcgaccaga ccgtgccggt tgacggtggc cagcggaatg 2798161 cgaatctgcg cgctcgggtc gggttcgccg tcgacgacga cggtgcccaa ctgcagggcc 2798221 tggccttcga cggtgtaacc cgccgcgatc cgctgcgcgg gcccgccagg tccaccggcc 2798281 gccgattcgg tgcccatagc tggatcacac tacttgcccg ggggagacag ccgcgacggc 2798341 tcgcatgcgc ctacgctgag cgctgtgcaa gacgaactgg tgtggatcga ctgcgagatg 2798401 accgggctcg atctgggttc ggacaagctg atcgagatag ccgccctggt caccgatgcc 2798461 gatctgaaca ttctcggcga cggggtggac gtggtgatgc acgccgacga cgccgcgctg 2798521 tcgggcatga tcgacgtggt cgccgagatg cactcgcggt cggggctgat cgacgaggtg 2798581 aaggcatcca cggtcgacct agcgaccgcc gaggccatgg tgctcgacta catcaacgag 2798641 cacgtcaagc agcccaagac cgccccactg gccggcaact cgatcgccac cgaccgcgcg 2798701 ttcatcgccc gcgacatgcc cacgctggac tcgtttctgc actaccgaat gatcgacgtc 2798761 agctcgatca aggaactgtg ccggcgctgg tatccgcgga tctacttcgg ccagccgccc 2798821 aaggggctga cgcaccgggc gctggccgac atccacgaat ccatccgcga actgcggttc 2798881 taccgccgca ccgcgttcgt gccccagccc ggcccttcta ccagcgaaat cgcggccgtc 2798941 gtcgccgagc tttccgacgg ggcgggcgcg caggaagaaa cagattcggc cgaggcgccc 2799001 cagagcggtt aatatcgacg tcgccgctca ttagcccccg cgggggcggc cggcggccat 2799061 ggtgagtgta gttcagttgg tagagcacca ggttgtgatc ctgggtgtcg cgggttcgag 2799121 tcccgtcact caccccaaca gggcggcagg gtgtttatgg ccctgggccc tttgctgtcc 2799181 ccgccgaggg cttgcacctg caaccttcgt gtctatgatc tggtcccgtg gcgaattcga 2799241 ccactcgccg cgactgcacc tggccgcccg ctccaacacc cgccggtcaa actgccatcg 2799301 gacagcatgt tccccgtcgc cagggccttg gcaggtgtcg gtttgcccgg tctatttgcc 2799361 tgccgcgcaa ctatcgcacc tccggcgtgg cttgttcgga ctcactcggt gtttcgtgcc 2799421 atggttgatg tgcaggacgt ttgagacccc aaccagctag accaggatga gcgcttctgc 2799481 gtcagccgac aaggtcgtat gcgagtgctg cgagctctgt gttcctaaac agctcgcgtc 2799541 agcgattcgc aacccatacg gactcgtccg tgggtggcgc tgtcgcatct gtaacgagca 2799601 ccaaggccag ccggtcaaga tggcgcaaga ccacgaagag gaggtccgca tccgttgggg 2799661 cgagacggtg gacgaactcc acgctgcgct ggaccgcgcc gggccaaggc cagggacgtg 2799721 gtgtacgagt gaaggttcct cgcgtgatcc ttcgggtggc agtctaggtg gtcagtgctg 2799781 gggtgttggt ggtttgctgc ttggcgggtt cttcggtgct ggtcagtgct gctcgggctc 2799841 gggtgaggac ctcgaggccc aggtagcgcc gtccttcgat ccattcgtcg tgttgttcgg 2799901 cgaggacggc tccgacgagg cggatgatcg aggcgcggtc ggggaagatg cccacgacgt 2799961 cggttcggcg tcgtacctct cggttgaggc gttcctgggg gttgttggac cagatttggc 2800021 gccagatctg cttggggaag gcggtgaacg ccagcaggtc ggtgcgggcg gtgtcgaggt 2800081 gctcggccac cgcggggagt ttgtcggtca gagcgtcgag tacccgatca tattgggcaa 2800141 caactgattc ggcgtcgggc tggtcgtaga tggagtgcag cagggtgcgc acccacggcc 2800201 aggagggctt cggggtggct gccatcagat tggctgcgta gtgggttctg cagcgctgcc 2800261 aggccgctgc gggcagggtg gcgccgatcg cggccaccag gccggcgtgg gcgtcgctgg 2800321 tgaccagcgc gaccccggac aggccgcggg cgaccaggtc gcggaagaac gccagccagc 2800381 cggccccgtc ctcggcggag gtgacctgga tgcccaggat ctctcggtag ccctcggcgt 2800441 tgacgccggt ggcgatcaag gtgtgcactc cgacgacgcg gcctgcctcg cgcaccttga 2800501 gcaccagggc gtcggcggcg aggaaggtat acgggccggc atcgagcggg cgggtccgaa 2800561 acgcctctac ggcttcgtcg agctctttgg ccatgatcga cacttgcgac ttggaaagct 2800621 ttgtcacacc aagtgtttcg accaggcgct ccatccggcg agtggatact cccagcaggt 2800681 agcaggtcgc caccacgctg gtcagtgcgc gttcagctcg cttgcggcgc tgcagcagcc 2800741 agtccgggaa atagctgccc tggcgcagct tggggatcgc gacgtcgatg gttgcggcac 2800801 gggtgtcgaa atcacggtgg cggtagccgt tgcgctgatt ggaccgctca tcgctgcgtt 2800861 cgcggtagcc cgccccgcac agggcgtcgg cttcagcccc catcaaggcg gcgatgaacg 2800921 tcgagagcag cccgcgcagc agatccgggc tcgcctgtgc gagttggtca gccagaagct 2800981 gctcggtgtc gataagatga gaagaggtca ttgcgtcatt tccttcgatt gacttttgct 2801041 ggtcgtttcg aaggatcacg cgatgaccgc ccactactgg gctacgacac gcccaccggc 2801101 cttacctgcc cgtacaccac acccctggac gtaactccgc gccgatgact acaaggcaaa 2801161 gatgctggct gcgtttaggt ctcacgatgc cgtgttaaga gagttcgaaa agctcggccg 2801221 ctatcatcag tcaaccgggc acggctgcct ctgcggcaaa cgaaactgtg caacgctgtc 2801281 catcatcgat agcaaccaga tatatggcca cattgaccga atgaatcgcc gcgacgagct 2801341 tggctaagcc acaacagaga gaaacaaggt ggacgacatc gcagcattca agctcgacag 2801401 cctgccggac ataaccttca cggtcacgcg ggccataagt tcgggtgggg aaaatccggc 2801461 ggggtttctc aatttcgcgg cgcgccgaga gcaaccggag atcctgggtg gtggaggccg 2801521 tcctggaccg gtgggcccgg aagcggtcga tactccacgt attcgcggcg ggaaggtgcc 2801581 gttcgtcttc cggacgctac cgggttacac cttctacgcc agccaaatcg agccgagagt 2801641 gggcgacccg gaagggccca cactcctggc tggattcggc aatatccctg agacttcgca 2801701 gcggtcgccg ggatggatcc gcatcacctg caaggggcca gacgacgatg aggagctgga 2801761 attctttgga ttcgccgggc cagagtccta accaggcgat gaacgaagga tcggcgacgg 2801821 ctacgaacct ggataggcaa gaatggcgca ccgaagcgtc actcgacgtc ggccggccgg 2801881 agaacgcacc acgaaacgaa acacttgtga ggaccaagat tctccgatct tcgggtagca 2801941 cccgagagca tgtcgttagg cctgtcggca tgggcgccgg caaggtcctc cagccgggtg 2802001 atgggcgtcg cagagtacag acgtggctgc tgtccgcagg ctgaagcgga tgaagtgaca 2802061 gcccagcggc gcggccagaa gctctcagaa agtccatccc tgcgcctcga tatagcccat 2802121 cagagttagc cacggcacgc caagagcgtc gcagacatcg gggatgcgcg gcttttcgat 2802181 attgccgctc gcagtctcct gggtaaccac cgtggcgttg ttcaccatcg cgagcgcgat 2802241 gacgaacggg tcggcggcgc ttcgcctgcc accctgccgg accatgttcg ggtgcaaccg 2802301 caagatgtgc cgcgccgcct gctggatctg ttcatccaga ggacagaaca agccagtttg 2802361 cccgtccgcc caccgcttcg cgtcatcatc acgcctggcg agttcgcgct gaacctcatc 2802421 gaccgacctg atctgaccgg cgctgatcgc atcctcaacc cggccccaca gactgcgaaa 2802481 caccgctggc cgaaacagat cacgccgtcc gttcaggatg gcgctggtat cgaaggaata 2802541 gagcacagcg gttagaccac gctccgcagt tcggctgact cagccaactt cggaatctgg 2802601 ctgaccttgg cgtcgaggta gatcgcagcg gtgttgctgt cgatgacgcg gcggcggtgg 2802661 gcgtcggtca ccgcccgcac gtagcccttg ccgaggtctc ggacggtatt gcggtaccag 2802721 ttgccgcccc cagccgatcg agcccgttcg gcctcgtcct cgtgagccgc gatgaactcg 2802781 gcgcggcgct gtcggtagac ctcgaccggc acgattccaa gcgtgcttag ccgccgcagg 2802841 aacgcctcgg cactcacgcc aaaatgcgcc gcgaccggcc gcagcgattc gtaatcccac 2802901 gaagacggag tctcgctgcg aacgatgacc tccggccgcg ctcgcaccac gtcggcaggc 2802961 atcagcacag cggcggcgat cgcgttgcat cgagcctcca gcgatcggtc ctgggtgctc 2803021 ggatgagcat cggcgatcac gtcacacaag ccctcggtgt gcagcaccac gtgcacgaac 2803081 tcatgcagca gcgagaacag gcgagggcgg gggtggtcgc tgccattgag cacgatcacc 2803141 ggcaattcgt cgaaatacag acacataccg cgcatctcgt cgatagcgac cttgccgccg 2803201 cgggtcgcga gcaccagaac gccggacgtt tcgatggccg acacccaggc gttcagatgc 2803261 tcgtaagggt caaccgaggc cacggggata ggcaacgggc tgacctcgat caaggccttg 2803321 cggattcgtg ccgcgatatc cgcgtcggcc tcgtcgccgg ataggggcaa acgccaggcg 2803381 cccggtatct cccggtcctc ggcgtcggcc agctctagcg cgaagtcgcg ttgcgtgtgt 2803441 gcgcgacgga actcctcgtg aagccccggc gtccattgac ccgacgcggc accgtccaat 2803501 cgtcggaagt cgcgtaaggt gtcaaacccc tcgggcggct cggacaggaa gaacaccgcc 2803561 agcgagcgct tgtagacctc ggcggccttg cgcagctgcg cgatggttgg cacaacctcg 2803621 cccacctccc aagccgcgac gcgatcatca ggcaggccga gtttgcgggc cgcggctacc 2803681 tcggtcaggc cacacgactc gcgagcccaa cggagcaccg agctctccac cgaagcggga 2803741 atcgaccgca tggcaatgat gatgcaccac cccacccaca ttggatggcc gatacccacg 2803801 cttggttccc gaccagccga ttaaccgctc ccccgcaacc tggcgagacg gtactcgccg 2803861 cgttcggcgt ctgggacggt gtgccgtgag accggctgcg gtgtaacgcc ttacgaacta 2803921 gtgagcaggg tgcaacggga cggccgccca ctcgtcctgt ccagcccaac ggacgtatag 2803981 ctgatttgga aggggatggc cccaagccgc tatcaagacc atgttgagcc cctctccggg 2804041 gcgaatcacc gtcttcttcg ggacatttcg agtgatcgca tcgattcgcg agaggtcaat 2804101 ctcgacatct tctgcgatat cgtcgccgat gttgcgcaac acaaagcgga ttttgtctgg 2804161 gttctcgaca cgccaccgga cgttaggtgc cttaccggac ctgccgaccg ccggtcccac 2804221 cgcccacgag tacgcgaact tggcagtccc ggtatgcggc cgaccgggct ttcgctccca 2804281 ggtctccgca aacctgcgca ccgctgccgc atcccacacc gcgcctccac gcaaatctgc 2804341 caacggagcg ggaaaccctg ctgtcgacct caattggtgc accctctgac gcgaaacccc 2804401 caactcatcc gcgatctcag ccgcagacat caactcgggc gttgtgaacg cctcagcgcg 2804461 cagacgatgc tctggctcgc taatgatctg cacagcaatg ggactcttgg cttgaactac 2804521 cggcataacc tcgccagcca tcttggcgag cgcgtcgaac acactccaat cgccgggcgc 2804581 atagaccgtg acgtcaatgc cgtgtcctgg gacccgagat accagtgcgt cgaagccctc 2804641 gagctgcgtc tcccaggcgt ccatggtctc catcgaaggg tcagcatcaa acgtgaaggt 2804701 gacgacccag tcggctgtca ctgtgcgcct tccttcctgt gctgtgcccg ccgttccttc 2804761 ttgctcggcg gtggccacgt caggcccgct ttcttcaacg cgcccaatag gtctcgcatc 2804821 cggcggtact cgttgctagg tgttgccgga aaccgagcaa tatagacgcc ctgggggttg 2804881 tagaagcggg tgtagccgct ggcgtcatcc tcaaccgtcc attgttgcga ttgcgcccac 2804941 ttcgcgatct tgatgattgc gctgttcaca tcgtctcccc aacacttgcg atgtgtcaag 2805001 agtaatggca agacgcgaca tcgtaaaggt tttagccgga ctcattcgaa tatttgagcg 2805061 atgtagccag tgagtgggtg ctccgatgat cacggcttcg cgcgagctcg ccccggctgg 2805121 cctcatgatc gccgaccggc tgggcttcac ccggtctcag tggttctccc agtcgcgaag 2805181 gaacacccga gtgtcgtcat ggtccgcgcg gttgggcact gcggccatcg gatgtcatcg 2805241 tcgtacaacg aaccatgcgg tcgttgcagg gcgtgtatca ggcgctgttg gttgtctggc 2805301 gttcctcgcg gcgcgcttac gccttggcgt taccggcgcg ccactggtcc cacggaatgt 2805361 tccaatcgcc gagcccgtcg atccccggca gggtgccacc cacggtattg accacctcga 2805421 cgatgtcgcc gcgcttgaca tggtcgtaga accactgcgc gttgctcggg ctgacgttca 2805481 ggcagccatg gctggtgttg gtgtggccct gagcccccac cgaccacggc gctgagtgca 2805541 cgaagacacc gctgtaggag atctgggtgg cccagtcgac atcggtgcga tatccgttgg 2805601 gcgagttgac gggtacgccg taggtggacg agtccatgat gatgtgcttg taccgcgagc 2805661 cgacgatgta tatgccgttg gccgtcgggg tgctgtcctt gcccatcgac gtcggcatgg 2805721 actttacgac ctcgccattc acccgcacgg tcagtatctt ggtgttgtcg tcggcggtcg 2805781 cgatcacctc gtcgccgatg gtgaagtgcg tctgcacgtt gtcctcgccg aacattccct 2805841 cgcccaagtc gacgccgtag gtgttgaccg ccacatcaac ggccgtacct ggcttccaga 2805901 aatgctctgg gcgccaacgc acttcacggt tattcagcca gtagaacgcg ccctccacgg 2805961 gcgggttggt ggtgatcttg atggccttct cggccgcgcc ccggtcagcg atgttctcgt 2806021 cgaatcggat cgccaccggc tcgccgacac ccacgacctc cccatcaccg ggcatgacgt 2806081 agggcatggt caggtgcgcg ggggaactgg tctggaaggt cagctggcgg gtcgccgcgc 2806141 cacccagtcc aagcgccgtc gcgttcagcg tgtagcgcct gttgtagccg agctgctcag 2806201 tggtcgacca gcgcagtccg tcggggctga gtcgaccggc caccggcctg ccgttgtcgt 2806261 tgaccatggt gacggccgcc agcacaccgt cggcggcggt caccgacacc ggtgcatcca 2806321 cggtgacgcc gacggcgccg tcggtgaccg acgcggtgag cttgggcacc agcagatcgg 2806381 cgaacggcgt gcccttgtcc gcgatgacct tgatcggtgc gggtccgcgg ccgctgccgc 2806441 atgcgacggc accgatcatc acggcggtca tcatcagcgc ggttaaccag gctctccgaa 2806501 ccctggtcct acccgcctga gctgcaatcc ccacctttgg catgccttcc ctcacctccc 2806561 ccactgcgtc gtgaccgagc tagactcggc tgtagtctag gttctgactg gccgccacgc 2806621 tgcgatgctg ataccaagtt cagtgtgaga tttcacgcga gagcgcaagg cctgttaatg 2806681 tgccttggct aggtaatcga ggcgccgtta gctcagttgg tagagcagct gactcttaat 2806741 cagcgggtcc ggggttcgaa accctgacgg cgcacaggtc aacgcgttat ttcggatgca 2806801 ccagccgcag ctgtcccgtt gggcgacgat ttccgtattc ggaaggtgca cgccggttac 2806861 cggatttggg cagcggatcg gatcggagcc acggggatag ctcgacgaga cagccgggga 2806921 agccgcagaa aattgggttg taggcgcgtg caatagctac gctgcatgtg gacagcgggg 2806981 aagaggttag ttgtgtcgcg tctgatcgtg gctccggact ggctggcgtc agcagcggcg 2807041 gaggtgcaaa gcatcggctc ggcgctgagc gcggcgaacg ccgcggccgc ggcccccacc 2807101 accctattgg tggccgccgc cgaagacgag gtatccgcag cggccgcagc gctattcgcc 2807161 aactacggcc gggagtatca gacgctgagt gtgcggttcg cctcgcttga tcagcagttc 2807221 gcgcaagcac tgaactcggc ggcagcgtcg tatcagacgg ccgaagccac gggtgcgtcg 2807281 ctcgtgcaga ccgcgacaca aggtgtactg ggtgtgatca atgcgcccac cgagttcatg 2807341 ttcggacgct cgctgatcgg cgacggagct gacggcacgg ctgccagccc catcggcgag 2807401 cccggcggaa tcctgtacgg cgacggcgga aacggctact cccagaccac gcccggagct 2807461 gtcggcggag ccggcgggtc ggccggattt atcggtaacg gtggcgccgc cggcgccggc 2807521 gggcccggcg ccggcggcgg gactggaggc ctcggcggct ggttatgggg caacaacggc 2807581 gccgctggca ccggcgaccc agttaacgtt gccgtccccc tgcgcgtgga aaacaacttt 2807641 ccgctggtga acctcttggt caaccgcggg ccaactgtcc ccatactgct ggacacggga 2807701 tcctcgagtc tcgtcatccc attctggaaa atcgggtggc agaacctggg cttgcccacc 2807761 gggttcgatg tcgttcacta cggcaatggc gtgagcatcg tctacgccga cgtgcccacg 2807821 acggtcgatt tcggtggcgg cgccgctacc acaccgacct ccgtccatgt cggtatcctg 2807881 ccgtacccgc gaaaccttga cagcctggtc ctcatcgctt ccggcggcgc tttcggaccc 2807941 aacggaaacg gcatactggg catcgggccg aatgtggggt cgtatgccgt cagcgggccc 2808001 ggcaacgttg tcacgaccga tttgccgggc caactcaacg aaggcaccct catcgacatt 2808061 cccggcggct acatgcagtt cggccccaac acgggcactc caatcacctc cgtgaccggg 2808121 gcaccgatca ccgtgctgaa cgttcagatc ggcggctacg accccaacgg gggctactgg 2808181 tcactcccct cgattttcga ttcgggcggc aaccacggaa cgcttccggc ggtgattctc 2808241 ggcacgggcc agacaaccgg ttacgccccg ccgggcacgg ttatctcaat ctcaatacat 2808301 gacaaccaga cgctgctgta tcagtacacg acaaccgcga gcaacagccc agtggtcacg 2808361 gcagaccccc gactcaacac cggtctaacc ccgttcctgc tgggaccggt atatatctcg 2808421 aacaacccta gcggtgtcgg gacggtggtg ttcaattacc cgccaccgta gctttccgcc 2808481 gggtccagaa ccgccgcgcc ataagggcgt cacgttcgtc cagaacctcg gctaagtgcg 2808541 gagtgcgcaa tcatggtgca ctgcaatggg tttcccatcg gtaactccgg gttggtcagc 2808601 gattcctgat cttgtggatg accacgacga cgaccacaga cccgatcccg accagtgaca 2808661 cggtcacgat gggcttcctg aggaaggcga tcacccgagt ttttgcgtcg tcggcgaggc 2808721 ggcgggggtt ggcgcgctcg gcgagggaat cgatggtcgc cgccagttgg tcgcgggttt 2808781 ggtcgatctc ctgcttgatg gtattgggat cgcggtccac cacgtgctgt cctccaagtt 2808841 ctccagtcgc ccactgccgg cctgcgtcgc ccgccgaact accctagatc agtgaccaaa 2808901 accacgcgtc tgacccccgg agacaaagcc cctgccttca ccctgcccga tgccgacggc 2808961 aacaacgtgt cgctggccga ctaccgagga cgccgcgtca tcgtgtactt ctacccggcg 2809021 gcctcgacac cgggatgcac caagcaggct tgtgattttc gcgacaatct gggcgatttc 2809081 accactgccg gcctcaacgt cgtcggtatc tcccccgaca agccggagaa gctcgctacg 2809141 ttccgcgatg cccagggcct gacgtttccg ctgctgtctg atcccgaccg cgaggtgttg 2809201 acggcctggg gtgcctacgg ggagaagcag atgtatggca agacggtgca ggggatgatc 2809261 cggtccacct tcgtcgtcga tgaagacgga aagatcgtcg tcgcgcagta caacgtcaag 2809321 gccaccggcc acgtcgctaa gcttcggcgc gacctgtcgg tatagccgcg agcttggcca 2809381 gcagcagcgc ttcggcggtc gccgcgcgtt ccagcacacc cagatgcagg ctttcattga 2809441 cactgtgcgc ctgcgttccg gggtcttcta ccccggtgac aaggatggtc gcctgcggga 2809501 acgcggcggc gaactcggcg atgaacggga tcgacccgcc cattcccata tcgatcggat 2809561 cggcacccca cgcctgccga aacgccgacc gcgccgcatc atagacaggg ccgctcgcct 2809621 cgatggcgta gggctgtccg acctcgccgc gcgtgacagt gacctgggcg ccccaggggg 2809681 cgtgccgccg cagatgggcc tccaccgcgt ccaggtgcgc cgtggcatcg cctccaggcg 2809741 ccacccgaat actgatcttg gcccgggccc gcgggatcag cgtattggac gctgccgcaa 2809801 cggatgtggt gtcgatgccg attacggtga tcgccggctt cgcccagagc cgctgcggca 2809861 ccgagcccgt gccgatttcc gatactccgt ccagtagacc cgactcagcg cgtacccgtc 2809921 cagccgggta atccacacgc gccgcggtgc tttcgtgcat gcccgccacg gccacgttgc 2809981 cgtcgtcgtc gtgcaggctg gccaacagcc gcactagcac ggtcagcgcg tcgggaacga 2810041 cgccgcccca caacccggag tgcagcccgt ggtcgagggt ggcgacctcg acgacgcagt 2810101 cggccattcc gcgtagcgac accgtcaaag ccgggatgtc ggtgctccaa ttgtccgagt 2810161 cggcgatgac gatcacgtcg gctgccagcg cgtcacggtg ggcggcgagc aaccggccca 2810221 gtgacggcga cccggattct tcttcaccct cgacaaagac cgtgacgccc accggcggtc 2810281 tgccgccgtg tgcccagaat gcggccacat gcgtggcgat acctgccttg tcatcggcgg 2810341 tgccccgccc gtagagccgc ccaccacgct cggtcggctc gaacggcggc gacacccatt 2810401 gcccgcggtc accctcgggc tggacgtcgt ggtgggcata gagcagcacc gtcggcgccc 2810461 ccggcggcgc cgggtaccgc gcgatcaccg ccggcgcacc gcgctcgctg acaatccgca 2810521 cgtcgtcaaa accggcctgc gacaacaggt ctgccaccgc acgcgcgctg cggtgaacct 2810581 cgtcgcgccg atctgggtcg gcccacaccg attcgatgcg gaccagctcc tcgagatcac 2810641 accgcaccga cggcaacacc tcacggacgc gctcaaccag ctcgcgagca gacgcagagt 2810701 cgcatgaaaa tccggatttc gatgcgattc tgcgtctgct cgcgctcacg gggcctccag 2810761 gatggcgacc gcggccgcgg tatccccttc gtgggtcagc gacacatgga tcgtcacgtc 2810821 ggccaaatac tcagcgatgg ccccggtcag cctgacccgc ggcctgcccc acatatcggt 2810881 gaccacctcg atatcgcggt ggatgtcctc cggcaacacc ggccgctgcg cgaaccgcga 2810941 tccggaccag gccttgatca ccgcctcctt cgcggcccag cgggccgcca ggtgccgggc 2811001 cgccgacgaa ctcttgtccg aggcgtcccg gcgctcaccc ggggtgaagg tctcggcgaa 2811061 caccgttccg ggctggtcga cctgctcggc gaaatcggga atggagacca ggtcgatccc 2811121 cacaccgacg atgcccatgg gcggccacgt taatcgatgg cccagtccgg cgacgatgcg 2811181 gtccgcgttg ggggcacctc ccgcttgcgg gggacggacc gaagagatgc cgggcagtca 2811241 ggccaaggag cacgcggcga gcgtgtatcc atggcggcga cacgccgaac accgtcgccc 2811301 tgagcgcacg ttcggcgccc aacggcaggg tcagccgata tacgcctcgc cgtcacccag 2811361 ccgggccgcc ggattcagca gcatcgacgc ctcctgcggc cgctcgggcg cgtggtggtc 2811421 gaagcgacgg tcaccgggcc gctggtacat cggcgcacca ccggcaatcg ccgaggccag 2811481 ccggcgctga ccggccagca ggcgggcgtc ggcacgccgc tggtagtccg cgcgctgtgc 2811541 gggatccagc gaggcgatga acgcctgcgg atgcaccaac gcgaccaggc ccgacacatg 2811601 gccgaacccg aggctggtca gcatgccggc cttgagtggg aacttgccgc cgagccgcaa 2811661 cgtgtcacgc acccacacga aatgcgcgga gccggccagc tcgtcgtcga cgcagtcgag 2811721 gctgcggttg ggtgggatca ccccatcccg caatatctgg cagagcccca tcatctggaa 2811781 gaccgccgcg ccgcccttgg cgtggccggt caggctcttc tgcgacacca cgaacagcgg 2811841 ggcgccctcg gaacggccca gggcgtcggc gagccgttca tgcaactcgg tctcgttggg 2811901 atcgttggcc agcgtcgagg tgtcgtgctt ggagatgacc gccacgtcgt cggcggccac 2811961 gcccagcttg gccagcgccc gcgccagcgg tgaatccttg ccgccgcggc ccgcccccag 2812021 cgcgcccagg cccggggccg ggatcgaggt gtgcacgccg tcgccgaacg actgcgcgaa 2812081 cgccaccacc gccagcaccg gcagccccat ccgcagcgcc aggtccccgc gggccaacag 2812141 gatcgtcccg ccgccttggg cttcgacgaa gcccagacgg cggcggtcgt tgggccggga 2812201 aaacttcgag tcgtggatgc cgcggccgcg catcatggac gtgtcggcgg tggcggccat 2812261 gtcaccgaat ccgatgatgc cctccagcgt caggtcatcc aggccgccgg ccaccaccag 2812321 ttgagccttg cccaaccgga tcttgtcgac accttcctcg accgacaccg cggcggtggc 2812381 gcacgcggct accgggtgga tcatcgcacc gtagctaccg acgtaggact gaaccacgtg 2812441 cgcggcaatg atattcggca agacttcctg gaagatgtcg ttcggcttgt tgcggcccaa 2812501 cagattgccg tggtacatcg tctgcatcga cgtgccgccg cccatgccgg tgccctgggt 2812561 gttggccacc aaactcgggt gcacgtaacg catcacctcg gccgggctga aaccggacga 2812621 caggaacgcg tcgacggtcg ccaccatgtt ccataccgcc aaccggtcga tggaaccggc 2812681 catgtctgcg ctgatgcccc acaccgtcgg gtcgaacccg gtcgggatct ggccgccgac 2812741 gacgcgggac agcttggtct ttcgcggcac ccggatctcg gtgccggcct tgcggatgac 2812801 ctgccagtcg gtggagtcgg gcaccggccg gatgaccgtg tgctcgggat cgaactcgac 2812861 gaaggcgcgc gcatcggcct ccgaggacac cacgaacgcg aagtccttct ccaggaacac 2812921 cgacaccagc agcggcgagg cgtggtcggg gtcgatcgcg ccgtcatcaa cgaattcgcg 2812981 aatgccgacg cgctgcacca cggcgtcgtg gtagcgctgc accaactcgg attcgtcgac 2813041 catttcgccg gattcggtgt cgtaccaacc gggttgcggg tcgtcctccc agcggatcaa 2813101 cccagtggtc caggccagct ccagcacgcc ggccgccgac agctcgtttt cgacctccat 2813161 ctcgaaccgg gtgcgtgacg agccgtacgg gccgatttcg gcgccgccga cgatcaccac 2813221 caggtcggcc gggtcgacat cgaggtcgtc ccattgcggc ggcggtgcgg gggtgaaacc 2813281 ccggggcggc gacggcagcg cggcgatggc gccaggggcc tcggcgtcct cgtcgacggc 2813341 cgccgctgcc gacatctgct cgcgcgcctt ggccgccagc tcggccatgt cgaggttggc 2813401 ctcggccagg cccccggtca ggtcggcctt gatcggcgaa cgcgccgcag ccaccttgga 2813461 ttccgcatca cacaggtcga gcagcagcgc cgccatctcg tcggtcgagt aggtggtgac 2813521 cccggcctct tcgacggcgg ccacgatggc atcgttgtgg cccatcagcc cggtgccgcg 2813581 ggtccagccg atgagcgcgt gcgccaggct gacccgtgcc gcccaggacg actcggcgtg 2813641 ccagcggctc accacggcat ccagcgcgga cttggcttcg ccgtaggcgc cgtcgccgcc 2813701 gaacatgcca cggttgggcg agccgggcag caccacgtgc agccgcgacg cgatgtcgcg 2813761 ttcggcgccg atcgtcgaca ggccgccgat cagccgttgc acggcccaca gcagcacttt 2813821 catctccatc tcggcgcgcg aaccggcctc cgacaggtcc ccgaccacgc gtggcgccgc 2813881 gaacgggaac agcagcgtcg gggtctgcgc gtctttgatg tgaatcgact gcggcccaag 2813941 gctttcggtc tgttcggtgc cgatccattc gaccagggcg tcgacgtcgg agtaggacgc 2814001 catgttcgcc gcgaccagcc acagcgccgc gccgtaacgg gcgtggtcgc gatacagcgt 2814061 gcggtagaac gccagccgct cctcgtcgag cttggaggtg gtcgcgatga cggtggctcc 2814121 gccgtcgagc agccgagcca ccaccgacgc ggcgatcgaa cccttcgaag cgccggtcac 2814181 cacggcaact tcgccgccgt agcggccggg ttcggggttc tcggcgccgg cggcgatgcg 2814241 gccgtacagc gatgcatgga tctgccggcc cgcggccagc gacttacctt gccaccaggt 2814301 agcctgggtc gccacgacgt ggccggcacc ctcgaagcgc tccgccaggc gcggccagtc 2814361 ggcgtcgatg tcgccctcgt cggtcagcca cagcttcacc aggtcctcgc gggcgctggc 2814421 ccagcggtcg tcgaatacga cggccttctt ggggtcgaac accggtgcca ccaaccgcgg 2814481 ccagtccgct cccagttcgg cggtgaccaa gtcgatcagc tcggaatcgg gggcggccgg 2814541 caaggcgttg acggggtcgt ccagtcccag ctgccccagc accaggcggg ccgcggaggc 2814601 cagcacgccc tcacggccgg tgatttggtc ggtgaactcg ctgagcgcgg ccgcgtcgat 2814661 ggtggcgccg ccaccactac cggccgacgg cagcgctacc gaaacgccct ggcgcgcggc 2814721 caccgatgcg accgccgcgt cgatgacctt gtcgacggag gcggcatcgg ccagcgcgcc 2814781 ctcgtgcagg tggcccatgg cgccgccgcg aacgctgctg ccctcgcggg tgcccagcgc 2814841 gacctcgacg gtgacatgct tggcccagcc ctcaccgagc tcccaggtct tcttcacccg 2814901 ctcggcgatg gcgccgggcc gcttgcccga cggtccgagg acggtgcgaa gctggtcgtt 2814961 gatggcgtcg gaaagcactg ggccgtaagg cttgtaggtg cgcgccagtt tggtcacctg 2815021 tgagcgcaga ccggccaggt ccgattcggc ggcgccgtca atggcaccga ggttcagctc 2815081 ggagcccagg tccaccagca gctggttgcg ccgcgacgac gcaccgtcgg tgatggactc 2815141 gatggagtcg agttcttcga tctggtcgat gcgcatcttg gccgagagcg cgatcagcgc 2815201 cagcgtggca tcggcggcgt cgaaaaccag atcgtcggga cgcgggcccg ccgacgaagc 2815261 ggccggcgcg acgggggcgg cttccgagac gacgtccggc gcgggcgatt ccgcgaccgg 2815321 ctcgtcttcc tccggctccg gctccgggtc ggtgtcggtg gcgaacagca ccgcggcatc 2815381 acgctcggcg ttgagcactt ccactgtgct gtgggcgtat tcgggcagtt tgagggtgtt 2815441 ggtggcaaga cccgccaccg tcggtgagct cttcacaccg atctcgacga atcgctccac 2815501 acccagcccg ccggcggcct cctcgatgaa cagcagatcc tgcgtctcga tccagcgcac 2815561 cgggctggcg aattgccatg ccagcagctc gatgaacacc gtgcgcgcca tctcgcgcgg 2815621 acgctcgcga agccaggtgt cgtagtcggc gaggatctcg tcgagcggct cggcgggcac 2815681 caaatcccgg atttcctgga tgaagtcgcg gtccagggtg aacaaccgcg gcaccaggtt 2815741 gggaatgtag cgcccgatga tcaggtcggg gtccgcgtcg cgcggcatga cccggtccag 2815801 cgagcgccgg aattcggcca ccccgacccg cagcactcgc gagtggaacg gaacatcgat 2815861 gccgggcacc aaaatgaacg accgtcggcc gccggtgagc tcgcggcgcc gctccacctc 2815921 ggcctcgagc gcctcgaggc cgcgtaccgt gcccgcgatc gcgtattgcg agccacgcag 2815981 gttgaaattc acgatctcca ggaattcacc ggtgctctcc gcgatcccgg cgacgaacgc 2816041 gggcacgtcg gcgtcgtcga ggtcgatctg ggacggccgg atggccgcca gccgatagtt 2816101 ggagcggccg agctcgtcgc gcggaacgat gtcgtgcatc ttcgacccgc ggtgaaacac 2816161 catctccagc aaggcttcca gttggtagat gccggtcacg caggccagcg cggtgtactc 2816221 gccgaccgag tggccgcacg cgatggcgcc ttcgacgaag gctccctgtt cacgcatctc 2816281 ggcgacctgc gcggccgcca ccgtcgccat cgcgacctgg gtgaactgcg tcaggtagag 2816341 caccccgtcg gggtggtggt agtgcacacc gctggcgatg atgctggtcg ggttgtcgcg 2816401 gaccacgtgc agtaccgaga agcccagggt gtcgcgggtg aacttgtccg cggtgtccca 2816461 caccttgcgg gccgccttgg agcgggcgcg cacctccatg cccatgccct tgtgttggat 2816521 gccctggccg gggaatgcgt agaccgtctt gggtgcggcc agtcgcgcgg aggccgacat 2816581 cactagatcc gacccgacgc gcgcggccac gtccacaatc tctgcgccct ggtcgattcc 2816641 gacgcgctcg acgcggaagt ccacctcgtc gccggggcgc accatgccca aaaaccgcgc 2816701 ggtccagccg accagccggg ccggtggccg ggcctgcccg tcggtggcgg tcaccgcgtg 2816761 ttgcgccgcg gccgacagcc acatgccgtg cacgatcggc gactccaggc cggcaagcag 2816821 cgcggcggcc cggtcggtgt gaatggggtt gtggtcgccg gacaccaccg cgaacgggcg 2816881 catgtcgacc ggcgcggtga tcgtgacgtc gcggcggcga cggcgcgggg tgtcggtggc 2816941 gttcgccgac accgcgccac cggctcgcgc cgggtcggcg agctcggcgg aaccggtgcg 2817001 acccaggatc gcgaatcgct cctcgagagt ggcgatcacg gcgccatcgg cgccggtaac 2817061 gacgaccgag accggcacga cgcggcccat gtccgtatcg gttgcgttgg cagccgttgc 2817121 ggtgacggtc aattgggccg ggaccgtggg cagctgaccg accacgcggg cggcgtggtc 2817181 cagatgcacc aggctcagca ggccttccac caccggctca ccggtgtcgg tgaccgccga 2817241 tccgatggcc gcgaaaaccg ctggccaaca agggccgacg agcgcgtcgg gcacgttggt 2817301 gaggctgggt gccagcggct caccgaacgt ggcggtgacg ccggtgtggt cggcaacacg 2817361 ctcggggtgc cagtccaccg tcaaagtggc cgtcccgttg gccaccgcag gcaagaactc 2817421 cgggctgtcg acaccggcgg cgatcgccag caccgtgcgc atggcgctgg tggcgtcctc 2817481 ggtggcgatc accggggtgc cgccatcgac ggtgttggcc ggcaacgtga atcggatgtc 2817541 gacccaggtg cccgagacgg gcacgctcaa ggcgacgtcg tcgccgtgcg tctgcagccg 2817601 ggcgccggtg gatgagtgtg tggcgcgcgg gttttcgggt ccatcgtgca cctgccattc 2817661 ggccgggtcg gcgatccgat gcaccgggtt ggtcacggtg cgaccggccc agcgcacatc 2817721 gggtgcgtcg aggacgacag ccaacggtcc ggccacgtcg gcgcggccca gccggcgcga 2817781 cgcgacatcc ttcggctcga caccggcgcc gagcacttca tcgattgcgg cttgctcgaa 2817841 acggtccagc aactcaccga cgggttcatc catccgggtg atgccggcta ccgacgcggt 2817901 gcccggaatg atgcacaccg catcggcgtc gtagcgggcg tcgtgggcct gccacagcga 2817961 gtcgctgcgc caccagcgcc gcacgtcctg gtcgatcacc ggcacgaagt tgaccggctt 2818021 gcccagcgtc ttgcacaacg tcacgaaaaa gggcacatcc gcgggatgca actgcacggt 2818081 ctcggcgtcg gggtagcgcg ccagcagggc ggcgatcgcc tgctgcggat tgtccagcag 2818141 gccagcatcg gtgaatagcg tctggatcgg gccgaaatcc tgtgggtgca accgggcttc 2818201 ggcacgctgc agcatctgct cgaagcggtc ccgccaggtg tcggccagcc acgggctgcc 2818261 caccgaggcg gtgtcggcgg tcgagttgcc ttccccgatg gccagttcga cgtagcgccg 2818321 cagccactgc aggtaggtca tgtcggcgac gtcgccgaag tagggcttgg cggtcttggc 2818381 catcgccgcg atgatctcgt cgcgacgctc cgcgaccgcc tccgcgtcac cggccacctc 2818441 gtcgagcagc cgcccgcacc gggatgcgct gttgtcgatc tcgtggatgt cggcaccgag 2818501 ctgactgcgg ctggaggcca tgccgccctg cgcttttccg gcgctgatcc attggtcggt 2818561 gccctgagtg tcgacgagca tccgcttgac cgatggcgac gtggtggatt ccttggtggc 2818621 catcgccgcg gtgccgacca ggatgccgtc gatcggcatc aatgggaagc cgtaggcctg 2818681 cgcccagcgc ccggacaaat attccgcagc ccttctcggg gtgccaatgc cgccgccgac 2818741 gcacaccgtg atgttggcgc gtgagcgcaa ctccgagtag gtagccagca gcaggtcgtc 2818801 gagatcctcc caggaatggt gcccgccggc gcgcccgccc tcgacgtgca tgatcaccgg 2818861 cttggtgggc acctcggtgg cgatgcgaat caccgagcgg atctgctcga tggtcccggg 2818921 tttgaacacg acgtggctga tgccgatgtc gcccagttcg tcgatcagct cgacggcctc 2818981 gtcgaggtct gggatgccgg cgctgatcac cacgccgtcg atcgcggcgc cggactggcg 2819041 ggccttctgc accaaccgct tgccgcccac ctgaagcttc cacaggtagg gatcgaggaa 2819101 cagcgcgttg aactgatagg tgcggcccgg ctcgagcagg ccggccattt gttcgatgcg 2819161 gttaccgaag atctcttcgg tgacctgccc gccgccggcc agctcggccc agtgcccggc 2819221 gttggccgcc gcggcgacga tcttggcgtc cacggtggtc ggggtcatgc ccgcgagcag 2819281 gatcggcgag cggccggtca gccgggtgaa cttcgtcgag agcttgaccc tgccgtcggg 2819341 gaggcgaacc acggtcggtg cgtagctcga ccaggcccgg gcaacctcgg gggtggcgcc 2819401 gacggtgaac aggttgcgct ggccaccgcg ggtagccgcc ggcacgatgc cgatgcccag 2819461 gccgcggatc accggtgcgg tcagtcgggt caggatgtcg cccggcccca ggtcgaggat 2819521 ccagcgggcg ccggccgcgt ggacacgggt gatctcgtcg acccagtcga cctttctgat 2819581 caagatggca tcggccagct cccgagccaa ggcgacatcg aggcccgcct tctcggccca 2819641 gcccgcgacg atgtcgatcc cgtcggatag ccgcggggtg tgaaagccca cctccacctg 2819701 caccggctcg aagaccggcg agaagacgtc gccgccgcgg accttgttct tgcggtcggc 2819761 ttcttccttc tcggagatct ggcggcaata aagctcgaaa cgcgacagct gctcgggggt 2819821 gccggtgatg acgacggcac gccggccgtt gcggatggac aacaccggtg gcagcaccgt 2819881 gcgcacgtcc tgggcgaact cgtcgagcaa ccggccgatg cgctcggggt cggcgttggt 2819941 gaccgatacc atcggcgggc gatcgcccag gacggaaatt ccgcgccggc gggccaccag 2820001 cgttccggcg gcaccgatca actgggccaa ggcaaacagc tcgacgtcgc gtgccccacc 2820061 agccttgagg gcttccaccg ccagcacacc ttgcgaatgc cccgccatgg cgaccggcgg 2820121 ggtggccacg aggtccatgc cttgacgggc cagcgcccgg gtcgccgcga tctgggtaag 2820181 caacacgccg ggcaccgaca cggcggccga cgtcaggtgc ttgtcggacg gaaccgggtc 2820241 ctcggccgcc agtgcgcgta cccattgcag cggctcgaaa ccgatcgggc gcaccacaat 2820301 cagctcgtcg gtgaccggat cgagcaacag ctctgcctca ccgaccaacg tcgccaactc 2820361 ggtttctatc ccggtggccg acaccagctc ttcgagggtt tccagccagg cgctgccctg 2820421 gccaccgaat gcgacagcgt agggctcacc agccatgagg cgatcgacca gagcgtgggt 2820481 ggtatgcggg ctgtccccgc cgcgatcagc ggacacccgg tcgtgctcgt ggatcgtcac 2820541 ggtctatgtc tccctatgtg catcggtacg tgtcagttcg tacagcggcc caggctgccg 2820601 tgcggggcat ccccgactcc gcaccgactc ccagccgaaa tcctctgacc ggtgtgttgt 2820661 cggtgggccg gcccgtgggt cgagcagcgc gacgggctgc atcggcctta taagagtctc 2820721 ataaggatcg gtccaccttg tttacacaga tcggttactg gcgagttcta cgtacgggta 2820781 accgtgtcgt gggtaacgcc gggttcgacg gccggcgcgt atgtgttgac caaacgtcct 2820841 gcgtgcaggt ggttacggtg gagtagctat aactgcgctg atcaaggcag ttttgttatc 2820901 aaatcgttat gctgggaatt cgctctacgc cgggcgcgtg ccgacgcgcc gacccaaagg 2820961 ccgcgccatt ggcggcgttg gcccggggtt ggcaatgccg tgcagcgggc gaacgagtgt 2821021 ttgctgtagt gcagcggggg ccaggctcgg ggcggcaggc taagcccact gcccgaattg 2821081 gggcttcagg atttggttga cgtccacccc gaccccacca accttgcgct tatcgatctc 2821141 cacctggtgc agatgcgcgg ccgggtgggt gtatcccttg ggcgagcccc agttgtgctg 2821201 ccagaagtac gagcccaagc catcgttgac ggcccagtcg atggttttgg agttggcgta 2821261 cacgccggtc cgctggtgtc cgatcaccga ctcccaggac cgcagatatg gcacgatctg 2821321 gttcttgtac tgctcatatg atgggttgtc gtcgatcgag gcgtagatcg gggcgctcgt 2821381 cgggccgccg gcagcggcat gcagctccga cccccgtctg gcgtgctgca cgccggcgct 2821441 ggcaccgccc agccagtcgg cagtgctccc cttgccgtat tgataacagg acacgatctt 2821501 gagcccattg ccgctcaggt cacgggcctc gctgagctgg atcggcttgc caagcatcca 2821561 ggcgccgcca ggccgccgat cggacacgta ccggattgcc cccaccgcgc cggcagccct 2821621 gatctggctg gcggggatga caccggcggc gtagtccaac agggtgccca gcgaaccggc 2821681 cgatgccggc gcggcgcgca acgacgacgc aacgacgcca agacccagca cgcccggagt 2821741 cgccgccgcg aatttgagca catcacgccg agagaccgac atatgccaca gggtacgaca 2821801 aaaacaacaa ctgtcacact ggtttcagtg gtcacggatg catcacactg gcagaacaca 2821861 tgcatgcggc cataccgaca ccggtgcggt ctcgggcagg ccgcctctcc ctgcgaccac 2821921 tactacggtg tgatcgccta cgctcccaac ggcgcaatgg gcaaaatcgt cgcgccaccg 2821981 cactcgaggc caggcggata tcgacgcata agaactttgc ggcgtcttag ctgcaaagtg 2822041 ctcagcaact tcaccaacta ccacggggga gtccgacgat cgcgcccgct ggcagaacct 2822101 ggacgtgcaa ccagttgagt agttcccaca ctgcgcgccg agcgtgggct ggctgcgccg 2822161 aatgtgcact ggtggcggcg acacgcccgg gcgacgccgc cgtggttgca cgttcggcgt 2822221 aggcagcccc gtgcgcttgc cgggcaggtg tcctcaaagg tccaactaga cacacatatc 2822281 agacactagt atgtacatat gaccgtaaag aggaccacga ttgagctgga cgaagatctt 2822341 gtgcgggcag cccaggccgt caccggggaa acattgcgag cgacggtcga gcgcgcgctg 2822401 cagcagctgg tggccgcggc tgccgagcag gccgccgcgc gccggcggcg gatcgtcgac 2822461 catctcgcgc acgccggcac tcacgtggac gcagacgtac tgctctccga gcaggcgtgg 2822521 cgatgaccac ctggattctg gacaagagtg cccacgtgcg actcgtggcc ggcgccacgc 2822581 cgccagccgg catcgacctc accgacctcg ccatctgcga tatcggcgaa cttgaatggc 2822641 tgtattcagc acggtcagct accgactacg acagccaaca aacgtcactg cgcgcctatc 2822701 aaatccttcg cgcacccagc gacatctttg accgggttcg ccaccttcag cgcgacctag 2822761 cccaccaccg tgggatgtgg catcgaacgc cgcttccgga cctattcatc gccgaaaccg 2822821 cgcttcatca ccgggccggc gtgttgcacc acgaccgtga ctacaaacga attgccgtcg 2822881 tacggcctgg gtttcaagca tgcgaactct ctcgcgggcg ctagcttcgc ccgaatccgt 2822941 gagcggaggc gataatcctt acaggccatc aaaaaagtcc tcgtcgagcc gtaagagttc 2823001 gacggtctgc accgcctgga caccgactcg ataccgcacg agcagctcgg ccagccgagc 2823061 gccgtcgatg agttcgatcc gggcgttgat ccgctcagct tcctggcggg caccgcggga 2823121 aaacgatgac gtggtgatgt agacgccccg gtcgccctgc ttgcccagga gggcgccggc 2823181 gaactcgtgg atcttcggcc ggccaatcgt ttggtcgacg gcgtatcgct tggcctgcac 2823241 gtagatgcgg tccagcccga gcgggtcctg gctgatgatt ccgtcgatgc cagcgtcacc 2823301 ggaggcactc gtccgttcca ccgcgccggc tcgcccgtaa cccatcgcct ccaaaagtct 2823361 gataaccaga tcttcaaacc cggtgggcga caacgtgagt gccttcttca ggatctcccc 2823421 ctcgacggct gcccggttct ccgcaagcgc agcgtcgatg agatcctcgg gtgagacctg 2823481 cacatcgtcc ccggacggtc gcttggcggt cgcgtcgact ggctgcttgg ctttggttcg 2823541 ctcacgaaaa gcgatgtacg acgggaactc ccgcagcaca gccatgtcga cgcgctcggg 2823601 atgcgccttc aggacttgac ggcccgtgtc cgtgacctgg acgtggcccc gcgtgggacg 2823661 gtcgagcaat ccggcctgcg acatgtgagt gagagaccag tgcaccctgt cgtacatggt 2823721 cctttgccga ccgctgggca acatctgcgc ccgctcgtcg tcggacagac cgaactcgtc 2823781 ggacatcgcc gcgatgacgt ccttggccga cttcgcttgt ccatcggcaa gatacgcgag 2823841 aatcggccgc atcaacgtct gggcatcagg gatcgtcatg gggagccatt atccagctgg 2823901 cttgtcagcc ctccgaaccg gccaagttgg gtaagtccat ccggggctcc gtgttctgac 2823961 aggcccgctg caggcgtcgc atcttcctca tctgccccac gtgtacccgg tcccgccgac 2824021 ctaaaaggtc ggcatatccc tgccatgccg ggacgcgtga ggcgggtgag acacaaggga 2824081 acgtgcacct cgcgcaccgg gtcgccagca gccgcgacac gccgtcgtcc agtgccacac 2824141 cgaatgcggt gtcgggctcg gcgtcaaacg ctgccgatcg gccttgcctc gtcaggccgc 2824201 cgacagcacc gccctgggct cacggtccgc ggctccgccg ggatccgacc ggcggcggct 2824261 caaccccctc gatcgtcttg agccggtcga cagaccgatc gaaagacggc caccggatcg 2824321 tcccggcagg ggcgaggaag tccggcgtcc gagcaagcac cgagcgattg ccctcaacgc 2824381 ggaagacaac ccgatcaccc gattgcaggc cgagcgcgtc gcgcaccgct ttcggaaccg 2824441 tcacctgccc cttcgacgtg acgatgggtt cgtcggagtg cctgcttcac cgttgccgta 2824501 cgccgcccgt accctcacac tctgtggagc tgctcgtcgc cgccaacccc gctgaagact 2824561 cgcgcctgcc ctacctgatc cggctgccgg tgggcgcggg actggtcttc gccacctcag 2824621 acgtgtggcc gcgcaccaag gcgctgtatt gccatcgcct cgacatcgcc gactggcccg 2824681 ccgaccccgt cgtcgtcgac cgggtcgagc tacgcagctg cagccgccgg ggcgcggcca 2824741 tcgacgtcgt cgccgcccgc gcgcgggaga accgatcgca actggtgcac accatggcgc 2824801 gcggccgcca ggtggtgttc tggcagagcc ccaaaacgcg caaacagtcg cggccgggcg 2824861 tgcgcacccc caccgcccgc gccgccggca tccccgagct gcacatcgtc gtcgacgccc 2824921 acgaacgcta cccctacacc tttgccgaca aacccgcgaa gacgacgcgg gaagccctgc 2824981 cctgcggcga ctacggcctg aaagtggccg gccaactcgt ggcggccgtc gagcgtaaag 2825041 cgttggcgga ccttacttct ggcgtgctga acggcaacct gaaataccaa ctgaccgaac 2825101 tggccgcgct gccacgggcc gccgtggtgg tcgaggaccg ctactcggag atcttcgcgc 2825161 actccttcgc ccgcccgacg gcgatcgccg atgggctggc cgaattgcag atcggctttc 2825221 ccaacgtgcc gatcgtgttc tgccaaaccc gcaagctcgc ccaggaatac acctaccgct 2825281 atctagccgc cgccctcacc tggttcgtcg acgatgccga cgccaccacg gttttcgagc 2825341 cggctgccgc cgagcccgag cccagcagcg ccgagctgcg cgcgtgggcc aaaagcgtcg 2825401 gcctgccggt gtccgaccgg gggcgcctgc gcccgcagat cctgcaggcc tggcgagccg 2825461 cccatccccg gtgactacaa cacctcgacg aggcctgcgg atgctgaatc ggccagtgcg 2825521 gcatcgaatg tgaccaaccg gcccccgtag cgcgcggcca aggcgatgag atggcagtcg 2825581 gtgacccgac ggtggttgga caccgcatcg cgatcgccgg cgctcccaac gatcagtggc 2825641 acatcgtcag gccaaaacgt gtgcccggca agagaagtca tcgccgccaa ctgagcgatc 2825701 gcgatagccg gcgtggtcga cacctgcatc acactgcgat tgcttgaaat tcggacatac 2825761 cctgcctcgg tgatcggcgt ggtggcccac ccattcgagg agaactgcgt gaaccatcgc 2825821 tgcgcggccg catggtgaac gtgattcggc cagcccagcg cgatcagcac attgacatcg 2825881 agcagtgccg tcacacgtcg tcctcgagcg cgcggacgac atcctcggaa gtcaccgtcg 2825941 gcgcatccgg cggaacatca aaaaccggaa atccgtcaac ctcgacaatc ccaaccggac 2826001 ggagcgacct acgcgccaac tcagaaatta ccgcgccgac tgacttgccc tccgaccgcg 2826061 cgatgctacg agcatcttct agaacatcat catcaatctg caacgtggtg cgcatagcat 2826121 catgttacgg ggcttgggcc agctttcacg cgtcttcggc gaccccctgc agcacactgt 2826181 cgccgttgac ggtgccattc aaagccgaag cgtcccgcgg tacctcgaag gccggcagcg 2826241 cggcacctac cgtggcgacg gcgttgcgcg cggcctccat ccgggccaat gccgcctggg 2826301 tgaacaccga caaccccagg tcggggttgt atccgtggat ctctttgacg tcgagctggg 2826361 caagaaagta gatgatctcc ttggaaacca gttgacccgg caccagtacc gggaagccgg 2826421 gcgggtaggg caccacgaac gtggtggata ccagagtctt gccctcggcc agccggcgcc 2826481 cggccaagcc gatctgcacg tactcacggt cggcctcttc gtagccggcg tagaaagccg 2826541 accgcatgtc accgaaagag ctggcgtcgt cggggcggaa ggcaaggtcg aactcgctga 2826601 aatctggtag atgcggcaga tcctgcgtga tctcctcgac gtggcgtcgg tgtagagcaa 2826661 ggtcggcccc gctggccgcc ttctggctgc ggtccagatc gatcgccacc cgacgcaaca 2826721 catcgagcag atagtgcacg ctcgaccagg tgacgccgat cgtgaagatc agcaacacgc 2826781 tgttgataga cgttttgttg atctggatgc cgaatcgctc catcaggatc ttctcgcgga 2826841 agtcgtaccc gttcatcccg gtcgccccga taaacagggt gagccgcgtc ggatcgagca 2826901 cgaattgatc ggaccgccag gcttcgttcc aatcggccag agccccctgc ctgacctgac 2826961 ggtacgagct gaccgtcgag gaccgaaagg catcgggaac caggtcggac tcgtcaagga 2827021 tgcggaacca cttgctgatc agccggtctt tgcggacgcg atggcggaac accagcgcca 2827081 tgttgtaaac atggcggacc agctcgaacc cttcgatgtc aacctgtcgg cgcgccaagt 2827141 ccaacgaggc gagaagttgc tggttgggcg aggtcgaggt gtgggtcaag aatgcctcac 2827201 cgaacgcgtc ccgggtgagc gctttgaaat cctggtcgcg cacgtggatc atcgatgcct 2827261 gccgtagcgc ggacagcgac ttgtgagtcg aatgcgtcgc atacactcgg acccgagcgc 2827321 ggttggggtc tggcaacagc cggtgatcaa cccactcgga gcggtccact ccgtccatcg 2827381 acgcacacca attccggtat tcctcagcgt attccgcagt ggacaacatc tgctcgagtc 2827441 gctcggcagc aatcatcgcg gtccgctgcc gggcccaggg caccgccgtc gcaaacgcat 2827501 accacgcctc gtcccacaaa aagcagatgt ccggtttgat cgctagcacc tcctccatca 2827561 cccggcgcgg gttgtacacc acgccgtcaa acgtgcagtt ggtgagcaac agcatgcgca 2827621 cccggtgcag ctgtccggcg gcctcgaggt ccagcagcgc ctgcttgatg gtgcgcaacg 2827681 gcacggcacc ataaatcgcg tactgcggca gcggatatgc gtcgaggtac atcgggtacg 2827741 cgccggcaag taccaggccg tagtggtgcg acttgtggca attgcggtcg atgagcacga 2827801 tgtcgccggg gcgggtcagg gcctgcacga cgatcttgtt ggcggtcgat gttccgttgg 2827861 tgacgaagta ggtctggttg gcgttccagg tcaccgcggc tttgtccatc gccgtcttga 2827921 tgttgccatg cgggtccagc agcgagtcca gtccaccaga ggttgtcgag gtctcggcca 2827981 tgaagatgtt gcggccgtag aactcgccca tgtcgtgcag tgacttggag ttgaagatgc 2828041 tggcgccgcg cgcgacggga agggcatgaa attggccgac cggcgccgcc gcataggccc 2828101 gcagcgcatc gaaaaacggt gtggcataac ggtttcgtaa acccgcgagc accgtgctgt 2828161 gcaggtcggt gacgtcgttg agccggtaga aggtgcggtc gtagacgtcg ggctcgtcct 2828221 gggtctcggc ggcgatcgac tcgtcggtga gcagatagag gtcgatgtgg ggccgcaact 2828281 cacggatcca ctcggcgcat tccacccagt cgtgggtctc gtttgccacc gcttcgtcgc 2828341 catcggtgcc cagcagcgtg gtcatcagcg gcacccggtc gcgggaccgc agcggcaggt 2828401 cgtgacggat gatcgccgcc tgaatctcgc cattcagcgc caccgcggtg atggcatctt 2828461 cgatgctggc caccacgagc aactcgaact gcacctcgtc ggccggattg cgcaactgcc 2828521 gcaggcactc ggccaagctg tccggagccg tcgccgggga gtcgtcggcg agcagcacgg 2828581 tgtagaactg ctgctgtttg gcctgcgcta ccagctcctg ctccgccagt gacgcggagg 2828641 tgtcgaacag cgctgtgcgg tcgccgtatt cggacagcag tcgtacggcc aacgacactt 2828701 cctcggtaag ccgcaccgtg gaatgactat ccagatgagc gcggaaagtc gccagattct 2828761 gtgcccccgg atacagccag taccgctcat aggcgccgat gcggtccatc agccgcttcg 2828821 cccgagccac gtcgtgtgtg gtatcgagcc cggcgaggtc gacctccgcc aggtgacgac 2828881 acgcgtcatc gagcaggttc caggtgtcca ggcgggtgta ggacgggttg gccaccgcgg 2828941 ccagcgcgga gacatgcagc cgtcgcgggc ggacgctgtt tgggttcatg tcgtcacctg 2829001 ttctctggtg cgggtagcgc cgtagagtgc aaccaggcaa ttatcgcgcg caggaccggg 2829061 tcagtcagct aagtcgtcgc tgtccgcgat ccgccgatta gcccgattcc cggagttgtc 2829121 cacccagcgc agcaccggca gctgcgaaag ctcccggcgg cgtgccggca gatcggtgac 2829181 gtcacccagc gagcgcacca cggcctgcgt caggcccatg atggcggcca tcaccggtag 2829241 cagcggctgc aacggctttg ccgccgtgcg aacagtgctg atgcggcgtg ccaagatcag 2829301 gcgttcgatc gcgtgataca gccaggcgcc gacacccagc gcgtagatcg acattgccga 2829361 ggcgacaatg gtcagcccgt aggcggcgca caccacggca ccgagcacca ccgtcgcagc 2829421 caggacggcc gcgaccacga cgcgcagctc gagccgggtc atcacgcccc cccacgcacc 2829481 gcttgagcgg ccgcacgcag ctgcggggtc accagcatga cctggcccag caccccgttg 2829541 acaaagcccg gcgagtcgtc ggtcgacagc tccttggcca gctggacggc ctcgtcgacg 2829601 accaccggct ccggcacatc cgccgcgtgg agcagctccc ataccgagac gcgcagaatg 2829661 gcgcgatcca cggcgggcaa ccggtccagc gtccagcccc gcagatgcgc ggtgatcagg 2829721 tcgtcgatgt gggcggcgtg ttcactgacc cctcgagcca ccgcggccgt gtacggatgt 2829781 agccgggcaa tgtcgggctt cgcttcggcc agcgcggcac gggtgtcgac cacctcggcc 2829841 gcgctgatgc cgcggacctc ggcctcgaac agcaggtcca ccgcgcgctt acgggcctga 2829901 tgtcgtccgc gaaccggctt tctgtccgac atcgtcaggc gttgacccgg cccaggtagc 2829961 taccgtcgcg cgaatccacc tttagtttgt ctccggtatt gatgaacagc ggcacgttga 2830021 tctgggctcc ggtctgaagg gtggccggct tggtgcccgc gctggaccgg tcgccctgca 2830081 agccgggctc ggtgtgagtg acctcgagct cgacggtcac cggcagctcg atgtatagcg 2830141 gcacgccgtt gtggaacgcc acctgcaccg gcatgccctc cagcaggaac cgtgccgcgt 2830201 ccccgaccag ggcctccggc agcgggtgct gctcgtagtc ttggctgtcc atgaacacga 2830261 agtccgagcc gtcgcggtaa aggtaggtgg tatcgcgccg gtcgacggtg gcggtgtcca 2830321 ccttcacccc ggcgttgaac gtcttgtcga cgaccttgcc cgagagcacg ttcttcaact 2830381 tggtgcgcac gaacgccgga cccttgcccg gtttgacgtg ctggaactcg gtgattgtcc 2830441 acagctggcc gtcgattacc aggaccagcc cgttcttgaa gtcagcagtg gtcgccacgt 2830501 gggtctccta cagaatggcc agttctttgg ggaaccgggt caacaattcc ggggtctgcc 2830561 cggcggtttc aggcattttc ggcgtcccgc cagccactac caatgtgtcc tcgatgcgga 2830621 caccgccgcg gccgggtaaa tagacaccgg gctccacggt caccacggag cccgccagta 2830681 gtgtaccggc ggatgtgacc ccgatgcccg gcgcttcatg tatctgcagg ccaacaccgt 2830741 gtcccagtcc gtgaccgaag tgctcgccgt agccggcgtc ggcgatcagc tggcgcgctg 2830801 cagcgtccac cccccgcagc tcggcacccg gcagcaacgc ctgccgaccg gcctgttgcg 2830861 cctcggccac cagctgatag atctctagct gccagtcggc ggccttgccc aacacgaagg 2830921 tgcgggtcat atcggagtgg tacccggcga ccagggcgcc gaagtcgatc ttcacgaaat 2830981 cgccgacctg cagcaccgcg tcggtcggcc ggtggtgcgg gatcgccgaa ttggccccgg 2831041 cagccacgat cgtctcgaat gacaccgcgt cagcgccatg atcgagcatc agggcctcca 2831101 gctcgcggct cacctgccgt tcggttcggc ccggccgcag gccgccgcgg gccaccaagt 2831161 cggtcagcgc ggcatcggct gcttcgcagg ctagtcgcag cagcgccagc tcgccggcgt 2831221 ctttaacctc gcgcagtgac tccacagttc cggatgcccg caccaactcg gtgttcttgc 2831281 cctccagcgc gcccgccaag gcgtccaggc cgtccaccgt gaccacgtgg ctctcgaagc 2831341 ccagctttcc cacgccggcc tcgccggccc ggccggccag gtagcgcccg accgcgcgct 2831401 cgatagccac ttcgaggtcg ggcgcttgcg aggcggcctg agtgcggtac cggccgtcgg 2831461 tggccaacac ggcatcgcgc tcatcggcga acaccagcaa tgcgccgttg gacccgctga 2831521 agcctgatag atatcgcacg tttatcaggt cgctgatcag catcgcatcc aacccggagg 2831581 cagcgatttg tgctttcagc ttgtctcgac gctgggaatg tgtcacgacc cttgacggta 2831641 ctcgctacgc tgaatgccca tgactaactg gatgctgcgc gggttggcgt tcgccgccgc 2831701 gatggtggtt ctccgcctgt tccagggggc attgatcaac gcgtggcaga tgctgtccgg 2831761 gctgatcagc ctggtgctac tgctgctctt cgcgatcgga ggggtggtgt ggggtgtgat 2831821 ggacgggcgc gccgacgcca aggcgagccc tgaccccgac cgccgccaag acctggccat 2831881 gacctggctg ttggccggcc tggtagccgg cgcgctcagc ggcgcggtgg cctggctcat 2831941 ttcgctgttc tacaaagcga tctacaccgg gggcccaatc aacgagctga ccacgttcgc 2832001 ggccttcacc gcgctcatcg tctttctggt cgggatcgtc ggggtagccg tgggccggtg 2832061 gctggtggac cggcagctgg cgaaggcacc ggtgcgacac cacgggcttg ccgctgaaca 2832121 cgagcgggcc gccgacaccg atgtattctc cgccgttcgc gccgacgaca gtccgaccgg 2832181 ggagatgcag gtcgcgcagc ctgaggcaca aaccgcggcc gtcgccacgg tcgaacgtga 2832241 ggcacccacc gaggtgatcc gcaccaccga aagcgataca cccaccgagg ttatccgcac 2832301 cgacaccgag gcggaccaga ccaagcccgg cgacgagccc aagaaggatt aaccctcacg 2832361 tcccgacatg ctcagctagg taccgcaggg ccagcaggta gccctggatg ccgagcccga 2832421 cgatcacccc ggtcgcgatg gggctgaggt aggagtggcg gcggaactcc tcacgcgcat 2832481 gcacgttgga gatatgcacc tcgatcagcg gagcgctcag ctccgcgcag gcatcgcgca 2832541 gtgccaccga cgtgtgcgtc agaccgccgg cgttgaggat cacgggttcg gccgcatcgg 2832601 cggcctgatg aatccagtcc agcagctggg cttcgctatc actttgccgc acaacggctt 2832661 tgagtccgag ctcggcggcc tcacgctcga tcagagcgac cagctcgtcg tgggtggtgc 2832721 cgccatagac ggcgggctcg cgccggccca accggcccag gttggggccg ttgatcacgt 2832781 tcacgatcag ttcgctcatg gggcgcaaac tccggcgtag gcggttacca gcagaccggg 2832841 gtccggtccc accattcggc ccggcttggc caatccgtcg agcaccacga accgcaacac 2832901 acccgcccga gtcttcttgt cgccggccat gatttccagc agctggggca gcgcgtccgg 2832961 gtcgtagctg accggcaatc ccaacgagga caggatggtg cggtggcgct gcgcggtcgc 2833021 gtcgtcgagc cgcccggcaa gcctggccag ctcggccgcg aacaccagcc ccaccgacac 2833081 ggcggcgccg tggcgccacc ggtagcgttc ccggcgctcg atcgcgtggc ctaatgtgtg 2833141 gccgtagttg aggatttcgc gcagctcgga ttccttttcg tcggcggcga ccacctcggc 2833201 cttgacggtg atcgcgcgcc ggatcagctc gggcagcacg tcgccggccg ggtcgagtgc 2833261 ggcctgcggg tcagcttcga tgagatccag gatcaccggg tcggcgatga agccggcctt 2833321 gaccacttcg gccatgccgc agatcatttc gtcgcgtggc aaggtttgca gcgtcgccag 2833381 gtccaccagg accgccaacg gctgatgaaa cgccccgacc aggttcttgc cggcgtcggt 2833441 gttgatgccg gtcttgccgc cgacggccgc atcgaccatg cccagcagtg tggtgggcag 2833501 gtgcacaatc gagacgccgc gcagccaggt ggccgccgcg aacccggcga cgtcggtggc 2833561 ggccccgccg ccgaggctga ccagggcgtc tttgcggccg attccgatgc ggcccaacac 2833621 ctcccagatg aatcccacga cgggcaggtc cttgccggcc tcggcgtcgg ggatctcgat 2833681 gcggtgcgcg tcgacgccct tgccggccaa gcgctttcgg atctcttccg cggtctcggc 2833741 tagtccgggc tgatgcacga cggcgacctt gtgccggtcg gccagcaggt cttccagctc 2833801 gtcgagcagg ccggtaccga tgaccaccgg gtatggcgga tcgacggcca cctgcacggt 2833861 cacgggtgcg ccgatatcgg tcatgtggcc gcctcgctgg ggctgggaac ctgcagccgc 2833921 gacaggatat ggcggaccac cgccccgggg ttgcggcgat tggtgtccac tcgcatggtc 2833981 gcgacgcgcc ggtacagcgg tgcccgcttg gccatcagcg cgcggtattt ttcggcgcgg 2834041 tcggggccgg ccagcagtgg gcgcacggtg ttgccgccgg tgcggcgcac gccctcggcg 2834101 gcgctgatct ccaggtagac gacggtgtgg ccggccagcg ccgcgcgcac accggggctg 2834161 gtcaccgcgc cgccgccgag cgacagcaca ccgtcgtggt cggccagtgc cgcgcgcacc 2834221 acgtcctcct cgatacgtcg gaactcctgc tccccgtcgg tggcgaagat gtcggcgatg 2834281 ctgcgtccgg tccgctgctc gatcgcgacg tcggtgtcga gcaggccgac cccgagcgcc 2834341 ttggccagcc ggcgcccgat ggtggacttg ccggagcccg gcaggccgac gagaaccgct 2834401 ttgggtgcca tctgttaacc ggagacccgc gcggccggtg cttcgcggtc ggcgacgctg 2834461 cgctggtagg cggcgatgtt gcgctgggtt tcggccagcg aatccccgcc gaatttttcc 2834521 agcgccgccc gggccagcac caacgccacc atggtctcca ccacgacccc ggccgccggc 2834581 accgcgcaca catccgagcg ctgatggatg gcgacggcct catcgccggt cgccaggtcg 2834641 acggtggcca gcgcgcgcgg caccgtggag atcggcttca tcgccgcacg cacccgcagc 2834701 ggctgcccgt tggtcatccc gccttccagc cccccggccc ggttggtgga gcggacgacg 2834761 ccgtcgggcc cggggtacat ctcgtcgtgg gcgcggctgc cgcggcggcg cgcggtctgg 2834821 aatccgtcgc cgatctccac gcccttgatc gcctggatgc ccatgacggc ggcggccagc 2834881 tggctgtcga gccgatggtc gccgctggtg aacgacccca gccccaccgg caggcccagc 2834941 gcgaccgcct ccaccacgcc gccgagggtg tcgccgtctt tcttggccgc ctcgatttgg 2835001 gcgatcatgt ccgcctcggc ggccttgtcg taggcgcgta ccgggctggc gtcgatggcg 2835061 ggtaggtcct cggcccgcgg cggcggaccc tcgtagggtg ccgacgcgcc gatcgagatg 2835121 acgtgggaga gcacctcgac acccagcgcc tgcctcagga atgcccgtgc gaccgtgccc 2835181 gccgcgaccc gggcggcggt ctcgcgggcg ctggcccgct ccagcaccgg ccgcgcgtcg 2835241 tcgaagccgt atttgagcat gcccgcgtag tcggcgtggc ccggccgcgg ccgggtgagc 2835301 ggggcgttgc gtgcgacgtc ggccagctcg gcggggtcga ccgggtcggc ggccatcacg 2835361 gtctcccatt tgggccattc ggtgttgccg atctcgatgg cgatgggccc gcccagggtg 2835421 ctgccgtggc gtatcccgga cagcacggtc accgcgtcgc gctcgaacgt catccgtgcg 2835481 ccgcggccgt agcccagccg gcgtcgggcc agctggtcgg cgatgtcggc cgaggtgacg 2835541 tgcacgccgg cgaccatgcc ttcgaccacg gccaccaagg cgcggccgtg tgactccccc 2835601 gcggtgatcc agcgcaacac ctgaccatct tcccatgcgc cgccggcggc caccgcacgt 2835661 caacgcaccc actccgtgcg atcgcggtga tgtgcggccc cccggatgcc ccgctagcat 2835721 ccctggcgtg gaagtggctg gcggcacccg ggcccggctg cgggtcacag ccgatggttt 2835781 gcaggcgctg gccgggcggt gcgcgaccct ggccggcgaa ttgtcggccg cggtcgcgcc 2835841 gtcgggggcg gtgttgtcgt ggcaggccaa cgcggtcgcg gtgaacgccg cgcatgcccg 2835901 cgcgggtgcg gccgccgcgg ctgtgagcgc ccgaatgcgg gccaccgccg ccgcgctggg 2835961 gcaggccgcc cgccggtacg cgggccagga caccgcagcg gcggccgccc tgggggcggt 2836021 acgcccgtgg gggacccact gatggctacg tcggggctgc cgccgctgtc ggcggtgcag 2836081 tcgacgagct ttgcgcatct gagcgaggcc gccgcccact ggcggcggct ggccacgcgg 2836141 tgggagcgcg ccttagccga ggtgcgcgat tcgatgcgcc gacccggcgg caccgactgg 2836201 gagggccagg ccgcggcccg cgcccactac cggtcgaccg tcgacgtggt gacgatcggt 2836261 cgcgcggtgg accggctgca tgacgccgcc gccgtcgccg gccgggggaa gaccagctgg 2836321 aggccaaccg gcgggcggtg ctggacgctg tcagcgacgc ccgccgggac gggtttgccg 2836381 tcggtgagga ttacacggtc accgaccgct ccacgggtgg ctcacgccag cagcgggcgg 2836441 cgcgtctggg ccaagcccag gggcacgccg actttatccg gcatcgggtg ggcgcgctgc 2836501 tggccaccga ccgcgatatc gcgacccggg tcagcgccgc cacccaaggc ctcgatgagc 2836561 tggcgttcga agacgtgccc ggggtcgaca ccccggccga ggatggggtg caggcggtgg 2836621 atttccgcca ggccccgcca ccgggagccc ccgggggcat gtcctccggc gacatcgacg 2836681 cgatcgacgc ggccaatcgc gccctgctgc aagacatgct ggcggagtac agccggctgc 2836741 ccgacgggca ggtgaaaacc gaccggctgg ccgacatcgc ggccatccaa gaggcgctga 2836801 gggtgcccga ctcgcatttg atctatgtgg ccaggccgga cgaccccgcc gacatgatcc 2836861 cggcggtcac cgcggtcggc gatccgttca ccgccgatca cgtgtcggtg acggtccccg 2836921 gggtgtcggg aaccacccgt cagaccatcg ccaccatgac ccaagaagcc cgtgggctac 2836981 gagaagaagc gagagtgatc gcccacagcg tgggtgaaag tgagaatgtg gcgaccatag 2837041 cgtgggtggg gtatcagccg ccgccggtgc tcgcgtcgtg gaacaccgtc gatgacgatc 2837101 tcgcgcaggc cggcgctccg aagttggagg cgtttttgcg ggatctgcag gcgggatcgc 2837161 acaatccggg tcacacgacg gcgttgttcg ggcattccta cgggtcgttg ctgtcgggga 2837221 tcgcgttgaa ggatggcgcc agttcactgg tcgacaatgc ggtgctgtat ggctcgccgg 2837281 ggtttgacgc gacctcaccg gccaagctgg gcatgaacga ccacaacttc ttcgtgatga 2837341 ccacacccga tgaccccatc cggtatccgg cgcgcctggc acccctgcac gggtggggat 2837401 cagacggcgc cgacaccatc ggcactgtag gccgccaagg cacccctgca cgggtgggga 2837461 tcagacccca acgagatcat cgccggatcc ccggaccgct accgcttcac ccatctgcag 2837521 accgacgcgg gatccactcc gctgggtgat cacaagaccg ccgccagcgg gcactcgcaa 2837581 tacggccaag acccgctgca acggatgacc ggctacaacc tggcgaccat cctgctcaac 2837641 cggcccgatc tggcggtgcg cgaaagccca cagcagtgat cgcaccacaa ccgatttccc 2837701 gaacgctccc gcggtggcag cgcatcgtcg cgctgaccat gatcggcata tcaaccgccc 2837761 tgataggtgg ctgcaccatg gatcacaacc ctgacacatc acggcgcctg accggcgagc 2837821 agaagatcca gctcatcgac agcatgcgca acaagggctc ctacgaggcc gcccgggagc 2837881 gcctaaccgc caccgcccgg atcatcgccg accgcgtcag tgcggccatc ccgggccaaa 2837941 cctggaaatt cgacgacgat cccaacatac aacagtctga ccgaaacgga gcactgtgcg 2838001 acaagctcac cgcggatatc gcgcggcggc cgatcgccaa cagcgtaatg ttcggcgcca 2838061 cgttctcggc cgaggacttc aagattgccg ccaatatcgt gcgggaggaa gccgccaagt 2838121 acggtgcgac caccgagtcg tcgctattta acgaatcggc caagcgcgac tacgacgtgc 2838181 agggcaacgg ctacgaattc cgactcctgc aaatcaaatt cgccacactt aacatcaccg 2838241 gcgattgttt tctgttgcag aaggtgctcg acctgccggc cggacaactc cccccggaac 2838301 cacccatctg gccaacgacc tcgacgccac attgatcgca ccacaaccga ttccccgaac 2838361 gctcccacgg tggcagcgca tcgtcgcgct gaccatgatc ggcatatcaa ccgcgctgat 2838421 aggtggctgc acaatggatc aaagccctga cacatcacgg cgcctgaccg acgagcagaa 2838481 gatccagctc atcgacagca tgcgcaacaa aggctcctac gaggccgccc gggaacgcct 2838541 caccgccacc gcccggatca tcgccgaccg cgtcagtgcg gccatcccgg gccaaacctg 2838601 gaaattcacc gaagatcccg ccgggcgaaa ggccgatcgg gaaggtttgt cgtgcaagga 2838661 actcaccggc gatatcgccc ggcggccgat cgccgacgca gttatctttg gtactgcgtt 2838721 ctcggcggag gacttcaagg ttgtcaccaa tatcgtgcgg gaggaagccg ccaagtacgg 2838781 tgcgaccacc gagtcgtcgc tatttaacga atcggccaag cgcgactacg acgtgcaggg 2838841 caacggctac gaattccgac tcctgcaaat caaattcgcc acacttaaca tcaccggcga 2838901 ttgttttctg ttgcagaagg tgctcgacct gccggccgga caactccccc cggaaccacc 2838961 catctggcca acgacctcga cgccacattg atcgcaccac aaccgattcc ccgaacgctc 2839021 ccacggtggc agcgcatcgt cgcgctgacc atgatcggca tatcaaccgc cctgataggt 2839081 ggctgcacaa tgggccaaaa ccccgacaaa tcaccgcacc tgaccggcga gcagaagatc 2839141 cagctcatcg acagcatgcg ccacaaaggc tcctacgagg ccgcccggga acgcctcacc 2839201 gccaccgccc agatcatcgc cgaccgcgtc agtgcggcca tcccgggcca aacctggaaa 2839261 ttcaacgacg actcctacgg ccaagacttc tatagaaatg gatcgttgtg taaggaactc 2839321 agtgccgata tcgcccggcg gccgatggcc aaaccggttg acttcggtag cacattctcg 2839381 gcggaagact tcaagattgc cgccaatatc gtgcgagagg aagccgccaa gtacggtgtg 2839441 accaccgagt cgtcgctgtt taacgaatcg gccaaacgcg actacgacgt gcagggcaac 2839501 ggctacgaat tcaacctggg ccaaatcaaa ttcgccacac ttaacatcac cggcgactgt 2839561 tttctgttgc agaaggtgct cgacctgccg gccggacaac tcccccccga accacccatt 2839621 tggccgacga cctcgacgcc aaccccgtga gcaccaccat cgttgctggc gtgatccagg 2839681 gtcacctgcc ggtgatcctg cccacgcgca ggcgggctcg cgatctcggg cacacgacgg 2839741 cgttatttcg ggcgcaaacg ctccaatgca tatatctcag tatcgaatac ctatatgttt 2839801 gctccatgtc tcggcgtaca acgatcgaca tcgatgacat actgctggcc cgcgcgcaag 2839861 cggcgctcgg taccaccggg ctgaaggaca gggtcgatgc cgctttgcga gccgcggtgc 2839921 gctagtcggc gcgcactcgg ctcgccgcgc gaatcgcctc gggtgccggc atcgatcggt 2839981 ccgaggcgct gcttgcccag acgcgtcccg cgcggtgatg gtgttctgcg tcgacaccag 2840041 cgcgtggcat cacgcggcgc ggccggaagt tgcgcgccga tggttggcgg ccttgtccgc 2840101 ggaccagatc ggcatctgcg accacgtgcg gttggagatc ctgtactcgg cgaactccgc 2840161 taccgactac gacgcgctcg ccgacgaact cgacggcttg gcccgtatac cagtcggtgc 2840221 cgaaaccttt acgcgcgcat gccaagtcca gcgtgagctt gcccacgtcg ccggtctgca 2840281 tcaccgcagc gtgaagatcg ccgatcttgt catcgccgcg gcggccgaac tttcaggcac 2840341 catcgtgtgg cattacgacg agaactatga ccgggtcgcc gccatcaccg gccaacctac 2840401 ggagtggatc gtgccgcgcg ggacccttta accgctgata ggcgccatca ctggatgtat 2840461 ggtgatgtca tgcggactca ggtgaccctg ggcaaagagg agcttgagct gctcgatcgt 2840521 gccgccaagg cgagtggcgc atcgcggtcc gaactcatcc gacgcgcaat tcaccgtgcc 2840581 tacgggactg gatccaagca ggaacggctc gccgcgctcg accacagccg tggctcgtgg 2840641 cgaggacggg acttcaccgg caccgagtat gtcgacgcca ttcggggcga cctcaacgaa 2840701 cgacttgctc ggctcggtct ggcgtgaagc tgatcgacac caccatcgcg gtcgaccacc 2840761 ttcgcggcga acccgcggca gccgtgctgc tcgccgaact gataaacaac ggtgaggaga 2840821 tcgcggccag cgagctggtc cgattcgaac tcctcgccgg tgtgcgggaa agcgaactcg 2840881 cggcgctcga ggccttcttc tcggcagtgg tgtggaccct ggtgaccgag gacattgccc 2840941 ggatcggcgg acgactcgcc cgtcgatacc ggtccagcca ccgcggtatc gacgacgtgg 2841001 actacctgat cgctgcgacc gccattgtgg tcgacgccga cctgctcacc accaatgtgc 2841061 gccacttccc gatgttcccg gatctgcagc cgccgtactg agcactccct ggggcatcag 2841121 ccttggtcgg cgatgagttg ttcgatgagc tcgacgatgc gctgttggcc ggcggcggcc 2841181 ccgtccagct tgcctcgcat ctcggtgaat ccgtcgtcga ctcgactaaa acgttcttct 2841241 acgtgactga aacgttcggt catctcttcc cgcagggcgg tgaaatcttc tcgcagggcg 2841301 ttgaagctac cgattgtagc tcgccggaag tcgcggaact cgccaacgaa ctccgtgaca 2841361 tcgcgatcgg ccgcgccggc tagcacgcga gcggcggcgg catcctgttc gctggcccgc 2841421 acgcggtcag ccagctcacg cacttgggat tccagcgcgg tgacccgttg ttcgaggttc 2841481 tcgggcagca cgagcgaatc ctaccgcgat tcaacgcaac gcagccctgt cccgggcgga 2841541 caccggcatt gggtgcacgt cggataagca gggctgagcg gggctcggct ctactcgggt 2841601 cttacctcga caaatccggc cgcgctgaag tcaccatcga aggcatacgc attttggatg 2841661 cctttctttc gcatcaccgc gaagctcgtg gcatcgacga acgagtactc tcgctcgtcg 2841721 tggcgtacaa gccattccca tgcctgctct tccaggtcgg ctgttacgtg ctcgacgcga 2841781 acgacggtgc tcaagcggat tgcagcggcg gcaaccgccg cgcggtgacc gcagcgccgg 2841841 ttgagcagcg tccaggtctc gcccaggaca tggttggagg tcatcaccac gggcggtttg 2841901 ctggcccaca acctcttcgc ggtgccgtgc cgagcgtcgc cggcgttgcc aagtgcagcc 2841961 cagaaggacg tgtcgacgaa gatcattcgt gctttccgta aaccacgtcg tcgacggacg 2842021 cggacaagtc ggcttccccc acgaacgatc cgacgaaggc atcgaccgga tctgggcccg 2842081 gctgccggag gtgctcagcg acgtactccc ggatcagcgc cgccttcgac gtccgccgcc 2842141 gtcgcgcttc aacagcaagc gctcggtcaa cgtcttcgtc gatgtagatc tgcagccttt 2842201 tcacatggca aatatacgcc actagcataa tgctgtatac atcggtagct gagaatcgga 2842261 tgcttgccgc tggctgccga gtttgttgaa ctcgccgccg tggtgacctg gatgaagtgt 2842321 gcccgccgaa actgccgccg cccgactagg cgactggcca aagcgatgac agtgctgact 2842381 tctgtacagg ggcgaagcga gtgtccgccc ctttacgcgc gtcgtaatca gccgctgagt 2842441 tcgccatggt tcccatgcaa gatcgctcac gttcgagccc ggcgcgcggt gaccgacatg 2842501 aaactcccgt tacgagcaag catgcggcaa cgccgcctcg acggcgacgg tccaagctgt 2842561 cctctgaacg aatcaggttg cgctaagcca agattcgttg tcaaacgacc tctggtctac 2842621 actgatatcg cgccaatctc agcccagcag cgccaacccc accgccccca ggctggccac 2842681 acacatcgac ggcccgtgcg gcagggtgcg gacaccccat ggcgtcacca tcacgccgca 2842741 caccgcggtc agcagcggcg cggccagcgc cgccagaaac cacacctcga ccccgaagca 2842801 gccggtcagc ccgcccagac cgatcgccag cttgacgtca ccggcgccca tcgcggcggg 2842861 caaagccagg tgcaccagca ggtacacccc ggccaaggcg gccgccccgg ccagcgccgg 2842921 cacaccgcgg ccggcaaggc ccgcgaagag caggatcacc cccgccccgg gcagggtgag 2842981 ccagttgggt agccggcgct gccggacgtc gcaaacgcac aacactccca tccaggccaa 2843041 caccgccgcc gccagcatgc tggggcacgc tagtccaacg cggccagcgc gcaagtcatc 2843101 gcttcgcggg gggcgggtag cccggtgaac tgctccacct gcgcgaacgc ctgatgcagc 2843161 aacatctgca gcccgctgat cacccgcccg cccgccgatc cgaccgcggc ggccagcggt 2843221 gtgggccacg gatcgtagat ggcgtccaac agcaccggga tcgcggccaa ggtgccggca 2843281 taccccgcgg ccacctccgc tggaatggtg ctgaccagca cttccgcggc ggccaccgca 2843341 tcggccaacc caccgctgtc gaacgcgcag aaccgggtcg ccacgccgac ccgtgtgccc 2843401 aggtccacca gccgggccgc cttgtccgag ttgcgcgcca ccacggtgat gtcggtgacc 2843461 ccgagttcgg ccagccccac cacggccgcc ggtgcggtcc ccccggaccc cagcaccagc 2843521 gcgtgtccag cagccgcccc caacgccccg gccaccccgt cgatgtcggt gttgtcggcc 2843581 cgccagccat gcggcgtccg aaccagggtg ttggccgaac cgacaaggtc cgcgcgtgcg 2843641 gtgcgctcgt cggcgaaccg cagggcggcg aacttgcccg gcatggtcac cgaaacaccg 2843701 acccactccg gtccgaaacc accgaccacg acgggcaact cggccgcacc gcattcgatg 2843761 cgctcatagg tccagtcgtg cagccccaac gcccggtagg cggccaggtg cagctgcggg 2843821 gagcgggaat gcgcgatcgg cgaaccaagc acgccggctt tttgggacct tcgctcatcg 2843881 cgcgctgtcg aggacaccgt tgtgtttggc cagctcgatg ttcgccagat gctgctgata 2843941 gtccctggtg aacagcgtcg tgccctggga atcgatggtg acgaagtaca gccagtcgcc 2844001 aggtactgga tgctcggcgg cgcgcagcgc gtcgacgccg ggcgaacaga tcgcggtggc 2844061 cggcagcccc tgggccatgt aggtgttcca cggtgtgcgc tgggcacggt cggtgtcgct 2844121 ggtggccacc tcacggcgat ccagcggata gttcacggtc gagtcgaact ccaacgtgcg 2844181 gtgttcgtgc agccggttgt agatgacccg ggccaccttc gggaaatcct gggtgttggc 2844241 ttcctgctgc accagcgagg ccaccacgag aatgtcatag ggcgacaggc ccagcgactt 2844301 tgcggtgtct accaacccgg atttcatgta ctccacggcg ccggcgctga tcaaggtcgc 2844361 caagatggtt tcagccgatg ccgacgggtc gatgttgaag gtccccggtg cgatcagccc 2844421 ctcgatccgg cgatggtcag tgcccagctc catcaccggc ccaaccgccc agcgcggcac 2844481 tgacagcatc gtcggcgtgc tcctgctcgc cgccgcgcgg aggtcggcca ccgagacgca 2844541 gcgttgggta ccgtcgagat ccacacaggt ggcacgggag atcagcgcga atatgccagg 2844601 attcaccacg ttggtcttca tgtcggtggt gtcgtcgagc tgacgccctt ccggtatgac 2844661 caacttcccc acccggttgt gcggatcggt aagccgcgcg acagcggaag ccgccgaaat 2844721 ctcggttcgc atccgataga acccgggttg gatcgaggaa atcgcggtgt tgccgtgcgc 2844781 ggcatcgacg aatgctcgga cggtggccac tacaccgtgt ttgagcagcg tctccccgac 2844841 cgccgtggtc gagtcaccgg ccctgatctg aatcacgatg tctcgcttgc cgggaccggt 2844901 gtagtcgtta ccgaagccca acatggtctg ccacaacttg gcgccgacga cgacggccac 2844961 caccaccacc acgacgagca ggctcagggc aaatccgccg gcgacgcgcc gtcgccggcg 2845021 gatttgttgg gcgtgtcggc gctgagcgcg gctgactcgg gtcctgcggt gccggttcgg 2845081 tcttaccgac accggctggg cgcggtggcg gtggccaccg tcaggcatcg gagccttctt 2845141 gagtcccggc catcgccgcg agacgttcat ccagccagct ctgcagtatt gccactgcgg 2845201 ccgcttggtc gatcaccgca cgctgctcgg aggcccgcac ccccgcctgc cgcaaagatc 2845261 gttgagcact gaccgtggtg agccgctcgt cggccagccg caccggcgta ggagaaacac 2845321 ggcgtgccag cgcctcggcc agttcgattg cgtcttgggc cgagcggccg atgcggtcgg 2845381 ccagcgtgcg cgggagcccg acgatcacct cgaccgcctc caactcggcg gccagcgcag 2845441 ccagcctgcg caggtgcttg ccggaacgat cgcggcgcac cgtttccacc ggggtggcca 2845501 agatcgcgtc cgggtcgctg caagccacgc cgatacgcgc ggcgcccacg tcgataccga 2845561 ggcgtcgtcc ccgtccaggg tcgtgcgctg gatcgccggg ccggtcgggc gggcggtgct 2845621 gtgctgggac cactcaaccg acccgcgcta tcacggcgat ctcggagcgg accgcgtcga 2845681 gcgcggcgtc gataccggtc ggattctttc ccgagccctg cgccaggtcc gccttaccgc 2845741 caccgcggcc ttcgaccgcc accgcaagtt gtttgaccag gtcgttggca cggattccga 2845801 ggtcctgggc agcgggattg gccgcgaccg catacggcac agtttggctt tcgccctcgg 2845861 caatcagcgc caccaccgcc ggctcgctac ccagcttgcc gcggatgtcg ccgatcaacg 2845921 accgcaggtc tgccgcggtc atcccgccgg acattcgctg cgccaccaaa cggacgttac 2845981 cgatccgctg agccccggcg gcggcattgg tggcggctgc ccgggcgctg gccatccgga 2846041 cacgttcgag ttccttctcg gcggcccgca ggcgctccac tagattggcc acccgggccg 2846101 gtacctcttc ggacggcacc ttcagtgacg aggccaaccc ggccatcaac gcacgctcct 2846161 tggccaggtg acgaaacgaa tccaacccca cgtaggcctc cacccggcgc accccggagc 2846221 cgatcgacga ctcgcccagg atcgtcacgg gaccgatctg cgccgtgttg ctcacatggg 2846281 tgccgccaca tagctccagc gagaacggtc cacccatctc caccacccgc acttcgtcgg 2846341 ggtagctctc gccgaacagc gcgatggcac ccatcgcctt ggccttgtcg agctgttcgg 2846401 tgaacgtgcg cacctcgaag tccgcttgca cggcctcgtt ggtgacctct tcgacctggg 2846461 tgcgctggtc gtcggtcaac ggaccctgcc agttaaagtc gaagcgcaaa tatcccggcc 2846521 ggttcagcga tcccgcctga accgcgttgg gccccagcac ttgtcgcagc gcggcatgca 2846581 ccatgtgggt gcccgagtgg ccctgcgtgg caccccggcg ccacccggga tccaccgccg 2846641 cgattacggt gtcaccctcg acgaattccc cggattccac gttgactcgg tgcacccaaa 2846701 gcgttttggc gatcttctgc acgtcggtaa ccgcggcccg ggcagcttcg ctggaaccgg 2846761 ttccgctgat ggtgccctca tcggcgatct gcccacccga ttcggcgtag agcggggtgc 2846821 gatctaagac aagttcgaca cgctgccctt ccccggctcc gccggctaca ccgtgcgcca 2846881 ccaccggaac ccgcttaccg tcgacgaaga tgcccagaat ccgcgcctgg gaacgcaact 2846941 cgtcgaatcc ggtgaactcg gtggcgccgg cgtcaaccag ctcgcggtag gcgctcaggt 2847001 cagcatgcgc gtgtttgcgc gcggcggcgt cggccttggc acggcggcgc tgctcggcca 2847061 tcagctcacg gaacccgatt tcgtctacct gcagaccggt ttcggccgcc atctccagcg 2847121 tgagctcgat cgggaacccg taggtgtcat gcaacgtgaa agcgtccgat ccggacagca 2847181 cggtggctcc ggatttcttg gtggagctag ccacctcctc gaacagcctg gaacccgacg 2847241 ccagcgtgcg gttgaacgcc gtctcctcgg cgaccgcgat ccggctgatc cgctcgaagt 2847301 cggcgacgag ttcgggatat gacgggccca tcgcgttgcg caccgtggcc atcaggtcgc 2847361 caacgatcgc agcgtcgatg cccagcagct tggcggagcg gatcacccga cgcagcagcc 2847421 ggcgcagcac ataaccgcga ccgtcgttgc cggggctgac gccgtcaccg atcaggatcg 2847481 cggcggtgcg gctgtggtct gcgatgatgc ggtaccgcac gtcgtcttcg tggttgccga 2847541 cgtcgtaggc acgcgcggcg accctggcca cggtatcgat gaccggcctg agcaggtcgg 2847601 tctcgtagac gttgtgcacg tcttgcagca ccagcgcgat ccgctcgacg cccatgccgg 2847661 tgtcgatgtt gttgcggggc agcggcccga ggatctggta gtcctccttg gtggttccct 2847721 ctccgcgctc gttctgcatg aacaccaggt tccagacctc gaggtagcgg tcttcgctga 2847781 cgatgggacc gcctgcggga ccgaattcgg gtccgcggtc gtaatagatc tccgatgacg 2847841 gcccgcacgg tccgggaatg cccatcgacc agtagttgtc ggccatgccg cggcgctgga 2847901 ttcgctccgc cggcagcccg gcaacctcct gccatagccg gacagcttcg tcgtcgtcga 2847961 aatagactgt cgtccagatt ctttccgggt ccaggccgta gccgccggcg gcgaggctgt 2848021 tggtcagcag tgcccaggcc agttcaatgg ccccgcgttt gaaatagtcg ccgaagctga 2848081 aattgccggc catctgaaaa aacgtgttgt gccgggtggt tatgcccacc tcgtcgatat 2848141 cgggggtacg gatgcacttc tggatgctgg tggccgtcgg gtacggcggc gtgcgctgtc 2848201 ccaagaagaa aggcacgaac tggaccatcc cggcgttgac gaacaacagg ttggggtcgt 2848261 cgaggatcac cgaggcgctg ggcacctcgg tgtggcccgc cttcacgaaa tgatcgagga 2848321 accgcttcct gatctcgtgt gtctgcactc tacgttcttc cttgatccgt ggttaagtcc 2848381 attaccagcc tattcgccgg attatgagaa ggctgtccga cggcccaatt cggcccgctc 2848441 agccttccac aaagctcaat cgcaccgacc gccgcggatt gtcctggttg aggtcgacca 2848501 gaacgatgct ttgccaggtg cccagcaggg gctggccccc cgagaccggc accgtcaccg 2848561 acggcgcaac aaaagccggt aacaagtggt cggcgccgtg accgtaggac ccgtgcgcgt 2848621 gccggtagcg gtcgtcgcgc ggcaacaacc gcaccagcgt gtccaccaga tcctcgtcgg 2848681 aaccggcgcc ggtctcgata atcgcaacgc cggccgtagc gtgcgggacg aacacgttgc 2848741 acaggccatc atcatgggcg gtgcagaagg cgcgcacggc gtcggtgaga tcgacaatgc 2848801 ggcgacgcgc ggtgtccaca tccagcacat cggtatccac ccgtcccagc ctacggtggg 2848861 ggcgcgccaa cctgccaatc cattgacgtc ggattgccca ttgccccggc cggcccgtcg 2848921 gaggaaggta atgattgacc ggtggcgcca ccggggcgct gccccgaaca atgaaagagg 2848981 ggtggatcgt gtacgcgcgc tctaccacta ttcaggcgca atccgagtgc atcgacaccg 2849041 gaattgcgca cgttcgcgat gtggttatgc ccgcactgca ggggatggat gggtgcatcg 2849101 gcgtatccct tttggtcgac cggcaatccg gcaggtgcat cgccaccagt gcctgggaga 2849161 ccgcggaagc catgcatgca agccgggaac aggtaacgcc gatccgcgat cggtgcgcgg 2849221 agatgttcgg cggcacgccg gccgtcgagg agtgggagat cgcggcgatg catcgcgacc 2849281 accgctcggc cgagggggcg tgtgtgcggg cgacctgggt caaggtgccg gcggaccaag 2849341 tagatcaagg catcgagtac tacaagtcgt ccgtcctgcc ccaaatcgaa ggcctcgacg 2849401 gattctgcag cgccagcctg ttggtcgacc gcacctccgg gcgcgcggtg tcttccgcga 2849461 ccttcgacag ctttgacgcc atggagcgca accgggacca gtcgaatgcg ctcaaggcca 2849521 catcgctgcg tgaggcgggc ggcgaggaac tcgatgaatg cgagttcgag ctggcgctag 2849581 cgcacctacg ggtacccgag ctggtctgat caacccgccg gcggcagtac cggcccgagc 2849641 ccgacgctgg gccggcactg ctgtcgtgcg tcgagcggcg ctcgcggtag gcattgccag 2849701 gctcagccgg ttggaggaag gtatttggtg ggaccggtgg cgccaccggg gcgctgcccc 2849761 gacacgggag ggggtcgatc gtgtacgcac gctcaaccac cattgaggcg caacctctgt 2849821 cggtcgacat tggaatcgcg catgttcgtg acgtcgtcat gcccgctttg caggagatcg 2849881 acgggtgtgt cggggtgtcg ctgttggtcg accggcaatc cggccggtgc atcgccacca 2849941 gcgcctggga gaccttggag gcgatgcgcg ccagcgtcga gcgggtggca cccatccgcg 2850001 accgcgccgc gctgatgttc gccggtagtg cccgggtcga ggaatgggac atcgccctgt 2850061 tgcaccgcga ccacccgtcg catgaggggg catgcgtgcg cgccacctgg ctcaaagtgg 2850121 tgccagacca gctcggtcgg tccctggagt tctaccgcac gtccgtactt cccgagctgg 2850181 agagtctgga cgggttctgc agcgccagcc tgatggtcga ccaccccgct tgccggcgtg 2850241 cggtgtcgtg ctcgacgttc gacagcatgg acgcgatggc ccgcaaccgc gaccgggcga 2850301 gcgagctgcg cagcaggcgc gtccgggaat tgggagccga ggtcctcgac gtcgccgaat 2850361 tcgaactggc gatcgcacat ctacgggtac ccgagctggt ctgagcggac ctgcttcccg 2850421 cagagcgcag cggtcacccc cgtttcttgc ggatgattgc ccgcaggcgg tccaggcggc 2850481 cggcgatctc gcgttcgccg ccgcgaccag tgggccggta gtagtccacg tccaccaact 2850541 cgtcgggcgg gtattgctgg gccacaacgc catccgggtc gtcatgggaa tatttgtagc 2850601 cctgtgcatt gcccagcgcc gccgccccgg agtaatgccc gtcacgcaga tgagccggca 2850661 ccagaccggc cttgccggcc ttgatgtcgt tcatcgccgc ggccaacgcc gtggtgacgg 2850721 cgtttgactt cggtgcggtg gccaggtgga tggtggcgtg cgccagcgtc agctgggctt 2850781 cgggcatgcc gatcagcgcc accgtctgtg cggcggcgac cgccacctgc agcgcgctcg 2850841 ggtcggccat gccgatgtcc tcgctggcca gaatcatcag ccggcgggcg atgaaccgcg 2850901 ggtcctcccc ggcgaccagc atgcgggcca aatagtgcag cgcggcatcg acgtcggaac 2850961 cgcgcaccga tttgatgaag gcgctgacga cgtcgtagtg ctggtcgccg tcacggtcgt 2851021 agcgcaccgc ggctttgtcc accgaccgct cgatggtttg cacgctgacc agctcgccgg 2851081 ccgcctgggc tgcctcggcc gctacttcca gcgcggtcag ggcgcgccgg gcgtcgccgg 2851141 ccgcgagttg caccagcagg tcgacggcct caggcgctac cgcgactgcc ctgcccaggc 2851201 cgcgggggtc atcgatcgcg cgttgtacta ccgcgcgggt gtcctcggcc gtcagcggcc 2851261 gcagctgcag gatcagcgac cgcgacagca gcggtgccac caccgaaaac gacgggttct 2851321 cggtggtcgc cgccaccaac agcaccaccc ggtgttccac cgccgacagc agggcgtctt 2851381 gttgggtctt ggaaaatcgg tgcacctcgt cgatgaacag cacggtctgc tcgccgtgaa 2851441 gcagcgcttt tcgcgaattc tcgatgaccg cccgcacttc cttgacgccg gccgacaatg 2851501 ccgacagggc ctcgaaccgg cggccggtgg cctgcgagat caacgccgcc agcgttgtct 2851561 tgccgctgcc cgggggaccg tagaggatca ccgacgccac ccccgagccc tcgaccagcc 2851621 ggcgcaacgg cgaaccgggc gccagcaagt ggtcctggcc gaccacttcg tccagcgacg 2851681 ccggacgcat ccgcaccgcc agcggtgccc cggccgaagc gcccaggtca tggccggacg 2851741 tcatcggtac gccgggcacg tcaaacagac cgtcggacac ggcttcaggc ataccacgcc 2851801 cacctgacga cgcgaacgtt cgccgaagac gccacacgaa taatccgcgc gccttcggca 2851861 aatatttgct aagttccggt ttgcttagcg tcgcgcgggt accgataaaa gcgaactacg 2851921 aagcgattgg gacagcgatg agccagccgc cagaacatcc aggcaatccg gccgaccccc 2851981 agggcggcaa tcagggcgct ggaagctacc cgccgcccgg ctacggagcg cctcccccgc 2852041 caccaggcta cggcccaccc ccggggacct acctgcctcc cggctacaac gcacccccgc 2852101 cgccccccgg ctatggccca ccgccgggcc cgccgcctcc cggttacccg acgcatctgc 2852161 aatcgtcggg ttttagcgtg ggcgacgcga tcagttggtc atggaatagg ttcacgcaga 2852221 acgccgtaac gctcgtcgtc ccggtgctcg cctacgctgt ggcgttggcc gcggtcatcg 2852281 gcgcgacggc cgggctcgtt gtcgccctat cggaccgtgc tactaccgca tacaccaaca 2852341 cctccggcgt ctctagcgaa tccgtggaca tcacgatgac cccggccgcg ggcatagtca 2852401 tgttcctcgg ctacatcgct ctattcgccc tggtgctcta catgcacgcc ggaattctga 2852461 ccggctgcct tgacattgcc gacggaaagc cggtgaccat cgcgacgttc tttaggccgc 2852521 gcaatctggg cctggtgctg gtcaccggac tgctgatcgt cgccctcacc ttcattggtg 2852581 gcctgctctg tgtcattccc ggcctgatct ttggcttcgt cgcccagttc gccgtcgctt 2852641 ttgccgtcga ccgttccact tcgccgatcg actcggtaaa ggccagcatc gagacggtcg 2852701 ggtccaacat cggtggcagt gtgctgtcgt ggctcgctca gctcacggcg gtgctcgtcg 2852761 gcgaactgct gtgctttgtc ggcatgctga tcggcattcc ggtcgccgcg ctcatccacg 2852821 tctacaccta ccggaagctg tcgggtggcc aagtcgttga ggcagtccgg ccagcgcccc 2852881 cggtcggctg gccgcccggc ccccagctcg catagtcggc acccgccgac gccggctggc 2852941 cgtcttggcc cgctggattt gtcacgcgct cacccgaatt ggcatccggg gcctggaacg 2853001 cgttagggca gtggctttcc cacaggttga cgtaaatgac ctccaagata ggtatcgaac 2853061 caaggttgcg gccgatgtgt acgtagttcg agagttcgct gatctgatca ctcgcgtggt 2853121 cgatgcagtc gacggaaccg gcagccgcca cccaagggtg cgcaggtggt tagcaaatcg 2853181 ccgacgaaca cgacgccacc gcgtcatgcg ccatcgccga ccccgccttg gtggctgaga 2853241 gccgctcgcc ggcgttaagc tgcccaacat catgggcatt caacgcgccg ttctcctcat 2853301 tgccgacatc ggcggataca caaattacat gcactggaac cgcaagcacc tggcccacgc 2853361 gcagtggacg gtggcacagt tgctggagtc cgtcatcgac gctgccaagg gcatgaagtt 2853421 ggcgaagctg gagggcgacg ccgcgttttt ttgggcacca gggggcaaca ccagtgtcct 2853481 ggtatgcgac cggcccccgc agatgcgcca gaggttccgc acgcggcgcg agcagatcaa 2853541 aaaagaccat ccctgcgact gtaagagttg cgagcagcgg gacaacctgt cgatcaaatt 2853601 cgtcgcccat gagggcgaag tggccgaaca aaaggtgaag cgcaacgtcg aactcgctgg 2853661 cgttgatgtc atcctggtgc accgcatgct gaaaaatgag gtgccagtgt cggaatatct 2853721 attcatgacc gacgtcgtag cgcagtgcct cgacgagtcg gtgcgaaaac tagcgacgcc 2853781 gctgacacat gacttcgagg gcatcggaga aacgtcgaca cactacaccg acctcgccac 2853841 gtccgacatg ccgccggcgg tgccagacca cagcttcttc ggcctgctgt gggcggatgt 2853901 gaagttcgaa tggcacgcgt taccgtacct gttaggtttc aagaaggcct gtgcaggttt 2853961 ccgcagcctg ggccgcggcg ccaccgaaga gcccgccgaa atgggctaat cgggttcgct 2854021 tggctcgatc gccgatgatc tcgaccgcca cgaccgaccc cctcacctcg gtcgaacctc 2854081 ggcgaaccaa cgcggcaacg ccagcccatg atcatttgat tgggtccacg gaagcaggta 2854141 gcttccgtcg catgcttttt gcggctttgc gtgatgtcca atggcgaaaa cgacgccttg 2854201 tcatcgcaat cgtcagcacc ggcctagttt tcgcgatgac gctcgttctg accggacttg 2854261 tgaacgggtt tcgggtcgag gccgagcgaa ccgtcgattc catgggtgtc gacgcattcg 2854321 tggtcaaggc cggcgcggca ggaccgttcc tgggttcgac accattcgcc caaatcgacc 2854381 tgccccaggt tgctcgtgcg cctggcgtct tggctgccgc cccactagcg actgcgccgt 2854441 cgacgatccg gcagggcacg tcagcgcgaa acgtcaccgc gttcggggca ccagagcacg 2854501 gacccggcat gccgcgggtc tcggacggtc gggcgccatc gacgccggac gaggtcgcgg 2854561 tgtcgagcac gctgggccga aacctcggcg acgatctgca agtgggtgcg cgcactttgc 2854621 ggatcgtcgg catcgtgccc gagtcaaccg cgctggcaaa gattcccaac atcttcctga 2854681 ccaccgaagg cctacagcag ttggcataca acggacagcc gacaatcagt tcgatcggga 2854741 tcgacgggat gccccgacag ctcccggacg gctatcagac cgtcaatcga gcggatgctg 2854801 tcagcgatct gatgcgcccg ttgaaggtcg cggtggatgc gatcacggtt gtggcggtct 2854861 tgctgtggat cgttgcggcg ttgatcgtcg gctcggtggt ctacctctct gcgttggagc 2854921 ggctgcgtga ctttgcggtg ttcaaggcga tcggcgtgcc gacgcgctcg attctggccg 2854981 ggctggcgct gcaggcggtc gtcgtcgcgc tgctcgcggc ggtggttggc ggcatccttt 2855041 cgctgctgtt ggcgccgttg ttcccgatga ctgtcgtggt acccctgagt gccttcgtgg 2855101 cgctaccggc gatcgcgact gtgatcggtc tgctggccag cgtcgcagga ctgcggcgcg 2855161 tggtggcgat cgatccggca ctagcgttcg gaggtcccta gccatgggcg gcctaaccat 2855221 ttccgacctg gtcgtcgagt attccagcgg cgggtacgcc gtgcggccga tcgacgggtt 2855281 aagcctcgac gtggcgccgg ggtcgctggt gatcttgctt gggcccagcg gctgcgggaa 2855341 gacgaccctc ttgtcctgcc tcggcggcat cctgcgcccg aagtccggct caatcaagtt 2855401 tgacgatgtc gacatcacga cgctggaggg cgccgcgctg gcgaagtatc ggcgtgacaa 2855461 ggtagggatc gtcttccagg cgttcaacct ggtctcgagc cttaccgccc tggagaacgt 2855521 gatggtcccg ctgcgcgcgg ccggcgtgtc acgagcggcc gcgcgtaagc gtgccgagga 2855581 cctgctgatc cgagtcaatc tcggcgaacg aatgaaacac cgcccgggtg acatgagcgg 2855641 cggccagcag caacgcgtcg cggtcgcccg cgcgatcgcg ctggacccgc aattgatcct 2855701 tgccgacgaa ccgaccgcgc acctggactt catccaggtg gaggaggtgc tgcggctgat 2855761 ccgctcgcta gcgcagggcg accgtgtggt ggtggtcgcg acccacgaca gccggatgct 2855821 gccgctggcc gatcgcgtcc ttgagctgat gccggcgcag gtgtcgccga atcagccacc 2855881 cgaaacggtg cacgtgaaag ccggcgaggt gctgttcgag cagtccacaa tgggcgatct 2855941 gatctacgtg gtgtccgagg gcgagttcga gattgtgcgc gaattggccg acggcggtga 2856001 ggaattggtc aaaaccgccg cgcctgggga ctacttcggt gaaatcggcg tgctgtttca 2856061 cctgccacgc tcggcaacgg tacgggctcg cagcgacgcg acagccgtcg gttatacggc 2856121 gcaggcgttt cgggagcggc tgggtgtgac gcgggtggcc gacctgattg agcaccgcga 2856181 gcttgccagc gaatagttcg gcaccaagtc gcgatccctg agggttgcga tgggcgcggc 2856241 gccgccgctg aatcgaccgc cccccactga gccgccgtgg aatactcgat gaatcctgcg 2856301 ggcgtgtccg cactgcgtgt ggctatggag ttggggaaca tgttgcttgg gataagaacg 2856361 tgaatgaggg accgctcttc acaatgtcag gcactgccgt gagaagtccg ctactcgatc 2856421 gggtgtatgt gagcagtcct ggcatgggcc gagatgccaa gagccgcatc tcatgaccac 2856481 cgcgcgacga cggcccaagc ggcgtggtac cgatgcgcga accgcgctgc gcaacgttcc 2856541 gatactcgcc gatatcgacg acgaacagct cgaacgactc gcaaccaccg tagaacgccg 2856601 ccacgtgccc gctaaccagt ggctctttca tgccggagaa ccagcggact ccatctatat 2856661 cgtcgactcg gggcggttcg tcgctgttgc cccagaggga cacgtatttg ctgagatggc 2856721 atccggcgac tcgatcggag acctgggggt gatcgccggg gctgcccgct cagcgggagt 2856781 gcgagctctg cgagacggcg tggtgtggag gatcgccgcg gagacgttta ccgacatgct 2856841 cgaggcaacc ccgctactgc aatcggcgat gctgcgagcg atggcgagaa tgctacgcca 2856901 gtcacgaccc gccaagacgg ctcggcgtcc gcgggtcatc ggcgtggtat cgaacgggga 2856961 caccgccgcg gccccgatgg tcgacgcgat cgctacttca ctggactcgc acggtcgaac 2857021 tgccgtgatt gcgccgcccg tcgaaaccac ctccgccgtt caggagtacg acgagctcgt 2857081 cgaggcgttc agcgaaaccc tcgatcgcgc ggagcgaagc aacgattggg tcttggtggt 2857141 cgccgaccga ggcgccggcg acctgtggcg gcactacgtt agcgcgcaaa gcgaccgact 2857201 cgtggtcctg gtggatcaac ggtatccgcc ggatgcggtc gattcgcttg ctacccaacg 2857261 gccagtgcac ctgatcacat gtctggcaga accggatcca agttggtggg atcggttggc 2857321 gccggtttcg catcatccgg ccaactccga cggcttcggt gcccttgctc gcagaatcgc 2857381 cggccgatcg ctcggcctgg tgatggccgg tggcggagcc cggggactgg cgcatttcgg 2857441 tgtttaccaa gagctcaccg aagccggcgt cgtcatcgat cggtttggcg gaacaagttc 2857501 gggtgcaatc gcttccgcag cgttcgcgct ggggatggac gccggggatg cgatcgccgc 2857561 ggcgcgagag ttcatcgcag gaagcgaccc actcggcgac tacacgatcc caatatccgc 2857621 cctcacgcga ggtggacgcg tcgatcgtct ggtgcaggga ttcttcggca acacgttgat 2857681 cgaacatctg cccagagggt tcttctccgt ctccgccgac atgatcaccg gcgatcagat 2857741 catccatcgg cggggatccg tctcgggcgc cgtgcgcgca tcgatctcga tccccggtct 2857801 catcccgcca gtgcacaatg gcgagcagct gctcgtcgac ggtgggctgt tgaacaatct 2857861 gccggccaac gtgatgtgcg ccgataccga tggcgaagtc atctgcgtcg acctccgccg 2857921 aacgttcgtg ccgtcgaagg gctttggcct gctgccgcca atcgttacgc cgcccgggct 2857981 cctccggcgg cttttgaccg gcacggataa cgcgctacca ccgctgcaag agacgttgct 2858041 gcgcgccttc gaccttgccg cctccaccgc aaacctgcgc gagcttcctc gcgttgcggc 2858101 catcatcgag cccgacgtgt cgaagatcgg agtgttgaac ttcaagcaga ttgatgccgc 2858161 cctagaggct gggcggatgg cagcccgtgc ggctttgcaa gcacagccgg acctggtgcg 2858221 ctgaacccga ccaagtgccg ctacggccca ctcaggtgtc cagcaccggg cgtacgcgct 2858281 gcgccgggcg gtccggtgtg atctcatcag cagctatgag catcaaagtt gcgctggagc 2858341 accgcaccag ctacaccttt gaccggctgg tgcgggtgta tccgcacatc gtgcggctac 2858401 gcccggcgcc gcactcccgc acctccatcg aggcctactc gctgcgcatc gagcccgccg 2858461 accacttcat caactggcag caggacgcgc tgggcaactt tctggcgcgg ctggtctttc 2858521 cgaatcccat gcgccaactg cgtattaccg tcgggcttat cgccgacctc aaggtgatca 2858581 accccttcga cttctttatc gaggactggg ccgagatatg gccctgcgca gggatggcct 2858641 accccaaggc gctcgccgat gacctgaggc cgtacttgcg gccggtcgac gaagacggcg 2858701 acggttcggg ccccggcgag ctcacgcagg cctgggtgcg caacttcacg gtgcccgatg 2858761 gcacccgcac catcgacttc ttggtcgcac tcaaccgcgc gatcaacgcc gacgtcggct 2858821 actgcgtgcg catggagccc ggagttcaga caccggattt cacgctgcgc accggcgtcg 2858881 gctcgtgccg ggactcggcg tggctgctgg tctcgatcct gcgtcagttc gggctggccg 2858941 cccggttcgt gtccggctac ctggttcagc tggcatccga catcgaagcg ctcgacgggc 2859001 cgtcggggcc cgccgccgac ttcaccgacc tgcacgcgtg gtccgaggca tacatcccgg 2859061 gtgccggctg gatcgggctg gacccgacgt cggggctgtt ggccggcgag ggccacattc 2859121 cgctggcggc tacgccccac cccgccagcg cggcacccat cagcggcggc accgacgtgt 2859181 gcgacaccgt gctggagttc tccaacaccg tcacccgcgt acacgaagac ccacgtgtca 2859241 cgttgcccta caccgacgag tcctggaaga ccatctgtga ggtgggccag cgcgtcgatg 2859301 agcggctggc cgccgccgac gtccggctga ccgtcggcgg cgaaccgacg ttcgtgtcgg 2859361 tggataacca ggtcgccgaa gagtggcgga cggcggccga cggcccacac aaacgcgaac 2859421 gggcatccga cctggccgcc cgcttgaagg cggtgtgggc cccgcaggga ctcatccacc 2859481 gcggtcaggg caggtggtat cccggagagc cgttgccgcg ctggcagatt gcgctgtatt 2859541 ggcgcaccga cgggcggccg ctgtggacca acgacgcgct gttggccgac ccctggggcg 2859601 ccccgcccgc cgaccccgtc gacgacgacg cggcctaccg ggtgctcgcc gggatcgccg 2859661 acggcttggg gctgccgatc tcgcaggtgc ggcccgccta cgaagacccg ttgagccggc 2859721 tggctgcggc cgtgcgaatg ccagccggcg acccggtgga atccggtgac gacctcggct 2859781 gcgacaccaa ccccgacacc cccaccggcc gcgccgcgct gctggcgcct cgatgaggcc 2859841 atcacctctc cggctgcgta cgtgctgccg ctgcaccgcc gcgacgacgg gcaaggctgg 2859901 gccagcgcga actggcggct gcgccgcggt cgcatcgtgt tgctcgaagg ggattcgccg 2859961 gcgggcctgc ggctgccgct ggattcgatc agctggcgcc caccccgggc atcgtttgac 2860021 gccgacccgg tagctgtgcg atccacattg ccggcggagc cccacaccga ccgggccgta 2860081 gtggaggatc ccgagacggc tccgaccacc gcgttggtcg ccgaggtccg gggtgggctg 2860141 gtgcacatct tcttgccgcc caccgacgcg ctcgagcact tcatcgacct tgtcgcccga 2860201 gtcgaggccg cggcgacgac ggccaactgc ccggtggtga tcgagggcta cggcccaccc 2860261 ccggacccgc ggctgacgtc caccacaatc acccccgacc ccggcgtcat cgaggtcaac 2860321 atcgcgccca ccgcctcttt tgcagaacaa cggcaacagc tggaaaccct gtatcaacaa 2860381 gcgcgcctgg cccgactcac caccgaagcg ttcgacgtcg acggcacgca cggcggcacc 2860441 ggcggcggca accacatcac gcttggcggc gtcacacccg cggactcacc gctgctgcgc 2860501 cggcccgacc tgctggtttc actgctgacc tactggcagc gacacccgtc gttgtcctac 2860561 ttgttcgccg ggcgtttcgt cggcaccacg tcacaggcgc cccgggttga cgagggccgc 2860621 gccgaggcgc tctacgaact cgagatcgcg ttcgccgaga tcctccggct gtcgccgtcg 2860681 tccgggggcg gccggcccca accgtgggtg accgaccgcg cgctgcggca cctgctcacc 2860741 gacatcaccg gcaacaccca tcgcgccgaa ttctgcatcg acaagctcta cagccccgac 2860801 agcgcccggg gcaggctcgg cctgctggag ctccgcgggt tcgagatgcc gccgcacctg 2860861 cacatggcga tggtgcagtc gctgctggtg cgctcgctgg tggcgtggtt ctgggaccaa 2860921 ccgctgcgcg ccccgctgat ccgccacggc gccaacttgc acggtcgata tctattgccg 2860981 cacttcttga ttcatgacat cgccgacgtc gcagccgacc tgcgcgcgca cggcatcgcg 2861041 ttcgagacta gctggctgga cccgttcacc gagttccgct tcccgcgcat cggcaccgcc 2861101 gtattcgacg gcattgagat cgagctgcgc ggggccatcg agccatggca cacccttggc 2861161 gaggaggcca ccgcggcagg caccgcgcgc tatgtcgact cgtcggtcga gcgcatccag 2861221 gtccgcatca tcggcgccga ccggcaccgc tacgtggtga cctgtaacgg ctacccgatg 2861281 ccgttgctgg ctaccgacaa ccccgacatc cacgtgggtg gtgtgcggtt caaagcgtgg 2861341 cagccgccca gcgcgctaca cccgaccatc acggtcgacg gcccgttgcg gttcgagctc 2861401 atcgacatcg ccaccgctac ctcgtgcggc ggctgtacct accatgtcgc ccatccgggc 2861461 ggccgcgcct acgacgagcc cccggtcaac gccgtggagg cggaggcccg ccgcgcccgg 2861521 cgcttcgagg cgaccggctt caccccgggc aagctcgacc tgtccgacat ccgggagaaa 2861581 caggccagga tatccaccga tatcggcgcg ccgggcatcc tcgacctacg acgcgtgcgt 2861641 accgtgcaac agtaatggca ccctcagctt ctgccgctac caacggctac gacgtcgacc 2861701 gcctgctggc cggataccgc accgcgcgtg cccaggaaac actgttcgac ctgcgggacg 2861761 gcccgggagc cggctatgac gaattcgtcg acgacgacgg caacgtgcga ccgacctgga 2861821 ccgagctcgc cgacgcggtc gccgaacgtg gcaaggcggg gctggaccgg ctgcgctcgg 2861881 tggtgcacag cctgatcgac cacgacggca tcacctacac cgcaatcgat gcacaccggg 2861941 acgcgctgac cggcgaccat gatctggaac cggggccgtg gcgcctggac ccgctgccgc 2862001 tggtgatttc cgcggccgat tgggaagtgc tggaggccgg cttggtgcag cgatcgcgct 2862061 tgcttgatgc catcctcgcg gacttgtacg ggccccgcag catgctcacc gagggtgtcc 2862121 tgccgccaga gatgctgttc gctcatcccg gctacgtgcg tgccgctaac gggatccaga 2862181 tgcctgggcg ccaccaactt ttcatgcacg cctgtgatct cagccggttg cccgacggga 2862241 cttttcaggt caacgccgac tggacgcagg cgccctcggg ctccggctat gcgatggccg 2862301 atcgacgtgt cgtcgcgcac gccgttcccg atctgtacga ggaactggcg ccgcgaccca 2862361 ccacaccgtt cgcccaggcg ctccggctgg cactgattga cgcggcaccc gatgtcgccc 2862421 aagaccccgt cgtggtggtg ctcagcccgg gcatctattc agaaaccgct ttcgaccagg 2862481 cgtatctcgc aacgctgctg ggtttcccgc tagtggaaag cgcggacttg gtggtgcgcg 2862541 acggcaagct gtggatgcgt tcgctgggca cgctgaaacg cgttgacgtc gttcttcgcc 2862601 gcgtcgatgc ccactacgcg gatccactgg atctacgcgc cgattccagg ctcggtgtcg 2862661 tcggtttggt ggaagcgcag caccgcggaa cagtgaccgt cgtcaacacg ctgggcagcg 2862721 gcatcctgga gaacccaggc ctgttgcgct tcctgccgca gctatccgag cgcctgctcg 2862781 acgaaagccc gctgctgcac accgctccgg tctactgggg cggcatcgcc agcgaacgct 2862841 cacacctact ggccaatgtc tcgtcgctgc tgatcaaaag cactgtcagc ggggaaactc 2862901 ttgtcggacc gacactttcg tctgcacaac tggccgatct ggcagtgcgt atcgaggcga 2862961 tgccgtggca gtgggtgggc caggagctgc cgcagttctc gtcggcgccc accaaccatg 2863021 ccggggtgtt gtcgtccgcc ggggtaggca tgcgactgtt caccgttgcc cagcgcagtg 2863081 gttacgcgcc gatgatcggc ggcctcggct atgtactggc gcccggtcct gccgcatata 2863141 cgctgaaaac cgttgcagca aaagatatct gggtgcgccc aacggagcgt gcgcatgccg 2863201 aggtgataac ggtgccggtg ttggcgccgc cggccaaaac cggagcgggc acctgggcgg 2863261 tcagctctcc gcgcgtgctg tccgatctgt tctggatggg ccgctacggc gagcgcgcgg 2863321 agaacatggc ccggctgctg atcgtcaccc gcgagcgcta ccacgttttc cggcaccagc 2863381 aggacaccga tgaaagcgag tgcgtgccgg tgctgatggc cgcgctgggc aagatcaccg 2863441 gatatgacac cgcaactggc gccggcagcg cttacgaccg ggccgacatg atcgcggtcg 2863501 ccccgtcgac actgtggtct ttgaccgtgg atccggaccg gccgggttcc cttgttcagt 2863561 cggtggaggg gctggcactt gccgcccggg cggtgcgcga ccagctgtcc aacgacacct 2863621 ggatggtgct ggccaatgtg gaacgcgcgg tggagcacaa gtccgacccg ccgcagtcgc 2863681 tggcagaggc ggacgccgtg cttgcgtcgg ctcaggcgga gacgctagcc ggcatgctga 2863741 cgttgtccgg ggtggccggc gagtcgatgg tgcacgacgt gggctggacg atgatggaca 2863801 tcggcaagcg tatcgaacgc ggcctgtggc tgaccgcgtt gctacaagcc acgttgagca 2863861 ccgtgcgcca ccccgccgcc gagcaagcca tcatcgaggc aaccctggtg gcgtgtgaat 2863921 cgtcggttat ctatcggcgc cgcaccgtag gcaagttcag tgtcgccgct gtgaccgagc 2863981 tgatgttgtt cgacgcccag aacccgcgct cgctggtgta tcagctggaa cggctgcgcg 2864041 ccgacctgaa agacctgcct ggctcgtcgg gatcgtctcg tccggaacgg atggtggacg 2864101 agatgaacac ccgcctgcgc cgctcacacc cagaagagtt ggaagaggtc tccgccgacg 2864161 ggctgcgcgc cgagttggcg gaactgctgg ccgggataca tgcctcgctg cgtgacgtgg 2864221 ccgacgtcct caccgccact cagttggcgt tgcccggcgg catgcaaccg ctgtggggtc 2864281 cagaccaacg gcgggtgatg ccggcctaaa cggtgcgacg gctgtgagcc ggctcgaaat 2864341 ccggggccac ctcgtcgacg acggtgtgga tgaaccgcat cttctccagc acagcggccg 2864401 gcagcacaaa ggggtatagg tcgtcgtggc ccatcgagcg attgaccatg ttcagcgacc 2864461 acgacagcgg cagccacttg tcgatgatgg tattaaaagc gctggggccc aacgccggcc 2864521 ggtcgaaggt tgccgacgcc ggtgccaggc cgcaccaggc cgcggtgtcc agggcgtcgc 2864581 ggatatgcag gtaatgagcg aacgtctcgg cccaatcctc actcgcgtgc atggtcgcat 2864641 acgacgagac aaagctgtcc tgccaacctt ccggcgggcc gccacggtaa tgccgatcca 2864701 acgcctggga gtagtcagcg tccgggtctc cgaacaactc gttgaaccgg gacagatagt 2864761 cgcttgacga ggcgatgagt cgatagaagt agtagtgccc gatctcgtgg cggaagtgcc 2864821 caagcagggt ccgatacggc tcgtccatct cgacccgcag ctgctcccga tgcacatcgt 2864881 cgccttcggc gagatccagt gtgatgactc cgttctggtg tccggtggtc acgttctcgt 2864941 gcgcgctgga caatagccgg aaggccaacc catggtcagg atcctggtcg cggccgacga 2865001 tcggcagctt cagctcgtgt agctcggcga tcagccgccg cttggcacct tcggctcggg 2865061 cgaactccgc cagcccggcg gtgttggtat cgctgggccg ctcgatggtc agcacacaag 2865121 aactgcaaag tccgccgagc tgatcactgg gcaccagcca attgcattgc gcgaggtgga 2865181 gattggcgca gagttggaca tcggcgtcgt cggcgatgac cagcagcgcc atccgcccaa 2865241 gagaaaaccc cagcgcgctg ccgcacgaca ggcaggcgga gttctcgaat gccaggcgct 2865301 gcccgcaatt tggacagtgg aagtcacgca tgcagcgcat caccttcgaa gggcacgaca 2865361 tcgacagaaa cgtcgatcac actgttctcg gagttggtgt agatgatgcc gcgtagcggc 2865421 ggcacgtctg cgtagtcgcg gccgcggccc acgacgatgt agcgctggtc gaccaactgg 2865481 tcattggtgg gatccagccc cagccactcg aaccgcccgg gctgctgcgg agtccacacc 2865541 gaggcccagg catgcgtcgc gtcgatgccg atcatccgat cctttccggg cggcgggtcg 2865601 gtggccaggt agcccgacac ataacaggcc gccaaaccgt tggcccgtag gcaggcgatc 2865661 gccagcctgg cgaaatcttg gcatacccct tcgcgggcca gcagcacctc gttgactcct 2865721 gtggaaatcg tcgtggaacc cgagcggtag gtgaagtcgg tgtagatccg cgacgcgaga 2865781 tcgcgcaata cctcgaccag ggggcgtttg ggcaggaagc taggagccgc gtactcacgc 2865841 accgcatcgg tgatctccgg cgggttcaag tccagggtga actcggtggc tagcgatccg 2865901 ggcagcccgg cgggccgggc cgcctcccac ggttgcagcg ccggcccgct ggtgtaaagc 2865961 ccgggcggcg gcggggacac gtcgacgatg gaatcgctgg tgatcgtcaa ggtgcggtgc 2866021 ggttcggtga cgtggaaata ggagctgatg ttgccgtacc cgtcgcgact ggtggaccgg 2866081 tcggcggggg ccgggtcgat ggtcagccgg tgtgcgacac aacgctgccg cagcgaattc 2866141 cgcggcgtga gaaacccgcg gccataggag ctggtcacca cgtcggagta gcggtattcg 2866201 gtgcggtgtg ttactcgata gcggtgagtg cccgacaacg gcaacgacaa cgagctatct 2866261 gctgacaaaa agctacctcc tggctgatca catcacacgc cggcggctcg tccggcgcga 2866321 tcgtcgcgca atgtggcgcc aagcgcacca tagccggagc acaattaaag cgtggctacc 2866381 tgggacgacg tcgcccgtat cgtgggtggg ctgccgctga ccgcggagca ggcaccgcac 2866441 gactggcgtg ttggccgcaa gctgctggcc tgggaacggc cgctgcgcaa gtccgaccgc 2866501 gaagccctga ccagggccgg atcggagcca ccgtccggcg acatcgtcgg tgtccgagtg 2866561 tcggacgagg gggtgaagtt cgccttgatt gccgacgagc cgggcgtgta cttcaccacc 2866621 ccgcatttcg acggctatcc agcggtgctg gtcaggctgg ccgagatcga ggttcgcgac 2866681 ctcgaggagt tgatcaccga ggcctggctg atgcaggcgc cgaagcagct ggtgcaggcg 2866741 tttctcgcca attcaggctg acatgcccga cgggcccggg cgttcgatta cccgttgtag 2866801 atcggtgaca cacgcttgga cgatatcggc gcgcaccact tcgttgctgc cacaagcagc 2866861 cgattgcagt gtcgacgcgg ttgcgcgggc ggcggccgcg tgctcgttcg ctgccgtcgg 2866921 atccgcgtcg gccaggccgg ttcccgcggc gaggtcggtg agcacggcgt gcacgggcgt 2866981 tggcagctta tcgccaccag gcccggcaat ggtgcgagcc agatgcaaca ccgaactgac 2867041 cagcagggcc aggtagacgg cctgttgatc gagatcgcgg acagtgctgc gcacccccca 2867101 tcggcggggc gctcgccgcg ccaccatggc agcgttggcg cgcacctcga tgagcccgtt 2867161 cagctgctga tgcagtcgat cagcggctgc catcggccag tcgggcgggg cgctggtggg 2867221 atcgctcacc gtgttcacca gctcggcgag gatgtcgcgc acagcggcca acacgtcggc 2867281 gcgcgcactg cacagcatga ccaccgggtc gggcgggaag agcagaatgc tgaacacgat 2867341 agccagccca ccaccgacca gcgcgtcgaa gaggcgttcg aaaaccacac tgccgttgga 2867401 cgcgaagacc aagaccagca ccgcggagac ggcggcctgg ttgatgaaca ttaagccttg 2867461 cgcgaccaac ccgcgtgcgc acagcaccgc gaccgacaac gcgatgaaca ccaccacacc 2867521 catggcgatc ggtccggaac caagcagagc atgcacgcca gcacccagca cgatccccag 2867581 cgccaccccg acgatcatct gttgggcacg tcgtgcgcgc agcacgttgg tcgccgacat 2867641 gcacaccaca gccgaaatcg gcgcgaagaa cgcctgcgga tggttgaaca cgtcatgggt 2867701 gagataccac gcgaggccgg cgacgaccga tgtctgggtg atcggccaca gcacggtgcg 2867761 caaccgttgg gcgaccgcac ggccgccgca ggccgtcctg actagcagcg aagcgctcat 2867821 gaacgcctat ttattcacac tcgggtgcga cgtcgtaacc gcaaagatct ggtcatgccg 2867881 ctggacccgc ttgggctggg catctattcc ggactcctta cgttgctgag cggtaatggg 2867941 cgccggcgcg tcggtgagcg gatcgacgcc gccgccggtc ttcgggaacg cgatcacctc 2868001 acggatcgag tccatcccgg ccagcagcgc ggtggtccgg tcccacccga acgcgattcc 2868061 gccgtgcggc ggtgcgccaa acatgaacgc ctccaacagg aatccgaact tttcctccac 2868121 ctcggccttg tccaggccca tcaccgcgaa cacccgttcc tggatatcac ggcggtggat 2868181 acgcaccgag ccgccaccga tctcgtggcc gttgcagacg atgtcgtacg cgtcggccag 2868241 cacgctgccg gtatcggatt cgatgcggtc ctcccattcc ggtttcggcg cggtgaaggc 2868301 atggtgcacc gcggtccagg cccccgagcc gaccgcgacc tcaccggcgg cggtcgcttc 2868361 gtcggccggc tcgaacagcg gcgggtcaac gacccagacg aatgcccacg catcggggtc 2868421 aatcaggccc agccggttgg cgatctcgac gcgggccgcg cccagcagtg cccgcgacga 2868481 tttgaccgga ccggccgaga agaagatgca atcgccgggt ttggccccga catggtcggc 2868541 cagtccggtg cgctcggcct cggtcaggtt tttggccacc ggaccgccca gcgtgccgtc 2868601 ttcggcgacc agcacgtagg ccagtccgcg gtggccgcgc tgcttggccc agtcctgcca 2868661 gccgtccagc gtgcgccgcg gctgcgacgc cccgccaggc atcaccaccg cgcccacata 2868721 cggtgcctgg aagacacgaa atgtggtgtc ggagaagaaa tccgtgcatt cgacgagctc 2868781 cagcccgaac cgcaggtcgg gtttgtccgt accgaatcgg cgcatcgctt cggcatagcc 2868841 gatccgcggg atgggcgtcg gaatccggta gcctatcagc gcccacagct cggtcagaac 2868901 ttcctcggag atcgcgatga tgtcctcggc gtcgacgaag ctcatctcca tatcgagctg 2868961 ggtgaattcg ggctggcggt cggcgcggaa gtcctcgtcg cggtagcagc gggcgatctg 2869021 gtagtagcgt tccatccccg ccaccatcag cagctgcttg aacagctgcg ggctctgcgg 2869081 tagggcgtaa aacgaaccgg ggtgcagtcg ggccggcacc aggaagtcgc gcgctccctc 2869141 cggggtcgag cgggtgatcg tcggcgtctc gatctcgacg aagtcgtgac gcgccagcac 2869201 cgcgcgcgca gcggcattca cccgggaacg cagtcgaatc gccgcagcgg ggtcgtcgcg 2869261 gcgcagatcg aggtagcggt acttcagtcg caactcctca cccgccggtt cgtccagctg 2869321 aaacggcagc ggcgcacatt cgcccagcac ggtcaacgac gtggcgttga cctcgatctc 2869381 gccggtggcg atctccgggt tggcgttgcc ttccgggcgg atctcgacga cgccggccac 2869441 cgatacgcag aattccgcac gcagccggtg agcctgcgcc agcacctcag tgtcctgggg 2869501 gtcgcggaac accacctgtg cgatgcccga agcgtcccgc agatcgatga agatcacgcc 2869561 gccgtggtcg cggcggcgag ccacccagcc ggccaatgtc acctgctgcc cggcgtcgcc 2869621 ttcccgtagc aaacccgcgg cgtggctgcg cagcacaaac actccccttc aaccggatta 2869681 accgactgct cagtctagag ctgcccgcgg cgcacatcgg tcacgcaggg taatttcggc 2869741 tcatctcaac aaacattgca acaggcattg ccctagtcgg acccggtgcc gtcggaacga 2869801 cggtcgccgc gctgttgcac aaggccgggt attcgccgct gttgtgcggc cacactccgc 2869861 gcgccgggat cgagctccgg cgagacggcg cagaccccat cgtggtgccc ggtccggtgc 2869921 acaccagtcc tcgggaggtt gccggcccgg tcgatgtgct gatcctggcg gtcaaggcca 2869981 ctcagaacga cgccgcacgt ccctggctga cccgcctgtg cgacgagcgc accgtggtgg 2870041 ccgtgctgca aaacggtgtc gaacaggtcg agcaggtcca gccgcattgt ccgtcctcgg 2870101 ccgtggttcc cgcgatcgtg tggtgttcgg ccgagaccca gccgcaaggg tgggtgcgct 2870161 tgcgcggtga agccgcactg gtcgttccca ccgggcccgc ggccgagcag ttcgccgggc 2870221 tgctgcgcgg tgccggcgcc acggtggact gcgaccccga cttcaccacg gcggcctggc 2870281 gcaaactact ggtcaacgcg ctggcgggat ttatggtgct gtccggacgg cggtcggcaa 2870341 tgttccgccg cgacgacgtc gcggcattgt cgcgccgcta tgtcgccgaa tgcctggcgg 2870401 tggcgcgcgc tgagggtgcc cgactcgatg acgacgtcgt cgacgaagtg gtccgcctcg 2870461 tccggtcggc cccgcaggac atgggcacct cgatgctggc cgaccgggca gcccaccggc 2870521 cactggaatg ggatttgcgc aatggggtga tcgtccgcaa ggcccgcgcc cacggcctgg 2870581 ccaccccgat cagcgacgtg ctggtgccgc tgctggcggc tgctagcgac ggtcccggat 2870641 agcaatgtag ctaatgtcta gatcatgtac ccctgcgagc gggtaggcct gagcttcacc 2870701 gagaccgcgc cttacctctt ccgcaacacc gtcgacctgg ccatcacgcc cgagcaactc 2870761 ttcgaagtgc tcgccgaccc gcaggcctgg ccacgctggg caacggtgat cacaaaggtg 2870821 acctggacca gtcccgaacc gttcggcgcc ggcaccaccc gcatcgtcga gatgcgcggg 2870881 ggtatcgtcg gcgacgaaga gttcatttcg tgggagcctt tcacccgcat ggcatttcgg 2870941 ttcaacgaat gctccaccag agccgtcggc gcgttcgccg aagactatcg ggtgcaggcc 2871001 atccccggtg gttgccggct gacctggacc atggcgcaga aactcgccgg cccggcgcgg 2871061 ccggcgctgt tcgtcttccg gcccctgctg aacctggcgc tgcgccggtt tctaaggaat 2871121 ctgcgcaggt ataccgacgc tcggttcgcc gctgcgcagc agagttaggc tggatcggcc 2871181 gatttcggga gcgtgcgatg accttcaacg agggtgtgca aatcgatacc agcaccacgt 2871241 cgacctcggg tagcggtggc gggcggcgct tggccatcgg gggcggcctc ggtgggctac 2871301 tggtggtggt ggtcgcaatg ctgctcggcg tcgatcccgg tggcgtgctg agccaacaac 2871361 ctctcgacac ccgcgaccac gtagcacccg gtttcgacct gagccagtgc agaaccgggg 2871421 ccgatgccaa caggttcgtg cagtgccggg tggtggccac cggtaactcc gtggacgcgg 2871481 tatggaaacc gctgttgccc ggctacaccc gcccacacat gcggctgttc agcggccagg 2871541 taggcaccgg atgcggaccg gccagcagcg aggtcgggcc gttctactgc ccagtggaca 2871601 aaacggccta cttcgacacc gacttcttcc aggtgctggt cacccaattc ggttccagtg 2871661 gcggcccatt cgcggaagag tatgtggtgg cccatgaata cggccatcac gtgcagaacc 2871721 tgctgggggt gctcggccgc gctcagcagg gtgcgcaagg tgctgcgggc agtggcgtgc 2871781 gcacggagtt gcaggcggac tgctacgccg gggtgtgggc atactacgcg tccaccgtca 2871841 agcaggagag caccggtgtg ccttacctgg agccgttgag cgacaaggac atccaagacg 2871901 ccctcgcggc cgcggcagcg gtgggcgacg accgtatcca acagcagacg accggacgca 2871961 ccaaccccga gacctggacg catggctcgg ccgcgcaacg gcagaagtgg ttcactgtcg 2872021 gataccagac tggcgacccc aacatctgcg acaccttttc cgccgcggac ctggggtagg 2872081 cgaattacca gggacgagtc gagcactgca cgccgctgcc gccgtcctgc gacaccacca 2872141 cctggccgtc tacaacaatc tcgcagtgga actccggatt gacccgcagg ccgccgctgg 2872201 cggtgacgat cgcccactgg ctcgggtttg ccagcgtggc ggtatagacc agcggctgac 2872261 cgccagcgat cggagtgtgc aaggtaatca tgtacttcga tgaatcggca ttgaaagccg 2872321 ccatgctggg cggatcggcg ctcatgtacc gaatgttggc catcaggtcg ctggtggtcg 2872381 tgacggtgta ggtcacctga tgcccgaccg gatccgcgcg ggcaatcgcc gggatgaccc 2872441 cgctgagcgc ggctccggca aacgtcacca gcgcgacggc gcttggcact gtgcgcacgg 2872501 acgtcatatc taaaacgcta ccggatgcgt taccgacgcc ggccggcact gcatgcgatg 2872561 accgtcgccc gccatccggg caagccgaat tgcgtgagcc gcaccgccat tagcagccga 2872621 aagctgtcgt tggcctcggg cttcgcgctc tggaggcgat cgctggtgtg agcgtctacg 2872681 cagttcagaa agcctttccg agcaacgcgc cgaggtaact tcagatttcg gcagccggtt 2872741 tacccgcagg taaaccaggg cgggtatgaa acgtgagtgg gcgccgatct gaagcagccg 2872801 caggatgccg attcaccccc gaaaggggtt agccgccgta ggttcctgac gacgggcgcg 2872861 gcagcggttg ttgggacagg tgtcggcgcg ggcgggaccg cgctgctgtc gtcacacccc 2872921 cggggtcctg ccgtctggta tcaacgtggt cggagcggcg cgcctccggt gggtggtctg 2872981 cacctgcagt tcggccggaa tgccagcacc gaaatggtgg tgtcctagca taccacggac 2873041 accgtcggca atccgcgagt catgctgggc acgccaacct ctggcttcgg cagcgtcgtg 2873101 gtggccgaga cccggtcgta ccgggatgcg aagtccaata ccgaggtgcg cgtcaaccac 2873161 gctcacctga ccaacctgac acccgatacc gactacgtct acgccgcggt gcacgacggt 2873221 acaactccgg agctcgggac cgcacggacc gcaccgtcgg gtcgaaaacc gctacgcttc 2873281 accagcttcg gtgatcagtc cactcccgcg ttgggcagac tggccgacgg gaggtacgtc 2873341 agcgacaaca tcggatcccc cttcgccggt gacatcacga ttgcgatcga gcgtattgcc 2873401 ccgttgttca acctgatcaa cggtgacctg tgttacgcca acctggcaca agaccgaatt 2873461 cgcacctggt cggactggtt tgacaacaac acccgctcgg cgcgctaccg gccgtggatg 2873521 ccggcagcgg gcaatcacga gaacgaagtc ggtaacgggc caatcggtta tgacgcctat 2873581 cagacctact ttgcggtacc cgactcggga tccagcccgc aactgcgcgg gctatggtac 2873641 tcgttcaccg ccggctcggt gcgggtgatc agcctgcaca acgatgatgt gtgctaccag 2873701 gacggtggca actcctacgt acgcggctat tcgggcggcg aacaacggcg ctggctgcaa 2873761 gccgaactcg ccaacgctcg gcgcgactcg gaaatcgact gggtggtcgt ctgcatgcat 2873821 cagaccgcga tctccaccgc cgacgacaac aacggtgccg acctcggaat ccggcaggaa 2873881 tggctaccgc tgttcgacca gtaccaggtc gacctggtgg tgtgcggcca cgaacaccac 2873941 tacgagcggt cacatccgct gcgcggggcc ctgggcaccg atacccgaac accgataccc 2874001 gtcgacaccc gcagcgacct catcgactca acccggggaa ccgtgcacct ggtaatcggt 2874061 gggggcggca cgtcgaagcc gaccaacgcg ctgctcttcc cgcagcctcg gtgccaggtg 2874121 ataaccggcg tcggggattt tgatcccgcg atccggcgta agccgtccat attcgtgctc 2874181 gaggatgcgc cgtggtcggc gttccgcgac cgcgataatc cttacggctt cgtggccttc 2874241 gacgtcgacc cgggtcaacc cggcggcact acctcgatca aggcgacgta ttacgcggtg 2874301 actgggccgt tcgggggact caccgtcatc gaccaattca ccttgaccaa gccgcgcggc 2874361 ggatagctca gaacagggtc gcctgaacgg gtaccagtgc cgcttcggtc tccggcggcg 2874421 ccgggcgatg atcacccgcc aaccgatact ttgcgatcag cggtgccacc cgttcccgca 2874481 gcatctcgcg gtagctcggc ggtagatatg gcccgcgccg gtacagttcg cggtaccggc 2874541 tgaccagttc gggatgcgcg cgggccagcc agcacatgaa ccagccgcgc gtcgaacccc 2874601 gcagatgcag gccaaagacc gttacaccgg tggcgcctgc ggccgcgatc tggcccaaca 2874661 gttggtcaag gtgctcgccg gagtcggtga gttgtggcag caccggcgcg accatcacgt 2874721 gacagtccaa gccggcggcg cgaattgcgg taatgagcgc cagccgcgcc tgcggtgttg 2874781 gcgtacccga ctcgacatcc cggtgcagct ccgggtcgcc aacggccagc gacaccgcca 2874841 ccgacaccgg cacttgttgg gcggcctcgg cgatcaacgg caagtcccgt cgcagcaggg 2874901 tgcccttggt caggatcgac agcggcgtac cggatgccgc cagcgcgccg atgatgcccg 2874961 gcatcagggc gtagcggccc tccgcgcgct ggtaggggtc ggtgttggtg cccaacgcga 2875021 cggtctcgcg ccgccaggac ggccggcgca actcgtgacg cagcacagcg gcgacgttgg 2875081 tcttgaccac cacctgggtg tcgaagtcgg tgcccggatt gaagtccagg tactcgtggg 2875141 tggggcgggc gaaacaatag cgacaagcat gcgagcagcc gcggtagccg ttgacggtgt 2875201 agcgaaacgg caacgcggcc gcgttgggca ccttgttcag cgctgatttg cacaacacct 2875261 cgtggaaggt gatgccgtcg aattgtggcg cgcgaacgct gcggaccagg ccgatccgct 2875321 gcaaccccgg cagcgccccg tcgtcaacgg gcatcccgtt caccgcgacg gcttgccggg 2875381 cccaacgcat accattattc gaacaaccgt tctatacttt gtcaacgctg gccgctaccg 2875441 agcgccgcac aggatgtgat atgccatctc tgcccgcaca gacaggagcc aggccttatg 2875501 acagcattcg gcgtcgagcc ctacgggcag ccgaagtacc tagaaatcgc cgggaagcgc 2875561 atggcgtata tcgacgaagg caagggtgac gccatcgtct ttcagcacgg caaccccacg 2875621 tcgtcttact tgtggcgcaa catcatgccg cacttggaag ggctgggccg gctggtggcc 2875681 tgcgatctga tcgggatggg cgcgtcggac aagctcagcc catcgggacc cgaccgctat 2875741 agctatggcg agcaacgaga ctttttgttc gcgctctggg atacgctcga cctcggcgac 2875801 cacgtggtac tggtgctgca cgactggggc tcggcgctcg gcttcgactg ggctaaccag 2875861 catcgcgacc gagtgcaggg gatcgcgttc atggaagcga tcgtcacccc gatgacgtgg 2875921 gcggactggc cgccggccgt gcggggtgtg ttccagggtt tccgatcgcc tcaaggcgag 2875981 ccaatggcgt tggagcacaa catctttgtc gaacgggtgc tgcccggggc gatcctgcga 2876041 cagctcagcg acgaggaaat gaaccactat cggcggccat tcgtgaacgg cggcgaggac 2876101 cgtcgcccca cgttgtcgtg gccacgaaac cttccaatcg acggtgagcc cgccgaggtc 2876161 gtcgcgttgg tcaacgagta ccggagctgg ctcgaggaaa ccgacatgcc gaaactgttc 2876221 atcaacgccg agcccggcgc gatcatcacc ggccgcatcc gtgactatgt caggagctgg 2876281 cccaaccaga ccgaaatcac agtgcccggc gtgcatttcg ttcaggagga cagcccagag 2876341 gaaatcggtg cggccatagc acagttcgtc cggcagctcc ggtcggcggc cggcgtctga 2876401 ccgcaaccgg gcctcatgct aggccaccgg cgaccgacgg acttcccgcg cgagccgctc 2876461 caaaagcctc agccgctcgg ggtggtcggc tcgtcaaacg acagccctat cagccgagac 2876521 accacgttgt gcagcgcgtc aaacacctcc aggatctctt ctcggctact cgaaacccat 2876581 gtttgaaacg tatgacgccc accgacaaga atggccgcct tgaggccctg cggccacggt 2876641 ggcgcaagtg atttcggtga ctccggctgg aagcggcgac tacccagcca gccgcgaaat 2876701 tacttcggcc acaaccgaat ccatcgagac cgaaacttgc tcacccgtcg tcaagtcctt 2876761 cactgcgacc gtcccggcct cgatgtcgcg gtcgcccgct accaacgcaa cacgggcgcc 2876821 ggaacgagcg gccgcgcgca tcgcgccttt gagcccgcga tcaccatagg caaggtcaac 2876881 ccgcaccccg gccgcgcgca gtcgtccagc cagcaccgcc agcctgagct tggccgcctc 2876941 gccaagcggc acgccgaaca cgtcgcaccg ggcgctgtcc cccgccgtct tgccctcggc 2877001 ccgcagcgcc agcacggtcc ggtccacgcc cagcccgaac ccgatgcccg acaagtcctg 2877061 cccgccaagc tggtgcatca ggccgtcgta gcgccccccg ccgccgatcc ccgattgcgc 2877121 accaagcccg tcatggacga actcgaaggc ggtcttggtg tagtagtcca ggccgcgcac 2877181 catgcgcggg ttgatgacat agggcactcc aagcgcgtcc agatgggcga gcacggtgtc 2877241 gaaatgctgc ttggcgacat cagacagatg atccagcaac accggcgccg acgccgtcat 2877301 cgcacgcaat tcgggtcgct tgtcgtcgag cacccgcagc ggattgatcc ctgcgcgcct 2877361 gcgggtgtcc tcgtcgagat cgagtccaaa caagaactcc tgcaacagtt cccggtactg 2877421 cggacggcaa ctctcgtctc ccagggaggt gatttccagc cggaacccgt cgagacccaa 2877481 cgagcggaac ccggcgtcgg caatggcgat cacctcggcg tccaacgccg ggtcgtcgac 2877541 gccgatcgcc tccaccccga cttgctgtaa ctggcgatac cggccggcct gcggacgctc 2877601 gtagcggaaa aacgggcccg cataacacaa cttcaccggc agcgcgccgc gatccagccc 2877661 gtgttcgatc accgcacgca ccaccccggc ggtgccctcg ggccgcagcg tcaccgagcg 2877721 gtcgccacgg tcggcgaacg tatacatctc cttggacacc acgtcggtgg attcacccac 2877781 gccccgggcg aacagggcgg tgtcctcgaa gatgggcagc tcgatgtggc tatagccggc 2877841 ttgacgggcc gccgcgagca gcccgtcgcg caccgcgacg aactgcgccg agtcgggcgg 2877901 gacgtagtcc ggtaccccct tgggggccga aaatgacgag aattccgtca ccggctcaag 2877961 ccctcaagga acggattgaa gcgccgctcg gccccaatgg tggtggagtt gccgtgcccg 2878021 ggcagcacca ccgtgctgtc gtcgagcacc aggagtttgt cgacgatgga gcgcaacagg 2878081 tcgcggccgc tgccgccggc caagtcggtg cggcctatcg cacgctcgaa cagggtgtca 2878141 ccggtgaaca cgatgtcctt gtcgttgttg gtcgcctgca ggacccggaa gaccaccgac 2878201 ccgcgggtgt gacccggtgt gtgatcgatg ttgaccgaga tgccgccgag gtcgatcttg 2878261 tcgccgtctc ggtccagctc cacaacctgt ttaggctcac gaaagaacgc acccgcaacc 2878321 agctgcgcta tccgcgggcc caggccgtag atggggtcgg tcagcatgaa ccggtcggcg 2878381 ggatgcacat aggtggggca gccgaaggtg tctgagacct tctgcgcgga ccagatgtga 2878441 tcgatgtgtc cgtgggtgag cagcaccgcg gcaggggtca gccggttctt gtcgaggatg 2878501 cgacgcagcg tgcccatcgc accctggccc ggatcgacga tgacggcgtc ggttccgggc 2878561 cgctcggcca gcacataaca gttacacgcc agcaaccccg caggaaatcc ggtgatcaac 2878621 acggttccca gtttcccatc cccggcgtcc ggggacgagg cgggccgcga acatgggcca 2878681 cttgacaccg gtcgcggcgc cccgattagc ctgtgctttc gtgccgacca atgctcagcg 2878741 acgtgccaca gccaaacgca aactcgaacg acaactagag cgccgcgcca agcaagccaa 2878801 acgccgtcgc atcttgacta tcgtcggtgg ctcactcgca gcggtggccg tgatcgtcgc 2878861 ggtagtcttc acggtggtgg tcaacaagga cgaccaccag agcaccacgt cagcaacccc 2878921 caccgactcg gcctcgacca gccccccgca ggccgcgacc gctcccccgc tgccgccgtt 2878981 caagccgtcg gccaacctcg gcgccaactg ccagtacccg ccgtcgccgg acaaggccgt 2879041 caaaccggtc aagttgcccc ggaccggcaa ggtacccacc gacccggccc aggtcagcgt 2879101 gagcatggtg accaaccagg gcaacatcgg tctaatgctg gccaacaacg aatcgccgtg 2879161 tacggtcaat agtttcgtca gcctcgcgca gcagggtttc ttcaagggca ccacttgtca 2879221 ccggctgacc acctcaccaa tgttggcggt tctgcaatgc ggcgacccta agggcgacgg 2879281 cacgggcggt ccgggctacc agttcgccaa cgaatacccc accgaccaat actcggcgaa 2879341 cgaccccaag ttgaacgagc ccgtcatcta tccgcgcggg acactggcca tggccaacgc 2879401 cggccctaat accaacagca gccagttctt catggtctac cgggactcaa agctgccacc 2879461 ccaatacacc gtgttcggca cgatccaggc cgacggactg accaccctgg acaagatcgc 2879521 caaggccggc gtcgccggtg gcggcgaaga cggcaagccc gccaccgaag tcaccatcac 2879581 gtcggtgctg ctggattagc ccgacgctcg ccgagcagac acagaatcgc acgaaatcag 2879641 cccgcccaat gcgattctgc gtctgctcgg cggagaaaag cgcgctacgc ggccgaggtc 2879701 acccggtaga cgtcgtagac accttcgacg ttgcggacgg cgttgagcag gtgcccgagg 2879761 tgcttggggt cacccatctc gaaggtgaat cgactgatcg ccacccggtc ccccgaagtg 2879821 gtgaccgacg cggacaggat attgaccttc tcgtcggcca gtgcgcgcgt cacatccgac 2879881 agcagccggt gccggtcgag tgcctcgacc tggattgcca ccagaaacac cgacgacggc 2879941 gacggcgccc atagcacctc gatgatgcgc tcggcctgct gctgcagcga tgcggcgttg 2880001 gtgcagtcgg tgcggtgcac actgaccccg ccgccacggg tgacgaaccc cataatcaca 2880061 tcgcccggaa ccggcgtgca gcacttggcc agcttggtca gcacgcccgg ggcgccgggg 2880121 acggagaccc cgacatcgtc ggtgctgcgt gggcgccgcg gcatggtcgc cggcgtggac 2880181 cgctcggcga gttcctcttc cgcctggtcg ataccgccga gctcggccaa caaccgctgc 2880241 acgacgtgtt tcgccgacac gtgcccctca ccgatggcgg tatagagtgc tgacacgtcc 2880301 gcgtagtgca gctcgcgggc caccgccgcc atggactcac cattgaccaa gcgctgcaac 2880361 ggaagtccac cgcggcgcac ctcgcgggcc atcgcatcct taccggtctc caacgcctcc 2880421 tcacgccgct ccttggcgaa ccactggcgg atcttcgtct ttgcgcgcgg cgacaccacg 2880481 aactgctgcc agtcccgcga cggcccggcg ttcggcgcct tggacgtgaa aacctcgaca 2880541 acttctccgt tttccagctt gcgttccagc gctaccaacc ggccgttcac tcgggcgccg 2880601 atgcagcggt ggcccacctc tgtgtgcacc gcgtaagcga agtccaccgg cgtcgaaccg 2880661 gttggcagcg tgatcacgtc gcccttgggg gtaaacacga aaatctcttg caccgcaagg 2880721 tcgtagcgca atgattccaa gaactcaccg gggtcggccg cctcacgttg ccagtcgagc 2880781 agctgacgca tccaggccat gtcgtcgatc tccgcggcgg catgcggatg aagaacaccg 2880841 ttgcggccct tggcttcttt gtagcgccaa tgcgcggcga tgccgtattc ggcggtgcgg 2880901 tgcatgtcgc gggtacggat ctgcacttcc agcggcttgc cctcaggccc gaccacagtg 2880961 gtgtgcagtg actggtacac accgtatctg ggctgggcga tgtagtcctt gaaccgaccc 2881021 gccatcggct gccatagcga atgcactacg ccgacagccg cgtagcagtc ccggatttcg 2881081 tcgcacagga tgcgcacacc gaccaggtcg tggatgtcgt cgaagtcgcg gcccttaacg 2881141 atcatcttct ggtagatcga ccaatagtgc ttggggcggc cctccaccgt cgccttgatc 2881201 ttcgacgcgg tcagcgtgtt gacgatttcg gcacgcacct tggccaggta ggtgtcccgg 2881261 gacggcgcgc gaccggcgac cagccggacg atctcctcgt acttcttggg atgcaggatc 2881321 gcgaaggaca ggtcctccaa ctcccacttg acgctggcca tgcccagccg atgcgccagg 2881381 ggtgcaatga cttccaacgt ctcacgggcc ttgcgggcct gcttctccgg cggcaagaag 2881441 cgcatggtgc gcatgttgtg taaccggtca gccaccttta tcaccagcac ccgcggatcg 2881501 cgggccatcg cggtgatcat cttgcgaata gtctcgcctt cggcggcgct gcccaacacc 2881561 acccgatcca gcttggtcac cccgtcgacg agatggccca cctcttcgcc gaattcctcg 2881621 gtcaacgcct ccagggtgta accggtgtcc tcgacggtgt cgtgcagcag cgcggccacc 2881681 aaagtggtgg tgtccatgcc caactcggcc agaatgttgg caacggccaa cgggtgggtg 2881741 atgtagggat caccggactg ccgcaactgg ctggcatgcc tttggtcagc gacctcgtag 2881801 gctcgctgca agatcgacag gtcggccttg ggatagatct cccggtgcac cgccaccaac 2881861 ggctcgagca ccggattggt ggtgctgcgc tgggcggtca tccgccgggc caatcgggcc 2881921 cgcacccgac gcgacgcgct gatgctggtc ttaagagtct cgaccggcga ctcgggcgtc 2881981 tcgagagcgg gctcgagagc cgcagaagcc tccgtgggcg gtgcaaccgc ttgcgccgtg 2882041 agctggtcct cggccacgtt cgtcacctcc gacctagagg atatccctca caggcggctc 2882101 aggctgtgca ccggcagcgg tgcgagcgcc gcgcgaccgc tcaaccccgc aagttccacc 2882161 actacggccg ccccggccac gttggcgcca ccgcgctcaa gcaggcgtcg cgtcgcgccg 2882221 atggtgccgc cggttgctaa cacgtcgtca atgatcacga cacggcggcc cgcaacctcg 2882281 atgccctcag cgagaatctc cagagtggcg gcgccgtact ccctgtagta ctcctcgctg 2882341 agcaccggcc ggggcagctt gccgcccttg cgaacggcca gcacacccac ttcgagccgg 2882401 gtggcgaccg cggctgccac cagaaacccg cgggcgtcga cgccggccac caggtcagct 2882461 ccggacgccc gatcggccag cgcttcggtt accgcggcca atcctcttcg gtcggcgaat 2882521 agcggggtga ggtccttgaa ctcgacgccg ggaaccggaa agtcggccac atcccgggtc 2882581 agcgacgcaa ccacgtcggc cacagatatg gctgagctcc ggcgggactc accgagcgcc 2882641 aatacccgcc cgtcgtcgac ccaacgctgc cggcggcgct tcccccgtgc ctttaaggag 2882701 agccccgtcg cgatcacgtt caacacgtag tcaccagccc atgtaccgcc atggcacaca 2882761 tcctctccca gacagcccgg agcacctgcg acactacgct ccgataggtc cgcttctcgt 2882821 cgtggaattc tgtcaattac ctgcagatgg cactggccat cgtcaccgcg ccagcgccca 2882881 gcgatccatg ttccaccctg ccccccatcg cgtcggattc ctgctcaccg catacatttt 2882941 cgtcgacatc aacaacgtgc gctgctgccg gtacaacggc aaggttggca tctcatccca 2883001 gagcaccggc gcggcctcgg caagcaacct ggcccgctcg gcggggtcgg ccgacaccgc 2883061 gagcgcgctg atgatgccgt cgatctgagc gtttgcgtac cccgatagat tgtttccgtt 2883121 gccgctgtgc aagtcatagg catccatcgc agacgatccg ctcgatccgc tgccggtggc 2883181 cccaccggtg ctcgccaaca atacgtcaat ctttccgtcc cgcagcgctt gcggtccggg 2883241 tgtgtccacc gtcacatccg aaacggtgat cccggccggg gcgcaggcgt cggcaatggt 2883301 tccgatggtg gccgccaacc gagcgttggg cctgccgtag ccgatccgca cggtcagcgg 2883361 cgtaccaccc agcgcgtcgc gagcggcggc ggggtccacc cggccgaact gacgtgcttc 2883421 ggcggcgccg tcggcatcgg tgagggcatc gtcggtcgcc ggggacagcc gcgagttggc 2883481 aatcggaacc ccggcatccc gagcgatcgc gtcccggggt acacacaacg cgagcgcgcg 2883541 gcgggtgcgg ctttgcgcga gtgaaccttg tggtgcgaag atcagctgct cgatcccggc 2883601 cgacgggtag tcggtgcgct ggtagctgtc gggggttacc agggatcccg atgaaccggc 2883661 cgcgacgtcg accacgtcga cgctgcggtt gttgacccgg tcttggatat cggctccctg 2883721 cggccagacg gtgatccgct tcgtgatcgc cttggtgccc caccaacgat cattggcgac 2883781 gagcaccacg gcgccatcgt ccaggacgga ttcgatcttg tacggtcccg acgaggggaa 2883841 gcggctgcgg acttcgtcgt ggctgcggcc cggcttgagg tcccacgtgg aattccacag 2883901 tcgcgcaatc tgttccaccg ctgacacgtt gttgcttagc aacgccgcgg taacatcgat 2883961 gtgcagctgg tcggcgatca cgtgcgacgg catcagcgac gtcgcggtga acagctggga 2884021 gtggtcaacg acactgcgat ccgggatgaa cgacacccgg gcctttttct gccccgccgt 2884081 gcactcgatg ttggcgatgt cgacatagcc ggcctgcgta gcagcgtcga agccgggaaa 2884141 gcggccggat tgggccgccc aggccaatac caggtcgtca caggtcaccg gcctgccgtc 2884201 ggaatagacg gcgtcgtcgg agatctggta gtcgaggatc aacggcgacc cctccaccac 2884261 cgagaccgtt ccgaagtcgc ggtcagccac cacttggccg tcggggccgt gatagccaaa 2884321 cccggtgaga gtccgggcga atgcctgcgc cccggccgac gcggcaccga tgacagtatt 2884381 ggtgttgtag gtgaccagcg cgccgtcgac cacgtagtcg atctgagccg cggcgctgcc 2884441 cgaacacgcg gtcagcgtgg ttgcggcgac caacgtcgcg gtaccaacga ctcgcaggcc 2884501 ggcgatgcgc gtatgacgcc ggcggcgggg ggccaccgcg cctaccgccg accggcgttc 2884561 cgcttgccgg gcggacgcct ggtaccgacg ggacgcactg ggcgcgcccc cggcgccggc 2884621 ttgctggagc cctgggccgc ccgcggggcg gattggctgc tggcctgcgt gatccccacc 2884681 agcgactgtt catcagcggc tgccggctgc tcgccgccat ccgtgctggc gtcctctgat 2884741 cccgccggcg agccggagtt acgccgtttg agcacccgac gggtgtggtt gcgcaccaac 2884801 tccgtgcgct cacggagggt aaccaacagc ggcgtggcga agaagattga cgagtaggtg 2884861 ccgatgatga tgccgatcag ctgcaccagc gccaggtctt tgagagtgcc gacgcccagc 2884921 agccagaccg ccaccaccat cagcgtcaac accggcaaca cgccgatcag gctggtgttg 2884981 atcgaccgca tgaacgtctg gttgatcgcc aggttggcct gctcggcgaa ggtgcgccgg 2885041 gtggtgtgct ggaagccatg ggtgttctcc tcgaccttgt cgaacacgat gacggtgtca 2885101 tagagcgaga acccgagaat ggtcagcagg ccgatgaccg tggccggggt gacttcgaaa 2885161 cccaccaggg aatacacgcc ggcggtgacg gtcaggtcga agagcatggc cgttatcgcc 2885221 gagatggtca tgtagcgctc gtagcgcacg gtaatgtaga gggcgaccag caccagaaac 2885281 accaccagcg cgatcaccgc cttcttggtg atctgaccgc cccaggtctc cgacaccgcc 2885341 gagtcgctga tggcctgctt gctgggctga ccgtcggttc ccttgggccc gaaggcctcg 2885401 aatagggcgt cccgcagctt ggccgtctgg tcgctggtca gcgtctccga acgaatctgc 2885461 accgtcgccg aagcaccggc cccgacgatc accaccgact ggggctcact gccgagggcc 2885521 cggtagtaga cgtcttcgac ctgcgcgact tgggtgctgc cacgcgggaa cgacaccgtg 2885581 gtaccgcctt tgaaatcgat gccgaaggtg aacccacgaa agacgatgct ggcgatggcc 2885641 accgcgacga tcgcaccgct cacgccaaac cacaaccggc ggcgtcccac tacctcaaac 2885701 gccccggtgc cggtgtacag gcgcgaaagg aagctatggt gccccagctt cgaggcggtg 2885761 tctgtggtgc tgtcgccgtc ggtccgcgcc acagcactct cggtggcctc ggtgagttcg 2885821 accgccgacg tggcttcgtc gtcgcggccg gtctttgctt tcgacgccat cggctatccc 2885881 cgtcccgtcc gagccatggc ccggcgttcg cgtgcgacct gctgcaccgc tcccaggccg 2885941 ttgtatgccg gcttggccag cagcgacgat ttggacgcca gatacaccaa cggccacgtc 2886001 accaagaaca ccacgacgag gtccaggatc gtggtgaggc ccagggtgaa cgcgaacccc 2886061 ttcacctgac cgatcgccag aaagtacagc acggcagcgg ccaggaaagt gacggcgctg 2886121 cccgacacga tcgtcttgcg ggcacgcgcc caaccgcgcg gcactgccga ccggaacgaa 2886181 cggccttcgc ggatctcgtc tttgatgcgt tcgaagaaca ccacgaacga gtcggcggtg 2886241 gtcccgatac cgatgatcag gcccgcaata ccagccagat ctagggtgta gttgatatat 2886301 cggcccaaga gcaccaggat cgcaaaaacc attgagccag aagccactag cgacaaggcc 2886361 gtgagcagtc ccagcactcg gtagtagagc agcgaataca ccagcaccaa cagcaggccg 2886421 atcgcacccg cgatcatgcc cgcgcgcagc gatgacaacc ccaaggtcgc cgaaaccgtt 2886481 tgggcttccg acggttcgaa ggacagcggc agcgacccgt acttgaggac gttggcgagc 2886541 tggcgtgcgg tcgccgcggt gaatggcgga tccccaccgc tgatctgggt tcggccgccg 2886601 gggatcgctt cctggatctg cggtgcactg acaacctgcg agtccagggt gaacgccgtc 2886661 tgggtgccga tatgggcggc ggtgtagtcg gcccagatgt tggccgccgg acccttgaac 2886721 tgcaggtcga cgacgtagcc gatgccgcgc tggtccatac ccgaggtggc gttttggatc 2886781 tggtcgccgc tgatgatcga cggcgccagc aggtacgcgg tcttgtggtc ggtcgagcag 2886841 gtcaccaacg gcagtttcgg gtcgtcgttg ccggccaaaa tgtcgtcgct ctcgcagcgg 2886901 gtcgcctgga attgcagtgc aaccatctgc atgtattggt tggtgctctg ccgcagcttc 2886961 ttctcctggg cgatgcgctc ggcgagatcc ttgcgcggat ccgtggccgg cgcctcagcg 2887021 ggcggcgccg gcggcgggct ggccggtgag gtcgggttgg gcgatggcgc cgggtcctgc 2887081 ggatagggcc gcggttgggc cccaggttgc ggtgaagccg gcgcccccga ttgggctggc 2887141 ggcggtgcgg cgggttgacc gggcggctgc ggttcggcgc tgggtgccgg ctgcggttct 2887201 tcggctgcgg gctgcgccgg catcgagttg agcaccggcc ggatgtacag ccgagcggtc 2887261 tgtccgaggt tgcgtgcctc gctgccgtcg ttgccgggca ccgtgatgac caggttgtca 2887321 ccgtcgacga ccacctccga cccggacact cccagcccgt tgacccgcgc gctgatgatt 2887381 tgctgcgcct gtgccagcgc ttcccggctc ggggccgagc cgtccggtgt gcgcgcggtc 2887441 agcgtgaccc tggtgccgcc ctgcaggtca atgccgagtt tgggggcggt gtgcttgtcc 2887501 ccggtgaaaa acaccagcaa atagatgccg atcagcatca ccaggaacac cgacaggtaa 2887561 cgggcagggt gcaccggcgc cgaagacgat gccacgttcc ttgtatctcc tcgagaatca 2887621 gttttctacc cccgacagag cctacgtgtc gcgccggggc gcgtcgcgca agcggctcgt 2887681 cggttccggt cggccggttg ccggtcagga atcgttggtc acccggcgct cgccggccac 2887741 gtcgtcaaca tccttgtcaa ggtcctcgtt gagctcctcg tcgatgtcgt cgtccggcag 2887801 aattcggtca cgaatcgcca acttcatcca cgtggtgacc accccgggcg cgatctcgag 2887861 gtcgatggtg tcgtcggcaa tggcgacgat ggtggcttcc agcccagaag tcgtgtgtac 2887921 ccgctccccg ggctgcaacg agtcgtgcag atcgatggtg gcttgcatgg cccgtcgctg 2887981 gcggcgcgac gcgaagtaca tgaacccacc catgatgagc aggaacggca agaacaaaac 2888041 gaaactctcc atcaacccgt ctttcgtatt ggtattgcga tcacggtgcc aggcctaccc 2888101 gcgggccgcg cacctggtaa cagtccagtg tgcccgtcca gtctggcagg ccggaaacat 2888161 cggtcagcag ataggcttta ccagcgatgt gaaccggcga gccgggtgag gaggatctgt 2888221 ggccagcctg cagcagagtc ggcgcctggt caccgaaatc cccggtcccg catcgcaggc 2888281 actgactcac cgccgggcgg cggcggtgtc cagcggtgtt ggggtcaccc tgccggtgtt 2888341 cgtagcccgc gccggcggcg gcatcgtgga agacgtggac ggtaaccggc tcatcgacct 2888401 gggttcgggc atcgcagtga cgacgatcgg caactcgtcg ccacgcgtgg tggatgcggt 2888461 gcgcacgcag gtggccgaat ttacccacac ctgcttcatg gtgacgccat acgaggggta 2888521 cgtggccgtc gccgagcaac tcaaccggat taccccaggt tcgggcccca agcgctcggt 2888581 gttgttcaat tccggcgccg aggcagtcga gaacgccgtc aagatcgcac gctcctacac 2888641 cggcaagccc gcggtggtgg cgttcgacca cgcctaccac ggtcgcacca acctaacgat 2888701 ggcgctgacc gccaagtcga tgccctacaa gagcggcttc ggtccgttcg cgccggagat 2888761 ctaccgagcg ccattgtctt acccctatcg ggacggcctc ctcgataagc aactggctac 2888821 caatggtgag ctagccgcgg cccgagccat cggcgtcatc gacaagcagg taggcgcgaa 2888881 caacctggcc gccctcgtca tcgaaccgat ccagggcgaa ggcggtttca tcgttccggc 2888941 cgaagggttc ctacctgccc tcctcgattg gtgccgcaag aaccatgtgg tgttcatcgc 2889001 cgacgaggtg caaaccggct ttgcccgtac cggggcgatg ttcgcctgcg agcacgaggg 2889061 ccccgacggt ctagagcccg acctgatctg cacggccaaa ggcatcgccg atggattgcc 2889121 gctgtcggcg gtcaccggcc gcgccgagat catgaacgcc ccgcacgtgg gcggcctggg 2889181 cggcacgttc ggcggcaacc cggtggcctg tgcggccgcg ctggccacca tcgcaaccat 2889241 cgaaagcgac gggctgatcg agcgggcccg ccagatcgaa cgcctggtga ccgaccggtt 2889301 gacgacgctg caggccgtcg acgaccggat cggcgacgtg cgtggtcgcg gcgccatgat 2889361 cgccgtagag ctggtcaaat ccggaaccac cgagcccgac gccgggctga ccgagcggct 2889421 ggcgaccgcg gcccacgccg ccggcgtcat cattttgacc tgcggcatgt tcggcaacat 2889481 catccggcta ctgccgccgc tgaccatcgg cgacgagctg ctgagtgagg ggctggacat 2889541 cgtgtgcgcg atcttggccg acctctgacg gcctgccggc cccgactgcg tcatcccgtg 2889601 ccgcatctca cagccgatca gcagcaggct tgcattgtgt aatatattta ctttagctaa 2889661 cgttctattg gtcgggcgca gcgccgcgcc gtcgatttcc caccctttcc ggcacgccga 2889721 ggtgaccgca tgtcgatcaa cgatcagcga ctgacacgcc gcgtcgagga cctatacgcc 2889781 agcgacgccc agttcgccgc cgccagtccc aacgaggcga tcacccaggc gatcgaccag 2889841 cccggggtcg cgcttccaca gctcatccgt atggtcatgg agggctacgc cgatcggccg 2889901 gcactcggcc agcgtgcgct ccgcttcgtc accgaccccg acagcggccg caccatggtc 2889961 gagctactgc cgcggttcga gaccatcacc taccgcgaac tgtgggcccg cgccggcaca 2890021 ttggccaccg cgttgagcgc tgagcccgcg atccggccgg gcgaccgggt ttgcgtgctg 2890081 ggcttcaaca gcgtcgacta cacaaccatc gacatcgcgc tgatccggtt gggcgccgtg 2890141 tcggttccac tgcagaccag tgcgccggtc accgggttgc gcccgatcgt caccgagacc 2890201 gagccgacga tgatcgccac cagcatcgac aatcttggcg acgccgtcga agtgctggcc 2890261 ggtcacgccc cggcccggct ggtcgtattc gattaccacg gcaaggttga cacccaccgc 2890321 gaggccgtcg aagccgcccg agctcggttg gccggctcgg tgaccatcga cacacttgcc 2890381 gaactgatcg aacgcggcag ggcgctgccg gccacaccca ttgccgacag cgccgacgac 2890441 gcgctggcgc tgctgattta cacctcgggt agtaccggcg cacccaaagg cgccatgtat 2890501 cgcgagagcc aggtgatgag cttctggcgc aagtcgagtg gctggttcga gccgagcggt 2890561 tacccctcga tcacgctgaa cttcatgccg atgagccacg tcgggggccg tcaggtgctc 2890621 tacgggacgc tttccaacgg cggtaccgcc tactacgtcg ccaagagcga cctgtcgacg 2890681 ctgttcgagg acctcgccct ggtgcggccc acagaattgt gcttcgtgcc gcgcatctgg 2890741 gacatggtgt tcgcagagtt ccacagcgag gtcgaccgcc gcttggtgga cggcgccgat 2890801 cgagcggcgc tggaagcgca ggtgaaggcc gagctgcggg agaacgtgct cggcggacgg 2890861 tttgtcatgg cgctgaccgg ttccgcgccg atctccgctg agatgacggc gtgggtcgag 2890921 tccctgctgg ccgacgtgca tttggtggag ggttacggct ccaccgaggc cgggatggtc 2890981 ctgaacgacg gcatggtgcg gcgccccgcg gtgatcgact acaagctggt cgacgtgccc 2891041 gagctgggct acttcggcac cgatcagccc tacccccggg gcgagctgct ggtcaagacg 2891101 caaaccatgt tccccggcta ctaccagcgc ccggatgtca ccgccgaggt gttcgacccc 2891161 gacggcttct accggaccgg ggacatcatg gccaaagtag gccccgacca gttcgtctac 2891221 ctcgaccgcc gcaacaacgt gctaaagctc tcccagggcg agttcatcgc cgtgtcgaag 2891281 ctcgaggcgg tgttcggcga cagcccgctg gtccgacaga tcttcatcta cggcaacagt 2891341 gcccgggcct acccgctggc ggtggttgtc ccgtccgggg acgcgctttc tcgccatggc 2891401 atcgagaatc tcaagcccgt gatcagcgag tccctgcagg aggtagcgag ggcggccggc 2891461 ctgcaatcct acgagattcc acgcgacttc atcatcgaaa ccacgccgtt caccctggag 2891521 aacggcctac tcaccggcat ccgcaagctg gcacgcccgc agttgaagaa gttctatggc 2891581 gaacgtctcg agcggctcta taccgagctg gccgatagcc aatccaacga gctgcgcgag 2891641 ctgcggcaaa gcggtcccga tgcgccggtg cttccgacgc tgtgccgtgc cgcggctgcg 2891701 ttgctgggct ctaccgctgc ggatgtgcgg ccggacgcgc acttcgccga cctgggtggt 2891761 gactcgctct cggcgctgtc gttggccaac ctgctgcacg agatcttcgg cgtcgacgtg 2891821 ccggtgggtg tcattgtcag cccggcaagc gacctgcggg ccctggccga ccacatcgaa 2891881 gcagcgcgca ccggcgtcag gcgacccagc ttcgcctcga tacacggtcg ctccgcgacg 2891941 gaagtgcacg ccagcgacct cacgctggac aagttcatcg acgctgccac cctggccgca 2892001 gccccgaacc tgccggcacc gagcgcccaa gtgcgcaccg tactgctgac cggcgccacc 2892061 ggctttttgg gtcgctacct ggcgctggaa tggctcgacc gcatggacct ggtcaacggc 2892121 aagctgatct gcctggtccg cgccagatcc gacgaggaag cacaagcccg gctggacgcg 2892181 acgttcgata gcggcgaccc gtatttggtg cggcactacc gcgaattggg cgccggccgc 2892241 ctcgaggtgc tcgccggcga caagggcgag gccgacctgg gcctggaccg ggtcacctgg 2892301 cagcggctag ccgacacggt ggacctgatc gtggaccccg cggccctggt caaccacgtg 2892361 ctgccgtata gccagctgtt cggcccaaac gcggcgggca ccgccgagtt gcttcggctg 2892421 gcgctgaccg gcaagcgcaa gccatacatc tacacctcga cgatcgccgt gggcgagcag 2892481 atcccgccgg aggcgttcac cgaggacgcc gacatccggg ccatcagccc gacccgcagg 2892541 atcgacgaca gctacgccaa cggctacgcg aacagcaagt gggccggcga ggtgctgctg 2892601 cgcgaagctc acgagcagtg cggcctgccg gtgacggtct tccgctgcga catgatcctg 2892661 gccgacacca gctataccgg tcagctcaac ctgccggaca tgttcacccg gctgatgctg 2892721 agcctggccg ctaccggcat cgcacccggt tcgttctatg agctggatgc gcacggcaat 2892781 cggcaacgcg cccactatga cggcttgccg gtcgaattcg tcgcagaagc catttgcacc 2892841 cttgggacac atagcccgga ccgttttgtc acctaccacg tgatgaaccc ctacgacgac 2892901 ggcatcgggc tggacgagtt cgtcgactgg ctcaactccc caactagcgg gtccggttgc 2892961 acgatccagc ggatcgccga ctacggcgag tggctgcagc ggttcgagac ttcgctgcgt 2893021 gccttgccgg atcgccagcg ccacacctcg ctgctgccct tgctgcacaa ctaccgagag 2893081 cctgcaaagc cgatatgcgg gtcaatcgcg cccaccgacc agttccgcgc tgccgtccaa 2893141 gaagcgaaaa tcggtccgga caaagacatt ccgcacctca cggcggcgat catcgcgaag 2893201 tacatcagca acctgcgact gctcgggctg ctgtgatcgg gcctggccgc cgcggcgccg 2893261 ggtaaccaag cagcccgtta cgcccagttc gcctatgaga aggcagtaag aagcgcgaaa 2893321 aatggcagac cccgacggag gccctctgaa agagtcttga tcatcagggc gcgtgacatg 2893381 tgtcacatga cgggttggga gggtggctga tgtcgtttgt cacggcagct ccagagatgc 2893441 tggcgacggc ggcgcagaat gtcgcgaata tcggcacatc gctgagtgcg gcaaacgcga 2893501 cggcagcggc gtccacgacc tcggtgctgg cggccggagc cgacgaggta tcgcaggcta 2893561 tcgcaaggct gttcagtgat tacgccacgc actatcagtc gctgaacgct caagccgcgg 2893621 catttcatca cagcttcgtg caaacgttga acgccgccgg tggcgcctat tcgagcgccg 2893681 aggcggccaa cgcttcggcg caggcgttgg aacagaatct gttggccgtg atcaatgcgc 2893741 ccgcccaggc gttgttcggg cgtcccctga tcggcaatgg cgcgaatgga acagcggcca 2893801 gccccaacgg cggtgatggt gggattttgt acggcaacgg cggcaacggc ttctcccaaa 2893861 cgaccgccgg ggtggccggc ggcgccggtg gttccgcggg cctgatcggc aacggcggca 2893921 atggtggcgc cggtggggcc ggtgctgccg gcggggccgg cggcgccggc ggatggctgc 2893981 tcggcaacgg tggcgccggc ggtcccggcg gcccaacgga cgttcctgcc ggcacaggtg 2894041 gagccggcgg ggccggcggc gacgccccat tgatcggctg gggcggcaac ggcgggcccg 2894101 gcggtttcgc tgcttttgga aacggtgggg ccggcggcaa cggcggcgcc agcggttcgc 2894161 tctttggcgt cggcggcgcc ggcggcgtcg gcggatcgag cgaagacgtc ggcggcaccg 2894221 gcggggccgg cggcgctggc cgcggtctat tccttggcct gggcggtgat ggcggcgccg 2894281 gcggcaccag caacaacaac ggcggtgacg gtggcgccgg cggcaccgcg ggaggtcgat 2894341 tgttcagcct gggcggtgac ggtggcaacg gtggtgccgg taccgcaatc ggatccaacg 2894401 ccggtgacgg tggcgccggc ggtgacagca gcgccctgat cggctacgcc cagggcggct 2894461 ccggcggcct cggcggcttc ggcgaaagta ccggcggcga cggcggcctg ggcggcgccg 2894521 gcgctgtgct catcggcacg ggcgtcggcg gtttcggcgg cctcggtggc ggctccaacg 2894581 gcaccggggg cgcgggcggc gcgggcggca cgggcgccac gctgatcggc ctgggcgccg 2894641 gcggcggcgg cagcatcggc gggttcgccg tcaacgtggg caacggcgtc ggcggtctgg 2894701 gcggccaggg cggccagggc gccgcgctga tcggcctggg cgccggcggt gccggcggtg 2894761 ccggcggcgc cacagtcgtt ggacttggtg gcaatgacgg tgacggcggt gacggtggcg 2894821 gcctgtttag tatcggcgtc ggcggggacg gcggcaacgc cggcaacggc gccatgcctg 2894881 ccaatggcgg caacggcggc aacgccgggg tcattgccaa cggctccttt gccccgtcgt 2894941 tcgtcggctt cggcggcaac ggcggcaacg gcgtcaatgg cggcaccggc ggcaccggcg 2895001 gcagcggcgg gatccttttt ggcgccaacg gcgcgaacgg accgtcgtag cgggtcctcc 2895061 agcgcactac tcgaacaacc ccggttgact cgctccgacc ggtggcgtca tgcccaggtg 2895121 cgtccaggcc agggcggtgg ccacccggcc gcgcggggtg cgcgcgacca tacccgcgcg 2895181 caccagaaat ggttcgcaca cctcctcgac cgtggcggcc tcctccccga ccgccaccgc 2895241 cagcgtcgac acacccaccg gaccaccgcc gaagctgcgg gtcagcgccg agagcaccgc 2895301 tcggtccagc cggtccagac ccagctcgtc gacgtcgtag acctccagtg cggccttggc 2895361 gacgtcgcgg gtgatgacgc cgtcggcgcg cacctcggcg aagtcacgca cccggcgcaa 2895421 caaccggttg gcgatccgcg gcgttccccg agaacggcgg gcgatttcgg cgccggcgtc 2895481 ggcgcccagc tcgataccca gaattccggc ggagcgggcc agcacccgct ccagctcggc 2895541 gggctcgtag aaatccatgt gcgcggtgaa gccgaaccgg tcgcgcagcg ggccggtcaa 2895601 cgcgcccgac cgggtagtcg ccccgaccag ggtgaacggc gcgacctcca gcggaatcga 2895661 cgtggcccca ggacctttgc cgaccaccac atcgacgcgg aagtcttcca tcgccagata 2895721 cagcatctcc tcggcgggcc gggcgatgcg gtggatctcg tcgataaaca acacgtcgtg 2895781 ctcgaccagg ttggacagca tcgccgccag gtcaccggcg cgttccaacg ccggccccga 2895841 cgtcacccgc agcgaggacc ccagctcggc ggcgatgatc atcgccaacg acgtcttgcc 2895901 caagcccggc ggaccggaca gcagaatgtg atccggtgtg ccgccgcggt ttttggctcc 2895961 ctcgatgacc agctgcagct gttcgcggac ccggggctgg ccgatgaatt cgcgtaacga 2896021 gcgcggccgc aggctgacgt cgatgtcgcc ctctccgacg gtgagtgcgg gcgaaacgtc 2896081 gcggtcggac cgctcggtca tcgggccttc cccagcaacg acaaggcaga ccgcagcgcg 2896141 ctggatgtcg tcgcgtcatg gttggcggcc agcaccgtat cggtggcctc ctcggcctgt 2896201 ttggccgcaa agcccaggcc gaccagagcc tcgaccacgg gactgcgcac cgcgtggccg 2896261 ttggtcgaga gtgcgccgcc ggtggctgcc accccaacct tgtcgcgtag ttccaacacc 2896321 atgcgttcgg cgccccgctt gccgatccca ggcacccggg tcagggcggc gacgttgccg 2896381 tcggccagca cctgccgtag cgccggagcg tcgtgcacgg ccagtgccgc catcgccagc 2896441 cggggcccaa cgccggagac cgacagcagc gtcaggaata ggtcgcgggt ttccccgtcg 2896501 ggaaacccgt acagcgtcat cgagtcctcg cgcacaatca tcgcggtgat cagccgggcc 2896561 tcggtgcctt gccgcaacgt cgccagcgtc gccggtgtcg cgttcactcg gtagcccaca 2896621 ccggcggcct cgatcaccac atggtcaagc gccacctcga gcacctcacc gcggaccgag 2896681 gcgatcatcg ggcggccttc agcttggcta ggtacgcatg acgctgctgc gctgctcgtg 2896741 cttccgccct cgacgtggcc tcagccatcc gggcgatcgt cggcgcccgc caacagtgac 2896801 agatcgccag cgccaaagcg tcggccgcgt cggccggtgt cggtttagct tgcagcgcaa 2896861 ggattttggt gaccatcgcg gtgacctgag ccttgtctgc ggaaccgttg ccagtgaccg 2896921 ccgccttgac ctcgctgggg gtatggaaat gcacgtcgac accacgtttg gccgccgcca 2896981 gggcgatcac gccgccggcc tgcgcggtgc ccatcaccgt ggtcacgttg agctgagaga 2897041 acacccgttc gatagccacc acctccggat gatgggtgtc cagccagtgc tcgacggcat 2897101 cgctgatggc caacaggcgc tgcgccaagg ccgcatccga cggtgtgcgc accacgtcga 2897161 catccagcgc ggtgagctgc cgaccacgcc cactctcgat aagcgacagc ccgcatcggg 2897221 tcaacccggg atcgacaccc atcacccgca ccgcacgctc cctcagccat ttccgaacaa 2897281 tcgttcgata cgctagcgga tcgtcccgac atcccgcgca ggacacgcct atggaacgtg 2897341 cgatggtaaa tttcctacca tgcgaacaac catcgatgtc gcaggacgtc tggtgattcc 2897401 caagcggatt cgcgagcgcc ttggcttgcg cgggaacgac caggtggaga tcaccgagcg 2897461 cgatgggcgc atcgagattg agccggcccc gaccggtgtc gaactcgttc gggaaggctc 2897521 ggttctcgtc gcacggccag aacgtcccct gcccccgttg accgacgaaa tcgttcggga 2897581 aacgctcgat cgcacacggc ggtgatcgca ccagacacca gcgtgctggt tgccggattc 2897641 gcgacctggc acgaagggca cgaggccgcc gtgcgcgcgc tcaaccgtgg cgtccatctg 2897701 atcgcgcacg cggctgtgga aacctattcg gtcttgaccc ggctaccacc gccgcatcgt 2897761 attgcccctg ttgccgtcca cgcctacttg gcggacatca cctccagcaa ctacctggca 2897821 ctggatgccc gctcatatcg cggcttgacc gaccacctcg ccgagcacga tgtcaccggt 2897881 ggcgcaacct acgatgccct ggtcggcttc acggcgaaag ctgccggcgc aaagctgctg 2897941 actcgcgacc tgcgcgcggt cgaaacgtac gagcgattgc gggtcgaggt tgagctggtg 2898001 acctgagaaa ccgttgccgt tgagtgtgtt tgagttgcac gctcaccgac acccggatgg 2898061 tgcaccagtg agctggggtg accgcggccg agacctgccg ggttcccggc cggacaactc 2898121 gcccgttgtg acccccggtc ccgcgaaagc tgttacgtta aacggcgcca tcgatatgcg 2898181 accgatcgac caaccgcggc gcagcggtac gagagggtat gcgtgggaaa tctgctggtc 2898241 gtgattgccg tggcgctgtt catcgccgcc atcgtcgttc tcgtcgtggc catccggcgg 2898301 cccaaaacac cagccacgcc gggcgggcgc cgggatccgc tggccttcga cgcaatgccg 2898361 caattcggcc cccgccaact cggacccggc gcaattgtca gccacggtgg catcgactat 2898421 gtggtccgcg gatcagtcac ctttcgcgag ggtcccttcg tgtggtggga acacttgctg 2898481 gaaggcggcg acacgccaac ctggctgagc gtgcaagagg acgacgggcg tctcgagctt 2898541 gcgatgtggg tgaaacgcac cgatctgggc ttgcagcccg gtggccagca cgtgatcgac 2898601 ggcgtgacgt ttcaggagac cgagcgcggt cacgccggat ataccaccga gggcacgacg 2898661 ggcctgccgg ccggcggtga gatggactac gtcgactgcg ccagtgccgg tcagggggcc 2898721 gacgagtcca tgctgctgtc attcgagcgc tgggcaccgg acatgggatg ggagatagcg 2898781 accggcaagt ccgtactggc cggcgagctc accgtctacc ccgcgccccc agtctcggca 2898841 tagggccgaa tcggtgccac ttcatcagct cgccatagcg ccggtggacg tatcaggggc 2898901 attgcttgga ctcgtgctga acgcacccgc gccgcggcca ctggccaccc accgactggc 2898961 ccacaccgac ggcagcgcac tgcagctcgg cgtcctcggc gcgtcgcatg tcgtcaccgt 2899021 cgagggacgc ttctgcgagg aagtctcctg cgtggcccgc agccggggcg gcgatctgcc 2899081 cgagtccacc cacgcacccg gctaccacct ccaatcccat accgagacgc acgacgaggc 2899141 ggcgtttcgg cgactcgcac gccacctgcg tgaacgctgc acgcgggcaa ccgggtggct 2899201 gggcggtgtg tttcccggtg atgacgccgc gctgaccgca ctcgccgccg aacccgatgg 2899261 aaccgggtgg cgttggcgga cttggcatct gtacccgagc gcgtccggcg ggacggtggt 2899321 ccacacgacg agccgatggc gtccatgagc cgcaaccgcc tgttcctggt tgccggcatc 2899381 ttggcggttg ccgccgccgt gtccttgatc tctggaatca cgctgctgaa cagggacgtt 2899441 ggctcgtata tcgcctcgca ctatcgccaa gaatcccgtg acgtgaacgg aacgcgatac 2899501 ctgtgcaccg gatcgcccaa acaggtggcc accacgctcg tcaagtacca gaccccggcg 2899561 gcgcgcgcgt cgcataccga caccgagtac ctgcgttacc gcaacaacat cgtgacggtc 2899621 ggacccgacg gcacctatcc gtgcatcatc cgcgtcgaaa acctcagcgc cggatataac 2899681 cacggcgcat atgtcttcct gggccctgga ttcacccctg ggtccccgtc gggcggttcg 2899741 gggggcagcc cgggcggtcc tggcggcagc aagtaaggcg atgacgcaaa ggagagagtc 2899801 atgtatcagg ccggagtcga tttcgggacc atcagcctta ccccgatcct gcatggggtg 2899861 gtggccaccg tcttgtactt cctagtgggc gccgccgtgc tagtcgcagg ctttctgatg 2899921 gtcaacctgt tgaccccggg cgatctgcgt cgcctagtgt tcatcgaccg ccgccccaac 2899981 gccgtggttc tggccgccac aatgtatgtg gcgctggcca tcgtcaccat cgccgccatc 2900041 tacgccagct ccaatcagct ggcccagggc ctgatcggcg tggcggtgta cggaatcgtc 2900101 ggtgtcgcgc tgcagggggt ggcactggtg atcctcgaga tcgcggtgcc ggggcgattc 2900161 cgtgagcaca tcgacgcacc tgcgctgcat ccggcggtgt tcgctaccgc cgtcatgctg 2900221 ctggcggtag cgggggtaat cgccgccgcg ttgtcatgac gtccacccgg caggcgggcg 2900281 aagccaccga agcttcggta cggtggcggg ccgtgctgct ggccgcggtc gcggcgtgcg 2900341 cggcctgcgg tctcgtttac gagctcgcgc tgctgacact ggcggcgagc ctgaacggcg 2900401 gcgggatcgt ggccacctcc ctgatcgtcg cgggctacat agccgcgctg ggagcaggcg 2900461 ccttgctgat caagccgcta cttgcacacg cggccatcgc gttcatcgcc gtggaggcgg 2900521 tgctaggcat catcggcgga ttgtccgcgg cggcgctgta tgcggcgttc gcgttcctgg 2900581 acgagctcga cgggtcgacg ctggttcttg cggtgggcac cgccctgatc ggcgggctgg 2900641 tcggcgccga ggtgccgctg ctgatgacgc tgttgcagcg cggccgcgtg gcaggggccg 2900701 ccgatgccgg acgcaccctg gccaacctca acgcggccga ctatctgggc gcgttggtcg 2900761 gcgggctggc ctggccattc ctgctgctgc cgcagttagg gatgatccgc ggtgcggcgg 2900821 tcaccggcat cgtcaatctg gcggccgccg gggttgtgtc gatcttcctg ctgcgccacg 2900881 tcgtgtccgg ccggcaactg gtgaccgcct tatgcgcgct cgccgcggcg ctcgggctga 2900941 tcgccacact gctggtgcat tcccacgaca ttgagaccac cggccgccaa cagctctacg 2901001 ccgacccgat catcgcctac cgacacagcg cctaccagga aatcgtggtc acccgccgcg 2901061 gcgatgacct gcgcctctac ctggacggag gtttgcagtt ctgcacccgc gacgaatacc 2901121 gctacaccga aagcctggtc tacccggcag tctccgatgg cgcgcgttcg gtgctggtgc 2901181 tcggtggcgg cgacggactg gcagcccgcg aactgctgcg ccaacccggc atcgagcaga 2901241 tcgtgcaggt ggaactcgac cccgcggtca tcgaactggc gcgcaccacc ctgcgcgacg 2901301 tcaacgccgg ttcgctggac aacccgcgcg tacacgtcgt gatcgacgac gccatgagct 2901361 ggctacgcgg cgccgcggtc cccccggctg gcttcgacgc agtgatcgtc gaccttcgcg 2901421 accccgatac tcccgtgctg ggtcggctgt attccaccga gttctacgca ctcgccgccc 2901481 gcgcgctcgc gcccggcggg ctcatggtcg tgcaggcagg cagcccgtat tcgaccccga 2901541 ctgcgttctg gcgcatcatc tccacgatcc ggtccgccgg gtatgccgtc acgccctacc 2901601 acgtgcacgt gcccaccttc ggcgactggg gattcgccct ggcacgcctt acagacatcg 2901661 cgcccacccc cgctgtgccg agcactgccc ctgcactgcg cttcctggac caacaggtgc 2901721 tcgaggccgc gaccgtgttt tccggcgaca tccggccccg cacgttggac ccgtcgaccc 2901781 tggacaatcc gcacattgtt gaggacatgc ggcacggctg ggactagcgc acccatctag 2901841 ggcggccagg gtttgcacaa cgcagcacgg gttccgaacg gaaccggggc ccgctcgtag 2901901 cccggccata aaagcataaa aacagtatgc tgggtaaatg aagaccacgc tcgacctgcc 2901961 tgatgaactg atgcgcgcta tcaaggtccg cgcggcgcag cagggccgca agatgaaaga 2902021 tgtcgtgacc gaactgctca gatccggtct gtcccagacg cacagcgggg ctccaatccc 2902081 aacgccgcgg cgcgtgcagc ttcccctggt gcattgcggt ggcgcggcta cccgcgaaca 2902141 agaaatgacg ccggagcgtg ttgccgcggc cttgctcgac caggaggccc agtggtggtc 2902201 cggacacgac gatgctgctc tgtgacacca acatctggct ggcgttggcg ctttccggac 2902261 acgtgcacca cagggcctcg cgcgcatggc tagacaccat caacgcgccc ggagtcatcc 2902321 acttttgccg cgcaacccaa cagtcgctcc ttcggctgtt gacgaatcgg acggtgctgg 2902381 gcgcgtatgg cagcccacca ctgaccaacc gcgaagcgtg ggcggcctat gccgcgttcc 2902441 tggatgacga ccgcatcgtg ctggccggcg ccgaacctga tggtttggag gcccagtgga 2902501 gagccttcgc cgttcgccag tcgccggcgc ccaaggtttg gatggatgcc tacctagctg 2902561 ctttcgcact taccggtgga ttcgagttgg tgacgactga caccgccttc acccagtacg 2902621 gcggaatcga gctgcggctc ctggccaagt gacagcgcaa gccccgcagt gctcactcgt 2902681 cgtcgagggc ggccagcacc tcgtcggaca cgtcgacgtt ggtccacacg ttctgcacgt 2902741 cgtcactgtc ttctagcgcg tcgacgagct tgaacacttt ccgtgcgccg tccaggtcca 2902801 cgggcacgct gaccgagggt tgaaagctgg cctcggccga ttcgtaatcg atgccggcat 2902861 cttgcaaagc gctacgaacc gcgaccagtt ccgcgggctc ggagatgacc tcgaaactgt 2902921 cgcccaggtc gttgacgtcc tcggcaccgg cttccagaac agccgccagc acatcgtctt 2902981 cggtcaagcc gttcttttcc agggtcacca cgcctttgcg ggagaacagg taggacaccg 2903041 accccggatc ggccatggtg ccaccattgc gcgtcatcgc cacccgcacc tcgctggcgg 2903101 cgcgattgcg gttgtcggtc agacactcga tcagcaccgc caccccgttg ggcgcgtagc 2903161 cctcgtacat gatggtctgc cagtcggcgc cgccggcctc ctcgccggcg ccgcgcttgc 2903221 gggcccgttc gatgttctcg ttgggaaccg agctcttctt cgccttctga atcgcgtcgt 2903281 agagcgtggg gttgccggcc ggatcaccgc caccgacacg cgccgccacc tcgatgttct 2903341 tgatcagccg ggcgaacatc ttgccgcggc gggcgtcgac gacggccttc ttgtgcttgg 2903401 tggtggccca cttggaatgg ccgctcatcg cagtgattta cctcttctgt tgctcgttcg 2903461 ccagacgagt ctacgtgggg gttgtgggcg gcgagccaac cggcacgagc agacacaaaa 2903521 gctccaaatt tcggcctgaa acgggtgctt ttgcgactgc tcacgccgcg gaggtgacga 2903581 tgtcgacgaa caactgatga atgcggcgat cgccggtcat ctccggatga aacgcggtgg 2903641 caagcaccgc accctggcgc accgcgacga tgtgccccgc cgcgcgggcc agcacctgca 2903701 caccgtcacc gactcgctca acccatggcg cccggatgaa caccgcgcgc accggatcgt 2903761 ctagaccagc gaactcgata tcgccttcaa acgagtcaac ctgacttcca aaagcattgc 2903821 gccgcaccgt catattcatc gcacgcaggg gcagcgcctg gcggcctgcc gcaccggcgt 2903881 ccaggatctc gctggccaac agaatcatgc ccgcgcacga accataggcc ggaagcccat 2903941 cggcgagccg ggcccgcagc ggtcccagca ggtcgaggtc gagcagcagg tggctcatcg 2904001 tggtggattc cccgcccggg atgaccagcg cgtccaccgc gtcaagttcg tcgcggcgcc 2904061 gcaccgtcat cggctcggcc ccgcattcgc gcagcgcagc caggtgctcc cgggtgtcgc 2904121 cctgcagcgc cagcaccccg acccgtggaa cgctcacagc ccgctcactg ccccaccgac 2904181 cggtgaccgc gccggtggcg ggtcagcccc tcctgcatga ccgccgcgac catctccccg 2904241 gaccgtgtga aaatctcgcc ccgagtcagc gcacgaccgc cgctggccga cggcgacgac 2904301 tggtcgtaca gcaaccactc gtcggcgcgg aagggtcgca tgaaccacat cgcatggtcc 2904361 agcgatgcca cctgcagctg gtcgcgcaca tcgaggtggt tgacttgtgc cgatcccagc 2904421 agcgtgaggt cgctcatgta ggcgagtgca cagatgtgca acaccgggtc gtcgggcaac 2904481 gggtcacggt ggcgaagcca cacctgctgc tgggaagcct tgcccggcaa aagccgcagg 2904541 cgctcccggg gcacgatgca cacgtcccac tcgtcgaact gccggaaccc ggcatcatcg 2904601 aaaaccttga tcgagttcaa ccccggcagg ccgtcgggcg gcggcgccgc tggcataacg 2904661 tcttggtggg taatgccctc ctgttcggtc tggaacgacg ccgccatgct gaatatggtt 2904721 tccccgtgct ggactgcgtt gacccgcctg gtgcagagcg atccaccgtc gcggatgcgt 2904781 tcgaccagaa aaaccgtgcg ctccttggca tctccaggcc gaagaaaata gccgtgcagc 2904841 gagtgcacca tgtaccgcgg gtcgacggtg cgcaccgccg acaccagcga ctggccggct 2904901 acatgaccac cgaaagtgcg ttgcaggaag cccgattcgg ggctgaacac gcttcctcgg 2904961 tagatgttga cctcaagttg ctcaagatca aggatctctt cgatcgacac gcgatgaccg 2905021 tctgctcgtc gcgggttctc accagccgcg ctgggcgagc cgatgaccga cagcgatctc 2905081 gtccacgttg atgcccacca tcgcctcgcc cagcccgcgc gacaccttgg ccagcacatc 2905141 gggatcgtcg aagaacgtgg tggccttgac gatcgcggcg gcgcggtgct caggggcgcc 2905201 ggacttgaaa ataccggaac ccacgaagac gccctcggcg ccaagctgca tcatcatcgc 2905261 cgcgtcggcg ggcgtggcga tacccccggc ggtgaacagt gtgaccggca acttgcccgc 2905321 ccgagctacc tcggcaacga gttcataggg cgcttgcaat tcttttgccg cgacaaacaa 2905381 ttcgtcctcc gacatcgacg tcaaccggcg gatctcacca ccgatggccc gcatgtgtgt 2905441 ggtcgcgttg gagacgtctc cggtcccggc ctcgcccttg gaccggatca tggccgctcc 2905501 ctcgctgatg cgcctcaacg cctcaccgag attggtcgcc ccacacacga aaggcaccgt 2905561 gaagttccac ttgtcgatat ggtgggcgta gtcagcgggc gtcagcacct cggactcgtc 2905621 gatgtagtcg acgcccaacg tctgcaggat ctgcgcctcg acaaagtggc cgatgcgcac 2905681 tttagccatc accgggatgg tgaccgcggc gatgatgccc tcgatcatgt cggggtcact 2905741 catccgcgac accccgccct gggcgcggat atcggcgggc accctttcca acgccattac 2905801 cgcaaccgca ccggcgccct cggcgatgcg ggcctgctcc ggggtgacaa cgtccatgat 2905861 gacgccgccc ttgagcatct cggccatgcc gcgcttgacc cgcgccgtac cggtcgctgg 2905921 gttacctgca ggatccatgg tgcctcctct tgtccccact acgatacgac cgctaccgcg 2905981 ccggtctgct agccactcag gggcgtggcc aggacgccga ttggtaaatt acgaatccct 2906041 cagccgtgca gcaccggagg ccggaatgga cgatgacgcc caaatggtcg cgatcgataa 2906101 agaccaattg gcaaggatgc gtggcgaata cggcccggag aaggatggct gcggagatct 2906161 ggacttcgac tggctcgacg acggctggct cacgctgctg cggcgctggt tgaacgatgc 2906221 acaacgcgcc ggagtgagtg aaccgaacgc gatggtgctc gccaccgttg ccgacggaaa 2906281 accggtgacc cgttcggtac tttgcaaaat cctggacgag tccggtgtcg cgttctttac 2906341 cagctacacc tccgccaaag gcgagcagct cgccgtgaca ccatacgcat cggcaacctt 2906401 tccctggtac cagctaggtc gccaggcaca cgtacagggc ccagtcagca aggtcagcac 2906461 cgaggagata ttcacgtatt ggtccatgcg cccccggggc gcgcagctgg gtgcgtgggc 2906521 ctcgcagcag tcgcgcccgg tcggttctcg cgcccagctc gataaccagc tcgccgaggt 2906581 gacgcgtcgc ttcgccgacc aggaccagat cccggtgccc ccaggatggg gcggctaccg 2906641 catcgctccg gaaatcgtgg aattctggca gggccgggag aaccgcatgc acaaccgaat 2906701 ccgcgtcgcc aatggccggc tggaacggtt gcaaccctga tcgtcgagtc tggccacctc 2906761 gcgggcgaag tttgacggaa cctcgcagat cttgccggac atgccataga gtctttgacc 2906821 ggaatgcccg ctgacccgtg acgacgcggt caccggggat acccgccgcg gtggtggcca 2906881 accgataacg gccaaccgag aaagtacaca gcgatgaatt tcgccgtttt gccgccggag 2906941 gtgaattcgg cgcgcatatt cgccggtgcg ggcctgggcc caatgctggc ggcggcgtcg 2907001 gcctgggacg ggttggccga ggagttgcat gccgcggcgg gctcgttcgc gtcggtgacc 2907061 accgggttgg cgggcgacgc gtggcatggt ccggcgtcgc tggcgatgac ccgcgcggcc 2907121 agcccgtatg tggggtggtt gaacacggcg gcgggtcagg ccgcgcaggc ggccggccag 2907181 gcgcggctag cggcgagcgc gttcgaggcg acgctggcgg ccaccgtgtc tccagcgatg 2907241 gtcgcggcca accggacacg gctggcgtcg ctggtggcag ccaacttgct gggccagaac 2907301 gccccggcga tcgcggccgc ggaggctgaa tacgagcaga tatgggccca ggacgtggcc 2907361 gcgatgttcg gctatcactc cgccgcgtcg gcggtggcca cgcagctggc gcctattcaa 2907421 gagggtttgc agcagcagct gcaaaacgtg ctggcccagt tggctagcgg gaacctgggc 2907481 agcggaaatg tgggcgtcgg caacatcggc aacgacaaca ttggcaacgc aaacatcggc 2907541 ttcggaaatc gaggcgacgc caacatcggc atcgggaata tcggcgacag aaacctcggc 2907601 attgggaaca ccggcaattg gaatatcggc atcggcatca ccggcaacgg acaaatcggc 2907661 ttcggcaagc ctgccaaccc cgacgtcttg gtggtgggca acggcggccc gggagtaacc 2907721 gcgttggtca tgggcggcac cgacagccta ctgtcgctgc ccaacatccc cttactcgag 2907781 tacgctgcgc ggttcatcac ccccgtgcat cccggataca ccgctacgtt cctggaaacg 2907841 ccatcgcagt ttttcccatt caccgggctg aatagcctga cctatgacgt ctccgtggcc 2907901 cagggcgtaa cgaatctgca caccgcgatc atggcgcaac tcgcggcggg aaacgaagtc 2907961 gtcgtcttcg gcacctccca aagcgccacg atagccacct tcgaaatgcg ctatctgcaa 2908021 tccctgccag cacacctgcg tccgggtctc gacgaattgt cctttacgtt gaccggcaat 2908081 cccaaccggc ccgacggtgg cattcttacg cgttttggct tctccatacc gcagttgggt 2908141 ttcacattgt ccggcgcgac gcccgccgac gcctacccca ccgtcgatta cgcgttccag 2908201 tacgacggcg tcaacgactt ccccaaatac ccgctgaatg tcttcgcgac cgccaacgcg 2908261 atcgcgggca tccttttcct gcactccggg ttgattgcgt tgccgcccga tcttgcctcg 2908321 ggcgtggttc aaccggtgtc ctcaccggac gtcctgacca cctacatcct gctgcccagc 2908381 caagatctgc cgctgctggt cccgctgcgt gctatccccc tgctgggaaa cccgcttgcc 2908441 gacctcatcc agccggactt gcgggtgctc gtcgagttgg gttatgaccg caccgcccac 2908501 caggacgtgc ccagcccgtt cggactgttt ccggacgtcg attgggccga ggtggccgcg 2908561 gacctgcagc aaggcgccgt gcaaggcgtc aacgacgccc tgtccggact ggggctgccg 2908621 ccgccgtggc agccggcgct accccgactt ttctaagcgg tccacaaacc gtgcacgtca 2908681 gcggatgggc tgaggaacgc cggcatcgcg cgcggctccg ttgtccagcg cgacgtccac 2908741 cagccggttg gctgccggca acagctcgcc tagttgcaac gggtacaccc gctcgcccgc 2908801 cgccaccagc tgcgcgatgt cgttcgcgtc acaccagcgg gcatcgcgaa tatagcggcg 2908861 ttccaactcg gttcgcccct gcacagcagg ctcgaaccga cgcgtccggt gcaccaggta 2908921 gaactcctcg ctgtcgatca gcgacccgtt gaactcgaag acctcgtcgc gtcgccagat 2908981 aggtccgatc atgtcggccg gggccacccg cagaccggtt tcttcggcca gctcccgggc 2909041 ggcggcctgg gccagccgct cacccggtcg cacttggccc ccgacggtga accaccactt 2909101 cggcgccgcg ccgtcccgaa acgccgggtt cgccggatcc gatccgcaca gcaacaacac 2909161 ggcaccgctg tcatccaata gcaccacccg cgccgaggtg cggcgaccgg acgcaccctg 2909221 atcgccgtgc accaatgcgt ggggtcgctc gacgatctcg aaataggttg gcagcacagc 2909281 ggttccacca agccgcagca atcgcaccag ccgtcgttcc cccagagcga gggtgtcgcg 2909341 aacggcgtcg ttgtggaagc ggcgggccag caggacgcgg gcttccgcgt cggctaactc 2909401 ggcgatcagg gccgcgggca gcgacgcggg gttgaccatc gccaacgcgg ccgaaagctc 2909461 gttctccgca ttctcgcgcg catgccgggg cgcgccctcc gcggcgtcgg ctaaggcggc 2909521 cagccgactg ccctgggggg caccgccgta cgcgtcgatc gccaccgcac gtgccaccac 2909581 cgctcgtcgc gcgagcgcgc tgtccagcga ctgccacgac aagtcatagc gcacgttcaa 2909641 ccggttcaac cggttggccg tctgatatcc ccaggcgccg aacgcaacca gcacaacgag 2909701 cagcactgcg ccggccagga ccagccacgt catcagctgg ccacctgaac cttggcgccc 2909761 gacccggcga ccgtctcgta cactcgcatg atctggctgg ccaccaccga ccagtcatac 2909821 cggcggacgg ccgcgttgcc ggccgccaca tagcgctccc gcaggacatc gttctccagc 2909881 accgcaatca gtccatcggc caacgcggcg gcctgcaagt ctggcgggtc caccggcacc 2909941 aggtgcccga cctcaccgtc gcgcagcaca cgccggaagg cgtcgaggtc gctggccacc 2910001 accgcagtgc cggcggccat cgcttcgacc agcacaatgc cgaaactctc accgccggtg 2910061 ttgggcgcac agtagacgtc ggcgctgcgc atcgccgaag cttttccggc gtcgtccacc 2910121 tgacccagaa agcgcaggtg cgccgccaaa cggcccgcct ggccgcgcaa ctggtcggcg 2910181 tcgccgtggc cgacgatcag tagctggaca tccggaaacc gctgcaccac cttcggcagc 2910241 gcgtcgagca aaacggccat gcccttgcgg ggctcgtcgt agcgacccag gaacaacacc 2910301 gttttaccct ggcgcgggta cccgtccagc cgcgctgccg aggcgaagga atcaacgtcc 2910361 accccattgg ggatctccac cgcatcggat cccaacgcct ccatctgcca gcgccgggct 2910421 aggtcggaca ccgcgatccg gccgacgatc ttctcgtgca tgggccgcag aatgccctgg 2910481 aacaccgtca gcgtcagcga cttggtggtc gaggtgtgaa atgtcgccac aatcgggccc 2910541 tcggcaatgt tcagggccag catcgacagg ctcggcgcat tcggctcgtg tagatgcagt 2910601 acgtcgaaat caccatgcgc aagccacttt ttgaccttgc ggtgggtcgc cggaccgaac 2910661 cgcagccggg ccaccgagcc gttgtaggga atcggaaccg ccctaccacc ggagacaaag 2910721 taatcaggca gtgcggcatg cggggaggcc ggcgcgagca cactgaccaa gtggccgcgg 2910781 gtgcgcatca cctcggcaag ctgtagcaca tgcgactgca ccccgcccgg gacgtcgaac 2910841 gagtacggac aaatcatgcc gatccgcatc aggctttcct catctggacc tcagttgcgc 2910901 ccgccgcgat tcggataagt cggccagcca ctggggctgc agcatgtgcc aatccgcggg 2910961 atgggcggca atgttctgcg cgaagcggtc ggccagcgcc tgtgtgatgg cagcgacgtc 2911021 accgctggtg caatccagcg ccggatacac ctggaaaccc cagccgcggc cctcgaacca 2911081 gcaatgtgtg ggcagcaatg ccgcaccggt ctcgaccgcc agcttcgccg gccccaccgg 2911141 catccgggtg ggctcgccga agaagtcgac ctcaacaccg gtgcgggtga gatcgcgctc 2911201 ggccatcagg cagaccactc ggttgttcct cagccgctca cagagcacct cgaacggcgg 2911261 ccgttcgccg ccggacagcg gcagcacctc aaatcccagg ctttcgcggt agtcgataaa 2911321 gcgctggtac agcgattcgg gttttaggcg ctcggcgacg gtggtgaagg tgccgtgccg 2911381 ctgcaccagc cacatcccgg ccatatccca gttgccgctg tgcggcaacg ccagcacggc 2911441 accgaggccc gcggccagcg ccgcgtccag gtgatccagt ccaccgatca cgcggtcgag 2911501 ctggcgggcc agcttgcggt ggttcatcgt cggcagccgg aacacctcac gccagtagcg 2911561 cccgtaggac tccagcgagg cgcacatcag cgggtccggc accgcggctg gcggcacacc 2911621 caggacgcgg gccaggttct tgcgcagctg ctcgggcccg ccgtggcggg caaagtagcg 2911681 cgctccggtg tcgaatgcgt tgcgtacggc gaactctggc agcgcccgta cggccatcca 2911741 gccggccgca tacgcccagt cggtcgcggt gcgcgtcacg gaactgcgcg gatctttggg 2911801 cagcttcaag cccttaaggc cggcaatcac cggtcgccct ttccaggaat cgccatccga 2911861 tcgatggctc cgggtgaagt ccagaccgtg tgcaaccgct gcacgcaggt gatcacgctg 2911921 gcgacggcca gcagccacat ccccaccgac aacgccggcg gccagggcac aaacgggaag 2911981 tccgacaccc cggcgccggt cagcacgatg atcaaccgtt ccggccgttc gatgaagccg 2912041 ccgtcgccgc gcagcccgct ggcctccgcc cgggccttga tgtaagagat cacctgcgag 2912101 gtgaccagac agatcaaggt cgcgatcacc agcggtcggt cgcgcatgtg aaacgctatc 2912161 caccacagca gaccgcagaa caccgcgccg tcactgatgc ggtcacaggt ggcgtccagc 2912221 accgcgccga agcgagtgcc gcccccgcgc tcccgggcca tcgccccgtc cagcatgtcg 2912281 aacaacacga agaaccacac cacacacgca cccgcgaaca gcttgcccat cgggaacagc 2912341 gtcagcgctc ccgccaccga cgcggtggtg cccaggatgg tgacgacgtc cggcgtgagg 2912401 ccgacccgca gcagtcccct ggcgatcggg gtggtaatcc gggcgaacgc cgcccgggac 2912461 aggaagggca gcttgctcat ggttgccgag cccactcggt ggcaagcagc cgacgggtgt 2912521 cgcgcagcag ctgcggaatc accttggagc ccccgatgat ggtgatgaaa ttcgcatcgc 2912581 caccccaccg tggcaccaca tgcacgtgca ggtgctcggc cagcgacccg cccgccgatg 2912641 tccctaggtt caggccgaca ttgaagccgt gcggacgcga cacgttcttg atcacgcgaa 2912701 tcgccttctg ggtgaacgcc atcaactcgg cgctctccaa atcggtgaga tcctcgagtt 2912761 cggatacccg acgatagggc accaccatca agtgcccggg gttgtacggg tacaggttga 2912821 gcacggcgta gaccagcttg ccacgagcga ccaccagacc ctcttcgtcg gacagctgcg 2912881 ggatctcggt gaacggctgc gcagggctgg ccgaggaatt ggggtcacgc ttcactggcg 2912941 cttcggccag gtagttcatc cggtaggggg tccataaccg ctgcagctgg tcgcgctggc 2913001 cgacaccccg atcgaagatg gtgtggtcct cggtggcccg atccgtgcgg tcctcgtcac 2913061 tcacgaccgg ccactttcac cagttccgct gtaggaaccg cattttcgcg gtcagcgatc 2913121 caggcgacaa tggccgccac cgcatcgtca cgggccacac cgttgatttg ggtgcggtca 2913181 ccgaaccgga aactcaccgc gccggcggcg acgtcacgat cacccgccaa caccatgaac 2913241 ggcaccttgt ggttggtgtg gtgcacgatc ttcttggcca tccgatcgtc gctggcgtcc 2913301 acctcggccc gcaccccgtg cgacttcagt tgcgtggcaa cctcttccag ataggcgacg 2913361 tgctcatcgg cgaccgggat gccgaccacc tgcacgggcg ccaaccaggc cgggaacgcc 2913421 cccgcgtagt gctcggtgag aatgccgaag aaccgctcga tcgacccaaa tagcgcgcgg 2913481 tggatcatca ccgggcggtg gcgggttccg tcggcggcgg tgtactccag gccgaaacgt 2913541 tccggaaagt tgaagtccag ctggatggtc gacatctgcc aggtgcggcc cagcgcgtct 2913601 ttgacctgca ctgaaatctt gggcccgtag aacgccgcgc cgcctggatc gggcaccagc 2913661 tccagcccgg attcggcgcc cacctcggcc agcacggtgg tggcttcctc ccagacctcc 2913721 tcggcgccga cgaacttctc cgggtccttg gtggacagtt cgaggtagaa gtcggtgagg 2913781 ccgtagtcgg cgagcaggtc gagcacaaac cgcagcagcg accgcagctc gtcgcgcatc 2913841 tggtcgcggg tgcagaagat gtgcgcgtcg tccatggtca gcccacgcac ccgggtcaac 2913901 ccgtgcacca caccggactt ctcgtagcga tacaccgtgc cgaactcgaa gagccgcaac 2913961 ggcagttccc gataggatcg cccgcgcgcg cggaagatca ggcagtgcat cgggcagttc 2914021 atcggcttga ggtagtagtc ctggccgggt ttgcgcagcg agccgtcggc gttgtactcc 2914081 gcgtcgatgt gcatcggggg gaacatgccg tcggcgtacc agtccagatg tcccgaggtg 2914141 tggaacaact gggccttggt gatgtgcggg ctgttgacga actggtagcc cgcctcggtg 2914201 tgcttgcgcc gcgagtagtc ctccagttcg cgacgcacga tgccgccctt ggggtggaaa 2914261 accgctaggc cggaaccgat ttcgtcgggg aagctgaaca ggtccagctc gacacccagc 2914321 ttgcggtggt cgcggcgctg cgcctcttcg atgaactcca ggtgcctgtc gagcgcctcc 2914381 tgggattccc acgcggtgcc gtagatccgt tgcaggctgg cgtttttctg atcgccccgc 2914441 cagtaggcgg ccgagctgcg ggtgagcttg aacgccggga tgtgtttggt ggtcgggatg 2914501 tgcggtccgc ggcacaggtc gccccagacg cgctcgcggg tgcgggggtt gaggttgtcg 2914561 taggcggtga gctcgtcacc gccgacctcc atgatctcgg cgtcacccga tttgtcgtcg 2914621 acgagttcca gcttgtaggg ctcgttggcc agctcggcgc gggcctgttc ggtggattcg 2914681 tagacccgcc ggtcgaacag ctggccttcc ttgacgatct ggcgcatccg cttttccagc 2914741 gccgccaagt cctcgggcgt gaacggctcg ggcacgtcga agtcgtagta gaagccgtcg 2914801 gtgatgggtg gtccgatgcc gagcttggcc tgcggaaaca gctcttggac ggcttgggcc 2914861 aacacgtgcg cggtcgaatg gcggatcacg ctgcgaccgt cgtcggtgtt ggcggccacc 2914921 ggcgtgatat cggtgtcgac gtcgggcacc cagctcaggt cgcgcaggtt gccgtcggcg 2914981 tcgcgcacga cgacgatcgc atcgggcgta ccgcgccgcg gtaaacccgc ttcgccgacg 2915041 gcggtggccg cggtggtccc ggcaggaacc cgaattcggg cttgcgacgg gtcgccgcca 2915101 tcgactcccg gggcgggttg tgcgggggcg ctcatcgggt cggtctccaa ggcttggacg 2915161 tgtcgaaacg atcgcgacca tgctatcggg gcgcacgtcg acgaccgtaa gccgagtgac 2915221 cggatgggtt ttcgatcacc ggtgtgggcg atcggtaccg ggcaggtgac cgggtgctct 2915281 acggcggctc gatgagccca aaggatgttg acgacctggc tacccagcag gacgtcgacg 2915341 acggacagtc gatagagcgt cgctggacgg ggagcggtca gcgacgctgg cggcggtcgc 2915401 cgccgacggg ccactaccgt agcaactcgc aaatccaggt ctggatttcc ggcgccggcc 2915461 ggctccgtta gccgtcggct ccgttggtgc cggccaggcc gggtggcccc aacagtgatg 2915521 caccgccgct gccgccgggc ccgccgaagc ctggagcgcc attgaagagg ctcccggcgc 2915581 cgccgtttcc gccgtttccg ccgtttccac cgttgccgat cagtccgacg ctgccgccgt 2915641 tcccgccttt cccgccgtca ccgccggacc cgccagcagt gccggcgctc gctccgttcc 2915701 cgccggcccc ggcgctccca ccggcccccg cgttgccgat gagggcattg ccgccgtttc 2915761 caccgctgcc gccgctaccg ccattccctc cgaaggccgt aacagacccg ggcgacccgg 2915821 cggcaccgcc gtttccgccg gccccgccgt taccgtagag tagcccgccg gcgccgccgt 2915881 taccgccttg cccgccaaac ccaacgcccc cgaaatcggc ggacacatca ccaccggctc 2915941 cgccggctcc gccattgccg ccgttgccga tcagtccggt ggttccggcg tgaccaccgt 2916001 tacccccgaa tccggacgct ggaccgttct gagaaatgcc tgccccgttt ccggcggccc 2916061 cgccgtcccc gctgttgccg atcagtaggc cgccgttccc gccgttcccg ccgttgccgc 2916121 cgctagcccc cgcggagggc tcgccgccgc cggtgccccc ggccccgccg gtcccgccgg 2916181 cgccgatcag cccggcgttg ccgccgttcc caccatgccc gccgatagcg aggttggtgc 2916241 cggccccacc gatcccgccg ttcccgccgg ccccgccgtt gccgaacagc catccaccgg 2916301 cgccgccggc tccgccgttc gcgccggcct caaagggtag gccctggccg ccagctccgc 2916361 cggccccacc gttgccgatc aacccggccg caccgccggc cccgccggcc tgcccgggtg 2916421 cccccgaccc gctgttgccg ccgttgcccc acagccaccc gccgttaccg ccggcttgcc 2916481 cggtcccgtc gatcccgttc gcgccgtcgc cgatcaatgg gcgcccggtc agcgactgaa 2916541 cgggtgcgtt gatcgcatcg agcacgttct gcagcggtgt tgcgctggcc gcttcggcga 2916601 ccgcgtaggt gctgccagct tggcttaagg ccagcacgaa ccgttcctga taggccgcga 2916661 cctgcgcgct gatcgcttga tagtgctggc cgtggctgcc gaacagcgcc gcgatcgccg 2916721 ttgacacctc gtcttgggcg gcgaccaaca cctgggtggt cgccgccgcc gcggtgttgg 2916781 cggtgttgat cgccgagccg atccgcgccg catcggccgc ggctgtggac actaactgtg 2916841 gggccacgtt gacaaacgac atcgaaatcc tcctgaccgc cacgatgttg agatgcgggc 2916901 ggcccaccgc ctgttacccc tgcggtgggt aaccgtttat tcggacgatc cctgccgttc 2916961 cacgcctggg cgcaggcaca aaccgcacca acattggtgg aacgtggtgc acactgcacc 2917021 tggggttctg ccctcatcgt gtgtcagcag gcgaaacccg cgcggacgag aactcctgcg 2917081 ttaagcagca caaatcgccc tacaccccag tgaatctccg gacgccacta cgacagcgcg 2917141 caacggtcgc ctcatcgact gtgtgcacgc gcgcttcgcg atgcgctgcc gtggcaagct 2917201 ggccaggtgg acctcaatgc gctggccgat ctgccgctga cctatccgga ggtgggcgcg 2917261 acagcgaccg gacgactgcc cgcgggctac aaccaccttg acgtgtcgac gcagatcggc 2917321 accggccgcc agcgttttga gcaggccgcc gacgccgtca tgcattgggg catgcagcgc 2917381 aacgccggcc tgcgggtgcg ggccagctcc gaaaccgcca tcgtgtccgc ggtggtgttg 2917441 gtgggaatcg ctttcctgcg tgcgccgtgc cgagtggtgt atgtcatcga cgaacccgac 2917501 gtgcgcggat tcggttacgg cactttgccg ggccatccgg tgtccggcga ggaacggttc 2917561 gcggttcgct gcgacccgat gacctccgtg gtgtttgccg aggtgttgtc gttctcccgt 2917621 ccggcgacct gggcgagcaa agccgccggg ccgctgggcg cggtgaccca gcgcttcatc 2917681 gcccagcgct acctgcgcgc ggtgtgaggc gccggcgccc tggttaaggc cgcccgatgc 2917741 ctccgctgtg cacgccctgc gccagccggg cgagcgcgat ggcgccaacc agcaaaccga 2917801 agtcgcgcag cgcgatgtcg tagaaaccgg gtccggtgac caggttgaga atgatcccgg 2917861 ccagccaggc cgcgactacc caggcgccga tgcgcggtgc gaccgcaacc aatacgccgg 2917921 ccacaatctc gattgccccg accaagtaca tgcattggtc ggcggtgccg ggcacgagat 2917981 cgttgatcca gccggccaga tacatgttcc agtgctgcgg atgggtcagc agattgaaga 2918041 acttgtccag cccgaacagg atgggcgcga ccgtgaacag cgtgcgaagc aatacgtatg 2918101 cagagtatgc cggatccttc agctggtctg cgagagcagg gctggtcgtt ggtctgatgc 2918161 tcatagctgc ctcccgactt ctaacagaca acaatttgaa cgctagatcc tatagactgt 2918221 atcgtcaagt gttttgtctg ttagagatgg cttgctgaag tggacggccg agcttccttc 2918281 gaacgcgacg tcgccgggat cggggcactc gtggatccgg tgcgtcgcca gctctaccaa 2918341 ttcgtgtgct cacaatcgat gccggtgagc cgagaccagg cggccgacgc cgtcggcatc 2918401 ccgcgccacc aggcgaaatt ccatttggac cggctcactg ccgaaggcct gctggatacc 2918461 gagtacgcgc gcctgaccgg ccggtccggc cccggcgccg ggcggaccgc caagctgtat 2918521 cgccgggccg gccgcgacat cgccctcagc cttccacagc gggagtacga gcttgctggg 2918581 cggctgatgg ccgcagccat cgtgctgtcg gccaccaccg gggagccgac cgtggaagtg 2918641 ctcaaccgga tcgcccatga ctacggccaa gccatgggcg ccgccgccac cacccggccg 2918701 cccgcagacc ccgcggcggc gctggagctg acgctggatg tgctgcgcaa gtacggttat 2918761 gaaccccgcc gcccggctgg ccctggcgac gatgaggtcg agctggtgaa ctgcccgttc 2918821 cacgcactgg cccgggagca gaccgagctg gcctgcaata tgaaccacgc cttgatcaca 2918881 ggcgtggccg acgcgctggc accgcacagc ccggccgttc ggttggcacc cggaccggcc 2918941 cggtgttgtg tagtactcaa gcgatgttcg gctcacgacc ccgagtgagc atcgggcagg 2919001 gatttcagca cggtcagcat gatcaccgaa tcctcgacgg cgtgcagcgc atgccgtgtc 2919061 ggcggaatcg cgacgtagtc gccggccctg ccgttccacg cgtcctcacc ggcggtaagg 2919121 cacacatggc cctgcagcac ttgcagcgtc gcctcgcccg ggctgtcatg ctcggacagg 2919181 tcgtggccgg caagcaatgc cagcaccgtc tgccgaagct cgtgggtgtg accaccgtgg 2919241 atggtgtggg cagcccgtcc gctgtgtgtc tgttgcgcct cggccagctt ttcggcggcc 2919301 aggctggtca gcgaaatgga ttccatcggc gcgtcctttc agccgttcag tagcagtatc 2919361 cccgcgacga gcaacgcaac caccttgact atctccaaac cgacatagat gtggtgaccg 2919421 cgggagcggg gagcctgcag cccggccaat acctgattgg accgtcgagt caatcgagga 2919481 cgcaccgcaa tcaactggac ggccaacgca gccaacgcga ccgaaaacgc cgcggcgatc 2919541 cgcgccggcg tcgagccgac caccacgatc gcgaggatga caagggcgaa accgacctca 2919601 acggtattga gcgcacggaa gaccaaccgg ccgatgccga gcccgatctg cagcgtcact 2919661 cctgccgccc ggaacttcag cggagcttcc agaaacgaga tcgccaccac cattcccagc 2919721 cagacgaacg cgacggcgac ctcgatcgcc ggtccggcgc tcaccgaatg gctccttcca 2919781 gcggcgtgaa gtgggccagg cacaggtcgg gttcgacgaa cgcgtcgagc cggtcgacgg 2919841 tgactggcgc ccatgtttgc aaggctcccc gcatgatccc taagtggacg gggcagacga 2919901 caccggcttg agtttcggcg agttccagaa acggacagtg ccgcagaccg acctgttgcc 2919961 tgccgttgga tgcccggcgc tcgggagcga agccaaggtc gtcaagcacc gcgaccaagt 2920021 ggtcgatcgt ctcctcggtg tcggcaccgg ccggcggcgc ttcgagctgg cgcccccacg 2920081 cccggcccgc ggacaacgcc atggcccgcg aatcccgttc ggcggcaagg ccactggcga 2920141 ggatctcggc aagcagccgg taacgccgcg tcccagtgct atccgtccgc cggaccgccc 2920201 gaaacatcag cggcgggcgc cccggtcggc cgcggccggg ctcgacccgc tccacctggc 2920261 catcagcgac caggttatcg aggtggaagc ggacggtgtt gggatgcacg cccaacttgc 2920321 cggcgatcgc ggcgatgctc atcggaaccc gcgacgcaca caatgcccgc agcaccgcac 2920381 gacggcgccc caccggctct tgcagtgacc tgatgatgac actcaccccc ataaggctcg 2920441 tcggctgcgc ctgagcaatg cagtaagttt acacaaacgg acttgtaaaa acctgcggag 2920501 gtggggtcta tggccaacaa acgtggcaat gccgggcagc ctctgccctt gtcggatcga 2920561 gacgacgacc acatgcaggg gcactggctg ctggcccggc tgggcaagcg ggtgctgcgt 2920621 cccggcggcg tcgaactcac ccggacactg ctggcccgcg ccgaggtgac cgacgccgac 2920681 gtgctcgagc tggcaccggg cctgggccgc accgcagccg aaatcttggc ccgcaacccg 2920741 cggtcgtacg tgggggcgga gagcgatccc aacgcggcca acctggtccg acacgttctc 2920801 gccggccgcg gcgacgtccg ggtcaccgac gcggccgata ccggattatc cgacgccagc 2920861 gccgatgtcg tcatcggcga ggcgatgctg accatgcaag gcaacgcggc taaacacacg 2920921 atcgtcgccg aggcggcgcg ggtgctgagg ccgggtggcc gctacgcgat tcacgaacta 2920981 gcgctggtgc cggacgacgt cgcagagcag gtccgcaccg acctgcggca gtcgctggcc 2921041 cgcgcgctca aggtcaatgc gcgtccgctg accgttgcgg aatggtcgca cctcttagcg 2921101 ggccatggac tggtcgtcga acacgttgtc accgcttcca tggcgttgtt acaaccgcga 2921161 cgggtgatcg ctgacgaagg cctcctgggt gcgctgcggt tcgccggaaa cctgctcatc 2921221 catcgtgccg cgcgtcggcg agtcctgttg atgcgccaca cattccgcag gcatcgtgaa 2921281 cgcttgacag ccgtcgccat tgtcgcgcac aaaccgcacg tcgattcgtg atccattgag 2921341 gacctaagcc cgttgggcta gtgacaaacg cctcctgagc aaaaccctcc tcccccgtta 2921401 ccgtcgtgcg gtagggacaa gccacatcgg ccgagcgggc gatcagccaa cgacaggagg 2921461 accgcgatgt catcgggcaa ttcatctctg ggaattatcg tcgggatcga cgattcaccg 2921521 gccgcacagg ttgcggtgcg gtgggcagct cgggatgcgg agttgcgaaa aatccctctg 2921581 acgctcgtgc acgcggtgtc gccggaagta gccacctggc tggaggtgcc actgccgccg 2921641 ggcgtgctgc gatggcagca ggatcacggg cgccacctga tcgacgacgc actcaaggtg 2921701 gttgaacagg cttcgctgcg cgctggtccc cccacggtcc acagtgaaat cgttccggcg 2921761 gcagccgttc ccacattggt cgacatgtcc aaagacgcag tgctgatggt cgtgggttgt 2921821 ctcggaagtg ggcggtggcc gggccggctg ctcggttcgg tcagttccgg cctgctccgc 2921881 cacgcgcact gtccggtcgt gatcatccac gacgaagatt cggtgatgcc gcatccccag 2921941 caagcgccgg tgctagttgg cgttgacggc tcgtcggcct ccgagctggc gaccgcaatc 2922001 gcattcgacg aagcgtcgcg gcgaaacgtg gacctggtgg cgctgcacgc atggagcgac 2922061 gtcgatgtgt cggagtggcc cggaatcgat tggccggcaa ctcagtcgat ggccgagcag 2922121 gtgctggccg agcggttggc gggttggcag gagcggtatc ccaacgtagc cataacccgc 2922181 gtggtggtgc gcgatcagcc ggcccgccag ctcgtccaac gctccgagga agcccagctg 2922241 gtcgtggtcg gcagccgggg ccgcggcggc tacgccggaa tgctggtggg gtcggtaggc 2922301 gaaaccgttg ctcagctggc gcggacgccg gtcatcgtgg cacgcgagtc gctgacttag 2922361 gttcagcggc gaacgacaag caccgaacac tcggcgtgac ggaacaccgg atgtccggat 2922421 ggcccgacca gccgcgctag ctgaccggcc tcaccaccgc cgatcactgc cagctgtacg 2922481 cgctcgtcgt ggtcggccag gaaccgggca atacccgtgt gagtggtgat cgggtagacg 2922541 cgcacatcgg gatgacggtg gtgccaatcc tgcacgcgac gttcgaattc gccgtccgga 2922601 atctcccgga gctcctccgg tcgcccgccg agtgccagta tgggcgcttg ccgcaacttc 2922661 gcttcccggg cagcgtattc cagcacggcc tcgttatccg gtgcgtcggt catgcgcacc 2922721 acgatccagt tgatgtcaga cgctggctgg tccacttttg agcgcatgac ggcgaccggg 2922781 caatgcgcct tttcggccag ctcggttgcc gtcgaaccca agatcgagct ggcgtagcgc 2922841 ccgattccca cggagccgac gcagatcatc tcggcgtcgc gcgatgcctc cacaagcacc 2922901 gggccggctg gcccgcgggg gatgtcggtt tcgatcttga cgagcttgcc cgcggcctca 2922961 acagcggact gcgcttcccg aagcgatctt tcagcatgcg caaggtcgcg gtcgtagtcg 2923021 tccggggacg gatgtgtcgg cttgatcact gagaccagtc gcagcggcac cgctcggctg 2923081 atggcctcgt caacccccca caatgcggcc gtaatcgccg cgtgcgaacc atcgatacca 2923141 acaatgattg ttttcatcgt cggctctcct ctcccagaca tttcccgatg ctcgatcacc 2923201 ccgcatcgga aaacctgtcc gcatcttggg gactcgtggt aaaggtcggt tccggctggg 2923261 ccaaccggta gacgtcaatc agccgcgcga catcgctggg agtgacgatg ccgaccaccg 2923321 cgctcccttc ggtgaccagc gcacggctgc gcgggccgag cggtgccatc cgctctagga 2923381 gcgcggtcag cggctcttgt ggtcgggcgg tcggcacgct gtgcagcggc agcgcaatgt 2923441 cacctacgct ggtagtgctg cgccggctag gcgcaacatc gcgcagctgc cgcaatgcca 2923501 ccaggcccgt gatcgatccg tcccgatcgg caaccggata tgccgagtgc cgttcaccaa 2923561 gcacgtaacg ctggatgaaa tcctcgacat tgatccatcc gggagccgta tgcggttggg 2923621 cggtcatcgc atcggccaca cgcaccccgg caaacagctg ctgggtcgaa atccgggtct 2923681 cctcctcgcg agcggcagcg aagataaacc agccaatgaa ggctaaccag accccaccga 2923741 cgaggccacc agccacaaac tcggccaatc ccaacgcgat caagaccagc gcaaccaccc 2923801 gtccggcccg cgccgcaccg atcccggcgc gcacactatc gccgtggcgg cgccacagat 2923861 aggcccggac caaccgccca ccgtccaacg gcgcgccagg cagcagattg aacagcccca 2923921 gcagcaggtt gacagtagcc aaccaccaag caacgctgat cacgatggcc ggggtccgca 2923981 cgccggcgag cgtgatggcc aacgcaccga atgtcgccga cagcgccagg ctggtagccg 2924041 gacccgcgaa cgcgatccgg aaagcggctt tgggcgtctt tgcctcgccg ccaagcgcgg 2924101 tcaccccgcc gaacagccac aacgtcacgc tctcaacgga taccccggcg cgacgagcga 2924161 cgacggcgtg cgcgagctca tgagccaaca gcgacgccag caacatgacc gcgccacctg 2924221 cgccgagaag ccaatagacc acggccgggt agcctccgac ggtacccggc aacatggtcg 2924281 ccagactcca ggtgaacaac cacaggatca ccaacacgct ccagtggacg ttcaccacaa 2924341 acccggcgat ccgcccaagc gggatcgcat cacgcattgg gtacctccga tgctggcgga 2924401 taaagccttt cgtgccggcg gatgatccga ggtcgctagc tggcgagggc catgggcgag 2924461 cagattgcct tgacgaactg cacaatggcg tgctcgggca ggtgtcgggc gatgtcggct 2924521 tcggtgacga ttccgaccaa gcggtgctct gagatgaccg gaacacggcg gacctgatgt 2924581 tcttccatga cgttgagcat ctcctggatg cttgcgttcg catcgacgta gtagatgctg 2924641 tcccgggcca actcgccagc cgtggcggta ttcgggtcta ggcccgcagc caggcctttg 2924701 atcacaatgt cgcggtcggt gagcatgccg tgcagccggt cgtcgtcccc gcagatcggc 2924761 aacgcgccga tgtcatgctc acgcatgtat tgagcggcag cggttagcgt ctcgtgttcg 2924821 ccaacacagg tcacacctgc gttcatgatg tcgcgtgcgg tggtcatcgg gatcctcctc 2924881 gagtcggggt gctattgctg atctgctgcc gaaggtacga ccacgtcgta gcgaacacta 2924941 gggtcgtttg acccgtgggc cgcgggtcga tggacccgta ctggcgcgcg ttgaggcagc 2925001 tggcttgcct ggcttgtcct cgccgtaggc cacctcaaag tcgaaggttg tcaattgatt 2925061 tcaccagccg gatatagcgc tatgggcggc cgcaggaccg atagtgatgc cgatcggccc 2925121 cgatcggggt aaccggcaat ggaacaactg acaaccatga aggctcgttt cgacggaagc 2925181 tgaagacgcc gacaggcaca tgagcctcgc gacggggcca atccgttggc tttgcgaccg 2925241 tggtcgtagg tcctggcgga gccgggttgc cacatccgtc acaagctgac acgccgaacg 2925301 tgcaaccagg gcggcatcgc ctgggtgtgt ctccgccacc agtgcacatt cggcgcagcc 2925361 agcccacgct cggcgcggag ttaggcggaa cggtcgcgct gtgtccgtgg cgcgtccaac 2925421 aggcccgact gctccagcgc agcctggaca aaccgtcgta ccggccgcga ctggaagaag 2925481 ccagtgtgac cgcctggata ccacacgatt tcgggtttgc cccagtgctc ccagaggcga 2925541 gtcacctgtt cgcgtggatg cacgagtcgg tcggcaatgc ccgcgtagat aaagcggccc 2925601 ggcatgggca ccagtggcgt aagtgagagc ggcgagatca ttcggccgat cggttcggcc 2925661 atcttgacgg tgtggcggcg ggggtctttg tgccgaagac cgcagtggcg gcccaacaac 2925721 tcgatcagat cagccactgg gacaccgaga atcgcgcagg cgagaccttc ttcgaggctg 2925781 gcgaccaatg acgcgatgta gccgcccagc gagagaccgt tcaacccgat cagcgactcc 2925841 tcctcctgcg atcgtatcca ggacaacagc cgccggatat cccacaccgc ttgagccgtc 2925901 ccatgcacat cgtcgagaac atcttctccg ggaaaaacgg cgcccttcgg cagaccttgc 2925961 ccgcggggac catgcatcgg aagaaccggc atgacaatgt tcaggccgag ttcgtcatgc 2926021 agcttccagg cgcggaacac cgcgagatcc aacggggccc tgcccatctc ggtgccgtgt 2926081 acacaaacca gccagggacg cggctctggg tgccgcagta acagggcgta ctcgcgattg 2926141 ttcgcagtgt atgagagcca ccgttggctg cccggttcac ccggatgcgg cgtaaaccca 2926201 ctgtcgaaga agatgcgata aaaggagcgt ctgcggtcct tgacctttcg gaccgcgacc 2926261 tcggtgagcg gtgggggctg ggcaaaaaat ccgctaggct tctccagcca tccgcgattc 2926321 ccatagaact ccagtccagc ggccacttct tggctgatgc gctcgaacac tcgatgattg 2926381 ctgaccggac gtcgtgcctt gaggcccagc aggacgattt cgtctcgaaa ggcttgcgcc 2926441 gctaaggcaa tagtgggccg tgcgatcggc agtttatcgg gctgttgacc cagatagtcg 2926501 cgccacgatt gagcgacgta cagaccggtg tgcatgaacg gtcccatggc gccgctcaag 2926561 accggtggac tcaggcgaaa agccgagcgt tcgtgggtgc cgtcgctcgc agaacttgcc 2926621 atggcagcaa agctaaccgc gtgcggaacg acgcgttagg gacttacgtc ccgccggaag 2926681 tcacctgtgt ggtggtggcc actgtcgaga ccggcggccc gttgtggtgg cccaagtgcc 2926741 ctaaggtgat caggtgccgc agcccggcca gcacgccgtc agagtttcac ggggcttggt 2926801 cgcggccgat ggcgtcctca tcgtggggtc gatgaccgag gtggacgcgg cgcgaccggg 2926861 cacatcgacg gtccccgggg ctttgtgggc cagtgaagtg acgaaagacc ccagtggaca 2926921 cggacttcgg catgtccacg caacgaccga ggcactccgg tattcgggct gttggcccct 2926981 acgcatgggc cggccgatgt ggtcggatag gcaggtgggg ggtgcaccag gaggcgatga 2927041 tgaatctagc gatatggcac ccgcgcaagg tgcaatccgc caccatctat caggtgaccg 2927101 atcgcttgca cgacgggcgc acagcacggg tgcgtggtga cgagatcact agcaccgtgt 2927161 ccggttggtt gtcggagttg ggcacccaaa gcccgttggc cgatgagctt gcgcgtgcgg 2927221 tgcggatcgg cgactggccc gctgcgtacg caatcggtga gcacctgtcc gttgagattg 2927281 ccgttgcggt ctaagcacca cctaacggtg tcgtcccgaa gggacgattg ccgatccggt 2927341 ggatgacttt ggtccctatg ccttcccgct ggaccgcaca acgatcgaag gtgccacgac 2927401 gcatagaaga catggccatg ccacaccctg atagcattgc agcaagctac atgtactgct 2927461 ctaccaggat ccttatgggc aacagtgggt ttgagttatg aaacccgtgg gcacataccc 2927521 ttccgcgtcg tactggtcag tctcgacagc gaagagatca ccggttgatc caccaagcat 2927581 gcattggcgg gcatctgcat aaacggtgac gtatcagcac aaaacagcgg agagaacaac 2927641 atgcgatcag aacgtctccg gtggctggta gccgcagaag gtccgttcgc ctcggtgtat 2927701 ttcgacgact cgcacgacac tcttgatgcc gtcgagcgcc gggaagcgac gtggcgcgat 2927761 gtccggaagc atctcgaaag ccgcgacgcg aagcaggagc tcatcgacag cctcgaagag 2927821 gcggtgcggg attctcgacc ggccgtcggc cagcgtggcc gcgcgctgat cgcgaccggc 2927881 gagcaagtac tggtcaacga gcatctgatc ggcccaccac cggctacggt gattcggctg 2927941 tcggattatc cgtacgtcgt gccattgata gaccttgaga tgcggcgacc gacgtatgta 2928001 tttgccgcgg ttgatcacac cggcgccgac gtcaagctgt atcagggggc caccatcagt 2928061 tccacgaaaa tcgatggggt cggctacccg gtgcacaagc cggtcaccgc cggctggaac 2928121 ggctacggcg acttccagca caccaccgaa gaagccatcc gaatgaactg ccgcgcggtc 2928181 gccgaccatc tcacccgact ggtagacgct gccgaccccg aggtggtgtt cgtgtccggc 2928241 gaggtgcggt cacgcacaga cctgctttcc acattgccgc agcgggtggc ggtccgggtg 2928301 tcgcagctgc atgccggacc gcgcaaaagc gccttagacg aggaagagat ctgggacctg 2928361 acatccgcgg agttcacccg gcggcggtac gccgaaatca ccaatgtcgc acaacaattt 2928421 gaggcggaga tcggacgcgg atcggggctg gcggcccaag ggttggcgga ggtgtgtgcg 2928481 gctctgcgtg acggcgacgt cgacacgctg atcgtcggag agctaggcga ggccaccgtg 2928541 gtcaccggta aagcgcgtac tacggtcgcg cgggatgccg acatgttgtc cgaactcggc 2928601 gaaccggtag atcgcgtggc aagggccgat gaggcgttgc cattcgccgc gatcgcggta 2928661 ggtgccgcat tggtccgtga cgacaaccgg atcgcgccac tagatggggt gggcgcattg 2928721 ctgcgttatg ccgccaccaa ccgactcggc agccatagat cctaggatgc tgcaccgcga 2928781 cgatcacatc aatccgccgc ggccccgcgg gttggatgtt ccttgcgccc gcctacgagc 2928841 gacaaatccc ctgcgcgcct tggcgcgttg cgttcaggcg ggcaagccgg gcaccagttc 2928901 agggcatcgg tccgtgccgc atacggcgga cttgcgaatc gaagcctggg caccgacccg 2928961 tgacggctgt atccggcagg cggtgctggg taccgtcgag agcttcctcg acctggaatc 2929021 cgcgcacgcg gtccataccc ggctgcgccg gctgaccgcg gatcgcgacg acgatctact 2929081 ggtcgcggtg ctcgaggagg tcatttattt gctggacacc gtcggtgaaa cgcctgtcga 2929141 tctcaggctg cgcgacgttg acgggggtgt cgacgtcaca ttcgcaacga ccgatgcgag 2929201 tacgctagtt caggtgggtg ccgtgccgaa ggcggtgtca ctcaacgaac ttcggttctc 2929261 gcagggtcgc cacggctggc gatgtgcggt aacgctcgat gtgtgaattg agacctgatt 2929321 catgaaaatc gtcgaggaga ccccataccg gttccggatc gaacaagagg gcgcgatgcg 2929381 ggtgcccggg atcgtgttcg cgtccaggtc gttgctgcct cgtgacgaag gcgacatggc 2929441 cctgatgcaa gtggtcaacg tggctacgct gccggggatt gtccgggcct cgtatgcgat 2929501 gcccgatgtg cactggggat atggtttccc aatcggcggc gtggccgcaa ccgacgtcga 2929561 caatgatgga gtcgtttccc caggcggtgt cggcttcgat atttcgtgcg gcgtaagact 2929621 cttggtcggc gaagggctgg accgcgagga gctgcaacca cggttgccgg cggtcatgga 2929681 ccggcttgat cgcgcgatac cgcgcggagt gggcacggcg ggtgtgtggc gactacccga 2929741 ccggaacacg ctgcaggagg tgctcaccgg tggtgcccgg tttgcggtgg aacaggggca 2929801 tggcgtcgcg ctagacctcg agcggtgcga agacggcggt gtgatgacag gagcggacgc 2929861 ggccaaaatc agtgaccggg ccctccaacg cgggcttggg cagatcggca gccttggctc 2929921 gggcaaccac ttcctggaag tccaggccgt ggaccgcgtc tacgatccgg ttgcggccgc 2929981 gccgatgggt ctggcggaag ggaccgtctg cgtgatgatc cacaccggct cacggggcct 2930041 gggccatcag atctgcacgg atcacgtccg ccagatggaa caagccatgg gccgatacgg 2930101 aatcgcggtg cccgatcgcc aattggcttg tgtgccggtg cactcccccg atgggcaggc 2930161 ctatctcgcc gcgatggcgg cggcggccaa ctacggacgc gccaaccgcc aactgctgac 2930221 cgaggcgacg cgtcgtgtgt tcgctgatgc aaccggaaca cctctggacc tgctctacga 2930281 cgtgtcgcac aacctggcca agatcgagac gcatccgatc gacggtcagc tgcgctcggt 2930341 gtgcgtgcac cgcaagggcg ccacccgctc gctgccgccg caccatcacg agctgccggc 2930401 cgaactggca gcggtcggcc aacccgtgct gatacccggg acgatgggta cggcgtcata 2930461 tgtgcttgcc ggggtcaccg gcaacccggc gttcttttcc accgcgcatg gtgctgggcg 2930521 ggtactgagc cgtcaccagg ccgcccgcca caccagcggt gaagcgatac gcgccagcct 2930581 cgcaaaacgt ggcatcatcg tccgcggtac ctctcgtagg ggtatcgccg aggaaaagcc 2930641 ggaggcctac aaagacgtcg acgaggtcat cgaagccagc catcagagtg gcctcgcgcg 2930701 caaagtggct cgccttgttc ccttgggctg tgtcaaagga tgaatcaacg gcgaacattc 2930761 cagccgtcgc gaccgccttc ttcagtggtg cagacccgtg accggctgat gggtactggc 2930821 ttcgatatcc gacgacgtca aagcgaatag ctgattcgcc aaatccgaca aggcccgggc 2930881 gatcgcaagt tcgtcgccga tctgggccac cggctcatcg gccggatcga gtcgcgccaa 2930941 accaacaccc accatctgcc tgcctgccca ggacagccgc gccttcgccc gggtgcgctc 2931001 gtcgtgttcc tcaatcagca catcaatttg gcaggttttt ccaacgtgct cgctgtctgt 2931061 catcgcggcc tccctgtcgg atttgcgctt acgcccgccg atctgccccg ctagctgaac 2931121 gcggtatcta tccaatcacc acaatcggtc gtggagtagg ccagaattct tttcgcccga 2931181 cccgggcccg cctagcgctg acaaccgcta gatggccttc aggaggtctg ctttgccctt 2931241 ggtacggagt gtgtacagag gtgagccgcg caactgctca atgcgagccg ccatcttgtc 2931301 accgagctcc tcgagctcgg catcggtgat gtgcaccggt gtaggagcgg ggatcatgtc 2931361 gcgttcctct acgtcggcgt gcgcctccaa cacggtccgg aacacgttcc actcttcttc 2931421 atacccgggc gcgcgctgcg gagtgcgcag cagcgtcgcg agctgatcaa ccacctgacg 2931481 gtgctcggcg tgggtacccg tgattggttt gccggccgcg gaaagggcag ggtagtacag 2931541 gtcatcctcg atgcggaagt gaatgtccag ctcgatgagc atctcgtcga aaaggacatg 2931601 gcgctcttcg ctattcaccg gcgcctcgcc gactttgcgg cccagtcctt taagcacggt 2931661 gtggtggcgc tttaatacgt cgtaggcatt cacttcgttg ctctattccg tattcgggat 2931721 caacgagaca accgtaacct cgcgccgcgg cccattaatg tgaggtagct gtgaatcagc 2931781 acaaagaagc ctgtgcagta gcgcgacgct cggcgtaccg gcacgagtcc gacggcccgc 2931841 atgtccatgc ggccgccggc accagcgccg aggcccccgc aggataccgg gatctgcagc 2931901 tcctcgtgcg gaaacagttg ccgcagttcg ggttcgggca gttcggcgag cgtgatgttc 2931961 gcgcctaacg gcaacaacta tccgtcggcg ccctgggtgc cgggcgggcc catattgccc 2932021 tggatgccgg agctgccacc cggtgaccca cccgcgccgc cggcgccccc gttgccgccc 2932081 agcgcgaaat cgccgccctg accgccggtc gcgccggtcc cgccgttgcc gccgttgccg 2932141 ccctggccgc cgaggccacc ttgcccaccc gtgccgcctg cgccggtgcc gccggcagct 2932201 cctgcccacc cgatcagccc gccggctccg ccgctgccgc cggtggtccc gccggcgccg 2932261 ccggtaccgc cagtgccacc agcgcccccc acgccgcctg taccgccgcc accgccaatt 2932321 gtcgctcccc cgccggtggt ggtacccgcg ccgccggcgc caccgttgcc gccggcacca 2932381 ccgatgccgc cgatgccacc ggtgccgccg acaccgccgg cacccccgcc accaccaagc 2932441 ccgatgagcg acccagccgc cccgccgttg ccgccgacac cgccgctgcc acccataccg 2932501 ccggtaccgc cgacaccgcc gaggcccccc agaccgccgg tgccgccttc cgcggtaccc 2932561 gcaccgtcgg tgagaccctc tccgcccgcg ccgccgacac cgcccgcgaa gccggcggcg 2932621 ccaccaccac cggtgccgcc cgtcccgccg gccccaccgg cgccgccgtt gccgattaac 2932681 atcccgccac gtccaccgtt tccaccggca ccaccggtgc cgccgttacc gcccgccgcg 2932741 cctagtgccc cgttaccgtc accgccgatt ccgccgtcac cgccgaaagc gtcacctaca 2932801 cccgtgttgt gcccctgccc ccccttgccg ccagcaccac ccacgccacc gtcgacccct 2932861 ccggtggcac cgtcaccccc ctcacccccg gtagccacgc cgccggcgct gccgtcggtc 2932921 tcacctatgc cgccagcgcc gccagcgccg ccggcaccgc catcggtacc ggcagtaccc 2932981 ccggctccac ccttaccgcc ggtgccgtcg ttgccgtcga gcgactcccc cagcccgccc 2933041 tgcccgccga cgccgccagc ctcgccgacg ccaccggcgg ggccgggacc cccgttcccg 2933101 ccagtttgat tcccgttgcc gctgttgtcg gtaccgttcg caccggtgtt ggggttcgca 2933161 atcgagccgg ggttgacccc gtttgtcccg gccagaccgg tgccaccctg cccgccggca 2933221 ccaccggacc cgaaccagtt ggcattaccg ccgttgccgc ccgcgccggg catcccgccc 2933281 aggacacccg ccacggccgg cccaccctgt ccgccggcac cgccatcgcc caacaacatc 2933341 ccgccggcac cgccattacc accggccgcc ccggccccac ccagaccccc aacaccgcca 2933401 ttgccgatca atagcggacc cgcaccgccg tcaccaccgg gcgcaccgtc cccaccaacg 2933461 ccgccaccgc ccccggtgcc gaagtagctg gccgcccctc cgacgccgcc ggcggcgcca 2933521 aggccaccgg ccccgccgaa tccaccattg ccgaacacgc cgccaacgcc accgctcccg 2933581 cccacgccgc cggtggtgcc cacccccgcc gcggccccgc cagcaccgcc gaagcccccg 2933641 ctgccgatca accccgtcgc cccaccgaca ccgccagtac cgccgaccaa agtggcccct 2933701 gcagccccac cagccccacc ggtcccgcca ttacccagca accatccacc gcgaccaccg 2933761 acacccccgg cagcaccgga cccgaccagc ccgtccccac ctttaccgcc agtcccgccg 2933821 ttaccgatca accccgcatc cccgccagca ccacctggct gacccggcgc acccgatccg 2933881 ccattcccgc cgttgccaag caacagcccg cccggcccac cgggagcccc cgtcccgtcg 2933941 gccccgttag cgccattgcc gatcaacggg cgccccaaca acgcctgggc gggggcattt 2934001 accacgccca acaaatcctg caacggcgca gcactggtgg cctcggcaac tacgtatgag 2934061 cgcgcgccgt tcgtaaggga ctgcacgaac tgggcgtgaa acgccgacag ctgcgcacca 2934121 aaagcctgat agctctgcgc gtacgacccg aacaaggccg caactgccgc cgaaacctca 2934181 tccgcggcag ccgccaccac ccccgtggtc ggcaatgccg ccgccgcatt cgccgcgttg 2934241 atcgtcgacc caatgttggc cagatccgaa gccgccatcg tcaatgcttc cggcaccgca 2934301 atcacaaatg acatctgcga cctcctggac cggacaaccc gcatggtcgc cgcggatcat 2934361 cgagcactcg gcagcaacaa atcctatccc gcctcgcaga cggcggaggc catttggccg 2934421 ccggcgcgta ctcttcgcta cgaccgccag agcccttggt tagcgaccgg attcgaccgc 2934481 cgcatgagcc aaactgttac cggtgtgggt gtgcagaact gcgcagttag caaacgccga 2934541 tgcagcgcgg tggaccacag cagccgcaca ccgtaccggc gctgagtgat aaacccgacc 2934601 cgggcccggc ggatgcgata tcgtcttgcg gctatggcgg gtatgccaga gggcaaactc 2934661 atcctcctca acggcggatc cagcgcggga aagacgtcgc tcgccttggc gtttcaggat 2934721 cttgccgccg agtgttggat gcacattggg atagatctgt tctggtttgc gctgccgcca 2934781 gagcagcttg accttgcgcg ggtgcggccc gagtactaca catgggacag cgcggtcgag 2934841 gccgacgggc tggagtggtt caccgtgcac ccgggcccca tcttggacct ggccatgcat 2934901 tcccgctacc gcgccatcag ggcatacctg gacaacggaa tgaacgtcat cgccgacgac 2934961 gtgatctgga cacgtgagtg gctggtagac gctctgcggg tttttgaggg ctgccgagtc 2935021 tggatggtcg gggtccacgt atccgacgag gagggtgccc gccgggaatt agaacgcggc 2935081 gatcgccacc ccgggtggaa ccgaggcagt gcgcgcgctg cccacgccga cgccgagtac 2935141 gacttcgagc tggataccac cgcgaccccg gtccacgagc tggccaggga gctgcatgag 2935201 agctatcaag cctgcccgta ccccatggct ttcaaccggt tacgcaaacg cttcctatct 2935261 tgaaatggag ccaaaagtcg tgcgcaactg gaactttcac tcctggcaaa cgctggggcg 2935321 acccgtcacc gcgcgcttgg gttcgggtcg aatcgtcggc cgcgcgggtc gtgcggaaca 2935381 ttgcacccga cgcggcggaa tcggagttga gaagtacatg gcgggacgca cccggcaccg 2935441 gtcaggcatt ctttacccat ggatgtggag gccctgctgc agtcgatccc gccgctcatg 2935501 gtctacctgg tggtcggcgc ggtggtaggg atcgagagcc tgggcatccc ccttcccggc 2935561 gagatcgtgc tggtcagtgc cgcggtgttg tcgtcgcacc ccgagctggc cgtcaacccg 2935621 atcggcgtcg gcggcgctgc ggtgatcggc gccgtggtcg gcgattcgat cggctactcg 2935681 atcggccgcc gcttcggctt accgctattc gaccggctgg gccggaggtt cccaaaacac 2935741 ttcggccccg gtcatgtcgc gcttgctgaa cggttgttca accgatgggg agtccgagcc 2935801 gtgttcctcg gtcgcttcat cgcgctgctg cggatattcg ccggaccgct cgctggcgcc 2935861 ctgaagatgc cctacccgcg cttcctggcc gccaacgtca caggcggcat ctgctgggcc 2935921 ggcggcacca ctgcactggt ctacttcgcc gggatggccg cccagcactg gttggaacgg 2935981 ttctcctgga tcgcgctggt catcgcggtc atcgccggca ttacggccgc gatcttgctg 2936041 cgcgaacgca cttcgcgcgc gatcgccgaa ctcgaggccg agcactgccg caaagccggt 2936101 accaccgcgg cgtgaccgac cggcttgaat ccggtaccca cgctcacagg agctgcaatc 2936161 tagacagatc tccagtcatg tcataaaaat gagatctgaa attacttgac aagcttgtct 2936221 tcggacagtg cggggcatcc gccgcggtgg ctgtacgccg tcgattagga gcgcaccatg 2936281 ggcctgatca ctacagaacc acgctctagt ccccacccgc tcagcccacg gctcgtccac 2936341 gagctaggcg acccacacag cacgctgcgg gcaaccactg acggcagcgg ggcagcgttg 2936401 ttgatccacg cgggcggcga gatcgatggc cgcaacgagc atctctggcg tcaattggtc 2936461 accgaggccg ccgccggcgt cacggcgccc ggaccgctca tcgtcgacgt caccgggctc 2936521 gatttcatgg gctgctgcgc tttcgccgca ctggccgacg aggcacaacg atgtcggtgc 2936581 cgcggcatcg acctgcgtct ggtgagccac cagccgatcg tcgcccggat cgccgaagcg 2936641 ggtgggctga gccgagtgct gcccatctac ccgaccgtcg atactgcgct cggcaagggc 2936701 acggccggtc cagcccgttg ctgatcccgg ccgtaagagc accgagccga ccgccggtgg 2936761 ccccaccgct agggccgatc gcaccgccgc gcgacgatgt tcgcgtcagg cgcgcatgcg 2936821 gtatcgcttg ccttgcaagg taatccactt cggacatcca cgatgcaggt cgcgatcaag 2936881 tcgggcgcgc cgcagcagtc agtggccgcg aggggcgtac atgatcacgg ctaccccggc 2936941 catgcagcca agggcaccga tgacatccca ccggtcgggc cggaacccgt ccagggccat 2937001 gccccaggcg agcgaaccgg cgacaaacac accaccgtag gcggccaaga cccgaccgaa 2937061 atgggcgtcc ggctgcaatg tggcgaagaa cccatagacc ccaagcgcaa taactccgag 2937121 tcccgcccaa agccaacccc gttgctcgcg gacgccctgc cataccagcc acgcgccacc 2937181 gatctccgca accgccgcca ggacgaatag caggattgac cgcaccacca tggttgcgag 2937241 cctacgagat ccgctgccct gccgcccccc aaccaatcgc gcaccccaaa tgcttcccgt 2937301 cacccgcgct cagccagaca ccggtgttgg ctacaactat ggttcccgga tcaggcgcag 2937361 cagttcgggt tgagcacggt acacagcgct tgcagggctt caggatgtac ccgatggaag 2937421 acgtgcatgc cccggcgatc ggaaatgacc aggccggcct tgcgcagctg ggccaagtgg 2937481 tggctgacgg tgccatcgct gaggctgagc gccgccgcta gttggccgct gacctgctcg 2937541 ccggccggcg agctgaacag gtaggacatg atcttgactc gtgccgggtc ggccagggcc 2937601 ttcagccgca gcgccaccgc caaggcgtcg ccgtcgctca tcggccccgc cgccaccggg 2937661 gcgcagcaca cgggagcgga gatgtcaatc accggcagcg acttgggcat aggcccaccc 2937721 tgccagatac cttgacatat atcaaagaga tgttgcacac tgggttcggc gccattttga 2937781 tataagtcaa acaactggga ggtgtctacc aatgtcccgc gttcagctag ccctcaacgt 2937841 cgacgacctg gaggccgcaa tcacgttcta ctccaggctg ttcaacgccg agcccgccaa 2937901 acgcaagccc ggatacgcca acttcgcgat cgccgatccg ccgcttaagt tggtgctgct 2937961 ggagaacccc ggcaccggcg gtaccctcaa ccatctcggt gtggaagtcg gctcgagcaa 2938021 caccgtgcat gccgaaatcg cccggttgac cgaagccgga ctggtcaccg agaaggagat 2938081 cggcaccacg tgttgctttg ccacccagga caaggtgtgg gtgaccggcc cgggtgggga 2938141 acgctgggag gtttataccg tgctggccga ctccgagacc ttcggcagcg gtcctcggca 2938201 caacgacacc agcgacggcg aagcaagcat gtgctgcgac ggccaagtcg ccgttggcgc 2938261 aagcggctaa ctgtaggcct gaccccgggg tgcgtctcca agccgcggag cccaccccgg 2938321 gccactcaat gccccctaac ccgcgtagcg ccgttcaccg cgtggccgct tgcggacctg 2938381 attcgatatt tgtcaatatt gatgtatgtc gaatctgcat ccgttaccag aggtggcgag 2938441 ctgcgtagtc gcgccgctgg tgcgcgaacc gctgaatcct ccggccgcgg ccgaaatggc 2938501 ggcccggttc aaagccctgg ccgatccggt gcgattgcag ctgctgagct cggttgccag 2938561 tcgcgccggc ggcgaggcct gcgtctgcga catttccgcg ggagtcgagg tgagccagcc 2938621 cacgatttcg catcatctca aggtgctgcg cgacgcgggt ttgctgacct cgcggcgtcg 2938681 ggcctcgtgg gtgtactacg ccgtggtccc cgaggcgctg accgtgttgt cgaacctgct 2938741 cagcgtgcat gccgatgccg cacccgccct gggggcaccg gcatgacgga gacggtcacc 2938801 cgcaccgccg ccccggcggt ggtgggcaaa ctctcgacgc tggaccgctt cttgccggtg 2938861 tggatcgggt cggcaatggc cgccgggcta ctactgggcc ggtggattcc cggcctgcac 2938921 accgccctag aaggggttca gctcgacggg atttcgctgc cgatcgcgct aggcctgctg 2938981 atcatgatgt atccggtgct ggccaaggtg cgctacgacc gcctcgacac cgtcaccggt 2939041 gaccgcaagc tgctactcag ctcgctgctg ctgaactggg tactgggccc ggcgttgatg 2939101 ttcgcgctgg cttggctgct actggcggat ctgcccgagt accgcaccgg gctgatcatc 2939161 gtgggcctgg ctcgctgcat cgccatggtg atcatctgga acgacctggc ctgcggggat 2939221 cgcgaagccg ccgccgtgct cgtcgcgttg aactcgatct ttcaggtggc catgttcgcc 2939281 gcgctcggct ggttctacct gtcggtgcta ccgggttggc tgggcctcga gcagaccacc 2939341 atcgccacat ccccgtggca gatcgccaag tcggtgctga tcttcctcgg catcccgctg 2939401 ctggccggct acctgtcgcg gcggatcggc gaaaagacca agggccgcaa ctggtatgaa 2939461 tcccgcttcc tgcccaaggt gggaccgtgg gcgctctacg gtttgctgtt caccatcgtg 2939521 attctctttg cgctgcaagg agatcagatc accggccgac cgctggacgt cgcacgcatt 2939581 gcgctgccgc tgctggccta cttcgccatc atgtgggtag gcggctacct actgggggcg 2939641 gcgctgcggc tagggtatcg gcgcaccacc acgctggcgt tcaccgccgc gagcaacaac 2939701 ttcgagctgg ccatcgcggt ggccatcgcc acctacggcg ccacctccgg gcaagccctg 2939761 gccggagtcg tcgggcccct gatcgaggta cccgtcctgg tggggttggt ctatgtgtcc 2939821 ctggcgctgc gcaaccgcct cgccggtccc aacgcgaccc acgatgccga caaacccagc 2939881 gtcctattcg tctgtgtgca caacgccgga cgttcccaga tggccgccgg gctattgacc 2939941 cacttggccg gtgaccgcat cgaagtccgt tcggccggaa ccgagcccgc cggtcaggtc 2940001 aatccgacgg ctgtggccgc gatggccgaa atgggcatcg atatcaccgc caatgccccc 2940061 acattgctca ccggcgggca ggtccagtcc agcgacgtcg tcatcacgat gggctgcggc 2940121 gatgcctgcc cttacttccc gggtgcctcc taccgcaact ggaaactacc cgatcccgcc 2940181 ggccagcccc tcgacgttgt gcgcatgatc cgcgacgaca tcgcagaccg cgtccaagcc 2940241 ctgatcgccg agctgctggc caccgccaag accagatagc gtgtgccacg ctcggtgctg 2940301 cgccgatacg tgaggtcccg gctgggatcg gattttccgc ctgtacggcg gctaggcacc 2940361 agcggatcgc atttgtactg gttagagact tgccgagtgg ccgcattagc ctgcgtggag 2940421 cgcttggtca aaaagctcgg ccctgttcgg ccctatgggt tcctgttgat ctgccctgtt 2940481 cgtagtctcg acaaagcggc tgcccgagat cgcgtgcgac gatatcggga gcggctgcgg 2940541 caacgaggtc tgcggccgat acagatctgg gttcccgatg tgaacgcacc cgaatttgtc 2940601 ggcgaagcac accgtccgtc ggcgctcgtc gcggcccgcg aatacgagga cgacgatcaa 2940661 gccttcgtcg atgcggtatc ggtcgactgg gacgacgcca cctgacgtgc ggcgcggcga 2940721 catccacacc gcggcggcgc gtggtgccta caccggcaag ccacgccggt cgcggtcatc 2940781 cagaatgacc ggttcgattc gacggcctcg gttaccgtcg tgccgtttac cacgcgtgat 2940841 gtccaggcat ccctgatgcg aatcccggcc ccagcgtcca acaccaccgg gctgaccgag 2940901 accagtcgcc tgacggtcga caaggtgaca acatcccccg caccagcctg acgcggcagg 2940961 ttggtcggtt atcggccaaa aacatggtca ggctcgaccg tgcattgctg gttttcctgg 2941021 ccggctgaca attgcgccac ctggtcatca gaactgatcg ggcggggaaa cgaaacgggg 2941081 ctcccagcgg aggtcatgag ttggcgcgcc ggtttcgccg cgatctctcc gaacttgacc 2941141 gctaaacctc ggggcagaag tcatgaacaa gcccgttagg aggcgtttga ggccgtaaat 2941201 gttgatgagg gcggggaaag tgtcgtcatg gccgtcgcgc tgaattcacc acgcccccac 2941261 gacggagctc gtgggcaccc agcattcact gcttaccact acgatctcgc tcacgaggtt 2941321 cgagcagcca ctgtcgcctg ccgccaacga ataatgctcc ctgacctagt ggtcccggct 2941381 gggatcgaac cagcgacctt ccgcgtgtga agcggacgct ctcccactga gccacgggac 2941441 cggcgccgag gagatgaacg aggtcgaaga ttagcacgtg caagacatcg tcagcagcag 2941501 tctacgtgcg cttcacatag gggctgcgat agcctagagc cgcaacgtac caagagattt 2941561 gtgtgggccc gctcacctcg actatcgtcg tgcttcgcac cgggcgacga tctcgttcgt 2941621 tgcgcgcgga tgtagcgcag ttggtagcgc atcaccttgc caaggtgagg gtcgcgggtt 2941681 cgaatcccgt catccgctcg aaggtgctag tggcatcaaa tcccagcggt ggagtggccg 2941741 agtggtgagg caacggcctg caaagccgtg cacacgggtt cgattcccgt ctccacctcc 2941801 aggttcaacc cccagcgcga ttagctcagc gggagagcgc ttccctgaca cggaagaggt 2941861 cactggttca atcccagtat cgcgcaccac gattgacctg cggtttcatc cacaaaatct 2941921 gggctgcgtg aactaaatgt gaactgactc ggtgcaacca ccgaaaggtt cctctgttcc 2941981 gtgcccacgc cgacaccgac ggtgacccca ccagatgcgc ctgccgcccg ctggctagcc 2942041 tggcctgttg ctgcaagcgc ctggtcgacg cccgctatca cgctgttgtc gcgtccaccg 2942101 aactcaccga ggcacgccgc acccgcgcaa ccgagctgac ggagctgatc accaccgcgc 2942161 tcgccttctg cgaacggctg caaacggtcg ttgagggtga ccggcgggct gaggtgaccc 2942221 gatgagcggc ggctggctcg ccgagcacct cggcctgtcc acaaaccggc tccggcacga 2942281 actcgcagac cggctcgacg cgcactacgg gccacccgca cagaacaggg agctcgcgcg 2942341 gccgagcctg cggattatca acgagggcac tgatggatga cctgacgcgg ctccggcgcg 2942401 agcttctgga ccgattcgac gtgcgggact tcacagactg gcctccagca tcgctgcgag 2942461 ccctcatcgc gacctacgac ccctggatcg acatgacggc cagcccgcca cagcctgtat 2942521 cgcccggagg gcctcgactc cgactcgtgc gattaaccac caacccatcc gcgagagcag 2942581 cccctatcgg aaacggtggg gactcttctg tttgcgctgg tgagaaacag tgccgcccac 2942641 cgtagcggcc tgcgcgtggc aattgaccga cctgacccga gtagccgcca gtgggctgta 2942701 agccattctt tacggcagcc tgttgtaaag gtaacgttta cacgtggagg tgagggctag 2942761 cgcccgcaag cacggcatca acgacgacgc catgctccac gcataccgca acgcgctgcg 2942821 ctacgtcgaa ctggaatacc acggcgaagt tcaactgctg gtgatcggcc ccgaccaaac 2942881 cgggcgcctt ttagagctgg tcatcccagc agacgaacca ccccgaatta tccacgccaa 2942941 cgtactacgc ccgaagttct acgactacct gaggtgatga gataagagtg aagcacaaga 2943001 ccgacattga cgagtggctc gacacgatcg agcccaaccc ggccgacgcc cacgatgcca 2943061 gccacctgcg gcgcatcatc gccgcgaaag aagcggtcca aacagccgaa tctgagttgc 2943121 gggccgcagt gaatgctgcc cgcgccgccg gcgacacctg ggcagccatc ggcgtcgccc 2943181 tcggcatcac ccgccaggcc gcgttccaac ggttcgggcc acacagcaca gcgagcccct 2943241 aaaccggcgc gcctccgcgg tggagttgac gacgaccaga cagggccgaa gcggagtcac 2943301 agcgtctggc cgacacacgt ggcgtcgtgt ttgctaggca tgggttttgt gtttgctgtc 2943361 ccccacaacc ccagacccgt acaaatcccc agacccctac acacagcgac acggcgaccc 2943421 gccgtctcct gagtgtgttt gctaaaattt cgtttgttct ggtcgatcac ttattgtgtt 2943481 tgccggtttt ggcgatgggc ttgattcctc tgacagcaac accagttggc cccttcctgg 2943541 ccaggacgtg atagaccacg ctggtgggtc atgcgcaccg gagcacccga tgatcgtcgt 2943601 ccgtacggcc gaggcggccg agcaggccct gactgagggc cagctggtct gcccccgccg 2943661 cggatgtggc gacaccttgc ggcggtggcg atatggacgg cgccggcatg tgcgcagcct 2943721 cggctcgcag gtgatcgatg tgcggcccca gcgggtgcgt tgccgcagat gcgaaagcac 2943781 ccatgtgctc ctgccagcgg cgctacagcc acgcctaggg cgcggcggcg gcggccagtt 2943841 acgtccaggg gtgtggtgta cgggcaggta aggccggtgg gcgtgtcgta gcccagtagt 2943901 gggcggtcat cgcgtgatcc ttcgaaacga ccagcaaaag tcaatcgaag gaaatgacgc 2943961 aatgacctct tctcatctta tcgacaccga gcagcttctg gctgaccaac tcgcacaggc 2944021 gagcccggat ctgctgcgcg ggctgctctc gacgttcatc gccgccttga tgggggctga 2944081 agccgacgcc ctgtgcgggg cgggctaccg cgaacgcagc gatgagcggt ccaatcagcg 2944141 caacggctac cgccaccgtg atttcgacac ccgtgccgca accatcgacg tcgcgatccc 2944201 caagctgcgc cagggcagct atttcccgga ctggctgctg cagcgccgca agcgagctga 2944261 acgcgcactg accagcgtgg tggcgacctg ctacctgctg ggagtatcca ctcgccggat 2944321 ggagcgcctg gtcgaaacac ttggtgtgac aaagctttcc aagtcgcaag tgtcgatcat 2944381 ggccaaagag ctcgacgaag ccgtagaggc gtttcggacc cgcccgctcg atgccggccc 2944441 gtataccttc ctcgccgccg acgccctggt gctcaaggtg cgcgaggcag gccgcgtcgt 2944501 cggagtgcac accttgatcg ccaccggcgt caacgccgag ggctaccgag agatcctggg 2944561 catccaggtc acctccgccg aggacggggc cggctggctg gcgttcttcc gcgacctggt 2944621 cgcccgcggc ctgtccgggg tcgcgctggt caccagcgac gcccacgccg gcctggtggc 2944681 cgcgatcggc gccaccctgc ccgcagcggc ctggcagcgc tgcagaaccc actacgcagc 2944741 caatcacggt cgacacaatg cataacgtca acctactgtt gacgtcatgc cggagcccac 2944801 acccaccgcc taccccgtcc gcctcgacga gctcatcaac gccatcaaac gggtgcacag 2944861 cgacgtgttg gaccaactca gcgacgccgt cctggccgcc gagcatctcg gcgaaatcgc 2944921 cgatcactta atcggccact tcgtcgatca ggcccgccgc tcgggcgcct cctggtccga 2944981 tatcggcaag agcatgggcg tcaccaaaca ggccgcgcaa aagcggttcg tcccccgagc 2945041 cgaagccacc acactggatt caaaccaggg cttcaggcgt ttcacgccgc gggcccgcaa 2945101 cgccgtggtc gcggcccaaa acgccgcgca cggagccgcc agcagcgaga tcacccccga 2945161 tcacctgttg ttgggagtgc tcactgaccc ggccgcactg gccacggcgt tgcttcagca 2945221 gcaggagatc gacatcgcaa ccctgcgtac ggcggtcacg ctccccccgg cagtcaccga 2945281 gccgcctcag ccgatcccgt tcagcggccc ggcgcgcaag gtcctcgagc tcaccttccg 2945341 cgaggcgctt cggctgggcc acaactacat cgggaccgaa cacctgctgc tggcactgct 2945401 agaactcgag gacggggatg ggccgttgca tcgatccggc gtcgacaaga gccgcgccga 2945461 ggccgacctg atcaccacgc tcgcatcgct caccggcgcc aacgctgccg gcgcaaccga 2945521 tgccggcgca accgatgccg gctgaggcga gcgacccctc cccttcgcgg cgccgcgtgt 2945581 gcaatcatgc gaaggtcccc caccgggagc cgaggaggca cagatgcgcc gctggctgat 2945641 cgtcctcgct acgctgctcg tcgccgccgc gggcgttgcg gccgccaacg acgtgccccg 2945701 tgcgtgggcc ggcgacgcgc cgatcggcca catcggcgac acgctgcgtg tggacaccgg 2945761 cacctacgtc gccgacgtca ccgtcagcag cgtcgtaccg gtcgatccgc cgccgggatt 2945821 tgcctatacc cgcagcggcg tcccggtcaa aagcttcccc gacagctcag tgacccgcgc 2945881 cgacgtgacg gtccgcgcgg tccgggtgcc caactccttc atcttggcca ccaatttcag 2945941 cttcaccgga gtaacgccgt ttgccgacgc gtacaagccg cggccgtgcg acgcatccga 2946001 ttggctcgac gccgcgttgg gcaacgcgcc acagggctcg atcgttcgcg gcggggtgta 2946061 ctgggacgcc taccgcgacc cggtgtcggt tgtcgtgctg ctggacaaga aaaccggcca 2946121 gcacctcgca cagtggaacc tttgacctgc gcctcgagat cgccacggcc gacgtgaccg 2946181 acgccgacga gttggccgcc gtcgccgcac gcaccttccc gctggcgtgc ccaccagcgg 2946241 tcgccccgga gcacatcgcg tcgttcgtcg acgccaacct gtcgtcggcc cggttcgccg 2946301 agtatctgac cgatccgcgg cgcgccatcc tcaccgcccg ccatgacggc cgaattgtcg 2946361 gttacgccat gctcattcgc ggtgacgacc gggacgtgga gctgtccaag ctgtacctgc 2946421 tgccgggtta tcatggcacc ggagccgctg cggcattgat gcacaaggtg ctggctaccg 2946481 ccgccgactg gggcgcgctc cgggtgtggc tgggtgtcaa ccagaaaaac caacgcgcac 2946541 aacgcttcta cgcgaagact ggtttcaaga tcaacggcac caggacgttt cgactgggag 2946601 cccaccacga gaatgactac gtcatggttc gcgagcttgt atgacccccg ccgtcagggc 2946661 cagcaggcga gatgtggccc gcaggtactt ctttcggtat ccaccggcca gcatttcctc 2946721 gctgaagatg gtgtccagct tagcgccgga cgccaccacc ggaatgccgg cgtcatagag 2946781 ccgatcaacg agcgccacca accgcagcgc aacgttctgg tcgtcgatgc cgtgcacgcc 2946841 ggtcagaaac accgcggtca caccttcgat cagggtcaga tatcgcgacg gatgcatggt 2946901 ggccaggtgc gcgcacagcg cgtcgaagtc gtcaagggtc gccccctcaa cacgtgcggc 2946961 acgcgcggcc acctcctcgt cggacagcgg cgccggtgcc ggcggcagat cacggtgtcg 2947021 gtagtccgga ccctcgatcc tcaccgtggt gaaaatgctt gccagggtgt tgatctcgcg 2947081 tagaaagtcc tgggcggcga agcggccctc gccgagctgt tcgggcagtg tgttggaggt 2947141 ggcggccacc gaaacccccc gctcgaccag agccgaaagc agccgggaga tcagcgtggt 2947201 gttgcccgga tcgtccagct cgaactcgtc gatacataaa gcggtgtaat tggccaacag 2947261 atcgatacag tcggcgaagc cgaacacacc ggccagctgg gtcagctcac cgaacgtcgc 2947321 gaatgccttt ggacatgtcg gcgcgtccgg gccggttcca ggcagctggt agtaggcaga 2947381 ggccagcagg tgcgtcttgc ctaccccgaa cccaccgtcc aggtacagcc ccacaccggg 2947441 caacacgtcg cgcttgccga accatttctt gcggcctgca cgccgctcga cggcctgccg 2947501 gcaaaagtcc tggcacgcca cgacggcggc cgcctgggtg ggttcaaccg ggtcaggtcg 2947561 atacgtcgcg aagctcacct cggcgaacgt cggaggcggc cgcagttggg cgatcagccg 2947621 caccggagac acggtcggat gcctgtccac caggtggtcc accgaaccgc aagcttcgga 2947681 ggcagacccg tgcatggtgg cactgtagcg acgtgctgca atcaaggtca tgcccgactc 2947741 tggtcagctc ggagccgctg acaccccgct aaggctgctc agctcggtgc attacctcac 2947801 cgacggcgaa ctcccccagc tttacgacta tccggatgac ggcacctggt tgcgggcgaa 2947861 cttcatcagc agcttggacg gcggcgctac cgtcgatggc accagcgggg cgatggccgg 2947921 gcccggcgac cgattcgtct tcaacctgtt gcgtgaactt gccgacgtca tcgtggtcgg 2947981 cgtgggcacc gtgcgcattg agggctactc cggcgtccgg atgggtgtcg tccagcgcca 2948041 gcaccggcag gcccgaggcc aaagcgaagt tccgcaactg gcaatcgtca ccaggtccgg 2948101 tcgccttgac cgtgacatgg cggtattcac ccggaccgag atggcaccgt tggtgctcac 2948161 caccacggcg gtcgccgatg acacgcgcca gcggctcgcg ggcctcgccg aggtgatcgc 2948221 gtgctccggc gacgatccgg gcacggtcga tgaggcagtg ctcgtgtccc agctcgcggc 2948281 tcgcggtctg cgccggatcc ttaccgaagg cgggccgacg ttgctcggga cattcgtcga 2948341 gcgtgacgtg ctcgacgagc tgtgtctgac gatcgccccc tacgtcgtcg gcggcctggc 2948401 gcgccgcata gtgacgggac ccgggcaggt gctgacccgg atgcgctgtg cccatgtcct 2948461 caccgacgac tccggctacc tgtacacccg ctacgtcaag acctgaaaca gctggacgtg 2948521 aatgcccgcc tcctcaccga cccactacgc ggcccgcatc gtcgccgggt gaatggctac 2948581 tgtggtcggc atgagtcggc ccatgacgtc aaccgcgatg ttggtcgcgc tgacctgctc 2948641 ggcgacagtg ctggccgcat gcgtcccggc gttcggcgcc gacccgcggt tcgcgaccta 2948701 ctcgggcgca ggaccgcaag gcgcagccac cacgacacca ccgccggctg gcccaccacc 2948761 gctcgccgca cccaagaacg acttgtcgtg gcacgactgc acgtcacggg tgtactcgaa 2948821 tgctgggatc ccagcagcgc ccggcgtcaa gctggaatgc gcaagctatg acaccgacct 2948881 cgacccgctc gtcggcgggt ccacagcggt aagcatcggc gtagtgcgcg cgcgctccaa 2948941 ccagaccccg agcgacgcag gacccctggt gttcaccacc ggctccgacc taccctcgtc 2949001 gacgcagttg ccggtctggc tggcacacgc gggcatcgat gtgctccgca gccaccccat 2949061 tgtcgccgtc gaccgccgcg gcatgggcat gtcgagccca atcgactgcc gcgatcactt 2949121 tgaccgcgac gagatgcgtg atcaggcgca attccaggct ggcgacgatc cggtggccaa 2949181 cctttccgac atctccaaca ccgccaccac cgactgcacc gacgccatcg cgccaggcga 2949241 gtccgcctac gacaacaccc acgccgcctc ggatatcgag cgcttacgca aactctggga 2949301 cgtccctgcc ctcgccttcg tcggcattgg caacggcacc caagtggcgc tggcctacgc 2949361 agcatcgcgt cccgacaacg tcgccagact gatcctcgac tccccaatcg cgttgggggt 2949421 ctctgccgaa gccgccgccg agcaacaggt ccagggccaa caggcggcgc tggacgcatt 2949481 cgctgcgcaa tgtgtcgcgg tgaactgcgc gctgggctcc gatccgaaag gcgcggtcag 2949541 cgcgctgctg tcggccgccc ggtccggtga tgggcccggc ggcgcgtcgg tggcggctgt 2949601 cgccaacgcc gtcgccaccg cgttgggctt ccccgacagt ggccgggtcg atagcaccac 2949661 gaaattggcc gacgcgctgg ccgcggcccg ctccggggac atgaacttgc tgtccgccct 2949721 gatcaaccgc gccgatacca cccgggatac ggacggtcag ttcatcagct cgtgcagcga 2949781 tgcggtcaac cgcccgacac cggaccgggt gcgcgagctg gtggtggctt gggggaagct 2949841 ctacccgcag ttcggcgccg tcgcggcgct caacctggtg aaatgcgtgc actggcccag 2949901 cagttcgccg ccgcagccac cgaaagacct caaggtcgac gtgctgttgc tcggtgtgca 2949961 aaacgacccg atcgtgggca acgaaggggt cgccgcgacc gccgccacgg ccatcaacgc 2950021 caacgccgcc agcaagcggg tgatgtggca aggtattggc cacggcgcca gcatctactc 2950081 gtcctgcgcg gtgccgccac tcgtcgccta cctggacact ggcaagctgc ctgacaccga 2950141 cacctattgc cccgcctgat attcggggcg ggcgggacgc ggtgtacggt gcgctggtga 2950201 cggcagctga ctccatccga accggcctag gcgcatcctt gttggccgga ttccgtccgc 2950261 gcaccggcgc cccgagcacc gcgacgatcc tgcggtcggc gctctggccg gccgccgtcc 2950321 tgtcggtgct gcaccgcagc atcgtattga cgaccaacgg caacatcacc gacgatttca 2950381 agccggtcta ccgcgcggtg ctgaacttcc ggcgcggatg ggacatctat aacgagcact 2950441 tcgactacgt cgacccgcac tacctgtatc cccccggtgg caccctgctg atggcgccgt 2950501 tcggctacct gcccttcgcc ccgtcgcgct atctgtttat ctcgatcaac accgcggcca 2950561 tcctggtcgc cgcctacctg ctgctgcgga tgttcaactt cacgctgacc tcggtggccg 2950621 cacccgccct gattctggcc atgtttgcta ccgagaccgt gaccaacacg ctggtgttca 2950681 ccaacatcaa cggctgcatc ctgctgttgg aggtgctctt tctgagatgg ctgttggacg 2950741 gccgagccag tcgtcagtgg tgcggcggcc tggcgatcgg gctgaccctg gttctcaaac 2950801 ccctgctcgg tccgctgttg ttgctgccgc tgctgaaccg ccagtggcgg gctctggtgg 2950861 ccgccgtcgt cgttcccgtc gtcgtcaacg tggccgcgct gccgctggtc agtgacccga 2950921 tgagcttctt cacccgcacg ctgccctaca tcttgggcac ccgggactac ttcaacagct 2950981 cgatcttggg caacggcgtc tacttcgggc tgcccacctg gctgatcctg ttcctgcgga 2951041 tcctgttcac cgcgatcacc ttcggcgcat tgtggctgtt gtaccgctac taccgcaccg 2951101 gtgacccgct gttttggttc accacctcgt cgggtgtgct gctgctgtgg tcgtggctgg 2951161 tgatgtcgct ggcccagggc tactactcga tgatgctgtt cccgttcctg atgaccgttg 2951221 tgctgcccaa ctcggtgatc cgcaactggc cggcgtggct gggagtctac ggcttcatga 2951281 cgttggatcg ctggctgctg ttcaactgga tgagatgggg ccgcgcgctg gaatacctca 2951341 agatcaccta cggttggtcg ttgctgttga tcgtgacgtt taccgtgctc tatttccgct 2951401 atctggacgc caaggcggac aaccggctgg acggcggtat cgatccagcc tggctgacgc 2951461 ccgagcggga gggccagcgg tgatcgcaag cgcggcgagc cgggcgcagc gggtcaccgc 2951521 catcgggact agcggtgatc gcaagcgcgg cgagccgggc gcagcgggtc accgccatcg 2951581 ggactagcgg tgatcgcaag cgcggcgagc cgggcgcagc gggtcaccgc catcgggact 2951641 agcgtggacc catgacgcgc ccaaagctag aactgtccga cgacgagtgg cgtcagaagc 2951701 tcaccccgca ggaattccat gtgctacgtc gcgccgggac cgagcggccc ttcaccggtg 2951761 aatacaccga caccacaaca gcgggcatct accagtgccg ggcctgtggc gccgaattgt 2951821 tccgcagcac cgagaaattc gagtcgcatt gcggctggcc gtcgttcttc gacccgaaaa 2951881 gctccgatgc ggtgaccctg cgccctgacc actcgttggg gatgacgcgt accgaggtgc 2951941 tgtgcgcgaa ctgcgacagc cacctgggcc acgtgttcgc cggcgagggg tatcccacgc 2952001 caaccgacaa gcgctattgc atcaactcca tttcgctgcg cctggtcccc ggtagcgtgt 2952061 agcgccgaga ttgacgtttt gcagacgccc tctcgcactt tcactgcaaa acgtcagtct 2952121 cggtgaaagt cagtccaccc gggtggcgtg cacttcccag aacggggcat gtacgcggcc 2952181 gcccaccagc cacggcttaa tcgcccggaa ccgctccagc acgcaccgca cttgatcggc 2952241 catatcggga ttgcgcgcgg ccatcagctc tagggcctcg acgctcaagt tgacctggta 2952301 ggtggtcgtc cccaggtagg tgatctccca accgccgacc ggaagcacct ggcgaaaatc 2952361 gtcctcggac aacgaccgcg gcatgctgaa cccgttgacg ttgtgctcgc cgaattcgaa 2952421 catgtacagc cgtgcacccg gcttgctggc ccggcgcagc gcccgcacat agcacctttg 2952481 cagctcgggc gcggtgctga aggtgtggta gaaggcgcaa tcgacgacgg tgtcgaaccg 2952541 gccgtccagc ccgtcgagcg tggtggcgtc gccgacctgg aagttcaccg acacccccgc 2952601 cttacgcgcg ttgtcccgag cccgctcgat ggccgcgacg gacccgtcga tcccggtggc 2952661 cgcatatccc ttggcggcgt agtagatcgc gtggtgcccg ggcccggtgc ccgggtcgag 2952721 cacctcacct cggatcgcgc ccaacgcaac cagctgttga accaccggct ggggaccccc 2952781 gatgtcccat ggcgtggcgg ccggcaaccc gtgggcgacc cgatcatcgc gatacatctc 2952841 ctcgaaccgg gtgggatcgg caggatcgaa ctgggccgtc atggcagcga gtgcaccaac 2952901 tgctccaccg gcactcgcgg accggtgaaa aacggtgtct ccgcgcgggt atggcggcgc 2952961 gcgtcggtgg cgcgcagctc acgcattagg tcgacgatgc ggtccagctc gggcgcctcg 2953021 aacgccagga tccattcgta gtcgcccagc gcgaacgccg gcaccgtgtt ggcccggacg 2953081 tccttgtatc cgcgggcggc catgccgtgt tcggcgagca tgcggcgacg ttcctcgtcg 2953141 ggcagcaagt accactcgta agaccgcaca aacggataga cgcagatgta ggcgccgggc 2953201 tcctcgccgg ccagaaacgc cgggatatga cttttgttga actccgccgg ccggtgcagg 2953261 cccacaccgc tccacaccgg cgtgcatgcc cgccccagcg tggtggtgcg ccggaagtcg 2953321 gcgtaggtgg cctgcagggc ctcgacacgt tcggcgtggg tccagaccat gaaatcggcg 2953381 tcggcccgca ggcccgcgac gtcgtagagg ccgcgcacca caaccccgcg ctcttcctgc 2953441 tgtttgaaaa acgtggacgc gtcgtcgatg atcgcgtcac gctggtcacc gagcgcaccg 2953501 ggactcaccg agaacactga gaacatcagg tagcgcagcg tcgcattcaa cgcgtcatag 2953561 tcaagacggg ccatggcatc tatcgtgcca cctgcgcatc taaggcctcg atgacgctgg 2953621 tgacggcccg gcccgccgcg ccgacgcagg ccggcacgcc gatcccgtcg aggtagctgc 2953681 ccgcaacggc cagcgtcggt ggcaggccgg cgcgcagctc ggcgaccaca tcggcatggc 2953741 cgggaccgta ctgcggcatc gcctcgatcc agcgccggac ccgaacgtcg accgggtcga 2953801 cggccacacc gaacaccgtg accaagtcgt ccgctgccca ggccaggagt tggtcgtcgg 2953861 aggccgtcag ggccggttcg tcgccgaacc gaccgaacga cagccgcaac agcgcgacgt 2953921 cgccgcgctg accccatttg cgcgacgaca atgtgatcgc cttggcatgc ggtgactcgt 2953981 cgccggccac cagcacgccg gaacagtgcg gaaacgcggt gccgccgggc accgccagcg 2954041 ccaccaccgc cgacgacgcg ctcacgatct gccgggcggc ggcatgtgtg cgcggcgcga 2954101 tgccatcgac gaggcgcgcc aaccgcggcg ccggaaccgc caggatgacg gcgtcggcct 2954161 gccagcggcc gccggtttcg tcgcgcagca cccagccgcg ttcgagctgg accaccctgg 2954221 cccgcaccca gtgcacccgg ctgcgccgga cgagcccgtc gagcagcacc tgatacccgc 2954281 cgtccagcgc gccgaacacc ggcccgccgc ttcccggcgg cagcgcctgc cggaccgcgt 2954341 cggtcacact ggtcgccccg cgatccaggg ccgcggccac gctcggggcg gccgcgcgca 2954401 gcccgatcgt cgccgccgag cccgcgtata ccccgcttaa cagcgggtcc accgaccggg 2954461 ccacgacttg gtcgccgaac cggtcagcca ccaagtcggc caccgcggga tcgctgccca 2954521 cctgccaggt gaacggacga gcggcttcgg cgtcgatccg cgccagggtt gcgtcgtcga 2954581 ccagccccgc catggagccc gccgacgacg ggatcccgac gaccgtctgc ggcggcagcg 2954641 ggtgcaagcg ctgctggctg tagatgagcg gccgcgcgcc ggtgctggcg agttggcggt 2954701 ccgacaggcc cagctcggcc aaaagcgccg gcatctcggg cctacgcagc acgaacgcct 2954761 ccgcgccgag gtccattggc tgtccgccga tatgctcggt gcgcaatacc ccgccgagcc 2954821 tatcggccgg ttcgaacaag gtgatggtcg cgtcatcgcc gacagcctgc cgcagccggt 2954881 acgccgaggt caatcccgaa atcccgcctc ctacaacaca atacgagcgg ggagtcatag 2954941 cgagtgtacg agcgagacca ggtcggccag caccgcggga tcgctttctg gcagcacccc 2955001 gtggccgagg ttgaagatat ggccggccgc accggcgtcg acggcgcggc gtccgtcgtc 2955061 gacaacggca cgtgcggcac gttccaccgc cggccagccc gccaggacca ccgccggatc 2955121 gaggttgccc tgcaacgccg tgccgggcac cacccgggcg gcggcgtcgg tcagcggggt 2955181 ccgccagtcc acgccgacga cggcccctcg gcctggccgc tccccggctg tcacggcctc 2955241 cgacatcgcg cccagcaatt cggcggtccc aaccccgaag tgcgtcatcg gcacgccatg 2955301 ctcgcccagc gcagcgaaca cccgggcgct gtgcggcaac acgtactggc ggtagtcgat 2955361 cggcgagagc gccccggccc aggagtcgaa tacctggatg gcgtccaccc ccgcgtcgat 2955421 ttggccgacc agaaacgcga tggtgaggtc ggtcagcttg gccatcagcg cgtgccagct 2955481 cgccggctcg gccaacatca tcgccttgac gtgggcgtga tggcggctcg gtccgccctc 2955541 cacgaggtag gaggccagcg tgaacggcgc gccggcgaaa ccgatcagcg gcacgtcgcc 2955601 aagctcagcg accaacaacg aagccgccac caataccggt tgaatcgctt gtggatcaag 2955661 tggtttcatg gcggcgacat cggcggcggt gcgcaccggg tccgcgatca ccggcccaac 2955721 gtcggcgacg atgtccaaat ccacgccggc cgcccgtagc ggcaccacga tgtcggagaa 2955781 caggatggcc gcgtcgacgt cgtagcggcg tatcggctgc agggtaatct cacaggccac 2955841 gtccggttcg aaacaggccg ccagcatgct gtaccgctcg cgcagcgccc ggtattcggg 2955901 caacgagcgc ccggcctgcc gcatgaacca caccggcacc cggctgggct tgcggccggt 2955961 gacggcggcc agatacggcg actgcggaag gtcgcgacgg gtactcatcg aactcaatgc 2956021 tgccacgacc gccaccccgc acctgcgtaa catcgaccca atgccagtta cctacgacga 2956081 cttccccagc ctgcgctgcg aaatccacga ccaacctggt cacgaaggcg tgctggagct 2956141 ggtgctggac tcccccgggc tgaactcggt cgggccgcac atgcaccgcg accttgccga 2956201 catctggccg gtgatcgatc gcgacccggc cgtgcgcgtg gtcttggtcc gcggtgaagg 2956261 caaggccttt tcctccggcg gcagtttcga cctgatcgcc gaaaccatcg gcgactacca 2956321 gggccggctg cgcatcatgc gcgaggcccg cgacctggtg ctcaacctgg tcaacttcga 2956381 caagccggtg gtgtcggcga ttcggggccc ggccgtcggt gcgggtctgg ttgtcgcgct 2956441 gctcgccgac atttcggtgg cgggccgcgc cgcgaagatc atcgatgggc acaccaaact 2956501 cggggtcgcc gcgggggatc acgcggcgat ctgctggccc ctgctggtcg gcatggccaa 2956561 ggccaagtac tacctgctga cctgcgagcc gctgtccggg gaggaggccg aacgcatcgg 2956621 tctggtctcc atctgcgtcg acgacgacga tgtgctcccc accgcaacac gcctggcgga 2956681 gcggctcgcc gctggcgcgc aaaacgccat ccgctggacc aaacgcagcc tcaatcactg 2956741 gtatcgcatg ttcggtcccg ccttcgaaac gtcgctcggg ctggagttca tcgggttcgg 2956801 tggtcccgac gtccgggaag gcctggccgc gcaccgcgaa aagcgccccg cgcggttcgg 2956861 cgccgacccc gatcccggcg ccggcagctg agcacagttc ggcgcgcctg tgcacacgtg 2956921 tcggcggata ggtctaccgt cgaaatctgt gacctccgcc ggcgacgatg cagagcgcag 2956981 cgatgaggag gagcggcgct tgacctccgc cggcgacgat gcagagcgca gcgatgagga 2957041 ggagcggcgc ttgacctccg ccggcgacga tgcagagcgc agcgatgagg aggagcggcg 2957101 cttgacctcc gccggcgacg atgcagagcg cagcgatgag gaggagcggc gcttgacctc 2957161 cgccggcgac gatgcagagc gcagcgatga ggaggagcgg cgcttgacct ccgccgagcc 2957221 ggccctattc cgcgaggcag tagcggcgat gaacgctgtc accgtgcggc cggaaatcga 2957281 actcggccct atccgaccgc cgcagcggct agctccgtac agctatgcgc tgggagccga 2957341 gatcaagcat cccgaactcg acgtcattcc ggagcgttcc gagggcgacg ccttcggccg 2957401 gctgatcatg ctgtatgacc cggacggctc cgatgcatgg gacggcacta ttcgcctggt 2957461 cgcctatgtc caggccgacc tggactcgag tgaagccgtc gaccccctgc tgcccgaggt 2957521 ggcatggagt tggctggtgg acgcgctgac agcgcgcacc gaccaggtga gggccctggg 2957581 cggcactgtc accgccacca catcggtgcg atacggcgac atctccgggc cgccgcgcgc 2957641 tcaccagctg gagctacggg cgtcatggac ggcgaccacc cccgatctgg gcgcccatgt 2957701 ccaggcgttc tgcgacgtcc tggagcacgc ggccggcctg ccgccagccg gggtcaccga 2957761 cctgggctcg cggtcacgcg cctgacatgt gccccgagcc gtctcacgcg ggagctgctg 2957821 agtccgaagg cacggaatcg gaacccaccc ccttgctccg gcccgccggt gggataccgg 2957881 atctgtgtgt gaccgtcggt gaaatcgccg ctgccgcaga actactggac cgcgggcgcg 2957941 gaccgttcgc ggtagacgcc gagcgggcgt cgggtttccg ctactccggc cgcgcctacc 2958001 tgattcagat ccggcgggcc gaggccggca ccgtactgat cgacccggtc agccacggcg 2958061 gtgacccgtt gaccgtgctg gcgccggtcg ccgaggtgct cagcaccaac gagtggatcc 2958121 tgcactccgc cgatcaggat ctgccctgtc tcgccgaggt cggtatgcga ccgccagcgc 2958181 tatacgacac cgagcttgcc gggcgcctgg ccgggttcga tcgagtgaac ctggcggcca 2958241 tggtcgagcg gttacttgga ctgggattga ccaagggcca cggcgcggcc gactggtcca 2958301 agcgcccgct accctcggcc tggctgaact acgcggcgtt ggacgtggaa ctgctcatcg 2958361 aactacgcgc ggtgatctcg cgggtgctgg ccgagcaagg caaaaccgat tgggctgcgc 2958421 aggaattcga gcacctgcgg tcgttcgaat caaggccacc cccagcggcc gcccggcagg 2958481 accgctggcg acgaacctcg ggtatccaca aagtgcatga ccggcggggg ctggccgcgg 2958541 tccgcgaatt gtggacagcg cgtgaccgaa tcgcccagcg ccgcgacatc gcgccccgcc 2958601 ggatcttgcc ggactcggcc attatcgatg ccgccatcgc cgacccaaag tcagtcgacg 2958661 accttgtcgc gttaccggtg ttcggcggac gcaaccaacg tcgcagcgcg gctgtgtggt 2958721 gggcggcact ggcagccgca cgcgaaagcc cagatccgcc ggagatcgcc gaaccggcaa 2958781 acgggccgcc gccgccgggg cggtgggtca gacggaaacc ggcagccgcc gcacggctgg 2958841 atgcggcgcg cgcggcgctg acggaggtgt cgcaacgggt gcgggtaccg accgagaacc 2958901 tggtctcacc tgatctggtg cgacggctgt gttgggaatg ggaggacatc tcgcagagtt 2958961 ctccagaccc gattgccgct gtcgaggcgt acctgcgcac cggccaggca cgggcctggc 2959021 agctcgaact agtggtcccc atcctgaccg cggcgttgac aggggctccg gacgccggcg 2959081 cccagggcga tgatggctct tagtcgagat gttctggaat cgcgtcggac gcacacaccc 2959141 cggtacccag cgcggcgacc cagccggtga tccgccgggc cacgtcctgg tcggtaagcc 2959201 ccagatcggc cagcacctcg cttcgagacg cgtgctcgta gaactcctgc ggcaacccga 2959261 catcgcggca gggcacgtcg atctccgcgc gccgcagcgc ggccgacacc gctgaccccg 2959321 ccccaccgtt gaccccgttg tcctctagcg tgacgagcag cttgtgctgc accgccagtt 2959381 cgcgcacacc gtcagacacc ggcaacaccc agcgcgggtc gatcaccgtc acaccgatcc 2959441 cctggttgtg cagccgcttg gccaccgcca acgccatcgg tgcgaacgcg ccgatggcca 2959501 ccaacaggac gtcgtggttc aaaccatcgg cgggcgccgc cagcacatcc acgcctccac 2959561 gccgctccaa agccgaaata tcttctccca catcaccttt ggggaaccgt aacgccgtcg 2959621 ggccgtcgtc gacgtcgagc gcctcgccga gttcttcacg caaccgggtg gcgtctctgg 2959681 gcgctgccac ccggatgccg ggcacgatac ccagcatcga caagtcccac attccgttgt 2959741 ggctggcgcc gtcgctaccg gtgatcccgg cacggtccag caccatggtg accggcagct 2959801 tgtgcagcgc cacatccatc atgatctggt cgaacgcccg gttcaggaac gtcgagtaga 2959861 tcgccaccac ggggtgcagc ccacccatcg ccaacccggc cgccgacgtc atcgcgtgtt 2959921 gctcggcgat cccgacgtcg aacaatcgat ccgggaagcg ctgcccgaac gcggtcagcc 2959981 cggtggggcc cggcatggcc gcggtaatgg ccacgatgtc acggcgtttc tgggcgtagc 2960041 cgataagtgc atcagagaag gtcgccgtcc agcctgggcc ggccaccttg gtggcttgtc 2960101 cggtggccgg atcgatcggg accgtggaat gcatctgctc ggcctggtcg gcctcggccg 2960161 gcgggtagcc catgcccttg cgggtgacga cgtgcacgat caccggtgca ccgaagcgcc 2960221 gcgcgctgcg cagcgcgacc tccaccgccc gctcgtcatg gccgtcgacc gggccgacgt 2960281 acttcaaccc gaggtcggtg aacagcaact gcggcgacag cgagtccttg atgccggcct 2960341 tgacgctgtg caggaatcga aaccacagac cgccgacaag cggcaccgcg cgcaccaggt 2960401 cgcggcccgt ctccagcgcc tgctcgtagg ccggctgcag ccgcagcgtg gccagatggt 2960461 cggcgacgcc cccgattgtg ggcgcgtagc tgcgcccatt gtcgttgacc acgataatca 2960521 ccggccggcg ggatgcggcg atattgttca gcgcctccca gcacataccg ccggtgagcg 2960581 caccgtcacc gaccaccgcg accacatgcc ggttgcggtg tccggtcaac tcgaacgcct 2960641 tggccaaccc gtccgcgtac gacagcgccg cgctggcgtg gctcgactcc acccagtcgt 2960701 gctcgctctc ggcacgagac ggataccccg acaacccgcc cttcttacgc agggttgcga 2960761 agtcctggct gcgtccggtc aacatcttgt ggacgtaggc ctggtgaccg gtgtcgaaga 2960821 tgatcggatc gtgcggcgag tcgaataccc ggtgcagcgc caaggtgagt tccaccactc 2960881 ccaggttcgg ccccagatgc ccccccgtgg cggcaacctt gtggatcagg aactcacgga 2960941 tctcggcggc cagctcccga agctgcgcct gggaaaggtg ctgcagatca gcgggcccgc 2961001 ggatctgttg cagcatttcg ctagtgtacg cagcaacccc cccattggcc cagcatgcgg 2961061 ccgccgatca aaagggccga accactttga tagcgtcggt ggccggcgcg ccgggaagcc 2961121 tggtcggcga ctcattgtca tccaactccg gagttcgata tgaaggtaaa catcgaccca 2961181 accgcgccca cctttgcgac gtatcgtcgg gatatgcgtg ccgagcaaat ggcggaggac 2961241 tatcccgtcg taagcatcga ttccgacgcg ctggatgctg cccgcatgct cgcagagcat 2961301 cgtctgcctg gactattggt caccgccgga gcgggcaaac agtatgcggt actccctgcc 2961361 tcacaggtcg tgcgcttcat cgtgccccgc tatgtgcaag acgatccctc actggccggt 2961421 gtgctcaacg aatcgacggc cgaccggtgc gccgagagat tgagcggcaa aaaggtccgc 2961481 gacgtgttgc ctgaccacct ggtcgaggtt cccccggcta acgccgacga caccatcatc 2961541 gaggtggccg cggtgatggc acggctgcgc agcccattgc tcgcggtggt caaagacggc 2961601 tcgctgctcg gggtggtcac cgcatcgcgc ctgcttgctg cggcactgaa gacttgacct 2961661 cgtgagcgtc gtcgcggtca ccatcttcgt ggcggcctac gttctgattg ccagcgatcg 2961721 cgtcaacaag acgatggtgg cgctgaccgg cgcggcggcc gtggtcgtcc taccagtgat 2961781 cacatcccac gacatcttct attcccacga caccggaatc gactgggacg tcattttctt 2961841 gttggtgggc atgatgatca tcgtcggagt gctgcggcag acgggggtgt tcgaatacac 2961901 cgcgatctgg gccgccaagc gcgcccgcgg ctcgccgcta cgcatcatga tcctgctggt 2961961 attggtgagc gcgttggcgt cagccttgct ggataacgtc accacggtgt tgttgatcgc 2962021 gccggtcacg ctattggtgt gcgaccggtt aaacatcaac acgacgtcgt tcctgatggc 2962081 cgaagtcttc gcctccaaca ttggtggcgc cgcgacgttg gtgggtgacc cgccgaacat 2962141 catcgtggcc agccgggcgg gattgacgtt caacgacttc atgctgcact tgacaccgct 2962201 ggtagtcatt gtgctgatcg ccctcatcgc tgtgctgccc cgcctgttcg gctcgatcac 2962261 ggtcgaagcc gatcgaattg ccgatgtcat ggcgctcgac gagggtgaag ccatccgcga 2962321 ccgcggactg ctggtcaaat gtggcgccgt gctggtgctg gtgttcgcgg ccttcgtcgc 2962381 ccatccggtg ctgcacatcc agccttctct agtggcgctg ctgggcgctg ggatgctgat 2962441 cgtggtctcg ggtctgacgc gatccgagta tctatccagc gtcgagtggg acacgctgct 2962501 gtttttcgcc gggctgttca ttatggtcgg agcgctggtc aagaccggtg tcgtcaacga 2962561 tctcgcgcgg gcagcgaccc agctgaccgg cggcaatatt gtggccaccg cgttcctaat 2962621 cctcggcgtc tccgccccga tctcgggaat tatcgacaac attccctacg tcgccacgat 2962681 gacgcccctc gtcgcggagc tggtcgcggt catggggggt caacccagca ccgacacccc 2962741 ctggtgggcg ctggccctgg gtgccgactt cggcggcaac ctgaccgcaa tcggcgccag 2962801 cgcgaacgtc gtcatgctcg gaatcgcccg gcgcgcagga gctcccatct cgttctggga 2962861 gttcacccgc aaaggggcgg tggtcacggc cgtctcgatc gcgctcgcgg cgatctacct 2962921 gtggttgcgg tacttcgtgt tgttgcactg accatctgta ttgccgacag acctgtagca 2962981 ccagacgacg ccgcgatgag cggcctacga gaagattcgg aggatggccg atgagcatca 2963041 tcgccatcac ggtgttcgta gccggctatg cacttatcgc aagcgaccga gtcagcaaga 2963101 cccgggtggc actgacgtgc gcggcgatca tggtcggcgc cgggatcgtc ggatcggacg 2963161 acgtgttcta ctcgcacgaa gccggaatcg attgggacgt catctttctg ctcttgggca 2963221 tgatgatcat cgtcagcgtg cttcggcaca ccggcgtctt cgaatacgtc gcgatttggg 2963281 ccgtcaaacg cgcaaacgcc gcgccgttgc gcatcatgat cctgctggtg ctggtgaccg 2963341 cgctggggtc ggccctgctg gacaacgtca ccacggtgtt gttgatcgcg ccggtgacgc 2963401 tactggtatg tgatcgactg ggggtcaatt ccacgccgtt tttggtggcc gaagtcttcg 2963461 cgtccaatgt cggcggcgcg gccacgctgg tcggcgaccc gccgaacatc atcatcgcca 2963521 gccgggcggg actgacgttc aacgacttcc tgatccacat ggccccggcc gtgctcgtcg 2963581 tcatgatcgc cctgatcggt ctgctgccct ggctgctggg ctccgtcact gccgagcccg 2963641 accgagttgc cgacgtgctg tcgctcaacg agcgcgaagc catccacgat cgcgggctgc 2963701 tcatcaagtg cggtgtcgtc ttggtgctgg tgtttgcggc cttcatcgct catccggtgc 2963761 tgcacatcca gccgtctctg gtggcgctgc tgggcgccgg tgtgctcgta cggttctcgg 2963821 ggctggagcg atccgactac ctgtccagcg tcgagtggga caccctgctg ttcttcgccg 2963881 ggctgttcgt catggtgggg gccctggtga agaccggtgt cgtcgagcaa ctggcgcggg 2963941 cagcaaccga gctgaccggc ggcaacgagt tactcacagt cggtttgatt ctcggcatct 2964001 cggcaccggt gtccggcatc atcgacaaca tcccctacgt cgccacgatg acgcccatcg 2964061 tgaccgaact ggtcgccgcg atgccgggcc acgtccaccc cgacacgttc tggtgggcac 2964121 tggcgctaag cgccgacttc ggcggcaacc tgaccgccgt gggagccagc gccaatgtcg 2964181 tcatgctcgg aatcgcccgg cgctcgtgca ctcccatctc gttctggaag ttcacccgca 2964241 agggcgcggt ggtgaccgcg gtctcgctcg tgttgtcggc ggtctacctg tggctgcggt 2964301 acttcgtgtt cggctaagcg ccaacgctca cgcgtgctta gcgcgaaagc gccgaaacag 2964361 cacccagacg atggccaggt tgtagacggc acccccgacg agatacggcc accaggtgcc 2964421 gtgatcgctg gcgacccaaa atgccttggc agcccagtac ggtggtaaga cgccgaacgc 2964481 gaggttccag ttggaactga tgaaccacgg caggcagggc agcccggcga tgagcatgcc 2964541 cagcgcacgg accatcgcca ggccctgaat cttgttgttc gccaccgcaa gaatcagcag 2964601 cagcgtgacc accgccgaca ggccggccac cagtccgatg ggaatcagtg aagacaccag 2964661 gcccggttcg aggatcccgc tgcacgacat cgtcgcgacg acgtagatgg tggtcaccac 2964721 catcacggtg gccgcacgat agccgaaaaa gaccgacagc ggcaccgggg ttactcgcag 2964781 cgccgtcatc gtgcccgcgt ctacgtcgtc cagcaccaag aacgcggcca gcgcaccggc 2964841 gacgatgatg ctggtcaaca acaggaacgc ggtgaggatc agtgggtagt atccgaccag 2964901 gtcgaatcca taacgccgcg ccagcatctc ggtgaacagc ggcgtgagca gcgcgactcc 2964961 ggtggtccag atgaccggtg cgatgacgag catgaccagc agcggatcgc ggtaggtgcc 2965021 tcgaatgtcg ttgcggccga acgcggccaa cgcccgtggg cccgcaaggc tcgatatcgc 2965081 tctcacagca cacccgatct ttgcacgaca taacggccga atagcgcctt ggccgcccgg 2965141 cacaatcccg ccgcacacac gattgggtag accaccgcat acccgacctg ccagggcgcc 2965201 aagctcacct gatcgaacgc cgcgccgagc aagagcagcg gcccctgggt ggggatgagg 2965261 taaagcaccg ggttgggcca caggccggag tagtgcacca ccggcggcgc cagcatgatc 2965321 gcgagcggga tgaccgccgc caggaaccaa tcggtcaccg aggcgaacgg caacgaggaa 2965381 ctgaagccga ccagcagcat cagcagtgtg cccagcacga tgccggccac cagcggcagc 2965441 aggtggtaac caagcccgtg aacgatggtg gccacgacaa ccgcaacgaa cagcgagatc 2965501 gccagcagca cagttagttt ggcagccagg tactcccaga accgcagcgg cgtcgagacg 2965561 atcgcgccga tcgtgcgctc ctgcttctcg aagaacacgg tcccgccgac gaagaagaac 2965621 ccgatgatcg cgatatcacc caccaggaca tagggttcgg cgaccgggcg caggctgacc 2965681 ggcatcggca gcagcactgc cagccaaatc agtccggaga aaacggcggc atgcaagaac 2965741 ttctgccgca cctgtagcgt cagctcgagc cgcagcgcag gcaccaaccg ggtcatgtca 2965801 gctgcctgcc ggtgacctcg acgaagacat cgtcgaggct ggcctcgcgg ctatgaatgg 2965861 tctcgacgtg gtggtttcgc agcacggagt ggaacgccgg gtcgtcggca aggccgtcca 2965921 tgccgaactc ggcggtctcg agtcccccgc cgtcgccccg gtattccacc cgcacccgcc 2965981 gccggctgcg agcgatcttc agttcggtgg gactgtccag tgcgacgatc ctgccgtcga 2966041 cgacgaacgc cacccggtcg cacagctcgt cggcggtggc catgtcgtgc gtggtgagaa 2966101 agatcgtgcg gccgcgcgcc ttcaggtcca cgatgatgtc cttgatcttg cgggcgttca 2966161 ccgggtccag cccggaggtg ggctcgtcga ggaacagcag ctccgggtcg ttgatcagcg 2966221 acctggcgaa ggtcagccgc atctgcatgc ccttggagta cttgcccact agggtgtggg 2966281 cgtcatcggc caggccgacg gcggccagca gctgcatcgg gtcggccgtc gcgccggcgt 2966341 acagcgaggc gaagaagcgc aggttctcat acccggtgag cttttggtag tggttgggca 2966401 gctcgaagga gaccccgatg cgctcgtagt aatcgggtcc ccactcggcc ggctctttgt 2966461 cccacaccgt ggcctggccg ccgtggtcgc gcagcagccc gatgagaagc ttctgggtgg 2966521 tggacttgcc cgcgccgctg ggacctagaa gcccgaagat ttcgccgcgg ccgacggtga 2966581 actccatgcc acgcaccgcc ggctcggccg cctttgggta gcggaaggtg agcccgcgca 2966641 cgcggatcac ctcggttccc acacgcgccg atgccacagc acggttgagc gccgtcatga 2966701 ttggctccgt tccctttcgg gcgagcgcgg tgcgccggct catccaagta accagaaagt 2966761 caccgcgcca atgctgatac ctggttccga ccagtcttcc cggagcgcca acccaagact 2966821 actagctgcg ctgctgtata cggagcaacc cacgacgacc acgggcgagc tggtcgagca 2966881 gctgcatgac ctctacacct ttcgggtcaa cagcgcaacg cactcgacgt agtgagtcag 2966941 cgggaacgcg tcgaacacct tgatcttctc cacggcgtaa ccgtgaccac ggtagaggcc 2967001 gatatcgcgc gcgaaagacg ccgcttcgca accgatatgt atcaaccgtg gcacccccgc 2967061 accggccagc aagtcgacaa cctcgcgccc agcgcctgat cgcggtggat ccagcaccgc 2967121 cagatccgcg ccggcgggtt gcactgccaa cacccgccgc accgaaccgg tgacgacctc 2967181 cacctggggc aaatcgacca gcgcggcacg tgcggccccg gatgccaggc gcgaagtgtc 2967241 gacggtcaac acccgtccgg actccccgac cgcctcaccc agcaccgcag cgaaaacccc 2967301 cgcaccaccg tagagatccc aggcggtcat gccgggggcg ggctgagccc agtcagcgat 2967361 cagatcgctg tagaccgccg ccgcgtcgcg atgcgcctgc caaaaggccg ttaccggcac 2967421 ccgccagctg cgccggtgca cacgctggtg ggcgtggtag gcgccctcca ccacgttggt 2967481 cacggttcgg gtcctattcc gagggccctg ccgcacggaa cagaccacat ggcgctcgcc 2967541 gtcgtcgtcc agagccacgt aaagctgggc ttccggcggc cagtcagccg ctaccaggcc 2967601 gtctagcatg ccgacaggca actgcccgca gtccaggtcg gttaccagct cgccactgtg 2967661 gtagcggtga aaacctggac gacggtctgc gcccacgtcg agccggactc gaatacgcca 2967721 acccgtgggg ccggcatccg acagcggttg cgcctcgccc tgccagctgt gccgcccgag 2967781 ccgttccagc tggttagcca caacttgcgc cttaagtgtg cgggccgcct ccggagcagc 2967841 aaacgccaga tcgcaacacc cggcgccgtc ggccccggcg atcgaacaca gcgacccgat 2967901 ccggtcgggc gacgggtcga tcacctcgaa agcctctgcg tgccaataag agccacgttg 2967961 cgcggtcacc cgcgcccgca ctcgttcacc gggcaacgca tagcggacga aaaccacccg 2968021 gccctcgtgg tgcgccacgc agctaccgcc gttcgcgggc gctccggtga ccaacgtcag 2968081 attcactgca tcgtcgccgg cgcgggtcac tggcgccgct cctccccatc gctttgctct 2968141 gcatcgtcgc cggcgcgggt cactggcgcc gctcctcccc atcgctttgc tctgcatcgt 2968201 cgccggcgcg ggtcactggc gccgctcctc cccatcgctt tgctctgcat cgtcgccggc 2968261 gcgggtcaat cgaagatgcc ccgtcgcgtg tcaccgggag ccgcgtgcgg ctgtaacgtc 2968321 ttgatccgct ccgacgacgt cagttgccaa ggcaccgaag tcaccatcac gccgggcatg 2968381 aacagcaacc ggcccttgag ccgcagcgca ctctggttgt gcagcagctg ttcccaccag 2968441 cgccccacga catactccgg aatgaatacc gtcaccacgg tccgtggcga ttccttgctg 2968501 acccgcttga cgtaatcgag caccggccgg gtgatctcac ggtacggcga ggcgatgacc 2968561 ttgagtggca cgctcacatc gctgtcctgc cactggcgca ccagctcgcg ggtttccgca 2968621 tcgtcgacgt tgaccgtcac ggcttccaac acgtcgggcc gggtcgctcg tgcgtaggtc 2968681 aacgcgcgca acgtcggcag gtgcagcttc gacaccagca cgacggcgtg attgcggctg 2968741 ggcaacgtta tctcggcttc ctcggcctgt tccgccaact cccggttgac ggcgtcatag 2968801 tgcctgtgga tgagcttcat catcatgaag aaccctccca tggcgacgat cgcgatccat 2968861 gctccggcaa ggaatttcgt taccagcacg atgagcagga cggtaccggt ggacacgaag 2968921 ccgaccgtgt taaccgcgcg ggagcgcagc atcgcgcgac gggcgcgcgg atcggtctcg 2968981 gcgctcagca accgggtcca gtgccggacc atgccgacct gactcatggt gaacgagatg 2969041 aacacaccga cgatgtacag ctggatcagc gcggtcaact cggcacgaaa cgcgaccacc 2969101 gccccgatcg ccgccgccgc caggaacagg attccgttgg agaacgccag ccggtcccca 2969161 cgggtgtgca actggcgcgg cagatagctg tgctgcgcca gcaccgagcc cagcaccggg 2969221 aagccgttga aggcggtgtt agcggccaac accaggatca gcgctgtcac cgcggcgatc 2969281 agcaagaacc ccaggtaaaa gcccccgaac acggcctgcg ccagttgtgc gaccagcgtc 2969341 ttttgctgat aacccggcgg ggcgcccgtc agctgggtgt ccggatcgtc gacgacctgg 2969401 accccggtct ctacggccag cacgatcatg cccataaaca tgctcaccgc aatgatgccc 2969461 agcatcagca gcgtggttgc cgcgttacgc gacttgggct tttgaaacgc cggcaccccg 2969521 ttgctgatcg cctcgacacc cgtcagcgcc gcacaccccg acgaaaacga gcgcgccacc 2969581 aagaacacca gcgcgaaacc gacgatctgg ccgtgctctg cgtgcatttc aaaagccgcg 2969641 gactcggccc gaaccggatt gcccagcacg aaaatccgga acaaccccca cacgagcatg 2969701 gtgccgattc cggcgatgaa cgcataggtc gggatcgcga acgccaaccc ggattcccga 2969761 accccacgca agttcatcgc catgatcagc acgatcgcgc cgacggcaaa caacaccttg 2969821 tgctcgtaca cgaacgggct cacagagccg atgttggacg ccgccgacga tatcgaaaca 2969881 gcaacggtga gaacgtaatc caccatcagg gcgctggcaa ccacgagacc gccggtagca 2969941 cccaggttgg tggtgacaac ctcgtagtcg cccccaccgg aggggtaagc gtgcacgttc 2970001 tgccggtaac tagacaccac cacgagcaga accgcggcga ccgccaggcc gatcaacggc 2970061 gccatcgaat aggccgccag gccggccacc gagagcacca gaaatatctc ctcgggggcg 2970121 taggctatcg acgacatcgc atccgaggcg aacaccggca aggcgatccg cttgggcaac 2970181 aaggtgtgac tgagccggtc actgcgaaac ggccggccga tcagcaaccg acgcgccgcg 2970241 gttgaaagtt tggacacgag agccaagggt aggcctatcc gagcgtggcg gtagcgttcc 2970301 ctagacgaga atgttcgccg acgtaaatcg gctggccacc gcgggttgcc gatcgcgtac 2970361 ggcgcaccgg acacagccga gaggacctct aatgcgggtg gttgtgatgg ggtgcggccg 2970421 ggtcggggct tcggtggccg acggactgtc ccggataggc catgaagtcg cgatcatcga 2970481 ccgtgacagc gccgccttca atcggctcag cccgcagttt gccggcgagc gggtgttggg 2970541 tcagggcttc gaccgagatg tgctgctgcg tgcgggcatc cagggggccg acgcattcgc 2970601 cgcggtgtcc tccggcgaca actccaacat catctcggcg cggttggccc gggaaacctt 2970661 cggtgtgccg cgcgtcgtcg cgcggatcta tgatgccaag cgcgccgagg tctatgagcg 2970721 actcggcatc cccaccattg ccaccgttcc ctggaccacc gatcggctgc tcaacgcgct 2970781 aatgcaggac accgaaaccg ccaagtggcg cgatcctacc ggtaccgtcg cggtcgccga 2970841 ggtcgtctta cacgaagact gggtgggcca ccgggcgacc gatcttgagc aggccaccgg 2970901 cgctcggatt gcgtttctga tccgattcgg aaccggtgta ttgccggaac cgaagacggt 2970961 cctacaggcc ggcgataagg tctatatcgc tgcgatatcc ggccgggccg cagaggcagc 2971021 ggccatcgca gccttgccac ccagtgagga cttcgagtcg ggggctcgac gatgaaagta 2971081 gctgtcgccg gagcgggtgc ggtgggccgc tcggtcaccc gcgaactcgt ggaaaacgga 2971141 cacgacatca ccctgatcga gcgcaacccc gaccacctcg acgccgccgc catcccggag 2971201 gcgcattggc ggcttggcga tgcctgcgaa ctgagcctgc tggagtcgat tcacctcgaa 2971261 gagttcgacg tggtcgtcgc cgccaccggg gacgacaagg tcaacgtggt gctcagcctg 2971321 ctagccaaga ccgaattcgc ggtgccgcgg gtggtggccc gggtcaacga tccccgcaac 2971381 gagtggctgt tcaacgacgc ctggggggtc gacgtcgcgg tgtccacacc ccgcatgctg 2971441 gcgtcgctga tcgaagaggc cgtcacggtc ggcgacttgg tgcggctgat ggagttccgc 2971501 acgggtcagg ccaatctggt agagatcacc ctgcccgaca acacgccgtg gggcggcaaa 2971561 ccggtgcgca aacttcagct gccgcgggat gccgcgctgg tgacaatcct gcgcgggcca 2971621 cgagtcatcg tgccggaggc cgacgagccg ctggaaggcg gcgacgagtt gctcttcgtc 2971681 gcagtcaccg aagccgagga ggagctgagc aggctgctgc tgccgtccat gtaaccggcg 2971741 ggctctactc gcggccggcg tcggcgtcga attcagctgc acctccgacg gccgcggcgt 2971801 cgtgagaggc caagatggcg cgctgggctg ccttgattgc cgcgtaggtg gccagcgcgg 2971861 cgagggcggt cagcggccaa cccatcccga tcctggccac tcccagccaa cccgtcttat 2971921 cggcgtcgta gaggtgcctt tggacgatga accgggcagc aaaaaccagc gtccaaccca 2971981 gggtggcgac gtcaaacgca aagacagcgc gggacacgtc gcgccaggcg cgatcgcgcc 2972041 cgctgagcca gctccacaag tagccgacta tcggccgccg gatcaggatc gacagtgtga 2972101 agaccaccgc ccacagcaac gacatccaga tgcccagcag gaagtacccc ttggactgtc 2972161 ccaccaggta cgcgatcagc gcgcacacgg ctaccccgca gaatccggca accaccggcc 2972221 gcgcagattc ccggcgcaaa agccgccaca gcaggatcaa ccccgccatg ctcagggcga 2972281 acccaatcgc gggcagcaag ccggcggcgc tggaagcaac cacaaaagtc accaccggta 2972341 atgacgaata gaccaggccg ctcactccgc cggcctgcgc caacaggcgc tgggcgctag 2972401 tgcggttagc gttcacgaga caccggcaat tccgactgcc ggatagtcac cgctgaattt 2972461 cgtaatgcgg gttgtagata gccttcgtcc cattttccag cttgcccagc cggccgcgca 2972521 cccgcagggt acggcccgtg tcgatgccgg gtatccggcg ttgacccaac cacaccagcg 2972581 tgacggtgtc gctgccgtcg aacaattcgg cgctaacacc acccgagcaa cccttgccat 2972641 tggtttccac gctacgcagg gtgccaacca ccgtgacctc ctggccgcgc tggcagtcga 2972701 tcgcacgctg tgcgccggca ttgagcacct cgtcggataa ctcttcgacg tcgcgttgct 2972761 ccaggtcctc cgtcaaccga cgggtgagcc tgcgcagata accctgggcc cccatggcct 2972821 ctcctgacac gtcacctacg ttatggaagt ttcgtgcaac tgccggcgta ttccacctat 2972881 gccaacggcc accgtagacc tgttggttcc cgggcgccac cgttggcctt ggagcacccc 2972941 aaagtggcgg gcactatcaa gggatggctg tcgatttgga tggggtcaca accgtgttgt 2973001 tgccgggaac cggatcggac aacgactacg tccggcgagc attttccgcc cccctgcgac 2973061 gcgccggggc ggtgctggtg acgcccgttc cgcatcctgg tcgcttgatc gacggctatc 2973121 gcgccgccct ggacgacgcc gcgcgcgacg ggccggttgt cgtcggcggc gtctcgctcg 2973181 gagccgcagt ggcggcggcg tgggcgctgg aacatcccga tcgcgcggtc gccgtcctgg 2973241 ccgccttgcc ggcctggacc ggggaacctg aattagcacc tgccgcgcag gcagcgcggt 2973301 atacggcagc gcggctgcgc tgcgacggtc tggcggcgac gaccacacgc atgcgtgcat 2973361 ctagccccgt ctggttggcc gaggagctga cccgatcgtg gcgagttcag tggcccgagc 2973421 tgcccgatgc tatggaggag gcggcggcct atgtcgcccc aagccgcgcc gagctggccc 2973481 ggctggtcgc gccgctggcc gtggccgcgg cggtcgatga tccgatccac ccgctgcagg 2973541 tcgctgccga ctgggtgtcc gtagctccgc atgcggcgct acggacggtg acgctggacg 2973601 agatcggcgc ggacgccgcc gcgctgggct ctgcctgcct ggccgctctc gccgaggtct 2973661 cgggcgcttg atcgcctgtt tgtccgacgg cggagtgcgc gtaccgtttg ggtcgccgag 2973721 cctgtaattt tgcaggcccc cactcgcact ttgcctgcag agttacagcc tcagcgaaca 2973781 gcgcgctcgt actgttgagg tcgtcgagct agtcccgatc gcccgactcc tcctcacgcc 2973841 gctacgcggc gcgctcgtac tgttggggtc gtcgggctag ccgcccgtcg tgctgcgcaa 2973901 ctgctgcatc gccgatcctt gagcgccgcg gcgtgcaacc cccgcggcgg cttggcgttg 2973961 cgtgtcggcc tgtgccgccg cagcctccct tagttgcgcc gccatcggct cgggcaggtg 2974021 caccggtaac ggcgtccgta ccggcagcgg ggtatcgccc cggcggacca cggtgtccgc 2974081 caacgcctca cgggcctcct cggtcagcgc atcgactgtc tcttgtgggc cgttgacgac 2974141 gcatcggatc atccagcggt agccgttgac cccgatgaag cgcaccacac cggcggcgat 2974201 gccgatcact tcgcgacccc acgggccatc cttgatcgaa actttggccg agtccttgcg 2974261 cagcgagtcg gcgagttcgc cggccacctc acgccagagc ccgccggtct taggtgccgc 2974321 gtaggccgca atgctgtagc gaccgttggg tgtgatgacc cacaccgcgc tgggaacacc 2974381 gctctcggtc agctcgacct gtacctgacc cgcggccggc atcggaatca gcaccgagcc 2974441 caagtccagc cgggccagca ccgccaccga agggtcatcg aagtcgtcga tgtcgaatgg 2974501 gccctgaagc tcctcctggt cttcgacgcc tgacgcggcg gcagcgctgg ccaccacggt 2974561 gtcctccggt cggacgtgct cgtcggccgg ctggacgggg gcgtgtccgg ccttgcgttt 2974621 gccaccgtct ttgcctgtgc gtctaccgaa tgccatggcg agcgccgctc tcccccgtaa 2974681 gcgggtggta cccccacctc atcgcgccct cctttgcatc gtcgccgggg tcacaaactc 2974741 gcatgtccgc cggaggaacc gtggccaccg tcgccgcggg atgtcgaggc cagcccggcc 2974801 tcgtcgaacg acgagacctc gaccagctcg accaactcaa cccgttgcac tagcaactgg 2974861 gcgattcggt caccgcgatg taccacgatg ggcgcggctg ggtccaagtt gatcagggcc 2974921 accttgatct ccccacgata acccgcgtcg atggtgcccg gactgttgac gatcgaaagc 2974981 cccacccgcg tggccaaccc ggagcgcgga tggaccagcc cgaccatgcc gaacgggacg 2975041 gcgaccgcaa cacccgtccg taccagggcg cggcgcccag gtgccagctc gacgtcttcg 2975101 gcgctgtaga gatcaacgcc ggcgtcgccg tcgtgagcgc ggctgggcag cgggagcccg 2975161 gggtcgaggc ggacgatcgc cagagtggtc gacacggggc cacagactac ccttgaccgc 2975221 gtgtctggga cgcgcctcgc gccgcacagc gtgcgatacc gcgagcgatt gtgggtgccc 2975281 tggtggtggt ggccattggc tttcgcgcta gcggcgctta tcgcgtttga agtaaacctg 2975341 ggcgttgcgg ccctacccga ctgggtaccg ttcgcaacgc ttttcacagt cgcagccggg 2975401 acgctgctat ggctcggacg tgtcgaaatt cgggtcaccg ccggctcagc ggatggagcc 2975461 ggagtgaagc tatgggccgg accagcgcat ctgccggtag ccgtgatcgc ccgatcagcc 2975521 gaaatcccgg ccacggctaa atctgcggcg ctgggccgac aactcgatcc ggcagcttac 2975581 gtcctgcatc gggcctgggt ggggcccatg gttctggttg tcctcgacga ccccaacgat 2975641 cccacgccgt actggttggt gagctgccgc cacccggagc gggtgttgtc ggcgctgcgc 2975701 agctgaccta tcaggcggcg cagtcggtgc agatcatcac gccgttcttc tcgctggcca 2975761 acctgctgcg gtgttgcacc aaaaagcaac tcgagcaggt gaattcgtca gcttgcttgg 2975821 gtacgacgcg caccgacagt tcttcgccgg acaggtcggc gccaggcagt tcgaaggact 2975881 cggcggattc ggattcgtcc acgtcgacca cggccgacgc cgcctcgttc cgtcgtgctt 2975941 tgagctcttc aagcgagtcc tccgagacat catcggtctc ggtacgccgc ggagcgtcat 2976001 agtcggtagg cattcctatc ccctcacatg cctcataact tcaagcaacg ctttgtacca 2976061 gcgtcgaacg cgtccaccaa acgattcgtg cccgtatcgt ggcctattca agtgtgattt 2976121 acatcacata ttcatattgc accttgtacg cggccctaaa cggtgccttt ttgggtgcga 2976181 actacaccca atggtccgcc tcctcaccgc gccgtgccgg cacgcgtcgt cagcggatta 2976241 aagtgcacgt gtggtcgcac aaatcaccga gggtaccgct ttcgacaagc acggacggcc 2976301 ctttcggcga cgcaaccccc gacccgctat cgtcgtggtg gccttcctcg tggtggtgac 2976361 ttgcgtgatg tggactcttg cactgacgcg gcccccagat gtccgcgagg ccgcagtctg 2976421 caacccgcct ccgcagccgg cggggtcagc accgaccaac cttggtgaac aggtgtcgcg 2976481 gacggacatg accgatgtcg cacccgccaa actgagcgac accaaagtcc acgtcctcaa 2976541 cgccagcggc cggggcggcc aagccgccga tatcgctggc gcactgcaag atctgggctt 2976601 cgcccagccg accgccgcca acgacccgat ctatgccggc acccggctgg actgccaagg 2976661 ccagatccgc ttcggtacgg cggggcaagc caccgctgcc gcactatggc tggtagcgcc 2976721 gtgcaccgag ctgtatcacg acagccgcgc cgacgattcc gtcgaccttg cgctcggcac 2976781 cgacttcacc acgctggcac acaacgacga catcgacgcc gtgcttgcca acctgcgccc 2976841 cggcgccacc gagccctcag atcccgcgct gctggccaag atccacgcca acagctgctg 2976901 atcggccggc tcagtccggg atcggctcta ggccgttgaa tcgctgtagc gccgccaaca 2976961 gctcgtcggc gattccgggc gcggcagcca ccaccaccaa ccccgcgccg cccgctctcg 2977021 gcgttgacaa caacacgcgg gcccccgcct cagcggcgat caacgcacct gccgcacagt 2977081 cccacacctg caccccgtgc tcgtagtagg cgtccagccg acccgccgct accatgcaca 2977141 agtccagcgc cgcagaaccg atccgacgca cgtcgcggac caacggcaca acatgagcca 2977201 gcaattctgc ctgcttctcg cggcaccgaa ccgagtaccc gaagccggta cccagcaacg 2977261 ccatcgacaa ctcgtcgaca ccggtgcacc gcaacacatg tctcccccgc tcatcggtga 2977321 gatgtgcgcc gaggcccgtc gccgccgaat acaccgtgcg agcggcgacg tcggcgaccg 2977381 cgcccgccac cgtgatgccg ccaacctgtg ccccaatcga caccgcgtac gccgggatgc 2977441 cgtagacgaa attcaccgtg ccgtcgatgg ggtcgagcac ccaagtgacc cggtcggagg 2977501 gtgtagccgt cacgtcggcg ggaccaccac cttcctcccc gagaatcggg tcaccgggcc 2977561 gaagttgagc caaccgatca cgcaagagcc gctccgtgtc ggtgtcgacc acggtcaccg 2977621 gatcggtcgg gctgctcttc gcgcgcaccg cgccgtcgcc gtcgcccgcc ctggagatgc 2977681 cgaaaacctc ggcccgacga ccgcgaacga aggccgccgc ctcggcagca aggttttcgg 2977741 ccacagagcg cagccgcgcg ggttcgttgt caggtcgtgt caccggccta tcgcatcaca 2977801 gtcgccaccc gcatggtggc gtggactcca gcggccataa cgccctcgca actgccgggc 2977861 cgcagtttaa ggtgagggtc atccacgtct cgccgaggag attcgatgac cagcaccggc 2977921 cccgagacgt ccgaaacacc gggtgccacg acacagcgtc atggcttcgg catcgacgtc 2977981 ggcggcagcg gcatcaaggg cggaatcgtc gacttggaca ccggccagct gatcggcgac 2978041 cggatcaagc tgctgacccc gcaaccggcc actccgttgg cggtcgccaa aaccatcgcc 2978101 gaggtcgtca acggtttcgg ctggcggggt ccgctggggg tgacctatcc cggcgtcgtc 2978161 actcacggcg tcgtccggac cgcggctaac gtggacaagt cctggatagg gaccaacgca 2978221 cgcgacacta tcggcgccga gctgggcggt cagcaggtca ccatcctcaa cgacgctgat 2978281 gccgccgggc tggccgagac acgctacggg gccggcaaga acaaccctgg cttagtggta 2978341 ctgctcacat tcggaaccgg gatcgggtcc gcggtcatcc acaacgggac gttgataccc 2978401 aacaccgagt tcggacatct tgaggtcggc ggcaaggaag cggaggaaag ggccgcctcc 2978461 tcggtaaagg aaaagaacga ctggacctat ccaaagtggg ccaagcaggt gacacgcgtg 2978521 ctcatcgcca tcgagaacgc gatctggcct gacctgttca tcgccggcgg cggcatcagc 2978581 cgcaaggccg acaaatgggt gccgctactg gaaaaccgca caccagtagt gcccgcggcc 2978641 ctgcagaaca ccgccggaat tgtcggtgcg gccatggcct ctgtcgcaga tacgacgcac 2978701 tgaaacttgc ccgctcgggc tgtactcgtg cgcagtaaag ttacaatggt cagcggcggc 2978761 cgcccgaccg atagcgcgcg agtattcacg ctgatatcaa cgccgacatt cgacatagca 2978821 gacactttcg gttacgcacg cccagaccca accggaagtg agtaacgacc gaaggggtgt 2978881 atgtggcagc gaccaaagca agcacggcga ccgatgagcc ggtaaaacgc accgccacca 2978941 agtcgcccgc ggcttccgcg tccggggcca agaccggcgc caagcgaaca gcggcgaagt 2979001 ccgctagtgg ctccccaccc gcgaagcggg ctaccaagcc cgcggcccgg tccgtcaagc 2979061 ccgcctcggc accccaggac actacgacca gcaccatccc gaaaaggaag acccgcgccg 2979121 cggccaaatc cgccgccgcg aaggcaccgt cggcccgcgg ccacgcgacc aagccacggg 2979181 cgcccaagga tgcccagcac gaagccgcaa cggatcccga ggacgccctg gactccgtcg 2979241 aggagctcga cgctgaacca gacctcgacg tcgagcccgg cgaggacctc gaccttgacg 2979301 ccgccgacct caacctcgat gacctcgagg acgacgtggc gccggacgcc gacgacgacc 2979361 tcgactcggg cgacgacgaa gaccacgaag acctcgaagc tgaggcggcc gtcgcgcccg 2979421 gccagaccgc cgatgacgac gaggagatcg ctgaacccac cgaaaaggac aaggcctccg 2979481 gtgatttcgt ctgggatgaa gacgagtcgg aggccctgcg tcaagcacgc aaggacgccg 2979541 aactcaccgc atccgccgac tcggttcgcg cctacctcaa acagatcggc aaggtagcgc 2979601 tgctcaacgc cgaggaagag gtcgagctag ccaagcggat cgaggctggc ctgtacgcca 2979661 cgcagctgat gaccgagctt agcgagcgcg gcgaaaagct gcctgccgcc cagcgccgcg 2979721 acatgatgtg gatctgccgc gacggcgatc gcgcgaaaaa ccatctgctg gaagccaacc 2979781 tgcgcctggt ggtttcgcta gccaagcgct acaccggccg gggcatggcg tttctcgacc 2979841 tgatccagga aggcaacctg gggctgatcc gcgcggtgga gaagttcgac tacaccaagg 2979901 ggtacaagtt ctccacctac gctacgtggt ggattcgcca ggccatcacc cgcgccatgg 2979961 ccgaccaggc ccgcaccatc cgcatcccgg tgcacatggt cgaggtgatc aacaagctgg 2980021 gccgcattca acgcgagctg ctgcaggacc tgggccgcga gcccacgccc gaggagctgg 2980081 ccaaagagat ggacatcacc ccggagaagg tgctggaaat ccagcaatac gcccgcgagc 2980141 cgatctcgtt ggaccagacc atcggcgacg agggcgacag ccagcttggc gatttcatcg 2980201 aagacagcga ggcggtggtg gccgtcgacg cggtgtcctt cactttgctg caggatcaac 2980261 tgcagtcggt gctggacacg ctctccgagc gtgaggcggg cgtggtgcgg ctacgcttcg 2980321 gccttaccga cggccagccg cgcacccttg acgagatcgg ccaggtctac ggcgtgaccc 2980381 gggaacgcat ccgccagatc gaatccaaga ctatgtcgaa gttgcgccat ccgagccgct 2980441 cacaggtcct gcgcgactac ctggactgag agcgcccgcc gaggcgacca acgtagcggg 2980501 cccccatgtc agctagccgc accatggtct cgtccggatc ggagttcgaa tcagccgtcg 2980561 gctactcgcg cgcggtacgc atcgggccac tcgtggtggt ggccggaacg accggcagcg 2980621 gcgatgatat cgccgctcag acgcgagacg ctctgcgccg catcgagatt gcgctcggac 2980681 aggccggcgc aactctggcc gacgtggtcc gtacccgcat ctatgtgacc gatatttccc 2980741 gctggcgcga ggtcggcgaa gtgcatgcac aggctttcgg caagatccgt ccggtgacga 2980801 gcatggtcga ggttaccgcg ctgattgcgc ccggcctgct ggtagagatc gaggccgacg 2980861 cctacgtagg gtcggcggtt gcagaccgaa attcgggagc cggcccgaag gacccgtcac 2980921 cagccggtgg gtaggcggcg gccccaatca cagcgcgcac cggcagtggg ccgtagagat 2980981 gcgggaaaag catcgaccgc ggatcagtag gcacgcccgg ctcccaacgc acgggtgagt 2981041 cgagcgccgc cgggtcgatg tacagcagca ccaggtcagc acggccacgg taaaggcggt 2981101 tggcgggcag gtgaacctgc tcgagtgtcg acaggtggat ataccccgtc ttgtcggact 2981161 cgggatagat cccaccgcgt tctcgggcat gcgaccactc ctgcaccccg cataggtgca 2981221 ccagcatggc aggatcgggc gtcattctca ccaccctgcc cgattggcgg gggcgaaagt 2981281 cgtgagaaat gacacacccg acagcggccg gggaacacgg cgagaacccc gaacgtctga 2981341 gaaggtgaag atacccgaga acggagagcc atgaacgcaa ctctgaccag tcctgagctg 2981401 actagagcag accgctgcga ccgctgtggc gctgcagctc gggtgcgcgc caagctgccc 2981461 tccggagccg agcttctttt ctgccagcat cacgccaacg agcacgaggc gaaactgacc 2981521 gagatgtccg ccgtgctgga ggtcagcggg agcgaataga ccgaactcac ccgtccacaa 2981581 tgccggtagc gcgcgcagtt ttcggtaatg ctggactggt atgagcgacc aggtccccaa 2981641 gccacaccgc caccacatct ggcgaatcac ccgtaggact ttgtccaaaa gctgggacga 2981701 ctcgatcttc tcggagtcag cgcaagcggc tttttggtcg gccttgtctt tgccgccgct 2981761 actgctggga atgctgggca gtctggccta cgttgctccg ctattcggcc cggacacctt 2981821 gcccgcgatt gaaaagagcg cgctttcgac ggcccacagc tttttctccc ccagtgtggt 2981881 caacgagatc atcgagccca ccatcggcga tatcaccaac aacgcccgcg gtgaggtggc 2981941 gtcgctgggc ttcttgatct cgctgtgggc aggatcgtcg gcaatctcgg cgttcgtcga 2982001 tgcagtggtg gaagcgcacg accagacacc gctacgccac ccggtccggc aacgcttctt 2982061 tgcgctcttc ctctacgtgg tgatgttggt gttcctagta gcgaccgcac cggtaatggt 2982121 ggtgggtcca cgcaaggtaa gcgagcacat cccggagagc ttggccaacc tgctgcgcta 2982181 cggctactac cccgcgctta ttctcggtct aaccgtcggg gtcatcctgc tataccgggt 2982241 ggcactaccg gtacccctgc cgacgcatcg gctggtccta ggcgcggtgc ttgcgatagc 2982301 ggtcttcctg atcgccacct tgggcttgcg ggtctacctc gcgtggatca cccgcactgg 2982361 ctacacctac ggagcgctgg ccacgccgat cgcgtttctg ttattcgcct tctttggcgg 2982421 ctttgcgatc atgctcggcg ctgaactcaa cgccgccgtc caggaggaat ggccggcgcc 2982481 ggcgacgcat gcccaccgac tgggcaattg gctaaaggcc cgcatcggcg tcggcacgac 2982541 gacgtattct tcgacagccc agcacagcgc cgtcgctgcc gagccgccga gctagtcagc 2982601 ccttcttgag ggtgtcgtaa atccgcttgc aatcgggaca gaccggcgag cccggcttgg 2982661 gcgcgcgggt aacgggaaac acctcgccac acaacgccac cacgtggcta cccatgaccg 2982721 cgctctcagc gatcttgtct ttcttgacgt agtggaagta tttcggtgtg tcgctgccgg 2982781 tcccgtcgtc gacgcgttcg tcggcgtcgg tacgttcaat cgtctgggtc tgcatacctg 2982841 acattgtgcc cttggcagga aagctctcga agccggagtg cactgcatgt gggacagtag 2982901 agtaatgaag cacggcttga ggctgggttt caatggccag ttcgacgact tcgacgactt 2982961 cgacgataag ggccggccgg tactgattag tgccgccgct ccctcgtatg aggtggagca 2983021 tcgcacacgg gtgcgtaagt acctgaccct gatggcattc cgggtccccg cgctcattct 2983081 ggccgccatc gcctacggcg cctggcacaa cggactgatc tcgctactga tcgtggcagc 2983141 ctcggtgccg ttgccatgga tggccgttct gatcgctaac gaccgaccgc cgcgccgcgc 2983201 cgacgaaccc cgccgcttcg acgtcgcccg ccggcgcatc ccgctgttcc cgaccgccga 2983261 acggcccgca ctcgagccgc ggcgacagcc ggcagagcgg tcagccccgc ggggattcgc 2983321 cgaccacggt tagccgtctg ttggccggcg ttccgggttg tcggccactg gccacacttc 2983381 tcaggacttt ctcaggtctt cggcagattc ctgcacgtca cagggcgtca gatcactgct 2983441 gggtgggaac tcaaagtccg gctttgtcgt taaaccccat gacagtgcaa gccgatcggg 2983501 aggtcgctat ggccgatgca cccacaaggg ccaccacaag ccgggttgac agcgatctgg 2983561 atgctcaaag ccccgcggca gacctcgtgc gcgtctatct gaacggcatc ggcaagacgg 2983621 cgttgctcaa cgccgccggt gaagtcgaac tggccaagcg catagaagcc gggttgtatg 2983681 ccgagcatct gctggaaacc cggaagcgcc tcggcgagaa ccgaaaacgc gacctggcgg 2983741 ccgtggtgcg tgatggcgag gcggcgcgcc gccacctgct ggaagcaaac ctgcggctgg 2983801 tggtatcgct ggccaagcgc tacacgggtc ggggcatgcc gttgctggac ctcatccagg 2983861 agggcaacct gggtctgatc cgagcgatgg agaagttcga ctacacaaag ggattcaagt 2983921 tctcaacgta tgccacgtgg tggatccgcc aggccatcac ccgcggaatg gccgaccaga 2983981 gccgcaccat ccgcctgccc gtacacctgg ttgagcaggt caacaagctg gcgcggatca 2984041 agcgggagat gcaccagcat ctgggtcgcg aagccaccga tgaggagctc gccgccgaat 2984101 ccggcattcc aatcgacaag atcaacgacc tgctggaaca cagtcgcgac ccggttagtc 2984161 tggatatgcc ggtcggctcc gaggaggagg cccctttggg cgatttcatc gaggacgccg 2984221 aagccatgtc cgcggagaac gcggtcatcg ccgaactgtt acacaccgac atccgcagcg 2984281 tgctggccac tctcgacgag cgtgagcacc aggtgatccg gctgcgcttc ggcctggatg 2984341 acggccaacc acgcaccctg gatcaaatcg gcaaactatt cgggctgtcc cgtgagcggg 2984401 ttcgtcagat cgagcgcgac gtgatgagta agctgcggca cggtgagcgg gcggatcggc 2984461 tgcggtcgta cgccagctga agctggacat cctgagccag gtagcagacg gtatgcccgc 2984521 cgcgccagcg gcgggcatac cgctgcggtg gggcggcggg caaccatttt cgcagctggc 2984581 caagtagact cagctgcaat ggagggtgct gaatgaacga gttggttgat accaccgaga 2984641 tgtacctgcg gaccatctac gacctcgagg aagagggcgt gacgccactg cgtgcccgga 2984701 tcgccgagcg gctcgaccag agcgggccga cggtcagcca gaccgtgtcc cggatggagc 2984761 gcgatgggct acttcgggtg gctggcgatc gccacctgga gctcaccgaa aagggccgcg 2984821 cgctggccat cgccgtgatg cgcaagcacc gcctcgccga acggctcctc gtcgatgtca 2984881 tcgggttgcc gtgggaagaa gttcacgccg aggcatgccg gtgggagcac gtgatgagcg 2984941 aggacgtcga gcgacggctg gtcaaggtgc tcaacaaccc gaccacgtcc ccgttcggca 2985001 acccgatccc gggcctggtg gaacttggcg tgggcccgga accgggcgcc gacgacgcca 2985061 acctggtccg gttgaccgag ttgccggccg gctcgccggt cgcagtcgtc gtccgccagc 2985121 ttaccgagca cgttcagggc gacatcgacc tgatcacgcg gctaaaagac gccggcgtgg 2985181 tgcccaacgc acgagtaacc gtcgaaacca ccccaggcgg cggcgtgacc atcgtcatcc 2985241 cgggccatga gaacgtcacc ctgccacacg agatggccca cgcggtcaag gtcgagaaag 2985301 tctgagctaa cccgcaccta ccctgcgcgt tgaccgaacg cacgtcgagg cggcagtcgt 2985361 attccgagtt gttcagcccg ttggtagccg gtgaccgcga tgtcacggat gtgctcaggt 2985421 cgcagaccag actgcagtgc cgtgtccagc atgcccgcca tccgatggcc cggctcacag 2985481 cacagcgcag cctgcagcga aacaccggcc agcggcccgt caccgcgggc ataggcgctg 2985541 aacgcgagca acaccagggc ctccacccgc cacggttcgg gcagcacccg cgccagtaac 2985601 gcccacaatg actcggccgc accagcattc tcgccgacgg caagggcata cagcatgtcg 2985661 cggacccgcg cgtcgcccag tgcgcaaccc agccgcgcca gctccgtgtc ggacaaggac 2985721 tgaccgtctg cgacccgggc cgcggcggcc agcgcatttt ccacatcctg gcggctgcag 2985781 ccgaccgaat cagcacggtg tgcgatctct cggtcagccg cttggtgtcc tagcgcaacg 2985841 gcaagctcgg cggagcgcac agggtcgtcc acggcgatga cggcctgcag gtcggagcgc 2985901 cgcgggtaga gctgcctgcc gtccagcacc gccgccatcg ccaacggcga cgccgacgga 2985961 tcgtcgataa cgccgctgca gccgcagccg tccacacaat gccagcgccc gccagcggct 2986021 acccggtcta ccacgtgcgc tgcccatagc acgatgtcgc gctgcgacaa cgccgccgcg 2986081 agcgccgcgc acagctgccg gtactcctca ttgcatcgcg gacactgggc tccgttcgcg 2986141 tcaacgatca ccgcgatcgc ggccgccggg ttcgccgcgg cgacaagttc tgcgagatgg 2986201 ccaacccgat cggcgagttc atcacagagg tcggcgcgca tcaccgaccc tagttccccc 2986261 gctgccaacg acaccagaac cagcgatttt tccggcacga agccgaggat ggccggtagc 2986321 gcggcgatca gtgttgcagg gcggttgagt tcaaattgtc ctcgatactt cgtcatgaat 2986381 gccacgctga ctaccggcac cgtcagccgg tgcccacgtc acgcgatcga gctgccttcc 2986441 tgtggacgaa ggcgtaactg tgcgttctac tgtcatttca tggggtcgat gcgtgaatac 2986501 gacatcgtgg tgatcgggtc aggcccgggc ggacagaaag ccgccatcgc ctcggcgaag 2986561 ctgggcaagt ccgtggccat cgtcgaacgc ggccgaatgc tcggcggcgt ctgcgtcaac 2986621 acaggcacga tcccatccaa aacgttgcgt gaggctgtgc tctacctcac cggcatgaac 2986681 caacgcgagc tgtacggcgc aagctaccgc gtgaaggacc ggatcacccc ggccgacctg 2986741 ttggcgcgga cccagcacgt gatcggcaag gaagtcgacg tggtgcgcaa ccagctgatg 2986801 cgtaaccgcg tcgatctgat cgtgggccat ggccggttca tcgacccgca caccatcctc 2986861 gtggaggacc aggcccgcag ggaaaagacc accgtcaccg gcgactacat catcatcgcc 2986921 actggcacca ggccggcacg gccatccgga gtcgaatttg acgaagaacg ggtgctcgac 2986981 tccgacggga tcctcgatct caaatcgctg ccatcctcga tggtcgtggt cggtgccggc 2987041 gtgatcggca tcgaatacgc ctccatgttc gctgcgttgg gcaccaaagt caccgtcgtg 2987101 gagaagcggg acaacatgct ggacttctgc gaccccgagg tcgtcgaggc gctgaaattc 2987161 cacctgcgcg acctggcggt gacattccgg ttcggcgagg aagtgaccgc ggtcgatgtc 2987221 ggctctgcgg gcaccgtgac caccctggcc agcggcaaac agattccagc cgagaccgta 2987281 atgtactcgg cgggacgtca gggacaaacc gaccacctcg acctgcacaa cgccggactc 2987341 gaggtgcagg gccgcgggcg gatcttcgta gacgaccgtt tccagaccaa ggtagaccac 2987401 atctacgccg tcggcgacgt cattggcttc cccgccttgg ccgcgacgtc gatggagcag 2987461 gggcggctgg ccgcctacca cgccttcggc gaaccaaccg acggaatcac cgaacttcag 2987521 ccgatcggta tttattcgat tcccgaggtg tcctacgtcg gcgccaccga ggtggaactg 2987581 accaagagct ccatcccata cgaggtggga gtggcccgct accgggagct ggcccgcggc 2987641 caaatcgccg gcgactccta cggcatgctc aagctgctgg tttccaccga ggatctcaag 2987701 ctgctcggcg tgcatatctt cggcaccagc gccaccgaga tggtgcacat cgggcaggcc 2987761 gtgatgggat gcgggggcag cgtcgagtac ctggtcgacg cggtgttcaa ctacccgacc 2987821 ttctcggagg cctacaagaa cgccgcactg gacgtgatga acaagatgcg cgcactcaac 2987881 cagttccgcc gctgagggtg ccgagcggat gtgaatccgt ctcggcgccc aagtaggctt 2987941 gccagcaaat tcgccgccgc ccacgaacgg tcggcgtcga acgtggcccc gcgcttttgg 2988001 cgttgtgcag cacagcggca gccagggttg gctgttcaat cattgctgtc cgctgatttg 2988061 agggacactg gttacggcac ctcggcgaca accccgagag gaggcaacac ccatggctcg 2988121 cgatcaaggc gcagacgaag cgcgagaata tgagccgggg caacccggca tgtacgagct 2988181 tgagttcccg gcgcctcagc tgtcgtcgtc cgacggccgt ggtccggtgt tggtgcacgc 2988241 tttggaaggt ttctccgacg ccggccatgc gatccggctg gccgccgccc acctcaaggc 2988301 ggccctggac acagagctgg tcgcgtcctt cgcgatcgat gaactactgg actaccgctc 2988361 gcggcggcca ttaatgactt tcaagaccga tcatttcacc cactccgatg atcctgagct 2988421 aagcctgtat gcgctgcgcg acagcatcgg caccccattt ctgctgctgg cgggtttgga 2988481 gccggacctg aagtgggagc ggttcatcac cgccgtccga ttgctggccg agcgcctggg 2988541 tgtacggcag accatcggcc tgggcaccgt cccgatggcc gttccgcaca cacgaccgat 2988601 cacgatgacc gctcattcca acaaccggga gctgatctcc gattttcaac cgtggatctc 2988661 cgaaatccag gtcccgggta gcgcttccaa cctactggaa taccggatgg cccagcacgg 2988721 tcatgaggtc gtcgggttca ccgtgcacgt cccgcactat ctcacgcaga ccgactatcc 2988781 cgcggccgcc caagcgctgc tcgaacaagt ggccaagacc ggttctctgc agctgccgct 2988841 ggccgcgcta gccgaagcag ccgcagaggt ccaggccaag atcgacgagc aggtccaggc 2988901 aagcgccgaa gtggctcaag tggtggcggc ccttgagcgc cagtacgatg ccttcatcga 2988961 cgctcaggag aacaggtcgt tgctaacgcg cgacgaagat ctaccgagcg gcgacgagct 2989021 cggtgccgag tttgagcggt tcctggctca gcaggccgag aagaagtccg acgacgaccc 2989081 gacctaacgc cgcgaaagcg gcccacaaaa cggccccagt cggcccgaca acaagattgg 2989141 cgaggatgac cgagcggaag cgaaatcttc ggccagtgcg cgacgtggca ccgcctacgc 2989201 tgcagttccg caccgtccac ggttatcggc gggcattccg gatcgccggt tccgggccgg 2989261 cgattctgct tatccacggg ataggtgaca attccaccac ctggaatggg gtgcacgcca 2989321 agctcgccca acgattcacc gtcatcgctc cggatctact gggccacggg caatccgaca 2989381 agccgcgtgc cgactattcg gttgcggctt acgccaacgg catgcgggac ctcctcagcg 2989441 tgctcgacat cgagcgggtg accatcgtgg gccattcgct cggcggcggg gtagcaatgc 2989501 aattcgccta ccagttccct cagctagtcg accgactgat cctggtcagc gcgggcggtg 2989561 tcaccaagga cgtcaacatc gtcttccggt tggcctcgtt gcccatgggc agcgaggcta 2989621 tggccttgct acggttgccg ctggtgctgc cggcagtgca aatcgccggg cggatcgtgg 2989681 gtaaggccat cggtaccacc agcttggggc acgacctgcc caatgtgctg cgcattttgg 2989741 acgacctgcc agagccgacg gcttctgcgg cgttcggccg caccctgcgg gcagtggtgg 2989801 actggcgggg gcagatggtc accatgctgg accgatgcta tttgaccgaa gccatcccgg 2989861 tacagatcat ctggggcaca aaggatgtcg tgctgccagt ccgtcacgct cacatggcgc 2989921 atgccgccat gccgggctcg caattggaga ttttcgaggg ctcgggacat ttcccgtttc 2989981 acgacgaccc tgcgcgcttc atcgacatcg tcgaacgctt catggacacc actgagcccg 2990041 ccgaatacga ccaggccgcg ctgcgcgcgt tgcttcgccg gggtggcggc gaagcaaccg 2990101 tcaccggctc ggcagacacc cgtgttgcag tactgaacgc catcgggtcc aacgaacgca 2990161 gcgctacctg atcaccaccg ggtctgttag ggctcttccc caggtcgtac agtcgggcca 2990221 tggccattga ggtttcggtg ttgcgggttt tcaccgattc agacgggaat ttcggtaatc 2990281 cgctgggggt gatcaacgcc agcaaggtcg aacaccgcga caggcagcag ctggcagccc 2990341 aatcgggcta cagcgaaacc atattcgtcg atcttcccag ccccggctca accaccgcac 2990401 acgccaccat ccatactccc cgcaccgaaa ttccgttcgc cggacacccg accgtgggag 2990461 cgtcctggtg gctgcgcgag agggggacgc caattaacac gctgcaggtg ccggccggca 2990521 tcgtccaggt gagctaccac ggtgatctca ccgccatcag cgcccgctcg gaatgggcac 2990581 ccgagttcgc catccacgac ctggattcac ttgatgcgct tgccgccgcc gaccccgccg 2990641 actttccgga cgacatcgcg cactacctct ggacctggac cgaccgctcc gctggctcgc 2990701 tgcgcgcccg catgtttgcc gccaacttgg gcgtcaccga agacgaagcg accggtgccg 2990761 cggccatccg gattaccgat tacctcagcc gtgacctcac catcacccag ggcaaaggat 2990821 cgttgatcca caccacctgg agtcccgagg gctgggttcg ggtagccggc cgagttgtca 2990881 gcgacggtgt ggcacaactc gactgacgta gagctcagcg ctgccgatgc aacacggcgg 2990941 caaggtgatc ctgcaggggt tgcccgaccg cgcgcatctg caacgagtac gaaagctcgt 2991001 cgccgtcgat gcggtaggaa cggtcaaggg cggtcacctc ttttgcggtc ggggccaatc 2991061 cgatcgaccc atccgcgcgt gtggacaatt cgagttcgat gacgtcaccg gtcaccgaat 2991121 aggttccaac ctcaatttcg gtgatgccgc ttggatgggc gagaacgagt tcaacgcagc 2991181 ccggtcggca aacgcggaga taccccgtct cggaatgcag cggcttcccg tcagctaccg 2991241 ccctggtctg ctgtgtgtac gtcagaaacg gtttacccac atgggcgaat acgacttcct 2991301 cgaggtattc gaacggccgg atggtggggt acttgcccgc accgcgaccc gcccaactcc 2991361 ccaggagggg tgacagcgcc tgcagggcag gggccagatc tcgggtcatc gcccgcttgc 2991421 gggggacagg catgcgggaa gcctagcgcc gcgagatcgg tcagctgtgg gctgataggt 2991481 tgcggtgcgc gcgaagcgcc tcaatctcgc gcgcgaaatc gtccgcggaa gaaaacgacc 2991541 ggtagaccga cgcgaaccgt aggtaggcca cctcgtcaag ctcgcgcaac gggcccagga 2991601 tagccaggcc gacatcgtga ctcggaatct ccggcgaccc cgcggcacgc accgaatcct 2991661 cgacttgctg agccagcagg ttcaacgcat cgtcgtcgac ctggcgtccc tggcacgccc 2991721 ggcgcacacc gctgatcacc ttttccctgc tgaagggttc ggtaacgcca ctgcgcttga 2991781 ctacggccag caccgcggtc tctacggtgg tgaatcgtcg tccacattcg gggcacgacc 2991841 tccggcgccg gatcgcctgg ccttcatcgg tttcccggga atcgatcacc cgcgaatcgg 2991901 gatgccggca gaacgggcaa tgcatggccg ctcctttgcc gtcttgacat ccgggtatca 2991961 cagacgactc cgagcgtacc tgtgtgctcc cgcgggtagc cactgcagtc acgactgatg 2992021 cgcatattgc gtcgcggtca cccagtaacg ttgacacaga acggttttcg cggacaccgg 2992081 gatggcctca gccaaccgga gcgatcagcg tctgacccac ggccaatgcc ggtgtctgca 2992141 ggccgttgag ttcacggatg cggtcggcaa cctggcgggt cggagcgttc ggcgccaccc 2992201 ggaccgccac gtcatgcagg gactcccccg tttccacccg taccacggca agcctgtcgg 2992261 gcacccgacc ggtcgaatcg gccgacccgt cggccgaacc gccggtgatc atctgcccga 2992321 actgcgccac caaaccaagc cagagagtaa tcgccgcggc aagcagagcc agccccaccg 2992381 tcgtggccgg cgggacgggc ctgctgccat gcccagtcct cgacatcccg accccggtgc 2992441 ggtggtagcg cagcggcgca ccccccggcc tcgatcggcc gggcctgcgc gattgcgccg 2992501 gctcagcgcg gcgccagcga ggtccatcga gcgggccccg cagattgagc ggatcggggg 2992561 tatgcggtgg ccggaccggt gtcatgttcg ctcctccaac tcagacggta atcgctcgcg 2992621 tgttcgacac tgtagtcact catgtgttcg atatccgaac atttgatcga agcgtgtcgc 2992681 acgcgcaaaa cggtagacca caccaccgac acgtttcggt tggagccgga cttccggcgc 2992741 gaaggcccag ccactcctcg tgccctcccg cgaccggaac acgcctgtcg aacacatgtt 2992801 tgattcttgg tgcgaatgcg actacattca ttgccatgaa cgacagcaac gacacctcgg 2992861 ttgccggcgg agccgctggt gcggacagcc gggtgctgtc cgcagattcg gcgctgaccg 2992921 agcggcaacg cactattctc gacgtcatcc gcgcgtcggt cactagccgc ggatatccgc 2992981 cgagcatccg ggaaatcggc gacgccgttg gtctgacgtc gacgtcttcg gtggcgcacc 2993041 agctgcgcac cctggagcgc aagggctacc tacgccgtga cccgaaccgc ccccgcgccg 2993101 tcaatgtgcg cggtgccgac gacgccgccc taccgccggt gaccgaagtg gccggctcgg 2993161 acgccttacc ggaacccacc tttgcccctg tcctgggacg tatcgcggcc ggcggcccga 2993221 tccttgccga ggaagccgtt gaagacgtct tcccgctgcc gcgtgagctg gttggcgagg 2993281 gcaccctgtt cctgctcaag gtgatcggtg actcgatggt cgaagccgcg atctgcgacg 2993341 gtgactgggt ggtggtgcga cagcagaacg tcgccgacaa cggcgacatc gttgcggcca 2993401 tgatcgacgg tgaggccacc gtcaagacgt tcaaacgcgc cggcggtcag gtgtggttga 2993461 tgccgcacaa cccggccttc gatcccatcc cgggcaacga cgcgacggtg ctgggcaagg 2993521 tcgtcacggt gatccgcaag gtctgatgct gatccgcgtg caggctgtca atccgcccta 2993581 atgaagccgt tgacttgtgc cacttcttca ctggcgaacc agagttcggc cagcgtgtcg 2993641 tggtatagcg cactgccggg tgggtaatac aggccgaagc tcacactggc tttgacggga 2993701 tagccgttcg gcatctggta tgggtcctcg agcggcagat gaattgccgg gcgcacgccc 2993761 gcactcggcg gttccggcgg ttctgcggcc gcgtgccgcc cgcctctgtc ttcagctgcg 2993821 gatacagccg ccgccggcaa cccagtgtca gcaagatcgg cagcgtggac atcgggcggc 2993881 accgcttcag gcgccactgc ttccggaaca aaggcttgcg gaacgaaagt ctccggaacc 2993941 actcgctcag gaacaatgag atcgggaccg acttcggaaa ggtcggcctg cgacacaacg 2994001 ggtgtcggcg tggtgtccac cgcatcggtg tcttcctcct cgaccccgac gttgctgggg 2994061 ccggacagca agtcgctgcc gtagccctcc tcacccggaa gatgctcggc atcaccgaca 2994121 gccgccccgg cgccgcgcgg ccagctgacc cgcggtgtgg acccggcgtc cggcgccaca 2994181 ggttccggcg ggaactggtc gccgaaaccg aaatgctctg aaccgaagtc ctcgtcgggc 2994241 ggccagtctc catcagcggc ggtaccgtac tcgacatccc cagcgcggtc atcgtcgtaa 2994301 gcagcggcgt cgtacccacg tcgacgccga cgcaatccga acaccaccaa cgccaccatc 2994361 acgaccagga gcacccccag ggcggcggcc cctaaccacc accaatgcca ggtgaacttc 2994421 ttgccgggtg ggggcatcgc cgaagtactt ggctggttct gccctgacac ctgcagaccg 2994481 gacagcaagg gcgctaggtt agcgggatcc gtagtgaatg tgttttttgc acggttccac 2994541 gaaatcatgc cgccggtgaa tttctgtgag acaacatcgc cgtcgacggt ctggtcgccg 2994601 accggggcgc cgagcttgcc gttggggccg cgcagcttgt cccacgcggc caccatggct 2994661 ccgcgcacga cgaacgcgcc gtggtccgga gtccagaaaa tcaccggctt gtcggccgcg 2994721 gagaacctga cgatccggct ggagggccca aaaccaccat cagtttcgtt ggcgatgggg 2994781 aaacccaagt cgctgctgac cggtccgccc agcgactcgt acttcgccag gatttcaccc 2994841 tcgacggcgt ttgcaccggt tgccgggctg aagaagacct tgccaccgac gaagtcctgg 2994901 gcgataccgt ctccgccgat cgggtactgc ccacccttct tggcgcccag cggacctgcg 2994961 gcaccacctg ctgcgcgcca ggccatgttg atcgccgcgg aaggatcgat cgctacctgc 2995021 aaacccttca gctgctcggc cagcaccgcc ggaacggtgg tgaactcctt ggttgcccgg 2995081 ttccaggaga cttcaccacc gctgaacttc tgggcggtga cctcgccgtc gtaggtttca 2995141 tccccgaccg gggcacccag cacgccaccc gagctgccga gcttgtccca cgcggcattc 2995201 agcgcgccgc gcacgacgaa cgcaccgtgt tcaggcgtcc agaaaatcac cgggttgtcg 2995261 gccgcggaga acgtgctcac gcgactgtcg ggtccggcaa ggccgggcac ctcgttgatg 2995321 gtcgggaatc ccagatcgct gtcggctgca ccgcccagcg actcgtattt gtccaggagc 2995381 gggccgtaga ggtatttggc accggtggcc ggggtgaaaa acatcttgcc gccggcgaag 2995441 tccagggcga acccgtcgcc tatcgggtaa acgtcacctt tccggacacc aagtgttgaa 2995501 gtgtcaccac ccgccttctc ccacgcggcc atcatggcgt cctcggcatc gcccatcggc 2995561 gaagccgcca ccgtgggcgc cagcaacacg gcggtcaccg ccgtggccgc caagccgagc 2995621 agcgtacgcc cgatcagcgt gctcaattga cctctctgcc cgttcaccaa gcctcccagc 2995681 cgatgccctg cctagcccgc cagccggtgg atctcccacc gtgggccggt ccccgctgcg 2995741 gtccgtattg tccccgggct cgcataacat tgctccagcg aacgacgatt gcgaagtcca 2995801 atcgcaaata ttacgaaaac ggatacccag ccgatgtcaa attgatgccg gggcacgctg 2995861 ctgtggtgag caaccgggct gcagcccggg ccgggtttgc gttaccgtgc cggaaacgac 2995921 aaccggactg atgcggtgag aggaatcccg gctgacatgg gtgcttccgg cctggtctgg 2995981 accctcacca tcgtcctgat cgccggcttg atgttggtcg actacgtcct ccacgtacgc 2996041 aagacccatg taccgacgtt acgtcaggcc gtcatccagt cggcgacctt cgtggggata 2996101 gcgatcctgt tcggcatcgc agtggtggtg ttcggcggct cagagctggc ggtcgaatat 2996161 ttcgcctgct acctgaccga cgaagccctg tcggtcgaca acctgttcgt atttctggtc 2996221 atcatcagca gcttcggggt gcctcgtctc gcgcaacaaa aggtgctgtt gttcggtatc 2996281 gcgtttgcgc tcgtcacgcg caccggattc atcttcgtcg gcgccgcgct catcgagaac 2996341 ttcaactcgg ccttttacct gttcggcctg gtcctactgg tcatggcggg caacctcgcc 2996401 agacccaccg ggctagaaag ccgcgacgcc gaaacgctca agaggtccgt cattatccgg 2996461 ctagccgacc gcttcttgcg gacctcacag gactacaacg gagaccggtt gttcacggtc 2996521 tcgaacaaca agcgaatgat gaccccgttg ttgctggtca tgatcgccgt gggtggcact 2996581 gacatactat ttgcgttcga ttcgattcca gcacttttcg gcctgaccca aaacgtctat 2996641 ctggtgttcg ccgccaccgc gttctcgctg ttgggcctgc gccagctgta cttcttgatc 2996701 gacggcctgc tggatcggct agtctatctg tcttacgggt tggccgtgat tcttggcttc 2996761 atcggcgtca aactgatgct ggaagcattg cacgacaaca agattccgtt catcaacggc 2996821 ggcaagccgg tcccgaccgt ggaggtgagc accacccagt cgttgacggt gatcatcatc 2996881 gtcctgctga tcacgaccgc ggcgtcgttc tggtcggcgc gcggacgggc gcagaacgcc 2996941 atggcgaggg cccggcggta tgcaaccgca tacctcgacc tgcactatga gaccgagtcg 2997001 gccgaacgcg acaagatctt taccgcactg ctggccgctg aacgccagat caacactctc 2997061 ccaacgaaat accgcatgca gcccggacag gacgacgacc tgatgacgct gctgtgcagg 2997121 gcccatgccg cgcgcgacgc gcacatgtga gcccgcgcta gctgagggct agctgcgcct 2997181 aaacacccaa gccacgaccg atgatctctt tcatgatctc ggtcgtgcca ccgtaaatcg 2997241 tctgtacccg cgaatcgaga taggcccggg cgactgggta ttcgcgcatg tagccgtacc 2997301 caccgtgcag ctgcagacag cggtcgttca gatacacctg cttctcggtg gcataccact 2997361 tggccatggc ggcctgctct gccgtcaact tccccgccag gtgcagctta atgaattcgt 2997421 cgaccatgat gcgcaccaca gtggcctcgg ttgccagctc ggccagcaag aatcggctgt 2997481 tctggaagct accgatcgac ctgccgaacg ccttgcgctc cttggcgtac tgcagtgtct 2997541 gctccagcac ggattccatc cccgcggccg ccatgatggc gatcgagatc cgttcttgcg 2997601 gcaggttctg catcaagtag atgaacccca tcccctcctg gccgagcagg ttttcggctg 2997661 gaaccgccac gtcggtgaag gacagctcgg cggtgtcctg ggcgtccaac ccgatcttgt 2997721 ccagctggcg gccgcgttcg aatccagcca tgccgcgttc gacgaccaac aaactgaacc 2997781 cttgcgcacc cttttcggga tccgtctgcg ccaccacgat cactaggtct gaattgatcc 2997841 cgttggtgat gaacgtcttt gacccgttta gcacgtaatg atcaccgtgt ttgacggcac 2997901 gggtggtgat accttgcagg tcactaccgg ttccgggctc ggtcatcgcg atcgcggtgg 2997961 tcaattcccc ggtgcagaag ttgggaaacc agcgccgctt ctgctcttcg gtggccagcg 2998021 ccagcaagta cggcgccacg atgtcgttgt gcaggccaaa accgatcccg ctgtaccgtc 2998081 cggcgcaggt ttcctcggtg atgaccgtgt tgtaccggaa gtccgcgtta cccccaccgc 2998141 catactcctc gggcaccgcc atgcccagaa atccctgctt gccggcctcc agccacacgc 2998201 cgcggtcgac gatcttggtc ttttcccatt catcgtgata ggccgcgacg tggcgatcga 2998261 ggaacgcccg gtaagactcg cgaaacaact catgttcggg ttcgaaaagt gtgcgctggt 2998321 acttggtggc actgcccatg gatgccctcc ggggaagaaa attctggtgc ccaacaatac 2998381 caaccgggcg gttggtcggc aggtagccgg ggcgcgccag ccgccgcgag cgtaacgcca 2998441 cggcgagctt gcgtgcaccg aattcgccgt ggcgttacgc tcgcggcgca aactcgcgca 2998501 aggtggcagc cagcgcctcc gggacacggg ccttgatccg ggtgccctcg ggcttgtgct 2998561 ccgcctgctg tatccgccca tcggcgtgca cacgggccac caggtcgccg cggtcgtacg 2998621 ggatcaccac gtcgacggcg gtgtcggcgg gcacaaccag ctcggccatc cgccgtcgga 2998681 gcgcatcgat accgtcgccg gtgcgggcgg aaacgaacac cgcgccgggc agcccgtgcc 2998741 gcagcttggc cagcatcagg tcgctagcga cgtcaacctt gttcactacc agcagctcgg 2998801 gcggcggatc gccgtcatgg tcggcgatca cctcggagat cacctgacgg accgcgtcga 2998861 tctgggctag cgggtggccg tcggatccgt ccacgacgtg gaccaataga tcggcgtgca 2998921 cgacctcctc cagcgtggag cgaaacgcct cgaccaactg ggtgggcagg tgccgcacaa 2998981 agccgacggt gtcggtgagc acgactggcc taccgtcacc gaactccgcg cgcctggtgg 2999041 tgggttccag ggtggcaaac agcgcgtcct gtaccagcac cccggccccg gtcagcgcgt 2999101 tgagcaggct ggacttaccc gcgttggtgt agccgacaat cgcgatcgac ggcacgtcac 2999161 tgtgccggcg acggctgcgc tgggtgtcgc ggacctgttt catggccctg atgtcgcgcc 2999221 gtaacttggc catccgctcg cggatgcggc gtcggtcagt ctcgatcttg gtctcaccgg 2999281 gaccgcgcag acccaccccg ccaccactgc caccggcgcg accgcccgcc tgccgtgaca 2999341 tcgactcacc ccagccgcgc agccgcggca gcatgtactc catctgagcc agcgacacct 2999401 gggctttgcc ctccctgctg gtggcatgct gggcaaagat gtcgaggatc agcgcggtgc 2999461 ggtcaataac cttaacctgc acagcctttt ccaaggcggt caactgcgcc ggcgacagtt 2999521 cgccgtcgca gatgacggtg tcggcgccgg tcgccacgat cacttcgcgg agttcggccg 2999581 ctttgcccga gccgatgtag gtcgacgggt cgggcttgtc gcgacgctgg atgagtcctt 2999641 cgagcacctg ggagccggcg gtttcggcca atgccgccag ctcggccagg cttgcccggt 2999701 tgtcagccgc gctgccctcg gtccacactc ccaccaacac cacccgctcc aggcgcagct 2999761 ggcggtactc cacctcggag acgtcggcaa gctcggtcga caacccggca acccggcgca 2999821 gcgccgatct gtcctcgagt gcgagctcac cgaggctcgg tgtgaagtcc gaaaggcccg 2999881 tctggggcgg atctggatat gtcatagcca gtacccgatg gtggcacgtg gcagctggcc 2999941 gcgcatctga atttgccggc ataagccgct gcctgggatc accccatcgc gttccaccaa 3000001 tcgtcagcga gatctccgcg ggccaccaac actgacggcc cacgcaggaa gctggtggca 3000061 tcggtgacgg taaccacgac ctctccgccc ggcacgtgca cggtgagcgt tccggtcggc 3000121 gagcccaccg ccgccaacgc ggcgaccgcg gccgcaaccg tcccggtgcc acacgagcgg 3000181 gtttccccca cgccgcgttc gtgaacccgc atccagaccg ccccgtcgac cggcgcggtg 3000241 agtacctcga cattgacccc gtcggggaac tgcgcaccat cgaaactcac cggcgcaccc 3000301 acgtccaatg ccgccaggcc gtcgacggtc agctgggaat ccacgcacgc cagatgcggg 3000361 ttacccacat cgacggccag gccgtgaaac cgcctgccac caacaacagc ctcccctgcg 3000421 cccaatctgt tggccttgcc catgtcgacg gagacgtcgg cgtaggccgc ctcgacgtgg 3000481 tggcaggtga ctggtcgcgg tccggccagt gaccctacga cgaactcgtc gcgaacctcc 3000541 aggccactgg cacgcaagta gtgcgcgaac actcgcacac cgttgccgca catctgggct 3000601 gccgacccgt cggcgttgcg gtaatccatg taccagtcgg tcacgcggac accctcgggc 3000661 aggctgtcca gcactcctac cgcctgggcg gctccggcgg tcgtaacccg caacaccccg 3000721 tcggcgccca gccccttccg ccggtcgcac aatgccgcca cccgggcagc ggtgagcacc 3000781 aactcggcgt cgacgtcagg cagcaacacg aagtcgttct gggtaccgtg gcccttcgcg 3000841 aagatcatct gcgccactcc tcaatcacca gatcaggtta cgtgccgcca taaccgcacg 3000901 gcgtcgtcga ccagtcgtgc gcggtccggt gagctggcca caccggcgtc gagccagtgc 3000961 acccggtggt ctcggcgaaa ccaggaccgc tgccgtcgca cgtagcggcg ggtgcccagg 3001021 tacgtctgct cccgcgcggc gcgcatcatg tcagctccag caccggcgtc cagagcggct 3001081 attacctgcg cgtagcccag cgcgcgtgac gcggtgaccc cctcgcgcag accattgcgg 3001141 agcagagtgc gtacctcttc aaccaggccc tgatcaaaca tcaggtcggt gcgacgggcc 3001201 aaccgctcgt cgagaatcgt tgtctgacag tccaacccga cgataaccgt gtcccaccgc 3001261 ggcgcaccga tgcgtggcgc ggacgcggca aatggctgcc cggtgagttc gaccacctcg 3001321 agcgcccgca ccgtgcgccg ggcatctgtg ggcaggattg ccgcggctgc agccgggtct 3001381 cggcgggcta actcggcgtg cagccgatcc accccgacct cggccagacg ccgctcccat 3001441 ctcgcgcgta ctgaaggatc ggttgcggga aacgaccagt cgtcgagcag ggattggaca 3001501 tacagcatcg agccgcccac cacgaccggc accgctcccc gggctgcgat cgcctcgatg 3001561 tccgccgcgg cggcccgctg gtagcgcgcc acggtcgcgg tttcggtgac atccaggaca 3001621 tcgagttgat gatgcgggat gccacggcgc tcgctgacgg gcagcttcgc cgtcccgatg 3001681 tccatgccgc gatacagctg catcgcgtcg gcgttcacga tctccacgct caccctggcg 3001741 ccgagccgcg cggcgacgtc gagcgccaac tgggacttgc cggcgcccgt cggtccgata 3001801 atcgccaacg gtctcacggc tgccagacac cggcgaaata ccccacgccg tgtggcgctc 3001861 ctcggtagaa ctccttggcc gatcgcggtc ctggctcggc caggccggcc agcacctgaa 3001921 atgccacccg cccaagcacc tgtgcgggca gccgggtcaa gacggcgagg tcgccgctgg 3001981 ccaacgcgtc gtcgagagcc cgctgcatac cggcgccgtc ggggtcatag ccgccgggag 3002041 cgcggggcgt cagggtgttc aggccgtcgg cgacgactag taccccgatc ggatcgggct 3002101 cccggtcgat gtcggctcgc agttgcctgc cacgtgccac cgcggcatcg gaaccgtggt 3002161 cgctggcata gacgtggacc tgtgccctgg cctcaggccg ggcctggccc cgtacccagg 3002221 cggtaagtag cgcacacagg ggtaattcca ccggaacggc aactccgtca ccgtcctgcg 3002281 gcgcgagccc gactcgcacg tcggcgccga agcccgcaaa ggtgccgacg tcggtggggc 3002341 gcacgacgtc gtcggcgcgc ccggttccga cagcaatcca gcttttcggc aacaaggagg 3002401 ccgccgcgat caccgcggcc cccaaatcgg ccagctcggc agcggcggct ccggccagtt 3002461 cgggaaccaa caccggcgcg gacggaacga tcccgatggc gctcaacaca acacaaagct 3002521 aacgccttgg cgggcggatt cggcctcgtg gagcaacggc tgcaaagaga ccgtgctgag 3002581 aggccgacac tgtcccgcgc tcatcggcca gagacggtca acgaaccgca agctggcccg 3002641 cggcccccaa ttcgccggcg gaaaccgtca tcatcgcgac ctcgtcacgg gccaaggcta 3002701 cggtcgcgac aaccaccacc actaccactg ccaccagcgc gaccagggcc acccggccgg 3002761 tgtgcagcac ctcatcgagc acggtgatcc ccagcaccga agcgatcacc ggcctcgcca 3002821 cggtgatcgt cggcaacgag gcggttagcg cgcccacccg caacgacgac tgctgaagca 3002881 tcagcccgat cggtagaacc aggatccagg catacaactc gggggtccgg atcagtgtcg 3002941 cgaacccctc gccgagctcc gtcacgaccc ctttggtcag cacggtgaat accgccaacg 3003001 ttgccgacga cgccaccgcc agcagcaccg cggacagcga acccgaggca atccgtgcac 3003061 caaccacaca aagcaccacc gccggaacaa ccacgacagc aaccaccgcc caggtcgaga 3003121 agggggcccg agtagtgccg gccgccgggt tgcccgacat gacgatgacg gccaccgcgc 3003181 cggccagcaa taccgcccac atccactccc tgggagtaca gcggtgatga gtcaaccgag 3003241 catcgatcag cagcgcgaac aacagtgcgg tggcctgcag cgactgcacc aacaccaccg 3003301 aacccatcgt cagcgcaatg gcctgcaggg tgaaactggc gactgcggcc aggctgccca 3003361 gccaccacag agcgtgacgc aaagagaggt ggaacaacgt caaatggccg acatattctt 3003421 cagcggtgac ctgtcgcgcg gaccgctgaa gtgtcacata cccgatcccg gccagcaacg 3003481 cggcgcccag cgccagaatg gtcgcgaatt cgacgctggc cataggtgac ctcccaccga 3003541 cattcggccc ggaagctgac tgatacatct cgatttaagc agttgttcaa tgatgatgaa 3003601 ctggcgccag acaaatatca caacaaaacg ttgcgcgcag acgcgtgctt cgtcatcggc 3003661 ttcggaattc tgcggcatat tcgctgcgcc gggattgatg aggaattgcc atcatggtgg 3003721 ctcggcccca agtgcggtcg gcgggtcggc cgtgcaattg accgttgctt acggccctca 3003781 gcgcttccac gggaggtgcg cgagtaacag ctcggttcgg ccccttacta ccggcggcag 3003841 ctggaccccg acctcgatca gctctacgga tggcggaaaa gcacagggcc atgacaccca 3003901 cgatcggcaa atatcgcgac gaacagtctg tcaagcggcc tcaatacttg cttcaatact 3003961 gttggaaacg gtggcaggac cgggtgaggg catcgggccg accacatcgg tgccgcttcg 3004021 cgcggcagat gcgcggcata cgcgcgaagg gttgcaaggt aggtgacaag cgcatgacgg 3004081 ccgacgagcc ccgcagcgac gattcgtccg ggtcggcccc ccaaccggct gccacgccgg 3004141 tgccccgccc gggaccgcgt cccggccccc ggccggtgcc gcgacccacc tcctacccgg 3004201 tgggtgcgca ccctcccagc gacccgcacc gtttcggccg tatcgacgac gacggcacgg 3004261 tgtggctggt cagtgcgagc ggcgagcgta tcgtcggctc ctggcaggcc ggcgatcccg 3004321 aagccgcgtt tgcccatttc ggcaggcgat tcgatgacct gagcaccgaa atcatgctga 3004381 tggacgagcg gttggcgtcc ggcaccggcg acgcacgcaa gatcaaagcc catgcgatcg 3004441 cgctggccga aacgttgccg acggcatgcg tgctgggcga tgtcgacgcg ctggcagacc 3004501 ggttgacaag cattcgtgat cgcgcggagg tcatcgctgc cgccgaccgc tccagacgcg 3004561 aggaacatcg agccgcccag accgcccgta aagaggcgct ggccgccgaa gccgaggagc 3004621 tggccgccaa cgcgacacaa tggaaggtcg ccggtgaccg gctgcgggca atcctcgatg 3004681 aatggaagac gattagcggt gtggaccgca aggtcgatga cgcgctgtgg aagcgctact 3004741 cgacggcccg cgatacgttc aaccggcggc gagggtccca cttcgccgaa ttggaccgtg 3004801 agcgatccgg cgtccggcaa agcaaggaac ggctttgtga acgggccgag gagttgtccg 3004861 agtcgacgga ctggaccgcc accagcgcgg agttccgcaa gctgctcgcc gactggaaag 3004921 cggcgggacg cgcgagcaag gatgtggacg acgccctgtg gcgtcgcttc aaggccgcgc 3004981 aggactcctt cttcacggct cgcaatgccg ccaccgccga gaaggaggcc gagttgcgag 3005041 ccaatgccga cgccaaggag gcgctgctgg ccgaagcgga gcggctcgac acgacaaacc 3005101 acgaggccgc tcgagcagcg ctgcggtcga tcgccgagaa gtgggacgcg atcggcaagg 3005161 tgtcgcggga gcgggccgcg gagctggagc ggcgactacg cgcggtcgag aaaaaggtgc 3005221 gagaagccgg cgaagcggat tggtccgacc cgcaggcgcg ggcccgcgcc gagcagttcc 3005281 gcgcccgggc cgagcagttt gaacaccagg ccgagaaggc agcagcggcc ggtcgcacca 3005341 aggaagccga cgaggcgaag gcgaacgccg aacaatggcg gcagtgggcc gaggcagccg 3005401 ccgacgcgtt gacccgacgc ccctaacggt cggtgccgcg gtcgggcgtt gtcccggcct 3005461 cggagtccgt ttgcacgtgg tccagcagcg tcttgcactg ttgttgcgcg acgacgcggc 3005521 gacgccgctc ctcggctgcg agctgcacga tggtgcgcga ccacaccacc tgggcccagt 3005581 ggaatgtcaa cacgatcgcg gtgatccagg cgacgatgag cccgataccg gggccgggat 3005641 gaccggcggc gaccgtctga cgcgaccata cggccagcag cccggtaccg ctggccatcg 3005701 ccgaacccgc cagcgccacc caagccagcg cccaccgccg ggtgagcaac gccagcatcg 3005761 agaagccaac gccgaacacc agcgccaacc aggcgaatac ccgcgagggc agcgcgacgg 3005821 cggccctgcc ggcgccgtgg ctgctgaaca acacatccca gccgcgcacg cttccggtat 3005881 gcggcaggat aaacgacccc aacagcacga ataccaggat tgcgacaacc aaagccctcg 3005941 cgcctggctc gatttcgcgc gcaacgcggc gttctgccgc ctcgatctca gcgcggaggg 3006001 cgtcgagatc cccggcgtcg tgttcgtggc tcatcatctg catcctccgg gcttggccgc 3006061 gctgaccggc agcccgaccc caggcatgcc caggccgacg gcgcgccccg gctgcccggc 3006121 ggtgtgcgcg tcgccggcgc gggtgcggcg gtgggtcagg acgccggcgt cggcgatgag 3006181 gtggtgcggc gccgcttcgg tgaccttcgt ggtgatgacg tcgccgggac gcacgcgcgg 3006241 ctggccggcg gtgaagtgca ccaggcgccc gtcgcgcgcc cgcccgctca tgcgcgccgt 3006301 gacggtgtcc ttgcgccctt ccccggtggc caccagcacc tcgacggcct gcccgaccag 3006361 ggcgcggttg gcttccagcg agatttgctc ctgcagcgcg atcaggcgtt catagcgttc 3006421 ctgcacaacg gctttcggca gctgtccgtc gagttgcgcg gccggtgtcc cgggccgctt 3006481 ggagtattgg aaggtaaatg cggccgcgaa gcgggcccgg cgcaccacgt cgagcgtggc 3006541 cgcgaagtcc tcttcggtct ccccggggaa accgacgatc agatcggtgg taatcgcggc 3006601 atgcgggatg gccgcccgca cgcgctcgat gatgccgagg tagcgctcgg cacgatagga 3006661 ccgccgcatc gcgcgcagga tccggtcgga tccggactgt agcggcatgt gcagcgcggg 3006721 gcagacgttg cgcgtctgcg ccatcgcctc gatgacgtcg tcggtgaatt cggccgggtg 3006781 tggggaggtg aaccggaccc gctccagccc gtcgatgtct ccgcaggccc gcagcaactc 3006841 ggcgaaagct ccccgattac ggggcaatgc ggggtcggcg aacgagacgc cgtaggcgtt 3006901 gacgttttgg ccgagcaggg tgacttcgag cacaccgtcg ttcaccaagg accgcacctc 3006961 ggccaggatg tctgccgggc tgcggtcgac ctccctaccc cgcagcgacg ggacgatgca 3007021 gaacgtgcag ctgttgttgc agcccaccga gatggaaacc cacgcggcat aggcagattc 3007081 gcgggagctg ggcagcgacg acgggaactg ttgcagcgcc tcggcgattt cgacctgggc 3007141 gaccttgttg tgccgggcgc gctccagcag cgtgggcaaa gacccgatgt tgtgggtgcc 3007201 gaagacaacg tctacccacg gcgccctgcg cagcacggcg tcgcggtctt tttgcgccag 3007261 gcagccaccg accgcgattt gcatgtcggg attggcgcgc ttgcgcgggg ccagatggct 3007321 gaggttgccg tacagcctgt tgtcggcgtt ctcgcggacg gcgcaggtgt tgaacaccac 3007381 gacgtcggcc tcggaaccgt cggtcgccct ccggtagccg gccgcttcca gcagacccgc 3007441 cagccgctcg gagtcgtgga cgttcatctg acagccgtag gtgcggacct gataggtgcg 3007501 cgctggcgct cgccgcacgg gcggcccggc gccctcgccg gtcacccccg cggcggcatc 3007561 gtgcgccacc atcgaagtca cggggccatg gtacggcggc tgggcggctc gcggcccagc 3007621 ggatggtgtc gcctcgtcgc agcatcgggc tagcggggac gcgctcgaca cggtggccga 3007681 tcacggcttc gctgcacacc ggctcgaaga agtcggccac gcgcatgagg tagtcgcgtc 3007741 gtcaccgaca ctatggctcg cttgcctcta aagcatcgct tatgccacag accagacttg 3007801 tcggagccgc tgtctagcat cggggaccgg gtgctcggcg cggacaaacg tcatgaaggg 3007861 aatcgataat gtcggatcgc tcagcgatcg aatggacggg ggcaacctgg aacccggtca 3007921 ccggatgcga ccgtgtatcg ccgggatgtg accactgcta cgcaatgacg ttagcgaagc 3007981 ggctaaaggc gatgggctcc gacaagtatc aaaccgatgg tgaccccaga acctccggtc 3008041 cgggatttgg cgtcaccatc catccccgca gtcttgacga gccgttccgg tggcgaagcc 3008101 cccgcacagt gttcgtgaac tcgatggcgg acctatttca cgccagggtg gcgctctggt 3008161 tcattaggga agtgttcgag gtgatgcgag ccacaccaca gcacacttac cagatcttga 3008221 ccaagcgcag cctgcgactg cgtcgcctcg ctcacaagct ggagtggccc tcgaacgttt 3008281 ggatgggggt gtcggtggaa aatgtcgacg ccttccgccg tatcgaggac ctacgacagg 3008341 tgcccgcagc agtaaggttc ctctcctgcg agccattact cgggcccctg gacggaataa 3008401 atctaggttc gattgattgg gttatcgccg gaggcgaatc tggtccaaat ttccgcccga 3008461 tcgatccaca atgggttcgc catattcgcg atacctgtac tgccgctgat gtcccattct 3008521 tcttcaagca atggggcggt agaacaccaa aggcatttgg acgtgaactc gacggacgtt 3008581 gttgggatga aatgccgctt attgagatta gaaacccgga tcctcggacc accagccgcg 3008641 tgcacgcgga tcccatgttg gcgacggcgc ccacagaatc tgcccagcgt tcgaatcctg 3008701 gacagctagt tcgccaacgc tgaataatcc catctcgcca cggtcctcgg actccttctg 3008761 ctgtttcgcg ctcttggctt gccgcatcat ctctggctcc ttttgagccg cccggttgta 3008821 caggtggcac atgatcgcat ctccggccca gtgatcggtc gcgaacacca tgtcaaagat 3008881 tgtgacctta ttgtgcatct gcatgggaat acgatgcgaa tacttgtatc ccagctcata 3008941 ctccagcttg acgcgcatga gattaaccat ctcggcacgg taggcaggcg cagttagatg 3009001 gtggcgccat cgcgctgcct gtatccgctt ccaatccgcg tctccgtaca tgcgggtgac 3009061 ctgctcgata aacagttccg cgttcgtgcc cttcacgccc cgcgcgatca tggtgggtga 3009121 catcaacatc catagttcgg tcttgaggtt acgagggttc tggcgaaagg cggcgacctt 3009181 attgatcgtt tcccaatgga cttcagcggc ctgttggtcg atgaaagcga aggtgggcgc 3009241 ccaccgccaa gggcctagtt cggcaagtgt ttcatcgatt gttacgttgg aatcgccggc 3009301 cacaacgcgg tacctaccgt caccgggaaa gcgggtccga agggcgacgt ccaattcaga 3009361 ggcaagcggg ttaagctcgc aaaaccggag ccgcgtgaaa ggtggatcgg ctttcatagc 3009421 gataagagaa gagccatcaa atttctctcc catgtcgcgg tctatgttct cgggctggcc 3009481 cgccatcaag tcgaggtaaa ttcgttcacg agaagtctga ctagccctgt tgaaggccgg 3009541 gaggtacccg gcaagtatct ccagtttgtt tcgcgtccaa tatgaccatt ctctagccat 3009601 cgaatccctt tagacgcgtc ggcgctcccg ctcggcggcc agctcggcga taaccacctc 3009661 gcacgccaag gtctggccgt acccacggcg cgccaacatc gccaccagcc tgcggctcac 3009721 ccgcgcttcg tcggtgccgt cgtcgatcag cacctcccgc cgcagcctgg cccgtaccag 3009781 cttttccgcc cgcccccgtt cggcaccggc gtcgatgccc ccgagcaccg tggtgatcac 3009841 gtcgtcgtcg acgcccttgg cgtgcagctc ggcagccaac gcgcgcttgc tctttgctgc 3009901 gttcgcccgc ctggactgaa cccattgttc ggcgaagtcg gtgtcatcca ccaggccaac 3009961 ggcggccagc cgatccaata cccggttgcc gatgtcttcg gggtagccgc gcttggccag 3010021 ctggccggct aactcggcgc gggtgcggga tcgcgcggtg agcaggcgca ggcacagtgc 3010081 ccgcgcctgc tcttcgcgct cagaagtcga cgggggcggg caggacaccg tcatttgagg 3010141 gatcatcggt caccacggca ccaatgccaa gcttttcctt gatcttcttc tcgatctcgt 3010201 cagccacgtc ggcgttctcc accaagaagt tgcgggcatt ctccttgccc tggccgagct 3010261 gctcgccctc gtaggtgaac caggcacccg acttgcggat gaggccctga tccacaccca 3010321 tgtcgatcag cgagccctcc ctgctgattc ccttgccgta gaggatgtcg aactcggcct 3010381 gcttgaaggg gggcgaacag ttgtgcacga caaccccttc ggcgacgagg gtgtgcagtt 3010441 cctcgacctc gaggtcgaac gttcgtgccc gccgcgttgg cagcacttct cggatcacgg 3010501 aatagcggag ttcttccgcc agcatgtcgt gcaggaattt gtcatccagg gcatccgcga 3010561 gcgcctgcac gcgatcccga cgaaggcggc tggcacctaa gacctgcttc attccaccgc 3010621 gggggtcccc ggaagctaca ccgatcatgg ccgcggcctc ctgcgcggtc acgccgcgct 3010681 cgtccagata attcagcacg gcatcggtca tctctgcagc cagatatgtc gcttgcgatc 3010741 cacgacgccg cccctgcgtg gcttctggaa tcgcctggat aagcgcggca ccgcgcggcc 3010801 cccacatggg aactgactcc gcgaatgccg tgacgttatc catacccgag atccggacct 3010861 cgaacacttg acgtttgctc tggatccgtc gaccgttgac gatgctcggc cgcttctggg 3010921 tcggatcgta atctcgaacg gtgctcccga caccgaaccg cagcagcagc caatgaatct 3010981 gatgcgcgag ttgttcagag gtcgtcgtgt aaccgacccg aagtgccccg gtctgttccc 3011041 ggctcaccca cccgtcgctt tcgaacaggc cgaagagcag attgccgaca atgtcggccg 3011101 cgatgtccgg ctcgaagaac caattcggaa tcgtcttctc ccacgcgagc ttgccgtaga 3011161 taccggcctg ctgacaaagg tctgccacac cgttgcgctc accgggtcga tgagcgatcg 3011221 cgagtgagat acgcccctgc ggatgggccg cgcaaccgag cgtcgcagcg attcgcgtca 3011281 cgtcgtcaat gagcgcccgc tgaacattga tgaagttgat cggagtcttg ccccccaccc 3011341 aaccatccct gccatctccg atcaggtagc caagcagccg ggcatgatcc gccggaatcg 3011401 gcgcactgtc accgaatcca tcgaagcgtc gcggttgcgc caccctgtct cccttgcgga 3011461 gttccccggc ggcacgccag ccgtactctg tcagcacctt gtgatcgggt gtcgcccaca 3011521 cgatggcgcc accggcgatc cgcaacccga tcacatcccg cgttccctgg tcgaaccagg 3011581 acaccacggg ccgcgcatgc agcgttccgt ccttggcagc agccacgaca tgaataggct 3011641 tgcgcccatc gacaacatcc tcgatgcgat gcgttgtacc ggtgaccgga tcgaagatcc 3011701 gagtgccctc tgcgaggcac ttgttcttga cgaccttgac ccgggtgcgg ttgccgaccg 3011761 cgttggtacc gtccttgagc gtctcgactc gccgcacgtc catgcgcacc gacgcgtaga 3011821 acttcaacgc ctttccgccc gttgtcgtct cgggcgaccc gaacatcact ccgatcttgt 3011881 cgcggagctg gttgatgaag atcgccgtgg tgcccgaatt attcagcgcg ccggtcattt 3011941 tccgcagcgc ctggctcatc agccgggcct gcagcccgac gtggctgtcg cccatctcgc 3012001 cttcgagctc cgcgcgcggc accagcgccg ccaccgagtc gatcaccacg atgtcaagcg 3012061 cacccgagcg gatcagcatg tcggcgatct cgagtgcctg ttccccggtg tccggctggc 3012121 tgaccagcag cgaatcggtg tcgacaccga gcttcttggc atagtccgga tccagcgcgt 3012181 gctcggcgtc gatgaacgcc gcaacaccac cggcggcctg agcgttggcc accgcgtgca 3012241 gcgccacggt ggtcttaccc gacgactccg ggccgtatat ctctatcacc cggccacgcg 3012301 gcaggccgcc aatgcccagg gccacgtcta gtgcgatgga tccggtcgga atgaccgaaa 3012361 tcggctgacg cgcctcgtcg ccgaggcgca tcaccgaacc tttgccgtaa ctcttctcga 3012421 tctgggccac tgccagctcg agcgcctttt cccgatcggg ggtctgcgtc atggtgcctc 3012481 tcctgtggtc ggtgttcgat tgaccggtat cggtcggttg gccgtgacac tagagacagc 3012541 cactgacaag tcggctgctc cgaatgatca ccacagtagc cgaacacctg ttcgattcaa 3012601 gtgtgacacg ccgcgtgtgg caacatcgcg tccgcgctcg tcggcgcgtc gaacgccctg 3012661 gcggcggtgc ggcccgactt gcgtgcgcgg ctggtccgga tcaccgacga tctgctcaac 3012721 accgctagcc tggccggatc cggcgtgctc accggcccgg atctgacctt tcggcgtcgc 3012781 agctgctgcc tgttctaccg ggtacccgcc ggaggcaagt gcggcgattg cccgctttga 3012841 cgaatgtgca acctcaccac cgatcgtggg gaacgtcgaa gtcggcgcac aatgcccgcc 3012901 agacgtcgcg gggctcgaca ccgtcctcga tggcctgggc ggcgctacgg ccgtcgaagc 3012961 cggtcagcac gtgatcgagc agcaccgacg agccataagc cgccccgaaa tgcagggcta 3013021 cccgctcgtg gaactccgtc agccgcacgc cagccaacat acccagcgcg ctaccccgcc 3013081 aacgcaagcg cgtcgtggca gacccgcacc ggatcggccg caccggcgac gctggcggcc 3013141 gcccgccgcg cggcctcccg gaacctcggc gacgacaaca cctcgttgac cgccgccacc 3013201 agcgcgtcgg cggtcaacgg ccggatcagc accgcgctac cctgccggac tacccggttg 3013261 gcgatctccc actgatcccc gccaccggga accaccacca tgggcacccc ggccagcagc 3013321 gtcttggcca ccatcccatg accaccgccg cagatcacca gatcggcccg cgtgagcagc 3013381 tcggcctggc tgcccagccc ggccaccgcc cagggcggca ccgtcaggtc ggctccgctc 3013441 aaacgcgaca ccaccaggcg cgatcccgac ggcaccgtct cacccggcgt cagagactgc 3013501 aacgcgacct ccgtcaatcc ggcggtcccg gtcaacgcgg tggacggcgc cacgaccacc 3013561 accggcccgg tgccggcggg gatggccagc acccgatcgg tcggctcgaa atgcagcggg 3013621 cccaccacga cggcctcggc cggccagtcc gggcggggaa cctcgagcgc gggcagcgtg 3013681 gcgatcagcc ggcgcagcgg cccgggatcg cgggccggca atccgatctc gacccgaacg 3013741 gcggcacgct ggcgcagccc ggcacgccag gaccgccccg tcagcgctcg catggtggca 3013801 tcgcgcagcc ggccgcggat accggtgcct gcagccagtc cgctgccgat cggcggcagt 3013861 cccttcgacg gcaggtacag cggatgcggg ttgagttcca cccacgggat ccctagcagt 3013921 tcggctgcca tgccgccgca cgccgtgatg acgtcggaca ccaccagctc cggttccaga 3013981 gcccgcagcc gcggcacgtt gagcacggcc atctgcgccg ctcgccgatg gatcctggcc 3014041 ccggcgtcga gatcgcggtc ggtggccgcc agcccgtcca gctcgacggc gtcaatgcca 3014101 gcggcgcggg cggcttccag ccattccacc ccggtgaaca gggtgggcgt gtcagcggct 3014161 gcgcggaaac gctggcacag cgcgatcgcc ggaaacgagt gcccgggatc cggcccggcg 3014221 accacggcga cgcgcatcgg ccctaccctg ccacagcgcc acagccgtag gctgacagcc 3014281 atggccgagc tgaccgaaac atcgccggaa acccccgaaa ccaccgaggc cattcgtgcc 3014341 gtcgaggcgt tcctcaacgc cctgcagaac gaagacttcg acaccgtcga cgccgcactg 3014401 ggcgacgacc tggtctatga gaacgtcggg ttttccagga tccgcggtgg ccgccgcacg 3014461 gcaacgctgc ttcgccgcat gcagggccgc gtcggcttcg aggtgaagat ccaccgcatc 3014521 ggcgccgacg gcgccgcggt gctcaccgaa cgcaccgacg cgctaatcat cggaccgctg 3014581 cgggtgcagt tctgggtctg cggcgtattc gaggtggacg atgggcggat caccctgtgg 3014641 cgggactact tcgatgtcta cgacatgttc aagggcctct tgcgaggcct ggtggcgctg 3014701 gtggtgccat cgctgaaggc aacgctgtag gccgaccttc cggatcaagc ccaacgcgct 3014761 gtagaacatc gggtagcgct acagccagcc ggctgcccgg gcttatcgct actctgcgcg 3014821 gcgggccagc aaagatgcga agtgtgggcg aaaccgcaaa tgcatcgcct cggccgctat 3014881 acgatcccca tgcacagtct tgagggtgag ctggcgattt tgggccgaca cgacgggctg 3014941 tggcgtgttt ggaggtctca gatgtcattt gtgatcgcgg caccggagtt ttaacggcgg 3015001 cagcaatgga cttggcgagc atcggctcga cagtgagcgc ggccagtgcc gccgcatcag 3015061 cccccacggt cgcgatcctg gccgcgggcg ccgatgaggt gtcgatagcc gtcgcggcgc 3015121 tgttcgggat gcatggccag gcatatcagg ccctcagcgt gcaggcatcg gcgtttcatc 3015181 agcaatttgt gcaggccttg accgcgggcg cgtactcgta tgcctccgct gaagccgccg 3015241 ccgtgacacc gcttcagcaa ctagtcgatg tgataaatgc gcccttcaga agcgcgctcg 3015301 gccgccccct gatcggcaac ggcgccaacg gtaaaccggg gaccggacaa gacggcgggg 3015361 ccggcgggct cttgtacggc agcggcggta acgggggatc agggctggcc ggctccggcc 3015421 agaagggcgg taacggagga gctgccggat tgtttggcaa cggcggggcc ggcggtgccg 3015481 gcgcgtccaa ccaagccggc aacggcggcg ccggcggaaa cggcggcgcc ggtgggctga 3015541 tctggggcac cgcggggacc ggtggcaacg gcgggttcac cacctttctt gatgccgctg 3015601 ggggtgccgg cggggccggc ggcgccggtg ggctgttcgg cgcgggcggg gccggcggcg 3015661 taggcggcgc cgccctcggc ggcggcgccc aggccgccgg tggcaacggc ggtgcgggcg 3015721 gggtcggtgg gctgttcggc gccggcggtg ccggcggcgc cggcggcttc ggcgacaccg 3015781 gtgggaccgg cggcgacggc ggcagcggcg ggctgtttgg cgtcggcggg gccggcgggc 3015841 acggtggctt cggcagtgct gccggcggcg acggcggcgc gggcggcgcc ggcggcacgg 3015901 tcttcggctc gggcggggcc ggcggtgcag gcggagtcgc cactgtcgct ggccacggtg 3015961 gtcacggcgg taatgccggc ctgctatacg gcaccggtgg ggccggcgga gccggcgggt 3016021 tcggagggtt cggcggcgac ggcggcgacg acggtatcgg cgggttggtc ggttctggcg 3016081 gcgccggcgg cagcggcggc accggtaccc taagtggtgg tcgcggcggg gccggcggta 3016141 acgccggcac gttctacggt tccggcggcg ccggcggcgc cggcggggag agcgacaacg 3016201 gcgacggcgg aaacggcggc gtgggcggca aggccgggtt ggtcggcgag ggcggcaacg 3016261 gcggcgacgg cggtgccacg atagcaggaa agggtggtag cggcggtaac ggcggcaacg 3016321 cctggctgac gggccagggc ggcaacggcg gcaacgccgc atttggcaaa gccgggactg 3016381 gcagcgtcgg cgtcggtggc gccggcgggc tgctggaggg ccagaacggc gagaacggat 3016441 tgctgcctag ctgagccagc ttagccgcag cttggcctca gccaccgggc gtgcggcggc 3016501 ccatcgaccg aggcacgtcg aaatcggtgc acaacgccca ccacgcgacg cggggctcaa 3016561 ccccgtcctc gatcgcgcag atggcattgc gcccaccgaa accggtcagc acatggtcca 3016621 accagcacca aaagcccggt acgccgcgcc gaatcgcggg ctgaccaact cgtggaactc 3016681 cgtccgccgt atgccgccaa ccgcgtgcgt cagcgccttg tggcacaccc gcaccggatc 3016741 ggctacctaa cccgcaccgg atcggctacc taacccgcgc cggccgctgc cttggcgcgg 3016801 cacccggtcg gccatccacc gcagatcccc gccaccgaac gcaccgaaac gccgaccttc 3016861 ggcccgcttc gtatccggtt ctgggcctgc ggcattttcg aggtacagcg ggcacgctat 3016921 ggcattacca cttcggcgtc caaggcgctg gtgcgcggtc tgaccgcgtc ggcgttctcg 3016981 tcgccgcggg ctaccctgta gcgaatgagc gacaacgcaa tccgcccgcg gcccaacccg 3017041 tggcagtaca tccgctattg ctacggggcg cggctgccgg actcgatgcg agactgggtg 3017101 cgcaacgatc tggccggcaa gggtgcggcc atccggatga tgatccgcgt cgcggttccg 3017161 gcggtgctgg tgctggcccc gttctggctg atcccgacgt cgctggacgt ccacttgagc 3017221 atgacgttgc cgattctcat cccgttcgtg tatttctcgc atgcgctgaa caaggtatgg 3017281 cgccggcaca tgctgcgcgt gcacaatctt gaccccgagc tcgtcgacga gcacgcccgc 3017341 caacgcgacg cccacattca ccgggcgtat atcgaacgct acgggccacg gccggacccg 3017401 aacgactaac gccggggcaa tccgccgagc tcgtcaaacg cctgcgccca agcgaccagg 3017461 cgatcggtgg cgccggccaa ctcctcgcgg tagcgctgtt gtccgggccc agccccaccc 3017521 gcgccgttcg ccgaggaaac caattgcgct gcggcggtga ccatttcgtt gtactgacgg 3017581 acgccggtgc tcagctgcgc ggtaaacgcg ttgatggtcg gcaccagata cgaccgcgac 3017641 gccgccgagc attgcacggc ccgctccatc gagaccacct cggccgcggt cgccaccatc 3017701 gccgccgaag tctggttggc cgcggccgtt aggtcgcgga tctcgtccgc cggcaacatg 3017761 gcgccccgct ccatgacacc caacagcgag aagaacccgc gttcggaggc gcccagcgcc 3017821 gacatcgcgg gtcgcgcggc cgagcccggt ggtggcagcc ggcgcacact tgcgggccgc 3017881 cgcaccggca gtggctccga gcgcagccag cggtagcgaa gcagcaatag cgtcgccgga 3017941 atggcctgcg tgaccgcaat cgtgccggta atcaccagca gcgacgtaaa ccagccccag 3018001 gccgccaaca gcgccgtcac caacccccag agcagacagc ctgcggtgaa taccagaccc 3018061 cagcgcaatg cacggcggcg gcggcgcagc agccgggcac gcggatcgat ggcgacgctg 3018121 atcttttgtg ctaccaggtc ggccaaatca ccggcggtat ccacgccgcg ctgcagcaac 3018181 gaacgccacg gccggcgctg acccgctttc actgccatgc cgaaccgtct gcccaactac 3018241 tgaccgtagg gctgctcggc aatagccccg ccagaagtct cggtggccgg tctgggggta 3018301 gccgtggtcc cgccggccgg caacgcttca ccgcgcatcg atgcgcggat ctgttccaac 3018361 cgtgaatgac cggccatctg gatcccggcc tgctccacct cgagcatccg gccctgcacc 3018421 gaactctcgg caagttcagc cgaaccgatc gcgttggcgt agcgacgctc gatcttgtcg 3018481 cgcacctcgt cgaggctcgg cgtgttgcct ggcgcggcga gctcactcat cgaccgcaac 3018541 gatgcgctga cctgctcctg catcttcgcc tgctcgagct ggctgagcag ctgggttcgc 3018601 tcggcgatct tctgctgcag caccatcgca tttcgttcga cggccttctt ggcctgagct 3018661 gcggcgctaa gcgcctggtc atgcagcgtc ttgaggtctt cgacgctctg ctcggcggtc 3018721 accagctggg ctgcgaacgc ctcggcggcg ttgttgtatt cggtggcctt ggcagcgtct 3018781 ccggcggcgg tggcctggtc ggccagcgtc agggcttggc gcacattgac ctgaagcttt 3018841 tcgatgtccg ccagctgtcg gttgagtcgc atctccaatt gacgctggtt accgatcact 3018901 tgcgccgcct gttgagtcag cgcttggtgg gtgcgctgtg cttcctcaat ggcctgttga 3018961 atctgcacct tggggtcggc atgctcgtcg atcttcgagc tgaacagcgc catgaggtac 3019021 ttccaggctt taacgaacgg attggccatc agttagctcc gccttcgctt cttgtgtgcg 3019081 ccagatggtc tcagcgccct gtcgctcaat ttatcgggtc agcgcgcatt gccccaccca 3019141 tggcgcgcat cttgtcgacc cggaccgacc ggcgaccctt aggccaccgc cagcgacacc 3019201 accggcgcaa tgacgacctt ggtgctggcg tcaatggtgg cgccggttgc tctgccagcc 3019261 ggggtggcgc gggcaaggcg ctcttgacgc gccatccgct cgcccgcatc gatgagcacc 3019321 accgacaacg ggagctgcag agccgtacaa atcgcactga gcagctcgct ggaaggctcc 3019381 ttgcgaccgc gctcgatctc cgacagatac ccgaggctca cccgcgccga atcggacacc 3019441 tcgcgcagcg tccgaccctg cgacatccgc gctccgcgca gcacgtcacc aacgacctca 3019501 cgcaccaaag ccgccatcaa aaactccttg tccacctcgc aatcgtcatc aggtgaacgc 3019561 cgccggcggt ggggttggtt cccgcaatca gctggcggtc tggcggatcc ccccgatgtc 3019621 ccgcagagcc ctggccacgt aatcgacacc agtgatcacg gtgagcagga tcgcggcggc 3019681 catcactacc accgccgcaa cgtgcagcgg acccgaaagt ggcaacacga ataagccaat 3019741 tgccaccgcc tggacaaagg tcttcagctt gccgccccag ctcgcgggaa tgacaccgcg 3019801 cctaataacc gccaacctca aaacggtcac tccgagttcg cgggtcagga ttagcaccgt 3019861 gacccaccac ggcaagtcgc cgagcatcga caatccgatc agcgccgagc cgatcagagt 3019921 cttgtccgcg atcggatcga caaacgcacc gaattcggtt gccatcccgt aattgcgagc 3019981 cagcaggccg tcgaatcgat cggtaatgca ggcggttgca aatatcgccc acgccactac 3020041 gcgggccgcg gagtggtggc cgccgccata gaacaaggcc agcaggaaga ccgggaccat 3020101 caccagccgc aacagcgtca ggatattggc gaggttggca atgcgggcgc ggcctgctat 3020161 ctgacccgtt tcaggctgcg ccgacacggc aacagaataa cgggttgacc tgctcatgcg 3020221 acccttgatg tcgatactgt ttcacacgtg accgagcgtc cacgggattg ccggccggtg 3020281 gtccggcgcg cgcgaacctc cgatgtgccc gcgatcaaac aactcgtcga cacctatgcc 3020341 ggaaagatct tgctggaaaa gaatctcgtg acactctatg aagcggttca ggaattctgg 3020401 gtggccgagc acccggacct ctatggcaaa gtcgtcggtt gcggtgcgtt gcacgtgttg 3020461 tggtcggatc tcggcgaaat ccgcaccgtc gctgtcgacc cggccatgac cggccacggt 3020521 atcggccacg caatcgtcga tcggctactg caggtcgccc gcgatctgca gctgcagcgc 3020581 gtgttcgtgt tgacctttga gaccgagttc ttcgcccggc acggattcac cgagatcgag 3020641 ggcaccccgg tcaccgccga ggtgttcgac gagatgtgcc gctcctatga catcggggtc 3020701 gccgaattcc tggacctgag ctacgtcaag cccaacatcc tcggcaactc ccggatgctg 3020761 ctggtgctgt agcccggcga gcagacgcaa aatcgcctca tttcggcacg aaatgggcga 3020821 ttttgcgtct gctcggcggg ctactcgccg ccgtcacccc ggatcgcggc cagtgtgccc 3020881 gccaactcgt cgggcttgac cagcacctca cgggccttcg agccttcgct gggcccgacg 3020941 atgccgcggg tctccatcag gtccatcaaa cggcccgctt tggcgaagcc gacccgcagc 3021001 ttgcgctgca gcatcgacgt cgacccgaac tggctggaca ccaccagttc cacggcctgc 3021061 aggaagacgt ccatgtcgtc gccgatgtcg gggtcgacgt cggtgcgctc cgcggtgggt 3021121 ttagccgtgg tgacgccctc ggtgtattcg ggttcggcct gttccttgca ggcggtgacg 3021181 acggcgtgga tctcttcgtc ggagacgtaa gcgccctgca gccggagggg tttgctcgca 3021241 cccatcggca agaacaggcc gtcgcccatg ccgatcagct tttccgcgcc cgcctggtcc 3021301 aggatcaccc ggctgtcggt cagcgacgag gtggcaaacg ccagccgcga cggcacgttg 3021361 gtcttgatca gcccggtgac cacgtccacc gacgggcgct gggtggccag caccaggtgg 3021421 atgccggcgg cgcgggcttt ctgggtgatc cgcacgatgg cgtcctcgac gtcacgcggc 3021481 gcggtcatca tgaggtcggc caactcgtcg acgatggcca ccacgtaggg gtagggccga 3021541 tactcgcgct ggctgcccag cggcgcggtg atggccccgg atcgcacctt gtcgttgaag 3021601 tcgtcgatgt ggcgcacccg ggaggcctgc atgtcctggt agcgctgctc catctcgtcg 3021661 accagccagg ccagcgcggc cgcggccttc ttcggctggg tgatgatcgg cgtgatcaga 3021721 tgcggaatgc cttcatacgg cgtcagttcc accatcttcg ggtcgatcag gatcatcctg 3021781 acctcttccg gggtggcccg ggtcaacagc gacaccagca tggagttgac gaagctggac 3021841 tttcccgagc ccgtcgagcc ggccaccagc aggtgcggca tcttggccag gttggccgag 3021901 atgaagtcgc cttcgatgtc cttgcccagc ccgatcacca acggatgatg gtcgcgacgg 3021961 gtctctcgtg cggtgagcac gtcggccaac cgcaccattt cccggtcggt gttgggtacc 3022021 tcgatgccga cggcggactt gccggggatc ggtgccagca tgcgcacgct ctcggtagcc 3022081 accgcgtagg cgatgttgcg ctgcagcgcg gtgatcttct cgaccttgac gccgggcccc 3022141 agttcgacct cgtagcgggt gacggtgggc ccgcgggtgc agcccgtgac ggccgcgtcg 3022201 accttgaact gggtcagcac ctcaccgatg gcgccggcca tgtgggtgtt ggccgcactg 3022261 cgtttcttgg gcggatcacc ggatatcagc aggtccagcg acggcagcgt gtagggaccc 3022321 tcgacgatcc ggtccagcac ttgggtatct ttgcggcggc cgcgtcttcc ggagccccga 3022381 ccggcggagg cttccggtat cgtcgcagtg tcatcctgcg gaacctcggc cgacggccag 3022441 gccggtggcc cgtcgtcgga gcacaggggc acctcgtcgt agtaaccgtc ggagaagtcc 3022501 tggcgggcga cttcgacggt gtccgcgtcg tcaccatcga agtccgcgaa gtcctcgaag 3022561 tcgtcggcgt attcccgtgg caacagccgg gtgccgaaca tggcgcgcat ggcatctggc 3022621 acctctcgga tcgtgatccc ggccagcagg agcaatccga acagcgcgcc gatgaataac 3022681 agcggcgcgg cgatccaggc ggtcaacccg tccgagagcg gcccgccgat cgcgaaaccg 3022741 atgaaccccg cggcgcgcaa acgcgactcc ggggcctcgg gtgagcccgc ccacaggtgg 3022801 cacaagccga gaaacgacaa gccgatcagg ctggcgccga ggatcagccg cggccgcgaa 3022861 tcggggttgg gcgacgtacg catcagcacc acggccacgg cggcggcaac cagcgggagc 3022921 atgaccactg ccgacccgat gaacgtccgc aacaaggcgt cgacccacgc gccgagcggc 3022981 cgggcggcgt cgaaccacga gctcgcggcg actaccacgg caaggccgag cagcaccagc 3023041 gcgattccgt cgcggcgatg cccgggctcg atgtcgcggg ctcgcccgat cgaccgcgcc 3023101 gcgccgccgg tgcccttggc cgccatcatc cagacggcac gcatggcccg gccgcaggcg 3023161 agtccggtag acaccagcag cgaccgatgg tgccgtcggg agggtctgcc gacccctttg 3023221 acgggcctcg acctctttct gggcacggcc gatcgcgcac ttcgggacgc gccccgcgaa 3023281 gtggcctttg acctgctcgt tcgagtgccg gagcgggcaa cggtcttgct agacataacg 3023341 gcaagcctag tcgctatcac accatctaca ccatccgcca cactggtaac ggcgatctgc 3023401 tcgcctcgtt gccagggtct cctgagtagg gtgacaagtg atcgtgccgc gtcacgccgc 3023461 ccgacgcgcg gagttccagg aggccccagc atgcccgtcg tcgtcgtcgc cacgctgacc 3023521 gccaagcctg aatcggtcga caccgtccgc gacatcctca cccgcgcggt cgatgacgtg 3023581 caccgcgaac ccggctgcca gttgtacgcg ctccacgaaa ccggcgagac cttcatcttc 3023641 gttgagcaat gggccgatgc cgaggcgctc aaggcccata gcggcgcccc cgcggttgcc 3023701 accatgttta ccgcggccgg cgagcacctg gtcggggcgc cggacatcaa actgctgcag 3023761 ccggttcccg ccggcgaccc gagcaaaggg cagctgcgcc ggtgatcgac cggccactcg 3023821 aaggcaaggt cgccttcatc accggcgccg cgcgcggctt gggccgcgca cacgcggttc 3023881 gactggcagc cgacggcgcg aacatcatcg cggttgacat ctgcgagcag atcgccagcg 3023941 tgccttatcc gttgagcacc gccgacgacc tggcggccac cgtcgagctc gtcgaggacg 3024001 ccggcggcgg gatcgtggcc agacagggcg acgttcgcga tcgcgcatca ctgtcggtcg 3024061 cattgcaggc gggccttgac gagttcggcc ggctcgacat cgtggtggcc aatgccggta 3024121 tcgcgatgat gcaggccggc gacgacggct ggcgcgacgt tatcgacgtc aacctcaccg 3024181 gcgtcttcca caccgtacag gtggcgatcc cgaccctgat cgagcagggc accggtgggt 3024241 cgatcgtgtt gatcagctcg gccgcgggac tggtcggcat cggcagcagt gatcccggat 3024301 cgcttggcta cgcggccgcc aagcacggcg tcgtcggcct gatgagggcg tacgcgaacc 3024361 atctggcacc gcaaaacatt cgggttaact cggtacatcc ttgcggggtc gatacgccga 3024421 tgatcaacaa tgagttcttc cagcagtggc taaccactgc tgacatggac gcgccgcaca 3024481 acctgggtaa cgcgctgccc gtcgagctgg tgcagccaac cgacatcgcc aacgcggtgg 3024541 catggctggc gtccgaggag gcgcgctatg tcaccggcgt caccttgccg gtcgacgcgg 3024601 gctttgtgaa caagaggtag ctgatggctc gaaatcccgc tgcgcagacc gccttcggcc 3024661 cgatggtgtt ggcggccgtg gagcaaaacg aaccacctgg ccgccgcctg gtggacgacg 3024721 acctcgcgga cttgttcttg cccagaccat tgcgatggct ggccggtgca acccggtcgg 3024781 cggtgttgcg tcgtttactc attagcgcct cggagtggtc cggccgcggg ttatgggcca 3024841 atctggcctg ccgtaaacgc ttcatcggag acaaactcga cgaagcgctc ggcgacatcg 3024901 acgcggttgt catcctcgga gccggattgg acacccgtgc ctaccggttg acgcgacgag 3024961 tgcggatgcc ggtattcgag gtcgacctgc cggtcaacat cgcccgcaag gccaagacgg 3025021 tccgacgggt gctcggtgaa ctgccgctgt cggttcgctt ggttgcattg gatttcgagc 3025081 atgacgacct gctcaccgct ctggccgagc acggctaccg taccgagtac cgggtgttct 3025141 tcgtctgcga aggtgtgacc caatacctca ccgagcgggc cgtccggcgg accttggagg 3025201 gcctacgcgc ggccgcaccg ggcagtcgaa tggtattcac ctacgtccgc cgggacttca 3025261 ttgacggcac caaccgttac ggtacccgga cgctatacca cacggttcgc cagcgacgtc 3025321 aactgtggca cttcggctta gatcccgagg aagtagccgg gtttctcgcc gactacggtt 3025381 ggcggctgac cgagcaggcc gggccggagg agcttgtcca gcgctacgtc gagcccaccg 3025441 gccgcaacct caacgcatca caaatcgagt ggtctgccta cgccgagaag agtgagccgg 3025501 ttacacctcg atgaccgtcg gcacaatcat cggctggcgg cgataggttt cccccaccca 3025561 cttgccgacc gtgcggcgca ccccttgagc gatccggatc ggatcggtga cgttggcggc 3025621 caccaacgat tccagctctg cctccacctt gcgcacggcg ggttcgagcg ccttgggatc 3025681 ttcggagaaa ccccgcgagt gtagatgtgg cgcagccaac ggctggccgg tgccacgtct 3025741 gaccacgacg gtcaccgcga caaagcccga cgacaaaatg agccgctcgc ccagggtgat 3025801 atcgccgacg tcgccggcga tcaagccgtc gacgaacatc ttgcccaccg gcaccgcacc 3025861 ggagatactg gctttgccgg caaccaggtc gacgctgaca ccgttctcgg ccaacagaat 3025921 tgactcttgc ggtacgccgg tactggcggc cagcttggca ttggcgcgca gcatccgcca 3025981 ggttccgtgc accggcatca cgttgcgcgg ccgcaccccg ttgtagagga acagcagctc 3026041 accggcgtac gcgtggccgg aaacatgcac ccttgcttgg gcgttggtga cgactctggc 3026101 gccgatcttg gacagtgcat cgatgactcc gaagaccgcc tcctcgttgc cggggatcag 3026161 cgacgacgac aacacgatga gatcaccagc agtcaacgtg atgctgcgat gctccccacg 3026221 cgacattcgc gacaacgccg acatcggctc gccttgggtg ccggtggtga tcaacacaac 3026281 ttggtcgggc gccatcgttt cggcggcggc gatgtcgatg agatcggaat cagccactcg 3026341 taggaagccc agttgccttg cgacgcgcat gttgcgcacc atcgatcggc cgacgaacga 3026401 cactcgccgg cccaatgcca ctgcggcatc gatgatctgc tgtacccgat ccacgttgga 3026461 ggcgaaacac gcaactatca cccgtccgtc ggcaccccgg atgagccggt gcagcgttgg 3026521 gcccacttcg ctttccgatg gcccgacacc ggggatctcg gcgttcgtcg agtcgcacag 3026581 caacaggtcc acgccggtgt cgccgagccg cgacatgccc ggtagatcgg tgggacggcc 3026641 gtccggtggc aattggtcga acttgatgtc gccggtgtgc aggatggttc ccgcgccggt 3026701 atacaccgcg atggccaacg cgtccggagt ggaatggttg acggcgaagt actcgcactc 3026761 aaacacgccg tgccgggtgc tctggccctc gcggacctcg acgaacaccg gtgttatgcg 3026821 gtactcacga catttctctg caaccagagc caaggtgaac ttcgagccga cgaccgggat 3026881 gtcgggtcgc agcttgagca gaaacggaat cgccccgatg tggtcctcgt gcccgtgggt 3026941 caacaccagc gcctcgatgt cgtcaagccg gtcttcgaca tggcgcatgt ccggcaggat 3027001 cagatcgaca ccgggctcgt cgtggccagg aaacaacaca ccgcagtcga taatcaacag 3027061 tcggcccagg tgttcgaaaa ccgtcatgtt gcggccgatt tcgttgatgc cgcccagcgc 3027121 ggtgacccgc aacccgccgg aggtcagggg acctggcggg ggaaggtcta catccacttc 3027181 tgggccaccc tttggctcac ctttagatca ccgaagcacc gaggccgcgc gcatgtcggc 3027241 ggccaacgcg tcgatctgct ccggtgtcgc ggccacctgg ggcagccggg gatcaccgac 3027301 gtcgatgccc tgcagccgca agcccgcctt ggacaacgtc accccaccca ggcggctcat 3027361 cgcgttgcac agcggggcga ccgcaatgtt gatcttgcgg gcggtggcga tatccccaga 3027421 accgaaggcg gacaacaact ctcgaagctg cccggctgcc aggtgggcaa tcacgctgat 3027481 gaagcccgtg gcgcccatgg ccagccaggg caggttgagc gcgtcgtcgc cggaatagta 3027541 ggccagtccg gtgtcggcca tgatttgggc gccgctgtgc aggtcggctt tggcgtcctt 3027601 gactccgacg atgttcggat gcgacgccaa cgcgcggatc gtgtcgggct cgatcggcac 3027661 cgccgaccgc cccgggatgt catagagcag catcggcagc tcggtcgcgt cggcgacggc 3027721 ggtgaaatgg gcttgcagcc cccgctgcgg cggcttggaa tagtagggcg tgaccaccag 3027781 cagcccgtgc gcaccctcgg ccgcacaagc cttggccagc cggatgctgt gcgcggtgtc 3027841 ataggtgccg gcaccggcga taacacgggc ccggtccccc accgcttcca agacggcccg 3027901 cagcagctcg attttctccc cgtcggtggt ggtcggcgac tcgccggtgg tgcccgagac 3027961 caccagaccg tcgcacccct gatcgaccag gtggttggcc agccgcgccg cggtggcggt 3028021 gtccagggag ccatcgccgc taaacggtgt caccatcgcg gtcagcaggg ttcctaggcg 3028081 cgctgcgacg tcgaatccga cggtggtcac ggctcccaag gttacctggc gctttatccc 3028141 ggccgcgagc gcgcgtgttt gtccagcgac acgccgcctc aggcttcggt cgccaacggg 3028201 ctggtcgcca cctcggtgcc gtcggccagg gtggtcacct cgaagtcggc gaacaccgcg 3028261 ggggccacgg cggcgagctg gcgcaggcat tcgatggcca gtcgccggat ttccacgtcg 3028321 gcgtgctcgc tggcccgcat tgcgatgaag tgccgccagg cccggtagtt gccggtcacc 3028381 acgatgcggg tttcggtggc gttgggcagc accgcgcggg cggcttggcg ggcctgcttg 3028441 cggcgcagga tcgcgttggg ttggtcggcg aacttggctt ccagcttggc cagcagctcg 3028501 ctgtaggtgg cgcgggcggc gtcggcggcc tcggtcagga tgtggcgcag gtcggcgtcg 3028561 tcctccatgc cgggcggcac gacgacccgc gagtccttct cgggtacgta gcgctgggag 3028621 agctgcgagt aggagaaatg ccggtggcgg atcagctcgt gggtgcacga tcgcgagatc 3028681 ccggtgatgt agaacgacac gctggcatgc tctagcaccg agaaatgtcc gacgtcgatg 3028741 atgtgccgga ggtagccggc gttggtggcg gtcttgggat tgggcttgga ccagctctga 3028801 tagcaggccc ggccggcgaa ctcgaccagc gcgggtccgc cgtcggcgtc ggtggtccag 3028861 ggcacgtcgg gtggggccaa gaagtcggtc ttggcgatca gttgcacgcg cagcggcgcg 3028921 gtctcggcca cggcgctcac cttagcgccg gccgcaacta gacgaactcg gtgtggcagg 3028981 tcagcccggg ctcccggcgc agacgcgggt ccgcggtcag cagggggatg tcgaggtgac 3029041 tggccagtgc cacgtagagg gcgtcgtaaa acgtgaagtt gtgccgcagg gtccacgccc 3029101 gtcgagcgtc cgcgttggcg ctggctccca gagttgaacc ccaccaaatc tgttgcctga 3029161 agaagccgat ctacctaacg gggatcgttg cccttgaagt cgcgaacaaa taggcaagtg 3029221 tccagcggcc agatcggacc cgcaacgaaa gttgcggtac caatcgccgc accgctcctg 3029281 ccgatggcta caccgggacc atcgtacgca gctgtgtcat gcataccggt caccccgaat 3029341 gacccgataa caggtaccgt tccagatccc cgcgacgccg caggaagatc atgtcctcgc 3029401 tgcagctcaa ggacttcccc gaaccgcaag gttttccatc catcactcat ctaagccgcc 3029461 ccagttgctc ccgcacgacc ctttccagcc gcgccgactc atcgaacgcc tccagcaacg 3029521 ccttcgacaa ccgggccatc ttctcgtcga tcggctctcc gtcgtcctcg accgcgggcg 3029581 tacccacata ccgccccggc gtgagcgcat agtcggtcgc cttgatctcc gccaacgtcg 3029641 ccgacttaca gaaccccgga acatcctcgt acataatccc tttgacggca gccgacttcg 3029701 acccgcgcca cgcgtggaag gtatccccga tgcggacgat ctcctcgttg gtcagcgccc 3029761 gctcggcccg gtccactagg tcgcccagtt cacgagcgtc gatgaacagc acctgcccgc 3029821 accggtcgat agacccttgc ttacctgccg ccttgtcttt ggcgaaaaac cacaggcaca 3029881 ccgggattcc ggtgctgcgg aacagctggg tgggtaacgc gaccatgcag gaaaccaaat 3029941 ccgcctccac gatctgcgcg cgaatatccc cctcgccgtt ggagttcgac gacatcgacc 3030001 cgttggccat caccacgccc gcccgacctc ccggcgccaa cttgtacagg atgtgctgaa 3030061 tccatgcgta gttggcgtta ttggcgggcg gaacaccgaa gcgccagcgt gggtcttcct 3030121 cgttgcgggc ccagtctttg atgttgaacg gcggattggc catcacgtag tccatctgca 3030181 cgtccgggtg ctggtcgcgg gcgaaggtat cactccatcg ggcgccgagc cccttgttgt 3030241 cgatgccgtg gatggcgagg ttcatcttcg ccatccgcca ggtctcctca atgctttcct 3030301 ggccatagat cgagacatcc ttcggatcgc cgtcgtgttc gtagatgaac ttctcggtct 3030361 gcacaaacat gcctccggaa ccgcagcacg ggtcatacac ccgcccactc gacggctcca 3030421 gcacctccac gatcaccttg accacgctgg gcggggtaaa gaactcgcca ccccgcttcc 3030481 cttccgcgcg agcgaaattg ccgaggaagt attcgtagac ctcacccatc agatcccggg 3030541 cgcggtgctc gccctgccgg ctgaagcgcg cactgttaaa taggtcgatc agctcaccga 3030601 gccggcgctg gtcgatgttg tccttgttat acagcctcgg cagcgtccca ccgagtgttg 3030661 gattggcctt cattaccgcg tccatcgcct cgtcgatcag ctgaccgatg ttcttcgccg 3030721 gctcaccacc aacggctggc ttgccttttg tgttctctgc caagaacttc cagcgcgcac 3030781 tcaccggcac cacgaatacg ccgtaaccct ggtactgctc gggatcgtcg atcaggtctt 3030841 ctatctgaga ctcctccatt ccttcggccg ccaactcggc acggattgcc tcgcgccgtt 3030901 cgtcatacgc gtcggacacg tacttaagga acaccaggcc gaggatcacg tccttgtatt 3030961 ggctggccga cagcgacccg cgcagcttgt cggcggcctt ccagagcgtg tctttgagct 3031021 ccttcatcgt cgacggcgcc tgcggcgcct gcttcttcct gggcggcatt cccgtttcct 3031081 tcctatcgat gcgccgcggc gatgccgggc gtggtgggcc agctcctcga caacacgaag 3031141 gtcgcatcgg gcgaatcacg ctgtccctgg ggccaccacc cattccacgg gttgccgtgt 3031201 gatggcggcg atgcgttcga agtcttggtc gtagtgcatg acgggtatgc cgtgatgctc 3031261 ggcgaccgcc gcaatgatca agtccgggat cttgaccgag cggtgaaatc ccttgtcggt 3031321 caatgcttct tggatctccc atgcacgaac ccacacggtg tcgggggtgt tgacgtattc 3031381 gagcgcgtca cgccggtagg tgcccagtgt tcgatggtcc tcgcgggaac gcgccgagac 3031441 tccgaactcg agatcggtaa tgccgcaccg ggccagtaga ccgcgttcca tcaacggttc 3031501 caagcgatgt cggaccgcgg gcaagtgcgc gcggtaagcc gctgatttgt cgagcaaata 3031561 gcgcgtggtc atgccgtgtt ctctgggtgg ccgtctcgcc acattgcgtt gaccagagct 3031621 tcgtcctggg ttccggtggc gttctcggcc atccggttca tgagcgagcg cgcggcactg 3031681 gctcgcaacg cggcccgcag cgcggcatgc acggtgtctt tctttgtcgt ggtacccagt 3031741 tccttggcgg cccgagcgag caggtcgtca tcgatgtcga tcatggtgcg cgtcacaccc 3031801 ggagagcata ctactaatgc atatccgcga tgcatataac ggatgtatct caggcggggc 3031861 tcaggtgcac gcgggccgga tatcggtatg cgtgaagtca tcgccacgaa acagcagcgg 3031921 ctccccggtg acctgggcca gggcgtagct gtaggtgtcg ccgaggttga gacgggccgg 3031981 atggccgctg ccgcggccgt agtcgcgata cgcctgcgcg gccacgcggg cttggtcggc 3032041 gtcgacggct tcgacctgga ttccgtagtc gtccagcaaa cggtccacca atcgagagat 3032101 ctccggccgg tcccgccgct gcatgatcgc gcacagttcg acgtagttgg gcgcggacat 3032161 tcgggagttc ggtgaccgct ccagcgcctc cttgagcacc tgcgcgcccg attccccgct 3032221 cacgatggcg acgatggccg acgtatcgac gatcaccggg gcagaccgct gtcatcgtag 3032281 aggtcgacct cgtgtcgccg aatcaggcgc ttgtcgtcgt cgctgagcag cttgtcgagg 3032341 tcgcgcaggg tctgttcggc ggcggcgcgc cgggcctccg cgcgtgccct gtcctcgcgg 3032401 tccaactccg agaggcggcg cgcgacggcg tcctcgacag cagccgtctg gttggtgccg 3032461 gtgcgtgcgg ccagttcccg caccagcgcc acggtgcgct ggctcttgat attgaggctc 3032521 atggtagaag gctaccggcc agcgggtaga ccatctatcc cggacatcaa cagcggaagc 3032581 agcgcatcgc ggcaggatgc caggcgtgcg gattcaatcc gccgctcgtt gcacagcgca 3032641 cccaggttcg cgattgcggc cgcgtgtccg ggagtcaacc ggcgcacatc gcgcacccaa 3032701 acccgcaaca gctgggtcgg ttggattcgt tgccggcttc ccgtcatgcc cccgactaac 3032761 tgccgcagtt ctgccaggac atcgggttgt cgcagcgccg cccacagggc cgaagtgtcg 3032821 acgccgactg gccgcagcac gacgaactcc gtactcgcca gcgccatttc cgacgggagg 3032881 ctggtgatgt tccagattcg cgggattctt ggattcagtt tcgggaacaa cacacacggc 3032941 tgcgacacga cgagctttgc gctcctgatc gttcgcccac cgacgcgact gggctgggcg 3033001 ccgccgtcga atgccgcgaa actgtaatgg gcgacggtgc tatcgaagtg ctgcgcatca 3033061 agacatgcgg ttgacctgct cgccaggctc gacaacggca cgtatgcaga gagccgcccg 3033121 acgatcgcaa gcatcaacgc ctcggcggct tcgatgacac ggtcgttggc ggcgatcttg 3033181 tcgtcgaagg cgcctaggat ctcgccgatt cgagggcggt cgggcgcggc gacggccgat 3033241 accgaaacgt tccgcagaac accctgactc agcaggggct gtcccgatcc ggcccgatat 3033301 cggttgagcc cgaaacccag tagcgcgtaa taccaatatc gggtttcctc gggcttcttg 3033361 gcccgacacg ccagcgcgtt gtcggtcacc cacacgtcgg aatcgcaata gcgcaggcta 3033421 ccgcagtacg agccgacgcg gccgacgacg atcagcgggc cacgcgcgtt gtgttgggcg 3033481 gaatatccga taaccccgtt tgcaccatag acgggatagc ggccgccggg ctcgctcgct 3033541 ggcgacgtat ggccagacgt atggccattc gagaagtcga gatggtcccc tagccttacc 3033601 ttttcgactt tctcgacgcg gctcatccgt tagtccgctt ggtggccgcg cacagttccc 3033661 cagccagatc accccgggtg gacacggcga tcccccccaa tcccagccac gacgccatcg 3033721 acgccagctc gccggccaac ccctcggcaa ccgtgaccgg cggtatgtcg agttcgccaa 3033781 gcacgcccgc cacgggcaag ctgtccgcgg cgcggtcggt tttgcggtcc acccgcgcaa 3033841 ccgggtgtcc gtcaaggggc cgcacgtaat acaagtgccg gcgtttggcc gccactgccg 3033901 ccgaatccag ctaccgcact tggattcgcg agatcagccg cttgcggtcg gcaccggcga 3033961 tcgggccggc cgacccgggt tcggcgacgc ctcctacgac gacggccgcc cggcggtccc 3034021 acgcagcagt cgcggcgctc atgagcgcgc gccgcgacga tgcagtgggg gtaccacccg 3034081 cttgcggggg acgaagcgat gaggagaagc ggcgctcatg agcggtggta gctgtacaac 3034141 cggtaccgca acccggaccg gctgaagcgc cactcccccg tctcgccccg ccatgtctcg 3034201 tccagcacgg gggccagcgc gtcaccggct tcgcgcggca ggccgatgtc gacctcggta 3034261 acctcacatc tggtcgcgta cggcagcgcc agcgcataga cttgtccgcc tccgatcacc 3034321 cacgtctccg ggctggtcag cgcctcctcg agtgaaccga caacctcagc cccgctggcc 3034381 ataaagtcag cttggcggct cagtacgaca tttcgccggc cgggcagcgg ccggacttta 3034441 gccggcagcg aatcccatgt gcgccggccc atcacgatcg tgtgccccat ggtgatctcc 3034501 cggaaatgcg cctggtcctc gggcaagcgc caggggatgt cgccgccgcg gccgatgaca 3034561 cccgatgtcg cttgagccca gatcagcccc accatcgtca cacgcgtcac tccttgattc 3034621 cggcttgaag gctgtccgag ccgacttcat tgtcgtcggc gcgcctcata ccgcgactgg 3034681 agctttgatc gccggatgcg gatcgtagtt cttcacaacg atgtcttcat aggtgtactc 3034741 gaagattgaa tcccggtcgg ctagaagtag tttcggatat ggccgcggct cgcggctgag 3034801 ctgcagccgt acttgctcga cgtgattgtc gtagatgtgg cagtcgccac cggtccagat 3034861 gaactcgccg accgacaagc cggcctgggc ggccatcatg tgggtgagca acgcatagct 3034921 ggcgatgttg aacggcacac ccagaaacag gtcggcgctg cgttggtaga gctgacagct 3034981 cagccggcca tcggcgacgt agaactggaa gaacgcatga cagggcggca gcgccatccg 3035041 ctcgatttcg ccgacgttcc aggccgacac gatgatgcgc cgggaatcgg gatcggtgcg 3035101 cagcaaatcc agcgccgcgc tgatctggtc gatgtgctca ccggatggag ccggccacga 3035161 tcgccattgt acaccgtaga tcggcccgag ttcgcctgta tcacttgccc attcgtccca 3035221 gatggtgact ccgtgctcgt gcagccaacc gatattggaa tcgccgcgca aaaaccacag 3035281 cagctcgtag gctaccgatt tgaaatggac tttcttggta gtgagcagcg ggaaaccggc 3035341 cgacaaatca tagcgcatct gctggccgaa caggctgcgg gttccggtgc cggtgcggtc 3035401 ggatttgggc gtacccgttt cgagcacgaa gcgcagcagg tcctcgtatg gcgtcacgat 3035461 tgacacgcgg tcagcctagc ggcgatcgca agcgcggcga agccgccgca gcgactcgcc 3035521 gccaaacaaa cccagcgggc gatcgcaagc gcggcgaagc cgggcacagc gagtcgacgg 3035581 gaatacaccc agatccgcgc cacaggagta caacggaggc catgccgaaa accaccgaca 3035641 ccgccgctac tcctgacggc acctgcgccg tgcgtctgtt cactcccgat ggtccgggcc 3035701 gctggcccgg tgtggtgatg tttcctgacg ccggcggcgt tcgggacacc ttcgaccgga 3035761 tggccgccaa gctagccgga ttcggttacg tggttctgct tcccgacgtg tactaccgcg 3035821 aaggcgactg ggctccattc gatatgaaga ccgcgttcgg cgatccgcaa gaacgcgcac 3035881 ggatcatgtt tatgattggc accctaacgc ccgaccgggt aacccgtgat gccgatgcgc 3035941 ttctcaacta cctggccagc cgcccggagg tgatcgggga ccgcttcggt gtctgcggct 3036001 actgcatggg cgggcgaatg tcggtggtgg tggccggccg cctgccggat cgtgtcgccg 3036061 ccgcggcagc tttccacccc ggcggtttgg tggccaacag cccggacagc ccgcacttgc 3036121 tggccgaccg gatcagcgcc accgtctaca tcggcggcgc ggagaacgac ccgtcgttca 3036181 ccgccgacca cgccgagaaa ctcgacaaag cgttcagcgc ggccggcgtg ccgcaccgca 3036241 tcgagtgcta cccggccgcc cacgggttcg cggtcccgga caatccgtct tatgacgccg 3036301 cagccgacga acgccattgg gcagcaatga cagagacctt cggcgcagcg ctcaactagc 3036361 cccgccaagc agacgcagaa tcgcattaat cgcgcccggt ttgtgcgatt ctgcgtctgc 3036421 ttggcagcac ctcaggcgcc gcgacgtcga tcccgatgat gattcagccg acgccggtcc 3036481 gcggtgcgcc ccgcgagcta cgcgtcgagt tgcgtccgcg gcagtgcgtg gacgcacttt 3036541 ccacggggca aaggcgcccc tacaccggcg cggtcaatgc tcagtgctgg gtgcggcccg 3036601 gaatcccagc gcgttgccga gcagtagacc gccgtcgatg atcatggttt cgccggtgat 3036661 ccagcttgcg gcatccgaaa ccaggaacgc gaccgcgctc gctatgtcgg ccggctcccc 3036721 gattcgtccg agcgcaatgg tcgccgccaa cggatcctcg tggtccttcc acagcgcctc 3036781 ggcaagcctg gtgcgaacca ccccgggaca gatcgcattc acccggatgc gcggtgaaag 3036841 ctccagcgcc agctgcttgg tgacgtggat cagcgcggct ttggtcgcgt tgtacatgcc 3036901 catggccggg gactggtgca tcccgccgat ggaggcggtg ttgaccaccg cgccgccgtg 3036961 ctcgcccatc cacgccgtca cgacgagcga ggtccacatc agcggtgccc acaggttgac 3037021 gtcgaagatc ttggcgaagc gggcgtggtc ctgctcgagc agcggaccgt aagccgggtt 3037081 ggttccggcg ttgttgatca ggatgtcaac gctgccgaag cgctcgaggg tgaggtccac 3037141 acaacgccgg gcggcatcct cgtcgaccgc gtgtgcacca acgcccaggg cgcggtcgcc 3037201 gacctgtgca gcagcctcgt cggcagcttc ctgcctgcgt gcggtgagca ccacatgggc 3037261 gccggcagct gccagctgtt gggcgatggc aagcccgatg cctcgcgatg cgccagtaat 3037321 tatggcggtg cggccggtca gatccagtga ggtcatttgg cttgccttcg gttgctgtgg 3037381 tggccggact ccgccggcgg ggagcgtcgg tagcgccccc gcaccgtatg cgacaagaat 3037441 gctagcgaaa tcaaacccca cgaaaccacc ggtagtggtg gtgctatcgc gattgccgta 3037501 gcctgcacaa cctcacgcca gacttgagcc actgcgacca tctgcggcgt gtcgcgtgcg 3037561 tggtttaagt gtcgcgaacg gcgaggcctt acagcctcat gattccgaat gattccgaac 3037621 ggtatccggc ttgaacgtgc cccagctgtg gcggattctg acatttctcg gccagcccgg 3037681 ccacgggcac cctcgtaacc aaccatttcg ccgctagcga gcccggcggg ggcggctgcg 3037741 acgccatggc tccggcggct tgattgacgg tccgggcggc gtcggttgcg gccccaccgt 3037801 cggttgccgc accggccacg cctggcgggt cgctgtgcgg gacatagccg gccggcccgg 3037861 tcgatgggcc acaggccaat cagacgacga cctgtttggg catgacgatg ggcttgaacc 3037921 cgtaccgagg cccggcatag gcaccggctc ctttggccgc cccagaaatc cccgccatac 3037981 ccggcattgc ggcgactggc ccggcttcct cggggaccgc ccagcccgag ccctccagtg 3038041 ccgtggtacc agacgtcatg gccggagcag cggtcgacca accggccggg accgacaggc 3038101 caccgaccga ggacgcctcg cccagactcg ccgtcagcga ggccccacca acgcccactg 3038161 gcgtcaccgt gtgcgccaaa ccagctgctg ccgcggcagc tggaacggca tcggcggctg 3038221 cggtcacggt tgccgggttt agggcagcaa aggcgtgtcc gaggaatacc gcgttgcgga 3038281 tggtggccat gacgaaccag gcggtggtgt tgaccgcgcc gttgatcgcg ttctgaacaa 3038341 acgtgatacc gagcagctcc tcaatgtcct gaatgattcc gcctaatccc gccgcgtcgg 3038401 ccgccgatgt caggggcgaa gcgaacccca ttaccgcgtt cggcaggttg ctgatcagcg 3038461 atcccagccc cacctgttgg accgtgctgg cggcagcggc atggctgacc gcggcggcct 3038521 gaccggccag cccggccatg ttggcggtct gggagggcgt gatcaacggg ttcaacctcc 3038581 ccgccgccgc cgaggaggcc gcgtaaccgt acatcgccag tgcgtcttga gcccacattt 3038641 cgccgtagtg agcctcggtc gccatgatcg ccggtgtgtt ttgacccagg acgttggtcg 3038701 ccaccagtgc cgcgagcaga gccctgttgg cagcgacctc cgccggcgga accgtcatgg 3038761 cgaacgccgc ctcaaaggcg gccgccgacg ccatggcctg tgcagccgca tgggccgccg 3038821 attcagcggt gtaggtcaac caggccaaat aaggctgggc agcaacgacc atcgacatcg 3038881 acgccggacc cagccactgt tcggtagtca actgcatgat caccgactcg acggacgatg 3038941 ctgtagtgct caactcgacg gccaggccgt tccacgtcgc cccggcggcc atcaggggtg 3039001 ctgcgccggc accggcgtac attcgtgtgg agttgatttc cgggggtaaa gctccaaaat 3039061 ccattttccc tatccctcta ttgatctcta ttgatcgaaa ttcgctactt ctcaagtgcg 3039121 ggcaaccgcg tcgaggccgc cccctatacc gccggcttgg gcatgacgat gggtttggcg 3039181 ccgtagcgtg gcgcaccgaa gcccgcgctg ctgcgcgtcg ccgaggccaa ccccggcatc 3039241 ccgggaatga ccgtccccgc ggcaccatgc ggtgccgcgg tggtccagcc agcgccctgc 3039301 agtgtgctgg tgctcgatac caggttggcc tgtcccgccc agctgggcgg caccgacaat 3039361 gcgccgattg acgacgcccg actaaggccg gcggctagcg gagccgcacc cagaccggcc 3039421 gcgatcggcg cctcgccgac ggccgcctcc gccgccccca gctccgatag gcccgcgccc 3039481 tccaagccct cctcgagggc ggcttcctcg gcagccggaa gaagaccacc gctggccagc 3039541 cctagcaagt ccgacgcggc ggaggcccag ttcccagccc caatgttgaa gatattggca 3039601 atatctgaaa tccaggaggg caccttcccg ggcgtggaac ccaagatgct cgcgataccc 3039661 gacaacggcg aagcggccgc ggatgagttg gcggcctcgg tggccgcata ggtgccagcg 3039721 ctgaccccca gggtcttcac aaacaggtcg tataccgcag ctgcttcagc actgacctgc 3039781 tggtagagag tgccgtacgc ggtgaacaac gacgcctgta gcactgatat ctcatcagcg 3039841 gcggcgggaa tcacgcccgt ggtggtcggg gcggccgcgg ccgcgttctg ggcgaccatc 3039901 gccgagccga tggtctcgag cttgccggcc gcagccgcca actcttcagg ctgtgtcgtc 3039961 aggaatgaca tcgattgctc ctcatatgac taagccagca gggctagaaa cctgtgaatt 3040021 atctgatcag tccctgccga atagctgatc aggtcctgtg tttagataag gctaacgatc 3040081 cacacctccg caagcccgat caaaaggcgc aagcgcagaa ttcatttacg gcttatttac 3040141 gccggcaccg gcagtcttaa cacgatcctt ttgagcgtgg cacctgaccg ctcgccgcag 3040201 cagcgaaatg aaacacgcgc cgcgggaggg ttagcgcaat gtggccgcgg cggcgcgctg 3040261 gtcggccgcg tgcgcttgtc tcggtgtctc cagatcagaa gaggccgtgc ttgggcataa 3040321 caatcggctt gactccgtac cgtggtccgg agtcggcacc aacactgttg gcggctacga 3040381 ccattccagg ggcaggcggc atcactgcga tcgggccgtc ctcctcggga actgcccagc 3040441 ctgtgccatc caaggccgcg ccggctgccg tcgccggcgc tgcagtagac cagcttgccg 3040501 gcaccgacag gcgaccaacc acggacgcat tgcccaaatc ggcggtcagc gctgttccgc 3040561 cgacgcccgc tggggcaacc gcgtgtgcca ccgcggctgc cgcgccgcca cctggagcgg 3040621 ctccgccaac ggttcccata gcatcggcaa gaagcgtcat attgccaatg gcggccgtgg 3040681 caaagtctgc cacgccaccc aggccgtgaa acgcggattc tacgaacagc ggaacatcga 3040741 gattgaggaa ctgcctgacc gcctccaacc cggtatcggc cgcggacatc accggggagg 3040801 cgaagctcag gacagcgtca gcgacgtcgc tgatcaggtg gctcagaccc acctggcgcg 3040861 cggaagcgga tgcgccggct tggccaacag cggcggcttg gtgtgcgagc ccggccggat 3040921 tggtgatgtg cgacggcctg gtcagcgggt tcagtcttgc ggcgaccgcg gatgcggccg 3040981 catagccgta catggccgaa gcgtcttggg cccacatttc gccatagcgt gcctcggtag 3041041 ccgcgatggc cgacacgttt tgcccaagga tgttggtcgc tgtcagttca gccaacaggg 3041101 ctctgttggc aaccacctcg gccgggggca ctgtcagcgc aaacgccgtt tcaaaggcgg 3041161 ccgcagacgc catggcctgt gccgccgcga gcgccgagga ttcagcggtg caggtcaacc 3041221 agaccaaata gggctgcacc gcggcggcca tcgacaacga tgcgggaccc atccagtgct 3041281 cggtgctcag ccgcgtgatg accgacccga cggaggacgc agctgtgctc acctcgacag 3041341 ctatgccgtt ccacgcagcc gcagccgcca gcaggtctgc cgcgcccgcg ccgccataca 3041401 tgcgcgcaga attgacctcc ggaggtagag ctccaaaatc cactgaggcg ttccgtttct 3041461 ggtcgagtgc agtggtggcc ggtgctccgt ctgaggcagc cattattcca tcaaggtcag 3041521 cgccagcgta ggcaccacgc tcgccacggc gtcgatggcg cccaaatcat cccattaact 3041581 gcgcagcgac ggttgctccg aggttccagc acgcctcgat atcggccttg ctcggcttgc 3041641 ccatcaccac tacagtctca gcggcttgca cccaacccag gccggttgtg atggcgtcga 3041701 cggctcgctc ggctccctcg gtgccctcgt tgccgtgaat gtacgcgccg aacgaacgcc 3041761 cacgggtggt gtccaggcag gggtaatagc agacatcgaa ggcatgcttg agagcaccac 3041821 tgatgtaccc cagattggct ggggtaccca gcagatagcc gtcagcctcc agcatctcga 3041881 tcggcgaaac cgtcagggcg ggtcgtctca ccacctcgac gccctcaatc tcgggatcgg 3041941 tcgcgccgga caccaccgcc tcaaacatct cctgcatgtg cggagacggc gtgtggtgca 3042001 cgatcagcaa gcgccgcacc gcaggaccct gtcactaaaa gtggggtaat cgaccaaagc 3042061 gtgcagaagc gctccggaca ggtagcccaa ggccggcaac gtggtcatct ggcccccggc 3042121 ctagcgcgcc cctctagctg tagggccgtc ttcatcgctt cccgcgcgcg gcgccgatcc 3042181 cccgcgtagt cgtaggcgcg cgccagtcgg taccagcggc gccagtcgtc ggcgtcgtct 3042241 tcgagctcgg tgcgcacggc agcgaacaac gcatcggccg cgtctcgctg aatgcggcca 3042301 gaagcccggc ggggcagcgc gctggcgtcg atgtccagtc cgtcttcggc gatcagacgg 3042361 gccagccgct gatacgcgaa tccggcccgc agcgtggcaa tcatggccca cagcccaatg 3042421 accggcagga tcagcagcgc cagccccagc ccggcagccg cggcgcggcc cgaaccgatc 3042481 attgcgacgg cgacacgccc gagcataacc aggtacgcca ccatcgccac gcacatgaac 3042541 gcgattatca actggacata cagggtgcgc ctggtcatca cagtgtcggt cagtgcagat 3042601 cgagtagggg ctcaagacct acggtgagac cagggcgttc ggcgatgcgg cgcaccgcca 3042661 acagcacacc gggcacaaac gatgtgcgat cgaggctatc gtggcggatg gtcagaatct 3042721 ccccctcggt cccgaacagc acctcctggt gggcgaccag tccggccagc cgcaccgcgt 3042781 gcaccggtat gccgtcgacg tcggcaccac gcgcgcccgg caggctggta ctggtggcat 3042841 cgggattggg cggcaagcct tttcgggcct cggcgatcag cttcgcggta cgcgcggccg 3042901 tgcctgacgg cgcttcagcc ttgtgcggat gatgcagctc aatgacctcg gccgagtcga 3042961 aaaaccgtgc ggcctgcttg gcgaaatgca tggacagcac cgctccgatc gcgaagtttg 3043021 gcgctatcaa caccgatgtg ttgggttttg cgacgagcca cgattcgact tgttgaaacc 3043081 gctcggcggt gaaccccgtg gtaccgacca cggcgtgaat tccgttgtcg atgaggaact 3043141 ccagattgcc catcaccacg tccgggtggg tgaagtcgat gacgacctcg gtgttaccgt 3043201 ccgttagcag gctcagcgga tcgccggcat ccagctcggc ggatagggtc aggtcgtcgg 3043261 cggccgccac cgcccgcacc atcgtcgctc cgaccttgcc tttggctcca aggacgccta 3043321 cccgcatggc cttcacccta gaccgggccg tcctcgaggc caacgaccgc ggctgcacca 3043381 aacccggcgt gcgccgtgag gcgcttgttg atcgagtgga ggtgaaagac ctgcacggta 3043441 gttctgtcgc agctgtctga accaccccat cggcagattc cgtgaagagc cagatacggt 3043501 gaaagtcgca cgtccggttc gaagggcggc cacgggaaac ggacccgcag caacgcgggc 3043561 accgcaccca tggtcgaccc aactgccacg cacccggtga ccggtgcgaa gtccaccata 3043621 tcgaccagtg ggcaaccggc acatcccacc acaggttggt cggaaacggc tggtgcacaa 3043681 cgaagctccc caacggccaa accgcaggga tcccgccacc ccacctcgac cgcggtgccc 3043741 acaccaaaca actacgcgct gaccgccgcg actgcgccca cgacctatct aggctttaat 3043801 gatccgaggc gtcagcagcg aaggtgctca tgtgaaaccc agcaatatca ggattcgtgc 3043861 agccaaaccg atcgatttcc cgaaggtggc ggcgatgcac tatccggttt ggcgacaatc 3043921 ctggaccgga atcctcgacc cgtacctact cgacatgatc ggttcgccga agctgtgggt 3043981 cgaggagtct tacccgcaaa gcctgaaacg cggcggctgg agtatgtgga tcgccgagtc 3044041 tggcggtcag ccaataggta tgacgatgtt cgggcccgac attgctcatc ctgatcgcat 3044101 tcaaatcgac gctttgtatg tagccgagaa cagtcaacgt cacggcattg gcgggcgcct 3044161 cctcaacagg gccctgcact cacatccgtc agccgacatg attttgtggt gcgccgagaa 3044221 gaacagcaag gcacgcggct tctacgagaa gaaggacttt cacattgacg gccgcacttt 3044281 cacgtggaaa ccactgtcag gtgtgaacgt gccccatgtg ggctaccggc tttatcgatc 3044341 cgccccgccc gggtaagcat caggcgtcga taaccacccg accgctcacg gcccgcgaca 3044401 cacagaccag catctcgtta tcgccttcga tgatgcggcc gcggcggtcg acctgcccgg 3044461 caaggactct caccttgcag gtcccgcaga agccctgctg gcaggagtat gccgtcgtcg 3044521 ggtcccagtc gagcatgacg tccagcgccg accggttcgc cggaactcgg agcactcgcc 3044581 tcgaccgtgc gagctccagc tcgaacggaa ctccgtcgac aaccggcggc gggctgaatc 3044641 gctcgtaatg cagcggcgcg tcggcgtgtt gattgcgggc cacgcgcacc gcttctaaca 3044701 tcccgggcgg cccgcacacg taaacggccg tcgtcggccc tgcgccggcc aacagttcat 3044761 cgacagacgc aaaacgaccg tgctcgtcgt cggcccacac cgtgacccgg ccgggtgcca 3044821 ccgccactac ctcgtccagg aacggcatgt actcccgacc gcgaccggca tagattgcgc 3044881 gccagtcgat tccgcgctgt tcggcggccc ggatcatcgg caggatgggc gtcaccccga 3044941 taccgccgat cacgaaaagc acgtcacgct cggccagacc gagatggaag gcgttgcggg 3045001 gaccttcgaa ctcgcacgtg tcacctacgt cgaaggcctc gtgcatctcg atcgaaccgc 3045061 cgccgccgtc cgcgattctg cgaatggcga tccggtagtc cgtacgccgt ccgggcacac 3045121 cgcacaacga gtactgtcgg cgccgccccg agggcagctg cacgtcgatg tgcccaccgg 3045181 gcgaccaggc cgggagcaat ccgccaccgg ggtcagccaa cgtcaacgcc accacgtcgg 3045241 gagcgaccag ctcgcgcttg gtaaccaccg cgggattcgt gcgccgcacc ggctgcaccc 3045301 gcgacggttc ccaccgcgag gccgcgccca atcctcccaa taacgctcgt acaccccaca 3045361 gcgctgtgaa gaagcggtcc cggctgcggc gaccgtaaag gtcggcgggc ctactggccc 3045421 agctggtctc tggcacggtg cgctccgcca ttcctacgga tcgtcaccga tcagtgcgac 3045481 gctcgcgcgg cgggcgagac ggccaggtag tccacggccg cccccagccc gcccagctgg 3045541 gacgggtgaa aacccggctt gtagtagtgc cccacgaccc gaagcagccg cggcagcccg 3045601 ggcaccaaac cacggcgtgc ggccttgaaa tagtcccgcc agcgcggctt tgtccccggt 3045661 ggcaggtacg gatccaccga atacatgaac cgcactccgc gaatccacag cagcaacatc 3045721 accggggtaa cggtcagctg ggcacgcacc tgccgccagt aaccggcgcg caagtgcttc 3045781 atggtgtcga aggccacggc tttgtgctcg acttcttctg caccgtgcca ccgcagcatg 3045841 tccagcatca cggggtctgc accgacggca tcgagctgcg gggaattcag gatccactcg 3045901 cccatgacgg cggtgtagtg ctcaattgcc gcgatgaacg aaacctgctc tagcaaccag 3045961 ctgtactgtc gtcgcgggct ccgccgagga ctctccccca gcagcttttc gaacagccac 3046021 ctgatctggt tggtaaacgc tgtcacgtcg acaccctggg catcgaagtg gtcaaccacg 3046081 ccggagtgcg cctgggaatg catcgcctcc tgaccgatga atccttgcac gtccagcctc 3046141 agttgatcgt ccttgatcag cggcagcgtc ttcttgaaga ccctgacgaa gaactcctcg 3046201 ccggccggca gcagcatatg cagaacgttg agaacgtggg tggccatcgg ctcgttgggc 3046261 acatagtgaa atggcaggtt tgtccagtcg aattcgacat ctcgcggctc gaggacgaga 3046321 cgttcgtggt cggcggcgcg cgactctgac gagtgcggac ccgtcgcccg gtcatcgacg 3046381 ctgaccattg ctgccccctc agaaaacgta gccacggcgt ttacataaat gcccgacatg 3046441 tcgccccagt agacatcacg tgttggcaag tatagttgcg cgtacccgag gggtgaagaa 3046501 cctgctcgcc agcctggcgc cgaatgcacc tcgacgttca ccgcgcctcg gcagccgacg 3046561 atgtcggctt caccgtgtcg aattcgtcgc ctcgccctcg tcggcctact cgcaactggt 3046621 ggcatcagta atgccattgc gcagcaacgc acttgctacg gcacgcgact cgccatcagt 3046681 gtcccgccag cactttcgct acccggcctc agctcggtcc ttgagacgct gcagggtgcg 3046741 tcgaatgtgc tcggtgttta cgctcgcccg atccttgacg ccggtggcca tccgggcaac 3046801 tgcgcggaac cagctgggcc gccggtccca ggtgctctcc gtaacccgac agccgtgttc 3046861 ggtagcgacg atgccatatt gccagcgtga aatcggaata atgccggacc gtacatcgaa 3046921 agcgaaaacc cgaccgggat cggcgtcggt aacggtgcac gtcgtggtcc agcgccgtcc 3046981 accgttttcg ttgcgaccga caaacaccgc tcccttgcga acatcgtcgc ctttgcgcaa 3047041 ctgcatcgcc accacttcct cggccagcga ggccagtgtc ggcagatcag tgatcagccc 3047101 gtataccagg tcgggattgg cgtcgatctc aacggtgacc gtcacagaag gcccatcagg 3047161 gtctggcatc ccgcgatcat agcccgctgg gcgggccgct ctagatgggc gccgccccgc 3047221 gcagatgctc gaagatcagg gacgtctggg tacctgcgac gtcggcgtcg gcattgaggt 3047281 tttcgaccac gaacgaacgc aggtcctcgg tgtcgcgagc ggcgacgtgc aagatgaaat 3047341 cgtcggcgcc ctgccgtttg cggcggatct gctggatgaa gctgcggatt ttcccgcgag 3047401 cggacgactg caagttgacc gagatcatcg cctgcaacgg caaacccacc gcgaccgggt 3047461 cgatgtcggt gtagaacccc cggatcacgc cgaggtccac caaccgccga acccggccgt 3047521 gacacgtcga cggcgctatc ccgacagtgt ccgctaacgc gttgttgggc attctggcat 3047581 cgccatgcag caagctcagg attctgcggt ccacctcatc aagttcagcg ggtcgaacat 3047641 ccttcgacga ggcagcccgg cgagtcttgt gttccgttga attatcacgt atatggcctc 3047701 gaaaaagaat tatcatcagc aatcttgcag attaatcgaa ctttcttcac actgaagcgt 3047761 acagtatcga gaggggtaat catgcgcgtc ggtattccga ccgagaccaa aaacaacgaa 3047821 ttccgggtgg ccatcacccc ggccggcgtc gcggaactaa cccgtcgtgg ccatgaggtg 3047881 ctcatccagg caggtgccgg agagggctcg gctatcaccg acgcggattt caaggcggca 3047941 ggcgcgcaac tggtcggcac cgccgaccag gtgtgggccg acgctgattt attgctcaag 3048001 gtcaaagaac cgatagcggc ggaatacggc cgcctgcgac acgggcgatc ttgttcacgt 3048061 tcttgcattt ggccgcgtca cgtgcttgca ccgatgcgtt gttggattcc ggcaccacgt 3048121 caattgccta cgagaccgtc cagaccgccg acggcgcact acccctgctt gccccgatga 3048181 gcgaagtcgc cggtcgactc gccgcccagg ttggcgctta ccacctgatg cgaacccaag 3048241 ggggccgcgg tgtgctgatg ggcggggtgc ccggcgtcga accggccgac gtcgtggtga 3048301 tcggcgccgg caccgccggc tacaacgcag cccgcatcgc caacggcatg ggcgcgaccg 3048361 ttacggttct agacatcaac atcgacaaac ttcggcaact cgacgccgag ttctgcggcc 3048421 ggatccacac tcgctactca tcggcctacg agctcgaggg tgccgtcaaa cgtgccgacc 3048481 tggtgattgg ggccgtcctg gtgccaggcg ccaaggcacc caaattagtc tcgaattcac 3048541 ttgtcgcgca tatgaaacca ggtgcggtac tggtggatat agccatcgac cagggcggct 3048601 gtttcgaagg ctcacgaccg accacctacg accacccgac gttcgccgtg cacgacacgc 3048661 tgttttactg cgtggcgaac atgcccgcct cggtgccgaa gacgtcgacc tacgcgctga 3048721 ccaacgcgac gatgccgtat gtgctcgagc ttgccgacca tggctggcgg gcggcgtgcc 3048781 ggtcgaatcc ggcactagcc aaaggtcttt cgacgcacga aggggcgtta ctgtccgaac 3048841 gggtggccac cgacctgggg gtgccgttca ccgagcccgc cagcgtgctg gcctgactct 3048901 cggccgctcg ttacgccgag cacacgtcgg gagtaaggga agcgatgatg tcggccgcgg 3048961 gtcccggccg ggtcttccgg tgcgccgatc ccgcccaaag gtttgttccg tgcgggtcgt 3049021 ccgcctgcac cgccgccgcc cgtatcggct tcgtcatctg gtggacctcc ggataaccca 3049081 gcggcgccac gtggtcgagc aggcgagtga agttgttggc cagaccgcgc gcatacctac 3049141 ccgagaacgc ccgagtgacc agggtggcat cgaactctgg attcttcagc gcggcacggt 3049201 gtgcggcatt ggtaccggct tcgtcggcca gcagcaatgc ggtaccaacc tgcgcggcga 3049261 tcgctccgcg gcgcagcacg gcggccacgt cctcagccgt gcccaggcca ccggctgcaa 3049321 ccagcggcac atcatgggcg ctgccaatcc gatcgaggag ttggtgcagc gactccgtac 3049381 cgggttccat gtccggcgcg aacgttccgc ggtgcccgcc ggcagccggg ccctggacca 3049441 ccaggctgtc cgcgcccgcg gcaatggcca caccggcctc gtagaccgac gtcacggtga 3049501 tcgagaccaa cagtcccagc gcgctcaacc gctgcacgac atccggcggc ggcgcgccga 3049561 aggtgaacga caccacctcc ggacggacat cggctaccac ctcgagtttg cgcacccagt 3049621 cgtcgtcgtc accatagacg ggctggccca cctcggtgtg gtagtactcg gcgacctctt 3049681 cgagctcgtc cgcgtaatac tccagctgcg cccagtcggc gacgctgggt tggggcacaa 3049741 acagattggc tccgatagga ccggtagtgg cggcgcgcgc agcggcgata tcgtcggcga 3049801 gccggtccgc gctcagatag ccgccggcga cgaaaccaag cccgccagcg ttggacaccg 3049861 ccgcggccaa cgccggggtg ctcgggccgc cggccatcgg ggcgccgacg atcggcaccg 3049921 cgatgtccca gaagcccaac accatcgggc taattcgccg acggcgagcg ccggcacggc 3049981 gcgagtgagg aagcggacat ttgagctacc ctaccatcgc tcgaagttgt tgcggcagtg 3050041 atcgtttcga tccgtgtggg ccaagaacgg cagcaccgta gcgcctgctc agcaggtggc 3050101 gggccaccgc gttgacctcc tccacggtga cctgctcgat ttgccgcaag gtgtgttcga 3050161 tgctgcggtg cttgccgtag ttcaactcgc tgcggccgag ccggctcatc cgggagctgg 3050221 aatcctccag ccctagcacc agcccacccc gcagcgatcc cttggcgatg ccgcattccg 3050281 cctcggtgat gccgtcgcgt gccacgcttt ccagcacatc ggcggtcacc cgcatcacgt 3050341 cggcgaagcg ttcgggcagg caggccgcgt acaccgaaag cgcgccgctg tcggcgaaga 3050401 gatccagcgc ggagtagacc gagtaggcca gcccgcgggt ctcgcggacc tcctggaaca 3050461 gccgggaact caagccaccg cccagcgcgg tgtgcagcac cgacagtgcc caacgatgct 3050521 cccagccgcg cccgggtgtg cggatgccca gcgacacatg cgtctgttcg gcgtcgcggc 3050581 taaccagtgt caaccggggg ctgccgttga cccggccggt acccttgcgc ggcgcaactg 3050641 gccgtctccc ccggaccaac cgggacccga agtgctcgcg gaccaacgca accagcccgt 3050701 cgtgatccac attgccggcg gccgcgacga ccatccgctc cggggtatag cgccgcaggt 3050761 gaaacgattg cagttgagcc cgcgtcatca ccgacacgga ttgcgcgctg ccgatcaccg 3050821 ggcgaccgac cgggtggtcg ccgaacaacg ccgccaggaa catgtccgcc aaggcgtcct 3050881 cggggtcgtc gtcgcgcatc gcgatctcct cgaggacgac gtcacgttcc acctcgacat 3050941 cgtcggcggc acagcggccg ttgagcacca catcggcgac caggtcgacg gccaacggca 3051001 agtcgctgcc gagcacgtgg gcgtagtagc aggtgtgctc cttggcggtg aatgcgttca 3051061 gttccccgcc caccgcgtcc atcgcctgcg caatgtccac ggcagagcgg gtgggcgtcg 3051121 acttgaacag caaatgctca aggaagtgcg ccgccccggc caccgtggcg ccttcgtcgc 3051181 gcgatccgac gccgacccac accccgaccg acgcggagtg caccgcgggc aggaattcgg 3051241 tgaccactcg cagcccgccc ggcagggtgg tgcgccgcgg cgccagcgcc gccgcggggt 3051301 cagctggtga ccgtcgcggc atcggtagcg gcggcggtgc tgtcctcgtc ggcgaccagg 3051361 atcagggaga tcttgccccg tttgtcgatg tcggcgatct ccacccgcag cttgtcaccg 3051421 acattgacaa cgtcctcgac cttcgcgatg cgcttgccct tgccgagttt ggaaatgtgc 3051481 accagaccgt cgcggccagg cagcaacgat acaaaggcac cgaaatcggt ggtcttgacc 3051541 acggttccga ggaaccgttc gcccaccgtc ggcagctgcg ggttggcgat ggcgttgatc 3051601 ttgtcgatcg cggcctgtgc cgatggcccg tcggtggcgc cgacgaacac ggtgccgtcg 3051661 tcttcgatgg agatctgcgc gccggtctcc tcggtgatgg cgttgatgac cttgcccttg 3051721 ggtccgatga cctccccgat cttgtccacc ggaaccttga tggtggtcac ccgcggggcg 3051781 tagggactca tttcgtcggg tctatcgatg gcctcagcca tcacctccaa gatcgtgagg 3051841 cgggcgtcct tggcctgctc gagtgctccg gcaagcacct gcgaagggat cccgtcgagc 3051901 ttggtgtcca gctgcagcgc ggtgacgaag tccttggtcc cggcgacctt gaagtccatg 3051961 tcaccgaacg cgtcttcggc gccgaggatg tcggtgaggg tgacgaagcg acgctccaca 3052021 acgccgtcga ccgccccttc tacttgaatg tcgtcggaga ccaggcccat cgcgatgccg 3052081 gccaccggcg ccttgagcgg caccccggcg ttgagcagcg ccagcgtcga cgcgcacacc 3052141 gaccccatcg aggtcgaccc gttggagccc agagcctccg acacctggcg aatggcatac 3052201 gggaattcct cgacgctcgg caacaccggc accagggccc gctcggccag tgcgccgtgc 3052261 ccgatctcac gccgcttggg cgaaccgacc cgaccggtct cgccggtgga gaacggcggg 3052321 aagttgtagt ggtgcatgta ccgcttcgat gtctccggcc ccaacgagtc gatctgctgg 3052381 gccatcttga tcatgtcgag tgtggtcaca cccaggatct gggtttcgcc gcgttcgaac 3052441 agcgcgctgc cgtgcgcgcg cggaaccacg gccacctcgg ccgacaatgc gcgaatgtcg 3052501 gtgatgccgc ggccgtcgat acggaaatgg tcggtgagga tgcgctgccg aaccagcttt 3052561 ttggtcaggg cacgcaacgc ggcgccgacc tccttttcgc gaccctcgta ggtgtcggcg 3052621 agccgctgca caacctgggt cttgatttcg tcgatgcgct ggtcgcgctc ggctttaccg 3052681 ccgatggtca acgcggcggc caactcgtcg gtggccaccg aggacaccga gtagtacacg 3052741 tcttcgccgt agtcagggaa caccgggaag tcgacggtcg gtttgcccga ctttccagcg 3052801 gcatcggcaa gctcctgctg cgcggtgcac agcgcggcga taaacggctt ggccgcctcc 3052861 aggcccgcgg ccaccacgct ttccgtcggc gcttgggcac caccttcgac gagctcgacg 3052921 acgttttcgg tggcctcggc ttcgaccatc atgatggcaa catcaccctc gacgatccgg 3052981 ccggccacga ccatgtcgaa cacggcgcgc tcgatctggt cgacggtggg gaagccgacc 3053041 caggtgccgt cgatgagcgc cacccgcaca ccgccgatgg gcccggagaa cggcagaccg 3053101 cccagctggg tggacgccga cgccgcgttg atcgccaata cgtcgtagag atcgcccgga 3053161 tccaggctga gaatcgtcac cacgatttgg atctcgttgc gcagcccgtc gacaaacgac 3053221 gggcgcagcg ggcggtcgat gagccggcag gtcaggatcg cgtcggtgga gggtcggccc 3053281 tcgcgacgga agaacgaacc ggggatgcgg ccggccgcat acatgcgctc ctcgacgtcg 3053341 accgtgaggg ggaagaagtc gaagtgttct ttggggttct tgctggcggt ggtcgccgac 3053401 agcagcatgt tgtcgtcgtc gaggtaggcg accaccgcgc cggcggcctg caaggccaat 3053461 cggccggtct cgaagcggat ggtccgggtg ccaaagctcc cgttgtcgat ggtggcggtc 3053521 gtctcgaaca cgccttcgtc aatttcagcg gcagacatga cgtccgtgcg gcctctctgg 3053581 attattgagc tgtttcgcgt cgtcacgcgc aatccagcgg gttcgccgaa ccccgagagc 3053641 ttcccagggg aaaaggtctg aatgcggcta cggccatcga tcgaagcggc cgacctgccc 3053701 cagatccgga gagcccggca gccactaccg aggaccgccc gatacaggcc gggggtgctc 3053761 ccttggatat gcatagtgac tcgctggaac ggcacacgcg gttctgcgcg taccgcacca 3053821 tttgctgggc cgaaccggcc cagaacgttc tcactctaca cgggcgaccg gcggcatttg 3053881 cgtagaactc gctttgccga gctaccccgc ctcagctccg cgggccgccg gtgacatcct 3053941 cgacgcacac cgcgaaccgt cgctgagtgt agacgtagcc cacgccgctg gcgcactggt 3054001 cgacgctgac gggagagtcg aggtctttca ggatctgggt ggcccgctgc cggtgcggca 3054061 ccgaggcgtc gtcgcagtcc acccggaacg ggtcggtgtt gtgggtaggg tcgacgctca 3054121 tacaaccgcc aatcacccaa tcgatgtcca ggcaaatggt gttggttgag ccgttgaacg 3054181 cattgcgcat cgaataggtg gagtcgacgt ccgccgggca ttccgcgtgg tcctcctgca 3054241 cgacggcaac gaccttgaag ttggacgccg ggctcccgca ctccgcctta gtggcctgcg 3054301 gccggtcggg cgtgccggcg agtttgacgc agtcccccac cttgagttcg gcgacgttgg 3054361 tcgctgacga acaccccgtc gccacgacga acaaggccgt ggtcgcggcc gcgagccagg 3054421 cgcgcatcga cgccgcgggt cagcgacgca ggcccagccg ctcgatgagt gaacgataac 3054481 gctccacatc gatctgggaa atgtacttga tcagccggcg ccgccggccc accagcaaca 3054541 gcagtcctcg ccgcgaatga tggtcgtgct tgtgcacctt gagatgctcg gtgaggtcgg 3054601 cgatgcgttt ggtcagcaac gcgatctgtg cttccgggga tccggtatcg gtctcatgca 3054661 ggccgtagga gcgcagaatc tccttttttt gctcggctgt cagcgccacg aaatgtctcc 3054721 atcaatgggt tcgcgatcat ggatatcagg gcacggccac cgcgaaccgc agcacgcacc 3054781 gatgtcgttg gacagtctag cagcgggttg accgccaaac acaaacgccg caggtgccag 3054841 ccgggggtca cgaccgcaag aaccgtcaac ccgtagacaa caggtcacgt gcccgctcgg 3054901 tatcggcacc catcgcagcg accagctggc gcaccgattc gaacttcttc tggccgcgga 3054961 tacgcccgac gaagtccaag gccacatgtt gaccgtagag gtcagcggtg gtgtccagca 3055021 cgaacgcttc gacggtgcgg gtgcgtccgg agaaggtggg attggtcccg accgacaccg 3055081 cggcctggta gcgctcaccc gggacgaccg tgccggtcac cggcccatgc ccgagcaccg 3055141 tgaaccaagc ggcgtacacg ccgtcggccg gaatcgccga atacatcggc ggcgccacgt 3055201 tcgcggtggg aaagcccagc tccgcgcccc gcccctcacc gcgtaccaca accccctcca 3055261 cgcggtgcgg tcggcccaga gcttccatgg ccgccaccat gtcgccggcg tccacgcagg 3055321 accggatgta ggtggaggag aacgtcacgg tctcgttgct gtggtgctcg gacaccaacg 3055381 acatcgattc caccgcgaac ccgaaccgct cgccagcccg acgcagcgtg tcgacattgc 3055441 cggcggcctt tttgccgaag gtgaagttct cgccgacgac gacctccacc acatgtaggt 3055501 gctcgacgag cagctcatgg atgaagcgat ccggcgtgag cttcatgaaa tcggtggtga 3055561 acggcatcac caggaacact tcgatgccca agtcttgaac gagctccgcg cgtcgggtca 3055621 gggtggtcag ctgcgccggg tgactgcctg gatagaccac ctccatcggg tgcgggtcga 3055681 acgtcatcag cacggccggt acaccgcgag cgcggccggc cttgaccgcg tgcgcgatca 3055741 gttcggcgtg cccgcggtgc acgccgtcaa ataccccgat ggtgagcacg catctgcccc 3055801 aatccgtcgg gatctcgtcc tggccacgcc agcgctgcac gatcgcaagc ctacggcgca 3055861 cggtggtcgg ccaggcgcca gattcaccgg tgggctctgg ccagcggccg atccgggaac 3055921 accatgcacg cggccgccgg acacctggcg cagcacgtcc gggaacgccg gcggcaccgt 3055981 ggccggtccc agagaattgg cgcgcgcata ccgaacgatt ggtctcaagc tttacgccga 3056041 ccattgatca ggtgatcagg gagtgggtct gatgagtacg tttagagaat gccgcagcat 3056101 gttcgatgcc gcggtgaaga gctaccagtc cggagacctg gccaatgccc gagcggcctt 3056161 tggccgcctc acagtcgaaa acccggacat gtccgatggc tggttggggc ttctggcctg 3056221 cggcgaccat catcttgata ccttggccgg tgcccatcaa cactccgaag cactgtacag 3056281 cgaaacccgc cgcgtcggcc tcacggacgg cgaattgtcc gccgtggtca tggccccgat 3056341 gtatctgggg ttgcgggtgt ggtcgcgcgc cacgatcggg ctcgcgtacg ccagcgctct 3056401 aatcatcgcc gaccgccacg atgaagcggc agcaacgctg gacgacccgg tcatcacgga 3056461 ggacaccggc gccgcccaat accgccagtt cgtcatggcg acgctgttcc acaaaactcg 3056521 ctcctggtcc aaccttttga aggtcaccga aatttctccg ccgagcgggg ccaccgatgt 3056581 ccgtgacgag gtggctgacg cggtggccgc gctggcctcg accgctgcgg cgagtctggg 3056641 ccaattccag ttcgcgttgg agctcgctga gcaagtctcg acaaccaatc cgcgggtgac 3056701 tgccgatgtg accctcacta gggcgtggtg cctgcgcgaa ctgggtgacg acgacgccgc 3056761 cagagtggca cttagcgcca cgaccaccgg tgatgccccc aggacaaaca ccaccgcgga 3056821 acaggctggt agcccccaac cgaagtttcg acatccttac gacgacggcc gggatctcct 3056881 ggtggctcgc cgccgcccgc cggccgggga cggttggcgc aaagcggtaa ccaaaatgac 3056941 tttcgggcgg gtgaatcccg aaccgagcgc caagcgcgag caaaccgacg agctgattca 3057001 gcgtatctgc gctccactgg ccgatgtcca taagttggcg ttcgtctctg ccaagggcgg 3057061 cgtaggtaag accacgatga cggtgctggt gggcaacgcc gtcgcccggc tgcgcggcga 3057121 tcgggtgatg gctgtggacg tcgatgccga cctgggcgac ctgtcagcaa ggttcagtga 3057181 gcgcggtggc ccgcagacca acatcgagca tttcgtgtca tcgcagcaca ccaagcgcta 3057241 cgcggacgtg cgtgtgcaca cggtgatgaa caaagaccgg ctggaaatgc ttggtgccca 3057301 gaatgatccg cgatcgacat acaagtttgg cccggaggac tatggggccg ccatgcagat 3057361 cctggaaacc cactgcaacg tcatactgct tgattgcggc acaccggtca acgggccatt 3057421 gttcagcaat atcctcaacg acgtcactgg tctggttgtg gtggcatccg aagacgtgcg 3057481 cggtgtcgag ggagcgttgg tcactctgga ctggctgggg gcgcatggct ttggccggtt 3057541 gcttcagcac actgtggttg ttctcaacgc aatccagaaa acccggtcac ttgtggattg 3057601 cggggccgcc gaaaaccagt tcaggaagcg cgttccggat ttctttcgga ttccctacga 3057661 cccgcatctg gccacgggtt tggcggtcga tttcagctct ctcaagcgaa ggacacgcaa 3057721 cgccgtgctg gatttggccg gcggcctggc acagcactat ccggctagcc gagtacggcc 3057781 ccgtggcgag gacagttgga aaacctggat cgaaacgatg cgtcaggtcg gatgacggtt 3057841 tggtcgagac cgagttggcg gccatttccc cgactgcgca ccgagcgcgc cgtcacgccg 3057901 gtatctagac tctctggttg tgagggctga cgaggagcct ggcgatctta gcgcggttgc 3057961 gcaggactat ctgaaggtca tctggaccgc ccaggagtgg tcgcaggaca aggtcagcac 3058021 caagatgctg gccgagagga tcggggtgtc ggccagcacg gcctcggagt ccattcgcaa 3058081 gctcgccgag cagggcttgg tcgaccacga gaagtacggc gcggtgacgt tgaccgattc 3058141 ggggcgacga gccgcgctgg caatggtgcg ccggcaccgg ctactggaga cattcctggt 3058201 caacgagctc ggctaccgct gggacgaggt gcacgacgag gccgaggtgc tcgagcacgc 3058261 ggtctcggat cgcttgatgg cccgcatcga cgccaagctg gggttcccgc agcgcgatcc 3058321 gcacggtgcc ccgatcccgg gcgccgacgg gcaagtgccc acgccaccgg ctcgtcagct 3058381 gtgggcgtgc cgcgacggcg acacagggac ggtggcccgt atctccgatg ccgacccgca 3058441 gatgctgcga tactttgcca gcatcgggat cagcctggac tcgcggctgc gggtgctggc 3058501 tcggcgcgag ttcgccggca tgatctcggt ggcaatcgac tcggccgacg gcgccaccgt 3058561 cgacttgggg agcccggccg cccaggcaat ctgggtggtg agctgacggc tttggcccgc 3058621 gagcgtaacg tggctgcgat tttcggcacg gattttcgca gtccggttac gctcgcgaag 3058681 ccggttcgcc cagcaggccc ttggcgatgt gggttacctg gacctcgttg ctgccggcgt 3058741 agatcatcag cgacttggca tcgcgagcca gctgctccac ccgatattcg gccatgtagc 3058801 cgttgccgcc gaacagctgg acggcctcca tcgcgacatc ggtggcggcc tccgaggaat 3058861 acagcttgat cgccgaggcc tcggccagcg tcagctgttt gccggctttg agccgctcga 3058921 tggcctgaaa taccatgttc tgcacgttga tccgcgcaac ttccattttc gccaacttca 3058981 actggatcag ttggaactgc ccgatgttac ggccccacag cgtgcgggtc tttgcgtaat 3059041 ccacacacag ccggtggcat tcgttgatga tacccaacga catgagcgcc acgccgaggc 3059101 gttcgacggc gaaattggcg cgggcgctgt cgcggccgtc cccctcggcg caaagcaggc 3059161 gatccggggt cagccgcacg ttgtcgaaga acaactcgcc ggtcggcgaa gacatcatgc 3059221 ccatcttctt gaacggcttg ccctgcgtca ggcccggcat gccggcatcg agcacaaaga 3059281 ccagcaccgg gcggttacgc caatctgagg cgggctcacc gtcggcgagc ttggcgtaga 3059341 ccaccaggac atcagcgtac ggcccgttgg tgatgaaggt cttgtgcccg ttgaggatgt 3059401 agtcttcacc gtcgcgggtc acgtgagtct tcatgccgcc gaacgcatcc gagccggagt 3059461 ctggctcggt aatggcccag gccgcgatct tttccagcgt caccagcgtg ggcacccagc 3059521 gctcctgttg ggccagggtg ccgcggctca tgatcgtcgc cgcgcccaac ccgaggctga 3059581 cggccaccgt gctcagcaat ccgatgctga ccccggccag ttcggacacc agcaccgcga 3059641 ccatcgaagc ctggtcagcc agcccgaaac tgcctgagct gtcccgcttt tcccgcttag 3059701 cccgctcccc atccagcatc tggttgaccg actcggcaag cagcacgtcc agaccgaact 3059761 ggctgaacag cttgcgcgcg atcggatacg gcgacagttc accggtttcc aatgcgtctt 3059821 ggtgcgggcg gatctccttg tcgatgaact ggcgaacggc gtcgcgcacc attagatcgg 3059881 tgtcggacca ctcgaacatc gcgtgctccc tccgatcgcg tggctcaacg ttcggcccgt 3059941 tggtatgcgg tgaccacggc ggcgccgccc agcccgatgt tgtgttgcag cgcggcggtc 3060001 acgttgtcga cctggcgcgc ctcggcggtg ccgcgcagct gccaggtcag ctccgcgcac 3060061 tgcgccaacc ccgtcgcacc cagcggatgg cccttggaga tcagcccacc ggatgggttg 3060121 acgacccagc gtccgccgta ggtggtctgg ttgtcgtcga tcagctcggg cgcctcgccc 3060181 ggcccgcaca ggccgagcgc ctcgtagagc agtagctcgt tggctgagaa gcagtcgtgc 3060241 agctcgatca ctccgaagtc cttcgggccg agtccggatt gctggtaaac ccgttgtgcc 3060301 gcttgcacag tcatgtcgta gccgatgata ttgcgggcac tgccatcaaa ggtggaagcg 3060361 aagtcggtgg tcatcgcctg cccgacgatt tccacagccc gcccggcaag gttgtggttg 3060421 gccaggtaat cctcactggc cagcaccacc gccgccgacc cgtcggaggt gggagagcac 3060481 tgcaatttgg tcagcgggtc ggaaatcatc tttgaggcca agatgtcgtc cagggtgtat 3060541 tcgtcctgaa actgtgcata cgggttgttg accgagtgct tgtggttctt gtagccgatc 3060601 ttcgcgaaat gctccgcggt ggtgccgtat ttcttcatgt gttcgcggcc ggccgccccg 3060661 aacatccacg gcgccaccgg aaagccgaac tcgtcgatct cggctaacgc cttgacgtgc 3060721 ctgcccagcg gcgactcccg gtcgtcggcg ccaccgccca gcgctccggg ctgcatcttc 3060781 tcgaagccca gcgccaacac gcaatcggcc agtccgccgc ggatggcctg cgcgccgagg 3060841 tagagcgccg tggatccggt cgagcagttg ttgttgacgt tgacgatggg gatacccgtc 3060901 atgccgagtt cgtagagcgc ccgctgaccc gacgtcgatt ctccgtagac gtagccgacg 3060961 tagccctgtt caacttcgcg gtagtcgatg ccggcgtcgc gcagcgcttt ggtgcccgac 3061021 tccctggcca tgtccgggta gtcccagcct tcgcgtcgcc cgggcttttc gaacttcgtc 3061081 atgcccacgc caatgacgta aaccttgttc gacgaccctt ggttaggcat cgttgccgtt 3061141 gcaagtgagt gatctttagt ggtcacgcga cttgcacccc gtctcggggt tgttcggcag 3061201 ccttgcggct gcttcccttc cgcgcttcac ggccaccagc ccggccaggc cgggtcttac 3061261 ggtcggctcc acgcttgacg gcggccccaa ctgggccgac gacgctactg gtgtcctcgt 3061321 agcgtgcgag gttgatcgct gcgcagtcat cacgctgatg cgatgccgag cacgaatcgc 3061381 attgccagtg ctcggcccat ccgatctctt ggacatgccc gcagacgtgg caggttttcg 3061441 acgatgggaa ccagcggtca gcgaccacta gttgtgaccc gtaccagcct gtcttgtagg 3061501 acaggtggcg gcgcggggtg cccagggccg cgtcggagag tccgcgccgg cgagcgcggg 3061561 cacccgagag gccctgttgc cgcagcatcc ctgccgcgtc caggccttcg acaacgatgc 3061621 ggccgtgggt cttagccaaa tgcgttgtca gacagtgcag gtggtgggtg cgaacatcgt 3061681 tgacccggcg gtgcagccgg gatatttcgg tggtgcgctc acggtagcga cgtgagcctt 3061741 tcgtgcagcg cgaccgtgcc cggcagacat gccgtagctc gttgagtgcc gcgtcgagtg 3061801 gccgtggatt cggcactcgt tcgagcaccg cgccgtcggc ggtggcgacc gtggccaggc 3061861 ggcgcacccc gacatcaacg ccgacccgtg aaccggggtc ggtcaccttc ggttgctgcg 3061921 ggcgctgcac gaggacccgc acactcgcat cgatccgggt cccgttacgg cgcaccgtga 3061981 tcgcgagcac ccgcgaccgg cctttggcga tgagccgctc aacccggcgc gtgttctcgt 3062041 gggtgcggac ggtcccgatg accggcagcg tgaggtggcg ccggtcgggc tcgacgcgca 3062101 tcgctccggt cgtgaacgtc acccggtctg ggtcgcgtcc cttcttctta aaccggggaa 3062161 agcccatcct tttgccatca cgtttgcctg atcgcgagtt ctgccagttc cagtacgcgt 3062221 cgaccgcgcc gtcaataccg tcggcgtagg cctccttcga gcactccggc caccacacaa 3062281 caccggtctc gatgttgacg cacacgtcgt tcttgacggt gttccagcgc ttccgcaaca 3062341 cccgcagcga cggctttgcc gtctggatcc cggtcgcctg ccaggcgtcg atatcggctt 3062401 tcagggtggc gacggtccag ttgtaggcct tgcggcgggc accgaaatgc cgtgccaacg 3062461 cgcgggcctg ctcggcggtc ggatcgagcg tgaaccggaa agcctgaacc atccagccct 3062521 caggaatctc gaatttcgcc atcaggcagc ctccgactct tcggcggcgg ccgccaatgc 3062581 gcgcttggcc cggttctgcg cagcgcgctt gccgtacagc cgggcgcaca tcgaggtcaa 3062641 gatctcggtc atgtcccgta ccaggtcgtc atcaacctcg gccgagtcga ccacgaccag 3062701 ctcgcggcct tgggcggcca gcgccgcttc gacgtactca gagccgaacc ggcagaaccg 3062761 gtctcggtgt tccaccacga tccgcttcac cgatgggtca cgcagcagcg caagaaactt 3062821 tcggcggtgc ccgttcagcg ccgaaccgac ctcggtcacg accttgtcga ccgcgatctg 3062881 ctcggccgtg gcccaggcgg tcacccgcgc cacctgccga tccaggtccg gcttctgatc 3062941 cgctgacgac actcgcgcat acacggccgt ccgcgcccgg cgggatctat cggccggctg 3063001 gtcgtccacg agaatcagcc gcccggcctt ccgcgccggc accggcaaca accccgcatg 3063061 aaaccagcga tacacagtca cccgcgcaac accgttgcgc tcagcccaca ccgccagatt 3063121 catactgttg ttcctacagc acgccactga caactaccga ccactcagac cgcaacagct 3063181 gacagcccct tccgaattga acagcggccc atcgccgtgc gacgtaggcc gtgtagccca 3063241 gtgtgccacc gttgccgtcc cggaccgcat ccccctacat tgaggccagg ctccaaccga 3063301 atcgcccggc tcctcctcac cccgctaccc ggggtgcatc gtcgccgggc ggagcaccgc 3063361 caccgacctg gtccgcgaac cctcgtcacg cagcagcgcg ataacccggc cgtcggcgtc 3063421 acaggccgcg tacacgccgt cgataccgac cgccggcagg gaccggccgt tggcggccgc 3063481 gctggcctcc gcggcggtca ggtcgcggcg cgcaaacatc agcaggcagg cctcatcgag 3063541 gctcaggctc agcgcggggc gctccgcgag atcgtcgagc gatctcgcct ggtccagctc 3063601 gaagcggccg acgcgggtgc gccgcaacgc cgtcacatgg cctcccaccc caagcgcgtc 3063661 gccgaggtcg cgtgccaacg cgcggatgta ggttcccgag gagcagtcga tctccacatc 3063721 gatatcgatg agctggtcgc gccggcgtgc ggccagcagc tcgaaccggt cgatgcggat 3063781 cggccgggct tccaattgca cggagcgccc ctggcgggcc aaccgatagg cgcgtcggcc 3063841 accgaccttg atcgcgctga ccgacgacgg cacctgccgg atctcaccgc gcagccgctc 3063901 catcgcggcg tcgatcgcct cgatggtcag gtgcttagcc ggaaccgact gcagcacttg 3063961 accttcggcg tcctcggtgg aagtggtctg acccaagcgg atggtggcgg catacgactt 3064021 gggggccgcc gtcagcagac cgaggatctt ggtggcgcgt tcgatgccga tcaccaacac 3064081 cccggtggcc atcgggtcca gggtgcccgc gtggccgacc cgccgggtgg cgaagatgcg 3064141 gcggcaccgc cccaccacgt catggctggt cattcccgcg ggcttgtcga taaccacgat 3064201 tccggggccg gttgcgctca tagcacgatc gcggtcagca ccagtccgcg ctcaaccgac 3064261 cagcgtcccc gcagcgttgt cagcggcgga cccgacaggg tggacccgtc gatgaggata 3064321 cgggagacga agcgacccgt ccagccggtg ctatcggttt cgaacgtgat gtgcgcgtcc 3064381 tcgaaaccca gccacctctt ggtcagcgga aaccacgcct tgtacgttgc ttccttggcg 3064441 cagaacagga ttcgatccca atgcaacgcc gctggcatgg tgcggggcat gtcggcgcgc 3064501 tcggccggca ggctgatcgc atccagcaca ccattgggca acacgtcgtg cggttcggcg 3064561 tcgatgccca cggaacgcac cgcatccctg cgtccgacaa ccgcgccgcg gtaaccggcg 3064621 cagtgggtga ggctaccgac cacgccgtcg ggccagcacg gttcgccctt gtcgcccttg 3064681 aggatcggcg ccggcggcac accgagctgg tccagcgcga tgcgggcgca gtgacgcacg 3064741 gtgatgaatt cgttgcgccg cttggcaacc gatcgtgcga tcaacggcgc ctcctcgggc 3064801 agcggggtga gaccgggtgg gtcggagtac aactcggcat acgccaaatc ctcgaacacg 3064861 gtcgccggca acaccgacgc caccagcgtg cctaccgtca tcgagactgc cgttgccgca 3064921 atcgttcccg gaactgggcg gcctgggttc gcatctcggg cgtgatcacg aagtgaccgc 3064981 cgaagtcgtt gaggtagccg ggcgcgtatt ggggatccgg cagcacctgc cgcagccagg 3065041 agtagggctt gcgccggcgc cactcccgcg ggtaacccac cgacacctcc tcgaaccgca 3065101 caccgtcata ccaggtggtg cggggaatgt gtaagtgtcc gtagaccgag cacacggcgt 3065161 tgtagcgggt gtgccagtcg gcggtcttgg tggttccgca ccacagcgag aattccgggt 3065221 agaacagcgc gtcgcagggc tgtcgcagca gcggaaagtg gttgaccagc acggtcggtt 3065281 gcatccagtc gagctgttcg agacgggccc gggtggccgc gacccgctcg tggcaccagg 3065341 cgtcgcgggt ggggtacggc tcgggtgaga gcaggaactc gtcggtggcc acgacgttgc 3065401 gttccttcgc gatggccaca ccttcggcct tgctgtttgc cccctccggc aaaaagctgt 3065461 agtcgtagag cagaaacatc ggcacgatgg tggccgggcc gcctcgttcg gtccataccg 3065521 ggaacggatg ctcgggtgtg acgacgccca tctcgtcgca catgttgacc agatagtcat 3065581 agcgtgcgcg gccgaagatc tgcatcgggt cgcggttggt ggtccacagc tcgtggttgc 3065641 ccggcaccca gatcaccttc gcgaaccgcc gccgcagcag gtccagcgac cagcggatct 3065701 cgtcggtgcg ttcggcgacg tcgccggcga cgatcagcca gtcgtccggc gaggacgggt 3065761 acagcgattc ggcgacgggt ttgttgccga ggtgaccggt gtgcaggtcg gagatcgccc 3065821 acagcgtcgg ctcggcgccg acggtctcct gccccgatcc tttccaggtc acgacttacc 3065881 accctaacga cccggcgaag tgggaacgaa atccagccag ttcgaccaac cgctacggcg 3065941 tgagcagacg caaaagcccc catttcgggc ccgaaatggg ggcttttgcg tctgctcggc 3066001 caacctagcc caactgctac ggggtcggcg agggttttgg ggtgtcggtc cggctgatcc 3066061 ggcagtccga ctgtgcggtg aggaccgccg ccttctgggt cgagaatttg aactcgacgc 3066121 cgccatgacc ggcgaagtac acgtcgtggt cggccggctt gttcttcatc accccaaagc 3066181 cggtggcacc gaactgttcg acaccgtcct tgacgatctg cacggcctgc agccacttgt 3066241 cgtcggggat cggtccactg aaaacgatca tgaaatacgc cgctttggcc ctggtccatt 3066301 catactcacc accgcagccg gtccaggtgt ccatatccgt tcgccacgtc aggccgggca 3066361 ccagggccgt gatggcgttg gccagctggg tcaccgccgc ccggtactgg tccttggcgt 3066421 cctccagcgg gggcttggcg cgcaacgggt tctccagctc ggcgaccttc tccgggctca 3066481 acggcccctc ctcgccggcc cgcgtcccgt ggccactggg tccgcaccca gtcgccatca 3066541 cacacaccag agccagcagc cacgccgtcg gccaccgcat caacgtcccc ctctcagtgc 3066601 tgggccgggc gctgccggca tgccgccacc cagaattggc ggaagcagcg gcgggcccac 3066661 cgtgttgtcg ggcagcccgg cggcgatcgc cgccaggtta tagccggaca tccgcagctg 3066721 cggctggccg gcggcatcga ggaaggaccg cgggtagtcc ccgtgggcat acactccgtc 3066781 acgccagatc ccgcccggat caaaacccgc ctgtgacgac agctccgtga acccgggggt 3066841 cagatagggg tccaggcccc atccgtgcag cggcgccaac ggcgccacca gattggtgat 3066901 gaggtcgtgg ggggcctgca tgacataagc gtgcccgtga tcgagcccga gctgcgccgg 3066961 gctgtacagc tccaagccgg gtgagccgta aaacacgacg tcgttgaccg gatgggcgct 3067021 ctgggcatcg aggtcctgca acgccagcga cgccgtcagc gacccatacg agtgccccaa 3067081 cacggtcagg tggccactgg ggttattggc gcgcacctgc tgcaaatacc gcgacagatc 3067141 ggccgcgccc gcgtgtgcct gcccatcggt catggtctgc cacagatcgc ccgcactgcc 3067201 ggtgtcgagt gggttcgggg gcgggtggta gcccatccag gcgatggtgg caaccgatgc 3067261 gggcttgccg gcagcattga gttgccggat tacctccgac cgcaggtcgc gggcttcggt 3067321 caccatgccg ggcagggcgc cccgggtggt ggacccgacg ccgggaaccg tcaccgacac 3067381 attggcggcg gtgtcgggat taccgacggc cacggccgcc agcacctgct gatttgggtc 3067441 ctcgggaatc tgcagctggg tcaggtaggt ctcgggtgct cggctcaacg cctcgtcgac 3067501 ggcatcgagc tcacccagcc ggcccctggc ggcgctcagc tcgtcggtaa gcgctgccag 3067561 tcggcccacc gcgtcaccgt cgaggatgcc gttgtggtag tcacgggcgg cccgcacact 3067621 cagttggtca tactccgcct gtaaccgctc gaggtgggcc tgcaggcggg cgcgctcctc 3067681 gcgcagccgc tcctcgttgg catcgctggc gagctgggtc ggggtcagcc cctcgggacc 3067741 gaccggcggg ccggaatcgg ccgggatggg cgcgtcaccg tcggccatat tgaccgctga 3067801 ggccagctcc tcgtcgacgg cattggcctc ggccataatc gcatccagct ccgcctgcag 3067861 ctccgtttgc ttggccagcg tccgcgccca ctgcgcctcg gtggatcgca gcccggggat 3067921 cggcaccacc cggttgatca gcgcatcgat cgtcagctcg gcggccgcgg cggcatggcg 3067981 tagtgcggcc agctcggact gaaccttcac aatcccgtcg gcggccctgt cggccgcccg 3068041 agcaaccgcc aacgcctcgt tgccgtgggc gtcgaggtct cggcgaatgc ccgcgttgtg 3068101 gtgtgccgcc gcctcagcgg tcttgccacc cgagttcgca aaaatcgaca gcgcggccaa 3068161 ctgacgcgac gcctcgaacg tcacctccgc tcgggcactg gccgcgtgaa acacctcccg 3068221 gaccgcttgc gcgttccacc gatcgatatc ggccacggtc agtggcacga atcacacccc 3068281 acgcggacca gctacgacgt cggcggaaac acccacctgg gcgagcgcct gcgcccgctc 3068341 cgcctccgcc gccgcatgct ggatagcggc ctcctgcagc ccgaatgcgt gatcaccgat 3068401 cctggtcagc agcgccctcg acgcgtccaa ccagtcgtcc atcttggcgt tgagcgccat 3068461 cgccgaggcg ccctgccagc cgaactgggc ggcctgcatc cgatagtccg acgacaaatg 3068521 tccgacggcc agaccctcac cctgcgtggt cacctgcgcc gccgagtgca tccactgctc 3068581 cggactgatc tgaaacaccc gttgcttcct tgcgtccatc gaagtgcatc acattatgcg 3068641 tcagcgggaa ctaccgcaga attcaccgca tcaaaggtgg cccgggttag aacaagttct 3068701 cgtttgactg tgacgacgcg gagccgactt gtacactccc ggcaagggac cgccgagggc 3068761 agggggtgtc gtgttcacca gggtgcggct gatcggaggg ctcggtgcgc tgacggcagc 3068821 ggtggtggtg gtggtgggca cggtgggctg gcagggcatc cccccagcgc cgaccggcgg 3068881 cgacgcggtc cagctgcgat cgaccgcggc gcccatgtcc accacgatga agagcccgat 3068941 cgtggcgacc accgacccca gcccgtttga cccgtgccga gacatcccgt tcgacgtcat 3069001 ccagcggctc ggattggcct acacgccacc ggaagccgag gaggggctgc gctgccactt 3069061 cgacgcgggt aactatcaga tggccgtcga gccgatcatc tggcgcacct acgcccagac 3069121 cctgcccccc gacgcgatcg agaccacgat cgccggccac cgcgccgcgc agtactgggt 3069181 gcggaagccg acgtatcaca acagcttctg gtactcctct tgcatggtga ccttcaagac 3069241 cagctacggg gtgatccagc agtcgctgtt ctactcgacc gtctactccg agcccgacgt 3069301 ggactgcccg tcgaccaacc tgcagcgggc aaacgacctc gtcccctact acaggtttta 3069361 ggtccctacc ctgggcgtcg tgagtaccac ctccgctcgg cccgagcggc ccaagctgcg 3069421 cgccctgacc ggacgagtcg gtgggcaggc cctgggcgga ctgttgggtc tgccccgcgc 3069481 aaccacccgc tacaccgtcg gtcacgtccg agtcccgatg cgcgacggcg tccagctggt 3069541 ggccgaccac tacgcacccg ccacgtcgca gcccgtcggc accctgctgg tgcgtgggcc 3069601 atacgggcgc cggtttccgt tttcgctggt gtttgccagg atttacgccg cccgcggtta 3069661 tcacgtcgtg ctgcagagcg tgcgcgggac gttcgggtcc ggtggcgtgt tcgagcccat 3069721 ggtcaacgag gccgccgacg gcgccgatac ggtggcgtgg ctgcgtgaac agccctggtt 3069781 caccggccgg ttcggcacca tcggcctgcc ctatctgggt ttcacccagt gggcgttgct 3069841 gcacgatccg cccccggagc tggccgcggc cgtgatcacg gtggggccgc acgacttccg 3069901 ggcctcggtg tggggcaccg gatcgtttac ggtcaacgac ttcctgggct ggagcgatct 3069961 ggtttcccac caggaagacc ccggtcgcat ccgggccgga atccgccagc tcaccgcgcc 3070021 gcgacgggtg gcgcggacgg ccgccacgtt gccgctgggt gagtcggccc ggacgctgct 3070081 cggcacgggt gcgccgtggt tcgaatcctg ggtggaacac accgaccgcg acgatccgtt 3070141 ctgggaccga ctgcggtttc ccgccgcgtt ggaccgcgtc caggtcccgg tgctgctcgt 3070201 cggcggctgg caggacatct tcctgcggca gacgctgcag cagtaccggc acctgcgcga 3070261 ccggggtgtg cacgtcgcgc tgacggtcgg tccctggaca cacacccaga tgctcaccaa 3070321 ggggctggcc accggcgctc gggaatcgtt ggactggttg gacgcccacc tcggccgggc 3070381 gccggcgctg cgccccagcc cggtgcgggt cttcgtcacc ggccagggct ggcggcacct 3070441 gccggactgg cctccggcga ccaccgagcg ggcgtggtac ctgcagcccg gtggccgcct 3070501 gggtgagagc gctccggctt ccggcacgcc accggcgacg tttcgctacc accccgccga 3070561 cccgacaccg accaccggtg gtccgctact gtcatccaac ggcggttacc gcgacgacag 3070621 ccggctggcc acgcgcgccg acgtgctgtg cttcaccggg gcgcccctca cccacgacct 3070681 ctgcgtgcac ggaaaccccg tcgtcgagct ggtgcacagc tcggacaacc cctacgtcga 3070741 cgtgttcgtt cgggtcagcg aggtggacgc gaagggccgg tcccgcaatg tcagcgacgg 3070801 ctaccggcgc cttggtgacg cgccggagct ggtccgcgtc gagctggacg ccatcgccca 3070861 ccgattccgc gccgactccc gcatccgggt gctgatcgcc ggtagttggt ttccccgcta 3070921 tgcgcgaaac ctcggcaccc cggaaccgat actcaccgga cggcagctca agccggctac 3070981 ccacgcggtg catttcgggc gctcccggct gctgctgccc gtcggctaac ggctggtggt 3071041 gcggcggacc cgggcggcga cccggccgat aacccgagcc cgtccagcgg cgcgtgccca 3071101 gtggtctccc cgacgcggaa cctgcgagag ctacgaccat aagtcgagat gcagtttcaa 3071161 agcctcatcg agctgggcaa gttcggcggc tgaaactcgg ccgattggcc ggagcaaccg 3071221 ctcggtagca atcgatctga tttgctcggc ctgcgccttg cagtcgacct ggagaccagt 3071281 agtggtggcc gacaacaaca cctgaaacgg atagaccttg gcgatgttgc tcgtcaccgg 3071341 cacgacggtg atgacgccgc gcccaagacg cgtggcggtc gcgttggccc ggtcgttgct 3071401 gacgacgacg gcggggcgct ggttgttcgc ttcgctacct cgagcggggt cgagatcgac 3071461 ctgccaaatc tcaccgcggc gcatcaccga ctccgtcgcc gacggtctgc tcccacgcgt 3071521 ccgtgtcgcc ggctgccgac cattcttgcc atgcgttggc atagtcatct tcgagcgtgg 3071581 ggtagcgaag cacgcggatc gcatgctgca ggccggcgga gcgggatggt aatcccgctc 3071641 gtttcacata tgcgtccagg atcgcgacgt cgtcatcgga caggctcacg ctcaacttca 3071701 caacctaaga tgctaccagg gtcgtaccta ggtagtaata ggttcagcgg ctggtcgcgc 3071761 gccagtcgcg cagcacttcc tcgacgtgct cacccacccg gtgccgcgcc gtttcccggt 3071821 cgacacccga catcagcagt tcgtcgaacg aagtgtcgat atgccgcacc gacgccgcga 3071881 cggccagcct caccgcttcg ggatcgagcg cccgtccggc cgcgctacgt ccgatcctgc 3071941 cgctgccgcg ggtggccgcg tggcgggcga tcgcctcggc ccggccggcc gggcagttcg 3072001 ggaacagcgt gcgaatcgcg gcgccgaatt cggcttgcag acgcaggtcc tcgttggccc 3072061 gtcgcgcctc gtcgcgctcc cggcggcggg cgcgcacctc cgcatcggcg aggcactcgt 3072121 tttcggcgcg ctccagcgcc tccgcctcga ccaggatgcc ctgacgctcg tatcgcttac 3072181 gcgcccggct ccaccgcacc accaccgccg aaagccggct cgcccgcttg gcccggcggg 3072241 taagcgcggc gtccccggac ggcaagaaga ccagatggcc aaggtccgcg cagtccaggc 3072301 acaacggccc cgcgtcctca aggaacatca ggtcaccgct gccgccacac gacgcgcatg 3072361 accagtcgtt gaccggcatg atcacgacca aatcggggcg ccggctctgc cgcgcgaccg 3072421 cacgctccga gagctccggc gacacccaat gcgtgcgata cgcgcgctcg atggcgtcct 3072481 cgccggtgac gctgaaccgc agccgacggc ggtcccgagt gcgagcgacg taatcggtct 3072541 ccgacgggtt gagcccccgg tcgcgggccc agcgccgcaa cgcggccatc acggcggtga 3072601 tcttgctgag gttggcctgt acgacttgct ccagcgagtc gacgcggccc tgccgccact 3072661 ggtcgacatg cgagggcgcc agccagccca ggccgagcag cacatcgatc gcgctgacga 3072721 accgctgtcg ggccagcgcc gcctgcgccg cccgggccac ccgctgctcc agaggttgac 3072781 gtgccatgac ctgcccgagc ctagtcggac tgcgcaccga ggccgcggaa ctgagttact 3072841 ccgaccagcc ggacgcgctc ggagtggcga tgcgcgaacg tcgggaacaa cagaacctcg 3072901 ttcggccgcc acggagaaac gcttctcgcc gcatcaacac cgatcagacg tcgacgaagt 3072961 acgtctacat tacgtacatg cccgagactc tgactggtcg cctcaacttc cgcctgtctc 3073021 ctgaacagga gcaggccctt cgccacgccg ccgcgctcac cggccagagc ctgtcggggt 3073081 tcgtattgtc cgccgcggtc gaccacgccc acgatctctt ggcccgggcc aaccggatcg 3073141 agctgtccga ggccgctttc cgccgcttcg tcgccgcgct cgacgagccc gacgaggcgg 3073201 ctcccgaatt ggtgcgcctc gccagacgga agagccgcat tcccccccat tgagcacccc 3073261 cgcgctcggc cccgtcgagc tgttggaccc ggaccggcac gacacggcgc gcttctccag 3073321 cgatgttgag gttctcgacc actggctgcg ccgagtcgcg cccgtcgcgg ctgccgccgg 3073381 cacggccgct acgtgggtgc tctgtcgagg ccggcgggta gttgggttct acgcgctcgc 3073441 catggggagc atcgagcgga tccgggtgcc atcgcggccg ggccggggcc aacccgaccc 3073501 gacccgatcc cagtgctcgt cctcgctcgc ctggcgctcg accggcagga gcaaggcacc 3073561 ggtctcggtg gcgatcttct cctcgatgcc ctcatccgat ccgtggccgg tgcccggcac 3073621 tacggcgccc gcgccctggt cgtcgacgcc atcgacgacc gcgccgccga gttctacggt 3073681 caccacggct tcttgcccct cgagggtcga cgcctctacc ggcggatcag cgacatcgcg 3073741 cgggcgctgg gagtatgaag cgctatcgtc gcttggcgac gtgctgccga tcgatcgcct 3073801 cgaatggcct cgttgttgtt gtcgtcggtg atggggaggg acaacggcaa gattttggat 3073861 ccggtggtgg ccaccacggg gatgggtcgc tcgacggcgc ggcagatgtt gaccggcccg 3073921 aggttgccgg gcccggccga gcaggtcgac gggcgtagcc ttcggcctcg gggcttcagc 3073981 gacgaagcca gggcgctgct ggagcacgtg tgggccttga tgggcatgcc gtgcggcaag 3074041 tacctggtgg tcatgcatga cctgtggttg ccgctgttga ccgctgccgg tgatcttgac 3074101 aagccgctcg tcaccgaggc gtcggtggcc gagttgaagg cgacagccct accaggggcg 3074161 aatcgcatgc cgcactgggc cgcagggaca ctccctgatg gctttccagc ccgggcggtg 3074221 aggacgcgca cgtgaaaacc aacccccggt acggcccggc gttctactca gtgatgacgg 3074281 tgttgttcct ggcgctgttc gtgctaaatg tgtgcaccca cggctcgacg ctgggcctga 3074341 tcagtaccgg aggcctcgcc gtgttgatgg gctacatcgg ctaccggggc tggtccggca 3074401 agcgccatat caaccggcaa tagcgatcat cgaccggttc cggcacacct gaccagcgcc 3074461 gtcgtcggcc gccaacccca cggctcgtgt gccagccgac ggtcaccgtg tcgcggcggc 3074521 gggacacgag gaaactgccc accagccaca cctacttcgc gctcactttt aagtgaggca 3074581 cttcggcatc gaaggcggat aagaccaaga tcctggatcg ggtggtgtcc accaccggga 3074641 tgggtcgttc gacggcccgg cggatgctga ccggcccggg gctgccggag ccggccgagc 3074701 aggtcgacgg gcgcaggctg cgggcgcggg gcttcagtga cgacgccagg gcgcttttag 3074761 agcacgtgtg ggccttgatg ggcatgccgt gcggcaagta cctggtggtg atgctcgagc 3074821 tgtggctgcc gcttgtggcc gccgccggtg atcttgacaa gccgttcgcc accgaagcgg 3074881 cggtggcgga gttgaaggcg atgagcgcgg ccaccgtgga ccgctacctc aaacccgccc 3074941 gcgagcggat gcgcatcaaa ggcatctcga caaccaaacc ctcaccattg ctgcgtaatt 3075001 cgatcaccat ccacacctgt tcggatgagg cgcccaaggt cccgggggtg atcgaggccg 3075061 acactgtggc gcactgcggc ccgagtctaa tcggcgagtt cgcccgcacc ctgacgatga 3075121 ctgatctggt gaccggctgg accgagaacg cctcgatccg caacaacgcg gccaagtgga 3075181 tcctcgaggg catcaaggag tgccagcagc ggttcccatt cccgatgacg gttttcgatt 3075241 cggactgcgg gggcgagttc atcaatcacg acgtcgccgg ctggctgcag gcccgcgaca 3075301 tcgcccagac tcgctcgcgg ccgtaccaga agaacgacca ggcccatgtc gagtccaaga 3075361 acaatcatgt ggtgcgcaaa cacgcgttct actggcgcta tgacaccggc gaagagctgg 3075421 agctgctcaa ccggctatgg ccgttggtgt cgctgcggtg caacttcttc accccgacca 3075481 aaaagcccgt cggctacacc agcaccgtca acggtcgccg caagcgcatc tatgacaagc 3075541 cggccacccc atggcagcgc ctgcaggcat cgggcgtcct tgatgcacag caactctcga 3075601 ccgtggccgc ccgaatcgaa ggcttcaacc cggccgatct gacccgccag atcaacgcga 3075661 tccaaatgca gctgctcgac ctggccaaga ccaagaccga ggccctggcc accgcccgcc 3075721 acatcgacct gcaatcattg caaccgtcaa tcaaccgatt ggccaaggcg aagtaatgca 3075781 agccccccac gcgctcacta tgcgtgaggc accagccacg cttcgcgctc acttctacgt 3075841 gaggcacctc ggatgctgtt gcgaatcctg ttgggccgcc ccagtttaaa gtggatgagc 3075901 ttggtagagg cgcttacgtg tacgttggga aagacgcaac agtggtccta aacaaagatg 3075961 gccaagtggt aaccgcctgg gcgaacagcc gggctggatg gagaaatccg tgagcaacgt 3076021 tctcgatgct atttcaacgg agcaccgtcc cgtgatcgag caagaattag agaatcgtaa 3076081 tcccgctctc ttcgacgagc ttcggcgcac agagaagcca accaacgaac agagcgacgc 3076141 tgttatcgac gtgctttccg acgccttgat gaagaccttt ggacctgatt gggttccgaa 3076201 tgattatggg ttgaaaatcg aacgagcaat tgacgcatac ttagagacgt ggccgatata 3076261 ccgataatcg cttgacacca actattgcca gcaccaggcg cctaccgtgc atcgggagcg 3076321 cggccgggct ggtattcgcg tgggactgaa ggagcttagg caggaacgca catgacgtac 3076381 gcagccaggg acgatacgac gctccccaaa ctgctcgcac agatgcggtg ggtggtgctg 3076441 gtggacaagc gtcagctcgc ggtgctgctg ctagagaacg agggaccggt cgcttccgcg 3076501 acggacccgt tggatacgcg cggtgatagc gactatgaaa accagccggt cgacgcagtg 3076561 gagcggctat gtcggcgttt ggctgaccag gcggtgcgtc agtggggttt tatgcagggc 3076621 ctcaagcaga agctcggacc aggtgtcgac gtgcggatga agctggtgga gtggaaccga 3076681 tgagctttaa tggctcttcc ggaatcagag tgcatggatc agctgagcca ggttgccgca 3076741 gtgcagtagt gaacggattc ggtagtgggt gaggtttctg aatccgaggg cgttacggca 3076801 tagggcttcc agtcgtccgt tgatggcttc ggtgggcccg ttggacgcgt ggtggtcgaa 3076861 gtaggccagc acatcgtggc ggcagcgcca cagggtgcgg cctagtttgg ccagttcctc 3076921 tagtacgaca gggacaccgg ttccggcgct gacggtgagc agcgcggggg cttgccgtac 3076981 ccgggtttgt tgtttgtagt gccggggcgg ctcggtgatc aggtcattgg taggcgacgg 3077041 ccctcccgtc gtctcttgcc ggagtgctac gggagggccg cctgtgtgcg cttggaggcg 3077101 cagtggtcac cgtagaagca gatgtcgatc aagtcgagcg tcggctggcg gccggtgagc 3077161 tgagctgccc gtcttgcggg ggtgtgctgg cgggctgggg ccgggctcgg tcgcggcagt 3077221 tacgcggccc ggctggtccg gtggagttgt gcccgcgtcg gtcgcggtgc accgggtgcg 3077281 gggtgacgca tgtgttgttg ccggtgagcg cgttgctgcg ccgcgccgac acggcggcgg 3077341 tgatcgtgtc ggcgctggcg gcgaaggcca ccagccgggt cgggttccgc cggatcgcca 3077401 cggatgtggc tcgcccggcg gagacggtgc ggggctggct gcgccggttt gccgagcgtg 3077461 tcgaggcggt gcggtcggtg ttcacggtgt ggctgtgcgc ggtcgatgcc gatccggtga 3077521 tgccggatgc aggtggcggc gggttcgtcg atgcggtggt ggcgatcggc gcgctcgcag 3077581 ctgccatcgg gcgccggttt tcgctgccca cggtgtcgct ggctgagacc gcggtagcgg 3077641 tgtcaggtgg gcggttgttg gcgccgggct ggcccggcga gtgggtgcaa cacgagtcga 3077701 ccctgccgta gccgtcgatc gggccgtaaa cctgtgcgct gtcgtgtgtt ttgacagaca 3077761 gcaaatggaa aggagcggcc ggtggcggtc ggcgatgacg aggagaaggt gcgcgcggag 3077821 cgcgcgaggg cgatcgggtt gtttcgctac cagttgattt gggaggccgc cgatgcggcg 3077881 cattccacca agcagcgggg aaagatggtg cgcgagttgg cctcacgcga gcacaccgat 3077941 ccgttcgggc ggcgggtgcg catcagccgc caaaccatcg accgctggat ccggggctgg 3078001 cgggccggcg ggttcgacgc gctggtgccc aacccacgcc agtgcacacc gcgtaccccg 3078061 gccgaggtgc tggagctggc ggtggcgctg cggcgggaaa acccgcagcg cacggcggcg 3078121 gcaatccggc ggatcctgcg tacccagttg ggctgggcgc ccgatgaacg caccctgcaa 3078181 cgcaacttcc accggctcgg gctcaccggc gccaccaccg ggtcggcgcc ggcggtgttc 3078241 ggccggttcg aagccgagca cccgaacgcc ctgtggaccg gggatgtgtt gcacggcata 3078301 cggattgatc tccgcaagac ctatctgttc gcgttcttag acgaccattc ccggttggtg 3078361 cccggctacc gggggccatg ccgaggacac ggtgcggctg gccgccgcac tgcgcccggc 3078421 gctggcctcc cgcggcgtgc ccaacgcggt gtatgtcgat aacggctcgc cctatgtgga 3078481 tgcgtggttg ttgcgggcat gcgcgaaact cggtgtgcgc cttgttcatt ccacgccagg 3078541 tcggccgcaa ggcaggggca agatagagag gttcttccgc accgtgcgcg agcagttcct 3078601 ggtcgagatc accggcgaac ccgacgtcgt cggccgacat tacgtcgctg atctggccga 3078661 gttgaatcgg ctgtttacgg cctgggtcga aacggtttat caccgcagcg tgcattccga 3078721 aaccgggcag accccgctgg cccgctggtc agccggcggc cccatcccgc tgcccgcccc 3078781 cgagacgctc accgaggcct tcctgtggga ggagcaccgc cgcgtgacca agaccgccac 3078841 cgtctcgctg cacggcaacc gctacgagat cgacccggcg ctggtcggcc ggaaagtgga 3078901 gttggtgttc gacccgttcg atttgacccg catcgaggtg cggctggccg gcgcgccgat 3078961 ggggcgggcc attccgtatc acatcgggcg ccattcacac ccgaaagcca aacccgaaac 3079021 ccccaccgca ccgcccaaac ccagcggcat cgactacgcg cagttaatcg agaccgcgca 3079081 cgcagccgaa ctcgcccgcg gcgtcaacta caccgccctc accggggctg ccgatcagat 3079141 ccccggccag ctcgacctgc tcaccggcca ggaggcccaa ccgaaatgat gcacaaactg 3079201 atctcgtatt acggtttttc gcgcatgcca ttcggccgcg atctggcacc gggcatgctg 3079261 catcgccaca gcgcgcacaa cgaagcggtc gcccgcatcg gctggtgcat cgccgaccgc 3079321 cgcatcggcg tcatcaccgg cgaagtcggc gccggcaaga ccgtcgccgt gcgcgccgca 3079381 ctagcgagcc tggatcgcag ccgccacacc gtcatctacc tgcccgaccc caccgtcggc 3079441 gtccagggca tccaccaccg catcgtcgcc tcgctcggcg gacaacccct cacccaccac 3079501 gccaccctgg ccccacaggc cgccgacgcg ctagccgccg aacaagccga gcgcggacgc 3079561 acccccgtcg tggtcgtcga ggaagcgcac ctgctcggct atgaccaact ggaggcgttg 3079621 cggctcttga caaatcacga cctcgactcg tcaagcccgt tcgcctgcct gctcatcggc 3079681 caacccaccc tgcggcggcg gatgaaactc ggcgtgctcg ccgcgcttga ccagcgcatc 3079741 ggactccgat atgccatgcc gcccatgacc gacaccaaca ccggcagcta cctacgccac 3079801 cacctcaagc tagccggacg cgacgatgcc ctgttctccg acgacgccat cgggttgatc 3079861 caccagacca gccggggcta cccccgcgcg gtcaacaacc tcgccctgca agccctcgtc 3079921 gccgccttcg ccgccgacaa ggccatcgtc gacgaatcca ccacccgcac cgccatcgcc 3079981 gaagtcacgg cagactgaac accacaccga caccccgaac accaccgacc ccgccggaca 3080041 tctcccggcg gggtcatttc atgaccaaac gtcctcaccg tcaacgccgc catcatgctc 3080101 atcctgaatg ccggtcaaca gacgcggtgg cgacccagtc gtcgtagttt ccgtcccctc 3080161 tcggggtttt gggtctgacg acacttgcgc gcacaacgca tccgccatcc acggggcgtt 3080221 tccgtcccct ctcggggttt tgggtctgac gacctgaaag ggggactgtg gacgagttcg 3080281 cgctcaaaat gtttccgtcc cctctcgggg ttttgggtct gacgacgcga gaataagatt 3080341 gtcaggctgg tgctgcgttt ccgtgtttcc gtcccctctc ggggttttgg gtctgacgac 3080401 ttcgggagca tgccgcagct gcggatgtgg tgctggattt cgagtttacg tcccctctcg 3080461 gggttttggg tctgacgaca cgttggaagc gtttcgagcg tacggacttt ccggatgcag 3080521 gtttccgtcc cctctcgggg ttttgggtct gacgacttga ctcagacccc agcgccaact 3080581 tgcccaccat ccgcgtttcc gtcccctctc ggggttttgg gtctgacgac aggcgcttcg 3080641 acggtgtggg cgaggtgact tcgcaaaggt ttccgtcccc tctcggggtt ttgggtctga 3080701 cgactgcgtc aagtgcggca ccgccgtcat gtcggtgtcg agtttccgtc ccctctcggg 3080761 gttttgggtc tgacgacttg aacacgccga tacctatttg gtcgggagtg ataaagtttc 3080821 cgtcccctct cggggttttg ggtctgacga ctgacagggt gcggtggtcg ctgatcggct 3080881 ccccgagttt ccgtcccctc tcggggtttt gggtctgacg accggacttg atcgacgcga 3080941 acctgtctga cgcgaacctg tttccgtccc ctctcggggt tttgggtctg acgacggctg 3081001 gaaaagggcg cggggcaacc gcatcgtcaa gagtttccgt cccctctcgg ggttttgggt 3081061 ctgacgacgc gttgtggtcg tgtcgtggag cctgtatttc gctggtttcc gtcccctctc 3081121 ggggttttgg gtctgacgac cattagttgg tgttgtgatc gctaaacgcc ggggcagttt 3081181 ccgtcccctc tcggggtttt gggtctgacg acctatccgc gggaagagat cacgaatccg 3081241 gcgtcgaagg gtttccgtcc cctctcgggg ttttgggtct gacgacatgc tgagctgagg 3081301 cgccggatga tggtggtgct gaaggtttcc gtcccctctc ggggttttgg gtctgacgac 3081361 tgacagggtg cggtggtcgc tgatcggctc cccgagtttc cgtcccctct cggggtgaac 3081421 cgccccggtg agtccggaga ctctctgatc tgagacctca gccggcggct ggtctctggc 3081481 gttgagcgta gtaggcagcc tcgagttcga ccggcgggac gtcgccgcag tactggtaga 3081541 ggcggcgatg gttgaaccag tcgacccagc gcgcggtggc caactcgaca tcctcgatgg 3081601 accgccaggg cttgccgggt ttgatcagct cggtcttgta taggccgttg atcgtctcgg 3081661 ctagtgcatt gtcataggag cttccgaccg ctccgaccga cggttggatg cctgcctcgg 3081721 cgagccgctc gctgaaccgg atcgatgtgt actgagatcc cctatccgta tggtggataa 3081781 cgtctttcag gtcgagtacg ccttcttgtt ggcgggtcca gatggcttgc tcgatcgcgt 3081841 cgaggaccat ggaggtggcc atcgtggaag cgacccgcca gcccaggatc ctgcgagcgt 3081901 aggcgtcggt gacaaaggcc acgtaggcga accctgccca ggtcgacaca taggtgaggt 3081961 ctgctaccca cagccggtta ggtgctggtg gtccgaagcg gcgctggacg agatcggcgg 3082021 gacgggctgt ggccggatca gcgatcgtgg tcctgcgggc tttgccgcgg gtggtcccgg 3082081 acaggccgag tttggtcatc agccgttcga cggtgcatct ggccacctcg atgccctcac 3082141 ggttcagggt tagccacact ttgcgggcac cgtaaacacc gtagttggcg gcgtggacgc 3082201 ggctgatgtg ctccttgagt tcgccatcgc gcagctcgcg gcggctgggc tcccggttga 3082261 tgtggtcgta gtaggtcgat ggggcgatcg gcacacccag ctcggtcagc tgtgtgcaga 3082321 tcgactcgac accccaccgc aaaccatcgg ggccctcgcg gtggccctga tgatcggcga 3082381 tgaaccgggt aattagcgtg ctggccggtc gagctcggcc gcgaagaaag ccgacgcggt 3082441 ctttaaaatc gcgttcgccc ttcgcaattc ggcgttgtcc cgccgcaagc gcttcagctc 3082501 agcggattct tcggtcgtgg tcccgggccg tgcgccggca tcgacctgcg cctggcgcac 3082561 ccacttacgc accgtctccg cgcagccaac accaagtaga cgggcgacct cactgatcgc 3082621 tgcccactcc gaatcgtgct gaccgcggat ctctgcgacc atccgcaccg cccgctcacg 3082681 cagctccggc gggtacctcc tcgatgaacc acctgacatg accccatcct ttccaagaac 3082741 tggagtctcc ggacatgccg gggcggttca gggttttggg tctgacgact cgcggcgagc 3082801 acgtctcacc cagcaggcgg tgaggttggg tttccgtccc ctctcggggt tttgggtctg 3082861 acgacacgga cgagctggac cgcatcagcg atgctgagct gagggtttcc gtcccctctc 3082921 ggggttttgg gtctgacgac ttgtctcaat cgtgccgtct gcggtgacac gctccaagtt 3082981 tccgtcccct ctcggggttt tgggtctgac gactcgacga ttgggacatc gacatcgacg 3083041 cgatcgtcga cgagtttccg tcctctctcg gggttttggg tctgacgacg tgatcttctc 3083101 tcctggcgag gtcaaggaaa tcgatgtgga gtttccgtcc cctctcgggg ttttgggtct 3083161 gacgaccacc aggatcagcg ccaagccagt tagcgcaatc cagtttccgt cccctctcgg 3083221 ggttttgggt ctgacgacct cccggaccat ctgcagctcg cccgggtcca tgcggtttcc 3083281 gtcccctctc ggggttttgg gtctgacgac cggagtcatc cgcgcgggcc ggcgcgattg 3083341 ttgccgggtt tccgtcccct ctcggggttt tgggtctgat ccgcgaaatt cactgcgcgt 3083401 tattcaaggt ttccgtcccc tctcggggtt ttgggtctga cgacccgagc cgaccatccg 3083461 catcacaccg aaagggttgg cgcaagtttc cgtcccctct cggggttttg ggtctgacga 3083521 cacgtgggga gagggaatgg caatgatggt cgacgaagtt tccgtcccct ctcggggttt 3083581 tgggtctgac gactcaaaaa cggggacggc atgctgcatg ccctaacgtc gtgtttccgt 3083641 cccctctcgg ggttttgggt ctgacgacca gcgcagacgg cagccccgag tactcgctct 3083701 cctcaggttt ccgtcccctc tcggggtttt gggtctgacg accagcgcag acggcagccc 3083761 cgagtactcg ctctcctcag gtttccgtcc cctctcgggg ttttgggtct gacgacctaa 3083821 gcccgctaat cccgcacaag tggtcagaaa agtttccgtc ccctctcggg gttttgggtc 3083881 tgacgacctg atgattggtc ggcgtatgac gtgctactga ggtgttgttt ccgtcccctc 3083941 tcggggtttt gggtctgacg acacccctga gccacggcat gtgcacggct ccgtgttcaa 3084001 gtttccgtcc cctctcgggg ttttgggtct gacgactcga tgaatgatgc gcgccagcgg 3084061 taacgcgtgg gatgtgtttc cgtcccctct cggggttttg ggtctgacga ctctgaaaat 3084121 tcgtggtcaa cgaattgtcg tcgaaatgtt tccgtcccct ctcggggttt tgggtctgac 3084181 gacacgggcc gtggggcact tacggcgacg gcgcacatgt ttccgtcccc tctcggggtt 3084241 ttgggtctga cgacccccta gccacgcctg ccgtgccatg ccgcgccgcg agtttccgtc 3084301 ccctctcggg gttttgggtc tgacgacttg gtcaaaagct gtcgcccaag catgaggcaa 3084361 aaagtttccg tcccctctcg gggttttggg tctgacgaca cgactagggg agcgtgatcc 3084421 agagccggcg accctctatg gtttccgtcc cctctcgggg ttttgggtct gacgacgtgc 3084481 aagaattccg ggttgcagtg caacacggtt ttaagtttcc gtcccctctc ggggttttgg 3084541 gtctgacgac tctatggaca attcgtccag cgtgtggtaa caatgcctgc tgatgatgtc 3084601 aaaagaacac aaactcctct gcgctgacaa gccgtcccct tccgtagaac gtaactgccg 3084661 caacacctct tatcttatag atccggatgt tgtcgcagtc gatggcgaag cggtcgatac 3084721 gtgcaactag tttcgcgagc tggcccttcg tcagcatcgc ttcgaatgcg gactcttgga 3084781 cgcgatagcc aaacccggcc aggatcttcg caagtgaagc ccgccgccgg ttgtcgctga 3084841 tgtcgtatat tacgaggacg aacatcttgc ctatagtgcc gctggactcg tccactttga 3084901 gcgggagatt gaagtactcc tcacggctgc gagtgggcat ttaggctccg gatggctcgg 3084961 aggtgatatc gatatcgacg agccgcgacg ggtgcccggc ttcgataaca cgcacgaggc 3085021 tttgcagttg caagtcgagg gcgtactgaa aggtgtatcg gtgaggatcg cctttgatgt 3085081 aggtggcggt tcgtgcgatt cgattaccaa aggcgcgcgc gatggatcgt gtggcttccc 3085141 gtgtcgcgaa gacggccccc gtgtcggagt tcttgctgaa agcccgggtg tcgaccacac 3085201 cgtccgcgat caatcgaagt acggtgtcat cgatgatcgg cgcccgccat acctccatga 3085261 ggtcgctcgc caacgttgcg tgccctcgtg aatcctggtg taggaaaccg atatacgcgt 3085321 tcaggctgtg acgctcgatc gcccctatga tgttcttgta cagcagcgaa tagccgaggc 3085381 tgaccatcga gttgaaggcg tccaacggcg gccgagtcga gcggccctgg aatgcgaact 3085441 cctgcgggac gagatgcccc agcgcggtga agtatgcctt tgcggcattt ccctcgaacc 3085501 cgttcaactc cgccagggag cccgatcgat cgacccaggc cagcgagtgc ttcatcgtgc 3085561 ggatgctctc agcaacgtct tgccccgacg tgtgtgcccg aatcaaggcc tgctgattca 3085621 ggatcttcct cgacacgatc cgcttgctta acgacaggca gaacgcagga tcgtcggtgc 3085681 ggtgaacttg ctgacggagc cgcggcgcgt atgacacgtc gggtgttgag atccggccct 3085741 ggtagtggcc gtcggtcgtg aagagctgga tgtcgcgctc acgcttgagc atctcaacga 3085801 tgaagggcgt tgtcatcgtc ggccgcccaa acagcgtgat gccgtccagc gtctcgatcg 3085861 gatactggct ctcgccgagc tcctcgctcc acacgatcac ccggccgtcg gcaaagctga 3085921 tccgcgacac ggagtccgag acatacagct gcaccatctt gcgcacctgt tagcccagcg 3085981 gtgccatatc aatctgccgg atgatctcgt cgttcaaccg gtcatagagc gtcaaatcag 3086041 cgcctgtttc gcgggcgagg atcttcagca gttgttctgg gagaaggccg ccatccttcg 3086101 tgatgcgatc ctcactgatt gagacgatct cgtgtgctgc ggtgttgcgg acccggctct 3086161 cgaaccttcc gagtacttca agagcaccaa ctcgatcggg tgcgaattag cggagcagtg 3086221 cgagccagtc cttggtgtag aggtaccact ccgcgtttgg cgatttcgga gggtgcttga 3086281 gcgcgcaccg tatctccggc tctctttcca gctttcggcg gtcgacgcgg cccatgtcgt 3086341 cgagatagcg gtcctccgga aggtgttttg ccacagccgc cctgagcacg atagtgattg 3086401 ccggggtagc tgatcgtgcg aattcagccc attgctcgcg ctttgccagc agcgcaagag 3086461 cacttatgta ctcagcgacc ttgttcgcgg ggtcatacgt gaacgcggtg tccttaaaga 3086521 actttggcgc tacgaggtgt tccagcctcg agcggtgcat cgcgccgcgg atcagattgc 3086581 tcacttgatc gggcaggcgc gagtctgccg cgatcgtcac tgctgccgag tagtcgtacg 3086641 acacgatcag ctgcttcagg ttggcccgct caagcagcgc gccgagcgca gcggaagtcg 3086701 cctcaaagca acggttgggg gctccaggct gattgtcgtc gtttgcgtcc cacattagtt 3086761 cgaggtcgta agcgtctggg gattcacgat cgccaggctt gctcaatgcc cgggcaggcg 3086821 tgcttacttg cacagcggtg gtcctgggaa tgccaaacac atttatggcc accagcgccg 3086881 cctgcatcgc aggggtgccg gaactggtat tcagcagaat ggttcgatca gggaactcag 3086941 ccgacagttc aaccaggtgg ttgcggaaaa ccggcacgaa aaggtcgaac ctgtgcaccg 3087001 acgggttggt ataggtgact atgcgaacgt cggtctcagg cgcgagccgc gtgattgccg 3087061 cggagtaccg ccggtccgcg ttctcaaagg cagctatctc ggcgctgagg aatagcacga 3087121 caactattgg tcgatagtgg cggacgatgt gtagcatcgg gccgtcgccg agcgcggtga 3087181 tcgggtccgc agttccgata ggcgagaaca ggatcattcg gctctcctga tcgacagctc 3087241 gcactgaccc atctcgtagc atatgttgtc gatcttggtt cgcttcaaga caagtggtga 3087301 gacgcgtagt tcgcgcgtct tgtcgacgtg cttgactacc ttcccgaact gggcgtcgag 3087361 caccttcgcc atgtcgtctt ggtcggtgac aaaggtcttg ctccgatagc cggctccgcc 3087421 gcccagatag acaattgggc caactatcgc gttcacgcca gggtacatgg ctctgtactc 3087481 cgcgtaacgc gcctgattca cggacgcggc tgtctcggcc agcgtttcaa ggaaccgctc 3087541 gccctcacgc cagccgccgc gagcggtggg actggtgtcg accaccacgc ggtgcgagat 3087601 tgaggttccc ggcgccaaac attcccggaa gagcggcagg ccatcaggct tgccgtggac 3087661 attcatgtcc atcttctggc agatcagcag atcgcttgtt ctcagtgcag gtgagtcggt 3087721 gaccctgatc gcctgaaaca ggtcgttgac cgcgtcttgc ggacgggtgt tggggcgccc 3087781 cgatttgcgc aactccttcc gctcaaacct ttcgccgtac tgccggtgct cccgcgtctg 3087841 gtgtcccgga acacgaacag gttgggccgt ccgcttatgc acaagcgact gcaggtagat 3087901 gctgcgaagc attcccttga cagtcgaacc cggcacgtag ggccttccaa gagggtcttt 3087961 gatgaaagcg tgaatctcgt tgagcgtaag cttctttcga gtcatgcgcc cgcctcgacc 3088021 acgagatgca cgtcgcggtt cgatcgaccc gatcttcacc tcgtaacctc gatgcttagc 3088081 aggatccagc ttgaccgcgt ttggctctac ccactctttg agtggcgccg tcgcctgtgc 3088141 cccatcggtg ttcatgacga acgcttcgaa agacttcctc ttgtgagccg gaatgtctgc 3088201 gtaaagaagt tccatgtccg ggaagtagac ccggtcgccc tccacgtggt actccttcga 3088261 ggtccgcttc tcgccggatc cgataaacac cggccccagg caccgcagcg tgagttcgaa 3088321 cggcttcagg taggtgttca tgcggcggac tccgggagtg cgagaaatag cggtcgcgcg 3088381 tagctgtaga ccggatggtt tccgcccagg ctgacgtcga ggatgcctcc ttggaagggg 3088441 cgcgagaaga ccgagccggc ggcgaatttg tagatgtcgc gtttgcgcag gggcatgtca 3088501 gcgtatgtgc tcgacgcgac gaatccactg cgcttgacga ggcggtacgt cgcgccggcg 3088561 agtgcggctt cgagctcgtc gtccgtgggt agggatgtcg tgagcgtcat cagactggcc 3088621 gcgtcgactg tcggcgtgag tgcggcgggt gcttctgact cggtaaggtt aaacgctccg 3088681 aacccgcttg tccgttcgcc gcccagcgcg gagatccctt tcaacagcct ggtgagtagg 3088741 ccgagctcgg actcggatcc ggtcgccagc aaccacagac ccgcgtccag ctcgaaccgg 3088801 aagtagccga cacggtacgg gtcggcgtct ttctttccgt tgtggatcgc tgccttcgct 3088861 gacacggcgt ggacaccgat cttggtctgc cgcgccgcga gttctttcag gtcggccgtg 3088921 ccatcgagga agctgccaag ctgggcagcg ggaagaaagc cgatcttctt cgccagcttc 3088981 ttctgcatac ttgagccgtc ggaccgaacg ctgtgcaggg gcttgggaac caggtaatcg 3089041 ggccccacat agggcagcag atcggtcaac cgcagcgtcg agcacgcaac gagttcgcca 3089101 agcagctgct ggccacccat ccgtagcgct tcaacgcaaa gcgcagagta gagggtgtcc 3089161 gcggggcagc taatcgtgga cgactcgagg ccgtggtcgc cgaagtgtgt gcggtcgaag 3089221 tcgaacctaa acagccgcga gttcatggtt tagcttctcc agcagagaac cgtcgagggc 3089281 gccgactgcg gcgcgggctt tcaggttgct gaacttgacc tgcccgtagc cacgggttcc 3089341 gctgccgccg aggtagtcga gttcgagcaa cttcaggccg cgcgcgatgg cgttgaagtc 3089401 ctcgatgatc tcatcggagg aaggcagaga cgccttctgt tcctcgccgg gggtgccgaa 3089461 ggagacctcg tagacaagtg agaacgcgaa ctcgctgccg gggatcacgc gttccatctg 3089521 gcgaaggttt gcctttgcgg tcacccggtt gatggcgttc tcgaatttca cctcggtgag 3089581 agtcttagcg ccgcgggctt cgaggtcgtc tttgttggtg agcttcgtgt cgcggaagac 3089641 gagtcggccc gtcatgtact cctcggtgtc gccgaaaagc cgacggatat gggcgtggtc 3089701 ctcattcggc ttcctgtaaa acgtttctgt gtcggcgccg tattggcggg acagcaaggt 3089761 gcggaccttg cccttcaggc tggtacccgg aatcatcggc agcctgctca gcggatcacg 3089821 aacgacaggc ttgtcgaccg cgccgatggc ggagaagcca tcgccggccc cgatctgcag 3089881 gcccgtcagg acggtcagtg tcccggttat ctcgatcttg gcgtagctcg tagtcattgg 3089941 gttgtctcac ttgtccttcg gatcgaggta cttcttgtat gcggctaggg cttccatgta 3090001 ccggcagaat cgcagcagcc cgtcgcggct atcgcctatc ccttccagcg cttctaggag 3090061 tttcgcgttt cggacgaatg tcttaaccgc gtcttcacgc ccggactggt agacgaaccg 3090121 gacccgcagg tactggacct tctccttcag ctgacgcggg agcgtggggt tggcgctctg 3090181 ctgcgcctcg tcgaagagct gtgcggtcag gctgagtagc acccgcagct gggttgtggt 3090241 cagctcgaag ccgttctttt tctttggcag gccgcgaatt acttcggcct gtttcacata 3090301 gtcgtcttgg atgacgctca ttcggactcc tccttgcgag tgcgatagat gtagaggtgc 3090361 agcgcggtct tgagttgctt ggcgtctgtc ggatcttgga accattggtg tagccggtta 3090421 gcaaactgct gaaaaggcgc tgtgtcaccg gtggggttac gcatgcgcgt gaggaagtac 3090481 acccatctgg cctttgtgat tcgatcgtcg cgttcggcga gtagttcgag cagcttgtag 3090541 atgaaggcca tgccgcgttc ttcgttgcca ctgaaatagt cggcgatgtg ccggtacttc 3090601 tcctcgatca ccttgctgag cagctcatcc cagccgaagg tgaactcgcg atcgaagagt 3090661 gcaaccccgt tcttgccggg cagcgacttc gccgcgtctt cgagatctcc gacttcgcgg 3090721 gccatcacgg agatggggta cttgtcgggg aacatgccga tgccagccga cacggtgagt 3090781 ttgccctggg tgaattcgtg gaaccgctcc cgaagctcga tcccgaactc gatgacgtcg 3090841 tcccacgcgc ccacgacgaa gacgtcatcg ccaccggagt agatgatcgt ggcctcgcgg 3090901 ggccgcgccg ggtcatcgcc ggtgatcggg cgcagtttcg ggcgtgccaa cacgtagttg 3090961 atgtgctgcc ggaagaacaa cgacagcatc cgggagaacg cggccgtgcg gctaatcgtg 3091021 ttgaacttgc cgttgccttg ctccatgaag ccgtgcgtga atgcctggcc caggttatcg 3091081 acgtcaaggc gcagaacccc gaggcgcgcg attccgctcg cacgcttcac gtagtcaccg 3091141 aactccatct gtgcgacgta gtcgcccacc cagagcccgg tgcccaaaca ctcgccggcg 3091201 aagaacttgt tcttcgcgta ccgccttcgg gtttggggtt gctggagtgc cttatcggcg 3091261 tcggctcggc tacagaacgt gagtgtggcg ccgaacggca ggggcagacc tttggtggcg 3091321 ccgtcagaga tgagtaggaa gcggcgagac tcggattgaa tctgcgaaga cgcagcggtc 3091381 agcgcttggc acaggctgca ctttggctcg tcgtcggcgc tgaccgtgcg gttgaccgtg 3091441 tggcacacgc tgcattcccg gtcacctttc tgaccgtcgt gatcgcgcga gttgagttcc 3091501 cgcagttggt cagcgctgta tcgggcgagc ttcttcgcgg aaagttgctc gctcaactca 3091561 cggtagagcc cgctgtagcg gagggcgcgg ttacttgcct ggctcgcact ctcgttcggc 3091621 cgacgcatca ggtcgttcgc ggcaagcggt acgctgcccg tggcgatgaa gagccgggtt 3091681 gcgaagtttt ccagcagcca gtcgttggcc tcacgctcga actgttcgac ggatttccgc 3091741 gcggactccg tgttgggcag cagcaggtac gcgtgcccgc cgccggagta gttgagattc 3091801 gcgcggctga gacccacccg cgcaagtagc tcgtcgatga gatgctcggt cagcatctcc 3091861 aggtagaagc tgcgggcacg cagcatcttc gcggcacccg aggaatggat cgtgtagatg 3091921 aagtcctgga tgcctgagac gtcgaaagtt gtgagcagga aggctttttc gttgtagaag 3091981 gtgtcctgct tgtcgaacag cgctgacttg aagtcgcttt gtccggtggc ttgtaggtag 3092041 tgccagatgc aggcgccgag cgcacccgtc agcttcaggt ggtcgaagag tgagacgtcg 3092101 acgacctcgg acgcgtcggt cgaggacggc acgaacgaca gcgtcgcctc gaggacgttg 3092161 aggaggctgg cgaggtaggt gtcggaacgt tcgaggtcga ccagaatggc tttaagtttg 3092221 ttgacgatgg cggcgtagcg gtccttgtcg aattcgatcc ggcgtggcga cggtatattg 3092281 atcggcttgc ggtcgtcgag catctccggg gcaaatgcca gattcgctgt gccggagccg 3092341 aatcggttga acatcgaata caggggcgtg tccggatccc aagtgctcgc accatggccg 3092401 tcgtcggagt cggccttgcg gcggtcggtt ccggccgcga tattgtcggc gatgtaggcg 3092461 atgtaggccg gcgcatcggc ggcaaggcgg ccattctcgg ccgccgtacg cagcgcagaa 3092521 ctgtggtgat agctgatcgc gtcgagaatg cggcggtcgg agaccccaat gtcagcctca 3092581 tccacctcgt cggtgaactg cgacggattg cggctgtcgc gcaaccacac cttcttcata 3092641 aaagcgcggc caatcgcact gtgcctgccc gggtagccga gcgccgcgcg ctggaccggt 3092701 ttgccaatgt cgtgcaagag gcagccgatt atggcctcga tgagttgcgg gttcatggct 3092761 tcggtacgca tttttccctc ggtgccagtg gctggacccg gatcgcgccc atccccatgg 3092821 atgcctttat tccgcatccc gagaactccc cgaaccacaa cagcgccgcg atatagctcg 3092881 caaaagtatc cacaccgcgg acggtgaacg tggccgagcc ggtgaagccg ggaacacgcg 3092941 ccgcgcccac cgcgaacggg gccgacgcca cccggaacgc ggagaggcga accgactgac 3093001 cgaattcggc gatgaggcca ggatcgggct cttcgccgtc gacaattgca ccgtacttct 3093061 gcgcgagact ctgaaacacg agccgcggat ccggccagaa cacgtactcg ccggattgct 3093121 tgaatgcggt aggcgtcagg aactcgaccc ggaacttgcg cgtctcgggc cgcgcgtaga 3093181 aaatgcgcgc gaattgactt agcgggttct gctccagcga tcgcgacgtg acctgtgtcg 3093241 ctatcccgct cgcacggagc cgaaaacccg caaacgccgc gtcgttgata ggtccgacga 3093301 tctgctgccg cgcctcgttc gtcagcgtgc tgatcttcca ctccaaagat gtggtcgagc 3093361 gggccagcgc gtactgactg tacgggttca ccggcacggt gtggagggtc tgcacataat 3093421 cggccgggat cgactccatg aggacgccat gaagatgcgg ccccagggtc gccaccctcg 3093481 cgcgttcgag cggggcatca acctctagag tcagcgtcaa tcgcgacaag ggttccgtca 3093541 tccggcgatc tcctcggtga gaaaagccca ccagaataag cgttggtgaa atccaggtca 3093601 agcctgattc cgccgcactc gcgcgatgtc ggcctcgggg ctggccactc cgacgtagca 3093661 gatccgtcct tctgatgccg cctcggcgag cagccaaggg atggcttcct aacgagccgg 3093721 ggttgtagcg gtgggcgccg gccgcgcgga ggatggcgcc gccaatgctg ctgttgcccg 3093781 cgccgtcaat tgacgccagc agggaccgaa ccgacagcag atcactcgca gcgccgactg 3093841 ggcgtacagc tcagacccaa gcgatgccgc ccagtcaacc cacggcctcg cggacccggg 3093901 cggcgacctc ggccagcgcg gcctcgtcgt gcaccggctc cgccagctgc ggcgtcagcg 3093961 gcagctgcac ccagctggtg caaccgccgt actcgggcct acgcgccagc cggaccggct 3094021 cggccagcgg gatcgcgcag accaccaaga cggccagttt gtgcttgggc cgaaagtcga 3094081 gccggtcggc gcgcaccgac tcggcggtcc agatgtgcag atcctcgatg gcgtccagac 3094141 cctctggccg gttaaccggc agtgcggcaa caactttcgc tgcggcccgc agtagcacac 3094201 actcgtcggt gctgtcggcg gccgccgggc ccagcaggtc gcggtgctcg gggcgaaccc 3094261 gctcggcgtg gctgtgcgcg accgtcggga acaacaagaa ctcgtgggcc gccacctcga 3094321 agcgcttctc gccgatcccg cccttacgca gcagcaccgt ctgccggccg tccagcagcg 3094381 cgtgcaccgc cgcgctccac tccttcagcg ctggcgtcac cacgatcccg cgagccggac 3094441 cgatgtccga atgacgccag caccgcaggg ttccgagggg acgccgatca tctccgagac 3094501 gttttgcccc gggcagttcc attggtcctg ctgaatcagg ccggtcatcc agtgcatcca 3094561 atagtgatga cagtactcgt gtcttgctca ccaccacagc cggattcgtg cccaactgct 3094621 ctcatctagt cgattcagcc gcgtccagcc gcaaccgtgc cagcggaacg gcacgatcgc 3094681 cggcaggttg atcaggaccg cagcaccgcc agcgcgttct ccacttcgcg gcggtgccgt 3094741 tcgtcgcagg cggcccaccg ctgctcgtcg gcgtcgaggt cagtgaggaa cgcaaatcgc 3094801 ttccgaacgc gagcttccca ggcagccata gcgacaggac gggtcagcac gccgatcgag 3094861 tcgggctgga agtcgtgctc gctgcgggcg gcgaggacgt cttcgacgcg tagtggccgg 3094921 gtgccgcgcc ggtcatcgac gacatcaccc cacaccttga gcacccacag ccgccgcacg 3094981 agcggttcgt caatcgttcg cgaggcgaag tggttcaggt cgtacaggtc ccgtgccagc 3095041 gcaacgcggc ggtaccgcgc gagtttctct gcgcaggctt ccgcttctgc cacgaccggc 3095101 agtgtcggca gcccaaaacc gtaagcctta tggatcggca actggatgaa tgcgagcagc 3095161 tcagacggca aagccaacgg ccgccgtgcg aactcgacgc tggcgacgat ccggggctcg 3095221 cccaattccg tgtgccgcac ccgcaactgc caatgccggc cgtcgcctcg tgtgctctgc 3095281 acgccgaatt cgaagccgcc gacacgggcg ccgtcgatca gctcgcacac ctccagcacg 3095341 acctcatcgt cgggcgcgct gaagtccaga tcagtggaga accgcccgac gttgcccagc 3095401 cggcacttcc gtaagctggt accgcctttg aacaccaggc ggttatcgcc gaactggacg 3095461 gtctgcgaca gcaggtacag caggtggtcc tgggcgacgt cgagcagagc ggcgtcgtat 3095521 gcctcggccc gaccaagagc gtgacgcgca acgagcgcac gggtcagacc ggccacagtc 3095581 acgccttgcc gatcacgcgc agcagcggta cgacgagctc gtcgacaagc tgatactcgg 3095641 gagcccagac actctcgcca cggtcgcggc tgtgcgcggt ggtgaatcga gtcaccggca 3095701 tcacttcggt gtgccgcttg gccagcagcg cctggcctcg tgccggttca ccgcccgagt 3095761 ccagcaggta gcttgcacgc tgccaggccg atgtcggccg gcccgacagt agacgctcca 3095821 gacgctcgtc actgcagtcg gcgacgaggt cgtcaaggtg ggggacaagg tcggcccacg 3095881 gcccgaacga ggccgggcgc gtggcgattt gcacaagtaa tgcttctggt cctagcgccg 3095941 gtaacccggt cgcccacgcg acgaggtcga gccgccgccg gaccagcaac gcgggacgcg 3096001 gagccagcag tgcggtgtcc gccgcgttcc aggggatgcg cacgacggac acatacgatg 3096061 ctaggccgtc gggcagcctt ttggccggcg gcagccagat cgggatgcgg ccgtcgggtt 3096121 ggcggtccag gtatccgagg tgccacgctg cggatgcacc ggccagcatg aagcccgcgt 3096181 tctggtcacg ggccagccac gagcgcagcg gtagatacgg gtccgagatg gcggcctcgc 3096241 cggggggaat gaatgcccag gtgcctttca ccggcagttg gaccagccac ccaatgcggc 3096301 gcagttcgcg gatggcggag tcggggtcgc gtccacaccc agcctctgta agccgttgcg 3096361 tcagatcctc tttcgtgacg actacgggcc gatcgcgagc gaggccggac accacccgtg 3096421 acgcccacgt ggggatgcgc cgatcggcgc cggctgggct caccaccgaa cttgaattca 3096481 caccggaaac tatactatat ctgtacgcaa caatgttcaa actcaagaaa tcacttgatt 3096541 taggaacggg cttcggtcag tgacagtacg aaacccgttc caaactcaag tgccctgtac 3096601 gggctggcgg cgatgcggtg caacggcgag agacaaaacg cgcttcgcgg acgaccggcc 3096661 gacgcgccgg agagtcgcca agaacgtcac ccctgaaatc aagtgggacc aggatgcact 3096721 gacgcgttgc tcggaccagt cacccaggcg atgcgcctcg gctcaaaaac tcaacccacg 3096781 gcctcgcgga cccgggcggc gacctcggcc agcgcggcct cgtcgtgcac cggcgccgcc 3096841 aacgtcggcg tcaccggcag ctgcacccag ctggtgcagc cgccgtactc gggcgtacgc 3096901 gccagccgga ccggctcggc cagcgggatc gccgagacca ccagcacggc cagccgatgc 3096961 ttgggccgaa agtcgagccg gtcggcgcgc accgactcgg cggtccagat gtgcagatcc 3097021 tcgatggcgt ccagaccctc tggccggtta accggcagtg cggcaacaac tttcgctgcg 3097081 gcccgcagta gcacacactc gtcggtgctg tcggcggccg ccgggcccag caggtcgcgg 3097141 tgcgcggggc gaacccgctc ggcgtggctg tgcgcgaccg tcgggaacaa caagaactcg 3097201 tgggccgcca cctcgaagcg cttctcgccg atcccgccct tacgcagcag caccgtctgc 3097261 cggccgtcca gcagcgcgtg caccgccgcg ctccactcct tcagcgctgg cgtcaccgcg 3097321 atcccgcgag ccgggcagcc acgtcgggtc ggcgcaacgg cgggacggtc ttcggcggct 3097381 gccgccgggg cggcagggcg tccagcaacc gcgtcgtcgt cgcggtcacc tcggcgacgg 3097441 cggcctcaaa cgcctcggcg gtggccgccg acgggtgcgt gatgccactg accttgcgca 3097501 catactggcg cgccgccgcc gcgatctcga cgggcgtggc cgggggttgc agcccgcgca 3097561 gttcggtgat gttgcggcac atgccctcaa cgataggcgc ggctaccaga cggtgaccgg 3097621 tcgtgggtgc cgatgactgc gtagccgccg gtccttggtc accagccgcc agccgtgttc 3097681 gatcgcggtg gcgtagatca accggtcggc cggatcgccg gggaacgacg agggcagcgc 3097741 caccgccgtg gcggcgaccg agggcgtgat accgacggtg cgaacgtgct cggccagctg 3097801 ctgaagccag gacagcaccg gaatcgccag ttggatgcgt tcctgttcgg caagccaagc 3097861 cagctcgaac cacgaaatcg cggcgacggc gagctcgtcg gcgtgttcga tggcctggct 3097921 cgccgccatg ctgagacgct gcggctcggc cgaccaccag taggccacat gcgagtcgag 3097981 cagcaccgtc gtcatgaaac gttccacgaa accccggtgg tgaagagttc gtcgtcatcc 3098041 acggccgcca tcgccacacc cgagaatcga cccttcagcg cgtgcggccc cgtcgctgcc 3098101 accagccggg ccacggtgcg gccgtgtttg gtgatctcga tctcctcgcc ctgggccact 3098161 tcatcaagca aggagaggat cttcgccttc acctccgtag cggtcatttt tctggtcatc 3098221 aggacagtct aacggtcctg ttacggtgat cgaatgaccg acgacatcct gctgatcgac 3098281 accgacgaac gggtgcgaac cctcaccctc aaccggccgc agtcccgcaa cgcgctctcg 3098341 gcggcgctac gggatcggtt tttcgcggcg ttggccgacg ccgaggccga cgacgacatc 3098401 gacgtcgtca tcctcaccgg cgccgatccg gtgttctgcg ccggactgga cctcaaggag 3098461 ctggccgggc agaccgcgct gccggacatc tcaccgcggt ggccggccat gaccaagccg 3098521 gtgatcggcg cgatcaacgg cgccgcggtc accggcgggc tcgaactggc gctgtactgc 3098581 gacatcctga tcgcctccga gcacgcccgc ttcgccgaca cccacgcccg ggtgggcctg 3098641 ctgcccacct ggggactcag cgtgcgcttg ccgcaaaagg tcggcatcgg cctggcccgg 3098701 cggatgagcc tgaccggcga ctacctgtcc gcgaccgacg cgttgcgggc cggcctggtc 3098761 accgaggtgg tggcccacga ccagctgctg cccaccgccc gccgggtggc ggcgtcgatc 3098821 gtcggcaaca accagaacgc ggtgcgggca ttgctggcgt cctaccaccg catcgacgag 3098881 tctcagaccg ccgccgggct gtggctggaa gcctgcgcgg ccaagcaatt ttgcactagc 3098941 ggcgatacca tcgccgccaa ccgcgaagcc gtgctgcagc gcggccgcgc gcaggtgcgt 3099001 tagcggcgat cgcaagcgcg gcgaagccgg gtgctggggg tacctcccgc gtgcggggga 3099061 cgggtcgccg ccatcagccc ttcagcgaag ccgggtctcg gtgcggctgt tgaagaggcg 3099121 cacctcctgc gagtgcggca cgatcgccaa cgactcaccc acccgcaccg cggtacgccg 3099181 gtcggtgcgg aacacgatgc gcggtgcgcg tgacgaccag ccccgctggt cgaccggcgt 3099241 tgcgtagacg aaggattcga agccgagctc ctccaccaac tcgacgtgca cggtcaacga 3099301 tcccggggtg ccgatcgatg ccacgtccca ggactccggc cgcacgccga ccagcacccg 3099361 ctcggccgcc gggtccggaa ccggtatcgc caaatccggt gcccgcacca caccgtgggc 3099421 gacggcggcg tcgatgaggt tcatcgccgg cgcgccgatg aacgtggcga caaacgtgtt 3099481 gaccgggtcg tcatacagcg ccctcggcgt gtcaacctgt tgcagcacac cgtctttgag 3099541 caccgccacc cggtcgccca tcgtcatcgc ctccacctga tcgtgggtga cgtagacggt 3099601 ggtggtgccc aaccgacgct gcaatccgga gatctgtgag cgggtgctca cccgcagctt 3099661 ggcgtccaga ttcgacagcg gctcgtccat gcagaacacc cggggccggc gcacgatcgc 3099721 ccggcccatc gccacccgct gccgctgccc gccggagagc ttggcgggct tgcggtccag 3099781 cagatccgtc agctccagca tgtcggcgac ttccagcacc cgccggcggg tgtccgcgcg 3099841 cgacatcccg gcgtttcgca gcgcgaaccc catgttggcg gccaccgtca tgttcgggta 3099901 cagcgcgtag ttctggaaca ccatcgccac gtcacgcgcc cgcggcggca gatgcgtcac 3099961 atccacgtcg ccgatgctga tgcgcccgct ctcaatgggt tccagcccgg ccagcacgcg 3100021 cagcgtggtg gacttgccgc aaccggacgg accgaccaga accagaaact ccccgtcggc 3100081 gatgtcgagg tccaagttgt cgacggtcgg cgcgtcggcg ccgggatagc gctgggtgac 3100141 agcagagtac tgaacgttag ccatgccccg ccagcttccg catgatctgc cgatccagga 3100201 tgacctgcaa ccgtttttgg atgttcgtga aggtcttggt cacgtcggct ccgcgcagcc 3100261 cgatggattc caggccggcg gagatgatcc ggtcaccacc gggcaggaaa acccgtgcgt 3100321 agtcttgtgt ccgggtgtgt ggcagctggt cgagcgccac ccgcgcacgg ggattgtccg 3100381 ccagatagtg ccgttcgctg gcatcgtcga cggcggactt gcgcaccggc agatagccgg 3100441 tttgctggct gaagtaggcg gtgttcgtcg ggttggtgac gaatgcgatg aacttgagcg 3100501 cgttgacttt tcgctcctcg gagagcttgg ccggtatcgc cagccccgca ccgcccgtcg 3100561 gacaggcggg cgctgcgtcc gggcccgtgg gcagcggtgc ggcgccgaag tcgaatcggg 3100621 cagatgcggt gatgccgggc agcgagccgg tggatgccac ggccgaggcc aggattccgg 3100681 tggcgaactc gttggcaata tcgttggcga ccgccgcata acccttgcca tggatggagt 3100741 tccgatagaa gttgccggcc gcgatcgtgg cgggctcggt caatgtcaat gtccacttgt 3100801 cggagtaggc accgccgaat gcccagttcg gtccctgaaa cgtccacgag atgaggtcgg 3100861 cgttagccca gccgtgcgcc gatcgaccgg cgccgaccac gcgctgtaac tccggacccc 3100921 actcgtcgaa ctctgaccag gattgcagtc cgcggtcggg taggccggcc tgttgccacg 3100981 ccgccttgtt gtagtagaac agcggcgtcg agcgagcata cggcacagcg taatggcggc 3101041 cgttgaactc atagtcggcc agcagcgaat cgacgtaatc cgttgtgtcc accccaactt 3101101 ggccgaacag gtcgtcaagg gcagtgagaa caccgatgag ggcgaaatgg aaccaccatc 3101161 ggtcgtcgag caaaacgacg tcgggcacgt cggttccgat gagcgccgca ttgaatttct 3101221 gtgccacctc gtcgtagtcc ttgccggcgt cgatcagctt gaccgacaga gtggggaatc 3101281 ggtcctggaa acgaccgatc agctcccgtt ccgccgcgct ggattggccg ggatgactgg 3101341 accagaagtc gattgggccg gaaccggact tcaccgaacc gccgccgccc atcccggcgc 3101401 agccggcggt cacgccggcg gcggcagcgg ccagcgcgag gaattgtcgg cggttcagcg 3101461 ggtccatgcc tatcccttga ccgcgcccga ggtgaggccc tttatcatct gccgctgcaa 3101521 ggcgatgaag accagcaaga tcggcagcat cgccaacagc gtcaccgcca tcaccgggcc 3101581 ccagttcgtc acaccctcgg cctgctgcag aaacgtcaga cctatcggca gtggtgccac 3101641 cgattcgtcg tcggacatca ggaacggcca caggtattcg ttccattcgt tgaccacggt 3101701 gatgacaccg acggcgacca tggtgggccg cgacatcggc aacaccaccc gcagcagcag 3101761 ttgccaccac cgcgcgccgt ccatccgggc cgcctcgatg atctcggcgg gcagcgacag 3101821 aaagtggttg cgcatcaaga aggttccaaa cgccaccccc gccagaggca ggatgatgcc 3101881 ggcaaaggtg ttgcgcaggc ccaggtgtga gatcagcgcg tagttggaaa tcacggtgat 3101941 ctggttgggc accatcaacg cggcgatgat caccaaaaac accgccgtgc ggcccgggaa 3102001 ccggacaaac accaagccaa aggcgctgag cacaccgagc gtgaacttca ccaccgccag 3102061 caccgacgtg atgatcagcg agttgcgcag aaacgtccag aacggaatct gctcggtggc 3102121 cgtgcggtag ttctgcgggt accagcgcag cggccaccaa ctggtgggct gcgcatagat 3102181 gtcgggctga tccttgaacg aggtgaagaa cacgaacagc aacggcccgg caatcagcgt 3102241 gaccaccagc aacatggccg cgtagccaac gctgctacgg agccgatccg gcgtcactgc 3102301 cgctgccccc gatccatcac ccgcacctgg tagtacgtca cggccagcag caccaggaac 3102361 atgatcgtgg ccaccgtggc gccataaccg gcccggaaat tgcggaacgt ctccacatac 3102421 acctggtaca ccatggtggt ggtgccggtg ccctccggcc cgccccgggt catcacgttg 3102481 atcacatcga acacctgcag cgagttgatc agcacggtga tcgacaagaa aaacgtggtc 3102541 ggccgcagct gcggcaacag cactcgacgg aacacggccc accggctggc gccgtcgatt 3102601 tcggccgcct ccaacagatc tcggcgtacc ccctgcaacg cggccagata gatcacgaag 3102661 gtatagccga ggttcttcca gacgtaggtg atggtcacca tgaacaacgc ccagcgcgca 3102721 tcctggtaaa agtcgggcac cccgaccccg atccggcgca acaggtcttg aatcagaccg 3102781 aaatgcgggt cgaagacgaa ctgggcggcc aggccgacag cggcaccgga gatcacgaac 3102841 ggcgcgaaaa cagtggagcg caccaggttt cgtccacgca acggtcgatc gagcagcatc 3102901 gccagcgcca accccagcac catcgagccg accaccgcgg caccggtgaa accgccgtgt 3102961 tgaacacgat ctggcgggtg tccgaccggg tgaaccactc ggtgtagttg gataacccca 3103021 caaatcgggc cgacggatcg gagacgttcc agtcgaagaa cgacagccgg atgttgtcgg 3103081 ccaacgggcg atagacgaac agcagcaata gcgccacatt ggggccgacc aacacgacga 3103141 acagcgcata atcgcgcacg cgctctttcg atgaccgaag ccgtgctcgt tgcggcgccg 3103201 ccatcggcgc agtgtagctc cgtattctgt cggcgagttt gccgccacgg tcgatgaacc 3103261 aatcaccgcc gtcacggcaa tgtcggccag ctacgccgcg cccgtcaccg cccaccgacc 3103321 gctgtacgcc cgccatccga cgaatatcag ccgtagcacg ataaacgtgc ccagtcccga 3103381 ccagataccc gccagccccc agccatacgc cagcgacaac cagacaagcg gcaaaaagcc 3103441 caccaacgca ctcgccaccg tcgccgtccg catgaacgcg gcgtcgcccg cgcccagcag 3103501 caccccgtca actgcgaaaa caattcccgc aaaaggcaat tggactacca tgaaccacca 3103561 cagcaccccg atcgcggcga gtaccgatcg atcgtcggtg aatagcccgg gcagcaccga 3103621 ggagcctagc cctaacgccg ctgccaaaat tcccgccgcc aacagcgaaa acgccgtcac 3103681 ccgccatgcc accgccttag cgtgcccggc atcaccggca cccaacgcgg caccgaccag 3103741 cgactgcgcc gcaatcgcta gcgaatcaag aaccagcgca agaagacccc acaactgcaa 3103801 cacgacctgg tgggccgcga gcgcggcagc gccgaacctc gcggccaccg ccgcagccga 3103861 gacataacaa acttggaagg ccagggtccg cacgatcagg tcccgcgcca tcatcagctg 3103921 ggcgcccagc acggcgcggt ccggccgcag cgacacccgc tcggccagta acgcaccggc 3103981 aaacagcagc gccgccagcc actgccccac cagattggcc accgccgagc cggttaaccc 3104041 ccagcggggc aaccccagcc aaccgtaaac cagcagcggg cacagcagag ccgacgaccc 3104101 gaagccggcg accacatacc gcagcggtcg cacggtgtcc tgcacgccgc gcagccagcc 3104161 gttgccggcg agcgagacca ggatcgccgg cgtgcccagg atcgcgatcc gcagccacgg 3104221 caaggccgcc gcggtgatgc catcgccaga agcgatcgcc gacaccagcg gcgtcgcggt 3104281 ggcttccacc acgacgacga ccaacgcgcc cagacccaac gccaaccagg tcgcctgtac 3104341 accttcggtg accgcggcca cccggttgcc ggcaccgtaa cgacgcgccg cgcgcgctgt 3104401 ggtgccgtag gacaaaaacg tcgcctggga accaaccagg ccgagcacca gactgccgat 3104461 agccagaccc gccagcgata tcgcccccag ccggcccacc acggcgatgt cgaacagcag 3104521 gtacagcggc tcggcggcca gcacgcccag cgcgggcaac gccagctgcg cgatctgacg 3104581 gccgcccgcg cggtgcccca cctggctcaa cggcggctaa ccaagcgccg cgcgcaacga 3104641 cgccacagcg tcgtcgatcg agccggtggt cgtatacccc gcggccagcc ggtgaccacc 3104701 gccaccgaac ccagaggcaa ccgcggccaa attcacggtc ttagcccgca tcgacaccga 3104761 ccaccgatgc ggttcgacct ccttgaacac cgccgcgacc tcggcttgtt gcgtggtgcg 3104821 gacgatgtcg acgatgcttt ccacttcctc cgagcgcgca gcgacccact cccggttgtc 3104881 gacgacgacg taaaccagcc cgcggccacc gaccgcctcg gacaccagct gcgccgaacc 3104941 caacacccgc gatagcaacg gcaaccaggt gaagggatgg ctgtccatca aggtcctgct 3105001 gacggtggcg ttgtccacac cgatctctac cagccgcgcc gccagccgat acccccgcac 3105061 actggcccag cgaaacgacc ccgtgtcggt cgccaacccg gcgtagatgc agtgcgcgac 3105121 gcgcgggtct atcggtttcc cccacgcgtc gaggatctcg gcaaccatcg tcgtggtgga 3105181 atccgccgac gggtcaatga aattcgcggt gccgaacagg tcgttggagg cgtgatggtc 3105241 gattaccagg agctcccgcc cggaatcagt tagatcgccc agagcaccga gccgatcaac 3105301 actcggaatg tcaacagtca caaccaaatc gacatcgcgg cgcatcacct cagggcggac 3105361 cagcagatgg cagcccggca gcgaacgcag cgactcgggc agtgtcgccg gcgcggcaaa 3105421 gctgacctct acccgcttgc cgcacccgtc caacaccaat gccaatgcca atccggcgcc 3105481 gatggtgtcg gcatcggggt ggacgtggca gactaccccg accctggcag cggccgacaa 3105541 cagcgcagcg gcaccgacgg cgtccacgcg ggcccccgcg cgacgccgcc cgtcgaccag 3105601 ctcactcctt gggtcgatcg tcgtcaccgg tgtctccccc gcaagtgagc ggtgcctcca 3105661 cagcctcggg tccgtcgctc gttctgatac ccagtccccc gggagccggt gattgcgcca 3105721 ccgacccgtt atcacggtac gggtcggcct cccccgccgg tttggcgccc acccggaccc 3105781 gcgccagatc ggcatccgcg gcgcgagcgc gggccagcaa ctcgtccatc cggtgcacac 3105841 tgtccgagat cgtgtcgagc gtgaacgtca aggtgggagt gaaccgaacg ccggtgcccg 3105901 ccccgacctt ggtgcgcagc acccctttgg cccgttccag cgcggcggcc gcgccggcgc 3105961 agttcggctc gtcgtgtagc gtgcgtccca tcaccgtgta gtacaccgtg gcatcgtgca 3106021 agtcggcggt caccttcgca tcggtgatgg tcaccccggc caatccagga tccttgatct 3106081 cgtactcgat cgccgaggcg acgatcgcgg cgatccgttt ggccagccgc cgcgccctag 3106141 cagcatcagc catcaggcgc gttccttctg gaccagctcg taggactcga tgacgtcgcc 3106201 ctccttgatg tcggcgtaac ccagtgtcag gccacactcg aagccgtcgc gcacctcggt 3106261 cacgtcgtcc ttctcccggc gcagcgaagc gatcgaaagg ttctcggcga ccacgatgtt 3106321 gtcccgcaac agccgcgcct tggcgttgcg ccgcatcaca cccgaggtga ccaggcagcc 3106381 ggcgatgagg ccgaccttcg aagaccggaa caacgcccgg atctcagccc gacccagctg 3106441 gttttcctcg tagatcggct tgagcaggcc acgcagcgcc tgctcgatct cgtcgatcgc 3106501 ctggtagatg accgagtagt agcggatctc cacgccttcg cggctggcca gctcggtcgc 3106561 cttgccttcg gcgcgcacat tgaaaccgat gatcaccgca tcggaagccg acgccaggtt 3106621 gacgttggtt tcggtaatgc cgccgacacc gcggtcgatc acccgcagca ccacctcgtc 3106681 gtccacctgg atacccatca gggcctcttc cagcgcctcg acggtaccgg cgttgtcgcc 3106741 cttgaggatc aggttcagct ggctggtttc cttcagcgcc gagtccaggt cctccaggct 3106801 gatccgcttg cgtgagcgcg ccgccagggc gttgcgcttg cgagcgctac gccggtcggc 3106861 gatttggcgg gcgatacggt cctcgtcgac gacgaggaag ttgtcgccgg cgccgggcac 3106921 cgacgtgaag ccaatgacct gcacaggccg cgacggcagc gcaacctcga cgtcttcgcc 3106981 gtgttcgtcg accatgcggc gaacacggcc ataggcgtcg ccggcgacca ccgagtcacc 3107041 gacccgcagg gtgccgcgct gcaccagcac ggtagccact gggccgcgac cacggtccaa 3107101 gtgcgcctcg atcgccacac cctgggcttc catgtcgggg tttgcccgca ggtccagcgc 3107161 ggcgtcggcg gtcagcaaca cggcctcctc cagcgcctcg atattggtgc cctgcttggc 3107221 cgagatgtcg acgaacatcg tgtcaccgcc gaattcctct ggcactaaac catattcggt 3107281 aagctgcccg cgaatcttgg ccgggtcggc accctccttg tcgatcttgt tgaccgccac 3107341 cacgatcggc acgtcggcgg cctgcgcgtg gttgatggcc tcgaccgtct gcggcatcac 3107401 tccatcgtca gcggcgacca ccaaaatggc gatatcggtc gccttggcgc cacgggcacg 3107461 catggcggtg aacgcctcgt ggcccggggt gtcgataaag gtgatcagcc gctggctgcc 3107521 gtccagatcg acggccacct ggtaggcacc gatgtgctgg gtgatgccgc cggcctcggc 3107581 ctcgcggacg ttggccttgc ggatggtgtc caacagccgg gtcttgccgt ggtcgacgtg 3107641 acccatcacc gtcaccaccg gcgggcgaac ctgaaggtcc tcctcgccgc cctcgtcctc 3107701 accgtagctg aggtcgaagg attccagcag ctcgcggtct tcgtcctccg ggctgacgac 3107761 ctgaacgttg tagttcatct cgctgcccag caactccagc gtctcgtcgc cgaccgactg 3107821 ggtggccgtc accatctcgc cgaggttgaa cagcgcctgc accagcgccg cggggttggc 3107881 gtcgattttg tccgcgaagt cgctgagcga cgcgccgcgt gcgagccgga tcgtctcgcc 3107941 gttgccgtgc ggcaaccgca ccccgccgac gaccggagcc tgcatcgagt cgtactcctg 3108001 gcgcttctgc cgcttggact tgcggccgcg ccggggcgca ccacccgggc ggccgaacgc 3108061 gccggcggca ccgccacgct gcccgggacg gccaccgccg ccgcccccgg gacggccgcg 3108121 gaagcccgtt ccgggagcgg cacccacgcc gccaccccgg tagttgccgc cgcccgcgtc 3108181 ggaacggcca gcgccgggcg caccgggccg gcccccgggt cgtggcgcac caggacgtgg 3108241 tggacgggca cccccgacag ctccaccggg gcgtggcggc atgctgccgg gcgaggcgcc 3108301 cggacgtgga accccgggcc gggcggtacc ggggcgggga gccggcggac gcgggatggg 3108361 ccggtcggcg ggttgcgccg acgagaacgg gttgttgccg acgcgcgggg tgcgaatccc 3108421 cggcttcggc accgggcccg gccgcgcccc gggagccatg ccggggtggg gtgcctgagg 3108481 gctgggcggc actgcggtcg gaggctcggg tgcggcgggg gtagttgggg agacgattgc 3108541 cgccccgccg gaatcggcgg ccttggcggg cgcggcagtc gccttgccgt tgcctgcggc 3108601 catgtcgatc gcggcgtcca gcgccttgtc aagggacttg tcggggcctt tgccggggga 3108661 cttggcggtg cctttcgccg gggcaggttt gctgccaccg aacgattcac gcagccgacg 3108721 ggcaaccggt gcttccaccg tcgacgatgc tgatttgacg aattcgccct gctcgctcag 3108781 ccgggcgaga acttccttgc tggttacacc gagttcctta gccaactcgt gtacgcgggc 3108841 cttacctgct gccactacat ctcctgtcca tgaggcgaca gtcgtgggcc gcgcctcggg 3108901 tttagctatg acgcattgtc atcgggactt cacggtgtgc tcatgttcta ttgctacctg 3108961 ttctgttgcc cggtggttcg agctcgccta gagactccag gtactcgacc actgcggatg 3109021 tgtccggcga accggcgatg cgcagcgctc ttgcgaaagc ccgccgccga atcgcttgtt 3109081 gcgcgcactg ccgtagcgga tgcagccacg caccccgccc cggcaggctg gtcgctgtat 3109141 caacgatcac ggcgtagttg ccgttcccgg tcgacacagc caccactcga agcagttcga 3109201 cggccaaccc tcgctttcgg cacccgacac acgtccgcac cggtccgcgg ggattatccg 3109261 ggcgtcgatg cgccgaggcc gaaggctcgc gctggatcac ggctaagtgt agcgtcaccg 3109321 ggcaagcccg attgcccggc tatctgccgt gggataacgc acggcgctag cggtcgtgcg 3109381 ccataccgcg gctgactccg ggttcgggct gaccgggcgg gggcggcggc gcatcgccgc 3109441 gaatatcgat acgccacccg gtgagccggg cagccagccg ggcgttctgc ccttcctttc 3109501 cgattgccag cgacaattgg aaatcgggca ccaccacgcg ggcggcccgg gcggtctggt 3109561 cgatcaccga caccgacacc accttggccg gcgacaacgc gttggcgaca aaacgcgccg 3109621 gatcgtcgtc atagtcgatg atgtcgatct tctccccgga cagctcgctc atcacgttgc 3109681 ggacccgttg ccccatcgga ccgatgcaag cacccttggc gttcaagccg gcaacgttgg 3109741 accgcacagc gatcttggag cggtggccgg cctcccgggc caccgcgacg atctccaccg 3109801 atccgtcggc gatctcgggg acttccagcg agaacagctt gcgcaccaga ttggggtgcg 3109861 tgcgcgacag cgtaatcagc ggctcgcggg cacctcgggt tacaccaact acgtagcagc 3109921 gcagccggtt gccatgttca tagctctccc ccggtacctg ctcagcggcc gggatcacac 3109981 cctcggaagc cttggtctcg gtgccaatcc ggacgacgac cagaccgcgg gcgttggccc 3110041 ggctatcgcg ctggatcact cccgcaacga tctcgccctc gcgggtggag aactcgccgt 3110101 aggtgcgctc gttctcggcg tcgcggaatc gctgcaacat cacttggcgt gccgtcgtgg 3110161 cggcgatccg gccgaagccc tctggagtgt cgtcccactc gctgatgaga ttgccagcct 3110221 catcggtctc acgggcgatc acccgaacga caccggtttt ccggtcgatc tcgatgcgcg 3110281 catcggtctg gtgaccttgg gtgtgccggt aggcagtcaa cagcgcggac ttgatcgttt 3110341 cgagcagttc attgaccgag ataccccggt ccacctcgat ggcatgcaga gcagccatgt 3110401 cgatgttcat gctccggcct ccgtcccgcg ggccagcccc atctcggaag actgggccag 3110461 ttccaactcc gccggagccg gtggcgaaaa ctcaacctgg acaacagctt tcacaatctc 3110521 agcaagcggg atctcacgga ctgcccagcc ccggtcttcc cggatcacca acgccaccgt 3110581 gccagcacgc atctcgccga cccggccggt cagtcgcgat ccgtctgaca acaccagctc 3110641 aaccttgcgg cctcgagcac ggcggaagtg cttttcgctg gtcagcgggc gttccacacc 3110701 gggagagctg acctcgagca ggtagcggcc ccggatcttg ttcgcaccgt ccaggccgtc 3110761 cagcaaagcc gatgccctgc gcgacaatgc ggctatcgta tccaggtcga gaggggcgtc 3110821 accgtcggcg atcaccgcta tccgcggcgg gcgggcccgc gcatcgatga ccacgtcttc 3110881 gatctcgtag ccggcgcacg cgaaatctgc accgagtagc tcgatcacct gcctctgcga 3110941 aggtagcccg gtggtcacgg cgagctcctc atcttgagtt gtccggtcat ctagcggagg 3111001 cgccgccagg gcggctccca gtgtcccgcc ggcacgcagc agccggcgta gctaccaacg 3111061 atacgccagg aatcacgaat gacgccgtga tcacgccgtt caacgtctcg cctgccttcg 3111121 tagcggcgtc tttctgatgg caggatgttg ctgtgcttag agcagcacca gtcatcaacc 3111181 ggctcacgaa tcgacccatc agcaggcggg gtgtgctggc cggtggcgcc gcgctggccg 3111241 cactgggagt ggtgtccgcc tgcggcgagt ccgcgcccaa ggcacccgcg gtcgaagagc 3111301 tgcgctcgcc gttggaccag gcccgacacg acggtgcgct cgcagctgcc gccgccacag 3111361 ccatcgggat cccgccgcag gttgccgccg cgctgaccgt cgtcgccact cagcgaacct 3111421 cgcatgctcg agcgctggcc accgagatcg cccgggccgc gggcaagctg gtatccgcta 3111481 cgagcgaaac cagcagctcc agtcccagcc caaccgatcc ggcggcaccg ccaccagcgg 3111541 tgtccgacgt gatcgattcg ctgcgcacgt cagcggggga agccagtcga ctagtggcga 3111601 cgacatcggg ctaccgagca gggttgctcg cctccattgc cgcgtcctgc accgcctcct 3111661 atacggttgc gctcgtgcct tcaggcccgt cgatatgacc tcgtccgaac ccgcccacgg 3111721 tgccacaccg aagaggtccc cctccgaggg gagcgccgac aacgcggcgc tgtgcgatgc 3111781 gcttgccgtc gaacacgcca ccatttacgg ctacggcatc gtctccgcgc tctcgccccc 3111841 tggtgtcaac ttcttggtgg cggacgcgtt gaagcagcac cgccaccgcc gagacgacgt 3111901 gatcgtgatg ctgtccgcgc gcggagtcac cgccccgatc gctgccgccg gttaccagct 3111961 gcccatgcag gtcagcagcg cggccgacgc ggcacgacta gcagtgcgga tggagaacga 3112021 cggggcaacg gcctggcggg cggttgtcga gcatgccgag acggccgatg accgggtgtt 3112081 cgcttcgacg gctctgaccg agagcgcggt gatggccacc cgctggaaca gggtgctggg 3112141 cgcctggccc atcaccgcgg cctttccggg cggggacgaa tagctacccg gtgacggccg 3112201 ctgcgatatc ggtggccagc gaggcgccgg caaccagctc gcgagtctga ccgctgaacc 3112261 ggtcgcgcag ctcgaccacg ccgtccgccc agccgcgccc cacgacaacg atccagggca 3112321 tacccaacag ctcggcatct ttgaacttga cgccgggcga tgcctggcgg tcgtccagca 3112381 acacctcaac ccccagccga tccagatcgg cggccagcgc ggtcgccccg gcgcgagcct 3112441 gcgcgtcctt gttcgcgatc accaggtgaa catcgaacgg cgcgaccgtc gacggccagc 3112501 gaaggcccag ctcgtcgtgg tgctgctcgg caacgacggc aaccaaccga gacacaccga 3112561 tgccgtagga acccatggtc aaccgcacag gcttgccatc ctcgccgagc acgtcggcgg 3112621 tgaaggcgtc ggtgtatttg ctccccagct ggaagatgtg cccaatttcg ataccgcgcg 3112681 ccatgaccag cggaccggcg ccgtcgggag atggatcgcc ttcgcgcacc tcggcggcct 3112741 caatggtgcc gtctgcggtg aagtcgcggc cggccaccaa accgacaaca tggcggccgg 3112801 gttggtccgc cccggtgatc cagctggtgc cgtcgactat ccgcgggtcg acgagatagc 3112861 ggacattgtt ctcccgcaac gcctttggcc cgatataacc cttaaccagg aacgggtgct 3112921 tggcgaaatc atcgtcgtcg agcaacgcgt agtcagccgg ttccagcgct gcgcccaacc 3112981 ttttgtcatc gacctcacgg tcgccgggca cgccgattgc cagcagttcg gtgtcccctc 3113041 ccggctgtcg gactttgatt aagacgttct tcagggtgtc cgcggcggtc accgtgcggc 3113101 cgagatcggc ctcgttggcc caggccacca ggctggcgat ggttggggtg tcgccggtgt 3113161 cgtggaccac cgcctcgggc agcccatcga tgggcagggt gtccgggcgg gcggtgacaa 3113221 ccgcctcgac gttggccgta taacccgact cgaggcaccg gacaaatgcg tcctccccgg 3113281 acggactctc agccaagaac tcttcggacg cactgccgcc catcgccccg gacactgccg 3113341 aaacgatgac atagcgcacc tgaagtcggt caaatatgcg ctggtaggcc tcccggtgag 3113401 cgcggtaggc cgccttcagc ccggcggcgt cgatgtcaaa ggagtaggag tccttcatga 3113461 cgaactcccg agcgcgcagg atgccggccc gcggccgcgc ctcgtcgcgg tacttggtct 3113521 ggatttggta cagcgtgagc gggaagtcct tgtaggagct gtactcgccc ttcacggtca 3113581 gggtgaacag ctcttcgtgg gtggggccca gcaggtagtc gttgccgcgg cggtccttga 3113641 gccgaaacac gctgtcgccg tattgggtcc accggttggt cgtctcgtac ggtgcccgcg 3113701 gcagcagggc aggaaatagg atctcctgtc caccgatggc gttcatctcg tcgcggatga 3113761 cccgttctat gttgcgcagc actcgcaggc cgagcggtaa ccagctgtac agcccgggcg 3113821 cgacgggccg gatgtagccg gcccggatca gcagtttgtg gctggccact tcggcgtcgg 3113881 cgggatcgtc gcgcagggtg cgcaagaaca actcggacat ccgggtgatc acaggcggca 3113941 agcctaattc gccgagcaga cgcaaaagcg cccaggtctg cccgaaaagg ggagctttta 3114001 tgactgctcg gcgggaaggg ttacagctcg ccggcgtcga tcgcttcctt gacctcctgc 3114061 gcatgggcaa cctgctgcgg cgtatacccg ataaacagcg ccataccgcc gacgatgatg 3114121 gccgctccgg ccacccacag caggccgtag gtgtaggcgt ggtcaagcgc ggccaactgc 3114181 acgtcgttca tgaacttcac cggaccggtg gtaccgccca ggtacagcgt gcgcgacgtg 3114241 atcacagcct ggatgacggc gagcaccagc ggaccgccca ggctctgcag catcagcgca 3114301 attgccgata ccggaccgat ctggtcgaag ccgacgccag cgatcgccga cagagtcagc 3114361 gggacgacgg ccatgccgat gccaatcccg ccgacgacga tcggcatgac caggttgggg 3114421 aagtagggca caccacggtg catgaaaaat gagccgtaca gcatggcgcc gaatagcaga 3114481 tatccgccgc cgatggtcaa cacccgtggc gaaaaccggg acaccagctg cgaggacaca 3114541 cctaggccga ttcccatcgc gatgacgaac gggatgaaac ctacgcccgc gcgtagcgcg 3114601 ctgtagccca agatgtcctg cacgtacagg ccgatgcaga cggtcaggct gaacatgacg 3114661 ccgccggcca acaggatcgc gctgaacgtg accaaccggt tgcggtcgcg gaacaagtgg 3114721 aacggcacga cggggttctc ggcagtgcgc tccacgatga caaacgcgac agcggccgcc 3114781 aaggccacca ggcccgaacc gatggtaatg cctgacatcc agcccttttc aggaccgatc 3114841 gagaaggcga aaaccgccgc ggtgcatgcc agcgtggcca gtatggcccc ggtggcgtcg 3114901 agcttcatcc gttctttgtt ggtttcccgt agggcggtgc gggccaggta gatcatcacc 3114961 agcccgatcg gcacgttcac caggaacgcc caccgccatg acacctcggt cagtgctccg 3115021 ccgaccacca gccccatcac cgacccgatc gcggtcatcg cggcgaacac cgccgtcgcg 3115081 gcgttgcggg caggtccctt ggggaacgtg gtcgccacca gcgccagacc ggtcggagat 3115141 gcgatggccg accccacacc ctgggacaac cgggcgatca ccaacgtcgc ctcgtcccag 3115201 gcgaccgcgc acagcaccga cgagatggtg aatagcgcaa cgccaacaat gaaggtgcgt 3115261 ttgcgcccga tggtgtcgcc aagccggccg ccgagcagca tcagcccgcc gaaggtcagc 3115321 acgtaggcgg tgatcaccca gctgcggccg gcatcagaca agctcagctc gttttgaatc 3115381 ttaggtagcg cgacgatggc gacggtgctg tccatggtcg ccagcagctg catcccgccg 3115441 atagcaataa ccgcagcgat aaagctgcgc gagggcagcc aagtcgggta gtacctgctg 3115501 gggcgctctg aagcggtctc ctccgagcgc ggcgggcgca tcggggccgg acggtgtggg 3115561 cgtccggccc tccagttacg gaccgcccgc tctgtgtcgt tgagagccgt catagcgggt 3115621 taccttacag tattcttaag aattgtttaa accccgaacg ccgctcaggc cgactacagc 3115681 cccgatcacg atgatcgcgg gaggtcggat ccccgccgcg cggaccttct ccggcgtgtc 3115741 ggcaagggtg gcccgcaacg tctgttgagc ggcggtcgtt ccgtgttgaa ccaccagtac 3115801 cggcgtatcc gcagttcggc caccctttag cagaacgtca acgaaaagct cgatgcgttc 3115861 gaccgccatc agcaaaacga tggtgcccgt caatgcagcc aatgcatccc aattcactaa 3115921 cgattcggga tgaccgggcg caagatggcc actgaccacc acgaattcgt gggtcatggc 3115981 ccggtgagtg actggaacgc ccgccatagc gggcacggct atggcactcg tcacacctgg 3116041 caccacggtg accgggattc cggcgtgggc acatgccagc acttcttcat agccccgggc 3116101 gaacacgaag gggtcgcccc ctttgagacg gaccacaaag ttgccggatc tggcccgttc 3116161 gatcaggaca gcgttgatcg cgtcctgggc catggcccgg ccgtaaggga tcttggccgc 3116221 gtcgatgact tctacgtgcg gcggcagctc ggccagcagt tcgggcgggg cgagccggtc 3116281 ggcgaccacg acatcggcct gggcaagcag ccggcgaccg cgaaccgtga tcagttcggg 3116341 atcgccggga ccgccgccga ccaacgccac tccgccgctg aggacgtcgg aactctgcgc 3116401 agtgatgacg ccctgctgca acgcctcccg gattgccgag cggatcgccg ccgaacggcg 3116461 gtgctcacca ccggcgagca cccccaccga caggcccgca tagctgaatg acgccggggt 3116521 caccgccgtc ccctccaccg cgatatcggc ccggacgcaa aagatccgtc ggcgctccgc 3116581 ctcggcgacg acagccacgt tcacccgcgc gtcatcggtg gccgcgatcg cataccaggc 3116641 gccgtcaagg tcgccgtcgc ggtagtcacg caccgacaag gtgatctggt ccatcgcctc 3116701 gacggcgggg gtgacgctgg gggcgatcac gtgcacgtcc gcgccactgg cgatcagcag 3116761 gggtaaccgg cgctgggcga ccgtgccccc gccaaccacg acgaccttct tgccagccag 3116821 ccgtaacccg accagatagg ggttctcggt cacccgccaa gcctagtggc gatcgcaagc 3116881 gcggggaccg ggcgccgcgg gtcgccacca tcagggccag tggcgatcgc aagcgcgggg 3116941 accgggcgcc gcgggtcgcc accatcaggg ccagtggcga tcgcaagcgc ggggaccggg 3117001 cgccgcgggt cgccacccct ttggccgcga atgtaacgcc actgcgaatt tccggcccgg 3117061 cttttcgcag tgccgttacg ctcgtggagt attgcaggcc gcatgtgcga cgaaacgcgc 3117121 caccgcaccg ggtgttgcgg ccggatgggt atgcaggtag gacgcgtgca cgccgctgtg 3117181 caccgcgccg tctcgcacgt cgtccacgtc ttggccctgg tacacccacg cgggctgata 3117241 gctatcggcg aatgtgactg cggttcggtg gaattcatgt ccaaccacgc gctcgccgac 3117301 ggagtacagc gccgaatcaa caaccgcgac ggcgtcgcga taacccagct tgagatgctg 3117361 ggtgaaccgc gccgatccgg ccaccacacc gcacatcggg tgtccgtcga gttcagaaac 3117421 cagatagagc aggccggcac attcggcatg caccggggcg ccggcagcgg ccagttcgtt 3117481 gatctgccgc cggacggtgt cgttggcgga caactcggcg gtgaactgct cggggaatcc 3117541 gccgggcaac accaccgcgt ccgtaccctc gggcagagtt tcgctgagcg ggtcgaactc 3117601 gaccacttca gccccggcgg cgcgcaacat ctcggcgtgt tcggcgtagc cgaaggtaaa 3117661 cgcccttccg gccgcgatgg caaccgtggc tggctggcgg gcggtgttgc cgacggcaat 3117721 caccgggtcc catggcgggt gggccgcctg gctcccggcg caggcgatca ccgcggccag 3117781 atcgacgtgg cgagcgacca cagcagtcat cgcctgcacg gcgagccgtg cgcgacggcc 3117841 gtactcgacg gcggtaacca gacccagata ccttgtcggc agctctagtt cagctgtgcg 3117901 tggaatggcg cccaagaccg cgacaccggc ctggtcacac gcctgtcgca gcacctgttc 3117961 atgtcgggcc gatccgaccc ggttgaggat gacaccggcg atccgagttg cggtgtcgaa 3118021 cgtggaaaag ccgtgcagca gtgcggcaac gctgtgactc tggccgcggg catcgaccac 3118081 caggatcacc ggggcgccaa gcagagcagc gacgtgcgcg gtggaccccg ctgcgggcgc 3118141 gcccccggca ggcccaatgc gcccgtcgaa cagccccagc accccttcga tcacggcgat 3118201 gtccgcgccc gcaactccat gcgcgtacag ggggccgata agccgctccc ccaccagtac 3118261 cgggtcgaga ttgcggccgg gccgtcccgc ggccagggcg tgatagccgg ggtcgataaa 3118321 atccgggcct accttaaacg gcgcgacggt gtgaccggcc tgccgcagcg ctccgatcaa 3118381 gcccgtcgcg atcgtggtct taccgctgcc cgacgcaggc gcggcgacgg ccaccgcgga 3118441 tacccgcatc accactcgat gcccttctgc cccttgcggc ccgcatccat cgggtgcttc 3118501 accttggtca tctcggtcac cagatcggcg gccgcaacca accgctgggg tgcgtctcgc 3118561 ccggtgatca ccacatgctg atggccaggc cgggctcgca ggacatcgac gacttcgtcg 3118621 acgtcgagcc aaccccactt cagtgggtag gtgaactcgt ccagcagata gaagtcgtga 3118681 cgttgcgtgg ccagccggag cgcgatctcg gcgcaaccgt ccgccgccgc ggccgcacga 3118741 tcgacgtcgg tgccggcctt gcgagacgta cgtgtccagg accagcccgc acccatcttg 3118801 tgccactcca ccgctccgcc gatcccgtgc tggtcgtgca gccggcccag ttgacgaaac 3118861 gccgcctcct cacccacttt ccacttagcg ctcttgacaa actgaaacac cgcgatgtcc 3118921 agaccagcgt tccacgcccg caacgccatt ccgaacgccg cggtcgattt tcctttgcct 3118981 tcaccggtgt gtaccgccag tatcggcatg ttgcgccggg cccgggtggt caggccatcg 3119041 ttgggcactg cgagcggatt gccctgcggc atgtgtggtt acctatccat cgtcaagcca 3119101 cgccacgcac ggcatgcact agataatccg cgtgcaactg ctccaaccga accaccggcg 3119161 cacccagctg acgagccagt tgcgctgcca aacccagccg tacatacgac gtttcgcagt 3119221 ccaccaccac cgcggccgcg ccctcggcga ccagcccggc agccgcggtt cggctgcggc 3119281 ccaacgggtc cggcccggcg gtggcccggc cgtcggtcag cacgaccacc agggggcgtc 3119341 gggcgcggtc gcgtaccttc tcccggatga tcagcgcacg cgcggccagc agtccctcag 3119401 ccagcggggt cttgccgccg gtgctgaatc gggccagtcg ccggccggcg atgtgcgccg 3119461 acgacgtcgg cgacagcaac agcgttgcct cgtgctggcg gaaggtgatc accgccacct 3119521 tgtccctgcg ctggtaggcg tcgcgcagca gcgacagggt ggcgccactg accgcagcca 3119581 tccggtcccg agcagccatc gatccggaag cgtcgacgac gaagatcacc agattgcctt 3119641 cgcgactctc gcggatggcc cggcgcacat cgtccggcca cgggcgcaac ggcccggctc 3119701 cgaacgcacg ctcgccggcg gccagcaggg tagcgaacag gtgcagtcca tgtgcgtcgg 3119761 ggtcgctgac ctcggcggcc gccaccacac tgcccgaggc gttgcgggcc cgagaccgtc 3119821 gccccggcgc gcccgtgccg acccccggga cccgcagcgc gcgggtccgg aatatctttg 3119881 acggcggcgc gctcgggcgc ggcgacgatc gcaagcgcgg cgaagccggg cgcggcgggt 3119941 cgtcgcccat cgagctcggc gcaccaggtt ctgtcgactt cgagcgtgag ttcggttgtg 3120001 aggcaggttc attggctgac tggccgcccc cgggcggatc gggctcgggc tctgggtcga 3120061 cgctcgccag cgccagcgcc tcatccagct ggtcgcggtc gatgccgtga tcgtcgaacg 3120121 ggtcgcgacg acgacgatgc ggcaacgcca gttctgctgc cgcctggata tcctgctcct 3120181 caacggtgcg gacaccacgc caggcggcgt gcgcggcggc ggtccgggcc actaccagat 3120241 cggcccgcat gccgtccacg tcgaacgccg cgcacaacgc agcgatgcgc cgcaactcgt 3120301 tgtcgcccaa caccacatcg tctaccgtgg cccgggccgc ggcaatccgg tgggccagct 3120361 ccgcgtcggc gtcggcatag cgtgcgacga acgcatccgg gtcggcttcg taggccatcc 3120421 gccggcggat gacctgtacc cgcacgtcga tgtcacgtga cgcctgcacg tcgacggtca 3120481 gcccgaaccg gtccagcagc tgcggacgca gttcgccctc ctccggattc atcgtgccga 3120541 tcagcacgaa acgggcctcg tgggaatggg agatgccgtc gcgttcgacg tgtacgcgtc 3120601 ccatggcggc ggcgtcgagc aggatgtcaa ccaggtgatc atgcagcaga ttgacctcgt 3120661 cgacgtagag cacgccgccg tgggcgcgag ccagcagtcc cggagagaac gcgtgctcgc 3120721 cgtcgcgcat cacccgctgc agatccagcg agccaaccac ccggtcttcg gtggccccca 3120781 gcggcagctc cacgaggccg gtctcggtgc tcccggtcgc gaccgacaac aacgcggcca 3120841 gcccgcgcac cgccgtcgat ttcgccgtgc ccttctcgcc acggatgagc gccccaccga 3120901 tctccggtcg cacggcacac aacaacaacg cgagccgcag ccgatcgtgc ccgacgatcg 3120961 cgctgaacgg ataaggcttc acggccgctc cacctgaccg gagccgggcc gcaacatggg 3121021 cacatgcggg atgccgtcgt ccaggaactc gtcaccgtcg cggacgaagc cgtgctgggc 3121081 atacatggcc gtcaggtagg cctgtgcatc aatccgacag gggtagtcgc ccacctcggc 3121141 cagtgccgcg cacagcagcc ggttggagtg gccctgtccg cgggcgtcgc gtttagtgca 3121201 cagccggccg atccggaaga ccttctcacc cccggcgtgc tcttccatca ggcgtagcgt 3121261 gcacgtcacc tctccgtcgg gcgtttccaa ccagaaatgc ctggtctcgg caagcaggtc 3121321 acgcccgtct agctccgggt atgggcaggc ctgttcgaca acgaacacct ccaccctcaa 3121381 cttgagcagc tcgtaaaggg cccgggcgtc aaggtctttg gcccagacgc ggcgcagtgc 3121441 ttcggtcata agcgccgctc tcccccgcaa gcgggcggta cccccactgt atcgtcgccg 3121501 gcgcgggtca tgcggcacct aacttcagcg ccttggtgct ccatgaccac acctcgtcga 3121561 acagcgcggg ttcattcgac agctgcaccc ccagcgacgg caccatttct ttgagcgtgg 3121621 gcagccagga ttgatagcgg ttggcaaagc atttctgcag cacgtccagc atgatcgcca 3121681 ccgcggtcga agcccctggg gagccgccca gtagtccggc aatactaccg tcagcatcgc 3121741 cgatgaccgt cgtgccgaac tcgagcaccc cgccgttgcg ttcatctcgc cggatcacct 3121801 gtacccgctg accggctatc gtcaactccc agtccgaatc gattgcgcta ggggcgaatt 3121861 cgcgcagcgc actgacccgc tcgggttcag agagacgcag ctggctgatc aagtagttca 3121921 gcagtctccg ctcggtgagg cccacgccga gcacggacaa cagattgtcc ggcctgatcg 3121981 accggggcag gtcgctgatc tgcccgtgtt tcaagaactt cggcgaccag ccggcgtatg 3122041 gcccgaacac cagccacgac ttgccgttga caaaccgcag atccagatgc aaggcgccca 3122101 acggcggggc gcccggcgcc gggaagccat atacctttgc ccgatgcgag gcggtgagcg 3122161 ccgggttccc ggcgcgcagg aaccgaccgc caatcgggaa gccggcgaag cctttgacct 3122221 ctttgatccc ggatttctgc agcaccggca aggtgtcacc cccggccccg acaaagacga 3122281 acttggtgtt caacttgcgc ttttcgccgg tccggcggtt gcacatggtg accgtccagc 3122341 tgccgtcgga ttgccgcgag aggttgcgaa cctcgtgccc gaacaacgcg gtagtgccat 3122401 tttgcacgca atagccgatg agttgtttgg cgagggcacc gaagtcgacg tcggtgccgt 3122461 cggcggccca gttgagcgcc accggctcgg agaaggcccg tttagcggcc atgaacggca 3122521 gccggcgggc gaattcgtcg ggactctcga tgaactcggt gccggcgaac agcgggttgc 3122581 cggccaacgc cttttggcgg cgccgtagat actcgacgcc ccgcgatcca tggacgaaac 3122641 tcacgtgcgg cacagggttg aggaagctgc gcacgtcggt gaggatgccg ttttcggccg 3122701 cgtatgccca gaactggcgg gtgacctgga attgctcgtt gacacgcacc gctttggtga 3122761 tgtcgatcga gccgtccggc atttctgggg tgtagttcat ctcgcacagc gcggagtgcc 3122821 cggtgccggc gttgttccag ggaccgctgc tttcggcggc taccgcgtcc agccgttcga 3122881 tcagggtgat tgaccagttc ggttcgagcc gacgcagcag cacccccagc gtggcgctca 3122941 tgatgcccgc accgatcagc acgacgtcgg ttctggctag gtctgacacc ggacggttgg 3123001 ttccttcctt ggctgcgccg ctcccaggtt atcccgacgg gtgttaacac gatgacgtcc 3123061 gcctcctggg ccagtaaccc tgtgcagcgc ggggcagcca acccaagaca attaccccga 3123121 agcccacaat gtgcgtccct ggccgccata gaatccgcac tatccgccca gtccggttct 3123181 tcttgggagg taacgatgtt gtatgtagtt gcgtcacccg acttgatgac cgcggcggct 3123241 accaatctgg cggagattgg ttcggcgatc agcacggcaa atggtgcggc ggcactcccg 3123301 actgttgagg tggtggccgc ggccgccgac gaggtgtcca cgcagatcgc ggctctattc 3123361 ggagcgcatg ccaggagcta ccaaaccctc agcacccagg cagcggcgtt tcatagtcgg 3123421 tttgtgcagg cgttgaccac ggccgcggct tcctacgcca gcgtagaggc cgccaacgcg 3123481 tcgccacttc aggttgcgct agacgtgatt aatgcgcccg cccagacact gctcggacgt 3123541 ccgctaattg gtaacggcgc cgacggatcg acaccggggc aggccggcgg gcccggcggg 3123601 ttgctgtacg gcaacggcgg taatggcgcc gccggtgggc ccaaccaggc cggcggcgcc 3123661 ggcggcaacg ccggcttgat cggcaacggc ggggcgggcg gcgccggggg tgttggcgcg 3123721 gtcggcggta acggcggcac gggcggcctg ctattcggca acggcggggc cggcgggcaa 3123781 ggcgggctcg gcctcgcagg tatcaacggc ggcagcggcg ggcagggagg ccacggtggc 3123841 aacgccatcc tgttcggcca gggcggtgcc ggcgggccag gtggcaccgg cgccatgggc 3123901 gtcgccggca ccaatcccac ccccatcggc accgcagcgc ctggcagcga cggcgtaaat 3123961 cagattggga acggtggtaa cacggacctc accggcggcg ccggtggcga cggcaatgcc 3124021 ggcagcacca ccgtgaacgg cggcaacggc ggtaccggcg gcgcagctag gaactcatct 3124081 ggtggtaccg gtaactcctt tggtggtgcc ggcggcgccg gaggcgacgg cgccaacggc 3124141 ggcgacggtg gcgctggcgg ggaagccctc accgaaggcg gtgccaccgc cgttagtggt 3124201 gctggtggta agggaggtaa cgccgaggct tccggcggcg ccggcggcaa cggcggcaaa 3124261 ggtggctttg ctcaggccac caccagcgtg accgggggta acggcggtaa cggtggcaat 3124321 ggccacgaca gtaacgcgcc gggcggcgct ggcggcagcg gtggcgtcgg cggtgacggc 3124381 ggccgtggcg gcctgctggc cggcaacggc ggcaccggcg gtgccggtgg caacggcggt 3124441 accggtggcg ccggtgcccc cggcggtgcc ggcggcgccg gcggcaaagc cgacatcgcc 3124501 aacagcctcg gcgacaatgc caccgtaacc gggggcaatg gcgggacagg cggagacggc 3124561 ggcagcgcgc tgggcaccgg gggggctggg ggtgccggag gtctaggtgg tcacgggggt 3124621 gcaggcgggc tgctgattgg caacggcggc gccggtggcg ctggcggcct cggcggtgcg 3124681 ggcggcgccg gcggtgcggg cggtgagggc ggtgccggcg gcgccggagg cgaagctatt 3124741 cccggcgggg cgtccaccaa ctccgccggc ggtgacggag gggcgggcgg tactggcggc 3124801 aatggcggtg acggcggtgc cggcggagcc cccggcctcg gtggcgcggg cggggccggc 3124861 ggatggttga tcggccagtc gggcagcacc ggcggcggtg gcgccggcgg tgccggtggt 3124921 gccggaggtg ccggtggcgc gggcggcagc ggcggtgcgg gtggccatgg cgacactacc 3124981 tccggcaaga acggttcgtc tggcaccgcg ggcttcgacg gcaaccccgg gcagcccggc 3125041 tgagcggcac aagatctgaa cgcgctctaa gctgaccccg tgactggctg ggtgcccgat 3125101 gtgctgcccg gctattggca gtgcacaatt ccgctcgggc cggatcccga cgacgagggc 3125161 gacattgtcg caaccctggt cggccgcggt ccgcaaacag ggaaagcccg cggagacacc 3125221 actggggcac accacacggt cctggcggtg cacggctaca ccgactactt cttccatacc 3125281 gagctggccg atcacttcgc caaccgtggc ttcgcgttct atgcacttga cctgcgcaaa 3125341 tgcggccgat cgcgagcgcc cggccagacg ccgcacttca tcaccgacct ggcccgctat 3125401 gacaccgaac tcgagcactc cctgtccatc atcaacgagc agaaccgctc ggcgaaggtc 3125461 ctggtatacg gccactccgc cggcgggctc atcgtgtcgc tgtggctgga ccggttgcgc 3125521 cagcgcggcg agatcacccg cgcgggggtc accggcctgg tgctcaatag cccgttcctg 3125581 gatctgcaag gcccggcaat cctgcgcctg ccgctgacct cggcgttctt cgccgcgatg 3125641 gcgcgaatgc gccccaagtg ggtagcccgg ccaccaaaag aaggcggtta cggttgcacg 3125701 ctgcaccggg actatgatgg agagttcgac tacaacctgc aatggaaacc ggtgggcggt 3125761 ttcccggtca ccttcggctg gattcatgcc agccgtcgtg gccacgcacg gttacatcgc 3125821 gggatcgacg tcggtgtgcc caacctgatc ctgtgttcgg atcacacggt acgggaaaag 3125881 gccgacccgg cgaccctgca ccgcggcgat gcggttctcg acgtcaccca tatcacccgc 3125941 tgggccggct gcatcggcaa ccgcagcacc gtcatcgcgg tggcggacgc caaacacgat 3126001 gtgttcttgt cgctgccgca accgcgccag atggcttatc gccgactgga tctctggttg 3126061 gacgactacc tcggcacaca caacgacacc gacgcttcgg catcgtcggg gaaagggtga 3126121 tggcccctac aaatggaaac gtacgacatc gcgatcatcg gaaccggttc gggcaacagc 3126181 attctcgacg aacgctatgc cagcaagcgg gcggcgatct gcgagcaggg caccttcggc 3126241 ggcacctgcc tcaatgtcgg gtgcatcccc acaaaaatgt tcgtctacgc cgccgaggtg 3126301 gccaagacca tccgaggcgc gtcgcgttac ggtatcgacg cgcacatcga ccgggtgcga 3126361 tgggacgacg tcgtctcgcg cgtcttcggg cgcatcgatc cgatcgcgct gagcggcgag 3126421 gactatcgaa ggtgtgcgcc caacatcgac gtgtaccgca cacacacccg tttcgggccg 3126481 gttcaggccg atggccgcta cctgttgcgc actgacgcgg gtgaagagtt caccgccgag 3126541 caggtggtga tagccgccgg atcgcggccg gtgattccgc cggccatcct cgcgtccggc 3126601 gtcgactatc acaccagcga taccgtcatg cggatcgccg agttgccgga gcacatcgtg 3126661 atcgtcggaa gcggcttcat tgcagcggaa ttcgcacatg tgttttccgc tctgggcgta 3126721 cgggtcaccc tggtgatccg gggcagctgc ttactacggc attgtgacga caccatctgc 3126781 gaacggttca cccgcatcgc atcgaccaaa tgggagctgc gcacccatcg caacgttgtg 3126841 gacggccagc agcgcggctc gggcgtcgcg ctgcggctag acgatggttg caccatcaac 3126901 gccgacctac tgttggtagc gacaggccgg gtgtccaacg ccgacctgct ggatgccgag 3126961 caggccggtg tcgatgtcga ggacggccgg gtgatagtcg acgagtacca acggacttcg 3127021 gcgcgtgggg tttttgcgct gggcgatgtc tcgtcgccgt acttgctcaa gcatgtcgcc 3127081 aaccacgagg cccgcgtcgt gcagcacaat ctgctctgcg actgggagga cacccagtcg 3127141 atgatcgtca ccgaccaccg atacgtaccg gctgcggtat tcaccgatcc tcagatcgct 3127201 gccgtcggac tcactgaaaa ccaagctgtg gcaaagggac tcgatatttc ggtcaagata 3127261 caggactatg gtgacgtcgc gtacggctgg gcgatggagg acaccagtgg aatcgtcaag 3127321 ctcatcaccg agcgcggctc tgggcgctta ctgggcgcac acatcatggg ttaccaggca 3127381 tcctcgctca tccaaccgtt gatccaggcg atgagctttg ggctgaccgc cgccgaaatg 3127441 gcccgcggcc agtactggat tcatccggcg ctgccggagg tggtggaaaa cgcgctgctt 3127501 ggcctgcgtt gaccgcaacg gcgagccgtc gtccggcaag cgatttgcat cccgtcagcg 3127561 ccttacctac agtcgggaca tcgcgttctg ccccgtgctg gaaggaccga catggccagc 3127621 agccagctcg acaggcagag gtcgcggtcg gccaaaatga accgcgctct gacagcagca 3127681 gaatggtggc gtctgggcct gatgttcgcg gtgatcgtcg ccttgcatct ggttggctgg 3127741 ctcaccgtga cgctcttggt ggagcccgcg cggctcagct tgggcggcaa ggcattcggc 3127801 atcggcgtcg ggctgacggc gtacacgctg ggcttacggc acgcgttcga cgccgaccac 3127861 atcgccgcca tcgacaacac cacccgcaag ctgatgagcg acggacaccg accccttgcc 3127921 gtcgggttct tcttttcact gggccactcc acggtggtct tcgggctggc ggtaatgctg 3127981 gtgaccggac tcaaggctat cgtcggaccg gtcgagaacg actcctcgac gctgcatcac 3128041 tacacaggct tgatcggtac cagcatttcc ggcgcgttcc tgtatttgat cggcatcctc 3128101 aacgtcatcg tcctggtcgg catcgtgcgt gtcttcgccc acctgcgccg cggcgactac 3128161 gacgaagccg aactcgaaca gcagttggac aaccgcggac tgctcatccg gttcctcggc 3128221 cgcttcacca agtcactcac caagtcctgg catatgtacc cggtcggatt tttgttcggt 3128281 ctcgggttcg acaccgccac cgagatcgcg ctgttggtgc tggcgggaac cagtgccgcg 3128341 gccggcctgc cctggtatgc catcctgtgc ctgcccgtct tgttcgccgc cggcatgtgt 3128401 ctgctggaca ccatcgacgg ttcgttcatg aatttcgcgt acggctgggc cttctccagc 3128461 cccgtgcgca agatctacta caacatcacc gtcaccggac tgtcggtggc agtcgcactg 3128521 ttgattggca gcgttgagct gctgggcctg atcgccaacc agttgggttg gcagggcccg 3128581 ttctgggact ggcttggcgg cctcgacctc aacaccgtcg gcttcgtcgt cgtcgcgatg 3128641 ttcgcgctca cctgggccat tgccctgctg gtctggcact acggccgcgt tgaagagcgg 3128701 tggaccccgg cgcccgaccg cacaacttga cctcgggcga tcaaccctag ggcggtgccg 3128761 ccggaatcga gacggtagcc aagcgagcgg tcgacgtgtt ggaaaagatc ttcgccgaga 3128821 acgatgtccg cgcgaacgtc aaccgggcgg cgtttgagaa caacgggatc cgcgcgctgg 3128881 acctgatgag ctcaccgggg tcggggaaga cgaccgtgct gggcgccgcg ctcgacgagc 3128941 acgccgacca attcgcaatc ggcgttatcg aaggcgacat caccaccgac ctggacgcgg 3129001 ccaatggccg cggcacccag gtgtcgctgc tgaacaacca gcatggcttt tgcgccgaat 3129061 gccacctcga cgcacctatg gtcaaccgcg ccctagctgg tgcgcccgac ggagttcgac 3129121 gtcggtaagc gccaaggcga tggtctcctc ggtcaccgag ggcaaggaca agccgctgat 3129181 gtacccggcg acgttccgct cgagggatgt agtgctgctc gacaagatcg acttggtgcc 3129241 ctttctggac gccgacgtgg acgcgtatat cgcgcatgtc cgcgaggtca acgcagccgc 3129301 gacgatcctg ccgaccagca cgcgcaccgg agccggcatg gggtcctggt catgagccgc 3129361 cggaaacggc tcgtctcatc ggctttcacg gtgaggccac cgcagccgaa atggacaacg 3129421 ttgatcgtct tccgggcctg acagcaatcc gactgtgaaa tgcactacgc gacacgctaa 3129481 cccgttgcgc agttcacact cggggcgcga tcacagcgga gtgacatagg ccgagctgat 3129541 cccaccgtcg accaggaacg tcgaagcggt gatgaatgat gcgtcgtcgc tggctaaaaa 3129601 cgctaccgca gcagcaattt cgtcgggctc ggcgaaccgg cccagcggca catgcaccat 3129661 gcggcgagcg gcccgttccg ggttcttggc gaaaagctct tgcagcagtg gggtgttcac 3129721 cggccccggg cacaacgcgt tgacccggat gccctgccga gcgaattgca cgcccagttc 3129781 ccgtgacata gccagcactc cacccttgga ggcggtgtag gagatctgcg acgttgccga 3129841 acccatcacc gcaacgaagg acgccgtgtt gacgatggag cctttcccag caagcaccat 3129901 gtggcgcagg gccgcccggc agcacaagta caccgacttc aggttgacgt cttgtacccg 3129961 ttgccacgcc gcgagctcgg tgttttcgat cagattgtcc tcgggtggtg agatgccggc 3130021 gttgttgaac gcaatatcta tgcggccgta ggtttcggct gctccgtcga acagcccgtt 3130081 gacggcgtcc tcatcgcaaa cgtcggttgg cacaaacaag cctgatagtt cgtcagcggc 3130141 cgcaccaccg gcctcgacgt cgacgtcgcc gaccacgatc gtggcgcctt ccgcccgcat 3130201 ccgacggccg gcagccaggc caataccgct gccaccgcct gtgatcaccg ccacccggcc 3130261 ggccagccgt tggctgaggt ccatcacatc tcctccccga cggcgatgaa cacatttttg 3130321 gtttcggtga actgcagcgg agcgtccggc cctagctcgc ggcccacacc ggactgcttg 3130381 aaaccgccaa acggggtgtt gaagcgcacc gacgagtgcg agtttaccga caggttgccg 3130441 gattcgaccg cccgcgccac ccgcagcgcg cgggacaggt catcggtcca gatcgatccg 3130501 gacagcccgt acgcggtgtc gttggccagg ctgatagcgt cggcctcgtc gtcgaacgtc 3130561 agcactacaa ccaccggccc gaagatttcg tcggtgacgg tgcggtcgcc gcgtttgggt 3130621 gtgagaacgg ttggtggaaa ccaaaatccg cgcccagccg gagccgtacc ccgaaacgcc 3130681 accggagcgt cgtcgggcac ataaccggcg accttgtcac ggtgtgcgcg cgataccagc 3130741 ggacccatct cggtggcgcg tgatccgggg tccccgacga caatgctgtg taccgccggc 3130801 tcgagcagct ccataaaccg gtcgtaaacg ctgcgctgca ccaggattcg acttcgggca 3130861 cagcaatcct gcccagcgtt gtcgaagacc ccggccggcg cggtcgtcgc ggcgcgctcc 3130921 aggtcgcagt cgtggaagac gatgttggcg ctcttgccac ccagttccaa cgtcactcgt 3130981 ttgacttgag ccgcggcacc ggccatgacc cgcttgccga cttcggtgga cccggtgaac 3131041 acgatcttgc gaatgtcggg gtgggtgacg aaccgctccc cgaccaccgt gccctttccc 3131101 ggcaacacct gcagcaggtc ttcgtccaga cccgcctcga cggccagctc accgagccgc 3131161 atcgtggtca gcggcgtcag ttcggcgggt ttgaccagca ccgcgttgcc ggcggccagc 3131221 gccggcgcga tggcccagga cgcgatcacc atcgggaaat tccatggcgt gatcacaccg 3131281 accacgccca tcggttcgtt gaaagtgacg tccaccccgc cggcaacggg aatctgcctg 3131341 ccggacaacc gttccgggct ggcggcatag aacgccaaca cgtcacgcac gtggccggct 3131401 tcccactcgg ccgacacgat gggatgtccg gaattggcta cctcgagcgc ggccagttcg 3131461 tcgaggtggg cttgcacggc tgccgcgaat gcgcgcaggc cggccgcccg ctgcgccggt 3131521 gccaaccgtg cccagcgccg ctgcgctgct cgcgcgcgtt gcacggcgtc gtccaccgcg 3131581 ttggcgtcgg tgtggtcaac tgaggccagc acttcctcgg tggcgggatt gatcagttgc 3131641 gtggtactca tcgtggctcc gcttggctct gccggcccgc gtatccgctg gcggcgtcca 3131701 ccaacgcctt aaacagccgc agatcgtcca acgacttctc cggatgccac tgcaccgcta 3131761 gtacgaacgt gtccccaggt agctccagcg cctcgattac cccgtcgaca tccaccgcac 3131821 tgaccaccag gccctcaccg acctggtcga tggcttggtg gtggtagcac ggcacgtcgg 3131881 cggattcgcc gatcagctcg gccaaccggg tgcccgatgc ggtgtggacc ggcaacctgg 3131941 tgaagacccc gttgcccgcc cgatgcccgc tatggccaag gatgtcgggc aggtgctggt 3132001 gcagcgtgcc gccgagcgcg acgttgagca cctgggtgcc gcgacagatg cccaacacgg 3132061 gcatcccccg ctgaagcgcg ccccgcaata gcgcgaactc ccaagcgtcg cggcccgggc 3132121 gagggtgatc ggtggccgga tgcggctcct ggccataagc tgccgggtcc aggtcgtagc 3132181 ccccggtgat caccagagcg tgcaggctgt ccagcacgca gccgacgctc tcggggtcga 3132241 ccggctgcgg cggcagcagt accgcaacac ccccggccat ggtgatgcct tcgaagtaat 3132301 cggcgggcag ataacccgca ggaatatccc aaaccccggt gcgcacctgc tccagataag 3132361 ccgtcaggcc aaccaccggg cgactcgcgc ccagtggcga tcgcaagcgc ggcgaagccg 3132421 ggcgcagcgg gtcgccacca tcggacacag gcgatcgcaa gcgcggcgaa gccgggcgca 3132481 gcgggtcgcc accatcggac acaggcgatc gcaagcgcgg cgaagccggg cgcagcgggt 3132541 cgccaccatc ggacctagag gcgctcaaat ccacgtatcc tctcccaatc ggtgaccgcc 3132601 gcgttgaacg ccgccagctc cacacgcgcg ttgttcaggt agtgcgcgac aacatcctcg 3132661 ccgaacgcct cgcgcaccag cgcagaatcc tcgaacagca ccgcggcgtc ggccagcgta 3132721 accggcagcc gttcgacatc ggcgccttgg taggcgttgc cgacacaggg ctcgggcagc 3132781 tgaaggcccc gctcgatacc gtacaaccct ccagcaatga gagccgccac cgccaggtac 3132841 tggttgacat caccgccggg aacccggcat tcgacccgga tgttttgccc gtggccaacc 3132901 acccgcaggg cgcaggtgcg attgtccagc ccccaagcca gcgccgtcgg cgcgaaactg 3132961 ctatcggcaa atcgcttgta ggagttaatg gtcggcgcat agcacagcgt gaattcgcgc 3133021 aacgtggcca actggccggc gacgaagctg cggaacatcg acgacatgcc gtgcggcccg 3133081 ttactgtcgg caaacaccgc ggagccatcc gtgccacgca gcgagacatg gatgtgacag 3133141 ctattacctt cgcgttcatc gtatttcgcc atgaacgtta ggctcttgcc gtgctggtcg 3133201 gcgatttcct tggcgccgtt cttgtagatc gcatggttgt cgcaggtgac cagcgcctcg 3133261 tcgtaacgaa acccgatctc ctgctggccc atgttgcatt cgcctttgac cgcctcgaat 3133321 cgcagacccg caccggccat acccaaccgg atgtcgcgca gcaacggctc catccgcgag 3133381 gatgccaata tcgcgtagtc gatgttgtag tcgctggccg gggtcagccc gcgatacccg 3133441 ctggcccatg cctggcgata cggctggtcg aacacgatga actccagctc ggtggccaca 3133501 tcggcgacca gtccgcgcgc cttgagccga tcgagctgac ggcgcagaat gctgcgcggc 3133561 gagacggcga cctcgctgcc gtcggcccag accaggtcgg cgatcaccag cgccgttccc 3133621 ggtagccaag gaatcagccg cagagtggac aagtccggcg tcatcaccat atcgccgtag 3133681 ccggtgtccc aactggccat cgcatagccg ggcaccgtgt tcaggtcgac gtccacggcc 3133741 agcagataac tgcagcactc gacgccgcgg gtggctatgt cgtcgacgaa atgccggccc 3133801 gatatccgtt tgccggccag ccggccctgc atgtcggtga acgcgacgat gacggtgtcg 3133861 acgtcaccgg ccgcgaccag tcgctccaac tcggtccacg ccaacggcgg cgaaccgggg 3133921 ccggtcaccg cacttcctcc cacaccatgg ccgctagtca accatctata ggctccgggc 3133981 ccacatgctg gctgtcgcgg gcaccgcgaa ccgccggagc cggcgagtag acgcgaaaga 3134041 acatgatggg cgctggtgcc catcatgttc ttttgcgcct actcgcgcta cagacaggtc 3134101 aggatctcga cgccggtatc ggtaaccagc agggtgtgtt cgaactgtgc ggtccacttg 3134161 cggtccttgg tgaccaccgt ccaaccgtcg tcccagattt cgtagtccag tgcgcccaag 3134221 ttgatcatcg gctcgatggt gaaggtcatc cccggctgca tgatggtctc gacagcgggc 3134281 tggtcgtagt gcaagacgac cagcccgttg tggaacgtcg tgccgatgcc atgaccagtg 3134341 aagtctcgaa ccacgttgta cccgaaccga tttgcatacg actcgatgac acgaccgata 3134401 acggacaacg cccgcccggg cttgacggtg ttgatcgcac gcatggtcgc ttcgcgggtc 3134461 cggtcaacga gcaaccggtg ttcgtctgcg acatcgccgg ccggaaacgt cgcgttggtg 3134521 tcaccgtgca ccccaccgat gtaggcggtg acgtcgatgt tgacgatgtc gccgtcggtg 3134581 atcaccgtcg agtcggggat tccatggcag atgacctcgt tgagggacgt gcagcacgac 3134641 ttcgggaatc ccttgtagcc cagcgttgat gggtaggcgc cgttgtcgac caggtattcg 3134701 tgcgcgatcc ggtcgagttc gtcggtggtt accccgggcg cgaccgcctt gcccgcctcg 3134761 gccaacgcac ctgcggcgat ccggcctgcc acgcgcatct tctcgatgac ctcaggtgtc 3134821 tgcacccacg gctcgctgcc ctcttgggcg gccggtttgc cgacgtattc ggggcgcgcg 3134881 atccagttgg gcaccggccg tgtcggggac agcacgccgg gggagagcgc ggtacgacta 3134941 ggcatcccgc tagcttagcc gggcaaattt tggccgcgcc cggctatcag ccccggtgtc 3135001 ggcgcagcag tgcgcgccgc ggtcccttga tgaccaccga cccgcacacc atccgaccgg 3135061 tcaacacgac atgcggtgtt ccttccgccg gtgcgtcctt gcggcggtcg ctcgcgctac 3135121 ccacatagac ctcgacgtcg tcgatcgacg cactggcgcc gttgggcagc cggacctcaa 3135181 gtgagccgaa catcatatcg agttcgatca ccaccaccgg ccccgcgaaa cgggccttga 3135241 cgaggtcgag ttcgattgac cccagccgac gcaccagcgc cagccgggtg ggcacgatcc 3135301 attcgccgtg gcgtttcagg gagccggccc agccgcgcag ctccacccgg tcggccgcgg 3135361 acgtgacgat cgcgccaggc ctgggcaggt caccgaccag cccatccagc tcgcttcgcg 3135421 tacgcgcgaa ggaaacccgt gacgagcgct gctcgaactc gtcgatgttg ataagcccga 3135481 gcgccacggc gttgtgcagt cgtcgcattg tgccgttggg gtcggcgtcc gagacccgca 3135541 acgccaccat gtccccaccg gtctccgtca tggcccattc ccgagagttc tggcacggct 3135601 tcaacggcga acttcgccta ccccccgcaa cttaccgctg ttgaaaggcc gccgaaaacc 3135661 tagcagttta ggtaatcctt tccgacgaag agcgggaggc gttccggcag caagccgcag 3135721 cccagcagat gtccctcagt aactggctgc gtcaagcggg gctcaggcag ctcgaggcac 3135781 agcgacaacg tcccctgcgc accgcccagg aattgcgcga gttctttgcg tcacggcccg 3135841 acgagacagg ggcagaacct gattggcagg cgcatctgca ggtgatggct gaatcgcgcc 3135901 gtcgcggcct gccggcgcca tgatcttcgt cgataccaac gtcttcatgt atgcggtcgg 3135961 tcgcgatcac ccattgcgga tgcccgcccg tgagttcctc gagcacagcc tcgaacacca 3136021 agaccgcctt gtcacgtcag ccgaggccat gcaggaattg ctgaacgcgt atgtgcccgt 3136081 cgggcggaac tcgacgctgg actcagcatt gaccttggtg cgggcgctga cggaaatctg 3136141 gcccgtcgag gcggccgacg tcgcgcatgc gcgaaccctg caccaccgcc accccggtct 3136201 gggcgcgcgc gatctgctac acctggcatg ctgccagcgt cgcggtgtca cgcggatcaa 3136261 gacgttcgac cacacactgg ccagcgcatt ccgatcatga cgcgtccgtg tgggcgcgag 3136321 cgtccgcagt tgtacggccc taacggcgtg tcgtcgtaca aacgaggagg ggcgagccgc 3136381 gctacgccag gtaccccggc ggcagcgatt cgaacatcac cttggtcatc cgcaccgcgt 3136441 attccgagct accgcccccg acgatcagcg acgcaaatgc cagatcgcca cggtacccgg 3136501 cgaaccagga atgcgatccg cccgggaatt cggcttcgcc ggtcttaccg aacacctcgc 3136561 cacagccagc gatctccttg gcggtgccat tggtcaccac caaccgcatc atgggccgca 3136621 gcgcgtcgat catcttctgg ctgatcggtg tggcatcgcc ttcgacggcc gtcggccggc 3136681 cggcgatcag ctgtggaacc ggggtcttcc cggcggctac cgtcgccgcc accaaggcca 3136741 tgccgaacgg gctggccagc accttgccct ggccgaaacc gtcctcggtg cgttcggcca 3136801 ggtccaccgt cggcggcacc gaaccggtca ccgtggtgat gccgtccacc tggtagtcaa 3136861 gcccgatccc gtaccgccgg gccgcctgag tcagaccgcg gggaggcagc ctgctgctca 3136921 gctcggcgaa ggtggtgttg caggaactgg caaacgcgcg tgacatcggc accacgccca 3136981 gatcaaagcc accgtagttg ggaatggtgc gatgcccgat gtcgatctcc ccggggcaac 3137041 ccagcagcgt ctcaggggta gccaggtcac gctcgacggc cgcaccggcg gtgatcatct 3137101 tgaatgtcga cccgggtgga tatagaccgg tggtcgcgac cggaccgtcc gcatcggccc 3137161 cggcgttctg cgcgatcgcc aggatctcgc cggtcgacgg cttgatcacg acgatcatcg 3137221 ccttgccgcc ccgggtgttc accgcgtgtt gcgcggcgtt ttgcacgacc cgatccaacg 3137281 tgatcgaaac cgacgacgca ggtgatgggg cgacctcgtg cagcaccgag acgtcgacgc 3137341 cattttggtt gacgctcacc acccgccaac ccgccttgcc gtcgagttca tcgacgacgg 3137401 ccttcttgac atcgttgagg accgccggcg cgaagtgctt gtcggtcggg agcagctcgg 3137461 cctgcggtgt aatcaccacg ccaggcagct gcccgatcgc cgcggccacc cggttgctgt 3137521 cgtcggcgtg caacgtgacc aggtccaacg gctgggtcga cgagctggcc tgttcggcca 3137581 gcagctgcgg atcattgagc gtgtcgtcga aggggtgcag cgcgcccacc accgcgtgtg 3137641 ccgtgccgaa gagctcgcgg ccggcctggc cggcgtccag cgagtagtga tacagatagc 3137701 ccggcaccag cacatcggtg ccgccgactt cgttcaccga ggcgcgccgc ggcgggtcgg 3137761 ctcgtagcgc gaacgtttga tgttcgccta gcttgggatg caacccgctg gtggtccagc 3137821 gaacgtgcca acgcccttcg tcgcgggcca tcttcagctg gccgtcatag gtccagattc 3137881 ggtccttggg cagatgccag ctgaagcgat aagcgaccgt accggtgtcc tcggcgtact 3137941 tggcgctgag aacctgcgca tccaggtggg cggcctgcag ccccgcccag gccgcgttca 3138001 gcgcttcgcg cgcctcgttg gggttgtcgc tgagctgggc ggcggaggcg gtgtcaccga 3138061 tggccagcgc ggcgaagaac ttttcggccg ccggaccggg cccttgggga cgcggggtgc 3138121 agcccgacat ggcgacgacc gcaagcagca gcaaacctga ggtggctgag gctaatgttg 3138181 ttttagttac catcgttgct gatgttaaga actgtgacgg agacaccggc cgcgacacac 3138241 cgagaccgaa ccgttacgcc gagactaggt cgcgaatgga acaccaccgc gaaaatcgtg 3138301 gccagaaatc gcaaccacgt tacgctcgcg accgctcaat cgagcaaggc gccgaccgca 3138361 agcaccagca aacctgagac gccgcgcaca aagtgcgaaa ccactggaag gtgagcccta 3138421 atttagggct gagcaggacc tgtataacgg cctagtatgg cggtatgcgg atactgccga 3138481 tttcgacgat caagggcaag ctcaatgagt tcgtcgacgc ggtctcgtcg acacaggacc 3138541 agatcaccat caccaagaac ggtgcacccg cagccgttct ggtcggcgcc gacgagtggg 3138601 aatcgttgca ggagacgctg tactggctgg cgcaacccgg aatcagggag tcgatcgctg 3138661 aagccgacgc cgacattgcc tccggccgca cctacggcga agacgagatc cgcgccgaat 3138721 tcggcgtccc gcgacgcccc cactgagcgg tgccttacac cgtgcggttc accacaaccg 3138781 cgcgtcgaga cctccacaag ctgccaccgc gaatcctcgc ggcagtggtc gaattcgcgt 3138841 tcggcgatct gtcgcgcgag cccctgcggg tgggcaagcc ccttcggcgc gagttggccg 3138901 gcacgttcag cgcgcgtcgc ggaacgtacc gcctgctgta ccggattgac gacgagcaca 3138961 caacggtagt gatcctgcgc gtcgatcacc gcgcggacat ctaccgccga tagcaactca 3139021 ccgacgggcg ctctgccgtc cgacggcagc catgactgag atcggtcggc cgggcggctc 3139081 cgaaaagacc tgaacagaac ctcaggattc ctatgctccc aatgtggcgg caatcacgaa 3139141 gaagctaatc ctcggccaga tccgggaagt ggctgaggcg aacgacggcc gaccgcccgg 3139201 ctgtgagcgc tttgccgccg agaccggaat tccagcaagc gcgtggcgtg gacggtattg 3139261 ctaacccatt ctttcaagac cgacgatcct gttggcatcg agaggtactg gcagcgccga 3139321 cacttgccag accgcatcgc cgtgcacagg acgtcgtcag cgctgatatg cccgcagctc 3139381 ggcgctcagt ccagcaacac cgtcgcgaac gtgccgatct ccttaaagcc cacccgggcg 3139441 taggcggcac gggccaccgt gttgaagctg ttcacataca ggctggcgat gcgcccgctg 3139501 ccgacgatca ctgcggccaa cgttgcggta ccagccgtgc ccagaccgat accgcgccac 3139561 tccggatgaa cccagacccc ctggatctgc ccgacggccg gagattgcga tcccacttcg 3139621 gccttgaaga tcacttgacc gtgctcgaat cgggcccacg cgcgtccggc cgcgatgagg 3139681 ccggccaccc ggcgacgata gccgcgacca ccgtctccga gccgagggtc gacgccgact 3139741 tcgccgatga acatgtcgac ggcggccacc aggtaggagt ccagttcctc gggccgtacc 3139801 tggcgtacgc cggtgtcgat agcgcagctg gggtgagtag ccagggccat cagcggttgg 3139861 ttgtcgcgga catcccgcgc cggaccccac accggctcga gccgctgcca catcggcaac 3139921 accaggtcgg ccctgccgac cagtgacgaa caccgtcgcg gcgtgctcat cgccacgtcg 3139981 gcgaacgcat tcaggtcgat cggtccgccg cgcagcggga tgaggttggc accggcgaaa 3140041 cacagggatt cgtgcgcgcc gcgtcgggtc cacagctccc cgccaatcgc attgggatcg 3140101 atgccatggt ctgcgacccg ggcggcgacc atgcacgatt cgatcgggtc gtcgtcgagt 3140161 acccgccaca cggcggcggc gtcacgcacc acggacactt gccgctcgcc gacaagccga 3140221 gagatgggcg gagccgacat ctgcgaactc cctttggtgg gaactgacgg ccactgaatg 3140281 aaaagctgac ccctatcagc ttacggtcac aataggcgaa ccgctcggtg tcgcgcccgg 3140341 agcttgctcg cccatttcgg cggccagccg catcgcctcc tcgatcagcg tctcgacgat 3140401 ctgtgcttcg ggcacggtct tgatcacttc gccccgtaca aagatctgac ctttgccgtt 3140461 gccggacgcc acgcccaggt cggcctcacg tgcttcaccc ggaccattga cgacacaccc 3140521 catcacggcc acccgcaacg gcacatcgag accatccagg ccggcggtta cctcgttggc 3140581 cagggtgtag acgtcgactt gcgcgcgacc gcacgacggg caagacacga tctcgagcga 3140641 acgcggccgc aggttcaacg actcgagaac ctgattgccc accttgactt cctcgaccgg 3140701 cggggccgac aacgacaccc ggatggtgtc gcctatgccc cgcgacagca acgcgccgaa 3140761 ggcaaccgcg gacttgatgg tgccctggaa agcagggccg gcctcggtga caccgaggtg 3140821 cagtgggtag tcgcaccgtg cagcaagcag ctcgtaggcg gcgaccatca ccaccgggtc 3140881 gttgtgcttg acgctgatct tgatgtcacc gaagccatgc tcctcgaaaa gcgaagcctc 3140941 ccacagcgcc gactcaacca gcgcctcggg cgtggctttg ccatacttct ccatgaaccg 3141001 tttgtccagc gaaccggcgt tgacaccgat tcggatcggg atcccggccg cacccgccgc 3141061 cttggcgacc tcacccaccc ggccgtcaaa ctccttgatg ttgcccgggt tgacccgcac 3141121 cgcggcacat ccagcgtcga tggcggcgaa tatgtagcgc ggctggaaat gtatgtccgc 3141181 gactaccggg atctggctgt gccgggcgat ctcggccagc gcgtcggcgt cctcctggcg 3141241 cgggcaggcc acccgcacga tgtcgcatcc ggccgcggtc agctcggcga tttgttgcaa 3141301 tgtcgagttg acgtcgtggg ttttggtggt gcacatcgat tgcaccgaga ccggatggtc 3141361 actgcccacg ccgacgttgc cgaccatcag ctgacgggtg gcgcgccggg gagcgagcgt 3141421 gggtgccggg ggctgcggca tgcccaagcc tacagtcact gaaaatcctt tctacctact 3141481 ggaaaagcct aatcgggttg accaggtcgg cggtgacggt caagagcatg tacccgacga 3141541 caagaaccaa gaccacatag gtcgccggca agagtttgag gtaattcacc ggtgcggccg 3141601 ccaccttgcc acgagccgac cggaccatgt tgcggatcct ctcgaacacc gcgacggcaa 3141661 tatggccgcc atcgaacggc agcaacggca gcaggttgat cgcagccagg atgaggttca 3141721 gctgggccaa gaagaaccag aacgccaccc acagcccatg gtcgacggtg tcgccgccga 3141781 tgatgctggc gcccaccaca cttatcggcg tctgcgggtc acgctgcccg ccgccgatcg 3141841 cccgcaccag cgcacctacc ttggtcggga gggcggccag cgccttgccc acctccacgg 3141901 tcaggtcgcc ggtgaacgcg aatgtggccg gcatggcgga gaacacgccg tagcgcacag 3141961 gcccgacccg ggcggcgccc accccaatcg caccgaccgt tgccggctgg agctcaccgc 3142021 cctgcccgtt agggatccag cgttgggtgg attcgatgtc cacgtaggta acaatcgcgg 3142081 tgccgtcacg ctcgacaacg atcgggacgc tgccgtgtga cttgcgcacc gcggcggcca 3142141 tctcgtcgaa actggacacc ggggtgtcac cgaccttgac cacgacgtca ccggagcgaa 3142201 ttccggccag cgccgccgga ccgggcccgg tgcactgctc gagcttgccc tggctcactt 3142261 cctgtgcaac gcagccagtt tcgccgatta cggccctggt tggcggatgc aggttaggca 3142321 gcccccagac cagcgcgatg gcatagatca gcaccaggca gatagcgagg ttcattccgg 3142381 gcccggcgaa taacactgcg acccgcttcc aggtggcctg cttgtacatc gcacggtcac 3142441 gttcgtcggg gtcgagttcc tcgaccgggg tcatgccggc gatgtcacag aagccgccca 3142501 gcggaacggc tttgacaccg tattcggtct cgccgcgccg ggtcgaccac aacgtggggc 3142561 caaagccgac gaaatagcga cgtaccttca tcccggtgcg gcgcgcgacc cacatgtgac 3142621 cacattcgtg cagggccacc gaaatcagga tcgcgagcgc gaacagcaca atgccggtaa 3142681 caaacatcat cgaggtgtca ggacctttct aacgtcgatg cgtgtcgacc cgctgcgccc 3142741 ggcttcgccg tgcttgcgat cgccaccgaa gccataccag ataccgcgcg ctgcgctcgc 3142801 tcgcgggccc agcgctgcgc gtcgagtacg tcatccacgg tagcgggttc gacggcccat 3142861 tggtcggcag cgtgcaacac gtcggcgatg atgccgacga tggccgggaa gccgatccgg 3142921 ccagcaagga acgccgctgc tgcttcttcg ttcgccgcat tgtaaaccgc ggtcatgcag 3142981 ccaccggcta cgccggcctg ccgggccaac tcgaccgcgg ggaagacgtc ggtgtccaac 3143041 ggctcgaact cccagctcga cgcggtatgg aaatcacagg cagcagcggc gccgctgacc 3143101 cgacgcggcc agcccagcgc taacgaaatc ggtagcttca tgtccggggg actggcctgg 3143161 gcgatcgtcg aaccgtcgat gaaggtgacc atcgaatgga tgatcgactg ggggtgcacc 3143221 acgacatcga tgcggtcgta ggggatgccg aacagcaggt gggtttcgat gacctcaagt 3143281 cccttgttga ccagcgacgc cgaattcagc gtgttcatcg ggcccatcga ccacgtagga 3143341 tgcgcgccag cctgctcggg ggtgacatgc tcgaggtcgg ccgcggacca gccccgaaac 3143401 ggccctcccg aggccgtcag caccagcttg gcgacctcgt cgggagtgcc gccgcgcagg 3143461 cactgggcca gcgcggagtg ttcggagtcg accggcacga tctgaccggg ccgcgccgcc 3143521 cgcagcacca gcgaaccacc ggcgaccagc gattccttgt tggccagcgc cagccgggca 3143581 cccgtcttga gcgcggccaa cgtcggtcgc aggcccaacg cgccgaccag cgcattgagg 3143641 acgacgtcgg cctcggtctg ctcgaccagc cgggtggcgg cgtcggatcc gtggtagggg 3143701 atgtcgccga cccgctgcgc cgcgtgctcg tcagcgacgg caatattggt caccccggtc 3143761 tgcgcacgtt gtcgcagcaa cgtgtccaga tgggcgccgc cagcggccag cccgactacc 3143821 tcgaaacggt ccggattgtc ggcgatgacc tgaagcgcct gggtgccgat cgagccggta 3143881 ctgcccagca ccaccacccg caaccggccg tcagcgcgcc cgtcggtcga gttggtcacc 3143941 tcatcattgt gcgccaccac ctcgttgtca ccgcgccgcc ggatcacgac gcgtccaccg 3144001 gtagccacac ttccccgtgg aatgcaatcg tcttgatgcc tgcgcttgat gctaagatgc 3144061 catgcgtgcg cacgacgatc cgtatcgatg acgagctgta ccgcgaggtg aaagcaaagg 3144121 ccgctcgttc cgggcgtacc gtggccgcgg ttcttgaaga tgcggtgcgg cgtggtctca 3144181 acccgcctaa gccgcaggcc gccggccgtt atcgagtcca gccgtcgggt aagggcggcc 3144241 tgcggcccgg tgtcgatcta tcgtccaacg ccgcacttgc cgaagcgatg aacgacggcg 3144301 tgtcggtcga tgctgtgcgt tgatgtcaac gtgctcgttt acgcgcatcg ggcagaccta 3144361 cgggagcacg cggactatcg gggtttgctt gagcggctgg ccaacgatga cgagccgctg 3144421 ggtctaccag atagcgtgct cgccggcttc atccgggtgg ttaccaaccg ccgcgtcttc 3144481 accgagccga cgagcccaca ggacgcatgg caggcagtcg acgccctact cgcggcaccc 3144541 gcagccatgc gacttcggcc tggcgagcgc cactggatgg cctttcggca gttagcgtcc 3144601 gatgttgatg cgaacggcaa cgacattgcg gacgcgcacc tggccgccta cgcgctagag 3144661 aacaacgcaa cctggttgag cgccgaccgc ggctttgccc gtttccgtcg actgcgctgg 3144721 cgtcatccgt tggacggtca gacccatcta taaccggccc cactccgaat cactggtgtc 3144781 cacccaggag gacggcgttc aacgccgccg cagaagcaaa ggaatcgaag cgatgatcaa 3144841 cgttcaggcc aaaccggccg cagcagcgag cctcgcagcc atcgcgattg cgttcttagc 3144901 gggttgttcg agcaccaaac ccgtgtcgca agacaccagc ccgaaaccgg cgaccagccc 3144961 ggcggcgccc gttaccacgg cggcaatggc tgaccccgca gcggacctga ttggtcgtgg 3145021 gtgcgcgcaa tacgcggcgc aaaatcccac cggtcccgga tcggtggccg gaatggcgca 3145081 agacccggtc gctaccgcgg cttccaacaa cccgatgctc agtaccctga cctcggctct 3145141 gtcgggcaag ctgaacccgg atgtgaatct ggtcgacacc ctcaacggcg gcgagtacac 3145201 cgttttcgcc cccaccaacg ccgcattcga caagctgccg gcggccacta tcgatcaact 3145261 caagactgac gccaagctgc tcagcagcat cctgacctac cacgtgatag ccggccaggc 3145321 gagtccgagc aggatcgacg gcacccatca gaccctgcaa ggtgccgacc tgacggtgat 3145381 aggcgcccgc gacgacctca tggtcaacaa cgccggtttg gtatgtggcg gagttcacac 3145441 cgccaacgcg acggtgtaca tgatcgatac ggtgctgatg cccccggcac agtaacgttc 3145501 ggcgcggtca aggcgaggca gcccgtgtag gcggtttgcc tcgctcatcc ggcggcttcg 3145561 tgccgataga tcacgtgata tcccaagcgc atgacggtga caccgcgccc agcgcaagcc 3145621 gatccccgca gcatgcctgc tgaagtcgcg tctcgcgaac tgcgcaacaa caccgccggg 3145681 ctgctactgc gcgtgcaggc cggcgaagac atcaccatca ctgccaacgg caaacccgtt 3145741 gcgctgctga ccgcaggcag cccgcacggc gccgatggtt gagtcgagac gagctgctgc 3145801 ggcggcttcg gcatacgcaa gcagatgcgg gattgcaccc gcgacctcgc aacgctcact 3145861 ggcgacacca ccgacgatct cggtcccgtc cggtgagggc cgctgccgtt gccacgtcgc 3145921 aaggggtgcc ggtcgtgacc cacgacggcg acttcgacgc cgtcgatggt gtggccgatg 3145981 tggctatcat tcgcatctga cgggtggcga gttcgacgtg aaccgactct gtcaacagcg 3146041 ctcgcgtgag cggtcctgcc aactcgttgc cgtcccggca gatccaagac ctaaacggca 3146101 acgaataacc gatgtgttga ccctcgcact agtcggcttc ctcggcggcc tcatcaccgg 3146161 aatatcacca tgcattctgc cggtcctgcc agtaatcttc ttctccggcg cgcagagcgt 3146221 cgatgcagcg caggtggcga aacccgaagg cgccgtagca gtccggcgca aacgtgcgct 3146281 atcagcgaca ttgcggccct accgggtgat cggtggtctg gtgctcagtt tcggcatggt 3146341 caccctgctc ggctcggcat tgctgtcagt gctgcatcta ccgcaggacg ccatccgctg 3146401 ggccgcactg gtcgccttgg tggcaatcgg cgccggcctc attttcccgc ggtttgaaca 3146461 acttctggaa aaaccgttct cccgtattcc gcagaagcaa atcgtcactc gcagcaacgg 3146521 tttcgggctg ggtctagccc tgggcgtgtt gtatgtcccc tgcgccggcc cgattctagc 3146581 tgcgatcgtc gtggccgggg ctactgccac catcgggttg ggaaccgtcg tgctcaccgc 3146641 gacattcgca ctcggagccg cgttgccgtt gttgttcttc gccctcgccg gccaacggat 3146701 agctgagcgg gtgggcgctt ttcggcgccg ccagcgtgag atcaggatcg ccaccggttc 3146761 cgtgacgatc ctgctggcgg tggcgttggt gttcgatctg ccggccgcgc tgcagcgggc 3146821 tattcctgac tacaccgcat cgctgcagca gcagatcagc accggcacgg agatacggga 3146881 acaactgaac cttggcggca tcgtcaacgc ccagaacgca cagctgtcga attgcagcga 3146941 cggggccgca caactcgaaa gctgcggcac tgcaccagat ctcaaaggca tcaccggctg 3147001 gctcaacacg cccggcaaca agccgatcga cctgaaatca ttgcgtggca aggtggtgct 3147061 gattgacttt tgggcctact cctgcattaa ctgccaacgg gccatccccc acgtcgtcgg 3147121 ttggtatcag gcctacaaag acagtggttt ggcggtcatc ggcgtgcaca cccccgagta 3147181 cgctttcgag aaggtcccgg gcaacgtcgc caaaggcgcg gccaatctgg gcatcagcta 3147241 tccgattgcg ctcgacaaca actacgccac ttggaccaac taccggaatc gctattggcc 3147301 cgccgagtat ctgatcgacg ctaccgggac ggtgcggcac atcaagttcg gagaaggcga 3147361 ttacaacgtc accgagacgt tggtcaggca gttgctcaac gatgccaagc ccggcgtcaa 3147421 actcccccag cccagcagca ccaccacgcc cgaccttacc ccgcgggccg cacttactcc 3147481 cgagacgtac ttcggagtcg gcaaggtggt caactacggc ggcggcggcg catatgacga 3147541 agggtcggcc gtgtttgact acccgcccag tttggcagcc aacagctttg cactgcgcgg 3147601 ccggtgggcg ctggactatc agggtgccac gtccgacggc aacgacgccg ctatcaaatt 3147661 gaattaccac gccaaagacg tctacatcgt tgtcggtggc accggcaccc tcacggtcgt 3147721 gagggacgga aagccagcca cactaccgat cagcgggccg ccgaccaccc atcaggtggt 3147781 cgccggcgat cggctggcgt ccgaaacact tgaggtgcgg cccagcaagg ggctacaggt 3147841 tttttccttc acctacggat gaatatccat ccaagacccg gacggctccg aagaaatcat 3147901 gtcgggggta gcgagacggc acaagccgcc gtctccggca gcgaaggagt gaacggcatg 3147961 aaggtaaaga acacaattgc ggcaaccagt ttcgcggcgg ccggcctggc ggctctggcg 3148021 gtggctgtct caccgccggc ggccgcaggc gatctggtgg gcccgggctg cgcggaatac 3148081 gcggcagcca atcccactgg gccggcctcg gtgcagggaa tgtcgcagga cccggtcgcg 3148141 gtggcggcct cgaacaatcc ggagttgaca acgctgacgg ctgcactgtc gggccagctc 3148201 aatccgcaag taaacctggt ggacaccctc aacagcggtc agtacacggt gttcgcaccg 3148261 accaacgcgg catttagcaa gctgccggca tccacgatcg acgagctcaa gaccaattcg 3148321 tcactgctga ccagcatcct gacctaccac gtagtggccg gccaaaccag cccggccaac 3148381 gtcgtcggca cccgtcagac cctccagggc gccagcgtga cggtgaccgg tcagggtaac 3148441 agcctcaagg tcggtaacgc cgacgtcgtc tgtggtgggg tgtctaccgc caacgcgacg 3148501 gtgtacatga ttgacagcgt gctaatgcct ccggcgtaat cgtccgcgga ggccgccgac 3148561 ccgcccgaga gcgactgagc atgtgccaga atgttcgggc agtgggagtt cgacgtcagt 3148621 ccaaccggag gaatcgccgt ggcaagtacc gaggtggagc acttcgccgg ctcgcaacat 3148681 gaggtcgaca ccgccgaggt tccatctgca gcgtgggggc ggagccggat cgatcaccgc 3148741 acctggcaca tcgtcggcct gtgcatcttc ggcttcctgc tggcgatgct gcggggcaac 3148801 cacgtcggcc acgtcgagga ctggttcctg atcacgtttg ccgcagtcgt gctgttcgtc 3148861 ttggcgcgcg acttgtgggg ccgacgacgc ggctggatca gatagccagc acaccgttcg 3148921 gtgtgcccga cccggtcagc gccgcacccg ccgaaaccag gtaccggcga aggcaccgac 3148981 caccagcaca accagcaaca ccgcccaagg ccatgcaccg tgctggttaa cccagccagc 3149041 cagggcacct tgcaggcggc cggccgcggc aatcaccgca tcctggggat tcgccccgac 3149101 accggcaatc aggcgcagct cgtagagacc gtagtaaccc acgtacagcc cgaccaccac 3149161 cagcagcgcg ccactgatcc ggttgacgaa cggcaagatt cgccgtaggc ggtcggccag 3149221 cgccgagctc gcggtcgcgg ccgcgacggc aagcacgccg acaacgaggg tcaggcccgc 3149281 gacataagcc agatagatcg ctacgctccc gacgaccgaa ccgccccgca ggcctgcccc 3149341 ggtaaccgcg agaaacggcc cgatggtgca tgacagcgaa gcaaccgcat agctgatgcc 3149401 gtagccatac atggaaccca gccgtaccgt tggagcccaa cgcacgccga gggatcgggg 3149461 cgtcaacgcc gtcagccctc gtcccaacag cagccacccg ccgagggcga tgagcgccag 3149521 accgatcagc accgtggcat agggcaggta tcgctgcacc gccgtggccg cggaaatggt 3149581 cagggctccg aagatgccga acaccgtcaa gaagcccagc gccatcccga ccgtggcggc 3149641 tgccgctcgg cccactgcgc taagcggccc cgtccggccc gccgaatcct gcccatgcac 3149701 caccaacagc aggtaggccg gcaacatggc aaacccgcat gggttcagcg cagccaccaa 3149761 cccggcggcg aacgccaaac cgatcagcgc ctcgttcacc gggtcaggac gtcagcgcag 3149821 ccacccggcc ggacagctcg tcctgagaca tggccgcggt ggggttgttg acgaacgtcg 3149881 atgtgccgtc cgcgcgatag aacacaaatg ccggttgcca aggcacgttg tagcgggccc 3149941 agatcacacc atcggcgtca ttgaggttgg tgaaattcag gttgtacttc gagacaaagc 3150001 tctgcatcgc cccgacgtcg gcgcgggtgg cgattccgac gaaggtgacc gccggattag 3150061 cggccgctac ctggctgagg ctgggggctt ctgcgttgca gaacgggcac cacggcgtcc 3150121 agaaccacaa caccgccggc ttgccttgca ggcttgcgcc atcgaagggg gcaccgctga 3150181 gcgtggttgc ggtgaactgc agacgttcat cggctgccac cgctcgcggt gtattggcca 3150241 gaccgaacat caggacaacc gcgatagcaa cggccacaat gccgtccgca aacgccttga 3150301 tcggggacac caggcgaaga ctcatgacag acctcacttg ttcgtgtttt gacctaatga 3150361 cgtaatacgc tccgtgacgg ttcagtacat cccggcgccc cctgcgctcg cggccagctg 3150421 tccgcagcgc tggctgattc gcctgcgctc cagctacccg cctacggcgg ccagctgtcc 3150481 gcagcgctgg ctgattcgcc tgcgctccag ctacccgcct acggcggcca gctgtccgca 3150541 ggcggcgctg atctcccgcc cacgggtgtc tcgtaccgtg caggaaactc ccttcgcccg 3150601 aacccgtttg acgaattcac gctcaaccgg cttggggctg gcatcccaat cactgcccgg 3150661 agtcgggttc agcgggatca ggttcacgtg cgccaacggc ccgagaacac gatgcagtcg 3150721 ctttcccagc aagtcggccc gccacggttg gtcgttgaca tcacggatca gcgcgtactc 3150781 aatagacacc cgtcgcccgg tcacattggc gtagtaccgg gccgcatcga gcgcttcgct 3150841 gatcctccac cggttgttga ccggaactag tgtatcgcgc aacccgtcgt cgggggcgtg 3150901 cagcgacagc gccagggtca cgccgagccg cgcgtcggca aggttgcgga tagcaggggc 3150961 cagacccacc gtcgacaccg tcaccgcgcg ggccgaaatc ccgaaaccgg acggcggccg 3151021 cgcggtaatg cgctgaactg cggccaacac cctggcgtag ttggccagcg gctcccccat 3151081 acccatgaac accacattcg acaaccgatc gccgaagtcg tcgcgcaacg ccgcggcgcc 3151141 ggcacgcacc tgctcgagga tctccgccgt cgataggttg cgagtcaatc cgccctggcc 3151201 agtggcacag aacgggcaag ccatgccgca gccggcctgc gaggaaatgc agaccgtgtt 3151261 gcgccgcgga tagcgcatca gcaccgattc gaacatggta ccgtcgacgg cccgccacaa 3151321 cgtctttcga gtctggccgg catcgcaggt gatgtcggcg gacgcggtaa gcaagttcgg 3151381 gaacatcgct ccggcgatcc ggtcgcgaac ggccgccgga aggtcggtca tctgacgcgg 3151441 atcggcgatc agccgaccgt agtactggtg tgcaagctgc ttggcccgaa acgccggcag 3151501 ccccagctcc gcgacggcag acgctcggcc cgccgcgtcg agatcggcca ggtgccgcgg 3151561 cggccgaccc ggacgcggct catcgaacat caactcgggg accatgacct gtccagtatc 3151621 gccgttgtca gggcagcagt gtgaggacta tccaggccgc caccgcggaa ggcagtatgc 3151681 cgtcgagccg gtccatcaga ccgccgtggc cgggtagcag gcggcccatg tctttgatgc 3151741 cgaggtcacg tttgacctgc gactccacca ggtcgcccag cgcggtggtg agcacgaaaa 3151801 gcacgccgag cagtgcacca atccacggcg ttttgccgac caggaaagtc gcggtgatga 3151861 tcgttgcggt gatcccgcac accagcgaac cggcaaagcc ctcccacgac ttcttcgggc 3151921 tgatcgtcgg aaccatcgga tgcttgccaa acagcacccc cacggcgtag ccgccgacat 3151981 cggaagcgat gaccgcgatc atcatgcaga acacccatcc cgagccattt tccgggtaga 3152041 ccagcattgc gccgaaagag cagaacaatg ggacccacac ggccaggaag accgtggccg 3152101 agacgtcgga caagtagttt cccggcgacg gtgcaccgcc ggtcgtcggg cgcgtcacgc 3152161 tgtcctgcat gaacagtcgc caaatcatgc agacaacgac catgccacca aagcccgcca 3152221 atgcgccgac cgcgccgaac ggccaggtca gccacaccgc ggcctgcccg ccaatcagca 3152281 acgggataac cgggatgaga tagcccgctt cccgcaacct ccgcaccacc tcatgggtag 3152341 cgaccaaggt ggcgacggcc acgatggcaa cccaaacgcg cggaacgaac accagcaccg 3152401 cgatgaggac taggcctatg gaaaggccca ccacgatcgc tgcgcgcaaa tcacggccgg 3152461 cgcgggacgt ttcggtcgcc ggctgctgtt tagcaccacg cgccggctgc tcggcggggt 3152521 ttccggtgcc ggcatcgttg gttgtcacgg attttgttgc tgagcggccg ctagacctcc 3152581 agcagctcgc cttctttgtg tttaaccagc tcatcaattt gggtgacgta ttggtgcgtg 3152641 gtcttgtcga gatccttttc tgcgcgaccg acctcatcct cgccggcctc gccttcctta 3152701 cggatgcgat ggagttcctc catcgctttg cgacggatat tacgcaccga aaccttggcc 3152761 tcctccccct tatgctttgc ctgtttgacc agctctcgcc gacgttcttc ggtgagctgc 3152821 ggtacggcca cgcgaataag ggcgccgtcg ttggtgggat tcactccaag gtcggagttg 3152881 cgaattgcag tctcgatagc gcgcaactga ttggcttcat acggctttat cacgactagc 3152941 cgcgcctcgg ggacattgat gctggccagt tgcgtgatcg gggtggccgc accgtagtag 3153001 tcgatggtga tccgagagaa catgccaggg ttggcgcggc cggtacggat agttgacagg 3153061 tcgtcacgtg ccaccgccac agccttctcc attttctctt cggcgtcgaa gagagcctca 3153121 tcaatcatct gcgccgctcc tcctcatcgc tgcgctctgc atcgtcgccg gcgccaacca 3153181 tctgcgccgc tcctcctcat cgctgcgctc tgcatcgtcg ccggcgccaa ccatctgcgc 3153241 cgctcctcct catcgctgcg ctctgcatcg tcgccggcgc gaagcagcgc gtagtcccct 3153301 taggtggtga ccagcgttcc gatcttctca ccccgaacag cacgggcgat attgccatcg 3153361 gtcagcaggt tgaacaccag gatcggcatg ccattgtcca tgcaaaggct gaacgcggtg 3153421 gcgtcggcta ctcgcagccc gcggtcgagg acctcacgat gactgacggc ggtgagcagt 3153481 tcggcctcgg ggttcacccg cggatcctca gcaaacacac cgtcgaccgc tttggccatc 3153541 aagaccacgt cggcaccgat ctccagcgca cgctgcgctg cggtggtatc cgtcgaaaag 3153601 tacggcagcc ccatgccggc accgaagatc accacccgtc ccttctccag gtggcggacg 3153661 gcccgcaacg gcaggtacgg ttcggccacc tggcccatgg tgatcgcggt ctggactcgg 3153721 gtaacgatgc cttccttctc caggaagtct tgcagtgcaa ggctgttcat gacagtgccg 3153781 agcattccca tatagtccga cctggtgcgc tccataccga gctgctgcag ctgtgcgccc 3153841 cggaaaaagt tgccgccgcc gatcacgacg gcgatctgga cgccgccgcg caccacatcg 3153901 gcgatctggc gggccacctg cgcgacgaca tcgggatcca gcccgacctg gcctccgccg 3153961 aacatttccc cgccgagctt gagcaacact cgcgagtacc cggacagctg agccgccgac 3154021 gcggcgccag tgctcgcagg ctccggcttc gaagccggcg cgccggcgac atcgggctct 3154081 gtcatctgac tcctcgcacg acagtgccat cccggcacca ccaggacggc atctcacatt 3154141 ctgcctcaat agccgcgctc cggcgtgggc ggggtgcgtt agtcacgcaa caacgagggg 3154201 ccggccgagg ccaggcccgt cgactatctc aaggtgtgag catcgctcga gcaacaaagt 3154261 tggaatagtt ctgttctgaa ccgggtaccc aggggtaccg gcagacatct ccgcgaggga 3154321 tgcctacggg ccccacgacg gggaagtggc accctcatga agtttggaga tatctcttgg 3154381 aagttctact tcttaccgat gaagccgatc ttgaatcggc tctgccggag ctggagtcgt 3154441 tcgcgcagtc ggtgcagcgc gcaccgctgg acgacccggg cgcggccaag ggtgcggacg 3154501 ccgatgtcgc gatcattgac gcgcgcgccg acttggcggc cgctcgccgg gtgtgccgcc 3154561 ggctgacgac tagcgcacca gcccttgccg tggtggctgt tgttgcgccg gccaactttg 3154621 tggcagtgga cggcgattgg atattcgatg acgtgctgtt gaacgcggcc ggcggggccg 3154681 agctgcaggc acggttgcgg ttggcgatca cacgtcgacg gagcacgcta gcgggcacac 3154741 tgcaattcgg ggacctcgtc cttcacccag ccagctacac cgcgtcgctg ggcgaccggg 3154801 acctggggct gacgctcacc gaattcaaac tcatgaattt ccttgtgcag catgccggtc 3154861 gggcgttcac ccggactcgg ctcatgcgtg aggtgtgggg ctatgagtgc catggtcgca 3154921 ttcgtaccgt cgatgttcac gtacgacgac tgcgcgcaaa gctcggagcc gagcacgaat 3154981 cgatgatcga caccgttcgc ggtgtgggtt atatggcggt gacgccaccg cagccgcgct 3155041 ggatcatcag cgaatcgata ctaaaccgtt gcaagtgagt gatctttagt ggtcacttga 3155101 cttgcacccc gtctcggggt tgttcgccgg ccgggtggcc ggttgccttc cgcgcttcac 3155161 ggccacccgg ccgggccagg cccggtctta cggtcggctc cacgcttgac ggcggcccca 3155221 actgggccga cgacgctagg tggttcctcg tagcgtgcga ggttgatcgc ggcgttgtcg 3155281 tcacgttggt gcgtgatcga acagccgtcg cattgccata tttcgtccca gccgatgtct 3155341 tgcacatgcc ggcaggcatg gcaggttttc gacgatggga accagcggtc ggcgaccacc 3155401 agactcgatc cgtaccagcc tgtcttgtag gacaggtgac ggcgcggggt tgccagggct 3155461 gcatcagaca gtgcgcgccg tctggcgcgc gcccccggca gtcccttttg ccgcagcatt 3155521 cccgccgcat ccagaccttc gacaacgata cggccgtggg ttttggccaa tcgtgttgtc 3155581 agcacgtgca ggtggtgggt acggacatcg ttgacccgac ggtgcagccg ggacagttcg 3155641 gtggtgcgct cacagtagcg gcgtgagcct ttcgtgcagc gtgagcgtgc gcggctgacg 3155701 cggcgcaacc cgcgcaacgc agcatcaagc gggcgaggat tcggcacttg ttcaagcacc 3155761 gtgccctcag cgtctgcaac agtggccaaa cgccgcacac caacgtcgac acccacccgt 3155821 gaatcaggaa gcgccacacg ccgctgttgg gggcgttgga cgagcacccg cacgctcgca 3155881 tccaggcggg tgccgttgcg gcgcacggtg atcgccagca cccgcgcccg acctttggcg 3155941 atgagccgct caacccggcg ggtgttctcg tacgtacgga tggtgccgat caccggcaag 3156001 gtgaggtggc ggcggtcggg ctccacacgc atcgcaccgg tcgtgaagca cacgcgatcg 3156061 gcgtcgcgtc ccttcttctt aaaccgggga acgcctactg ttttgccagc ccgtttcccg 3156121 gcacggcagc tctgccagtt ccaatacgca tcgaccgcgc ccgcgatgcc atcggcatag 3156181 gcctctttcg agcattccgg ccaccacacc tgcccggtct gcgcgttgac acacacctgg 3156241 tctttgaccg tgttccaccg tttgcgcaac acccgcagcg acggcttcgc cgactcggtg 3156301 ccatccgcgc gccacgcctt gatgtcggct ttaagcgccg tgacggtcca gttgaatgcc 3156361 ttacggcgag caccaaaatg gcgcgccaag ctggcagctt gcgtctgggt cgggttcagc 3156421 gtgaaccgaa acgcctgcac acaccacccc tcaggcacct ttaagcgcgc catcacctag 3156481 cctcgtgtcc cccggcgcgt gccgccgcgg ccacggcacg cgcagcacgg ttgccagcag 3156541 cgcgtttgcc gtagagccgc gcacatatcg acgtcaagat ctcggtgata tcgcccacaa 3156601 cgtcgtcatc gacatcggcc gaatcgacca caaccagctc acggccgtca gcggccagca 3156661 ccgcctgtac acactcaaag ccgaaccgcc ccaaccggtc ccgacgtttc atcacaatcc 3156721 gcctcaccgt cggatcaccc agcagcgtaa ggaacgtgcg gcggcgcccg tacagcgccg 3156781 acccgacttc agtaacgacc ttgccgacgg gtatctgttc cgccgtggcc cacgcggtca 3156841 cgcctaccac ttgccgatcc agatccacct tctgatcagc cgacgacaac cgcgcacaca 3156901 ccgccgtccg cccccaccgc cccggctgcc cggctggctc gtcgacaaga atcactcgcc 3156961 caaccctgcg ggcgggaacc ggcaaccgcc cgacacgcaa ccagcgatac gcaataaccc 3157021 gcgccacacc gttgccctca gcccacacca ccagattcat acttccgttc ctacaacaca 3157081 ccaccgacaa ccaacgacca cccaaacgca acagctgaca gccccttccg ggcatcggca 3157141 gcaccggccg aagactccac agcgcgttaa tgcgcccagg tgtttgcaac ggcggtgtcg 3157201 aaggctgccg agaacacgcc cactgcggca atgcgatgta ggcttcacgc ccgtggctat 3157261 ggttcccgct caaacgaccg gcggcactgc ccacaagcgc cgggagcgca taggaacgat 3157321 ttaccgttcg gcccggcaca tgtgtcagta tccttgacat gggtctagcc gatgacgccc 3157381 cgctgggcta tctgctctac cgggtgggag ccgtactgcg gccagaggtt tccgctgcgc 3157441 tcagtccact cggcctgacg ctgcctgagt tcgtctgcct gagaatgctt tcgcagtcac 3157501 cgggactatc cagcgccgaa ttggcccggc acgcaagcgt cacaccgcag gcgatgaaca 3157561 cggtgttgcg caagctggaa gatgccggtg cggtggcccg gcccgcatcg gtgtcttccg 3157621 ggcgttcgct accggctaca ttgaccgctc gaggccgagc cctggcgaag cgcgccgagg 3157681 ccgtcgtacg cgccgccgat gcccgcgtcc tggccaggct gaccgcgcct cagcaacgcg 3157741 agttcaaacg aatgctggag aagctcgggt ccgactagat ccggacgcgg gctactcggc 3157801 gatatttggg gcgtggatcc gggcccaggg ccgggcctct tcgagttcgt aagccagctc 3157861 cagcagcagc gcctcgcgcc cggtatcagc cgagagcatc atgcccacgg gcatgccgtc 3157921 cgcggattga gccaacggta gcgaaatcgc cggcaccccc gtgacgttct gcactggcgt 3157981 gaacacgacc cagctgctca gccggtcgag caccgtctga tagtcggtag gcgcaaggta 3158041 tccgacctgc ggagtggcct ccgcgaccgt tggcgtgagc aagacgtcgt aggtaccgaa 3158101 gaaccgcacg ctgcgccgcc gtagcatgcg cagacgcatg atcgccaacg gcagccggtg 3158161 caggttgcgg ccggtatggc gggccagccc caaagtcagt tcgtccagcc gggtagggtc 3158221 gaacgtcctg ccgaatgtgc gccggccgct gcgcacttgc gccagggcca agaaccccca 3158281 atagagcacg aaatcgtcca cgaaactggc cggtgccggt gggtggtcga cgtgttctac 3158341 ccggtgacct agttcctcga gcagccctgc caacttcagc gtcagctgcc gcacttcggg 3158401 gctggcctcg cgcagaaccg agcgggttac tacggcaatc ctcagccgct gcttaacggg 3158461 gcttgtgacg tccccgaccg gcggcagctg gtggttacgc caaaggcgct cggcctcgcg 3158521 gtagaaggct gcggtgtcgc gtaccgtgcg ggtcaggacg ccattggcga cgatgcccac 3158581 cggcaacctg cgatactccg gctccagcgg caaccggccg cgcgacggct tgagcccgac 3158641 caacccgttg caggcggccg gaatacggat cgagccgccg ccgtcgttgg cgtgcgcgat 3158701 cggcaccacg ccggctgcca ccaaggcgcc cgatcccgat gaggaggcac ccgctgtgta 3158761 gtcggtattc cacggattac ggaccggtcc cagccgaggg tgttcggcca cggcgctgaa 3158821 gccgaattcc gacaactgcg tcttgcccag ggacaccagc ccggtgccca gcaccacccg 3158881 ggttatctcg ctgtcggcga cggccgcgta tggttcccac gcgtcggtgc catgcatcga 3158941 cggctgtccg gcaacgtcga cgttgtcctt gatgaaggtc ggcactccac tgaagaacgc 3159001 ttcctggccc gtacccatcg cggccgcgtc tcgcgccacg tcgaaagccg catacgccaa 3159061 cgcgttcagt gccgggttaa cggcttcggc gcgggcgatg gcggcctcga cgacgtctgc 3159121 ccgacccact cgacctgatc ggatggcgtc ggcgagggcg accgcgtcga ggtcaccaag 3159181 ggcatcgtca acgaaagcgt gtacgcgcga catacccggc taagcctggc ccacctcgaa 3159241 gcggacgaac cgtgtcacca tcacgccggc cacgtcgagc agggccttga cggtcttctt 3159301 attgtcggac accgacgcct gctcaagcag caccgcatcc ttgaagaagc cgttcagccg 3159361 gccctcgaca atcttgggca gcgcctgctc cggcttgccc tcggcccttg ccgtctcctc 3159421 ggcgatgcgg cgttcgctgg ccacgatgtc ttcaggcacg tcgtcgcggg acaggtaccg 3159481 cgcccgcagc gcggcgattt gcaacgcaac ggcgtgcgcg gcggccgcgt cgtcgccgcg 3159541 gtactcgacc agtacaccca ccgctggcgg caggtcagcg gaacgtcgat gcaggtaggc 3159601 ttccacggtc ccgtcgaaaa tcgccacacg acgcagctcg agcttctcgc cgatcttggc 3159661 cgacagctcg gcgatcgcct gctcgacggt cttgtcgccg atgctggcac ccttgagcgc 3159721 gtcgacgtcg gcgggcttag ctgctgccgc cgccgcgacc acttggtcgg ccagcgtttg 3159781 gaactccgcg ttcttggcaa caaagtcagt ctcgcagttg agctcgatca gcgcgccgtc 3159841 cttggccgcc accaagccct cggccgtagc ccgctcggca cgcttgccga catccttagc 3159901 gcccttgatc cgcagcgcct cgacggcctt gtcgaagtcc ccgtcggttt cggccagcgc 3159961 gttcttacag gcgagcatgc cggcgccggt cagctccctc agccgcttga cgtcagcggc 3160021 agtgaagttc gccatatcag cctttcctag gatgcatctg tggttggttc ggttgcgcct 3160081 gcgggggcgt cggtgagggc agttgttgac gcggttgctg atggcgttgc cgaagctgtc 3160141 gccgaggcca gcagctcttg ctcccattcg gccagcggct cggcggcttc ggcctccggc 3160201 ttgccgtcgg cgcgccccag tccggcacgg gcctgcaggc cctcggcgac cgcggaagcg 3160261 atcaccctag tcagcagcgc ggccgagcgg atcgcgtcgt cgttgcctgg gattgggtag 3160321 tcgacctcgt cggggtcgca gttcgtgtca aggatcgcga tgaccgggat gcccagtttg 3160381 cgggcctcac cgacggcaat gtgctctttg ttcgtgtcga cgacccagat cgccgacggc 3160441 accttggcca tgtcgcggat gccgccgagg ctgcgctcga gcttgttctt ctcgcgggtc 3160501 aatcccaaga tttccttctt ggtgcggccc tcgaagccac cggtctgctc catcgcctca 3160561 agctccttga ggcgttgcag ccgcttatgc acggtggaga agttggtgag catgcctccc 3160621 agccagcgct ggttcacata cggcatgccg acccgggtgg cttcggcggc caccgactcc 3160681 tgcgcctgct tctttgtgcc gacgaagagc accgacccac cgtgagcgac ggtctctttc 3160741 acgaactcgt acgccttatc gatgaaggtc aacgtctgct gcaggtcgat gatgtagatg 3160801 ccgttgcggt cggtgaagat gaaacgcttc atcttgggat tccagcgacg ggtctgatgc 3160861 ccgaagtggg tgccgctgtc aagcagctgc ttcatggtga ctacggccat acctatgcct 3160921 tactcatgtg tcggttgttc gcccggcatc ggctgaagcc gggccctggc gtctgccgcg 3160981 atgccggacc cgggaggaaa tccccgaagg gaaccgccgc gggaccgccc cggcatgctg 3161041 ttgcggatcc cggaaaggcg ggccgcggtg cagacacgcg aagtcagccc gccgatgcga 3161101 gctgcgccga gtagtttaca ccgacccagc tggtgatttt cccggcagcg gaatccacag 3161161 cgacgacatt gtccacaaaa cgggcggcgg cgattggcca aatcgcccgc gcggcgctgc 3161221 actgcaaagg tgcggagggt tctgagccgc agcgtactga tcctttgctg gtcgctgctt 3161281 ggtgcggcgc cggcccatgc cgacgactcc cggctgggct ggccgctgcg gccgccgccg 3161341 gcggtagtcc ggcagttcga cgccgcatcg cccaattgga atccggggca ccgcggtgtc 3161401 gacctggccg ggcgccccgg tcagccggtt tacgcggccg gcagcgcgac ggtcgtattc 3161461 gccgggctgc tcgcgggacg gccggtggtt tcactggccc acccgggtgg gctacgcacc 3161521 agctacgagc cggtagtcgc ccaggtccgg gtcggtcagc cggtgtcggc gcccaccgtg 3161581 atcggcgcgc tggcggccgg gcaccccggg tgccaggccg ccgcctgtct gcactggggg 3161641 gcgatgtggg gcccggcttc gggcgccaac tatgtcgatc cgctgggcct gctgaagtcc 3161701 acaccgatac ggctcaagcc gctatccagc gaagggcgga cgctgcatta ccgccaagcg 3161761 gaacccgtat ttgtgaacga agccgccgcc ggtgctctgg ccggcgctgg ccatcggaaa 3161821 tccccgaagc agggcgtttt ccgcggtgcc gcgcagggcg gtgacatcgt cgcccggcaa 3161881 ccgccaggcc gctgggtttg cccatcgagc gcgggcggcc caatcgggtg gcaccgacaa 3161941 tgaaccagcc gagctcccct tccccaaagc ggccgatacc gatccgccaa tgctttctcg 3162001 gtctagtgcc cagtaccagt acggctgggg cgtctgaacc ccgccaacag caccgccgcc 3162061 tgccacactt gggctcgccc gcgggccggc gaagatggtt ggaccccagc tgtcaagcac 3162121 cgaggatccc gagtcaccgg cgccgcccgg ggtgcccggg atcgcttgct gggcgagcga 3162181 agcctcgaat tgcagtagcc cttcgtcgaa gtagctgatg cccagcaacc gaacgatcgt 3162241 catcctgttg tcaggcgtga gaccgagaag attctccgcg agccattgct gcaatgccga 3162301 gtaccacggg atcagtgacg tcgacgagag ttgctgcaac agttggggca ccgcggcggt 3162361 ggtggccagc ggtgggaccg tcgacgatac cgtcgccgcc gcctggccgg ccagcgcggc 3162421 agggctggta gtcactggcg ccgcggtgaa cggggtcaac tccgtggcga ttgccgcgga 3162481 gcccgcatag gcgtacatgg cggcagcgtc ttgggcccac atctcggcgt attgggcctc 3162541 ggtggccgcg atcgccgggg tgttctgccc gaaaaagttg gtcgcgacca gcgccaccaa 3162601 cagcgcgcgg ttggcaacga ccaccggcgg gggcaccgtc atcgcaaacg ccagctcata 3162661 ggctgccgcg gccgctctgg cctgcatgcc cgcctgttca gcctgaccgg cggtggcgct 3162721 gagccacgcc acataaggcg tgaccgcggc caccatcgac gccgctgcgg gccccgccca 3162781 gtacgcaccg gtcagctccg agatagccaa ccggtagccg ccggcggcca agcccaattc 3162841 agccgccaaa ctatcccagg ccgccgcggc ggccatcatg ggccccgatc ccggacctgc 3162901 gtacattcga ccggagttga tctcgggcgg caacacccca aagtccaacg cccatccctc 3162961 cctagccggc cgggatcacg gcgtggttac gcgccccacc cgaataggca gtggtacgtg 3163021 atgcggtcac gaactggtct tgaatcgcca gcctcaggtc gctgatcgct gacagcggcc 3163081 ccggtcgtcg aacaagccag tccatcctgt gccctcatcc ctgatagctg gattttggcg 3163141 gcttgacatc ggccgcacca gcgtttctgg gtaagtgctt acaaacgaga cgcatttgct 3163201 gtgaccggag ccgaatgttt gattcccggc cagctaccgt tcacctgaag gaagtcggcg 3163261 cgttacccac agctcgatat tcggggtcct gccggcccga accgccaccg cacaatcgat 3163321 gccggcttcg cggctaccgt cgactccatg accgttgcca gcaccgctca ccatacacgt 3163381 cggctacgtt tcgggttggc ggcaccgttg ccccgcgcgg gcacccagat gcgcgccttc 3163441 gcgcaggctg tcgaggccgc cgggttcgac gtgctggcct tcccggacca cctggtgcct 3163501 tcggtttcgc cgttcgcagg cgcgaccgcc gcggcgatgg ccacgcaacg actgcacacc 3163561 ggcacattgg tgctcaacaa cgactttcgc catcccgtgg acaccgctcg agaggcggcc 3163621 ggtgtggcaa ccctcgccga aggccgcttc gaactgggac tgggcgccgg acaccggagg 3163681 tccgaatacg acgccgccgg cattaccttc gattccgggg caacacgggt ggcgcggctc 3163741 atcgaatcgg cgcacctgat ccgtgcgctg ctggacgcgg agcccgtcga cttcgacggg 3163801 cagcattacc gggtgcacgc cgaagcgggc tcactggtgg caccgccgaa ggtccgggtc 3163861 cccctgctag tgggcggcaa cgggaccgag gtgctgcggc tgggcggacg catcgccgac 3163921 attgtcggcc tggccgggat cagccacaac cgcgacgcca cccaggtccg gttcacccac 3163981 ttcgacgccg acggcctggc cgaccggatc gccgtggtac gtcacgcggc cggcgatcgc 3164041 ttcgaagcca ttgagctcaa cgcgctgatc caggcggtgg tctgcaccaa cgaccgaaac 3164101 gcggcggccg ccgaactggc cgccaccttg ggcgggatca cgcccgagca ggtcctcgag 3164161 tcgccgtttc tgctgctcgg tacccacgag cagatggccg aggctctcgc cgcgcggcag 3164221 cggcggttcg gtgtcagcta ttggacggtg ttcgacgagt gggctggccg cgcgtcggca 3164281 atgcgcgaca tcgccgaggt catcgcgctc ctgcgctacg gctaggcccg cggatgggcc 3164341 cgctcgtgca ccgcccgcaa ccgggcgacc gcgacgtggg tgtacagctg cgtggtcgcc 3164401 aggctggaat gaccgagcag ctcctggacc acccgcaggt cggcgccacc ttccagcagg 3164461 tgggtcgccg cgctgtgccg cagcccgtgc ggccccatat cgggtgcgcc gtccaccgcg 3164521 gccacggtct ggtgcaccgc agtgcgtgct tgccgcacgt caaggcgccg gccccgggca 3164581 cccagcagca gcgcgtgccc ggactccgcg gtgaccagcg cgcgacggcc gtcgaccagc 3164641 caggcgtgca gcgcatcggc ggctggctgc ccgaacggga cggtgcgctg cttgttgccc 3164701 ttgccgagca cccgaaccaa ccgatggccg gtgtcgatgt cgtcgacgtc caggccgcac 3164761 agctcgctga cccggatacc ggtggcgtac aacagctcga cgatcaaccg gtcccgcagc 3164821 gctagcggat caccttgctc tgcaccagat tcggcagccg ccatggcgcg cagcgcctga 3164881 tcctgacgca gcaccgccgg caaggtgcga cgggccttcg gcacctgtag ccgggccgca 3164941 ggatcaccgg ccagtagccc gcgccgcacc gcccaggcgg tgaatgcctt aaccgccgaa 3165001 gtgcgccgcg ccagcgtcgt gcgggcggcg cccgctcccg ccgtcgcggc cagccaagac 3165061 cgcaggaccg aaagggttag tgcgtccaga ctcgatccgc gatcggcgag aaacgcgaag 3165121 agcgatctta gatcgcccag gtaggcacga cgggtgtgca ccgaccgacc gcattgcagg 3165181 gcaaggtatt cgtcgaactc gtcaaggatc gcctgcactc ccccacagtc gcaggcatga 3165241 cgtctcgagc ccgagtcgac gcgccgcacc gtgtccgggg tgagatcttt cggcctggga 3165301 tagccgacct agtgagtccc ggcctccgcc tcagccagtt ctttcttcca tttgcggaac 3165361 atctcctcgg tgcgtccgcg ccgccagtaa ccggagatcg acgacgccca tttggcatcc 3165421 acaccgcgct cgttgcgaac gtatggccgc aagttatgca tgacggcttg cgcctcaccg 3165481 tgaataaaga cgtggacctg tcccggcagc cacgcggtgg tggtgaccgc ctcgatcagc 3165541 ggcgcgtgat caccggcgcg gtcctcggga accagatcgg cgcgcccgcc gcgatagacc 3165601 cagttcacct cgacggcatc cggcgcggtc aggccgatct cgtcgtccgg gccggcaact 3165661 tcgatgaatg ccctaccgat tgcgtcgggg ggcaacgctt ccagcgcggc ggcgatggcg 3165721 gggatcgccg attcgtcacc cgccagcaaa tgccagtcgg cggctgggtc gggggcgtac 3165781 gcgccgccgg ggcccatcag gtagatcggt tgcccacgct gggccccagc cgcccacgga 3165841 ccggctaccc cgtgctcacc gtgcagcacg atgtccacgg cgatctcgcg ggccgcggcg 3165901 tcgacatgac gaacggtcat ggtgcgcacc ggcggccgct tcgcggtggg caggtcggcg 3165961 aagctgtcca gggtcagcgg ccggggcaac cgcccgacat cgacatcgtc gtcgacgaac 3166021 accagcttga tgtaagagtc ggtgaagtcg ctggggacga atgtgtcgaa gccgctgccg 3166081 ccgagcacta cccggaccat gtgcggcgcg aggtgtcggg tagcgacaac ctcaaaggcg 3166141 tgcaatggtc gacccgccac atgtcctcct gtccagaccc gacccgcgtc gactatacga 3166201 gccgggccgc tgcacccttg gccgcggcct gaccggcacc ggcgcgcaat attcgccacc 3166261 gcccgtcgcg acactcggcc aacccggcga cctcgaggat tgccagcgga cctagcacct 3166321 gcgcgggcag cagcccggag ccgacagcga tctcatcaat ggtagcggcg ccgcggcccg 3166381 gcagggcctc gtacacttgg cgttcggctt cgcttagcac gtcgagcgct gcgccgggcc 3166441 gcggttcatc accggccaac tcaccgatgt gaccgacgaa ctcgacgata tcgtcggccc 3166501 gggtgaccaa ctccgcgcca tggcgaagca gcgtatgaca gcccgccgat gccgaggatg 3166561 tcaccgggcc gggcaccgct gccaccaccc ggcccaatgc ccgcgcccag gcagcggtgt 3166621 tggcggcgcc gctgcgcagg cccgcttcca ccactaccgc cgccctcgcg accgcggcca 3166681 ccaaccggtt gcgggttagg aaccggtgcc gggccggacg gacaccgggc gggtattcgg 3166741 tgaacagcac cccatgttgg gcaatgcgat gtagcaacgc cgaatggccc gccggatacg 3166801 ggatgtcaaa tccgccggcc agtacggcca cggtgatgcc ctcggaatcc agcgccgcgc 3166861 ggtgagccgc accgtcgatc ccgtaggcgc caccggagac gaccgcgacg tcgcgctctg 3166921 ccaacccggc ggccagatcg gccgcgacat gctcgccgta ggccgtcgca gcccgggttc 3166981 caacgacggc ggccgcacgt ggtgccactt cgtccaggcg cgcggggccc agggcccaca 3167041 acaccagcgg cgagtggccg cacggccttg cccgggctcc ggcgccactg aaagcggcga 3167101 acgccagcac cggccactcg tcgtcgtcgg gagtgatcag acgcccaccg cggcgcatga 3167161 gtagctcgag atcgtctgcg gcccggtcta ttccgcgtcg ggcaccggtg tgctgcgcca 3167221 gctcgttacc gacctgcccg cggcgcaccc ggtcggcggc ctccacgggg cccacacatc 3167281 gcaccagcgc ggccagctgg gcgcacggcg gttcggccac ccgggacaga taggcccacg 3167341 cccgcgccgt cggatcgatc atcgtcgtgc tccggtttgc cggaagctca gggcggcggc 3167401 gacctcgtcg atgcctggcg atgtgcgacc ggccaagtcg gccaaactcc aggccacccg 3167461 caaggtgcga tccacaccgc ggatgctgag tagcccgcgg tccagcgcgg tgcgcaacgg 3167521 gagcatcgcg gcgctgctgg gccgaaactt gcggcgcaac agcggcccgc tgacttcggc 3167581 gttggtccgg aacccatgtg gccgccatcg ttgcgcggcc gcctcccggg ccagcgccac 3167641 ccgctggcga acctgcgacg tcgactcgcc gtccgcggcc gagaacgccc cggcccgaag 3167701 ccgatgcatc tgcacccgta ggtccacccg atccagcaac ggcccagaca gtttgcccag 3167761 ataccgtcgt ttggtagccg ccgcacagat gcaatcctgt ggatcggcgg gcgcgcacgg 3167821 gcacgggttg gcggctagca cgagctgaaa ccgtgccggg tagcacgcca ccccgtcacg 3167881 gcgcgctagg cggatttcac cgtcctccaa cggtgttcgc aatgcttcca gcgcgctaag 3167941 gctgatctcg gcgcactcgt ccaggaacaa caccccgcga tgcgccctgc tgaccgcccc 3168001 tgggcgagcc atccccgatc ccccgccgac aagcgccgca acgctggaac tgtggtgcgg 3168061 cgccacgaac ggcggccggg taatcaacgg tgtgtccccc gacagcaggc cagccaccga 3168121 gtggatcgcg gtcacctcca acgactcgct gcccgacagc gacggcaaca gccccggtag 3168181 acgttgcgcc agcattgttt tgccgacacc cggtggacca gtcagcatga ggtgatgcgc 3168241 cccggcggcg gccacctcga cggcgaaccg tgcttgggac tggcccacca catcggcgag 3168301 gtccgccgca gactcggggg tggtgtcggc cgtggtgatc cgcccggcca agccggtgga 3168361 cccgcgtagc cagctctgca actgccccag cgtgcgaaca ccccggacgt cgattccgtc 3168421 caccaggctg gcctcgggca ggttgtcggc cggaacgacg acggccggcc aaccgtcacg 3168481 tttggctgcc agcacggcgg gcaacacccc acgcaccgga cgcacccgtc cgtccagcga 3168541 caattcaccc agcagcagcg tgttctccag acgttcccac ggcttctttt gttgcgccga 3168601 caacaccgcc gcggccaggg cgatgtcgta gaccgagccc attttcggca gcgtcgccgg 3168661 cgacagcgcg agcgtgagcc tggccatcgg ccagctgttt ccgcaattgg tgaccgccgc 3168721 gcggacccgg tcgcgggact cctgcaatgc agcatcgggc agacccacca gatgcacacc 3168781 cggcaaccct gaggtgatgt cggcttcgat ttccacgatc tcgccgtcca gcccccgcac 3168841 cgcgaccgag aacgcacgcc ccagcgccat cagccgatcc cctgcaggtg ggtgagctct 3168901 ggggtgcggc ctgaattctt ggggccgact cgcacgccga tcacatcgat gcgcaccgca 3168961 gcccagcgct cttcctggtc ggccagccac agcccggcca ggcgacgcag gcggcgaacc 3169021 ttgcgctcgg tcaccgcgtg cgcgagcccc ccataaccgt cgccggtgcg ggtcttgacc 3169081 tcgacgaaca ccaccgtgcg ggtggcagcg tcgcaggcga tcacgtccag ctcgccgtag 3169141 cggcaacgcc agttgcggtt caagatccgc aaccccatgc tggtcaggta gtccaccgct 3169201 agggcctcgc ccatcgctcc cagctgaacc cgagtcatcg tcttcagggt tgtcatgcgg 3169261 ccaacctgca cgctggcccc gacatcacct gccacgaatc gcgtctcacc gatgccgcga 3169321 cgaccagtta tccccagtcg cggccctgtc cacagcccca gtactgcgcg ggatcacgac 3169381 accgcgtcct tgtcatcatc gtctccgtca catagcaact tctcgggtcc cggctactcg 3169441 caacgcaccg caggcggcac acgccgatcc agcaacatca tgttcggcgc cggaagagtc 3169501 ccgttaggtg attcggtccg ctctggtgta gacgttcatc gagtcccccc gcaggaaagc 3169561 caccagcgtg atcccggacg cgtcggccaa cgaaaccgcc agcgacgacg gcgcggatac 3169621 cgcggccagc accggaatcc cagccatcag cgccttttgg gtcaactcga acgacgcccg 3169681 cccgctgacc aacaacaccg aggcgccaag cggtattcgg tcacgctcga aagcccagcc 3169741 gatgaccttg tcgaccgcat tgtgccggcc gatatcctca cgcacggcaa gcatggcgcc 3169801 gtccaccccg aatagtgccg cagcgtgcag cccaccggtt ctcgcgaaaa ccttttgcgc 3169861 gcgccgaagt tggtccggca tcgccttgag agtgtcggcg gcgacggtag cgggatcgcc 3169921 gcccggtgcg aatcggctga cctggctcac cgcctgaagc gacgccttac cacagactcc 3169981 gcacgacgag gtggtgtaga aggtgcgggt gacatcgaca tcgggcggct tgacgccggg 3170041 cgccagagcc acatccaaaa cgttgtacgt gctggcccct gtggcattgc cctcgacgcg 3170101 cctgccacag tagctaacgg tcagcacgtc ttcgcggtgc gcaaccaccc cttcggcaag 3170161 cagaaagcct tgcaccagtt cgaaatccga tcctggcgtg cgcatggtca cggtaaccgg 3170221 cgtcccattg acgcggatct ccagcggctc ctcgacggcc aaggtttccg gccgggtgat 3170281 cacctgatcg gcgctgagat gcctgacccg ccgatgcgcc gttgcgtacc ccactaggcc 3170341 gttggctcca atcgcacgat gatcgccttc gacaccgggg tgttcgattg ggccgcggta 3170401 tggtcgagcg gaaccagcgg attggtctcc gggtagtagg ccgcagcatt gccgaccggc 3170461 gtcgaatatg ccaccaccag aaagtctttt gcccgccgtt cttgcagacc gccttggccg 3170521 tcggtccact ccgacaccag gtcgacacgg tcacccgccg tcaaaccgaa cgtttcgatg 3170581 tcggccgggt tgatgaacac cacccggcgt ccgcccttca cgccgcgata tcggtcgtcg 3170641 agcccgtaga tcgtggtgtt gtactggtca tggctgcgta gggtctgtag caccagccgg 3170701 ccgggcggca ccggcaccca ctgcaacgga ttgaccgcga agttagcttt gcctgtgctg 3170761 gtacggaatt cgcgcgcatc gcgcggcggg tgcggcaatt ggaatccgtc gggcacacgc 3170821 accttgtggt tgtagtcgtc acagccgggc accaccgcgg cgatggcgtc acggatggtg 3170881 tcgtagtcat ctgcgaaccg ttcccatggc accggatgtc cggggccgaa caaggcgcgg 3170941 gccagctggc agatgatctg cacctcgctg cgcacctgat cgctgggcgg gtgcaggcta 3171001 ccacgcgaca gatgcaccat cgacatcgaa tcctcaaccg acaccaattg tttgcgacca 3171061 ttgcgggtat cgcgatcggt ccgacccagc gtcggcagga tcagcgcggt ggcgccgtgg 3171121 acaaggtggc tgcggttgag cttggtcgag acttgcacag tcagcgcgca cctgcgcaag 3171181 gccgcctcgg tgacggcggt gtcgggggtg gccgacgcga agtttccgcc catgcccatg 3171241 aagacgctga cccgaccgtc gcgcatggcc cggattgcgg ccacggtgtc aaagccgtgc 3171301 gctcgggggc tggtaatgcc gaactcacga tccagcgccg ccaggaactg ctcgggcatc 3171361 ttctcccaga tccccatcgt gcggtcccct tgtacgttgg aatgcccgcg caccgggcac 3171421 acccccgcgc cgggtttgcc gatcatgccc cgcagcagca gcacgttggt gacctcaccg 3171481 atggtggcca cggcgtgggc gtgttgggtc aagcccatag cccagcagat gaccgtgcgc 3171541 tgcgacgcca tcaacatcgc ggcgacccgc tgaagttgcg cgagttcgat gccggtggcg 3171601 tccatcacgg tgtccaagcc gacctgcaga gtccggcggc ggtacccgtc gaatccggca 3171661 caatggttgt cgacgaacga ccggtcgaca acgctgccgg ggaccctctc ctcggcctcc 3171721 aacaacaacc tgcctaaccc ggcgaacaat gccatgtccc cgccgaggcg gatctgcacg 3171781 aactcgtcgg cgatcgggat accatgtccc acaaccccgt tcaccttctg cggatctttg 3171841 aaccgaatca acccggcctc gggcagcggg ttcacggcga tgatcttggc gccgttggcc 3171901 ttcgctttcc ccagcaccga cagcatgcgg ggatgattgg taccggggtt ttgtccggcg 3171961 atcacgatca ggtcggcgtg ctcgacgtca ccgatggtca ccgagccttt tccgattccg 3172021 atcgagtcgg tcagcgccgc acccgaggac tcgtggcaca tgttggagca gtcgggcagg 3172081 ttgttggtgc cgaaagagcg cacgagcagc tggtaacaga acgccgcttc gttgctggtg 3172141 cgccccgatg tgtagaacac ggcccggtcg ggactgtcca acccgttgag ctgctcggcg 3172201 atcagctgat aagcggcatc ccagctgatg ggccggtagt ggtcatcacc ggggcgcaag 3172261 accatcgggt gggcgagccg gccttgctgg gacagccaat attcgggctt cgcggacagc 3172321 tccgccaccg agtgccgagc gaagaactcc gcagtgacgg tacgcttggt ggcctcttcg 3172381 gcgactgcct tggcgccgtt ctcgcagaac tcggccagct tgcgtccgcc gggctcctcc 3172441 ggccacgcgc agcctgggca gtcgaagccg ttacgctgat tcaaccgagc cagcgccgcc 3172501 gcggtgcgca gcgcgcccat ctgctgcatc ccccgctgca gcgataccat caccgcccgc 3172561 acgcccgcgg cctcgcgttt gcgcggcgcc accgttaccg cctgctcgtc atagtcggcg 3172621 aggacgtcgc gagacgccgc cgaccgctgc cacctcaccg cctcaacgta catccacgac 3172681 cgaccgactg ccgcacacag ccgattgacg tgtgacggcg cttggggcag ctattccggc 3172741 aggcgcagct cgggtttttc gacttcctcg atgttgacgt ccttgaacgt gaccacccgc 3172801 acctgtttga cgaaccgtgc cggccggtac atgtcccaca cccaggcgtc agccagccgc 3172861 agctcgaagt acacctcacc gtcggtattc cgcggcacca tctccacact gtttgccaaa 3172921 tagaaacgtc gctcggtttc tacgacgtag ctgaactggc cgacgatgtc cttgtattcg 3172981 cgatacagcg agagctccat ctcggtttca tacttttcga gatcctctgc actcatctgc 3173041 tcagacgtcc ttctccctgc cggttccccg gcttccccgc tcagtgcccc taagtgccct 3173101 gagcgcgacc cgtggcccgc attgtcgctg ggtgggaact cttgctccat cttccctcac 3173161 ccgtctgtgc cgtcccgtcc cgagggtcgg gttggccgtc ggcgacctct gcggtgttcg 3173221 acccactcgc cacccggcga acattgatga acgagtaacg gtgctgcggg cagggtccca 3173281 atcgggccag cgcccggctg tgcgccgggg tgctgtaacc cttgtgctcc gcgaaaccgt 3173341 acccggggtg atcggcgtcc aacgcaacca tcacgcggtc ccggctgacc ttggcgagca 3173401 cgctagccgc ggcgatgcag gcggctgccg cgtcgccacc gatcaccggc aacgacggca 3173461 tcggcagtcc tggcacgcga aagccgtcgc tgagcacata accgggccgc accgccagac 3173521 cggccaccgc gcgccgcata ccttcgatat tggccacgtg cacgccgtgg cggtcgacct 3173581 cggccgacgg gatgaacacc acgtgatagg ccaccgcata ccggcagatc agcgggaaca 3173641 gcttctcccg cgcttgctcg ctgagcttct tcgaatcatc aagggcggca agacttgcta 3173701 tccgcccggg gccaagcacg caggccgcga ccaccaacgg gccagcgcag gcgccgcgac 3173761 ccacttcgtc gaccccggcc accggcccca gaccaccacg atgcagcgcg gactccaggg 3173821 tgcgcattcc ccgcaaaccc ccagatttac ggatcaccgt ccgcggtggc caggtcttgg 3173881 tcatattcca gccatggcta ccgaccttgc tggggattca ccgaacgcac aacaccccaa 3173941 cgcgacggcg gccacacgat caacctggcc ttaccgatga cgttggccac cggcacggtc 3174001 cccggtagcg gatcgttagt acatagcaac gggcagtgag cgcgggaatc cgccgaatgg 3174061 gtgcggttgt cgcccatcac ccagacacgc ccgggcggga cggtgaccgg cccgaactcg 3174121 ctgcccaggc acgggtatat cgacgggtcg gccatcatgg tggccggatc caggtatggc 3174181 tccttcagtg gcctgccgtt gaccgtcagg ccggtgtcgg accggcattg aaccgtctgt 3174241 ccgccgaccg cgatgacacg cttgaccagg tcgttctcgt cgggaggcac gaaaccgatg 3174301 aacgacaacg cgttctgcac ccagcgcacg gcgacgttgt gcgaacggat cgacttgtaa 3174361 ccaacgttcc acgacggcgg tcccctgaag acgatgacgt cgccaggttg cggtgagccg 3174421 aagcggtagc tgagtttgtc caccatgatg cggtcgccga cgcacgtcga acacccgtgc 3174481 aacgtgggtt ccatcgattc cgacggaatc agataagggc gcgcgacaaa cgtcagcatg 3174541 acgtagtaga gcaccacagc aatcaccgcc agcaccgcga actcccgcag cgttgatcgc 3174601 ttcgcgggcc gcggctcgtc cgttttggcc gccttggagt cgccttcgga gtccgcatcc 3174661 ggggctgcgt cgaacggggc tgcgtcgaag acctggccgg caatgtccgg gtcccgggag 3174721 gagagctccg gctctgccgg acccggctgg cgctccgatg gggagtccgt ggtttcggtc 3174781 acgagatcag cgtagccagc gcaggtggcg gctttcgaac atcgccgaga cgttcccggt 3174841 cagcgcttct ccttgatctt ggccttcttt ccgcgcagtt cgcgcaggta gtacagcttg 3174901 gcgcggcgaa catcgccacg ggtcaccacc tcgatatggt cgatgttcgg cgagtgcacg 3174961 gggaaggtcc gttcgacgcc gacgccgtag ctctccttgc gcaccgtgaa cgtctcgcgg 3175021 atgcccccgc cctgccggcg gatcaccacg cccttgaaca cctggagacg ttccttggcg 3175081 ccctcgatca ccttgacatg cacgttgatg gtgtcgcccg ggttgaacgc cgggatgtcg 3175141 tcgcgcaacg acggcttgtc gacgaagtcc agccggttca ttggaaatga ccatccttgg 3175201 ggtcgcggcg tggttacccc ccacacgcag cgtgcggtgg tcaccaagcc ggtgggttcg 3175261 ggcttattgg tgcatctcgc agcaggcggc acgcaacccg gccgaccacc gcgacagaca 3175321 actgctcaat tgtgccagac ggtacgcatg cagtgaaatc acaggaaatc tccggtggtt 3175381 cgcggccgtc gaaaagcgcc cgcaaatggt acacatgact tacatatgac tagggtcaaa 3175441 ccgcgcgtgt ggaaacccga agcttggcgt gacacccaac agagggcact taagagggca 3175501 atgcggccgc ctacctgcac gttttcgcga tgtcagagga tgccgaggga gaacaatgcg 3175561 agcacggccg ctgacgttgc tcaccgcttt ggcggcggtg acattggtgg tggttgcggg 3175621 ctgcgaggcc cgagtcgagg ccgaagcata tagcgcggcc gaccgcattt cgtctcgacc 3175681 gcaagcgcga cctcagccgc agccggtgga gctactgctg cgcgccatca cgccgcctag 3175741 ggctccggcg gcgtcgccga acgtcgggtt tggcgaactg cctacccggg tccggcaggc 3175801 aaccgatgag gccgccgcca tgggcgccac cctctcggtg gcggtgctcg atcgcgctac 3175861 tggccagctg gtctccaacg gcaacacgca gattatcgct accgcgtcgg tggccaagct 3175921 gttcatcgcc gacgatctgc tgctggccga ggccgagggc aaagtcacat tgtccccaga 3175981 ggaccatcat gcgttggacg tcatgctgca gtcatccgac gatggtgcgg ccgagcgatt 3176041 ctggagtcag gacggcggca atgccgtcgt cactcaagtc gcgcgccgat atgggctcag 3176101 gtcgaccgcg cctcccagcg acgggcgctg gtggaacaca atcagctccg cgccagacct 3176161 gatccgctac tacgacatgc tgctcgacgg gtccggcggc ctaccactgg atcgggccgc 3176221 cgtcatcatc gccgacctgg cccagtccac accgaccggg atcgacggct acccgcagcg 3176281 gttcggcatc cccgacggtt tgtacgccga accggtcgca gtcaaacagg gctggatgtg 3176341 ctgtatcggc agcagctgga tgcatctgtc caccggggtg atcggcccgg aacgccgcta 3176401 catcatggtg atcgagtcac tgcagcccgc cgacgacgcc accgctcgag caaccatcac 3176461 gcaagccgtc agaacgatgt ttcccaacgg ccggatctga cgctcgtccg gtcgcctcac 3176521 cggcgcgagc agacgcaaaa gccaccgcac gttcggcgtg tcgggggatt tcgcgtctgc 3176581 tcgccagcgg ggctagtcgg ggtgggacag gtcggggcgt cgttcgcggg tgcgctgcag 3176641 cgagacctct ctgcgccagg cggcaattcg ggcatggtcg ccggagagta ggacctcggg 3176701 tacatcgagg ccacgccagc tcgccggccg ggtgtagctc ggaccctcaa ggagcccgtc 3176761 caggcccgtt gagtgcgaat catcttggtg ggaagcggga ttgccgagaa caccggccaa 3176821 cagtcgcagc acggcttcga ccatcaccac ggccgccgac tccccgccgg gcaatacgta 3176881 gtcgccgatc gagacttctt cgacgcgcat tcgccgggcg gcatcctgca cgacccgctg 3176941 gtcgatgcct tcgtagcggc cgcaggcgaa caccagatgg ctctcggtgg tccagcgctg 3177001 ggcggtggcc tgggtaaaca acacaccggc gggcgtggga acaatcaaca acgtttcgct 3177061 ggaacaaatt tcgtcaagcg cttcacccca caccggcgcc ttcatcacca ttcccgggcc 3177121 gccgccgtag ggtgcgtcgt ccaccgagtg atgcacatcg tgggtccagc gccgcaggtc 3177181 gtgcacgtta aggtcgacca ggcccgattc gatcgccttg cccggcaacg actgtcgcaa 3177241 cgggtccagg caggcgggga agatcgtcac gatatcgatg cgcacgcctt actccagatt 3177301 cagcaagcca tggggcggat caatctcaac gatgccgtcg tccaatgaca ccgacgtgac 3177361 gatggcacgc acaaacggca ccaaaacctc atcggaatca cgcttgaccg ccagcaactc 3177421 accagcggcg gtgtgcacca cttcggtgac gacaccaaca ccctcccccg tcgccgtctg 3177481 gaccataagc cccaccagct ggtgatcgta ataggtgtcc ggctcgtcga tcgggggcaa 3177541 gtcatcggcg tcgatcacga acaagctgcc gcgcaacgca tcggctgcgt ctcgatcggc 3177601 cactccagcg agtcgcacca acaggcggcc gccgtgctgc cgcacacttt cgatgacgta 3177661 actcaccgca ctgccctcgg caccaccgtc aaaaggcccc ttagcgcgca acctggtacc 3177721 cggcgcaaac cggtcagctg ggtcgtcggt gcggatctcg acgacgacct cgccggtgac 3177781 accgtgcgac ttcaccaccc gcccgactac cagctccatg agcggggctc cgctactggt 3177841 cggtgtccac cacgtcgacg cggataccgc ggccaccgat accggctacc agagtgcgca 3177901 atgcggtagc ggtgcgtccc ccacgaccga tcaccttgcc caggtcgtct ggatgaacgt 3177961 ggacttcgac ggtgcgcccc cgccgactgg ttatcaggtc tacccggaca tcgtcaggat 3178021 tgtcgacgat cccacggacc agatgctcaa cagcgtcaac gacgacggcg ctcatttccc 3178081 cgtcagcttt ccgccgtcag ctcggcctgc tcgccaccca gcgccggcgt gtccggctgc 3178141 tcaggctgcg gcgctggctc agcagccttg gcagcctttt tggccggcga cttcttcttc 3178201 ggtttggtgg cctcggtggt aggaccaccg tcggcggcgg ccaacgcggc gttgaacacc 3178261 tcgagcttgc tgggcttggg tgcggcgacc ttcaaccggc cctgagcgcc aggtaggccc 3178321 ttaaacttct gccaatcccc ggtgatcttc agcagcttga ggacgggctc ggtgggctga 3178381 gcacccaccg agagccagta ctgggcacgc tcggagttga tctcgatgag actcggctct 3178441 tctttggggt ggtaccggcc gattacctcg atcgctcggc cgtcgcggcg ggtgcgcgca 3178501 tcggcgacgg cgacgcggta ctgaggattg cggatcttgc caagccgagt gagcttgatc 3178561 ttcacagcca tgattgagcg ctcctattgg tgtcacgctg caattcagcg acccgggcgg 3178621 gatgcccgga tccggttttg cctcgcgtgt atgaccaccg ggcggcaacc ccgaacagga 3178681 caagtcgtcg cgcggtggac agccgccaat tgtgccagaa cgtgatgctg gggcagtaat 3178741 tcgcccagcg ggcttcacat cattttctgg aagcacttgg tttcgacccg cctgatgatc 3178801 cgccagccat cgggggtgcg cacgaaatcg tcgtcgtacc acagtccaca gaacagcact 3178861 tgctgccggt cgccggcgaa caccatcggg ttgaagcaga tcacccgcga cgacgcggta 3178921 tcgccgtcga cacggaccga gaagttgccc aacatgtgcg catataccgg gaagtttccc 3178981 agcacctgcg acagccattg cttgatcttc ggatacctgc cgtcgatgcc acctagcgcg 3179041 cgatagtcga tataggcgtc gggggtgaac acccggtcaa gatcgtcgaa tcggcgctgg 3179101 tcaatcgcgc tggagtagtc caccagcaac tgctggattt ccaaccggtc ggaaatttcg 3179161 gccacgctca acatgctccg atccaacacc gcacacatcg gccggacagc ccccgaccag 3179221 cccgagaata ggcctaccgg agccctggaa gttaaactct gcgcccatgc gaaagctcat 3179281 gaccgcgacc gccgcgctct gtgcctgcgc agtcaccgtc agtgcgggtg ccgcgtgggc 3179341 cgatgccgac gtgcagccgg ccggctccgt gccgatcccc gatggcccgg ctcagacctg 3179401 gatcgtggcc gacctcgata gcggtcaggt gctagccggc cgcgaccaaa acgtggccca 3179461 tccgcccgcg agcaccatca aggtgctgtt ggcgctggtg gcactcgacg agctggacct 3179521 gaactccacg gtcgtcgccg acgtcgccga cacacaggcc gagtgcaact gcgtcggcgt 3179581 caaaccgggg cgcagctaca ccgcgcgcca gctgctcgac ggcctgttgc tggtgtcggg 3179641 caacgacgcc gccaacacgt tggcgcacat gctgggtggc caagacgtca ccgtggccaa 3179701 gatgaacgcc aaagccgcca ccctaggtgc gacgtccacc cacgcgacga cgccgtccgg 3179761 cctagacgga cccggcggct ccggggcgtc caccgcgcac gacctggtgg tcatcttccg 3179821 ggccgcgatg gccaatccgg tgttcgcgca gatcaccgcc gagccctcgg cgatgttccc 3179881 cagcgataac ggcgaacagc tgatcgtcaa ccaggacgag ctgctgcagc ggtacccggg 3179941 cgcgatcggc ggcaagacgg gctacaccaa cgccgctcgc aagacgttcg tgggtgccgc 3180001 cgcccgcggc ggccgccgcc tggtgatcgc catgatgtac gggctggtca aagagggcgg 3180061 accgacgtat tgggatcagg ctgcgaccct gttcgactgg ggtttcgccc tcaacccgca 3180121 ggccagcgtc ggctcgctct agcaccgcga gcagacgtgg gcgctggtgc gcccatcatg 3180181 ttcttttgcg tctgctggcg ctcataggcc ggcggtcagc agcgccgtca gcatgggaat 3180241 ccgctgttcc tcgagttcgg gctgcggcag tacccctcgc acaatggcgg cgccgtcgaa 3180301 gacgttggtc atcaacgcca cgatgaccgg aaatgtctcc tccggaaagc tctcggcacc 3180361 cggcagagca cgcgcggcgt cgtggatctt cgcgctgtac tgccccagca cattctgcag 3180421 cgtctccttg agcttctcgt cggtgcgcgc ggcgaccata agctcgtaga gcaccgcatt 3180481 cgtggagccg gccgtgatgt cccgcaaaat cgtcagcgcc gccggaagcg ccggccgatc 3180541 ggccggtatt tcggcgactt gcttggtgaa cgtttccagc tgacggcgca acacctcgta 3180601 tgccgtggcc gccatgaaat cacccatcgt ttcgaagtgc cggaacaggg cgcctaccga 3180661 caccccagcc cgcttggtga tcacggcagc cgatgcccgc gcgtagccga cctcgatgat 3180721 cgtgtcgatg ctggcctgca gaagccgtgc aacggtttct tcgcggcgct gctgctgggt 3180781 cctggccatg tcaggcagaa cggctcagag cggcgccgag ctcacccgcc cgcaggtagc 3180841 gacccgactt cacgttctgc ccgtagccgt cgcggaactg acctccgaac tggcctccgc 3180901 ggaataccac ggtgccgccc acgcccgtcg caaccacggt cgcatcgttg cggttgacca 3180961 tgcgtcgcag accgccatag tagggcaccg cctcctcgtg gtacccgtcc accgattcat 3181021 ctaggtgggt agggtcaatc accgcgaagt ccgcacggtc accctggcgc aacgtgcccg 3181081 cgcctatacc gaaccactcg gccaactcac cggtgaggcg atacactgcc cgctcgatgg 3181141 acagaaacgg ttgtccggcc cggtcggcgt ctctggctcg tttgagcagc cgaagcccga 3181201 agttgtagaa cgccatattg cgcaggtgcg cgccggcgtc ggagaagccc atgtggacac 3181261 tcggttcggc ggccagcttg ttcagctggt tgggccggtg attggcgacg atggtggtcc 3181321 atcggacatt gcgctccccg ttgtccacca gcacatcgag gaacgcgtcc agcgggtgca 3181381 gcccgcgctc gtcggctatt gccccgaaac tcttaccgat caacgactta tccgggcatt 3181441 cgacgatcac ggcgtcgtgg aagtcccgat gccacaacga aggtccgagc ttgatgcgat 3181501 cgaactcgcg ccggaacgac cggcggtaag acctgtcggc caggagctcg ttgcgctgca 3181561 gttggtcacg cagatgaagg gccgccgttc cggcgccgaa ctcctcgaag accggcaggt 3181621 cgatgccgtc ggagtacagc tcgaacggga ccggcagatg ctggaatcgc acctgagagc 3181681 ctaagagctt gttcagcacg cgggtgccca acccgaacac gtgtaccgcc agcggcatcg 3181741 acttggcgtc ggcggacacc aacatgctca ttcgaacgcc cttgcgccgg ttgaatatcc 3181801 ggctgctggc caagaaaaac agcagcgcgg acaccgggtt gtcgacgtcg ggtgcgctct 3181861 gcagtatccg gccccggtgg cgcagcaccg agatcagctt gcgacgctcc cgccaggtcg 3181921 cgaaggtgga cggcagcgca cgcgagcgga agcggtcgcc gtcgagcttg tcgatagcgg 3181981 cgtccatccc ggacatgccc agcatcccgg cctcgagcgc ctcatcgagc agtttcgcca 3182041 tcttcgccag ctcggcttcg gtgggccgga cggtgtcgtc ggtggcacga tcaaggccca 3182101 gtaccgcggt ccgcagatcc gaatggccaa gcagtgaact cacattcggc ccgaggggca 3182161 gggcgtcgat cgcttcgatg tactccgcgg gcgtcgacca cgtctggttg tcccgcaggg 3182221 cacccaggac aaattcgcgg ggcaccgctt caacacggct gaacaggtcg gcggcatcct 3182281 cggagttggc gtagaccgtc gacaacgagc agtttcccag cagcaccgtg gtgacaccgt 3182341 ggcgcaccga ctcccgcaaa ccaggatcga gcaacacctc ggcgtcatag tgggtgtgca 3182401 cgtcgatgaa gccaggcacg acccacttcc ccgccgcatc aaccacctcc gggcagccgg 3182461 tctcgtccag tgcgccggca gccaccgtgg ccaccacgcc gtcgcgaatg cccagagtgc 3182521 gagtcaatgg cgcattgccg gtgccgtcga accacagtcc gtcgcgaatg atcacgtcgt 3182581 aggtcaccgt ttcctccaga tcgttgagtt gccgccaagc taacatagat agcgatcact 3182641 cgcaatcttt ttggctgacg ccgcttcgct gccgcggcgc tggtcaagtg ggtgtcagcg 3182701 accgggcccc ggcgccgttg tggtcggcgg cgtcagggtg gctgtggacg ttgtatcggg 3182761 ggtatcagga atcgttgctg ggtccggcac ggtgactgcc gggggaagat caccactgcg 3182821 gctggccacc gcgggaatcc gaatgaccgc tccttgttgt ccacattcgt tgctatgcac 3182881 agtgacgacc atttcgccga cgaggtcgcc ctgcggttgc ggtcgcagtg cgagcaactg 3182941 cgtcgtggcc tgtgtactcg gcgagccgtt gggcccgacg cacgggaact gcaccgtctc 3183001 cggccgcgac ttccactggc cctcgccgaa ctgcatgagg aacggcctaa cgggcggagt 3183061 cttggcctgg gtgtggtcgt tgtcgtcgag catcgttgcg gccgcgagac attcggtcgg 3183121 agtgcacgaa gtgcggaacg cccaccaggt gttcacgtcc ggcggttgcg gcgtaggggt 3183181 gtagtcgtag gtctgctttg agcgttggat ctcgatgcgg tatgtgccgt ccagtgggac 3183241 cggcgcggtg accgcgacgg tggtcgtcgg ggcgctgggc acagccgacc cactggtcgg 3183301 cgggcgcgcg acttcggtgg cggtcgtgtt cgtcttgcgc ccaatcacga tgccgaccgc 3183361 gaacaggcca gccaatagca acaccgctac cgcaccgacc aggatccggc gtggccggcg 3183421 cctggtgggg ctcgccggag ccttggtggc ggtggagaag ttgtccaggc ggcgcgccag 3183481 caccccggcc gctgactgca gcatcgagcc gcgccgttgg ggggtcgggg ccgccggcgc 3183541 cggtgcccgg gcggatggtt ctttgcagtc gacggcctcg ggccaaccat aagccgggta 3183601 gtcgacgaca tacgcttcct caccggccgc cgcggtgacc tcagaagcgt cgacaccccc 3183661 cgagctctga tcagcgatcg cgacgccggc ctgttcgttc atcgcgtcgg cgaactcgcg 3183721 gcagctgccg aaccggtccg cgggcgctgt ggcgagcgca cgcgagagga caccgtcgag 3183781 gcgtgccagg tccgggcgga aggcggagag cttcggtggc tgcagcggtc cggtgtgcga 3183841 acgatcaacc ggcggcgcac cggcgaacag gtgtatggcg gtaagcgcca acgcgtactg 3183901 atcggcacgc ccgtcaacgt cggcccccgc cgacagttcg ggcgccggat agctgggttg 3183961 gctggcaatt ccgaagtcgg ccaacaggat ccgttggtcg ccagcactct gactggttag 3184021 cacgacgttg gcggggttga cgtcacgatg cagcaggccg cgctggtggg cgtagtcgag 3184081 agctccggct acggcagtga cgatggcgag tacctcacca accggcaaga ccgccggaaa 3184141 ccggtcggcc atatgctgcg tggcgtcgat gccatcgacg tagtccatcg caatccacag 3184201 ctgcccgtcg aactcaccgc gatcatgaac ttccaggatg tgcgggtgaa atagccgcgc 3184261 ggcaacctcg gtctcccgtt gaaatcggcg gcgaaattcg tcgtccgcag ccatcgccgg 3184321 cgaaagcacc ttcagcgcct gccagccggg gaatccggga tgttgcacga ggtagacctc 3184381 acccatcgcg gaacaaccca gcatccgcac gacggtgtag ccggcaaagg tcacgccgct 3184441 ggccaacgcc attggccgat agtaaccgcg ttcggcacgg cccgcgcggc caaagctagg 3184501 gcccaaaagt cctgccgcgc aaaatcacca gatcgggatg ctgcagcaca cccggaccct 3184561 gccgcggatc ctgggcgtag cacaacaggt cggccgatgc cctatcgtcc agcccgggcc 3184621 ggcccagcca gcgtcgagca tcccagcacg ccgcgcccaa cgcctcgtgc gcggtcatcc 3184681 caatcctctg cagcgccgct acctcgtcag cgatccgtcc gtgctcgatc gtgctgcccg 3184741 catcggtgcc cgcgtatacc ggcacccccg cctcccgcgc cgcagcgacc cgcccatagc 3184801 cgcgggcata caggtcgcgc atgtgcgcgg cataggttgg atagcgccct gccgcatcgg 3184861 caatgcccgg aaagttttcc aggttgatca gcgtggggac caacgcggtg ccgtgctcga 3184921 gcatcaaggc gatggtgtcg tcggtgaggc cggtgccgtg ctcgatgcag tcgatgccgg 3184981 cgttgatcaa gccgggcagc gcgtcctcgc tgaaaacgtg cgcggtgacc cgggcgccct 3185041 gagcgtgtgc cgtgtcgatg gcggctttga gcacgtcatc ggaccacaac ggggcaagat 3185101 cgccgatttg acggtcgatc cagtcaccga ccagcttgac ccagccgtca ccgcggcggg 3185161 cctgctcggc taccgctgcc ggcagctggg attcgtcttc gagctcgacc gcgaagccgg 3185221 cgatgtaacg cttgggtctg gccaggtgcc gtccggcgcg gatgatgcgg ggcaggtctt 3185281 cgtggtcgtc aaggccgcgg gtgtcggtcg gcgagccgca gtcccgcaac agcagcgcgc 3185341 cgacgtcacg ttcggtctcg gcctgagcga tcgcctcgtc gagttcgacg ttgccgtgtt 3185401 tcccaagccc gacatggcag tgcgcgtcga ccagcccggg caggatccag ccgccgtcaa 3185461 agacggtgtc ggctcctgcc accggttcgg tgctaatgcg gccgtcgacg atccacagtt 3185521 ggatcgccgt ctcgtcgggc aggcccaaac ctcgcacgtg caggcgcacg gcgcggctac 3185581 ggggcctgat ggtgtcgacc cgcttcaccc ggctccgccg cactcgcgat cgccactact 3185641 tcttgcctgg gaacttcagc ttggacaggt cgaagtcggc caggccgggc ggcagctcgt 3185701 cgagaccttt gggcatctgt gagagatcag ggagcccccc aggtagccca gccaagcccg 3185761 gcatcccggg cacgccgaac gggctcttga ccttcggcgg cgtcggaccg cgcgtcccct 3185821 tcttactctt cttgccggat ttgccttttg cgcccttgct ctttcgcgtc gcggatttgc 3185881 gccctatgcc cggtatgccc atgcccccga gcatggacga catcatcttg cgggcttcga 3185941 agaagcgctc gaccagctgg ttgacctcgg acaccgtgac gcccgagccg ttggcgatgc 3186001 gcagccgccg cgaggcattg atgatcttgg ggtctgcccg ttcctgcggc gtcatgccgc 3186061 gaatgatggc ctggacacga tcgagttgtt tgtcgtcgac ctcggccaac gcgtccttca 3186121 tctgagccgc gccgggcagc atgcccagca ggttgccgat cgggcccatc ttgcgtaccg 3186181 cgagcatctg ctcgaggaag tcctccaggg tcagctcgcc ggcgccgatc ttggctgcgg 3186241 cctcctcggc ctgttgtgca tcgaagacct gctcggcctg ttcgatcagg ctcagcacat 3186301 cgcccatgcc caagatgcga ctggccatcc ggtccgggtg gaagacgtcg aagtcctcca 3186361 gcttctcccc ggtggaggcg aaaaggattg gaacaccggt cacttcgcgc accgataacg 3186421 cggcaccacc gcgggcgtca ccgtcgagct tggtcaaggc cacaccggtg aacccgacgc 3186481 cctcgccgaa cgccgcagcg gtggtgaccg cgtcctggcc gatcatcgcg tccaggacga 3186541 acagcacctc gtcggggttg atggcgtcgc ggatggccgc ggcctgggcc atcagctcct 3186601 cgtcgatgcc cagtcgtccg gcggtgtcga cgatgacgac gtcgaagtgc ttggcccggg 3186661 cctcggccag cccggccgcc gccaccgcaa ccgggtcacc ggggccggac tccggcgagg 3186721 cacccggatg cggcgcgaac accggcactc cggcacgctc gccgacgacc tgcagctggt 3186781 tcaccgcggc cggccgttgc aggtcacaag cgaccagcag tggcgtgtgt ccttgtccac 3186841 gcaggcgggc ggccaatttg ccggccagtg tcgtcttccc ggagccctgc aggccggcga 3186901 gcatcacgac ggtcggcggg gtcttcgcaa acgccaactc gcgggtttcg ccgccgagga 3186961 tgcttatcag ttcctcgttg acgatcttga cgacctgttg agccgggttg agggcacttg 3187021 acacctcggc cccgcgggcg cgttctttga tccggtggat gaatgcccgg accaccggta 3187081 gcgaaacatc ggcttccagc agcgccaacc gaatttcgcg ggtagtggca tcgatatcgg 3187141 catcggtcag tcggcccttg ccgcgcagcc cctgcagggc ggcggtcaaa cggtcagaca 3187201 gcgattcaaa cacgcccgcc agcctaatgg tgatcgcgag cgccgcgcag cggcaccgtt 3187261 atccgttgac tctgcgtcca ccacgcaaaa gtgcgagtaa cccgcctggt ggacgcagag 3187321 tcaacacgat gcgacgtcgg acctgcgccg aaaagcgttg ccatgctaca tttcaccgcc 3187381 gccacctcac ggttccggct ggggagggag cgggcaaatt cggtccgtag cgacgggggg 3187441 tggggagtct tgcagccggt cagcgcgacc ttcaaccctc cgttgcgggg ttggcagcgc 3187501 cgggcgctgg tgcagtacct gggcacccag ccgcgggatt tcctcgcggt ggccactccc 3187561 ggatctggca agacatcgtt cgcgctgcgg atcgcagccg aactactccg ttaccacact 3187621 gtcgagcagg tcaccgtcgt cgtgcccaca gagcacctca aggtgcagtg ggcgcatgct 3187681 gcggcagcac acggcctttc ccttgaccca aagttcgcca actccaatcc gcagacctca 3187741 ccggagtatc acggcgtaat ggtcacctac gcccaggtcg cttcgcatcc cacgctgcac 3187801 cgagtgcgta ccgaagcgcg caagacgttg gtggtcttcg acgagatcca ccacggcggc 3187861 gacgccaaga cctggggaga cgccatccgg gaagctttcg gtgacgccac ccgccgcctt 3187921 gccctgacgg gtacaccgtt tcgcagcgac gacagcccaa tcccgttcgt cagctaccag 3187981 cccgacgcgg atggcgtgct gcgttctcag gctgaccaca cctacggcta tgcggaagcc 3188041 ctcgctgacg gtgtcgtccg gccggtggtc ttcctcgcct attcggggca ggcgcgctgg 3188101 cgggacagcg ccggcgagga gtacgaggcg cgactgggcg agccgctgtc tgccgagcag 3188161 accgcgcggg cgtggcgcac agcgctcgac ccggaaggcg agtggatgcc ggcggtgatc 3188221 acggcggccg atcgacggct ccgacaactg cgtgcgcacg tacccgacgc gggcggcatg 3188281 atcatcgcct cggatcgcac cacggcccgc gcttatgccc gcctgctcac cacgatgacg 3188341 gccgaagagc ccacggtcgt gctctccgac gaccccggat cgtcggcgcg tatcacggaa 3188401 tttgcccagg gcaccagccg ttggctggtc gcggtccgca tggtctccga aggtgtcgac 3188461 gtgccccggc tttcggtcgg ggtttacgcc accaacgcct ccacgccgct gttcttcgca 3188521 caggccatcg gtcggttcgt gaggtcccgc cgaccgggtg aaaccgcgag catcttcgtg 3188581 ccgtcggtgc ctaacctgct gcagctggcc agtgcgttgg aggtgcagcg taaccacgtg 3188641 ctgggccgac cgcaccgcga atcggcccac gatcccctcg atggtgatcc cgccaccagg 3188701 acgcaaaccg agcggggcgg cgcggagcgg ggctttaccg cgttgggggc cgatgcggaa 3188761 ctcgatcagg tcatcttcga cggttcctcg ttcggcaccg ccaccccaac cgggagcgac 3188821 gaggaggccg actacctagg catccccggg ctgctcgatg ccgagcagat gcgcgccctg 3188881 ctgcaccgcc gccaagacga gcagctgagg aaacgggctc agcttcagaa aggggccacc 3188941 cagccagcaa cgtcgggggc ttcggcatcg gtgcatggcc aactgcgcga cctgcgccgc 3189001 gagctccaca cgctggtgtc gattgcgcac caccgcaccg gcaaaccgca tggctggatc 3189061 cacgacgaac tgcgccgccg ttgtggcggg cctccgatcg ccgctgccac ccgcgctcag 3189121 atcaaggcac gcatcgatgc gttgcgacag ctcaactccg agcggtcatg agcgtgcgat 3189181 cctaatcgcc gacgggttcg tcgaccacaa cgtcgacgct ggcgcccaac acctccagca 3189241 ggtgttgctc gaccgcggct ctcgcgtcca actccgcggg gaccgtcaca cagaacacat 3189301 cggccgcggt cgatccgaac gtattgacct tcgcccagac aatgccggct cccgcgccct 3189361 ccagcgcccc ggccagcaac gcgagcaaac ccgcccgatc catggcccga acttcgagga 3189421 tcagcttggc cggcgcggcg gtgtcgagcc acaggatgcg gggcggagcg gccgtacgag 3189481 tcacgggcac cccggcctgc acgtccccgg cccgagcgga taccaagctg gcggcatcgc 3189541 tgtcccgctt ctgcagcatg cccagcacgt cgacgtcgcc gttgagggca ccgacaaact 3189601 gctgacgcac caactccgcc gcgggcgggg acccaaacag tggtgacacc acaaactcgg 3189661 ttatcgcgac accctggtgg acgttgaccg acgccgaatg tacgcgcagc gagttcagcg 3189721 ccagcaccgc ggcggctttc gacaccagtc cccgctcgtc cggcgccact attacggcgt 3189781 cgatgcgttc accgtcgcgc ggactaatct ccacatgcac cccgtggtcg gccgccagcg 3189841 aaagataatg gggtgcagtc ggttcggctt gaggcagcga ctctccggcc atcaccatcc 3189901 ggcagcgacg caccaggtca tcgaccagtg acgccttcca atcgctccac accccggggc 3189961 cggtggcctt cgagtccgcc tccgacaggg cgtgcaaaac ttcgagcagt tgcggatccc 3190021 cacccagcgc ctcggacacc gcctcgatgg ttttggggtc gtttaagtca cgtcgggttg 3190081 ccgtaatcgg cagcagcagg tggtggcgga ccagcttgga gagcgtccac acgtccggcg 3190141 gcgacaaccc cagcctggtg caaaccggga ttaccaattc ggccccgagc acactgtgat 3190201 cggtgccccg tcccttgccg atgtcgtgca gcagcgcgcc aagcgcaagc aggtcgggac 3190261 gtgccacccg ggtggccagt ggcgccgcat gcaccgcggt ctcgaccacg tgtcggtcaa 3190321 ccgtccactt gtgggcgacg tcgcgcggcg gaaggtcgcg aatgggctcc cattccggca 3190381 acaaccggcc ccagagcccg gttcggtcga gcgcttcgat ggtagccacc gtggtggggc 3190441 cggcggagag cacaactagt aagtcgtcca atgcctcttg cggccaggga gtcggcagat 3190501 ccgggacgct ggcggccaac cggctcaggg tggcggcgcc aatgggcaat ccggtgtcgg 3190561 ccgacgcggc ggccactcgg agcaccaggc cgggatcgtg ttcgggttcg gcgtcgcggg 3190621 cgagcacgat ttcgccggca tactcgacga caccctcgtc gagcggtcgc cgctttggcc 3190681 gccgcaccaa ggccgagatg ccgcgccgcg gcaatgcatt cgccgcagtc cgcagcccgg 3190741 cttcggcgtg gtaaccgatg gtgcggccag cactcgacag tgtgcgcgcc aaatcgaatc 3190801 ggtcaccgaa acccaacgcg gcgctgatct cgtcggcgaa ctgggccagc aggtggtcgc 3190861 gtccgcggcc cgacacccgg tgcagttcgg tgcgcacatc cagcaaggtg cgatacgcac 3190921 cgtccagcga acccgccggc cggtccgtgt ggccgatacc gtgccggtcg atgagctggg 3190981 cgagagccag cgcgtctagc aactggacgt cccgaaggcc gccgcgaccc aatttgagat 3191041 cgggctctgc gcgctgcgcg atccggccac agcgccgcca acgcgcatat gtcatttcga 3191101 cgagttcgcc catgcgggaa cgaattccgt tgcgccactg gcgtcgcacg ccgtcgatca 3191161 acgcgaacga gagctgctga tcgccggcga tgtggcgggc ttccagcatg cctagagcgg 3191221 ccatcagatc ggaattggcg atggtcaatg cctcactaac cgttcgcaca ctgtgatcga 3191281 gccgaatgtt ggcatcccac aacggatacc acaacctgtc ggcgacgggc cgcaagatgt 3191341 cagcaggctt gccatcgtgc aacagcaaca cgtccaggtc cgaatacggc agcagctcgc 3191401 ggcggccgag cccgccgacc ccgacgattg caaaaccact ggcatcggcg atcccgatct 3191461 cgtcggcctt gtcgatcagc caagactcat gcagatccag ccacgtctgc cgcagcccga 3191521 ccggatccag ctcgcgatgg ttgccggaca gcagctcgcg tcgggcgaca gctaaatcgc 3191581 ttgcggcaca aggactttct gcctccatct ccctcgctag cgctaattgg tgcggccggg 3191641 ttggttcagc acagtgcggc tagtttcata acgcgtcgtg tccgcgttca ccggtgcgca 3191701 cccgcacgat ggtgtctacc ggactcaccc acaccttgcc gtcgccgatc ttgccggtgc 3191761 gcgccgcccg gacaatgctg tccacgacct tatcgacaat ggaatcgtca acaacgacct 3191821 cgatccgaac cttcggtacg aaatccaccg agtattcggc cccgcggtaa acctccgtgt 3191881 ggcccttctg ccgtccgtat ccctggattt cactgaccgt catccccagc actcccgcgt 3191941 cctcgaggct cgtcttgacg tcgtcgagcg tgaacggctt cacgatcgca gtgatcagct 3192001 tcatttcggc tccgcctcca ctttctggcc tatacgctcc tgaatgccgt tgcggctatc 3192061 ctccacggtg acccgcgggg ggagaaccga gccgctggcg acggcgaaat cgtagccgct 3192121 ttccgcgtgc tcagcctcgt cgatgccggt gctctcttgc tccgcgtcaa gcctgagccc 3192181 gatggtgaat ttcaggatca atgccaagat cagggtgatg attccagagt agacgagaac 3192241 actgcaggca ccgagcgcct gtcgttccag ctgggcgaag cctccgccgt aaaacaaccc 3192301 cttcgatacc ccggccacac cattaattgc cggagcctcc ggagctgcca gcagacccac 3192361 cagcagtgtg cccaccagac caccaaccag gtgcaccccg accacgtcga gcgaatcatc 3192421 gaagcccagt ttgaatttca gccccaccgc cagcgcgcac agcaccccgg ccgacacgcc 3192481 taccgccaag gcacccagga cattaaccga cgagcaggac ggcgtgatgg cgaccagtcc 3192541 ggcgacgatg cccgacgccg cgcccagcgt cgtagccttg ccatctcgga cgcgctccgt 3192601 gagcagccag ccaagcatgg ccgcggccgt cgcaatcgtg gtggtgacaa acgtcgcccc 3192661 ggcaacaccg ttggcggtcg tcgccgatcc tgcgttgaac ccgtaccagc cgaaccacag 3192721 cagggcggcc ccgagcatca caaacggcag attgtgcggt cgaaacagcg tcgccggcca 3192781 accgcgtctt ttgcccagca cgatcgccag catcaaggcc gccacaccgg cgttgatatg 3192841 aaccgcggtg ccgccggcga agtcgatggc gtgcagcttg ttggcgatcc agccgccgtg 3192901 ctcagcggcg aaaccgtcaa atgcgaagac ccagtgtgcg accgggaaat agacgaacgt 3192961 cgcccacaaa ccggcgaaca acagccaggc gccgaacttc aaccggtcgg ccaccgcccc 3193021 ggagatcagc gcaaccgtga tgatcgcgaa catcagctgg aatgccacaa acacggtcgc 3193081 cggcagggta cccgccagcg gaatattcac cgcggcggtc tgcgtgctcg gatcggcagc 3193141 aacagcattg acgccgatga gacctttgag accccagtat tggctcgggt tgccggcgat 3193201 gttgccaacg tcatcaccga acgcaatcga gtagccgtaa agcgcccaga gcaccgtcac 3193261 gacacccatc gcgctgatgc tcatcatgat catgttcagg acgctcttgg aacgcaccat 3193321 gccgccgtag aaaaatgcca gacccggcgt catcaacagc acgagcgcgg aactcaccag 3193381 catccaggcg gtgtcgccgc catccggaac gcccatgatg gggaattggt ccactcgcta 3193441 tcacctccag tcgagcgttg gcacggcccc agccttacga ctgacgacct gatccagaac 3193501 catgcgcact agttgttgcg gcgatggtgc cgccatgttt catcaggatt aacgtaaaac 3193561 ttgctgtgaa agagctttcc gtggcgatcg caagcgcggc gcagccgcgc gcagcgggtc 3193621 gccaccatca aaccccgtgg cgatcgcaag cgcggcgcag ccgcgcgcag cgggtcgcca 3193681 ccatcaaacc ccgtggcgat cgcaagcgcg gcgcagccgc gcgcagcggg tcgccaccat 3193741 caaaccccgt ggcgatcgca agcgcggcgc agccgcgcgc agcgggtcgc caccatcaaa 3193801 ccccgtggcg atcgcaagcg cggcgcagcc gcgcgcagcg ggtcgccacc atcaaacccc 3193861 gtggcgatcg caagcgcggc gcagccgcgc gcagcgggtc gccacctcgg ctagccgagc 3193921 agggcgtcga cgaatgcggc gggttcgaaa ggcgccaggt catcggggcc ttcaccaagc 3193981 ccgaccagct tcaccggcac cccaagttcc tgttgaacgc ggaacacaat gccgcccttg 3194041 gccgttccgt ccagtttggt gagcaccgcg ccgctgatgt cgacgacctc ggcgaacact 3194101 ctggcctgcg ccaacccgtt ctgtccgatc gtggcatcga gcaccagcaa cacctcgtca 3194161 acggacgctc gccgagtcac cacgcgcttg accttgtcca gctcgtccat caggccaacc 3194221 ttggtgtgca gccgcccggc tgtatcgatg agcacgacgt ctgcgccggc ggcgatgccc 3194281 ttgtcgacgg cgtcgaacgc caccgatgcc gggtcggcgc cttcgggccc gcgaaccacc 3194341 gctgcgccaa cccgcgccgc ccaggtctgt agctgatcgg cggcggccgc acggaaggtg 3194401 tcagccgcac cgagtacgac ccgtcggccg tcggccacta gtacccgcgc caacttgccg 3194461 accgtggtgg tttttccggt gccgttgacg ccgacgacca gcaacaccga aggatggccg 3194521 gcgtgcggta gcgcgcggat cgagcggtcc atgccaggtt gcagttcgtt gatcaggacg 3194581 tcacgcaata ccgcccgggc gtcggcctcg gtacgcacgt tgccgctggc caggcggctg 3194641 cgcagctgcg acaccaccga cgcggtggcc gccggtccca ggtcggcgac cagcagggtg 3194701 tcctcgacgt cttgccagga gtcctcgtcc aggtcgccgc cgccgatcag tcccaacagg 3194761 ccgcgcccga gggcattctg cgatctggcg agccgtccgc gcagtcgttc caatcgacct 3194821 tcgggcggcg cgatggcgtc agcctcgggg acctctggag cctggggttc tggctcaaac 3194881 tcgggaaggt gtacgtcggc gatcgtgcgc ttgggcgcgt cgcgagggac ggtcgcatcg 3194941 tcgcccacgg cgggcagtcc gctcgtatcg atccgctcgg ccggctgggt cgtcggcgtc 3195001 tgactaaacg tgatgccaga cgatgcggtg taaccgcctg agcggtcgac aacgccgcgc 3195061 tcgggccgag gcgacagact gatgcgccgc cgacggtaga gcaccagccc cagggtcagc 3195121 gcagcgatga cgaccagggc ggcgatgacc gccgtggcga tccacaaacc ttcccacacg 3195181 ctgacaatcc ttccaggggt cgcttgcccc gatgcttagg gacgaaccct acgaggaatt 3195241 ggtaaccagc tgatccacct gctgaccgcg catgcgctgc gagatgaccg cggtgatgcc 3195301 gtcgttctgc atggttacgc cgtacagtgc gtccgcgacc tccatcgtcg gcttctggtg 3195361 ggtgatgatg atgatctgcg actgctctcg cagctgttcg aacaggctga gcagtcggcg 3195421 caggttcacg tcgtcgaggg cggcctccac ctcgtccatg atgtagaacg gcgatggacg 3195481 ggcacgaaag atcgcgacca gcatcgccac cgcggtcagc gccttctcgc caccggagag 3195541 caaagacagt cgggtaatct tcttgcccgg cgggcgggct tcgacctcga tgccggtggt 3195601 gagcatgtcg tcgggctcgg tcagccgcag ccgtccttca ccaccgggga acaatgcggt 3195661 gaacacgccg cgaaattcgc gttccacgtc tacgaacgcg tcattgaaca cctgcaggat 3195721 gcgggcgtca acatcggcga cgacgcccag cagatccttg cgggcagcct tgacatcctc 3195781 gagttgggtg gacaggaaat tgtagcgctc ctccaaggca gcaaactctt cgagcgccag 3195841 cgggttgacc ctgcccaact cggcaagcgc acgctcggcg cgtttggccc ggcgctcctg 3195901 ggtaacccgg tcgaacggca tgggggcggg cgcaatcacc tgctcgccgc gttcgcgggc 3195961 ttgctcgaac tcagccatct cgagctcggt cggtggtagc gccacatgtg gaccgtattc 3196021 ggtgatcaag tcggccggcg ccattccgaa ctgctctagc accatctgct caagctgctc 3196081 gatacgcagc gccgcctgcg cgttagccag ctcgtcgcgg tgcagcgaat cggtgagttc 3196141 ccccactcgg gcgctcagcg tgttcacctc gtcgcgcacc gcggccatcg ccgctaaccg 3196201 ctgctgacgt tgcgcggccg acgcgtcgcg cagttgcgac gccccgtcca ccgcccggtg 3196261 caaccgcccg gccagcagcc gtccgcagtc ggcgaccgct gcggccaccg cggccgcatg 3196321 cagtcttgcg gcgcgtgctt gctgagcccg cacccgcgcc tcacgttccg ccgcagccgc 3196381 acggcgcagc gaatcggccc gcccgcgaac cgcgttggcg cgttcctcgg cggtgcgcac 3196441 cgccagccgg gcttccactt cgacaccgcg ggcgcgatcg gcagcggcac tgatcgcctg 3196501 gcggtcgatc ggttgggcca cctgcacccg ttgggtctcc tgggccttac gcagctgggt 3196561 ctcaagttgt atgacgtcgt cgagagtctg tgtgcgcacg gcttcctgtt ccgtacgctg 3196621 ctgcagcaac cggttccact cttcttccgc cgcgcgggcc tcctgcccga ggcggcccag 3196681 ctgctcgtac atcgccgaga tggccgtgtc ggattcgtta agcgcggcca aggcttgctc 3196741 ggccgcgtcc tggccggcgg actgctcggt cagcgcaccg gccagggccg cattcaattg 3196801 cgccgccagc gcctcggcag cggccagctc actcctggcc ttgtcgatct cggaggtgac 3196861 ctccaaggtg gacagcttgc ggtccgatcc gccgctgacc cagccggcgc ccaccagatc 3196921 accgtcaacg gtgaccgcgc gtagctccgg acgaatctcg accaggccca ttgcctcagt 3196981 caggtcgttg accaccgcga cacccgaaag catggcgatc atcgcgccaa ccaactgcgg 3197041 tggagactcg accaggtcta gggcccactg ggcgccgcta ggcagcatct cccccgaggc 3197101 ggattggggg gcttgcgggg ccggccagtc actcagcacg aggaccgcgc gaccgccgtc 3197161 ggcttgtttg agtgcgctga cggcactacc cgcggcagtc aggccgtcca ccgcaagtgc 3197221 gtcggccgcc ggcccgagcg ccgcggccag tgccgcttca tagccggaac gtaccttcac 3197281 caattgggcg atcgaaccga aaagccctgc gccactgcga ttgtgcgcca gccacgccgc 3197341 gccgtccttg cgctgtagcc ccactgcgag cgcatcgatg cgagcccgta gcgatgccac 3197401 ctggcgttcg gcggcgcgtt cggcggattg cagctcggcg acgcgttcgt cggccaaccg 3197461 caacgcggcc acagtacgct cgtggtgctc atccaggccg acctcgcctt gatccagttc 3197521 accgatgcgg ccctgcacgg tttcgaactc ggctcgggtc tgctgggcgc gcattgcggc 3197581 atcctcgatc cgctcggaca accgtgccac gctctcatcg atcgattcga cacgcgcccg 3197641 catggtctcc acctggccag ccagccgcgc cagtccctca cggcggtccg cctcctcccg 3197701 gaccgccgcc aggtgtgccc ggtcggcctc ggcggcgcgg cgctcccggt cggcccgctc 3197761 tgcacgggca gcatcgagtc gggcacgcgc cgcgtccagc tccgctaaca gttgttgctc 3197821 ggcgacggcc acctgctggg cctcggcttc tagctcctcg ggctttctgg ggtcggtgtc 3197881 gctgaccgct accggctcga tatcgagatg atgggcgcgt tcgctggcga tgcgcaccgt 3197941 agcgtccacc cgttcggcca gcgcagacag cccgaaccaa gtgtgctgga tcgactcggc 3198001 ccgcgtcgag agttcggcga ccgcggactc atgcgcggcc agctcctcgg atgccaccgc 3198061 cagccgggcg gcggcctcgt catgctcgcg gcgcatcgca gcctcggcct gaaagaccgc 3198121 ttcccgttcg gctctgcggc ttaccaagtc gtcggccgcc aggcgcagcc gggcgtcgcg 3198181 cagatcggct tggatggccg cggcacgctg ggccgcctcg gcctgccggc ccagcggttt 3198241 gagttgacgc cggagctcgg tggtcagatc ggtgagccgg gccaggttcg ccgccatcgt 3198301 gtcgagtttg cgcagagctt tttccttgcg cttgcgatgc ttgagcacac cggcggcttc 3198361 ctcgatgaac gcccgccgat cctcaggccg cgactgcaag atctcctcga gcttcccttg 3198421 cccaacaatc acatgcatct cacggccgat gccggagtcg ctcagcaact cctgcacatc 3198481 catcaaacgg caactgctgc cgttgatttc gtattcgctg gcaccgtcgc gaaacattct 3198541 tcgggtgatc gacacctcgg tgtattcgat aggcagtgcg ttgtcggagt tgtcgatgct 3198601 aacggtgact tcggcgcggc ccagcggcgc acgcgacgag gtgccggcga agatgacgtc 3198661 ttccatcttg ccgccgcgca gcgtctttgc cccctgctcc cccatcaccc acgccagggc 3198721 atcgaccaca ttggatttgc cggagccgtt gggcccaacg acggccgtaa tgcccggctc 3198781 gaagcgtaaa gtcgtcggcg cggcgaagga cttgaagccc ttcaacgtca gactcttgag 3198841 gtacacgagg ggccagatta ccgctcgctg aacccggtga tctgctccgt cgactgcgac 3198901 cagtcggcga cgactttggc gacgcggccc ggtgtcgtgt cgccctgcag cagctgcagc 3198961 agcttctggc acgcagcgcg cggaccctgg gcgaccacca gcacgcgtcc gtcggcgtgg 3199021 ttggccgcgt aaccggtcag gccgagctcc aacgctcggc agcgggtcca ccagcggaaa 3199081 ccgactccct gcacccaccc gtgcacccag gcggtcagcc gcacgtcagg cgccgacatc 3199141 gacgacctcc aagttgaccg tggtgcccga cttgagggtg cgcccgacgg tgcacgccag 3199201 ctccaccgcg cggttgatga ccaccagcag acgctccttt tcgtcctcgg tgaggcccga 3199261 caagtcgagc tccatggtct cctcgatcag gggatagcgc tcctggtcgc ggtcggccgc 3199321 accggatacc ttgaccaccg cctggtagtc gtcgccgagc cgccgggcca gcggctggtc 3199381 actggccatc ccgctgcatg cggcgagtgc gatcttgagc agctctccgg gggtgaatac 3199441 cccgtcgacg tcctcggagc caaccagcac ctgcgccccc cgcgtgctgc gtccgatgta 3199501 acggcgcgtg ccggtgcgct cgacccacag ttgcgtcatg gcttctttct acccgggggt 3199561 ctttgcgtcg agatcgacgg cagcgccccc gcgagagaga gcatcgcgct gacgtcgatc 3199621 tcgatgcgtc aacacccgcc ctactttcgg ggccgcggct ggcatcgcgg gcagtagaac 3199681 gacgagcggt tcataaacct ctcccggcgt atcaccgcgc cgcagcgccg acagttttcg 3199741 ccttcgcggc cataagcgtc cagcgaccgc tcgaagtagc ccgactcgcc gttgacgttg 3199801 acatacaaag agtcgaacga ggtgccacct ttcgccagcg cttcgcgcat cacgtcggcg 3199861 gcggcatgca ggaccgctcc cagacgccgg caccttagtg tggcggcgac gtgggcgccg 3199921 ttcaccttgg cccgccacag cgcctcatcg gcatagatgt tgccgattcc cgacaccacc 3199981 cgctgatcca gcagctggcg cttgagttcg gaatgcttgc gccgcaacac tttaactaca 3200041 gcgtcacaat cgaaccgcgg gtcaagcggg tcgcgcgcca ggtgggcgac cggcaccggt 3200101 accacgctgc cgtccaccgt caccaggtcg gcaagcagcc accctccgaa ggtccgttgg 3200161 tcagcgaagc tcagcacggt cccgtcgtcg agcagcgcgg aaatccggac gtgagcggca 3200221 cacggcaccg ccccgagcag catctgccca ctcatgccca ggtgcaccac gagtgcggtg 3200281 tccgtcggcc tatggacccc agccgtattg agtgtcaacc acaggtactt gccgcgccga 3200341 tcggttccgt tgatccgcgc tccccgcagc cgcgccgtca gatccgcggg cccggcatcg 3200401 tggcggcgca cagcgcgggg gtggtgcacc cgaacctcgg tgatggtccg gccggtcacg 3200461 tgagcctgca agccgcgccg caccacctcg acttcgggca gctcgggcat ccagtgatga 3200521 tcgcaagcgc ggcgaagccg ggcgcagcgg gtcatcacca tcgaaccagt gatgatcgca 3200581 agcgcggcga agccgggcgc agtcccccgc aagcgggagg tgcccccagg tcatcaccat 3200641 cgaaccagtg atgatcgcaa gcgcggcgaa gccgggcgca gtcccccgca agcgggaggt 3200701 gcccccaggt catcaccatc gaaccagtga tgatcgcaag cgcggcgaag ccgggcgcag 3200761 tcccccgcaa gcgcggcaaa gccggcgccc ccaggtcatc accatcaatc cagttaggcg 3200821 gaggttttgc ccggcatggc gttgtcgagc acttccaggg ctttccaagc ggccgccgcg 3200881 gctttttgct cggcttcttt tttggaccgg cccactcctg aaccgtattc gctgtccatc 3200941 acgacaacca ccgcggtgaa ttccttatcg tggtccgggc cggtggaggt gaccaggtat 3201001 gacggcgcac ccagccctcg cgctgcagtc agctcctgca agctggtctt ccaatccaat 3201061 cccgcaccca gggtcggcgc ggcgtccagc aacgggccaa acagccgcag gatcacctca 3201121 cgggccttct ccataccgtg ttgcaggtag atcgcgccca gcagcgattc cataccgtcg 3201181 gccagaatgc tggacttgtc ggccccgccg gtgttcgcct cgccgcgacc caatagcacg 3201241 tgaacaccga ggccttccgc acagaggcgg cgtgcgacgt cggccagggc ctgggtgttg 3201301 actacgctgg cccgcagttt ggccagatcc ccctccgacc gatcaggatg acgatggaac 3201361 agcgcgtcgg tgatggtcag ccctagcacg gcatcgccga gaaactccaa acgctcgttg 3201421 gtcggcagcc cgccgttctc gtaggcgtag ctgcggtggg tcaacgccag tgagagcagc 3201481 tcgtccggga ggtccacacc gagtgcgtcg agcaggggtt gtcgtgaccg gatcatcgct 3201541 cacctcgtaa tgtgtcggac tccggcccga gcatttcgac caacttcgcc caccgcgggt 3201601 cgatctgttc atggcgatga cctggctcgc tggccagcgg gacaccgcac tgcgggcaaa 3201661 gacccgggca gtccggccgg cacaccggcg aaaacggcaa ttccagaccg accgcatcga 3201721 tgatcggctg ctcgagatcg atggtttcgt cgacgacgcg tccgacctcg tcttcctcgg 3201781 tggtctcgtc ggtggcgcta tccggatagg caaacagttc ggtcagggct acctgaacgc 3201841 gaccccgcac cgggctgagg caacgagcac actcgccgac ggtcggggcg gccacggtcc 3201901 cggtcaccaa cacgccttcg gacaccgact cgacccgcag atccaggtcc agaagggcgc 3201961 cctggtcaat cgcgatcagc tccagcccga tgcgtgcggg gctgtgcacg gtgtcatgca 3202021 gctcgaacat cgctcccggt cgtcgcccca accgtgcgat gtcgaccgtc atcggcgacg 3202081 ccacatgtcg ctgcgcagtg ggaccgtgct gcctggccat aagagaaatc ctacggcgca 3202141 cgccacccag atccacgccg cgttgggcgt tggccggcgc ttgatccttg cgccgggtga 3202201 atggtgttag cgcaccgcgt agtcgtgagt gccggccgct gtgcggagct ggtggcgacc 3202261 gcggccaacg gaccgcaggg tgccgttgag gaattcctcg aattcggcga gcttgttgtc 3202321 gacgtagata tcgcattctc cgcgtagccg gtccgcctcg gcgtgcgccg tgtcgacgag 3202381 gcgggtcgat tccgcgttgg ccgccgcaac cacctcgttc tgcgatacca ggcgctgctg 3202441 ctctttgatg ccctcctgca cggctttctc gtaggagatg ttgccgtttt cgatcagccg 3202501 gtcgcattcg gcctgggcgc gactgacgct ggcctcgtat tcgcgtttcg cggcggtggc 3202561 gatgcgaatc gcctcctcgc gtgcatcggc gaccattcgc tcgctgtgct ggcgtgcctc 3202621 gctgaccatc cgatcagcct gcgccttcgc gtcagacagg atccggtcag cctcggtgcg 3202681 ggcgtggttg agtatcgact ccgcctcagt ggtcgccgag gacaccatag agtcagcgtg 3202741 cgtcttagcg tcctgcaaca tcgaatcacg tgcgtcgagg acgtcctgcg cgtcatccag 3202801 ctcaccgggg atcgcatcct tgatgtcgtc gatcaactcc agcacatccc cacgcgggac 3202861 gacgcaacct gccgtcatcg gcacgcctcg ggcttcttcg actatggcgc tcaattcgtc 3202921 cagcgcttca aagactcggt acacggccac accctcctgg catcttgcaa gatccctgtt 3202981 gttaccagtg tgcctggtgt ttcgtctgtg actgcactgg tggcgccggt gtgtcgggac 3203041 acaatttcat attcgacgag cccgggcgac cactcagatc acgcggcctg ctgggcgcgt 3203101 gtcgtagacc gttcggcggc tgacgggtga gcctacgtcg tctgggcgat cttgcccgag 3203161 cgtgccgaca acgtaggtgt cgatgctggc ccgtcacgga ccacgctatg gtggctcggt 3203221 gaacgggcac tcagacgaca gtagcggcga cgcgaagcaa gccgcaccca cgctgtatat 3203281 tttcccgcat gccggcggca ccgcgaaaga ctatgtcgca ttttcccgag aattttccgc 3203341 cgacgtaaag cggattgctg tccaataccc cggccagcac gatcgttctg gcctgccacc 3203401 gcttgagagt attcccaccc tcgctgacga aatctttgca atgatgaaac cgtcggctcg 3203461 gatcgacgat ccggtggcat tctttgggca cagtatgggc ggaatgctag ccttcgaagt 3203521 agcgttgcga taccaatcgg cgggccatcg agtcctggca ttctttgtgt cggcctgctc 3203581 agcaccgggt catatcagat acaagcagct ccaagattta tcagatcgcg agatgttgga 3203641 cttgttcacc cgaatgacag gaatgaatcc agatttcttt accgacgacg aatttttcgt 3203701 tggagcgcta cccacgttgc gagcggtccg agccatcgcc ggttattcct gcccaccaga 3203761 gacgaagctc tcgtgtccga tttatgcctt tatcggagat aaagattgga tcgcaacgca 3203821 agacgacatg gatccgtggc gcgatcggac gacggaagag ttctctatcc gtgtattccc 3203881 tggggatcac ttctacctca acgacaattt gccagagcta gtcagcgaca tagaagacaa 3203941 aacactccaa tggcatgatc gagcttagct atgctccgga tgtagctggc cgaagatcca 3204001 actggccgaa gggctcgggg gtcaacacct ggacagccat tcgctggaca tttgctgaag 3204061 attcaccgta cgtcggcacc ggtctggagc ggatggcttc agacacacac gggggcggtg 3204121 gcggccgacc ggtcaccccg cccccgcccg gtatgcacca tctcgggtgc agccgaggcg 3204181 tgttgttaat ctcgtcacaa cgggacgccg gtcacaagac gtgcgaccca gccgccggcg 3204241 gcactctgac ctcggttctt acctgactac caattcgtca ccggcatcgc acacgtcaca 3204301 ccaaccacag cggacgcggc acggcacgcg gaagggacgc tagactcggc tagcaccacc 3204361 accgtgccca ggcaacgacg ccggccgtcg ctaagaaatt tggttgactt catgaataag 3204421 gccgcgcccg ccccgacaaa tgattacctt acatttgcgg gctaggcata gcggagcagg 3204481 ggttttagtc tagggggaga tcggctggcg ctgcgcagac atgctgcgga agcagaactg 3204541 cgtaatcgtc aggtggcttg gtcagttcag accggcacgt ttcagagcgg tggggatgtc 3204601 ccgacgtgcg atccgacagg ggttcgcagg gtccgcaaaa aacatagtga acgccagaaa 3204661 gccgaatggg agtacaaggc gatgccggtg accgaccgtt cagtgccctc tttgctgcaa 3204721 gagagggccg accagcagcc tgacagcact gcatatacgt acatcgacta cggatccgac 3204781 cccaagggat ttgctgacag cttgacttgg tcgcaggtct acagtcgtgc atgcatcatt 3204841 gctgaagaac tcaagttatg cgggttaccc ggagatcgag tggcggtttt agcgccacaa 3204901 ggactggaat atgtccttgc attcctgggc gcacttcagg ctggatttat cgcggttccg 3204961 ctgtcaactc cacagtatgg cattcacgat gaccgcgttt ctgcggtgtt gcaggattcc 3205021 aagccggtag ccattctcac gacttcgtcc gtggtaggcg atgtaacgaa atacgcagcc 3205081 agccacgacg ggcagcctgc cccggtcgta gttgaggttg atctgcttga tttggactcg 3205141 ccgcgacaga tgccggcttt ctctcgtcag cacaccgggg cggcttatct ccaatacacg 3205201 tccggatcga cgcgtacgcc ggccggagtc attgtgtcgc acacgaatgt cattgccaat 3205261 gtgacacaaa gtatgtacgg ctatttcggc gatcccgcaa agattccgac cgggactgtg 3205321 gtgtcgtggc tgcctttgta tcacgatatg ggcctgattc tcggaatttg cgcaccgctg 3205381 gtggcccgac gccgcgcggt gttgatgagc ccaatgtcat ttttgcgccg tccggcccgc 3205441 tggatgcaac tgcttgccac cagcggccgg tgcttttctg cggcaccgaa tttcgccttc 3205501 gagctggccg tgcgcagaac atctgaccag gacatggcgg ggctcgacct gcgcgacgtg 3205561 gtcggcatcg tcagtggcag tgagcgaatc catgtggcaa ccgtgcggcg gttcatcgag 3205621 cggttcgcgc cgtacaatct cagccccacc gcgatacggc cgtcgtacgg gctcgcggaa 3205681 gcgaccttat atgtggcagc tcccgaagcc ggcgccgcgc ccaagacggt ccgttttgac 3205741 tacgagcagc tgaccgccgg gcaggctcgg ccctgcggaa ccgatgggtc ggtcggcacc 3205801 gaactgatca gctacggctc ccccgaccca tcgtctgtgc gaatcgtcaa cccggagacc 3205861 atggttgaga atccgcctgg agtggtcggt gagatctggg tgcatggcga ccacgtgact 3205921 atggggtatt ggcagaagcc gaagcagacc gcgcaggtct tcgacgccaa gctggtcgat 3205981 cccgcgccgg cagccccgga ggggccgtgg ctgcgcaccg gcgacctggg cgtcatttcc 3206041 gatggtgagc tgttcatcat gggccgcatc aaagacctgc tcatcgtgga cgggcgcaac 3206101 cactaccccg acgacatcga ggcaacgatc caggagatca ccggtggacg ggccgcggcg 3206161 atcgcagtgc ccgacgacat caccgaacaa ctggtggcga tcatcgaatt caagcgacgc 3206221 ggtagtaccg ccgaagaggt catgctcaag ctccgctcgg tgaagcgtga ggtcacctcc 3206281 gcgatatcga agtcacacag cctgcgggtg gccgatctcg ttctggtgtc acctggttcg 3206341 attcccatca ccaccagcgg caagatccgg cggtcagcct gcgtcgaacg ctatcgcagc 3206401 gacggcttca agcggctgga cgtagccgta tgacgggaag catcagtggt gaagccgacc 3206461 ttcgccactg gctaatcgac tacctagtaa ccaatatcgg ctgcacacct gacgaggtgg 3206521 accccgatct gtcgcttgcc gacctcggcg tcagctcccg cgacgcggtc gtactgtccg 3206581 gcgaactgtc agagctgctg ggcaggaccg tatcgccgat tgacttctgg gagcacccga 3206641 cgatcaacgc gctggccgcg tatctggccg cacccgagcc gagccccgac tccgacgccg 3206701 cagtcaagcg tggtgcccgg aactcactcg acgagccaat cgccgtcgtc ggcatgggat 3206761 gtcgtttccc tggcgggatt tcgtgcccag aagcattgtg ggactttctc tgtgaacgcc 3206821 gttcctcgat cagccaggtg ccgccgcaac gatggcagcc cttcgaaggc gggccacccg 3206881 aggtagccgc ggcgctagcg cgcactacac ggtggggctc atttttgccc gacatcgacg 3206941 ccttcgacgc ggaattcttc gagatctccc ccagcgaagc cgacaagatg gacccccagc 3207001 aacgcctgct gctggaagtg gcctgggaag cgttggagca cgcgggaatc ccgcccggca 3207061 cgctgcgccg ctcggcaaca ggagtgtttg ccggggcatg cctgagcgaa tacggtgcga 3207121 tggcttccgc cgatctgtcg caggtcgatg gttggagcaa tagcggtggc gcgatgagca 3207181 tcatcgccaa ccgcctctcg tatttccttg acctgcgcgg cccgtcggtg gcggtagaca 3207241 ccgcatgctc gtcgtcgttg gtagcgatcc acctggcctg ccagagcctt cggacccagg 3207301 actgtcacct ggcaatcgca gccggcgtga atttgttgtt gtccccggcg gtatttcgcg 3207361 gtttcgacca agtcggcgcc ttgtccccga caggtcagtg ccgtgcgttc gatgcgaccg 3207421 ccgacgggtt tgtccgcggc gagggtgccg gggtagtggt gctcaagcgg ttgaccgatg 3207481 cacagcgcga cggggatcgg gtgcttgcgg tgatctgcgg ttctgcggtc acccaggacg 3207541 gccgatccaa cgggctgatg gcccccaacc cagcggccca gatggcggtg ctgcgtgccg 3207601 cctacaccaa cgcggggatg cagcccagcg aggtcgacta cgtcgaagcg cacggaacag 3207661 ggacgctgtt gggcgacccg atcgaagccc gcgctctcgg aacggtgctg ggtcgcggcc 3207721 ggcccgagga ttctccgttg ctcatcggct ctgtcaagac caacctcggt cacaccgagg 3207781 ctgcggctgg aatcgcgggc ttcatcaaga cggtgctggc tgtgcagcat ggccagattc 3207841 cgccaaatca gcacttcgaa accgcgaacc cgcacattcc ctttaccgac ttgcggatga 3207901 aagtcgttga cacacaaact gaatggccgg caacgggcca tccccgccgt gccggtgtgt 3207961 cgtcgttcgg cttcggtggc acaaacgcgc acgtggtgat cgagcagggc caggaggtgc 3208021 gccccgcgcc tggacaaggc ttaagtccgg cggtgtcgac cctggtagtg gccggcaaga 3208081 ctatgcagcg ggtgtccgcg accgcgggga tgctagccga ttggatggaa gggcccggcg 3208141 ctgacgtggc cttggccgac gtggcccaca ccctcaatca ccaccgatcg cggcaaccca 3208201 agttcggcac ggtggtggcc cgtgaccgta cccaggcgat agccggattg cgtgcgctgg 3208261 ccgccggcca acacgccccc ggcgtggtca accctgccga gggctcgccg gggccgggca 3208321 ccgtgttcgt ctactccggc cgcggttcac agtgggctgg catgggccgt caattgttgg 3208381 ccgacgagcc ggctttcgcg gccgcggtcg ccgaattgga accggtgttt gtcgagcaag 3208441 ccggcttttc gttgcacgac gtgctggcta acggcgagga actggtcggt atcgagcaga 3208501 ttcagctcgg gttgatcggg atgcagctgg ccctgaccga attatggtgt tcctacgggg 3208561 tgcagcccga cctggtgatc ggccactcca tgggcgaggt ggccgccgcc gtggtcgccg 3208621 gggcactgac cccggccgag ggtctgcggg tgaccgccac ccggtcacgg ctgatggcac 3208681 cgttgtccgg ccagggcggc atggcactgc tggaactcga cgcgcccact accgaggcgt 3208741 tgattgccga cttcccacag gtgacgctcg gtatttacaa ctcaccacgg caaacggtga 3208801 tcgccgggcc caccgagcag atcgatgagt tgatcactcg cgtgcgcgct agggaccgat 3208861 tcgccagccg ggtcaatatc gaagtggccc cgcacaatcc ggccatggat gctttgcagc 3208921 cggcgatgcg ttcggagctg gccgatctga ccccacggac ccccaccatc ggaatcatct 3208981 ccaccaccta cgcagacttg cacacccaac cggtcttcga cgccgaacac tgggccacca 3209041 acatgcgcaa ccccgtgcat ttccagcagg ccatcgcttc cgccggtagc ggcgccgacg 3209101 gcgcctacca caccttcatc gaaatcagcg cacacccgct gctgacccag gccatcatcg 3209161 acactctgca cagcgctcaa cccggagcca gatacaccag cctcgggacc ctgcaacgcg 3209221 acaccgacga cgtcgtgacc ttccggacca acctcaacaa ggcccacacc atccacccac 3209281 cgcacacccc ccaccccccc gagccacatc cgcccatccc caccaccccg tggcaacaca 3209341 cccgtcactg gatcaccacc aaatatccgg ccggctctgt tggatcggcc ccccgagcgg 3209401 gcacactgct cggccaacac accaccgtcg ccacggtctc agcgagtccg ccctcccacc 3209461 tctggcaagc aaggctggct ccggacgcca agccgtacca gggcggtcat cgattccacc 3209521 aagtcgaggt ggtcccagct tctgttgtgc tgcacacaat cctttccgct gcaacagaat 3209581 tgggctactc cgcgttgtcc gaggtccgat tcgagcaacc cattttcgcc gaccggccac 3209641 gtctaatcca ggtcgtcgcc gacaaccggg cgatcagcct ggcctcgagt ccggctgccg 3209701 gaacaccctc agaccggtgg acgcggcatg ttaccgcaca actttcctcg tcaccgtcgg 3209761 attcggccag cagcttgaac gagcaccatc gcgccaacgg gcagccgccc gaacgtgctc 3209821 accgcgacct gattcccgac ctggccgagc tgctcgcaat gcgcggcatc gatggcctgc 3209881 ctttctcatg gaccgtcgcg tcgtggacac agcactcgag caacctcacg gttgcgatcg 3209941 atctccccga agctctgccc gaagggtcga ctgggccgct ccttgacgcc gcggtgcacc 3210001 tcgccgcgct atcggacgtc gctgattcgc ggctctacgt gccggcaagc atcgagcaga 3210061 tatcgctcgg cgatgtcgtc accgggccgc gtagctcggt gacgctgaac cgcaccgctc 3210121 acgacgacga cgggatcacc gtcgatgtca ccgttgcagc ccacggcgaa gtgccgtccc 3210181 tgtcgatgag gtcgcttcga taccgggctc tggactttgg cctagacgtt ggtagggcgc 3210241 aaccgcccgc gtcgaccggt ccggtcgagg cctactgtga tgccaccaat ttcgtacaca 3210301 cgatcgactg gcaaccgcag accgttccgg acgcgacgca cccaggggcc gaacaggtaa 3210361 cccatccagg acccgtcgcg ataatcggcg atgacggcgc agcgctgtgt gagaccctcg 3210421 aaggggcggg ctaccagccg gccgtgatgt ccgatggggt gtcgcaggcc cgctacgtcg 3210481 tttacgtcgc ggattctgat ccggctggcg ccgacgagac cgacgtcgac ttcgccgtcc 3210541 ggatctgtac cgaaatcacc ggtctggtgc ggactctcgc ggaacgcgat gcggataagc 3210601 ccgcggcgct atggatcctc acccgcggag ttcacgaatc ggtcgccccg tccgcgctgc 3210661 gccagagttt cctgtggggc cttgccggtg tcatcgccgc cgaacatccc gagctgtggg 3210721 gcggactggt cgatctcgcg atcaacgacg acttaggcga attcgggccg gcacttgccg 3210781 aactgcttgc caaaccaagc aagtcgatct tggtgcgtcg tgacggcgtg gtgctcgccc 3210841 cggccttggc tcccgtccgt ggcgagccgg cgcgcaagtc cttgcagtgc aggcccgacg 3210901 cggcctacct catcaccggc ggcctgggcg cccttggcct gctgatggcc gattggctcg 3210961 ccgaccgcgg cgctcatcga ttggtgttga ccggccgcac gccattgccg ccacggcggg 3211021 actggcaact cgacaccctc gacaccgagc tgcgccggag gatcgacgcg atccgcgccc 3211081 tggaaatgcg cggggtgact gtcgaagccg tcgccgccga cgtcggctgc cgcgaagacg 3211141 tgcaggccct gttggccgcg cgcgaccgtg acggagcggc accgatccgc gggatcatcc 3211201 acgccgcggg cattaccaac gatcaattgg tgacgagcat gaccggcgat gcggtgcgac 3211261 aggttatgtg gccgaagatc ggcggcagcc aggtcctaca cgacgcattt ccgcccggca 3211321 gcgtggactt cttctacttg accgcctcgg ctgccgggat attcggcatt ccagggcagg 3211381 gttcctacgc cgccgccaat tcctacttgg acgcgctggc gcgggcgcgc cggcaacagg 3211441 gctgccacac catgagcctc gactgggtag cctggcgggg gctcggattg gccgcggacg 3211501 cccagctcgt cagcgaagag ctagcgcgaa tgggttcgcg tgacatcacg ccgtcggagg 3211561 cattcaccgc ttgggaattc gtcgatggct acgacgtcgc gcaagcggtc gtggtgccca 3211621 tgcccgctcc ggcgggcgcc gatggatccg gtgcgaacgc ttacctattg ccggcgcgga 3211681 actggtcggt gatggcagcg accgaggtgc gatccgagct cgaacagggg ttacgccgca 3211741 tcattgcagc cgagctgcga gtgcctgaga aagagctgga caccgaccgc ccgttcgccg 3211801 agttgggtct caattccctt atggcaatgg cgattcggcg cgaggccgag cagtttgtcg 3211861 gcatcgagtt gtctgccacc atgttgttca accacccaac ggtcaaatca ctcgccagct 3211921 accttgccaa acgtgtggca ccgcacgatg tgtcacaaga caaccagatt tccgcgctat 3211981 cctcgtcggc cggaagtgtg ttggacagtc tattcgatcg catcgaatcg gcgccgcctg 3212041 aggccgagag gtcggtgtga tgcgaacggc tttcagccgg atttccggta tgaccgcgca 3212101 acagcgcacc tccctagccg acgagttcga cagggtctct cgcatcgccg tggccgagcc 3212161 ggttgcggtg gttggcatcg gctgccgctt tccgggagat gtggatggac cagagagttt 3212221 ctgggacttt ctggtcgcgg gcaggaatgc gatctcgacg gtgccggcag atcgatggga 3212281 cgcagaagcg ttttaccacc ccgacccgct aacaccgggg cggatgacga cgaagtgggg 3212341 cggcttcgtc cctgacgtcg cgggcttcga cgccgaattc ttcggtatca caccgcggga 3212401 agccgcggcg atggacccgc agcagcgaat gctgctggag gttgcctggg aagcactcga 3212461 acatgccggc ataccaccgg attccctcgg cggcacccga accgccgtca tgatgggggt 3212521 ctatttcaac gagtatcagt ccatgttggc cgccagtccg cagaacgtag acgcctacag 3212581 cgggaccgga aatgcacaca gcatcacggt gggtcgcatc tcctacctgt tgggattacg 3212641 gggtccggcg gtcgcggtgg acaccgcctg ctcgtcgtcg ttggtggctg tgcacctggc 3212701 gtgtcagagt ctgaggctgc gcgagaccga tctggctctc gccggtggag tgagtatcac 3212761 ccttcgccca gagacccaaa tcgctatctc tgcctgggga ttgctgtccc cgcagggccg 3212821 gtgtgccgca ttcgatgcgg cggcagacgg atttgtgcgc ggtgagggcg ccggagtggt 3212881 agtgctcaag cggttgacgg acgcggtgcg cgacggcgac caggtgctgg cggtggtgcg 3212941 cggttcggca gtcaaccagg acggcaggtc caatggcgta acggcgccga atacggcagc 3213001 ccagtgcgat gtgatcgccg atgccttgcg atccggcgat gtggcgcctg acagcgtgaa 3213061 ttacgtagag gcccatggaa ccggcacggt gctgggcgac ccgatcgaat tcgaggccct 3213121 ggccgccacg tatggccacg gcggggacgc atgcgcgttg ggtgcggtga aaaccaacat 3213181 cggtcatctg gaggcggccg ccgggatcgc ggggttcatc aaggcgacgc tggcggtaca 3213241 acgcgcgacg atcccgccga atctgcattt ctcgcaatgg aatccagcta tcgatgccgc 3213301 gtcgaccagg tttttcgttc ccacgcagaa ctccccgtgg ccaaccgcgg aggggccgcg 3213361 ccgggcggcg gtgtcgtcgt tcggattggg cgggacgaac gcacacgtga tcatcgagca 3213421 aggtagcgag ctggctccgg tatccgaagg cggcgaggac accggggtgt cgacgttggt 3213481 ggtgacgggt aagacggccc agcggatggc cgcgacggcg caggtgctgg ccgactggat 3213541 ggaaggtccg ggcgccgagg tggccgtagc tgatgtcgcc cacacggtca accatcaccg 3213601 ggcccgccaa gccacgttcg gcaccgtcgt agcccgtgac cgcgcccagg cgatagccgg 3213661 actgcgcgcg ctggccgccg gccaacacgc tcccggagtg gtgagccacc aggacggttc 3213721 gccggggccg ggcaccgtat tcgtctactc cggccgcggc tcgcagtggg ccgggatggg 3213781 tcgccaattg ttggccgacg agccggcttt cgccgccgcg gtcgccgagc tggaaccggt 3213841 gtttgtcgag caagccggct tctcgctgcg cgacgtgatc gccaccggca aggagctagt 3213901 cggtatcgag cagatccagc ttggcctgat cggcatgcaa ctgacattga ctgagctatg 3213961 gcgctcctac ggggtgcagc ccgacctggt gatcggccac tccatgggcg aggtggccgc 3214021 cgccgtggtc gccggagcgc tgactccggc cgagggtctg cgggtgaccg ccacccgcgc 3214081 acggttgatg gcgccattgt ccggccaggg cggcatggca ctgctgggac tcgatgctgc 3214141 ggccaccgaa gcgttaatcg cggactaccc gcaggtgaca gtggggatct acaactcgcc 3214201 gcggcagacc gtgatcgccg ggccgaccga acaaatcgat gagttgatcg cccgggtgcg 3214261 cgcgcaaaac cggtttgcca gtcgggtcaa tatcgaagtc gccccgcaca atccggccat 3214321 ggatgcgctg cagccggcga tgcgttcgga gctggccgat ctgaccccac ggacccccac 3214381 catcggaatc atctccacca cctacgcaga cttgcacacc caaccgatct tcgacgccga 3214441 acactgggcc accaacatgc gcaaccccgt gcgcttccag caggccatcg cttccgccgg 3214501 tagcggcgcc gacggcgcct accacacctt catcgagatc agcgcacacc cgctgctgac 3214561 ccaggcgatt gccgacacct tggaagacgc gcaccgccca accaagtccg cagcgaaata 3214621 cttgagcatt ggcaccttgc agcgtgatgc cgatgacacg gtcaccttcc gcaccaacct 3214681 ctacaccgcc gacatcgccc acccaccgca tacctgtcac ccgcccgagc cgcaccccac 3214741 catccccacc acaccctggc aacacaccca ccactggatc gccaccacgc acccgagcac 3214801 ggcagcgcca gaagatccgg gcagcaataa ggttgtggtg aacggacaat cgacatccga 3214861 gagccgtgcg ctcgaagact ggtgccacca gctggcctgg ccgatccgcc cggcagtcag 3214921 cgccgacccg cccagcaccg ccgcctggct cgtggtggca gacaacgaac tctgccacga 3214981 gctggcccgt gcggccgatt ctcgggtaga cagcctctcg ccgccggcgc tcgcagcagg 3215041 cagcgatccg gccgcactgc tcgacgcgct gcgcggtgtg gacaacgtgc tctacgctcc 3215101 acccgtcccc ggtgaactcc tcgatattga atcggcctac caggttttcc acgcaacgcg 3215161 acggctagcc gccgcgatgg tcgccagcag cgccacggct atttccccgc cgaagttgtt 3215221 catcatgacc cgcaacgccc agcccatctc ggaaggcgac cgagccaacc ctggccacgc 3215281 tgtgctgtgg ggtctcggcc ggtcgctggc actagagcat cctgaaatct ggggcggcat 3215341 aatcgatctc gacgattcga tgcccgcaga gctggccgtg cggcatgtgc tgactgcagc 3215401 ccacggtacc gacggggagg atcaggtcgt ataccggtcg ggcgcacgcc atgtaccccg 3215461 gctgcagagg cgaactcttc cggggaaacc ggtcacgttg aatgccgacg ccagccagct 3215521 cgtcatcggt gcgaccggca acatcggacc gcatctcatc cgacagctcg cgcggatggg 3215581 ggctaagaca atcgtcgcga tggctcgcaa gcccggcgcg ctcgacgagt tgacccaatg 3215641 tctcgctgcg accggaacag atctcatcgc ggtggccgct gatgcgaccg atcccgccgc 3215701 catgcaaacc ctgttcgacc gattcggcac ggagctaccg ccactggagg gaatctatct 3215761 ggcggccttt gcgggccgcc cagcgctgct gagcgagatg accgacgacg acgtgaccac 3215821 catgtttcgt cccaagttgg acgccttggc gttgttgcac cgactgtcac tgaagagccc 3215881 agtgcgccac ttcgttttgt tctcttcggt gtcaggtctg ctgggttctc gatggctcgc 3215941 ccattacacc gcgaccagcg ccttcctgga cagcttcgcc ggcgcgcgtc gcaccatggg 3216001 cctgccggcc accgtcgtcg actggggact gtggaagtcg ctggccgatg tgcaaaaaga 3216061 cgcgactcaa atcagcgcgg aatccgggct gcaacccatg gctgacgagg tggccatcgg 3216121 cgcgctaccg ctggtgatga accccgatgc ggcagtcgcg accgtggtgg ttgccgcgga 3216181 ctggcccttg ttggccgcgg catatcgaac gcggggagcc cttcgcatag tcgacgacct 3216241 gttgccggca ccggaagacg tcgggaaggg cgaaagcgaa ttccgcacat cgttgcgtag 3216301 ctgcccggcg gagaaacgac gggacatgtt gttcgaccat gtgggcgcct tggccgccac 3216361 ggtgatggga atgccgccca cggagccgct cgatccgtcg gccggcttct tccaactcgg 3216421 catggactcg ctaatgagcg tgacacttca gcgggcgttg tcggaaagcc tgggcgagtt 3216481 cttgccggcg tccgtggttt tcgactatcc gaccgtttac agcctcaccg actacctggc 3216541 caccgtcctg cctgagctcc tcgaaattgg ggcaaccgca gtcgcaaccc agcaagccac 3216601 cgactcctac cacgaactga ccgaagccga gttgttggaa caactttcgg aacgactaag 3216661 aggaacacaa tgaccgcagc gacaccagat cgccgagcga tcatcaccga ggcgctgcac 3216721 aagatcgatg atctcacggc gcgcctggaa atcgccgaaa aatccagcag cgaaccgatc 3216781 gcggtgatcg gcatgggttg ccggttcccg ggcggggtca acaaccccga acagttctgg 3216841 gatttgttgt gcgccggccg aagcggcatc gtccgggttc ccgcgcagcg gtgggacgcc 3216901 gacgcctact actgtgatga tcacaccgtg ccggggacca tctgcagcac cgaaggcggt 3216961 tttctcacca gctggcagcc agatgagttc gatgcggagt tcttctcaat ctccccgcgc 3217021 gaagcggcgg cgatggaccc gcagcagcga ttgttgattg aagttgcgtg ggaagcgcta 3217081 gaagacgcgg gcgtcccgca acacaccatt cgcggtacgc aaacctcggt attcgtcggt 3217141 gtcaccgcct acgactacat gctcacgctg gcgggccggc tacgacctgt tgacctcgac 3217201 gcgtacatcc caaccgggaa ctcggcgaac ttcgccgccg gacggctggc ctacatcctc 3217261 ggggcacgcg gacccgcggt ggtcatcgac acggcctgct catcgtcgtt ggtggcggtg 3217321 cacctggcat gccagagcct gcgcgggcgg gaaagcgata tggcgttggt gggtggaacc 3217381 aaccttttgc tgagcccggg acccagcatc gcttgctcgc gatgggggat gctgtcaccg 3217441 gaggggcggt gcaagacctt cgatgcgtcc gccgatgggt acgtgcgcgg cgagggtgcc 3217501 gcggtggtgg tgctcaagcg gctggatgac gcggtgcgcg acggcaaccg cattcttgcc 3217561 gtggtacgcg gttcggcggt caaccaggac ggtgccagca gcggagtgac cgttcccaac 3217621 gggccagcgc aacaggcgtt gctcgccaaa gcattgacgt cgtcgaagtt gacagcggcc 3217681 gatatcgact acgtcgaggc ccatggaact ggtactccgc tgggcgaccc gatcgaactc 3217741 gattcactga gtaaggtttt cagcgatcga gcgggttcgg atcagttggt gattggatcg 3217801 gtgaagacca atctcggtca cctggaagcg gcggccggtg tcgccgggct gatgaaagcc 3217861 gtgctcgcgg tacacaacgg ctacattccg cggcatctta acttccacca gctgacacca 3217921 catgcaagtg aggccgcatc tcggctgagg atcgccgccg atggtattga ctggccaacc 3217981 accggtcgac ctcgccgggc gggggtgtcg tcgttcggcg tcagtgggac gaatgcacac 3218041 gtggtgatcg agcaggcacc cgatccgatg gccgctgcgg gaacggagcc gcagcgcggc 3218101 cccgttcccg cggtgtcgac gctggtggtg ttcggcaaga ccgcaccgcg ggtggctgcg 3218161 acggcatcgg tgctggcaga ttggctggac ggccccggcg cggcggtgcc gctggccgat 3218221 gtcgcgcaca ccctcaacca tcaccgggcc cgtcagacca ggttcggcac ggtagccgct 3218281 gtcgatcggc gccaagcggt gatcgggtta cgcgcgctgg ccgcgggtca atccgccccc 3218341 ggggtggtgg caccccgcga aggctccatc ggaggcggca cggtgttcgt ctactcggga 3218401 cgaggatcgc agtgggccgg aatggggcgc caactgctgg ccgacgagcc ggcattcgcc 3218461 gctgccatcg ccgaactgga gccggaattc gttgctcaag gcgggttttc gctgcgcgac 3218521 gtgatcgccg gcggaaaaga gttggttggc atcgaacaga tccagctggg actgatcggg 3218581 atgcagctgg cgctgaccgc gttgtggcgc tcatacggcg tgacacccga tgcggtgata 3218641 ggtcactcga tgggcgaagt ggccgccgcg gtggtggccg gggcgctgac cccggcccag 3218701 ggattacggg tgaccgcggt ccggtcgagg ctgatggcgc cgctgtccgg gcagggcacg 3218761 atggcgttgc tggaactcga cgccgaagcc actgaggcgc tgattgccga ctaccccgag 3218821 gtgagcctgg ggatctatgc ctccccacgc caaaccgtga tttccgggcc gccgctattg 3218881 atcgacgagc tcatcgacaa ggtgcgccaa cagaacggct tcgctacccg agtcaacatc 3218941 gaggtggccc cccacaaccc ggccatggat gcactgcaac cggcgatgcg ttcggaattg 3219001 gccgatctca ccccgcaacc gccgaccatc ccgatcatct ccaccaccta cgccgacctc 3219061 ggcatttccc tgggttccgg ccccaggttc gacgccgagc actgggcaac caacatgcgc 3219121 aacccggtac ggttccacca ggccatcgct catgccggcg ccgatcacca caccttcatc 3219181 gagatcagcg cccacccgct gctgacccac tcgatcagcg acaccctgcg cgccagctac 3219241 gatgtcgaca actatctgag catcggcacc ttgcaacgcg acgctcacga caccctcgag 3219301 ttccacacga acctcaacac gacccacacc acccatcccc cccagactcc ccaccccccc 3219361 gaaccccacc ccgtgctgcc caccacccca tggcagcaca cccagcactg gatcaccgcc 3219421 acgtcggccg cttaccacag gcccgacacc cacccgttgc ttggcgtcgg tgtcaccgac 3219481 cccactaacg gcacccgggt ttgggaaagc gagctcgacc ctgatctgct gtggctcgcc 3219541 gatcacgtca tcgacgatct cgttgtgctg cccggggcgg cctacgctga gatcgcgctg 3219601 gcggccgcga ccgacacctt cgcagtcgag caagatcagc cctggatgat cagcgagctc 3219661 gaccttcggc agatgctgca tgtgacccca ggcaccgtgt tggtcaccac gctcaccggc 3219721 gacgagcagc gatgccaggt cgaaatacgc acccgcagcg ggtcttcggg atggaccacc 3219781 cacgccaccg ccaccgttgc ccgcgccgag ccgttagcac cgctggatca cgaaggacag 3219841 cggcgcgagg taaccactgc cgacctcgag gaccaactgg atcccgacga cctgtatcag 3219901 cgcctgcgcg gcgccggcca acagcacgga cccgcgtttc aaggcatcgt ggggctggcc 3219961 gtcacgcaag ctggcgtggc ccgtgcgcaa gtacggctac ccgcatcggc cagaacgggt 3220021 tcccgtgagt tcatgctgca cccggtgatg atggatatcg cgttgcagac actgggagcc 3220081 acccggacgg cgaccgatct ggccggcggc caggacgccc ggcagggccc atcttccaac 3220141 tcggccttgg tggtaccggt gcgtttcgcc ggtgtccacg tgtacggcga tatcacccgc 3220201 ggggttcgcg cggtcggctc tctggccgca gccggtgacc ggctggtcgg cgaggtagtc 3220261 ctgaccgacg cgaatggcca accgctgctg gtcgtcgatg aagtcgagat ggcggtgctc 3220321 ggatccggca gtggcgcaac ggaactcacc aaccgcctat tcatgttgga gtgggagccc 3220381 gcaccgctgg aaaagaccgc cgaggctacg ggtgccctgt tgctgatcgg tgaccccgcc 3220441 gcgggtgacc cgctgctgcc cgcgctgcag tcgtcgctgc gcgaccgcat caccgacctc 3220501 gagctggcat ccgcggccga cgaagccacg ctgcgcgcgg cgatcagccg aacctcctgg 3220561 gacgggatcg ttgtggtctg tccgccccga gcgaacgacg aatcgatgcc ggacgaggct 3220621 caactggagt tggcacgcac acgcacgctg ctggtcgcca gcgtggtcga gaccgtgacg 3220681 cgaatgggtg cccgcaagag cccccgactg tggatcgtca cccgtggcgc tgcacagttc 3220741 gacgcaggcg agtcggtcac gttggcgcag accggcctac gtggcatcgc acgggtgctg 3220801 acatttgagc attcggagtt gaataccacc ctcgtagata tcgaaccgga cggcaccggc 3220861 tcgctggccg ccctggccga ggagttgctt gccggttccg aggccgacga ggtcgccttg 3220921 cgcgacggtc aacgctatgt caaccggctg gtgcccgcac ccaccacgac cagtggtgat 3220981 ctcgccgccg aagctcgcca ccaggtggtg aacctggaca gctcgggcgc ttccagggca 3221041 gctgtccgac tgcagatcga tcaacccgga cggctggacg cactaaacgt tcacgaggtg 3221101 aaacggggca gaccgcaagg cgatcaagtc gaggttcgcg tcgtcgccgc cggactcaac 3221161 ttcagcgacg tgctcaaagc gatgggcgtg tatccgggac tcgacggtgc cgcgccggtg 3221221 atcggcggcg aatgtgtcgg ctacgtgacg gccatcggtg acgaggttga cggcgtcgag 3221281 gtcggacagc gagttatcgc attcggccct ggcacattcg ggacccatct ggggaccatc 3221341 gccgatctcg tcgtcccaat tccggacacg ctagccgaca acgaggcggc cacgttcggc 3221401 gtcgcctatc tcaccgcctg gcactcgctg tgcgaggtcg ggcgcctatc ccccggcgaa 3221461 cgcgtgctca tccattccgc caccggcggt gttggaatgg cggcggtctc gatcgcgaag 3221521 atgatcggcg cccgcatcta cacgacggcc ggttcggacg ccaaacggga aatgctttcc 3221581 aggctcggtg tcgagtacgt cggcgactcg cgaagcgtgg atttcgctga cgagatcctc 3221641 gagctgacag acggctacgg tgtggacgtc gttctcaatt cgctggcggg cgaggcgatt 3221701 caacgcggcg tgcagatcct tgcgcccggt ggccggttca tcgaactggg caagaaggac 3221761 gtctacgccg atgccagctt gggcttggcc gcgctagcca agagcgcgtc cttctccgtg 3221821 gtcgacctcg acctgaatct caagctgcag ccggcgcgct accgccaact cctgcaacac 3221881 atcctgcagc acgtggcgga tggcaaactc gaggtacttc ccgtcaccgc atttagcctg 3221941 cacgatgcgg ccgacgcatt ccggcttatg gcatccggta aacacaccgg aaagatcgtc 3222001 atctcgatac cccagcacgg cagcatcgag gcgatcgctg ccccgccacc acttcctctg 3222061 gtcagccgcg acggcggcta cctcatcgtc ggcggtatgg gtggtctcgg attcgtcgtc 3222121 gcgcgctggc tggctgagca aggtgcggga ctgattgtcc tcaacggacg ctcggccccc 3222181 agcgacgagg tggcagccgc tatcgcggag ctgaacgcct ccggtagccg gatcgaggtg 3222241 atcaccggcg acatcaccga gccagacacc gccgagcggc tggtgcgggc ggtcgaagac 3222301 gccgggttcc ggctggccgg ggtggtgcac agcgcgatgg ttctcgccga cgagatcgtg 3222361 ttgaacatga ccgattccgc cgctcggcga gtgttcgccc cgaaggtcac cggcagctgg 3222421 cggcttcatg tggccaccgc cgcgcgcgac gtcgactggt ggctgacctt ctcctcggcc 3222481 gccgcgctgc tgggcactcc cgggcagggc gcgtacgccg ccgccaactc gtgggtcgac 3222541 ggcctggtcg cgcatcggcg ctcggccgga cttcccgctg tcgggatcaa ctggggcccg 3222601 tgggccgacg ttggacgcgc gcagttcttc aaagacctcg gggtggagat gatcaacgcc 3222661 gagcaggggc ttgccgccat gcaggcggta ctcaccgccg atcgcgggcg caccggtgtg 3222721 ttcagcctcg acgcgcggca gtggttccaa tcgttccccg ctgtggcggg gtcctcgctg 3222781 ttcgcgaagc tgcatgactc ggcggcccgc aaaagtgggc agcggcgcgg cgggggcgcg 3222841 attcgcgctc agctagacgc cctcgacgcg gccgaacgcc caggccacct cgcgtccgcg 3222901 atcgccgacg agatccgtgc ggtgctgcgc tcaggcgatc ccatcgatca ccaccgaccg 3222961 ctggaaaccc tgggactcga ctcgctgatg ggcctggaat tgcgcaatcg gctggaagca 3223021 agtctgggca tcacgttgcc ggtcgcgttg gtgtgggcat acccgacgat cagcgatctc 3223081 gcgaccgccc tgtgcgaacg aatggactac gcgacacccg cggctgcgca ggagatttcc 3223141 gatacagaac ccgaactgtc cgacgaggag atggatttgc tcgccgatct ggttgacgcc 3223201 agcgagctgg aagctgcgac gcgaggcgag tcatgacaag tctggcggag cgcgcggcgc 3223261 aactgtcgcc gaacgcgcga gcggccctgg cgcgcgagct cgtccgtgcg ggtacgacct 3223321 tcccgaccga catctgcgag ccggtggcgg tggtgggcat cggctgtcgc tttccgggga 3223381 atgtgactgg gccagagagc ttttggcagc tactggccga cggtgtggac acaatcgagc 3223441 aggtgccgcc tgatcggtgg gatgcggacg cgttctacga tcccgatcct tcggcgtcgg 3223501 gtcggatgac gacgaaatgg ggtggtttcg tttccgatgt cgacgcgttc gacgccgact 3223561 ttttcggaat cactcctcgg gaagccgtgg cgatggaccc gcagcatcgg atactgctcg 3223621 aggttgcctg ggaagcgttg gagcacgcgg gtattccgcc ggattccttg agcggcactc 3223681 gaaccggcgt gatgatgggt ctgtcgtcgt gggactacac gatcgtcaat atcgagcgca 3223741 gagccgacat cgacgcgtac ctgagcaccg gaaccccgca ctgtgccgcg gtggggcgga 3223801 tcgcgtatct gttgggattg cgtggtccgg ccgtcgccgt agataccgct tgttcgtcgt 3223861 cgctggtggc aattcacttg gcgtgtcaga gccttcgcct gcgtgaaacc gacgtggcat 3223921 tggcgggcgg ggtgcagctc accttgtcac cgttcaccgc catcgcgctg tccaagtggt 3223981 cggcgctgtc accgaccggc cgatgcaaca gcttcgacgc caacgcggat ggattcgtgc 3224041 gcggcgaggg ctgcggcgtg gtggtgctca agcggttggc cgacgcggtg cgcgaccagg 3224101 accgggtgct tgcggtggtc cgcggttcgg caactaactc cgatggtcgg tccaacggca 3224161 tgaccgcacc gaacgcgctg gcgcagcgtg acgtgatcac atccgccctc aagcttgcgg 3224221 atgttacccc tgacagcgtg aactatgtcg aaacacacgg caccggaacg gtgttggggg 3224281 accccatcga gttcgagtcg ctggcggcca cttatggcct gggtaaaggc cagggcgaga 3224341 gcccgtgcgc attggggtcg gtcaagacca acatcggcca cctggaggcg gccgccggtg 3224401 tggctggatt catcaaggcg gtgctggcgg tgcaacgtgg gcacattccc cgcaacttgc 3224461 acttcacccg gtggaacccg gccatcgacg cgtcggcgac gcggctgttc gtgccgaccg 3224521 aaagcgcccc gtggccggcg gctgccggtc cacgcagggc tgcggtgtca tcgttcggcc 3224581 tcagcgggac caacgcgcac gtggtggtcg agcaggcacc cgacaccgca gtagccgcag 3224641 ccggcggcat gccgtatgtt tcggcgctga acgtctccgg caagacggcc gcgcgggtgg 3224701 cgtcggcggc ggcggtgctg gccgactgga tgtcggggcc gggcgcggcg gcaccactgg 3224761 ccgacgtggc acacacgttg aaccggcacc gggcccggca cgccaagttc gccaccgtca 3224821 tcgcgcgtga ccgcgccgag gcgatcgcgg ggttgcgagc gctggcggcc ggacaaccac 3224881 gcgttggggt ggtggattgc gaccagcatg ccggtgggcc tggccgggtt tttgtgtatt 3224941 cgggtcaggg ctcgcagtgg gcgtcgatgg gccagcagtt gctggccaac gaaccggcgt 3225001 tcgccaaggc ggtagccgag ctggatccga tattcgttga ccaggttggc ttttcgctgc 3225061 agcaaacgct tatcgacggc gacgaggtgg tgggcatcga ccgcatccag ccggtgctgg 3225121 tcgggatgca gttggcgctg accgagttat ggcggtccta tggggtgatt ccagatgccg 3225181 tgatcgggca ctcgatgggt gaggtgtcgg cggcagtggt ggccggcgcg ttgacgcccg 3225241 agcagggctt gcgggtcatc accacccggt cgcggttgat ggcgcggctg tcggggcagg 3225301 gagcgatggc gctgctcgag ctggatgccg acgccgccga ggcgctgatt gccggctatc 3225361 cgcaggtgac gctggcggtg catgcgtcac cgcgccagac ggtgatcgcc gggccgcccg 3225421 agcaggtgga cacggtgatc gcggcggtag cgacgcaaaa ccggttggcg cgccgcgtcg 3225481 aagtcgacgt ggcctcccat cacccgatca tcgatcccat actgcccgag ttgcgaagcg 3225541 cgttagcgga tttgactccg cagccgccga gcatcccgat catttccact acgtacgaaa 3225601 gcgcgcagcc ggtggcggat gccgactatt ggtcggccaa cctgcgcaac ccggtgcgat 3225661 tccaccaggc cgtcaccgcc gccggtgtcg accacaacac cttcatcgaa atcagccctc 3225721 accccgtgct cacgcacgca ctcaccgaca ccctggatcc ggacggcagc catacagtca 3225781 tgtcgacgat gaaccgcgaa ctggaccaga cgctgtattt ccacgcccaa ctcgccgcgg 3225841 tcggtgtggc tgcgtccgag cacaccaccg gtcgccttgt cgacctgccc cccacaccgt 3225901 ggcaccatca gcgattctgg gtcacggatc gttcggcgat gtccgagctg gccgcgaccc 3225961 acccgctcct gggcgcgcac atcgagatgc cgcgcaacgg agaccatgtc tggcagaccg 3226021 atgtcggcac cgaggtctgt ccctggttgg cagaccacaa ggtgttcggt caacccatca 3226081 tgccggccgc ggggttcgcc gagatcgcct tggcggcggc cagcgaagcc ctcggcacag 3226141 ccgccgacgc cgtcgcaccc aacatcgtga tcaaccagtt cgaggtggag cagatgctgc 3226201 ccctcgacgg ccacacgccg ctaacgacgc agttaattcg cggcggggac agccagattc 3226261 gggtcgagat ctattcccgc acgcgtggcg gagagttctg ccgacacgcc acggccaagg 3226321 ttgaacaatc gccgcgcgaa tgtgcgcacg cgcacccgga agcccaaggt cccgccaccg 3226381 ggacaacagt gtcgccggcc gatttttatg ccctgctccg ccaaaccggc caacaccatg 3226441 gtccggcgtt cgcggcctta agccggatcg tgcgcctggc cgatggttcc gcggaaaccg 3226501 agatcagcat tcccgacgag gcgccgcgcc atcccgggta tcggctgcac cccgtggtat 3226561 tggatgcggc attgcaaagc gtgggtgccg cgatacccga cggcgagatc gcggggtcgg 3226621 cggaagccag ctatctgcca gtgtcgttcg agaccatccg ggtgtaccgc gacatcggtc 3226681 ggcacgtcag gtgtcgtgcc cacctgacaa acctcgacgg cggcaccgga aagatgggca 3226741 ggatcgtcct aatcaacgac gccggccaca tagcggccga agtggacggc atctatctgc 3226801 gtcgtgtcga acgccgtgcg gtacccctgc cactagagca gaagatcttc gatgccgaat 3226861 ggaccgaaag cccgatcgca gccgtgccgg ctccggagcc agctgccgag acgacgcggg 3226921 gaagttggct ggtactcgcc gatgcaacgg tggatgcgcc aggcaaggcc caggccaagt 3226981 cgatggccga cgacttcgtg cagcagtggc gctcgccgat gcggcgggtg cacaccgccg 3227041 atatccacga cgaatcggcg gtgctggccg catttgcaga aacggcaggc gatcccgagc 3227101 acccgccggt tggcgtggtg gtgttcgtcg gcggtgcctc gagtcgactg gacgacgagc 3227161 tggcggcggc gcgcgacacg gtgtggtcga tcaccacggt ggttcgtgcg gtcgtcggca 3227221 cgtggcacgg ccgatcaccg cggctatggc tggtcaccgg gggcggactt tccgttgccg 3227281 acgacgagcc gggaacaccc gcggcggctt ccttgaaagg gctggtgcgg gtgctcgcct 3227341 tcgagcaccc ggacatgcgc accaccctgg tcgatctgga catcacacaa gacccgctga 3227401 ccgcgctgag cgcggaactg cggaatgccg ggagtgggtc gcgccatgat gacgtgatcg 3227461 cgtggcgcgg cgagcgcagg ttcgtcgaac ggctgtcgcg cgccacgatc gatgtatcca 3227521 aagggcatcc ggtggtgcgc cagggagcgt cgtacgtcgt caccggcggc ctcggcggtc 3227581 tcggcctggt cgtcgctcgt tggctggtgg accgcggcgc cggccgggtg gtgctgggtg 3227641 gccgcagcga tcccactgac gagcagtgca acgtcctggc cgaactgcag acccgcgccg 3227701 agatcgtggt tgtccgtggc gacgtggcat cgccgggggt ggcagaaaag ctgattgaga 3227761 cggcccgaca gtctgggggc caattgcgcg gcgtcgtgca cgccgccgcg gtcatcgaag 3227821 acagcctggt gttctctatg agcagggaca acctagaacg ggtgtgggca cccaaggcca 3227881 ccggtgcgct gcgcatgcac gaagccaccg ctgactgcga gctcgactgg tggctcggat 3227941 tctcttccgc cgcttcgcta ttgggttctc ccgggcaagc ggcctacgcg tgcgccagcg 3228001 cgtggctgga cgcgctggtc ggatggcgca gggcatccgg cctgccggcc gcggtgatca 3228061 actggggtcc gtggtcggag gtaggcgtcg cccaggcctt ggtgggcagt gttctcgaca 3228121 cgatcagtgt cgcagaaggc atcgaggctc tcgactcatt gcttgccgcc gaccggatcc 3228181 gcactggagt ggctcggctg cgtgccgatc gggccctggt cgcattcccg gagatccgca 3228241 gcatcagcta cttcacccag gtggtcgagg agctggactc ggcgggtgac ctcggcgact 3228301 ggggcgggcc cgacgcgctt gccgacctcg acccgggcga ggcgcggcgc gcggtgaccg 3228361 agcggatgtg tgcgcgcatc gctgcggtga tgggctacac tgaccagtcg actgtcgaac 3228421 ccgccgtgcc cttggacaag cccctgaccg agctggggct ggattctctg atggcggtac 3228481 gaatacgcaa cggcgcgcgg gcggatttcg gcgtggaacc gccggtagcg ctgatactgc 3228541 aaggcgcgtc cttgcatgac ctgacggcgg acttaatgcg ccaactcggg ctcaatgatc 3228601 ccgatccggc gctcaacaac gctgacacta ttcgcgaccg ggcgcgccag cgcgcggcag 3228661 cgcgacacgg agccgcgatg cggcgccgac ctaaacctgc agtacaggga ggataagacc 3228721 tgtgagcatc cccgagaacg cgatcgcggt ggtcggcatg gccggccgat ttccgggcgc 3228781 caaggatgtt tcggcgttct ggagcaacct tcggcgcggt aaggagtcga tcgtcaccct 3228841 gtccgaacag gagctgcgcg acgccggcgt cagcgacaag acgctggccg atccggcgta 3228901 tgtgcgtcgc gccccgcttc ttgacgggat cgacgagttc gacgccggct tcttcgggtt 3228961 cccgccgctg gccgcgcagg tgctggatcc ccaacaccgg ttgttcctgc agtgtgcatg 3229021 gcatgcgctc gaggacgcgg gcgctgaccc cgcacggttc gacggctcga tcggcgtata 3229081 cggaaccagc tcccccagcg gctatctgct gcacaacctg ctgtcgcatc gcgacccgaa 3229141 cgctgtgttg gccgagggac tcaacttcga ccagttcagc ctgttcttgc agaatgacaa 3229201 ggactttctg gcaacccgga tttcgcacgc gttcaacctg cgcgggccga gcatcgcggt 3229261 gcaaaccgcg tgttcatcgt cgctggtagc ggtgcatctg gcctgcctga gcctgctatc 3229321 cggcgaatgc gacatggcgt tggccggcgg gtcgtcgcta tgcatcccgc accgtgtcgg 3229381 ctacttcacc tcaccgggat cgatggtgtc ggcggtgggc cactgtcggc ccttcgacgt 3229441 gcgggccgac ggcacggtct tcggcagcgg tgtcgggttg gtggtgctca agccgctggc 3229501 ggccgccatc gacgccggag accggattca cgccgtcatc cgcggatcgg cgatcaacaa 3229561 cgacggatcg gcgaagatgg ggtatgcggc gcccaacccg gccgctcaag ccgatgtcat 3229621 cgccgaagcc catgcggtgt ccggcatcga ttcgtcgacc gtgagctatg tcgagtgcca 3229681 cggaaccggc accccgctcg gtgatcctat cgaaatccag ggcctgcgag cggcgttcga 3229741 ggtgtcgcag acgagccgtt cggccccttg tgttctgggg tcggtcaagt cgaacatcgg 3229801 ccacctggaa gttgctgccg gcatcgcggg tctgatcaaa acgattctgt gcctaaagaa 3229861 caaggcacta cccgcgacgc tgcactacac cagcccgaac ccggaactgc gcttggacca 3229921 aagtccgttc gtcgtgcaaa gcaagtacgg cccctgggag tgcgacggcg ttcgtcgtgc 3229981 cggggtgagt tcgttcgggg tcgggggtac caacgcgcac gtcgtcttgg aggaggcgcc 3230041 agcagaagca tcggaggttt cagcgcacgc cgagccggct ggccctcagg taatcctgct 3230101 ctcggcgcaa acggccgcgg cgctcggcga gtcgcggacc gccctggccg cggcgctaga 3230161 aacgcaagac ggcccgcgcc tgtccgacgt ggcctacacg ctcgcccggc gccgcaagca 3230221 caacgtcacg atggccgccg tcgtgcacga ccgcgagcac gcggccaccg tgctgcgggc 3230281 ggccgagcac gacaacgttt tcgttggcga agccgcccac gatggggagc atggcgatcg 3230341 cgccgacgcc gcacccacgt cggatcgcgt cgttttcctg tttcccggac agggcgctca 3230401 gcacgtcgga atggcaaaag ggctctatga caccgagccg gtcttcgccc aacacttcga 3230461 cacctgcgcc gccggattcc gcgacgagac aggcatcgac ttgcatgccg aagtgttcga 3230521 cgggaccgca acagatcttg agcgcattga ccgttcgcaa ccggcgttgt tcacggtgga 3230581 atacgcgctc gcgaagttgg tcgacacttt cggcgtgcgc gccggggcgt acatcggata 3230641 cagcaccggc gaatacatcg cggccaccct ggccggcgta ttcgacctgc agacagcgat 3230701 caaaacggtg tcgctgcgcg cccgccttat gcatgagtcg ccgcccggtg ccatggtcgc 3230761 ggtggctctt ggccccgatg acgtcacgca gtacctgcca ccggaggtcg agctgtccgc 3230821 ggtaaacgat cctggtaact gtgtggtcgc cgggcccaaa gaccagatcc gtgcactgcg 3230881 ccaacgtctt accgaggcag ggattcccgt tcgccgcgtc cgggcaaccc acgcgttcca 3230941 taccagcgcg atggatccca tgctgggcca attccaagaa ttcctgtccc gtcaacagct 3231001 acgtcctccg cgcacaccgc tgctgagcaa cctcaccggt agctggatgt ccgaccagca 3231061 agtagtcgat ccggccagct ggacgcgtca aatcagctcc cccatcaggt tcgccgacga 3231121 gctggacgtg gtgctggcag ctccaagtcg aatcctggtc gaggttggtc cgggcggcag 3231181 cctgaccggt tcggctatgc gccacccgaa gtggtcgacc acgcaccgca ccgttcggct 3231241 tatgcgccac ccactgcaag acgtcgacga ccgcgacact tttctgcgcg cgctgggcga 3231301 actctggtct gccggagtcg aggtcgactg gacgccgcgg cgtccggcgg tgccgcacct 3231361 cgtttccctg ccgggttatc catttgcccg tcaacggcat tgggtcgaac ctaaccacac 3231421 ggtttgggcg caggctcccg gcgcaaacaa cggctcaccg gccggcactg cggatggttc 3231481 cacggccgcc accgtcgatg cagcccgcaa cggagagtcg cagaccgagg ttacgctgca 3231541 acgcatctgg tcacagtgcc tcggcgtcag ctcggtcgat cggaacgcca atttcttcga 3231601 cctcggcggc gattctttga tggcgatcag catcgcgatg gccgccgcca acgagggtct 3231661 gaccatcacg ccgcaggatc tctacgaata cccgaccctg gcctcgctga cggccgccgt 3231721 cgacgcgtcg ttcgcgtcca gcgggttggc gaagcccccg gaggcacagg cgaacccggc 3231781 ggttccaccc aacgtcacgt acttcctcga ccgcggattg cgcgacaccg gccgctgtcg 3231841 tgtcccgctg atcctgcgcc tggatcccaa gatcgggcta ccggatattc gagcggtgct 3231901 gaccgcagtg gtcaaccacc acgacgcatt gcgcctgcac ctggtcggca acgatgggat 3231961 atgggagcag cacatcgcgg cacccgcaga attcaccggg ctttccaacc ggtcggtgcc 3232021 cgacggcgtg gctgcaggca gccccgagga acgggccgcg gtcttgggca tcctggccga 3232081 actccttgag gatcaaacgg atccgaacgc gccgctggct gccgttcata tcgccgccgc 3232141 gcacggcggt ccgcactatc tgtgccttgc catacatgcg atggtcaccg acgactcatc 3232201 gcgccagatc ctggcgaccg acatcgtcac cgcgtttgga caacggctgg caggcgagga 3232261 gatcacgctg gaaccggtca gcacggggtg gcgggaatgg tcactgcgtt gcgcggccct 3232321 cgcgacgcat ccggcggcgc tggacactcg ctcgtactgg atcgagaatt cgaccaaggc 3232381 gactttgtgg ctggccgatg cccttcccaa cgcgcatacc gcccatccgc cccgcgccga 3232441 cgagctcacc aagttgtcga gcacgctaag cgtcgagcag acatccgagc tggacgacgg 3232501 ccggcgcagg ttccgccggt cgattcagac gatcctgctg gccgccctcg gccgcacaat 3232561 agctcagacg gtaggtgagg gtgtggtcgc cgtggagctc gaaggcgagg gccgctcggt 3232621 gctgcggccg gatgtcgacc tgcgcagaac ggtcggctgg ttcacgacgt actacccggt 3232681 accgctggca tgcgcaacag ggctgggcgc gcttgcgcag ctggacgcgg tgcacaacac 3232741 tcttaagtcc gttccgcact acggaattgg atacgggctg ctgcgctacg tttacgcccc 3232801 gaccggacgt gtcctgggcg ctcagcgcac acccgacatt cacttccggt atgcgggcgt 3232861 gatccccgag ctaccgtccg gcgatgctcc agtacagttc gactcggaca tgacgcttcc 3232921 ggtgcgcgaa ccgatcccag ggatgggcca cgccatcgaa cttcgggtgt atcggtttgg 3232981 tggctcactg catctcgatt ggtggtacga cacccgccgg atcccggcgg caacggcaga 3233041 agcgctggag cggaccttcc cgctggccct cagcgcgctg atccaggagg ccatcgcggc 3233101 cgagcacaca gagcacgacg acagcgagat agtcggggaa cccgaggcgg gcgctctggt 3233161 ggacctgtcg agcatggatg ccggctgagg aggatcggat gcgcaacgac gacatggcgg 3233221 tggtggttaa cggggttcgc aagacctacg gcaagggcaa gattgtggcc ctcgatgacg 3233281 tgagtttcaa ggtgcgccgc ggtgaagtga tcgggctgct gggccccaac ggggccggca 3233341 agacgaccat ggtggacatc ttgtcgacgc tgacccgacc ggatgccggc tcggcgatca 3233401 tcgctggcta cgatgttgtt tccgaaccgg ccggtgtacg ccgctcgatc atggtcaccg 3233461 ggcagcaggt ggccgtcgac gacgcgcttt ccggtgagca gaacctggtg ttgtttggtc 3233521 gtctgtgggg actgagcaag tccgcggcgc gcaaacgcgc cgccgaactg ctcgagcaat 3233581 tcagcctcgt acatgccgga aagaggcggg tgggcaccta ctccggcgga atgcgccgac 3233641 gaatagacat cgcgtgcgga ttggtggtcc aaccccaggt ggcgttctta gacgagccca 3233701 ccaccgggct cgatcccagg agccggcaag ctatttggga tctggtggcc agcttcaaga 3233761 agctgggcat tgccacgttg ttgaccacgc agtatctcga ggaggcggat gcgctcagtg 3233821 accgcatcat cctgatcgat cacggcataa tcatcgccga aggcaccgcg aatgaactca 3233881 agcaccgcgc cggcgacacc ttctgcgaaa tagtgccccg cgatctgaag gatctggacg 3233941 ctatcgtcgc ggcgctcggt tcgctgttgc ccgagcacca cagggcgatg ctgacgcccg 3234001 actcagaccg cattacgatg ccggcgcctg acggcatacg tatgctcgtc gaggcagcgc 3234061 gccggatcga cgaggcgagg atcgagctag ccgatattgc gctgcgccga ccgtcactcg 3234121 atgacgtatt cctggccatg acgaccgatc ccaccgagtc tctgacccat ctggtgtcgg 3234181 ggtccgcgcg atgagcggcc cggccataga tgcgagcccc gccctgacct tcaaccagtc 3234241 aagcgcgagc attcagcagc gacgcttatc gaccgggcga cagatgtggg tgctctatcg 3234301 gcgtttcgcc gcgccgagcc tactcaacgg tgaagtactc accacggtgg gcgcgccgat 3234361 aattttcatg gtgggcttct atatcccgtt cgccataccg tggaaccaat ttgtgggtgg 3234421 cgccagctcg ggcgtcgcca gcaacttagg gcaatacatc acgccgttgg tcacactgca 3234481 ggcggtctcg ttcgccgcga tcgggtcggg ctttcgagcc gcgaccgatt cgctgctagg 3234541 cgtcaatcgt cggtttcagt ccatgccgat ggccccgttg acgccactgc ttgcccgcgt 3234601 gtgggtggct gtggaccgat gcttcacggg tttggtgata tcgctagttt gcggctacgt 3234661 catcggattc cgttttcatc gcggggccct ctatatcgtc ggtttttgcc tactggttat 3234721 cgcgatcggg gctgtgctgt cattcgccgc tgacctggtt ggcaccgtta ccaggaaccc 3234781 agacgcgatg ctgccgctgc tgagcttgcc cattttgatc ttcggactgc tgtccattgg 3234841 tcttatgccg ttaaagctgt ttccgcactg gatccatcca tttgttcgca accagccgat 3234901 ctcccagttc gtcgcggcgc tgcgggcatt ggccggagat accaccaaga cagcctcaca 3234961 ggtgagttgg cctgtgatgg ctccgacgtt gacgtggttg ttcgctttcg tggtgatcct 3235021 ggcgctttca tccaccattg ttttggctag gcggccatga tcacgacgac aagtcaggaa 3235081 atcgagcttg cacccacacg tttgccaggc tcgcaaaacg ctgctcggct gttcgttgcg 3235141 cagacccttt tgcagaccaa ccggttgcta actcgatggg cacgtgacta tatcaccgtt 3235201 atcggagcga tcgtgttacc gattctcttc atggtggtgt tgaacattgt gctaggtaac 3235261 ctagcttatg tcgtaaccca cgacagcggg ctctacagca ttgttccgct gatcgcactc 3235321 ggcgccgcga tcactgggtc aacttttgtc gcgatcgacc tgatgcgcga gcgctccttc 3235381 ggactgcttg cccgactgtg ggtgctgccc gtgcaccgag catcgggcct gatctctcga 3235441 atcctggcaa acgcgattcg gactctggtc accactttag tgatgctagg tactggggtg 3235501 gtattgggtt tccggtttcg acaaggcctg atcccgagcc tcatgtggat tagtgtcccg 3235561 gtgatactgg gcatcgcaat cgcggctatg gtcactaccg tcgcgcttta cacagcacaa 3235621 accgttgttg tcgaaggcgt tgagctggtg caagcaatcg cgatcttctt ctccacgggt 3235681 ttggtgccgc tcaactcgta tccaggctgg attcagccgt tcgtcgccca tcagccggtg 3235741 agctacgcca tcgcggcgat gcgcggtttt gcaatgggtg gtccggtcct ctctccgatg 3235801 atcgggatgc tggtgtggac cgcgggtatc tgcgtcgtat gcgccgtacc cttggccatt 3235861 ggctaccgac gggccagcac gcattgacca gcaccgctgg cccgggatgc cgtgacgagt 3235921 tgggagtgtt gagatgtttc ccggatctgt gatccgaaag ctgtcgcaca gcgaggaagt 3235981 cttcgcgcag tacgaggttt ttacttccat gacaatccag ctgcgcggtg ttatcgatgt 3236041 cgatgcgctg tcggatgcct tcgacgccct cttggaaacc cacccagtcc tggccagcca 3236101 ccttgagcaa agctccgacg gcggttggaa tctcgttgcc gacgacctgc tgcactctgg 3236161 aatctgtgtc atcgacggca cggccgccac caacgggtca ccgtcgggaa acgccgaact 3236221 acggctcgac cagagcgtgt ccctattgca tctgcagctg atcctccgcg aaggaggagc 3236281 cgagctgacg ctatacctcc atcactgcat ggccgatggt catcacgggg ccgttctcgt 3236341 cgacgagctg ttctcccgct acaccgacgc ggtcactacc ggtgaccccg gcccgataac 3236401 cccgcagccc acgccgctgt caatggaggc tgtgctggca cagcggggta tcaggaagca 3236461 agggctttcg ggagctgaac gttttatgtc ggtgatgtat gcctatgaga tccctgccac 3236521 cgagacgccg gcggtcctcg cgcatcctgg gctgccccaa gctgttccgg tcacccgact 3236581 ctggctttcc aagcagcaga catcggacct catggcgttc ggccgcgagc atcgcctcag 3236641 ccttaacgcc gtggtcgcgg cagccatcct gctgaccgag tggcagctgc gcaacacccc 3236701 gcacgtcccg attccctacg tttaccccgt cgacctgcga tttgttctag ctcccccagt 3236761 ggccccgaca gaagctacca atctcctcgg ggcggcgtct tacctcgctg agatcgggcc 3236821 gaataccgac atcgtggatc tggcaagcga tatcgttgcc acacttcggg ctgacttggc 3236881 caatggtgtg attcagcagt cggggctcca cttcggcacg gcattcgaag gaactcctcc 3236941 cggcctacca ccacttgtct tctgcactga cgccacttca tttcccacca tgcgcacacc 3237001 gccgggcctg gagatcgaag acattaaggg ccaattctat tgttcgatca gcgtccccct 3237061 cgatctgtac tcgtgtgccg tttacgcagg acaactgatc atcgagcatc atgggcacat 3237121 cgcggaaccg gggaagtccc tcgaggcgat acgttcactg ctgtgcaccg ttccctcgga 3237181 gtatggctgg atcatggagt gacctaacga accagcccgc cgatcgggct tcggccagat 3237241 cacgcactcg cgtcccgaac cgatcatcat atccgcccca gctgcggtcg cggctgacaa 3237301 gccttacccc gcagctcacc tcatgatctc accacgaggc ttgcggcaca acagaattcg 3237361 accgctatga tgccgccggt gccgccgcct gctcctcggc cagcgtgtcc gccaagtact 3237421 gggccaaagc gcgggcggtg ttgtttgtgg cgatgacctt gggggtcagg cgtatcccgg 3237481 tctcggtttc aacgtgggta cgcatctcga gcatgcccag cgaatccagg ccgtactcga 3237541 tgaatgagcg gtcagcgtcg atcgtgcgac gcaggatcac actggcctgc tcaaccagca 3237601 gacgccgtag ccggccggcc cattcatctt gcggcagcga aaggagctcc atgcggaatt 3237661 tgcttgggcc ccttgaccgc tgcccagtgg atgcgaacat ttcaccccac gggctgcgtc 3237721 ggacaaggtc ggccagccat ggcgccccga ggatcggaat gtaaccgctg taggcgcggt 3237781 cgtggcgcac gagcgtctcg aaggcatacg caccttcctc cggggtgatc atgatttcgc 3237841 ccccctcggc caagaacgtg gcgcggccga cctcgcccca cgcaccccac gcaatcgcgc 3237901 tgaccggcag gccctgggcg cggcgccagt gcgcgaagac gtcgacccag ctgttggccg 3237961 ccgcgtaggc gccctgaccc ggcgagccga gcaatgccgc tcccgaggag aacaagcaga 3238021 accagtccag cggctgaccg agggtggcgc ggtgtaggtt ccaggatccg aacaccttgg 3238081 gcgaccagtc gcgatcgatg agctcatcgg tgatgttggt cagcgtggca tcctcgacca 3238141 ccgccgccga gtgcagcaca ccgcgcagcg gaagcccggt agcggtcgcc gcactcacca 3238201 gccggtccgc cgtgtcgggt tcggcgatgt tgccacactc caccacgatg tcggccccag 3238261 ccgcgcgcag gccttcgatg gtctgccgcg ctttggggtt gggctgggaa cgtgcggtca 3238321 gcacgatccg gccacagccc gccgcggcca gcttcgaggc gaagaacagg ccgaggccac 3238381 ccaggccgcc ggtgatgatg taggagccgt cgcggcggta cagcggagct tgctccgggg 3238441 tgaccgccac gcttctacgg ccgctacgcg gtacgtcgag cacgagtttg ccggtgtgct 3238501 cggcgttgct cattgcccgg atggcgtcgg ccgcctcggc caacgggtaa tgagtgcatt 3238561 gcggtgcggt cagcaccccg tctgcggtga gcttgaacac cgtggccagc aactcacgga 3238621 cccggtcggg ctgggtgacc gacatcagcg cgaggtccaa gtagtagaag gtcagtccgc 3238681 gacggaacgg gaacagcccc agccgggtgt tgccgtaaac gtcggccttg ccgatttcga 3238741 cgaagcgtcc gccgaaggcc aacaactcca gccccgcacg ttgggcggcg ccggtcagcg 3238801 agttcagcac gatatccacg ccgtacccgt cggtgtcgcg ccggatctgc tcggcgaact 3238861 cgacgctgcg cgaatcgtag acatgctcga cgcccatgtc gcgcagcatg gctcgcttcg 3238921 cgggattgcc ggcggtcgcg aaaatctccg ctcccttggc gcgggcaatc gatatggccg 3238981 cctgccccac accgccggtg gcggagtgaa tcaacacttt gtcaccggcc ttgatctgag 3239041 ccaggtcgtt gagcccatac caggcggtgg catgcgcggt ggccgccgtg atcgcctgct 3239101 catcggtcaa gccgggcggc agcgtgaccg cgaggttggc gtcacaggtg aggaacgtcc 3239161 gccaacagcc accttcggag aaaccgccaa cacgatcacc gacctggtga ccggtgacac 3239221 cttccccgac cgcagtcacc acaccgacga aatccatacc caactgcggc tcgcggtcat 3239281 cgataatggg gaatcgtcca aacgcgatca aaacgtcggc gaagttgatg ctggacatgc 3239341 tgaccgcgac ttcgatttgc ccggggccgg gcggaactcg gtcactcgca acgaattcca 3239401 acgtttgcaa gtctcccggc ctgcggacct gcacccgcat accgtcgtgg tcgggatcca 3239461 agaccgcggt gcgccgctct tcatggccca gcggactggg ggtcaagcgg gccacatacc 3239521 agtcgccatt ccgccaggcc gtctcgtcct cttccgatcc gctcagcagc tgctgggcca 3239581 cccgctcaac gtccgtgtgt tcgtccacat cgatcaaggt ggtgcgcagc atcggatgtt 3239641 cactgctgat cacccgtagc agaccacgca ggccggcctg ctccaggttg gctctttctc 3239701 ccgagtcgtg cggcttcact atctgggctt gtctggtcac cacgaacaag cgcggcagct 3239761 cgccctcgaa ttcagccagt tcccgggtga tccgaaccag gtgacggacc tgttcacgac 3239821 cggccagcag actgtgctca tcggggtcgc cgacgcgagg cccatacacg atcaccacac 3239881 catcgcggcc acgcagctgg ctgcccagct tttcgaggcc agcttgatcg ttgggcgggg 3239941 tgtcctggac cgaccaggac aggctggcgc attcggtgcc ttgggggccg tgggacttca 3240001 gcgcgtccgt caacgtggaa gccaacatgt cgggggtgtc gacggcgttg gaagtgtcga 3240061 tcaatagcca cgatccagcc tcgccgtcgc caacctcggg cagcgctcgc tgctgccatc 3240121 cgagggtcag tagccgctcg ctgactaggc ggtcacgctc gtcgcgttcg gaggtcccgg 3240181 ttcccatgcg tagcccacgc acggccaaca ggacggtccc gtgctcgtcc agcacgtcga 3240241 ggtcggcctc accacctcgg gtcccgtcgt tgaaggcctt ggtcaaccgc gtgtagcagt 3240301 agcgggcatt gcgggtaggc ccgtaggcac gcaggctgcg cacacccaac ggcaacagca 3240361 ggccaccagt ggccgtaccg gcctggacgc ccgcgccgac cgactggaaa caagcgtcca 3240421 gcagcgccgg gtggattcgg taggcgccct gctggaaccg gatcgacgcg ggcagcgcga 3240481 cctcggccag caccgtcgcg gctcccgcct cggcggtatg cgcggtggtc agaccaccga 3240541 acgcggcgcc caaagtaaca ccacgctcgg cgaacgattc ccgcatggcg gtcccgttca 3240601 cggcgtgcgg atgcgcctgc agcagagcgg tgatgtcgta ccccggcggc gggcagtcat 3240661 cttcggcggc gcgcagcgcc gcggtggcat gccgggtggt ttcaccgtcc cggttggtct 3240721 ccacggtgaa gttgacgaca ccaggcgcgt cgatcgatgc gacggcgtcg atcggggtct 3240781 gctcgtcgag caacaacatc tgctcaaagg tgatgtcgcg aacctcagcc gcttcgccga 3240841 agacctcagc ggccgcagcc aaagccatct cgcagtaggc ggcgccggga agggcggcaa 3240901 cgttatgcac ctgatgatcg ctgagccagg acagcaccga ggtgccaacg tcgccctgcc 3240961 agacgtggcg ctcaggttcc tcagtcagcc gcacatgcga gccaagcaac ggatgcacgg 3241021 tgatggtgca ggcaccttgt gcccgctgtt cttgcccatc atcgtcgatg aataggcggg 3241081 cgtgggtcca cgccggcagc ggcgcatcca ccagccgccc agcgggatac agcgccgaat 3241141 agtccaaagc ggcgcccgcg cggtgcagct ccgtcagcaa gccgcgcaga ccatgcggca 3241201 gaggctgctc tcgccgcatg ccggccaggg cggcgaccga catgtcgagg cttcggcccg 3241261 tctgttcgac ggcgtgggta agcagcgggt ggggcgacag ctccgcgaag acccggtagc 3241321 cgtcctccat cgcagcctgc accgccgcgg cgaactgcac cgtgttgcgc agattgtcca 3241381 cccagtaagc gccatcgcac accggctgct cgcgcgggtc gaacagggtc gccgagtagt 3241441 acggcacctt gggcgtcatc ggagcaatgt ccgccagcgc cgcggccaaa tcgtcgagta 3241501 tcggatcgac ttgaggcgag tgcgacgcca cgtcgacggc cacctcgcgc gccatcacgt 3241561 cccgctgctc ccaacgggcg atgaggtcac gaacggtgtc gctcgtaccg ccgatcaccg 3241621 tggattgcgg ggacgccacc accgagacca caacatcgtc gattccgcgt gccatcagct 3241681 ccgaattcac ttgcttggcg ggcaattcca ccgagcccat ggcaccagca ccggctatgc 3241741 gggtcatcag cttcgagcgg cggcaaatga cgcgcgccgc gtcctcgagc gacagtgccc 3241801 ccgcgacgac ggccgcggcc gactcaccca tcgagtgtcc gacgaccgcg cccggccgca 3241861 ctccgtaggt ttgctccatg gtggcggcca acgcgacctg aacggcgaac actgccggct 3241921 gcactttgtc gattccggtc acggtctgct gcgccgttat cgcctcggtc accgagaatc 3241981 ccgattctgc ggcgatcacc ggctccagct tggcgatggt ggccgcgaac actggttcgc 3242041 tggcgagcaa ttgcgtgccc atcgccgccc actgcgaccc ttgcccggag aagacccaga 3242101 ccggtcctcg atcaccgtgt cccaccgccg cgtcatagag ggcgtcaccg tcggccacct 3242161 cgcgcaaacc ctcgacgagc tccggcaggt tggcggcaac caccgcggtg cgcaccggcc 3242221 ggtgcgcgcg gccacgcgcc agcgtgtagg ccagatccga ggccgccacg cagtcctggt 3242281 gttcttccac ccaggtggct agttggcggg ccgtctggcg cagtgcgtcg ctggacgtgg 3242341 acgacagcat gaatagccgc gggcccacct cagcgtcgcc cggtgaactc tcgggtgcgg 3242401 aagcttctgc tggggcctct tccacgatgg catgcacgtt ggtcccggac atcccgaacg 3242461 aggacaccgc gacccgcttc ggtgtgtgat cattaccgtt gggccacggc gtaaccgctt 3242521 gcggcacaaa gagcccggtc tcgacgtcgg aaagctcatc gggcagccga ttgaaatgca 3242581 gcagcggcgg caccaccccg tgccgcagtg acagaattgc cttgatcagc ccgacggtcc 3242641 ccgccgatgc cgtgctgtgc cccatgttgc tcttggccga tccaagcgcg cagggggtgc 3242701 ccgcgccata cacccgcgcc aggctgcggt actcaatcgg gtcgccgatt ggcgtaccgg 3242761 tgccgtgcgc ctcgaccaca ccgaccgttt cgggctgcac gcccgccgcc gccaacgccg 3242821 cacggtacac ggcaacctgg gcgtcctcgg acggcatggt gagcgtctcc gtgcggccgt 3242881 cctgattggt ggccgtgcca cgcaccacgg cgaagatccg attaccgtcg cgcagcgcat 3242941 ccggcagtcg cttcagcaac accatcgcgc agccctcgga acgcacaaac ccatccgcgt 3243001 cagcatcgaa tgaatggcac cgaccggttg acgacagcat gccctgcgca gacgccgcca 3243061 cacaggcatg cggctccagc agcaccgcac aaccgcccgc caaagcgagg tcagcttcgc 3243121 cgtcatgcag gctgcggcag gccaggtgca ccgccatcag acccgaagaa cacgcggtgt 3243181 caaacgtcat cgccggacca tgtagaccca atgtgtgcgc gatccgccct gacgccacac 3243241 tgttgttgag gccggtaacc acatatggac tggccaaacc gcccgccgtt gtggtgagta 3243301 ccaggtagtc ctcgtgggtc agcccagtaa aaacggccgt cgaggacccg gccaacgacg 3243361 ccggatccag accagcatgc tcgatcgcct cccacgacgt ttccagcagt agccgctgct 3243421 gcggatcgat cgaggtcgct tcccgctcgc taatcccgaa gaactcagca tcgaaaccgg 3243481 cgacgtcgtc aaggaaccca ccccaccggg acaccgaccg cccgggaacc cctggctcag 3243541 ggtcgtaata gtcgtcggcg tcccagcggt cgggcggaat ctcggtgacc aagtcatcac 3243601 cgcgcagcaa cgactcccac agtttgtcgg gcgagttgat ccccccagga agccgacatc 3243661 ccatcccgat caccgcaacg ggagtgacac gtgattccat actcttccaa cctcgtctca 3243721 gctcaaccgg tgttacccga cgacatcagc gaattttcac accgggaatg aaacggccgc 3243781 ggtgccgctc tcccagctct taagtaatcc gagccaaccc ggatcccgac accaaagaca 3243841 agtgttacac gacgccaaga ccccccgcgg gtagcgctgg aatactaaca cgagcacatg 3243901 tgctcgcgac cgagtctcac ctcggacctg ggcaaatgac cccatgtcgc aggtgcatgg 3243961 agttgttcgg gcagtctcgg cgaggttgca gggctgttcg accagcggat ttcgacactc 3244021 ggtaacgcaa gccagttagg ggcggtcatc ggtgatgctg cgccacgaag cactacatcc 3244081 gttgcaccgc aattattttc ggtgcccgca tgacgggcgc aatgccttaa ttgcgttagc 3244141 cggcgacccg ccgcgggggc ggcgccacat cacatccgac cgtgtccgat ggtggaccca 3244201 tggcgagccg gcaaacccct gctgagctgg ccagatgcga cttggctaag accgcggagc 3244261 gcgagcacac cccgacggcg actgcgacaa ctccaagcgt ggccggtaac gtgatgccca 3244321 tgattgtgcg ttcccttccc gctgcgttgc gcgcgtgtgc gcgtctgcaa ccccatgacc 3244381 cggccttcac gtttatggat tacgaacagg actgggacgg cgttgcgata accctgacgt 3244441 ggtcgcagct gtatcggcga acgctgaatg tggcacggga gctgagccgt tgtggttcca 3244501 cgggtgaccg cgtggtgatc tctgctccgc agggactcga gtacgtcgtc gcctttctcg 3244561 gcgcgttgca ggccggccgc atcgccgtgc cgctttcggt tccacaaggc ggcgttaccg 3244621 atgaacgttc cgattcggta ctgagtgatt cgtcgccggt ggccattctc actacatcgt 3244681 ctgccgtgga cgacgtcgtg caacatgttg cgcggcggcc cggggaatcc ccgccatcaa 3244741 ttatcgaagt tgatttgctc gatctggacg ctccgaatgg gtataccttc aaagaagacg 3244801 agtatccatc taccgcgtat ttgcaataca cctccgggtc cacccgcacg cccgctggcg 3244861 tggtgatgtc ccatcagaac gttcgggtta atttcgaaca gctgatgtct ggctactttg 3244921 cggataccga cgggattcca ccgccaaatt ccgcactcgt atcctggcta cccttctacc 3244981 acgacatggg tttggtaata ggaatttgcg caccaattct gggtggatac cccgcggtgc 3245041 tcaccagccc ggtgtcgttc ctgcagcgcc cggcccggtg gatgcacttg atggccagcg 3245101 attttcacgc cttttcggca gcaccgaatt tcgcctttga actagcggca cgaagaacaa 3245161 ccgacgacga catggccggg cgtgacctcg gcaacatact gaccatcctc agcggtagcg 3245221 agcgggtaca ggccgcgacg atcaagcgct tcgccgaccg ctttgctcgc ttcaatctgc 3245281 aggagagggt gatccggcct tcatacgggc tcgcagaagc aacggtgtac gtggcgacga 3245341 gcaaaccggg tcaaccaccg gagaccgtcg acttcgatac tgaaagttta tccgccggcc 3245401 atgcgaagcc gtgcgcaggc ggcggcgcta catcgttgat cagctacatg ttgccgcggt 3245461 caccgatcgt gcggatcgtc gactcggaca cctgcatcga atgtccggac ggaaccgtcg 3245521 gcgagatctg ggtgcacggc gacaacgtcg ctaatggcta ttggcaaaaa cccgacgaga 3245581 gtgagcgcac gttcggcgga aagattgtca ccccttcgcc gggcacaccc gaaggtcctt 3245641 ggctaagaac gggcgactca ggtttcgtca ccgatggcaa aatgttcatc atcggtcgga 3245701 tcaaagatct cctaattgtg tacggacgca accactcccc cgacgacatc gaggcaacga 3245761 tccaggagat cacccgcggg cgctgcgcgg cgatctcggt tcccggtgac cgcagcaccg 3245821 aaaagctggt cgccattatc gaactcaaga agcgtggcga ctcagatcag gacgcgatgg 3245881 ctagactggg cgctattaaa cgcgaagtca cgtcggcttt atcgagttcg cacggtctca 3245941 gcgtcgcgga tctggttctg gttgcgcctg gctcgatccc cattaccacc agcgggaagg 3246001 tcaggagagg ggcgtgtgtc gagcaatatc gacaggatca attcgcccgc ttggatgcct 3246061 agtccggctg gccgtctaca cagaattcgg tatatccgtt tgaaaaagtc ctccccggac 3246121 tgccgcgcca ccatcaccag cgggtcagcc gacggtcagc gaaggtcacc ccggctcacc 3246181 aacctgctcg tcgtcgccgc ctgggttgcc gcggcggtga tcgcaaatct gcttctcacg 3246241 ttcacgcaag cagaaccgca cgacaccagc ccggcgctgc tgccacaaga tgccaagaca 3246301 gccgccgcca ccagccggat tgcgcaggct ttccccggca ccggtagcaa cgctatcgcc 3246361 tatctcgtcg tggaaggcgg cagcacgctt gagccgcagg accagcctta ctacgacgcc 3246421 gccgtcggtg ccctgcgcgc cgacacccgc cacgtgggat ccgtcctcga ctggtggtca 3246481 gatcccgtca ccgccccgct gggaaccagc cccgacggcc gctccgctac ggccatggtg 3246541 tggctgcggg gcgaggcggg caccacccaa gctgccgaat ccctcgatgc cgtccgatcg 3246601 gtgctgcgcc agttaccgcc cagtgagggg cttcgcgcca gcatcgtggt cccggcaatc 3246661 accaacgaca tgccgatgca gataaccgcc tggcagagcg cgacgatcgt gaccgttgcg 3246721 gcggtgatcg ccgtcctact gctgctgcgg gcgcgcctgt cggtgcgggc cgcggcgatc 3246781 gtgctgctga ccgcggactt gtcgcttgcg gtggcctggc cgctggccgc ggtggtgcgg 3246841 ggacacgatt ggggaaccga ttcggtattt tcttggacgc tggccgcggt cctgacgatc 3246901 ggaaccatca ccgcagccac catgctggcc gcgcggctcg ggtccgacgc aggtcattcg 3246961 gccgcgccca cataccgcga cagcctgccc gcgttcgccc tgcccggggc gtgtgtcgcc 3247021 atattcaccg gcccgctgct gctggcccga accccagcgc tgcacggagt tggcactgcc 3247081 gggctaggtg tatttgtggc acttgcggct tcgttgacgg tgctgcctgc cctgatcgcg 3247141 cttgccggag cgtcacggca gttaccggca ccaaccacgg gtgccggctg gacaggccgg 3247201 ttgtcgctac ccgtctcttc tgcttcggcc ctgggcacag cggcagtgct ggcgatctgc 3247261 atgctaccca tcatcgggat gcggtggggt gtggccgaga acccgacaag gcaaggcggc 3247321 gcacaagtcc ttccggggaa tgcgcttccc gatgtggtgg tgatcaaatc cgctcgggac 3247381 ctgagggacc cagccgcgct catcgccatc aaccaggtca gccaccgtct ggtggaggtt 3247441 cccggtgtgc gcaaggtgga gtcggcggca tggccggccg gtgtcccgtg gaccgacgcc 3247501 tcgctcagtt ccgcggccgg caggctcgcc gaccagctgg gtcagcaggc tggatcgttc 3247561 gtgccggcgg tgactgcgat caaatcgatg aagtccataa tcgaacagat gagcggcgcg 3247621 gtcgaccaac tggacagcac cgtgaacgtg actctcgccg gggcaaggca agcacagcaa 3247681 tacctcgatc ccatgctcgc cgccgcgcgg aacctcaaaa acaaaaccac cgaactgtcg 3247741 gaatacctgg aaacgatcca cacctggatt gtcggcttca caaactgccc cgacgacgtc 3247801 ctgtgcacgg ccatgcgcaa ggtcattgaa ccctacgaca tcgtggtcac cggcatgaac 3247861 gagctgtcca ctggcgccga ccgcatctcc gcgatatcga cacagacaat gagcgcgttg 3247921 tcctcggcac cgcggatggt ggcgcagatg cggtcggcgc tagcacaggt gcgctcgttc 3247981 gtacccaagc tggaaacaac catccaggac gccatgccgc aaatagcgca ggcgtcggcg 3248041 atgctgaaga atctcagcgc cgatttcgcc gataccggtg agggcggctt ccacctgtcc 3248101 aggaaggacc tggcggaccc gtcgtaccgg cacgtacggg aatcgatgtt ctcgtcagac 3248161 ggaaccgcca cccggctgtt cctctattct gacggacaac tggaccttgc tgcggcagca 3248221 cgcgcgcagc agctcgagat cgccgcgggc aaggcgatga aatacggaag cctggtcgac 3248281 agccaggtca cggtgggtgg ggccgcgcaa atagccgcgg ctgtccgcga tgccctcatc 3248341 cacgatgctg tgctactggc cgttatcttg ctcacggtag tggctctggc cagcatgtgg 3248401 cgcggtgccg tccacggtgc tgcggttggc gtgggtgtgc tggcctctta cctcgccgcc 3248461 ctgggggtct cgattgcact gtggcaacac ctactggatc gcgagctcaa cgccttggtc 3248521 ccgctggtgt cgttcgccgt cctcgcttcg tgcggcgtcc cgtatctcgt tgccggcatc 3248581 aaagccggtc gtatcgccga cgaggcaacg ggtgcgcggt ccaagggggc ggtatccggg 3248641 cggggagcgg ttgcgccgct tgcggcgctc ggtggcgtat tcggcgctgg cctggtgctg 3248701 gtgtcgggag gttccttcag cgtgctcagt cagattggca cggttgttgt gctcggtctg 3248761 ggcgtgctga tcacggtgca gcgagcgtgg cttccgacca cgccagggcg gcgttgaccg 3248821 cctgttcgag accccatgcc acgctcggct ggccgacgac gatcacccat cgcagacacc 3248881 acacttggta ggggttgcca gttgttggcc gggtgagtgg tcggcgcgcc gttgcccggg 3248941 gtagggttcg aggtctttgg atgatgggcg tttccacgct gcccaaagga tgacctcgac 3249001 gtgtccgagt tcacgttgac cgcgtgaagt taaaccggtg ccgagcgtgc actgagggcg 3249061 aaatccggcg ccgattttcc gccctgagtt cacgttgggc gacggcgccc atgaacgacg 3249121 ccacatcgca catggcgctc aggccaagca ccagcccatc tccgtcgccg gccaccgtca 3249181 ccgatcgaac gacctcgacc cccgccctgg caacaacacg ccgctgccct ctacacctcc 3249241 gcgctgtcga aaattgtcac ggagccttgc gggggctggt gcgactgata tgacgcacct 3249301 tccgccagag gctagcccga cgtttactga cgttactgct gcttaccgtt tgtcgacggc 3249361 acgtgaaaac tgaccccggc gcggcacccg aattttgacc ccctggtcgg gtggactggc 3249421 tctacccgag ccaggaggac cgaagggaat gttgactgtg gaagattggg ctgagattcg 3249481 ccgattgcat cgcgcggagg gtttgccgat caagatgatc gcccgggtgc tggggatttc 3249541 caagaacacg gtgaagtcag cgttggaatc aaaccagcag ccgaaatatg aacgggcacc 3249601 gcagggttcg atcgttgatg cggttgagcc gcggatccgg gagttgttgc aggcctatcc 3249661 gacgatgccg gcgacggtga tcgccgagcg gatcggctgg gagcgctcga ttcgggtgct 3249721 ctcggcgcgg gtggccgagc tgcgcccggt gtatctgccg ccggacccgg cgtcgcgcac 3249781 cacgtatgtg gcaggcgaaa ttgcccagtg cgacttctgg tttccgccga tcgagttgcc 3249841 ggtagggttc gggcagaccc gcacggccaa acagttgccg gtgctgacca tggtgtgcgc 3249901 ctattcgcgc tggctgttgg cgatgctgct gcccagcagg tgtgccgagg acctgttcgc 3249961 cggctggtgg cggctgatcg aggcgttggg ggcggtgccg cgggtgttgg tgtgggatgg 3250021 cgagggcgcg atcgggcgct ggcgcggcgg gcggtcggag ttgaccactg agtgtcaggc 3250081 gttccgcggc acgctggcgg ccaaggtgct catctgccgg ccggccgacc cggaggccaa 3250141 gggcctcatt gaacgggccc acgactacct ggagcgctcg tttttgcccg ggcgggtgtt 3250201 tgcctcgccg gccgatttca acgcccaact gggcgcctgg ctggcgctgg tgaacacccg 3250261 cacccgccgg gcgctgggtt gtgcgcccac cgatcgcatc ggcgcggatc gggccgcgat 3250321 gctgagcttg ccgccggtgg cgccggccac cgggtggtgc acctcgctgc ggctgccccg 3250381 ggatcactat gtgcgctgcg attccaacga ctactcggtg cacccgggtg tgatcgggca 3250441 tcgggtgctg gtgcgcgccg acctggagcg ggtgcatgtg ttctgcgacg gtgagctggt 3250501 cgccgaccac gagcggatct gggcggtcca tcagacggtc tccgatcccg cacatgtgga 3250561 ggcggcgaag gtgttgcgcc gccggcactt cagtgcagca tcaccggtag ttgagccgca 3250621 ggtgcaggtc cgctcactga gcgactacga tgacgcgctg ggagtcgaca tcgatggcgg 3250681 ggtggcctga tgcccaccac caaagccacc cagcgccgtg atgtttccac cgagatcgct 3250741 tacctgacaa gagcattgaa agctcccacc ctgcgtgagt cagtgtcccg gctggccgat 3250801 cgcgcccgcg ccgagaactg gagccacgaa gaatacctgg ccgcctgcct gcagcgggaa 3250861 gtgtcagccc gggagtccca tggtggtgag ggccgcatcc gcgccgcccg cttcccggct 3250921 cggaagtcgt tggaagagtt cgactttgag catgctcgtg gcctcaaacg cgacaccatc 3250981 gcacatctgg gcaccctgga tttcatcacc gcccgcgata acgtcgtgtt tttgggcccc 3251041 gcctggcacc gggaagactc atcttgcggt cggcctggcg atacgcgcgt gtcaggccgg 3251101 tcatcgggtg ctgttcgcca ccgccgccga atgggtagca cggctcgccg aggctcacca 3251161 cgccgggcgc atctacgccg aactcacccg gctttgccgc tatccgctcc tggtggttga 3251221 cgaagtcggc tacattccgt ttgagcccga ggccgccaac ctcttcttcc agctggtgtc 3251281 ctcccggtat gagcgggcca gcttgatcgt cacgtccaat aaggccttcg gccggtgggg 3251341 cgaggttttc ggcggcgacg acgtcgttgc tgccgccatg atcgaccgcc tcgtccacca 3251401 tgctgaagtc gtcgccctca aaggcgacag ctaccggctc aaagaccgcg acctcggccg 3251461 cgtcccacca gccggaacca ccgaagaata accaccaacc gcccggtcta gggggtcaat 3251521 tttcagatgc cgtcaggggg tcagttttcg ggtgccgttg acaccgttca caagggcgtt 3251581 tcgagcaacg cgtcgacgca acttcggcct agtcgacgtt gacgggttcg ttccatttcg 3251641 actgcgtgag ctgaatcgac ccggatccga ggtcgatgct cgctcggacg aggtggtgcg 3251701 agccgtcctg ggcaatccac acggtcgccg gccttgcact cttggcgcca ggatcaagca 3251761 tcttgacaga gctcgcgggg atggtcccgg tgattttggt ggtcgaaatt ccgtctatca 3251821 cttcggtacc ttgcgcttgg aggttcgtga caccggacag cagctgcgtc accccagcgg 3251881 caggatcgag cacgcgtgaa gttgacagtt cagaaatcga gccgagattg ctccagtcgt 3251941 cgaacagttt caccgagatg ttgtcgcctt gtacccgaaa cgggacaccc tgctcgtcgt 3252001 tgtaggtgca tacgcccttt gccgcgagcg gattggcccg gacgtcgaca tcggcactgg 3252061 taatacccag caagctgtcg actttcccgg ttgttcggac cgctacgtgc acgctggtca 3252121 acccttttgt cgcatcaagc gactgcctga tctcggcgag gagcgcgggg tcggacgccg 3252181 tcgggctcac gggaacaccc tgttcctcgg catcaggttt cggcgaagaa catcctgata 3252241 gccacaacgc caggcaggca cctagcacca ccagaacagc ggacgtcacc gcccgttttc 3252301 catcattcat ttgcgctcac tacctcgatt gtcaaatggg cccgcaggcc gaatgcaggt 3252361 tgattggatc acgctgggca tgactgcccg cctcctcact cgcgccattc cggcgctcgc 3252421 cgtcgccggg tcccgccaaa ttgcccgcct cctcactcgc gccattccgg cgctcgccgt 3252481 cgccgggtcc cgccaaattg cccgcctcct cactcgcgcc attccggcgc tcgccgtcgc 3252541 cgggccccgc caaattgccc gcctcctcac tcgcgccatt ccggcgctcg ccgtcgccgg 3252601 gctaggcatg gaccgatact tccgcggcgg cgggttcgac aacctgcgac gtcggatcac 3252661 cggattccgt tgggcggctg ccagacattt gctgggcgac atactcggcg accgcagtgg 3252721 gagtgggatg atcgaaaatc acggtaggtg gcagcgtcag tccggtggcg gttttgaggc 3252781 ggttgcgtaa ctccacagcc gttaatgagt cgaaaccgag gtcgccgaat tcggtgtcgg 3252841 ggtcgacgtc ctcggcggag ggcctaccca gcactgccgc tgcctgcaga cacaccagcc 3252901 ccactagcag ctcgagttgt tcgtccgcgg ccagcccgtg taggcgttga gccagcgccg 3252961 acttcgacga ggtggcgtca ccggtgtcgt cgatttggcg tcggcgtggg cggcgcgcga 3253021 gcccgctgaa cagcgccggc aacgcaccgg cctgggcccg ggcgtctagt gcagcccggt 3253081 ccaagagcgt ggccaccgcc agagggtgat cgatggccag cgcagcgtca aacaattcca 3253141 ccgcttcggc agggctcatc ggagccagcc cgctgcggct catgcgggcc agatctcggc 3253201 tgctcaaatg cgcggtcatg ccgccaggct gttcccacaa accccacgcc agtgatatcc 3253261 cggccaaccc tgcggcctgc cggtgagcgg ccaacccgtc cagaaacgcg tttgccgccg 3253321 agtagttgcc ctgccccggc gagccgaccg tggccgcgat cgatgagcac agcacaaaca 3253381 tcgacaaatc caggtcactg gtggcctggt gcaggttcca cgccgcgtcc accttggccc 3253441 gcaacaccgt atcgatgcgg tccggtgtca acgaggtgat cactgcgtca tcgagcacgc 3253501 cggcggcatg aatcaccccg cgcaccggcg ggtactcccg cgacagctgg gcaaacaacc 3253561 ccgctaccgc agcgcgatcg gccacgtcac aggccaccac ctgccccttg gcgccggcct 3253621 ccgtcaagtc ggcggccaat tcggccgctc cctccgcgcg atcgccccgc cgactggcca 3253681 acaccagatg acgcacccca taggcgccaa ccaggtggcg ggccaacacc ccaccaaccg 3253741 ccccggtggc accggtgatc accaccgtgc cgtcggcaag ccggtcggcc aacgccgagg 3253801 gcatggttaa gacaaccttg ccgatatggc gggcctggct catgaaccgg aaggccgccg 3253861 gggcgcagcg cacatcccac gtggtgaccg gtagccggtg cagctcccgg gtgtcgaaca 3253921 gctcccgcac ctcggccaac atctcctgca tgcgtgccgg gccggcctcc gacaggtcga 3253981 acgcccgata ctgcacgccg ggataattag cggcgatctc ctgcgcatcg cggatatccg 3254041 tcttgcccat ctcgaggaaa cgcccaccgc ggaccagtaa gcgcagcgac gcatccacga 3254101 actcaccggc cagcgagtcg agcaccacat caaccccgcg gccctcggtg accgccagga 3254161 acttctcctc gaactcgcat gtgcgggaat cgccgatatg gtcgtcgtca aaccccatgg 3254221 cgcgcagcgt gtcccacttg ccacggctgg cggtgacgaa aacctccacg ccccactggc 3254281 gagccagctg cacagccgcc atgcccacac cgccggtacc ggcatggatc agcaccgatt 3254341 cgcccgcctt gatctcggct aaatcggcca acccgtacca ggccgtcaag aacaccaccg 3254401 gcacagcggc tgcctgagca aacgaccagc cttgcggcac ccgggtaacc agttgctgat 3254461 ccaccaccgc cagcggaccg gccccgccca ggaatcccat cacggcgtca ccgacggcaa 3254521 gatcggtcac ttcgggaccg gtctcaagca ccaccccggc gccttcggca cccagcggtg 3254581 gagcctggcc gggatacatc cctagggcgg ccaccacatc gcggaagttg accccgacgg 3254641 ccgccaccgc cacgcgcacc tgccccgcct gtagcggtgc ctgtacctcc gggcagggct 3254701 ggatcaccaa atcctccagg gtcccgccac caccggcggc caatcgccac gccgactctg 3254761 ccgccggtaa cgctagcaac gccggggccg gggacagccg gggggcgtgc acagtgccgc 3254821 cgcgcaccag cagctggggt tccccgacgc cggctagcac cgaggcatcc accgccgcat 3254881 cggtgtcgat caacacgatc cggccgggat tttcggcctg cgcggaacgc gccatgcccc 3254941 acaccgcggc ggcggccagg tcgctgatgt cctcgccagc cagccccacg ccaccatggg 3255001 tcaacaccac caacgtggcc gcccgatccg cgccgagcca ggactgcaac acctccaggg 3255061 cggtgtgggt ggccgcatac accgagccca ccaccgagga tgcttggcca ccggcagact 3255121 cgagttccca caccacgaca ctggcgtcac catcactgcc ggcgcaaaag tccgcccaag 3255181 acaccggggc aggtggggcg gacccgttag cgccgccgct gaccaccgag atcggcgacc 3255241 acaccacttc cagcggcccc tgatcggacg caccgccggc cgcggtcacg gcggcgcgca 3255301 gctgttctgc ggttatcggg cgagtaacca gcgagcgcac cgtcaacacc ggcagcccag 3255361 tggcgtcgca gacgtccacg gaaatcgcat ccgcgcccgc ggacgcgaag cgggcccgca 3255421 cccgtccagc gccgccggca tgcagcgaca ccccacgcca gcaaaacggc agtctcgtct 3255481 cggtgctcgc ctgggtcttc tcgacggcca gcccgagggc atgcagcacc gcgtccaaca 3255541 ccgccggatg catccccatt cggtcgacgg ccacgccggc ctcgccgggg gctacaactt 3255601 cggcgaacag ctccgacccc cgccgccaga tcgccaccag accctgaaac gcggggccgt 3255661 aggcataacc gcgctcggcc aactgcgcat agccgtccga gatatccaca ctctccgcgc 3255721 cctcgggcgg ccacacggac aaatccatcg gcgtctcagc ggcagccacc cccagcatgc 3255781 cttcggcgtt cagcaaccaa ccctgggatt gatcaccgcg ggaatacacc gacaccgcac 3255841 ggtgcccgga ttcatcggca gccccgacga ccacctgcac ctgaaccccg acacccgggt 3255901 gcatcaccaa cggtgcggcc agcaccaact cttcgatgag cgcgcacccg acctcatcac 3255961 cggcgcggat caccaactcc acaaaacccg ccccggggaa cagcaccacc ccgttcacca 3256021 cgtggtcggc cagccacggc tgatccgcaa gcgacaaccg gccggtcagc accacctcgt 3256081 cagaatcggg ccgctcgacc accgcaccca acaaggcatg ctcggtcgcg cccagaccca 3256141 acccggccgc atcggcgggc ccatccgcgc ccggcgtctc ccaaaaccgc cgtcgctgaa 3256201 acgcatacgt gggcagctgc acccgccgtc cacccgagcc ggcgaacacc gccgaccact 3256261 gcaccggcac accggtggtg aacacctgac cggcagcacc gagcgccgag gccagctcgg 3256321 gccggtcttt gcccagcatc gacaccacca tcgcctcagc cggggccaag gactgctcga 3256381 tcgagccagt caaaccactt cccgggccgg cctcgatgaa gtgggtcgcc ccaagggtct 3256441 gcaaatgacg cgcactgtcc gcgaagcgca ccggccgacg aacgtggtcc acccagtact 3256501 gcgccgaccc gaaatcaggg ccggccaact cgcccgtcac gttcgacacc agcccaagct 3256561 ggggctcgcg tgcctgcacc cgggccgcga cacgcgcgaa ctcctcgagc atcggctcca 3256621 tcaacggcga atgaaacgca tgcgagaccg ccaactggtg cacccgccga ccctgcgcgg 3256681 cgaaccgatc cgcaatcgca tttgccgcgg cctgcgcacc ggagatcacc accgattcgg 3256741 gcgcgttgat cgcagcgatc cccacaccct cacccagcag cggctccacc tcgtcctcac 3256801 tggcagccac cgccaccatc gcaccgcctg ccggcagcgc ctgcatcaac cggccccgcg 3256861 ccaccaccag catcgccgcg tccgccaacg tcaacacacc ggccgcgtgc gccgccgcca 3256921 gctctccaac ggagtgaccc atgacgaagt ccggaagcac accccaatcc cgcaacaccg 3256981 cgaacgatgc cacctccacc gcgaacaacg cgggctgagc aaattcggtg ctgtcaagca 3257041 aatccgcatc ggcaccccaa ataacgtcgc gcagcggcaa ccgcagatgc cggtccaact 3257101 cgtcggccac cgcatcgaat gcctgcgcaa acacgggcaa ctcgccgtac aactcgcggc 3257161 ccatcccgat gcgctgcgcg ccctgcccag gaaacacgac caccgtcttg cccaccgacc 3257221 ctggctgacc gaccgccacg ccggcacccg gctcgcccgc cgcgagccca gccagcccgg 3257281 caatcagttg ctcacggctt gcgccgacca ccaccgctcg gtgctcaaac accgagcgac 3257341 tggccaacga gcaccccaca tcgatcggat ccagccctgg gttggcctgc acgtgggcca 3257401 taagtcgacc cgcctgcgcc gtcaacgcct cagccgatct cgccgaaatc acccacggca 3257461 ccatcgacgg ccgcggcccc ccggtgcttt cgctcgcctc aaccggcgcc tctgcggggg 3257521 ctggtacggg ggcctcttcc aagatcagat gcgcgttggt gccgctgatc ccaaaggagg 3257581 acaccgccgc ccggcgcgga cgcccgtcaa ccgaccactc cctggcctcg gtcaacaccg 3257641 acaccgcgcc gctggtccaa tccacccgcg gggaaggctc atccacatgc aacgtcgccg 3257701 gcatcacccc atgacgcatc gcctgcacca tcttgatcac cccggcgacc cccgcggcgg 3257761 cctgggtgtg gcccatgttc gacttgattg agcccaccca cagcggctgc tccgctggac 3257821 gtccctgccc gtaggtggac agcaatgcct gcgcttcgat gggatcaccc aacgtggtgg 3257881 cggtcccgtg tgcctccacc acgtctacgt ctgcggcgga caacccggcg ttggccaacg 3257941 ccgcctggat cactcgctgc tgggcgagcc cattgggcgc ggtcagccca ttggacgcac 3258001 catcctggtt gaccgcgctc ccccgcacca ccgccagcac cgaatgcccc aaccgccggg 3258061 cgtccgatag ccgctccagc acaaccaccc cggcgccctc gccccacccg gtgccgtcgg 3258121 ccgcggccgc aaacgcctta catcgcccat cggcagccaa cccccgctgc cgggaaaacc 3258181 ccacaaaaat cgacggcagc cccatcaccg tcaccccacc ggccaacgcc aaatcacact 3258241 ccccggagcg caatgacgac atcgcccaat ggatcgccac caacgacgac gaacaagcgg 3258301 tatccactga caccgccggg ccctgcagcc ccaatacgta cgacacacgt cccgaggcca 3258361 cgctgattga cgtgccggtc aacccgtacc cttgcagccc cccggtatcc ctattgccgt 3258421 aactcgccgc gaaaatgccg gtgtacaccc cggtcgccga accacgcaac gacaacgggt 3258481 caatccccgc gtgctccaac gcctcccacg aaacctccag catcaaccgc tgctgaggat 3258541 ccatcgccaa cacttcacta ggagcgatgc cgaagaaccc ggcgtcaaag ccggtggcgt 3258601 cgtctagaaa tgccccccat cgcgtgtagg ttttgccctc agcgtcggga tccggatcgt 3258661 atagcccctc aacatcccag ccccgatcgg tcggaaactc cgacaccacg tcgcgccccg 3258721 ccgaaacgac atcccagagt ccgtccgggc catccacgcc gcccggaaat cggcagccga 3258781 ttcccaccac cgccaccggt tctgtcgcgc gttgctcata ttcacgcagc cgagcgcgtg 3258841 tctcatcgag ctcgacagca accttcttta ggtagtgaaa aagcttttcg ctctgctggt 3258901 cggcaccttc aacgctcatc gtccgttgct cctctatcac ttcccaagtt cggaatcgat 3258961 tagctggaaa atttcgtcag gagtcgaagc agcctggatc agcttgccca ggcccgcctc 3259021 gctgccggcg atggtgccca gcagggcacg caaacggtcg gccacccgct gcttctcgcc 3259081 gtcggcgatg acggccacca gctcttcgac cttgttcaac tgctcttcga ttgcccaaag 3259141 acccgtcgcg ccgctattca ccggaccggc tgatttcaat cgaccatgcc cgccggccag 3259201 ttcggcctcc aaatactggg ctaatcccga tatcgacccg tagtcccacc caaccgtctc 3259261 gggtaaccgc aggccggtaa ctgccgccaa tcgcttgcac agcgtgactg tcatttgcga 3259321 gtcaaaaccc agctccgaga aggcgagatc ctgatcgacc gaccaaggat ctggctcacc 3259381 taacatcttc gcggcctcgg cgcatacggc atccaccacc agccgctgac gttcttgccg 3259441 caaagcgacc aaccgctcgc gaagagtcgc cccgccgtcg tttcctccgg cgatcgtcat 3259501 gttggacgcc gacaggtcat cacgctgcgc ccgcacgccg gacccaggtt cagtcaacga 3259561 gagttcccaa atcggtttgg ttggactctg cttgcgcagg gcgccacgca ccaatttccc 3259621 gttcggggtt cgagggagtc gatcaacaac ggcaaaccta tgcggcacct tgaacgcaga 3259681 caatcggttg agcaatccgc ggtgaaggtc tcgcatgacc gacccatcga tggtggcacc 3259741 gctggtcgca accagaaaag cctgcagtgt cgacgcgccc gtggactccc ttaccgcgac 3259801 aaccgcggcc tcagccacgg cttcgtcctc gatgatgagt cgctcgacct cacgcggatc 3259861 aacgttgacc cctccgataa cctcggtgtc gtcggcgcgg cagcggtagg taacccaccc 3259921 gtcgctgtcg atacacaccc tgtcccgcgt gtcgagccaa ccctcattcg cgacggggga 3259981 atcaggccga ttccaatagc ccttagcgat cgccggtccg cggacccata ggtcgccctc 3260041 aaccccaggc ccggcagttg ttccatccgg cgctacaaca cgaatctcgt agggcggcag 3260101 cacccttccc agcgtcccca ggcgccattc gtcaacccga ttcgatacga acgtctgccc 3260161 gacctccgta gatccaatac cgtccagaat ggggatgccg ccaaagaatt ccatgagccg 3260221 ctcggcaaga cccagctcaa gggcctcccc ggctgacacc acacatcgaa gcgaacggaa 3260281 ggaatcagga gaacatgagt cgatgactct ggcaaagaaa tttggcacac cgtagagcac 3260341 cgatggccca aatcgcgcgc ttagaatggc cgctgcttct ggagttaccg gcgccgaatt 3260401 gatgaccgcg gaaccacctg tcgcgagtgg aaaccagacc gaatttccta ggccgtaagc 3260461 aaaatacatg cgtgcactac atagcccagt atcttcagga gtgagccgca aggctttacg 3260521 acacatagcg tccacgaacg tcaacgggtc ggcgtgccga tgaatcgccg ccttcggcgg 3260581 acccgtggta ccagacgtat acgtagcgta tgcgagtgcg tcaccaccca tcggttcgta 3260641 gcctccaggc gcgactcgag ccgcctcgga catgagttcc gcggcttcgg ccacccgcga 3260701 cggctgaaac cgatcgcgca gcgcatccga ggtgacgaca agcgccggtt ccgtgttgcg 3260761 tgcggccaac gcgtggtcgt cgcgatgcag ctccggattc gctagaaacg ccataacccc 3260821 acgagccagg cacgccagca atagctgcac caggtcgggc gaatccggca ggcacaacag 3260881 aacccgatca ccactggata gtccgcggtt tctcagcact tccccaagac gtgcggcacc 3260941 gtcgtggatt tgaccatgag tcaccacatc ggccgcatag aaggccggcc ggtcgtacca 3261001 tcccgcctcc gatgcctgct cagccaggag ccccgctaga ttcccattcc gcatttattg 3261061 gatgaccgcc ctagcgcgcc agagtgatgg catttgaaaa ctgccagcga tcaggttctt 3261121 catgcggagc atctcgaaat acgcttcgta gaaagtgctc cgtcaccacc atgatcggct 3261181 ggcctccgga aataacgcga taacggcggg caacggctct ctttcgggaa ttctgatacc 3261241 cgtggagcgc gagccaacct ggcaaatctc cgacccagac tttagcctct tccttgaagg 3261301 tctcgatgtg gctggctgcc ataacctcgc cgagaggatc gtttgtctgc gtcaaccttg 3261361 ttataattgc agccggcaac cgatcaatcg caatcaacga ctccgcggct acaaataagt 3261421 gttccgagtt ccgaccttta agtatgatgt accgctgcag aacacggccg acccccacct 3261481 gccctagctg ctcgaattcg gatagcttcg gtgaaacatc gtgaatccgc tgcttgacga 3261541 tttgcacgat cacctcatcg tcagcgacaa tgttgagaac cctagtgagg gtgccattag 3261601 ctgctatcag tattcgaagg tcacgattga gctttcggat ctcttgatca gatagaaaac 3261661 actcggtcat attcttcccc cacatacatg cgctgttatg ccatagcatc taggcggctg 3261721 aattcgtgat gtaggtaacg ctcaacgctg gccgaacgcc gaaccttacc gctggtggtg 3261781 accggaatag aacccggcgc caccataacg acatccgcga cgcgcagacg atgtgacctg 3261841 gatatcgcgg aggcgacttc acgtttgacg gtgcggagtc gattcttttc ctcctcatct 3261901 gtgcgacccc gcttcatgag ttcgataatg gttaccagct tttcagtacg gtcatcgggc 3261961 accgcaatcg ccacaacccg gccgccggtg atttcctgga tcgtcgcctc gatgtcttcc 3262021 ggatagtggt tggccccatc caccaccaac agctccttga tgcgacccgt gatgaacagt 3262081 tcgccctcga aaatgacgcc gaggtctccg gtccgcagcc acggaccttc cgaagtcccg 3262141 ggcgagggag tgacgagccg cgcgcggaac gtcgcctccg tctgctgcgg gttgcgccag 3262201 tagcccaagc cgacgttgtc tccctgcacc cagatttcgc caaccgtccc cgcgggattc 3262261 tccatcctgg tttcggggtc gacgatccgc acggttgacg cccggggagc tccataactc 3262321 accaggttgg ctccctcgct gccgttctcg gtacgcttcg cctgaccgac cgacagctgc 3262381 tggtagtcaa agcaaacact cttcggcgcg cgtcccggtc cggcggtcgc cacgtacacc 3262441 gtcgcctccg cgagcccata tgacggccgg attgccgtct cgctgaggtt gaacggggcg 3262501 aaccgctcgg tgaagcgccg cagcgtcgcg acgtttactc gttcggcgcc ggtgacgatc 3262561 gtccgcacat gcccgaggtc aagtccagcc atatcgtcgt cggatgttct gcgtaccgcc 3262621 aattcgaaac cgaaattcgg tgcgctggaa atctgtgcgc ggtgtttggc taataattgc 3262681 atccaacggg ccggccgctg caagaatgcc atgggactca tcaacaccgc ggtgtcttga 3262741 ttgatcatcg ggagaatgat gcccagcatc aaccccatgt cgtgatagaa cggcagccac 3262801 gatacgggag ttgacggaac cttttccgaa tccccgatgt aatcggacat tagctgtacg 3262861 cagttggtga tgacattctt gtgcgagagg acaacaccgg ccggcgcgcg ggtcgaaccg 3262921 gatgtgtact gtagatatgc tgtgctcgga cgctcgaacc gagtcggatc gagcgctctg 3262981 gatgagctca agtccagagc gtccacagcc acgacgatgg gcgcggactg gccctgtgcg 3263041 gcgcacgcat gtggcgcata tgtcgtgacc tcgtcaataa ccgacgaggt cgtaagaata 3263101 atggacggcg cagagtctcg taatgccgaa gatattcgtt cgtcgtgaat gccgaattgt 3263161 ggcaccggaa gaggaaccgc aatgagacca gcctgcagca cacccataaa ggcgatgatg 3263221 tattcaaggc cctgcggggc caatatcgcg acccgatcac cgcttgacgc gtatatccag 3263281 agctcctctg ccacgatcat cgctcgccgg tggacttgcc accacgtcac ggtttcggtg 3263341 aagccagccg gatccgtgtc atagtcaatg aacttgtacg ccgcgcgatt ggggtactgg 3263401 ctcgccgcct tctgtaggag atcagcgagc gacgactcgc tcatagcgaa tcgcgatgtg 3263461 ctcccgttca gcggttgtgc cgcttgctca ccggtgcccc atgccggttg tgtcgccacc 3263521 tcgcccgcgg catgaaatga cgagttggtt ttcatggtct tccttcagct atggacggca 3263581 gagagcagac ggctgcgctg ccgctttcat acgaatccga gtcggcgcat agcgtctgta 3263641 ccttgcccgg gctcgcgacg cgattcgtta aggtctcacg accatagcag gtacgggcca 3263701 cacccgaggg cctaatggga ttgacggaat cgtcagccgc ggcgtcagcg ctggctgcag 3263761 ccccattcgc gaaacacacc gttgggctgg cttcgctaag cctaatgagc accgtcttcg 3263821 atgtcgacct gctgatcgtg ccggagcgaa ttcgccagcg ccgtgcgatt cgttcgatcc 3263881 ggtcgcgacg ggtggcaccc gcagagcgct gtcagacccc acaggtcaca gttcagagac 3263941 cgcaaccgac tgatcccgcc ggtacaccgt gcccccacca acacgaatca cggcaagccc 3264001 gttgcggcga ggccgaaccg agtaaccgct gatcaatggc ctgacctcaa gtcagctgaa 3264061 cgtgcgcacg gctgacctgt gggcactctg ggaaattcac atcgagttcc aagctcgaca 3264121 cgccgaaatc gcctgccgca cgccgcgatt ggcacgccag ccgctgggcc ggcttacccc 3264181 atcttcgcga gagtggcgca aatcatagct tcttgagccc gcgcaaaacc ttggcgtgcg 3264241 gcaggacagc cgtcaccgtc ttgcgcaggc tggggttaac caaactgccg ttgatcagca 3264301 caacgtagcg cagcccgtga tctcgccatt ccgctacttg atcgatgact tcgtcagggg 3264361 ttccactgaa gacgacttct ttcataagcg cagccgggac cttggccgcg taggacaaaa 3264421 ccgtctgttt gtccatggtt tgcgggatga tgtcctgcac accggagaag tcggctccca 3264481 ttggatgctc gacgccgtga cgcgcccagg cttccccagg taccccgagc gcggtcatct 3264541 tcacaacgac agattccagc gcctcttcca cgtcgtcgcg attccgtcca gtgatgatgc 3264601 cgcgcaccgc cgccggagta atcgacattg ggtcgcgtcc ggcatcggac gccgcgctgc 3264661 gcaccgcttc gagtgcgcga ctgtagtcgc tgggacgaac cacaacaatg ggaatccagg 3264721 catcggcgta acgtccggtg gcccgtaaca tccgcggccc gtgggccgcg acccagattt 3264781 cgggccattt cccacggtat ggcggaaggt cgaacaaggc gttatgtaac ggaaagtatg 3264841 gcgattcacg tgagataagc tccccgtttg aattccacaa cgcgcgaatg gtggccaggg 3264901 cttcttcgaa ccgcgccacc ggtttggtcc actccacacc gtagggctcg ttgccttcac 3264961 gttccccgac accgataccc aatatggctc ggcctcgggt aagcaagtgc aaagtcgcgg 3265021 cagcctgggc tgtgaccgct ggattgcgcc gacctgcatc ggtcacgcac acgcccagtc 3265081 gcagacggct gggcaacccg aaggcgaggt ttccaagcat cgtccacggt tcgtaattgg 3265141 catcgatctt gggcacgaat ttcgccgcaa ttccgagata ttcggaagtc gcaatcgagc 3265201 gcggcaccag cgcattcaga tggtcgccga cccaatacga gtcggcgccc atcacggtgg 3265261 cggccgccat gctagaccgt gccggcaggg tcggcggcaa ccgcgagtgc acgagggcat 3265321 caacaaaacc gaaacgaagt ccgcccacgc ctaccccttg tctactacgc tgttgaccaa 3265381 cgtcatggct aggaacgcta cctcagcgag tcatgtccgc gcggtcgcgc gttgagcaac 3265441 accggggtcg gatgtcatgg catcgaccgc gggctcggtg ttgcgccaac ttctcttctg 3265501 acgcatcgct cgtacatact gtctgccata ctccttgccc atggccttca gtcgaaccca 3265561 cagcctcctc gcccgcgcgg gcagtacctc gacctacaag agagtttggc ggtactggta 3265621 cccgttgatg acgcgcggac tcggtaacga cgaaatcgtg ttcatcaact gggcctatga 3265681 ggaagatccg ccgatggacc tgccactgga ggcatccgac gagcccaacc gagcccacat 3265741 caacctgtac caccgcaccg cgacccaggt cgatctgggc ggcaagcagg tgctggaggt 3265801 cagttgcgga cacggcggcg gagcctctta cctcacacgc acgttgcacc cggcctccta 3265861 caccggcctg gacttgaacc aggcgggaat caagttgtgc aagaaacgac accggctgcc 3265921 tggtttggac ttcgtgcgag gtgacgccga aaacctgccc ttcgacgacg aatccttcga 3265981 tgttgtgctc aatgtcgaag cctcgcactg ttacccgcac tttcggcgtt tcctcgccga 3266041 ggtggttcgc gtgctgcgcc caggagggta cttcccatac gccgacctgc gccccaacaa 3266101 tgagatcgcc gcatgggagg ccgacctcgc tgctaccccg ctgcggcaac tgtcgcagcg 3266161 gcaaatcaac gccgaagtgc tgcgcggcat cggaaacaat tcacagaagt cacgggacct 3266221 ggtcgaccgc catttgccgg ccttcctgcg tttcgcgggc cgcgaattca tcggtgtgca 3266281 gggcacgcag ctgtcccgct acctggaagg cggggaactc tcgtaccgga tgtactgctt 3266341 caccaaggac tgagccagtt tcgggtaatg tcgcccggat gagcccagct gagcgcgagt 3266401 tcgacatcgt tctatatggc gccaccggct tctccggcaa gctgaccgcc gaacacctcg 3266461 ctcacagcgg gtcaacagca cggatcgcat tggccggtcg gtcaagcgaa cggctgcggg 3266521 gcgtgcggat gatgttgggc ccgaacgcag cggactggcc gctgatcctc gccgacgcat 3266581 cccaaccctt gacgctcgag gcgatggccg cgcgggccca ggtggtgctg accacggtcg 3266641 gcccctacac gcgttacggc ctgccgctgg tggcggcctg cgcgaaggcc ggaaccgact 3266701 atgccgacct gactggcgag ttgatgttct gccgaaacag catcgatctg taccacaaac 3266761 aagccgccga cacgggcgcc cggataatcc tggcgtgcgg attcgattcg atcccttcgg 3266821 atttgaacgt gtatcagctg taccgtcggt ccgtcgagga cggcaccggt gaactgtgtg 3266881 acaccgacct cgtgctgcgt tcattctcgc aacgctgggt ctccggcggc tcggtagcaa 3266941 cgtattccga agcaatgcgc acggcatcca gcgaccccga ggcccgtcgg ctcgtcaccg 3267001 acccgtacac gctgaccacg gaccggggcg ccgaacccga acttggtgcg cagccggatt 3267061 ttcttcggcg tccaggacgt gatctggcgc ccgaacttgc cggcttctgg accggcgggt 3267121 ttgtgcaggc tccgtttaac actcgaatcg ttcggcgtag caacgcatta caggagtggg 3267181 cttatggccg gcggttccgc tactcggaaa caatgagtct gggaaagtcg atggcggcgc 3267241 cgattctcgc cgcagccgtc accggcactg tggcgggcac catcgggttg gggaataagt 3267301 atttcgaccg actaccccga cgattagtgg agcgcgtcac gccaaagcca ggcaccggtc 3267361 cgagccggaa aacgcaagag cggggccatt acaccttcga gacgtacacc accacgacga 3267421 ccggtgcccg ctacagggcg actttcgcgc acaacgtcga cgcgtacaag tcgaccgcgg 3267481 tgttgctcgc gcagagtggt ctggcgctgg cgctcgatcg cgatcggctc gccgagctgc 3267541 ggggggtgct cactcccgca gcggcgatgg gcgatgcgtt gttggcgcgc ctcccgggcg 3267601 ccggcgtggt catgggaacg accaggctga gctaacatct ccaccccggc cgccagcaag 3267661 attagctatg ccatgggcac attagcccaa tcctgttctc ccagatctgg gcctttgccg 3267721 ccgagaatca aactcctgac gacaacccac gttcacatgt gggcttcagc accggcgctg 3267781 caccatcgga agctcctcga ccagggtcgg caggttcagc ggagcccgcg aagcgacaaa 3267841 caccgcacga gccaaacctg tggaagctat cggtccgttc gcccgccaat ccagtggaaa 3267901 ctgccggtgc cggggctgcg tagcggtcac atatacgtgc ggcatcttct ccctcagtcg 3267961 gttcatcacc cacacgcgcg aaggccgaca ccccgtgccg gttatcgcct ggctcggcga 3268021 ggaagccctc tcactaacca gaaaaggctc gtcttcgccg gaatagctca cgcatgtctc 3268081 cagcagcaga aggtccactg cgcggtcaca catccacgcc agcgcctcgg caggacggga 3268141 gaggtggtag agcactccat agcagtacac cacgtcgtat tggtgcgctt ctgctgggag 3268201 atcgccgtcg agatctaggt ggtcgactgt gacattggga ttggacccga agcgttggcg 3268261 aatgacatcc agattctccc cccggggctc ggtgcagagc accttgcacc cgcggtcgag 3268321 gaagaactgc gtgtgatcgc cgatcccggc accaacctcc agcacgctct tgttgccgag 3268381 gtcgagcccc agcgtggcca ggtgctcctg acggcgggcg ttgtgccgaa ggtaaaagat 3268441 gctgtgaaaa tgccgttccg cagtcgggcg caacatgccg gggagtcgca tcaggcgagg 3268501 atagcgcacc tcgccgagga gaaacgctcg ccgatggcta tcgagctgct gacacgtctg 3268561 cggccccagc gtatcgacct cctcgagccc acgcccgctt ggccctggac gccaccgcca 3268621 cccagcagcg caaacgcccg gggcaagcac tcgaggtaga gagcggcagc cggcgccagc 3268681 tatcccttcc ggctcggaat aaagaagtag cagtaccggt cgtcgcggtg ccgttggtag 3268741 ggctgcagcc ccgcatcgtc ggcgtagacg aacggttcgt aaccgtacgc gcggatgtcc 3268801 gcaatggtcc gttccgggtc cggattagag gcggcgcccc cgtagatctc caccagcagg 3268861 accgggcgat cgcgccgcag aagctccgcg gcgcccgcga tgaccgcgcg ctcgaggccc 3268921 tcaacgtcga tcttcagcag acccaccggg aggggcagct cggcggcgag cgcgtccagc 3268981 gtggtacacg gcacccgtgt ccgctcgcga atccgaattc gtcccgtgtc gtttagcgaa 3269041 ctgaaggcgc tgtcggccgc cacgaaaaag tcgacctcgc cgaccgcgtc cccggcggcc 3269101 gtccgcagcg tgcggatgcg gtcttgcagg ccgttggcgg ccacgttggc ctccaaccgc 3269161 gaatgggtgc ccggcgccgg ctccagggct accaccgggg ctaacctcgc ccaggccagg 3269221 ctgtgtatgc cgacgttggc tccgacgtcg aggatgcagc ggtctgggta gagcgcggaa 3269281 tagagcgccg ccgcgatgtc gatctcggtc tcctcgaacc cgccggtcaa ccgaacgatc 3269341 cacgcgatgg ccgaccccgg ctcaagggtg acctggaggc cgcgccaata ccagggggcc 3269401 agccgccacc gatggggggg caagccaaac ggccgccagc gttgcaggct tcgaacgagg 3269461 cggtttggca tggcgcactc taacatccgg atcgcccgca tccggtaggt cggccgttga 3269521 gctccgaggt tctcgaaaca accagtggtg cccagatcca aagggtgcca acgccgctgg 3269581 cccttcgccg gcccaagccg tctgcacact accacccgca tcaggcgcac atcttggaac 3269641 tgcaccaggt ccaatcgtca gcagcgcctg gcgttgtgac cgaacctcgg gtccgcagac 3269701 ccactacaat gttgcgcgac ccaaactatc ccccggggcg gagtatttag cgtgttagtg 3269761 ttgcacagtg aaatcgttga aactcgctcg tttcatcgcg cgtagcgccg ccttcgaggt 3269821 ttcgcgccgc tattctgagc gagacctgaa gcaccagttt gtgaagcaac tcaaatcgcg 3269881 tcgggtagat gtcgttttcg atgtcggcgc caactcagga caatacgccg ccggcctccg 3269941 ccgagcagca tataagggcc gcattgtctc gttcgaaccg ctatccggac cgtttacgat 3270001 cttggaaagc aaagcgtcaa cggatccact ttgggattgc cggcagcatg cgttgggcga 3270061 ttctgatgga acggttacga tcaatatcgc aggaaacgcc ggtcagagca gttccgtctt 3270121 gcccatgctg aaaagtcatc agaacgcttt tcccccggca aactatgtcg gtacccaaga 3270181 agcgtccata catcgacttg attccgtggc gccagaattt ctaggcatga acggtgtcgc 3270241 ttttctcaag gtcgacgttc aaggctttga aaagcaggtg ctcgccgggg gcaaatcaac 3270301 catagatgac cattgcgtcg gcatgcaact cgaactgtcc ttcctgccgt tgtacgaagg 3270361 tggcatgctc attcctgaag ccctcgatct cgtgtattcc ttgggcttca cgttgacggg 3270421 attgctgcct tgtttcattg atgcaaataa tggtcgaatg ttgcaggccg acggcacctt 3270481 tttccgcgag gacgattgat tggaatcgct tcgcgaggcc cggcaccaga ccgggcacca 3270541 gaggtccgcg cagatcgcct gggtcgaaga tggtgcagac gaaacgatac gccggcttga 3270601 ccgcagctaa cacaaagaaa gtcgccatgg ccgcaccaat gttttcgatc atcatcccca 3270661 ccttgaacgt ggctgcggta ttgcctgcct gcctcgacag catcgcccgt cagacctgcg 3270721 gtgacttcga gctggtactg gtcgacggcg gctcgacgga cgaaaccctc gacatcgcca 3270781 acattttcgc ccccaacctc ggcgagcggt tgatcattca tcgcgacacc gaccagggcg 3270841 tctacgacgc catgaaccgc ggcgtggacc tggccaccgg aacgtggttg ctctttctgg 3270901 gcgcggacga cagcctgtac gaggctgaca ccctggcgcg ggtggccgcc ttcattggcg 3270961 aacacgagcc cagcgatctg gtatatggcg acgtgatcat gcgctcaacc aatttccgct 3271021 ggggtggcgc cttcgacctc gaccgtctgt tgttcaagcg caacatctgc catcaggcga 3271081 tcttctaccg ccgcggactc ttcggcacca tcggtcccta caacctccgc taccgggtcc 3271141 tggccgactg ggacttcaat attcgctgct tttccaaccc agcgctcgtc acccgctaca 3271201 tgcacgtggt cgttgcaagc tacaacgaat tcggcgggct cagcaatacg atcgtcgaca 3271261 aggagttttt gaagcggctg ccgatgtcca cgagactcgg cataaggctg gtcatagttc 3271321 tggtgcgcag gtggccaaag gtgatcagca gggccatggt aatgcgcacc gtcatttctt 3271381 ggcggcgccg acgttagcgc gataccaccg caacgttgac tcgatgccct tgggcggcgt 3271441 gatcttgggt ggccaacccg cctcttgcaa gaccgacacg tctaacagct tgcgtggtgc 3271501 ggcgcctgtc aagctctttg cgccagtgtc tcattatgtg gacgctattt cggatctggg 3271561 gtgggcgggt tgatccatgc cgcggtcgcc ggtttcgggg gttgcggtga gacgccgaat 3271621 ggattcgggt tggccgagta ggcgttggcc atggccgcgc ccatgtggcc tggccaggtg 3271681 cgggcgtgtt cgatcgaatc gaaccgttca ggagagtcgt tgcggtactt cagcgttttg 3271741 aactgcgaca cgctcccgaa tcaggtgctc gaccggacat ccgttggcta gccggcgata 3271801 tcgtgggcac cctttagcag acgagccgca gcgcactttc gatgtgctgc gggaatccgg 3271861 caaagtctgg tccgaaggct tcggcaagcc gccgggcggc ttgtcggaac tcggccccac 3271921 tgagcacctg ctttacggct gccgccacgc cttcagtgtt gagccgctcg gttcacagga 3271981 gaacgccggc gccggcccgc tcaagggcct ccatgttcaa gtgctggtcc atgttgctgg 3272041 ggagcccgat caccggcacc ccggccgcca acgcctgctg cgtcgtcggg ctgccgccgt 3272101 tgcagagcac cacggcggag cgcgctgcag ccgcttcgcc cggcaggtag tccgcgacga 3272161 aggcgttggc cggcacgttc ttcaggtggt tccggccagc ggtggccgcg atcaccgtca 3272221 cgggtaaatc ggcccagggc gttcaaaacc acctgcaaca ggttctttcc gccggaactg 3272281 ccgagggtcg cataaataat cggccggtct gtcggcagcg agtgccacca agtcggcggt 3272341 tttacgtcgg gcgaccacag gacgggtccg agatatcgat ggttggccgg caggttgtat 3272401 gtcggcacca gctcgggtac gtcggcatac agggtgtagt caccgtcggt gaaaatgcgg 3272461 cacaaatccc agcccagact cgacagcccg tgcttccggc ggagccagtt gagcgggaga 3272521 caatagaggg caaagatcaa cggacggtac aggcggtaca ggatgctgac cggcctgacc 3272581 ccgaagaagc gggtccacgg cacgtctggc agcggaaacc gacggcgggc ctgaggactc 3272641 cagtaggcgt tcgcgatggc gatgtacgga atgccggcta gtcgggcgct gaccgagagc 3272701 gaaagacggt tgtcaccgac gactacgtcc ggtgcgatct cgttcaggat cttcctgtca 3272761 gccgcgatgt atttgcgcaa cgtccgcgtg ttgtagaaga ggcggccctg agcgatttta 3272821 aggagaacct cctcgctggg gacggtgtga atcgggtgat gtgggaacgg gagcgggccc 3272881 aaaagcttat tgaaccgcgg gtcgcaggca aagtggacct cataacgact cgggtccagc 3272941 gaccgcgcca acacgaacgg ccggacgacg tgggccaggg tcgcggcctc ccctacaaac 3273001 aggatccgtt gcctgcgagc gacaggctcc ggtgcggcgt tgggcgccgt gctcgtccca 3273061 gcgtccggtc ccgggtcgcc ggcgacgctt gtttcctcca tactcgcccc ctaatctcga 3273121 ggcagcccgt acccgcaggc aacctcccaa aaatgcaatc ccccaaaatg caatgcgtcg 3273181 agctatttct cacaccgacc gctagttgcg gatcagaaat ccgttgggcg cggaagtcca 3273241 gccgaatttg ttctcccgct ccgcatcatg cttgtaatcg tttggaaatt catcctcata 3273301 tgcctcgatc gcttcatagg gtccaggccc aaacccgggc aggactgggt ggccgttgat 3273361 gttggaatcc tcgactacta ggtagtcacc ggcggagagt agcggccgta gtaatttcat 3273421 ctcggccagc acatgattca tcgagtggtc gctatctaag atggcgaaga tcttgccagg 3273481 gtattcgttt ttgaggcgtt gaatttgttc ggcaatcgcc gggtcggtgg atgacgattc 3273541 aacgaacaaa acatctggtt cgcgccgggc tcttggatcg agggctttgt gtgagttgtc 3273601 cacggtaagt accttgaatg gctggccgat ctgcctcatg atgttggcaa aatacaccgc 3273661 cgagccgccg tagcgggtgc cgaactcgat gacgagggat ggttgcaact cgctcaggat 3273721 ctcctggtaa ttccacatat cgctgacgga tttccagcaa ttgatcccca tataagtggt 3273781 cttcgtccac actaagttgc cgtagtacca cttgtggtat tcttccgcta ctgcgtcgct 3273841 cggccggtag aataactggg ccgcaaaact cgccactaac ctgactagtc cgatcagttg 3273901 ccctacaaga ctagtccgac tgcgccacac tagccccatt ccatcatctc ctcactgcga 3273961 aaccgtagtc agtcgaatgt tggtcattta gcaagcctct ttaagagaac tgatgaggtc 3274021 gaagcggact caatacatgg ctgcggcaat tcgttagacc gcgttcgcgc ccacgttgtg 3274081 agctccgcgc gccgcatcct tggggctcgg tgccgggcat acgcgaccca gcttgcggct 3274141 gagcatcttc tggacaccgc caccgcacgg cggatggtag caacagattg gggttaccct 3274201 caaaccgcgg gttatggact gccaaaggta gccagcttgt cctgctcgcg gtgacagcgc 3274261 aaccacgggt agtgacacta ccgccgtggc gttcctcccc acggcagaag gccggggccg 3274321 gtcgagttcg ggcacaagcc ccagatcgtc gacaacgacg atggcatcgt cctggatcac 3274381 accgtggagc acggcaatcc gcatgacgcg ccgcagctag cgcccgcggt cgaacggatc 3274441 accacacgcg ccggacgccc gcccggcacc gtcaccgccg accgcggcta cggcgagaaa 3274501 cgcgtcgaag atgacctgca cgacctcggt gtacgtacgg tcgcgatacc gcgtaaaggc 3274561 agaccctccc aggcccggcg cgccgaagaa caacggccat cgttccgacg aacagtcaag 3274621 tggcgcaccg gcagcgaagg ccgcatcagc accctcaaac gaaactacgg ttggaaccgc 3274681 tcctgcatcg acggcaccga aggaacccgg atctggacca ggcacggcat cctcacccac 3274741 aacctcatca agatcagcag cctcgcagca tgacccggct cccagagcac gaagctctgc 3274801 cccaccaaca gtccggcggc attcgcccac aaacgactca cttagtcgcc gtcacttttt 3274861 caggtcgaag taactagctg gccaaccatg tccggggccg gttctccggc atgaggcgca 3274921 gagcattctc cacatgctgc gggaatccaa cgcggtctcg tccgaaggca tcggcgagtc 3274981 gcgcggcggc ttgtcggtac tcggaccgac tgatcacctg catcacggcc cctgccaccc 3275041 gctgactctt cagccgctca gttcgcagca gcacgcccgc cccggcccgc tcaacggcct 3275101 ccatattcaa gtgctgatcg agattgcccg cgaccccgat caccggcacc ccggccacca 3275161 aggcctgctg ggtcgtcaaa ctcccgccat tgcagaccac cacggccgag cgagccgcag 3275221 cggcctcacc cggcaggtag tccgccacga aggcgttggc cggcacggtc ttcaggtcac 3275281 tgcggcccgc ggtggccgcg atcaccgtca ccggcaactc agccaacgcg ttcaacacca 3275341 gttgcaacag atttctcccg ccggacgtgc ccagggttgc gtacacgatc ggccggtcgg 3275401 ttggcagcga atcccaccat gtcggcggct tcccggcggg cgaccacagg accgggccaa 3275461 ggtactcgtg gttggccggc aagtcgtagg tgggcatcag ctcgggcacg tcggcataca 3275521 gggtgtggtc cccgtcggtg aaaatgcggc acaggttcca ccccagactc gacagcccgt 3275581 gcctgcggcg gacccagttg agcggcatgc actgcagggc gaagagcaaa gggcgttcca 3275641 ggcggtagag gagcttgacc aacctgacgc cgaacaagcg ggtccatatc acgtcgggca 3275701 gcggaaaacg ccgctgcgcg tacggactcc agtaggcatt cgcgatcgcg atgtaaggaa 3275761 tgccggccag tcgggcgctg accgacagtg aaatgcgaag gtcaccgacg acgaggtccg 3275821 gcgcgatctc atccaggacc cgcaggtccg cctcaacgta cttccgcagc gtccgcatgg 3275881 catagaaacg accctgagtc agattgccga aaaaccgctc gctggggatg gtgtgaatcg 3275941 catggtgacg gaaagggagc ggacctagaa gctggttgta gcgcgggtcg caggcgaagt 3276001 gcacttcata acgactaggg tccagcgact gcgcaagcgc gaatggccgg acgacgtgag 3276061 ccagggtcac tgcttccgcg acgaaaagga tccggcgcct gcgtgcggca agcccaggtg 3276121 cggcgtccgg tgtcgtgctg atggccgcgt cccctctcac ctcgctagca accggtggcc 3276181 cgccccacct cgacgccgta gcgtacacgc acgacacgcg cactcgggga aaacctcggc 3276241 aagagtgggg cggcgatacg tttagcggca ccactgcgcg gtcgttgccc accccggtga 3276301 ctataccccc gggtggtata tggtggaggg cagagcgtga cctcaaccaa agtggaggac 3276361 cgagtgacgg cagcagtgct gggagcgatc gggcacgcac tggcgctgac cgcgtcgatg 3276421 acctgggaaa tcctgtgggc gctgatcctg ggcttcgcgc tgtcggcggt ggttcaagcc 3276481 gtggtgcgcc gctccacgat cgtcacgctg ctcggcgacg atcggccgcg caccctggta 3276541 atcgccaccg gcctgggcgc ggcctcgtcg tcgtgctcgt atgccgcggt ggctttggct 3276601 cggtcactat tccgcaaagg ggccaacttc actgccgcta tggcgttcga gatcggttcc 3276661 accaacctcg tggtggagtt gggcatcatc ctggccctgc tgatgggctg gcagttcacc 3276721 gccgccgagt tcgttggcgg tccaataatg atccttgtcc tggccgtgtt gttccggttg 3276781 ttcgtcggcg cccggctcat cgacgccgcc cgggaacagg ccgaacgggg actcgcaggc 3276841 tcgatggaag gccatgccgc catggacatg tccatcaagc gggaaggctc attttggcga 3276901 cgactccttt ccccaccggg atttacctcc atcgcccatg tgttcgtgat ggagtggttg 3276961 gcgatcctgc gcgacctcat tctcgggctg ctgatcgccg gtgctatcgc ggcatgggta 3277021 cccgaatcgt tctggcagag cttcttttta gccaatcatc cggcctggtc ggcggtctgg 3277081 ggtccgatca taggacccat cgtggccatc gtttcgtttg tttgctcgat cggcaacgtg 3277141 ccacttgccg cggtgctgtg gaacggaggc atcagcttcg gcggggtcat cgcgttcatc 3277201 ttcgccgacc tactgatact gccgatcctg aatatctacc gtaaatacta tggcgccagg 3277261 atgatgctgg tgctgctcgg caccttctac gcatcgatgg tcgtcgctgg ctatctcatc 3277321 gaacttctct tcggtacaac gaatctcatc ccgagccagc gcagcgctac ggtcatgacc 3277381 gcagaaatat cgtggaacta caccacctgg ctcaacgtca tctttctggt gatcgcggcg 3277441 gccttggtgg tccgattcat cacatcgggc ggtctcccga tgctacgcat gatgggcggc 3277501 tcaccggatg ccccgcatga ccaccatgac cgccacgacg atcacctcgg ccactagcgc 3277561 caccacgccg atcagtcggc gccgaaaagg ccaccggcgg cggtatcctg gcctgcgggt 3277621 attccaccca tgggcaaagg gagcatgacc gcgcacgcaa cgccgaacga gccggattat 3277681 ccgccaccgc ctggcggtcc accgccgccg gccgatattg gccggttact gcttcggtgc 3277741 cacgaccgcc ctggaatcat cgccgcggtg agcaccttcc tggcccgggc cggcgccaac 3277801 atcatttctc tggaccagca ctccaccgcg ccggagggcg gaacgttctt gcagcgcgca 3277861 atctttcacc tgcccggtct cacggccgcc gtcgacgaac tgcagcgcga cttcggcagc 3277921 actgtggcgg acaagttcgg catcgactac cgatttgccg aagcagccaa gcctaagcgg 3277981 gtcgcaatca tggcatcgac agaggaccac tgcttgctgg acttgttgtg gcgcaaccgt 3278041 cgcggcgagc tagaaatgtc ggttgtcatg gtgattgcca atcatcctga cctggccgcg 3278101 cacgtacgcc cgttcggtgt gccattcata catattcccg ccactcgcga cactcgtacg 3278161 gaagccgaac agcgtcagct tcagttgcta agcggcaatg tggatttagt agtgctggca 3278221 cgctacatgc agatactcag cccggggttc ttggaggcga tcggctgccc gctgatcaac 3278281 attcaccatt cgttccttcc agccttcacc ggcgcggccc cgtaccagcg cgcacgagaa 3278341 cgcggcgtca aactgatcgg cgcgaccgcc cactacgtga ccgaagttct cgacgagggg 3278401 cccatcatcg aacaagacgt cgttcgtgtc gaccacaccc acaccgtcga tgatctggtg 3278461 cgtgtcggcg ccgacgtcga acgcgcagtg ctttcccgcg ccgtgctctg gcactgccaa 3278521 gaccgcgtca tcgtgcatca caaccagacc atcgtcttct gacatgggtg actgcgcgcg 3278581 ttgcggtcaa cttcttggtg cccatgatgg tcacggcgtc gactggccgt ttcggcgccg 3278641 tcgcccagcg tgaactgagg gcggaaaatc ggctggcccg aatctcgccc ccagtgcacg 3278701 ctcggcgccg tttggcctca cccggtcaac gtgaactgtc cgggtgggcg ctgtcacgta 3278761 gcgagcccac gtggggccgg ggtcggcccg ccaaaaacgc cccggcgcgg ccagctcatg 3278821 agcgagtacg caagctcaag ggacacccgc tttgcactgt ggaagaaccc cgaagacctg 3278881 gcctgcggca ggtgcggtca aaggagcgga gtgtagacag gaccggtggg tctgctcagc 3278941 gcggccccga attaggacaa ttttcgcacc tagcgcatcc aatatcgctt tcgaagaacg 3279001 ttcacgccag tcccactggg ccggtgcgaa tggtgcaacg cgcctttcgt cgaaggaaac 3279061 gccgtccgcc accgagcccg cgctaggcaa gtcggtccca agaacgtcgc aaggatacgc 3279121 caagcggccg cggtcaatct tgacttgtcg gccaccgccg gcaaaccaac attcagccac 3279181 aacgcgacag agaggtaccc aatgttcact gcccgtatcc gcgccctcgc cggcatgtct 3279241 ctgctagcct cggcgatcgg actggcggcc ttcggagccg ctaccggcac cgccaatgcc 3279301 gccccgaccc accaacccga gtggggcacc tacacctgct acgactacgc aacccagacg 3279361 ttctacgagt gctttgaccc cagctagtcg gcgaaggcct cacacgatcg gacctagtcc 3279421 cgcaaaggag ctaggtccgt tcggtgttga gcctgtcccg cagccggcga ttcaccggtt 3279481 cgggcagcaa ctcggacacg tcaccgccca gcatcgcgac ttctttggcc agtgaggacg 3279541 acacgaacga ataccgtggc gcggtcgcga cgaaaaaggt gtccacaccg gcaatgtgtt 3279601 tgttcatttg cgccatctgc agctcgtatt cgaagtcggt gccggtgcgc agccccttca 3279661 cgatcgcggt catcccgcaa gacctgacaa agtcgaccac caagccatgc ccgacctgca 3279721 cgcgcagatt gggcaggtgc gttgtcgact ccttgaccat cgcgatccgc tcgtcgaggt 3279781 cgaacatgcc cgtctttgca gggttgacca ggatggcaac caccacctcg tcgaattggg 3279841 ctgcggcgcg ttcgaaaatg tcgacgtggc ctaacgtcac cgggtcaaat gaccctgggc 3279901 ataccgcgcc cgtcatctgc gccgctcctc ctcatcgctg cgtcccccgc aagcgggcac 3279961 ggcccccacc gcatcgtcgc cggcggtcat gaccgatgac gctacacgtt ggcaaaaagc 3280021 cgttcggcca gttccaaacg ggtgtcgccg taaacacgct ggggccatcg gcgccagccc 3280081 tccggccacg tcaacggcgc gcacgtggtc gcacgctcca ccaccgctac ggttccctcg 3280141 cgcgtccagc cgttggtgcc cagtgcggcc aggatggcgt caacgtcggc ggagtcgacg 3280201 ttgtagggcg ggtcggccaa caccagatcc accggggacg tggtcccggc cgccacgacg 3280261 gccgccaccg cgccccggcg cagcgtcgca ccggagagac ctagggcctc gatgttgcgc 3280321 gcaatgacgg ccgcgctgcg ctggtcggac tccacgaaca gcacggacgc cgctccccgc 3280381 gacaacgcct ccagccccag ggcgccggaa cccgcataga ggtccaacac cgccagaccg 3280441 gtcagatccc gccgcgcagt cacgatgttg aatagcgact cgcgcacccg atcggtggta 3280501 ggtctggttc cgcgtggtgg gacggcaatg cgccggcctc cggcgacacc gccgatgatc 3280561 cgggtcaagt gcgccgctct ccctcgcaag cgggcggtac ccccacctca tcgcttcgtc 3280621 ccccgcaagc gggcggtacc cccactgcat cgtcgccggc ggtgctcatc tgcgccgctc 3280681 ctccgcaagc gggcggtacc cccacctcat cgcttcgtcc cccgcaagca ggcggtaccc 3280741 ccactgcatc gtcgccgggg cggtcagctc accaccacca acaggtctcc gccctccacc 3280801 tgggcggtgt ccgacaccgc cacccgctcc acggtgccgg caaccggggc ggtgatcggg 3280861 gcttccatct tcatcgcctc gatggtggcg atggtttggc cggcgccgac ccgctcgccg 3280921 acgcacaccc cgaccgtgac gactccggca aatggcgcgg cgatgtgtcc gggattgccg 3280981 cggtcggcct tctcggcggc cggaacggca ctggcaatgc tgcggtcgcg cactagcacc 3281041 ggccgcagct gcccgttgag gatgcacatc accgttcgca tgccgcgttc gtcgggttcg 3281101 gaaatggcct ccagcccgat caacagctcc accccacgct ccagcttcac ccgatgctct 3281161 tcaccttggc gcagaccata gaagaactgg ttggccgaca attgcgacgt gtcgccgtag 3281221 gcttcccggt gctcattgaa ttcctttgtt ggactgggaa ataacagcct gttcagggtg 3281281 gcctgacgct tggctccgac cgacgatagg gcaatctcgt cgtccgccgc caattgcgca 3281341 gtgggcctgg ccgccccgcg accggccagc gccgcagtgc gcagcggttc gggccacccg 3281401 ccgggcggat cacccagctc gccccgcaga aatccgagta ccgattccgg gatgccaaat 3281461 cgcgctggat cggaggcgaa ttcgtctgca ctgacaccgg cgccgaccag tgccagcgcc 3281521 agatcgccga ccaccttgga cgttggcgtg accttaacca gcctgcccaa cactcggtcg 3281581 gcgcccgcgt aggcctcttc gatctcttcg aatcgatctc ccagaccaag agcaattgct 3281641 tgctggcgca gattagacag ttggccgccc ggaatctcgt ggtgataaac ccgccccgtc 3281701 ggccccggca acccagactc gaacggcgca tacacttttc gtaacgcctc ccagtacggc 3281761 tccagggcgc acaccgccga aagcgacagg ccggtgtcgt actcggtgtg ggcagcggca 3281821 gcaacgatcg agctcagcgc gggctggctg gtcgttcccg ccagcggcgc ggcggcgccg 3281881 tcgacggcat cggccccggc gtgccaagcg gccacatagc tggcgagctg gccacccggt 3281941 gtgtcgtggg tgtgcaggtg aacgggcagg tcgaagcgac tgcgcagggc gctgaccaac 3282001 ctttgagcgg ccggcgggcg caacagtcca gccatatcct tgatcgccag cacatgggcg 3282061 ccggcgtcca cgatctgctc agccagtttc aggtagtagt ccagcgtgta cagctgttca 3282121 cccggatcgg taaggtcgcc cgtgtagcac atcgcgactt ctgctatcgc agaacctgtt 3282181 tcgcgtactg cgtcgatcgc cggacgcatc gactcgatgt tgttgagcgc gtcgaagata 3282241 cgaaagatgt cgataccggt ggctgttgct tcttgcacaa acgccgacgt cacgatttcc 3282301 gggtacggcg tgtagcccac ggtattgcgg ccccgcaata gcatctgcaa gcagatattg 3282361 ggcattgctg cacgcagtgt ggccagccgt tcccagggat cctccttgag aaagcgcagc 3282421 gccacatcgt aagtcgcacc gccccaacac tccacggaca acagctgcgg catggtccgc 3282481 gcgagatacg gtgccacccg cgacagtccg ctggtgcgta ctcgggtagc cagtaacgac 3282541 tggtgagcat cccggaatgt ggtatcggtg accccgaccg cggccgactc ccgcagccaa 3282601 cgagcaaatc cttccggccc caacttgact agtcgctgct tggacccggc cggtggtgcg 3282661 gcccgcagat caagatcggg cagcttgtcg tccgggtaga tcgttgacgg acgcgagcca 3282721 tacgggttgt tgacggtgac atcggccagg aagttaagga tcttggtgcc gcggtcggcc 3282781 gaggcgcgcg cggtcagcag ctgcggccgc tcatcaatga aggacgtggt gacccggccc 3282841 gctcggaagt ccgggtcatc caggaccgct tgcaggaacg gaatattcgt cgataccccg 3282901 cggatccgga actccgcgat cgcccggcgc gcacggctca ctgcggtagg gaggtcacgg 3282961 ccccgacagg tcagcttgac cagcatggag tcgaagtacg ggctgatttc tgcgcccagg 3283021 ttggtgctgc cgtccaggcg gacaccggca ccgccggcgg tgcgcaacgc gctgatccgg 3283081 cccgtgtccg gccggaagcc gttggccgga tcctcggtgg tgatccggca ctgtagtgcg 3283141 gcgccatgcg gtgcgatgtc ctcctgccgc aggcccaatt gttcgagcgt ctccccggcg 3283201 gcaatgcgca gctggctggc gaccaggtcg acgtcggtaa tctcctcggt caccgtgtgc 3283261 tccacctgaa cccgcggatt catctcgatg aagacatact cccctcgctc gtccagcagg 3283321 aactcgacgg tgcccgcgca gctgtacccg atatggcggg cgaaggcgac cgcatcgacg 3283381 cacatcttgt aacgcaactc ggcgtccagg tgcggcgcgg gcgccagctc gatgaccttc 3283441 tgatggcgac gctgcacact gcagtcacgc tcatagagat ggatcacgtc gccgaggttg 3283501 tccgccagaa tctgcacctc gatgtggcgt ggattgatca ctgcctgctc gagatagacc 3283561 gtcgggtccc cgaacgccga ctcggcttcc cggctggcgg cttcgatcgc ctccggaagc 3283621 gccgcgatat cgccgacacg acgcataccc cggcccccgc caccggcaac tgccttgacg 3283681 aacaacggaa acggcatgcc ggccgcaacc gacagcagtt cgtcgaccga ggccgacggc 3283741 gccgaggaca tcagcacggg caagccggct tcgcgggccg ccgcgatggc gcgagactta 3283801 ttcccagcca gctcaagcac ttcggcgctg ggaccgacga agctgatgcc cgccgccgcg 3283861 catgccgcag ccagatccgg attctccgat agaaacccgt agccagggta gatagcgtcg 3283921 gcacccgccc gacgggccgt cgcgacgatc tcgtcgaccg acaggtatgc atgcaccggg 3283981 tgaccgatgt cgccgatctg gtaagactcg tccgccttga gacggtgctg cgaattgcgg 3284041 tcctcgtacg gataaacggc cacggttccg acgcccagtt cgtaggcggc acgaaaggcc 3284101 cggatcgcga tctccccgcg attggcgacg agcaccttgg aaaacacgtg tggctccctt 3284161 atccggatgt ctcagatcag cgtcgaccaa tagtcccaaa agcggaccat gatcagcagg 3284221 aatactgtcg tgaaccagag cgtggccagc gaccatcgcc attgatagag cagccgtgcc 3284281 ccgacgcgct cctggcttcc ccggttttcc cgcatcggac cgaaaacgat ggacgcgacc 3284341 accaccagca gtgtggcgat gaccgcccag accaccatgc agtatgggca cagggcaccg 3284401 atacggtaca ggctctggaa tatcagccaa tgcacgaacg ccacaccaac caggatcccg 3284461 accgccaggc cgatccaata ccacctgggc aacggcactt tcgccaccgc cagcaccccg 3284521 gtgaccacca ccacggtgaa gcccgcaatg ccgagaagcg ggttgggaaa gcccagcaac 3284581 gacgcctgcg gtgtggtcat caccgagccg cacgacacta tcgggttgac attgcatgac 3284641 ggcacataga tcggatcgag cagaatcctg accttctcca ccgtgagcgt catcgaagcg 3284701 aacagcccga tcacaccgcc gatcagcacc caccacgcgc taggcaccgg cacccgcacc 3284761 gcagccgggt cgccggatcg ctcggcaggt cgagctgcca ccacaatcgt caggatgtcg 3284821 cggtagcagc ggccgagtca atgcccggca catcacccac aatttctttg atcttggcga 3284881 ccagcgccgc cggcgtcgac cactcgtact ctgtgccatt gacccggacc gtcggggtcg 3284941 cgtgcacgtt gaccgccgcc gccagcccgt cgactttttc gatgtacttg ccgctgttga 3285001 tgcagtcggg caccttgccc acgacgccgg cttcgcgggc aagttcgatc aaccgcgcgt 3285061 tgtcggggaa atccttgccg agctcggcag gctggatgtc cttgctgaac aaggcggcgt 3285121 ggaagcggcg gaacgcctcg atcgattcgt cggcaacgca ataagccgca gcagccgctc 3285181 gcgacgaata gtgttgattg ctggcgctat cgagaatggc caccatcgtg taatcggccg 3285241 cgacagcgcc gatgtccacg agcttggaca cggttggccc gaaaccgcgc tcgaatatgc 3285301 cgcacgccgg acacaggaaa tcctcgtaga aggacaccac ggccttgggg ttgctggttc 3285361 cgggctgggt gaccagcttg ctcgacgtca cccgtactgc atcgccgggg cccgcgacgc 3285421 cgtccttctt gtcgtcgcgc gacgtcacga tgtagaagac caggacgacg gcaaaaacga 3285481 cgacgatggt ggtgccacca atctggacga gccggccgaa gctgccgtcg gcggacttca 3285541 gatcgaatcg cggggggcgt ttggatttgt cggccacagt ttcgctgatc ctcacgtgct 3285601 cgatttgtcg gcttgtcgcg gccgcggtca ggcgacggcg cctctagcgt accggcggca 3285661 agccagcctc gactcaaacc cggctaaggt gcgcgcgcag cgcggagatc agctcgttgg 3285721 tcccggcagc actgcctccc cccagctgaa acaggttgag gaagccatgc gtcagcgaac 3285781 ccagataccg caagtccact gcagtcccgg cagcccgcag cgccttcgca tagctttctc 3285841 cttcgtcgcg caatgggtcg aagccggcga ccgcgatgag agcaggcgcc agcccggaca 3285901 gcgattcggc caacaacggc gacaaccgcg gatccgccgg atcgacatcg gaatccctga 3285961 ggtattgcgt gtggaaccaa tcgatgtccc gcttggtcag caggaagcca ttgccgaaca 3286021 ggcccattga gcgagtctgt gcggtgaaat cggtcctggg atacagcagc cactgcagca 3286081 ccggggtggg cccaccctcg tagcgagcct tgtcgcgcgc caactgacac accacggccg 3286141 acaggttgcc gcccgcactg tccccgccca ccgcgacccg cccggggagc gcaccgaact 3286201 catcggaagc gtgctcatgg gcccatacaa aagccgcata ggcatcttca accgcggccg 3286261 gcgccggatg ctcgggagcc aaccggtagt cgatcgacag tacctggatg tcggcgtcgc 3286321 gacaggtcaa ccggcacagc gcgtcatggg tgtccaagtc cccgagcgtc cagccgccac 3286381 cgtggtaaaa gaccagcagc ggcgtggcgc caccgccgct ggggcggtag tgccgcgccg 3286441 ggatctcacc ggctggtccg ggtattgaca ggtcggtcac gtcgacgtgg atctgcggac 3286501 cgggcatcgc ctcgcatatc gcgcgcatgt gcgcgcgaga ggcgacgatg tcgtcgtcta 3286561 cggccaggcc gtcgacaccg aagatccgcg aagtcgacaa catcagctgc agggtggggt 3286621 caagcgtatt gccatcgata atgaccgatc ggccggccga caggatccgt ttggcaggcg 3286681 tcgggatcca cggaaggacc ttgactccga cgttgacgac ggtgccctgc acacgccgtg 3286741 tccacatgcg cgggtggttt gctccgagac ggaggtctgc cacacctggc agactcttgg 3286801 tcatgggctg ctccctacaa aactctgtca cgcgcagcaa cggacactcg atccgcgccg 3286861 tcaggctgga tgtctttcgg gtcctgccgg ccgacaccgg gcaagcggta ggtgccgcga 3286921 gtccggcgtg cccaacggcc aactctacgt ggtgaccaaa gtgttgaatg ccgaccagca 3286981 ctattcgcgg cttacgccgc cgtcgccgaa ggctgtggct cagcacctgc ccaggtgttg 3287041 attaggtggc atatccaact cggtaatatc gtgatcccca agtcggtgaa cccaatgcgg 3287101 attgcgagca acttcgacgc gttcgatttc cctcgctcga tgacggaacc cggcttggtc 3287161 cgaatccgaa aaccttcaat ttcacaggca ggtgagatga cgtgactggc gagtcgggcg 3287221 ccgccgccgc accctcgatt accctcaacg acgagcatac gatgccggtg cttggcctcg 3287281 gcgtcgcgga attgtcggac gacgagaccg aacgtgcggt gtccgcggcg ctggaaattg 3287341 gctgccggct gatcgacacc gcctacgcct atggcaacga ggccgcggtc ggccgcgcaa 3287401 ttgcagcctc cggcgttgcc cgcgaagagc tgttcgtcac caccaagcta gccacccccg 3287461 accagggttt cacccgttcc caggaagcat gtagagccag tttggaccgc ctcggcctcg 3287521 actacgtcga cctttaccta attcactggc cggccccgcc ggtgggcaag tatgtggacg 3287581 cctggggagg catgattcaa tcccgcggag agggccatgc ccgatcgatc ggcgtgtcca 3287641 acttcaccgc ggagcacatc gaaaacctta tcgacctcac attcgtcacg ccggcggtca 3287701 accagatcga gctgcacccg ctgctcaacc aggacgaact gcgcaaagct aacgcccagc 3287761 acaccgtcgt cacacagtcc tactgccccc tggcactcgg caggctgctg gacaacccaa 3287821 ccgtcacatc aatcgccagc gaatacgtca agacgcccgc acaagtgctg ctgcggtgga 3287881 acctgcaatt gggcaatgcg gtggtcgtcc gctcggccag acccgagcgc atcgccagca 3287941 acttcgacgt cttcgacttc gagttggcgg ccgaacacat ggatgcattg ggcgggctca 3288001 atgacggcac ccgggtgcgc gaggatccac tgacctacgc cggcacctga tacgccgccg 3288061 actgtgaacc gcgcgacgtc tcctcggcgt gtcacgtcgt gagattcacc gtcggcgcgt 3288121 ggactagccc gtcgggcagg tggccgcggc ctgacgcagt acgtcggacg atggctgatc 3288181 cactggcagt gaatagccgc gcagcacggc gatgaattgc atcgcgtact gacaggcgaa 3288241 ggccttgttg ggtggcatcc attgggccgg tggcgaatcg cccttgtcct gatttgcctg 3288301 cccctgcacg gccagcaggt tggccggatc gttggcgaag cgcattcgct cggagttcgg 3288361 ccaccgatag gcgcccatgt cccaggcata cgagagcgga acgatgtggt cgatctggac 3288421 cgattggcca acactggcgc cgcgttggaa ggcaacggtg gtgttggtgt acggatcgcg 3288481 cagggtgccg gtggccaccg cattcggaca ccgcttgatc gacacatatg tcttgtcgac 3288541 cagatcccgg tcgaggatgt cgtcgcgggt gtcgcacccg ttgtgccctc ccggcgcgtc 3288601 attgcgatcg tcccaggggt gaccgaatgc ggacctgcgg tagtcgtagc ggtggatccg 3288661 tttgggtagc acggcgatgc cggcgagcac gtcggcaccg ggttgcacgg ttggcacgcc 3288721 agcgcgggcg gcgaactcgt cagcgtgcct gcccgccgat gatcccagcg tctgatacgc 3288781 gaccaccagc gccagcgccg cgatcgccga cagccacagt agcgttctgc ggttcatgac 3288841 ttatctaagt attcgatgcg gtcggtgctg gtgaatcgcg cggccatcag cgccaatgcg 3288901 gggtctgtgg ggttcttgta agcctcgatg cagaagtccc gcgcggccac tatgtattcc 3288961 tcgtgttcgg ccaatgacag caaccgcagc gtgatggcct tgccggattg gttgcggccc 3289021 agcacatctc cctccttgcg ctccttcaga tccagatcgg cgagggcgaa cccgtccatt 3289081 gtcccggcga ccgcacgcag ccgctgaccc gccggcgtat ccggcggcac ccagctggcc 3289141 agcagacaca cgctgggatg ttcgccgcgc ccgatgcggc cgcgcagctg gtgcaattgg 3289201 ctgatgccga accggtcggc gtccatcacc agcatgaccg tagcgttggg gacatcgacg 3289261 ccaacctcaa tgaccgtggt gcacaccagc acatcgacct caccggcccg gaaagccgcc 3289321 atcgcagcgt ccttgtcgtc ggccgacaac cgtccatgca tgagcgccaa ccgcaactct 3289381 gcgagctcgg cggaacgcaa ccgggagaac aggccttcgg cagtggccga tggtcggacg 3289441 ccgccttgaa cgtcggtgtc gtcggactca tcgatgcggg gcgccaccac ataggcctgg 3289501 cggccggcgg cagcctcttc gatgatgcgc cgccaggcgc ggtcgagcca ggcgggcttg 3289561 tccttgacaa agatgacgtt ggtggcaatc ggctggcgcc cgagcggaag ttcgcgcagc 3289621 gtagaggttt ccaggtcgcc atagacggtc agcgcgaccg tgcgcggtat cggcgtcgcg 3289681 gtcatcacca gcaggtgcgg ggtaatgccg gcgggggcct tggcgcgcaa ctgatctcgc 3289741 tgctcgacac caaaccggtg ttgctcgtcg accaccacca tgcccaggtt gtgaaagtcg 3289801 acggcctcct gcagcagcgc gtgcgtgccg atgacgatgc cgacctgacc gctggcgatt 3289861 tcggcgcgaa cttgcttctt ctgccctgcc gtcatcgaac cggtgagcag tgccacccgg 3289921 gtggcgtttt cggcgcctcc cagttggccg cccatggcca gcggccctag gacatcgcgg 3289981 atcgatcgca agtgttgtgc ggcaaggact tccgttggcg ccagcagggc acactggtaa 3290041 cccgcgtcca ccatctgcag catcgccaac accgcaacga tcgttttgcc cgagcccact 3290101 tcgccttgca gcaggcgatt cagcgggcgg ttcgccgcga gcccgtcgga caacacgtcg 3290161 agcacctcac gctgtcccgc cgtcagctca aaaggcaacc gccgcagtag ctcagcggca 3290221 agaccgttag atttccaggc cgccgagggc ccggattccg acagttcacc gtgccgtcgg 3290281 gccaccagcg cccactgcag acccacggcc tcgtcgaagg tcaggcgttc ccgggcgcgc 3290341 tcgcgtaacg actggctttc ggcaaggtga atggcgcgca gtgcctcgtc ctcggggatc 3290401 aggccgtgct tggcgcgtag ttccgcgggc aacggatcat cgacccggtc gagaacatcg 3290461 agcacctgcc gcacgcattt gaagatgtcc cagctctgca cttttgtgct ggccggatag 3290521 atcgggaaga aacgacgctc gaactcctcc acgaccaatt caccgctgat ggccttggag 3290581 gcatcagcga tacttttgag cgacctggtg ccgtggttct tcccgtccgg cgagtcgagg 3290641 atgagaaacg ccggatgcgt gagctgcatc gcgcccttgt agtagccgac ttccccggag 3290701 agcatcacct tcgtgtgctt ggtgaggtcc cgcatgatgt agtccgcgtt gaagaacgtg 3290761 gccgtcacct tgttgcggcc gccgccgacg gtgatgcgca gacatttccg attcggcttc 3290821 tttttcatcg gaaacgaata cgtatcggtg atcacgtcga cgatggtgat gtgctcgcca 3290881 gcttccggtc gcgcgtcacc gatacccacc cgcgccgcgc cctcgacgta gctgcgcggg 3290941 tagtggcgga gcaggtcgtc gacggtccgc atgccgaact gctcgtcgag ggcatcggct 3291001 gccgtggcgc cgaggacgcg atcgagccga tcgcttaacg acgccaccgc tactcgaccc 3291061 cgatcagcag cgcgtcgccg cggtgtccgg tgcggtagga gaccagctcg gtgcctggat 3291121 ggtggtcgtg cacatgccgt tccaggacga cagccacgtc ttcggttacg ccggcgccaa 3291181 ttagcaccgt caccagatcg cctcccgatg ccaacaacag gtcgaccaga ccgatggccg 3291241 ccgcggcgac atcgtcggcg acgatcagca cctcgtcgcc cgcgataccc agaccgtcgc 3291301 ccggcttgca ggtaccggcc caggtcagcg ccttttgggt ggcaatgcgc accgatccgt 3291361 gccgggaagc accggcggca cgggccatgc tgtagccgtc gtcgacggcc tggcgggccg 3291421 cgtcatgcac ggccagcgcg gccaacccct gcaccatcga tccggtcggc acgggtacca 3291481 cgtcgacgcc ccagccgatc gccgcggtac acccggccac cagttcttcg gcggccacat 3291541 agccattggg cagcaccatc acgtgcgcgg cgccggtgtc taccacggcc cgcaccagct 3291601 ggtgggcact gatatcggcg gccggtgtca cggcgtctgg acccggtcgc agcacgcagg 3291661 cgccctcccc ggcgaacagc tcggcggcac cgtcgccgtc gacgaccgcc agcacggcgc 3291721 ggccccgcgt ccagccaccg gccggcaatc cgctggtccc ggaaccgagc gccgagatca 3291781 cgatccggct aactcgcccc accgccaatc cggcttccac ggcggcaccg gcgtcgtcgg 3291841 tgtggacgtg tacggagtag ctgtcgggcg gagcagcggc gatggccacc gactcaccca 3291901 attccttgag tcgatcccgc aactggtccg ccgctgcagc atcacatacc gccaacagat 3291961 acatcacctc gaattgcggg gcggggcgtt gggtagccgt gtcggtcggc aacgcgcgcg 3292021 gcgagggttc gtagaccgcc cgggcaggtg cctgcccgca gatggtggag cgcaacgcgt 3292081 ccagcagaac cagcaggccc cgtccgccgg cgtccaccgc gcccgcatcg gcgagcacgt 3292141 caagctgttc gggggtcttt tccagcgcga tgaccgccgc gtcaccggcg gcggtgaccg 3292201 caccggccaa cccctcgtgc gcgcactggt cgacggctcc ggcggcggcc cgcagcaccg 3292261 agacgatagt tcccggcacc tccacgccac ccatcgacgc gacgaccaac tcgacgccgc 3292321 gccacaacgc ggccccgagg gcgttggcgt cgaccgcccg caataccgcg ccagaggcgg 3292381 cggccgcagt cgcggtcacc tctgcgatcc cgcgcaggat ctgggacagg atcacgccgg 3292441 agttgccgcg agctccgttc aacgcgccgg ccgcgagagc ggccgcaacc cgcgccacgt 3292501 cttcggcgtc agcctgcgaa ttcgcgtgca aatcagcttc tacgaccgcg gcacgcatgg 3292561 tgaacagcat gttgacgccg gtatcggagt cagcgaccgg gaacacattg agccggttga 3292621 tctcgtcgat gtggaggatc agatcgctga cgacggcgtg tgcccagtcc cgcaaggccg 3292681 aggcgtccaa cggccgatcc gccgtcccca ctacaacaca cctcctccgc aacacacctc 3292741 ctccgcgcca gcccgcgccc cgagcctaac cagacgtggt gacagcacgg tcacgacgcc 3292801 gctctcccgg ccaaggcggg tgctgacatg tccgcgaagg gctgatcgtt ttggcgctac 3292861 cgcacaacaa tggctatcct gtgctagccg cgggctacac gtaggcgtcc cggccaggtc 3292921 gccggaccta agagatttga ggagcttgac gaatggccgc tgtgtgcgat atctgcggga 3292981 aaggccccgg cttcggcaag tcggtgtcgc actcccaccg ccgcaccagc cgccggtggg 3293041 atccgaacat ccagactgtg cacgccgtga cccgtcccgg cggcaacaag aagcgactca 3293101 acgtttgcac atcctgcatc aaggcgggca agatcacccg cggctgacgc ccggtaacac 3293161 ctgcacgact cagggcaacc gccaatcgat cggctcggca cccatcccga cgagcagttc 3293221 gttggcgcgg ctgaacggac gcgagccgaa gaacccgcgc gatgccgata gcggtgaagg 3293281 atgcggcgac tcgatcgcaa cgcagttgcc cgcggccagc atcggcttca gagtcgacgc 3293341 gtcacgaccc cacaggatcg ccaccagcgg cgctgcgcgc gccgccaggg cgcgaatcgc 3293401 gcattccgtg accgcttccc agcccttgcc ccggtgcgac gccgggttgc tgggtcgcac 3293461 cgtcagcacc ctgttcaaca gcaacacacc gcgttgcgcc cagggcgtca gatcgccgtt 3293521 cgagggcagc ggatagccca aatccgcggt gtactcgtcg aagatgttgg ccagactgcg 3293581 cggccacgga cgtacatcag gggccaccga gaagctaaga cccacagcat gtcctggagt 3293641 cggataaggg tcttggccaa cgataaggac acggacgttg tcgaacggga aagtgaaggc 3293701 gcgcaacaca ttcgatccgg cgggcaggta tctgcgcccg gccgcgatct cggcccgcaa 3293761 gaactgcccc atgtgggcca cctggtcggc caccggctcg agcgcggcgg cccacccccg 3293821 ctcgacgagc tcactcaacg gccgtgcggt cactgcatcc ctttcgcgta cagacggtca 3293881 ccgcgtcacc ctagcgaacc ttgattgtct ggctccccaa acgattgcca gcccgcgtat 3293941 ccagtccact cctcgccgtc gaccagcacc ctagccggcc cgtcgagaac ccggccaatg 3294001 gtgcgccacc cggccggcac cggaccgacg aaacaggcga ccagggcatg atcttcaccc 3294061 ccgcttagca cccacggcca ggggtcggtg cccagagcgg ttgcggccgc agtcaaagcg 3294121 tcgcggtcag cggccaacgc cgcggcggac aggtcgatgc gcacgccgga tgcctcggcg 3294181 atgtgccgca gatcggcgag cagcccgtcg gagacatcga tcatcgcttg agccccgaca 3294241 gccgcggccg ccgcgccgtg gccgtagggc ggctgcggca ccaaatggcg gcggcgcagt 3294301 tcggcgaagt cttcaatccc gttgcaccac agcgcatagc cagcagccga gcggcccagc 3294361 tcaccgacga cggccagcac cgagccggcc ttcgccccgg agcgcagcac cggggcacga 3294421 ccgtcaaggt caccaatcgc ggtgaccgac accacccact gccggcagct gaccagatcg 3294481 ccgccgacga tgccggcacc aatgcgcccc gcctcctccc acattccgtc gaccaacgcg 3294541 ctcgcctgcg ccgccggcgt ctcagcgggt gctccaaagc cgaccacgaa cgcggtggcc 3294601 cgcgccccca tcgcctcgat gtcggcggca ttctgggcga tcgccttgcg gccgacgtcc 3294661 tgcggtgtcg accagtccag ccggaagtga ctatcttgca ccagcatgtc cgtcgacacc 3294721 acagtgcgac catcgccggc agacaccagc gcggcatcgt cgccgggccc gagcagtacc 3294781 gtggcgggtt gtcggcgccc ccgcaccagc cggtcgatca cggcgaactc gccgagctgc 3294841 tgcagcgtcg gggactccgt tgcaagtgag tgatctttag tggtcacgcg acttgcaccc 3294901 cgtctcgggg ttgttcggca gccttggggc tgcttccctt ccgcgcttca cagccacctg 3294961 ccgggcgagg cccggtctta cggtcggctc cacgcttgac ggcggcccca actgggccga 3295021 cgatgctgga tgtttcctcg tagcgtgcga ggttgatggc agcgcagtca tcacgctgat 3295081 ggaccactga gcatcggtcg cattgccatt gttcgtccca gccgatgtct tgcacatgcc 3295141 ggcaggcgtg gcaggttttc gacgacggga accagcggtc ggcgaccacc agcgccgacc 3295201 cgtaccagac tgtcttgtag gacaagtgcc gacgcggagt gcccagggcc gcatccgaca 3295261 gtccgcgccg acgagcgcgg gcacccggca accctttttg ccgcaacatc tctgtcgcgt 3295321 ccaagccttc gacaacaatg cggccgtggg tttgagccaa ccgtgtcgtc aggacgtgca 3295381 ggtgatgggt gcggacatcg ttgacccggc gatgcaaccg ggaaatctga gtggtgcgct 3295441 cacggtagcg ccgtgaacct ttcgtgcaac gcgaacgggc ccggcacacg tggcgtagct 3295501 cgcgcagcgc ggcgccgagc ggtcgtgggt tctcaacctg ctcgatcgcc gtgccgtcag 3295561 cggtggcgac cgtcgccagg cgccggaccc cgacatcgac accaacccgc gaaccggggt 3295621 gcaccacctt cggctgctgc ggacgctgga caagcacccg cacactggca tccagacgag 3295681 tgccgttgcg gcgcaccgag atcgccaata ctcgcgcccg accggccttg atcaggcgtt 3295741 cgatacggcg ggtgttctcg tgcgtgcgga cggtcccgat gaccggcagg gtgaggtgac 3295801 ggcggtcggg ttccacacgc atcgctccgg tcgtgaacga cactcgatcc tggtcgcggc 3295861 ctttgcgttt gaaacgggga aacccgaccc gtttaccggc gcgtttgccg gcgcgggagg 3295921 tctgccagtt ccagtacgcc tcgaccgcac ccgcgatgcc atcggcgtag gcctcttttg 3295981 agcattcagg ccaccacgcg acaccggtct cggtgttgac gcacacgtcg tccttgacgg 3296041 tgttccagcg tttgcgcagc acgcgcagcg acggtttcgc tgtcacggtc ccgctggcat 3296101 gccacgcctg gatgtcggct ttcagggtgg ccacggtcca gttgtatgcc ttgcgacgag 3296161 caccgaaatg ccgtgccagc gccttggcct ggtcctcggt cgggtccagc gtgaaccgaa 3296221 acgcttggac cgtccagcca tcgggaacct cgaacttggg catcaggcgg cctcatggtc 3296281 ctcgccagca gcggccgcca atgcgcgctt ggttcgattc tcggcagccc gtttgccata 3296341 cagacgggcg cacatcgacg tcaggatctc ggtcatatcc cgcaccaggt cgtcaccgac 3296401 ctcggcagag tcgactacca ccagttcgcg gccctgcgcc gcaaacgctg cctgcacgta 3296461 cttcgagccc aaccggcaga accgatcccg gtgctccacc acgatccggt ggactgacgg 3296521 gtcgcgcagc agtgaaagga acttacggcg gtgctcgttg aacgcggaac cgacctcggt 3296581 cacgaccttg tcgactggca tctgttgggc cgcggcccac gcggtcaccc gcgcgacctg 3296641 ccgatccaga tcggctttct gatcggccga cgacacccgt gcatacaccg cggtcggtga 3296701 tcgcatgcca gcgtccccag ccggttcgtc gacgagaatc agtcggccca ctcgcctcgc 3296761 catcaccgac aacagaccag cacgaaacca gcggtaggcg gtcccccgag caacaccgtt 3296821 gcgctccgcc cacgtcgcca ggttcatatc tctgttccta ccgcacgcca ctgacaacta 3296881 ccgaccactc aacccgcaac agctggcacc ccccgatgcg tcgtcgccca cgccgcctcc 3296941 ttcggcccgt tctggccctg tggaccttcg aacacctcgc ccgacctgcg gtaagttgag 3297001 tcactgccgg cgcgagcgga ccgcgccagt gtatgagagc aaagaggtgg ccgcgcaggt 3297061 gacaggcgag tccgacgggc cgccgcgcgc cgtgctgatc gccgcggcgg cgctggcggc 3297121 ggcggtgatc ggggtaatcc tggttgtcgc ggcgaaccgc cagccgccgg agcgaccggt 3297181 tgtcattccg gccgtgcccg ctccgcaggc caccggtccc ggctgcaaag cactgctggc 3297241 ggcgctgcct caacgactcg gcgagtatcg gcgcgcgccc gtcgcggagc cgaccactgc 3297301 gggtgccacg gcctggcgaa cggggccaaa cagcacaccg gtgattttgc gctgtggact 3297361 cgaccgcccg gccgagttcg tggtgggttc ggccatccaa gtcgtcgatc gggtgcagtg 3297421 gtttcaggtg gccgcgcaaa acccggacga gccaggccgg tccacctggt acaccgtgga 3297481 ccggccggtg tatgtggcgc tgacactccc ctcgggatcg gggcccaccg cgatccagga 3297541 attgtcagac gttatcgacc acaccatccc cgcggtaccc atcgacccgg cgccggctcg 3297601 ctagtgccga tcgcaagcgc ggcgcttgcg ccgggcgcgg cgggtcggca ccatcgggct 3297661 aagtgccgat cgcaagcgcg gcgcttgcgc cgggcgcggc gggtcggcac catcgggcta 3297721 agtgccgatc gcaagcgcgg cgctagcgcc gggcgcggcg ggtcggcacc atcgggctag 3297781 tgcaggccca cgccgcgggc caatgccgtc tcgatcatcg tcgccagcag ggtcggatag 3297841 tcgacaccgc tggccgccca catccgcggg tacatcgaga tcgtggtgaa tcccggcatc 3297901 gtgttgatct cgttgatcac cggaccgtcg tcggtgagga agaagtccac cctggccaga 3297961 ccccggcagt cgatagccgc gaacgcccgg atcgccagct gacgaatcgc ctctgcgacc 3298021 tggtcatcga ccttggcggg cacgtccaat tcggctgcgt cgtcgagata cttggttgcg 3298081 aagtcgtaga aagagtcctc gcgtccccgc accccggcca cccggatctc ccccagcgtg 3298141 ctggcttcca gtgtgccgtc cggcatttcg agcacaccgc attccagctc gcggccgctg 3298201 atcgcggcct cgacgatgac cttagggtca tgccggcggg cccgcgcgac cgcggcgggc 3298261 agttgatccc aactcgacac ccggctaaca ccgatcgacg agccgcctcg ggcgggtttg 3298321 acgaacaccg gtaagcccag ccgttcgcac tcctggcggt gcagtgtcga ccgcggcgga 3298381 cgcagcaccg cgtacgcacc caccggaagt ccatcggcgg cgagcagctt cttggtgaac 3298441 tccttgtcca tgccgacggc actggccagc acaccggcgc ccacgtaggg caccccggcg 3298501 agttcgagca gtccctggat cgtgccgtcc tcgccgtacg ggccgtgcag taccgggaac 3298561 accacgtcga ccgactccag aacctcgccg gccccgggcg gcagcgacac caactggcca 3298621 ccacgccgcg gatcggccgg cagcgccagc tcggtgcccg atcctgattt gacctgagga 3298681 agctcccggt tggtgatcgt cagggcgtcg gggttggcgt cggtgagcac ccacgaacct 3298741 gccggggtga tacccaccgc gatcacgtcg aaccgccgcg agtccaggtt gcgcaggatg 3298801 ctgccggcgg acacacacga gatggcgtgc tcgttgctgc gcccgccgaa cacgacggca 3298861 acgcggacac gccgatcgtt agcactcaca acctgcagag gctaccgggt caggcagacg 3298921 ggctcccacg agctgcagtt ttcggtcgtg ccggcccgtg cgaggctcat tcgggcttgg 3298981 tgcggcgacc cagcagcagc gttatcgcct cgtccaccga cagcccttta tgacagaccc 3299041 gatgcaccgc gtcggtgagt ggcatttcga cgtcgtagct ggacgccagc gcgagcacgg 3299101 attcgcacga cgtcacgcct tcgacgacat gacaagcctt gcccgccgac tgcaacgttt 3299161 cgccccggcc caggcgttcg ccaaacgatc ggttgcgcga acgcggtgag gtgcaggtgg 3299221 ccaccagatc accgacccct gccagaccgg ccaacgtcgc gccgttggcg ccgagcgccg 3299281 tcccgagccg gatgatctcc gccaggcccc gggtgatgat cgcggccgcg gtgttttcgc 3299341 ccagcccgat gcccaccgcc attccgcacg caagcgcgat gatgttcttg cacgccccgc 3299401 cgatctcggt gccgacgaca tcggcgttgg tgtaggggcg gaagtacccg ctgttcagcg 3299461 cgcgctgcaa ggcaaccgcg cggccggagt cgctgcacgc gacgacggta gcggcgggct 3299521 ggcattcggc gatctcgctg gccaggttgg gtccagagat caccgcgacc tgcgccggct 3299581 cggcaccggt caccgagatg atgacctggc tcatccgcat cagggtgccc aactcgatgc 3299641 ccttggccag actgaccaag gtcgcaccct cgggcaacag gggagcccac cgctcgagat 3299701 tggcccgcat ggtctgcgcg ggcactccca acagcaccgt ggatgcgccc ccaagtgcct 3299761 cctcggcatc tgcggtggca tgaatgctcg gtggtaacag cgcaccgggc agatagtcgg 3299821 ggttatatcg ggtggtattg atctgatcgg ccacctcagc tcgccgcgcc cacagcgtga 3299881 cctctccgcc cgcgtcggcc agcaccttag ccagggccgt gccccatgca ccggcgccca 3299941 tcaccgcgac ggtgcttgct attccggcca tccacacaca ctaatctgcg ccgcggttgc 3300001 cgtcgggacc gtgcctgggc cccggccacg accgtggcgg caatgccgtc gaagtgtgcc 3300061 gcgtggatcg acgctggcag gatgacttca tgagcggcac accggacgac ggcgatatcg 3300121 gcttgatcat cgccgtcaag cgcttggccg cggccaaaac caggctggcc ccggtgttct 3300181 cggcgcagac tcgcgagaac gtggtgctgg ccatgctcgt cgacacgttg accgccgcgg 3300241 cgggtgtcgg ttcactgcgc tcgatcactg ttatcacccc cgacgaagcc gcggcggctg 3300301 cggcggccgg gctgggcgcc gatgtactgg ccgacccgac acccgaagac gatcccgacc 3300361 cactgaacac cgccatcacc gctgccgaac gcgtggttgc cgaaggggcc tccaacatcg 3300421 ttgtgctgca aggcgatttg ccggcattac agacacagga actcgccgag gcaatctcgg 3300481 ccgcacgcca ccatcggcgc agcttcgtcg ccgaccggct tgggaccggc accgcggtac 3300541 tgtgtgcgtt cggcaccgcg ctgcacccgc ggttcgggcc ggattcgtcc gcgcggcacc 3300601 gccgttcggg cgctgtcgag ctgacaggag cctggccggg cctgcgctgc gatgtcgaca 3300661 cccccgccga cctgacggcc gcacgccagc tcggggtagg gcccgcgacc gcgcgagcgg 3300721 tcgcacatcg ttgaccggga cggggcaacg ccggcgaggc atccaggggg tgaacggcag 3300781 accaacggcg aacggatgcc tgccgagtgc tggcaacccc acccaatgat gagcaatgat 3300841 cgcaaggtga ccgaaatcga aaacagtccc gtcacagagg tgcggccaga ggagcatgcg 3300901 tggtatccag acgactcggc gctggcggca ccgcccgctg ccacccccgc cgcgattagc 3300961 gaccagctac cctcggatcg ctacctgaac cgggagctga gttggctgga cttcaacgcg 3301021 cgcgtgcttg ccctggccgc cgataagtcg atgccattgc tcgagcgcgc caagtttctg 3301081 gcaatcttcg cgtccaatct cgacgagttc tacatggtcc gggtggccgg cctcaaacgc 3301141 cgcgacgaga tggggttgtc ggtgcgctcc gccgacggtc taacaccgcg cgaacaacta 3301201 ggccggatcg gcgagcagac tcaacagctc gccagccggc atgcccgggt gttcctcgat 3301261 tcggtgctac ccgcgctcgg cgaggaaggc atctacatcg tcacctgggc cgatttggat 3301321 caggctgagc gcgaccgatt gtcgacctat ttcaacgaac aggtcttccc cgtcctgacc 3301381 ccgctggccg tcgatcccgc ccacccgttc ccgtttgtca gcgggttgag cttgaacctg 3301441 gcggtcacgg tacgccaacc tgaagacggc acccagcatt tcgcgagggt caaggtgccc 3301501 gacaacgtcg accgcttcgt cgaactcgct gcacgtgagg ccagcgagga agctgcgggg 3301561 accgaaggcc ggaccgcgct gcggttcctg ccgatggagg agctgatcgc ggccttcctt 3301621 ccggtgcttt tcccgggtat ggaaatcgtc gagcaccacg catttcgcat cactcgcaac 3301681 gctgacttcg aggttgaaga ggatcgcgac gaggacctac tgcaggcgct cgagcgagaa 3301741 ctggcccgcc gccggttcgg ttcaccggtg cgactcgaga tcgcagacga catgaccgag 3301801 agcatgctgg agttgctgct tcgcgaactc gacgtgcatc ccggtgatgt catcgaagtg 3301861 cccgggctgc tcgacctatc gtcgttgtgg cagatctacg ccgtggaccg cccgacgctt 3301921 aaggatcgga cattcgtccc agctacccat cccgccttcg ccgagcggga aacacccaaa 3301981 agcatcttcg cgacgctgcg cgaaggcgat gtgctggttc accatccgta tgactcgttc 3302041 tccaccagcg tgcagcgatt catcgaacag gccgcggccg accccaacgt gctggcgatc 3302101 aaacagacgc tgtaccgcac ctccggcgac tcgccgatcg tccgggcgct gatcgacgcc 3302161 gccgaagccg gaaagcaagt ggtggcactg gtcgagatca aggcacgctt cgacgaacag 3302221 gccaacatcg cctgggcgcg cgcactagaa caagccggcg tgcatgtggc gtacgggctc 3302281 gtcgggctca agacgcactg caagaccgcc ttggtggtgc gccgcgaagg tccgacaatc 3302341 cggcggtact gccatgtcgg caccggcaat tacaacagca agacagcacg actctacgag 3302401 gacgtcggac tgctgaccgc tgcacccgat atcggcgccg acttgaccga cttgttcaat 3302461 tcgctcaccg gctactcacg caagttgtcc taccgcaact tgttggtggc cccgcacgga 3302521 atccgcgccg gcatcattga ccgcgtcgag cgggaggtcg cggcgcaccg tgcagagggt 3302581 gcccacaacg gcaaaggccg catccgactc aagatgaatg cccttgttga tgagcaggtc 3302641 atcgatgcgc tgtaccgcgc gtcgcgagcc ggtgtgcgga tcgaggtggt ggtacgcggc 3302701 atctgcgcgc tgcgtccagg tgcgcagggc atttcggaaa acatcatcgt gcgctcgatt 3302761 ctcggccgct tcctcgagca ctcgcggatc ctccatttcc gtgccatcga cgagttctgg 3302821 atcggcagcg ccgacatgat gcaccgcaac ctcgaccggc gagtcgaggt tatggctcaa 3302881 gtcaaaaacc cgaggctgac cgcgcagctg gacgaattgt tcgaatccgc actggacccg 3302941 tgcacccggt gctgggagct cgggcccgac gggcagtgga ccgcgtcgcc gcaagaaggc 3303001 catagcgtgc gcgaccatca ggaatcgctg atggaacggc accgcagccc ctgacactgc 3303061 gtggtgattc ccgctgctgc accgaccaca tccacgaccg cgagcagcct ggccgaattg 3303121 acctgcagga gttgaggtgt cgatccagaa ctcgtccgcc cgccggcgct cggcgggccg 3303181 gattgtgtac gccgccggtg cggtgctctg gcgacccggc agtgccgatt cggaagggcc 3303241 ggtcgagatc gctgtcattc accgcccccg ttacgacgac tggtcgctgc ccaagggcaa 3303301 agtggatccg ggcgagaccg caccggtggg ggcggtgcgg gagatactcg aggagaccgg 3303361 tcaccgcgcc aacctgggta ggcggctcct gacggtgacc tacccgaccg actccccttt 3303421 tcgaggcgtc aagaaggtgc actactgggc agcgcgcagc accggtgggg aattcacccc 3303481 cggcagtgag gtcgacgagc tgatctggtt accggttccc gacgcgatga acaagcttga 3303541 ctacgcccag gatcgaaaag tcctgtgccg gttcgctaaa cacccggcgg acactcagac 3303601 ggtgctggtg gtgcggcatg gcaccgcggg cagcaaagcg cacttctccg gggacgacag 3303661 caagcgaccg ctagacaaga ggggtcgtgc gcaggcagaa gcgttggtac cacagctgct 3303721 ggcgttcggc gccaccgatg tttatgccgc cgaccgggtg cgctgccacc agacgatgga 3303781 gccactcgcc gcggaactga acgtgaccat acacaacgag cccaccctga ccgaagagtc 3303841 ctacgccaac aaccccaaac gcggccgaca ccgagtgctg cagatcgtcg agcaagtagg 3303901 cacacccgtg atctgcacgc agggcaaggt cattcccgat ctgatcacgt ggtggtgcga 3303961 gcgcgacggt gtgcaccccg acaagtcccg caatcgcaaa ggcagcacgt gggtgttgtc 3304021 gttgtcagcc ggcaggcttg tgacagccga ccacatcggc ggtgcgctgg ccgccaacgt 3304081 gcgggcctaa cacacggata cccttcgtca cattgccacc gtgcaaaggg tatccgtgtg 3304141 tcttgaccta tttgcgaccc cgccgagcgg ttgccttctt ggcgggagcc ttggtagccg 3304201 gccgcttggc cgctgccttc tttgccggcg ccttggtcgc cgccttacgc accgatgcct 3304261 tgaccgcggt cttcttcacc gccttggtca ccttcttggc gggtgacttc gtggccttga 3304321 cagctttctt ggcgggcgcc ttggtcgccg ctttcttggc gggcgccttg gtcgccgcct 3304381 tcctggcggg cgccttggtc gccgccttct tggcggcctt tgtcgccttc ttggcaggtg 3304441 ccttcttcgc taccttcttg gctgcactgg cccccacacc acgcttaaca gcgggtcctt 3304501 ctgccgggag acgctgcgcg ccagacacaa ccgctttgaa ttgcgcgccc gggcggaacg 3304561 ccggcaccga cgtcggcttc acctttactg tctcgccggt acgcggattg cgggccactc 3304621 gagccgcgcg gcgacgctgt tcgaacacac cgaacccggt aatggtgacg ctgtcgcctt 3304681 tgtgtaccgc acgcacaatc gtgtcaacga cattctcgac ggcggcggtc gcctgccgac 3304741 ggtccgagcc caatttctgt gtgagcacgt caatgagctc tgctttgttc atcccaaccc 3304801 tccgaaacca gtggtcctcg tttggaaccg actagtggac acggtaaacc cttacccggc 3304861 tgatttccaa gagccacgcg caatttcact gagccaacga ccggtttttc gcaatccggt 3304921 tgccgccctt gaccggtggc gcggccccaa aatggctcag gttctgccgg cgggtcacgc 3304981 tgaaatttcg cccggttcta cgcctcaggg ggcgggtaga gtgcgcggtt tccagtacgc 3305041 gcacgcaccc tcaaaggcct cgatctcgtc gagtttccgc agcgtaaggg ctatatcgtc 3305101 gagaccttca agcagccgcc acgccgagtg gtcgtcaatc ttgaacggca gcaccactgt 3305161 tgctgcggtg ataattcgat cttgaagatt ggcagtgatt tccaggcccg gactctgctc 3305221 aatgagcttc cacaggagtt ccacatcgtc ttgggcaacc tcggccgcca gcagcccggc 3305281 cttgcccgcg ttgccgcgga aaatgtcacc aaatcgggat gagataacca cccggaatcc 3305341 gtagtccatg agcgcccaga ccgcatgctc tcgcgaggat ccggtgccga aatcgggccc 3305401 ggcaaccagg accgaacccc ggtcaaaggg actgaggttt agcacgaatg caggatccga 3305461 ccgccaaccc gcgaacaagc cgtcctcgaa accggttcgg gtgacccgct tcagaaagac 3305521 cgcgggaatg atctgatcgg tgtcgacatt ggaccgccgc aacggcacgc caataccaga 3305581 gtgggtgtga aaggcttcca tgctgatccc ctagctgttc tcagttcaat tcaaatcggc 3305641 cgggctggac agtgtgccgc gaaccgcggt ggcggccgcc actgctgggg acaccaaatg 3305701 tgtgcggccg cccgcgccct gccgcccttc gaagttgcgg ttggacgtcg cggcgcagcg 3305761 ctccccggac gccagctgat cgggattcat gcccagacac atcgagcatc ccgcctgccg 3305821 ccattgcgcg cccgcgtcgg tgaagatctc accgagccct tcggcctcgg cctgcgcgcg 3305881 tacccgcatt gagcccggaa cgatcagcat ccgcacgccg tcggccacct tgcggccacg 3305941 cagcacttcg gcgaccaccc gcagatcttc aatgcgaccg ttggtacacg acccgacgaa 3306001 cacggcgtcg accgcgattt cgcgcatcgc ggttccgggt cgaaggtcca tgtacgccaa 3306061 tgctttctcg gcggcctgcc gctcggcgtc gtcggtcatc agttgcggat ctggcaccgc 3306121 ggccgccagc ggtacccctt ggcctgggtt ggtgccccag gtgacaaacg ggctcaacga 3306181 cgcggcgtcg agatacacct cggtgtcgaa aacggcgccg acgtcggtgc gaagccgttg 3306241 ccagtagacg agtgcggtgt cccactgggc accggtgggt gcgtgcggac gaccacgcaa 3306301 gaacgcgtag gtggtttcgt ccggagccac catgcccgca cgagcgccgg cttcgatgct 3306361 catgttgcag atcgtcatcc ggccttccat ggacagcgat tcgatggcgc tgccccggta 3306421 ttcgatgaca tgcccctggc cgccgccggt gccgatcttg gcgatcaacg ccaggatgat 3306481 gtccttggcc gacacaccgt cgggcagccg cccatcgacg ttgaccgcca tggtcttgaa 3306541 cggccgcagc ggcagcgtct gggtggccag cacgtgctcg acctccgacg taccgatgcc 3306601 catcgccaac gcgccgaatg cgccgtgggt tgaggtgtgg ctatcgccac agacgatcgt 3306661 cattcccggc tgggtgagac ccaattgcgg tccgacgacg tgcacgatgc cctgctcgat 3306721 atcgcccatt gaatgcagcc ggattccgaa ttcggcgcag tttcggcgca acgtctccac 3306781 ctgggtgcgt gacaccgggt cggcgatcgg ctggtcgatg tcgacggtgg gcacgttgtg 3306841 atcctcggtg gcgagggtga gctcgggccg ccgcacccgg cgcccggcca ggcgcaggcc 3306901 gtcgaacgcc tgcgggctgg tgacctcatg caccagatgc agatcgatgt agatcaagtc 3306961 gggcgcacag cccccgcctg ataccacaat gtggtcgtcc caaatcttct cggccagtgt 3307021 gcgtggctcg ccggtctgca aggccatctc gaagtgcctc tattcattcg ttcgcgactc 3307081 gctggtcatc tcaaaatacg agacgctatg atctctttgt gagacagcat agcggtatcg 3307141 gtgtcctcga caaagccgtt ggcgtgctgc acgcggtcgc ggaatctccc tgcggactgg 3307201 ccgaactctg cgatcgaacc gacctgccca gggccaccgc ataccggctg gcggccgcgc 3307261 tggaggtgca tcgcctgctg gggcgcggcc aggatggcca ctggcggctc ggtccggcca 3307321 tcaccgaact cgcgacccat gtcgacgatc cactgctggt ggcgtgcgcg gcggtactgc 3307381 ctcagctgcg cgacgccacc ggcgaaagcg tgcaggtata tcgccgcgag ggaacgtcgc 3307441 gggtctgcgt ggccgcattg gaaccagctg cgggccttcg cgatacggtc ccggtcgggg 3307501 cacggttgcc gatgaccgcg ggctcgggcg ccaaagtgtt gctggcccac accgacgccg 3307561 ccacccaagc ggccgtattg ccaaaggcgg tgttcagcgc ccgagcgctg gccgaggtgt 3307621 gccggcgcgg ctgggcgcaa agcgtggccg aacgcgagcc tggcgtggcg agcgtgtcgg 3307681 cgccggtgcg cgacggccgg ggcgtcgtga tcgctgccat ctcggtgtcc ggcccgatcg 3307741 accggatggg ccgccgcccg ggggtccgat gggccgccga cctgctgtcc gcggcggacg 3307801 cgctcacccg acggctctag ccgcgttgtg ctacatcggt tcgaccgcga tcacatagtc 3307861 attgccgtgc cacagaccgt cttgccgctc gttgagctgc aatgcccgag cgcgcagttc 3307921 ttccacatag gcacgcattg ccatgccaag cccgttggag gagaatcgct cgattcgggc 3307981 caagcacatg ttgagttggc cgttcacgta gcgagctcgg tagcggatgg ggaagcggcg 3308041 ctcttcaaga atgcgaaagc ccgcaaggcc cagtcgcccc agcatccagt ccagcgggaa 3308101 ctctcggtac ggtcgttcgc cggcaagcaa caggcaggcg tcgcgcacgc gaccgatttc 3308161 ccagatgatt ttgccacttt cggtttccgg ctcgaattgc acgtagggct ccaagccgac 3308221 taggtaaaga cgaccatgat cggcgagatg cgggcgcaac cgctcgaaca cgcggtcctg 3308281 ccagtacggg gcgaagcctt cgatggcccc gaccaggtag tcgaccaaga tggtgtcgaa 3308341 cgtctcgccg gcaagaaggc tgtcgtctac ccagttgccg acgagcaggc ggtcctgcgg 3308401 gcgcatggcg ctacccaacg cggcgcgggt cttgtccgcc aggctgcggg cggccgtgac 3308461 cgccgtccag cgctcggtcg gcaaagtctg tatccactga agcgatttca caccggtacc 3308521 ggcatccaag acagtgcccc agggtctttc gccgtgcacg ccttcgatgt agcggaacaa 3308581 ggatgagatc ccggccctca gtatgtacga gcgaccgtgg cgggcgtgta ggtcttcgat 3308641 gtggcggatc agggctgcga tcttgggcat ttcggcccag gtcacacaca tcgcagacgt 3308701 ccatgcggcc ggttcggccg agcgcggtat cgcggcgccg gcttcagacc ctgccaaccg 3308761 agcgatcgtc gtgggtgctt cctcggagta accactgtga tgtcttcctc acggctgaag 3308821 ctggcggact accgatgaac cgacccaccg aaactctata gcaaacgata ttcattttca 3308881 aactaggcac cgcgagcgtc actggggtgg cgacgacgcg ctaccggcgg agccttgctg 3308941 acacactgac gccatgggaa ccaaacagcg cgccgacatc gtcatgtccg aggctgaaat 3309001 cgccgacttc gtcaactcga gccgtaccgg aacgctggcc accatcggac ccgacggcca 3309061 gccgcacttg acggcgatgt ggtatgccgt gatcgacggc gaaatctggc tggagaccaa 3309121 ggccaagtcg cagaaggccg tcaacctccg acgggatccg cgggtgagct tcctgcttga 3309181 agacggcgac acctacgaca cgctgcgcgg cgtgtcgttc gagggcgtta ccgagatcgt 3309241 cgaggagccc gaggcgctgc accgcgtcgg ggtcagcgtg tgggaacgct acaccggccc 3309301 ctacaccgac gagtgcaaac cgatggtcga ccagatgatg aacaagcggg tcggtgtgcg 3309361 catcgtggcc cgtcggaccc gctcgtggga tcaccgcaag ctggggctgc cacacatgtc 3309421 ggtgggtggc tcgaccgccc cgtagctgcc cggcgagcag acgcaaaatc gcccatttcg 3309481 agacgaaatt gggcgatttt gcgtctgctc ggcagttgta gccccgatgg gattcgaacc 3309541 cacgctaccg ccgtgagagg gcggcgtcct aggccgctag acgacggggc cggaaccgat 3309601 ccgagctgcc agcatagctc acgccttgtg ctggggtacc aggactcgaa cctagaatgg 3309661 ctgaaccaga atcagctgtg ttgccaatta caccataccc catgggctgc ctaaaaccgc 3309721 tgccgccagc tgttatgggc cgacgtgcag actaccaaag attcgccaca caaggctcac 3309781 gcgtgcccga ccagctggcg cgccgcgcgc agccgctgca tgctgcggtc acgaccgagc 3309841 agctccagcg attcaaacaa cggcgggctg acggtcgtgc cggtggcggc cacccggatg 3309901 gggctgaacg ccttgcgggg tttgagcgcc aaaccttcga tcaaggcgtc cttaagggcc 3309961 gcctcgatca ggggtgccgt ccagtccgtc acacttgtca gcgcggccag ggccgcgtcg 3310021 agcaccgcgg ccccgtctgg gcctagctcc ttggccgcgg ccttgggatc gatcacatac 3310081 tgatcgtcgt tgaagaactt caacagctcc cacgcgtcac cgagcaccac gatgcgggtc 3310141 tgcaccaact cggcggcggc ggcgaatgcc gcctcatcca acgcgatgtg atggccgtgg 3310201 gtatccagat ggtcgcgcag cctgaccgtg aagtcgccca cgtcgagcat ccggatgtgc 3310261 tcggcgttca gcgcgtcggc cttcttctgg tcgaaccggg ccgggctgga gttgacgtcg 3310321 gcaacgtcga acgcggccac catctcgtcg agaccgaaca ggtcgtggtc gtcggctatg 3310381 gaccagccga gcaacgcgag gtagttcagc aggccttcgg ggatgaaccc gcggtcgcgg 3310441 tgggcaaaca ggttcgactg cggatcgcgc ttcgagagct tcttggtgcc ctcccccaag 3310501 accgttggga ggtgcgcgaa tttcggaatc cgctcagcta ccccgatcct gatcaacgcc 3310561 tgatgtagcg ccagctggcg cggcgtcgac ggcagcaggt cctcgccacg caacacatgg 3310621 gtgatcttca tcagcgcgtc gtcgcacggg ttgaccaagg tgtataacgg atcaccgctg 3310681 gctcgggtca acgcgaagtc gggtacggag ccagccgcga acgtcacggg cccgcgcacc 3310741 aggtcattcc aagcgaggtc gtcatcgggc atccgcagcc gcaccaccgg ctggcggccc 3310801 tccgccaggt acgccgcacg ctgcgcgtcg gtcaagtgac gatcgaaatt gtcgtaaccc 3310861 agcttgggat tgcgcccggc cgcgacatga cgggcctcca cttcctcggg tgtggagaaa 3310921 gcgtggtagg cctcgcccgc ggcgagcagt cgggcgagca cgtcacggta gatttcggcg 3310981 cgctgcgact gccggtacgg cccgtacggc ccacccacct cgggcccctc atcccaatcc 3311041 aggccaagcc agcgcagcgc gtccagcagc gccagatagc tttcctcgct gtcgcgttgg 3311101 gcgtcggtgt cctcgatgcg gaacacgaag gtgccaccgg tgtgccgggc gtaggcccag 3311161 ttgaacagcg cggtgcggac cagaccgacg tgcggagttc cggtgggtga agggcagaat 3311221 cggacccgga ctgtttccgt ggcggtcacg gctttccttt gcggactacg ggattggtga 3311281 gggtgccgat tccctcgatg gtgatcgaga cggtgtcgcc gtcctcgatg ggaccgactc 3311341 ccgcgggtgt gccggtgagg atgagatcac ctggcagcaa ggtcattatc gccgagatcc 3311401 attccacgat ggcgccgatg tcatggatca tcagcgaggt gcgggcgtgc tgtttgacgt 3311461 cgccgttgac gacggtgcgc agctcgagat cggccgggtc aaagggagcg aggtcggtga 3311521 cgatccacgg cccgaccggg cagaaggtgt cgtgcccctt ggctcgcgtc cactgaccgt 3311581 cggattgctg ctgatcgcgg gccgacacgt cattgccgat ggtgtagccg aggatattgt 3311641 cgacggcctg ggcggccggg acatccttgc acgcccggcc gatcacgatc gccagctcac 3311701 cctcgaagtg caccggtgat gcgttggcgg gcaatcgaat tggcgtattc ggaccgatga 3311761 tcgcggtgtt gggcttgagg aatatcaccg ggtctgccgg cggccggcca cccatttcgg 3311821 cgatgtgatc ggcatagttc ttcccgacac agaccacctt gctcgccagt atcggagcca 3311881 gcaggcgaac gtcggccagc ggccaggagc gtccggtgaa ggtcggcgta ccgaacgggt 3311941 gctcggcgat ctcgcgggcc gtcatctcac tcggctcgcc cagctcgccg tcgatgctgg 3312001 caaaagcgac accgtccggg ctggcgattc gaccgatacg catttggatg agcttagccg 3312061 ggccctgccg ggcgacgatt cgggccggca cggcccgatg aggagcccgg caatcagacc 3312121 ctgccgggcg acgattcggg ccggcacggc ccgatgagga gcccggcaat cagaccctgc 3312181 cgggcgacga ttcgggccgg cacggcccga tgaggagccc ggcaatcaga ccctgccggg 3312241 cgctgcgggc cctcaccatc gggccccgtg ccgggtgact gtgccagcat gggtggatgt 3312301 cgcgagatcc gactggggtg ggtgcgcgct gggcgatcat gatcgtctcg ctgggggtga 3312361 ccgcaagctc gtttctcttc atcaacggtg tcgcgttctt gatcccccgg ctggaaaatg 3312421 cgcgcggaac cccgctatct cacgcgggtc tgttggcgtc gatgcccagc tggggcctgg 3312481 tggtcacgat gttcgcctgg ggctatctgc tcgatcacgt cggcgaacgg atggtgatgg 3312541 ccgtgggctc ggcgctgacc gccgcggccg cctacgccgc ggcatcggtt cattcgctgc 3312601 tgtggatcgg tgtcttcctg tttctcggcg gcatggccgc cggtggttgc aacagcgccg 3312661 gcgggcggct ggtctcgggt tggttcccgc cccagcaacg cggtctggcc atgggaatcc 3312721 gccagaccgc acaacctttg ggcatcgcct ccggcgcgtt ggtgataccc gaactggccg 3312781 aacgcggggt gcacgcaggg ctgatgtttc ccgccgtcgt gtgcacgttg gccgcggtgg 3312841 ccagcgtgct cggtatcgtc gacccaccgc gaaaatcccg cacgaaagcc tccgaacagg 3312901 agctggccag cccttatcgg ggatcgtcga tcctgtggcg gatacacgcg gcgtcggcgt 3312961 tgctgatgat gccgcagacg gtgaccgtga cgttcatgtt ggtctggctg atcaaccacc 3313021 acggctggtc ggtcgcgcag gccggtgtct tggtgaccat atcgcagctg ctgggggcgc 3313081 tgggccgggt cgcggtcggc cgctggtcgg accatgtcgg gtcacgcatg cgtcccgtcc 3313141 gcctgatcgc cgctgccgcc gcggcgacgt tgtttctgct cgcggcggtc gataacgagg 3313201 gctcgagata tgacgtgctg ctcatgatcg ccatctcggt gatcgccgtt ctggacaacg 3313261 ggctagaagc caccgcgatc accgagtacg ccggaccgta ctggagtggc cgggcgctgg 3313321 gtatccagaa cactacgcag cggctgatgg cggccgccgg acccccactg ttcggtagtt 3313381 tgatcaccac ggcggcctac ccgacggcat gggccttatg cggtgtgttc ccgctggccg 3313441 cggtgccgct ggtgccggtt cggctgctcc cacccggctt ggagactaga gcgcggcggc 3313501 aatccgttcg ccgacatcgc tggtggcaag ccgttcgctg ccacgcgtgg ccaaatgggc 3313561 ctcgacggcc cggtccaccc gggcagccgc gtcgtgttcg ccaaggtggg acagcaataa 3313621 cgccaccgac atgatcgccg ccgtcgggtc ggcgatgccc tgaccggcga tgtccggcgc 3313681 gctgccatgc accggctcga acatcgacgg gttggcccgg gtcgcgtcga tattcccact 3313741 ggccgccaag ccgataccgc cacataccgc cgcggccaga tcggtgatga tgtcgccgaa 3313801 caggttgtcg gtgacgatca cgtcgaagcg acccgggtcg gtgatcatgt ggatggtggc 3313861 ggcgtcgacg tgctggtagg ccacctcgac gtccgggtag cattcgccga cctcgtcgac 3313921 ggtccgcaac cacaatcccc cggcgagggt caacacgttg gttttgtgca ccaatgtcag 3313981 atgcttgcga cgccgtcgag cccgctcgaa cgcgtcggca accacacgcc gcacaccgaa 3314041 cgcggtgttc acgctgactt cggtggccac ctcgttgggc gtgccgacgc gaatcgcccc 3314101 gccgttgccg gtgtagggtc cctcggtgcc ctcgcgcacc accacgaagt cgatgccggg 3314161 attgccggac agcgggctgg ccacccccgg atacagccgg gccggacgca ggttgatgtg 3314221 gtgatccagc tcgaagcgca gtcgcagcaa cagaccgcgc tccaagacgc cgcttggcac 3314281 cgacgggtca ccgatcgccc cgagcaggat cgcgtcgtgg ttgcgcagct cggccaccac 3314341 cgagtccggc agcacctcgc cggtggcatg aaagcgccgc gcacccaggt catagctggt 3314401 tttctggacg cccggcacaa ccgcgtcgag cactttgacc gcctcggcgg ttacctcggg 3314461 cccgatcccg tcaccggcaa tgatcgcgag tttcatcggc gtggaagggc tcacgacaga 3314521 tcgacaacct cgagcttgta ggcgtccacc gccgccgcga tcgccgtccg cacgtcgtcg 3314581 ggcacgtctt ggtccagccg cagcagaatc gtcgcgcccg ggccttcggc gtcttcggag 3314641 agctgcgcgg cctggatatt caccccggcc gtccccagca acgtgccgat cttgcccagc 3314701 gctcccggcc ggtcgacgta gtggatgatc aggttgatcc cctgggcgcg cagatcaaag 3314761 tggcggccgt tgatctgcac gatcttctgc gacagctgtg ggccatacag cgtgcccgag 3314821 acggtcacca ccgaaccgtc cgcgccgacc gcgcgaacgt cgacgacgct gcggtggttg 3314881 gggctttccg aggccttaca gatctcggcg gtgacgccac gttcggcggc caatgccggt 3314941 gcgttgacaa atgtcaccgc atcctcgatc accgccgaga acaggccgcg cagcgccgaa 3315001 aggcgcagca cctcaacctc ttcggcggcc agctcaccgc gcacctgcac cgacaacgac 3315061 accggcagtt cgtcggacaa cacacccgcc agcacgccga gcttacgcac cagatccagc 3315121 cagggcgcca cctcctcgtt gaccactccg ccgccgacgt tgaccgcgtc gggcacgaat 3315181 tcccctgcca gggccagccg cacgctctcg gcgacgtcgg tgcccgcccg gtcctgcgcc 3315241 tccgcggtgg acgcacccag atgcggtgtg accaccacct gtgccagctc gaacagcggg 3315301 ctgtcggtgc acggttcggt ggcgaacacg tccagaccgg ccgcccgcac gtggccgccg 3315361 gtgatcgcgt cggccagtgc cgcctcgtcc accaggccgc cgcgcgcggc gttgacgatg 3315421 atgacgcccg gcttggtctt cgccagcgcc tccttgtcga tcagtcccgc cgtctccggt 3315481 gttttcggta ggtgcaccga gatgaaatcg gcgcgggcca gcaggtcgtc cagggacagc 3315541 agttcgatgc ccagctgcgc cgcacgggcc ggcgaaacgt acgggtcata ggcgacgacg 3315601 taagcgccga acgcagcgat ccgctgggcg accaactgcc cgatgcggcc cagacccacc 3315661 acgccgacgg ttttgccgaa gatctcggta ccggaaaacg acgaacgctt ccaggtgtgc 3315721 tcgcgcagcg acgcgtcggc cgccggaatc tggcgtgagg cggccagcag cagcgccagc 3315781 gcatgctccg cggcgctgtg gatgttcgac gtcggggcgt tgaccaccag cacgccgcgg 3315841 gccgtcgcgg cgtccacgtc gacgttgtcc agcccgacgc cggcgcgcgc gacgatcttg 3315901 agcttggggg cggcggccag cacctcggcg tcaaccgtgg tggccgatcg caccagcagc 3315961 gcgtccgctt cgggcaccgc ggccagcagc ttgtctcggt ccggaccgtc aacccagcgc 3316021 acctcgacct gatctcccaa ggcggcaacc gttgatgggg caagtttgtc ggcgatcaac 3316081 acaacaggca ggctcacgcc gatagcgtat cggctgtaat tgacgagtgg acgtcaccgt 3316141 cgtcggcagc ggacccaacg ggctcgccac ggccgtcatc tgcgcccgcg cgggcctgaa 3316201 cgtgcaggtc gtcgaggccc aggcgacctt cggcggcggc gcccgcagcg cggccgactt 3316261 cgaatttccc gaagttttac acgacgtgtg ctccgcggtg catccgcttg ctttggcgtc 3316321 gccgtttttc gccgaattcg acctacccgc gcgcggagtg acgctgaccg tgcccgacat 3316381 cgcctacgcc aacccgctac ccgggcggcc cgcggcgatc gcctatcacg atctggcgca 3316441 caccagcgcc aagctggacg acggcgcgtc ctggcggcgc ctgctgggcc cgttggtggc 3316501 gcactcggag acggtcgtgg agttcatgct ctccgacaag cggtctttgc ctactgcact 3316561 gggctcggtc ctgcgtctcg ggctgcggat gctggcccag ggcacccctg cctggcggtc 3316621 gctggcgggc gaggatgccc gcgcgttgtt caccggcgtt gccgcccacg cgatttcacc 3316681 gttgccgtca ctggtgtcgg ccggcgccgg actgatgctg gcaacgctgg cccattcggt 3316741 cggctggccg attccggtgg gcggcaccca ggcgatagcc gacgcgctga tcgccgatct 3316801 acgcgcgcat ggtggtcggc tcgcggccgg tgtcgagatc accgaaccgc aaagaagtgt 3316861 ggtcgtcttc gacaccgcac ccaccgccct gctgcgggtt taccgcgaca agcttccaca 3316921 tcggtatgcc aaagcattgc gccgctatcg atttcgcgct ggcatcgcca aggtggactt 3316981 cgtgctcagc gacgagatcc cgtggtcgga tccgcggctg cggcgggctg cgaccctgca 3317041 tctcggcggc acccgtgacc agatggcgcg cgccgaggca gacgtcgcgg cgggacgcca 3317101 cgccgactgg ccgatggtgc tggccgcgtg tccgcacgtc gccgaccccg gccgcatcga 3317161 cgaaaccggc cgccgtccgt tctggaccta tgcccacgtg ccgtcggggt ccacgctcga 3317221 cgcgaccgag accgtaacca gcgtcctcga gcggttcgcc cccggcttcc gtgacatcgt 3317281 ggtggcggcc cgcgccgtgc ccgccgcgcg gatggccgac cacaacgcca actacgtcgg 3317341 cggtgacatc acggtcggcg ccaactcgac ctggcgcgcg atcgccggcc ccaccccgcg 3317401 gttgaatccc tggcgcacac cgattcccaa ggtgtacctg tgttctgcgg cgactccgcc 3317461 cggcgccggc gtgcacggca tgtgcggctg gtatgccgct cgaacgctgt tgcgcaccga 3317521 gttcggcatc acccgcatgc cccccttggg ccatgagctg aggccataac gaagcttgcg 3317581 atcatcgact attcggaggc gcgccaggcg gcagcggcga caaccggaac gtcggcacgg 3317641 tgctcaatca cgggtgcacg gtgtgcatca gaatggcggg ggttcgttgt cgcggtgagg 3317701 cgttcggcga ggaggtagtg tctacccctt gcccgcgggt tcgtgcggac tgaagggatt 3317761 tcattgggac ccacggctgc gtatcgcagg gcctcggtga cgtctgcttc ctcaagctca 3317821 ggaagttcgg cgagaatctc ggtggatgtc atttggtccg cgaccatcgc gaccacagtc 3317881 gccactggga tgcgcaagcc ccggatgcat ggcatgcctc ccatcacgtc ggggtcgatg 3317941 gtgacgcggg tgactcgcat gtctataagg ctagccggtg acagcacgct ggggcggttc 3318001 tccaccagcc gtcttggctt aagctcagcc aagagcaagc cggagggaga tttcggcacc 3318061 gcctgcggcg cggtttcggg cggtgacgct ggcgtggttg cgttggctga gggtgtcgac 3318121 gatggccagt ccaagcccgg tgccgccggt ggtgcgcgcg gtgtcggtgg tttccgcgag 3318181 agccgagcgg attgcggcga gcagttcggg gttgcttcgt ggacgccgca gggcgagttc 3318241 gagttcggtg gtcaggaggc taagggggtg cgaagttcgt ggcccgcatc gctgacgaat 3318301 tgacgttctc gctcgagcgc gtcttgcagc cgctgcagaa ggtcgttgaa tgttgttcca 3318361 agataccgga tttcgtctcg agccagtggc aatggcagac gggcatgtgg gtcggtggcg 3318421 ctgatgcccg cggcgcggat gcgcatgcgt tccacgggtc gaagagccgc ggcggccagc 3318481 agatacgcgc ccagggcgtc gatgaggacg tctggccgtg cccggatcgg tgcgatcgag 3318541 cttcatcgtg gtttcataat cacccgatag accatggcta accgaactgc catccacgcc 3318601 gtggatgcaa cggatggagg gaacgcgcat ggcgggcgcc aaacatgctg ggagaatcgt 3318661 cgcgatcacc accgcggcgg cggtgatact ggcggcgtgc agttcgggct ccaagggtgg 3318721 agcgggcagc ggccacgccg gcaaagctcg ttcggcggtg accaccaccg atgccgactg 3318781 gaagccggtg gccgacgcgc tgggacgtag cggcaagctc ggagacaaca acaccgcgta 3318841 tcggatcaac ctgccgcgca atgaccttca catcacgtcc tacggtgtgg acatcaaacc 3318901 ggggctgtcg ttgggcgggt acgcggcatt cgcccgatac gacaacaacg aaacgctgct 3318961 gatgggcgac ctcgtgatca ccgaggagga gttgcccaag gtcaccgatg cgttgcaggc 3319021 gcatggtatc gcccagaccg cactgcacaa gcatctgctg cagcaagacc cgccggtgtg 3319081 gtggacccac attcacggca tgggtgatgc cgcccgactg gcccaaggac tcaaggcggc 3319141 gttggatgcc acaacgatcg gcccgcctac cccaccgccg gcacggcaac caccggtcga 3319201 catcgacgtc gccggcgtcg accaggcgtt gggccgcaag ggaacccaag atggtgggct 3319261 gttgaagtac agcatccccc gcaaagacac catcatcgag gacgggcacg tgctgcccgc 3319321 agtgtcgctg aacctgacga cggtgatcaa ttttcagccg gtgggccgcg gtcgcgcagc 3319381 gatcaacggc gatttcatcc tgatcgcccc cgaggttcag gaggtcatcc gggcaatgcg 3319441 tgccggcaac atcacgatcg tggaactgca caaccatggg ctgaccgaag agccccgcct 3319501 gttctacatg cattactggg ccgtcgacga cgcggtcacc ctggcgcggg cgctgcgccc 3319561 ggcgatggat gccaccaacc tgcagtcgtc ataatcccga tgcaaccgca taagggctgg 3319621 tgtggctgat gcatcctgat ggcggtgcat ggtttcctgc tcgaacgggt cagcgtggtg 3319681 cgcgacgagg cgacggtgct gcggcaggtc agcgcgcatt ttcccgctgg ccgctgcagt 3319741 gcggtgcggg gcgccagtgg atcgggaaag accacgctgc tgcggttgct gaaccggctc 3319801 atcgatccga cgtccggaaa agtctggctt gacggtgtgc cgctcaccga tctggatgtg 3319861 ctcgtgttac gtcggcgggt cggcctggtt gcgcaggctc ccgtggtgct taccgatgcg 3319921 gtgctcaatg aggttcgcgt cggacgcccg gacctgccag aaggtcgagt gaccgagctg 3319981 ctggcgcggc tgtgtctcgg ccagtccgca cgcgaagcgt tcttgccgca ccaacgatcc 3320041 gccttgcgca ctgcgctgat acccgcgatc gactccacga aagtcgttgg gctgattagc 3320101 cttccgggtg cgatgtccgg acttatcctg gccggggtcg acccgctgac cgcgatccgc 3320161 taccaaatcg tggtgatgta cctgctgctc gccgccaccg cggtggcagc gctgacctgt 3320221 gcacgcctgg ctgaacgtgc cttattcgac cgcgcgcacc ggctcgtttc gctgcccgcg 3320281 gcgactcgtc gggcatgagt tcgcgactcg atcacagcca atcgccgctg tggcatgtgg 3320341 ccgttgtcag tcgttatcca cggtctccgt gcccaggaag cgaaagccct gatccaggtg 3320401 cacttccacc tgagtaccat cgggtttggt gacgagcacc ctcatcgctt catcccttct 3320461 tgtcgtcgtc gtggttacga aggcgacgct aacggcgcca gatgaagccc cgatgaaggc 3320521 agcgacgccg gtgacacaac ggggcggacc tgccccgtgg cacacggcgg ttgccggtca 3320581 cgatcactgc agtgtcgaga cggcctagga gctaggccgt ctcggtgatc gggcggtcca 3320641 cccagctcat caggtcgcgg agtttcttgc cgacgacctc gatggggtgc tcggcgtttt 3320701 gccggcgcaa ctcttcgagc tgtttgttgc cgccctcgac gtcggcgacc agcttgtgga 3320761 caaagctacc gtcctggatc tcccgcagga tgtcgcgcat ccgctccttg gtgccggcat 3320821 cgatgacgcg cgggcctgag aggtagccgc cgaattccgc ggtgtccgac accgagtagt 3320881 acatccgcgc caggccaccc tcgtacatca agtcgacgat cagcttcagc tcgtgcagca 3320941 cctcgaagta ggccaattcc gcggggtagc cggcttcgac catgacctcg aacccggcct 3321001 tgaccaattc ctcggtgccg ccgcacaaca ccgtttgctc accgaacagg tcggtttcgg 3321061 tctcgtcttt gaacgtcgtc ttgatgacgc cggcccgggt gccgccgatc gctttggcat 3321121 acgacagcgc cagcgccaag ccgtcgcctc gcggatcctg ctctaccgca accaaacacg 3321181 gcacaccctt gccgtcgacg aactggcggc gcaccaaatg acccggtccc ttcggggcga 3321241 ccatcgcgac ggcgacgtcg gcgggcggct tgatcaagcc gaagtgaacg ttgagtccgt 3321301 gaccgaagaa cagcgcgtca ccgggcttga ggttgggttc gatgtctcct gcgaagatct 3321361 cggcctgggc ggtgtcgggg gccaacacca tgaccacatc ggcccatttg gcgacctcgg 3321421 cgggagtgtc gacgtccagg ccctgctctt ctaccttggg ccgcgaccgc gaaccctgct 3321481 tcagcccgac gcgcacctgc acacccgagt cgcgcaggct tagcgagtgc gcgtgcccct 3321541 ggctgccgta gccgatcaca ccaaccttgc ggccctgaat gatcgacagg tctgcgtcgt 3321601 cgtcgtagaa catctctagt gccaccgctg aatctctcct tacctgctag ctacttggcg 3321661 gtgccgatgc cgcgcggacc gcgggacagc gacaccattc cggattgggc gatttcgcga 3321721 ataccgaacg gctccaacac ccgcagcagg gcctctaact tgccgcggtt accggtggcc 3321781 tcgacggtca atgactccgg ggatacgtca atcacgttgg cgcgaaacag attcaccgct 3321841 tcgatcactt ggctgcggct gccggcgtcg gcttggacct tgatgagcgc caattcccgt 3321901 gacaccgagt gctcgtcgtc ctgctcgacg atcttgatga cgttgatcag cttgttgagc 3321961 tgcttggtga tctgctcgag cggagtgtcc tcggcggaga ccacgatggt catccgtgac 3322021 ctgtccttgc actcggtggc acccaccgcc aacgactcga tgttgaaacc gcgccgggag 3322081 aacagcgccg ccacccgcgc cagcacgccg ggcttgtctt cgaccaacac cgacaacgtg 3322141 tgcgtcttcg ggctcatcag gcgtggcctt cggtgatgtc gtcgaacagg gggcgaatgc 3322201 cgcgggcggc ctggatctcg tcattgctgg tgcccgcggc caccatcggc cacacttgcg 3322261 cgtcggcacc gacgatgaag tcgatcacca ccgggcagtc gttgatcgcc cgcgcctggt 3322321 tgatgacgtc gacgacgtcc tcttcccgct cgcaccgcaa ccccacacac cccaaggcct 3322381 cggccagttt cacgaagtcg gggatgcggt gcgaatgagt ggccaggtcg gtctgcgagt 3322441 accgctcggc atagaacagg ctctgccact gccgcaccat gcccaggttg ccgttgttga 3322501 tcagcgccac cttgaccggt atgccctcga ccgcgcaggt ggccagctcc tggttggtca 3322561 tctggaagca accgtcgccg tcgatcgccc agacctcggt gccggggagg gcgatcttgg 3322621 cgcccatggc cgccgggatg gcaaacccca tggtgcccag accgccggag ttcagccagc 3322681 tgcgcggctt ttcgtatctg atgaactgcg cggcccacat ctggtgctgg ccgacgccgg 3322741 cgacgaagac ggcgtccggc ccggcgatct cgccgagctt ttcgatcacg tattccgggc 3322801 tcaggctgcc gtcgctctgc ggcccatagc tcagcggata ggtcttgcgc acaccgttca 3322861 ggtatgccca ccagtcggcc atctcgatgg tgccgggaat gtggtggtgg cgcagcatcg 3322921 cgatcagttc ggtgatgacg gccttgacgt caccgacgat gggcacgtcg gcgtggcggt 3322981 tcttgccgat ctcggccggg tcgatgtcgg cgtggatgac cttggcttcc ggcgcgaacg 3323041 agtcgagctt gccggtcacc cggtcgtcga agcgggtacc cagcgcgatc agcaggtcgc 3323101 tgcgctgcag cgccgccacg gcggccaccg tgccgtgcat gccgggcatg ccgaggtttt 3323161 gccggtggct gtcgggaaac gcgccgcggg ccatcagcgt ggtgaccacc gggatgccgg 3323221 tcagctcggc cagctcccgg agctgctcgg tggcctcacc gcggatgacg ccgccgccga 3323281 catacagcac cggcttgcgc gcggccgcga tcagcttggc ggcctcgcgg acctgccggc 3323341 tgtgcggttt ggtgttgggc ttgtagccgg gcagctccat ccgcggcggc cagctgaacg 3323401 tgcactggcc ctgcagcacg tccttgggga tgtcgaccag caccgcgccc ggacggccgg 3323461 aggccgcgat gtggaaggcc tcggccagca cccgcggaat gtcgtcaccg gagcggacca 3323521 gaaagttgtg cttggtgatc ggcatcgtga tgcccgagat gtcggcctcc tggaaggcgt 3323581 cggtgccgat cagcccccgc ccgacctgac cggtgatagc gaccaccggg atcgagtcca 3323641 tctgcgcgtc ggccagcggg gtcaccaggt tggtcgctcc gggacccgac gttgccatgc 3323701 acacgcccac ccggccggtg acgtgcgcgt agccgctggc ggcatgcccg gcgccctgtt 3323761 cgtggcggac cagcacgtgg cgcagctttt tcgagtcgaa cagcgggtca tacaccggca 3323821 gcaccgcacc gcccggaatc ccgaaaatga cgtcgacgcc gagttcctcc agcgaccgga 3323881 tgaccgcctg tgcaccggta agctgctgca gtgcaacatg tttcggacga gccgccgggt 3323941 gctttggctc attcgccgcg ctgtgtggct ctggcttgaa tgtcggtgag tgtggcttgg 3324001 ttggtgcgct cactgttgtg tgatcctcta ttgctctgga agtctcgttg gtggacaaga 3324061 aaaaaccctc gccagctcag ctgctgcacg agggtcgcgt tggtgctcgc ttgggctagt 3324121 caggcaccaa cgcgccgacc aattactacg agcatcccgg gctttccggc cttgtccata 3324181 gtgtccgacg gtagccttca cacagctcag cagtcaaatc cgcggtgtca gtcttgatcc 3324241 gcgagcgtga cggcactgcg aaatcccatg cgaattttcg cggtggcgtt acgctcgcga 3324301 actcgacgcc caccaagcgg tgagatgatg ctggggtggc caccacatcg ccggtcgtga 3324361 tcaaggtgtc gccgatggcg cacttcgccg tgggattcct gaccctgggt ctgctggtgc 3324421 cggtactgac ctggccggtg agcgccccgc tgttagtcat tccggtggcg ttgtcggcat 3324481 cgatcattcg gctccgcacg ctcgccgacg agcggggcgt gaccgtgcgg acgctggtcg 3324541 gcagccgcgc ggtgcgctgg gacgacatcg acgggctgcg gttccaccgc gggtcctggg 3324601 cgcgcgcaac gctcaaggac ggtaccgagc tgcgattgcc cgcggtgacc tttgcgacgc 3324661 tgccgcacct gaccgaagcc agctcgggac gggtccccaa cccgtaccga tgacagcgtt 3324721 caggccagcg gatttgcccc gttgagcagc acccatacgg caatacccgc cgcgatgcca 3324781 cccaacagcg ctacaaacga tccgataaat ggccgatgag cccagccgcg ggccgcgtcc 3324841 agaccgtagc ggccgggacc actcaagata acggcgaccg ccatcacgac cagggtgatc 3324901 tggtattcat gcccgtcctg caggaagtac gcgacgggcc gcgaatgctg tgccgagatg 3324961 ccggcgagca ggccgttgat caagaaggcc agcgcgcccg cggccgccag cggagtaaac 3325021 aaacccaaca ccagcagcac tccggcgacg atctcgccgc cagcgctcac ataagcgagg 3325081 atctcggcgt gctggtaacc aatgtcggac agcgagttct ggaatccggc cagaccctgg 3325141 ccgtcccacc agccgaacaa tttctgcagc ccatgggcga taaggaccgc gcccagaccg 3325201 acccgcaata tcagcagccc gagattctgg gtgccgcgcc gacctgcggc gcgtacccgc 3325261 tcgtcgtcgt ccatgtcgat tccggcagat cccgccggta cctgccggcc cggctgcggc 3325321 tggacgtagg gcaacggctc cgcagcttcg atcaggctgt acccggagtt gccgacacca 3325381 gagctagcgg catcataggg cgggataacg gtggtggttc cactgccaaa gtccccggca 3325441 tatctggccg gcgtcaggtc atcctcgggg tcgaccaggc ttgccgagac aggccgtcca 3325501 ggcattggcc caggcgaatc atccggccgc tgccaatgtg agtcattcga actggtcact 3325561 cgtgtcaggg taaggccatt tagtgccgaa ttggggattt gagcggcgct ttcgccagac 3325621 aatccgcaca ttgaccctga ccagcccacc aaaaggcccc aattgggccg ccatgccgac 3325681 agtgcgcacc ccggcaggtg gcggcgatgc ccacaatgtc cgtagcctgt cggtcatgtg 3325741 gacaacgcgg ttggttcgat ccggactcgc cgcgctgtgc gcggcagtgc tggtatcgag 3325801 cggctgcgca cggttcaacg acgctcaatc tcagccgttc accaccgaac cggagctgcg 3325861 gccccaaccc agctcgacac ctcccccccc gccgccgctg ccgccggttc cctttcccaa 3325921 ggaatgtccg gcgccgggcg tgatgcaagg ctgccttgag agcaccagcg gcttgatcat 3325981 gggcatcgac agcaagaccg cactggtcgc cgagcgcatc accggtgccg tcgaggagat 3326041 ctctatcagc gccgagccga aggtaaagac ggtcatcccc gtggatcctg ccggtgacgg 3326101 tggcttgatg gacattgtgc tgtcgcccac ctactcgcaa gaccggctga tgtacgccta 3326161 catcagcacg cccaccgaca accgggtggt gcgagtggcc gacggcgaca tccccaagga 3326221 catcctgacc ggcatcccca aaggtgctgc cggtaacacc ggggcgctga tcttcaccag 3326281 tcccaccacg ctggtcgtga tgaccgggga tgctggcgac ccggcgttgg ccgccgatcc 3326341 ccaatcgttg gccggtaagg tcctgcgtat cgaacagccc accaccatcg accagacgcc 3326401 gccgacgacg gcgctgtctg gcatcggctc cggcggcggc ttgtgcatcg atccggtcga 3326461 cggctcgcta tatgtcgccg accgcacgcc aacggcggac cgattgcagc gcatcaccaa 3326521 gaactcggag gtctctacgg tatggacctg gccggacaag cccggcgtgg ccgggtgtgc 3326581 cgcgatggac ggcaccgtgc tggtcaacct gattaatacc aaactgacgg tggcggtccg 3326641 gctcgcgccg tcgaccggtg cggtcaccgg agaacccgac gttgtccgca aagacactca 3326701 tgcgcatgcg tgggcattac ggatgtcgcc ggacggcaac gtctggggag ccaccgtcaa 3326761 caagaccgcc ggcgacgccg agaagctcga cgatgtggtg ttcccgctgt tcccgcaggg 3326821 tggcggcttc ccgcgcaaca acgacgacaa gacctgaccc ggttagggca cgtcgagcgt 3326881 gaaccttacg acgccgtatc ggcgtgtctc gtcgccccgt tcacgctcgt agaaccgggg 3326941 tgaggcttcc ttgccagggt cgatgtcgtc gacatcaaag tcgaggtcgg agaggtagag 3327001 cagatcttcc gagcactccg gagcccacac gctcacgggc tccaacaggt aagccacata 3327061 atccccgaca tcgctgcgac tggccgtcct accgatgaac caggcggccg cgtcgtcgag 3327121 aatcggcatt ccacaggggc cagcgcgcca cgagcaacgg gcgaacttgt tgacctcctc 3327181 ctccgtttgg ctgccgaaca gttcggcgag cacatgctgc cgctgcgaaa gcacgtgcac 3327241 ggcgaggtgc tcggatcggc tcgccacctc ggaggtgccg gtgctcctcg gcaggccgac 3327301 cataaaactc gggggctgca cgctcgtttg ggtagcgaag ctgaccagac aacccgcggg 3327361 gtgaccatcg gcctgggttg tcaccacaaa caccgggtgg tccagcatcc ccatcaactc 3327421 gtcgaacgac tcatcgatca catcaccatc atgaatccgc gcaacgtctt ctgacactct 3327481 ttccgagcgt tcagtcggcg aatcgccgct accgccatca cgtcgaccgg tgaggccgcc 3327541 gtcacggccc caaatcggcg acgatctggg cacggaatca gaacctgatt gggtcccggc 3327601 cagcctcgct ggcgtgggaa gtcaccacgg tcgcgccgcg gcttctcaac ccggccgaca 3327661 cgcgctcccg atgctcactg tcgtcgctgt cataggcata ctcgaatgtg gattagtgct 3327721 acacatgcct gacaacgatt tgtggtactg cgggccatgg acactatggg tgatggccgg 3327781 taggggtgtt gcgtcgggcg cgggagtgtg gcgaggtgat cgcgttgcga cgccccttgc 3327841 ggtggcgatt accgcagccg gattggtatc aggggcccgg ataggacccg gtgcggctgc 3327901 gaaacgcgac ccgcagctcg cacagtggaa cgagattcgc agtcactacc aagagatcgc 3327961 cgagtggatc gaccacgaca cagcaaccgc acaccccgct gttgccgcaa cgcagatcag 3328021 tgccgctggc tctttcggcc gcgccaatat ggtcgactac ctggggctcc tggattccag 3328081 ggccgacgaa acggtccgac gcgacgaatt ttcgcggtgg ctgtcggcca aacccgacta 3328141 cttggtcacc accgagcaat ctgtcgacgc cgccacgata gcccttcctg aattccgcca 3328201 tgcgtacgac cgcgcggcca ccatcgggac actcaacgtg tatcgtcgca actcccctga 3328261 cggtgatgaa ccgctacccg cggacggcaa ctaaccctgc ccgcaggcct ctagaacgag 3328321 ttcgcgcact cgggccgcgt cggcctgtcc gcgggtcgcc ttcatcaccg caccgacaat 3328381 cgcgccggcc gcggccacct tgccgccgcg aatcttgtcc gccacatcag gatttgcggc 3328441 cagggcctcg tcgaccgcgg cctgggtcaa cgagtcgtcg cggaccaacg ccaaccctct 3328501 cgcagtcatc acctgttcgg gctcaccttc accggccagc acaccctcca cgacttggcg 3328561 ggccaagctg ttggacagct tgccctcatc gaccaatgcc accacggctg cgacctgggc 3328621 aggagtgatg gccagttcgt ccagcccgat gccggcctcg ttggcctttt gcgccaggaa 3328681 gtttccccac caggcgcgcg ccgcctcgct ggacgcgccg tgctcgacgg tggcagcaac 3328741 caattcgacg gcgccggcgt tgaccagatc gcgcatcacc tcgtcggaaa caccccactc 3328801 ctgctgaatc ctcctgcggc tcaaccacgg caattcgggg atcgtctggc gtagtcgctc 3328861 gaccagctcg cgactgggcg cgacaggctc caaatccggc tccgggaagt accgatagtc 3328921 ctcggcggtc tccttggtgc ggcccgcgct ggtgtaaccg gcctcgtgaa agtgtctggt 3328981 ttcctgggtg atccgaccac cagacgccaa aatagcgccc tggcgctgca tttcgtagcg 3329041 gacggcgact tcgacgctct tcagcgagtt gacgttcttg gtctcggtcc gggtgccgaa 3329101 ttcggtcgtc ccggccggct tcagcgacac gttggcgtca cagcgcatcg aaccctggtc 3329161 catccggaca tcagatacat ctaatgcgcg cagcagatcc cgcaacgccg tcacatagga 3329221 ccgggcgatc tgcggcgccc gggcaccggc gcccacgatg ggtttggtga cgatctcgat 3329281 gagcggcacg ccggcacggt tgtagtcgat cagcgaaccg gtggcaccgt ggatccggcc 3329341 cgtctcgctg ccgatgtggg tgagcttgcc ggtgtcttct tccatgtgag ctcgctcaat 3329401 ctccacccgc caagtggtgc cgtcttccaa aggcgcgtcc aggtagccgt tgatggcgat 3329461 cggctcgtcg tactgtgaga tctggtagtt cttgggcatg tcggggtaga agtagttctt 3329521 ccgggcgaag cgacaccagg gtacgatctc gcagttcagc gccagcccga tgcggatcgc 3329581 cgactccacg gcggcccggt tgagcaccgg cagcgaaccg ggcaagccca gacacaccgg 3329641 acacacctgg gtgtttggct cgccgccgaa tgtggtggtg cagccacaga acatcttggt 3329701 cgcagtggac agctcgacgt gcacctcgag gccgagtacc ggctggaagc gcgcgacgac 3329761 ctcgtcgtaa tcgagcagtt cagcccctgc ggccttggct gccccggcag caacagtcat 3329821 agccgcgatc ctagtttgag cacccgacgt caaccgaaga aggcggcggc gtcgtcgtaa 3329881 cggctctgcg gcaccagttt gagtttgcga accgcatccg ccagcggaac ccgaccgatg 3329941 tcctggccgc gcaacgtcac catctggccg tactcgcccg catgcgcggc gtcggcggcg 3330001 ttcaccccga atcgggtggc cagcactcgg tcgtaggcgg tcggagtacc accccgctgg 3330061 atgtggccca acaccgtcac ccggacatcc ttgttgatgc gcttctcgac ctcgaccgcc 3330121 agctgcgccg ctacacctgt gaaacgctcg tgcccgaact cgtcgagacc accctcgcgc 3330181 agcatgatcg tccccggagc cggtttggcg ccttcggcga ccacgcagat gaaatgcgag 3330241 tccccgcgct ggaaacggcc tttgaccagt cggcacacct cttcgatgtc gaacggctgc 3330301 tcaggaatca gggtcatgtg agcaccggag gccagcccgg cgttcagcgc gatccagccg 3330361 gcatgcctac ccatcacctc caccagcatc acccgctcgt gggattcggc ggtgctgtgc 3330421 agccggtcga tggcctcggt ggccacggtc aacgcggtgt cgtggccgaa ggtcacatcg 3330481 gtgcagtcga tgtcgttgtc gatcgtcttt ggcaccccga ccaccggcac attctcttcg 3330541 gagagccaac tcgcggcggt cagcgtaccc tcaccgccga tcgggatcag gacgtcgatc 3330601 ccgttgtcgt ccaaggtctg catgatttgg ggcagccccg cccgcagttt gtcggggtgc 3330661 acccgggccg tgcccagcat cgtgccgccc ttggccagca gccggtcatt gcggtcgtcg 3330721 ttgtgcagtt gaacacggcg gttctccagc agcccgcgaa agccgttctg aaatccgacc 3330781 accgacgagc cgtatcgggc gtggcaggta cgcaccaccg cacggatgac ggcgttaagg 3330841 ccgggacagt cgccgcctcc ggtaagaact ccaatccgca taccctcatc ttgcctcgcg 3330901 gccgccgacc tggcgcgagc agacacagaa tcgcacgggc gaggggcgcc ggatgcgagt 3330961 ctgtgtctgc tcgccgctaa atggcgctca gtagcgggcc gcgggcggcc tcataagccg 3331021 cccccacccg gtagagccgg tcgtcggcca atgccggcgc catgatctgt aggccaaccg 3331081 gcaacccgtc gtccggggag agccccgacg gcacagacat gccgcagtgg ccggccaagt 3331141 tcagcggcag cgtgcacagg tcgaacaagt acatcgccag cggatcgtcc accttctcac 3331201 ccagccggaa cgcggtggtc ggggtcgtgg gcgacaccag cacgtcgacg gaccgatacg 3331261 ccgcgtcgag gtcgcgggcg atcagcgtgc gcaccttctg cgcctggttg taataggcgt 3331321 cgtagtagcc ggccgacaac gcgtaggtgc cgatcatgat gcgccgcttg acctcgggcc 3331381 cgaaaccggc ggcccgggtc atcgccatca cctcctcggc gctgcgggtg ccgtcgtcgc 3331441 cgacccgcag cccgtagcgc atcgcgtcga agcgcgccag attgctcgac acctccgagg 3331501 gcagaatcag gtaataggcg gccagggcat ggtcgaagtg cgggcagtcg acctcgctga 3331561 cctcagcgcc cagcgcggtt agctgctcca cggcagcctc gaaggaggcc agcacgcccg 3331621 gctggtagcc ctcgccgccg tgcagctgtc gaaccacgcc gacccgcacg ccacgcagat 3331681 ccccgaccgc gccggcccta gcggcgccca ccacgtcggg cacctcggcg tcgaccgacg 3331741 tggagtcgcg cgggtcgtgg ccggcgatca cctgatgcaa cagcgcggtg tccaagacgg 3331801 tgcgcgcaca cgggccgccc tgatccagcg aggacgcgca ggccaccagc ccatagcgcg 3331861 acaccgtgcc gtaggtgggt ttgacgccga cggtcgcggt cagcgcggcc ggctggcgga 3331921 tcgacccccc ggtgtcggat ccgatggcca gcggcgcctg gaacgcggcc agcgccgccg 3331981 cgctgccgcc accggaaccg ccgggtaccc ggtcgagatt ccacgggttg cgggtgggac 3332041 cgtaagcgga gttctccgtc gacgagccca tcgcgaactc gtccatgttg gtcttgccca 3332101 ggatcgggat ccccgcggcg cgcaaccgcg cggtcagcgt ggcgtcgtag ggagatcgcc 3332161 atccctccag gatttttgac ccgcaggtgg tgggcatgtc gctggtggtg aagacgtcct 3332221 tgagcgccag cggcaccccg gccagcgccg acggcaaggg ttctccagcg gccacctgct 3332281 tgtcgacggc ggccgccgcc gccagcgcct catcggccgc cacatgcagg aaggcgtggt 3332341 acgtctcgtc ggtcgcctcg atctgatcca ggcaggcccg ggtgatctcg gtcgacgaca 3332401 cctccttgat ggcgatcttg gcggccaacg tcgcggcgtc ggatcggatg atgtccgtca 3332461 ctgttcatcc cccaggatct gcgggacggc gaagcggccg tcgacggcat cgggcgcctg 3332521 gtcgagcacc tgacgctggg tcaggcacgg cacggtctcg tccgggcggg tgacgttgac 3332581 gtccttgagc ggattgtcgg tggcctgcac accggtgacg tcgacggcct ggatctggct 3332641 gacgtgggtc aggatggcgt cgagttggcc ggcgaaactg tccagctcgg tttcggtcaa 3332701 tgccagccgg gcaagcctgg cgaggtgggc aacctcgtcg cgggagatct gggacacgac 3332761 cgcaaagcct aatgggtggc cggacggccg acgccggctg ccgaaacgcc gtggatacat 3332821 cgttgtgcca cagtgttggc cgtgcgttcg tatctattgc gtatcgagct ggccgaccgg 3332881 ccgggcagcc ttgggtcgct ggcggtcgcg ctcggctcgg tgggcgccga catcctctcg 3332941 ctcgacgtgg tcgagcgcgg caacggctat gcgatcgacg acctggtggt cgaactgccc 3333001 ccgggagcga tgcccgacac gctgatcact gctgccgagg cgctgaacgg cgtccgggta 3333061 gacagcgtcc gcccgcacac cggcctgttg gaagcccacc gcgagctgga actgctcgat 3333121 catgtggccg cggctgaggg cgcgaccgca cggctccagg ttctggtcaa cgaggccccc 3333181 cgggtgctcc gggtgagctg gtgcacggtg ttgcgcagtt ccggcgggga gctgcaccgt 3333241 ctggccggca gcccaggtgc gccggagacc cgggccaatt cggcgccctg gctgccgatc 3333301 gagcgggccg cggcgctgga cggcggcgcc gactgggtgc cgcaagcctg gcgcgacatg 3333361 gataccacca tggtcgcggc tccattgggt gacacgcaca ccgcggtggt gctgggcagg 3333421 ccaggcccgg aatttcgccc gtcggaggtg gcgcggttgg gttatctagc cggcatcgtg 3333481 gcgacgatgc tgcgctgagc ggttcgttgg caaccaaggt tcgccgagcg taacgccact 3333541 gcgaaaaacc gcgcggagat tcgcagtgcc gttacgttcg tgacgcgggt ccgtcggcca 3333601 gcagtctccg gaacccatcc tcgtccagaa tcggcacccc caactccacc gccttgtcgt 3333661 atttggatcc cggcgagtct ccggcgacga catagttggt cttcttcgac accgagccgg 3333721 cggccttgcc gccgcgggcc acgatcgcct ccttggcgtc gtcgcgggag aaaccggtca 3333781 gcgagccggt gaccacgatg gtcagcccgg ccagcgtgcg tggcacactc tcgtcacgct 3333841 cgtcgaccat tcgcaccccg gcggcccgcc acttgtcgac gatctcgcgg tgccagtcga 3333901 cggcgaacca ctcggtgacc gcggcggcaa tggtcggccc caccccctcg acggcggcca 3333961 gctggtcggt ggacgccgcg gcgatggcgt caaggctgcc gaactcggtg gccagggcgc 3334021 gggccgccgt cggcccgaca tggcggatgg acagcgccac cagcacccgc cacagcggtg 3334081 ccgccttggc cttgtcgagg ttgaccagca gccgtttgcc gttggccgac agttcgcctg 3334141 ccttggttcg gaacaggtcg gtgcgcagca agtcccgctc ggtcagcgcg aacagctcgc 3334201 cctcgtcggc gatcaccttc gcctgcaaga gcgccacacc cgcctcgtaa ccgagcacct 3334261 cgatgtctag gccgttgcgg ctggcgacgt ggaaaacccg ctcccgcagt tgccccgggc 3334321 agccgcgggc gttggggcaa cggatgtcgg cgtcgccttc cttctccggc gccaacggcg 3334381 aaccgcactc cgggcaggtg gtgggcatga tgaattcgcg ttcggagcca tcgcgcagtt 3334441 cgacgacggg tcccagcacc tcggggatca cgtcgccggc cttgcggatc accacggtgt 3334501 cgccgatcag cacgcccttg cgcttgatct ccgaggcgtt gtgcagggtg gcctgtccca 3334561 ccgtcgaccc ggccaccttc accggcgtca tgaacgcaaa cggcgtgatc cgcccggtgc 3334621 ggccgacgtt cacccggatg tcgagcagct tggtctgcgc ttcctcgggc gggtacttgt 3334681 aggcgatggc ccagcgcggc gcccgcgacg tggaacccag cctgcgctgc aacgccacct 3334741 cgtcgacttt gaccaccacg ccgtcgattt cgtggtccac ctcgtggcgg tgctcgcccc 3334801 agtagtcgat gcgctcgcgc acaccggcca ggtcggttgc cagggtggtg tgttcggaaa 3334861 ccggcagtcc ccatgcccgc aacgccaggt atgcctgatg cagggtggcc gggcgaaagc 3334921 cctccacgtg gcccagcccg tggcagatca tccgcagccg gcggcgcgcg gtgaccgccg 3334981 ggtctttctg gcgcagcgat cccgccgcgc tgttgcgggg gttggcgaac ggcgccttgc 3335041 cctcctcgac gaggctggcg ttgagcgcct ggaagtcgtc cagccggaag aagacctcgc 3335101 cgcggacctc gaggacctcg ggcaccgggt agtcgtcgcc gggggtgagc cgttcgggaa 3335161 cgtcggcgat ggtccgggcg ttcagggtga cgtcctcgcc ggtgcgcccg tcgccgcggg 3335221 tggaggcccg ggtcagccgt ccctcgcggt agaccaaaga cagcgcgacg ccgtcgatct 3335281 tgagctcaca caggtaatgt gcggcgtctc cgacctcggc atggatgcgg ccggcccagg 3335341 cggcgagttc gtcggcggtg aacgcgttgt cgaggctgag cattcgttcg agatggtcga 3335401 cgggctcgaa atccgtggcg aagccggcac cgccgaccag ctgggtcggc gaatcgggcg 3335461 tgcgcagctc gggatgctgc tcctcgaggg cttccagacg gcgcagcagc tcgtcgaatt 3335521 ccgcgtcgct gatgatcggc gcgtcccgca cgtaataacg gaactggtgc tcacgcacct 3335581 cctcggccag tgcctgccac tgccgcaaca cctcgggagc ggtctgatcg gcgtctgggg 3335641 agctcactct ggcaggctag ccgagggggc tcttccctca gatggcctct gggtcccgcg 3335701 cgaacgcctc agcgacatca cgggcaagcc cgaccgcggt gcgggcccac tgccccgtcg 3335761 cattggccag accacacgcc gggctgacgc cgagtcgatc gcgtagcgcc gagcgaggaa 3335821 cgccgagccg atcggtgacc gcgaccgccg cagcagcgac ctcttccatc gaaggtgctc 3335881 gctccggggc ggtcaccggg accaggccca gcacgacggt tcggcccgac tcgacaaatg 3335941 ccgcgacagc atccaaatcc gcagcctgca gtgtgctcgc atccaccgat accgcactaa 3336001 ttctgctgcg ctgcagcaga tcccacggca aatccggact gcagctgtgt agcgctacgt 3336061 ccgcgtcgac agccgcgatg caagtgtcga gcagcgcttc ggccaccgtc tcgtcgagcg 3336121 gggcaaccgg gctcaacgcg gtcaccccgg tcagccggcc gcccaacgcc gccggcaacg 3336181 acggctcgtc gaactgcacc accaccggtg tgtcaagtcg acgcgccagc gccgcgcgat 3336241 gcgcggcaac gccttcggcc agcgaggcgg ccaggtcacg cacggctccg gggtcggtga 3336301 tcgcccggtg accgttggcc agctccaacc ccgcgaccaa tgtgactggc ccgggcgcct 3336361 gcaccttcac cgcccgccca cagccacgca ggcccgcggt ctcccaggcc tcttctaagg 3336421 catccatatc ctcgtcgagg aggctcgcgg cccgccgtgt caccgcgccg ggtcgagcag 3336481 cgatgcggta gccacgaggc acggtgtcaa tcgccacgtc gaccagcagt ccgccggctc 3336541 gccccagcat gtcggcgccg acgcccctgg cgggcagctc ggtgagatag gccaatgcac 3336601 ccgccaactc cccgaccacg acctgcgcgg cctctcgcgc ggcggtgccc ggccacgatc 3336661 cgatcccggt ggccgttgcg aaaacactca cccggcaacc gtattcgacc tcacatcgtc 3336721 ggctggccgc caggggtgtc tgctgcaggt tcgcccgggt accttcgaag cagaagggtg 3336781 gcagatggtg ggattgacgc ggccgctgct gttatgtggc gcgacactac tgattgcggc 3336841 gtgcacccgg gtggtgggcg gcacggcttc ggcgactttt ggcggtgacc gacagggcat 3336901 gcttgacgtc gctacgatcc tgttggatca gtcacggatg caagcaatca ccggctccgg 3336961 cgatgacctg acgatcatcc ccacgatgga cacgacgtat cccgtcgacg tcgacgattt 3337021 cgcccaaccc ataccacgag aatgccggtt catctatgcc gagacggcag tctttggctc 3337081 tgagatcgaa gcgtttcaca agaccacctt ccaggaccgg ccagatggca gtctgatctc 3337141 cgaggcggcc gccgcctatc gggatgccgg caccgcccgg cgtgccttcg acaccctggc 3337201 ggtcaccgtc cacgactgcg cggcaagtcc ggcaggctgg ctgttcgtca gtaggtggac 3337261 cgccggcggc aattccctac acatccgggc cggcgattgc ggtcgcgact accgggtcct 3337321 atcggcggcc ctgttggaag tgaccttctg cggcttcccg gaatcggtct ccgacatcgt 3337381 gatgacgaac atcgccgcca acgtgccggg ttagcacctc gagcccgcgt tcaggatgcc 3337441 aggacggatg tcaacgtggt cagttgtgcg ttgcgctgcg cgacgacatt ggtgctgaca 3337501 tttccaccac gcgcgtttag tctccggcgt cggcggccgt ggctggaccc gcatggcgcg 3337561 ggtccagcca ccgaccccgg aacgacccca ccctaatcgt tccgcagtct gacgaatcgc 3337621 ctaccggcct ttccagcacc ccgatctggc gtagtgctcg ccggcaccga cggtaggccc 3337681 gcgcgagagc ctccatggcc tgattccact gggcctgcca gtcctgatga ctcatcccga 3337741 gatcaccttg gcaagcacgc gacggcgcgg tgcgctcact ggcgatatcg gcccccaagc 3337801 tctgcgtcgt gcccgtataa ccggccatgt ctccgacatt ggccgtcatc gccgggtagc 3337861 tgtacatact ctgcgacacc acgaatccct ttcaaatatt ccgggcaatg atttttagac 3337921 actctttcga tcgaaaattt ggtcgagttc acggccgtca gatcgtcaaa ctgacaccaa 3337981 ccccccatca ccggccacac cgaccaaatc cggcccccag ctgcccggca gcatcggcac 3338041 cggcgcacca tcaccaaact catcggccaa caccgtcaac cccgccggct gcccaaccga 3338101 ctccttgcca gccgtcccaa caaaccccaa cgccccagca ccccgatcag aagccagcac 3338161 cgacaccgac gtgctcgcgg gagccggctc caccgccgac accaaccgcg cctgcggcga 3338221 caccacacca cccgacaccg gcgccacact gcccaccaac gccgccggcg cgccagcagc 3338281 cgcacccacc gccggcacag cggccaatcc cgccaccccg gccaacccag ctacacccgg 3338341 aaccacagca gcggccaacg cccccaacaa cggccccccc aaaaggggaa caacaccaaa 3338401 cagattccca ataacccact cgacaactat gtctataacg cccgcaatgg catacaataa 3338461 tccgaccgca acataagaag cattgatagc gaactctgtc aatagttgag cattggatgc 3338521 caacgtaata atgaacccaa taatgttgaa acccaggata tccacaaaga gctggaacca 3338581 aacccaagcc accgcaggga gttcggaaag caaagcggac agatactggt catatgctgc 3338641 gaacgtttct tctaaaaact gtacaatttc gtgccatggg aatggggtta tggttgcggc 3338701 ggccacggcg ttgctggctt cattggcgcc gggtttgacg atgaccggtg ccgggccggt 3338761 gtgtggtgtg gccaccagcg cggcacccac caccgcctca taggcgctca tcacggtggc 3338821 cgcctggacc cacatccgca catagtcggc ctcgttgagc gcgatcggga tcgtgttgat 3338881 cccaaagaaa ttcgtcgcca ccaacaccgc atgcgtgagg tggttggccg ccaactccgg 3338941 caacgtcggc atctccgcca acgcacaaac atagccagcc gccgcggcct catgctcacc 3339001 ggccgccgcc gcgctatccg cactggcctg caccaaccac gccacatacg gcacataggc 3339061 ggccacaaac aactcagcac tgggaccctg ccacaccccg gcccccaccg cggccaccac 3339121 cgcgctcaac tcttgcgcca cagcggcgta ctcggcgctt aacgcgctcc accccgccgc 3339181 ggccgcctgc aacgaacccg gccccggacc agcacttagc agcgccgaat gcacctccgg 3339241 cggcgacgcc aaccacaccg gcgtgcggtg cgaggacacc ggatcacccg ctagcggaat 3339301 caatgtgcgg cggccagccg tgcggtcaac gcctccaccg ccgcactggc ggccgccaaa 3339361 ccctcaggaa ccacgctcag cgtcaccacg cacactcctt ccttaggcgc ctcccacacc 3339421 catctcccgg atttttgctc tatcaactgt tgtaaatagc tacgattacc caggcgtaga 3339481 cgacgacgcc gcagattcct cacacccgcg cctgcgcaat tggccacgca ccaccgccgg 3339541 cagcgaggcc gccagccaca caccaagctc ctcgccgacc acatcggcta ccggatccac 3339601 caacagcgca acggcattcg gatgggggcc catcaaaccc accgatcatc ccggcgtcgc 3339661 cgaccacgcc cgcgccgtgt gctagccgcc ccacttggcg gcttcggccc catctcgagc 3339721 caacatcgcc atggtgttgg actcatgggt gccagacatc gactgatagg cccgcaccag 3339781 atcctctagg gcctggttcc actgggtctg ccagccctga tacgtgatcc cggtatcacc 3339841 ctgccaagca ctggacagca cggcctgctc actggcgata tcggccccca agctctgcag 3339901 cgtgcccgca taaccggcca tgtccccggc atgagccatc atcgccggat agttgtacat 3339961 aatctgcgac atcacaaacc ccttttcatt ccgagcagcg acttttttaa aacccggtgt 3340021 agctggacgc ggcggcggca tcggcggcca catacgtgcc cgcggcctca cccaaattgg 3340081 cttgcgcgat atccagcaag gtattgacct tggcggccgc ggccacaaac cgggcatgcg 3340141 caccctgaaa cgccgccgcg gactctccct gatgaaacgc ctgcgccgac atcgcctgct 3340201 gctcggcctg accgatcgta tgccgcatca accccgcctt agcggcaaac gccgtatgcg 3340261 aagcgatcaa ctgcggaata tgggcatcca acaaactcat cacaattcct tccaattcga 3340321 atcaccaatt actcgccgtc agatcgtcaa actgacacca accccccatc accggccaca 3340381 ccgaccaaat ccggccccca gctgcccggc agcatcggca ccggcgcacc atcaccaaac 3340441 tcatcggcca acaccgtcaa ccccgccggc tgcccaaccg actccttgcc agccgtccca 3340501 acaaacccca acgccccagc accccgatca gaagccagca ccgacaccga cgtgctcgcg 3340561 ggagccggct ccaccgccga caccaaccgc gcctgcggcg acaccacacc acccgacacc 3340621 ggcgccacac tgcccaccaa cgccgccggc gcgccagcag ccgcacccac cgccggcaca 3340681 gccgccaacc ccgccaaccc cgtcacacca gccacacccg gcaccaccgc agcagccaac 3340741 gcccccaaca acggacccgc caaaaccgga atagcgccaa aaatattcga aataatccaa 3340801 ccaatactga gtatcgccag ttccaagaca accgcgaatt ctaatagcgg aaccagaaca 3340861 aatacgccta acgcaaaacc taccgtggcg aaaaacatat cgatcatccc ggtaaggacg 3340921 agccaaggct cgaaatttac cagacccgtg atcagctcga cgaatcccac agcccaggct 3340981 tcggcgctct tcattatcaa ttcgccgacc tctgtaaatg cttgggcagc catttccaaa 3341041 aacttcgcta attctccgaa tgggaatggg gttatggttg cggcggccac ggcgttgctg 3341101 gcttcattgg cgccgggttt gacgatgacc ggtgccgggc cggtgtgtgg tgtggccacc 3341161 agcgcggcac ccaccaccgc ctcataggcg ctcatcacgg tggccgcctg gacccacatc 3341221 cgcacatagt cggcctcgtt gagcgcgatc gggatcgtgt tgatcccaaa gaaattcgtc 3341281 gccaccaaca ccgcatgcgt gaggtggttg gccgccaact ccggcaacgt cggcatctcc 3341341 gccaacgcac aaacatagcc agccgccgcg gcctcatgct caccggccgc cgccgcgcta 3341401 tccgcactgg cctgcaccaa ccacgccaca tacggcacat aggcggccac aaacaactca 3341461 gcactgggac cctgccacac cccggccccc accgcggcca ccaccgcgct caactcttgc 3341521 gccacagcgg cgtactcggc gcttaacgcg ctccaccccg ccgcggccgc ctgcaacgaa 3341581 cccggccccg gaccagcact tagcagcgcc gaatgcacct ccggcggcga cgccaaccac 3341641 accggcgccg tcacaacgac ccacccgaaa ccagatacgt cgccgccgcc accgcatcac 3341701 cggcggcata accgatccca gactcaccca cagcgacccc ggaacgaccc agctcctcga 3341761 ccccttcgcc cgcgatcgcc gcatgctcgc tacctaaggc gctaaacccc accgcactct 3341821 gcaacgacac cggatccgcc gccggcgcca ccaccgccgt aatcgccggc gccgcgccag 3341881 cgtgtgcggc ggccagccgt gcggtcaacg cctccaccgc cgcactggcg gccgccaaac 3341941 cctcaggaac cactctcagc gtcaccaccc acactccttc cttaggcgtc acacacccgc 3342001 acgaccggtt accgtcacca gcggagcgaa ttattgacac ctgtcttgac gcctgtcttg 3342061 acatgcgtca ggcaatattg atctcacaga tcgttgcgta tgtcaactgt tattgatagc 3342121 tactattacg taggcgtagg tgacggctcc gtaggattcg gggactagcc cgttgcttgg 3342181 gctgcccgac ccccgccccg tcccacgcaa cccggctgcc cgtcgtcggg cgacatcccg 3342241 gtctctatcg gcggacccga gcagccgccc ggctagccag tcgcggccaa ggccagggac 3342301 gtggtgtacg agtgaaggtt cctcgcgtga tccttcgggt ggcagtctag gtggtcagtg 3342361 ctggggtgtt ggtggtttgc tgcttggcgg gttcttcggt gctggtcagt gctgctcggg 3342421 ctcgggtgag gacctcgagg cccaggtagc gccgtccttc gatccattcg tcgtgttgtt 3342481 cggcgaggac ggctccgacg aggcggatga tcgaggcgcg gtcggggaag atgcccacga 3342541 cgtcggttcg gcgtcgtacc tctcggttga ggcgttcctg ggggttgttg gaccagattt 3342601 ggcgccagat ctgcttgggg aaggcggtga acgccagcag gtcggtgcgg gcggtgtcga 3342661 ggtgctcggc caccgcgggg agtttgtcgg tcagagcgtc gagtacccga tcatattggg 3342721 caacaactga ttcggcgtcg ggctggtcgt agatggagtg cagcagggtg cgcacccacg 3342781 gccaggaggg cttcggggtg gctgccatca gattggctgc gtagtgggtt ctgcagcgct 3342841 gccaggccgc tgcgggcagg gtggcgccga tcgcggccac caggccggcg tgggcgtcgc 3342901 tggtgaccag cgcgaccccg gacaggccgc gggcgaccag gtcgcggaag aacgccagcc 3342961 agccggcccc gtcctcggcg gaggtgacct ggatgcccag gatctctcgg tagccctcgg 3343021 cgttgacgcc ggtggcgatc aaggtgtgca ctccgacgac gcggcctgcc tcgcgcacct 3343081 tgagcaccag ggcgtcggcg gcgaggaagg tatacgggcc ggcatcgagc gggcgggtcc 3343141 gaaacgcctc tacggcttcg tcgagctctt tggccatgat cgacacttgc gacttggaaa 3343201 gctttgtcac accaagtgtt tcgaccaggc gctccatccg gcgagtggat actcccagca 3343261 ggtagcaggt cgccaccacg ctggtcagtg cgcgttcagc tcgcttgcgg cgctgcagca 3343321 gccagtccgg gaaatagctg ccctggcgca gcttggggat cgcgacgtcg atggttgcgg 3343381 cacgggtgtc gaaatcacgg tggcggtagc cgttgcgctg attggaccgc tcatcgctgc 3343441 gttcgcggta gcccgccccg cacagggcgt cggcttcagc ccccatcaag gcggcgatga 3343501 acgtcgagag cagcccgcgc agcagatccg ggctcgcctg tgcgagttgg tcagccagaa 3343561 gctgctcggt gtcgataaga tgagaagagg tcattgcgtc atttccttcg attgactttt 3343621 gctggtcgtt tcgaaggatc acgcgatgac cgcccactac tgggctacga cacgcccacc 3343681 ggccttacct gcccgtacac cacacccctg gacgtaactc cagtcgccgg gtttctacga 3343741 gtgatttggc gccgagtcaa gccccggggt tgccgccagt cgacaaccct gaagcgccgg 3343801 cgatggtcgc gctgccgagc acctcgtcac cggctgggtc aggtcggtag agcaccagcg 3343861 tctggccgcg cgccacgccg cgcagcgggg catgcaactg cacgaaaagc gcatcgccga 3343921 tcaattccgc taccgcactg acggtttcac cgtgcgcacg cacttggacc acgcagtcaa 3343981 cgggtcctga cggcgcggct ccggcggtga agacgggagc gcgcccagtc agcgtttgca 3344041 catcaaggtc ggtcacgtca cctacgtgaa cggtggcggt gtcggcgtcg atcgccgtga 3344101 catagcgcgg acgaccattc gggcccggcc cggcgatgcc caggcctcta cgctgcccga 3344161 tggtgaaccc gtgcacccca tcatgggaag ccagcaccac accatccgcg tcaaccacca 3344221 caccacggcg aaccccgatg cgctcaccca aaaaagcctt ggtgttcccg gacggtatga 3344281 agcagatgtc gtggctatcc ggcttgttgg cgaccgccag gccgcggcgg gccgcctcgg 3344341 cacggatctg ccgcttcggc gtgtcgccga tcgggaacgc ggcgtggcgc agctgctgcg 3344401 cagtgagcac ggcaagcaca taagactgat ccttgtcccg gtcgacggcg cggcgcagcc 3344461 gcccacccga cagccgggcg tagtggccgg tggccaccgt atcgaaaccc aacgccacag 3344521 ccctggcgga cagagcagcg aacttgatct gctgattgca ccgcacgcaa gggttcggag 3344581 tttccccgcg ggcatacgac gacacgaagt cgttgatcac gtcctctttg aacttctctg 3344641 cgaaatccca aacataaaac gggattccga gcacatcggc gacgcggcgc gcgtctgcag 3344701 cgtcctcttt ggaacaacag ccccgcgagc cggtgcgcag cgtgccgggc gcggtcgata 3344761 gcgccatgtg cactccgacc acctcgtgtc cggcatcgac catgcgggcg gcagcaacag 3344821 acgagtcgac gccaccgctc atcgcggcga gaactttcat cgggatgctc ccgcggcggc 3344881 tagggcggcc cgccgtgcac gtgccaccgc cccgggaagc acctccaacg cggcatcgac 3344941 atcagcctca acactggtgt gccccagcga gagacgcaat gatccgcggg cgctggccgc 3345001 gtcgacgccc attgcaatca acacatgcga gggctgcgct acacctgccg tgcaggccga 3345061 tccggttgag cactcgattc cgttagcgtc caacaacatc aacagcgcat cgccttcgca 3345121 gccacggaaa gtgaagtgcg cgttacccgc tagccgcatc gggtcatcgg cgccgttaag 3345181 gcaaacatcg tcaatctcag ccagcacacc ctcgaccaga cgatcccgca gcagccgtaa 3345241 ccgcgcgctg ttttcctcga gtccgtccac cgcgatctgc gcggccgtcg ccattccaac 3345301 tgcactggcg acatcgggtg tgccggaacg aatatcgcgc tcctgcccac cgccgtgcat 3345361 aaggggcacg caggtgacgt cgcggcgcag cagcaacgca cccactcctg gcgggccacc 3345421 gaatttgtgc ccggccacgc tcatcgccga cagcccgctg gccccgaagt caagcgggag 3345481 ctgtcccacc gcctgaatgg catcactgtg catcggcacg ccgaattcca tggcgacaac 3345541 tgacatttcg gcgatcggta gaatagttcc gacctcgttg ttggcccaca tcaccgatac 3345601 cagcgcgacg tcgtcgtggc tctgcagtgc ctcgcgcagc gcagttgccg acaccgagcc 3345661 gtcggcggcg gtcggcagcc aggtcacatg ggcgccttcg tgttccacga gccagttcac 3345721 cgagtccagt acggcgtggt gttccacctc ggtggtgacg atgcgacggc ggtgcggctc 3345781 cgcatcgcgg cgtgcccaat agataccttt gacagccagg ttgtcgcttt cggtgccgcc 3345841 cgcggtgaag atcacctcgg acggacgagc gcctagcttg tccgcgatca gctcacgggc 3345901 ctcctcgatc cgccggcgcg ccgagcgccc gctggtgtgc agcgacgacg cattgccgat 3345961 ggtgcgctgc acggccgcca tcgcctcgat ggcggcgggg tgcatcgggg tggtggcagc 3346021 gtgatccagg taggccatga cgcacctaga atactggccc gggcggcgac gcagaacgtg 3346081 cgcgcaggcc acggccgcag cagcggctgg gcaatctggc tggggccaga ccacttaggt 3346141 cgccggcacg tgccggcggc ctgggcgttg ccccgactgc cccaaggctc ccgcaagcac 3346201 cgctgactgg caacggcgcg cgagattccg acgatcggta cctggcagct gcaaagactc 3346261 gacgcgcacc caggccagcg tgcggcgcac ggtcagcagc cggcaaaccg accgcaccaa 3346321 ggtgtcgtcg ccgacaaagg ccggagcggt cgagacggtg ccgtcgacgt ggtgatatgt 3346381 caaccggagt ggctgcaccg ggcggccggc atcgattgcg gcctggaaca tcgccggata 3346441 gaaagcccca caaccgcgat gcgagcaccc ggctcctgct cgggccgctg gacggccggc 3346501 atcgtcgccc ggccgaccgc accaggtggt gccctcgggg aaggccacca ccgtctgacc 3346561 ggcgcgcagc cgacgcgcga tggtatcgac aaccccggga agccgccgca ggctggctcg 3346621 ctcgatcgga atgatcttca gaatgcgcgc cacgatccct atagtccgtc cggtgaacat 3346681 gtcggcgcgc gcgacgaacg acccgggcaa caccgaaccg atgcagaaga cgtccaacca 3346741 ggacacgtgc ccgctgacca ccaggactcc gcgcaggttc cgaactggac tacccgacac 3346801 cgtgatccgg acaccgaaaa ggcgcagcac caaccggcag tagatgcgtt gcacccgcgt 3346861 tcggcccggc agtggcatca ccaccagcgg cactcccggt accaggagca gagccaacat 3346921 gacgcgaagc gctacccgca gcaccaccag cggccgccgc acctgcgcag cgtcgccgac 3346981 actcacgcag ctgacgccgc acgttgcgcg gggcaaccag gagtgttcgg tgactgcggg 3347041 agcgctcatc gcgcgtcgtt caccatttcc gaggccgccg caaccgaccg cagtcgtcgc 3347101 agatatcgcg tatcggcgtg gtccttatcc agtagcaggc agaagtcgcc cacgccaaag 3347161 tccgggtcgt gcgccggctc cccgcaggcc cgcgcgccca gtctcaggta accgcgcatc 3347221 agcgggggaa ctgctggccg tggcggaggg agaatgtcgt cgagggacct cccgtccacg 3347281 cgcaccggcc ggtaggggta cacctggcac tgcggcggcg cggcatgccg gttgaggatg 3347341 aagtcgcgca ccccacgcag ccggctgccc ggcgtttcac cgtctccccc gattggtact 3347401 gacacacatc cggtcacata gtcatagccg tatcggtcca ggtaggccag gatgcccgcc 3347461 cacatcaaca acaccacccc accgttgcgg tgaccctcgc gcaccacggc gcggcccatc 3347521 tccaccaacg acggccgcag cggatcgaac gcgcaaacgt cgaattccgt tgcggtgtag 3347581 agtcctccgg cggcgatggc acccgccggt gccagcatcc ggtagcaacc caccagctca 3347641 ccggtgtcgt cgtcgcggac cagcaggtga tcgcagtact cgtcgaaccg gtcgccatcc 3347701 cggcgcgtat ccgcggccgc cggcagtgcg aagcctggcg tagtgctgaa cacgtcatag 3347761 cggagccgct gcgccgcctc gaccatgctg ggatcggtgg atagcaacag ggaatagcgc 3347821 ggtccggttg acgatcctgt cgcgacgcca tgcggtttgt cactgggtac cagcgcagaa 3347881 gcgatgctca tagcaccaac gtggcgcagc cgatcagcta atcggcatca acgttgtgac 3347941 gtgtcggtgc acgtcagatg acgaactgtt gggctaggtg agcaggcgcc aaggcccccc 3348001 acgcctcggc gtgtcggggt cttttgcgac tgctcgcgca gggaacctag cccttgcggg 3348061 ccttgatgac ctcggtcagc tgcggagcga ccttgaacag gtctcccacc accccgtagt 3348121 cggcgatctc aaagatcggc gcctcttcgt ccttgttgac cgcgacgatg gtcttggacg 3348181 tctgcatgcc agcgcggtgc tggatcgccc cggagatgcc cagggcaatg tagagctggg 3348241 gcgacaccgt cttgccggtc tggccgacct ggaactggcc cgggtagtag ccggagtcga 3348301 ctgcggcacg cgaggccccg accgcggcgc ccagcgagtc ggccagcgcc tcgaccacgc 3348361 tgaagttctc cgcgctgccg acaccacggc caccggccac cacaatggtc gcctcggtca 3348421 gctccggccg gtcgccggcg accgccggtt cgcgcgcggt gatcctggcg gcgttctccg 3348481 ccgcagccgg cacttccacg ctgacctgct caccggcgcc ggcggccggc tccgcctcca 3348541 cggctcctgc gcgcacggtg atcaccgggg tgtcgccgtt ggcctgcgct tcgacggtga 3348601 acgccccacc gaagatgctg tggacaccca ctccaccttc tctcacgtcg accacgtcga 3348661 ccagcagacc cgagccgatc cgagccgcaa gtcggccggc gatctccttg ccgtccgcgg 3348721 tggcggcgat tagtacgccg gcaggggccg aggactcggc cagcccggcc agcacgtcga 3348781 ccgccggggt gatcaggtat ttgtcgacaa ggtcggactc ggcgacgtag atcttggcgg 3348841 caccagccgc cttaagcccg tccaccagcg gcgcggccgt ccccggcaca ccgacgacga 3348901 cggcggctgg ttcgcccaag gcgcgggcgg cggtgatcaa ttcggcgctg accttcttta 3348961 acgcgccttc agcgtgctca acgagcacca gtacttcagc catgggttat atcgctctcg 3349021 tctttgggag gtgcgtatgt cttagatgat tttctgggca accaggtact gcacgatctg 3349081 gttgccgcct tcaccctcgt cggtgacctt ctccccggca gtcttggccg gtttgggcgt 3349141 cgacgccagc acggtggatc cggcgttggc cagccccacc tcgtcgctct cgacaccgat 3349201 ctcggccagg gtcagcacgg taacttcctt cttcttggcg gccatgatgc ctttgaagga 3349261 cgggaagcgc ggctcgttga tcttctcgtt cacgctgatc accgcgggca gcgtggcctc 3349321 gagggtgaat acgccctcat cggtctcacg ctcgccggtg atcttgccgc cctcgatcga 3349381 cactttgcgc aggtgggtga gctgcggcag gcccaggtac tcggcgatga tggccggcac 3349441 cgcaccgccc accccgtcgg tcgattcgtt gcctgcgatc accagctcgg tgccctcgat 3349501 ggtgcccaac gcgcgcgcca aagcccaccc ggtttggatg acgtccgagc cgtgcatgcc 3349561 gtcgtccttt aggtggacgg ccttgtcggc acccatcgac agcgccttgc ggatcgcctc 3349621 ggtggcgcgc tcggggcccg ccgtcagcac ggtcaccgac ccttcgatgc cgtcggcggc 3349681 ctctttctcc cgaatctgta gcgcttcctc cacggcgcgc tcgttgatct cgtccagcac 3349741 cgcgtcggcg gcctcgcggt ccagcgtgaa atcgccgtcg gtcagcttgc gctccgacca 3349801 ggtatctggg acctgcttga tcaggaccac gatgttcgtc atgactgtgg ttcgtcctcc 3349861 tcgaaggcgg cccgcagcgc tcgactgcgg aacctcggtc acacgttttg caaccgcaca 3349921 gcgatattac tattcggtaa gttcgcgtgg tgcgccctca caccatagcg ggtggtagag 3349981 caggttccca cgcctgtgcc tcgcccacga ccggcggata ctcccggtgc ccggttcgcg 3350041 aatccgatgc cacgggttag cctgccttaa caatgtgcgc attcgttccc cacgttcccc 3350101 gccatagccg aggcgacaac ccgccgtcgg cctccacggc tagccctgcg gtgttgacgc 3350161 tgaccggcga gcgcaccatc cccgatctgg acatcgagaa ctactggttt cgccgccacc 3350221 aggtcgtcta ccagcggctg gcaccccgct gcacggcccg cgacgtgctg gaagccggct 3350281 gcggcgaggg atatggcgcc gacctgatcg cctgcgtcgc tcgccaggtc atcgcggtgg 3350341 actacgacga gactgcggtg gcccatgtcc ggagccgcta tccccgagtg gaggtgatgc 3350401 aagcaaacct ggccgagctg ccattgcccg acgcgtcggt agacgtcgtg gtcaacttcc 3350461 aggtcatcga gcatctgtgg gatcaagccc gattcgttcg cgagtgcgcc cgggtactgc 3350521 ggggctcggg actgttgatg gtgtccaccc ccaaccggat caccttttcc cccggccgcg 3350581 ataccccgat caacccattc cacacccgcg agctcaatgc cgacgagctc acttcgctgt 3350641 tgatcgacgc gggattcgtc gatgtggcca tgtgcgggtt gtttcatggc ccacgcctgc 3350701 gcgacatgga cgcccgccac ggcggctcca tcatcgacgc acagatcatg cgggcggtgg 3350761 ccggcgcacc gtggccaccc gagctagccg cagacgtcgc ggcggtcacc accgccgact 3350821 tcgagatggt ggcagcgggt cacgaccgtg acatcgatga cagcctggat ctgatcgcga 3350881 tcgcggtgcg gccttgaaca cgtccgcaag cccggtgccc ggcctgttca cgcttgttct 3350941 gcacactcac ctgccctggc tggcccacca cgggcgctgg ccggtcggcg aggaatggct 3351001 ctatcagtcg tgggcggcgg cctacctgcc gctgctgcag gtgctggccg cgctggccga 3351061 cgagaaccgg caccggttga tcaccctcgg gatgacgccg gtggtcaacg cccagctcga 3351121 cgacccatac tgcctcaacg gtgtgcatca ctggctagcc aactggcagc tgcgcgccga 3351181 agaggccgcc agcgtgcggt atgcccgtca gtcgaagtcg gctgactatc cgtcatgcac 3351241 accggaggcg ttgcgggcct ttgggattcg cgaatgtgcc gatgcagctc gcgcgctcga 3351301 caacttcgcc acgcggtggc ggcacggcgg cagcccactg ctgcgcggcc tgatcgacgc 3351361 cggcacggtg gagctgctcg gtggcccact tgcccacccg ttccagccgc tgctggcacc 3351421 gcggctgcgc gagttcgcgc tgcgcgaagg cctcgccgat gctcagctgc ggctggcgca 3351481 ccgcccgaaa gggatctggg cacccgaatg cgcatacgcc ccggggatgg aggtcgacta 3351541 cgccaccgcg ggggtcagtc acttcatggt cgacggcccg tcgctgcacg gcgacaccgc 3351601 gctgggccgg ccggtgggga aaaccgatgt ggtcgccttc ggtcgcgact tgcaggtcag 3351661 ctaccgggtg tggtcaccga aatccggcta ccccgggcac gccgcctacc gcgacttcca 3351721 cacctacgac cacctgaccg gactcaaacc ggccagggtc accgggcgta acgtgccgtc 3351781 ggagcaaaag gcaccctacg atcccgagcg cgctgaccgc gccgtcgacg tccatgttgc 3351841 cgatttcgtc gacgtggtgc gcaatcggct gctctccgag tccgagcgca tcggccggcc 3351901 cgcccacgtg atcgccgcct tcgacaccga gttgttcggc cactggtggt acgagggccc 3351961 aacctggctg caacgggtat tgcgggcttt acccgccgcc ggtgtccggg tgggcaccct 3352021 gagcgatgcg atcgccgacg gattcgtcgg cgacccggtc gaattgccac ccagctcttg 3352081 gggttccggc aaggactggc aggtgtggag cggtgccaag gtggccgatc tggtccagct 3352141 caacagcgaa gtggtcgata ccgcgttgac caccatcgac aaggcgctgg cccagacagc 3352201 gtccctggac ggaccgctgc ctcgcgatca cgttgctgat cagatcctgc gcgagaccct 3352261 gctcaccgtg tccagcgact ggccgttcat ggtgagcaag gactccgccg ccgactacgc 3352321 ccgctatcgt gctcacctgc acgcacacgc cacccgggag atcgccggcg cgctggccgc 3352381 gggccgacgc gacaccgcac ggcggctcgc cgaagggtgg aaccgcgccg acggtctgtt 3352441 cggcgccctg gacgctcgga ggctgcccaa gtgaacgcct cgcacaggcg gaaccggccg 3352501 cgcgcatgag gatcctcatg gtgtcgtggg agtacccgcc ggtggtgatc ggcggactcg 3352561 gccgccacgt gcatcatctg tcgaccgcgc tagccgcagc cggtcacgat gtcgtcgtgt 3352621 tgtcccggtg tccgtcgggc accgatccca gcacacaccc atcctccgat gaggtgaccg 3352681 aaggggtccg ggtgattgcg gccgcgcagg acccgcacga gttcacgttt ggcaacgaca 3352741 tgatggcctg gaccctggcg atgggccacg ccatgatccg cgccgggctg cgcttgaaga 3352801 aacttggcac cgaccgctcg tggcgtcctg acgtcgtgca cgcacacgac tggctggtgg 3352861 cccatccggc catcgccctt gcccagttct atgacgtgcc aatggtttcc acgattcatg 3352921 caacggaggc cggtcgacat tccggctggg tctccggagc tctcagccgt caggtgcacg 3352981 cggtcgagtc gtggctggtg cgtgaatccg attcgctgat cacatgctcg gcgtcgatga 3353041 acgacgagat caccgagctg ttcgggcccg ggctggccga gatcaccgtg atccgtaacg 3353101 gcattgacgc ggcgcgctgg ccgttcgcgg cccgccgccc gcgcaccggg ccagccgaat 3353161 tgctctatgt ggggcggctg gagtacgaga agggcgtgca cgacgccatc gccgcgctgc 3353221 cgcggctcag gcgcactcac ccaggcacca cactgaccat cgccggcgaa ggcacccagc 3353281 aggattggtt gatcgatcag gcccgcaaac accgggtgct cagagcaacc aggttcgtcg 3353341 gacacctcga ccacaccgag ctgctggcgt tgctgcaccg agccgacgcc gcggtgctgc 3353401 ccagccacta cgaaccgttt gggctggtgg cactggaggc cgccgcggcc ggcaccccgc 3353461 tggtgacgtc caacatcggc ggtctgggtg aagcggtcat caatggacag accggggtgt 3353521 cgtgtgcacc ccgcgacgta gcggggctgg ccgccgcggt gcgtagcgtg ctcgacgatc 3353581 cggccgccgc gcagcggcgc gcacgagccg cccggcaacg gctcacctcc gacttcgact 3353641 ggcagacggt ggccaccgcg accgcgcagg tgtacctggc ggcgaagcgc ggtgaacggc 3353701 agccgcagcc ccggttgccc atcgtcgagc acgctcttcc cgatcggtag ccgtggcagg 3353761 gacgtgatga tcggagcacc gcagtgaaac cgcaggacca ggggctccac ttcccctatc 3353821 gctacgacct tcgactggcg cctatgtggc taccgtttcg atggccgggc agccaaggcg 3353881 tgaccgtgac cgaggatggc cgcttcgtcg cacgctacgg gccgtttcgc gtcgaggcgc 3353941 cactgtctag cgtccgcgat gcgcacatca ccggcccata ccgatggtgg acagcggtgg 3354001 gcccccgact gtcgatggtc gacgacggac tcacgttcgg aaccaacgca gctgccggtg 3354061 tctgcatcca cttcgagccg cggatccacc gcgtgattgg actgcgggac cattcggcgc 3354121 tgacagtgac cgttgcggac cccgaagggc tggtcgccgc gctcagcagc tagttcgccg 3354181 agcgccccgt gctgggcaca acccgactcg gcctggaggc gcctgcatcc aagccgcacc 3354241 ggcgcacaat tatctgccgg aggtcaaccc cctttatcga ttcggtatcg aagacgccgt 3354301 ttgacatgcc atgatcggcg aattcgcagt ttcagatgcc agggaggcga catggctcac 3354361 tcgatcgttc gcacgctgct ggcctcaggt gccgccacgg ccctgatcgc cattcccaca 3354421 gcctgctcgt tttcgatcgg aacgtcgcac tcgcactcgg tgagcaaggc cgaggtcgcc 3354481 cggcagatca ccgccaagat gacagacgcc gccggcaaca agcccgaatc ggtgacgtgc 3354541 ccaagcgatc tcccggcaga ggtcggggcc gagctgaatt gcgaaatgaa gatcaaggac 3354601 cgcacgttca acgtcaacgt caccgtgacc agtgtcgacg gtagcgacgt caagttcgac 3354661 atggtggaga ccgtcgacaa gaaccaggtt gccaacatca tcagcgacaa actgttccag 3354721 cgggtgggcg ccaggcccga ttcggtgacc tgccccgaca atctaaaggg cgtcgaggga 3354781 gccaaactgc ggtgtcgact gaccgacggc agcaaaacgt atggcatctc ggtgattgtc 3354841 accagcgttg acgccggcga tgtcaacttc gatttcaagg tcgatgacca ccccgagtag 3354901 gctcaccgtg gaatcggctg cccggcagcc aatttcgcgt acccgatgtg gatggtcgcc 3354961 ggagcaccat cggctttagg gtgctcgggg ctagcgggcc gccttcttgc gttcgatgtc 3355021 ggccaaggca gcggctagct cagcgcgctg cgcggccgat gcctcccagg acagctggcg 3355081 gttcttgacc accttggccg gcgccccgac cgcgatcgaa tagtcgggaa ttgcgccgcg 3355141 gaccaccgcg tgcgagccga gcacgcagcc ccgtccgatg gtggtgccgc gcagcacgct 3355201 caccttcacg ccgatccagg tgtcgggccc gatccgcacc ggactcttga tgatgccctg 3355261 gtctttgatc ggcagcgtga tgtcgtccat ccggtggtcg aaatcgcaga tatagcacca 3355321 gtcggccatt agcaccgagt ccccgatctc gatgtcgaga taggtgttga tgacgttgtc 3355381 ccggcccagc accaccttgt cgccgaaccg cagcgagccc tcgtgggcac ggatcgtgtt 3355441 cttgtccccg atgtgcaccc agcggccgat ctccagttgc gctagttccg gtgtcgcgtg 3355501 gatctccaca cccttgccga gaaacaccat gccgcgggtg atgatgtgcg ggttggccag 3355561 cttgaacctc aacagccgcc agtagcgcac caggtaccac ggagtgtagg cgcggttggc 3355621 aagcacccat ttcagcgatg ccagcgtgag gaacttggcc tgacgtgggt cgcgcagccg 3355681 cgatcctcgc cacctgcggt gaagcggagc accccacatg gttgtcattg gcgcagagct 3355741 tagcttagct gtcggacctg tttgggcgta tcggcgcatc tgagaatgcg catcggcgcg 3355801 cgaggtgacg ccggtggccg cccccgcggg ggcggtgatc ggcacccggc cccacaccac 3355861 acccgatgac gagcccaaac tgaggacgtt cacgacactt acaccacgta cacgacacgc 3355921 ccacggacaa ccgggaaccg ccaccggcca aggacgcgag gaaccgaatc tcgcccgcct 3355981 tgccagaatg tacgtggtga cccgagccgg gcaaccatta cacagttggc cagcactgac 3356041 caacttcatt tgtagcggtt accctcacct gtactcattc ggccgggccc gccgatgagc 3356101 gacccacgta gcggaaggat ctgggaacct gcgaaaggat aaggcgcttg cgcgacgcct 3356161 tccggcggcc gttgcggccg ccgtaatcgc ggtcgagctg ggcggttgcg gaagtgccga 3356221 ctcgtgggta gaagcggccc ccgcacaagg ctggcccgca caatacggcg acgccgccaa 3356281 cagcagctac accacgacga atggcgccac caatctcacg ctgcggtgga cgcgttcggt 3356341 caaaggaagc ttggctgccg gaccagccct gagcgcacgc gggtacctcg cgttaaacgg 3356401 gcagaccccg gccgggtgtt cgctgatgga gtggcagaac gacaacaacg gccggcagcg 3356461 ctggtgtgtg cggctggtcc agggcggcgg cttcgccggc ccgttgttcg acggcttcga 3356521 caacctctac gtcggccagc cgggagcgat aatctccttt ccgccgaccc agtggacgcg 3356581 ctggcgccag cccgtgatcg ggatgccgtc caccccgcgg tttctggggc atggccgcct 3356641 gctcgtgagt acacacctgg ggcagctgct ggtattcgat acccgccgcg gcatggtggt 3356701 cggcagtccg gtggacctgg tggacggcat cgatcccacc gatgcgacac gcggactggc 3356761 cgactgcgcg ccagcccggc cgggctgccc ggtcgcggcc gcccctgcgt tctcgtcggt 3356821 caacggcacg gtggtggtca gcgtctggca gccgggcgaa ccggccgcga agctggtcgg 3356881 gctgaaatac cacgctgagc aactcgtccg cgagtggacc agtgacgctg tcagcgcggg 3356941 cgtgctggcc agcccagtgc tctccgccga cggatcgacg gtctacgtca atgggcgcga 3357001 ccaccggcta tgggcactca acgccgccga cgggaaagcg aagtggtcag ctcccctggg 3357061 ctttctggcg cagacgccgc ccgcactgac cccacatgga ctgatcgtgt ccggcggggg 3357121 ccccgacacc gcgctggcgg cgttccggga tgccggtgat cacgccgagg gggcctggcg 3357181 acgcgacgac gttactgcgc tgtcgaccgc gagtctggcc ggcaccggcg tcggctatac 3357241 ggtcatcagc ggtccaaacc acgatggcac gcccggtttg tcgttgctgg tcttcgatcc 3357301 ggccaacggc cacacggtca acagctatcc gctacccgga gcgaccggat atcccgtcgg 3357361 tgtatcggtc ggcaacgacc gccgcgtggt gaccgccacc agcgacggcc aggtctacag 3357421 cttcgcacct tagattgcca gcggcggaat ggcgctgcgc ggcacctggg cttggcaagc 3357481 gccgacaaac gacggcagca gctcaccctg ggcgaagtag aaaatcagac tgtcgtcggt 3357541 gatagcaaag ttctggtagt gagccgggtc gaggccggtc gaaggcaata tcgcggcacc 3357601 gaaaccggtc tgacgtgcca gctcgcgctg aacgatgggg tagatgctgt ccagtggcgt 3357661 ggtgccgggc acgaacaacg tgtcgaaggt gatgggctgc gaggtcgcga ggttgtagtt 3357721 gaaggccttg taccaggtgg acggatgtgc cccaccgagg tcctggaaga atttgagcac 3357781 tacgctgcgg gtggcctgcg gcggctggcc ggagctgtgc tgttcgctgg tggcgtccat 3357841 ttggtagggc tggtctcgca gcggggaccc ctgcgcgacg ttgacgaacc cgtcgcggtt 3357901 ttgcgtgatg tagtcggtca gcgcccgctg gtcgggatag tcgacaggaa atgtcatatc 3357961 cagcatgtac ttagggcccg aggcgtgcac atggcagatc tggccggcct gcacagtgcc 3358021 gcccaggccg gcgcatgacg gcggcgcacc agccgccggc cagcccacca ggaccacagc 3358081 aacgagcact gcggtcgcta tcagataacg catcgtctaa tcgtcctcgc agaccaaagg 3358141 cgttggcgca gcttaggggt gaccgccgcc agcctacccg tgccgctacc gggacggccg 3358201 acacacatag gcggtcacat ggctcaagga cccggcacca atgcgggtga tgaccaccgc 3358261 cagcggtctg ctgccccgca gccggagccg tcgccgcaga gcgtcgggat cgatcgcaac 3358321 gccgcgcacc aggatttcgg ctgccccgca atccagcgct gacagcacct gacgcagccg 3358381 acgctcgtcg aaggccagct gctcgagcac ctcgaacccg cgcaacgcag gcggcagccg 3358441 gtcaccggac aggtaagcga tttggggatc gagctgccac agcccatgcc gggcgccgta 3358501 gttgcgtacc aggccggcac ggacgacggc gccgtcgggg tcgacgatcc atttcccggc 3358561 gggccgcaca ccgcagtcgt cgggctcgtc gtcaccgatt tgttcaccgg aatcgaggat 3358621 gctggctcga cggcggatac ccgatccggc caacccggcc gaccaaagac atgcttctcg 3358681 aaccccaccg cggtatgaga tcacctcgat ctcgccctcg aaaccgagcc ggcccacctc 3358741 ctcgaaatct attccgggag cgcacttgac gaccacatca cggccgcggt agcggtccag 3358801 tagggggccc aggccgggct ggtagtcggc gaggtggaag cgtcgccgcc cgttgctgcg 3358861 acgcgccggg tcgatgacga cgaccgcgtc gcgggtcacc ggatgcagca catcggcgcg 3358921 gcacaggtca gcttccattc ccagggcggc caggttgtgg cgcgccatgg ccagccgcac 3358981 cgggtcgata tcgctgccga ccgcccggac agctagctcg cgcagcgcgg ccagctcggt 3359041 gccgatggag caggtcgcgt cgtgcactac ccgaccggcc agtcgcctgg cccggtgccg 3359101 ggccacgggt gctgcggtag cctgctgcag cgcctcatcg gtgaatagcc attgcgacac 3359161 cccaacgttc ggacacagct cgcccagttt gccggcggcg cggcggcgca gcagcgtggt 3359221 ctccaccagc cacggcgccc gagcgccaaa ccgggcgcgc accgcggcgg tgtcggcaat 3359281 gcgagtggca gcggtcagct cgagctctgc gaccgcggcc agcgcaaccg cacccgattc 3359341 tgaccgcaga tagctgacgt cggcggtggt gaacgtgaga ccagtgaagc cgctggtcag 3359401 gacggcttaa ccccggtaat catcacgttg tagaaccagc ccttcggcac cacatgccgc 3359461 cagacgttgg cgtccaccca gcccagagtc ttccagctgg tgaaggcgaa gcgcgcccaa 3359521 ccccagccca gccgccctgg cggcaccgtg cactcaaacg tgcgcaacgg ccaacccagc 3359581 atcgccgcgg tgaactcctc ggtggcggtt tggacctcga ctgcacccgc gttgtgcgcg 3359641 atccgctgca ggtcctgggg cgtgaatgtg tgcaggtcga ccagggcctc cagcgccgcg 3359701 gcgcgcgagg actcatcgag ctcgccttgt ggtcggcgcc agcctctcag gccgggcagc 3359761 ttggtagcgt tggtgacgac acgccaggtc agcgtggaca gtgtgcgagc gtagccgtcg 3359821 ccgacggtgg tcggctcgcc ggcgaacacg aagcgcccgc ccggcttgag tacccgaacc 3359881 acctcccgca acgacagctc gacgtcggga atgtggtgca gcaccgcatg cccgaccacg 3359941 aggtcgaaag cgtcgtcgtc gtacgggatg ccctcggcgt cggcgacccg gccgtcgatg 3360001 tctagcccca gcgcttgccc attgcgggtg gcgaccttga ccatgccggg tgagaggtcg 3360061 gtgaccgatc cacgccgggc aacgccagcc tggatcaagt tgagcaggaa gaatccggtt 3360121 ccacagccca gttccagtgc gcggtcgtag ggcagctgcg cgatgacctc atcaggcacg 3360181 atcgcgtcga accggccgcg ggcgtagtcg acgcaacgct ggtcataaga gatcgaccac 3360241 ttctcgtcgt agttctcggc ttcccagtcg tggtagagca cctgggcgag cttgctgtcg 3360301 tgccgagctg cggccacctg ctcggccgtg gcatgtggat tgggagtggc gtcggcgggg 3360361 atgtttgaac tcctcgtcat ataggcgagc ctaacggccg ccccggtcac ccttgctgcc 3360421 accaccttga ccagcggcga acacctcgac atagcgccga cgctcagcgg cgatccgctc 3360481 ggccggcgcc agctcgtaga cgtcgctgat cccggctttg gccgcggcca gcgcgtgcgg 3360541 cgggccgtca agaaagcgcc tcgcccaggc cgccgcggcg tcgtaaacgt cgtcgggggc 3360601 caccatgtcg tcgatcaggc ccagcgccaa ggcctcctcg gcgtcgaaga agcgcccgct 3360661 gaacaccagc tccttggctc tgctcggacc ggccgcacgg gtcagccggg ccattccgtc 3360721 gccgctgggg atcaggccgg ccaggatctc ggtcgcgccg aatttcacgt tgtcaccgct 3360781 gactcgccaa tcggcggcta gggccagcgt aaggccggca cccaacgcgt atccggtgat 3360841 ggcggccacg gtcggcttgg ggatcgccgc aacggcgtcg acggcctgct gccgaatccg 3360901 ggcggcggtg tcggcctcct gcgcgctcaa tgtccgcagt tcgggcatgt cgtcgccggc 3360961 ggagaagatt tcgtggccgc catacaggat cactgcggcc acgtcgtcgc gtcgccccag 3361021 ctcgttggcc gcggcgacca cttcccggta gacctggcgg gtcatcgcgt tggtaggcgg 3361081 tcgcgatagg agcaacatgg ccaggccggc atcctgggag ccgtcactga ccacgacgtt 3361141 gacgaactcg ggcaccgttg gcgtcacagc ggccacccgg tcgatccgcc cgatctccgg 3361201 gcctggttat agcggtcgga atcgaagaac tcgatctccc agttgtcgcc gttgcgggcc 3361261 agctgtggct gcaccgggac gatttggcgt tcgacggcca gcacgtcggc aacggtatga 3361321 ccggccagcg agtccagttg cgtccaggtc ggcggcagca agaagttgcg gccggcggcg 3361381 aagtcggcga tagcgtcggc tggcaacacc caaccagccc ggtcggattc ggtgttctcg 3361441 ccgtcggcgc gctgaccttc aggtagggca cccacaaaga agtaggtgtc gtagcgccgg 3361501 gtcagttcgg cctccggggt gacccagttg gcccagggcc gtagcaggtc ggatcgcagc 3361561 accagctttt cccgctgcag gaagtccgcg aaggacagcg tccggtcggc cagtgcgcga 3361621 cgcgcgtcgc cgtacaccga ggcatccgag acgatgctgt tcggtgccga atggtcctga 3361681 tcgaccggcc cggcgaatag cacccccgac tcctcgaacg tctcgcgggc cgccgcgcag 3361741 accaaggctt cggcgagatc aggctcgatg ccgaaccgct gcgcccacca ctgcggcggc 3361801 ggaccggccc atgcccccag ccggcccaag tcggcgtcgc ggtcgcggtc gtcgactccc 3361861 ccgccgggaa acaccattac cccggcggcg aaatccatcg cagcgtgccg ccgcatcaag 3361921 aagacggcca gaccggacgc tgatccggcg tccgggtcgc ggaccaacat cacggtcgcc 3361981 gccggcctcg gtgtaggcgg gggtaccagt ggctcgcgag gtgaattcat gactgtctcc 3362041 gatgggctgc tcggctgcga cgccgtcggg cgaaatatcg cccgtcggcc acctccagcg 3362101 tgatctcctg gccaaacgcg gtggacaggt tctcggcggt cagcgcgtcg ggaagcaagc 3362161 ccgcggcaac cacccgggcc tccgacagca gcaggcaatg gctgaagccg ggcggaatct 3362221 cctcgacgtg gtgggtgacc agaaccagcg cgggcgcgtc agggtcggct gccaggtcgg 3362281 ccagccgggc gaccaattcc tctcggccac ctaagtccag gccggcggcg ggttcgtcga 3362341 gcagcagcag ctctggatct gtcatcaaag cccgcgcaat cagcactcgc ttgcgctcgc 3362401 cctccgacag tgttccgtat gtgcggttgg ccaaatgctc agcgcccagg ctctccagca 3362461 tgtcgatcgc gcggtggtag tcgacggcct cgtagcgctc gcgccaccgg cccaacactg 3362521 catagccggc ggagacgaca agatcgcgga cgcgttcgtc gccgggcacc cgctccgcca 3362581 gcgccgagga actgagcccg acccgagcac gcagttccga gacgtcaacc cggcctagcc 3362641 gctcaccgag cacaaaggcc acccccgacg acggatgctc agccgcggcg gcaatgcgca 3362701 gcaatgacgt cttgccggcc ccgttggggc cgacgatcac ccagcgttcg tcgagttcga 3362761 ccgcccaatc cagcgggccg accagcgtgc gcccattacg gcgcagggac acgtttcgga 3362821 agtcgatcag caggtcgggg tcggccgcat cagggccgcc gttgtcgagc acccgactat 3362881 cgtgccgcat gctccgcgag caacctagtc ggccgggatt tcgacgcgac gcaccccaca 3362941 gtcgccggcg tcggcggcct cgatctcacc gcgagtcacg cccaacaaga acaacaccgt 3363001 gtccaggtac ggatggctca acgacgcatc ggcgacctca cgcaacgccg gcttggcgtt 3363061 gaatgctatt cccagcccgg ccgcgcccag catgtcgata tcgttggcgc cgtcgccgac 3363121 cgcgacggtc tgctccatcg gcaccccata ctggctcgcg aagtcccgaa gcgccttggc 3363181 cttgccgggc cggtcaacaa tcggccccac gacccggccg gtaagaatgc catcgacgat 3363241 ttccagctcg ttggacgcaa cgaaatccaa catcaactcg cgtgcgagcg gctcgatgat 3363301 ccgccgaaag ccgccggaaa ccacaccgca gcgaaaaccc agacgccgca aggtccggat 3363361 cgtggtccga gcaccgggca tcagttcgag ctgctcggcg acgtcgtcta tcaccgtcgc 3363421 gggcagcccc gccaaggtgg caacacgacg ctgcagcgac tcggcgaagt ccagctcacc 3363481 acgcatcgcg gcctcggtga tcgcggcgac ctgtccctgg gcacccgcac gggctgccag 3363541 catctcgatg acctcgcctt ggaccagagt ggagtcgacg tcgaagacga tcaggcgttt 3363601 ggtgcgccaa gccaagccgt agtcctcgac ggccacatcg acatgctctt cggcggccac 3363661 cttggtcagg gcgatctgca gcggacccac gcatccaggc ggcaccgaga cccgcaactc 3363721 caggccggtg accgggtagt cggaaatgcc gcggatgaag tcgatgttga cgccgagtgc 3363781 ggccacttcc ctggccaccg cgctgaacgc tccggcggta atcgggcgtc ccagcacgaa 3363841 aatggtgtgg gtggacggtt gccgaatgat tggcagatcg tcgctgcgct cgatggcgac 3363901 gtctagaccc accccgtgga tggcggacgc gacgtcgtcg cgcagcgcgg taccgtcggc 3363961 aacgtccagc gggcacgaca ccagcacacc cagcgtgagc cggccccgga tcaccacttg 3364021 ttcgacgttg agcagctcga ctccgtgctg cgcgagcacc tcgaagagcg cggatgtcac 3364081 gcctggctga tccatgccgg tgaccgtgat cagcaccgac accttggctg gcatgctcac 3364141 cttcagatgg ggcccaaccg gtacggcccc atcagctcga agtcaccatg gcgggctcgt 3364201 catgatgacg cccgacgtgg gcctcggcgc gaaggcgttc caccatatgc gggtagtgca 3364261 gttcgaacgc gggacgctcg gagcggatgc gaggcagctc ggtgaagttg tgccgcggcg 3364321 gcggacaact tgtcgcccac tccagcgaat tcccgtaccc ccacgggtcg tcgacggtga 3364381 ccacctcgcc gtagcgccag ctcttgaaga cgttccacac gaacgggaac atcgacgcac 3364441 ccaggatgaa ggccccgatc gtcgagacga cgttgagacc ctggaagccg tcggtgggta 3364501 ggtagtcggc gtagcgacgc ggcataccct cgtcgcccaa ccagtgctgc accaggaacg 3364561 tggtgtgaaa accgatgaac gtcaaccaga agtgcagttt gcccaaccgc tcgtcgagca 3364621 gccggccggt catcttgggg aaccagaaat agatgccggc gaaggtggcg aacacgatcg 3364681 tgccgaacag cacgtagtga aagtgtgcaa ccacgaaata gctatcggtg acgtggaagt 3364741 ccagcggcgg gctggccagc agcacaccgg tgagtccgcc cagcaagaag gtgaccatga 3364801 agcccaccga aaacaacatc ggggtttcaa aggtcaattg ccccttccac atggtgccga 3364861 tccagttgaa aaacttgatc ccggtcggca ccgcgatcag atacgtcatg aaagagaaga 3364921 agggaagcag gacggctccg gtcgcgaaca tgtggtgcgc ccataccgcg accgacaacg 3364981 cggcgatcga cagcgtcgca taaaccagcg tggtgtaacc gaagatcggc ttgcgggaaa 3365041 acaccgggaa gatctccgag acgatcccga aaaacggcag cgcgatgatg tagacctcgg 3365101 ggtggccgaa gaaccaaaac aggtgctgcc acagcaggac tccgccattg gcggcgtcat 3365161 agatgtgagc tcccagatgc cggtcggcgg ccagcccgaa caatgccgcc gtgagcagcg 3365221 ggaacgcaat caatatcagg atggacgtca ccatgatgtt ccaggtgaag atcggcatcc 3365281 ggaacatcgt catcccgggt gcgcgcatgc acaccacggt ggtgatcatg ttgaccgcgc 3365341 ccaggatggt gcccagaccc gcaacgatca aacccatgat ccacaggtcg cccccggcgc 3365401 cgggcgagtg aatggcgtcg gtcagcggcg tgtaggcggt ccacccgaag tccgcggccc 3365461 cgcccggagt gatgaagccg gctgccccga tggtggcgcc aaatacgaac agccagaacg 3365521 aaaaggcgtt cagccggggg aaggccacgt cgggtgcgcc gatctgcagc ggcagcacca 3365581 ggttggcgaa accaaacaca atcggcgtgg catagaacag cagcatgatc gtgccgtgca 3365641 tggtgaacaa ctggttgaac tgctcattcg acaagaactg cagaccgggt gcggccagct 3365701 cggtccgcat caacaacgcc agcaggccac cgatgaagaa aaagcttatg cacgcgacgc 3365761 agtacatgat gccgatcatc ttgtgatcgg tggtggtgat cagcttgtag accaggctcc 3365821 ccttgggacc ggtgcgggcc gggtaaggac gaatggcttc gagttctccc agcgggggcg 3365881 cttcggctgt caacgcactc ctccaaacat ccagcccgga ccgggccaaa acccagtatt 3365941 gagaggcatc ttagccctcg atcaggctgg cggcaggcct ggtcctacaa accgtcgtaa 3366001 atgccagact ccgccggcgg gccgttgcag accaacgctt tccgcccgcg cgaatcgggg 3366061 tcgacggctg gccgagtgct accgtcgaac gcgtgctgtc cggcgggatg cgatccactg 3366121 ttgctgtcgc cgtagcggca gccgtgatcg cagcgtccag tggttgcggc tccgatcaac 3366181 cggcccataa ggcgtcacaa tcgatgatca cgcccaccac ccagatcgcc ggcgccgggg 3366241 tgctgggaaa cgacagaaag ccggatgagt cgtgcgcgcg tgcggcggcc gcggccgatc 3366301 cggggccacc gacccgacca gcgcacaatg cggcgggagt cagcccggag atggtgcagg 3366361 tgccggcgga ggcgcagcgc atcgtggtgc tctccggtga ccagctcgac gcgctgtgcg 3366421 cgctgggcct gcaatcgcgg atcgtcgccg ccgcgttgcc gaacagctcc tcaagtcaac 3366481 cttcctatct gggcacgacc gtgcatgatc tgcccggtgt cggtactcgc agcgcccccg 3366541 acctgcgcgc cattgcggcg gctcacccgg atctgatcct gggttcgcag ggtttgacgc 3366601 cgcagttgta tccgcagctg gcggcgatcg ccccgacggt gtttaccgcg gcaccgggcg 3366661 cggactggga aaataacctg cgtggtgtcg gtgccgccac ggcccgtatc gccgcggtgg 3366721 acgcgctgat caccgggttc gccgaacacg ccacccaggt cgggaccaag catgacgcga 3366781 cccacttcca agcgtcgatc gtgcagctga ccgccaacac catgcgggta tacggcgcca 3366841 acaacttccc ggccagcgtg ctgagcgcgg tcggcgtcga ccgaccgccg tctcaacggt 3366901 tcaccgacaa ggcctacatc gagatcggca ccacggccgc cgacctggcg aaatcaccgg 3366961 acttctcggc ggccgacgcc gatatcgtct acctgtcgtg cgcgtcggaa gcagccgcgg 3367021 aacgcgcggc cgtcatcctg gatagcgacc catggcgcaa gctgtccgcc aaccgtgaca 3367081 accgggtctt cgtcgtcaac gaccaggtat ggcagaccgg cgagggtatg gtcgctgccc 3367141 gcggcattgt cgatgatctg cgctgggtcg acgcgccgat caactagtga ggcgcagcgc 3367201 taggctttgg gatacccaca gctaaaaaat taatcaaaga aacgaagagg gttgccatga 3367261 gcactgttgc cgcctacgcc gccatgtcgg cgaccgaacc cctgaccaag accacgatca 3367321 cccgtcgcga cccgggcccg cacgacgtgg cgatcgacat caagttcgcc ggaatctgtc 3367381 actcggacat ccataccgtc aaagccgagt ggggccaacc gaattaccct gtggtccctg 3367441 gccacgagat cgccggcgtg gtgaccgccg tgggctcgga ggtgaccaag taccggcagg 3367501 gcgaccgcgt tggggttggc tgtttcgtgg actcgtgccg cgagtgcaac agttgcacgc 3367561 gcggcatcga acagtactgc aagccgggcg caaacttcac ctacaactcg atcggcaaag 3367621 acggccagcc aacccagggc ggctacagcg aagcgatcgt cgtcgacgaa aactacgtgt 3367681 tgcgcatacc cgacgtgctg cccctggatg tggcggcgcc gctgttgtgc gcgggcatca 3367741 cgctgtactc gccactgcgc cactggaatg ccggggcgaa cacgcgggtg gcgatcatcg 3367801 gcctaggcgg actgggtcac atgggcgtca agctgggcgc cgcgatgggc gccgacgtga 3367861 cggtgctgtc ccaatcgctg aagaaaatgg aggacggtct gcgcttgggg gccaagagct 3367921 actacgcgac cgccgacccg gacaccttcc gcaagctgcg cggcggcttc gacctgatcc 3367981 tgaacaccgt ctcggctaac ttggacctcg gccagtacct gaacctgctg gacgtcgacg 3368041 gcacactcgt ggaactgggt atccccgagc accccatggc cgtgccggcg ttcgcgctag 3368101 cgctcatgcg acgcagcctg gccgggtcca acatcggcgg gatcgccgag acccaggaga 3368161 tgctcaattt ctgtgccgag cacggcgtga cacccgaaat cgagctgatt gaaccggact 3368221 acatcaacga cgcctacgag cgcgtgctgg ccagcgacgt gcgctaccgc ttcgtcatcg 3368281 acatctcagc cctgtgaggc cggtgcgcga tcacttccgg attcggactc gccgacgtcg 3368341 acgccggcca gcggccatcc ggcggcggcc aggatgcctg ccacccgttg gatgttttcc 3368401 ggtccggcgt cgtgatgggt cacctcggag atgaactcgg cgatctcgtc gcggtcgatc 3368461 acacggtccg ccacggcggg cgacccgttc tcggtgaagt ggcgcaccac ctccccgatc 3368521 tgctcctcgg tcagcggggt gctgcgcaat agcgacagca gcgccacccg gtccggcccg 3368581 gggacgccct cggggtagcc aacctgaagc cagcgcagca ccgaacggaa gaaatgcggg 3368641 tgcgagaacg ttttcgtcac cccaccagtc tcaaggtttc gacatcactc gcgccagtgt 3368701 ggtgcggcgc gattcagaca attcacgagg cgttcaccac gatcgcgagc ccatggaccc 3368761 atgagcccgt gacattctgc agcgtcgtct agcgggacgg caacgacgaa ctgggttttc 3368821 accccgctcg atttttcacc ccgctcgatt aggtggcgtt tggcaagctg gctcgcgcgc 3368881 tgcggggcaa ggccatctgg cgttgcgctg tcacgcgctg gagtgccctc gtgagaaatg 3368941 accggccccg ggcagcggac gtcgacggtg tcaggccggc cagtcgccgc gggtcaaaga 3369001 gcttggcgtg acgctccacc ggtagctatc ggattccaga agcttgggca gccaattgtc 3369061 ccaggtgcca gtcgcgccgc cagcggtatg caccgcggta cgcgcggcaa caaacgcctt 3369121 gtgacgagcg cgtccgagcg gtcatcggcc tccaccgtca tgcacagctc cttctccagg 3369181 tctacgccga cgtcgcggtc cacattggtg agcttggcga atgcctcggc aacctcgtcg 3369241 aaatgcgcct ccgcgtccgc atcgaacggt ccgcccatgt caaagatcaa ctcgacgtag 3369301 tagctagtta ccgcatcagg tcagtgtttg ctggcctcgg agtccggccg aacaatggcc 3369361 cattttcccg cgactctaga agtcccagtc atcgtcctcg gtgacgaccg ccttgccgat 3369421 cacatagctc gacccggatc cggagaagaa gtcatggttc tcgtcggcgt tgggtgacag 3369481 cgccgacagg atggccgggt tcacgtcggt ctcatcgcgg gggaacagcg cctcatagcc 3369541 gaggttcatc agcgccttgt tggcgttgta gcgcaagaac ttcttgacgt cctcggttag 3369601 cccgacctcg tcgtagaggt cctgggtgta ttccacctcg ttgtcgtaga gctcgaacag 3369661 tagctcgtag gtgtagtcct tgagctcggc gcgcgtgacg tcgtcaacca acgccagacc 3369721 acgctggaac ttatagccga tgtagtaacc gtgcacggcc tcgtcgcgga tgatcagccg 3369781 gatcatgtcg gcggtgttgg tcaacttggc ccgactcgac cagtacatcg gcaggtagaa 3369841 cccagagtag aacaggaagc tctccagcag ggtggaggcc accttgcgct tgagcggctc 3369901 gtcgccgcgg tagtactgca gcacgatctc ggccttgcgc tgcagattgc gattttcctc 3369961 cgaccagcgg aaggcgtcgt cgatctcggc ggtggaacac agcgtggaga agatctggct 3370021 gtagctcttg gcgtgcaccg actccataaa cgcgatgttg gtcaacaccg cctcctcatg 3370081 cggagtcagc gcgtcgggaa tcaggctgac cgcaccaacg gtgccctgga tggtgtccag 3370141 catggtcagg ccggtgaaga cccgcatggt tagttgcttc tcgccggcgg tcagggtgcc 3370201 ccacgacggg atgtcattgg acaccggcac cttctcgggc agccagaagt ttccggtcag 3370261 ccgatcccag acctcggcgt ccttctcatc ttgcagtcgg ttccagttga tcgctgagac 3370321 tcgatcaatt agctttgcgt ttccagtcac cagaacccca cttcaccagg acaacaagct 3370381 gccttgctag gcctcaaaca ctacccctgg ggtccgacaa ggtactgcaa cacaagaagt 3370441 tgtgtttgcg tgtcgcgaat ccgctcgcct gtttcgccgg ctagttcgcc gcagcgaccg 3370501 tcgcgcggtc gctcgacaaa ccgttgccga tcccgaagaa ccggtactcg gcggggttga 3370561 ccgagcgggt ggtcagccag tattgccagg tgtagccgca ccagagcacg gtgttcttgc 3370621 cgtgctcgtc gagataccag ctgcggcagc cgccactgtt ccacaccgac ccagccagcc 3370681 tgcgctgcag ctcctggttg aaccggtctt gcgcctcgcg ggtgggggcc agcgcttgca 3370741 cgcccatccg gtcgcatttc gcgatcgcat cggccacgta atggatctgc gattcgatca 3370801 tgaacaccac ggagttgtgt cccagcccag tgttcggccc cagcaggaag aacaggttgg 3370861 gcatgttggc gacggtgatc ccgcggtgtg caccgatgcc ctcacggttc cagcggtcga 3370921 ccaggtcctc gccgtgacgc cccttgatct gcacataggt ataggagtcg gtgacgtgga 3370981 agccggtggc gtacacgatc acatcggctt cccggaagac ctcacggcca gtgccgtcgg 3371041 cggtgacgat cccgtcgtgc gtgatccggt cgatgcggtc ggtgatcagt tcggtcttcg 3371101 ggtccgccac cgcggggtaa taggtagagg agttcaggat ccgtttgcag ccgatacgat 3371161 accgcggcgt cagcttgcgc cgcagctcgc gatccttcac cgatcgacga atattgtatt 3371221 tggcataggc ctcgatgatc ttcaacgtgt tgggccgctt ggtcatgccg taggccagcg 3371281 cctcctgggc ccagtagatg ccgaggcgca acagtgcccg tagcccgggg acggttcgca 3371341 acgcccggcg cagcgacacc ggcagctctt cgttggtgcg cgggaccacc cacggcgggg 3371401 tgcgctgata gagctgaagt tcggcgacct ggccgacgat ctcgggcacg atctggatcg 3371461 cgctggcacc ggtcccgacg atcgccaccc gcttgccggt caggtcgata ctgtggtccc 3371521 actgggcgga atggaaagcg gggccggcga attcgtcgcg acctgcgatc tcggggaagg 3371581 acgggatgtg caacgcaccg gccccggaga tcaggaactg cgcgacgtat tcacgcccgt 3371641 cggcggtgaa cacgtgccag cggcattcgt cgtcgtccca gtagccgcga tcgacgagcg 3371701 aattgaactc gatgtagcgg cgcaggccgt acttgtcggt gacccctttg aggtagccca 3371761 agatttcgtc ccagtaggaa aacaggtgtt tccagtccgc cttgggctcg aacgagaagg 3371821 agtacaggtg cgacgggatg tcgcacgcgc agccggggta ggtgttgtcg cgccaggtgc 3371881 cgccgacgtc gtcggctttc tccaatatga cgaagtccac tccttgcttt tgcagtgcga 3371941 tggccatgcc caaaccggag aatccggttc cgatgatgac ggcgcgggta cgtaccggcg 3372001 gctggttggc cgggcttggc gtggacggct tggcagccgt atcggcaatg ctcacaatgg 3372061 atcggtcttc ctgttcagcg gcgagtttgg cgcttcagtc accctcgccg gcgcaaatac 3372121 ccagtaccca gggtatcgga tatgacgagt gttgttgatt gccgcagcga tgtcaacggc 3372181 cgccgcgctc aacgcacggc gggattgttg ggtaccgcgt cgtggatcgg ttggtcaggg 3372241 tcgaccgcga tgcccagcgc ttcggcggtg ccgacgatca cgcccatcat gatggtggtc 3372301 agatgcgcca cgaactgctc acgcggcatg cggcgcgggc tgtcgggttc ggggcccaac 3372361 caccactcgg ttgccgatgc ggccgatccg aacgccgcga atgcggcgag ttcgagcgcg 3372421 gctcgattca gctccatctc gcgcagctcg ttgttgaaca tctcggccat ggccagcgtg 3372481 atctcccggc cttcgttgag ggtgcgtacc gtcgcctcgg actgctttgc cgagcggccc 3372541 tgaatgaaca cccgcagcac gttggggtgc tggtcgacga ggttgacgta ctcctcgacg 3372601 ctgcgccgga taacttcgcg ggcagagtcg gtggctaagt cgagcgacgg gaagatcgcc 3372661 gcccacagca tgtcacgcag tcgcatcccg atagcctcga gcaaatcgga cttgtcggtg 3372721 aaatgccgat agatcttggg cttggcggtg ccggcctctt cggcgatttg gcgcacactc 3372781 agctcggggc ccagccggtc gatagcgcgg aacgccgcgt cgacgatttc gttgcgcacc 3372841 ttcttgcggt gctcacgcca ccgttcactg cgggcgtcga ctttcacccc cggctttgca 3372901 ctggggtggg gtcgggggat tctgaccaca tcaagcacct taccgcgttg caagcgctga 3372961 cctgggcaga ctggccacgc caggcttggt tgaatgtgag gttcacgacg cgacacgccg 3373021 cgaagccgtc gccactttca ctctggcgcg ccggtgctac agcatgcagg acacgcaacc 3373081 ctcgacctcg gtgccctcca acgccatctg ccgcagccgg atgtagtaca gcgtcttgat 3373141 ccccttgcgc caggcgtaaa tctgcgcctt gttcacgtcg cgggtggtgg cggtgtcttt 3373201 gaagaacaac gtcagcgaaa gcccttgatc cacatgctgg gtggccgccg cgtaggtgtc 3373261 gatgatcttc tcgtaaccga tctcgtaggc gtcttcgtag tactccaggt tgtcgttggt 3373321 catatacggc gccgggtagt agacccgccc gatcttgcct tccttgcgga tctcgacctt 3373381 cgacacgatc gggtgaatcg acgacgtcga atggttgatg taggaaatcg acccggtcgg 3373441 cggcaccgcc tgcaggttct ggttgtagat gccgtgcgct tgcaccgact ccttgagccg 3373501 acgccagtcg tcctgcgttg ggatgcggat gccggcgtcg gcgaacagct ggcgtacctt 3373561 ctgggtcttc ggctcccaaa tctggtcggt gtacttgtcg aagaattccc cggacgcgta 3373621 cttggaccgc tcgaaaccct tgaagtgcgt gccgcgttcg atcgcgatgc ggttggatgc 3373681 ccgcaacgcg tgatacagca ccgtatagaa gtagatgttg gtgaagtcga tgccttcgtc 3373741 ggatccgtag aagatgcgtt cccgggccag gtagccgtgc aggttcatct gtcctagccc 3373801 gatcgcgtgg gagtcgttgt tgccctgctc gattgagggc accgacttga tatgggtttg 3373861 gtcgctcacc gcggtcaacg cgcggatcgc cacctcgatc gtctgcgcga agtccggcga 3373921 gtccatcgtc ttggcgatgt tcagcgaccc caggttgcac gaaatgtctt tgcccacttt 3373981 ggcatacgac aagtcctcgt tgaacaatga cggcgtagac acttgcagga tctccgagca 3374041 caggttgctg tgcgtgatct tgccatcaat tggattagcg cgattgacgg tgtcttcgaa 3374101 catgatatag gggtagccgg actcgaactg cagctcggcc agcgtctgga agaactcccg 3374161 tgccttgatc ttggtcttgc ggatgcgcgc gtcatcgacc atttcgtagt acttctcggt 3374221 gaccgagatg tcagcgaacg gcacaccgta gacccgctcg acatcgtagg gcgagaacag 3374281 gtacatgtca tcgttgcgct tggccaactc gaaggtgatg tcggggatca ccacccccag 3374341 actcagcgtc ttgatccgga tcttctcgtc ggcgttctca cgcttggtgt ccaggaatcg 3374401 gtagatgtcg gggtgatggg cgtgcaggta caccgcgccg gcaccttgac gagcgcccag 3374461 ctggttggcg taggagaacg catcctccag caacttcatg atggggatga cgcccgagga 3374521 ctggttctcg atgttcttga tcggcgcgcc gtgctcgcga atgttggtca gcagcaacgc 3374581 cactcccccg ccacgcttgg atagctgcag cgcggagttg atcgaccgtc cgatcgactc 3374641 catgttgtct tcgacgcgaa gcaaaaaaca gctcacgggc tccccgcgct gcttcttgcc 3374701 agaattcaaa aacgtcggtg tggcgggctg gaagcggccg tcgatgatct cgtcgaccag 3374761 cagctcggca agtgcggtat cgccggcggc caacgttagc gccaccatga ccacgcggtc 3374821 ctcgaagcgc tccagatagc gcttcccgtc aaaggttttc agcgtgtagg aggtgtagta 3374881 cttgaacgca cccaaaaacg tcggaaaccg gaactttttg gcgtaggcgc ggtctagcag 3374941 cgtcttgacg aagttgcgcg agtactggtc gagaacctca cgctcgtagt aattctcgcg 3375001 gatcaggtag tcgagcttct cgtcctgatt atggaagaag accgtgttct gattgacatg 3375061 ctgcaaaaag tactggtggg ctgcttcccg atccttgtcg aactggatct tgccgtccgc 3375121 gtcgtacagg ttcagcatcg cgttcagcgc gtgatagtcc gtttcgcccg gccccccaga 3375181 gtaagaggcg tgcgcgccgg aggctacagg ctctgcaatg acggttggtg gcacgtctgt 3375241 tccttccaga attcagcgag accggtgcgg acggcggcga cgtcgtcctc ggtgcccatc 3375301 agttcgaagc ggtataggta gggaacgcta cattttcggg agacgacgtc gccggcgtag 3375361 cagaactcgg caccgaagtt ggtattgccg gcagcgatga ccccgcgcag ctgcgctcga 3375421 ttgtggtcgt tgttcaagaa ggcaatgacc tgtttgggga cgtatccgcc ggcatcgaga 3375481 cccgggttgg cccggccgcc accgtaggtg ggcagtatca gcacgtacgg ctcgtcgacc 3375541 tcgatccggc catgcagcgg tatccgcgtg gcgggaatac ccagtttctg cacaaagcgg 3375601 tgggtgttct ccgacacgct ggagaaatag accaggctgc gccccgcgat atccatggca 3375661 ccgcaatctt ccttatctat gtctgccgcg ctaggcggtc agcgctgccc cggcgagcgc 3375721 cttgatgcga tcggggcgga aacccgacca gtggtcgttt ccggcgacca cgacgggtgc 3375781 ttgtaggtaa cccagcgcca tcacgtagtc ccgcgcttcg gaatccaggc tgatatcaac 3375841 cttctggtag gcgatgccct gcttgtccag cgccttggag gtggcactgc actgtacgca 3375901 cgcgggctta gtgtaaacgg tcacggtcat gggcgtaccg ctcctttgcg gaaatcggga 3375961 atctgacagg atctggcaac gactcaagta gtgcatcttc gatatgttga gcggcccgac 3376021 aaggctccag attcccgtca tagcgcgacc acgtccgtcg acctggcgat gccgatgccg 3376081 ggaagttcat cgcgccccgt ggatctggtg agacctggtg aacctgggat ctgccggtac 3376141 tcgaaaacac tacacctagg gggtggcacg gagaagagat acaagatgtt ctgaataaca 3376201 ttttcgaaat tccctggtcg taagcctggc tcgagcaccg cggcggcgtg tcgcagatca 3376261 cctcagccgc cgccataccc gtctgacccg atatatcacc ggccaccgac aacgtcgccg 3376321 ctagcttccc gccgaaccgc taccagcaaa gccgatggag ccgctattgg ctgacccgcc 3376381 tcggcccagg gatcagccga cctcagcggc caagttgccg acggcgtcgc gcacattcgc 3376441 cgccagcccg gcgtcgtccg cgaccgactt gcccagagtt tggaacggca ccgacagttt 3376501 gatcgcatcg accacccgcg tgccagcgat gctgaacgac ttgcgagtct cgtcgtgcgc 3376561 ccataccccg ccgtagcggc ccatggagcc gccgatcacg gccaacggct tgtccttcaa 3376621 cgcgccatcg ccgaatggcc tggacagcca gtcgatcgcg ttcttgatca cggccggaat 3376681 gctgccgttg tattccggcg tgaccaccaa ggcagcgtgc gcgtcagacg cggcctcccg 3376741 caacgcgctc accggcgccg gcacctccgt cgctgtgtcg atgtcttcgt tgtagaacgg 3376801 caggtccccc agcccctcga acatggtgac ggtgacgccg tccggagcga ccttggcagc 3376861 cagctcggcg atctggcggt tgaacgacgc cgcgcgcagg cttcccacta aggccaagat 3376921 tttgatgtcg gacttggtat ctgacactgc tacgttcctt tccgcttgtt ggtccacgtc 3376981 cttgcacgag ccaaccggac catggtccga tttattccga tcgcgttaca gtgcaaaggt 3377041 gagcggcgcc gagcggttgg gtgacttgcc tgtgttcgcg aggcaagagc ccgtaccaga 3377101 gcggggcgac gcggcacgca atcgtgcact cctgttggag gcggcgcgcc gcctgatcgc 3377161 ccgaagcggt gcggacgcaa tcaccatgga cgacgttgcc gcggccgctg gcgtcggcaa 3377221 aggcaccttg tttcgccgct tcggcagccg tgccggcctg atgatggtgt tgctcgatga 3377281 agacgagcga gccagtcagc aggccttcct gttcggcccg ccaccgctgg gcccggatgc 3377341 tccgccgctg gaccgcctga tcgcattcgg tcgggagcga atgcgcttcg tccatgccca 3377401 tcaccagctg ctgtcggaag ccaaccggga tccacaaacc cgccacagcg cggcgctatc 3377461 ggtactgcgc acccatttgc gggtactgct ggcctcggcg ccgaccaccg gcgacctgga 3377521 tgcccagacc gatgccctgc tagcgctgct cgacgtcgac tatgtcgagc accaactcaa 3377581 cgccggcggc cataccctgc aaaccctggg cgacgcatgg gagagcctgg cgcgaaaact 3377641 gtgcggacga tgatcgatca ctatgccgac agcagcaccg cgatggatcc tgcacgtaga 3377701 cctcgaccag tttttggcgt cggtcgaact gctccgccac cccgaactgg caggtttgcc 3377761 ggtcatcgtc ggcggcaacg gtgatccgac cgaaccgcga aaggtcgtca cctgtgcgtc 3377821 gtatgaggcc cgcgcctacg gtgtgcgcgc cggcatgccg ttgcggaccg ccgcccgacg 3377881 atgccctgag gccaccttct tgccgtcgaa cccagccgcc tacaacgcgg cgtccgagga 3377941 ggtggtggcg ttattgcgcg acctgggata cccggtcgag gtatggggct gggacgaggc 3378001 ttacctcgcg gtggcgcccg ggactcccga cgaccccatc gaagtcgccg aagagatccg 3378061 aaaagtcatc ttgtcgcaaa ccgggctgtc ttgctcgata ggtatcagtg acaacaagca 3378121 gcgcgccaag atcgctaccg ggttggcgaa accagctggc atctatcagc tcaccgatgc 3378181 caactggatg gccatcatgg gtgaccgtac cgtcgaagca ctgtggggtg tggggcccaa 3378241 gactacgaaa aggctggcaa agcttgggat caacaccgtt taccaacttg cacacaccga 3378301 ttccgggcta ttgatgtcca cgttcggtcc gcgaaccgcg ctgtggctgc tgctggccaa 3378361 aggcggaggc gataccgaag tcagtgccca agcttgggtt ccacgctcgc gcagccacgc 3378421 cgtcaccttt ccacgagacc tcacctgccg atccgaaatg gaatcggccg tgacggaatt 3378481 ggcgcagcga acactcaacg aggtggtggc ttcgtcgcga accgtcaccc gagtcgcggt 3378541 caccgtgcgc acggcgacgt tctacacccg caccaagatc cgaaagctgc aagctcccag 3378601 caccgatccc gacgtcatca ccgctgccgc ccggcacgtt cttgacctat tcgagctgga 3378661 tcggcccgtc cggttgctgg gagtgcggtt agaactggcc tagaaccggc gggcacaccg 3378721 cacctgggcg gcgcgaagtc ttgaccgcac cggccgctat ggcccgggcc gaagcgcgcg 3378781 cgtgaagaac acgttgactc gtcgcatcac cagggtgtat ggccaccacg catatcgctt 3378841 gaacgcatac agcgcccgga tgtccgccga cgtatagacc aggtatctgt tccttgtgac 3378901 cccggccaaa attttgtcgg ccgccttctc cggcgtcacg gcgtgaccac tgaaccgttc 3378961 gacccagcgg ttgaccctcg ggtcgtcgcg atccactccg gcgatctcga ccgtattgac 3379021 cagcggggtc ttgacggcgc caggcaccac gaccgacacc ccgatgccgt gccgggccag 3379081 atcgaagcgc agcacctcag aaagtccccg caacccgtac ttgctagcgc tataggccgc 3379141 atgccacggc aagccaacca gcccggccgc cgaggacaca ttgaccaggt gcccgccccg 3379201 accggcggcg accatcggtg ggaccaaggt ctcgatgacg tggattgggc ccatgagatt 3379261 gatcgcgacc atcctgctcc actgatcgtg cgtgagctgg tcaacggtgc cccaggccga 3379321 cacaccggcg atgtttagta ccacgtccat gctgggatga cgggcgtgga tatcggccgc 3379381 gaatgccgcc acgtcctggt agtcggagac gtccagaact cgatgctcgg gcacctgagc 3379441 gccgagtgca cgggcgtcac acacggtttg cgccaagcca tcacggtcgc ggtcggtcag 3379501 atacagctcg gcaccttgcg ccgcgaggcg caacgcggtc gcgcgaccga tgccactggc 3379561 cgcgccggtg acaaagcacc gcttacccgc gaaatactgt ccggctcccc tctgcaacat 3379621 ggtcgtgacg ataccggcgg taccgacacc ccctccggta ggacgatcga tgcgccccga 3379681 tagctatggg gccttgccgc caccccaaag cgcgttgagc cacatctgtt caagcacccg 3379741 aacccggcgc gctgcgtcac tgtcggggcc gacgagcaaa gcgtcaccgg tcagcatcag 3379801 cgcggtggta gctgccaggg tgcggacgag cgtcgggagg tcttcgctga tcggatgcgc 3379861 agtgccggcc ttcacctcag cctcgaagac gccgatggtt tcacgcaaca gcacttggaa 3379921 ctgccgctcg agaatatcgc ggatctccat gtcgctctgg cgtgccgcat tacaggcccg 3379981 cagcaccggg tcgttgttcg cgtaaacggc ggcgacgctg ccgatcatcc ggttgacgaa 3380041 ctgctcgggt gactcccctg gctgacgggc ggagaaatgc tggctggctt cttcgagttc 3380101 ttcggtggcc tcggccaaga tctgggcgag caccgagtat ttggaatcga agtagaagta 3380161 gaaaccggag cgggctaccc ctgcgcgaag gctgatagcg cgcaccgaca attccgcgaa 3380221 cggtgtctcc tccagcagtt cgcgtgcggc ccgcagaatc gcctggcgat gcctgtcacc 3380281 acgccgtcgc atcggcggcg cagcctgctt ctcgtctgcg gcatgactgg tcaccttttg 3380341 atcaccccct tgaccttgca ccatggcgtc tgaaaacgga acatcggtag ccgtcaaatt 3380401 gaccagaagg atagatttca gttacagcca ccaccggtaa ggagcgccaa tggcgacgat 3380461 ccaccccccg gcatacctcc ttgaccaagc caagcgtcgc ttcacgccgt cgttcaacaa 3380521 ctttcccggc atgagtcttg tcgaacacat gctgctgaac accaaattcc cggagaagaa 3380581 actcgccgaa ccgccgccag gcagcgggct caagccggtc gtcggtgacg cggggctgcc 3380641 gatccttggg cacatgatcg agatgttgcg cggcggaccg gactatctga tgttcctgta 3380701 caagacgaag ggtccggtcg tattcggcga ctcagctgtg ctgccgggtg tcgcagcact 3380761 gggccctgac gcggcgcagg tcatctactc caaccgcaac aaggactact cgcagcaggg 3380821 ctgggtgccc gtgatcgggc ccttcttcca ccgcggcctg atgctgctcg acttcgaaga 3380881 gcacatgttc caccgacgga tcatgcagga ggcgttcgtc cggtccaggc tcgccggcta 3380941 cctcgagcag atggacaggg tcgtctcgcg ggtggtcgcc gacgactggg tcgtcaacga 3381001 cgcacgcttc cttgtctatc cggccatgaa ggcgctcacg cttgacatcg cctcgatggt 3381061 cttcatgggc cacgaacccg gcaccgatca cgaactggtc accaaggtga acaaggcgtt 3381121 cacgattacc acccgtgccg gcaacgcggt gatccgcacc agcgtgccac cgttcacctg 3381181 gtggcgagga ctgcgagcac gcgagctgct ggaaaactac ttcaccgccc gagtcaaaga 3381241 gcgccgcgaa gcgtcgggca acgacctgct gacggtgttg tgccagaccg aagacgacga 3381301 cggcaaccgg ttctccgacg ccgacatcgt caaccacatg atcttcttga tgatggccgc 3381361 ccacgatacc tcgacgtcaa cggccacgac gatggcctat cagctggccg cccacccgga 3381421 atggcagcag cgctgccgcg acgaatcgga ccggcatggc gatgggccgc tcgacatcga 3381481 atccctagag cagctggaat cgctcgacct ggtgatgaac gagtcgatcc ggttggtgac 3381541 gccggtccag tgggcgatgc ggcagacggt gcgcgatacc gaactgctgg gctactacct 3381601 acccaagggc accaacgtga tcgcataccc agggatgaat catcgcctgc cggaaatctg 3381661 gacagacccg ctgacattcg acccggaacg gtttaccgag ccgcgcaacg agcacaagcg 3381721 gcaccgctat gcgttcacgc cgttcggcgg cggcgtgcac aagtgcatcg ggatggtgtt 3381781 cggccaattg gagataaaga cgatcctgca ccggctgctg cgccgctacc ggctggagct 3381841 gtcccgtccc gactaccagc cccgctggga ctacagtgcc atgccgatcc cgatggacgg 3381901 gatgccgatc gtgctgcgtc ccaggtaggc cctcttcggc ggattccgcc aatccaccgg 3381961 tgccgcagat gaaagtgcca gtgcgcagcc cgcacccact ttcgacccgc ggcgggagtc 3382021 ggtctggatc agatcccgcc gcgggtcgcg cgaatggtca gcgtcgctat cgtgcgccga 3382081 cggtgcaagc cctttcgact tctatgacga ccgtttgaat ttggacgtcc cctgttgcag 3382141 aaaaccctcg ctgcggtgga acctggcgat agcatctgat gacggtgtgg aaaccgcgga 3382201 atatgggtgt gctccagcga cgaaaggctc aatcgatgag cgcgactaaa agcaagggtt 3382261 tgcgggcgtt tcagacactg gtcgcggcgc tggctgcggt agttgcagta ctagcagcgg 3382321 gctgcgctac ccagcgcgtt cccacggttc tgccggaatc ggagttaatt cctcaaagcc 3382381 tcggttagct gctctgcgac ctcgccggac gggtgcagcg caaccacgca catgcaggag 3382441 caagaagggc gcgaccgatg atcgcaaaag gcaacaggcg gatccgggta gggcaattgc 3382501 tgggcgcagc actggtcgct gcttttgccc tgacagcggt gggatgcaca atccagatgc 3382561 ctcagccacc tctcccgcag caggagttaa ggcggtaggt ccggcctcag ggtagctgct 3382621 aactacccga tggggcagtc acgtcgccgt cgggcacggt gcggacgagc gcggacacac 3382681 tcatccgtac tttggtgctt agagccacca ggaagcggca gcgtccagat ggcggcgggt 3382741 gcggcaacgg gccaggctgt cgtcaccaga gccgatcgct tcgagaatcc gtacgtgggc 3382801 atgtcccaca gcgaccacgt cgctccatgt cggcagcgcc tgctctgtgc tcgacaagtg 3382861 ccggcggaac agctcgacca gtatcagcag gaacagatcc agcatcgtat tgccggccgc 3382921 tcgcgccagt ccgacgtgga accgaaactc ctcgacggcg gccgcgcgta catcgtcggt 3382981 ggggttatcc aacctcggcc gccccagcgt atcgaggaag gacgcaacct caggttcgct 3383041 gcggcgcttg acaactttcg cgacattgtc gatctcgatg gcatcccgga cgcaccgtag 3383101 gtcttcgcgg ctcggcttgc ggtactgcag atagagcgcg atggtgtcga tgctggcttg 3383161 tggctggggg gtggtgacga ccaacccgcc gccgggtccg cggcgcatgt gcgcgatcgc 3383221 gtgatattcc agcagccgca ccgcttcccg aagcaccgcg cggctcacct ggtagcgttc 3383281 caacagcgct gtctcggtcc cgaagaccga cccgacctgc cagccgctgg cggcgatatc 3383341 gtcgccaatg gtggccgcca acacctcggc cagcttgccg cggggcgcgc ccaggatcag 3383401 ctgctgggcc cggcgcggct cacgcgcccg gcccccgttg cggacggccg catcgttgcc 3383461 gcgctggtgc tgctgcagcc agccggctac cgcctcaacg tgtcgttcgc ttaaggtttt 3383521 ggcccacgcc gaatcacccg ccgtgacggc cgcgacgata tcggaatgtt cgttgtgcac 3383581 ttgaccggcc gcctcgacgg cctcacctgc ggattgggta cctgacttct ggacgtatcg 3383641 cttggtcagc cgcatcaaga tgtcgataaa cagctgtagg acagggtttt tcgattgctc 3383701 cgcgagcacg cggtaaaact gctcaggcgg cgggggcaaa ccgggccgcc accgttcctc 3383761 tgcgcgcaag accgctcgca gcctttcgat gccgggttcg tcgatatgct ctgcggcaag 3383821 agaggccgcc aagggttcga gcaccagacg cgcgccgagc aagtcaccga tggtggtgcc 3383881 caggtattcg agatagatga ccacggcgcg ggtagcgggc ccggcatttg gctcgcagat 3383941 gaacagcccg ccgttcggtc cacgacgcat tcttgccacc tgatggtgct caaccagacg 3384001 cacagcttcg cgcagcaccg atcgactcac gcaaaagcgt tgctgcaaag cgctttccga 3384061 acccaaggat gctccgatcg gccagccgcg gcggacgatg tccgcctcga tgcggcgggc 3384121 gatcttcgac gctcgcttgt ccgtccagac cgcgtccggc tcggtgctca tttcaataga 3384181 gtgtactgta ttggctgagt caagggcgcg agctgggccc tagctaatca ggggatcacg 3384241 cggcatgccc aggatccgct cggcaatctg attgcgggtc acctccgacg tgccgccggc 3384301 gatcgccatg ccacgggcgc ccatcaccgt tcggccaatc accctgccgg ggccgtccag 3384361 caacgcaatc tcgggccccc atagcgcggc cgcgatggcg gcgccctcga tcatgtgctc 3384421 tgccactttg agcttggtga tgttgccctc cggaccaggg ccggctcctt cgacgctgcg 3384481 agcggcacgg cgcaggttca gcagccgcag tgcgtgatcc tctgcgagga aagcgccgac 3384541 tcgaattggg gcgcccgcaa acgcatctga ccgccgctgg accaattgca ccagcttcgc 3384601 cgccattgct tcgtagtacg agccactgcc gccgatgctg acccgctcgt tgcccagcgt 3384661 tgcccgcgcc accgtccacc cggagttcgg cgccccgaca acgtcctcat cggggacgaa 3384721 gacatcgttg aagaacacct cgttgaattc cgagtcgccg gtgatctgcc gcagcggccg 3384781 cacctcgaca ccgggggcca acatgtcgat gatcaccgtg gtgatgccag cgtgtttggg 3384841 ggcatccgga tcggtacgca cggtagccag gccacgcgcg cagtactgcg ctccgctggt 3384901 ccacaccttt tgcccgttga tcttccagcc gccctccacc cgagttgcgc gggtcttgac 3384961 cgaggccgcg tcagaccccg cgtcaggttc ggagaacagt tggcaccata tctcctgctg 3385021 gcgcagcgct ttctcgacga atctttcaat ctgccaaggc gttccgtgct gaatcagcgt 3385081 caagatcacc cacccggtga tcgagtaatc cgggcgctcg atgcccgccg cgctgaactc 3385141 ttcctcgatc accaactgct ccaccgcgcc cgcggcacga ccccacggcc tgggccaatg 3385201 cggcatcaca tagcccgtct cgatcagctt gtcgcgctgt gcatcctttt ccagagcagc 3385261 gatttcagcg gcgtccgaac ggatgcgggc gcgcagctcc tcggcctgtg ccggcaggtc 3385321 caagctgatc gcccgggtaa cgccagccgc ggtgcgctcg aaaacgtctc ggacgggcgc 3385381 atcaccgccg aacaatccca cggtcaccaa cgcccggcgc agatgcagat gcgcgtcatg 3385441 ctcccaggta aagccaatac cgccgtgcac ctggatgttg agctcggcat tgcgtgcata 3385501 ggccggaaac gccagggccg cagcgaccgc ggcggccagc cgaaactgct cctcatcctc 3385561 tgctgccgca cgcgcggcat cccagaccgc ggcgatcgcc gactcggcgg ccaccagcat 3385621 gttcgcgcag tgatgcttca ccgcttgaaa cgtggcgatg gtacggccga attgctgtcg 3385681 caccttggca taggccacgg cgctgtccac gcagtcggcc gccccaccga cggcctcggc 3385741 ggccagcaat gtgcgcgcgc gggccaaagc cgattcatac gcaccaagca ggatgtcgtc 3385801 ggtcgtgacg cgcacgttgt ccaggcgcac gcggccactc cgccgggtcg gatcaaagtt 3385861 ttccggcaca tcaaccgaga cgcccttgcg gccgcgttcc aacaccagca cgtcgtcacc 3385921 ggcggcaacc aacagcagct cggcaagccc ggcgcccaac acgattcccg cctcaccgtc 3385981 ggcaacaccg tcggtaacct gcacctgact atccagtccc acacccgccg tcagggttcc 3386041 gtcaatcagc gccgggcaac agccgtgccc gttggtcatc agtaccttct ttggcgacca 3386101 ccgctgaggc gatcacggtc ggcacaaaca gccccggtgc caccgcacga ccgagctctt 3386161 cgatcaccac cacaagctcg gacaggccat agccagagcc accgtgtcgc tcgtcgatat 3386221 gcaggccgag ccagcccagc tcggcgaggt tctgccagaa cggcgggcgg gcgtcccccg 3386281 ccgcgtccag tgatgcacgc gccgcccagc gcaccttctg cgaagtcaag aacgcgcgag 3386341 ccaccccgga gagctcgcga tggtcgtcgg tcaatgcaat acccatcaag gcctcctagc 3386401 ggcactaccg gacccacata gcccccaggc ggtattggta aagagtatac taattgtctg 3386461 tcgcggccgc gagacacggc ttgctcgggc acgccagcct tgccctcgcc aacgatgtcg 3386521 gcgagacatg ccaagctgaa ccgtgctcct tcacgacgtg gccatcacct caatggacgt 3386581 ggccgccacc tcgtcgcggc tgaccaaggt cgcgcgcatc gccgccctgt tgcaccgcgc 3386641 cgcgccagac acacagctgg tcacgatcat cgtgtcgtgg ctctccggcg agctgccgca 3386701 acgccatatc ggtgtcgggt gggcggcatt gcggtcccta ccgccgcccg cgccgcaacc 3386761 ggcgttgacc gtcaccggtg tcgacgccac cctctctaag atcggcactc tatcgggcaa 3386821 agggtctcag gcgcagcgcg cggcactcgt tgcggaattg ttctccgccg caaccgaagc 3386881 tgagcaaacc tttttgttgc gactgctcgg cggtgaactg cgccagggcg caaagggcgg 3386941 gatcatggcc gatgcggtcg cccaggccgc cgggctcccg gccgcgacgg tccaacgcgc 3387001 cgcgatgcta ggcggcgacc tggcggcagc ggcggcggcc ggcctgtccg gcgcggcgct 3387061 ggacaccttc accctgcgag tgggccgacc gataggcccg atgctggcac agaccgcgac 3387121 cagcgtccat gatgcactcg aacgtcacgg cggcacaacc attttcgagg ctaaactaga 3387181 cggcgcgcga gtgcagatcc accgggcaaa cgaccaggtc aggatctaca cccgaagcct 3387241 ggacgacgtc actgcccggc tgcccgaggt ggtggaggca acactggcac tgccggtccg 3387301 ggatctagtg gccgacggcg aggcgatcgc gctgtgcccg gacaaccggc cgcagcgttt 3387361 ccaggtcacc gcatcacggt tcggccgatc ggtcgatgtt gcggctgccc gcgcgacgca 3387421 gccactttcg gtgttcttct tcgacatcct gcatcgggat ggtaccgact tgctcgaagc 3387481 gccgaccacc gagcggctgg ccgccctgga cgcactggtg ccggctcggc accgcgtgga 3387541 ccggctgatc acgtccgatc caacggacgc ggccaacttc ctggatgcga cgctggccgc 3387601 cggccacgag ggggtgatgg ccaaggcacc ggccgctcgt taccttgcgg gtcgccgcgg 3387661 agcgggctgg ctgaaggtca agccggtgca cacactcgac ttggtggtgc tcgcggtgga 3387721 atggggctcg ggacgccggc gcggcaagct ctccaatatt cacctgggcg cacgcgatcc 3387781 ggctaccggt ggattcgtga tggtgggcaa gaccttcaaa ggaatgaccg acgccatgct 3387841 ggactggcag accaccaggt ttcacgagat cgcggtgggt ccgacagacg gctacgtcgt 3387901 ccaacttagg cccgagcagg tggtcgaggt agccctcgac ggcgtgcaaa ggtcgtcgcg 3387961 ctacccgggc gggctggcat tgcggtttgc ccgcgtggtg cgctaccgcg ccgacaagga 3388021 cccggccgag gccgacacca tcgatgccgt gcgcgcgctc tactgatcgc acggcgagag 3388081 tgactcctgc gacgggacac gccggctggg cgtcgccaga ttcacgctcg tcgaccaagc 3388141 gggcgggaca agcagctgca aggatcaacg gagatcgcac ccgtgattga gggaggtgac 3388201 ggtggcagcg ccgaccccgt cgaatcggat cgaagaacgc tccggacacg ccagctgcgt 3388261 ccgcgccgat gccgacctgc cacccgtggc catcctcggt cgctccccca tcacgcttcg 3388321 gcacaagatc ttcttcgtgg ccgttgccgt gatcggcgct ctcgcctgga ccgtcgtcgc 3388381 gttcttccgc aacgagccgg tcaacgcggt ctggatcgtg gtcgcagcgg gctgcaccta 3388441 catcatcggg ttccggtttt atgcgcggct gatcgaaatg aaagtcgtcc gtccccgcga 3388501 cgatcacgcc accccggccg aaatcctcga cgacggcacc gactacgtgc ccaccgaccg 3388561 gcgggtggta ttcggacacc acttcgccgc catcgccggt gccgggccgc ttgtcggacc 3388621 agtactggcc acccagatgg gttacttacc cagcagcatc tggattgtcg tcggcgcggt 3388681 gctggccgga tgtgtccagg actacctggt gttgtggatc tccgtgcggc ggcgtggccg 3388741 ctccctgggt cagatggttc gcgacgaact cggcgccacc gccggagtgg ccgccctcgt 3388801 tggaatcccg gtcattatca ccattgtgat cgcggtgctg gcgctggtgg tcgtgcgggc 3388861 cctggccaag agcccatggg gcgtcttctc gatcgccatg accatcccca tcgccatctt 3388921 catgggctgc tacttgcggt tcctacgtcc cgggcgggtg tcggaagttt cattgatcgg 3388981 gatcggactg ctgctgctcg ccgttgtctc cggtgattgg gttgcccata cctcctgggg 3389041 cgcagcgtgg ttcagcttgt caccggtgac actgtgttgg cttctcatca gctatggctt 3389101 cgcagcttcg gtgctgccgg tgtggctgct gctcgcgcca cgcgactacc tgtcaacgtt 3389161 catgaaggtc ggcaccatcg cgcttctcgc gatcggtgtt tgtgcggctc acccgatcat 3389221 cgaggcccca gcggtgtcga aattcgccgg tagcggcaac ggcccggtgt tcgccggctc 3389281 actgtttcca ttcctgttca tcaccatcgc gtgcggggcg ctgtctggat tccacgcgct 3389341 catctgctcg ggcacgacgc cgaagatgct ggagaaggaa ggccagatgc gcgtgatcgg 3389401 ctactgcggc atgatgaccg agtccttcgt cgccgtcata gcactactca ccgcggcgat 3389461 cctcgaccag cacctatact tcaccctcaa cgcgccgtcc ctgcataccc acgacagcgc 3389521 agccaccgcc gccaagtacg tcaacgggct cggtttgacg ggctcaccgg tgaccccaga 3389581 ccacatcagc caggccgccg ccagcgtcgg cgaacagacg atcgtgtcgc gcaccggcgg 3389641 tgcgccgacg ctggcgttcg gcatggcgga gatgctgcat cgagtggtcg gcggtgtggg 3389701 cctcaaggcg ttctggtatc acttcgcgat catgtttgag gctctgttca tcctcaccac 3389761 cgtcgacgcc ggcaccaggg ccgcgcgctt catgatctcc gatgcgctgg gcaactttgg 3389821 cggtgtgctg cgcaaactgc agaatccgag ctggcgtccc ggtgcgtggg cttgcagttt 3389881 ggtggtcgtc gcggcgtggg gcagcatcct gctgctcggt gtgaccgatc cgctgggcgg 3389941 catcaacacg ctgttcccgc tgttcggcat tgccaaccag ttgcttgccg gaattgcgcc 3390001 gaccgtcatc accgtcgtcg tcatcaagaa ggggcgactg aagtgggctt ggataccggg 3390061 tattccactg ctgtgggatc tggcggtcac cctgaccgca tcgtggcaga agatcttctc 3390121 cgctgatcct tctgtcggct actggactca gcatgctcac tacgcggcag cccagcacgc 3390181 aggcgagacc gcgttcggct cggccaccaa cgccgatgag atcaacgacg tcgtccggaa 3390241 cacattcgtc cagggcaccc tgtcgatcgt cttcgtggtg gtcgtcgtgc tggttgttgt 3390301 cgccggagtc atagtggcgc tgaagacaat tcgcggccgc ggcataccgt tggccgagga 3390361 cgatccggcg ccgtcgacgt tgttcgcgcc cgctggcctg attcctacag ccgcagagcg 3390421 aaagttgcaa cgacgtttgg gcgcgccggc ctcggcttcc gtcgcggcgc ccgactagcc 3390481 ctcccgctgc agtggtaccg gcgccgcaat cagacggcga gtaggcgtgg gtccaacccg 3390541 cgattcgcgg cagccggcgg agaaggcgac caagagacgt tatcggttcg ctcggggact 3390601 catggccggt ctgctgggca cgatggctct cacgagcggc ggtggtgtcg ctcgcgagga 3390661 tccattggaa cctgatccgc tagccccgat catcgacgat tccaggtaaa cggattcgaa 3390721 ggcacctata gggacgtgcc ctgacgcccc gccacaatgg acgcttgggt agcctgacca 3390781 gccttatgca gtgacagtgc gtcgagcatc aattgagtag atcccaccac cggtgaacac 3390841 cagcaggaag aagccgaagc agaacagtat cgccggagtt ccgccattgc cgtccggtgg 3390901 accgccgatc ggccacagtg catacggttg atgcatccag aagtaggcga ccgccatttc 3390961 gcccgaggca acgaacgcca cagcgcgggt aaacagcccg gttgcgatca gcagacctgc 3391021 caccaactcg atgaccccgg cataccagcc gggccaggat ccaaattcga cgggttgagc 3391081 cgaggtgacg ggccagccga aaaggatcat cgatccgtag ccggcgaaca gcagcccgta 3391141 taccaaccga aagaggctca gcacagccgg caaacagccg gcgagccgac ggtcgagatc 3391201 tttcaccatg acacgacgtt acggggatcg accgcgcgaa cgctgggcgg attttgtctc 3391261 ccaccggtgt gcctactcac gtgtggacgc acgagcctcc tttgtgtaca tttgtacatg 3391321 tacaaatgta cacaaaggag gggtcttgat ctacctatac ctcttgtgcg cgatcttcgc 3391381 ggaagtggtg gcaaccagcc tgctcaaaag cacggaaggg ttcactcggt tgtggcccac 3391441 ggtgggctgt ctagtgggtt atggcatcgc tttcgcgctg ctggccttgt cgatctcgca 3391501 cggcatgcag acggacgtcg cctatgcgct gtggtcggca atcggtacgg ccgccattgt 3391561 gctggtcgcc gtactgtttc tcggctcgcc gatatctgtg atgaaggtgg ttggcgtcgg 3391621 cctgattgtc gtcggcgtgg tcacgttgaa cctggcgggt gcccattgac cgcaggctcc 3391681 gaccgccgtc cacgcgaccc agccggtcgc cggcaggcga tcgtcgaggc ggccgagcgc 3391741 gtgatcgctc gccagggcct tggcgggctg agccaccgca gggttgccgc ggaggccaat 3391801 gtaccggtcg ggtcgacgac ctactacttc aatgacctcg acgcgctgcg ggaagccgcg 3391861 ctcgcgcacg ccgcaaacgc ctcggccgac ctgttggcgc agtggcgcag cgacctcgac 3391921 aaggaccgcg acctggccgc gaccctggcc cggctcacca ccgtctacct ggccgaccag 3391981 gaccgctatc gcacgctcaa cgagttgtac atggcggcag ctcatcgacc ggaactgcag 3392041 cgcttggccc ggctgtggcc agatggtcta ctcgcgctgc tcgaaccgcg catcggtcga 3392101 cgagccgcca acgcggtcac cgtgtttttc gacggcgcta cgctgcacgc gcttatcacc 3392161 ggtaccccgc tgagcaccga tgagctcacc gatgccatcg ccaggctggt tgcggacggc 3392221 ccggaacagc gcgaagtggg acaatctgcc catgcgggac gaacccccga ctgacaccgc 3392281 agcggctccc accaccggtg cggcacctga gattgacacc gcccgcgaat acgaagtaac 3392341 cgccgaatac cagtcctggc gggtcgtctg gggaagcgcc gcagcattgc tgacggtcgg 3392401 cgtcgggata ggcgcggcca tcctcctcgg gtggttcacg ttagcgcacc ggcacccgga 3392461 ccagcctggg gcggccgcga caccaccccc tgcggggcta acaacacggt ccgcgcccac 3392521 cgccgccccg ccgtcaacgc tgcaaagccc agacctggac agcgtctttc ttggcaacct 3392581 gcacgatcgc ggcatctcgt tcaccaaccc cgatgccgcc gtctacaacg gcaagatggt 3392641 ctgcaccaat ctcggcggcg gcatgaccgt gcagcaggtg gtcgaggcat tgcagagtag 3392701 cagccctgca cttggcgacc ggacaaccgc ttacgtggcc gtctcgattc gcacgtattg 3392761 tccgaagtac gacgctgtgc tgccaccggg atcctgagtg gagctaaggg gactcgaacc 3392821 cctgaccccc acactgccag tgtggtgcgc taccagctgc gccatagccc catgaagtga 3392881 tgcccatcga agctacacca ccgccggaaa gcgttcaaag ccccaggtca gcgagcctca 3392941 cccgatgacc cgatcgacca cttcgcgggc ggtctgctgc acctcgacca gatgttgcgg 3393001 tccacggaag gactccgcgt agatcttgta gacgtcctcg gtgcccgacg gacgcgcggc 3393061 aaaccacgca ttggccgtcg tcaccttcaa tccgcccagc gcagcaccgt tgccgggcgc 3393121 ggtcgtcagc tttgcggtga tcggctcacc ggccaactcg gtggcgctca cctggtcggc 3393181 cgacagcctg gccaggcggg ctttctgctc ccgatcggcg ggcgcgtcga tccgcgcata 3393241 gcacggccca ccgtactcgc cggccagcgc gtgatatcgc tgcgacggcg tagccccggt 3393301 gaccgccagg atctcggcgg ccagcagcgc catgatgatg ccgtccttgt cggtggtcca 3393361 taccgatccg tcccgtcgca gaaatgatgc ccccgccgat tcctcgccgc cgaagcccaa 3393421 ggtggcgccg atcagaccgt cgacgaacca tttgaatccg accggtacct caacgagttg 3393481 acggccgatc ccggcgacca cccggtcgat gatcgacgag ctgaccaccg tcttgcccac 3393541 ggcgatgccg gccggccagg acgggcggtg ggtgtagaga tattcgatgg ccacggccag 3393601 atagtggtta ggattcagca gcccttcgtc aggggtgact atgccgtgtc ggtcggcgtc 3393661 ggcgtcgttg ccggtggcga tctggtagcg ctcccggttg ccgaacatcg ttcggatgag 3393721 cccagccatc gcatccggtg aactgcagtc catccggatc ttcccgtcgg tgtccagggt 3393781 catgaaccgc caggttgcgt cgaccagcgg attgaccacg gtcaggtcta ggccatgccg 3393841 gtgggcgatc tcaccccagt aatccacgct ggccccgccg agcgggtcgg cgccgatccg 3393901 caccccggcc tcgcgaatgg cggcgatatc gaccacgttc ggcaggtcat cgacatagtg 3393961 gcccaggtag tcgtgtcgct gggcggtgcg taacgcgcgg gccagcggca accgcttcac 3394021 catcgaccga gcgagcagaa tctcgttggc acgcttggct attgcggtgg tcgcagcggt 3394081 gtccgccggg ccaccgttgg gtgggttgta cttgatgccg ccgtcggacg gcgggttgtg 3394141 cgacggcgtc acaacgatcc cgtcggccag cgcttcggtc cggccgcggt tgtaggtcaa 3394201 gatggcgtgg ctgattgccg gcgtcggcgt gtagcggtcg cgggagtcga cgacggccac 3394261 cacctgattg gcggcgagta cctccagcgc cgatacccat gccggttccg aaaggccatg 3394321 ggtgtcacgg ccgatgaaca gcggcccggt ggtcccctgg gcggcgcggt attcgacgat 3394381 agcctgggtg atggccagaa tatgtagttc gttgaacgtt ccggtcaggg ctgagccccg 3394441 gtgccctgag gtgccgaaag cgacctgttg agcgaggtcg tcgggatcgg gttcgatcga 3394501 gtagtacgca gtcaccagat ggggcaggtc gacgaggtct tcgggctggg ccggttgacc 3394561 ggctcgtggg ttggccacca tggctaccaa ttctgcccac aggccctaca gtgcgaagcg 3394621 cagcattagc acaccgagag ggatcgacca gtgccaaacc acgattatcg cgagttggct 3394681 gcggttttcg ccggcggagc gttgggtgcg ctggcccgag cagcgctgag cgcactcgcc 3394741 atccccgacc cagcccggtg gccatggccg acgttcacgg tcaacgtcgt cggcgccttc 3394801 ctggtgggtt atttcaccac ccggctgctg gagcgattgc ccctgtcgag ttatcgacgc 3394861 ccattgctcg gcaccggatt gtgcggcgga ctgaccactt tctcgacgat gcaggtcgag 3394921 acgatcagca tgatcgaaca cggtcattgg ggtttggccg ctgcctactc cgtcgtcagc 3394981 atcaccctcg gattgctggc ggtgcacctg gccacggtct tggtacgccg agtgcggata 3395041 cgccgatgac ggcctcgacg gccctgacgg tggcaatctg gatcggcgtg atgctcatcg 3395101 gcggtattgg gtccgtgttg cgttttctgg tcgatcgctc ggtggcccgc cggctggccc 3395161 ggacttttcc ctacggcaca ctgacggtga acatcaccgg agccgcgctg ctggggtttc 3395221 tggccggcct ggcgttgccg aaagacgcag ccttactggc cggcacgggg ttcgtcggcg 3395281 cctacaccac cttttccacc tggatgctag aaacccaacg gttgggagag gaccgccaga 3395341 tggtttcggc attggccaat atcgtcgtca gcgttgtgct cggtctagcc gcggcgctac 3395401 tcggtcagtg gatcgcccag atatgaacga gcaatgcctg aagctgaccg cgtatttcgg 3395461 cgagcggcaa cgcgctgtcg gcggggcggg gaggtttctg gccgatgcga tgctggatct 3395521 gttcggctcc cataacgtcg cgaccagcgt gatgctgcgc ggtaccacca gtttcgggcc 3395581 aaagcacgag tttcgctgcg atcaatcgct gagcctgtcc gaggacccgc cggtgaccgt 3395641 cgccgccgtc gacatcgaat cgaaaatccg ctccctggtc gacgacgtca cagcgatgac 3395701 cgaccgcggc ctggtgaccc tggaacgggc gcgactggtc acccggcaca gcggcgccga 3395761 ggaattcggc gacatcgaca gccgaaacgg agatgccgcc aagctcacca tctacgccgg 3395821 ccgccaggtg cgggttgccg gggcgccggc ctactacacc atctgcgagc ttttgcatcg 3395881 acatggattc gcaggtgcca cagtgctgct cggcgtcgac ggcacggcac acggtcggcg 3395941 ccgccgggcc cggttcttcg gccgcaacgt caatgttcca ctgatgatca ttgccgtcgg 3396001 aacgcctgca caggttgccg tggccgcaat ggaactcacc gcagcactgc ctaacccgct 3396061 gctgaccatc gaacgggtgc ggctgtgcaa gcgcgacggc gagttgttcg cccgccccca 3396121 acagctgccg cagaccgatg accagggacg caccctgtgg caaaagctca tggttcacac 3396181 cgccgaagca acccatcatg aggggctgcc gatccaccga gcgcttgtcc atcgactgat 3396241 gcagtccgaa acggcgcggg gcgctaccgc gctgcgcggc atctggggct tttacggcga 3396301 ccataaaccc catggggaca agctatttca gctggtgcgt agggtgccgg tgaccacgat 3396361 catcgtcgac acaccccagg ctatcgcgcg cagcttcgac atcgtcgatg agctgacgaa 3396421 ctggcacggg ctggtaacca gtgagatggt ccctgcggcc gtgtcactca ccgggtcacg 3396481 ggatggcacg caaaagaccg gtgaaacccc actggcgcgc tacgactact gagtgccagc 3396541 cgccagattg gtcagatccc acgtcgggga cgcttaccca acccgcgatg cgaacatcca 3396601 tttgtcggcc agcgccgata gccagcccgc tacgacctca ggattatccg gtggcacctc 3396661 gaccaacacc aactcgtcaa cgcccagttc gcgagcacat cgacatcacc aacctgagga 3396721 ttagccaggg ccacggccag ccgcagttcg ccacgatcac gacccgactg ttgccgtcga 3396781 acgatgcgac gtcgtcgcgc cataatgtgc gcattgcagc gacgtattcg gcggtgcgct 3396841 ctgcgcgccg ctcgaatggc actccgagcg cgtcgaactc ctccttggac catccgacgc 3396901 cacgcctagt gtcagccgcc taccactcaa ccgatccagg ctcgccgctt ctttggccac 3396961 tatcaccggg ttgtgctcag gcagcagtag cacgcccgtc gcgacgtcca cccgcgacga 3397021 ggcggcagcg gcgaaactca acgcgatcat cgggtcaagc caatccgcct gtgccggaac 3397081 cgcgatgacg ccgtcgcggg agtagggata acgcgacgcg ggccggtcca ccatcacgac 3397141 atgttcgccg acccacaagg tggcgaagcc acagtcgtcc gccgcaaccg cgacggcatc 3397201 gacgaccgcc gggtcggcgc cggcacctat tccagcgcgt gcagtcccag tcacatcgca 3397261 cgagcgtctc acacaggcca attggcatta gcggccgttg agcaactgcg ccaagacggc 3397321 cgcatggctg cgtgccacgt ggcgcgtcgc ggtcaccggg gtcacaacgc tgcgcccggt 3397381 cagcttgcgc agctcggcta gtgccgcgct gtcgtgcagc tcctcctggt agcgcgacgc 3397441 gaattcgtca aaccgctccg gctggtggtg gtaccactcg cgcagctctt tggatggtgc 3397501 gacgtctttg caccagatgc ccacccgctg gtcatccttg cggattccgt gcggccagat 3397561 gcgatcgacc aggacacgct ggccgtcgtc gggatcgatg tcttcataga cgcgggccac 3397621 ccgcacccgt gtctcgcgca ccattgtgcc agcgtatagc cgttaccgcg ggggcttatc 3397681 cacagccacc ggcgccacca gttgtcccgg tctgtgcagg ctgctattct cgaacacatg 3397741 ttcgagacat tgaccgcgat cgacccggat gccgaggaag cggcgttgat cgagcgaatc 3397801 gccgagctgg agcggcttaa gtcggcagcc gcggctggcc aggcgcgggc ggcggccgct 3397861 gtggacgccg cccgcagagc cgccgaagga gctgccgggg tgccggctgc gcgccgtgga 3397921 cgtgggctgg ccagtgagat tgccctggct cgacgagatt caccagcccg gggcagccgg 3397981 catctggggt ttgccaaggc cttggtttac gagatgccac acacgctggc cgccctggac 3398041 tgcggcgccc tctcggagtg gcgggccacc ctgatcgtgc gcgaaagcgc atgtctggat 3398101 gtcgcggacc ggcgcgcatt agatgccgag ttatgtggcg accccggcga cttggagggg 3398161 atgggcgatg cgcgggtggt cgcggccgcc agggcgatcg cctatcggct ggacccgcag 3398221 gccgtcgtcg accgggcggc caacgccgaa aatgaccgta cggtcaccat tcggccggca 3398281 ccggacacca tgacgtatct gaccgccctg ttgccagtcg cccaaggcgt gtcggtgtat 3398341 gcggcgctga cccgagcggc agacacccgc tgcgacgggc gctcccgcgg ccaagtcatg 3398401 gccgacaccc tggtcgaacg ggtcaccggc cgcgacgcgg cggtcccgac cccgatcgcg 3398461 gtcaacctgg tcatgtcgga tgaaacgctg ctgggtgcgg ccaacacacc ggcgcagctg 3398521 tgcggctacg gtcccattcc tgcggccgtg gcacggacca tggtcgctag cgccgtcacc 3398581 gaccagagat cgcgggccac cctgcgcagg ctctacgctc atcctcaggc cggggcgctg 3398641 gtgtcgatgg aatcacgggc gcggctgttt ccccgcggtc tggccgcctt catcgagctg 3398701 cgcgatcagc gttgccgcac cccctactgt gacgcgccga tccgacaccg cgaccatgcc 3398761 cacccctggg ccgacggcgg cccgaccagc gcgcacaacg ggcttgggac ctgcgaacgc 3398821 tgcaactacg ccaaacaagc ccccggctgg cgggtcagca caagtgtcga cgaaaatcac 3398881 acgcacacag ccgaattcat taccccgaca ggcagtcgac accggtccgg cgccccgccg 3398941 cacctgcctg cggtcaccgt cagcgaactc gaggtccgaa tcggcatcgc gctcgctcga 3399001 tacgccgcct agtagtggta ggtgtcagtc ggagccggca tgtgaaccgg ttcgtcctcg 3399061 aagtcggaca cttcgatgcc gtaggcgcgg gccagatcga ggatcttggt ggcccgggca 3399121 atgcgcggca ggtcagaccc gttgcggatc tcgcccccgt cgcgggcgaa ctcagcgaaa 3399181 aattccttcg cccagacgat ttcgtcctgc gacggggata gcccctcatt caccaccgga 3399241 cattggtccg gcgaaaggca gatcttgccg gtcatgccaa actcggcgga gacggccgtg 3399301 gcctcgatca gcttgagcgc gttggagccg atggtcggcc cgtcaatcgc gctgggcaga 3399361 ccggcggccc gggccgcgat ggtaaagcgc gaccgcgcgt aggccaatgt tgccgggtct 3399421 tcgccaaagc cggtgtcccg gcgaaagtcg ccgataccga aggcgagccg gaaggtgccc 3399481 ttggccgcag caatctcgtt gatgcgctcc agaccccgcg ccgtttcgac cagtgcaacg 3399541 atcggcacgt taggtagtcg tttcgcggtc tcggtgacat ggtccaccga ttcgaccatc 3399601 gccagcatca ctccgccaac ggggctatcg gccaacatcg ctagatcgtc cgcccaccaa 3399661 ggtgtgccga agccgttgat gcgcacccag tcagcgtttc cgtcaccaaa ccaacgcacg 3399721 gcgttgtccc gggcggcatg cttgtctttg ggagcgaccg cgtcctcgat atcgagcacg 3399781 acgatgtcgg cgcgtgagtg cgcggcggac tcgaaccggt cgccgtgcgc gccgttgacc 3399841 agtaaccaac tccgcgcgag aaccggatcg atacgagacc cggccaccgg atccgccgtg 3399901 ttggtatcga cctgttcata cattgaggtc atctagtgtc tcttcgctca gtcgatgtcg 3399961 acattgttct ccttaaaccg tagcgacgtc gcaaatcgga ttggcaggat gccccgcaaa 3400021 acccacgtcc atggtgttgg atggcgtggt gtccgacact cgccgcagcc ggacgatagc 3400081 ggcccggcag caaaccatct gggacgtcct ggccgacttt ggttccttga gttcatgggt 3400141 cgagggcgtc gaccactcct gcgtcttgaa ccacggtccc gacggcggag ctctaggcag 3400201 cacccgccgc gtgcaggtcg gccgcaacac gctggtggag cgtgtcatcg agttcgaccc 3400261 acccacgaca ctggcctacc gcatcgaggg cctgcccgcc cggctgcgca aagtcaccaa 3400321 ccgctggaca ctacggccgg ccgatcctgt aggcgcggtg acggtggtca ccttgaccag 3400381 cacgatcgaa atcggcggca acccgctggc gcgtctggcc gaacttgtcg tcggccgcgc 3400441 catggccaag cggtccaaca cgatgctcgc cgggctggca caacgattgg aggacaaaca 3400501 tggctaaccg tcccgacatc atcatcgtga tgaccgacga ggaacgtgcg gtgccgccgt 3400561 acgagtcggc cgaggtgctc gcctggcgtc aacgcagctt gaccggccgc cgttggttcg 3400621 acgagcacgg gatcagtttc actcggcact acaccggttc gctggcgtgc gtgcccagcc 3400681 gcccgacgat tttcaccggc caatatccgg atctgcacgg cgtcacccag accgacggca 3400741 tcggcaagcg attcgatgat tcgcggctgc gctggctacg ggccggcgag gtgccgacgt 3400801 tgggtaactg gtttcgcgcg gccgggtatg acactcacta cgacggcaag tggcacatct 3400861 cgcacgccga tctggaagac cccgcgaccg gtgcaccact ggccaccaac gacaacgagg 3400921 gcgtcgtcga ctcggccgcg gtgcggcgtt acctcgacgc cgacccgctc gggccatacg 3400981 gcttctccgg gtgggtgggc cccgagcccc atggggcggg gttggccaac agcggttttc 3401041 gtcgcgaccc gctggtcgcc gatcgtgtcg tcgcgtggct gaccgagcgc tacgcccggc 3401101 ggcgcgccgg tgacaccgcc gcgatgcgcc cgttcttgct ggtggccagc ttcgtcaacc 3401161 cgcacgacat cgtgctgttc ccggcatggg tgtggcgcag cccgctaaag ccctccccac 3401221 tggacccgcc acacgtaccg gcggcgccga ccgccgacga ggacctgtcg accaagccgg 3401281 ccgcgcaggt cgcctaccgg gaggcgtact actccggata cggcctaacg cgtatggtca 3401341 gccgcaacta tgcccgcaac gcgcagcgct accgggacct ctactaccgc ctgcacgccg 3401401 aggtcgacgg gccgatcgac cgggtgcgcc gcgcggtcac cgagggcgga tccgaggatg 3401461 ccatgctggt gcgcacctcc gaccatggcg atctgctcgg ggcgcatggc ggactgcacc 3401521 agaagtggtt caacctctat gacgaggcaa ccagggtgcc gttcgtcatt gcccgcatcg 3401581 gcgagaaggc aacccaaccg cgcacggtct cggcgcccac ctcgcatgtc gacttggtgc 3401641 cgacgctgct tagcgcggcc ggcgtggacg tagacgtggt ggccgcggcc ctggccgaat 3401701 cgttctccga ggtgcatccg ctgcccggtc gtgacctgat gccggtcgtg gacggggctt 3401761 cggccgacga gggtcgggcc atctacctga tgacgcgtga caacgtgctc gaaggcgaca 3401821 ccggcgcgtc cctgctgtcg cggcaactgg gccgtatcgt gaatccgcct gcaccgctgc 3401881 gcatcaaggt gcccgcccac gtcgccgcca acttcgaggg attagtcgta cgggtcgatg 3401941 acaccgacgc cgccggtggt gccgggcacc tgtggaaact ggtgcgtacc ttcgacgacc 3402001 cggccacctg gaccgaaccc ggtgtgcgtc acctggccac caacggcatg ggcggcgacg 3402061 cctatcgcac cgatccactg gacgaccagt gggagctcta cgacctgacc gccgatccca 3402121 tcgaggcata caaccggtgg accgacccac aactgcacga gctgcgacag catctgcgga 3402181 tgctgctcaa acagcaacgt gcggtatcgg taccggaacg caaccaaccg tggccgtatg 3402241 ctcatcgact gccgccgagc ggggcatcca acggtttggt gcggcgagtg ttgggaaggt 3402301 tcgtgcgcta attgcagaag ctgctattca ccatcgggtt ggccctgttc ctgatcggcc 3402361 tgcttaccgg attggtcatc ccggcactga agaacccgcg catggcgctg tcgagccacc 3402421 tcgagggggt cctcaacggg atgttcctcg tcgtgctcgg cctgctctgg ccgcacatcg 3402481 atctgcccga ggcatggcag gttatcgcgg tggcgctgat cgtttactcc gcctacgcca 3402541 actggctggc gaccctgctc gcggcggcct ggggagcggg ccgtaaattc gcgcccatcg 3402601 cgaccggcga ccacaaagcc ccggccgcca aggagggatt cgtcagcttt ctgttgttgt 3402661 ccctctcggt ggccatcgtg atcggcgtgg tcatcgtcat cattggcctc tgacggcgac 3402721 ccgtccaact acgccagccg cgctagctcg gcctgaagct tgtccagata tcgaagcgtc 3402781 gggtcgcgag gctcggtcgg cagctccagc aaaacccgct ccacccctag atgccggtat 3402841 ccctcaaggt ctttagccgc cgcttcaccc cactggcaca cggtcaccgg cacgtcgccc 3402901 ccggccatgg cgcgcaaccg ctgaagcgga cccgacagcc gctgcggtga tggactgatc 3402961 gcgatccacc cggcattgag ccgggctatc cgcgggaagt tcgccggtcc cccgcccaca 3403021 tacagcggag gatagggctt tgtcaccggc ttcggccagc agtagatcgg atcgaagtcc 3403081 acatatgtcc catggaattc cgcctgctcc tgcgtccaga tctcgattat cgcgcgcaac 3403141 cgctcatcga tcacacgtcc gcgcaccgca gggtccacac catggttggc gacttcttcg 3403201 cgcaaccagc ccacacccac gccgaagcga aaccgtccct gcgacaccag atccagcgag 3403261 gcgacctcct tggccgtgac gatcggatcg cgttccggga tcagcgcgat gccggtgcct 3403321 aacaccagtg actgggtggt agctgccgcg gccgccaacg ccacaaaggg atccagggtg 3403381 cggtaatact tctccggaat tgggccaccg cccgggtagg ggctctgcgt gttgacggga 3403441 atatgggtgt gctcggcgag gaacagcgac tcaaacccgc ggtgctcgag tgccgcaccc 3403501 agctccgccg ggccgattcc ctcgtcggtg acgaacgtca ggacaccgaa ttgcatgctt 3403561 gctcccatcg tcttgtggct gcaagatctg cacgacgata cggccggccg cgagttaggc 3403621 cagtcccgca tcgaccagca gacgtgacag cccgagttcg gcgcacttcg tggctaccgg 3403681 cgccagttcg tttcgcgcat cggattcccg tccggtggcg gcaagcgttt cgatatgaag 3403741 tatttgcgcc tgcagcgccg ccagcggtct gcgcgtaccg tcgatggcgg cggcgagagc 3403801 accggcccgt tggcaggctt ggtcacgatc ggcggagtcg ccggcggaca acaggcgcac 3403861 cgcggagtcc tcgtcgagtt cggctgtcat ggtggcgatt ccattgtcgc gggggatggt 3403921 gcggggtgcc agcaaatcgg cggccaccgc cgcaggtagc gcgatgccca gccggatccg 3403981 ctcgttgttg attcgggcag ccaggcgcgg cagccccagc tggacggcag tatcgcctcc 3404041 ggtggacagg cgatcagccg caccctcatg atccccctgg gccgccttga cccgcgcgcc 3404101 gatcacgtac ctggcggcca ggtagtccac tgcacccccc tcggaaccca gcagatagct 3404161 ctcgtccatg agacgaccag ccccggccag atcgccggtc tcgtagagca attcggcgag 3404221 cagcgaaccc gcaagccgcg ccgcgtgcga gtgggccccc actgccgtgc cgacctcgaa 3404281 cgccgttcgg aagttctgta gcgcagcgac aatgtcgagc cgattcctgg ccgccatgcc 3404341 gcgcaagcac tgcgcataaa cggtgccgaa cggtcccatc atttcctggt agggcgcggc 3404401 ccagtccagc agtggatata cctcggcgaa ctcgaagcgg cagatcgcgg ccaacgccgc 3404461 ggtgttgccg gcggtcccgg ggactcgcgg gggcagggtg tccggtctcg acattgcctc 3404521 ggcgagaagg tcatccacgc gctcgacccg gtctgcgaac acctcggcga ccgcccgcaa 3404581 cacgtctgcc tcggcccgca gatccgcctg cgtcgcctcg ggaagctcgg cccggccaag 3404641 ggccgtttcg aaacgattca gggcaccggt ggccggcgcc ggccgttgca gcagaatgtt 3404701 cgcccacgcg atggcgagtt ggagccgggc ccgtgaaacc accatcgacg tcggcagttt 3404761 ctgcacgatt gccagaagtg tggtcatctt tgactgctcc ggcaggttcg tttcatcctg 3404821 ctcgacaaga tcgacggcgc gcgcgggatc gcccgcggcc agtgcatggt cgacggcttc 3404881 gtgcaggtag ccgttctcgg cgaaccaggc cgatgccctg cggtgcagtt ccgccacccg 3404941 gtgcgacccg ccacgttcga ggcgacggtg gagaaagtcg gcgaacattt ggtggaagcg 3405001 aaaccaattc gggtcgtctt cggtccgttg caggaacaag ccgcggtgct cggcctcttc 3405061 cagcatcgcc cgcccattgg tgatcccggc cagcgccgag gccagcccgc cgcacgtgcg 3405121 ttcggtgacc gatgccacca gtaggaattc gcgcagttcg ggttccaggg tgtccagcac 3405181 gttttcgctc aggaattcgt ggatcacgtc actggcgccg gaaagtccgc gcaggagttg 3405241 ggtcgcgtcg cccccgccgc gcagcgacag cgcggccagc cgcagcgccg cggcccaccc 3405301 gtcggtagag gtagtcagcg cctgcacgtc tgcgcgcggc aatcgcagac caccagcatc 3405361 gttcagcagc gcggcggcct cgtcggtatc gaagcgcaaa gcagccgaat cgatctcggc 3405421 tagttcgtcg ccgatccgca acctgcccac cggcaaaccg gcgcgagacc agctggtcac 3405481 gatgagctgc aggtggtgac atccgttgtc cagcaggaaa cccaaggcag cttgggtgcg 3405541 gctgtcggac acccgatgcc agtcgtcgat caccaccgcg atccggtcgt cgttttcgtg 3405601 gatttcgtcg atcagcgaag tcaacacgta gcggccggcg tcatccccat gctcttcgag 3405661 cacgtgcccc aacgactcgg ccagcgtggg ccggacccgc cggatcgact cgagcaggtg 3405721 cgacaagaac cacacctcgt tgttgtcgtc gttgtcgatt gtcagccagg cgaccgcggc 3405781 gccgtcgcgc gagagctctt cccgccattg cgccgccagg gtgcttttgc cgaatcccga 3405841 gggcgcgtgg atgaggatca gccggcgccg tccgccggcg cgcaggatgt cggtgagccg 3405901 gctgcgggtg accagcgagc cggtgggcac cgacggccgg tacttggtcg cgggtgtcgg 3405961 aggcgtcggg accgtcgggg tgccgccgcc ggtatgccga tgcgccgcgt gcgcctcggg 3406021 cgagcgtcgg cgttccacgc ccagctcgac ggggaggggc atctcgtcga cgctgacgcc 3406081 gttgcggcgc tgaacgtcgc gaagctcctc gccaacgtct gccgcggtcg cgggacgatc 3406141 cgccggatgg cgggccatcg cccgttcgat ggcggcggcc acgtccgcgg gcagtccctg 3406201 cttccgcagg tcggggatcg gctgcgaggt gatccgcagg aactgggcga tcacccgctc 3406261 accgctgcgg cgctcgtagg cggcatggcc ggtcagcaca cagaacaacg tcgcgcccag 3406321 ggagtacacg tcagaggcgg gcgtcggcga tgctccttcg agaacttccg gcgcggtgaa 3406381 agccggggaa ccggcaatca ccccggtcgc cgtctcgaaa cccccggcga ttctggcgat 3406441 tccgaaatcg gtcagctgcg gttccccgta gtcggtcagc aggatattcc ccggcttcac 3406501 gtcacggtgc agggtgccga cgcgatgcgc ggcttccagc gctcccgcga gcttgacgcc 3406561 gatcgacagc gtctcgcgcc agtccagcgg cccgtgccgg cgaatcagcg tctccaacga 3406621 attcttggcg tggtagggca tcacgatgaa gggccgccca cccgccaaca cgcccacctg 3406681 caagacggtc acgatgtgcg ggtgcccgga aaggcggccc atggcccgct gctcgcgcag 3406741 gaagcgctcg agattgtccc gatccaggtc ggtgctcaat accttgacgg cgacggcgcg 3406801 gtccagcgag ggctggacgc agcggtagac gacgccgaat ccgccgcgcc cgatctcctc 3406861 gacattgtcg aatccagcct caagcagttc cgcgggaata ttcgggacca ggtcccgccg 3406921 cgtcgcgtgc ggatcaacgt cggtcatcga cggtcactat cctcggccgg gagggtatca 3406981 ccaccagttt catcgccggt gaccccacac tatcgccaag ccgcggcgtc gcggctcgat 3407041 acccaccgca cgcaaaagct ccgttcccag accaacggag ggaaggaccg gcaccagttg 3407101 acatacgagc agttcgctcg tatgttgacg ctgatggggc cgagcgatct gtggacggtg 3407161 gaacgcgcgg cgcgccattg gggcgtgagc gcgtcgcgcg ctcgcgctat cctgtcgagc 3407221 cgccacattc accgggtcag cggctacccc gcgcaggcga tcaaggcggt caccctgcgc 3407281 cagggtgcgc gcaccgacct caaaaccgcc aaccatctcg tgccggccgc acaagcgttc 3407341 accatggccg agacgggtgc cgcgatcgga gagaccgaag atgagcgggc acgactgcgc 3407401 attttcttcg agttcctccg cggcgccgat gagaccggga catccgcgct cgatctcatc 3407461 gttgacgagc ccgcgctgat cggtgagcac cggttcgatg ctttgttggc cgcggctgcg 3407521 gaatacattt cggcgcgctg gggccggcct ggacccttgt ggtcggtgag tatcgaacgg 3407581 tttctggaca cggcctggtg ggtcagcgac ctcccgtcgg cacgagcgtt tgccgccgtg 3407641 tggacgccgg cgccgttccg gcgccgcggc atttacctag atcgccacga cctcacgagc 3407701 gatggagtgt gtgtcatgcc cgaaccggtg ttcaaccgaa ccgagctcca gcgggcgttc 3407761 actgccctgg cggccaagct ggaacgcaga ggcgttgtcg gtcaggtgca cgttgtcggc 3407821 ggggcggcga tgctactcgc ctacaactcc cgtgtcacca ctcgcgatat cgacgcgttg 3407881 ttctcaactg acgggcctat gctcgaagcg attcgtgagg tcgctgacga aatgggttgg 3407941 ccgcgaacgt ggctcaacaa tcaggccagc ggttacgtct cccgcacacc aggtgaaggc 3408001 gcccccgttt tcgatcaccc attcctgcat gtcgtagcca cacccgcgca gcaccttctc 3408061 gcgatgaaag tcgttgcggc acgcggcgtg cgtgacggcg aagacattcg cctcctgctc 3408121 gatcggctgc gaatcaccag cgcggccggc gtatgggaga ttgtcgcacg ctactttccc 3408181 gccgaaacca tcaccgaccg gtcgaggctc ctcgtcgagg acctcctcaa ccaatagcag 3408241 accactagca gtgaagccgc ggccgccgcg cgcagcaccc cagtgtcatg gattatccat 3408301 gattcgggcg tccccaatgc gaaccgcttc tgtcagtcgg ggctggggtt tcaccacccg 3408361 tttcaccgac cgctgacccc accataggct cgatactgcc ggggtgtcat cccaaaccag 3408421 caccggcacg accggttgag cgcgctctgc tcggaatagc cgagcagcac cgcgatttgg 3408481 ctcagataca accccggttg ggcgaggtac cttgccgctt gcgcacggcg ttcgcgctcg 3408541 atgaggtcat ggcaccggag gccctcggca gccaagcgcc gctgcagcgt tcgtgggtgc 3408601 atgtcgagtt ggtcggcgat ggcctcggcg ctgcattggc cggtcggcag caggcggcgg 3408661 gccaacccga cgacccgctc ggagagcgtg gcatcgctcg gaaggtattg ggattccaaa 3408721 tatttcgtgg cgatgcgctt ggtttccgga tccgcatggt cgatgggcct accggcgagc 3408781 cggtggtcca cctcgaaccc gcaccatgtc cggccgaacc gaacggtaca acccaacgct 3408841 tcgcggtagg cggcgtcggt gcccagttgc gcatgtcgga acgagaaaac gcgcgcccgc 3408901 gcctgcggtc cgcccagcag gcggatcatc cgggcggcgt tggccatgct cagctcgtat 3408961 ccctgcagcg gatagggaat ccccggttcg gtcacctcat agccgaaccg gacgttggac 3409021 cgtgcggtag ttgatgaaac cgtcagcgtc agggcgggcg aatggacgta gaggtagcga 3409081 ccgatcgcct ccagcccgcc gaacaaggtg gcagcgttgc gcgcgatcac cgctaccggg 3409141 ccgagaatgc ccaggccctg ccagcgtgca aggcgtagtc cgaagtccgg gcaatcgagc 3409201 tcggcggcgc tggcctccag catgcgcacg aacccggcca gcgacatgaa cgcgtcctct 3409261 tggtgttcga tgcccggcgg gatgtcgaag cgccgcagaa acggcagcgg gtccgcgccg 3409321 agctcgcgca tcaggtcggt gtacccccac aggttggtgg cgcggatgag gctgcccagc 3409381 tccatcacct cctgtcggaa aatgataaaa ggctgtcgca aagtgtcaat acgtggcggg 3409441 ggtcctccac catgctggag ccatgaacca gcatttcgac gtcctgatca tcggcgccgg 3409501 cctatccggc atcgggacgg cctgtcacgt gacggccgag ttccccgaca agacaatcgc 3409561 cctcctggaa cgacgggagc gcctgggcgg cacctgggac ttgttccgct acccgggagt 3409621 tcgttcggac tccgacatgt tcaccttcgg ctacaagttc cgcccgtggc gcgacgtgaa 3409681 ggtgctcgcc gacggcgcgt cgatccggca gtacatcgcc gacaccgcca cggagttcgg 3409741 catcgacgag aagattcact acggcctgaa ggtcaacacc gccgagtggt cgagccggca 3409801 gtgccgttgg accgtcgcgg gcgtgcacga ggcgaccggc gaaacccgga cctacacctg 3409861 cgattacctc atcagctgca ccggctacta caactacgac gcgggttatc tgccggactt 3409921 ccccggcgtg caccggttcg gcggccggtg cgtgcacccg cagcactggc ccgaagacct 3409981 cgattattcc ggcaagaagg tcgtcgtcat cggcagcggc gcaacggcgg tcactttggt 3410041 tccggcgatg gccggctcca accccggcag tgccgcgcac gtgacgatgc tgcagcgatc 3410101 cccgtcgtac atcttctcgc tgccggcggt cgacaagatc tccgaagtcc tgggccgctt 3410161 cctgccggat cgctgggtct acgagtttgg ccgcaggcgc aacatcgcca tccagcgaaa 3410221 gctctaccag gcctgccggc gctggcccaa gctgatgcgg cgattgctgc tgtgggaggt 3410281 acgacgccgc ctcggccgct ccgtggacat gagcaacttc accccgaact acctgccgtg 3410341 ggacgagcgg ttgtgcgccg tgcccaacgg cgatctgttt aagacgctgg cctcgggcgc 3410401 ggcgtcggtg gtgaccgatc agatcgagac cttcaccgag aagggcatcc tgtgcaagtc 3410461 cggccgggag atcgaggccg acatcatcgt caccgcgacc ggtctgaaca tccagatgct 3410521 gggcgggatg cgactcatcg tggacggcgc cgaataccag ctgccggaga agatgaccta 3410581 taagggtgtg ctgctggaaa acgcccccaa tctggcctgg atcatcggct acaccaacgc 3410641 gtcatggacc ctgaagtccg acatcgccgg cgcctacctg tgccggctgc tgcggcacat 3410701 ggccgacaac ggctacacgg tggcaacgcc gcgcgatgcg caggactgcg cgctggacgt 3410761 tggcatgttc gaccagctga actccggcta tgtgaagcgc ggccaggaca tcatgccgcg 3410821 ccagggctcc aagcatccgt ggagggtgct catgcactac gagaaggacg ccaagatcct 3410881 gctcgaagac cccatcgatg acggcgtgct gcacttcgcc gcagcggccc aagaccacgc 3410941 ggcggcctga gcatcatgaa cctgcgcaaa aacgtcatcc ggtccgtatt acgtggtgcc 3411001 cggccactgt tcgcttcccg ccggctgggt attgccggcc gtcgagtcct gctggcgacg 3411061 ctgacggccg gcgcgcgcgc ccccaagggc acccgctttc agcgcgtcag catcgccggt 3411121 gtcccggtcc agcgggtgca acccccccat gcggcaacca gcgggacgct gatctacctg 3411181 cacggcggtg cctacgccct gggcagcgcc cggggctacc gcggcctggc cgcccagctc 3411241 gcggcggcgg ccggaatgac ggcgctggtc cccgactaca cccgcgcacc gcacgcccac 3411301 tatccagtgg ccctcgaaga gatggctgcg gtgtacaccc gcttgctcga cgacgggctc 3411361 gacccgaaaa cgaccgtcat cgccggtgat tcggctggcg gagggttgac cctggcgctg 3411421 gccatggcgc tgcgcgatcg cggcatccag gccccggccg cactcggcct gatctgcccg 3411481 tgggccgatc tcgccgtcga catcgaagcg acgcgaccgg cgctgcgcga tccgctcatt 3411541 cttccgtcga tgtgcaccga atgggcgccg cgctacgtag ggtcctccga tccgcggctg 3411601 cccggtatct ccccggtcta cggcgacatg agcggcctgc cgcccatcgt catgcagacc 3411661 gcgggcgacg atccgatctg cgtcgacgcg gacaagatcg aaaccgcctg cgccgcttcg 3411721 aaaacaagca tcgagcatcg ccggttcgcg ggcatgtggc acgacttcca tctgcaggtc 3411781 agtctgctcc ccgaagcccg cgacgcgatc gccgacctcg gggcaaggct gcgcggccac 3411841 ctccaccaat cgcagggaca accacgggga gtagtcaaat gagctcattc gaaggcaagg 3411901 tcgccgtcat caccggggcc ggctcgggca tcggcagagc gttggcactc aacctctccg 3411961 agaagcgcgc aaagcttgcc ctttccgatg tcgacaccga cgggctggcc aaaaccgtgc 3412021 gcctggctca agcgctcggc gcgcaggtga agtcggaccg gctcgacgtc gccgaacgcg 3412081 aggcggtgct ggcccacgcc gacgccgtcg tcgcacattt cggcaccgtg caccaggtct 3412141 acaacaacgc cggcatcgcg tacaacggca acgtcgacaa gtcggagttc aaggacatcg 3412201 agcgcatcat cgacgtcgac ttctggggcg tcgtcaacgg caccaaagcc tttctgccgc 3412261 acgtgattgc ctccggcgac ggacacatcg tcaacatctc cagcctgttc gggctgatcg 3412321 cggtgcccgg gcaaagcgcc tacaacgcgg ccaagttcgc ggtgcgcggc ttcaccgagg 3412381 cgctgcgcca ggagatgctg gtcgccaggc atccggtcaa ggtgacgtgc gtgcatcccg 3412441 gcggcatcaa aaccgccgtc gcgcgcaacg ccaccgtggc cgacggcgag gaccagcaga 3412501 cgttcgcgga gttcttcgac cgccggctgg cgctgcattc gccggagatg gccgccaaaa 3412561 ccatcgtcaa cggagtcgcc aagggccagg cccgcgtcgt ggtcggcctg gaggccaaag 3412621 ccgtcgatgt gctcgcgcgc atcatgggct cgtcgtatca gcggctggtt gccgccggcg 3412681 tcgccaagtt cttcccctgg gccaagtagg cccatagagt tctagaaagg gacaccacga 3412741 tgaaaaccac cgcggcggta ctgttcgagg cgggcaaacc gttcgagctg atggagctcg 3412801 atctcgacgg gccgggtccg ggcgaggtgt tggtcaaata caccgccgcc gggctgtgcc 3412861 attccgacct gcacctcacc gatggtgatt taccaccgcg gttcccgatc gtgggcggcc 3412921 acgaagggtc cggggtcatc gaggaggtgg gtgccggcgt caccagggtc aagcccggag 3412981 accacgtggt gtgcagcttc atcccgaact gcgggacttg ccgctactgc tgcaccggcc 3413041 ggcagaacct gtgcgacatg ggggccacca tcctggaggg ctgcatgccg gacggcagtt 3413101 tccgattcca ttcccaggga acagatttcg gcgccatgtg catgctgggc acgttcgccg 3413161 agcgggccac cgtctcgcag cattcggtgg tgaaggtgga cgactggctg ccactggaaa 3413221 ccgcggtgct ggtgggctgc ggcgtgccgt ccggttgggg caccgcggtc aatgccggaa 3413281 acctgcgggc cggcgacacc gccgtcatct acggcgtcgg cggcctgggc atcaacgcgg 3413341 tccagggcgc gaccgccgcc ggctgtaagt acgtcgtggt ggtggacccg gtggctttca 3413401 agcgcgagac cgcgctcaag ttcggcgcca cccatgcctt cgccgacgcc gccagcgcgg 3413461 cggccaaggt cgacgaactc acctgggggc agggcgccga cgcggcgctg atcctggtgg 3413521 gcaccgtcga cgacgaggtg gtctcggccg cgaccgcggt gatcggcaag ggcggcaccg 3413581 tcgtcatcac cgggctggcg gacccggcca aactcaccgt gcacgtctcc ggaaccgatt 3413641 tgacgctgca cgagaaaacg atcaagggct cgctgttcgg ttcctgcaat ccgcaatacg 3413701 acatcgtgcg gctgctgcgc ctctacgacg ccggccagct gatgctggac gaactcgtga 3413761 ccaccaccta caacctcgaa caggtgaacc agggctacca ggatctgcgg gacggcaaga 3413821 acattcgggg cgtgatcgtg cactgaccag cttccaccaa ccacgaatcc agagaggacg 3413881 atgatgcgca ggctcaacgg cgttgacgcg ctgatgctgt atctcgacgg cggcagcgcc 3413941 tacaaccaca ccctcaagat cagcgtgctc gacccgtcga ccgacccgga cggctggtcg 3414001 tggccgaagg cgcggcagat gttcgaggag cgcgcccacc tgcttccggt cttccggctg 3414061 cggtacctgc ccacaccgct gggcctgcat cacccgatct gggtcgagga tcccgaattc 3414121 gacctcgacg cgcacgtgcg ccgggtcgtc tgtcccgccc cgggcgggat ggcggaattc 3414181 tgcgcgctcg tcgagcagat ctacgcccac ccgctggatc gcgaccgccc gctgtggcag 3414241 acctgggtgg tcgagggcct cgacggcggc cgcgtcgccc tggtcacgct gctgcaccac 3414301 gcctactccg acggcgtcgg cgtgctggac atgctcgccg cgttctacaa cgacgcgcct 3414361 gacgaggccc ccgtggttgc gcccccgtgg gagccgccgc cgctgccgtc cacccggcaa 3414421 cgcctcggtt gggccctgcg ggacctgccc tccaggctcg gcaagatcgc gccgaccgtg 3414481 cgggccgttc gtgatcgggt gcgcatcgaa cgggagttcg ccaaagacgg cgaccggcgc 3414541 gtcccgccca cgttcgaccg ctccgcaccg ccgggcccgt ttcagcgcgg gctgtcgcgc 3414601 agccggcggt tctcctgcga atcgttcccg ctcgccgagg ttcgcgaggt gagcaagacg 3414661 ctgggcgtca ccatcaacga cgtctttttg gcgtgtgtgg ccggtgccgt tcgtcgctat 3414721 ctggagcgtt gcggctcccc tcccaccgac gcgatggtgg ccacgatgcc gctcgcggtc 3414781 accccggcgg ccgagcgcgc ccaccccggc aactactcgt cggtcgacta cgtctggcta 3414841 cgcgccgaca tcgccgaccc gctcgagcgg ctacacgcga cccacctcgc cgccgaggcc 3414901 accaagcagc acttcgccca gaccaaggac gccgacgtcg gcgcggtggt cgagctgctg 3414961 ccggaacgcc tcatctcggg cctggcgcgt gccaacgcgc gcaccaaggg ccgcttcgac 3415021 accttcaaga acgtggtcgt gtccaacgtg ccggggccgc gtgagccgcg gtatctcggc 3415081 cgctggcgcg tcgaccagtg gttttccacc gggcagatct cccacggcgc cacgctcaac 3415141 atgaccgtct ggagctattg cgaccagttc aacctgtgcg taatggccga cgcagtcgcg 3415201 gttcggaaca cctgggaatt ggtcggcggc ttccgcgcct cgcacgagga gctgctcgcg 3415261 gcggcccgtg cccaagccac gcccaaggag atggccacat gacccgcatc aatccgatcg 3415321 atctgtcctt cctgctgctg gagcgggcca accggcccaa ccacatggcc gcctacacga 3415381 tcttcgaaaa gccgaaagga cagaaatcgt cgttcgggcc gcgcctgttc gatgcctacc 3415441 ggcacagcca ggcggccaag cccttcaatc acaagctgaa atggctgggc acagatgttg 3415501 cggcgtggga aaccgtcgag cccgacatgg gctatcacat tcgacacctc gccctgcccg 3415561 caccgggttc catgcagcag ttccacgaaa cggtctcgtt cctcaacacc ggcctgctcg 3415621 ataggggcca cccgatgtgg gagtgctaca tcatcgacgg catcgagcgc ggccggatcg 3415681 cgatcctgct caaggtgcac cacgcgctca tcgacggtga aggcggcctg cgcgcgatgc 3415741 gcaacttcct ctccgattca ccggacgaca cgacgctggc cggtccctgg atgtcggcgc 3415801 agggcgccga ccggccacgg cgcacccccg ccacggtgtc gcgcagggcg caactgcaag 3415861 gacaactgca aggaatgatc aaggggctga ccaagctgcc gagcggcctg ttcggcgtca 3415921 gcgcggacgc ggcggacctt ggtgcgcagg cactgagcct caaggcgcgc aaggcgtccc 3415981 tgcccttcac ggcgcgacgc actctgttca acaacacggc gaaatcggcg gcgcgcgcgt 3416041 acgggaacgt cgagttgccg ctcgccgacg tcaaggccct ggccaaggcg accggcacct 3416101 cggtcaacga cgtggtgatg acggtcatcg acgacgcgct gcaccactac ctcgccgaac 3416161 accaggcgtc caccgaccgg ccgctggtgg cgttcatgcc gatgtcgctg cgtgagaagt 3416221 cgggcgaggg cggtggcaac cgggtgagcg ccgaactggt cccgatgggt gcacccaagg 3416281 cgagtcccgt tgagcgcctt aaggaaatca acgcggcgac cacacgcgcg aaggacaaag 3416341 ggcgcggcat gcaaacgacg tcccgccagg cctacgcgct gctactgctc ggcagcctga 3416401 cggtggcgga cgccctgccc ctgctcggca agttgccgag cgcgaatgtg gtgatatcaa 3416461 acatgaaggg gcccaccgag cagctctacc ttgccggtgc gccgctggtg gcgttcagtg 3416521 gcctgcccat cgtgccgccg ggcgccgggc ttaacgtcac cttcgccagc atcaacaccg 3416581 cgctgtgcat cgccatcggc gcggcaccgg aagccgtgca cgaaccctcc cggctggccg 3416641 aactcatgca acgggcattc accgagctcc aaaccgaagc cggcacaacg agtcccacaa 3416701 catcgaagtc gagaacccca tgaagaacat tggctggatg ctcagacaac gcgcgaccgt 3416761 ctcgccgcgg ctgcaagcct acgtcgagcc gtccaccgac gtccggatga cctacgcgca 3416821 gatgaacgcg ctggcgaacc ggtgcgccga cgtgctcacc gcgctgggga tcgccaaggg 3416881 cgaccgcgtg gcattgctga tgcccaacag cgtcgagttc tgttgcctgt tctatggcgc 3416941 ggccaagctc ggcgcggtag cggtccctat caacacccgc ctcgccgcac ccgaggtgag 3417001 tttcatcctg tccgacagcg gcagcaaggt ggtgatctac ggtgcgccgt cggcgccggt 3417061 gatcgacgcc atcagggcgc aggccgaccc tccgggcacg gtcaccgact ggataggcgc 3417121 cgactcgttg gccgaacgcc tgaggtcggc ggccgcagac gagccggcgg tcgaatgcgg 3417181 cggcgatgac aacttgttca tcatgtacac ctcgggcacc accggacatc ccaagggagt 3417241 ggtgcatacc cacgaatcgg tgcattcggc ggccagttcc tgggcctcga cgatcgacgt 3417301 gcgctaccgc gaccgcctgc tgctaccgct gccgatgttc cacgtggcgg cgttgacgac 3417361 ggtcatcttc agcgccatgc gcggcgtcac gctgatctcg atgccgcagt tcgatgcgac 3417421 gaaggtgtgg tcactgatcg tcgaggagcg ggtctgtatc ggtggcgccg tgccggcgat 3417481 cctcaacttc atgcgccagg tgcccgagtt cgccgaactc gacgcgcccg acttccgcta 3417541 cttcatcacc ggtggcgcgc ccatgccgga ggccctgatc aagatctatg ccgccaagaa 3417601 catcgaggtc gtgcagggtt acgcgctcac cgaatcctgt ggcggcggca ccctgctgct 3417661 cagcgaagac gcgctgcgca aagccggctc ggccggacgc gccaccatgt tcaccgacgt 3417721 ggccgtgcgc ggtgacgacg gcgtgatccg cgagcacggc gaaggcgaag tcgtgatcaa 3417781 gtccgacatc ctgctcaagg aatactggaa tcgcccggag gccacccgcg acgctttcga 3417841 caacggttgg ttccggaccg gcgacatcgg cgaaatcgat gatgagggct atctttacat 3417901 caaggaccgg ctgaaggaca tgatcatttc cggcggcgag aacgtctacc cggccgagat 3417961 cgaaagtgtg atcatcggcg ttcccggggt cagcgaggtg gcggtcatcg gcttgcccga 3418021 cgagaagtgg ggcgagatcg ccgccgccat cgtcgttgcc gaccagaacg aggtcagcga 3418081 gcagcagatc gtcgagtact gcggaaccag gctcgcacgc tacaagctgc ccaagaaggt 3418141 gatcttcgcc gaggccatcc cccgcaaccc gaccggcaag atcctcaaaa cggtgctgcg 3418201 cgaacagtat tcggcgacgg tgccgaagtg atgcacggcc cgagccgcta ggacggcgcg 3418261 agccgcacga tgccgggaac gaggtagcgc gcaacgtacg cacgcagccc ctcgtcgtca 3418321 tcgagcggga tcgggccctc cggcgcagcg accggtaatc cgttgccgtt gagtgtgttt 3418381 gagttgcccg ttcatgcggc ggcgctcgtc gatctcctct tgcaccaggg cctcgaccgc 3418441 cagccgcgag ccggtgatcc ggtcgtagct tccggtccag cggccgccgc ggcttgtcgc 3418501 cccatgcggt ttggatcacc cgccgcgtgc tgcggctggt gtccagccag gcgattgccc 3418561 gcgctcgcac ccgctcggcg cgcgggttgt cggcgacccc gaagatgacc tcggcgacga 3418621 tgtcgagcgc gatcgggccg gcgcggtctc ggaaacggac ctcctcgccc attggccagg 3418681 tggcgagcgc ctttcggtca cgcgttccat ggcccgctcg taggacctca gcgcctttcc 3418741 gcggaagggc gggctggcgt agcgccggtc ggcgcggtgc cttccatgct gaccagcgtg 3418801 tgctcgccga agatcgcgtg gtgagccggt ccatcgccgg ggtcagctga aggaccgagt 3418861 tgctcgccgt gaagaccctc ttcacgtctt cgggattggg tcacgcacag cgcgtcgacg 3418921 gctccaggca cgttgaacag gaagcgatca accgagtttt gtggtgcgcg cgtaaaaccg 3418981 ctgggggcca gccagtattc ggctccaaat gcgatcgagg ataggcgcac ccggggcctc 3419041 catcgggact cttcgaacta ccaccgctca ccttgcagtg cgactaccaa gcccgccgac 3419101 gtgtctgcgg cgcagtattc ttcacgcacc tggcccgcgt actccccgac ccagcaaagg 3419161 agtccaggaa tgacatggca gatcgtgttc gtcgtgatat gcgtgatcgt cgccggcgtc 3419221 gcggcattgt tctggcgact cccctccgat gacacgacgc gcagccgggc caaaacagtg 3419281 acaatagccg ccgtggcagc ggcggccgtg ttcttcttct tgggctgttt caccatcgtt 3419341 ggcacccgcc agttcgcgat tatgaccacc ttcggccgtc ccaccggcgt aagcctgaac 3419401 aacggcttcc acggcaagtg gccctggcag atgacccatc ccatggatgg tgcggtgcag 3419461 atcgacaagt acgtcaagga aggcaacacc gatcagcgca tcacggtgcg gctgggcaat 3419521 caatccaccg cgctggcaga cgtcagcatc cgctggcaac tcaagcaggc cgctgccccg 3419581 gaactgttcc agcagtacaa gaccttcgac aacgtgcgcg tcaacctgat cgagcgcaac 3419641 ctctcggtgg cgctcaacga ggtgttcgcc ggcttcaacc cgctggaccc gcgaaacctc 3419701 gacgtgtccc cgctgccttc gctggccaag cgcgccgccg acatcctgcg ccaggacgtg 3419761 ggcgggcagg tcgacatttt cgatgtcaat gtgcccacca tccagtacga ccagagcacc 3419821 gaggacaaga tcaaccagct caaccagcag cgcgcgcaga cctcgatcgc cctggaagca 3419881 cagcgaactg ccgaggccca ggccaaggcc aacgagatcc tgtcccgctc gatcagcgac 3419941 gaccccaacg tggtggtgca gaactgcatt acggccgcga tcaacaaggg aatcagcccg 3420001 ctgggttgct ggccgggaag ctcagcgcta cccaccatcg cagtgccggg acggtaaccg 3420061 cgaagattga ccccatgccg atcccctttg ccgatgggat gctcagccgg ctgggtcgcc 3420121 gcggggcagc gctcgacctg atcgaggagt tcgaggacga gtccggggag ccccccgcat 3420181 ccctgagccc cgccgacctg ctggccgccg aaccggccct gctgctgcag aagatggaga 3420241 accgcctcgt ccggcaccac ctagccaatc cggacgtgtt gagcggcgaa cagctgcgca 3420301 agctgcgcta catcctcaat ttcgccaggc tggccgactt cgaaccgggg gccgcggggc 3420361 cgggcggaag ccgcggtcgc ggggacatct cggtgggcgg ccaagtcgcg ccttggcggt 3420421 cccgggtcgt cgacgcgttg tacgcaccgc tgcgcgagga gcccgatccg gtcacggcgc 3420481 tggagggcgc gaaagacgtg ctggcgacgc tggtcgacga ccaggacgat cagcgtcgag 3420541 tgctcatcga gcgccacggc agcgacttct ccgcgacgga actcgacgcc gaggtcggct 3420601 acaagaagct ggtgaccgtc ctcggcggcg gcgggggcgc gggcttcgtc tacatcggcg 3420661 gcatgcaacg gctgctggcg gccggccagg tgcccgacta catgatcggc tcgtcgttcg 3420721 ggtcgatcat cggcagcctg gtggcccgtg aactgccggt gccgatcgac gagtacgccg 3420781 agtgggccaa aacggtgtcc taccgcgcca tcctgggccc ggagcggcgg cgcagccgcc 3420841 acgggttggc cggaatgttc accctgcgct tcgaccagtt cgcccatacc ctgctcagcc 3420901 gtgcggacgg cgaacggatg cgcatgtcgg atctggcaat cccgttcgat gtcgtcgtcg 3420961 ccggtgtgcg caggcagcct tatgcggcgc tgccgtccag gttccgccat cgcgagcggt 3421021 ctacactgac gttgcggtcg ctgccgtttc tgccgatcgg tatcggcccg tgggtggcgg 3421081 cacgcatgtg gcaagtcgcg gccttcatcg acttgcgggt ggtcaagccg atcgtcatca 3421141 gcgccgacgg cgcgacacgc gacgtcaacg tcgttgacgc ggcgtctttc tcgtcggcca 3421201 tccccggtgt gctgcaccac gaaaccagcg acccgcggat gctgccaatc ctcgacgagt 3421261 tgtgcgccga ccaggacgtc gcggcgatgg tcgacggcgg cgcggccagc aacgtcccgg 3421321 tcgaattggc gtgggagcgg gtccgcgacg ggcggctcgg cacccgcaac gcgtgttatc 3421381 tggcgttcga ctgcttccat ccgcactggg acccccgaca tctgtggctg gtaccgatca 3421441 cccaggcggt ccagctgcag atggtgcgca acctgcccta cgccgaccac ctcgtccgat 3421501 tcgagccgac gctgtcgccg gtgaacctgg cgccgtccgc ggcggccatc gaccgggctt 3421561 gccggtgggg gcgcgacagc gtcgaaccgg cgattgcggt gacatcggcg ctgctggagc 3421621 cgacgtggtg ggaaggcgac aggccccccg ccgccgaacc caaggaacgc acaaagtcgg 3421681 cggcctcgtc gatgagcgcc gtgatggccg cgattcaggc gccgacgggc cggtttcggc 3421741 gatggcgaag ccgccacctg acctagcgac ggctacaggg aacgcgacct cggcggtcga 3421801 aagcaaacca ggtgcacaag tgcaacaaca acgattccga tcaccaaccc agtcgccgcg 3421861 cacgccgcgg tgctgaccaa ccaggtcagc gcgccaccag cagaccccac caggtggtca 3421921 tctaggtggt gaaccaggcg gtacggggcg tgccagccga ggtggtcgct gcccacaagc 3421981 accatgtggc cgcccaccca gagcatggct cccatcccga ccgctgacag cgccgatagc 3422041 agtttgggca tccccgcgac caggcccccg ccgatccgct gcccgaatcg ggacgcggtc 3422101 tgggtgaggc gcaggccgac gtcgtccatt tggacgatga cggcgacgac accgtacacc 3422161 gcggcggtga tgacgagggc gacgatgacg aggacgatga ggcgcggcac gaatggctgg 3422221 tcggccacct cgttgagggc gatcaccatg atctcggcgg ataggatgaa gtcggtccgg 3422281 atcgccccgg ccaccagctc gcgttcggcg acctgcggcg cggcgtcgtg gccacggccg 3422341 ccgatgacgc cgcacacctt ttcggcgccc tcgtagcaca gatacgtggc gcccaacatc 3422401 agcagcgggg tcaacagcca cggcacgagc tggctgagca gcaatgcacc gggaaggatg 3422461 agcagcagct tgttgcgcac cgacccgatc gcgatgcgtt tgatgatcgg cagctcacgc 3422521 tcagcggtga tccggtggac gtattgcggc gtcaccgccg tgtcgtcaat gaccactccc 3422581 gcagcctttg ccgtcgcacg accggcggcg gcgccgatgt cgtcaatcga ggcggcggcc 3422641 agccgtgcaa gaaccgcgac atggtccagc agtccgaaca gaccgccgct catcgcgact 3422701 ccgccatcac gatcgaggtt accgtctgcc gtcgttgtcg ccagcggtgc cgtagagccc 3422761 gccgggtcgc agcgctcgca gagccacccg gccccccggg tcttcagcgg tggcgggcac 3422821 gaccgcgacg caatcggcac cggcatccgc gtaggcgcgg agccgggccg ccactcgatc 3422881 ggggctaccc aacgcacaca cccggtcgag cagttcgctg gggacagcga ccgccagttc 3422941 gcggcgagta gcccgggacc gcgcgctacg gaccaggccg tcgaaaccca gcgcgctgaa 3423001 catttcgcca tagccgggcg gggcgaggta caccgccagc tgagctgcca gctgggagtg 3423061 cgcggccgca ccggggttga cggcgaccgg cacccacacc gtgaggcgcg gcgcggcacg 3423121 gccggccgcg gcggctgcgc tgtcgatcgc cgcacgaacc cgcccgacac ggaacggcga 3423181 tgccaggttg agcacgacct catcggcgtg ctgcgcggcc aggcgaatca tgccaggtcc 3423241 aaacgccccc aacgcaattc gcgtatcggg cgccgcaccg cgcagccgga atccgcggct 3423301 gttgacgtga cggccgctgt attcgacccg cgcaccggta aatatcgacc gcaggcattc 3423361 gatggtttcg cgcatgaccg gcacgtggtg cgcccaaggt cggccatgcc agccggccac 3423421 gatcgccgga ctggaagctc ccagcgcgag gtcaacccga cagccggtga gagaagcgac 3423481 cgaactgacc cctagcgcca gccccaccgg accgcgaacg ccgacggcta gcggtccgac 3423541 cttcagcgtc atgtttggcg tgcggagccc gatcgaggtc gcgagcgcga acgcatcgta 3423601 ggtcgccatt tcgccgatcc acagcgcagc gaaacccgtg tcagcggccg cgagcgcgac 3423661 atcggttgcc tcgtggtcgg ggcggtcaag ccagaacggt agggcgactt cgatatcggt 3423721 catagcatcg acacgtcggc cggctggtcg agcaggacac gcccgggcag ttcgcgtgat 3423781 gcctcgttga cctggaaatg ggcggtagcg gtgaatgcgt cgcggaaccg gcgctgcagc 3423841 ggtgcgttgt cgtagatggc ggtgccgccc gccagatcat acatgctgcg caccacgtcg 3423901 gccgaggtcc gtaccgcgtg cgtggccgcc aaccgcagcc ggttgcgcat cgtcaccggt 3423961 accgcctcgg catcgtggct gacctgccag gccgcctcga ttacctcgta gaacagggcg 3424021 cgggcggcgc ccagcgccga ctcggcggtt gccgccgcgg cttgggtcgc cgaacgttcc 3424081 gccaaggtcc gagtggaccc aagccctttc ttgccgccgg ccagctcgac cagatcgtca 3424141 atcgcggcgc gcgcattgcc caacgcagcc gcgccaatcg acaacgcgaa aaatccaaac 3424201 accggaaagc gatacagcgg ccggtccacg attggtccgt caaacaccga gaacacgcga 3424261 tcagcgggca cgaagacgtc gtcggcaacg cagtcgtggc tgccggtgcc acgcaaaccc 3424321 aatgtgtgcc aagtgtcgag gacctgcagc tcgtccttgt tcagcgcgac gaccgacggc 3424381 acttgccggt cgtcgacgaa gcagccggcg aacatgatgt ccgcgtggtt gatcccgctg 3424441 caaaacggcc agcgtccgga caccacgaca ccgccgtcga cggaccgggc cgtgccacgt 3424501 ggcgcccaca cccccgccgc gacaccccgc cccccgccga acatttcctc gcggctgcgc 3424561 gccggcaggt aggcgaccag cagggcactg gtaatcgcga tcgacacaca ccatcccgct 3424621 gacgcgtcac cacgcgccac cgcctcggcg caccgcagcg cccgcccggg tgccagctcc 3424681 ggcgccgcaa cctcacgcgg catggtggcg cgcagcaagc cggcctcgcg cagccgggtc 3424741 accagctcgt ctggcagccg acgatcgcgc tcgatttccg cggatcgcgc tcgggcccac 3424801 cgcgcgatct tctcggcgag gatctcgatc tcggtttcgc tttggttcac gggcggctcc 3424861 tgatgacggt ggcggttcaa tgaagttacc acccttggtt cagtcattga accaggtaca 3424921 gttggtggac catggccgtt tccgatctat cccaccgctt cgaaggggag tcggtcggcc 3424981 gggcgctcga gctagtcggt gaacgctgga cgctgcttat cctgcgtgag gcgttcttcg 3425041 gggtgcggcg gttcggtcag ctcgcgcgga accttggcat tccgcggccc acgctgtcct 3425101 cgcggctgcg gatgctcgtc gaggtgggtc tttttgaccg ggtgccatat tcctccgacc 3425161 ccgagcgaca cgagtaccgg ctcaccgaag cgggccgcga tctgttcgcc gcgatcgtcg 3425221 tcctcatgca gtggggggat gagtacttgc cacgcccaga aggaccaccg atcaagctgc 3425281 gccaccacac ctgcggcgag cacgccgacc cacgcctgat ctgtacccac tgcggcgagg 3425341 agatcaccgc gcgcaatgtg acacctgaac cggggccggg ctttaaagcc aagctggcgt 3425401 cctcataacg attcccaacc tcaaattgtt gcgaatcgat aatgcaagcc gaaccacgtc 3425461 gccgaacaag gccgtacacc ttggccggga aactatcgtc attttgtgca ccgtcgaacg 3425521 gccctgaagc tcccgctgct gctggcggca ggcacggtgc tgggccaagc gccgcgggcc 3425581 gccgccggag aaccaggccg gtggtcggcc gaccgcgcac atcgctggta tcaagcgcac 3425641 ggctggctcg tcggtgcaaa ctacatcacc tcgaacgcca tcaaccagct cgagatgttc 3425701 cagccaggca catacgatcc ccggcgcatc gacaacgagc tgggccttgc gcggtttcac 3425761 gggttcaaca ccgtgcgagt cttcctccac gacctgctgt gggcccaaga cgcgcccggt 3425821 ttccaaaccc ggctcgcgca gttcgtcgcc atcgcggcgc gataccacat caaaccgctc 3425881 tttgtcctgt tcgactcctg ctgggacccg ctccccagac cgggtcggca gcgggcgcca 3425941 agggctgggg tgcacaactc cgggtgggtg caaagtccgg gtgctgaacg cctcgatgac 3426001 cgccgctatg ccagcacgct gtacaactac gtcacgggtg tgttgggcca attccgcaac 3426061 gacgatcgcg tgttgggttg ggacctgtgg aatgaacccg acaatcccgc gcgcgtgtat 3426121 cgcaaggtgg aaaggaaaga caagctcgag cgcgtcgcgg agctcctccc ccaagtgttc 3426181 cgatgggccc gcacggtcga tccggttcaa ccgctgacca gtggtgtctg gcaagggaat 3426241 tggggagatc ccggacgccg cagcaccatc agcgccattc aactcgacaa cgccgacgtg 3426301 atcaccttcc acagttacgc cgcgccggcc gaattcgagg gccgcatcgc tgagctcgct 3426361 ccgttgcagc ggccaatcct gtgcaccgag tacctggcgc ggtcccaagg cagcactgtc 3426421 gagggaatcc tgccgattgc taagcggcac aacgttggtg cgttcaattg gggtttggtg 3426481 gcgggaaaga ctcagaccta tttgccgtgg gattcgtggg atcaccccta ccgcgcgccc 3426541 ccgaaggtgt ggtttcacga cctgctacac cccaacggcc ggccgtatcg ggacggcgaa 3426601 gttcaaacga ttcggaagct gaacgggatg ccgagccagg actaggcttt ccccagcccg 3426661 cattgggcgc ggctcgccga atgcgagccc gacacctact gaaaaccatg tgcgcggtcg 3426721 gcctggcgga accggatcag gcggcgatac cgagttgctg gttaatctgc ggccaggaca 3426781 gcaaacccca gggggtgagc agtatccagt cgtggatttg ccagggggcc agtacgaagc 3426841 tgaacggcgc tccttggact acggctgtgt gctcgaggac aaccgcttgt tgtgcgagcg 3426901 gatcaagcga gcccgaatag acatacgtcg gcggaagacc gttcagcgac ccatacagcg 3426961 gactgaccag cgggtcgttg accgcaagat tgcctgccca cgcctggctg atctgccagg 3427021 tccccacatc gagccacggg gacagcaaca ccatggacga cggtactggg ttgccctggc 3427081 tcaccatgta ttgggcggcc gccagtgcga ggttgccgcc cgcggagtcc ccgaccacgc 3427141 tgacgttgga gaccccgtgt tgcgcgattt gcgtggagat gagcccggcc atcgccggta 3427201 ctaccgtccc ggcagtgcct ccttcctgca ccaacgggta aatcggcact tgcacggtcg 3427261 cgccggtctg gtaagccgtc accgagtagt tgagccagtg gaagattgac ggcggcagga 3427321 taaacgcgcc gccgtgaatg gcaaccacgt attcgccggt tggatgagcc ggcgtgatct 3427381 gcacgacgct catcccgtca taggtggtgt actggaccgt ctgtcccagc agcgagttca 3427441 gcaacggcgg tggggagttg ccaagaaacc acgacagcgg cggtatgtcg ctggcaatga 3427501 gcgctaaaag tggattgttt gggattgcaa agtgagtttc gagcgcagac aaactcagca 3427561 ggggtttcac cggccacagc gaagcgatgt cgaatccggc agcccctgac ggcgtgccgg 3427621 tgaagatccc tgcctgcgcc gccgcgaaag gcggaacctg cgtgaatccg gcggccagcg 3427681 ccgtgggggc ccgctgaatt tcctggtgaa tcgtggcaaa accgttcccg ataccgctgg 3427741 cgaattcact ctgcaacagc gaagcgttgg ccagctcggc ggcggcatac cccttggccg 3427801 cgcctgtcaa tgcctgcaca aaccgctcat gaaacaccgc aagctgtgcg ctaagagctt 3427861 gatagtcctg accatgggcg gaaaacaaag ccgcgatcgc ggctgacacc tcgtcctcgg 3427921 cagcggctaa taccgtcgtg gtggcacccg cgacaccctg gctcgccgtc gcgaccaccg 3427981 aaccaatcga agccacgtct gtggccgcgg cggacatcac ctccggcaac gcaacaacat 3428041 aagacaccac gccgctcccg ccacctcacg gcaacttccc cagttgccca gccactaccg 3428101 atcgccgagt agccggagct tatgcccacg ccgagtagtc acgtgccagt ttgcgcgaat 3428161 tcccaaagtt agaccggcaa acgtgacggc accgatccgt gtggtgcagc cgccgggaat 3428221 cgaacactct ccgacgcaaa acgacctgcg attacgcgcg gggcgttgat ggcctcaaga 3428281 aggaatgagg cggcgaacgc gggcgttggg gtgccgctat gcgttgaaca attgctatac 3428341 gattgtgcaa catcagctat cgtcgtactc atgaccgcga ccatcggctt ccgacctact 3428401 gaaaaagacg agcagatcat caaggccgca atgcgcagcg gcgagcgcaa gagcgacgtc 3428461 atccggcggg cactgcagct gctcgaacgg gaagtgtgga tcaagcaagc tcgcaccgac 3428521 gctgagcgac ttcgagacga ggatgtctcc actgaaccgg acgcgtggtg attcggggag 3428581 cggtctacag ggtcgacttc ggcgatgcga agcgaggcca cgagcaacgc gggcggcgct 3428641 acgccgtggt catcagcccc ggctcgatgc cgtggagtgt agtaaccgtg gtgccgacgt 3428701 cgacaagcgc ccaacctgcg gttttccgac cagagctgga agtcatggga acaaagacac 3428761 ggttcctggt ggatcagatc cggacgatcg gcatcgtcta tgtgcacggc gatccggtcg 3428821 actatctgga ccgtgaccaa atggccaagg tggaacacgc cgtggcacga taccttggtc 3428881 tgtgatggcc gtcgcatctg caaatgggcc accgacctgg cccttcggtg gagctgccgg 3428941 gaatcgaacc cgggtcctac ggcattccct caaggcttct ccgtgcgcag ttcgctatgc 3429001 ctctgctcgg atctcccggt cacgcgaact agccgagatg acgatcccag tcgctgtggt 3429061 tgtcccgagg agtcccgcga ccggactcat cggtggatcc ctctagctga tgccagggtc 3429121 cgggccgagg gcgttcccgg tctgacagac tagccgtcgc ttaggcagcg agagcgtagt 3429181 cgcgctgatg tgaatcggcg cttatttggt cgcaacgacg cttacggtgg tctcttgcct 3429241 gcaccggcac gcttcccttg attcgatgcg cgaagtcgaa accgttcagc ccctcgcatc 3429301 cctgccgacc ttcggcagga ccatcaatcc tacgccgctc tcaacaaccg gcaacgccat 3429361 taacttcccg gtcagatcac gaagttcagg cgctcgagga tgtgaccggc cagctccttg 3429421 tcgccgccga gttccacatc ctggctgcgc gccgggctca tcgggcgccc gccggcgagc 3429481 ctggtgaact gcagtccgtc caggcggatc gtcgccgtcg gcgccggccc accgaagtcg 3429541 tcgaccaccc gcgctcgacc gtccacggaa acgcggatgc tgcgagacag cgggccggtc 3429601 agctccaaca gcacgcggga gccgtcgggc gctttggcca gcttgccgac gacgaacccc 3429661 atggtggccg ctatctcatc gaggaccagc ggtgacgccg gcccgccgag ttcgtcgtcg 3429721 gacgacgggc gctgcaccgc cgcgcggatg tcctgttcgt gcatccagca gtcgaagatg 3429781 cgtatccgca tgaaccgccc gtagctgtcg gggcccgagg gggtggtcgt cggcgcattc 3429841 cattcgtcat cggaaaggct cgctaagacc ttgcggcgct ggctagtcac tgcgcgaaac 3429901 cgctccagca agcccacacc cgattctgtg cccagatgac gcacccagca ctcgttcatc 3429961 acgccgatgg ggttgcggac atgcgcaagc gcagagacgt ctgtgtctgg ttctggtgcg 3430021 gcgatgccga gcagaaatga ctcggtgccg atgatgtgcg acaccacggc cttgacgtcc 3430081 caaccgggca gcggactcgt tgcctgccag tccgtctcga gcagtccatc gagcagcgca 3430141 tccagggagt gccaaacggc gaacagcccg gccagcacgt cggacttgtc cagtgtggta 3430201 aggggacggc ccggtgtggt cacaaagtga tgctaaacct cacattgccc agttctcgat 3430261 caggtcatgc ccttagcgcg ccgacccaac tcgcggagca cttcacgctg ggcatcacga 3430321 cgggccatgt cctggcgttt gtcgcgggct tgcttgcctc gggccagcgc aagctcaacc 3430381 ttgaccttgc cttcggcgaa atacagcgac aacggcacca gggcgaagtt gccttcgcgg 3430441 atcttgccga ccaaggtgtc gatctggcgg cgatgcaaca gcagtttgcg gttgcgtcgc 3430501 ggctcgtggt tggtccagct gccgtgccgg tattccggga tgtgcgcgtt gcgcagccac 3430561 acttcgccgt cgtcgatggt ggcgaacgaa tcggccagcg acgcctgccc ttcccgcagg 3430621 ctcttcacct ccgtgccttg cagcgcaacc ccggcctcga acacctcgat gatcgaatag 3430681 ttgtgccggg ctttgcgatt gctggcaacg atctgccggc cgccacgcga cgacttggac 3430741 acagctatcg ccgcacgtag aggcgcagcg ttaagtaagc cgtcaacccc gacatcgcca 3430801 cgcccaacag cagcagccac ggcgtgatga agaggatgtc cgcatagtca accttggcaa 3430861 tgagattggc ttgataaaac tggttgagcg cattctccag gaacaaagcc cgcaccacca 3430921 tcaagcccgc tacggcgatg ccgacaccca tcgtcgcggc cagcatcgcc tccactagga 3430981 acggcagctg ggtgtaccag cggctggcac cgaccaagcg catgatgccg atttcggtgc 3431041 gccgcgtata ggcagccact tggaccatgt tggcgatcaa cagaatcgcc ccgatggcct 3431101 gaaccagcgc gaccgcgaac gcggcactgc tcaaaccatc aaggaccgcg aacagccggt 3431161 caatcagctc cttttgattc agcacgtcca agacgccggg ctgccccttc atagcggtgt 3431221 caaagtcctt gtgctgctcg gggttctcca gcttgacaat gaacgacgcc gggaacgaat 3431281 ccttgcccgc cacgtccttg aactggggaa acttgcggat ggcatcgtca taggcctgct 3431341 ggcggttaag gaaacgcacc gctttgacgt cggatcgcgt ttcgatcttc tcccgtaacg 3431401 ctttgcacgc agtggtatcg caggacgagt cgttggcgga aacgtcttcg gtgagaaaga 3431461 cctgagattc cacccggtcg agatagatgg cccgggagct gtcggccaac cggaccacca 3431521 acataccgcc gccgaacaat ccgaccgaga tcgcggtcgt caggatcatc gcgatcgtca 3431581 tggtgacatt gcgacgaaag ccggtcagga cctcatttag caggaaaccg aaacgcactt 3431641 agcgatccat cccgtagacg ccacgctgtt cgtcgcgtac cagcctgccc agggacaact 3431701 caaccacccg ttggcgcatc gagtcgacga tgtggtggtc gtgcgtggcc atcagcaccg 3431761 tcgtgccggt gcggttgatc cgctccaata agtccatgat gtccctactg gtctccgggt 3431821 cgaggtttcc ggtgggctcg tcggccagca gtaccagcgg ccggttgaca aaggcgcggg 3431881 cgatcgcaac gcgctgttgc tcgccgcccg acagctcgtc tggcagccga ttggccttgc 3431941 cggacagacc gaccgtctcg agcacttcgg ggaccacccg gttgatcgcg tcggtgcgtt 3432001 tgccgatgac ctccaatgcg aaggcgacgt tgtcgtacac cgtcttctgc tgcagcaacc 3432061 gaaagtcctg gaagacgcag ccgatcacct gacgcagctt cggtacgtgg cgaccgcgga 3432121 gtttgttgac atgaaacttc gagacccgga catcaccact ggtcggcgtc tccgctgcca 3432181 gcagcagccg catgaaggtt gacttgcccg aacccgacgg gccgatcagg aagacgaact 3432241 cacccttgtc gatcttgacg ttgatgtcat ccaacgccgg acgcgccgac gatttgtact 3432301 gcttggtgac atggtccagg gtgatcatca cggcacgcca gtgtagcggt gagattagcg 3432361 ggcaggcgaa atcaacgggt cggtggctcg gatttggggt aggtgccggc cgtcggaccc 3432421 ggcccgggct gcggtagcgg tgccggtggt gttggggtcg tggtgcccgg gccgaacggc 3432481 ggcggcaact caaacggcgg cgggacagcc gaatcggtcg tggtttcggg cgggctgacc 3432541 ggcggtgtgc tcgacgtggt ggtcggcgtc gccttgacgg tgggtggttg cactctggtt 3432601 cgcggcaccc aggtgtagtc aggatcgggc acgaagcccg gcggcaccac ctgggtcggc 3432661 ggagagtcac caggacctgg tgcctgtggc ctataggtct cgtaaatcca ccacaccgcc 3432721 aggaacgcgg cgatcaacac cagggtcgac gtgcggatcc ggccgaacag atagcccggc 3432781 cagtgccgtt tctggttgct gagcttcacg ctactgctcc ggactttctg ccaccgcggc 3432841 ccgcgcatcg gccgcggtga ctatcccggc gcgggtgagc gcgcggatca ccagcacccg 3432901 caactgccgg cccgcctcga actgcttgcc gggtagggtg cgggccacca gtcgcagggt 3432961 gacggtgtcc acttcgatgc gctccacgcc catgaccgtg ggctcatcca acaacagctc 3433021 tcccagcagc gagtcgtggc gcgcgtgctc acactcctga tgcaagacct cgttcacgcg 3433081 gccgagatcg gcgctggtcg ggacggggat gtccacgacc gcgcgggccc agtccttgga 3433141 caggttgacc gacttgacga tgttcccgtt gggaacggtg aacacctcac cctcgctgga 3433201 acgcagcttg gtcacccgca gcgtgacgtc ctccaccgtg ccggccgcgt tctccggtga 3433261 ccccaccatg ctgagttcga ccaaatcgcc gaacccgtac tgcttctcca cgatgatgaa 3433321 gaacccggcg agtaggtcct gcaccaggcg ttgggcaccg aagcccagcg cggcgccgag 3433381 caccgccgcc ggccccacca acgcaccgac cggaaccggc aacacatcga tgacctcgta 3433441 cacaacgacg acatagatga ggacgatcga cacccacgag atcaccgacg ctacggcctg 3433501 gcggtgcttg gttgcctccg agcgcaccaa cgcgtcgctt tcggtaaacc ccaggtcgag 3433561 gcgccgggtc acccggttgg caagccaagt cacgaagcgg gccgccagca ccgctgcgat 3433621 cagcagcatg acgatgcgca ggccccggtt gaggatccag tcgccgattt caccgcgcca 3433681 gaagttatgc cagtgctgtg ctatcgaggt ggccagaact gtgccgctag tcgtcattac 3433741 gtcgattgcg ccaccggatc ccggcttcca ggaatccgtc gaggtctcca tccagaacgg 3433801 ccgccggatt gccgacctcg tactcggtgc gcagatcctt gaccatctga tatgggtgca 3433861 gcacatagga acgcatctgg ttaccccagg agctgccgcc gtcggccttc aacgcgtcga 3433921 gctcggcgcg ttcttctaag cgcttgcgtt ccaacaactt tgcttgcaga acccgcatcg 3433981 ccgcgatctt gttctgcagt tgggacttct cgttctggca ggtgaccacg ataccgctgg 3434041 gaatgtgggt gagccgcacc gctgagtctg tcgtgttcac cgattgcccg ccgggcccgc 3434101 tggagcgata gacgtcgacg cggacatcgc cctcggggat gtcaatgtgg tcggtggtct 3434161 ccaccaccgg cagcacttcg acttcggcga acgacgtctg tcgccggctc tggttgtcga 3434221 acgggctgat ccgcaccagc cggtgggtgc cctgttcgac cgacaacgtg ccgtaggcga 3434281 acggtgcgtg cacggcgaac gtggcgcttt tgatgccggc ttcttcggca taggaggtgt 3434341 cgaacacctc gacggggtat ttgtgctgct cggcccagcg gatatacatc cgcatcagca 3434401 tctcggccca gtctgcggcg tccaccccac ccgcgccgga ccggatggtg accagcgcct 3434461 cacgctcgtc gtattccccc gacagcaggg tgcgcacctc ggtggcctcg atgtcggcgc 3434521 gcaacgactt gagctccgcg tcggcctcgg cgacggcatc ggcggcggcc gcgcccgctt 3434581 cctcggcggc cagctcgtag agcaccggca ggtcgtccag gcggcgcctt agctcctcga 3434641 cgcgccgcag ctctccctgg gtgtgcgaca actcgctggt cacccgctgc gcccgggtct 3434701 ggtcgtccca caagtgcgga tcagatgcct catgctcgag cttctcgatg cggctgcgca 3434761 gaccctcgac gtcgagcacc cgctccaccg tggtcagggt gcagtccaag gcggcgatgt 3434821 cggcttgacg gtcggggtcc acagcagcca aggttaccgg catcagcgtc tagcatcaga 3434881 tgaccgtcat gtgcaccgca cgactgcggc ccagcccatt cgcagcccct tgcgccgcag 3434941 ccgggcacaa cacagaaggc tcgagtatgc gtccctatta catcgccatc gtgggctccg 3435001 ggccgtcggc gttcttcgcc gcggcatcct tgctgaaggc cgccgacacg accgaggacc 3435061 tcgacatggc cgtcgacatg ctggagatgt tgccgactcc ctgggggctg gtgcgctccg 3435121 gggtcgcgcc ggatcacccc aagatcaagt cgatcagcaa gcaattcgaa aagacggccg 3435181 aggacccccg cttccgcttc ttcggcaatg tggtcgtcgg cgaacacgtc cagcccggcg 3435241 agctctccga gcgctacgac gccgtgatct acgccgtcgg cgcgcagtcc gatcgcatgt 3435301 tgaacatccc cggtgaggac ctgccgggca gtatcgccgc cgtcgatttc gtcggctggt 3435361 acaacgcaca tccacacttc gagcagatat cacccgatct gtcgggcgcc cgggccgtag 3435421 ttatcggcaa tggaaacgtc gcgctagacg tggcacggat tctgctcacc gatcccgacg 3435481 tgttggcacg caccgatatc gccgatcacg ctttggaatc gctacgccca cgcggtatcc 3435541 aggaggtggt gatcgtcggg cgccgaggtc cgctgcaggc cgcgttcacc acgttggagt 3435601 tgcgcgagct ggccgacctc gacggggttg acgtggtgat cgatccggcg gagctggacg 3435661 gcattaccga cgaggacgcg gccgcggtgg gcaaggtctg caagcagaac atcaaggtgc 3435721 tgcgtggcta tgcggaccgc gaaccccgcc cgggacaccg ccgcatggtg ttccggttct 3435781 tgacctctcc gatcgagatc aagggcaagc gcaaagtgga gcggatcgtg ctgggccgca 3435841 acgagctggt ctccgacggc agcgggcgag tggcggccaa ggacaccggc gagcgcgagg 3435901 agctgccagc tcagctggtc gtgcggtcgg tcggctaccg cggggtgccc acgcccgggc 3435961 tgccgttcga cgaccagagc gggaccatcc ccaacgtcgg cggccgaatc aacggcagcc 3436021 ccaacgaata cgtcgtcggg tggatcaagc gcgggccgac cggggtgatc gggaccaaca 3436081 agaaggacgc ccaagacacc gtcgacacct tgatcaagga tcttggcaac gccaaggagg 3436141 gcgccgagtg caagagcttt ccggaagatc atgccgacca ggtggccgac tggctagcag 3436201 cacgccagcc gaagctggtc acgtcggccc actggcaggt gatcgacgct ttcgagcggg 3436261 ccgccggcga gccgcacggg cgtccccggg tcaagttggc cagcctggcc gagctgttgc 3436321 ggattgggct cggctgatca gcgaccgagc aacacccctg ggttgaggat cccggccggg 3436381 tcgagtgcgg acttcgccgc ccgcagggcc gccgcgaacg ggtcgggacg ctgccggtca 3436441 taccaagcgc ggtggtcgcg accgaccgca tggtggtggg tgatggtacc gccactggcg 3436501 ctgatcgcct cggacacggc agccttgatc tcgtcccact gcgcgtcgag cgacccccag 3436561 cgcccgccgg catagatgcc gtagtaagga gccgggccgt ccgggtagac atgggtgaat 3436621 cgacaggtca ctactccggt cccgcatacc ttccagatcg cggtccgagc ggcatcggtc 3436681 accgcggcat gtagagtatc gaatccgtcc caggtgcaag cggtttcgaa tgtttcggcg 3436741 ataactccgc ggcgaaccag cgcgtctcgt tgatacggca tgcgcagaaa cgccgagcgc 3436801 cagttcgcgg ctgcgttgtg ttccgttgcg tcgcttgtag ttccgcggct acgttgcgcg 3436861 gtcaccgtgc cgccgtgttc ggcggtgatc gccaccgccc ggtgcagcca cgggtctatc 3436921 gggtggtcgg cagactcgaa cgccaacacc aacagcccgc caccaacgga cgtgccggca 3436981 ttcagcaacg cctcggccgg atccaacagc cggcagttgg ccgggtacag ccccgcctga 3437041 gcgatcgtcc gggtcgcggc gaccgcggcg gcccagtcgt caaacaccac ggacaccgtg 3437101 acctgccatc gcggacggtg ttgcagccgc atccacgcct cggtgatgat gccaagcgtc 3437161 ccctcggacc cgaggaacaa ccggtccggg gatggtccgg caccgcttcc gggcagccgc 3437221 cgggactcgc tgatccccac cggggtgaca atccgcagcg attcggtcaa gtcgtcgata 3437281 tgggtataga gcgtggcgaa gtgtccgccg gagcgggtgg ccaaccagcc accgagagtc 3437341 gagaagccga aggactgcgg gaaatggcgc agtgtcaaat cgtgtgggcg aagctgatgc 3437401 tcgatcgagg ggccgaacgc acccgcctgg atgcgcgcgg cacggctgac acggtcaatc 3437461 tcaagcaccg cgctcatggc agtgacgtcg accgtgacca ccggctcatc gaagcgcggc 3437521 tcgacaccgc caaccaccga gctgccacca ccgtatggga tgaccgcaat cccctcgcgc 3437581 gcacaccaat ccagcacgtc gatcacgtcc tgctcgctgc ggggtcgggc gatgaggtcg 3437641 ggcaggtggt cgagctggcc ctgcaggttg cgtgcgatgt cgcgatacgc tttgccgcgc 3437701 gcgtgtccgg cccgatcgac gagatcgctt gagcagagcg cggccagcga tgccggcggg 3437761 ctgacccgtg gggccgccaa accgagcgcg gtcaggtccg gcggcgggtg gtcgctcagg 3437821 tcatggccgg acaccagtgc cgcgactcgc gactgtagcg cttgcgtctc ctgatcggag 3437881 agcgcgtcct cgactgtgcc ccaaccccac cacgaacgca tgctgatggt gtcagcgttt 3437941 gaggacgatc atggctccgc cgacgaccac cagcaccagg gccgcgacga tagcccatcc 3438001 agcaccggct agccaccaca tgacacccaa tgcggcgagt accggcgaca gcgcgaagaa 3438061 caccattacc gggtgctgcc taatcactgc gagggcactg gtcgcccgga ctcgatcgat 3438121 ttccttgcct ggcatgccct tcaggatgcc agctgactac cacaatgcaa gcagcgatga 3438181 gccgacgaac cgtcatcctt ggcctgctcc cgctcgctgt tgtcgtcacg aatggcgcac 3438241 gatgcggcgc accaatgcct gtgaccgaag gcggttcggg ctgtcattga caattcatga 3438301 agatgcctgc cgcatcatat ccgttgtgcc cgttgttcta gaagtccgac gtgctgagcc 3438361 tgcccacccg gcgaccccat atccggaacc cctcgcgcgc tgcagccgct cacctggtct 3438421 gaacgaaagc tcgcacatga gtggtcgggt tccgccctaa caacgcgcca taaacgcagg 3438481 ctcatgcgct gcgccacgat gcgccgatgc atttcggtaa cgattgttag ttaacccttg 3438541 tacgaaactc tcttgaggcg ctttaaccga ctgcgtccaa agtggaggat cgaaaagatg 3438601 ataggaaaat gagtacgcct acgctgcctg atatggtagc tccatccccg agagtgcgag 3438661 taaaagaccg ttgtcgccgg atgatggggg acctacgcct ttccgttatc gatcagtgca 3438721 atttgcgatg ccgttattgt atgcccgaag agcactacac atggttgccg cggcaagatt 3438781 tgctatccgt caaagaaatc agcgccattg tagatgtttt cctttccgtt ggggtaagta 3438841 aagttcgaat caccggtggc gaaccgctga tccgcccaga tttgccggaa atagtgagga 3438901 cattgagcgc aaaggtcggc gaagattcag gtctgagaga cttagcgatc acgacgaacg 3438961 gcgtccttct cgccgaccgc gttgacggcc tgaaggctgc gggtatgaaa cgcatcactg 3439021 tcagtcttga tacgttgcaa cccgagcgct tcaaggcgat aagtcagcgt aatagccacg 3439081 ataaggtcat cgcgggtatc aaggctgtcg cagccgcggg atttacggac acaaaaatag 3439141 acacaacggt gatgcgtggt gccaatcacg atgagctggc tgatctgatc gaattcgctc 3439201 ggactgttaa cgcggaagtc aggttcattg agtacatgga cgtcggcggc gcaactcact 3439261 gggcatggga gaaggtcttt accaaagcga acatgctcga gtcccttgag aaacggtatg 3439321 gacgtattga gcctttgccc aaacatgata cggcgcccgc caatcgatat gcgcttccgg 3439381 acggaactac cttcggaatt atcgcgtcga caacggagcc attctgcgca acctgtgacc 3439441 gttcacggtt gaccgccgat ggcttatggc tgcattgctt gtacgcaata tcgggtatca 3439501 acctaaggga gccgctgcgt gcaggcgcga ctcacgatga cttggtggaa accgtgacaa 3439561 ccggatggcg gcgacgagcg gatcgcggag cagagcagcg tcttgcccaa cgcgagcgcg 3439621 gagtgttcct gccattaagc acgttaaagg ccgacccgca tctggagatg cacaccaggg 3439681 gcgggtaagc cgaacgaaca gtcgattgat caacgactcc acagttgagg aaggaaccat 3439741 gacggtcagc acccctgagc aacacgagca acgagcatcc cacgatgcat ccgagggaaa 3439801 gcacaacgta tgtcagggga ggctggccgc acttgccgac gcggccgtgt cagagaaact 3439861 cggagcacta cctggctggc agcttctcga catgcgactc agccgcgctt ttcagtgcac 3439921 aaatttcgac caatccattg acttcatgaa tagggtcgca tcaatagcaa acgatatcaa 3439981 tcaccatccc gatatcgctg tactggacaa gcgttcggtg cgcgtgacgg cgtggacgcg 3440041 caagctgggc tatctgaccg acatcgactt cgatcttgcg gcgtccgtcg aggcgatgta 3440101 tgcgacagaa ttcgctgaca ggccagcacg atgatcgacc atgcactcgc gctgacacat 3440161 atcgatgagc gtggtgcggc acgaatggtc gatgtgtccg agaaacccgt gactttgagg 3440221 gttgccaaag cgtcagggct cgtgatcatg aagccgtcta ccttgaggat gatttccgac 3440281 ggtgccgctg ctaagggtga cgtcatggcg gcggcccgga tagctggcat cgcggcggcg 3440341 aaacgtacgg gtgatcttat tccgctatgc cacccgttag ggctcgacgc tgtcagcgtc 3440401 actatcacgc cgtgcgagcc tgaccgggtg aagattctgg cgacaaccac cacgctgggg 3440461 cgtaccggcg tggaaatgga agcgttgacc gcagtttcag tcgccgcctt gactatctac 3440521 gacatgtgca aagccgtcga tcgagccatg gagatttctc agatcgtgct ccaagagaaa 3440581 agcggcggcc ggtccggagt ttatcgccga agtgcttctg atttggcctg tcagtcccga 3440641 taagtaggtg agtgtctgaa tgattaaagt gaatgttctt tacttcggtg ccgttcgtga 3440701 ggcgtgtgac gaaacgcctc gggaggaagt agaggttcag aacggtaccg atgtcggaaa 3440761 tcttgttgat caactccagc aaaaataccc tcgccttcgc gatcattgtc agcgagtaca 3440821 gatggcggtc aaccaattca tcgcgccgct gtcgaccgtt ctcggcgatg gtgatgaggt 3440881 cgccttcatc ccgcaggtag ccggaggctg aacaagggga tgacggccgt gaatgcgctc 3440941 tcatcgtcgc cgctgttcgg caacgtggga gttccagtgc cggcgtgcag aacgaccgaa 3441001 attcgccgca cccgaatagt cgggtcgcat agatgaccag cagggatgga ttcaccatcg 3441061 tttgcgattg gaacgggacg ctgtgcgacg accggacaat tcttctcgac gcggttgggc 3441121 agacgctggt caacgaggga ttcgagcctc tttcgcaaca gcagctgatc caacggttcg 3441181 cacgcccact acgaacgttt ttcgagaatg cgtgcggtcg agatctcttg acgtccgagt 3441241 gggaacgcgt ccaatccacc tttcgccgaa tctatcgatc gcgagaagct gaagtcacac 3441301 tcgtcgaaga tgcgtacgac gttctggcgc agggaaaccg cagcgccgct gggcagttct 3441361 tattatcgct ggcgcctcac gacgagctta tgcacttcgt ccaaaaatac gggattgcca 3441421 agtggttcaa cgaaatccgt ggccggactc ggcccgacca agaaaaaccc atgatgctgg 3441481 cagaactgat catgcagcgc tctctgaatc ccactcgcgt ggtgcacatc ggcgattcgc 3441541 ttgaggacgc cgctgctgcc agcgcggtcg gagccatttc cgtcttggtc accggagctt 3441601 cacggcagcc acccgaccga gtcatgctca aacagttgca gcccttcgtt gcgagttcgc 3441661 tgaagcaagc actgcagtac gcgggtggcg acggtgattg acgacgaagg tacgcaggtg 3441721 gtggcggcgc gcctgccgtt cggatggcca gccgacagtg gggtgacagc cgacatcatc 3441781 gaggcagcga tggaacttgc gatcgacaca gcgcgacatg ccacggcacc gtttggcgct 3441841 gcgctgcttg atgttacgac actccgagca ttctcgggtg gcaacaccta ttttgaatcg 3441901 ggggatcgct tcgctcacgc cgaaaccaac gttctacggg ccgcaatgag cacattgccg 3441961 gagctttcaa atcacgtgct gatatccacc gccgagccat gcccgatgtg cgcggcggcc 3442021 agcgtgctca gcggagtgag agccatcatc ttcggcacat caatcgagac ccttatccag 3442081 tgcggttggt tccaaatccg catcagcgct tcggatgtgg tggcggcctc cactcgtccc 3442141 acgcgtccat cggtgtatag cggtttcctc agccacaaga cggacttgtt gtaccggaac 3442201 tccgaaaacc gacgagcaat gaacccctgg accgatccat cgcattgact cggcttgccg 3442261 actacctcac tgacccagga ggagagttac gtccaggggt gtggtgtacg ggcaggtaag 3442321 gccggtgggc gtgtcgtagc ccagtagtgg gcggtcatcg cgtgatcctt cgaaacgacc 3442381 agcaaaagtc aatcgaagga aatgacgcaa tgacctcttc tcatcttatc gacaccgagc 3442441 agcttctggc tgaccaactc gcacaggcga gcccggatct gctgcgcggg ctgctctcga 3442501 cgttcatcgc cgccttgatg ggggctgaag ccgacgccct gtgcggggcg ggctaccgcg 3442561 aacgcagcga tgagcggtcc aatcagcgca acggctaccg ccaccgtgat ttcgacaccc 3442621 gtgccgcaac catcgacgtc gcgatcccca agctgcgcca gggcagctat ttcccggact 3442681 ggctgctgca gcgccgcaag cgagctgaac gcgcactgac cagcgtggtg gcgacctgct 3442741 acctgctggg agtatccact cgccggatgg agcgcctggt cgaaacactt ggtgtgacaa 3442801 agctttccaa gtcgcaagtg tcgatcatgg ccaaagagct cgacgaagcc gtagaggcgt 3442861 ttcggacccg cccgctcgat gccggcccgt ataccttcct cgccgccgac gccctggtgc 3442921 tcaaggtgcg cgaggcaggc cgcgtcgtcg gagtgcacac cttgatcgcc accggcgtca 3442981 acgccgaggg ctaccgagag atcctgggca tccaggtcac ctccgccgag gacggggccg 3443041 gctggctggc gttcttccgc gacctggtcg cccgcggcct gtccggggtc gcgctggtca 3443101 ccagcgacgc ccacgccggc ctggtggccg cgatcggcgc caccctgccc gcagcggcct 3443161 ggcagcgctg cagaacccac tacgcagcca atctgatggc agccaccccg aagccctcct 3443221 ggccgtgggt gcgcaccctg ctgcactcca tctacgacca gcccgacgcc gaatcagttg 3443281 ttgcccaata tgatcgggta ctcgacgctc tgaccgacaa actccccgcg gtggccgagc 3443341 acctcgacac cgcccgcacc gacctgctgg cgttcaccgc cttccccaag cagatctggc 3443401 gccaaatctg gtccaacaac ccccaggaac gcctcaaccg agaggtacga cgccgaaccg 3443461 acgtcgtggg catcttcccc gaccgcgcct cgatcatccg cctcgtcgga gccgtcctcg 3443521 ccgaacaaca cgacgaatgg atcgaaggac ggcgctacct gggcctcgag gtcctcaccc 3443581 gagcccgagc agcactgacc agcaccgaag aacccgccaa gcagcaaacc accaacaccc 3443641 cagcactgac cacctagact gccacccgaa ggatcacgcg aggaaccttc actcgtacac 3443701 cacgtccctg gccttggcca ggaggagagc aatcatgact gaagccttga tcccggcacc 3443761 gtcgcagata tcgctgaccc gcgatgaggt gcgcaggtac agcaggcacc tcatcatccc 3443821 ggatatcggc gtcaacggcc aacagcggct gaaggatgcg cgcgtattgt gtatcggcgc 3443881 cggaggattg ggttcgcctg ctctcctgta tcttgcggcc gccggagtcg gtaccatcgg 3443941 catcatcgat ggagaccacg tggatgagtc gaatctgcaa cgccaaatca ttcatggcac 3444001 atccgacgtg ggtaggccga aagtagaatc agcagccgag gcggtggcgg aaatcaaccc 3444061 gcacgtccgg gtgacgcaat atcgcgaaat gctcacccac gacaacgcac tggaaatttt 3444121 tggcgatcac gacctcattg ttgacggcac agacaacttc acgacgcgct acctgatcaa 3444181 tgatgccgcg gtcttggccg gcaaaccata tgtttggggg tcgatctacc gattcaacgg 3444241 ccagaccagt gtgttttggc ccggccgggg gccgtgttat cgatgccttc atccagctcc 3444301 gcccccgccc ggattggtgc cgtcgtgcgc tgaaggcggt gtactcggtg ccatctgcgc 3444361 cacgattgcg tcgatccagg taactgaagt gctgaagctc cttaccggag tcggaactcc 3444421 cctcgtcggt cgcctgctca tgtatgaagc tctcgacgcg acataccatc aaatccggat 3444481 cgcgaagaat cctgactgcg ccatttgcgg cgatgcgccc acgatcaccg aattggtaga 3444541 tgacagcgtc agctgcgcat cgacacaatc ggtggatccc gaactagtga tcagttgtga 3444601 tgagttgcga accaaacagc agtcggacca gaacttcctc ttggtcgacg tgcgagagcc 3444661 cgccgagttc gacatcgcgc acattccggg cagcatcttg atacccaaag gcgaaatcgg 3444721 ctcggcggcg ggcctagccc agctaccgct ggacaaggaa attgtcctgt actgcaagag 3444781 tggaatccga tcggcccagg cgctaaccac gttgaaagca gccggactgc acaacgtgaa 3444841 gcatctcgac ggcggtatcg cggagtggac acgaaccatc gactcctcct tgttggtgta 3444901 ctagcaccga actatgcgaa aggattcccg ccatggcacg ctgcgatgtc ctggtctccg 3444961 ccgactgggc tgagagcaat ctgcacgcgc cgaaggtcgt tttcgtcgaa gtggacgagg 3445021 acaccagtgc atatgaccgt gaccatattg ccggcgcgat caagttggac tggcgcaccg 3445081 acctgcagga tccggtcaaa cgtgacttcg tcgacgccca gcaattctcc aagctgctgt 3445141 ccgagcgtgg catcgccaac gaggacacgg tgatcctgta cggcggcaac aacaattggt 3445201 tcgccgccta cgcgtactgg tatttcaagc tctacggcca tgagaaggtc aagttgctcg 3445261 acggcggccg caagaagtgg gagctcgacg gacgcccgct gtccagcgac ccggtcagcc 3445321 ggccggtgac ctcctacacc gcctccccgc cggataacac gattcgggca ttccgcgacg 3445381 aggtcctggc ggccatcaac gtcaagaacc tcatcgacgt gcgctctccc gacgagttct 3445441 ccggcaagat cctggccccc gcgcacctgc cgcaggaaca aagccagcgg cccggacaca 3445501 ttcctggtgc catcaacgtg ccgtggagca gggccgccaa cgaggacggc accttcaagt 3445561 ccgatgagga gttggccaag ctttacgccg acgccggcct agacaacagc aaggaaacga 3445621 ttgcctactg ccgaatcggg gaacggtcct cgcacacctg gttcgtgttg cgggaattac 3445681 tcggacacca aaacgtcaac atagcatttg gatatggtcc acatgcttgc ccggcctcag 3445741 cgtattcacg catgtgcttg acgacgttct tcacctcgct tacccagcga tttccgcaac 3445801 ttcaactcgc aagaccgttt gaggatttgg aacgacgggg taagggccta cattcggtgg 3445861 ggatcaagga actccttgtt acctggccga cgtgaccccg cgtgccagca agggactgtt 3445921 gacttctccg acggatgaaa gccgccctgg aatatccaac cgctcctgct cctcggtcaa 3445981 ctcaagccga aaccgccaac ggtggccaca aaatacgagt tcgtccacaa cgtcggcagc 3446041 cgggaccgca accacgcaaa ctcctcacgc actacccgca accgacggcc cctaattggg 3446101 gttgggccca tgatcggttg gcggctcatc aggcggtgca ggatcttggt gtgcccgcct 3446161 cggcgcggcg gagccggggt cgagcatctc tttgcgagtg atgaaggcac agccccggcg 3446221 cggggtgggt gtgcaacacg aatgtaggta gcgggagttg aggctgggcg cggtgtattc 3446281 tggttgttgg ataaacaacc agaatgggga gacgcgggtg ggcgaggact cgctggagga 3446341 tctggagcag cggcgagcgc gactgtatga ccagttggcc gcgaccggcg atttccggcg 3446401 cggctcgatc agtgagaact atcgccgctg cggcaagccc aattgtgtgt gcgcgcaaga 3446461 gggtcacccc gggcatgggc cgcgatattt gtggacgcgc acggtggccg ggcggggtac 3446521 caaggggcgg cagctctcgg tcgaggaggt ggacaaggtg cgcgccgagt tggccaacta 3446581 tcaccgtttc gcgcaggtca gtgagcagat cgtggcggtc aacgaggcga tctgcgaggc 3446641 ccgcccaccg aacccggcgg ccacggcgcc cccggccggc acaacggggc acaaaaaagg 3446701 gggctctgcg accagatcgc ggcggagttc accgccgagt tagagcggct ggttgcgctc 3446761 gcggtcggtg cgctgggatc ctcggtgccg acctggtcgc agtggagttg gcgatccgca 3446821 ctgcgatgac ccggctgggc tcctcgctgc tggagcagct gctgggcgcc aacaccgggc 3446881 accggggcca gcgcatcgat tgcgggcaag ggcattgcgc gtggttcgtc ggttaccgcg 3446941 acaagaacct cgataccgtg ctggaccggg tccggttgct ccgcgcctgc taccactgcc 3447001 gcacctgcgg gcgtgggatg gcgccccctg gatctggaac ctggccaccg cgatcctgcc 3447061 cgaagccacc ccgatcgtgg acctctacca cgctcgccag cacgtccacg acctcgccgg 3447121 ccagctcgca cccgccctcg gcgaacacca cagtgactgg ctgaccgccc ggctggtcga 3447181 cctcgactcc ggcgacatcg aaacgctggt tcaacaaccg atcgggcagc acaccggtca 3447241 cacgtaacga agtgtgcatg aaacccggag tggttcaggg gtccgccgcg ctcgtccgcg 3447301 ctgtgagggt ctcggcacta ccacgagatg agatcgaggc accaggtgca ttgtgcacca 3447361 cattctggcg atgttggtga ggtttgttcc tgcgcccgtc cgtggcgcgt tcgggatcgt 3447421 tggggttggc cggttgccca cctcggcgga agcggacggt gagcgcggcc gagtcgtcga 3447481 catttggcgg taggaggttt cgatgctgtt tgtcagcgtg gccgcggagt cggtaggggt 3447541 ggcggcggcg actcttgttg ggcccccgtt gatcggcaac ggcgccgatc ggcccccggc 3447601 accggacaag ccggcgggat cttgtggggc aacggccgtt ttcgcccaat cacaggagtg 3447661 gagttttgaa cgcaacgacg gcaggtgctg tgcaattcaa cgtcttagga ccactggaac 3447721 taaacctccg gggcaccaaa ctgccattgg gaacgccgaa acaacgtgcc gtgctcgcca 3447781 tgctgttgct atcccggaac caagtcgtag cggccgacgc actggtccag gcaatctggg 3447841 agaagtcgcc acctgcacga gcccgacgca ccgtccacac gtacatttgc aaccttcgcc 3447901 ggaccctgag cgatgcaggc gttgattcgc gcaacatctt ggttagtgag ccgccgggct 3447961 atcgccttct cattggagat cgacagcaat gcgatctcga ccgtttcgtg gcagcgaaag 3448021 aatcgggact gcgcgcttct gccaaaggat attttagcga ggcgatccgt tatctagatt 3448081 cggccttgca gaattggcgc ggtccagtac tgggggacct acgcagcttt atgtttgtcc 3448141 aaatgttcag cagggcgttg accgaagatg agctcctcgt ccatacgaag ctggccgaag 3448201 ctgcaatcgc ctgcggacgc gccgacgtcg ttatccctaa attggaaaga ctcgttgcga 3448261 tgcatcctta tcgcgagtcg ttatggaagc agttaatgct cggctactac gtgaacgaat 3448321 accagtccgc ggcaatcgac gcatatcata gactcaagtc cacgctcgca gaggaactcg 3448381 gtgttgagcc ggcacccacg atacgtgcgc tctaccacaa aattcttcgc caattgccca 3448441 tggacgatct cgtcggccga gtcacgcgtg gcagggttga cttgcgtggc ggcaacggcg 3448501 ctaaggtaga ggaactgacc gagagcgata aggatctcct tcccatcggt ttggcataac 3448561 tacgcccctc aatgcaagcg agctgattcg atgttgtcga gccggagccc gctccgacct 3448621 ccgtcacaca gaccggacta cgaatactga cccgcgctgc tagccaaccc cggttcgtgg 3448681 aatcacagtg agacgtgcct gcgtgacatg ccaacccgca ccatcacgat ccatcagccc 3448741 accgggcata ccagcgccgg caccgctaat actcattggc atcagcatca tcggcatacc 3448801 accaccggcg gccccggccg cctgcgtcag cgcgactggg ttaggcggca cacccaaccc 3448861 ggacatcgct gaagaagcca tcgaaatggg tatcgacccc tgccacgtcg gcggcaccga 3448921 catcgacccc accaactgag cctgccctaa gccagcggac atccccgcac ccaattcccg 3448981 accgcctagc ggcttgaacg ccggaatagc cgcaccactc gggttcggcg tattcgcggc 3449041 agccaacccg ccagccggcg gacctaacct ggtcgcgctt tcacgcgcca tactcgccaa 3449101 cgtcgttatc ggcgacgtca ggatgcggac cgggagaaac aacatcgacg catgttgcaa 3449161 cggcagctgc gacaccagcg actgcatccc gctcatcacg gtcggcaccg acgccatcgc 3449221 accctcgacc accggcgtca cggcagccga cgccgtcgtc gccatcccgg caacctgggt 3449281 acccacctgc gcggccaacc cggctaagct caccggcggc agactaaatg gcgccaacgt 3449341 cgccgccacg gactttgccc cagcgtgata gcccaccatc gcagccacat cctgagccca 3449401 catctccagg taatcgaact ccgtggctgc gatcgccggg gtgttctgac ccaaaacgtt 3449461 cgccgctatc aacgacgcca gcgacacccg attcgccgtc accgccgtcg gatgcaccgt 3449521 ggccgccaac gcggcctcaa acgccgtcgc tgccgcgcga gcctgaatgg ccgccagctg 3449581 cgcctgcgac gccaccgtgc tcaaccaccc gacatacgga gacgcggcag cagccatcga 3449641 catcgacgcc ggaccggtcc acggcccagt cgtcaacgcg gcgagcaccg actcaaacga 3449701 cgatgccgat gcccacaaat ccacggctag cccctcccac gccgaggccg ccgcaaacaa 3449761 cggccccgag ccggctccgg caaacatgcg cgccgagttg atctccggcg gcagccacga 3449821 aaagcccaaa accatcgcaa ccccagccca atcagccgcc cagaagggtc tcgtacaagg 3449881 gttaactaaa caatcgttac cgaatgaatc gacacatcgt gacgcaccga tggctcagca 3449941 cgccggactt ctagaacaac gagcacaacg gatatgatgc ggcaggcatc ttcatggatt 3450001 gtcaatgaca gcccaaaccg ccttcggcca ctggcattgg tgcactgcac cgtgcgccat 3450061 tcgtggcgac aactgcgagc gggagcggga ccaaggatga tggtcccggt cgcgacgggc 3450121 gcgatcccgc tccggagtgg tcaacgcatc aaacgacaaa gcgctcagct catcgaccgc 3450181 agcatcgagc cggtccagcg ccgcgaccaa actagaattc tcgcgcagac accgctgaaa 3450241 cgacagtgac gcaagggatt tcattgagag gaccaatgac cctatttgat caaaccggat 3450301 gaccataccg tcaacgttgt ggacatacag gtgctcaaga acgcagtctt gctggcatgc 3450361 cgggcgccgt cggtgcacaa cagccagccc tggcgttggg tggccgaaag cggctccgag 3450421 cacactactg tgcacctgtt cgtcaaccgc caccgaacgg tgccggccac cgaccattcc 3450481 ggccggcaag cgatcatcag ttgcggtgcc gtactcgatc accttcgcat cgccatgacg 3450541 gccgcgcact ggcaggcgaa tatcactcgc tttccccagc cgaaccaacc tgaccagttg 3450601 gccaccgtcg aatgcagtcc catcgatcac gtcacggcgg gacagcgaaa ccgcgcccag 3450661 gcgattctgc agcgccgaac cgatcggctt ccgtttgaca gcccgatgta ctggcacctg 3450721 tttgagcccg cgctgcgcga cgccgtcgac aaagacgttg cgatgcttga tgtggtatcc 3450781 gacgaccagc gaacacgact ggtggtagcg tcacaactca gcgaagtcct gcggcgggac 3450841 gatccgtact atcacgccga actcgaatgg tggacttcac cgttcgtgct ggcccatggt 3450901 gtgccgccgg atacgctggc atcagacgcc gaacgcttgc gggttgacct gggccgtgac 3450961 ttcccggtcc ggagctacca gaatcgccgt gccgagctag ctgatgaccg atcgaaagtc 3451021 cttgtgctgt cgacccctag cgacacgcga gccgacgcac tgaggtgtgg cgaagtgctg 3451081 tcgaccatcc tactcgagtg caccatggcc ggcatggcta cctgcacgtt gacccatctg 3451141 atcgaatcca gtgacagtcg tgacatcgtg cggggcctga cgaggcagcg aggcgagccg 3451201 caagccttga tccgggtagg gatagccccg ccgttggcag cagttcccgc ccccacacca 3451261 cggcggccgc tggacagcgt cttgcagatt cgccagacgc ccgagaaagg gcgtaatgcc 3451321 tcagatagaa atgcccgtga aacgggttgg ttcagcccgc cttgatcagg atgcctttgt 3451381 ggatgtcggg tagggcggtg gggatgttag cgaggtagag ctgctcggtt ttctccttgg 3451441 ccaagatgag gagtcggttc tgcaggtcgg cgattttgcg gccgatctgg gcggggttga 3451501 ggctgtctcg gtaggtgatc aggtcggcct gctgggccgc ggagagcacc cttgcggcca 3451561 gtggccggtc cagcggcgtc tgtggggcat cgtagaggcg tcggcggcgg ccgtcggcgc 3451621 tgctggcata cccgatcggt ttgatggtcg gggtgaggta gttgaggcgg tcgttgacca 3451681 gcttccacat ccggttgagc acggcgcgtt cctcggcggt gtcatagcgg tagtagaacg 3451741 cgtacttgcg gaccaggtgg ttgttcttgg actcgatggt ggcctagtgg tttttcttgt 3451801 acgggcgaaa gcgggtgaag tagataccgt tgtcgccggc ccagctgatg accggcttgt 3451861 tgagaaaccc ggtgccgttg tcgaaatcta aacccgttat cccatgcggg atctcggtga 3451921 cagaagcttt gagcccggcg aggatgtggg tacgggcgtt gttgcggacg gtgcgggtga 3451981 acacccatcc gatgtgcacg tcggtcaagt tcagggtgtg ggcgaactcg cctttgagcg 3452041 tcggaccgca atgggcgacg gtgtcgccct cgaagaaccc cggctccgcc tcgacctcat 3452101 cgccggccct gcgaaccttg atcgaattac gcagcagtgg tgagggtttc gtcgtcgaca 3452161 cacccgatat ctggtctttg gccttcgcgg tcttcagata acgatcgatg ctggccgcac 3452221 tcatcgccaa cagctcctca cgcacctcgg ggccatagcg gtcacgccca aactccaaca 3452281 caccgtgacg ttccaaccca tcaagctgca gcaccatcga ggcggcaaga tacttcccgc 3452341 actgcccacc cgaggcggac cacaccctct gcaacacctt cagcgcgtca taggagtact 3452401 tcagcgaacg cggtttgcgc cgccgcttgg caacactgcg gcccagcccc ggcgatagct 3452461 tggccgctgc gacaagccgg cgccgcgcgt tatcacgtga ctagcccgtc aggtcaacca 3452521 cctggtcgaa aatccggccc cggctcttct tcaaagcctg cacatacgcc ttggcgtacc 3452581 tgctggtgac ctccgcgcga gatctcatcg acaacccact tcccatgcct cacgacggtc 3452641 accatgtcgc gggcatattt acgtgaggca ccgagggtgt ttcgcgggca ttcttggtga 3452701 gtcaagtcga acggttgagc catgatcgac gattccgtta ccgtgctgtc agaagacgaa 3452761 agttggcacc ggctgggcag cgttgcactc ggtcggctag ttaccacctt tgctgatgag 3452821 cctgggatct tccagtcaat ttcgtggtgc aaggccgcac cgtgctgttt cgtaccgcgg 3452881 agggcgccaa attattttca gccgtcgcga agtgcgcggt ggctttcgag gcggacgacc 3452941 acaacgttgc cgagggctgg agcgtgatcg tcaaggttcg cgcccaggtg ctgacgaccg 3453001 acgcgggggt ccgcgaagcc gaacgcgccc agttactacc gtggaccgcg acgctgaaac 3453061 gtcactgtgt gcgggtgatc ccgtgggaga tcaccggccg ccacttcagg ttcggtccgg 3453121 aaccggaccg cagccagacc tttgcctgcg aggcctcgtc acacaaccag cgatagcgct 3453181 ccgcgcctgc gagtcacctt gcgccgctta ctgatcgcca ccagccgtgc gacggcgtct 3453241 tcaattcctc gcgccagctg gccggcatct gctaccacgt cgtagtcggc caggatcccg 3453301 aagtacaggt cgtcggcgta gctgagcatc gcgacactgg tgcgcagttg catcgcgatc 3453361 ggcgaaaccg ggtataggtc aagcacccgt ctgcccataa tctgcagcgg ccgtcgtgga 3453421 cccggcacat ttgtcgccac ggtgacaaca ccacgctgcg gcagccgcat caacagcccg 3453481 accgcccatg cggtcatggg gaacggaagg cggttggcaa tcgccatcaa agtatttccg 3453541 aattgtctct gtccccccgc cttggcccga gtcagccgcg agtgcacgat ccgcagccgc 3453601 tgcagcgggt tctcttgatc caccggcagg ttgggcagca ttaacgaaac acggttatcg 3453661 gtcttgctca aagcgctgtt ggaacgcgtc gagaccggca ctagcgtacg cagcgaatca 3453721 aacctaggcc gctcaccccg ctggatgagg acgttgcggt agctttccgt aatcgcggca 3453781 agcgcaacat cattgatggt gacgtcgaat ttccggcaca cctgttcgac gtcggcgaga 3453841 gggacctttg ctgcgctgta gcgacgcaaa tcactgatcg gcccgttcaa cgacgacgcg 3453901 gcgggactta gcacgccggc cgcgatctca ctggcaccct tggccgcgcg aacgatgcct 3453961 gccatcacgg cggtcgacgc ggtcaacgcc tcgcttggat tgacacggaa tccaccccgc 3454021 cgcacagatg cggattgcga ctgcatggtc gtgtggatgt tgctcgcgaa gctgtcgctc 3454081 atactttcat cggagagccc agctagcagg tgagtcgccg cgattccgtc ggccatgcag 3454141 tggtgcagtt tggtcaggat cgcccacttg ctgtccgcca ggccttcgat gacccagacc 3454201 tcccacagcg gtcgaccccg gtccaaacga cgcgccatca gatcggcgat cagctcgaat 3454261 aactggtctt cgttgccagg ccgcggcaag gcgatgcgcc acacatgacg gccaagatcg 3454321 aagtcgggat cgtccaccca tttgggtgca ccgaggtcga acgggcgcag gcgtaaccgc 3454381 tgcccgaacc gggtacaggg acgtaggcgt tgagcgagcg acgataagaa ggcttcctga 3454441 tcgggagccg gcccctcgat gaccgccaga gcgccgattg ccagactcac gtgccgatcc 3454501 acgtcttctg ccttgagaaa cccggcgtca agtgtcgtta ggtgattcat ggtcagcgcc 3454561 ttccccggtg atccggatta tctgcaaccg tcagtaccac tctccgctgc gaggagccgt 3454621 tgaggcaggg ccaaaggtcc tccgctggcg agccttcgtg ctctgccacc gcggctgtcg 3454681 acgcgcgatc cttaatagat gaccgcagcc gttgatggga aaggcccggc agccatgaac 3454741 acccatttcc cggacgccga aaccgtgcga acggttctca ccctggccgt ccgggccccc 3454801 tccatccaca acacgcagcc gtggcggtgg cgggtatgcc cgacgagtct ggagctgttc 3454861 tctagacccg atatgcagct gcgtagcacc gatccggacg ggcgtgagtt gatcctcagc 3454921 tgtggtgtgg cattgcacca ctgcgtcgtc gctttggcgt cgctgggctg gcaggccaag 3454981 gtaaaccgtt tccccgatcc caaggaccgc tgccatctgg ccaccatcgg ggtacaaccg 3455041 cttgttcccg atcaggccga tgtcgccttg gcggcggcca taccgcggcg acgcaccgat 3455101 cggcgcgcct acagttgctg gccggtgcca ggaggtgaca tcgcgttgat ggccgcaaga 3455161 gcagcccgtg gcggggtcat gctgcggcag gtcagtgccc tagaccgaat gaaagccatt 3455221 gtggcgcagg ctgtcttgga ccacgtgacc gacgaggaat atctgcgcga gctcaccatt 3455281 tggagtgggc gctacggttc agtggccggg gttcccgccc gcaacgagcc gccatcagac 3455341 cccagtgccc cgatccccgg tcgcctgttc gccgggcccg gtctgtctca gccgtccgac 3455401 gtcttacccg ctgacgacgg cgccgcgatc ctggcactag gcaccgagac agacgaccgg 3455461 ttggcccggc tgcgcgccgg cgaggccgcc agcatcgtct tgttgaccgc gacggcaatg 3455521 gggctggcgt gctgcccgat caccgaaccg ctggagatcg ccaagacccg cgacgcggtc 3455581 cgtgccgagg tgttcggcgc cggcggctac ccccagatgc tgctgcgagt gggttgggca 3455641 ccgatcaatg ccgacccgtt gccaccgacg ccacggcgcg aactgtccca ggtcgttgag 3455701 tggccggaag agctactgcg acaacggtgc tgaccatcgc agcactgttc cgctcgcgcc 3455761 cggtacgctc gcgagggtga attcgccgcc ggcctgctct gcccgctgcc gcaggttcgt 3455821 taagccgctt ccggtgaact cgtcgggcag cccgcggccg ttgtcggtca cctcgatgca 3455881 caagtcgtcg tcgactttga cccggacggt caacgtgctg gccttcgcat ggcgaaccgc 3455941 gttgctgacc gcttcccgaa ccaccgcctc ggcctgatcg gcgagcgcgc tgtcgaccac 3456001 cgacaatgga cccacgaatt gaacgctggt gcgcaacccc gagtcggcaa attgggctac 3456061 ggccgcatcg attcgctgcc ggagccgagt gataccctgc gatgctccgt gcaggtcata 3456121 aatggtggtc cggatttcct gtataacgtc ttgcagatcg tctaccacgt ccgagagtcg 3456181 ttgctgcact tcaggattac gttcgtgcgg gacagcaccc tgcaaagcca ggccaatcgc 3456241 gaagagccgc tggatgacat ggtcatggag gtcacgggcg atacgatccc ggtcggtcag 3456301 tacgtcgagt tcgcgcatcc gacgttgcga agtggccaat tgccaagcca gcgcggcctg 3456361 gtcggcgaac gcggccatca tctcgagttg ttcgtcggtg aaagcccctg gaccgccttg 3456421 actcagcaca acaacgacac ccgctacggt acctctggcc cgcagcggca acagcagcgc 3456481 cggacctgcg tcggccagtt cgtccaggcc ttccaaatcg acccggtcga cccgtcgcgg 3456541 aatgccgttg acgaagacct cccgcagcac cgcgcccgcc accggaatcg ttcgcccaac 3456601 agtggaagcc acagcgctgc cgactgtttc aatcaccagc agctccccca cgtcagcggc 3456661 aggcatgtcc tcgtcgacgg gaacggctac cagggcagcg tcagccgccg tcagcttgag 3456721 cgcctccgcg gcgacaagcc ggaacaccgt cgcgggttcg gtgccggaca acaactcggt 3456781 ggcgatgtca cgggtggcct cgatccacga ctgacgcgcc ttagcctgct ggtagagccg 3456841 ggcattcgcg actgcgatac ccgcggcggc cgccagcgcc tggaccagaa cctcgtcgtc 3456901 gtcgctgaac ggttgcccgt tggtcttgtc agtcaggtac agagtgccga acgattcatc 3456961 gcgcacccga accggtaccc cgaggaaggt acgcatcggc ggatgatacg gcggaaaacc 3457021 aatcgaggcc gggtgcgcag aaacatcgtc cagccgtaac ggtttgggat cttcgatgag 3457081 cagcccgatg acgcctaggc ctttcggtag gtggccgatc cgccgaacgg tctcctcgtc 3457141 gatgccttca tagacaaagt gcaatacccg atgctgccgg tcgtgcacct ccatagcgcc 3457201 atagcgcgca tcgacaaggc tggtcgctga atgcacgata gcgcgtaggg ttgcctccag 3457261 gtccaggccc gctgtgacca cgagcatggc ctccaccaga ccatcgaggc ggtcccggcc 3457321 ctcgacgatc tgctcgaccc ggtcctgcac ctcgaccagc agctcgtgca ggcgtagttg 3457381 ggagagcgtg tgacgcagtg gacgcattgc ggcgccgtcg ttttcgtcga cgaggccccc 3457441 tgttgtcatg gtccatcacc gggtggccgc gagcgcttca actccgtcgc gaataccgcg 3457501 gcttgcgtcc gacgttccat gcccagcttg gccagcaacc gcgacacgta gttcttcacc 3457561 gtcttttcgg ctaggaacat tcggtcggcg atctgcttgt tggtcaggcc ctcgctaagc 3457621 aggcccagta gcgtccgctc ctggtcggta aggcctgata gcgggtcctg cttctcggcg 3457681 gcaccgcgca gcttggccat cagcgcggcc gcggcccgat tgtccagcag cgaccgtcca 3457741 gcgcccacat ctttgacggc gcgcgccaac tccattccct tgatgtcttt gacgacatat 3457801 ccgctggcac cggcgagaat cgcatctagc atggcctcgt cagaggtgta ggacgtgagg 3457861 atcagacagc gcagatcggg catgcgggac aacagatcgc ggcacagttc aatgccgttg 3457921 ccatcgggca accggacatc cagcaccgcg acatctgggc gcgcggcagg aaccctggcc 3457981 atcgcctcgg cgaccgaacc cgcctcacct acgacgtcaa gctcgggatc ggccccaagc 3458041 aagtcaacca gaccacgacg caccacctcg tggtcatcga ccaagaagac ctttaccacc 3458101 agggcaccac tcccaagatc cgctccctac aagttggcac tgcgtaccgt aagtacggcg 3458161 catccgggct ggtatgcacc gcacaattcg tgcgcggagt gtgagtccgc gacgaacagc 3458221 tgacccggct ttgcgttggc ggccagatga cggcacgcac tgccgccggc gatggcccga 3458281 tccacccgca cctcggggta gagccgggtc cagtgggcga gccgacggct caggtgtaca 3458341 tgcgccaacc ggctgccctg ttcgacgtca tcgggtgttt cagcagcgtg gacagccacg 3458401 gcccgcagcg gaactccgcg cagcctggcc tcctcgaatg cgtgccgcag caccacacca 3458461 ttgtccacct ccgcgacaac cgcgctgacc tgggaggttg tcgctggctc ggccggcgac 3458521 gggtgaatca ccgccacggg gcataaggcc gacccagcca gggtcgccgc gaccgaaccc 3458581 cggcgaccgc ggacatgatc aagccccacc gaaccgacgc acagcatcgc cgcggacctg 3458641 gactcctgca tcagcttggt gagcggcctg ccgcacagaa cctccgtttc gatcttgacc 3458701 ggttgcccgg tggcctcgac cttccgagag gcgtcgtgca gcgccgctcg ggccgctgat 3458761 tgcccaccgc cctcgccggc ggcggacagt tgggacggat cgatgacgta caccagtcgc 3458821 agcggaatgt ctcggttcac cgcctcatcg accgcccaca acgccgcatg cgttgccgcc 3458881 cttgacccgt cgataccaac gaccactgcc cgagctggcc gaggatcgct catcgccgtc 3458941 tccttcgctg gggcggatac atcccgtcgg ttcagcggta cgttactggc ggggaccgct 3459001 atctcccagg ggcgttggtc cccacctgag ggccgttagt ccttatcgac cgatgacaga 3459061 cgcaacccgt cagggcgaga atgaatctca cctatcgcac gggtggctcg tccaggtcca 3459121 caaccatcgc ccagcttttc acagcaaagt cccagaaatg gcttacagtt gccgacagct 3459181 gccgaaccag cggccgtcca tcggctgcat atcgcttgac ccacagaata tttgggcata 3459241 gccgcgctgt gagagcgcat ctcgatgcgg ccggcacggc gtcgatcaat ctccgatccg 3459301 ccgtcagtcg actgccatac aacctgcccg cccagttgta cactggccgc ggcacgagtt 3459361 gccgcactgg tcaacaagta tcagccggcc tgcgcccgag cggagcccac tcggagccgc 3459421 tcgtgaccat ggggggagcc actgccgtct cccgcatgcc cacaccgagg tccgaattgg 3459481 gctgggtgcg caatcgacgt taggggcctg cggagtaatg gactacgcgt tcttaccacc 3459541 ggagatcaac tccgcgcgta tgtacagcgg tcccggaccg aattcaatgt tggttgccgc 3459601 ggccagctgg gatgcgctgg ccgcggagtt agcatccgca gcagagaact acggctcggt 3459661 gattgcgcgt ctgaccggta tgcactggtg gggcccggcg tccacgtcga tgctggccat 3459721 gtcggctcca tacgtggaat ggctggagcg gaccgccgcg cagaccaagc agaccgctac 3459781 ccaagccaga gcggcggcgg cggcattcga gcaggctcat gcgatgacgg tgcccccagc 3459841 gttggtcacc gccaaccgag ccgagctgaa agcactgatc gcgtcgaacc tcctgggcca 3459901 gaacaccgca gcgatcgcag ccatcgaggc acagtacgcc gagatgtggg cacaagacgc 3459961 ggccgcgatg tacggctacg ccaccacctc agcggcggcg agacagttga cgccgttctc 3460021 ctcgccacaa cagaccacca acccggccgg gctagccgcc cagaacgccg cggtcaccca 3460081 ggccgccacc aactccgccg ggaacacgcc gaccgcattg tcgcaactgt cctctttcct 3460141 gtcgcaggca gtagaggcgc cgacgggatg gcccaacatt ctcccggatg acttcaccat 3460201 ccttgacggc atattggctg cgtacgcaac ggtcggcgtg acgcaggaca tcgagtcgat 3460261 atgtgcgggc atcatcgggg ccgagaacaa cttgggcctt ttaggcgccg ccagtgagaa 3460321 tccggcggag ttggcgccgg gcgcgttcgg gatcgatgcg gcactcagtt cggcagaaaa 3460381 aggcgctgct gcgagcatgc acgatgcggt actcgcgagt gccggccggg cgggttcgat 3460441 cgggccgatg tccgtaccac cgtcctgggc cacgccctcg agcaccccgg tctcggcgtt 3460501 gtcgggcgcc ggcttaacca cgctcgacgg gaccgacgta gccgagcacg gtacgcccgg 3460561 cttgccaggg gtgccagcag ggacagacaa gcgagcctcg ggtgtcatcc cccgatacgg 3460621 ggtccggctc acggtgatgt cgcgcccgcc ggcggctggg tgacgcggct gccgctgtca 3460681 ctgtcatcct gtaggcgagc caaccccggt caccactagc gtgccgcaaa gatctgcgcg 3460741 cggcccgcgc actgagcccc aggaggggtc agtgaattca gttgtgtgac tgcgtctttc 3460801 gtcgacaccg cgtcaggtgc cggcgtttct ttctcgcggc tacgattcac cgcgccccag 3460861 gaattcactc gatagccgcc tggaattcac gcagatctgc ccacgccggc cggagacctt 3460921 tgcgagtttt atgacacgct gtcaccgaag gtagcccgag atgccctttg cggatgcctg 3460981 gataccaagt ccgatagcgt ctgcaaagcg ttcctgccat cacacggcac gccacggtag 3461041 cgcgaacccc cgccatacgc cccagctgcg ccccggccac gacaccggac cccgacacgc 3461101 gtcgggtcgt actactcgcg ttcaactgcc acgagcggcc atatgacttg gattacaaca 3461161 gttttgactc ggcagccgga taggtcaggc atccggggtg ccatcgttgt cgaaacggcc 3461221 agtgccagca acaccgctgg cactccacct tgacccattc agttctcgac cagcacgaca 3461281 ccgtatccgc acaaatgtaa ggagctgaga cacaatggat ttcgcactgt taccaccgga 3461341 agtcaactcc gcccggatgt acaccggccc tggggcagga tcgctgttgg ctgccgcggg 3461401 cggctgggat tcgctggccg ccgagttggc caccacagcc gaggcatatg gatcggtgct 3461461 gtccggactg gccgccttgc attggcgtgg accggcagcg gaatcgatgg cggtgacggc 3461521 cgctccctat atcggttggc tgtacacgac cgccgaaaag acacagcaaa cagcgatcca 3461581 agccagggcg gcagcgctgg ccttcgagca agcatacgca atgaccctgc cgccaccggt 3461641 ggtagcggcc aaccggatac agctgctagc actgatcgcg acgaacttct tcggccagaa 3461701 cactgcggcg atcgcggcca ccgaggcaca gtacgccgag atgtgggccc aggacgccgc 3461761 cgcgatgtac ggttacgcca ccgcctcagc ggctgcggcc ctgctgacac cgttctcccc 3461821 gccgcggcag accaccaacc cggccggcct gaccgctcag gccgccgcgg tcagccaggc 3461881 caccgaccca ctgtcgctgc tgattgagac ggtgacccaa gcgctgcaag cgctgacgat 3461941 tccgagcttc atccctgagg acttcacctt ccttgacgcc atattcgctg gatatgccac 3462001 ggtaggtgtg acgcaggatg tcgagtcctt tgttgccggg accatcgggg ccgagagcaa 3462061 cctaggcctt ttgaacgtcg gcgacgagaa tcccgcggag gtgacaccgg gcgactttgg 3462121 gatcggcgag ttggtttccg cgaccagtcc cggcggtggg gtgtctgcgt cgggtgccgg 3462181 cggtgcggcg agcgtcggca acacggtgct cgcgagtgtc ggccgggcaa actcgattgg 3462241 gcaactatcg gtcccaccga gctgggccgc gccctcgacg cgccctgtct cggcattgtc 3462301 gcccgccggc ctgaccacac tcccggggac cgacgtggcc gagcacggga tgccaggtgt 3462361 accgggggtg ccagtggcag cagggcgagc ctccggcgtc ctacctcgat acggggttcg 3462421 gctcacggtg atggcccacc cacccgcggc agggtaaccc ggcgcctaac cgacaggcgg 3462481 cccgttgggc gtaaacgtcc aattgtcagg attcttcggc gagtacacca ccggaagtat 3462541 ttgaccgacg gtcggccact ggtcgacgtc gacggccatg cgctgataca cggcgtactc 3462601 attgaccgtg ggcccagtga tgatcccggc gatggtgaca tactgctggc cgcctgcgtc 3462661 cggtcgcggg ctgactccgg tcaccaggag cgtgccgctg gccagatctc cccgcgggcc 3462721 gcgcgggata agccgcggag caagaaatac cgctaggacc gcgatcagta tgagtagcac 3462781 gccaaactcc catcccaccc ggccatggta ggactgctgg catgagccgt tattacgccg 3462841 agcgtgaact cagtgcaaga acgcacgcga aaaatcgcac tgggtacacg ctcggcgaaa 3462901 ggatggtgca ccagtgagcc acgacgatct aatgcttgcg ctggctctgg ccgaccgtgc 3462961 ggacgaattg acgcgggtcc ggttcggggc gctcgatctg cgcatcgaca ccaaaccgga 3463021 tttgacgccg gtgaccgacg ccgatcgggc ggtcgaatcc gacgtgcgcc agacgctggg 3463081 ccgcgaccgg cccggcgacg gcgtcttggg cgaggagttc ggcggatcaa cgaccttcac 3463141 cggacggcag tggatcgtag acccgatcga cggcaccaaa aactttgtgc gcggggtgcc 3463201 ggtgtgggcc agtttgatcg cgctgcttga agatggcgtc ccgtcggtcg gtgtggtgag 3463261 tgcgccggcg ctgcaacggc ggtggtgggc ggcacgcggc cggggcgcgt tcgcatccgt 3463321 cgatggtgcg cgtccacacc ggctgtcggt ttcctctgtg gcagagctgc attcggcgag 3463381 cttgtcgttt tccagtctgt ccgggtgggc gcggctgggt ctacgtgaac gcttcatcgg 3463441 gttgaccgat accgtgtggc gcgtgcgtgc ttacggcgac tttctgtctt actgcctggt 3463501 ggccgagggc gccgtcgata ttgccgccga accgcaagtg tcggtatggg atctggcggc 3463561 actggacatc gtggtgcgtg aggcgggcgg gcggctcacc agcctggacg gcgtcgccgg 3463621 cccacacggg ggcagcgccg ttgcaaccaa cggtctgttg cacgacgagg tgctgacacg 3463681 gctcaacgcc gggtaacctg gcgctcgaga gcgccatgag cgacccgttc accatcgcaa 3463741 ccaaacactg gcaccgactg cacgacagcc ggatccagtg cgatgtatgt ccacgcgcat 3463801 gcaaacttca cgagggacag cgtggcctgt gtttcgtccg cggccgattt gacgatcaag 3463861 tgaagctcac cagctacgga cgctctagcg gattctgtgt cgatccgatc gagaaaaagc 3463921 cgctcaacca cttcttgcca ggttcggcga cgctgtcttt cggcaccgcc gggtgcaacc 3463981 tggcgtgcaa gttctgccag aactgggata tctccaagtc ccgcgagatc gacgtcctgg 3464041 ccaatcgggc ggccccggcc gacatcgccc ggaccgcaca cgaattgggt tgccgcagcg 3464101 tggcattcac ctacaacgac ccaacgatct tctgggagta tgccgccgat gtagccgacg 3464161 cctgccacga ccagggaatc aaagccgtcg cggtgacggc cgggtacatg tgtcctgagc 3464221 cccgcgcgga attctaccgg cgtgtcgacg ccgccaacgt cgacctaaag gcattcaccg 3464281 aagactttta tcgcaaggtt tgcgtcagtc acctgcgcaa cgtcctggac accctggcct 3464341 acctgcggca ccagacgaat gtgtggttgg agatcaccac cctgctgatt cccggacgta 3464401 acgacagcga cgcggaagtc gctgccgaat gcagatggat ccgcgaaaac ctgggcgtcg 3464461 acgtgccggt gcatttcacc gcgtcccatc ccgactacaa gatgatggac accccggcta 3464521 cactacccgc cacattgacc cgagcccgcg agatcggcat tggcgaaggc ctgcgcttcg 3464581 tctacaccgg aaacgttcac gatgccgtgg gtggcagcac ctcgtgccca ggctgccggg 3464641 caacggtgat cgttcgcgac tggtattcga tacgacatta cgccctcacc gaggacggcc 3464701 gctgccaagc atgcggctat cagatgcctg gcgtgtacga cggaccggcc ggacactggg 3464761 gccagcgccg gctgcccttg ctgaccagct tgtcccggat gtgaacaact taacaagcac 3464821 ccctatctta ctccggagta agatagggtg gtccgctatc accccgatga ccgaggctgc 3464881 cgtatgacca acaccacctc tgctgcaaat gctgcaaaac cctccggcgc acgcaccgat 3464941 agacgcggcc gcacgaccgg tgtcggcctg gcgccccaca aacggaccgg catcgacgtc 3465001 gcactggcgc tgctaacccc gattgtcggc caggagttcc tggacaaata ccgcctgcgc 3465061 gatccgctga accgatcact gcgctacggc gtgaagacga tgtttgccac tgccggcgcc 3465121 gccacccgtc agttccagcg ggtgcaaggc ctgcggggcg gaccgacccg gctgaagtcc 3465181 agcggccgag actacttcga tctgacgccc gatgacgacc agaagctgat catcgagacc 3465241 gtcgacgaat tcgccgaaga ggtactgcga cccgccgcgc acgacgccga cgacgccgcg 3465301 acctacccgt ccgacttgac cgccaaggcc gccgagctgg gcattaccgc gatcaacatc 3465361 cccgaggact tcgacggtat cgccgaacac cgctccagcg tcaccaacgt gctggtggct 3465421 gaggcactgg cgtatggcga catgggcctg gcactgccga tcctggcgcc tggcggggtg 3465481 gcgtccgcgc tcacccattg gggcagcgcc gatcagcagg ccacctatct caaagagttc 3465541 gccggcgaga acgttccgca ggcctgcgtg gccatcaccg aaccgcagcc actattcgat 3465601 cccacccggc tgaagaccac cgcggtgcgc accccgtccg gttaccggct cgacggcgtg 3465661 aagtcgttga tcccggccgc cgccgacgcc gagctgttta ttgtcggcgc gcagctgggc 3465721 ggcaagcccg cactgttcat tgtcgagtcc gcggccagcg gcctgaccgt caaggcggat 3465781 ccgagcatgg ggattcgcgg cgcggcgttg ggccaggtcg aactctgcgg ggtgtcggtc 3465841 ccgcttaacg cccggctggg cgaggacgaa gccagcgaca acgactattc cgaggcgctt 3465901 gcgctggccc ggttgggttg ggcggcgctg gcggtcggta cctctcacgc cgtgctcgac 3465961 tacgtcgtcc cgtatgtgaa acaacgccag gctttcggcg agccgatcgc tcatcgccaa 3466021 gcggtggcgt tcatgtgcgc caacatcgcg atcgagctcg acggcctgcg cctgatcacc 3466081 tggcgcgggg cgtcccgtgc cgagcagggt ctgccgttcg caagggaagc ggcgctagcc 3466141 aagcggcttg gctccgacaa gggcatgcag atcggcctgg acggggtgca actgctgggc 3466201 ggccacggct acaccaagga gcatccggtt gagcgctggt accgcgacct gcgagccatc 3466261 ggcgtcgccg agggcgttgt tgtcatctag aacgagctga aagatcaatc atggcaataa 3466321 atctggaact gccgcgcaag ctgcaggcga tcatcgtcaa gacccatcag ggcgctgcgg 3466381 agatgatgcg gccgatagcc cgcaagtacg acctgaagga acatgcctac ccggtcgaac 3466441 tcgacaccct gatcaatttg ttcgagggcg ccgccgaatc gttcaacttt gccggagccc 3466501 attcgcttcg cgacgaggac gaaggcaagg acgaaaacca caacggtgcc aacatggccg 3466561 ccgtggtaca gacgatggag gccagctggg gcgacgtcgc gatgatgctg tcgctgccct 3466621 atcaggggct gggtaacgca gccatctccg cggtagccac cgacgagcag ctggagcggc 3466681 tgggcaaagt gtgggcagcg atggccatca ccgaaccgga attcggatcg gactcggcgg 3466741 cagtgtcgac gaccgccacc ctcgacggcg acgagtacgt gatcaacggc gagaagatct 3466801 ttgtcaccgc cggttcccgc gccacccaca tcgtggtctg ggccacgctg gacaaatcct 3466861 tgggccgccc ggcgattaag tcgttcatcg tgccccgtga gcatcccggc gtgaccgtcg 3466921 aacgacttga acacaaactc ggcatcaagg gttctgatac tgcggtgatc cggttcgaca 3466981 acgcccgtat ccccaagggc aacctacttg ggaacccgga aatcgaggtc ggcaagggct 3467041 ttgccggggt gatggagacc ttcgacaaca cccggccgat tgtggccgcc atggccgtcg 3467101 ggatcggccg tgccgcactg gaggaaatcc gtagtgtcct caccggggcc ggcgtggaga 3467161 tctcctacga caagccctca cacacccaga gcgccgcggc cgccgagttc ctgcggatgg 3467221 aggccgactg ggaggccagc tacctactgt ccctgcgcgc agcctggcag gccgacaaca 3467281 acatccccaa ctccaaagaa gcctcgatga gcaaggccaa ggcgggccgg atggccagcg 3467341 acgtcacctg caaaaccgtc gaattggcag gaactaccgg gtattccgag caatcactgc 3467401 tggagaagtg ggcccgcgac tccaagatcc tggacatctt cgagggcacc cagcagatcc 3467461 agcagctggt ggtcgcacgc cgactgttgg gcctgtcgtc gtccgagctc aaatagcctc 3467521 ggcgagcaga cgtcaaagcc cccgaatttc agtgaaatcg ggggcttttg cgtctgctgg 3467581 cgcccgtctg cacccccgcc agtaggctgg tcggcatgcg cgcggtacgg gtgactcggc 3467641 tggagggacc agatgcggtc gaggtggccg aggtcgagga acccacgagc gccggtgtgg 3467701 tcatcgaggt gcacgctgcc ggcgtggcct tcccggacgc actgctaacc cgtggccgtt 3467761 accagtaccg cccggagccg ccattcgtgc tcggcgccga gatcgccgga gtggttcgat 3467821 cggcgccgga taacagccaa gtgcgttccg gagacagggt tgtcggcctc acgatgctca 3467881 ccggcggcat ggccgaagtc gcggtattgt cgcccgagcg cgtgttcaag ctgccggaca 3467941 acatgacttt cgaggcgggc gcgggcgtgc tgttcaacga cctgacggtg tacttcgcgc 3468001 tggcggtccg gggccggctg caggccggtg agacggtgct ggtgcacggg gcggcaggcg 3468061 ggatcggcac atcgacgttg cgactagcgc cggcgctcgg ggcgtctcgc accgtcgcgg 3468121 tggtcagcac gcaggagaag gccgagcttg cgacagtggc cggggcgaca gatgtggtgt 3468181 tggccgaggg gttcaaggac gcggtacagg agctgacgaa cggccgtggt gtcgacatcg 3468241 tcgtagaccc ggtcggcggc gaccggttca ccgattcgct gcgctcgctt gctgcgggag 3468301 gacggctgtt ggtcatcggc ttcactggcg gcgagattcc caccgtgaag gtaaaccgcc 3468361 ttctgctcaa caacattgac gttgtcgggg taggctgggg cgcctggtcg ctgacccacc 3468421 ccgatgcgct ggcccagcag tggtcacaac tcgagcggct gctacgctcg ggcaagctgc 3468481 ctcctcccga accagtggtc tacccactgg accaagccgc tgcggcgatt gcatcgctgg 3468541 agaatcgcac cgccaagggg aaggtcgtac tacgcgtgcg cgactaacgc ccctcccggg 3468601 acgcgtcgcc gccgtgctct ggccaatttg ccgcttcctc actggtcgcc gttggcgtcg 3468661 gctacgtcct gccgcacaac tcgcagcttg cctggcgcca ggcacgcggc gtatccgtgg 3468721 tatttgccat acagttccca tgcggtgacg cgatcatcgg ggtgcacgtc gatctgatga 3468781 ccgtcggaga actcaagatg gagatctccg gtgtcatacc agacgaaagc tgtgcaggtt 3468841 gccccggcga aatcgaagag cggacgctcg tggtcggctg ggtcgtttgg gtcgatggcg 3468901 accacttctg cgggcgaggt ttcgatggcc ggcagagtca gctgtagtgg taccgagatg 3468961 accagctcgt tgtaatcgtc gaagttcagc accagaccgt cgcggaacat aatccgctga 3469021 accgcacagc cctctaacca ctgctcggtc atttcctgtt cggtcatata ttcactctgg 3469081 ccttgttgtg cccatatgtc acgtacacaa ccgccgaaat ctcgtgcggg attacaccct 3469141 aggcgtccga tggacaccag taccatctga caccgtgccc gactccagca ccgcattgcg 3469201 gatcctcgtc tacagcgaca acgtccagac ccgcgaacgg gtgatgcggg ccctgggcaa 3469261 acggttgcac ccggatctgc ccgatttgac ctacgtcgaa gtggctaccg gtccgatggt 3469321 gatacgccag atggatcggg ggggcatcga cttggccatc ctcgacggtg aggcgacacc 3469381 gaccggaggc atgggaatcg ccaaacagct caaagacgaa cttgccagtt gcccgcccat 3469441 cctggtgctc accggccgtc cggacgacac ctggctggcc agctggtcgc gggccgaggc 3469501 cgcagtgccg catcccgtcg accccatcgt gctgggccgc acggtgctct cactgttgcg 3469561 cgcacccgcc cactaaccgg acgcggccgg cattcgcggc gcgaacgttc agccgccccg 3469621 catttgaatc ttcgggtcct gtcttacccg aggtcgtaat tggcccgctg ccgcttccgg 3469681 ccgcaacgac ggcgctgtct cctccgccgc tgaagtctct gaagcctgct gaccttgcgc 3469741 ggtgcgtagt gtcgattccg gaattccaga acccgcggat tggcctaccc gcgttgtcga 3469801 cagcggagcg gccttggccg caactttcgg atccacagtt ggcagcaccc ccattgctgg 3469861 aacttcaagt tctggaactt ccacaacggc ttccggtggc gcggaagccg ccggctctgg 3469921 cgctcgagct gactcggtgg cagttcccgg ggcagagtta gtgccgccac gtgccatctg 3469981 acccagagcg gcgagcgcga gcggggcacc gatgaagccg gggctcacaa cgccggcatg 3470041 ggcactggta gcgctcgacg cgacgacgtc tccggcgcca acatcgccac cgccgaaatt 3470101 cccttggcct acactgccat gggccggatc accggccgct aacccggcgc tagccacgcc 3470161 gcctccgacg tagccgacgc ccccgccgct agcggttgca cctccggtgc cgacgctctc 3470221 accgccgccg gtgccgacgc tctcgccgcc agtagcgccg gtgccgacgc tctcaccgcc 3470281 agtagcgccg gtggcgccgg cacccaaacc cggaaatcgc tgcagcaaac tcgcccacgg 3470341 caccacttgc gcggcaatcg ccgatgcccc ggagtagtac cccgacatgg cggccacatc 3470401 cgcggcccac atctcctcgt acacaccctc ggcggcagca atcaacggcg cgttctgccc 3470461 gaacaaattc gtcatcacca gctgcacgaa tgcgtcgcgg ttggcggcca ccgccgccgg 3470521 aagcaccgtc gccgcctgcg ccgcctcgaa gatgctggcc actgcgcgcg cctgtcccgc 3470581 cgccccggcc gactgagccg ctgccgcggt caaccacccc gcgtagggag ccgccgctgc 3470641 cgccatcgcc aaggctgccg gaccctgcca cgcctgaccc gccagcccgg ccgtgaccga 3470701 cgcaaatgat tgcgccgcgg tccccaactc ttcggcaagc ccgtcccagg ctgccgccgc 3470761 cgccagcatc ggcgcagtgc ctgcaccgat gaacattcgc aaggaattga tctccggcgg 3470821 cagcacgacg aaactcacag ctcccgtcct tccgcttcgc tgctcgatgc cacgccgacc 3470881 tcaatacggc caacgattaa ccggcaaatg ccgagattaa caacaaatgc tgcgcttatc 3470941 agggggttag accaacattc atacaattcg ccgggacgcg caatccccag ttttgcttcg 3471001 cagcgaccga cgccggaccc agccacgggt tctgcttcga ctcgcacagg tatgcaccag 3471061 cctgaccccg ggaatgtggg gtggccgttg cgcgactatg ttgaagggca ctgtgacggc 3471121 ccgaagcccc ggttcgtcac ggcagcccgg tcaccgcccg gccgccgcgc tggcggcccc 3471181 gtacgacgga tcatggagcg agttgaacgt ctacataccc atcctggtac tggcggcgct 3471241 ggccgccgcc ttcgccgtgg tgtcggtggt gatcgcgagc ctggtcggcc cgtcgcggtt 3471301 caaccggtca aagcaggccg cctacgaatg cgggatcgag cccgctagca ctggagccag 3471361 aacctccatt ggccccggcg cggcgagcgg gcagcggttc cccatcaagt actacctgac 3471421 cgcgatgttg ttcatcgtct tcgacatcga aattgtgttc ctctacccgt gggcggtcag 3471481 ctacgactcg ctgggcacgt tcgcgctggt cgagatggcg atattcatgc tcacggtgtt 3471541 cgtggcctac gcgtatgtgt ggcgccgcgg gggcctgacg tgggattgag gtagggcgtg 3471601 ggcctggaag aacagctgcc cggcgggatc ctgctgtcga ccgtcgagaa ggtggcgggc 3471661 tatgtccgca aaaactccct gtggccggca acattcggat tggcgtgctg tgcgatcgag 3471721 atgatggcga ccgcgggacc aaggtttgac attgcgcggt tcgggatgga acggttctcg 3471781 gccacgccgc ggcaggcaga tctgatgatc gtggcgggcc gggtcagcca gaagatggcg 3471841 ccggtactgc gccagatcta tgaccagatg gcggagccga aatgggttct ggccatgggt 3471901 gtgtgcgcct cgtcaggtgg gatgttcaac aactatgcga tcgtgcaggg cgtggatcat 3471961 gttgttccgg tcgacatcta cctacccggc tgcccgccgc gcccggagat gctgctgcac 3472021 gcaatcctga agctgcacga aaagattcag cagatgccat taggtatcaa ccgggaacgc 3472081 gctatcgccg aggccgaaga ggcggcgttg ttggcccggc ccaccatcga gatgcgcgga 3472141 ctgctgcgat gagcccgccg aaccaagacg cccaggaagg ccgcccggac tcccccaccg 3472201 cggaggtggt cgacgttcgc cgcggcatgt tcggcgtctc gggcaccggt gacacctccg 3472261 gttacggacg gttggtgcgc caagtcgtcc tccctggcag cagcccccgg ccctacggcg 3472321 gctacttcga cgatatcgtc gaccggctgg ccgaggcact gcggcacgag cgcgtcgaat 3472381 tcgaggacgc cgtcgagaaa gtcgtggtct accgcgatga actgaccctg cacgtccgcc 3472441 gggatctact gccgcgggtc gcccagcggc tgcgcgacga acccgaattg cgattcgagc 3472501 tgtgtcttgg ggtgagcggg gtgcactacc cgcacgagac gggtcgggag ctgcatgccg 3472561 tctacccgct gcagtcgatc acccacaacc gtcgcctccg gttggaagtg tctgcgccgg 3472621 acagtgatcc gcacatccct tccctgttcg cgatctatcc gaccaacgac tggcacgagc 3472681 gggaaaccta cgacttcttc gggatcatct tcgacggcca tccggccctg acccggatcg 3472741 agatgcccga tgactggcag gggcatccgc aacgcaagga ctaccctctc ggcggcatcc 3472801 cggtcgaata caagggcgcg cagatacccc cgcccgacga gcggaggggc tacaactgat 3472861 gacggcaatc gccgactcgg ctggcggcgc cggcgagacc gtcctggtcg ctggcgggca 3472921 ggactggcag caggtcgtgg acgccgcgcg cagcgcggat cccggtgaac gcatcgtcgt 3472981 caacatgggg ccccagcacc cgtctaccca cggggtgttg cggttaatcc tggagatcga 3473041 gggcgaaaca gtcgtcgaag cccggtgcgg aatcggctac ctgcacaccg gaatcgagaa 3473101 gaacctcgaa taccggtact ggacccaggg cgtcaccttc gtgacccgaa tggattacct 3473161 gtcaccgttt ttcaacgaaa ccgcctactg cctcggcgtg gagaagctgc tcggcatcac 3473221 cgatgagata cccgagcggg tcaatgtcat ccgcgtgctg atgatggagc tcaaccggat 3473281 ctcgtcgcat ttggtcgcat tggcgaccgg gggcatggaa ttgggcgcca tgactccgat 3473341 gttcgtcggc ttccgggcac gcgagatcgt gctcacgctg ttcgaaaaga tcaccggttt 3473401 gcggatgaac agcgcctaca tccgacccgg cggcgtggcg caggacttac cgcccaacgc 3473461 ggccaccgaa atcgcggaag cactcaagca gttgcgccaa ccactgcgcg aaatgggcga 3473521 gctgctcaac gaaaacgcca tctggaaggc ccgcacccag ggcgtcggat acctggatct 3473581 gaccggatgc atggcactgg gcatcaccgg cccgatactg cgttccactg ggttgcccca 3473641 cgacctgcgg aaaagcgagc cctactgcgg ataccagcac tatgaattcg atgtgatcac 3473701 cgacgacagc tgtgatgcct acgggcgcta catgattcgc gtcaaagaga tgtgggagtc 3473761 gatgaagatc gtggagcagt gtctggacaa gttacgaccc ggcccgacca tgatctccga 3473821 tcgcaagctc gcctggccgg ccgacctgca ggtggggccc gacggcctgg gcaactcacc 3473881 caagcacatc gccaaaatca tgggctcctc gatggaagcg ctgatccacc acttcaaact 3473941 ggtcaccgag ggcatccggg tgccggcggg ccaggtctac gtcgcggtgg agtccccccg 3474001 tggtgagctc ggcgtacaca tggtcagcga cggtggcacc cgcccctacc gggtgcacta 3474061 ccgggatccc tccttcacca acctgcagtc cgtcgccgcg atgtgcgaag gcgggatggt 3474121 cgccgatttg atcgcggcgg tcgccagcat tgacccggtc atgggcgggg tggaccggtg 3474181 acacagccac ccggtcagcc ggtgttcatc cggctcggac cgccaccgga cgaacccaac 3474241 cagtttgtcg tcgagggcgc tccgcggtcg tatccgccgg acgtactggc gcggctggag 3474301 gtcgacgcca aggagatcat cggccgctat cccgacaggc gctcggcgct gttgccgttg 3474361 ctgcacctgg tgcagggcga ggattcctac ctgacgccgg cgggtttgcg gttctgcgcc 3474421 gatcaactcg ggctgaccgg ggccgaggtg tcggcggtgg ccagcttcta caccatgtac 3474481 cgccggcgcc ccaccggcga gtacctggtg ggtgtgtgca cgaacacgct gtgcgccgtc 3474541 atgggtggcg acgccatctt cgaccgcctc aaagagcatc tcggcgtcgg ccacgacgaa 3474601 accacctccg acggtgtggt caccttgcaa cacatcgaat gcaacgccgc ctgcgattac 3474661 gcaccggtgg tgatggtcaa ctgggaattc ttcgacaacc agacgccgga gtccgcgcgc 3474721 gaactcgtcg actcgctgcg ctccgacaca ccgaaggcgc ccacccgcgg cgcgccgctg 3474781 tgcggcttcc ggcaaacatc gcgcatcctg gcgggtctac ccgaccagcg tcccgacgaa 3474841 ggccagggcg gtcccggcgc gcccaccctg gccgggctgc aggtggcaag gaagaacgac 3474901 atgcaggcgc caccaacccc cggagcggac gaatgaccac gcaggccacc ccgttgaccc 3474961 cggtgatcag ccgccactgg gacgacccgg agtcgtggac cctggccact tatcaacgcc 3475021 acgatcgcta tcggggctat caggcgttgc agaaagccct gacgatgccg cccgacgacg 3475081 tgatcagcat cgtcaaggat tccgggttac gcggacgcgg cggcgcgggc tttgccaccg 3475141 ggaccaagtg gtcgttcatc ccgcagggcg acaccggcgc cgcggccaag ccgcactacc 3475201 tggtggtcaa cgccgacgag tccgaacccg gtacgtgcaa agacattccg ttgatgctgg 3475261 cgacgccaca tgtgctcatc gaaggcgtca tcatcgccgc ctacgcgatc cgcgcccatc 3475321 acgcgttcgt ctacgtacgc ggtgaggtgg tgccggtatt gcgccggctg cacaacgcgg 3475381 tggccgaggc ctatgccgcc ggcttcctag gccgcaacat cggaggttcc ggattcgatc 3475441 tggagctggt ggtacacgcc ggcgcgggcg cctacatctg cggcgaggag accgccctgc 3475501 tcgactcgct ggaaggccgg cgcggccagc cgcggctgcg gccccccttc cccgcggtgg 3475561 ccggtctgta tggctgcccg accgtgatca acaacgtcga aacgatcgcc agtgtcccat 3475621 cgatcatcct gggcggcatc gactggttcc ggtcgatggg cagcgagaaa tcgcctggct 3475681 tcaccctgta ttcgctgtcc ggccacgtca cccgccccgg ccagtacgag gcgccgctgg 3475741 gcattacgct gcgcgagttg ctcgactacg caggcggggt gcgcgccggg caccggctga 3475801 agttctggac accgggcggc tcgtcgaccc cgctgctcac cgacgagcat ctggatgtgc 3475861 cgctggacta cgagggtgtg ggtgcggccg gctcgatgct ggggaccaag gcgctggaga 3475921 tcttcgacga gaccacctgc gtggtgcgcg cggtgcgccg ctggaccgag ttctacaagc 3475981 acgaatcgtg tgggaaatgc acgccgtgcc gggagggcac cttctggctg gataagatct 3476041 acgagcggct ggaaaccggc cggggtagcc atgaagacat tgacaaactg ttggacattt 3476101 ccgattccat cttgggaaag tcgttctgcg cgttgggcga cggtgccgcg agtccggtga 3476161 tgtcgtcgat caagcacttc cgcgacgagt acctggccca cgtcgaagga ggcggttgcc 3476221 cattcgaccc ccgagactcc atgctcgtcg cgaacggagt ggacgcgtga cccaggcggc 3476281 cgacactgac atccgggtag gccaaccgga gatggtgaca ctgaccatcg acggcgtcga 3476341 aatcagcgtc cccaagggca cgttggtgat tcgcgccgcc gaactgatgg gaatccagat 3476401 cccgcgattc tgcgaccacc cgctgctgga gcccgtcggc gcctgccggc aatgcctggt 3476461 cgaggtcgaa gggcaacgca agccgctggc gtcgtgcacc accgtggcca ccgacgacat 3476521 ggtggtgcgc acccaactca cctccgagat tgccgacaag gcccagcacg gtgtgatgga 3476581 actgctgctg atcaaccatc cgctggattg cccgatgtgc gacaagggcg gtgaatgccc 3476641 gctgcaaaac caggcaatgt ctaacggccg cacggattct cgcttcaccg aggccaaacg 3476701 taccttcgcc aaaccgatca acatctccgc gcaggtgctg ctggaccgcg aacgttgcat 3476761 cctgtgcgcc cgctgcaccc ggttctccga ccagatcgcc ggcgatccgt tcatcgatat 3476821 gcaggagcgc ggcgccctgc agcaggtcgg tatctacgcc gatgaaccgt tcgagtcgta 3476881 cttctccggc aacacggtgc agatctgccc ggtgggggcg ctaacgggga ccgcctaccg 3476941 gttccgcgcg cgtccgttcg atttggtctc cagccccagc gtctgcgagc actgcgcgtc 3477001 gggctgcgcg caacgcaccg accatcgccg cggcaaggtg ctgcggcggc tggccggtga 3477061 cgacccggaa gtcaacgagg agtggaattg cgacaagggc cggtgggcct tcacgtacgc 3477121 gacccagccg gacgtgatca ccactcccct gatccgcgac ggtggggacc ccaagggcgc 3477181 gctggtgccc acctcgtggt cgcacgcaat ggcggtggcc gcccagggac tggcggcagc 3477241 gcggggccgc accggggtgc tggtcggcgg ccgagtgacc tgggaggacg cctacgcgta 3477301 cgccaagttc gcgcggatca cgttgggcac caacgacatc gacttccgcg cccggccgca 3477361 ctcggccgag gaggccgact tcctggcggc ccgcatcgcc gggcggcata tggcggtcag 3477421 ctatgccgat ttggaatcgg ctccggtggt gctgctggtg ggattcgagc ccgaagacga 3477481 gtcgccgatc gtgtttctgc ggttacgcaa ggccgctcgc agacaccgcg tcccggtgta 3477541 cacgatcgcc ccctttgcca ctggtggcct gcacaaaatg tcgggccggc tgatcaaaac 3477601 cgttcctggt ggcgaacccg cggcgctgga cgatctggcc accggtgcag tgggcgacct 3477661 gctggccacc ccgggcgcgg tcatcatggt cggggagcgc ttggccacgg taccgggcgg 3477721 attgtcggcg gccgctcggc tggccgatac gaccggcgcc cgtttggcgt gggtgccgcg 3477781 gcgggcgggg gaacgcggag cgctggaagc cggagcgttg cccacgctgt tacccggtgg 3477841 ccgcccgctg gccgacgagg tcgcccgcgc gcaggtgtgt gcggcgtggc atatcgccga 3477901 attgcctgcc gcggctggac gggacgccga cggcatcctg gccgccgctg ccgacgagac 3477961 gttggctgcg ctgctggtcg ggggtatcga acccgcggac ttcgccgacc cggacgccgt 3478021 gctggccgcg ttggacgcca ccggtttcgt ggtcagcctg gagctgcgac acagtgcggt 3478081 caccgaacgc gccgacgtgg tgttcccggt cgcgccgacg acccagaaag ccggcgcgtt 3478141 cgtcaactgg gagggtcgct accgtacatt cgaacccgcg ctgcgcggca gcacactgca 3478201 agctggccag tcggatcacc gggtgctgga cgcgttggcc gacgacatgg gtgtccatct 3478261 gggcgtgccc accgtggagg cggcccgcga ggagctggcc gcgctcggta tctgggacgg 3478321 caaacacgct gccggtcccc acatcgcggc caccgggccg acccaacccg aagctggtga 3478381 ggcgatcttg accgggtggc ggatgctcct cgacgagggc cgcctgcagg acggcgaacc 3478441 atatctggcc ggtaccgcgc gcacacccgt ggtacggctg tcgccggata cggcagccga 3478501 gatcggcgcc gccgatggcg aggcggtcac ggtcagcacg tcacgcggct caatcacctt 3478561 gccgtgcagt gtcaccgaca tgcccgaccg cgtcgtgtgg cttccgctga actcggcggg 3478621 ctcgacggtg caccgacagc tgagggtgac aatcggcagc atcgtgaaaa tcggagcggg 3478681 ctcatgagcg tctccccttg ccgcgagcgc gcgtgttccc ccgcaagcgg gaggtgcccc 3478741 cagtacgccg acacaccgat tttgatgtac cagtgcggac cctcgcgcaa ggagtggcgg 3478801 ccatgaccac gttcggccac gacacctggt ggctggtggc ggccaaagcg atcgcggtat 3478861 tcgtgttcct catgctgacg gtgctggtgg cgatcctggc cgaacgcaag ctgctgggcc 3478921 ggatgcagtt gcggcccggc cccaaccggg ttggcccaaa aggagccctg cagagcctgg 3478981 ctgacggcat caagctggcg ctcaaagaga gcatcacacc cggtggcatc gatcgattcg 3479041 tatattttgt ggcgccgatc atttcggtga ttccggcatt caccgctttc gcgttcatcc 3479101 cgtttggtcc cgaggtgtcg gtgtttggcc accggacacc gttgcagata accgaccttc 3479161 ccgtcgccgt gctgttcatc ctgggactgt cggcgatcgg ggtatacggc atcgtgctgg 3479221 gcggttgggc gtccgggtcc acctacccgc tgctgggcgg ggtgcgctcc accgcgcagg 3479281 tcatctccta cgaggtcgcg atgggcctgt cgttcgcgac ggtgttcctt atggccggca 3479341 ccatgtcgac gtcgcagatc gtggccgcac aagacggtgt ctggtatgcc ttcctgttgt 3479401 tgccgtcatt cgtcatctat ctcatttcta tggtgggtga aaccaaccgg gcgccgttcg 3479461 atttgcccga agccgagggc gagctggtcg cgggattcca caccgagtac tcgtcgttga 3479521 agttcgcgat gttcatgctc gccgagtacg tcaatatgac tacggtttcg gcactggccg 3479581 cgaccctatt cttcggtggc tggcatgctc cctggccgct gaacatgtgg gcgagcgcca 3479641 acaccggctg gtggccactg atctggttca ccgctaaagt gtggggcttt ctgttcatct 3479701 atttctggct gcgggctacg ctgccgcggc tgcgctacga ccagttcatg gcgctgggct 3479761 ggaagttatt gatccccgtc tcgctggtgt gggtgatggt cgccgcgatc atccgctcac 3479821 tacgcaacca gggctaccag tactggaccc cgactctggt gtttagcagc attgtcgttg 3479881 ccgctgccat ggtgctgttg ttgcgaaagc cgttgagcgc tcccggcgct cgcgcatcgg 3479941 cacggcaacg cggggacgaa ggcaccagcc ctgaaccggc atttccgaca ccaccgctgc 3480001 tagccggtgc aaccaaggag aatgcaggtg gctaacactg atcgtccggc tctcccccac 3480061 aagcgggcgg tacccccatc tcgggctgac tccggcccgc gtcgtcgccg gactaagtta 3480121 ctggacgccg tagccggatt cggggtaacg cttggttcga tgttcaaaaa gacggtcacc 3480181 gaggagtatc cggaaaggcc cggtccggta gcagcgcgct accacggccg tcatcagctc 3480241 aaccggtatc cggacggcct ggagaaatgc atcggctgcg agttgtgcgc ctgggcctgc 3480301 ccggccgacg caatctatgt cgagggcgcg gacaataccg aagaggagcg gttttcgccg 3480361 ggcgaacgct acggccgggt gtaccagatt aactatttgc gttgcatcgg ttgcggtttg 3480421 tgcatcgagg cgtgcccgac gcgggcgctg acgatgacct atgattacga actggccgac 3480481 gacaaccgcg ccgacctgat ctacgagaag gaccggctgc tggccccgct gctgcccgag 3480541 atggccgcgc cgccgcatcc gcgggcgccc ggtgccaccg ataaggacta ctacctaggc 3480601 aatgtgaccg ccgagggctt gcggggcgtg cgtgagagcc agaccaccgg agattcccga 3480661 tgaccgcggt gctggcttca gatgtcatcg tccgcacctc caccggggaa gcggtgatgt 3480721 tctgggtgct cagtgcgttg gcgctgctgg gcgcggtcgg ggttgtgctg gccgtcaacg 3480781 ccgtgtactc agcgatgttt ctggcgatga ccatgatcat cctggcggtg ttctacatgg 3480841 cccaggacgc gctgtttttg ggtgtcgtcc aggtggttgt ctacaccggc gcggtgatga 3480901 tgctgttcct gttcgtgctg atgctgatcg gtgtggactc cgcggaatca ctgaaggaga 3480961 cgctgcgcgg gcagcgggtc gccgcggtgc tgaccggtgt cgggttcggc gttctcctga 3481021 tcagcaccat cggccaggtg gcgacccgag gttttgccgg actaaccgtc gccaacgcca 3481081 acggcaacgt cgaaggcttg gccgcgctga ttttttcccg ttacctgtgg gcgttcgagt 3481141 tgaccagtgc gctgttgatt accgccgccg tcggggcgat ggtgctagcg caccgggagc 3481201 gtttcgagcg ccgcaagacc cagcgcgaac tctcccagga acgcttccgt cccggcgggc 3481261 accccacccc gctgcccaac ccgggtgtct acgcgcgcca caacgcggtc gacgttgccg 3481321 ccctgctccc cgacggttcc tattccgaat tgtcggtccc ccggatgctg cgcacccgcg 3481381 gggccgacgg cctgcaaaca ccctcgcccg gagccgtctc cggctcttta gaaggcggtg 3481441 catcatgaat ccggccaact acctttatct ttcggtgctg ctattcacca tcggagcctc 3481501 cggtgtgctg ctgcgacgca acgcgatcgt gatgttcatg tgcgtcgagc tcatgctcaa 3481561 tgccgttaac ctggcgttcg tcaccttcgc gcgcatgcat ggccatctcg acgcccagat 3481621 gatcgcgttc ttcaccatgg tggtggccgc ctgcgaagtg gtcgtcggcc tggccatcat 3481681 catgacgatt ttccgtaccc gcaaatcggc gtcggtcgac gacgcgaatc tactcaaagg 3481741 ctgacgacgc caccgtgaca acttccttgg ggactcacta cacctggctg ctggtggcac 3481801 tgccactggc gggtgccgca atcttgctgt tcggcggcag acgcaccgat gcgtggggcc 3481861 acctgctggg ctgtgccgca gcgctggcgg cattcggggt gggcgcgatg ctgctggccg 3481921 acatgctcgg tcgcgatggg ctcgagcgcg cgatccatca gcaggtgttc acctggatac 3481981 ccgccggcgg actccaagtc gacttcgggc tgcagatcga tcagttgtcc atgtgcttcg 3482041 tgctgctgat ctccggggtc ggatcgctga ttcacatcta ttcggtcggc tacatggccg 3482101 aggacccgga ccggcgcagg tttttcggct atctcaacct gtttctggcc tcgatgctgc 3482161 tgctggtggt cgccgacaac tatgtgttgc tgtacgtcgg ctgggagggt gtgggcctgg 3482221 cgtcgtatct gttgatcggt ttctggtacc acaagccgtc ggcggccacc gcggccaaaa 3482281 aggcattcgt gatgaaccgg gttggggacg ccggcctagc ggtgggtatg ttcttgacgt 3482341 ttagcacttt cggcaccctg tcgtatgccg gcgtgttcgc cggcgtaccc gccgcaagtc 3482401 gcgcagtgct gaccgcgatc gggttgttga tgctgttggg ggcgtgcgcc aagtccgcgc 3482461 aggttccgct gcaagcctgg cttggcgacg cgatggaggg ccccaccccg gtgtccgcgc 3482521 tgatccacgc cgccaccatg gtgaccgccg gagtgtattt gattgtgcgg tcgggcccgc 3482581 tgtacaacct ggcgcccacc gcccaactgg cggtcgtcat cgtcggcgcg gtgacgctgc 3482641 tgtatggggc gatcatcggc tgcgccaagg acgacatcaa acgtgcgctg gcagcctcga 3482701 ccattagcca gatcggctac atggtgctgg ccgcgggcct gggtccggcc ggctacgcgt 3482761 ttgcgatcat gcatctgctc actcacggtt tcttcaaggc cggcctattc cttgggtccg 3482821 gcgcggtgat tcacgcgatg cacgaagagc aggacatgcg ccgttacggt ggtctgcgcg 3482881 ccgccctgcc ggtcacgttc gcaaccttcg gcctggcgta tctggcgatt atcggggtac 3482941 cgccgttcgc gggcttcttc tccaaggatg cgatcatcga ggccgcattg ggcgccggcg 3483001 gcatccgggg ctcgctgctg ggcggtgccg cgctgctggg tgcgggcgtc accgcgttct 3483061 acatgacgcg agtgatgctg atgaccttct tcggcgaaaa gcgttggacg ccaggcgccc 3483121 atccgcacga ggcaccggcc gtgatgacct ggccgatgat cttgctcgcc gtcggctcgg 3483181 tgttctccgg tggcctgctc gcggtgggtg gcacgttgcg gcattggctg cagccagttg 3483241 tcggatctca tgaagaggcc acccatgcgc tgccgacctg ggtcgccacc accctggcgc 3483301 tcggtgtggt cgccgtcggt atcgcggtgg cctaccggat gtacggcacc gcgccgatcc 3483361 cgagggttgc cccggttcgg gtgtcggcgc tgaccgcggc cgcacgtgcg gacctgtacg 3483421 gcgatgcctt caacgaggag gtgttcatgc gccctggtgc gcaattgacc aacgcggtgg 3483481 tcgcggtgga cgacgcgggt gtggacggct cggttaacgc gctggcgacg ctcgtgagcc 3483541 agacttcgaa tcgcctgcgg caaatgcaaa ccggcttcgc ccgtaactac gcgttatcga 3483601 tgctggtagg agcggtgtta gtggcggcgg cgctgctggt ggtgcagctg tggtgaataa 3483661 cgtgccgtgg ctgagcgtgc tctggctggt gccgctggca ggtgcggtgc tgatcatcct 3483721 gctaccaccc ggtcggcgcc gactcgccaa gtgggccggt atggttgtca gcgtcctgac 3483781 gttggcggtg tcgatcgtcg tcgcggccga attcaagccc agcgccgagc cgtatcagtt 3483841 cgtcgaaaag cattcctgga taccggcgtt cggcgccggc tatacccttg gtgtggacgg 3483901 catcgcagtg gtgctggtgt tgttgaccac agtgctgatt ccgttgctgc tggtggccgg 3483961 ctggaacgac gcaaccgatg ctgacgacct gtcccccgca agcgggaggt acccccagcg 3484021 cccggctccg ccgcgcttgc gatcgtcagg tggcgaacgc acccgaggcg tgcacgccta 3484081 cgtggcattg acgctggcca tcgagtcgat ggtgctgatg tcggtgatcg cgctggacgt 3484141 gctgctgttc tacgtgttct tcgaggccat gctgatcccg atgtacttcc tcatcggcgg 3484201 cttcggccag ggggccggac gctcgcgtgc cgcggtgaag ttcttgctgt acaacctgtt 3484261 tggcgggttg atcatgctgg cggcggtgat cgggctgtat gtggtgaccg cacagtacga 3484321 ttcgggcacc ttcgacttcc gtgagatcgt ggccggcgtg gcggcgggcc gctacggagc 3484381 ggacccggcg gtgttcaagg cgctgttctt gggcttcatg ttcgcgttcg cgatcaaggc 3484441 tccgctgtgg ccgttccatc gctggctgcc ggacgccgcc gtcgagtcca ccccagcgac 3484501 cgcggtgctg atgatggcgg tgatggacaa ggtcggcacc ttcggcatgc tgcgctactg 3484561 cctgcagctg tttcctgacc cgtcaacgta tttccgtccg ctgatcgtga cgctggccat 3484621 catcggggtg atctacggcg cgatcgtggc gatcggccaa accgacatga tgcggctgat 3484681 cgcctacacc tcgatctcgc acttcgggtt catcatcgca ggcatcttcg tcatgaccac 3484741 ccagggccag agcgggtcga cgctgtacat gctcaaccac ggcctgtcca cggcggcggt 3484801 gttcctgatc gccggtttct tgatagcgcg gcgcgacagc cgatcgatcg ccgactacgg 3484861 cggtgtccag aaggtggcgc ccatcctggc cggcacgttc atggtctcgg ccatggccac 3484921 cgtatcgctg cccggcctag ccccgtttat cagcgaattc ctggttctgc tgggcacttt 3484981 cagccgctac tggctggcgg cggcgttcgg cgttaccgca ctggtcctct cggccgttta 3485041 catgctgtgg ctctaccagc gggtgatgac cggtccgata gccgaaggca acgaacgcat 3485101 aggggatctg gtgggccgcg agatgatcgt ggtggcaccg ttgatcgcgc tgttactcgt 3485161 gcttggggtc taccccaaac ctgtgctcga catcatcaat ccggcggtcg agaacaccat 3485221 gaccaccatc ggccagcatg atcccgcgcc cagcgtggca cacccggttc cggccgtggg 3485281 cgcctcccgg acagccgaag gaccgcaccc atgatcctgc ccgccccgca cgtcgagtac 3485341 ttcctgctcg ctccgatgct catcgtcttt tcggttgcgg tcgccggtgt gctggccgag 3485401 gctttcctgc cgcgccggtg gcgctatggc gcccaagtga cgctcgccct tggcgggtcg 3485461 gcagtggcac tcatcgcggt catcgtggtg gccaggtcga ttcacgggtc gggtcacgcc 3485521 gcggtgctgg gggccatagc cgtggatcga gcgaccctgt ttctgcaagg caccgtacta 3485581 ctggtcacga tcatggcagt cgtcttcatg gccgaacgca gcgcccgggt gagtccgcaa 3485641 cgccagaaca ccctcgctgt ggcgcggctc cctggactcg attcgtttac cccgcaggct 3485701 tccgccgtgc ccggcagcga tgctgagcgc caagcggaac gggcgggagc cacccagacg 3485761 gaacttttcc cgctggcgat gctgtccgtc ggcggcatga tggtgtttcc cgcgtccaac 3485821 gacctgttga cgatgttcgt tgcgctggag gtgctatcgc tgccgctgta cctgatgtgt 3485881 gggctggccc ggaatcgccg cctgctgtcg caggaagccg cgatgaagta cttcctgctg 3485941 ggcgccttct cgtcggcgtt cttcctctac ggcgtcgcgt tgctatacgg cgcgaccggc 3486001 acgctgacct tgccgggtat tcgggatgcg ttggcagcgc gcaccgacga ctcaatggcg 3486061 ttggccggcg tcgcgctgct cgcggtcggc ctactattca aggtcggcgc ggtgccattc 3486121 cactcctgga ttcccgatgt gtaccagggc gcacccaccc cgatcaccgg gttcatggcg 3486181 gccgccacca aggtcgcggc gttcggtgcg ctgctccggg tggtctatgt cgcgctgccg 3486241 ccgctgcacg atcagtggcg cccggtgctg tgggcgattg ccatcctcac catgacggtg 3486301 ggcaccgtca ccgcggtaaa ccagaccaac gtcaagcgta tgctggccta ttcatcggtc 3486361 gcgcacgtcg gtttcatact taccggcgtg atcgccgata atccggcggg tctttccgcg 3486421 acgttgttct atctggtcgc ctacagcttc agcacgatgg gtgcgtttgc catcgtgggt 3486481 ctggtccgag gcgccgacgg ctcagctggt tcagaggatg ccgacctgtc ccactgggcc 3486541 gggctgggac agcgttcacc tatcgtgggc gtgatgctgt cgatgtttct gctggccttc 3486601 gccggcatcc cgttgaccag tggattcgtc agcaagttcg cggtgtttag ggccgccgct 3486661 tccgccggcg cggtgccgct ggtaatcgtc ggcgtgatct ccagcggcgt cgccgcctac 3486721 ttctacgtgc gggtgatcgt gagcatgttc ttcaccgaag aatccggtga cacaccacac 3486781 gtggcggcac ccggcgtgct gagcaaggcc gccattgcgg tatgcacggt agtcaccgtg 3486841 gtgctgggga tcgccccgca gccggtgctc gacctggccg accaggccgc ccagttgctg 3486901 cgctgaatcc gttagggctg accgaagaag cccgactggt cactgccctg attgaagccc 3486961 cccgagctgt ggtcacccgt gttcgccaca cccgtgttga gggtgcccga gttcgcaatg 3487021 cctgtggtct gcaggccaga gtttgcgatg cccacggtgc cggcacccga gttatagaag 3487081 ccgacgttga agccgccgga gttggtgtta ttgatgcccg actgaacgtc accgttgttc 3487141 ccatagccag ccgaaacatt gcccgtgtta aagaagcctg aggaattcat gccggtgttg 3487201 ccgaagcccg agctcgaaac ggattggtcg accgagcttc caaacccggt gttccggtcg 3487261 cccgagtcga aaccgcccgt attgatgctg cccgagttcg cgaatcccgt attgatactg 3487321 cccgcgtttg cgaagcccac gtttagggtg cccgcgttgc caaagcccac actttggttg 3487381 cccgcattgc caacgcccac gttaaaggaa ccgccgttcc cgacgcccat gtcttcgttg 3487441 cccgcgttcc cgatgcccat attgaagaag ccggcgtttc cgaagcccgt gttggtgtcg 3487501 ccggcgtttc cgaagcccgt gttgatgtcg cccgcgtttc caaagccgaa gttgttgttg 3487561 cccgaattga agaagcccac gttgttgttg ccagagttga agaaaccgat gttgttgtta 3487621 cccgagttcc cgaaacctag attcccgatg cccgagttca gcgcgccaat gcccaccaag 3487681 ttgtcgccgg tgagcccaaa accgatgttg ttgttgccat tgttcccgag gccgaggtta 3487741 ttgtcgccgt tgtttccgaa accgatgttg gaggagccga tgtttccact gcccaagttg 3487801 aaggaaccga gatttccgcc gccgaagttg gtacttccgg tgtttccact gcccaggttc 3487861 ccactgccaa agtttccgtt gccgaggttt ccaaagcctc ggtttccgct gcccagattg 3487921 acattgccaa cgtttccgct gccgagattg gtgttgccga tatttccgct gcccaaattc 3487981 gtggcaccgt catttccgct gcccacattg gcgttgccgg agtttccgct acctacgttg 3488041 gcgttgccgg aatttccgct gcccagattg tattcaccgg tgttcccgcc gcccaggttc 3488101 ccgacgccga tgttgccgag gccgatcgcg gcggccagcg ccgatggcgc agctggcaac 3488161 gcctgctgca gaccaattga ccacgacgac agctgcgccg cggccgccga tgccccgccg 3488221 tgatagccca ccatcgcggc cacatcggcg gcccacatct gttcatacat cgcctcagcg 3488281 gccgcgatcg ccggcgcatt ctgcccaaac agattcgaca acaccaactg cacaaacgca 3488341 ttacggttgg ccgccaccag catcggatgc accgtcgccg cccgcgccgc ctcaaacgca 3488401 ctggccaccg ccttggcctg agccgacgcg ccagcggccc gcgccgccgc agcagccaac 3488461 caccccgcat acggcgccgc cgccgcggcc atcgccgccg ccgcaccctg ccaggactga 3488521 cccgccaacc ccgaagtcac cgacccaaac gaggacgccg ccaccgccaa ctccgcggcc 3488581 aaacgatccc aagccaccga tgccgcaagc atcggcgcag accccgcacc ggtaaacatc 3488641 cgcaacgaat taatctccgg cggcaacacc gaataattca tcagcccagc cccttcccca 3488701 gcgcgcgacg ccgatgacac aggcgttcgc gcacgctact ccacccgtaa gaacaaactg 3488761 tagggaaatc acggcaccaa tatcggcgat ttgtcaaaac acttgtacat tgcgaaaaat 3488821 tcgggcccac cgatcgccac cctggtcacc gcgactcctg ccacattgcc gccgcggtac 3488881 ctcatcgtgc cggctaatcg cccccagcaa cgtcgggctc ggtaggcgtt acggctggcc 3488941 gaagaacccg gcctggttat cgccctggtt gaaggcaccc gaggagttgt cacctgagtt 3489001 ggcgacaccg gagctggcat taccggagtt tgcgatgccg gtgctggcca cgcccgagtt 3489061 gtagatgccg ttgttaaaca tgcccgtgtt gaagaagccc tcgttggagt tgcccgagtt 3489121 aaagaagccg acattcgcgt cgccaatatt ctggaagcct gaggtggagt cgaagacgtt 3489181 cgagttcatg ttcccaaagc ccgaggtaag gttacccgaa ttattgaagc ccgagacgct 3489241 ggtgccggtg ttaccgaagc ccgagaccga accaggctga tcgaccgtgc tgccgatgcc 3489301 ggtgttgacg tcacccgagt tgaataggcc tgtgttgaga tcgccggcat ttccgaagcc 3489361 ggtgttcaga ttgcccgagt tgaacgagcc agtgtttcca ccgatcccgc tgaccacacc 3489421 gcccgagttg aaatcgccgg tgttggccaa gcccgcgttt ccggagccca cgttgacacc 3489481 acctgcgttt ccatggccca tgttctggaa gcccgcgtta ccaaaaccga agtttgtgtt 3489541 accaccgttc ccaaaaccgg tgttgaaggg tcccccgttc cagaagccgg tgttgacatc 3489601 gcccgcgttg ccgaagccgg tgttgccgtc cccggagtta aagaagccca cgttgccatt 3489661 gccggagttg aagaagccaa tgttgttgtt cccagagttc ccgaaaccca tgttcccgat 3489721 tcccgagttc aacgcaccaa tgcccaccag attgtcaccg gtgagcccaa aaccgatgtt 3489781 gttattgccg ttgttcccaa agcccaggtt gttgttaccg aggttgccga acccgatatt 3489841 actcatcccc atgttgccgc tacccacgtt gaacgaacca aagttcccgc taccgaggtt 3489901 tgtactaccg acgttgccgc tacccaggtt tccgtcaccg aaggcgtttc cgaaaccgaa 3489961 gtttccgttg ccgaggaggt tcccgctgcc aaagttgaga ttgccttggt tgccgttgcc 3490021 gaagttggtg ttaccaatgt tgccgctccc caggttggca aagccccagt ttcctccgcc 3490081 caggttggca ttgccggtgt tgccgccacc caggttcagg ctgccaaagt tcccgctgcc 3490141 caagttcaac aaaccgacgt ttccgccacc caggttccag ctgccaatat tccccagacc 3490201 cgtgttcagc gccggcgggg tgacagccgc tgtcgcagcc gctggggcgg caaacatgct 3490261 agccgcaccg ccggaaatcg cgctggccac ctgacccaac cccggcaggc cccgcaatgc 3490321 ctgctgccac gatggcaacg ccgccgcggc cgccgatgcc ccgccgtgat agcccaccat 3490381 cgcggccaca tcggcggccc acatctgttc atacatcgcc tcagcggccg cgatcgccgg 3490441 cgcattctgc ccaaacagat tcgacaacac caactgcaca aacgcattac ggttggccgc 3490501 caccagcatc ggatgcaccg tcgccgcccg cgccgcctca aacgcactgg ccaccgcctt 3490561 ggcctgagcc gacgcgccag cggcccgcgc cgccgcagca gccaaccacc ccgcatacgg 3490621 cgccgccgcc gcggccatcg ccgccgccgc accctgccag gactgacccg ccaaccccga 3490681 agtcaccgac ccaaacgagg acgccgccac cgccaactcc gcggccaaac catcccaagc 3490741 caccgatgcc gcaagcatcg gcgcagaccc cgcaccggta aacatccgca acgaattaat 3490801 ctccggcggc aacaccgaat aattcatcag cccagcccct tcccctacag gacgtcccgg 3490861 ccaatgactc aggcaacggt gcacgtctct gtactcgtag aacaaactgt aggaaaacgg 3490921 cgcgacgaat aacggcgatt tcgtgaaaat tctggttccc gtcagaagca cgccaccctc 3490981 ggccacctcg tttgcgcacg cctagagccc gcggtcgggg ggtgcggtct ggatctccaa 3491041 agcatctgct gctgcccgga tctcggctag ccgatcaggg tccgacaaca gcgccgtcat 3491101 cgcgaactcg acgatctcgt ccgggctgac cgcgaattcc ggcagcgcca gagcgtcaaa 3491161 caacgcctgc accagcctgg cagccgataa cggatgcatt gcgcgcacat caccttcgcc 3491221 ttggccggtc tcgatcaggc caaccagcgc gcgttccatc tccgcgacta gctcccgctc 3491281 cgcgacgaag gattcctgat gcaggtccgg ggtgatgagg atggaaacca gcacataggg 3491341 cgaagcatgc aggtggtcca gggattccgt cagccagcgg tgcagcttga ccaccgccgg 3491401 aaccggcatc gcggtgatgt gaccgaacag ctcaagcggc cactccacgg cgagccgcac 3491461 cagggccgca aggatatcgc gtttggccga gaagtgtttg tagatggccg gctgctccac 3491521 cccgacggct gcggcaatgt ctcgcgtcga ggtggagctg taaccccgca gcgcgatgag 3491581 ctcggcagcg gcccccagga tgcggagtgc cgttgggctc cagcggccgg cctgcctcgg 3491641 catgccggca aggctagctg gcacctgggt ggtcgccaac cagcgccatg gcgaggttcc 3491701 ggtagaacgc gagcatgccg ggccattctt tcgagctaag gtgaccccgt tcggcgaatc 3491761 gcgagcccgc cccaacctgc acggcctcca gtccgagccg gtcctcgtca ttgatcatcg 3491821 ccatcacgaa ctgcgacgtc tgagctgttg ctgccgcatc ggcggctaac tcgggggtgg 3491881 tgagcacgcc gccgagcacc tgcacccggt cgatgctttg cggaataaag ccgaaccaca 3491941 ccacccgctc gcccgctatg gccagcgcgc tgttcggaaa cgtccacaac acgaccagat 3492001 tacttttctg aacctcgttg agctgcaacg acttcgcttc tactggaacg gtgaagggaa 3492061 ccctgaggcg caacgcccac cgcgaatact gccgaacgtc cagatcgccc ccaccaggaa 3492121 cgaacggctc cagggtttgg cgatgcaggc cgagcacgtg gtagttctca tgaccatttt 3492181 ccgccgccac cttccaatta gctcgccact catgcgacca cgactcgacc tgcaccatct 3492241 caccgagccg atagccggcg aattcgtcgt cagtcaggtc cagatgcgcc gcgattggtt 3492301 cggcatcggc atccaggttg atccacacca atccattcca ggtggccacg gcgaactgcg 3492361 gaagccggca ctccctacgg ttgaagtcta agttggcggc catatggggc gctccgcgca 3492421 accggccatc cagcccatag cgccacaggt ggtattggca ggtcaacgtg tcgatgcgcc 3492481 ccgcaccggg ttccaccatc agcatcaacc ggtgccggca gatcggcgaa agagcgtgca 3492541 gctgcccgtc gacgtcccgc accaccatga ccggctcccc tgcgacggac acggtgacgt 3492601 agtcaccggt cttggcgagt tggtcgacat gcgcgacaag catccaggac cggttgaaga 3492661 tccgttcccg ctccagctgc cacagctccg atgaggtgta ggcggccggc ggcaggctta 3492721 gcgccggtgg attgtcgtcg aggtaatccc cgatgtcggt aaggatgtct ccgagctcgg 3492781 ctcggttatc agttgataac ataccctcca tgttatcgac tgataaccga ttgtcaacag 3492841 cgcgcaccgg cccgaccggc cagccggcgg ttcacctcga gaacggacgg gtggccagca 3492901 cgtaggtagc caacacggcc aacggtgccg ccaacggcag ccatggcact tgcagcggga 3492961 acgacgtcgc agccaaccca gcgaacgtga aaccaacggc ggcaacggtc gtcggccagc 3493021 tcccggcgac aacaccggcc ccgtatcggc acaccaggta gacggcggcg caaaccccga 3493081 caatgcggca agcacatgcg tcgggccgga caccacgatc atcaccaccg acaacaccac 3493141 ggcaagcgtt gccgccaggc gaaacaccgc cgccacccct accgcaatca ccgcggcaag 3493201 ccccacgaca acagccagcc cgtgcgatcc cacagcggcc gaccccacca tcatcagtcc 3493261 gaacaccgtg gagagcccac gagtacccgg gtgcgcaaac gaggtcatgg cagcctcgcc 3493321 cggctagctc tgccccgtcc gcgacgacgg cgattgggca acgcacccat cgactgctga 3493381 agcgagtgat ccgccggcca ggacagcacg tcgaccccga tggtggccat gtcgcgatac 3493441 atcgcggagc gctgcagcgc ccacatccgg accaccaggg gatccagttg gtcctggagc 3493501 ggacagctat caagaacgtc gacagcaacc acgacgtggc cgcgtttacg cagttcgatc 3493561 aacgccagcg cgaactcggt atccagcagc gtggaaaacg caatgacaac cgctcctgcg 3493621 ggaacagctg cgcgcggagc cagcgtcccg gtggtgtttt cgaacccttc cccggcgccg 3493681 agcacggtgt cgagcacccg atagaactgg cgctgcccga tgtcggcgcc cagccatcgc 3493741 ggccgattgc cgcccagcgc aacgatccca gcacggtcac cgtttcgcag cgcggtttgc 3493801 accacctgag cagcaccccg cacgactcgt tcggtggcct cggtcgccgg acccgccggc 3493861 tgtcgataca tgtcgatcaa caccaccacg tcagcggccc ggtcggtcaa ccgccttgtc 3493921 acgtgcagtc ggccacggcg cgcgcttacc acccagttca cggcacgtag ctggtcgccc 3493981 gggacatatg ggcgaatgtc ggcgtattcg acacccggcc cgacgtgccg ggtgagatga 3494041 gctcccaggc ggtcgagcaa ttcggtctgc ggcagtggcg tcgactgcgg cggtgtcagc 3494101 ggaaacacga cgatttcggc ggcgtcgacg gttccggctc ccatcaacaa cccaccgcgt 3494161 gcgacgacgg cgacccgggc ccggatagga tagcgccccc agcgttgcgc caccgcggaa 3494221 accgttgtcg tccggcgtga cacggattcc agagcttcga actgcattcc cgccaacgcc 3494281 gataccgtga gttcgaccgc ggcgtccacg gattccgttg tgacccacac ggtcactcgc 3494341 acatgttcgt tctcgaaaca tcgctgcgaa tccgggtcac cgtgcacctg gatcaccggg 3494401 accggacgct gccagctgat cgagcacaac acgccgagca gcggcgccgc gaacgcaatc 3494461 agctgccaac gaccagcgac gaccgctgcg gctagcgcaa ctccggcaca ggtggcaatc 3494521 gccagcgtca gttgtgatgc acgccagcgc aactcgactt cacacgtttg gatcacatcg 3494581 cgccgtagtt catccagcca acccgctacg ttccactaat tcggggaaca ggcagacgcc 3494641 gcaacagctc tgagaccaca tcagcgcccg caatcttgcg cacccacatc tccgggcgca 3494701 atgtgatccg atgcgcgacg gccgcggtcg caagttcctt gacatcttcg ggtatgacgt 3494761 agtcccggcc gagcaacaga gcgcgggcac gggagagctg gaccaggtcg agttcggctc 3494821 gcgggctggc gccgacggcc acctgcggat ggtgccgggt agcgttggcc aacgacacca 3494881 catagtgcaa gacgtcctcg tgcacggtga cctgctcgac cgattcacgc atggccaaca 3494941 gatcgtggca gtccaccacc tgattcaccg tcggatccgc agaaccgcgt tccaggcgac 3495001 ggcgcagcat cgaggtctcg tctcgctcgg agaggtagcg cagttccaac cggatcgcga 3495061 accgatccag ttgcgcctcc ggcagtggat atgtgccctc gtattcgatc ggattgtcgg 3495121 tcgccagaac gatgaatggc attgccagtt tatgggtttg gccatcgatg ctcacctggc 3495181 cctcggccat tgcctccaac agtgccgctt gcgtcttcgg cggcgtccgg ttgatctcgt 3495241 cggcgagcaa caggttggtg aaaataggcc cggcccggaa ttcgaaacga ccggactgca 3495301 tgtcatagat ggtcgagccg agcagatcgg ccggcagcaa atcaggcgtg aattgcactc 3495361 gggtgaaatc gagccccaac gcggcggcga aggatcgcgc gatcagcgtc ttgccgaggc 3495421 cggggagatc ttcgatgagc acgtggccac gggcgagcac ggcggtgagg atgagtgtca 3495481 gtgcagagcg cttccccacc accacacgtc cgatttcgtc gagcaccgcc tcgcagtggg 3495541 cggtggtcgt cgcggccggc ataatcatcg ttgagtcata cctgttctaa cttctgcaga 3495601 atttcttcca gtgccgcacg gccggggcct ggttgacggt cgccggtgtg cgtcacattg 3495661 ttcgggttga cccattccca caattcgtcg ccgaaaagca ttcggccggt ggcagcaaag 3495721 gcaaccgggt ctttggcctg tctatggccg gtggcgattt cgaaccgtcg tgcgagcatc 3495781 ggacgcaaat gccggtccca gtcggctcga gtggactccg accaccggat cgtcgtctcg 3495841 gtgttggaga gccaccggcg caacccctcc cccagatcgt cggagtccgg cgcagccgtg 3495901 agttcgtccc ggttgcccag catccggcgg acgttgagca gcaccagagc cagggcgagc 3495961 cccgacccgg cgagcacgag ccgacggtcg tgcagtatca gcgccagcag ctcaatcccc 3496021 acgatgagga aaatccccag ggcgataagc cttttcatat agcggtccga gtgctcagtt 3496081 cgtcaagaac cagtcgaagc aaacgcatcg ccacctcacg gtgctcctcg ttcatcacgt 3496141 gcgggctaaa acgcgcctcg gcgaacaggc tcaccaacgc ggcggcacta gcaccatgga 3496201 gcgcacggtg ttcgacggct cgggccagca cctcggtcgg ggtgtcgaag tcctgagggg 3496261 caacaccggg aacatgcgac agttcacgct ccatcgccac gtaacacgca attatcgcct 3496321 cccgtggttc gcggcggagg tcggccatct cggccagtcc gatctcggcg gcacgcgcca 3496381 gtgattccga acgcgccgag ggcgccggag actcgatgcg atcgccactg atacgagccg 3496441 gtgccgactt gcgctgtcgt cgcgaggtaa tcagcgaccc cgcgacgacc atcaagaaca 3496501 ggccgattgt gctggcaaag agaatgccga gcacgtcgtc attgttgtct tgcggcggtt 3496561 gcgggcgcga cggcgtggtg ctggaagcat ccggcgtagc ggttgaatcc ggtatgggcg 3496621 cagcaggacc gacatcatcg gggacgaaca accgtgccag cagtatcgca atcagcagcc 3496681 aggccaggat tgtcccgagt ccgagcaaca gcacacgcca gttcggacgc cctgctgcac 3496741 cgccaagcat tgccgagagc tcccccgcgc tgggcgccac cgggagcgga tgtcgcaacc 3496801 gggtgatgat ggcgagcgct atcagcgcga gcgtcgcggc aagtgcggcg acaatgaaca 3496861 tcagcgccgc ccggctgccg ccggccgccg cgagcggtgc accgtcgtcg gccggcaggt 3496921 ggccgcgcag ggcagcgcca gcaagcatca agagcacgat cacgacgacg acgcgccctg 3496981 tcggtttgtc actaccgggc ttagtaccgg gcatacgcac accactcgac cggttgcctg 3497041 ccgccgttgc ggcctggggg ttggttcaac ctggcttggt tcatactggc acgtcagacg 3497101 acactgccgc caggagcggc gcggtggacc cctcgcacga cgatcgcggt ggtttggtcc 3497161 acccacgcgt cgtccagcat gtcgtccggg tacagcagca tccgcagcat ggtggcgccc 3497221 ccgatcagct cgatcaaccg gtccgggtcc acgtcgggat gcgcctcgcc gcggtcgacg 3497281 gcctcgcgca ggcgcatgcg caccgcggcg aataagtcgg caaaacgcgc cagcacccgg 3497341 gcgttgagtt cagcgtctgc ggtcatatcg gctaccagac cgggtaacgc ggcccgcacc 3497401 accggggtgg tgaacacatc gcgggtggcc gcgatcatca ttcggatgtc ggcggcgata 3497461 tcaccggccg cagcctgcag cgcggtgggc gcggcgggaa acgcggcctc gtgcactagt 3497521 tcggccttgc tcgaccaccg ccggtacaac gccgatttgg tggtgccggc gcgttcggcg 3497581 accgcggcca agctgaggtt cgaatacccg atctgcacaa gcagttccgc cgtcgccgac 3497641 aggatcgccg agtcgatgcg cggatcacgc ggccgcccgg cgccgggggc cttgtcaagg 3497701 gagggcaggt ctgctttcat aacgctacct aaagtagcgt aattgccgca ccagggaggc 3497761 gcttgtggcc aacgaaccgg caatcggagc catcgaccga ctccagcgct cgagccgcga 3497821 cgtgaccacc ctgccggcgg tgatatcgcg ctggctgtcg agcgtgttgc ccggtggggc 3497881 ggcacccgag gtgaccgtgg aaagtggcgt ggactccacc ggcatgtcgt cggaaaccat 3497941 catcttgacc gcgcggcggc aacaagacgg gcgatcgatc cagcagaagc tggtggcgcg 3498001 ggtggcgccg gccgccgagg acgtgccggt gttcccgacg tatcggcttg accaccaatt 3498061 cgaagtgatc cggctggtcg gagagctgac cgacgttccc gtcccgcggg tgcgctggat 3498121 cgagaccacc ggcgacgtgc tgggaactcc gttctttctg atggactacg tcgagggcgt 3498181 ggtgccgccc gacgtcatgc cgtacacgtt cggtgacaac tggttcgccg acgcgcccgc 3498241 cgagcgccag cgccaactgc aggacgccac cgtcgcagcg ttggccacac tacattcaat 3498301 ccctaacgcc cagaacacgt ttagcttcct cacccagggc cgcaccagcg ataccacgct 3498361 gcaccggcac ttcaactggg tacggtcctg gtacgacttc gcggtggaag gcatcggtcg 3498421 atccccacta ctggaacgga ctttcgagtg gctgcaaagc cactggccgg acgacgctgc 3498481 cgcgcgcgag ccggtgttgc tgtgggggga cgcgcgggtg ggcaacgtct tgtaccgaga 3498541 ctttcagccg gtggcggtgc tggactggga aatggtggcg ctgggtccac gggaactcga 3498601 cgtcgcgtgg atgatatttg cgcacagggt atttcaggag cttgccggtt tggcgacgct 3498661 gccgggtttg ccggaggtga tgcgtgagga cgatgtgcgc gccacctacc aggcgcttac 3498721 cggcgtggaa cttggtgacc tgcactggtt ttacgtgtac tccggggtca tgtgggcatg 3498781 cgtgttcatg cgcaccggtg cgcggcgagt gcacttcggc gagatcgaga agcccgacga 3498841 tgtggagtcg ctgttctatc acgccggctt gatgaagcat cttcttggag aggagcacta 3498901 atgccgcaaa tgctaggccc actcgacgag tacccgctac atcagcttcc ccagccgatc 3498961 gcctggccgg gctcctccga ccgcaacttc tacgaccgct cctacttcaa cgcccacgac 3499021 cgcaccggga acatctttct gatcaccggt atcggctact accctaacct gggcgtgaaa 3499081 gacgcgttcg tgctgatcag gcgtgcggac atacagaccg cggtgcatct ttcggatgcc 3499141 atcgactccg accggctaca ccagcacgtc aacggttacc gggtggaggt cgtcgagccg 3499201 ctgcgaaaac tgcgtatcgt gctcgacgaa accgaaggtg tggcggccga tctcacctgg 3499261 gagggcctgt tcgacgtcgt ccaggaacag ccgcacgtct tgcgctccgg caaccgggtg 3499321 accctggatg cgcagcgctt cgcgcagctg ggcacctgga gcggccgcat cgtcgtcgac 3499381 ggcgaacgga tcgccgtcga tccggcgacc tggctcggca gccgggaccg gtcctggggc 3499441 atccggccgg tgggggaacc agaaccggcg ggccggcccg ccgacccacc cttcgagggc 3499501 atgtggtggc tgtatgtgcc gttggccttc gacgacttcg ccgtcgtgct gatcatccag 3499561 gaagaacccg acgggttccg ctcgctcaac gactgcaccc ggatctggcg tgacggccac 3499621 gtcgagcagc tgggctggcc gcgggtgcgg atccactacc gctccggcac ccgcatcccg 3499681 accggggcga cgatcgaggc aagcaccccc gacggcgcgc cggtgcactt cgacgtggag 3499741 tccaaactgg cggtgccgac ccatgtcggt ggcggctacg ggggtgactc ggactggtca 3499801 catggcatgt ggaagggcga gaagttcgtc gagcgaagaa cctacgacat gaccgatccg 3499861 acgatcatcg cgcgggccgg cttcggcgtc atcgaccacg tcggtcgcgc gctatgccgc 3499921 gacggcgacg ggaatccagt gcagggctgg ggtctgtttg aacacggggc gctgggccgc 3499981 cacgacccat cggggttcgc cgactggtct acgctggcgc cctaggcgct tcaggcttac 3500041 ttcggcaccg gtgaggctat ccgcattcgc gagtccaggg ttcctgggcg ccggccggga 3500101 aacggcccga aaacgacggc agccggaata gccgaccgga accgccgaaa tgcggttgac 3500161 tagagcggtg acaaacccac cgtggactgt cgatgttgtc gtggtgggcg cgggcttcgc 3500221 cgggctggcc gcggcccgcg agctgacgcg acagggtcac gaggtgctgg tgttcgaagg 3500281 ccgcgatcgg gtgggcggcc gctcgttaac cggtcgcgtg gcaggggtgc ccgcggatat 3500341 gggcggctcg ttcatcggcc cgacccaaga cgccgtgctg gcgttggcca ccgagctggg 3500401 gatcccgaca accccgaccc accgcgacgg ccgaaacgtc atccagtggc ggggatcggc 3500461 acgcagctat cgtggcacca tccccaagct gtcgctgacc gggctcatcg acatcggccg 3500521 gttgcgttgg caattcgagc gaattgcccg cggcgttccg gtggccgccc cctgggatgc 3500581 gcggcgcgcg cgtgaactcg acgacgtgtc gctcggggag tggttgcgct tggtgcgcgc 3500641 cacatcgtcc tcgcggaacc tgatggccat catgacccgg gtgacctggg gttgtgagcc 3500701 cgacgatgtc tcgatgctgc acgccgcccg ctacgtacgc gcggccggcg gcctggaccg 3500761 gctgctcgac gtcaaaaatg gtgcccagca ggaccgtgtg ccggggggga cacagcagat 3500821 cgcccaggcg gccgccgccc aactcggcgc acgcgtcctg ctcaacgccg cggtgcgtcg 3500881 catcgaccgg cacggagcgg gtgtgacggt cacgtccgat cagggtcagg ccgaggccgg 3500941 gttcgtcatc gtcgccattc caccggccca tcgcgtggcc atcgagttcg atcccccgct 3501001 gccgccggaa tatcagcagc tcgcccacca ttggccgcag ggccggctga gcaaggccta 3501061 cgcggcctat tcgacgccgt tctggcgggc cagcgggtat tccggccagg cgctgtccga 3501121 tgaggcgccg gtgttcatca ccttcgacgt cagtccgcac gccgacgggc caggcattct 3501181 gatggggttc gtcgatgctc gcgggttcga ctcgctaccc atcgaagagc gccgccgcga 3501241 tgcattgcgc tgctttgcgt cgctgttcgg cgacgaagcg ctcgaccccc ttgattatgt 3501301 tgactatcgt tggggtacag aggaattcgc gccgggtggt ccgaccgcgg cggtaccgcc 3501361 ggggtcgtgg acgaaatacg gtcactggtt acgtgagccg gtcggtccga ttcactgggc 3501421 gagcactgag accgcggacg aatggaccgg gtatttcgac ggcgccgtca gatccggtca 3501481 gcgtgccgcc gccgaggtcg ccgccctgct atgagctgat ccgccggtcc cggacgtgcc 3501541 gggtcaccga ttcggccagc gcccgcaggt ggctgttcac ctcttggtgc cgttccagca 3501601 tcgagcagtg gccgccgggc agttcaacga ggccgacgac attgggcgcg gtgcgcgcaa 3501661 tcctgcggga ctggctgatc ggcgttagtc gatcacgtac gccgccgatc accagggttg 3501721 gcaccgtcag accatccagg ttgaggtgtg ccgaccctac ttcctcgacg agcatcttcg 3501781 cgcagccgcc gcgccccgcg gcagacatct gggtgaacaa ctcatagacc agtctcgtgg 3501841 cgctggggtc cgcgtcggcg gcgaccgcca gcgtggagat cacgtgccgg cttaaggccc 3501901 tggccgcgcc ggggagtgga aacccgccga acgtgttgac caggctccgg ccggccagca 3501961 cccgaaccgg ggacaactcg cgtggcaccg acagcagttt caccttgcgc accaggtcgc 3502021 cggtggtggt gttgatcagc gcgacggcgt ccgtgcggcg gcggactttg tggcggtagc 3502081 ggtccgacca ggcggcaatg gtaatgccgc ccatcgagtg cccagcgacc accgcacgct 3502141 cgcgcggggc caacgtagcg tccaacaccg aatcgaggtc ggccgcaagg tgattgaggc 3502201 tgtaggcgcc acgccgtggg acaccgcttc gaccgtggcc gcgatggtcg aaggcgatca 3502261 cccggtagtc gccggccagg tcggcgattt ggtatgccca ggcccggatg gcgcagacga 3502321 aaccgtgcgt cagcacaatc ggatagccgt gaggcggccc gaacacctgg gtgtgtaacg 3502381 gggtgccgtc cgccgcacgg acggtcaagg tgcggctagg cggtaggacg tctggaatct 3502441 gggtagcccc gctgcttcga gtgggtctcc gagcactcat cgccgctccc ccttcgacgc 3502501 ggccccgttg ccgccttccg gatgtcgccc actctagcgt gcagttactt acgggtagct 3502561 ggaaatcgct gaagcatagg atcacagaat aataacgtcg cggcccctgc tctcagctgg 3502621 tttcgcatcg ccagccgatc agtagtcgtc tcagtaatcg tcgagggcgg ccacgttgcg 3502681 ccaactcggc cacgtcgtct cccagatccg gtgaattcgg ccgttgcggt aggcggcaat 3502741 gagtaccacc tcgatgcggg tcggctcctc gccaggtcgc gacgtggtga tccacacccg 3502801 cccggcaacc ttgtctgggc ctctacccat gcgtgctcgt cgtattcgac cgcgtagctg 3502861 atcgccgtgg cgtagagctt gcggtggcta tcgcggaatt ttgcgaagct ctggctcagc 3502921 ccgtcggagt acatcaggaa gtctgggtcg tagtagtgct cgatcagctc cgcgtttttg 3502981 gcgacgacca tccgatcgaa catttcccga agcagcgcaa cggacattcg gcgatcctaa 3503041 accctggccg ccggccatct cacaacgtga gcgtggacga atccccatcc attgcgatga 3503101 cgagttcaga ccggacgggc cgttgcctga tcaatcagga cctccgctgc cgctcgggcg 3503161 tgcgcccagg ggccggcatc gtcgagggag gtggacagtg cggccgcgcc ctcgaacagc 3503221 acggcgagtt gattgcccag gctgcgcgga tgcgctgcgc cggcttctcg ggccagccgg 3503281 gcgaggcctt tgatgtagtc gcgtttgtgc gagtggacga tccgctcgac tccgggcatc 3503341 tccccggccg cctcgaccgc cgcgttgtgg aatggacaac ctcgcatccg cccatcgccc 3503401 ctgtttggac gatcgaacaa tgcgagcagc cgctcgcgtg gtgtcgcgtt ggatgccttg 3503461 ggcatcttgt cggcctcgcc ggcggcttgc cggagcccgc gcaggtactc ctccaccaac 3503521 gcggacttac tcggaaagtg ttggtagaga gtccgcttgg ataccgaagc cttgttcgca 3503581 atcagttcga ccccggtggc gttgatgccc tcgcagtaga acagctctgc agccgccttc 3503641 aagatacgct gacgagcgcc gcggcccccg cgcctggggg gttccgttgt tctggtgacc 3503701 ggcggcatag tgctgagtat accgacctgt ttacaacacc ccttagcgcg tgtaccgtca 3503761 aagcacaaag tacaccaatc ggtttactgt aggaggtctc atgacttcac tagccgagcg 3503821 gaccgcgctc gtcaccggcg ccaaccgcgg catgggccgc gaatacgtcg ctcagcttct 3503881 cggtcgcaaa gtggcaaagg tctatgccgc tacccgcaac ccgctggcaa tcgacgttag 3503941 cgatccgcgc gtgattccgc tccaactcga cgtcaccgac gcggtgtcgg tcgccgaggc 3504001 agccgactta gcaaccgatg tcggcattct gatcaacaat gccggcatct cccgggcgtc 3504061 ctcggtgctc gacaaggaca catccgcgct tcgcggcgag ctggagacga acctgttcgg 3504121 accgctcgcg ctggcctccg cgttcgccga ccgcatcgcc gagagatccg gtgccatcgt 3504181 caacgtttcc tcggtactcg cctggcttcc ccttggcatg agctatggag tgtccaaggc 3504241 ggcgatgtgg agcgcgacgg agtcgatgcg tatcgagctg gcgccgcgcg gtgtgcaggt 3504301 ggtgggcgtc tacgtggggc tggtcgacac cgacatgggt cgattcgccg acgcgccgaa 3504361 gtccgatcct gccgatgtgg tccgccaggt gctcgacgga atagaggctg gcaaggagga 3504421 cgtgctcgcc gacgagatga gccgtcaggt gcgcgcgtcg ctgaatgtcc ctgcgcggga 3504481 acgtatcgcg cggttgatgg gtaactgagt ccgaaagtcg atatggccat gtccgccaag 3504541 gcctcagacg atattgcctg gctaccggcg accgctcaac tcgcggtgct cgccgccaag 3504601 aaggtgtcca gcgcggagtt agtcgagctg tatctttccc gaatcgacac gtacaacgcg 3504661 tcgctcaacg cgatcgtcac cgttgacccc gacgccgccc gacgcgtcgc caagcggtcc 3504721 gatgcggcac gagcccgcgg cgacgaactc ggcccgttgc atgggttgcc gatcaccgtc 3504781 aaggacagct atgagacggc cggcatgcgc acgacctgcg gtcgccgcga ccttgccgac 3504841 tatgtaccca cccaggacgc cgaggcggtc gcccggttgc gccgggccgg cgcgatcatc 3504901 atgggcaaga caaacatgcc caccggcaac caggacgtcc aggccagcaa tccggtcttc 3504961 ggccgcacca acaacccatg ggacgccgcg cgcacgtccg gcggctcggc cggcggcggg 3505021 gcggccgcca ccgcggccgg gctgaccagc ttcgactacg gctcggagat cggcggctct 3505081 accaggatcc cggctcatta ctgcggtctg tacggccaca aatcgacctg gcgctcggtt 3505141 cctctggtcg ggcacattcc cagcgcacca ggtaatcccg ggcgatgggg gcaagccgac 3505201 atggcctgcg cgggcgtgca ggtgcgcggt gcccgcgaca tcatccccgc actggaggcg 3505261 accgtcgggc cgatgcgggc ggacggagga ttctcgtatg cgctcgctcc gccacgagcc 3505321 ggcgcgctca aagacttccg ggtcgcggtc tgggccgagg acccgcattg cccaattgac 3505381 gccgacgtgc gtcgggccat ggatgatgct gtcgccgcgc tgcgcgccgc gggcgcacac 3505441 gtcgttgagc agcccgccac catcccggtc gatatggcgg tgtcgcacaa catcttccag 3505501 agtctggtgt tcggcgcctt cgctgtcgac cggtccaccc tcagcccagc ctccgccgcc 3505561 gcgctcggat tacgcgcggt tcggcatcct cggggcgaag ccgccaacgc cctgggtgcg 3505621 acgctacaga gccaccgtgc gtggttgttc gccgatgcgg cgcgccacga aatgcgcgac 3505681 cggtgggccg gattcttcaa cgagttcgac gtgctgctcc tgcccgtcac gcccaccccc 3505741 gcgccgctcc accacaacaa ggaccacgac cggttgggcc gcaccatcga cgtcgacggc 3505801 gtctcacgat cgtactggga ccaactcaaa tggaacgcgc tggccaacat cgccggcacc 3505861 ccggccacca ccatgcccat caccaccaca gctaccggac tcccgatcgg catccaggcg 3505921 atggggcccg cgggcggaga ccgcaccacc gtagagttcg ccgccctgct caccgaagtc 3505981 ctaggcggct tccgcgttcc ccctctttag gaacgctcgg gcagggccgc aataacctcg 3506041 gcgagccgat cgggctgctc cgctgtcgtc aggtggccgc ccgcaagctc ggtgatttcc 3506101 accgaatccg caagccgctc tcgggcgagc cgcagttgct ctccctcgaa tggatcctcg 3506161 gcgctgccca ccacaccaaa ggcgacctca tcgcccagcg ccgaaatgat ccgcgccagg 3506221 tcccagcgcg ctgcgtgctc gcgatgctcg tccacgaagc ccgccgtggc gggcagcacg 3506281 cgcacgccgt cgcgccggct gatcgcgtcg tggagctcct tcatctccgc tgcgcttaat 3506341 gggtatccgc gcgagaagac gggcgcaaga atggggcgaa catgcgccat gagcgctggc 3506401 cgatcggcgt gatcgccgcg ccgagcggcg atgtgagcag cggcgtcgta taccaggcgt 3506461 gggtgtggcc gtcggcaaag atgccgccgt tggcgagcag gcaagccgtg attcgggtcc 3506521 gctgatcgtt tcccgcccgc tcgcgatcga tccgccgcgc cagcagctca aggctgacga 3506581 tgcaggagta gtcgaaggca acgacgacgg tctgcgctat cccctcggcg tgccagaggg 3506641 cttcgacgag atccgcgcgc tcgaaggtcg agtacgggta atcccggggt ttgtcggagt 3506701 cgccgtggcc gatgtagtcc aggtagatgc gggggaagtg gaatcgcgag ctcaagaaag 3506761 cttccacctt cgcccaaccg taggaaccat ccggccagcc aggcaggaac gttcgcgtga 3506821 cccccgtccc agcagcgcgc cgtatgaacg cgcgcagcgg cgaacgtggg ttgatgcccg 3506881 gccgctcagc gtcgtagccc accctctccc cagcggagaa ccactcctgt gcgctgatga 3506941 gcgcgctcgc ccggtgcgtc atcgcgcgct cgctagccgt tggcggaggt tgtcgaggtc 3507001 catgtcggtg catctccgca accaaagtac accgataagt ttacgtgtcg cattaaccga 3507061 tgtacagtgt cggttataag tacaccgatc agtatacaag gagtcggcgt gccccagaga 3507121 caggccggcg acatcggcgc gacataccag gacgcgccca cgaagagcat caatgtgggc 3507181 ggaacgcgtt ttgtctaccg gcggctcggt gctgatgccg gcgtgccggt gatctttctg 3507241 caccacttgg gcgcggtctt agacaactgg gatccacggg tcgtcgacgg catcgccgcc 3507301 aagcatccag tggtcacttt cgacaaccgc ggtgtcggcg cttcggaagg ccagacgccg 3507361 gacaccgtga ccaccatggc cgacgatgcg atcgcctttg tccgtgccct ggggttcgat 3507421 caggttgatc tccttggatt ctcgttgggc ggcttcgtcg cgcaggtgat cgcgcagcaa 3507481 gaaccgcagc tcgttcgcaa gatcatcctc gcgggtaccg gaccggccgg tggtgtcggc 3507541 atcggcaagg ttactttcgg gacgatccgc gagagcatca aggccacact gactttcagg 3507601 gatcccaagg agttgcggtt cttcacgcga accgacagcg gcaaatcggc ggcgcgacag 3507661 ttcgtgaagc ggctcaagga acggaaggac aatcgcgaca aatcgattac agtgcgcgcg 3507721 ttccgctccc agctcaaggc catccatgca tggggcacgc aaaagccttc ggacttgacg 3507781 agcatcggcc atccggtcct gatcgcaaac ggtgacgacg acacgatggt gcccaccagc 3507841 aactcgttgg acctcgctga ccggctgccc gacgccacgc tgcgcatcta tcccgacgcc 3507901 ggccacggcg ggatattcca gcaccacgca cagtttgtgg acgatgccct gcagtttctc 3507961 gagtcgtgaa gcgatttcgc atgaccacca aagccacgcc cagaccagtt ggattcgccg 3508021 ctcctcccca ccgtttcgcg gtatcggcag agcgcaccca tggatctatc accgcaccgg 3508081 cggacgagtc ggctgcaagt tgcgactcgg cgccggattc cgcaaaccgg tgccgacact 3508141 gctactcgaa caccggagcc gcaagtccgg caagaacttc gtcgcaccac tgctttacat 3508201 caccgaccgt aacaatgtca tcgtcgttgc ctctgccctt gggcaggcag aaaacccgca 3508261 gtggtatcgc aacctgccgc ccaatcccga cacccacatt cagatcggat ccgatcgccg 3508321 cccggtgaga gccgtcgtgg ccagctcgga cgagcgggcg cgcctatggc cgcgcccagt 3508381 agacgcctac gccgacttcg attcttgcca aagctggacc gagcgtggga ttccggtgat 3508441 catcttgcgg ccacgctaat aggcgtcggc ctgctccgcg tggtcgagcg atcccggtgc 3508501 ggttacccgc tacggggtgc tttcggcacc gcgatcggct aggccaccga gggagcagac 3508561 atcgaataca gcggccgaat caagtcgctg gacccggcaa ctcccacggg tgtcgtcacc 3508621 gtcgccgcga tgactggcgg ccggaagacc tttggccagg cgacgttgaa cgtccgcttc 3508681 cgctgacccg gcggcctggt gacggcggcc gaggacaaag aagagcggct tcggctgtcc 3508741 ggaacccgga tcgaactcga ggagctactt cagcttccgg tcgatgttgc gtacgagggc 3508801 ctgttgacgg acgacgtttc cgaatccgtt cgcaaaaagc tcattacgct acgagccggt 3508861 ccctcaagaa ccgcctgctc gaatctgcgc aaccccgctg gcgttggggc ggacgacggt 3508921 gctcggcgtg atgtggtgca ccaaagggac attgccgacg gaactggcgt tgagccagca 3508981 acacaccgtt gatcgcatga gtgatgtcca cccaaccgcg gtcaccgaca acggggatcc 3509041 agtcgggatc atcgctggca taaggatatc ggcctgcacc ggcattgtgt gctcacggcc 3509101 atcgctgcct gggaccaatc accagcccct ggaaggtcga ctacagccac aagcccgacg 3509161 atggtcgaca gatcaagata cgtctttcga caaaacaaga tccaatggtc gacaaaacag 3509221 gacaaactat tcgacaaatc gggatcagat gtacgacaaa acaggagtac tttgacgttg 3509281 tggtgcatga tgaggctggt cacgagctga tcgagcggca catgctcgaa cagttgcgcg 3509341 aggttgcgga gtacacccgt gtcgtgctga tcaatggtcc acggcaggct ggtaagacga 3509401 cgctgctcca acaattgcac gccgagctag gcggatggct gcgttcgttg gatgttgacg 3509461 tcgaacgcgc gtcggcgcga gccgatcccg aggggtacat catgtccgcg ccgcgcccga 3509521 cgttcttgga cgaggtccag tgcgccgggg atccgttgat cctggcgatc aagacggcaa 3509581 ccgatcgtga ccgccggccc agacagttct tcctgtcggg gtcgacccga ttcctgacgg 3509641 tgccgacgct gtcggaatca ctggccggac gggttgcgat cctcgacctc tggccgctgt 3509701 ctgtcgctga acgatcgggt gtccggccgg agatcattgc gcaactgttc actgaacccc 3509761 aagtggtcct gggcacggag cccgccccgg tcacgcgaca tgagtatctg cagctggcct 3509821 gcgcgggtgg ctttccggaa gttgtgcagc gcccggcggg tcgcgcccgc agccggtggt 3509881 tctcggacta tctgcgcacg gtgacgcagc gcgacgtgcg cgagctgaag cggatcgagc 3509941 agacggatcg cctgccgcgg ttcatgcgct acctggccgc tatcaccgcg caggagctga 3510001 acgtggccga agcggcgcgg gtcatcgggg tcgacgcggg gacgatccgt tcggatctgg 3510061 cgttgttcga gacggtctat ctggtacatc gcctgcccgc ctggtcgcgg aatctgaccg 3510121 cgaagatcaa gaagcggtca aagatccacg tcgtcgacag tggcttcgcg gcctggttgc 3510181 gcgggcaaag cgccgactcc ctggccaggc caaccgcgga gggcgcgggc ccgatcatgg 3510241 aaacgttcgt gatcaacgag ctgatgaagc tacgtgcggc gaccgaactc gaggttgacc 3510301 tgtatcactt tcgcgatcga gacggacggg agatcgactg cattcttcag accccagaca 3510361 gtcgcgtcgt cggtgtcgag gtcaaagcct cggcgacagt gaacgtccat gatttccgac 3510421 acttgtcatt cgcgcgtgac cgactcggcg acgaattcat caccggagtt ctcttctaca 3510481 ctggtgcccg ggctttgccg ttcggcgacc ggttgatggc tctacccatc aatctcctct 3510541 ggaacggaca atccgtctcc agcctgtagg cgcataccga tcgccatatt tcaagagcag 3510601 gttggagctt ctgcccccaa tcatcgtgcg gcaacgatgg gcggctctag cgctagtcga 3510661 cgcgctattc aaccagctca caccgagctc ccgcgcggcc acatacccgc gaccgtgtga 3510721 tgcaagcacc ccaccagctc cgcgcatcac gcaacgaacc ggtcaaatcg taggcttcca 3510781 aaatctccat gatctcctcg gcagacttca cgtcacccct tttcgggagc tgaacaaccg 3510841 acgcggagcc gtcggccgcg gatgccctgg ggcggcggtc cccaaacccg atatggctaa 3510901 cgtcaagcgg tcggatcacg ggtcgagttg ggcgggggcg actcggcacc cggcggcatg 3510961 ggctccggtg tgcaggcgtc ggtcccaaac ggcgactacc aggccggggt cgccgactgc 3511021 caatgcgctg gccagatgaa cggcgtcggc tccgcgtaag gcatgtgctc gggcgaggtg 3511081 gccggcgtgc tgttcaaccg tcgcggtgag ttcgactggg cgggtggcgg cccagaagtc 3511141 ctcccagtca cgctcggcgt cggcgagctc ggattcggtt aggtcgtgat tgcgggccgc 3511201 tgcagcgagt gcggcgcgga cttcggggta ggccaggcgg ctggacaatg cggcgtcgca 3511261 gccgtcccat agagcggacg ccagcgagct ccctgtctcg gtggtgagaa gtttgacgaa 3511321 ggcgctggcg tcgaagtaga cgagcggcac ggtcagcgcc gctggtcgct gacccggtca 3511381 gacaccggcc gctgcggtcg gggcctgggc cgtcccgcgg ctacgggccg ctgcgcggtc 3511441 gccttgccaa tcacgccttc ggccgtgaga cgctccaagg tgtctgtgct gtccagcgca 3511501 gcgagtcgtg cgatcggaat cccacgttcg gtgatgacga cctcgccacc ggcccgagct 3511561 cgatcgagcc aatcgctgag gtgcgcgcgc aactcggtca cggatacatc cacactttga 3511621 actgtacact cactgaaccg tgatttgtac atatcactct gcgtgcggca acgacgacgt 3511681 gagagattga cctgcgcaag ccggaggcga ggtggcaacg gccggtacac cgattcgtcc 3511741 gcggtgctgg cgacgccgaa acggtcgatg tcgtggtgac tggtcacctt ccgtccaagc 3511801 tgcatccgaa ggtgttgcaa cggaaggtgt ttgccgtccg cgctgggcct tcggcgcagc 3511861 tggcatttgt ggtcagctgc atggcgacgg cagcgcctcg gtggtgaacg ccgggtttag 3511921 cttgcagcgg ccgagcaggc tgcctcgttc ctgctcggtg acagttggcc cgacgatgac 3511981 cgcgcaccgc cgccaccacg agatataacc tagaggttat actggtgcgg aagcgttggc 3512041 cgtgatcctg ctcccgcagg tcgaacggtg gttcttcgcg ctcaacaggg atgcgatggc 3512101 ctcggtcacc ggcgccatcg acctgctcga aatggagggg ccgacgttgg gccgcccggt 3512161 ggtcgacaaa gtgaacgact caacgtttca caacatgaag gagctgcgcc ccgccggcac 3512221 cagcatccgg atcctgttcg ccttcgaccc ggcccggcag gcgatcctgc tgctgggcgg 3512281 tgacaaggca ggcaactgga aacgctggta cgacaacaac attccaatcg ctgaccagcg 3512341 ctccgagaac tggctggcga gcgagcacgg aggtggatga ccatggcccg caactggcgt 3512401 gacattcgcg ccgatgccgt cgcgcagggc cgcgtggatc tgcagcgggc cgccgtggca 3512461 cgcgaggaga tgcgcgatgc cgtcctggcg caccgcctgg ccgagatccg caaggcgcta 3512521 ggccacgcac gtcaggccga cgtcgcggcg ctgatggggg tctctcaggc ccgtgtctcc 3512581 aagctggaga gcggcgacct gtcccacacc gaactcggca ccctgcaggc ctacgttgcc 3512641 gccctgggcg ggcacctgcg catcgtcgct gagttcggcg aaaatactgt cgagctgacc 3512701 gcctgagcta actcacgccc acacttccgg ccggtctcga tctcccaagc cccagcacag 3512761 ctcgtgttcc caatctgttc ccaaccagat ccttagctat gcgcatgttc ccaaaagtgt 3512821 tcccgcccat gaaaacggcc cccggagtct cctccgaggg ccatttcgcc ggtagcgggg 3512881 acaggattcg aacctgcgac ctctgggtta tgagctaacc agtcgcaatc tctcccatcg 3512941 cggtcggtct catacgtcca gatcagcctc tattccgccg tccagcctgt tccgccgcgt 3513001 cgcggttgta cggattcgtt tcggcctgtt ctgttcccaa atccgttccc aacacagcaa 3513061 tcagcagcaa tcccaggccg aaatcggtca gactcttggt ggacctacag cacctcgcct 3513121 ccatgtggtc gcggagctag tgagggtcca tcggcagcac cacttagggc gcctccgttg 3513181 tcatcatggt cgataagcgg tagcgtttac ggtagtagaa ccggaagttg cggaggaacc 3513241 acgatggcgg tcaccctgga ccgggcggtc gaggccagcg agatcgtcga tgccctgaaa 3513301 cccttcggcg tcacccaggt cgacgtcgcc gcggtcatac aggtgtccga tcgggcggta 3513361 cgcgggtggc ggaccggcga catccgccct gagcggtacg accggctggc gcagcttcgt 3513421 gacctcgtcc tcctgctctc ggattcgctt accccccgag gtgtcggcca gtggctgcac 3513481 gccaaaaacc ggctcctcga cgggcagcgc ccggttgacc tgctcgccaa ggatcgctac 3513541 gaggatgtgc gaagcgcggc ggagtcattt atcgacggcg cctacgtgtg aagcttgccg 3513601 acgcgatcgc caccgcaccg cggcgaacgc tcaaaggcac ctactggcac caaggcccca 3513661 cacgtcaccc tgtgacctcc tgcgccgacc ccgcccgagg tcctggccgt taccaccgaa 3513721 cgggcgagcc gggagtctgg tacgcatcga acaaagagca aggtgcatgg gcggagttgt 3513781 tccgccactt cgtcgatgac ggggtcgatc cattcgaggt ccgtcgccgc gtcggtcgag 3513841 tggcggtcac actccaggta ctcgacctca cagacgagag gactcgatcc catctaggtg 3513901 tggacgaaac agatcttctg tccgacgact acaccaccac ccaggccatc gccgccgccc 3513961 gcgatgccaa cttcgacgcc gtactggccc cggcggcggc gctccccggt tgtcaaacac 3514021 ttgccgtgtt cgttcacgca ctgcccaaca tcgagcccga gcgatccgag gtccgtcaac 3514081 cgcctccgcg gctcgccaac ctactcccgc tgatccgtcc gcacgaacac atgcccgact 3514141 ccgtgcgcag attgcttgca acgctgacac gtgcaggagc cgaagcaatc cggcgccgac 3514201 gacgttaaag gcttcgagac cggacgggct gtaggttcct caactgtgtg gcggatggtc 3514261 tgagcactta acgcttcgtt gaccaaagcc ccacttgatg cgaggacgcg atcagacaac 3514321 ggaatggcct agccgccgtc gcggtggctt tgcgcgactg gggcggctca cggaatggtc 3514381 gtcgttggca cctctgctgt cgggcgtaat gcaaagggaa tcaatgtcag gtgaatctcg 3514441 cgttcgggat caccgtcggc gtgcatggtg aactcgtact ggtctgcacc ggcccgatgt 3514501 gcggggcagc gcttatgatt cgggtgctct ttgatcttgg cgatggcgtt atcgatgacc 3514561 gcggtcacgt ctttgttgcg gataaagagc aagatcgcgg ccttggtgtc gcgccacaca 3514621 aggtagccga atagctgctt cagcgcatcg tccatggttc ttgggcccga ccacactttg 3514681 cattcgccaa tgaagatgtt gcggtcgtcg acgcgaatga gaatgtcggt cttgcctgcg 3514741 ccgttgaaga gttcgccccc ggcatcgcct tcaaactgtg cgttgaggcc gacgagcagc 3514801 atgtctcgga tttcttcccc gtcgagcttg gcggcgacag atggggtgcg ctccaacgcg 3514861 ttccgctggt tacggagcac ccgaagtgcg gactggtagt cctcatcctg cattgcaggc 3514921 tccggcttga atgctgccct cgcgcccgct gggcggtgtg gccgcggacg cacgcttttc 3514981 cgactgatcg gagctgcgta tgtgtcggcg tccttcctgc ggcgtacagg gaagccgatc 3515041 tcggcctgga ggtttcgggt cgctaagagc tgctcacggc gcctcgccac catgcccggt 3515101 agctcgttgc gcagtccttg gttgtgcaag tcgatctgcc ggcgcgacca accgaggtac 3515161 ttctcaatat tcgcgatctg cttatgaaac gccgcgttga tcgccgcggc gtcattcgac 3515221 ggattgtcga tcgccaggtg gatttcgtga ccttgtagcc gcagtacctg cggcggcatg 3515281 gtcgtgaact ggtccgggcg aaggttaaag atgtccttat gcccctcgaa gggcaccacg 3515341 agaacgagcc tcgtcacgcg tcgggtgcgc tgttcgcccc aatcccggta ctgctggtcg 3515401 acctcggtgg ctggcagcat gaaagcgtcg tcgacgcgca gatcggggca ttcgaccgaa 3515461 cccaattcga cgagctgttc gacgacgtca tcaacgggcg tgttcagcag gtcgtcggcg 3515521 tcccagctct gaagacgctg cgccgtggct tggctcgcct ttccgagaaa tccggctaag 3515581 gagccagcga gatcgttgag gcgccccttg gaaaacagct gaacatactc cacttacccg 3515641 aagatagtgc tcatccccga cgcggctacg gaggcgtttc ggcggcgtgc cgcgatgcaa 3515701 tgcagccagc ggagccaccg ggccgtagcc gacgtcgcgt cgtgggtggc gacggggttc 3515761 tccggggtgc cggaatcctt cgacgagctt gtcgggggtc atgattactg ttctcgatat 3515821 gaacggattc aaggatgcga ggcccgatcg tcttccgctt tcggcatcgg tttgggatat 3515881 cgcccagcga tacaacaagg gcggacctac cgtcactgag gcgctatacg aggcgctgaa 3515941 ggaactcgag gcccaagtca tcgctctgca gcgaagcgag ggtaagggcc tgctcagccg 3516001 cctgagctga acgactagag gattggggaa ggggcccccg gggaatggat catcctactg 3516061 agcgggaatg ggccagcatc gccgaacata cacgcgcctc caacttcacc ggcgacctgt 3516121 tacgaatgcc gccttacccg ctgatcctca ccctccgaac gctggtgggg tctgccgagg 3516181 tggtcactgc atcacatacc ctcttcctgt cggcggcaac tgaatactga ccagagcgcg 3516241 gcaaggtggg ttctagtcaa cgtcgcaaca attgatggtc tggtgaggtt agcagcgcgg 3516301 tgaaaagttc agcgggactg cggtgcccga ggacttggcg gggtcggtta ttgatctcgt 3516361 attcgacagc ccgcagatgg tcgggcgtgt aggtgctgag gctggtgccc tttgggaagt 3516421 attgccgtag cagaccgttg gagttctcgt tgctggctcg ctgccacggt gagcgggagt 3516481 cgcaaaagta gaccggcgcg cccaggtcgg cggtgatgtc gatgtgccgg gccatttcga 3516541 tgccctgatc ccacgtgatg gaccggacca gcgtcaccgg caagtcgctc atggtctcgg 3516601 tgatcgcgat gcgcaggcag taagcgtcgt gggtcggcag gtgcagcagc cgaatcagac 3516661 gtgtctgtcg ctcgacgagg gtgccaatcg ccgagccctg gttcttacca acgatgagat 3516721 ctccttccca gtggccaggc tcggagcggt cggcgggatc gaacggccgc tggtgaatcg 3516781 acaacatcgg ctgggcgaag cgcgggcggc gacggccagg acgcagatgg gcgcggcgat 3516841 gagttcgtcc cgtgcgcaga gggccacggt gtggcgactt gacctgcggc ggccggatca 3516901 atcgtgattg aggctgatag acggcctgat agatgctttc gtggcacaac cacatcgacc 3516961 ggtcatcggg gtatttccgt cgcagatgcc gggcgatctg ttgcgggctc caccgctggg 3517021 ccagcagctc ggcgatcagc tcacaaaggt cggggttttt gtcgatccga cgccggtgac 3517081 ggcggactcg gcgttgaacc gcccagcgat gcgcttcgaa cggccggtac tggccatcgc 3517141 ggcgactgtt gcggcgtagc tcccgcgaca ccgtcgaggg tgcccgtccg agctggtcgg 3517201 cgatcttgcg gatacttagg cccgagcggc gcagatcggc gatgttgatc cgctcctcct 3517261 cggacagata gcgactacta atttggcgca cagccaaacg atcgagcgcg ggcacgaatc 3517321 cgacggcttc gccacgccga taggtcttgt atccccgcgc ccaattgttt gctgcagtcc 3517381 gggatactcc aacttcacga cccgctgccg agatggacca gccccgagcc cgcagctcca 3517441 taaaccgttg acgcttggcc gactgtgggc gccggcccgg accctttttc acgcgacgag 3517501 acgatgacaa cacaacctcc agaacctaga gatgtgttgc gacaccgcct agaaaccacc 3517561 ttgccgacac ctgatcagtt ttcggttgcc gctgacacaa tgaacatggc ccgcttcacc 3517621 cgttcagcgt cacgtggata agcggcccgt agcgcgtccc agtcggtttc ggagtagtcg 3517681 ggccgttgta caggggcatc cggcgcggcc ggtggcggca tcttgatgcc gccaccggcc 3517741 gcgtcacggt tcgcggttgg cgctcgcctg acgacggtgc tgctcccgtt cctgagcacg 3517801 ctgctttcta gccttgcggt ctccctgctt tcccatctcc cggtcctccc ggcgggtcac 3517861 gatagccgcg cactccgaca tacctggcgc ggcgcggggc gctgcgaacc ggatgggcgc 3517921 caccaccgat aaccattgcg cgttgcggca gccttcgcat tagcaatgct ggcgcgccgc 3517981 tcgacgcctc ggctatcacc tcacctgacc accgcgcgca tcaccgacga gacctcatca 3518041 tcgcgcccgc tctcgcaaac accacgcccg ccaaacgggg ctggcccgag acgatttcag 3518101 aggcccctac agaccgatcc gcacgcccga aacccgggtt accgctaagc agcccaggac 3518161 agcagccgca gtcctgatcg gcgaagactg acgttcagac cgcaagcaag ctaaatagca 3518221 agccaagcaa ttagcaagac taatgttccc aaatccgttc ccatcgggca tgaaaatgac 3518281 cccagaggtc gcacctctgg ggtcatttcc gctggtagcg gggacaggat tcgaacctgc 3518341 gacctctggg ttatgagccc agcgagctac cgagctgctc caccccgcgt cggtaaatgc 3518401 caggctaccg aacacgcacg aagctcgcca aatcgcgggt gccggagtac gaccgcccag 3518461 atcagcggag ctcgggcata cagctgcgcc gtacgcgtcg atgcgatgat gattccgcag 3518521 ccgctcagcc agctcggtga cctggcgcgt cgcccaggcc gcagggttct ctgttccccg 3518581 aaaacggccg caccgtcgat ctcaaacgca actgtcgcct cgccggccgc gcccggcctt 3518641 gagctgtcca ccgggatcgc gttggcgttc ccgcgcggtc ccttcgtccc ggcagccgcg 3518701 gcgtgggagc tccaggaagc taccagcggg aagttccagc tcggtctggg cacgcaggtt 3518761 cgcaagaatg tggtgcaccg atacggtatg gccttccacc gtcccggtcc gcggctgcgc 3518821 tacctgctgg ccgtgaaggc gtgcttcgcc gttttccaaa ccgggacacc ggatcaccac 3518881 ggcgagttcg acaatcccga cttcatcact gcccaatgga gcccggcgcg cattgacccc 3518941 cccggtccca gccccgctgg gccgcggtga atccgtggat gcggcgaggt ggccgacggg 3519001 gtgtggggcg aggccgggtt cgaggggacg accacgcgga tccgggagcc gacgagcacc 3519061 cgtgagcaga cgcagaagtc cccgatttcc ggtgaaatcg gcgacttctg cgtctgctcg 3519121 ccgcgagcgc cccgactgac tacccggcgt cgttgaactt ggtgatggcc tcatcaagtc 3519181 gctgcagcgc cgacccgtag gcggcgaagt cgcccttctt ctgcgcatcc cgcgccgcgc 3519241 cgatggcagc ctggatctcc tgcagcgcag caactttggc cggcgataag gtgaccgccc 3519301 cgacgggaac cgggggcgcc gcagtcaccg gcggcggttg gggtccactg gcaggcggcg 3519361 gtggattcgc agcgggactc ggtggtaccg ctgcctccgt gggcgcgatc ccggtagccg 3519421 tcgcaccggc cccgggcccg aacaagccgg tgagcgcatc ccgcaccgtg gggccgtatc 3519481 ccaccttgtc gttgtacatc atcgccaccc ggatcagccg cgggtaggac gaagcagcgt 3519541 cgctggctcc cggggatgca tagaccggtt cgacgtagag cagtccgccc tgggccaccg 3519601 ggagcgtgag caagttgccc cagcggatgc ggttttggtt gtcgcgtccg atgacaccga 3519661 ggtcctggga caccgccgga tcggtggtga tcgcgttgtt ggccaacttg ggcccgttga 3519721 cctggcctgg gatggtcaac accgtgagat tgccgtaggt cgcgggatcg gaactggcgc 3519781 tgatgtaggc ggccagatag tcacgcttga atctgttcat cgcgctgatc aactgatatg 3519841 aggctgaatt atcgtcctta gcaatgtttt tcgcgacgat gtaatacggc ggctgataac 3519901 tgctggcggt cggattcggg tccagcggca cgtcccagaa atccgatgtg gagaagaacg 3519961 tcaccggatc attgacgtgg tatttggcca acaacatgcg ctgcaccttg aacaggtcct 3520021 cgggataccg caggtgctcg gcaagctccg gcgcaatgtc gctcttaggc tttaccgtgc 3520081 cggggaagac ctgcatccag gccttgagca ccggatcctt ttcgtcctgt tggtacagcg 3520141 tgaccgttcc gtcgtaggca tccacagtgg ccttcaccga attgcggatg taggaaacct 3520201 tcttgtccgg gaccaaccgg ttgaacgcca cctcgttgga gtccgcggtc gccgaggaca 3520261 gcgaggtgag ctcggagtac gggtaattgt ccaacgtggt gtagccgtcg acgatccaca 3520321 ccagtcgctt gttgacgatc gcgggataca cagcgctgtc tgtcgtcagc cacggcgcga 3520381 ccgcctccac ccgctgcgcc ggatcgcggt tgaacaagat cttgctgttg gagccaatca 3520441 cattggagaa caaaaagttt cgctccgcga acttcgcagc gaacacgcta cgggctaacc 3520501 aaccaccgag cgggactcca ccgcttccgg tgtaggtgta tctcttggtg tcgatgttag 3520561 tttcgtagtc gtattcgcgg tcgtcgccat tgcgtccaac gatcgcatag tccgcggacg 3520621 tgttagagat caccggaccg aagtagatcc gcggctgatc cagtggcgcc ggcccatcag 3520681 acaccacggt gccattggcc ccgacgacgt tgaccaagaa ttcggggtaa ccgccatttt 3520741 gattcgggtc gttggcgata ccgcgcacgg tgttggccgg tgaggcgatg aacccgttcc 3520801 cgtgggtgta cacggtatgc cggttgatcc agtcccgttg gttgtcgatc aaccggtccg 3520861 ggttgagttc gcgggccgcg acgacgtagt cgcgcaggtt accgttgcgg tcgaggtagc 3520921 ggtcgatcga cagctggtcc gggaaatagt agaagttctt gccctgctgg aactgggtga 3520981 acgccgggct aacgattgtc gggtcgagta gccggatgtt cgaggtagtc gcgcggtcgg 3521041 cagcgacctg ttgcgcggta gccgggctat caccgctgta attgcgatag gtcaccacat 3521101 cagacgtcag gccataggct tgccgagttg cggtgatact tcggctgata tattcgctct 3521161 ctttttgcgc agcgttgggt ttgacgctga tttgctcgac gatcaacggc cagccggcgc 3521221 cgacaatcag cgacgacagc agcaacaaca ccaggccgat cgccggaatc cgcaagtccc 3521281 gcagggcgat cgccgagaac actgcggccg cgcaaatcaa cgcaatcgcc atcagaatca 3521341 gcttcgccgg caggacggcg ttgatatcgg tgtacccggc accggtgaac ggcttgccgc 3521401 cacgcgtgtg cgacagcagc tcataccgat ccagccaata agcaacggct ttaagtaaca 3521461 ccagtacccc gaccaggcta accaactgga cgcgcgccga gcggctcagc gcaccggtgc 3521521 gtccggatag ccgaatgcca ccgaagatat agtgcgccac cagattcgcc acgaatgcca 3521581 gaaataccga aacgagcatg tagctgagca tcagccggta gaacggcaac tcgaacgcgt 3521641 agaagccgag gtcccgcccg aactgcggat ccctaacccc aaagtcaccg ccgtgcagga 3521701 acagctggat ccgagcccag tagctttggg cgacgatgcc ggccagcaag ccgatcgccg 3521761 cggggattcc gatgccgact agccgcaggc gtgccagcac gacggcgcga taccgtgcaa 3521821 ccggatcgtt gtcggcatcc gggacgaaca ccgggcgagt gcggtaggcc aaggcgagcc 3521881 cgccgaacac gatgccgccg accaccaccc cggcaaccaa gcacaccacg atgcgggtag 3521941 ccagcatggt ggtgaacact gagcggtagc caagctcacc aaaccacagc cagtcgacgt 3522001 aagcgtcgat caaacgcggg ccagcgagca gcagcacgat cacacccagt gcgatcatga 3522061 tcagaatccg gctgcgccgt gtcagtttcg gcatccttgc ggcggaccgc attcccacta 3522121 gctacgctcc ctgatcgttc tggctggttg agactttctc gacggtcata actctacgca 3522181 ccgcaaccat ccgcagcagc cggcgcgagc tagcagctcg gcgtcggcga gcccgacgtc 3522241 atcgcgtgca gcgcgtccac cgcctggcta agcgtctcga ccttcaccaa cttcaaaccg 3522301 ggcgggctgt cggaacttgc ctcgtagcag ttcttcgcgg gcaccagaaa caccgtcgcg 3522361 ccggccgctc gagcagcggc catcttgtgg gtgatgccac cgatctggcc caccttgcca 3522421 tcgacggcga tcgtgccggt gcctgcgacg aacgtcgacc caaccaggtg gccactggtg 3522481 agcttgtcga cgacggccag actgaacatc agtccggccg aagggccgcc gacgttggcg 3522541 aggtggaagt ccacggcaaa cggcgcccac ggcgcgtcca ccacctctat gcccaggacg 3522601 ccttggtcgc gatccttatt cttgcccagc gtgatctgcg cgatgccggg cggctcgttc 3522661 ttgcggcgga agtcgatcgt cacctcctgg cccggtttcg tgttcttcaa cagcgcggtg 3522721 aactggtcga ggttgcccac cggagtgccg tcgacggcgt cgatggcgtc accggcctgc 3522781 agcttgtcca ccgatggccc tggatccatg accgaggcga cggtgactgc tttcggatac 3522841 ttcaggtacc ccagagcggc gtactcagcg gcggcctcgg agcgcttgaa atcagcggcg 3522901 ttgtcatttt cgatctcttc ccgcgacttg cccggagggt agacgaggtc gcgtggcatc 3522961 aactgttctt gacccgaaag ccacagggcc agggcttcac ccagggttag accgtcgcgc 3523021 tgggagaccg tcgtcatgtt gaggtgacct gacgtcgggt aggtctgggt gcccacgatc 3523081 tggaccacct gcttgccgtc tatctcgccg agcgtgtcga acgttgggcc gggtcccagc 3523141 gccacaaacg gcacggttac cacggcgagc aacacgccga ataccacgat cggcaccagc 3523201 gcgaccatca aggtcaatat ccgcctattc acgccgcata cactagacgg acctggccgg 3523261 gctggttcag ctgcgagcgt gaccgctgat cgcaccttct gttcccgcgg tgagtaccgg 3523321 tgaggtcatg ggtgacctgc ctttcggctt ctcttccgga gacgaccccc cggaagatcc 3523381 gtctgggcgc gataagcgcg ggaaggacgg tgccgattcc ggatcgggcg ccaatccgtt 3523441 gggcgcgttc ggcatcggtg gagaattcaa catggccgac ctggggcaaa tcttcacccg 3523501 cctaggagag atgttcggcg gcgtcggcac cgcgatggcc gcgggcaaaa cctcaggacc 3523561 ggtcaactac gacttggccc ggcaggtcgc gtcgagctcg atcgggttca tcgcgcccat 3523621 cccggcggcc acgaactcgg cgatcgccga cgcggtgcat ctggccgaca cctggcttga 3523681 cggggcaacc tcgctacccg ctggcgccac caaggcggtg ggttggagcc ccaccgactg 3523741 ggtcgacaac accttggcta cctggaaacg gctgtgcgat cccatggccc agcagatctc 3523801 cacggtctgg gcgtcgtcgc tgccggaaga ggccaagagc atggccggcc cgctgctgtc 3523861 gatcatgtcg cagatgggcg gcatagcgtt tggttcgcaa ctgggccaag cgctgggccg 3523921 gctgtcccgt gaggtgctga cgtctaccga catcggtcta ccgctggggc ccaagggggt 3523981 ggccgcaata ctgcccggcg ccgtcgaatc gtttgccgcc ggactcgagc aaccgcgcag 3524041 cgagattctg acgttcctgg ccacccgtga ggccgcacat caccgcctgt tcagccacgt 3524101 tccctggctg gccagtcaac tgctcggcgc cgtcgaggcc tacgccatgg gcatgaagat 3524161 cgatatgacc ggaatcgagg agctggcccg cgatatcaat ccgacgtcgc tggccgatcc 3524221 cgccgccatg gaacagctgc tgagccaggg agtattcgag cccaaggcaa cgccggccca 3524281 gacgcaggca ttggaacgac tcgaaacact gctcgccctg atcgaaggct gggtgcagac 3524341 cgtggtgact gcggcgctgg gcgagcgaat tccgggtgag gcagcgctca gcgaaacgct 3524401 gcgccgacgc cgagccagtg gcggccccgc cgaacagacc tttgcgacgt tggtcgggct 3524461 ggagctgcgg ccacgcaaac tgcgggaggc cggagcgctg tgggagcgcc tcacccgggc 3524521 cgtcggcatg gacgcccgcg acgccgtctg gcagcacccg gacctgctgc ccgccactga 3524581 cgatctcgac gacccggccg cctttatcga ccgtgtcatc ggcggcgaca ccagcggtat 3524641 cgacgaagcg atcgccgaac tcgagcggga ccagcaggcc cgcggcgccg acgactccgg 3524701 ccacgatggc ggtcctgtgg ataactgagc ggtgtgtctg ctcgcagtgt ggcaccgtct 3524761 caggtcatgc ggcgggctgc gtctgctctg tattcgttga atcctgcgat gccggtgctg 3524821 ctaagacccg acggtgccgt gcaagtgggc tgggatcctc gtcgggctgt gctcgtccgt 3524881 ccaccgcgtg gattaaccgc gacaggtttg gccgcgctgc tgcggtccat gcgatcaccg 3524941 ataccaatca ccgagttgca gcgccaagcc gccgagcgtg gattggttga cggtgacgcc 3525001 atggcgaacc ttgtcgcgca actggttggc gcgggtgtag cgacccccct agccaacccc 3525061 ggaaacctgg attcccggcg tcgcgccgcg tccatccggg tccacggtcg cgggccgttg 3525121 tcagacctgc tcgtccaggc gctgcgctgc tccggtgccc ggatcaggca cagcagccaa 3525181 ccacatgcgg cggtgactcc cgcgggcgtg gatctggtgg tgttgtcgga ctatctggtg 3525241 gccgatccgc acatggtgcg cgatctgcac accgagagag ttccgcatct tcccgttcgg 3525301 gttcgtgacg gcaccgggat ggtcgggccc ctggtggtcc ccggcgtgac cagctgtctc 3525361 ggttgcgctg acctgcatcg cagcgaccgc gacgccgcgt ggccggccat cgccgcccaa 3525421 ttgcgggaca ccgtcggggt ggccgaccgg gccacgttgt tagcgacggc ggcgctggcg 3525481 ctcagccaag tgaaccgggt gatcgccgcc gtgcgtggac aggaggcgac ccctgagccc 3525541 ccgtcggcgc tgaacaccac cttggagttc gatctcaacg ctggctctat cgtggcgcga 3525601 caatggacca ggcatccgcg gtgtttttgt tgacgttacg tctaacccag tcgtccctgc 3525661 tccggcacgt tggtcgagat tgacgcatag gctctggcca aggtgtcgag cacgtcctct 3525721 gtcagggtgc gctcgttgcg gtgcttgtcc agcgtttcga tgatcgctct gaacagggcg 3525781 tcggcagcgt cgtgctgcgt tgatcttgct gacatggttt cttgcggtcc accctcctgc 3525841 acatttcact gatgcggcca acaccacaac gcttgtcggc gcttgtcggc gcttgtcgac 3525901 gcttgtcgac tcggggcaag ctcaaccgtc cgcacccagg cagttgttac cagatcaaca 3525961 ccccgaccgg ataaccgtca tggatgatgg gagtgtgtca gatatcaaac ggggccgcgc 3526021 cgcgcgcaat gcgaagctgg ccagcatccc ggtcggcttc gccggtcggg cggcgctcgg 3526081 gctcggcaag cgactgaccg gtaagtcaaa agacgaggtt accgccgagc tgatggagaa 3526141 ggccgccaat cagttgttta ccgtcctcgg cgaactcaag ggtggcgcga tgaaggtcgg 3526201 ccaggcgctg tcggtgatgg aggccgccat tcccgacgag ttcggcgaac cctaccggga 3526261 agcactgacc aagctgcaga aggacgcccc accgctgccc gccagtaagg tgcaccgggt 3526321 actcgacgga cagctgggca ccaaatggcg ggagcggttc agctcgttca acgacacccc 3526381 agtggcatct gccagcatcg gccaggtgca caaagcaatc tggtcggacg gccgagaagt 3526441 ggccgtcaag atccagtatc ccggcgccga cgaggcgctg cgcgcggacc tcaagaccat 3526501 gcagcgcatg gtcggcgtgc tcaaacagct ctcacccggc gccgacgtcc aaggggtggt 3526561 cgacgaactg gttgaacgca ccgaaatgga actcgactac cggctggagg ccgccaacca 3526621 gcgcgccttc gccaaggcgt accacgacca cccgcgcttc caggtgcctc acgtcgtggc 3526681 aagcgcaccg aaggtggtga tccaggagtg gatcgaaggt gtgccgatgg cagagatcat 3526741 ccgtcacggg accaccgagc agcgtgatct gatcggtacg ctgctcgccg agctcacctt 3526801 cgacgcacca cggcggctgg ggttgatgca cggcgacgcc caccccggta atttcatgct 3526861 gctgcccgac ggccggatgg gcatcatcga cttcggtgcc gtggcaccga tgcccggcgg 3526921 cttcccgata gagctcggga tgacgattcg actggcccga gagaagaact acgacctcct 3526981 gttgccgacg atggagaagg ccgggttgat ccagcgagga cgacaggtgt cggttcgcga 3527041 gatcgacgag atgctgcgcc aatacgtcga gcccatccag gtcgaggtct tccactacac 3527101 ccgcaagtgg ttacagaaaa tgaccgtcag tcagatcgac cgctcggttg cgcagatcag 3527161 aacggcgcgc cagatggacc tgccggccaa gctcgcgatt ccgatgcggg ttatcgcatc 3527221 ggtgggcgcg atcctatgcc agctggacgc gcatgtgccg atcaaggccc tgtcggagga 3527281 gctgatcccg ggtttcgccg agcccgacgc gatcgtcgtc tgagccggct cgcgccggcg 3527341 ggcgcaccat cgcgggctat gcaacagcat ccttgcgcgg acgtccgcgc ggacgcttgt 3527401 gactcacgat cgagccttgg tcgaatatct caccacccca aacgccccag ggttcagccc 3527461 gctgaagcgc cgcggccaag cactgccgcc tgatcgggca gctcacacac agtgtcttgg 3527521 ctacctcgag accggccggg gtatcggcga accacagatc gggatcaccg acgtggcacg 3527581 gcaaaaccgg caatctttgt ctgggggtct gtctggggac tgtcagtacc gacacgtcct 3527641 gtttcacctg cttcctggtc tggtggcggt tcttcgaaag tgatccggac cagggatgct 3527701 gcggtgggca gatgtcccga aagtttggcc acggatcctg tgacttcggg tccgtggcca 3527761 tctggcgaaa cggggctgat tacgtagcgc ttacgtagag ccccgctcca cggactcgtc 3527821 agtcgcggcg gcgacacggt tcttgctatg gggggttccc gcggttggca ccgcggcagc 3527881 cgcgccgaca ccaaatgcgt tgttgtcaat caccgcggcc gccctcctct cgtgtcgcgc 3527941 gcggttgcca gccccccaat gccatctcca ggctggcagc agaatgcgac ctggaggtta 3528001 accggtggca gcagctgacc acaaccgatt ttctgacctg cgcgtttgcc ggtacaggcc 3528061 cggttcaggt ccgaccgcga accagctgca gcacgtccga tccgtattgt tccagcttgc 3528121 gggcaccgat gccggggatc gcgatcagcg ccgcgtcgtc ggtaggtagc agctcggcga 3528181 tcgcgatcag ggtgttgtcg gtgaaaacga cataggcggg gacgttctgt tccttggcgg 3528241 tgctcagacg ccaggacttg agctgcaaca acaactcctc gtcgacgtcg gctgcacacg 3528301 tctcacaccg ccgcagcatg acggccgccg aagtgttcag ctcgttgtta cagatccggc 3528361 agcgcgctgc ggcgccccgg ttgcgtcggg atgtgcccgg caccggatcg gcgcgcgtct 3528421 gcggcgcaat gccgttgagg aaccgcgagg gcttgcggct ctggcgcccg cccggggacc 3528481 gtgatagcgc ccagctgagc gccaaatgga ctcgggcccg tgtgattccg acgtagagca 3528541 gccgacgctc ttcctctacg ggctcgctat tggggccgtg tgccagcgca tgtgagatgg 3528601 gcagcgtgcc gtcagccaat ccgaccagga acaccgcgtc ccattccagt cccttggcgg 3528661 cgtgcagtga ggccagcgtg acgccctgca ccaccggtgg gtgccgcgcc tccgcccgcc 3528721 ggcgtagctc ggcaagcagg cctggcagct gcagtgcggg acgctgcgcc agctcgtcgt 3528781 cgaccagctc ggccagcgcg gtgagcgctt cccagcgttc cctggcgcgg gtgccgaccg 3528841 gcggttgtgc cgtcagcccc agtggtgcga gcaccgcgcg aaccacgtcg gacaacgcgg 3528901 catcggtatc acgttcggac acacgctgta aggcaagcaa cgcctgcttg atttcctgac 3528961 ggttgaaaaa cccctcgcca ccgcgaacct gataggcgat acccgcctgg gtcaacgcct 3529021 cttcataaac ctctgactgc gcattgactc ggtagagaat ggctacctcg gatggcggag 3529081 tgcccgatgc gattaaccgg gcgattgacg ccgccaccgt ggcagcctcg gcgggctcgt 3529141 cggaatgctc atggaacgac gggaccggac ccggctcacg ctggccggac aaccgtagct 3529201 tgctgccggc aacacggccc cgggcggcgg cgatcacccg gttagccaat gacaccacct 3529261 gcggagttga ccggtaatca cgctccagcc gcaccaccgc ggcgtccggg aaccgccgcg 3529321 agaagtcgag taggaaacga ggcgaagccc cggtaaacga gtagatggtc tggttggcgt 3529381 cgccgacgac ggtcaggtcg tcccgatcac ccaaccaggc cgagagcacc cgctgctgca 3529441 ggggggtgac gtcctggtac tcgtccacga cgaaacaccg gtaccggtcc tggaactcct 3529501 cggccaccgc ggcgtcgttt tcaatcgcgg ccgcggtgtg cagcaacagg tcgtcgaagt 3529561 caagtaaggt gacgccgtcg ccgcgggcct tgagcgcctc gtattcggag tagacagccg 3529621 cgatttgcgc ggcgtccaac ggggggtctc ggcgtgcggc cgccactgcg gtcacatact 3529681 cctcggggcc gatcagggac gccttggccc actcgatctc gccggccagg tcacgcacat 3529741 catcggtgct ggcgtgcagc ctggtgcggc tggcggcgcg ggccaccacg gcgaacttgc 3529801 tgtccagcag ctgccagccg gtgtcagcga ttacgcgcga ccagaagtac cgcagctggc 3529861 gatacgcggc cgcgtgaaag gtcagcgcct gcacagcgcc gacgcccgaa ccggtccgtg 3529921 ccgcggcgtc gagtgcgcgc aaccggctgc gcatttcgcc cgccgcgcgc tgggtgaatg 3529981 tcacagccag cacctgcccg gcggcgacgt gaccgctcgc gaccagcgaa gcgatccggt 3530041 gagtgatggt gcgggtcttg ccggttccgg caccggccag cacgcacacc ggtccacgcg 3530101 gagccagtac ggcttcgcgc tgctggtcgt ccagcccggc aatcaatggg tcgctggcta 3530161 tcgacatgac gtccatcttg gcagcggtag atgacagacc gggcgtgtcg ccacgccgtg 3530221 gggcgtgcga catgaacaac tgccgagccg ccacaccgcc cgggtcgtcg ccgcgctagg 3530281 ttagcgtgtc atgatcaccg ctgcgctcac catctatacg acatcatggt gtggctattg 3530341 ccttcgactc aaaacagcgc tcacggccaa ccgaatcgct tacgacgagg tcgacatcga 3530401 acacaaccgt gcggccgcgg agttcgtcgg ctcggtcaat ggcggcaaca gaactgttcc 3530461 cacggtgaag ttcgccgacg ggtcgacgct gactaacccg agcgcggacg aggtcaaagc 3530521 gaagctggta aagatcgcgg gttaacgacg tggactttca ttcgcacgct gcccacgatt 3530581 cgatgatcac gcgggcgatc gagatcgacc cgggcagtag cagtttcgac tccgacgcac 3530641 tggaccaatc gccggcggca agcgctgcgc gcacctcatc gcgggtgaac cacgcggctt 3530701 cggcgatttc gccgtcgctg aacgagaact cctcatccgg gtcacccaag gcatgaaagc 3530761 caaccattaa cgaccgcggg aacggccacg gctggctgcc cagatagcgc acatcgcgaa 3530821 cggtcaggcc gatttcctcg cggatctccc gggcgacgca gacttcgaac gactctccgg 3530881 cctcgacaaa gccagccaac agcgagaaca tccgttccgg ccacgccgcc tggcgagcca 3530941 acacggcacg atcagcgccg tcgtgaacca ggcagatcac cgccgggtcg atacggggga 3531001 actcctcatg accggtgatc gggttgaccc gtgaccagcc ggccctggcc ggtttcgtcg 3531061 gcgcgccgtc tagggcgctg aatcgtgcgt tgtcatgcca gttcaacagc gccgatgccg 3531121 acgacaccag ttggctgctg gtgtcgtcca tgattcggcc gagcccacga aggtccaccg 3531181 cctcggctgg tatgtcggga tcagcgatcg gctgcagcgc tgcccgcacc gcccagacgt 3531241 ggcggccgcc ctcgacgcga cccaggaata ccgcctctgg cggtggcttg tcggccagct 3531301 cgatggccgc gccaagcaac acccggccgt tggcgaccag cacgcgattg cgggaatcca 3531361 cccgcagcaa tgccgcgcct ggccatcccg cggcggccgc ctccatgtcg gtcctcagcc 3531421 ggtcggcccg gtcggcgccg acgcgcgaaa gcaacggaac gcttctcagc tgaaaatcca 3531481 cgccgcttac gttcgtcact ggcgccccac ctggtggcga cccgccgcgc ccggctccgc 3531541 cgcgcttgcg atcgccacta gcgccccacc tggcgaatat agagcagccg gtcgctggcc 3531601 tcgatggcgt ccacctcggg cgccccaatg cgcagcagct ggccgtcacg taccacgccg 3531661 agcacgatgt cgcgcaggtg ccgcggagac ccgcccacct cggcctgctc cacctcacgt 3531721 tcggcaacgg ccaggccggc ttccggggtc agcagatcct cgatcatctc cacgacgctg 3531781 ggcgtcgtgg tagcgatgcc gagcagccgc ccggcggtct cggaggagac caccaccgtg 3531841 tccgcacccg actgccgcaa caagtgctgg ttttcggcct cccggatgga cgccacgatc 3531901 ttggctttgg gcgcaatctc gcgcgccgtc aacgtgacga gcacagcggt gtcgtcgcga 3531961 ctggtggcga cgatgatcga agacgcatgc tgagtgccgg ccaacctcag cacgtcggac 3532021 ttggtggcat caccatgcac ggtgaccaga ccggctgccg cggcacgttc gaggacaccc 3532081 gaatcggtgt cgacgaccac aatttcaccc ggaactaact cgtcactgac catcgcggcc 3532141 accgccgttt tgcccttggt gccgtagccg atgacgacgg tatggttgcg cactctgctc 3532201 ctccaacgct ggatcttgta cgcctgacgg gatgtttccg tgaggacttc gagagtcgtg 3532261 ccgaccaaca agatcaagaa cgcaatccgc agcggtgtga tgacgaagat gttgatcgct 3532321 cgcgcgaatt cggaaatggg cgtgatgtcg ccgtagccgg tcgtcgacag cgtcaccgca 3532381 gcgtagtaga ggcaatccag aaacgtcagc cgatcgccct gggcgtcgag gtagccgtcg 3532441 cggtcgacgt agacgatccc ggcggtgagc agcaacgcca ccacagcgac gaccacccgg 3532501 cgtgaaataa cgcgagctgg actggcccgc ctttggggaa tgcgcagcac gccgacaagc 3532561 gcgtaaccag gctgcgcggt cagcttctcg tcgagccccc gcaaccgccg ccagctaccg 3532621 gccaccgaaa tccgtcaccg gttagcccca atgcacgcca aacgcacgac acaaatggta 3532681 accacgtcag gtgtccgacc gccgaccggc gcagtcggtc agtagcatgg ccaactcgcc 3532741 gggagcgggt aactcgtcgg ggacgaccgt gatgccgctg cgcacgtaat agaaggcggt 3532801 acgcaccgag gatgtcggac atccccgcaa tgcggcccag gccagtcgat agacagcgag 3532861 ctggacagcg gcctgccgca tggctgccgg cccgtgcggc ggcttgccgg tcttccagtc 3532921 caccacggtg gcaccgccgt cggggtcgac gaacaccgcg tcgatgcggc cgcgcaccac 3532981 ggtatcgccg atcggcattt cgaacggcac ttcgaccgcc gccggggtgc gagccgccca 3533041 cgatgatgcg gtgaacgccc tctgcaacgc ggccaactcc tcaggatcgc ccacctcgcg 3533101 gtccgctgca cctggcaggt cacccaggtc aaacagcagt tcagcaccgt aaaattgctg 3533161 aacccaggcg tgaaatgcat cgcccaacca cgcgtgcggg tccgggcgtt ttggcagccg 3533221 acacatcagc cgctgccgcg caccgaccgg gtcgccgacc agctccacca aactgctgac 3533281 cgacaaatgg ttcggcagac cacgggcagg tgctccccgc gccgcgtgcg cacgttcagc 3533341 caacagtgca tcgacgtcag tggaccaggg ggcatcgccc gggcgcgggg gatgatcgat 3533401 gtcggtggtg cttccgggca agtcggccga catggccgcc gccaccagcg ccgcgccccg 3533461 ctccacatcg ccgcgacgtg cggccaacgg atcagcgggc caaaccgcct cgatagcgtt 3533521 gtcacacaat gggtttcgct catcgccggc gggcgccgac gcccactgct cgacgactcc 3533581 gcaaggatca ccggcagcgg ccgaacggtc aatgatgtcc ttgagttcgc acaggaattc 3533641 cgatggcccg cgcggctttg tcccggtggg cccccaatgg tggccggaca ccagcagagt 3533701 gtcctcagcc cgggtaacgg ccacgtacaa cagtcgacgc tcctcgtcaa cgcgccgccg 3533761 atcgagcagg cgacgatgtt cggagatctt gtccgacaac tgttttcggt cagcgacagc 3533821 tgacgtgtcc agtacgggga tgccgtgcgc gccggccgag gcgcgatccc cacgcagcag 3533881 cggcggtagt tcggcggggt cggtaagcca gctgctgcgc gacaccgtcg acggaaacac 3533941 tccgcgcgac aggtgtgcca ccgccaccac ctgccattcc aagcccttgg cggcgtgcac 3534001 ggtcagcacc tggacccggt cgcaggcgac ggtcaactcg gcaggcggca aaccgttctc 3534061 gaccacctcg gcgacgtcca aataagccag caggcccgca accgacgcct cgctggacct 3534121 agcgctggcc cgttcggcgt accccgcgac cacgtcggcg aacgcatcaa ggtgctcggg 3534181 tccggcccag ccacctgaga ccggggccga ggcccgcacc tcgcaatcga cgccaagcac 3534241 gcggcgcacc tcggctacta ggtcgggcag ggaatgaccg aggcgaccgc gcagcgcgct 3534301 cagttcaccg gccaaggcgc cgatgcgccc atatcccgcc accgaatacc cctcggcgga 3534361 acctggatcg ctgatggcgt cggccagaca cggattgtcg gcgtccgcgc tggccgccat 3534421 cgcgatcgat tcgggcgacg ccgttgacgg tgattcgcca ctcagcgtca gcgcacgccg 3534481 ccacagcgcg gcgaggtccc gggcgccgag ccgccaccgt gggccagtca gcacccgcat 3534541 cgcggccgcc ccggccgttg ggtcggcaac caggcgcagc atggccacca cctcggcgac 3534601 ctcggggatg gacagtaggc cggccagccc gacaacttca gccgggattc cgcgggcccg 3534661 cagggtatca gcgatagcgg cggcgtcggc gttgcggcgt accagcaccg ccgcggtggg 3534721 cggcttgaca ccgtccgctt ctgcccgctg gtaacgcatc cgcaagtggt cggcgatcca 3534781 ttcgcgttcg gcctgcacgt cgggaagcaa cgcgcagcgg acggctccag gcggggcatc 3534841 cggacgcggc cgcaacgcgc gcaccgcaac cgagcgccgc cgcgcctccg ccgatatgcc 3534901 attggccacg cgcagcgctt gcggcgggtt gcgccagctg gtcagcagct ccagcaccgg 3534961 cgcgggggtg ccgtccgata aggggaagtc ggtggtgaac cggggcaggt tcgtcgccga 3535021 agcgccgcgc cacccgtaga tcgactgaat cgggtcaccg acagccgtca gcgccaaccc 3535081 gtcatcaacg ccgccgccaa acagcgacga caacacaacg cgctgcgcgt gccccgtgtc 3535141 ctggtattcg tccagtaaca ccacccggta gcgcctccgc agatcctggc caacttgggg 3535201 agaggtcgcc gccaaccgtg cggccgaggc catctgcatg gcgaaatcca tcactttgcc 3535261 ggcgtgcatc cgctcaccca acgcgtcaag caacggcacc aactccgcgc gctgggtctg 3535321 ggtggccagc atccgcagca gccactggct ggggccgcgg tcacgctgat agcggcccgc 3535381 cggcagagcg tggaccagcc gttccagctc gacgtgggtg tcgcgaagcg cgcgggtgtc 3535441 gaccagatgc tcgccaagct ggccccataa ccgcaccacg atcgaggtga ccgccgccgg 3535501 gctcttgtcg gtgcacagca cgccgtcgta cccgctgacc acatcgaatg ccagctgcca 3535561 cagctcggtc tcgctcagca acctggtatc gggttccagc ggtagcagca ggccgtagtc 3535621 gcgtagtagc gagccggcaa aggcgtggta ggtgctgact accggagcgc aggccgccgg 3535681 gtcgccgcag ccgaggccga taccggccaa cctggccaga cgggaccgaa cgcggcgcaa 3535741 cagctggccc gcggccttgc gggtgaacgt caatcccagc acctggccgg gttccgcgta 3535801 gccgttggca accagccaca ccacccgggc agccatcgtt tcggtttttc cggcgccggc 3535861 tcccgcgatg acgaccagcg ggccgggagg tgcggcgatt accgcggcct gctcagcggt 3535921 gggcgggaaa agtcctagcg cgcaggctag ttcagctgga ctgtagcgtg ccggtgccgc 3535981 ggtttgggtc atggcgccga ccctcggacg tgggccggac agcccggccg cagcgggcag 3536041 tgggtgcacc cgtcgttgcg ccgagcgatg aactggggac cggctgtcgc cgcggccagc 3536101 tgccggacga ggttgcgcca ttcgtcgcgc gcggccggtg tgagtggatc ctgtttgcgt 3536161 tcggcgacgc cagcggcccc gcttttgccg acatagacca gccgggcacc gccgggctcg 3536221 tccccggcgc gcaccaagcc ttcggccacc gccagctgat acatcgccag ctgggcgtgc 3536281 tgctgggcat cgtccttgct gaccggtgtc ttgccggttt tgatgtcgac gatcaccagg 3536341 cggccggccg ggtcgcgttc cagccgatcc gcccggccac gcaaccgaat ttttctggct 3536401 tgaccgctac cgtcctcgag ggccccatcg atgtcgacct ccacgccaac ttcggtcagc 3536461 tcggatcgac tctgagctcg ccactgtacg aacgcctgga tcatcgcgcg gtgccgggca 3536521 agctcgttgg ccgaatacca ctgagcgccg aacggcagat ggccccacac ccggtccagt 3536581 tcagccagca gttgggattc gctcctgccc ggctcggcaa acagtgcgtg caacaccgat 3536641 ccgacggcag acggcagctc gcgggtgttt gttccgccgt gccgctcggc cagccagcgc 3536701 agtgggcagt cgttgagtgc ctgcaaagtc gacggcgtca acgtgacgag atcgtcgcta 3536761 tcgcacaacg gatcactcgt gctgaccggg gccaggccat gccactcgga cgggtcggca 3536821 cctggcacac cggctttggc caaccgggcc aattgcgttg ccgcacaatc gcgatcggcg 3536881 tcatctaccg cgcaggcagg cgcgcacacc acagcgcgta accggcctac caccgccgca 3536941 gccgacaaca cgcgcggcgc cgagaccggc tgcatcgcga cgggttcgcc atcgccgtcg 3537001 gcccactggg caatctcgaa aaagaacgcc gatggcagca ccgcctcgtg cccgcccccg 3537061 cccgcgtcgc tatctacggc ggtcaccagc aaccgccgcc gggcccgccc catcgcggtc 3537121 accagcagcc ggcgctcctc ggccagcaac ggcgcgcgca tcgaggcatc cttcgtgaca 3537181 ccgtcgagtt cgtccagcag ccgctgggtg ccaagcacac cgccacgtgg aaccgtgttg 3537241 ggccacaagc cgtcctgtag gccggcgata actaccagat cccattcgtg tcccagcgcg 3537301 gcatgtgcgc taaggaccat gacctgctct gtcggggctg ccggttcggg tcgcacaacc 3537361 ggcagctgca gcgcggtgac gtgctcgacg agtccgcgca gggacgcacc cgaggtgcgg 3537421 gacacgtaat ggtcggtgat gtcgaacaag gcggtcaccg tttccaggtc ccgggtggcc 3537481 tggacagccg ccgcaccacc atgctcgctg gccgccagcc agcggcgttg cagacccgac 3537541 cgttgccagg cagcccatag cgtgtggcgc ggatcctggc cacccagact tcctgagcgg 3537601 tggcagcgcg cggccgcggt cagcacggca cgcacgcgcc gcagtgcccg cgaccctggc 3537661 cccgatggcg gcgcgtcgcc gccgagcact tccaccagca ggtcgccgaa cttcctcgaa 3537721 gtctggccgg gacgtgcgcg ttgcagagtc cggcgcagct ggcgaagtga taccgggtcc 3537781 acaccaccaa tcggcccggt gagcaggagc agcgcctggt cgccgtcgag cccgtcagcc 3537841 gtcgcctcga gcaccgtgag cagcgcccgt accgccggct ccgcggacaa cggcccgcca 3537901 actgcaggtg gggccaccgg caccccggcg gcggccagag cgcgcggcaa ccgcacagcg 3537961 cgcggcaccg acctgacgat caccgccatc tgcgaccaag gcaccccatc gatcaggtgc 3538021 gcgcgtcgca gcgcgtcggc aatcatcgct gcctcagcgt gcgccgaacc ggccaggcgc 3538081 accgtcaccg atccgacctc ggtcccggtg ccctcgattc gccgaccgac gcttcgaccc 3538141 ggtagccgtc gtgcgatgcc ggtgacggcc cgcgccacgg cgggtgcaca ccgatgagag 3538201 accgtcaacg tcaccgacgg aatgggggca ccacctgctg gcggcggatc gtcggccagc 3538261 aggccggtgg gctcgccgcc gcggaacccg aacaccgctt ggttcggatc accggcgatc 3538321 agggccagct cggtgcccgc cgccagcatc cggaccaggc gtgccgcctg cggatcaagt 3538381 tgttgggcgt cgtcgaccaa aagggtccgg acccgggcgc gttcggcggc cagtaactca 3538441 ggatcgaccg cgaaggcctc caaagctgcc cccaccagtt cggcggcact cagcgccggc 3538501 gccgtggcct gcggcgccgc cagccccacc gcaccccgca acaacatcac ctgctcgtac 3538561 cgctgggcga attgaccggc ggcgatccat tccggacggc cgcggcgacg gcccagttgc 3538621 tgcaactcca gcgggtccag gccgcgttcg gcgcaacgtg ccaacaggtt tcgcagctcg 3538681 gtggcgaagc cggcggtagt cagcgcgggc cgcagatgcg caggccaggt ggtggtggcg 3538741 gccggtccgt cttcggcgtc cccggccagc agttcccgaa tgatggcgtc ctgctcggcg 3538801 ctggtaagca gccgcggcaa ggcgtcaccg gcgcgctgtg cggccttgcg caagaccgca 3538861 taggcgtagc tgtgcacggt gcgtaccacc ggttcgcgga tcgccgcccg gcaagggccg 3538921 ttggtgcgcg accgcagcag cgccgtcgtc agcgcactgc gggcccgcat gcccattcgg 3538981 ccggaaccgg tcagcagcag aaccgactcc gggtcggtgc cggcgccgat gtgagcgacc 3539041 gcggcctcaa ccaacagtgt gctcttaccg gtgcccgggc cgcccagcac aagcaccgga 3539101 ccgcgcaaac ccggcgcgag ggccgcaccc gcctcgacac cccagatatg tgacatagcc 3539161 gcatgacatc acgagggtct gacaagctcg gatactggag ctggcaagaa aaccgaaaac 3539221 gcgatgtgag gggtggctac catggcggcg gtcgtaggcg gcggtccaca ggacgaaata 3539281 cccgaagccg atgcggtgga gcaagggcgt gctgtcgatt tcgacgacga agccgggttg 3539341 gacaccgcct acctcagcgg cggcgccggc gaccgagacg ccagcgaagc cgacgtcgtc 3539401 gaccaagcct tcgtcgttcc ggtcgccgac gacgaagaaa tcgaccggta gcaggcgtcg 3539461 ccgggctggc atcatcgacg cgtgatcatc gaccttcacg tacagcgcta cggcccgtca 3539521 gggcccgcgc gggtgctgac catccacgga gtgaccgagc acgggcgcat ctggcaccgg 3539581 ttagcccatc acttgcccga aatccccatc gccgcacccg atctgctggg ccacggtagg 3539641 tcaccatggg ccgcgccgtg gaccatcgac gccaacgtgt ccgccctggc agcactcctc 3539701 gacaatcagg gcgacggtcc ggtagtggtg gtcggacact ccttcggcgg cgctgtcgct 3539761 atgcacctgg ccgcggcccg cccagaccag gtcgcggcgc tggtgttgct cgacccggcg 3539821 gtcgctctgg acgggtcccg ggtacgcgag gtggtcgacg ccatgctggc ctctcccgac 3539881 tacctggacc ccgccgaggc ccgggccgag aaggcgaccg gtgcctgggc ggacgtggac 3539941 cccccagtgc tcgacgccga actcgacgag cacctcgtcg cattgcccaa cggtcggtac 3540001 ggttggcgta tcagcctgcc ggcgatggtg tgctactgga gcgaactggc ccgcgacatc 3540061 gtgctgccgc cggtgggaac ggcaaccacg ctggttcggg cggtccgtgc gtcaccggcg 3540121 tacgtcagcg accagctgct cgcggccctg gacaaacggc taggagccga ttttgagcta 3540181 ctagacttcg actgcgggca catggtgccc caagccaagc ccactgaggt cgcggcggtg 3540241 atccgcagtc gactgggacc gcgctagcca tggcgccggt gaccgacgaa caggtggagc 3540301 tggtgcgctc actggtcgcg gccatcccac tcggccgggt gtccacctac ggcgacatcg 3540361 cagctctcgc agggctttcc agtccgcgta ttgtcggctg gattatgcgg accgattcct 3540421 cggatctgcc ctggcaccgg gtgatcagag cctccgggcg cccagcacag cacctggcca 3540481 cccggcagtt ggagttgttg cgcgcagagg gcgttctcag tgttgacggc cgggtggcgc 3540541 tgagcgagat ccgctatgag tttccgccgg gctgagtagg tttagagcac tagccgcact 3540601 agggccgcgg tgtgggccag gccgggaaac gcttcggcgg tggatcgtgg gtgcagcgcg 3540661 tacactgcta ggcggaacat caacgcgcgc aacaacatct ggggccactc cggcagcgcg 3540721 ttccaccgct cgatgagccc gtcgtcggcc gcaccccagg acagcgcgtc gacgacggcc 3540781 accccggccg cccaggatgc gggccgccag tagggcgtga tgtcggtgat ccctggaggg 3540841 gcggtgcccg cgaaaagcac tgtaccgtaa agatctccgt gcaccagctg gttcgggctc 3540901 ttggtcggct tacgcaaccc ggcaagctga ttgatcagat cgatcgatcg ctgggggtcc 3540961 gctgccgggg gggcggtcgg cacgcccggt gggaccgact gtaatggccg ctcctcccac 3541021 ccagctcggt ctgcggcgac gaacacatcg atctcggccc agggcgccgc gggtccctgg 3541081 gtcaagaatc gggggcgttc cagttttccg gtggcctcat gcagccgcac cgccgccgag 3541141 acgacctcat catgcctagg ctccggcgcg ccggcgacga acgtgtctgc ccgccaacca 3541201 gacaccacgt accggccgtc ggtcgatcgg acgggccgag ccaggcgtac gccgtcgacg 3541261 aacaacgtct cgcgcacccg ggccgaccag gccgcgcggg cgttgtcggc caccatcgac 3541321 aacaccacct cgccgcatcg ccagccacct tcccaaccgg cacccaacag gatgggttgc 3541381 gcacctgcca aaccgaacgc caccaacacg tgctcgggcg gcggctcgac attcacaccg 3541441 gtcagcctag tagagcccat cggggtgtat tgggcctgta tcggtcctag tacatcacca 3541501 tgtcgggctg catctgcttg gcccacgcga cgatcccacc ctgcaggtgt accgcgtcgg 3541561 agaaaccggc tttcttgacc gcagccaatg cctcggccga gcgcacgccc gtcttgcagt 3541621 acagcacggc ggtgcggtcc tgggggagct tggccagacc ctcacccgag ttgatcaacg 3541681 atttcggaat cagttgggct ccgtcgatat gcacgatgtc ccactccacg ggatcgcgaa 3541741 cgtcgatcag tgccagctta cggccggagt ccagccagtc gcgcagctcg cgcggcgtga 3541801 tggtggaacc tttggccgcc tgggcggcat cgtcagcaac cacgccgcag aactgttcgt 3541861 agtcgaccag ctcggtgatc ttcggtgtcg atgggtcctt gcggatggtg atcgtgcgat 3541921 agctcatctc cagcgcgtcg tacaccagca accggccaag cagtgtttca cctatcccgg 3541981 tgatcagctt gatcgcctca gtgcccatca ccgatgcgac cgaggcacag ataatgccca 3542041 gcaccccgcc ttcagcacag gacggcacca tgcccggcgg cggctcggga tacaggtcgc 3542101 ggtagttgac acccaacccg tcgggggcgt cctcccaaaa caccgatgcc tggccctcga 3542161 agcggtaaat cgacccccac acgtacggct tgccagccag caccgcggcg tcgttgacca 3542221 gataccgggt ggcgaagttg tcggtgccat ccaagatcag gtcgtactgc ttgaacaggt 3542281 cgacggcgtt gctcggcgca agccgcagct cgtgtagtcg cacccggatc agcgggttga 3542341 tcgcgacaat cgaatcgcgc gccgactgag ccttggagcg cccgacgtca gctaccccat 3542401 ggatgacctg gcgctgcagg ttcgactcgt caaccacatc gaagtcgacg atgccgatgg 3542461 tgccgacgcc ggcggcggcc agatacaata acgtgggcgc tccgagcccg ccggcgccga 3542521 tcaccagtac tcgcgcgttc ttgagcctct tctgcccgtc aacacccagg tcaggaatga 3542581 tgagatggcg gctgtagcga gctacctctt cacggctgag cgcggatgct ggctcaacta 3542641 gtggcggcaa ggatgtcgac accgaatatc tcctcggtta tatccgaaac gtctgctgcg 3542701 cgtcgtcctg caaatacctc aacgcccagc ttgccacctt tgcttccccg ggttagggaa 3542761 tcgggtaggg ccagggattg aatcggcagg tctttccatc cgccttaacg aagtcggggt 3542821 caaacttggc cgcgtcgtca ttggaggtgg aaaacgtctg ctgcatcatt accggagcca 3542881 gaccgccttg ttggtcgcac ggctcgtggc gcaggtaacc gatggcatga ccgacctcgt 3542941 ggttgatcac atattgccga taggaaccta cgtcaccttc gaatggaacg gctccgcgta 3543001 cccagcgcgc ctcgttgatg aacacccgcg attggcgatc catgccgccg aacgacgggt 3543061 tgtagcagga cgtctcgagc cggaattcgt agccacaccc cccgcgcact gtcgtcggcg 3543121 acaccagcga aatccggaag ttgggttttc cgctgtcgat ccgcacgaac gcgaattgcg 3543181 gattgtgggt ccagcccttg ggattggtca acgtctggtc gaccatctgg gcgaatgcgt 3543241 tgtcaccgcc gtacattgtg ggatcaagac cgttctcgat ctcgacggta tacctgaaca 3543301 ctttgacggt gccttgaccg acctggggag tagtgcccgg aacgacacgc caggtcttgt 3543361 caccagcctc ggtgaacggg ccgccatccg gcagcgtccc ggccggcaga ttggcatcga 3543421 acactgcaag accgcgaggc ggtgcgtcga ggatcgcggt ccccaccaca ccaatggccg 3543481 gcgagtcccg gacggtctgg gccgccgcgg gccttggcgt gctcgtcccg gtcaccgtct 3543541 ggtacaccac caccgtggtc agcaccatca gaaccggcag ggcgtaggcg cgccagccgt 3543601 acgtggacac gaaccgcccc aaccaggttt gtttgcgcca ttgacgcttc cggtcgcggc 3543661 gggcccggac ccgtctgtca gtcgcggcga gcgggtcgcg cagggcccgc agcggctcac 3543721 gccactcgtc acgcagcacg ggtactcgac tcgtgcttcc ggcgggccac ggagacgtca 3543781 tttcctcagg atgacacagc tggcccgggt cgcgaccctg gcgcgcccga atgcaacacc 3543841 caacaaacta tcccgccgct accgatgccg caggtagtaa tgtcattccg acagacgcgc 3543901 ggcggtgggg gttggcacag tggccctcga attagtgtga tcagattgag gactgatgag 3543961 cgatctcgcc aagacagcgc agcgacgtgc cctcagatcg tccggcagcg ctcggccaga 3544021 cgaagacgtt ccggccccga accggcgcgg caaccgactg cctcgcgacg agcgccgcgg 3544081 ccaattgctt gtcgttgcca gtgacgtctt cgtcgatcgg ggttaccacg cggccggtat 3544141 ggacgagatc gcggatcggg cgggagtcag taaacccgtt ctgtatcaac atttttcgag 3544201 caagttagaa ctttacctgg ctgtgcttca tcggcacgtg gaaaacctgg tgtccggcgt 3544261 gcatcaggcg ctgagcacga ctaccgacaa ccggcagcgg ttgcacgtgg ccgtccaggc 3544321 gttcttcgac ttcatcgagc acgacagcca gggttaccgg ctgatcttcg agaacgactt 3544381 cgtcaccgag cccgaggtcg ccgcacaggt gcgggtggcc accgaatcgt gcatcgacgc 3544441 agtgttcgcg ctgatcagcg ccgattccgg actggacccg caccgcgccc ggatgatcgc 3544501 ggtgggcttg gtcggaatga gcgtcgactg cgccagatac tggctggacg ccgacaagcc 3544561 gatttccaag tccgacgccg tcgagggcac cgtgcagttc gcctggggcg ggttgtccca 3544621 cgtcccgctt acccgctcgt agcaaccttt ccggcggacc cagctgcggc gtccaccccg 3544681 acgccgaagc ccacccggcg ggcgtctgcg acaccgatct cgacataggc gatcctggcg 3544741 gtgtgaatta ggaagcgacg gccccgctcg tcggtcaggg tcagcaaacc agagtcgtcg 3544801 cgcagcgcgt tgctgacgag ttcttctacc tcactgggcg tctgcgcact ggagaacacc 3544861 agctcgcgcg gactgtccgt gataccgatc ttgacctcca cggtggcccc ttccattggc 3544921 attccgtcac aggcgtgtca ccagcaggct agtagacgcc cctggccccc ataacggtta 3544981 ggtctaggcc agcccgacac gccgccagac accccatccg ccggcagggg ctcgataaca 3545041 tcagcaccat cggtaacaca gttaacgacc tctacgagtg cgttcggaac gtccgggaag 3545101 tccaggacta cccggacgac gagagctcga gcggcttcgg ggctggccgg acctgttcgg 3545161 aaggcgagtt tgcctgggcg gctgaccgcc gatggcgccc ggtagctgcg atcctcggca 3545221 gtgtggtggc gcttggcgcg gtcgcgaccg cagtcattat caacagcgga gatagcacgt 3545281 cgaccaaggc cattgtcggg gcaccagccc cgcgcacggt gatatccacc tcgccacgac 3545341 caacggcccc gaccagcacg tcaccccacc cttcgcccag caccttgcgg ccgcagctcc 3545401 cgccggagac ggtcaccacg gtggcaccgc cgggcaccgg gcctactacc gtgccgacgc 3545461 gaacccccac cgccgcgcca cctcagactg ctgtgccacc gccggcgccg ctgaatccgc 3545521 gcaccgtcgt ctaccgcgtg accggcacca agcagctgtt cgacctggtg aacgtcgtct 3545581 acaccgatgc gcggggcttc ccggtgaccg acttcaacgt gtcgctgccg tggacgaaga 3545641 tggtcgttct gaaccccggc gtgcaaaccg aatcggtcgt cgcgaccagc ctttacagtc 3545701 gtctcaactg ctcgatcgtc aataccggcg ctcagacggt ggtggcgtca accaacaatg 3545761 cgatcatcgc gacatgcact cgctagatct gggatctagc tgagacccag ttcccgcatg 3545821 cgttggtcgt gggtctgctg caaccggtcg aagaaggcac caagctggct cagtccacca 3545881 ctaccggaca ccaccaggtc gaccagctcg tcgtggtcgg ccaacaccag ctgggcctgc 3545941 gttatcgcct cgccgagcag acgacgcgac cacagcgcca gtcggctgcg ctgtttgccg 3546001 ctggccgtca ccgctgcgcg cacttcggcg acgacgaact gagagtgccc ggtctccgac 3546061 aacgccgccc gcaccacgtc agcaacctcg tcaggcagcc cgtcggcgat ctccagatac 3546121 aaatcggcgg ccaacgcatc ggcaacatag gtcttcacca gggcttccag ccatgtgctc 3546181 ggcgtcgtca gccggtggta gttttctaac gctgaggtgt acttcgacat cgccgacacc 3546241 acgtcgacgc cgcgacgttc caacgcattg cgcagcagct cgtagtgccc catctcggcg 3546301 gcggccatgg atgccatcga gatccttccc cgcagatccg gggccatgcg cgcctcatcg 3546361 gtcaatcggt agaaggcggc aacttcgccg taggccagca acgcgaacaa ttcgttgacg 3546421 ccgggatgat ccgccggcag ccgtggcctg ggtgaatcgg ccacctgatc ggcggatgag 3546481 ggcgatggca tggcaacact ctagtaggca ggctcagcgg caaatgggaa cctgctggcc 3546541 gaccagctat catgctcgtt aggtggcggc attggttcga ctgccgctac cggcgaaatg 3546601 tgcgtgcatg gagtctgccc cgcctggact gtgctagggg ccggcgactc ggcgacgtaa 3546661 tcggagtcgg aactcatgcg cgcgtgaacc gcgacagaga aacaccgaca cacgaccgac 3546721 accgtcaccg aaaggccgct taccctcgta tgaccgcagt gaaacacaca actgaatcaa 3546781 catttgccaa acttggagcc cgcgacgaaa tagtccgcgc attaggggaa gagggcatca 3546841 aacggccctt tgctatccag gaactcaccc tgccactcgc gctcgacggc gaggacgtga 3546901 tcggccaggc ccgcaccggc atgggcaaaa cgttcgcttt tggcgtgccg ctgctgcagc 3546961 gcatcacctc cggcgacggc acgagaccgc tcactggcgc tccgcgggcc ctggtcgtag 3547021 tccccacccg cgagctgtgt ctacaggtca ccgatgacct ggccacggcg ggcaagtacc 3547081 tgaccgccgg ccccgacaca gacgacgctg ccgcggtacg gcgccggctg tcggtggtgt 3547141 ccatctacgg gggacggccc tacgagccgc agatcgaggc gctacgcgcc ggcgccgacg 3547201 tcgtggtcgg caccccgggt cggctgctcg acctgtgcca gcagggccac ctgcagctgg 3547261 gcgggctatc cgtgttggtg ctcgacgagg ccgacgagat gctcgacctg ggcttcctgc 3547321 ccgatatcga gcgaatcctg cggcaaattc ccgccgaccg acagtcgatg ttgttttcgg 3547381 cgaccatgcc ggacccgatc atcacgctgg cccgaacgtt catggtccgg cccacgcata 3547441 tccgggctga ggcaccacat tcctcagcgg ttcacgacgc gaccgagcag ttcgtctacc 3547501 gcgcccatgc gttggacaaa gtggagttag tcagccgggt gctgcaggct cgtgaccgcg 3547561 gcgcgacgat gatcttcacc cgcaccaagc ggaccgccca gaaggtcgcc gacgagttga 3547621 ccgagcgcgg tttcgcagtc ggcgccgtgc acggtgatct cggacagctg gcacgcgaga 3547681 aggcgctcaa ggcgtttcgc actggcggca tcgacgtatt ggtggccacc gacgtggccg 3547741 cccgcggcat cgacatcgac gacgttaccc acgtgatcaa ctatcagtgc cccgaagacg 3547801 agaagatgta cgtccaccgc atcggtcgca ccggccgtgc cggccgaacc ggggtcgcgg 3547861 tcaccctggt ggactgggac gagctgcccc gttggagcat gatcgaccaa gcactgggcc 3547921 tgggctcccc cgatccggcc gagacatact ccaactcgcc gcatctgtat gccgagctgg 3547981 ccatcccggc cacggccggc ggtaccgtcg gcccggcgcg caaatcgcag ggcaggcgac 3548041 gtgacaccga ctgcgacggc cagaaaacgg cacagcacgc ccgcaatacc cccaggcgtc 3548101 ggcgcacccg cggcggcaaa cccgtcaccg gacaccccgg caccaaccca atcagcagcc 3548161 caatcgtggg cggcgacgcc acctcggagc cgggctccgg caccgcatca gattccgggt 3548221 ccgatgttgt gtccggctcc cggtccggca acggcgaagc tgcgcgacgc cgtcgtcgcc 3548281 gccgccgacg cccgacgcac gcccaggacg gcttcgccgc gcgggctaac tgacccgccc 3548341 accgcatggt taaaccggag cgccgcacca agaccgatat cgcggccgcc gcgacgatcg 3548401 cggtcgtggt ggccgtggcc gcgtcgttga tctggtggac cagcgacgcc cgcgccacca 3548461 tcagccggcc ggcggcggtt gcggtgccca ccccggcccc ggctcgcgag gtcccgacct 3548521 cgctgaagca gctgtggacc gccgccagcc cagccacccg cgttcccgtg gtggtgggcg 3548581 gaacagtggc tactggcgac ggacgccagg tggacgggcg cgacccagcc accggtgagt 3548641 cgctctggag ttacgcccga gacaccgatc tgtgtggggt gacctgggtc taccactacg 3548701 ccgtcgcggt ctatcggtac gaccggggtt gcggtcaggt cagcaccatc gatggatcca 3548761 ccggtcgccg gggagccgcc cgcagcggct acgcggatcc gcgggtgcgt cttttttccg 3548821 acggcaccac ggtgttgtcg gccggggaca cgcgcctgga actgtggcgt tcagacatgg 3548881 tccggatgct ggcctacggc gagatcgatg cccgggtgaa accgtcgaac cgcggcctgc 3548941 agtccgggtg cacgctggag tcggcggcgg ccagctcggc ggccgtatcg gtgcttgaag 3549001 cgtgtacgaa ccaggctgac ctgcggcttg tgctgttacg cccgggcaag gaggacgacg 3549061 agcccatcca gcgcattgtc ccggaaccgg gggcccggcc gggttcgggc gcccgggtat 3549121 tggtggtatc gcagaacaac accgccgtgt acctgcctgc aagatcaggc gcgcaaccga 3549181 gagtcgacgt gatcgacgag accggcgcca cagtttcgag cacgctgctg gccaagccac 3549241 cgtcaacttc ggccgtggcg tcgcggaccg gcaacctggt gacctggtgg acgggcgacg 3549301 cgttgttggt cttcgacgcg ggcaacctga cccagcgcta caccattgcc gctggcgaga 3549361 cgactgcgcc ggtggggcca ggggtgatga tggcaggtca actcctggtg ccggtcaccg 3549421 gcgggatcgg tgtctatgac ccggtcagcg gtgccaacaa ccgttatatc ccggtgaccc 3549481 ggccgccaag cacgtcagca gtgatcccgg cagtttctgg atccagggtc attgagcaac 3549541 gtggcgacac actagtcgct ctgggttgat cgcctatgtt ggcgcgagca gacgcaaaat 3549601 cgcccgaaac cgatggcttt cgggcgattt tgcgtctgct cgcgctacag gtccaccgtg 3549661 aaggtgggca gcggcctacc tgtcttccag tgtttgagca gcgcctgcgc cagctcgcgg 3549721 taggccaccg cgcctttgtt cttgcgccca gccatcaccg acgagcccga ggcgctggcc 3549781 tcagcgaagc gcacagtacg ggggatgggc ggagccagca cctgtaggtc gtagcggtcg 3549841 gcgacatcga gcaacacgtc acgggtgtgg gtggttcgag agtcgtacag cgtcggcagt 3549901 gcacccaaca accgcagatt cggattggtg atctgctgga catcggcgac cgtccgcaga 3549961 aactggccga caccccggtg cgccagcatc tcgcactgca gcggcacgat ggcctcgtcg 3550021 gcggccgtca gcccgttgag ggtgagcaca cccagcgacg gcggacagtc gatgatgacc 3550081 acgtcgaacc ggtcggagaa tttggccaac gcgcgtttga gcgcgtactc acggcctgcc 3550141 cgcatcagca gcattgcctc ggcgcccgcc aagtcaatgt tggccggcag caacgtcatt 3550201 ccctccatgg tggtgaccag cacggcgttg ggctcgactt caccgagcaa cacctcgtgc 3550261 acagacaccg gtagtttgtc gggatcttga ccaagggaga aggtcagaca accttgcgga 3550321 tccagatcga cgagcagcac gcgccgtccc ttttccacca tcgccgcacc gagcgaggcg 3550381 accgtagtcg tcttggccac cccgcccttc tggttggcca ccgctagcac ccgggtatca 3550441 gtcataggcg ccgctctccc ccgcaagcgg cagggacccc cacctcatcg tgctctccct 3550501 tcgtcgtcgc ccgcgcagtc acagtgtcat cctggcatgc tgctcgcaca gtggttcggg 3550561 cgacaggcct aggatgtcgt cgggcacaat ctgtcggtat gggcgtgcgc aaccaccgat 3550621 tgctactgct ccgccacggc gagaccgctt ggtcgacgct gggccggcac accggcggta 3550681 ccgaggtcga gctgaccgat accgggcgaa cgcaggcaga gctggctggt cagctgctgg 3550741 gtgaactcga acttgacgac ccgattgtca tctgtagccc gcgtcgacgg acgttggata 3550801 ctgccaagtt ggccggcctg acggtgaatg aggtaactgg gctgctcgcc gaatgggatt 3550861 acggttccta tgagggcctt acgacgccgc agatccggga atccgaaccc gattggctgg 3550921 tgtggacgca cggctgccca gctggagaaa gcgtcgcaca ggtaaacgat cgcgctgaca 3550981 gcgccgtcgc gctggccctg gagcacatgt cctcacgcga cgtgttgttt gtcagccatg 3551041 gccacttctc ccgcgcggtg atcacgcgct gggtccagct accgctcgcc gaaggcagcc 3551101 gtttcgcgat gcccaccgcc tcgatcggga tctgcgggtt cgagcacggc gtgcgtcagc 3551161 tcgccgtgct cgggttgacc ggtcatccgc agccgatcgc agccgggtga gcgcacacgt 3551221 ggcaaccttg cacccagaac caccgttcgc actgtgcgga ccaagaggca ccctgattgc 3551281 ccgcggggtg cggacacgat actgcgacgt gcgggccgcg caagcggcac ttcgctcagg 3551341 tacagcacca atactgttgg gcgcgttgcc tttcgacgtg agcagacccg ccgcattgat 3551401 ggtgccggat ggcgtgctgc gggcccggaa gctgcctgac tggccgaccg gcccgctgcc 3551461 caaggtacgc gtcgccgccg cccttccgcc acctgccgac tacctgaccc ggatcggccg 3551521 cgcacgggat ctgctggccg ccttcgacgg cccgttgcac aaagtggtgc tcgcgcgcgc 3551581 cgtgcaactg accgccgatg ctccgctgga cgcgcgggta ctgttgcgca ggttggtcgt 3551641 cgccgacccg accgcttacg gctatctcgt cgacctcacc tctgcgggca acgacgacac 3551701 cggggcagcc ctggtcggcg ccagcccaga gcttctggtc gcacgatccg gcaatcgcgt 3551761 catgtgcaag ccatttgccg gctcagcccc acgcgccgcc gaccccaaac tcgacgccgc 3551821 caacgcggcc gcactagcca gttcggccaa gaaccgacac gaacaccaat tggtcgtcga 3551881 cacgatgcgg gtagccctag agccactatg cgaggacctg acaatcccag cccagcccca 3551941 gttgaaccgc accgcagccg tttggcatct gtgcaccgcg atcaccggcc ggctgcgcaa 3552001 catctcgacg acggcaatcg atctggcttt ggcgctacat cccaccccgg cggttggtgg 3552061 ggtcccgaca aaagctgcca ccgagctcat cgccgaactc gagggcgacc gtggcttcta 3552121 cgccggcgcg gttggttggt gcgacggccg gggcgacggc cattgggtgg tgtctatccg 3552181 gtgcgcgcaa ctttcggctg atcgacgcgc agcccttgcg cacgctggcg gtggcatcgt 3552241 cgccgaatca gaccccgatg acgaacttga agaaaccaca acgaagttcg ccacgatatt 3552301 gaccgcactg ggagttgagc agtgaccgat accatccgcc gcgctacacc ggcggatacc 3552361 gccgacatcg tggccatgat tcacgcgctg ggcggaattc gagtatgccg ccgatcaatg 3552421 cactgtcacc gaaacacaaa tacatacagc acttttcgga gatttcccga cgatgcgagg 3552481 ccacgtcgct gaggttaatg gcggagttgc cgcgatggcg ctgtggtttc tgaacttttc 3552541 cacctgggac ggcgtcgcgg gcatctatgt ggaggacttg ttcgtctggc cgaggtttcg 3552601 ccgccgcggc ttggcccgtg gcctgctgtc gacgctggcc agagaatgcg tcgacaaccg 3552661 ctacacgcgg ttggcctggt cggtgctgaa ctggaattcc gatgcaatcg cactgtatga 3552721 ccgcatcggc gggcaaccgc agcacgagtg gactatctat cgactgtcag gaccgcggtt 3552781 ggctgcgctg gccgcaccac gctgatcacg cccggcggcc cagcggatcg aaggcggact 3552841 gaacagcaat accagcacgc caagcgcgat gattcccacc gggatcccga tcgccggctg 3552901 atgcgaaccc acaatcagat accacgccac cggcagcagc agcagctggg cgaacaccgc 3552961 cagcccgcga ccccaaagct tgccaaccgc cagcctgcat ccggcggcga gcactgctcc 3553021 gccgaccagt acgaaccaac ctgcggtgcc caggccattg acgatgtgct ggtcggcgcc 3553081 cgcgagtccg cgcaccagca acgccgcggc caccaccagg gcggccccac cctgcacggc 3553141 gacgatcagt ccggcgccgc gcacggcggc cggggctcga acaggcacag catcagcgta 3553201 gtcacccggc cgtgaccggc ccgcatcgtc acaccaccca ggcccattgc cgtcctcctc 3553261 aacgggccga cccggcccgc atcgtcacac ggcctaggcc cattgccgtc ctcctcaacg 3553321 ggccgacccg gcccgcatcg tcacacggcc taagcccatt gccgtcctcc tcaacgggcc 3553381 gacccggccc gcatcgtcac acggcctaag cccattgccg tcctcctcaa cgggccgacc 3553441 cggcccgcat cgtcacacgg cctaagctcg tgcgtcatgc gtgcagtgct gatcgtcaac 3553501 cccactgcga ccgccaccac accagccggc cgcgacctgc tggcgcacgc cctcgaaagc 3553561 cgccttcagc tcacggttga gcacaccaac caccgcggtc acgggaccga actcggacag 3553621 gcggcggtag ccgacggggt ggacctggtc gtggtgcatg gcggcgatgg cacggtaagc 3553681 gccgtagtca acggcatgct ggggcgcccc ggcacgacgc cggtccgacc ggtgccagcc 3553741 gttgcggttg tgcccggcgg ctcggccaac gtactagctc gcgcgctagg gatttccgcg 3553801 gacccgatcg ctgccaccaa ccaactcatc cagctgctcg acgactacgg ccgccaccaa 3553861 cagtggcgcc gcatcgggct gatcgactgc ggtgagcggt gggcggtgtt caacgccggc 3553921 atgggcgtcg acgccgaggt cgtggccgcg gtagaggccg aacgcgacaa aggcggcaag 3553981 gttacggcgt ggcgctatat tcgcgctgcg gtgcgcgcgg tgctcgcctg cactcgtcgc 3554041 gaaccggctc ttacgctgca acttcccaac cgcgatccaa ttaccggagt gcactttgtg 3554101 ttcgtgtcca actccagtcc gtggacttac gcaaacaacc ggccggtatg gaccaatccc 3554161 gactgcaggt tcgagtcggg gctgggagtg ttcgccacca ccagcatgaa ggtggtcccg 3554221 accctgaggg tggttcggca gatgttcgca aaacagccca agttcgagtt caaccacgtc 3554281 atcaacaacg acgacgtcgc gtgtctacgc gtcacctcca tggggccccc gatcgccagc 3554341 caattcgacg gggactacct cggcgtgcgc gagacgatga cgttccgagc tgttcccgac 3554401 gccctcgccg tagttgcccc gcccgcaaga aagcgaatct gagctgcaga aacaaagatg 3554461 tgatgggtgt gcgacacaaa cgttgggcga aactggcagc gtagtgtagt acaactgggt 3554521 aagggctgtg gaacgagatc gccagagtga gatagcccac gcgcttacgt aacactattg 3554581 acatctgttg agcctgtgaa acgatcaaaa ggttgcatgt agagaaatgt aggggtacag 3554641 aagcctttct tgtgcacccg ttaccagcca agaagaaacg cctgtgcgta ccgctgcgca 3554701 catagtgagg agtaacgact aatggattgg cgccacaagg cggtctgtcg tgacgaggat 3554761 ccggaactgt tcttcccggt aggaaacagt ggtccggcac ttgcgcagat cgctgacgcg 3554821 aaactggtct gtaatcggtg cccggtcacc acagagtgcc tcagctgggc actgaatacc 3554881 ggccaggact cgggcgtctg gggaggcatg agcgaagacg agcggcgcgc gctgaagcgt 3554941 cgcaacgccc gcacgaaagc ccgtaccggg gtctgacgac tcagttctgc acagtgcggc 3555001 cccgacatac gtcggggccg cactgttgcg tagcgcgcta cagcatcaac cgtccccggc 3555061 gtccgaccgg tacccgtagc accacatcgg tgccacgttc gcgggcgtcc cgcataccta 3555121 acgagccgtc caattccgca gagaccaagg tccgcacgat ctgcaggccc aggctgtccg 3555181 acttctccag gctgaaacct tgcggcagac caagcccgtc gtcgtgcacg acgacatcga 3555241 gccaacgcgc agagcgttcc gctcgaatcg tcacggaccc ttccgccgcc gccgggtcga 3555301 acgcatgctc gatcgcgttc tgcaccagct cggtgatcac catgatcagc gccgtggcgc 3555361 ggtcggagtc gagcacaccg aggtcgccaa cccgatttat ccggatcggc ctgtccaccg 3555421 atgccacatc gttcatgatc ggcagaatcc ggtcgatgac ctcgtcaagg ttcacctgct 3555481 cgtccaccga catcgacaac gcatcgtgga ccaaggcaat cgacgacact cggcgcaccg 3555541 actcgatcag cgcttcccgc ccctcggcgt tggacgtccg gcgagcctgc agccgcaaca 3555601 gcgcggccac cgtctgcagg ttgttcttaa cccgatgatg gatttcccgg atcgtggcgt 3555661 ccttggatat cagggctcgg tcgcgccgct tcacctcggt cacgtcgcgg atcaatatcg 3555721 cggcgccgac attgcgacca gctaccacca gcggcagagt ccgcagcagc accgtggcgc 3555781 cgccggcgtc gacctccatc cgcataccct ttccatcccc ggccagcaag tcctgcacat 3555841 gctcgtctac ctcgtgcgcc tcgaacgggt ccgagatcag cgggcgcgtc gcgtcaatga 3555901 gattgacgcc ctccaactcg gtggtcaaac ccattcggtg gtaagccgat agggcattgg 3555961 ggctggcgta agagaccaca ccgtcgacat cgagacggat gaagccgtca cccgcgcgcg 3556021 ggctagatcg cgacatcgcc acgtcccctg cgtcgggaaa ggtgccctcc gccagcatcc 3556081 ggagaagatc tgtggcgcac aaccgatagg cggtctccag gtggccggat ctacgtcgcg 3556141 ccgccagttc gggttgatgc cgtgtcagca ccgccaccac ctgatcgcca aagcgcaccg 3556201 gggagacttc gacactgtgg ccgtcgtgtt gacatgaatt ctgttggccg acagcgcctt 3556261 cccgtcccgg gacaccaccg gagaaggtcg cggcgaccag cggcatgcta ttggcggcga 3556321 cgacggtgcc taccgcgtcg gtatgcacca ccgtcggccc ggtgttcggc cggcattgcg 3556381 caacgcacac caggacaccg tcgttgcggc gaacccacat caggtaatcg gcaaacgaca 3556441 agtcggcaag gagctgccac tccccgacca ccgcatgcag gtggtccacc gcgctgcccg 3556501 gcagcaccgt gtgttcggcg agcagatcac cgagtgtgga catgagtgac tatcaacgac 3556561 tagctgatca ccgcgataag gtcgccggcc tgaatgacat cgcccaccga taccgccacc 3556621 ttgctgaccg ttccggcagc ttcggccagg acggggatct ccatcttcat cgactccagc 3556681 agcaccacga cgtcgccctt gtcgatctga tcgccttcgt tgacaacgac ttcgagaacg 3556741 ctggccacga tctcggcgcg aacatcctcg gccatcatca ccccactctt ttcggccatg 3556801 ccgtatgctg actgctggtc atcggacttc catcaaactc aggtatatcg aaccataaga 3556861 accctgggga gcgcggcacg cgggctattg gggtcgcgcg cgacgccgca tgagaaactg 3556921 ggcaatgacc gggcggccgc tgcctgcccg cacctgagca atgacggagg ttccgatggc 3556981 caagcgtggc cgtaagaagc gtgaccgcaa gtacagcaag gccaaccacg gcaagcggcc 3557041 caattcctaa cgcactgcgc tagggccctc cacggatgat ggtggtccgg cggatctcta 3557101 gccgaagacg ctcccgcaag ccctcggggg ccctgtcgcc tcggcacttg gtcccgatca 3557161 acgccttgat ccgttcctcg agcccgtaat gcctcaggca ccccgggcag gcctcgaggt 3557221 gtcgccgcag cctctcgcgg gtttccgggg tgcattcacc gtcaagcagg gtccacacct 3557281 cggcgatcac ttccgcgcaa cccatgccgc cgtgggaatc gtcgtggtcc gcgtgcgcat 3557341 cggtcggacc gcaattttcg ctcactggtg caccatcctt gtgtcggtga tctcggatgg 3557401 attgccgatg tagaggcgcc gctgggttag cgccccgcgc gcttgacagc cgtgatgtcc 3557461 atcatgagtt ttgcggagtc cggcggttgc cccggacgcg ccgaccgtcg acagggccaa 3557521 gcgccgacga gcgccgaacg actcgccccg cacgccgacg cccagcccga attgctggcc 3557581 tgcttggccg gcgtcgcccg ctccaccggc tagtccgaca aagtcaccca cgtcgggttc 3557641 ggttgggcgg cagacaaaca actccgcaac ggtgtctgcg acttcgccgg cgacagccgc 3557701 cgagccaacc tctaggccgc cgacctgcac aaccgcaccc gagcccgcgg ccacgaccac 3557761 cggcacgcgg tacgcatcct cgcccgcgcc tggctttacg tcatctgaca ccgctggcaa 3557821 gacggcatcg cttacgaccc cacccaacac cgagccctgc aggctctcct tgaccaagtt 3557881 cgccaaacgg cggcttgaca ccgggctgct catgacgaca ccccctcgtg cgcctgctcg 3557941 cccctggcaa acccccgatc cctggccaca tcggctaaaa gaccgcgcaa ctgacgtcgg 3558001 ccgcgatgaa gcctcgacat cacggtgccg atcggagtat ccatgatctc ggcgatctcc 3558061 ttgtagggga aaccttcgac atcggcgtag tagaccgcca tccggaactc ttccggcaat 3558121 gcctgcagcg cctctttgat ctcggtgtcc ggcaacgctt ctaacgcttc gacttcagcc 3558181 gagcgcagcc cggtcgagga atgctcggcg ttggacgcca gttgccaatc ggtgatctgc 3558241 tcggtcggat actccgccgg ttgccgctgt ttcttgcgat agctgttgat gtaggtgttg 3558301 gtcagtatcc ggtagagcca ggccttgaga ttggtaccgt gccggaacga acgaaatccc 3558361 gcataggcct tcaccatcgt ctcctggagc aagtcctcgg cgtcggccgg attgcgcgtc 3558421 atccgcagcg caccgccgta cagctggtcc aacaggggaa tcgcgtcgcg ctcgaaacgc 3558481 gcggtcaact cctcgtctgt ctcctcagac ggcccaggct gcagacccgc cgaaccggtt 3558541 acaccatcga tgtcggccat cttgattaac tgggtccctt cgtttgcggt gtcgccggac 3558601 agcaccggcg cggacaccgg acgtgcgagc atgcgagcca accgcttctc acccaacagg 3558661 ctcgtcgccg ttgacaccag actcccctcg tcccaatgta gaggccgcga ccgacactgt 3558721 ctgcaccggt ctggccagcc acgtggctgc aggaaccgaa ccaatcaacc gtgttcgcca 3558781 gcgggttatt tccagcgctg aatcgcatgc ggcctgtccc gcagtccggt ggaatcgagc 3558841 agggcgttag ggtgacgcca tgtcactcaa cggcaagacc atgttcatct ctggcgccag 3558901 tcgcggtatc ggccttgcga tcgccaagcg ggccgcgcgc gacggcgcca acattgcctt 3558961 gatcgccaag accgccgagc cgcatccaaa gctgccaggc acggtgttca cggccgccaa 3559021 ggaactcgag gaagccggcg gccaggcact gccgatcgtc ggggatatcc gcgacccgga 3559081 tgcggtcgcg tccgcggtgg ccaccaccgt ggagcagttc gggggcatcg atatctgcgt 3559141 caacaatgcc tcggcgatca acttagggtc catcaccgag gtgccaatga agcgtttcga 3559201 cctgatgaac ggcatccagg tgcgtggcac ctacgcagta tcccaagcgt gcattcccca 3559261 tatgaaaggc cgtgagaacc cgcacatcct gacgctgtcc ccgccgatcc tgctggagaa 3559321 gaagtggctg cggccgacgg cctacatgat ggccaagtac ggcatgacgc tgtgcgcgct 3559381 gggaatcgcc gaggagatgc gcgccgacgg catcgcgtcg aacacgttgt ggccacgcac 3559441 gatggtggcc accgcggcgg tacagaacct gctgggcggc gacgaggcga tggcgcggtc 3559501 ccgcaagccc gaggtatacg ccgacgcggc ctacgtcatc gtcaacaagc ccgccaccga 3559561 atacaccggc aagacgctgc tgtgcgagga cgtgctcgtc gaatccggcg tcaccgactt 3559621 gtcggtctac gactgcgtcc caggtgcgac gctcggcgtc gacctgtggg tggaagacgc 3559681 caacccgccg gggtacctcc cggcctagcg acagcaaaac cctgatcctc gagttgcccg 3559741 acgagcgggc cgtcgcgatc gtgccggtgc cgtcgaagtt gtcgctgaag gcggccggcg 3559801 gccctagggg tgcccaaagc ggccatggct aaacccgctg ccgccgaaca agccaccggc 3559861 tacgtggtcg gcggcatctc cccgttcggt cagcgcaagc ggctgcggac cgtggtcgat 3559921 gtgtcggcct tgagctggga ccgggtactg cggtgccggc aaacggcatt gggccgtcac 3559981 ggtggccccg ccggacctga tcaccttgat cagcgcgatc atcgctaaca tccgggccta 3560041 gcgccgtacc ggaaatcggc gaggacttca ccgatggcgt agcgcgcgct ggccgccagc 3560101 ggcgggttgg tgtcttggta gtacgggagc gcgatcaagg cgatggccag agctctgccg 3560161 cgcccgcgca tccagtcgtc gtcggcggcg ccgaccgcga cgcggaactg agcacgggcg 3560221 ggcgccgaca ggaggttcca cgcgatgatc aagtcgacgc tggggtcacc gacgcccatc 3560281 agaccgaagt caatgacgcc cgtcaagcgt ccttgcgctg tcaggatgtt gaaccgggac 3560341 aggtcaccgt ggaaccacat cggcggcccc gcatacggag gaacgcgtag ggctgattcc 3560401 cacgcggcag ttgccgcgtg gacgtcgatg atcccgtcga gggccgccag cgctgcgcgt 3560461 acctcggcat cctgctcccc cagcggcgca ccccgcttgg cgggcggccc gcccatgggg 3560521 tcggtggccc gtaaggtggt gatgaagtca gccaggtcct cgacggcccg attgggctcg 3560581 acgaactcgg ctgccgacgg gttctcaccc gcaacccagc ggcacactga ccacggccaa 3560641 ccgaacccct cagccgggct ccccaacccc accggaactg ggctggcaac gcctagatgc 3560701 gcagcgatcc gcggcagcca ctgttgctcg gtccgaaggc tctcgatggc ccagccaatg 3560761 cgcgggatgc gcacggccag gtcctcgcct agccggtaca ttgcgttgtc cgtgcccgcc 3560821 gagcgcaccg gtgcaatggg tagatccgcc cactgtggga attgtgcacg cagcagacgc 3560881 cgcaccagat cctcgtcgat atccacctca tcggcgtgca tctttgccct taggacacgt 3560941 tcgtaccggt cgaagacggt tccgtcctgc tcacagatcc gccgcacgaa agcaaagccc 3561001 gcccgcaacg ccaccctcgc cgatgcggag ttctccggct ccaccttgat caccgcttcg 3561061 gtcgcgccgt gttcggccgc atactggcac accagatcga ctgcgcgagt ggcgagtcca 3561121 cgccctcgcc agctggggta gagcccatag gcaacgttga cctgcccgct agccagcccc 3561181 tcgccgtcga aacgcagatc aatcgtaccc actattgttt cggcaaccgt cctgatgccg 3561241 aaagagcgca gcggcccgcc ggtcacccat tgctcgcggc agtgccggat gtacgcttcg 3561301 acgcttgctc gagtcgaggg cataccgcta agccaacgca ctagccgttc gtccccccca 3561361 gccagatgcg catcgacatc gtccaggcac agtggcgata gagtgacgat cccgtctgat 3561421 agcccgtcgg acagcttcgc aaagcgcacc ccgcgattgt cggactcaca ctggcttcag 3561481 gcaaacctgc cgcgagcgcc cggcgagcgt aatggcgcgg caagaaatcg cgcttggatt 3561541 cgccgcagcg tcacacgcgt gggcacagac cctcacagca gctggatctg ctcgggctgc 3561601 gacctggccg gctccaacag ctcaggcccg ttgttgcgca cgttgttgac caacgtggac 3561661 acttggcgca gcgcgatgtc gcgcacatcc ggcgggcggg ccagcagctc aggatccggc 3561721 ggggcgtctg gattcagcca gtcgtcccag tcctcttcgg ccagcagcag cggcatccgg 3561781 tcatggatct cggccagctc gcccacggca tcggtggtga tcaccgtgca gctcagcagc 3561841 ggtggggcgg acctgtaaga cttccaaacc gaccacagcc cggccgtgaa caacagggcg 3561901 ccgtcgtggc ggtgcaggaa gaacggcgtc ttggcgttcg gcctccccgg ggtggcgtcg 3561961 gggtcgacgc gccattcgta ccagccgtcc atcggcacca ggcaacgctt acttctgacc 3562021 gcactccgga acgccggcga cgtggcgacc ttatcggcgc gggcgttgat cagcggtggg 3562081 cctttggcat cgggtgcgcc gccgggcccg gccttgatcc acgacggaat cagtccccag 3562141 cgcatgagcc gcacccggcg ggtgggctcg tcgtcgggct cgctgtggcg ggacaccact 3562201 gtcgcgatcg tgtcggtggg tgccacgttg tagctcgtct tcccgccacc gcacccggtg 3562261 gcctcgtcta tggccgtgat tttctcggcc agctgggccg gatcagtggt gaccgcaaac 3562321 cgtccgcaca tgcttcctat ggtgcctggt acccacgaca cccgccgaca cggcaggatg 3562381 aagcggtgaa gacatggcca gccccaacgg cgccgacgcc ggtgcgcgct accgtgaccg 3562441 ttccaggctc gaagtcgcag accaaccggg cgctggtgct agcggcgctg gcggccgcac 3562501 aaggccgggg cgcatcgacc atctccggcg cgctgcgcag ccgcgacacc gaactgatgc 3562561 tggacgcgct gcagaccctg ggcctgcgcg tcgacggtgt gggttcggaa ctgacggtca 3562621 gcggccgaat cgaaccgggg cccggcgctc gggtggactg tggcttggcg ggcacggtgt 3562681 tgcggtttgt tccgccgctg gcggcgctgg gctccgtccc ggtcaccttc gacggcgatc 3562741 agcaagcccg gggacggccc atcgcaccgc tgctggatgc gctgcgcgag ctcggcgtcg 3562801 ccgtcgacgg caccggtcta ccgtttcggg ttcacggcaa cgggtcgctc gccggcggca 3562861 ccgtggccat cgacgcgtcg gcgtcctcac agttcgtgtc cgggctgctg ctgtccgcgg 3562921 catcgttcac cgatggcctg accgtccaac acaccggttc gtcgctgccg tctgcgccgc 3562981 acatcgcgat gacggcggcg atgctgcggc aagccggagt cgacatcgac gactcgacac 3563041 cgaaccgttg gcaggtgcgc cccggtccgg tggcggcgcg gcgctgggac atcgaaccgg 3563101 acctgaccaa cgcggtggct ttcctgtcag cggccgtggt cagcggcggc accgtgcgca 3563161 tcaccggctg gcctagagtc agcgtgcaac ccgccgacca catcttggca attttgcggc 3563221 agctcaatgc cgttgtcatt catgctgatt catccctcga ggtgcgcggt ccaacgggat 3563281 acgacgggtt tgacgtcgac ttgcgcgccg tcggcgagct gacgccatcg gtcgcggcgc 3563341 tggcggcgct ggcatccccg ggatcggtgt ccagactaag cggcattgcc catctgcggg 3563401 gccacgaaac cgaccggctc gccgcgctga gcaccgagat caaccggttg gggggcacct 3563461 gccgggaaac acccgacggt ctggtgatca ccgcgacgcc gttgcggccc ggcatctggc 3563521 gggcatacgc ggaccatcga atggcgatgg ccggcgcgat cattgggctg cgggtggccg 3563581 gagtcgaggt cgacgacatc gccgccacca ccaagacgct gccggagttt ccgcggctgt 3563641 gggccgagat ggtcggaccc ggccaggggt gggggtaccc ccagccgcgc agcggccagc 3563701 gggcgaggcg ggcaaccggg caggggtccg gcggttgagg cccggcgact acgacgagtc 3563761 cgacgtcaag gtgcgctccg gcaggagttc gcggccgcgg accaagaccc gtcccgagca 3563821 cgccgacgcc gaggccgcca tggtggtcag cgtcgaccgc ggccgctggg ggtgtgtgct 3563881 gggcggccgc cccgatcgcc gaatcacggc gatgcgcgcc cgcgagctcg gccgcacccc 3563941 gatcgtggtc ggcgacgacg tggacgtggt cggtgacctg tccgggcggc ccgacaccct 3564001 ggcccgcatc gtgcggcgag caccgcgacg aaccgtgttg cgacgcaccg ccgatgacac 3564061 cgaccccacc gagcgggtgg tggtcgccaa cgccgaccaa ctgctgatcg tggtcgcgct 3564121 ggcagacccg ccgccacgca ccggcctggt cgaccgggcg ctgatcgccg cctacgccgg 3564181 cgggctgacc ccgattctct gcctgaccaa gaccgacctc gccccggcgg aaccgttcgg 3564241 caagcagttc gccgacctgg aattgaccgt aaccgccgca ggcgtcgatg atcctctgct 3564301 cgcggtggcg gacctgctgg ccggcaagat caccgtcctg ctcgggcatt ccggggtcgg 3564361 caagtcgaca ttggtgaatc gtcttgtacc cgaagctgat cgggcggttg gtgaggtcac 3564421 cgagatcggc cggggacggc acacgtcgac tcggtcggtg gcgctgccgt tgggagatac 3564481 gctgtccggt tccggctggg tgattgacac cccaggaatc cgctcattcg ggttggctca 3564541 tatccagccc gacaacgtgc tattggcttt ctctgacctc gccgaggcaa cccgcgagtg 3564601 tccgcgcggg tgcgggcaca tgggaccgcc ggccgatccc gaatgcgcgt tggatacctt 3564661 gtccgggccc gctgcccgcc gcgccgcggc cgcccggcga ctactggcag tgctcagcca 3564721 gacttgacta gccgcatgct cgtcgcgcgc cgagcaatct taggctgcca gatcgtcggg 3564781 ttcggtgacc gacttagcca tacgcttgct gcgccgccga ccccgcacgg cggcaatcgc 3564841 ggtctttaac ccccgacgac gtccggtcac cggatcggcg cccgcgaaac ccggccccag 3564901 accagcgaac atccgctcac tgcgggtctc gggtgcatcg tcagcgttgt cacgtaagta 3564961 cttatccggc aacgacagct tggcaagggt gcgccaggtc ttgccgtact gcaccaagaa 3565021 cgagcccgtg gtgtatggca agtcgtatct gtcgcagacc tcacgcaccc gcaccgaaat 3565081 ctcgtgaagc cggttgctcg gcaggtccgg atagaggtga tgctcgattt ggtggcacag 3565141 attgccgctc atgaaccgca gcgccggccc agcgttgaag tttgcgctgc ccagcatctg 3565201 ccgtaggtac cactggccct tcggctcacc gatcatgtcc gtcttggtga atttctctgc 3565261 gccatccggg aaatggccgc agaagatcac cgcgttggac cacacgttgc ggatcacgtt 3565321 ggccaccacg ttggcggtca aagtggaccg atacgtcgcc cccggggaca acgaggtcag 3565381 cgccgggaac gcgacatagt ccttgaacac ctggcggccc gctttggctg agaattcacg 3565441 caaccgggtt ttagcggcct cgcggtcggc ccgacccttg aagatcttgc cgatctccaa 3565501 gtgctgcagc gcaactcccc actcgaagcc gatcgcaagg atggtgttcc acaccacgtt 3565561 gaagatgttg tagcgcttcc agcgctggtc acgggtgacg cgcagcatgc cgtatccgac 3565621 gtcgtcatcc ataccaagga tgttggtgta tttgtggtgc acgaagttgt gggtgtagcg 3565681 ccagtgcttg gacgatccgc tcatgtccca ctcccacgtc gaggagtgaa tctccgggtc 3565741 gttcatccag tcccactggc cgtgcatgac gttgtggccg atctccatgt tttcgatgat 3565801 cttggccacg ccaagggtca gggcacctgt ccaccaggcg aggcgtcgtg agctgccagc 3565861 cagcagtagc cgaccggaca cctcgagcgc ccgctgtgcg gcgatggtgc ggcggatgta 3565921 gcgggcatcg cgttcgccgc gcgattcttc aacgtctcgg cggatggcat ctagctcggc 3565981 ggccaggttt tcaatgtcgg cgtccgtcag atgcgcgaat acgtcgacgt cagtgatcgc 3566041 catcgtcttc tccctgcgtc atacggccga tgacctacgc tatcgtaact tacgattccg 3566101 taggttacct atgagtaaca ctagatgtcc agcacgcaat cacccgaggc ggccgacacg 3566161 caggtctgga cccgggttcc gggctcatgc cgctggcccg tgcgcagatc ccgaacatgg 3566221 ccttccacca ggtcgaccac acacgactgg cagatgccca tccggcagcc gaagggtagc 3566281 tgcacgccgg cgccctcacc ggcgtccatc aacgacgtgg cagcatcggc ggctacgctc 3566341 ttgccacttc gggcgaacgt gacggtcccg cccgctccag cgggcgccgt tttggacact 3566401 gcgaaccgct ccaggtgcag tcggtcgctg gcacccgccg atgaccagac cttgtcggcc 3566461 tggttgagca cgccctccgg cccgcacgcc caggtctggc gttcacgcca gtccggcacc 3566521 tgctgaccga tccgggtcag gtccagccgg ccctgggcgc gcgtctcgcg caccgacaac 3566581 cgataaccgg gatggtcggc cgccagggca gccagctcgg caccgaacat cacgtcagct 3566641 gcggtgggcg ccgaatgcag gtgcactacg tcggtgattt ggttgcggcg caccaacgtt 3566701 cgaagcatcg acattaccgg cgtaatcccc gacccggcag tcaaaaacag aatcaacggg 3566761 ggcgccggat ccggtaatac gaaattgccc tggggcgcag ccagccgcac aatggtccct 3566821 ggctttaccc cggccaccaa gtgggtggac aggaagccct cgggcatcgc cttcaccgtg 3566881 acggtcacca tgcgcgcgga cccggatgcc gccggactcg acgtcagcga atacgaccgc 3566941 cagcgccagc acccgtcgac cagcagcccg atcccgatgt attggcccgg ctggtagtcg 3567001 aaactgaagc cccagcccgg tttgatgaac agggtcgcgg agtcttccgt ctctcggcgg 3567061 acccctagga tgcgcccccg caattcccgc gcggaccaca gcggatttgc caggtgaagg 3567121 tagtcgtcgg gcaacaatgg cgtcgtgatg cgcgcggcaa tcttgcgcag cgcatgccag 3567181 cccggatgcc ggtcggctcc ggcgacggtg gggcgcctgg tgtcgatgat gctggcgtta 3567241 agcgtcgtgt gtttcttgct cataggaagc tcctgctcgg ccttagcttc cgcccaacaa 3567301 agctacggta ccgtaaccta cggttccgta tctaggcccg gacgcgcaga ctgcgtcaca 3567361 cccacggcat cgtcagagca ggtccagcag aaatggcagc tcttggttgg cgtaccaggc 3567421 gagatcgtgg tcctgggcgt caccgaccac cagctcagcg tcctcgtcgc ccaggtcagc 3567481 ggcatcgatg accgcaatcg ccgccatcac ggccggctca gcgccggcat tgtcaacata 3567541 tgcggcgaca acctgatcga tcgtgattgg ccccgccagc ctgacgaccg cgtcatcaag 3567601 atcgggacgg tacgtggcat cgtcgacctc ggcggccagc accgcgcgtc tgggcggcag 3567661 ggcgtctgcg gtggcgccga tgtccgccgc tagcagacgc aacgacgcca acgccgcttc 3567721 gcgcagcgcc acctcggcaa gctcctcgtc gtcaccctcg gcgtacgact cacgcaacgt 3567781 cggcgtcact gcaaaagcag tgccgttgac cggccacaac gcgccatcgg caacgagtcg 3567841 ctgcaacatg gccagggtgg ccgggatgta gacctgcgtc accgggcgat caacgtggcc 3567901 acatagtcgt cgacgtatgt cgacaactcg cggggcggac gcctgtagtt gccactcaca 3567961 agcggccgtg gcggcagctt gacctttggc ttttccacat ctgcgtagtc aatcgtggac 3568021 agcaagtggg ccatcatgtt cagccgcgcg tgctttttga tatcagactc caccacgtac 3568081 caggggctga cgggggtgtc ggtatgcacc atcatctcgt cctttgcgcg cgaatagtcc 3568141 tcccaccgat acaccgattc caggtccatt gggctgagct tccattgccg gaccgggtca 3568201 ttccgtcgag ccttgaatcg gcgcaactgt tcggcgtctg agactgaaaa ccagtatttg 3568261 cgaagcagaa tcccgtcatc gatcagcatc tgctcgaaaa tcggggtctg ccgcaaaaac 3568321 aacacatact cctgcggcgt acagaaaccc atgaccttct ccacaccggc gcggttgtac 3568381 caggaccgat cgaagagcac tatctcacct ttggcgggaa gatgggcaat ataacgctgg 3568441 tagtaccact gaccccgctc gcgatccgtc ggcgcgggca atgccgcgat acgagccact 3568501 cgcgggttga ggtactcggt gatccgtttg atggcgccac ccttaccagc tccgtcacgg 3568561 ccttcgaaga tgaccaccag acgcgcaccc gaatgccggg cccactcttg cagcttcacg 3568621 aattctgttt gcagccgaaa caattcggct tggtagacgg catcggagat cttgcgccgg 3568681 cccggcgcag ctgatctgtg tcccttcgct ctcgacgacg cgccgtcgtt ggtcgcggtg 3568741 ctcacatcaa cggatggtat atccacacat caccatcgac ccctaacaac taccgcgaag 3568801 cctccagaag ctcgtccagt gcttggctca acagccccgg cagcagatcg acatcgctca 3568861 tcgcgtcgcg gtcggcattg atgccgaaat acaacatccc gttatacgac gtcacgctga 3568921 tggccagcgc ctggttgtgc agtagcggcg gcacggagta ggtctccagc agcttggtac 3568981 ccgcaatgta catctgcgac tgggttccgg gggcattggt gatcaacaga ttgaacaacc 3569041 gtgccgaaaa gctagtggcg acccgcaccc ccatggcgtg caaagtggcc ggtgctaacc 3569101 ccgacaacgt gacgatagtc ctggcatcga ccaggctggc ggcggtcggg ttggattcgg 3569161 tggcgtgcgc gatctgcgac aaccgcacta cggcattgcc ctcccccacc gggaggtcaa 3569221 ccaagaacgg tgtcacctgg ctgatcgcct gaccagggcc ggttgagtcg agttggtcgt 3569281 cggcatagac cgacagcggc gccatcgccc gaacagtcgc ggtcggtgcc acagcttcac 3569341 cgcgtgacat cagccagttg cccaaggcac cggcaatcac cgtcagcacc acgtcgtgga 3569401 cgtcacagtc gtagcgagcc cgcaccgtgc gatagtcatc aagacttgca cgggcaaccg 3569461 taaatcgccg attacgcgac acggtggcat tgagcgggct actgggcgcg gtgccccgtg 3569521 ccaccgtgcg ggcgatatcg agaaccttgc ggcccgtctc gacgagttgg ccggaattcg 3569581 ttaccaaccc ggcgaccgcg gatccgacgg cctgtagttg tgcgcccggc cgcaccagcc 3569641 agtccccgac cgcgcgcagc agcaaccgcg tggtgccggg gtcccgttcc gggacccaga 3569701 tgtcttccgg aaacgccggt ggacgccgcg tccggtcggc gatcacgtgg cctatcgcca 3569761 gcgcggtcac cccgttgatc agggcttggt gcgacttggt gtagagggca atgcgattct 3569821 tttccagacc ctcgacgaga tacatctccc acaatggccg cgatttgtcc agcggccgag 3569881 cggccagccg tgcgatcagc tcgtgcagtt gctcgtcact acccggcgac ggcagggccg 3569941 accgccggac gtggtaggtg atgtcgaagt cgcgatcgtc gatccacacc ggcctggcca 3570001 ggcccaattt cacttcctgg actttctgac gatagcgcgg tatctgcggc agccgctgtt 3570061 cgacggtttc cagcagtgcc tcgtagctca atccggcacg cggacggcgc aggatcaaca 3570121 gcaacccgac atacattggg gtggctgtgt tctccagctg atagaaggag gcgtccgatg 3570181 cagacaaccg ggtgaccact acggccctgt cctccttgtc aattcgtcgc gacgagtcac 3570241 gtcgtcgccc acgctaacgg ttagcccgac cacttcacgg cgcgggtaca cgcaagcccg 3570301 cattgtgcga tgatggccag caaccaaacc gctgcgcaac actcgtctgc cactctccag 3570361 caggctcctc gttcgatcga tgatgctgga gggtgcccct tgaccatcag tcctatcgcg 3570421 aactcaccgg gcgacacctt cgccgtcaca cccgtcgtcg agtacgagcc gccgccgcga 3570481 aacatcccgc cgtgcggcca atcatcgcac gcagcccggc ggccgcacac cccgcagcta 3570541 gctcgccgac aaccaatcag gccgagcggc cgggcaccgg cagcggtcac ctccacggcc 3570601 aagtcaccgc ggctgcgtca agcggggacc ttcgccgatg ccgcgctacg ccgagtgctg 3570661 gaggtcatcg accgccgccg cccggtgggc cagctgcgcc ccctgctggc acccggcctc 3570721 gtcgactccg tgctcgcggt gagccgcacg gcggccggac accaacaagg cgcggccatg 3570781 ctgcgccgca tccggctgac accggccgga cccgacaccg cggacaccgc cgccgaggtc 3570841 ttcggcacct acagtcgcgg ggaccggatc catgcgatcg cctgccgggt ggaacaacgg 3570901 cccgccggta acgaaacccg atggctgatg gtcgccctgc acatcgggtg agatcgccgg 3570961 cccacaccct agttcgaagc tactgcggcg gccggcagcc caccgccggt gtagcgggcc 3571021 agtatcggac cgacgatcgc catgacgaac acatacgccg tggccaaggc ggcaaccccc 3571081 gggatcgagg caccggccag cccgatgatg atcaaagaaa actccccccg ggcaacgagc 3571141 gcggtgccag cacgcagctg cccacgccgt gccactccct cccgccgggc agcgaacatc 3571201 ccggtggcca ccttggtcgc tgcggtgaca gcggccaggg ccagcgctac cggaagcatt 3571261 gaaacgagct ttcccgggtc aaccgacagg ccgattccca ggaagaagat cgtggcgaac 3571321 aagtcacgca gcggagtcag caccatgcgt gcccggtctg cggtctcccc ggtaagcgtg 3571381 aggcctacca gaaacgcacc cacagccgcc gacgcgtgca gcgactcggc caccgccgcc 3571441 acgatcaagg tgatgcccag cacccgcaac aacaattgtt cggaatcagg atgagtcacc 3571501 aaccggccga catgatgacc ccaacgatac gacgccgcga acgccccaag caaagcggcg 3571561 atcgccaccg tcatgcccac gaccgcctcg agccagctgc cgtctgtcgc gagaaccgcg 3571621 aacagcggca agtaggccgc catcgcgaag tcttcgagca ccagcaccga cagcacagcc 3571681 ggcgtttccc ggttgccgag ccgacgcagg tcctccaaca gccgcgcgat cacacccgag 3571741 gaggaaatgt aggtgacccc ggccagaccg aggatggcaa caccgtccaa ccccaaaagc 3571801 cagcccgcca ccgcaccggg cgtggcgttg aggacgatat cgacacccgc cgacggcagg 3571861 tggtggcgca gactgctggc gaactcggtc gcagaaaact ccagacccag ggccaaaagc 3571921 aacaacacga caccgatggg cgcaccggta gcgatgaact caccggcggc ggccaccccc 3571981 aagatgccgc cattgcctaa cgacaaaccc gccaacaaat acaccggaat cggcgacaac 3572041 gcgaatcgtc gtgccactgc acccagcacc gcaagcaccg ccaacaggac gccgagctca 3572101 aacaacagcg ccctcgaaac ctccaccggt tcagcccttt tcgacgatct gttcgacccc 3572161 ggcgatcccg tcctcggtgc cgatcacgat gaggacatct ccggctcgca gcacatcagt 3572221 cgggcccggc gaggccaaca catcctcgtc acgcacgatc gccacaatcg acgcgccggt 3572281 acgggtgcgc gcacgggtat cacccagcgg ccggtccaca aacaagctac ccgcccggat 3572341 gtgaatctga ccggccttaa gcccgggcac ctcacgcgtc agctcggtaa atcgctcggc 3572401 gatcctcggc gcacccagaa tctgagccac cgcctcggcc tcttcatcgg tgagccgcaa 3572461 aaccggtcgg gcttcgtccg gatcatcgcg gccatacagg acgacgtcga aaccgccact 3572521 gcgcctggca acgatgccga tccggtcacc gcgatagctg gtgaactcgt atcgcaggcc 3572581 cacccccggc agcagcacct ccttgacgtc cataggagtc aatccttgac gaaatgcggc 3572641 caagatagaa gcggtacggg caatctcgtt gactcaggta tgccggtgcg gccacggcaa 3572701 caacatcgac acctcgcggc ggtaatcgcg gtattggtcg cccagcgccg cgagtaggtc 3572761 gcgctcttcg aactgcaacg cgaccaagat gtagcccgtc gcgccgatcg cgaaaagcaa 3572821 gtgccccgcc gtcatcatgg gcgtcgccca gaacgcgacg acgaatccga gcatgatcgg 3572881 gtggcgtacc caccggtaga gcagatgagc ctgaaaaccg atctcggtgt acggctttcc 3572941 gcgccaagcc aaatacacct gccgtaggcc gaacaattcg aaatgattga tcatgaaagt 3573001 cgacgtcaac accgtggccc acccgagcca gaacaacgcc cacaacgcca cccggccagc 3573061 cggctgccgc acgtcccaga tgaccgccgg catcgttcgc cattgccagt acagcaacaa 3573121 cagcgcaacg ctggccagca gtacataggt gctgcgctcg atcgagggcg gcacgaatcg 3573181 agtccaccag cgtttgaaac cctgtcgtgc catcacgcta tgttggacgg cgaacacgcc 3573241 cagcagcacc aagttgacca cgaccgcctg gccgatcggc gccgcgatcg cgtgatctac 3573301 ggttcgtggc accactacgt cgccgacgaa accgatcgca tacccgaagg caaccaggaa 3573361 taccagatag ctcgcggccc cgtaaatgat cgtcaaataa cgcttcataa cctgattctg 3573421 ctccgcagga gtgtgcagct ggggcgttcg gcccgattgg cgccaatcag cgattcaaca 3573481 gtgccatgat gtgcggcatg gcctcgcggg ccgcaacgcg tcccgcctcg cgggcggcgt 3573541 cgatctggtg aaactccagc agcccaacag caccggtgtc gggtctgata acgacctgcg 3573601 caagactgag tgcggcatcc gccccacgct ggctgccgat tgtcatcgtg cgcatcaagg 3573661 tgtcgccgat tcctggcact tttggcgagc cgtcctgtcg agccgagccc ggcccgccac 3573721 cacctaagcc gatgctcacc gcgatcaatg ggccatcagg acttgcccgg gtcgagaccg 3573781 gaaggttgtc taacacaccg ccatccacat gcagtcgacc gttgtagacc tggggcggat 3573841 agatgcccgg cagccgaagg gaacacccaa tgacatcgac gagtcggcct cggcggtgta 3573901 cgaccggtcg gcgggcaagc aaatcgacgc taacgcaacg gaactccttt ggcagctcct 3573961 cgaccagtcg gtccccgaac gctgcttcta acagggtcag cgtccgtcga ccatggacta 3574021 gccccctgac cggaaacgcg tagtcactga gcggattgtg ccgaatgaag tactcgtatg 3574081 cgtaggcgtc cgctgttgcc gcgtccatac cgcacgctcc gaacaccgca ataaccgccc 3574141 ccatgctggt gccggcgaac cggtcgatgg tgaccccgac ccgctctagc tcgtcaagaa 3574201 ccccgaggtg cgcaaagccg cgcgcgccac cgccgccgag gactagaccg atcgagcggc 3574261 cggcgatgcg tgcggcgagc gggcgtacgt tttccaagat gcgtcggtaa tgaaccacat 3574321 gaaccgatcg cggcgtgatc aattcctccc actgacgccg gtgctcccgg ctggcggccg 3574381 gaccggccag cacgaggtcg gcaccccgcg cacgcgccgg cagccgcgcg gcttgtgggt 3574441 tgggatctcc cgcgaccagc actatccggt cggcgacgcg caggcagaag tcccgccagc 3574501 cggcatcctc gaccgcggca tgtagcacta ccttgtcggc gactcgctcc gcgcgatcaa 3574561 ggccgtcgcg gtcgacccgg ccggggtcaa cggcacgcaa ccgcgccgac agcgcggtaa 3574621 gcaggccagc ggccactgcc ggcacgggcg cgtcgccgct cactccgatc accgaaacga 3574681 ccacctcagg cgacgtcgag tcagtcgccg gtggcggtgc ctcccgcagc cgcgttgcca 3574741 gcacctttac caacgccgcc agcgcaccat ggtcggcgat ctcgtcgaac tgtgccttgg 3574801 tgagccgcac tagcttggtg tcgcgcaacg cccggaccgt cgcggaccgg ggcgcgtcaa 3574861 taagtagccc aagctccccg agaacctccc cgcgacccag ttctttgaga acgatgctgt 3574921 cctgcagcac ctgcacgcga cccgtgcgga tcacgtaaag cgaatcggac gggtcacctt 3574981 cgtggaagag atagcaaccc gcctccaact cgacgtcctc aacgtgctcc ccgagctgtg 3575041 ccaaggtggc cgcgtccagg ccggcaaata gcggcagatt ccccagcgga tcggcgtcac 3575101 cggccgccca atgctcaatc ggcgcggccg ccggctgggg aatcggtggc tccaaccgcg 3575161 gcgcgatcgc gggctccggc gccggcatct ggacggggtt gcggttggtt ctacccagca 3575221 ccgcggccgc gacagccacc gcgatgaaac agatggcagc catagcccat ccgcgccgca 3575281 acgcctcctc ggcagtaccg tgctccggct taccgatcaa gatcaccatc accgcgacac 3575341 cgagcaccgc accgagctgg cgagtggtgc taacgaccgc cgacgaggtg gcatagctgc 3575401 cgcccttggc gacctcggcc agcgctgcac tgctcaacac cggcaacgtc gcgccgacac 3575461 cgatgccctg cagcagttgg cccggcagcc acacgcggag gaaatccggc tcggacccga 3575521 cacgctgcaa ataccacacc aggctgccgg cccagaccag cgcaccaacg aggacgatga 3575581 cgcgatgccc atgccgaccg gcaacccgac ccagcgccgc cgccaccacg gcagccacca 3575641 ccgcagcggg cgcgatcgcg aaacccgcct tcagcagcga gtagtgccac acatagttga 3575701 ggtaaagcac atgggtaagg ccatagcagt aaaaacccgc tgcggcgacc agcgtgagca 3575761 ggttgcccgc cacgaacgac cggctacgca acagcgccgg ctcgaccagc ggcgcggggt 3575821 gcgaccgcga gctgtgcacg aacccaaccg aggtcaggac gctggccagg aacgaaccga 3575881 cggtggccac gctcaaccaa ccccagtccg gccccttgac caaaccgagg gtaaccaacc 3575941 cgagcgttac cgcaagcagc agcgcaccgc gcaagtcagg catgcggcgc cggcccgagg 3576001 cgcggctctc gacgagcatg cgcttggtgg cgatcgccgc gacgatgccc agcggaacat 3576061 tgaccagtaa cacccaccgc cagccggccc actccacgag gagcccgccg atcggcgggc 3576121 ccaggccagc cgcgatcgct gccgccgcac cccacaggcc gatagcgtgc gcgcggcgcg 3576181 ccgcgtcgaa gccctcaacg accagtgcga gcgaagcagg cacgagtatc gcagccccga 3576241 tgccctgcag cacccggaac gccaccaact gctcgacact gccggcgacg gcgcacagcc 3576301 cggacgcaat ggtgaacacc agcacaccgg acaggaatgt ccgtctgcgg cccagcaaat 3576361 cggccaacct gccggccgca accatgaagg cggcgaagac gatgttatag ccgttcagaa 3576421 tccaggacag gctcccgatg tcgtaggacg ggaaggaacg ctggatatcc gggaacgcga 3576481 tgttgacgat tgtcgagtcg agaaacgcca ggaaagcgcc gaaccccgct accagcagaa 3576541 ccgacgccga cgaaggtcgg cgacgacggg tgagattagc gaaccccttg ccgccgtgca 3576601 acgaaatgtg catgcgcgcc ggggcgcggg gtgtgccggg aagtgacttc tgggaactga 3576661 gaaaccgata cacccatctg caacctacgc gctaacgctt cttgaccgat ttcggcggct 3576721 tggcgccgcg gccttgtcgg cgggcggctt cgcgccgctc gcgccggcta gcaccggccg 3576781 gcactccggc cggcgtcttg tgggctccac cgccgttgcg ctgcacctga gccgagccat 3576841 cctccgcggg accggaatag gtcaaagcgg gcgactcgct ggcaacaccc ttggcgcgta 3576901 atgcacttgg agctctttcg cgcgcgccac catcgaccgc gctgcgttgc tgcgcggcgg 3576961 ctgcggccgc ggcggcgaat tcggcaagct ctgcgggttc ggcagccggg gcaaccggcg 3577021 gggcggggac cgcctccacg gtgacgttga acaggaagcc gaccgattcc tctttcatgc 3577081 cgtcgagcat ggccatgaac atgtcgtagc cctcacgctg gtactcgacc aacggatcgc 3577141 gctgcgccat cgcgcgcagc ccgataccct ccttgaggta gtccatctcg tagaggtgtt 3577201 cacgccactt acggtctatg acgttgagca gcacgttgcg ttccagctgg cgcatcgcac 3577261 cctcgccggc gatttcctcg agttcggctt cccgtgcggc ataggcacgt tcggcgtcct 3577321 tgagtagtgc ctccagcaac tcctcgcggg tgagatcgtc gcgctcgaat tcgtggtcct 3577381 tgcgggtcag cgagtcggcg gtgatcccca ccggatagag ggttttgagt gccgtccaca 3577441 acgcgtccag atcccaatct tcggcatagc cttcgccggt cgcgccgtcg acgtaggcgg 3577501 tgatgacatc gcggaccatg tccagcgcct ggtccttgag gttttcgcct tcgaggatgc 3577561 gccggcgctc ggcgtagatg accttgcgct gctggttcat cacctcgtcg tatttgagga 3577621 cgttcttgcg gacctcaaag ttctgctgct cgacctgggt ctgggcgctc ttgatggccc 3577681 gggtgaccat cttggcttcg atcggcacgt cgtcgggcag gttcagcctg gtcaacaagg 3577741 tctccaaggc cgcgccattg aagcggcgca tcagctcgtc acccagcgac aaatagaagc 3577801 gcgactcccc ggggtccccc tggcggccgg accggccacg caactggttg tcgatccgcc 3577861 gcgactcgtg gcgctcggtg cccagcacgt acaggccgcc ggcctcgatt acttccttgg 3577921 cctccttgct ggcttcctct ttgacgatgg gcagttcgga gtgccaggcc gcctcgtact 3577981 cctcgggcgt ctccaccgga tccaggccgc gttcgcgcag ccgctgatcg gtgagaaagt 3578041 cgacgttgcc gcccagcaca atgtcggtgc cgcgaccggc catgttggtg gcgacggtga 3578101 cgccgccgcg gcggcccgcc accgcgatga tggtcgcctc ttgctcgtgg tacttggcgt 3578161 tgagcacatt gtgcgggatg cgccgcttgg tgaactgccg cgacagatac tccgagcgct 3578221 ccacgctggt ggtgccgatc agcaccggct gtcccttcgc gtagcgctcg gcgacgtcgt 3578281 cgaccaccgc gatgtacttg gcctcctcgg tcttgtagat caggtcggac tggtcttcac 3578341 ggatcatcgg catgttggtc gggatgctga ccacgcccag cttgtagatc tcgtgcagct 3578401 cggccgcctc cgtctgggcg gtgccggtca tgccggcgag cttgtcgtag agccggaagt 3578461 agttctgcag cgtgatggtg gccagcgtct ggttctcggc cttgatctcg acgtgctcct 3578521 tggcctcgat ggcctggtgc atgccctcgt tgtagcggcg gccgatcagc acccggccgg 3578581 tgaactcgtc gacgatgagc acctcaccat cgcggacgat gtagtccttg tcgcggctga 3578641 acagctcttt ggccttcaga gcgttgttga gatagctgac caacggcgag ttggcggcct 3578701 cgtacaggtt gtcgatgccg agctggtctt cgacgaattc cacacccttc tcgtgcacgc 3578761 cgacggtgcg tttgcgtaga tcgacctcgt agtggacgtc cttttccatc agcggcgcca 3578821 accgggcgaa ctcggtgtac cagttggagg cgccgtcggc gggaccggag atgatcagcg 3578881 gggtgcgggc ctcgtcgatc aggatggaat cgacctcgtc gacaatggcg taatggtgcc 3578941 cgcgctgcac cagatcatcc agtgagtgcg ccatgttgtc gcgcaggtag tcgaacccaa 3579001 actcgttatt ggtgccgtag gtgatgtcgg cgttataggc cacccggcgt tcatcgggtg 3579061 tcatggtggc caaaatcacc ccgacctgaa gcccgaggaa gcggtgcacg cggcccatcc 3579121 actcactgtc gcgtttagcc aggtagtcgt tgacggtgac gatgtgcacg ccgttgccgg 3579181 ccagcgcatt gaggtaagcg ggcaacacac aggtcagggt cttgccttca ccggtcttca 3579241 tctcggcaac gttgcccagg tgcagggcgg ccgcacccat cacctgcacg tcgaacggcc 3579301 gctggtccag cacccgccag gcggcctcgc gggccacggc gaaggcctcg ggcaacaggt 3579361 cgtcgagggt ttctgggttt ttctggtcgg ccagccgccg cttgaactcg tcggttttcg 3579421 ccctcagctc ggcgtcggtg agtttctcga catcgtcgga caaagtgccg acatagtcgg 3579481 ccaccttctt gaggcgcttg accatgcgac cttcgccaag gcgcagcaac ttcgacagca 3579541 cagctatgtc cccgcatgtg taggagtctt tagataaggc gactcccatg gtaggtgacg 3579601 acgcggcgcg cgccgccgat cacgccagac ggatcaagcc gtagtcgtag gcgtgccggc 3579661 ggtagaccac cgacggccgt tcggtgtcct tgtcgtagaa caagaagaag tcgtgtccaa 3579721 ccagctccat ctggtagagc gcgtcatcga ccgacatcgg cttggccggg tgttctttgg 3579781 tgcgaacgat ccgcccaggc tcccgctcga cgacggcacc gtcgtgatcg tgtgcctcgg 3579841 ctggtctggt gttgaagccg ttctccggcg ctggcaccac cgcggtcgcc tcggccagcg 3579901 aaaccggggt tttgtcgccg tagtgcacct tgcggcgatc cttaccgcgg cgcagccggc 3579961 tctccagttt gacgaccgct gattcaagcg cggcatagaa gctgtcggcg caggcctcac 3580021 ctcgcaccac cggccctcgc ccacgcgcgg tgatctccac gcgctgacag gacttgcgct 3580081 ggcggcgatt acgttcgtgg tcgagttcga cgtcgaacag gtagatggtc cggtcgaacc 3580141 gctccaagcg ggcgagtttc tgcgaaacgt agatgcggaa gtggtcgggg atctcgacat 3580201 tacggccctt gaacacgatc tcagcgtttg atttcggttc ggccagaacc tgacctgaat 3580261 ccacggctag ccttgacata cgtgacaact cgtttctctt tccacgtcac acgcgccctg 3580321 cgtgcctggc cttcggggag acgcgccgac ggggtgggag cggttggaga agttaccgcc 3580381 gcaggctgcc cgccggagca agatgtcgat tgctcacctc ctatcgcggg atgctgattc 3580441 aacctgggaa gcgcgagcgt gagtcgttaa aggttgatct cgacgttagc ccgtgttcgg 3580501 ctcaccgtgc caccaaattg accgacctgt ttcgagttct tcacgttgtc ttggcaactg 3580561 caccggctca ggcagatcct cacgcggccg cgaccgccaa cacggcaccc acccgcacac 3580621 cggcggcctg caagacccgg accgactcgc gcgccgtcgc cccggtggtg atgatgtcgt 3580681 cgacgagcac gacttcgttg cgcggccgct ggccccgcaa cagcacccga cccgtgatgt 3580741 tgcgctcgcg cgcggacgcc ccaagaccta ccgagtcccg ggctagcgct cgcatccgca 3580801 gcgccgggac gacggtgacg tcatggtggc gcccaagggt ggcacccgca atccgcgcca 3580861 tccggctgac ggggtcaccc ccacgccgtc gcgccgccca ccgtctcgtc ggcgcaggca 3580921 ccatcgtcag cgggttttcg agcatgcccc aggacaacag gtggtcgaca ccgacaatca 3580981 gcgcgcacgc cagtggcgcg acgaggtcgc gacggccgtg ctctttcata gcgaggatcg 3581041 cctgacgacg cacgcccgcg tagcggccga gcgcgaacac cggcacctgt gggtcaacac 3581101 gaggactcac cacgtgcggt tcaccggcag ccaccgacag ctcggcggca caggcggcac 3581161 accagcgggt cgccggcgca ccgcagccac cgcattccag cggcaggacg aggtcaagca 3581221 cacaccaagt gtcgcggtca ccggtgacag cagtgctgtc aatcggcgcc gctgcgcagc 3581281 ggcggccaga caaagctgag cgcaccctga ctcaattggg taatcacgct ttccagataa 3581341 cgcagcggca gctccgttcg ccactccgga agcaacgcag agagctgcaa acaatcggct 3581401 gcgaggctga cgtcggtgat gcgcagccca tgcggcagct caggcagtgg cactcggtag 3581461 gccggtgtcc gtgccggcag tgtccaccgc cgttgtccgg tgatcacggt gcgcggtcgc 3581521 agccacagtg tcgtctggga ggttgtgccc gccacgtcga cgtcgacctc aagacctccc 3581581 cagtcgggcc ggcgcgccca gcgcagccgg gccgctccgc tctcggacaa ctcaccgcgc 3581641 agctgcggcg tcgcttgacg gagcacatca tcgaaaatct cggtcggcag ggcggacgac 3581701 aactcgacgg gagcggcgat caccagcggc ggcacacccg ggcggatgtg cacgttgcgc 3581761 aggacggcaa cggcgctgtg caaatgatgc tgatcccagc tgatgccgcg agcggccacc 3581821 cgaacctcgc ccagctggcc gacggccagc ccctgcggtt ccagtgccga gtccagctcg 3581881 gtgacggtca gcaccacgtc atggtcccca atccgaaccg tgacttcctt gccgatgagc 3581941 agctgctgca aggtggtgaa caacgtccgg tagggcgccg caactgcctg ggcggctccc 3582001 gcgctgacca gcgacatccc ggtcgacgac cacagcgagg ccagcatgtc caaggcacgg 3582061 aagggatcat cccaacgcag ccggggaact cttggcgaca tcaacaagcg cctcctcact 3582121 gcgagggtag ccggtgtgct caggtcgcga aaaacgcagg cacagcactc atccgggcaa 3582181 taccggcgcc gcccccggca ccatcagccc cggtacgtcc gcccagcctg gtcggctttc 3582241 gacagacgcc gagtacatca acaccccttg cgggccggcg acatacacag tcgacgggtt 3582301 ggccgcgatc gccgtcagtg gagtttgcaa cccgcgggac ggcgcgtcgg agttcacccc 3582361 gtcgaggttt acataagaca cgggatgggc ggcgtcggtg cgtgtcacca cgatgtcgtc 3582421 accggttcgc caggacaacg acaccaccga ggaacccagc ccgaaaccca gccgccgagg 3582481 gtaggtcagg gcgaactggc cagcctgggt ctgctcgacg ccggcgagga tcacctgccc 3582541 accgatcacc atcgcggcgc gcgtcccgtc acgggacagt tgaagatcgt tgatcgcccc 3582601 cgggaagcgg ctggccaccg cggtcgaatc caccggaatc cgcgcgggtt gccccgatgc 3582661 cgggtcctgt atcgctcgca gcacgacgtt ggtatcgacc accacccaga ccgcgtcgtc 3582721 cagcgaccag ctgggccgca acaggctgtg cccgtcggcg gactgcaccg cctcgccgcc 3582781 gaggtcgccg acccacaaag acgccgcctc atccggagcc ccgcgcccca gcgtcaccac 3582841 cgaggccacc tgacgcccgc tgcgtgatac ggcggccgcc gtctgctccg gcatccgtcc 3582901 gaaggccccg ggcacggggg tgactcgctg tgcgtccatc gccaccagtg atccgttcac 3582961 caaggcgtgc aaccccgcgg cggcaccgtc ggccaccccc gggtcggtgg ccgcgacatc 3583021 ggaagtggtc cacccctcgg caaacctgtc ttccagcggg gcgccgtcgg cgttgatcac 3583081 gtacggcccc ctgatgtcgg ccctggccaa ggtccagatg atctgtgcgg caagtaattg 3583141 cctgctgtgc ggatcggtgg tggacagctt ctccatgtcg actcgcgcgc cgccgtaccc 3583201 gcggccgatt ccgctctttc cgccgtcggc ccgagtcacc ggcccgcgca gtcgtagcgg 3583261 cggagcgagc agattacgca ccgtgcgcgc catctccggg cgtggacccg ccagcagttt 3583321 ggagacgagc tccgtggcca gctggtcgcg gtcggacaca gcgacgtagc gcggatcggg 3583381 aaccacggtc ttgccggtgg ggtcggcgaa gtacagggtg ttgcgcttgt acgtttcttg 3583441 gaactgctgc cagtccagga aaaccccgtt gggtaggcga tcgatgcgcc aaccaccgga 3583501 cgtcttgacc aactcgatcg ggcccggatc cggcagttga ccctcggcgg tctcaaacac 3583561 ccccacatcc gagagcgagc cgagaatgtc tgcccgcatg gttaccgaaa ccttctcggc 3583621 gcttcgggtt tcgacgaaca ccacgtggtc gatcaacaac gcgctgccgg cgtcgtccca 3583681 ggcgttggaa gccgattcgg tgaggaactg acgcgccgct aggtgccggt tggccgggtc 3583741 ggctgtggcc ttgaggaact cgcgtaacag cacgtcggga tccatacccg ggctcggttt 3583801 gggcagattc gacggcaccg gacgttcgac ggttccgatg gcttgcgggg ccgacgtgct 3583861 gggcacactg gcacagccgg ccagcactgc accaaggaac aacaaaattg tcagccgcat 3583921 caaccgctcc actccgcgtg ctcacgtggg cgctgacgtt ccttgtattc cggtggcatc 3583981 ggttgcggat tcggttgcgc gaccggttgc agaactggct gcgggatcgg tttcatgggc 3584041 agcgggctgg tggtgacctt gtggccgcgc accagcggaa gcgtcagccg gaagcaggcg 3584101 ccctcgccgg gttcgcccca cgcctcaagc cgaccctggt gcaatcgggc atcctcgacg 3584161 ctgatcgcca aacccagccc ggtgccgccg gaccgacgta cccgtgaggg atccgagcgc 3584221 cagaaccggc taaacaccag cttctcctca ccaggccgca gcccaacccc gtagtcacgc 3584281 acggtgacgg cgaccgtgtc ttcgtcggcg gccatccgga tccgcaccgg tttgtgttcg 3584341 gcgtggtcga tggcattggc aatcagattg cgcaggatcc gttctacccg acgcgcatcg 3584401 acctccgcga tcacctgctc ggcgggcaga tccaccagca actcgatacc ggcctcctcg 3584461 gccaggtggc ccacattgcc gagcgcgttg ttgaccgttg tgcgcaagtc gaccgcctca 3584521 accgacaact cggccacccc ggcgtcatgc cgcgagatct ccagcaggtc gttgagcaac 3584581 gtctcgaatc ggtccagctc gctaaccatc aactcggtgg accgccgcag cgtggggtcg 3584641 aggtcggcgc tgtggtcata gatcaagtcg gccgccatcc gcaccgtggt cagcggcgta 3584701 cgcagttcgt ggctgacgtc ggaggtgaac cggcgctgta ggttgccgaa ctcctccagc 3584761 tgggcgatct gtcgggacag gctctcggcc atgtcgttga acgacaccgc cagcctggcc 3584821 atgtcgtcct cgccgcgcac cggcatgcgt tcggacagat gtccctcggc gaaacgttcg 3584881 gcgatccgcg acgccgaccg caccggcacc accacctgac gcgacaccag cagcgcaatg 3584941 ccggcgagca ggactagcag taccaggccg ccggtggcca tcgtgccacg caccagcgtg 3585001 atcgtggctt gctcgctcgc cagcggaaag atcaggtata gctccaggtt ggccacccgc 3585061 gacaacgtcg gagtcccgat gatcagggcc ggcccggaga aaccttcggt ctgcaccgtg 3585121 gcgtactggt aggcggcctg cccggccttg acgaagccgc gcagcgcgtt gggcacctga 3585181 tcgacgggtc cggcagtaga ggcagcgcgc ggcccatcac ccggcaccat cagcaccgca 3585241 tcgaacgcac cggcgaggcc agcccccgaa gcggggtcgg ttttcgacgt cagagtgttg 3585301 cgcgcaagct gcaggctact gtccagtgag cgcgtctcct caccgttgac gatcccgctg 3585361 acggtggtgc gtgcccgctc gatctggtcg atcgccgccc tgaccttgat gtcgaggaca 3585421 cgattggtga cctggctggt cagcacaaag ccaagcgcca ggatgacggc tagcgacagt 3585481 ccaagggtca gcgccacgac ccgcagctgc agcgatcggc gccacgcgac agctacggct 3585541 cgactcaacg cactgaggcc ccgtgtcatc gggccagagc gaccccggcg accccgaatg 3585601 cgtcggcgcg agccgaagat catcggcgcc gctccttagc atcgctgcgc tctgcatcgt 3585661 cgccggcgcg gatcacggag gtccggcctt gtaccccact cctcgaacgg tcagcaccac 3585721 agtcgggttc tcgggatcct tttcgacctt ggcccgcaga cgctggacat gcacgttcac 3585781 cagcctggta tcggctgggt gccggtaacc ccatacctgt tcgagcagca catcacgagt 3585841 aaacacctgg cgcggcttgc gcgccaatgc gaccaacagg tcgaattcca gcggtgtcaa 3585901 cgagatctgc tcaccgttgc gagtgacctt gtgcgccggt acgtcgattt ctacgtcggc 3585961 gatggacagc atctcggcgg gttcgtcgtc gttgcggcgc agccgcgccc gcacccgcgc 3586021 aaccagctcc ttgggcttga acggcttcat gatgtagtcg tcggcgcccg actccagacc 3586081 cagcaccaca tccacggtgt cggtctttgc ggtgagcatc acgatcggaa caccggaatc 3586141 ggcgcgcaac acccggcaca cgtcgatgcc gttcataccg ggcagcatca aatccaataa 3586201 caccagatcg gggcgcagct cgcgcaccgc ggtcagagcc tgagtaccgt cgccgatgac 3586261 cgcggtgtcg aagccttccc cccgcagcac gatggtgagc atctcagcca acgaagcgtc 3586321 gtcgtcaacg accaaaatcc tttgcctcat ggtgtccatg gtgtcaccac atcgggacaa 3586381 aactggcgca ccacacgggc gtttcttgct tgattagggc aaataccctc aacttggcac 3586441 gtctggaggc gccaaagtcg ccgctagtcg gcccggatca acatcggcgc cgacaaccag 3586501 ccaccggccg ccccaccctt gggccgccaa ctcggcgtag accgcaccgg tgcgctgctg 3586561 aagttcagcg tcgcgttcgt aattgtcgcg cgcccgaccg gggtcacgct gggcacggcc 3586621 gcgggatcgt tccccggcga gctcggcaga gaccgcaagg agcacctgcc agtcgggctt 3586681 gggcaacccg agtcttgcaa attcgatccg ctgaacccag gccgctgcct tcccggccgc 3586741 gttttcatgt aggcgcgccg cgctgtaggc cgcgttggag gcgacgtagc gatccaggat 3586801 caccacgtcg tagccgcgac acagcccctg gatcgtgtgg accgcgccag cgcggtcgag 3586861 cgcgaacagc gtcgccatcg catacaccga cgatgcgagg tcaccgtgct cgccgtgcag 3586921 cgcctccgct gcgatgtcgg cggccaccga ctgtccgtag cgcgggaacg ccagtgtggc 3586981 caccgatctc ccggctgctc gaaaggcccc ggacagcttt tccaccaacg tccgcttgcc 3587041 agcgccgtca acgccctcaa tcgcgattag cacggcgcgg ccctgtcggt ggcggcgcga 3587101 gcagacgcaa aatcgccctt ttcgtcatga aaatgggcga ttttgcgtct gctcgcgggt 3587161 gggaggcact cagtagcggt agtggtccgg cttgtaggga ccctcgacgt cgacgccgag 3587221 gtattcggcc tgctccttgg tcagcttggt caggtgaccg ccaagggcct cgacatggat 3587281 tcgagccacc ttctcgtcga ggtgcttggg cagccggtac acctcgttgt cgtactcgtc 3587341 gttcttggtc cacagctcga tctgggcgat cgtctggtta gcgaagctgt tgctcatcac 3587401 gaacgagggg tgcccggtgg cattgcccag gttcagcagc cgcccctcgg acagcacgat 3587461 gatcgagcgg cccgtgtcgc caaaggtcca caggtcgacc tgaggcttga cgttgacccg 3587521 tgtcgccccg gagcgctcca gcccggccat gtcgatctcg ttgtcgaagt ggccgatatt 3587581 tcccaggatc gcgtggtcct tcatcgcctt aatgtgctcg agcatgatga tgtctttgtt 3587641 gccggtcgcg gttacgacga tgtcggcgtc cccgatggcc tcctcgacgg tgaccacgtc 3587701 gaagccctcc atcatggcct gcagcgcgtt gatcgggtcg atctcggtga cggagacccg 3587761 cgctccctgg cccttcatcg cctccgcaca gcccttaccg acgtcgccgt agccgcagat 3587821 gaggaccttc ttaccgccga tcagcgcgtc ggtgccgcgg ttgatgccgt cgatcaggga 3587881 gtgccgagtg ccgtacttgt tgtcgaattt ggacttggtc accgagtcgt tgacgttgat 3587941 cgccgggaag gccagatccc cggccgcggc gaattggtag agccgcagca cgccggtggt 3588001 ggtctcctcg gtgacgccct tgaccgactc ggctatcttg gtccacttgt ccttgtcggt 3588061 ctcgaagcgg gtccgtagca ggttcaggaa gatcttccac tcggcggggt cgtcctcctc 3588121 ggcgggcggc accacgccgg ccttctcata ctgcatgccg cgcagcacca acatggtggc 3588181 gtcaccgccg tcatcgagga tcatgttggc cggcttgtcg gggtccggcc aggtgagcat 3588241 ctgctcggcg gcccaccagt actcttcgag cgtctcgccc ttccacgcga acaccgggac 3588301 acccttgggc tcgtcggggg tgccgtgcgg gccgaccacg acggcggcgg cggcgtgatc 3588361 ctgggtggag aagatgttgc acgaggccca gcggacttcg gcgcccagcg cggtgagggt 3588421 ttcgatcaac accgcggtct gcaccgtcat gtgcagcgaa cccgagatcc gggccccctt 3588481 caggggttgc acctcggcat actcgcgccg cagcgacatc aggccgggca tctcgtgctc 3588541 ggcgatccgg agttctttgc ggccgaaatc cgctagtgac aggtcggcga tcttaaagtc 3588601 gatgccgtta cgaacgtcag gggtcagcga atttttggtc accaaatttc cggtcatagg 3588661 ggctttcatc cttctttggg ggctcacagg gatccgagcg ggctacttag cctaggtacg 3588721 ctcttgcagt cactgtagcc gccgtcggtc agccccgcag gtcaggggac attgatcaca 3588781 ccgtgacgct ccgcgaacgg cgttattagc cgtgctaggt ccgctgcgac atcatggtcg 3588841 gcctcgggcg gcatcgacac gtagctcaag cacagccgca cgatcgcacg cgagagcaca 3588901 ttggcgtcgt tatcggtggt ggccacccag gtatcggtga aggccggcgc cagccgggcc 3588961 gacgcgcggg tgatgatcgg cgcgctgtcg gtggtgatca gttgcagcag atcgggcttg 3589021 gcgacaccgg tcaacagcga gatgaccaac ggatctgccg ccgactcggc gaagaacgac 3589081 cgaaagccct gcaggaacgc ttcgtaaaag ttgccgacgt tggcgtccaa cgatgcatgg 3589141 acgttgtcca ctaatcggtc ggccaggcgc agcgcgtatc cctgcgccag gccttgccgg 3589201 gaaccgaatt cgttgtagat ggtctgccgg ctgatgcccg ccgcgcgggc cacgtcggac 3589261 agcgtgatgg cggaccagtc gcgggtcagc agcagatccc gcatcgcatc cagcaccgaa 3589321 tcccgcaaca gggcccgcga ggcctcggca tagggtatcc gcttcacagg cgcgacagta 3589381 gcgcttggag tgctcacgag cgagccacct ccaccatctc gaaatccgac tttgccgcac 3589441 cgcaatccgg gcaactccag tcatcgggga tgtcgtccca gcgggtgccg gccgcgatgc 3589501 cgtcctccgg ccaacccagc gcctcatcgt actcaaagcc gcattggata cagcggaaca 3589561 gtttgtagtc gttcacttag ttaccctcct atcttttcga aatcgacctt ctcgcgcacc 3589621 gcgcagtccg ggcagcacca gtcgtcggga atttgatccc agcctgtgcc ggctgggaag 3589681 ccttccctgg catcaccgtt ggcctcgtcg tagacgtagt cgcagaccgg gcaccggtag 3589741 gcggccatca tgccgaggct ccgtaacggg cgagtgcctt ctcccgcacg cgcgggtgca 3589801 ggttaacccg agtgatatcg ccgccgtagt gctccagcac ccggtgatcc attaccttgc 3589861 gccacaacgg cgggaagtag gtcagcgaga tcatcgatgc atacccactg ggcaggttgg 3589921 gcgcacccgc catgctccgc agtgtctgat agcggcgagt ggggttggcg tggtgatcgc 3589981 tgtgtcgctg caggtggtag aggaacaggt tggtgacgat gtggtcggag ttccagctgt 3590041 gcaccggggc gcagcgctcg tagcggccgt tggcgctctt ctgccgtagc agtccgtagt 3590101 gttcgaggta gttgacggcc tctaacaggc tgaagccgaa gactgcctgg atgatgacga 3590161 acgggatcag cgccgggccg aagaccgcga tcagcccacc ccacaacacc accgacatca 3590221 gccacgcgtt gagcacgtcg ttgcgcagat acgtcatggg attccagggg ctgacgccga 3590281 gccgacgcag ccgttgggcc tccaaatgaa cggccgagcg caagccgccg ataacactgc 3590341 ggggcaggaa ctcccacaac gtctcgccga accgcgccga cgccgggtcc tccggtgtgg 3590401 acacccggac gtgatggcca cggttgtgct cgatgtagaa gtgcccgtag caggtctggg 3590461 cgagggtgat cttggacagc caccgctcca gcgaatcctt cttgtgcccc atttcgtggg 3590521 cggtgttgat accgacgccg ccaagcacac cgaccgacag cgccacccca agcttgcccg 3590581 cccagctcaa ggcgccgtca aagccgagcc aactgaggtt tgcggcggtg aacaggtatg 3590641 cgcccagcac cacgctgagg tactggaacg ggatgtagat gtaggtgcag tagcggtagt 3590701 acttgtcatt ctccagccgg tcggtcacct cgtcgggcgg gttctgcccg tcgggcccga 3590761 agcgtaggtc aagaagcggc aacaagacgt agagcaggat cggtccgatc cacagcggca 3590821 cctgcgcggc ggcgtgccag ccgagctggt tcatccccca gatcagcggc agcatcacca 3590881 ccaaggccgt cggggcgatg aggcccataa gccacaggta acgcttcttg tcccgccact 3590941 cctcgacttc gggcggccgg ggggcttcgg gtccaccaga gccgatttgc gtggtcatat 3591001 gccaaacctc ctcatgagcc acaccacgtt gggatttgac aatagagcag tttgcgtctt 3591061 atgtctagac atataacgca atttgtaaat acgcggcgaa gctagttcaa cacctccggg 3591121 tcgcgctctc tcgagcttgc cgaaggccct gcgccgagtg ccggcgcccg tagccgacat 3591181 aaatcgcggt tccggccacc agccagatcc cgaaccggat ccaagtcaac gcggtgaggt 3591241 tcagcatcag ccacaggcac gcgcacactg cggcgatcgg aagtaacggc acccacggag 3591301 ctgtgaaccc ccgctgaagg tcgggtcggg tccggcgcag cacgaccact ccggccgaga 3591361 cgaggatgaa cgcgaacagt gtcccgacgt tgaccatctc ctcaagcttg gtgatcggaa 3591421 acaccgacgc cgtcgtggcc accaacaccg cgaccagcac cgtgacccgg accggggtgc 3591481 cgcgcgaacc ggtcttggcc aattgccgcg gcaccaagcc gtcgcgcgcc atggcgaaca 3591541 gcacgcggca ttgcccgagc atcaacacca tcaccaccgt ggtaagcccg gccagcgcgc 3591601 cgacggagat gatgccgctg gcccagtaca ccccgttggc ctggaacgcg gtggccagat 3591661 ttgccggccc gcggcccggt acggtccgca gttgggtgta tggaaccatg cccgacagca 3591721 ccaccgatac cgcgacgtag agaagggtca cgacccccag cgacgcgaga atccctcgag 3591781 ggacgtctcg ttgaggacgc ttggtctcct cggccatggt ggccacgatg tcaaacccga 3591841 taaacgcgaa gaacacgatc gatgccccgg ccagcacgcc gtaccatccg tagtggctgc 3591901 cttgggctcc ggtcagcaac gagaagacgg attgatcgag cccgccgccg tggtgctgga 3591961 cttcgggctc gggaatgaac ggcgagtagt tggcggccct gatgtagaag gcaccgacga 3592021 ccaccaccaa gacgaccacc gacaccttga ttgcggtgac caccgcggaa aatctcgacg 3592081 acaatttggt gcccaacgcg atcagggtcg ccaccaacgt gacgatcacg agcgcacccc 3592141 agtcgagctg cagcgatccg agatggcctg tgccattacc gaatccgaac acggtgccca 3592201 agtagctgga ccagcctttg gcgaccacgg ccgcacccat cgccagttcc agcaccagat 3592261 tccagccgat cacccaggcc aagaactccc cgaaggtggc ataagagaag gtataggcgc 3592321 tgccggccac cggcagcgtc gaggcgaact cggcgtagca cagcgcggcc agcgcacagg 3592381 tcgccgccgc gatcagaaac gatatccaga tggccgggcc ggtgatatcg ccagcggtcg 3592441 acgcggtaac cgtgaatatt ccggcgccaa tcaccaccga gacgccgaaa acaaccaggt 3592501 cccaccaggt gaggtccttg cgcagccgag tggtgggctc gtcggtgtcg gcgattgact 3592561 gttctaccga cttcatgcgc cgtcgaccgg ccatgcaccc gtcctctcgc actcgttgtg 3592621 accgcacagt actgggtact ctgcgaggat gacgggtcgc gtagggaacc cgaaggacca 3592681 cgccgtggtg atcggagcta gcatcgccgg gttgtgcgcc gcgcgggtgc tctcggactt 3592741 ctactccacg gtgacggttt tcgagcgcga cgagttgccg gaagcgccgg cgaaccgggc 3592801 cacggtccct caagaccgac acctgcacat gttgatggcc cgcggggcgc aggaattcga 3592861 cagcctgttc cccggcctgt tgcacgacat ggtggccgcg ggcgtgccca tgcttgagaa 3592921 ccggccggac tgtatctact tgggcgccgc cggccatgtc ctcgggacgg ggcataccct 3592981 gcgcaaggag ttcaccgcct acgtgcccag ccggccgcac ctggaatggc agctgcggcg 3593041 acgggtcctg cagctctcca acgtccagat tgtgcggcgc ctggtcaccg agccacagtt 3593101 cgagcgcagg cagcagcgag tggtcggcgt gctgctggat tcccctggta gcggccaaga 3593161 tcgggaacgc gaagagttca tagctgccga ccttgtcgtc gacgcagccg gccggggtac 3593221 ccgactgccg gtttggttga cgcagtgggg atatcggcgg ccggccgaag acaccgtgga 3593281 catcggcatc agctatgcca gccaccaatt tcgcattccc gacgggctga tcgccgagaa 3593341 ggtggtggtc gccggcgcct cacacgatca gtcgctgggg ctaggcatgc tgtgctacga 3593401 ggacggcacc tgggtcctca ccaccttcgg ggtggccgat gccaaaccgc cgccgacttt 3593461 cgacgagatg cgtgcactcg cggacaaact gctgccggcc cgcttcaccg ccgcgctggc 3593521 gcaagcccaa ccgatcggct gtccggcgtt tcatgctttc ccagccagca gatggcgtcg 3593581 ctacgacaag ctggaacgtt tcccgcgcgg aatcgtcccg ttcggcgatg cggtggccag 3593641 cttcaatccc accttcgggc agggcatgac gatgacctca ctgcaagccg gccacctacg 3593701 acgggcgctc aaagcccgca actcagctat gaaaggcgac ctggccgccg aactcaatcg 3593761 ggccaccgcc aagaccacct atccggtgtg gatgatgaac gcaatcggcg acatcagttt 3593821 ccaccacgcc accgctgagc cccttccccg atggtggcgc ccagccggtt cgctgttcga 3593881 ccaattcctc ggggccgcag aaaccgatcc tgttctcgcc gaatggtttc tgcgacggtt 3593941 ttcgctgctg gacagcctgt acatggtgcc gtcggtaccg atcatcggtc gcgccattgc 3594001 tcacaatctg cgattgtggc taaaagagca gcgtgagcgt cggcaacccg tcacaacccg 3594061 acggtcgccc tgaacagctt ggcgggttgg ccggcggtca gccggatcgg gccgtcgtcg 3594121 gccgccaccc aggcggccgt gccgcgctgt agcgtgagcg acccgcactt cccgtgcacc 3594181 gtcgccgaac cctcggtgca taacaagatc tgtggaccgt catggccgga cgacgcgtcg 3594241 acctcgtggc cgaggtgatc gccgtcgagc accagtagcg tggccgcgaa ctcatcggtg 3594301 ggcgtctcaa agaccagccc cagcccctcg cgccggatcg ggggccgcag ccgagccttc 3594361 ggcgtggggg cgaagtccag cacccgcaac aactcgggca catcgacgtg cttaggggta 3594421 agtccaccgc gtaacacgtt gtcggagttg gccatcactt ccacaccgaa accacgcaca 3594481 taggcgtgca ggttgccggc cggcaggaag atcgcctccc caggagccaa gctgatgcgg 3594541 ttgagcaaca acgccgccag cacaccggcg tcgccgggat aacgttcgcc gagttccagc 3594601 actgtcttgg cttcggcgcc aaattccgtt gcgccggagc tgacgtactg gatagcgccg 3594661 tccagcacgg caggcaccag cacgtcgatg tcgggctggg gtgcggtaat ccaggtggtg 3594721 aacagcgcac gcaaaccatc ggcatcggac ccctcgctca gcaagtcgat gaacgggtcg 3594781 aggtcggata cggccagcgc ccgcagcagc tcggtggtgc gagccgcctc ccggaatccg 3594841 gccagcgcct cgaacggctg cagcgccacc aataactctg gcttgtgact ggtgtcgcgg 3594901 tagttgcgga cgggtgagga caccggaatg cccattcgct cttcccgcag gtagccctca 3594961 accgcctgct cggcgctcgg atgggcctgc aacgatagtg gctcgtcggc cgccaacacc 3595021 ttgaccaaga acggcaacac atcgccgaat cgcgcgcgcg acgcggagcc gagctgcccc 3595081 tccggatccg cgaccaacgc ttcgagcaac gaggtttggc catgcggcgt ctgcagccaa 3595141 gccggatcac ccgggtgtgc accgaaccat agttcggcct cggggtgagc ggccggcacc 3595201 ggacgcccgg tgaattcggc gatagcggtg cgcgatcccc aagcgtaggt gcgtaacgcg 3595261 ccacgtagca gttccaccgg cgatctatcc tcgcaccagt cgcagataca cggcggccat 3595321 ctccagccga acggccaata ccgccccccc ggatcccacg ggggcgtcga gcagctccgg 3595381 cacatcctca gccgcgacca gataggcgtc atcgagcccg gcaacccgag cggccaccac 3595441 cgtccgctcg ccggccagcg ccagcgccaa cacccgcagc cgctgcggtg ccggcccatc 3595501 gatttcctcg tcatggaaca gcgcatccgg cggcgtcccg gcacgtagcg ccacaaccgc 3595561 atccgaaagc ctggtagcgg ccacaacctg gtttgcgatc cgcagcatga ccgaactccc 3595621 atgccgggcc agcgccagcg tcgcggcatt gtctccagcc agggccagct ggcaaccgga 3595681 aacgcgagcg gcaagtgcct tggccgggtt ggtgaacacc tctcggccgg cgctgttgcg 3595741 gagcgcctca gcatccagct cgtctgccag cgacgccaga tcgatgcgca gcttgggatc 3595801 cacggtttgc aaggccgcca gacccgcggc caggtaccgg gacaacccga actcgtcagg 3595861 aacccgcagc cgcggttcca gcaccgcgac gcgaccggcc gtgctgtccc gcagcggacc 3595921 ctcatacggt gccaccacga caacccgcgc gcccctgcgc accccgatcg cggcggcccc 3595981 gaccagcgcc gggtcgccgg ggtcgtcgcc ggcaacgatc agcacgtcaa gcggcccgac 3596041 ccagggcggc gccgcactgg cgagcacgat cggctcggcg gccccggcac ctagcgtcga 3596101 ggccaggatg gtcccggcgg tctcagcggt cccccggccg gtcacccaga tcaccgagcg 3596161 gggacggtca ctaccgcgca gcaagtccag ttcgccctcg tcggccgcgg cagcgatggc 3596221 acgcacctgt gcgccggcca tcgatgcggc ccgcagcagg gcaccccggt cggcagcgat 3596281 caggccttcg gtgtcctcga gatcgatcgc ccgggcgacg ttcacggtcc ggccttcgca 3596341 tgtgcgctct gggcagcgat ttcagcgctg acctgacgta ccaccgcgtc aacgtccccg 3596401 acgctgcggc cctccacatt gagccgcagc aacggctcgg tgtttgagct gcgcaggttg 3596461 aaccagctgt cgtcgcctaa gtcaacggtc acgccatcga ggtgatcaat actgacaatc 3596521 cggttgccga acgatttcaa cacggcctcc acacaggccg aagagtcgac cacggtgaag 3596581 ttgatctcgc cggaggattc atagcgttgg tagtccgcgg tcaactccga cagcggtctg 3596641 ctctgctcac cgagggcggc cagcacatgc agtgcggcca gcattccgga atcggcaccc 3596701 cagaagtcac ggaagtaata gtgcgccgaa tgttcaccac cgaaaatcgc cccggtctcg 3596761 gccatcagtg ccttgatata ggagtgccca acccgcgaac gcagcggcgt accgccgcgc 3596821 tcggcgacca gctcgggcac cgcgcgggag gtgatcacgt tgtggatgat ggtggcgccg 3596881 atctcccggt tgagttcccg cgcggccacc aatgcggtaa ccgtcgacgg cgagaccggc 3596941 tggccgcgtt cgtcgaccac gaagcagcgg tcggcgtcgc cgtcgaaagc aagcccgata 3597001 tcggcgccgg tgtcacgcac ataggcctgc agatccacca ggttcgccgg gtccagcgga 3597061 ttggcctcgt gattgggaaa cgatccgtcg agctcaaaat acaagggcaa caaggtgatc 3597121 gagtcgatca ccccaaggac cgccggcgcg gtgtgaccgg ccatgccgtt gccggcgtcc 3597181 acggccaccc gcaacggacg tagccccgag gtgtccacca gcgatcgcag gaacgccccg 3597241 tagtcgacca gcacgtcctg gtcggcaatg gttccgggcg tcccgtcgta tcgtgcgacg 3597301 ccggcgatca ggtcgtcacg gatggcggtc agcccggtat cggctccgac tggtttggcg 3597361 gcggcccgac acatcttgat gccgttgtat gccgccgggt tgtggctcgc ggtgaacatc 3597421 gctcccgggc agtccaacag ccccgaggcg aaataaagct gatcggtgga cgccaaacca 3597481 actcgcacca cgtcgaggcc ctgcccggtc accccggccg cgaacgcgtc ggccagcgac 3597541 ggcgaactgt cccgcatgtc gtgaccgatc accactggtc gcgcatcctc ggtccgcatc 3597601 aaccgcgcga atgcggcgcc gagatcggta accagcgact cgtcgatctc ttcgccgacc 3597661 agcccgcgta cgtcgtaagc cttgataacg cggtccacag ccgcggcggg ccaagacatg 3597721 cgcgggctcc tgacaaccta gattttctgc gactcttggc cgccagccta tcggcccgcg 3597781 aacgacgcgg gccgaatcgg tctcgaacag catgggaaga ctagtcggcg gggtcgggca 3597841 acacccgtag atgtccgcgc cggcgcccgg ccccaggctc gggcggcgca agcacgccac 3597901 ccccggtggg cgctccggtc gccgcagcgg gaaaatcgtc gaaaccatgc agtggcgcgc 3597961 catttccgcc tggatgatgc cggcgccccg cgctcgggcc accctcgcgc accgcgtccg 3598021 ccagggccac caggtcgtcc tcgtcggggt ggctgggcag cggcccggcg tgacgcacga 3598081 gttcccaccc gcgcggtgca gtgatgcgac cggcatggcc gacacacaga tcccacgaat 3598141 ggggctcccg cgcagtggca agcggaccga tcaccgccgt cgagtccgag tagacgaacg 3598201 tcaacgtcgc cactgcatag tgcggacacc cgggccggca gcagcgacgg ggtacgttca 3598261 cgaccgaaag gctatcgtgc accaacgccg ccgaagcgcc ggacacgcgc atccgtccac 3598321 gccgcgatgt ttaaccgtta ccatcggcgc gtgagcgatt cccgcagctc ctcgtggagc 3598381 cgtcggtcgc ggggcgggtc ggtagcgcgg cgagcaatcc ggcggggccg cgagatgcgc 3598441 gggccactgc tgccgccgac agtcccgggg tggcgcagcc gggccgagcg gttcgacatg 3598501 gcagtgctgg aagcctacga acccatcgag cgacgctggc aggagcgggt gtcgcagctg 3598561 gacatcgcgg tcgacgagat cccgaggatc gcagccaaag atcccgaaag tgtgcagtgg 3598621 ccgccggaag tcatcgccga cggaccgatc gcgctggccc ggctcatccc ggccggcgtg 3598681 gacgtccgcg gaaatgcgac gcgcgcgcga atcgtcttgt ttcgcaaacc aattgaacga 3598741 cgggccaagg acaccgagga acttggtgaa ttgctgcacg aaatcctggt ggcccaggtg 3598801 gccatctacc tggacgtcga cccatccgtc atcgacccga cgatcgacga ctagttcgcg 3598861 ccgccgactc cggcggccgg gtcagatgat cccgcgtttg aggcggcggc gctcgcgttc 3598921 ggaaagacca ccccagatgc cgaaccgctc gtcatgagcc agggcgtact ccagacactc 3598981 gtgccgcacc tcgcagccca tgcaaatctt cttggcctca cgcgtggagc cgcccttctc 3599041 cgggaagaac gcttcgggat ccgtttgcgc acatagcgca cggtcctgcc attggtcggt 3599101 ggcttccggc ggcagaggtt cctcgaatgg cgccggcgcc tcgggaacca aactcagatg 3599161 cggtcgcaaa actgccgttg ctgatgcggt agccgatccg gtagtggtat gcggtgtgcc 3599221 tcccattaca ccccgaaggt gttcatagga catgcctccg cctcctcact cgatagatag 3599281 tgaaatggtt tcccactgtt ttgatgtaca gttaacccaa ttcgaacaag tgatcgaatc 3599341 tcggtctgcg acaccgaaac cggccggcca accgcgaaat gacactgatg tgattagaca 3599401 caagttgggg acgcgggtca agtgtgccgg cgcatttcca tatcatctcg taataaaatt 3599461 tccgcggttc tgttgtggtt gggtcccggc gtgtcgagcg tgactcgtaa ccaacgtttg 3599521 gtgatgggcg ccgggaggta ctgtcctgcg atgtgaaggt caccgttctg gccggtggag 3599581 tcggcggcgc ccgcttcctg ctcggggtcc agcagctgct cggcctgggc cagtttgctg 3599641 ccaattctgc ccactcggac gccgaccacc aactgagcgc tgtcgtcaac gtcggcgacg 3599701 acgcctggat ccacgggctg cgtgtctgcc cggatctgga cacctgcatg tataccctgg 3599761 gcggcggggt ggacccccag cgcggctggg gccagcgtga cgaaacttgg cacgccatgc 3599821 aggaactggt gcgctatggc gtgcagcccg actggttcga gctcggggac cgcgatctgg 3599881 ccacccatct ggtgcgcacc cagatgctgc aggccggcta ccccctgtca cagatcaccg 3599941 aggccctatg cgatcgctgg caaccgggcg cccgcttgct gcctgccacc gacgaccgtt 3600001 gcgaaaccca tgtagtgatc accgacccgg tcgacgaaag ccgcaaggcg atccattttc 3600061 aggagtggtg ggtgcgctac cgtgcccagg tgccgacgca cagctttgct tttgtcggcg 3600121 ctgaaaagtc cagcgctgca accgaagcga tcgccgccct ggccgacgcc gacatcatca 3600181 tgctggcgcc gtctaatccg gtggtcagca tcggcgccat cctggccgtc cccgggattc 3600241 gcgcggcgtt gcgggaagca accgcaccga tcgtcggcta ctcgccgatc atcggcgaaa 3600301 agccgttgcg cggcatggcc gatacgtgcc tttcggttat cggggtggat tccaccgcgg 3600361 ccgctgtggg ccggcactac ggcgcgcggt gcgccaccgg gatactggac tgctggctgg 3600421 tgcacgacgg cgaccacgct gagattgacg gggtgacggt gcggtcggtg ccgctgctga 3600481 tgaccgaccc gaacgcgacg gctgagatgg ttcgcgccgg gtgcgacctt gcgggagtgg 3600541 tagcttgacc ggccccgaac atggctccgc ctcgaccatc gagatcctgc ccgtcatcgg 3600601 gctgcccgaa ttccgtcccg gcgacgatct gagcgccgcc gtcgccgcgg cggcaccgtg 3600661 gctacgcgac ggtgacgtcg tggtggttac cagcaaggtg gtgtccaaat gcgagggccg 3600721 gctggttccg gctcccgaag accccgagca aagagaccga ttgcgccgca agctgatcga 3600781 ggatgaggca gtgcgcgtgt tggcgcgcaa ggaccgcacg ttgatcaccg agaatcgact 3600841 cgggctggtt caggcggccg ccggcgtgga cggatccaac gtcggccggt ccgagttagc 3600901 gctgctgccg gtcgatcctg acgccagtgc cgcaaccttg cgcgccgggc tgcgcgagcg 3600961 gctcggcgtc accgtcgccg tggtcatcac cgacaccatg ggacgcgcct ggcgcaacgg 3601021 ccagaccgat gccgcagtcg gcgctgccgg tctggcggtg ctgcgcaact atgccggtgt 3601081 ccgcgaccca tacggcaatg agttggtggt caccgaggtc gcagtcgccg acgagatcgc 3601141 cgcggccgcc gacttggtca aaggcaaact gaccgcgacg ccggtggcgg tggtgcgtgg 3601201 gttcggcgtg tccgacgacg gctcgacagc ccggcaactg ctgcggccgg gcgccaacga 3601261 cctgttctgg ctcgggaccg ccgaagcgct cgagctgggt cgccagcaag cccaactgtt 3601321 gcgcaggtcc gttcgccggt ttagcaccga tccggtgccg ggcgacctcg tcgaggctgc 3601381 ggtcgccgag gccctcaccg cgccagcccc acatcacacc cggccgaccc gattcgtgtg 3601441 gctgcagaca ccggccatcc gcgcgcggct gctagatcgg atgaaagaca agtggcggtc 3601501 tgatctcacc agtgacggct tgcccgccga cgcgatagaa cgccgggtgg cacgcggcca 3601561 gatcctctat gacgcacccg aagtcgtcat accgatgctg gtgcccgacg gagcacacag 3601621 ctaccccgat gccgcccgca ccgacgccga gcacaccatg ttcacggtcg ccgtcggagc 3601681 ggccgtacaa gccttgctgg tcgcgctggc cgtgcgcggg ctgggcagtt gctggatcgg 3601741 ctcgacgatc tttgccgctg acctggtccg cgacgagctg gacctgccag tcgactggga 3601801 gccgttgggc gccatcgcga tcggatatgc cgacgagccg tccgggttgc gcgacccggt 3601861 gcctgccgcc gatttgctga tcctgaagtg acattcgctc tagcgacgat aggctaccca 3601921 gacatggcgg tcctgcagcc gatgccaacc atcaacctcc cgacggatca attcaccgcg 3601981 ttcggtcaaa agtggctcct cggctcgaaa ttctccaaga aggacgacag gacttaggcg 3602041 ccgtgataga tgccgctgtg ggcggcgcac tgtcggtgat gctcggcaac atcccattgg 3602101 tggttccgaa cgccaaccag ctgtaacctt cccaagcgcc gacgtgtacc gctgctatcc 3602161 ggcccgattc cagggacagc caccccatgc aacctagtca tccgacgcgc cctggtgcgg 3602221 tcatcagata tgtcggtagc tcccttgata cttgtcccat gacgacgttc gccggcaaaa 3602281 cggctgcgtc cgctgacaag gtgcgcgggg gctactacac gccgccggcg gtggcccgat 3602341 tccttgccca ctgggttcac caggcggggc cgaagatcct cgaaccatcc tgcggcgatg 3602401 gccgaatcct gcgcgaactc tccgccatca cagaccacgc gcacggtgtg gaactcgttg 3602461 cgcgcgaggc gaaaaagtcg cgggacttcg cgtccgtcga cactgagaac ctttttacct 3602521 ggctgcacaa gacccaactc ggcagctggg atggcgttgc cggcaacccg ccctacatcc 3602581 gcttcggaaa ctgggcatcc gaacaacggg atccggcact cgaattgatg cggcgtgtgg 3602641 gcctacgacc gaccaaactg accaatgcct gggtcccgtt tgtcgtggcg agcacgacgc 3602701 tagcgcgtga cggcggccga gtgggcctgg tggtcccggc ggaattgctt caagtcacct 3602761 acgcggcgca gctacgcgaa ttcctgctga gccgctatcg ggagatcacc ctggttacct 3602821 tcgagcggct ggtgttcgac ggaatcctgc aggaagttgt gctgttctgc ggcgtcgtcg 3602881 gtcccggtcc tgcacacata cgcaccgtca ggctcggcga tgcgaacgat ctgaacgcgc 3602941 tgggggacaa ggacttcacc aatgagtcag cgccggcgct tctccacgaa aaggagaagt 3603001 ggaccaagta cttcctcgac cccgctcaaa tccggctact gcgaggactc aaacagtccg 3603061 ccactatgat caggctcggc gaactggccg acgtggatgt gggcatcgtg accggccgca 3603121 acagcttctt cacgttcacc gatgccaagg cacaagcgct gggattgcga gcgcactgcg 3603181 ttcccctggt ctctcgcagc gcccaactca gcgggctgat ctatgacgag gattgccggg 3603241 catgcgatgt cgccggcaac caccgaacgt ggctactcga cgccgcggac tatccaaccg 3603301 atccagctct cgtcgctcac atcaccgcgg gtgaagcggc cggcgtccac ctcggctaca 3603361 agtgctcgat ccgcaagcca tggtggagca caccatcgct gtggatgccc gacctcttta 3603421 tgctgcgcca gatccacttc gccccgcggc tgaccgtcaa cgctgccgcg gcgaccagca 3603481 ccgataccgt gcaccgggtc cggctcgacc cgaacgtcga tccggcaact cttgccgcgg 3603541 tgttccacaa cagcgcgaca ttcgcgttcg ccgagatcat gggccgcagt tatgggggcg 3603601 gcatcttgga gttggagcct agggaagccg agcaactacc tatgccaccg ccggcgtacg 3603661 ggagcgcaga acttgcccag gatgttgatc tcctgctgaa agcaaacgag atcgacaagg 3603721 cgctcgacgt cgtggaccgt cacgttctga tcgacgggct cggcttgtcg ccgcgcctgg 3603781 tcgcaggttg ccgagcggca tggctcacgc tccgcgaccg caggaccaag cgcggatctc 3603841 ggcgataacc gcggcgggtg agcgcctcgc gtgcccggcc aacgatgtcg atctcggcgc 3603901 aagaagctca aacgtcggac gagtaacgga tcccgccgtc gggaagaaag acaccgggcc 3603961 atacccgggc accacttaac aactcgcagc gcgcgccgat gtcggccccg tcaccgatca 3604021 caccgtcgcg gatcaacgcc cgcggtccga tgcgagcacc gaagccgatg atcgaacgct 3604081 cgatcacgca cccggcctcc acccggacac catcgaagat gaccgcgccg tccaatctgg 3604141 tgccggggcc gatttcggca ccacgcccca cgacggtgcc gccaatcagc aacgcaccgg 3604201 gagataccgc cgcaccgtcg tgcaccaact gctcaccgcg gtgaccacgc aaggccggag 3604261 acggggcgat gccgcgcacc agatccgccg atccgcgaac gaagtcttcc ggtgtgccca 3604321 tgtcccgcca atagctggca tcgacatagc cgtagatctt gcagtcgccg tcggcgagca 3604381 aggccgggaa cacctcgcgt tccaccgaaa cctcccggcc ctgcggaatc cggtcgatga 3604441 cgttgcgttc gaagacatag cagccggcat tgatctggtc ggtcggcgga tcctccgtct 3604501 tctccagaaa ggcgactacg cggtcctcct cgtcggtggg tacgcagccg aatgcccgcg 3604561 ggtcgcccac ccgcaccagt tgcagcgtga catcggctcg attgcttcgg tggaagtcca 3604621 gcagttgggc cagatccgcg cccgagagca catcgccgtt aaacaccatc gcggtgtcgt 3604681 tgcgcagctt gccggcaacg ttggcgatgc cgccgccagt ccccaaggga tgctcctcgg 3604741 tcacgtattc gatctgtagg cccagtgcgg acccgtcgcc gaactccgct tcgaagactg 3604801 cgggtttgta ggacgtaccc aggatcacgt gctcgatgcc cgctgcggcg atccgcgaca 3604861 gcagatgggt gaggaacggc agtccggcgg taggcagcat tggcttgggc gccgacagcg 3604921 tcaacggccg cagtcgggta cccttgccac cgaccaggac caccgcatcg acttggtgag 3604981 ttgccaactc agtgccgccc ttctaccagc ttcagtttcc gtctgcggga cctgcgcacc 3605041 atgaggtggg aacgcagcgc cagtgatccc cgcagggtcc agcgcagcgg agcccgccac 3605101 caaccagaat gtcggtcggc taagaagata taggtgcttt tgtgatgggc ggccagatgg 3605161 cttgccgggt cgcgacccgt cgaatgcgcc ttgtggtgca gaacctcggc tgacggcaca 3605221 tacaccgaca gccaaccggc tttgccaagc cggtcgccaa ggtcgacgtc ctccatgtac 3605281 atgaagtaac gttcgtcgaa tccgccgacc tggccaaacg ccgaccggcg caccagtagg 3605341 caagaccccg acaaccaacc caccggccgt tcactgggct ccagccgctc ctgccggtag 3605401 gccgtcgtcc acggattgcg cggccagaac ggcccgagca ctgcgtgcat gccgccgcgg 3605461 atcaggctgg gcatctgccg cgccgacggg tacaccgacc cgtcggggtc ccgaatcagc 3605521 gggcccagcg cgcccgcgcg gggccagcgg gaggcggcgt ccagtagtgc atcgatactg 3605581 cccgggcccc attgcacgtc cgggttggcc acgatcaccc agtcatcgac ccagggttcg 3605641 ccggcatcgc ccgccatttc accgagctgg gcgatcgtcc gattcaccgc ggttccgtac 3605701 ccgaggttgg cccctgtggg cagcagccgc acgttggggt agcgctgcac cgcggcctgc 3605761 ggggtgccgt cggtggagcc gttgtctgcc aacagcacgc tgaccggccg ctcggtggcc 3605821 agcgacaacg acgccaggaa ccgctctaga tggggccccg gcgagtaggt caccgctacc 3605881 accggcagga cgtcagtcac gcgttgaggg taaccgtcga tcgatcgaag ttgagttcgc 3605941 aggtgctgcc agcgccgtgg ccagtgcgct gcgccagtgc cgtagcggcg tcaagcccgc 3606001 cagcgcccac tgcctgctcg acagcgcgga atagctcgga cgcggcgcgg gccgcggaaa 3606061 ctgcgcgctg ctgaccggac gcacccgctg tgggtcggca ccgcattctt cgaacaccgc 3606121 gcgggcttga ccgaaccggg agaccacgcc ctcgttagcg gcgtgcaaca cgcgtccgcg 3606181 cacgcccgcg tcggccaacg ccagcagcgc ctcggccagg tcggcgacgt aggtcggcga 3606241 cccggtctgg tcgtcgacca catccacccg accgtgtccg gcggccagcc ggcgcatgac 3606301 ggcgacgaaa tccttgccgg tcccgccggt gtagacccag gcggtccgta ccacggcagc 3606361 ctccgggaac gctgccagca cagcctgctc gccggcgagt ttgctgcggg catacacgcc 3606421 ctgcggcgcg gtttcatcgg tgggctcgta gggccggggc tcggcgccgc cgaagtcgcc 3606481 atcgaatacg tagtcggtgg agacgtggat taaccgagca cccacacgag cgcacgcacg 3606541 ggcgaggtgt tgcgggccag tggcattgac cgcataggcg actgcctcat tgctctcggc 3606601 gccgtcgacg tcggtgtagg cggcgcaatt gatcaccacg tcaccgtgtc ggatgatccg 3606661 ctcggccgca gcggggtcgg tgatatccca ctgcgaggaa gtcagcgcca gcatatcgcg 3606721 gccttcccgg gcggcctgtg ccgtcagatg gctgcccagc tgcccgcccg caccggtgat 3606781 gactagcctt tctgacctgc ccgccatgtg tttgagtctg gcacgcctcg ggcacgccgg 3606841 ggttggctac ccgacagggc gccgttacac aagtagtcta gtgtgatgtc tgcgcaacgt 3606901 gtggttcgta cggttcgtac cgctcgggct atttccacgg cactggccgt cgcgatcgtc 3606961 cttggcaccg gggtggcgtg gagcagtgtc cggtcgttcg aagacggcat cttccacatg 3607021 tcggcgccct cgctggggca cggcggcgac gacggcgcga tcgacatttt gctggtcggc 3607081 ctggacagcc gtaccgacgc gcacggcaac ccgttgagcg ccgaggaatt ggcgacattg 3607141 cacgccggcg acgaggaagc caccaacacc gacaccatca tcctgatccg ggtacccaac 3607201 aacggaaagt cggcgaccgc aatctctata ccgcgggact cctacgtcgc ggctcccggt 3607261 ctgggtaaga ccaagatcaa cggcgtctac gggcaaacca gagagaccaa gcgggccggc 3607321 ctggtccaag ccggtgcctc gccgaccgaa gcggccgccg ccggcaccga ggccgggcgt 3607381 gaggcgttga tcaagacggt cgccgatctg accggcgtca ccgtcgacca ctacgccgag 3607441 atcgggctgc tcggtttcgc gttgatcgcc gacgcactcg gcggcgtcga cgtctgcctc 3607501 aaagagcctg tatacgaacc actttcgggt gccgattttc cagccgggcg gcaaaagctc 3607561 aacggtccgc aagcgctcag cttcgttcgc cagcggcatg atctgccccg cggcgacctg 3607621 gaccgggtgg tacgtcagca ggcggtgatg gcggcgttgg cccaccgggt catctccgga 3607681 cagacgctat ccagccccgc cacgctgaag cggttggagc aggccgtgca gcgctcggtg 3607741 gtgctgtcct ccgggtggga catcatggat ttcgtccgcc aattgcagaa gctggccggc 3607801 ggtaacgttg ccttcgccac catcccggtg ctcgacggcg ccggctggag cgacgacggc 3607861 atgcaaagcg tggtgcgggt ggatccgcgt caggtgcagg actgggtcgt cggcctgctg 3607921 cacgagcagg accagggcaa gaccgacgag ctggcctaca cacccgccaa gaccacggcc 3607981 aacgtggtca acgacaccga tatcaacggg cttgcggcag cggtgtcaaa ggtgttgagc 3608041 tccaaggggt ttaccaccgg atccgtcggc aacaacgacg gcgaccacgt gcctggcagc 3608101 caggtgcggg ccgcaaaggc cgacgacctg ggcgcacagc aggtcgccaa ggaactgggc 3608161 gggttgccgg tggtcgccga tgcgtcaatc gcgcctgggt cggtgcgggt ggtgctggcc 3608221 aacgactaca gcggtccggg ctccgggctg gggggtagtg atccgaacgg cgtcgtatcg 3608281 ccggcccgcg cgttcaacct cgggtccgcc gacgacacga ctcccccgcc gtcgccaatc 3608341 cttaccgccg gctccgacgc gccggagtgc atcaactgac cacaccgacc accctgagcg 3608401 gggcgatcct ggatccgatg ctgcgcgccg acccggtcgg cccgcgcatc acctactatg 3608461 acgatgccac cggtgagcgc atcgagctat ccgcggtgac actggctaac tgggccgcca 3608521 agaccggcaa cctgttgcgc gacgagctgg cggccggacc cgccagccga gtcgcgatcc 3608581 tattaccggc ccattggcag accgcggcgg tgttgttcgg cgtgtggtgg atcggtgcgc 3608641 aagcgatact cgacgattct cccgccgatg tggcactgtg caccgccgac cgtctggccg 3608701 aagccgacgc cgtcgtcaac agcgcggcgg tagccggcga ggtagccgtg ctgtcgctgg 3608761 atccattcgg tcgaccggca accggcctgc cggtcggcgt caccgactat gcgaccgcgg 3608821 tgcgggtaca cggcgaccag atagttcccg aacacaaccc cggtccggtg cttgccggta 3608881 gatccgtcga gcagatcctg cgcgactgcg cggcgtccgc ggccgccagg ggtttgacgg 3608941 cggcggatcg ggtgctgtcc accgcttcct gggccggacc cgatgagttg gtggacggcc 3609001 tgctggcgat cctggccgcc ggtgcgtcgt tggtgcaggt ggccaatccc gatccggcga 3609061 tgctgcagcg caggattgcg accgaaaagg tcacccgcgt cctgtgacgc aggccgcgtc 3609121 cagcaggcga aggcatcaga gcaatacata ttgatatcgc gatatataga tgttaatgtc 3609181 actgcaacga gctgccgctg caattacaga cccggaagaa aggtacaggc aatggcgata 3609241 caagtgttct tggcgaaggc gacaacgacg gtgatcaccg gcttggccgg cgtgaccgcc 3609301 tacgagatct taaaaaaggc cgcggccaaa gcgccgcttc gtcagaccgc ggtatcggca 3609361 gcagcgctgg gtctgcgcgg aacccgcaag gccgaggaag ccgcggaatc ggcccgccta 3609421 aaggtggccg acgtgatggc cgaggctcgt gagcgcatcg gcgaggaatc gcccactcca 3609481 gcgatcagcg acctgcacga ccacgaccac tgagcgcctc gccatgaccc tggaagtggt 3609541 atcggacgcg gccggacgca tgcgggtcaa agtcgactgg gtccgttgcg attcccggcg 3609601 cgcggtcgcg gtcgaagagg ccgttgccaa gcagaacggt gtgcgcgtcg tgcacgccta 3609661 cccgcgcacc gggtccgtgg tcgtgtggta ttcacccaga cgcgccgacc gcgcggcggt 3609721 gctggcggcg atcaagggcg ccgcgcacgt cgccgccgaa ctgatccccg cgcgtgcgcc 3609781 gcactcggcc gagatccgca acaccgacgt gctccggatg gtcatcggcg gggtggcact 3609841 ggccttgctc ggggtgcgcc gctacgtgtt cgcgcggcca ccgctgctcg gaaccaccgg 3609901 gcggacggtg gccaccggtg tcaccatttt caccgggtat ccgttcctgc gtggcgcgct 3609961 gcgctcgctg cgctccggaa aggccggcac cgatgccctg gtctccgcgg cgacggtggc 3610021 aagcctcatc ctgcgcgaga acgtggtcgc actcaccgtc ctgtggttgc tcaacatcgg 3610081 tgagtacctg caggatctga cgctgcggcg gacccggcgg gccatctcgg agctgctgcg 3610141 cggcaaccag gacacggcct gggtgcgcct caccgatcct tctgcaggct ccgacgcggc 3610201 caccgaaatc caggtcccga tcgacaccgt gcagatcggt gacgaggtgg tggtccacga 3610261 gcacgtcgcg ataccggtcg acggtgaggt ggtcgacggc gaagcgatcg tcaatcagtc 3610321 cgcgatcacc ggggaaaacc tgccggtcag cgtcgtggtc ggaacgcgcg tgcacgccgg 3610381 ttcggtcgtg gtgcgcggac gcgtggtggt gcgcgcccac gcggtaggca accaaaccac 3610441 catcggtcgc atcattagca gggtcgaaga ggctcagctc gaccgggcac ccatccagac 3610501 ggtgggcgag aacttctccc gccgcttcgt tcccacctcg ttcatcgtct cggccatcgc 3610561 gttgctgatc accggcgacg tgcggcgcgc gatgaccatg ttgttgatcg catgcccgtg 3610621 cgcggtggga ctgtccaccc cgaccgcgat cagcgcagcg atcggcaacg gcgcgcgccg 3610681 tggcatcctg atcaagggcg gatcccacct cgagcaggcg ggccgcgtcg acgccatcgt 3610741 gttcgacaag accgggacgt tgaccgtggg ccgccccgtg gtcaccaata tcgttgccat 3610801 gcataaagat tgggagcccg agcaagtgct ggcctatgcc gccagctcgg agatccactc 3610861 acgtcatccg ctggccgagg cggtgatccg ctcgacggag gaacgccgca tcagcatccc 3610921 accacacgag gagtgcgagg tgctggtcgg cctgggcatg cggacctggg ccgacggtcg 3610981 gaccctgctg ctgggcagtc cgtcgttgct gcgcgccgaa aaagttcggg tgtccaagaa 3611041 ggcgtcggag tgggtcgaca agctgcgccg ccaggcggag accccgctgc tgctcgcggt 3611101 ggacggcacg ctggtcggcc tgatcagcct gcgcgacgag gtgcgtccgg aggcggccca 3611161 ggtgctgacg aagctgcggg ccaatgggat tcgccggatc gtcatgctca ccggcgacca 3611221 cccggagatc gcccaggttg tcgccgacga actggggatt gatgagtggc gcgccgaggt 3611281 catgccggag gacaagctcg cggcggtgcg cgagctgcag gacgacggct acgtcgtcgg 3611341 gatggtcggc gacggcatca acgacgcccc ggcgctggcc gccgccgata tcgggatcgc 3611401 catgggcctt gccggaaccg acgtcgccgt cgagaccgcc gatgtcgcgc tggccaacga 3611461 cgacctgcac cgcctgctcg acgttgggga cctgggcgag cgggcagtgg atgtaatccg 3611521 gcagaactac ggcatgtcca tcgccgtcaa cgcggccggg ctgctgatcg gcgcgggcgg 3611581 tgcgctctcg ccggtgctgg cggcgatcct gcacaacgcg tcgtcggtgg cggtggtggc 3611641 caacagttcc cggttgatcc gctaccgcct ggaccgctag cagccgcagc cgtgaccacg 3611701 ccaggtgcgg atgccctgcc agaccgcgat accggcgatg gccagcccga tcgcggggtc 3611761 aatccaccag ccgttcgacc acacggcagt gatcgccagc ccaagcagaa ccgcggcggc 3611821 ctgagcagca cacaggtagt tctgggtgcc ctcgcccgcg gtggcccccg atcccagccg 3611881 ctcacccact cggtggttgg cccagcccag gaccggcatc agcagcaggg cgatggccgt 3611941 cagtccgatg ccgatcaccg aggtctcggc acgatgctcg ccggctaggt ggcggatgga 3612001 ttcggcaacg aggtaggggg ccgtcagcca aaaagacacc gcaactccac gctgtgcgcg 3612061 gtgctccgcg gtcgcggacc aagtgcggtc gccggtgaac cgccagagca ccatcgcgct 3612121 ggccaggccc tcggatccgc cacccagcgc ccacccggtc aacgcgacgg atccgaccgc 3612181 aataccctgc cacagcccca cggcaccttc ggtgagcaat accgccaggc tgacccacgc 3612241 cagccagcgg gcccaccgaa cgttccgctg ccattcggcc tctcgcgcca ccgacacggg 3612301 cgaatccagc gtggattcat cgcggtgttc cgtcgtcgtc tccatcccga cgatggtaga 3612361 ggcaagacat gccgggcggt cgccgcggcg tcgcgaaccc gtatggttca gggaggatgc 3612421 cgcacgccag ggaaggtcac caccgatgcc gaccagcaac cccgccaaac cacttgacgg 3612481 gtttcgggta ttggatttca cccagaacgt ggccgggccg ctggccgggc aggtgctggt 3612541 cgacctgggg gctgaagtca tcaaggtgga ggcgcccggc ggtgaagcgg cccgtcagat 3612601 cacctcggtg ttacccggac gcccgcccct ggccacctac tttctgccca acaatcgtgg 3612661 caagaagtcg gtgacggtgg acctaaccac cgagcaggcc aagcagcaga tgctgcggct 3612721 cgcggacacc gccgacgttg tcttggaggc gtttcggccc ggcaccatgg aaaagctggg 3612781 cctaggccct gatgacttgc gctctcgtaa ccccaacctg atctacgcgc gcctaaccgc 3612841 ttacggcggc aacggcccgc acggcagccg gccgggaatc gacctggtgg tggccgccga 3612901 ggccggcatg accaccggaa tgcccacgcc tgagggcaag ccacagatca tcccatttca 3612961 gctcgtcgac aacgccagcg gtcacgtgct ggcccaggcc gtgctggccg cgctgctgca 3613021 ccgcgagcgg aacggggtgg ccgacgtcgt ccaggtcgcg atgtacgacg tcgcggtggg 3613081 actacaagcc aaccagctga tgatgcatct caatcgggcc gctagcgacc agccgaagcc 3613141 tgaaccggca ccgaaggcca agcggcgcaa gggagtcggc ttcgctaccc agccatcgga 3613201 cgcgtttcgc accgccgatg ggtacatcgt catcagcgca tatgtgccca aacactggca 3613261 gaagctgtgc tacctcatcg gccggcctga cctcgttgaa gatcaacgat ttgccgaaca 3613321 acgctcccgg tcgatcaact acgccgagtt gaccgccgag ttggaattgg cactggccag 3613381 caagaccgcc accgaatggg tccagttgct gcaggcaaac ggcctcatgg cctgcctcgc 3613441 ccatacctgg aaacaggtcg tcgacacccc ccttttcgcc gagagcgacc tcaccctgga 3613501 agtcggtcgc ggggcggaca ccatcacggt gatccgcaca ccggcgcgct acgccagctt 3613561 ccgcgcggtc gtcaccgatc ccccgcccac cgccggcgaa cacaatgccg tgtttctggc 3613621 ccggccctga cgctgtgacc attccgagga gtcaacacat gagcaccgca gtcaacagct 3613681 gcaccgaggc gcccgcatcg cgatcacagt ggatgctggc taatctgcgg cacgatgttc 3613741 ccgcatcact tgtcgtcttc cttgttgcgt tgccactttc gctggggatc gcgatcgcct 3613801 ccggggcccc gataatcgcc ggtgtgatcg ccgccgtcgt aggcggcatt gtcgccgggg 3613861 cggtcggtgg gtcgccggtt caggtcagcg gcccggccgc gggtctgacc gtggtggtcg 3613921 ccgagctgat cgatgagctc ggttggccga tgctgtgtct gatgacgatc gccgcgggtg 3613981 cactgcagat cgtgttcggc ctaagtcgga tggcgcgcgc cgcgctggcc atcgccccgg 3614041 tcgtggtgca cgccatgctg gccggcatcg gtatcaccat cgcgctgcag caaattcatg 3614101 ttctgctcgg tggtacgtcg cacagctcgg cgtggcggaa catcgtagcg ttgccggacg 3614161 gcatcctcca tcacgaactg cacgaagtga tcgtcggcgg gacggttatc gcgatcctgt 3614221 tgatgtggtc aaagctgccc gccaaggtgc gtatcattcc cggcccactg gtagccatcg 3614281 cgggcgcgac cgtgcttgcg ttgctacccg tgctacaaac cgaacgaatc gacctgcagg 3614341 gcaacttctt tgacgcgatt ggcttgccca aacttgccga aatgtccccg ggaggacagc 3614401 cgtggtctca tgagatcagc gccatcgcgc tcggtgtcct caccattgcg ctgatcgcaa 3614461 gcgtcgaatc gctgctgtcg gcggtcggtg tcgacaagct gcatcacggc ccgcgcaccg 3614521 acttcaaccg ggagatggtc gggcagggca gcgcgaacgt ggtgtccgga ttgctcggcg 3614581 ggctgcccat caccggtgtc atcgtgcgca gctcggccaa cgtggccgcc ggcgcccgaa 3614641 cccggatgtc gacgatcctg cacggagtgt ggatcctgct gtttgcgtca ctgttcacca 3614701 acctggtgga actgattccc aaggcggcgc tggccggcct gctcatcgtg atcggtgccc 3614761 agctggtcaa gctggcgcac atcaaactag cttggcgcac aggaaatttc gtaatctacg 3614821 ccatcaccat cgtgtgtgtg gtgttcctca atctgctgga aggcgtggcc atcgggctgg 3614881 tcgtggcgat cgtattcctg ttggtgcggg tggtacgcgc gcccgtcgag gtcaagccgg 3614941 tcggcggcga gcagtccaag cgatggcggg tcgatatcga cggcacgttg agcttcctgc 3615001 tgctgccccg cctgaccacg gtgctctcga agctgccgga agggtcggag gtgacgttaa 3615061 acctgaacgc agactacatc gacgactccg tttccgaggc catctccgat tggcggcgcg 3615121 cccacgagac gaggggcgga gtggtagcga tcgtggaaac gtcgccggcc aaactgcacc 3615181 acgcacacgc ccgaccaccg aagcgccact tcgcgtctga tccgattgga ctggttccgt 3615241 ggcgatcagc gcgcggcaaa gaccgcggca gcgcttcggt tctcgaccgc atcgacgagt 3615301 atcaccgcaa tggcgcggcc gtgctgcacc cgcatatcgc cgggctgacc gattcacagg 3615361 acccgtatga gctgttcctc acctgtgccg actcgcggat tctgccgaac gtcatcaccg 3615421 ccagcggccc cggcgacctg tacaccgtcc gcaacctcgg caacctggtg ccgaccgatc 3615481 cggacgaccg atcggttgac gcggcactcg acttcgccgt caaccagctc ggcgtcagct 3615541 cggttgtcgt ctgcggacat tcgtcgtgtg ctgcgatgac ggcgctcctg gaagacgacc 3615601 cggccaacac gacgactccc atgatgcgtt ggctcgagaa tgcccacgac agcctggtgg 3615661 tgttccgcaa tcaccacccg gcacgccgca gcgccgaatc cgccggttac cccgaagccg 3615721 accagctgag catcgtaaac gttgccgttc aggtggaaag gctgacccgc cacccgatct 3615781 tggcgaccgc ggtcgccgct gctgatctac aggtcatcgg catattcttc gacatctcga 3615841 ccgcccgggt atacgaggtg ggtccgaacg gcatcatctg cccggacgag ccggccgacc 3615901 gccccgtcga ccacgaatca gcgcagtagc gcccgcgaca tcactacccg ctgaatctga 3615961 ttggtgccct catagatctg ggtgatcttg gcgtcgcgca taaaccgctc gaccgggaag 3616021 tcggtggtgt agccggcgcc gccgaacagt tgtacggcat cggtggtgac ctccatcgcg 3616081 acgtcggagg cgaagcactt cgaggccgcc gaaatgaagc ccagatccgg ctcaccgcgt 3616141 tcggcgcggg cggcggcgga gtaaaccatc agccgagccg cctccacctt catcgccatg 3616201 tcggccagca tgaactgcac ggcctgaaac gtactgatcg actcaccgaa ctgcttgcgg 3616261 tccttggtgt aggcgatggc agcatccagc gcgccctggg cgatacccac ggcctgcgcg 3616321 ccaatcgtgg gacgggtgtg gtccaacgtg gccagcgcgg tcttgaaacc ggtaccgggc 3616381 tcaccgatga tgcgatcgcc ggggatgcgg cagttctcga agtacagctc ggtggtcggt 3616441 gaccccttga tcccgagctt gcgttctttc ggaccgacgg tgaacccctc gtcgtccttg 3616501 tgcaccatga acgccgagat gccgttggcg ccccggtcgg gatcggtcac cgccatcacc 3616561 gtgtaccagg tcgacttgcc gccgttggtg atccagcact tggcgccgtt gagaatccag 3616621 tgatccccat cggccttggc ccgcgtccgc atggacgccg cgtcactgcc ggcctcgcgt 3616681 tcactcaatg cataggaagc catcgcccct tcggcggcca acgccggcag cacctgcttc 3616741 ttcagctcct cggagccccg caggatcagg cccatggtgc ccagcttgtt gaccgcgggg 3616801 atcaacgacg cggacgcgtc gacgcgggcc acctcttcga tcacgatgca ggtagctacc 3616861 gagtcggcac cctgaccgcc gtactcctcc ggaatgtgga cggcgttgaa accggaggaa 3616921 ttgagcgcca ctagcgcttc ttcggggaac cgcgccttct cgtccacctc ggcggcatgc 3616981 ggagcgatct ccttttccgc caaagcccgt atcgccgatc gcatttcgtc gtgttcctcg 3617041 ggcagcttga acagatcgaa cgacgggttt ccggcccatc caaccatctt ggagccctcc 3617101 taatctccgt gctagtcgcg ggttaactta cccgcaagcc gctgcagttc cgcatccttg 3617161 gccgccacga cgtcggccag ccggtcctgg aatgcgacga tccgggccct cagctggggg 3617221 ttggcggctc ccagcatccg caccgccagc agtccggcat tacgggcgcc cccgatggac 3617281 accgtggcca ccggaacccc ggccggcatt tgcacgatcg acagcaggga gtcaaggccg 3617341 tccagcctgc ccagcggtac cggcaccccg atcaccggca gcggcgtcgc ggcggcgacc 3617401 ataccgggca agtgcgcggc cccgcccgct ccggcgatga tcacctcgag accgcgctcg 3617461 gccgcgccgc gcgcataact gaacatcgcc tcaggggtgc gatgggccga aacaacccga 3617521 acctcggccg gaatgtcgaa ctcggccagc gccgccgcag cgtcggccat caccggccag 3617581 tcgctgtcgc tgcccatgat caccccgacc cggggccgct cgccggcagg agtcataggc 3617641 gccgctcctc ctcatcgctt cgtcccccgc acgcgggtgg tacccccact gcatcgtcgc 3617701 tggcgcggtg tgggtcccat ccgtcagtcc accgcccatg ggacaaccag tgtgccgcca 3617761 gctcagcgcg ttcacacaac tgggcgacat cggagccaag gaagttgata tgccccacct 3617821 tgcgaccggg tcgctcggcc ttgccgtaga ggtgaacccg ggcgtcgggc attcgcgcaa 3617881 acagatggtg cagccgctcg tcgacgctca tggccggcgg ctgcgcggcg ccgagcacat 3617941 tggccatcac cgtcacgggc accacggcgt cgctgtcgcc gagcgggtag tccaagaccg 3618001 cgcgcagatg ctgctcgaac tggctggtgc gcgccccgtc gatggtccag tgcccggaat 3618061 tatgtggccg catcgcgagc tcgttgacca gcaacgcccc gtcggtcgtc tcgaacagct 3618121 cgacggcgag cacgccgacc acaccgagtt cgtcggccag ctgcaacgcc aaccgttgcg 3618181 ccgcggtggc caggtcgtcg ggcagcgccg gcgccggcgc gatcaccagc acacacgtgc 3618241 cgtcacgttg caccgtctgg accaccggcc acgccgcacc ctggccgaac ggcgaacgcg 3618301 ccaccagtgc cgacagctcg cggcgcaggt ccacccgttc ctcgaccagc accgccacgc 3618361 cgtcagccag gcattcgcga gcgaaatcac gggcatccgc cacatcacgt gccatccgaa 3618421 cgccccggcc gtcgtaaccc ccgcgcactg ccttgaccac gatcggggcg tcgacacgtg 3618481 cggcgaagac gtcgatttcg tcggggtctt tgatgcccgc gtagcggggc acggcgacgc 3618541 ctgctgcagc cagacgctgc cgcatgacga gtttgtcctg ggcgtgcacc agcgcctgcg 3618601 gcgacggtgc gacattgacg ccatcggcga ctagcttctc caacagctcg ttcgggacgt 3618661 gctcgtggtc aaaggtcagc acgtcggcgc cggccgcaac gcggcgcaag gcggcaagat 3618721 cggtgtgcga gccgatcacc acgttggggg tgacctgcgc ggcagggtca tctgccgagg 3618781 tgaccaatac acggaggttc tgccccagcg cgatggcagc ctgatgggtc atccgggcca 3618841 gctgaccgcc accgaccatc gcaacgaggg gggcaatgaa cgaggtgacc gccggggtgc 3618901 gtgagctcgc cacggccatc atggtgtcac ggcatctgac cggcgtactt gccggccacg 3618961 gcagccaaac cgttacgtat cattttgcgt cgattttgtg ttcgtccgta cactcacttg 3619021 ttgtgtcctt tgccgatgcc accatcgcgc gccttcccgg ggtggtccag ccctatgcgc 3619081 agcgccacca tgagctgatc aaatttgcca tcgtcggcgg caccacattc atcatcgaca 3619141 cagcaatttt ctacaccctc aagctgacgg ttctcgaacc caagccggtg accgcgaagg 3619201 tgatcgccgg catcgtcgcc gtcatcgcgt cctacgtgtt gaacagggag tggagcttcc 3619261 gcgaccgcgg cggtcgcgag cgccaccatg aggcgctgct gttctttgcg ttcagcggcg 3619321 tgggagtgct gctgagcatg gcgccgttgt ggttttccag ctacatcctg cagctacggg 3619381 tgccaacggt gtcactgacc atggaaaaca tcgccgactt catctcggcc tacattattg 3619441 gcaacttgct gcaaatggcg ttccgcttct gggcgtttcg gcgctgggtg ttccccgacg 3619501 agttcgcccg caaccccgac aaggccctgg aatccgccct taccgcgggc ggcatcgccg 3619561 aagtcttcga ggacgtcttg gagggcggct tcgaggacgg caacgtcacc ctgctgcggg 3619621 cctggcgtaa ccgggccaac cggttcgctc agctgggcga ctcgtcggag cccagggtgt 3619681 cgaaaacctt gtgatacagc aacgcatgca cctcccgcag gcgcggaatg ttgtagaact 3619741 cgagcggatc ttgtgacgcg gactcgataa tcaacgtccc ggtgcgaaaa atccgctcga 3619801 agatccggtc ccggaactcc acgctgttga tccgtgctag cggtatgtcg atcccgctgc 3619861 gggtcagcac accatgccgg aacatcaccc gccggttggt caccacgaaa tgtgtggtca 3619921 gccagctcag gaatggccac agcgtgagcc agccgacgat caccaaccag atcccccaga 3619981 tgaccgcgtg aatcacgttc ttagcgatct gctgccaagg tgtcgagttg acgaatccgg 3620041 acccgaacgc cgccaacccg gtcagcaaga ccagcaccac gacgggccag attaagcgat 3620101 tccagtgcgg atggcggtgc agaacgacct gctcgccagc ggccaggaca ttctccggat 3620161 agctcatgcc cgcgacctta atcttttggg gacgccagct ccgcgcgagt taacgcaaat 3620221 gcaccacgtc gcccgctgaa acaactaccg ttcgaccgcc gacgtccaga cacagccgac 3620281 cctggtcatc gatgtcacgc gcgatcccga cgacgtcctg gccaccgggg agctcgacgc 3620341 gcacgcgcga cccaatggtc aggctgcgag cacggtagtt ggccgccagt tgtgggttgg 3620401 cgttgcgcca ctggatgatc cgagcttcga gctcgcgcaa cagcctgctg gctatgcggt 3620461 tgcggtccgg tgccgccact ccgaggtcca gcaatgaggt cgcgtcggga tcaacctctt 3620521 cgggggcctg ggtgacgttg agtcccacac cgagtaccac aaacggctgc gcgacctcgg 3620581 ccaggatgcc ggctaacttg ccaccccggg ccagcacgtc attgggccac ttgaggcccg 3620641 tttcggccgg cgggactgca atcagggggg ccaccgaatc gagcaccgcc agacccgcgg 3620701 ccagtgacag ccagccccac gcttgcaccg ggacgtcgac cacacgcaca ccgaccgaca 3620761 ggatgatctg cgctcgggca gtggccgccc agccgcggcc atgacgcccc cgcccagcgg 3620821 tctgatgctc ggcgatcaac accaccccgt cgatatcggc cccggatgcc gcccgggcca 3620881 gcaagtcggc gttggtggaa ccggtttggg ccacgacgtc aagttggcgc cacccggatc 3620941 cagcaccgat cagctggtcg cgcagtgagc gttcgtccaa aggcggcctg agccgatcgc 3621001 ggtcggtcac cgccccagcc taaggaagta gtgtgcggca gccgataaca tcgactccca 3621061 tgacaagcgt taccgaccgc tcggctcatt ccgcagagcg gtccaccgag cacaccatcg 3621121 acatccacac caccgcgggc aagctggcgg agctgcacaa acgcagggaa gagtcgctgc 3621181 accccgtcgg tgaggatgcc gtcgaaaaag tacacgccaa gggcaagctg acggctcgcg 3621241 agcgtatcta cgcgttgctg gatgaggatt cgttcgtcga gctggacgcg ctggccaaac 3621301 accgcagcac caacttcaat ctcggtgaaa aacgcccgct cggcgacggc gtggtcaccg 3621361 gctacggcac catcgacggg cgcgacgtgt gcatcttcag ccaggacgcc acggtgtttg 3621421 gcggcagcct tggcgaggtg tacggcgaga aaatcgtcaa ggtccaggaa ctggcgatca 3621481 agaccggccg tccgctcatc ggcatcaacg acggtgctgg cgcgcgcatc caggaaggtg 3621541 tcgtctcgct gggcctgtac agccgtatct ttcgcaacaa catcctggcc tccggcgtca 3621601 tcccgcaaat ctcgttgatc atgggagccg ccgccggtgg gcacgtctac tcccccgccc 3621661 tgaccgactt cgtgatcatg gtcgatcaga ccagccagat gttcatcacc gggcccgacg 3621721 tcatcaagac cgtcaccggc gaggaagtca ccatggaaga actcggcggc gcccacaccc 3621781 acatggccaa gtcgggtacg gcacactacg ccgcatcggg cgaacaggac gccttcgact 3621841 acgttcgcga gctgctgagc tacctgccgc ccaacaactc caccgacgcg ccccgatacc 3621901 aagccgcagc cccgacaggg cccatcgagg agaacctcac cgacgaggac ctcgaattgg 3621961 atacgctgat cccggactcg cccaaccagc cctatgacat gcacgaggtg atcacccggc 3622021 tcctcgacga cgaattcctg gagatacagg ccggttacgc ccaaaacatc gtggtggggt 3622081 tcgggcgcat cgacggccgg ccagtcggca ttgtcgccaa ccagccgaca cacttcgccg 3622141 gctgcctgga tatcaacgcc tcggagaaag cggcccggtt tgtgcggacc tgcgactgct 3622201 tcaatatccc catcgtcatg ctggtggacg tcccgggctt cctgccgggc accgaccagg 3622261 aatacaacgg catcatccgg cgcggcgcca agctgctcta cgcctacggc gaggccaccg 3622321 tgccaaagat cacggtcatc acccgcaagg cctacggcgg tgcgtactgc gttatgggct 3622381 ccaaagacat gggctgcgac gccaacctgg cgtggccgac cgcgcagatc gcggtgatgg 3622441 gcgcctccgg cgcagtgggc ttcgtgtacc gccagcagct ggccgaggcc gccgccaacg 3622501 gcgaggacat cgacaagctg cggctgcggc tccagcagga gtacgaggac acactggtca 3622561 acccgtacgt ggccgccgaa cgcggatacg tcgacgcggt gatcccgccg tcgcatactc 3622621 gcggctacat cgggaccgcg ctgcggctgc tggaacgcaa gatcgcgcag ctgccgccca 3622681 aaaagcatgg gaacgtgccc ctgtgagtcg agtgagcgga acgaacctgt gagtcgagtg 3622741 agcggaacga acgaagtgag tgacgggaac gagacgaaca atccggcaga agtgagtgac 3622801 gggaacgaga cgaacaatcc ggcagaagtg agtgacggga acgagacgaa caatccggcc 3622861 cctgtgagtc gagtgagcgg aacgaacgaa gtgagtgacg ggaacgagac gaacaatccg 3622921 gcccctgtga ccgagaagcc gctgcatccg cacgagcccc acatcgagat actgcgggga 3622981 caacccaccg atcaggagct ggccgcgttg atcgcggtgc tgggcagtat cagcggttca 3623041 accccgcccg cgcaacccga gcccacccgg tgggggctgc cggtcgacca gttgcggtac 3623101 cccgtcttca gttggcagcg catcacactg caagaaatga cgcacatgcg ccgatgaccc 3623161 ggctggtgct cgggtccgcc tcccctggcc ggctcaaagt ccttcgtgat gccggcattg 3623221 agccgctggt catcgcctcg cacgtcgacg aggatgtcgt catcgcggcg ctggggccgg 3623281 acgcggtccc gagcgatgtg gtgtgcgtac tggccgcggc aaaggccgcg caggtcgcga 3623341 ccacgctgac cggaacgcaa cgcattgtgg ccgcggattg cgttgtcgtt gcctgtgatt 3623401 cgatgctcta catcgaaggc aggctactcg gcaagccagc gtcaatcgac gaggcgcgcg 3623461 agcagtggcg gtcgatggcg ggccgggccg gccaactcta tacgggccac ggtgttatcc 3623521 ggttgcagga caacaaaacc gtgtaccgtt ctgctgaaac agcaataacc acagtatatt 3623581 tcggaacacc ttcggcctcc gatctggagg cttacctggc cagtggggag tcgctgcggg 3623641 tcgcgggtgg attcaccctg gacggtctgg gcggctggtt catcgacggc gtgcagggca 3623701 atccgtcgaa tgtgatcggc ttgagcctgc cgttgctgcg gtcgctcgtg cagcgatgcg 3623761 ggctgtccgt cgccgcactg tgggcaggaa atgcgggcgg cccagcgcac aagcagcagt 3623821 agcttcggac tgggccaggt cgccagcggt aggctcgatg atgtgccgct tcccgcagac 3623881 cctagcccca ccttgtcggc ctacgcccat cccgaacggc tcgtgaccgc cgactggttg 3623941 tcggcacaca tgggcgcgcc gggcctggcg atcgtcgaat ccgacgagga cgtcttgctc 3624001 tacgacgtcg gccatattcc cggcgccgtc aagatcgact ggcacaccga cctcaacgac 3624061 ccacgggtgc gcgactacat caacggcgag cagttcgccg aattgatgga ccgcaagggc 3624121 atcgcccgcg atgacaccgt ggtgatctat ggcgacaaga gcaattggtg ggccgcctat 3624181 gcgttgtggg tgttcacgct gttcggtcac gccgacgtgc gactcctcaa cggcggccgt 3624241 gacctctggc tcgccgagcg ccgggaaacc accttggacg tcccgaccaa gacctgcacc 3624301 ggttatcccg tcgtgcagcg caacgatgca cccatccgcg cattcagaga cgacgtgctg 3624361 gccatcctgg acgctcagcc gctgatcgac gtacgctctc ccgaggagta caccggcaag 3624421 cgcacccata tgcccgatta ccccgaggaa ggggcgctgc gggccggtca catccccacg 3624481 gcggtgcaca ttccgtgggg gaaggccgcc gacgaaagtg gacggtttcg cagccgcgag 3624541 gaattggaac ggctctatga cttcataaac ccggacgacc aaaccgtcgt ctattgccgc 3624601 atcggtgaac gctccagcca tacctggttc gtgctcacac acctgctggg caaggcagat 3624661 gtacggaact acgacggctc gtggaccgag tggggcaacg ccgtgcgagt gccgatcgtc 3624721 gcgggcgaag aaccaggagt ggtacccgtc gtatgaccgc gcccgcgagc ctgcccgcgc 3624781 cgctagcaga ggtggtatcc gacttcgccg aagtccaggg tcaagacaag ctgaggctgt 3624841 tgctggaatt cgccaacgag ctgccggcgc ttccgtcgca cctggccgag tccgctatgg 3624901 agccggtccc cgagtgccag tctccgctgt ttttgcacgt cgacgcgagt gaccccaacc 3624961 gggtgcgcct gcatttcagc gcgccggccg aagcgccaac cacgcgcggg ttcgcctcga 3625021 tcctggccgc cggcctagac gagcaaccgg ccgccgacat cttggcggtg cccgaggatt 3625081 tctacaccga gctgggtctg gctgccttga tcagcccact gcggttgcgg ggaatgtcgg 3625141 cgatgctggc ccggatcaag cgccggctgc gcgaagcgga ctgaatcgag gaaccgcgtg 3625201 agcgggtcag cggcgcgacg cttaaacttc ccccgacaag acttgtaaga aaatctctta 3625261 gagacgaaga atcagcccga caggaggcgc agtggctagt cacgccggct cgaggatcgc 3625321 tcggatctct aaggttctcg tcgccaatcg cggcgagatc gcagtgcggg tgatccgggc 3625381 ggcccgcgac gccggcctgc ccagcgtggc ggtgtacgcc gaacccgacg ccgagtcccc 3625441 gcatgttcgg ctggccgacg aggcgttcgc gctgggcggc cagacctcgg cggagtccta 3625501 tctggacttc gccaagatcc tcgacgcggc agccaagtcc ggggccaacg ccatccaccc 3625561 cggctacggc ttcctagcgg aaaatgccga cttcgcccag gcggtgatcg acgccggcct 3625621 gatctggatc ggccccagcc cgcagtcgat ccgcgacctg ggcgacaagg tcacggcccg 3625681 tcacatcgcg gcccgcgctc aggcgcccct ggtgccgggt acccccgatc cggtcaaagg 3625741 cgccgacgag gtggtggcat tcgccgagga gtacggcctg ccgatcgcga tcaaggccgc 3625801 ccacggcggc ggcggcaagg gcatgaaggt ggcccgcacc atcgacgaga ttccggagct 3625861 gtacgagtcg gcggtgcgcg aggccacggc cgcgttcggc cgcggtgagt gctacgtgga 3625921 gcgctatctc gacaagccgc gccacgtcga agcacaggtg atcgccgacc agcacggcaa 3625981 cgtcgtcgtc gccggcaccc gggactgctc gctgcagcgc cgctaccaga agctggtcga 3626041 ggaggcgccc gcaccgttcc tgaccgactt tcaacgcaaa gagatccacg actcggccaa 3626101 acggatttgc aaagaggccc attaccatgg cgccggcacc gtcgaatacc tggtcggtca 3626161 ggacggcttg atctcgttct tggaggtcaa cacgcgcctt caggtagaac acccggtcac 3626221 cgaggaaacc gcgggcatcg acttggtgct gcagcaattc cggatcgcca acggcgaaaa 3626281 gctggacatc accgaggatc ccaccccgcg cgggcacgcc atcgaattcc ggatcaacgg 3626341 cgaggacgcg gggcgtaact tcctaccggc gcccgggccg gtgacaaagt tccacccgcc 3626401 gtccggcccc ggtgtgcggg tggactccgg tgtcgagacc ggctcggtga tcggcggcca 3626461 gttcgactcg atgctggcca agctgatcgt gcacggcgcc gaccgcgccg aggcgctggc 3626521 gcgggcccgg cgcgcgctga acgagttcgg tgtcgaaggc ctggcgacgg tcatcccgtt 3626581 tcaccgcgcc gtggtgtccg acccggcatt catcggcgac gcgaacggct tttcggtaca 3626641 tacccgctgg atcgagaccg agtggaataa caccatcgag ccctttaccg acggcgaacc 3626701 tctcgacgag gacgcccggc cgcgtcagaa ggtggtcgtc gaaatcgacg gtcgccgcgt 3626761 cgaagtctcg ctgccggctg atctcgcgct gtccaatggc ggcggttgcg acccggtcgg 3626821 tgtcatccgg cgcaagccca agccgcgcaa gcggggtgcg cacaccggcg cggcggcctc 3626881 cggtgacgcg gtgaccgcgc ctatgcaggg caccgtagtt aagttcgcgg tcgaagaagg 3626941 gcaagaggtc gtggccggcg acctagtggt ggtcctcgag gcgatgaaga tggaaaaccc 3627001 ggtcaccgcg cataaggatg gcaccatcac cgggctggcg gtcgaggcgg gcgcggccat 3627061 cacccagggc acggtgctcg ccgagatcaa gtaagcccgg cggctactcc aactgatccc 3627121 gtagccgtgc caatgacttg gccagcagcc gcgacacgtg catctgtgag ataccgacgc 3627181 gctcggcgat ctgcgtttgg gtcatcgagt cgaagaacct gagcaccaag accgttcgtt 3627241 cccgctcggg caacgcctcg agcaacggac gaagcacctc ccgattctcg atctggtcaa 3627301 gacccgcatc cacgtcgccc agggtgtctg tgattgcgcg ggcatcgtcg tcgctgccgc 3627361 caccgctgtc gatggacaag gtgtggtagg aactacccgc cagcaaacct tcgataacct 3627421 cagcgcggtc catcccgagc tccgcggcga gctccgatgc cgacggcgcc cgcccgagcc 3627481 gctgcgacaa atcggcggtg gcggtaccta gccgcagatg cagttccttg agacgccggg 3627541 gaaccttgac cgaccagctg ttgtcgcgga agtgtcgtcg gacctcgccc atgatggtag 3627601 gaaccgcgaa ggagacgaag tccgacccgg tcttcacgtc gaagcgaacc gcggcgttga 3627661 ccagcccgac ccgcgcgacc tgaataaggt cgtcacgcgg ttcgccgcga ccctcgaacc 3627721 gccgcgcgat gtgatcggcc agcggcaagc accgctgaac gatcttgtcc cggtgccgct 3627781 ggaattccgg tgagccggca ggcaaaccaa ccagctcgcg aaacatctcc ggaacgtcgg 3627841 cgtattcgtt agctcgcgat gcagaaccgc cggcagcgcg cgccgtcacc tgctggatgc 3627901 cgcccgtcgg gcggtcaacg tgatgccgaa gacactgccg gctacatcgg gctggcgacc 3627961 gtcgtggaag gtctggacgt cgtcggccag cgcggtcagg acatgccagc taaagctgcc 3628021 cggtgccacc acgtcgtggg tgtcgcaggc agcagaagcc tccaccacaa cttcgtcttt 3628081 tcgcggatcg accaccaggc gcagggtggc atccggcaag gccgagcgaa tcaaccgggt 3628141 gcacacctcg tccaccgcca acctcaggtc ggccacggcg tcgaaatcca ggtcctcgaa 3628201 ggtgccgatg gcgccgacca gggtgcgcag cagcgccagg ttctccaggc gggcagcaac 3628261 gttcagctcg acggcgcgga caccgcgttg gcgccccttg gtgggtaaat ccgagtcggc 3628321 catgcaccct cccggcaagc ttcgatcgac agtactcccg ccttgggtct ggtcttcgag 3628381 ctggtcggtc atggtcggac ctgctggtag tggggatcta acgcaacatg gtcgggattc 3628441 atcatggtgt acccgtgata cccattcgca gctgccggtg aaaccccgcg atgccgggat 3628501 ttccagccgc actaggatgt ctagccggcc agccgctgcc gccggacttc gggatgttcg 3628561 gtataccagc gatcggcaat cttgcgtatc cgccgatgct cgaacgctag ccacgccaaa 3628621 ccaaccactg tgacgacaat cgccaccaca ccaaaggtca tgccctcggc gtgatgtccg 3628681 gtgccgaaag ccgcaagagc tccgacgccg ccgacgacac cggccacaat caacagatac 3628741 ccaggccaat gcaccacgtc gatcagcgac tcgccggcaa gcggccgcgt cgtccgcaag 3628801 tggtcgacgg ggtcacgata ggtgtcgccc atggcctcct ccgtttccgt cctattccgc 3628861 catttctgcc cattaccagg cactaccatc aacggtagaa ctcgtcgaac gggttgtgga 3628921 gggatctgac ccatttattt gttgaccgcg gccgacctgg ccgacggctc acggtgccat 3628981 gaccgggccg gcgatcggtg ggacgcctat gcagagcgtc agcaccatca gcgtcaacaa 3629041 aaaccagccg gcgccgtgcc atccccacca ggtaccttcc gcacgccata cccggtaggt 3629101 gcgcagaaac gcccacagcc cggccgcaca caggatcagc gggcccccca gcgccagcag 3629161 gatccgctgg ggcgggccgc aggccgcggt gtcgacgccg ctgcacgtgc tgaccaacaa 3629221 cgctcccata atgaggaaac cgaccccgac gacagcggcc acaacagcaa accgaatcgc 3629281 cgagtgcacc tcgctgtcat cccggcctag ccgatcgccg cgtgacggcc cacctacttc 3629341 gtgcatcggc gaatctccat cccgctcttg gcggctgcct tacgtcacca ccggtaacgc 3629401 gctgcgcacc gcggctatcg cggcgtcgat ctcggcggtt gaaaccgtca gcggtggacg 3629461 gaatcgcacg gtgtctgcac cggccggcaa cacaatcacc gcacgttgcc acagctggcg 3629521 gatcaactcg tcacggtcgg cggtggtcgg caggctaaac gcacacatca gcccgcggcc 3629581 gcgcggatcg agaaccactg ccgggaagtc cgcggcgagt tcgtcaagcc gggcgcgcag 3629641 atacttaccg tgctgcaccg cccgctcgaa caggccctcg gcttcgatga cctccaagat 3629701 gcggcgggcg cgcaccatgt cggtaagatt gccaccccat gtcgagttga gccgtgatgg 3629761 gaccgcgaac acattgtcgg cgacctcgtc cacccgccga ccggccatca ctccgcatac 3629821 ctgcgtcttc ttgccgaacg ccacgatgtc gggtgcgaca tccaactgct ggtatgccca 3629881 ggcggttccg gtcaacccgc agccggtctg tacttcgtcg aagatcagca gtgcatcaaa 3629941 ctcgtcgcac agctcgcgca tcgcagcgaa aaactccggg cggaaatggc ggtcgccacc 3630001 ctcgccctgg atgggttcgg ccacaaaaca cgcgatgtcg tgcgggcggg tctcgaatgc 3630061 cgcgcgggcc tggcgtagcg cctcggcctc tagcgcggcc atagcgggct catccaggcc 3630121 gggccgcatg tacggcgcat cgatgcgtgg ccagtcgaat ttcgggaacc gggcggtaat 3630181 ggtcggcttg gtgttggtca gcgacagggt atagccgctg cggccgtgaa atgccccgcg 3630241 caggtggagc acttgagtgc ccagcgccgg gtcgatccca tgggcttggt tgtgccgact 3630301 cttccagtcg aacgcggctt tgagcgcgtt ctccaccgcc agggcgcccc cttcgacgaa 3630361 gaacagatgc ggcagcgccg ggtcgcccaa gacacgggcg aaggtctcga cgaagcgggc 3630421 catcgccacc gagtacacgt cggaattgct gggcttgttc agcgcggcct gcatgagttc 3630481 ggcatggaac tcccggtcgt ccaccagcgc cgggggattc atacccagtg ccgaggaggc 3630541 aacgaatgtg aacatgtcca ggtagcgccg acccgttata gcgtcgacca gatatgaacc 3630601 gcccgaacgg gtcagatcga gcactatgtc cagaccgtcg accagcatgc tgcgccctag 3630661 cacctcatga acccggtctg gtgttgttgg tctaccggca agagcgacgg acttcacgac 3630721 ggcggccatg acgctatgat agcaggattt acggaatatt gatatttatg ctggaaaaat 3630781 tatggtatat gctgcctatc gctgtaaaaa gtgttcagaa tgatcgtgct tcgcgtccgc 3630841 acgttcgccg ttgtccggat ccgttgcaac aggtcctcga gcgcccgtgc ggacgcgacg 3630901 cgcaccagca agacgtagct ctcttcgccg gccaccgagt aacaggactc gacctcctcg 3630961 atatgttcta ggcgcgcggg ggcatcatct ggttgagacg gatcaagagg agtgatagcc 3631021 acgaacgccg acaacaaatg cccaaccgcc tcgggattga ttcgcgccga atatccctgg 3631081 accacaccac gagactccag ccggcgcact cgcgattgga ccgccgagac cgacagcccg 3631141 gctcgcgtgg ccaactctga cagcgtcaca cgtccgtcgg cggccagttc gcgcaccagg 3631201 atccgatcga tatcgtcgag cgcctcgttc atggccggag actatcgcaa cggcagtgcc 3631261 gcatgagccg ctcgaaaaga ctgcagactg gccagctgcg cgcgcgcttc gccgccgggt 3631321 tgtcagccat gtacgccgct gaggtgcccg cctacggcac gctggtcgag gtatgcgcac 3631381 aagtcaactc cgattacctg acccggcatc ggcgagccga gcggctgggg tcgcttcagc 3631441 gcgtcaccgc cgagcgccac ggcgccatcc gagtgggcaa cccggccgaa ctcgctgcgg 3631501 tcgccgacct gttcgccgcg ttcgggatgc tgccggtcgg ctactacgat ctgcgcaccg 3631561 ctgagtcacc aattccagtg gtgtccaccg catttcgccc aatcgatgcg aacgagctgg 3631621 cacacaaccc gtttcgggtg ttcacctcga tgctggccat cgaggatcgg cggtacttcg 3631681 atgccgacct acgcacccga gtgcagacct tcctcgcgcg ccggcaactc tttgaccccg 3631741 cgttgctcgc ccaggcgcgg gcaatcgcgg ctgacggcgg ctgcgatgcc gacgacgcac 3631801 cggctttcgt cgccgcggcg gtggccgcgt ttgcgctgtc gcgggaaccg gtcgagaaat 3631861 cctggtacga cgagttgtcc agggtgtcgg cggtggccgc tgatatcgct ggagtcggct 3631921 ccacacacat caaccatctg acgcctcggg tgctcgacat agacgatctg taccgtcgga 3631981 tgaccgagcg cggcatcacc atgatcgaca ccatccaagg ccctccccgc accgacggac 3632041 ccgatgtgtt gttgcggcaa acctcatttc gcgcgctggc cgaaccacgc atgtttcgcg 3632101 acgaggacgg taccgtgacg ccgggaatcc tgcgggtgcg gttcggtgag gtcgaggcgc 3632161 gcggtgtcgc gctgaccccg cgagggcgcg aacgctacga agccgcgatg gcggccgcag 3632221 atccggccgc ggtctgggcc actcactttc cctcgacgga tgcggagatg gccgctcaag 3632281 gcttggccta ctaccgaggt ggtgacccgt cagcgccgat cgtctacgaa gacttcctgc 3632341 ccgcttcggc cgcgggcatc ttccgctcca acctggatcg cgactcgcaa accggtgacg 3632401 gacccgacga tgccggctac aacgtcgatt ggttggccgg ggcaatcggc cgacacattc 3632461 acgacccgta tgcgctctat gacgcgctcg cccaggagga gcggcgctga taaccactga 3632521 cgcgttacga gcccaggtgc tcgaagcctg ccaagcgatc ggcgtaaccg ccgcccttgg 3632581 cgagccgggc gaacacagcc tgcccgcgag cacaccgatc accggcgacg tgctgttcag 3632641 catcgcaccg accaccccgg agcaggccga ccacgcgatc gccgcggcgg ccgcaacatt 3632701 tacggcatgg cgaagcacgc cggccccggt gcgcggcgcg ctcgtggccc ggctcggcga 3632761 gctgctcacc gcacaccagc aggacctcgc gacactggtc acagtcgaag taggcaagat 3632821 caccgccgag gcgcgcggcg aagtgcagga aatgatcgac gtctgccagt tctcggtggg 3632881 tctgtcacgc cagctctacg gccgcaccat cgcgtcagag cgcgctgggc accggctcct 3632941 ggaaacctgg catccgctgg gagtggtggg cgtgatcacc gcgttcaact tcccggtcgc 3633001 ggtctgggcg tggaacaccg cggtggcact ggtctgcggc gacacggtgg tgtggaaacc 3633061 ctcggagctg acgccgttga cggcgctggc ctgccaggcg ctgctcagtc gggccgccgc 3633121 tgatgtcggc gcgccggccg cggtgggcgg cctgctgttg ggcggcgccg agcgtggtgc 3633181 gcaactcgtc gacgacccgc gggttgcgtt gttgtcggcg acgggttcgg tgcggatggg 3633241 ccagcaggtc ggtccacgcg tcgcccggcg cttcgggcgg gtgctgctgg agttgggcgg 3633301 caacaacgcg gccattgtgg cgccgtcggc cgacctggag ctggcggtgc gctgcatcgt 3633361 gttcgccgcg gccggcaccg caggtcagcg ctgcaccagc ctgcgccggc tgatcgtgca 3633421 ccgctcggtg gctgacgatg tggtggcacg cgtcgtcggc gcctatcgcc agctggcgat 3633481 cggtgacccg tcggccccgg acacgctggt aggcccactc atccacgagg ccgcctaccg 3633541 cgacatggtg gcagcgctcg agcgggcacg caccgacggc ggcgaggtca tcggcggtga 3633601 tcgtcgcgag gtgggctcac cgggcgccta ctatgtcgcg cccgctgtgg tccgaatgcc 3633661 gtcccagacc gccatcgtgg cgaccgaaac gttcgcacca atcctgtacg tgctcaccta 3633721 cgacgacctc gacgaggcga tagccctcaa caacgcggta ccacaagggc tttcgtcgtc 3633781 gatcttcacg accgacctgc gtgaggccga gcacttcctc gaccagtccg actgcggtat 3633841 cgccaacgtc aacatcggga cgtcgggagc ggagatcggt ggtgccttcg gcggcgagaa 3633901 gcagaccggc ggcggccgcg agtccgggtc cgacgcgtgg aaggcctaca tgcgccgggc 3633961 caccaacacc gtcaactact cgagcgagct gccgctggcg cagggcgtga agttcgggta 3634021 accatgcccg tgggtgcgtc tgggcatcat cgacgcgcgc ttggggttgg gcggggtgga 3634081 attcatccat ttcattcagt gcccgttgcg aatccccaag ctaccccgac ggcgaccaga 3634141 ggatgtcgat ggggacggcg gcgaggcggt cgccgaatgg ctgggcttgt gggccggtgt 3634201 gcaggatcac gccgccggcg aagcgtgcgc cgactttgtc gcggagtctg ctgatcgagc 3634261 gggtgtctct accacggagg gttgccgccg acttgatttc gatcgcggca atgaggccgt 3634321 ctgcggtttc cagtatgagg tctacttcgg cgccgtctcg atcgcggtag tggaacagtc 3634381 gaggtgcctg ttgcgaccat ccgagttgtc gccggagttc tgcgatcacg aaagtttcga 3634441 tgatggctcc ggccgcgttg gggttggcat gtggaccggc tccggtaggc gagacattga 3634501 cgaggcgagc ggccagtccg gagtcgagaa ggaggacttt cggtctatcg acgacccgct 3634561 tggaaaggtt ggtcgaccac gcgggtatgc ggtcgatgag atacagggtc tcgaggaggt 3634621 cgaggtacgg cggcagggta cgtacgggga tttcggcgtc ggtagctagg gagctcaggt 3634681 taagttcgga cgcgctgcgt gcggctagaa gtcggatgag gcgcggcagg tcggcgatgc 3634741 gttggagatt ggagacgtcg gccgcgtcac gtttgacgac gcggtcgacg ttcctagctt 3634801 tcgccgattc gcgacaaagc cgtcgccgat acgcggcact atcttcgcca attcgcggat 3634861 atctcctcac cgattcgcga tatctggcgg agccggtggt gtcgcagcag ggacgtcggg 3634921 gcagacccac cccaccgaaa gaaccaccac cacctgctcg cctagccgaa cgtgtggtct 3634981 acgtgagtaa tatctgtcac atggcgacag ccagaaggcg gttatccccg caggaccgcc 3635041 gcgctgaact gctcgctctg ggggcggagg tctttgggaa gcggccttac gacgaggttc 3635101 gcatcgatga gatcgccgag cgcgctgggg tgtcgcgggc actgatgtat cactacttcc 3635161 cggacaagcg ggcgttcttc gccgcggtcg tcaaggacga ggccgaccgg ctgtacgcgg 3635221 cgaccaacaa ggcgcccgcc cctgggatga cgatgttcga agagatacga accggcgtgc 3635281 tggcctatat ggcctaccac caacaaaacc ccgaggcggc gtgggccgcc tacgtcggcc 3635341 tcggccgatc ggacccggtt ctgctcggta tcgacgacga agccaagaac cgccagatgg 3635401 aacacatcat gtcccgcatc gccgaggtcg tgagcgggat tgaccgcgat aacaccctgg 3635461 acccagaggt cgagcgcgac ctgcgggtga tcatccacgg ctggctggcg ttcaccttcg 3635521 agctgtgtcg tcagcggatc atggacccgt cgaccgacgc tgaacggctc gccgatgctt 3635581 gcgcacacgc gctgctggac gccatctccc ggctgccgca gatccctgcc gaactggctg 3635641 acgcgatggc aaccgcgcga atgtgagcgg taggcggttt ttgtcggtgc ctgttggcac 3635701 gatggctagg tgaggttcgc gcagccttca gcactgagcc gattcagcgc gctcacccga 3635761 gactggttca ccagcacttt cgccgcgccc accgccgccc aggccagcgc ctgggcggcc 3635821 atcgcagacg gcgacaacac gctggtcatc gctcccaccg gatccgggaa gaccctggcg 3635881 gcgttcctgt gggccctgga tagcttggcc ggttcggaac ctatgtccga gcggccggcg 3635941 gccacccgcg tgctgtatgt gtcgccgctc aaagcgttgg ccgtcgacgt cgagcgcaac 3636001 ctgcgcactc cgctggccgg actgacccga ctcgccgaac gccagggtct gcccgcgccc 3636061 cagatcaggg tgggcgtccg ttcgggcgac accccgcccg cacttcgccg ccagctcgtc 3636121 agccagccgc ccgacgtgct gatcaccacc ccggagtcat tgtttttgat gctcacttcg 3636181 gccgcacgcc aaactctgac cggtgtgcag accgtcatca tcgacgaaat tcatgccatc 3636241 gccgccacca agcgcggcgc acacctggca ctatccctag aacggctcga cgacctgtct 3636301 agccggcgac gggcgcagcg catcgggctg tcggcgaccg tacgtcctcc cgaggaactc 3636361 gcaaggttcc tgtccggaca gtccccgacg accattgtgg cgcccccggc cgccaagacc 3636421 gttgagctgt ccgtgcaggt gccggtgccc gacatggcca acttgaccga caacaccatc 3636481 tggccggatg tggaggctcg gctggtcgac ctgatcgaat cacacaactc gaccatcgtg 3636541 ttcgccaatt cgcgacgatt ggccgagcga cttaccgcac ggctcaacga aattcacgcc 3636601 gcgcgctgcg ggattgagct cgcgccagac accaaccagc aggttgccgg cggcgccccg 3636661 gcgcacatca tgggctcggg ccagacgttc ggagcgccgc cggtgctggc ccgcgcccac 3636721 catggctcga tcagcaagga gcagcgcgcc gttgtcgaag aggacctcaa acgcgggcaa 3636781 ctcaaagcgg tggtggcgac gtccagcctg gagctgggca tcgacatggg cgcggtcgat 3636841 ctggtgatcc aagtacaggc accaccatcg gtggccagcg ggctgcagcg cattggccgg 3636901 gccggtcatc aggtcggcga gatttcgcgg ggggtgctgt ttcccaagca tcgcaccgac 3636961 ctactcggct gcgcggtcag cgtgcagcgc atgcttgccg gtgagatcga gaccatgcgg 3637021 gtgccggcca acccactcga cattctggcc cagcacacgg tggcggcggc tgcgctggaa 3637081 ccgttggatg ccgacgcgtg gttcgacacc gtgcggcggg ccgccccgtt cgcgaccctg 3637141 ccgcgtagcc tgttcgaggc caccctggac ctgctgtccg gcaagtaccc atccaccgag 3637201 ttcgctgagc tgcggccgcg gctggtgtat gaccgcgata ccggcacgct gaccgcgcga 3637261 cccggagccc agcgactggc cgtcacctcc ggcggcgcca ttcccgatcg cgggttgttc 3637321 gccgtctacc tcgctaccga gcggccgtcg cgggtaggcg aactcgacga ggaaatggtt 3637381 tacgagtccc gccccggtga cgtgatctcg ctgggtgcca ccagctggcg aatcaccgag 3637441 atcacccacg accgggtgct ggtgatcccc gcgccgggcc agccggcccg attgccgttc 3637501 tggcgcggag acgatgccgg ccgccccgcc gagctcggcg ccgcactcgg cgccctcacc 3637561 ggcgagctgg ccgccctgga ccgtacggca ttcggcacac gttgtgcggg tttgggtttc 3637621 gacgactatg ccaccgacaa cctgtggcga ctgctggacg accaacgcac cgctaccgca 3637681 gtggtaccca ccgacagcac attgttggtc gagcggtttc gtgacgagct gggcgattgg 3637741 cgggtgatct tgcattcgcc gtatgggctg cgggtgcacg gaccgctcgc gctcgcagtc 3637801 ggccggcggc tgcgcgaccg ctatggcatc gacgagaagc cgaccgcctc cgacaacggc 3637861 ataatggtgc gcctaccgga caccgtgtcc gctggcgaag acagcccgcc gggtgccgaa 3637921 ctgttcgttt tcgacgccga cgagatcgac ccgatcgtca ccaccgaagt ggccggttcg 3637981 gcgctgttcg cgtcacggtt ccgggaatcg gcggcccgcg ctctgctgct gccccgccgg 3638041 caccccggcc gccgctcgcc gctgtggcag cagcggcagc gcgccgcccg gctgttggaa 3638101 gtggcccgca aataccccga cttcccgatt gtgctggaga cggtccgcga gtgcctgcag 3638161 aacgtctatg acgtcccgat cttggtcgag ctgatggcgc ggatcgccca gcggcgggtg 3638221 cgtgtcgccg aagccgagac cgccaaacct tcgccatttg cggcatcgct gttgttcggc 3638281 tacgtcggcg ccttcatgta cgagggcgat acgccgctgg ccgaacggcg cgccgccgcg 3638341 ctcgcgctgg acggcacgtt gctggccgag ctgctaggcc gggtggagct gcgcgagctg 3638401 ctcgatcctg acgtcatcgc cgctaccagc cgccagctcc agcatctggc ggccgaccgg 3638461 gtagcccgtg acgccgaagg ggttgccgat ctgctgcggc tgctgggtcc gctcaccgaa 3638521 gacgagatcg ctgcccgggc gggcgcgccc gaggtcagcg gctggctgga cggcttacgc 3638581 gccgccaaac gcgcgctcgt ggtgtccttc gccggccgca gctggtgggt tgccgtcgag 3638641 gacatgggcc ggctgcgcga cggcgttggc gcggcggttc cggtggggct gccggccagc 3638701 ttcaccgagg cggtagccga cccgctgggc gaactactgg gccgctacgc acgcacccac 3638761 acaccgttca ccaccgctgc ggccgcagcc cggttcggtc ttgggctgcg ggtgaccgcc 3638821 gacgtgctgg gccggctggc cagcgatggc cggctggtgc gcggcgaatt cgtggccgcg 3638881 gccgaaggat ccgccggcgg cgagcagtgg tgtgacgccg aggtgttgcg aattctgcgg 3638941 cgccgctcgc tggccgcact gagggcgcag gcagagccgg tcagcaccgc cgcctacgga 3639001 cgcttcctgc cggcctggca gcacgtttcc gcgggcaact cgggcatcga cgggctggcc 3639061 gcggtcatcg atcagctcgc cggcgtccgg ataccggcct cggcgatcga accgctggtg 3639121 cttgccccac ggatccgcga ttactcgccg gcgatgctcg acgagctgct cgcgagcggg 3639181 gacgtcacct ggtcgggcgc cgggtcgatc tcaggcagtg acggctggat cgccctgcac 3639241 cccgccgact cggcgcccat gacgctggcg gagccggccg agatcgactt caccgacgcc 3639301 caccgggcga tcttagccag cctgggcact ggcggcgcgt acttcttccg ccagttgacc 3639361 cacgacggcc tgaccgaggc ggaactcaaa gccgctctgt gggaattgat ttgggccgga 3639421 cgagtgaccg gcgacacgtt cgcaccggta cgcgcggtac tcggcggggc gggcacccgg 3639481 aagcgtgctg ctcccgcaca cggcgggcat cgaccgccgc gcctgagccg ataccgcctc 3639541 acgcacgccc aggcccgcaa cgctgacccg accgtcgccg ggcggtggtc cgcgctgccg 3639601 cttcccgaac cggactccac gctgcgcgcc cattaccaag ccgagctgct gttgaaccgc 3639661 cacggcgtgt tgaccaaaga cgcagttgct gccgagggtg tggcgggcgg gttcgcgacg 3639721 ctctacaagg tgctcagtgc gttcgaggat gccggcaggt gccagcgtgg ctacttcatc 3639781 gagtcgttgg ggggcgctca gttcgccgtc gcctcgaccg tagaccggct gcgtagctac 3639841 ctcgacggtg tcgaccccga acagccggac taccacgcgg tggtgctggc cgctgccgac 3639901 ccggccaacc cgtatggggc ggcgttgccc tggccagcgt cgagcgctga cggtaccgcc 3639961 cggccgggcc gcaaagccgg cgcactggtc gttctggtgg acggcgagtt ggcctggttc 3640021 ctcgagcgcg gcgggcggtc gttgctgacg ttcaccgatg atcccgaggc caaccacgcg 3640081 gcggccatcg ggctggccga cctggtcacc gccgggcgcg tcgcgtcgat tctggtcgag 3640141 cgggccgacg gcatgccggt gctgcagccc ggcgggcggg cgtcggcggc actgacggcg 3640201 ctgctggcag ccggcttcgt ccgcacacct cgcggtctgc ggcggcggta agccatgccc 3640261 gagggcgaca ccgtctggca caccgcggcc acgttgcggc ggcatctggc cggtcgcacg 3640321 ttgacacgtt gcgacatccg agtgccacgg tttgccgccg tcgacctcac cggcgaggta 3640381 gtggacgagg tgatcagtcg gggcaagcac ctgttcatcc gaaccgggac agccagcatt 3640441 cattcgcatc tgcagatgga cggcagctgg cgggtcggca accggccggt gcgggtggat 3640501 catcgggcgc gaatcatttt ggaagccaac cagcaagaac aggccatccg ggtggtcggc 3640561 gtcgacctag gcctgttgga ggtcatcgac cggcacaacg acggcgccgt cgtcgcacac 3640621 ctaggacctg atctgctggc cgacgattgg gacccgcagc gtgcagccgc caacctgatc 3640681 gttgccccgg accggcccat cgccgaggca ctgctcgacc agcgggtgct cgccgggatc 3640741 ggcaacgtgt attgcaacga actgtgcttc gtcagcggag tattgccgac ggccccggtg 3640801 agcgcggtcg ccgacccgcg ccgcctggtc acccgcgccc gagacatgct gtgggtcaac 3640861 cgcttccgct ggaatcggtg caccactggc gatacccggg ccggccggcg actgtgggtc 3640921 tacgggcggg ccgggcaggg ttgccgccgc tgcggcacgc tcatcgccta cgacactacc 3640981 gacgagcggg tgcggtattg gtgcccggcc tgccagcgct gaaccgggcg atcaaagcca 3641041 gcacctagtc gcggccgtgg gtagcgaaga actgggcaat gacttgcgac ccgtcgaacg 3641101 cgcgcgtggt cgccccgatg accgccttgg gcagatattg cctgccaccc ggccaggtat 3641161 gtccgccatt gtcgatctgg taggagatca cctcggtgcc ggccgcacat gagctggaat 3641221 cgaaaaggtg caccattgtt ccgtccccga cgtcaggcag ctccgccgcc gacggatcgc 3641281 cctgacaccc atcgaccgcc cgccagcgat ccaccaagct cgcaaccgag atggaatggc 3641341 tgagcccgcc gcgaccacgc accgccccgc cgttgaacgg caccagcggg tcggcggtgc 3641401 cgtgtgcttc gagcaccgac accggccgcg acggattaca tgtcacaccc acacccagcg 3641461 tgcccgccac cggcgcgacc gcggcgaaga tatcggcacg gtcacacgcc agccggttgg 3641521 acatgaagcc accgttggac atgccggtgg cgaagacgtg cccgggagcg atgtcgaagt 3641581 cgtgcaccag ctttgcggcc agcgcgacca agaacccaac gtcgtcgaga tgacggcgat 3641641 ccgccggcga cgcccccctc ccgtcggccc agcttttgtc gtagccgtca ggatagacaa 3641701 ccaacaagtc ggcggcgtcg gcaacagcgt cgaaatcggt gagagcctcc tgtccggctc 3641761 cggtgccgcc accaccgtgc aggctgatca ccaacccgga gggctcagcg ggcggcacgt 3641821 gcaagcgata actgcgggtc aagcccccga actggaacgt cgctaccgaa ctggcatgcc 3641881 tggccagtag ctgatcaccg ccacacccgg ccaggcaaac catgagaacg ataagcgaca 3641941 gcattcgcgc ccacggcatc tcgtcaaggt accgatcgcg agcgctcagc ccgcggcgcc 3642001 ctgtcccacc gcttggaccg atgcgtgctc gtgcaacgcc ctggcggctt cgggatgtac 3642061 gggcttgagg tcgaagatga cctcggtgac ggtcccggtg aacgcatagg gcgccttgtc 3642121 ctcatagccg cggtcaacga ccaggccgtt gtcgcggccg atgtccatgc cggcatagga 3642181 ggtaaaggcc agcggcaccg tctggggcag ctcaccctct ccgatcaacc gatcgtcggc 3642241 ccagagcgtc acccgaccac cggaggcggc gacgggttga tgggaatcga acagcatccg 3642301 caccgtgaca tccccggtgg ggagcggctc gctggacacc tgccggtagg tttcgacgcc 3642361 caggaaggag taggtgtggt gcaggtgccg ctgttcgtcg acccatagcg cgaaccctcc 3642421 catgaagtcg gcgttggcga cgatcacacc ctgcgcgccg ccgtcgggga tgtgcagccg 3642481 tgcctcgatc gcgtaagaac gaccgcagat acgggggacc atgccgcgct gaatgttctg 3642541 cacgtcacct ttgaaactga accgtgcggt ggtgggcagg ggcggcaggt cgccgaacat 3642601 taccgcgagc ccgcccagca gcggcagcac ccggtttcgt tcggcctcct gccaccacag 3642661 ctgggtgagc tcggcgacct tgtcgggatg ctcggctgcc aggtttttcg cctgggagaa 3642721 gtcatctggt aggtagtaca gctcccagac gtcctggtcc gggtcgtagg tccccggcgc 3642781 gaaccgtcgc atcgtctccg gtgacagatc ccagggcgcc ttgtccaagc gagcgcacgc 3642841 ccaccagccg tctttgtaga tggcacggct gccgaagttt tcgaagtact gcacggtgtg 3642901 gcggtcttcg gcttcagcgt cgtcgaaggt ccgcacgaaa ctggttccgt ccatcggttc 3642961 ctgctcgaag ccgtcgacat gggtcggctc cggtaaaccg atggccgcca acacggtcgg 3643021 cgcgatgtcg atgcagtggg tgaactggct acgaacacgg ccgtctggcc ggatccgggc 3643081 cggccaagcg accaccaatg gatcgcgcgt gccgcccagg tggctggcca tctgcttgcc 3643141 ccactgcaac ggggtgttgc tcgcatgcgc ccacgcgctg gcgaaatgcg gtgcggtgaa 3643201 ctcgtcgccg agtgcggcga tgccgccgta ttgttcgatc agctccaatt gccgctcggc 3643261 atccagatcc aggccgttaa ggaacgtcat ctcattgaac gaaccggtgt tggtgccctc 3643321 catgctggcg ccattgtcgc cccagatgta gaacaccaac gtgttgtcgg actcgccgag 3643381 atcctcgatc gcgtccagca gccggccaac attccagtcc gcattttccg agaacccggc 3643441 gaacacctcc atctggcggg caaagagccg tttttgcgcc tccgacatac tgtcccacgc 3643501 ggggaatagg tcgggccgct cggtgagttc ggcgtcgggt ggaatgatcc cgagtcgctt 3643561 ttgccgttcg aatgtcttct gccggtacac atcccagcca tcatcgaact cacctcggta 3643621 cttgtcggcc cattccttga atacgtggtg tggcgcgtgg gtggcgccgg tcgcgtagta 3643681 cagcatccac ggcttggtgg cattctgggc ccgcacggtg tgcagccact cgatagcctt 3643741 gtcggtgagg tcgtcgggga aatagtaggg acggccgtct tccccagaac cctcgggtat 3643801 gcctatgacg gagttgtcct gactgatgat cgggtcgtac tgacccgcgg cgccgctcgg 3643861 gaagccccag aaatggtcga atccccaacc cagcggccag ttgtcgaacg gccccgcggc 3643921 tccctggaca ttgtccgggg tcagatgcca cttgccgaaa gcgccagtca cataaccgtt 3643981 gtcgcgcaga atacgcggca gcgctgcgca actgcgtggc ctgaccgccg aataccccgg 3644041 gtacgggccg gggaactcgc agaccgaccc gaagcccacc cggtgatggt tacgcccggt 3644101 caacagcgcc gcacgggtcg gcgagcacac cgcggtcaca tgaaaacggt tgtagatcaa 3644161 cccattctgg gctagccggg acagcgtcgg ggttcggatc gcgccgccga atgtatccgg 3644221 tccgccgaac ccagcgtcat cgatcaacac gatcagcaca ttcggtgcgt cgtcgggcgg 3644281 aaagggactg gggacaatcg accagtcgcc gaccgactct gccatggtgc ggccaaccac 3644341 gccaccaaag cggcgctgcg gtagcggcag ccgggtgcgg tctgggttga acttgcccat 3644401 cgcctctcgc aacgccgcac ccaggcttcg caacgtcgaa cgactcagct ccgcaaccga 3644461 tttcattgga gagctagcca acgcctgccc cgcttccagt cggccttgtg cctccgtcac 3644521 ggcgatgacc actgctcggc ccgccgccag cgcttggccg atcttgtcgg ccagcccggt 3644581 cttgatccga tggtgggcga aggtgccggc caatgctccg gtcgcggcgc cgagcgccgc 3644641 cgaggccaac agtgccggcg agaacaggcc gatcgccagg cccaccccgg cgccccacgc 3644701 ggcgccgcgc cggccgagcc gatttccggt gtcgaccaaa accggactgc cctcggcgtc 3644761 cttgccgatc agcaccgcac cctgcagcgg aatgcttttg tccttggcgg catcgacgag 3644821 ggtttgaaaa tcgtgacgag ccgaatcgag gtcctgatag ccggcgacga gcaccagcgc 3644881 gttgtcttca ctcatcacga aactcccgat atgtgtgtca cggccggcaa tcggccgcgg 3644941 ctgaccatgt tggcaacgta gcaccggtca acgtgcgcgt gctggcgaac tcgcggtgcg 3645001 acccggtcag cggatcgtcg aactcgatgc gctgcgcgag caactgcagc ggtgtgctga 3645061 agtcgtgggc ggccacggat atcacgttgg ggtacaacgg gtcacccatg atcggtatcc 3645121 ccagcgccgc catgtgcact cgcagctggt gggtgcgccc ggtggtcggt gtcagccgat 3645181 acagaccgtc gcgcgctatc cgctccacca gcgtctccgc gttgggaacg ccgggctcac 3645241 agaccgcctg cagatggccc cggcgcttga cgatgcgact gcggaccagg cgcggcaggg 3645301 ccagacccgg ggcaacgggt gcgcgagcca gataggtctt gcgcaccaaa ccgcgggcga 3645361 acatcgtctg gtagctgccg cgcacctcgc gtcgggtggt gaacaacaac accccggcgg 3645421 tcagccggtc cagccggtgg gccgggctca gctcgggcaa tcccagttcg cgacgcagcc 3645481 gcaccagcgc ggtctgcgcg acgtgtcgcc cccgaggcat ggtcgccaag aaatgtggct 3645541 tgtcgacgac gacgatgtcg gcgtcttgat gcagcactgg gacatcgaag ggcaccggca 3645601 cctcgtcggg caggtcgcga tacaggtgca caaccgaacc gggcggcagc accgtgccac 3645661 tgtcgaccac cgcaccgtcg ttgtcgacca cctccccggc cagcaccttc gcacgggccg 3645721 ccacgccaaa ccgtgcggtc agctcggcta acaccgaccc gccaagcagt cgcacccgca 3645781 ccggccccag cacgtcgtgc acgctaagca aacgatcctc tggccgcaac gccacacgag 3645841 accctctcag taagtggaaa tctcgtcctc ggtcggtagc accccggtga ccatgaagat 3645901 gacgcggcgg cccacttcca cagcgtggtc ggcgaagcgc tcaaagaaac gacccagcaa 3645961 cgccgtttcc acaccgacgc gaacgccgtg ccgccattct cgatctatca gcacgctcag 3646021 caaatgccta tgcaggtcat ccatcgcgtc gtcacgatcg tgcagttgcg cggcttcctg 3646081 cgggtcacgg ttcaccagca cttgtcttgc actgtcaccc aacgcgattg ccaccttcgc 3646141 catgtcggcg aagcagttgc gaacttcctc aggaagcacc tggttcggat actcgcgtcg 3646201 ggtgatcttg gcaatatgca cagccaacgc acccatgcgc tcggtgtcgg cgatgatctg 3646261 caccgcactg aagatttccc gcagctcgcc ggccaccgga tgttgcaacg ccagcagcgc 3646321 gaacgcttcc ttttcgactt gggctcgcat cgccacgatc cgctcatggt cacggattac 3646381 ttgttcagcg gcgccaatgt cggcctcgag cagagcctgc gttgcgcgtt tcatcgctat 3646441 cccggccagg ctgcacatct ctcccaatcg tccggccaac tcggttagcc gctggtgata 3646501 gaccgtccgc atggtgtcac gcctctctga ccctgagtcg tcgtgtggtg ctgccgcgga 3646561 tccacaccgc catcatcgac catggcggca ccgcgcgaca tacccgcttg gcgtagcctt 3646621 caatccaaag gcaccggctc gaggatctcg gcacgcgcct cgggtgcgct ggcccgcaac 3646681 atgtccgccg aaacgtcgtc gggctgggcc tgggagagca cctcggcctc cacgcgcgcc 3646741 atatagttcg cgacctcgcg gtcgatgtct gcggcggtcc acccgagcac gggcgcgacc 3646801 acctcggcca cctcccgggc gcagtcgacg ccccggtgcg ggtattcgat ggaaatccgc 3646861 atccgacggg ccaggatgtc ctcgagatgc agggcgccct cggcggcggc ggcgtaagcg 3646921 gcttccacct tcaaatagcc cggtgcctcc gttatcgggc tcaacaggct gggatcggag 3646981 gccgccatcg ctagaacgtc gctgatcagc gaaccatagc ggtccagcag atggcgcacc 3647041 cggtacgggt gcaggccctg cagcgcgccg acgtgttcgg cctgattgac cagtgcaaag 3647101 taaccgtcgg cgcccagcag gctgaccttc tcggtgatcg acggcgcaac gcgggcgggg 3647161 atgaactgca cagcagcgtc gatcgcgtcg gccgccatta ctcggtaggt ggtgtacttg 3647221 ccaccggcga tggccaccag gcccgccgcc ggcacagcca cggcgtgttc ccgggacagc 3647281 ttggaggtgt cgtcgctttc cccggcaagc agcggccgca gcccggcgta cactccgtca 3647341 atgtcggcgt gcgtcaacgg ggtcgccaac acggcgttga cagtgcccag gatgtagtcg 3647401 atgtcggcct tggtggccgc ggggtgcgcc aggtcgaggt tccagtcggt atcggtggtt 3647461 ccgatgatcc agtgacttcc ccacggaatg acaaacatca ccgacttctc cgtgcgcagg 3647521 atcatcgcga cgtcactgac aatccggtcc cgcggcacca ccacatgcac gcccttggat 3647581 gcgcgcacct ggaagcgccc gcgctgtttg gacaacgctt gaatctcatc ggtccagacc 3647641 ccggtcgcgt tgaccacgac gtggccgcga acctcggcaa ccgcgccgtt ctcggagtcg 3647701 cggacgccca cgccgatcac ccggtcaccc tctcgcaaca aggccactac ctgggtggag 3647761 cagcggacaa ccgcgccgta atgcgccgcg gtgcgcgcga ccgtcatggt gtgccgggcg 3647821 tcgtcgacga cggtgtcgta gtaacggata ccaccgatca gcgagctgcg cttcaagccg 3647881 gggctcagtc gcagcgcacc ggcgcgagta aaatgccgtt gcgccggaac cgatttcgcg 3647941 ccacccagcc ggtcgtaaag aaagataccc gcggcgatgt agggacgctc ccaccagcgt 3648001 ttggtcagcg ggaacaaaaa cggcagcggc ttgaccaaat gcggtgccag cgtggtcagc 3648061 gacagttcac gttcatagag cgcctcacgc accagcccga actccagttg ctcgaggtag 3648121 cgcagcccgc cgtggaacat cttcgaggag cggctcgacg tgccggaggc caagtcccgc 3648181 gcctcgacca acgccacctt gagcccacgg gtggcagcat ccaaagcgca tccggagccc 3648241 actactccgc cgccgatcac cacgacgtcg aattgctcgg ttccgagtcg cttccaggcg 3648301 accgcgcgct gtgcaggtcc cagcgccgcg gcgggccacc cctgcccgcc gtccggtgcc 3648361 tggattgggt tgctcacgaa accggctcct gtcagttact cgtcggtagg tggtgtggca 3648421 ccaaggctag ttgttcagcc gcgtcttgag ctgccgttca gtccagatcg tcgtgcgcca 3648481 tcagccggcg ggccgcctcg gttatcgaac ccgacaacga tgggtaaacg gccagtgtct 3648541 gggccagctc gttgacggtg atgcggttct gaacggctac ggcgatgggc aggatcagct 3648601 ccgatgcgat cggcgccacc accacgccgc cgatcacaac gccggtggac cgccggcaga 3648661 agatcttgac gaacccgtga cgcatctccg acatcttggc gcgcgcgttg gttcgtaacg 3648721 gcagcatgat ggtccgggcg gccaccgaac cggcgtcgat gaccgattgc ggcaccccga 3648781 ccgcggcgat ctcgggcctg gtgaaaaccg tcgcggccac cgtgcgtaac cggatcgggc 3648841 tgacgccctc ccccagcgcg tggtacatcg cgatgcggcc ctgcattgcg gcgaccgacg 3648901 ccaggggcag caaacccgtg cagtcgcccg cggcgtagat gccggtcgcc gacgtccgcg 3648961 acacccggtc cacggtcagg taattgcccc ggccaagctg gatgccgacc cgttccaggc 3649021 ccaggccgct ggtgttgggc accgacccga tggtcatcag ggcgtggctg ccctcgacgg 3649081 tgcgaccgtc ggtcatcgtg acgagcaccc cggccccggt gcgggtgacc gatgctgccc 3649141 gggcattttt gaacagccgg actccccgtt cggcgaacga ctcttccagg accagcgcag 3649201 cgtcagcgtc ctcatacggc agcacgtggt cctggctggc caccaccgtg accggcaccc 3649261 ccaattcggt ataggcgtcc acgaactcag caccggtaac cccggagccc accacgatga 3649321 ggtggtcggg caacgcgtcc aagtcgtaga gctgccgcca ggtcagaatg cgctcaccgt 3649381 ccggctgggc cgacggcagg atccgcgggc tggcgccggt ggcgaccagc acgacgtcgg 3649441 cctcatgctc actggtggag ccgtcggcgg cggtcgcctt aatgcgatgg cgcgccagac 3649501 ccggtgtgga gtcgatcaac tcgccccggc cggcgatcac ctgaaccccc atgctgagca 3649561 gctgggcggt gatgtcggcc gactgtgcgg cggccagcgt cttgacccgg gcatggattt 3649621 gcggcaacga gatcttggcg tcgtcgaagt cgatatgaaa gcccaggtgc ggcgctcggc 3649681 gcagttcggt acgcagcccg gtggaggcga tgaacgtctt cgacggcaca cagtcgtcca 3649741 gtacggcagc cccgccgatg ccgtcgcagt caatcacggt aacttgggct gtttccgggt 3649801 gtgaggtggc ggccaccagt gcggcctcgt aaccggccgg gccgccaccg aggatcacga 3649861 tgcgggtcac cacagcccat aacctagctc ggcgacgatg cacgccgcgc agcggcgtga 3649921 ggaggagccg agcagtccaa cacagctcgg cgacgatgca cgccgcgcag cggcgtgagg 3649981 aggagccgag cagtcaagca cagcttgacg atgacccgca ccgcagcgcg gcgcgatggg 3650041 taccacccga gcccccgccg tctaagcttt cccccgtgcc gctctacgcc gcctacgggt 3650101 cgaacatgca tcccgagcag atgctcgagc gcgcacccca ctcgccgatg gccggaaccg 3650161 gctggttacc cgggtggcgg ctgacgttcg gcggcgagga catcggctgg gaaggggcgc 3650221 ttgccaccgt cgtcgaagac ccagattcga aggtgttcgt cgtgctctac gacatgaccc 3650281 cggcggacga gaagaacctt gaccggtggg aaggctccga gttcggcatc caccagaaga 3650341 tccgatgccg cgtggagcgc atttcctcgg acaccacaac ggatcccgtc ctcgcgtggt 3650401 tgtacgtttt ggacgcctgg gagggtggcc tgccgtcggc ccgctatcta ggtgtgatgg 3650461 ccgatgccgc tgagatcgcg ggcgcgccaa gtgattacgt acatgacttg cgtactcgcc 3650521 cggcccgcaa catcggcccg ggaactattg cctaattatc gcgagcgccc aggctaatgc 3650581 gcggcggcct gctcgatgat gttgaccatc acccgcagcc cgatcgccag ggctcgctcg 3650641 tcgatgtcga acgtcggctg atgcaggtcc aactgcagtc cgtcaccgga ccacacgccc 3650701 agtcgagcca tcgcgccggg aacctcctcc aaataccagg agaagtcctc accaccgccg 3650761 gactgccggg tatcggccag cacacctggg ccaatagcct caatagcgtg ggcgagaatg 3650821 cgtgtcgaga tttcctcgtt gaccaccggc ggcacccccc gacggtattg cagcgtgtgc 3650881 tcgatcgcca acggtaatag caacgccgaa atggcttggc ggacaagctc ctcaaggtca 3650941 acccaggtct gccggctggc cgtgcgaaca gtgccggaca gaactccggt ttgcggaatg 3651001 gcgttggcgg ccatacccgc gttgaccgcg ccccacacca gcacggtgct gttacgtggg 3651061 tcgatgcgac gcgacagcac cccgggcagc ccggtgacca gcgtgccgag cccgtagacg 3651121 aggtcggcgg tcaagtgtgg acgcgacgtg tgcccgcccg gcgaatacag cgtgatttct 3651181 atcgagtcgg ccgccgacgt gatggggcct tgccgaacgg cgaccttgcc gacttcaagc 3651241 cggggatcgc agtgcagggc gaagatccgc gacaccccgg ccaacgcgcc ggccgcgatc 3651301 gcgtcgatgg caccaccggg catcagttcc tcggccgcct ggaagatcaa ccgcaccccc 3651361 accggcagct ccggtaccga agccaatgcc aatgcggcac ccagcaggat cgcggtgtgc 3651421 gcatcatggc cacaagcatg cgcgacgttg ggcatggtcg aggcgtaggg cgcgccggtc 3651481 cgctcggcca tcggcagcgc atccatatcg gcgcgcagcg cgatccgcgg ctgatgctga 3651541 ggaccgaagt cgcaggtgag tcccgttcca ccgggcagca ccttggggtt cagccccgcg 3651601 tcggctaacc gctcggcgac gaactgggta gtggcgtatt cctgacggcc caactccgga 3651661 tagcggtgga tgtgccggcg ccagccgacc aggtcgtcgt ggtgggcggc tagccatgat 3651721 tcggcggcgt cggcgaggct catcgcgccg ccctgcgctg ctgcgcggcc agcacccggt 3651781 cacgctcatc aggagtctgc gcgagacgga caaccgtgcg tgccaacatg atcgcgccgt 3651841 caaccaccgc gcggtcggcg ctggcaccag cggaagcgac ggtgaaggcc cgttggtgca 3651901 ccgtcgccgc gccggcgtcc aggccgatca ccggatggat cccgggcagc acctgcgtca 3651961 cgttgcccat gtcggtgcta cccagcggca gctctgcctc caaggctggc agcaacggct 3652021 cgcgccccag ccgctgcatc tcctcccggc acacgtcagc cagccacggg tcgggtttga 3652081 gctccgcgta tgccggtgca gcctcgtcga tttcgtattc gcacccggcg gccagcgcgc 3652141 cggccgcaaa gcaggcgaac attctggtct gcagctcgcg cagcgaatcc gattcgaccg 3652201 cacgcatcgc atactgcagc ctcgcctgcc cggggatgac attgaccgcc tgcccgccgt 3652261 cggtcacaat gccgtgcacc atttgcccgg gcgccaattg ctgtcgaagt accccaatag 3652321 cgacctgcgc cacggtcacg gcgtcggcgg cgttaacccc taggtgcggc gcgacggccg 3652381 cgtgcgattc cttaccccga tagcgcacgg tgacctcgga cagggccagt gatcgtgcgc 3652441 cggcgatatc ggtcggcccg ggatggacca tcacggccac cgcaacgtca tcgaacgtcc 3652501 cggcctgcag catcagcgcc ttaccgccgc cggactcctc ggcaggggtc cccagcagag 3652561 ccacggtcaa gcccaggtcg tccgccacct cagccagtgc cagcgcggtg cccaccgcgg 3652621 aggccgcaat aatgttgtgc ccgcaggcgt gtccgatccc gggaagcgcg tcgtactcgg 3652681 cgcacactcc gacaaccaac ggtccgctgc cgtagtcggc gcgaaacgcc gtgtccaacc 3652741 caccggcggc cgtggtgatc tcgaaaccgc gttcggcgac cagcgcctga gccttggcgc 3652801 agctgcgatg ctcggcgaac gccagctcgg gctcggcgtg gatggcatgg gacagctcga 3652861 ccagctcgcc accacggcgc cgcaccaatt cctcgacgcg gtcggatgcg ctggctgctg 3652921 gcatgctcgc agtatctcat cgacgagcac ccgctccccg gcgagcggct cagttaagct 3652981 cgcccagtgt ggctgacccg cgccccgatc ccgacgaact ggcccggcgg gcggcgcagg 3653041 tcatcgctga ccgcaccggg atcggcgaac atgacgtcgc ggtcgtgctc gggtcgggat 3653101 ggttaccggc cgttgcggcg ttgggctccc cgaccaccgt gctgccgcag gccgaactgc 3653161 ccgggtttgt gccgccaacc gcagccgggc atgcgggcga gctactgtcc gtgcccatcg 3653221 gtgcgcaccg ggtgctggtg ctggccggtc gcatccacgc ctacgaggga cacgacctgc 3653281 gctacgtcgt gcatccggtt cgggcggccc gtgcggcagg ggcgcagatt atggtgctca 3653341 ccaacgccgc cggtgggctg cgggcggacc ttcaggtcgg ccagccggtg ctgatcagcg 3653401 atcacctgaa cctgaccgca cgttcgccac tggttggcgg ggagttcgtc gacctgaccg 3653461 acgcctactc accgcgactg cgggaactcg cccgccaatc cgacccgcag ctggccgaag 3653521 gcgtctacgc cggcctgccg gggccgcact acgagacacc ggcggagatc cggatgttgc 3653581 agacactggg cgccgacctg gtcggcatgt ccacggtgca cgagaccatc gcggcccggg 3653641 cggcgggcgc tgaggtactg ggcgtatccc tggtgacaaa tctggcggcc gggatcaccg 3653701 gcgagccgct gagccacgcc gaggtgctcg ccgccggagc cgcatcggcg actcggatgg 3653761 gcgcgctgct agccgacgtg atcgcccggt tctaagccgt gacgccagag aattggatcg 3653821 cccacgaccc ggacccgcag acggccgccg agctcgccgc ctgcggcccc gacgagctga 3653881 aagcgcggtt cagccgccca ctggcgttcg gcaccgcggg gttgcgcggg cacctgcggg 3653941 gcgggccgga cgcgatgaac ctggcggtgg tgttgcgcgc cacctgggcg gtggcacggg 3654001 tgctcacgga tcgaggtctg gctggttcgc cggtgatcgt ggggcgcgac gctcggcacg 3654061 gctcaccggc gtttgccgct gcggccgccg aagtgcttgc cgccgcaggt ttttccgtgc 3654121 tgcttctgcc cgatcccgca cccaccccgg tggtggcgtt cgcggtgcgg cacaccggcg 3654181 ccgccgctgg gatacagatc acggcgtcac acaacccggc gaccgacaac ggctacaagg 3654241 tctatgtcga cggcggcctt cagctcctcg cccctaccga ccggcagatc gaagccgcga 3654301 tggccaccgc gcccccggcc gatcagatcg ccaggaagac cgtcaacccc agtgaaaacc 3654361 gcgcctccga tctgatcgac cgttatatcc agcgtgcggc cggggtccga aggtgcgccg 3654421 gttcggtccg ggtggccctg acgccgctgc acggggttgg cggggcgatg gccgtcgaga 3654481 cccttcggcg agccggtttc accgaggtgc ataccgtggc gacgcaattc gcgccgaatc 3654541 ccgacttccc caccgtgaca ttgccgaacc ccgaggagcc cggagccacc gacgcactgc 3654601 tcaccctggc taccgacgtg gacgccgacg tcgcgatcgc gctggatccc gatgcggatc 3654661 gctgcgcggt cgggataccc acggtgtcgg gatggcggat gctgtccggt gacgaaaccg 3654721 gttggctact aggtgattac atcttgtcgc aaaccgacga ccgggcgtcg ccgccggaaa 3654781 ccagggtggt ggccagcacc gtggtgtcgt cgcggatgct ggcggcgatc gccgcgcatc 3654841 acgctgccgt gcacgtggag accctcaccg gctttaagtg gctggcgcgc gccgatgcga 3654901 acctgcccgg caccctggtg tacgcctacg aggaagcgat cgggcactgc gtcgacccca 3654961 ccgcggtgcg tgacaaagac ggcatcagcg ccgcggtgtt ggtgtgcgat ctggtggccg 3655021 cgctcaaagg ccagggtcgt tcggtgaccg acgcgctcga cgagctcgcc cgatgctacg 3655081 gcgtgcatga ggttgccgcc ctgtcacgcc ccgtgggcgg cgccgtcgag accaccgacc 3655141 tgatgcgacg gctccgcgag gacccgccgc gtcggctggc cggtttcccc gccacggtca 3655201 ccgatatcgg cgacacgctg atcctcaccg gcggcgacga caacatgttg gtcagggtgg 3655261 cggtgcggcc ttctggaaca gaaccgaagc tgaagtgcta cttggagatt cgctgcgcgg 3655321 tgaccggtga cctaccagct gcccgacagc tggtgcgggc gaggatcgat gagctgtcgg 3655381 ctagcgtgcg gcggtggtgg tgactcagcg cgggccgaac tggcgatcgc cggcatcgcc 3655441 gagaccgggc acaatgtagg cgacctcgtt aagcccttcg tcgatggccg cagtgaacaa 3655501 ccgcacgttt ggcgcagcct tctgcagcgc cgcgattcct tctggcgccg caaccacaca 3655561 cagcaccgtg atatccgctg caccgcgcga gatcagcaga ccgagggtgt gcgtcatcga 3655621 cccgccggtg gccaccatcg ggtcaagcac catgaccggt acatccgtca ggtcgtcggg 3655681 cagcgagtcc agatacggca ccggctggtg ggtttgctcg tcgcgggcga caccgacaaa 3655741 gccaacgtgc gcctccggca aggcggcatg cgcctcgtcg accatcccca accccgcccg 3655801 caacacagga accagcaggg gtggcttggt tagccgcgac ccgaccgtct cggccagcgg 3655861 cgtacggatc gggactggct cgcagggcgc atcgcgggtg gcctcataga tcaacagcag 3655921 cgtgagctcg cgcagcgctg cccggaagcc ggcgttgtcg gtgcgttcgt cacgcagcgt 3655981 ggtcagtcgg gccgcggcca gtgggtggtc aacgacatgg acctgcacgg cgttgaaccc 3656041 tatataacaa tcgtggctcg gtcccctaaa agggggctga tacgggtgcg tccatccgcg 3656101 cgaccggtca accccgtcca tatactcccg gcatgctccg cggaatccag gctctcagcc 3656161 ggcccctgac cagggtatac cgtgccttgg cggtgatcgg tgtcctggca gcatcgttgc 3656221 tggcctcatg ggtcggcgct gtcccacaag tgggtctggc agcgagtgcc ctgccgacct 3656281 tcgcgcacgt ggtcatcgtg gtggaggaga accgctcgca ggccgccatc atcggtaaca 3656341 agtcggctcc cttcatcaat tcgctggccg ccaacggcgc gatgatggcc caggcgttcg 3656401 ccgaaacaca cccgagcgaa ccgaactacc tggcactgtt cgctggcaac acattcgggt 3656461 tgacgaagaa cacctgcccc gtcaacggcg gcgcgctgcc caacctgggt tctgagttgc 3656521 tcagcgccgg ttacacattc atggggttcg ccgaagactt gcctgcggtc ggctccacgg 3656581 tgtgcagtgc gggcaaatac gcacgcaaac acgtgccgtg ggtcaacttc agtaacgtgc 3656641 cggcgacact gtcggtgccg ttttcggcat ttccgaagcc gcagaattac cccggcctgc 3656701 cgacggtgtc gtttgtcatc cctaacgccg acaacgacat gcacgacggc tcgatcgccc 3656761 aaggcgacgc ctggctgaac cgccacctgt cggcatatgc caactgggcc aagacaaaca 3656821 acagcctgct cgttgtgacc tgggacgaag acgacggcag cagccgcaat cagatcccga 3656881 cggtgttcta cggcgcgcac gtgcggcccg gaacttacaa cgagaccatc agccactaca 3656941 acgtgctgtc cacattggag cagatctacg gactgcccaa gacgggttat gcgaccaatg 3657001 ctccgccaat aaccgatatt tggggcgact agccgccgtc gctattctgt gccgcatggt 3657061 tgctgacctc gtacccatcc gcttgagcct gtccgctggt gaccgctaca cgctgtgggc 3657121 tcctcgctgg cgggatgccg gcgacgagtg ggaggcgttc ctgggcaaag acgacgacct 3657181 gtatggcttc gagagcgtct ctgacctggt cgcgttcgtg cgcaccgaca ccgagaacga 3657241 cctggtcgac cacccggcat ggcaagacct gaccggagcc cacgcgcaca acctcaatcc 3657301 ggccgaagac aatcagttcg acctggtcgt cgtcgaggaa ctgctggctg agaagccgac 3657361 ggcggagtca gtggccgcgc tggccgcctc attggcgatc gtatccgcca tcggatcggt 3657421 gtgcgaactg gcggcagtgt cgaagttctt caacggcaat cccatcctgg gcacggtttc 3657481 cggcgggctc gaacacttca ccggaaaagc cggcaataaa cgctggaatt cgattgccga 3657541 ggtcatcgga cgcagctggg acgacgtgct cgcggccatc gacgagatca tcagcacccc 3657601 cgaggtcgac gctgagctgt cggaaaaggt cgccgaggag ttggcggagg agcccgaggg 3657661 cgccgaggaa gtggcggcgg aggtggaggc cacgcaggac acgcaggagg cggccgagtc 3657721 cgacgacgag gaagccgacg cacccggtga cagtgtcgta ctgggcggcg atcgggactt 3657781 ctggttgcag gtgggcatcg acccgatcca gatcatgacg ggcaccgcca ccttctacac 3657841 gcttcgctgt tacctggatg atcgaccgat cttcctgggc cgcaatggtc ggatcagtgt 3657901 gtttggctcc gagcgggcat tggcccgcta tcttgccgat gagcacgacc acgacttgtc 3657961 ggacctgagc acctacgacg acatccgcac ggccgccacc gacggctcgc tggcggttgc 3658021 cgttaccgac gacaacgtct atgtgctcag tgggctggtc gacgattttg ccgacgggcc 3658081 ggacgcggtg gaccgtgagc agctcgacct ggccgtcgag ctgctccgcg atatcggcga 3658141 ctactccgag gacagcgcag tcgacaaggc actcgagaca acccgcccgc tgggccagct 3658201 ggtggcctat gtgttggacc cccactcggt cggcaaaccc acggccccgt atgcggcggc 3658261 tgtccgtgaa tgggagaaat tggaaaggtt cgtggagtcg cggctcaggc gcgaataggc 3658321 accgtcagcc ggcgaaggct agccgccgcg gcgcttgccg atgtccaggg cacacgcggc 3658381 gaggatcgca tcccagtctt cgatgttgaa atggcccttg ccgtgcgccc agtgcaaatc 3658441 aacgtgcgga atcgcgcgct gcaggtattc gcccatggcg cgtggcacga aggagtcacg 3658501 atcacccagc cagatatggg taggcacggc cacctcggcg aggtcgaaac cccacggccg 3658561 aaattgcaga aatgattcat aggctgcgcc gcggctgccc tgtcggaacg cttcgagctg 3658621 gatggcgcgc aggtggcggc cgaagcgttc gtcgctcagc aggtgcttgt cggccgcggg 3658681 gaccgcagcc gccaacaacg tagaaaacag cccgggcgtg tatttcgcgc accagccgag 3658741 cggggcaaac aacgcaccga atagccgcgg cccgcttcgc gccaaccgcg cgtagcaccg 3658801 atcggccgcg ttgaggctgc gcatgatatc cggcgtcgcc agtggacccc atggtccgag 3658861 cgcgccgacg aacgctagtc gggtccgcgg gatgacggca ccgcaggcga ataggtgcgg 3658921 tcccgcgccc gaatgcccga ccaccccgaa ctcctccagc tcgaacgcgt cagccagggc 3658981 acacacgtcc gcgggccaat cgcgaaaatt gcgtcccgct tgaaaggtgg agcgcccgta 3659041 cccgggccga tcaatcgcta tcagtcggaa gccggtgcgc cgcgcggcac catcggcgaa 3659101 ggccccctcg agccgcgaac ttggcgtgcc gtggaagtag aacgctgggt agccggtgct 3659161 atcaccccat tccaggtagg caagcgcccg cccgtcgggc agcatgagca catccgcctc 3659221 gtcggtgcga atgcgctcgg gcagcgatgg cggtggcccg gtcaagagca caccagcgat 3659281 ggtatgccgg tcagagtcga ttcagcgcgc gtgccatgca cgagtcctcg aggaaccgat 3659341 agcgcctagg ctgggactgc cgcaaccaca gccgatccag cgccgaacgc acgatccggc 3659401 gaacgggtgt gcgggtaaca gccttgtcga tgtcgatggt ggaggcgctg tcgccgttca 3659461 tgacaggttc ccttcaagcg tcctgcaagc ggttgccaaa gccgtcgcct attttctgtc 3659521 atcggacggc gcgatccatc ggcacgggag cgtaaatctg ccccgccggg ggtcgtagct 3659581 tgccgggggc acgcccgggt ttatacgcgt attcgctgat gcggcccggt caacgagcgc 3659641 tatgcgccgc caccggcagc cgggggcggc ggcgcagcac cgggatcgtc aagcacggga 3659701 ccttcgagga tgggtccggg gtagtcgcgg ctgtggtcgg ggccgtcgct gtcgcggtgg 3659761 aagtcgtcat ggcaggtgta gggatcccag ttgggccccc atgcggggtc gaaaggctgc 3659821 cccgggcacc agtagtagtc gggcaccggc gcggtttggg ctgcggactg cgcgccgacc 3659881 ccgagacccg ccacacccgt ggccaggatg cacgccgcca gcatgagcgt gcggcacgcg 3659941 aaccggtaca tgcgatgacg gtacgaaagc gatctggcaa gcaactggac gctaggtgcg 3660001 atataccaga gaacttgctg attactcgct gtgacccatg agcgccgcga accgcggctt 3660061 gatcacttcg tcgattatcg ccagccgctg gtcgaacgga atgaacgcgg atttcatcgc 3660121 attgacggtg aagcgcgcca ggtcgctcca gccataaccg aaagcctcta ccaaacgatg 3660181 catttcgagg ctcatcgagg tgtcgctcat cagccggttg tcggtattga cggtcacccg 3660241 gaaccgggcc cgagccagta ggtcgaacgg atgctcggcg atgcttgcga ccgcgccggt 3660301 ctgcacgttg gagctggggc acagctccag cggaattcgc ttgtcccgca ggatagctgc 3660361 cagccgaccc aactggaaac cgccgtcggc atccacgtcg atgtcgtcga cgatccgcac 3660421 cccgtgaccc agccggtcgg caccgcagaa ggcgatcgcc tcgtggatgg acggcaaccc 3660481 gaacgcctca ccggcatgaa tcgtgaagcg cgcgttgtga tcacgcatgt actcgaatgc 3660541 atccaagtgc cgggttggcg ggtggccggc ctccgcgccg gcgatgtcga atccgacaac 3660601 tcccttgtcc cggaaccgga tcgccaactc tgcgatctcc cgggacattg cggcgtgccg 3660661 catcgcggtg accagacagc ggacggtgat gggttgacca tcggcggcac acgccttctc 3660721 gccggcggcg aagcccgtca gaacggtgtc gacgacgtcg tcgaacgaca gcccgcagct 3660781 gatgtgcagc tccggcgcga accgcacctc ggcatagacc accgaatcgg cggccaggtc 3660841 ttgcgcgcat tcgaaggcga cccgatacaa ggcctcggga gtctgcatca ccgccaccgt 3660901 gtgcgaaaac ggttccaggt agcgctccag cgagccgctg tgcgactggg tgcgaaacca 3660961 acttgccagc gcgtcgacgt cagttgccgg caggtcgtcg tatccgacct gcccggcaat 3661021 gtccagcacg gtggccggcc gcagcccgcc gtcgaggtga tcgtgcagca acgccttggg 3661081 ggctagcctg atcgtctgca gggtcggcgc agcggtcatc agacgatccg atcgacgatt 3661141 agcggccgca cctgcggcgg actgtcccgg atactccaac cgccggccag ctcggctcgc 3661201 gccgcaccaa agcgctcggg agcattcgtg tagagggtga acaacggctc accgaccaca 3661261 accggctccc ccgggcggcg atgaatccgc acccccgcac cgtgctgtac gcgtgcgccc 3661321 gggcgggacc tgcccgcacc gagtcgccat gccgctaacc ccactgccat cgcatcgatg 3661381 tcgcccattg tgccgctcgc gcccgccgtg acggtttccg aatgcgaacc gatcggcaac 3661441 ggtttcgaca agtcacctcc ctgcgcggca accaaccagc gaaaccggtc cattgcggtg 3661501 ccgtcccgca gcgtctgggc cgggtcccgg ccgtggatcc cggcaagctc gagcatctcg 3661561 ccggccagcc gcaacgtcag ctccaccacg tcgggcggtc cgccgccggc cagcacctcc 3661621 agcgcctcgg ccacctcgag cgcattgccg acggttcgac ccagcgggca gttcatctcc 3661681 gtcagcaggg cacgggtggg cacgccatgc gccgcgccca gttcgaccat ggtgtgcgca 3661741 agttcgcgcg cctgcactgg cgacctcatg aaggccccgg aaccaacctt gacgtcgagc 3661801 accagtgcac ccgcaccctc agccagcttc ttgctcataa tcgaactggc gatcaacggc 3661861 agcgattcga cggtgccggt aatgtcgcgc agcgcataca gcttggcatc ggctggcgcc 3661921 agctggccgg cggcgaagat cgcggcgccg acgtcgcaaa gctgctcgcg cacccgctgg 3661981 ttggacagat tcgcggtgaa cccggtgatg gattccagct tgtccagggt gccgccggtg 3662041 tggccgagtc cgcggcccga cgcctggggc actgcgccac cgcaggcggc gacgacgggc 3662101 accaatggca gcgtgatttt gtcacctacc ccgccggtgg aatgcttgtc cacggtcgct 3662161 agtggcagat cggtgaaatc cagccgggca cccgaggcca gcatggccgc cgtccatctg 3662221 gcgatctcgc cgcggtccat gccccgccaa acgatcgcca tcagcagcgc cgacatctgt 3662281 tcgtcggcga cccggccgtc ggtataggcc ttgacgaccc agtcgatggc ggcgtcggac 3662341 aaccggccgc cgtcacgttt ggtgcggatg acggtcgggg cgtcgaatgc gaagtcggtc 3662401 accggcgttc ccgggggagg tcgtcgaggc cgaaggcgtc gggcagcagg tcgccgagcc 3662461 ggcggggtcg caccggatgg tcgatcagta gctcggaacc cccgtgttcg agcagcacct 3662521 gacggcatcg cccgcacggc atcagcacgg atccatggcc gtcgacgcag gccagcgcga 3662581 gcagccggcc gccgccggtc gaatgcaggg cgcacaccac cgcacattcg gcgcacaaag 3662641 tcaagccata cgagacgttt tccacgttgc atccggtcac cacgcgacca tcgtcgacca 3662701 gtgcggccgc acccaccgca aaccgcgaat acggcacata ggctccggct gctgcctggg 3662761 ttgcattgcc ccgcagcata ttccaatcga catcaggcat tcggcaaccc cgctcgtcga 3662821 tgggccgact aagaaaagcc agcctaaccc cggatccaca cacgatcccg atcggactgt 3662881 tcgacaccgc gggcaacctg gccaagttaa gctcgattgc ccggctctag ctgttcgata 3662941 gtgcttttaa ggggtttgcc agcggtgaat acaacggcga caaccgtctc gcgcgggcgg 3663001 cggccacctc ggaccctgta tcggggagat cccggtatgt ggtcgtgggt atgccatcgc 3663061 atcagcggcg cgacgatttt cttcttcctg tttgtccatg tcctggacgc cgccatgctg 3663121 cgggtgagcc cgcagaccta caacgcggtg ctggcgacct acaagacccc gatcgtcggc 3663181 ctgatggagt acggcctagt cgccgcggtc ctttttcacg cactgaacgg gattcgggtc 3663241 atcttgatcg atttctggtc ggaaggcccg cgctatcagc ggctgatgtt gtggatcatc 3663301 ggcagcgtct tcctcttgct gatggttccg gcaggcgtgg tggtgggcat ccacatgtgg 3663361 gagcacttcc gatgagcgcc ccggtcagac agcgcagcca tgaccgtcca gccagcctgg 3663421 acaacccacg atcaccacgg cggcgtgccg gcatgcccaa cttcgagaaa ttcgcctggc 3663481 tgttcatgcg gttttccggt gttgtgttgg tgttcctggc gatcgggcac ctgttcatca 3663541 tgctgatgtg ggacaacggc gtgtatcgcc tggacttcaa cttcgttgcc caacgctggg 3663601 cgtcgccgtt ctggcagacc tgggatctgc tgttgttgtg gctggcgcag ctgcacggcg 3663661 gcaacggtct gcgcaccatc attgacgact acagccgcaa agacaccacc cgattctggc 3663721 tgaactcgtt gctggtgttg tccatgctgt tcaccctgat gctgggaacc tacgtgatag 3663781 tgacattcga cccgaacatc tcctgaaagg cccggaagga gcacatgatc acgccacctc 3663841 tcccccgcaa gcgggcggta cccccacctc atcgctgcgg ccccctcgtc gcttcgcggc 3663901 tgggggtgcc cccactgcat cgtcggcggc ggcgttgatc tgccaacacc gatacgacgt 3663961 ggtgatcgtc ggcgcgggcg gtgccgggat gcgcgccgcg gtcgaggcgg gtccgcgggt 3664021 gcgtaccgcg gtactgacca agctgtatcc cacccgcagc cacaccggcg cggcccaggg 3664081 cggcatgtgc gccgcgctgg ccaacgtcga ggacgacaac tgggagtggc acacgttcga 3664141 caccgtcaag ggcggcgact atctcgccga ccaggacgcc gtggagatca tgtgcaagga 3664201 agccatcgac gcggtgctcg acctggagaa gatggggatg ccgttcaacc gcacccccga 3664261 gggccgcatc gaccagcgcc gcttcggcgg gcacacccgc gaccacggca aggccccggt 3664321 gcgccgggcc tgctacgcgg ccgatcgcac cggccacatg attctgcaga cgctgtatca 3664381 gaactgcgtc aagcacgacg tcgagttctt caacgaattt tacgcgctgg atttggcttt 3664441 gactcaaacg ccgtcgggcc cggtggccac cggggtgatc gcctacgagc tagcgaccgg 3664501 tgacatccat gtctttcacg ccaaggccgt cgtgatcgcg accggcggct cgggccgcat 3664561 gtataagacc acgtccaacg cacacaccct gaccggcgac ggcatcggca tcgtgttccg 3664621 caagggactt cccttggagg acatggagtt tcaccagttt caccctaccg gcctggccgg 3664681 tctgggcatc ttaatctccg aagcggtgcg cggcgaaggc ggccggctgc tcaacgggga 3664741 aggtgagcgt ttcatggagc gctacgcccc gacgatcgtc gacctagcgc cccgcgacat 3664801 cgtcgcccgc tcgatggtgc tggaagtgct ggagggacgc ggcgccggac cgctcaagga 3664861 ctacgtctac atcgacgtcc gccacctggg cgaggaagtg ctcgaggcca agctgcccga 3664921 catcaccgag ttcgcccgca cctacctggg cgtggatccg gtcaccgagc tggtgccggt 3664981 ctacccgacg tgccactacc tgatgggcgg catcccgacc acagtcaccg ggcaggtgct 3665041 gcgggacaac accagcgttg tcccgggcct gtatgcggcc ggcgagtgcg cgtgcgtgtc 3665101 ggtgcatggc gccaaccggc tgggcaccaa ctcgctgttg gatatcaacg tcttcggtcg 3665161 tcgggccggc atcgccgccg ccagttatgc gcagggtcac gactttgtcg acatgccgcc 3665221 caacccggag gccatggtgg tgggctgggt cagcgacatc ctgtccgaac acggaaacga 3665281 gcgggtcgcc gacattcgcg gggcgctgca gcagtcgatg gacaacaacg ccgcggtgtt 3665341 ccgcaccgag gagaccctga agcaggcgct caccgacatc cacgcgctca aggagcgcta 3665401 ctcccgaatc acggtgcacg acaaggggaa acgcttcaac accgacctgc tggaagccat 3665461 cgagctggga tttttactgg agctggccga ggtcacggtg gtcggcgctt tgaatcgcaa 3665521 ggagtcccgc ggcggtcacg cccgcgagga ctatcccaac cgcgacgacg tcaactacat 3665581 gcgacacacc atggcctaca aggaaattgg ggccgataag gagggccccg agctgcgcag 3665641 cgatgtccgc cttgatttca aacccgtcgt gcagacccgt tacgaaccca aggaacggaa 3665701 gtactaatga gcgtcgagcc ggacgtcgaa actttggatc cgcccctacc gccggtaccg 3665761 gacggcgcgg tgatggtgac cgtcaagatc gcccggttca accccgacga gcccgacgcg 3665821 ttcgcggcca ccggcggctg gcagagcttc cgggtgccct gtttgcccag cgatcggctg 3665881 ctcaacctgc tcatctacat caagggctac ctcgacggca cgctcacctt ccggcgatcc 3665941 tgcgcccatg gggtgtgcgg ctctgatgcc atgcgcatca acggggtgaa ccggctggcc 3666001 tgcaaggtgc tgatgcgtga cctgctgccg aagaagaagg gcaaatcgtt gaccgtcacg 3666061 gtcgagccga tccgcgggct gccggtggaa aaggacctgg tggtcgacat ggagccgttc 3666121 ttcgacgcct accgggcgat caaaccgtac ctgatcacca gcggcaaccc gcccacccgc 3666181 gaacggatcc agagcccgac cgaccgcgcc cgctacgacg acaccaccaa gtgcatcctg 3666241 tgcgcgtgct gcaccaccag ctgcccggtg ttctggcacg agggcagcta cttcggcccg 3666301 gcggcgatcg tcaacgcgca ccgcttcatc ttcgacagcc gcgacgaggc cgccgccgag 3666361 cgcctcgaca tcctcaacga ggtcgacggg gtgtggcgct gccgcaccac gttcaactgc 3666421 accgaatcct gcccacgggg cattgaggtg accaaggcga tccaggaggt caagcgcgcg 3666481 ctgatgttca cccgctgagg gcttgcgcga gcagacgcaa aatcgcccga aaaccagtgg 3666541 ttttgggcga ttttgcgtct gctcgcgcag ccgggtctac agcgttgcca ggtgctgttt 3666601 ggttgcgcca ggaaccgcag tcaacgcaat cgactgatcg aaggtgacaa atcggccatc 3666661 atgagcgacc gcgagggcca gcaagtacgc gtcggtgacc tgtttggggc tgtgcaggcg 3666721 ggaacgatcg atgacctttg agtcgagaat gctgacggtg caggaccaga actcgtgata 3666781 gcgcgtgtgc gtcgcacgag ccaacaagtc gatggcatgg gctaccgaga ttgggctggg 3666841 atagcgcggt tggctgatga cgcggacgaa cccgttttgg gtgatcgcac aggaagccca 3666901 tccccgctcg atctgcccgg tgatccacgc tcgggcgcgc tcgtggtcga cgtgatcgcg 3666961 gtccaacagc gccagtagca cgttgacgtc caacagcgct cgcatcgatc acacggcctc 3667021 ctcgtcacga agccgatcga tcagcgcgtt cgataccgct ccaccgcgat gaggcagggg 3667081 ttcgaagcca tgaaaggcgt cctcctggct cgccgcaggc tggggattct ggttggttaa 3667141 cgcttgccgg gccagatccg acaggatttc acccgcggtg cgcttctccc tgcgtgcccg 3667201 ttccttcacg gccagcaata catcgtcgtc gatggacaac gtggtgcgca tgcatcagat 3667261 gctatcgcac caatctgggc gcaacgcgtc tacaggatgg ccagcgctcg cggcattgag 3667321 aatctccttc gtgggtgcac tcccacgcga ggtaggggcc gacgaccacc atctatgccc 3667381 ctggcaacgg tgagcgccgc gcgatcatga tccgcgacgg cgccgaatcg cagttaccct 3667441 gcccctcgtg tacaacggtg aagtcggcag gaagcagaca cgctggctct cccggcttga 3667501 cacgtcgctt cgcgctggct gtgcccgcct cggcgccact gagagccagc gactcccatg 3667561 ccaatacgcc gcctggcatc accgcctcac aggcgcggtg aaatatcgcc gcatcccaaa 3667621 agagcctgct gagcaccagc gcgaaacgcg tctcgccggg ttcccagcag cccaagtcgg 3667681 cctgcacgag gttgagccga tcggccacgc ctcgacgcac ggcctcgctg tccagctgca 3667741 gcagcgcgac atcggacaca tcgattgcgg tgacctggcg gccgtgggcg gccaacgcca 3667801 gtgcggtacc cgatcgaccg ctagctaact ccagaacggg accgtccgga acgcctgctc 3667861 tgaggacatc ggcgagccaa ggcaccgggg caaacggcgc gtgcgccgaa cccgcgcgtt 3667921 cgtatcgcgc gttccagtcg acgcggttgg ggtgctcccg cagcgccgga tccgtctgca 3667981 cgctcatggc cgattggcca cccactcaac accgtcgagt gcgaactcct tcttccatat 3668041 cggcacatcc tgtttgagcc gctcgatgca catgcgagcg gcgtcgaacg cggccgcgcg 3668101 gtgaggagcc gaagcaccga tgacaaccgc cgcatcaccg atgcgcaatt caccggtccg 3668161 gtgtgccacg gcaactcgca caccgtcggc ctgtcgttca cactcttcga tgatgtccat 3668221 cagcgtgcgg tgcaccatgg ccggataggc ctcgtagtac aacttggtca cttcgtggcc 3668281 gttgttgttg ttacgcacgg tacccacgaa gatgacggcg ccgccctggg aaggtccaga 3668341 tatcgcgttg agcacttcat cgacgctcag cggctcatcg gtgagccggc agtagacatc 3668401 ggagcccccg gcaacctgcg gtatgaacgc caccgtgtcg ccatcgtcga gaatcgttga 3668461 tgctggcgct atggattcgt taacggccat ccgcactcgc ttgcgaaaat cagcaagtgg 3668521 cggatagtcg atttgcaatt ggtcgactaa gccgtcgacg gtggtgccgc tttcgagtga 3668581 gatcttctcg tgagcgacct tgcacgcttc gcgaaccgcg ccaaagtaga gcacattgac 3668641 agtaatcatt caacatccat cctcggtgga gccaccatcg ctgggtttga cgtccgcgtc 3668701 gtgccgccgg taatgacccg atcggccacc gcttttttcg tccaatctga tatccgtgat 3668761 cgtcatggca cggtcgactg ctttgcacat gtcgtaaacc gtgagcgctg tcaccgtaac 3668821 ggcggtcaac gcctccatct ccacacccgt acgtgccacc gtggtcaccg tcgccgcaat 3668881 cgagagccgg tccgcgccct gcggctcgag cgtgacggtg accgcctcga tccccagcgg 3668941 gtgacacagc gggataagct caccggtccg tttggccgcc ataatgccgg ctatccgtgc 3669001 ggtcgctatg acatcgccct ttgccgcggt gccgtgacag atcatgtcca gggtcgacgg 3669061 tttcatcagg acggccccgg atgcccgcgc tcgccgcaag gtcaccgcct tcgccgacac 3669121 atcgaccatt cgggcggcgc cttgttcatc aaggtgggta agcaccccat cgtggtcgtt 3669181 caccgtgcca cctgctggct gcattgctca tcgtgcactg cgctgaaagc ctcggcgagg 3669241 tcgaagtcga cgcgagtcaa acagtgcatc tggcgcgtcc aacaagtcaa ccgcaccgac 3669301 cgcttgttat ggacacagat atcggggtga tgattgaatt tgtcggcgat ggcggcgatc 3669361 ttcgccacaa atttcatgga ttgatcgaag ctcccaaacc caaatgtgtg gcggagcttt 3669421 ccatcgacga gctcccaccc gggcagtgcc gtcaacctct cggacagctc cctgtcggtg 3669481 agagccgaca gcctgccgca gcacacgtcg tgaaggcttg actcctcaca tcgactgacg 3669541 ttgccgtgtg ccatggttcc ctcctggact tggttggatt gatggtggat gtcgcacctg 3669601 gtcgcccagc gagtgcgcgc atgcgggcat gggcaccaga ccggccggaa ttatccgcct 3669661 ctggtgtgca tctccagatg tgggtcgcgt ttcaagcggg acaccggaag aaagacctgt 3669721 cgcgttcgtt gggccaagcg ttgttctgcc ccacggttcg cgcggtcccg ccagccacgc 3669781 tgcagaatct gcacaacgtc attggcggac gctcccgcac gcacggattc ccgcaaattg 3669841 atgccggaca gggcgtaaag gcaatgtagc cagatgccat cggcggtcag tcgcgacctg 3669901 tcacaggttg cacagaatgg ctcggtggtc gaagcaatga tgccgaatgt tgttccgtcc 3669961 ggcaatctgt agcggttcgc gggcgcggaa tcgtacttcg gcaaggccgc gatgggaccg 3670021 tatttcttcc cgagagttga cagcatctgc gctttcgtga agaccttgtc catcgaccat 3670081 tgcgtcgcac cgccgacgtc catgtattca atgaatctca cctcggcatt gacgttgcga 3670141 gcgaattcga tcagatcgga caactcatca tcgttgaagc cacgtatcac cacggaatca 3670201 agcttcgtat cggtgaagcc cgccgccgca accgcctcaa tgccttcgat aactttatag 3670261 tgggtcccgc gctggctgat tgctttgaac cggtccggcc gcaacgtatc aagactaatg 3670321 gtgatgcgcc gcatgcccgc cgattttaac tttctggcct ggtccgcgag cagcacgcca 3670381 ttagttgtga tcgccaaatc ctgtaaaccg ctgccatcgc caactttcgc gcttataacc 3670441 tcaatgattg ccgccagatc cgagcggatc agcggttctc caccggtgag ccgaatttta 3670501 tcgacaccga cagcgataaa tgcatcgacg atcaggctga tttcgtccac agacagcaga 3670561 tccgcccgcg gcaaccaggc gtactcggcc tcgggcatgc agtagcggca gcggaggttg 3670621 cactgatcga tcacagaaag ccgcagatca cccatggtgc gaccacagcg gtccctgatg 3670681 ggggattcat tgatgcacaa gccggaccgg ctcaccgagg cgacatccgt gatctgatag 3670741 cgcgactgtt ccggtccgga caatgctgga tgaaaaaccc gagtcattgg atagctccgc 3670801 cgacaaggcc accgacggcg atgggcgacg ccaggcatcc ccgcagggat ctaggacatc 3670861 gggcgttatc gggggctggc cgcggcgatg gcgcccgcga agaccatcgc aacacgtttc 3670921 gcgtgcacac tggcaccaca tgcctccttc tcattcgcgc gctttccggg gttcacgttc 3670981 attactgttc aacaccggcc tcaatggaat ctcaatgcgc acggcgcacg accggattcg 3671041 gcggcgcatg tgggtccggc aaggcaacgg ccacccgttg gccgttcatc cacgacggtg 3671101 acccgaattc ccgacagtga tgtcgagaag ccaggtagct ccggttcggt gtcgagcacc 3671161 gggccggagg tcgccgcacg gcgtcatggt aggcgttgcg ggtgcgaccg caatcggtca 3671221 ccgccacgtt ccaaatcggt cacacttgac acccgtcatg tcggcggagc cggtattctt 3671281 cgaattcgag ctttccccat caggccctgg cgcccggtga atcgctcatc aaacggaccc 3671341 gcgaccgacg aggattgtga tggcccgaaa cgaactccgg ttcggtattc tcggaccgct 3671401 agagataagc gcaggtttcc gcagtctacc gttgggcaca ccgaagcaac gtgcggtctt 3671461 ggcgacgctg atcattcatc gtaatcgccc ggttgggatc gactcgttga tcgacgcggc 3671521 ttgggagcag gaccggccgg agggatcacg agcgaccgtg tatacgtatg tctcgaattt 3671581 gcgtcggttg gtaagcacca cgggggcgga ttcgcacagc atcttggcta gcgcaccgcc 3671641 agggtatcga ctcgccgttg ccgataacca atatgatgtg gcacgtttta tcagccaaag 3671701 gtcggccggg ctgcgcgccg ccgctgccgg ttctttcgaa caggccagtg accatttgtc 3671761 ggccgcgctg gccgagtggc gcggcccggt cctggacgat ctgcgtgaat tcagctttgt 3671821 cacccgcttg gccaactcat tggttgaaga caaaatcatc gcccacacag ctctcgcgga 3671881 ggctgaaatc gcctgcgggc gtgccgattc agtgatcagc gagctcgagg agctgatcct 3671941 ggagcatcct tatcacgagg ccctgtggcg gcaactcatc gccgcatact atgtctcgga 3672001 acgtcaatcc gatgccctgg acgcctaccg gcgattaaag accagcctgg ccgaagacct 3672061 cggcgtcgac ccgggaccca aggtacgcac gctatacgag caagtgctgc gccaacaagc 3672121 actggacacg cgagtcgtcg tccaggctgc cgcaggagat atcatcaggg ccctcgaaca 3672181 ctctcccggc atgaccgacc gttcgccgcg cgccgcaata cgcgacgccg cggggcaccg 3672241 gtctccactt ggccggttgc cccttcgtat cgggcgtagc aaaagtaacg acatggtgct 3672301 gccggacggc aaagtcagtc cctaccatgc cgttatcgtc aacaccggtg aaagcttcat 3672361 gatcaccgac ctgcgatcgg tcaacggcgt ctacgtgcgt gggcggcgca tcgcgaccac 3672421 agccaccctc aacgacggcg accacattcg catcggcgac catgaactca cgttcgaggt 3672481 cataccgcac gaatcgggcc gttagccggc gggtttgccc atccccggtt ctgcgcccac 3672541 ccgcgcctac ccttcggtag gcaggccggc tcgccgaccc gccgcggaca ccaggatgcc 3672601 taccgtgctg gcaccagcac actctcgtcg tattgcctca ccaggtcttc gaccacggtc 3672661 tgccacgagg catcgtccac gcagaggcgg ctggcggtaa gtactagccg gtcgggactt 3672721 ccggggagca gagaaaccgc gaacaacctg ccggtgtaca tatcgaagct cgcgctgtgc 3672781 cgtgcgatca cctccgcgac ggcggctccc gggggttcca ctccccagcc ccacccgccg 3672841 ccgcccgggc gagaagtcaa ggtgtcaacg cgcggctcga atacggtgcc aagggccgga 3672901 tgggcgtcga agaccgcctc caccgccgcg cggatacgcc cctggtccac ccgtgagttc 3672961 gcaatcagaa cgtcggtgtg cgcgccaggc tcgatgtcct cgagcctggg aagctcaacg 3673021 aaatacgaat acgacatgac cacagactcc ggcacacccc tcagcgaatt cttggcaact 3673081 tcttggacct gggccgagca gacgcaaaag caccccattt cggcacgaaa tgggggccct 3673141 ttgcgtctca gtgtcggtgt tgtgtgtgcc gcgaggtggg tgtgtcggtg tgacagacgc 3673201 cgtgtcgcgg tggtttgttc cggatcacct ggtgtctggc tcactttgcg tctgccgtcc 3673261 tcttggggtt ggcgttgagc agtattgccg gcactaggtg agaaggaccg gccggcgtga 3673321 cttgatagga gcgtggcttt cgccccgact gagatgtgtc cgccgaccgg cccaacctca 3673381 acaccccctc aagtgaagga ggcaaccacc atggttgttg ttggaaccga tgcgcacaag 3673441 tacagccaca cctttgtggc caccgacgaa gtgggtcgcc aactcggtga gaagaccgtc 3673501 aaggccacca cggccgggca cgccacagcc atcatgtggg cccgtgaaca gttcggcctc 3673561 gagctgatct ggggcatcga ggactgccgc aacatgtcgg cgcgtctgga gcgtgaccta 3673621 ctggcggccg gccagcaggt ggtgcgggta cccaccaagc tgatggccca gacccgcaag 3673681 tcggcgcgca gtcggggcaa gtcggatccg atcgatgcgc tggcggtggc gcgggcggtg 3673741 ctgcgtgaaa ccgacctacc cctggccacc cacgacgaga cgtcgcggga gttgaagttg 3673801 ttgactgacc gtcgagatgt ccttgtggcc caacgcacgt cggcgatcaa ccggttgcgc 3673861 tggctcgtcc atgaactcga tcccgagcgg gcaccggcag cacgctcgct cgatgccgcc 3673921 aagcaccagc aggccctgcg gacctggctg gacacccagc caggattggt cgccgaactc 3673981 gcgcgcgccg agctgaccga catcatccgg ctcaccggcg agatcaacac cctggcccag 3674041 cgcatcagcg cccgagtcca ccaggtcgcc cccgcactgc tggaaatccc tggctgcgcg 3674101 gagctgactg cagccaaaat cgtcggcgaa gccgccggag tgacccggtt caaaagcgaa 3674161 gccgccttcg cctgccatgc cgcagtggct cccatcccgg tgtggtcggg caacaccgcc 3674221 ggccagatgc ggctcagccg ctcgggcaac cgccagctca acgccgccct acaccgcatc 3674281 gcactgaccc aaatccggat gaccgacagc cggggccagg cctactacca aaggctgcaa 3674341 gacgccggga aaaccaaacg cgcagcacta cgctgcctca aacgccgcct agcccgcacc 3674401 gtcttccagg ccctgcgcac cgtccatcag cccagctccg aacacaccca acccgcggcc 3674461 gcttgccata ggagctattg ctcgtcacac ctcggcgagc cacctcgtct aacggatatg 3674521 acacagaaaa cccgcatcca gcccctacct cccaagcgag ccggcctgtt gatccgcgca 3674581 ctgtatcgga tcgccaagcg gcgcttcggc gaagttcccg agccgttcac ggtcaccgca 3674641 catcatcggc ggctgctgat cgccaatgtg gtgcacgaag ccctgctgca gcgagcgtcg 3674701 cggaagctac cgcccagcgt ccgtgagctg gcggtgtttt ggaccgcccg cagcatcggc 3674761 tgctcgtggt gcgtggactt cggagccatg ctgcagcgcc tggacgggct ggacgtggac 3674821 aggctcacgg acatcgacaa ttacgccacc tcatcgaaat tcagcgacga cgaacgcgcc 3674881 gccatcgcct acgccgaggc gatgaccgca gacccgcatt cggtgaccga cgagcaggtg 3674941 gccgacctgc gggcccgctt cggcgaggcc ggcgtgatcg agctgactta ccagatcggc 3675001 gtggagaaca tgcgagcccg gatgaattcg gcgctgggca tcaccgagca aggcttcaat 3675061 tccggtgatg cctgccgcgt cccgtgggct gcgcccgacg ttccttcagc ggagagccgg 3675121 tgaacttgtc gggattggcg atatcccaca gcgcgcacac ctttccgtcg cgcacggtta 3675181 tcgcggtgat ccgcggcgcc atcgcccgat acccgtcgac cccgggtaag cccgccgtgt 3675241 aggcgccgag ctctccgttg accagcgcca gctgattcgc gccgaagagc cccgggccgt 3675301 aacgctggac cagcccgagt atgaaccgga ccaccttgtc ggatccgcgg acggcccgta 3675361 ccgctgtggg cgccttgcca ttcgaatcgc cggtaaacgt cacgtcggga tgcagcagcg 3675421 acaccaccgt gtccaggtca ccagcggcca tggcggccat cagccggccg accacctcgt 3675481 tgtgggccgg atccggatcc cccgatatca gggcgggctg cgccgtgacg gccttgcggg 3675541 cccgcgacgc cagctggcgc gcggcggcct cgctggttcc cagcacctcg gccacttcgg 3675601 caaacggcac ggcgaacccg tcgtgcagca cgaacgcgac ccgctgatcg gggcgcagcc 3675661 gctccagcac caccatggcc gcgaacctgg cgtcctcggc ggccaccacg gcggccaacg 3675721 gatcggtcgc gtccaagccg gtgaccaccg gttcgggcag ccaggtgccg gtgtaggtct 3675781 cccgccggtg cgccgccgac ctcaacttgt ccagacccag ccggctcacc acggtggtca 3675841 gccaggcccg cgggtcggcg atcacggtgt cctgtgagtc ccagcgcagc caggcctcct 3675901 gcacgatgtc ctcagcatcg gcgaccgtgc cggtcagcct gtaggcgacc gacatgagat 3675961 gctgtcgcag tgcctcgaat tcggaaacct ccatcgaggt cattgcccga gcctagcgct 3676021 gcgctcgcca acacgacgac acgaaacctt tggttgcact tcgcccggca cggtgccggc 3676081 atccaacacc cggtcatcgt ccgcggcgac ggcgtcacca tcttcgacga ccgcggcaag 3676141 agctatctgg acgccttgtc cgggctgttc gtggtgcagg tcggttacgg ccgggccgaa 3676201 ctcgccgagg cggccgcgcg gcaagccggc acgttggggt atttcccgct ctgggggtat 3676261 gccaccccgc cggcgatcga gctcgccgag cgcctggccc gctacgcgcc cggggaccta 3676321 aaccgggtgt ttttcaccag cggcggcacc gaggccgtcg aaaccgcctg gaaggtggcc 3676381 aagcagtact tcaagctcac cggcaaaccg ggcaaacaca aggtcatttc acgctcgatc 3676441 gcctaccacg gcaccaccca gggcgcgctg gcgatcaccg gcctgccatt gttcaaggcg 3676501 ccattcgaac cgctgacgcc gggcggcttc cgggtgccca acaccaattt ctaccgagca 3676561 ccgttgcaca ccgacctcaa agagttcggg cgatgggctg ctgaccggat cgccgaggcc 3676621 atcgagttcg aaggccccga caccgtggcc gcggtgtttt tggagccggt gcagaacgcg 3676681 ggcggctgca tcccggcgcc gccgggttat ttcgaacggg tccgcgagat ctgtgaccgc 3676741 tacgacgtgc tgctggtctc cgacgaggtg atctgtgcgt tcggccggat cgggtcgatg 3676801 ttcgcctgtg aagacctcgg ctacgtgccc gacatgatca cctgcgccaa gggcctgacg 3676861 tcgggctact cgccgctggg cgcgatgatc gccagcgacc ggttgttcga accgttcaac 3676921 gacggcgaga cgatgttcgc acacggctac acgtttggcg gtcatccggt gtcggcggcc 3676981 gtcggcctgg ccaacctcga catcttcgag cgcgagggtc tcagcgatca cgtcaagcgg 3677041 aattcccccg cgctgcgggc caccctggag aaactgtacg acctgcccat cgtcggcgac 3677101 atccgcggcg aggggtattt cttcggcatc gaactggtca aagaccaggc gaccaagcaa 3677161 accttcaccg atgacgaacg cgcacgactg ctaggccagg tatccgcggc gctctttgag 3677221 gccgggctgt actgccgcac cgacgaccgc ggggaccccg tcgtccaggt ggctcccccg 3677281 ctgattagcg gacagcccga gttcgacacc atcgaaacca tcctgcgcag cgtgctcacc 3677341 gataccggac gcaaatatct tcatctgtaa ctttcgtccc gccagtcaca gcgcggctcc 3677401 tcgcggtcgg gccgccgatc acctactctg cacagacgat ggccttctta cgttcggtat 3677461 cgtgcctggc agcagccgtg tttgcggtag gcaccggaat tggtctacct accgcggccg 3677521 gcgaacccaa tgccgcaccg gcggcgtgcc cgtacaaggt atccacccca cccgccgtgg 3677581 actcgtcgga ggttcccgcg gccggtgaac ccccactgcc gctggtggta ccccccaccc 3677641 cggtcggcgg caacgcgctg ggcggctgcg gcatcatcac cgcccctggc agcgcgccag 3677701 cgcccggcga cgtctcagcc gaggcctggc tggtggcgga cctggacagc ggcgcggtga 3677761 tcgccgcccg ggatccgcac ggccggcacc gcccggccag cgtcatcaag gtgctggtgg 3677821 cgatggcgtc catcaacacg ctcaccctca acaagtcggt cgccggaacc gccgacgacg 3677881 cggcggtcga gggcaccaaa gtcggggtga acaccggtgg cacctacacc gtcaaccagc 3677941 tgctgcacgg gctgctgatg cactccggca acgacgctgc gtacgcgctg gccaggcagc 3678001 tcggcggcat gccggccgcg ctggagaaaa tcaatctgct ggccgccaag ctgggcggcc 3678061 gggacacccg agtggccacg ccgtccggac tggacgggcc cggcatgagc acgtcggcct 3678121 atgacatcgg cctgttctac cggtacgcgt ggcagaaccc ggtcttcgcc gacatcgtcg 3678181 cgacccgcac cttcgacttc ccggggcacg gcgaccatcc aggctacgag ttggagaacg 3678241 acaaccagct gctctacaac tatccgggcg cgctcggcgg caagaccggc tataccgacg 3678301 acgcggggca gaccttcgtg ggcgcggcca accgcgacgg ccggcggctg atgacggtgc 3678361 tgctgcacgg gacccggcag ccgatcccgc cgtgggagca ggcggcgcac ctgctcgact 3678421 acgggttcaa caccccggca ggcacccaga tcgggacact gatcgaaccc gacccgtcgc 3678481 tgatgtccac cgaccgcaat cccgccgacc ggcaacgagt cgacccccag gccgcggcgc 3678541 ggatatcggc cgccgacgcc cttccggtgc gggttggcgt ggccgtcatc ggcgccctga 3678601 tcgtgttcgg gttgatcatg gtcgcgcggg cgatgaaccg ccggccgcag cactagctgc 3678661 ttaccccgat accttcggcg tcgtttgcgg gcgggcatcc tagccggcct tggtcggcac 3678721 cgaaatcggg gcttgaccag cggttgaccg cgtgacgacg ctgtggcagc ctcatcgaaa 3678781 tgactacagc cctataccag gacgcggggt tcacgcccgc cggggcgccc gacgaccccg 3678841 accgcgtggt ggacgtgctg agcgccccgg taccggtcaa ctgaccagat cggggcgccg 3678901 ggcgctcctc gtcgggctca ccgccgccag cgtcggcgtc ctctacgggt acgacctttc 3678961 cgccatcgcg ggtgcgttgc tgtctctcag cgaggaattc gaactcacca ctcgagaaca 3679021 ggagttgctg accaccacgg cggtgctcgg ccagatcgcc ggggcgcttg gcggcggcat 3679081 cctcgccaac gcgatcggac gcaagaaatc ggtggtgctc atcgtcgccg gctacgcagt 3679141 gttcgccctg ctcggcgcga cctcggtgtc cgtaccgatg ctggtggtgg cgcgtctgct 3679201 gctgggtgtg acaatcggcc tgtcggtggt ggtggtgccg gtgtatgtgg ccgagtcggc 3679261 gccggcggcg gtgcgtgggt cgttggtgac cgcgtatcag ctggcgacgc ttagcggcat 3679321 cgtcgtcggt tacctggtcg gctacctgtt ggccggatcg cacggctggc gcgcgatgtt 3679381 cgggctggcc gccgcgccgg ccacgctgct gttgccgttg ttgtggcgca tgcccgatac 3679441 cgcccgctgg tatctgctca agggccggat cgccgacgcg cgtagcgcgc tgcggcggat 3679501 ccagccggag gccgacatcg atgccgagct ggccgatatg gcggccgcgg tcgacgaacg 3679561 cggcggcggt atcggcgaaa tggtgcggcg gccgtatctg cgggccacgc tgttcgtcat 3679621 cgcgctcggc ttcctcgtcc agatcaccgg gatcaacgcg atcatctact acagtccgcg 3679681 acttttcgcc gccatgggct tcgcgggcta tttcgcgatg cttgccctgc ccgcgatggt 3679741 gcaagtcgcc ggcttggcgg cggtgtgtgc ctcgctgttt ctggtcgatc ggctgggccg 3679801 tcgcccgatc ctgttgtccg gcatcgcgac gatgatcacc gcagatgccg tgctgatcac 3679861 cgtattcgcc aacgactccg atggtggcac ggggctggtg ttggggttcg ccggcgtgct 3679921 gctgttcatc atcgggttca acttcggatt cggctcgctg gtctgggtgt acgccgcgga 3679981 gagcttcccg tcccggctgc ggtcgatggg atcgagcctg atgctcacct cgacactgac 3680041 ggccaacgcg atcgttgccg ccttctcgct caccatgctg cgtgtgctcg gcggcgcagg 3680101 cgttttcgcg gtcttcggca cgttcgccgt cgtcgcgttc gtggtcgtgt accgctttgc 3680161 gccggagacc aagggccgca aactcgagga gatccggcac ttctgggaga acggcggccg 3680221 ctggcccgcc gagcggtcac cggcggcgga cgaaccgtga ccgtgctcgg cgccgacgcc 3680281 gtcgtcatcg acggccggat atgccggcca gggtgggtgc acaccgccga tggtcggatt 3680341 ctctccggtg gcgctggggc accgcccatg ccggccgacg cggaattccc cgatgcgatc 3680401 gtggtgcccg gctttgtcga tatgcatgtg cacggcgggg gcggcgcgtc gttcgccgac 3680461 ggcaacgccg cagacatcgc ccgtgcggcc gagtttcacc tgcggcacgg caccactacc 3680521 acgctggcca gtctggtcac cgcgggcccc gccgagttgc tctccgccgt gggcgctttg 3680581 gccgaggcaa ctcgggacgg cgtcgtcgcg ggcatccatc tggaggggcc gtggctgagc 3680641 ccagcgcggt gcggagcgca cgaccacacc cggatgcgtg ccccggatcc cgccgagatc 3680701 gagtcggtgc tcgccgccgc cgacggcgcc gtccggatgg tcacgttggc acccgagttg 3680761 cccggaagcg atgcggcgat ccggcgcttc cgtgacgccg aagtggttgt cgccgtgggg 3680821 catacggatg cgacctacac acagacccga cacgccatcg acctgggcgc gacagtcggc 3680881 acccacctgt tcaacgcgat gccgccgctg gaccatcggg cgcccggacc cgtgctggcg 3680941 ttgctgtgcg acccgcgggt gaccgtcgaa atcatcgccg acggcgtgca cgtgcacccc 3681001 gcggtggtgc acgcggtgat cgaagccgtc ggtcccgatc gggtcgccgt ggtcaccgac 3681061 gcgatcgccg cggccggatg cggcgatggc gcgttccggc tcggcacaat gccgatcgag 3681121 gtcgagtcga gcgtggcacg ggtggctggt gcgtcgacgc tggcgggcag caccaccacc 3681181 atggatcagc tcttccggac ggtggctggg ctcggctcga agtcggactc agccggcgat 3681241 gtggcgctgg ccgccgcggt gcaggtgacc tcggcgacgc cggcccgcgc tctcgggctc 3681301 accggggtgg gccggctggc ggcgggctat gccgccaatc ttgttgtgct ggaccgtgat 3681361 ctgcgggtga cggccgtcat ggtcaacgat gactggcggg tgggctgagc gtccgtggag 3681421 gcccgtcaca atgcccaggc tcgcaccgtg agtactcggt caacgttgac ggttgccccg 3681481 gcgacccggt cactctggcg agggctaccg gcgccgcgcg gcttgtaccg caatcatccg 3681541 atcgccgcga agcgctcggc agccggcttg ggcggtagcc gacgacacgg gtacggtctc 3681601 acggcgcgag cctgataaag cccggcggca tgggtcgtgc aggcgacggc tctaccggtc 3681661 cgtcaccacc gccgccacca ccgctgccgg cgccgccact gccggcagcg cccccggact 3681721 gcggaacacc agcaggcggc tcaacctctg gcggcggggg cggcggctgt tgcggcggcg 3681781 ctggtcgcgg tggcggcggt gccacgatcg gcgggggtgg aatcagggtc tgcgccgccg 3681841 gcggcggtac cggaatcggc ggcggattcg gtatcagggg atcccccgcg cgaaccgctc 3681901 cgagcaccga ggcaagcatc gcacccgtcg gttcccgcca tcccggcgac atgatggtca 3681961 tgtccgacac cgacgcccgc aggtcgcttc ccgagttgac cgcgctgcgc gtggacgccg 3682021 caacgcgatg cgtcggttca ttcgatcccg gctcgaaatt ggccatggcg aacgccatct 3682081 tgctgtgatg gttcgggcag tagatctcca ctgccgcact gataaatcgg gtcatggtcg 3682141 tcgtgaggcg gacagggtag aggcgcatga ccgggtctat gttgtaggca tcgttgcgta 3682201 acccgtccac aatgtcgttc accggcatgc cgccatcgag tttgcgacac actttgtggg 3682261 ccgcgtcgat gacgcgaggc acattcgcga cggcggggat ttcctttttc tcgagcagcg 3682321 ccagaaaccg atcgtcttgg tttgggtcgg ccgctgctgg gccgtcgtgc agaattgcgg 3682381 cgccgatcag caccactaag gcggcaccca gggcgccggc atggctagcg atgccggtga 3682441 acatgatggg gtttccgttc tgctaaaagc cgttacctgg cgggctttgg atcgcgatcc 3682501 acgccatagg tgtggctgtc tggtcaggtt tgaccggcgc catgatgtcg tttcacagcg 3682561 ccgatgcagt ctgggagggg accagggcat gggtgcattg aggagccaga tccagagaac 3682621 cacaccggag ccgctggccg aggctcatcc acaagccttc gatcccgctc ccgttgtcgg 3682681 catgggcgcc tgccgacgga atcagcggat ggtcatagtg gcgtcgggcg ccaggcctgc 3682741 gcgggcacac gcggtgcggt gtcgatggtt gttctcatct ggtaactcct ttccgcaggc 3682801 cgcaattcag cggtatgggc tcaccgagat caggctcgtc acgatcgccc gcactgctgg 3682861 cggctcacat gtacccagtg ttaaccttct agtgcactag aaggtcaagg ggagtcgcat 3682921 gaagatcagc gaggtagccg cgctcaccaa caccagcacc aagaccctcc gcttctacga 3682981 gaactcgggg ctgctgccgc cgcctgcacg cacagcatcg gggtatcgca actatggacc 3683041 cgagatcgtg gatcggctgc ggtttatcca tcggggccaa gcggccgggc tggcattaca 3683101 ggaagtacgc caaatcctgg ccatccacga ccgcggcgag gcgccgtgcg cacacgtccg 3683161 ccaactactg agcacccgca tcgacgaagt ccgcgcgcag atcgccgaac tgattgccct 3683221 cgaaggccac ttgcagaccc tgcttgacca cgcttcatat ggcccgccca ccgaacacga 3683281 ccactccacg gtgtgttgga tcctggaaag cgacctcgat gagcccaccg ccatcgaggt 3683341 cagcgacatt cacgcctaga ggtcgctggg tacgcgggct ggcccacggg ttttacgccg 3683401 aagccgtcgc cgcccacgcg gtggcgaaca ggatcagcca cgcggtgacg aacgcgaaca 3683461 ccatcaaccc cagcaccggc ccgaacaccg cgcccgccgg gctgcgcaac actatctgca 3683521 ggtagatcgc ccccacctgc ttgaacagct cgaagccgac cgccgccatc aacccggccc 3683581 gcgccgcggt gaccaaaccg accggctccc gcggcagccg gccaatcatc caggtgaaca 3683641 gcacccacga caccagcacc gataccagca ccgagatgcc ccgaaagatc tcgtcgaaca 3683701 ctgaaaactg gggtatttca agccatctca gtaccgcagc catcggcctg gcatggccga 3683761 gcacggtgag cgcgatggtg gccacgatca ccacgaacgt ccccaccatg gccgctagat 3683821 ccgacagttt ggtgcgcaag tagcccgccg gagcgactgg atgtgcccac atctggctca 3683881 acgcttcccg caggtgccac atccagccca ggcccaccca ggccgcggtc gccagaccga 3683941 tcaccccgac cgacgcgcgt gcatcgatcg ccgaattcat caggtcgacc agctgctgtc 3684001 ccaccgcacc ggagaccgag gtgcggatgc gctcctcgag cgtggtcagc agctccggac 3684061 gacgcgacaa cgcgaatcca cccaccccga aaccgaccat cagcaaagga aatatcgcaa 3684121 agatcgtgta gtaggtgagt ccggccgcaa aaagactgcc gttgcgatcg ttaaagcgcg 3684181 tgaacgcacg cacgacatgg tccaaccacc cgaaccgggc ccgcagccgg tcaagcaccc 3684241 ctggctcggc gagctcgccc atgatcgact gccctacccc cgttatagaa ggaacccgag 3684301 ccgatcgtag actcgctgaa ccgttttgct ggccacatcg tgggcgcgct gcgccccggc 3684361 ggcgagcacg gcctccagct ccgcgggatc tgcggtcaat tcgtcaactc tggcttggat 3684421 cgggttgacg aattcgacga cggcctcggc ggtgtctttc ttcaaatcgc cgtagccgtg 3684481 tccggcatag ccgtcgacga gaacgtcgat gtcggtcccg gtgaccgccg actggatgtt 3684541 caacaggtta gacacccctg gcttgacgtc cgggtcatag cggatgtcac gttcgctgtc 3684601 ggtcacggcg gagcgaatct tcttggcgga caatgccgga tcgtcgagca ggttgatcaa 3684661 accggcatcg gtgcccgccg atttgctcat ctttgacgtc gggtcttgta gatcgtagat 3684721 tttggcggtc atcttgggga tgagcacgtc gggaaccacc agggtgccgg ggaatcggct 3684781 gttgaaccgt tgcgcgacgt cgcgggccag ctcgaggtgc tgccgctgat cctccccgac 3684841 gggcaccagc tcggtgtcgt aggccaacac gtccgcggcc tgcagtaccg ggtaggtgaa 3684901 caggccgacg gtggtggcct cgctgccctg acgcgccgac ttgtctttga actgggtcat 3684961 ccgcgacgcc tggccaaagc cggtgaaaca acccagcacc cacgccagct gggtgtgagc 3685021 cggcacctga ctttgcacga agatggtggc gcggccggga tcgattccca acgccaggta 3685081 ttgcgcggcg gtaatcaggg tccggcgccg cagtgcctcg ggatcctgag ggatggtgat 3685141 cgcatgcagg tcgaccacgc agaagaacgc atcgtggtca tcctgcaagc caacccattg 3685201 ggcgacggcg cccaaggcat taccgaggtg aagcgagtca gacgtgggct gcacgccgga 3685261 gaagatccgg cgggacccgg taggggtgct catgatgccc cgatcctttc acgcggggtg 3685321 ccctccccgt cgaccaccgg tcaccacgct gcttgcggta ccggcggtac cggctttagt 3685381 gtcggctcta tgcgcagtcc gatacgcgtg ggttcgggag agccggtcct actgctacac 3685441 ccgttcttga tgtcccaaac ggtgtgggag aaggtcgccc agcagctggc cgacaccggc 3685501 cgcttcgagg tatttgcccc cacgatggcc ggccacaacg gcggaccggc ctcgggcacc 3685561 cggtttttgt cctcggcggt gctggccgac cacgtcgaac gccagctcga cgaactgggc 3685621 tgggaaacca gccatatcgt cggcaactcg ttgggcggct gggtcgcgtt cgaactcgaa 3685681 cgacgtggcc gggcacgcag cgtgaccggt atcgccccgg cgggcggttg gacccgctgg 3685741 agtccggtca agttcgaagt gatcgctaag ttcatcgcag gggcgccgat cttggccgtc 3685801 gcccacattc ttggccaacg ggcgcttcgg ctgccgttca gccgcctgct ggccaccctg 3685861 ccgatcagcg ccacaccgga cggcgtgagc gagcgcgagc tgtccggcat catcgacgac 3685921 gccgcgcact gcccggccta ttttcagctg ctggtcaagg cgctggtgct gcccgggctg 3685981 caggagttgg aacacaccgc cgtgccctcg cacgtggtgc tgtgcgagca ggaccgggtg 3686041 gtccctccca gcaggttcag ccgtcatttc accgactcac tgccggcggg ccaccggctc 3686101 accgtgctcg acggcgtcgg tcacgttccg atgttcgagg ctccggggcg catcactgag 3686161 ctgatcacca gcttcatcga agagtgctgc ccgcatgtcc gggccagtta gcgggcgcga 3686221 gcagacgcaa aatcgcccat ttcggcacga aattgggcga ttttgcgtct gctcgcccta 3686281 attggccagc tccttttcca ggttgtcggc gatcgcatcg aggaattcct cgctattcag 3686341 ccagtcctgc tccggaccga tgaggatcgc gaggtccttg gtcatcttcc cgctctccac 3686401 cgtggcgatg acgacggact ccagcttgtg ggcgaagtcg atgacttcgg gagtgccatc 3686461 cagcttgccg cgatgctgta atccgcgggt ccaggcaaag atcgacgcga tcgggtttgt 3686521 tgaggtcggt ttaccggcct gatactgccg gtaatgccgg gtgacggtgc cgtgggcggc 3686581 ttcggcctcg actgtcttgc cgtcggccgt catcagcacc gacgtcatca ggcccagcga 3686641 gccgtagccc tgtgcgacgg tgtccgactg cacgtcgccg tcgtagttct tgcacgccca 3686701 gacgtaaccg ccttcccatt tcaggcaggc ggcgaccatg tcgtcgatca accgatgctc 3686761 gtaggtcagc cccgccgctt cgaactgcgc cttgaattcc tcttcgtaga cgcgctcgaa 3686821 ctcgtctttg aacatcccgt cgtaggcctt gaggatggtg ttcttggtgg acagatatac 3686881 cggccatttc gcgttgaggc cgtaggagaa cgacgcgcgc gcgaaatccc ggatggattc 3686941 cttgaagttg tacatcccca gcacgacgcc gccgtcctcg gggatggaca ccatttcgtg 3687001 cacgatcggc gcgctgccgt cggcgggcgt gaaagtcagt gtgacggtgc ccggttggtc 3687061 gaccttgaag ttcgtcgccc gatattggtc accaaaagcg tgccggccga tgacgatcgg 3687121 cttggtccac cccggaacca gtcgcggcac attagaaatc acgataggtt cgcgaaagat 3687181 tgtgccgccc aagatgttcc ggattgtccc attgggcgac agccacatct tcttcaggtt 3687241 gaattcctcg acacgggcct cgtcgggggt gatcgtcgcg cactttacgc ccacaccgtg 3687301 tttcttgatc gcatacgccg cgtcgatcgt cacctggtcg tcggtggcgt cgcggtgctc 3687361 gatgcccaag tcgtaatagt ccaagcggat gtcgagatag ggaaggataa gcatgtcctt 3687421 gatgagcttc cagatgacac gggtcatctc gtcaccgtcg agctctacga ccggaccgct 3687481 gacttttatc ttgggtgcgt tggacatggg agtccacatc agattactag cagcccgcgc 3687541 gggcccctag cggccggtaa agggccagtt gagaccgccg gagttgtgct ttgagttggc 3687601 actgagtagc tgccatgcgc taggcttcga gtcggtcatg agcgccagcg tcaagccccg 3687661 gcttgctggc cggcaaccct ccaaccgcgg tggggtgccc cgggtgatga ccaggttgag 3687721 tagccatcgc cggctgcgcg gcaagcgcgg gtccgccatg acgggcccct gaccagacgg 3687781 ggaaagctca tgagcgccga cagcaatagc accgacgccg atccgaccgc gcattggtcg 3687841 ttcgaaacca aacagataca cgctggtcag caccctgatc cgaccaccaa cgcccgggct 3687901 ctgccgatct atgcgaccac gtcgtacacc ttcgacgaca ccgcgcacgc cgccgccctg 3687961 ttcggactgg aaattccggg caatatctac acccggatcg gcaaccccac caccgacgtc 3688021 gtcgagcagc gcatcgccgc gctcgagggc ggtgtggccg cgctgttcct gtcgtcgggg 3688081 caggccgcgg agacgttcgc catcttgaac ctggccggcg cgggcgatca catcgtgtcc 3688141 agcccgcgcc tgtacggcgg cacctacaac ctgttccact attcgctggc caagctcggc 3688201 atcgaggtca gcttcgtcga cgatccggac gatctggaca cctggcaggc ggcggtacgg 3688261 cccaacacca aggcgttctt cgccgagacc atctccaacc cgcagatcga cctgctggac 3688321 accccggcgg tttccgaggt cgcccatcgc aacggggtgc cgttgatcgt cgacaacacc 3688381 atcgccacgc catacctgat ccaaccgttg gcccagggcg ccgacatcgt cgtgcattcg 3688441 gccaccaagt acctgggcgg gcacggtgcc gccatcgcgg gtgtgatcgt cgacggcggc 3688501 aacttcgatt ggacccaggg ccgcttcccc ggcttcacca cccccgaccc cagctaccac 3688561 ggcgtggtgt tcgccgagct gggtccaccg gcgtttgcgc tcaaagctcg agtgcagctg 3688621 ctccgtgact acggctcggc ggcttcgccg ttcaacgcgt tcttggtggc gcagggtctg 3688681 gaaacgctga gcctgcggat cgagcggcac gtcgccaacg cgcagcgcgt cgccgagttc 3688741 ctggccgccc gcgacgacgt gctttcggtc aactatgcgg ggctgccctc ctcgccctgg 3688801 catgagcggg ccaagaggct ggcgcccaag ggaaccgggg ccgtgctgtc cttcgagttg 3688861 gccggcggca tcgaggccgg caaggcattc gtgaacgcgt tgaagctgca cagccacgtc 3688921 gccaacatcg gtgacgtgcg ctcgctggtg atccacccgg catcgaccac tcatgcccag 3688981 ctgagcccgg ccgagcagct ggcgaccggg gtcagcccgg gcctggtgcg tttggctgtg 3689041 ggcatcgaag gtatcgacga tatcctggcc gacctggagc ttggctttgc cgcggcccgc 3689101 agattcagcg ccgacccgca gtccgtggcg gcgttctgag gaattctgac atgacgatct 3689161 ccgatgtacc cacccagacg ctgcccgccg aaggcgaaat cggcctgata gacgtcggct 3689221 cgctgcaact ggaaagcggg gcggtgatcg acgatgtctg tatcgccgtg caacgctggg 3689281 gcaaattgtc gcccgcacgg gacaacgtgg tggtggtctt gcacgcgctc accggcgact 3689341 cgcacatcac tggacccgcc ggacccggcc accccacccc cggctggtgg gacggggtgg 3689401 ccgggccggg tgcgccgatt gacaccaccc gctggtgcgc ggtagctacc aatgtgctcg 3689461 gcggctgccg cggctccacc gggcccagct cgcttgcccg cgacggaaag ccttggggct 3689521 caagatttcc gctgatctcg atacgtgacc aggtgcaggc ggacgtcgcg gcgctggccg 3689581 cgctgggcat caccgaggtc gccgccgtcg tcggcggctc catgggcggc gcccgggccc 3689641 tggaatgggt ggtcggctac ccggatcggg tccgagccgg attgctgctg gcggtcggtg 3689701 cgcgtgccac cgcagaccag atcggcacgc agacaacgca aatcgcggcc atcaaagccg 3689761 acccggactg gcagagcggc gactaccacg agacggggag ggcaccagac gccgggctgc 3689821 gactcgcccg ccgcttcgcg cacctcacct accgcggcga gatcgagctc gacacccggt 3689881 tcgccaacca caaccagggc aacgaggatc cgacggccgg cgggcgctac gcggtgcaaa 3689941 gttatctgga acaccaagga gacaaactgt tatcccggtt cgacgccggc agctacgtga 3690001 ttctcaccga ggcgctcaac agccacgacg tcggccgcgg ccgcggcggg gtctccgcgg 3690061 ctctgcgcgc ctgcccggtg ccggtggtgg tgggcggcat cacctccgac cggctctacc 3690121 cgctgcgcct gcagcaggag ctggccgacc tgctgccggg ctgcgccggg ctgcgagtcg 3690181 tcgagtcggt ctacggacac gacggcttcc tggtggaaac cgaggccgtg ggcgaattga 3690241 tccgccagac actgggattg gctgatcgtg aaggcgcgtg tcggcggtga cgtgctcccg 3690301 acgcgacatg tccctgtcgt ttggctccgc ggtcggcgcc tacgagcgcg ggcgcccctc 3690361 gtatccaccg gaagccatcg actggctgct gccggccgcc gcccgccgcg tgctcgacct 3690421 gggagcgggc accggcaagc tgaccacccg gctagtcgag cgcggcctgg acgtggttgc 3690481 cgtcgacccg atcccggaga tgctggacgt gctgcgtgct gcgctgccgc aaaccgtcgc 3690541 gctgctgggc accgccgaag agattccgtt ggacgacaac agcgttgacg cggtgttggt 3690601 ggctcaggcg tggcactggg tggatcccgc ccgggcgatt ccggaggtcg cccgggtgtt 3690661 gcgtccgggc gggcggctcg gcctggtgtg gaacacccgc gacgaacggc tgggctgggt 3690721 gcgcgagctg ggtgagatca tcggtcgcga cggcgatccg gtgcgcgaca gggtgacgct 3690781 gcccgagccg ttcactacgg tgcagcgcca tcaggtcgag tggacgaatt acctgacacc 3690841 acaagccctt atcgacctgg tggcttcgcg cagctattgc atcacctcac cggcgcaggt 3690901 ccgcaccaaa acgctcgacc gggtgcggca gttgctggcc acccatccgg cgctggcgaa 3690961 tagcaacggc ctggcgctgc cctacgtcac ggtctgtgtg cgggcgactc tggcctgacg 3691021 ccgcctttag ggcccggtgc cggtgtaaat caggcccgcc agttgctggc cgacgttgcc 3691081 gaagccggag accagggccg aggtgatcag gcccagcgcg ccggtgttgt acacacccga 3691141 gatgtccgcg ccgcggttga ggatgccgga gagttgggtg ccgaagttgg cgaagcccga 3691201 cgccgatccg agcagcggat ccgagatcgc gttgagcacg cccgacatgc ccgcgccgag 3691261 gttgtggaag cccgacaacc cgccgccacc gccgatgttg aagaaccccg acgacgggac 3691321 cgcggtggtg ttgccgaatc ccgggacggg cgggatgacc aacccggcgt tgatggggcc 3691381 gagcagcgcg ttgacgtcga gaaccactgg gattcggtcg atggtgatct ccagagggaa 3691441 ggcgaaggcg ggggtggcgc cggacaacgc gaggcccagc gggagttggg gaatggtgat 3691501 ttccgggctc acgaagggtc cgatggtgac ggacaggggc agctcgacat ggattggatc 3691561 gacgggtatg tggaatcccg ggatggtgat ttccggtgtt agatgggtca cgccaagcga 3691621 actcagcagc acggtgaatg gcagaatctc gctgggcgcc gtttggatgg cggggacatt 3691681 aacgttgatg aaccccagca gcgtaaggct gaatggatcg atgatggagc ctgagctgaa 3691741 tatcgggccc acggtgacac cggttgcggg gtcgagtccc agggcgggaa tcgtgatgtc 3691801 ctggacggtg atggggccga ggtcgaagac tgggtcgatg cgaaccgtga tcggggaaat 3691861 ggacaccggc gggatggtga agccgccgat gtggccggtt gcgctgaggt ccaagggaat 3691921 tgccggaaat tggatcgacg gaacgatgat gggtccggcg ccgccggacg cgtggatgtt 3691981 cgcgacagtg aattcgggaa tgatggtgct ggtgtaggag aagccgagca ggccctggta 3692041 gtcgccccgc cagaaggcgc cgttgctgta gttgccggag atgaaggcgc cggtgttgac 3692101 gtcgccggag ttggccaccc cggtgttggt gtcaccggtg ttcaaccaac ccgtgttgac 3692161 actgcccggg ttgaaaccgc ccgtattggc ctgccccgcg ttgaaactgc cggtgttgta 3692221 gctacccgca ttgaccacac ccgtattgaa cccacccgcg ttgaacaacc ccgtgctggc 3692281 aatccccgaa ttaccgatcc cggtattata actccccgaa ttgaacaccc cccagttccc 3692341 ggtgccagag ttaaagaacc ccacattacc ggtccccgaa ttaaacaacc ccacattccc 3692401 gctaccggta ttgaaaccac cgaacccggt cagattatca ccggtcaacc caataccgaa 3692461 attcccactg ccggtgttag cgaacccaat attgcccaca cccatattcg ccaaaccgaa 3692521 attgtagctg ccggcattac caaacccgat attacccaaa cccatcagac ccggcgttaa 3692581 ccccgaattc ccgagcccaa agttgcccca cccgacattg cccaacccga cattgttgcc 3692641 gccgatattg ccgccaccca cattgaaccc accgacgttg cccgcaccca ggttaaagtc 3692701 cccgacattg cccaacccga cattgcccaa ccccacatcg gccaacccga aattgaggac 3692761 cagaccctga tgcagcgccg tcccgctcgc caacaatccc gacaactgct gaccgacact 3692821 acccaaaccc gacaccaacg ccggcgcacc caaccccaac acgctggtgt tgaacagccc 3692881 cgacatgcca gagccgaaat tcagcacacc cgaatgcagc gtgccggcgt tgaaaacacc 3692941 cgaaccccca cccagcaacg ccgacggagc ctgattccag ccacccgaca ccatcgcgcc 3693001 gacattccca aaccccgaca ccccacccgc accggagttg aagaaacccg acgacggagc 3693061 accggtcgta ttcccgaacc ccggcacggc gggaaggtcg atgaggatgt gaacggggcc 3693121 gagcgtgctg tgggccacga ggtcaaaggg gatttcgccg atggtgattg ccggaatggt 3693181 gacggcgccg gtgccaccgg acaggttgat gctcagcggg ttcatcgcgg ggatcgtgag 3693241 gccgcccggg aagatgtcga cgggctcgct gtggccggta atgctggcca gcagcgggat 3693301 ctcgtcaatg gtgacgacgg gggtgctgaa cggcaggttg gccaggaaag ccgtgatggt 3693361 cccttgcgac gagctagcac cgatgactat ctggcttaac gccagggggg taaggccgat 3693421 gggggtgttg aagagtcccg taatcggacc gattttcagg ggcccgccgg gttgtgagcc 3693481 aaacaagtaa ttcagcgtga cgggcacccg tggaatatcg aggtgcggga cggtgatggg 3693541 gccgaggccg acgctgaccg tggtggcggc caggtcgatc tggggaatcg ggatgctcgg 3693601 cacagtgaag ctgtcgatgg cgacgttggc gctgaactcg gggcggatcg cgggaatgtc 3693661 gatggcgggg ataacgacgg agcccagtcc gccggtgagg gtgaggtcca ggaacggcgt 3693721 ttggggaagc acggcggggc ggtaggagaa gccgagcagg ccctggtagt cgccccgcca 3693781 caagacgccg ttgctgtagt taccggagat gaaggcgccg gtgttgacgt cgccggagtt 3693841 ggccaccccg gtgttggtgt caccggtgtt caaccaaccc gtgttgacac tgcccgggtt 3693901 gaaaccgccg gtattggcct gccccgcgtt gaaactgccg gtgttgtagc tacccgcatt 3693961 gaccacaccc gtattgaacc cacccgcgtt gaacaacccc gtgctggcaa tccccgaatt 3694021 accgatcccg gtgttatagc tccccgaatt gaacaccccc cagttcccgg tgccggagtt 3694081 aaagaacccc acattaccgg tccccgaatt aaacaacccc acattcccgc taccggtatt 3694141 gaaaccaccg aacccggtca gattatcacc ggtcaaccca ataccgaaat tcccactgcc 3694201 ggtgttagcg aacccaatat tgcccacacc catattcgcc aaaccgaaat tgtagctgcc 3694261 ggcattacca aacccgatat tacccagacc catcagaccc ggcgttaacc ccgaattcgc 3694321 caacccgaca ttgccaaacc cgacattgcc caacccgaca ttgttgccac cgatattgcc 3694381 gccgcccacg ttgtagctcc cgacgttgcc ggcccccacg ttgtagctgc cgacgttgcc 3694441 gcttcccgcg ttgaagaggc caacgttggc caaacccaga ttgacggcga gcgacttggc 3694501 cggctcggcg gcggccgcca ggcttgccag cggcgagcca aacggcgcca acgcctcggc 3694561 cgccgccgag gcgccggtgt ggtaccccag catcgcggcc acgtcctggg cccacatcag 3694621 ctcgtagtcg aactccgcgg ccgcgatcgc cggcgtgttc tggccgaaca gattcgataa 3694681 cgccagcgac actaacctcg accgattggc cgcgatgacg aaggggtcca ccgtcgcggc 3694741 caacgccgcc tcgaacacac ccaccaccgc ccgggcctgc ccggccgccg actcggcgga 3694801 ggccgccgcc gcgctcaacc accccgcata cggggcggcc gccgccgcca tcgcgaccga 3694861 ggacggcccc tgccagatac caccgaccag ccccgaggtc accgacccga aagccgccgc 3694921 cgccgagccc agctcggcgg ccagctcatc ccaggccgcg gccgccgcca acagggggcc 3694981 cggacccgcc ccggtatata tcagcaggga gttgatctct ggcggcatta cgacaaaact 3695041 catgccgcca gccctttccc gtgcgttccc aacatcgctg tcaaccggtg atcagggtgt 3695101 tgcgccggcg ccgccgaggc cgccgtcgcc gccgaaccct ggctccgtgc ctgagttggg 3695161 ctggccggcc tgccctttgc cgccggcgcc gccggccttg gcgccgctgt tgccgccgtt 3695221 gccgccgtca ccgccgtcac cgccgtcacc gccgaggccg gtcgcgctct gagtgccgcc 3695281 gccaatgccg ccctggccac ccttaccgcc gttgccaccg aagccgccgt ccggggcgtt 3695341 gcctccgcca ccgcccgcgc cgccaaggcc gccgttgccg ccggtggagc cgccgccatt 3695401 gccgccctgc ccaccgaggc cgccctggcc gccggcaccg gcaaagacgc cgtcgccgcc 3695461 ccggccgccg acaccgccgt tgccgccgcc accggccacg gtgccgacgg taccgccgcc 3695521 gttggggccg ccctgaccgc cgtcgccgcc gaagccgccc ttgccgccga aaaagccgct 3695581 gccgccggcg ccgccggcgc cgccgccacc gccgctgccg ccttgggtga cggagctgtt 3695641 gccgccgacg ccgtcaccgc cgtggccacc gtcgccgccc ttgccgccct cgccggagct 3695701 aaggctgccg tttccgccgg cgccgccagc gccaccggcc ccaccggaac cgccgacgat 3695761 gccgctgttg gcgccgatcg agcccccgtt gccgccggca ccgccgttgc cgcccttgcc 3695821 gccgtcgcca cctgagccgt tggggttgct gccaccggcg ccgcccttgc cgccgttgcc 3695881 gccgggggcg cccgtgaccc cgatggaggc ggggccgctg gtagcgccga agctcccatc 3695941 accgccattg ccaccggcgc cgcccttgcc gcctgagccg gtggcgttac ccccggcgcc 3696001 accgttgccg ccggagccgc cggcgccgcc gcggctgccg ctgcccgggt tggtggcagg 3696061 cccaccgtgg tcaccgttgc ccccgtcgcc gcccttgccg ccaagcacga cgccggtgcc 3696121 gccggcgccg ccgttgccgc cgttgccgcc ggcgccgccg ccaatgccgc tgccgctgcc 3696181 cccggtgcca ccgaacccac cctggccacc tgcgccgccg gcgccgcccg tgtcgccgct 3696241 gccgccggcg ccgccgtggc cgccgttacc ggcgttgcca ccgcgagcgt tgccgttgct 3696301 ggaaccgccg ttggcgccag cgccgccctt gccgcccgcg ccgccggtgg agccagggcc 3696361 gacaccgtcg ccgcccttgc cgccattgcc gcctgagccg gcgttgccgg catcgccacc 3696421 gccaccgttg ccgccggcac cgccgttgcc accggcacca ccggcgccgc cgttgccggc 3696481 cgagccagcg ccgccgttgc caccggcacc accgctgccg ccgtggccgc cggactggcc 3696541 tgtgctcagg ctgcccccgc cagcaccggc gccgccgttg ccgccggccg cgccggcgcc 3696601 gcccgtggtg ccgctgccac cgctgccgcc gctgccgccg ttgccgccgc tgccgccgtg 3696661 gccggcggcg ctggaagtgc cgccgccgtt gccgccggcg ccggcggcac caccggccaa 3696721 gcccgcgacg ccggtgctgt tgccggagtt gccgccgttg ccgccgttgc cgccgtggcc 3696781 ggcggtgacg ttgacgacgc ctgagccgct ggcggcaccg ctgctgccgt tgccgccctt 3696841 gccgccggcg ccgcccgtcg tgccgtcgcc gccgtggccg ccgttgccgc cgttgccgcc 3696901 gtcgccgccc acagcgttgc cgaaggacac gccggcgaca cccgcgttgc cgccggcccc 3696961 gccagcaccg cccgcgccgt tgaggccagt gcccccatta ccgccggcac caccggagcc 3697021 ggcgttgccg gtggtcgtgc ttttgctgct accgccgtta ccgccagcgc caccggcccc 3697081 tccggcaccg cccgcgtcgg tgccgatacc gccattgccg cccgcgccgc cgaagccgcc 3697141 gttgccgccc tggccgccta aggaaatgcc gccaccgccg tcgccgccgc taccgccgtt 3697201 gccgcctgtg cgcccttccc cgccgatgcc gccctggccg ccgaagccgc cgaccccgcc 3697261 ggcaccgccg tccccgccgg cgccgccgac accgccaaca ccgctagcaa agtcgcccgc 3697321 gccgccggga ccgccggcgc cgcctgggcc acccaacccg gtgctagcga agccgccggc 3697381 accgccattg ccgccagcgc cgcccgttgt cgcggcgacg tcaacggcgc cgccaccgcc 3697441 ggcgccgccg aagccgccga ggccgccgtt gatcatgccg gcaccgccat tgccgccgtt 3697501 accgcctttg ccgcccgtgc cgaagaagcc ggcctggttc agcgccccac cgccgttgcc 3697561 gccgttgccg gcgtcaccgc cgttgaggcc ggagccgccg ttgccgccgt tgccgccggc 3697621 cgcgccgctc ccgttgccgg cggtgccgcc cttgccgccg ttgccgccat tgccgccgtt 3697681 accgccgttg ggggtgatgc cgtcggtgcc gtccaagccc gtcaaggagc cggtgccggc 3697741 cttgcctccg gtgccgccga cgccggcgtt gccgccgttg ccgccgttgc cgccggtacc 3697801 ggggtttcct acggtgccgc cgcccggcag catggccccg ctgtttaggc cgtttgcgcc 3697861 ggccccgccg tcaccggctt tgccgccatc gccgccgttg ccgccgtcgc cgccggtgcc 3697921 cgtggcgccg tcggtgtacc cggccgcctg cgccttgccg cccgcgccgc cattgccgcc 3697981 ggcgccgccg tcgccaccgt taccaccgct accgccgttc tcgccgtttg cgccgttagc 3698041 attggggccg gcgccgtcgg cgcctctctc gccggcgccg ccgatgccac cctggccgcc 3698101 gttaacaccc ttaccaccgt tgccgccgtg gccggccagt gttccgccgg cgccgcccgc 3698161 cccgccgttg ccgccagccc caccgtcggt gcccgaggtg ccggaatcac cgctggtagg 3698221 gcccggcgta ccggcttggc cggccgcgcc gttgccgccg gccccgccat tgccgccatt 3698281 gccgacattc ccgccgctgc cgcccttgcc gccgtcaccg ccgttgccgc ccgcgacggt 3698341 ggggctggcg ccgttgccgc cgttgccgcc gtcaccgccg ctggtgggtg cggtgccatc 3698401 ggcgccggtc gcacccttca tggctggaat ggcgcccttg ccgccggccc caccctggcc 3698461 ggcaacgccc acattgccgc cgttgccgcc ggcaccgccg gcgccctggt tgatggccaa 3698521 ggtcacgtca ccggcggcgc cgccgccgcc attaccggcg gcgccgccgt tgccgccggc 3698581 accgccgttg ccggccttag cgaacgtggc gaaggcgtca ccacccttgc cgccgatgcc 3698641 gccgttgccg ccgttgccgc cctgtccgcc attcgcgcca ttggcggacg cggagaagtc 3698701 ttggccgttg gctccggcgc ccccgttgcc gcccttgccg ccgtccccgc ccgtgccggc 3698761 cgccgatccg ccgttgccgc cgatgccgcc gttgccgccg ttgagggcaa ggccggtgcc 3698821 ggcgacgcca tttccgccgg caccacccgc accgccgtta ccgaccgacc cgccatggcc 3698881 gccgttacca ccggcgccgc cgttttctcc cgcgacggtg ggggtggcgc cggcacctcc 3698941 gttgccaccg ttgccgccgc tggtgggcgc ggtgccgttc gccccggccg aaccgttcag 3699001 ggccgggttc gcgctaacac cgccggcccc acccttgccg ccaacgccca cttcaccgcc 3699061 gttgccgccg tcaccgccgg caccctggtt gacggccaag gtcacatcac cggcggcacc 3699121 ggctccgcca tcaccggcct tgccgccgtc accgccggcg ccgccgtccc cggccttgcc 3699181 ggcggtgccg gcgaccgcgg tgccgccggc gccgccacca ccgccgtcac cgcccttgcc 3699241 gccgttgccg cccataccgc catcggcacc gggcgaaccc aaggtggcgg cgtcgaatcc 3699301 gtttccgccg gcgccgccgc taccgccggc accgcccttg ccgccgacgc cgccgtcgcc 3699361 gtgctgggcg ccgccatttc cgccattacc gccgtggccc ccggcgccgc cattggtgcc 3699421 gttaccgccc gtcggttgta aggcggtacc ggtagcgccg gtggaacccg catgaccggc 3699481 accgccggcg ccgccggtgc cgccgttgcc gaccaacccg ccatgaccgc cattaccgcc 3699541 ggccccgccg gcttgtaggg gtgagttggc ggtggcgccg atgccgccat cgccgccgtt 3699601 gccgccgctg gtgggggtgg cgccggcggc accgtgcgca cccgccagca ggccgccggc 3699661 cccaccggcc ccgcccacgc cggggttgcc gccgtgaccg ccgttaccgc cggcaccgtt 3699721 gttgacggcg aaactcggat cgccagcgcc gcccttacca ccgtcgccgc cgacgccgcc 3699781 ggccccgccg gccccgccgt tgccaaccaa taacccgccg cgcccgccgt tgccgccggt 3699841 tccgccgttg ccgccgtcgc tgccgtcgcc gccgttgagg ccggcggcac ccggcaggcc 3699901 cgcggccccg gccccccggc gccgccgttc ccgaacagcc cggcgtcgcc accgttgccg 3699961 cctatacctc cgatgccgcc gatcccgccg gcgccgccgt tgccgtagac aaatccgccg 3700021 gacccgccga cgccaccatt ggtgccggcg ccgccggacc cgccggcccc gaacaaccag 3700081 gcgttgccgc cggcaccacc gttagcgccg gtcccgccgg ccccgccggc cccgccgttg 3700141 ccgttcaacc acccgccgga tccgccgaca ccgccggcag cgccggcccc gccggacccg 3700201 ccggacccgc cgttgccgaa caacccggcc gcgccgccgg gcccaccgac ttgaccggcc 3700261 gcccccgaac cgccgttacc gccattaccc cacaacaacc ccccggcccc accgggctgc 3700321 ccggtccccg gcgccccgtg aacgccatca ccgatcagcg ggcgccccaa ccacagctgt 3700381 gtgggcgcgt tgatcgcacc caacacttgc tgctccagcg cctgcagcgg tgatgcattc 3700441 gccgcctcgg cagtcgcata cgcgctgcca gccgcagtca gcgagcgcac aaactgctca 3700501 tgaaacgtcg ccacccgggc gctcaacgcc tggtactcct gcgcgtgggt accaaacaac 3700561 gccgcgatcg ccgccgacac ctcatcaccg gcggccgcca acacctgcgt cgtcgggccc 3700621 gctgccgccg cattcgccgc gctgatggcc tgcccaatcc cggtcaagtc cgccgcggcc 3700681 gccgccacca gctccggcgc caccatcagc gacatgacca ttcctccaac accaatggcg 3700741 cgtacagccg gctcgcgcga gccttgaccg ccggcggcaa cccgagcgat cccatggccc 3700801 taggcggttc tcgggcgaac gccacgttta gcggatcgat tcacccggtc gttgcgttgc 3700861 ggcgcagcaa tagacatctc gaagcactcc ggctgccaat ctcgtcgcgt ttattctgct 3700921 cgtgaccagc gcaggaaagg gggggattac gaaagtcttc gggatctcag tgcacagtgc 3700981 acacatgttt aaccaatcac cgtggcataa cgcacaccaa aggccgagag cgcggaaaac 3701041 gcagaacatc aattggatcg gttgctagct ttgccgcacc gtggtcagcc gcgccaggat 3701101 cggtcggcaa tggcaccacc ggagcaggcg aaaggtaccc ggttctagcc cgtccccaac 3701161 gggtcaatgg tggatgcgat atagaccatg gccgccgcga ccgtcacggt cgtcacgaaa 3701221 tcgatcccct tgctgcgcac caccaacagg ccggcccgtt cctcggacaa caccaaccgc 3701281 agcaccgccg ccaccccaac gccgataccg atcagcagcg caccacggcg ccagaagttg 3701341 acccccgcca ggatcggcca ctgggcgcca acagtgcgcc gcaaaacggc cctcacggtc 3701401 atcgccgctc agccagctcc acgacacttg tcagcaagga cgcccggggc gaagggcgtt 3701461 cgccaagtct gtagatgagc tgcgggagat ggccggcggc gagggttgag aagcgtcaac 3701521 ttcgatcgtg atgcctggga ggacttctta tttcatacgc gatcggtgat gccgccctga 3701581 agccgaggtc gacggcagcg cggagacgtt cgagaagacg tcgcggtgag gtcaatcccg 3701641 gtgtgaccaa cggccggtta cggcccggtg cccgcgaaca gcaggcccga cagctgctgg 3701701 ccgacgttca taaagcccga gacgaaggcc gatgtgacca ggccaagcgt gcccgtgttg 3701761 tacacgcccg agatgcccgc gccacggttg aggatgccgg agagctgggt gccgaaattg 3701821 gcgaagcccg acgccgaccc gagcagcgga tccgagatcg cgttgagcac ccccgacatg 3701881 cccgacccgg agttggagaa gccggacccg ccaccaccgc cggtgttgaa gaagcccgac 3701941 gacggcgcgg tggtgtcgtt gccaaagccc ggtgctccgc cgaacccgaa aatcgggagg 3702001 ctgacggggc cgatggtggt gctggcgtgt aactccaccg ggatccggtc gataacgacc 3702061 gtcgggagat caaagggtgg ggtgccgccg gacaaaccga ggcccagcgg gagttgggga 3702121 atcagggtgc cgcccgggat ggtgaagccc ggaatggtca gcgacagcgg caggccgatg 3702181 tggatgggtc cggtgggaat ggtgaatccg gggaagtgca gtgtcgtcgg gttcaagttg 3702241 atgggtgcca cggtgaatgg ttgaagtatg gagacctcgc ccccgggcat gccgtcgggt 3702301 ccgaccgcga agaatgaaaa gctgggtctg accttgaatc cggagctgct tccggacgtc 3702361 atcctgatct ccgagacggc agcatccaaa cttaggccag ggatggtgag ggtgatgggg 3702421 tccacggtga tagggccgac gtcgaaggtg ggatcgatgc ccaggtggat cgaggggatg 3702481 gcgatgttcg ggatgctgat cggcccgatg tggccgatcg cggcgaagcc caacgggatg 3702541 gacgggatgt ggatgggcgg aatgatggtg gcggggccga tgtcgccggt gacgtcggcg 3702601 cccaccgcgg ggaacagcgg aatggggtac ccgaaggaga agccggccaa gccctcgtaa 3702661 ttgccccgcc ataagatgcc gttgctaaag ttgcccgtga tgagggcgcc ggtgttgaca 3702721 ttgcccgcgt tggcgacgcc ggtgttggcg ttaccggtgt tgaaccagcc ggtgttggtg 3702781 ctgcctgggt tgaagccacc ggtgttggtg tcaccagcat tgaagctgcc cgtgttgtac 3702841 gacccggcgt ttgccacacc ggtgttgaag ccgccggcgt tgaccaaccc ggtgctggcc 3702901 accccggagt tgccgatacc ggtgttgtag ctgccggagt tgaacaaccc gaagttggca 3702961 gtcccggagt tgaagaagcc gatattgcct gtgccggagt tgaacaggcc aatgttgcca 3703021 gtgccggagt tcaagccgcc gatgccggac tggttgtcgc cggtgagccc gatcccgagg 3703081 ttgttggtgc cggtgttgcc aaacccgatg ttgcccaggc ccatgttggc ccagccgacg 3703141 ttgccgctgc cggcgttgcc cagcccgata ttgcccatgc cggccaggcc cgccgccaga 3703201 cccgaattcc cgaacccgaa gttggcatcg ccgatattgc cgaacccgac gttgccgccg 3703261 ccgatgttgc cgaagcccag gttcacgtcg ccaatgttgc cgaatcccag gttcacgtcg 3703321 ccaatgttgg ccgcacccag gttgaggttg ccgatgttgc cgaggccgac gttgccgttg 3703381 ccgacgttag ccaacccgat gttgacgatg gtgatggggt tttgccccac gttggaggcc 3703441 aacaagcccg acaggtgatc accgacgttg cccaggcccg acaccaacgc cggcgtccca 3703501 agcggcagcg tgctggtgtt gtagatcccc gacagccccg aaccgaggtt gagcacgccg 3703561 gagtgcagtg tgccgacgtt ggcaataccc gaacccgcgc ctgccaaagc ggtgtgcgcc 3703621 tggttccacc accccgacat gttcgcgccg aagttgccga aacccgagcc cccgcccgcc 3703681 ccggtgttga agaagcccga cgacggaacg gtggtggtgt tcccaatgcc cggggtgggc 3703741 gggatgttga tcagcgggat gttgccggcg atgacgtaga gttcgccgtc ggcgttcgcc 3703801 gggatctccg ggaacgtgat cgccggaatg gtggcgccgg gggtgccgac gaacacatcc 3703861 aggttcagca gcgagttcgc cgggaacgtc agaccaccgg ggaacagggt gatcgcgtcg 3703921 atgctgcccg gcacctggaa acccaacggg atctggtgaa tattgagcgc cggggtgttg 3703981 aacgcctgag atgccgcatt gaagacggca tgcaccgggc cggtcgtgct gagcgtcggg 3704041 attcccgaga tgatattgcc gccgacgaac aggtcaccgg cgttgtagat tctgccgacc 3704101 gagtaccacg ttgggccgat cgcaccggat gacgtccaga cgataaacgg ctctatttcg 3704161 ctggtcgccc cgaccgacgc ggccatatcg aggaccgctc gtgcggcggt cagggcggga 3704221 atggtgaccg aggggaccgc gatggggccg aagccgacgc ttccggtgac gttcggattg 3704281 agggcgggaa tatcgatttg cgggatggtg aaggcgccca tcgccgcgtt gccggtcagg 3704341 tgcgcgttga tcgccggaac cgggatgggc gggacgacca ccgggccgaa ggccccggtg 3704401 aaatgcgcgt ccaggatggt gatccgggga acgtcgaggc tgtaggaata gctgaatagg 3704461 ccttcgtagt tgccccgcca caggatgccg ttgctgaagt tgcccgacat gagggcgccg 3704521 gtgtcgacat tgcccgagtt cgcgatgccg gtgttggcgt taccggtgtt gaaccagccg 3704581 gtgttgatgc tgcccgggtt gaagccaccg gtgttggtgt caccgacatt gaagctgccc 3704641 gtgttgtacg acccggcgtt ggccagaccc gtagtgaaac caccggcatt gaaaagccca 3704701 gtactgcccg ttccgctatt accgatgccg gtgttgaagc tgcccgagtt gaacaacccc 3704761 cagtttccgg tcccggagtt gaagaacccg atgttgccgg tgccggagtt gaacaggcca 3704821 atgttgccgg caccggagtt caagccgccg atgccggtct ggttgtcgcc ggccagccca 3704881 atcccgaggt tgttggtgcc ggtgttggcg aacccgatgt tgcccacacc catgttggcc 3704941 aggccaacgt tggtgctgcc cgcattgccc aacccgatat tgccgatgcc gagcgccgcc 3705001 ccaggcccga attgccaaac ccgacgttgc cgtggccgat attgccgaag ccgacgttgg 3705061 cgttcccgat attgcccaac cctaggttga ggtcgccgag gttggccgcg cccaggttga 3705121 agtccccaac gttgcccaac ccgaggttgt agttgccgac atcggccaac ccgaggttga 3705181 tgatggggct ttgggtcaac gccgtcccgg ccgccaacac ccccgacagc tgctggccca 3705241 cgttgccggc acccgacacc agcgccggcg tccccaaacc cacgatagcg gtgttgtaca 3705301 gccccgatat ccccgagccg acgttcagca cacccgagtt cagcgtgcca acgttgagaa 3705361 cgcccgagcc cgcgcccgcc aacgcggcat gcgcctggtt ccaccagcct gagctgccgg 3705421 ccccgaagtt gccgaaaccc gacaccccgc ccgcgccgga gttgaagaaa cccgacgacg 3705481 gggtggcggt cgcgttcccg aagccgggcg tcggcggaac gatgatgatc ggaacgctgc 3705541 tgtccggcac gctgatgttg agggccaggc tcagtggcag cggatcgatc gtgaaaccac 3705601 ccgggaatat cgtgatcgga tccagcacgc cggacgcatc gatggtcaac gggatcgcat 3705661 tttgcgggat gttgaggcca ccggggaaca gcgtgaaggc cggaagaccg cccgacacat 3705721 cgatcttgag cgggataggc gatgtcgtga tcgttgggat ggtgacggtt gggagggtta 3705781 gtgcgaggct accggtggtt gcgctgctgg gaccggtatg gatcaggatg ccctgagtgg 3705841 gtgcggtgac aaagccacca ctcattccgg ttgagttgga cgccccaacg atccagttgt 3705901 cgccgagcgc attcacgaac agcaacggaa gtctgaaggg cggcggggcg ggggccgggg 3705961 gcgtgtcgag cggaatcgtg taggtctgac cgccgatcgt catgctcggc aggaagacga 3706021 tgggcgggat gaccatcgtt tcgtggatgt ccagcaccac tgcggggaca tcgatgggct 3706081 cgatcctgaa gggcccgatg ttgacgagtt cgtggatgtc gaacagcgac atgccgggaa 3706141 tatcgatctg atcgatgtgg acgggaccga ggttgagggt ttcgttgatg tccaccaggg 3706201 tgctgccggt gatttcgatg ctgtaggaga agccgaccag cccgtggtga tcaccggtcc 3706261 acagcgcgcc gttgttgaag ctgccggagt tgaacgcggc ggtgttgaca ttgcccgtgt 3706321 tgaagccgcc ggtgttggtg tggccggcgt tgaaccagcc ggtgttgaca ttgccagggt 3706381 tgaagccgcc ggtgttggtg ttgcccgcgt tgaggctgcc ggtgttgtaa ctaccggcat 3706441 tggccagacc cgtgttgaaa ctcccggcat tgaaaagccc ggtactgccc gttccgctgt 3706501 taccgatgcc ggtgttgtag ctgcccgagt tgaacaaccc ccagtttccg gtcccggtgt 3706561 tgaagaaccc gatgttgccg gtgccggagt tgaacaaccc caggttgccg gcaccggagt 3706621 tcaggccgcc gaacccggtc tggttgtcgc cggtcagccc gatcccgagg ttgttggtgc 3706681 cggtattgcc gaacccgatg ttgcccaggc ccatgttgcc gaagccgacg ttgttgctgc 3706741 cggcgttgcc caacccgatg ttgccgatcc ccggcagcgc ccccaggccc gagttgccga 3706801 acccgacatt gccgtggccg aggttgccga acccgacgtt gccgtccccg aggttgccca 3706861 accccaggtt ctgcccgccg aggttgccac cgccgaggtt gaggttgccg aggttgcccg 3706921 cgcccaggtt gacgtcgccg acgttggcga agccgaggtt gtagctgccg acgttgccca 3706981 ggttgacgat gttcagcgga ttcaggtgcc gcagctcggc gatcgccgcg tcgatgatgc 3707041 tcggctgccc ggagccgccc gacccgccgc tggtcagcat cgccagcagg ccatcgatgg 3707101 acacccccga cacgtggttg cccaggttgc cgaaacccga gatcaccgcc ggcgcggagc 3707161 ccagcgtgct cacgttgaac atgcccgaga tgtcgacgcc ggagttcagc acaccggatg 3707221 ccaggctgcc ggcattgccc aggccggaga gcgtccccac catcggactc gaggcctggt 3707281 tcagcaagcc ggacaccccc gcgccgaagt tggcgatgcc cgagccgcca ccgccgccgg 3707341 tgttgaagaa gcccgacgac ggcagctcgg tcgagttgcc aaagcccggc agcgccggaa 3707401 tgtcgatgat cgagatgttg atgggtccgg cgctgctgag aacgtcgaag ttcagcggaa 3707461 tcgggtcgat cctggtgccg gtgatggtga ccgccggaat gtcgacggac acatcgatcg 3707521 gcacgacctc cgacatcgaa attccgttga tagtggaggc cgggatgtcg atcggcggaa 3707581 tgtcgatggg tatggattgg ctgaacgaga ttgccggcaa ttcgatggcg tcgatggtct 3707641 gctgcagcgg cagggccaat ccgcccagcg ttgccgaagt aaggggtatg gcgacctgta 3707701 tctgaaccga gattgtggga tcgggaaatt catttgggaa cgcgtcgtgg aggaactgaa 3707761 gcttgaggtt aacgttgaac ggattgagct ggacgtttga gacggtgatc gggccgaacc 3707821 tgaattgtcc ggtaatgccc agcgcagaaa gcagggtggt ggccgaggcg gtgaagccgg 3707881 cgtcggcggc accgtcgaag tcgatgtgga ttgccggaat ggggatgtcc ggcacggcga 3707941 agccgtagtt cgcttgtccc gtgaggccca ggtggatggg gggaaggatc gtggtgtccg 3708001 ggatgataat ggggccgatg ccgccggttg aagtccagtg gatcgggaat tcgggaatcg 3708061 tgatgccgac gttcaggccg aacaggccct cgaagttgcc tcgccacaag atgccgttgc 3708121 tgtcgttgcc ggtgatgaag gcgccggtgt tgacggtgcc cgaattggcg acgccggtgt 3708181 tggcgttgcc ggtgttgaac cagccggtgt tgatgctgcc cgggttgaaa ccaccggtgt 3708241 tggtgtcacc cacattgaag ctgcccgtgt tgtacgaccc agggttggcc acaccggtat 3708301 tgaaattacc ggcattgaaa agcccagtac tgcccgttcc gccattgccg atcccggtgt 3708361 tgtagccgcc cgagttgaac aacccgaagt tcccggtccc ggagttgaac aacccgacgt 3708421 tgccggtgcc ggagttgaac aacccgatgt tgccggcacc ggagttcaag ccgccgatcc 3708481 cagtctggtt gtccccggtc agcccaatcc cgaggttgtt ggtgccggtg ttaccgaacc 3708541 cgatgttgcc cacacccatg ttgccgaagc cgacgttgcc gctgccggca ttgcccaacc 3708601 cgatgttgcc caccccggcc aggcccgccg ccagacccgc attgcccaac ccgaagttgg 3708661 catcgccgat attgccgaac ccgacgttgc cgccgccgac attgcccaaa cccacgttca 3708721 agtcgccgat attggccgca cccaggttga agtccccgac gttgccgaaa ccgacgttta 3708781 cgctgcccac atcggccagc ccgagattga tgatgaggct ctggttgagt gccgtccccg 3708841 ccgaggacaa ccccgacagc tgctcaccaa cattgccgat gcccgagacc accgccgggg 3708901 tccccggcgg caacccgccg gtgttgtaca gccccgacac acccgagccg aagttcagca 3708961 cacccgatcc cagcgagccg aagttggcga aacccgaacc cgccccagcc acctcggtct 3709021 gcgcctggtt ccaccaaccc gagctgcccg caccgaaatt cccgaagccc gacaccccac 3709081 cgtcgccgga gttgaagaaa cccgacgacg gagcggtggt cgtgttgcca aagcccgggg 3709141 tcgccgggat attaacgccg ttgatcagga tagggccgac agtgacgctg gcgccgaggt 3709201 tcagcgggat gcggtcgatc gtgatcggcg gggtgctgaa gccgtcaatc tggccgtcta 3709261 tgtcgatcgt cagcggcagc ggcgcagcgg gaatggtgaa gcccgggatc gtgaatccca 3709321 gcgtgccgat cgacgcgctg gccagcagcg ccagtggatt gttgggaata ctgatgccat 3709381 tcgggaagat cgttactgcc ggggtactcc agttgacggt caccgggaat gactggttaa 3709441 ttctggtgtc gatattaagg ttacctaatt ggagggtgac gttgccggca agatctttga 3709501 tttcgattcc tgaaatgttg acgaccccca agccaaagaa ggggccgacg gggaaagtcg 3709561 tgttgaagtt ctgagccggg aacagggtga tgggcgagat ggtgatgggg ccgacgctga 3709621 taggtatggc cgtaccgcca ccaaaagcgg ggatcacgat gtccggaacg accagcgggc 3709681 cgaggctgaa ggtttggtga atgttgagcg ggatggtggg caaaatctgg atcggcaaca 3709741 cggtgatggg gccgacgccg ccgttgagct cgagaccaat ggggatcgcc ggaatggtcg 3709801 atccaccgga gagcccccac aggccctcgt agtcaccccg ccacagcaca ccgttgctga 3709861 agttgcccga gatgaacgcg ccggtgttga cattgcccga gttggcgatg ccggtgttgg 3709921 tgttgccggt gttcagccag ccggtgttga cgttgcccgg gttgaagcca cccgtattgg 3709981 tgttgcccgc gttgaagctg cccgtgttat agctacccac gttggccaca cccgtgttga 3710041 acccaccaac gttgaacaac ccggtactgg ccgtccccgc attaccgaca ccggtgttgt 3710101 agctgcccga gttgaatacc ccgaagttgc cggtccccga attgaagaac ccgatgttgc 3710161 cggtgccgga gttgaacaac cccagattac cggttcctga attcaggccc ccaatgccag 3710221 tcaggttgtc cccggtcaac ccgatcccga tgttgttgct acccgtgttg gcaaaaccga 3710281 tgttgcccac acccaggttt gcgaggccgt agttgctgct gcccgcattg cccaacccaa 3710341 tattgcccat gcccggcggc aacccaagac ccgagttgcc gaacccgaag ttggcgttgc 3710401 cgatattgcc gaaaccgaaa ttcccgctac cggcgttggc agcacccaaa ttctgcgcac 3710461 cgacattggc tgcgcccagg ttgaatatcc cgacattgcc caacccgacg ttgtaattac 3710521 cgacattgcc caagcccgcg ttaagcctca acatcttcgc gggtccggca aatatagcat 3710581 tgaggaacgc gccgacacca ccccccaacg cctgcgccgg tgggctgaac gccggcaacg 3710641 ccgcggcagc agccgacgcg ccggaatggt agccggccat cgccgccaca tcggcggccc 3710701 acatcaactc gtactcggcc tcgacggccg cgatcgctgc ggcgttctgc cccagcaggt 3710761 tcgacatcgc caacgcccgc atcgccatcc ggttgaccgc caccgccgcc ggatccaccg 3710821 tcgccgccaa cgccgcctca aacgccgcca ccgcggcccg cgcctgcccg gccaccgcca 3710881 cggcctgcgc cgccaccgaa cccaaccacc ccgcatacgg ggccgccgcg gccgccatcg 3710941 ccgccgccgc cgcaccctgc cacacccccg ccgtcaggcc cgacgtcacc tgcccaaacg 3711001 acaccgccgc cgaccccaac tcctcagcca gcccatccca cgccgcggcc gccgccagca 3711061 atgggctcga ccccgcaccc gaatacatca gcacggagtt gatttccggg ggcagaactg 3711121 gaaaattcaa ccgcccctac ctctgccgct cacgatgcgt tcacacctca tcgtctcacc 3711181 acgacgtggt gagcgcgggc acttcgacaa actaatctgc aatatcccga tcgcgtacaa 3711241 acgtgccgac atttgcggcg cattaatgcc catatcggct tgtatctctt gtagtgccgc 3711301 tttgacgggg tggtggtcag gtacggtggc ctcgggagag gctggagggc tcgacgtttg 3711361 ggctgagtgt ctgggcccgt gaaagagatc gtctgctcca gctttgtctc ctgaactgac 3711421 ccggtttagg gaattggtgg ccaggttgcg gaagtgcgca gcatcgacgt gtacctgggt 3711481 gaggcatcga atcatcgaca agcaccggag ccgcgcgtga actcccgccg cgttgtggtc 3711541 ggggatgatg tgggagaccg gccggcagtg ctgtgtacga aggttctccc accgcaacga 3711601 gttcacgcac gacggtcggc tgggtgggcc ctggaatgcg tgaactcttc atcaacacaa 3711661 catgattgac gatgaagggg agaacctcca tgcacaacaa cgctaacccg tgactgccga 3711721 gaatccagga cggagcaggc ggacgctggt cggaatcgac gcggcgatca cggcctgtca 3711781 ccacatcgcg atccgcgatg atgtcggtgc gaggtcgatt cgattcagtg tcgaacccac 3711841 gctggccgga ctgcgcaccc tcaccgacaa gctcagcggt tacgacgata tcgacgccac 3711901 cgtggaaccg acctcgatga cgtggctgcc gctcacgatc gctgtcgaga atgccggtga 3711961 caccatgcac atggccggcg cgcggcattg cgcccggctg cggggtgcga tcgtgggcaa 3712021 gagcaagtcc gacgtcatcg acgccgaggt tctcacccgc gccagcgagg tgttcgacct 3712081 gacgccgctg acactgccga cgcccgcgca gttggcgtta cgtcgatcgg tgatccgacg 3712141 tgccggcgca gtgattgacg cgaaccggtc ctggcgtcgg ttgatgtcgt tggcgcggta 3712201 ggcgttcccc gatgtgtgga ccgcgttcgc cgggtcgtta ccgaccgcga cagcggtgct 3712261 ggggcgttgg cccgacatcc gcttgctggc cggcgcaccg acccgccacg ttgaccgccg 3712321 tcatcgccgc gcacacccgc ggtgtcgccg acaccccggc ccggccgagg ccatcaagac 3712381 cgccgcaacc ggctgggccg cgttctggga cgggcacctc gacctggacg cactggccgt 3712441 cgatgtcacc gagcatctca gcgacctcac cgacgaccga tgcgcgcgtt ggtgatgccg 3712501 gtgaccaaga aggtgttgat cttgggtgac tagtcaatgg tggtggccag ggtgagcagt 3712561 tcggggatct gcgagtcgat gcgccaggca ggaagcggtg taggtgatgg cgcgccaggt 3712621 gggggtcccc gccggtgcgc acggtcgaca gcagggtgcg cagctcctct ttggcgatcc 3712681 aggccgagag aatctgcgcg cgggggtcga cggcgttgat ccgattccgc attttggcga 3712741 agcttttgtc cgacaagcgt tcccgggcgg tcagcaagcg acgtcggttg gcccactgcg 3712801 ggtcgatctt gcggccgcgc cggtcgtgga acgcccaggt cacccggcgg cgcaccgcgg 3712861 tcagcgcgtc gttggccagc gtggtcacat ggaagtggtc gacgacgagc ttggcgttgg 3712921 gcagcagccc gggcgtgcgg atcgccgagg cgtaggcagc ggcggggtcg atggccaccg 3712981 tactggatgc tctcccggaa ctgcggtgtg cgcgcttgca gccatgccag caccgccgcg 3713041 ccgccgcggc cttcatgctg ccccataaac ccctgatcac cggccaggtc gacgaacccg 3713101 gtatcccacg ggtcgacccg tacccaccgg ccagtcttgg cgcagcgctc tgggttttcc 3713161 tcgccgtgtc tggtcaacgc ccagcaccgg ggtgggcaac ggctcggtca atacccgtct 3713221 cggcgtaggc aacaaacgcc cgatgtgccg tcggccacga cacggcgtca gcctgggcga 3713281 cctcggccca ccgagcgggc cgcatccccg atcgccttgg ccatctgccg acgcagccgc 3713341 agcgtgctgc ggacgcgggc aggtacctgg gtgatggcct cgatgaacgg ccccagcttg 3713401 cagtagtctt ctcggcatcg ccagcgaatt ttgttccagc gcaccatgat gcggtcttcg 3713461 ccataaggta gatctttcgg tgaggtaacc gcgtattcct tcactgatat cgagaccacc 3713521 cccgcacgac gggcacgccg ccgccgtcgg ctcatcggtg atcacatcga ccacccgggt 3713581 cccgtcactg cggcgctcga cacgctcaac ccgtgctcct ggcagcccga acaacactgt 3713641 cgtagcgtca gacacagccc ttggctcctt cctcggcctg aatgcttcgc aacacttaga 3713701 cttcagaagg ccaagggccc tcagccgcta aacacgccga ccaagatcaa cgagctacct 3713761 gcccggtcaa ggttgaagag cccccatatc agcaagggcc cggtgtcggc gcaaaattta 3713821 gcgtcgttgc gcccacacca gagttaccgc cgcacacacg gcgtgaccac cggcgtgcat 3713881 ttaagaatcc gttagggccc gacgccggtg aagagcaagc ccgacagttg ctggccgacg 3713941 ttgccgaaac ccgagacgac ggccgcggtg acaacaccca gcgcgccggt gttgtacacg 3714001 cccgagatgc cggcgccgcg gttgaggatg ccggagagct gggtgccgaa gttggcgaag 3714061 cccgacgccg acccgagcag cggatccgac atcgcgttga gcaatcccga catgcccgcg 3714121 ccggtgttgc taaagcccga accgccgcca gctccggtgt tgaagaagcc cgacgacggc 3714181 agcgtggtcg agttcccgaa acccggcgcc ccgccgaacc cggcgatcgg gacgttgatc 3714241 gggccgatag tggtgtcggc gtgcaggtcc agcaagatcc ggtcgagaac gatggccggg 3714301 atgtcgacgg gcgggatgcc attggacaac gcgaggccca gcgggagggt ggggatcagg 3714361 gtgccgcccg ggatggtgaa ccccgggatg gtcagcgaca ccggcaggcc gatgtcgatc 3714421 gggtcgaggg ggatggtgaa tcccgggaag gtcaccgtgc cggaggggat ggagatgggc 3714481 cccacaaagt atgccccttg cgtggacgtt gcacccccgc cgctagaggg cgcgatccgg 3714541 attccgggga agaagctggg cttgacccaa atctctgagg ttggtccgga cgtgctggtg 3714601 acggctcctt gggagtaact gacgagcacg ggcggggtcc tgacggtaat ggggttgacg 3714661 gtgatggagc cgacatggac ggcggggtcg aggcccaagt gaatggatgg aacagagatg 3714721 tccgggatgg cgatcgggcc gatgccaccg accgcggcga agccgaccgg aatgggcggg 3714781 atgtggatgg gcggcagcac ggtaatcggg ccgatcccgc cgctgacgtc ggcgcccacc 3714841 gcggggaaca gcgggagggt gtagcccacg gcgaagccgg ccaggccctg gtagtcgccg 3714901 cgccacagga tgccgttgct gaagttgccg gtgacgaagg cgccggtgtt gacattgccc 3714961 gcgttggcca ccccggtgtt ggcgttgccg gcgttgagcc agccggtgtt gatgctgccc 3715021 gggttgaagc ccccggtgtt ggtgtcaccg acattgaagc tgcccgtgtt gtagctgccg 3715081 gcgttggcca caccggtgtt gaaactgccg gcattgaaga gcccagtgct gcccgttccg 3715141 ctattgccga cgccggtgtt gaagctgccg gagttgaaca acccgaagtt gccggtcccg 3715201 gtgttgaaga acccgacgtt gccggcgccg gagttgaaca accccaggtt gccggcaccg 3715261 gaatttaggc cgccgatgcc ggtctggtag tcgccggtca gcccgatccc aatgttgccg 3715321 gtgccggtgt tggccaaccc gatattgccc acgcccacgt tggccaaccc ccagttgttg 3715381 ccgccggcat tgcccaaccc cacattgccc aggcccggca cgcccgcggt cagacccgag 3715441 ttgccgactc cgacattgcc gtggccaata ttgccgaacc ccaggttgcc ggcgccgata 3715501 tttccgaagc ccaggttgtg cgcgccgagg ttggccgcgc ccaggttgac ctccccgaca 3715561 ttgccgaaac cggcgttgtg gctgccgacg ttggccaacc cgatattcag aacggtcacc 3715621 gggttcaccg cggacccgcc ggaaagcagc cccgacagtt ggtggccgac gttgcccagg 3715681 cctgagacca gcgccggggt ccccaccccc agcgtgctgg tgttgtagat ccccgagaca 3715741 cccgagccca ggttgagcac accggaatgc agcgtgccaa cgttggcaaa acccgagccc 3715801 gcccccgcca gcgcggtgtg cgcctggttc caccagcccg aggtgcccgc gccgaagttg 3715861 ccgaagcccg atcccccgcc cgcgccggcg ttgaagaagc ccgacgacgg ggtgatggtg 3715921 ctgttcccaa tgcccggggt gggcgggatg ttgatcagcg ggatgctgct ggcgaggaca 3715981 tacactgagc cgtcggcgct cgccgcgatc tcgggccagg tgatggccgg gatgtccacg 3716041 ccgccggcgc cggcggtcac gtccaggttc agcagcgagg tcgccgggaa cgtcaaacca 3716101 ccggggaaga gggtgatcgc gttgacgctg ccgggcacct ggaagcccaa cgtgatcggg 3716161 ccagtttcga gctgcggagt ggtaaacgcc ccgctggacg cggaaatggt gagatggctt 3716221 ccgtcgctcg tgccggcgcc gaaaacgagt gggccggtgg cgtagggcga accgtcggcc 3716281 gatccgaatg aatagaaggt tataccaagg ccattagtgc cttgagtcca catttcgaag 3716341 ggatctatcc tcatctccgc cccaaccgag gcgttgatta tttgctccac aatgacactc 3716401 accggcggaa tgcgcacgga ccccacaacg atgcggaagg cggcgcttcc ggtgatgttt 3716461 ggggtgagtg cggggatgtc gatctgcgga atggtgaatg cgcccatcgc gacgtttccg 3716521 gtcaggtgcg cattaacggc cggcaccggg atgggcggga ggaccacggg tccgaagccg 3716581 ccgtcgaggt gggcgtccac gatggtgatc cggggcacgt cgaggctgta gaacaggctg 3716641 aacaggccct cgtgatcacc ccgccacaac aggccgttgc tgaagttgcc cgacatgaac 3716701 gcgccggtgc cgacgttgcc cgagttggcg atgccggtat tggtgtggcc ggtgttgaac 3716761 cagccggtgt tgatggtgcc cgggttgaac cccccggtgt tggtgtcgcc ggcattgaag 3716821 ctgccggtgt tgtagctgcc ggcgttggcc acgccggtgt tgaagctgcc ggcattgaag 3716881 agcccagtgc tgcccgttcc gctattgccg atgccggtgt tgaagctgcc cgagttaccg 3716941 atgccgaagt tgccggtccc ggagttgaag aacccgatgt tgccggtgcc ggagttgaac 3717001 aagccgatat tgccggcacc ggagttcagg cccccgatgc cggtgaggtt gtcccccacc 3717061 agcccgatcc cgatgttgcc ggtgccggtg ttggccaacc cgatgttgcg cacgccctgg 3717121 ttggcgaaac catagttggc gctgccggca ttgccgaacc ccgtgttgcc caggccggcc 3717181 gcgccggcgg tcagacccga attgccgaaa ccgatattgc cgtggccgac gttggcgaag 3717241 ccgaggttgc cggtgccgac gttgcccagc cccaggtttt gcgcaccgag gttggccgcg 3717301 cccaggttaa cgtccccgac gttgccgaac ccgacgttga agttgcccac atccgccaac 3717361 ccgatgttga ggatggggat ctggttcaac gcggtcccgg ccgcagacac gcccgacagc 3717421 tgatggccga cgttgccgag gcccgacagc accgccggcg tcccgagcgg caacacgctg 3717481 gtgttgtaga tccccgagac acccgagccg acgttgagca cacccgagcc cagcgtgccg 3717541 acattcaaca cccccgatcc cgaccccgcc agcgcgctcg ccgcctggtt ccaccagccc 3717601 gacaggttcg acccgacgtt tccgaacccc gacaccccac ggcgccggag ttgaagaacc 3717661 ccgacgacgg ggtcgccgtg gtgttgccga accctggcgt cggcgggaca tcgatgatcg 3717721 ggatgctgct gtcgggcacg gtgagattca gcgccaggtg cagcggcagc gggtcgatcg 3717781 tgtacccacc cgggaaaatc gtgatcggat ccagcgcgcc ggacgcatcg atcgttaacg 3717841 ggatggcgtt cgtggggatc gtcaggccac ccgcgaacaa ggtgaaggcc ggcagaccac 3717901 cgctgatgtt cacgtccaac aggaatctcg tggtagcgat ttgcggaatc tcgaaacccg 3717961 gaatagatat cttgagctcg ccggtcgttc cggggccagg gccggtgtga atggtgatgc 3718021 cctgggtggg cgccgggaag gggtctccga aattgggaat cgccgcggtc gacccgagga 3718081 tccagtcctc gccttcgaag cgcatgctga tgagcggaag cgtcatggtt gacccgggtg 3718141 aggcggggat gtccagcgga atggttctcg tctgtgcggg aattgtggtg gcgggcacca 3718201 ggacgatggg atccatgtgg atcgattcgt ggatctctag cggtatcgcg ggaacatcga 3718261 cctgcgggat ggtgaagggt ccgatctcga cgatttcgtg gacgtcgaac agcgacatgc 3718321 cggggatgtc gatctgctcg atgtggatgg ggcccaggtt gagggtttcg ttgaggtcca 3718381 gcagggtgct gccggcgatg tcgatgctga aggagaagcc gaccagcccg tggtagtcac 3718441 cggtccacag cgccccgttg ttgaagctgc cggagttgaa cgcgccggtg ttgacgttgc 3718501 cggtgttgaa caggccggtg ttggtgtggc cggtgttgaa ccagccggtg ttgacggtgc 3718561 ccgggttgac gccgccggtg ttgaagctgc ccacgttgag gctgccggtg ttgtaggagc 3718621 cggcattggc cagaccggtg ttgaagttcc ccgcgttgaa caacccggtg ctggccgtgc 3718681 ccgcattccc cacaccggtg ctgtaactgc ccgagttgaa cagcccgaag ttcccggtcc 3718741 cggtgttgaa gaacccgatg ttgccggtgc ccgagttgaa caacccgagg ttcccggtgc 3718801 ccgagttcag gccgccgatc ccggtccgat agtccccggt cagcccgatc ccgatgttgc 3718861 cggtgccggt gttggccaac ccgatattgc ccacacccac gttggccaac ccccagctgc 3718921 cgctgccggc gttacccaac cccacattgc ccaggccccc cgcgcccgcg gtcaggcccg 3718981 cgttgccgaa tccgaaattg ccggcaccga tgttgccgaa cccgaggttg ccggtcccga 3719041 cgttgcccaa ccccaagttg ctgccgccga ggttgccggc gccgacgttg atgttgccga 3719101 cgttgcccgc acccaggttg aactcaccga cgttagccaa accgaggttc accccgccga 3719161 cattgcccaa ggccaaagcg ttgccgatgt cgaggtgctg cagctcggcg atggccgcgt 3719221 cgatgatctg atcgaacacg gactcggcag gtgggaaggt gaggatcgcg atcaggccat 3719281 cgatggacac ccccgacata tggtcgccga ggttgctgaa ccccgagatc accgccgggg 3719341 tggtggcgtc cagcgtgctc acgttgaaca gcccggagat ggcggtgccg gagttcagca 3719401 cacccgaggc cagggtgccg gcattgccca gccccgagag tgtccccacc agtgaccccg 3719461 cgccggcctg gttgagcagg cccgacacgc ccgcacccaa gttgccgatg cccgatccgc 3719521 cgccggcacc ggtgttgaag aaccccgacg acggcatctg ggtcgagttc ccgaagcccg 3719581 gcgccgccgg gatgtcgatg atcgggatgt tgaggggtcc ggcactggtg cgaatgtcga 3719641 agcccagcgg gatcgcggaa atggtggtgc ctgtgatcgt gaccgccggg atgtccacgg 3719701 acgcatcgat cggcaccact tccgacattg aaatcccatc gatgaccgag gccggaatat 3719761 caacaggtat gcggatagga atcgactcac tcaacgaaat cacatccagg gggatgggct 3719821 cgatctccag gggcacaccg atcccggcca ccacgattgg ctcaagatga attggtccga 3719881 gttggcccgt gataggacca agaacgggca ggcctaacgt gaaatccatg ggcggaatat 3719941 cgatattcga gagcgtgatg gggccgaagc tgatgaagct accgttattc ttcagggcgg 3720001 acagcagggt ggcttccggg gcggtgaagc cgacggtgac gacgccattg atgccgatgt 3720061 ggatggcggg gatggggatg tcgggcacgg tgaagctgta gtccgcgtcg ccggtgatct 3720121 gcaggtgcag cggcggaagg atcgtggtgt ccgggatgac gatggggccg ataccgccag 3720181 tcgtggtgat gcggatcggg aattgcggga tcgtgatgcc ataggacagg ccgaacaggc 3720241 cctcgtggtc gccgcgccac agcatgccgt tgctgtcggt ccccgacatg agggcgccgg 3720301 tgttgcgggt gcccgtattc ataatgccgg tgttgaacca gccggtgttg atgtcgcccg 3720361 ggttgaaacc accggtgttg gtatcaccga cattgaagct gcccgtgttg tacgacccgg 3720421 ggttggcgat gccggtgttg aaattgccgg cattgaagag cccagtactg ccggttccgc 3720481 tattaccgat gccggtgttg aaactaccgg agttgaacag tccgaagttg ccggtgccgg 3720541 tgttgaagaa cccgacattg ccggtgccgg aattgaacaa tccgatattg ccactacccg 3720601 agttgaggcc gccgatgccg gtctggtagt cgccgaccag cccgatcccg atgttgcccg 3720661 tgccggtgtt ggccaacccg atgttgccca cacccaggtt ggccagcccc cagttgttgc 3720721 tgccggcatt gctcaacccc acgttgccca ggccggccag gcccaccgcc ggacccgagt 3720781 tggcgaaccc gacgttgccg gcaccgatgt tgccgaaccc gacgttcccg ctgccgagat 3720841 tgcccaggcc caggttctgc gcgccgatgt tggccgcacc ccagttgagg tcccccacat 3720901 tgcccaaccc ggtgttgaac gcgcccacat cggcccaccc gatattgaca atggggctcc 3720961 ggttgagcac ggtcccattt gccaagaacc ccgacagctg ctggccgagg ttgccgatgc 3721021 ccgagaccac cgccggggtg cccgctccca gggtgctggt gttgtaccac ccggagatcc 3721081 ccgagccgac gttcagcacg cccgagctca gcgtgccggc attggcaact cccgagcccg 3721141 cccctgctaa cacgtcgtgc ccctggttcc accagcccga cgtgcccgcg ccgacgttgg 3721201 cgaaccccga tccaccgccg ccgccggtgt tgaagaaccc cgacgatggg gccgtggtgg 3721261 tgttgccgaa ccccggcacc gccggcacat cgatgatcgg gatcgggata tcgccgatga 3721321 ggatggtgcc gtcgaaggtc gccggcacgg tgtcgagggt gaacccgtcg ggcaacagcg 3721381 tgaacgcgtc cagccccacg gacagtccgg tgaccccggc ggaggcccgc ggaaaggtca 3721441 gcccacccgg gaagaaggtg aacccgtcgt tggcgacctc catacccacc gtcacggggg 3721501 tttgcgcggg aatggtgaaa ccattcggga aaagcgtcca cggggtggtg tccaagttga 3721561 gggttagggg aattggtgtc ggggtgacca atatctgacc gctaaccgtg aggccgggca 3721621 caatgatgtt ctctaggaac aagacaccgg caacaacttg gaacgcatca atggtgataa 3721681 atgggtcact gaggcggaac ggctcgagaa aaagccctat cgaaccggcg agcgggtcaa 3721741 gagcgcgaat cggcgagatg gtgtttgcgg ccaggtccac gcttccggtg atgctggcga 3721801 tgggaagtga gggaatgctg atcggtggga cggtgaacgg acccaggccg acggtggcgt 3721861 cggtgatctc gacgtgcacg gcgggtaccg ggacgggcgc cacatgcagc gggcccaccc 3721921 cgccgatcgc gtgcacggtg accgggaatt gggagatcgt gggcccgacg cggacgccga 3721981 ccaggccctc gtagccgccc cgccacaaca ggccgttgct gtagtcgccc gtcatgaagg 3722041 cgccggtgcc gaaggtgccc gcgttggcca acccggtgtt ggcatgcccg gtgttgaacc 3722101 agccggtgtt gatgccgccc gggttgaagc caccggtgtt ggtgtcgccg gcgttgaagc 3722161 tgcccgtgtt gtagtcacca gtgttggcga tgccggtgct gaagctgccg gcattgaaga 3722221 gcccggtgct ggccgttccg ctattaccga tcccggtgtt gaagcggccg gagttcccga 3722281 tgccgaagtt gccggtcccg gaattgaaga acccgacgtc gccggtgccg gagttgaaca 3722341 acccgatatt gccgatgccg gagttcaagc ccccgatccc ggtccgatgg tccccgacca 3722401 gcccgatccc gatgttgccc gtgccggtgt tggcaaaccc aatattgccc acacccatgt 3722461 tcgccaagcc atagttgttg atgccggcat tgccaaaacc aacattgccc acccccgccg 3722521 cgccggcggt caggcccaag ttggcaaacc ccaggttgcc atggccgatg ttgcccaacc 3722581 ccaggttgcc gtccccgaca ttgcccaggc ccaggttgtg cccaccgatg ttggccgcac 3722641 ccaggttgac gtccccgaca tttccgaacc cggtgttgaa gttgcccaca ttggccaacc 3722701 cgaggttgcc ggcgagcatc gagcgcagcg tggttcccgc cgccgacacc cccgacagct 3722761 gctggcccag gttgccgatg cccgacaccg ccgccggtgt cccgaaaggc aacacgctgg 3722821 tgttgtagaa ccccgagatc cctgagccca ggttgagcac acccgagccc agggtgccca 3722881 cgttgccaac acccgaaccg gcccccaaca gcgcgctcgg cgcctggttc caccagcccg 3722941 agctgcccgc gccgacgttg ccgaaacccg acaccccacc cgcaccggag ttgaagaatc 3723001 ccgacgacgg ggccgtggtg gtgttcccca ctcccggcgc cgccgggata tgaaggccct 3723061 ggatcgtgat ggggccgatc gtgaccccgc cccccacggt cagggggatg cgatcgatcg 3723121 tgatcggcgg ggtgctgaac ccgtcgatct ggccctcgat atcgatcgac aacggcaacg 3723181 gctgcgcggg aacactaaat cccgggatgg taaagcccgg gttactgatc gacacactca 3723241 ccagcaaccc caaaggatta tcgggagcac tgatgccatt cgggaacagc gtgatcggag 3723301 gggtatccca tctgatcgtt aaatcaatct gtggattggt gggtccggga atggtggtgt 3723361 cgataacgat agggccgata aagctgacaa gctgaccgtt agaatcaaag gtttggattt 3723421 gtggaattgt gattttccct aaactgaagg tgggaaaggg caattggttg acaaatgtct 3723481 gttgggcaaa cagggtgatg ggtgtgatgg tcagcgggcc gatgttgatg ggtatgccga 3723541 taccgccgcc gaacgcgggg atcacgatgt cgggaaccac cagcgggccc aagttgacgg 3723601 tttggtgaat gctgagcggg atggtgggca ggatcgggat gggctggatg gtgatcgggc 3723661 cgatgtcgcc gttgagcacc aggccgatgg gaattgcggg gatcgacgag ccggcggaga 3723721 cgccgaacag gccctgttag tcacccaccc acagcacgcc gttgttgaag ttgcccgaga 3723781 tgaacgcgcc ggtgttgacg ttgcccgagt tggcgatgcc ggtgttggtg ttgccggtgt 3723841 tcagccagcc ggtgttcaca ccgccggggt tgaagccacc ggtgttggtg tcgccggcgt 3723901 tgaaactgcc ggtgttgtaa ctgcccacgt tcaccacgcc ggtgttgaaa ttgccggcat 3723961 tgaacaaccc cgtgctggcc gtccccgcat taccgacacc ggtgttgtaa ttacccgagt 3724021 tgaacacccc gaagttcccg gtccccgaat tgaagaaccc cacattcccg gtgccggagt 3724081 tgaacaaccc gatattcccg gtgcccgaat tcaggccccc gatacccgtc aggtggttgc 3724141 cggtgagccc gacgccgacg ttgttggtgc cggtgttgcc gaaaccgatg ttgcccacac 3724201 ccaggtttgc gaaaccatag ttgctgctgc ccgcattgcc caacccgata ttgcccaagc 3724261 cggccaggcc cgcccccaga ccggagttgc cgaacccgac attcccgtta ccgaggttgc 3724321 cgaacccgac attggtgcca ccggcattgc cgaaacccag attctgccca cccacattgc 3724381 ccgcgcccag gttgaacacc ccgacattgc ccaacccgac gttgtaattg ccgacattgc 3724441 ccaaacccgc attcaggctc agcgccttcg cagggctggc gaacagggcg gtaaggaacg 3724501 cgccgacacc tccccccagc gcctgcgccg gtgggctgaa cgccggcaac gccgcggcag 3724561 cagccgacgc gccggaatgg tagccggcca tcgccgccac atcggcggcc cacatcagct 3724621 cgtactcggc ctcggcggcc gcgatcgccg gcgtgttctg ccccaacaga ttcgacaccg 3724681 ccaacgccac cagccgcgcc cggttggccg ccaccagcgc cggatccacc gtcgccgcca 3724741 acgccgcctc aaagaccccc accaccaccc gcgcctgccc ggccaccgcc tcggccgcgg 3724801 ccgccaccga acccaaccac cccgcatacg gcgccgccgc ggccgccatc gccgccgccg 3724861 ccgcaccctg ccacaccccc gccgtcaggc ccgacgtcac ctgcccaaac gacaccgccg 3724921 ccgaccccaa ctcctcagcc agcccatccc acgccgcggc cgccgccagc aacgggctcg 3724981 accccgcacc cgaatacatc agcacggagt tgatttccgg tggcaacacc ggaaactcca 3725041 tcacccattc cccttcccag cccgacacca atccccaccg acacccccca catgacgtgt 3725101 cgacgccccg ataattttgc tcgcattgcc aacggcccaa gaacgattcc ccgataatcg 3725161 cgggtactgg gtgcactttg cacagacgcc gcagcaaaat gcgcatatgc cctgtccaga 3725221 ccggcgagcg gcagggcgtc atctgccctg acacttcgac tgctggcgga gtccgcgagc 3725281 atgctcaccg ccgcggcgtg cgccgaaccg gcagcgccgg caaatccatg accccagcct 3725341 gttcttgggt cactgcgacg ttcactttta agcgcgacca cgtaaggttg ggcaaagttc 3725401 ccaagcgttt cacagtgtca gtgcacagtg cgcacctgat taccaaaacc ccgaacctca 3725461 ctcgaaagcc gagagcgggt aaaagtcgtt cagcgacctg tctggtagag aaatccagac 3725521 ccgagtacat gatccggtcg ggatcgtact tgcgccgcac tgtggtcagc cgcgacaggt 3725581 tcgcgccgaa gtattgtgac gccgcggcgt tggcctccag gtagttgaca tagccgccga 3725641 ccgaaaagtg ttgcaccgcg tggtgtgcgt cgctcagcca tttgttggcc gtcgccacct 3725701 ggccgtcgct gggggtgttg acataccact gcaccacagc ggactggcgg caccagggaa 3725761 atgccgagcc ctccgggtcc atgtcgccca ccgcgccgcc cagcgaatcg atcagagccg 3725821 acgcgcggcc cgcagcgggt ggccatgttc cgatggcggc gacgatggct tgggccgcgg 3725881 ccggattcgt cgtcccgatg acatcggatc cagccacgaa gccctccggc ggataggtcg 3725941 tatggccgcc ggccagatac ctcaccaggt ccatacggcg cagcgtcttg tgctcaactc 3726001 cactgggttg cactccaacc gcggacttga tcgcatccgc gacagccgcg ccggaccgcg 3726061 ccgggcagct cgccagcaca tgacaattgc ctccggatga gctgaccgcg gggtcaacca 3726121 gaccccacgt ggtgcggtcg gccccggcca gccacgtctg tcagccgacc agcacctgcg 3726181 cggccgcaga cggcgcgaaa tcgacacgga cgacatcgca gtccgcggtg gggaacctcg 3726241 cgaacgtcat cgatgtcgtc accccgaagt tgccgccccc gccgccacga agcgcccaga 3726301 acagctccgc gtggtcgtcg gcagacgcgc tcaccgcatc accgccgggc aacaccaccg 3726361 tcgccgactt gagcgcatcg caggtcaacc ccgcatggcg agaatcggcg cctaacccgc 3726421 cgcccagggt caaacccgcc acacccacgg tcgggcagct gccggtcgga atcgcccggc 3726481 tctcaccggc caacgcttga tggaccgcat agagatcggt cgcggccgac accgtacgtt 3726541 tctcgtggcg ctgtcgaaat gcaccccgcc cggtaggccc agcagatcga gcaccatggc 3726601 gccattggcc gacgaggcgc cgatgtagga atgtccgccg ccgcgcacag cgatcttgag 3726661 cttgctggcc gccgctacga aaccgccttc cggacgtctg cctgcgaggc gaccgtcacc 3726721 accgcggccg gattcaagcc gctgtagttc gaattgaaga tctgctttcc gctcgtgaac 3726781 gccctgccgt tggccggcag cagcacctgc ccgcctatcg atgaggccag actggcccac 3726841 ccatcacccg gtgttgcgcg cgccaatatc gtcgggaaga ccgccgacgt cgccggcgct 3726901 ccgacggcgc cgcgaagaaa cgtctggcga gacatcacga ccgcgatcgt gtcgtatcga 3726961 gaaccccggc cggtatcaga acgcgccaga gcgcaaacct ttataacttc gtgtcccaaa 3727021 tgtgacgacc atggaccaag gttcctgaga tgaacctacg gcgccatcag accctgacgc 3727081 tgcgactgct ggcggcatcc gcgggcattc tcagcgccgc ggccttcgcc gcgccagcac 3727141 aggcaaaccc cgtcgacgac gcgttcatcg ccgcgctgaa caatgccggc gtcaactacg 3727201 gcgatccggt cgacgccaaa gcgctgggtc agtccgtctg cccgatcctg gccgagcccg 3727261 gcgggtcgtt taacaccgcg gtagccagcg ttgtggcgcg cgcccaaggc atgtcccagg 3727321 acatggcgca aaccttcacc agtatcgcga tttcgatgta ctgcccctcg gtgatggcag 3727381 acgccgccag cggcaacctg ccggccctgc cagacatgcc ggggctgccc gggtcctagg 3727441 cgtgcgcggc tcctagccgg tccctaacgg atcgatcgtg gatgcgatgt agaccatggc 3727501 cgccgcgacc gtcacggtcg tcacgaaatc gatccccttg ctgcgcacca ccaacaggcc 3727561 ggcccgttcc tcggacaaca ccaaccgcag caccgccgcc accccaacgc cgataccgat 3727621 cagcagcgca ccacggcgcc agaagttagc ccccgccagc acgaacccca ccgcgaagat 3727681 cgacccaacc agcaggatcg gccactgggc gccaacagtg cgccggaaaa cggccctcac 3727741 ggtcatcgcc gctcagccag ctccacgaca ttggtcaaca agaacgcccg ggtcaacggg 3727801 cccacgccgc ccggattggg tgacacgtgg ccggcgagct cccacacatc gggatgcacg 3727861 tcgccgacca gtccgtcatc agtgcggctg acgccgacgt cgattaccgc ggcacccggg 3727921 cgcaccatgt cagccgtcaa caggtgcgcc accccgaccg cggccacgac gatgtcggcc 3727981 tgccgggtca acgcgggcag gtcgcgggta ccggtgtggc acaacgtcac cgtggcattc 3728041 tccgagcgcc gggtcagcaa cagccccagc ggccggccca ccgtcacacc acgaccgata 3728101 acgaccacat gcgcgccggc gatcgagatg tcgtagcgcc gcagcaggtg cacaatgccg 3728161 cgcggagtac acggcagcgg cgccggggtg cccagcacca gccggcccag gttggtcggg 3728221 tgcaacccat cggcgtcctt ggccgggtcg acgcgctcca acgccgcgtt ctcgtcgaga 3728281 tgcttgggca acggcaactg cacgatgtag ccggtgcagt cggggttggc gttcagttcg 3728341 tcgatggtct cattcagcgt ggcggtgctg atgtcggcgg gcaggtcgcg gcgaatcgac 3728401 gtgatgccca ccttggcgca atcagcgtgc ttaccgcgca cgtaggcctg cgaccccggg 3728461 tcgtcaccga ccaggatggt gcccaagccg ggcgtgcggc ccgccgcgtc caatgcggcc 3728521 acccgcggct tgaggtcacc gaagatctcg tcgcgggtag ccttgccgtc cagcatgatc 3728581 gcgcccacgc cagccagtct ggcatgcgtg tccgcggtgc cgatggtgac gacccgctca 3728641 cgcgcccacc gtacggacaa cttgtaccat tgtggtacag attatccgta catctttcta 3728701 agagaggacg catgagcatc agtgcgagcg aggcgaggca gcgcctgttt ccactcatcg 3728761 aacaggtcaa taccgatcac cagccggtgc ggatcacctc ccgggccggc gatgcggtgc 3728821 tgatgtccgc cgacgactac gacgcgtggc aggaaacggt ctatctgctg cgctcaccgg 3728881 agaacgccag gcggttgatg gaagcggttg cccgggataa ggctgggcac tcggctttca 3728941 ccaagtctgt agatgagctg cgggagatgg ccggcggcga ggagtgagaa gcgtcaactt 3729001 cgatcccgat gcctgggagg acttcttgtt ctggctggcc gctgatcgca aaacggcccg 3729061 tcggatcacc cggttgatcg gagaaattca gcgtgatccg ttcagcggga tcggcaaacc 3729121 cgagccgctc caaggtgagt tgtcgggata ctggtcgcgc cggatcgacg acgaacaccg 3729181 gctagtgtat cgagcgggcg acgacgaagt cacgatgctg aaggcccgat accactactg 3729241 atttgggggc tggtggtatt ccggcgggct taagctcccc atgtggctcc cggcagctgc 3729301 gaagccccgg acgtgttcaa cccggccaaa ctcggtccgc tcacgctgcg gaaccgggtc 3729361 atcaaggccg ccaccttcga ggcccgcaca cctgacgcgt tggtgaccga tgacctgatc 3729421 gagtaccacc ggctgccggc cgcgggcggg gtcgccatga ccaccgtcgc ctattgcgcg 3729481 gtctcccccg gcggacgcac cggcggcaac cagatctgga tgcgcccgca tgcggtgccg 3729541 ggactgcgcc ggctcaccga ggcgatccac gccgaggggg cggcgatcag cgcccagatc 3729601 ggccacgccg gcccggtggc cgacgcccgc tccaaccagg cgaccgcgct ggctccggtg 3729661 cggttcttca atccgatcgc tatgcggttc gcccagaagg cgacccgcga ggacatcgac 3729721 gatgtgctgg ccgcgcacgc ccatgccgcc cggctggccg tcgacgccgg cttcgacgcc 3729781 gtcgaaatcc atttggggca taactatctg gcgagcgcgt ttctgtctcc gctgctcaac 3729841 cggcgtgatg acgagttcgg cggttcgttg cagaaccggg cgaaggtagc tcgcggattg 3729901 gtgatggccg tgcgccgcgc cgtccggcag caggtcgcgg tgaccgccaa gctcaacatg 3729961 accgatggca tccgcggcgg catcacagtc gacgaggcac tgaccaccgc caggtggctg 3730021 caggacgacg gcgggctaga cgcgatcgag ctcaccgcgg gcagctcgct ggtcaacccg 3730081 atgtatttgt tccgcggcga cgcgccggtt aaggagttcg ccgccgcgtt caaaccaccg 3730141 ctgcgctggg gcatccggat gaccggccat aggtttttcc gcgaataccc ctaccgcgat 3730201 gcctatctgt tacgcgaggc tcggttgttt cgcgccgagc tgacaatccc gctgattctg 3730261 ctgggcggca tcaccaaccg aacgaccatg gacctggcga tggccgaagg gttcgagttc 3730321 gtcgcgatgg ctcgggcgct gctcgccgag cccgacctgg tcaatcggat cgcggccgaa 3730381 ggcagccagg tgcggtcggc gtgcacacac tgtaatcagt gcatggccac gatttatcgc 3730441 cgcactcact gtgtggtcac cggggctcca tagcgtccag attgacgcca ccgtgaagaa 3730501 gtgcaaccca ttgtgccgga aatccggttg acttccccgc gcgaatccgg ctcaagcact 3730561 attgaccgcg cgcagcataa tttgaaccga tgagtcgacc ccatccaccg gtgctgacag 3730621 ttcggtccga tcggtcgcag caatgcttcg ccgcgggccg cgacgtggtt gtcgggagtg 3730681 atcttcgtgc cgacatgcgc gtggcgcacc cactgatcgc ccgtgcgcac ctgttgctgc 3730741 gcttcgatcg gggcaattgg atcgcgatcg acaacgattc gcagagcggg atgttcgtcg 3730801 acggccagcg ggtgtcggaa gtcgacattt atgacggcct gactatcaac atcgggaagc 3730861 ccaccgggcc gtggatcacc ttcgaggtcg gccatcacca gggcatcatc ggacggctgt 3730921 cacgcacccc gtcgtcgcgt cccggctcac cgatctagcc ccctgccaag cacagcccgt 3730981 gcgccgccgc aaaggccacg gcttggtcga cgtcgacacg cgcacccacc aacgacgcgg 3731041 tccgccacaa taccgggtcc acggtcgcgc cccgcaagtc ggcgtcatcc agccgggcgc 3731101 ccgtggtacg ggcaccactg aggtcggcgc cgcgcagcac gcacttgcgc aagtcggtat 3731161 ccaccaggct ggtctctcgc aaccggcagc cggtcaagtt gagaccacgc agatcatttc 3731221 cgccgagcac ggcgagcgtg aaatccacgt cgtccaacgt cagcggccgc agccggcaag 3731281 ccacgaagac cgagcccaac atgctgcact gggcaaatgt gctgtgccac agtgtcgtcc 3731341 gttcgaaggt gcaattacga aacgccgacc ctcggtgttg tgactcggcc agattcacgc 3731401 cgctgaaatc gcattcgctg aacatcgccc gttcggtgtg caggcggcta aggtcctcgt 3731461 cgcggaagtc tcgaccggtg aattcgcaat caacccactg ctgcaacgct tttcaaccgc 3731521 ccgcaggaga cagggtggcc agcgcgtatt cgctcaccgc gatcagtgca tcggtcgccg 3731581 acctgcgatt gcgggcgtca acattgatca ccggaatgtg tgcgggcagc gtcaacgcgt 3731641 cgcgcaccgc gctaaccgga taccttggcg cgctgtcgaa ctcgttgatg gcgatcaaga 3731701 acggcaggtt gcggtgttcg aagaagtcga ccgccgcaaa gctgtcctgc agacgccggc 3731761 agtcgaccaa gacgatcgcc ccgatggcac cacgcaccag gtcgtcccac atgaaccaga 3731821 accggcgctg gcccggggta ccgaatagat aaagcaccag atcctcgccc aaggtgatgc 3731881 ggccgaagtc catcgccacc gtggtgctcc gcttgtcggg agtggcctcc agcatgtcga 3731941 cgccggcgga ggcatcggtg accatcgctt cggtgcgcaa cggcatgatc tccgaaacag 3732001 cgccgacgaa tgtggtcttg ccggacccga atccgcccgc gatgacgatc ttcgtcgacg 3732061 cggtgccgga tgcctcagag tgctttaagg ccacgcaggg tccttcctat gagttcgtgg 3732121 cgttcgtcgc gggtcgatcg gtcggtcaag gtcgcgtgca cccgaaggta accggacgtg 3732181 accagatcac cgaccagcac acgcgccaca cccaccggca aatccagccg agccgagatt 3732241 tccgcgaccg acggactgcc aatgcacaat tgcaagatcc tgcgtcgcat gtcgtaggcc 3732301 ggccagcggc cagccggtcc cgccggcagg gtctgcaccg gcgcctgaag cggaaggtcg 3732361 acgtcggtac cggtacgtcc ggcggtcagc gtgtaggggc ggaccaggcc cgccttcggt 3732421 ctatcgccgg caggattgaa caacgccgcc cacccgctcg acaaggatgg ccatctcata 3732481 accgatctgg ccgatatcgc atccggtcgc ggccagcgcc gccagcgccg acccgtctcc 3732541 cacctgcatc aacagcaggt agccgttctg catctcaacc accgactgca gcacctgccc 3732601 gccgtcgaac agttgcgcgg cgccgccggc caggctggcc agcccggacg tcaccgcggc 3732661 caactgatcg gcgcgttcgc gtggtagatg ttcgctggcc gccacgggaa gcccgtcgac 3732721 cgacaccagc aatgcatggg ccaccccggg aacctcgcgg gcgaacttcg acaccagcca 3732781 gtcaagcggg ctgtccggca agcgggcttt cattgctgat tgggtccctg actgctctcg 3732841 cgggcatgcg accgcccggt gcgcacgccg ccgaaatggc tgctgatgga ggcacgaacc 3732901 gcgtcggggt cgcgtaccgc agccgcgtgc cgcggcgctc ggccgggatg aagtccgccg 3732961 ttggatgcta gcgctgcacc cggatgctcc cgatcgggtc cctcaggcac cgccgccccc 3733021 ggcactaacc gggccccggg ttcgcgcacc ggcaggccgt agtccgtgcg ggactgcacg 3733081 ggcttgtccg cggcctcggc ggccgccgac cagccgtggt cccacaccga cttccagtcc 3733141 agatcggggc tgtgggccag ctcgtgcggg tcacccacca tctcggagag catccgccgg 3733201 tagatgacgt cgtcatcaac cgggcccgcc ggtggcgcgg gtttggcggg cggcggcgcc 3733261 ggtcgcggtt ctggtgcggg cggttgtttg ggctcctgtt gaaacctatc ctcccaccag 3733321 ggtgttttca gctcgcgccg ccgctgctgc atcggctggg ccgggacgtc ggcgatgcca 3733381 ctggaccccg gggtacggcg cgggagcaac gtgaccggtg gtagcggccc gatggcggcg 3733441 ggaacgtccg tcggatcggc cgccgcgggt tcaggacacg gcggcttgat cgcaaatacc 3733501 cgcggctttg gcggctgcgc tggggccgtc ccctcgagca cggctagcgg caggtagacc 3733561 tcggcggtgg tgccggtgcc ctgttcaccg gtcaccggac cgcgcagccc gactcggatg 3733621 ccgtgccgac cggccagccg gccgactacg aacagaccca tgtgccgggc actatccggg 3733681 gtgacctcac cgccggcccg cagccgcata ttggccatcc gccgatcggc atcggtcatg 3733741 cccaggccgg aatccgagat tcgcagcaga acactgcctt cgctgccgat tgcggcggca 3733801 acccgaacgg gtgtggtcgg tgacgagtag cgcaacgcgt tgtcgatcag ctcggcaagc 3733861 agatgaatga cgccaccagc cgctgcgccg actaccgcac agtcgggtac cctcgcgatg 3733921 tcgacgcggc gatagtcctc gacctctgac acggcggcgc tgatcacggt tgacagcggc 3733981 accggctcgc ggtggtcacg ggtaatctgc gcaccggcca gcaccagcag gttggcgctg 3734041 ttgcggcgca gccgggcggc caggtgatcg agccggaaaa ggctgtcgag tcgggcggga 3734101 tcctcctcgt tgcgctccag ttggtcgatg accgacagct gctggtcgac cagggaacgg 3734161 ctacgccgcg acatggtctc aaacatctcg ttgaccagca gtctcaaccg cgtttcctcg 3734221 ccggccagca acagggcccg ggtgtgcagc tcgtcgaccg catgcgcgac ctgaccgatt 3734281 tcctcggtgg tgtacaccgc cagtggctcg gggatcggct cgtcgccggc gcggaccgcc 3734341 gcgatctcgc cgtcgagatc ggtatgagca accttgagcg ccccatcacg cagtacccgc 3734401 atcggcccga ccagcgtgcg cgccaccacc aacacgacga cgatcgcggt cgcgatggcg 3734461 gccaacacca gcacggcgtc gcgaatcgcg gcatcccgcc ggtcggtggc ctggctttgc 3734521 accgacttcg tcaccgcctc ggtggtgtcg gtgatcacct gctcggcaat gtcgcgggtg 3734581 atctgtatcg agtgcagcag ctctgggttg ttgaccagtg caacggccgg atcggacatg 3734641 atcgccatcc tggtcaccat ttgctgctgc aggttcttgg tgtccggcga gcctgcaccg 3734701 agcgccgcgc tcatcccgaa cagcgtcgag ggttcggtgc cggccagggt aaccatcgcg 3734761 ctgcgcagtt gcggctcggc aaggtcggcg ccgcgagtca ccaggatctc ctgcatcgtc 3734821 atctgcccgc gggcgccaac ggctcggctc aaaccctgca cctgggttcg gatttgctcg 3734881 ctgtcaaccc gcaccgacgc gtcaatcacg ttctgggccg tcaacagcag cggcgcgtag 3734941 gcggtgaccc gatcccgcaa gccgatgctg tcggccagca ccttatccag cagcgcctga 3735001 ccgccgttga gcagcgtgtt cactcccgac cgcacgtctg cgatgacgtc ggtgtcggcc 3735061 agtcgcgtct gcagctcgta cttgcgggcg gtgaagtttt tctgcgcccc ctccacatcg 3735121 tgtccggtcg agctggccag cacggcgacg tccagcgccg acatgtattt cgtgatcgcg 3735181 ggtatcattt cggcgcgcgc ggcgaccagc cgcaggccgc tggtgctggc catcgcagcc 3735241 tcgacccgca atcctgctaa caccatcgcc actaccagcg gcagaagcgc gatcgtgaac 3735301 actttccatc ggaccggcca gttgcgcggc gaccaggacg gcgggcgttg ctgaggtttg 3735361 ccgcgggccg gttgagccgg ggcggaaata tcagaagcgg ccgccgcgac cgggatggtc 3735421 gggcgggcga acatggtcac gtggccgcgg ccgtgccacc ggccgcaccc ttatgcagcg 3735481 ctcgaaaaac ggagagactc atagacttcc tgctcatgcc ttgatgccgt ccgccccagc 3735541 cggccgggcg cggacgtaaa caactggcaa tccgacgagt atgacagccc acggccgagg 3735601 tctccaccgc tgtcaccgag catgtcaccg gacaggccgg caaacgggca ccgggcgctt 3735661 tgccatgatc ggcggatgtt ccggctgctg ttcgtatctc cgcgtatcgc ccccaacacc 3735721 ggcaacgcca tccggacgtg cgccgcaacc ggctgtgaac tgcatctggt cgagccgctc 3735781 ggcttcgacc tgtccgaacc caagctgcga cgggccgggc tggactacca cgacctggcc 3735841 tcggtcaccg ttcatgcctc gctcgcgcac gcctgggagg cgctgtcgcc agcgcgggtg 3735901 ttcgccttca cggcgcaggc gacgacgttg ttcaccaacg tcggctaccg ggccggtgac 3735961 gtgttgatgt tcgggcccga acccaccggc ctggacgagg ccaccctggc tgatacgcac 3736021 atcaccgggc aggtgcgcat tccgatgctg gcgggccggc gctcgttgaa cctgtccaac 3736081 gccgcagccg tcgcggtcta cgaggcctgg cgtcagcacg gctttgccgg ggcggtctag 3736141 tcgcgaccaa ggtgacaccg aaccagccgg tatgcgcaca acgaagctca tcggcgtcgg 3736201 gcgccggaca ggagcaccca accggtgaca gcacaccgaa cgcaacccgg gcgatcacat 3736261 cggaccacga catcccggga aaatcgatgc cggtgagctt gcgcgtccag ctaccaccac 3736321 cgtcagcggt gacaccttca ccggcaacaa cggcagcgca ggcgcagctg tcagcggcgg 3736381 cgcgcagcga aggcgttgcg gtcaatgaat ctgccgcaaa ccccacgccc gttggcccat 3736441 attgcgctag catccgggtg ttgtgatctc gcaggttgcg tgctggcagc ctgggggtgg 3736501 gttgtgatgt cgtttgtcgt agcagtcccg gaggcattgg cggcggccgc gtcggatgtg 3736561 gcgaacatcg gttctgcgct aagtgccgcg aatgcagcgg cagccgccgg cacaacgggg 3736621 ctactggcag ccggtgccga cgaggtctcg gccgccctgg cgtcgctgtt ttccgggcac 3736681 gctgtgagct accaacaggt cgcggcccag gcgacggcgt tacacgatca gtttgtccag 3736741 gccttgaccg gtgccggcgg atcgtacgcc ctcaccgagg ccgccaacgt ccagcagaat 3736801 ctgctgaacg caattaacgc gcccactcag gcgctgttgg ggcgcccgtt aattggcgac 3736861 ggggctgtcg gcaccgccag cagccccgac gggcaagatg gcggtctgct gttcggcaac 3736921 gggggcgccg gctacaacag cgccgccacg cccggaatgg ccggcggcaa cggcggcaac 3736981 gccggattga tcggcaacgg cggtactggc gggtcgggcg gtgccggcgc ggccggtggc 3737041 gccggcggca gcggcggctg gttgtacggc aacggcggaa acggcggcat cggcgggaat 3737101 gcgatcgtcg cgggcggtgc cggcggcaat gggggcgctg gcggcgccgc cggattgtgg 3737161 ggcagtggcg gcagcggcgg ccaaggcggc aacggtctga ccggcaacga cggcgtgaat 3737221 ccggcccccg tcacaaaccc cgcgctaaat ggcgccgccg gcgacagcaa tatcgagccg 3737281 caaaccagcg tcctgatcgg cacccaaggc ggtgacggca cgcccggggg tgctggcgtc 3737341 aacggcggca acggtggcgc gggcggagac gccaatggca accccgcaaa cacctcgatc 3737401 gccaacgcag gcgccggcgg gaacggcgcc gccggcggtg acggcggtgc caatggcggt 3737461 gcgggcggcg ccggcgggca ggccgcgtcc gccggtagtt ccgtcggcgg tgacggcggc 3737521 aacggcggtg ccggcggtac gggcacgaac gggcacgccg gcggtgcggg cggcgccggc 3737581 gggcaggccg cgtccgccgg tagttccgtc ggcggtgacg gcggcaacgg cggtgccggc 3737641 ggtacgggca cgaacgggca cgccggcggt gcgggcggcg ccggcggtgc cggtggtcgc 3737701 ggcgggtggc tggtcggcag cggtggcaac ggtggcaacg gtggcaacgg tgccgccggc 3737761 ggcaacggcg ccatcggcgg taccggtggt gccggcggcg tccccgccaa ccagggcggt 3737821 aacagcgccc taggcaccca gccggtcagc ggcgacggcg gcgacggcgg caacgggggc 3737881 accggaggca ccggcgggcg tggcggcgac ggcggatccg gcggcgcggg cggcgcgagc 3737941 ggttggttga tgggcaacgg cggcaacggc ggcaacggcg gcaccggcgg ctcaggcggt 3738001 gtcggcggca atggcggcat cggcggtgac ggcgccggcg gcggaaacgc cacgagcacg 3738061 tcgagcatcc ccttcgacgc ccacgggggt aacggcggcg ctggtggcga cgctggtcac 3738121 ggcggaacgg gcggcgacgg cggtgacggg gggcatgccg gcaccggtgg acgtggcggg 3738181 ttactggccg gccagcacgc caactccggc aatggcggtg gcggcggtac cggcggtgcc 3738241 gggggcaccc atggcacccc cggcagcggc aacgcaggcg gcaccggcac cggtaacgct 3738301 gacagcacaa acggcgggcc aggcagcgac ggcctcggcg gggacgcgtt taacggcagt 3738361 cgcggcaccg acggcaaccc cggctaatta ccagccgttc cagtgcgtca cgctctcggc 3738421 cggcagccgc ttggccggcc ggaagtcgat gccttgtgtg taggcgatcg gaagcagccc 3738481 gccttggctg tattcgtcgt agggaatgcc gagcacgtcg gccaccttgt gctcgccgtt 3738541 gtcgagcagg tgcagcgtcg tccagcacga acccagcccg cgggagcgca gcgccaggca 3738601 gaagctccac accgccggga acagtgaggc ccaaaacgac acgccaccca ccgccgactc 3738661 gtcttcccgg cctttcaggc aggggatcag cagcaccggc gcccggtgca tgtgttcggc 3738721 gagataggtc gccgaatcgc ggacccgccc catccgctcg ccgcgggtgt cgccgtcggg 3738781 gtactcgggc gccggcccgc tgaggtagcc ccgggcgttg gccaggtaga cgtcggcgat 3738841 cgcctttttc ttggcggcgt cctcgacgaa cacccactgc cagccttggg aattggaacc 3738901 ggtgggcgcc tgcagcgcca gctcgaggca ttccatcagc acgtcgcgtg gcaccggctt 3738961 gtcgaaatcg agacgcttgc gcaccgagcg ggtagtggtc aggacctcgt cgacggacag 3739021 gttgagggtc atgtgggcag gctaccgttg ggccatgagc gtcgaactga cacaagaggt 3739081 ttctgccagg ctcacgtccg acctttacgg gtggttgacc accgtcgccc gatcggggca 3739141 gccggttccg cggctggtgt ggttctactt cgacgggacc gacctgacgg tgtactccat 3739201 gcctcaggcg gccaaggtcg cccacatcac cgcccatccg caggtcagcc tgaacctgga 3739261 ctccgacggc aacggcgccg ggatcatcgt ggtgggcggg acggcggcgg tggtggccac 3739321 cgatgtcgac tgccgcgacg acgcgccgta ttgggccaag taccgcgagg atgccgcgaa 3739381 gttcgggctg accgaggcga tcgccgccta cagcacccgg ctgaagatca ccccgacccg 3739441 ggtgtggacg acgcccacgg gctgagcggg ctggcccccg ctcgccgcca gagtgaaatc 3739501 cacgacgcgt ttgcggcgtg tcgcgtcgcc cgtttcactg tcggcgcaga ggttcaccgg 3739561 aagtcgcgcg agcgcgcgcc gaccgccagg gtgaggcggc ccatccgttc ggcgacgacg 3739621 gtgattgcgc cgctggcgtt ttggacctgg ccgcggatca gcagcgccgg cgccgtgtgc 3739681 gcgagcttgc ggtgtcgcgc ccacaccccg ggcgtgcaga gcacgttgac catcccggtc 3739741 tcgtcttcga ggttgatgaa cgtcaccccc tgggccgtgg cgggtcgctg ccgatgagtc 3739801 accgcgccgg cgatcagcac gcggtcgccg tcggacaccg atcccagcct ctcggcgggc 3739861 agcaccccca tcgcgtccag gtccgcccgc aggaactggg tcggatagct gtccggggag 3739921 acgccggtgg cccacacgtc ggcggcggcc agctccagct cgctcatccc cggcagcgcc 3739981 gggatgtgcg acgacgagcc caccccgggt aaccggtccg gccggcccgt ggccgcggcc 3740041 ccggccgccc acagcgcctc ccgccgagac atgccgaagc agcccagcgc cccggccgtc 3740101 gccagcgctt cgacctgcgg cacggaaagc tgcacccgcg acgtcaagtc cggcagggag 3740161 gtgaacgggc cgttggctgt tcgctccgcg accagcttct cggccagctc ggcgccgagg 3740221 tagcggacgg cgcccaagcc caaacgcacc tccgttccgg cgttctcaca cgtggcgtgc 3740281 gccaggctgg cattgacaca cgggccgtgc accgccacgc cgtgccggcg ggcgtcggcc 3740341 accagcgact gcggcgaata gaaacccatc ggctgggcgc gcagcagcgc cgcacagaac 3740401 gccgccgggt ggtgcagctt gaaccacgcc gagtagaaca ccagcgacgc gaaactcagt 3740461 gcgtggctct cggggaagcc gaaattggca aacgcctcca gcttttcgta gatccggtcg 3740521 atcacctcgt cgggggcgcc gtgcagcgcg cgcatgccgt cgtagaaccg gccgcgcagc 3740581 cggcgcatgc gttcggtgga gcgtttagac cccatggcgc ggcgcagctg gtcggcctcg 3740641 gcggcggaaa agccggcgca gtcgaccgcc aactgcatca gctgctcctg aaacagcggc 3740701 actcccagcg tctttcgcaa tgccggcgcc atcgacgggt gctcgtagat gaccgggtcg 3740761 acgccgttgc gccgccggat gtaggggtgc accgatccgc cctggatggg cccggggcgg 3740821 atcagcgcca cctccaccac caggtcgtag aacactctcg gcttaaggcg cggcagggtg 3740881 gccatctgcg cacgtgactc cacctggaac acgccgacgg aatcggcgcg ggccagcatc 3740941 tcatacaccg ccggctcgga gaggtcgagg cgggccaggt ccacctcgat gcccttgtgc 3741001 tcggccacca ggtctttcgc atagtgcagc gccgagagca tgcccagccc gagtaggtcg 3741061 aatttcacca agccgattgc cgcgcagtcg tctttgtccc attgcaggac gctgcggttg 3741121 gccatgcgcg cccattccac cgggcacacg tcggcgatcg ggcggtcgca gatgaccatg 3741181 ccgccggagt ggatgcctag gtgccgcggc aggttgcgga tctgggtggc caggtcgatc 3741241 acctgctcgg ggatgccgtc aacgtcgtcg gcctgcccgg tccagtggct gacctgcttg 3741301 ctccacgcgt cctgctggcc cggcgagaag cccagggcgc gggccatgtc acgcaccgcg 3741361 ctgcgccccc ggtaggtgat gacgttggcg acctgggcgg cgtagtcgcg gccgtatttg 3741421 tggtagacgt actggatgac cttttcgcgc tgatccgact cgatgtcgat gtcgatgtcg 3741481 ggtggcccgt cgcgggcggg cgataagaag cgctcgaaca acagctcgtt ggccaccggg 3741541 tcgacggcgg tgacgcccag ggcatagcag accgcggagt tggccgccga tcccctgccc 3741601 tgacacagga tgtcgttgtc ccggcaaaac cgggtgatgt cgtgcaccac caggaagtag 3741661 cccggaaatc tcagttgggc aatgactttc agctcatgct cgatctggga gtacgcccgg 3741721 ggcgcgctct tgggcggccc gtaacgctcg cgggcgcccg ccatgaccaa cgaccgcagc 3741781 cagctgtcct cggtgtgccc gtcgggaaca tcgaacggcg gcagccgcgg cgcgatgagc 3741841 tgtaggccaa aggcgcaccg ctcgccgagc tcggcggccg cggtcaccgc ctcggggcac 3741901 cacgcgaaca accgggccat ctcctccccg gaccgcaggt gcgccccacc cagcggagcc 3741961 agccacccgg ccgcggagtc cagcgaccgc cgggcccgga tggccgccat cgccatcgcc 3742021 agccgcccac gtgacggatc cgcgaagtgc gccccggtgg tggcgacgat gccgacaccg 3742081 aagcgcggcg ccagtccggc cagcgcggcg ttgcgttcgt cgtcgagcgg gtgaccatga 3742141 tgggtcagct cgatgctgac ccggctgggg gtgaaccggt ccaccagatc ggccagcgcc 3742201 cgctgcgccg cggccgggcc accctgggaa agcgcttggc gcacatggcc tttgcggcag 3742261 ccagtcagga tgtgccagtg cccgccggcg gcctcggtta gcgcgtcgaa gtcgtagcgc 3742321 ggcttaccct tttcgccgcc ggccagatgc gccgccgcca gttgccgcga caaccgccgg 3742381 tagccttccg ggccgcgggc caacaccagc aggtgcgggc cgggcggatc cggccgctcg 3742441 gtgcgagccg tggcgcccag tgacagctcg gcgccgaaga ccgtgcgcac gtcgagttcc 3742501 gcggccgctt cggcgaaccg caccgccccg tacaggccgt cgtggtcggt cagcgccagg 3742561 gcacacaggc ccagccgggc ggcctcctcg accaactcct cgggcgtgct ggccccgtcg 3742621 aggaagctgt acgccgaatg cgcatgcagc tcggcatacg cgacggacga tccgacccgt 3742681 tcccggcccg gcggctggta cgccccgcgc ttgcgggacc gtgggacgtc cccatccgcg 3742741 tcgaacgccg gcaccccggc atggcgcggc ttgccgttaa gcacccgttc catttccgcc 3742801 cagctcggcg gcccgttgct ccaccccaca ttccacagta tatcgaacaa ttgttcgata 3742861 cagcgcagtt gttcagcaca tcttcacctg cgaaacatgt tcttaaccgt ttgggccttc 3742921 tgcttccggt gcggtccggc ggacacttat acctggggtc gcaaaacgac ggtggggact 3742981 tgtcatggca caactgacgg cactggatgc gggttttctc aagtcccgcg atccggagcg 3743041 gcacccgggc ctggcgatcg gcgcagttgc cgtcgtcaac ggtgccgccc ccagctacga 3743101 ccagctcaaa acggttctca cagaacggat taagtcgata cctcgatgta cccaggtgtt 3743161 ggcgaccgag tggatcgact atccgggatt cgacctcacc cagcacgtgc gacgggtggc 3743221 gcttccccgg cccggcgacg aagccgagct gttccgggcc atcgcgctgg cactggagcg 3743281 tcccctcgac ccggaccgcc cgctgtggga atgctggatc atcgaaggcc tcaacggcaa 3743341 ccgctgggcg atcttgataa aaatccacca ttgcatggcc ggcgccatgt cggcggccca 3743401 cctgctggcc aggctctgcg acgatgccga cggcagtgcc ttcgctaaca atgttgatat 3743461 caaacagatt ccgccgtatg gcgatgcgcg gagctgggcc gaaacgctgt ggcgaatgtc 3743521 cgtcagcatc gctggcgccg tctgcacggc cgcggcacgc gccgtcagct ggccggcagt 3743581 gacgtcaccg gccggcccgg tcaccaccag gcggcggtac caagcggtgc gcgttccccg 3743641 cgacgccgtc gacgccgtgt gccacaagtt cggggtgacc gccaacgacg tcgcgctcgc 3743701 ggccatcacc gagggcttcc gaacggttct gctgcaccgc ggccagcaac cgcgcgccga 3743761 ctcactgcgt accctggaga aaaccgatgg cagctcggcc atgctgccct atctccccgt 3743821 cgagtacgac gacccggtgc ggcgattgcg caccgtgcac aaccggtcac agcagagcgg 3743881 ccgtcgtcaa cccgacagtc tgtcggacta tacgcctctc atgttgtgcg ccaagatgat 3743941 tcacgcgcta gctcggttac cgcaacaagg catcgtcacc ctggcgacca gtgcacccgg 3744001 gccacgccac cagttacggc tgatgggcca gaagatggac caggtgctgc ccatcccgcc 3744061 caccgcactg cagctgagca ccggggtcgc ggtcctcagc tacggcgatg agctggtgtt 3744121 cggcatcacc gctgactatg acgccgcgtc cgaaatgcag cagctggtca acggtatcga 3744181 actgggtgtg gcgcgtctgg tggcgctcag cgacgattcc gtgctgctgt ttaccaagga 3744241 tcggcgtaag cgttcatccc gcgcactccc cagcgccgcg cggcgggggc ggccctctgt 3744301 gccgaccgcc cgagcgcgtc actgacgcca tctccgtcgg cgttgacccc cgtgagaggg 3744361 tgggtcgtgc gcaagttggg cccggtcacc atcgatccgc gccgccatga cgcggtgctg 3744421 ttcgacacca cgttggacgc cacccaggaa ctggtccggc aactccagga agtcggtgtg 3744481 ggcaccggcg tcttcggtag tggcctagac gttccgatcg tagcggccgg ccgtctggcg 3744541 gtgcggccgg gccggtgcgt ggtcgtctcg gcccactcgg cgggcgtcac ggccgcacgc 3744601 gaaagcggat ttgcgctgat catcggtgtc gaccgcaccg ggtgtcggga cgcattgcgt 3744661 cgcgacggcg ccgacacggt ggtcaccgac ctaagcgagg tcagcgtgcg caccggggac 3744721 cgacgcatgt cgcagctgcc cgacgcgtta caggcactcg gcctggccga cggcctggtc 3744781 gcccggcagc ccgcggtgtt cttcgacttc gacggcacgc tgtccgacat tgtcgaggat 3744841 cccgacgcgg cctggctcgc ccccggtgcc ttggaggcac tgcagaagtt ggccgcgcgc 3744901 tgtccgatcg cggtgctcag tggccgcgac ctggccgacg tgacacagcg ggtgggtctg 3744961 cccggcatct ggtatgccgg cagccatggt ttcgaattga ccgcacccga cggaacgcac 3745021 caccagaacg acgccgcggc ggcagccata ccggtgctga aacaggcggc tgccgagctg 3745081 cgccagcaac ttggaccctt cccgggtgtt gtggtggagc acaagcggtt tggcgtcgcc 3745141 gtgcactacc gcaacgcggc ccgggaccgg gtcggcgaag tcgccgcggc ggtgcgcacg 3745201 gccgagcagc gtcatgcgct gcgggtgacg acgggccgcg aagtcatcga gttgcgtccc 3745261 gatgtcgact gggacaaggg gaaaacgctg ctgtgggttc ttgaccatct gccgcattcg 3745321 ggctcggctc ccctggtgcc gatctacctc ggcgacgaca tcaccgacga ggacgctttc 3745381 gatgtggtcg gcccccatgg tgttccaatt gtggtgcgcc acaccgacga cggtgaccgc 3745441 gccaccgccg cactgtttgc gctggacagt cccgcacggg tcgcggagtt caccgatcgg 3745501 ctggcgcgtc agctccgtga ggctcccctg cgggcaacgt gagacgcggt gccgccgcgg 3745561 gcgatacgct ccgaccgtca acgaggagga cggccatgtg gtttgcattg gtgaacccgg 3745621 agatgctggc cgcggcggcg acagacttgg gcggcatcag gtcagggatc agcgccgcct 3745681 atgcgcgtcc tctgcggtga cctggctggt agcttaggca cgtctttatc gacaccgggt 3745741 gctgccagag aactcgagac gcggcacagg tcggcaccat gaggcggcgt gcaatgacga 3745801 agatggacga ggctagcaat ccgtgcggcg gggacatcga agctgagatg tgccagttga 3745861 tgcgcgagca accacccgcc gaaggcgtcg tcgatcgtgt cgcgctgcaa cgccatcgaa 3745921 acgttgcgtt gatcacgctg agccatccgc aggcgcagaa cgcactcaac ctggcgagct 3745981 ggcgtcggct gaagcggctg ctggacgatc tcgccggcga atcggggctg cgggcggtgg 3746041 tgctgcgggg cgccggtgac aaggcgttcg ccgcgggtgc cgacatcaag gagtttccga 3746101 acacccgcat gagcgccgcg gacgccgcgg agtacaacga gagcctggcc gtctgcctga 3746161 gggcgttgac cacgatgccg atcccagtca tcgcggcggt ccgggggctc gccgtcggtg 3746221 gcggctgtga gctggcgacg gcctgcgatg tgtgcatcgc gaccgacgac gcgcgcttcg 3746281 gcatcccgct gggcaagctc ggcgtcacga cgggcttcac cgaggcggac accgtcgcgc 3746341 gcctcatcgg tccggcggcg ctgaagtatc tgttgttcag cggagaactg atcggcattg 3746401 aggaagccgc ccgctgggga ttggtgcaaa aggtcgtcgc accacaggat ttggcggccg 3746461 cgacggccaa actggtcggc caggtctgtc ggcaatccgc ggtgaccatg cgtgcggcga 3746521 aggtggtcgc caacatgcac ggccgagcgc tgaccggcgc cgacaccgat gcgctgatcc 3746581 ggttcggtgt cgaagcctac gagggggcgg acctacgcga aggggtggcg gccttcagcc 3746641 agggacgccc acccaaattt gatgattagc gccatgaccg atgctgacag tgcggtccct 3746701 ccccgactcg acgaggacgc gatctcgaaa ctcgagctga ccgaggtcgc cgacctgatc 3746761 cgcacccggc aactgacgtc ggcagaagtg accgagtcga cgctgcggcg tatcgaaagg 3746821 cttgaccccc agctgaagag ctacgccttc gtcatgccgg aaactgcgct agcggcggca 3746881 cgtgccgccg acgccgacat cgcgcgcggc cactacgagg gtgtcctgca cggcgtaccg 3746941 atcggcgtga aggatctctg ctacacggtc gacgccccga ccgcggccgg caccaccatc 3747001 tttcgtgact ttcgcccggc atacgacgcg acggttgtcg cgaggttgcg cgcggccggc 3747061 gcggtgatca tcggcaagct ggccatgacg gagggggcct atctcggcta tcaccccagt 3747121 ctgccgaccc cggtcaatcc ctgggacccg acagcgtggg cgggcgtgtc ctcgagcggc 3747181 tgcggcgtgg ccaccgcggc gggattgtgc ttcggctcga tcgggtcgga caccgggggg 3747241 tcgattcgct ttccgacgag catgtgcggc gtcaccggga tcaaaccgac gtggggccgg 3747301 gtcagccgtc acggcgtcgt cgaacttgcg gcaagctacg accacgtcgg gccgatcacc 3747361 cgtagcgctc acgatgcggc ggtattgctc agtgtcatag cgggatccga tatccacgat 3747421 ccctcgtgct cggcggagcc cgttccggac tatgccgccg acctcgcctt gacacggatt 3747481 ccgcgtgtcg gggtggactg gtcgcagacg acgtcgtttg acgaggacac cacggcgatg 3747541 ctggccgatg tcgtcaaaac gctcgacgac atcggatggc ccgtcatcga cgtcaagctg 3747601 cccgcgcttg cgccgatggt ggcagcgttc ggaaaaatgc gcgcggtcga aacggcgatc 3747661 gcgcatgccg acacctaccc ggcgcgcgcc gacgagtacg ggccgatcat gcgcgcaatg 3747721 atcgacgccg gacacaggct ggctgcggtg gaatatcaga cgctgaccga gcggcgtctg 3747781 gaattcacgc gatcgctgcg tcgcgtgttc cacgacgtgg acatcctgct gatgcccagc 3747841 gccggaattg cctcgcccac actggaaacc atgcgcgggc tcggacaaga cccggagctg 3747901 accgccagac tggcgatgcc gacagcaccg ttcaacgtca gcggtaatcc cgcgatatgc 3747961 ctaccggcgg gaacgacggc gcgcggaacg ccgctcggcg tccagttcat cggccgtgaa 3748021 ttcgacgagc acttgctcgt ccgagccggc cacgcatttc agcaagtcac cgggtatcat 3748081 cgccgacgcc cgccggtgtg aaaaaccctc ggccgcaaaa ggcttgcgaa tgtcgcaccg 3748141 aaggtcgcgg cgaatcgcct tactggtatg tttacgaaca caatctgtgg ccatcaaggg 3748201 aggacgcgtt gagcattagc gcggttgttt tcgaccgtga cggtgtgctc accagctttg 3748261 actggacacg tgccgaggag gatgtgcggc gaatcacggg cctaccattg gaggagatcg 3748321 aacgccgctg gggtgggtgg ctcaacggat tgactatcga cgacgcgttc gttgaaaccc 3748381 agccaattag cgagttcctc tcgagcctgg cgcgcgagct cgagctcggt tcgaaggcaa 3748441 gagacgagct agtgcgcctc gactacatgg cgttcgccca gggatatcca gacgcgcgtc 3748501 cagcccttga agaagcccgg cgccgtggcc tcaaggtcgg tgttctcaca aacaacagcc 3748561 tgttggtcag cgcccgcagc ctccttcagt gcgccgctct gcacgacctc gtcgacgtcg 3748621 tgctgagttc gcagatgatc ggagctgcca agcctgaccc gcgggcctat caagcgatcg 3748681 cggaagccct cggcgtctcg acaacgtcat gcctgttctt cgacgacatc gccgactggg 3748741 ttgagggcgc acggtgcgcg ggcatgcgcg cgtacctcgt ggaccgttcc ggacaaactc 3748801 gcgacggcgt cgttcgcgat ttgtccagcc ttggagcgat cctggacggc gcgggaccat 3748861 gaccgaacgt gacgagccgg acatcgccga cagggacgcc tcattggtta ctctcatcga 3748921 ccagccgcag tgcacttagg atggcagcct taactaccgt cgccgagcag taaagtgtct 3748981 tggcaatcca caacggcgcg tatggcggtt cgcagtgttg cgatagccac ccacccgcgc 3749041 gactgatctg cgccgacaag gatgtgccgc tgtgcctctg ccaatgcgcc agagcttgaa 3749101 tgcaatatgc tgtctcttcc gcagtcgctt ggccgtcgaa aaatccccac gagccatcgg 3749161 gcctctgcgt attaagaatc caccaaccgc atctgagcat agggcatcat catagttact 3749221 ggcagcacat atcagatgcg cagtcgtata atatgccgat cggtgccact tatcccgcca 3749281 gcagaaccgt ccaggctcct tgcttgatcg gatgaattcc agaacctttc gtactcgtgg 3749341 atgacatttg tcgtagcccg cctgcttcaa cgcaccgagc acgtggacgt tcgtcgatat 3749401 cgaggggccg acttcgtgaa agtaggtacg gaaccaatcg gcgtcttcga attgtaatac 3749461 ggctccgata tccggcgacc gtccaaactt cgacaaaaca tcgtaggcca cacttgtggt 3749521 gtcacaatct tccaaggtgg aatttcctgt ccaccccaca cctcgaccac ggacccaatg 3749581 ttgttcgaca tggtcaagat agggtaggta cgtacgaacg atctcaggat cggacaaatc 3749641 aatatccgta cgcgagagat tccatagaga ccaaacaatt tcaaaaatct cggcttgata 3749701 gaaggccggc gcaccgccat cgccggcttg aattatcgat gagatgtacg ccaaggcccg 3749761 cttgtctcct ggtttaacat gtaacgcgaa gtaggctgac gctgatggcg aatacttgac 3749821 cgatccattt gtctcctgca agttatcgac atccaacata ccgacaccgt cttggccggc 3749881 cagttctacg gagaaagctg cggtgatatg tttattgatt ttgcttccgc cgagttttct 3749941 caacttctgc tcacgcactc cgacaagctc gccgaggatg gattcctcgt ggcaaatggc 3750001 aaggccaagt cgcgccgcct cagccatcag cgtaggtgcg attaactcaa acccgacggt 3750061 tgcgtctttt atatcaagtt gagggccttc gaaagcaccc gaggtaaggt tcttcagggc 3750121 tagcaagcct ttttcaactt gcgctgcgcg cctccgacga tgcttattcg acgtgaggct 3750181 gatcatggcc gccaaagtgg agagcagtcg atcttcgtag cagaaaggga actcggctcc 3750241 ccatgagccg tcaggaagct ggcgctcgca aagccagttg agggcgaggt cgcttagctc 3750301 atcatcgagc tggcccagct tcgcgaccca cgcggtgtca taggctgtgc tcgagatgcc 3750361 gttgcctagt gccgctttcg ctagcagagt cctgaaagtc tccataccat cagccctccg 3750421 cgaaccagat tccatcatga acacaaccca caccgaaaac tctgtcaggc tgggctcgat 3750481 atctgttgcg cagtacattg agctgatcgg ccgacatcgc tgaatagtca ggtttgggcc 3750541 ggaaatgacg gagatagata tgatcgtaca agatcctccg gagcgttgtt tcggtcatat 3750601 agtatgaggg agcaacggtg aagtatagac tcgtttttcc tgagctgagc agcggaaaat 3750661 cgaaagtgct gaaacgccca aatccgatga acatgtcagc cttgtcaaca tactcgccgt 3750721 aataaccttc gataatctct ctccgcgtgg gaggtttacc atgtgtttcg ttccacgaga 3750781 tagaaaactg cgcaacagac tcagccgcat cgttgccaaa aacaccgaag catagtctgt 3750841 gttcagtgtt ggatgacgta gagattgtta ggtcatcgaa tgacttgaca acggctgccc 3750901 cttgcgcggt cgaaggcaat cttttcttat aatcaccata aaaaagaacg tggacctcgt 3750961 gttctttata gaacgacaga atttcctcgt cgttggcaag caaggccatc ccttcgagcg 3751021 cctgtacgat atatcgatca ccacgatcca gaagatcgtc gctaaagatt ggcgagatta 3751081 ctgtttcgat gccgtgctcg aagagcatct tcagaatacg aattgattga cgcaaggcgg 3751141 cctgctgata atcgtcgtac tgcggattac attcgaggtg aaaccagcgg cgtgtgccat 3751201 cgaagggaaa gacggatacc ttcggtccac ggcaacgtac aatctctgct acggatacta 3751261 gaggaagatc caagaattct ttttcgctaa ccaagttcat gcttcctctt aataactatc 3751321 gccggaatca ggatggtctt cgggtccagg gacttcatgt agtgcgttaa gtagtgattt 3751381 gcatcttatg cggattgcgg ggccggtgag tccgtggctg gaaaggatgt ggtcgcggct 3751441 ggcgtgggga atgtaggccg gtggcagtcc cagtgtgtag gtgcgcgtcc gcgggtgtgt 3751501 ccgcccgatg tggtggctaa ggtgcgcgcc gattcccacg tcggcaatcg catcttcgac 3751561 acacacggtg atccgatggc ggccagccag ctcggtcagt gccgggctga ttggccagac 3751621 ccattgtgga tcaacgactg tcaccccgat ctgctcctcg ctgaggcacc gggcggcgtc 3751681 catgcatggt cgactcatgg cacccactgc gaccaagagc acgtcgggtc gccaatgcgg 3751741 tggtggtgta tgcaagacgt cgaggccacc gatggtgtgt tcggccgtga tcggttcgcc 3751801 cggcgcccct ttggggaaac gcacggcggt gggagccgcg gtcgcgatcg cggtacgcaa 3751861 ctgttgtcgt agccgaggcg cgtcgcgcgg acaggcgatc tgaaacccgg gcacgcaggc 3751921 cagcagcgcc agatcccaca aaccgtgatg gctgggtccg tcgggcccgg ttaccccagc 3751981 ccggtccagc accagcgtca cgggtaaccg gtgcagcccg atgtcgaaca gaagttggtc 3752041 aaaggcgcgg tgcagaaacg tcgagtacac cgcgacaacg ggatgggttc ccgcggcagc 3752101 tagcccggcc gcgctggcca acaggtgttg ttcggcgatg cccgaatcga acacccgatg 3752161 cgggtatcgc ctcgacagcg cgcctagacc agtgggcaga cgcatcgccg cggtcagccc 3752221 gacgacgtcg gatcggtcgt cagcaatgcg cgcgatttcg tcctcgaaca cgtcggtcca 3752281 gctccgctga ctgggtgtgc tagcgaggcc ggtggcaatg tcgaccaccc cgtaggcgtg 3752341 catatggtcc ctctcgtcag cttcggctgg aggataaccc cggcccttac tagtcactgc 3752401 gtgaacaaca acgggcctag ctgccgcggc cgcttttcgt agaaccgcgc acgtgtcggg 3752461 gatgttgtgc ccatcgaccg gaccgatgta ggtaaatccc atgttctcaa agaggttcgg 3752521 ccctcggggt gtgccgacgc gaagttcttc taggtgtgcc gcaagagccc cagcggtggg 3752581 gtcgtaggag cggccattgt cattgagcac gacgatcacg ggccgggtag cggcaccgag 3752641 gttgttcagg ccctcccatg ccacgccccc ggtgagggcg ccatcaccga tcaccgcgat 3752701 gacacgtcgg tcgcattgcc cctgcagggc caatgctttg gcgatgccgt ccacccaggc 3752761 gaggctgacc gaggcatggg agttctcgac ccagtcatgt ggcgattcat ggcggttggg 3752821 ataccccgat agaccatcgg cctggcgcag cgtggcgaag tctttaccgc ggccggtgag 3752881 cagcttgtgc ggataggttt ggtgcccggt gtcgaacacc acgatgtcgt gtggcgaggt 3752941 gaacacccga tgcaatgcga tggtcagctc taccatgcca agtcccgcgc cgagatggcc 3753001 accggtagcc gtcactgttt ctatgagccg ccgacgcatc tgcacggcca gctctggcag 3753061 ctggctttcg ggcaatgcct gcacatcgca aggtccgccg atcgcggtaa tcaagtcccc 3753121 gcgtccgttg cgaatcgtgg ttgtcattgc gcgcgaacct gtttgggaag gccgaatcgc 3753181 accgtctcgg tcgctatcga gcgttccacc acggtgatcg aggcgtatcc gcgaagtgca 3753241 tcaatcacct gccccaccag tcgtggcggc gcggaggctc ccgcggtgac accgatcgtc 3753301 gagaccgacg acagccattc gggctcaatg tcatcaggcc cgtcaatcaa gtaggccggc 3753361 gtcccacttc gctgcgccaa ctcgaccaga cgccgcgaat tcgacgaatt gcacgagcca 3753421 atcaccaaca caacgtcaca ttcaccgacc atcgattgca gcgcacgctg tctgttcgtg 3753481 gtggcatagc agatgtcttc agaggggggt tggcccaacg tcggaaacct cgcgcgcagc 3753541 gcatcaatga catcggcagt ttcatcaagt gccagggttg tctgggtcag atacgatagc 3753601 tgggtaccct cgggcaggtt caacgctgcc acatcagcgg gtgtctgcac caataatgtt 3753661 gaccgcggag cgacgccaag cgtgccttcg gtctcctcat gtccggcgtg cccgatgaag 3753721 accaccgtgt caccgcgcgc ggcaaaccgt gcggcttcag cgtggacttt cgccaccagt 3753781 gggcaggtcg cgtcgacgac ctgcagtccc cgctcatcag cgcccgcgcg caccgccggg 3753841 gaaaccccat gcgcggagaa caccacgacc gcccccggcg gcggcggatc gggaatctcg 3753901 tcgagatcct cgacgaacac tgctccccgg tcccgcaact cggcaaccac aacagtgttg 3753961 tgcacgattt gcttgcgcac atacaccggg ccttcggcca cgtcaagcac tcgcttgacc 3754021 gtctcgatag cacgctctac accggcgcaa aacgaccgcg gcgacgccaa cagcaccgtg 3754081 acttcacccg aagcgtatcc ctgtgcgacc ggtcccacga acacctcagc catcagcact 3754141 cccggcgaca tatcagttgc gacaacgcga tcaggtctgg ggatcgcacc gcatcgggca 3754201 gtgccgcaat agcagcctgg atgcgttcat cggcgcatcg ctgcgccaca tgaccacccc 3754261 cggccacctt gacaagcgcg gtagcccgct cgacatcgct tgctgtcatt gcggcaggtg 3754321 cttgatagag ggccgccaat tcggtcgccg cttcggatcg cgagttcagg gcggcaacaa 3754381 ctggcagtgt cgccttacgt cgggcaaggt cgttgccgac cggctttccc gtcacaccag 3754441 ggtcacccca gatgccgatc agatcgtcga cgcattgaaa cgcaagaccc aactcatggc 3754501 caaaacgctc caacgcagca atcgtcgcgt cgtctgcatt ggccactaaa gctcccagag 3754561 cgcaacaaca accggtcagg gcggccgtct tgcccgcggc catccgcaga tagtcatcga 3754621 ctgtaacttc gggctgtccc tccaataaac aatcctcaaa ctggccgata cacaagtcca 3754681 ggcacgacat ctgcaatcgc cttatcgccc tgaccgccac acactcgtcg gtcaggccgg 3754741 tcagtatccg aacggccgtg gcgtgcaacg catctcccaa caggatcgcg ccgcccacac 3754801 cccacacact ccataccgtc ggccgtcccc tgcgagtcgc atccccatcc atcacatcgt 3754861 catgcaacaa cgtgaagttg tgcaccaact ccacagccgc cgacaccgga gtagcatcac 3754921 cgacatcacc accgcaagcc gcggccgccg cgtagacaag ggcggcgcga aaatacttgc 3754981 ccgacgatcc tgccgctgtg gatcgatcgg cgttccacca gccaaggtga tatcccgcca 3755041 tcgtcgccaa cggctcgcgc atcgactcaa tggcccgatg cagcacaggg ccacaatccg 3755101 ctcgagcccg ttctaacaat gctttcccaa ggtcagcagg gacactcccc agaaaagccg 3755161 catccagagt caatacgcct cccattctta acctcaccgg agcaacagtg agtcgctatt 3755221 ttcagcgaac gagcaatcgg cgatattgct tcacttcgga gatacccaaa tatttcaaat 3755281 atcaacgcaa catgtaccta tgcccgtcga ccaacacgac catcagggtt gttagcaatg 3755341 atctcggaat tcgagttgtc cagacgcccc gggtcatcca ctacagaaag acacgcatac 3755401 cctgcggcga cctatacttc ccatcacggc gggtaggttg ccttcgacaa tactgcaaca 3755461 ttcaattgcc tggcctttct cggagtatct tgcggacttg aagctcacac atcggccggc 3755521 gtcgaacgcc tcacgctgca gagcagttta gtggatttca tcagcatcgg atatgcataa 3755581 ttgaaaccac agcactttca taaacagtgt ccagatgatt tacacctaat ttgggcggcg 3755641 aatgctacgc aatggtggtg cgcttcccaa gggagcacaa cgcgaagcta aagcagttgc 3755701 acgccgagac cgagccgaaa ggtcgccctg cggggaaggc ggccacggga gaattgtgag 3755761 ctcggcggtc gaccacgacg tacccgccac gccgtagtaa tgggcatttg tacatgtaca 3755821 ttcgcacaca aggagaggtc ttgacgtatc tattccctct ctgcgcgatc gcggcggagg 3755881 cggcggcaac cagcctgttc aagggcagtt tcggggactt tcgcgtctgc tcgccgggtc 3755941 acgacggggc gatcacggcc atgccgagcg tcttggcggc gtcgcgcatc cggtcgtcgt 3756001 aggtgcacaa ccggcccaga tcgacgccga gccgctgcgc cgtcgccaag tggatggcat 3756061 cgagcgtgcg cagctcgaat ggcagcagcc caccagcgag atcgaggacg cgcttgtcga 3756121 cgcgcagcag atcgagatga gccagcgccc ggcggccggc tttccgcgct gattcaccct 3756181 tgtcaagcag ggcccgcatg acctccgcgc gcgcaagggc actcgacact cgcgggtggc 3756241 gggtgcgaag gtagcggcgc agcgcgtccg actctggctc gcgaaccgcg agcttgacga 3756301 tcgcggacga gtcgagatag atggccgcca tcaacgctcg tgctcacgca ggcgcgcaag 3756361 cgtcaccgac ggcagctcga cgcccgcgtc gaggtcgagc ggttcgggca gatcaacgac 3756421 gtcgagcgtg gcacgctcga tctcgccgct tgccagcagc tgctcgtatg gaccgccctg 3756481 cggcagcggc gagagcaggg cgacgggccg gccgcggtcg gtgatctcga tcgtctcgcc 3756541 ggcctcgact cggcgcagca gctcgctggc ccgctgccgc agcgcacgca cccccaccga 3756601 ggtcattgtg ctaactgtag cacaagcggt cggcgtcatg ggccgacgtt cggctcgcgc 3756661 aggctttaag taacgtcggt gttaattact aggacctgaa aaagtcggcg cgttgttcct 3756721 cggttggttg gcgctgagct gggaggatgg cctcaatgcc cttgttgcgg aagggattga 3756781 ggccatcgtg tttcgtactg taggcgatca ggcatcgttg tgggaatccg tgctgcccga 3756841 ggagttgcgg cggctgcccg aagagctggc ccgggtggat gcgctgctcg atgattcggc 3756901 gttcttctgc ccgtttgtgc cgttcttcga cccgcggatg ggtcggccgt ccataccgat 3756961 ggagacctat ttgcggttga tgttcttgaa gttccgttac cggttgggct atgagtcgct 3757021 gtgtcgggag gtcaccgatt cgatcacctg gcggcggttc tgccgtattc cgttggaggg 3757081 atcggtgccg cacccaacca cgttgatgaa gctgaccacg cgctgcggtg aggatgcggt 3757141 ggccgggctc aatgaggcgc tgctggccaa ggcggccagc gaaaagctgt tgcgcaccaa 3757201 caaggtccgt gccgacacca ccgtggtgga gggcgatgtg ggctatccca ccgacactgg 3757261 actgctcgcc aaggcggtcg gctcgatggc gcgcaccgtg gcgcggatca aagccgcgga 3757321 cgcgggatcg gcgccgctcg gtgggtcgtc gggcccgcgc gatcgcctcc aagctgcggt 3757381 tacgcggcgc gcagcaacgc gatcaggcgc aggccttcgt gcgccggatc accggggagc 3757441 tagccgggat cgccgagcag gcgctgaccg aggctgccgc ggtggtacgt aacgcccaac 3757501 gtgcggtgcg ccgcgccagt gggcggcgca aagcctggct acgccaggcc atcaaccatc 3757561 tcgagaagct gatcggacgc accgagcggg tggtggacca ggcccgtagc cggctggccg 3757621 gggtaatgcc cgactcaagc agccgcctgg tcagtctcca cgatgccgac gctcgcccga 3757681 tccgcaaggg acgattgggc aagccggtcg agttcggcta caaggcccag gtcgtcgaca 3757741 acgccgacgg tgtcatcctg gaccacagcg tcgagctcgg aaaccccgca gatgcaccgc 3757801 aattggcacc cgccatcgaa cggatcagcc gccgcaccgg acgcccacca cgggcagtga 3757861 ccgctgatcg gggctgcgga gacgcatcgg tcgaagatga tctccaccag ctcggggtgc 3757921 gcaacgtggc catcccacgc aagagcaaac ccagcgccac ccgccgcgca ttcgaacacc 3757981 gacgggcatt ccgcgacaag atcaaatggc gaaccggatc cgaaggacgc atcaaccacc 3758041 tcaagcgcag ctacggctgg aaccgcaccg aactcaccgg catcaccggc gcccgaacct 3758101 ggtgcggaca cggcgtcttc gcccacaacc tcgtcaagat cagcaccctg gcagcgtgac 3758161 agacacccgc gcccaccccg accacgccac gcaggtcgcc cagcccgccg ccgtcaatgc 3758221 aaccgcgact ttttcaggtc ttagtaatta gtggccgccg ctttgggtcc accggggccc 3758281 tgcggcgaaa caccagacgt gatgccgtga tcggcgatac ccttcgaccc attgaaggga 3758341 gaacagccat gtcgtttgtg atcgcgaacc ccgagatgct ggcagcggcg gcgaccgatt 3758401 tggccggcat ccggtcggcg atcagcgccg cgaccgcggc ggccgcggcc ccgacgatcc 3758461 aggttgccgc ggccggcgcc gacgaggtgt cgctggccat ctcggcgctg tttggccagc 3758521 acgcccaggc ctatcaggcg ctcagcgccc aggcgacgat ctttcacgac cagttcgtgc 3758581 aggccctgac ctccggcggc aacctgtatg cggccgccga gagccacacc gtcgagcaga 3758641 tggtgctcaa cgcgatcaac gcgcccaccc agacactgtt cggccgcccg ctgatcggcg 3758701 acggcgccaa cgggaccgcg gagaacccgg acggccaaaa cggcggcctg ctgttcggca 3758761 acggcggcaa cggctttacc cagacgaccg ccggggtggc cggcggcaac ggcggcagcg 3758821 cggggttgat cggcaacggc ggggccggcg gcatcggggg cgcgggcacc ggaaccggtg 3758881 gtcacggcgg ggccggcggg gccggcggcc gggcctggct gtggggcacc ggcggggccg 3758941 gcggagccgg cgccgccgcc atcggcaacg ccgtcacccc cggcggggcc ggcggcgccg 3759001 gcggagccgg cggtgacggc ggctggttgt tcggcgacgg cggggccggc ggcaccggcg 3759061 gcaacggcgg cagcggcttt aacagcttga cctcttcggt cggcggcgcc ggcggggccg 3759121 gtgggcacgc cgggctgttc ggcgccggcg ggaccggcgg gaccggcggc atcggcgggc 3759181 aaaacaccga gaccggcccg gccgccagca acggcggcgc gggcggcgcc ggtggcggcg 3759241 gcgggtacct ggtcggcgat ggcggcgccg gcgggaccgg cggggccggc gggaagaatt 3759301 ccagcggtgg cgccaccctc accgggggca ccggagggac cggcggggcc ggcggggcgg 3759361 ccgggtggct ctacggcagc ggcggtgccg gcggcgccgg cgggctcaac aacgccggtg 3759421 gtgccaccgg cggcaccggc ggtaccggcg gagccggcgg ctctggagcg tggctgtacg 3759481 gcaacggcgg ggccgccggg gccggcggca acggcggcaa caataccagc gccggcaccg 3759541 gtggtgtcgg ggctagcggc gggaccggcg gaaacgccgg gctgatcggc gccggcggcc 3759601 acggcggggc cggcggcgcc ggcggaaacc aaaccggtgg cgtgggcaac ggcggggccg 3759661 gcgggaacgg cggcgccggc ggggccggtg gtcagctgta cggcaacggc ggggacggcg 3759721 gcaacggcgg ggccggcggg gccaacatcg ccggcggcaa tggcagcgac ggcggcgccg 3759781 ccggccacgg cggggccggc gggagcgccc ggctgatcgg agccggcggc cacggcgggg 3759841 acggcggcgc cggcgggaac accgccggca gaagggccga cgcgatcgcc ggcaccggcg 3759901 gggacggcgg caacggcggg aatggcggct tgctaagcgg caacgccggg gccggcggcc 3759961 acggcggggc gggcgggagc agcaccgcga ccaccaccac cggaacaccc ccaacgggtg 3760021 caacgggcgg caatggcggc aacggcgggg ccggcggcac ggccgggttt accggcagcg 3760081 gcggcatcgg cggcaacggc ggggccggcg gcaccggcgg taacgccggt gtcgccttgt 3760141 cggttggcag cacgggcgga ctgggcggta acggcggcag cgggggcctc ggcggcggcg 3760201 gcgggtcgct cttcggcaat ggcggggccg gcggtgtcgg cgcaaccggc ggaaacgccg 3760261 gaagcggtat cgggcccgcc agcgtgggtg gcaacggcgg caagggcggc gttggtgcgg 3760321 ccggcgggct tgccgggcag atcggcaacg gcggtagtgg tgggtccggc ggtgccgggg 3760381 gcaacggcgg gaccggcgat accgccggca acggtggcaa tggtggtgcc ggcgcggtcg 3760441 gcggcaacgc ccagctcatc ggcaacggcg gcaacggcgg tggcggcggg aacggcggaa 3760501 ccggcgccac ccccggcacc ggcggcgccg gcgccgccgg cggcaccggc ggcacgctgt 3760561 tcggcgcccc cggaaccacc ggcgccgacg gcacctaagg cccgcgagca gacgcaaaat 3760621 cgcccaattt cgtgccgaat tgggcgattt tgcgtctgct cggcgcagct aacccgccac 3760681 gtactccacc gcgccgtcgt cgagcaccac ccgggcctcg gcgccgtcgg agccggccac 3760741 ctcggtgcgg aacaccgccc ggcccggctc ggtgcgccag atcaccgtcg acagcgtctc 3760801 gccgggaaac accggcttgg tgaaccgcgc ggcgatcgag gtgatgttgg ccgccacacc 3760861 gccgccaagc tcggccacca gcgcccggcc cgccaccccg taggtgcaca acccgtgcag 3760921 gatcggcttg ggaaacccgg ccagctgcgt ggcgaaccag gggtcgctgt gcagcgggtt 3760981 gcggtcaccg gagagccggt agatcagcgc ctggtcctca cgggtcagca tatcgattcg 3761041 ggcgtcgggg tggcggtccg gaaattccgg cgcggccggc cgctcacccc gcgctcctcc 3761101 gaaacccccc tgaccccgaa gcaccaacgt ggtaagcgtt tcggcaacca acgaacccga 3761161 ttccgggtcg caaccgcggc cgcgcagcac aacgatggcg ttcttgccct cccccttgtc 3761221 ctggatgtcg gcgacctcgg tgaccaccga cagttttccc gccgccggca gcggcgcatg 3761281 cagccggatg ccctgggagc cgtgtagcag cgccgccggg ttgaatgttc ccacctttgc 3761341 ggccgcacca aacgccggac agcaaatcac cgcatacgtc ggcaacactt gctggtcgat 3761401 gccgtggctg ttctccgtgg tgaacgccag atctccggtc ccggcgccca ccccgatcgc 3761461 gtaaagcagc gtgtcccggt cggtccactc gaacaacatc ggctcggtca ctgcacctat 3761521 ggagttcgga tcaatcgcca tgcaactctc ctcccggttg gaaaatcatc gcaagccctt 3761581 cccccggacg gtatcgacag ggcaggctat cgccatggcg aagcgcaccc cggtccggaa 3761641 ggcctgcaca gttctagccg tgctcgccgc gacgctactc ctcggcgcct gcggcggtcc 3761701 cacgcagcca cgcagcatca ccttgacctt tatccgcaac gcgcaatccc aggccaacgc 3761761 cgacgggatc atcgacaccg acatgcccgg ttccggcctc agcgccgacg gcaaagcaga 3761821 ggcgcagcag gtcgcgcacc aggtttcccg cagagatgtc gacagcatct attcctcccc 3761881 catggcggcc gaccagcaga ccgccgggcc gttggccggc gaacttggca agcaagtcga 3761941 gattcttccg ggcctgcaag cgatcaacgc cggctggttc aacggcaaac ccgaatcaat 3762001 ggccaactca acatatatgc tggcaccggc agactggctg gccggcgatg ttcacaacac 3762061 tattccgggg tcgatcagcg gcaccgaatt caattcccag ttcagcgccg ccgtccgcaa 3762121 gatctacgac agcggccaca atacgccggt cgtgttctcg cagggggtag cgatcatgat 3762181 ctggacgctg atgaacgcac gaaactctag ggacagcctg ctgaccaccc atccactgcc 3762241 caacatcggc cgcgtggtga tcaccggcaa cccagtgacc ggctggaggc tggtggaatg 3762301 ggacggcatc cgtaacttca cctgaccgcg cggttgacgc ttaccgccgc tgaccgccac 3762361 gattgaccgc atgcggtacg tcgttaccgg cggtaccggg tttatcgggc gccacgtggt 3762421 atcccgtctc ctggacggcc gacccgaggc acggctgtgg gcgctggttc gccgccagtc 3762481 gttaagccgc ttcgagcgcc tcgccggcca gtggggtgac cgggtaagac cgctggtcgg 3762541 tgatctcacg gagctcgaac tgtccgagcg gaccatcgcc gagctaggcg atatcgacca 3762601 tgtgctgcac tgtgcggcgg tacacgacac cacctgggcc gacgccaccc gcgccgtcat 3762661 cgagctggcg gcacgccttg acgccacgtt tcatcacgtg tcgtcgatcg cggtggccgg 3762721 agacttcgcc ggccactaca ccgaggccga cttcgacgtc ggccagcgcc taccgacccc 3762781 gtatcatcgg atgacattcg aggccgaacg gctggtgcgc tccacgcccg gcctgcgcta 3762841 tcgcatctac cgcccggcgg tggtggtggg tgattcgcgc accggcgaga tggacacgat 3762901 cgacggaccc tactacttgt tcggggtgct ggccaagctg gcggtgttgc cgtcgttcac 3762961 cccgatgctg ctgccggaca ttgggcgcac caacatcgtg ccggtcgact atgtggccga 3763021 cgcgctggtg gcgctcatgc acgccgacgg ccgggatggg cagacgtttc atttgaccgc 3763081 gccgacagca atcggactgc gcggcatcta ccgcgggatc gccggcgcgg ccggactgcc 3763141 cccgctactc gggacgctgc ccggctttgt ggccgcaccg gtgctcaacg cgcgcggccg 3763201 cgccaaggtg ctgcgcaaca tggcggccac ccaactggga attcccgccg agattttcga 3763261 cgtcgtcggc tgcgcgccca cgttcacgtc cgacacaacc cgggaagcgt tgcgcggcac 3763321 cggcattcac gtccccgaat tcgccaccta cgcgcccggg ctgtggcggt attgggccga 3763381 gcacctcgac cccgaccgcg cgcgtcgcaa cgatccgctg ctgggccgcc acgtcatcat 3763441 caccggtgcg tccagcggca tcgggagggc atcggcgatc gccgtcgcca aacggggtgc 3763501 gacggtattc gcgctggccc gcaacggcaa cgcgctagat gagctggtca ccgagatccg 3763561 cgcccatggc ggtcaggcgc acgcattcac ctgcgacgtc accgattccg cgtcggtgga 3763621 gcacaccgtc aaggacatcc tgggccgttt cgaccacgtg gactacctgg tgaacaacgc 3763681 cggccggtcg atacgccgct cggtggtcaa ctccaccgac cggctgcacg actacgagcg 3763741 ggtgatggcg gtcaactact tcggcgcggt gcgcatggtg ctggcgctgc tgccgcattg 3763801 gcgcgagcgc cggttcggcc acgtcgtcaa cgtctccagc gccggcgtgc aggcccgcaa 3763861 tcccaagtac agctcgtatc tgcccaccaa ggccgcgctg gacgcgttcg ccgacgtggt 3763921 cgcctccgag acgctgtccg accacatcac gttcaccaac atccatatgc cgctggtggc 3763981 caccccgatg atcgtgccgt cgcggcggct caacccggtg cgcgcgatca gcgccgaacg 3764041 cgcggcggcg atggtgatcc gcggactcgt ggaaaagccg gcgcgcatcg acactccgtt 3764101 gggtacgctc gccgaagccg gcaactacgt cgcgccacgg ctgtcgcgcc gaattctgca 3764161 ccagctctat ctgggctatc ccgattcagc tgcagcgcag gggatttcgc gtccagacgc 3764221 ggaccgccca ccggcgccgc ggcgtccccg gcgatccgcc cgcgcgggag tcccgaggcc 3764281 gctcaggcgc ttggggcgac tggtgcccgg tgtgcattgg tagtcacttc tggcaggtga 3764341 actggttgac gtcgatgtat ccgatgcgaa acatctcggc gcagccggtg aggtacttca 3764401 tataccgctc gtagacttcc tcggattgca gcgcgatggc ctggcccttg ttggcctgca 3764461 acgccgcgga ccagaggtcg agggttttcg catagtgcgg ctgcaacgat tgaactctgg 3764521 tgacggtgaa gccgtttgcg ctggcacact cctgcaccat cggtatcgag ggcagccgcc 3764581 cacccggaaa gatctcggtc acaatgaatt tcaggaaacg agcgaaggtg aacgacatgg 3764641 gcaggccgcg ttcgtggatc tctttcggat gcaacccggt gatggtgtgc agcagcatga 3764701 ccccgtcagc gggcagcagg cgatgcgcca ggctgaagaa cgcgtcgtag cgctcgtgac 3764761 cgaaatgttc gaaagcaccg atgctgacga tgcggtcgac gggctcgtca aactgttccc 3764821 agccggccag cagaacgcgt ttggagcgta gactttcgga gttggcgacc agctgctgaa 3764881 cgtggttggc ctggtttttg ctcagggtca gaccgacgac gttgacgtcg tatttttcca 3764941 ccgcgcgcat catggtggcg ccccagccgc agccgacgtc caacagtgtc atgcccggct 3765001 gcaatccgag tttgcccagc gcgagatcga tcttggcgat ctgcgcctct tgcagcgtca 3765061 tgtcgtcgcg ctcgaagtag gcgcagctgt aggtctgagt gggatcgagg aacagccgga 3765121 agaagtcgtc ggacaggtcg tagtgcgcct gcacgttggc gaagtgcggc ttcagctcgt 3765181 cgggcattgg gatagcgtat cgtcgtcgcg gtgagcgtcg tattcgccga cgtcgacacc 3765241 ggcatcgacg acgcgctggc cgtgatctat ctgctggcca gtcccgacgc cgatctggtc 3765301 ggcatcgcct cgaccggcgg aaacatcgcg gtaggtcaag tgtgcgcgaa caacctgagc 3765361 ttgctcgaat tgtgcggtgc cgcagacatc cccgtgtcca aaggcgccga tgagccgctc 3765421 ggcggccggt ggcccgatca cccaaagttt cacggcccca aggggatagg ctatgccgag 3765481 ctgccggcca gcaatcgccg gctcaccgat tatgacgcca cgacggcctg gatcgcggcg 3765541 gcgcactccc acgccggcga cctgatcggt ctggtcaccg gcccgctgac caacctggcg 3765601 ctggcgctgc gcgccgaacc cgcgctgccg aggctgctgc gccggctggt gatcatgggc 3765661 ggcatgttcg acggccagcc gatcaccgaa tggaacatcc gggtggatcc cgaggcggcc 3765721 agcgaggtgt tcaccgcgtg ggccggacaa cgacaactgc cgatcgtgtg cggtttggat 3765781 ctcacccggc gggtcgcgat gacaccggac attctcgccc ggctggcgtc cgtctgcggc 3765841 tcgtctccgg tgatgcgggt gatcgaggac gcgctgcggt tctacttcga gtctcatgag 3765901 gcgcgcggac atgggtacct ggcatatatg cacgacccgc tggccgccgc ggtcgcaatg 3765961 gacccggaac tcctgacgac ccggaccgcg acggtggatg tcgacccgac gggggcgacg 3766021 gtcaccgact ggtccgggaa gcgaaatccc aacgcgcgga tcggcatgag cgtcgatccg 3766081 gcggtgttct tcgaccggtt cgtcgaacgg atcggacgat tcgcgcgccg aacgtgaact 3766141 gacggcggga ttttcccgaa attctcgccc tgacgtcacg ttcggcgcaa gtcattcgta 3766201 gcttccctcc agataccacc gccgctgccg gtagcacagc agcaacgcgg tgccgggatc 3766261 gccgtccagc aatacctgag cgcgcgcggt gcggccactc gcccgatccg gatcccacca 3766321 ccgctcgtcg tccggccacg gtccggccca ccagcgcagc cgatcgtctc ggccacgaac 3766381 cctcagccgc gccgggtccg cggagaacat cccccggctg gtcacccgta tcgggtttcc 3766441 ttgggcgtca agcaagtcca ccggatcgtc gaacagcacc gccggcgacg ggtcgggcaa 3766501 cctgccgggc cacggctgac cggggtcggc ctgcggcacc ggctcagggg ctactaggcc 3766561 cagcacggtc aacgtgatgc gttcggccgg gccgtgtccg ccggatagca ccggcacccg 3766621 cacggcctcc ggaccgagca agccctgcac ccgcaccagc gcccgacggg cccgaagcct 3766681 gtcctgttca ccgagcccgc cccatagcgg caactgcaag ccttccgatg cggacaccgt 3766741 ctccaccgcc tgcagccgca gcagagtcac cgccgcggtg ggccggtcac gagcattccg 3766801 gttgttcaac cacccgtcca gttgccagcg cacccggtcg gcggtggcgt cctcggtcag 3766861 cggctcggcg caccgccaca cccggctgcg ctcttcgccg ttggcggtga cggcatgaat 3766921 ggccagccgg gtgcagccca ctccggcggc catcagcgcc cgatgcagct cggcggccag 3766981 cgagcgcccg gcgaacgccg cggcgtcgac ccggtcgatc ggcggatcgc atgccagctc 3767041 ggcggccaga tccggcggcg gctcccgccc gcagggcgcc cgttccggtt cgccgcgggc 3767101 gaaccggtgc gcggccaccg cgtcggcacc gaacctggac gccacgtcgg tacgagacag 3767161 cgcggcgaac tgtccgatgg tgcgaatccc catcctccac aacagatccg tcaggtcgtc 3767221 ccggcccggc ccggacaggc tcggctcggt ggcaagttgg cggatcgaca gcagcgacag 3767281 aaaccgcgca tcgcctcccg gctccacgat gcggccagca cgcgcggcga aaaccgcggt 3767341 agacaaccgg tcggcgattc cgacctgaca ctccgcgccg gccgcggcca ccgcgtcgat 3767401 cagccgctcg gccgccatct gctcggaccc gaaaaaacgg gccggcccgc gcaccggcaa 3767461 caccaggagc ccgggccgca gcagctcggc gcggggcacc agatcgtcta ccgccgcgat 3767521 caccccttcg aagagccggg cgtcgcggtc ggcgtcggca gtcgctataa acagttgcgg 3767581 acaccgcgcc gccgcctccc gacgccgcaa ccctcggcgc accccggccg cccgcgcggt 3767641 cgccgagcag gcgatcaccc ggtttgccaa cgtgaccgcg accggggccg tcgcggatag 3767701 gcccgcggcc gcggccgccg cgaccgcggg ccagtccata caccagatcg ccagcacgcg 3767761 agcggaggcc atcaccgtcc acgcccgttg atctgcagcc gcaccccact gatccgcccc 3767821 aaccccgggg tgggcacgcc cctgagggcc ggggtgatct catagccgca gacccgggcc 3767881 gcaagccgcg tcgacacgcc ttgccagtcg ccgtcggtga ccagcagggt gcagcctttt 3767941 tgacgggcac gggccaccac tgcccgcgcc cgcgcccgcg tcacccggcg ccctcccaga 3768001 ccgagcacca ccagatccat gccgtcgatc agcacagcgg ccacctcaac cggatcggtc 3768061 ccgggatctg gtatcatcgc gagccggctc agatccgccc ccatctccac cgcggccagc 3768121 aacccgatat ccggctggcc aacgatggcc gcgtttcccc cggccgccgt caccgatgcc 3768181 accatgctca gcagcagtga ccgcgcaccc gacagcactc ccaccgtccc cgggggcaac 3768241 gacaccggtc ccgccggcac caggtcgccc gaacggctgg gccccccgga caccttctcg 3768301 gacagcaaag ccatctgccg tcgtagtgat tcgagctgct cagcaccatt ttcaaggcgt 3768361 tggtcggagg cgaaggccac agtcatgacc agcctcctgt tcgaaaatat gttcgaagtc 3768421 agtaaacacc cgtccttgga gtccgtcaag gtcatgagag gctgccttgt gcaatcgcgt 3768481 aaaaccacct cggtactggc ggctgccctg ctgttttgcg gcctgttagg cccagggacg 3768541 gccccaccgg ccaccggtgg cgggcctgcc tgccggccgg cagagctctt cgccaccgac 3768601 aacaccaccg atgggttcga gctaccggcc gttgcgacta tcgcactaac cggcacggtg 3768661 gtgaccggat cgaccctggt cgacggcgtg ttctggtcga atgagcgcca gcagatcggc 3768721 tacgagcgct cccgtgaatt tcatctgtgc gttgtcgacg cgcccacatt gcacaacgcc 3768781 gccgaggcac tgcaccgcca gttcaaccaa gaagcggtgc tgaccttcga ctacttgccg 3768841 cagaatgcac ccgaggcgga cgcgatcctc atcaccgtgc ccgacatcgg catcgcccgc 3768901 ttccgcgatg ccttcgcatc tgatttggct gcacaccacc gattacgggg cggatctgtc 3768961 accacagccg accacacctt aatcctggtc gccggcaacg gcgatctcga tgtcgcccgc 3769021 cgactcgtcg aggaggccgg cggggactgg aacgcaacca ccattgccca tggcaggcgt 3769081 gaattcgtga actagctgat caagggcgct ccgctggcca cccgagccgg gttggtcaca 3769141 ttagttagtc acagcaatct ctgggccggc gggcacaacg cgtattcatc ccgacagata 3769201 ccaatgtgtc gcctgtgaca aaagccgggc ctggctaatg ctggccgccg ctactcccac 3769261 tcgatggtgg cgggcggctt gctggtgatg tccagcacca cgcggttgac ctcggcgacc 3769321 tcgttggtga tccgggtcga gatgcgctcg agcacctcgt agggcacccg ggtccagtcg 3769381 gcggtcatcg cgtcttcact cgacaccgga cgcagcacaa tcgggtggcc ataggtgcga 3769441 ccgtcaccct gcacacccac cgagcggaca tcggccaaca gcaccaccgg acactgccag 3769501 atctggttgt ccaggcccgc cgcggtcagc tcctcacgca cgatcgaatc ggcgtgccgc 3769561 agcgtatcca accgcttggc ggtgacctcc ccgacgatcc gaatacccaa ccccggtccc 3769621 ggaaacggct ggcgcgccac gatctcctcc ggcagaccca actcccgccc gaccgcgcgc 3769681 acctcgtctt tgaacagcag ccgcagcggc tcaacgaggg tgaacttcag gtcgtcgggc 3769741 aggccgccga cattgtggtg gctcttgatg ttcgcggtgc cgctgccccc gccggactcc 3769801 accacatccg gatacagcgt gccctgcacc aggaactcag cagtcttacc gtccagcaca 3769861 tcccgcaccg cgccctcgaa cgcgcggatg aactgacggc cgatgatctt gcgtttgccc 3769921 tcgggggcgc tcacgcccga cagcgcctcg aggaaggtct cggccgcgtc gacggtgacc 3769981 aggttggcgc cggtggcggc cacgaaatcg cgttgcacct gcgcccgctc accggcgcgc 3770041 aacagcccgt ggtcgacgaa gacacaggtc aaccggtcgc cgatggcccg ctgcaccagg 3770101 gccgcggcca ccgcggaatc cacgccgccg gatagcccgc agatggcgtg gccgtcgccg 3770161 atctgggtgc gcacctgctc gatcagcgcg ttggcgatgt tggcgggcgt ccactgggcg 3770221 ccgagcccgg cgaagtcgtg caaaaaccgg ctgagcacct gttgcccgtg tggggtgtgc 3770281 atcacctccg ggtgatactg caccccggcc aggcgccggt cgaaggcctc gaaggcggcc 3770341 accggggcac cggcgctgct agccaccacg tcgaatccgt ccggcgcggc cgtgaccgcg 3770401 tcaccgtgac tcatccatac cggctgaacc tcgggaagat ccgaatgcag tttgccacca 3770461 aggactttca gttcagtccg accgtattcg cgagtgccgg tgtgggcgac gatccccccg 3770521 agcgcctgcg ccatggcctg aaacccgtag cagatgccaa gaaccggtac accgaggtcc 3770581 agtagcgccg gatcgagttt cggagcgccg tcggcgtaga cactggccgg tccaccggaa 3770641 agcacgagcg ccaccggctg acgggccctg atctcctcga tcgaggcggt gtgcggaatc 3770701 acctcggaga aaacccgtgc ttctcgaacc cgacgggcaa tcaactgggc atattgggca 3770761 ccgaagtcga ccaccaacac cggtcgagcc ggtgtctcag gcacgtcaat gtcagcaggc 3770821 tgcaccacgg ccagtcagtc tagtggctgg ggtgactccc gaggtcggcc ggtagcggtc 3770881 catgggccgg tccgcaggtt accgaagagg ccagtgctgc cgccgccact tgggccttct 3770941 tcagtcccga cagagagatt cgccgatcgt agacgaccgc cggcgatgct ctgatcaagg 3771001 cgagctgacg gcggtagatg ccagacatgg ccgcacagca ggcagcgctg cggcggtcga 3771061 ggtgtggaat cagccgcagt cccagcgaat accagtctgc ggcgcggtcg gcactgaacc 3771121 gcagcagtgc cgcgagccgt ccgtcggggt catcgagtgc cccggtgtcg tccaggcgga 3771181 ggcgtacgcc taatcggtcc agctcgtcgc gcggcaggta gatccgtcca ttcaaaaagt 3771241 cctctcgaac gtcgcgcaga atattggttt gctgcagagc gattcccaac tgctcggcgt 3771301 atcgcgacgt cgccgtgctg acgggtccaa agatggaaag acaaagcttt ccgatcgtgc 3771361 cggccccccg gcggcagtag acgatcagct cgtcgaaatc gcggcaacca gtccagtcga 3771421 tttccatacg ggcgccgtca atcaactctg cgaacatcgc gatcggcacc ggaaaccggc 3771481 gagccgcgtc agccagcgca accagcaccg gatcggatga atcatcaata ttatcaagtg 3771541 atttcctgat ggcatcgagc tcggtgatct tggtctcggg ggccagctcg ccgtcggcga 3771601 cgtcgtcgat ccggcggccg agcgcataga ccgcagatag tgccgctcgc ttttcgcgcg 3771661 gcaagagtcg gatgccgtag tagaagtttc tggcggccgt gcgcgtgatc gactcggtga 3771721 ttcgatacgc ctgttcgatc tcggtcatgc cgtcctccaa ctacggtgtt ggtcagtcac 3771781 gcctgacgat cgacgatgta gtgagccaaa tcctgaagct cagcggccgg gcgatcggga 3771841 atgccgatgc gcgccaccat gtcgatgcct tgcgttacgt gtcggcgggc ctccgcgctt 3771901 gcccacctgc gccccccacc gcactcgatc agttctgcga ccgctgcgag ctcatcatcg 3771961 gacgctgtct ggctgcccgt ctcgtccacc agccacgctg cgaggcggcg gccggccgaa 3772021 ccgccgtgcg ccacggtcca ggtaacgggc agagttttct tgcgggagcg aaggtccgag 3772081 tacaccggct tgccggtgat ctcaggacgg ccccaaatgc cgagcaggtc gtcgaccaat 3772141 tggaaggcaa gtccaatgtg acgaccgtag gcaaccaacg cttctcgcac cgaacgcggt 3772201 gcgccagcga gtaacgcgcc gacctctgcg ctggctgcca tcagtgctgc ggtcttgcct 3772261 tcagccatct tgagacactc atcgagtgcg acgtcggttc ggctttcgaa cgcggtgtcg 3772321 gcggcctgcc cacggatcaa ctcacgggtg gcttccgaaa tcgcgcgcag cgccgcaccg 3772381 acgtgtggtg aatcgcaatc cagcaggacc tcgtgcgcca gcgacagcat cgcatcaccg 3772441 gccaatagcg ccatcgcatc gccccacagt gcccacaccg tcggccggtg ccgacggtgc 3772501 tcgtcgcggt ccatgaggtc gtcatggacg agcgagaagt tgtgcaccag ttcaaccgag 3772561 acggctccgg gaatcgccga gtgggggtcg gcgccggcgg cttcggcggc gacaaacacc 3772621 aaagcaggac ggattgcctt gccgcagttg ttgttcactg gacggccgcg ttcatcagac 3772681 cagccgaggt ggtaggacac gacgggccgc atgtggggat cgaggcggtc agccatctgg 3772741 cgcagcgtcg gtgtgatgag ttcgtgtgcg agtcccaaaa cgggaagcgt gcgacgggtc 3772801 atacggtcgc tgtcgggttg cggtggcagt ccgtactttt cgtcggtacc gcgcattgcg 3772861 tgaatctagc attcgctcat ggcacggccc atgggcaagt tgcccagcaa tacgcgaaaa 3772921 tgtgcacaat gtgcaatggc ggaggcacta ttggagatcg ctggtcagac tattaatcaa 3772981 aaggaccttg gcaggagcgg acggatgacg cgtaccgaca atgacacttg ggatctggcc 3773041 tccagcgtgg gggcgaccgc cacaatgatc gccaccgccc gggcgttggc tagcagggcc 3773101 gaaaaccctt tgatcaatga tccattcgcc gagccgctgg tgcgcgccgt cggcatcgac 3773161 ctgtttaccc ggctggccag cggcgagttg aggcttgagg acatcggcga ccacgccacc 3773221 gggggtcggt ggatgatcga caacatcgcg attcggacca agttctacga tgactttttc 3773281 ggtgacgcaa ccacggcggg tattcggcag gtagtgattc tggcggctgg gctcgacacc 3773341 cgcgcgtacc gactgccctg gcccccgggc acggtggtct acgagatcga ccagcccgca 3773401 gtcatcaagt tcaagacacg ggccctcgcc aatctgaacg ccgaacccaa cgcagaacgg 3773461 cacgccgtgg ccgtcgatct gcgaaacgat tggccgacgg cgctgaagaa cgccggcttc 3773521 gacccggcca gaccgacagc cttcagcgcc gaggggttgc tgagctacct gcccccacag 3773581 gggcaggacc gcctgctcga tgcgattacc gcgctcagcg cccctgacag ccggttggcc 3773641 acccagagcc cactggtgct cgacctggcc gaggaagatg agaagaagat gcgcatgaaa 3773701 tccgcggccg aggcatggcg ggaacgcggc tttgatctgg acttgaccga gctgatctac 3773761 ttcgatcaac gcaacgacgt ggccgactac ctcgccggct ccggctggca ggtcaccacc 3773821 agcaccggca aggaactctt tgcggcccaa gggctgccgc ccttcgagga cgaccacata 3773881 actcggttcg ccgaccgccg ctacatcagc gcggtgctga agtaggtggc cccggcacta 3773941 tagccgggcc taactcgtag gcttggtacg cgggcagagc cgccaggcat ggcgaactgg 3774001 tatcgcccga actatccgga agtgaggtcc cgcgtgctgg gtctgcccga gaaggtgcgt 3774061 gcttgcctgt tcgacctcga cggtgtgctc accgataccg cgagcctgca taccaaggcg 3774121 tggaaggcca tgtttgacgc ctacctagcc gagcgagccg agcgcaccgg cgaaaaattc 3774181 gttcccttcg accctgccgc ggactatcac acgtatgtgg acggcaagaa acgcgaagac 3774241 ggcgttcgat cgtttctgag cagccgcgcc atcgaaatac ccgacggttc cccggatgac 3774301 ccgggcgccg ccgagacggt gtatggcctg ggcaaccgca agaacgacat gttgcacaag 3774361 ctgctgcgcg acgatggggc ccaggtgttc gacgggtcgc ggcgctacct ggaggcggtc 3774421 acggccgcgg gtctcggtgt ggccgtggtg tcttcgagcg ccaacacccg cgacgtgctc 3774481 gcgaccaccg gtctggaccg gttcgtccag cagcgggtgg acggcgtgac gttgcgcgaa 3774541 gagcacatcg ccggcaagcc ggcccccgac tccttcctgc gcgcggcaga actgttgggg 3774601 gttacccccg acgcggcggc ggtgttcgag gacgccctgt ccggggtggc ggccggccgc 3774661 gccggcaact tcgccgtagt ggtgggcatc aaccgaacgg gccgggcggc tcaggccgcc 3774721 cagttgcgcc gccatggcgc cgacgtggtg gtaaccgatc tcgccgagct gctgtagggc 3774781 atgatcgggc gatgatcacc gaggacgcct tccccgtcga accgtggcag gtccgcgaga 3774841 ccaagctcaa cctgaacctg ctggcccagt ccgaatccct attcgccttg tccaacgggc 3774901 acattggatt acgcggcaac ctcgacgagg gcgaaccctt cggactgccg ggcacctacc 3774961 tgaactcttt ctacgaaatc cggccgctgc cgtacgccga ggccggttat ggatatccgg 3775021 aggccggcca gaccgttgtc gacgtcacca acggcaagat ctttcgcctg ttggtcggcg 3775081 acgagccgtt cgacgtccgg tatggcgaat tgatctccca cgaacggatc ctcgacctgc 3775141 gcgccgggac gctgacccgc cgcgcgcact ggcgctcacc ggcgggcaag caagtcaaag 3775201 tgacgtccac ccggctggtg tcgctggccc accgcagcgt cgcggcgatc gagtacgtcg 3775261 tcgaggcaat cgaggaattc gttcgcgtga ccgtgcagtc cgaactcgtc accaacgagg 3775321 acgtaccgga gacctcggcc gacccgcggg tgtcggccat cctggacagg ccgctacagg 3775381 ccgtcgagca cgaacgcacc gagcggggtg cacttctcat gcaccgcacc cgagccagcg 3775441 cgctgatgat ggccgcaggg atggaacacg aggtcgaggt tcccgggcgg gtcgagatca 3775501 ccaccgacgc ccgcccggac ctggcccgaa ccaccgtgat ctgcgggctg cgcccgggac 3775561 agaagctgcg catcgtcaaa tacctggcct atggctggtc cagcctgcgc tcccgcccgg 3775621 cgctgcgcga ccaggccgcc ggcgcgctgc acggtgcccg ctacagcggc tggcaggggc 3775681 tgctggacgc gcaacgcgcc tacctcgacg acttctggga cagcgcggac gtggaggtcg 3775741 agggcgaccc ggaatgtcag caagcggtgc gtttcgggtt atttcacctg ttgcaggcca 3775801 gcgcgcgcgc cgaacgccgc gcgatcccca gcaaggggct caccggaacc gggtatgacg 3775861 gccacgcctt ttgggacacc gaaggtttcg tgctaccggt gctcacctac accgcaccgc 3775921 atgcggtcgc cgacgcgctg cggtggcggg cgtcgacgtt ggacctggcc aaggagcggg 3775981 cggccgagct cggcctggaa ggtgccgcct ttccctggcg gaccatccgc ggacaggagt 3776041 cctcggccta ctggccggcc ggcacggcgg cctggcacat caacgccgac atcgcgatgg 3776101 cgttcgagcg gtaccgcatc gtcaccggcg acggttcgct ggaggaggaa tgcggccttg 3776161 cggtgctgat cgagaccgcc cggctgtggc tctcgctcgg gcaccacgac cgccacggcg 3776221 tctggcacct cgacggggtc accggtcccg acgagtacac ggcggtcgtc cgcgacaacg 3776281 tgttcacgaa tctgatggcg gcgcacaatc tgcacaccgc cgccgatgct tgcttgcgcc 3776341 accccgaggc ggcggaggcc atgggtgtca ccaccgagga gatggccgcc tggcgcgacg 3776401 cggccgacgc cgccaacatt ccctacgacg aggaactcgg tgtccaccag cagtgtgaag 3776461 ggttcaccac ccttgcggag tgggatttcg aagccaacac cacttatccg ttgctactgc 3776521 acgaggccta cgtgcgcttg tatcccgcac aggtgatcaa gcaggccgac ctggtgctgg 3776581 cgatgcagtg gcagagtcac gcgttcacgc ccgagcagaa ggcgcgcaac gtcgactact 3776641 acgaacggcg catggtgcgc gactcgtcgt tgtcggcctg cactcaggcg gtgatgtgcg 3776701 ccgaggtcgg ccatctcgag ttggcccacg actatgccta cgaagccgcc ctgatcgacc 3776761 tgcacgacct gcaccgcaac acccgtgacg gcctacacat ggcttcgctg gccggagcct 3776821 ggacggcgct ggtcgtaggc ttcggcggcc tacgcgacga cgagggcatc ctgtccatcg 3776881 atccgcagct gcccgacggc atctcgcggc tgcggttccg gctgcgatgg cgcggcttcc 3776941 ggctgatcgt cgacgccaac cacaccgacg tcaccttcat ccttggcgac ggtcccggca 3777001 cccagctgac catgcgccac gccggccaag atctgacgct gcacacggac acaccgtcca 3777061 ccatcgccgt gcgcacccgt aagccgctgc tgccgccacc accgcagccg ccaggccgcg 3777121 agccagtgca ccgccgggct ttagcccggt gacgatacgg gccgcgtagc ggcccgagga 3777181 ggagccgggc aatcggctta gcccggtgac gatgcgggcc gcgtagcggc ccgaggagga 3777241 gccgggcaat cggcttagcc cggtgacgat acgggccgcg tagcggcccg aggaggagcc 3777301 gggcaatcgg cttagcccgg tgacgatgcg ggccgcgtag cggcccgagg aggagccggg 3777361 caatccagcc tgagcccggt gacgatgcgg gccgcgtagc ggcccgagga ggagccgggc 3777421 aatccagcct gagcccggtg acgatgcggg ccgcgtagcg gcccgaggag gagccgggca 3777481 atccagcctg agcccggtga cgatgcgggc cgcgtagcgg cccgaggagg agccgggcaa 3777541 tccagcctga gcccggtgac gatgcgggcc gcgtagcggc ccgagaagga gccgggcaat 3777601 cggcttagcc cggtgacgat gcgggccgcg ctgggggcac catccgcttg cggggacgcg 3777661 tctgcgtcta cctgggcggc accggtgaac gtctcattca ccgcgcacct ccgcttcctg 3777721 cacggcggcg acgacccggg caacgtcatc cggggccatg tggtcgtgga ctggcagcga 3777781 cacgattcgc gagcaaatgt ccgccgtgac ggctagatcg gtcgactcga ctaactcggc 3777841 attcgtcaca aagtacggat gtcggtgctg cggtgggttc tagtagtcgc gcgcctcgat 3777901 cgcgtgccta cgcaggctac ccagaaccgc ggccttgtgg tcggcggacg tgcagcaagc 3777961 gctcgcgaaa cagagcgacg caacattggc gttgtcctgg aaacgcacac ccgcgtcggc 3778021 cataccggtg cgatagcact cgaggacctt gcggcgactt gccaggcggc gatcaagccc 3778081 gactagttgg cgtaggccaa tagcggcgct gatctccgac agcttgccgt tcattccgag 3778141 ctggatggac tcgcgtgttt gcaccaagcc gaagttctgg aacttgtatg cgtgctcgac 3778201 gagccgtgga tcgcgagaaa ccagagcgcc gccctcacca accgcgaacg gcttggtcgc 3778261 atggaaggag aagatctcgc atgcaccgcg tccaccgagg cgctcgccgt cggcgtacgt 3778321 ggagccgaag ccggccgccg agtcgagcac aatcggtagc tcccattcgg cggcgagctc 3778381 ctcccagacg ctgatctggg gattgccgac gccgaacaca ttggccagca ggatgccggc 3778441 gatccggtcg cggaagcgtt cgatgacggc gcaggcggag tggacgcatg gctgccatgt 3778501 gttggcgtcg atgtcgatga accagggacg gtacccagtc catagcgcag cctgagccac 3778561 gccgacgaac gtgaacgacg gcatcagcag gtagcggtcc cgcgtaccgg cgccgaaact 3778621 gacgtggagc gccgcgagga gtgccagggt gccgttggcg agggtagcaa cgtgcagatg 3778681 aggtcccaga tagtcgcgca gggcgcgggc aaaccgccgc tcgttcggac cgaagttcgt 3778741 gtaccagtta gcctgggcga tctgtacgaa gtcctcggcg agctcggctg gcccgggaaa 3778801 gctcgggcgg atgaagggga tcttggggat cgtcgagcca cccggctcga gttttaacat 3778861 ggacgtgcct ggggtcgcgc gtactgcgga cggcggctcc agcaccgagc cggataacgt 3778921 tcggatcttc atatatgcag agctcaaggt ccgtttgcag cgcgtcggga cagtttccgc 3778981 agcgcacttc gtcgcaccat cgttggcatc ggcgcctgaa gcagttaccg cgagaaccgc 3779041 atcatgtcga acttgaggtt agccttacct cttaagaatg tcacccccgg ggagtgaccc 3779101 cggctgtcca cgcgtgggcg acccggcacg cacggtgcac gttccctggt gtctcacccg 3779161 cctcattcgt cccggcgcca cagggctagc gatatggccg cctcgcgtag tcggtccggg 3779221 tcggtacgcg tggggcagat gaggaaggtg cggcgcattg aagcagcctg tcaagcccgg 3779281 ctcggtgacc ggcacggcgc gttcagcatg gggccagtgc gacgggctgg agcgaagcaa 3779341 gcaggctggc cgccagcgat ttcgccagcg accggacgcg cggtgcgctc tcgacgtgcc 3779401 aaaaacggat cttgggagtg aaattgccgc cgactagggg cccgatgacg caaaaacctg 3779461 ggctggcctc gaagtcgtcg ttaaccagaa ggccacggtt ggtgcggttc gggcggcaca 3779521 gcccgttctg catcgcgctg accaggaacg gcgaggaaca cgtgtccagc tcctcgaaac 3779581 cgccacaatt caccaccgca gcgaagggga cggggtgggt atgctcggct cccgcggctc 3779641 ggtaggtcat ggtggcgaac ggctggccgg acgcgcaggc atccacgcgc agtacttcgc 3779701 cggcgagcag gctcagcgtg ccgtccgcgg ctagctcctc ggatgcctgg cggcaatcgc 3779761 gtcccgcacg ccgcaccaac ttggtgaagt tcatgccgtg cacgcagaag aactcttcct 3779821 gctgcacgag atccatcttg tgcagcgcct gcccaaacag ggcggcaacg gcgtcgtaca 3779881 aatcggccag gttcaacgag cgttcttcgg ccgtcgcgag atcgtcgcgg atcgcggaca 3779941 tgagatccgc cgcggcgatc gcttccgtac agagcagcgt gcgcagccgc gggaagtcaa 3780001 actccggcgg ctgattgcag atcatgtagg gcagcacgcc ggagcgcgag atgacggtga 3780061 tggaccggac gcgtgcgcgg atgcgcgcgt cgtgacgcat taggtagagc gcttccagcg 3780121 aggtggcgtt ggaacccacg accagtacgt tgcgcttctc ccacgactcg acgcggtcga 3780181 gcgaatcgcg cagtcgcgct acgttgctct ccccgccggg ggagtagaaa tcgttgatat 3780241 aggtgaatgc gggttcggaa tcgctcgcaa ggatggcttt ggtcgggggg ctgccaatgg 3780301 ccacaaccac tttgcctgca gcaattgccg ttggaccgtt tccagacggg cggaggccga 3780361 ttcggtagtg gccgtctgcg gagtgggcgc tcatggcctc agcgcggatg gtgacgattt 3780421 cggccaggtc acgctcgccg agcgcggcga tggcggcaat catctgctcc gacagaaata 3780481 caccgaagag aaaccgcggc aggtagagct ccccccactg gttgccgtcc aatgcgtcgc 3780541 ggttgtcgca gatccagcgg gccgcggccg caccgccctc tgcctggaag aacgccagcc 3780601 agcgctgctt gttctgctcc agccagatcc ggtaggcggc cttttccggc tcgtcggcga 3780661 aatcgtcgag cttctgaatg gccagcgatc cgatgctgga gcgttggcca taggggattc 3780721 cgcaccagaa ctgctcgtct cgctccacca ccgcgatgcg caacttgggc gatgccgagg 3780781 ggctgctcag cagggcatcg gccatttcca gcagagtcat agagcacgcg gccccgctgc 3780841 cgatgaacgc aacgtcgaag gtaggtggag tgatcatagt catcaaataa gggaaggcta 3780901 acataacctc gaggcggtgg ttaggcttcc gcgggcttct ccggttcgag cacgacgcgg 3780961 acaaacacct tgcggcctga cgcatcgacg aaccaagcgt tgcggaaatc atcatgggtc 3781021 aacgcgcgca ggcgattcag gaaatgccca aacgttccgc gctcgttcag gtctagccgc 3781081 cggagttgtt cgaaatcctt tttcaggttg aggttgccct cggtggccgg cgatttagcc 3781141 gtgtagctgc cgtcccggat ggcgtcgaaa tgttccagca ccaactcacg ctcgatgtcc 3781201 atcagccggg cgtagacact tcccgaggaa tcccacgact cgatcgcgca ttcccgctgg 3781261 gcgatgatcg gaccatggtc caactgatcg tcgatctcgt ggatcgtcac gccgactttt 3781321 tgcccgtcga tgatcgagaa gacctgggga aaccagccgc ggttgtaggg gttgaaaccc 3781381 ggatgaacat tcacacacct gaccccatcg atcaaagcgg cgggaaacct ctgtttacag 3781441 tggaaggaaa ggacgaggtc ataccgctcc acgatttccg cgacgcgctc tgcgacatca 3781501 catcgcggga cacccggcag ctggccgatg ggggactgat agacgtccat atcgccatgc 3781561 ctggcctgca gatcgaccgc cagagcatgg gcgtggacgt tgtcggtcag gatcaatatc 3781621 gtcacgactc gcccccgcca gcctgcccag cgcccatcag cggagccccc aacaccagct 3781681 caccctactt cagggccgac gcataccgga cggccacgct ggggccagcg cagggacatc 3781741 agtcagtgcg gttccaggat ccgggctacc gcatcgttca cggacaaccg tagttcgtca 3781801 ccggtcagct cgtcgatacc cgtcgccgag cgcagcatgg gcgcaaagag ccgccaaccg 3781861 aattgcagcg caagggcgtg cgcgaccgcc agccgcgcgc ccaagtcgct gtcgtagcga 3781921 ggccgtaccg cgtcgagcag ctccgcaaca ttgggaaatc gctgttgcag ctggcccacg 3781981 ggatatccgt ccagcagtgc ccgggctaag acccgcccat gtcggtcgag agcccgttcg 3782041 atgatgtcag cgggcgcctc ggagtgcaac agtctggtca gcttcgtgcc caggtgatcg 3782101 agcacggccc caaccagttg gtccttggtg ccgaagtgac gaaacaccag cccgtggttg 3782161 accttggatc gagcggcgat gtcgcgaatc gacgtcgcgg ctggcccacg ctcggcgaac 3782221 aggtcggtgg cggcctgcag gattgcggcc gctacctctt cccgcccagt gggcatcttg 3782281 cggcggtcgg ttgccggacg cgtagtcatc cggctacagt aaccgatgta gtcatctgac 3782341 tacactaacc attcattgag gacgccagca atgacagatc tgattaccgt gaagaagctg 3782401 ggcagccgta tcggcgccca aatcgacggg gtgcgcctcg gaggcgatct ggaccccgcc 3782461 gcagtcaacg agattcgcgc ggcactactg gcccacaagg tggtcttctt ccgcggtcag 3782521 caccaactcg atgacgccga gcagctggcg tttgccgggt tactgggcac cccgatcggc 3782581 cacccggccg cgatcgccct cgccgacgat gcaccgatca tcacgccgat caactccgag 3782641 ttcggcaagg cgaaccgctg gcacaccgac gtcacgttcg ccgccaacta tccggccgcc 3782701 tcggtactgc gcgcggtctc cctgcccagc tatggcgggt cgacgttgtg ggccaacacc 3782761 gccgcggcct acgcggagct gcccgagccg ctcaagtgcc tcaccgaaaa cctgtgggcg 3782821 ctgcacacca accgctatga ctacgtcacg accaaaccgc tgaccgcggc gcagcgggcc 3782881 ttccgtcagg tgttcgagaa gccggacttc cgcaccgagc atcccgtggt gcgggtacac 3782941 ccggagaccg gtgagcgcac gctgctagcg ggcgacttcg tgcgcagctt cgtcgggttg 3783001 gacagccacg aatcaagggt gttattcgaa gtgctgcaac ggcgaatcac catgcccgaa 3783061 aacaccatcc gctggaactg ggcgccgggc gacgtagcca tctgggacaa ccgggccacc 3783121 caacaccggg cgatcgacga ctacgacgac cagcaccggc tgatgcaccg ggtcaccttg 3783181 atgggcgacg tgcccgtcga cgtgtacggg caggctagcc gggtgatcag cggggcgccg 3783241 atggagatcg ctggctgatc aaccagtaag cgcaacgcaa ttatgtagca ccatgcgtgc 3783301 taccgttggg cttgtggagg caatcggaat ccgagaacta agacagcacg catcgcgata 3783361 cctcgcccgg gttgaagccg gcgaggaact tggcgtcacc aacaaaggaa gacttgtggc 3783421 ccgactcatc ccggtgcagg ccgcggagcg ttctcgcgaa gccctgattg aatcaggtgt 3783481 cctgattccg gctcgtcgtc cacaaaacct tctcgacgtc accgccgaac cggcgcgcgg 3783541 ccgcaagcgc accctgtccg atgttctcaa cgaaatgcgc gacgagcagt gatctatatg 3783601 gacacctcgg ccctgactaa gctgctcatc tccgagcccg agacgaccga actgcggaca 3783661 tggctgaccg cgcaaagcgg ccagggcgag gacgcggcga caagcaccct tggccgggtc 3783721 gagttgatga gagtcgttgc ccgatacgga caaccaggcc aaactgagcg tgcgcgttac 3783781 ctactcgacg ggctcgacat cctcccgctc accgaaccgg tgatcggtct agctgaaacg 3783841 atcggaccgg ccaccctacg ttctctcgac gcgattcacc tcgcggccgc agcccagatc 3783901 aagcgggaac tgacagcctt cgtcacctac gaccaccgat tgttgagcgg atgccgtgag 3783961 gtcggcttcg tcaccgcctc acccggcgca gtccggtgac catatccaac gaccgcacgc 3784021 ttcctgatgc ctcagcccgc gttgctgacc ggatcgatcg gcaaccaccg cagcgcgccc 3784081 ggggcgtcgg caggcaccac cgggtgcgcc ggttggatcg gcgccagccg acgatagggc 3784141 tcaccctgcg gtggccgccg gtcggtttcg cccttgttcg gccacagcga ggcggcccgc 3784201 tcggcttgag cggcgatgga cagcgacggg ttgacaccca ggttcgccga gatcgccgca 3784261 ccgtcaacca cgtacagcgt cggatagcca tagacccggt gataggggtc gatgacgccg 3784321 tgctcggggt cgtcgccgat caccgcgccg ccgagaaagt gcgcggtgag cgggatgttg 3784381 aacagctcac cccaggtgcc gccggccacg ccgtcgattt tggcggcgat gcgacgggtg 3784441 acctggttgc cgatcgggat ccatgtaggg ttcggctcgc cgtgtccctg cttgctcgag 3784501 taccagcgga tacccagctt cccgcgcttg gtgaacgtgg tgatcgagtt gtccaggtgc 3784561 tgcatgacca gcgcgatcac ggtgcgctcg ctccattgcc ggggattgag catccggatg 3784621 gtgccgcgcg gatcctgact ggcggtctgc agcaactgcc tccagcgcgg cacatcggtg 3784681 ccctgcggac cggagccgtc ggtcatcaag gtctgcagca gccccatcgc gttggagcct 3784741 ttgccgtagc gcacgggttc gatgtgggtg tcggccgtcg ggtgaatcga cgacgtgatc 3784801 gccacgccgt gggtcaggtc caggtccgga ttgaccttca aggtggcggc cccgacgatc 3784861 gattctgagt tggtgcgggt aaggacaccc aatcgcttcg agagaccagg gagccgaccc 3784921 ctatcccgca tcttgaacag cagatgctgg gtcccccagg tgcccgcggc cagcaccagc 3784981 tgcgttgcgg tgaaggtgcg ccgatcccgg cgcagccaac tgccggttcg cactgtgcgg 3785041 acctcccaca acccgtcgga ccgccgctca aaccccttca ccgtggtcat cggaatcact 3785101 tgcgcgccag ctgattccgc gaggccaagg tagtttttca ccagggtgtt cttggcaccg 3785161 tggcgacagc ccgtcataca gcagccgcat tccaggcagc cggtgcgcgc cggcccggca 3785221 ccgccgaagt agggatcggg cacggtcttg ccgggcgtct tggtgccgtc ggggccgaag 3785281 aacactccaa ccggggtcgg cacccaggtg tcgccaaacc ccatctcgtc ggcgacctcc 3785341 ttgacgatgc ggtcggcgtc ggtgaaggtc gggttttgca ccaccccgag catccgctgc 3785401 gcctgctggt agtgcggcat cagctcgcca cgccagtcgg tgatgtgtga ccactgctgg 3785461 tcggcgaaga acggctccgg cggcacgtac aacgtgttgg cgtagttgag ggagcccccg 3785521 cccaccccgg cgccggccag gatcatcacg ttgcgcagcg ggtggatacg ttgaatgcca 3785581 tagcagccca acctcggcgc ccagagaaac ttgcgcaggt cccacgacgt cttggcgaac 3785641 tcctcgtcgg agaaccggcg gccggcctcc agcacgccga cccggtagcc cttttccgtc 3785701 agccgcagcg cggtgacgct gcccccgaaa cccgatccaa taatcaggac gtcgtaatcc 3785761 ggcttcatcg ctgcagtatg acccccttta catcgggcca gttaatcagt ctctcaggtg 3785821 gcgtcagccc ccaacggtca ggccgacctt ctggaactcc ttgaggtcgc aatacccggc 3785881 cttggccatc gatcggcgta gcccaccgac cagattcagg ccgccgaacg ggtcgtccga 3785941 cggcccgccc agcacccgcg ccagcggcgg ccgctcgccg accgcgatct gcagcaacgc 3786001 cccccgcggc aacgacgggt gcgccgccgc ggccggccag aaccatccct cgccgagcgc 3786061 ctcggccgat tcggctaacg gggtacccag caccaccgcg tcggcgccgc aggcgatggc 3786121 cttggccaac tcgccggaag tgtggatgtc gccgtcggcc aacacgtgca cgtagcggcc 3786181 gcccgtctcg tcgaggtagt cgcgccgcgc ggcggcagcg tcggcgatcg cggtcgccat 3786241 cggcacgctg atgcccagca cctcgtcggt cgtcgtcacc ccctgggtgg agccgtagcc 3786301 aacgatgacg ccggcggcgc cggtgcgcat cagatgcagc gcggtgcggt ggtcgagcac 3786361 cccgccggcg acgaccggta tgtcgagctc ggagatgaag gtcttcaggt tgagcggctc 3786421 gccgtcgctg gcgacgcgct cggcggagac gatggtcccc tggatgacca gcaagtcaat 3786481 accggccgca accagtaccg gtgtcagcca ctgggcgttt tgcgggctca cccgcaccgc 3786541 ggtggtcacc ccggcctcgc ggatgcgagc caccgcggca cccaacaggt cgggatttag 3786601 cggtgccgcg tgcagctcct gcagcaaccg gatcgccgtc gacggttcgg ggtcggccgc 3786661 tgcagcttcc aagagttggg cgatttttgc ctcgacatcg aggtggcggc cgatcagccc 3786721 ctcgccgttg agcacgccca gcccgcccag ccggccgagc tcgatcgcga actccgggga 3786781 caccagggca tcggtggggt gtgccaccac cgggatctcg aaccggtagg cgtccagctg 3786841 ccaggccgtg gagacgtcct tcgacgagcg ggtgcgccgc gacggcacga tgctaatctc 3786901 gctgagttca taggtgcggc gggcggtgcg gcccatgccg atctcgacca tctagatatc 3786961 caggtcgccg ttagcgcgcg tagtagttgg gcgcctcgac ggtcatcgcg acgtcgtggg 3787021 gatgactctc cttgaggccc gcgggtgtga tccggacgaa ttgcgcctgc tgtagcacct 3787081 cgatggtggg cgacccggtg tagcccatcg cggcgcgcag gccaccggtc aactggtgga 3787141 tcaccgacga cagcggacca cggaacggca cccgcccctc gatcccttcc ggcaccagtt 3787201 tgtcttccga cagcgcgtcg tcggcgaagt agcgatcctt ggaatacgac gtcgcccccc 3787261 cacgccctcg catggcaccc agtgatccca tgccgcgata actcttgtac tgcttgccgt 3787321 tcacgaagat cagctcaccg ggcgcctcgg ctgtgccggc cagcagcgag cccagcatgg 3787381 ccgtcgacgc accggcggcc agcgccttgg cgatgtcgcc ggagtactgc agtccgccgt 3787441 cggcgatcac cggcacgcca gcaggacgac aagccgctac agcttccaag atcgccgtga 3787501 tctgcggcgc gcccaccccg gccaccaccc tcgtcgtgca gatcgacccc ggccccacgc 3787561 cgactttcac cgcgtcggct ccggcgtcga ccagggccgc ggccgcggac ctggtggcga 3787621 cgttgccgcc taccacctca acccggtcgc cgacttcgga cttgagtttg cccaccatgt 3787681 cgagcaccaa ccggttgtgc gcgtgcgcgg tgtccacgac cagcacgtcg accccggcgt 3787741 cgaccaacat catggcgcgc acccaggcat cgccgccgac gccgacggcc gcccccacca 3787801 gcagccggcc gtcgctgtcc ttggtggcca gcgggtgttg ctcggtcttg acgaagtcct 3787861 tgacggtgat cagcccggtc agccggccgc ggccgtcgac cacgggcagc ttctcgatct 3787921 tgttgcggcg caacaggccc agcgccgcgg acgcactgac accctcttga gcggtgatca 3787981 gcggggcttt ggtcatcacc tcggcgacct gcttggactg gtcgacctca aaccgcatgt 3788041 cacggttggt gatgatgccc accagcgcac cgtcgtcgtc gaccaccggc aacccggaga 3788101 tccggaaccg ggcgcacagc gcatcgacct gggccaaggt gttgtccggc cggcaggtga 3788161 cgggatcggt gaccatgccg gcctcggatc gcttcaccat ctcgacctgg ccggcttgct 3788221 cggcaacggg caggttgcgg tgcaacaccc ccatgccacc cgcccgtgcc atcgcgatgg 3788281 ccatacgcga ctcggtgacg gtgtccatcg ccgagctgac cagtggcacc ttgagcctga 3788341 tcttcttggt gagctggctg gaggtatccg cggtggcggg caccacgtcg gaagccgccg 3788401 gcaacaacaa gacgtcgtcg aatgtcagcc ccagcatcgc caccttgtgc gggtcgtcgc 3788461 cgccggtggg caccgggtca gtagtcaggc cgcccatgcg aacgtacggg ctgaccacca 3788521 ggtcggagct gtcttccagg ccggacatgc cacgggacat cggtggggcc ctccatacgc 3788581 atgttttcag tgagaagccc atcctatcgg ctcgtaaccg cccggtgacg atgcgcgccg 3788641 cagcgctggc cgagaagaac cggacaatca caccgcgacg aggctgcgcc agcgtgtggt 3788701 cagcccgaca cgaagcgaga actcaatttc tggcgttatc accgcgtgct tgcgtagtgt 3788761 agaggggtgc gcgaccacct gccgccgggt ttgccgcccg atccgtttgc cgacgacccc 3788821 tgtgacccgt cggccgcact ggaggcagtc gagcctggcc agcccctcga tcaacaagag 3788881 cggatggccg tcgaggccga cttggccgat ctggccgtat acgaagctct gttggcgcac 3788941 aagggaattc gtggacttgt agtgtgctgc gacgagtgcc agcaagacca ctatcacgac 3789001 tgggacatgc tgcgttccaa tctgttgcaa ctgcttatcg acggcaccgt ccgcccgcac 3789061 gagccggcct acgatcccga accggactcc tacgtcacct gggattactg ccggggatat 3789121 gccgatgctt cgctcaacga ggcagcacca gacgcggaca ggttccgccg ccgctgatcg 3789181 cgctcgctag tgcgtcggac tcaccggcgt ttccggtgct ggctgccccg ccggattcgt 3789241 cgcctcgtcg gcgggttcca acgaggggtc aattgagccc ggctcgggtt ttgatgacgg 3789301 cgtgctgggg ctggctgcca ccgtggacgt ggagttcggc atggggctct ccgagacgcc 3789361 tgccgacatc gacggttcag ctgccgatgc aggcgtcggg ggggtcggcg gctcgacgac 3789421 tggagccagc ggagtccacg agtttcccac cgaacccgga gccgcagggt tggacggcga 3789481 gccgggccgc agcgtggcgt tcgggtcgcg cgtctccacc ttggtattca gcaggttcac 3789541 ctcgttgatc aggtcctgcc ggcggctacc gtcagtcacg gcctgcacgg tgctgctgac 3789601 ctcagccagc tcatcctgcg cctcggccca ttggccttgg gcaatcattt gctcgacctt 3789661 cgccagattg gccttggccg acagcacgat ctgatcgtcg ctgacccgcg atcggttgaa 3789721 catcatcgcg tgcaggccgt acaacaggtc cccggggcga gcatcggcca ccacggcgcc 3789781 gaacccgctc agcaccaaca gcgccgcggc caccgacccg acggccgcca ggctgcgacg 3789841 agcccgtcgc cgttgcgcta ccccggcgcg caacgcggcg acggcctcgt cctgtgaaac 3789901 cagggcactg gccggcggcc acctcaagtc gtcgcgccac tgtccgagca gggcggccaa 3789961 cgcgtcatcg cgaggatccg cgaagtcaac ctcctcccgt tcggcgagtg cgtcgagcag 3790021 cagatcggtg cgggccagct catccaatgg cggccgatcg ccaaggggat taccaaattc 3790081 acgcatagtc acctgccgca acaatctcgt ccttcagccg ctgaagtgca cggtgttggg 3790141 ccacccggac cgcccccgtg gtgctgccga cggcggcggc ggtctcttcc gcggacaggc 3790201 cgacgacaac acgcagaatg aggatctcgc gttgcttggc cggcaagatc tcaagcaatt 3790261 cgttcatccg ggtgaccgaa tcggcctcga tggccatctg ctccgggccg gcgtcggctg 3790321 accagcgctc aggaagcgtt tcggcgggat aggcccggtc acggccggct gcccgatggg 3790381 cgtcggcaac cttgtgcgcc gcgatgccgt acagaaacgc caggaatggc cggccgcggt 3790441 cccgatagcg cggcagcgcc gttatggtgg ccaagcacac ctcctgtgcc acgtcatctg 3790501 ctgacaggcc gctccgctcg accgtgccga ctcgcgctcg gcaatatcgc acgacgatcg 3790561 ggcggatggt ctccagcacc tcccgaagcg cgttccggtc tcctgccacg gcctccgcaa 3790621 ccacagcgtc gagacgttcc ccttgcattg tcatcgacgg cgatatctcc aacgttacga 3790681 agcggacaca tcccgggcta actcccggat cgaccataac ggcccaaccg cgttttaagc 3790741 ggtacgccag catccaccgg cgcgccgcac ctggcctgcg caaatattgc gtattttggt 3790801 gagttcgcgc agctgttgtg ctgaaaacgt gacggtgccg atatcgatca gcaagcatgc 3790861 cagcgcccac cgcagcggca agagcccaaa ccgggcggtg gcatcgagag cctcttcacc 3790921 gacggcgcgt gctcgcgcga cggcgccagc actgcacagc gcggcggcca acaccacgtc 3790981 gcttttgacg cggtggcgcg ccgacgcgac ggccatggcc tgcgtcagct cgaccgcttc 3791041 ctcggcatgg cggacagcag ttgcgccgtc gccggtggcc atcgccaact cggcggccac 3791101 ccaccgccga cgcaccgcca ggcggtccgc cacgagcggg gacaccacca acggatccgc 3791161 gcgatctaac aatgcccccg cggcggcgaa gcggccgacg ccaagcgcat cggccgccag 3791221 cccgatcagt gcatcggcac cagcttcccg atcggcgccg gccaacgcca aggcacgacc 3791281 atcccagccg cgcgccagcg tgtgccaacc aagctgccgc aacaacgatc cctgcgtact 3791341 atgcgccagc gatgccaacg ggcccgccgg caccaggcgt cgcagcaccg acaggtcgcc 3791401 ataggcgtgg gcgtaacgac cctgcccacc agcggccacg gcgcgcaacc acaagtggtg 3791461 cggcgtgatc gccgtcggta gcggccagct gcccggctgg tttccgaagg cggcagcgac 3791521 caacacttgc tcaaccaccg gagcgtgagg agtttcattc accgtgatag ccgtgccttc 3791581 atcagtaaaa agttggtggt ttcttcgtta acggcatatt actcacagct ttctttgcgc 3791641 taatttaggc gtactcacag catgggatga cctgggcaaa tacctcatct atccgcccgg 3791701 gatagcatgc ggcgcaggcg gcgaatgcgg cgcagatgaa cgcagagtta attctcacgc 3791761 aacggtccga tattgcacgc caacggacgc ctattgacgg aaattcggca gcgcccctag 3791821 cgtctatcct tgacggtagt catcggtgac gccactccac ttcagttgca caactcgcgc 3791881 gtccgcgaac ccaacctcca cttgggcgtg tcgtgcagag agggaatcag caatgccaca 3791941 gccggagcag ctaccgggac ccaacgcaga catctggaac tggcaattgc aaggcctgtg 3792001 tcgcggcatg gactcatcga tgttcttcca tcccgacggc gagcgtggcc gtgcccgaac 3792061 gcagcgcgaa caacgcgcca aggaaatgtg tcggcgctgc cccgtgatcg aggcgtgccg 3792121 atcccatgcg ttagaggtcg gtgagcccta tggcgtttgg ggtggcctgt ccgaatccga 3792181 gcgcgaccta ctcctcaagg gcaccatggg acgcacccgc ggcatccgcc gcacagctta 3792241 agccgcgcga gcagacgcta aagcccccgc acgctcggcg tgtcgggggc ttttgcgtct 3792301 gctgaccgga gttcagtgcg cgtgcccgtg gtgatggtcg tgatcttctg ccttggccgg 3792361 cttgtcgacc acgaccgtct cggtggtgag taccatccgg gcaaccgatg acgcgttcaa 3792421 caccgccgac ctagtcacct tgaccgggtc gatgacgccg tcagcggcca agtcaccata 3792481 gctcagggtg ttcacgttca gcccatgccc ggcgggtagc tcgctgacct tgttgaccac 3792541 caccgagccg tccaagccag cgttggcggc gatccagaac aacggcgcgg caagggcttc 3792601 ggagaacacg tcgacaccga ggacctcgtc accggtcagc gacgcacgca gttcggtcag 3792661 cgccttgcgg gcctggtgga tgagcgaggc tcccccacca gggacgatgc cctcctcgac 3792721 cgcggccttg gcggccgcga ccgcatcctc gacgctttcc ttgcgctcct tgagtgcggt 3792781 ctcggtggcg gcacccacct tgatgacagc aaccccgccg gccagtttgg ccagccgctc 3792841 gccaagcttt tcccgatccc aatccgaatc gctcttgtcg atctcggcac gcaagtgctt 3792901 cgcccggttg gccaccgctt ctgcggtgcc gccgccgtcg acaatgaccg tgtcgtcctt 3792961 gctgaccacc acgcgtcggg ccgagcccag cacctccaag cccacctcgc gcagcaccat 3793021 gccggcgtcg gggttgacca cctggccacc cgtcaccacc gccaggtcct caaggaacgc 3793081 cttacggcgg tcaccgaagt acggcccctt gaccgcgacc gctttcaacg tcttgcgaat 3793141 cgcgttgacg accagcgtcg ccaacgcttc gccctccacg tcttcagcca cgatcagtag 3793201 tggcttaccc gttcctgcaa ccttttccag caatggcaac agatcgggaa gcgagctgat 3793261 cttgtcttgg tgcagcagga tcaacgcgtc ctcgagcacc gcctgctggt tatcgaagtc 3793321 ggtaacgaag tatgccgaca agaagccctt gtcgaagccg ataccctcgg tgaactccaa 3793381 ctcggtgccc agcgtcgagg attcttcgac gctgaccacg ccgtcgtggc cgaccttgct 3793441 catcgcttcg ccaaccaggt caccgatctg ctcgtcgcgc gaggacaccg tcgccacctg 3793501 cgcgatgccg gtcttgccgg acaccggcgt ggccgatgcc agcagcgcct cggataccgc 3793561 gtcggcggcc ttgccgattc ccacgccgag cgcgatcggg ttgacgccgg cggccactag 3793621 cctcaggccg cccttgatca gtgcctgcgc caagatggtt gcggtggtgg tgccgtcacc 3793681 ggccacatcg ttggtcttgg tggccaccga cttcaccagc tgggcgccca agtcttcaaa 3793741 cggatcttcc agctcgatct cacgtgccac cgtgacgccg tcgttggtaa ccgtgggtcc 3793801 gccaaacgcc ttggccagca ccacatgccg gccgcgcggc cccagcgtca cccgcacggt 3793861 gtcggccagc ttgtccatgc cgacctccat ggcgcgacgc gcggtttcgt cgtattcgat 3793921 cagcttgctc atcaggctcc tctacgcagg gctagtccgc taacgcatgc cgccccggaa 3793981 atcacccgtg gtgagcacgg ggatcgccgg ggcggaacac gctctactac ttggaaacga 3794041 cggccagcac gtcgcgtgcc gacaggatca ggtattcctc gccgttgtac ttgatctcgg 3794101 tgccgccgta cttgctgtag atgacggtgt caccctccgc aacgtccagc gggatccgct 3794161 tctcgccgtc ctcgtcccac cggccagggc cgacggcaac gacggtgccc tcctgcggct 3794221 tctccttggc ggtgtcagga atgaccagac cggacgcggt cgtggtctcg gcctcgttgg 3794281 cctgcacgag aatcttgtcc tcgagtggct tgatgttcac cttcgccacg attggagccc 3794341 tccactattt ggatcagagc ccgggacgct cgcccggacc ggagttggcg gtcggtccgg 3794401 ggcgtgcccc ggaaccgtcc gaattaccag gtgattcggc attcgtccgc gccctcgcgc 3794461 cgtcgtcgcg ggtgccgacg caggggttag ccgattgcca tctagcactc tatacatgag 3794521 agtgctagca ctcaagggcg cccccttgct tcctggttgc cagcgtgtcc gggtacgcca 3794581 ggtgcaatgt ccgggtcacc gcacctgccc ctgcatcacg ggcagacccg ggtcactggg 3794641 cacgtccagc ggcgacggcg gcgctcccgc ggccaccagc tgcgcggcga acgccgcgat 3794701 catcgccccg ttgtcggtgc atagccgggg actggggatc cgcaacgtcc ggcccgcctc 3794761 gccgcagcgc tgtgtggcca gctctcgcag ccgggagttc gccgccactc cccccgcgat 3794821 cagcagcgtt gagacgccta gcgcagtggc ggcccgtacc gccttcatgg tcaacacgtc 3794881 cgcgacggcc tcctggaatc cggcggcaat gtcggcggta cggaagcccg ggtcagccgc 3794941 gtggctttcc acataccgcg cgacggccgt cttgagcccg gagaagctga acgcatagcg 3795001 gtcatcggcc gggccactca tgccgcgcgg gaaaacgatg gcgtcccgat caccggtgcg 3795061 cgccaggtcg tcgagcgcct tgccacccgg atagcccaat cccagcaacc gggccacctt 3795121 gtcgtaggcc tcgccggcgg cgtcgtcgac ggtgctgccc agctcgatga tcggctcacc 3795181 gagcgagcga acgtgcaaca ggtgggtatg tcctccggac accaacaacg ccacacactc 3795241 gggcagcggc ccgtgttcgt agacgtcggc ggccaagtgc ccgcccagat gattcaccgc 3795301 atagaacggc accccccaag cagccgaata tgccttggcc gcagccactc ccaccaacag 3795361 ggcgcccgcg agcccgggac cgatggtggc cgcgacaatg tctggctgtt tcaagccggc 3795421 ggccgccagc gcgcggcgca tcgcgggacc cagtgcctcc aggtgcgcac gggaggcaat 3795481 ctcggggacc acgccgccga accgaacatg ctcgtcgaca ctggaagcca cctcgtcggc 3795541 caacaatgtc acggtgccat cgggatcgag ccgcgcgatg ccgacaccgg tttcatcgca 3795601 ggaggtttcg atgcccaaga ctgtcgtcat gacgggtccc ccgaatccct acgcatcgtg 3795661 tacgcgtcgg cgccgctgac ccggtaatat cgccggcgca agccgacccg ctggaatccc 3795721 acgctgcgat acagcgcaag agcggcgtca ttatcggtgc ggacctccag gtagaccaca 3795781 ccacccctgg caaagtccag cagttcgcgc agcaaccgac ggccgatgcc ccgcccctgg 3795841 taggccgggt ccacgccgat ggtgtgcacc tcgtactcga acggcggtgt tcggcccaac 3795901 cgcgagattc cggcgtaacc gaccagcgtg ccaccgctgc gcgcacccac atagtggttg 3795961 tgcgggctgg ccagttcgcg gttgaacgcc gccggcggcc agggatcgtc accgacgaac 3796021 agctgggcct ccagctcggc gcaccgctgg gcgtccgcgc gcgtcagcgc gccgatggtg 3796081 acgggctcgg tgtcggccgt cacgtgcaaa ccgccagcgg cttggcatcc ggccggcgaa 3796141 gatacagcgg cactaacggc gccggcttgt cggcccagtt caccgcggct accagacccg 3796201 ccggcgacgg gcggctgggc tcaacgcagg ggagcgcgaa cagcgccgcg tgctccggcg 3796261 caccggcgac cgccaatgcc gggccgggat cgacgtcggc cgcggcatta acggctggtc 3796321 cgaccgtacg aatcccgtcg cagtagcgtg cccagtagac ctcacgccgg cgtgcatcgg 3796381 tgaccaccag cgtgtcaccg atggtttgcc cgccgatggc gtccaggctg cacacgccat 3796441 acaccgggat gcccagtgcg tgcccgtacg cggcggcgga ggccatgcct gcgcgcagcc 3796501 cggtgaacgg gcccggaccg cagcccacca cgacggcgtc caggtcggcc attgtgagcg 3796561 cggcatcggc aagcgcagcc agcacgttgg gagtcagccg ttccgcgtgc gctcgggcgt 3796621 cgacggtgac cctctcgccc agcacaacca gatcatgacg ccgcacgata cccgccgtga 3796681 ccgccggtgt agcggtgtcg atggccaaga cggtgcttat ttgcacgcgg ctcatgaccg 3796741 gccccacgac caagtcgcga tcctggtgtc ggagtggcta acccgctcca ggcggacgtc 3796801 gaggtggcgc tgcgagagcc gctcggccag gccctcgccc cactccacca cgacgacggc 3796861 gtcttcaaga tcggtgtcga ggtccagtga gtccagctca ctcagcaggt cggcgctgtt 3796921 gtggtccagc agtcggtaga cgtcgacgtg gaccatcgcc ggcgtgcccg gccgccgcgg 3796981 ccggtgcatt cgcgccagca cgaacgtcgg cgatgtgatc ggcccctcga catccatcgc 3797041 catggcaata cccttggcca gcaccgtctt tcccgcaccg agcggaccgg agagcaccac 3797101 cacgtcgcca gcgcacagct gctcacccag ccgggacccc agcgttaggg tgtcctcgac 3797161 gcgcggcagc gtcgccgtgc cgccgcccgt aagcccagcc ctggctttcg gtcgtctgcg 3797221 gataccctca cggctcaacg gttttcagcc tcgcgatagg tcctggtgat acgtcctcgc 3797281 gggctggtga ccacttcgta gtggatggtg ccgacaagat cggcccagtc ctgagccgtg 3797341 ggctcacccc ggatgcccgg cccgaacaaa atcgcctcgt cgccttcggc cacatcaagc 3797401 ggcccggggc ccaggtcgac catgaactgg tccatgcaga tccgccccac accggggcat 3797461 cgtctgccgt tgatcagcac ctccagccgc ccgcccagcg accggaacac gccgtctgcg 3797521 taaccgatcg gcagcagcgc cagattggtg tcgcgtggcg cgatccatgt gtgcccatac 3797581 gacacgccct cccccgcacg aatcgatttc accagcgcaa cagcacattt cacggtcatc 3797641 gccggcacca gccccatgtc accgagggcg ggtaccgggc ttagcccata caccgcgatg 3797701 cccggccgca ccaggtcgaa cgtcaggtcg gggcgcgcca tagttgctga tgagttcgat 3797761 agatgcgcca cctcgaaccg caccccttgt tcgcgggcct gcgccagaaa ggcggtaaac 3797821 cgttgggcct gaacatcgtt gatggaatcg tcaggcttgt cggcgtaaac catatgcgac 3797881 atcagccccc gcagccggac ggcgtcctcg gccatggctt ggcgtaacgc ggtcagcatg 3797941 gccgggaatt gtgccggtcc cacgccattg cggttcagcc cggtatccac cttgacggtc 3798001 accgtcgccg tccggccggt ccggcgcacc gcgtgcaaca gttcgtcgag ttggcgcagc 3798061 gaggacaccg cgacctgcac gtcggccagc agcgcgggcc cgaagtcgat gccgggcgga 3798121 tgcagccagg ccagcaccgg tgcggtaatg ccatcagcgc gcagcgctag cgcctcgtcg 3798181 acggtggcga cgccgagttc ggccgcaccg gctcccaggg cggtttgggc gacgcgcgta 3798241 gcaccgtgac cgtagccgtc ggccttgacc accgccatca gctgcgcgtg gccggcgtgc 3798301 tcacgcagca cccgcacgtt gtgttcaata gcgcccagat ccaccatggc ctcggcgagg 3798361 aggccaggtg tctgggatat cggtgtcatg gccaacgaag tcgtgccccg cccatctgtc 3798421 gtgtcgtttg gctttccgac attctcccag aaccgtttca ctgagcagta ttccggcctg 3798481 tgcccgattg ccccgggtcg cggtgctggg ctgcagccgt gtcggcgtga ctgtcctgtg 3798541 gctcggtggt tggttgccga tcacccggtg tttggctcag attgccggtg ccgcatgatg 3798601 gttggcgtca atagagtgcg gatcggccgg catgaattga cgggagcgta gcttgaccgc 3798661 ggcccatcac ccgtggcagg aaacagttgc agtgtgtact attcgcccta gactgccgca 3798721 gttccggggg aagtgaacct attgcgcccg tgcatcactg cacgggtatg ggctttggcg 3798781 gtcgcttcgc accatcaacg ccgacagtgc ggacagcgca aaccgacggc acaccccttg 3798841 cacggatgtg gggtgttttt gagatggagc gaaagtaggc gtgtctttta ttttcacaac 3798901 cccccaggca ttggacaacg cggctaagtc cgtgtcgggg attcacgatt tgtggcgcaa 3798961 aggacgctaa ggcatcgatc ccggtggtca acgctatttg agccccccgc ttccgacccg 3799021 gtgtcgaata gggatgaggc cgctcctccg ccagcacatg aggcagtatc accagatcag 3799081 ctttccggcc atagagcatc gtcaccgggt taggcatggt ttaggcagcg cttagctgag 3799141 aacgccgagg cgtgtcggct cgccgaggcc caaaacagca caaccttgca ctgatctagc 3799201 tgaagaccaa accggcacag cagacattgc catacgcgac aacagccgtc atcaaccgaa 3799261 aggagcaaag aacaaacaga tgcatccaat gataccagcg gagtatatct ccaacataat 3799321 atatgaaggc ccgggcgctg actcattgtt tttcgcctcc gggcaattgc gagaattggc 3799381 ttactcagtt gaaacgacgg ctgagtcgct cgaggacgag ctcgacgagc tggatgagaa 3799441 ctggaaaggt agttcgtcgg acttgttggc cgacgcggtt gagcggtatc tccaatggct 3799501 gtctaaacac tccagtcagc ttaagcatgc cgcctgggtg atcaacggcc tcgcgaacgc 3799561 ctataacgac acacgtcgga aggtggtacc cccggaggag atcgccgcca accgcgagga 3799621 ggtgcacagg ctgatcgcga gcaacgtggc cggggtaaac actccagcaa tcgcaggact 3799681 cgatgcacaa tatcagcagt accgggccca aaatatcgct gtcatgaacg actatcaaag 3799741 taccgcccgg tttatcctag cgtatctgcc ccgatggcag gagccgccgc agatctacgg 3799801 gggcgggggc gggtaggtcc agaaggccgg ggcggaacct gtcaacattt ctgagacacg 3799861 attttcgggg atttattgag tcggctggtc ctccttcggt ggtgggttga tcgcgctgaa 3799921 ggccggtagc gcgggtggct cgggtggttt gcgaacgaat ccgctcgagg tggtctcggt 3799981 aggcggtgtc cagaacggtg gcgcggtgcc ggcggatctg atcggcgcgg ccgtagtgca 3800041 cgtcggcggg cgtgtgcagt ccgatgccgg aatgcttgtg ttcgtggttg taccagccga 3800101 agaaccggtc gcagtgcacc cgggccgcct cgatcgactc gaaccgtttc gggaagtcgg 3800161 gccggtactt gagggtcttg aactgggcct cagacaacgg gttgtcgttg ctggtgtgcg 3800221 ggtgtgagtg cgacttggtg acaccgaggt cggccagcag cagtgccacc ggtttggagc 3800281 tcatcgacga gccgcggtcg gcgtgcaggg tcagctggtc ggcgctgatg tgctgggcgg 3800341 caagggtttg cgcgatcagc cgctcggcca agaccttcga ctcacgcgag gccaccatcc 3800401 acccgaccac gtagcgggag aagatgtcga ggatcacata caggtagtaa tagctccact 3800461 ttgctgggcc acgcagcttg gtgatatccc acgaccacac cgaattcggc tgatgagcaa 3800521 ccaactctgg cttcaccgca gccgggtggg tggcctggcg gcggcgatca ccggtctggc 3800581 cgcgctcacg cagcagccga tacatcgtgg actcgctgca caggtagatg ccctcgtcga 3800641 gcagcgtggc atataccacc gccggcgcca tgtcagcgaa gcgctgcgag ttcagcaccg 3800701 ccagtacgtg ctcacgttcg gccgcactca gcgcccgcgg ctgcgcgctc tcccgcggtc 3800761 ccgacgggtc ggtcaccgcc gtgctggtga acgtatccga ttgtgccgac aaccgtttcg 3800821 agtgggcccg gtagtaggag gccggcgcac gaccggtcgc cgcacacgcg gcccgaaccc 3800881 cgatcaacgg gatcatctcc tcgatggccg tgtcgatcac gctcagcgct cactctcgca 3800941 catcgccgcg ctgtcggctc agagcctctc caagagcgcg gacagttccc cctgcacacg 3801001 gatcacctcg cgtgcggtgt cgagctcggc gcgcagccac gcgatctcgg cgtcagcggc 3801061 attggcgccg gccttgcccg gcttggggcc ccgccgcgcc gacagcgccg ccaacgcccc 3801121 ccgatcacgc tgatggcgct attcggtcag caacgacgaa tacaggttct cccgccgcaa 3801181 gatcgcaccc ctttccgtgc gatcggcgcg gtcatactca tcaaggatcg ccagcttgta 3801241 cttcacggtg aacgtacgcc gctgcgcccg ctcaggcacc tgaggatcag gcacctcgtc 3801301 cacggtgacc gacgaacccc gtcggccagt accagcccta ttagtcaacc tcgttctctt 3801361 cgtactcgcc ctcaggctca gtaaacatct ccactcgcag tgtctcactc aaggttgaca 3801421 gagagggtcg gcgacgcggt cccactgagc gccgacctcc tcagggtcgg tgtgggcgaa 3801481 aatcgtcttg accgccacgg tcaccgccgg ggcgtgtttg gccgctaccg cggtgtacag 3801541 gtttcgcatg aaatgcaccc ggcaacgctg ccacgacgcc ccactgaact gttgtgccac 3801601 agcggctttc agcccagcat gggcatcgga gatcaccaga tgcaccccgg tcagcccacg 3801661 cgctttcagt gaggccaaaa actcacgcca gaactcgtaa gactcgctgt cacccacagc 3801721 ggtgcccaac acttcgcggg tgccgtcgat ggacaccccg gtggccacca ccagagcctg 3801781 agacaccacg tgcgccccga cacgcacctt gcagaaggtc gcatcgcaga acacatacgg 3801841 gaactcggtg tgggtcaagc tgcgggtccg aaacgcctcg atctcggtgt ccagaccggc 3801901 gcagatgcgt gagacctcgg atttagacac cccggcctgc acgcccatcg cggccaccag 3801961 atcatcgaca ctgcgcgtcg acaccccgtg cacgtaggcc tccatgatca ccgcgtgcaa 3802021 cgctttatcg atgcggcggc gccgctccaa aagcgacggg aagaacgaac cggcccgcag 3802081 cttggggatc tgcacctcga tatcgccggc cgtggtcgac actgtcttgg gccggtgccc 3802141 attgcggtgc acgatgcgcc catcggagcg ctcgtagcgg cctgcaccga tcgcctcggt 3802201 ggcttcggcc tcgatcaacg cctgcaaccc ggcacggatc agctcggcaa acaccgccga 3802261 ggcatcagca gcttcactcg cgttacggac cgctcagttg cttcccccaa cggggctttc 3802321 gacgctgggc ttcgaccctg cccgtttcca aaccaagcgg ccagcctgct accgggcctc 3802381 ctgacagcta cccggaccgg actcccaccg gcaggcgacg acgagctttg atcaggtcat 3802441 gacctaagac atcacctcct gatcactggg cgcaccggct gcagtactag tgcgcgaaat 3802501 gctgtgcgtc gaagtggcca cccggcttga ccttgtccag ggcagccaac gcggtgaccg 3802561 cgtcgtcgtg cagggcccgc gccaggtcgg cggagagtcc ttcccgaacc acgatccgca 3802621 gcaccgccac gtcggtggcg ttgtccggca tggtgtaggc gggcacctgc cacccgaagg 3802681 tccgcagctc atgggagacg tcgaactccg tgtacccgcg gtcgccggcg agccggaagc 3802741 tgaccaccgg gatcgccgaa ccatccgaga tcacctcgca atgatccacc tcgcgcagct 3802801 ggtcacccag ccaccgggcg gtgtgcgaca gcgcctgcat caccttggta tagccgtcgc 3802861 gccccagccg caggaagttg tagtactggc ccaccacctg gttaccggga cgggagaagt 3802921 tcagggtgaa ggtcggcatg tcgccgccga ggtagttgac ccggaaaacc agatcctccg 3802981 gcaggtgctc gggcccgcgc cacacgacaa acccgacgcc gggataggtc agcccatact 3803041 tgtggccgct gacgttgatc gacaccacgc ggggcagccg aaaatcccat accaggtccg 3803101 gatgcaaaaa cggcaccaca aagcccccac tggccgcgtc gacgtgtacc gggacgtcca 3803161 cacccccgcc agccgccagt ttgtccagcg cggcgcagat ctcggcgatg ggttcgagtt 3803221 caccggtata ggtggtgccc aagatcgcca ccacgccgat ggtgttctcg tcgacggcgg 3803281 cgagcacctg ctcgggggtg atgacgtagc ggccccgctc catcggcagg taacggggtt 3803341 cgacgtcgaa gtagcggcag aacttctccc acaccacctg gacgttcgaa cccatcacca 3803401 gattgggcat gcgccccttc caagacccca cccgttgccg ccaacgccat ttcagggcca 3803461 gcccacccag catcaccgcc tcgctggagc cgatggtgga caccccggtg gcgctggtgg 3803521 ggtcgtggtc gcgcagaccc tcggcgtgaa acaggtcggc gaccatggac acacagcgcg 3803581 cctcgatggc cgcggtcgcc gggtattcgt ccttatcgat catgttcttg tcgaacgtct 3803641 cggccatcag cttttcggcc tccgggtcca tccaggtggt cacgaaggtg gccagattca 3803701 gccgcgagct accgtcgagc atcagctcgt cgtggatgaa gcgataggcc gcctcgggat 3803761 ccatcgactc atcgggcatc cgcagcgccg gcaccggtgc ggtgaacatc cgaccggtgt 3803821 aggccggagc gatcgaatgc gcgggcacgg acgggtgact gcgagacacg gcggatcctt 3803881 tccgggcttg ttgcggactg gcaggactac agggcagcca gagcggcccg aatgtggccg 3803941 ctgatgcgcg acgccgacgt gggcgcatcg ccggggccgg gatcggcggc cgcggccgcc 3804001 gccgcccggg cgtgcacgaa cgccgcggcc gcggccgcct ccccagacgg caatcccgac 3804061 gccagcagcg caccgatcat cccggacagc acgtcaccgg acccggcggt ggccgcccag 3804121 gactggccgg ccggattgag atagaccggg ccgccgggat cggcgatgac ggtgacattg 3804181 cccttgagca gcacggtggc gcccagcgcg tcggccagct ggcggcaggc ccccacgcgg 3804241 tcgtcaccgg gcggcgcccc ggccagccgg gcgaactcac cggcgtgcgg cgtcaagacc 3804301 gtcggggcgt tgcggcccgc caccagatcg gggtggtccg ccagcatggt cagcccgtcg 3804361 gcgtcgacca acaccggcag gtcggtgtcc agcgcgaacc acaacgcggc ggccccggct 3804421 tcgtcggtgc ccaggcccgg cccgacgacc caggcctgca cccgcccggc cgccgccggg 3804481 gtgggcgagg cgatgacctc cggccagtgc gcgaggactt ccgcatgggc ggtcccggcg 3804541 tagcggacca tgccggaggt ggcggcgacg gccgccccgg tgcacagcac ggccgcaccc 3804601 ggatacgtcg acgacccggc cagcacgccg gtcacgccct gggtgtattt gtcgtcgcgg 3804661 ggaccgggca ccggccagcg cgcggccacg tcggtagcct cgaaacccaa cacgtcggtg 3804721 tgcgccaggt ccagcccgat atcgacaagg acgacgcggc cgcagtcggc cagcgcgtgc 3804781 accggtttga gcccgccaaa ggtgacggtc agcgcggcgt gcacggcggg gccggtgatc 3804841 gccccggtcg ccacatcgat gccgctgggg atgtcgacgg cgaccaccgg tatggcggcg 3804901 gcctgaaccg cggcgaacac ctgcgcggcc gccggtcgca gcggccccga gccggagatg 3804961 ccgaccaccc cgtcgatgac gagatcggtc gccgccgaga cactctcgac gaggcgaccc 3805021 ccggatttgg tgaacgccgc cagcgccttg cgatgcgtgc ggtccgggtt gagcagcacc 3805081 gcgtcggcgg cggcgccgcg gcgtcgcagg aacgtcgccg cccacagcgc gtcgccaccg 3805141 ttgtcgccgg atccgacgac cgcgcacacc cggcggccga ccaccccacc cgtgcgagcg 3805201 gtcaactcac ggccgatctc ggtggccagc ccgaaggccg cgcgtcgcat cagcgcaccg 3805261 tcgggcaggc tggccaacag gggcgcctca gccgcgcgga tggtgtcgac agagtagtag 3805321 tggcgcatct caggcccgcc gtcctcgggt gccgcgcctg tgcagcagac ttttgattct 3805381 ggccggattc cacagccgac cgtcgcgttc ccgggccatc ggatagaaca gcagaccaat 3805441 caggatggac gcgaagtgac cgaccgtggt gaagtccagc tcggctttgt ccatcgcgat 3805501 cagcggaaaa ccaaagatga ccagcagcac cccgagatag ccccagcgcc acggtttggc 3805561 gatgtgatag gtcaataccg ccatcacacc gaccaggaag tagctgaccc cgatatcacg 3805621 agcgtgcacc atcctttcgg aggcgtctcg gtgctggatc gccagataga gcaggccttc 3805681 gctcaaatag gtggcaccga tgtgagcggt caatcccacg gtgagccaac gcaagtggcc 3805741 gagccaatgc tcggcgggcg ctaggaacag ggtgaacagc agcaggtacg gttccaaatt 3805801 ccggccgtcg atccacaaca ggctggaaaa cagcacctcg agcggatcgc gccccaactc 3805861 ggcgatgttg gtggaccggt gcaggagcac gaaatgcagc tggctcccgg tgagattgtt 3805921 ctggatgatc gtggtgatca ccaacacgac cagccaggca taggtcaacg gggcgttgct 3805981 gacgaagtgc cacaccgcga gcgcccacga tcgcagccgt gccaccaccg atgcgtccgc 3806041 cacgggtcaa cacttacatg gtttcgtcga cgtcaggctt caggtgccac caccagcaga 3806101 ggatatacag caccatcgag gtcaccaatg cgacgaccac ccagagtacg accgtgacgt 3806161 cgatccaaaa tccgattggg ggcgcgtcgg gaagcgcatt gcgcagcggt atcaccgcaa 3806221 aaagcattgc cgcataccac gttgtcatcg gcggctggaa ttgccgccgg ccacgtgcgg 3806281 tttgaaccgc aacgaacagg cccaccccgg ccagcgcgat caacacaccg acgatgacgg 3806341 tgccgaatgc cacgctgctc ggcgatcggt gcaaccccac tcggtacggg gcaggcacat 3806401 tggcgtcgcc gacaccggaa atgtcgacgt tccagcccgg aagccggtcg acgaatgtca 3806461 ccgacacacg ttccggcgcg tgcgcggctc cgcggtagag ctggaccgtg atcggccccg 3806521 aacggtagtg gtcgaacggc caattcgcgg gatccccgga gatggtcagc gggacgggaa 3806581 agacgccggg cagcgaacca ctcgaccagg tgcgcttggt aggcgttacc acggatgtga 3806641 ccgtgacggt gaggtcgtcc ttgaggccct gggtttgcga atccagcagc tcagtcccag 3806701 gtgacacggc gaggttggca accagcacgc ccttgatcgt ctgaagctgc tcgacgtgca 3806761 gggtcaccgt ggtcccgtcg gccgtcggcc gaccgtgggc gacttcatga ggtcggccga 3806821 ggccggtgct gtgatacaac gcgatcacgg tgacgtaggc cgcaatcacg agcaccaaac 3806881 cgacaacgac tctcaggatg cgtcccaact cgctacccgc ccacttgtgc gttccggccc 3806941 ggaaattgta accgcgggac ccctccgtca gcggatgcca ccgccaggcc acgtgattgt 3807001 gcgacagccg ccatcttcct gtggtaggtg atcatcgccg tcaactccgc acccaacgtc 3807061 tccgcggtcg ccacgtggat cgcgtcgagt gtcttgggcc gccaggtccc gtaccgtgag 3807121 ccacctcgac tattcgacgg tgacggactt ggccagattc cgcggcttgt cgacatcgta 3807181 gccgcgggcg cgcgccaccg aagccgcgaa cacctgaagc ggaatggttg atagcagcgg 3807241 ctgcaatagc gttgacaccg ctgggatttc gatcaggtga tcggcgtagg ggcgcaccgt 3807301 ttcgtcgccc tcctcggcga tcacgatggt caccgcaccg cgggtctgga tttcacggat 3807361 gttggacagc agcttggcgt gcagcgtggc cgaccccttg ggtgagggca tgacgacgat 3807421 gaccggtagg ccgtcttcga tcagcgcgat cgggccgtgc ttgagctcgc cggccgcgaa 3807481 accctcggcg tgcatgtagg ccaactcctt gagtttgagt gcaccctcca gcgccaccgg 3807541 atagccgaca tggcgaccca ggaacagcac ggtcgacgac tgggcgaacc ggtgggccag 3807601 ctcggccacc ggtccggtcg ccgcgatcac ccgggccacc aggtccggca tcgcttccag 3807661 ttcgtggtac tcgcgctcga cctcgtcggg gtatttggtg ccgcgggcct gcgccaaggc 3807721 aaggccgagc agatagttgg cagcaatctg cgccagaaac gtttttgtgg acgccacacc 3807781 gatctccggg ccggcgcggg tgtagagcac cgcgtcgcac tcgcgcggga tctgcgagcc 3807841 gttggtgttg cagatcgcca gcaccttggc tttctgctcc ttggcgtgtc ggaccgcttc 3807901 cagcgtgtcg gcggtttccc cggactgcga gatcgccacc accaaggtgc tacggtccaa 3807961 caccggatcc cgataccgaa actcgctggc gagttccact tccacgggca gccgcgtcca 3808021 gtgctcgatc gcgtacttgg ccagcagccc ggagtgatat gcggtaccgc aggccaccac 3808081 gaacaccttg tcgatctcgc gcagttcctg gtcgctcaac cgctgctcgt cgagcacgat 3808141 ccggccaccc acgaagtgtc cgagcaaggt gtcggccacc gcggcgggct gctcggcgat 3808201 ctccttgagc atgaagtact cgtagccgcc cttttcggcg gcagccagat cccagtcgat 3808261 gtggaagggg cggaaatcgc gcccagcttg taggccatcg ttgccgtcga aatcgctgat 3808321 ccggtagccg tcggcggtga tcaccaccgc ctggtcctgg ccgagctcga ccgcttcccg 3808381 ggtgtgctcg ataaacgcgg ccacgtcgga accgacgaac atctcgttgt cgccgatgcc 3808441 cagcaccagg ggcgtggaac ggcgggccgc cacgagggtg ccggggtcgt cggcattggc 3808501 gaacacgagc gtgaaatgcc cctcaagccg gcgcagcacg gcaagtacgg agccgacgaa 3808561 gtcatcggcc gtctcgccgt gccgatacgc ccgcgccacc aggtgcgccg cgacctcggt 3808621 atcggtgtcg ctggcaaact cgacaccggc agtctccagc tcccggcgca agacggcgaa 3808681 gttctcgatg atgccgttgt ggacgacggc gatcttgccg gcagcgtcgc ggtgcgggtg 3808741 cgcgttgcgg tcggtgggac gaccgtgggt ggcccagcgg gtgtggccca ggccggtagt 3808801 accggacagc gccgtggacg gcatttccgc cacggcttcc tcgaggttgg ccagccggcc 3808861 cgcacgccgg cgcacggtga gtgtgccacc gtcgaccagc gcgatgcccg acgagtcgta 3808921 gccgcggtac tccatccggc gcagcgcgtc catgacgacg acgtaggcgg ggcgccgccc 3808981 gacgtaaccg acaattccgc acacagcaga ccagggtagt gcagcatggt cggtagggca 3809041 gtcccgtcgc ccaaccgacg ctatcgtcga gtttggccac cgcgcacgaa aggccaacac 3809101 ttgtccaacc catatgccca gcaccagctg aagctcatca ggcacacggg tgcgctgatc 3809161 ctgtggcagc aacgcaccta cgtggtctcc gggacgcgcg agcaatgcga agcggcgtac 3809221 aagtcggcgc agacctacaa cctgctcgtt ggttggtgga gtttggtgtc gctccccgcg 3809281 atgaactgga tcgcgctgat ttccaacttc aatgcgattc ggcgggtgcg agccgccgcc 3809341 gacggggcgt ccgttcccca cggcccgcac gccatcgccc atccagccgt tccccgggga 3809401 cccataccgg cgggctggta tccagacccg tccggggcgg gactgcgtta ctgggacggt 3809461 gcgacgtgga cccactggac ccatccgcca cgtcaccgct aacgtcgacg ggtgccccgg 3809521 atccgcaagc tcgtcgccgc cctgcaccgc cggggaccac accgtgtttt gcgcggtgac 3809581 ctggcttttg ccggcctacc cggggtggtg tacacccccg aggcggggct gcaccttccc 3809641 ggtgtcgcct tcggccacga ctggctcacc ggcacctctc gctattcggg tctattggag 3809701 catttggcgt catggggcat cgtggccgcc gcccccgaca gcgagcgcgg actggcccca 3809761 tcggtcctga atctggcctt cgatctgggc gttgccctcg acatcgtggc cggtgtccgc 3809821 cttgggcctg gaaaaatcag cgtgcacccc gccaagctcg ggctggtggg ccatggtttc 3809881 ggtggctcgg ccgccgtgtt cgccgccgcc ggcttgaccg gcacgcacgt caagtccgtg 3809941 gcggcgatat tcccgacggt gaccaatccg gccgcggagc agccagccgc gaccctagac 3810001 gttccgggac tgattctgac cgcacctggc gatccgaaga cgctgacctc caacgccctc 3810061 gggctatccc gggcttggga taaggccacc ctacgcatcg tcagcaaagc ccgagccggt 3810121 ggtctggttg agggcagacg actgacgaag gtgttggggc tcccaggccc acaccgccgg 3810181 acgcagcgtt cggtccgggc gctgctgacc gggtacctgt tgtacacgct cggcggcgac 3810241 aagacttatc gcaggttcgc cgatccagac ctgcagctgc ccaagacgga cccgatcgac 3810301 cctgaagcgc cgccgatcac cccgggggag aagatcgtga cgctgttgaa gtagcgcggg 3810361 ataccccgac ccgtcacggc cccgcctgcg gaagctcgtc ggcggcgatc tcacaggggg 3810421 tggctccctc ggacagcgct tccggcgaag gcccattcgc cggttccggc gcacccggcg 3810481 gcgccggcgc agcgaccgga ggcggtggtt cggcgaccgg aggcgcggca ggcggcggcg 3810541 gttcggcgac cggcggcagc gtggcctcct cggcgacatc ctgggcacgt tcagcgggca 3810601 ccgattcgtc ggcgtcgtcg acctcgtctg cttcgtcggg ctctgttgct tccttcggct 3810661 ccgctgcctc gtcggcctct tccgggtggg catcgtcgcc gtcatcagcg ttgtcggccg 3810721 cgtcttcagc gaacggatcg acggcacccg gcggattgtc agctgccaac ggatccccca 3810781 gctgttcggc caccgaaccc agcaggctat ccaccgcatc gacgatccgg ttggcaagcc 3810841 cggcaagccc ggcaaacccg cccagaccgc cggtgccgcc ggcatcgccg aaaccaccgg 3810901 cgctgccaac acccgccggc gtcgcggaac catcacctgg cgccgatcca aaatccgacg 3810961 gcgtcactgg ccgcgaggtc acggccggca ccggatccgg cgggggaaga gcggccgcgg 3811021 gcgtaatcgc tgccgtcgcg ctcggttgag ccggcaccga tgccggagaa ggttggcgac 3811081 cgggcccgag atcgtccgga atctcgaagt gcgcgcgcgg cgcgctggcc agctgatcgg 3811141 tgaccgcatc atacgacgcc gccacaccgg ccgttgtcga tcgcatcgtg gtcagccagt 3811201 cgttgcgaac atcgtcgtcc acgtagggct gtatctgttg gcgaaccact tcgacggccg 3811261 tcggccgatc tgccccctcc gtcgtgagcg cttcggccgc agccaaccat gccggccgct 3811321 gcgccagggc acgctcgtcg atcgcaatgg ccgtcgcgac tttggagtcc accagctgcc 3811381 agaggttgtc gcgcagcgat tcgcagcgtt gggccgcggc acggacttcg gtgaccaccg 3811441 aatttccagt ctcacagtga cgctgcacaa agtgcaccgc cgcgtcggcc cccgatcccg 3811501 tccatgccgc tgccaagacg gcgacctggc tacgctccat ccgcagcgcc tccatgagca 3811561 cactggcggc agcccgcagc tgcgcgcagt cagcgtcgag cgcgtgcagg tcaagtccgt 3811621 cttcgctgcc gtaccagtcg tggatctggg cagggtaggc ggtcaggtcg ggatgttggt 3811681 agcccaccag gtggcaagcc cgcacgtagc tttgcgtgtg ctcggctgcg ggcctgccct 3811741 cggcgagacg ctcagcgacg ttcaaccggt cagccaccct cacccgatcc gcgccgccgc 3811801 gcacaggtcg gcctcggcgt agcggttggc gcccgcccgc aacgcaaacg cgatctgcac 3811861 agccgcccgg gaccacactg acaactcgcc ggccaaccgg tctagcctgc agcgcaacgc 3811921 atcgccacgc gaggcgtgcc cccggcccgc gcaggctccg ccaaaagcca gcctcgtcag 3811981 gtgattgccg atggcgtcat cgatgagttc ggcggcggcg ctgaaccggt cggcaaccgc 3812041 gtataccgct gctatgtcta tgccggcgct gtttacgcta tcgggtctca tgcctattcg 3812101 gacgccccgc gccgcgtcgg ggttccagca tttccggttc agcgcgcggt gctcaccgcg 3812161 tcggcgaccg tggccgccag ccgctgggcg acgccctcgt cggctgcctc caccatcacc 3812221 cgaatcatcg gctcagttcc ggacgggcgc aagaggattc gacccgtgtc acccagctcg 3812281 gccgcggcct gctcgaccgc cgttcggacc gagggcgccg cggcggcggt ggccttgtcg 3812341 acaacctcga cgttgatcag cacctgcggc aacgtccgca tcgccgacgc caggtcggac 3812401 aacgacgagc cggtctgcac catgcgggtc atcaaccgca gcccggtgac gatgccgtca 3812461 ccggtggagc ccagcgccgg catgacgatg tggccggatt gttcgcctcc gaggctgtag 3812521 tcaccggccc gcagctcttc gaggacgtag cggtcaccga cggcggttgt acgcacggtg 3812581 acgccggccg agcgcatggc taggtgcagc ccgaggttac tcatcacggt ggccaccaat 3812641 gtgttgcagg ccaactcacc ggcctctttc attgccagcg ccagcaccac catgatggcg 3812701 tcaccgtcga cgaggtcacc gttggcgtcg acggccaggc accgatcggc gtctccatca 3812761 tgggccaggc ccaggtcggc ccgatgggcg agcaccgctg cccgcagcgg gtcaaggtga 3812821 gtcgatccac agccgtcgtt gatgttgcgt ccgttgggtt cggcgttgat cgcgataacc 3812881 cgggcaccgg ccgctcggta ggcgcgcgga gccgccgacg acgcggcccc atgagcgcag 3812941 tcgaccacca cggccaggtc atcgagccgg gcggtggcgg ccttggccac gtggcgcagg 3813001 tagcgttcgg tcgcatcctc ggcgtcgata acgcggccaa tccccgcgcc ggccggccgc 3813061 aacccgggtc cgcgggagac gccgaggacc agatcctcga tctgatcctc ggtgtcgtca 3813121 tctaatttgt ggccgccggg cccgaagatt ttgatgccgt tatcgggcat cgggttatgc 3813181 gacgccgaga tcatcacccc gaagtcggcg tcgtaggcgc cggtcagata ggccaccgcg 3813241 ggggtcggca acaccccgac ccgcagcgcg tcgacgccct cactggtcag gccggcgatc 3813301 acggcggcct ccagcatctc gccgctggcc cgcggatcgc ggccaagcac cgcgactcgc 3813361 cgacccggtg cgcccgacct cgacaatcgt cgcgccgccg cggcgcccag tgccagggcc 3813421 agttccgcgg tcaactcgcg attggcgaca ccgcgcacac catcggtgcc aaacagtcga 3813481 cccatacgga caacctttca cagttgacgg ctgcgcacat atccactctt ggcagcgaat 3813541 atgcctgttg gttcaccgac acgccgacga gcgcacacaa acatgcacgc ttgtcgcccg 3813601 aaagtgatgt cagcgcttgc tgtactgggg cgccttgcgg gccttcttca ggccgtactt 3813661 cttgcgctcg gtggcgcgtg gatcacgggt caagaagccg gccttcttca gcgcgggccg 3813721 gtcctccggc gataccagaa tcaatgcccg ggcgataccc aggcgcagcg cgccggcctg 3813781 acccgacggg ccgccgccgc ccaggtgggc aaagatgtcg aaactttcca cccgatccac 3813841 ggtgaccagg ggtgccttga tcaactgctg gtgcaccttg tttgggaagt agtcctccaa 3813901 gctgcggccg ttgaggtcga acttgccggt gccgggcacc agccgcactc gtaccacggc 3813961 ctccttacgg cgcccaacgg tctggatggg ccgctccaac acgaacgatt gtgcgggccc 3814021 ggccggggcc gccggggttt gcggggctgg ggtggtttcg gtcattgcgc cacctgcttg 3814081 agctcgtacg gaaccggctg ctgagcgctg tgcggatgct ccgggccggc gtagacgcga 3814141 agcttgcgct ggatctggcg gctgagcctg ttcttgggca acatgccgag gatcgccttt 3814201 tccaccacgc ggtcggggtg gcgttgcatt agctcaccga tggtgcgctt gtgcaggccg 3814261 ccgggatacc ccgagtgccg gtaaaccatc ttgtgctgca gtttgtcgcc gctgatggcg 3814321 accttgtcgg cgttgatcac gatgacgaag tcaccgccat cgacattggg ggcgaacgtc 3814381 ggcttgtgct tgccgcgcag caggttggcc gccgcgacgg caaggcggcc aagcaccacg 3814441 tccgtggcgt cgatgacgta ccacgatcgc gtggtgtcac ccgccttggg cgcgtacgtg 3814501 ggcacagcgc ttaccttctt ttctctcggg tggatcccgg ggtgccccgg gcgccggtca 3814561 ggcgtgaacg gcgggttggt ctcggcgacc gacattgacc cgaggtcccg gcgtaccgca 3814621 cgccaaccga gcagcttacc gacgagcatc cacgcaggtc aaaatgactg tgtggtcccg 3814681 acggctctcc cccgtcggga ccacacaggg gtctgttgcg cgctccgggg cccggaacta 3814741 gcgtgcccaa gctccagccg cccgccggtc ggcatgcgcc acgtcgtcgg caccgtggcg 3814801 aaccgcgttt cccaagtcga tcaggatctc gttgagcgcg ctggccgcct ggtgccactt 3814861 gagttgctcc gcgtggtagg cggcggccgc ttcccgtgtc cagagctgct gcaacggcgc 3814921 gatctgcgac ctcagctctt gcagcgcagc gttgaaacgg gccgcggtgg tgtggatctc 3814981 ctgacgaacg gagtattcga tggcgtcaaa gttgtacgac aacacggggt ctgcgttcat 3815041 agtcgaggct gatcctcggt ctataggtcg ccgccggcgg cggcgatgtg gcgggcatgg 3815101 atttggccgg cttcccgcag cgcggcctcg ttgtggcgga tggtgtcggc gatcgcgtgc 3815161 aggacgtggt agagccgcgt cgactcggcg ttccagcgat ccaccacatc ctggaaccga 3815221 gcggccgcga gcccacccca caccgacggc ggcacaccgc tcatgcggcc gatgaatgcc 3815281 tgcagcatcg cacggatttc ctcattgcgg gcgtccgtga tacccgcaac cgaacgcatc 3815341 aggtcaaagt cggcgttcag cgtgttcggt gtgctcacat caagtaggac cgccgccaac 3815401 ctcgtctggt tccctccgat ccttcccggt tcaaccaacg gcgtggacgg accgtacggc 3815461 ttgcgcacac acctccctga ggaggtcttc atggccgggc ccgctctggc agccgacgct 3815521 gatccggacc gctccgtcga gcagaatcgt ccaccgcacc tgatgcccgg cgcggacctc 3815581 tcgataggtc accgcgggcc ggccggctct gatatcggag gggttgaagt cgacgaatac 3815641 cccggccggt gacgcgtcga tcgcccgctt caaccgctgc gcggtgccag gcagcgtctc 3815701 accgggaacc ggtgattgtg tgacgtgcaa cgccacctcg ggatcggccg gtgaagtgac 3815761 ctgtacccgc gccgaaccgg gaccggagac cacccgctgc gtggaccagt ccgccggaat 3815821 cgtcagcgcc acccggccct ctaccagaag cgtcgtcggt ggtctttgca gggttgtcgc 3815881 accgtggcgg accacggcag ccggcgccag taacgccaag gcgacaccgg cggccgcaac 3815941 ccgggcaagt gtcgggaccc gagagcgggt ggcaggccgc gccgccggat cggcgggctc 3816001 gtcggaaggc ggcagggcgg ccctggccaa ccgcgccagc cgcacgccgt cgatctcgac 3816061 cacgctgcta ccggtacccc gcaccgcacc ggcgattgcc gccgcgagcg ctgccgcccc 3816121 ggcgaccgta ctgggcacgt cgatcagcac caccgcggta ataccccgcg tcatccgcgc 3816181 aatgacactg cctacctggc cggcaacgga ctcggcgtcc gtgcggcggg ccaccgcggc 3816241 gacctcggcg ccggccacca acaccagtcg ctccgcgatc tccaccacca ccgttgcggc 3816301 cgaaaccccc gaggacgcct gcctcagcag ccacgaccgc gggtgcacga cgacatcgcg 3816361 ggtcagcgtg cgtgcggctg cggtgaccac ctcgacccga gccgccgacc accacgacgg 3816421 gtgcacgacg accgggccgt cacggtggtc gacggccacc gatcgcaggg cgtcgaacca 3816481 cagcgaatcc acggcgactg gccgttcgtc cagcagcgct acctggtcgt cgatcgccgc 3816541 cagcgcggcg gcagacactg cggtgtccgc gactacgtct gcgccacaac acaatcggcg 3816601 gatggcaccc ggacccgcct cgatcaccgc gcgatgtggg ctcacggggg tgggctccag 3816661 gcgacttgaa ccagttgctc gtcaccggca ccggtgacca ggatgccccg gcccggtggc 3816721 agcggcatcg ggcggctcga cccgaacagt gcgccttcat ccggacgtcc gctcatcagc 3816781 agtgcccggc agcccaggtc acgcaggctg gcaagcaccg gctcgaacag cgcccgagca 3816841 gcacccccgc tgcgccgcgc caccaccagg tgtaaactga gatctcttgc gtgcggcaaa 3816901 tattcgagca agaccatcag cgggttgccc gatgagaccg caaccaggtc gtagtcgtcg 3816961 accacgacat agatatccgg acccgaccac caggacctgg ctcgcagctg cgcctggctc 3817021 acatccgggg cgggcatccg cgcctggagc aggtcgacca gactcgacag cttggcaccc 3817081 agcgccgccg gcgagctgac gtagccgccc atatgttccg actcgatgac gtcgagcagg 3817141 gtgtgccgga agtcgacgat gagaagttgg gctcgcgcgg cggtatgggt ccggacgatc 3817201 tcgcggcaca gggtccgcaa cgcggccgtc tttccgcact cgttgtcgcc cagcaccagc 3817261 aggtgcgggt ggcgtccgaa atcgacggcc accggctggc ctcgacgttc ctcgaggccg 3817321 agcaagatgt gcgcaccgag ttcgtcgccg gctcgggcca cgacgctgtc gtagtccacg 3817381 cgcgcgggca gtagcggtat cgggggcgcc accggatcac cacttcggcg tcgtagcgca 3817441 actccatcca ggtcgggcag ggcgatcacc atgtgcatcc cgtcgcggga gaggccacgg 3817501 cccggtctgt cgaccggcac ccgttgcgcc tgcctacggt ccaattcgga atccgcggga 3817561 tccgccagcc gtaactcgat tcgactgccg atctgatccc gcagcgacgg cctgatctcc 3817621 gcccaccgtg ctgccgatag cgccacatgt acgccgaatg aaagcccttg agctgccagg 3817681 gcaacgatcg actcctcaag ggccgcgaac tcctggcgta agcttgccca gccgtcgatg 3817741 acaagaaata tgtccgcaaa agactcagcg gccgactttg ctcgcagctg gcggtaccgc 3817801 gccaccgagt cgatgccgtg gtcgcggaag aatgcctccc gaaatcgcac ggccgactcc 3817861 agttcggcga gcatccgcga tgccagctgc ggctgcgccc tgccggccac ggcacccaca 3817921 tgcggcagtt cgtccacctg ggccagcgcc ccgccgccga agtccaaaca atagaactgc 3817981 acccggcccg catcgtgggt agcagccaac gccatgatca gcgtccgcag cgcggttgac 3818041 ttgcccgttt gcggtgcacc tacgaccgcg acattgcctg cggccccgga caagtcgatc 3818101 gtcagcggca cccgtgactg ctcgaacggc cgatcgacaa tgccgatggg tacggccagc 3818161 tcggcctgcg ccggctcagc gtcacgcagt agggcgccca gcatcggtgg ctcgtccagc 3818221 ggcggtagcc agacttgatg cgcagccggt ccatgaccga ccagccggtc gagcaccgca 3818281 tgcaagacgg taggcgtggg cacctcggct gtcccgccga cgggaccggc tgtgaccggc 3818341 gccgcagcgt gcgtggtgaa cggtcgcacc gacggcgggg ctaccgggtg gaccgctgag 3818401 ggactcgccc gtcgaagcgg cccggaaacg aacgcggtct gaaatcggat cagctctccg 3818461 gttcccgttt gcagcaagcc cgcaccgggg gtgttgggca gttgatatgc gtcctgcgtc 3818521 ccgagcacgt tgcgtgattc actggcggac cacgttttca ggcacattcg ataggacaga 3818581 tgggtttcca gtccacgcag tcggccctcg tcgagccgct gactggccag cagcaaatgc 3818641 atgcccagcg accggcccac ccgaccgatc gcgaggaaca cgtcgacgaa ttcgggatgt 3818701 tggctcagca attcggaaaa ctcgtcgacg acgatgaaca ggatcggcag gcagggaagt 3818761 tgcgcacccg tttggcgtgc ccgctgatat gccgtgacac tgaccaagtg gcctgccatc 3818821 cgcagcagct gttgccggcg gctcatctcg ccggccaatg cgtcttgcat ccgtgcgacc 3818881 agcggtgctt cctcggcaag gttggtgatg accgcggcta catgtggggc tcccgcgagg 3818941 tcgagaaatg ttgcaccacc cttgaagtcg accagaagga ggttgaggac ttcgggcgaa 3819001 ttgcgtgcca tcatccccag cgcgatggta cgcagcagct ccgatttgcc tgatccggtg 3819061 gcgccgacgc acagcccgtg tggacccatg ccctgttccg cggcttcctt gatgtctagc 3819121 tgcacggcgg taccgtcggg cgtgactccg atcgggacac ggagccgatc atgttggttt 3819181 acgttgcgcc acaacgtgct cggatcgaaa gcggccacat cgccgatgcc gaccagttcc 3819241 gcccaacccg agccacggat gaacgtgcga cccgagtgcc cgacccggtg agcggccagc 3819301 cgacgggcgc ataccagcgc gtcttgaggc tccagctggt ccgggcacgc tagcgctgtc 3819361 acttcgccgg cacatctgac caccggcggt gcaccgtctc gtctggcgcc cacctcgatc 3819421 gtgatcacgc cggtgatcgc gccgttgcca cgttcggccg tgtcgacgat cgcaacaacg 3819481 tgggccaata ccgttgcggc tagcgcattt tgcatctctg ccagggtcga gtacaccatc 3819541 ggggctggcc ccaaggcatc acaggcattc ggatgttggt tgtgcggcag ccatttcagc 3819601 caatcccagt gcgcgcggtt gcggtcactg accacgccgg cgatcagcaa ctcctccggt 3819661 gagtgccata cggccagctg gcagatcatc gcccgcagca gcccgcggac cttggtcggg 3819721 tcaccgtcga tggcgatcgg accgccgacc cgcaagggga tcgcgatggg cgcatccgca 3819781 atggtcgcgt gtgcggcaag gaaacagcgc agcgcggcgc gggtgaccgg atccgcacgc 3819841 tgcgccggcg gaagctgccc gaccaccaag cgggtggcca gcggtgcaga tccaactccg 3819901 acacggatgc gacagaagtc ggcagcaccc ggtcgacgct cccacattcg cggaccaccg 3819961 atcaatgtcc acaaggtggc aggatcggga tgcgtccagt tcagtgatac gtgttgtgct 3820021 gcagccgttt gggtgacaga tgtgcgcaag acactcaggt acccgaggta gtcgacacgg 3820081 tcgttgtgga taccggagac atgccgccgg ccgcgtccgg ttaccgcagt caccaccaac 3820141 gagaccagca tcatcattgg gaaggccaga aacgtggggt ggcgcgtggc cggcgagccc 3820201 ggcaagaaca ccgtcaccat gacacccacg gtcgccaccg acatgacgac cgggagcagg 3820261 cgaatcagca ggctggacgg ttccgaccgc cgcaactcgg gcggcggggc aaccaggatg 3820321 tccgcagtcg cgcacgccgg ccctgaattc atgctgggcg acggtatgca gcgcgagaat 3820381 ccgccgcaag tcgcttgtgg acaaccgaat accgggcgat cgagaaccgg ctaccgttcc 3820441 ggtgatccga gaataaaggg ggagaatgcc tacgtctgat ccgggactgc gccgggtcac 3820501 cgtacatgcc ggcgcccagg ccgtcgacct gaccttgccc gccgcggtgc ccgtcgcgac 3820561 tctgatcccg tcgatcgtcg acatcctggg tgaccgtggc gccagcccgg cgacggcggc 3820621 gcgctaccag ctgtctgccc tgggggcgcc agctctgcca aacgcaacga cattggcgca 3820681 atgcggtatc cgcgacggcg ccgtcctggt cttgcataag tccagcgccc agccgcccac 3820741 cccccgctgt gacgatgtgg ccgaagcggt ggcggcggcg cttgacacca cagcccggcc 3820801 ccaatgccag cgcacgaccc ggctcagcgg tgcgctggcg gcaagctgca tcaccgccgg 3820861 cggcggcctg atgctggttc gaaacgccct cggcaccaac gtaacccgct actccgacgc 3820921 cacggccgga gttgtagcgg cggccggctt ggctgccttg ctgtttgcgg tgattgcatg 3820981 ccggacatat cgggacccga tcgccggcct cacgttgagc gttatcgcca ccatattcgg 3821041 tgctgttgcc ggcctactgg cggtgcccgg ggtccccggt gtccatagcg tgctagttgc 3821101 cgcgatggcg gcggccgcca cgtcggtgct ggcaatgcgc ataacgggtt gtgggggtat 3821161 cacgttgacc gcggtggcgt gctgcgcggt agtcgtcgcg gccgctacgc tggtcggcgc 3821221 gatcactgcg gccccggtgc ctgccatcgg ttcgctggac acgctggcat cctttggtct 3821281 gttagaggta tccgcgcgga tggcagtcct gttggcgggg ttgtcgccac gattgccgcc 3821341 cgcgctgaac cccgacgacg ccgatgccct gcccaccacg gatcggctga ccacccgagc 3821401 gaaccgtgca gatgcttggt tgacgagcct gctggcggcc ttcgcggcct cggcgaccat 3821461 cggtgccatc ggaaccgccg tcgcaaccca cggcatccac aggtccagca tgggcggtat 3821521 cgcgttggcc gccgtcaccg gtgcgctgct gctgctacga gcacgttcag cagacaccag 3821581 aaggtcactg gtgtttgcca tctgtggaat caccaccgtt gcaacggcat ttaccgtcgc 3821641 cgcggatcgg gctctggaac acgggccgtg gattgccgcg ctgaccgcca tgctggccgc 3821701 cgtggcaatg tttttgggct tcgtcgctcc cgcgttgtcg ctctcgcccg tcacgtaccg 3821761 caccatcgaa ttgctggagt gtctggcgct gatcgcaatg gttccattga ccgcttggct 3821821 atgcggcgcc tacagcgccg ttcgccacct cgacctgaca tggacatgac cacgtcccgt 3821881 accctgcgcc tgctggtggt atcagcgctc gcgacgctgt ctgggttggg aacgccggtt 3821941 gcccacgcgg tttcgccgcc gccgatcgac gaaagatggc tacccgaatc tgcgctgccg 3822001 gcgccgccgc ggccgaccgt acaacgtgag gtatgcaccg aggtcaccgc cgaatcggga 3822061 cgggctttcg gccgggctga gcggtccgct caactcgccg acctcgacca ggtctggcga 3822121 ctcacccgcg gcgccggcca acgggtcgcg gtcatcgaca ccggcgttgc gcgccatcga 3822181 cggttgccca aggtggttgc cggcggtgac tatgtcttca ccggggacgg caccgcggat 3822241 tgcgatgcac acggcacgct ggtggccgga attatcgcgg ccgcaccgga tgcgcaaagc 3822301 gacaatttca gcggggtggc acccgatgtc accttgatta gcattcgcca gtccagcagc 3822361 aagttcgcac cggtcggcga cccgtccagc acaggtgttg gtgacgtcga caccatggcg 3822421 aaggccgtgc ggacggccgc cgacctcggc gcgtcggtga tcaacatctc gtcgattgcc 3822481 tgcgttccgg ccgcggctgc gccggacgac cgcgcgctag gtgccgcttt ggcctatgcg 3822541 gtcgatgtca agaacgccgt catcgtggcc gcggccggca ataccggcgg cgccgcgcag 3822601 tgtccgccgc aggcccccgg ggtaacccgg gacagcgtca cggttgcggt gagtccggcc 3822661 tggtacgacg actacgtgct gaccgtaggt tcggtgaacg cccaaggcga accctcggca 3822721 ttcactctcg ccggcccctg ggtggatgtc gccgccaccg gcgaggcggt gacctcgctc 3822781 agcccgttcg gtgacgggac cgtgaacagg cttggcggac agcatggttc gattccgata 3822841 tccggaacca gttatgcggc gccggtcgtc agcggcctgg ccgccctgat ccgggcccgc 3822901 tttccgacgt tgaccgcacg gcaggtgatg cagcgcatcg aatctaccgc gcatcaccca 3822961 cccgccggat gggatccgct cgtcggcaac ggcacggtcg atgccctggc tgcggtcagc 3823021 agcgactcga ttccgcaggc cggcaccgca acgagcgacc ccgctccggt ggcggtgccg 3823081 gtccctaggc ggtcaacgcc cggcccatcg gatcgccgcg ccctacacac cgcctttgct 3823141 ggtgccgcga tctgcctgct cgcgctgatg gcaaccctgg ccaccgccag ccgccggcta 3823201 cggcccgggc gcaacggtat cgcgggcgac tgacgcgttg gctctactca gctccggtcc 3823261 ggacggcagt gtcgccaaca ccggccacgg cgccgggatg gcagccgtcg gcagaccgag 3823321 gtcgtgtgcc acgtcgtcgt cgtggatcgc gaaccgcacc ccggtgtcgg tgaccaggta 3823381 gcgcgtgccg gtgccgccgc cggacaggct gcgcgcggct acgtaggcgc tgcgtcccgg 3823441 cggcaggtac accgcgtcca gtgcggggcc gcgaccgtcg gcttgtgcca gtgtcaccgg 3823501 aacccctccg aggggcaccg gcgggccgct gcccgccaag aacgcgacgc gagcagcacc 3823561 cggctgcgcg ggcgtccagg tcacgcacaa cgtggtgacc gcccttcccg gcgagccgtc 3823621 caccggtgtt ggcggccggt cgggaaaggc cgacaccggc aaggtgttca cgatcggagc 3823681 gacgcgaatc acatcggggg ccaccgtcgg gacgttgacg ctgccctgcg aatcgccgaa 3823741 ccgcaacaaa tccgcggcga cctggccgat gcgctgcacg ccgtcctcca gcaccacgta 3823801 atactcatca ccgctcgcgc gagtgatgcg caccacaccg ccgaccagaa acccgggcag 3823861 cccgaccgag gcccgcccgc cgccacgaat ccggggagcc gtgatgcgcg gtgcctccgg 3823921 gacggcgttg agcaacgatt gcgcgaccac gtgcgggacc cggccctgca gccgcagcgc 3823981 ccacaccacc gccgggtcgg ccagatccac cacggcccgc cgaccgccgt agagcaggta 3824041 ggtgggcgaa cctgattcgg tcgccaccag gatcatctgt tcggcggtca gcacctgcgc 3824101 cgacgagtct tcggcgggcc cgacgacgac agtcgttgat ccgccattgt cgctatcgca 3824161 gatcgcccac gccgattcgg cgccggctag cggctggtca agcagctgcg gcgcacctgg 3824221 aataccgagc agtggaccgc gtttggtgtg gcccaattcg gactcggaca ccggttgcgg 3824281 gttggcgttc gtcgccgcga tcaaccgcgc cgaagccagg ttcaacaccg gatgccagac 3824341 atcgtccact cgcacgtaga gtgccccgga ttcccgaccc atcacgatcg gcgcctgacc 3824401 gagcgccgac tgtggccgca gcagcgcaac gaatgcgcat cccatcgcgg cgacgatcgc 3824461 cagcacgcac ccgagggcca gcgatgttgt gcgcgcgcgc agtgctccgg tcgctgcgca 3824521 gacatccccg aacagcaacg cgcactcgat gcgccgcagc agaaatcggt acccgctgac 3824581 gtgcagccag gtcgtcgctg ggctcggcac tggctctccc acggtggcgc gctgatttct 3824641 ccccacggta ggcgttgcga cgcatgttct tcaccgtcta tccacagcta ccgacatttg 3824701 ctccggctgg atcgcgggta aaattccgtc gtgaacaatc gacccatccg cctgctgaca 3824761 tccggcaggg ctggtttggg tgcgggcgca ttgatcaccg ccgtcgtcct gctcatcgcc 3824821 ttgggcgctg tttggacccc ggttgccttc gccgatggat gcccggacgc cgaagtcacg 3824881 ttcgcccgcg gcaccggcga gccgcccgga atcgggcgcg ttggccaggc gttcgtcgac 3824941 tcgctgcgcc agcagactgg catggagatc ggagtatacc cggtgaatta cgccgccagc 3825001 cgcctacagc tgcacggggg agacggcgcc aacgacgcca tatcgcacat taagtccatg 3825061 gcctcgtcat gcccgaacac caagctggtc ttgggcggct attcgcaggg cgcaaccgtg 3825121 atcgatatcg tggccggggt tccgttgggc agcatcagct ttggcagtcc gctacctgcg 3825181 gcatacgcag acaacgtcgc agcggtcgcg gtcttcggca atccgtccaa ccgcgccggc 3825241 ggatcgctgt cgagcctgag cccgctattc ggttccaagg cgattgacct gtgcaatccc 3825301 accgatccga tctgccatgt gggccccggc aacgaattca gcggacacat cgacggctac 3825361 atacccacct acaccaccca ggcggctagt ttcgtcgtgc agaggctccg cgccgggtcg 3825421 gtgccacatc tgcctggatc cgtcccgcag ctgcccgggt ctgtccttca gatgcccggc 3825481 actgccgcac cggctcccga atcgctgcac ggtcgctgac gctttgtcag taagcccata 3825541 aaatcgcgtc atgaggttca tcggggtgat cccacgcccg cagccgcatt cgggccgctg 3825601 gcgagccggt gccgcacgcc gcctcaccag cctggtggcc gccgcctttg cggcggccac 3825661 actgttgctt acccccgcgc tggcaccacc ggcatcggcg ggctgcccgg atgccgaggt 3825721 ggtgttcgcc cgcggaaccg gcgaaccacc tggcctcggt cgggtaggcc aagctttcgt 3825781 cagttcattg cgccagcaga ccaacaagag catcgggaca tacggagtca actacccggc 3825841 caacggtgat ttcttggccg ccgctgacgg cgcgaacgac gccagcgacc acattcagca 3825901 gatggccagc gcgtgccggg ccacgaggtt ggtgctcggc ggctactccc agggtgcggc 3825961 cgtgatcgac atcgtcaccg ccgcaccact gcccggcctc gggttcacgc agccgttgcc 3826021 gcccgcagcg gacgatcaca tcgccgcgat cgccctgttc gggaatccct cgggccgcgc 3826081 tggcgggctg atgagcgccc tgacccctca attcgggtcc aagaccatca acctctgcaa 3826141 caacggcgac ccgatttgtt cggacggcaa ccggtggcga gcgcacctag gctacgtgcc 3826201 cgggatgacc aaccaggcgg cgcgtttcgt cgcgagcagg atctaacgcg agccgcccca 3826261 tagattccgg ctaagcaacg gctgcgccgc cgcccggcca cgagtgaccg ccgccgactg 3826321 gcacaccgct taccacggcc ttatgctggc gccggacccc gcccgccagg cgcgccgccc 3826381 gtcaacgcag ccgaatgcgc atttgtccgc cgaatgcgcc gcgatgaacc gcaatcattt 3826441 caccggaagg gaagtgtgcg gacacgctaa ccggacgctc gggctaactt cgaccgctat 3826501 tgcgctgagg agggttgatg ccgggcgtca taacaaacag tgaaagccca accgcagccg 3826561 accacgacag aattacggcc accagagaga cgctggagga ttacacactg cggttggcgc 3826621 cgcgcagcta tcgcaggtgg cccccggcgg tggtgggcat ctccgctctc ggcggcatcg 3826681 cctacctggc ggacttcgcg atcggcgcca atgtcggtat cacgtggggt accgcgaacg 3826741 cgctgtgcgg aatcgcaatc ttcgcactgg tggtcttcgt caccggcttg ccgctggcct 3826801 actacgcggc gcggtacaac atcgacctgg atctgattac ccgcggtagc ggtttcggct 3826861 actacggctc ggtggtcacc aacgtcatct ttgccacgtt cacgttcatc ttctttgccc 3826921 tggagggctc gatcatggct cagggcctta agctaggcct gcacattccg ctgtgggcgg 3826981 gttacgcgtg ctcgaccctg atcatcttcc cgctggtggt ctacgggatg aaagttttgt 3827041 cacagctgca actttggacc accccgctct ggctgatcct gatggcggcc ccatttggct 3827101 acctggtagt cagccatccc gattcgattg gacagttttt ctcctacgcc ggcaaggatg 3827161 gtcatggcgg ccttagcttc ggttctgtcc tgttggcagc gggagtgtgc ctgtcactca 3827221 tcgctcagat cgccgagcag atcgactacc tgcgcttcat gccgccacgg acgccggaga 3827281 acgcgaacag gtggtggacg tggacgctgc tggccggtcc cggctgggtt gcatttgggg 3827341 cgaccaaaca gatcatcggc ctgttcctgg cggtctatct gatggccaac atccccggct 3827401 cgtcgacaat cgccaaccag ccggtgcacc aattcatgca gatataccgc accttcgtac 3827461 cgggctggct ggcgttgaca ctcgccgtca tcctggtgat cttgagccag atcaagatca 3827521 acgtcacgaa cgcgtattcg ggctcgctgg cgtggaccaa ttcattcaca cggctcacca 3827581 agcactatcc cgggcgggtc gtgtttcttg gggttaacct cgcgattgcg ttgattctca 3827641 tggaagccaa catgtttgac ttcctgaaca caatcctggg ttgctacgcc aattgcggta 3827701 tggcctgggt ggtggcggtg gcgtcggaca tcggcttcaa caagtatctg ctcggcctgt 3827761 cgccgaagac tcccgaattc cgccgcggca tgctatacgc catcaacccg gtcggcttcg 3827821 ggtcgttgct gctggccgcg gggctgtcga tcgtcacctt cttcggcggt ctgggtgcgg 3827881 cactgcagcc ttattcacca ttggtggcaa tcgtcaccgc gttggtaatg ccgcccattc 3827941 tggcagccgc gaccaaaggc aagtactacc ttcgccgcac gcacgacggt atcgatctgc 3828001 ccatgtacga cgagcacggc aatccctcgg ccgcggtgtt gacttgccat gtctgccacc 3828061 aggatttcga gcggcccgac atgctggcct gccagaccca tggtgcgcat gtctgttcgc 3828121 tgtgcttgtc cacggacaag caggccgagc atgtgcttcc tgggttagcc cgagcgcaca 3828181 tcccgggtga ccaagttccg tgacgcgagc tggtcatcgg gcggatagtc cacctggatc 3828241 aacgtcaacc cgtgcgccgg cgcgaccgcg aagtcgctgg atcgtcctgt cgcggtgagc 3828301 agctcacgac accaagttgt cgcgcgacgg tgctcgccga ccgccagtag cgcccccacc 3828361 aacgaccgca ccatcgacca acagaacgcg tcggcggtga cgtgcgcggt gaccagggtg 3828421 ccggcacgcg accagtccag ccgctgcaga tcacgaatcg tggtggcgcc ctcgcgatga 3828481 cggcagaacg ccgcgaagtc gtgcagcccc atcaaatctc gcgacgcggc cgtcatcgca 3828541 tccagatcaa gctcgcgtgg ccaagcggtg atgtagcgcg cctgctgcgg ctcgacaccg 3828601 tagggtgctg tcgacagccg gtacacgtaa tgccgccgca gcgccgagaa tctggcgtcg 3828661 aaacccgctg gtgcgcgcgt gatatcgagg attcgaacgt cggcgggcag aaatcgaccc 3828721 agcctccgca acagcggcag gaattccgga tcaccgacgt ggccggcgcg cgggtaagcg 3828781 ttcggcaagg catcggcggg cacgtcaacg tgggcgacct ggccgctggc gtgcacgccc 3828841 gcatcagtgc gtccggccgc ccgcagccgc accggggtgc ggaagatggt agtcagcgcc 3828901 gcatcgagat cgcccgcgac cgtgcgctgc cccacttgtg cagcccagcc cgcgaaatcg 3828961 gttccgtcgt aggcgatatc gagccgaaga cggacaacgc cgctaattct cgggggcctc 3829021 tgcggggggc tcttcggggg ccttcgcgtc aggctcactg gcgccgacca cgtcaccctc 3829081 ctcagcaggc ttggcctcgg actcctcggt cggcatggcc gccgccttct tggccttcgc 3829141 ctgcgcggca gctacccggc gtgctcgatt ggcctccgag gtcaccgtct tctcccggac 3829201 cagttcgatc acggccatcg gagcgttgtc gcccttacgt gcctcgattt tgatgatacg 3829261 ggtgtagcca ccatcgcggt cggcgaagaa cggtccgatc tcggcgaaca aggtatgcac 3829321 cacatccttg tcacggagct tcttgagcac ctcgcgccgg ttgtgcaatg cgcctttttt 3829381 ggcatgcgtg atcagcttct ccgcgtacgg acgcagcgcc cgggccttcg gctcggtcgt 3829441 cgtgatccgc ccatgctcga acagggacgt ggcgaggttg gccaagatcg ccttctgatg 3829501 tgaagacgac ccgccgaggc gagggccctt ggtgggcttg ggcatagctg acgctcctgt 3829561 ctggattaga ggcagtctaa agctgttcgg tttcggcgta gtcctgctcg tcgtacgcgc 3829621 cctcggtcga ccaggtgccg gtggcgacgt cgtagcccgc gacctccgag gggtcgaagc 3829681 tcggcgggct gtccttgagt gacaggccca gctggtgcag cttgatcttc acctcgtcga 3829741 tggacttctg accgaagttg cggatgtcaa gcaggtcgga ttcggtgcgc gccaccagtt 3829801 cgcccacggt gtgcaccccc tcgcgcttga ggcagttgta ggaccgcacc gtcagatcca 3829861 ggtcgtcgat cggcagggcg aatgacgcaa tgtgatcggc ctcggccggc gacggcccga 3829921 tctcgatgcc ttcggcctcg acgttgagtt cccgtgccag gccgaacaac tcgaccagcg 3829981 tcttgccagc cgacgccagc gcgtcgcgcg ggctgattga attcttggtc tccacgtcca 3830041 ggatcagctt gtcgaagtcg gtgcgctgct cgacccgggt ggcgtccacc ttgtaggtca 3830101 ctttgagcac cggtgagtag atggaatcga ctggaatgcg cccaatttcg gcacccgaag 3830161 cccggttttg caccgccggg acatagccgc ggccacgctc gacgacgagc tcgacttcca 3830221 gcttgccctt atcgttcagc gtggcgatgt gcatgccggg gttgtgcacg gtgacgccgg 3830281 ccggcggcac gatgtcgccg gcggtaacct cacccggacc ctgcttgcgt aggtacatgg 3830341 tgaccggctc gtcctcctcc gaggacacca ccaggctctt gagattcagg atgatctcgg 3830401 tgacatcttc tttgaccccg ggcaccgtgg tgaattcgtg cagtacacca tcgatgcgaa 3830461 tgctggtgac ggccgctccg ggaatcgacg acagcagggt gcgacgcagc gaattgccca 3830521 gggtgtagcc gaatcccggc tccagcggtt cgatcacgaa ctgggatcgg ttgtcggtga 3830581 ggacgtcctc ggacagggtg gggcgctgtg agatcagcat ggtgtttctt cttcctttcg 3830641 acgtccgcca tatgacgtct gtgggggcac tcgggggcgg cgcccccgag ggtgggggta 3830701 ctcggggggc gccccccgag ggttgggttg gggggtactc gggggcggcg ccccccgagg 3830761 gttgggttta ctttgagtag tactcgacga tcagctgctc ggtgagtggg acgtcgatct 3830821 gcgcgcgctc gggtagctgg tggatcagga cgcgttgccg ctcccccacc acttgcagcc 3830881 agctcgggat cggacgctcg cccgccgtct cccgggcaat ctggaacggc accgtgttca 3830941 gggacttgtc ccgcacgtcg acgatgtcgt actgcgacac ccggtaactg gggacgttga 3831001 cgtgcacgcc gttgacgttg aaatgcccgt ggctgaccag ctggcgagcc atccgccggg 3831061 tgcgcgccag cccggcacgg tagatgacgt tgtccagccg gctttcgagg atcttcagca 3831121 gttcttcacc cgtcttgccg ggctgccgca cggcctcttc gtagtagcgg cggaactgct 3831181 tttccattac gccgtatgtg aaacgggcct tctgcttctc ctgcagctga agcagatatt 3831241 cgctttcctt gatccgcgcg cgaccgtgtt ggccgggcgg gtagggacgc ttctcgaagg 3831301 cctggtcgcc accgacgagg tcggtgcgca accgccgtga tttgcgggtg acgggtccgg 3831361 tgtaacgagc catcttctct cctagacgcg ccggcgattg gggggccgga caccgttatg 3831421 cggctggggg gtgacatccg agatcgcgcc cacctccagg ccggcggcct gcagcgaccg 3831481 gatcgcggtc tcgcggcccg agcccgggcc cttgacgaac acgtcgacct tgcgcacccc 3831541 gtggtcttgg gccttgcgag cggcgttctc cgcggccagc tgggccgcaa acggggtcga 3831601 tttccgggaa cccttgaagc cgacgtgccc cgacgatgcc caggcaatga cgttgccttg 3831661 cgggtcggtg atggtcacga tcgtgttgtt gaacgtgctc ttgatgtggg cggcgccgtg 3831721 cgggacgttc ttcttctccc gccggcgggt cttctggccc ttcctagccg acgttgccgg 3831781 cccttttttt gctggtggca tcggttacct agccttcttc ttgcctgcga tggtgcgctt 3831841 ggggcctttg cgggtccgcg cgttggtttt tgtccgctgg ccgcgtaccg gcataccgcg 3831901 gcggtgccgc aacccctgat agcagccaat ctcgatcttg cgacggatgt cggcctgtac 3831961 ctcgcggcgc aggtcaccct ccaccttcag gttcgcttcg atgtagtcgc gcaggtggat 3832021 cagctgttct tcggtgagat ctctggtgcg cagatcccgg tcaatgccgg tggccgccag 3832081 gatttcgttc gagcgggtac ggccgatgcc aaagatgtag gtcagggcga cctccatccg 3832141 cttatcgcgc ggcaggtcga cgccgacgag tcgagccata ggtggcgttt cctcttcctc 3832201 tgcggaggta tggtcccagt ccgttccctg cccaaaaaag atctttgggt gtggggcccg 3832261 gcctccgtcc gggcgtgaat gagctggccc atctccatcg atgccagccg ctcattggtg 3832321 ctgggggtct gcatttagtt gtcgggccgt ccggctcctc ctcggaccac tacgcggccc 3832381 gcatcgtcgc cgaactagcc ctgcctttgt ttgtgacgcg gatcggaaca gatcaccata 3832441 acccgcccgt gccgacggat cagcctgcac ttgtcacaga tcggcttgac gctcgggttt 3832501 accttcacga ctgtctcggt cctgttctat gggtatgtcg ctacttgtac cggtacacga 3832561 tgcggccccg ggacaggtcg tagggcgaca attccaccac cacccggtcc tcgggcagga 3832621 tgcgaatgta gtgctgacgc atcttgccgc tgatgtgggc gagcaccttg tggccgttct 3832681 ccagctcaat gcggaacatg gcattgggca ggggctcgac cacgcgaccc tcgacctcta 3832741 tggcaccgtc cttcttggcc attactttct ggcgatcctt ctcttccttg tcggtgcacc 3832801 cgattccggc gcagcacgtg ctcggactac aaacgtgagc cggtggtgga aattccgcga 3832861 agggctccga gaaattttca aaactgggca cgccaaaccg gcacgggaca ccgcaccgcc 3832921 aacccacatt acccgcatcg ccgtgctctg cgcaaaacgc cgtaggccac gcgctcaccg 3832981 gaatagcacc ggtgagccga gcggttagag caaccatgac caattgtgcc gccggcaaac 3833041 ccagctcagg ccctaacctc ggccgattcg gatcgttcgg acgcggcgtc accccccagc 3833101 aggccacaga aatcgaggcg ctgggctacg gggcggtctg ggtgggaggc tcaccacccg 3833161 ccgcactgtc ctgggtggaa ccgattctgc aagcgaccac cacattgtgt gtggccaccg 3833221 gcattgtcaa tatctggtcg gcaccggccc agcgagtcgc cgaatcgttc caccgcatcg 3833281 aggcggccta cccgggccgc tttctgctgg gtatcggagt cgggcatgcc gagatgatca 3833341 gtgagtaccg caagccctac aacgcgctgg tggaatacct agaccggctc gacgactatg 3833401 gggtgcccgc caaccgccgg gtggtggccg cactgggccc ccgggtcctg ggcctgtccg 3833461 cacgccgcag cgccggggcg cacccgtacc tgaccacacc cgaacacacg gcacgggccc 3833521 gtgagctgat tggtccgtcg gcgttcctgg cgcccgaaca caaggtggtg ctgaccaccg 3833581 actcggcaag ggcccgtacg gtgggacgcc aggcgctcga tatgtacttc aacctggcta 3833641 actaccgcaa caactggaaa cggctgggct tcaccgacga cgaagtctcc cggccgggca 3833701 gcgaccgcct ggttgacgcc gtggtcgcct acggcactcc agacgcgatc gcggcacggc 3833761 tgaacgaaca cctgcttgca ggcgccgacc atgtccctat tcaggtcctc accgaagatg 3833821 acaacctggt gtcggcgctg accgaactcg cgaagccgct ccgactgact tgatcccgaa 3833881 acggagggtt gcgaacccaa ctggtcgcgg ctccactcgg ttaaggctcg gttagggttt 3833941 gatccatgcg gttgctagtc accggtggcg cgggattcat cggcacgaat ttcgtgcaca 3834001 gcgccgtacg tgagcatcca gacgatgcgg ttaccgtact cgacgccctg acctacgccg 3834061 gccggcgcga gtcgctggcc gacgtggagg atgccatccg gctggttcag ggcgatatca 3834121 ccgacgccga gctggtttcg cagctggtgg ccgagtccga cgcggtggtg cattttgccg 3834181 ccgaatccca tgtcgacaat gcactggaca atccggagcc gtttctgcac accaacgtca 3834241 tcgggacctt caccatcctg gaagcggtgc gacgccacgg tgtgcgcctg caccacatct 3834301 ccaccgacga ggtctacggc gacttggagc tcgacgaccg ggcgcggttc accgaatcga 3834361 cgccctataa cccgtccagc ccttactcgg cgaccaaggc gggcgcagac atgttggtcc 3834421 gggcctgggt tcggtcctat ggcgtacgcg cgacgatctc caactgctcc aacaactacg 3834481 ggccgtatca gcacgtcgag aagttcattc cgcgtcagat caccaatgtg ctcaccgggc 3834541 ggcggcccaa gctctacggc gcgggcgcca atgtccgtga ctggatccac gtcgacgacc 3834601 acaacagcgc ggtgcggcga atcctggaca gaggccgcat cggccgaacc tacctgatca 3834661 gctccgaggg cgagcgtgac aacctgaccg tgctgcgcac gctgctgcga ctgatggacc 3834721 gcgatccgga cgacttcgac cacgtcaccg accgcgtcgg ccacgacctg cgctatgcca 3834781 tcgacccgtc cacgctctac gacgaattat gctgggcgcc aaagcatacc gatttcgagg 3834841 agggcctgcg gaccacgatc gactggtacc gcgacaacga atcgtggtgg cgtccactaa 3834901 aagacgccac ggaggcccgc tatcaagaac gcggtcaatg agatgaaagc acgcgaactc 3834961 gacgtccccg gcgcctggga gattaccccg accatccatg tcgattcccg cggactgttc 3835021 ttcgaatggc ttaccgatca tgggttccgc gcattcgcag gtcacagttt ggacgtccgg 3835081 caagtgaact gctcggtgtc atcggccggt gtgctgcgcg gcctgcactt tgcccagttg 3835141 ccgccgagcc aggccaagta tgtgacctgc gtttccggct cggtgttcga tgtcgtcgtc 3835201 gacatccgag agggctcacc gacattcggc cgatgggact cggtgctgct cgacgaccaa 3835261 gaccgtagga cgatctacgt ctccgaaggc ctagcgcacg gcttccttgc actgcaagac 3835321 aattcgacgg tgatgtactt gtgctcggcg gaatacaatc cgcagcgcga gcacaccatc 3835381 tgcgccacag atccgacgtt ggcggtcgat tggccgctgg tcgatggcgc tgcccccagc 3835441 ctgtccgacc gtgatgccgc tgcgcccagc ttcgaggatg tgcgcgcgtc tggcctgctg 3835501 cccaggtggg aacagacgca gcggttcatt ggggagatgc gcggcaccta gctcggtaat 3835561 cccttgtgtt gctttagctt cagcggtcac agcgcggcga ttgttgtcgg tggcccctcg 3835621 tagaatttgg ggtatgggtt cgggtagccg cgaacggatt gtcgaggtct ttgatgcgct 3835681 ggatgccgag ctggaccgct tggacgaggt gtcttttgag gtgttgacca ccccggaacg 3835741 gctgcggtct ctggaacgtc tggaatgctt ggtgcgccgg ctaccggcgg tcgggcacac 3835801 gttgatcaac caactcgaca cccaagccag cgaggaagaa ctgggcggca cgctgtgctg 3835861 cgcgctggcc aaccggttac gcatcaccaa gcccgacgcc gccctacgca tcgccgacgc 3835921 cgccgatctc ggacctcgtc gagcactcac cggcgaaccg ctagccccac agttgaccgc 3835981 caccgccacc gcccaacgcc agggcctgat cggcgaggcg cacatcaaag tgattcgcgc 3836041 cctttttcgc ccacctgccc gccgcggtgg atgtgtccac ccgccaggcc gccgaagccg 3836101 acctggccgg caaagccgct caatatcgtc ccgacgagct ggcccgctac gcccagcggg 3836161 tcatggactg gctacacccc gacggcgacc tcaccgacac cgaacgcgcc cgcaaacgcg 3836221 gcatcaccct gagcaaccag caatacgacg gcatgtcacg gctaagtggc tacctgaccc 3836281 cccaagcgcg ggccaccttt gaagccgtgc tagccaaact ggccgccccc ggcgcgacca 3836341 accccgacga ccacaccccg gtcatcgaca ccacccccga tgcggccgcc atcgaccgcg 3836401 acacccgcag ccaagcccaa cgcaaccacg acgggctgct ggccgggctg cgcgcgctga 3836461 tcgcctccgg ggaactgggc caacacaacg gtcttcccgt ctcgattgtg gtcaccacca 3836521 ccctgaccga cctgcaaacc ggcgccggca agggcttcac cggcggcggc accctgctac 3836581 ccatggccga tgtgatccgc atgaccagcc acgcccacca ctactccccc gcaagcggga 3836641 ggtaccccca ggcgatcttc gaccacggca cacccctggc gctgtatcac accaaacgcc 3836701 tagcctcccc ggcccagcgg atcatgctgt tcgccaacga ccgcggctgc accaaacccg 3836761 gctgtgacgc accggcctac cacagccaag cccaccacgt caccggctgg accagcaccg 3836821 gacgcaccga catcaccgag ctgaccctgg cctgcgaccc cgacaaccga ctcgccgaaa 3836881 aaggctggac cacccgcaaa aacacccacg gccacaccga atggctacca ccaccccacc 3836941 tcgaccacgg ccaaccccgc accaacacct tccaccaccc cgaacgattc ctccacaacc 3837001 aagacgacga cgacgaaccc gattgacccc cagcagtcaa agccacacgc cacaacgccg 3837061 cacaaccata aacaccgagt ccgtcagggc ctggccggag caaacacgcc acggtggtag 3837121 gagctgtggg catatgcctt ggagcccacc agttgtgaca acggcgtgtg caccgactgc 3837181 ccgcgtgcga gtctggcggc gaccgcgttt aggtcgaacc gcggacgcca attcaagtca 3837241 cggcgagcgc gggagttaac gtacacgcgg tcgaggcggt cggggaagcg ccaaccacgc 3837301 tgggtccaca cagccgcggc cagcggtacc cgccgggcga acaccgatgc cgcgtcggtg 3837361 cgcagctgcg tcaggtcatc acgggtaaac ggtgtggtcg ccgacaccag atagcgcccg 3837421 aaccccagct ggggagctcg ctgcgcggcg ttgaggtgcg catctaccgc gtcttcgagc 3837481 gcgacccgcc ggcaggcata ttcgttggct ttgatgttgt cctggctgcg cccgtcatac 3837541 aggtcaggca tgtcatcgcc ctcgacgaag aatcgggcaa cacgcagcac gacgcaggcc 3837601 aaaccgtcgt tgcgatgtgc caactggcag aggtcctcgg agctagcttt ggtcacgccg 3837661 tagatgttct tgggaatggg cgtgacggat tcgtcgatcc acgccgcggg ctggtctgcc 3837721 ggcggtgtca gggcgtcgcc gaaaacggtc gtcgatgatg tcatgacgaa ggcgcggacg 3837781 ttggcggcga ccgcagcatc cagcacggtc tgggtaccga tgatgttcgt gtccagaaac 3837841 gcctgacgcg gcaggaaggc cagttgcggc ttgtgatggg cggccgcgtg gaacaccacc 3837901 tcaacgccgg ccatcacgtc tcgcagcagt gctcgatcac tcacgcagcc aacgatattc 3837961 gtgtaccgcg acggtctgct gtcgaggctg acgacgtcgg cgccccgtgc acgcagagtg 3838021 cgcaccagcg cctcgcccag gtgaccggag ctgccggtaa ccagggtacg catcccgctc 3838081 tcggcggcgg cagccgtcgg aggcgtgccc gcgtgcaaca acagcggact ggaccgcacg 3838141 ccggcgcgga ctctcatggt ggctgcatgt gttcccacga ctcacgccct cattcccacg 3838201 accactcgat cgatgtcttg cggggacaac cactgccccg catgactttt cgcggtctgc 3838261 cgaataacgt gggacacgga aagccccgcc tgttgggccg ctgcgcggag gtgctcgacc 3838321 tgcagcgaat cgagtccgtg gaatccgaac gagatctccc acggatcgat gccgtaactc 3838381 tgcgggctcc ggcccaacat atccgatcgc gctgcgtcga aaatccgatc caggtcgact 3838441 gccgccaggt gcccgcgccg ctgcaacgcg gcgacgaggc actcgatctg gcaattcccg 3838501 gccccgcgac cgaaacccat cagcgttcca tccaggaaat cggccccggc gtcgaatgcc 3838561 tccaaggtgt tggcgacggc catggcgagg ttgttgtgcc cgtggaagcc gacggagaca 3838621 tcgctggcac cgcggagagc ctcgacgtag cggcgcgcgt cctcgggcag gaaggttccc 3838681 gtcgtatcca ccacgtaaac gatccggacg cccacatcgc gggcccgctt cccggcagca 3838741 gcaagcacat cgggctcgaa gagatgcgac ttcaccagct ggatcgaaac ctccagacct 3838801 tttgactgcg cacgctcgac gaacggcatc accaactcaa attcggtggc gatgacacat 3838861 atgcgcagaa agtccagata gtctccggcc aaatcgaccg tctcgatgcg ggccagggcc 3838921 ggcacgatca cggcaccaag tctcgcgttt cgaaccaccg atcgggcggc gcggaaatat 3838981 tcttcgtcgg tgtgagccgc cgggccctgc gccgcggcgg ctccgatggt gacgccgtga 3839041 ccgatttcaa tgtagggaat tcccgctgcg tcgagatccc cgacaatcct gcggacatcg 3839101 tcgtcggtgt actggaagtt caccgcatag ctgccgtcac ggacggtcgt gtccaggaca 3839161 atcggctctc tgtgggtcgc agtcatgagc atcagtgtca gcccgcaccc ttgccggatc 3839221 cttgatgaat tcttggacgc gcggctggtg tcctatcgac ccagtccaat gtcgggtttg 3839281 ttgatctctg gatcgatcgc gatatcgagg acgcagggac cggtggcggc caacgctttt 3839341 tgcacaccgg cgcgcagctc gcagcgcgta tcgacccgaa tcccttccgc tccaagggcg 3839401 cgggccatcg ccgccagatc gttcgcgccg atgcgagcga ccggcgacgg atccatccgc 3839461 ccgctgaccg ggccggcgct ggcactcatt tgtccgtcgt tgaggacagc ccaggtcacc 3839521 ctgatcccgt gcgcaaccgc agtggaaatc tccgtgccat gcatcaagaa agccccgtcc 3839581 ccggcgatgc atatgacgtg ttcttccggt cgagccaggg ccacgccaat ggctccggcg 3839641 atgccgcatt ccatgggcga aaagtcaacg gtggcaaaga atctgccggg ccgccgcacc 3839701 ggtatcccac gaaacgtcca agaaatgcag gtacccacgt cggcgcatat cgtggcgttg 3839761 ggtgcaagct cgcggtccag ttcgtgcatc agctcaagcg ggtgaatcga ttccccccgc 3839821 gcttgcgggg tccccggcaa cgccgctggc gccggcggcc gcacgcccac cctccgacaa 3839881 aagcgtggcg gccgcccgca gttcagggca ttgacgaacg cgcgcccgga cgtggtgatc 3839941 ccgagcgacg tagcgacgaa tcggccaact gccgatggat cgggatcgac atggacgacg 3840001 tcggctttca gcccgcgcca gcggggcgaa aaggagcggg taaccaaccc gccgaaggaa 3840061 acaccgaccg cgatcaacag gtcgcacggt gtgtcgaaga ggtactcgtc ggccctgccg 3840121 tcaccaaata tgccgagcac acccagagac agcggatggg tttccgcgac gatcccccgc 3840181 ccgttcggtg tggtcgcaaa aggaagtccc gccttctcgc aaaacgcgac gatctgctcg 3840241 ccgatgccgt ccagccggca gccattcccc agcacgagca tgggggcacg cgaccgatcc 3840301 agcctaccga tcacctcgtc agcgacatca ggaccgcacg gcgccagggt tcttaggccc 3840361 ccaagaccgg ccgcggcagt tccaagttgg tgagccggca gccgctcgtc cactagatcg 3840421 cgcggcagag caatgtgcac cggtccgcga gggatgctcg ccaaggcccg gaacgccgaa 3840481 tcgatcttgc tgcgcgcatt ggcgatcgat tcgatggaca ccgaacagcg gcagaaccgg 3840541 cggaaggttg cgcccaggcc cagtccgtcg tcgctcgtat cctgctgcga gtgcaggccg 3840601 aattctccga ccgccacctc cccggtcagg ataagcatcg gaacctgatt caccgacgca 3840661 ttggccacgg cgctaatgac gttggtcgcc ccaggtcccg ccacaaacac cgcagcggac 3840721 ttgccggacg cgcgggcgaa cccgtcggcc aggtagccgg cgccgccctc gtgccgggcc 3840781 aacacgatct gaaagccggc atcgcgggac agacgcacca gcaacgaatc gagccgggaa 3840841 gtcggtagcc cgcatacgac cgaaatgccg gctgcgcgca tcctggcgac gagatgatcc 3840901 ccgacggtca cgggagtcac ggccatgccc cgatcacggc ggcctcgccc atgcgctgat 3840961 cgcgttccgg taggtaggcc gggccccagg cgcacaagaa tgtcaacgga actgaaccta 3841021 gggcccggat tttctgcggg acaccagccg gtatccagac cgcatcgccg ggcccgacct 3841081 cgccagattc gtctccgacc gaaaccagcc cgcgccccga gagaacaaaa tagatctcat 3841141 cggtggcttg caatcggtgc catacggtct cggctcccgc cgccacggtc gcatgggcca 3841201 gactgaccga ggcgacgccc acagtggccc gatccaccag gacccgaatc tcggacaagt 3841261 ccggcgccac gaacggctct gcctccctgg cgttgctgac gaacatggca gcagcgtgtg 3841321 cccgcgctct tggcggatcc ttgacgaatc ctcggaacgc gggtttgtga ccggcggaga 3841381 gcgcgacggt tgcctgcagc acagcgtctg tcgacgttga cgctcgctcc cgttcgggcc 3841441 gggttgacat cccccaccac cggccacaca atgcgcccgg tggatgagca gtggatcgag 3841501 atactcagga tccaggcact gtgtgctcgg tactgtttga cgatcgacac ccaggatggc 3841561 gaaggctggg cgggatgctt taccgaggac ggtgccttcg agttcgacgg ctgggtgatc 3841621 cgggggcggc ccgcattacg cgaatacgca gatgcgcatg cccgcgtcgt gcggggccgc 3841681 cacttgacca cggatcttct ctacgaggtc gacggggacg tcgccaccgg gcgcagcgcc 3841741 agcgtggtca ctctggccac tgccgccggc tacaagatcc tcggctcggg cgagtaccag 3841801 gatcgcctca tcaagcagga cggccagtgg cgtatcgcgt accggcgatt gcgcaacgat 3841861 cggctggtgt cggatcccag cgtggcggta aacgtcgccg atgccgacgt cgccgcggtc 3841921 gtcggtcacc ttctcgcggc cgcgcgccgg ctcggaaccc agatgagcga cacgtagggg 3841981 cgacaagcta gggccgacgt cggtgtacgg acacacgcgc tcgcgggttg gctgtgcagg 3842041 accttcccta accccatcat cggacgccga catgccgagc gagaaaatct aggaccgccc 3842101 ctgcgaaagc gtcgttgcga tcgccggcga ccatatgtcc ggcgccgcgc acatcggtga 3842161 actcgacttg cggaaaccgc gagagaaatt ggtcggcgct ttcttggcgg acgatgtcgc 3842221 tgacttggcc gcgcacgaga agcaccggca cttcgtcgcg caggatcgtc gcaacggctg 3842281 cattcatgcg gtcgacgtcg gtgacctcta cgggaggaaa cgccgcgata ccaccgatga 3842341 actgcggatc ccagtgccaa taccagcgat caccgcggcg gcgcaggttg gccaccaagc 3842401 catccggatc cgaaggccgc ggccgatgcg ggttgtagtt ggcgatgacg tcagccacct 3842461 cgtccaacga gccgaacccc gattccaccc gttcggccat gaacgcgtgg atcctgctcg 3842521 ccccggccag gtccatattc ggcacgatgt ccaccagcac cactgcgctg gcaatgcccg 3842581 gcgagagctc ccccgccagc agcatcgcgg caaacccacc caaggaggcg cccaccagcg 3842641 ccggctgccc aggcaggttg cgcagcactt cctggatatc gccggcgaag ctgaccaacc 3842701 gatagtcgcc ttcgctcgac cagtcggatt cgccatgccc gcgcagatcg atcgtgaccg 3842761 cttgccagcc acgttcggcg acagcggctg cggcccgacc ccatgagcgt cgcgtctgtc 3842821 caccgccatg caagaacacc acggcacgcg ctcgcgggtc tcccaagcgg tcggcgacga 3842881 tacgggcacc gcccggcccg tggaccgaga acgattcagc tgccattgat atcgggtcca 3842941 tcaggggatc cagaaccatc cgtttgcatg ccctaccacg atcctgtcct accgagcggc 3843001 ccgcagtcac cccagattcg gcgtcaatcc ggcacccggt tcgtggtcca tccacggaac 3843061 ccaaggcgcc attttcgcag tgattgcacg ctcggcgaaa ggtgttaccc agacgctaca 3843121 gctatgcgtg cccgtagaat gcaaatccct gctcgcggtc gaggtaggta tcggccttgt 3843181 tcttgatgaa aaacacatag acaatcagcg agaccgctat gcacgcggtc acgtaggcga 3843241 tgaacatcgg cacctgatcg cgttccttaa gagcctggta gatcagcggc gcggtgccgc 3843301 cgaagaccga gttcgccagt gcatagccga ctccgacacc aagggcgcgc acgtgcgcgg 3843361 ggaacagttc ggacttgacc agtgcattga tcgagcagta tccggtcaga atcacatagc 3843421 cgacggccac caatagaaac gacattgtcg gcgaacgtgt ttcgggaaga taagtaacaa 3843481 ggacgtaggt atagatgagt ccgccgacgc cgaaccacag cagcagtggc ttgcggccga 3843541 tcttgtcgct gatcatgccc ccgatgggct gcagcatcat caacagaatc agaccaacca 3843601 ggttgatcca agtagcggtc atcgcctgcg aaccgtagac actcttgacg atcgcaggtg 3843661 cattgacgct gtaggtataa aacgcgaccg tgccgcccaa cgtgacgagg aaacagagca 3843721 gcaatggctt ccaatagtgg gtggccagtt cacggagcga cccggagtcg tggtcccgcc 3843781 cggccttgat cgcagtcagg cgttcctgac tgagcgattc atccatcgtg cgccgcaacc 3843841 agaacaccac gatcgcggcg ccaccgccta cggcgaagcc gatgcgccag ccgaattcgt 3843901 gaacctgctc gcgggtgaag accgccagga tgactagcag ggtgaactgg gcaagcacgt 3843961 gcccacccac cagcgtcaca tactgaaacg acgagaagta gccgcgccgc tcccgcgtcg 3844021 cggcctcaga catgtacgtc accgacgtgc cgtactctcc gccggtcgca aatccctgga 3844081 cgagccgaca caaaataagc aggatcggcg cagcgacgcc aatgctcgag cgagacggca 3844141 ccaacgccac gatcagcgaa caggcggcca tcagcgacac actgaacgtc agcgcggccc 3844201 ggcggccgcg gcggtcggca aaccgaccaa agaaccacga tccgacgggc cgggtcacga 3844261 aggtaacagc gaagatcgcg tagacataga ccgtcgagtt gcgatcggcc cgatcaaaga 3844321 attggtcctc gaaatacgta gcgaacacgg tgtagacgta gacgtcatac cactcgacca 3844381 gattgcccga cgatccccgg atcgtgttcc aaatggcccg acgggtctcg gcctgactcg 3844441 ggcgcgatgg aggtgcaatg gaaacggtca tggtgtcctc catgcgattc gcattgtcgc 3844501 gccgtctgac ggtcaccata gtgaccgacg tcagcacccg ccgtgcaggg ctggagcgtg 3844561 gtcggttttg actctgcggt caaggtgacg tccctcggcg tgtcgccggc gtggatgcag 3844621 actcgatgcc gctctttagt gcaactaatt tcgttgaagt gcctgcgagg tataggactt 3844681 cacgattggt taatgtagcg ttcaccccgt gttggggtcg atttggccgg accagtcgtc 3844741 accaacgctt ggcgtgcgcg ccaggcgggc gatcagatcg cttgactacc aatcaatctt 3844801 gagctcccgg gccgatgctc gggctaaatg aggaggagca cgcgtgtctt tcactgcgca 3844861 accggagatg ttggcggccg cggctggcga acttcgttcc ctgggggcaa cgctgaaggt 3844921 tagcaatgcc gccgcagccg tgccgacgac tggggtggtg cccccggctg ccgacgaggt 3844981 gtcgctgctg cttgccacac aattccgtac gcatgcggcg acgtatcaga cggccagcgc 3845041 caaggccgcg gtgatccatg agcagtttgt gaccacgctg gccaccagcg ctagttcata 3845101 tgcggacacc gaggccgcca acgctgtggt caccggctag ctgacctgac ggtattcgag 3845161 cggaaggatt atcgaagtgg tggatttcgg ggcgttacca ccggagatca actccgcgag 3845221 gatgtacgcc ggcccgggtt cggcctcgct ggtggccgcc gcgaagatgt gggacagcgt 3845281 ggcgagtgac ctgttttcgg ccgcgtcggc gtttcagtcg gtggtctggg gtctgacggt 3845341 ggggtcgtgg ataggttcgt cggcgggtct gatggcggcg gcggcctcgc cgtatgtggc 3845401 gtggatgagc gtcaccgcgg ggcaggccca gctgaccgcc gcccaggtcc gggttgctgc 3845461 ggcggcctac gagacagcgt ataggctgac ggtgcccccg ccggtgatcg ccgagaaccg 3845521 taccgaactg atgacgctga ccgcgaccaa cctcttgggg caaaacaccc cggcgatcga 3845581 ggccaatcag gccgcataca gccagatgtg gggccaagac gcggaggcga tgtatggcta 3845641 cgccgccacg gcggcgacgg cgaccgaggc gttgctgccg ttcgaggacg ccccactgat 3845701 caccaacccc ggcgggctcc ttgagcaggc cgtcgcggtc gaggaggcca tcgacaccgc 3845761 cgcggcgaac cagttgatga acaatgtgcc ccaagcgctg caacagctgg cccagccagc 3845821 gcagggcgtc gtaccttctt ccaagctggg tgggctgtgg acggcggtct cgccgcatct 3845881 gtcgccgctc agcaacgtca gttcgatagc caacaaccac atgtcgatga tgggcacggg 3845941 tgtgtcgatg accaacacct tgcactcgat gttgaagggc ttagctccgg cggcggctca 3846001 ggccgtggaa accgcggcgg aaaacggggt ctgggcgatg agctcgctgg gcagccagct 3846061 gggttcgtcg ctgggttctt cgggtctggg cgctggggtg gccgccaact tgggtcgggc 3846121 ggcctcggtc ggttcgttgt cggtgccgcc agcatgggcc gcggccaacc aggcggtcac 3846181 cccggcggcg cgggcgctgc cgctgaccag cctgaccagc gccgcccaaa ccgcccccgg 3846241 acacatgctg ggcgggctac cgctggggca ctcggtcaac gccggcagcg gtatcaacaa 3846301 tgcgctgcgg gtgccggcac gggcctacgc gataccccgc acaccggccg ccggatagca 3846361 cgaccggttt gcgcggatgc gtcggcgttg ttccccgccg cggttggcgt gctctggcaa 3846421 tctggtctaa gggacccgac cccaccgggc ggaccccacg gcatcgaggg gctgtcgatg 3846481 gcattcgaaa agccgtcacc ggtaacggca ttgacgcagg aactacgatt cgcgacgacc 3846541 atgacgggcg gcgtcagcct cgcgatctga atggccggtg ttacgcggga gatcaacctg 3846601 ctcgcgcagg cctcacaatg gcgcaggctg gggggaacct tcccgaccaa cagccaactc 3846661 accaacgagt cagccgcttc cctgcggctc tacgctcaac taatcgacct cctcgacatg 3846721 gtcgtcgacg tcgacatctt gtcgggaaca agtgcgggcg gcatcaacgc ggctttgctt 3846781 gcgtcatccc gagtcaccgg gtctgacctg ggcgggatcc gcgacctctg gctcgatctt 3846841 ggggccttga ccgagcttct ccgagatccg cgggacaaga aaacaccgtc cctcttgtac 3846901 ggcgacgaac gcatattcgc cgctctggcc aagcggcttc ccaagctggc gaccgggccg 3846961 ttcccgccca cgacctttcc ggaggccgcg cgcaccccgt ccaccaccct gtacatcacg 3847021 acgacgctgc tagccgggga aacaagcaga ttcaccgact cattcggcac tctcgtccag 3847081 gatgtcgacc gccgcggtct gttcaccttc accgaaaccg acctggcgcg gccagacacg 3847141 gcgccggcgc tggcactagc agcgcgcagt tccgcctcat tcccacttgc gttcgaaccc 3847201 tcctttctgc cgttcacgaa gggaaccgcc aagaagggag aggtgccggc tcgaccggcg 3847261 atggcgccgt tcaccagcct tacccgtccg cactgggtta gcgatggtgg cttgctggac 3847321 aaccggccaa ttggcgtttt gttcaagcgc atcttcgacc gtccagcccg acggccggtt 3847381 cgccgggtgc tcctgttcgt cgtaccatcg tccggacccg cacccgaccc gatgcatgag 3847441 ccaccaccgg acaacgtcga cgagccactc gggctcatcg acgggctgct gaagggcctg 3847501 gccgcggtca ccacccagtc gatcgcggcc gacctacgcg cgatccgcgc ccatcaggac 3847561 tgcatggaag cgcgcacaga tgccaaactg cggctcgcag agctggcggc aacgctgcgg 3847621 aacggcacac ggttgctcac cccgtccctg ctcacggatt accggacccg cgaggcaacc 3847681 aagcaggccc agaccctcac cagcgctctg ctgcgccggc tttccacctg tccgccggag 3847741 tcgggcccgg caaccgaaag ccttcccaag agctggtcag ccgaactcac cgtcggtggt 3847801 gacgccgaca aggtgtgccg gcagcttgca tcatttcggt gcgttctaca agaggtcatg 3847861 gcgagccaat gactggatgt ggggccgact cgacggagcg ggatggctcg tccacgtgct 3847921 gctagacccg cgccgggtgc gctggatcgt cggggagcgc gccgatacca acgggccgca 3847981 gagcggtgca caatggttcc taggcaaact caaagaactt ggggcacctg actttccgag 3848041 tccgggctac ccgctgccgg cggtcggcgg cgggccggcc caacatctga ccgaggacat 3848101 gctgctcgat gagcttggct tcctggacga cccagcaaag ccgctgccgg ccagcattcc 3848161 gtggaccgcg ctgtggttgt cgcaggcgtg gcaacaacga gtcctcgaag aggaattgga 3848221 cggactggcc aacacggtgc tcgacccaca gcccggaaaa ttgccggact ggagcccgac 3848281 gagttcacga acatgggcga ccaaggtatt ggccgctcac cctggcgacg ccaaatatgc 3848341 tctgctgaac gaaaatccaa tcgcaggcga aacattcgcc agcgacaagg gctcaccact 3848401 gatggcgcac acggtcgcca aagccgccgc gactgcggcc ggagcggccg gctcggtccg 3848461 gcagctgccc agtgtattga agccaccact gatcacgttg cggacactca ccctcagtgg 3848521 ataccgagtg gtctcgttga ccaaaggcat tgccagatcg accattatcg ccggcgcgct 3848581 gctacttgtg ctcggcgtcg cggcggcgat ccagtcggtg accgtgttcg gagtcactgg 3848641 cctgatcgcg gccgggactg ggggcttgct ggtcgtccta ggcacttggc aggtctccgg 3848701 caggctcctt tttgcactgc tgtctttctc ggttgtcggc gcggtactcg cgttggcgac 3848761 gcccgtcgta cgcgaatggc tgttcggcac ccagcagcag cccggctggg taggcactca 3848821 cgcgtattgg cttggcgccc aatggtggca ccccctggtc gtcgtcgggc tcatcgcact 3848881 ggtggccatc atgatcgcag cggccaaccc aggacgacgg tgacgatgcg tgcggtgatc 3848941 cggaattcag gaaccgaggc ccgcggcgcc gtcagccgcc gccaactgat cgagcgcttc 3849001 gccggtgtag acggcgagcc gctgcaggtg tggcagtgtg tcacggcagc cgatgaatcc 3849061 aaagttcagc gtgccggcgt aactctgcaa agtaacgttg agagcctggc tgtgcgccac 3849121 cagggagacc ggataggacg cctccatccg gctgccccgc aggtagagca cgtcctcggg 3849181 ccccggcaca ttgctgacac acaggttgaa cgtgtacggc cagggtggct tcaccccact 3849241 gagcgtgctg gccaactgca ccccgtacgg cgccatcaac gcggcgctat aggccaggat 3849301 cgcgtccttg tccatggacc tcagctgagc cttggccgcg cgggttgacg ccgtgaccgc 3849361 cgccagccgc tgcaccggat cggcaacgtc ggtacccaac gtcgccagga tggtcgcgac 3849421 cgcgttgccg ccgccctcgt cgtccttggg tcgcacgttg accggcaaga ccacgatcag 3849481 cgacttgttg ggcagctcac ccagctcgtc cagaaaacgt cgtaagccgc ctccgatgat 3849541 cgccaacgcg acgtcgttga ttgtggcatc atattgagcc ccaatggctt tcagtcgatc 3849601 cagcggatat tgctgggtgg cgaagcggcg gttgcggctg atgcgggtgt tgagtatgca 3849661 gtgcggcgct tgcaccgagc cgacgaggtt gcggtactcg tgatcactgc gcagctgggc 3849721 gttgaccagc gccttggtga gctcgaacgt cgatcgtccc gcaccggcca ccgaacctaa 3849781 gacgctaccg accccgctga cgcccgccca acccccgcac cacgtcgccc aggccatcga 3849841 gcacgttacc ggccccagct atcaaaccgc cgccgacgga gtcttgagtg tcggcgggtg 3849901 atcggccagg tgtgggaatg ttgaagaaca acgggtgggt ggtgtcgtgc gggtcggtgg 3849961 acaggctgcg ggccagcatt ttctggccgg tatagccgtc tatcaacgag tggtgcatct 3850021 tgatgtagat cgcgaaccgg ccaccttcga ggccttcgat gaaatgcact tcccacggcg 3850081 gacggcgtag gtccagggcg tgactatgca agcgggacac cgggatcccg agttcacgct 3850141 cgtcgccagg gctggccagc gccgaccggc gcacgtggta gtccaggtcg aagttgtcat 3850201 caacgaccca ggactgcgtg ggatggtata gcagctccgg atggctcagt cttaggctcc 3850261 agggttcgac gacctcgctg gccttgcttt cgtcgacgag ttggcgcagc aagtccggcg 3850321 gcgcacccga gggcggcgtg aacggcatca acgcaccaac gtgcatcatc gtggtcgacg 3850381 attcggagta caggaaaaac atgtcctgcg gacccaaccg ccgggccgtc tggctcacgg 3850441 gccactcctt cgttggaggc attctcaggc cgtctagcgc cgccagataa ctaccgtaga 3850501 tgatcgcggc cgtgcgttgt acgccgcatc acatcgcgtg aactccgttg tagagcagca 3850561 gtaaaccgat cacgaccaga atggccgcga ccatcccggc atggttcttc tccatccagt 3850621 ctttaagtcg ttccagcgaa tcgtcgagtc ggtcaccggc agccacgtag gccaatatcg 3850681 ggatcgcgac cgtggatgca gccaacatgg caaagaatgc cgtgtaaatc caggaacccg 3850741 cggcgccgtg gccgccgctg ccgatggcca atccggccgc cgcgcaaatg atcagcacct 3850801 cgggtctcac caccaccagc acggccccta ccaatccggc gcgtgccggg gtgaagctgg 3850861 cgaatgcgcg catccagccc ggcatttcgg tgtggcgatg ccgggtcagc caccgaagca 3850921 cgccgaacac gatcagtgcc gacccgagga ccacccgtag ccaggatgcc caggccggcg 3850981 atgttgtgct caaaccgcca agtgcgccgg aggccgcaac aaagacggcg gtcaccacgg 3851041 ccaagcccaa cagccagccg cccaggaagg ccaggctgct cggccgcggc tgcggcgagt 3851101 gtacgaccag taccgctggg atcaccgaca acggcgagag cgcaatgacc aacgccagcg 3851161 gcacgagccc ggtgagcacg gagacccaat gacctgccac gggcagcaat cctcgcattg 3851221 acaccgcctc ggtgaccacc gagcgcccca attcgacgaa attgcgcgca cccaaccgcc 3851281 gttcgggctg ttgatagcca tcgcgagacg tcgatcgccg aaacgtacgt gcaaagacgg 3851341 cggcttggtg agccgacgat taacgacgct gcccggccaa acgctcgccc tgacagaatt 3851401 gcgacccgaa gtccaccgtc acggtcgacg gcgtgcggaa ctgagcggcg aaggtgagct 3851461 ggggatccac cggcgcctga tcgagcccca gtcgctttgt ctcgaggttg atctccacgg 3851521 tatcgggggc agttctgcgc gcattggtgt tccggtccgg ccgcatgttc ggctcggtgc 3851581 ctctgctttc gccgggcttt ctgatggcaa gttcatcggt gtcctgctgc gggcccagct 3851641 cggcgaactc cttgccattg ttggcgatgg tgtaggtcag caggtaaccg gcaaagcccg 3851701 acgcaaacga gcccgatggc gacggtggca gcggctcggc gaatcgaact accagccgaa 3851761 ggaccgcgcc tcggggatgc gagacgtcca cggaggccac ggtaatggtc gcgggcggtg 3851821 tcggcccggg ccgcaactgg caactgagtc gcttcggcag ctgcgaccag tcgtctggca 3851881 gcggcctgat ccaggcgtag accgcgacac ctaccatcgt cagcactgcc gcgaccggaa 3851941 ctgtcaaccg caggccggcc ggcatgccca gccagcggct ccggagaccg tcgacacgca 3852001 gggccagcgg tgatcgtggt gcagaggggt tgggtcgacg atgtctggtc cagcggtcac 3852061 cgtcccaata tcgttgcccc gctgaaccgt caggatcggt ataccatcct gccggcggcg 3852121 aagtcgccac gtcgtgctcc attcaacagt cggtaaggat cagctgcggt gccgctcctc 3852181 gcggactacg gcggcgcatc gaacaactcc ggtagcgaat cgaggatctg cacgtggtcg 3852241 ccgcgccacg caaaccccac aatcgtcaag atgccatctt ggcagccgtc acagctttga 3852301 cgtgtccgat agctcagcac cacgatgtcg tttgtggatg ccggaccgat caaattggtg 3852361 aacgggtagg ccctcggcgt tgcggttccg acgaacgttc cccgatgaaa catcagcgcc 3852421 tggtccgggg agctgttggt ggcgtcttgc accgtcacca gcaccgcgga caggtccgcg 3852481 cacgggtcgt agttgctgtc ctccggcgta ctattccacg gcctgccggt tttggaatcg 3852541 ggggcaagct gggccagcgc ggcgcgcacg gccgttgcct cgtccggccc acacgggcca 3852601 acctgggatg ccggggaagt ggggcgcgcc ggtgcggacg tcgttgccgg cgccgcctgg 3852661 ttggcacctg gtcggacccg atgcataccg gcgtacgcaa cgaccgcggc agccacacag 3852721 gccaggacca cgagcgccac cagccaggcg gtgggccaag acccgcctgg ggcgggcgga 3852781 gtggtatcga cgtcatcgga cggctggtat gccggcgcag gccagtcggg gtcaatctcg 3852841 tcagacacct aacccgctaa ccctcccggt acccgcccgc tggctgtgcg atacttgccg 3852901 agcttgccga attgtagcca gaacgtgcag gtagcggaaa caagcgggcc gtctcgaggg 3852961 gccccgccgg ccggtgaggc tgaccacatc cagcattctg atagctggct tcacagcaat 3853021 ctggccccat actagacgtc atgcagcaag cgacggcacc gcaaccgctg gcagcgcgcc 3853081 agttggttcg acggcgcctg gccgaggcat atgatggcgc gttctgaggg caatcgccca 3853141 cgccatcgcg ctgtgcctca gccgtcgcgg atccgcaagc ggctgtcgcg gggcgttatg 3853201 acgctcgtgt cggtggttgc cctgctgatg accggcgcag ggtattgggt agcccacggc 3853261 gcgctgggcg gcatcaccat ttcgcaggcc ctaacccccg aggatccccg ttccagcggc 3853321 aacaacatga acatcttgct catcgggctg gactcgcgca aagaccagga aggcaacgac 3853381 ctgccctggt cggtcttgaa gcagctacac gcgggcgatt ccgacgacgg cggctacaac 3853441 acgaacacgc tgatacttgt gcacgtcggt gccgatggca aagtggtggc cttctcgatc 3853501 ccccgcgacg actgggtgcc cttcaccggc gttccgggat acaaccacat caagatcaaa 3853561 gaggcgtacg ggctgaccaa gcaatacgtg gcagaacagc tggccaacca gggtgtgagc 3853621 gaccggaaag agctcgagac ccggggccgt gaagctgccc gggccgcgac cctgcgggcg 3853681 gtgcgaagcc tgaccggcgt cccgatcgac tacttcgccg agatcaattt ggccggtttc 3853741 tacgatttgg cccagaccct cggcggcgtt gatgtgtgcc tgaaccatgc cgtctacgac 3853801 tcgtactccg gagccgactt ccccgccggg cgtcaacggt tgaatgccgc gcaggcgctg 3853861 gcgtttgtcc ggcagcgtca tggcctagac aacggggacc tggaccgcac ccaccgccag 3853921 caagcattcc tgtcgtcggt catgcgcgaa cttcaggatt cgggcacctt caccaacctg 3853981 gacaggctcg acaacctgat ggccgtggca cgcaaagatg tggtgctgtc ggccggctgg 3854041 gacgaggacc tgttccgccg gatgggcgac ctggcgggcg gtaacgtcga attccggacg 3854101 ctgcccgtgg tgcgctacga caacatcgac ggccaggatg tcaacattat cgacccgacc 3854161 gcgatccggg ccgaggtagc ggcggcattt ggcagcgcgc cgccaacgtc gcagaccgcc 3854221 gcggccgcca aacctaaccc atccaccgtc gtcgatgtgg tcaatgccgg cagcatcagc 3854281 ggactggcca gccaggtctc cggtgcgctg ctgaagcgcg gctacaccgc gggtcaggtg 3854341 cgtgaccgcg aatccggcga tccgttcacc accgccatcg agtacggtgc cggcgcggaa 3854401 acggacgccc agaacgtggc agacctgctc ggtatcgacg cccccaacca tcccgatccc 3854461 gccgtcgcgc ccggacacat ccgtgtgacg gtggatacca acttctccct accggcaccc 3854521 gacgaagcca ccgccgccgc gacgtccacc gaaaccagca catatccgct gtacggcggc 3854581 ggcaccacca ccgacccgac accggaccaa ggggcgccca tcgatggcgg cggcgtgccc 3854641 tgcgtgaact aggtaagtta tccgaccact ccacgcagcc cgtcggcgcc gaacaccggc 3854701 tccagcatgg gcgagaagtc cgggcccctt cgcagcatgt ggccgccgtc gacgttgatg 3854761 acctgtccgg tgatccaact ggccgcgtcg ctgagcaaaa acattgctag gttcgcgacg 3854821 tcttcgacct cacccacccg cggtaatggc gtgcagaccc ggtagtccgc gctcagctcc 3854881 ggcgactctg tgacgggcac aaccagatct gtacggatca ggcccgggcg gatgctgttg 3854941 acccgtaccc acgacgggcc gagttcgtca gcggccagtt tcatcatgtg gtcaacggcc 3855001 gacttggtga ccccgtaggc gccgaaccag cgatgggtgt tgctggccgc gatcgaggag 3855061 atgccgacga acgaaccgcc gccgccgcgt accaattccc gcgcggcgtg cttgagcacg 3855121 tacatggtgc cattgacatt gaggtccacg gtgcgccgcc aggcctgcga gtcgatctgg 3855181 gtgattggcc caatggtctg agacccgccc gcgcaatgca ccacaccgtg cagccggcca 3855241 tgccacgcgg ttgccgcgtc caccacacgc agggtctgct cctcgtcggt gatgtcggcc 3855301 ggctcatagc cgatcgctcc ggtcttgagc gcctcgatgt ctttgacagc cgccgccagc 3855361 ttgtctggat ttcgtcccac gatcatgacg gcggctccag ccgcgaccaa cccggcggcc 3855421 acccccttgc cgattccgct gccacctccg gtgaccaggt aggtccggtc ttggaaagaa 3855481 agctgcactt gaggcccctc acgccgaaac tgaaacaggt tctcgccatt ttggaccatg 3855541 cggcccgtca cttgcgccga aggtgaactc acggcgaggt ttcgcggcgc tcgcgaattc 3855601 atgccctcag ttcacgttcg acgttcgtga tcaacggtgc cgccatcgtg gagggattcc 3855661 ataggttgcg gcttgttgcc acattgcggc cagtgtgcgc cgccgggtgc gcgtccacgg 3855721 taggcttcaa ccacgaatta tcgggcaacg atatcggagt cggagttggc aataactggt 3855781 tcggccgcac cgtcatggcc gcgactattg cacgccgagg gccccccttc cgtcatttgt 3855841 atacggctgt tggtggggtt ggtgtttctc agtgagggaa tccagaaatt catgtatcca 3855901 gatcagctgg gtccgggccg cttcgagcgg atcggcatcc ccgccgccac gttcttcgcc 3855961 gatctggacg gggtggtcga gattgtctgc ggcacactgg tcctcctcgg cctgctgacc 3856021 cgggtcgcgg cggtgccgtt gctcatcgac atggtgggag cgatcgtgct gaccaaactc 3856081 cgagcactgc agccgggcgg gtttctcggg gtagagggct tctggggcat ggcccacgct 3856141 gcccggaccg acctgtcgat gctgctcgga ttgatcttcc tgctgtggtc cggccccggc 3856201 cggtggtcac tagataggcg actgtccaaa cgcgccacgg cttgcggcgc gaggtgaacc 3856261 cgcgacgtag cgcgaccgat gcaccggact caacgacgag tcagcggtgg cgtcgcgaat 3856321 gaactgcccg atctgacgca acgaacgggt cgcttcgggc accagcggtg tggcgagttg 3856381 gaaaagatga gcctgaccgg gccaaacccg tacctcggca cagacgcctg ccgccgccag 3856441 cttgccggcg cccagctgcg cgtcgtgcag cagcacttcg gagccggaaa cgtgaataag 3856501 tgtcggcggc aagctggatt cgatatggtc gagcggctca tagaggtctt cgggcctgcc 3856561 gtcgaccatg ttcttggcag cggccgccct gacccatgcc gccaaggcat cgaatgcccg 3856621 cgccggaaac atcgcgtcgg tcccgatgtt gggatggtcc tgcttgggcc ccttggccag 3856681 ctgcagcaac ggagagatgg ccactattgc cgccggtttc tcgtcgtcgc actgcagccg 3856741 ctgcgcaagc gcgagcgcaa ggtaaccacc cgcggaatca ccggccaaca cgatctgttc 3856801 cggccggtat ccgcgcgccc gcaaccattg gtatgcatcg tggcagtcgt cgagcgccat 3856861 ccccagcgaa tgcttaggga tcagccgata gtcgactatc aacacgggtg attcggcaaa 3856921 tcctgacagc gcgttgacga tcctgctgtg cgaattcggc ccgcacatga caaacgcgcc 3856981 gccgtgcaaa tagagcacca cccgcccagc gccgtcggcc gcccgcaccc caggcgcacg 3857041 caccaactgg gcggtagcat tcggcaaatt tatcgttgtt cggaccgtgc cctgcccggg 3857101 gcgccaaacc ctgcatgcga agtcgacgaa ccccaacggc agaggcaggg gcgataggta 3857161 actgcccaca gtcataagtg gcttgatcgt catgcgcgat gccagtgccg ccaaccgacc 3857221 tgcaacacta gggccgcttt cggtgatctc gatgggagcc ccgtcccagc acgaatcgga 3857281 attcgagcat cccgacgatt gcaggggccg gcgtgcgtaa tacgaggaca ttttcagcac 3857341 gtttcgccgg aatgtggccg gtggttggcg ttagctgcac ggaagcgcct gagctggccc 3857401 gccgtcaccg cccgatttat caatcgcaaa tctcgcactt cccgtttacg tagttgctcc 3857461 aaccagacgc agcccaattc gggctcctcc ccccatcaat cattcggtgg cgcgaagttc 3857521 accagagtcc cggacacgct cacgcgaact acctgcattt aggggatcac aggcaccttg 3857581 aaatgcatcg gtgtatgact gggagtttgc tgtacgtcta ttggtaagtg cgaattcgcc 3857641 gccggctacc cgcaccccgt agaatcgcaa gccgatatcg gcttggtcac ctgaggtgtt 3857701 ctatgcggga gtttcagcgg gccgcggtgc gcctgcacat cctgcaccac gctgccgaca 3857761 acgaggtgca cggcgcgtgg ctgacccaag aactgagccg gcacggctac cgggtcagcc 3857821 ccggcacgtt gtacccgacc ctgcaccggc tcgaagccga cggcctgctg gtgtccgagc 3857881 aacgggtcgt cgacggccgc gcgcgccgcg tctaccgggc taccccggct ggccgggcag 3857941 cactgaccga ggatcgccgg gcactggaag agctggcccg cgaagtcctc ggcaggcaat 3858001 cgcacaccgc tggtaacggg acctgaaccg cgtcgacggt acccatcgcc ggggccaaac 3858061 cgtgacgacg tctgcagcgc aatgcgggct tggcttacag ttatgtaatg tctaccaaat 3858121 ctgaccacgg cgaaatcggt gacgtcgaac cgctggcaga cagcaccgcg agccaggcca 3858181 ggcgagtcgt cgccgcatat gcgaacgacg ccgacgagtg tcggatcttc ctgtccatgc 3858241 tcggtattgg accggccaaa ctcgagagct aatggctccc tcgggaggcc aggaggcgca 3858301 gatttgcgat tcggagacct tcggggactc tgacttcgtg gtggtagcca atcgactgcc 3858361 cgtcgatctg gagcgtcttc ccgacggcag cacaacctgg aaacgcagcc ccggaggctt 3858421 ggtcaccgcc ttggagccgg tgctgcggcg tcggcgcggg gcctgggtcg gctggcccgg 3858481 cgttaacgac gacggggccg aacccgacct ccacgtgctg gacggcccca tcatccaaga 3858541 cgagctggaa cttcatccgg tacggctgag caccacggac atagctcagt actacgaggg 3858601 attctccaac gccacactgt ggccgctgta ccacgacgtc atcgtcaagc cgctctacca 3858661 ccgcgaatgg tgggatcgct acgtcgacgt caaccagcgc tttgccgagg ccgcgtcgcg 3858721 cgccgccgcc cacggcgcaa ccgtgtgggt acaggactac cagctgcagc tggtaccgaa 3858781 gatgctgcgc atgctgcggc ccgatctgac catcggtttc tttttgcaca tcccgttccc 3858841 gccggtagag ctgtttatgc agatgccgtg gcgcaccgag atcatccagg gcctactggg 3858901 cgccgacctg gtgggcttcc atcttccggg cggtgcccag aatttcctga tcctgtcccg 3858961 gcgtctggtc ggcaccgaca cttcccgcgg aaccgtcggt gtgcggtcgc ggttcggtgc 3859021 ggcggtgctc gggtcccgca ccatacgagt tggcgccttt cctatctcgg ttgactccgg 3859081 cgcgctcgac cacgctgccc gcgaccgcaa catcaggcgc cgggcccgcg agattcgcac 3859141 cgaactggga aatccgcgca agatcctgct cggtgttgac cggctcgact acaccaaggg 3859201 catcgacgta cggctgaagg ccttttccga gctgctggcc gagggccgcg tcaaacgcga 3859261 cgacaccgtc ctggtccagc tggctacccc gagccgcgag cgggtggaga gctaccagac 3859321 gctgcgcaac gacatcgaac gccaggtcgg ccacattaac ggcgagtacg gtgaggttgg 3859381 ccatccggta gtgcattacc tgcatcgacc ggctccgcgc gacgagctta tcgctttctt 3859441 cgtggccagc gacgtcatgc tggtcacccc actacgcgac gggatgaacc tggtggccaa 3859501 ggagtacgtc gcttgccgca gcgatcttgg cggtgccctg gtgctcagcg aattcaccgg 3859561 ggccgcagcc gaactccggc acgcatacct ggtcaacccg cacgacctgg aaggcgtcaa 3859621 ggacgggata gaggaagcgc tcaaccagac ggaggaggcg ggccggcggc gaatgcggtc 3859681 gctgcgacgc caagtgctcg cccacgacgt ggaccgctgg gcacagtcgt ttctcgacgc 3859741 tctcgccggg gcacacccga ggggccaagg ctaacggtca agccgctccc gctcgcgagc 3859801 agacgcagaa tcgcccattt cggcacgaaa ttgggcgatt ctgcgtctgc tcgcgccctg 3859861 gaagctggtg cggctgccca aaggctgtga tactcgatgg agcgcgaagg cccgaaggag 3859921 ggcatgtgaa catccgttgc ggactggccg ctggggccgt catctgctcg gccgtcgcac 3859981 tgggaattgc gctgcactcc ggtgacccgg cgcgtgcgct cggaccgccg ccggatggca 3860041 gttactcctt caaccaggcc ggagtgtccg gggtgacgtg gacgattacc gcgctgtgcg 3860101 atcagccgtc gggaacccgt aacatgaacg actattctga ccccatcgtt tgggcgttca 3860161 actgcgctct caacgtggtg agtacgacgc cccaacagat cacccgtacg gaccggctgc 3860221 agaacttcag cggcagggct cggatgagta gcatgctgtg gaccttccag gtgaatcagg 3860281 cagacggcgt ggcgtgtccg gacggcagca cggcaccgtc cagcgaaacc tatgcgttca 3860341 gcgacgagac gctgaccggg acgcacacca ccgtgcatgg cgccgtgtgt ggcctgcagc 3860401 caaagttgag caaacaaccg ttttcactgc agctcatcgg cccgccaccc agcccggtcc 3860461 agcgttatcc gttgtactgc aacaacattg cgatgtgcta ttaaatcggc gtgatgtagg 3860521 cgatcagcca tttgccgtca atccgctgga aatccacccg cagtcggctg ccgtcgtaga 3860581 gcggctgtcg cgtcttgtcg gtcacggtgc ggttcaaata gaccatcacc gatgcgcaat 3860641 cgcgtttggc atccatgact cccacaccga cgacattggc ctggaccacc acttcacgct 3860701 tcttcgcctc cgggatgatc tgcgcattgg cgctcttctg gaactcctgg cgatagtccg 3860761 gcgtcagcag cgggtacacc gcggtgaggc tgcgctcgac agtttggtag tcgtaaccga 3860821 agacttgtgg gatttcctgc atggccagct tcggtaacag tgcccgcgcc gacgcttcgc 3860881 ccccggtctg cacccggtcc cagtagaacc agccaccggc cgcggacaaa cccacgatgg 3860941 tggcgaccat cagcgcgtag gcgacggaaa tcaaccgtct catcagttgc ccccgtccgg 3861001 gtacttcagg tcgtagccgg tcatccggcc gttctcgtcc tcatgcacga tgacccgaag 3861061 acgatagggc atggacggct tgttgacgcc gtcgatatcg gcgaccgtca cccgcaccga 3861121 caccaatacc gatgcgttgt cgctgatttc gtcaatgccc tccaacgcgg cgccgttgac 3861181 gacggcctcc gatgtcgcgt tggtggcccg gaatagaccc ttgaggttgt ccacgttgtt 3861241 gttggcgttc agcatgccgc gtagcggccc actggtgccg ttgacgaacc ggttcacgct 3861301 ctcgtcgatg gtgtccggcg tgtagctgaa catgttgacc acggtctggg tggcggcatc 3861361 gacaaaacgc tggttgcggg cttgccgggc gtccgcatcc cggttctgca taaccagtgc 3861421 ggtcacaccc catgccagcg cggcaatcgc caataggcct gccgccagcg aaagccagcc 3861481 gaccaggacg cggtgtgccg gccggcgtgg cggcggtttg accggcctca gggcgggctt 3861541 ggcggctttc gccggcttcg attcggtccg cgccgcggcc ctcaccgtcg ccgcaccctg 3861601 ggcgggacgg ctcgactcgc cttccgccgg acccgcgggg cgggacgcct tacgacgagc 3861661 gcgccgcgtc gtcgactgct gtccgccggc tacaccggta tctgcggcca ctacagctgc 3861721 ctcggatcgc gcatgagatc cacccaattc tcggcgctgg atgcgcccgt catcccgggc 3861781 gcgaagatac cagtgccgcc ggccgggtcc gcgaaggctc cgctgagttg gtcatagatg 3861841 gtgtaggccg gaccgctggc ctgcggttgg gggccaggcg ccggcccggg cggaggcccg 3861901 gtgccttccg gtggtggcgg cggcggaatg gtcgccgggt atggcacctg gggtggctcc 3861961 ggcgggtacc cgggcggcat ccatgacgtg aacggtggag gcggaccgtt gtcgttgggc 3862021 ggcggcgcag gctgcgccgg ctggtgcggc gccggcccgg ggccggccac ctggccgggt 3862081 ggcggcggtc cgacgatggg cacgcccggg tcgggatccg cgcccggcgg gatgtaaggg 3862141 aacttgttgg gcggcagaat gtttcgccca tccgtcacct cggtgccgta cgggatcggc 3862201 ggaccgcgcc acgggttggt tccaactggc acatagccac gcggatcccg acataactgc 3862261 accgtcggtg cccgcttacc ggggaattcc tggcacgggt agttgcgagc gccgcgcacc 3862321 gtgctcgggt cgttctgcgc ggtcttgcag tacatgtccc ggggaatctc gcgtaccgac 3862381 tcgtcggccg gcgaccggac cagcggcggg ggcaagaacc cggtcatgca gggcggcggg 3862441 tcgtgcaggt cgatcttgaa gtccagcttg gcgccctcgt cctggggtac gccgcccgcc 3862501 gaggtgatga tcgcggcgaa cagcgccggg aaaaccacca ggagctgttc gatcgacttg 3862561 tgatagatca cgcccacccg gcccaggttg gccagactgg ccgccagcgc gggaaacgaa 3862621 ggacgaatcc cggagaacgc ggtgttggcc tcgtcgatcg catccggggc gccggccaac 3862681 gtgtcgcgca gccgcgggtc tgccgcacgg agctgccagg tgaaccgcgc cagcccatcg 3862741 gcgagtgact tgatgtcccc gccggcgcgg atctgggctt gcaggaacgg gccggcctga 3862801 tcgatcaact gcgaaacctg tggatagttg gcgttggcct catccaccag caaccgggcc 3862861 gactcgatca gccgggccag ttccggaccg gcgccattgg tcgcgatgaa cgcctcgtgc 3862921 agcagctccc gcagccgggt gtcgccaagg ctgccgagca gcgtctcggc ctgacgcaac 3862981 aggtcggcga cgtcttgccc gattcgggtg ttctgccgct ggatccggaa gccgttgcgc 3863041 aacttggtcg acgacgggtt ctccggcggc actaggtcga tgtactgctc accgatggcc 3863101 gaaacgctgc gtacggtggc ggtgacgttc gacggaatgg cggtgccact gttcagtcgc 3863161 atgtgcgcgg taacgccatt gggatttagc cccaccgact ccacccgccc gaccgcgaca 3863221 ccgcggtagg tgacgttggc gttcttgtac aggccaccgc ccgcgacgaa gtcggcactc 3863281 acgccgtagg ttccgatgcc gaacgtggcg ggcagacgca gataaaagat cgccatcacg 3863341 ctcagggtga tgacggtgat caccgcaaaa atggacaact ggatcttggc gagtcggtcg 3863401 atcatgtccg ggcccctact gtcccgacgc cgtaccgggt ggaatcttaa atgggtcggc 3863461 cgcttgcccg gacaggttgg ccagttcgcc aatcaggaag tcgggcgggt tgaggatctc 3863521 gtccatgtgc gccatgttcg ggtcgaagta cgccgtggtg aagaacgtct caccaatccg 3863581 gcgcagggtg aggtcgaagg tggtgaacac gttaagatag tcgccgcgca ccgcctgctt 3863641 gataccgaag ttgggaaatg ggaacgtcag caacagctgc agcgaggtga cgaaatcctt 3863701 tcggtcgtcg ttgagggcct tgacgatcga gtagaggtct ttgaggtctt caccgaaatc 3863761 caccttggtc tcggccagca cgtgcgacgt gaccatcgtc aaccttttga gcgcggcgaa 3863821 cgcgtcgacg atgtggtccc ggttctggtt gagcacgcga accgcgtcgg gcagcgtgtc 3863881 cagtgctcgg cccaggttgt ccttgtcacg tgccaggatc gcggagactc ggttcagccc 3863941 atccaacgca tcgatgatgt cgtgaacctg ccggttcagg cccgccgtca actccgcgag 3864001 cctggggacc aggttgacga actgggcctg ccgacccgcc accgcctggt gggtctcgtc 3864061 aatgatctct tccaacgcac cgacgttacc cttgttgacc accaccccca gcgccgagaa 3864121 aacctcctcg gtggtgggga atcggtcggt gttggcctcg gtgattctcg agccgtcaac 3864181 caacctcccg gtcggcgggc ggtccgtcgg tggcgccagc tctacatgta acgaacccag 3864241 cagcgaggtc tgggagacct tcgccacggc gttggccggc agcaacacat tcttgtccag 3864301 gtccagcttc acggcggcat aaaaggatcc gtcgggtcgt tggaccgcga caatgccggc 3864361 cacgctgccg acggtgacgt catcgaccat gaccggtgag ttctgcggca acgtcgccac 3864421 atcagccatt tcgacggtga ccgagtaggc accttcaccg tgcccggcgg tgccaggcag 3864481 cggcagcgag ttcagcccgc caaactgaca gccggcaagc agcgcgctgc tggccgtcaa 3864541 tatgatggcg cgcaaccaga ttcggttcat ccgccgcccc catgctcgcc cggtcctgcc 3864601 cccggcgccg gcggggccgg tgccggaccg ggcgccggtg ggacgagtag gctctgcaga 3864661 tccgcggggt tgcccacagg cgctcctcct cccgcgggta cccaagtcaa ttccggaacc 3864721 ggcgtctccg acttggcctc ggtggccggg gtgtcgtaga tgatctggcc cttgtacgcc 3864781 gtgatcgtgt taagcgggtg gaacatgatc ggcgggtaat tcaccgtgag ccggcgcagc 3864841 accggcccca gccgctcacg gcagatctcg gcgcgccggt agtagtccgg cgccgacggg 3864901 cccgcggcgg tatcgaagga accgccgcag atgaactgca ccgggttagc gaagttgggt 3864961 atcgacaaca gaccgttgag ggtgccttgc gcagggtcat agatgttgta gaagttggtg 3865021 atccccggcc cagccacgtg cagcacttgc tcgatgttct cgctctggtc actcaacgtc 3865081 tgcgcaaagt cgttgagctg attcaccgtt tcgatcagcg tcgagttgtt ctcgcgcaag 3865141 aaccccctga tgtcggacag cgcctggttg agcgtgccca gggtctggtc cagattggcc 3865201 gagctgtcgg cgagcacctg cgacaccgat gccacgtggc cggcgaactg cacaatctgc 3865261 tcgtcgctct ccgatagcgc gtcgaccagt acctgcaggt tcttgacggt gccgaagatg 3865321 tcgccgcgcg aatcccccag ccgcccggcg acctgcgcaa gctcgcgcaa cgcgttgtgt 3865381 aacgagtctc cgttgccgtc aagggtgtcc gcggcctggt tgatcgccgc gcccagcggc 3865441 ccctgcagct cgcccgccgc cggactcagg tcggcggcca accgggtgag cccctctttc 3865501 acctcgtccc attccaccgg caccgcggtg cgatccagat cgatccgacc gttgtcgggc 3865561 agtaccgccc cgccggtata caccggggtg agctgaatga agcgcgccgc caccaaattc 3865621 ggcgacatga tcacggcctg cacgtccacg ggcaccttga cgtccttgga caccgacata 3865681 gtgatcttga cgtcggacga ccgcggctcg atcatgtcga tctcacccac cgggacgccc 3865741 aggacgcgga cctggtcacc gggatagagc ccgacagcag aggtgaagta gcccacgatg 3865801 gtgcgcttat taccggtgga cgagagcacg tacacgccgc ccaccagcgc ggccaccagc 3865861 gcgatcaccg tggcgtagcg caatccccgg ctccccgtca acatggcgac ccggcccatc 3865921 acggcgactt cggtctgatg atccagcgct cctggataaa cccgcgcaag taatcggcga 3865981 ggctatccgg cagcttgccc ggctgaaaga ccaggtcgaa cacggtcgcc accagcggcc 3866041 cgggcagcac gctgtagacg ttgacattga atccgggtcc ggatccgacc acctccccca 3866101 gcgtggtcgc gtacgtgggc agccgcttga gggcctcggt gatatagtcg cggcgctcgt 3866161 tgaggttggc cagcaccagg ttgagcttgc tcaaagccgg gccgaactcc ttacggttgt 3866221 cggcgacaaa gccggaaatc tgcgctgcaa catcgtcgat cccagagatc aacgcgctga 3866281 gcgcggcccg ccgggcatcg agcgccgcaa acaactggtt gccgtcctcg accagcttgt 3866341 tgacctgttc ggcgcgttcg gacaacaccg atgtcaccga cttggcgtgc gccagcaggc 3866401 cttgcagcgc ttcgtcgcga cgattcaggg cgcgcgacag cgacgtcagc ccgtccacgg 3866461 caccacgcac ctgcggggtg gcgtcatgca aggcctgggt gaacacgttc aaggcctgct 3866521 cgaactgcgg cctattcagg tcgttggcgt tgcggcccag atcctgcagc accccgttga 3866581 gcgtgtaggg cgtggtggtc cggctcaacg gaatcgtggt cgacttgccg gagccagccg 3866641 gactgaccgc gatggagcgc tcgccgagga tggtgtcggt gcggatcgcg gccagggact 3866701 ggtcgccgac gacgatgctg cggtccacgc tgaaggtgac ctttgcactg tttccggcca 3866761 gactcacggc cgacaccgcg cccaccttga ggcccgagac ataaaccgag ttaccggggg 3866821 tgatcccacc ggcgtcggtg aaatacgcgt cgtaggtttt gccctgtggc cagaaaggca 3866881 acccgctgta gccgaatgcg atcaggacga cgcagatcac cagcaccagg ccgaagatgc 3866941 cggtgcggag cgggtcgcgt tcgtgtttgc tacttggctt cctatttagc aaaggcgcac 3867001 ctccccttgc tgggatccgg ctggccgccg atcggcagca ggatgtcgct gccggccggt 3867061 ccgttgatct tgatcgtcac cgagcagaag tagatgttga agaatgctcc gtaactgccc 3867121 agcgcggaca ggcgcaggta gtcctcgccg agctgctcga tgtcgttgtt gacctcggcc 3867181 tttcggttgt ccagctcggt agccagcggc cgggcgtttt ccaggatgcc ttgcagcggc 3867241 cggcgcgaat tccgcaacag ttccgtaaga tccgtcgtcg tcgacgccag cggcgaaatg 3867301 gcgcccgcga tcggatcccg gttcttggcc aggccgctga ccagctgctg cagctggtcg 3867361 acactggccg aaaattgcgc gctctttgca tcgacggtcg ccagcaccgc gttgaggttg 3867421 gtgattacct cgccgatcag ctggtcgcgt gcgcccagcg ccgccgagaa ggcaccggtg 3867481 tcggcgagca cgttcgccaa cggaccaccc tggccctgca gcaactcgat gaccgcactg 3867541 gtgatggtgt tgatcttgtc agcgtcaaag cctttcagca ccggccgtag cccacccagc 3867601 aacgcatcga gatccagtgc gggctgggtg tgggccacgt tgatggtgcc acccggcggc 3867661 agcttgcgca gttcacccgg acccgacgtg atctccagga accggtcgcc caccaggttt 3867721 tcgtaccgga tcaccgcacg cgtggacgag tacagcgtgt agctgcggtc gatcgcgaat 3867781 gccacgtcga tgctgtggtc tgggttgagc ttgaccgcct tcactgaacc gaccggcaca 3867841 ccggcgatgc gaaccttctg gcctgccttc agccgcgacg cgtcggtgaa ggtggcgtgg 3867901 tagacggttg tgggaccaaa ccggaagtcc ccgaagacca ccaccagacc ggcggccacc 3867961 agcagcatga ccaccgcgaa gacgctgacc ttgatcacca tcgaccggtg cgagggaacg 3868021 cccgagcccg ccatcagaag tcgtcccgtt ccgcgaacgc accgttgaac aggaactgca 3868081 gcgtcgacgg cgcgtcaacc tgtaactcgg tgaacggctg gtatgggatc aaagcgttgt 3868141 cggtgaccag gaacggcgcg cggtagaacg acccgcccgt ctgcttggtc gggatatcgg 3868201 gcaaccctcg gcagttcgga ccgccggagg cgttgacgat cggcaggctc tccggatagg 3868261 tgtacgacgg cgcacccaac acgaagctcg acgaggtgaa cagcccagcc ttacggacac 3868321 cgattagcgg ggcaaactcc ttgacaccgc gcgcgatgcc cttgaaaagg cagccgaata 3868381 ccggggagta gtcggaggtc actttgagcg gggctcggag ccggttgatg gcgtcgatga 3868441 aattctgttc ggcgggcgcc aacgtctcat aggcgttatt agacagaccg atggtggcta 3868501 gcagcgtgtc gttgaggttg tccttctggt cgacgatcgt cttgttgatc gtcggcaggt 3868561 tatcgaacac ggtgttcagg tccccggcgg cgtcagcata gacgttggcc accaccgccg 3868621 ccttgcggaa atcctcctga agggcgggta actttgggtt cgcttggcgg gtcagcgtgt 3868681 tcagtcccga caacagcgca cccaggtcat cgccgtggcc gcgcaggcct tcggacagcg 3868741 cgctcagcgt cgcgttcgtt tcaagcggat cgatcttgtg tagcaggtcg atgagcgatt 3868801 ggaacaacgt gttgacctca agctgtacct gagacgccgc cacgtgcgca ttcggactta 3868861 gcggcttggg cgacggcgtc tttggcggaa tgaattccac cgatttggcg ccgaagatgg 3868921 tgtttccggc gatgcgcacc gtcgcgttgg aggggataaa acccatctcg ccgctgtcga 3868981 tggccagctt gagccgtgct tggttgccgc tgtagctgat atccgtgacc ttgcctacct 3869041 ggatgccacg gtatttgacc ttggcgccct tctccataac caggccggcc ctcggcgacg 3869101 ataccgtgac ggtgtccgta gacgtgaaag ccgccgtata cgaaagataa gtcagcactg 3869161 cggatcccac catcagcccg gccagcagcg ccgctgccac cctgacactg gtgcgtcgag 3869221 atccgccgcc ggacatgttt cctttctgaa ggtttttacc ccgagaggtt gaagttaccg 3869281 gacgcgccgt agacggcgag cgagatgaac aaggtgatga caacaaccac gatcagcgag 3869341 gtccgtacgg cctgcccgac cgcgaccccg accccaaccg acccgccgct ggcgttgtag 3869401 ccgtagtagg tatgcaccag cattaccgcg atcgacatag cgatggcttg cataaacgac 3869461 cacaacaggt cggaagggat gaggaaggtg ttgaagtaat ggtcataaag gcccgcggac 3869521 tgcccattga cgaacaccgt ggtgaaacga gcggcgaaga acgcggccag caccgacaac 3869581 gaatacaacg gaatgatcgc caccaggccg gcgatcagcc gggttgacac caaataggac 3869641 accgagtgca ccgccatgca ttcgacggcg tcgatctcct cagagacccg catggcaccc 3869701 agctgcgcgg tggctccggc cccgatggtg gccgccagcg cgatacccgc gatcaccggc 3869761 gcgacaacgc ggacgttcaa aaacgccgac aggaacccgg tcaacgcctc gataccgatg 3869821 tcgcccagcg acgaataccc ctgcacggcg atcacgccac cggacgccag ggtcaaaaag 3869881 gccgccaccc cgaccgtgcc gccgatcatg accagcgctc cggcgcccag cgtcatctcg 3869941 gcgaccagcc ggaccgtctc cttccggtag cgggtgatgg cgttgggcac atagcgcatg 3870001 gtttcgccgt agaacagcgc ctgctcaccg aagttgtcga ccggccgctg cagccgcgaa 3870061 aagaaacggc gaaaccggat agtgacgtcg tagctcatcg cttcatcacc atcgctcgct 3870121 caccgtcgtt tgttactgcg ccgagattcg cacacctata gcggtcatga ctacgttgat 3870181 cacgaaaagg cagatgaacg cgtagacgac ggtctcgttg accgcattgc ccaccccctt 3870241 gggcccaccc ttgaccgtca gaccgcggta acacccgacc agcccggcca tgaccccgaa 3870301 cagtagcgcc ttgatctccg ccagtatcaa ttcgcgcagt ccggtgagca cggtcagacc 3870361 gttgataaac gcacccgggt tgacgccctg aagaaagacc gagaacgcgt agccgccgga 3870421 caggccaatg gcgcacacca agccgttgag cagcagcgca accaatgtgg acgccaacac 3870481 cctggggacc acgagccgtt gaattgggtc gatgcccagc acccgcatcg cgtcgatttc 3870541 ctcacggatg gtgcgagcgc ccaggtcggc gcagatcgcc gtggcgccag cacctgccac 3870601 caccagcaca gtcacgaccg ggcccagctg ggtgatggtg ccgaacgccg ttccggcgcc 3870661 ggacaagtcg gcggccccaa tttcacgcaa cagaatgttg agggtgaacg ccaccaggac 3870721 cgtgaacgga atggacacca gcaacgtcgg gactagcgaa acgcgggcca ccatccaggt 3870781 ctggtccaaa aactcgcgga actggaacgg ccgccggaaa gcggcacgcg cggtgtccat 3870841 cgacatttcg aagaacccgc cgacggcccg ggccggaacc gcaagttgtt ggatcaactg 3870901 gggtcccccc gtctactgct cgcggcgaag tctgtgagtc tcctgaacgc gcttagggcc 3870961 cgcacgttgc acggtgtgag ccggcccatc ctaacccaga acgagtttgc ggtgtcaacg 3871021 aaccgcacac cggatcaact gggtcaattt cgctggttaa gccctatgtt ggcgtggtga 3871081 ttcggacacc gattccaata atcggccgcc tatatccacg ggtcactgac gcatcagatc 3871141 ggtcgccgaa aagctctgtt ccggatcccg accagcaaag tagtcccgca gcgtcgcggt 3871201 gagctcggtg ggatcccagg acgtgccgtc cgcgctgaac cggcgctcca tgtgcggcgg 3871261 tgacaccagc gtcacctgcg gaccgtagac gatgaacacc tgaccgttga cttccgcggc 3871321 agccggggac gccagaaact ggaccaggct taccacatgc tgcggcgaca gcgggtcgat 3871381 ctggcccgct tcgacatcgg gtgcggcgcc gaagacatcg gccgtcatcg cggtgcgcgc 3871441 ccgcggacaa atcacattgg cgcaaacgcc gtagcgcccg agcgcccgcg ccgccgacag 3871501 ggttagcgcg gtgatgccag ccttggcggc ggcgtaattg gcctgcccca ccgggcccac 3871561 cagacccgcc tccgacgagg tgttgacgag ccggccgaag accgatcccc cttcggcatc 3871621 cttggctttg tcccgccagt aggcagcggc gttgcgggtg agcagaaaat ggccgcgcag 3871681 gtgcaccgcg atcacggcgt cccactcctc gtcggacatg ttgaacagca tccggtcgcg 3871741 ggtgatgccg gcattgttca ccacgatgtc cagtccgccc agcccgacgg cgctggcgag 3871801 cagttcgtcg gccgtcgcgc gctggctgat atcaccggct accgcgacgg ccttagcacc 3871861 agcatcggca gcggcggcgc cgatctcgtc gacgacgtcg gaagcatcca gggcggaagc 3871921 aacatcgttg acgacgacgg tggcgcccaa ccgggccagg ccgagcgctt cggcccgacc 3871981 caaacccgcg gccgcgccgg tgaccaccgc cacctttccg gacagatcgg tcgtgttcgt 3872041 ggtacgcggc gagcgattgg actcagtcaa tttcaattta tgaatacctc tagttccgtc 3872101 ctactcacca cgcgacaacg ccgcacgcgg gcattccgcg atggcctgct cggccagatc 3872161 ctcctgatca accgggatcg gatcggtctt gaccacggca tagtcctcgt cgtccaggtc 3872221 gaagatatcc ggtgcgattc ccaagcacac cgcgttgcct tcacatcggt ctcggtccac 3872281 gatcacccgc acggcaccct ccttaccctg accatccccc cggtcgctgc tagttccacc 3872341 ataaggccct gctacatccg aggaaacggt cgctggattc agagactaga acgtgttaca 3872401 accgggaaga cggccgggtt gccgttggcg ttggttgtcg acagctagtg gacggctgct 3872461 gacggccagt gataaagacg cgatcattca atcggaggca gctgagatgc gcatcagtta 3872521 caccccgcag caggaggagc tgcgccgcga gctgcgctcg tactttgcca cgttgatgac 3872581 gccggaacgc cgggaggcgc tgagctcggt ccagggtgaa tacggcgtcg gcaatgtcta 3872641 ccgggagacg atcgcgcaaa tgggccgcga cgggtggctt gcgctgggct ggcccaagga 3872701 atacggcggc cagggccgct cggcgatgga ccagctgatc ttcaccgatg aagccgccat 3872761 cgccggtgca ccggtgccgt tcctgaccat caacagcgtg gcgccgacga tcatggccta 3872821 cggaaccgac gagcagaaga ggtttttcct gccccggatc gccgccgggg acctgcactt 3872881 ctcgatcggc tactccgagc ccggcgccgg caccgacctg gccaacctgc gcaccaccgc 3872941 ggttcgcgac ggcgatgact atgtggtcaa cggccagaag atgtggacca gcctgattca 3873001 gtacgccgac tacgtctggt tagcggtacg caccaacccg gagtcttctg gggccaaaaa 3873061 acaccgtggc atatcggtgt taatcgtgcc gacgaccgct gagggcttct cctggactcc 3873121 agtgcacacc atggccggtc cggacaccag cgccacctac tactccgacg tgcgggtacc 3873181 ggtggccaac cgggtcggtg aggaaaacgc cggctggaag ctggtgacca accagctcaa 3873241 ccacgagcgg gtcgccctgg tgtcgccggc accgattttc ggatgcctgc gcgaggtccg 3873301 cgaatgggca caaaacacca aggacgccgg cggcaccagg ctgatcgact cggagtgggt 3873361 gcagctcaac ctggcccggg tacacgccaa ggccgaagtc ctcaagctga tcaactggga 3873421 gctggcttcc tcgcaaagtg ggccgaagga cgctggaccg tcaccggccg atgcgtcggc 3873481 ggccaaggtg ttcggtaccg agctggccac cgaggcctac cggctgctga tggaggtgtt 3873541 gggcactgcg gcgaccctgc gccagaattc gccaggcgcg ttgctgcgcg gccgcgtcga 3873601 acggatgcac cgggcgtgcc tgatcctgac gttcggcggc ggcaccaacg aagtccagcg 3873661 cgacatcatc ggcatggtcg cgctgggact gccgcgagcc aaccgctgag cggacctgag 3873721 aggacaagac gtcatggatt tcacgacaac cgaagccgcc caggatcttg gtggtctggt 3873781 cgacaccatc gtggacgcgg tgtgcacgcc ggagcatcaa cgtgagctgg acaagctcga 3873841 gcagcggttc gaccgcgagc tgtggcgcaa gctgatagac gccggcatcc tgtccagtgc 3873901 ggcgccggag tcgctgggcg gcgatggctt cggcgtgctc gagcaggttg cggtgctggt 3873961 ggcgttgggg catcaactgg ccgcggtgcc gtacctggag tcggtggtgc tcgccgccgg 3874021 cgccctggcc cggttcggct cgccggaact gcagcagggc tggggggtgt cggcggtctc 3874081 cggcgatcgg atcctcaccg tcgccctcga cggtgagatg ggcgagggtc cggtgcaggc 3874141 cgccggcacc ggacatggct accgcctcac cggcacacgc acccaggtcg ggtacggccc 3874201 ggtggccgac gcatttctgg tacccgccga aaccgattcc ggtgcagccg ttttcctggt 3874261 tgccgccggc gacccagggg ttgcggtgac cgcactggcc accaccggac tgggcagcgt 3874321 cggacacctc gagctaaacg gggccaaagt ggacgccgcc cgcagggtcg gcggaaccga 3874381 tgtcgtggtt tggctcggca cgctttccac cctgagccgc accgcttttc aactcggtgt 3874441 gctcgagcgc ggactgcaaa tgacggccga atatgcgcgc acccgtgaac aattcgaccg 3874501 cccgatcggc agcttccagg cggtggggca acggttggct gacggctaca tcgacgtcaa 3874561 gggattgcga ctgacgctta cccaggcggc ctggcgggtg gccgaagatt ccctggcaag 3874621 ccgggagtgc ccccagccag ccgacatcga cgtcgccacc gcggggttct gggccgccga 3874681 agccgggcat cgggtggcgc ataccatcgt gcatgtgcat ggcggcgtcg gcgtcgacac 3874741 cgatcatccc gtacaccggt atttcctggc cgccaagcag accgagttcg cgttgggcgg 3874801 cgccaccggt cagctccgcc gaatcggccg tgaactggcg gaaacccctg cctagccctg 3874861 cctagcccgg cgacgatgcg gtccgcgcag cggaccgaga aggagcgggc gaatcgaacc 3874921 caccgatgac tcccactcac ccgaccgtca ccgaacttct gctgccgcta tccgaaatcg 3874981 acgatcgggg cgtctatttc gaggactcgt tcaccagttg gcgcgaccac atccggcacg 3875041 gtgccgcaat cgccgcagcg ctgcgggaac gcctggaccc ggcgcggccg ccacacgtcg 3875101 gtgtgttact gcagaacacg ccgttcttct cggcgacact ggtggccggc gcgctgtcgg 3875161 ggatcgtccc ggtgggcctc aacccggtgc gccgcggcgc ggcactggcc ggcgacatcg 3875221 ctaaagccga ctgccagttg gtgctcaccg gctcgggatc ggcggaggta ccggccgatg 3875281 tcgagcacat caatgtcgac tcccccgaat ggaccgacga ggtggccgca caccgggata 3875341 ccgaggtgcg ttttcgatcc gcggatctcg cagacctttt catgctgatc ttcacctcgg 3875401 gcaccagcgg cgacccgaag gcggtgaagt gcagccaccg caaggttgcg atcgccggcg 3875461 tgacgatcac gcagcgcttc agtctgggcc gcgacgacgt ctgctacgtc tcgatgccgt 3875521 tgttccattc caacgcggtg ctggtcggct gggcggtggc tgcggcctgc caaggctcaa 3875581 tggcgttgcg acgcaaattt tcggcgtcgc agttcctggc cgacgtccgc cgttatggcg 3875641 ccacttacgc caactacgtg ggcaagcctc tttcgtatgt gcttgcgaca ccggagcttc 3875701 ccgacgacgc ggacaacccg ctgcgggcgg tgtacggcaa cgagggagta cccggtgaca 3875761 tcgaccgttt cgggcgcagg ttcggctgcg ttgtcatgga cggcttcggc tcgactgaag 3875821 gcggggtggc gatcacgcgg acactcgaca ccccggcggg cgccctgggc ccactgccgg 3875881 ggggaatcca aatcgtcgac cccgacaccg gcgaaccgtg cccgacagga gtggtcggcg 3875941 aactggtcaa caccgccggg ccgggcggtt tcgaaggcta ttacaacgac gaggccgccg 3876001 aggccgagcg gatggccggc ggcgtctacc acagtggcga cctcgcctat cgcgacgacg 3876061 ccggctacgc ctatttcgcc ggtcggctcg gcgactggat gcgagtcgac ggtgaaaatc 3876121 taggcaccgc accgatcgag cgggtgctga tgcgctaccc ggacgccacc gaggtcgctg 3876181 tgtatccggt acccgatccg gtggtgggtg atcaggtgat ggccgcgtta gtgttggcgc 3876241 ccggcaccaa attcgatgcc gacaagttcc gggcgtttct gaccgagcag cccgacctgg 3876301 ggcacaagca gtggccgtcg tatgtgcggg tcagcgcggg gctgccgcgc accatgacct 3876361 tcaaggtgat caagcgccag ttgtcggccg aaggtgtcgc ctgcgccgat ccggtgtggc 3876421 cgattcgccg gtagcctcac ggcgcgccac catgctcacc gggatctggc cggatggtgg 3876481 acccgaataa tcgggtagaa ccgccgaatg agctgcccgg atcgcgatac gatccattcc 3876541 tagcaattgc accgatgatg cacggccgcg gccgggttcg gcttgggctg gtgcgaggta 3876601 ccggatgtcg tttgtgttgg tttcgccgga gaccgtggcg gcggtggcca cggatctcaa 3876661 gcgcatcggc gcctcgctgg cccacgaaaa cgcgtcggcg gccgcttcga cgacggcggt 3876721 ggtctccgcg gccgccgacg aggtatcgac ggcggtcgcc gctctgttct cccaacacgc 3876781 ccagggctac caagcggcgg ccgctcaggt agcagcgttt catagccggt ttgtgcaagc 3876841 cctgacggcc ggtgccgggg cgtacgcatt tgccgaggcg gccaacgcgt cgccgctaca 3876901 gtcagccatg ggtgcggtaa gcgcgtctgc gcagacgctg ttgtcgcgcc cgttgatcgg 3876961 caatggcgcc aatgcgacga cgccgggcgg taacggcggc gacggcggat ggctattcgg 3877021 cagcggcggc aacggcgcgc ccggcgcggc gggccagtcc ggcggtaacg gcgggtcagc 3877081 cggactgtgg ggtaacggcg gcgcgggtgg cgccggcggc agcggcggcg ccgccggcgg 3877141 caacggcggt aacggcgggt ggctgttcgg cgccggcggc accggcggta tcggcggcac 3877201 cggtgctccc ggcgccatgg gcggcaccgg cggcaacggc ggcaacggcg cgctgctgat 3877261 cggcggcggc ggcctcggcg gcgccggcgg catgggtggc accggcggcg gcaccggcgg 3877321 caccggcggc aacggcggca acggcgcgct gctgatcggc gctggtggtg tcggaggtgc 3877381 tggcgggatc ggtggccagg gtaccggcgc cggcggtgcc gccggcgccg gcggcaccgg 3877441 gggcaacggc ggcgccgggg ggttgttcat gaacggcggc gacggcggcg ccggcggtca 3877501 aggcggcgac ggtgcggccg gcgacgcggc tgccagcgcc ggcggcaccg gcggcaaagg 3877561 cggccaaggc ggcgacggcg gcaccggagg ggccggcggc gcaggcccag tgctgttcgg 3877621 ccacggcggc gccggcggca tgggcggcca aggcggcacc ggtggaatgg gcggcgccgg 3877681 cggagacggc accaccgtca tcgcggccgg taccgggggg gagggcggca ccggcggcac 3877741 cggcggtacc ggcggcaacg gcgctgacgc cgctgctgtg gtgggcttcg gcgcgaacgg 3877801 cgaccctggc ttcgctggcg gcaaaggcgg taacggcgga ataggtgggg ccgcggtgac 3877861 aggcggggtc gccggcgacg gcggcaccgg cggcaaaggt ggcaccggcg gtgccggcgg 3877921 cgccggcaac gacgccggca gcaccggcaa tcccggcggt aagggcggcg acggcgggat 3877981 cggcggtgcc ggcggggccg gcggcgcggc cggcaccggc aacggcggcc atgccggcaa 3878041 cacaggtgac ggcggcgacg gcgggaccgg cggtaacggc ggcaacggca ccggaggcgt 3878101 gaacggcgcc gacaacaccc tcaaccccga cacccccggc ggcgccgggg agcccggcgg 3878161 ggccggcggg gccggcgggg ccggcggggc cgccggcggc ccgggcggta ccggcggtac 3878221 cggcggcaac ggcggcaatg ccggcaacaa cagcaccaat gccccagtcg gtggcgaagg 3878281 cggcgccggc ggcgacggcg gcgccggcgg cgcaggcggg gccgccaacg gcggcaccgc 3878341 gggcagccag ggcactgggg gcgtcggcgg cgacggcggc gcgggcggca acggcggcgg 3878401 cggcaaggct ggcaccggca acagcggcaa ctttggggtg gacggcgaag ccggcttcag 3878461 cggcggcgcc ggtggcaacg gcggcgtagg cggggccgcc ggcgccaatg gcggaaccgg 3878521 cggcagcggt ggtaatggcg gtgacggcgg tgcgggaggc attggcgggg ccggcggcaa 3878581 cggcataccg ggcactggca cagagcctgc cgggggcacc ggcgccaaag gtggagacgg 3878641 cggcgacggt ggcgccggcg gcgcaggcgg caatgccggc ggggccggcg gtaacggcgg 3878701 ggccggcggc cagggcggca atgccggcca gggtggcgcc ggcggtgcgg gcggcaacgc 3878761 cgtgattccc ggcgacggcg tcgggaaggc gccgcacggc ggcgcgggcg gcagcggcgg 3878821 agacggcggc aaaggcggcc agggcggtag tggcggcacc ggcggatccg gtgccccgat 3878881 cggtggcggc gccggaggca ccggagggtc cggcggacac gccggcaagg gtggcgccgg 3878941 cggcatcggc gcacagggca ccaccatcac cgtgcccggg aacggcggca acgccggcga 3879001 cggcggcaac ggcggtggcg gcggcgcggg cggcaccggc ggcgacggcg ccaccggcac 3879061 gccagccggc aacggcggca acggcggcaa cgccggcgac ggcggcaacg gcggctccgg 3879121 cgacttcggt ggcaatacca ccagcggcgc ctccggcagc ggcggcaacg gcggcaacgc 3879181 cggcaccgcg ggtagcggcg gtgcgggcgg aaccggcggc accggcctta gcggcggcaa 3879241 cggcggcaac ggcggtgacg gcggtaacgg cgcccacggc accgtcggcg cccagttcgt 3879301 cccggccacc agcttgccca cacccaacgg cggggccggt ggcaacggtg gcaccggaag 3879361 caacggcggc gcgcccggcc ccgccggggc gcccggcccc actaccggcg gtaacgctgg 3879421 cagccagggc atcggcggcg acgggggcaa cggcggcgac ggcggtaaag gcggtgacgg 3879481 cgccgacgct gtcaacgtcg tattcatgcc gactgagcca caggccgcga ccggcactgc 3879541 cggcagcgcc ggtgacccca ccggcggtaa cggagggccc ggcactcccg gcagccccat 3879601 ggttgccccg cccccgccaa cgccaatcac tcaagtccaa cagggcggtg acggtggcgc 3879661 cgggggcacc ggatccacca acgccaacga cggcacagcc accggcggaa agggcggaga 3879721 aggcggagtc ggcagcattc tcggcgggcc cggcggcaac ggcggaactg gcggcaacgc 3879781 ctcggcaacc ggcaccaacg gggtggccaa cgccgggaat ggcggcaagg gtggcgacgg 3879841 cggccagttt ggggccggcg gcaacggtgg tgccggcggc agcgtaaccg acggatccgc 3879901 cggcagcacc gcaggcaacg gcggcaacgg cggcaacgca accaacggca ccatcgcagg 3879961 ccaacccgcc ggcggcaacg gctcggccgg cgggaaaggc ggcgacggcg gcaacatcgc 3880021 cgccggtgcc accggcaccg ccggcaacgg cgggaacggc ggcaacggca acgacggcgc 3880081 cgtcaacgcc ggcaccggcg gctccggcgg gaacggcggt aacgccggtg gcggcggcgc 3880141 caatggcggc gacggcggcg ccggcggcgc cggcggggcc ggcgggcgtg gcggcaaggg 3880201 catcgacggc gggttcggcg gtgacggcgg caacggcggc agcaacaacg gcaccggcgc 3880261 cggtggcaac ggcggcaacg gcggcaccgg cggggtcggc tcggttggcg cggctggtgg 3880321 cgatggcggc aacggcggca ccggaggctt cgccggtttc ggcggcaccg caggcaatgg 3880381 cggttccggc ggcacgggcg gggccggcgg cgacggcggc accggcgggg gcggcggcaa 3880441 cggcggcacc ggcgttatcg ccggcggcgg ggggaccggc ggcaacggcg gcgccagcgg 3880501 ggccggcggc gccggcggca cgggcgggtt cgccggcaac ggcaatgccg gcggcaatgg 3880561 cggcaccggc ggcgcgagcg aggacggcga caacggcaac gctggcagcg gcgccaccgg 3880621 cggtaccggc ggcaacggcg gcaccggcgg cgacggcggc gctgccgggc tgggcggcgt 3880681 cgcgtgaggt tgaccggcga tcaccgtagc cagcacggcc cgtgacaccg gtccggcacg 3880741 ccaccctcgt cgttcaggtg gtgtcgccac tcgcgctaca caacgcttca cggcactcgt 3880801 cgagacttat gctcgagttc tgatacgtgg agcaactgtt ttggcgttcg acccgtattg 3880861 cgcaggtggc ggtactggaa aacgtagacg tgttgggcgg gtgacgaata agatcctggc 3880921 ctaactactg cgtcaattat gccgcggtgg ccgcgccgtc cggttgggag ttcgcccatg 3880981 tcgttcgtgt tgatcgcacc ggaattcgtg acagcagccg cgggggatct gacgaatctg 3881041 ggttcgtcga ttagcgcggc caacgcgtcg gcagccagtg cgaccacgca ggtgctggct 3881101 gcgggcgccg atgaggtgtc tgcccgtatt gcggcgctgt tcggcgggtt tggcctggag 3881161 taccaggcga ttagtgcgca ggtggcggcc taccaccagc ggtttgtgca ggccttgagt 3881221 accggcgcgg gcgcatatgc ctcggccgag gccgccgccg ctgagcagat cgtgctgggc 3881281 gtgatcaatg cgcccaccca ggcgctgctg gggcgcccgt tgatcggtga cggcgccaat 3881341 gcgacgactc ccggcggggc cggcggggcc ggcggtctgc tgttcggcaa cggcggggcc 3881401 ggggcagccg gggcgcccgg ccaggccggc gggcctggcg ggcccgccgg attgtggggc 3881461 aacggcgggc ccggcggggc cggcggcagc ggtgggggca ccggcggtgc cggcggcgcc 3881521 ggtgggtggc tgttcggggt tggcggcgcc ggcggtgtcg gtggggccgg tggcggcacc 3881581 ggcggggcgg gcgggcccgg tggtttgatc tggggcggcg gcggggccgg cggtgtcggt 3881641 ggggccggtg gcggcaccgg cggggccggc ggccgcgccg agctgctgtt cggcgccggc 3881701 ggtgcgggcg gcgcgggtgg ggcgggcacc gacggcgggc ccggtgctac cggcgggacc 3881761 ggcggacacg gcggagtcgg cggcgacggc ggatggctgg cacccggcgg ggccggcggg 3881821 gccggcgggc aaggcggggc aggtggtgcc ggcagcgatg gtggcgcgtt gggtggtacc 3881881 ggcgggacgg gcggtaccgg cggcgccggt ggcgccggcg gtcgcggcgc actgctgctg 3881941 ggcgctggcg gacagggcgg cctcggcggc gccggcggac aaggcggcac cggcggggcc 3882001 ggcggagatg gcgttctggg gggtgtcggt ggcactggtg gtaagggcgg tgtcggcggc 3882061 gtggctggcc tcggcggggc cggtggtgcc gcgggccagc tcttcagcgc cggaggcgcg 3882121 gcgggtgccg ttggggttgg cggcaccggc ggccaaggcg gcgccggcgg catgggtggc 3882181 tccggtgctg ataatgccag cgggattggc gccgacggcg gcgcgggtgg gactggcggt 3882241 aacgccggcg ccggcggggc cggcggggcc gccggcaccg gaggaaccgg cggggttgtc 3882301 ggcgccgcgg gcaaggccgg tatcggcggc accggcggcc aaggcggcgc cggcggcgcg 3882361 ggcagcgccg gcacggatgc gaccgctacc ggtgccaccg gcggcaccgg gttttccggt 3882421 ggagccggcg gggccggcgg ggccggcggc aacaccgggg ttggcggcac caacggctcc 3882481 ggcgggcaag gcggcaccgg cggcgcgggc ggcgccggtg gtgctggcgg tgtcggcgcc 3882541 gacaacccca ccggcatcgg cggcgccggc ggcaccggcg gcgccggcgg caccggcggc 3882601 accggcggag cggccggagc cggcggggcc ggcggagcgg ttggcaccgg tggtaccggt 3882661 ggcgttgtag gcgacgttgg taacgcaggg atcggcggca ccggcgggaa aggcggcgcc 3882721 ggcggcaccg ggttcgccgg tggcgccggc ggggccggcg ggcagggcgg tagcagcggt 3882781 gccggcggca ccaacggctc tggtggcgct ggcggcaccg gcggacaagg cggcgccggg 3882841 ggcgctggcg gggccggcgc cgataacccc accggcatcg gcggcgccgg cggcaccggc 3882901 ggcaccggcg gagcggccgg agccggcggg gccggtggcg ccatcggtac cggcggcacc 3882961 ggcggcgcgg tgggcagcgt cggtaacgcc gggatcggcg gtaccggcgg tacgggtggt 3883021 gtcggtggtg ctggtggtgc aggtgcggct gcggccgctg gcagcagcgc taccggtggc 3883081 gccgggttcg ccggcggcgc cggcggagaa ggcggagcgg gcggcaacag cggtgtgggc 3883141 ggcaccaacg gctccggcgg cgccggcggt gcaggcggca agggcggcac cggaggtgcc 3883201 ggcgggtccg gcgcggacaa ccccaccggt gctggtttcg ccggtggcgc cggcggcaca 3883261 ggtggcgcgg ccggcgccgg cggggccggc ggggcgaccg gtaccggcgg caccggcggc 3883321 gttgtcggcg ccaccggtag tgcaggcatc ggcggggccg gcggccgcgg cggtgacggc 3883381 ggcgatgggg ccagcggtct cggcctgggc ctctccggct ttgacggcgg ccaaggcggc 3883441 caaggcggtg acggtggcag cgccggcgcc ggcggcatca acggggccgg cggggccggc 3883501 ggcgacggcg gcgacggcgg ggacggcgca accggtgccg caggtctcgg cgacaacggc 3883561 ggggtcggcg gtgacggtgg ggccggtggc gccgccggca acggcggcaa cgcgggcgtc 3883621 ggcctgacag ccaaggccgg cgacggcggc gccgcgggca atggcggcaa cgggggcgcc 3883681 ggcggtgctg gcggggccgg cgacaacaat ttcaacggcg gccagggtgg tgccggcggc 3883741 caaggcggcc aaggcggcct gggcggggca agcaccacct cgatcaacgc caacggcggc 3883801 gccggcggca acggcggcac cggcggcaaa ggcggcgccg gtggtgcggg aaccctgggc 3883861 gtcggcggct ccggcggcac cggcggggac ggcggcgatg cgggcgctgg tggtggcggc 3883921 ggcttcggcg gggccgcggg taaggccggc ggcggcggaa acggcggtgt tggcggtgac 3883981 ggcggcgagg gagccagcgg tctcggcctg gacctctccg gctttgacgg cggccaaggc 3884041 ggccaaggcg gggccggcgg caacgccggc gccggcggca tcaacggggc cggcggcacc 3884101 ggcggcaccg gcggggccgg tggtgacggc gccccggcga ccctgatcgg cggacccgac 3884161 ggcggtgacg gcggccaagg cggcatcggc ggggacggcg gcaacgccgg attcggcgcc 3884221 ggtgttcccg gcgacggcgg gatcggcggc accggcgggg ccgggggcgc cggcggggcc 3884281 gggggcgccg gcgacgccgg cgccgacggg gaccccagca ttgacggcgg ccaaggtggt 3884341 gccggcggcc acggcggcca aggcggcaaa ggcggcctga acagcaccgg gctagccagc 3884401 gccgccagcg gtgacggcgg caacggcggg gccggcgggg ccggcggcaa cggcggcgac 3884461 ggcgacggct ttatcggcgg gtccggcggc accggcggga ccggcggcga cgccggcgcc 3884521 ggcggcctgg ccaacaccgg cggaaccgcg ggcaacgccg gtatcggcgg ggccggcggc 3884581 cgcggcggcg acggcggggc cggcgacagc ggcgccctct cccaagacgg caacggcttc 3884641 gccggcggcc aaggcggcca aggcggggcc ggcggcaacg ccggcgccgg cggcatcaac 3884701 ggggccggcg gcaccggcgg caccggcggg gccggtggtg acggcgcccc ggcgaccctg 3884761 atcggcggac ccgacggcgg tgacggcggc caaggcggcg gcgcgggatt cggcagcggc 3884821 gtagccggcg ccgccggcgc cggcggcaac ggcggtaagg gaggtgacgg cgggaccggc 3884881 gggaccggcg ggactaactt cgctggcggc caaggcggtg ccggtggccg aggaggtgct 3884941 ggcggcaatg gcgccaacgg cgttggcgac aacgccgccg ggggcgatgg cggcaacggc 3885001 ggagccggcg ggctcggcgg gggcggtggc acaggcggca ccaacggcaa cggcggcctc 3885061 ggcggaggcg gcggcaacgg cggagccggc ggtgccgggg gaacgcccac cggcagtggc 3885121 accgagggga ccggcggcga cggtggagat gccggcgccg gcggcaacgg cggctctgcc 3885181 accggcgtcg gtaacggcgg taacggcggt gatggcggca acggcggcga cggcggcaac 3885241 ggcgcacccg gcggcttcgg tggcggcgct ggcgccggcg gcttgggcgg ctccggcgcc 3885301 ggcggcggca ccgacggcga cgacggcaac ggcggcagcc ccggcaccga cggcagctaa 3885361 gctaacggca gcccaaagcg ccagcagcca cccgacaacg ctgggcggct acccatggcc 3885421 cgttggcagc acaggctggc gatggccgtc cgaccgataa cacccgggcc atcgcatccc 3885481 cagcacaacc agctgtcctc gcgggcttat gcacgacggg ggagcactac cccacaagcg 3885541 atggcaccac tacatcgatc agatgcggcc cgggctcggc gaaggccgcg cgcagggcgt 3885601 cggcgaattc ctcgcaggtg gtgacacgac gtgcaggaac acccatacct tcggcgatct 3885661 tgacgaaatc cattgtggga cgcgatatat caaggagatc cagggccttc gggccaggat 3885721 ccgaccccgc gccgacacgt tgcagctcga tccgcagaat gtcgtaggcg ccgttgttgt 3885781 agatgacggt ggtgacgtcg aggttctccc gcgcttggct ccacaatcct gaaatcgtgt 3885841 acattgccga cccgtcggat tccaggcaca acaccgggcg gtcgggcgcg gcgaccgcgg 3885901 caccgaccgc agccgggatg ccgtaaccga ttgccccgcc ggtcagcgta agccagtcat 3885961 gggccggggc cccggcggtg gcctgcggca gcaggacacc acaagtattc gactcgtcga 3886021 caacaatcgc ccgttccggc agcaacgcac cgaccacatc ggccgccgac accgacgtca 3886081 ggtcacccgt cggcagctgc ggacgtgacg cgcccgccac cggggcaacc gtcccgggcg 3886141 ctacctcgtc ggccaacgcg gccagtgcgt cggccgcacc accgggttcg gcaagcacgt 3886201 gcacctcaca accggccggc accaggtcac tgggcatacc cgggtaggcg aaaaacgaca 3886261 ccggcgacct ggccccggcc agcacgagat gtttgacccc gtccagctgg gccgcggcac 3886321 cttcagcgaa ataggccagc cgttcgacgg cggggatacc ggcgccacgt tccaggcacg 3886381 tcggaaacgt ctcgcataac caacgggccc cggttgcctg cacgatccgc gcagccgcgg 3886441 tcagccccgg cccgcgggtg gcatccccac cgatcagcat catggcgggt tcccctgagc 3886501 gcagcacccc agccaccggc cccacgtcca ctggcgccgc cgccgcctga gccggcacgc 3886561 ccgcggccgc gtgggcaccg tcgctccaac acacatccgc gggcagaatc agcgtcgcga 3886621 tctgtgaacc tgaccggctg gccgcaatgg ccgcttcagc gtcggccccg acgtcggcgg 3886681 cagcctccgt ccggcgcacc catcccgaaa cggtgccagc gaccgcatcg atatcggatt 3886741 ccagcggggc gtcgtacttc ttgtggtaag tcgcgtggtc tccgacaacc accaccatcg 3886801 gcacccgggc acggcgcgcg ttgtgcaggt tggccaggcc gttgcccagt ccggggccca 3886861 gatgcagcag caccgccgcc ggccggccag caatgcgggc ataaccgtcg gcggccccgg 3886921 tagccacgcc ttcgaacagg gtcagcatgc cacgcatgcg cgggacggcg tctagcgccg 3886981 ccacgaaatg catttccgac gtgccagggt tggcgaagca cacatcgaca cccccgtcga 3887041 ccagggtgtt gatcagggcc tgagcaccgt tcacgtctgc acctttcctc gtgggtccag 3887101 cttgaatacc cgcacagcgt tgccgtgcag aaagtcgcga cgagcttcgt cgcttagccc 3887161 cagttcgtca agaccggtca gggcgtgcgt gtgggcgatc atcgggtaat tggtaccaaa 3887221 cagcaccttg cgctgtcccg tgtcggtttt catgaaccgc accagcttcc cgggcagccg 3887281 cttgatggtg taggccgagg tgtcgatgta gacattctcg tgtttgcggg cgaccgcgac 3887341 catctcctcg gtccacggat agccgacatg tccgcacacg atcaccagtt ccggaaagtc 3887401 caacgccacc tggtcgatgt agggaatggg gcgtccggtc tccgacggcc gcagcgggcc 3887461 ggtgtgacca acctgggtgc agaacggcac cgcggactgc acgcattcgg cgaacaacgg 3887521 atagtagcgg cggtcggtcg gcggggcgcc ccatagccaa ggcaccaccc gcaggccgac 3887581 gaacccctca ccgactcggc gcctcaactc ccggacggcc gccatcgggc gatccaggtc 3887641 gaccgccgcc agaccggcaa aacggttggg gtacaaccgg acccattccg caacagcgtc 3887701 attagagatg aggtcctggc cgttggggcc acgccaggcg ctgagcaaac ccagggtgac 3887761 gccgccggcg tccatcgagg agacggtcgc ttcgatcggg atgtcggtct ccgggataga 3887821 cccaccggtc caccggcgca gcgaggcgaa catatcgccg tgtaggaacc gttgcgtcgg 3887881 atgctgcatc cacacatcga tggtcatcgc gtttcagact gtagccgccc gggcggcgac 3887941 tacccgcggc gacgctgcag atcatcgccc ggccagggtg ctaccaggtt gctgccatcc 3888001 ccgaatgttc gcggtcggag ggcgacgcga cgtgttgaaa cgccgtacgt tcgggccttc 3888061 ccgcgagaag ccctagccgc ccgagattgt ccctcccggc gttcgtggcc acgcggtgct 3888121 tcgccttttt gcccatccca aattacacgg gtggtactca cgagaaagct tggacgtatt 3888181 gggcgggtgc tgaattatga tcccgacaca actgcatcaa tttagccgcg tcgtgatgct 3888241 atccgccgac ggtttggagc tggtccgtgt cgttcgtgtt gatctcaccc gaagttgtgt 3888301 ccgccgccgc cggggatcta gcgaacgtgg gatcgacaat cagcgccgcc aacaaggcgg 3888361 cagcggctgc gaccacgcag gtgctggccg cgggcgccga tgaggtgtca gcgcgcatcg 3888421 cggcgctgtt tggtatgtac ggcctggaat atcaggcgat cagtgcgcaa gttgccgcgt 3888481 atcaccagca gttcgtgcag acgttgcgca ccggagcggc ctcgtacatg ttggccgagg 3888541 ccaccaacgt cgagcaaaat ctactgaacc tcatcaacgc gccgacccag acgctgctcg 3888601 ggcgcccgct gatcggagac ggggccaacg cgacgacgcc gggcggggcc ggcggagacg 3888661 gcgggctgct gtttggcagc ggcggcaacg gcgcgcccgg tgcacccggc caggctggcg 3888721 gtgccggtgg gtctgccggg ctactgggca acggcgggag cggcggagcc ggcgggacgg 3888781 gcgcgcccgg cggaaacggc ggcaatgccg gttggctata cggccgcggc ggagtcggcg 3888841 gcgccggggg aatcggcggc ggaacaggcg gggccggcgg gcacgcgtgg ctgttcggcc 3888901 acgggggaac cggcggtatc ggtggcgggc ccggcggcaa cggcgggtgg ctgctcggca 3888961 acggcggaca tggcggcgct ggcggaatcg gtggcggcag cggcggcgct ggcgggaacg 3889021 gcgggtggct gctcggcaac ggcggtatcg gcggagcggg cggaaccggc ggcggagcgg 3889081 gcggcaccgg tggcaacgcc gcgtggctgc tcggcggtgg tggtaccggc ggcgccggcg 3889141 gaatcggtgg tggcaacggc gggcacggcg gcaacggcgg gtggctgctc ggcaacggcg 3889201 gcaacggcgg cctcggcggt gacggtgacg gcggtactgg cggcggccac ggcggcaacg 3889261 gcgggaatcc cgggtggctc ttgggcacag ccgggggtgg cggcaacggt ggcgccggca 3889321 gcaccggtac tgcaggtggc ggctctgggg gcaccggcgg cgacggcggg accggcgggc 3889381 gtggcggcct gttaatgggc gccggcgccg gcgggcacgg tggcactggc ggcgcgggcg 3889441 gtgccggtgt cgacggtggc ggcgccggcg gggccggcgg ggccggcggc aacggcggcg 3889501 ccgggggtca agccgccctg ctgttcgggc gcggcggcac cggcggagcc ggcggctacg 3889561 gcggcgatgg cggtggcggc ggtgacggct tcgacggcac gatggccggc ctgggtggta 3889621 ccggtggcag cggcggcacc ggcggtgacg gcggcgcccc cggcaacggt ggcgccgggg 3889681 gtgccggcca gttgttgagc catagcggcg tggccggtgc tagcggcaaa ggtggtgccg 3889741 gcggcaccgg cggcaacggc ggggccggca gtgccggcgc cgacgccccc gcaggctccg 3889801 gcgcgatggg tagcactggc tttgctggcg gcgccggcgg tgacggcggt aacggcggcg 3889861 ggagcggtgc cagccaaggc aacggcggca acggcggcaa cggcggcacc ggcggcaaag 3889921 gcggcaccgg cggggccggc atgaacagcc tcgacccgct gctagccgcc caagacggcg 3889981 gccaaggcgg caccggcggc accggcggca acgccggcgc cggcggcacc ggcttcaccc 3890041 aaggcgccga cggcaacgcc ggcaacggcg gtgacggcgg ggtcggcggc aacggcggaa 3890101 acggcgcaga caacaccacc accgccgccg ccggcaccac aggcggggcc ggcggggccg 3890161 gcggggccgg cggaaccggc ggaaccggcg gagccgccgg caccggcacc ggcggccaac 3890221 aaggcaacgg cggcaacggc ggcaacggcg gcaccggcgg caaaggcggc accggcgggg 3890281 ccggcatgaa cagcctcgac ccgctgctag ccgcccaaga cggcggccaa ggcggcaccg 3890341 gcggcaccgg cggcaacgcc ggcgccggcg gcaccggctt cacccaaggc gccgacggca 3890401 acgccggcaa cggcggtgac ggcggggtcg gcggcaacgg cggaaacggc gcagacaaca 3890461 ccaccaccgc cgccgccggc accacaggcg gggccggcgg ggccggcggg gccggcggaa 3890521 ccggcggaac cggcggagcc gccggcaccg gcaccggcgg ccaacaaggc aacggcggca 3890581 acggcggcaa cggcggcacc ggcggcaaag gcggcaccgg cggggccggc atgaacagcc 3890641 tcgacccgct gctagccgcc caagacggcg gccaaggcgg caccggcggc accggcggca 3890701 acgccggcgc cggcggcacc ggcttcaccc aaggcgccga cggcaacgcc ggcaacggcg 3890761 gtgacggcgg ggtcggcggc aacggcggaa acggcgcaga caacaccacc accgccgccg 3890821 ccggcaccac aggcggggcc ggcggggccg gcggggccgg cggaaccggc ggaaccggcg 3890881 gagccgccgg caccggcacc ggcggccaac aaggcaacgg cggcaacggc ggcaacggcg 3890941 gcaccggcgg caaaggcggc accggcggcg acggtgcact cgcaggcagc agcggtggtg 3891001 ccggcggtaa aggcggcaac ggcggcgacg ccggcaaggc cggtaccggc tccgctcctg 3891061 gcacggcggg gaccggcggc gatgggggta agggcggcaa cggcggcatt ggcgctgccg 3891121 gcacaaccgg ccccgtaggc accggcgcgt ccggcggcac cggtggtagt ggtggcgccg 3891181 gcggaaccgg cggtgacggc ggcgccgcca acggcggcac cgccggggct ggcggggcgg 3891241 gcggcaatgg cggcaaaggc ggcgacggtg gagcaggcgt caccagcagc accgccggca 3891301 acagcggcgg cgcgggcggc agcggcggaa agggcggaga cgcgggcgcg ggcggcgccg 3891361 gtgccactcc gggcgccaac ggtatcgctg gcaatggcgg cgacggcgga gatggcgcgg 3891421 ctggtgccgt cggcatctcc ggcgcaaccg gcgctggcga cggcgggcat ggcggaaccg 3891481 gcggggccgg cggcaacggt ggaaccggcg gtgctggcgg tagcggcatc gacggcgtcg 3891541 gcggcgggac cggaggtacc ggcggcaacg gcggcaacgg cgccatcggc ggcgctggcg 3891601 gagacgccgg tggtagcgga aatagcggcg gaaacggtgg gactggcgga aagggcggaa 3891661 acgccggtgc cggtggtgcc gcgggcagca acggcggtac cgtcggcgcc aacggtaccg 3891721 gcggcgacgg cggcaacggc ggcgctgccg gggccgccac ggctggcagc aacggtgggg 3891781 ccggcaccgg ctcggccggc ggcaacggcg gcaccggcgg cagaggcggc agtggtggcg 3891841 ccggcggcga cggtatcggt ggcgtcggcg gcggcaaggg cggcaacggc gcggacggcg 3891901 aagtcggcgg tgcgggcggc gccggcggca gcgggcccaa caccagtccc ggcggcaacg 3891961 gcgggcaagg aggtcaaggc ggcagcggtg gtgccggtgg ggcggccggg gctggcggcg 3892021 cgggtggcgg cgctaacggc accgctggca acggcggcca aggcggtgcc ggcggcaccg 3892081 gcggcgccgg cgcagcctcc tcagctacca acggcggcag cggcggcgcc ggcggcaccg 3892141 gaggcgccgg cggcaccggc ggcgctggtg gcgatggcgt cggcggcgcc ggcggcggca 3892201 atggtggcca cggcggtgac gccggagacg gtggcaacgg cgccaacggc aacaaccgca 3892261 gttccggctc cttcctcgca gccggcggca ccggcggggc ggccggcgac ggcggacaag 3892321 gtggccaggg cggcgccggc ggcggtgccg gtggtcaagg tggtgccggc ggtgccggcg 3892381 ggaccggcgg caacggcggc aatatcaccg gcggcaccgc gggcaccgcg ggggccgccg 3892441 gtaacggcgg cgccgccgga aagggtggcg ccggcggcca aggcggcacc ggtggcggga 3892501 ccgggggtca gggtggcgcc ggcggcgacg gcggtgccgg cggcaccggc ggcgaccgca 3892561 ccgtcggcgg tggcacggtc cccgccggct ccggtggaca aggcggtaac gctggcggtg 3892621 gtggggccgg cgggcagggt ggagccgacg gcggcagcgg cggcgacggc ggcgacgccg 3892681 gcacaggtgg caatggcggt aacggcggca accgtaattc cggcaatggc accggcggcg 3892741 ctggcggcaa cggtggtggt ggtgctaacg gtggcgccgg cggcgctggg ggcagcggcg 3892801 gcggcaccgg cggcaacggc ggcgctggcg gcgacgccgg cgacgccggc aacggcggca 3892861 acggcaacgg caccggcaac ggcggcaacg gcggcaacgg cggcatcgcc ggcatgggcg 3892921 gcaacggcgg tgccgggacg ggcagcggca acggcggcaa cggcggcagc ggcggcaacg 3892981 gcggcaacgc cggcatgggc ggcaacagcg gcaccggcag cggcgacggc ggtgccggcg 3893041 ggaacggcgg cgcggcgggc acgggcggca ccggcggcga cggcggcctc accggtactg 3893101 gcggcaccgg cggcagcggt ggcaccggcg gtgacggcgg taacggcggc aacggcggca 3893161 acggagcaga taacaccgca aacatgactg cgcaggcggg cggtgacggt ggcaacggcg 3893221 gcgacggtgg cttcggcggc ggggccgggg ccggcggcgg tggcttgacc gctggcgcca 3893281 acggcaccgg cgggcaaggc ggcgccggcg gcgatggcgg caacggggcc atcggcggcc 3893341 acggcccact cactgacgac cccggcggca acgggggcac cggcggcaac ggcggcaccg 3893401 gcggcaccgg cggcgcgggc atcggcagcc ttggcggcgg cactggcggc gatggcggca 3893461 acggcggtac cggcggcaac ggcggtaccg gcggcgaggg cggcgaggtc ggcggcgccg 3893521 gcggcaccgg cggtgcggcc ggcaatggcg gcgatggcgg caccggcggc accggcggcg 3893581 gggacggggg cgccggcggc accggcggca ccggcggcac cggcggcctc ggcgaccccc 3893641 gggtcggcgg atccggcggc gacggcggca ccggcggcag cggcggtgcg gccggcaatg 3893701 gcggcaacgg cggcaacgcc ggcgcgggag gcaatggcaa cggcggcacc ggtggggccg 3893761 gcggtatcgg cggcaccggc ggcaatggcg gcgacgccga gcccggagtg cccccgggag 3893821 ccggtggtgc tggcggcgcc ggcaccaccg gcggcaaggg tggcaccggc ggcaacggca 3893881 gtggcaccgg ctcgggcggc accggcggcg atggcggcac cggcggtggt ggtgggaacg 3893941 gcggcaccgg ctggaatggc ggcaagggag acaccggcag cggcggtggc gccggagacg 3894001 gtggtaaggc accagccggt ggcaccggcg gcgccggcgg cgacggcgga gcgggcggca 3894061 agggcggcag cggcggcgtc tagtcgcgat gggcccagcg gccgcgatgg tgcgccgggc 3894121 gtccgccggc gagtggtcca gccagatttg acgacaaacg gcgacccagc ggtatccccc 3894181 agccgcggcg ccatagccgc gacccgcgca atcaggaacc gctcgtcacg tgtcccgcat 3894241 gcacgtcatc ggctggccgc gcctcggtct gctccttggc ccagcggtag tccggcttac 3894301 cggcgggcga acgcttcacc tcgtcgacaa accacagact gcgcggcact ttgtagcccg 3894361 cgatctcgga gcgcacgaac gagtccaact cggccaacga cggccgacaa cccggccggg 3894421 cctgcaccac ggcggccacc tgctggccgt aacgcggatc gggcaccccg accaccagag 3894481 cgtcgaacac gtcgggatgc cccttcaaag cggcctcgac ctcttcgggg tagaccttct 3894541 cgccgccgct gttgatcgac accgagccac gacccagcat ggtgaccgtg ccgtcctcct 3894601 cgacttgggc gtagtccccc ggaatggcgt agcgcacacc gttaatcgtc cggaacgtct 3894661 cggccgtctt cttctcgtcc ttgtagtagc cgacgggaat gttgcccttc ttggcgagcg 3894721 tgcgcgcggg tcgcactgct tggcgcctgg tgcaccggtc gccgggcggc tcctccccag 3894781 ggcgctccag gttcgttgcg gcattaccag aaagccggca catattagat gagtggcaac 3894841 taaggttctc acttaaagat gccgccatat cggccgtggt tgcaccggcg caaagatggt 3894901 tgggagttcg cccatgtcgt tcgtgttgat cgcaccggaa ttcgtgacag cagccgcggg 3894961 ggatctgacg aatctgggtt cgtcgattag cgcggccaac gcgtcggcag ccagtgcgac 3895021 cacgcaggtg ctggctgcgg gcgccgatga ggtgtctgcc cgtattgcgg cgctgttcgg 3895081 cgggtttggc ctggagtacc aggcgattag tgcgcaggtg gcggcctacc accagcggtt 3895141 tgtgcaggcc ttgagtaccg gcgcgggcgc atatgcctcg gccgaggccg ccgccgctga 3895201 gcagatcgtg ctgggcgtga tcaatgcgcc cacccaggcg ctgctggggc gcccgttgat 3895261 cggtgacggc gccaatgcga cgactcccgg cggggccggc ggggccggcg gtctgctgtt 3895321 cggcaacggc ggggccgggg cagccggggc gcccggccag gccggcgggc ctggcgggcc 3895381 cgccggcctg tggggcaacg gcgggcccgg cggggccggc ggcagcggtg ggggcaccgg 3895441 cggtgccggc ggcgccggtg ggtggctgtt cggggttggc ggcgccggcg gtgtcggtgg 3895501 ggccggtggc ggcaccggcg gggcgggcgg gcccggtggt ttgatctggg gcggcggcgg 3895561 ggccggcggt gtcggtgggg ccggtggcgg caccggcggg gccggcggcc gcgccgagct 3895621 gctgttcggc gccggcggtg cgggcggcgc gggtggggcg ggcaccgacg gcgggcccgg 3895681 tgctaccggc gggaccggcg gacacggcgg agtcggcggc gacggcggat ggctggcacc 3895741 cggcggggcc ggcggggccg gcgggcaagg cggggcaggt ggtgccggca gcgatggtgg 3895801 cgcgttgggt ggtaccggcg ggacgggcgg taccggcggc gccggtggcg ccggcggtcg 3895861 cggcgcactg ctgctgggcg ctggcggaca gggcggcctc ggcggcgccg gcggacaagg 3895921 cgggatgggg ggtgctggcg gggccggcgc cgataacccc accggcatcg gcggcaccgg 3895981 cggtgacggc ggcaccggcg gtagcgccgg tgagggcggg gccggtggtg ccgcgggcca 3896041 gctcttcagc gccagcggag cggccggtaa cgccggtgtc ggcggggccg gcggccaagg 3896101 cggtgacggc ggagccggcg gggccggcgc cgacgccgac cagcccggcg ccaccggcgg 3896161 caccgggttc gccggtggag ccggcggagc cggcggggcc ggcggtagca gcggtgccgg 3896221 cggcaccaac ggctccggcg gcgccggcgg caccggcgga caaggcggga tggggggtgc 3896281 tggcggggcc ggcgccgata accccaccgg catcggcggc accggcggtg acggcggcac 3896341 cggcggagcg gccggagccg gcggggccgg cggagcggcc ggcaccggag gcaccggcgg 3896401 catgatcggc accacaggca acgccggtgt cggcggggcc ggcggccaag gcggtgacgg 3896461 cggagccggc ggggccggcg ccgacgccga ccagcccggc gccaccggcg gcaccgggtt 3896521 cgccggtgga gccggcggag ccggcggggc cggcggtagc agcggtgccg gcggcaccaa 3896581 cggctccggc ggcgccggcg gcaccggcgg acaaggcggc gccgggggtg ctggcggggc 3896641 cggcgccgat aaccccaccg gcatcggcgg caccggcggt gacggcggca ccggcggagc 3896701 ggccggagcc ggcggggccg gcggagcggc cggcaccgga ggcaccggcg gcatgatcgg 3896761 caccacaggc aacgccggtg tcggcggggc cggcggccaa ggcggtgacg gcggagccgg 3896821 cggggccggc ggggccggcg gtagcagcgg tgccggcggc accaacggct ccggcggcgc 3896881 cggcggcacc ggcggacaag gcggcgccgg gggtgctggc ggggccggcg ccgataaccc 3896941 caccggcatc ggcggcaccg gcggtgacgg cggcaccggc ggagcggccg gagccggcgg 3897001 ggccggcgga gcggccggca ccggaggcac cggcggcatg atcggcacca caggcaacgc 3897061 cggtgtcggc ggggccggcg gccaaggcgg tgacggcgga gccggcgggg ccggcgccga 3897121 cgccgaccag cccggcgcca ccggcggcac cgggttcgcc ggtggagccg gcggggccgg 3897181 cggggccggc ggtagcagcg gtgccggcgg caccaacggc tccggcggcg ccggcggcac 3897241 cggcggacaa ggcggcgccg ggggtgctgg catcagcttc agcaacggca gcaacggcgg 3897301 caccggcggc accgggggcg tgggcggcac cgggggcgac ggcggcaacg caggcaccgg 3897361 cgccggcgac cccggcaaag gcggcaccgg cggcaccggc ggcagcggcg gggccggcgg 3897421 tagcggcggg gccaacttca acggcggcac cggcggcacc ggcggcaccg gcggcaccgg 3897481 cggcaaaggc ggcatgggcg gcatcgctgg cgacggcggg cccggcggtg acggcggcaa 3897541 cgccggggtc ggaggaaaag gcggcaccaa cggcaacggc ggcagcggcg ggaccggcgg 3897601 cacaggcggg cccggcggca gcggcggcgc gcccaccggc agcggcaccg gcggcaaagg 3897661 cggcgccggc ggtgacggcg gcgatggcgc cgacggaggg gcagccaccg gcgtcggcga 3897721 cggcggcgac ggtggtaacg gtggtaacgg tggtaacggc ggcacgggcg tcggctcgcc 3897781 cggcggcctc ggcggggcag gaggcactgg aggcctcggc ggcgccggtg caggcggcgg 3897841 agccgacggc gatgatggcg acgacggcca acccggcaac aacggcagct gaagcaccac 3897901 ctgccaccag acaacgccgt cgatgtggcg ctccggcgtg cgcaaggcaa atcggtgcga 3897961 tcctgaccag ccaggtgatt acctggttcg actcatgccg agcgaccgtc ccagcgccgc 3898021 agtggatgca tacacggtag gttcgacgga caccctgggc tggctgaccg aatggccgcc 3898081 gcagctcccc gaccgaaccg tcagcggcaa catgtcacct gcatcgtcgc caagcccagg 3898141 cgatcgcccg gccccgcaag cggatgtgtt ctcctgccct ccgtgggcag cgcgcccgac 3898201 acccgtaagc ggatgtcccc gacggactcc ggccggccta gccgatggct accccaggga 3898261 gtgccgcacg atggccgtcg atcaagtgcg gtccggcttc ggcgaaccct ccgcagctat 3898321 ttcgcgacgc gcgagaacac ccgtgcctta cttccatcca catcgatgtc ggctcggccc 3898381 ccgagaggca cgacagccga cccatgtcga ccttccgtgc ggggtgtccg gagccggtcg 3898441 cagccgcacc catcacccac cgctcgtcac gtgtcccgca tgcacgtcat cggctggccg 3898501 cgcctcggtc tgctccttgg cccagcggta gtccggctta ccggcgggcg aacgcttcac 3898561 ctcgtcgaca aaccacagac tgcgcggcac tttgtagccc gcgatctcgg agcgcacgaa 3898621 cgagtccaac tcggccaacg acggccgaca acccggccgg gcctgcacca cggcggccac 3898681 ctgctggccg taacgcggat cgggcacccc gaccaccaga gcgtcgaaca cgtcgggatg 3898741 ccccttcaaa gcggcctcga cctcttcggg gtagaccttc tcgccgccgc tgttgatcga 3898801 caccgagcca cgacccagca tggtgaccgt gccgtcctcc tcgacttggg cgtagtcccc 3898861 cggaatggcg tagcgcacac cgttaatcgt ccggaacgtc tcggccgtct tcttctcgtc 3898921 cttgtagtag ccgacgggaa tgttgccctt cttggcgatg acgccccgca tccccgagcc 3898981 gggcttgact tcgttgccgt cgtcatcgag cacgacggtg cgatggtcga tccgcacccg 3899041 gggcccgccg ccatgcgcct gcccggcagc aacgacgctg gtaccgccaa aacccgtctc 3899101 cgacgagcca attgagtccg tgatcacccg attcggcagc agctcaagga gtttctcctt 3899161 gatgctcggc gagaacagcg ccgcggtgct ggccaacagg aacaacgacg acaggtcgta 3899221 gtcgttgccc ttgaccagcg cgtcgaccag cgggcgggcc atcgcatcac cggtgaagaa 3899281 cagcaggttc accttgtgtt tgtggatcgt gcgccacacc tcgtcggcgt tgaattccgg 3899341 tgccagtacc gtggtttggc ccgagaagag cgccatccag gtggccgact gggtggcgcc 3899401 gtggatcatc ggcgggatcg ggtagcggat catcggtgga ttcgccgcgg ccgccttggc 3899461 caggtcgtat tcgtctttga cgaactctcc tgtcgcaaag tcggttccac cgaacagcac 3899521 acgatagatg tcctcgtgac gccacatcac acccttgggg aaaccggtgg tgccgccggt 3899581 gtagagcaga tagatggcgt cggcgctgcg ttcgccgaag tcacgctccg gcgagcccgc 3899641 cgcgatcgcg gaatagaact cgacgccgcc gtagcgccga tagtcctggt ccgagccgtc 3899701 ctcgacgacc aagatcgtcc ttacatgggg cgtgtcgggg agaacgttgg cgacccggtc 3899761 ggcgtagcgg cgttcgtgca ccaacgcgac catgtcggag ttgtcgaaca ggtagcgaag 3899821 ttcgccctcc acgtaacgga agttgacgtt caccaagatg gcgcccgcct tcacgatgcc 3899881 cagcatcgcg atcacgatct cgatgcggtt gcggcagtac aggccgacct tgtcgtcctt 3899941 ttgcacgcct tgatcgatca ggtggtgcgc gaggcggttg gccttatcct ccagctgggc 3900001 gtaggtcaac tgctcatcgc cgcagataac ggcgacacgg tcaggcacgg cgtcgatggc 3900061 gtgctcggcg agatcggcaa tattcagggc cacggccacc aaactagaac gtgttacatt 3900121 tcttgacaag ctcacacccg acgggcagaa agaggtggcg gccgtggcaa ccgtggaatc 3900181 cggacccgac gcgctggtgg agcggcgcgg ccacaccctg atcgtgacca tgaaccggcc 3900241 ggccgcccgc aacgcgctga gcaccgaaat gatgcgaatc atggtgcagg cctgggatcg 3900301 cgtcgacaac gatcccgaca tccgttgctg catcctcacc ggagccggtg gctacttttg 3900361 cgccggcatg gacctcaagg cggcaaccca gaaaccgccg ggcgactctt tcaaggacgg 3900421 cagctacgac ccgtcgcgca tcgatgccct gctcaaaggg cgccgcttga ccaaaccgct 3900481 gatcgccgcc gtcgaggggc ccgcgatcgc cggcggcacc gagatcctgc agggcaccga 3900541 catccgggtc gccggtgaaa gtgcgaagtt cggcatctcc gaggccaagt ggagcctgta 3900601 cccgatgggc ggctcggccg tgcggctggt ccggcagatc ccctacactc tggcctgcga 3900661 cctgctgctg accggacggc acattaccgc cgccgaggcc aaggaaatgg gcctgatcgg 3900721 ccacgtggtg cccgacggcc aggcgctgac caaggctcta gaacttgccg acgccatctc 3900781 ggctaacgga cccctggccg tgcaggccat cctgcggtcc atccgcgaga ccgagtgcat 3900841 gcccgaaaac gaggcgttca agatcgacac ccagatcggc atcaaggtct tcctgtccga 3900901 cgacgccaag gaaggcccgc gcgcgttcgc cgagaagcgc gcacccaact tccagaaccg 3900961 ctaggcgccg agcgtgaact gagggcgaga tttcggccga ttttccgccc tcagttcacg 3901021 ttggacggcg gtgtcggtgc acgacggcac actgcgatcg tgatcgaacc attcctcggc 3901081 agcgaagcga ttgcctccgg cgcgttgacg cggcaccggc tgcgaagcgc atacgccacg 3901141 atccaccccg acgtctatgt ctcccccggc gccgacctga ccgcatggag tcgcgctcag 3901201 gccgcctggc tatggtcgcg gcggcgcggc gtcatcgccg ggcagtcggc ggcggcgatg 3901261 cacggcgcca aatgggtcga cgcgcgacag gcggccgagc tgctctacga ccaccgtcgc 3901321 ccgccggccg gcatccacac ctggtcggac cgtgtcgccg acgacgagat ccagccaatc 3901381 tccggcatga atacgaccac accggcgcgc accgccctcg acctcgcccg ccgctatccg 3901441 gtcggcaagg ccgtcgcggc catcgatgcg ctcgcccgcg cgacggacct caagctggcc 3901501 gatgtcgaga tgctcgccga acgctaccgg ggaagccgcg gcatccgaaa tgctcgtatc 3901561 gcattggatc tggtggatcc aggtgccgag tcacctcgcg agacgtggct gcgtctgcta 3901621 ctcatccgag cgggctttcc aagaccacag acccagatcc cggtttacga cgagtacggc 3901681 cagctggtcg cggttatcga tatgggttgg gcaggaatca aggtcggcgt ggattacgag 3901741 ggcgaccatc accggaccga ccgcagaacg ttcaacaagg acatcaagcg tgccgaagcg 3901801 ttgaccgagc ttgggtggac cgacgtacgc gtgacggtcg aggacaccga gggtggcatc 3901861 atctggcggg tgtcagcggc ctggcagcgc cgaacgtgaa ctcacggcgg agattcggcc 3901921 gatattccgc cctcagttca cgttcggcgt ggctcagccc agcggcgggc tcggcgtgaa 3901981 caccaccggc atggattcca ggccgctgac aaagttcgcc ggccgcagcg gcaacacgga 3902041 gtcatcggcg accaaccgca ggtcgggtag ccgccgcaac acccgttccg tcatcaacga 3902101 cagctccaac cgggccagct gattgcccag gcagaaatgc gtgccgaagc caaacgccaa 3902161 gtggctgttt ggatttcgct gaacatcaaa cttttccggt tcacagaaaa ccgcctcgtc 3902221 gaagttcgcc gactcgaaga gcagcatcat cttctcgccg gcacacaacg ccgtgccgtg 3902281 aaactcggta tccgcggtca acacccggca catgttcttt accggggcgg tccaacgtag 3902341 catctcctcg atggccccgg gcagcaacga cgggtcgcgc tgcagcaggt cccactggtc 3902401 acggttgcgc agcagctgct cggtaccacc gctcaaggta tgccgcgtgg tctcgtcgcc 3902461 gccgatcagg atcagcagcg tctccatgac cagctcgtcg tcgcttagcc gctcgccgtc 3902521 aacttcggaa ctcaccagca cgctgaccag gtcgtcggtg ggtccgctcg ccgtgccgca 3902581 atggtggccc gggtgaagtc gttgtaggcc gcgaaggcgt ccatggtgat ctggaaatcc 3902641 tcttgagaca catgcgaact gaggaatgtc accagatcgt cggaccaccg caagaacatg 3902701 tcccgctgct ctggacgcac cccgagcatg tcgccgatca ccgccatcgg tagcggcgcg 3902761 gccaggtccc gcacgaagtc acactcgccg cgttcgcaca cggcgtcgat cagggtgtca 3902821 cacagcgcgg caatcgacgc ctccttgtcc ttcacccgct tgcgggtgaa gccggcgtta 3902881 accagcttgc gccgcaacag atgtgcggga tcgtccatgt cgatcatcat cggcagggcg 3902941 ggctggtcgg ggcggatgcc gccggcgttg gagaacagct cgggttgacg ttcggcgtcg 3903001 atcaccgcct ggtacgtcga cgcggccgcc aggccgttgc gatcgcggaa caccggttgg 3903061 ttggcccgca tccaccggta cgcggcccgc gcctcgcggc tggcgtagaa gttgccgtcg 3903121 gccagatcca cgtccggagc ttcagtcatc gcgatcctcc gcactacagt gggcgatatg 3903181 cccgtctcgc aacacaccat cgccggcacg gtgctcacca tgccggtgcg cattcgcacc 3903241 gccaacctgc attccgcgat gttctcggtg cccgccgacc cagcgcagcg cctcatcgac 3903301 tacagcgggc tgcgggtgtg cgaatacctg cccggtaagg caatcgtgat gcagatgctg 3903361 gtgcgctacg tcgacgggga tttggggcga taccacgagt acggcaccgc gatcatggtg 3903421 aacccgcccg gcacccaacg ccgcgggccc agagccctca cccgagccgc cgcgttcatc 3903481 catcatctgc cggtagatca ggtgttcacg cttgaggccg ggcgcaccat ctggggcttc 3903541 ccgaagatca tggcggactt caacgtcacc gacggccgga ggttcggctt cgacgtcagc 3903601 gccgacggac ggttgatcgc cgggatcgag ttcagcaccg gcctgccggt gccgaccctc 3903661 gggtggcaaa tgttgaagac ctactcccac catgacggcg taactcgcga gattccctgg 3903721 gaaatgaaag tctcgggcct gcgcgcccgg ctcggcggcg cccgactgcg gttgggagac 3903781 catccctacg ccaaagaact ggcatcgctg ggcctgccga agcgggctct gttgtcccag 3903841 tcggcggcca acgtagaaat gaccttcggc gacggtcacc cgatctgaac cgcaagaaag 3903901 cgaagccatc agcccaatct agaacgcgtt ctagcccgct ggcaaggatc gatcagacca 3903961 gggcggcaag gtcgcggacc tgctctgcgc tgccggcggt caccaccatc atggtgaccc 3904021 cggcggcctc ccagacggcc atctgcttac gcacgtggtc gatgtcaccg acgatcacgg 3904081 cgtcgtcgac gagctcgtcc gggatgatct cggcggcctc gtccttgcgg ccagaccgaa 3904141 ataacttggt gacctcatcg accacttgcg tgtaccccat ccggcgatag acgtcggcgt 3904201 ggaagttggt ctcttcggcg cccatcccgc ccatgtagag cgccaggaac ggcttgattc 3904261 cggcaaacgc ggccgcccga tcgtcggtga tgaccacctg cgccgtcgcg cagatctcga 3904321 agtcctcgcg gctacgccgg gcgccgggcc gggcgaatcc ttcgtcgagc cattcgttgt 3904381 acatgccggc catgcgtggc gaatagaaga tgggcagcca gccatcgcag atctcggcgg 3904441 ccagcgcgac gttcttgggc ccctcggccc ccagcatgat tggtatgtcg gcgcgcagcg 3904501 gatgggtgat gggtttgagc gctttgccca gacctgtcgt gccctccccc gtcagtggca 3904561 gccggtagtg cggcccggcg ctggtcaccg gcgattctcg ggcccacacc tggcgcacga 3904621 tgtcgatgta ttcgcgggtg cgagccagcg gcttgggaaa ccgctgcccg taccaaccct 3904681 cgaccacctg cggaccggac acgccgagcc cgagaatgtg ccggccaccg gacagatggt 3904741 ccagtgtcag cgcggccatc gcacaggccg ttggtgtgcg cgcggacagc tggatcaccg 3904801 acgtacccag ccgcacccgt tgcgtcgacg agccccacca ggccagcggc gtgtaggcgt 3904861 cggaccccca cgcctcggcg gtgaacaccg tgtcaaaacc cgcatcctcg gccgcggcga 3904921 cgagttccgc atggttctgc ggcggctgcg cgccccaata ccccagctgt agtccgagct 3904981 tcatccctgc ctccacgacg cccttcagga gggcaatgtt gaaaccgttg ttagaacctg 3905041 ttctactcga caggcgtgac agccagctcg agcggcccgg cgctgatcga tcactctgag 3905101 ccgccccttt ccgcgcccct cacgttgtcc ttcgactaca cccgttcggt ggggcccacg 3905161 ttaagcaggt ttttcaccgc cttgcgtgca cgccgcattg tcggggtgcg cggatccgac 3905221 ggccgagtcc atgtgccgcc ggtggaatat gacccggtta cctacgaacc cctgagcgaa 3905281 atggtaccgg tgtccagcgt cggcaccgtc gcgtcctgga cctggcaacc cgagccgcta 3905341 gccggccagc ccctggaccg gccgttcgcc tgggcgctga tcaagctcga cggcgccgac 3905401 accttgctga tgcacgccgt tgatgtggga accgccggcc cttccgccat ccacaccggc 3905461 gcccgggtgc acgcgcattg ggccgaccaa ccggtgggcg ccatcaccga tatcgcctgc 3905521 tttgcgctcg gcgagaccgc agaaccggtg gcggctcaca agaccgagga tgcgcgggac 3905581 ccggtcacca tgatcgtcac gccgatccag ctggaaattc agcacaccgc ctcgcacgag 3905641 gagagtgcgt atctgcgcgc catcgcccag ggcaagctcg tgggcgccag aaccggaaag 3905701 accggcaagg tatacttccc gccgcatggc gccgacccgg ccaccgggaa acccacctcc 3905761 gagtttgtcg agctgcccga caagggcacg gtgacgacgt tcgcgatcgt caacatcccg 3905821 ttcctgggcc agcgaatcaa gccgccctat gtggcggcct acgtgttgct cgacggcgcc 3905881 gacatcccgt ttttgcattt ggtttccgac gtcgacgcgc accaggtgcg gatgggcatg 3905941 cgcgtcgagg cggtgtggaa gccgcgggag cggtggggac tgggcatcga caacatcgag 3906001 tacttccgcc ccaccggcga accggatgcc gactacgaca cctacaagca ccacctgtaa 3906061 agggcccacc aaccaatgag cgttcgcgat attgccgttg tcggcttcgc ccacgccccg 3906121 cacgtgcgcc gcaccgacgg cactaccaac ggcgtcgaga tgctgatgcc gtgcttcgcc 3906181 cagctatacg acgagctggg catcaccaag gccgacatcg gattctggtg ttcgggttcg 3906241 tcggattacc tggctggacg agcattttcg ttcatctccg cgatcgactc catcggagcc 3906301 gtaccgccga tcaacgaatc gcacgtcgag atggacgccg cctgggcact gtatgaggcc 3906361 tacatcaaac tgctgaccgg cgaggtcgac accgcgctgg tgtacggctt cgggaagtcc 3906421 tcggccggaa cgctgcgccg tgtgctgtcc cgccagaccg acccgtacac cgtcgcgccg 3906481 ctgtggccgg attcggtatc gatggcggga ctacaggcgc ggttggggct ggactccggc 3906541 aagtggaccc acgagcagat ggcgcgagtg gcgttcgatt ccttcaccaa cgctcgccgg 3906601 gtggattccg tggagccgcc gatcaccgtc ggggaactgc tggcacggcc gttttttgcc 3906661 gatccgctgc ggcgccacga cattgcgccg attaccgacg gtgccgccgc ggtcgtgctc 3906721 gcggccgaca accgcgcccg agaactgcgc gaaaatccgg cgtggatcac cggaatcgaa 3906781 catcgcatcg agtctccggc gctgggggcg cgcgacatca ccgagtctcc gtcgaccaaa 3906841 ctggcggcca agatagccac cggcggacac accggcgaca tcgacgtggc ggagatccat 3906901 gggcccttta cccaccagca cctgatcgtc gcggaggcca tcaggattcc gggtaagacg 3906961 aaagtgaatc cgtccggcgg cccgttggcc gccaacccca tgttcgccgc cggccttgag 3907021 cgtatcggct ttgccgcaca acatatctgg gacggatcgg cgcggcgcgt gctggcgcac 3907081 gccaccagcg gaccggcgct gcagcaaaac ctggtcgcgg tcatggaagg acggggatag 3907141 tggaggggca gcgctgatgg ccggaaagct ggccgccgta ctcggcaccg ggcagaccaa 3907201 gtatgtcgcc aagcgccaag acgtttcgat gaacggtctg gtgcgggagg ccatcgaccg 3907261 agcgctggcg gattccggtt ccaccttcga cgacatcgac gccgtcgtgg tcggcaaggc 3907321 gcccgacttc ttcgaagggg tgatgatgcc ggagctattc atggccgacg ccatgggcgc 3907381 gaccggcaag ccgctgatcc gggtacacac cgccggttcg gttggcggat ccaccggggt 3907441 agtggctgcc agcctggtgc aatccggcaa ataccgccgg gtcctggcat tagcctggga 3907501 aaagcagtcg gaatccaatg ccatgtgggc gttgtcgatt cctgtgccgt tcaccaaacc 3907561 ggtcggtgcc ggtgcggggg gatacttcgc cccgcatgtc cgggcctata tccgccgctc 3907621 gggcgcaccg gcacacatcg gtgctatggt tgcggtcaag gaccggctca acggcagccg 3907681 caacccgttg gcacatctgc agcagcccga catcaccctg gagaaggtga tggcatctca 3907741 gatgctctgg gatccaatac gtttcgatga gacgtgcccg tcgtcagacg gtgcgtgcgc 3907801 ggttgtcgtc ggcgacgagg agatcgccga cgcgcgactg gcgcaagggc atccggtggc 3907861 ctggattcat ggcaccgcat tacgcaccga gccgctggct ttcgccgggc gcgaccaggt 3907921 caacccgcag gccggccgcg acgcggcggc ggcgctgtgg aaggccgcgg gcatcaccag 3907981 ccccatcgac gaaatcgacg ccgccgaaat ttacgtcccg ttctcctggt tcgagccgat 3908041 gtggttggag aatctgggat ttgcccgcga gggcgagggc tggaagctca ccgaggccgg 3908101 cgagactgcg atcggcggtc gactaccggt gaacccttcc ggcggcgtgc tgtccgccaa 3908161 tccgatcggc gcatcgggcc tgatccgctt cgccgaggcc gcgatccaag tcatgggcaa 3908221 ggcggaggcg cgtcaagttc cgggtgcgcg aaaggccttg gggcacgctt acggtggcgg 3908281 ctcgcagtac ttctctatgt gggtggtcgg ctgcgagaaa cccaaacagg cagccgcata 3908341 atcgcccggc gcgatccggg cgacgccgca gaccatccga gcatggtgaa gttcacaccc 3908401 gatagccaga cgtcagttct gcgcgcgggc aagtgctcag gtactctttc tccgtcgcgg 3908461 tcgcgattgc aaagggggag ctggccggtg gattccgaac gccgacgcta cgggtggccg 3908521 cggaatcgac gcaccttagc cattactgga gctgcagtcg ttgtcgtggt gaccctcgca 3908581 gccattggtt acctgatctt tgagccaaaa atttctgggt cgtccacgtc caggcaggcc 3908641 gcatcgccaa ccactccttc cccgcccagc caggtcgtgg tgccgatcga cctttggaat 3908701 cccgacgggg tgacggtgga cctggcggac gccgtttacg tggccgactc cggtcacaag 3908761 cgactgctga aactgccggc cggctccaac accccgacca cgttgccatt caccgacacc 3908821 atcggtccag gcggcgtggc ggtaaacagc aaccgcgacg tctatgtcat cgatgaagac 3908881 agccaccatg tgttgaaact cgcggccggc atcgaacccc cggtcgagct cccgttcggc 3908941 agccttggcg atgcgcatgg tttggcagtg gaccgcagcg acagcgtcta tgtcgtcgac 3909001 tatgacaatg ccaaagtgtt gaaactgccc ccaggcgcag atacccctac cgaactgccg 3909061 ttcgtcgggc tcgaccaccc ctatgatgtg gcggtggacg gtgctggcac cgtctacgtg 3909121 accgacagcg gccacaatcg cgtggtggcg ttgaccgcgg ggtcggccac gccggtgcac 3909181 ctcccattcg ccgatctcag ctttcccgcc ggtgtgacgg tggaccgcga cgatagcgtc 3909241 tatgtggccg atctgaacaa caatcgggtg ctgaagctgg cggccggctc gaatgcgcag 3909301 tcgcagctgc cgttcaccgg actcttctcc ccaactgatg tggcggtgga caacgacggc 3909361 gccgtctacg tgatcgactt ttacaaccgg atgttaaaac tgccgacggc ttaacccgca 3909421 gcgacgccta catgggttcc agtccggcca gatgccgtgc agccaggtca cgataggcct 3909481 gcggattgac gttaacccac atttccgcgc ccgttccttc aatcggtccc ttcaccttgg 3909541 ccggcgctcc cgttaccagc attccggccg gaatctgagt gccggccacc accagcgctc 3909601 ccgcggcgat catgcagcgc gcgccgatta ccgctccgtc gaggaccgtc gcgtggttgg 3909661 cgatcagagc ctcagacccg acgtggacgc cgtggatcac acacaggtgc gccactgtcg 3909721 cccccgggcc gatgtctacc gggatgccgg gcggtgcgtg taataccgcc ccgtcctgca 3909781 cattggcccc ctcgcgcacg acgacgggcg catagtcgcc gcgcagcacg gcattgaacc 3909841 agaccgacgc cccagcctcg atggtgacgt cgccgatcag ggtggctgtc ggggccacaa 3909901 acgcggtggg atcgatccgg ggcgatcggc cctcgaaaga aaacagcggc atcgtttaga 3909961 tatacgcccg tcgtacatat gccgtggcca gactcgctgt cgttgcgctc accggagaga 3910021 aaactgtaac gtgttctagt tagcgatacc gatcgggagg tgacaggtga gtaccgacac 3910081 gagtggggtc ggtgttcggg agatcgatgc cggcgccttg ccgaccaggt atgcgcgtgg 3910141 ctggcattgc ctgggcgtcg cgaaggacta tttggaaggg aagccacacg gggtagaggc 3910201 gttcggcacc aagctggttg tgttcgctga ttcccacggg gacctgaaag tcctcgacgg 3910261 ctactgccgg cacatgggcg gcgacctgtc cgagggcacc gtcaaaggcg acgaggtcgc 3910321 ttgcccgttc cacgactggc gctggggtgg cgacggccgc tgcaaattgg tgccgtatgc 3910381 caggcgcaca cccagaatgg cgcgcactcg gtcgtggacg accgatgtgc gcagcgggct 3910441 gctgtttgtc tggcacgacc atgagggcaa tccacccgac cccgcggtcc ggatccccga 3910501 gattcccgag gcggccagcg acgagtggac cgactggcgg tggaaccgca tcctcatcga 3910561 agggtccaac tgccgcgaca tcatcgacaa cgtcaccgat atggcgcact tcttctacat 3910621 ccacttcggt ttgccgacgt acttcaagaa cgtcttcgag ggccacatcg cctcgcagta 3910681 tctgcacaac gtgggccggc ccgatgtcga cgatctgggg acgtcttacg gtgaggcgca 3910741 tctggattcg gaggcgtctt acttcgggcc gtcgttcatg atcaactggc tgcacaaccg 3910801 ctacggcaac tacaagtccg agtcgatcct gatcaactgc cactacccgg tgacccagaa 3910861 ctcgttcgtc ctgcaatggg gcgtcatcgt cgaaaagccc aagggtatga gcgaagagat 3910921 gaccgacaag ttgtcgcggg tgttcaccga gggcgtcagc aagggcttct tgcaggatgt 3910981 cgagatctgg aagcacaaga cccgcatcga caacccgctg ctggttgaag aggacggcgc 3911041 cgtgtatcag ctgcgccgct ggtatgaaca gttttatgtc gacgtagccg acataaaacc 3911101 agagatggtg gagcgcttcg agatcgaggt cgacaccaag cgcgccaacg agttctggaa 3911161 tgccgaggta gagaagaact tgaaatcgag agaagtttcc gacgacgtgc ccgccgagca 3911221 acactgacgg acatgcctga cgatcagccg gcggttcccg acgtcgatcg gctggcccgg 3911281 tcgatgctac tgctgcacgg tgatcatcac gatcacaacg attcccccga gcaacaccgc 3911341 acatgtggat cctggtcgaa gtcaagggat ttcgctgacg acccgcagcg tgctgccgcg 3911401 gtgcgcgaag ccagccgcgc cgagcgcgac cgttatctga cctcaggcct gcaaccggtg 3911461 gattgccggt tctgccatgt cacggtgacc gtaaagaggc tggggccggg tcataccgct 3911521 gtgcaatgga acaccgaggc gtcgcggcgc tgcgcgtact tcaccgagct gcgggcacgc 3911581 ggcggggatt ccgcacgcac caggtcctgt ccccggctga ccgacagcat cgaacacgca 3911641 gtggccgagg gctacttgga gcaccacgac ccaaaccgat aacgtcgcac acccgcttgc 3911701 cgcgggatac ggtgccgcat ccggcacggt gccaccgagg cgtacggttt gtgacggcgg 3911761 ttccgggact gagcttccta tgaagcctct ccggtgtgcg cgagtcgatc gaggcgcacc 3911821 agagcatcgt gttcgccgcc ctggcggtca gaactggatt gaacaccaaa caggttggag 3911881 catcaagaaa ttcgttcaaa cactacgtcg ctaccgcacc gtgaccccgc gccggcaacc 3911941 acaccctccg tgcgggacct tcagggtccg atgcaagagc accggtctgt tggatgggga 3912001 tgcgactcgt aggcgatgcc ctgtccattc actcgagatc cattcactcg aggtcgactt 3912061 cgcccggccc gccccgcatc tcacaaaacg agggtttact gtgaccttat tgtcgcgcaa 3912121 aaagaaaggc ccgattctga atattgggca gccagccaaa tccgcggcaa tcctccttgt 3912181 agagcagctt gaaaccgagt tcagaagctt ttgattcgag atcagcgtcg gtgatacccc 3912241 attgccagat gtctggtata tcgcgccacg gcttgtcatg atccgggtgc tttttatcga 3912301 gtttttgaaa gagatctcta taagccttgt taagtttgct gtgcgggacg ttacgaaagt 3912361 agtgcttttc acccagatcc aacaatctaa ctgtggttgt tgaacctatc cattgttgat 3912421 tataaatcag caaacaacgt acattctttg cgtacatatc caggatagtg tcccaatcag 3912481 gcgacacttg gtgcagcagc acatcgaata ggaagagggc gtcgacatta ccgactttat 3912541 cggcaatctc ctggtctccg aagttcccct caataacgcg aagttgcgga tatgaatttg 3912601 cacgggccgc gactgttgga gttatgcggc catcgaccaa tactgcctct tttaccgggt 3912661 acttatccag ggcgcgaaat gtataggcgc cttccactcc ccaaacggca ccgagatccg 3912721 cgaacgactc tatgcgacat gatgtgaaag cccgatctat caggttgatt ttgcctctaa 3912781 cgagccaata gccaccctgt cgtaaccgat ccaacatcat ttcacctcaa atacgtgtgt 3912841 tcaactggcg tctcgctgga tggtagatca ccgctagctg gtccagtatg ccgcctgcca 3912901 tgagcttggg accagtgtga tctgcttgtg ccagggggag gacgggacca ccttgattgc 3912961 cagtcacggg accacggcgc gccgcgccgg tggtcttttc gcttattcgt gcgatcgtcg 3913021 tgacagctca agtcacggga ggcggcgggc atggcttttc gggaggtcag tatgaacaag 3913081 atcagggaag tgctgcgggt ctggctgggg gtggccgggt tgccggcccc ggggtgccgc 3913141 acgatcgccg cgcattgcgg tatggaccgc aagacggtgc ggcgctacgt ggaggcccgc 3913201 gcaggcaccc ggtctgcccc gcgacgacga tgtcagcgct atcgatgacg ggttgatcgg 3913261 ggcggtcgcc gacgcggtgc gtccggcccg gccggatggt catggtgcgg cgtgggagca 3913321 actgctgggg gtagcgaact gttcactacc ccctggcaat caggtggtcc cgtctccctg 3913381 gcaagcgaca gctcgggcaa tccacctggt aatcaaacaa tttcgggcgg ctggcgagtt 3913441 gtcgcgctga ggcgggcaaa ttgcgtatct gctcgaccaa tgcgacgcgg ctggccaaac 3913501 agcacaccga atcacagccc ggcgaaccgc tctttgacca tttcgaccgt gagcccgtag 3913561 tcagccaacg aataggaatg ctttggggcc cgggcaccgc tctggctctc ggcgtggacg 3913621 gttgtcattg cctgtcgagc ctcgtcggac agcgtcaacc cgaagtgccg gtagatatct 3913681 gccaccgtac ccagcggatc ggcaatcaag tcgtggtagt ccacgtcgta gaactgggcc 3913741 gaatcatatt tggcccgtgc ggcattgaac cgctccagcc cacgcgacca ggtgtccatc 3913801 gcgtccgcac cgatctgggc gcccacaaac ttcgtcgacc acccttctgt ggtgtgctgc 3913861 gccagcgagc acatcgacgc catgatcgtc tccaccggcc ggtgagtctg caccaccagg 3913921 gcatcgggat aggtcgccat cagcgcatcc agggcaaata gatgactcgg attctttagt 3913981 acccaccgct tttcggcatc gttgagccca atcagctgca ggttgcggcg gtgccggcaa 3914041 tacgacggcg tccagtcctg gcgtgacaac cagtcggcat agctgggtac atgcgccagc 3914101 gcctcgtacg acaccgaatg cagcgactgc cgcaacagct gccaacactc ctccaactcg 3914161 taggccgcca tgaaatgcaa gccggtgtat cccggattct cggcatgatg ctgggtgaac 3914221 tgtgcatcga gctggcgata caacgggttt gactcccagg tctcgcgcgg ggggcgcggc 3914281 tgcgggtact cggccagcca catgtgcagg ccttggtggg ccgggtcggc gcccagcagc 3914341 cggtgcagcg cagtggttcc ggtgcgcacc aacccggtga cgaagatagg ccgtttgatg 3914401 gcaacgtcga cgtgctccgg atactgcttc cacgcggact gggacagtag cctggccacc 3914461 agcgcaccgc gcaggaagaa ccggttcatc ttgctgccca acacggtgag gccggcttcg 3914521 ccctggtaag cgtccagcaa cacacccagc gcctcacggt agttgtcgtc gtcggtgcca 3914581 aaatcgtcga gacccaccag tttggtagcc gatgcgtgca gttcgtcgac ggtggccaca 3914641 tctttccgat cgggacgccg agtcattacg tgtggtactc cccgcaattg acgtccaggg 3914701 tctgcccggt gatgccgctg gccaggtcgc tggccaggaa aagaatcgct gaggccacct 3914761 cgtcttcggt tggcagccgt ttgagatcgg agtttgccgc ggtcgcctga tagatctgat 3914821 ccacggtagt gccgtatttg ccggcctgat ggtcgaaata gcttttcagc gtgtcacccc 3914881 agatatagcc gggtgcaacg gaattgacgc gaattccctg ctcgcccagt tccgtggcca 3914941 gcgaatgcga catagctagc agtacggact tggccatctt gtaggtgccg tatttcggct 3915001 gcgagtgccg gatcaccatg gagttgacgt tgacgatcgc gccgtgagac tgcgccagcg 3915061 cgggcgtaaa cgcctggatg agtcgcagcg tccccagcgc gctgagctct atcgcgtcac 3915121 ggatgtgctc aaatgtggtg ccggccaatg gtttcatcga tggcacccgg aacgcgttgt 3915181 tgatcagcac gtcggccttg ccgtacgccg ccagcgtggc ctgcacaagg ttgcttacgt 3915241 cgtcgtcgtc ggtgatgtcg gtgcgcaccg ccaccgcccg tcgcccggtg tcgatgatct 3915301 gcttggcgac gtcgtcgaga cgctccgcgc tgcgcgcagc cagcaccaga tcggcgccgt 3915361 ctcgcgcaca tcggtgcgcc agcgtcgtgc ccagccccgg tccgacgcca ctgacgacga 3915421 tcaccttgcg cttgagcatc ccggtcatcc cagcatcctg gtcgcgattt gccgttgacg 3915481 caacgcaatt cgcgcccgcc aatcatcctc ggaaatcttg ttgtgctggt aatgcggcag 3915541 tgccgccggg atggcgtcga agtcgaccag ttcgacggtg ggcccatcgg cctcggtgag 3915601 ctcacgggat acccgctgcc agcggaactg cagaaacccg cgccgatggc cgagcgtttc 3915661 cacccagttg gtcacacccg gattctgctc ggcgaccacg atgcgcacct tgccatccgg 3915721 gtccgcttgg gcctggctgg cattcaacga ggtctgatga ttgatatagt ccagcgagat 3915781 gtaccacatg ctgcccaact gaaaccctaa gtagggggcg tcgctcaccg gcaccgtgat 3915841 caccagcgct tgacccggcc gcagctcgaa atgaccggct gacgagtatt gggtggccag 3915901 gccaccggga gtcaaccgag gcgccaccat ggtgttaacc gggatattga ggtagaacca 3915961 ctggggaaac tgtaaccagg ttttcacccg gttcacaagc tgggatcccg ctgtggcata 3916021 acgcttttcc atgagctcgc gagtcagcgg cggcggcgcg gtgccgacgg tgtccagcct 3916081 ggcgatggcc agcgtgccgc gctgttgtga ccaatcgccg tacacctccc ggatcactag 3916141 ttgcccggga gcgctgggcc gcaaccgcca ttcgaagctg ccgtccgcgg cgatgtcgag 3916201 ctcacggtcg tcgaacgcgg cctggctggc cggcacgtta tagtcagtgt actcgccgcc 3916261 gagcagctga aagctcaggt cggtggtggt gccgcgccgt ccgctgacca catagtcgcg 3916321 gttggcctgc agccgggtgc cgaagtagag ggtgtcgggg ttgtccaggc ccatcttcgt 3916381 gaacggccct gttccggact gcaggaacgg gtggtcacgc tcgtagtcga aggccaggtg 3916441 catgcagccc gcgatgcagc cggccaggta ttgcagccct tcgagcaggt cggcttcagt 3916501 ctcgatgtgc ggggcggcgg ctaccagctg ctcggcttct gcgatcgcct cgcgcagcgg 3916561 gtcggagtac acgacttcga cactagaacg tgttcctgtt ttgcgtcaat ggcgaacatc 3916621 tgcccccgtc atttacggca attgaagaca aagcccgctc gcttccagag ccctgcgcac 3916681 gagctaccca ttattgatct agcttattgt tgcgttatac gacagtctga gcagtatatt 3916741 gtccgctata tgtgtattcg tagcggcgtg gattgacgcg aggctggtcg agacccggtc 3916801 cgtgagatgc gccggaaagg gtgtcgcgat gcggaactct actgaccgtc cagcggcggc 3916861 taacgaagtc tgcatccgcg acagccaccc aatgacgcgc ctgccgttgc gatctcagca 3916921 ctgacggcag cccggcctat ccgcgaccag ctagggaaag gcattcgcag atgttcatgg 3916981 atttcgcgat gcttccgccg gaagtcaact cgacacggat gtatagcggg ccgggagcgg 3917041 gctcgttgtg ggccgccgcc gccgcctggg atcaggtgtc ggcggaattg cagtcggcgg 3917101 cggagaccta ccgctcggtg atcgccagcc tcaccggctg gcaatggctg ggtccatcgt 3917161 ctgtgaggat gggtgcggcg gtcaccccgt atgttgagtg gctgaccacc accgccgcgc 3917221 aggcaaggca gacggccacc cagatcaccg cggccgcgac cggatttgag caggcgttcg 3917281 ccatgacggt gccgccaccg gcaatcatgg ccaaccgtgc acaggtgcta tcgctgatag 3917341 cgaccaactt tttcggccag aacaccgcgg cgattgcggc cctggagacc cagtacgccg 3917401 agatgtggga acaggacgcc accgccatgt acgactacgc ggccacctcg gcggcagcgc 3917461 ggactttgac accatttacc tccccgcagc aagacaccaa ctcagccggt ctgccggcgc 3917521 aaagcgccga agtcagccgc gcgaccgcca acgccggcgc cgccgacggc aactggctgg 3917581 gaaacctcct ggaagaaatc ggaatactgc tgctgccgat cgcgcccgag ctgacaccct 3917641 ttttcctgga ggcgggcgaa atcgtcaatg cgataccttt cccgagcatc gtcggggacg 3917701 agttctgttt gctcgacggc ctactggctt ggtacgcaac gatcggctcg atcaacaaca 3917761 tcaattcgat gggtaccggc atcattgggg ccgagaagaa tttggggatc ttgcccgagc 3917821 tagggagcgc ggctgcggcg gccgctcccc caccagccga catcgccccg gcgttcctcg 3917881 cgccgctgac cagcatggcc aagtcactat cggacggagc actacgcggc ccgggcgaag 3917941 tttcggccgc gatgcgcggc gcgggtacca tcgggcaaat gtcggtgccg cccgcctgga 3918001 aggcgcccgc ggtcaccacc gtcagggcgt tcgatgccac cccaatgacc acactgcccg 3918061 gcggcgacgc ccccgccgct ggagtgcctg gactgcccgg gatgccagcc tcgggggccg 3918121 gacgggctgg cgtggtgccc cgatacggcg tacggctgac cgtgatgaca cgtccactct 3918181 cgggcgggtg acatcagtgc gtgatggcgg cgcaccttga ccgtcgcgca ttgcgcttcc 3918241 aacaccaacg aactgggact gcagtagtag cgcaaccgcg cttggagcgg gtccccaccg 3918301 gttatggcat tcgataccgc accaaagcga aatcagttcc cgaaccccga ccgctggttc 3918361 tcgctgttga agccgcccga gttgtggacg cccgagttga agaatccgga gtttaagacg 3918421 cccgagtttg cgatacccac cgtttgggcg ccggtgtttc cgaaacccgc cgcgatgacg 3918481 gtgccgaccg cgttttgcag gcccgagttg ttgctgcccc cgttctggaa ccccgagctg 3918541 ccaacgcccg tgttgaagaa gcccgagttg ttcgtgccgg cgttgccgaa acccgagttc 3918601 gggccggagc cggtgccgac ggcgccgaac ccggtgttga gcgaacccgc gttgaaggcg 3918661 ccggtgttag tgaggcccga gtttgaccag ccggtatttc tgacgccggc gttgaaatca 3918721 ccggtattgc cttgtccgga attgaagttg cccgtgttga tgctgcccgc gttgaagctg 3918781 cccgcattca actggcccga gttcccgaag ccgaggttta ttgcgccgcc gtttccgaaa 3918841 ccggtattca cgcccccggc gtttcccatg cccgtgttga gcgaacccga gttcccgaaa 3918901 ccggtgttat ggcccgccac gaacgggccc acggtggccc ccgagtttcc gatgcccaca 3918961 ttcgcgtcgc tggagttgcc gatacccagg ttgccgttgc cggagttgaa gaaaccgacg 3919021 ttgttggtgc ccgagttgaa caagccgatg tttccgctgc ccgagttcag cccaccgatg 3919081 cccacctggt tgttgccggt gagcccgaag ccgatgttgc cgttgccggt gttgccgaaa 3919141 ccgatgttgc caatgccgtt gttgccgaag ccccagttgc tgctgccggt gttcccggcg 3919201 ccgatgttgc cgctgccggc gtttccggtg ccgacgttgt tgttgccggt attcccgaac 3919261 ccgacgttgg agttgccggt attgccgctg cccaggttgc tgttgccgag gttcccgttg 3919321 ccgacgttcc cactgcctgg aagtcccgcc ctcccgttgc cgctgccgaa gttgccgttg 3919381 ccgacgttcc cgccgcccac gttggagttg ccgatgttgc cgctgcccag gttcgcgtca 3919441 ccccggttcc cgctgcccag gttggtgtta ccgatgttgc cgttgccggt gttcaggtcg 3919501 ccggtgttgc cgccgcccag gttggcgttg ccgatattgc cgatacccag gttcggcagc 3919561 tgctgcagcg cctgctgaaa gggcaccaac tgctcggccg ccgccgaggc cccggagtgg 3919621 tagcccacca tcgcggccac atccgcggcc cacatctgtt cgtaggcgcc ctcaacggcc 3919681 gcaatcagcg gcgcgttcag cccgaaccaa ttcgacatca ccaactgcgc aaacgcattg 3919741 cggttggccg ccaccagcaa agggtgcacc gtcgccgccc gtgccgcctc gaacgcgctg 3919801 gccaccgcct tggcctgggc cgccgccccc gcggcccggg ccgccgcagc gctcaaccat 3919861 cccgcatacg gtgccgccgc cgccgccatc gccgccgccg ccggcccctg ccacgcctga 3919921 cttgccaagt ccgaggtcac cgacccaaac gaggacgccg cggaccccaa ctctgcggcc 3919981 agcccgtccc aggccaccgc cgccgccaac atcggcgccg aacctgcacc cgtgaacatc 3920041 cgcaacgaat taagctccgg cggcaatacc gcatagttca tgaccccgtc ccttcccgac 3920101 ctgacaatca gtcagaaccg taggacaaac cgggtcggac catctgcgtt tccgtgaaat 3920161 ccgcgaacca gcggtgtcgt caatgcgtta cggccgcacc gctatccagc tcgcgtttga 3920221 tttccagcgc gatgtcgatg agctggtctt cctggccacc gatgagcttg cgctgaccgg 3920281 cccggtgcaa cagcgccgac gccggcacgc cgtagcgctc ggcctggcgg accgcatgct 3920341 tgaggaagct ggagtagacc ccggaatacc ccatgatcaa cgcgttgcgg tcgagcagac 3920401 attcggccgg catggccggg cgcaccacgt cctcggcggc gtcggcaatg tcgaagaaat 3920461 caatgccggt cttgacgccg atcttgtcga acaccccgat cagcgcctcg accggcgcgt 3920521 tacccgcccc ggcgccgaaa cgccggcagg acccgtcgat ctgcttggcg cccgcgcgca 3920581 ccgccgccac cgaattggcc accccgagac cgaggttctc gtgcccatga aagcccacct 3920641 gggcgtcttc gccgagctcg gcgaccaggg ccgacacccg gtcggccacg ccgtcgagca 3920701 ccagggcacc ggcggagtcg acgacgtaga cacactggca gccggcgtcg gccatgatgc 3920761 gggcctgggc ggccagtttc tccggcgcaa tggtgtgggc catcatcaaa aacccgacgg 3920821 tttccagacc cagttcgcgg gccagcccga aatgctggat cgacacgtcg gcctcggtgc 3920881 agtgggtggc gatccggcag atcgacccgc cgttgtcccg cgcctctttg atgtcgtcct 3920941 tggtgcccac accgggcaac atcaaaaacg cgatccgggc ctctttcgcg gtcgccgcgg 3921001 ccagcttgat cagctcctgc tcaggggttt tcgagaagcc atagttgaac gatgagccgc 3921061 ccaggccgtc gccgtgggtc acctcgatca ccggcacgcc agcggcgtct agggcggcca 3921121 cgatggcacc gacctcgtcc ttggtgaatt ggtggcgttt gtggtgcgac ccatcccgca 3921181 gcgaggtgtc cgtgatgcgg acgtcccaca tatcggtcat cgcgctcctc ctacaaccag 3921241 cgtctccttg gcgatctcct cgcccacctt ggtggccgcc gcggtcatga tgtccaggtt 3921301 gcccgcatag ggcggcaggt aatccccggc gccctcaacc tcgacgaacg tggtgaccag 3921361 cgcctgcccg cccgagttga tcgacggctc gtcgaactgc ggttcgttga gcagccggta 3921421 tccaggcacg taggtctgca cctctttgac gacgtcgtgg atggaggcgg cgatcgcttc 3921481 gcggtcggcg tcggtgggga tggcgcaaaa gatggtgtcg cgcatgatca tcggcgggtc 3921541 ggcgggattc aagatgatga tcgccttgcc gcgggcggcc ccgccgatgg tctggacccc 3921601 acgggcggtg gtcttggtga actcgtcgat gttggcgcgc gtgcccggtc ccgctgaaac 3921661 ggaagccacc gacgccacga tctcggcgta gggcacctcc acgatccgag acaccgcgta 3921721 cacgatcgga atggtcgcct gtcccccgca ggtgatcatg ttgacgttcg gcgcgtccag 3921781 gtgctcgcgc aggttcgccg gcgggatcac cgccggaccc accgccgccg gcgtcaggtc 3921841 gatggcccgg atcccggcct cggcgtactt gggcgccgcg tcccggtgca cgtaggcact 3921901 ggttgcctcg aacaccaggt cgggtttatc gggctgcgcc agcagccagt ccaccccctc 3921961 gtgggtggtc tccaaaccca gcttggccgc gcgcgccagg ccatcgctct ccgggtcgat 3922021 gcccaccatc cagcgcggct ccagccactc cgatcgcagc agcttgtaca gcagatcggt 3922081 gctgatattt cccgacccga caatcgccac ttttgccttg gacggcatgt tgctcccctt 3922141 attcgaacga caaccggacc aaacccagcc cggtgaagtc ggcgacaaac tcgtcgccgg 3922201 cccgcgcctc gaccgcgaac gtgcatgacc cgggtaacac gatgtcgcct ttgcgcagcc 3922261 gcacgccgaa actctcgacc ttgccggcca gccaagccac cgcggtcgcc gggttaccca 3922321 acaccgcatc actgcggccc tcggccacca cctcgccgtt gcgggtcagc ttcgcatcga 3922381 tcgccctgac gtcaagatcg gccggcggca cccgggccgc gcccaacacg aagcccgccg 3922441 ccgaggcgtt gtcggcgatg gtgtcgcaga tcttgatctg ccaatccttg atcctggtgt 3922501 cgatcagctc gatggcgggc accagggcct cggtggccgc cagcacgtcg tcctcggtgc 3922561 agcccgcacc cggtaggtcg gcggccagga tgaagcccac ctccacctca acccgcggag 3922621 acaggtaccg ggacgcctgg accggcgtgt cttcgaacac ctgcatgtcg tcgagcaggt 3922681 gtccgtagtc tggttcgtca acccccatca tctgctgcat gatcggcgac gacagcccga 3922741 ccttatgacc caccacgcgg gcaccctcgg ccacccgctg ccggatgttg atcaactgga 3922801 tctcgtaggc gtcgacgaca tcgatctcgg gatgggcggc ggtcagttga ccgatcgggt 3922861 cgcggcttcg ctcggcttgt gctaggtcgg cggccagctc atcacgggtg gcatcacgga 3922921 gcattcggcg aagtcccctc gtaggcgtga ccgggccagt agcgcccgac ccgagcaatt 3922981 ctataacgtg ttctacatga ctgtgcagga gttcgacgtc gtggtggtcg gcagcggcgc 3923041 cgccggcatg gttgctgcgc tggtcgccgc tcaccgaggt ctctcgacgg tagtcgtcga 3923101 gaaggccccg cactacggcg gctccaccgc acgctcgggc ggcggcgtct ggatccccaa 3923161 caacgaggtc ctcaagcgcc gcggcgttcg agatacaccg gaggcggcac gcacctatct 3923221 gcacggcatc gtcggcgaaa tcgtcgagcc ggaacgcatc gatgcttacc tcgaccgcgg 3923281 gcccgagatg ctgtcgttcg tgctgaagca cacgccgctg aagatgtgct gggtacccgg 3923341 ctactccgac tactaccccg aggctccggg cggccgcccg ggcggacgtt cgatcgagcc 3923401 gaaaccgttc aacgcgcgca agcttggtgc cgacatggcc gggctggagc ccgcgtatgg 3923461 caaggttccg ctcaatgtgg ttgtgatgca gcaggactac gttcgcctca atcagctcaa 3923521 acgtcacccc cgtggcgtgc tgcgcagcat gaaggtcggc gcccgcacga tgtgggcgaa 3923581 ggcaacaggt aagaacctgg tcggcatggg tcgagccctc attgggccgt tgcggatcgg 3923641 gttgcagcgc gccggagtgc cggtcgaact caacaccgcc ttcaccgatc ttttcgtcga 3923701 aaatggcgtc gtgtccgggg tatacgtccg cgattcccac gaggcggaat ccgctgagcc 3923761 gcagctgatc cgggctcgcc gcggcgtgat cctggcctgt ggtggtttcg agcataacga 3923821 gcagatgcga atcaagtacc agcgggcacc catcaccacc gagtggaccg tgggcgccag 3923881 cgccaatacc ggtgacggca ttctcgccgc cgaaaagctc ggcgcagcac tggatctgat 3923941 ggatgacgct tggtggggcc cgacggtacc gctggtcggc aaaccatggt tcgcgctctc 3924001 ggagcgcaac tctcccggtt cgatcatcgt caacatgtca ggcaagcgat tcatgaacga 3924061 atcgatgcca tacgtcgaag cctgtcatca tatgtacggc ggcgaacacg gccaggggcc 3924121 cggaccgggc gagaacattc cggcgtggct ggtgttcgac cagcgatacc gggaccgcta 3924181 catcttcgcg ggactacaac cagggcaacg cattccgagc aggtggctgg attccggcgt 3924241 catcgtccag gccgataccc ttgcggagct ggccggcaag gccggtctac ccgcggacga 3924301 actcactgcc accgtccagc gtttcaacgc attcgcccgg tccggtgtcg acgaggacta 3924361 ccaccgcggg gaaagtgcct acgatcgcta ctacggcgac ccgagcaaca agcccaatcc 3924421 gaacctcggc gaggtcggcc acccgcccta ttatggcgcc aagatggttc cgggcgacct 3924481 ggggaccaag ggcggtatcc gcaccgatgt caacggacgt gctctgcggg acgacggcag 3924541 catcatcgac ggcctttacg ctgcaggcaa tgtcagtgcc ccagtgatgg gacacaccta 3924601 ccccggtccg ggcggcacga taggcccggc gatgacgttc gggtacctgg cggcgctgca 3924661 cattgccgat caggcgggaa agcgctgata tgcccatcga cttggacgtc gcgctgggtg 3924721 cacagctacc gcccgtcgaa ttctcttgga ccagtaccga tgtgcagctc taccagctgg 3924781 gactgggcgc cggctctgat ccgatgaacc cccgtgagct gagttatctg gcggacgata 3924841 caccgcaggt gttgccgacg ttcggcaacg tcgcggccac cttccacctc accacaccac 3924901 cgaccgtcca gtttccgggc atcgatatcg agctcagcaa ggtgctgcac gccagcgagc 3924961 gagtcgaggt tcccgccccg ctgccgccgt cgggttcggc cagggcggtc acccggttca 3925021 ccgacatctg ggacaagggc aaagccgcgg taatctgcag cgaaacgacg gcgaccacac 3925081 cggacggctt gctgctgtgg acgcagaagc ggtcgatcta tgcccgtggc gaaggcggat 3925141 tcggcggcaa gcgcgggccg tcgggatcag atgtcgcgcc ggagcgggcg cccgatctgc 3925201 aggtcgcgat gccgattctg ccgcagcaag cgctgctcta ccggctctgc ggcgaccgca 3925261 acccgctgca ctcggatccc gaattcgccg ctgccgcagg ctttccccgg cctattctgc 3925321 atggcctgtg cacctatggg atgacctgca aggcgatcgt cgatgcattg ctggactccg 3925381 atgcgacggc cgtggccggc tacggcgcac gctttgctgg cgtggcgtac ccgggcgaga 3925441 cgctcacggt caacgtgtgg aaggacggcc gccgcctggt ggccagtgtc gtcgcaccca 3925501 ctcgtgacaa cgctgtggtg ctcagcggag tggagctagt gccggcatag cggtgcggtc 3925561 ggcgctaaag gtttggtgag actgcggatt tcgcagaagt cgacatgaca ttgctgctat 3925621 ggtctgcggt gacggggccg tcgcagtggt ggcgcggcgg ttgggccgag ccggcgggat 3925681 gttgtcatgg cggatttctt gacgttgtca ccagaggtga attcggcccg gatgtacgcg 3925741 ggtggggggc ccgggtcgct atcggcggcc gcggcggcct gggatgagtt ggccgccgaa 3925801 ctgtggttgg cggcggcctc gttcgagtcg gtgtgctccg gcctggcgga ccgttggtgg 3925861 caagggccgt cgtctcggat gatggcggcg caggccgccc gccatacggg gtggctggcc 3925921 gcggcggcca cccaggcaga gggagcagcc agccaggctc agacgatggc gctggcctat 3925981 gaagcggcgt tcgccgcaac cgtacacccg gcgctggtcg cggcgaaccg cgccctcgtg 3926041 gcctggttgg cggggtcgaa tgtgttcggg cagaacaccc cggcgattgc ggccgccgag 3926101 gccatctacg agcagatgtg ggctcaggat gttgtcgcga tgttgaacta ccatgcggtg 3926161 gcctcggcgg tcggggcgcg gttgcggccg tggcagcagt tgctgcatga gctgcccagg 3926221 cggttgggcg gcgaacactc cgacagcaca aacacggaac tcgctaaccc gagttcaacg 3926281 acgacacgca ttaccgtccc cggcgcatct ccggtgcatg cagcgacgtt actgccgttc 3926341 atcggaaggc tactggcggc gcgttatgcc gagctgaaca ccgcgatcgg cacgaactgg 3926401 tttccgggca ccacgccaga agtggtgagc tatccggcca ccatcggggt ccttagcggc 3926461 tctcttggcg ccgtcgatgc caaccagtcc atcgctatcg gtcagcagat gttgcacaac 3926521 gagatcctgg ccgccacggc ctccggtcag ccggtgacgg tggccggact gtcgatgggc 3926581 agcatggtca tcgaccgcga acttgcctat ctggccatcg accccaacgc gccaccctcg 3926641 agcgcgctca cattcgtcga gctcgccggc ccggaacgcg gtcttgccca gacctacctg 3926701 cccgttggca ccaccattcc aatcgcgggg tacaccgtgg ggaatgcgcc cgagagccag 3926761 tacaacacca gcgtggttta tagccagtac gatatctggg ccgatccgcc cgaccgtccg 3926821 tggaacctgt tggccggcgc caacgcactg atgggcgcgg cttactttca cgatctgacc 3926881 gcctacgccg caccacaaca ggggatagag atcgccgctg tcacgagttc actgggcgga 3926941 accacgacaa cgtacatgat tccgtcgccc acgctgccgt tgctgttgcc actgaagcag 3927001 atcggtgtcc cagactggat cgtcggcggg ctgaacaacg tgctgaagcc gctcgtcgac 3927061 gcgggctact cacagtacgc ccccaccgcc ggcccttatt tcagccacgg caacctggtg 3927121 tggtagttaa cccaggatca gcccggacgt aggcaccccg gtgcccgcgg tgacgagcac 3927181 atgctcgacg cccgccaccg ggttcaccga ggtgccgcgc agctgccgca ccccctccgc 3927241 gatgccgttc atgccatgga tgtaggcttc gccgagttga ccgccgtggg tgttgatggg 3927301 cagccgcccg cccacctcga tcgcgccgtc ggcgatgaag tctttcgctt cgcccttgcc 3927361 gcagaatccc aactcctcca actgaatcag ggtaaacggc gtgaagtggt cgtagaggac 3927421 tgcggtctgg acatcggccg gcgtcagccc cgactgcgcc catagctgcc ggcccaccag 3927481 gcccatctcg ggcaggccgt cgagttccgg ccggtagtag ctgaccatcg tgtactggtc 3927541 tggactgcag ccctgcgcag ccgcctcaat gaccaccggg cgctgcttga ggtcccgtgc 3927601 gcgcgcagct gacgtcacca cgatcgcgac cgcgccgtcg gtctcctggc agcagtccag 3927661 cagccgcagc ggctcggcga tccacctcga attctggtgg tcctcaatgg ttatcggctt 3927721 gccgtagaag tacgccttgg ggttgttggc ggcatgcttg cggtcggcca ccgagacagc 3927781 accgaagtcc cggctggtcg caccagacag gtgcatgtac cggcgagcga tcatcgccac 3927841 ttgcgcggcg ggcgtggaga gcccgtgcgg atacgaaaac gaattgtcca cgccggtgga 3927901 gtcggcattc tcggtcaaac gagtttgcac ctgaccgaac cgcatgccgg atcgttcgtt 3927961 gaatgcccga tacgccacca cgacgtcagc caccccggtg gccactgcca tagcggcgtg 3928021 ctgcacggtc gcacatgcgg cgccaccgcc gtagtggatc ttggagaaga acgtcagctc 3928081 gccgatgccg gccgcacgcg ccacggcgat ttcggtgttg gtgtccatcg tgaacgtggt 3928141 cagcccgtcg acatcggtcg ggctcaggcc cgcatcggcc aacgcatcca acaccgcctc 3928201 ggccgccagc cgcagctcac ttcgaccgga gttcttcgaa aagtcggtgg cgccgatacc 3928261 gacgatggcc gcctgacccg ataacactac gaatccctca tcgaaagttc caccgtcgcg 3928321 gtcacgtggt cgccaagggt attgcggccc accaccttta ccgtgatcaa gccgtcgttc 3928381 accgcggtca cctcaccgga gaacgtcacc gtgtcgtagg cgtaccacgg cacccccagc 3928441 cgcagcccaa tcgacttgat cagcgccgac gggcccgccc agtcggtgac gtagcgttgc 3928501 accagcccgg tgtcggtgag gatgttgacg aaaatgtctt tcgacccctg ggcgacggcc 3928561 ttgtctcgat catgatgcac atcctggaag tccctggtag ccagcgccgt tgagacgatg 3928621 aacgtcgggt ctccgtagag cttcagctca ggcagcacag caccaacaac cgtcattcgt 3928681 caggctccca tgcgtagagg ctccagtcgg ggaaatcgat ataggtcgct cgtaccggca 3928741 taccgatcgc aacacgagca ggatcggccc cccgcagctc gcccagcatg cgtacccctt 3928801 cctcgagctc caccagcgcg atcacgaagg gcaccgtgcg acccggaact ttcggcgcgt 3928861 gatgcaccac gaagctgaac accgtgccgc gaccgctgga gacgacgtag ttgatcggca 3928921 ccgatttgtc ttgccacacc gccggcaccg gtgggtgccg caggctgcca tcggcaagcc 3928981 gctggatccg caattcgtgg gccttgactc catcccagaa aaacgcggtg tcccgcgacg 3929041 acgagggacg catcatagcg tcgggatcca aatcgtcagg caccgagctc ggagaacccg 3929101 cgggcttgaa tttgaggatg cgccaattca tctctgcgac gtcctcgtcc ccgacttgcc 3929161 atacgatgtg ctggttgatg aaccagccct cgccgagcgc ggtttgcttg ggtccgacga 3929221 cgtcaccgag ctcggcgctg atgctgactt gctccccggg caataggtag cggtggtagg 3929281 tctgctcgca gttggtggca accacaccga tgtagccggc gtcgtcgaac agcttgatga 3929341 tgggtcccag cggatcgtcc ttcggacgca ctccgcccag acccatcatg gtccacacct 3929401 gaatcatggc cggtggcgcg acgattccgg ggtggccggc ggcgcgagcc gccgcgtcgt 3929461 ccacatagat ggggttgcgg tcgccgatgg cctccaccca gttgttgatc atcggctggt 3929521 tcaccgggtc acgggccagg cgcggcttgc tgggcccggc cgccttgatc tgggcaaccg 3929581 cttcctgaat gtcgctcacc ccggtcacct gggcaccctt ggcactttga ggccagacgc 3929641 ggcgatcatc tcgcgcatga cttcgttcac acccccgccg aaggtgatca ccaggttgcg 3929701 cttggtctgg gcgtccagcc agcgcagtag ctcggcggtg tcgggttcgg cggggttgcc 3929761 gtacttgcca acgatttcct cggcgagccg gccggcacgc tgaacacgct cggtgccaaa 3929821 gactttcgtg gccgcggcat cggccatgtt gatgtcctca ccggcggacg ctacctgcca 3929881 gttgagcaac tcgttgatcc gccagatcgc acgaatctca ccaagagccc gcttgacgtc 3929941 gtcgtggtcg atcggcgtca cgccgttgcc acccggcacg gacgcccacg cgtgcacccg 3930001 gtcgtagatg ctggcgaacc gcccggccgg gccgagcatt acccgttcgt tgttgagttg 3930061 ggtggtgatc agccgccagc cgtcgttctc ctttccgacc agcatgtcga ccggcacgcg 3930121 cacgtcgttg tagtacgtgg cattggtgtg gtgggcgccg tcggccaaga tgatcggcgt 3930181 ccaggaatag ccgggatcct tggtgtcgac gattagaatg gaaatgcctt tgtgcttagc 3930241 ggcattcggg tcggtgcggc aggccagcca gatgtagtcg gcgtcgtgtg cgccggtggt 3930301 gaagaccttc tggccgttga cgatgtagtg gtcgccgtcg cgaacggcgg tggtgcgcaa 3930361 cgacgccagg tcggtgccgg cttccggctc ggtgtagccg atcgcgaagt gcgcctcacc 3930421 ggccaggatc gccggcagga acttcttctt ctgcagctcg ctgccgtgcg cctgcagcgt 3930481 ggggccgacg gtctgcagcg tcaccgcggg cagcggcacg tcggcgcgat gggcctcgtt 3930541 gacgaagatc tgctgctcga tcggaccaaa acccagaccg ccgaactctt tcggccaccc 3930601 aacaccgagc ctgccgtccc ggcccatgcg ccgtatcacc gcacggtagg ccgggccgtg 3930661 ccggtctttc tccatctccg tgcgctcgtc gggcgagatg agattcgaaa agtattgccg 3930721 tatctcggct tgcagctggc gctgctccgg cgtcaggtca atgaacatcg cgctcccagg 3930781 agctcaaggc gatgcgaggg cccgcccagc agccgggtga ggtccttgat cgtggagtag 3930841 tagcggtgca tcggatacgt gacgtccatc cccatgccgc cgtgcaggtg atggcagatt 3930901 tgcatcgccg gcggcgcctg cgatgtcacc cagtacccga ggacgcccag atcatctccc 3930961 gcatccagat cctcggccag tctccagatc accgacttgg ccaccaggtc aatggtgcgc 3931021 gaggcgatgt aaacctcggc gagctgcgcg gccacggtct ggaaggttga cagcggctta 3931081 ccgaactgct tccggttcgc cacgtagtcg gcggtcagcc gcagcgcccc ggcgaccagc 3931141 ccgtcggcgt atgcacccat gacggccagc gctagctgat tgacccggtg cgcggctaca 3931201 tccgccagga tgtcacagtc ggcaaccgcc acgccgtcca tcgtcatcac atactcgtct 3931261 gaaccattcg atgtgggcgt acgaaccatg cgcacaccgt cggccgtcgg cgacaccacc 3931321 acgacggcgt tgtcggcggt caccaacatc cagtccgcct gttcggcgta gccaacaccg 3931381 actttggtgc ccgacaaccg cccacccaca aagctagtgg caggccgatc cggcagcgcc 3931441 gccccgggct cgttgagcgc ggcggtcagt actcctccct tggccacccc ggccaggaag 3931501 cggtcctgtt gctcggcgga tgccagctcg agcagcggca ccaccccaag acccagcgtt 3931561 gccagcgccg gcgtgacggc gccgtggcga cccacctcgg tgagcagcgc gccgacttcg 3931621 aataggccca cgccgtcgcc gccgagacgt tccggcaccg gcagcgccgt cacaccaccg 3931681 cagaccagcg cctcccacga gatgtcccgc tccaacaccg acgtgaccac gtcggcgacg 3931741 gcttgctgtt ccgcagtggg atcgaaatcc attagtgagc aaccgggcat ctaccggtgt 3931801 agtcgacctg ccagtgctta atgccgttga gccagccgga ccgcagccgc tcgggcgccg 3931861 agatcggctt gaggtcgggc atgtggtcgg ctacggcgtt aaagattagg ttgatcgtca 3931921 tccgggccag attcgcaccg atgcagtaat gagcgccggt gccgccgaag ccgacgtgcg 3931981 ggttggggtt gcgcaggatg ttaaatgtga acggatcctg gaaaacctct tcgtcgaagt 3932041 tagccgaccg gtagaacatc accacccgct gacccttctt aatctgtacg ccggacaact 3932101 cgtagtcccg cagcgcggtg cgctgaaaag cggtgaccgg ggttgcccag cgcacgatct 3932161 catcggccgc ggtctccgga cgcactttct tgtacagctc ccactggtcg gggtgttcag 3932221 cgaacgccat catgccctgg gtgatggagt tgcgggtggt ctcgttaccg gccaccgcca 3932281 gcatcaccac gaagaagccg aactcgtcgt cggagagctt ctcgccgtcg atatcggctt 3932341 ggatcaactg agtcacgatg tcgtcggcgg ggttcttcgc cttctcctcg gccatcttca 3932401 tcgcatagcc gatcagctcc gccgaggacg ccttcggatc gatgtgggcg tattccggat 3932461 cctcgttgcc ggtcatctcg tttgaccagt ggaacagctt gccgcggtcc tcctgcggca 3932521 cgcccagcaa gcccgcgatc gcctgcaatg gcagctcaca ggaaacctgc tcgacaaagt 3932581 ctccagaacc cgcggcggcc gcctccgcgg cgatcttctg ggcgcgctcc tggagctcgt 3932641 catgcaggcg tccgaccgca cgtggcgtga agccgcgaga gatgatcttg cgcagccggg 3932701 tgtggtgcgg cgcgtccatg ttgagcatga cgaagcgctg aacctcgatg tcctcacgcg 3932761 cgatgtcgtt cttgaatcgc gggatcaccc cgttttcgta gctggagaac acgtcgctat 3932821 gccgcgatat ctctttgacg tcgttgagtt tggtgatcgc ccagaaaccg ccgtcgtgaa 3932881 agccgccgcc cttgccagga tcctgcccgt tccaccagat cggcgccgcg gaccgcagct 3932941 cggcgaattc ggcaaccggc agccgttcgg cgtagattgc ggggtcggtg aaatcgaacc 3933001 cgggcggcag attggggctg ggcacggtag ttctccttac tgcaatctcc actgactggt 3933061 gattccacga cactagctgt cctagtgagg accttctgcc agtaaaacat gccttcaccg 3933121 cagacaaaag gcattgaagc aaccttgctt gtcatagtaa tgaaacgtgt tctagcctgg 3933181 ccccatgggt tacccggtca tcgttgaagc cacccgcagc cccatcggca aacgcaacgg 3933241 atggctgtcg gggctgcatg ccaccgagtt gttgggcgcg gtgcaaaagg cggtggtcga 3933301 caaggccggc atccagtccg gccttcacgc cggtgacgtc gaacaggtca tcggcggttg 3933361 cgtgacccag ttcggggagc aatccaacaa catcagccgg gtggcctggc tgacggccgg 3933421 tttgcccgaa cacgtcggcg ccaccaccgt cgactgccag tgcggcagcg gccagcaggc 3933481 caaccatctg attgccgggt tgatcgcggc cggtgccatc gatgtcggca tcgcctgcgg 3933541 catcgaggcg atgagccggg tcgggctggg cgccaacgcc gggccggacc gctcgctgat 3933601 ccgcgcgcag tcatgggata tcgacctgcc gaaccagttc gaggccgccg agcggatcgc 3933661 caagcggcgc ggcatcaccc gcgaggacgt ggatgtcttc gggctcgagt cgcagcgacg 3933721 cgcgcagcgg gcctgggcgg agggccgctt tgaccgcgag atctcgccga tccaggcgcc 3933781 ggtgctcgac gagcagaatc agcccaccgg cgagcggcgc ctggtctttc gcgaccaggg 3933841 cctgcgcgag accacgatgg cggggctagg cgagctgaaa ccggtgctcg agggcggcat 3933901 ccacaccgcg ggcacgtcgt cgcagatctc cgacggcgcg gcagccgtgt tgtggatgga 3933961 cgaagccgtg gcacgtgcgc acggcctgac cccgcgggcc cggatcgtcg cccaggcact 3934021 cgtcggcgcc gagccctact accacctgga cggcccggtg cagtccaccg cgaaggtgct 3934081 ggagaaggcc ggcatgaaga tcggcgacat cgacatcgtc gagatcaacg aggcgttcgc 3934141 gtccgtggtg ctgtcctggg cgcgggtgca cgagcccgac atggaccggg tcaacgtcaa 3934201 cggcggggcg atcgcgctgg ggcatccggt gggctgcacc ggcagccggc tgatcaccac 3934261 cgccctgcac gagctcgagc gcaccgacca gagcctcgcg ctgatcacca tgtgcgccgg 3934321 cggggccctg tccaccggca ccatcatcga gcggatttaa cctagctgcg gcagggcacc 3934381 gtgcggcgtg actgcaacat gaagcgaccg atgattagat agcgaggcgg acgcgcgcct 3934441 ttggcgaccc ttggtcgcta ggatcagcgt catgccgaaa tcaccgccgc ggtttctgaa 3934501 ttcgccgctc agcgacttct ttatcaagtg gatgtcacgg attaatacct ggatgtaccg 3934561 ccgcaacgac ggggagggtc tgggcggcac cttccagaag attccggtcg cgctgctgac 3934621 caccaccggc cgcaagaccg gccagccgcg ggtcaacccg ctctacttcc tgcgcgacgg 3934681 tgggcgggtc attgtcgcgg cctccaaggg cggcgcggag aagaacccga tgtggtacct 3934741 caacctcaag gccaacccca aggttcaggt acagatcaaa aaggaagtgc tggaccttac 3934801 cgcgcgggac gcgaccgacg aggagcgcgc cgaatattgg ccacagttgg tcacgatgta 3934861 cccaagttat caggactacc agtcctggac cgaccgcacg atcccgatcg tggtttgcga 3934921 accctgaccg ttcccaactt cgccgaacgt gaagccaggg cgagaaaacg gccgaaatct 3934981 cgccctgagt tcacgctcgg cgcagataac taggccccat agaccggaac cggcggccgc 3935041 gacttggcca acaggtcgct gacgacgggc cccagctcgg ccggatccca tttcacgccc 3935101 ttgtccacct gcgggccatg cgcccagccc tcggcgaccc ggatgatgcc gccctcgacc 3935161 tcgaatacct tcccagtgac atcgcgggac tccgcactgc ccagccatac caccaagggt 3935221 gagacgttct ccggggccat cgcgtcgaac ccctcctgcg gcttggccat cacctccgcg 3935281 aacacagtct cggtcatgcg ggtgcgcgcc gccggcgcga tcgcgttgac ggtcacgccg 3935341 taccgcctca tttcggcggc gccgacgagc gtcagcgccg cgattccggc cttggcggcg 3935401 ctgtagttgc cctgccccac gctgccctgt aggcccgcgc cagagctggt gttgatgatc 3935461 cgcgcgtcaa tgtctttcgg ggctttgccc gccttggaca gtccccgcca atgggacgcg 3935521 gcgtgccgca tggtggcgaa gtggcccttg aggtgcaccg cgatgacagc gtcgaactcc 3935581 tcttcgctgg tgttggcgat catccggtcc cgcacgatgc cggcattgtt caccaggacg 3935641 tccacaccac cgtacgtctc gacggcggcc tggatcaggt tggccgcctg gtcccagtcc 3935701 gagatgtccg acccgtcggc gacggcttgg ccaccggccg caaggatctc gtcgaccacg 3935761 tcttgggctg cgctgccgcc gcttgccggc gaaccgtcca ggcccacacc gatatcgttg 3935821 accaccacgc gcgcaccctc ggccgcgaag gccaacgcat gtgcgcggcc gatgccgcca 3935881 cccgctccgg tgacgatgac cacccggccg tcgaccaagc ccatgacccc attgctcctt 3935941 tgctcgtcac ttgttggcac tcgaggcgcc caggtacggc ggcggctcac cgccgccgtg 3936001 cacctcgagc gtcgccccgc tgatatatga cgccgcatcg gacgccaaaa acgctgcagc 3936061 ccaaccaatg tcggcaggtc gtgccagccg gcccaacggc accgtggcgg cgacgcgagc 3936121 gatcgactcg gcatcaccgt agaacagttc ggaccgttcg gtttccacca tgccgaccac 3936181 cacggcgttg acccgaacct tgggtgccca ttccaccgcc agcgtggtgg tcaggttttc 3936241 caggcctgcc ttggccgcgc cataggccgc cgtgccggga gtgggacggc gaccgctgac 3936301 gctacagatg tttacgatcg acccaccgtt gggctgcgct tgcatcagca cgttggcgtg 3936361 ctgggaaacc agcagcggtg caagcacatt gagctcgacg atctttcggt ggaagttgtg 3936421 tgtcgcctcg gcggccagcg cgtatggcga gccgcccgcg ttgttgacca gcatgtcgag 3936481 tcggccgtgc cgctccccga tctcaccgac caggcgcttg accgagtcct cgtcccggat 3936541 gtcgcagcgg tggaactcat acggttggcc gtcgaccgct cgtcgcgcgc aggtgatcac 3936601 ggtcgcgccc tgttcggcga ataccgagct gatgcccgcg cctaccccgc ggacaccgcc 3936661 ggtgaccaaa accacccgcc cggccagccc gaaattgatg gcgtcggctg cctcggcgag 3936721 agtcactgtg ctagcgtacc aagcaagtgc ttgcttaggt agcgaacccg caggagtgca 3936781 atgccgatca cctccaccac gcccgaaccg ggcatcgtcg cggtcaccgt cgactacccg 3936841 ccggtcaacg ccatcccgtc gaaagcgtgg ttcgacctgg ccgacgcggt gacggccgcg 3936901 ggcgccaact ccgacacccg cgcggtgatc ctgcgggccg aggggcgcgg cttcaacgcc 3936961 ggggtggaca tcaaagagat gcaacgaacc gaaggtttca cggcgctgat cgacgccaac 3937021 cgcggctgct tcgccgcatt ccgcgccgtc tacgagtgcg cggtgccggt gatcgccgcc 3937081 gtgaacggat tctgcgtggg cggcggcatc ggcctggtcg gcaactccga cgtcatcgtg 3937141 gcctccgagg acgccacctt cggcctgccc gaggtggaac ggggcgcgct gggcgcggcc 3937201 acgcacctct cgcggctggt gccccagcac ctgatgcgac ggctgttctt tacggcggcc 3937261 accgtggacg cggccacctt gcagcacttc ggctcggtgc acgaggtggt gtcccgcgat 3937321 cagctggacg aggccgcttt gcgggtggcc cgcgacatcg ccgccaaaga cacccgggtc 3937381 atccgcgccg ccaaggaggc gctgaacttc atcgacgtgc aacgggtcaa tgcgagttac 3937441 cggatggagc aaggttttac cttcgagctc aacctcgccg gagtcgccga cgagcaccgc 3937501 gacgcctttg tgaagaagtc atagtgcccg ataaacgaac ctctcttgac gacgccgtcg 3937561 cgcaattgcg cagcggcatg accatcggca tcgccggctg gggctcgcgg cgcaagccca 3937621 tggcgttcgt gcgggccatc ctgcgctcgg atgtcaccga tttgacggtg gtcacctacg 3937681 gcgggccgga cctggggctg ctgtgctcgg cgggcaaggt caagcgggtc tactacgggt 3937741 tcgtctcgct ggactcgccg ccgttctacg acccgtggtt cgcgcacgcc cgcaccagcg 3937801 gcgcgatcga ggcccgggag atggacgagg gcatgctgcg ctgcggtttg caggccgcgg 3937861 cacaacggct gccgttcctg cctattcgcg ccgggctggg cagctcggta ccacagttct 3937921 gggcaggcga gctgcagacg gtcacgtcgc cgtatccggc gcctggcggc gggtacgaga 3937981 cactgatcgc catgccggca ctgcgcctgg atgccgcctt cgcccacttg aatctcggtg 3938041 acagccacgg caatgcggcc tacaccggca tcgaccccta cttcgacgat ctcttcttga 3938101 tggccgccga gcggcgcttt ctgtcggtgg agcgcatcgt cgccaccgag gaactggtca 3938161 aatcggtgcc gccgcaggcg ctgttggtca accggatgat ggtcgacgcc atcgtggaag 3938221 cacccggcgg cgcccacttc accaccgccg caccggacta cgggcgcgac gagcagttcc 3938281 agcggcacta cgccgaagcg gcgtcgacac aggtgggttg gcagcagttc gtgcacacct 3938341 acctatccgg caccgaagcg gactaccagg ccgcggtgca caactttgga gcatcacggt 3938401 gagcacccga gccgaagtgt gtgccgtcgc ctgcgccgag ttgttccgcg atgcaggcga 3938461 aatcatgatc agccccatga ccaacatggc ctcggtaggg gcgcggctgg cgcggctcac 3938521 cttcgcgccg gacattctgc tgaccgacgg cgaggctcag ctgctcgcgg acacaccggc 3938581 attgggcaag acgggcgccc caaacaggat tgaggggtgg atgccgttcg gccgggtttt 3938641 cgaaaccctg gcctgggggc gccggcacgt ggtgatgggc gccaatcagg tcgaccgcta 3938701 tggcaatcag aacatctcgg cgttcgggcc gctgcagcgg ccgacccggc agatgttcgg 3938761 cgtccgcggc tcgccgggca acaccatcaa ccacgccacc agttactggg tgggcaacca 3938821 ctgcaagcgg gtctttgtcg aggccgtcga tgtggtctcc ggcatcggct acgacaaggt 3938881 ggatccggac aatccggcct tccggttcgt caacgtctac cgggtggtgt ccaacctagg 3938941 cgtgttcgac ttcggcggcc ccgaccactc catgcgggcg gtatccctac accccggggt 3939001 gacgcccggc gacgtccgcg acgccacctc gttcgaggtg catgacctcg acgcggccga 3939061 gcagaccagg ctgcccaccg acgacgaact gcacctgatc cgcgcggtaa tcgatccgaa 3939121 gtcgttgcgg gacagggaga tacgatcatg attgttccgc ctcctctccc ccgcaagcgg 3939181 gaggtgcgcc cacatcgctt cgtcccctgc aagcgggtgg tacccccact gcattgtcgg 3939241 cggtggctat gaggctgcgt acgccgctga ccgagctcat cggcatcgag cacccggtgg 3939301 tgcagaccgg gatgggctgg gtggccggtg cccggctggt gtcggccacc gccaacgcgg 3939361 gcgggctggg catcttggcc tcggccacca tgacgctgga cgagctggcg gcggcgatca 3939421 caaaggtcaa ggccgtcacc gacaagccat tcggggtgaa catccgcgcc gacgcagccg 3939481 acgcgggcga ccgcgtcgag ttgatgatcc gcgagggggt gcgggtggcc tcgttcgcgt 3939541 tggcacccaa acagcagctg atcgcccggc tcaaagaagc cggcgcggtg gtcataccgt 3939601 cgatcggcgc ggccaaacat gcgcgcaagg tggcggcctg gggcgccgac gcgatgatcg 3939661 tgcagggcgg cgagggcggc ggccacaccg ggccggtcgc caccacgctg ctgttgccgt 3939721 cggtgctgga cgccgtggcg ggcaccggca tcccggtgat cgccgccggc ggcttcttcg 3939781 acgggcgcgg gctagccgcg gcgttgtgct acggcgccgc cggggtggcc atgggcaccc 3939841 ggtttctgct cacctcggat tccaccgtgc ccgacgcggt caaacggcgt tacctgcagg 3939901 ccggcttgga cggcaccgtg gtcaccaccc gcgtcgacgg gatgccgcac cgggtgctgc 3939961 gcaccgagct ggtcgagaag ctggaaagcg gctcgcgggc acgaggtttc gcggccgcgc 3940021 tgcgcaatgc cggcaagttt agacggatgt cgcagatgac ctggcggtcg atgatccgag 3940081 acggcctgac catgcgccac ggcaaggaat tgacctggtc acaggtgctg atggcggcaa 3940141 acaccccgat gctgctcaaa gccggcctgg tcgacggcaa caccgaggcc ggggtgctgg 3940201 catcgggcca ggtagcgggc attcttgacg acctaccgtc gtgcaaagag ctgatcgagt 3940261 cgatcgtgct tgacgccatc acacatttac aaaccgcatc tgcgctggtg gagtgactga 3940321 cgcgtgtcaa gcagagtacg ctatcgcagc tatgtcgacc gtcgagatgg accaggcggc 3940381 tccagagtcc gccgcgcacc accctctgcc ggaccccggt gagtcggtcc ccagactcgc 3940441 gctgcccacg atcgggatct tcctggccac gctcaccgcg ttcgtcggtt ctacgaccgc 3940501 ttacatcagc ggatggatcc cgttctgggt gacgatcccc gtcaacgccg cggtcacgtt 3940561 cgtgatgttc accgtcgtgc atgacgcatc gcattacgcg atcagctcca tccggtgggt 3940621 gaacgggctg ttcgggcggc tggcgtggct tttcgtcggg ccggtggtcg cgttcccggc 3940681 cttcgggtac atccacatcc agcaccaccg ccattccaac gacgacgagc aagacccgga 3940741 caccttcgcc tcacacggct cgctgtgggt gctgccgttg cgctggtcga tggtcgagta 3940801 cttctacatc aagtactacc tgcctcgcgg ccgcagccgg ccggtcatcg aggtcgccga 3940861 gacgctggtg atgatgaccc tgttcctgac cggcctgatc gtcgccatcg tcaccggcaa 3940921 cttctggacg ctggcgatcg tcttcctgat cccgcaacgt atcggcctta ccgtgctggc 3940981 ctggtggttc gactggctgc cccaccacgg tctggaggac acccagcgca gcaaccgcta 3941041 ccgcgcgacc cgcaaccggg ttggcgccga gtggctgttc accccggtgc tgctgtcgca 3941101 gaactaccac ttggtgcacc acctgcaccc gtcggtgccg ttctaccggt acctgcgcac 3941161 ctggcggcgc aacgaggagg cgtatctgga acgcaacgcc gcgatctcca cggtctttgg 3941221 ccagcaactg aatccggacg agtaccggca gtggaaggag ctcaacggcc ggctcgcgcg 3941281 actgctgccg gtgcggatgc cggcccgctc cagctcgccg cacgcggtgc tgcaccgcat 3941341 cccggtcgcg tcggtggatc ccatcaccgc cgatgccacc ctggtgactt tcgcggtgcc 3941401 ggaagcattg cgggacgcgt tccgattcga gccgggccag cacgtgacgg tgcgcaccga 3941461 cctgggcggc caaggcatcc ggcgcaacta ctcgatctgc gccccggcca cccgcgccca 3941521 gctgcgcatc gccgtcaaac acattcccgg cggggcgttt tcgacgttcg tggccaacga 3941581 actgaaggcc ggcgacgtgc tcgagctgat gacaccgacc ggccggttcg gcaccccgct 3941641 ggatccgttg caccgcaagc actatgtggg cctggtggcc ggcagcggga tcaccccggt 3941701 gctgtccatc ctggcgacca cgctggagat cgagaccgaa agccgattca cgctgatcta 3941761 cggcaaccgc accaaggaat cgacgatgtt tcgggccgag ctggatcgtc tggagtcgcg 3941821 ctatgccgac cggctggaaa tcctgcacgt gctctccagc gagccgctgc acaccccgga 3941881 gctgcgcggg cgcatcgacc gagacaaact caccaggtgg ctgacgagta ccctgcggcc 3941941 ggccggtgtg gacgaatggt tcatctgcgg cccgctcgcc atggccaccg cggtgcgcga 3942001 gaccctgatc gagcacggcg tggactccga gcgcattcac ctggagttgt tctacgggtt 3942061 cgacacgccc ccggcgaccc gtccctccta tgcgggagcc accgtcacct tcacgctgtc 3942121 cgggcagcgg gcgatattcg atctggtgcc cggcgactcg attctggaag gggcgctggg 3942181 gctgcgcagc gatgcgccgt atgcgtgcat gggcggcgca tgcggcacct gccgagccaa 3942241 actgatcgag ggcaacgtcg agatggacca caacttcgcc ctccggaagg cggagctgga 3942301 tgccggctac atcctgacct gccagtcaca cccgacgaca ccattcgtcg ccgtcgacta 3942361 cgacgcctag gttcgtggcg ccgccccata cttgcgccga ctgtgaatct gacgacgcga 3942421 cacgccgatt cgccgtcgtg tggttcactc tcggcgctca tgggcgccat cccgccgccc 3942481 gcatcgcggc atcgacgcgg ccaacgaacg tgccccggcg gtaccggagc agctcactgg 3942541 tgaccctgat gatcgtccag cccagatcca gcaacgcggt ggaccgctcg atgtcccgag 3942601 cccgctgcgc cgggtctgtc caatgctgtg gcccgtcata ctcgacaccg actcgcaatt 3942661 gctcgtagcc caggtcgatg cgggcgacga agtccccgta gtcgtcaaac actctgatct 3942721 gtgtttgcgg cttcggcaga ccggcatcga tcaacaccaa tcgggtccac gtctcctgtg 3942781 gggattccgc acccccgtcg atcagcggca gcaccgcacg gaggcggacc aggccgcgcg 3942841 caccggtatg ttcggcaatg acggcctgca cgtcggcgac cttgacatcg gtcgaattcg 3942901 ccaacgcgtc cagccgttga acggcctgca gccgcgaggg tgtgcgccgc ccgatatcga 3942961 aggcggtgcg cgccggggtg gttaccgcga caccgtcaac cgcaaccgtc tcgtgcggcg 3943021 ccaatcgatc cgtgtgcacg acgatgcgcg gcggaggctt tcgattggcg tgcactaact 3943081 ctgcgtcaag cgctgggttt acccacttcg cgccaagcag cgccgccgcc gaattgccgg 3943141 ccacgacggc gcggcgccgc gaccacagcc acgccgcgtg ggcgcgctgg cgcgccgtca 3943201 gctccacacc ggccggggcg tagacgcccg ggtagactgg ctcgtagagc tgtctcatgg 3943261 cccgctccgg aatggccttt gcggccaaca cttccgagcc caggacgggc catggaagtt 3943321 cgtccatggc cacatcctgg catcacccac cgacaccccg ccgacagtga atcgcacgac 3943381 gcgacacgcc gacgacccgt cgtgagattc accctcggcg ccaacgaagg cctacagccg 3943441 ctcgataatg gtgacgttgg cggtgccgcc gccctcgcac atggtctgca gcccgtagcg 3943501 gccaccgatg cgctccagct cgcccagcat ggtggtgaac agtttggcgc cggtggcgcc 3943561 tagcggatgc cccagcgcga tcgcgccgcc gttggggttg accttcgccg ggtcggcctt 3943621 gatttccttg agccaggcca taactaccgg cgcgaacgcc tcgttgatct cgacggtgtc 3943681 gatgtcgtcg atggcaagcc cggtcttgtc cagcgcgtac cgggtggcgg ggatgggtcc 3943741 ggtcagcatg aataccgggt cggcggcgcg cgcactgatg tggtggatgc gggcacgggg 3943801 cctaagtcca tggtctttga cggcccgctc ggaggccagc aacactgcac tggcgccgtc 3943861 ggagatctga ctggccatcg ccgccgtcag ccggccgccc tcgaccagcg gctgcaagcc 3943921 ggccatcttc tccagcgacg actcccgcgg gccctcatca acccggaacg gcccggattc 3943981 ggtttccaca gtgatgattt cgttttcgaa gtggccggcg cggatcgccg cgaacgcgcg 3944041 ttcgtggctg gtcagcgagt accgctccat ctcttcacgg gacaggttcc acttctcggc 3944101 gatcagctcc gagccacgga actgtgaaat ctcctggtcg ccataccggt gtaaccattg 3944161 cttggattcg ttggtcggcg aggtgaaccc gaactgttcg cccacggtca tcgccgacga 3944221 gatcgggatc tggctcatgt tctgcacgcc gccggccacg atgacatccg ccgtcccgga 3944281 catgatcgcc tgcgcgccaa aggaaatcgc ctgctggctg gatccgcact ggcggtccac 3944341 ggtgacaccg gggacctctt cgggatagcc ggcggccagc cacgacagtc gggcgatgtt 3944401 gcccgcctgt ccgccgatgg cgtcgacaca tccggcgatc acgtcgtcga cggcggcggg 3944461 gtcgatgtcg gtccggtcca gcagtccgcg ccaggccagg gcacccaggt cgacgggatg 3944521 gataccggcc agtgcgccgc cccgcttgcc gaccgcggtc cgtacggcgt cgatgacgta 3944581 cgcctctgtc ataaccgctc ctctcccgtt gccagtgagt ggtaccccca ccgcatcgtc 3944641 gtcgacacgg ggcatttcag actccctctt tggtgatccc gccaagcacg atggctaggt 3944701 attgctggcc cacctgctgg gcggtgagcg gcccaccggg tcgataccag cgcaccgaca 3944761 cccaggtggt gtcacggatg aatcggtaga ccaggtcgac gtctaggtcg ggccggaagt 3944821 agccctcttc gatgccctgg ttgagcacgt ccacccacat cttgcgctgc tgcttgttac 3944881 ggtcctcgat gtaggaaaac ctgggttgcg acgccagccg ttgcgcttca tcctggtaga 3944941 tcaccacttg cgcgtgatga tgctcgatcg cctcaaacga cgccatgaac aggccctgca 3945001 gccgctccag cggattggcc gtgctatcca cgatgtcgcg gtaacgggcg aagagccaat 3945061 cgaggaaacc gcgtaacagc tcatcgacca tctcctcttt ggaggcgaaa tggtgataca 3945121 ggctgccgga taggatgccg gcgccgtcgg cgatatcgcg cacggtggtg gcgcgcagtc 3945181 cgcgctcggc gaacatcgcc gccgcgagct ccagcaactc gcctcgccgg ctattgacct 3945241 gaccggccac tcgatccatc cgaccagact atcaaccaag cgcttgctcg gccagctgcg 3945301 acctcgatgg ggtgggaatc cgggaattcg gtacgaggga tgcgcccttc gctcaccggg 3945361 gcattagatg cgacgttgct ggcgctggat ggacgccttg cccgcacagc ccggcccagg 3945421 tgcaggatcg aggggcttgg tacctgatca cgggagacat ctggggtatc ggcggagagt 3945481 gcctagcgtt ctgggcattc tggcggattg cgcatattct tccgcgcgtc gtcatagcct 3945541 aatcggacta cgcggatcgt gccgatcacc ctggtgcggc ggcggcgcca gtaacgagga 3945601 ggtcaacatg gctcattttt cggtgttgcc gccggagatc aactcgttgc ggatgtacct 3945661 gggtgccggt tcggcgccga tgcttcaggc ggcggcggcc tgggacgggc tggccgcgga 3945721 gttgggaacc gccgcgtcgt cgttctcctc ggtgaccacg gggttaaccg ggcaggcgtg 3945781 gcagggcccg gcgtcggcgg cgatggccgc cgcggcggcg ccgtatgcgg gctttttgac 3945841 cacagcctcg gctcaagccc agctggctgc cgggcaggct aaggcggtgg ccagcgtgtt 3945901 cgaggccgcc aaggccgcga tcgtgcctcc ggccgcggtg gcggccaacc gtgaggcgtt 3945961 cttggcgttg attcggtcga attggctggg gctcaacgcg ccgtggatcg ccgccgttga 3946021 aagcctttac gaggaatact gggccgctga tgtggcggcg atgaccggct atcacgccgg 3946081 ggcctcgcag gccgccgcgc agttgccgtt gccggccggc ctgcaacagt tcctcaacac 3946141 cctgcccaat ctgggcatcg gcaaccaggg caacgccaac ctcggcggcg gcaacaccgg 3946201 cagcggcaac atcggcaacg gaaacaaagg cagctccaac ctcggcggcg gcaacatcgg 3946261 caataacaac atcggcagcg gcaaccgagg cagcgacaac ttcggcgccg gcaacgtcgg 3946321 caccggaaac atcggcttcg gcaaccaggg ccccatagac gttaacctct tggcgacgcc 3946381 gggccagaac aacgtgggcc tgggcaacat cggcaacaac aacatgggct tcggcaacac 3946441 cggcgacgcc aacaccggcg gcggcaacac cggcaacggc aacatcggtg gcggcaacac 3946501 cggcaacaac aacttcggct tcggcaacac cggcaacaac aacatcggaa tcgggctcac 3946561 cggcaacaat cagatgggca tcaacctggc cgggctgctg aactccggca gcggcaatat 3946621 cggcatcggc aactccggca ccaacaacat cggcttgttc aactccggca gcggcaacat 3946681 cggcgtcttc aacaccggag ccaataccct ggtgcctggc gacctcaaca acctgggcgt 3946741 cgggaattcc ggcaacgcca acatcggctt cgggaacgcg ggcgttctca acaccggctt 3946801 cgggaacgcg agcatcctca acaccggctt ggggaacgcg ggtgaattaa acaccggctt 3946861 cggaaacgcg ggcttcgtca acacggggtt tgacaactcc ggcaacgtca acaccggcaa 3946921 tgggaactcg ggcaacatca acaccggctc gtggaatgcg ggcaatgtga acaccggttt 3946981 cgggatcatt accgacagcg gcctgaccaa ctcgggcttc ggcaacaccg gcaccgacgt 3947041 ctcgggcttc ttcaacaccc ccaccggccc cttagccgtc gacgtctccg ggttcttcaa 3947101 cacggccagc gggggcactg tcatcaacgg ccagacctcg ggcattggca acatcggcgt 3947161 cccgggcacc ctctttggct ccgtccggag cggcttgaac acgggcctgt ttaacatggg 3947221 caccgccata tcggggttgt tcaacctgcg ccagctgttg gggtagcgcg acactcacgg 3947281 gtgctggcag gataccgaaa tcacctcacc agtcaggtaa ctcgagtagt cgctggccag 3947341 aaacgcgatg gtggccgcca cctcccaggg ctcggcggcc cggccgaacg cctcgccggc 3947401 caccagccgg tccagcagct cggccgaggc ggtcttgtcc aggaacttgt gccgggcgat 3947461 gctgggcgag acggcgttga tccgcacccc atactcggcg gcttcgattg cgctgcaccg 3947521 ggtcaacgcc atcaccccgg ccttggcggc ggcatagtgc gactgcgaat gctgggcccg 3947581 ccagcccagc acgctggcgt tgttgacgat caccccgcca tgcggcgcgt cgcggaagta 3947641 gcgcaatgcg gcccgggtgg cccggaacac cgacgtcagg ctcacgtcta acacgcggtc 3947701 ccactcgtcg tcggtcatgt cggccaccgg cgtctgcccg cccagcccgg cgttgttgac 3947761 cagcacgtcg agccggccca tccgggcggt ggtcgagtcg atcagcgcgt cgacctgggc 3947821 ggtggacgtc acgtcgcaca ccacatgctc cacccggccc agccccagcg cagacaactc 3947881 ggcggccgtc tcccccagcc gtcgttcatg gtggtccgag atcaccacgt cggcgccctc 3947941 cgccaaggct cgccgcgcgg tggccgaacc gatgccggtg cccgcagccg ccgtcacgac 3948001 gaccaccttg ccatccagaa gtccatgtcc ggcaatctct ttcggcgcta cggacaggtt 3948061 catcccttgg cctcccgggg cagaccgagc acccgctcgg cgatgatgtt gcgctggatc 3948121 tcgttggatc ctccgtagat ggtgtcggcg cgggtgaata gatatagccg ctgccactcg 3948181 tcgaactcgc cgtcgggcat ggtcattccg ggtttaccga tcacgtccat ggccagctca 3948241 cccaggttgc gatgccagtt ggcccacaac aactttgaca cattgtcctg gccgggctgc 3948301 tcaacggctg gcccttccat ggtggccaaa gcataggagc gcatggcgcg cagcccggtc 3948361 cacgcccggg tcagccgctc ccggatcagc gggtcatccg cggcggcggt gcgccgcgcc 3948421 agctcgacca gattggaaag ctcacgggcg tagacgatct gctgacccag cgtcgagacg 3948481 ccgcgctcga aggtcagcgt cgccatcgcg acccgccagc cgtcgcccgg tgcgccgacc 3948541 accaggtcgg cgtcggtgcg ggcgtcgtcg aagaacacct cgttgaactc cgcggtgccg 3948601 gtgatctgca cgatcggccg gatctgcacg ccgggctggt ccagcggcac cagcagatac 3948661 gacaggccgg cgtggcgctg cgagcccttc tcggtgcgtg cgagcacaaa gcaccattgc 3948721 gacaggtgcg ccagcgacgt ccacaccttc tggccgttta tcacccactg gtcgccgtcg 3948781 agttctgcgg tggtcgcaac gctggccagg tcgctgccag cgccgggctc cgaatatccc 3948841 tgacaccaca gctcggtgac gtcgcggatg cgcggcagga agcgccgctg ctgctgcggc 3948901 gttccgaacg cgatcagcgt cggacccagc agttcctcgc cgaagtggtt gaccttgtcc 3948961 ggcgcgtcgg cgcgggcgta ttcctcgtag aacgccaccc ggtgcgcggt cgagagcccc 3949021 cgcccgccgt gttcttccgg ccagcccagg caggtcagcc ccgcggcggc caggcgctga 3949081 ttccacgccc ggcgttcctc gaacgcttcg tgctcgcgcc ccggcccgcc gaggccctta 3949141 agtgccgcga attcgccggc cagattgtcg gcgagccaac cgcggacctg cgcccggaac 3949201 tcctcgacgt cctgcatgcc ctgtaggcta acctaccaag cacttgcttt gttaggagcg 3949261 tccgttgata aacgatctgc gcaccgtgcc cgcggcgctg gatcgtctcg tgcgccagct 3949321 acccgaccac acggcgttga tcgccgagga ccggcgtttc acgtcgaccg agctgcgcga 3949381 cgcggtctac ggcgccgcgg cggcgctgat cgccctcggt gtcgaacccg cagaccgggt 3949441 ggccatctgg tcgccgaaca cctggcactg ggtggtggcc tgcctggcga tccaccacgc 3949501 cggcgccgcg gtggtgccgt tgaacacccg ctacaccgcc acagaagcca ccgacatctt 3949561 ggaccgagcc ggcgcgccgg tgctgttcgc ggcgggcctc ttcctgggcg ccgaccgggc 3949621 ggccggcctg gaccgggccg cgctgcccgc gttgcggcac gtcgtgcggg tgccggtcga 3949681 agccgacgac gggacctggg acgagttcat cgccacgggt gccggggccc tggatgccgt 3949741 cgcagcccgt gccgccgccg tcgcacccca ggacgtcagc gacatcctgt tcacctccgg 3949801 caccaccggc cgcagcaaag gcgtgctgtg cgcgcaccgg cagtcgctgt cggcctcggc 3949861 atcctgggcc gccaacggga agatcaccag cgacgaccgc tacctgtgca tcaacccgtt 3949921 cttccacaac ttcggctaca aggccggcat cctggcctgc ctgcagaccg gtgccacgct 3949981 gatcccgcac gtgacgttcg atccgctgca cgcgctgcgg gccatcgagc gccaccgcat 3950041 caccgtgttg ccgggccctc cgaccatcta ccagagcctg ctggatcacc cggcccgcaa 3950101 agacttcgac ctgagctcgc tgcggttcgc ggtcaccggt gcggccaccg tgccggtggt 3950161 gctggtggag cgcatgcagt ccgaacttga catcgacatc gtgctgaccg cctacgggtt 3950221 gaccgaggcc aacgggatgg ggacgatgtg ccgccccgag gacgacgcgg tgaccgttgc 3950281 gacgacgtgc gggcggccgt tcgccgactt tgagttgcgc attgcggacg acggggaagt 3950341 gttgctgcgc gggccgaacg tcatggtggg ctatctggac gacacggagg cgaccgcggc 3950401 cgccatcgac gccgacggct ggctgcacac cggcgacatc ggtgccgtcg accaggcggg 3950461 caacctgcgc atcaacgacc gcctgaagga catgtacatc tgcggcggat tcaacgtcta 3950521 tcccgccgag gtcgagcagg tgctggcccg gatggacggc gtcgcggacg ccgcggtgat 3950581 cggcgttccc gaccagcggc tgggcgaggt cggccgggcg ttcgtggtgg cgcgccccgg 3950641 cacgggcctc gacgaggcat cggtgatcgc ttacacccgt gaacatttgg cgaacttcaa 3950701 gacaccccgg tcggtgcggt tcgtcgacgt actgccgcgc aacgccgccg gtaaggtgag 3950761 caaaccacaa ctgcgagagc tgggctagat ggacctgaat ttcgacgacg agaccctggc 3950821 ctttcaggcc gaggtgcgcg agttcctcgc cgccaatgcc gcatcgatcc cgacgaagtc 3950881 ctacgacaat gcggaaggct ttgcgcaaca ccgttattgg gaccgagtac tgttcgacgc 3950941 gggcctgtcg gtgatcacct ggccggctaa gtatggtggc cgggacgcgc cgctgctgca 3951001 ctggatcgtg ttcgaggagg agtactttcg cgccggcgcc ccgggccggg ccagcgccaa 3951061 cggcacctcg atgctggcgc cgacgctgtt cgcgcacggc acagccgaac agcttgaccg 3951121 gatcctgccg aaaatggcta gcggcgaaca gatctgggcg caggcctggt cggagccgga 3951181 atccggcagc gacctggcgt cgctgcgctc caccgcgagc aaggtcgacg gcggctggct 3951241 actcaacggg cagaagatct ggagctcgcg ggcgccgttc gccgacatgg gttttgggct 3951301 gttccgctcc gatcccgcgg tcgaacggca ccgcgggctc acgtatttca tgttcgacct 3951361 gaaagccaag ggtgttaccg tgcgcccaat cgcccaactg ggcggcgaca ccggtttcgg 3951421 tgagatcttt ctcgacgacg tgttcgtccc cgaccgggat gtgattgggg caccgaacga 3951481 cggatggcgc gcggccatga gcacgtcaag caacgagcgc ggcatgtcgc tgcgcagccc 3951541 agcccgcttc ctggcctccg ccgaacggct ggtccagctg tggaaggacc gcggctcgcc 3951601 cccggagttc gccgaccggg tcgccgacgc ctggatcaag gcgcaggcct accggctgca 3951661 gaccttcggc acggtgacca ggctggccgc cggtggcgaa ctgggggcgg aatcgtcggt 3951721 gaccaaggtg ttctggtccg agctggacgt gcacttgcat cagaccgcgc tcgacctgcg 3951781 cggcgccgat ggggagctgg ccggcccgtg gaccgagggg ttgctgttcg ccctgggcgg 3951841 cccgatctat gccgggacca acgaaatcca gcgcaacatc attgccgaac ggctgctggg 3951901 cctgccacgc gagaagacgt gaccatggaa ttcgcactca acgaacagca gcgcgacttc 3951961 gcggccagca tcgacgcggc gctcggcgcc gccgacctgc ccggcgtcgt ccgtgcttgg 3952021 gctgccggtg atgtggcgcc cggccgcaag gtgtggcagc agttggccaa cctgggcgtc 3952081 accgcgttgg gcgtagcgga gaagttcgac ggactgggtg ccagtccggt cgatctggtt 3952141 gtcgcgctcg aacgtctcgg gcgctggtgc gtgcccggcc cggtcaccga atccattgcc 3952201 gtggcaccga ttctgctggc tcatgatgat cgggctgaac gcagccatgg gctagcttcc 3952261 ggtgagctca tcgccaccgt ggccatgccg ccgcgggttc cgcgcgccgt cgacgccgac 3952321 accgccgggc tggtactgct cgcgggcgat ggcagcgtca ccgaagggac gccgggtgat 3952381 tgccaccggt ccgtcgaccc cagccggcgg ctgtatgagg tggcggcatc cggccaggcc 3952441 tggcgggccc cgaaagacgt agtggcgcgc gcctatgagt tcggggcgct ggccaccgcc 3952501 gcacaactgg tcggcgccgg gcaggcgctg ctggaggccg ccgtcaacta cgccaaacag 3952561 cgcacgcagt tcggccgggc gatcggctcg tatcaggcca tcaagcacaa actcgccgac 3952621 gtgcacattg cgatcgagct ggcctgcccc ctggtttacg gcgcggccgt gtcactcgag 3952681 ccgcgcgatg tcagcgccgc caaagccgcc gcgagcgagg cggctctgct ggcggcacgc 3952741 tcggcgttgc agacccacgg cgccatcggg ttcacctgcg agcatgacct gtcgctgtgg 3952801 ttgttgcggg tgcaggcgtt gcactcggcc tggggtacgc cgcaggagca tcggcggcgt 3952861 gtgctggagg cgctatgacc ccccctgaag aacggcagat gctacgggaa accgtcgcct 3952921 ccctggtggc taagcatgcc ggcccggcgg cggtgcgcgc agcgatggcc tccgaccgcg 3952981 gctacgacga atcgctgtgg cggctgctat gtgagcaggt cggtgccgcc gcgctggtca 3953041 ttccggagga gctgggcggc gcgggcggtg aactcgccga tgccgcgatc gtcgtgcagg 3953101 agctgggccg ggcgctggtg ccttctccgc tgctgggcac cacgctggcg gagctggcgc 3953161 tgctggccgc agctaagccg gatgcgcaag cactcacgga gcttgcccaa ggcagcgcga 3953221 tcggcgcgct ggtgctggac cccgactacg tggtcaacgg cgacatcgcc gatatcgtcg 3953281 tcgccgccac cagcgggcag ctgaccaggt ggactcgctt tagcgcgcag cccgtcgcca 3953341 ccatggaccc cactcgccgg ctggcccgcc tgcaatccga agagaccgag ccgctgtgcc 3953401 ccgatcccgg aatcgccgac accgcagcaa tcctgttggc ggccgagcag atcggcgccg 3953461 ccgaacgctg cctgcagctg accgtcgaat acgccaagag ccgagtgcaa ttcggccgcc 3953521 cgatcggcag tttccaggcc ctcaagcatc ggatggccga cctgtatgtg accatcgccg 3953581 cggcccgggc cgtcgtcgcc gacgcctgcc acgcgcccac acccaccaac gccgccaccg 3953641 cgcggctggc cgccagcgag gcgttgagca ccgcggcggc cgagggcatc caactgcacg 3953701 gcggcatcgc gatcacctgg gaacacgaca tgcacctgta tttcaaacga gcgcacggca 3953761 gtgcacaatt gctcgagtcg ccacgagagg tgctgcgccg tttggaatct gaggtgtggg 3953821 agtcgccgtg acggatcgtg tcgccctgcg tgccggcgtt cccccgttct acgtgatgga 3953881 cgtctggttg gcggccgcgg agcgccagcg cacccatggg gatctggtga atctttcggc 3953941 gggccaaccc agtgcgggcg ctccggaacc ggtgcgtgcg gccgcggccg ccgccctgca 3954001 tctcaaccag ttgggatact cggtggcgct gggtattccg gagctgcgcg acgctatcgc 3954061 cgcggattac caacgccggc atggcatcac cgtcgaaccc gatgcggtgg tgatcaccac 3954121 gggctcctcg ggcggctttc tgctcgcgtt tctggcgtgc ttcgacgccg gtgatcgggt 3954181 cgcgatggcc agtcccggct acccgtgcta ccggaatatc ctgtcagcgc tgggatgtga 3954241 ggtcgtggag atcccgtgcg gaccgcagac ccgattccaa ccgaccgcgc agatgctggc 3954301 cgagatcgac ccaccgctgc gcggtgtcgt cgtcgccagc ccggccaacc cgaccggaac 3954361 cgtcatcccg cccgaagaac tggcggccat cgcgtcgtgg tgtgacgcat cggatgtccg 3954421 gttgatcagt gatgaggtct accacggcct ggtgtaccag ggggcaccgc aaaccagctg 3954481 cgcctggcag acgtcgcgaa acgcggtggt agtcaacagc ttttccaagt attacgcgat 3954541 gacgggctgg cggctgggct ggctgctggt gccgacggtg ctgcgccgcg cggtggactg 3954601 cctgaccggc aacttcacca tctgcccgcc ggtcttgtcg cagatcgccg cggtgtccgc 3954661 gttcaccccg gaggcgaccg ccgaggccga cggcaacctg gccagctacg cgatcaaccg 3954721 ctcgctgttg ctggacggtc tgcgtcgcat cggcatcgac cggctggcac ccaccgacgg 3954781 cgcattctac gtctacgccg acgtctcgga cttcaccagc gattcgctgg ccttctgctc 3954841 aaagttgctg gccgacaccg gtgttgcgat cgcacccgga atcgatttcg acaccgcacg 3954901 ggggggttcg tttgttcgga tatcgtttgc cgggccaagc ggcgacatcg aagaagcctt 3954961 acggcgcatc ggctcctggc tgccgagcca atagctcgtc gatgcgcgtc tcgagcgcgc 3955021 cgcgctcgcc gatatctgcc acgttgatcc cgaaccgttc gctcagggtg tcgacaaccg 3955081 ctgccgcatc ggcaaggcgg atcttctcgg taccaccggc acggtgaacg gcaaggtcgc 3955141 ggccagatag gttccaccgg gcgtcgtcgg tgatcaccgc ggcggtcagt cccgtgacga 3955201 acttcgatgc cgggtgtgtt gaggcgtacc agctggccac tttcagatcg atctgcgggc 3955261 gggtctgggt ggtgaattcg tacagtgtct gccatgtgtc ccggaccatc gcctgcaaga 3955321 caaagccgtc gacgcggtcc tcgagccgat aaggttcgtg cgttgtcggc tggacggcgc 3955381 cggtttcgag gcgaagcggt gaggtcggtg tttggccgcc gaatccgacg tcgacgagat 3955441 agcatccgcc cgagccgggg aacgtgaccc ccagcagggt gtgcgtctgc ggcggcaggg 3955501 gcgcgtccgg cgcgagcttc cagacgacgc gggcggcgaa tcggcgcacc cgatagccga 3955561 gttcggccag cacataaccc atcagcccgt tgtgctcaaa gcagtacccg cctcggcgcc 3955621 gaagtaccag cttgtcggcc agcgcctgtg gactgaggtc gtcgaccggc acccccagca 3955681 gcgggtcgag gttctcgaac ggaatcgttc gactgtgcac ggtcaccaga tcctgcagaa 3955741 catccagggt tggatcggta gcgccgcgat agttgatgcg atcgaagtac gcggtcagat 3955801 ccagtgccat gttgccattc tgacctcgtc gccgcgtcgg accgaccgca gggtattcgg 3955861 gcgttcgtcg cgcagccggc caactatgtc gcaccgattg tggtttgcca catgagtttc 3955921 tgggtcgacg gcaaacaaag tgccctcgca gcgacacgtg tcggcggcta cggcaaactg 3955981 cccgctgaca ctcacccatc cggtggctgc tcgcgccatc tggccgaatg cccggcgggt 3956041 cggaggatcg gcgccggaca caacaacgca tcatgatgcc tgttactgat gctattgccg 3956101 acccacggca ccggaggctt gcaggccggt gtcgacttgg acgacaagga agccctcgcc 3956161 gaactgatcg gcgacaatgc tgctccttga cgtaagcgtc tgcatattcg ccatccgcga 3956221 ggacagctgc cccaaccacg cgacataccg gacgtggctc accagactgc ttaccggcga 3956281 cggcgagcag acgcaaaatc gcccaacacg cccgcaaaat gggcgatttt gcgtctgctc 3956341 gcgccactag agccaggtgt cctgggtggt ggtggtgagg aaagcctcca ggtcgtcgcg 3956401 ccagtgcgcc ggcgtggtct tttccggctc gatgccggtg tagtcgccgc gatagaacag 3956461 cagcggccgc ggcttgaccg ccgggacctc tgacagtgac tcgacggcac cgaacaccac 3956521 gaagtgatcg ccgccgtcgt gcaccgacgc caccgtgcag tcaatgtagg ccagcgatcc 3956581 ctcgatgatc ggtgagccta gttccgaagg gcgccaatcg ataccggcga acttgtccgg 3956641 ctccttcgag ccgaatcgcg ccgagacgtc tttctgcttt tcggtcagta cgttgacgca 3956701 gaaccggccg ctggcctcga tggcctgcca ggaccgcgac accttagtgg ggcagaacag 3956761 caccaacggc ggttccaacg acagcgccgc gaacgactgg cacgcaaacc cgacgggcac 3956821 gtcgtcgtgc acagtggtga tgacagtgat ccccgtacag aactgaccga gcacggagcg 3956881 gaacgtgcgt ggatcgatct gagccgacat cgtttgcttt cgagctagcc gcgagcgcct 3956941 acggtgaaat cgtgacccca taggctgacc gcagtactct cccgggcgat ccagtcccga 3957001 tcgtcgactt gcctgccctc acaaccgaat tcgatgtcga agccaccggg cgtcttcatg 3957061 tagaacgaca gcatcaggtc gttgacatgc cggcccaggg tggccgacat cggcaccttg 3957121 cgccgcaacg cccggtccag gcacaggccc acgtcgtcgg cctgctcgac ctcgaccatc 3957181 aggtgcacga tgccgctgga cgtcggcatc ggcaggaagg ccaacgagtg gtgacgcggg 3957241 ttacagccga agaaacgcag ccaggctggc ggcccgtcgg cgggccgccc taccatccgt 3957301 ggcggtagcc gcatcgagtc acgcagccga aagccgagca cgtctcggta gaaatgcaac 3957361 gcctcagcat cgtcgcgggt ggacagcacc acatggccca taccctgctc accggtgacg 3957421 aacctgtgcc catacgggct gaccactcgg cggtgttcca gcgcggtacc gtggaagacc 3957481 tccaggcaat tgccggaagg gtcggcaaac cggatcatct cgtccacccg gcgatcggcc 3957541 agctcggcgg cggtggcctc tttgtacggc gtgccctcca aatccaggcg gttccggatt 3957601 tcctgcaggc cttcggcatt cgcgcattcc caaccggcct ccaacagcct gtcgtgctca 3957661 ccgggcacga ccaccagccg ggccggaaag tcatccatcc gcagatacag ggccccttct 3957721 ggggcccctt tgccctcgac catgcccagg accttcagtc catactcccg ccaggcagcc 3957781 atgtcagtgg cctcgatgcg cagatagccc agcgaccgga tgctcatctg ccacctccca 3957841 gaaattcaat cgtcagcttg ttgaactcgt cgaacttctc cacctgcacc caatgcccac 3957901 actgcccgaa tacgtgcagc tgcgcacgcg gaatcgtttt caacgcaacc agcgcgccgt 3957961 ccagcgggtt gacccggtcc tcacgacccc agatcagcaa caccggctgg cgcagccgat 3958021 acacctcgcg ccacatcatg ccggcctcga agtcggctcc ggcgaacgac tttcccatcg 3958081 cccgtgttgc cgtcaacgac tccggggtgc tggccagcgc aaaccgctga tccaccaact 3958141 cgggggtgat caggttcttg tcgtagacca tgacccgcag gaacgcctcg aggttctccc 3958201 gggtgggcgc aacggagaac ttcgacagcc gtttgactcc ctcggtcggg tcgggcgcaa 3958261 acaggttgat actcaggccc cccgggccca tcagcactaa ccgtcctgcc cgggccgggt 3958321 agtccagcgc aaaccggacc gcggttcccc cgcccaacga gttgcccacc agcggtaccc 3958381 gccccagccc cagctgatcg aagagcccct tcagcgccat cgcggcatag cgattgaact 3958441 ggccgtgctc ggcccgcttg tcggaatggc cgtaaccggg ctggtcgacg gccagcacat 3958501 gaaagtgccg cgccagcacc gcgatattac gcgagaagtt cgtccagctc gccgcgccgg 3958561 gcccaccgcc gtgcagtagc accaccgtct ggtcgttgcc cacgccggcc tcgtggtagt 3958621 gcagtttcag cggcccgtcg acgtccactt ccgcaaagcg cgaggtggat tcgaacgtca 3958681 attcctcggt agctgtcatt tcgcctagac cagctagacc atggtgtcgc cgggcggcaa 3958741 cccgaactcg tggtttccaa agatcacgta tgcccgctcg gggtcgttgg cggcgtgcac 3958801 ccgaccggcg tgcgcgtcgc gccagaaccg ttgaatcgga gcctcattgg acaacgcggt 3958861 ggcaccggac gcctcgaaca gccggtcgat cgaggcgatt gagcgaccgg tggcgcgcac 3958921 ctggtcgcgg cgcgcacggg cgcgcagttc gaacggaatc tccttgccgg cagccagcag 3958981 cgcgtattcg tcgctcacat taccgatcag ttggcgccac gcggcgtcga tgtcgctggc 3959041 cgcctcggcg atacggacct tggcaaacgg gtcgtctttg gccttttccc cggcgaacgc 3959101 cgcgcgcacc cgcttgccct ggtgctcgac gtgcgcggcg taggcaccgt aggccatgcc 3959161 gacaatcggc gccgaaatcg tagtgggatg cattgtgccc catggcattt tatagacagg 3959221 tgcgctgttg gtcgccagcc ctcccgcggt gtggtcgttc atcgccttgt acgacaagaa 3959281 ccggtgccgg ggcacaaaga catccttgac caccagggtg ttgctgccgg taccacgtaa 3959341 gccgaccacg taccacacgt ccttgatctc gtattcgctg cgcgggatca ggaaactgcc 3959401 gaagtccacc ggccggccgt ccttgatgac cgggccgccg acgaacgtcc agctggcatg 3959461 gtcgcagccc gaggaccagt tccacgaccc gttgaccagg tagccaccgt cgaccaccac 3959521 gcccgccccc atcggtgcgt acgaggacga gatccgcgta ctcgggtcct cgccccagac 3959581 ctcctcttgg gcccgttggt cgaacagcgc cagatgccag ttgtgcacgc cgacgattga 3959641 gctcacccac ccggtggaac cacacacgct cgccagtcga cgcgtcgcct cgaagaacag 3959701 cgcagggtcg cactgcagtc cgccccactg ctgcggctgc aacagggtga agaagccgac 3959761 gtcgtcgagc gccttgacgg tctcgtcggg cagccgccgc agatcctccg tggcctgggc 3959821 gcgatcccga atctccggca gcagatcatc gatggcagcc aagacagact gagcatcacg 3959881 ctgttgaatg gacgtcactt acttttgcct ctccgggttg cgaacttaga gaaagactag 3959941 aacacgttcc gatttgtgtc gagctaggta ttcctgcggc aggtagcgat accaaatggg 3960001 ttttctgtaa catgttctag ttatgacgga agagaggacg ggtcttgacc gaggcaattg 3960061 gagacgagcc actcggcgac cacgtccttg aactgcagat cgccgaggtc gtcgacgaaa 3960121 ccgacgaggc gcgatcgctg gtcttcgcgg tgcccgacgg atcggacgac ccggagatcc 3960181 cccctcggcg cctgcgttac gcccccggcc aattcttgac gctgcgcgtg cccagcgagc 3960241 gtaccggttc ggtggcgcgc tgctactcgt tgtgcagttc gccctacacc gacgacgcct 3960301 tggcggtcac ggtcaaacga accgccgacg ggtacgcctc caactggttg tgcgatcacg 3960361 cgcaggtggg catgcgcatc cacgtgctgg ccccgtcggg caacttcgtc cccacaaccc 3960421 tcgacgccga tttcctcctg ctggcagcgg gtagcggcat caccccgatc atgtcgatct 3960481 gcaaatcggc gcttgccgag ggcggtggac aggtgacgct gctctacgcc aaccgcgacg 3960541 accgctcggt catcttcgga gacgcgctgc gcgagttggc ggcgaagtat cccgaccggc 3960601 tcacggtgct gcactggcta gagtcgctgc aggggctgcc gagcgcgagc gcgctggcca 3960661 agctcgtcgc gccctacacc gaccggccgg tgttcatctg tgggcccggc ccgttcatgc 3960721 aggcggcccg ggacgccctg gcggcgctga aagtgcccgc ccaacaggtg cacatcgagg 3960781 tgttcaagtc gctggaatcg gatccgttcg cggccgtcaa ggtcgacgac agcggtgacg 3960841 aggcgccggc gaccgcggtg gtggaactcg acggccaaac ccacaccgtc tcctggccgc 3960901 gcaccgccaa gctgctcgac gtgctgctgg ccgcgggcct ggacgcgccg ttctcctgcc 3960961 gggaaggcca ctgcggtgcg tgtgcgtgca ccctgcgcgc cggcaaagtg aatatgggag 3961021 tcaacgacgt gctcgagcag caggatctcg atgagggact gattttggcc tgtcaatctc 3961081 gcccggaatc tgattcggtg gaagtgacct acgacgagta gtcccggaag ggagcgagat 3961141 gacgcggctg ataccgggtt gcacgctcgt cgggctgatg ctgacgttac tgcccgcgcc 3961201 cacctcggcg gccgggagca acaccgccac caccctgttc ccggtcgacg aggtcaccca 3961261 gctggagacg cacaccttcc tcgattgcca ccccaacggc agctgcgact tcgtcgctgg 3961321 agcaaatctg cgcacacccg acggcccgac gggctttccg cccgggctgt gggcgcgcca 3961381 aaccaccgag atccgttcga cgaaccggtt ggcctatctg gacgcgcacg ccaccagcca 3961441 gttcgaacgg gtaatgaagg cgggcggatc cgacgtgatc accaccgtct acttcggcga 3961501 gggtccgccg gacaaatacc agaccaccgg ggtcatcgac tcgaccaatt ggtcgaccgg 3961561 tcaaccgatg accgacgtca acgtcatcgt gtgtacacac atgcaggtgg tctacccggg 3961621 ggtcaacctc acctcgccca gcacctgcgc gcaagccaac ttttcctagc taggactcgt 3961681 cctggtactc gctgagccgg taaatcaacg cggcagaccc agcagccgtt cggcggccac 3961741 cgtcaacagg atctgctcgg taccgccggc tatcgtcagg caccgggtgt tgaggaagtc 3961801 gtacaccgcg cggttctcga cgagcccgcc cccgtcggac acctccatca ggtattcggc 3961861 cagcgcctgt cggtagcgca cgccgatcag tttgcggacg ctggattgcg cccccggatc 3961921 ctggccgccg acggccaact cggcgatccg ccggtccaac agcgcaccgg cctgagccag 3961981 caggatcagc ctgcccagcc gatcttgctg cgcgacatcg agttccatgt cacccaagac 3962041 cttgagcagc tcttccatcg ggttgcccag cgcggtcccg gtggccatcg cgacccgctc 3962101 gttggctagc gtggtgcgcg ccagccgcca gccgtcgttc acggcgccga cgaccatctc 3962161 gtcggggacg aacacattgt ccaggaagac ctcgttgaac agcgagtcgc cggtgatctc 3962221 gcgcagcggt cggatctcaa ttcccggtgt ggtcatgtcc accaggaagt aggtaatgcc 3962281 cttgtgcttc ggagcatccg ggtcggtgcg cgccaggcac accccccacc gagccttgtg 3962341 agccgccgac gtccacacct tctgtccggt gagcagccag ccgccgtcag cccgcaccgc 3962401 cttggtacgc agcgacgcca ggtccgaacc ggcccccggc tcggaaaata gctgacacca 3962461 aaggaattca ccgcgcatgg tggccgggac gaaacgttcg atctgttccg gcgtgccgtg 3962521 ttcaaggatg gtcggcgccg cccaccagcc gatcaccagg tccgggcgct caaccttggc 3962581 cgcggccagt tcctgatcga tcagcagttg ctcggccggg gacgcgccgc gcccgtacgg 3962641 cgccggccag tgcggcgcca gcaggccggt gtccgccagc gccacctgac gtttctcctc 3962701 gggcaacgcg gccacctcgg cgaccgccgc cgcgatctcc ggtcgcaggc cggccacctc 3962761 ggccaggtcg acgcccaagc gacgacggac accggcctgg gtcagcgccg taacccgacg 3962821 cagccagcgc ccggatccac cgaggaaccc accgattccg tgggcccggc gcagatacaa 3962881 atgcgcgtcg tgctcccagg tgcagccgat accaccgagc acctggatac agtccttggc 3962941 gttggctttg gcggcgtcga tgccgatgct cgcggccacc gccgcggcga tcgaaagttg 3963001 ggtgccatcg gaatcggctg cggcgcgggc cgcatcggcg gcggccacat cggcctgctc 3963061 ggcacggcac aacatctgag cacacaggtg cttgacagcc tggaagctgc cgatcggctt 3963121 gccgaattgc tcccgcacct tggcgtaggc aaccgcggta tcgagcgtcc atcgagccac 3963181 cccggccgcc tcggccgcca gcacggtagc ggccaggtct tccacccgct cccccgacac 3963241 ctccagaacg gtgaccggtg ccgatgtcag caccatccgg gccagcggca gcgaaaagtc 3963301 ggtggcccgc agcggctcca ctacgacctc gtcgcaagca gtgtccacca gcagccaatt 3963361 cccgtcggcc ggcaacagca cgacgccgcc gggcgcgcca ccaagcactc ggccgacggt 3963421 gcccgacgcg gtcgacgtct tcgggtcgac ctgcacgcca ccgtcgatag ccaccccggc 3963481 gaaccgttca cccgacgcta gcgcgctgcg cagcttggga tcggagacaa ccaaagtggc 3963541 caccgcggtg gtcgcgaccg gccccggtac caacgccctg gccgcctcgt cgaccatcgc 3963601 acacaggtcc tcgatgctgc cgccagctcc gccacaatcc tctgggacgg cgacaccgaa 3963661 gaggcccagg cccgccagcc cggcgaacac cggccgccat gcgtccgcat ttccttcttc 3963721 gaagccgtat tccatgtcgc ggaccgccgc agtcgcggcc gcacctgagg ccgcggtgcg 3963781 ggcccagccg cgcaccaact cacgagccgc ggattgttcg tcggtgacgg tcgctaccac 3963841 ctgcagacct ccgcgtcgac aatttcacat agcaatggag cgttcttgcc cactagaacg 3963901 tgttctaata gtgctaacga tcaaccgtca agtcgaaggc aataactcca gcacatgtcg 3963961 tcgtctcggc tgtcgggagg tgggaaatct acacacagca tgcgtatcgt ttgcaaacga 3964021 accgcccgga agaggagctg cccgctacat gtcgtcagcg aacacgaaca ccagtagcgc 3964081 tcccgacgca ccacctcgcg cggtcatgaa agtggcggta cttgccgagt ccgagctcgg 3964141 atcggaggca cagcgggagc gccgcaagcg catcttggac gccaccatgg ctatcgcgtc 3964201 caaaggcggc tatgaggcgg tgcagatgcg cgccgtcgcc gaccgcgccg acgtcgcggt 3964261 tggcacgctg taccggtact tcccgtcgaa ggtgcatctg ctggtgtcgg cgctgggtcg 3964321 ggaattcagc cgcatcgacg cgaaaaccga ccgctccgcg gtcgccgggg ccaccccctt 3964381 ccagcggctg aactttatgg tcggcaagct caaccgcgcg atgcaacgca atccgctact 3964441 caccgaggcc atgacacgtg cctacgtgtt cgccgacgcc tcggcggcca gcgaggtcga 3964501 ccaggtcgaa aagctcatcg acagtatgtt cgcgcgtgca atggccaacg gcgaaccaac 3964561 cgaggaccag taccacatag cgcgggtgat ctcggacgtg tggttgtcga acctgctcgc 3964621 gtggcttacc cgacgagcct cggctaccga cgtcagcaag cggctggacc tggccgtgcg 3964681 gctgctgatc ggcgatcaag acagcgccta gaagacttac gccggcggac ccgcggtgcg 3964741 gccccggacc agctcggtat cgagcacctc gatgacgggc agtccggacc gcggcggctt 3964801 cagcaatagc tcgcccgccc ggtgcccctt gtgcagactc ggctgcgcga ccgtggtcag 3964861 cccccggctc agcgcctctg gcactccgtc aaaccctgtg acggtcatct gcccgggcac 3964921 gtaaatcccg tgcgcccgaa ggtaatccat agctgagagc gccaagatgt ccgctgtgca 3964981 catcagcgcg gtcagccgcg gattggcctg cagagccacc ttggcggcag tgccgccgga 3965041 cgtcggcaaa tgctcgtagc tttccaccac ggtcagcgag tccgggtcga cgccggcggc 3965101 cgtcatcgcc tcccatacgc cgacgatgcg ttcgcgctgt acgtcgaagg tcggcgaccg 3965161 cagccgctcg gcgtccacca agtcttgccg ccgatcccgt cccagccgca tggtcagcag 3965221 gccgagctcg cgatgcccca acccgagtac gtagccggca agctcacgca tcgccgcccg 3965281 gtcgtcgatg ccgacccggg acactccgga gaggtctttg ggctggtcga ccaccaccac 3965341 cggcagccgc cgctgcagca cgacctgcag gtagggatcg tcgtcgccta ccgaatacac 3965401 cacgaagccg tccaccccag cgccgagcac ggcagctgtg ccgtccgcaa ggctccgact 3965461 ggagccgacg gaaaccagct gcaggccctg ccccagctct tcgcacgact gcgccactcc 3965521 cgcaacaaaa tcccgcgcgg ccgggtcgct gaagaaatag gtcagcggtt cggccatcac 3965581 caaaccgacc gcaccggctt tgcgggtccg caacgatcgc gccaccggat ccggtccggc 3965641 atagcccagt cgcttggccg tggcaagcac tcgttcacgt agatcggcgg agagctgatc 3965701 cggtcggtta aaagcattcg agacagtggt gcgggacacc ttgagctcgg ctgctaacga 3965761 cgccagagtc gcccgcctcc gcggtgtggg actcacgttc ggtgagggta cagcggaccc 3965821 tcgagcacgc aatatcgtgg gccggctggc aaccgtcggt ttcgacgttg gtgacgaccc 3965881 ctcgttcatg aatcgttctt gagctccccg ttttgctgga tgcccaggca ccgccggtac 3965941 tgctgcgctt aagcttgtcg cacatggtgc cggcagggag gaacagtggg caagcagcta 3966001 gccgcgctcg ccgcgctggt cggtgcgtgc atgctcgcag ccggatgcac caacgtggtc 3966061 gacgggaccg ccgtggctgc cgacaaatcc ggaccactgc atcaggatcc gataccggtt 3966121 tcagcgcttg aagggctgct tctcgacttg agccagatca atgccgcgct gggtgcgaca 3966181 tcgatgaagg tgtggttcaa cgccaaggca atgtgggact ggagcaagag cgtggccgac 3966241 aagaattgcc tggctatcga cggtccagca caggaaaagg tctatgccgg caccgggtgg 3966301 accgctatgc gcggccaacg gctggatgac agcatcaatg actccaagaa acgcgaccac 3966361 tacgccattc aagcggtcgt cggcttcccg accgcacatg atgccgagga gttctacagc 3966421 tcctcggtgc aaagctggag cagctgctcg aaccgccggt ttgtcgaagt cacccccgga 3966481 caggacgacg ccgcctggac tgtggctgac gttgtcaacg acaacggcat gctcagtagc 3966541 tcgcaggttc aggaaggcgg cgacggatgg acctgccagc gtgccctgac tgcgcgcaac 3966601 aacgtcacta tcgacattgt cacgtgcgcc tatagccaac cggatttggt ggcgattggc 3966661 atcgctaacc aaatcgcggc caaggttgct aagcagtagg catggccgac ggtccccttg 3966721 ccatcacggc gaaatcggtt tacatacatg gctattcggt agatacggca gagattccaa 3966781 cagctgtgcg tggccacccg aatgccgcgg gaaccgcgat caaggaccgc cgctgatgcg 3966841 gccgaaactt gggcgtccca atatcgcgcg gtattccaac aggtttagcg tgcctaccgc 3966901 cagatccgat gctccgttgt cggtgacctg gatgggcgtt gcgacgctgc tggtcgacga 3966961 cggatcgtcg gccctgatga ctgatggcta cttttcccgg cccggcctgg cacgggtggc 3967021 ggcgggtaaa gtgtcgccgt cagcggagcg ggtcgacggt tgccttgccc gggccaatgt 3967081 ctcccggctg acggccgtta tcccggtgca cactcacatc gaccacgcga tggattccgc 3967141 gctggtcgcc gaccgtaccg gagcccagct ggtcgggggg gagtcggcgg ccaatgtcgg 3967201 gcgcggatac gggttgcctg aggagtctct tgtcgtcgcc gtcccaggtg aaccaatcca 3967261 gttgggcgcc ttcgacgtga cgttggtgga gtcgcatcac tgcccacccg accggtttcc 3967321 cggtgtgatc agcgcaccac tgacaccgcc ggtgaaggcg tcggcctacc gctgcggtga 3967381 ggcgtggtcg acgctggtgc accaccggcc atcggggcgc cggctgttaa tccaggacag 3967441 cgccggtttc gtcagcggcg cactggccgg ttaccgcgcc gatgccgcct acctcagtgt 3967501 cggccagctc ggcctgcaac cgccgtcata cctgctcgaa tactggaccg agaccgtgcg 3967561 cacggtgggc gtccgccgcg tgattctcat ccactgggac gacttttttc ggccgctgtc 3967621 aaagccgttg cgggccttgc catatgcggc cgacgaccta gacctgtcga tccgcatcct 3967681 cgacgagctg gccgcccagg acggcgtcgc gctgcagatg ccgacggtgt ggcgccgcga 3967741 ggatccctgg atgtgaagcg ctctagccct tgacacttgc tgttgcgctg atactgcttg 3967801 ccgtggtcct ggggttcgcg gttgcccgcc cacgcggctg gccggaggca gcggcggcgg 3967861 ttccggcagc ggtcatcctg ttagcgatcg gggcgatctc gccccagcag gcgatggcgc 3967921 aggtgtccgg gctggcgcgc gtggtcgcgt ttctgggtgc ggttctggtg ctggctaagc 3967981 tgtgcgacga cgaaggcctg ttcgaggcag ccggcgcggc catggctcga gcgagcgcgg 3968041 agtcgcaccg actgctacgg caggtgttcg ccgtctcggc cgccatcacc gcggcgctct 3968101 gcctggacgc caccgtggtg ctgttgaccc cggtggtgct ggcgacggtc cgccggctgc 3968161 ggaccccggt gcgcccctat gcctacgcca ccgcccacct agccaacgcc gcttcgctgc 3968221 tgcttccggt gtcgaatctg accaacctgc tcgcctacca cggtgccggc atctcgttca 3968281 ccaagttcac gctgctgatg gcattgcctt ggctgtccgc cgtggccgcg gtctatgtgg 3968341 tcttccgctg gtttttcgcc cgggatctac gcgtggtgcc ggaccggcag caactcaagc 3968401 cggcgccgcg cctgccaatg ttcgtgctgg tggtggtggc gctgacactc gggggcttcg 3968461 ccgtcgccga gtcggtggga ctggccccaa cgtgggcggc gctggctggc gccgcagtgt 3968521 tggcgctgcg aagtctgcgg cgtggacaca cttcggtgct gcggatcgcg cgcgccgtca 3968581 acgtgtcgtt cctggtcttt gtgttggccc tgggtgtcgt ggtgcacgcg gtcatgctca 3968641 acggcatggc cgccaggatg tccgccgtgc tgccgaccgg gtccgggttg cccgcgctgc 3968701 tcggcatcgc cgcgctggcc tccgtgctgg ccaacgtggt caacaacctg cccgcgactc 3968761 tggtgttagt gccgctggtg gcggccggcg ggccggcggc cgtgctggcc gtgctactcg 3968821 gggtcaacat cggacccaac ctgacctatg ccggttcgct gtctaacctg ctgtggcggg 3968881 gcgtgctgcg ccggcacaac gtcgacgcca gcgtcggcga gtacacccga ctgggactgt 3968941 gcaccgtgcc tgcggccctg gcgatggcgg tgctcgcgct gtgggccagc gcccaggttc 3969001 tggggatcta gccgcaaggg cgcgagcaga cgcagaatcg catgatttga gctcaaatca 3969061 tgcgattctg cgtctgctcg cgaggctcgc gtggccgccg gcgctggcgg gcgatcgcgg 3969121 cgagcaccac cccagcggcc accgaggcgt tcagtgattc ggcctgagcg gccatcggga 3969181 tggacaccac ctcgtcacag ttctgcctta ccaaccggga caaccccttg ccttccgacc 3969241 caacgaccac caccaacgag tcagtgccat ctacatcgtc gagcgcggtg ccgccaccgg 3969301 cgtccagtcc gatcacccgc actccacgat cggcccagcc cttcagcgtc ctggtgagat 3969361 tggtggcccg ggccaccgga atcctggccg ccgccccggc gctggtgcgc cacgccaccg 3969421 cggtcaccga cgcagaacgg cgttgcggaa tcagcacccc atggccaccg aacgcggcca 3969481 ccgaccgcat gatcgcaccg aggttgcgcg ggtcggaaag gttgtccaaa gcgaccagca 3969541 gcgcaggcgg ttggtcgagg gcggcggcca gcaggtcatc gggatgggcg tagttgtacg 3969601 gtggcacctg tagcgcgatg ccttgatgga ggtggttggc ggtcatccga tccaggtcgg 3969661 cacgtagcag ctcgacgatc gcaatccctg aatcagccgc ccgcgcaacg cattcagtca 3969721 gtcgctcgtc ggcctcggta ccaagggcga cgtatagcgc ggtggccgga acacccgcgc 3969781 gcaggcattc cagcactggg ttgcgaccca acaccgtctc ggtctcgtcc gcgcgcttga 3969841 ccgggcggcg tggctgtgca cgtgcccgct tggcggcggg atggtgggga cgcaggtgcg 3969901 ccggcggggt aggcccgcgc ccttccagcc cacggcgtcg ctgaccgccc gagccgacgc 3969961 ctgcgccttt cttggtaccg gatttgcgga ccgcaccccg ccgccgagag ttaccgggca 3970021 tctacttggt gtcaccaccc agcagcgacc actgtggccc gtcggcggtg tcggtgacct 3970081 cgatgccggc tctcttcagc cgaccccgga tctcgtcggc gagcgcccag ttgcgctgct 3970141 cgcgggcctt ttcccgattc tgtagttcag cctggaccag cacatcgacg gcggccagcg 3970201 ctgccgaggt ttcgtctcgg gattcccagc gctggtcgag cgggtcacag cccaggatgc 3970261 ccatcatcgc ccgaatcgcg ctagcgcttc gcaaggcccc gtcgtggtcg ccggcatcga 3970321 gtgcccggtt gccttccgcc cgcacgtggt gaatctcggc gagcgcgatc ggaacggaca 3970381 ggtcgtcgtc gagcgcttcg gcgaaccgtg gggtcggatc gccggggcag acggcgccca 3970441 cccgggtgcg aacgcggtgc aggaagtcct ctagcccgac ataggctttc accgcatcct 3970501 gcatagcggt ctcggagaac tcgagcatcg accggtagtg cgcgctgccc aggtaataac 3970561 gcagctcagc cggccgcacc cgctgcaaca tcgccggcat ggacaacacg ttgcccagcg 3970621 acttgctcat cttctccccg cccatcgtca cccagccatt gtgcagccag tagcgggcga 3970681 acccatcacc ggcggcgcgg ctctgggcga tttcgttctc atgatgcggg aagactaaat 3970741 ccattccacc gcaatggata tcgaattccg gcccgagata gctgcgagcc attgccgagc 3970801 attccagatg ccagcccgga cgcccgcggc cccacggcgt cggccacgac ggttcacccg 3970861 gcttttcgcc cttccacaaa gtgaagtcgc gctggtcccg cttgccggca gccacacctt 3970921 cgccctgatg gacgtcatcg atcttgtgac cggataactg gccgtactcc gggtagctca 3970981 gaacgtcgaa gtaaacgtca ccgccaccgg tatacgcgtg gccggcctgg atcaggcgct 3971041 cgatcatctc gatcatctgg gtgatatgcc cggtggcgcg cggctccgcg gacggcggca 3971101 agacgtccag agcgtcgtag gccgcggtga aggcacgctc gtgggtagcc gcccactccc 3971161 accacggccg gcccgccgcg gcggccttgg ccaggatctt gtcttcgatg tcggtcacgt 3971221 tgcggataaa cgcgacgtcg tagccacgcg cgagcaacca tcggcgcagg atgtcgaagg 3971281 cgaccccgct gcggacatgc ccgatatgcg gtaggccctg caccgtggca ccgcacaggt 3971341 agatcgagac gtgtccaggt cgcaacggga cgaaatcccg cacgacaccg gcggcagtgt 3971401 cgtgtagccg caagcgagcc cgatcggtca cgacgtgcca gcttacctgc ccaattgctg 3971461 caacctgcgg cgcgcgcgtc cggaccagga gtgcgctacc gcaacgaaac caccaatgcc 3971521 gtagcgattg cggccaagcc ctcgccgcgg ccagtgaggc ccagcccgtc ggtggtggta 3971581 gccgacaccg acaccggcgc gttgagcaga cgtgacagca ccgcctgcgc ctcgagccgg 3971641 cgccaaccga tcttcggtcg gttgccgatc acctgcacca cagcgttgcc gacccgatag 3971701 ccatgctggg tgatcaggac gacgacatgg cgcaacatgt cggcaccact gacaccctgc 3971761 caacggggat cgtcgacgcc gaacacctcg ccaatgtcgc ctagccccgc ggccgacagc 3971821 accgcgtcgc acagcgcatg aacggccacg tcaccgtcgg agtggcccgc gcaaccgtcg 3971881 gcgctcggga acaacaaccc caccagccag cacggacgtc cgggttcgat cggatgcaca 3971941 tcggtcccca aaccaacgcg gggcagctga ttcacccgcg cactatagct tgggccagca 3972001 acagatccag tttggtggtg atcttgaacg ccagcggatc gccgtcgacc acctgcacct 3972061 ggccgccgat atgctcgacc agcgacgcgt catcggtgta ctcggcggct ggaaggtcta 3972121 gggagccgcg ctgatatgac cgcagcagca ggtcggtagt gaacccttgt ggggtctgca 3972181 cggcccgcag cccggctcgt tccggcgtgc ccaggaccac cccgttggca tccacggcct 3972241 tgatggtgtc agaaagcggc agtacgggaa cgacggcggc ataaccgtcc cgcaacgcct 3972301 cgaccacccg ggcgaccagg gccggtggtg tcagtgcccg cgcggcatca tgcacaagca 3972361 caaactccgg ctccgcggtc ccggacagca ctgccagcgc caggttcacg gtgtcagtgc 3972421 gattcgaccc acccgccaca atcatcgccc tgtggccgag gatctgcctc gcctcgtccg 3972481 tacggtcggc gggcacggcc acaacaacgg tgtcaactac ccccgaatcc agcaggccat 3972541 cgacggcccg ctcaatgaga gtctgcccgt cgagctggta aaacgccttg ggcacaccga 3972601 cggccaaccg ctcccccgac cccgcagccg ggacgatcgc aactacttcg cccgcttccc 3972661 tgaccactag agcctcaggg cggtcaagac gcggcggcta aaacctcgtc aaggatggtc 3972721 tcggctttgg cgtcatcggt gctctcagcc aacgccaact cgccgaccag aatctgccgg 3972781 gccttggcca gcatgcgctt ctcaccggcc gacaagccac gctcctggtc gcgacgccac 3972841 aaatcgcgca ctacctcggc caccttgttc acatcgccgg atgcgagttt ctcgaggttc 3972901 gccttgtaac gacgtgacca gttcgtcggc tcctcggtgt gcggggcacg caacacctgg 3972961 aaaaccttgt ccaggccttc ctgcccgacg acatcgcgaa caccgacgta ttcggcgttt 3973021 tcagcgggaa ctcgtactgt caggtcgccc tgcgcaactt tcaagacgag atactctttt 3973081 tgttcccctt tgatggtccg ggtttcgatc gcctcgacta acgcagcacc gtggtgtgga 3973141 tagacaacgg tgtctccgac cttgaaaatc atctgatttg agcccctttc gttactccat 3973201 gctaacacgg gccctaacgg gcgccgaaca acggtgcagg tcaggggcat agcgcgggaa 3973261 gattgggggt tgacagacgg gcctagaagt gcatcgccga atctgggacg cccctgagaa 3973321 cggggtgccc gggctaccgc gccggtccgg tcgacgccgc ggtccccacc gctaccgtcg 3973381 gcggcaccta actactactg tgcatagtcg agccgcaggc accatgccgc gccaaggccg 3973441 agcaggaggc atccgagtga accgctgcaa catccgcctg cgtcttgccg ggatgaccac 3973501 ctgggtggcg agcatcgccc tgctggccgc cgcactgagc ggttgcgggg ccggtcagat 3973561 ctcccagaca gcgaaccaga agccggccgt caacggcaat cggctcacca tcaacaacgt 3973621 gttgctgcgc gacatccgca tccaggccgt ccaaaccagc gatttcatcc agccaggcaa 3973681 agcggtggat ctggtgctgg tagccgtcaa ccaatcaccc gacgtttcgg accggctggt 3973741 gggcatcacc agtgatatcg gctcggtgac ggtggccggc gacgctcgac tgcccgcatc 3973801 cgggatgctt tttgtcggga cgccggacgg ccagatcgtg gcgccggggc ccttgccatc 3973861 caatcaagcg gccaaggcga ccgttaactt gaccaagccg atcgcaaacg gcctcaccta 3973921 caacttcacc ttcaagttcg agaaggccgg tcagggcagc gtaatggtgc cgatctcggc 3973981 cggattggct acgccgcacg aataggcgcc gcatcgtcgc cagacgagcg actcgctcgg 3974041 gttgtcacac ccccccgata cggtcacggc gtggccaacg ctcgttcgca gtaccgctgt 3974101 tcggaatgcc gccatgtcag cgcgaagtgg gtgggacgct gcctggagtg cggccgctgg 3974161 ggcaccgtag acgaggtggc ggtgctcagt gccgtcggtg gcaccaggcg ccgttcggtg 3974221 gcgccggcgt cgggcgccgt tccgatcagt gccgtcgacg cgcatcggac ccgaccctgc 3974281 ccaaccggca tcgacgaact ggaccgggtg ctaggtggcg gtatcgttcc cggttcggtg 3974341 acactgctgg ccggcgatcc cggagtgggt aagtcgacgc tgttgctcga ggtcgcgcac 3974401 cgctgggccc agtccggacg gcgcgcgctc tatgtctctg gtgaggaatc cgccggtcag 3974461 atccggctgc gtgccgaccg gatcggctgc ggcacggagg tcgaggagat ctacctcgcc 3974521 gcacaatccg acgtgcacac cgtgctcgac cagatcgaga cggtgcagcc ggcactggtc 3974581 atcgtcgact cggtgcagac catgtccacc agcgaggccg acggcgtcac cggcggggtc 3974641 acgcaggtcc gtgcggttac ggctgccctg accgctgccg ccaaggccaa cgaggtcgca 3974701 ttgattctcg tcggccacgt cacgaaggac ggggccatag ccggaccgcg ttcgctagag 3974761 cacctcgtcg acgttgtgct gcattttgaa ggggaccgca acggtgcgct gcggatggtc 3974821 cgcggggtca agaaccgatt cggcgccgcc gatgaagtcg gatgtttcct cctgcacgac 3974881 aacggaattg acggtatcgt cgacccgtcg aacctgttcc tggaccagcg gccgacaccc 3974941 gtcgccggta ccgcgatcac cgtgacgctg gacggaaaac ggccgctcgt cggggaagtc 3975001 caggcattgc tggccacacc gtgcggcggc tcgccgaggc gggccgtcag cgggatccac 3975061 caggcccgcg ctgcgatgat cgctgctgtg ctggaaaagc acgcacggct ggcgatcgcc 3975121 gttaacgaca tctacctgtc caccgtgggc ggcatgcggt tgaccgagcc gtcggcggat 3975181 ctggcggtcg ccatcgcgct cgcctcggcc tatgcaaatc tgccgctgcc caccactgcc 3975241 gtcatgatcg gcgaggtagg tctggccggc gacatccggc gggtcaacgg gatggcgcgg 3975301 cgccttagcg aagccgcccg ccaagggttc accatcgcct tggtcccgcc cagtgacgat 3975361 ccggtgccgc ccggtatgca cgcgctgcgc gcatccacca tcgtcgcggc gctgcagtac 3975421 atggtcgaca ttgccgacca ccgcggcacc accctcgcaa ccccgccctc acattccggg 3975481 actggacacg tcccactagg gcgcggtaca tagcagaatg cacgctgtga ctcgtccgac 3975541 cctgcgtgag gctgtcgccc gcctagcccc gggcactggg ctgcgggacg gcctggagcg 3975601 tatcctgcgc ggccgcactg gtgccctgat cgtgctgggc catgacgaga atgtcgaggc 3975661 catctgcgat ggtggcttct ccctcgatgt ccgctatgca gcaacccggc tacgcgagct 3975721 gtgcaagatg gacggcgccg tggtgctgtc caccgacggc agccgcatcg tgcgggccaa 3975781 cgtgcaactg gtaccggatc cgtcgatccc caccgacgaa tcggggaccc ggcaccgctc 3975841 ggccgagcgg gccgcgatcc agaccggtta cccggtgatc tcagtgagcc actcgatgaa 3975901 catcgtgacc gtctacgtcc gcggggaacg tcacgtattg accgactcgg caaccatcct 3975961 gtcgcgggcc aaccaggcca tcgcaaccct ggagcggtac aaaaccaggc tcgacgaggt 3976021 cagccggcaa ctgtccaggg cagaaatcga ggacttcgtc acgctgcgcg atgtgatgac 3976081 ggtggtgcaa cgcctcgagc tggtccggcg aatcgggctg gtgatcgact acgacgtggt 3976141 cgaactcggc actgatggtc gtcagctgcg gctgcagctc gacgagttgc tcggcggcaa 3976201 cgacaccgcc cgggaattga tcgtgcgcga ttaccacgcc aacccggaac caccgtccac 3976261 ggggcaaatc aatgccaccc tggacgaact ggacgccctg tcggacggcg acctcctcga 3976321 tttcaccgcg ctggcaaagg ttttcggata tccgacgacc acggaagcgc aggattcggc 3976381 gctgagcccg cgtggctacc gcgcgatggc cggtatcccc cggctccagt tcgcccatgc 3976441 cgacctgctg gtccgggcgt tcggaacgtt gcagggtctg ctggcggcca gcgccggcga 3976501 tctgcaatca gtggacggca tcggcgccat gtgggcccgt catgtgcgcg atgggttgtc 3976561 acagctggcg gaatcgacca tcagcgatca ataattatcc gccttgcgcg ggagactccg 3976621 gcggaggcgc ctgcgctgga cccggagcgg gtaccggccc gggcggcggc ggcggctgat 3976681 tcaggatgaa cggaaccggc agcgagcgca gattgcccag ttgtaccacg agattgtagg 3976741 tgcccggccc gatcgccggc cgcggcaatg ggcagcgcgg cgccgatccc atcccggtcc 3976801 aggtcaccgc ggtcgttacc tgctcaccgg gggaaaacgt cttgaccagc gtctcattcg 3976861 agggcgcgca gtccaggttg gaccacaacc gcttgttgtc cagcgagtaa acgtaggcgg 3976921 ccaacaccgc ggccccaacg tcgcgtttac aggacaccag gccgatgttg gtgaccacca 3976981 tggtgaactt cggctggtcg ccgacgtagt actgcggcgc gttggtcaaa cctttgacgg 3977041 ccagcgtcga atcggggcaa tcgtcccctt ccttgagcac cggcggcggc tgcaccgcgg 3977101 cggtgggcgt gggtgtctcg gggttttggc cctgcggcgg ggccgcggcg gcgttacctt 3977161 cggtttgccc ggccggctgg ggtgcttggg gtgccggcga gcccggatgg ctctgggcgg 3977221 aggccggctt gtcggcgctg accggtttgg caccggcgct gctgtcgacg aaggcgatga 3977281 cgatggccac cgcgatcccg actacgacga ccgcgatgcc cagggccagc cccctgcgcc 3977341 gccagtagat ctcggtaggt agcgggccac gcggttccag atccagcacg attacaccgt 3977401 agggccaggt cacgcaaacg cgcttgaccc gcctcggcgt gtcgccggct tcgctggccg 3977461 acgccgtgtt aacggtggcc tgttatcggg cggtaactca gacctcctcg ccgatgttgc 3977521 cgatgtggtc gcgcagtaca gcccgcccat cgtcgagttg ataggtgacg cccacgatcg 3977581 ccaggctgcc ccctgcgatt cgttctgaga tggccgatga acgcgccatg aggatcgcca 3977641 ccgtctcgtg tacatgtcgt tgctcgaact cgtcgacacg actcagaccg tcacggcggc 3977701 cgagcaggac cgacggcgca accctttcca cgacgtctcg cacgtagccg cctggcaggg 3977761 tgccgtcgtt gatcgcggcc aaagcggcgt tcacggcgcc gcagctgtcg tggccgagga 3977821 cgacgatgag cggcacattg agcacggtca ccgcgtactc tatggagccc agcacggccg 3977881 agtcggtgac atgcccggcg gtgcggacca cgaacatgtc gcccaggcct tggtcgaaga 3977941 tgatctcagc ggccactcgg ctgtccgcgc agccgaagat caccgccgtg ggcttctgcc 3978001 cggcggccaa gccggctcgg tggtcgacgc tctgactggg atgctggggc cggccggcga 3978061 cgaatcgctc gttaccctct ttgagtgctt tccacgcggc taccggattg gtgttgggca 3978121 tgcctcacat actgccggaa ccgtcggtga ccggcccgcg acacatatca gataccaatc 3978181 ttctcgcttg gtatcagcga tcgcaccggg atctgccctg gcgagagccc ggtgtcagcc 3978241 cgtggcagat cctggtcagc gagttcatgc tgcagcagac gccggccgcc cgggtgctgg 3978301 cgatctggcc ggactgggtg cggcggtggc ccacgccgtc ggccaccgcc acggccagca 3978361 ccgccgatgt gttacgcgcc tggggcaagc tgggctatcc caggcgagcc aagcgcttac 3978421 acgagtgcgc caccgtcatc gcccgcgacc acaatgacgt ggtgcccgac gatatcgaga 3978481 tcctggtcac cctgccgggc gtcgggagct acaccgcgcg cgcggtggcg tgtttcgctt 3978541 accgccagcg ggtgccggtg gtggacacca atgtgcggcg cgtggtggcc cgcgccgttc 3978601 acggccgcgc cgacgccggt gcgccatcgg tgccgcgcga ccacgccgac gtcttggcgc 3978661 tgttgccgca ccgcgagacg gcgcctgaat tttcggtcgc gctgatggag ttgggtgcga 3978721 cggtgtgcac cgcccgcaca ccccggtgcg ggttatgccc gctggactgg tgcgcatggc 3978781 ggcatgccgg ttatccgccg tcggacggtc cgccgcgccg ggggcaggcc tacaccggaa 3978841 ccgaccgcca agtccgcgga cggttactgg atgtgttgcg cgccgcggag tttcccgtca 3978901 cccgggccga gttggacgtg gcgtggctga ccgataccgc acagcgtgac cgggcgctgg 3978961 agtcgctgct ggccgatgcg ctggtgaccc ggacggtcga tggccggttc gcgttgcccg 3979021 gcgaagggtt ttagccgggt aggccgtccg caccggcggc gccgaaaccg ccgggatcac 3979081 cggggttgcc cgcgacgact gtcccagctc ccgcggcgcc acccgcgccg ccagcgccgc 3979141 cggcacctcc ctggcccccg gtaccgcccg caccgtggac acctggctgg ctgaacattc 3979201 cggcacctcc gccggcacct ccggcaccgc ccttgccgcc gttgccgccg gcgccgccgg 3979261 caccaccgtt gccgccgtca ccgccgacca ggccagagcc gcccttgcct ccggcgccgc 3979321 ccgaggcacc cgtgccgccg atgccgccgg caccgccggc gcccccgtta ccgccgtcgc 3979381 cgaacagcag cccgccctga ccgcccgcac ccccgacacc gccgacaccc ccggtgccgg 3979441 cggtgttggc gccagccccg ccggggccgc cgtcgcctcc gctaccaaaa aaggtcagcg 3979501 tgccggtggc gccgccgcca atgccaccat tgccgcccgc agccccggtg ccgccccggc 3979561 ccccggcgcc accgttgccg ccgatcccgt tgccaccgtt tagcgctagg ccgttgcccc 3979621 cgttgccccc gtcgccgccc cgggcgccgg cgccaccgtc accgccattg cccccgtttc 3979681 ccccgtaggc ccagccagta ccggtattga caccgatgcc gccgggtgcg ccgttgccgc 3979741 cgggcgcgcc gggaccgccg tcgccgccat tgcctccgtt gccccccgtc acagggcctt 3979801 cactcgtatc gctgccgctg ccgcctaaac cgccagcgcc gccagcccgc cctggtacgg 3979861 cacccgggtt gccgggcagc cctgcgccac cgctaccggc gccgttgttg gcgccggggc 3979921 ttccgtttgc cgcctggctg gtctggttcg gcggcgggtt catcccgttg gttccggggg 3979981 cacccacccc gccgacgccg ccgtcgccgc cggcgccgat cagccccgcg ttgccgccgg 3980041 cacccccatt gccgggcaac ccgccgatga ccgcggcccc gcccgccccg cccacaccgc 3980101 cattgccgaa cgcgccggcg gcgccgccgg ctcctccgtt gccactgacg gtcgttccca 3980161 ccccgccgaa cccgccggca ccgccgttgc cgaccagcca gcccgcccca ccggctccgc 3980221 ccacaccgcc ggcgccggcg gcgttgctcc cgccggcccc gccattgccg ccgtgcccga 3980281 acagcccggc ggcgccgccg ctaccaccgg gcccgcccac accggccaca cccgacccgc 3980341 cgttgccgcc attaccccac agcagcccgc cgggcccgcc cggctgcccg gttcccggcg 3980401 ccccgtcggc accgttgccg atcaacgggc gccccagtag cgtctgcgcg ggcccgttga 3980461 tcaagccgag cacctgctgc tccacggctt gcagcggcga cgcgttggcg gcctcggcgc 3980521 tggcatagga gttcactccc gcgtttaacg cctgcacgaa ctgggcatga aaccccgccg 3980581 cctgggcgct gagcctctgg tactcctggg cgtgcgcgcc gaacagcgcc gccatcgccg 3980641 ccgacacctc gtccgcggcg gccgctagca cacccgtggt cggggcggcc gcggcggcgt 3980701 tggcggcatt gagcgccgaa ccgataccgg ccacctctga agccaccgac atcagcgctt 3980761 ccggcgccac gatcacaaac gacatctgac acccctttcc gcggcgcggc ctgacggccc 3980821 gatcgtagcg cgatcacggg ccgacaaaac ccgttatggc caggcttttc gccacattgc 3980881 ccgcgccgcg tgggctcacg gggtaagccc cgccaggaac gactccaccg cccgccggta 3980941 aacctgtgga gcctcgtcat gaaccagatg accggcgtcg ggaacacgca aatacgctgt 3981001 cggataatct ctttcagcca tcgcgcgcat ctggcccggg ggagttaccc catcgccggc 3981061 ctcgatgagc agcgccggcg accgtacggc ccgccactgc gcccagtagt cacgggtgcc 3981121 ccattcggcg gcgatctcga tccatcgtgc ggtgcgcccg tgtagccgcc acccggtggc 3981181 cgtgcggtcg aatgcgtcca ggaagtaccg gccggcgacg ggcccgaact cggcgaatac 3981241 ctgttcggca gagtcgaatt cgaccggaag ggcgcgcagc cacggctccc atgggccggt 3981301 ggtcctacca cggaagtccg gcgccatgtc ctcgaccacc agcgccgaaa ccagttccgg 3981361 gcgctcggca gccagacacc acgaatgcaa ggctcccatc gaatgtccga ccatcctggt 3981421 cggcgcgccc agcgccgaaa ccgcgtcgcc cagatcggcc acgaagcgtt cggtgctgat 3981481 cgggtgtgga tcggcgacgt cacgcccgcg gtgccagggc gcgtcgtagg tgtacacggc 3981541 gcctaacagc gtcagccacg gaagctgacg ggcccaggtg gaacccctac ccatcaagcc 3981601 gtgcaccagg accaacggct cgccccgtcc gccgcgatgg gttaacagat tcgctggcat 3981661 gcggggcacg gtagcctagc ggcatgccag tggtgaagat caacgcaatc gaggtgcccg 3981721 ccggcgctgg ccccgagctg gagaagcggt tcgctcaccg cgcgcacgcg gtcgagaact 3981781 ccccgggttt cctcggcttt cagctgttac gtccggtcaa gggtgaagaa cgctacttcg 3981841 tggtgacaca ctgggagtcc gatgaagcat tccaggcgtg ggcaaacggg cccgccatcg 3981901 cagcccatgc cggacaccgg gccaaccccg tggcgaccgg tgcttcgctg ctggaattcg 3981961 aggtcgtgct tgacgtcggt gggaccggca agactgcata accggcgcgc ggggcgccgg 3982021 atgctggcgt taagcgccgc ggcggcattg attgtggcgc tggcgtcggg ttgctcctca 3982081 gctccgacgc cgtccgcgaa cgcggcaaat cacgggcacc ggatcgacac cagaactccg 3982141 cctggtctgc gggcgcaaca gaccatggac atgctcaact cggactggcc gatcggcgag 3982201 atcggcgttg gcactctcgc cgcgcccggg caggtcgaca cggtcaagac caccatggaa 3982261 gcgctctggt gggatcgccc gttcgcgctg gccggcgtcg atatcggcgc cagtgtggcc 3982321 gcgttgcacc tcatctcctc ttacggcgcg caacaagaca tccgcattca taccgacgac 3982381 gacggctggg ttgaccgatt cgacgtcgaa acgcaggcgc cgtcgatcgc ttcgtggcgc 3982441 gacgtcgacg cggtgctgag caagaccggc gcccgctact catttcaggt ggcaaaggtc 3982501 gacaacggtc gctgcgaccc ggtggcgggc accagcaccg gcgaatccct gccgctggca 3982561 tcgatcttca agttgtacgt gttacatgcg ctggccggtg cggtccaaca caacacggtg 3982621 tcctgggatg atctgctgac ggtcaccgcc aaaagcaaag ccgtgggctc ttccggcctg 3982681 gaactgcctg tgggggcacg tgtttcggtt cgcacagccg ccgagaagat gatcgccacc 3982741 agtgacaaca tggccaccga cttgctgatc gaaaggctgg gcacccgcgc catcgaggaa 3982801 gcgctggcca gcgccggcca tcacgatccg gccagcatga cccccttccc cacgatgtac 3982861 gagctgttct ccgtcggctg gggcaagcca gatctgcgtg accagtggaa gcatgcgacc 3982921 caacaggtcc gtgcccagat actgcggcaa accaattcca cgccctacca acccgaccca 3982981 acgcgcgctc acactccggc gtcaaactac ggtgcggaat ggtacggcag cgccgaagat 3983041 atctgccgtg tgcacgcggc actgcgagcc gacgcggtcg gcccggcctc gcccgtccga 3983101 cagatcatgt ccgccgtccc gggtatccag ctggaccgca gcgtgtggcc ctatatcggc 3983161 gcgaaagcag gtggcctgcc aggcgatctg acgttcagct ggtacgccgt cgacaagacc 3983221 ggccaaccat gggtggtgag ctttcagctg aactggcccc gcgatcacgg accgacggtg 3983281 accggctgga tgctgcaggt cgccaggcaa gtctttgcgt tgatagcgcc acaatagatc 3983341 gctacagccc aggcatccgg aggtatccgc ggctcgcttc cgtaacgacc ggccggtcgt 3983401 gctcgacgtg aacaacgaga cacttcccgc gccggtgcgt tcgacggccg attcgctccg 3983461 gctcaccgat aggaggcgcc accgtgggat ggatcggcga tccgatttgg ctcgaggagg 3983521 tgctacggcc ggcactcggc gagcgcctgc gggtgctcga cggctggcgg gaacgcggac 3983581 acggcgactt tcgcgatatc cgcggtgtga tgtggcacca caccggcaac tcacgtgaga 3983641 ccgccaaaag cattgcccgc ggccggcccg acttacccgg cccgctggcc aatctgcaca 3983701 tcgcgcacag cggggtcgta acgatcgtcg cggtaggcgt gtgctggcac gccggccgcg 3983761 gcagctaccc gtggctgcca accgacaacg ccaactggca catgattggc gtcgagtgcg 3983821 cgtggccgac catccggcgt gacggctcct acgacgccgg tgagcgctgg cctgacgcgc 3983881 agatcgtgag catgcgagac gtcgccgcgg cgctcacgct caagctcggc tacgggcccg 3983941 aacgcaatat tgggcacaaa gagtatgccg gggcggctca aggcaaatgg gacccgggaa 3984001 acctgtcgat ggactggttt cgcgccgagg tggcaaagga cacgcggggc gagttcgacc 3984061 accccctcac cccgccgccg gcggtgattg cccgcccacc gattctgccc aagccgcgca 3984121 acccgcgtga cgatcgcatc ctgctcgagg aggtgtggga ccagctacgc ggcatcgagg 3984181 gccgcggctg gccggtactc ggcgacaaga cgatcgtcga ctacctagcc gagctcggca 3984241 ataaggtcga cgccctggcc gcaaaactcg acgcgcgcga gggcctcgac cggcccagtg 3984301 acactcggta gctgctccag caggcggcgg ggtgctgacg gacccgctgc aacgatgtca 3984361 accgggctgg cccggctggc cgggctggcc gggtgcacct tcagggccga actggccgag 3984421 gttgccgtcg ccgccacggc ccgcaggccc aacgcccggg gagccggcca caccgggctc 3984481 accaccgccc ccgccattac ccccgctacc ggcggcaccg cccagcccgg agctaccgga 3984541 gacgccgaac aggccgcccg cgccgcccgc gccgcccgcg ccgccgtctc cgccggcgcc 3984601 gccgtcaccg ccgatcccgc cgttaccccc gtcaccagcg tcacccccaa cgcctggttg 3984661 cccggcgccg cccattccgc cctgaccgcc ggtgccgccc cgggcgccga tgccgccttc 3984721 gccccctgtg cccccgatgc cacctgcgcc gccgtcgccg accaggagcc caccccgacc 3984781 gccagagccg ccggccccgc cggtcccccc ggtgccgccg ggcgcgccgg ggccaccgtt 3984841 tccgccggtc ccgccggtcc cgccaacgga caggccagta ccgccagtgc caccggtgcc 3984901 accggtttgc ccgaactcgg tgcctggctg gccgggccgc ccggttcacc gttgttaccc 3984961 atgctgccct ggcccgccgg gacggtcagg gggttgaccc cggcggcacc cgccgcgccg 3985021 gccgcgccga gccccccggc cccaccgttg ccgaatagcc acgcgttgcc gccggcacca 3985081 ccggcgccgc cgttgccgcc ggggatagtc gccgccccgc cgttgccgcc cgcgccgccg 3985141 ttgccgtaca gcagcccgcc ttgtccgccg gacccgccgg ccgcaccggc tccgccggca 3985201 ccaccggatc cgccgttgcc gatcaacccg gccgccccgc cgctgccgcc ggcaatcccc 3985261 gcgttcaccc cggccgcgcc attaccgcca ttgccataca gcagcccgcc cgccccgccg 3985321 ttttgcccgg gcgccgtccc gtcggcacca tcaccgatca acggacgtcc caacagtgcc 3985381 tgggtgggcg cattgatcgc gttcagcaaa ctctgctgaa cgttggcggc ctcggcgttg 3985441 gcatacgccc ccgcacccgc actcaaggtc tgcacgaact gggcatgaaa cgtcgccgcc 3985501 tgcgcgctga tcgtctgata cgcctgggcg tgcgcgccga agagcgccgc aatcgccgcc 3985561 gatacgtcat ctgcgcccgc ggccagcacg ccggtggtcg ggatgctggc agccgcgtta 3985621 gccgcgctga tcgtcgaacc gagattggcc aaatccgtgg ccgccgccga caggaactcc 3985681 ggaacggcaa tcacaaacga cattggccac ctccgaacag cttccggaca aaccgacgtc 3985741 agcagagtct attgtcacag cggatcggcg gtcgcggttt tcgcctaata cggccgatgg 3985801 acctagaccg ctaccgcgcg gccggctccg ggccgcccgc gctgtgcgct ccagccttgg 3985861 ccagatccgg ctcggccggc ggcttgcggg taccggtgaa ggtgaacacc gcgtcctcgc 3985921 cgggaccttc accgtcccag ttgtccacgt cgacggtgac cacctgaccc ggcccgacct 3985981 cctcgaagag gatcttctcc gagagctgat cttcgatctc acgctggatg gtgcgccgca 3986041 acgggcgggc ccccagcacc gggtcgaagc cacgcttggc cagcagcgcc ttggccgcat 3986101 cggtcagcac cagcgccatg tccttgctct tgagctggcc ggcgacccgg ctgatcatca 3986161 ggtcgaccat ccggatgatc tcctcgcggg tcagctggtg gaagacgatg atgtcgtcga 3986221 tgcggttgag gaactccggg cggaagtgtt tcttcagctc gtcgttgacc ttctgtttca 3986281 tccgctcgta gtcgttctca ccgccgccct tggaaaagcc cagaccgacc ggcttagaga 3986341 tgtcggaggt gcccagattg gacgtaaaga tcagcacggt gttcttgaag tccaccgtgc 3986401 ggccctgccc gtcggtgagc cggccatcct cgagcacctg cagcaggctg ttgtagatct 3986461 cctgatgcgc cttctcgatc tcgtcgaaca gcaccaccga gaacggcttg cgccgcacct 3986521 tctcggtgag ttggccgccc tcctcgtagc cgacgtatcc gggcggcgcg ccgaatagcc 3986581 gcgacgcggt gaaccggtcg tggaattcac ccatgtcaat ctgaataagc gcgtcgtcgt 3986641 caccgaacaa gaagttggcc agcgccttgg acagttcggt cttaccgaca ccggacgggc 3986701 cggcgaagat gaacgagccc gacgggcgct tggggtcttt cagcccggcc cgggtacgcc 3986761 ggatggcctt ggaaacggcc ttgacggcgt cctcttgccc gatgatccgc ttgtgcagct 3986821 cttcttccat ccgcaacagc cgggtggtct cggcctcggt gagcttgaac accgggatac 3986881 cggtccagtt gcccagcacc tcggcgatct gctcgtcgtc gacctccgcg accacgtcaa 3986941 gatcgcctga acgccactgc ttttcgcgct cagcacgctg tgcgaccagt gtcttctccc 3987001 ggtcgcgcag gctggcggcc ttctcgaagt cctgggcgtc gatagccgat tccttctccc 3987061 gacgagcctc ggcgatcttc tcatcgaact cgcgtaggtc tggcggtgcg gtcatgcgac 3987121 gaatccgcat ccgagcaccc gcctcgtcga tcaggtcgat cgccttgtcg ggcaggaacc 3987181 ggtcgttgat gtagcggtcg gccagggtcg cggcggccac catcgccgca tcggtgatcg 3987241 acacccggtg gtgcgcctcg taccggtccc gcaggccctt gaggatctcg atggtgtgct 3987301 ccaccgtcgg ctcacccacc tgcaccggct ggaagcggcg ctccagcgcg gcgtccttct 3987361 cgatgtactt gcggtattcg tcgagcgtgg tggcgccgat cgtttgcagt tcaccgcgag 3987421 cgagcttcgg tttcaggatc gaggcggcgt cgatcgcgcc ctcggcggct ccagcaccga 3987481 ccaaggtgtg cagctcgtcg ataaacagga tgatgtcacc gcgggtgttg atctccttga 3987541 gcaccttctt gaggcgttcc tcgaagtcac cgcggtagcg gctacccgcc accagcgatc 3987601 ccagatccag cgtgtagagc tgcttgtcct tgagcgtctc gggcacctcg ccgtgcacga 3987661 tggcctgcgc cagtccttcg acgaccgcgg tcttgccgac gccgggctcg ccgatcagca 3987721 ccgggttgtt cttggtgcgc cgagagagca cctgcatgac ccgctcgatt tccttctcgc 3987781 ggccgatgac cgggtccagt ttgccttcca tcgccgccgc cgtgaggttg cggccgaact 3987841 ggtcgagcac caaggacgta gacggagagc cggactctcc cccgcggccg ccggtgccgg 3987901 cttcggcggc ctccttgcct tggtaaccgg agagcagctg gatcacctgc tggcgcaccc 3987961 gggtcagctc ggcgcccagc ttgaccagca cctgggcggc cacgccttca ccctctcgga 3988021 tgaggcccag caaaatgtgt tcggtcccga tgtagttgtg gccaagctgc agcgcttcac 3988081 gcaagctcag ctcgaggacc tttttggcgc ggggggtaaa cggaatgtgc ccagacggcg 3988141 cctgctggcc ctggccgatg atctcctcga cctgactgcg cacaccttcc agcgagatcc 3988201 ccaacgactc cagtgacttg gcggcaacgc cttccccttc atggatcagg cctaaaagaa 3988261 tgtgctcggt gccgatgtag ttgtggttga gcatcctggc ctcttcctga gccaggacga 3988321 cgaccctgcg ggcacggtcg gtaaatcgtt cgaacatcgg tggctacctg ctctccctca 3988381 ccatcggata cagcggtcga caccgcgtac ctgccgtcca ctgtaatggt cggcctgcca 3988441 gggttcctaa ccttgcggtg cctggtcggt tccggggcgc agcgccccaa gtcgccgttg 3988501 aacagaaccg cataggagat aaacgagaaa accacccaag cgtttccggc gccgagcggc 3988561 catcggttcg ccgccagcga acgcggcaaa gtaccggcgc ccaggctttc gcctgggcgc 3988621 cggtagccaa atgtcaggtc gccgcgtggt atgcgtcgat gacgtcggcc gggatccggc 3988681 ctcgcgtcga cacattgtgc ccgttacgac gagcccattc gcggatcgcc gcgctctgct 3988741 cgcggtcgat cgcgccacgt ccacggccgg atccggaacg gccgcgccgg cgcccaccga 3988801 cgcgacggcc cgccgccacc cattgcttca ggtcgccacg cagtttcgtg gcattcttag 3988861 tggaaaggtc gatctcatag gtcaccccgt caagcccgaa ttcgaccgtt tcgtcggcgg 3988921 cgcccgaacc gtcgaaatcg tcgaccaagg tgacggttac tttcttcgcc attggcttac 3988981 cctcgcgttt cttcctgtgc agtacggata gactccccgg tcaccaatct gccataagaa 3989041 cgcagaatac tcaatccaga cacaacaccc acagttcagt tggagtgtgg tcgaacaatc 3989101 gggaacaaaa ctgtctccct aattgacaac ccagtcaaag acatcaacaa ccgatcgata 3989161 cccattccgg ttccggtgca cggtggcatg ccgtactcca gagcggccag aaaatcctcg 3989221 tcaagcacca tcgcttcgtc atcgccagcg gccgcggcac gggcctggtc ggcgaatctc 3989281 tcccgctgga ctaccgggtc gcttaattcc gagtagccgg tggcaagttc gattccgcgc 3989341 agatagaggt cccacttctc ggttacgccg gggatactgc ggtgctgacg ggtcaaaggc 3989401 gttgtctgaa ccggaaaatc cttgacaaat gtgggtgcgc tcaagctctt gcccactgtg 3989461 cgctcccaga gttcctcgat gagtttgccg tggccgaagc cacggttgtc atgaatcgct 3989521 gggtctttct ccaggccaag gctatcggcg atcccacgta agcgatcgac cgtcgtctgc 3989581 ggtgtgatct cttcaccgag cgccacagac agcgacgggt acatttgtat agtcgcccat 3989641 tctccgtcga tgtcatagac actgccgtcg ggcaacggca gttgtctggt tccgatcgcc 3989701 tcatcggcca cctcttgaat aagctcccgg gtgacgactg ccgaatcgtc ataggttccg 3989761 taggtctggt aggtctccag catggagaat tccggagaat gcgtggaatc ggctccttcg 3989821 tttcggaaca ctcgattaag ttcgaagacc ttgtcgaaac cacccacgat gcagcgcttg 3989881 aggaacagtt ccggcgcgat ccgcaggtac agatcgatgt ctagggcatt ggaatgagtg 3989941 gcgaacggac gggccgccgc accaccggct aacgtctgca agacgggcgt ctcgacttcc 3990001 aggaacccac gacgttgaag cgccgtccgg atcgcgcgga cgacggcgat ccgtagtcga 3990061 gccaccgcgc gcgcttccgg tcgaactatg aggtcaacat agcgctgacg aacccgcgac 3990121 tcttcactca tctctttgtg cgcgacggga agcggccgca gcgacttggc ggcgatccgc 3990181 cagcaatccg ccaggacgga cagctcgccg cggcgcgaac tgatcaccgc gccatgcacg 3990241 tagacgatgt cgcccaggtc gacatcggct ttccatgcgt cgagagcagc ctggccgacc 3990301 ttgtcgaggc tgatcatcac ttgcagctgg gtaccatcgc cgtcctgaag tgtcgcaaag 3990361 catagctttc ccgagttgcg cgcaaagatc actcggcccg cgacgccgac gatgtcttcg 3990421 gtcgcggtat cgatcggcaa gtcagggtgg gcggcgcgaa cctcggccaa cgtgtgagtg 3990481 cgcggcaccg cgacgggata gggatcgcgc ccctgggcca gcaagcgagc gcgcttgtcc 3990541 cggcgaatcc ggaactgctc aggaaggtct tctgctgtgt cagcggcact cacgacgtgc 3990601 cagcttaaat gacctcacgc cgacgctcgt gggtggcgtc gagcctgtcg gcggcgggcg 3990661 acccggtacc cagactcgat gccggcatcg acgtcagcgc gccgtcttga gccggccgcg 3990721 ctggacttcg aggttacgct cgaacaccag ccgcagaccc tgcaaggtca ggtgctggtc 3990781 gtaatggtcg acggtgtgca attccggcag cagcaggggc gcggtatgcc cggtagccac 3990841 gatcgcgaca tcgtggtcga cggagaaacc ggacacgtcc tcgcggatgc ggcctaccaa 3990901 cccgtctacc agcccggcga agccgaacac cgcaccggct tgcatgcatt cgacggtgtt 3990961 cttgccaacc accgaacgtg ggcgggcaag ttcaacgcgg cgcaatgccg ccgagcgggc 3991021 cgccgcggca tcggaagaca cctgcacccc gggcgcgatg gcgccgccaa gaaattcacc 3991081 cttggccgat acaacatcaa cacagatcga ggatccaaag tcaacgacga tggcggcctt 3991141 ccggaaccgg tcataggcgg ccaaacagtt cacgatgcgg tctgcgccca cttccttcgg 3991201 gttgtcgacg agcaaaggga tcccggtgcg tactccgggc tcgatcagca cgtgcggcac 3991261 cgacggccag tactggtcga gcattatccg cacctcgtgc agcacggacg ggaccgtgga 3991321 caaggcggcg gtaccggtga gccgctcgga atcctcgccg atcagcccgt cgatcgtcag 3991381 tgccagttcg tcggcggtga cttcggattc ggtgcgtatc cgccactgct gcacgacctt 3991441 tgcgtgctct ttcattccgg acagcaggcc cacaacggtg tgggtgttgc ggacgtcaat 3991501 cgccagcagc acggctatcc cacaccgagc cgggggtcta gcagctcgcc cgcgttttcg 3991561 ggcacaaatg ccggatcgtg gcccatgtcg atcggtttgt tgtaagcgtc gacaaacacg 3991621 atccgcggct ggtatgtgcg ggcccgggcg tcgtccatcg tcgcgtacgc aatcagaatc 3991681 accagatccc ccggatgcac caagtgcgcg gcggcaccgt tgatgccaat cacaccactg 3991741 ccgcgttcgc cggtgatcgc gtaggtgacc agtcgagcac cgttgtcgat atcgacgatg 3991801 gttacctgtt cgccttccag caggtcggcg gcgtccatca agtcggcatc gatggtcacc 3991861 gagccgacgt agtgcaggtc ggcgcaggtc accgtggcgc ggtggatctt cgacttcagc 3991921 atcgtccgta acatcagttt ctccaatgtg attcgaggat tgcccggtat ccgtccgggc 3991981 ggtcggtgcc ggcgaaagtt ccgatttcaa tcgcaatgtt gtccagcagc ctggtggtgc 3992041 caagccgggc agcaaccagc agccgaccgg aaccgttgag cggcatcggg ccaagcccga 3992101 tatcgcgcag ctccaggtag tcgaccgcca cgccgggtgc agcgtcgagc accgcacggg 3992161 cggcatccag cgcggcctgc gcgccagccg ttgccgcatg cgctgcggcc gttagcgccg 3992221 ccgagagcgc gacggccgcc gcacgctggg ccgggtccag gtagcggttg cgcgacgaca 3992281 tcgccagccc gtcggcttcg cgcacggtcg gcacgccgac caccgcgaca tcgaggttga 3992341 agtccgcgac cagctgccgg atcagcacca gctgctggta gtccttctca ccgaagaaca 3992401 cccgatccgg gcgcacgatc tgcagcagct ttagcacgac cgtcagcacg ccggcgaaat 3992461 gggttggccg cgggccgccc tcgagttcgg cggccaacgg accgggttgc acggtggtgc 3992521 gcaggccgtc gggatacatc gccgcggtag ttggcgtgaa agcgatttcc acgccttcgg 3992581 cccgcagttg cgccaggtcg tcgtccgggg tgcggggata ggcgtcgaga tcttccccgg 3992641 caccgaattg catcgggttg acgaagatcg acacgacgac gaccgatccg ggcacccgct 3992701 tggccgcacg caccaacgcg aggtggcctt cgtgcagcgc acccatagta ggcaccaaca 3992761 tcactcgccg gccggtgagt cgcagtgcgc gactgacatc ggcgacatcc cccggtgccg 3992821 agtacacatt gagttcaccg ggatggaacg caggaatcgt catgccgtca aaacctcgac 3992881 gacatccgcg ggggcgtgtg cgcgctgcgc ggtccgcagc gcgtttatcc ggtatgcctg 3992941 ggccagcgct gcgtcgacgt ccgcgagggc cgccagatga tccgcgaccg ctgccgcatc 3993001 gccgcgggcg accggtccgg tgagcgcggc ctgtccccgc tgcagcgtgt tctccagcgc 3993061 cgctctggcc agcggcccga cgatgcgctc cacgatcccg cccggctggt cgtcgacggt 3993121 ttgttggccg agcagttccc ccccgctcag ggcggcccgc aacgcctcga gcgcatcggc 3993181 cagcacggtg acgatgtggt tgctcgcatg ggccagcgcc gcgtggtaga ggatgcgggc 3993241 gtcttcgcgc acacaaaacg gctccccgcc catctcaaga accagtgact gtccgatcgc 3993301 atacccgacg tcgtcggccg cggtgatccc gaagcaggta tccggcagcc ggctgatgtc 3993361 ctcgtcggag ccggtgaagg tcatcgccgg gtgaatcgcc aatggtatgc agccctgttg 3993421 ggctagcggc gccagaatgc caatcccgtt agctccggag gtgtgcgcca caatcgtttg 3993481 tggccgcacc gccgaggtgg ctgccaggcc ggataccagg ccggcgagtt cgctgtcggt 3993541 gaccgccaat agcagcagct cagcgctggc cgcgacgtcc agcggtggca gcaccggggt 3993601 atcaggcagc cggcgctgcg cgcgccgccg ggacgcatga gagatggcgc tgcacgccac 3993661 cacaacatgg tcggcgcgct gcagcgcgac ccctagcgcg gtgccgaccc ggccagccga 3993721 gatgatcccc accttgagcc tggccggacg caaaccgtcg aaccgctcca tagcagacgg 3993781 cctcacaggt ttcttggttc gttccagtcc catgcccggg taccggacgg tcaccaagac 3993841 tgtagtcgat ttgcacgtca agacccaccc agggcactgc tgatttggtc actacaccaa 3993901 cagtgtcggt tgccggcggc aatcgggcgg gtacaccctg gcacaagcgg cgccgctatt 3993961 caccgcggcg gcgacgccgg ccgcctccgg tcgactcgac ctgaagcctc gccataaggt 3994021 cggcgaccga ctggccgccg gttagcgggt cccgcgcatg caaaccaccg gagtcatccg 3994081 gcggcgtgtc cgcggtgcgg tgccggcgtg tgggctcagc aggcggcggc ggggccattg 3994141 gaggtgacgg cggacgctca cccgtcccgg caccggagcc gccgatgtca tggtcgcggt 3994201 gctccgccga atgacgcgac cggcgaccgg attcaccgta ttgtgccgcg agctcgacgt 3994261 aggcgggcgg attgtaggcc tggtctgctg ggctggcatg ccgggcgcgg cgccggcgcc 3994321 ccggcggcgg ggccgccggc gtggtttcgg gttcgacgga tgcccactgg ctgccaggtg 3994381 tttcggcagg cagccactgc ccgtggctgg tgaccggctg ccagactggt cgctcctgtt 3994441 gcggcgggag cggcgggggc cggtggcgcg gctcgaataa cggctcaggt tgcggcggtg 3994501 gcggagcctc ataatgccgc ggtcctccgc tcaccggtgg tacccccacc tccggcacat 3994561 cgatgatcga ggcctcgtcg gtgcggctgg cgccatcacc cccgcggacc gccataaccc 3994621 gatcgctgga tacccagtcc gcgggagggc tctcgccatc gagggcacgc gcggccctgg 3994681 cctctttctc cccggtcccc agcgccggac ggtgctcgag gtcggcgtcg aacaaaatct 3994741 ccaggctggt tcgcagcgcg gccagttcgg cccgcagggc tgctacctcg tcggcggccg 3994801 gagcgcgcaa ctccgaggcc agctcgcggc gcagctgaga ttccagggtc agctcgtact 3994861 cccggcgcgc cgaaatctcg cgatccaact gaaggtcata gaccagcttc aggtcacgca 3994921 cccgggcctg atccacgtcg ctttgccggc ggtaaagcac cgacacaaac gcacccgcga 3994981 ccgccgccca cagcgccagc agaacagcga gcttgagaag ttccacgcga tcggtgaaaa 3995041 ccaatgcgga actggcccca atcgccagga ccagcaacgc cgtcaaaagc acccaccccg 3995101 gcctgcggcc gccgcgccgg acccgggcgc cgcgggacag aacggtcatg gcctgactgt 3995161 acccgggcga ggtcaatccg cgtgtcgcgc cggtccggcg attcccgcat ggcttagccg 3995221 ggtaggcagt tcggccaaat tcgccgcgta gacaaccccg cattccgggt cggccgcccg 3995281 gccagcaacg tcgacgaccg acgcccatcg ccggttcggc catggccgac gcagcgcgcg 3995341 gcaccgtcgg gtgtgggcta gctttccgcg ccgtcggcgt gctcggtcgg atcctgcgga 3995401 gacttgcagc aatgttgcag ccaaagcgcg gcaaccacca acgctagcgc gctgcccgcc 3995461 gccaccaccg tgccagtggt gtcctcggcg gccgcccgca gccatgaccg ccgcggcagg 3995521 aagtacgcca gcaccccgat ccaccacccc gtcaccagcg cacccaccca ggccgaggcc 3995581 ttggctacca tcaagctgcg cgccaccaca agcgggtgca gccagccggg cccgtctccg 3995641 atctcaccat cgctgatctt gacccgcacg tagcgagccc acaacgcctc ggcgaccgcg 3995701 accgcgagca aggacaagcc cgtccacacc gtgatcggcg gaaaccaccg gtaaagcacc 3995761 gccaccaaca gatatcccac cgccgcggcg ccgaccaccg cggcggtcag atcacgtttt 3995821 cgggtcggtc ccatcagctt tccggtgccc gactgacggg gtgtctgcta ttcagatcga 3995881 acgacggcct aaacaaccgc acactgtcgc ggtcggcggg ctccagctcg gccagcagtc 3995941 gcgtgacggg ccgcgggcac ccggcaaccg tcagctgcgc cgttgggtcg acggcaatcc 3996001 acgggatcaa cacaaaggcc cgcagatgcg ccagtgggtg cggcagcgtg aggtggttct 3996061 cccgcgcggt cacttcgacc agagcctcgg tggccgaggt ctggtagcag gcgatcaggt 3996121 cgacgtcgag atttcgtgga ccccagcgct ggccacgcac cctgcccgca gcgcgctcga 3996181 actcctgcgc ccgccgcagc cactcccgcg gttcgcaggt aggatcgtcg gcgatcagca 3996241 ccgcattgag gaactgcccc tgctccaccc caccccaggg gtcggcctca tatatcgggg 3996301 aagccgcaat caacgcatcg ccgagaccgt cggcgaccga ccgcaatcgt gccaggcggt 3996361 cacccaggtt ggagccaacc gagagcacta cccgcgtcat accgcgccgc ccgccgggac 3996421 tacccaaccg cggccgccgc gccgtgagcg tcggatcacc accgccacat cgtcgaacgt 3996481 ctgcggaatg ggcgcctgcg gcttgtgtac cgccacctca acggcatgca ctcgctggtc 3996541 gtccatcacg tgatcagcga tctcggcccc gaccgtttcg atcagcttcc gcgggggtcc 3996601 ggcgacgatc tcggccgccc gcgaagccag ccgcacgtag tcataggtgt cggccaagtc 3996661 gtcgctgttg gcggcctcgg ccaggtctat ccacacggtg acatcgatga caaaccgctg 3996721 cccggccact cgctcgtggt cgtagacccc gtgccgacca tgcacggtca ggccgcgcag 3996781 ttcgattcgg tcagccatcg cgttctatcc tttccgctcc catccacgct tcgaccacct 3996841 tgatggcatc gaccgaggcc cgcacatcat gcacccgcac accccaggcc ccgtgcagtg 3996901 cggccagcgc ggaaatcacc gccgtcgcgg tgtcacgccc atcggttggc cgcatcacgc 3996961 cgtcgggccc ggccaacaac gcaccgagga agcgcttgcg cgaagcaccc accagcactg 3997021 ggattccggt cgcgaccagt tccggaaggg catgcaagat cgcccaatta tgttgcgccg 3997081 tcttggcgaa tccaagcccg ggatcgagca ccagccttgc cgggtcgacg cctgcggcca 3997141 ccgcgtcggc gacgctggcc agcaggtcgg cacggacctc ggccaccacg ttgccgtagc 3997201 gcacaggcac atgcggggta tcggccgata ccgcccgcca gtgcatcaac acccacggca 3997261 catcggcctc ggccaacagc ggccccatcg ccggatcggc ccgcccaccc gacacgtcgt 3997321 tgaccatctg ggcaccgttc tgcaacgccg cccgagcgac atccgcgcgc atggtatcga 3997381 tgctgacggt gatgccttgt gctgcaagct ctttgacgac gggtatgaca cgagacgtct 3997441 ccaccgccgg gtcaacccga gtggcaccgg gccggctcga ctcaccaccg acgtcgacga 3997501 tgcccgcacc tgcggctgcc atcgccagac cgtgcttcac cgcatcgtcg agatcgagat 3997561 aacacccgcc gtccgagaaa gagtcgtccg tgacgtttag aacccccatc acctgcacgg 3997621 gcgccggact cacttccgca aaatgaggtc gagcgcttcg gctcgagaag cggcattggt 3997681 tttgaacagt ccgcgcaccg ccgacgtagt ggtgaccgag ccgggcttgc gaaccccgcg 3997741 catcgccatg cacagatgct cagcctcgat caccacgatt accccgcgtg gatcgagttt 3997801 tttcatcagg gcatcggcga tctgactggt gagccgctcc tggacctgag gtcgcttggc 3997861 gtacagatcg accagtcgcg cgatctttga caagccggtc accctgccgt cgtcgcccgg 3997921 gatgtagccg acgtgggcca caccgtggaa cgccaccagg tggtgttcgc aggtggagta 3997981 catagggatt tccttgacca acaccagctc gtcgtggtct tcgtcgaaca tggtgttcaa 3998041 caccgagtcg gggtcggtgt agagcccggc gaacatttcg cggtatgacc gggcaacccg 3998101 ggacggggtg gctaccaagc cgtccctatc cggatcctcg ccgatcgcgt acagcaattc 3998161 gcgcaccgcg gcctcggcac gttgctggtc gaacacacgg atacgagcag atgcgctgcg 3998221 cgaatccagc tgcgacatcg aatgctccgt tcgtcagccg tgggccggct tggtccgact 3998281 gacctcgtca tcctgctccg ccgaggactc atcggaaccc ggatcggctt gaccggtcgg 3998341 gtagggctga cccggatacg tcggtgccgg ttcaccgcta tagctgggcc gatgagatga 3998401 ccttgggggc catcccggcg catgccagcc cgccggggca ccgtagtcag gctgggtgga 3998461 gccgtactgg cggtcaccgg accggtgggt gccggcgggc gaaccgttgg cgccgtgccc 3998521 ggtttggccg gcgtcggacc gggcggcctc agcggcttgg gtagcctgcg caatcgccgc 3998581 cttgaacgcc ggctcgggga ccggctgggg ccaaggttcg ccgcgttcga tcgcgagctc 3998641 gccgggtgtc ttgatgggcg gtttgtccga cgggatccgg ccaccgaagt cgtcgaacat 3998701 ggtgagccgc ggccgctttt cgacgtcagc gaagatgctt tccagctcgg gtcggtgcag 3998761 ggtctccttt tccagcagct cgccggccaa agtgtccagc acgtcgcggt attcggtcag 3998821 gatttcccac gcttcggtat gcgccgcctc gataagcttg cggacctctt cgtcgatctc 3998881 gcgggcgacc tcgtgggagt agtccggctg ggtgcccatg gtacgtccga ggaacgggtc 3998941 gccgtgttcg gagccgtatt tgaccgcgcc cagcttggag ctcattccaa attcggtgac 3999001 cattgagcgc gctatcttgg tggcctgctc gatgtcggac accgcgccgg tggtcggctc 3999061 acgaaacacc agttcttcgg cggcgcgccc acccatcgcg aacaccagtt gcgcgatcat 3999121 ttccgagcgg gtccgcaggc ccttgtcttc ttccggcacc gccaccgcgt gcccgccggt 3999181 acgcccgcgc gccaggatcg tcaccttata aatcggctcg atatcgggca tcgcccaagc 3999241 ggccagggtg tgcccgccct cgtgataggc ggtgatcttc ttctcctgct cgctgatgat 3999301 ccggcctttg cggcgcgggc cgccgatcac ccggtccacc gcttcctcga gggcgggacc 3999361 ggtgatgacg gtgccgttct cccgggcggt cagcagcgcc gcctcgttga tgacgttggc 3999421 caggtcggct ccggtcatgc cgacggtccg cttggccagt ccgtcgaggt cggcgtccgc 3999481 ggccatcggc ttgcccttgg agtgcacgcg cagcaccgcc cgccgacccg ccagatcggg 3999541 gttggatacc gggatctggc ggtcgaagcg gcccggccgc aacagcgccg ggtccaggat 3999601 gtcgggccgg ttggtggccg cgatcaggat gacgccggcg cgatcgccaa aaccgtccat 3999661 ttcgactagc aactggttga gggtctgctc acgctcgtcg tgaccgccgc ccagcccggc 3999721 gcctctttgt cggccgacgg cgtcgatctc gtcgacgaag atgatgcacg ggctgttctg 3999781 cttggcctgc tcgaacaggt ctctgacacg ggatgcgccg acgccgacga acatttcgac 3999841 gaagtcggag ccggagatgg tgaagaacgg cactccggct tcgccggcca ccgcacgagc 3999901 cagcaacgtc ttaccggttc ccggcggccc gtagagcagc acgcctttgg ggatcttggc 3999961 gcccagcgct tggtacctgc tggggttctg caggaagtcc ttgatctcgt agagctcctc 4000021 gaccgcctcg tcgacacctg cgacgtcggc gaaggtggtc ttgggcatgt ccttgctcag 4000081 ttgcttggcg cgtgacttgc cgaacccgaa gcccatccgg gcgccgcctt gcatgcggga 4000141 gaacatcacg aacagcccca ccagcaacag cagcggcagc acgtagacca gcagctcgcc 4000201 caggatgctg ccctggttga cgaccgtgct gaccttcgcg tttttggcgc tgagcgcgtt 4000261 gaacaggtcg acggcgtacc cggtggggta cttggtgatg accttctcgg acccgtcggt 4000321 ctcgttgtta cccttcttca ggatcagccg cagctgttgc tcgcgatcgt cgatctgtgc 4000381 gctcttgacg ttgtcgccgt tgatctgtgt tatcgccacc gaggtatcaa cgggcttgta 4000441 gccgcgggtg tcgtcgctga agtaaaagaa cgaccagccg agcagcacca cgacggcgat 4000501 cgctgttatg gtgcgagtca cgtttttccg gttcatcgat catcggccgt gccggccagg 4000561 tccttcccga tacacgcagc tggaaagtcc aggttaccgc tcgtggcgat cgcaaacccg 4000621 gcggagccgg gtgcagcggg tcgccaccat cagccccgtg gcgatcgcaa accccgcgcc 4000681 tggcgacaat gcggcccgca aaacgggccg aggaggagcc aggcaatcac cccagagccg 4000741 ggtgcagcgg gtcgccacca tcagccccgt ggcgatcgca aaccccgcgc ctggcgacaa 4000801 tgcggcccgc aaaacgggcc gaggaggagc caggcaatca ccccagagcc gggtgcagcg 4000861 ggtcgccacc atcagccccg tggcgatcgc aaaccccgcg cctggcgaca atgcggcccg 4000921 caaaacgggc cgaggaggag ccaggcaatc accccagagc cgggtgcagc gggtcgccac 4000981 catcagcccc gtggcgatcg caaaccccgc gcctggcgac aatgcggccc gcaaaacggg 4001041 ccgaggagga gccaggcaat caccccagag ccgggtgcag cgggtcgcca ctggctagac 4001101 caacgaccgg tagttcccga cggcgtcgga aaatccgaca gctgagcgtt cgggtcaaac 4001161 acgcggtgca ccggacctga tttggctcga attggtgcgc accgagggtc gggcacatcg 4001221 ctccggtcgc atgtgtcact gcaccgggcg acacccgatc tgcccagctc tcagcgacag 4001281 ctgcctgacc tgcggttttg ttcacaagtt ggttgcggct gtgcgggatt gtaggcggcg 4001341 ttgaccggca gaaaccgagt tgtcgcgcat aggtgagcac agcgaccatc gcccccggtg 4001401 gagtccagtg ttgcggacgt gactaaagag cagcacgggc agcgggagca gaactcgggt 4001461 caattgagtc atccagcgcg cgaacgtggt tcggcgcagc cccggttggc tgtctgggcg 4001521 tgaaggtgct cccgagcggc cggcccgcca tgaaggcgcg ccaaagcttt ggcattgtgc 4001581 acattttcca cccgtgctct attaatgctg agccgcgaat tgtgagccca gtcgggaaac 4001641 acgcggagca ccagagtcac cgcagcggcc ggggcggttc aactcaccat ggatcgctct 4001701 cgtcgtctgg tgctggacaa tcgtcgctgt agcgcgtcgc gaacacctca gcttctgctg 4001761 ccgcggcttc ttccggcgat ggtaaccccc aggtttcgcc cacggtctta cgtagcagtg 4001821 cgacgcggtg ttcatctgca tcgacctgtt gactcatcct gtcaaggatg aaggcgtact 4001881 gggccgactg cgccttctgc cgcgccaggt cggcaatcac caggatctca gaagcgagct 4001941 gcgactcact catccaggcc accctggccg acagctcgac atggtcaatc cggccgtcca 4002001 tcagcgtcga taccgacacc gtgcgtgggg gattcgtcac ggtaaaaagc gcgatctctt 4002061 gttcggtgtc cgtctccgcc tgaccgtggg cattgtccag gtcgggtccg gtgtccgggg 4002121 tcgccgccga cccgacgcca ataatcggat ccgcagtcca gccctccgcg ccgtcggcgc 4002181 cccagagatc cacggcgtcg aaatcgttgc tgtcaaagtc atttccgggc aagtccaccg 4002241 tcccttcgga attcattgcc acccgggaag ggtcggcctg ggcagctggc gtggtcagtc 4002301 cgaacaggtc gttgggaaga cgctgtggcc tgcactgcgg gcagcaaacg tggtcaggta 4002361 aacaacccgt cgatagcctt gcgccacgct tcgtcggcct cgctatatat tttcgccgca 4002421 attcgaagac ttttggcgag atcgacaccg gccgtatgca aggacgagcc cagggcattg 4002481 tgggcagtca agtacacatt taacgtgtcg ttgaactgtg agcagtacgg accgtgagtg 4002541 atcgccacag attcgcctag gccagcggca gcttcgacgc ccgaggaggc atcgaccgcc 4002601 gcgttgtcat ggtgcgacgc cagtacaccg agacgctcgg gctggacggt caagttttcc 4002661 gtcattgatc gtgtcccttc cgtttagcat tgcgcgttgt taggcgctgg ctagcaatgg 4002721 atttggctcg ccatgccgtt agacgacgtt tcgtaccagc accttttgcc caccgcccgc 4002781 gtcagcttcg actggcgcgc gctcggcgtc ttcagtgccc gccgccgcgc cttccgagta 4002841 cttcttcgtc gtcgtccctt tcgacgcccc cgaagagggg tgcatgccgc ccatgcctac 4002901 gggtccgccc ataccttggg aaccctgcgc ggagaccagc tgcgactgcc cgccgacctg 4002961 ctcggcagcg gcgccgaccg ggccatcagc tcggggccgt agcgcctgcc gagttgaggc 4003021 ggcatggacc tgagccaggc tcggcaagcc cccaaaaccg gacccgcccc caatgccggc 4003081 cagggcgggc aagctggctg agctcgccag gctatccgcg tgagccaagc ccgacgatgc 4003141 ggacagaccg gccgcaccga acaagccagt cacttgcgac aagccgctgg tcgcgccggt 4003201 caagccgggg acgcccgcaa agaaggactc caggttcgac caccctcgag agaacagtcc 4003261 ggtcacccac cccgtgagct tgtcccaaag ctctttcagg ccgttgagcg cgtttgtgat 4003321 gaactcccac acttctccga ggatgccctt gatgatgtcc gccacatccg aaatgatgtc 4003381 cgcaatggcg gccgcgacca actccgccaa tttggcaagc aatttgagga gttgagtcgc 4003441 gttgatcagc gttttcacgg ccaagtaggc aagcgcgccg cccactacgg ccatcgcgcc 4003501 cgcgcaaaac ggcgcctgga aggcggccga tagggcgtgc ccgacgaccg ggatgtaggt 4003561 caggtccaca gccaccgggc gcacgaactc gagacctttc ttggcgccct ccaggatgtc 4003621 gcgggtcgtc tggaccgcgt tggcctggtc gtggatcagg ctgatgagct gacgatcgag 4003681 gtctgccagt tcctggaaaa aattcacgtg gttgcggttt ttgccggcgt atttgtccgc 4003741 ggccgaacct aaccagccat cacccggaaa cgctgctgcc agctcctcca gggctttttc 4003801 gaagtactct agtgaggagt aaaggatacc cccttggttg ggtattccaa tccccagaag 4003861 gtcgtacaag ccgtcaatgg cactgatcgt tggatcgatg atgaacgctc tgctcatgcc 4003921 tgccgcctat ctcaacggtc gtcgattcca tgcatagact tggttctgca ttgcgcgcgt 4003981 agggcctaca gtctggctgt catgcttggc cgatgtcaac agttttttcc catgctaagc 4004041 agatcgtcag ttttgagttc gtgaagacgg catgttcact tgttgtcgac tacatcgtct 4004101 gcgcacattt gccctcctgc aactgcgctg cgacaatgcg ccaaccgccg tgtaggcggc 4004161 gcgatcccaa ggcagtgtct ccgacgtcga tgcctgcgct tcgccttcga tcggtatgag 4004221 atctgttgca ggagagtcta tatagtgtgc tcatggggct agccggcggc ggcctcgtgg 4004281 cgggcacaat cacctcgccg gtggcgcaat cagggctgtg ctaacccacc atcactcacc 4004341 cgattcggcg tcgaagcggg gcgctctcat ggttgcgagg caatgaacca gaccaatgat 4004401 cgccagcaag cggtgatcga tgcactggtg ggtgccggcc tggaccgcaa ggacatccgc 4004461 accaccaggg tcaccgtggc accgcagtac agcaatccgg agccggccgg aaccgccacc 4004521 atcaccgggt atcgggcaga caacgacatc gaggtgaaga tccacccgac cgacgccgcg 4004581 tcgcggctgc tggccctcgt cgtcagcacc ggcggtgacg ccacccggat cagctcggtc 4004641 agctactcga ttggcgacga ctcgcagctg gtgaaggatg cccgggcgcg cgccttccaa 4004701 gacgccaaga accgtgcgga ccagtacgca caactgtcgg ggctgcggct aggcaaggtg 4004761 atctcgatct ccgaggcatc tggcgccgcg cccacgcacg aggcgccggc gccgccgcgc 4004821 ggcctatccg cggtgccgct ggaacccggc cagcagacgg tgggcttctc ggtcacggtg 4004881 gtctgggaac tgacctagcc gcctactgat agaccctggg gtccagcgtc ccgatgtatg 4004941 acaggtcacg gtagcgttcg tcgtagtcca ggccgtagcc cacgacgaag tcgttgggaa 4005001 tgtcgaaacc cacgtacgcg atttcgacgt tggcgtgcac cgcatcgggc ttgcgcagca 4005061 gcgtgcacac ccgcaatgac cgcggattcc ggctcgtcag gttccgcgac aaccacgaaa 4005121 gcgtaaggcc ggagtcgacg acgtcctcga cgatcagcac gtcgcggccg tggatgtcgc 4005181 ggtcgaggtc cttgaggatc cgcaccacgc ccgacgagga tgtcgatgac ccatacgaac 4005241 tcaccgccat gaactcgaac tgggtcggca cgggaatcgc tcgcgccagg tcggtgacga 4005301 agagcaccgc gcccttcagc acggtgatca gcagcagatc ctggccggtg gtagcggaca 4005361 gctcgcggta gtcgttgccg atctgctcgc cgagctcggc gatgcgggcc tgaatctgct 4005421 cggccgtgag cagcaccgac ttgatgtccc ccggataaag ctccgccgtc tgcccggggg 4005481 tgatcgccga ggagctctgg gtcacgtgca cagcgtgcca cgccgcggga ccaacgacca 4005541 acgcgggcgt caaacgggct cgcgccgcaa cacaagtacg ccgtcgcgcc gcccggcgac 4005601 cagtcgctga ccgcgcaacg tggacccaac cgctaccccg ccctgaccgc gccacgcggt 4005661 gaccagccgg tccactccgc ggatctgcct gtcggtcagt ccggtcgcgc cgccggccag 4005721 cagccagccc cgaatcaccc ggcgccgcac cgcatccggc agcgcggtca aggcgctggt 4005781 actcaactcc tgtccccgtg agccagcaac agcggctccg ggcagcgcct gcgcagcgat 4005841 cgtgtcgatg aggtcagtgt cctcgcgcaa cgctgtcgcg gtgcgagcca gcgcttcggc 4005901 cacacctccg cccagcacgt cctccagcag tggcagcact tcggtgcgca atcgggttcg 4005961 ggtgaagcgg cggtcggtgt tgtgcggatc ctgccaggcg gtcaggccca gctcccggca 4006021 ggccgcatgt gtcacgctgc ggcgcacccc cagcagcggc cggcaccagg gcggatcgta 4006081 cggacgcatg ccggcgatcg accgggcccc cgaaccacgg ccaagcccca acaacactgt 4006141 ctcggcctga tcatcgagcg tatgggccaa cagcaccggg ccatcgcggt gctcctccaa 4006201 tgccgagtag cgggcgctgc gcgccgccgc ctcccggccg ccggccgcgc ccacctgaac 4006261 gcaaagcacc cgcgcgtcca cacatcccag cgaaatcgct tgtatgcgag ctgtttccgc 4006321 gaccgtggcc gagccgggct gcagaccgtg gtccacgatc agtgcggtgg tgggccacag 4006381 ccgtgcggct acagcggtga gcgccaacga gtccgggccg ccggagagcc ccacgctcca 4006441 acggtcgcag gcgtcgagat ggacccgagc gaactgctcc gcagccgcac gcagctgcgc 4006501 tacagcactc tgtcgatcca tcgctgcggg ttttcgatct cggcaggcaa cggcagcgtc 4006561 tcggggcccg accagatcgt gttgaacagc ttcattcccg cccggtcgac cacatggtcg 4006621 acgaatgcct tgcctcgggt gtactggctg agcttggcgt cgaagcccag cagagctcgc 4006681 accagccgct gcagcggcgg ctgtttgtga tgacgacggt cgtcgaagcg gcggcggatg 4006741 gtggccaccg agggcaccac catcggcccg accgcatcca tcacatgctc ggcatggcct 4006801 tccagcagcg tgccaagtac cagcagctgg tctaaggcct tacgttgcgg ctcggattgc 4006861 acggctcgca ccaggcccag aatgcccgac gggttgacct cggaatcgtc ggtaccgtgt 4006921 ccacggctgc ggatgaagtc cgccagccgg ctcaccaccc gcccgatgtc gtcaacgggt 4006981 tcgaaggtca acaggtttag cgcctgcgac atgtagccgg acagccaggg gttggcggtg 4007041 aactggactc ggtgggtgac ctcgtgcagg cacacccaca accggaaatc ggacggctcg 4007101 acccgcagtt gacgctcgac ggcgatcaca ttgggatata ccagcagcaa gcagccttct 4007161 ccggcggctc cgaacgggtc gtactggccg aggatgcccg aggccacaaa cgccagcacg 4007221 gcaccggtct gcgcaccggt gatccgaccg gtgagaaacc cccgcggttt ggcgcttccg 4007281 tgcgtcatcg cccgcatcga ttcggcggcc gagcgaatcc acgccggccg gtcgacgaca 4007341 cgggccggcg gcaccacacc gtcggcgatc agaccggtga cgtcgcgcac cggcggttcg 4007401 gccttctccg ccgcgacggt cagctcgtcg atcacctggc gacgggtgta ttcggtggac 4007461 ggcggagcgg gccgggccag ccgctccccg acgctggccg caaattccca atcgaccgtg 4007521 ttccccagtg tcagctcgga cgctccggtc acgtcgtgca cccgcagaac cacaacttag 4007581 tggccagagc gtccatcgcg ttgcgaccgt tgggaccggc ttcgttggag atgaacgcga 4007641 aggtgagcac tcggccgcta cggtcggtga gcaccccgac tagcgagttg atcgcggtca 4007701 gcgagccggt cttggcccgc aaccacccgg ccggaccctg gtcggtggcc gcgtcgagga 4007761 agcgctcgcc cagcgtgcca ctgccaccgg cgatcggtag cagatccagc agcggccgca 4007821 acgcgggctg gtcgggtcca gccgcggcct gcatcgttgc atcgagcgtc cgagcggtca 4007881 ggcggttgtc gagcgacaat ccactagaat ccaccagcgc agcgccggcg gtgtcgatgt 4007941 gtgcggtgtt caatcggctg gtcaccgcgt cgaccgcgcc actaaagctc tgcggccggt 4008001 tgatcgcgac cgctacctcg cggccgatgc actcggccat cacattgtcg gaggcgttca 4008061 tcatctgaga cagtcgctgg atcaacggcg ccgactgcac cacggccagc tgccgcgcgc 4008121 cggccggagc cgatgcgatc gtcaccgccg cggggtccag gccaagggct ttggccaact 4008181 cccgaccggc atccagcgcc ggggtgcggg accgtctcga attgacggtg gtcggctgga 4008241 tacgcccggc gtcgatcatc gccgcttcga tcggcgcgat gtcaccgttg tcgatatcgg 4008301 ccggatccca acccggcgcc atcgtcggac cgctaaacgc cgaagcgtcc acctgcacgg 4008361 cggtgggcgt cacaccgctg cggcgaattt gttcgacgag gtcaccgatg cgagccgcgc 4008421 cgtgatacca ggtgtcctga ccgggcggcg ctgccgacag cgtcggatcg cccgcgccca 4008481 ccaacacgac aggtccctgg gggttctggc cgccggccac cacccgcgtg ctgatccggg 4008541 cctgtcggtc cagtgtcagc agagccgccg ccgccgtcag gattttgttg gtcgaagccg 4008601 gcaccaaggg cacgtcgtct agccgctgcc aaagttcttg tccggtcagg gcatcggtga 4008661 tccgacctgc taacttgccc agatcaggat cgaccgccac caccgcaagc gccgcggtca 4008721 cgccagcggc actcggtgtc gcagcggtgt ccgccacagg gaccactccc gccttgactg 4008781 tgggtggccg cggtggaggc acaggtgcgc gcacgccagc ccggtgacca ccagtagtga 4008841 ccagcgctgc ggccgccacc acaacggcga caaacgccag cacggccgcg ccgacgacca 4008901 cgtgcgtgga tttccgccag cgtgtgggac ccatgagctc tcctgccttt ccggtcccat 4008961 tctgccgaac cggccgggcg acgctgccac ggtaccggct cgactagggt gtccacggac 4009021 gcattggacc tgcccgttgt cccatgcact ctgatctgaa ggagccgacg cgtgcaattc 4009081 gacgtgacca tcgaaattcc caagggccag cgcaacaaat acgaggtcga ccatgagacg 4009141 gggcgggttc gtctggaccg gtacctgtac accccgatgg cctacccgac cgactacggc 4009201 ttcatcgagg acaccctagg tgacgatggc gacccgctgg acgcgctggt gctgctaccg 4009261 cagccggtct tccccggggt gctggtggcg gcgcggccgg tggggatgtt ccggatggtc 4009321 gacgagcacg gcggcgacga caaagtgctg tgcgtcccag ccggtgaccc ccggtgggac 4009381 cacgtccaag acatcgggga cgttccggct ttcgagctgg atgcgatcaa gcatttcttt 4009441 gtgcactaca aggacctgga accaggtaag ttcgtcaagg cggccgactg ggtcgaccgc 4009501 gccgaagccg aggcagaggt gcagcgttca gtggagcgct tcaaggccgg tacacactga 4009561 tttgggctta gggcgcccgc cccgcgcctt ggcaccctcc gccggtcatg atccgaactt 4009621 cgtgggggac ctgactgtta ggcgattgcg ccgcacactc tcggtgaacg ccgccccgat 4009681 aaaaaccacc cccaccgaag cggtgaccca ctcggggacg gcgaatcggt ggtcgatgga 4009741 caacagcaag attatggcga gcgcgccaat cgcccagtgt gcgccgtgtt ccaggtacac 4009801 gtaccggtcc agtgtgtcct gtcgcaccag atagatcgtg atcgaccgga caaacatcgc 4009861 acccaccaca ccaaggccga gcgcgatgat gatcgggtcc gtagtgatcg caaaggcccc 4009921 ggtgacgccg tcgaaagaga aggcggcgtc gagcacctcc agatacagga acaacgcgca 4009981 accagccttt ccggccgcct gcctcgcctg cacgcccggc gtggcttcac ccaaccccgc 4010041 cggccggaac gcccggctga tcccgttgac gacaagatag gtcaccatgc ccaaaaggcc 4010101 ggcgatcagc accgtacccc gctgatcgct ggagtgtgtc aacagcgcgc cggcaaggac 4010161 caacccaaca ctggccacta tcaccgggac ctgaccgagt cgaccgatgc gggcaaaggg 4010221 gacctcaatc cacttcagcc atttgatatc gcggtcgtga acgacgaagt ccaggaaaag 4010281 catcagcagg aacatgccgc cgaacgccgc gatctgcgga tgcgcagcgg tgatcagttt 4010341 ttcatagctg ggcgatccgt ccgcaaattc cagcgcgcca tgggccggtg gacgaagcgc 4010401 cagctccatt gcgcggacgg ggtccaggcc cgcggtggtc cagatgatgg ccagcgggaa 4010461 caccagccgc atcccgaaca ccgcaataag aatcccgatg gtcaggaaca tccgctgcca 4010521 aaacgggctc atccgctgca gaatcgcggc gttgatgatg gcgttgtcga acgacagcga 4010581 tacctcaagg agcgccagaa ccgccagcaa gaacagggcg gtcggcccgc cgtgcaaata 4010641 tccggtaacc aacgccacca ccgtcatcag cagcgagaag ccgaagatgc ggaacgttga 4010701 catggatcct tccgaggaaa aaccccacaa tagcgacgaa ccgacatcaa ttggtcaggt 4010761 tcgcgccgcg cagcgcggcc aaccggcccg cctactattt tcagtcgtga cgatccatgt 4010821 cggttggccg ttggcgccgc cgcggtgacc gaagtcggcg atacggcatc tcctgttggc 4010881 tcctcgggcg cctctggcgg agctatcgca agcggcagcg tagcccgggt cggcacggcg 4010941 accgcggtta ccgcgctgtg cggctacgcg gtgatttatc tggcggcccg caacctggct 4011001 cccaacggct tctcggtatt cggggtgttc tggggcgcat tcggactggt caccggggcc 4011061 gccaacggcc tgctgcaaga aaccacccgc gaggtccgct cgctggggta cttggacgtc 4011121 tctgcagacg gccgccgtac ccatccgctg cgggtctccg ggatggtcgg cctcggctcg 4011181 ttggtcgtga tcgccggtag ctcaccgttg tggagcgggc gggtattcgc cgaggcgcgc 4011241 tggctatcgg tcgcattgct cagcatcggg ctggctgggt tttgcctaca cgccaccctg 4011301 ctgggcatgc tggccggcac caaccggtgg acccagtacg gcgcgctgat ggtggccgac 4011361 gcggtcatcc gggtggtggt cgccgcggcc acgttcgtga tcggatggca gctggtcggg 4011421 ttcatctggg caaccgtggc gggttcggtt gcctggctga tcatgttgat gacctcaccc 4011481 ccgacacgcg cggccgcccg cttgatgacg cccggcgcta ctgcgacatt cctgaggggc 4011541 gccgcccatt cgatcatcgc ggccggtgcc agcgcgatat tggtgatggg gtttccggtc 4011601 ttgctgaagc taacctccaa tgaactgggc gcgcagggag gcgttgtcat ccttgcggtg 4011661 acgttaaccc gggcgccact gctggtgcca ctgaccgcca tgcaaggcaa cctcatcgcg 4011721 catttcgtcg atgaacgcac cgagcggatt cgggcgctaa tcgcgccggc ggcgctcatc 4011781 ggcggcgttg gcgcagtcgg gatgctggcg gccggcgtcg taggtccatg gattatgcgc 4011841 gtcgcgttcg ggtcggaata ccagtccagc agcgcattgc tggcctggtt gacggcggcc 4011901 gcggtggcga tcgcaatgct gacactcacc ggtgccgccg cggtcgcggc cgcactgcac 4011961 cgggcgtatt cgctgggctg ggttggtgcg acggttgggt cgggcttgtt gctgctgctg 4012021 ccgctgtcct tggagacccg caccgtggtc gcgttgttat gcggtccgct ggtgggaatc 4012081 ggcgtccatt tggtggcgct ggcgcggacg gacgagtaag cggccgatca gcccctgacc 4012141 aacgtgtaac ttgtgggctt aaatggcctc gaaaatggac actgaaacgc actactcgga 4012201 cgtctgggtc gtcattcccg ccttcaacga agccgccgtg atcggcaagg tcgtcaccga 4012261 tgtgcggtca gtcttcgacc acgtcgtctg cgtggacgac ggcagcaccg acggcaccgg 4012321 cgacatcgcc cggcggtccg gtgctcacct cgtacgccat ccgatcaacc tgggccaggg 4012381 ggcggccatt cagaccggaa tcgagtacgc ccgcaagcag ccgggcgccc aggtctttgc 4012441 cacctttgac ggcgacggcc agcaccgcgt caaagacgtg gccgcaatgg tcgaccggct 4012501 cggcgcaggt gacgtcgatg tggtgatcgg aacgcggttc ggccggcccg tgggcaaagc 4012561 ttcggccagc cgaccgccac tgatgaagcg gatcgtgctg cagacaggag cgcggttgag 4012621 ccgtcgaggc cgccgacttg gcttgaccga caccaacaat ggcctgaggg tgttcaacaa 4012681 gaccgtggcc gacgggctga acatcaccat gagcggcatg agccacgcca ccgagttcat 4012741 catgttgatc gccgaaaacc attggcgggt agcggaagaa ccggtcgagg tgctctacac 4012801 cgagtattcg aagtcgaaag gccaaccgct gctcaacggc gtcaacatca ttttcgacgg 4012861 gtttctgcga gggaggatgc cacgatgaac tggatccagg tgctgttgat cgcgtcgatc 4012921 atcgggttgc tgttctacct gttgcggtcg cgccgaagcg cgcggtcgcg tgcctgggtc 4012981 aaggtgggct atgtcttgtt cgtgctcgcc ggcatctatg ccgtgctgag accggacgac 4013041 accacagtgg tcgcaaactg gtttggggtg cgccgcggca ccgacctgat gctctacgca 4013101 ctggtgatgg cgttcagttt caccacactg agcacctaca tgcggttcaa ggacctcgag 4013161 ttacgctacg cgcgcatcgc ccgggctctg gcacttgagg gcgcacaggc gcccgaacag 4013221 tgccggtaag acccagccac ttgagggcgc acaggcgccc gaattaagcc gcgattcgat 4013281 ctgcgcagac cgtagccagg aaggacccgg cggcctacag ttcttagagt tactgcatct 4013341 ctgaccagca ggaggcgata tgtccgaccc tgacgacgtc accacatcat ctgacgaccg 4013401 cgacgagggc gaaccggaaa tagacctgct gccggcctga tgactcagag ctcatcggtc 4013461 gaacgcctgg tcggcgagat cgacgagttc ggttacaccg tagtcgagga tgtcctcgac 4013521 gccgattcgg ttgccgcata cctagcggat acccgtcggc tggaacggga gctaccgacc 4013581 gtcatcgcca actccacaac cgtcgtcaag ggcctggcgc ggcccggcca tgtcccggtc 4013641 gaccgggtcg accacgactg ggtgcgcatc gacaacttgt tgctgcacgg cacccgctac 4013701 gaggcgctgc cggtacaccc caagctgctg ccggtcatcg agggtgtgct tggccgcgac 4013761 tgcctgttgt cgtggtgtat gacgagcaac cagctgccgg gcgcggtggc tcagcgcttg 4013821 cactgcgacg acgaaatgta tccgctgccg cggccgcatc aaccgctgct gtgcaacgcg 4013881 ttgatcgcgc tgtgcgattt caccgccgac aacggcgcca cccaagtggt gcccggttca 4013941 catcgctggc ccgagcggcc gtcgccgcca tacccggagg gcaagccggt cgagatcaat 4014001 gcgggcgacg cgttgatctg gaatggcagc ctgtggcata ccgccgcagc gaaccgcacc 4014061 gatgccccgc ggccggcatt gaccatcaac ttctgcgtgg ggttcgtgcg ccagcaggtc 4014121 aatcaacagc tgtccatccc gcgagagttg gtgcgctgct ttgaacctcg gctacaggaa 4014181 ctgatcggct acgggctata cgccggaaag atgggccgaa tcgactggcg accgccggcc 4014241 gactatctcg acgccgaccg gcatccgttc ttggacgccg tagcggaccg tctgcagact 4014301 tcggtcaggc tctgatcaat cagtgtgctt gtgccggaag tactcgaccg tgcgacgcac 4014361 gccgtcggcc aactcgatct gcggacgcca gcccaaaacc cgttcggcta agccgatgtc 4014421 aaggcaggac cgcttaagat cgcctagccg cggcgggtgg aactcagggt cgtcgggccc 4014481 gccgacagcc gcggccaccg ccgaatgcag ttggcggtcc gacgtttcct taccggtgcc 4014541 gatgttgaag cgcagcccac cgccgacgtc cgcggacacc cggacaaacg cgtcgaccac 4014601 gtcgtcgaca aacacatagt cgcgcgtatt ggtgccgtcg ccgaacaccc tggtgggttt 4014661 gcccgagagc agcgcctgcg cgaagatcgc taccacaccc gcttcaccgt gtgggtcctg 4014721 gcgaggaccg tagacgttag ccggtgcgat atgcgagcag tccaggccgt agagatgtcg 4014781 aaaggtgttc aggtagattt cgccggccac tttgcccgcg gcatacggcg aggccggatc 4014841 ggtgggcgct gtctcagggg ttggatactc cggcggggtg ccatagatcg atcctcccga 4014901 ggaggtgtgc acgatcttgc ggacaccggt ctgccgcgcg gcctcggcta ggcgcaccgt 4014961 gccgatgaca ttgaccgcgg cgtcgaattg cgggtcagcc accgaacggc ggacatcgat 4015021 ctgggccgcc aggtgaaata ccacctcggg ccggtgctgc tcgaggatgg cgtgtagatc 4015081 ggcggtcaca atgtcggctt cgacgaagac gtgtgcggag ttgtcggcca gatgctcgag 4015141 gttggtcgcc cggccggtcg cgaagttgtc caatcccacc accgaatgac catctgccag 4015201 caaccggtcg actaacgtcg agccgatgaa tccggccgcc ccagtgacca gtgcgcgcac 4015261 cggcccacca taccggcggc ccatgccagc gccccgtatg cctcgggtcg ccctggtcgc 4015321 cgtattgctg atcacggtgc agctggtggt tcgcgtggtg ctggcatttg ggggctattt 4015381 ctattgggac gacttgatcc tcgtcggcag ggccggcact gggggcctgt tgtcgccgtc 4015441 gtacctgttc gacgaccacg acggccacgt gatgcccggt gccttcctgg ttgcgggcgc 4015501 cattatccgg gtggcacccc tggtgtggac cggaccagcg atcagcctgg tggtgctgca 4015561 gctgctggag tcgctggcgt tgctgcgcgc gttgtatgtg atatcgagct ggcggccggt 4015621 actcctgatc ccattgacgt tcgcgctgtt cacaccgcta gcggtgccgg ggttcgcgtg 4015681 gtgggcggct gcgctcaact cgctgccgat gctggccgcg ctggcgtggg tgtgcgccga 4015741 tgccatcctg ctggtgcgga ccggcaacca ccgctacgcc gtcaccggtg tcctggttta 4015801 cctcggtggc ctgctgttct tcgagaaggc cgcggtgatc ccgttcgtct ccttcgcggt 4015861 ggccgcgctg cagtgccatg tgcgcggcga ccggtcagct ttggcgacgg tgtggcgggc 4015921 cggtgtccgg ttgtggacgc cgtcgctggc actgaccgtc ggctgggtag ccctttatct 4015981 ggcggtggtg gatcaacggc gatggagttc cgatctgtcg atgacgtggg atctgctgtg 4016041 ccgttcggtc acccacggca tagtgccggc actggccggc gggccgtggg actgggcgcg 4016101 ctgggctccg gcatccccgt gggccactcc cccggcggtg gtgatggtgc tcggctggct 4016161 ggtgttgatc gcagtgcttg cgctgtcact ggtccgcaag cgacgcatcg gcccggtgtg 4016221 gctgaccgcg gccggctacg cggtggcctg ccaggtgccg atctttctga tgcgctcgtc 4016281 gccgttcacc gcgctcgagt tggcccagac cctccggtac ttcccggatc ttgtcgtcgt 4016341 gctggcgctg ctagccgccg tcgcgctgca ggcacccaat cgcgccggca cccgctggct 4016401 ggacgcctcg ccggcccgag ccgttgcgac agtcgcttcg gccgtgttgt ttttgaccag 4016461 cagcctgtat tcgaccgcga cgtttctggc cagttggcgt gacaacccca ccgagggata 4016521 cctgaagaac gcccaggcaa gtctggccgc ggccgcgtca ggtgcgccgc tactggatca 4016581 ggaagtcgat ccgctggtgt tgcaacgagt ggcctggccg gagaacttgg ccagccacat 4016641 gttcgccctg ctgcgcgtcc gaccggaatt cgctacgaca acaacacaat tgagaatgtt 4016701 caccagcaca ggtcggctgg tcgacgcgaa agtgacctgg gtccggacga tcatcgcggg 4016761 gccggtgccg cagtgcggct acttcgtcca gccggaccgg ccggaacgtc tgatcctcga 4016821 cggccccttg ctgcccggcg actggaccgt cgaactcaac tacctggcca acagcgacgg 4016881 ctcgatggcg ctggcacttt ctgacggacc tgagcggaag gttccggtgc atccgggtct 4016941 caatcgggtg tacgcccggc taccaggggc cggcgacgca atcacggtgc gagccaacac 4017001 caccgcgctt tcgctgtgca tcggagcggc gccggtggga tttctggcac cggcctgacc 4017061 tcaacgccgg tcgccacagc cgctcaaacg tggcggccgc gcgtattcga ccgtccgtag 4017121 tggttcgtta aagcgttgca gtacaacgca tacaacaatc aatcggccat tgagttcgca 4017181 cgctcatgca gttgcgaatg gtcggtggat gctcgaagcc aatgcagaaa gcgaccggct 4017241 cgatgagctg caccagcagt atcaccgaga tgatcttggc ggtaatcagg cttgtatctc 4017301 ttgtagtgtg gcggcggcaa ctgaatactg accagagcgc ggcaactgaa aattgaccag 4017361 cttcctggag agccttggct atgggccaag gaggaagcga gtgttgagcg tggaggattg 4017421 ggccgagatc cggcggttgc gccggtcgga gcggttgccg atttcggaga tcgcgcgggt 4017481 gttgaagatt tcgcggaaca cggtgaagtc ggcgttggcc tccgatgggc cgccgaagta 4017541 ccagcgtgcg gcgaagggct cggttgcaga tgaggccgag ccgcggatcc gggagttgtt 4017601 ggcagcctat ccgcggatgc ctgcgacggt gatcgccgag cggatcggtt ggtggtattc 4017661 gatccggacg ctcagcgggc gagtacgcga gttgcggccg ctgtatctgc cgccggatcc 4017721 ggcgtcgcgc gacatatgtg gccggtgaga tcgggcagtg cgacttctgg ttccccgatg 4017781 tcgttgtgcc ggtggggtac ggccaggtcc gcaccgccac ggcgttacct gtgctgacca 4017841 tggtgtgtgg gtattcgcgg tgggcctcgg cgctgttgat cccgacacgc accgccgaag 4017901 acttgtatgc cgggtggtgg cagcatcttt cgacgttggg cgccgttcca agggtgttgg 4017961 tgtgggacgg cgagggcgcg gtcgggcggt ggtgggcgcg ccaacctgaa ctgactgcgg 4018021 catgccatgc cttccgcggc accctggccg ccaaagtgtg gatctgtaaa ccggtgatcc 4018081 cgaagccaag gggctggtcg aacgtttcca cgactacctg gagcgggcgt tcttgccggg 4018141 tcgggtcttt gcctctccgg cggatttcaa tacccagttg caggcctggc tggtgcgggc 4018201 caatcaccgc cagcaccgag tgctgggatg tcgaccggca gatcgcatcg aggccgatac 4018261 cgcagcgatg ctgacattgc cgccggtcgg gcccagcatt gggtggcgaa cctcgacacg 4018321 gctgccgcgc gatcattacg tgcgcctcga cggcaacgac tactcggtgc atccggtcgc 4018381 gatcggccgg cgcatcgaga tcaccgcaga tctgagccgg gtccgggtct ggtgtggcgg 4018441 caccctggtc gccgatcatg accgcatctg ggccaaacac cagacgatca gcgatcccga 4018501 gcatgtcgtg gccgccaaac tgctgcgacg caaacggttc gacatcgtcg gtccacccca 4018561 ccacgttgag gtcgaacaac gtctcctgac cacctacgac accgtgttgg gccttgacgg 4018621 gccggtggcc tgatggcagc caagaccgct accaacagcc gcgatgtggc cgccgagctg 4018681 gcgtatctga cccgggcgct gaaagccccc accctgcgcg gggccatcga gcagctcgct 4018741 gaccgcgccc gcaccaagac ttggagctat gaggagttcc tcgcagcgtg tctgcaacgc 4018801 gaggtgtcgg cccgcgaatc ccacggcggc gaaggacgca tcagggccgc ccgcttccca 4018861 tcgcgcaagt cgttggagga gttcgacttc gaccacgccc gcggtctcaa acgcgacacc 4018921 atagcgcatc tgggcaccct ggacttcgtc accctagcaa tcgggatcgc gatccgcgcc 4018981 tgccaggccg gccaccgcgt cctattcgcc accgcctcgc aatgggttga tcgtctggcc 4019041 gccgcccacc acagcggcac cctgcaatct gaactgattc ggctggcccg atacccgctg 4019101 ctggtcgtcg acgaagtggg ctacatcccc ttcgaacccg aagccgccaa cctgttcttc 4019161 caattggtgt cgtcccgcta cgaacgggcc agcctcatcg tcacgtcaaa taagcccttc 4019221 gggcgctggg gcgaagtatt cggcgacgac gtcgtagccg cggccatgat cgaccgactc 4019281 gtgcaccacg ccgaagtcat cgcactcaaa ggagacagct accgcatcaa agaccgagac 4019341 ctcggccgcg tccccaccgt cacggccgac gaccaatgaa accaagctgg tcaattttcg 4019401 attgccgaca cctgatcagt tttcggttgc cgttgacata gtgcccaaaa cacgcaccca 4019461 catcagatgc agaacccctt gacaaccaat agggaatctc ttcgcatgat ggaggttgct 4019521 ggcaccaatc catcaggaag gcccttgttg accggcactg ggttgggggt ccaccgcgat 4019581 gggtgagtat ggcaagtgcg gcacgtatgc acccgtcttg gtgcacgcgg ccaagggcag 4019641 cccgttagcg ccgtcgccca gcgtgaactg agggcggaga atcggccgga atctcgccct 4019701 cagtgcacgc tcggcgccgt ttggcctcac ccggtcaacg tgaactgtcc ggggcgggca 4019761 ctgtcgcgta gcgagcccac gtggggccgg ggtcggcccg ccaaaaacgc cccggcgcgg 4019821 ccagctcatg agcgggtacg caagctcaag cagatctccg tagccgtgac ggagtgcttc 4019881 atcgatgtcc gcagcgatgg cagcggccag tgcgtgccta aacccgtctt gcgcagagtc 4019941 gttcgcagcg ggcgggtagt tgcacgtcgt cgccgaagtg ctgacgatcc cgttgcggtc 4020001 ggagaccgcg agtagccagc gcgcgtccgg ggcagcatct cgcgcagcac gctgaagtgt 4020061 cgcggcaccg gaagccgggg gcgtgaagag acccgccatg acaccggctg gacggcgcgg 4020121 ggcagagtcc cgcggagtgg tgggcttcga cgttgagttc gtcggtgcct actggccgcc 4020181 gctgattgcg gcgaccacag cattatcgct atcggggtag agcagcgcca tagaggcctc 4020241 ggagaggtag cggcgctcgc tggcctgcca ttcgtcgtgc atgtcggcca ggacggcgcc 4020301 cacaaggcgg atcacggctg caggattcgg gaagatcccc acgacgcggg agcgtcgctt 4020361 gatctcctta ttgatgcgct ccaatggatt ggtcgaccag atcttttgcc agtgcgcctt 4020421 gggaaatgcg gtgaacgcca atacttctgc cctggcgtcg tccatcagcg ggccgatctt 4020481 gggaaacgac gcggcgaggc gatcacggac ctcctcccag gtcgcgtgca ccgcctcggc 4020541 gtcgggtgcc gagaaaatca ttcgaaacat gctggcgacc atgtcggcct tgtccttggg 4020601 cacgtgggcg agcagattgc gcgcgaagtg cacccgacag cgctgatgcc cagcgccctg 4020661 gaaacagcgc ttcaacgcct tcaccagccc ggcgtgctgg tcactgatca ccagccggac 4020721 accaccgagg ccgcgcccct tgagcgaggt caggaacccg cgccagaagg tctcatcctc 4020781 gctgtcgccg acgtcgaggc cgaggatctc gcgtgacccg tcggcggcga tgccgctggc 4020841 aacgatgacg gccatcgaca ccacctggcc agtaccgttg cgcacgttga gataggtggc 4020901 gtcgaggtag acgtagggga actcgatgtg cccgagcgtg cgggtgcgga acgcgccgac 4020961 gatctcgtcg agtccggcac agatccgcga cacctcggat ttggagatgc cggtctccac 4021021 acccatcgcc tcgaccaggt cgtcgaccgc acgggtagag ataccgtgca cgtaggcctc 4021081 catcaccacc gcgtacaagg cctgatcgat ccgccggcgc ggctcgagga tcgccgggaa 4021141 gaaagagccc ttgcgcagct tagggattcg cagttccacg tcaccggcct gcgtggacag 4021201 cacccgcgat cgggcaccgt tgcgatcggt cacccgagtg tcgctgcgtt cataacgggc 4021261 agcgccgatc cgttcagtgg cttcgagctc gctgagttcc tgcaacacca gacggacggc 4021321 atcacggatc aagtcgacgc catcaccagt gcggaacgcg tcgagcaact cggacagggc 4021381 agactgtggc aaggccatcg gcgggatctc cttcggtgcg tgcttggcgg tacacaccga 4021441 cgatctcgcc gacggcccct acctcatcgg agccactccg caacaacccc taaacccacc 4021501 acgctgcggg aggcttaccg gcggcgtggc acaacgttcg gtatcgctga tcggcatcag 4021561 gaggttagtg cgatcagaag tcgtaagtgg gctcggcgtc gaggatcccc ttgaacatcg 4021621 cgaccaggcc cgtgagatca gagttggcgc gcgccacgtg acaagcgccg tgcaactctt 4021681 ccaggtcggt cttcccccag tcgaggccag aaccgcgttc ggacaacagg agatcgaaga 4021741 actcgcgggt cgagcggccg ttgccctcgc ggaacgggtg ggcatagttc acgtagtcgt 4021801 accggtatgc gacctggcca gcgagatcac cttcgccgac cgctctgagc cggtcgagct 4021861 ggtagatctc cgcagccaca tgctccatgg gccgactgat gccgcccggc gcgcagaaag 4021921 actcgtcctc cttctcgatg ccgactgtcc gcagatctcc cgcccagacg taaatgtcct 4021981 ggaacagctg gcggtgaatc gcccgcaggt atgcgagatc tgtgcggtcg cccagcagat 4022041 tgggatcctc gcggagttcg atcacccggg cctcaacgag gtcgttctcg gcatcacgca 4022101 gttcggcatg cgttcgagcg ccgacccggt tcctcaagac ggacatagcg gggatgaagt 4022161 agccctgcca attccgttcg tgatcgccgg tgtcccatgg atgcggcact ccaccccggt 4022221 tactggatgt tgtaccggcg gcggacgcgc tcacccaact cggctgccgt gatcttgccg 4022281 cgggcgtagt cgttctgatc ggcacgggtg gcggcggtgc tgcgggtgcc ctccagctcg 4022341 gtgttgcggc gagttgccct gacattcctg aagcgccgct tcaccttctg caactcggtc 4022401 gcctggacaa acacttcatc tcatttggtg gtcctgacca ggatagtcga cagcgctgac 4022461 attgcaggaa gttgaccgtc aagcacagca cggttctcca ccgctgatgt acgaccatca 4022521 tgtctcgttg gtcctgtaat cgacggcgtc ccaccggctc gacaagaaat cccaccaggt 4022581 gactggacgc aaggccggtg gggcccccta caccgtcacc atcccggagt tcggagccgc 4022641 agctttgcgc gagcagcggg cactggtcat cccgttcgac ccggtgtttc cggcccggcg 4022701 cggcacccgc tagtccgagg ccaacgtcgc acccactggc gggcgatccg cggagaggac 4022761 ttcaaatggg ttgtcccgca ctcgatccgc aagtccgtcg tcaccgcggt ggaacgctcg 4022821 atagggctgg aagccgcggc ccagcaggcc gggcacagcg gcagcgagat cacccggcgg 4022881 cactacgtcg agcggtccgt gacggtgccc gactacaccg ccgccctgga cgagtattcg 4022941 cgccctatcc gcgccttcag gccattaaag agcaacaggc cgggtgatat accgacctga 4023001 cctgcaaaga tggagccgcc taggagaatc gaactcctga cctattcatt acgagtgaat 4023061 cgctctaccg actgagctaa ggcggctttt cccctgggtg cccgcttgcc gggcggcacg 4023121 agtctacggc aggcgggccg gcccgcccaa gtttgcggcg gtcgctaccg cagttcctgg 4023181 ccgatggtgg cgaccatggc atcgacggcg aacttcggtt tgacgttgac cgctagcgct 4023241 tccctgcacg ccaacaccgc ttcgatgcag cgcagcagcc gctccggcgg ggcgtgggcg 4023301 gccagcgcag caacccggtc ggccatatcc gggtggttgg cccgcacccc acccgcgtgg 4023361 gctgcgacca acagtgcatc ccggaagtag gtcgccagat cgatcagtgc ccggtccagc 4023421 gcatcgcgcg aggcccgcgt ctgccgggat ttctgccgtc gttcaagatc cttcatcgcg 4023481 ccggtggcac cacgcaacgc cgcgccggtg cctttaccgg tacctccggc tcccagcgcc 4023541 gtccgcagtt cttcggtctc ggcctcgata cgctgcgcgg tcaacgctaa ggcctcggcc 4023601 tcggcgccgg ccaccaactc ctcggcggct gcgtaggcac gcgagggtgt cgcggcgtca 4023661 cgtgccagcc ccaaagcccg ctcgcgtcgc tgccgggcct gcggatcggt ggccagccgg 4023721 cgcgctcgtc cgacatggcc accactgacc gacgccgccc aattggccgt gtcggggtcc 4023781 aacccgtcgc cgtcgctcag cacctgcgcg atcgcgtggg tcgacggagt caccaacgcg 4023841 acatgcctac accgggatcg cagcgtgacc gcaatgtcct cgggatccac cgacggcgcg 4023901 cacagcagga acaccgtcga cggcggcggc tcctcgacaa ccttgagcaa cgcgttggcg 4023961 gcgccttcgg tcaaccgatc ggcgtcctca atcaccacga tctgccagtg cccggtagtc 4024021 ggccggcgcg cggcgatttg cacgatggcc cgcatttcgt ccacaccgat cgacagacct 4024081 tcgggaatca cccggcgtac gtcggcgtgg gtgcccgcca gcgtggtcgt acacgcccgg 4024141 cagcgcccgc acccgggctc cccgcccgac gtacattgca aagccgccgc gaagcacagc 4024201 gcggcaaccg agcgcccaga accgggcgga ccggtgagca gccacgcgtg tgtcatagtc 4024261 ccgccgccac ccgcgctgtg agccgaatca cgacgggccg ccttggccgt ggcaagcagc 4024321 tcggcttcca ccgcttgctg gcctaccagc cgcgtaaaca ccccggacat catcggcaac 4024381 agtagctatc cgcgccgaca gataccgatc agcgttcgtt tcgcgacaat tccgtgatct 4024441 ttcgtcgcca tttggatgga tgccgaggcg ttcgtcggtt tccggcaagt ccccgccgcc 4024501 cgatacggtg ggctaatggc aaccacggcg gcgctaccca gacggatcca tgcattcgtc 4024561 cggtgggtag tgcgcactcc gtggccgctg ttctcgctga gcatgctgca gtccgacatc 4024621 atcggcgcat tgttcgtgct cggattcctg cgctacggcc tgccgcctca ggacaatatc 4024681 caactgcagg atctgccacc ggtcaaccta ctgatcttcg tcagcacggt aatcatcttg 4024741 ttcctcgccg gggccgtggt gaacctgaag ctgctgatgc cggtctttcg atggcagcgc 4024801 cgcgacaacc tgctcaccga gcctgatccg gccgccaccg agctggcccg cagccgcgca 4024861 ttgcgcatgc cgttgtaccg cactctgatc agcctggcgg tctgggctac cggcggcggg 4024921 gtgttcatcc tcgccagctg gtcggtggcc aagcatgcgg cccccgtcgt ggcggtggcc 4024981 accgcgctgg gtgccaccgc caccgccatc atcggctacc tgcagtctga acgggtgtta 4025041 cggccggtgg ccgtcgcggc gctgcgcagc ggtgtgccgg aaaacgtcaa cgcacccggc 4025101 gtcatactgc gactgatgct ggcgtggatt ccgtccaccg gcgtaccact cctggcgatt 4025161 gtgctggccg tagcggcgga caagattgcc ttgctgcacg ccacaccaga ggcgctgttc 4025221 aatcccatcc tgatgatggc actggccgcg ctgggcatcg gatccgtcag caccctgttg 4025281 gtggccatgt cgatcgccga cccgttacgc cagttgcgct gggcgctaag cgaggtgcag 4025341 cgcggcaact acaacgccca catgcagatt tacgacgcca gcgaactggg cctgctacaa 4025401 gccggcttca acgacatggt ccgcgagctg tccgagcggc agcggttgcg tgacttgttc 4025461 ggtcgctacg tcggcgaaga cgtggcccgg cgggccctgg agcgcggcac cgagttgggc 4025521 ggtcaggaac gcgacgtcgc ggtgctgttc gtggatctgg tcggctccac gcaactggcc 4025581 gcgacacgac cgcccgccga ggtggtccag ctgctcaacg agttcttccg ggtggtggtc 4025641 gaaaccgtcg cccggcacgg tgggttcgtc aacaagttcc aaggcgacgc cgcgctggcc 4025701 atcttcggtg cacccatcga acaccccgac ggtgctggtg ccgcgctatc ggcagcacgt 4025761 gagctccacg acgaactcat cccagtgctg ggttccgcgg agttcggcat cggcgtgtcg 4025821 gccggaaggg ccatcgccgg ccacatcggc gctcaagccc gcttcgagta caccgtcatc 4025881 ggcgacccgg tcaacgaggc cgcccggctc accgaactgg ccaaactcga ggatggccac 4025941 gttctggcgt cggcgatcgc ggtcagtggc gccctggacg ccgaagcatt gtgttgggat 4026001 gttggcgagg tggttgagct ccgcggacgt gctgcaccca cccaactagc caggccaatg 4026061 aatctggctg cacccgaaga ggtttccagc gaagtacgcg gctagtcgcg cttggctgcc 4026121 ttcttcgccg gcaccttccg ggcagctttc ctggctggcc gttttgccgg accccgggct 4026181 cggcgatcgg ccaacagctc ggcggcgcgc tcgtcggtta tggaagccac gtcgtcgccc 4026241 ttacgcaggc tggcattggt ctcaccgtcg gtgacgtacg gcccgaatcg gccgtccttg 4026301 atgaccattg gcttgcccga cgccggatct gttcccagct cgcgcagcgg cggagccgaa 4026361 gcgctttgcc ggccacgacg tttcggctct gcgtagatct tcagggcttc gtcgagcgtg 4026421 atggtgaata tctggtcttc ggtgaccagt gatcgagaat cgttgccgcg ctttagatac 4026481 ggtccgtagc gcccgttctg cgcggtgatc tcctcacccg aggcggggtc cactccgacc 4026541 acgcgcggca gtgacagcag cctcagcgcg tcttcgaggg tgaccgtctg taggtccatg 4026601 ctccgcagca acgaaccggt gcgcggtttg ggcccggcgg ccttctggcg tttcttgact 4026661 ccctgagcgg ccgcggccgc atcagccgca ggctccggca ggatctcggt cacatacggc 4026721 ccaaaccggc cttccctggc cacgatctcg tggccggttt ctgggtccaa gcccaaagtc 4026781 cgtccctgtt gcggtgtggc aaagagctct tcggccacct gtagagtcag ctcgtccggg 4026841 gtaatcgagt cgctgaggtt ggcccgctgc ggcgtgggct caccggtgtc gccggccacc 4026901 aaacgttcca ggtagggacc gttcttgccc acccgaacat atatggggcg tccgtgggtg 4026961 tcgtcaaaaa gcttgataga gtttacttct cgtgcgtcga tgccctcgag attgatcccg 4027021 acaagcttct tgaggccacc cgatcgggct accgaatcgg gcacaccgtg atcgccacca 4027081 aagtagaagt tgttgagcca gttggtgcgg cgctcgttgc cggcggcgat ctcgtcgagc 4027141 tcgtcttcca tcgccgcggt gaagtcgtag tcgacgagcc gaccgaaatg ctgctcgagc 4027201 agaccggtta ccgcgaacgc cacccatgac ggcaccagtg cactgccctt cttgtgcacg 4027261 tagccgcgat cctggatggt cttgatgatc gacgagtagg tcgacgggcg gccgatgccc 4027321 agctcctcga gcgctttgac cagcgacgcc tcggtgtagc gggccggcgg gttggtggca 4027381 tggccgtctg gggtcaactc gacgatgtcc aaccgttgac ccggggtcag atggggcagt 4027441 cgccgctcgg catcgtcagc ctcgccgccg accagctcgt ccacggtctc cacgtaggcc 4027501 ttgaggaagc ccgggaacgt caaggtgcgt ccggtcgcgg agaacaccac ctcctggtgc 4027561 cccgacatgc cagtgatccg caggctcagc gtcatgcccc gcgcatcggc catctgcgag 4027621 gctacggtgc gttgccaaat cagctcatag agccggaaat catcaatgtt gggaccgtcg 4027681 agttcgcgac gcaccgcgtc cggggtggca aacgtttcac cggcgggccg gatagcctcg 4027741 tgcgcttcct gggcgttctt caccttgcgg gtgtattggc gcggcgccgg cgcgacgtac 4027801 tcgtcgccgt agagctggcg cgcctgggta cgtgcggcgt tgatcgccga ctccgacagc 4027861 gtggtggagt cggtacgcat ataggtgatg tagccgtttt cgtacagccg ctgggcgatg 4027921 ctcatcgtcc gctcggcgga gaaccgcagc ttgcggctgg cctcttgctg cagcgtggag 4027981 gtcatgaacg gcgggtacgg gcgccgggcg tagggcttct cctcggccga ggccacggtc 4028041 agctgcgtgc catccaggcc cgcggccaac gcggtcgcgc tcccctcgtc gagcacaatg 4028101 acttcgtcgc ctttgcgcag cgtgcccagc gagtcgaaat cgcggccagt ggccacccgc 4028161 cggccagcca cggccgtcag ccgggcgctg aaggtgggcg gcgcggcgtc cgggtcggac 4028221 acgctggcat ccagcttggc aaggatgtcc cagtaggccg cgctgcggaa cgccatgcgg 4028281 tcgcgttcgc gcgccacgat gatgcgggtg gccaccgact gcacccggcc cgccgacaac 4028341 ttgggggcga ccttcttcca cagcactggg ctgacttcgt agccgtacag ccggtccagg 4028401 atgcgccggg tctcctgcgc gtcgaccagg tcgatgtcta ggtcgcgggg gtgctcggcg 4028461 gcggcgcgga tcgccggttc ggtgatctcg tggaagacca tccgctttac cggtatgcgc 4028521 ggtttgaggg tttccagcag atgccaggca atagcttcgc cctcacggtc cccatccgtg 4028581 gccagataca gctcgtccac gtctttgagc aggcccctga gctcgctgac ggtgctccgt 4028641 ttctccgggc tgatgatgta gagcggttcg aagtcggcgt cgacgttgac cccgagccgc 4028701 gcccacggct gcgacttgta ctttgcgggt acatccgacg cggcccgcgg caagtcacgg 4028761 atgtgccccc gggaggactc gacgatgtag ccagagccca ggtaggaggc cagcttgcgc 4028821 gccttggtgg gcgactcgac gatgaccagt cgccggccgc tgccattgcc gccgctgcca 4028881 cggcccttcg ttttcgggtc agccaactgc gcccacgctc catctcttat cccggcccct 4028941 atcgagaccg ccccggtagg tagaggacgc ggccgactgc cgaatcccag gtgaattccg 4029001 gtacgccggc gttccctcgc ctgtgggcaa ctgacaatct cgcactctag ggcgggcctg 4029061 cgcaaaccgg ctgcaaacag attacccaca ccaaaggctc aaacgggccg ctcaggacgc 4029121 tcggagatcc gcatcgtcgc cgaactaggt ccgactgccc ggctcctcag cggaccccag 4029181 cgggaccgca tcgtcgccga gctaggtccg actgcccggc tcctcagcgg accccagcgg 4029241 gaccgcatcg tcgccgagct aggtccgagg ccactgtacc catgcctcgg ccccgtctgg 4029301 gggttccccc acgttctcta ccaggcgcga taacctgcgt cgcccgctaa tccgcagcgc 4029361 tggtcgggtc ccccgggtac cgatcagggt gggcgcaatc ccgaccctca tcagcgccga 4029421 cgccagcgga gaatgggtgt ccggcgcgtg tggatccagg cccagcaaat agcggtcggc 4029481 ctccgggctg cctgccgcca gagtccaagc cctcagctcg cgcggcccgg gcagccatcg 4029541 cgggggcacg gtcttgaccg caccgcgcgt ccactcggcg gcgatgccgc acaacagcgg 4029601 gtcgacggcc gtccgcacca gcggggtgtt ttcgtcggta cgggcgacct cgggtaccaa 4029661 accagcctcc tggatcatct cggccagggc cgatgcgcgc caggactcgg cgacgactac 4029721 cgacagccga gcgccgcaac caaccagcac gatctggccc gggcccgcca gcaccccgga 4029781 aagatccgcg accgcgggag gtactgactc cgcggcgaag aaggaaagct ggctcacctc 4029841 accgacagta agccagcgag cgggtcgctg gctttaggca tccggcgcgg cggcagcgcg 4029901 ccatgtggcg agcagacgta aagcccccaa aacggaaccg ttttgggggc tttttgcgtc 4029961 tgctcgcggg ggtaactcag agcgagcgga ctccggtggc ctgggggccc ttagggctgt 4030021 ggccgatctc gaactcgacc ttctggtttt cttcaagggt gcggaagccc gttccctgga 4030081 tctccgtgta gtggacaaat acatccgcgg aaccgtcttc gggggcgata aagccgaacc 4030141 ccttctccgc gttgaaccac ttcacagttc cctgtggcat ttctcgatct ttccttttct 4030201 tctgggtgcg gtgcaccgcc tttcggtgcc ccgggccagc tgcggccgcc atacctcgcc 4030261 gagtcgccgg aacttcaccc gaccgataac ctcgcaggaa ccgcggccgc aacgtcgatc 4030321 ctgcgaaagt ttgacacgaa cacagaagct gcgaccgcca atcagtcaat catgttcatc 4030381 gcgtcggcaa cagcctctgg gtgtggacgg agctacgaag ggtccgcaaa tggcgagttt 4030441 cggcagccac ctgctggccg cagcggtcgc cgggaccccg ccgggcgagc gtccgctgcg 4030501 ccacgtcgcc gagctgccac cgcaggccgg ccggccgcgc ggttggccgg agtgggccga 4030561 gcccgacgtg gtggatgcgt ttgccgaccg cggcatcagc tcgccgtggt cacaccaggc 4030621 tgaggccgcc gagttggcgt acgccggccg ccacgtggtg ataggcaccg gcccggcgtc 4030681 tggaaagtcg ttggcctatc aacttcccgt gctcaacgcg ctggcaaccg actcccgggc 4030741 gcgtgcgctg tatctgtcgc cgacgaaggc gctcggccac gaccagttgc gcgccgcaca 4030801 tgcgctggcg gccgcggtgc cacggctggc tgacgtcgcg ccgacggcct atgacggcga 4030861 cagtcccgac gaggtgcgcc gctttgcccg cgagcgctcc cggtggctgt tctccaaccc 4030921 ggagatgaca cacctatcgg tgcttcgaaa ccatgcgcgc tgggctgtgc tgttgcggaa 4030981 tctccgcttt gtgatcgtcg acgaatgcca ttactaccgt ggtgttttcg gctcgaatgt 4031041 ggcgatggta ctgcgccgtt tactacggct gtgcgcgcgc tactctgcgc acccgacggt 4031101 gatcttcgcc agcgcgacaa cggcctcgcc gggcgcgacg gctgccgacc tgatcggcca 4031161 gccggtcgtg gaggtcaccg aggacggctc accccggggg gctcgcacgg tggcattgtg 4031221 ggagcccgcg ctgcggtcgg atgtgatcgg cgagcacggc gccccggtgc gacgctccgc 4031281 cggtgccgag gcggcccggg tgatggccga cctgatcgtc gagggagcgc agaccttgac 4031341 gttcgtccga tcgcggcgcg cggcggaact gactgcactg ggtgcccggg cgcgactggt 4031401 cgacattgcc ccggaactgt cggacacggt ggcgtcgtat cgggccggtt atcttgccga 4031461 ggaccgtagc gcgctgcacc aggccctggc cgagggccag ctgcgcgggc tggctaccac 4031521 caacgctttg gagttgggcg ttgatatcgc cggactggat gcggtggtgc tggctggttt 4031581 tcccgggacg gtggcctcgt tctggcagca ggcgggccgg tcgggccggc gcggccaggg 4031641 cgcgctggtg gtgttgattg cccgtgacga tccgctggac acgtatttgg tccaccatcc 4031701 cgcagcattg ttggacaaac cggtcgagcg cgtggtgatc gatccggtta acccgcacct 4031761 gctgggtccc caattgcttt gtgcagcaac agaactgcct ttagacgacg ccgaggtccg 4031821 gtcctggggc gccgttgagg tggcggagag tctggttgac gacgggctgt tgcggcgccg 4031881 gaacggcagg tactttccgg cgcccggggt gaaaccgcat gccgccgtgg atgtccgggg 4031941 ggctatcggt ggccagatcg tcatcgtgga ggccggaacc gggcggctct tgggcagcgt 4032001 gggcgtcggt caggccccgg ccgcagcgca cccaggcgcg gtgtacctgc accagggcga 4032061 gacctacgtc gttgactcgc tggatttcca ggacggaatc gccttcgtgc acgccgagga 4032121 tcccggctat gccacgttcg cgcgagaggt caccgacatc gcggtcaccg gcaccggcga 4032181 gcggttggtc ttcgggcccg ttgctttggg tttggtgccg gtgactgtca ccaatcacgt 4032241 cgtcggctac ctgcgccgcc agctgtccgg ggaggtgctg gacttcgtgg agctggacat 4032301 gccggaacat accttgccca caaccgcggt catgtacaca atcacttcgg atgcattggt 4032361 ccgcagcggt attgaggcca cacggattcc cgggtcgttg cacgccgccg aacacgcggc 4032421 catcgggctg ctgccgctgg tggccagctg cgaccgcggc gatatcggcg gcatgtccac 4032481 agcgaccggg cccgaggggc tgcccagtgt ctttgtctac gacggctatc cgggtggagc 4032541 cggattcgcc gaacgcggct ttcgccgggc ccgcacctgg ctgggcgcca ccgcggaggc 4032601 catcgaagcc tgcgaatgcc ccagtgggtg tccatcgtgt gtgcaatccc ccaagtgcgg 4032661 caatggcaac gacccgttag acaaggcggg cgcggtgcgg gtgctgcggc tggtgctcgc 4032721 cgagttaagt gaggaatcac cgtgagcagc ccagcgttcc ggcgttgtcg ggcaaagcgg 4032781 ggtcgtcgtc ttagccgatg tgatgcactt gacatcagtg tcttcggcct atcacgtagt 4032841 ggtcgtgggc gccggccgaa gatccgggcg ggaggtgaca cgtgtcgttt gtgatcgcgg 4032901 cgccggaggc gttggactcg gcagcaacgg acctcgtggt cctgggctcg acgttaggcg 4032961 cggccactgc ggccgcggcg gcccagacga cgggtatcgt ggccgcggcc cacgacgagg 4033021 tgtcggcggc gatcgcagcc ctgttttccg cccacggcca ggcctatcag gccgccagcg 4033081 cgcaggccgc ggcgtttcac acccggttca tccgtgcgcg ctcccgacat ccgcagcagg 4033141 aaacgacctg tcgccgtgtg cgataggcaa atcaccaggc aacacgccgg cagctccggt 4033201 aaggccaaca tcgaccacct acccagggca ttcccatgca cgtcaccgcc gcatagcaag 4033261 ttgcggatgc tgagtggtcc gctaccaccc ggtatggcaa cgccggtggt catggcacca 4033321 cctcgggtct gatctgcctc ggaggccggc cgctggcacg aaggcaacga cggttcgggc 4033381 gggttggcct agcgatacca cacgcatgcg ctgtcctgca agggaattcc ctcggcgacc 4033441 accggtaccc caccgagtca acggcgcacc gcgtccgtag actgctcgca tgacccacga 4033501 ctggctgctc gtggagacgc tgggggacga accggccgtg gtagcacggg ggcgtgagct 4033561 gaagaagctc gtcccgatca ccacgttcct gcgtcgcagt ccctatttgg cggcggtccg 4033621 cacagctatc gccgagacgc tgcagaccgg ccaaagcctg accagcatca ctcccaagca 4033681 cgatcgcgtc atccgcaccg aacctgtaat aatgaccgac ggccgcatgc acggcgtgca 4033741 ggtgtggagt ggccccacag acgccgaacc gcccgaccgg ccgatcccag gcccgctgaa 4033801 gtgggacctg acccgtggtg tggccaccga caccccggag tcactgacca acagcggcaa 4033861 gaatcccgag gtcgagatca cctacggccg agccttcgcc gaagacctgc cggcgcgcga 4033921 gctcaatccg aacgaaaccc aggtgcttgc catggcagtt aaagccaagc ccggcaaaac 4033981 actatgcagc atttgggatc tcactgattg gcaaggaaca cccatccgga tcggcttcgt 4034041 ggcgcgaagc gctctggagc cgggaccaaa cggccgcgat cacctggtcg cccgggcaat 4034101 gaattggcgt gctgagacca aggcccctgc agtgcccgtc gacgacttgg ctcagcggat 4034161 ccttatcgga ctggcgcagg ccggagtcca ccgggcactg gtcgatctca aaacctggac 4034221 cctgctgaaa tggctcgacc aaccctgctc tttctacgac tggcggcgta gcgcggccga 4034281 tgggcctcgt ctacatcccg acgaccagca cgtgatcgac gccatgacaa gagacctcgc 4034341 caacggatcg gccagtcatg tgctgcgctt gcctgggcac gacgtcgatt gggtgccggt 4034401 ccatgtcacc gtcaaccgga tagagctcga accggatacc ttcgctggac tggtcgctct 4034461 gcgactgccc accgacgaag aacttgccga cgccggactg ccgaaagcca ccgacgtcac 4034521 cacctgacaa ccagtccttt cgactcagca acggcagctg ccgatccgcg gctaccgttg 4034581 cttgtcgtga acggtttgac ggtgatccgg actgcgcgct cgctgagcgg cctacgccca 4034641 cgctgtcggt cagattgcgt cgatgaatcc tatgcgctct gaactgaact gggctgaatg 4034701 cgcgagccgc cgacgtaggg aatcggcaac gcccgtcgga cgaccccgcc gatctcgtcg 4034761 tcgacatcca gtggcgccgg catcagcagg gtggtgacga ttgcccgttc agacagtcgc 4034821 cgcaaggccc cgggcctgct aggaggtcgg gttccccggg acgtcgacca caccctggtc 4034881 gcaatgtcca acgtaagcaa caggtttgag tatgaggtgc cggtagcgag gatgaattcg 4034941 ccagtcctgg tacacgcgca cggacatcgc aggtgccgcg atgcggccgg cctctggcca 4035001 ccgccgaatc ggcgtagccg tcgggcactt tcaagatcgg gtcagcgcgc ctgatgcgca 4035061 ccgggccgcc acctcagcgc catggtgttt cggacatcct ccaatcgccg ccgatccccg 4035121 aggaacacca ggtcgcccgc gtgcgggcga aaggcagcga ggacttttgg gaaacccacg 4035181 cacatgcttc ccggatagcg ataagctgcg ctccagcaga ttgtccgccg gtgaccgggc 4035241 ggcccttcga tcggcatcgc gcggtggtcg gaggtgtccg atgtcatatg tgatcgcggc 4035301 gccggaggcg ctggtggcgg cggccacgga tttggctact ctcggctcga cgatcggcgc 4035361 cgccaacgcg gccgctgcgg gctcgacaac ggcgttgctg accgccggcg ccgacgaagt 4035421 gtcggcggcg atagcggcct attcggaatg cacggccaga cctatcaggc actcagtgcg 4035481 cgggcggcgg cgttccatga gcggttcgtg caggccttgg ccacaggtgg gggcgcctat 4035541 gcggccgccg aggccgccag cgtctcgccg ctgcagagcg cgctcgattt gctgaatgcg 4035601 cccactcagg cgctgttggg gcgtccgttg gtgggcaatg gcgccaatgg ggccccgggg 4035661 actggggcaa acggcggcga tggcgggatt ttgttcgggt ccgggggggc cggcgggtcc 4035721 ggagcggccg gcatggcggg tggcaacggc ggggccgccg ggctgttcgg caacggcgga 4035781 gccggcggag ccggcggcag cgcgacggcc ggtgcggccg gggcgggcgg gaacggcggg 4035841 gccggcgggc tgctgttcgg taccgccggg gccggcggca acggcgggtt aagcctcggt 4035901 ttgggcgtcg ccggcggcgc cggcggcgcc ggcgggtcgg gcggtagtga caccgccgga 4035961 cacgggggga ccggtggtgc cggcggcctg ctattcggcg ccggcggcgc cggcggcgcc 4036021 ggcgggctgg gcggattccg cggtgccggc ggcaccggcg gtgccggcgg ggacggcggc 4036081 aacgccgggc tgttcggcga cggcggcgcc ggcggcgccg gcggcgccgg cgaggacggc 4036141 acaacgcccg gtggcaacgg tggggcgggc ggtgtcgccg ggctgttcgg cgacggcggc 4036201 aacggtggta acgccggagt tggcacgccc gcgggcaacg tcggcgccgg cggcaccggc 4036261 ggcctgctgc tcggccagga cggcatgacc gggttgacgt agccgcgtgg cggggccgcg 4036321 ccttgcttcc gggactacca cccgcaggtc gctggccgta gttggttctc cccgctagcc 4036381 caccactagc ttcgcttgcc gatagaacta gatcgtcgtc aacccggtgt cgtgggcacc 4036441 ttggccggcc ccgcccgcgc ggtggcggtc gccacacccg cgaacgcgac agccacctcg 4036501 acggtgacga ccacgtcgag gtccaccacc ctgcactgcg cgtgctcgac gcgcatcgca 4036561 cgggccacca gcgtcgcacg cgcgcaggcc gccgccagtc cggacggcag ccgggcggca 4036621 gcggctaacg aagccagatc agccgccgcc tgtgcgcggt gacgagccac caccgccgac 4036681 cctagatatg cacccgcacc ggtgacgcac agcagcaccg cgaccatcgc gacggcaagc 4036741 acggtggccg agccgcggtc accccggctc ggccaccgaa attgccctag cagcaatgtc 4036801 caacgtaggc aacaggtttg agtgtgctgt gacagtggcg accacaaact cgccgtcccg 4036861 gtgcacctgg accagcgccg cacgcggggc gatgctgcgg gcgacgtcgg tcgccgagcg 4036921 tacgtcaccg cgcgcggcca atcgagcggc ctcgcgggcc gcgtcgatac agcgcacctg 4036981 cattgatacc gcggtgacgc ccgccaggca cagcaccagc accagcacca gggtggcgat 4037041 cgccaacgcc gcctccacgg tgctcgcacc cgcacacgac gctaaacctt ggtgctgagc 4037101 gcgcgaccga tgatgcggtt gagcgccgac acaatggaat ccccggtgac gaccgtgtag 4037161 aggatcgcac cgaaggcagc cgccgcgatg gtaccgatgg cgtattccac ggtggacatg 4037221 cccgactcgt cgaccgccag cgccgtcatc cgcgccacga gtacacgaaa catggtgatc 4037281 accaacatat tcctttctca taccaggcca aactgcaaga catcaccggc cagcccgact 4037341 actagcggga caatgcccac acacagaaac gccggtaaga agcacagtcc cagcgggccg 4037401 gcgatcagca caccggcccg ctcggcggcc gccgcggccg cctgtgcggc gtcgtgccga 4037461 acctggacgg ccagttcgac aatgccatcg gcgagcgccg cgcccgaagc cgccgaacgc 4037521 cgtgccaacc gcagtaccgc atcggtctgc gcatcgtggg tgcccggcgg caaatccggc 4037581 ggcctcgacc aggcgatgtt ggggtcggca cccaatgcca gcaggtcggc ggcccggcgc 4037641 aacacgcgcg ccagccgcgg cggcgcgacc gcagcggtgg cggccgcggc cgtcgacacc 4037701 gccatccccg cagccagaca cacggccagc acgtcaaggc tggctgcgac ggctagcggg 4037761 tccgcgacat ccgtccgccc tagcagcagc ccctggtgtg gccgatgcgc gcggggcggc 4037821 ctcccggctc gcgcccgtac caccgacggg ccggcaccga gccacaacgc catggccagc 4037881 aacaccgccg ccgcactcac aacactggcc gatcggtgat ccggtccgac cacagcagcc 4037941 cggcgcaggc cagtgtcagc ccgaccacca gcagccatcc gcccacgcgt cccgtcagca 4038001 gaaagctcag cggccgggcg ccgatcagtt gaccaagcag caccccgagc agcggcagga 4038061 ttgccaatat ggccgcactg gcccgggcac cggccatccc cgctgacacc cgcgcggaga 4038121 accgttgccg ctcagcgaca tcacgttggg cggcacgcat caaactggct atcgccaagc 4038181 cgtgatcact gcccagttgc cagcagaccg cgagccgctc ccagtacgcg ggcagcgccg 4038241 aggatcgggc cgcagcgagc aggccagccg tgacgtcggc acccaatcgt gcccgcgccg 4038301 cgaccgcgcg caaggcaacg gcaaccgggc cgccggtctc gtcagccgcg atgctgaatg 4038361 cgcggactgg atgggcgccc gcgcgcagtt cacccaccac cagctcaagc gcggcctcca 4038421 gcgcctgccc ctcgcggctg cggcgcaggt agcggcgacg ccggcggtag cgcaggccga 4038481 gtgttgcgcc cagcaccgcg acagccacaa cggtcggtaa cggtagcaag gctgccacac 4038541 caaccgcgac acagccaaca ccccaggcaa cccgccgggc gccgaccaga agcacccgcc 4038601 ggccggtgtc gtctggagta aggcggcacc gcggcgaccc gggcaacacc acgagcgcaa 4038661 gcgacaaaat cagggcagcg gacgctatac cgctcatgcc gatgcccggc ttctcagcaa 4038721 atcgtgcagg gcggccgcgt cgtcactcat cccacggtcc gcgtgccaca ccgtcaccgc 4038781 ctggacccgc ccttcagctt ggcgcagcac ggcgatctcg gcgagccggc gacggcctgc 4038841 ccgatcgcgc gcgacgtgca gcaggacttg gactgccgcg gcgagctggc tgtgcagagc 4038901 agcgcggtca aggccgccga gcgcccccaa cgcttccatg cgtgcaggga cctcacccgg 4038961 gttgttggcg tgtacggtgc ccgcgccgcc ctcgtgaccg gtattgagcg ccgccaacag 4039021 atccaccacc tcggctcccc taacctcacc gaccacgatg cggtcgggcc gcatccgcag 4039081 cgcctgtcgg acgagttgac gcacggttac ctcaccgatt ccttcgacgt tcgcacgccg 4039141 cgcaaccagc ttgaccagat gtggatgccg aggggccagc tcggcggcat cctcgacgca 4039201 cacgatccgc tcatcgggcg acacggcgcc caacatcgct gccagcaacg ttgtcttccc 4039261 ggcaccggtt ccgccgcaca cgaggaatgc cagccgggcg gtgacgatgt cggcgaccag 4039321 cgcggcggcc gcggggtcga tcgcgcccgc cgcagccaac gcggccagat cctgagtcgc 4039381 gggacgcaac acccgcaacg acaagcaagt gccctgggtc gccacgggcg gcaacaccgc 4039441 atgcagccgc accgcgaacc ctccgacgcc gatcccggtt agttgaccgt ccacccaggg 4039501 ttgcgcgtcg tcgagccgac ggccggccgc caaagccagc cgttgtgcca accttcgcac 4039561 cgctgactcg tcagcaaacc gaatctggct gcgtcgcaat ccgtttccgt cgtccaccca 4039621 caccgagtcg ggcgcggtga ccagaacgtc ggtggtgccg tctgcggata gcagcggttc 4039681 gaggatgcca gcgccggtca gttctgtctg cagcacacga agattcgcca gcacttcggt 4039741 gtcgccgagc atccccccgg actcggcccg gatcgcggcg gccaccacac tgggccgcag 4039801 cgggccggat tcggatgcca gccgttcgcg gacgcgttcg atcagggagc cggtcatgcc 4039861 gccctaccgt gtcgccctga cccagcacgt ggcagcacac caagtacccg tcgggcagcc 4039921 gatgccagca ccgatcgccg tcgcagtcga agacccccgt gttccagctg ttcggctagc 4039981 cgcggctggg ccctcatgga tgccagtagc ggcaccccgg cgacgtccgc gacctctgcc 4040041 gcccgcaatc cccccgggga gggcccccgc accaccagac ccaggttggg gttgatcgcg 4040101 gtcagcacag gcgccatcgt cgcggcggcc gcacatgccc gcacatcgca tgggctgacc 4040161 aggacgacga gatcggcggc atccagcgct gcttgggtgg catcggtcag acgacgtgga 4040221 agatcgcaga ccacggtgac tcccccacgt cggccggcgt cgatcacggc gtccaccggc 4040281 ccggcgtcta actcgtagcc gcgccgagtt cccgagagca cgctgatccc ccgcggtcgc 4040341 ggcaatgccg cacgcaccgc cgaccaattc agccgtccac cctgtagcgc caggtcgggc 4040401 caacgcagac cgggggcggt ttcgccgccc accagaagat cgatgccgcc ggcccacgga 4040461 tcgagatcga ccaacagcgc atcagcggcg gcctgcgcca gggcaaccgc aaacaacgat 4040521 gccccagcgc caccgcgacc cccgatgacc gcgaccaccg ccccgcagat cccgtcatcg 4040581 cgtgccgatt cagcagcttc ggcgagctcg cggaccagtt caccctcctg ctcgggcatc 4040641 ctcagcacgt gctgggcccc gacggttatg gcagccgccc aggtcgccgt cgcggcttcg 4040701 gttccggtca acacgctgac gtgggtgcgc cggggtagcg cgagccgccc acaccggtcc 4040761 gccgccgcgt ggtcgagcac cacagccgcc gccgccgacc acgtctttct gctcaccaga 4040821 tggcggccgc cgagatgaac aacgcgaacc ccgacggctg cggcgactcg gtccagctcg 4040881 tcgcgcaacc ccggatcggt cagcatcgcc aacacgcccg agcccaccgg gtggctacca 4040941 gacgggccac cagggcctga gaagactgtc acccacccac cgtgcggggt ccatggtgtg 4041001 ggacaccagt cccaaaggcg caattgggga cagacgtgca actgtgcaca aacgcccctg 4041061 agggggtccg ggcaacacga ttcccgcaac gcccagaaag ctgggctaag caccgggctg 4041121 acgacgtttg cgtggctgcc aaaagggacg acccccgcca ggggggggag gaggcgaggg 4041181 tcgtcgtgca tcagccccgg ggggtcggac tgatacaccc tcggctatgg ccgagtaatg 4041241 cttactatac acatgacagt gcgcagtcac gcaagtaccg gacgcaatgg aaagcacagc 4041301 ttgagccgtg taaatgctct tgacttctcg acaacatcgg tagtcaattg acctgttcgg 4041361 gaacaaggtc gccggccggt ccaactgccg acctatgctg ggtcggtgac cgtctccgac 4041421 tcgcccgccc agcggcaaac cccaccgcaa acaccgggag gcaccgctcc gcgagcccgc 4041481 accgcggcct ttttcgacct ggacaagacc atcattgcca agcccagcac actggcgttc 4041541 agcaaacctt tcttcgctca gggactgctc aaccgccgcg ccgtgctgaa gtccagctac 4041601 gcgcagttca tctttctgct gtccggtgct gaccatgacc agatggaccg gatgcgcacc 4041661 cacctgacca acatgtgcgc cggttgggac gtagcccagg tgcggtcgat agtcaacgaa 4041721 accctgcacg acatcgtgac cccactggtg ttcgccgagg ccgcggacct catcgccgcc 4041781 cacaagctgt gcggccgcga cgtcgtggtg gtctcggctt cgggcgagga gatcgtcggc 4041841 ccgatcgccc gcgcgctggg cgcgacccat gcgatggcga cccggatgat cgtcgaggac 4041901 ggcaagtaca caggcgaggt cgcgttctac tgctacggcg aaggtaaggc gcaagccatc 4041961 cgtgagctgg ctgccagtga gggctacccg ctggaacact gctacgcgta ctccgactcg 4042021 atcaccgatc tgccgatgct tgaggcggtt gggcatgcct cggtggtcaa ccctgatcgc 4042081 ggcttacgaa aggaagccag cgtgcgcggt tggcccgtgt tgtcgttctc tcggccggtg 4042141 tcgctgcgcg accggatccc ggcaccgtca gccgcggcga tcgccacgac tgcggcggtg 4042201 ggtatcagcg ccctagccgc cggcgcggtc acctacgcgc tactacgccg cttcgcgttt 4042261 cagccctagc gacgatgcgg gccacacagt ggcccgagga ggaacggggc cacgaagcag 4042321 gccgccggat cgcgcccgag cgggcgggca gcaaacgtct agcccacgca atccaaagcc 4042381 gcttcgtaac tttcgcagaa ttgggccttg ctgtgttaaa ggtctagtag tacaaaggaa 4042441 ccacggaagc ccggtgaggc caaggctcga tccagaagag aaggttcggt ctcccgaccc 4042501 gggcgcccag catggttccc ggcacccacg cggagtcata gccacgataa cggcagaagt 4042561 gttgcgggtc tgcgtaattg cgaacagcag atggcatcga cggccctttg ggtggggcta 4042621 cagctagaag cgtcgcaaga tcgccgaggc cacccacgca accccaggag tgcacgcttg 4042681 gtaaccgaga accgtgttgg tgggcggcga ttcgagttct tcgggtcgcc gcccgctttt 4042741 tgttttctgg atcaagtatt acggccattc gaggcccgcc ggttagccgc tcggctatct 4042801 aggcgcgtaa ttcagtgacc gtttggccgg gctgtctcgc ggctgtgcca gatcacagcg 4042861 gcgaagtgcc gcagccgtga cccgctcggg gtagccgggc tgtttgagca accagacacg 4042921 ccgaacgtgc aaccacggcg gctccacccg gcggggcgtg tccccgccac caatgcacgt 4042981 tcggcgcagc cggcgcaccc tcggcgcgga gtttaggaac tactcatcca ggtgacaacg 4043041 actcggcaat cgacaaagcc tcccgcgcgc cgtcgagcat cgcgccgcaa cacagcaaca 4043101 gccagcccgc caccccatca ggtgtgcccc cggcgaacct gcgggcagcg tcgtggtatt 4043161 cggcgggttg gcgcatccaa atcacttcgg gaacacccag cccgtgcgga tccagtccgg 4043221 tggcgattgt caccagccgc gacaccgcgc gggccaccac accgtcggca cagccaaacg 4043281 gcctcagcgt caagagctcc ccgtgtgcga ccgcagcaac caccggcgcc gatgccaggg 4043341 tggggtgggt taccacatcc gcgagcaact ccaaacgcgg gccaacgtcg gcatcggacc 4043401 gcggacgccc aagccgatcg tcatcgacct ggtcggcggc cgccagcatg tgtaggcggg 4043461 ccagcgcctg caacggtgcc cgccgccaca ccccgaccac cggacccgcg ccgccttcca 4043521 gcgcctgccc cacccgaagc gctcccgcga acaccggatc gctgagcgcc ggcttgcccg 4043581 aggtgggcgc ccccgcgtcg tgcagccgcg caggaccacc gtcgagcacc gaggaggccc 4043641 gcgccgcccg caacgaggcc tcggcggcgg ccaccggcca gccccgcagg ttggtccggt 4043701 gccggtgcac gcggctcagc gcgtcgcgca cccggtcgct ggccgcagca acgcccggga 4043761 gctccattag cggagccagc gggtcgaccg tcacaggttg ccaacctttc ggggagctga 4043821 gggggcaccg ggaatggcct gaagcaactg gcgggtgtac tcgtggcggg gccggctgaa 4043881 cacctcctcg gtagaggcgt gctccaccac ccggccggcc cgcatgacca ggacgtcgtc 4043941 ggcaatctgc cggatcaccg ccagatcatg gctgatgaac aaatacgtca aacccaggtc 4044001 ggcctgcaga tcggccagca gatccaggat ctgtgcctgc accaatacgt cgagcgccga 4044061 caccgcttcg tcgcacacca atacctccgg gcgcagcgcc agcgcacgcg cgatcgctac 4044121 ccgctgccgc tgaccacccg acagctcacg gggccgccgg cccagtatcg acgacggcag 4044181 cgccacctga tcgaccagct cacgcaccgc cctttgccgc tgccggcggt caccgacgtg 4044241 atggacgcgt aacggttcct cgatggcgcg aaacaccgag tacatgggat ccaggctgct 4044301 gtatgggttt tggaacaccg gctggacccg gcggcgaaag gccagcacct ggtcccgggc 4044361 cagcgcgccg acgtcgtagg tgccgtcgaa aacgaccgtg cccgaggtag gttggagcag 4044421 cccaagcacc atccgcgcta gcgtcgactt gcctgacccg gattcgccga cgattgccag 4044481 ggtgctcgcc cgcggtagcc ggaatgacac tccgtcgacg gcgcgagact ccacccgccg 4044541 ccacggtgcg ccgcgggact cccggtaaat cttggtcagc tccgagacga cgagaatgtc 4044601 gccggcctgc gtggttgccc gtgaccggga ttccggcgga cgtctgctgc gcgccgtcag 4044661 cgatggagcc gcggccacca ggcgccgggt gtactcgtgc tgagggcttt gcaggattga 4044721 ctgcgccgca ccggattcca ccaccactcc acgacggacg acgacgacag cctcggcccg 4044781 ctgcgcggcc aacgccagat cgtgggtgat cagtagcagc gcggtgccta gttcgtcggt 4044841 gagtccctga agatgatcga gcacctgccg ctgcacggtg acatccaacg cggacgtcgg 4044901 ctcatcggcg atcagcagcc gcggcctgcc cgccaagccg atcgcaatca acgcccgctg 4044961 gcacatgccg ccggacagct gatgcgggta gcgtccggct tgcttcgccg gatccggcag 4045021 gcccgcctca gcgagtagct ccaccgcccg tcgtcgtgct gcgcgaccgt cggtattggc 4045081 ccgcaacgct tctgtgacct gaaagccgac cttccaaacc ggattgaggt tggtcatcgg 4045141 atcctgggga acatagccga tctcccgtcc ccttatcgac cgtagccgct tggcatcggc 4045201 cccggtgatg tcgcgcccgt cgaacacaac gcgtccagcg gtgatccgtc caccagccgg 4045261 aagcaaccca agaatcgccg cggccgtcgt ggatttgccc gacccggact cacccaccac 4045321 ggcgacggtt tgaccgctcc ggacggccag atccacccca cacacggcgg gagcatcggt 4045381 gccgaacgta acttccaggc cctccaccga caacagcggc gctgctggga cgctcatgcc 4045441 cgccatgccc gcgaagccgg atccagcgcg tcgcgcaaag cgtcgcccat catcatgaac 4045501 gccagcaccg taatcgccag cgcgcccgca ggatagaaca aaattggcga gcccgaccgt 4045561 agccgggtct gcgcgacatt gatgtcgcca ccccaggaca ccaccgacgt cggcaatccg 4045621 accccgaggt aggacagcgt ggcctcggtg acgatgaaga tccccagagc gacggtagcc 4045681 accgcgatca ccgggcccac ggcgttgggc agcgcgtgcc gaagcagaat ctgaaaccta 4045741 ttcaacccca atgccttagc tgcaaggacg taatcgctgg cacgcacctc gagcaccgca 4045801 ccgcgcgcga tcctggccac ttgcggccag ccgaacaatg ccaagatggc gatcaccgtc 4045861 cacaccgtgc ggtgatgcat gacttgcatg agcacgatgg cggccaacag caacggcaag 4045921 ccgagaaaca catcggtgac ccgcgaaacc accgcatcga tccagctccc gtaaaaaccg 4045981 gccaatgcgc ctaacgcccc gcccacgacg aacacggcca gcgttgcccc caacccgacc 4046041 gtgaccgaag cccgcgcacc atacaccgtg cgcgaataga tgtcgtggcc ctgcaggtcg 4046101 gtgccgaacc agtgcgcggc cgatggcgca agcatgcttt ggctgggatc ggcataggtg 4046161 ggatcggctg cggtaaacaa cgacggaaac gccgccacga caagaatcag caggatcagc 4046221 gccgcggcga tcacgaattt aggacgccgg cgcaacccgc gccaggcatc gagccagaac 4046281 cccgtgtgct cagccatagc ggatccgcgg gtccagggcc gcatacagca gatccaccaa 4046341 cagattggtg atcaggtaga tcagcaccag caccgtcacg atcgacacca ccgtcggcgt 4046401 ctcctgacgc gtgaccgctt gatacagcac gcccccgacg ccgtggatgt tgaagattcc 4046461 ttcggtcaca atcgctccgc ccatcagcgc gcccagatcc gcgcccagga aggtcaccac 4046521 cggaatcagc gaattgcgca gaatgtgcac cgtcaccacc cggggccgcg acaacccctt 4046581 ggcggtggcg gtgcggacat agtcagcgtg tgcgttggcc gccaccgccg agcgggtcaa 4046641 tcgcaccacg taggcgaatg acatggcgcc cagcacgatc ccgggtagca gcaggcggcc 4046701 gacgctcgcc cgttcgccca ccgtgaccgg cgcgatttcg agctggaccc cgaataagaa 4046761 ctgcgccaga aagcccagca cgaagatggg gatcgcaata atgacaagtc cggtaaccag 4046821 caccgcggaa tcgaagattc caccctgacg taggccggcg atcacgccga atccgattcc 4046881 gagcactgcc tccaccgcca gggcgatcaa ggccagcctg atggtgaccg gaaacgcatg 4046941 cgccagaacg gcactgaccg gcagcccaga atacgcacga cccaagtcac cgtgcagaat 4047001 tccgcccaga tagcgcaagt attgcacgag gaacggatcg tcgaggtggt aatgcgaacg 4047061 cagctgcgcg gccaccgcgg gagtcaacgg acggtcgccc gccagcgcgg caactgggtc 4047121 accgggcagc agaaagacca tgccgtagat cagcagtgtc gcgcccagga aaaccggcac 4047181 catcacggcg actcggcgcg caacatacca gcccatgtca ggccttgacg atgttctcgt 4047241 agtcgggcag accattccag gtgacggtga cgttgctgac ttgcgacgac catccgacga 4047301 cactgatgta atcccagagc ggcacaactg gcatgtcgtg aaacaggatt cgctgcgcgt 4047361 cgttgaccag ctcgtgggat tcggttaacg tgggggcggc ttcggcggcg gccagcgccg 4047421 cgtcgaattc cgggttgatg tagccgacgt cgttggatcc ggcgccggcg gtgaacagcg 4047481 gagcgagaaa cccgatcatc gacgggtagt cgccctgcca tccagcgcga aatgcactgt 4047541 cgatggcgcg gttggtgatc tgggtgcgaa atccggcgaa ggtgggctgc ggcgcggcca 4047601 ccgcatcgat gcccaacacg ttcttgatgc tgttggccac cgcgtccacc caatcccgat 4047661 ggccagcgtc agcgttatag gcgatcgcgt accggccgct ccacggtgag atcgcatcgg 4047721 cctgcgccca gagccgccga gcccgctgcg ggtcgtagtc cagcacctcg ttgcccggca 4047781 ggttgggatc gaagcccggc aacgaccggg cggtgaaatc gcgggccgga ctgcgggttc 4047841 cggcgaagat ctgctggcag atttgcggcc ggttgatggc ggccgacagc gccaaccggc 4047901 gcagccgccc ctcctcgcca ccgaaatgcg gcagccgcaa cggagtgtcg agggtctgat 4047961 tgatcgctgc gggcccgctg gtagcgtggt cgcccaggtc gcgctggtag accgtcaacg 4048021 cgctcggcgg aatcgtgtcc aggacatcga gattgccgga cagcaagtcg gcataggcgg 4048081 tgtccagatt ggcgtagaac tcgaatcgca aacctttgtt acggggcttg cggttgccgt 4048141 ggtagtcggg gttgggcacc aggtcgattc tgacgttgtg ttcccaggcc ggcccggctg 4048201 ggccgtcggc gagtttgtac gggccgttgc cgatcgggtt gcggccgaac gcggccatgt 4048261 cccgaaatgc ggagtccggc agcggataaa acgagctgtg gccaaggcgc aacgtgaagt 4048321 cgatggtcgg cgccttaagc cgcacggtga actccaggtc gttgaccacg cgcaacccgg 4048381 acatggtggt ccggctctta tcccctggcg cgccggccac gtcatcgaac ccttcgatcg 4048441 ggctgaaaaa gtgctgctgc agttgggcat tggtgctcag ggctccgtag ttccacgcgt 4048501 cgacgaacga gtgggccgtc accggcgagc cgtcggtgaa cttccagccg ggtttgacag 4048561 tgatccggta gttgacgtta tcggcgctct cgattgactg cgcgacctcc agcgacggct 4048621 tgccaacggc gtcataggac atcaggccgg cgaacaaccg atcgatgatg cgcccaccgt 4048681 tgctgtcgtt ggtgccggtc gggatcagcg ggttgggcgg ttccccgccg ttgaccagca 4048741 ccacgtcagg gctcaggaca ccgccgccac aaccggccac tggcgcaagc accagcaatc 4048801 cggtggcaag ggctgccagg gccgcccgca tccgacgcac catgacagcg accctaaagc 4048861 cttcttgtgc agtccggctc cccagccggt gaagtgcggc ctggccagcg cagccgacac 4048921 actcgccggt gaccgttagc taccacgcca cccagagtgc cggcgaaccg gtgggacgat 4048981 gttttgggaa cgctcacacc gtcgttcgcg atccggtgtt ggctacccac cgcgactgcg 4049041 cttcccaagg gaagacctcg cccgaccggg cgctgttggc gtgcggcatc ctcgaggagg 4049101 accggtggtg tcggcgctgt ggcgaggaag gcagcccgcg cgacaccgtg accaggaggt 4049161 tgactcactg gtgtgggctg cacccgggtg tgagcgtaga tcactcatgt cttagccgat 4049221 gctgccgctt ggattgccgc cgtcgtggcc cagcggtgcc ccaacgcgat ccgccgcgcc 4049281 gataaagcta accggtgcca acgaacgacg ccacatcgca catgtcgctc acgccagccg 4049341 atctccgttg ccggccaccg taaccgtcag cacgactcgg cacaatgcca gccgcacgct 4049401 gcaaggccga ccaacgtgtg atgtgtagcc tgcaagacac cggctttctt ggctatgact 4049461 gcatcctggt cagcgattgc actgtgacga ctttgcccag ctcaacctct gccatgccgg 4049521 ctgtatcgtc gcgcggttag gctcacatcc gtgagtgagt ccacccccga agtctcctcg 4049581 tcatacccgc cgccagcgca cttcgccgag cacgcgaacg cccgcgccga gctttaccgc 4049641 gaggccgagg aagaccggct ggctttttgg gccaagcagg ccaaccgact gtcctggacg 4049701 acgccgttca ccgaggtgtt ggactggtcg gaggcgccgt tcgccaagtg gttcgtgggc 4049761 ggcgagctca acgtcgccta caactgtgtg gatcgtcacg tcgaggccgg ccatggagat 4049821 cgggtcgcca tccactggga aggcgagccg gtcggcgacc ggcgcacgct gacctattcc 4049881 gatctgcttg ccgaggtatc caaagccgcg aacgcgctca ccgacctcgg tctggtggcc 4049941 ggtgaccgcg tcgccatcta cctgccgttg atccctgagg ccgtgatcgc catgctggcc 4050001 tgtgcccggc taggcatcat gcatagcgtt gttttcggcg ggttcaccgc tgcggccttg 4050061 caggcccgga tcgtcgacgc ccaagccaag ctgctgatca ccgcggacgg gcagtttcgg 4050121 cgcggcaagc catcgcccct caaggcggcc gctgacgagg cccttgcagc gatccccgac 4050181 tgctcggtcg agcacgttct ggtggtgcgg cgcacgggaa ttgagatggc ctggagcgag 4050241 ggccgcgacc tgtggtggca ccatgtcgtc ggctcagctt caccggcaca caccccggag 4050301 cctttcgatt ccgagcaccc gctgttcctg ctgtacacgt caggcaccac cggcaagccc 4050361 aaaggcatta tgcacaccag cggcggctat ctcactcagt gttgctacac gatgcgcacc 4050421 attttcgatg tcaagccgga cagcgacgtg ttctggtgca ccgccgacat cggctgggtc 4050481 accggccaca cctacggcgt ctacggcccg ctgtgcaacg gagtcaccga ggttctctac 4050541 gagggcacgc cggatacccc cgaccgacac cggcatttcc agatcatcga aaaatacggc 4050601 gtgacaatct attacaccgc ccccaccctc atccggatgt ttatgaagtg gggccgtgag 4050661 atccccgaca gccacgacct gtccagcctg cggctgctgg ggtcggtcgg cgaaccgatc 4050721 aaccccgagg cttggcgttg gtaccgcgat gtcatcggcg gcggacgcac cccgctggta 4050781 gacacctggt ggcagaccga gaccggctcc gcgatgatct ccccgctgcc cggaatcgct 4050841 gcggccaaac cgggttcagc gatgacgccg ctgcccggga tctcggccaa gatcgtcgac 4050901 gatcacggtg atccgttgcc accgcacacc gagggcgccc agcatgttac cgggtacctc 4050961 gtcctagacc agccgtggcc gtcgatgttg cgcggcatct ggggcgaccc cgcgcggtat 4051021 tggcactctt actggtccaa attttccgac aagggctact acttcgccgg ggacggcgct 4051081 cgcatagacc ccgacggcgc gatctgggta ctaggccgca tcgacgacgt gatgaacgtg 4051141 tccgggcacc ggatctcgac cgccgaggtg gaatcggcgc tggtcgctca ctctggcgtg 4051201 gccgaggcgg cggtggtcgg ggttaccgac gagaccacga cccaggccat ctgtgcgttc 4051261 gtcgtgctac gcgccaacta cgccccccat gaccgcacag ccgaagagtt gcgcaccgaa 4051321 gtggctcgag tgatctcgcc catcgcacgg ccacgcgacg tccacgtagt gcccgaacta 4051381 cccaagactc gtagcggcaa aatcatgcgt cgactgctgc gcgacgtcgc ggaaaaccgt 4051441 gagcttggcg acacgtcgac gctgctcgat cccaccgtat tcgacgcgat ccgggccgcc 4051501 aagtaggtcg cggcacgatc aaccgggtca gcccagccaa ctcaggccgg taccgggacg 4051561 aatcccgcgc ccggccggtt cttggcgttg atgtcggcca ggtcggcgtt gatcgacatc 4051621 accaccgccg gggtgtgcag cgggatgtat ttggtgatgc aactcggcag attgtcgctg 4051681 aatgcgccgt ggatcatccc gaccagcaga ttgtcgacgg tcaccggcgc accggagtcg 4051741 cccggtccgc cgcagacctg catcacaagg gtgcccggac tctcccctgg cccccaggta 4051801 accccgcacg agttaccggt ggtgcggccc tgcttgcagg cgatctggcc gaacgacggg 4051861 tccgggccaa tgccgttgat cgcaaacccg ttgaagacgg ccaccggggt caccttggcc 4051921 gggtcgaact tgatcaccgc gtagtccagg ccgtcgttgc cggcgaccat gatgcctacc 4051981 gggcccgcgt tctcggcacc ctcagcggcg atctgcgcgc ccgggccccc acagtgggcg 4052041 gaagtgaagc cgatgaggtc accgttcttg tcatggccga tggtggttag ggtgcacatg 4052101 gtgtccccgt tgacgacgat gcccgcacca ccgcccagcg gtagcttgtc gtcggctgcc 4052161 gcggtgttcg caggtaggca cacaacggcc aaaagcacgg ccgcgaatgc cgcggcaaag 4052221 cgcctgtgcg ccgtctgcaa cgcaatgctc ccgtcatatc gtcagacact tgagaacaga 4052281 tccgccagtt tagacgatcg caccgcaaca tcggcctctg ttcaaacggc cgcacacgtc 4052341 aagacgtggc taactctgtc ccgccgccct tggtgttggc tggcctcgta tggcaccgca 4052401 ccgcatggca acatgaaccg cgatgccagc cgaaccgctc ggcgacgatg cgggccggat 4052461 gacggcccga ggaggagccg agcaatcgaa ccgagctcgg cgacgatgcg ggccggatga 4052521 cggcctaggg tggggtaccg ccgctggcga gggcgagccg agcaatcgaa tcgagaggac 4052581 cgtctgtgag caagatcgat cgcaagaacg gtgtgcccag cacgctgacc acgattccgt 4052641 tggccgaccc gcacgccgga cctgctgagc cgtcgatcgg tgacctgatc aaagacgcga 4052701 caacgcagat gtcgacgctg gtccgagccg aggtcgagct ggcccgcgcc gagatcaccc 4052761 gggacgtcaa gaagggactg accggcagtg ttttcttcat ctcctcgctg gtggtcgggt 4052821 tctactccac cttctttttc ttctttttcg tcgccgaact gctcgatacc tggatctggc 4052881 gctgggtggc tttcttgctc gtgttcgcca taatggtcgt ggtcaccgcc gtgttggccc 4052941 tcttgggttt cctgaaagtc cggcgcatcc ggggaccgcg gcagaccatt gcgtcggtca 4053001 aagagacgcg caccgcactt accccgggcc atgacaaaac ccctgtgaca ccaaaacccg 4053061 tcacatctga tcgcgcgacg ccggttgacc cctcgggttg gtagatggcg gcaccagatc 4053121 cgtcgatgac ccgcatcgcc gggccatggc gtcatctgga cgtgcacgcc aacggcatcc 4053181 gattccacgt cgtcgaggct gtgccgtccg gccagccgga gggcccggat gcggctacgc 4053241 cccccatgca gccggccctg gcgaggccgc tggtcatact gctccatggt ttcggctcgt 4053301 tctggtggtc ctggcgtcat cagttgtgcg gcctgaccgg ggcgcgggtg gtcgcggtcg 4053361 atctgcgcgg ctacggcggc agcgacaaac cgccccgcgg gtacgacggc tggacgctgg 4053421 ccggcgatac ggccggtctc atccgtgcgc tcgggcaccc atcggcgacg ctggtcggcc 4053481 acgccgatgg cggactggcc tgctggacca ccgcgctgct gcattcgcgg ctggtgcgcg 4053541 ccatagcgct gatcagctca ccgcaccccg ccgcgctacg gcgatccacg ctgacccggc 4053601 gtgatcagcg gcacgcactg ttaccgacat tgctgcgtta ccagctgccg atctggccgg 4053661 agcgcttgct gacccgcaac aacgcagcgg agatcgagcg cctcgtgcgc gcccgtggct 4053721 gcgccaaatg gcttgcatcc gaggacttct cgcaagcaat cgaccacctt cgacaggcga 4053781 tccagatccc ggcggcggcg cattgcgcac tcgagtacca gcgctgggcg gtgcgcagcc 4053841 agctgcgcag cgaagggcgg cgattcatca gggcgatgac acagcaactg gggatgccgc 4053901 tgctgcactt acgaggcgac gccgaccctt acgtgctggc cgacccggta gagcgcaccc 4053961 agcgctacgc accacacggg cggtacatat ccattgccgg cgcaggacat ttcagtcacg 4054021 aagaggcgcc ggaggaagtc aaccgacatc tgatgcgttt cctcgagcag gtgcaccagc 4054081 tcagctgacg caggccccgg tgccgaccgg ttgggtagca ccgattttgg caagctgccc 4054141 cgccacctcg ccggccgtca gcacaaaccc agtttcggcg tcgtcgacgg ctgcgccgaa 4054201 caccacaccg agcacctgac cgttgaggtc gatcaggggc ccacccgaat caccttgctc 4054261 cacatcggct ctgatggtgt acacgtcgcg ggtaaccggc tccgggtccc cgtaaatatc 4054321 ggggccactg agtctgatgg cctcgcgaat cctggcgggt gtggcagtga aattgccgcc 4054381 gccgggataa cccagcacca caacgtcggc accggttttc gccggctccg cagcgaagac 4054441 cagcggcggc ggcggcaagt gcggaacggc caggatcgct acgtcgaccg acgggtcgta 4054501 ggacaccacc gtggcctcga agggcttgtc gccggcatac accgtgacgt tgttggatcc 4054561 ggccaccacg tgcgcgttgg tcatcacccg atcgggtgag atcacgaagc cggtgccctc 4054621 caacactttc tggcatctgg gtgccaggct gcggattttg acgacacttg gctcggtggc 4054681 cgccaccacc ggattgttga ccagcgctgg gtcgggtgag gccactggaa tgaccggcgt 4054741 gcggctgaac ggctccaaaa ccgcgggcag gccggaggtg ttcagcaggg ccgacagccg 4054801 cttgggcacc gtcttcagcc aggtgggtgc cgcctcgttg acccgggcga gcacccgcga 4054861 acccttcacc gcggcagcca gctcgggctg ctctttcgac tgtgtcagcg gcatcgccaa 4054921 caaccacgcc gcggtgagca ccacgaccag ctgcacccct accccaatga ccgagtcgat 4054981 caaccggatc ggccggttac ggatcgcccc gcggacggcg cggcccagca ccacaccagc 4055041 gacctcgccg actacgacca gtgccaggat caggaacagc gcggcaaaca gtttggcccg 4055101 cggagcgctg atttgactga cgatatgcgg cgccagcagc acgccggctg tcgcgcccag 4055161 cagcaccccg ccaaacgaca gcattgagcc cagcgcaccg gcacgccagc cggagatggc 4055221 tgcaataaat gcgaccgcca agacggcgat atccagccac tgcgacgggg tcatcgaatt 4055281 catcgcgggt cactctcgtc gtcgatcagc accattgccg cgtccaactc gcggatgtca 4055341 ccggtgtccc agggttgtgc ccagcccgcg acatcgagca ccgcggaaat cacctggcca 4055401 gtgaagcccc ataccagcat ctggtttaac aggaacgccg gcccggccca gcgacgagtg 4055461 tgcgggcggc ggtacaccat gagccgattg gccggattga tgaaggcgcg caccggtacc 4055521 cgcgcgacga tcgccgtttc ggcctcgttg acgacggcca ccggcccggg atccggcgag 4055581 tacgccagca ccgggacaac atggaaccgc gacggcgcaa tgaacgtccg ctccatggtg 4055641 gccagcggat gcagcctgga cgggtcaatc ccggtttctt cgttcgcctc acgcaaggcg 4055701 gtggccaccg gcccgtcgtc ggcggggtcg accacaccgc cgggaaaagc cgcctggccg 4055761 gcatggtggc gcaatgtcga ggcccgcacg gtcagcagta ggtcggcgtc gtctgggaca 4055821 ccaccgtcgc ctggcccggc ctccgggcca gaaaacagca ccagaacggc cgcctcgcgg 4055881 tgatcccggc gcgacgatgt cattgccgac acagccccgg cggccgtcac catcgctagc 4055941 acatcggcgg gcaaccgacg ccggtaggcg tcgggtatct ggccaacgtt gtcgaccagt 4056001 ggacgcagcc aggacgggcc ggcatcaggc cgcagggcaa ccgtcccccg tgaaccggtg 4056061 ggggtcgctc ccgcttgcag gggggtaccc ccagcactca tcggcgcctc ctttgggtcc 4056121 aaagttgccc agctcctctt caagccgcta acccggccca catcaccgcc gagtggagcc 4056181 cacctgctca gagcaggccg gaccggctac gcggcccgca ccgcaaccgt actcatcccg 4056241 cgtcgttccc gaccgcagcc acgatctcgt cggcactgcc gaaagcccgc ggcagggtct 4056301 gggcaacgct accgtccggc cgcagaacca ccgtcgcggg catcacattt gcgacccgca 4056361 gcgcggccgc caccctgcgg cggtcatcct gcagcgtcgg caaccggacg ccgagatcgg 4056421 ccagccgcga cagcgcggcc gcctcgttct ggccctgatg caccgtcacg accagcacgg 4056481 cgggcccgac ccgtcgttga tattcggcca tcacgggcag ctcggtcatg cacggcgcgc 4056541 accaatgcgc ccacagattg atgaccaccc gacgtccggc cagcgcgcgg gcgacgtcga 4056601 cggccgaacc gtcgcccgca cacaccacca caacaccgcg tagtgccgcc gcgcccggac 4056661 cgttacctgc cgcgggacag ggcggcaggt ttgcgcgctg ccgggaccaa gccaatgctt 4056721 ccggggtatc gccgtcgcga tgttcgcgcg gggcgggccg ctggctgatc gtgctcgagg 4056781 cggaatagtc atgcagttgg gcaaccagcg ccgccatcag cgctgccacc accgccagga 4056841 tcgcgatggt ccagcgggtc tttccggtta acgtcgtcat tgcggtctca gcgggggttg 4056901 ttggcaggct tggcattaca gtccagccag ggccagcagg tgatcggtct cggggccctg 4056961 gaccaggggc gccgcgagca gcggttcagt ggggccaagc ccgaaggagg ggcagtcttt 4057021 ggcaagcaca caaacaccac acgccggtct gcgggcgtgg cacacccgcc gtccgtgaaa 4057081 gatcactcgg tggctgagca aggtccactc cttgcgttcg atcagctcac cgaccgcctg 4057141 ctccaccttg accgggtcct ctgcggtggt ccagcgccac cggcgcacca atcgtccgaa 4057201 atgagtatcc accgtgattc cggggatacc gaatgcgtta cccaggatga cattggcggt 4057261 tttgcgcccc accccgggca gcgtcaccaa cttgtccatg gtggccggca cctcaccgcc 4057321 aaaccgctca actagggcct gccccaggcc gatgagagag gccgctttgt tgcggtagaa 4057381 gccggtgggg cggatgaggc tctcgagctc ggtgcgatcc gcctgggcgt agtcccgtgc 4057441 cgtccgatac cgcgcgaaca aggctggcgt cgtcaaattc acccgtttgt cggtgctctg 4057501 cgccgaaagt atggttgcca cggctagctc gagcggcgtg gtgaagtcca gctcgcagta 4057561 tacgtgcgga aatgcctgtg ccaaagcgcg attcattcgc cgcgcccgtc gcaccaaggc 4057621 gagccgggtt tctgcagacc agcgcccggg cacgtcggcg gcacgcgccg ctggcttcga 4057681 tctggatgac ttcgccgctg tcacctacga cagagtactg atttcgtgat ctcactgaga 4057741 cctcgtgttg attcgaagcc atgtttactc tccttgtgtc atggttgctc gtggcctgcg 4057801 ttcctgggtt gttgatgctg gcgaccctcg ggttgggacg gctggaaagg tttctggccc 4057861 gagacacggt cacggcgacc gacgtcgcgg agtttctcga gcaggccgag gccgtggatg 4057921 tgcatacgct cgctcggaat ggaatgccgg aggcgctgga ttacctgcat cgacgtcaag 4057981 cccggcgaat caccgattca ccgccgcttg ggtctggcgc tgggccacgg tatgccgggc 4058041 cgctgtttgt caccgatctc gatagccccg tcgagccacc ccggcatggc cagcccaatc 4058101 cgcagtttag aacggctcga cacgcaaatc acgtgtagcg ttggcacggc gaaccggttg 4058161 gcctacctct agactcttct cgttggcaaa cggttagtgt gcccgtatca cttcgtcgga 4058221 aagttgaaga ggcaacgtgg acgagatcct ggccagggca ggaatcttcc aaggcgtgga 4058281 gcccagcgca atcgccgcac tgacgaaaca gctgcagccc gtcgacttcc cccgtggaca 4058341 cacggtcttc gcggaagggg agccgggcga tcggctgtac atcatcatct cggggaaggt 4058401 caagatcggt cgccgggcac cagacggccg agaaaacctg ctaaccatca tgggcccgtc 4058461 ggacatgttc ggcgagttgt cgatcttcga cccgggtccg cgcacgtcca gcgcgaccac 4058521 gatcaccgag gtgcgggcgg tgtcgatgga ccgcgacgcg ctgcggtcat ggatcgccga 4058581 tcgtcccgaa atctccgaac agctgctgcg ggtgctggcc cgccggctgc gccgcaccaa 4058641 caacaacctg gccgacctca tcttcaccga tgtgcccggt cgggtggcca agcagctgtt 4058701 gcagctcgcc cagcgtttcg gcacccagga aggtggcgca ttgcgggtca cccacgacct 4058761 gacacaggaa gaaatcgccc agctggtcgg ggcctcacgc gagacggtga acaaggcact 4058821 ggctgatttc gctcaccgcg gctggatccg ccttgagggc aagagtgtgc tgatctctga 4058881 ctccgaaaga ctggcccgcc gagcgaggta agcgcgcgca gcgcgggcgc aaccgagcga 4058941 gctagcttcc tcacgcccag cagacacaga gtcgcacgca aacgacggat tttgtgcgat 4059001 tgtgcggctg ctcgcgctac cgagtccgca gatagtccag ttgtgcctgc accgaccatt 4059061 cggccgcatt ccaaagcttt tcgtcaacgt cgaggtagac gtgttcgacg acctcgcgga 4059121 ccgtggcgtc gtcaccgaga tcccgcaacg cggcgcgtat ctgctccaga cgttcgtgcc 4059181 ggtgcagcag gtatcccgat gcaatcgctt ccaggtcgag caagtccggc ccgtgccccg 4059241 gcagcacggt ccgccggccc aggccacgca gccggtgcag cgattccaag tagtcggcta 4059301 ggctgccgtc ttccttgtcg atgacggtgg tcccgcaacc caacacggtg tcggcggtca 4059361 acacggcgtc gtcgaggaca aatgacagcg aatctgcggt gtggccaggg gtggccaaca 4059421 cggtaatggt taacccggca acgtcgatca cttccccgtc ggtcagcgtc tccccatcac 4059481 gtcgcaagaa ctgcggatcc gcggcccgta ccggcgcccc ggtcagcgcg accagtttgt 4059541 cgatgccgct ggtgtggtcg ccatgacgat gactgatcag taccaacgcg atgcggccaa 4059601 gcgcggcaac ccgtgccagg tgctcgtcgt cgtccgggcc tggatcgaca acgaccagct 4059661 cgtcactgag cgggccgcgc agcacccagg tgttggtgcc gtccaacgtc agcaaaccgg 4059721 ggttgtcggc caacaggacc gacgcggtgt cggtgaccgc gcgcagctgg ccgtaggcgg 4059781 gatgggtcag cgactcagct gtcttcgaca tcggccgcta gccgacctcc acgatcaact 4059841 cgacttccac cggcgcatcc aacggtagct cggatacgcc gaccgccgaa cgcgcatgcg 4059901 cgccgctatc gccgaacacc tcggccagca gatcggaggc cccgttgatc acgctcggct 4059961 ggccgtgaaa ccccggtgcc gaagcgacaa acccgacgac tttgaccacc cgggtcaccg 4060021 cgtcgagatc caccagcgaa tcaacggctg ccagcgcatt gagcgcgcag atccgcgcga 4060081 gcgtcttgcc ctcctccggg ttgacgtcgg cgccgagctt gccggtccgc accagcttgc 4060141 ctgcctccaa cggcagctgg cccgcggtgt agaccaggtt gccggtgcgc acagctggaa 4060201 cgtaggccgc cagcggcgcc gccacttgcg gtagcgtgac accgagttgc cctaatcggg 4060261 ctttagcgct cattaacccc gatacctcct acttcgggcg cttcaggtaa gcgacgtgct 4060321 gctcaccggt gggcccgggc agcaccgcca ccagctccca gccatcggct ccccactggt 4060381 cgaggatctg tttggtggcg tgcgtcaaca gcgggaccgt ggcgtactcc catgcggtgg 4060441 gttgggtcat gacgcgagct tatcggtcgg actggacccg ctccgctcag cccggtagcc 4060501 cggaaagatc gccaggccat cgggctagca tgccatggtg gcaaccacat ctagcggcgg 4060561 tagttccgtc ggctggccgt cacgcttgtc gggggtccga ctgcaccttg tcaccggcaa 4060621 aggcggtacc gggaagtcga cgatcgcggc cgcgctcgcg ctgacgctgg cagcgggcgg 4060681 ccgcaaagtc ctactcgtcg aagtcgaggg gcgccagggg attgcgcaac tcttcgacgt 4060741 cccgccactg ccctaccagg aacttaagat cgcgaccgcc gagcgcggcg gccaggtcaa 4060801 cgccttggca atcgacatcg aggccgcctt cctggaatac ctcgacatgt tttacaacct 4060861 cggtatcgca ggccgggcca tgcgccgtat cggcgcggtc gagttcgcga cgacgatcgc 4060921 gcccggtctg cgcgacgtgc tgctcaccgg caagatcaag gagacggtgg tgcgcctcga 4060981 caagaacaag ctgccggtct atgacgcaat cgtcgtcgat gcgcctccga ccgggcgcat 4061041 cgcgcgcttc ctggatgtca ccaaggcggt gtccgatctg gccaagggcg gaccggtgca 4061101 tgcgcaaagc gaaggcgtgg tgaagttact gcactccaac cagaccgcca tccatttggt 4061161 cactctgtta gaagcgctgc cggtgcagga gacactggaa gccatcgagg agcttgcgca 4061221 gatggaactg ccgatcggca gtgtgatcgt gaaccgcaac atccccgccc atttggagcc 4061281 tcaggacttg gcgaaggccg ccgagggcga ggtcgatgca gactcggtgc gggccgggtt 4061341 gttgacggcc ggggtcaagc ttcccgacgc cgatttcgcc ggcctgctta ccgagaccat 4061401 ccagcatgcc acccgaatca ccgcacgcgc cgaaatcgca caacagcttg acgccttgca 4061461 ggttccgcga ttggaattgc cgacggtctc tgacggcgtc gaccttggca gcctctacga 4061521 gctctcggaa tcacttgccc agcagggggt tcgatgagtg tcacaccgaa gaccctcgat 4061581 atgggcgcaa tcctggccga cacatccaac cgggtggttg tgtgctgcgg cgccggtggg 4061641 gtcggcaaga ccactaccgc ggccgcgctg gcgttgcgcg cggccgagta tggccgcact 4061701 gtggtcgttt tgacgattga cccagccaag cgattggcac aagcactggg gatcaacgat 4061761 cttggcaaca caccacaacg cgtgccattg gcacccgagg ttcccggcga gctacacgcg 4061821 atgatgctcg acatgcgccg cacgtttgac gaaatggtta tgcaatactc tggacccgaa 4061881 cgggcgcaat cgattctgga caaccagttc tatcagaccg tcgccacatc gcttgccggc 4061941 acccaagagt acatggctat ggagaagctg ggccaactgc taagccagga ccgctgggac 4062001 ctgattgtgg tagacactcc gccgtcgcgt aacgcgctgg acttcttaga cgcgccaaag 4062061 cgactgggca gcttcatgga tagtcggctg tggaggctgt tactcgctcc cggccggggc 4062121 atcgggcggc tgatcaccgg cgtgatggga ttggccatga aggcgttgtc caccgtgctc 4062181 ggttcccaga tgctggccga cgcagcagcg ttcgttcaat cgctggacgc cacgttcggt 4062241 ggtttccgcg agaaggcaga ccgcacttac gcgttgttga aacggcgcgg cacccagttc 4062301 gtggtggtgt cggcggccga acccgacgca ctgcgcgagg cgtccttctt cgtcgaccgg 4062361 ctatcgcagg agagcatgcc gctagcgggg ctggtcttca accgcacgca cccgatgctg 4062421 tgcgcattgc cgatcgagcg ggcaatcgac gccgccgaaa cgttggatgc cgagaccacc 4062481 gactccgacg ccacatcgct ggccgcagcg gtgctgcgta tccatgccga gcgcgggcag 4062541 acagccaaac gggagatccg gctgctgtcc cggttcaccg gagccaaccc caccgtgccg 4062601 gtcgttgggg taccgtcgct cccgtttgac gtctctgacc tggaagcgct gcgggcgctc 4062661 gccgaccagc tcaccacggt cggcaacgat gcgggccgcg cagcgggccg ctgaggaacc 4062721 ggcccatcag tgacggtcgg cgacgatgcg ggccgcgcag cgggccgctg aggaaccggc 4062781 ccatcagtga cggtcggcga cgatgcgggc cgtacaacat ctgaccggga tccggctatt 4062841 gggcacaagc cagttcctat tgggcacaag ccaattagaa atgaatggct tttgctgtaa 4062901 ccaaaccgta atcagaagcg acgggaccgc ggcacctatc cgcagtccct gagtggctat 4062961 ccggcggtgc cggtgcggcg cttgcgcttc tcaaggtagt ccgaccacga aaccacctcg 4063021 ggatgttgct tgagcagagc cctgcgctgg cgctcggtca tgccacccca aacaccgaac 4063081 tcgaccttgt tgtccagcgc atctgccgca cactcttgca ttaccggaca gtgacggcag 4063141 atcaccgcgg ccttgcgttg tgcggctcct cgaacaaaga gttcgtcagg gtcggtagtc 4063201 cggcacagcg ccttggatac ccacgcgatc cgctcttccg cgtctacgct gcgtaccacg 4063261 ttctgtgcag ccgtgaggtt agtccttcga gcggctggac gggttcctga cacgagctga 4063321 tcccttcctc ccggccgccg tgtgcgaccg ccctcctcgg aaacagccga tgctgcgagc 4063381 gacgccacac catgcacatc ggtgttacct gtatctcact gatctgtata agttaggtgg 4063441 tcgtgtgcca attgcgcaac agtacgataa cgcttttttg ggacgagcgt gccgtcttgt 4063501 ctggatcggc cgggggaaat gccgccgctt cggtcccgtt tacggggtct gaccagtgac 4063561 gcagccgcaa atatcgcgcc cgccccgatc ccgcagtgac tcacccgccc gcggaaagat 4063621 tctattggac cgagcggcac ggtggagtga caggaggtcg ctactgtagt acgcatgccc 4063681 gagcgcctcc cggccgcgat caccgttctg aagctggctg ggtgctgtct gttggccagt 4063741 gtcgtcgcca ctgcgctgac gttcccgttc gcaggcgggc tagggctgat gtccaatcgt 4063801 gcctctgagg tcgttgccaa cggctcggcc cagctgctcg aggggcaagt gcctgcggta 4063861 tcgacgatgg tcgacgcgaa gggcaacacg atcgcgtggc tgtactcgca gcgccggttc 4063921 gaggtgccct cggacaagat cgccaacacg atgaagctgg cgatcgtctc gattgaagat 4063981 aagcggttcg ccgaccacag cggcgtggac tggaagggca ccctgaccgg cctggcgggc 4064041 tacgcgtccg gcgacctcga cacgcgcggc ggctcgacgc tcgaacaaca gtacgtgaag 4064101 aactaccaac tgctggtgac agcccaaacc gatgccgaga agcgagcggc cgtcgaaacc 4064161 actccggccc gcaagcttcg cgagatccgg atggcactca cgctggacaa gaccttcaca 4064221 aaatctgaaa tcctgacccg atacttgaac ctggtctcgt tcggcaataa ctcgttcggc 4064281 gtgcaggacg cggcgcaaac gtacttcggc atcaacgcgt ccgacctgaa ttggcagcaa 4064341 gcggcgctgc tggccggcat ggtgcaatcg accagcacgc tcaacccgta caccaacccc 4064401 gacggcgcgc tggcccggcg gaacgtggtc ctcgacacca tgatcgagaa ccttcccggg 4064461 gaggcggagg cgttgcgtgc cgccaaggcc gagccgctgg gggtactgcc gcagcccaat 4064521 gagttgccgc gcggctgcat cgcggccggc gaccgcgcat tcttctgcga ctacgtccag 4064581 gagtacctgt ctcgggccgg gatcagcaag gagcaggtcg ccacgggcgg gtacctgatc 4064641 cgcaccaccc tggacccaga ggtgcaggca ccggtcaagg ccgccatcga caagtacgcc 4064701 agcccgaacc tggccggtat ttccagcgtg atgagcgtga tcaaaccggg taaggatgcg 4064761 cacaaggtgt tggccatggc cagtaaccgc aaatacgggc tggatctaga agccggcgaa 4064821 accatgcggc cgcagccatt ctccctggtt ggcgacggcg ccgggtctat cttcaagatc 4064881 ttcaccacgg ccgctgctct ggacatgggc atgggtatta acgcccaact cgacgtgccg 4064941 ccccgattcc aggccaaagg tctgggaagt ggcggggcaa aggggtgccc caaagagacc 4065001 tggtgtgtgg tgaacgccgg caactaccgc ggctcgatga atgtcaccga cgcgctggca 4065061 acctcgccaa acaccgcgtt cgccaagctg atctcgcagg tcggggtggg gcgtgcggtc 4065121 gatatggcca tcaaactcgg gctgaggtct tatgcgaatc ccggcaccgc acgcgactac 4065181 aaccccgaca gcaatgagag cttggctgac ttcgtcaaac gacagaacct gggttcgttc 4065241 accctcggcc ccatcgagtt aaacgcgctg gagctgtcca acgtggcggc cacgttggca 4065301 tccggcggcg tgtggtgccc ccccaaccca atcgaccagc tcatcgaccg caacggcaac 4065361 gaagtcgcgg tcaccaccga gacgtgcgac caggtggtgc ccgcagggct ggcgaacacc 4065421 ctcgccaacg cgatgagcaa ggacgccgtg ggcagcggca cggcggccgg ttcggccggc 4065481 gcggcgggct gggatctgcc gatgtccggc aaaaccggca ccaccgaggc gcaccggtcg 4065541 gccggcttcg tgggcttcac caaccgctac gcggcggcga actacatcta cgacgactcc 4065601 agctcgccga cagatctgtg ttccggcccg ctgcgccatt gcggcagcgg cgacttgtac 4065661 ggcggcaacg agccatcccg cacctggttc gccgcgatga agccgatcgc caacaacttc 4065721 ggcgaagtgc agctaccacc gaccgatcca cgctatgtcg acggcgcacc aggctcacgg 4065781 gtaccaagcg tggccggtct ggatgtcgac gccgcacgcc agcgcctcaa ggacgcgggc 4065841 ttccaggtcg ccgaccaaac caactcggtc aacagctccg ccaagtatgg tgaggtggtc 4065901 ggaacgtcgc ccagcggtca aacaattccg ggttcgatcg tcacgatcca gatcagcaac 4065961 ggcatcccgc cggctccgcc tccgccaccg ctgcctgagg atggtgggcc gccaccgccg 4066021 gtcggatcgc aggtggtgga gattccgggg ctgccgccga tcaccattcc gctgctggcg 4066081 ccaccacccc cagcgcctcc cccgtaggcc ctcccaatcg gcctcgtgcc gctgcagacg 4066141 cgcgatcaga cctcgaccgg cagtaggctg cgtgcatggc tgctgtcttg cccaccttga 4066201 tccgcaccgg cgccgtggcg ttgggctcgg ccatcgccgg gattggttac gctgcgctgg 4066261 tcgagcgcaa tgcattcgtc ctgcgcgagg tgaccatgcc agtcttgact ccgggctcca 4066321 caccgctgcg ggtgctgcac atcagcgatc tgcatatgct gcccaaccag caccgcaaac 4066381 aggcctggct gcgcgagctc gccagctggg agccggatct ggtcgtcaac accggtgaca 4066441 acctggctca ccccaaggcg gtgcccgccg tcgtccaaac cctgagcgat ctgctgtccc 4066501 ggccgggtgt cttcgtgttc ggcagcaacg actactttgg gccgcgcctg aagaacccaa 4066561 tgaactatct gaccagcccg gatcaccgcg tccgcggagc agcgctgccc tggcaggatc 4066621 tgcgggcggc gttcaccgaa cgtgggtggc tcgacctaac ccatacccgc cgcgagttcg 4066681 aagttgccgg tctgcacatc gccgctgcgg gcgtcgacga cccgcatatc gaccgagacc 4066741 gctacgacac catcgccggc ccggccagcc cggccgccaa cctgcggctg gggctcaccc 4066801 attcaccgga gccgcgggtg ttggaccgct tcgccgccga tggttaccag ttggtgctgg 4066861 ccggccacac ccacggcggg cagctgtgcc tgccgttgta cggggcgctg gtcactaact 4066921 gcggtctgga ccgctcccgg gccaaaggag cgtcacactg gggtgcaaac atgcggctgc 4066981 acgtctccgc cgggatcggc acttcgccgt ttgcgccggt gagattctgc tgccggcccg 4067041 aagcaaccct gctgacgttg atcgcgaccc caatgggcgg gcgcgattcg agcagcaacc 4067101 tgggccgctc acagccgaca gtgtcggtgc gttgagcggc ggggcctgta tcgcggtccg 4067161 cagcctatcc cggagctgga cggacaacgc gatccggttg atcgaggcgg acgcccgccg 4067221 tagcgccgac acccacctgc tgcgctaccc actgcccgct gcctggtgca cggatgtcga 4067281 cgtcgagctg tacctcaagg acgagacgac ccatatcacc ggcagtctca aacaccggtt 4067341 ggcacgttcg ttgttcctct atgcgctatg caacggctgg atcaacgaga acaccacggt 4067401 ggtggaggca tcgtcgggtt caacggcggt gtccgaggcc tatttcgcgg cgctgctggg 4067461 tctgccgttc atcgccgtga tgccggccgc gaccagcgct tccaaaatcg cgttgatcga 4067521 atcacaaggt ggccgttgtc atttcgtcca gaattcaagt caagtgtacg ccgaggcgga 4067581 gcgcgtcgcc aaggaaaccg gcggccacta tctggaccag ttcaccaacg cggagcgcgc 4067641 aaccgactgg cgcggcaaca acaacatcgc cgagtcgatc tacgtgcaaa tgcgcgaaga 4067701 gaagcacccc accccggaat ggatcgtcgt gggtgcgggc accggcggaa ccagcgcgac 4067761 gatcggccgc tacatccgct accgacggca cgcgacccgg ctgtgcgtcg tcgatccgga 4067821 gaattccgcg ttcttccccg cgtactccga aggccggtac gacatcgtca tgcccacatc 4067881 gtcccgtatc gagggcatcg gccggccgcg ggtcgagccg tcgtttctgc ccggtgtggt 4067941 cgaccgcatg gtggcggtcc ccgacgcggc gtcgatcgct gccgcccggc atgtcagcgc 4068001 cgttctgggg cgccgagtgg gaccgtctac cggcaccaac ctctggggcg cgttcggact 4068061 gctcgccgag atggtcaagc agggccgcag cggctcggtg gtcacactgc tcgccgacag 4068121 cggcgatcgc tacgccgaca cctacttttc cgacgagtgg gtcagtgccc aggggctcga 4068181 tccggccggg ccggctgcgg cgctggtgga attcgagcgc tcctgtcgat ggacgtgacg 4068241 gtcggacctg cggtttggct agtcaacggt ccggtgcgat aggctgtcgt ggcttcaagc 4068301 ggggtgtggc gcagcttggt agcgcgcttc gttcgggacg aagaggccgt gggttcaaat 4068361 cccgccaccc cgaccgagag atcgctgacg acagccttac ccggcgcagc gtggtagctt 4068421 gctgcagtct gctcgggcgg cagcgccacc ctgacggtgc tggttgacca tgccggacag 4068481 cacgtcaacg cacaggcatt tccaacggaa gttgtaggtt accggccgcc ctaaaacacg 4068541 gtgcactttt cgttaaaggt tgtgggtgtg gatccaacga aattcgttgc cccggcgtgg 4068601 gcagcgccgt gtccacaggg ggacccgccg cgcattacgc ctatgggccc acccccgtac 4068661 cgcgggagtt ggctctgcac cccgagccaa tcatgcttct ctcggagtcc gacgcgggac 4068721 tgggacgact cgcatgagcc ggacgcctcc tgcctgaccc ccacctgcta ggaacgtaaa 4068781 ccgggagagt ttcgtcggag ccagaattgg atttcctccc cgagcaatcg gcccgaaacc 4068841 gcggggttgt ttccgccgac cgtcgacaac atgtggcgtg cgttggatga ctgggaaatg 4068901 tatctccacg acgcagcgcc acaactgccg ctcttgatcc gttgcgccct ggtgcattac 4068961 caattcgagg cgatcgggcc atttctcgac ggcaacgcac gactcgggcg tctgttcatc 4069021 atcctttgcc ttgttgcatt gggacggttg ccgctaacgg gcggggcgaa accgcacccg 4069081 agtgccgcgg cggggcacaa gcatgatgga gcgccgcacg atccgctctg gctcgtcgtc 4069141 gacggcggtg aactcgccct cgcgcagcag cacgtgcaat acggtgatca actctcgcat 4069201 cgaaaagttc gcacccagac agcgtttcac gccgccgccg aacggaaccc aggcataggt 4069261 ttgcggccgc gtaccgagga accgctcggg gcggaactcg tgtgggtgct catacacctc 4069321 ggcgctgcgg ttgatcgcga tgatgtggac cacgattcgt gtgccagcct ccacacggta 4069381 accgccgatg gttagtggtt gcgcggcgac acgagccgtc aacggcgcgg gcggacgcac 4069441 ccgcaacgtc tcgttgatca ccgccgtcgt gaaggcttcc ccaccgccaa cggcctccgc 4069501 tcgcacgcgc cgcaacgcgt ccggatggtg cagcagcaag tcgaacgccc acgccaacgt 4069561 ggtcgccgtg gtttcatgcc ccgccagcac gagggtgatc agatcgtcgc ggatctcgct 4069621 gtctgacaac tgttccccgg actctccgcg cgcgctcacg agcaacgaca ggacgtcgtg 4069681 tcgctcgccc aggcgtggat cggcgcgccg ctgcgcaatg agcgccatga cgacgtcgtc 4069741 gatctcggtg ttggcgcggg cgcgtgcagg ccagactcgt agtgcgccca accgacgcag 4069801 tgcgtagcgc acggtcaact gctctgaaac accaagattc aacagccgct cgaacggccg 4069861 gcccaagcgc cggacctcct cggggtcgtc gaccccgaat atgaccttga cgatcacatc 4069921 cagcatcagc gaccgcgcca ccgtcaacat cgcaaacgga cggtcaaccg gccatgtatg 4069981 catcgccgcg cgagtggagt tctcgataat cggaacgtaa cgatccagcg cagcgccatg 4070041 taatggcggc gtcaagagtt ttcgacgtcg aagatgctgc ggctcctcct ggacaaacat 4070101 cgaccccgac ccatagatcg ccgctgccgg ccccaccccc tcgcccccga gcaggacgtc 4070161 ggtgggagcg gtgaaaacct ccttggccag cgccgagtcg gacacgatcg caacgtcacc 4070221 caggctgaga atgggcatcg tcatgatcgg tccgtaccga cggatcagtc gcagcatccg 4070281 gcgctcgcca cccgccaggt aggcaaccgc gtaggcggcc gcgaaggccg cgcgaaatcc 4070341 acggggcgcc ggcaagcccg gtgggccgcc caaagcatcc ggcgcgtgct cccggcgaac 4070401 cgcaaacgct gccacgccta cgacagaagc acagcgtttc gggtcggtca acgcagcagg 4070461 gctagcaagc gacctcagca ccatcggttc ccgaaggtgc ggtccggcgc taccgcgtcg 4070521 aaaatcgcag accgcgccag ccggttggga atgaggccgt ttcaccggcg ggcgtcccgc 4070581 gcagcgtttc gccgcagacc ctatgttggc catgcgcgat ataggccacc cggcaccaag 4070641 gtgccatgac cgccacaacc agggccgcgg cggcaaccgc caggtgtccg atcgtcagcg 4070701 caactaaacc cgcaaccagc ccgacagccc acacggcagc caccaccagc cccggcacat 4070761 tgacactcgc ggggacgctg ccgctaccgc ttggctgcgg cgcggatgca tgatcaccgg 4070821 cgtcactccc ggtgtagacc atgaccactc ccagcgataa aaggttgccg atcaaggtaa 4070881 cccatacagg ccgtcggcag ccaccggcga acagctcttc gaggatgccg tcaggacatt 4070941 gacagctacc agacaccatt tccacaccgt caaaatgtgg cgcgtgacac gcacggcggc 4071001 acgctggcaa cgtggcgtgc gccgcaggcc tcgactatct ggtgccgatc acagcatgca 4071061 tgctcgtcgg tttgtacggc gttaccggtc gatcctgccg ccccgtaccc cagtcaaggc 4071121 atcgtggagc gtggaaaaca accgaaaggt cttgtccaga cccatcagat gaatcggcct 4071181 tctggttacc gaaccccgtg caaccacccc gaacttgacg gattggccta tcttttcgga 4071241 tgttgccgcc aaaatcttca gccccaccga ccccagaaac tccaccgcgg aaaggtcgat 4071301 gactagcgcc gtcggattgt cggccacaac ttcgccgatg gcctcttaaa gtgccgcagc 4071361 ggtgatcaaa tcaatctcac caccgatgct gagcacggcg accccgttat ggtcggcaac 4071421 cgtgacggtg atcgagtcgg gagctgacaa tggcgatcct cttgtccgag ccgtccgtgt 4071481 ggtgaaagcc tagcccgcct gcgaactgcg gcggcggccc atcagcgtag gatttgccgg 4071541 ctgcacacga cctgtgtgcg ggccgcaatc gcggcgaagg cgctggggcg tgggtgaatt 4071601 gcctaacaac cctcgagtgc ggacacgcat atagcctccg tcgaaattgg cctataggcg 4071661 ttccttgacc gccgccgaca agcgtgcgcc gtcggctttc ccggcggcga tcacggtggc 4071721 cgccttcatt accagaccca tctgtttcat gctgggccga tgcccgagtt cttcggccac 4071781 ctcggctatg gcggtgtcgg cgacatcggc cagttccccc tcggtgagcg gcgtcgggag 4071841 gtactcgtca atgatccgtg cctcggcatg ctcggtggcg gcgagctcac cgcggccgtt 4071901 ttgggtgtag atctccgccg cctcaccacg cttgcgcgat tccctggcca acaccttgat 4071961 cacctcgtcg tcggagagct ctcttgcctg cttgccagag acctcctcgg tctggatcgc 4072021 ggccagcagc atgcgtatgg tcgcggtccg cagcttgtcc tgcgtcttca tcgcttgggt 4072081 caggtctgac cgaagctggg atttaagttc cgccattgca caaacgctac gcgccgcaac 4072141 gcccgaaacc cgacactgag acctacattg agaaatgcac cgaccgccga caggatggag 4072201 gccatgacga acgacgacag ctgctgcgtc cggtgagcat gctgccgcct ggctacccgg 4072261 ttgaaccacc gcccgtggcg ccgggatatg cgccggccgg atatccgccc taccccgcta 4072321 caccacccgg gtacggcccg ccgggttatg gtgcgccgcc cagctatggc cccccgcctg 4072381 gctatggtcc acccctcggc taccccgccg caccgcccgg ctgcggccca ccgcccggct 4072441 atggcccacc gctcggctat ggcccaccgc tcgccccggg cgcggtcaaa ccaggaataa 4072501 tcccgctgcg gccgttgacc ttgagcgata tcttcaacgg cgcggtcggc tacatccgcg 4072561 ctaacccgaa ggcgacgctg ggattgaccg ccatggtcgt ggtgaccctg caaatcatct 4072621 cactggtggc cctatttggc cccatgaccg ccttcggtga catcgtgacc ggggagcccg 4072681 acgagctgac cggcgcggtg gtgggcggtt ggtcagcgtc attcggcgcc agtctcctgg 4072741 tcagctggct agcgggtgtg ctgctcagcg gcatgctcac cgtcatcgtc gggcgggccg 4072801 tgttcggttc gccgatcacc gtcggcgagg cgtgggccaa ggttcgcggt cgcctgctcg 4072861 cgttgttcgg cctggcactg ctggaagcag ccggcgtggt ggcggtgctc gggctggcgg 4072921 tcgtcatact ttccggggtc gcggcggcgg ccaacgaggc agcggcggcc ctcctcggct 4072981 tcccgctgct gctcgtggtt ggggtgtcgc tggcctattt gtatgtcgtc ctgctgttcg 4073041 cacccgtgct gatcgtgctg gagaggctgc ccatcgtcga ggcgatcacc agatcctttg 4073101 cgctcgtgcg tcatggcttc tggcgggtcc tgggcatccg cctgctgacg gtgctggtgg 4073161 tgggcgtagt tggtaatgcg atcgcggctc ctttcatgat cgtcggcgag atagtgacgg 4073221 ccgtcacagc gtccgacggg tcagtcacca tgcggctcgt cggcgctacg ctctcggcca 4073281 tcggagtgac gatcggccag attgtcaccg cgccgttcag cgccggagtt gtcgtgctgt 4073341 tatacaccga ccgccgtatc cgtgccgagg ccttcgacct ggtattgcag accggcttag 4073401 aagccggccc cgccggcggg cccgccccgg tggagtccac cgacaaccta tggctcacgc 4073461 ggcctttcta aagggagtta gtgaggacag gctgacagtg ccctccatcg acatcgaccg 4073521 cgaagccgca caccaagccg cacaacgcga gctcgacaaa ccgatctacc ccaaagactc 4073581 cctgaccaag gaactcaccg actggatcga cgagcagctg taccggattt tggagaaggg 4073641 atcctcgata cctggcggtt ggttcaccat caccgtgctg ctcatcttgc tgatgatcgc 4073701 ggtgaccgcc gccgtccaga tcgcacggcg caccatgcgc accaaccgcg gcggtgacta 4073761 ccagttgttc gacgccggcc aattgaccgc agcccagcat cgctccacgg ctgaaagcta 4073821 tgccgccgag ggtaattggg ctgcggcgat ccgccaccgg ctacaagccg tggctcgcga 4073881 gttggaggag accggcatgc tcaacccggc tgccgggcgc accgccaacg agctggccag 4073941 cgatgcgggc gaggttttac cgcatctggc aggggaattg acgcaggcgg caaccgcttt 4074001 caacgacgtc acctacggcg agcggcccgg aacccaaggc gcctaccaaa tgatcgccga 4074061 cctcgatgac catctgcggt cccgttcacc ggccgtcgta tctgcagtgc agcacccggc 4074121 cgtgttcgac tcgtgggcgc aggtccggtg attcccacac gtctcgcaac cgtgcgccgc 4074181 cgacggccgt ggcgcggggt gttgctcacg ctggccgcag tcgccgtcgt ggcctcgatc 4074241 ggcacctatt tgacggcgcc acggcctgga ggcgccatgg cccccgcgtc caccagctcg 4074301 acggggggcc acgcgctggc gacgctgctt ggcaaccacg gcgtcgaggt tgtcgtggcc 4074361 gactccatcg ccgatgtcga agccgcggca cgccccgact cgctgctgtt ggtggcgcag 4074421 acgcagtatc tagtcgacaa cgcactgctg gatcggctgg cgaaagcccc cggtgacctg 4074481 ttgctggtgg cacccacctc acgaactcgt acggcgctga cgccgcaact gcgcatcgcg 4074541 gccgccagcc cattcaacag tcagccgaat tgtacgctgc gggaagctaa tcgggcagga 4074601 tcggtgcagt gggggcccag tgacacctac caggccaccg gcgacctggt gttgaccagc 4074661 tgttacggcg gggcattggt ccgctttcgt gctgagggcc gaaccatcac ggtggttggc 4074721 agcagcaact tcatgaccaa cggcggcctg ctgccggccg gcaatgccgc actggccatg 4074781 aacctcgcgg gcaaccggcc tcgtctcgtc tggtacgcgc ccgaccacat tgagggggaa 4074841 atgtcttctc cgtcatctct ttccgacctg attccggaga acgtgcactg gaccatctgg 4074901 caattgtggc tggtggtgct cttggtggca ctctggaaag gccggcggat cggtccactg 4074961 gtggccgagg agttacccgt tgtgatccgc gcgtcggaga ctgtcgaggg tcgcggtcgg 4075021 ttgtaccgat cccgtcgggc gcgtgatcgc gccgcggacg cactacgcac cgcgacgctg 4075081 caacgcctgc ggccccgact tggggtgggc gcaggcgcgc cggcgccagc agtggtgaca 4075141 accatagcgc agcgcagcaa agctgacccg ccgtttgttg cctaccattt attcggcccg 4075201 gcaccggcca ccgacaatga cctgttacaa cttgcccgtg cgctcgacga catcgaaagg 4075261 caggtcaccc actcgtgaca cagtccgcgt ccaacccgca agctcctccc acccaaaccc 4075321 ctggcgctga attgcccggc tatcccccgc aagcgggtgg tgcccctaca gcggcccctt 4075381 ccgggccgca tcctcaccgg gctgaagcag aatcggcacg tgatgcattg ctggcattac 4075441 gcgccgaggt cgccaaggcc gtcgtcggac aggacggggt gatcagcggc ctggtgatcg 4075501 ctctgttgtg ccgtgggcac gtgctcctgg aaggtgttcc aggagtggcg aagacgctga 4075561 ttgtccgcgc tatgtccgcc gctttgcaac tggagttcaa gcgggtgcag ttcacccctg 4075621 acctgatgcc aggcgacgtc accggttcac tggtctacga tgcccgcacc gccgagttcg 4075681 tgttccggcc gggcccggtg ttcaccaatt tgctgctggc cgatgagatc aaccgcaccc 4075741 cacccaagac gcaggccgcg ctgctcgagg cgatggaaga gcgtcaagtc agtgtggagg 4075801 gtgagcctaa gccgctgccc aacccgttca tcgtcgccgc gacgcagaac ccgatcgaat 4075861 acgagggcac ctatcagttg cccgaagccc aactggatcg tttcctgctg aaactgaatg 4075921 tgacactgcc ggcacgcgat tccgagatcg ccatccttga ccggcacgcg cacgggttcg 4075981 acccgcgcga tctatccgcg atcaatccgg tggccgggcc ggccgagctg gcggctggcc 4076041 gcgaggcggt gcgccacgtg ctggtcgcta atgaggtgct gggctacatc gtcgacatcg 4076101 tcggggccac ccgctcctcg cccgcactac agctcggtgt gtcgccgcgt ggggcaaccg 4076161 ccctgctggg caccgcccgg tcctgggcgt ggctgtccgg gcgcgattac gtcacccccg 4076221 acgacgtgaa ggcgatggcc cgaccgacgc tacgccaccg ggtgatgcta cgcccggaag 4076281 ccgagctgga aggcgccaca cccgacggcg ttctcgacgg aattctggcc tcggttccgg 4076341 tgccccgcta gtgatccgtg tgatcggcgc cggcgacgat gcagtggggg caccacccgc 4076401 ttgcggggga cgaagcgatg gggtgggggt acgcccccac aagtgggagg tacccccacc 4076461 cgcttgcggg ggagagcggc gcagatgatc ctaaccggac gcaccggctt gctggccctg 4076521 atctgcgtcc tgccgatagc gctgtcccct tggccggcaa gggctttcgt gatgttgctg 4076581 gtggcgcttg cggtagcggt gaccgtggac accctgctag cggccagcac ccgtaagttg 4076641 cgctttaccc gctcgccgta tacctccgcc cggctcgggc agcccgtgga cgcgagcctg 4076701 ctgctctgca atgggggccg ccgccggttc cgcggccagg ttcgtgacgc ctggccgccc 4076761 agtgcccgtg cgcagccgca cacccacgat gtcgacgtgg ctgccgggca gcgccagcag 4076821 gtgcacaccg cactgcggcc agttcggcgt ggggaccagc gcgcagcaat ggtcacggcc 4076881 cgttcgatcg gaccactggg gttggcggga cggcagagtt cacagtcggt gcccggcttg 4076941 gtccgggtgc tgccgccgtt cctgtctcgc aagcacctgc cgtcgaggct ggccaagctg 4077001 cgggagatcg acgggctgtt acccacgttg atacgcggcc aaggcaccga attcgattcg 4077061 ctgcgcgagt atgtcgtcgg cgacgacgtc cgctcgatcg attggcgcgc gagcgcacgc 4077121 cgcgccgatg tcatggtccg cacctggcgg cccgaacggg accgccgagt cgtcatcgtg 4077181 ctcgacactg gacgcatggc ggcggggcgg gtcggtgtcg acccgaccgc cgccgatccc 4077241 gccgggtggc cgcggctgga ctggtccatg gatgccgcac tgctgttggc ggcactggcg 4077301 tcacgagccg gcgaccatgt cgacttcctg gcccacgacc ggatcagccg cgccggcgtg 4077361 tttggcgcct cgcgtagcga actgcttgcc caactggtcg atgccatggc cccgctgcga 4077421 ccggcgctta tcgaatccga ctggcatgca atgattgcca ccatcttgcg gcgcacccgg 4077481 aggcgatcgc tggtggtgct gctgaccgac ctcaacgcga ccgctctcga cgagggcctg 4077541 ttgccggtgc tgccgcagtt gtcggcccga caccatgtgc tggtcgccgc ggttgccgac 4077601 ccgcgcgtcg atcaactggc cgccgggcgg tccgacgcgg cagcggtgta cgacgctgcg 4077661 gctgcggagc gcgcccgcaa cgaccggcgt gcgatcgcgt cacaactgcg ccgaggcggg 4077721 gtagatgtca tcgacgctcc tcccgccgaa atcgcacccg gacttgcgga tcgctacctg 4077781 gcgatgaaag cgaccggccg cctctaattt ccgacctcca ttgtgaaatg tgcgacgcca 4077841 gcgcggcgtg tcgtgtcgcg agtttcactc tcgggggagt tcagccggtc gggaccacgt 4077901 cgggcgcgtc ctccatgtcg ccggtctccc cggcttgcgc ggcacgacga ccgaagtagc 4077961 cgatgtagga cagaaacacc gcctcggcga tgatcccgac ggcgatccga acaaacgtcg 4078021 gcaacggcga cggtgtcacc accgcctcga tcagacctgc gaccagaaac acacccacca 4078081 agcccaccgc gaccgacacg acaccacgtc cttgctcggc gaggacctgt ccgcgcgggc 4078141 ggttgcctgc agatatcacc gaccacccca gccgcatccc aatcgccgcg gcgagaaaga 4078201 cggccgtcag ctccagcagc ccgtgcggaa gaagcaggcc cagcaggaaa tcgcccttcc 4078261 ccgcctggaa catcagcccg gcgatcagtc cgacgttggc ggcgttatcg aaaagcacca 4078321 gcggtatcgg cagccccagc acaacagaca tcgcgatgca cgtggtagcc acccaggagt 4078381 tgttcaccca gacctgcaga gcgaacgacg cggccgggtg ctcgctgtaa taggactgga 4078441 cgtcatggct gaccaattcg tctatctcag tgggcgtccc gatcgcggac tgcacctcgt 4078501 gactgccggc cacccagaac ccgatcagca ccacgacggc gaaaaacgcc accgcagtcg 4078561 ccagccacca ccgccaggta cggtaggcca cgaccgggaa cgacactgtc cagaaccgaa 4078621 tgaacgtacg ggtcagcggt gcgtgcgcgc ctgtgaccgc ggaccgagcc cgcgcgacta 4078681 gactcgacag ccgaccggtc atcaactggt ccgacgaagc cgatctgagc atcgacagat 4078741 gcgtggacac acgctgatat agctcgacga gttcgtcgat ttcggctccg ctcagtgaat 4078801 ggcgcttctt gatcaagtgg tcgagccggt cccacgtgcc gcggttggtc agcaagaacg 4078861 cgtcgacgtc caccctgcgc agcctaccta agccgccgag cgtgagcggt ggccaatgcc 4078921 gagtgcagca gagcaccgca ccaaagcctg tagcgtttgt tggtatgtcg gaggtggtga 4078981 ccggcgacgc cgtggtgctc gacgtacaga tcgcccagtt gccggtgcgc gcggtcagcg 4079041 cggtcatcga tatcaccata atattcatcg gctacatcct cggtctgatg ctgtgggcga 4079101 ccgccctgac ccagttcgac gaagccttga ccaccgcatt cctgatcatc ttcacggtgc 4079161 tggcgctggt cggctatccc ctggtctggg aaaccgcaac gcggggccga tcagtgggga 4079221 agatcgtgat gggtctgcgg gtggtgtcag acgacggtgg cccggagcgg ttccggcagg 4079281 cgctgtttcg cgcgttagcg tcggtggtgg agatctggat gctgctcggg agccccgccg 4079341 tgatctgcag catgttgtcg ccaaaagcca agcgagtcgg cgacgtcttc gcgggcacgg 4079401 tcgttgtcag cgaacgtggt ccgcggttgg ggccgccgcc ggtgatgcca ccgtcgctgg 4079461 cctggtgggc gtcgtcgctg caattgtctg ggcttaccgc cggccaagcc gaggttgcac 4079521 gtcaatttct ggtgcgggca ccgcaactcg atcctgcgct acgcgagcag atggcctacc 4079581 ggatcgccgg tgatgtggtt gcccgcatcg ctccgccgcc gccacccgga gttccaccac 4079641 agttggtcct ggccgccgtc ctcgccgaac gacaccggcg tgaactgttg cgactgcgtc 4079701 ccacgctgcc tcccgcagga caggcgccat gggcccaaat ggcgcctcat cggggttggc 4079761 cgcccggttt gtccggcgcc acgccgtggt ctcctcagca gccggtgatc ccctggccgg 4079821 agccagatcc gccaccgcaa gccgctccct ggccgcagca ggcgccggac ggcccgggat 4079881 tctcgccgcc gggctagcag ctagtcttcg ctgcgccgga tcccccgagc gtgcggacat 4079941 gttcaggcgc acagcgaaag ctaggacacg tcaacccaat ccagggtccg ctgcaccgcc 4080001 ttgcgccagc cggcataacc cgcggcacgc tcgtcgtcgt cccacgtcgg tgtccaccgc 4080061 ttgtcctctc gccagttggc ccgcagatcg gacggagccg cccagaaccc gaccgccaag 4080121 cccgccgcgt aggccgcacc tagtgcggtg gtctcggcga ccaccggccg caccacatcc 4080181 acacccaaca cgtcggcctg gatctgcata cacaggtcgt tgccggtgat cccgccatcc 4080241 accttcaaca cctgcaggcg aacaccggag tctgcttcca tggcgtccac cacatcgcgg 4080301 ctctggtagc agatcgcctc cagcgttgcg cgcgccaggt gcgcgttggt gttgaaccgc 4080361 gacaacccga cgatcgcgcc gcgcgcatcg gaccgccagt atggcgcgaa cagcccggaa 4080421 aacgccggca cgaaatacat gccgccgttg tcggggacct ggcgggccag cgcctcactc 4080481 tgtgcggcgc cgctgatgat gcccagctga tcgcgtagcc actgcaccgc cgagccggtc 4080541 accgcgatcg aaccttcaag cgcgtacacg ggtttagcgt tcccgaattg gtagcacacc 4080601 gtggttagca ggccgttatt cgatcgcacg atcgtttcac cggtgttcag cagcagaaaa 4080661 ttgccggtcc cataggtgtt tttcgcctcc cctggggcca gacagacttg accgaccatg 4080721 gccgcatgct gatcagcgag aactccggtg atcggcacct caccgccgac aggcccggtc 4080781 gccagcgtga caccgtaagg ctccgacggc gccgacgatg cgatctcggg cagcatggcc 4080841 cgaggtatcg aaaacaacga caacagctcg tcgtcccagt ccagcgtctc tagatccatc 4080901 aacatggtcc ggctggcgtt ggttacatcg gtgacatgca cacccccccc gcggcccgcc 4080961 ggtcagattc cacaacaccc aggtgtccgg tgtgccgaac aatgcgtcgc cgttctcggc 4081021 ggccgcgcgg actccatcga cattttccag gatccactgc agcttgccgc cagagaaata 4081081 agttgccggc ggcaggcccg ccttgcggcg gatcaggttt ccacgaccgt ctcgatccag 4081141 cgccgacgcg atgcggtcgg tgcgggtatc ctgccataca atcgcgttgt agtagggccg 4081201 tccggtgtgc cgattccata ccagcgtcgt ctcacgttgg ttggtaatcc ccaacgcggc 4081261 aatatctttc ggcgataggt tggtggcgtt gagcaccgag atcaacaccg acgcggtgcg 4081321 ctcccagatc tcgaccgggt tgtgctccac ccagccggcc cggggcagga tctgctcgtg 4081381 ctcgagctgg tggcgggcca cctcggcacc gtggtgatcg aagatcatgc agcgggtgct 4081441 ggtggtgccc tggtcgatgg cggctatgaa atccgaggac tcggccaatt gctctcctag 4081501 gatggcgtcg gacactgcat gtaatcgtcc atgatggtcc accgcagcgg cgggtccgac 4081561 gccgtcagcc ggagaagggg tcgcgaattc taatgccctc gaacttgcgg aagtcgcggt 4081621 cgtgactcca gatcgtggcg atgccgtgat ggcgcatgag cgcgacgagg tgggcgtcgg 4081681 gaaccagatt gcctcgcggc ttgaccgggt cggctactcg ccgatagacg ggccagaatc 4081741 cgttggcctc gccgacctgc cgcacgtgcg gtcgtgaggt gaattgctcg atgttttcga 4081801 cggcgacctc aggcgccagc ggcgcaccca acaacgtcgg atgggtgaca acccgtagat 4081861 aacccagcgc gacgggccac aatagatata ccagccctgg cctagccagg aatcgctcaa 4081921 cgagcgtctt cgccttatcg tgaaacgggc tggctcggtg cgtcgcatgg accagaacat 4081981 cgacgtcaaa ggtttcgctc acccacggtc caaaatcgcc caaacagcgt ccttgtcgtc 4082041 aagatccaca cggggccgca agtcggcagt cgaccagcgg atgtcaacgt ttggaggagg 4082101 ctcggccgcc agagcttgcg caagcaattc ggaggcgagc tgccctaacg ttttgcgctc 4082161 ctcgcgctgg cgtcgtttca acgcccgcag tatgtcgtca tcgaggtcga tcgtagtgcg 4082221 catacatcag atgctaactc gatatgcatc tgatgcgcac gatctcaccc ttcttgcgct 4082281 gccggcacga aacctgttgc atcagcaatg tgggcgaaga ggtaacgcgc accacatata 4082341 gccgcgaaca tcagcgcgag taccggcgca aggtgcggct gtgcttggac gtcttcgaga 4082401 ccatgcttgc gcagaccagg ttcgaggccg accggccact caccggcatc gagatcgaat 4082461 gcaacctcgt cgacgccgac taccagccgg ccatgtcgaa ccgctatgtg ctggatgcca 4082521 tcgccgaccc ggcgtaccag accgaattag gcgcttacaa catcgaattc aatgttccgc 4082581 ctcgcccgct accgggacgc acttgcctag agctggagga cgaagtccgc gccagcctca 4082641 acgatgccga gaccaaggcc agctgcagcg gagctcacat cgtgatgatc ggcatcttgc 4082701 ccacactgat gccagagcat ctgaccgacg gctggatgag cgcatcagcg cgttatgcgg 4082761 ctctcaacga gtcgattttc aaggcccgcg gcgaggatat ccccatcaac atcgccggcc 4082821 cggaaccgct gagctgccat gccggatcca tcgcacccga atccgcttgc accagtgtgc 4082881 aattacattt gcagctagca ccggcggatt ttccggctaa ctggaatgcg gctcaggtac 4082941 tggccggacg gcagttagca ctaggtgcca actcgcccta tttcttcggc caccagctgt 4083001 ggtcggaaac ccgcatcgag ctgttcacac agtccactga tgcccgtccc gaggagctga 4083061 aatcgcgagg ggtgcgcccc cgggtatggt ttggcgaacg ctggatcacc tccgtcctcg 4083121 acttgtttca ggaaaacatc cgctacttcc ccaccctgct acccgaggtg tccgacgagg 4083181 accccctcgc agagctttcg gctggacgca tcccacacct gtccgaattg cggctgcata 4083241 acggcacggt gtaccggtgg aaccggccgg tgtacgacgt ggtcgacggg cgcccgcatc 4083301 tgcggctgga gaaccgggtg ctacccgccg ggccgacggt cgttgacatg ctggcgaatc 4083361 atgccttcta ctacggcgca ctacgcggtc tgtccgaggc cgacccccca ttgtggacgc 4083421 agatgaattt cgctgcggca caagcgaatt tcctggcagc cgccaggtac ggcatggacg 4083481 cccagttgga ttggccgggc ttgggcgagg tgacgacgcg ggagttggtg ttgggcacgt 4083541 tgttgccaat ggcacacgag ggactgcggc ggtggggtgt cgacgcggag gtacgcgacc 4083601 ggttcctggg tgtcatcggc ggtcgcgccc agaccggccg caacggcgcg cgctggcagg 4083661 tcgccaccgt ggcggcccta caagacggcg ggctgacccg gcccgcggca ctggctgaga 4083721 tgctgcgccg gtactgcgag cacatgcaca gcaacgaacc cgtgcatacc tgggacacgt 4083781 agtccacgag taggttggga gccatgaccg acgaggtaat ggactgggac agcgcctacc 4083841 gtgagcaagg cgccttcgag gggccgccgc cgtggaacat cggtgaaccc cagcctgagc 4083901 tggcaacgct gatcgcggcc ggcaaggtcc gcagtgacgt gctagacgcc ggatgcggat 4083961 acgccgaact gtcattggcc cttgccgccg acggctacac cgtggtcggc atcgacctca 4084021 cgcccaccgc cgtcgcggct gccaccaagg ccgctgagga gcgcggtttg accacggcca 4084081 gcttcgtgca ggccgacatc acggagttcg cggcttatcc agccggctcc gccggccgct 4084141 tttccacggt gatcgacagc accctgtttc attcgctgcc ggtggacagc cgcgaccgct 4084201 atctgagctc ggtgcaccgc gcggcggccc cgggcgccag ctattacgtg ctggtcttcg 4084261 ccaagggcgc cttccccgcc gagctggaag tcaagccaaa cgaagtcgac gaggacgagt 4084321 tgcgtgccgc ggtgagcaaa tactggaaga tcgacgaaat ccggcccgcc ttcattcatg 4084381 tcaatccggt cacgattccg ccccagctgg ccggagcgcc agtcgaattc ccgccatacg 4084441 atcacgacga gaagggtcgg gtgaagttcc ccgcctatct actcaccgcc cacaaggccg 4084501 gctgaggcta acgttcgccg ctggtcgccg cggtcgccgc gaccaacgcc tcggcgaagg 4084561 cgtccaggtc atcggcggtg ttgtccacgt gcggcgagat ccgcagcacc ggcgccggca 4084621 gttccagcgg tgcccgctcc actccggcgt aggtggtcac gatccgccgc tgcgagagca 4084681 accaggcccg caccgctgcc gggtcggcgc cgtcgatcgg cgccagggtg gtgatcgcgc 4084741 taggctcgtc gaccgcttcg accacccgcc aaccggacac atcggcgagt acggtcctgg 4084801 cgatgtcgcc cagctcagcc aagcgtgccc gaatagcctg cggcccgcac gccagatgct 4084861 caccgagtgc gaccgaaaac cccactcgcg cagctacatt ggcttcgcca aatccgagtt 4084921 gttgggccac tgtcagcggc ggcatccagt ctggcgcggg cagcctcgca cgtaaccgct 4084981 ccatcagctc aggacgaacc gccagcaccc caactccgcg gggcccggcg atccacttgc 4085041 gcgacgaggc atacgtgacg tcggcaccca ccgcacaatc cacgtggccc aggccctgcg 4085101 cggcatccac gaccagcggc agtttcagct cggtgcacag ttgcgccacc atcgccagcg 4085161 gctgtgcgac gccacggtgg ctggccacca cggtcaggtg cactaggtcg ggcgggtcgt 4085221 cggccaacat gaaggccgcg tcgtcgagcg ctaccctgcc gtcctgcaga gttggtaacg 4085281 gacgcacgtc gaagccatgg gcggccatca cagccaggtt cggcccgtat tcgccgggca 4085341 agcaagccag cgtccggttc tccccaggcc agctgcccag cagcagatcc aacgcgtgca 4085401 gcgagccggt ggtgaacacc acctcggcgt cgggcaggcc gctcagtgcg gcgaccgccg 4085461 cacgtccggc gtcgagcacg gcggcggcgg cctcagccgc gacataaccg ccaacctcgg 4085521 cctcgtgccg cgcgtgctgg gctgcggcgt cgagtgcggc gaaactctgg cgcgaacagg 4085581 ccgcgctgtc caggtgtagc cccgcgacgg gcgggcgcgc tgcccgccat cggtcggcca 4085641 gcgaatcgcc ggcggggctg tttgcgccgc ttctcctcat cgcttcgtcc tgcatcgtcg 4085701 ccggcgcggc tcacttggcg gccagcgaca ggccaaagtc accggcttca tcggtccacc 4085761 atcggatgcg atgcagtccg gccgcggcca actcggcacc gaccgcttgc ggccggaact 4085821 tgcacgagac ctcggtcaac atctcctccc cggcgtcgaa gtcgacggtc aggtccagtg 4085881 caccgacccg tacccgctgg cgaccgtcgg cacgcaacca catctcaatc cgctcttctg 4085941 cgctgttcca acgggcgacg tgctggaagg catcgacgtc gaaatccgct tcgagttccc 4086001 ggttgatcac ggcaagcacg ttgcgattga actgagccgt caccccgcca ggatcgtcgt 4086061 aggcgcgcac cagccgggcc gcgtccttga ccaggtcggt gcccagcagc aggctatcgc 4086121 ccggccgcat taccccggcc agggccgtca ggaactgcgc gcgcggcccg ggcgtgaggt 4086181 tgccgatcgt ggaccccaag aacacaaaca ggcgccgtcc tcccctggga atctcggtta 4086241 aatgctcctc gaaatcccca caaacagcgt tgatttcgac accactgtat tcacgctgaa 4086301 ttgcggtcgc agttgccgac agcacgctgg cgtcgacgtc gaacgggacg aatctgcgca 4086361 gcgatccccg gtggcgcaac gcatccagca gcatccgggt cttctccgag gtgccgctac 4086421 ccaactcgac caaagtatcg gcccggcagg cggaagccac ttcggccgat ctggcccgta 4086481 ggatttcggc ctcggctcgg gtcgggtagt actccggcaa ccgggtgatc tgatcgaaca 4086541 gttcactacc caccgtgtcg taaaaccact tgggcggtaa cgatttcggt gtcttctgca 4086601 ggccagagta cacatcgcgg cgcaacgcca gatgccccgc atcctcgccc agatggttgg 4086661 caaccgacac tctcatcgag gtcctttcgc gcggtccaat gcggtcagcg tgacaccctt 4086721 ttgggttacc tccaccaggt ggcggtccgg cacgtcgccc caaccggagt cgtcgtcgta 4086781 tggttcgctg gccagcacca ccccgtcggc gcgccgcagg atggacagcg tgtctcccca 4086841 ggtggtcgcg atgagccggg aaccgttggc cgccaagatg tttagtcggg catttgggtc 4086901 ggccgcgccg accttgacaa tggtgtctcc cagagcgtcc agaccgtgag cgaagatggt 4086961 ggccgcgagt atcgcgctgt cacagaccga ttcggccgcc gggcccgccg gcaacacggc 4087021 acgatcaacc acaccgttgt gcgctagcaa ccagtgccca tcggtgaacg gcggggtcgc 4087081 gctgacttcg atcggcatac cgacagtcgc cgagcgcacc gcggcgagga tgcagtgact 4087141 acgcagcgcc ggcgccaccg agtgaaacga cgtgtccccc cacagcggag ccgggctgcg 4087201 ccaacgccgg ggaatggcac cgtcgaagaa gccgacaccc caaccgtcgg cgttcatcag 4087261 cccgtgcttt tgccgacgcg gcgcatatga ctgcacccgc agaccctgcg gcgggtccag 4087321 caccaacgaa gaaaccgcga cctgtgcccc gagccacccc aggtgacgac acatcagatg 4087381 tcccacgcca accggacacc ggcaaagatc tggcggcgat acgggtgatc ccagttgcgg 4087441 aagctgggcc gcaggatggc cggctccacc gcccacgagc cgccgcgtag cacgcggtag 4087501 tcgccgccga agaacggctg tgagtaccgc tcatagacca tcgggacgaa ccccggccag 4087561 ggccgcaacg gcgaggtggt ccactcccag acatcgccca gcatctgctc ggccccgcac 4087621 gccgatgccc cggccgggta ggcacccacc ggcgcggggc gcagcgtttg accgcccagg 4087681 ttggcatagg tgtctgtggg ctcctcggtt ccccacgggt agcggcggcg ggaaccagtc 4087741 gccggatccc acgcgcaagc cttctcccac tccacctcgg tgggcaaccg cgcgcctgcc 4087801 caggcggcgt acgcctcggc ctcaaagtag ctgacatgct gcaccggctc atcggcggga 4087861 atgtcctcga cgtgcccgaa ccgggtccgc gtccgcccgc ccgacctcca gaattgcgga 4087921 gcggtcagcc ccgcgcgctg gcggtgctgc cagccacgtt ccgaccacca ccgcgactgg 4087981 gtgtaaccgc cgtcgtcgat gaagtcttgc cattcaccgt tggtgaccgg aacccggccg 4088041 atccggaatg cgggcacgtc gacgacgtga gccggacgtt cgttgtccaa tgagcacggt 4088101 tcgtccgcgg cgtccacgcc cagcacgaac gggccgccgg ctaccagcac cgacgttccg 4088161 gccatcctcg gccgtccggc gggcagggcg gaagtcgcgg ccaacagtgg cgagccggtc 4088221 cgtaggttca aggcctgcag catggtttcg tcgtgctggt tttcgtggct gatcaccatc 4088281 gcgaacacga agctgtcgcc gtcttcaggt agagcggcaa gggcatccag cgcagcggag 4088341 cgcaccgttg cgcagtagga ccgcgcccgc gccggggaca gcaacggcag ttccacgcga 4088401 ctggcgcggg aatgctcgaa ggcgtcgtag agaccctcga ccgccggcgg caaaagcccg 4088461 ggctggcctg ggtcgccgcc gcgtagcagc cacaactcct cctgctgacc gatgtgtgcc 4088521 aggtcccaca ccagcgggct catcaacggg tcatactggc agcaaagctc ggcatcgtcg 4088581 aagtcgacca gccgcaacgt ccgcgcccgc gcccgcgcca gatgacaagc cagctgctcg 4088641 ggtgaagtca cgacgccccg tgcatcatcc cggtgacggc tgacgcgatg ccgcccgcga 4088701 tcacccggtc ggagaaatcg tctgccgggc aaacacccct gtcgacgtgg tccaccaacc 4088761 gctgcatcgc gccgatgagt tcagtcggta cccgccgcgc ggcgatggcc aggcatctgt 4088821 tggctgccag gtagagccgc cggtcggcca ggccgatccg ggccgcggtg tcccaggccg 4088881 tggccaccgg ttcgaccgcg tcgaccgcca aatctgccgc caccgggtcg tcgagcagcg 4088941 tcaccaaggt gaacaccacc gcgggccaca cctcgtcggg cacgctgtcg aggtagcgaa 4089001 tttccagcca ttgccgagga cgcaccggcg ggaacaacgt tgtcaggtgg taaaccaggt 4089061 cggcgacggt agcgcggcga ccgtccagca gcacccgacc gtcaacccag tcggtgaagg 4089121 gcacgtagtc cgtcaccgca cgggtgtctt gagtgtccgg gcttcgcacc atcatcaccg 4089181 gcgccttcaa ggcatactta gcccagtcga tgccggggtg gtcgccactg gcaccaagaa 4089241 tggggccgca gcgcgcggag tccatctggc cccacacccg ctgccgggtg gactgccagc 4089301 cggaaaaccg gccgcccagc atcggggagt tggcggcaat cgcgatcatc gtcggcccca 4089361 aggcgtgcgc caggcggact cgctcagccc atccttcctg cggtccggca tccagattga 4089421 cctggatcgc ggctgtcgag gtcatcatcg ccgcacccgg cactccgcta tggctggcgg 4089481 cgaaaaactg ctccatggcc cgatagcgtg cgcccggatt gacccgcacc ggcgaccgca 4089541 gcgggtctgc acccaggaag accaaaccca gcccggcatt ggcaagcgcc gaccgtagca 4089601 ccgcctgatc gcgcgtcatg gcaccgatgg ctgccagcac gccgtcggcg ggcggtccgg 4089661 acagttcgac ggcaccaccg ggctccacgc tgaccacgct gccgcccggc agcggactga 4089721 gccattcgag aacctcggtg atctcttccc agctgggccg gcgaaacgga tcggccgggt 4089781 cgaagcagtg cgcctccatc tccagaccga cgcgtcccaa cggaccatcg acgaggcagc 4089841 cgtccgcgat gtattccgcg gcggccgatg aatcggtgat ctcgacgtcg tccggggcag 4089901 cgttatccag ctgcgaggcc gcggcggtca tggcggcaag cgtcatatca cgatccctcc 4089961 gggcccggcg catgcctaaa acatgccctg cggaccgttg gttgcgcagc taccagaacg 4090021 atagccgcca ccggtttatc ctgccgaccg ccccgccgcg cgaatgaact ggccaactca 4090081 gcccagtgtg ttctgcattg ccccggccag cacgttgacc gcgggaccgg cgttgccgga 4090141 ctgacagacc ttggcctgca gcaatacgtt ttcgcgcagc ctggtttgaa caaagcatcg 4090201 tcgatcggtg ccggcctctt gtttggtcca ggcctcgtcg gtgccggtcg acggcccacc 4090261 ggcgaacgac cacacctgcg tcgtcccgtc gtccaggtgg atcgcggtgg tctgccccga 4090321 gcagcccacg gttcggtcga caacgcggtg aaacgcccgg tctgcggcat cgttgctggc 4090381 gaatacccca accgcctgct tgaccaggtg ggtttggtcg gtggcggacg tctgcgtggt 4090441 agcgccgttg aacgacgcca ggtcgggatc gtcgtacacc tcgggcagcc cgatgtccac 4090501 ccagttgttg cacgccggta gttcgaccca aaacgcctgg aacggcctgg tgaacaccgc 4090561 ctcccacccc attggggcgc cgacgatgtt gccgaccgac ccctttccga gcaccgcgta 4090621 ggacacaacc ccgggctccg acgggtgtgc gtcggcaaca ggtaccgcga accctgctat 4090681 gacggctaga ccgatgctca ccaccgcggc ggcgattcgc atggagctag accttggccg 4090741 agatgtcgac gtcgatgttg cccatgggtc acgatcatgc caacatggcc gaccaaacag 4090801 aaggcccttt tcttgaacga gtcaagaaaa ggggctggtg cgcccggccg ctaggggcgc 4090861 gccggcgcgg tggtgggacc cggaccagtc ggaccaacag ccgggccacc cgggcctcca 4090921 gggaaaccgg ggcccatgcc tgggggaggg ggtccgccgg ggaacccgaa cacccatccc 4090981 atccctggac cgggggcaac gggaccgacg gggcggaaca tgccgtggtg gtaatagcgg 4091041 tggtaagggc atttcccctg cccgagaacc aacgcgccag agaagaagat gactgcgacg 4091101 gtgaaaacaa ttccggccac gatcaccacc cacgctgcgg cccggtagag cctgggcggc 4091161 ttttcctgct gcggcgatgg cggcggtgaa gttgtggcgg cggacggcgg tggcgctgcg 4091221 ggttggggtg tctcggtcat cttttgaata tcgctcggcc ggcaaccacc gtaaagggtt 4091281 gcggttacaa acgtgccatg aactggtgca gcccggtgcc ttgggtggac accgggctgc 4091341 tggttgtcgg ttacggagcg ggtgtcgcgg gaggactgac ggacgacggc acctggccgg 4091401 gcccgcccgg gcctgggccg ggacgcactg ccgcgggccc accgtgcgga ctgcccggtc 4091461 gcagcatcat cgccgggtgc tggtggtgtt gccggtggtg gaagccgccg tgaccggcat 4091521 gcttgccgag gatatagccg gtgaagaaga tgaccgccac gatgaacacg gttccggcgg 4091581 caatggctac ccacgccgcg gctttgaaca ccttgggggt ctggtgaggc ggtggtgttg 4091641 gggtttcaga tgtttcactc atgtgtcgca tgatgccttg gcaaacagta acgcgactat 4091701 gcgtccctta tgtagcagct gtgagcgcgc gggctgggta tcggcccggg acaccaccat 4091761 ggctgcgtct cggtgtcaga gcaccagagc tacgggtctg accagggctt gaacgggttg 4091821 accgcgaact gaatcacccg gtacggccca ttctgccggg cccgagtgtc ccactggctg 4091881 acgaagatcc gcagttcgtc gatggtggac ccgggcgaga tgtagccgcc atacggttgt 4091941 gcgagtcgat tgtcgtaggg cggtggcagg ctttccgccg gctccggcca ctcgtcgtgg 4092001 cgcaccaccg tggtcaccgg ggcggcgccc agcgacgtcg ggtggtgtgc cacccgaacc 4092061 tccatgttgc cggtgctggc gttgaaatac gacagcaccg tctggccgtc gatctgacgg 4092121 atgctcatct cgccgagctg gtcgggccag agcggagtcg gcggcttgtt ccaaccgccg 4092181 tcggggccgc ccgcccagcc ctgccagcgg gaccggtcgg tgaacgattc cggggtggcc 4092241 cgatacagca ccgccggctc cccacgggtg aagctgtcgg ccacgatgta gacccaccca 4092301 gttggcgaat cgggcgtggg aaccgggtcg tagtatccgc tgatctgtgt ctgccggccg 4092361 tcctggtagg cggcgttgcg cctggacccc gacacggtct gccagccgcc gcgcgccgcc 4092421 tcggcccgca ccaggcggga attctgcggc tgcaggtcct tggtggtggt caccatcagg 4092481 tagttgcggc ggttgatctg caccacaccg gcgggcagct gtgagtctcc aggcggcgtg 4092541 ggatcggcca gcagcggcgt gccgacgccg gtgacaccgg tgtagcgcac cccggccgga 4092601 tcgtcgatcg actcggtgtc gacgtgcagc gcgaccggcg cataccagcc accgaacccg 4092661 acaccctgac cggcgaagct gtccccgcac acctgcagca gttgactggg gaattccacg 4092721 aactcgcaca ggtcggtggc accgatgccg tagtccccgg tgggggttcc ggtaccggcc 4092781 gtcggaccga ttcgcagcac ttgaccgggc gccagcggcg gcaggatggg ccggggcgcc 4092841 ggcgccggcg gatcggcgcg tgcataccaa acacattgcg ggacaaggaa agacactacc 4092901 agcgagcacc gcacgaccca ggcggagcac acccgcatat cacaagtcgg cggtcagcag 4092961 ctcggcgatc tggatggtgt tcagcgccgc ccccttgcgc aggttatccc ccgacacgaa 4093021 cagcgccaga ccacgcccgt cgggcacccc cgggtcgcgc cggatccggc cgaccagaga 4093081 ttcgtcgaca ccggcggcgg ccagcggcgt cggcacgtcg accagctgca cgcccgtagc 4093141 accgtcgagc agctcgcgcg cccgctccgg cgagagcggc tgcgcgaact cggcgttgat 4093201 cgacaaagag tgtccggtga acaccggaac ccgcacacag gtgccgctga ccaacaggtc 4093261 ggggatgcca aggatcttgc ggctctcgaa gcgcaacttt tgatcctcgt ctgtctcgcc 4093321 ggagccgtcg tccaccaggg atccggccag cggcaccacg ttgaacgcga tcggggcgac 4093381 gtaggtgttc ggcggcggga actcgagcgc gccgccgtca tacaccagct gctcggcccc 4093441 accgatgacc gcacgcgcct gctcggccag ctcggccacc ccggccaggc cgctaccgga 4093501 caccgcctga tacgacgaga ccaccaaccg caccagtcgg gcttcgtcgt gcagcacctt 4093561 gagcaccggc atcgcggcca tggtggtgca gttcgggttg gcgatgatgc ccttgggccg 4093621 gcggtgcgcg tcgcgttcaa agttcacctc ggacaccacc aacggcacgt cggggtcctt 4093681 acgccacgcc gacgagttgt cgatcaccgt gactccggcc gccgcaaagc ggggcgcctg 4093741 caccttcgac atggccgagc cggcggagaa caacgcgata tccagcccgc tcgggtcggc 4093801 cgtctcggcg tcttccactt cgatctcctg gccgcggaag gccagcttgc ggccctgcga 4093861 tcgggccgac gcgaagaacc gcaccgcgct cgccgggaaa tcccgctcgt cgagcaacgt 4093921 gcgcatgacc tgacccacct gaccggtggc ccccacgatc cctattgaca ggcccatcta 4093981 ccgtcccgtc cccgcgtaca ccgtggcctc ctcgtcgccg ccgagcccga acgcttcatg 4094041 cagcgcgacc acggccttgt ccagttcggt gtcgcggcac aacaccgaga tcctgatctc 4094101 cgaggtggag atcagctcga tgttgacccc caccgccgcc agcgcctcac agaacgtcgc 4094161 ggtgaccccg gggtggctgc gcatgccggc accgatcagc gataccttgc cgatgtggtc 4094221 gtcgtacagc agctgtgaga agccgatctc gtttctgagc gagtccagtt tttccacggc 4094281 ggcgggcccg acgtcgcggg agcaggtgaa ggtgatgtcg gtcttgccgt cctcgacctt 4094341 ggagacgttc tgcagcacca tgtcgatgtt gacgtcggcg tcggccaccg ccctaaacac 4094401 cttggccgca tacccgggga tgtcgggcag cccgacgatg gtcaccttgg cctcgctgcg 4094461 gtcgtgcgcg actccggtca ggatggggtc ttccatgggt acgtccttga tcgatccgac 4094521 aacgacggtg cccggtctgt ccgagtacga cgaccggacg tgcaccggaa tattatggcg 4094581 gcgagcgtat tccacgcagc gcagcatcag caccttggcg ccgcaggccg ccatctcgag 4094641 catttcctcg aaggtcacgg tgtcgagctt tcgggcgttg cgcacgatgc gcgggtcggc 4094701 gctgaagatg ccgtccacgt cggtgtagat ctcacagaca tcggcaccca gcgcggcggc 4094761 catggcgacg gcggtggtgt ccgagccgcc gcggcccaac gtcgtgacat ccttggtgtc 4094821 ctggctgacc ccttggaatc cggccaccaa aacgacccgc ccctcctcaa gggcggtttg 4094881 cagccgcccc ggcgtgacgt cgatgatctt ggcgttgccg tgggtgccgg tggtgatcac 4094941 cccggcctgc gaaccggtga acgaccgggc atgcgcgccg agcgactcga tggccatggc 4095001 caccaacgca ttcgagatgc gttcaccggc ggtaagcagc atgtccagct cccgaggcgg 4095061 cggcgccggg cacacctgct gagccagatc cagcaggtcg tcggtggtat cccccatggc 4095121 agagacgacg acgacgacgt cattgccttg cttcttggtg gcgacgatgc gttcggcgac 4095181 gcggcgaatc cgttcggcgt cggccaccga ggatccgccg tacttctgca cgacgagcgc 4095241 cactgtttcc ctttccgggg aagattggag acaggtccag aatagggggc gcgccggcct 4095301 gcgctgactc tgcgtccacc acgggaatgt gcgagtagcc cacacggtgg acgcagagtc 4095361 aacgtgtaaa gtgcttcatg tgcagcgggt gctcctcctc ggacgccgcg acggggtctg 4095421 atccagaccg gcttcccgtc gcgggacgtt cgcgatgcgc cggtctgagg ttccttctca 4095481 ccatcccgga gcaactaccg tgacaacttc tgaatcgccc gacgcctata ccgagtcgtt 4095541 tggggcccac accatcgtga aacccgccgg cccacctcgc gtcggtcagc cctcgtggaa 4095601 tccgcagcga gcctcgtcga tgccggtcaa ccgctaccgg ccgttcgccg aggaggtcga 4095661 gcccatccgg ctgagaaacc gcacgtggcc tgatcgcgtc atcgatcgtg cgccgctgtg 4095721 gtgcgcggtc gacttacgcg atggcaacca ggcgctgatc gacccgatga gcccggcccg 4095781 caagcgccgc atgttcgacc tgctggtccg gatgggctac aaggagattg aggtggggtt 4095841 cccctcggcc agccagaccg acttcgactt cgtcagagag atcatcgagc agggcgccat 4095901 tcccgacgac gtcaccatcc aggtgctcac ccaatgccgt cccgagctga tcgagcgcac 4095961 cttccaggcg tgttcgggcg cacaccgggc catcgtgcac ttctacaact cgacgtcaat 4096021 cctgcagcgc cgcgtggtct ttcgcgccaa ccgggctgag gtgcaggcca tcgcgacaga 4096081 tggggcgcgc aagtgcgtcg agcaggccgc caaatacccg ggcacgcagt ggcgattcga 4096141 gtactccccg gagtcctaca ccggcaccga actggaatac gccaaacagg tgtgcgacgc 4096201 cgtcggcgag gtcattgcgc cgacgccgga gcgcccgatc atcttcaacc tgcccgccac 4096261 ggtggagatg acgacgccca atgtctacgc cgactcgatc gagtggatga gccgcaacct 4096321 agccaaccgg gagtcggtca tcctgagcct gcacccgcac aatgaccgcg gaaccgccgt 4096381 cgccgcagcg gaattgggtt tcgcggccgg ggctgatcgg atcgagggct gcctgttcgg 4096441 caacggcgag cgcaccggca acgtgtgcct ggtcacgctg ggactcaacc tgttctcccg 4096501 aggtgtggac ccgcagatcg acttctccaa tattgacgag atccggcgca cggtggagta 4096561 ctgcaaccag ctgccggtgc acgaacgtca cccctatggc ggcgacctgg tttacaccgc 4096621 gttctccggt agccaccagg acgccatcaa caagggccta gacgcgatga agctggatgc 4096681 ggatgccgcc gactgtgacg tcgacgacat gctgtggcag gtgccgtatc tgcccatcga 4096741 cccgcgcgat gtcgggcgca cctacgaggc ggtgatccgg gtcaactcgc agtccggcaa 4096801 gggcggcgtg gcctacatca tgaagaccga ccacggcctt tccctgccgc ggcggctgca 4096861 gatcgagttt tcccaggtaa tccagaagat cgcagagggt acagcaggcg agggtggcga 4096921 ggtctcgccc aaggagatgt gggatgcgtt cgccgaggag tatctggccc cggtgcggcc 4096981 tttggagcgg ataaggcaac atgtggacgc tgccgacgac gacggcggca cgaccagcat 4097041 cacggcgacc gtcaagatca acggcgtgga gaccgagatc agcgggtccg gtaacggtcc 4097101 gttggccgcg ttcgtccatg cgctggccga tgtcgggttt gacgtggccg tgctggacta 4097161 ctacgagcac gcgatgagcg ccggcgacga cgctcaggcc gccgcgtatg tggaggcctc 4097221 cgtgacgatc gcgagcccgg cgcagccggg cgaagcgggt cggcacgcat cggaccccgt 4097281 gacgatcgcg agcccggcgc agccgggcga agcgggtcgg cacgcatcgg accccgtgac 4097341 gatcgcgagc ccggcgcagc cgggcgaagc gggtcggcac gcatcggacc ccgtgacgat 4097401 cgcgagcccg gcgcagccgg gcgaagcggg tcggcacgca tcggaccccg tgacgatcgc 4097461 gagcccggcg cagccgggcg aagcgggtcg gcacgcatcg gaccccgtga cgagtaagac 4097521 ggtgtggggt gtcggtatcg caccgtcaat caccaccgcg tcgctgcgcg ccgtggtgtc 4097581 ggcggtcaac cgggcggcac gctaggacgg cgctgaacta gggtcggggt ccgcggcatg 4097641 atttttcgca gtgacgttcc gctcgccgtt tcagaacaac gctaactgct tttcgacggg 4097701 agcgacgtcg gtgaagtcct ccacgctggc gcccccgacg acggcaccga tgcactccat 4097761 gaatcgcgct tcaggcatca ccggaacccc cagctgcagg gcgtgatagc ccttgccgtg 4097821 ttcgggggcg gtcgcgttgc agaccaccag tgaggtatcc cggtctacga cgtcgctgta 4097881 ggccagcccg gcgtgcagaa tccgttcgac gagttcctcg tgggtccgtt ttacctcggc 4097941 cgccagcccc acccgcatgc cctggaccag cgggcggccc tggacatacc ggcccgggtt 4098001 gaggtagggg caggccatcc gggctgccaa ggccttcagc ggtcgcagct cgtcgtgagt 4098061 cacccggccg ttgggccacc ggcgccgtgt caccgggtgc accggcagcc agacgtcgag 4098121 ttcgcgcgca ctctctaggg cagctgccag tatcccggtc aatacccggg cgtcgtcgaa 4098181 tgcatcgtgc ggccgttgct ggggcacacc ccaatgcgcg gcaagtgtct ccagccgcag 4098241 attgtcgacg ccaagctgca gccggcgggc cagctcgacc gtgcacatga cgaagtcaac 4098301 cgggagttcg gcctcggcga tctcggcctc cgcagcgaga aacgcatagt cgaacgcgac 4098361 attgtgcgcg accagagtgc gcccgcgcag cacgtcgaca acctcaccgg cgatatcggc 4098421 gaactgtggc tggtcatcga gcatggcggc ggtcaggccg tgcacgtggg tggggcccgg 4098481 gtccaccttg ggatttagca ggctgaccac ggattgctct agtcggccgg cggcgtccag 4098541 gccgagcacc gcaaggctga tgatccgggc ctggcccggc cgaaagcccg aggtctcgac 4098601 gtcgatgacg gcccaacccc gatcctggtg gctggctggc cgtccccagg tgtggctcac 4098661 aagacgagga tgacacgtcc gagcgacatc acctggtcgc tacgcatcgt gtcggcccgt 4098721 aaaacccgga cgcgggcgac ccgccgcacc cggcgacaag cgccgagctt gcgatcgccc 4098781 tgaatccaac gcgggcgacc cgccgcaccc ggcgacaagc gccgagcttg cgatcgcccg 4098841 taaactgccc gggtggtaac cacccgggca cgcctggccc tagccgccgg cgcgggcgca 4098901 cgctgggcgt cgcgggtcac cggtcgcggc gccggagcga tgatcggcgg tctggtcgcc 4098961 atgaccctgg accgctcgat cctgcgccaa ctcgggatgg gccggcgcac cgtcgtcgtc 4099021 accggcacca acggcaagtc gaccaccaca cggatgaccg cggccgcgct gggcacgttg 4099081 ggagccgtgg ccaccaacgc cgagggcgcc aacatggacg ccggcctggt ggccgcgctc 4099141 gccgctcacc gcgacgccga gctggcggtg ctggaagtcg acgagatgca cgtaccgcac 4099201 atctccgatg ccgtcgatcc cgccgtcgtc gtcttgctca acctctcccg agaccagctg 4099261 gaccgggtcg gcgagatcaa cgtcatcgaa cgcacactgc gggccgggct ggcccggcac 4099321 cccgacgctg tcgtggtcgc caactgcgac gacgtgctga tgacctcggc cgcctacgac 4099381 agccccaacg tcgtttgggt ggctgccggc ggcgcgtggt caaacgattc ggtcagctgc 4099441 ccgcgcagca gcgaggtcat cgttcgcaag gccccctctc aggaagacca ctggtactcc 4099501 accggcgccg acttcaagcg gcccgccccg cactggtggt tcgacgacgc cacgctgtat 4099561 gggcccgacg ggctggcgct gccgatgcgg ctggcactgc caggctcggt gaatcgcggc 4099621 aacgccgccc aagccgtggc cgccgcagtc gccctcggcg ccgatccggc tgtggccgtc 4099681 gccgccgtct gccaggtcga cgaggtcgcc ggacgctacc ggaccgttcg tatcggcgcg 4099741 caccaagccc ggatcctgct ggccaaaaac ccggccggct ggcaggaagc gctggcgatg 4099801 gtcgacaagc atgcagacgg ggtggtcatc gcggtcaacg ggcgggttcc tgacggcgag 4099861 gacctgtcct ggttgtggga cgtgcgcttc gagcacttcg agacgacccg agtggtagcc 4099921 gctggggagc gcggcaccga tttggcggtt cgcctcggat atgcaggcgt cgagcacacc 4099981 ctggtgcacg acaccgtggc cgccatcgcc tcatgcccac ccgggcgggt ggaggtcgtc 4100041 gccaactaca ccgcgttcct gcagctgcaa cgagcattgg cgcgtcgtgg ctgattctgt 4100101 ggtgcggatc gggctcgtgc tgcccgacgt gatgggcacc tacggcgacg gcggcaacgc 4100161 cgtggtgcta cgacagcggc tgctgctgcg cggcatcgcc gccgagatcg tcgagatcac 4100221 gctggccgat ccggtgccgg attcgctgga cctctacacg ctgggcggag cggaggacta 4100281 cgcgcagcgg ctggccaccc ggcacctacg tcgatatccg ggcctgcaac gcgcggcggg 4100341 ccggggtgct ccagtattgg cgatctgcgc ggccatccag gtgcttgggc actggtacga 4100401 gacgtcgtcg ggagaccggg tcgacggcgt ggggttgctg gatgtgacca cgtcaccgca 4100461 ggatgcgcgc accatcggcg agttggtcag caagccgttg ctggccggtt tgacccaacc 4100521 cttgaccggt tttgagaacc accgcggcgg caccgtcctc gggcccggaa cgtcgccctt 4100581 gggcgcggtg gtcaagggag ccggcaaccg ggccggcgac ggttttgatg gcgcggttgc 4100641 gggcagcgtg gtcgcgacct acatgcacgg gccgtgcctg gcccgcaacc cggagcttgc 4100701 cgacctgctg ctgagcaagg tggttggtga gctggcgccg ctggatttgc ccgaggtgga 4100761 cctgctgcgc cgcgaacggc tatccgcgcg ttaggtgggg cgttagggcc gccatcccct 4100821 ggccagcaga gcggcacgca cgcggttcac cacgtcgtcg gggttgtcct cggcgatcac 4100881 gcgaatgacg atccagccca actcggccag cttgcggagc cgccgctggt ctttcacgta 4100941 gcgaccgcgg tcgctgcgat gctgatcacc gtcgtactcg gcggccacca tgtatttctc 4101001 ccagcccatg tcgagcacgc caacgttgcg ccagcggtgg accaccggaa tttgcgtcgt 4101061 ggggaccggc aggccggcgt cgatcaacaa cagccgcagc caggtctcct tgggcgacgc 4101121 ggcgccgcca tcaacaaggg gcagcacgtc acgcaaccgg cggacacctc gggcgcccgc 4101181 gtgacgcttg gccaatagaa gcacgtcgtc gcgggaaaac ggggtggcac gcatgagggc 4101241 atcgagacga gccacggctt cgccgcggga cagatggcgg ccgaggtcgt atgccgtccg 4101301 cgccagtgtg gtgaccggca ggcccaccac cctggtgatc tcgtcgtcgc acaaggtctc 4101361 acgacgtatg acaagaccgt gctgcgggcg ggtagtggga gaaatcagct cgatggccac 4101421 gtcgacgtcc acccactgag caccatgcag cgcagaggcc gcattaccag ctatgacgcc 4101481 atggcgcctc gtggctagcc aggcgccaac cgtgcgatcc caaagtgtgg gcactgagcg 4101541 cctcgagacg tacacaccgc ggaacatcgg ctgataccaa cgttgcagct cgtgcctggt 4101601 caggcgacca gcggtgatgg cctcgctgcc gatgaagacg tcacccatga cggacatgct 4101661 ggcactccgc accgacatcc gtgagatcaa cattttgcag gcaaggtgcg agtagcggcc 4101721 tgcagaacgt tgatctcggc gaaagtcgga tgtcggcgaa tcaggcgagc acgcggcggc 4101781 cggcgagcgc tcggcccagg gtgagctcgt cggcgaattc caggtcaccg cccatcggca 4101841 gcccggacgc gatccgtgtg acggtcaggc cggggatgtc gcgcagcatt cgcaccaggt 4101901 aggtggccgt cgcctcgccc tcggtgttgg ggtcggtggc gatgatgacc tcggtgacgt 4101961 cgacgtcgtc gacccgttcc ccgatgcggc tcagcagttc gcggatccgc agctgatccg 4102021 gcccaattcc ggacagcggg tcaagcgccc cgcccaggac gtgatagcga ccccggaact 4102081 cgcgggtgcg ctcgacggcc tggatgtctt tgggttcctc gacaatgcac accacggacg 4102141 catcgcgacg gatatcagag cagattctgc aacgctcgtt gtcagagaca ttcccacaca 4102201 ccgcgcagaa tcgcacgccg tcccgaacct tcgccagcac accggtcagc cggtcgatgt 4102261 ccgacggttc taccgacaac aggtggaagg cgattcgctg cgcactcttg ggtccgatcc 4102321 ccggcaactt gccgagttcg tcaatcaggt cctggacggg tccctcaaac atgtcggtgc 4102381 aggtcagatc cctggtacag gtggtgcgcc cggcgcaccc ggcatacccg gcatacccgg 4102441 catacctggc gctcccggcg gcgcagccgg tggtgccggc gggcgcatcg cgccggccaa 4102501 tgcacccagc cgttcctgcg ccatcttcgt cacctgctgg gacgcgtcgc gcatcgcacc 4102561 gacgatcagg tcctgcaagg tctcgatgtc gtcgggatcg acgaccttgg ggtcgatcgt 4102621 cacgccgatc acctccccgc tgcctttgac gacgaccttg accaggcccc caccggcttg 4102681 accgtgcacc tcagagttcg ccagctgttg ctgggcctcc aggagctttt gctgcatctg 4102741 ctgcgcctga gcgagcagcg ccgacatgtc gcctccgggt tgcatgacag tcccctagca 4102801 tcttggtctc gagttggttt cgcctgtggt tgtcgggcga ttcggaacat tcagcctaga 4102861 ccgcgccgcg ttacctttgc gccgtggacc tacgagttgg cccgcgtgtc gggttcgcca 4102921 tgatagtcgg ggtactcgtc gcagcagcga cgccgatcat ctcgtccgcg agcgcaaccc 4102981 ccgccaacat cgccggcatg gtcgtcttca tcgaccccgg acacaacgga gccaacgacg 4103041 catcgatcgg ccgccaggta cccaccggtc gcggcggcac caagaactgc caggccagcg 4103101 gaacgtcaac caacagcggc tacccggagc acaccttcac ctgggaaacc gggctgcggc 4103161 tgcgggccgc gttgaacgca ttgggggttc ggaccgccct gtcacgtggc aacgacaacg 4103221 cgctcggacc gtgtgtcgat gagcgcgcca atatggccaa cgcgttgcgc cccaacgcga 4103281 tcgtgagcct gcacgccgac ggcggaccgg cgtctggccg cggattccac gtcaactact 4103341 cggccccgcc gctcaacgcg atacaggccg gtccctcggt tcagttcgct cgaatcatgc 4103401 gcgaccagct gcaggcctcg ggcattccga aggcgaacta catcggccag gacggcctgt 4103461 acggacgttc ggacttggcc ggcctgaacc tagcccaata tccgtcgatc ctggtcgagt 4103521 tgggcaacat gaagaacccc gcggactcgg cgctgatgga gtccgccgag ggcaggcaaa 4103581 aatacgccaa cgccctggtt cgcggcgtcg ccggcttcct ggccacccag ggccaggcgc 4103641 gttagccccg cacacaggcg gcacccccac cgcgcccgca tcgtcgtcag gcgtcaccct 4103701 cgagttcggt cttgaggttg gacagcacct cggcctggat cttcttcagc cctagcggcg 4103761 caaaggtctt ctcgaagaaa cccttgaccc cgcccgcgcc ggtccaggtg gtcttcaccg 4103821 tgacgctgga accgggtccg gcgggagcga ccgtccagtt ggtgaccatg gacgaattca 4103881 tgtccttctc gatgacggtg tgcccggcaa cgtccacgtt cacctgcaca tcgcgaacac 4103941 gcgactgcgt cgcctgcagc cgccacttgg cgactgtgcc ccgccccttg ccgccctcga 4104001 gcacctggta ctcgctgtag tgcggggaca ggattttagg acggacggtc tcatagtcgg 4104061 ccagcgcgtc gagtgtggcc gtgggctcag cattgatcaa gatcgtgctg gctgcgctca 4104121 cctgtcccat cagggccgga ctccttcgtt tgtgattgct gcaccgcccg cacccggatg 4104181 caggggcagt tgtcgaggac tagggtatat gcggtgcctg tccctggatc tgcacagtcg 4104241 gcttacgcct gcggcgtcga gcggttgctg gcgagctatc gatccatccc cgcgactgca 4104301 tccatccggc ttgccaagcc cacctcaaat ctgttccgcg cccgcgtcaa acacgatgca 4104361 cgcggcctgg acgcatcggg actgaccggt gtcatcggta tcgatcccga ggcccgcacc 4104421 gccgacgtgg ccggcatgtg cacatacgag gacctaatcg ccgcgacact gcactacggt 4104481 ctgtcaccat tggtggttcc gcagctgagg acgatcacat tgggcggagc ggtcaccggc 4104541 ttgggtatcg agtcggcgtc gttccgcaac ggcctgcccc acgagtcggt gctggagatg 4104601 gatatcctca ccggcgcagg agaacttctc accgtctcgc ccggacagca ctccgacttg 4104661 taccgtgcat tccctaactc gtatgggaca ctgggctatt caacccggct tcgaatccag 4104721 ctggagccgg tccggccgtt tgtcgcgctg cggcacatcc gatttagctc gttgacggcg 4104781 atggtggccg caatggagcg catcatcgac accggcggac tggacggcga atcggtggac 4104841 tatctcgacg gggtggtttt cagcgctgac gaaagctacc tgtgcatcgg catgcagacg 4104901 agcgtaccgg gcccggtcag cgactacacc ggacaagaca tctactaccg gtcgatccaa 4104961 cacgaggcgg ggatcaagga agaccggttg accatccacg attacttctg gcgctgggac 4105021 accgattggt tctggtgctc acgatcgttt ggtgcccaaa acccgcggct gcgccgctgg 4105081 tggccgcggc gctaccggcg tagcagtgtc tactggaggt tgatggcgct cgatcagcgc 4105141 ttcgggatcg ccgaccggtt cgagaacagc aggggtcgtc ccgcgcgtga acgggtggtg 4105201 caggatatcg aagtgccgat cgaacggacc tgcgagtttc tggagtggtt cggggaaaac 4105261 gtgcccattt cgccaatctg gttgtgcccg ttgcggctac gcgatcacgc cggctggccg 4105321 ctgtacccga tccggcctga ccgtagctat gtcaacatcg ggttctggtc gtcggtgccg 4105381 gttggcgcca ccgagggcgc caccaaccgc aagatcgaga acaaggtgag tgcgctcgac 4105441 gggcacaagt cgctctactc cgactccttc tatacccgcg aggagttcga cgagctctac 4105501 ggcggcgaga cttacaacac tgtgaagaaa gcctacgatc ccgattcgcg tctcctcgat 4105561 ctttacgcaa aggcggtgca acgacgatga caacgggcag actcagcatg gccgagatcc 4105621 tggagatctt caccgcgacc gggcaacacc cgctgaagtt caccgcgtat gacggcagca 4105681 ccgcgggaca agacgacgcc acactgggcc tggatcttcg gacgccccgc ggcgccacct 4105741 acttagctac cgctcccggc gaactcggcc tggcccgcgc ttatgtgtcg ggtgacctac 4105801 aggcacacgg agtacatccc ggcgatccgt acgaactgct caaaacgctg accgaaaggg 4105861 tcgacttcaa acggccgtcg gcgcgggtgc tggctaatgt ggtgcgctcg atcggcgttg 4105921 agcacatact gcccatcgcg ccgccacccc aagaggcgcg accccggtgg cgtcgaatgg 4105981 ctaatggctt gctgcacagc aagacccgtg acgccgaggc tatccatcac cactacgacg 4106041 tctccaacaa cttctacgag tgggtgctcg ggccatcgat gacctacacg tgcgcggtgt 4106101 ttccgaacgc tgaggcttcg ctggagcagg cccaagagaa caaataccga ctcattttcg 4106161 aaaagctacg gctagagccg ggtgaccggc tactcgacgt cggctgcggc tggggcggca 4106221 tggtgcgcta cgccgcccga cgcggtgtcc gggtgatcgg cgccacgctc tcggccgagc 4106281 aggccaagtg gggccagaaa gcagtcgagg acgagggatt gagcgacctc gcgcaggtgc 4106341 ggcattccga ctaccgcgac gtagccgaga ccggtttcga cgccgtttct tcgatcgggc 4106401 taaccgagca catcggcgtc aagaattacc cgttctactt cgggtttctc aagtcgaagt 4106461 tgcgcaccgg cggcttgctg ctcaatcact gcatcacccg ccacgacaac aggtcgacgt 4106521 cctttgccgg cgggttcacc gaccgttacg ttttccccga cggggagctg acgggctcgg 4106581 gacgtattac caccgagatc cagcaggtcg gcttggaagt gctgcacgag gagaacttcc 4106641 gccatcacta cgcgatgacg ctgcgcgact ggtgcggcaa cctcgtcgaa cactgggacg 4106701 acgcggtcgc cgaggtcggt ctgccgaccg ccaaggtgtg gggcctgtac atggcggctt 4106761 cgcgggtggc cttcgaacga aacaacctgc agctacatca cgtattggcg accaaggtgg 4106821 acccccgggg cgacgacagc ttgccactgc ggccctggtg gcagccctag gcgttgtcta 4106881 tccggcgcgc gcccagctcg ttctgcagca gctcgagtgc aacctcttcc gggtcgcgac 4106941 gcggcgacgg gtcgccacgg ccggcttcgg cgagcatgtg ctcctcttcg tcgcgctgag 4107001 tggaattcgc tgtgggggca gggtttacgg ccttggcggt cgccacgttc gctcccccgc 4107061 cgacgggtga tgccgccgca gccggttcac cggtctcaca ccgcacccgc cagttgactc 4107121 ccagcgcgtc tttaagcgcc tcggcgagga catcggcgtt gcgctgttcg gacagccgcc 4107181 gcgccagcgg cgccgattcg tgggtcagca ccagcgtgtt gtcctctagc gcacggacgg 4107241 tggcacccgc cagcatcacc tcggtggtac ggctgcgcag gcgcaccttg tcgcgcaccg 4107301 tcggccacat ggaccgaacc gcggccacgg tgggttcgct cgaggccggt gtgggggcca 4107361 gcaccggtct cggttcacgc gcgggctggt gtttcggctc ggcagccgca gccgacgggc 4107421 gtggtacggc ttgcggcgcc gggatcgaca tgtccaaccg ggtctcgatc cgttcgaccc 4107481 gctgcaacag tgccgattcg gcgtcgctcg ccgagggcag cagcagtcgc gcgcaaacca 4107541 cttccagcag cagacgcggc gcggtcgcac cgcgcatctc gcctagcccg gcctgcacca 4107601 cctcggcata tcgggtcagg gtcgcccgcc cgatccgggc ggcttgctcg cgcatccgat 4107661 ccagcgcgtc ttcgggcgca tccaccaccc cgcgagatgc cgcgtcggga accgattgca 4107721 gcacaatcag gtcgcggaat cgctccagca gatcggtagc gaaacgccga gggtcatgtc 4107781 cgccatcgat caccgattcg atcgccccga acaatgcggc cgcatcgcaa gcggccagtg 4107841 cgtcgaccgc gtcgtcgatc agggcgacgt cggtgacacc cagcagcccc agcgcccggg 4107901 tgtaggtcac gtgggtgtcc gcggccccag ccagcaattg gtccagcacc gagagcgtat 4107961 cccgtgggga acctccgccg gcccggatca ccaacgggta caccgcatcg tcgacgacga 4108021 cgccctcctg ctcgcagatc cgcgcgagca acgcccgcat agtgcgcggc ggcagcagcc 4108081 ggaacgggta gtgatgagtg cgcgaccgaa tcgtcggcag taccttctcc ggttcggtgg 4108141 tggcgaatat gaagatcagg tgttcgggcg gttcctccac gatcttgagc agcgcgttga 4108201 atcccgcggt ggtcaccatg tgcgcctcgt cgacgataaa tacccggtac cgtgactgga 4108261 ccggcgcata gaacgcgcgg tcccgcagct cgcgggtgtc gtccacgccg ccgtggctgg 4108321 cggcatccag ctctaccacg tcgatgctgc cgggggcgtt gggcgccaac gaaacgcagg 4108381 attcgcagac cccgcacggg ttggcggtag ggccctgcgc acagttcaac gaccgcgcca 4108441 ggatacgcgc tgacgacgtc tttccgcagc cacgcggccc agagaacagg tacgcgtggt 4108501 tgatccggcc ggcatccagc gccaccgaca gcggcgcggt gacgtgctcc tgccccacca 4108561 cctccgcgaa gcttgccggt cggtacttgc ggtagagagc cacgtcagca ggctaccgac 4108621 cctaggcgac gagtgtgttc gcagcgtcga atgtgaacgt tcggcgtgat ttcggcgcgc 4108681 gggttcccgc tctcagcgca cgttcggcgc cgaggaggct agtccctggt taagcaatgt 4108741 ctcggtcgcc gccagcagcg cgcaggtcgc caacccgtca accgcgttgc gcaggtccgg 4108801 taccgacgga aacgacggcg cgatccggat gttcttgtcg tccggatcct ttcgatacgg 4108861 gaacgacgcc cccgcctcgg tcaccgcgat accaacgtcc ttagccaagg ctacggtccg 4108921 gcgcgcggtc ccgggcaaca cgtcgaggct gatgaagtag ccacccttgg gctcggtcca 4108981 ggaggcgatc ttggactcgc ttagccgctg atccagaact tcggccacca acgcgaattt 4109041 cggcgccagt atctgctggt gacgcaacat gtgtagacgt accccatcgg cgtcgccgaa 4109101 gaagcgtaga tgccgcagct ggttgacctt gtccgggccg atcgacttct tcccggcgta 4109161 ctgcagatac caggcgatgt tgcctaacga tccaccgaag aagctgacac cgccgccggc 4109221 gaaggtgatc ttcgaggtgg acgcgaagac gtaggggcgg ttggggttgc cggccttggc 4109281 ggccagcccg agcacgtcga cctggcgcgg gaaatccagc gtcagggtat gcaccgcata 4109341 cgcgttgtcc cagaacaagc ggaagtcagg tgccgccgtc cgcatctgga cgagtcggcg 4109401 aaccgtttcc caggaatagg tgacgcccga agggttgccg aagaccggta ccgtccacat 4109461 ccccttgatg gctgggtcga cggcaaccag ttcttcgatc agatcgacgt cgggcccatc 4109521 ctgcagcatg ggtatcggga tcatctcgat gcccatggtc tcggtgatgg caaagtgccg 4109581 gtcatagccg gggaccgggc acaggaattt gatgccgtcc tgctcctgaa tccaaggccg 4109641 cggcgagtcc acgccgccat acaacatgga gaaggcgacg atgtcgtgca tcaattccag 4109701 gctggagttg ttgcccgcga tcaggttggg cactgcgatg ccgagcagtt cggcgaagat 4109761 agcccgcagg cccggcaggc cgtgctggcc accatagttg cgggtgtcgg tgccctccgg 4109821 gtcgcggtag tcgtctccgg gcaagctcag cagctggttc gacaggtcga gctgctctgc 4109881 ggatggtttg ccgcgggtga gatccagagc cagcttcatg ccctgaagcg ccgcataatc 4109941 ctgctgatgg cgtgcgtgta gtgccgctag ctcttggggg ctaagagagt cgaacgacac 4110001 cgtgggccct ttcgccgagt cgaaaaccgt gggtataccg aggtccagtc agtgccccgg 4110061 ctgaagggga ccccgcgcac ccgacagagc ccgttgaccc ttgctgcctt ccagccctgg 4110121 gggagttcac aggatagacg ccgcgcgggg tccaccgtga gtctaatacc tgggctggaa 4110181 cgcccgggac ggactcagcg ggctaccata tgctgcggag gattcgccta gtggcctatg 4110241 gcgctcgcct ggaacgcggg ttgggttaac agccctcgcg ggttcaaatc ccgcatcctc 4110301 cgccaggtgg tccgcagcgc ggacgggaac gcggacggga acgcggacgg gaacaatgtg 4110361 ggctggtcgg cttctcaccg gctcggttca ccagcctaag gaggggtatg gggcgcaagg 4110421 tcgccgtgct gtggcacgcg tcgttttcga ttggcgccgg cgtcctctac ttctatttcg 4110481 tattgccccg ttggcctgag ctgatgggtg acaccggaca ctcgctgggg actgggctcc 4110541 ggattgccac gggcgcgttg gtcggtctgg ccgcactgcc ggtggtattc actttgctgc 4110601 gcacccgcaa gccggagctg ggcaccccgc agctggcgct gtcaatgcga atctggtcga 4110661 tcatggctca cgtgctggcc ggcgcgctga tcgtcggcac cgcgattagc gaggtctggc 4110721 tcagcctgga tgccgccggg cagtggttgt tcgggatcta cggagctgcc gccgcgatcg 4110781 cggtgctcgg gttcttcggg ttctacctgt cgtttgtcgc cgagctgccg ccgccaccgc 4110841 cgaagccgct caagccgaag aaacccaagc agcgacgcct tcgccgcaag aagacggcca 4110901 agggcgacga ggctgagccg gaagccgccg aagaagccga gaacacggag ctggcggcgc 4110961 aggaggacga ggaggccgtc gaagctcccc cggaaagcat agaaagcccg ggaggtgaac 4111021 ccgagtcggc gacccgggaa gctccggcag cagagaccgc caccgccgag gagccccggg 4111081 gcgggttacg gaatcgccgc cccaccggca aaacctcaca tcgacgccgg cgcactcgca 4111141 gcggtgtcca ggtcgccaag gtcgacgaat agccgcggtc aggtgctgta gcggcggctg 4111201 tgaaccctgc gacgcaatgt cggcgtgtca cgttgtcgga ttcactgtcg ccggctagcg 4111261 ctttcccgtc agaagacgag aagcctcccc gatctccaac tagcatcgag atcgggcttg 4111321 cgaaggttgg gttgcaaaat ggatgtcatc agatgggctc gccggcttgc ggtggtggcg 4111381 ggcacagcag cggcagtgac cactcctggg ctactgagtg cgcacgttcc gatggtctcc 4111441 gccgaaccgt gtcccgacgt cgaggtggtg tttgcccgtg gcaccgggga gccacctggt 4111501 attggcagcg tcggaggact gttcgtcgac gcactgcgtt cccaggttgg cgccaagtca 4111561 ctcggggtct acgccgttaa ctaccccgcc agtaacgact ttgccagcag cgacttccct 4111621 aagacggtca tcgacggaat tcgcgacgcg ggctctcata tccagtcaat ggcgatgagc 4111681 tgtccccaga ccaggcaagt actcggtgga tactcccaag gtgcggccgt ggccggttat 4111741 gtcacctcgg ctgtggtacc gccggctgta cccgtgcagg cggtaccggc accgatggcc 4111801 ccggaggtag caaaccacgt cgccgcggtc actctgttcg gcgcaccgtc ggctcaattc 4111861 ctgggccagt acggcgcgcc gccgatagcc atcggtcccc tgtaccagcc gaaaacgctt 4111921 cagttgtgtg ccgatggcga ctcgatttgt ggcgacggca acagcccggt cgcgcatggc 4111981 ctgtacgcgg tgaacggcat ggtaggccag ggcgcgaatt tcgccgccag ccgcctgtag 4112041 ccagaactgc gctgccaccc cagcgagagc tgggcggtga tccaatgcag aatgccacca 4112101 tgcgcgttct ggtcaccggc ggtacgggat ttgtgggcgg gtggactgcc aaagccatcg 4112161 ctgacgcggg ccactccgtc cggttcctgg tgcgaaatcc cgcacggctg aagacgtctg 4112221 tcgcgaaact gggcgtcgac gtgtcggact ttgcggttgc agacatatcc gaccgcgatt 4112281 cggtacggga ggcgttgaac ggatgcgacg ccgtcgtgca cagcgccgcg ctggtggcaa 4112341 ccgacccgcg tgagacttcg cggatgctga gtacgaacat ggcgggcgcc caaaatgttc 4112401 tcggtcaagc cgtcgagctc ggaatggatc cgatcgtgca tgtgtcgagc ttcacggcgc 4112461 tgtttcgtcc caacttggcg acgctgagcg ctgatctgcc ggttgccggt gggacggatg 4112521 gatacggaca atccaaagcg cagatcgaaa tctatgcgcg cggtcttcag gacgccggcg 4112581 caccggtgaa catcacttat cctggcatgg tcctcggccc gccggtgggc gatcaattcg 4112641 gtgaagccgg ggagggtgtc cggtccgcat tgtggatgca tgtcattccc gggcgcggcg 4112701 cggcgtggtt gatcgtcgac gtccgagatg tggcggcact gcacgcggcg ttgttggaat 4112761 ccgggcgtgg gccgcgccgc tacactgcgg gaggtcatcg gattccggtg cccgagctcg 4112821 cgaaaattct gggcgaggtc gccggcacca cgatgctggc cgtcccggtg cccgattccg 4112881 cgctgcgtgt cgcgggatcg gtgctggatc aagccgggcc ctatctgcct ttcaatactc 4112941 cgttcaccgc ggcaggtatg cagtactaca cacagatgcc ggagcccgac gattcgccga 4113001 gcgaaaaaga actaggcatc acctaccgcg atccgcgcga caccgtggcc gacaccgtca 4113061 cggccctgcg cggcctgggc agctaactgc cgtcgggagg ttccgccggt tccgcgtcgg 4113121 ggcgcgaatt cttcaaccac tgcttcagcc ggagcagttc gttgacgacg atgccgacgc 4113181 ccaggaggat gaccagcgtc accacaatag cggtggccac gtagtccatg gtgacagccc 4113241 cgccacggcg cacgttcagg ccgcttgctg tcggatcgag aggacctacg cgatgaaggc 4113301 ggtgacctgc accaacgcaa agctcgaggt agtcgaccgg ccgtccccgg cgccggccaa 4113361 gggtcaactg ttgctcgatg tgctgcggtg cggtatctgc ggatcggacc tgcatgcccg 4113421 cttgcactgt gatgaactgg ccgacgtgat ggccgaatct ggctaccacg ccttcatgcg 4113481 atcgaatcag caggtggtgt tcggacacga gttctgtggc gaggtggtcg attacggtcc 4113541 cggcacccgc aggaccccta ggcgcggcac cccggtcgtc gccatgccgc tgctgcggcg 4113601 tggcaacaaa gaggtgcacg ggatcgggct ttcgacaatg gcgccgggcg cctacgccga 4113661 gcggctcgtc gtcgagcagt cgctgacgtt tcctgtcccg aacgggctgg cgcccgagat 4113721 agccgcgctg accgagccca tggccgtcgg atggcacgcc gtccggcgcg gcgaggtggg 4113781 caagggcgac gtcgcgatcg tgatcgggtg cggtccgatc ggcctcgcgg tgatctgcat 4113841 gctgaagtcg cgcggggtac acacggtgat cgcaagcgac ttttcacccg gccgtcgtgc 4113901 cctcgcaacc gcctgtggcg ctgattccgt agtcgatccc gtacaggact caccgtatgc 4113961 ggtagccgcc ggccttggac agggaaacag acacctgcaa agcatcctcg acgcgttcga 4114021 cctcgcagtc ggcacggtcg aaagcctgca gcggctgcgg ctgccgtggt ggcacctttg 4114081 gcgggctgcc gaagcagctg gcgccgcaac gccaaagcgt ccagtcatct tcgaatgtgt 4114141 tggcgttccg ggaattatcg atggcatcat cgccagcgca ccgctgttct cgcgcgtcgt 4114201 cgtggtcggc gtctgcatgg gctcagacca catccggccg gcgatggcga tcaacaaaga 4114261 gatcaacctg cggttcgtcc tcggctacac accgttagag ttccgcgaca cgttgcacat 4114321 gctggccgac ggcaaggtca acgccgcgcc gctgatcacc gggacggtcg gtttacccgg 4114381 cgtggcggca gcattcgatg cgctcggcga tcccgaggcg cacgcaaaaa tcatgatcga 4114441 ccccaagagc aacgccgcga gtccccaacc attccgcgtg gagtgaatga tgcgggatag 4114501 ccgcacggcg ttggatccac ccgggacgac agcttgaatt caggcggcct ctgctttaaa 4114561 gcgcacacta ccgcgcctgc tgcggcatgg atccaaatat ccgccaaagt acgtatggac 4114621 atccgatagc ccggcgcacc tacgacccgc cgcgcagaca catttacgcg ttcgcaccga 4114681 tggctgcgga cccagcaaat ggcagagtta gagcgtcggc cgtgtcttga gtcaatgctt 4114741 ccaggccggc accttttctc ccgtggaccg catgtgccca cggtcgcgtc agtaccgccc 4114801 gaatcattcc ttgaggccta ttgcagatga aaccgtcgcc tgccgatacc cacgtcgtga 4114861 ttgccggtgc tggcatcgcg ggattggctg ccgccatgat cctggccgaa gccggggtgc 4114921 gagtcacatt gtgcgaagct gcatccgaag ctgggggcaa ggccaagagt ttacgtctcg 4114981 cggacggcca cccgaccgag cacagtttgc gggtttacac cgatacttac caaaccctgc 4115041 tgacgctgtt ctcgcgtata cccaccgaac atgacaggac catgctagac aacctggtcg 4115101 gcgtcagcat ggtttcggct accgcgcaag gcgtgattgg ccgaatcgct gcgccagttg 4115161 ccttgcaacg ccggcggcca accttcgcgc ggatcatagg caaggtagtc gaaccgccgc 4115221 ggcaacttgt ccggatcttg ttgcgcggcc caatggtaat cgttggtctg gcccaacgag 4115281 gtgtgccggc caccgacgtc ctccattacc tctacgccca tctacggctg ctgtggatgt 4115341 gccgagagcg actcttggcg gagctgggcg atatctcgta tgcggattat ctgcagctcg 4115401 gctgcaagtc tgcccaggcg caggaattct tttctgctgt gccgcgcatt tacgtcgcgg 4115461 cgcgcaccag tgccgaagcg gcggccattg cgcccatcgt tctcaagggg ctgtttcgcc 4115521 tgaaaagtaa ttgtccatca gccctcaacg acgcaaagct gcccgcgatc atgatgatgg 4115581 atggaccgac cagcgagcgc atggtcgatc cctggattcg ccacctgaca aggctcggcg 4115641 tggacatcca cttcaacacg cgtgtcggcg atctcgagtt cgacgacggt cgcgtcaccg 4115701 cattgatatc gtccgatggc cgccggtttg cctgcgacta tgccctgctc gcggtgccct 4115761 atctgacgct gcgagagctg gccaaatcag ctcatgtcaa gcgatatctc cctcagctca 4115821 cacagcagca cgcccttgcg cttgaggcat cgaacggaat ccagtgtttt ctgcgcgacc 4115881 tccctgcgac gtggcctccg ttcatccgcc ctggagtcgt cactacgcat ctgcaaagcc 4115941 agtggtcgct ggtctgcgtt ctgcagggag aaggtttctg gaaaaacgtc cgcctgccgg 4116001 aaggaacccg ctacgttctg tcaataacct ggagtgatgt ggaaacgccc ggacctgttt 4116061 ttgatcggcc attgagtgaa tgtacgccag atgagatctt gaccgagtgc ctgacgcagt 4116121 gcggcctcga taaatcgaac gtcttgggct ggcggatcga tcacgagctg aagcacttag 4116181 acgaggccga atacgaaaag gtggcgagcg agctgcctcc tcatcttgtc tcggcgcctg 4116241 cgcgcgggca gcgcatggtg aatttctcgc cgcttaccgt attgatgccg ggcgcgcgcc 4116301 accgctcccc gggtatttgc acctcagtgc ctaacctttt gctagccggt gaggtgatct 4116361 attcacccga cctgaccttg tttgttccga ccatggagaa ggcggcatgc tccggctatc 4116421 tggccgcccg ccaaatcatg aacatggttg cttcgcacgc cgcaccgctg cggatcgact 4116481 tccgggatcc cgccccattt gcggttctgc ggcgggtgga ccgatggttt tggagccgcc 4116541 gccgacgacc gccagaccgg tcgacatttg caaccccacc aaccgccatg ccggcgccga 4116601 gccacctgac cgacgtggat cgctctgcaa gttagccgcc ggtaacccac caagcctcgt 4116661 cacgctacaa gtccaccgtt gaaccgacgg cgttgacgcg tcacatatcc ctgatccttc 4116721 aagaacgtgg agtttccctt gactgtgcac accgtcgcca ccaacaatgc tgcgcccgtc 4116781 atagccgccg gtcccgtcgg ccctagcaga cgacgccgtc gcgtgcacgc cccacttacg 4116841 cgacgccgcc aaccctcctc ctcggcggtg ctgctggtgg cggctttcgg cgccttcctc 4116901 gctttccttg actccacgat cgtcaacgtc gcgttccccg atatccagcg gcacttccac 4116961 agcgacatca gtgacctgtc ctggatgctc aacgcctaca acattgtttt cgcggcgttc 4117021 ctggtggccg ccggcaggct ggccgacctg atggggcgca agcgggtgtt catcttgggg 4117081 gtggcgttgt tcaccgtcgc gtccgggctg tgcgcgatcg ccgaaagcgt cggggaactg 4117141 gttgcgttcc gtgtgctgca aggcatcggc gcagcggttc tggtaccggc ttcgctgggg 4117201 ctggtcgtcg aggccttccc ggccgagcgg cgcgcgcacg gggtcaacct gtggggtgcg 4117261 gcgggggcca tcgccgcggg cctcggcccg ccgatcggtg gcgccctcat cgaggcggat 4117321 ggctggcggt gggtgttcct ggtgaacctt ccgctggggg tattcgctgt gctggccgct 4117381 cggcgggcac tggtggagaa ccgggccgcc ggacgtcggc gtgtgcccga cgtgcgcggc 4117441 gcggtgctgc tggctttcgc gctgggcctt ttgacgctgg gattgatcaa gggcccggat 4117501 tggggttggg ccagcctgcc gaccagcggg tcattgctgg ccgcggcggt cgcgatggtt 4117561 gggtttgtga tgagctcacg acaccacccg gcaccgatgg tcgagcccac gctgttgcgc 4117621 atccagtcgt tcgtggccgg caccgggctg accgccgtgg ccagcgccgg cttctacgcc 4117681 tatctgctga cgcacgtgct gttcctcaac tacgtctggg gttacacgct gctggaggct 4117741 ggcatggccg tcgcccccgc cgcgctggtc gccgccgtcg tcgcggcggt gcttggccgc 4117801 gtcgccgacc ggcacggtta ccgcttcatc gtcggcatcg gcgcgttgat ctgggctgcc 4117861 agcctgctgt ggtatctcaa ggttgtcggg tcccagcccg atttcctcgg tgaatggctg 4117921 cccggccaga tactgcaggg aatcggggtg ggcgctacct tcccgctgct cggcagtgcc 4117981 gccttggccc ggctggccaa gggcggcagc tacgccaccg cttcggcggt gaccggcacc 4118041 atccgccagg ttggcgccgt catcggcgtc gcggtgctgg tgatcctggt cggcacaccg 4118101 gcaccgggcg cagccgaaga ggcgttgcgt cacaggtggg cgttggccgc gatctgtttc 4118161 gtggcggtgg ggatcggggc gctgtcgctg ggtcgcatcc gcccagtccc agctgcggtt 4118221 gaacccccgc cggggccgcc ggtggctccg ttgggagcgc ggcggccgcc gagacccgca 4118281 ccggtggcct cacccgccgc ggcagtggcc ccgaccccca agacttcccg cgaagtcaac 4118341 ctgctggagg ctctgcggtt tgccaggccg gacacgcaac agattgagct gcaagcaggc 4118401 tcgtatttgt tccacgcggg cgatgtgtcc gatgcgctct acgtggtgcg cagcggccgc 4118461 ctgcaagtcc tcgccggcga cggcgcaaag gacgaagtgg tggccgagct gggccgtggt 4118521 caggtggtcg gggagctcgg ggtgctgctc gatgcgccgc ggtccgcgtc ggttcgtgcg 4118581 gtacgcgact cgtccctgat gcgagtgacc aaggccgaat tcgcgaagat cgccgatgcc 4118641 ggggtgcttg gggcgctggc gggggtactg gccaaacgac agcaccagac acgcgtggcc 4118701 tctcagcgga caacgccgga ggtcgttgtc gcggtcgtcg gtgtcgacgc caatgcaccg 4118761 gtcgcaatgg tggccaccga attgtgcagg gcactgtcga cacggctacg tgctgtcgcc 4118821 cccggccggg tcgactgcga cgggttggaa cgtgccgagc agaccgccga ccgggtggtg 4118881 ctgcatgcgg ccgtcggcga cgcgcggtgg cgggaattct gtttgcgtgt cgccgatcgc 4118941 gtggtgctgg tggccagcaa cccggccgtg cctgtggccc cgctgccgac ccgagcgacc 4119001 ggcgccgacc tggtgctggc cggacggccc gccggccggg agcaccgacg tgcctgggag 4119061 cagttgatca cgccgcggtc gatgcatgtg gtccgacgcg aatttgtcgc cgacgacctg 4119121 cgggtgctcg ccacgcgtat cgcgggccgt tccgtggggc tagtcctcag cggtggggca 4119181 gcgagggcgt gtgcccactt gggcgtgctg gaggaactgg aggccgccgg ggtcaccgtc 4119241 gaccgctttg ccggcaccag catgggcgca atcatcgcgg ctctggcggc cagcggtttg 4119301 gatgctgccg gggtggatgc gcaaatctac gagcacttcg tgcgcaagag ccacggcgac 4119361 tacaccctgc cgagcaaggg gctgatccgc gggaaacgca cccagtccac gctacgcacg 4119421 atcttcggag accatttggt ggaggagctg ccgaaacatt tccgctgcgt cagtgtcgac 4119481 ctattggccc ggcgtcccgt cgtgcaccgc caaggcccgc tcgccgacgt cgtcggctgc 4119541 tcgatgcggc tgccttttct gtatgcgcca ctgccctacg gcggcaccct gcacgtcgac 4119601 ggcggtgtgc tggacaacgt gcccgtcacc acgctggtgg gcaaggacgg cccactgatt 4119661 gcggtaaacg tggcctctgg cggaaatcca agccccgcgt ccggcggcca tcgccgcggc 4119721 aaaccacggg tgcccggcct aaccgacacc ctgctgcgca ccatgacaat cagcagcgcg 4119781 atggcatcgg aaaaagtgtt ggcccaggcc gacctggtga tcaagcccaa cccgatcggc 4119841 gtcggactca tggagtacca ccagatcgac cgcgcccgtg aagcgggccg gatcgcggcc 4119901 cgtgaagcgt tgccacaaat catggagctg gtgcacggct gaacctgggc agggccgcta 4119961 agatactgtg accacggcca cgctatcggc ggcctggcca gctttccggg ccgctacccg 4120021 atgggagtcc tcacccacgc cgccggcgga cccaaccccg attgttcgac cgcagacact 4120081 gatctatcgc gcaggcgttg ccgcatggtg gactagccca atgacgcggg ctgacggcaa 4120141 gcgcgaccgt gacgagatgt tcgtcgaata caccaagagc atctgccccg tctgcaaggt 4120201 cgtggtcgac gcccaggtca atatccgcca cgacaaggtg tatttgcgta agcgctgccg 4120261 cgagcacgga agtttcgagg ccctggtgta cggggatgcc cagatgtatt tggaatcagc 4120321 acgattcaac aaaccgggca cctttccgct gcggtttcag accgaggtgc gcgacggctg 4120381 tcccagtgac tgcgggctgt gcccggacca caagcaacac gcctgcctgg ggttgatcga 4120441 ggtcaacaca cactgcaacc tggactgccc gatctgtttc gccgactctg gccaccaacc 4120501 cgacggctac gccatcaccg cggcgcagtg tgaacggatg ctcgacacgc tcgttgccgc 4120561 cgagggtgaa cccgaagtgg tgatgttctc cggtggcgaa ccgaccatcc acaaacaact 4120621 cctcgagttc gtcgacgccg cccaggcccg cccggtcaag accatcatca tcaacaccaa 4120681 cggcatccgg ctggcctccg accggcgatt cgtcgaccag ctcgccaccc gcaaccgtcc 4120741 cggccacccc gtgcacatct acctgcagtt cgacggcctg gacgaggcaa cacatcgtcg 4120801 aatccggggc cacgatctgc gggacgtaaa gcagcgggcc ctggacaact gcgccgcggc 4120861 gggcctgacc gtcagcctgg tggccgcggt ggaacgcggc ctcaacgagc acgagctcgg 4120921 cgcggtcatc cgccacggca tggcgcagcc cggagtgcaa tcggtggtat ttcagccggt 4120981 cacccacgcc ggccggcatg tgcagttcga cccgctgacc cgactgacca actccgacat 4121041 catcgcctgc atcaccgcgc aactgcccga atggttcagg cccggtgact tctttccggt 4121101 gccatgctgc ttccccagct gccgatcgat cacctacctg ctcaccgacg gggagcatgt 4121161 ggtcccgatt ccgcggctgc tcaatgtcga ggactacctc gactacgtct ccaaccgggt 4121221 gatccctgac ctggcgatcc gcgaagcctt ggagaacttg tggtcggcgt cggcggtgcc 4121281 aggcaccgac accatgaccg cacagctaca gcgggctacc gccgccctga actgcgccga 4121341 gggctgcggg atcaacctgc ccgaggccct cacgcacctc accgaccggg tcttcgccat 4121401 cgtcatccaa gacttccagg atccctacac cctcaacgtc aaacagctga tgaaatgctg 4121461 cgtgcaacag atcaccccgg acggacggct gatcccgttc tgcgcctaca actcggtcgg 4121521 ctatcgagag caggtgcgtg aacagctcac cggggtaccg gtacccgaca ttgtgcccaa 4121581 tgccatccca ctcgccgggt tgctggcgga cgcaccacac ggatcaaaac aggccaatac 4121641 cggtgggagt atcgccaggc tcgcggggcc aacccgaggt gcgccgatgg cactgccacc 4121701 acagcagatc aaagcgtgtt gcgccgacgc ctattcccgc gacatcgtcg ccttgctact 4121761 cggtgactcc tttcacccgg gcggcgcgac attgacccgt aggttggctg accaactcgg 4121821 gctgaggtcg acaggcgacc cgcggcgggt cgccgacatc gccgccgggc ccggcgcctc 4121881 cgcacggctg ctggccagcg actacggtgt ggctgtcgac ggggtcgaca tcagcgagat 4121941 caacgtgaag cgcgcccaag ccgccgtcgc gcaaaccggc ctgaccgagc gggtgcgctt 4122001 ccacctgggc gacgccgaat cagtcccgtt gcccgacgac acattcgacg cgctggtgtg 4122061 cgagtgcgcg ttctgcacat tcccggacaa gaacgccgcc gcccagcagt tcgctcggat 4122121 tctgcgtcct ggtggcctgg ccggcatcac cgatgtcact gtcggggacg gcggcctgcc 4122181 ggcggagctg accccattgg ccgcgtgggt cgcctgcatc gccgacgccc gaaccgtcac 4122241 cgactacacc gacatcctcg aaggggccgg attgcgcacc cgccacatcg agtctcatga 4122301 cgagagcctg ctggacatga tcgaccgcat cgacgcgcgg atcaccgcct tgcacgtcgc 4122361 cgcaccggag atcctcgccg acaacggcat tcgccacgac tcggtgcgcg atttcacagc 4122421 gctcgcacgc gccgcggtac aaaccggacg aatcggatac acgttgatga tcgcggaaaa 4122481 gccgtgataa tccaggaaat gtgggacaga ccaatcgcat ttcccgcatc tgaggagcga 4122541 gccgcaccgc gttacttcga cgtgtttccc cccttcaagt cggtatcccg gctcggctgc 4122601 acccgcttgg gttcgcccgg catcttcgga tagttcggcg gatacggcat gtcaccgagc 4122661 ccgcgctcct cgtcggcggc ggccaagtcc agcaatggtg caatcgactg ggccacgtcg 4122721 tccatcccgg cccaggggtc gtcgcggatc ttcaccagct cgggcaccgt ggtcatggtg 4122781 tagtcgtcgg gatccgcgcc ggccagctct tcccaggtca acggcatcga taccgtcgcg 4122841 atcggggtag gacgcaccga ataggccgac gccatggtgc ggtcgcgggc gttttggttg 4122901 aagtcgatga agatacgcgc gccccgttct tccttccacc acgacgtcgt caccgcatcc 4122961 ggtgcgcggc gctcgacttc ccgggccaac gcaatgcccg cccgacgcac ctcgacgaag 4123021 tcccagtcgg tggcgatgcg caggaatacg tgaatccctc tacccccgga tgtcttcgga 4123081 taaccgacca gaccgaggtc gtccagcacg gaccggagca catcgacggc gaccgtacgc 4123141 gcctccacaa agccggtgcc cggttgcgga tctagatcga tgcgcaattc gtcggggtgc 4123201 tcggtgtcgg ggcagcgcac ttgccacggg tgcagggtga ttgtgcccat ctgcgccgcc 4123261 catacgatcg ccgccgggtg ggtcaccttc agcgcgtcag ccatccgccc cgacggaaac 4123321 gtcacccggc acgtctgcag gtagtcaggg cggtgccgcg ggatccgctt ttggtagatc 4123381 tgctcgccgt cgacgccgtc cgggaagcgc tgcaagtgcg tcggccggtc acgcagcgcc 4123441 gtcagcatcg gacccccggc cacggcgaag tagtactcaa cgaggcggcg cttggtgccg 4123501 tgcgacccca gcttcgggaa atacatcctg tccgggctag tcaaccgcac cgcgatgccg 4123561 tcgacgtcga gttcctcagc tgccgccgcc atatcggaat tccagcatgc cgcacgcaag 4123621 aatgagcaca tgcagttacc cgtcatgccg ccggtgtcgc cgatgctggc caaatcggtc 4123681 accgcaatcc cgccggacgc gtcgtatgaa cccaaatggg acggattccg ctccatctgc 4123741 tttcgcgacg gtgatcaggt cgaactgggt agccgcaacg agcggccgat gacccgctac 4123801 ttccccgagc tggtcgccgc gatcagggcc gagctgccgc atcgctgtgt gatcgacggg 4123861 gagatcatca tcgccaccga ccacggcttg gacttcgagg cgctgcaaca gcgcatccat 4123921 cctgccgagt cgagggtgcg aatgcttgcc gaccgcacac cagcctcctt catcgcattc 4123981 gacctgctgg ccctcggcga cgacgactac accgggcgac cgttcagcga aagacgagcc 4124041 gctctggtcg atgccgtaac tggttcgggg gccgacgctg acctgtcgat ccacgtcacc 4124101 ccggcaacca ccgacatggc gaccgcacaa cgatggttct ccgagttcga gggggccggt 4124161 ctagacggtg tcatcgccaa accgccgcac atcacctatc aaccggacaa acgcgttatg 4124221 ttcaagatca aacacctgcg gaccgccgat tgcgtggtgg ccggctaccg ggtgcacaag 4124281 tccggcagtg acgcgatcgg ctcactgctg ctagggcttt accaggagga cggccaactc 4124341 gcgtcggtcg gcgtgatcgg cgcgttcccc atggccgaac gacgccggct attaaccgag 4124401 ctgcagccgc tggtcaccag cttcgacgac cacccatgga actgggccgc ccacgttgcc 4124461 ggccagcgca ccccacgtaa gaacgagttc tcccgctgga atgtcggcaa agacctgtcg 4124521 ttcgtgccgc tgcgacccga gcgggtggtc gaggtccgct acgaccacat ggaaggcgcg 4124581 cggttccgcc acaccgcaca gttcaaccgg tggcgccccg accgcgaccc acgctcatgc 4124641 agctatgccc agctcgaacg cccgctcacc gtcagcctct ccgacattgt gccgggccta 4124701 cgctaaggtg cgaccctctt cggtcagttg atccccggtg ggccgatcgg ctcgggcgcc 4124761 acatccgggt cggttcgttg cgttcggccg cgtaacatct gcggcatggc ggtgctgccc 4124821 gcgtgccggt tgggacttgt cgtctgtgtg gcgaccgcag tgatcacagc aaccatggtg 4124881 ttggctacgc cgagctatgc atgcgcctgc ggtgccgcgg tcacagcaca tggctcccaa 4124941 gcaactttga atcatgaagt cgcgctgctt cattgggacg ggacgaccga gacgatcgtc 4125001 atgcagctgg caatgaacgc cgataccgac aacgttgcct tggtagtgcc caccccgacg 4125061 ccggcgatag ttacaaccgc ggaccagtcc acgttcggcg agctggacac gctcagtgcg 4125121 ccgttgatcg agcatcagcg acattggagc ttaaggcgcg gtgtcggtgc ctccggtccc 4125181 caggaggccg ccgcccgggc cccgcatgtg ctcaaccagg ttcgccttgg cccgctggag 4125241 gccaccacct tgaccggcgg ggatctgagc ggcctgcaga cttggttgtc tgacaacggc 4125301 tatgcgattc gaccggcggt gtcagcggcg ctggatccct acgtgcgtga cggatgggcg 4125361 ttcgtggcga tccggctgac cagcaccgac ctgatagtgg gcgggctcga tccggtgcgg 4125421 atgaccttcc gatcgtcgcg gttggtgtat cccatgcggc tatcggtcgc cgctcaggag 4125481 ccgcaacatg tcaccatctt caccctgtcc gatcaccggc agcagcgcac cgacgccgac 4125541 gctgccacac agacaaccca cgtccggttc gcgggcgaca tgtccactgc ggttcgtgac 4125601 cctctgttgc gcgagctgat cggcaaccac ggctcatatc tgaccaaggt cgaggtggac 4125661 atctatcaga catcgcgaat ctcttcggat ttcacgttcg gcaacgcacc aaacgacgat 4125721 ccgtaccggc aggtggtcac cgtttacgac gatgtcgcac tccccccgct gctgctggtg 4125781 gtcgtgtcgg cgatcgcggt gggcgcggcg ggcggggccg ttgtggtggt tctgcggcga 4125841 cggcggcgcg cccacactgg gtagtccgcc acggtgaggg cgctcagcga ggcagggatt 4125901 ctggtccttc agacaaaccc gccacggccg ggtgcgccat caaccggtcg agaaaacccc 4125961 gctgcccctt gagcagtttg gtgcgtgccc gcgctaccgg aaaccagctc acccggtcga 4126021 cctcggggaa cttacgcatc ttgcccgagc ccttcggcca gtccaattcg aaggtgctgc 4126081 ttcgtgcgtc ggtgatgtcc agatccgccc ggacaccgaa cacggtcacc accttgccgc 4126141 cggactgttt cagcgacccg aagtcgattc gcggcccgtc aggcacgcac aacccgatct 4126201 cctcggagaa ctcgcgccgg gcggccagcc acggatcttc gccgccggtg tattcgccct 4126261 tcgggatcga ccaagcgccg tcgtcctttc ccgcccaaaa cgggccgccc ggatgcgcca 4126321 gaaggacgtc gacgacaccg gcgcgcgccc gatacagcag cacacccgcg ctgagcttgg 4126381 gcatgagtac gggttcttta gatcccgacg gcctgttcca gatccttcag cgacgattcc 4126441 aggtgcgcaa gcagccgctg caagtggggc acactgcgac ggcacccgac cagtccgaag 4126501 tcgagattcc cagcattgtt caccagggtg atgttcaacg cttgaccgtc cgggatgttc 4126561 gacaatgggt aactaccgtc aagccgggcc gtgccgtagt agagcgggtc taccggcccc 4126621 ggcacattcg agatgacgat gttgaacggt ggcggcactg ccgacaagaa acccggtaca 4126681 cccgccaacg tcagcggcgc catattcaat gccgacaatg caagcacctg cagctgcggc 4126741 aattcggaga gcactttctt gttgccgtcc atggacgcgc tgatggtctg aatccgttgc 4126801 gctgggtcgt cgacatgggt ggcgagattg cacaggacgc tgccgaccaa gttgccgccg 4126861 gcgtcagcgt cctctttgga gcgtaggctc accggaacca tcgcgatcag cggtctgtcc 4126921 ggcagcgcat tccgctctat caggtagtag cgcaacgcac cggcacacat cgccaggacg 4126981 gcgtcgttga cggtcacacc ggcggcctgc ttgacgctct tgatccggtc cagcgaccag 4127041 gactgcgcag cgcaccggcg ggctcccccg accttgacgt tgaacatgct gtgtggcgcc 4127101 gcgaacggca gcgtcaactg ctgctcgagt agcgccgcac gagccagctt cagcgtcgac 4127161 ggtgcaagtc cgacaacgga tcccgccatc ttgaacagcg catccaacag tgacgagccg 4127221 tccgatggcg ggcgcgtacg tgggcgcgga ggcaggttcc agatggcgcg cacctcggcg 4127281 tcgtccgggt cagccgacag cgtgcgctgc gccagcttca tcgccgaaac accgtcgatc 4127341 agggcgtggt gcattttggt gtacatagca aaccggccgt cgttcagccc ctccaccacg 4127401 tgcagctccc acagcgggcg gtggcgatcg agcaggctgg tatgcagcct tgaggtcagc 4127461 tcgagcagat cgcggactcg tcctggcgag ggcagcgccg agcggcgaac gtggtaatcg 4127521 atgtcgatgt cgtcgtcata agcccatgcc acacgggcga ttccaccccc gatcgtcgca 4127581 gggtgctttc ggaacatggg ctggaattcg tcgttggcaa ccaaacgctc ggtgaactca 4127641 cggacgaact caggaccagc tccctgcggt ggctcgaaca acgacaagcc acccacatgc 4127701 atggggtgtt cacgagattc aatgaaaaga aacatcgagt cgttgggcat catcagatcc 4127761 atgcacccat tacacccatt accgagtgat ccgggaaggc ttctgtggtg cccgaggttc 4127821 ggcaagtcgc aagaacatcg ccgcccagct gacttcggga tgacaacgca tgtagtccgg 4127881 agcggcttga ggttgcaacg tcgggtgggc gaagtagtcc ggctgagagg tattggtggc 4127941 agcatgggtt tgtgacctca atgtcgttgg cctgggatgt ggtgtcggtc gacaagccgg 4128001 acgatgtcaa cgtcgtgatc ggccaggcgc acttcatcaa agcggtcgaa gacctgcacg 4128061 aggccatggt cggcgtgagc ccatcgctac ggttcgggct cgccttttgc gaggcttccg 4128121 ggccccggtt ggttcgacat accggcaacg atggcgattt ggtcgaactc gcgacccgca 4128181 ctgcgctggc catcgcggcc gggcatagct tcgtgatctt cttacgtgag gggtttccca 4128241 tcaacatcct caacccggtg caggcggtgc ccgaggtctg cacgatctac tgcgccacag 4128301 ccaatccggt cgacgttgtc gtcgcggtga ccccgcatgg tcgcggcatc gtgggtgttg 4128361 tcgacgggca gacccctctg ggagtggaga ccgatcgcga cattgcgcag cggcgtgacc 4128421 tgttgcgcgc catcggttac aagctctgat acgggccgcc ggtccgccct tgacagcggg 4128481 acgtccgccg cagagggtcg acggcatgtc cgtggtgcgc gggaccgctc tggctaacta 4128541 cccgagcctg gttgccgggt tgggcggtga cccggccact ctgctacggg ccgcgggtgt 4128601 tcgggatcag gatgtcggca actatgacgc gttcatttcg atccgggcag cgattcgggc 4128661 aatcgaatcg gccgcagcgg tcaccgccac aatggatttc gggagacgat tggcacagcg 4128721 gcaagggatt gagatcctgg gaccggtcgg tgtggcggcc cgcacggccg ccacggtcgg 4128781 tgacgctctg gcgatcttca acaccttcat ggcggcctac agcccagtta tcgccatccg 4128841 gatcacgccg ctggccggac agcggtcatt tattgcactc gagttcctgc tcgacgagcc 4128901 ggcgtcgtat ccgcagacca tggagctggc gctcggggtg gcgctcgggg tgatccggtt 4128961 gttgttgggc gctgactacg ccccactggc cgtgcactta ccccacgacc cactcacacc 4129021 cgaagccttc tacctgcagt acttcggctg ccggccttac ttcgccgaac gtgttggtgg 4129081 tttcaccatg cgcaccgcgg acctgagccg tcccctcaac cgcgacgatg tcgcccaccg 4129141 ggtggtcgtc gactacctga gcagcatcac gccgctgggc gaggggatcg tggaatcggt 4129201 gcgcaccatc gtgcgccagc tgctgcccac cggagcggcg acgctcaacg tggtcgccga 4129261 gcagttccac ctgcacccga aaacgctgca acgtcgactt gcggaggaga acaccacatt 4129321 cgttattctg gtcgatcggg tccgcaagga tgtcgccgat cgctacctaa ggaccaccgg 4129381 gatcggcctt acccatttgg cacgtgaact gggctacgcc gaacaaagcg tgttgacccg 4129441 ctcgtgcaaa cgctggttcg gaaccggacc ggccgcctac cgcaaccagg ccaggttaca 4129501 gacaaccgtg agcgcacctg gcagcgggcg tggtccgaat ccaggtaacg tctcagtatc 4129561 ctgctgaccg atggatcaag atcgatcgga caacacggca ttgcgccgtg gtctgcgaat 4129621 tgccctgcgc gggcgccgcg atccgctgcc cgtggcgggc cggcggagcc ggacctccgg 4129681 cggaatcggt gacctgcaca cccggaaggt gcttgacctg accatccggc tcgccgaggt 4129741 gatgttgtcg tccggctctg gcaccgcgga tgtcgtcgcc acagcccagg acgtggctca 4129801 ggcctaccag ctcaccgatt gcgttgtcga catcaccgtt accaccatca tcgtgtccgc 4129861 gctagcgacc acagacactc cgccggtcac catcatgcgg tcggtccgga cccggtccac 4129921 tgactacagc cggctggccg aactcgatcg actcgttcag cggataacct ccggtggcgt 4129981 cgcagtcgac caggctcacg aggctatgga cgagttgacc gaacggcccc acccctaccc 4130041 gcgctggctc gcgaccgcgg gggcggcggg cttcgcactc ggcgtcgcca tgttgctcgg 4130101 cggaacctgg ctgacctgcg tcttggctgc cgtgacgtct ggcgtgatcg accgactggg 4130161 ccggctgctg aaccggatcg ggaccccgtt gttcttccag cgcgtgttcg gcgcggggat 4130221 cgcgaccctg gtcgcggtgg cggcttacct gatcgccggc caggatccga ccgcgctggt 4130281 ggccaccgga atcgttgtgc tgctgtctgg gatgaccttg gtgggttcga tgcaggacgc 4130341 ggtcaccggg tacatgctca ccgcactcgc ccggcttggc gacgccctgt tcctgaccgc 4130401 agggatcgtc gtcggcatcc tcatctcgtt gcggggcgtc accaatgccg gcatccagat 4130461 cgaactgcat gtcgacgcaa ccacgacgct cgccaccccg ggcatgccgc taccgattct 4130521 cgtcgcggta agcggtgcgg cgctgtccgg cgtgtgcctg acgatcgcga gctatgcgcc 4130581 gctacgttct gtggccaccg ccggactctc ggccggactc gccgaactgg tgctcatcgg 4130641 actcggcgcg gccgggttcg gccgagtggt cgccacctgg accgccgcga tcggcgtcgg 4130701 cttcttggcc accctgatct caatccgtcg gcaggctccc gccttggtga cggccaccgc 4130761 cggcatcatg ccgatgctgc cgggccttgc ggtcttccgt gccgtgttcg cgttcgccgt 4130821 caatgacaca cccgacggcg gtctgaccca gctgctggaa gcggccgcga ctgcactcgc 4130881 gcttggcagc ggggtggtgt tgggcgagtt cctcgcctca ccattgaggt acggcgccgg 4130941 ccggatcggc gacctctttc ggatcgaggg tccacccggg ctccggcggg cggtcggccg 4131001 tgtggtgcgc ctacagccgg ccaagagcca gcagccgacc ggcaccggtg gccaacggtg 4131061 gcgaagcgtc gcgctggagc cgacgacggc cgacgacgtg gacgccggct atcgcggcga 4131121 ttggcccgct acctgcacca gcgcgaccga ggtgcgctag ccagcctcgc cagcgccgac 4131181 caactgctcc cagctagcgg gcaccatcgg caccgacgga ctacccccga actcaccacc 4131241 ggccaacgtg gtcaaccccg caggacgccc aacggactcc ttgccagcgg tcccggcaaa 4131301 ccccaacacg ccggcacccc gatccgaagc cagcaccgac gccgacgtgc ccgcgggagc 4131361 tggcgccacc gccgacacca gcctggcctc cggtcgcacc gcggcagact cacgcaacgg 4131421 cggcgccgct gcaggtcgta cggcaacggg cgccacgtcc gccgttatac gtccctttac 4131481 accaggcaga taaactgggc tggctttggt caaacccagc gcgacccgca gcacttcctc 4131541 atagcccgac cgcgcgctcc agttccttca acgaggtctc cagatggctg agtacccgct 4131601 gcacgtgtgg aacgctgcgg cagcaaccca cgactccgaa gtcgagacta tcggcggtgc 4131661 tggtcagggt gatgttgagc gcttgtccgt cgagcaccaa cgacattgga tagttgccga 4131721 ccatcctggc gccgttgaag tacagcggtt cgcgcgcacc gggcacgttc gagatgcaca 4131781 cattaaacgg cggtggcgtt gccttggcca agcccggcag ggtgttcagc gcagctgggc 4131841 tcaacagcag cagtgacacc gccaacgcct gggcgcgggg cagctgcgat agtacgttct 4131901 tattaccgcg catcgaagcg tggatggcgt tcagccggtc ggctggatca tcaaggtggg 4131961 tggccagatt acacaacacc gccccgacca tgttgccgcc gaccgagtcg cggtcggtgc 4132021 gcaggctcac cggaaccatc gcaaccagcg gcgtgtccgg cagcgcgtcg ttgtcgtcca 4132081 gatattcgcg aagtgcgccg gcgcacatcg ccagcaccac gtcgttgagg ctgaccccgg 4132141 ccgcgtcttt caccgccttg acccggtcca acggccagga ctgcgcggcg cagcgccgcg 4132201 ctcccccgac ggcgacattg agcatggtgt gcggggcccc gaagggcagt gtcaactgtt 4132261 gttcgatcaa cgcggaacgc gccagtcgca acgttgaggg agcgagcccg gcaaccgatc 4132321 ccagcatgcc ccccagctgt tgcaggcggc cgcgccgtcg cttgatggcg gtgtgctgcg 4132381 tcgccggtga ccaggcggtg cgcaacttgc cctcgatggg gtcggtggtc atcggctggc 4132441 gcatcagcgt aagtccggac accccgtcga ccagggcgtg gtgcatcttc gaatagatcg 4132501 caaagcgtcc atcccggagg ccctcgatca cgtgtgtttc ccagagcggg cggtgccggt 4132561 cgagcagatt ggagtgtaac cgtgacgtca gttccagcag ctcacgcacc cggcccggcg 4132621 ccggcagggc agaccgccgc gcgtggtagc cgaggtcgac gtcagcgtcg gtcgaccagc 4132681 cgaggttgat gagtgcaccg tgaagcgacg tggggcgctt gcgaaatagc ggtgctatct 4132741 cgcggcactg aagcatcgcc tgataggttt cccgcacaaa cccacgtccc gcccccgcgg 4132801 gtggctcgaa cagttgcagc gcgccgacat gcagcggatg ctctcgcgac tcggctgata 4132861 agaacagcgc atcgatcggt gacatcagtt ccatggcgtg ctcctggtga tgcgcttcac 4132921 cgtcagccgg ctcgccgaag ccgacgtcgt aaagcgcagg tgatcgtcgt cgaccgggcc 4132981 ctcgcgcaac accttgaggt ccgccagggg gcttcggcgc cctgcagcgg ccgggtcggc 4133041 atggctgcgg gcagctgcgg acagcagatg gtgtgcccgg gtgcgtccat gtgggccggc 4133101 gaagttggac acgtcgctga gcaggaaacc cttgaatgcc accttctcgg caacgtccac 4133161 ggcgactccg tcgaccgaga ggttgatgcc accgaaggcc agcagattga ggccggtggc 4133221 agtgatgctg atgtcggccg ccaattcgcg tccggattgc agccggattc cgttttcggt 4133281 aaaagtatcg atcgcctcgg tgaccaccga ggcccggccg tcgcggatgg ccttgaacat 4133341 gtcggcatct ggcaccgcgc acaggcgttg gtcccatggg ttgtagaccg gcttgaagtg 4133401 ctcgtcggcc ggatatccgg cggccagctg cttggcgttg agatgacgga tcagtcgccg 4133461 ggcggctctc ggataccgtt ggcataaccg ccacaccaac cgttgcttgg cgatgtcttt 4133521 gcgccgggtg acggcgtagg cccgatcgcg gcctatcatt tgggcatggt taccgcgccg 4133581 gcggtctggg ccatggccgg caccagcgtg accgcggtcg cgccgctgcc gatgatcacc 4133641 atccgcagct catggtgacg tgttcgccgg tgtcgaagcg ttcgatctcc accagccagc 4133701 gagcgtcctc ggtggaccat gatggcgtcc gcgctggcgg tcgccttctc gtgctgccac 4133761 ggcttgaact catagctgaa cgtgtgcagg tcggagtcgg atcgaattgc tggataccgg 4133821 gcttcaacga tcgcgaatgt cttggccggc tgcattgtct ttaggtagta ggcggcgcca 4133881 gtgccggaga tgccggcgcc aacgatcagc acgtcgacgt gttcgatgct ggctgactgc 4133941 tcggagtgca cggcgtactt cctgttcggg cgaaggctga cccgcgactt cgttgtcaac 4134001 cgggggtggt gtgcgtcacc gaactcactg tgcaccagca ctcggccttg agtcttgaca 4134061 ctagaagaca acaatttgac ttttcaagac acagcgtcac ctgtgcgcgg tgccagcggc 4134121 gcggcgccag gccgtgtggc gcagtaggcg cagcccattg agtccgacga tgatggtgga 4134181 accttcgtgt cgggcgacgc ccagtggcaa tggcaacgtg aaggccaggt cccacacaac 4134241 gagcccggcg atgaatgtca cggccacgat gaggttggcg accacgatgc ggcgggctcg 4134301 ccgcgacatg gcgataacgg tgggaatggt ggtcaggtca tcgcggacga cgacggcgtc 4134361 ggcggtctgc agggtgagtt ccgatcgggc gctgcccatg gcgatgccga catgcgcggc 4134421 cgctaaggcc ggagcgtcgt tgataccgtc accgaccacg gtcaatctgg cacctccagc 4134481 ttgcagctgc cgcacggctg cgaccttgtc gtcgggcagt agcccggccc gtacgtcgtc 4134541 gatgccaacc tgtacaccga gccgatcggc ggtggcccgg ttgtcgccgg taagcaatac 4134601 cggtttggcc ccggtcagtt tggtcgcagc ggaaatcgcc gcggcggctt cggggcgaag 4134661 ctgatcggtg atggcgagta gcccgacggg atggctatcg cataccacga cgacgacggt 4134721 gtagccctcg ccttgcagaa agtcgaccgc cgtgatcatg gaagcttcga gcgcggcggc 4134781 gccggcagtg cccagcagtg ccgtcgccga tccgaccgca atgacgtggc catcgacgcg 4134841 ggcggtgaca cggcaacctg ggtgtgcggt gaactcgccg acggtcggca gccggatgcg 4134901 gcgagactgg gcggctttca cgatggccgc acccagtggg tgctcactgg gatactccgc 4134961 tgcagccgca agccgcagca gttcatcatc ggtgaatcgt cgttcgtaca cccagatgcc 4135021 ggcgagttcg ggggtaccgc gggtaagggt gccggtcttg tcgaacgcga tccgtgtggt 4135081 ggttccaagt tgttccatca cgatcgcgga cttggcgagc accccgtggc ggccggcgtt 4135141 ggcgattgcg gccaatagtg gcggcatggt ggccagcacg accgcacacg gcgacgcgac 4135201 gatcatgaac gtcatggctc gcagcaacgc ccgctgcagg gtctcccccc atagcggggg 4135261 caccgcgaat acggcgaggg tcacggcgac catgccgatc gagtagcgtt gttcgacttt 4135321 ctcgatgaac agctgggtgc gcgccttggt ctggctggcc tgttcaacca gggtggcaat 4135381 gcgagcgacg acggaatccc gcgcgagccg gtcgacccgg atccgcaggg cgccggtgcc 4135441 gttgacagtg ccggcgaaca cctgatcgcc gattgacttg tcgacgggca gcggctctcc 4135501 ggtgacggtg gcctgatcga cttcgctgcc gccggcaagc acggttgcgt ccgccgagat 4135561 gcgctcaccg ggccgtacca gcacgatgtc cccaatcctt aggtcggcgg cgttgaccgt 4135621 ttcctcacca ccgccggcgc ccacgcgggt cgcggtgccc ggcgcgaggc ccattagccc 4135681 acgcaccgag tccgcggtgc gggccgttac cagtgcttcc agagcaccgg aggttgcgaa 4135741 gatgacaatg agcagagcgc cctcggcgat ctgcccgatg gcggccgcgc cgatcgccgc 4135801 gaccaccatc agcagatcga catctagggt ccttcgctgt agcgcctgta gcccggccag 4135861 ccctggctcc caaccgccgg tcgcgtaaca cgccagaaac agcgcccacc gcacccattg 4135921 cggtgctccg cacagctgtg tcagtagtcc cgctgaaaac aggcccaacg ccagcgcggc 4135981 ccaacgcatc tccgacaacg cgaacagctt ggttcggcgc gctaggacca acggcgacgc 4136041 tgaggtgcac cgggcgggag agagttcacg aacagccacc cggccaacat atcagaatat 4136101 atgatcatat gttcatttat ttctttgggg ataggctgcc taaccatggg gcacggggtc 4136161 gaaggcagga atcgtccgtc agcgccgttg gattcccagg ccgccgcgca ggtcgcgtcc 4136221 acactgcagg cgttggcgac tccgagccgg ctgatgatcc tcacccagct acggaacggc 4136281 ccgcttccgg taaccgacct cgccgaggct attggaatgg aacagtccgc cgtctcgcat 4136341 caacttcgag tgttgcggaa tctcggcttg gtcgtgggcg accgggcagg ccgtagcatc 4136401 gtctacagcc tctacgacac gcatgtggcg cagcttcttg acgaagctat ttaccacagc 4136461 gagcacttgc accttggtct ctccgaccgg caccccagcg cgggctaagc ggtcaggctc 4136521 ataagctcgc gggtcacttt cacccatgac cggcgagctt tacagacccc agcgcctcaa 4136581 ggggcaccac ctcaagggcg cagccaccgt ggcgggcgcg caatcgacag gtcgttgccg 4136641 accgagcgct ggtgtgccag gaattcggtg gtcatgacgg cgcagatggt gtgccaaccg 4136701 aggtcctcgg gtccggtcgc acagcagccg tcacgataga agccggtaag cggatcggtg 4136761 ccaccctgtt ccagggcgcc gcccagcaca ttgcaatcgg acatggacct aagtgtctaa 4136821 gctgcgccag ccacgccgtc ggacctatca gctaattcgg cgcgcgtcgc ggcgcactat 4136881 tcccgcgcga gggtctggcc gggtcgcgga attgcttcga gcaagcaggc ggccgccctg 4136941 acgtcggcgt ccgaatacat ccgggcgatc gcggtaaaca cctcgcccgc ctttctcagc 4137001 tcttcctgcg ccgcttgatt cagcgccagc agcccggtgg ccgccgtcgt gaacgccgtt 4137061 accgcccacg ccgacacctc ctcggccccg gcgggcaata gcgagctcag cgagacccac 4137121 gccaccgcac cggcctgtag cccttggaat gcgttgttga cgacctgcga tccgatgtcg 4137181 gcaacggccg gatcgaatga catggactgc atgtgtctct ccctagattg cgcgggctcg 4137241 ggccccaacg acgagatcta agcgaggaat tcagttgtcg gtagcgatag tagtaatagg 4137301 atatagtccg cgctgacgaa atagaagacg agatatgccg tcgcactgaa taatttgtca 4137361 ccaagggcgc tgccgccccg tgctacccct gggcatgttg tccacctgcg gcgcggtagg 4137421 ttcagcggcg tgatacttac gggtgcgttc ttggccgatg ccgccgcagc ggtggacaac 4137481 aaactcaatg tgcaaggcgg cgtgctgtcc agatttgcgg tcggtcctga ccggctggcc 4137541 cgatttgtgt tggtggtgtt gacgcaggcg gagcctgaca gttcggaccg cgacattacg 4137601 gtcgagatga ggccgccgac cgatgacgaa ccgatacgcc tgaatttcga ggcgcccgaa 4137661 gcggccgttg ccgagttccc cggattcgca ttcttcgaaa tccaactgcg cctgccggtt 4137721 aacggccgtt gggtgctggt ggtgactggc ggcaccggag cgatatcgct tccggtgctg 4137781 gtgagcgaca tgcctgcgac gataggtttt tgacgcgccg gtcttgagcg acgacccccg 4137841 gggctttgca gaaaggttgt cccgtgcacc agcagcatcc ctacaacgca gctgggttcg 4137901 gctgaccgtg ctgacaccca acccagcggt aggttcggca gcgtgatagt cggggccttc 4137961 ctcgccgaag cggcctcggt ggtggacaac aagctcaatg tctccggcgg cgtgctgtac 4138021 cgatttgcgg tggatccgga ccggtcggcc cagtttctgc tggtggtgtt gacccaggcc 4138081 gagaccgatg atccggatcg gcgggtcgac gtagaggttt ggcctccgac gggcgacgac 4138141 gcgcaccaca tcgagttcga gctacccgag gccgccgtcg ccgccgaggt cggattcgcc 4138201 atcttccgga tcgaggtaaa cctgcccgtc gacggccgtt gggtgctggt ggtaaccggc 4138261 gacgccggaa cgatctcgct gccgctgatc gtgacggggt gaggcgtagg cccctgccga 4138321 cggagctgcc agccctattg atcgaatggg agcaggacgc cgaggccgaa tggcgatccg 4138381 gacgggaaca gacgccgtgc cttagcggcg aactgtggga cctgctcgcc cagcgcatct 4138441 agcaggctgt gcggcgtgaa cggcgggtcg atgtaggcgt cgaccatccc gaggatcact 4138501 gctttcgttg cctcttcgta caaatccaac tgatcgagga gaaagtcgtc gggatgcaaa 4138561 gctttgatct gatagggctt tagcgcgtca tcagggaagt gcttgaggtt tgtcgtgact 4138621 atcacctccg cgcgctctcg gaccgctgca gctagcacat gtcgatcttt gtaatggttg 4138681 ttcatggcgg cgatgaggtc gttgtacccg aaagcgaatg cggtagtcag cccgttcggt 4138741 gctgatgttg aggcggtcga ccatggttcg ccgagtctcg gccaggatgt cctccgacca 4138801 cagaggccga taggtgccct cgtcagcgaa ccgcaacagg gcatcaacca gcgggtgtgg 4138861 cacgagcacg cacgcgtcca gtactacggg gaacggcatg ctcggcctcc tctacttctt 4138921 ttctgcaagc gccgcctgga gctcaccaag ggcgtcgcgg ctcaactcgc ctagtgctgc 4138981 acggcgattc gaccgggttt cttgctgata ttcgagcagc gcgtcaaggc tcactcggcg 4139041 gtggcggccc ggcttctcaa atgggattcg accatcctcc aagagccgaa cgagggtcgg 4139101 gcgtgagatg ttcaataggt cggcggcttc ttgggtggtt agtttgaggt ggcgtggcac 4139161 caatgaaatg cctttgcctt gcgacaaggc cagcacgacg ttgtacagcg catctctgac 4139221 tggttcagga agcgtcatcg gttgtccggc gttgccacac acggaaactt caggcgcgcc 4139281 aagcacctcc agcaaggagg tcatgtcctg cgggtcgcgg ggtggaagta ctgtccgttc 4139341 ctggactgcg gctgtcatgc agcttagcgt aattcgaaca aaacgaaacg tcgagtctct 4139401 gaccaggcat ttacgcaagc tactgcgccg ctaaccgcgc cgggtcgcgc acttggccgc 4139461 ctcaaacgcc gcctagcacg gtgacgtcga gcccggcgga gcgcaccagc tgatcagagc 4139521 tggaaaccgg cgcgcgtctg ccgcggccga agcctacacg cgggcggatc tgcggcgcgg 4139581 tgaagcgcgc aaaggtccag cagatcactc cgcacgattt gcggcacacc gcggccagct 4139641 tggcggtgtc ggccggcgtc aacgttttgg cgctgcaacg gattctcggg cacaagtccg 4139701 cgaaggtcac cctggacacg tatgcggatc tcttcgatgc cgatcttgat gcagtcgccg 4139761 tcactctcgg gaaagatgcc gaccagcaaa cctgaaaata ccctgctgaa ctgcactaac 4139821 agtcaaaggg atttggcggt ggcggaggga tttgaaccct cggacggtgt tagccgtcac 4139881 acgctttcga ggcgtgctcc ttaggccgct cggacacgcc accgcggtga agcttaccga 4139941 atcggcgcac cctcacccca atcgctggcg ggcgaagaag gcctccagcg gcgcggcgca 4140001 ctcccgcgcg agcacaccgc cgcgtacctc cgggcggtga ttgagccgac gatcacggac 4140061 cacgtcccac aacgagccga ccgccccggt cttgggctcc caggcaccga agaccagccg 4140121 cgcgacgcgg gccagcacca gggcaccggc acacatagtg cacggttcga cggtgaccgc 4140181 caaggtggtc ccctccagcc gccacccgtc gccgagcaca ccggccgcca accgcatcgc 4140241 caggatttcc gcgtgcgcgg tgggatcgcc gagcgcctcg cgggcattca ccgcccgggc 4140301 gagttcggtt ccgtcggcgc cgacgaccac cgcgcccacc ggcacgtcgc gcggacccgc 4140361 cgtcgccgcg accgccaacg ccgcacggat cagatcttcg tcagtggtca ccgcccgcgc 4140421 ttgcggtcac cgacctaggc ggtcgatcac cgccgacagc tggtcagcga agcccatttc 4140481 gcgggcgatg cggcccagct gttcgtcggc gtaaaggtcg gtctcgtcga ggatgactcc 4140541 cagaaccgcc tcgggcaggc cgatgtcgga cagcaggccc aggtcgcctt cctcgaacgg 4140601 atcggcatcc tcgaggtctt cgggatcgat ctcggcgtcc agattgtcca ggacctccgc 4140661 ggcgatgtcg tagtccagcg cggcggtggc gtcggacagc aacagccgag ttcccgaggg 4140721 cgccgggcgc acaatgacga aaaattcgtc gtcgacgtcg agtagcccga agacggctcc 4140781 cgcgctacgc agctcacgca gttccgtctc ggcagcccgc agactggtca acgctttggg 4140841 gcccatcgga gagcagcgcc agcggccctc ttcacgcaca accgcaacac cgaaaccgtc 4140901 cggtgtgtcc gcggccggtc tttgcgtgga ggcccgttgt gctcccatgg gcgcctacgg 4140961 tagtcgctga ccaggcctcc tgaccagatg gtgctcagac agcggagatc tggtcgcccc 4141021 tcagggcgcc gccacgggct acctatgcca accttggact gtgactcgga ctgtcgcggc 4141081 gccaccggtg tgcgtgcttg ggctgggact catcggcggt tccatcatgc gggccgccgc 4141141 agcggcgggc cgtgaagtct ttggctacaa ccggtcggtg gagggtgccc acggcgcccg 4141201 ctccgacggg tttgatgcca taaccgatct caaccaaacg ctaacccggg ccgccgctac 4141261 cgaggcgttg atcgtgctgg ccgttccgat gccggccttg ccaggcatgc tcgcccatat 4141321 tcgcaaatcg gcacctggct gtccgttgac cgacgtcacc agcgtcaaat gcgcggttct 4141381 cgacgaggtc acggcggctg gtctgcaggc gcgctacgtc ggcggtcacc cgatgacggg 4141441 caccgcgcac tcgggttgga ccgccggtca cggcggcttg ttcaacagag ccccctgggt 4141501 ggtcagcgtc gatgaccatg tcgaccccac ggtgtggtcg atggtgatga cgctggcgct 4141561 ggactgcggg gcgatggtgg tgcccgccaa atccgacgag cacgacgccg ccgctgctgc 4141621 cgtctcgcac ctgccacacc tgctcgctga ggcgctcgcc gtcactgcgg ccgaggtacc 4141681 acttgccttc gcgttggctg cagggtcttt ccgcgatgcc acccgggtgg cagccaccgc 4141741 tcctgaccta gtgcgggcaa tgtgtgaagc taacaccggc caactggcgc cggccgcgga 4141801 ccggatcatc gacctgctga gccgtgcgcg tgattcgctg caatcccacg gttcgatagc 4141861 cgacctcgcc gacgcgggcc acgccgcacg cacacgctat gacagcttcc cgcgctccga 4141921 catcgtcacc gtcgttattg gcgcggacaa atggcgcgag caactggccg ccgcggggcg 4141981 ggcgggcggg gtgattacat ccgctctgcc aagcctggat agtccacaat gaacccgtcg 4142041 gagtcgacgg tcacggtggt gtcagccacc ggtgagcgca gcttgatccc atccagacgt 4142101 ccttcgctgg tatagctcac ggtggccgca tcgacgctca tctcgggcac gtttacatag 4142161 accaccggca gcgcgatcga ttccgctcgt tcgtgcagcc caaggcgacg aatcggcaac 4142221 gcattgaaga atggactgaa caccaaatcg atgtccaatg caccgttgta tgctgcgcgc 4142281 cgttcaccct ggtggtcagt caccaaccac atgttctcct cgtcgcgggc gatggcgagc 4142341 tggcgttccc gctcggctag tgtgaccgtc agcccgaacc gtttggtggc accggtttcg 4142401 tcggtctgca gatcgtagtg cgcgccaaac gccggattat tcgcggtagc cgcggccaca 4142461 atgcggccgt tcgccctaat ccgcttgccg gacaactgga ctcgtaccga ttccatgcgc 4142521 gagatgtcct gcgcacgcca ggtcaacatg gccggccaga cgcgcggagt cagatcagag 4142581 gggactgcgt tcacactgtc taccgtaggg cgtgtccacc gcctgcggca ggtttgtcga 4142641 caaccgcggc gagcttgcgc atcctcccgg tgcccggcac cgatacccac cccgccaacg 4142701 ccagcagtcc gtcgagggtc aacgccaacg cggccaccat catcgcaccg accagagcga 4142761 tgtggaatcg acgctccttg atcccgtcga tcaagtagcc acccagcccc ccgagactgg 4142821 cgtaggcggc caccgtcgcg gtggcgacca cttgcagcgt cgcgctgcgt agtccgccga 4142881 gcatcagcgg tagtgcattg ggtacctcga cgcgcagcag cacctgggac tcggtcatgc 4142941 ccatcgcccg ggcggcatcg accaccagcg gatcaacact ggcaatgccg gcgtacgtgc 4143001 tggccagcaa agacgggata cccaacagca tcagcgccac cagcggcggc cccaatccca 4143061 gcccgaatag cagcacccct agcagcagca cacccaacgt gggcaaagcg cgcaaaccat 4143121 tgaccgcacc caccaccagc agcgtcccgc gaccggtgtg cccgataagc agcccgactg 4143181 gcacggcgat cagtgctgaa gcggccaccg ccaccgcggt gtattccagg tgctcacacg 4143241 tgcggactgc caagccgact ggaccggtcc agttactggc ggttagcagg taggacagcg 4143301 cctgctgcag gaaattcatc gcgctccgcc cgtgatcggg gccgcgacct ggcggcgccg 4143361 acgggctgcc cgcggcgccc gttcccatgg cgtggccagc cgaccggcga ggttgatcac 4143421 cacgtcgacg acaatcgcca gcaggaacat cgctacgacg ccggcaacga tctggtcact 4143481 cttgttggtc tgataccccg cggtgaacca ggttcccagg cccccgattc ctatcaccga 4143541 acccacggac accatcgcga tgttggtaac cgcgaccacc cgcagcccgg ctaccagcac 4143601 ggggatagac agcggcagtt cgactttcaa catctgagcg atccgcgaat agccgatggc 4143661 ggtggccgcg tcatgcacct gcgccggcac cgcgtccagc gcttcgagca ccgcccgcac 4143721 cagcagggcc gtggtgtagg ccgccaacgc cacaatgaca ttggcctcgt cgaggatccg 4143781 ggttccgatg atcagcggca acaccacgaa tagcgctagc gacgggatgg tgaatataac 4143841 gctggcggtc gccgtcgtca gccggcgaag cagcggcgcg cgctgcacca gcaggcccaa 4143901 cggcaccgcg ctcatcagcc cgatcagcac cggcagcaac gagaggcgca gatggacgac 4143961 ggtcagcgcc caggccgctc ccgggtgggt catcaggtag tgcatggctt agctccgccg 4144021 ccggccttct tgcctttttg gaactcggcc agcacgtcgg cggccagtat cccgccgatg 4144081 accttgccac cgccgtcaac ggcgacaccg acccccgacg gcgaggacaa ggcggcgtcc 4144141 agcgcctggc tgaggttacc gttcgggcgg aacaccgaac cgccgacggt catggcatcc 4144201 gacaatgccg cgccgccgcg gtgacgccgc cggccatcgg cgtcgatcca gcccaacggc 4144261 gcacccgcac cgtcgaccac cagcacccag ccgtcacgaa cttgcctgtc ccgggcatcg 4144321 gaaaggccgt tcaccgagac ttgctcgatg tcgcgcacag gtagtccggc cgcgtcgaac 4144381 agctgcagcc accgatagcc gcgaccgaga ccgatgaact tcgacacgaa gtcattcgcc 4144441 ggactggata acagccgggc agtttcgtcg tactgcgcaa gcgcgccgcc cggggcgaac 4144501 accgccacca gatcggcgag cttcaacgcc tcgtcgatgt cgtgcgtcac gaagacaatg 4144561 gtcttgtgca actcggcttg cagacgaagt atttcgttct gtagctcgtg gcgaaccacc 4144621 gggtcgacgg ccgagaacgg ctcgtccatc aacaagatcg gcggatcggc cgcgagtgcc 4144681 cgtgccacgc cgacccgttg ctgttcgccg cccgagagct gggccgggta gcgggtggcg 4144741 accttggggt ccagcccgac acgctcaagc acctcataac cggctttgcg ggctgcccgg 4144801 cgcggctgac ccttcagcac cggcaccgtt gcgacgttgt cgatgacccg ttgatgaggc 4144861 atcagccccg cgttctggat gacatagcca attcccaggc gcagcttcac cgcattgacc 4144921 gtcgacacgt cggtaccgtc gacagtgatg gtgcccgagg tcggatccac cattcggttg 4144981 atcattcgca gcgccgtcgt cttgccgcag ccggaggggc cgacgaagac ggtcagcatg 4145041 ccgttaggga cttccagcgt cagccggtct acggcggtgg caccgtgtgc gtacaccttg 4145101 ctgacatcgt caaagcagat caacgtggtg cctactgccg cactgggtga tcgaaaccgt 4145161 tgtcccgcac ccatttccgc gcggcctggt cggggtccac cccggagttg ccggacaccg 4145221 ctgcattgag ctcggccagg ccggcagtgg tcagctttgc cgacaccgcg tccagcacat 4145281 ctttgaggtg atccgacttc tttcgcgaat tcacaagcgg cacaatgttt ccggctagga 4145341 agttatgttc gggatcttcc agcaccacca ggtggttttg cgggatagcc gcagaggtgc 4145401 tgaagaggtt ggcggctgtg gccgttccct ccaccagtgc tcgcacggtc accgcaccgc 4145461 cgccgtcgtt gatggtcacg aagttgcccg gcgcgatgtc gagtgagtat ttgtgccgca 4145521 gcccgggcaa cccggacggc cgggtctgaa agaccgacgg cgccgcgaac ttcacatccg 4145581 cggaatgcgg ggccaggtcg gcgatcgttt tcaggttcca ccgggcggcg gtagcggcgg 4145641 tgacggtgac ggtgtcagtg tcagaggccg gcgacggcgt caggatcgac agatcgccgg 4145701 gaagtcgctt gtagagctcc aactcaacgg catcgagcat ggtcaccgtg gcgtcgggtt 4145761 gaaagtacag cagcaagttg ccgatatact ccggcaccag gtcgatggaa tgatctttga 4145821 gcgccgggat atacgtctct cgactgccaa ttcccaaccg ccgccccacg tcgaaaccgt 4145881 tggcctgcaa cacttgtgcg tagatttcgg cgatcacctg cgattccgga aaatcaccgg 4145941 acccgacgac gatggacttc acactgccgg tcgctgaccc gagcggatca gcattggcgc 4146001 aggacgcaac caggcacacc gtcgcgagcc acacagccgc agcgacagtt gcgcgacgta 4146061 ggcgtcgcag catcctcatg cagttgacac tatcgtcagc ggcggcgccg tgcttccaca 4146121 actcggcatg tactgggatt tttccggcgt ggtttggttt cattctgtgt gggataggac 4146181 aaaaatggtg tcatgaccag caatccctct tcctcggctg atcaaccact cagcggtaca 4146241 acggtgcctg gctcggtgcc cggtaaggca ccggaagagc cacccgtcaa gttcacccgc 4146301 gccgccgccg tatggtcggc gctgatcgtc ggctttctga tcctcatcct gttgctgata 4146361 ttcatcgccc agaacaccgc ctcggcccaa tttgcgttct tcggctggcg ctggagcctg 4146421 ccactagggg tggctatctt gctggcggcc gtgggcggcg ggctgatcac cgtcttcgcc 4146481 ggcaccgcgc ggatccttca gttgcgacgt gcggccaaaa agacccacgc ggccgccctt 4146541 cgctaactgg gcatccccga cgcgggatta cccgctcttc ttggcaatct ctgccagacc 4146601 gcgagcgatc agcggcgcaa caacgtcagg caccgactca gccgcggtgt ccttcccctc 4146661 ggcctgctcg gacatgcgtc ggcggtagtc gatgccggcg gcgatgatgg cgagcttgaa 4146721 ataggccaag gccatgtaga actcccagtg gcctagcggc tgcccggaga cgagtgaata 4146781 ccgatcggcc agctcgtcgg ctgctggcag cagcggcgaa gtccacgctg cctgcgcatg 4146841 cacaattaag tccagcgcgg ggtcgcggta tacgcacatc agggccgcgt cggacagcgg 4146901 atcccccagg gtggagagct cccagtccac caccgcgcga acatggcatg ggtcatcggt 4146961 gtccaagatc gtgttgtcga tccggtagtc gccgtgcacg atcgatgtgc ggctctgttg 4147021 tggaatggct tgctgcaggg ctaaatgcag tcgcgaaatg tcggcgtcgc ggtggtcgtc 4147081 gggcagccgc accagctccc attgtgaccc ccaccggcgc acctgccgtt ccagatagcc 4147141 gtcgggtttg ccgaaatcgc tcagtccgac ggccttcggg tcgatgctat gcaagtcgac 4147201 gagtacccgg atcaaggcgt cgacacagcc ctcgatgacc gaacggctgc cgagcgcttc 4147261 gagttcggcg cgccggcgca ccacttgccc ggcaacgaat tcgacaacct ggaacggcgc 4147321 gcccagcacc gagtcgtcct ggcacagcga gatcgtgcgc gccaccggaa ccggtgtgtc 4147381 tcccagcgcg gcgaccaccc tgtactcgcg ggccatgtcg tgcgccgacg gtgtcagccc 4147441 gtgcaggggc ggacggcgca ccaaccagct cgacgcgtca tcatagaccc ggaaggtcag 4147501 attggagcgt ccaccggaga tcagctcgcc acgcaactcg ccgtcgcgcc cgatccccag 4147561 cgaacgcaga taccggtcca gcgcgcccag atcgagcccg tcgagtcggt caaccgaagt 4147621 caccgaactt gtttaccact cgcgcaatgc ccggctttag ctcaggccgc cttcgactcg 4147681 gcgccgagcg gtaccgccga actacggcgt cacgatgttg aaggccgaat cgggccggtc 4147741 gaggacgctc aagaatgtct gcagcaccgt ccggtcgccg aacacctcga aaccgggtga 4147801 gctgatatcg cccagcgccg cggcgaccaa ccgaaccttg tcgcccaccg tcaccgtcgc 4147861 gttcgccgtc gccggatcgg cgggaagctt gcgatgtatc aacacgccgt tgcgcagcgt 4147921 gagccgatag ttgacatccg gctcggtgaa ggtgaaatcg atggccaggt cgaggtccca 4147981 tgcgcgtggg ccattgatgc tgatcgccag gacgtcaaag atttggtccg gcgtcagctg 4148041 ggcgaaaaac gtgggcgccg ggacttgccc ggagctgccc gggttcccgt cgcgcagctc 4148101 ggcggccccg gtcagaaaga aattgcgcca ggtcgcacac tccgcgccgt aggccagctg 4148161 ctccagggtg tcggcataga gcccgcgggc cgcagcgtgc tcgctgtcgg cgaacaccgc 4148221 atggtcgaga agcgttgccg cccaacggaa atcacctgcg tcgaaggctt cgcgggccag 4148281 ctccagcact cggtcgatgc cacccaacgc gtcgacataa cgcggcgcca gcgcctcggg 4148341 cggatgcggc cacaaccagc ccgggttacc gtcaaaccag cccatgtaac gctgatagat 4148401 cgccttcacg ttatggctga ccgacccgta gtagccgtgg gtgtgccatg cccgctgcag 4148461 cgccggtggc agctggaaca tctcggcgat ctccacaccg gtgtagccct ggttcagcag 4148521 ccgcagcgtc tgatcgtgca gatatgaatg catgtcgcgc tgttgcgaca agaactcgac 4148581 gatcttctcg cgtccccacg tcggccagtg gtgcgaggcg aacaccacgt cggttcggtc 4148641 ggcaaaggtg tcaatcgcct cggtgagata gcccgaccag gcgcgcggat cgcgcaccaa 4148701 ggcgccgcgc agggtcagca ggttgtgcag gttatgcgtg gcgttttcgg ccatgcacaa 4148761 cgcgcggaag cgcgggaaat agaagtgcat ctccgcaggg gcctcggtgc ccggggccat 4148821 ctggaactcg atctccaccc cgtcgatggt gtgggtctcc ccggtctcgg tgatgtcgac 4148881 cgtcggcacg acgagcgaaa cctcaccggt cgacagtgtc tgcccgaggc cgcagccgac 4148941 gtgcccccgg agaccgcgcg ccaacacggt gccgtacatg tagcccgcac ggcgcatcat 4149001 cgccgagccg gcgtagatgt tttcctgcac ggcgtgcgcg gtgaacccct ccggcgccag 4149061 caccgccacc tttcccgcgt ccacgtcggc ctgggtggtg ccgccgagca ccccaccgaa 4149121 atgatcgaca tggctgtggg tgtagatgac cgcgaccacg gggcggtcgg ctccgcggtg 4149181 ggcgcgatac aagtccagcg cggcggcggc cacctcggtg gacaccaacg ggtcgatgac 4149241 gatcagccca gtgtcaccct caacgaagct gatattggag atatcgaatc cgcggacctg 4149301 atagatgccc ggcaccacct ggtagaggcc ctgtttcgcg gtcagctggg attgccgcca 4149361 caggctggga tgcaccgatg tcggcgcggc accgtcgaga aacgagtacg cgtcgttgtc 4149421 ccacaccacg cgaccatcgg cagccttgat cacacacggg gacagcgcgg caatgaatcc 4149481 gcgatcggcg tcgtcgaaat ccgttgtgtc atgcaacggt aacgagtgtt caccgtgtgc 4149541 cgcctggatg acggcagtgg gaggtttgtg ttccatcggc actacattgc cactactacg 4149601 gtgcacgccg gtagatgccg ttggcgaacc acgctaccga ccagaaagag agaattttcc 4149661 gccgcaccta gacctcgggc cctgctaacg cgcatactgc cgaagcggtc ctcaatgccg 4149721 atggaccgct acgacaggca aaggagcaca gggtgaagcg tggactgacg gtcgcggtag 4149781 ccggagccgc cattctggtc gcaggtcttt ccggatgttc aagcaacaag tcgactacag 4149841 gaagcggtga gaccacgacc gcggcaggca cgacggcaag ccccggcgcc gcctccgggc 4149901 cgaaggtcgt catcgacggt aaggaccaga acgtcaccgg ctccgtggtg tgcacaaccg 4149961 cggccggcaa tgtcaacatc gcgatcggcg gggcggcgac cggcattgcc gccgtgctca 4150021 ccgacggcaa ccctccggag gtgaagtccg ttgggctcgg taacgtcaac ggcgtcacgc 4150081 tgggatacac gtcgggcacc ggacagggta acgcctcggc aaccaaggac ggcagccact 4150141 acaagatcac tgggaccgct accggggtcg acatggccaa cccgatgtca ccggtgaaca 4150201 agtcgttcga aatcgaggtg acctgttcct aacctaaagc gtgtcgatgc gggctgtgaa 4150261 cagcgcgtcg gagccgggca gtcaggccta gcgcggcgac gattcgagcg gttgccatcc 4150321 gtcaagtggc aaccgcaccg caaactcggt atatccgggt gagctactca cggtgatcgt 4150381 tccgttgtgc gccttgacca cagcggagac gatcgccagg ccgagcccgg tgctaccggc 4150441 ttggcgggac cgtgacgtat cgccgcgggc gaaccgctcg aaaacctcgg actgcagcgc 4150501 ggccggaata cccggcccat tgtcgatcac ctgcagcacg acgtgcgtcg gcccggtgct 4150561 caagcgcgtc gtcacgatcg tgccgggacc ggtgtgcacg cgggcgttgg ccagcaggtt 4150621 ggtcaccacc tggtgcaacc gtgccgcatc acccgggatg accaccggtt cggggggcag 4150681 gtcgagcgcc cactggtgat ctggtccggc aacatgagcg tcgctgaccg cgtcaaccgc 4150741 aagccgcgac atgtccaccg gtccgcgttc cagcggccgc cccgagtcca gacgcgccag 4150801 cagcagcagg tcctcgacga gacgtgttat ccgctcggtc tccgatgcca cccggctcat 4150861 cgcgtgtgcg acggcctcgg gatcgtcccc tatccgctgc gtcaattccg tgtaaccacg 4150921 gatcgccgca aggggagttc gcagttcatg actggcatcg gcaacgaact ggcgcacacg 4150981 ggtttcactg gcctgccgcg ccgacagtgc ggcagcgatg tggtcgagca tccggttgag 4151041 cgccgacccg agttgcccca cctcggtgga ggggtttgcg tcaggttcgg gcacccggac 4151101 cggtagcttg acctcgccgc gatccaacgg taggtcgacg acttcgctcg cggtttgcgc 4151161 gacgcgccgc aacggcgcca gcgcccgctt gatgatgacg attccggcgg tcgtcgcggc 4151221 gaccaacgca atcaccgtga cgattccgaa aatgatcagc atctgcaaca tcgtggcgtc 4151281 gacgttgccc atcgacaggc cggtgacgat gacgtcgtgc ccgtttcggc tcggagcggc 4151341 cagcacacgg taccggccca gaccgtcgag atccagggtc agcggtgtgc ggctgccggc 4151401 gatccgttcc agctgggacc ggccggttga cgtcaacgcc gcccgcgaac cactgccggt 4151461 cagatatccg gcggcgaccg tcgtgccgtc gctgaccacc gccgccacca tcccggccgg 4151521 ctggcccgga gcatcgagaa acctcggacc ggggcccgac cggatgtagt tgtgcgtctc 4151581 gcgccgccag ggcggacggg gcattttctc cggatacatc aacaccgagc ggtacgacgt 4151641 tccgccgagt tggttgtcaa gttgtgccac cagatgacga cgcagcgcca tttcggttgc 4151701 cgcggtgatt cccacacaca ccacggcgag gacgacaacc tgtccgacca ggagccgcag 4151761 ccgaagcgac caaattcgcg gactgctagc gggccggctt gagcacatag ccggcgccgc 4151821 gcagcgtgtg aatcatgggt tcgcgaccgt tgtcgatctt tttgcgcagg tacgagatgt 4151881 acagctccac gatattggac cggccgccga agtcgtaact ccagacgcgg tccagaatct 4151941 gggctttgct cagcacccgc ttggagttgt gcatcatgaa ccgcagcagc tcgaactcgg 4152001 tggacgtcaa cgacaccggt tcgccggcgc gcatcacctc gtggctgtct tcgtccagca 4152061 ccaagtctcc gaccactagc tgggcaccgc tgtcgactgt cgtcaccccc gtgcgacgca 4152121 gtaacgcccg cagccgaagc acgacctcct cgatgctaaa cggcttggtg acgtagtcgt 4152181 cgccccccgc ggtcaaccca gctatacgat cttccaccgc gtccttggcc gtcagcagta 4152241 gaaccggcag gcctggattc tcgctgcgca acttgtgcag cacgtcaaga ccgctcatgt 4152301 caggcaacat cacgtcgagc acaaccacat cgggccgctg gcggcgggcc gccgcaatcg 4152361 ccgacgatcc gtcaccggcg gtggtgatgt tccaaccttc ataccgcaat gccatggaca 4152421 ccatctcggc cagaacgggt tcgtcgtcga ccaccagcac agtgaccggt tggccatcgg 4152481 cgcgccgcat tacgacacgc tcaaccgaga tgcggtgctg cgtcacagcg tcaagtatcc 4152541 gcacacggct gagcagacgc catgcggatc ctatgtgcgc gctatgaaac ccgatttggg 4152601 gcacgttcgg agcctgccag cgggccggat ccgggcggta ccccactcac gtcggcgcgc 4152661 atgttggtac cagtagcggc tgctggcgac cgggctgctg aagcaaatcc cgctgccacg 4152721 cttgaggcag cgtcccggac caacgccaat tggtcgctct ccgtcgccgt tgtggaagtc 4152781 gccgacccgg acagttcgat cagacatagc caaggatcgg tagcatgacg atacgcattc 4152841 cgatagcggg gaattgaggt gccgtgacag acactttgtc cgcagatgtc tccgaatatc 4152901 aagtgcccgt gaataactcg tatccctacc gagtgctgtc gatccgcgtc tgcgacggca 4152961 cctatcggga tcgtaatttc gcgcacaact accgatggat gcgctcggca ttcgacagcg 4153021 ggcgactcac attcggaatc gtctacacct acgcccgtcc gaattggtgg gccaatgcca 4153081 acaccgtgcg ctcgatgatc gacgcagcgg gcggcttgca tccccgggtc gcgctgatgc 4153141 tggatgtcga atcaggcggg aacccgcccg gtgacgggtc gagctggatc aaccggctgt 4153201 actggaacct ggcagactac gccggctcgc ccgtgcgaat catcggttat gccaacgcct 4153261 acgacttctt caacatgtgg cgtgttcgcc cggcgggcct gcgcgtcatt ggcgcgggtt 4153321 atggttccaa tccgaacctt cccggacaag tggcgcacca gtacaccgac ggcagtgggt 4153381 atagccccaa tcttccacag ggcgctccac cgttcggtcg atgcgatatg aactctgcca 4153441 acggactaac accgcaacag tttgccgccg catgcggcgt cacaacgacc ggaggaccgc 4153501 tgatggcact caccgacgaa gaacaaaccg aactactgac caaagtccgc gagatatggg 4153561 accaactgcg cgggcccaac ggcgccgggt ggcctcagct cggacagaac gaacagggcc 4153621 aggacctcac tccggttgac gcgatagcgg tgatcaagaa cgacgtggcg gccatgctcg 4153681 cggaatagcc cgcgatctcc gtcagctcgt ggcccgctgc gcggatacga aaaggtttgg 4153741 cgggattgag tcttcgccac tgtgagggat gctgcggcca taccgagcca gcagctcggg 4153801 caacgttgcc gtcgacacgt cccagccacg ttcacgcagc cactcggcga ccgcggtgcg 4153861 ctgctctgca taccagaggt catcgacatc tgatatctca gtttcgacca gcttggctgc 4153921 cgcggcccgc atccgccgca tgtccgcacg ctggcgtcgc attcgctcag ggtcgagaaa 4153981 accggcgccg gggacgttgg acgccaacca actgcccggc ctgctgagcg catcgatacg 4154041 ctcgaacaac agatcctgag cccgcgccgg caggtaccgc accaaccctt cggctaacca 4154101 cgcacacggc ttcgatgggt caaatccggc tttctgcagt gcctttggcc agtcctgacg 4154161 aaggtctatg ggaacgttca ccagctgcga agccggctgc gcgccatgct ggcgcaacgt 4154221 ggctgatttg aattccagca ccttgggctg gtccagctcg tacaccacgg tgccgtccgg 4154281 ccagggcagc cgccaggcac gcgagtccag gcccgaggcg aggatcacta cttgcctcac 4154341 cccagcgtcg gcggtagcca ggaaatactc gtcgaaaaac gcggtccggg cggccatgaa 4154401 atcgatcatc tgctgtatcg gcgcccgcag gtccgggtcg aggtcggtcg caccggccag 4154461 caacgtgcga ttcgtgtaca tgctccatat cccgtcgccg gccgcgtcca caaagatccg 4154521 cgcgaacgga tcgttgatca atgggttgtc gctctcggtc tcggccgcac gcgccgccgc 4154581 cacacccagt gcggtggcgc ccacgctctc ggtaatggcc caggaatcgt tgtcggtccg 4154641 cggcacagtt aatcctcccc caggccggaa acgtcagttt tgcaaactat tcttccagcc 4154701 gccgaggggc ccgcgcgctc gtcaagagtg tcctacgctt tctcccagat ggtctacagg 4154761 ttgcagagga gcgcgatggg gtccacgccg ccacgtacgc cgcaggaggt attcgcccac 4154821 cacggccagg cgctcgccgc gggcgacctc gatgagatcg tcgccgacta cgccgacgac 4154881 tcctttgtca tcactccggc cggtatcgcg cgcggcaagg aaggtattcg ccaactgttc 4154941 gtcaagttgc tcgacgacat accaaacgca ctgtgggact taaagaccca aatcttcgag 4155001 ggcgacatac tgttcctgga gtggaccgcg aattccgcgg tcagccgagt cgacgacgga 4155061 gtcgatactt tcgtattccg agacggcacg atctgggcgc ataccgtccg gtacaccccg 4155121 caccccaaga cctgacgttt cgagcaggtg gcggatgtgg acctcgaggc ggtcgcctat 4155181 taccgatcag accgaggcac tgttgtctga cgcgggcgga tacccccagg gggcgcgttc 4155241 ctcgccgcgc acgaagtcgg taggttgcag ccgcactttg cggaggaacc gcctgctgat 4155301 ctgccggata ggatgagccc gtgacgacgc tgaaggagct tggagcacgg gtcgccgctc 4155361 tggaagcgaa ccaggccgac tatcgagccg tcctcgcggc cgtcaacccg ccgggcgcca 4155421 accagcgaga aatcgcgacg accgtccggg aacacaccgg acgactggac cgcgtgacga 4155481 ccaaagtcgg ccagctcgcg gccaagtccg acgacaccaa tgcgcgggtg cggtctctgg 4155541 aagagggaca ggccgagatc aaggaccttc tgctccgcgc cctcgacaag tgattctccg 4155601 aatggctgcg cgattttttg agcccggcat cgaacggtga tctgtggtcg gtgaatccgc 4155661 gacacgccgt ggtttcgggt cgtgccggat ggcgtcaaat ggccagatca gaacaccttt 4155721 cgagaccacg attttcgaga ccacgatcag gtgctgttgc aggctctcct aaagccgtag 4155781 ggcgtgtttg aaccgcacca tgatggggtg cgcggacatc ggttggcgat acgggctcga 4155841 ggttgcagat cctgtccgcg ctcgtggccg gcacccgagc gaccctgtcg aagaccgcgc 4155901 cttgattacc tggcggtgag cgcgagccgc ctgaccaggg ccgcgaagca cagcggcgcc 4155961 agcagccagg tcagctgcat ggccgctgtc accgggtgac cacgaaacca gaacatcacc 4156021 gggttgaacc acaggtcgtc gacgtaggac caggtcccgg cgaggatcgc cgacaactcc 4156081 acggccaggg ccgtgagctg tccccagagc acctggacgg tcaggccgag ggcgcccgcg 4156141 ggctcgcccg tgagcgctcg cgccagcagc cagcccatgg tcaggagggc accatcccac 4156201 acggagtggg cgagcaggaa caccacggtg gggagcggga gcggcgtggc ccactcgatg 4156261 atcggggtgt tggtccaagc gctgagcccg aacaccggca gctcccagac cagaccgatg 4156321 agcgtgccga gcaacagcat ccgcgcgagc tcgggtcgag tccttcgagc gcgcagcatg 4156381 agcaccacga ccgcgagcgc gacgagcagg tcggctacgt agtagccatg ggcgagggga 4156441 tcgttatcca taagcgtgtt ctgttgtatg ccactaagca tcgtatttgc ctccgcgaac 4156501 cttggtgagc aacagtgacg aacagtgacg gcgagccgcc agttgacccg cacgtgggca 4156561 caacggcgag cttcccgcac cgatggctac gaaccccggc cacgcaacgc tatgcggtcg 4156621 ccagccagct gggcgcgcag gatccgttgg atcgccccag cggtacggtc cgggttctcg 4156681 gggcgcagcg ccaccgccaa ctccagcacc tcgaccgggg tgcgcagcgt gcactggcac 4156741 gggtttgggc accaaggcgt cgaacccacc ggcgcgccag tccctaattc ctaatccagc 4156801 ggtcgatggt atgccggctg atccgaacct tccgcccgaa cgggtcggtg tgctcacggg 4156861 aggccagctc gcgcaccatc tttccccgct ccttggtgga atgcgctgca tcggcggcct 4156921 cccggatcaa ctgataccga aacaatccga tcgccctcgc gcgctccgcg cgcacctcgc 4156981 cttatcatcg ccgaccgcca ccggccgctc ctttccgttt ggtgtcccgt gaacacacga 4157041 cagcgcacag gattacggcc caatcggcgg ttagggcagg gtcgactcgt gttgcaccca 4157101 ctcgccgggt caccccggcg ccaccagccg cccacccgat accgccaccg ccgtctcggc 4157161 cagcgacacc gtggacagcg cgaactggca ctcgatcacg gtcacgaccg cggcgatcac 4157221 cgtcaccgca taggcgaaca ccccgacggc cgcatccggc atcaccggat ccggatccac 4157281 cgcgcgcaac atgacggtga acaccgaccg aaccgcctcg gcacgctcgg caaagcgacg 4157341 cagccaaccc cgcaccgtct cggccgggcg agccaaatcc gcggcgatgc ggcggaaccc 4157401 gacctggctc aaggccttct ccgccggcgc gggcacagct ccaccacgta tgtgcgctcg 4157461 cagcccgaca tgctggccga cggcgcgcag agttgggcac gagttgtgac aatccgtgac 4157521 agctttcccg gcgcctgata ccaacggaac gcgtttgcgc tagtaaagag cgcgcccgaa 4157581 gagattcgaa ctcccaacct tctgatccgt agtcagatgc tctatccgtt gagctacggg 4157641 cgcttgtctt cagttgtgtc ccctaaagga ctgcggaggc gagaggattt gaacctccgg 4157701 tccccttgaa gggggacaac tcattagcag tgagccccat tcggccgctc tggcacgcct 4157761 ccatggactt cccgagagta cccggactcc ccgagccgcc ggaggcctag cgtacacagc 4157821 cgccacatat gctgtcgacg tgaccgcccg cctgcgaccc gagctggctg ggctgccggt 4157881 ttatgtgccc ggcaaaacgg tgccgggcgc catcaagctg gccagcaacg aaaccgtgtt 4157941 cggcccgctg cccagcgtcc gtgccgccat cgaccgggct accgacacgg tcaaccgcta 4158001 ccccgacaac ggctgcgtgc agctcaaggc cgcgctggcc cggcatcttg gcccggactt 4158061 cgctcccgag cacgtcgccg tcggttgcgg ctcggtcagc ctctgccagc aactcgttca 4158121 ggtcaccgcc tcggttggtg acgaagtggt cttcggctgg cgcagctttg agctctatcc 4158181 accacaggtc cgggtcgccg gcgctatccc catccaggtg ccgttgaccg accacacgtt 4158241 cgacctctac gccatgctcg ccgcggtcac cgaccgcacc cggctgatct tcgtgtgcaa 4158301 ccccaacaat ccgacctcca ccgtcgtcgg tccggacgcg ctggcccgct tcgtcgaggc 4158361 ggttccggcg cacatcctga tcgccatcga cgaggcgtat gtggagtaca tccgggacgg 4158421 catgcggccc gacagcttag gcctggttcg cgcacacaac aatgtcgttg tgctgcgtac 4158481 gttttcgaaa gcgtacggcc tggcggggtt gcggatcggc tacgcgatcg gccaccccga 4158541 cgtcataacc gcgctggaca aggtctacgt gccatttacc gtgtcgagta tcgggcaggc 4158601 cgcggccatc gcgtccctgg acgccgccga cgagctgctg gcccgtaccg acaccgtggt 4158661 tgccgagcgc gcccgcgtca gcgccgagtt gcgtgctgcc gggttcacgc tgccgccatc 4158721 gcaggccaac tttgtctggc ttccgctggg atcccgcacc caagacttcg tggagcaggc 4158781 cgccgatgca cgcatcgtgg tccgcccgta cggcacggat ggcgttcggg tcaccgtcgc 4158841 cgcaccagag gagaacgacg cgttcctgcg gttcgcccgc cgctggcgga gcgaccaatg 4158901 agcgtggccc gtaagaaaat tcgacgccca cgctcgagcg tcacggctat ctggccgggt 4158961 tgcggccggt gaacgcgatc agccgctcca gcgccccgcc atcttccggc acgtcgaccg 4159021 gttcattgaa accggccaca ctacgttcct ccggcttgat gagctttcgt gccagctcta 4159081 ggacgtattc ggccaacgaa tcggcagcct tcagctcact cccgacggcg accgcgtaat 4159141 cccaggcgtg caccagaaat tcgaccgaga agaccgagac ggcaaccttg gccgacatcg 4159201 agccgggacc cagcgatacg tctccttcca gaccgtgacg gtgccaggcg tccagggccg 4159261 aacgggcggc gccgctcacc aggcgctcca cagagtcaat gtccgcacgc agtgagaatt 4159321 ccgcgccgac catgccgccg aggaccatga ttgagttgag caaatgctcg gttagttttt 4159381 ttcacgtcgt accccgggca cggtgtctgc ttggccttgt cctggcggcc gatggtgtgc 4159441 agcacttgct gcagcacctg cagcgcggct tccgcgcacg ccagctcgtc ggtcggtggg 4159501 gaatctggtc cgggtcgcga ttcaggcggc atactggcca cgctacggtc tgggcatggg 4159561 cgaaacctac gaatccgtca ccgtcgaaac caaggaccag gtcgcgcagg tgacgctgat 4159621 cgggccgggc aagggcaacg cgatggggcc cgcattctgg tcggagatgc ccgaggtgtt 4159681 ccatgccctg gacgccgacc gtgaggtgcg ggccatcgtc atcaccggat cgggcaagaa 4159741 cttcagctac ggcctggacg taccggccat gggcggaatg ttcgccccgt tgatcgccga 4159801 cggcgcgctg gcccgcccac gcacggactt ccacaccgaa atactgcgca tgcagaaggc 4159861 gatcaacgcc gtcgccgact gccgcacccc cacgatcgcg gccgtccagg gttggtgcat 4159921 cggcggcgcc gtcgacctga tctccgcggt cgacatccgg tatgccagcg ccgacgcgaa 4159981 gttctcggtg cgcgaggtca agctagcgat tgttgccgac atgggcagcc tggcgcgcct 4160041 tccactaatc ctgagcgacg gccatctacg agaactcgcg ctgaccggca aaaatatcga 4160101 cgcggcccgc gccgagaaga tcggcctggt caacgacgtc tacgatgacg ccgaccagac 4160161 gctggccgcg gcccacgcga ctgccgccga gatcgccgcc aacccacctt tggcggtcta 4160221 cggcatcaag gacgttctcg accaacaacg cacgtccgcc gtctcggaga acctgcgcta 4160281 tgtcgccgcc tggaacgccg cgtttctgcc gtccaaggac ctcaccgaag gtatttccgc 4160341 gacgttcgcc aagcgcccgc cccagttcac cggcgagtag acccggcgac catgcgcgct 4160401 ggcgacggca agatccgtgt cccggccgac ctagacgccg tcacggcaac cggcgaagag 4160461 gaccactccg aaatcgacgg tgcggccgtc gaccggatct ggcgggccgc acgccattgg 4160521 tatcgggccg gtatgcatcc cgcgatccag ttgtgcattc ggcaccatgg gcgggtcgtg 4160581 ctcaaccgcg cgatcgggca cggctggggc aacgccccca ccgatgaggc cgatgccgag 4160641 aagatcccgg tgacgactga caccccgttc tgcgtgtact cggcggccaa ggcgatcacg 4160701 gcgaccgttg tacacatgct cgtcgagcgc ggacacttcg cgctcgacga ccgcgtctgc 4160761 gagtacctgc cctcctacac cagtcatggc aagcaccgca ccacgatccg gcacgtgctg 4160821 acccacagcg caggcgtccc gtttcccacc gggccccgac ccgacgtcag acgcgcggac 4160881 gaccatgaat acgcggtgga aaggctcggc gaactacggc cgctatatcg gcccggactg 4160941 gtacacatct accacgcgct gacctggggt ccgttgatgc gtgagatcgt ctacgcggcc 4161001 accggcaagg aaatccgcga gatcctggcc accgagatcc tcgacccgct gggctttcgg 4161061 tggaccaact tcggcgtcgc cgagcgcgat gtgccgctgg tcgcgcccag tcacgccacc 4161121 gggcggcagc tgccgccggt gatcgccgcg gtgttccgca aggcgatcgg cggaaccgtg 4161181 cacgagatca tcccctatac gaacaccccg ttcttcctca gcaccatcct cccgtcgtcc 4161241 aacactgtgt caacggccaa cgagctgtcc cgctttatgg aaatcctgcg ccgcggtggc 4161301 gaactcgacg gtgttcgtgt actgagtccc gagacgctgc gcggcgcggt gacggaatgc 4161361 cggcgcttgc gaccggactt cgccaccggg ctgatgccgc ttcgctgggg caccgggttc 4161421 atgctggggt ccgccaagta cgggccgttc gggcgcaacg cgccggcggc attcggccat 4161481 ctcggtctgg tcaacattgc ggtttgggcc gaccccgaac gagctctgtc gggcggtttg 4161541 atcagtagcg gcaaacccgg tagggacccc gaggctgggc gctacggcgc cctgctgaac 4161601 gccattaccg ccgaaatacc acgggcatcg tcgggctgat ctgcccacga gcacgccacg 4161661 ccgccctaac cgagccggac ggctttgtcg tgccggtcac atgtcggcct gttgccttat 4161721 gtcaagatgc gccgccgtac gcgcgcatta tcaacgagtc aacgtggtcg gtgcagacct 4161781 gctatactcg aacgtatgtt cgagatatcg ttgtcggacc cggtggagct gcgcgatgcc 4161841 gacgatgccg cgctgcttgc cgcaatcgag gactgcgcgc gtgccgaggt ggccgccggc 4161901 gcccgccgcc tgtcagcgat cgccgaactc accagccggc gcaccggcaa tgaccagcgg 4161961 gccgactggg cgtgcgacgg ctgggactgc gcggccgccg aggtggccgc cgcactgacc 4162021 gtaagccacc gtaaggcctc cgggcagatg catctgagcc tcaccctaaa ccgactgccc 4162081 caggtggcgg cgttgttttt ggccgggcag ctcagcgcgc ggctggtgtc gatcatcgcc 4162141 tggcgcacct acctggttcg cgaccccgaa gcgctgagtc tgctcgatgc cgccctggcc 4162201 aaacacgcca cagcgtgggg tccgctgtcg gcccccaaac tggaaaaggc tatcgactcc 4162261 tggattgatc ggtacgatcc cgccgcactg cgacgcaccc gtatctcggc ccgcagccgc 4162321 gacctgtgca tcggtgatcc cgacgaagat gccggcaccg ccgcactatg gggccggttg 4162381 tttgccaccg acgccgccat gctggataag cgcctcaccc agctggccca cggcgtctgc 4162441 gacgacgatc cccgaaccat cgcccagcgg cgcgccgatg cgctgggcgc gctggccgcc 4162501 ggcgctgatc ggcttacctg cggctgcggt aattccgact gcccatccag tgctggcaac 4162561 caccggcagg caaccggtgt ggtcatccac gtcgtcgccg acgcggcagc actaggcgct 4162621 gcacctgacc cacgcctatc cggcccggaa cccgcgttgg cacccgaagc acccgccacc 4162681 ccggcggtca agccgccggc cgcgctgatc agcggcgggg gtgtggtgcc cgcgccactg 4162741 ctggccgagc tgatccgcgg tggggccgcc ctcagccgcg tgcgccatcc cggcgatctg 4162801 cgatcggagc cgcactaccg gccgtcggcc aagctggccg aattcgtccg gatccgagac 4162861 atgacctgcc gattccccgg ctgcgaccag cccaccgaat tctgcgacat cgaccacaca 4162921 ctgccctacc cactcgggcc cacccacccg tccaacctga aatgcctctg ccgcaaacac 4162981 caccttctca agaccttctg gaccggctgg cgtgatgtgc aactgcccga cggcaccatc 4163041 atctggaccg cgcccaacgg ccacacctac accactcatc ccgacagccg aatcttctta 4163101 cctagctggc acaccaccac cgccgcacta cccccagcac catccccgcc agccattggt 4163161 cccactcaca ccctgctgat gccacgacgg cgccggaccc gagcggccga gctggcccac 4163221 cgcattaaac gcgaacgcgc ccacgtcacc caacgcaaca agccaccccc aagcggcggg 4163281 gatacagcgg tggcggaggg atttgaaccc ccggacggtg ttagccgtct ctcgctttca 4163341 aggcgagtgc attaggccgc tctgccacgc caccgctgat aagggtaacg agccggtagc 4163401 gtgaccatca tgcgtgccgt cgtcgccgaa tcctcagatc gactggtatg gcaggaagtc 4163461 cccgacgtgt cggctgggcc gggcgaagtg ctcatcaagg ttgccgcttc cggtgtcaac 4163521 cgcgccgacg tgctacaggc cgccggcaaa tatccgccgc ccccgggagt aagcgacatc 4163581 atcggcctgg aggtgagcgg catcgtcgct gcggtcggtc ccggggttac cgaatggtct 4163641 gccggacaag aggtttgcgc cttgcttgcc ggcggcggct atgccgaata cgttgccgtt 4163701 ccggccgacc aggtgctgcc gattccgccg agcgtcaacc tggtcgactc agccgccctg 4163761 cccgaagtgg cgtgcacggt gtggtcgaac ctggtgatga ccgctcatct gcggccgggt 4163821 cagctggtgc tgattcacgg cggggccagc ggcatcggca gccacgcgat ccaggtggcc 4163881 cgcgccctgg cagcacgggt ggcgatcacc gccggctcac cggagaaact ggagctctgt 4163941 cgcgacctgg gcgcccaaat caccatcaac taccgcgacg aggatttcgt cgcgcggctg 4164001 aagcaagaga ccgatggtag cggcgctgac atcatcctcg acatcatggg agcgtcctac 4164061 ctggaccgca atatcgacgc gctggccacc gacggccagc tgatagtcat tggcatgcag 4164121 ggcggggtga aggccgagct caacctgggc aagctgctca ccaagcgggc gcgcgtcatc 4164181 ggtaccacgc tgcgggcccg gccggtcagc ggcccgcacg gcaaggcggc catcgcccag 4164241 gcggtggcgg cctcggtctg gccgatgatc gccgcgaacc gggtccggcc cgtcatcggc 4164301 acccggctgc ccatccaaca ggcggcacaa gcgcatgaac tgatgttgtc gggcaagacg 4164361 ttcggaaaga ttctgctgac ggtataggcg aacctcgcgg ccggatcaac ctagcgacgc 4164421 cagcgcgcgc accagctggt cgacttcggc catcgtcgag taatgcgcca gcccgacggt 4164481 gaccgcgccg ccgacgtcgt tgacgcccag cacgtcgagc acgcgtgagc cggtgttggc 4164541 gatcgcgaga attccgttgt ccgccagccg ctgcaccacg cggtcagccg gcaccttgtg 4164601 gaccgcgaag ctgaccaccg gtatctgtgc ttccgggcga ccgatcagca tcaccaatgg 4164661 cagcgagcgc aacgacacca tcagatagtc gaagacccgg ttcaggtacg cgtcagcaga 4164721 ttgcatcgac accgctagtc gttcgcgtct gctgccgcga gccgactcgt cgagcgccgc 4164781 caggtactca atgctggcga ccacaccagc cagcagacca aactggtgca cgccgatctc 4164841 caggcgcgcc ggcccggtgg catacggatt ggtcgaaacc gatccgaagg aattcatcac 4164901 tgacgggtca cggaaaacca tcgccccaat cggcggacca ccccaggcat gcgcattcac 4164961 cgtcaccacg tcggcgtcgg tttctctgat atcgagcaac cgatacggcg cggccgcgga 4165021 atggtcgacc accaccagtg cccccacgtc gtgcaccagt ttggtcatcg cccgcagatc 4165081 ggtgaccccg cccagcgttc cggatgcgga gttgacggcg accagcctgg ttgacttgct 4165141 gatcaggctc tcccactgcc acgtcggcag ctcgccggtc tcgatgtcga cctcggccca 4165201 cttaaccttg gcgccgtagc ggtgcgccgc ccgcagccac ggagcgatgt tggcctcgtc 4165261 gtcaagacga ctgacgatca cttcgtatcc cagcccggcg cgtgaggacg acgcttcggc 4165321 cagcaacgac agcagcaccg cccggtcggc gcccagcacc acgccgcccg ggtcagcgtt 4165381 gaccagatcg gccaccgctt cacgggcggc gtcgagtacc gccgcgctac gccgcgccga 4165441 cgggtgagca cccactgtgc tagcgcccga ccggcggaag gccgtcgaca cggtggtcgc 4165501 gacggaatcg ggaatcagca ttccggccgg tgcatcgaag tgcacccatc cgtcacccag 4165561 cgatgggtgc aatccgcgca cccgggcgac gtcgtatgcc atgccagcca ccttagaact 4165621 cgggtgtcct agacgtccca gcccgcccgg gcttccctga gccatgtcac ccggccagcc 4165681 atactaatcg agtgggcctg tggttcggta cgctaatcgc tttgattttg ctgatagcgc 4165741 cgggggcaat ggttgctcgc atcgcccagc tgaggtggcc ggtcgccatc gcggttggcc 4165801 cggcgctgac atacggcgtg gtggcactcg cgatcatccc ctatggcgcg ctcggaattc 4165861 cctggaacgg ttggaccgcg ctggccgcct tggcggtgac gtgcgctgta gcgaccggtt 4165921 tgcagctact gcttgcccgt tttcgggacc tcgacgccga ggcacttgcg gttagccgct 4165981 ggcccgcggt tacggtcgcc gccggggtgc tgctgggcgc cctgttgatc ggatgggccg 4166041 catatcgcgg cataccgcac tggcagtcca tccccagcac ctgggacgcg gtctggcacg 4166101 ccaacaccgt acgtttcatc ctggacaccg gccaggcgtc ctcgactcac atgggggagc 4166161 ttcgcaacgt cgagacccat gccccgttgt actacccgtc ggtgttccac gggctggtcg 4166221 cggtgttctg ccagttaacc ggcgcggcac ccaccaccgg ctacacactg agttcgctgg 4166281 ccgcctcggt ctggctgttt ccggtcagtg cagccgttct cacctggcgc gcggtgcgct 4166341 cacacccggg cgcgctgtgg tcggcctcct gcgcctcggc agagtggcgc gccgccggag 4166401 cggcgggcac cgccgcggca ctctcggcgt cgttcaccgc ggtgccctac gtcgagttcg 4166461 ataccgccgc tatgcccaac ctggcggcct acggcatcgc ggtgccgacg atggtgctga 4166521 tcacctcgac attgcggcac cgcgaccgca tcccggtggc cgtgctagcg ctggtcggcg 4166581 tcttctcact gcacattacc ggcggtatcg tcgtagcgct gttggtgtcg gcctggtggc 4166641 ttttcgaggc actgcggcat cctgtgcgat caaggctggc cgacctgttg acgctggccg 4166701 gcgtggcagc gatggccggg ttggtcatgt tgccgcagtt cttgagcgtc aggcagcagg 4166761 aagacatcat cgccggacac gcttttccca cctatctcag caagaagcgt gggctgttcg 4166821 acgctgtttt ccagcactcc cgccatctca acgacttccc ggtccagtac gcgctcattg 4166881 tgttggccgc catcggcggg ctcattctgc tggtcaagaa gatctggtgg ccgctggcgg 4166941 tttggctgct gttgattgtg atgaacgtcg acgcgggaac accgttgggc ggacctatcg 4167001 gaggggtggc cggcgcactc ggcgagttct tctatcacga tccgcgccgc atcgcggcgg 4167061 ccacaaccct gctgttgatg ctgatggcag gtgtggcgct gttcgcgaca gtcatgttgc 4167121 tagtggccgc ggcgaaacga ctgaccgacc gtttcagacc ccagccggtg tctgtctggg 4167181 catcggcgac cgcgacacta ctgatcggag ccactctggt cagtgcgtgg cattactttc 4167241 cccggcaccg atttctgttc ggcgacaagt acgactcggt gatgatcgac cagaaagatc 4167301 tcgacgccat ggcatacctg gcgagtttgc ccggcgcacg cgacacgttg attggcaacg 4167361 ccaacacgga cggcaccgcg tggatgtatg ccgtggccgg cctacacccg ctgtggaccc 4167421 actacgacta cccgctgcaa cagggcccgg gctatcaccg gttcatcttc tgggcctatg 4167481 gccgcaacgg ggagagcgat cctcgggtac tcgaggccat ccaagtcctc cgtatccgct 4167541 atatcctgac cagcactccg acggtgcggg ggtttgccgt gccggacgga ctagtgtcgt 4167601 tagagacatc gaggtcgtgg gcgaagatct acgacaacgg cgaggcccga atctacgaat 4167661 ggcgcggcac tgccgcagca acacactcct agaaggtgcg taagaggatg gtgattggat 4167721 tgagtaccgg cagcgacgac gacgacgtcg aggtcatcgg cggcgtcgac ccgcggctga 4167781 tagcggtgca ggagaacgac tccgacgagt cgtcgctgac cgacctggtc gagcagcccg 4167841 ccaaggtgat gcgcatcggc accatgatca agcaactgct cgaggaggtt cgcgccgccc 4167901 cactcgacga agccagccgc aatcggctac gcgatatcca cgccaccagc atccgcgaac 4167961 tcgaagatgg tctggccccg gaactgcgcg aggagctcga ccggcttacc ctgccgttca 4168021 acgaggacgc cgtgccctcg gacgccgagt tgcgcattgc ccaggcacag ctggtcggct 4168081 ggctggaagg gctgttccac ggcatccaaa ccgcgctatt tgctcagcaa atggcggcgc 4168141 gcgcgcagct gcaacaaatg cgccagggtg cgctgccgcc cggggtcggc aagtcgggcc 4168201 agcacggcca cggcaccgga caatacctgt aagccgtgtc ggatccgcac catccccata 4168261 tccagacgca caacgcgtgg gtggagttcc ctatcttcga cgccaagtca cgttcgctga 4168321 agaaggcggt cctgggtaaa gcgggcggca ccatcgggcg caacaactcc aacgtcgtcg 4168381 tcatcgaagc gttgcgcgac atcaccatgg agctgaacct gggtgaccgg gtcggtctgg 4168441 tcggacacaa cggagccggc aaatcgacgc tgctacgcct gctttcgggc atctacgagc 4168501 ccacccgcgg ctgggcgaag gtcaccggaa gggtggcgcc ggtcttcgat ctgggcatcg 4168561 gcatggaccc cgagatctcc ggctacgaga acatcatcat tcgtgggctg tttctgggac 4168621 agacccgcaa acagatgcag gcgaaagtgg atgagatcgc cgaattcacc gaattgggcg 4168681 agtacctttc gatgccgctg cgcacctatt ccaccgggat gcgagtccgc ctggcgatgg 4168741 gcgtggtcac cagcatcgac ccagagatcc tgttgctcga cgaaggcatc ggcgccgtgg 4168801 acgccgactt cctgaggaag gcccagtccc ggctgcagaa tttggtcgaa cgttccggga 4168861 tcctggtttt cgcaagccat tccaacgagt ttttggctcg actatgcaag accgcgatat 4168921 ggattgacca tggcgtcatc aggctcgccg gtggtatcga agaggtggta cgggcctacg 4168981 agggtgagga cgccgcccgg cacgtgcgcg aagtactggc cgagacccag gccgacagac 4169041 agaacgtcca gggatgactg aatcggtctt cgccgttgtg gtaacccacc ggcgccccga 4169101 cgagctggcc aagtcgctgg atgtgctgac cgcccagacc cggttaccgg accacctgat 4169161 cgtggtcgat aacgacggtt gcggcgacag cccggtccgc gagcttgtcg cgggacaacc 4169221 gatcgccacc acgtatttgg ggtcacgccg aaacctgggc ggtgccggcg gtttcgcgct 4169281 gggcatgctg cacgcgctgg cacagggcgc cgattgggtg tggctggccg acgacgacgg 4169341 gcacgcgcaa gatgctaggg tactggcaac cctgctggcg tgcgccgaga agtacagcct 4169401 cgccgaggtg tcaccgatgg tgtgcaacat agacgacccg acgcggctgg cgtttccgtt 4169461 gcggcgtggc ctggtatggc gcaggcgcgc aagtgaattg cgcaccgagg cgggccaaga 4169521 gctgctgcct gggatcgcat cactgttcaa cggcgcactg tttcgggcat ccaccctagc 4169581 ggcgatcggc gtgcctgacc tgcggctgtt catccgcggc gacgaggtgg agatgcaccg 4169641 ccggctgatc cggtccggtc taccgttcgg aacctgtctg gacgcggcct acctgcaccc 4169701 ctgcggatca gacgaattca agccgatcct ttgtggccgc atgcacgccc aatatcccga 4169761 cgatcccggg aagcggtttt tcacctaccg caaccgtggc tatgtattgt cgcaacccgg 4169821 cctgcgcaaa ctattggccc aggaatggct gcggttcggc tggttcttcc tggtgacccg 4169881 ccgcgaccct aaaggcctgt gggagtggat tcggttgcgc cgcctgggcc gtcgggagaa 4169941 gtttggcaag cctggaggat ctgcatgaca ttcatggatg ctcaagctag cttccagaca 4170001 cagtcgcgga cactggcccg cgtccgaggc gatctggtcg acgggttccg ccgccacgag 4170061 ctgtggctgc acctgggctg gcaggacatc aagcagcggt accgccgctc ggtgctgggg 4170121 ccgttctgga tcaccatcgc caccggaacg accgccgtcg cgatgggcgg cctgtactcc 4170181 aagctgtttc ggctcgagct gtctgagcac ctgccctacg tcacgctcgg gctgatcgtc 4170241 tggaacctga tcaacgccgc catcctggac ggcgcagagg ttttcgtcgc caacgaaggt 4170301 ctgatcaaac agctgccggc accgttgagc gtgcacgtct atcggttggt gtggcggcag 4170361 atgatcttct tcgcccacaa catcgtcatc tacttcgtca tcgcgatcat ctttcctaag 4170421 ccgtggtcgt gggcggatct gtcgtttctt ccggcgctgg cgctcatttt cctcaattgc 4170481 gtttgggtgt cactgtgttt cggcatcctg gcgacccgct accgcgacat cggcccgctg 4170541 ctgttttccg ttgtgcagtt gttgttcttc atgacgccga tcatctggaa cgacgagacc 4170601 ctgcgtcggc agggcgcggg ccgctggtcg agcatcgtcg agctcaaccc gctgctgcac 4170661 tatctggaca tcgtgcgggc gccactgttg ggcgctcacc aggagctgcg gcactggctg 4170721 gtggtgctgg tgttgaccgt cgtcggctgg atgctggcgg cgttcgcgat gcggcagtat 4170781 cgcgcgcggg tgccctactg ggtgtaggga ctattccggc ggctatagcc gaccggcttc 4170841 tttcacgcgg cttgcgcgtg acgggccgcc gttgatctca agatcggctg gcaacggccg 4170901 cgtaccagcg gcagcatgga ttaggttcac cgtttgccga tgaggctcag agggcgggac 4170961 ggatggaaat acttgtcacc gggggcgcgg gcttccaggg aagccatctg accgagtcac 4171021 tgctggccaa tgggcattgg gtcactgtcc tcgacaagtc ttcgaggaat gcggttcgta 4171081 acatgcaggg atttcgttcg catgaccgcg ccgcgttcat atccggttcg gtaaccgacg 4171141 gccagacgat cgaccgcgcg gtgcgggacc atcacgtcgt atttcacctg gccgcgcatg 4171201 tcaacgtgga ccagtccttg ggcgacccgg agagctttct cgaaaccaat gtcatgggaa 4171261 cctaccgcgt cctggaagcc gtccggcgct acaggaaccg cttgatatac gtatcgacgt 4171321 gcgaagtcta cggcgacgga cacaatctca aggaaggcga acgacttgac gaacacgcgg 4171381 agctgaagcc gaacagtcca tatggcgctt ccaaggcggc ggccgaccgc ttgtgctact 4171441 cgtactttcg ctcctacgga ctcgacgtca cgatcgtccg tccgttcaac atcttcggcg 4171501 tccgccaaaa ggctgggcga ttcggcgcgc tgattccgcg gctggtccgc cagggcatca 4171561 acggtgaagg cctgacaatc ttcggcgcag gtagcgcaac ccgggattac ctgtatgtca 4171621 gtgacatcgt gggcgcgtac aacctggtat tacgaactcc aaccctgcgt ggtcaggcca 4171681 tcaattttgc cagcgggaaa gatacccggg tgagggacat cgtcgagtat gttgcggaca 4171741 agttcggtgc caggatcgag caccgcgacg ctcgccccgg agaggtccag cgctttcccg 4171801 ctgacatttc gcttgccaaa agcatcgggt tccagccgca agtcgaaatt tgggacggca 4171861 tcgatcgcta tatcaattgg gccaaggatc agccccaata cccatatgag caggacgggt 4171921 ttagcggttc cagcgttctc taatacaccc gtcgccgcca tcgtctgccg gtaaagtggg 4171981 ccgaaatggc gcggaactac cagctggaag gattacctcc cattcgatgg tgaccgtagc 4172041 acgccgaccg gtgtgcccgg tgacgctgac accgggtgac ccggcgctag cgtcggtgcg 4172101 cgacctggtc gacgcgtgga gcgcgcatga tgcgctggcg gagctggtca cgatgttcgg 4172161 cggcgcgttt ccgcagacgg accatctgga agcgcggctg gcgagcctgg acaagttcag 4172221 cacggcatgg gactaccggg cgcgcgcacg tgcagcacga gcgctccacg gcgaaccggt 4172281 gcggtgccag gactccggcg gtggggcgcg atggctgatc ccccgcctgg acttgccggc 4172341 caagaagcgg gacgcgatcg tcgggttggc gcagcagctg gggctcacct tggaatcgac 4172401 cccgcaggga acaaccttcg accacgttct agtcatcggc accggacgtc attccaacct 4172461 gatccgggcc cgctgggccc gggaattggc aaagggtcgc caggttggtc acatcgtgct 4172521 cgccgccgca tcgcgtcgat tgctgccctc cgaggatgac gcggtcgcgg tctgtgcgcc 4172581 gggcgcacgc accgaattcg agctattagc ggccgcggca agggacgcat tcggcctgga 4172641 cgtccaccca gcggtgcggt atgtgcgcca gcgggacgac aacccgcacc gggacagcat 4172701 ggtgtggcgc ttcgccgccg acaccaatga cctaggcgtt ccgatcaccc tgctggaggc 4172761 gccatcgccg gagcccgaca gcagccgcgc cacctcggcc gacaccttca cgtttaccgc 4172821 acacacgctg ggtatgcagg actcaacgtg tctgttggtg accgggcaac cgttcgtgcc 4172881 ctaccagaac ttcgacgcac tgcgaactct ggcgctgccc ttcgggatac aggtggagac 4172941 agtgggcttc ggcatcgacc gctacgacgg gctgggtgag ttggaccaac aacaccctgc 4173001 caagctgctg caggaggtcc gctcgacgat ccgggcggcc cgagccctgc tggaacggat 4173061 cgaggccggc gagcgcatgg ctaccgatcc tcggcggtga tggtgcatgg cgtggccggc 4173121 gggtagctgc ccgatacggc tcgcaaccgt cccggtggcg gccacggccg tagtcccatg 4173181 ttggctaggt accgcaccgg attgacatgc ccgtcctgcg tgcggacctc gaaatgcaga 4173241 taaccatctg ccgattcgcc ttgcgcaccg atggtgccca gttgcgctcc cgcggcgatt 4173301 cgatcaccaa ggacaaggcg gccctcgtcc ccgggccgaa atacatagac aacgtcgagc 4173361 tcgcagcgtg cgatcgtcag cgacaccagg ccatcgacct cgtcgatcgc gctgacggcc 4173421 ccggaagcga ccgcgtagac gggtgttccc ggatcggtgg cgaagtcgac accgggatgg 4173481 aaaccacccg cgtgcggacc gtacccgcgg ccgatcgcgc gcggctcccg gtcgatcggc 4173541 agccgcccgc ccggctcgag cggatcgaag tcgccgcgga tgcgccgccg gtagtcggct 4173601 ttgagcaggt cgacctcgtc gagtgcgtag ccaaacaaca aagatcggtc gtaggccacc 4173661 ccgaagttga accgatagtc cggatcgagc cgcagatagt gctccaccct caagatccgt 4173721 tccgccagcg tcgaccagcc gctatgcacc agccgaaccc ccggaagctg tccgatgcga 4173781 ccgtgatcgg tgatgttggc cggccaatgc gggttgtgca tcagcttgcc gcctgcccgc 4173841 agacccgggt accagcgcca caacggtcca cgtagcgctt cggcggttcc catcaccgga 4173901 atcaggtcgg gatactccgg atcatcccag cgtgacacca tcggacacat cagcgccacg 4173961 atgtcgtccg gtgtgcgggc taacaccgcc cgaagatcga tgtcggtctc gaccaaccaa 4174021 tcggcatcga ccatcatcac ccagtccggg cggcagaagt ccgccatccg atacagcagt 4174081 tccagcccgg cggactcagg aatcagccat ggcgtgggcg gcagatctgg tcgggcccgc 4174141 accacgttcg tcaccgcagg atggttcgcc aggatctcgg cggtgtcatc ggtgctgcgg 4174201 tcgtcgatca cgtagatgtc gtcgctgaac acggccaacg agtccaacgt tgcggctagt 4174261 gtccgcccgg cgttgtgcgc acgcgtcatc gccagaatcc gcatgccgcc tctctatcac 4174321 cccagaacac aggtccagta attgggtctg tccgccaatc cagcgggaag ggcgggcgcc 4174381 gcgggcagat cgtgggcggc cagcagctgc gcggtggtgg tccccaccga gcgccatccg 4174441 tggttgtcca ggtagccgga cacctcgtgg cgggggccgg catagttgag tgcccagatg 4174501 tccagatgaa agccatgctc tcgccagcct cgggtcgcgg tgcggatcat ctcttccacc 4174561 cgagcggaat cccgatccgc agaaccgagg aaggcctcga gggccagccg gctcccgggc 4174621 gcgctcaagt cggtgacgtg gtccagcaga cgattctgcg cgtccggggg aaggtatccg 4174681 aacaaaccct cggcgatcca cgcggccggc tcggccgcat cgaagccgcc gcggcgcagc 4174741 gcatcgggcc aatcgtgacg caggtcggcc ggcaccatcc gcagatccgc ggtcggctgg 4174801 gcacccaagc cggcgagcgt ttgagccttg aactcgagca cccgaggctg atcgacctcg 4174861 aacaccgtcg tatccgccgg ccatggcagc cggtacccgc gtgcgtcgag ccccgacgcc 4174921 aggatcaccg cttgccgaac gccggcggcg gccgcgtcca agaagaactg atcgaagtag 4174981 cgggtgcgca ccaccaactc ggtcgtcatt cgctgcaagc cccaggccgc gtcggggtcg 4175041 tccacatcgg cagcatccag ttctccggtt gcccatcggg tgaggaactc gacacccacg 4175101 gcacgaacca acggttcggc gaacgggtcg tcgatgaggg gctgggccgc cctggccgcc 4175161 ctggcccttc ccgcggcgac cagcgtggcg gtcgcgccga caccggtggc taggtcccag 4175221 ctatcgtcgt cggtacgcgc cacggatcca tcttcggccc ggtccggccg ccaacgctcc 4175281 gctgtcgacc cgaacaaccg gttacaactg cgtgacgaat atcgatgacg gctgcacctt 4175341 aagggtgtaa cactgaagcg ccacgaatcc gatttatcgt cctgtggtga tcggtgaaac 4175401 ggcacccaca gcacgctatt aggtaaacag ctatccgggc gcaggcgaca acgcagtcac 4175461 cgaagcgccg cgaaaggtcg gcggacgtga gcgagaaagt cgagtcaaag gggctagcgg 4175521 atgcggcacg cgatcacctc gcggctgagt tggcccggct gcggcagcga cgcgatcggc 4175581 tggaggtcga ggtcaagaac gaccggggca tgatcggcga tcacggcgac gcggccgagg 4175641 cgatacaacg tgccgacgaa ctggccatcc tcggtgaccg gatcaatgaa ctggaccggc 4175701 ggctgcgcac cgggcccacc ccctggagcg ggtcggaaac gctgcccggc ggcaccgagg 4175761 tgaccttgcg gttccctgac ggtgaagtcg tcacgatgca tgtaatctcc gtcgtcgaag 4175821 agacgccggt gggccgagaa gccgaaaccc tgacggcgcg cagcccacta ggtcaggccc 4175881 tggccggtca ccaacccggc gacacggtga cctactcgac cccgcagggt cctaatcagg 4175941 tccagctgct tgctgtcaag ctgccctcat aattcgcaca ccgcaccagg ctcgccgccc 4176001 ccattagact tcccccgatg atccgatcgg agtctggtgc cgcgccgcca cgccaacacc 4176061 tgcacctgtc ggcacaggta atgcggttcg ttgtcaccgg cggcctcgct gggatagttg 4176121 actttggcct ctacgtcgtg ctgtacaagg tggcgggcct acaggtcgac ctgtccaagg 4176181 ccatcagctt catcgtcggc accatcaccg cgtacctgat caaccgccgg tggacattcc 4176241 aggccgagcc cagcacggcc cgattcgtcg cggtcatgct cctctacgga atcaccttcg 4176301 ccgtgcaggt cggactcaac cacctctgcc tcgcactctt gcactaccgg gcgtgggcca 4176361 tccccgtcgc gtttgtgatc gcgcagggca ccgccacggt aatcaacttc atcgtgcagc 4176421 gagccgtgat cttccggatc cgctgagccg gtcagggtcg aatcgggcgg gtaccctctt 4176481 tgacgatgtt gagcgtggga gctaccacta ccgccacccg gctgaccggg tggggccgca 4176541 cagcgccgtc ggtggcgaat gtgcttcgca ccccagatgc cgagatgatc gtcaaggcgg 4176601 tggctcgggt cgccgagtcg gggggcggcc ggggtgctat cgcgcgcggg ctgggccgct 4176661 cctatgggga caacgcccaa aacggcggtg ggttggtgat cgacatgacg ccgctgaaca 4176721 ctatccactc cattgacgcc gacaccaagc tggtcgacat cgacgccggg gtcaacctcg 4176781 accaactgat gaaagccgcc ctgccgttcg ggctgtgggt cccggtgctg ccgggaaccc 4176841 ggcaggtcac cgtcggtggg gcgatcgcct gcgatatcca cggcaagaac catcacagcg 4176901 ctggcagctt cggtaaccac gtgcgcagca tggacctgct gaccgccgac ggcgagatcc 4176961 gtcatctcac tccgaccggc gaggacgccg aactgttctg ggccaccgtc gggggcaacg 4177021 gtctcaccgg catcatcatg cgggccacca tcgagatgac gcccacttcg acggcgtact 4177081 tcatcgccga cggcgacgtc accgccagcc tcgacgagac catcgccctg cacagcgacg 4177141 gcagcgaagc gcgctacacc tattccagtg cctggttcga cgcgatcagc gctcccccga 4177201 agctgggccg cgcggcggta tcgcgtggcc gcctggccac cgtcgagcaa ttgcctgcga 4177261 aactgcggag cgaacctttg aaattcgatg cgccacagct acttacgttg cccgacgtgt 4177321 ttcccaacgg gctggccaac aaatatacct tcggcccgat cggcgaactg tggtaccgca 4177381 aatccggcac ctatcgcggc aaggtccaga acctcacgca gttctaccat ccgctggaca 4177441 tgttcggcga atggaaccgc gcctacggcc cagcgggctt cctgcaatat cagttcgtga 4177501 tccccacaga ggcggttgat gagttcaaga agatcatcgg cgttattcaa gcctcgggtc 4177561 actactcgtt tctcaacgtg ttcaagctgt tcggcccccg caaccaggcg ccgctcagct 4177621 tccccatccc gggctggaac atctgcgtcg acttccccat caaggacggg ctggggaagt 4177681 tcgtcagcga actcgaccgc cgggtactgg aattcggcgg ccggctctac accgccaaag 4177741 actcccgtac caccgccgaa acctttcatg ccatgtatcc gcgcgtcgac gaatggatct 4177801 ccgtgcgccg caaggtcgat ccgctgcgcg tattcgcctc cgacatggcc cgacgcttgg 4177861 agctgctgta gatggttctt gatgccgtag gaaaccccca gacggtgctg ctgctcggtg 4177921 gcacctccga gatcgggctc gccatctgcg agcgctacct gcacaattcg gcggcccgca 4177981 tcgtgctggc ctgcctgccc gacgacccac ggcgggagga cgcggccgct gcgatgaagc 4178041 aggccggcgc gcggtcggtg gagctgatcg actttgacgc cctggatacc gacagccacc 4178101 cgaagatgat cgaggcggcc ttctccggcg gtgatgtgga cgtggctatc gtcgcgttcg 4178161 gcttgctcgg cgacgccgaa gagctgtggc agaaccagcg caaggcggtg cagatcgccg 4178221 aaatcaacta caccgcagcg gtttcggtgg gcgtgctgct ggctgagaag atgcgcgctc 4178281 agggcttcgg tcagatcatc gcgatgagct cggccgccgg tgagcgggtg cgacgggcga 4178341 acttcgtcta cggctccacc aaggccggtc tggacgggtt ttacctgggg ttgtcagaag 4178401 cgctgcgcga gtacggtgtt cgtgtgctgg tgatccggcc cggccaggtg cgtacccgga 4178461 tgagcgcgca cctcaaggaa gctccattga ccgtcgacaa ggagtacgtc gccaacctcg 4178521 cggtgaccgc gtccgcaaaa ggtaaggaat tggtttgggc gccagcagcg ttccgctacg 4178581 tcatgatggt gttgcgtcac atcccgcgga gcatcttccg caagctgccc atctgagtat 4178641 gccgagcaga cgcaaaagcc cccaattcgg gcacgaaatg ggggctttta cgtctgctcg 4178701 cgcccgggag gtgctggtcg ctcttggcca gctggcagcg gcggtggtag tggccgtcgg 4178761 tgtcgcggtg gtgtccctgc tcgccattgc gcgggtggag tggcccgcct tcccgtcgtc 4178821 caaccagctg catgcgctga ccaccgtcgg ccaggtcggc tgcctggccg ggctggtcgg 4178881 catcggctgg ttgtggcggc acggtcgatt ccggcgactg gcccggctgg gcgggctggt 4178941 tttggtatcc gcgtttaccg tcgtgacgct gggcatgccg ctgggcgcca ccaagctgta 4179001 tctgttcggc atctctgtcg accagcagtt ccgcaccgaa tacctcaccc ggctcaccga 4179061 caccgccgcc ctgcgcgaca tgacctacat cggactgcca ccgttttacc caccgggctg 4179121 gttctggatc ggcggacgcg cggcggcgct gaccgggacg ccggcctggg agatgttcaa 4179181 gccgtgggcg atcacctcga tggccattgc ggtggccgtc gcgctggtgc tgtggtggcg 4179241 gatgatccgc ttcgaatacg ccttgctggt caccgtcgcc acagcggcgg tgatgctggc 4179301 ctacagctcg ccggagccct acgccgcgat gatcacggtg ttgttgccgc cgatgctcgt 4179361 actgacctgg tcgggcctgg gcgcgcgcga ccgtcagggc tgggccgcgg tggtcggtgc 4179421 cggcgtcttc ctgggcttcg cggccacctg gtacaccctg ttggtcgcct acggcgcgtt 4179481 cacggtggtg ctgatggcgc tgctgctggc cgggtcgcgg ctgcaatccg gaatcaaggc 4179541 ggcggtagac ccgctgtgcc ggcttgccgt cgtcggcgcg atcgcggccg ccatcggatc 4179601 caccacctgg ctgccctacc tgctgcgggc ggcccgcgac ccggtcagcg acaccggcag 4179661 cgcccagcac tacctacccg cagacggcgc cgcactgacc ttccccatgc tgcagttctc 4179721 cctgctgggc gcgatctgtc tgctgggcac gctgtggctg gtgatgcgcg cgcgatcatc 4179781 ggcgccagcc ggcgccctgg ccatcggcgt gctggccgtc tacctgtggt ccctgctgtc 4179841 gatgctggcc acattggcgc gcaccacact gctgtcgttt cgcctgcagc cgacgctgag 4179901 cgtgctgctg gtggcggccg gtgcgttcgg cttcgtcgaa gcggtccaag cccttggcaa 4179961 acggggtcgc ggtgtcattc cgatggccgc cgccatcggg ttggccggcg cgatcgcgtt 4180021 cagccaggac atccccgacg tgttgcggcc ggacctgacc atcgcctaca ccgacaccga 4180081 cggctacggc cagcgcggcg accggcgacc gcccggctcc gagaagtact acccagccat 4180141 cgatgccgcc atccggcgcg tcaccggcaa gcgccgcgat cggaccgtcg tgttgaccgc 4180201 cgactacagc ttcctgtcgt actaccccta ctggggcttt caggggttga cgccgcacta 4180261 cgccaacccg ctggcacagt tcgacaagcg cgccacacag atcgacagct ggtcgggact 4180321 ctccaccgcc gacgagttca tcgccgcgct ggacaagctg ccctggcagc cgccgaccgt 4180381 cttcctcatg cgccacggcg cacataacag ctacaccctg cggctggccc aggacgtcta 4180441 ccccaaccag cccaatgttc gccgctacac ggtggaccta cggaccgccc tcttcgccga 4180501 cccgcgtttc gtcgtcgagg acattggccc gttcgtgctg gccatccgca agccgcagga 4180561 gagcgcgtga tggctaccga agccgcccca ccccgtatcg ccgtccggct accatctacc 4180621 tccgtgcgcg acgcgggagc aaactaccgg atcgcccggt acgtcgctgt ggtggcgggt 4180681 ctgctaggcg ctgtgctggc catcgccacc ccactgctgc cggtcaacca gaccaccgcg 4180741 caattgaact ggccccaaaa cggcacgttc gccagtgtcg aggcaccgct gattggctac 4180801 gtggccaccg acttgaacat caccgtcccc tgccaggccg ccgccggact ggccggatcg 4180861 cagaacaccg gcaagacggt gttgttgtca acggtgccca agcaggcgcc taaggccgtc 4180921 gatcgcgggc tgctgctgca acgggccaac gacgacctgg tgcttgtggt gcgtaatgtc 4180981 ccgttagtca ccgccccgct gagtcaggtg ctcggcccga cctgtcagcg gttgacattc 4181041 accgcgcacg ccgatcgggt cgccgccgaa ttcgtcggac tggtgcaggg acccaatgct 4181101 gagcaccccg gtgcaccgct gcgcggtgag cgcagcggct acgacttccg cccgcagatc 4181161 gtcggggtgt tcaccgacct ggccgggccg gcgccaccgg gtctgagctt ctcggcgagc 4181221 gtggataccc gctacagcag cagccccacg ccgctgaaga tggccgccat gatcctcggg 4181281 gtagcgctca ccggcgccgc cctggtggcg ctgcacatcc tggacaccgc cgacggcatg 4181341 cggcaccggc ggttcctgcc cgcgcgctgg tggtcgatcg gcggtctgga caccctggtt 4181401 atcgccgtgc tggtgtggtg gcatttcgtc ggggccaaca cctccgacga cggctacatc 4181461 ctgaccatgg cccgggtgtc cgagcatgcg ggctatatgg ccaactacta ccgctggttc 4181521 ggcacacccg aggcgccttt cggctggtac tacgacctgc tggcgctgtg ggctcatgtc 4181581 agcacggcca gtatctggat gcgcctaccc accctggcga tggcgctcac ctgctggtgg 4181641 gtaatcagcc gtgaggtcat tccccggctg gggcacgccg tcaagacgag ccgggcagcg 4181701 gcgtggacgg cggcgggcat gtttctggct gtctggctgc cgctggacaa cggccttcgg 4181761 cccgagccga tcatcgccct gggcatcctg ctgacctggt gctcggtgga gcgggcggtg 4181821 gccaccagcc ggctgctgcc ggtggcaatc gcctgcatca tcggtgcctt gaccctgttc 4181881 tccgggccga cgggcatcgc ctcgatcggt gcgctgctgg tcgcgatcgg gccgctacgg 4181941 accatcctgc accggcgttc caggcggttc ggcgtgctac cactggtggc gccgatcctg 4182001 gccgcggcca ccgtcaccgc gatcccgatc tttcgtgatc agaccttcgc gggcgagatc 4182061 caggccaacc tcctcaagcg tgccgtaggg cccagcctga agtggttcga cgaacacatc 4182121 cgctacgagc ggctgttcat ggccagcccc gacggctcga tcgcccgccg cttcgccgtg 4182181 ctggccttgg tgctggcgct cgcggtatcg gtggcaatgt cgttacgtaa gggccgcatt 4182241 ccaggtaccg ctgctggacc gagccgccgc atcatcggca tcacgatcat ttccttcctc 4182301 gcgatgatgt tcaccccgac aaagtggacc catcacttcg gggtgttcgc ggggttggcc 4182361 gggtcgctgg gggcgcttgc cgcggtcgcg gtgacgggcg ctgcgatgcg ctcgcggcgg 4182421 aaccggaccg tgttcgccgc cgtggtggtc ttcgtgttgg ccctgtcgtt cgccagtgtc 4182481 aacggctggt ggtacgtgtc caacttcggt gtgccatggt cgaactcgtt tccgaagtgg 4182541 cgatggtcgc ttaccaccgc actcctcgag ctgacggtgc tggtgctgct gctagcggca 4182601 tggttccact tcgtcgccaa cggtgacggg cgccgaacag ccaggccaac ccggtttagg 4182661 gcacgactag ccggaattgt ccagtccccg ttggcaattg ccacgtggtt gctggtgctt 4182721 ttcgaggtgg tatcgctgac ccaggcgatg atttcccagt acccggcgtg gtcggttggc 4182781 cggtctaacc tacaggcttt ggccggcaag acctgcgggc tggccgaaga cgtgctggtg 4182841 gagctggatc ccaacgcagg catgctggcg ccggtgaccg cgccgttggc cgacgccctg 4182901 ggagccggcc tgtctgaagc cttcacaccc aacggcattc ccgccgacgt caccgccgac 4182961 ccggtgatgg aacgtccagg ggatcgcagt ttcctcaacg acgacgggct gatcaccggc 4183021 agcgaacccg gcaccgaagg gggcaccacg gccgcaccgg gaatcaacgg ctcccgcgcc 4183081 cggctgccct acaacctgga cccggcccgt acaccggtgc tgggcagctg gcgagccggc 4183141 gtgcaggtgc ccgccatgct gcggtcgggc tggtaccggc tgcccaccaa cgagcagcgg 4183201 gacagggcgc cgctgctggt ggtgacggcg gccgggcgat tcgactcccg cgaggtccgg 4183261 ttgcagtggg ccaccgacga gcaagcggcc gccggacacc acggtgggtc gatggaattc 4183321 gccgacgtcg gtgccgcgcc ggcctggcgt aacctgcgcg caccactgtc cgccatcccg 4183381 agcaccgcca cccaggtccg gttggtcgcc gacgaccagg atctggcgcc gcagcactgg 4183441 atcgccctca caccaccgcg gattccgcgg gtgcgcacgc tgcagaacgt ggtgggcgca 4183501 gcggatccgg tgttcctgga ctggctggtg gggctggcat tcccctgcca acgcccgttc 4183561 ggccaccaat acggcgtcga cgagacaccc aagtggcgga tcctgccgga ccggttcggc 4183621 gccgaagcca actcaccggt gatggatcac aatggcggtg gcccgctggg catcactgag 4183681 ctgctgatgc gcgcaaccac ggtggccagc tacctcaaag acgactggtt tagggactgg 4183741 ggcgcgttac agcggttgac gccttactac cccgacgccc agcccgctga tctgaaccta 4183801 ggaacggtga ctcgcagcgg gctgtggagt ccggcgccgt tgcgccgcgg ctagaagtgc 4183861 cgtggccacc gactcggcga caacctccgc ggccccgcat cctcaccgcc cttaaccgcg 4183921 tcgcctacca tcgagcctcg tgccccacga cggtaatgag cgatctcacc ggatcgcacg 4183981 cctagcagcc gtcgtctcgg gaatcgcggg tctgctgctg tgcggcatcg ttccgctgct 4184041 tccggtgaac caaaccaccg cgaccatctt ctggccgcag ggcagcaccg ccgacggcaa 4184101 catcacccag atcaccgccc ctctggtatc cggggcgcca cgcgcgctgg acatctcgat 4184161 cccctgctcg gccatcgcca cgctgcccgc caacggcggc ctggtgctgt ccacactgcc 4184221 ggccggtggc gtggataccg gtaaggccgg gctgttcgtc cgcgccaacc aggacacggt 4184281 cgtcgtggcg ttccgcgact cggtggccgc ggtggcggcc cgctccacga tcgcagcggg 4184341 aggctgtagc gcgctgcata tctgggccga taccggcggc gcgggcgctg attttatggg 4184401 tatacccggc ggcgccggga ccctgccgcc ggagaagaag ccacaggttg gcggcatctt 4184461 caccgacctg aaggtcggag cgcagcccgg gctgtcggcc cgcgtcgaca tcgacactcg 4184521 gtttatcacg acgcccggcg cgctcaagaa ggccgtgatg ctcctcggcg tgctggcggt 4184581 cctggtagcc atggtggggc tggccgcgct ggaccggctc agcaggggcc gcaccctgcg 4184641 cgactggctg acccgatatc gcccgcgggt gcgggtcgga ttcgccagcc ggctcgctga 4184701 cgcagcggtg atcgcgacct tgttgctctg gcatgtcatc ggcgccacct cgtccgatga 4184761 cggctacctt ctgaccgtcg cccgggtcgc cccgaaggcc ggctatgtag ccaactacta 4184821 ccggtatttc ggcacgacgg aggcgccgtt cgactggtat acatcggtgc ttgcccagct 4184881 ggcggcggtg agcaccgccg gcgtctggat gcgcctgccc gccaccttgg ccggaatcgc 4184941 ctgctggctg atcgtcagcc gtttcgtgct gcggcggctg ggaccgggcc cgggcgggct 4185001 ggcgtccaac cgggtcgctg tgttcaccgc tggtgcggtg ttcctgtccg cctggctgcc 4185061 gttcaacaac ggcctgcgtc ccgagccgct gatcgcgctg ggtgtgctgg tcacgtgggt 4185121 gttggtggaa cggtcgatcg cgctcggacg gctggccccg gccgcggtag ccatcatcgt 4185181 ggcgacgctt accgcgacgc tggcaccgca ggggttgatc gcgctggccc cgctgctgac 4185241 tggtgcgcgc gccatcgccc agaggatccg gcgccgccgg gcgaccgatg gactgctggc 4185301 gccgctggcg gtgctggccg cggcgttgtc gctgatcacc gtggtggtgt ttcgggacca 4185361 gacgctggcc acggtggccg aatcggcacg catcaagtac aaggtcggcc cgaccatcgc 4185421 ctggtaccag gacttcctgc gctactactt ccttaccgtg gagagcaacg ttgaggggtc 4185481 gatgtcccgc cggttcgcgg tgctggtgtt gctgttctgc ctgttcgggg tgctgttcgt 4185541 gctgctgcgg cgcggccggg tggcggggct ggccagcggc ccggcctggc gactgatcgg 4185601 cactacggcg gtcggcctgc tgctgctcac gttcacgcca accaagtggg ccgtgcagtt 4185661 cggcgcattc gccgggctgg ccggggtgtt gggtgcggtc accgcgttca cctttgcccg 4185721 catcggtcta catagtcgac gcaacctcac gctgtacgtg accgcgttgc tgttcgtgct 4185781 ggcgtgggca acctcgggca tcaacgggtg gttctacgtc ggcaactacg gggtgccgtg 4185841 gtatgacatc cagcccgtca tcgccagcca cccggtgacg tcgatgtttc tgacgctgtc 4185901 gatcctcacc ggattgctgg cagcctggta tcacttccgg atggactacg ccgggcacac 4185961 cgaagtcaaa gacaaccggc gcaaccgcat cttggcctct acgccactgc tggtggtcgc 4186021 ggtgatcatg gtcgcaggcg aagtcggctc gatggccaag gccgcggtgt tccgttaccc 4186081 gctttacacc accgccaagg ccaacctgac cgcgctcagc accgggctgt ccagctgtgc 4186141 gatggccgac gacgtgctgg ccgagcccga ccccaatgcc ggcatgctgc aaccggttcc 4186201 gggccaggcg ttcggaccgg acggaccgct gggcggtatc agtcccgtcg gcttcaaacc 4186261 cgagggcgtg ggcgaggacc tcaagtccga cccggtggtc tccaaacccg ggctggtcaa 4186321 ctccgatgcg tcgcccaaca aacccaacgc cgccatcacc gactccgcgg gcaccgccgg 4186381 agggaagggc ccggtcggga tcaacgggtc gcacgcggcg ctgccgttcg gattggaccc 4186441 ggcacgtacc ccggtgatgg gcagctacgg ggagaacaac ctggccgcca cggccacctc 4186501 ggcctggtac cagttaccgc cccgcagccc ggaccggccg ctggtggtgg tttccgcggc 4186561 cggcgccatc tggtcctaca aggaggacgg cgatttcatc tacggccagt ccctgaaact 4186621 gcagtggggc gtcaccggcc cggacggccg catccagcca ctggggcagg tatttccgat 4186681 cgacatcgga ccgcaacccg cgtggcgcaa tctgcggttt ccgctggcct gggcgccgcc 4186741 ggaggccgac gtggcgcgca ttgtcgccta tgacccgaac ctgagccctg agcaatggtt 4186801 cgccttcacc ccgccccggg ttccggtgct ggaatctctg cagcggttga tcgggtcagc 4186861 gacaccggtg ttgatggaca tcgcgaccgc agccaacttc ccctgccagc gaccgttttc 4186921 cgagcatctc ggcattgccg agcttccgca gtaccggatc ctgccggacc acaagcagac 4186981 ggcggcgtcg tcgaacctat ggcagtccag ctcgaccggc ggtccgttcc tgttcaccca 4187041 ggcgctgctg cgcacctcga cgatcgccac gtacctgcgt ggggactggt atcgcgactg 4187101 gggatcggtg gagcagtacc accggctggt gccggccgat caggctccag acgccgttgt 4187161 cgaggagggc gtgatcactg tgcccggctg gggtcggcca ggaccgatca gggcgctgcc 4187221 atgacacagt gcgcgagcag acgcaaaagc accccaagtc gggcgatttt gggggctttt 4187281 gcgtctgctc gcgggacgcg ctgggtggcc accatcgccg ggctgattgg ctttgtgttg 4187341 tcggtggcga cgccgctgct gcccgtcgtg cagaccaccg cgatgctcga ctggccacag 4187401 cgggggcaac tgggcagcgt gaccgccccg ctgatctcgc tgacgccggt cgactttacc 4187461 gccaccgtgc cgtgcgacgt ggtgcgcgcc atgccacccg cgggcggggt ggtgctgggc 4187521 accgcaccca agcaaggcaa ggacgccaat ttgcaggcgt tgttcgtcgt tgtcagcgcc 4187581 cagcgcgtgg acgtcaccga ccgcaacgtg gtgatcttgt ccgtgccgcg cgagcaggtg 4187641 acgtccccgc agtgtcaacg catcgaggtc acctctaccc acgccggcac cttcgccaac 4187701 ttcgtcgggc tcaaggaccc gtcgggcgcg ccgctgcgca gcggcttccc cgaccccaac 4187761 ctgcgcccgc agattgtcgg ggtgttcacc gacctgaccg ggcccgcgcc gcccgggctg 4187821 gcggtctcgg cgaccatcga cacccggttc tccacccggc cgaccacgct gaaactgctg 4187881 gcgatcatcg gggcgatcgt ggccaccgtc gtcgcactga tcgcgttgtg gcgcctggac 4187941 cagttggacg ggcggggctc aattgcccag ctcctcctca ggccgttccg gcctgcatcg 4188001 tcgccgggcg gcatgcgccg gctgattccg gcaagctggc gcaccttcac cctgaccgac 4188061 gccgtggtga tattcggctt cctgctctgg catgtcatcg gcgcgaattc gtcggacgac 4188121 ggctacatcc tgggcatggc ccgagtcgcc gaccacgccg gctacatgtc caactatttc 4188181 cgctggttcg gcagcccgga ggatcccttc ggctggtatt acaacctgct ggcgctgatg 4188241 acccatgtca gcgacgccag tctgtggatg cgcctgccag acctggccgc cgggctagtg 4188301 tgctggctgc tgctgtcgcg tgaggtgctg ccccgcctcg ggccggcggt ggcggccagc 4188361 aaacccgcct actgggcggc ggccatggtc ttgctgaccg cgtggatgcc gttcaacaac 4188421 ggcctgcggc cggagggcat catcgcgctc ggctcgctgg tcacctatgt gctgatcgag 4188481 cggtccatgc ggtacagccg gctcacaccg gcggcgctgg ccgtcgttac cgccgcattc 4188541 acactgggtg tgcagcccac cggcctgatc gcggtggccg cgctggtggc cggcggccgc 4188601 ccgatgctgc ggatcttggt gcgccgtcat cgcctggtcg gcacgttgcc gttggtgtcg 4188661 ccgatgctgg ccgccggcac cgtcatcctg accgtggtgt tcgccgacca gaccctgtca 4188721 acggtgttgg aagccaccag ggttcgcgcc aaaatcgggc cgagccaggc gtggtatacc 4188781 gagaacctgc gttactacta cctcatcctg cccaccgtcg acggttcgct gtcgcggcgc 4188841 ttcggctttt tgatcaccgc gctatgcctg ttcaccgcgg tgttcatcat gttgcggcgc 4188901 aagcgaattc ccagcgtggc ccgcggaccg gcgtggcggc tgatgggcgt catcttcggc 4188961 accatgttct tcctgatgtt cacgcccacc aagtgggtgc accacttcgg gctgttcgcc 4189021 gccgtagggg cggcgatggc cgcgctgacg acggtgttgg tatccccatc ggtgctgcgc 4189081 tggtcgcgca accggatggc gttcctggcg gcgttattct tcctgctggc gttgtgttgg 4189141 gccaccacca acggctggtg gtatgtctcc agctacggtg tgccgttcaa cagcgcgatg 4189201 ccgaagatcg acgggatcac agtcagcaca atctttttcg ccctgtttgc gatcgccgcc 4189261 ggctatgcgg cctggctgca cttcgcgccc cgcggcgccg gcgaagggcg gctgatccgc 4189321 gcgctgacga cagccccggt accgatcgtg gccggtttca tggcggcggt gttcgtcgcg 4189381 tccatggtgg ccgggatcgt gcgacagtac ccgacctact ccaacggctg gtccaacgtg 4189441 cgggcgtttg tcggcggctg cggactggcc gacgacgtac tcgtcgagcc tgataccaat 4189501 gcgggtttca tgaagccgct ggacggcgat tcgggttctt ggggcccctt gggcccgctg 4189561 ggtggagtca acccggtcgg cttcacgccc aacggcgtac cggaacacac ggtggccgag 4189621 gcgatcgtga tgaaacccaa ccagcccggc accgactacg actgggatgc gccgaccaag 4189681 ctgacgagtc ctggcatcaa tggttctacg gtgccgctgc cctatgggct cgatcccgcc 4189741 cgggtaccgt tggcaggcac ctacaccacc ggcgcacagc aacagagcac actcgtctcg 4189801 gcgtggtatc tcctgcctaa gccggacgac gggcatccgc tggtcgtggt gaccgccgcg 4189861 ggcaagatcg ccggcaacag cgtgctgcac gggtacaccc ccgggcagac tgtggtgctc 4189921 gaatacgcca tgccgggacc cggagcgctg gtacccgccg ggcggatggt gcccgacgac 4189981 ctatacggag agcagcccaa ggcgtggcgc aacctgcgct tcgcccgagc aaagatgccc 4190041 gccgatgccg tcgcggtccg ggtggtggcc gaggatctgt cgctgacacc ggaggactgg 4190101 atcgcggtga ccccgccgcg ggtaccggac ctgcgctcac tgcaggaata tgtgggctcg 4190161 acgcagccgg tgctgctgga ctgggcggtc ggtttggcct tcccgtgcca gcagccgatg 4190221 ctgcacgcca atggcatcgc cgaaatcccg aagttccgca tcacaccgga ctactcggct 4190281 aagaagctgg acaccgacac gtgggaagac ggcactaacg gcggcctgct cgggatcacc 4190341 gacctgttgc tgcgggccca cgtcatggcc acctacctgt cccgcgactg ggcccgcgat 4190401 tggggttccc tgcgcaagtt cgacaccctg gtcgatgccc ctcccgccca gctcgagttg 4190461 ggcaccgcga cccgcagcgg cctgtggtca ccgggcaaga tccgaattgg tccatagcgt 4190521 caggctccgc agtcgatagc ggcacgatgt tcgtcattag acggccccat cagttaggcc 4190581 tcctatgctg ctcggtatgc accaggccgg ccatgttggc acacacgaac ggcgcgcagc 4190641 cgcaacgagg cggtccgccc tgactgcggc agggttagcc gtcgtcggcg caggggtgtt 4190701 gggcgcgtcg gcgtgcagtc cacaaaagtc tcctcagcca tcatcacccc ggttgcccga 4190761 caatgcgctg atcacgctcg gggtggccgc cggcccgccg cctacgccca gcagagtagg 4190821 aatctcgtcg gtgctgaaaa ttggccgcga tctgtacgtg atcgattgcg gcctgggctc 4190881 gctgaacgca ttcaccaacg cgggcctgca attcgacgat ctcaaagcca tgtttatcac 4190941 ccacttgcac accgaccaca tcgtcgacta ctacaacttc tttctctccg gtggcttcct 4191001 tgccccaccc ggtcgagcgc cggtcctggt ctatggtccg ggcccagctg ggggtttgcc 4191061 gccaagtgaa gtcggcaacc cgaatccagc caccgtcaac cccgccaacc cgacaccggg 4191121 ccttgccgcg gccaccgaag cgctgcatcg agcgttcgct tacaccagca acatcttcat 4191181 ccgcgactac ggcattgaca acgttgcgga cctggttaaa gtcacggaga tcgggctacc 4191241 accaggatcg gactaccgca acagagcgcc aaagatgagc ccgttctcgg tcgcatcgga 4191301 cgacaacgtt tccgtcaccg caacgctggt ctcccactac gacgtctacc cagcgttcgg 4191361 attccgcttc gatctgaaga aatcgggtgt gtccgttacc ttctcgggtg acaccactaa 4191421 gtccgacaac ctgattaccc tcgctcaagg cactgacatt ctggtccacg aggcggtgtt 4191481 cagcctcgat acggcttact ttggcaacgc tttccccccg aactatctgg tgaactcaca 4191541 catctccgca gagcaggtgg gggaggtggc cgcagcggcc aagcccaaac aattgatcct 4191601 gagccactac gcccctgacg acctacccga ctcgcagtgg ctcgacaaga tcaagaagaa 4191661 ttactcgggc atgaccacca tcgcgcggga cggccaggtc ttcgccctct gatccgttag 4191721 cggtagcgcc ccgttcgacg atcgctgcct agagctagac atatataaaa cctatgcaat 4191781 agggtcgcgg catgcccgag tacgacctag aggccgtgga caagctgccc ttctcgaccc 4191841 ctgaaaaggc gcagcgctac caaacggaaa actatcgcgg ggccatgggc ctcaactggt 4191901 acctcacgga tccgaccctg cagttcatca tggcctatta cctacgaccc gatgaattgg 4191961 cgttcgcaga accccatctg acccgcattg gtgagctgac gggcgggcca gtgacgcgtt 4192021 gggccgagga aaccgaccgc aaccccccgc ggctcgaacg ctacgaccgg tgggggcatg 4192081 acatcagccg ggtagtgctg ccggaatcgt tcatccaatc caagcgcgcc gtcatcgagg 4192141 cgcgacaagc cgtgcgcgac gacgcggcac gggccggcgt caagccgtcg ctggcactct 4192201 tcgccgccga ctatctgctc aaccaggccg atatcggtat ggcttgcgcg ctcgccactg 4192261 gcggcaacat ggtccggtcg ctggtgactg cctacgcgcc acccgatgtg cgcgaattcg 4192321 tcctaggcaa actcaattcc ggcgagtggg acggcgaggc cgcgcagctg ctgacggagc 4192381 gtgcgggcgg ctccgatctg ggagctctgg agacgacggc cacccgcagc ggcgacgtgt 4192441 ggctgctgaa cggcttcaag tggtttgcgt ccaactgcgc cggggaggcg ttcgtggtgt 4192501 tggccaagcc cgagggggcg cctgactcga ctcgaggtgt ggccaccttc ctcgtgctac 4192561 ggacgcgccg tgacggttcc cgcaacggcg tgcgtatccg tcggctgaag gacaagctcg 4192621 gcacccgctc tgtcgcctcc ggtgaaatcg agttcgtcga cgccgaagcc tttctgttgt 4192681 ccggcgaacc gagcgctgac gcgggcccgt ccgacggcaa gggactcacc cgcatgatgg 4192741 agctgaccaa cagattgcgg ttgggcaccg cctcgttcgc cctcggcaac gcgcgccgcg 4192801 cgctggtcga atcgctgtgc tacgccgggc agcggcgggc attcggtggg gcgctcatcg 4192861 acaagccgct gatgcgccgc aagctggccg aaatggtcgt tgatgtggaa gccgcgctgg 4192921 cgatggtgtt cgacggcttc ggagcggcga accaccgcca gcccagatgc ctgccgcaac 4192981 gtatcgcggt gccggtcacc aagcttaaga cttgccggct cgggatcacc gtggcatcgg 4193041 atgcgatcga gatccacggc ggcaatggct acatcgagac ctggccggtg gcccggttgc 4193101 tgcgtgacgc gcaagtcaac acgatctggg agggccccga caacatcctg tgtctggatg 4193161 tgcggcgcgg gatcgagcag acgcgcgctc acgagacact gttggcgcgg ctgcgcgatg 4193221 cggtgtcggt gtccgacgat gacgacacca cgcggctggt ctcgcgccgc attgaggacc 4193281 tcgacgcggc gatcaccgct tggaccaaac tcgacaggca gctggccgag gcgcggctgt 4193341 tcccgctggc ccaattcatg ggcgacgtct acgccggcgc gttgctcacc gagcaggccg 4193401 cctgggaacg ggcaacccgc ggcaccgacc gcaaggcact cgtcgcccgc ctgtacgcgc 4193461 gccggtatct cgccgaccaa ggcccgctgc gcggtatcga cgcagattgc gatgaggcgc 4193521 tgcagcgttt cgacgaactc gtggcgggcg cgttcactgc cgagcagacg taaaagcccc 4193581 caattcgtgg ctcttctgac acttccgtgg gtgagtttgt gtcctgagta ggcgcacgtc 4193641 gttgtggctt aaggtttctg gcttgtcaag gatcagaaac acaaggagcc gacaacgacg 4193701 tgcgcaatgt gaggctattt cgtgcgctgc tgggtgtcga caagcgcacc gtgattgagg 4193761 acatcgaatt cgaggaggat gacgccggag acggtgcgcg ggtgatcgcc cgggtgcggc 4193821 cacgaagtgc agtgttgcgc cgctgtggtc gctgcggtcg caaggcgtcc tggtatgacc 4193881 gcggtgcggg cctgcgccaa tggcgcagtc tggattgggg caccgtcgag gtgttcttgg 4193941 aggccgaggc gccgcgggtg aactgcccca cccatgggcc gacggtggtg gcggtgccgt 4194001 gggcgcgtca tcatgccggg cacacgtatg ctttcgatga cacggtggcc tggctggcgg 4194061 tggcgtgttc gaagaccgcg gtgtgcgagt tgatgcggat cgcctggcgc accgtcgggg 4194121 cgatcgtggc ccgggtctgg gccgacaccg aaaagcgcat tgaccggttc gcgaacttgc 4194181 gccgcatcgg tatcgatgag atctcctaca agcgccacca ccggtacctg acggtggtcg 4194241 tcgatcacga cagcggccgg ttggtgtggg ccgccccgag ccaccctggg cttgttcttc 4194301 gatgccctgg gcgctgagcg ggccgcccag attactcacg tttcggccga tgccgcggac 4194361 tggatcgctg acgtggtcac cgagcgctgc ccggatgcga ttcaatgcgc cgatccgttt 4194421 catgtggtgg cctgggccac cgaggcgctc gacgtcgagc ggcgccgagc ctggaacgac 4194481 gcacgggcga tcgcgcgcac cgaacccaag tggggccggg gccggcccgg taagaacgcc 4194541 gcaccacgtc cgggccgcga gcgggcacgg cggctcaagg gcgcccgcta cgcgctgtgg 4194601 aagaaccccg aggacctcac cgaacgccaa agcgccaaac tggcctggat cgccaagacc 4194661 gatccccgtc tgtatcgcgc ctacctgctc aaagagagcc tgcggcatgt gttttcggtc 4194721 aagggcgagg aaggtaaaca ggccctggac cggtggatct cctgggccca gcgctgtcgc 4194781 atcccggtat tcgtcgagct tgccgcccgc atcaaacgcc accgggtggc catcgacgcc 4194841 gccctcgacc acggcctatc ccaaggcctg atcgaatcca ccaacaccaa gatccgccta 4194901 ctgacccgga tcgcgttcgg attccgctca ccacaagccc tcatcgccct agccatgctc 4194961 accctcgccg gccaccgccc caccctgcca ggccgacaca accacccaca gatcagtcag 4195021 tagagcccga aatgggggct tttacgtctg ctcgcgctac ccagctagac cgggatcagg 4195081 ccgtgcttgc ggcccacccg ccaccacagc tgcttgtccc gcagcaggtg catcgacttg 4195141 cgcaacagca gccgggtctc atgcgggtcg atgacggcat cgatgaaccc gcgctcggcg 4195201 gcgatccacg ggatcgccat gttgaggttg taattctcga cgaagctctt ccggatcgct 4195261 tgcgcctccg gcgcattcgg gtccgggaaa cgcttcatca gcaactgcgc ggccccgtcg 4195321 gcgccgatca ccgcgatgcg cgcggtgggc caggcgaagt tcaggtcggc ggtcagctgc 4195381 ttggacccca tcaccgcgta ggcaccgccg taggacttgc ggatggtgat cgtcaccttc 4195441 ggcacatcag cctcgaccac cgcgtacaag aacctcccac cgcgcttgat gatcccgttc 4195501 ttttcctgtt ccaccccggg caaaaacccc ggtgtgtcca cgacgaacac cagcgggatg 4195561 tcgaacgcgt cgctaaaccg gatgaaccgt gcggccttgt cggacgcctc gttgtcgatc 4195621 gcccccgaca tgtgcatggg ctggttggcc accacaccaa cggtccgccc gtccacccgc 4195681 gcgtagccgg tgatgatcgc ctgcccggcc tgggcagcga cgtcgaggaa gtcgccgtcg 4195741 tcgaagatcc gcagcaggac ctcgtgcatg tcgtaggcca tgttgtccga gtccggcacg 4195801 atcgagtcga gttccagatc gtggccggtg atttcgggtt ccagcccggg gttgacgacc 4195861 ggcggtttgt cgaagcagtt ggacggcaga aacgacagaa agtcccgcac gtactggtat 4195921 gcggcggcct cggactccac cacctgatgg atgttgccgt agctcgcctg gtggtcggcg 4195981 ccccccagct cgtcgaggct gacgtcctca ccggtgacgt ccttgatgac gtcggggccg 4196041 gtgacgaaca tgtaaccctg gtcgcgcacc gccaccacca gatcggtctg gatcggcgaa 4196101 tacaccgctc ccccagcgca tttgcccaaa atgatggaga tctgcggcac cagcccactg 4196161 agcagttcgt ggcggcgccc cagctcggcg taccaggcca gcgaggtgac ggcgtcttgg 4196221 atgcgggcgc cgccggagtc gttgatgccg acgatcgggc agccgaccat cgcgcaccac 4196281 tccatcagcc gggccacctt gcggccaaac atctccccga cggtgccgcc gaacacggtt 4196341 tggtcgtgcg agaacacgcc gaccggccgg ccgttgatga ggccatgtcc ggtgaccacg 4196401 ccgtccccgt agagcgcgtt ggggtcaccg ggggtgcggc acagcgctcc gatctccatg 4196461 aagctacccg gatcgaccag ctcgtagatg cgggcgcggg cactcgggat gcccttcttg 4196521 tcgcgcttgg cggcggcctt ctcaccgccg ggttccttgg ccaactccag gcgttcgcgc 4196581 agctccgcca gcttctcggc ggtggtgtgc agaaccggct cggtgacggt cactgcttgc 4196641 ctacctcact tgttcgatcg gcctcgatct gccccaacgc gcggctcatg tgttcgccca 4196701 ccttggcgat gatcggctcg tcgatggcct gaatgtgctc gccaccgatc ggcaccacct 4196761 cgaggtcgga aacgtactcg ccccacccgc cgtccggctg gcgcacggcg tagcggggct 4196821 cgaacatgat cgcgtcgtca tggtagcgat cggccatgta gagggtgaca tgcccgtcgt 4196881 acggctggat ctgggcggtg tcgatcgccc ggttgtccag atacgacgtg cgttggtgtt 4196941 cgatgatccc ggccgggatc tgcacaccgg actggctgac ggcgtccagc acgaaccgga 4197001 cctggccctc gtcgtcgagc tcctcgagct gctcgtacgg gatcgccggg atggtcacgt 4197061 tgaacgtctt ctcggcgaag gcggcgtagc ggtcccagcg cttgcggatc tcctccttgg 4197121 tctgcgggat ctcctcaccg gcgcgcaccg cgtcgatcag cccgacgaac cgcacgtcct 4197181 tgcccagccg ccgcaaaccg atcgcgcacg cgtaggccag cacaccgccc agcgaccaac 4197241 ccaccaggac atagggcccg tcgccctgca tctcgatcag cttcggcacg tactgctgtg 4197301 cacgctcttc gatcgacccc tcgacccgtt cgaagccata cattggggtg tccgccggca 4197361 gccggcccag cagcggctcg tacaccaccg tcgagccgcc ggccggatga aacacgaaca 4197421 ccggcacctt cccgcctgct tcgggccgcg cccgcagggt gcggacgaac ccatcgatct 4197481 gcccggcctc caaatacgtg cgcaccttgt cggccagcgc ctcgatgttc gacgacgtca 4197541 gcacgtcctc ggcggtgatc gggccttcgg cgcgctcgga aagccgctgc gcaatcttgg 4197601 ccgcggcctc gtcgtccagc ctgggcagct cgttgaagat gccgcccggg gacttgccgg 4197661 tgacgatcgc ccaggtggcg aaggtgaccc gctcggcagc gtcccgcggc ggcacgtcga 4197721 cgttgagcgc gggccctgtc gggtttggct gctcgccgtt ttgcggcgac gggagcgcaa 4197781 ccccggcttc cgagtcgacc ggctcggtct tgcccacctt gccatgcagc aattcggcct 4197841 gggcccgcgc gatctcctca gcggtctggg ttttctggtg ctcgtgcagc tgctgcacct 4197901 cgtcgcggtg ctcgaccgcg tattcgatca gcttctccac gttgtagagg ttggcgtcgc 4197961 gcaccgcggt cagctggatc ggtggcaggt cgaagtcgta ctcgacgcgg tttttgatgc 4198021 gcaccgccat cagcgagtcc aggccaagct cgatcagcgg cacctcccac ggcaggtcct 4198081 cgggctcata gcccatcgca gacccgacaa tcaggcccag ccgctcggcg atggtctcac 4198141 cggaatcagg cgaccatcgg gtcatgccgg acggcatgta acgggtggtc aggctgtccg 4198201 aaagcgtctc ggcgtccgcg tcttcggcgg gcgtttccgg cgcgacaggc gccccgtccg 4198261 caaccgcgat cgccgtcgcc gcacccaccg cggtgggcaa caccgattcg gaccccgctc 4198321 gggacaccag ggcgtcgtag accagcgtga aggactcgtc gatgcgggcg tgcacctgca 4198381 ccgaggcgcc gccggggtga cgggtcatcg tcgtcaccag ccgggcgccg tcgccgggca 4198441 ccgcgcgctg ctcggcggcg gtcagttgcg cgtccggaag cacgtgggcg gcggcggccc 4198501 tgaccaacgc ggccaagtcc acattgccgt cccgcggcgc gtactcccag acgtgccgcc 4198561 catccggcag ggcgacatgg gtgcccggca tgtacgtcga gccgtcgccg gagaagtgcg 4198621 cgggcagcca gtgctccttg cgcttgaacc gggtcggcgg aatgttcgcg taatcctgcg 4198681 gcccactggc gcggctaaac agcgtgcgta tgtccaggtc gtggccgtac acatacagct 4198741 gcgccatggt cgagaccatc gaggagacct cgtcttgctt gcgggccagc gtcgggatca 4198801 actgggcgtc atgcagcccg gcatcggcgg tggtcagggc gacctgcatc agcgccaccg 4198861 gattgggtgc cagctccagg aaggtggtgt gcccgctgtc gacggcgttg cggatgccgt 4198921 gggtgaagta gacggaatgc cgcagcccct tcttccagta ttcgacgtcg tggatgggtt 4198981 cgccgccggg tttgatgtag cggccctcgt gcaccgtcga gaagatccca cacgtcgggc 4199041 tcgtcggctt gatgccttgc agctccgcgg tgagctcgcc cagcagcggg tccatctgcg 4199101 aggtgtggct ggcgcccttg gtcgcgaatt tgcgggcgaa cttgccctcg gcctcggcgc 4199161 gggcaaggat cgcgtccacc tgctcggggg ggccgccgat gaccgtctgg gtgggcgcgg 4199221 cgtagacaca cacctccaga tcggggaagt cggagaacac ttctctgatt tcgtcggcgg 4199281 agtattccac cagcgccatc aaccggatgt actcgccgaa cagcatcgcc tcaccctcgc 4199341 ccatcaggtg cgagcgcgag cagatcgccc gggtggcatc ccgcagcgac agcccgccgg 4199401 cgaagtaggc cgacgcggcc tcacccagcg actggccgat gaccgcggcc ggtttggcgc 4199461 cgtgatggcg cagcagctca cccagcgcga tctggatcgc gaagatggtg acctgggtgg 4199521 tctcgatgcc gtagtcctgc gcgtcgtcca ggatcagctc cagcaccgag tagcccagct 4199581 cgtcttggac cagggcgtcg accttctcga tccacgccgc gaacacctcg ttgcgcaggt 4199641 acaggctctt gcccatcttg cgatgctggg cgccgaatcc ggcgagcacc cagaccgggc 4199701 cggtggtcac cggcccgtcg acgctgaaca cgttcggcgc ctgcttgccc gcggcgaccg 4199761 cgcgcaggcc cttgatggcc tcgtcgtggt cgtgggccaa caccaccgcg cgggaacggc 4199821 cgtggttgcg ccgcgacaac gacctgccga tcgattccag cgaggaggcc tggccttccg 4199881 ggctttgcat ccagtccgcc aactcggcgg ccgccgcctt cttgcgggac gtcagaaacg 4199941 ccgacaccgc caacgggacc aatggtgccg taacctcttg ggccgcaagc tcttccaacg 4200001 cggcttcctt gagccgcagc gcctcctcgg tgactccggg cagttcgggc tccggctctt 4200061 cggcgaccgc cgagtcggtg atgatgttgc cgaactcgtc gaaccgcagc gcgtggcctg 4200121 ccaacgtggg cgcctcggcg ggttcggcgg ccgccttggg ttccggctcg ggttccggtt 4200181 ccttttccac cacgtcacgc ggcaggacct cgcgcaccac cacgtgcgcg ttggcgccgc 4200241 cgaagccgaa gctggacacc ccgcccagcg cgtagccgcc gtatcgcggc cagtcggtgg 4200301 gcgtggtgat catcttcaac cgcatcgcgt cgaagtcgat gtaggggctg gggccggcga 4200361 agttgatcga cggcggcagt ttgtcgtgct gcagcgccag caccaccttg gccatgctgg 4200421 ccgcgccggc cgccgattcc aggtgcccga cgttggtttt caccgcaccc agcagcgccg 4200481 gccgatcggc cggacggccc ctaccgacca cccggcccag cgcctcggcc tcgattgggt 4200541 cgccgaggat ggtgccggtg ccgtgcgcct cgatgtagtc gacggtgcgc ggatcgatgc 4200601 cggcgtcctt gtaggcccgg cgcagcacgt cggcctgcgc gtcctggttg ggtgcgatca 4200661 ggccgttgga ccggccgtcg tggttgaccg cgctgccggc gatcacggcc aggatcgcgt 4200721 cgccgtcgcg gcgggcgtcg tcgacccgct tgagcaccag catgccgccg ccttcggagc 4200781 gggtgtagcc gtcggcgtcg gctgagaacg acttgatccg gccgtcgggc gccagcaccg 4200841 caccgatctc gtcgaaaccc agggtgacca tcggtgtgat caacgcgttc accccgccgg 4200901 cgaccactac gtcggcctcg ccgttgcgca gcgcctgcac cccctggtgg atggccacca 4200961 gcgaactcga gcacgcggtg tcaatggtga ccgacggtcc gtggaagtcg tagaagtagg 4201021 acacccggtt ggcgatgatc gagctgctgg tgccggtgat cgcatacggg tgcgcgaccg 4201081 tcgggtccga caccgccagg aagctgtagt cgttggtgga gctgccgatg tacacaccga 4201141 cggcctggcc gcgcaggctc gacgccggga tgcgggcgtg ctcgagcgcc tcccaggtca 4201201 gctccagcgc catccgctgc tgcgggtcga tgttgtcggc ttcggtcttg gccaccgcga 4201261 agaactccga atcgaagccc ttgatgtcct tcaggtagcc gccccgggtg cgggccccgg 4201321 cgacccgcgc ggccagccgc ggctcttcga ggaattccga ccagcgcccg tcgggcaggt 4201381 cggtgatccc gtcgcggcct tccagcagcg cctgccaggt ctgctcgggg gtgttcatct 4201441 cgcccgggaa gcgggtggac aagcccacga tcgcgatgtc gacgcgctcg gccgggccgg 4201501 tgcgcgacca gtcttcggcg tcatcgcccg ctaggtcggt ctccggctcg ccctcgatga 4201561 tccgggtggc cagcgattcg atggtcggat gcgcgaacgc caccgcgacc gacagcgtga 4201621 ccccggtcag gtcttctatg tcggcggcca tcgcgacggc atcgcgcgac gacagaccca 4201681 gctccaccat gggcaccgat tcgtcgatcg agtccggtgc ctttccgacg gccttaccca 4201741 cccagttgcg cagccactgg cgcatctcgg ggaccgttag ctcggccctt tcggcggggg 4201801 cgttctcctg ggattccgct acgtcagcca tgggtcctca gtccgaagtg gcgaagaccg 4201861 tcggggaacc cacgccactg cgcaggctgc cgtcgaggta ggccgcacgg caggcgcggc 4201921 ggccgatctt gccgctggag gttcgcggaa tcgtgccggc cgacaccagc aggacgtcac 4201981 gcacggtcac cccatgcccg acggcgatgg ccgcccggat gtcatcgacg atgggctggt 4202041 ggtcgagctt atgcgtgccg gccgcccgtt cgccgacgat caccagctgc tcggaggtgt 4202101 cctcggggtc gaatttcagc ccggcgtgcg agtcgtcgaa cactgtctga ggaagctggt 4202161 tggccggaac cgagaaggcc gccacgtagc caacccgcaa cgccttggtc gactcctgcg 4202221 ccgtgcactc gagatcctgt gggtagtgat tgcggccgtc gatgatgacg aggtccttga 4202281 tccggccggc tatgtagagg tggtccttga agtaggtgcc gtagtcgccg gtacgcaccc 4202341 acagcgcgtc gtctggggcg ccctcggcgc gcgactcgct gatccgcgat ttgaggatgt 4202401 tcttgaaggt ctgggcggac tcttcttctt tgccccaata accggtaccc aagttgttgc 4202461 cgtgcagcca gatctcaccg atctgtccgt ccggcagttc gctggccgtg tcggcgtcga 4202521 cgatgaccgc ccattcgctg accccgacct tgcccgcaga gacctgggcg acggcgttgg 4202581 gtgcatcggc ggccacctca acgaaccgct ggttgttcag ctcgtcgcgg tccacgtgga 4202641 tcacggtggg cacctcgtcc atcggcgtgg tcgagacgaa cagcgtggcc tccgctagcc 4202701 cataggacgg cttgacggcg gtctgcttca aaccgtacgg cgcaaatgct tcgaagaact 4202761 tgcgcatcga cgccggcgac accggctcgc tgccgttgag gatgcccttg acgttgctca 4202821 ggtccagcgg cggctcgtcg tctcgaggca caccgcgcac cgcggcgtgt tcgaatgcga 4202881 agttcggcgc cgcagagaag gtgccaccgg tttctccggg cttgcgggcg agctcgcgga 4202941 tccagcgacc gggccgccgc acgaacgccg cgggcgtcat aaaggtgaag ctgtggccta 4203001 gcaccgacgc cagcagcacc gtgatcagac ccatgtcgtg gaagaacggg agccagctga 4203061 ccccgcggtc gccttcctgt ccttccaggg cattgagcac ctgcaccaca ttggtgggca 4203121 ggttcagatg ggtgatctgc acgccgctcg gtatgcgggt ggaacccgac gtgtactgca 4203181 agtacgcgac ggtttcctcg ttggcctcgg gctgctgcca ggtggcggcg acttcggtgg 4203241 gcaccgcgtc gacggcaatg acgcgcgggc gctccttggc cgatcgggcc cggatgaact 4203301 tgcggacccc ttcggcggag tcggtggtgg tcaggatcgt cgacggggca cagtcgtcga 4203361 gcaccgcgtg taaccgaccg acgtgccccg gctcggccgg gtcgaacaac ggcaccgcaa 4203421 tgcggccgga gtagagggcg ccgaagaagg agatgaggta gtccaggttc tgcgggcaca 4203481 ggatggcgac gcggtcaccc ggctgggtga cttgctgcag gcgggctccc accgcacggt 4203541 tgcgcgcgct gaagtcagac cacaagatgt cgcgcgcgac accgtctcgt tcggtggaaa 4203601 agtccaggaa ccggtaggcc agcttgtcgc cacgaacctt cgcccacttt tcgacgtgac 4203661 gaaccaggtt ggtgttggct gggaacctga tctttccatt cacgatgaac gggttgtggt 4203721 acgccatccc actctctcct gtcacaaaca tctcggccgg ctctgccggc ggccaccggg 4203781 tgtcggctcc gccaacgggt tacccgcgca catcaacccc taccgcgctc acgtcggcga 4203841 acgcagtttg cagccagctt tgacccgact gggtcctgca catgctctta gttttctctt 4203901 aatgttaagg gccggtgcct gacagaccaa atcacaaggt accgctgttc gaggccgcca 4203961 tcagcgtacg cggggcggtg tcgagtcgcc cggttcatcg gtggaccgcc gcctagcgtc 4204021 catcgtcctc ggggaaatat cacctatgtt tggggtgggg cgcattttcg ataagttgat 4204081 gcgcccagtt caacgtccac tcggtcgccg gttctccatc ggaattccag aattcgggtg 4204141 tcgcatacat agcatggacc ggctggccgg cgccgccggc cagggtgttc agcgtagtcg 4204201 gcaagttggc gggactgaac gcctgtgccg gggccgcaca gatcaggtcg ccctgggcgc 4204261 agatctcgtt ggtccggccg tcgagcgcac caaaaccgcc cggccgcggg ccggtcatag 4204321 tcaaaccaag cccggacaac actgggactt cgtgcagggt gatctcggcg ccttcgccgc 4204381 gcgggctagg cgggacctga ttacccaccc cctgctgacg acgaccgtcg gcgatcagcg 4204441 tcacgcctag tactaggtcc tcgtccacgg gtccccggcc gttgccgata tcgctagcca 4204501 cgtcgcccgc gatcaccgcg ccctgcgaaa acccgatcag cacatagctg gtcaacgggc 4204561 acctgttgtt catatcggtc atcgctgcca ccatcgcgcg ggtgccctct gcccggctgt 4204621 cgttgtacga catctgatta tccgtggtca gcggattgtg gaattgggcc gtgtaggcaa 4204681 ctgtgtaggt ctgcacccgg gcgggtgcga attgctgggc gatcggccca gttaccttga 4204741 gcagcaacgc cttcggaaac tgcaccggat tcagtgggtt ctgctgcggc gatgactccc 4204801 aggttccggg aaccgagatc atctgcacgt cggggcagga cgcatcctgg aaggccggtc 4204861 ggggtttgtg cggatgtgct ggggtgggcc ccggtggtaa aactcctggc ggcaccgcgc 4204921 tgggcggcga ttcggcgccg cgcagcatga tcaccacggc cacgatgacc agcgctacga 4204981 cggacgccat cgcgcccgcc gctatccagg caaggattcg gtggcgctta cgccgagagt 4205041 tcttggccat gttctcctgc taacagagtc ggtagcgcac gcgaaagggg tgcacccgcg 4205101 ccgcgcgata gcgcggccat cccgcccgtt gccgcactcc ctctacggta ccggcccgct 4205161 acgcggcttc gcccgagtcg cgatgtcgtg cacgtctgcc gcaaggatca tccgatagcg 4205221 gccaggcagc tcgcatcggc acctggctta gcggatcgca ccgacgatat cgcccgacat 4205281 agcgcccagc tggggcgccc acgagcccca gccgttgtca ccgctggctg ggaagtcgaa 4205341 gtgtccgttg tgcccgccga cgctgcgata ctggttgtag aacatgcggc tgttacccat 4205401 cgcctcggcg gcttggccga tcatggcggc gggatcgctg gctcccgggt tggtcgggct 4205461 ccacacccac acccgggtgt tgttttgcgc cagcaggctg gcatgcaccc acgggtcgtg 4205521 ccacttccac cgacccagct gtggtgctcc ccacattccg ttggtgtcca caccgccgaa 4205581 ttgctgcatg cccgccgcga tcgcaccgtt ggtggtggtg ttcgacgggt acaaaaagcc 4205641 cgacatcgag ccagcgaagc cgaagcggtc ggggtggaag gccgccagcg ccatcgcccc 4205701 gtaaccgccc tgagcggcgc caacggccgc atggccaccg ggggccaagc cccggttagc 4205761 ggccagccag tcgggcagct cagcggacaa gaaggtgtcc cactgcttgc tgccatcctg 4205821 ctcccagttg gtgtacatgc tgtacgcacc accggccggt gccaccaccg aaatcccctt 4205881 gcccgccaac gtgttcatcg cgttacccgc ggtgacccag ttactgacat ccgggccggc 4205941 gttgaaggcg tccagcagat acaccgcgtg cggcccaccg gctaggaagg ccaccgggat 4206001 gtcccggccc atcgagggcg acggcaccat caggttctcg tatggggcgg ccttggcggt 4206061 gggttccgcg gctaccgcga caccgcccaa cccgaatgac agtgcggcaa tccagagcgc 4206121 ccgcagcagc gccgaccgac ccttcatgtg tccacctccg tcgtgtaagg ctgtgtgcac 4206181 ccggcgtcag accgccccgg ccaaccccta gcccgtcagg tagctaacca cacggcccgc 4206241 ggcgggagct agggacggga tttaggaaac atctagcggc ggcgaccaca agggtcaccg 4206301 ccgctagatg ttgtgtctgt tcggagctag gcgccctggg gcgcgggccc ggtgttgggc 4206361 gtggcaccca gtgcccgttg caggtcgggc ttcatagcgt tgagctgcgc cccccagtac 4206421 tcccagctgt gcgtaccgct gtccgggaag tcgaacacgc cgttgtggcc gccaccggcg 4206481 ttgtaggcgt cttggaactt gatgttgctg gtccgcacga agccctcgag gaacttggcc 4206541 ggcaggttgt tgccacccag atccgacggc ttgccgttgc cgcagtacac ccagacgcgg 4206601 gtgttgttgg cgatcagctt cccgacgttc aacagcgggt cgttgcgctg ccacgccggg 4206661 tcctccttcg ggccccacat gtcggaggcc ttgtagccgc cagcgtcacc catcgccagg 4206721 ccgatcaggg tgggacccat cgcctgggag gggtccaaca ggcccgacat cgctcccgcg 4206781 tagacgaact gctgggggtg atagatcgcc agcgtcagcg ccgaagaagc agccatcgaa 4206841 agaccgacga cggcgcttcc ggtgggcttg acgtgcctgt tggcctgcag ccaccccggc 4206901 agctcgctgg tcaggaaggt ctcccacttg taagtctggc aaccggcctt gccgcaggcg 4206961 ggctggtacc agtcggagta gaagcttgac tggccaccca ccggcatgac caccgacagg 4207021 cccgactggt cgtaccactc gaacgccggg gtgttgatgt cccagccgct gaagtcgtcc 4207081 tgcgcgcgca ggccgtcgag caggtacagg gcgggcgagt tggcaccacc actttggaat 4207141 tggaccttga tgtcacggcc catcgacggc gacggcacct gcaggtactc caccggcaag 4207201 cccggccggg aaaatgcccc cgcggtcgcc gtgccaccga cggcgccgac cagacccgac 4207261 actagggccg cgccgacggc cccgaccacg agtcgacgcg acatacccgt gacggcgcca 4207321 cgaaccctgt caacaagctg cattcttgct tccctcatcc tcatctcaac gcatccatgc 4207381 atgtttgggc gcatcctgaa ttaggtcaga ctgcaggcgc tgggcccggc agtgctcgtg 4207441 tagtcaacca caacttcggg cgtccacccg catcaagcgc accgccgaaa cccttatccg 4207501 gcggtcgttc acggccaatt cgggaccgac gcgacggcct gaaggtggca tttccgcagt 4207561 gtctgggcat gtgtcgaccg ctagtgccgg ctcaattgtg atcttgctgt cagtattgcc 4207621 cccgcgctca ttgcccctca ctcccgcggt ggcgggccgg gcccgtcggg aacatcgagc 4207681 ccacaccgga ccaattcata gcgcggaacg cggtcgatgc ggtaacgggt gaactcgtag 4207741 gaatgcaaca cgttggagag gaatcggtgc agggtgatcg gggcgcgcac agaattaagc 4207801 accgcccgag tcgcggggca ttgcagggcc gcttcggctt gcgtgaccca ctgctggtcg 4207861 atgtagccgg gaatacccgg gtaccacttc acccagggtc cgtcggcgat cacccagtcc 4207921 gggaacagat tcttgtcatg gccgatacgg gcatgcttca gccgctcggt gtgcgcggcc 4207981 aatgggttta ccagcccgat ttggtcgatc acccggacat cgagcccgac gttcatgcct 4208041 agcatgccca tgttggtgaa aaacactgcg tgctgcggtt tcggcgccgg cttgccaccc 4208101 ggcgcggtcc ccgacgaggg ccggatcatc ggcaccaggt cccactggtt gtagttgccc 4208161 gacggcaata gcaacgcccc ttccggggtg ttgttgagcg ctgtaagcac ggcagccatt 4208221 cgcgggtaat cgaggtagtc cgcggcggtc agcggatgcg cgtgcccggt ggcctgggcg 4208281 tagaagcggc gctcgtcgac gatgcccgaa taggtgaccc gggtggcgtc gtcacccatg 4208341 cccggcgagt ttgccgccca cagcgaccaa cccgcgatcc ccagccagag cccgctgagc 4208401 gcgccgacta gccagcgacc ggtctcccgc gaaaagtcct taccgtcggg cagcaaaata 4208461 ggaatgaccc ccaccggggc cagcaaacaa aacagcggcg ccagcaacac ccggccgtgc 4208521 ataaagtcgc cgccttgccg aacccagtac agcgcctgca gcacgccgct gccgacgatg 4208581 aaagccacca cggccggcgg actttgcacc gcccgggcca cccgaccgta gtcgggtgcc 4208641 agcacgggac gcaggaacga cggccggcgg cgcgccgtca tcaacagcaa tcccagcggc 4208701 accgacagca ccaacggcac ccacagtgcg tacggccggt tgaagttcga cacgtagatc 4208761 atgccttgcg accacttgtc gcccgcggca tccttggcca gcgcggtact cggaaccagc 4208821 agtccgtaat agcccatccg gaagatctgg taggccaccg gcaagaatcc gccggccagc 4208881 acgatcagca cgcggcgacg ccaggtccgc gcggcgatca acatcatgat cagcgccagc 4208941 ccgccgatca gcgcgaattc cggccgcact agcacgctgc atccggcgac gaaggccaac 4209001 gcgccgagga acatctggct gtccgggcgg gcccgcagcg gctgtgacca gcagaccatc 4209061 atccaccaca acagccccag ataggccaac accagcccgc tctccaggcc ggaggtggcg 4209121 aagtcgcggg ccggtggcac cgcgatatat accagcgccc cggccggaag catgatcgcc 4209181 cgacggcccc gcaggctggg tgcgtacaac cggccggtcc ccagcatgag cagcaccatt 4209241 cccagcagcg aaagcaccat ggccagggcc aacgccacgt actccaggcg catcggcccg 4209301 cccacccagc cgcccacata cagcagatac gtccacgctg tcgaggtgtt cgcttcgact 4209361 cgctcgccct ggttgaagac cggtccgttg ccggccaata ggttgcgtac cgtccgcagg 4209421 acgatcagtc cgtcgtcagc gatccagcga cgttgccagc tcccccagcc gaacagcacg 4209481 gcgaccgccg tcaccgacag ccacaagctg acccggacca tgggctcata cggaaacacc 4209541 ggccgaccga cccgcccgac caccggccgg cggggcagca ccccgactgg gaggacgttg 4209601 agcttgaggc tagccgaagg caacagcggc cccaaccgtt gctatccacg ccagcgccag 4209661 cagctgcaat acccggtcac gcagcgcgat atcttccggc tccccggcca ggccgccatc 4209721 gacgtccacc gcgtagcgca ggatcgcgat ggtgaacgga atcatcgaca ccgcgaacca 4209781 ggacccgctg tagccgtcgc gctcgaaagc ccacagcccg tagcacaaga ccaccgcggt 4209841 ggccgacaac gtccagacga accgcagata ggtgctggtg tagctttcca gcgacttgcg 4209901 gatcgcagcg ccggtgcgtt cggccagatg cagctcggcg tagcgcttgc cggccaccat 4209961 gaacagcgaa ccgaatgcca tgatcagcaa aaaccacttg gacagcggga ttttggtggc 4210021 cacgcccccg gcgatagcgc ggatcaaata cgccgacgac acgacgcaga tgtccaccac 4210081 cgcttgatgc ttgagaccaa agcaatacgc caactgcatg gcgaggtaga cgaccattac 4210141 cagcgccagg ttcggggtca gcatccaggc accggccagc gatgtcactc ccagtaccac 4210201 cgccacggtg tacgccagcc actcgggcac cacgccggcg gcgatcggcc ggaacctttt 4210261 ggtggggtgc tcccggtctg cctcgacgtc gcgcacatcg ttgacgaggt acaccgccga 4210321 ggcggccagg ctgaacacca cgaaggccat cgacaccttg ctgagcacct cgacgtagtc 4210381 gtagcggaca ccgccgccca acgcggccag cggcgcggcc agcaccagca cgtttttcac 4210441 ccactggcgc gggcggatcg ccttgaccac cccggcgacc aggtttgccg gaggttgagt 4210501 caccacatct tcactcatcc gagctcatct cttccgggcc ctttgccggc ccccgccgac 4210561 gctgtccacg atggccccga cggtggcgcc cagagcaaca cccacggcca catcactggg 4210621 gtagtggacc cccagcagta ttcgcgacag cgccatcggc ggcaccagca caaccggtag 4210681 cggcagcccg gtggctctgc ccatgagcag ggccgcggcc gtggtcgagg tggcgtgtgc 4210741 cgacggaaag ctcagttgac ttggcgtgtc cacgttgacc gcgatggccg gatgatccgg 4210801 ccgctgacgc cgcaccagcc gcttgatcag cacggcgatg gcatgggcga cgaacgcgcc 4210861 cgcccccgcc acaagccatt cccggcggcg ccgtggcagg gctatcgcgc ccagcagcgc 4210921 caggatcagc caaccgatgc agtgctcgcc gaagtgggag agtccgcgcg cagtggccag 4210981 catccccgga cggtcgacca gcgccgactg cacggccacc atcacggcga cttcgccgcg 4211041 tggcgcccgt tcagccatgc tcgggctctt ggtttgccgc cggcagcagc gccgtctccc 4211101 acttctgctt gctggacagc gtcggcaacg cgtcgcgata aatccggcgc atctcctcga 4211161 accgtttcag caactggcgc tgacggcgca acgactgcca cagcaacgcg aacatcttgg 4211221 cccggtcgcg ctgccggtag accacgccgc atccgtcggc cgtggtgacg gtggccccgt 4211281 cgacagtgca cagcaggaac cagcgcgcat cctgggtcgg aacgttgaac tccgggcgac 4211341 ggtggtgttg ggggttggcg gcggtcaggt tgtgcatgat cccgcgggcc agccggtagc 4211401 cgatgaccaa cgggttcacc ggcggcttca ttgccttgtt cttgtgcaac ggcggcggca 4211461 actcactggc cgccggcagc accaccgcgt ccggatagct cttgcggatg cggtgcactt 4211521 gcggcagcgc cgattccagg atcgaaaaga tgtgctcggg gccggcgaga aagtcgtcga 4211581 tggccttgtt ctggattgcc accgtcgaat attccaggca ggcaaggtgt ttcagggttg 4211641 ccttgagatg gctgcggacc aggccgatga cttgcgcctt tgggccgtcc cagtgcatgg 4211701 cggccaccac cagccggttg cgcagatgga aataggcctg ccagtcgatg gcgtcatcct 4211761 tatcgctcca ggccatgtgc cagatcgccg caccgggcag cgtgacggtc ggatacccgt 4211821 gctcggcggc ccgcaggccg taatcggcgt cgtcccattt gatgaacaac ggcagcggct 4211881 gtcctagctc ttcggcgacc tggcgtggga tcatgcacgt ccaccagccg ttgtagtcga 4211941 catcgatacg ccggtgcagc aacttgctac gggagttgtt gtcgttcaac gggtattcgg 4212001 cgaagtcgtg gtcatactcg gcatgcggcg cggcggtcca catgaatatc gaccggtcta 4212061 cgacttcgcc catgatgtgc aggtgcgacg gctcctgcag gttgagcatc tgaccaccca 4212121 ccagcatcgg cgccttggcg aaccggtgca tggccagcac ccgcagaatc gagtccggct 4212181 cgaggcggat gtcgtcgtcc atgaatagga tctgctgaca gtcggtgttt ttcagtgcct 4212241 catacatcac ccggctgtag ccgccggaac cgcccaggtt gggctggtcg tggatggaga 4212301 gccgactacc caatctcgca gccgcggcgg ggaaatccgg gtggtcgcgc accttgcgct 4212361 caccctgatc aggcacgatc accgccccga tcacctggtc caccagcgga tcggcggtga 4212421 gttctcgcag cgcgttgacg cagtctgcgg ggcggttgaa cgtcgggatg ccgaccgcga 4212481 tgttggccgt ccccggagcg gggctggtgg cataccagcc accactgtgc agggtgaccg 4212541 cggtgtcggt ggtgatgtcg aaccagaccc acccgccgtc ttcgaaaggc tgcagcacca 4212601 cttcggtctc cacggcggct ggctgatcct cggtgccggt gaagtcgtgg ccctcaacga 4212661 agatccgggc accggtggcc ttggtccggt agacgtctac ccgcccggcg ccggtcacct 4212721 gcacgcgcaa caccaccgat ttgcacgtcg tccaacgtcg ccaatagcta gccgggaaag 4212781 cgttgaagta ggtggcgaac gacacctcgg actccgcgcc aatctgtagc gaggtccggg 4212841 ttggcgcatg cgcgcgccgg gcgttggtcg ttgactcctc gaggtacagc ttgcgcacgt 4212901 caaggggttc acctgggcgc ggcaggatga cccgagacag caggctcgcg gcgagttcac 4212961 tcatgcgccg tcctgaagca gtgggacgcc gtcgcgcaga tgcggcgcga ggacgttgtc 4213021 gtacatgttc aaggcgctgg caatggccat atgcatatcc agatattggt aggtgcccaa 4213081 ccggccgccg aacagtacct tcgatgacgc ggtctcggac ttcgccctgg cccgataggt 4213141 ggccaacagg gcgcggtcag cctcggtgtt gatcggatag tatggctcgt cgtcgtcctc 4213201 ggcgaaccgg gagtattccc gcatgatcac cgttttgtcc gttgggtagt cacgctcggg 4213261 gtggaagtgg cggaactcgt ggatgcgcgt gtaggggacg tcgagatcgt tgtagttcat 4213321 caccgcggtg ccctgaaagt ccccgatcgg tagcacttcc acctcgaagt ccaaggtgcg 4213381 ccagcccaat cggccttcgg cgtagtcgaa gtagcggtcc agcgggccgg tgtaaacgac 4213441 cggggccgcc gggctgccgg ggcgcagctg gccgcgcacg tcgaaccagt cggtgttcag 4213501 cctgacctcg atgcggtggt cagcggccat gttttgcaac cacgccgtgt acccgtcggt 4213561 cggcaaaccc tcgtaagtat cgctgaaata ccggttgtcg aaggtgtagc gcacgggaag 4213621 ccgcgtgatg ttggcggccg gaagttcttt ggggtcagtc tgccattgct tggccgtgta 4213681 ccccttgacg aacgcttcgt agagcggccg gccgatcagc gagatggcct tctcctcgag 4213741 gttctgcgcg tcggcggtgt cgatctcggc ggcctgctcg gcgatcagct ggcgggcttg 4213801 ctcgggcgtg aagtacttgc cgaagaactg cgataccagg ccgagcccca tcggaaactg 4213861 atatgcctgc ccgttgtgca tcgcgaagac ccggtgccgg tagtcggtga agtcggtgaa 4213921 ctgccgcacg tagtcccaca ctctcttatt agaggtgtga aacaggtgcg caccgtactt 4213981 gtggacctcg atgccggtct gtggctcggc ttcggaatag gcattgcccc cgatgtgcgg 4214041 gcgccgctcg aggacgagca cgcgcttgtc gagttgggtg gccacgcgct cggcaatcgt 4214101 caggccgaag aatcctgagc cgacgacgaa aaggtcaaaa cgagcggtca tcggttgcat 4214161 agggtaaccg accttgctgg caaaacccga tttggcagct cgtggcggtc atggcccgaa 4214221 cgggtttcac cgcaggtgcg catggccgac cagtgtgggt ggccggaggt cgtttggtcg 4214281 cgattgcctc acgattcgat ataaccactc tagtcacatc aaccacactc gtaccatcga 4214341 gcgtgtgggt tcatgccatg cactcgcgac cgcgggagcc ggcgaacccg gcgccacaca 4214401 taatccagat tgaggagact tccgtgccga accgacgccg acgcaagctc tcgacagcca 4214461 tgagcgcggt cgccgccctg gcagttgcaa gtccttgtgc atattttctt gtctacgaat 4214521 caaccgaaac gaccgagcgg cccgagcacc atgaattcaa gcaggcggcg gtgttgaccg 4214581 acctgcccgg cgagctgatg tccgcgctat cgcaggggtt gtcccagttc gggatcaaca 4214641 taccgccggt gcccagcctg accgggagcg gcgatgccag cacgggtcta accggtcctg 4214701 gcctgactag tccgggattg accagcccgg gattgaccag cccgggcctc accgaccctg 4214761 cccttaccag tccgggcctg acgccaaccc tgcccggatc actcgccgcg cccggcacca 4214821 ccctggcgcc aacgcccggc gtgggggcca atccggcgct caccaacccc gcgctgacca 4214881 gcccgaccgg ggcgacgccg ggattgacca gcccgacggg tttggatccc gcgctgggcg 4214941 gcgccaacga aatcccgatt acgacgccgg tcggattgga tcccggggct gacggcacct 4215001 atccgatcct cggtgatcca acactgggga ccataccgag cagccccgcc accacctcca 4215061 ccggcggcgg cggtctcgtc aacgacgtga tgcaggtggc caacgagttg ggcgccagtc 4215121 aggctatcga cctgctaaaa ggtgtgctaa tgccgtcgat catgcaggcc gtccagaatg 4215181 gcggcgcggc cgcgccggca gccagcccgc cggtcccgcc catccccgcg gccgcggcgg 4215241 tgccaccgac ggacccaatc accgtgccgg tcgcctaagc cccgggtcgg ccgaaaacgc 4215301 acccgcggcc aaggcgtcgg tcattgcttc ggcccgtcac aattactcgc ctaagggtcg 4215361 ctaggtgttc tcgagagttt tatcgcaccg attccgtgtc gtctcattaa taccaataga 4215421 aacacacgta acatcagctg gtgccgtccc gcacccgcgc gccgacgacg ctgctcaccg 4215481 cgatggcagc gaccgtcgtc atcgtcgcgt ggatagcgaa tcgtccaccc gccagctccc 4215541 atgaaccatc gccgacgccc aacacccagc tcgccgagca gccactgatc gggctcggcg 4215601 gcggcgtcac ggtacgcgaa ctcacccagg acacaccgtt ttcattggtg gcgttgactg 4215661 gcgacctggc cggtacctcc gctcgtgtgc gcgccaagcg cccggacggt gactgggggc 4215721 cgtggtatca gaccgagtat gaaaccgaac cacgcgatcc ggcgggcacc gacgggtccg 4215781 tggaacttgg aggactcaat ccgggtcccc gtagcaccga tccggtgttc gtgggcacca 4215841 ccaccaccgt gcaggtcgcg gtgactcgcc cgatcgacgc accgataact caaccgccgg 4215901 cggggcggcc gcccaacgac ttgctcgaca gcggtttggg ataccgtcca gccaccaagg 4215961 aacagccatt cgggcagaac atctccgcga tcctgatctc gccgccgcaa gcgccgcccg 4216021 gaacgcagtg gacgccacca accgcagtca ccatggcagg ccagccgccg gccatcatca 4216081 gccgggcgga atggggcgca gacgagtcac tgcgatgcga aacaccggag tacgacaggg 4216141 gggttcgtgc cgcggtggtc caccacaccg cggggagcaa cgactactct ccgctggagt 4216201 ccgccggcat agtcaaagcc atctacactt accacagcaa gaccctgggc tggtgtgaca 4216261 tcgcgtacaa cgccctcgtc gacaagtacg gccaggtgtt cgagggtagc gccggcggcc 4216321 tcaccaagcc ggtcgaaggg ttccacaccg gcggattcaa ccgcaacacc tggggggttg 4216381 ccatgatcgg caacttcgac gatgtggccc ccacgccgat ccagatccga accgtcggcc 4216441 ggctgctcgg ctggcggctg ggcatggacg acgtcgatcc caggagcatg gtggatctgc 4216501 agtcagcggg tagctcgtac accacgtttc cgggtggcgc catagcgcga ttgcccgcca 4216561 tcttcaccca tcgcgacgtc ggcaacaccg actgtccggg caacgccgcc tacgctgtga 4216621 tggacgagat ccgggacatc gcagcacatt tcaacgaccc gccggaggag ctgatcaagg 4216681 cgctggaagg cggcgcgatc tatcagcgct ggcaggcgtt gggcggcatg aacagcgcgc 4216741 tgggtgcacc gacctcgccg gaggccgacg ccgcggatgg ggcgcggtat gcaaccttcg 4216801 ctaagggcgc catgtattgg tcgccggtga ccgacgctca gccgatcacg ggggcaatct 4216861 atgaggcctg ggcttcgcag agctacgaac gcggcccgct gggactgccg accagcgcgg 4216921 agatccagga gccgctgcag atcacgcaga actttcaaca cggaaccttg aacttcgagc 4216981 gcctcaccgg caatgtcacc gaagtcgtcg acgggatcac gacgccactg gcgacgcggc 4217041 ccccgagcgg cccgacggtg ccgcccgaac acttcacgct gccaacgcat ccgatcacct 4217101 gagtcgcggg tgtgcactat tcacattatg tgtgtgcact tttcacattc tggcttttgc 4217161 ggcgcggaat cgccggcgca tagacaccct gtgccattag gctccatttg ccgggctgat 4217221 caccgggtcg ccgcaggcca gtcgagagga acaacgtgtc gttcgtggtc acagtgccgg 4217281 aggccgtggc ggctgcggcg ggggatttgg cggccatcgg ctcgacgctt cgggaagcga 4217341 ccgctgcggc ggcgggcccc acgaccgggc tggcggccgc ggccgccgac gacgtgtcga 4217401 tcgctgtctc gcagctgttc ggcaggtacg gccaggaatt tcaaaccgtg agcaaccaac 4217461 tggccgcgtt tcataccgag ttcgtacgca cgttgaaccg cggcgcggcg gcgtatctca 4217521 acaccgaaag cgctaacggc gggcagctgt tcggtcagat cgaggcggga cagcgcgccg 4217581 tttccgcggc cgcggccgcc gctccgggcg gcgcatacgg ccaactcgtt gccaacacgg 4217641 ccaccaacct ggaatccctc tacggcgcat ggtcggccaa cccgttccca ttcctccgcc 4217701 agatcatcgc caaccagcag gtttactggc agcagatcgc cgcggcgctc gccaacgccg 4217761 tccagaactt ccccgccctg gtggcgaatt tgccagcggc catcgacgcg gccgtccagc 4217821 aattcctggc cttcaacgcg gcgtactaca tccaacagat tattagctcg cagatcggct 4217881 tcgcccagct attcgccacg acggtcggtc agggggtcac cagcgtcatt gccgggtggc 4217941 ccaaccttgc ggcggagctt cagctagcgt ttcaacagct tctggtgggt gactacaacg 4218001 ccgcggtggc gaacctgggt aaggccatga caaaccttct ggtcaccggg ttcgacacca 4218061 gcgacgtgac gatcggcaca atgggcacca ccattagtgt caccgcgaaa cccaagctgc 4218121 tgggcccgct gggagatctg ttcaccatca tgaccatccc ggcacaagag gcgcagtact 4218181 tcaccaacct gatgcccccc tccatcctgc gagacatgtc gcagaacttc accaacgtgc 4218241 tcacgacgct ctccaacccg aacatccagg cggtcgcttc gttcgatatc gcaaccaccg 4218301 ccgggacttt gagcaccttc ttcggggtgc cattggtgct cacttacgcc acattgggtg 4218361 cgccgttcgc gtcactgaac gcgattgcga cgagcgcgga aaccatcgag caggccctgt 4218421 tggccggcaa ctacctaggg gcggtgggtg cgcttatcga cgccccggcc cacgcgttag 4218481 acggcttcct caacagcgca accgtgttgg atacgccgat cctggtgccc acggggctcc 4218541 cgtcccctct gcccccgacg gtcgggatca cgctgcactt gcctttcgac gggattctcg 4218601 tgccgccgca tcccgtcacc gcgacgatca gcttcccggg tgctccggtt cctattcccg 4218661 gtttcccaac caccgtaacc gttttcggca cacccttcat gggaatggct ccgctgctga 4218721 tcaactacat tccccaacag ctcgccctgg caatcaaacc ggcggcttag cgcggcgtgg 4218781 cccgttggtt agtgtcgtag gttgccatgc caagctccaa ccatgcggtt agcagccgct 4218841 gatctgccgc cgcggccaca acctcgtcgt catcgagttg ctcggccgat gcgcagtgca 4218901 ccgcgtcgta gccacgcatg ggccaggtca gccgcgcggg tcacgacctg ctcatccacc 4218961 tcgatggcgt ccatctcgga ccacatctgg tcacggttcg cccgtcgcca atctgcgcga 4219021 ggggccggct cagtcacgca ctcccgagcc acaaaggcgc cgggtcacgt gggccatgct 4219081 aggaccacca gcgctccagc acccgcgcga cgccgtcctc gctattgggt gcagtgacct 4219141 cgtcggcgac ggccagcgcg tcgggatgcg cgttacccat cgccacaccc aaaccggccc 4219201 gcagcagcat cggcacgtcg ttgggcatgt cgccgaacgc caccacctcc gcgtcggaaa 4219261 ttccaagcgg ccgggcaatc tcgtcgacac cggtggcctt gctgataccg agcggcacga 4219321 tctccaccag cccgttattg gtcgagtagg tgatatcgcc ctcgaaaccg acatgcttag 4219381 ccagttcggc ggccatgtcg gcactggcag caccggcttt acggatcagc agtttgatcg 4219441 ccggcgcgct gagcaggtgg tcgatcgaca cttcggtgtt gtccggattc agccacgcat 4219501 gctcgtagcc cggcgagctg acgaactggg gggtcgccgt gtcgtgtgcg cgctcgccga 4219561 tccgctcgac cgccagtccc gcacccggta tgacgcgggt cgcaacttcg gccaacgttg 4219621 ccagggcgtc gacgggcagg gtgcgcaccg acgtcacccg atcggtcccg gggtcgtaga 4219681 tgacggcgcc gttggcgcac accgccatcg gcgcgaagcc gagggcatcg acgatgggtc 4219741 gcacccagcg cggcggccgg ccggtggcca ggatgaagtg cgtgccggcg tctaccgcgg 4219801 catgcaccgc gtcgcgagtg cgtttggtga cggtttctcc gtcatcgagc agggttccgt 4219861 cgacgtcaca cgcgacgagc gccggcacag tcggtttcaa agttggctgg cttgtcagtg 4219921 cgggccgact tggctgcgcc gtgatgaggt cacgccgtcg tatccgcgct tttgccgccg 4219981 cttcgccaat tcagcgattc tgagctgcct ggactcctcc accgtcggcg cgccgccgcc 4220041 cagccgccgc ggcacccagt gctccccctt gggatgtgga tactcctcct gtacgcggta 4220101 gagaatcgca ttcatcgctt ggcgcagcac ggcattgagc tgctcggcat tgccctccgg 4220161 ccgcaccggc gatccgatcg ccgcgacgat cggaatcttg ttgcggaaca ggttctttgg 4220221 atgatccttg ggccagatcc ggtgcgcgcc ccagacgatc atgggaataa tcggcacctg 4220281 cgcctccagc gccatccggg ccgctccggt cttgaactcg cgcagttcga ggctgcggct 4220341 gatagtcgcc tccgggtgta acccaacgag ttccccggcc cgcaaccgct gcactgccac 4220401 cgcgtacgca tcggccccca cactgcgatc caccgggatg agctgggcat gcttgatcac 4220461 gtagttgacc gcccgtacgt cttgcatctc ggccttgatc atgaaccgca gccgccgccg 4220521 ccgatggtgg gcggcgatcg atgccggaac ccagtccacg tagctcgtgt gattgagtgc 4220581 gatcaacgcg ccgccacgtt cggggatgtt ctccaggcct tcgaatgtga tcttgtttcc 4220641 gttggccgcg acgatcgacg gaacaagaat ctccatcatc cggaagaacg gctcagccat 4220701 gtattctcct tcacctctta ccgcgattca tgcggtgtcc ggctagcggc ccttgccgcc 4220761 gcctcgtcag cctccatccg tgccgcctcg gccagcgttg gcgcgccgcc gccgagtcgg 4220821 cggggcaccc agtacgcccc agccggatgc ggataccgct cctgcgcttg ccacagcagc 4220881 gcggtcatcg actcacgcag cgccgcgttg gtctgttcga tgcctgccgc ggcccgcagc 4220941 ggccgaccca cctgtaccgt gaccggcacc ttggcgcgtc ctatctgcct gggatggtcc 4221001 ttggtccaga tccgctgagc accccagaca acgacgggca caatcgggac atccgcttcc 4221061 gcggccattc gggcggcccc cgtcttgaac cctttgagct cgaagctacg gctgatggtg 4221121 gcctccgggt agaccccgac cagttcccct tcgcgcagcc gctgcaccgc caccgcatag 4221181 gcgctaccgc cggcgccccg gtccaccgga atggtccggg tgtgcctgat caggaagttg 4221241 accaaccgca cccgttgcat ctcggccttg atcatgaacc tcatccggcg acgccgacga 4221301 tgcatggcca acgcggccgg cagccaatcg acatagctgg tgtgattgat agcgaccacg 4221361 gcgccgcctt ggtcgggcac attctcctcg ccgacgtagg tgatccgggt tccggtggcc 4221421 agcaccagca actgggccag gatctctaag acgcgatagg tcggctccgc catcggtcac 4221481 tgctccggcg ccccggcggg atgggctcgc tgagcgcggc gcgcagccct aaccgccgcc 4221541 tcctgcgcgt ccaaccgggc cgcctcggca agcgacgggg cgccgccgcc cagccggtgc 4221601 ggcacccaga actcgccggc cggatgcggt ccgtacagtt cttgggcccg ctccagcaaa 4221661 tgttgcatcc gggagtgcag caggccgttc agttcagcgg tgggcagcgt cggttcgatc 4221721 cgttcaccga cgacaatcgt gaccggcacc ttcgggcgaa acagcttttt gggacggtcc 4221781 ttagtccaga tccgctgcgc accccaaaca atatgcggaa cgatcggcac cccggcctcg 4221841 atcgccattc gggccgcccc cgtcttgaat tccttgatct cgaagctgcg gctgatggtc 4221901 gcctcggggt acacgccgac gagttcgccg gccttcagca tcctgacggc ggcgtcgtag 4221961 gacgcggacc cgtcctgccg atccaccggg atgtggcgca ggctgcgcat aatgggaccg 4222021 gtgatcttgt gatcgaacac ctcctgcttg gccatgaacc gcaccttgcg cccgaggccc 4222081 tgttggtagg cgggcaaacc cgcaaaggtg aagtcgaggt agctggtgtg gttgatcgcg 4222141 acgacggcgc cgccgctggt cggtaggtta tccacacccg tgacggtgat cttcaaaccc 4222201 tgtatgcgcc aggacaagcg agcaagccga atgacggtgc cgtataccgg ttccacagca 4222261 gttcagccta gtggtcccgg ctgcaagccg cccaaagtgg cgaaaaccca aattgacgaa 4222321 agaggtgagc cgtgtccttc ccctcatcgc cacccgcgct gcccgcgatc gttgcccggt 4222381 ttgccgtcgg caggccggtg cgcgcggtgt gggtcaacga actgggcggc gtcaccttcc 4222441 gggtggactc cggcatgggc gccggctgcg agttcatcaa ggtcgccagg aggggtaccg 4222501 ccgacttcgc taatgaggcg cggcggctgc gctgggccgc gccgtacctg gcggtgccgc 4222561 gggtactggg tgtcggggtc gacggcgatt gggcctggtt gcacaccgat gcgctgcccg 4222621 gcttgtccgc ggtgcacccg cgctggcggg cgtccccgca ggtcgcggtc ccggcgctgg 4222681 gtgcggggct gcgcaccctg cacgacagct tgccggtgca ctcatgtccg ttcgactggt 4222741 cgacggccag ccggctggcc aagctggccc cggcgcgacg cgcggaactg ggtgactcac 4222801 cgccggttga tcggttggtc gtctgtcacg gcgacgcgtg ctcacccaac accatcctcg 4222861 atgacaccgg ccgctgttgc ggacacgtcg acttcggcaa tctcggtgtg gccgatcggt 4222921 gggccgacct cgcggtcgcg acgctgtcgt tgcaatggaa ctttcccgac tacccgggcc 4222981 aggtcagaga tgacgagttc ttcgccgcct acggtgtggc gccggacccg gctcgcatcg 4223041 actactaccg ccggctgtgg caggccgaag acgacagctc acgctaagct cgaggctgcg 4223101 ctttgcgctc gtaagctctt ccgaaaggta gctgtgcagg tcacaagcgt tggtcacgcc 4223161 ggctttctga tccagaccca ggccggcagc atcctgtgcg acccttgggt caatccggcc 4223221 tactttgcgt cttggtttcc gttccccgac aacagcgggc tggactgggg cgctttgggt 4223281 gagtgcgatt atctgtatgt ctcgcaccta cataaggacc acttcgacgc ggaaaatcta 4223341 cgagcgcacg tcaacaagga cgccgtcgtg ctgctgcccg actttccggt acccgacctg 4223401 cgaaatgagt tgcagaagtt aggatttcat cggttcttcg aaaccaccga ctcggtcaaa 4223461 caccgcctga ggggacccaa cggcgatctc gacgtgatga tcatcgcact gcgggccccc 4223521 gccgacggtc cgatcggcga ctcggcgcta gtcgttgccg acggcgaaac aacggctttc 4223581 aacatgaacg acgcccgccc ggtcgatttg gacgtgctgg catcggagtt cggtcacatc 4223641 gacgtgcata tgctgcagta ctcgggcgcg atctggtacc cgatggtcta cgacatgccg 4223701 gcgcgcgcga aggatgcgtt cggcgcccaa aagcggcaac ggcagatgga ccgtgctcgc 4223761 cagtacatcg cgcaggtggg agcgacgtgg gtggtgccgt cggcggggcc gccatgcttt 4223821 ttagcccccg agctgcgcca cctcaacgac gacggtagcg atccggccaa tatcttcccc 4223881 gaccagatgg tgttcctgga tcagatgcgg gcgcacggcc aggacggcgg gctgctgatg 4223941 atccccggct cgactgcgga tttcactggt acaaccctga attcattgcg ccatccactg 4224001 cccgccgaac aggtcgaggc catctttacc accgacaaag ccgcatacat cgctgactat 4224061 gccgaccgga tggcgccggt gctcgccgcg caaaaggctg gctgggccgc cgccgccggc 4224121 gagccactgc tgcagccgct gcgcaccctg ttcgagccga tcatgctgca aagcaacgag 4224181 atctgcgacg gcatcggata cccggtcgag ctcgccatcg gtcccgaaac cattgttttg 4224241 gactttccga aaagagctgt acgagaaccg attcccgacg agaggttccg ctacgggttc 4224301 gcgatcgcgc cggagctggt gcgcacggtg ctgcgcgaca acgaacccga ctgggtcaac 4224361 accatcttct tatccacccg atttcgggca tggcgggttg gtggctacaa cgaatacctt 4224421 tacacgttct tcaagtgtct gaccgacgaa cgcatcgcct acgccgacgg ctggttcgcc 4224481 gaggcccacg atgactcctc atcgatcacc ctgaacggtt gggagatcca gcgccgctgc 4224541 ccccatctca aagccgacct atcgaaattc ggtgtggtgg aaggcaacac gctcacttgt 4224601 aacctgcacg gctggcagtg gcgtctggac gacggtcgct gcctcaccgc ccggggccat 4224661 caactacgca gttcacggcc atgatgcagt tctacgacga cggcgttgta cagctggatc 4224721 gtgctgcact cacgctgcgc cgctatcatt ttccttcggg cacggccaag gtcatcccac 4224781 tggaccagat ccgcggatat caggctgaat cgctgggctt tttaatggcc cggttcaata 4224841 tctggggcag gccagacctt cgccgctggc tgccactgga cgtgtaccgg ccgctgaagt 4224901 cgacgttggt caccctcgac gtaccgggga tgcggccgaa accagcctgc acgcccacgc 4224961 gccccaaaga attcatcgca ctgctggacg agttgctcgc cctccaccga acgtgaaccc 4225021 acggtttcgc gcgcgatttt cgcactgccc tggggcacag cctcactcca gacttaagcc 4225081 acagcgacga tccaagcgac gtgtcatgtg cctggtttaa gtatcgcgag cgtgccgtcg 4225141 gcggtgcgga tatagatgga tttcatggcc gcgatgtaat tggcgacgga ttcgcttgcg 4225201 atcgggttgt ccgggaataa taccgtcact gtggtctgat gctgataccg attgacccac 4225261 atcgagacct gatgagaaac cctaccttcg tcgtaaatcc taaaattcag atcggaatta 4225321 gcgaccgtag aaagaggcgc aatgctggca tccagaaagg acatcacgaa attgcccggc 4225381 cggggcggcc tcagccccgt ttcggggcgt gccagctcca atacgcggtc gaatggtacg 4225441 gtcgccaggt ccttacccga atcgaaggag atctgcgcga cacgggcggc gctatcgaaa 4225501 agtcctgagg cgaccggcac ggtgatcggc accaacccgg taaaccagcc cgtcgttctg 4225561 agttctgtcg gcgtcctacg tgtatcagtc gtcgttacca cgtcaaacgt ttcacagttg 4225621 gtcaactcgc gctcagcgag ggcggcgcag gcgaaaacgc caccgctaaa acgggcgccc 4225681 gcagcgacgc aggcggcttc gaatcgctcg ccctgttgct cgtccatcag cgtttcggta 4225741 agcagctttc cggtatgggg caccgataga tcgccgagcg gcaacgggaa gtgcggcagg 4225801 gttccgtcgt tgttggcagc gaattcgacc caacggcgca cccgggcgga gtccaacgtc 4225861 aaggcggccg tgtcggcgta ctgtcggaca cagtggtcgt cgtagcggcc cgccggcggg 4225921 agctcgatcg gcgggtcgcc tcccaccaat gcggagtaca tcatatggat ctcgatgaaa 4225981 aggacgccca caatcatcgg atcgacacag agatgagcga tactcgcata gaaggtgaag 4226041 tgatcgtcac tctgaataat cccgaacaag aagcagtccc actgcaacgg ctgcggcgtt 4226101 gcaatgtggt ggcgcagctc cgccgacgtc atgttctgat gctcagcttg gacgacttcg 4226161 atatctgcag ggtcagcgat ggtatgccga acgatgtgtt cggcattgtc gaactcaaac 4226221 caactgtggt aggtgtcgtg gcggcgaagg tgtgcgttga tcgcataatt catggcgcgg 4226281 atgttgcacc ggccaggtag atcccaggtg aagatcatca ggcgcgacat atcgagaccg 4226341 cgcgctacat gatcgcgata acgtcgaagg tgttgagctt gttgatagct gggcggcacc 4226401 tcacttatcg gcgcttgccg ggctttcgcc ttcgccgtcg gtgatgcgtg ccaacagata 4226461 atcgaacctg ggtccggcgt ccagtcgcgg agcgttgtaa tgctaaacac tcattcctcc 4226521 tgcactcgga ccgagccccg ccagggcacg caagtaagct acggccagac ggtgtgacac 4226581 tcaaaccggc gggcgtaatt tcctccgacg acgctccgca gaccacaatc gtcagcggcg 4226641 gagtacggtt gctcaccatg tggtccaccg tgctggtctt ggcgctctcg gtgatctgcg 4226701 agccggtacg gatcggtttg gtggtcctca tgctcaacag gcgccgcccg ctgctccatt 4226761 tgctcacatt cttgtgcggt ggttacacga tggctggtgg cgtggccatg gtgacgcttg 4226821 tggtcctcgg ggccactccg ttggccggac atttcagtgt ggccgaggta cagatcggga 4226881 ccgggctgat tgccttgctt atcgcgtttg cgctgaccac aaatgtcata ggcaagcatg 4226941 tccggcgagc tacccacgcc cgcgtcggag acaacggtgg cagggtccta cgggagtcgg 4227001 taccgccaag tggtacgcat aagctggctg tgcgtgcacg ttgttttctg cagggcgatt 4227061 cgctgtatgt cgccggggtg agtggcctag gagccgcact gccttcggcc aactacatgg 4227121 gcgcgatggc cgccattctt gcctccggcg ctacgccggc aacacaggca ctggctgtcg 4227181 ttacgttcaa cgtggtggca ttcacagtgg ccgaagtccc cctcgtcagc tacctggcag 4227241 caccgcgtaa gacccgcgcg ttcatggctg cgctgcaatc atggctgcgg tcccgtagcc 4227301 gccgcgacgc cgcgttgctg gtggccgccg gaggttgcct gatgctcacg ctaggcctga 4227361 gcaacctgta ggcggcggcg ggcttgccta acgcagagct ctcacatgaa atgtccaggc 4227421 gtctccgact gcgttgcgac cgtaaggcac gataacgtgt ttgctattgc tgctggtttg 4227481 cgttggtcgg ccgctgtacc gccgctacac aaaggggacg ctgtgaccaa actgctcgtc 4227541 ggggccatcg cgggcggaat gctagcttac gcagctatat tgggcgacgg aatcgcttcg 4227601 gccgatactg cgttgatagt acccggtacc gcaccgtccc cgtacgggcc actcaggtcg 4227661 ctctatcatt tcaatcccgc gatgcagcct cagatcggcg cgaattacta cacccccacc 4227721 gctacccgcc acgtcgtttc atatccaggc agcttttggc ctgtcacagg cttgaattcg 4227781 cccaccgtcg gcagttctgt cagtgccggg acgaacaatc tcgatgcggc gatccgcagc 4227841 actgacgggc caatcttcgt ggccgggtta tcacagggca cgctcgtgct tgaccgcgag 4227901 caggcacggt tagcgaatga cccgacggct cctccccctg ggcaactcac attcatcaag 4227961 gccggcgacc ctaacaatct tctttggcgg gcgtttaggc cgggaaccca cgtgccgatc 4228021 atcgactaca ccgttccggc cccagtggaa agccagtacg acacaatcaa tatcgtgggc 4228081 cagtacgaca ttttttctga cccgcctaat cgtccgggca acctactcgc tgacctcaat 4228141 gcgattgccg cgggcggata ctacggccac agcgccaccg cattctcgga cccagctcgc 4228201 gttgcgccta gggacattac gacgacaacg aacagtttgg gtgcgacgac cacgacctac 4228261 ttcatccgga ccgatcagct acctctggtg cgggcgctgg tggacatggc gggcctgccc 4228321 ccgcaggcgg cgggaacagt tgatgccgca ctgcggccca taattgacag ggcttatcag 4228381 cccggaccag cacccgctgt gaacccgcgt gatttggtcc agggcatccg cggtatcccc 4228441 gccatcgccc ctgccatcgc catccctatc ggcagcacca ccggggccag tgccgccacc 4228501 agcaccgctg ccgccacggc agcagcaaca aatgcgctcc gcggggccaa cgtgggcccg 4228561 ggcgccaaca aggcgttgtc gatggtccgg ggtttgctac ccaaagggaa gaagcactag 4228621 ccataaagtc cacgacctac ggtggcgttt cgcagttggg ggtgtaaagg gggttgaggt 4228681 cttcgacgat ggcggttgct gctggcccac caatccgttg ctgctgacgc caatccatcg 4228741 ggaaggccct gggtggcgtc ttggtgcgcc cggaggggca gcccgttggc gcccgtcgtc 4228801 gagcgtgaac tgagggcgga cctcgggcag acacgccgag gtcttccttt tgggcagcgt 4228861 ggaaccgccc atcatcgaaa gacctcgacc cctaccccgg caacgacgcg ccgactacct 4228921 cacaccctca actgcgaaga gatcctaaag cctgagcccg tcgtgtaacc aaagaccgat 4228981 cagatcgtcg tcgtcgggcg gtgattgctc ttcttcttcc ttgggcaaca acggcttacg 4229041 tttggttcgt tgggcacggc cacggcgtcg gccaaggggc caccaggttg ccggccgcca 4229101 gcttgacggc aaccaccagt tcgcctgccc aaccaacacg gcaatggcgg gcacggtaac 4229161 ggtacgcacc aagaaggtat ccagcaaaag cccggtccct aggacgaacg caccttgaac 4229221 cacgctaccc aagctggcga ataccagacc gtacatcgag gcagccatga tcaaacccgc 4229281 cgcagtgatc acaccacctg ttgaggccac ggtccggatg acaccggaac gcacccccaa 4229341 gacggcctct tcacgcagcc tagaaataag cagcatattg taatctgcgc ccaccgcgac 4229401 caatataacg aaggtcaatc ccggaatgct ccaatgcatt tcctgaccga gtaaaaattg 4229461 gaacacgata acgccaatac cgagcgccgc caggtacgat acgataaccg agccgatcag 4229521 atacagcggt gccacaatcg cacgcagcaa aacgatcaat atgagcagaa cgatgcagac 4229581 ggtcatggcg atgatcaatc ggaggtcgtg atcggagtag tcgcgcgtgt ccttgagaac 4229641 gacgggcaat ccgacgacag acaccttggc atcggccagt gcggtatttg gttgcgcccc 4229701 tcgagcggcc gccgtgatcg cgtcaatttg gtccatggca gcagtgctga atggattcag 4229761 gtcggtttgt atcaaatacc gtattgagtg gccgtcgggt gaaatgaagg ccgccgcgac 4229821 ttttttgagt tggtctacat tcagcccgcc tagcagatcc cgatactctg acggcatcgt 4229881 ctcggctttg acgctctcac cggtggcata cgacaacaac tccgggggaa tatagaaccc 4229941 cgccatcgcc ggcgtggtcg cggtgtcctt cattgccaat aggaacgccg aggcctcgcc 4230001 caacccgaaa cccatcttct tcacctggtc gaccaacaac tgcacgccct cagccagttg 4230061 ccggctcccg tcggcgagat cattgacccc cttgttcacc aggttgatct tggatcgcac 4230121 accaccagga ctgctcatcc ccagtgaacc catcgccctg atgacggtgg ccagcgcccc 4230181 gcgtaatccg gacacggtgg ctgccagggt ctgcactgcg cgcgtggcct gcagctgtcg 4230241 agccaactca gatatctttg ccagcgttcc gtcgtcgcgc gctgtgacca aacgctgcag 4230301 ttcggtgcgc gcactggcac aagccggatc ggcagtgcac atcgggctgc tatccagcgc 4230361 ccccagcacc gggcttgccc actcggtgtt gttcgctaca aagctcgcat ccgcgtcaat 4230421 ggtgtcaccg agtgcccgca tgctgccgat cagcttctcc gcgccttcca gttcgccgag 4230481 aaccctgttg cccccgagca ggtcctgaag gtacgccagc gcgtcgatga ggccgccgac 4230541 cgtggatatg gcccggttaa cttgggcccg tacgtcgccg agtttgctcg ccatcaggtt 4230601 ggctccaccg gccagtttgt cgatgtcgcc ggtgtgcgca gcgatctgct tggaaccctc 4230661 atccagcttg ctgccgactt cgccagcctg ccaggacgtc cgggcctgct ccagcgaccg 4230721 tccagcgggt cgggtaatgc ccctgaccat cgcgacaccc ggcacttggc tcacccgctg 4230781 caccatctgc tctaggtcgg cgagagcctt cggcgtgcgc agatccgtcg aggattggat 4230841 gaacaggtac tcgggaatga tcaggttaga cgggaaatgc ttgtccaacg cggcataccc 4230901 gatcgaactc tcgacggaag ccggaagcgt cttgcgatcg tcgtagttgt accgggccag 4230961 tcctgcgcag ccggccagaa taaccagcac cagcgcgctg gcgagcagat gagtcttggg 4231021 ccgacgcacg atgtgcaccc ccgaactccg ccaaaagcgc cgggtgaggt cacggcgcgg 4231081 cgcgatccaa ccgcgacgcc cggtcagcac catcagggcg ggtagcagtg tgacagctgc 4231141 gaagaagacc acggctaccg agattcccaa catcggacca accgttttga gaattcccag 4231201 ttgggtaaac accatcccga gaaaggtgat tgctacggta gccgcggatg cggcgatcac 4231261 cttaccgatg gatgtcaatg ccttcttgac ggcttgatcc gaatccgcgc cctgccgtaa 4231321 atagtcgtga tatcgactaa tcagaaatac cgcgtaatcc gttcccgcac cgaccatcat 4231381 cccgctcata aaaataatgc tctggttagc aataccgagg cccgccaagc cggctattgc 4231441 aacgaggcgc tgtgcaacca ccacggacat gccaattgtt atcaatggca acaccatggt 4231501 gatcggattc cggtagatga tcagcaaaat gaccaacaac aggatcgtga tcgcaaactc 4231561 gatgcgactg cggtcccgtt gcccggtgag gttcagatcg gcgacggtgg ccgcgggccc 4231621 ggtcaggtta gccgtcagtg tcgagcctgc gacctggtgt tcgacgatgt cagcgacgcg 4231681 ggcgtacgcc tgcttggact gggtcgaacc caggtcgccg ggaaggccga ccggcaggat 4231741 ccaggcctga ttgtctttgc tggtcatgag ctcccgcagg ggcggtgtgg tgacgaagtc 4231801 ctggagcatc acgacgtctc gagtatcgcg tcgcagggcg tcaaccagct ctttgtagct 4231861 gcgttcatcg gccgcgccga gccctttggc atcgctgagc accaccaccg caacgctctg 4231921 caacccggct tcacgaaatg ccgcggtcat ctgccgggtc gagaccaaca ccggggcgtc 4231981 cgatggcaga atcgccactg gatgccgctg ggagatcgcg tccagggacg gcaccgtcgg 4232041 cgcaagcaga cccgcaagcg cgacccagaa ggcgatcacc acccacggcc ttcggacgat 4232101 aaggcgccct agccgcggaa agacaccccc gtcaccggtt ggcctaagcg gtttcgatcg 4232161 taagttcgtc gagggtctcg gtgttctgac aggctgcatc aagacgtcgc acattcctca 4232221 tctgctccgc acgtgcccgc cttgagcgcc agccgtggtg gtcgctgtga ggcgagtgag 4232281 acagcagggg atcggtcacc tgacgaattt acgtgcgcaa ccactaagct tctctatcta 4232341 ccgtcacatt cgcaaccttt agattgcaga tatcgataaa atcacccgcg cgacaagacc 4232401 gccatgtcat cctttcgatg ttatttcgcc ggcctgggga aagcgcaacg acgttgccta 4232461 cacgttccgc cgtcccaccg ttggcaatgc gcatacacac cgatctaatt gccctcagat 4232521 atgcggtaac ggattcgcga gcgaccggat tatctgggaa tagcacgctc gccgcggtct 4232581 cgtcgaaacg accgaccatc gtacttagcg gataggtgac cctcccgtcg ctgtaggtac 4232641 caacgttgag gccctcgaac agtttcgtca ccgccgagag cggtcccact tgtgcgtcga 4232701 aaaagttcac cagggaaaaa agcggttggg gcctgcgcag cgacggcgac aattcgacga 4232761 cccgttcgaa cggcacttta gccagatccg caccagtatc gaaggaggtc tgcgcgattc 4232821 gtgcaatctc gttaaaggac aatccggcga ctggaacggt caccgggatc tgcccggtga 4232881 accacccctg cgtcataagg tcggctggtg tgcggatatc tttgggagta attccaaaat 4232941 aggtatcggc gccggtcaac tcgtgtatcg cgatggcgat gcaagccagc atgccaccaa 4233001 tgaaacgagc gttcgccgcc atgcaggcgg attcgaatcg ctgtgtttgc tgctcgtcca 4233061 ttagcatcat gctgagcagg tcgccgccgc agcgtacaga cggatctccg aggggcagcg 4233121 gaaattccgg gaaagttccg ttattgattt cggcgaagtc gatccacgcg cgcacctccg 4233181 gggaatcgac ggtcaacgcc gaggtgtact cgtgctgcct gacgcagaag tccacatagc 4233241 tgccagcctc cgataaccca atcggtggct cacccattat cagcgcggtg tacatcgact 4233301 ggaactccat gagtccgact cctacgaact gaccgtccgc atgcagatga tcgatgctgg 4233361 catagaacgt gaaggagtct gctcgctgaa tgactccgaa gctgaagcag tcccaatgaa 4233421 gcgaatccgg tgtcgccacg atgtgctgtc gcaggtccgc gctcgtcatc tcgccatgtg 4233481 tggtcggaac aaattcgata tccgccggat cggcgatgct gtgccgaacg atgtggtcgg 4233541 tatctcgaag ctcgaaccag ctgcggtatg tatcgtgccg acgaaggtgc gcattgatga 4233601 cataggtcat ggcgcgcaga tcgcagtgac caaacacctc aacggacgca atgagcagcc 4233661 gcgagtgatc gagcccccgg gcagcctgct cagaaaagct ccgaatttgt ctggcttgta 4233721 cataactggg aggcacagca ctcaccggcg ctgcaagggc tttcgcgcac gaggcaggtg 4233781 ttgggtgcca cgaaactaac acgccgggcg ctgggtccca gtctttgacc gctgacaact 4233841 ctactggtcc tattcgcact aatagctcct atttcagcgc gtgcggaata cgtatgcggc 4233901 gaaacgttct tactgtgacg acagcgcggc agcaggagcg tcgtcgggcg ccagctgttc 4233961 atacaagtga tccgctaagc cccgcaccgt ggcgctgacg ttcttgggtg ccaaccggat 4234021 tccggtctcg gtctcgatcc gagtgcgcag ctctagtgcg cccaacgaat caagtccata 4234081 ctcgggtagc gggcggtcag ggtcgacggt gcgccgcaga atcaggctga cctgctcggc 4234141 gaccagctgc cgaagccgcg ccggccactc gtcgcgtggc agctcgttca gctcgacgcg 4234201 gaatttgctt gtgcccgaac cgttgctgct ggagaacact tcgaaaaacc ggctgcgctc 4234261 tgcgaaggcg accagccacg gggctccgat gaccggggca tagccggtat agacgcggtt 4234321 gtggcgcaat agcgcctcga acgcgtaagc accttcgtcg ggagtgatcg ccgtgtagtt 4234381 gctttcctcc aatgccgaag cccgcgcggg cgatgccgac caccacccca actggccgat 4234441 atccgaccag gctccccacg cgatcgcggt agccggcagg ccctgagctt gccgccaatg 4234501 cgcgaaggcg tccagccagc tgttggccgc tgagtaggca ctctgtcccg gcgagccggt 4234561 gagagctgcc gccgacgaaa acaagcagaa ccagtcaagc ggctgtccgc tggttgcttc 4234621 atgcaactcc caggcaccgt gaacctttgg cgcccagtcg cgcgccagca actcgtcggt 4234681 gatattggcc aaggtggcgt cctcgaccac cgcggccgcg tgtagcacgc ctcgtaccgg 4234741 aagcccggtg gccacagcgg tcgccaccaa ccgctccgcg gtacccggtt gggcgatgtc 4234801 accgcattcc accacgactt cagagcccat cgccgcgatg gcctcgatcg tttccctcat 4234861 cttttgcgtc ggctgggtgc gggaattcag cacgatccgg ccgcaaccgg ccgcggccat 4234921 cttctcggcc aggaacagcc ctagcccacc gaggccgccg gtgatgatgt aggagccgtc 4234981 gggacggaac acctgagctt gttccggagg cagggtaacg aggctttttc cggtctgtgg 4235041 gatgtggagg acgagtttgc cggtgtgctc ggcgttgccc atcacacgga tggcggtggc 4235101 cgcctcgacg agggggtaat gggtgctctg cggcatcggc aactcgccgg ctgcggtcaa 4235161 gcgatagacc gtgccgagca ggtcgcgcag ctcttctggg tgtgtcgcag acagcaaccc 4235221 caggtctacg gcgtagaagg acaggttgcg ccggaaggga aagagcccca gcttggtgtc 4235281 accatagatg tcgcgcttgc caatctcgac gaaccgtccc cggaaggcga gcagtttcag 4235341 cccggcaagt tgcgcggcgc cggtcaccga gttgagcacg acatcgacac cccggccgtt 4235401 agtgtcccgc cgaatctgct cggcgaactc gatgctgcgc gagtcataga catgctcaat 4235461 acccatgttg cgcaatagct ctcgacgctg tggggtaccg gcggtggcga agatctcagc 4235521 gcccgccgcg cgggctatag cgatcgccgc ttgtccgacc ccgccggtgc cggagtgaat 4235581 tagcaccgtg tcacccgccc taatccgggc gagctcatgc agtccgtacc aggcggtggc 4235641 gtgcgcggtg gtcaccgcag cggcctgtgc gtcacccagg cccggtggca gcgtcgcggc 4235701 cagccgagcg tcacacgtga cgaatgtgcc ccagcagccg ttaggcgaca tgccaccaac 4235761 atggtcacca accttgtggt cagtgacgcc tggtccgacc gcggtcacca cgccggcgaa 4235821 atccgtgccc agctggggca ggtgtccctc gaagctgggg tagcgaccga aagcgatgag 4235881 tacatcggca aagttgacgc tggacgcacg gaccgcaacc tcgatctgtc ctggtcctgg 4235941 tggaacgcgg tgaaacgcgg ccagctctat cgtttgcata tcgccggggg tacggatctg 4236001 caggcgcatg ccgctctgct gatgatccgc gacgatggtg cgccgctcct gaggacgcaa 4236061 cggggtcgga cacaagcgcg ccacgtacca ctcgttgtct cgccaggcgg tctcgtcttc 4236121 ttccgacgtg gccagcaatt ggcgtgccag ctgctcgaca ccggtctgtt cgtccacgtc 4236181 gatctgggtg gcacgcaggt gagggtgctc ggcgccgatc gtccgcagta gaccacgcag 4236241 cccgccctgc tcaagattga cgcagtcgtc ggccagcacc cgctgggcac cccgcgtcac 4236301 gacgtacatg cgcggcaccg ccccgggaag gtctgacaat tcgcgagcga tacccaccag 4236361 ccggcgaacg tactcagcgc cgcgatccgc gctcccctga tgcggcgtac cggtgttcga 4236421 cccggtgagc acgaccacgc cgctaaactc gtcgctacca acttgatcgc gtagctggtc 4236481 ggcggcggcc aactggtcgt cgtgcagtgg ccaccgcatc gtcgtgcacg ccgcgctgtg 4236541 ttccctaaac gcgtccgcta gccgggtagc ggtcacatca gaggcagcgc agtcactgat 4236601 cagcagccat tttccagcgc cagaggggtc catctcgggc agctcacgct ggtgccattc 4236661 gatggtgagt aagcgctcat tcagcacccg attgtgttta tcgcgctcgg acactcccgt 4236721 accgattcgc agtccgcaca cggccagcaa caccgtgccg tgcgcgtcca gcacgtcgat 4236781 atcggcctcg acgccgacca actcgacttt ggtcacccgc gtgtagcaat agcgagcggt 4236841 acgcaccgga gcataggcac ggactcggcg cacccccaac ggcaccaata ggccgctacc 4236901 taccgactgg ctatcgggat gcgcgccgac cgactggaaa caggcatcca ggagggccgg 4236961 gtggattgcg tacaggccct gctgcgaacg aatcgagccg ggcagcgcga cttcggccag 4237021 cattgtggcg gtcgcatcct ccgcgacata ggccacggcc aggccggtga aggccggacc 4237081 atattgcaca ccgtgcttgt cgaattgccg gcgcagatcc tcaccgtcca cgcggcaagg 4237141 gtgggcttcc aataaggagg ccatgtcgta cgccggcggc tcgcattcgc cggatacctg 4237201 ctgcagcacc gccgacgcac gccgcaagtg atgcccaacg ccttcctgca aggcctcgac 4237261 ggcgaagtcg acgacaccgg gcgaggtcac cgttgccacg gtggacaccg gggtctggtc 4237321 atccagcagc agcatcgcct caaagcgcat gtcgcgtact tcggactgct cgccgaggac 4237381 ggcacgggcc gcagacaacg ccatctcgca gtaggcggcc cctggaagag cagccacgtt 4237441 gtgtatccgg tgatcgccca accagggcaa ggttgcggta ccaacatcgg cctgccaggc 4237501 gtggcgttcc ggctcttcgg gcaatcgcac gtgtgcgccc aacaacgggt gcacggctac 4237561 cgtggagcca cccggcgacc gattgtcaac gccttcgcgg tcatagaaca ggaaccggtg 4237621 cgaccacgcc ggcagcggag catcgaccaa gcggccttgg ggacagagca ccgagaagtc 4237681 cactgccgca ccagcgttgt gcagatccgt cagcaggcga cggagcccca gcggcaatgg 4237741 ctgctcccgc cgcataccgg ccagcgcggc aaccggcatg cctacactgc cggcaatctg 4237801 atcgaccgcg tgggtcagca gcgggtgcgg cgaaagctcg gcgaagactc ggtacccgtc 4237861 gtcgagcgcc gagcgcaccg cagcggagaa ccgcacggtg tggcgcaaat tgtcggccca 4237921 gtaacgcgcg tcgcacgccg gcgcttcgcg cgggtcgaaa agcgtcgccg aatagtaggg 4237981 aatctcagga gctttcggat tcaggtcggc cagcgcagct atcaactcgt cgaggatcgg 4238041 atccacctgc ggcgaatgcg aagccacgtc gacggccacc gcccgcgcca gcacgtctcg 4238101 ccgctcccat atgtcgacca gcttgcgcac cgactcggtg cctccggcga tcacggtgga 4238161 ctgcggcgcg gtcaccacgg cgaccaccac atcgtcgatg cctagagcgg tcaattccga 4238221 ctgcacagct aaggcaggca actccaccga cgccatcgcc gcggaaccgg cgatcgtcgc 4238281 catcagtttt gatcgtcggc agatgacgcg taccccatct tcggctgaca gcactcctgc 4238341 gaccacagcc gcggccgact cacccattga gtggccgatc acggcgcccg ggcgcactcc 4238401 gtatgccgcc atcgtggctg ccaacgcgac ctgcatcgcg aagatggtcg gctgaactct 4238461 gtcgatgcca gtcacggtct cgggcgccgt catcgcctcg gtgaccgaga acccggactc 4238521 cgcggcgatc aatggctcta gctccgcaac ggtcgcggcg aacaccgatt cgttcgtcag 4238581 cagatcggcg cccatcgctg cccactgcga cccttgcccg gagaataacc agaccggccc 4238641 gcggtcatcc tgccccaccg cgggctggta aacggtgtca ccgtcggcga cctcgcccaa 4238701 gccggcaatc agctcgtcga cgctgctcgc gatgaccgcc gtgcgcaccg accggtgcgt 4238761 acgccgccgc gccagcgtgt acgcaagatc cgagagcacc agggagtcgg cgtgctgctg 4238821 tatccagtcg gtcaaccgct gagcagtctg ccgcagcgcg tcggccgagg aagcggacag 4238881 cgtgaacaag gcaggggtgc cggtcggggg ggtgctcgcc gcgtggggct gggcttcggt 4238941 ttgcggagct tgctccacaa cagcgtgcac gttcgttccc gagaacccat aagacgacac 4239001 tgccgcccgc cggggcacct gacgaccgtt ggtgggccac ggtgtggtca cctcgggcac 4239061 gaagaggttg gtggtgatgc cagcaatctc atcgggcagc cgagtgaagt gcagattacg 4239121 tggaaccaca ccatgtttca gagcgagaac caccttgatt agccctagca ccccggcggt 4239181 cgactgggtg tgtccgaagt tggtcttcac cgatgcgagt gcgcacgggc cgtcgacccc 4239241 atacacctcg gagacacttg catattcaat ggggtcaccg atcggggtgc cggggccgtg 4239301 cgcttcgacc atgccgaccg tcgcggcgtc cacgccaccg gcagccaacg ccgctcgata 4239361 agccgcaacc tgtgcgggct gcgaaggcgt cgcgatattg accgtgtggc catcctgatt 4239421 tgcggacgtg ccacgaatta ccgccaggat ccggtcaccg tcggccaatg catccggcaa 4239481 ccgcttgagc accaccacgg cacaaccctc gcctgacacg aacccgtcag ccgcgacatc 4239541 gaacgcgcga caacgtccgg tcggggacaa catgcccaaa gcggatccag cagcggcctt 4239601 gcgtggctcc agcatcaagg cgacaccccc cgccaaggca acgtcgcttt caccctcgtg 4239661 caggctgcga cacgccatgt gcacggccgt caggccggac gagcatgcgg tatcaacggt 4239721 tattgccgga ccgtgcagtc gcatcgcgta ggcgacccgg cccgacgcca tgctgaagct 4239781 gttgcccaga tatccgtacg gctcctccaa ttgtttggcg tcggccgcca ccatcgtgta 4239841 gtcaccatgg gtgacacccg cgaacacgcc ggtcgccgag cctgccagcg tttgctgagt 4239901 aagaccggcg tgctccatgg cctcccagga cgtctccagc aacagacgtt gctgcggatc 4239961 gatcgcaatc gcctcccgct cgccgatgcc aaagaactcg caatcgaaat ccgcggggtt 4240021 atccaggaaa ccgccccact tgcacaccgt ccgaccgggc acgcccggct gcgggtcgta 4240081 gaactcgtcg caatcccacc ggtccggcgg cacctcggtg atcaggtcgt cgcctcgtaa 4240141 caacgccttc cacaacaact cgggggaatc gatcccgccg ggcagccggc aagccatgcc 4240201 gataacagca accggagtca cacgtggttc agccaacgtc catgcacccc tatctgcacc 4240261 agtgcctgac gccgccgacc ccaagcccaa tgccggaggc gatacgtagc ctaactagca 4240321 atccttcgat gtagctgtgt ctttggtggc tctttagttc taagcggctg tgctactggg 4240381 gcactgggcc ctacttcggt ttgtcgtggc atgggcagcc cgcggtctgc cgcagtctga 4240441 agttcgcggc ctgagcgcgc gctatcttcc acgccgggcc ggtagtctga cgcttcatgg 4240501 tttcgctttc catcccctcg atgttgcgcc agtgcgtcaa cctgcacccg gacggcacgg 4240561 cattcactta catcgattac gaacgggatt cggagggcat aagtgaaagc ctgacgtggt 4240621 cgcaggtgta tcggcgaacc ctaaacgttg cagcagaagt ccgccgccat gccgcaattg 4240681 gtgaccgtgc agtgatattg gccccacaag gactcgatta tattgttgct tttctgggcg 4240741 ctttacaggc cggtcttatt gcggttccac tttcggctcc gctcggcggc gccagcgatg 4240801 aacgtgttga cgcggtagtg cgtgacgcga aacccaatgt cgttctgaca acatccgcga 4240861 taatgggcga tgtcgtcccg cgcgttacgc caccgcccgg tattgccagc ccgccaacgg 4240921 ttgcggtcga tcaactagat ctggactcgc cgatacgatc taatattgtg gacgattctc 4240981 tccaaacaac cgcatatttg cagtatacgt cgggatcgac ccgcacacct gccggtgtaa 4241041 tgattaccta caagaatata ttggcaaatt tccagcagat gatttccgcc tatttcgccg 4241101 acaccggagc cgtaccgcca ttggaccttt tcattatgtc gtggctaccg ttctatcatg 4241161 acatgggttt ggttctggga gtttgtgcgc cgattatcgt aggatgcggc gctgtgctca 4241221 caagcccggt ggcgtttctg cagcgaccag cccggtggct gcaattgatg gcacgcgagg 4241281 gccaggcgtt ttcggcggca ccgaacttcg ccttcgaact gacggcagca aaagcaatag 4241341 atgacgactt ggccgggctc gaccttggac ggatcaaaac catcctctgc ggcagtgaaa 4241401 gggtgcatcc ggcgaccctc aagcgctttg tcgaccggtt tagccgtttc aatcttcgag 4241461 aattcgcaat tcggcccgcg tacggactcg cggaagccac ggtgtatgtg gcgaccagcc 4241521 aagccggcca acccccagaa atccgttact tcgaacccca cgaactttcc gctgggcagg 4241581 ccaagccgtg cgcaaccggg gcgggcacag ctctggtcag ttacccgctg ccgcaatcac 4241641 ccattgttcg gatcgtcgat cccaacacca ataccgagtg cccacccgga acaatcggtg 4241701 agatctgggt acacggcgac aatgtcgccg gcggctattg ggaaaagcct gacgagactg 4241761 aacgcacctt cggaggagca ctggtcgctc cctcggccgg cacacccgta gggccttggc 4241821 tacgaactgg cgactcgggc ttcgtgtctg aggacaagtt tttcatcatc ggcagaataa 4241881 aggatctgtt gattgtttac ggccgcaatc attctcccga cgacatcgag gcaacgatcc 4241941 aggagatcac tcggggccgc tgtgcggcga tagcggttcc gagcaatggc gtggagaagc 4242001 tcgttgccat cgtcgaactc aacaaccgcg gcaacttgga cacagagagg ctgagcttcg 4242061 tcacgcgtga agtcacctcg gcgatatcca cctcgcatgg attgagcgtg tcggatctgg 4242121 ttctggtggc gcccggctcg attccgatca ccacgagcgg caaggtcaga cgtgccgagt 4242181 gtgtgaagct gtatcgacac aacgagttca cccggttgga cgctaagccg ttgcaagcga 4242241 gcgatcttta gtggtcacgc gacttgcacc ccgtctcggg gttgttcggc agccatgcgg 4242301 ctgcctccct tccgcgcttc acagccacca gccgggcaag gcccggtctt acggtcggct 4242361 ccacgcttaa cgacgggaac cagcggtcgg cgaccaccag cgccgacccg taccagcccg 4242421 tcttgtagga caagtgccgg cgcggagtgc ccagggccga gtccgacagt ccgcgccggc 4242481 gggcgcgggc gccgggaagc cccttttgcc gcagcatccc cgcagcgtcc aaaccttcaa 4242541 caacgatgtg gccgtgggtt tgagccaatc gtgttgtcag gacatgcagg tgatgagtgc 4242601 ggacatcgtt gacccggcga tgcagccggg aaatctcggt ggtgcgctcg cggtagcgcc 4242661 gtgagccttt cgtgcaccgc gaccgcgcac ggctggcgta ccgtagctct ttgagtgccg 4242721 cgtcgagtgg ccgtggattg ggcacttctt cgagcactgc gcccgcctcg ttggcgaccg 4242781 tggccagccg gcgcaccccg acgtcaacgc caacccgtga accgggctgt gccacgttgg 4242841 gctgctgcgg gcgttgcacg aggacccgca cactggcgtc gagccgggtg ccgttacggc 4242901 gcaccgagat tgccagcacc cgcgcccggc ctgtggcgat gagccgttca atccggcgtg 4242961 tgttctcgtg cgtacggacg gtcccgacga ccggcagtgt gagatgacgg cgatcaggtt 4243021 cgacgcgcat cgctccggtc gtgaatgtca cgcggtcctg atcgcggcct ttcttcttga 4243081 accgggggaa gcccattgtc ttgccctcac gtttaccgga tcgggagttc tgccagttcc 4243141 agtacgcatc gacagcgccg ccaatgccgt cggcgtaagc ctctttcgag cactccggcc 4243201 accacaccgc cccggtctcg gcgttgacac acacctcgtc cttgacggtg ttccaccgtt 4243261 tacgaagcac ccgcagcgac ggcttgacag tcccgatacc agtaacgcgc cacgcctcga 4243321 tatcggcttt caaagtagcg accgcccagt tgtaggcctt gcggcgagcg ccgaaatgcc 4243381 gcgccagcgc gcgggcctgg tcctcggttg ggtccagcgt gaaccggaac gcctgcacac 4243441 accagccttc tggcacctcg aatctggcca tcaagctgcc tccgcgtccc cgaccgcagc 4243501 agcaagggca cgcttggccc cgttctgtgc agcgcgttca ccatagagcc gagcacacat 4243561 cgaggtcagg atctcggtca tatcgcccac caggtcgtca tcaacctcag ccaagtcgac 4243621 caccaccaat tcccggccct gggcgacaag agcggcctcg acgtactcag agccaaacca 4243681 gcagaaccga tcccggtgct ccaccacgat ccgcgtcacc accggatcac ccagcagcgc 4243741 aaaaaactta cggcgatgtc cattcaacgc ccaaccaccc tcggccacca ccttgtcgac 4243801 agagagatgt tgcgatgtgg cccacgcggt cacccgcgcg acccgccgat ccagatcgga 4243861 cctctgatcc gctgacgata cccgcgcgta caccaacgtc cgcccgcgcc cagactcctc 4243921 gactgccgga tcgttcacca gaatgagccg acccactcgc tgcgccggaa ccggcaacag 4243981 cccggctcga aaccagcgat acgcgatcac ccacgcaaca ccgttgcgct ccgcccacac 4244041 cgccaaattc atccatctgt tcctacagca caccaccgac aactaccgac cactcaaaac 4244101 gcaacagttg gcagccctac gatcggccag cgcctgacgg gcggcgttat atccagggat 4244161 gaacgtgatt cccggcccac cgtgacaacc ggcactgccc aggtacaacc cggctatcgg 4244221 gatcggctgg ccgataaagc ctttcgggcc aggcctgttg gggccgatct ggtccgagtg 4244281 cagcagggca tggcagtagt ccccacccgg ggcaccgaac atcacaccca tgtgtttggg 4244341 ggtaaaggtg gtgtaccgga gaatgctgcc tttgaagttc ggtgccaacc tagtgatctt 4244401 gtcgatcacg ttctgcccca tttcgacctt tgcccggccg taccctccgt attttgagcc 4244461 accctcgatc gggaaccaca ttgcgaacgc cgacgcggcc tgcttacccg ccggggccag 4244521 gctgggatca tgcagcgacg ggatctgcaa caccacggtc ggatcggccg ggacgatccc 4244581 acgccggcaa tcctcccact gctgctgaac ctgctccggt gtacagaaaa tgcccatcga 4244641 tgcctgcatg ctcggatcgt tgagtgcctg gtagggcgcc gcgaaggccg gtggctgcgc 4244701 gagcgcaaaa tgcatctgca gatagctgcc gcggtggtcg atgcgcaaat agcgatcgcg 4244761 gatttccgac ggcaacactg ccggatcgat cagctcgttg atggtgacgt cgggtgctat 4244821 ggcggagacc acgatcgggg aggtcaaggt gtcccccgcc gcggtgcgca cgccccgcac 4244881 gcgggctgac gaccgactat tgtcaaccac gatctcggtc accttggaac gtaaccggac 4244941 ctcgccgccg gtgcgttcca gcaattgcga cagatgggtg gtaagcgcgc cgatgccacc 4245001 gcgcaatttc ttccaccgca cgaagtcgcc ctccgggaca cccaatccga aggcgagcgc 4245061 ggcagcgctg cccggtgtgg ccggcccgcg atagagcgtg ttcacggcca gcacggtcat 4245121 cgacccgcgc agggcgccgt gcttctcgcg gtccgggaaa tggcggtcca acacgtcggt 4245181 gaccgatccg aacagcatgt catcgatcgc tgaccgttcg aattcatttg tggcacaggc 4245241 atacatctcg tcgaagctct tgggcagagt tccggcttcg aaacgcccca gcgcccgggt 4245301 cggcgcctgg ctccacgcca gcaggcccgc catcccggtg acggcgtctg ccccgtgcac 4245361 ccgatggagg tgggtaagca tcttcgtcgg gtcggtgaat tggaccaccg gatcgtcccc 4245421 gacaccgcgc aacgctaccg acatcacctc cagatcgacc gtcggcaagc tgtccaggcc 4245481 taactcgctg ctgaccgccg aggaggtcgg gaactgcacc gatccggcga tatcgaaccg 4245541 gtacccgtcg aacagctcca ccgtggaggc catcccgccg gcgtagcgct tagcgtccag 4245601 acacgcggtc cgcagtccgg ctcgctgcag cagcactgcc gcggtcagcc cgttgtgccc 4245661 ggcgccgata actatcgcgt cataaccagt catacgcgtc tccagcaatg caggctcgca 4245721 cgcgctcgat gttttgtcaa ttatgacgaa actgtgaggg tagtccaggt gtcggagatg 4245781 ccgacgcgca gcgactccag tgcgacgtgg cagacccgcg ccagctcccc gagcgaccgg 4245841 tcactcccaa gcatccaggc ttccatcgcg ccgaacaccg ccgcggcgac gcatcgtgcg 4245901 gtgacggcga tgtgcaatcg ggcatcgggt gcacccgcga tatcgcagtt acgtcgccgc 4245961 aattgggcct ggatggcatc ggcgaagtcg gcttccacct cgcgcatatg gcggacgatc 4246021 cggctcggct ccaactcgcc gcgccgcaac gacgcaatct tcgtcactgc gtcaacgtca 4246081 taaggaaacg agaagatagc cgcttgcacg gaatcgatga tcgattcgtc ggccggtcta 4246141 gcatccagcg ccgcgcgaaa ccagtgcagt ccggcgtcgt agtcggcaaa cagcaaatcg 4246201 tgcttggatc tgaagtggcg atagaaagta cgcagcgaca ccccggcgtc ctccgcaatc 4246261 tgctcggctg aggtagcctc gacgccctgg gccagaaatc gcaccagggc ggcctggcgc 4246321 agtgcctcgc gagtgcgttc gctgcgcgcc gtctgcgggg gccggaccat gactgcaagc 4246381 tatcgtcaat tttcgttctg tcaacattga caaaactgtt ggccacggcg agactgcgcg 4246441 catggtgtcg cttcttgttc acgctgcgct gggagtagtc gtcatcggct ggatcgtctc 4246501 gtcgaacccg aaggttttca ccaggccggc cggcggatcg tggttctcgc tgccggagtg 4246561 tgtgtactac gtcgtcggta ttgcctcgat cgcgctgggg tggtacttca acattcgttt 4246621 tgtgcagcag tacgcgcacg gagccgccaa ccctctctgg ggtcccggca gctgggcgga 4246681 gtacgtccgg ctgatgttca ccaacccggc ggccagttcg gccggccagg actacaccat 4246741 tgccaacgtg atcctgctgc cgctgttttc caccaccgac ggctaccgac gtggtctgcg 4246801 gcggccctgg ctgtatttcg tgagcagcct gttcaccagt tttgcattcg cgttcgcgtt 4246861 ctacttcgcc accatcgaac gtcagcaccg acacgaacgt tcccgtgcga cggtcggcgc 4246921 ctaggcggcg actggcttgg tggcccgcca cctcaggcga gcgcccgcga catcgacgtg 4246981 gatatcagtg aatcccacag ctcgcagccg accggggagg tccgccgggg cgatcggagt 4247041 gtaggtgtcg gcgatgtgta ttaggcgaaa cggcagcgac ggcacaccgt cgctgccggc 4247101 aaagacgcca cctggttgca gcacccggta cgcctcagcg aatagctggt cctgcagttg 4247161 ggcgctggca acatggtgca gcatcgtgaa acacaccacg gacgtgaagt gatcatcggg 4247221 cagcccggtc tgggtgccat cgccgcggat gatgcgcgcc cgctggccgt agcggcggtt 4247281 caggcgctcg accatcgagt tgtcgacttc aacggcggtg agcgaggcgg tcaggccaag 4247341 gagcgcttgc agtgtcgccc cataaccggg gccgatctcc agcgtccggg ggccgagttc 4247401 gacgtgctgc aacgcccagg gcaggagctg attggccacc gctttttccc agcctgccga 4247461 gctgcaatga cgccgatgta gaagattcat ggccatggcc cagaacacta gttagccacc 4247521 ggccggcagt cttccgatat tctgccttaa tatgtcggaa aacagccacc acaggctggc 4247581 cacaacctcg ttgacgctcc cgccgggagc gcggatcgaa cgccaccgcc atccgtcaca 4247641 ccagatcgtc tatccgtccg caggggcggt ctcggtcacc actcacgcgg gaacctggat 4247701 tacgccggta aatcgggcaa tctggatacc ggcgggctgt tggcaccaac acaagttcca 4247761 cggccacacg caatttcacg gcgtagcgct ggatccgcag cgctatcgcg gcggcccggc 4247821 aaccccgacg gtgctcgcgg tcaatccgtt gatgcgcgaa ctcatcatcg cgtgttcgca 4247881 ggccgaccga accgacaccg acgagcacca ccggatgttg gccgtactgc aggatcaact 4247941 gccaacaacg agcatccgcg agccactgtg ggttccctca ccaaccgatc gccggttgcg 4248001 gcacgcgtgc gcgttgatcg ccgacaacct gacccagccc ttgacgctgc agcagatcgg 4248061 cggccggatc ggtgtcagcc agcgcacgct gagccgtctg ttcagcgacg agctgggtat 4248121 gacgttcccg caatggcgca cccagctgcg cctgcaacat gcgctcgtgt tgctcgccga 4248181 gcgccacgac gtcacgtccg tggcgtccga atgcggttgg gccacaccaa gcgcgttcat 4248241 tgacacctac cgacaagcct tcggacacac tcccggccaa gccgctaagc caatggcggc 4248301 gacccgcctc acccggctcc gccgcgctcg cgatcgccgc taagcgaccg gctccagcac 4248361 ttcgacaccc acgaacggaa ccagtgcgtc cgggactcta acgctgccgt cgggccgctg 4248421 gtggttctcc aggatcgcaa ccagccaccg ggtggtggcc agcgttccgt tgagggtggc 4248481 cgcgatctgc ggcttgccgc tggcatcccg gtagcgggtc gccaaccggc gcgcctgaaa 4248541 ggtggtgcag ttcgacgtcg acgtcagctc gcgataggcc ccctgcgtcg gaatccacgc 4248601 ctcgcagtcg aacttgcggg cggccgacga gccgagatca cccgcggcca cgtcgatgac 4248661 ccgatacggc acctcgatgc gtgccagcat ctggcgctgc cagcccagca gccgctcatg 4248721 ttcgtgctcc gcgtcggccg gtgtgcagta gacgaagccc tcgactttgt cgaactggtg 4248781 cacccggatg atgccgcgcg tgtccttgcc atggctgccg gcctcacgtc ggaaacacga 4248841 cgaccagccc gcataccgca gcggcccgcg ggaaaggtcc agaatctcgc cggagtgata 4248901 ccccgccagc ggtacctcgg aggtgcccac aaggtagagg ccgtcgccct ctacccggta 4248961 cacctcctcg gcgtgggcgc ctagaaatcc cgtgcctacc atcacttccg ggcgcaccag 4249021 caccggcggg atcgtaggga caaagccgtt gtcgacggct agcttcagcg ccagctgcag 4249081 caatccaagc tgcagtaggg caccccgacc ggtcaggaag tagaaccgtg aacccgacac 4249141 cttggcgccg cgctgcatgt cgatcaggcc cagcgactcg ccgagctcca ggtggtcctt 4249201 ggggttctcg aggtagctgg gctcgccgac gacgtcgagc accgcgtagt cgtcctcccc 4249261 gccggcgggt accccgtcca cgatgacatt cgagatcgcc aggtgcgccg cggtgaacgc 4249321 cgcctccgct tcgacctcgt cggcctcagc ggctttgacc tgctcggcga gttccttcgc 4249381 gcgccgcagc agcggcgggc gctcttcggg agacgcgcca cccacgcttt tgctggcggc 4249441 tttctgctcg gcccgtaacg aatcggcggt cgagatcacg gcccggcggg cggcgtcggc 4249501 cgtcagcagg gcatctacca gcgccgggtc ctcgccgcgg ctgagttgtg agcggcgtac 4249561 cgcgtcgggg ttttcacgaa gcagcttcag gtcgatcacg gccgcaagac tacttttgac 4249621 gcccagtcag ggtggcggca gaggaccatc cacccgcgat gaagcgatcc cgcaagctga 4249681 caactgcaac attggtcatg cggccccgcc gaccctgtca gaatggagcg gatgttggac 4249741 gcgcccgagc aggaccccgt cgatcccggc gacccggcca gccccccgca cggggaggcg 4249801 gaacagccgc tgcccgggcc tcggtggcca cgcgccctgc gcgcgtcggc gacccggcga 4249861 gcgctactcc tcaccgcttt gggtggcctg ctgattgccg ggctggtcac cgcgattccc 4249921 gccgtcggcc gcgcgccgga gcggctggcc ggctacatcg ccagcaatcc ggtgcccagc 4249981 actggcgcca agatcaacgc ttcgttcaac cgcgtcgcca gtggtgactg cttgatgtgg 4250041 ccggacggca cgccggagtc tgccgccatc gtcagctgtg ccgacgagca ccggttcgaa 4250101 gtcgccgagt ccattgacat gcggacattc cccggcatgg agtacgggca aaacgctgct 4250161 cccccgtcgc ccgcccgcat tcagcagatc agcgaggagc agtgcgaagc tgctgtgcgc 4250221 cgctacctcg gcacgaagtt cgatcccaac agcaagttca ccatcagcat gctgtggccc 4250281 ggcgaccggg cgtggcggca ggccggtgag cgccgcatgc tctgtggctt gcagtcgccc 4250341 ggtccgaaca accagcagct cgccttcaag ggcaaggtcg ccgacatcga ccagtccaag 4250401 gtctggccgg ccggtacctg cctgggcatc gatgccacca ccaaccagcc gatcgacgtg 4250461 ccggtggact gcgcggcacc gcacgcgatg gaggtatccg gcacggtcaa cctggccgag 4250521 aggtttcccg acgcgctgcc gagcgaaccc gagcaggacg ggttcatcaa ggacgcgtgc 4250581 acccggatga cggacgccta cctcgcaccc ctcaagttgc gtaccaccac cctgacgctg 4250641 atctacccca cgctgacgct gcccagctgg tcggcgggta gccgcgtggt cgcatgcagt 4250701 atcggcgcga ccctgggcaa cggggggtgg gcaaccctgg tgaacagcgc taagggggcg 4250761 ctgctgatca acggccagcc gccggtaccc ccacccgaca ttcccgagga gcggctcaac 4250821 ctgccgccga ttccgcttca gctgccaacg cctcggcccg cccccccggc tcagcagctg 4250881 ccaagtaccc caccaggcac tcagcacctc cctgcccaac agccagtggt tacgcccacc 4250941 cggccacccg aatcgcatgc gccagcgtcg gcagcaccgg ccgagaccca gccaccgcca 4251001 ccagacgccg gagcgccgcc ggcgacccaa tcaccagagg ccacaccgcc tggccccgcc 4251061 gagcccgcac cggcaggcta gccgggtgac agtacggatg gacccgcagc ggttcgacga 4251121 actggtgtcc gacgcactcg acctcattcc gcccgaactg gcggacgcca tggacaacgt 4251181 cgtcgtgtta gtcgccaatc gccaccccca gcacgaaaat ctgctcggcc agtacgaagg 4251241 ggtcgcgtta accgagcgcg gctccgacta cgccggatcg ctgcctgatg ccatcacgat 4251301 ctaccgcgag gcgctgctgg acgcctgcga ctctgaggat gaggtcgtcg accaggtcgc 4251361 catcacggtg atccatgagg tcgcccatca cttcggcatc gacgacgagc gcttggacca 4251421 actgggctgg gcgtgacgaa ccagcgcccg ggcgcggcaa cccggatttg tcggcacccg 4251481 atgctatgaa cggcccatga gcacggactg ccgcgactgc cgggcgggct tggatcactg 4251541 ccacggcacc gtcattcgtc atcccttggc acggccggaa tgcaccgagc cggactgtgt 4251601 cagccccgag ctgcaacccc atatcttcgt cctagactgc aataccgtca gctgcgaatg 4251661 cactgaatcg gccacggcgc ccgggtcctt cagatcagcc catcgggtcg gtgcttgacg 4251721 tcaccgcgtg tgtgaccggg ctggctgcgg cttcagcggg gtccggacaa aacggcggct 4251781 tccggaggcc ccactgcaca caactccatc gcccatcggt tatcggggcc agcaccaccg 4251841 actcgacgtt ttccaagtgg ttgtccaaca caaagttgcc gtccaccccg gcaagcaccg 4251901 cggcggccag ccggatcgcc gcgctgtggc tcacgacgac gatgtcgccg tcccagtcac 4251961 cgtcgtcgag gtaacgcatg cgcaggtcgg cgagcaccgg cagataacga tccaggacgt 4252021 cgttggcggt ctcgccaccg ggcagcggca catccaactc cccgcgatgc cagcggctgt 4252081 aggtggcgtt gaactcggcg accgcctcgt cgtcgttgcg gttttccagc tcccctacct 4252141 gtacctcgtg aatgccggca acctcgtggg ccaccatgtc gagttcggca gcgaccaccg 4252201 cggccgtctg gtaggcccgg atagccaccg agtgtgcgag cagtgccggc cggcgacaac 4252261 cgctgcgcgc gaacgccctg gcctgatcac gacccagcgg tgtcagcgcc gttcccggcg 4252321 gcagggtatc caacctgcgc tcgacgttgc cataggactg gccgtgccgc agcagcacca 4252381 aacgaccgct catgcttgcg ccccctggtc gtccgggcga accagggtct gctcgggttt 4252441 gcccgcgcgg agccgcgcta accagcgtga tgcttcgtct accaggggcg gctgcgcccc 4252501 tgccgctggc cccgtcggcc aggaccccag gtatcgcaca tcagcacaac gtcggtgcac 4252561 cgccttgagt gcctcggcga cggcctcgtc gtcgatgtgg ccgacgcaat ccacgaagaa 4252621 cagataggtg ccaagttcgg tacgggtggg ccgggattca atccgagtga gatcgatgcc 4252681 gcggatgccg aactcggcca gcgcagctac cagcgcaccg ggctggttgt cgatgcgcag 4252741 cactgcagac gtgcgatcgg ctccggtgcg cgccggaggc ggcccgggcc gaccaaccag 4252801 gacgaagcgg gtgcgggcat tggattcgtc aacgacaccg tcggccaggg ccgccaatcc 4252861 ccaacgagcg gccgccagcg gcgaggtcac cgcggcgtca accaagccgt cagccacctg 4252921 ccgggccgcg tccgcgttgg aataagccgg ccgcaggtcg gcggcgggaa gatgggccgc 4252981 caaccactgc cgcacctgtg cagccgccac cggaaaggcc gccagggtcc gcacgtccgc 4253041 ggcgttgcgc ccgggtttga ccacgatgct gaacgtcacg tccagcgttg tctcggcgaa 4253101 cacctgcagg cgcacaccga tggccaggct atccaaagta ggcagcacgg aaccgtcgat 4253161 cgagttctcg atcggcacgc acgcataatc cgcaccgccg tcgcggaccg cagccagtgc 4253221 tgcgggcgcg ctctcgaccg gcatccgctg cagtgcatcg ggcccggtct cgggaactag 4253281 gccggcggcc accatccgga ccagggctgc ctcggtgaat gtcccttccg gaccgaggta 4253341 agcgatacgc accacgctca caaccctaac gacgcaaagc cgaccgccaa ctcttgcgac 4253401 cagaccgtgc attagttaac ttaggcttac ctaaacacag gaggtcgtgg atgccgccgc 4253461 tcaccagtct cgcgccgact actgccgagc gaattcgcag cgcctgcgcg cgggccgggg 4253521 gcgccttgct ggtggttgag cgggaggatc cggtccccgt gcccatacac catttgttgt 4253581 acgacgggtc cttcgccgtg gcggttccgg tcgatcgtgg cgaggtgtcc ggttcgcaag 4253641 cgctgctgga gttgactgac tatgcgccgc tgccggtgcg tgaacccgtc cgttcgctgg 4253701 tgtggatccg cggctgcctc caccagatcc cgcccgcaga gctggttgag accctggacc 4253761 tgatcgccac cgataatccg aatccggccc tgctacaagt cgagaccccg aggtccgggc 4253821 cggccgatgc ggcggagacc cggtatacca tgcagcggct ggagatcgaa tccgtagtgg 4253881 tgaccgacgc caccggcgcc gaacccgtta ccgtggcgga cctgctcgcg gcccgacccg 4253941 atccgttttg tgaaatcgaa tcaaccttgc tctggcacct agccaccgcc catgacgatg 4254001 tggtcgcgcg gctggtatcc aggctgccgg caccgctacg acgcggacag atccgccccc 4254061 tcggtctcga tcggtacggc gtccggtttc gcattgaagc tcgcgacgga gaccgcgaca 4254121 tccgactgcc gttccataag ccggtggacg acatgaccgg gctaagccag gccatccggg 4254181 tgctcatggg ttgcccgttc cgcaacgggc tgcgcgcccg caggtagcag gcacagccgc 4254241 cgctcggccg cgttggccgg ctgcatccaa aggttcagcc acgtacgttg tctaggtccg 4254301 gggttggcat ccgacaaccc gacgacactg atatcgatcc cgcgtgactc ttatgtaccg 4254361 atccctggcc acggccggga caaaatcaac gccgcgttcg cgctgggcgg ggggcggctg 4254421 ctgacccaaa cggtcgagtt ggctactggc ctgcacctgg atcactatgc cgaggtcgga 4254481 ttcagcgagt tcgccgacct cgtcgacgcc ttcgatccgt tggccggcgt cgatctaccg 4254541 gcaggctgcc aaacacttga cggacgtgca gcgctgggct acgtccggac tcgggccaca 4254601 ccacgggccg atctagaggg ctccgacgtg ccggtgccag ccgccgcgtt cgaaacacag 4254661 ccctaacgac acgctgccga atatgacccg tgtcggaaat tagggcgaca agagtaatgc 4254721 ggctcaacat agccttgctt tacttaggca aacctgcctt caaccaggag gttattatca 4254781 tcctgtggta actaggaaag cctttcctga gtaagtattg ccttcgttgc ataccgccct 4254841 ttacctgcgt taatctgcat tttatgacag aatacgaagg gcctaagaca aaattccacg 4254901 cgttaatgca ggaacagatt cataacgaat tcacagcggc acaacaatat gtcgcgatcg 4254961 cggtttattt cgacagcgaa gacctgccgc agttggcgaa gcatttttac agccaagcgg 4255021 tcgaggaacg aaaccatgca atgatgctcg tgcaacacct gctcgaccgc gaccttcgtg 4255081 tcgaaattcc cggcgtagac acggtgcgaa accagttcga cagaccccgc gaggcactgg 4255141 cgctggcgct cgatcaggaa cgcacagtca ccgaccaggt cggtcggctg acagcggtgg 4255201 cccgcgacga gggcgatttc ctcggcgagc agttcatgca gtggttcttg caggaacaga 4255261 tcgaagaggt ggccttgatg gcaaccctgg tgcgggttgc cgatcgggcc ggggccaacc 4255321 tgttcgagct ggagaacttc gtcgcacgtg aagtggatgt ggcgccggcc gcatcaggcg 4255381 ccccgcacgc tgccgggggc cgcctctaga tccctggcgg ggatcagcga gtggtcccgt 4255441 tcgcccgccc gtcttccagc caggccttgg tgcggccggg gtggtgagta ccaatccagg 4255501 ccaccccgac ctcccggcaa aagtcgatgt cctcgtactc atcgacgttc cagcagtaca 4255561 ccgcccggcc ctgagctgcc gagcggtcaa cgagttgcgg atattccttt aacgcaggca 4255621 gtgagggtcc cacggcggtt gccccgaccg ccgtggccgc actgctggtc aggtatcggg 4255681 gggtcttgcc gagcaacacc gtcggcagca gcggtgcagc ccgccggatc cgccagaccg 4255741 cggcggccga aaacgacatc accaccgcac gggatcgatc tgcggaggcg ggtgcggcaa 4255801 taccgaaccg gtgtagcagc gccagcagct tgttttccac cagcgagccg tatcggacgg 4255861 gatgcttggt ctcgacgaag atcttcaccg gccggtgcca gtccaaaacc agcgaaacaa 4255921 gcgcgtccag ggtcagcaga ctggtgtcgc cgtgcgaacc gtcggggcgc cagctgtcgt 4255981 gccacgcgcc gtactccagc tcgcgtagct gggccagcgt catcgtgctg accaagccgg 4256041 ctcccgtcga ggttcggtcc aggcggcggt catgcacaca gaccagatgc ccgtcccggg 4256101 tcaaccgcac atcacattcc acgccgtcgg cgccctcttt gagcgccagg tcgtaggcgg 4256161 caagggtatg ctccggccga gccgccgacg caccacggtg agcaaccaca aagggatgtc 4256221 cggcgagcac ctcgtcggcc catgtcatgt ccactatgct gccggttcct gcccgtccaa 4256281 ctcaaccgca acagaagatg ccggcgcgga acgcccgtct gtgttcacca ccacccagcg 4256341 atgcgctggg cgttcgaccg gcttttgctc gaacccctcg aagacccgcg cagcagcggc 4256401 caccgcggcc gccgcacaca gatacgccag caccatcatg gtggtgttgt tggcgatgcc 4256461 ctgagcgtcg gtgacccagc tggtggcgaa cgcgaacatc gatatcgcgt tgctgacgat 4256521 ccacacgatc caccacacca cgatcggcct gcgcagccgc gtgtagcggt cctcgaccag 4256581 cgccaactcg atgacgtaca gcggagccca cagcagattg accatcggca ataggcagcc 4256641 ggcccataac tcacgggcgg aacgccgctc cggcaagcct tgatgcataa acgcggcggc 4256701 ccgacgggcg accagccacc ggaccaacag gacaatggta gtgccggccg ccgcaatcgc 4256761 cgccaagctg accaaaaccc ccagccagac cgaggcgctg gccaccaccg agttcaacaa 4256821 tgtgtttcgg ttgatgacca gcaacacata ccgcaccaca aacaccacga ccgcgatgct 4256881 gaacaccagc aggctcacca acagcgtggt gcgcaccgcc gccggcgatg gccctgcttt 4256941 cgccgaggcc ggcacgggag cctggtcgac atggtcggtt agcccccacc gcggtatccc 4257001 ggcgtagtgg ggagtaggcc cacgtaaccg tgggccgtgc cgtggcggcg gtgccgcccc 4257061 gggtcgcacc gctatccacc gaaaacctgg gggaagccgc ggcggtgtgc gccgcgtgtc 4257121 ggaggccgtc ggcacctgcg ggcgcgccgg tgtacgccag cgcgcctcgg ccggcatatc 4257181 cgccaacggc gccagcaaca tcccccgaca gcgtggacac cacacgcgtt gccgctcacg 4257241 gacgttccag cgagttccgc actgggagca cacttggatc accagaccag cctagtgact 4257301 tctccgcccc gcaccggtac ggcattgtcc gcgccgtcaa caggcgttga ggcaggcttc 4257361 cgcgctggat tgggcgcgcc cggtcgcggc acgtccagca cgacacagct acctacgact 4257421 atccacagtt tccacagctt tatccacagc ggtaagaatc cgacgaatgg cgttaacacc 4257481 ggctccatcc gtcagccagg cccacaactg tggataacag cgcccgtcaa tgcgttctca 4257541 tcgacagcct ggcaggtacc cgagcgaaat ggattgtcgc actaagcatc cacatctgcc 4257601 ccggctgcac ctagcagcct gcccgcccgg gcccggcctg ctcctgcgat cgtcaaacca 4257661 cacatttcgc ggcgctgccg gcgcagtatc cggacgtctt gtggcgctgc gaggtaccaa 4257721 tttttcccca ccattcacca ggagttatta tcgcgtgcac gacacttcgt tgtgacttac 4257781 ctcaccgtcg tgaggtgagc atgcaggtga aaggcgactg atggccacac actcgtacgg 4257841 cccagggtct acaacgccgc cgaactctgg ctcgccggag tggaaccacg cctacggcat 4257901 tgccgctttg cgggccgccc tgatcgctct ggcgttactg gcgattctgg ccgtcatcgc 4257961 tttggtttga gtccccggcc actcgggtgg caccgagtcg gtccggacgc cctggtcaga 4258021 accggttctc ggatttgggt aacccccctt gtgtcactgc cgtttcggtg gtcacagcac 4258081 ggcaattgtt gtgggtggcc tttcatagaa ctgcgacatg gattaccgcg gtcgtgagga 4258141 aatcgtcgag gctggttgca cccccacgga gccagccaga aattctgtag atcagagttg 4258201 gcttgattat gaatcatgct ctagcacagg gcaactcgtg agtgtgttga acactaccgt 4258261 cctgttctgc gttccggcac tcgaataacc tcccgtccca ctcgaaatat tgcgcagcct 4258321 aagataaatc agcttcatag ccgaatcctt gcctggcaaa aggaccgcgg ttattgatta 4258381 acttgcgcag ctcgatcgga tagtccagga atggcacgaa ttccggctac gcatgcgccc 4258441 agacaaaatt ccttgagcgc gagctcggcg gcctccacgg tgacggcgcc atgaatcccg 4258501 gcatcgacga gacgccccgg gctgtcttgg atcggccggc ccgaggcgtc tttgcgcccg 4258561 tcaaggtcca ccctgatagc caaatgcgcc agctggcggc aaccaccccg ttgtcttcga 4258621 tccgcagccg taaaccgtcg ttcgtcggcg cccgtcgccc aacgtgaact gagggcggag 4258681 aatcggccgg aatctcgccc tcagttcacg ctcggcgccg tttggcctca cccagtcaat 4258741 gtgatctgtg cgggcgggcg ttggcgcgta gcgaacccca gtggcgccgg cccgccaagc 4258801 acgccccggc gcggccagct catcagcggc tacgcaagcg caacggcgcc cgcgatgggc 4258861 tgtggaagaa cccggaggat ctcaccgaac accagaatgc caagctgtcg cgctcatcta 4258921 ctcaaagaag gcctacggca cctgttttcg gtcaaaggcg aagagagtaa gcaggcactg 4258981 gaccggttga tcttctaggc gcggccccga gtgagcatac tttggtggct tgtatctctt 4259041 gtagtgccgc tttgacgggg tggtggtcag gtacggtggc ctcgggagag gctggagggc 4259101 tcgacgtttg ggctgagtgt ctgggcccgt gaaagagatc gtctgctcca gctttgtctc 4259161 ctgaactgac ccggtttagg gaattggtgg ccaggttgcg gaagtgcgca gcatcgacgt 4259221 gtacctgggt gaggcatcga atcatcgaca agcaccggag ccgcgcgtga actcccgccg 4259281 tgttgtggtc ggggatgatg tgggagaccg gccggcagtg ctgtgtacga aggttctccc 4259341 accgcaacga gttcacgcac gacggtcggc tgggtgggcc ctggaatgcg tgaactcttc 4259401 atcaacacaa catgattgac gatgaagggg agaacctcca tgcacaacaa cgctaacccg 4259461 tgactgccga gaatccagga cggagcaggc ggacgctggt cggaatcgac gcggcgatca 4259521 cggcctgtca ccacatcgcg atccgcgatg atgtcggtgc gaggtcgatt cgattcagtg 4259581 tcgaacccac gctggccgga ctgcgcaccc tcaccgacaa gctcagcggt tacgacgata 4259641 tcgacgccac cgtggaaccg acctcgatga cgtggctgcc gctcacgatc gctgtcgaga 4259701 atgccggtga caccatgcac atggccggcg cgcggcattg cgcccggctg cggggtgcga 4259761 tcgtgggcaa gagcaagtcc gacgtcatcg acgccgaggt tctcacccgc gccagcgagg 4259821 tgttcgacct gacgccgctg acactgccga cgcccgcgca gttggcgtta cgtcgatcgg 4259881 tgatccgacg tgccggcgca gtgattgacg cgaaccggtc ctggcgtcgg ttgatgtcgt 4259941 tggcgcggta ggcgttcccc gatgtgtgga ccgcgttcgc cgggtcgtta ccgaccgcga 4260001 cagcggtgct ggggcgttgg cccgacatcc gcttgctggc cggcgcaccg acccgcaact 4260061 ggcggcgttc taccaccggc tgatgaccac ccagaggcat tgccacaccc aggccaccat 4260121 cgccgtagcc cgcaagctgg ccgaacgcac ccgggtgacg atcaccaccg gccgccccta 4260181 ccagctgcgc gacaccaacg gcgaccctgt caccgcccgc ggcgcgaaag aactgatcga 4260241 cgcccactac cacgtcgaca ccaggaccca cccacacaac cgcgcccaca ctgacaccat 4260301 gcagaactcg aaaccggcac gctgaacacc actgtcggca ggggatccgg ttgcacacgc 4260361 aacggtcact tgaggcgatc gtctccattc ctggctcctt gccgcccatt gttgtcggcg 4260421 agcaaggagt cacagtggag tccccgcagc gtagcgagga aaaccgacct tgacgcccga 4260481 cgagcggcaa cgagaaccgg caacgaggaa tggtcttcga caagcccacc gtgagttgtc 4260541 tatcggtttc tcattttcag cgtcttttca gagtcgcgca acacaatccg atgcccgtcg 4260601 agatccgtcg cgactacaca cacacccagc atctcgacca tcgcgactcc ggccgacgac 4260661 ggctaacgag cagcttcgcc ccacccgccc ccgcagcaac aacacaacgg cacggcagca 4260721 gctgatcact gcccaaaaca cgcacccaca tcagatgcag aaccccttga caaccaatag 4260781 ggaatctctt cacgaatgag ggggcagttg gggtttgaat ccgccggttt ccagtaggta 4260841 tctgtcggct tagttggtga gattgcgaaa gccgagggtc gatccccgga ggtgctcgac 4260901 gcggccgctg atcgcttcgg tcggcgggtt gaccgtggtc actgttttgg gcgtcgatcc 4260961 actgcgggaa ttcccactac cacgtccggc cggatcaccg gcgactcgcg gtgcacggcc 4261021 cgctccagca cctccttggt caattcgtta gccgtccccg ccaactgccc agccgtcgac 4261081 ttcttcttgc ccacccaccc catagacctt cgccacacag cgccttccgt ccacccaaca 4261141 gcggtccgat gacggacccc cgacggggac ttcagcgacc aggaacgcgc ccatagacgt 4261201 ggtatcagcc tgggggcgtc ctggtagcct atgccgtccg ccctggggca tcgaccccaa 4261261 ggtcgttgtt gcgacgcgag cggtcatgga gcagggttga cttgtcaagc tagagccagc 4261321 ccatcgcgtg ggaggcaccc gcgcgaaaag aaacatcgga cgatcatttc atcgaaggaa 4261381 ggaatgccgt ggccgaatac accttgccag acctggactg ggactacgga gcactggaac 4261441 cgcacatctc gggtcagatc aacgagcttc accacagcaa gcaccacgcc acctacgtaa 4261501 agggcgccaa tgacgccgtc gccaaactcg aagaggcgcg cgccaaggaa gatcactcag 4261561 cgatcttgct gaacgaaaag aatctagctt tcaacctcgc cggccacgtc aatcacacca 4261621 tctggtggaa gaacctgtcg cctaacggtg gtgacaagcc caccggcgaa ctcgccgcag 4261681 ccatcgccga cgcgttcggt tcgttcgaca agttccgtgc gcagttccac gcggccgcta 4261741 ccaccgtgca ggggtcgggc tgggcggcac tgggctggga cacactcggc aacaagctgc 4261801 tgatattcca ggtttacgac caccagacga acttcccgct aggcattgtt ccgctgctgc 4261861 tgctcgacat gtgggaacac gccttctacc tgcagtacaa gaacgtcaaa gtcgactttg 4261921 ccaaggcgtt ttggaacgtc gtgaactggg ccgatgtgca gtcacggtat gcggccgcga 4261981 cctcgcagac caaggggttg acattcggct gaccccgctg ccgcaagcgt cgggctcagt 4262041 attccggagt cgcgcatcac catcgccctt atcctggcct tatattgcag ctttgtgaac 4262101 acggccgcgg tggccgtgtc gagttgcagg gcgcgtaaac cacgcgcatg cttggttact 4262161 cgagctacca tttatttcga gctaccagcg tggttaggac ggaggcgtcg cggaggggcg 4262221 agatgggtac cgggtcaggt gggcctattg gggtttctcc cttccattcg cgtggtgccc 4262281 tgaaagggtt cgtgatctct ggacgttggc ctgattcgac caaagagtgg gcccagctgc 4262341 tgatggtcgc agttcgggtc gcgtcgttgc ccggcttgct ctccaccaca acggtgtttg 4262401 gtgcccgcga agagttgccc gacgaacccg agccggggac cgtcggtctg gtgctggccg 4262461 agggcaccgt cttcggtgaa tcagcaattc agccaggata tttcgctgat catcaacccc 4262521 ctgcattgct gatgctgcat ccaccctcgg agaccacgcc gtcgctgccg gaatgcaccg 4262581 gggcggcgtc agggtgcgtg ctgctgccgg gattaccgta tctgggattg gaacatcgtg 4262641 cggcttgggt ggaggctgaa gccgacggca ccatcacatc tatggtgagc cgggtgggcg 4262701 tcgacccgat aagccatccc gacaccgcaa ttctggcaat gctgcttgca gcataaggaa 4262761 attcgaagga gtctgttcgg gcggcgaatc gccaaatacg ggtggccgaa cttgtccgac 4262821 atcctggtgc acaccaaata tgaccgctag cctggggacg ttagcgaagg ggagtagtcc 4262881 cgaatcgtcg agtcgacata ctggcgaaaa gcccggctgg cgaaccgttt gataccaacg 4262941 gtgggcgaga ccttcgaccg atgttcgatg accgactggt cgtcgacaac gcgtcgaaag 4263001 gtcgcctgcc atgctcgccg ccacactgct aagtctggga gccgttttcc ttgctgagct 4263061 cggcgacaga tcccagctca tcacgatgac ctacacactt cgctaccgct ggtgggtggt 4263121 gctgaccggg gtggcgatcg cagcgttcac ggtgcacggg gtagcggtgg cgatcggcca 4263181 ctttttgggc tcgaccgtgc cggcccggcc ggccgcctgc gtatcggcga tcgcattcct 4263241 gatctttgcc gtgtgggtct ggcgggagga cacggccagc gacagcgaaa cctcgccaac 4263301 cgctgccgaa ccccgactcg cgctgttcac cgtggtctcg tcgttcgcac tggctgagct 4263361 gggtgacaag acaacgttgg cgacggtgac cttggccagc gatcaccact gggccggcgt 4263421 atggatcggc accaccctgg gcatgatcct ggccgacggc ctggcgatcg gcgcagggct 4263481 gctgctgcac cggcgccttc cggagcggtt gctgcaggtc ctgactggcc tgctgttcct 4263541 gctgttcgga ctgtggttgc tgttcgacga cgcgttgggc ttcagatcga ttgccatcgc 4263601 cgtgacagcg gcggtggtgc tggccgcggc aactacggcg gtatcggtgc gggtggcgca 4263661 aactcgtcgg cggcggccaa ccgctgctgc aacaccagaa gatgactcga cacgccccga 4263721 gcggtcgtcg gtcgcgccgg gccatcccgg gagcatcttg ctaccgcttc cggaagtgtc 4263781 tttgcggggg cgccgaccgc cctcagggtc gcctgacgag cgctgtgcgg acccaggcag 4263841 caaaggaggc tctcggcgaa tctccgttgg ctgctggttg cccggagtcg gccgcatccg 4263901 cccgacacgg tcatcctgat ctgctcgccg aacacgtggg cgacggacca acgcgcgtgt 4263961 tttcatcgga tattctgcgg ataacctgtg aaatccgttc gtcgtgtgga cacatcaccg 4264021 aatcggttgg accctcatcg ggggggtctt cgttgacccc tcacaacgtc agcacccaat 4264081 ccgctcaggt ttgcacttgg ttgtggacac aactgtcgct accatgatca gcaaatacat 4264141 acagataacc gtttgctctt ggagcccggt ggaggtcaca tcgatgagca cgacgttcgc 4264201 tgcccgcctg aaccgcctgt tcgacacggt ttatccgccc ggacgcgggc cacatacctc 4264261 cgcggaggtg atcgcggcgc tcaaggcaga gggcatcacg atgtcggctc cctacctatc 4264321 acagctacgc tcaggaaacc gtacgaaccc atcgggggcg accatggccg ccctggccaa 4264381 cttcttccgc atcaaggcgg cctacttcac cgacgacgag tactacgaaa agctcgacaa 4264441 ggaattgcag tggctgtgca cgatgcgcga cgacggcgtg cgccggatcg cgcagcgggc 4264501 ccacgggttg ccctccgcgg cgcagcagaa ggtgttggac cggatcgacg agctgcggcg 4264561 tgccgaaggg atcgacgctt agtccctgat accgaccgcc cgctccaccc gacctggcgg 4264621 gttggggttg gtctgccccg attagggttg ccccagcgat caccgcgata gtccacgaga 4264681 taccgggagg cggccgggaa tgggcctgtt cggcaagcga aagagccgcg cgacccgtcg 4264741 cgcggaagcc cgcgcgatca aagcccgcgc caagctcgag gccaagctgt cggccaagaa 4264801 cgaggcgcgc cgcatcaagg ccgcccagcg cgcggaatca aaggcgctca aggcgcagct 4264861 gaaggcccgg cgggacagcg accgggcggc gctcaaggtc gccgaagccg agctcaaggt 4264921 agcacgcgaa ggcaagttgc tgtcaccgac gcggattcgc cggttgctga cggtttctcg 4264981 gctcctggcc ccgatactga cgccggtgat ataccgggcc gcgatggctg cccgcgggtt 4265041 gatcgaccag cggcgcgccg atcagctcgg ggtcccgctg gcacagatcg gccggttctc 4265101 cggtcatggc gcccggttgt cggcgcgggt tgggggagcc gagcgatcgt tgcggatggt 4265161 gcaggaaaag aagccgaagg acgtagaaac caaacagttc gtgtcggcgg tgaccaatcg 4265221 gctcaccgat ctgtcggcgg ccgtcgcggc cgcggagcac atgcccgcaa agcggcgccg 4265281 gacggcccac tcggcgatct cgtcgcagct ggatggcatc gaggcggacc tgatggcccg 4265341 gctcgggttg acctaaccgg cggcccgatg accgcaattg gcatgtcaca tccgcctcgc 4265401 gtgcatcggc gggtcggcgg gcagcgcact gcactgaccg cgggcatcgg cctcttgctg 4265461 gccgccttgg tgctgaccac catcgcgaac ccacctgcgg cgtttgcgca caccgcgcag 4265521 ctgtccaccg ctacgcccgc acccgcagtc gccgccaccg acgcgaacga cgtcccgacg 4265581 tggccattcg tcgtagggac cgtggcggcg gttgccgtgg ctgcattgtg ggccgttcgg 4265641 cgcgggcgct aaccaatcaa ccccggtagc ccggaaggtg cggcaccgtg tcctggcatg 4265701 atgggaccga gcgtttgcga tctagtgagc gacgacaatg ctgcaaagga gcggccacat 4265761 gccagacccg caggatcgac ccgacagcga gccgagcgac gcatcgacgc cgccagctaa 4265821 gaagctgccg gccaagaagg ccgccaagaa agcaccagca agaaagacgc cggcgaagaa 4265881 ggcacccgcc aaaaaaacac ccgccaaggg tgctaagtcc gcgccaccaa agcctgccga 4265941 ggcgcccgtc agtttgcagc agcggatcga aaccaacggc cagcttgcag ctgctgctaa 4266001 ggatgcagcg gcacaagcaa agtcgacagt ggaaggcgcc aacgacgccc tggcgcgcaa 4266061 cgcatcagtg ccggcgccga gtcactcgcc cgtgccgctg atcgttgccg tcacgcttag 4266121 cctgctggcg ctgctgctga tccggcaact gcgccgccgc tgaacgcgct ggcaccatag 4266181 tggccatctc atttcgccca accgctgacc tcgtcgacga catcgggccc gacgtgcgca 4266241 gctgtgacct acagttccgc caattcggcg gccgatcgca gttcgccgga ccgatcagca 4266301 ccgtgcggtg ttttcaggac aatgcgttgc tgaagtcggt gctctcgcag ccaagtgcgg 4266361 gcggtgtgct ggtcatcgac ggcgccgggt ccctgcacac cgcgttggtc ggtgatgtca 4266421 tcgccgagtt ggcccgctct accggctgga ccgggttgat cgtccacggc gcggtgcgag 4266481 atgccgccgc gctgcgcggc atcgacatcg gcatcaaagc gctgggcacc aatccccgca 4266541 agagcaccaa gaccggtgcc ggagaacgcg acgttgaaat cacgctgggc ggggtgacat 4266601 tcgttccggg cgatatcgcc tacagcgacg acgacggcat catcgtcgtc tgactatggc 4266661 ctaaaccggc gctaaaccgt cgctaaagct aaacccccac cggggcaggc cttttggcga 4266721 accgcagacc ctcgtcgtcg atcttgccgc gccggatgag ccggatgtca cgtaggtagt 4266781 tctgattcag gcgccacggt gtacgcgaac cctgcttggg cagctcgtcc agcgagcgca 4266841 gcacgtaacc tggggtgaac tccatgaagg gccgctcttc gacatctgag cccggtcgct 4266901 cgacgaccac ggtgtcaaaa ccgttgtcgt ccatgtaatt caacaagcga cagacaaact 4266961 ccgacaccag gtcggccttc agcgtccagg aggcattggt gtagccaacc gtgtaggcca 4267021 tgttggggat gccggaaagc atcatgccct tgtaggccat cgtcgtggtg atgtccactt 4267081 gttgtccgtc gatagtcgcc gtcgccccac caaaaagctg caggttcaac cccgttgcgg 4267141 taatgatgat gtcagccggc agttcgcgac ctgagttcag ccggattccg gtcgcggtga 4267201 accgttcaat ggtgtcggtc accacctcga ccttcccgtg acgaatggcc cggaacaggt 4267261 cgccgttggg caccaagcac aatcgctggt cccaggggtt gtagtgcggg ccgaagtgct 4267321 ttcgcacgtc gtacccctcg ggtagctggc gctggatcag gctcaggaac atcttccgca 4267381 tgcgccgtgg ccacttctgg caggcgctgt acacggccgc ctggcgcagc acgttcttcc 4267441 accgtaccgc ggtgtaggcc atggtctccg gcagccagcg gttgagcttc tcggcgatgc 4267501 cgtcccggtc tggctgcgac acgatgtagg tgggtgagcg ctgcagcatc gtgacgtgct 4267561 tggcgcccga gtccgccagc gccggcacga gcgtgaccgc cgttgcgcca ctgccgatca 4267621 cgacgatgtt cttagcgtcg tagtcgaggt cctcgggcca gtgctgcgga tggatgatcg 4267681 gcccgacgaa atcctccgag ccggcgaatc tcggcgagta gccctcgtcg tagttgtagt 4267741 agccgctgca cagaaagagg aattcgcagg tgagggcgct gagcgtgccg tggctttgga 4267801 tgtgaacggt ccagcggttt tccgcggtcg accaatcggc actgatcacc ttgtggtgga 4267861 accggatatg cctgtcgatt ccatacatgg ccgcggtgct cttgacgtac tcgaggatgg 4267921 gcttgccgtc ggcgatcgcc tgccgtccgg tccagggacg gaatcggaaa cctagcgtgt 4267981 acatgtcgga gtcggagcga attccgggat aacggaacaa atcccaggtg ccgcccatgg 4268041 attcccgctt ttccaggatg gcgtagctct tggtcgggca acggtcctgc aggtgccagg 4268101 ccgcgctgac accggagatt ccagcgccca cgatgacaac gtcgaggtgc tcggtcatgg 4268161 atccacgcta tcaacgtaat gtcgaggccg tcaacgagat gtcgacacta tcgacacgta 4268221 gtaagctgcc agggtgacca cctccgcggc cagtcaggct tcgctgccta ggggccggcg 4268281 caccgcgcgg ccgtccggcg acgatcgtga actggcgatc ctcgccaccg ccgagaacct 4268341 tctcgaggac cgtccgctgg ccgatatctc ggtcgacgat ctggccaagg gcgccggtat 4268401 ctcgaggccg acgttctact tctatttccc atccaaggaa gcggtgctgc tgaccctgct 4268461 ggaccgggtg gtcaatcaag ccgacatggc cctacagacc cttgccgaga atcccgccga 4268521 caccgaccgc gagaacatgt ggcgcaccgg gatcaacgtg ttcttcgaga cattcgggtc 4268581 gcacaaggcg gtaacccgag ccggtcaggc cgccagggca accagtgtcg aagtcgccga 4268641 actgtggtcg acgtttatgc agaagtggat cgcctacacg gccgccgtga tcgacgccga 4268701 acgcgaccga ggcgcggcgc cgcgcaccct gccggcccat gaactggcca cagcgctcaa 4268761 cctgatgaac gagcggacgc tgttcgcgtc attcgccggc gaacagccct cggtgccgga 4268821 agcccgcgtg ctggatacgc tggtgcacat ctgggtgacc agcatttacg gcgagaaccg 4268881 ctaagccgca ctcggtcggg ggtgctcggt cgatgctcag tgccaaagcg gcatgcagat 4268941 ctcacggagg tccggtggac gatctggcag ccgaagtggc gccttgggta ggcaatggcg 4269001 tgcggtcata taggagcggg tgcattcgca tgtcggacac gtggcgttgc cgcctggtac 4269061 cgcggtgttc gtggccgaca gcgggctaat gcgacccggt ccacgccagg agcgtgtcgg 4269121 ccggccaggt gttgacgatc cggtcggcgg gcacctccgc gtccaaggcg cgctgggcgc 4269181 cgtagccgag gaagtccagc tggccgggtg cgtgcgcgtc ggtgtcgatg ctgaacacgc 4269241 agccgatgtc gcgcgctagg tgcaacaggc gcgtcggtgg gtctcggcgt tccggacggg 4269301 agttgatctc cacggcggtg ccgtgctcac ggcaggcggt gaacaccgcc tctgcatcga 4269361 acttcgattc tggccggatg ccacgattgc cggcgatcag ccggccggtg cagtggccca 4269421 gcacgtcggt gtgaccgttg gccacggcgc gcaccatccg tcgcgtcatc gctgccgaat 4269481 ccatcgacag cttggagtgc acgctggcca ccacgatgtc gaggcggtcc agcatctcgg 4269541 gttcctggtc caagctcccg tcttcgagga tgtcgacctc gatcccggtc aggatgcgca 4269601 gcggcgcgaa cttctcgcgc agctcgtcga tcacgtccag ctgcttgcgc aaccggtccg 4269661 gagacaggcc gttggcgatc gtcaaccgcg gtgagtgatc ggtcaatgcg cagtactggt 4269721 gacctagcgc cgccgcggtg gccatcatct cctcgatcgg cgcggacccg tccgaccagt 4269781 tcgaatgcag atgcagatcc ccgcgcaatg cggcacggat cgcccctcca ccgagatcct 4269841 cagcgtcagc gcgtaattca gccagcaggt ccggctcgcg gccagaccag gcctgggcga 4269901 tgactttcgc ggttttggga ccgatacccg ccagcgactg ccagctgttg gcctggccgt 4269961 gccgctgccg cgccgcgtcg tcaaggccct cgataatgtc ggcggcattg cgataggcca 4270021 tcacccgcct cgggtcgtgg cggttccggt ccttgtaata ggcgatctgc cgcagcgctg 4270081 ttaccgggtc cattatcggg ctcacaccag ttgcccgaag acgaccccgg tgacaaccac 4270141 cgcgaagccg gccatttcgc cgaggatgag caacgccatt aacacccccg caccctttgc 4270201 gggacgctcg aattggttcg cggtggcacg gcgcgcgcca tgggtgacat aactcgccaa 4270261 caggatgggt ttcgtatcaa atccgagggc acagttcatc gcttcactga gtttagttgg 4270321 gacctaggcc cagatgccgt cgcggcctgg ggcgccattg ccctagataa caatctgata 4270381 aagcggagca aacaagctgt ggtgcacact cgggcacgta tcaggttggc tacacagcga 4270441 agcgcaacag ctcttcagtg gttatcaggc gctcgttctt ggcggggaac tcgtggcttt 4270501 tgaccgggtg gcgaaaccat gaccaggcga ttcgccccat ccgtgaccgg ggtactgggt 4270561 tggtacgcac agcgacactc ctgcgatcgg acaactcgac tggcacctca cattaaacct 4270621 ctatgtgacg aagcccacat cgactcatta gacacctcgg agctggcaaa cagtgaacgg 4270681 cgcgccgagc aattatcaaa tgtttctgat gtgactctag tgattattga agcggtgcag 4270741 cggtcggctt aacaggcgcc ggcagggcac tggaacccat caagtaccgg tctacggccg 4270801 cggcagcggc ccggccctcg gcaatcgccc agacgatcaa tgactggccc cggcccatgt 4270861 caccggctac gaacacacca ggaaccgagg tgtcgaagtc gtcgccacgg gccacgttcc 4270921 catgctcggt gaacttcact ccgaggtcgg tcaacaggcc cgcccgttcc gggccgacga 4270981 aacccatcgc cagcaacacc aggtcggctt cgagctcgaa gtcggagccc tcaaccttga 4271041 cgaacttgcc atccagcatg gtcacttcgt gtgcccgcag cgcgctcacg cgcccgtccg 4271101 tgccgacgaa cgcctcggtg ttgaccgaga acacccgctc gccaccctcc tcatgcgcgg 4271161 ccgatacccg atacatcagc gggtaagtcg gccatggggt ggattcggcg cgggcgtccg 4271221 gtggacgcgg catgatctcg aactggtgca cggcgatcgc gccctggcgg tgcacggtac 4271281 ccaggcagtc cgccccggtg tcgccgccac cgatgatgac gaccttcttg ccctttgcgg 4271341 tgatcggcgg ctgcccgtcc tcatcgagga cgtcatctcc ttcttgcacc cggttggccc 4271401 acggcagaaa ctccatcgcc tgatggacgc cctccagctc gcggccggga atcggcagct 4271461 cgcgccaagc ggttgcgcca ccggccaata cgaccgcatc gaaatcagcg cgcagctttt 4271521 cggcgctaat gtcgaccccg acgttgacgc ccggccggaa ttcggttcct tcggagcgca 4271581 tttggtccaa acgccgatca agatgccgct tttccatctt gaattccggg atgccgtaac 4271641 gcagcagccc gccgatgcgg tcttcgcgct cgaaaacggt gacggtgtga cccgcccggg 4271701 tgagttgctg ggcggcggcc aaacccgccg gccccgaacc caccacagca accgtttgcc 4271761 cggtcagctt ccgcggcgga cgtggttgca cccatccttc gtcgaaggcc ttgtcgatga 4271821 tctccagctc gatctgcttg atcgtcaccg gatcctggtt gatgcccagc acacacgccg 4271881 gctcgcacgg agccgggcac aaccggccgg tgaagtcggg gaagttgttg gtggcgtgca 4271941 gccgttcgat tgcgtcgcgc cagcggcccc ggcggaccag atcgttccat tccgggatca 4272001 agttacccag cggacatccg ttgtgacaga acggaatgcc gcaatccatg cagcgggtcg 4272061 cctgttggcg caggctctcg ttgtcgaatt cctcgtagac ttcccgccag tctcgcagcc 4272121 gcagcgggac cggccgtcgc ttcggcaatt tccggtgggt gtatttgagg aagccgcccg 4272181 gatcagccat gcgcagccgc catgatcgcc ttgtcgacat caacgccgtc acgttcagcc 4272241 agggcgatcg cctgcaggac ccgtttgtag tcacgcggca tcaccttgac gaagtggcgc 4272301 tgctgtcccg accagtcgga cagaatccgc tggccgacag cggaatcggt agcgtcgacg 4272361 tgcacttgta tggtgccgtg cagccagtcc gcgtcatcct cgtcgagggt ctcgagttcg 4272421 accatctccg agttgaggtt ggccggcagt tcaccgtcgg gatcgtaaac ataggccaca 4272481 ccgccggaca tacccgccgc aaagttacgg ccggtgcggc ccagaatgac aaccctgccg 4272541 ccggtcatgt actcgcagcc gtgatcgccg acaccctcta ccacggcgtg ggccccggaa 4272601 ttgcgcaccg cgaaccgttc gcctaccaca ccgcgcaggt aaacctcgcc actggttgcg 4272661 ccgaacagaa tcacattgcc cccgatgatg ttgtcctcgg cgacataatc ctgcggcgcg 4272721 tcatccgacg gccgcaccac aatccggcca ccggatagcc ctttgccgac gtagtcattg 4272781 gcgtcgccat acacccgcaa ggtaattccc ttgggcacga aggctccgaa gctgtttccc 4272841 gcggatccgt cgaacgtgat atcgatggtt ccgtccggca agccttggcc gccataggcc 4272901 ttcgtcagct cgtggccgag catggtgccc accgtgcggt tgacattgcc tatggtggtg 4272961 gagaagcgga ccggcttgcc ggaatccagt gcttccctgc tcatcacgat cagctgctga 4273021 tcgagcgcct tgtctagacc gtgatcctgg cgcgaactgc agtacagatc ctgattcatg 4273081 aaggccgact ccggctcgtg gagcaccggc gccagatcca gcttatgcgc cttccagtgc 4273141 gcgcgtgcca gcgtggtgtc cagcgcacct gcctgtccaa ccgcctcgtt cacagtgcgg 4273201 aagcccaact gcgccaaata ttcccggact tcctcggcga tgaacatgaa gaagttctcc 4273261 acgaactcgg gcttcccggt gaaccgctcc cggagcaacg gattctgggt ggccacacca 4273321 accgggcacg tgtccaggtg gcacacccgc atcatgatgc agccggccac taccaacggc 4273381 gcggtcgcga atccgaactc ttctgccccg agcagcgtag cgatcatcac atcgcgaccc 4273441 gtcttgagct gaccgtccac ctggaccaca attcgatcac gtaacccgtt gagcagcaac 4273501 gtctgctgtg tctcagccag acccaactcc cagggtgctc cggcgtgctt catcgatgtc 4273561 agcggggtcg cgccggtgcc accatcgtgc cctgagatca agaccacgtc ggcgtgggct 4273621 ttggaaacgc cagccgcaac cgtccctacc ccgttttcgg agaccagctt gacgtgtacc 4273681 cgcgcggatg gattggcgtt ctttaggtcg tggatcagct gcgccagatc ctcaatggag 4273741 tagatgtcgt ggtggggcgg cggtgagatc agaccgacac cgggcgtgga gtgccggacc 4273801 tcggccaccc aagggtacac cttgtgcccc ggaagctgac ctccctcacc aggtttcgcg 4273861 ccctgcgcca tcttgatctg gaggtcggtg cagttggtca ggtaatgcga ggtgacgcca 4273921 aaccgggcgg aggctacctg cttaatggcg cttcggcgcc aatccccgtt ggggtcgcgg 4273981 tcaaatcgct tgacgtcctc gccgccttca ccacagtttg accgggcacc aagccggttc 4274041 attgcgatgg ccagcgtctc gtgcgcttca gcggaaatcg agccgtagct catcgccccc 4274101 gttgagaagc gcttgacgat ttcgctggcc ggctcgacct cgtccagcgg gactggagga 4274161 cgaaccccgg tacggaactt gagcagacca cgcagcgatg ccatccgctc gctctggtcg 4274221 tcgaccagac gggtgtactc cttgaagatc ttgtactggc cggttcgcgt ggagtgctgc 4274281 agcttgaaca cagtctccgg gttgaacagg tggtactcgc cctcgcggcg ccactggtat 4274341 tccccaccca cctcgagttc gcggtgagcg cgttcgtccg gccggtccag ataggccagc 4274401 cggtgccggg ctgcgacatc ggccgcgatg tcatccaggg tgatcccgcc ggtggggcag 4274461 gtaagcccgg tgaagtattc gtcgagcact tgctcggaga tgccgacagc ctggaacagt 4274521 tgcgcaccgg tgtaggaggc cagcgtcgag atgcccatct tcgacatcac tttcagcaca 4274581 cccttacctg cggctttgat gtagttgttc agcgccgccg tacggtcgat gccctcgata 4274641 acaccgcggt cgagcatgtc ctcgatcgac tcgaacacca ggtaggggtt gatcgcggcc 4274701 gcgccgaatc cgaccagcgc ggccatgtgg tgcacctcgc gggcatcacc ggactcgacc 4274761 accagaccca cttgggtgcg ggtccgttcc cgaaccaggt ggtggtgcac tcccgcaacg 4274821 gcgagcagcg acggtatcgg agccatttcc tcgtcggact cgcggtcgga caagatgatg 4274881 atccgagcgc cgtcggcgat tgccgccgcc gccgcgccac gtacctcttc cagcgcggca 4274941 gccagcccag cacctccctc ggagacccgg tacagacagc gaatcacctt ggaccgcaat 4275001 ccgtgtgggc gcccattgac cttgtcgttg ggatcgaggc tgaccagctt ggcgagctcg 4275061 tggttacgca gaatcggctg gggcagcacg atctggtggc aggagttctc gtccgggttg 4275121 agcaagtcac gttcgccgcc ggtggtgccc tgcaggctgg tcaccacctc ctcgcggatg 4275181 gcgtccaacg gcgggttggt cacctgggcg aacagctgat ggaagtagtc gtagagcatg 4275241 cgcggacgct gcgacaacac cgcaactgga gtgtcggtgc ccatcgaccc gattggctcg 4275301 gcaccgagcc gagccatcgg cgctaccagc aggttgagct cctcgtaggt atagccgaat 4275361 gccaactgcc gcatgacgat tcgatggtgg ggcatccgca cgtctttgcc ctccggcaat 4275421 tcgtcgagcg gaactagtcc gttgtcaagc cactcctgat acggatgctc ggccgccagg 4275481 tcggccttga tctcctcatc ggagacgatg cggccctgcg cggtgtccac caagaacatc 4275541 cggcccggct gcagccgcat ccggcgcacc accgtcgacg gatgcaggtc caacacaccg 4275601 gcctcggaag ccatcaccac caaaccgtcg tcggtgaccc agattcgcga cgggcgtagg 4275661 ccattgcggt ccagcacggc gcccacgacg gtgccgtcgg tgaacgtcat cgacgccggg 4275721 ccgtcccacg gctccatcaa cgaggcgtga tactggtaaa acgcccgccg cgcggggtcc 4275781 atcgactcgt ggcgctccca ggcctcaggg atcatcatca gcaccgcgtg ggccaggctg 4275841 cgtccgccca ggtgcagcag ttcgagcacc tcgtcgaagc gcgcggtgtc cgaggcaccc 4275901 ggggtacaga tcgggaacag cttttcgaca tcggccgccg acccaaagat gtcggtcttg 4275961 atcagcgcct cgcgggcccg catccagttc tcgttaccgg tgacggtgtt gatctccccg 4276021 ttgtgcgcga tccgccggaa tggatgcgcc agcggccagg acgggaaagt gttcgtggag 4276081 aaccgcgagt gcacgatgcc tagcgcgctg gtcagtcgct cgtcctgcaa atcgaggtag 4276141 aaggccttga gctgcggggt ggtcagcatg cccttgtaga cgagcgtctg gccggacagg 4276201 ctcgggaagt acacggtttc ccggcccggc ccgtcttgac ccggaccctt ggtgccgagt 4276261 tcatgctcgg cccgcttgcg gaccacatag cagcgccgct ccaacgccat gccggacgcg 4276321 ccagccaaga acacctgccg gaaggtgggc atggcatcac gggacagcgc gcccagcgat 4276381 gagtcgtcgg tggggacgct gcgccaaccc aggacttgca gcccctcggc ctcggcgatt 4276441 ttctgtacgg cggcgcaggc cgcggcggcg tctttagatg actgcggcaa gaacgcgata 4276501 cccgtggcat agctgcctgg ggcaggcaac tcgaaatcca cggcttcgcg aaggaattcg 4276561 tccggaacct gaatcaggat gcccgcgccg tcaccgctgc ggggttcggc gccttgcgcg 4276621 ccccgatgct cgaggttgag cagggcggtg atcgccttgt ccacgatgtc gcggctacga 4276681 cggccgtgca tgtccacaac catggcaacc ccgcacgaat cgtgttcgaa cgcggggtta 4276741 tacaacccga cgcgcttagg cgtcataccc acctaaccct tcagcagact ttctgcgcgg 4276801 ccgcctttgc ggattcgacg gggccgcacc cggaggtagc gggcaagacc ccttcggtct 4276861 tgtcgatagg ctgtccgtca agcgggcgtg atccggtcgg ggcttcgtcc gtgcagcagt 4276921 gaacgcttgg ccctggaatc ggactcgaca agtcgtaaaa cgatatgaca aaacccgctt 4276981 gacatgccaa ctttcccaat actaactcgt cagccggcgg caccgtagct gccgcgtggc 4277041 cagcaaccga ccgtatcgtc acatgcattt ttcctcgtcc aaatccggct gcgctagctg 4277101 cgtggcggtc tgatcgccag ccacaggaaa tgcttagata cgtttgctgt gaaatccgga 4277161 gcaccgctgt ttcgccactt gcgccggtgg gaacaaccgc cggaacggcg ggtatctgtg 4277221 ttgttgcatg gcgatgccgc cgcgacgact acccagcgca accccccaga gtttgcgcga 4277281 tactaaaagg ggtctaaaaa gggcgtctag acagccagca gtcagtccag ggagctagcc 4277341 gatacgggac gatattggtc ggcgtccggc atgggcgatc ttaccgtggg gctcatcagc 4277401 cgcgagctcg cctcagccgg ccaccggcgc gacaatcgat cgcctgtcac ctgaggagct 4277461 tatgtacgag cgtgacgaat tcctgcgcga tcggatccga ccacaccagc ccggcacccc 4277521 gcggggatac tcgccccgtc cgccgtccgg agatcgctgc cccgcgccac cgcctggccg 4277581 gcacgctgct gccgctacgc caccagggcc gccgcgcctg ccttcagctc cactgcgtcc 4277641 attgccggac ccggcttggc cacgccagcc ggaggccccg ccaccgagca cctgggccga 4277701 ccccgccctg gcgccgatac gcagtcggac gcgacccggc gagcgtggtt ggcgacgcat 4277761 ggtgcggctg gtcacctttg gccttgtcgg cctgggccgg tcgggcatgc agcgccagga 4277821 ggcccaattc gaagcaacga tacgaaccgt cctgcatggc aaccacaagg tcgccgtgct 4277881 gggcaaagga ggtgtgggaa agacgtcggt tgcggcgtgc gtcggatcga tccttgccga 4277941 actgcgccag caggaccgta tcgtcgggat cgacgccgac accgccttcg gcaggctgag 4278001 cagccgaatc gatcctcgag cagctggttc gttctgggag ctgaccaccg acacgaatct 4278061 gcggtccttc accgatatca ccgcgcgcct gggccgaaat tccgcgggac tgtacgtcct 4278121 ggcaggccag ccggcatccg gtccgcgccg ggcgctcgat ccggccatct accgcgaagc 4278181 cgccctaagg ttggatcacc atttcgcaat ctcggtgatc gactgcggtt cctccatgga 4278241 ggcggcggtc acccaggaag tattgcgcga tgtggatgct ctgatcgtgg tgtcctcgcc 4278301 ctgggcggat ggtgcctccg ctgccgccaa caccatcgaa tggctgtcgg attatggcct 4278361 gacaggtttg ttgcgacgca gcatcgtggt gctcaacgat tcggacggac acgccgacaa 4278421 gcgcaccaag tcattgctgg cccaggaatt catcgaccac gggcagcctg tggtcgaggt 4278481 gcccttcgat ccccatttgc ggcccggggg ggtcatcgat atgagccacg aaatggcccc 4278541 gacgacgcgg ctgaaaatcc tgcaggtcgc cgcgacggtg acggcgtact tcgcgtcgcg 4278601 acccgccgac gcacacggca gcccgccccg gtgacctggc tggctgaccc ggtcggcaac 4278661 agcaggatcg cccgagcgca ggcctgcaaa acgtcaatct cggcgcccat cgtcgaatcc 4278721 tggcgggcgc aacgcggcgc gcaatgtgga cagcgcgaga aatcttgtcg atgttctcgc 4278781 gctgtccaca tccagggcat ctcaccgcca ctgttccgca gacccctcga accagcggtc 4278841 caggcggcgg ttgcgtcatg ccgattgggc agacacccgg tggtcgcgca ccgggtaacc 4278901 gttgcgctcg gccagggatc gcagctggcc caacgcgaat gcccgcgccc ggcctgattc 4278961 gggaattacg acccctgccc acagcccttc cgcacccgcg gactcgacgg cgtcgcgtgc 4279021 acacagccac cggcgcgggc aagcccggca cagggtcttg gcctcgtcgt cgggagtcgt 4279081 cgtccaacga tcgggatctt gcgtgcaaac gccgagcggg acctcataca gggcggttac 4279141 tgtcatgtct acgttcctcc agaaagcgtt gcaggttgta gcctctgccg cgaaagcgta 4279201 tcgcattaac catagcgatg caacagtttc ctcctctgcc tgcctagcgg tgctgcggct 4279261 ccggttcggc gagctccgag tctagtgcgc gcaccgccga gtaccagggc atagatcctg 4279321 ttaatcagct gtgtatctgg cctcgccggc gcgtatccga ccccttcggg cagatcttcc 4279381 aggaaaagtg ttctgacatg cgacagttca ggtgtaaagt gaactgtagc ggcagttcgg 4279441 tttggctagg aaactatttc catagcgggc cgtcgcgtcg ctagatccaa aatgtagcga 4279501 agtcatagca gtagaagggt gcaacggtta ggatggcggg cgagcggaaa gtctgcccac 4279561 cgtcccggct agtacccgcg aataagggat caacgcagat gtctaaagca gggtcgactg 4279621 tcggaccggc gccgctggtc gcgtgcagcg gcggcacatc agacgtgatt gagccccgtc 4279681 gcggtgtcgc gatcattggc cactcgtgcc gagtcggcac ccagatcgac gattctcgaa 4279741 tctctcagac acatctgcga gcggtatccg atgatggacg gtggcggatc gtcggcaaca 4279801 tcccgagagg tatgttcgtc ggcggacgac gcggcagctc ggtgaccgtc agcgataaga 4279861 ccctaatccg attcggcgat ccccctggag gcaaggcgtt gacgttcgaa gtcgtcaggc 4279921 cgtcggattc cgctgcacag cacggccgcg tacaaccatc agcggacctg tcggacgacc 4279981 cggcgcacaa cgctgcgccg gtcgcaccgg accccggcgt ggttcgcgca ggggcggccg 4280041 cggctgcgcg ccgtcgtgaa cttgacatca gccaacgcag cttggcggcc gacgggatca 4280101 tcaacgcggg cgcgctcatc gcgttcgaga aaggccgtag ttggccccgg gaacggaccc 4280161 gggcaaaact cgaagaagtg ctgcagtggc ccgctggaac catcgcgcga atccgtcggg 4280221 gcgagcccac cgagcccgca acaaaccccg acgcgtcccc cggactccgg cctgccgacg 4280281 gcccggcgtc cttgatcgcg caggctgtca ccgccgccgt agacggctgc agtctggcta 4280341 tcgcagcgtt gccggcgacc gaggaccccg agttcaccga acgtgccgcg ccgatccttg 4280401 ctgatttgcg ccagctcgag gcgattgccg tccaagcaac ccgcatcagc cggattaccc 4280461 cggaattgat caaggcgttg ggcgcggtac gtcgccacca cgacgaatta atgaggctgg 4280521 gagcaaccgc ccctggtgcc acactggcgc agcgcttata tgccgcacgg cggcgcgcga 4280581 acctttccac cctggagact gcccaagcgg ccggcgtcgc agaagaaatg atcgtcggcg 4280641 ccgaagccga ggaagagttg ccagccgagg ccaccgaagc gatcgaagca ctgatccgtc 4280701 agatcaattg aggtcggctc cgagcgtccc acaagtacag gcacgccgta acgctcaagt 4280761 tcaacggtcc ggggaacgcg cgcgttctcc ggcgtttgac ggtgcgttcc atcgtgccgc 4280821 gaacttgaaa acgccagcgt caccaaaaaa ttcgtgcacc aacccccctc cgagcgctgc 4280881 taagctcaat gtgcagtgca aaggtgcaga taatgatggc gcaccggaac ggcgagcgta 4280941 aggaaacaca taaatggcat cgggtagcgg tctttgcaag acgacgagta actttatttg 4281001 gggccagtta ctcttgcttg gagagggaat ccccgaccca ggcgacattt tcaacaccgg 4281061 ttcgtcgctg ttcaaacaaa tcagcgacaa aatgggactc gccattccgg gcaccaactg 4281121 gatcggccaa gcggcggaag cttacctaaa ccagaacatc gcgcaacaac ttcgcgcaca 4281181 ggtgatgggc gatctcgaca aattaaccgg caacatgatc tcgaatcagg ccaaatacgt 4281241 ctccgatacg cgcgacgtcc tgcgggccat gaagaagatg attgacggtg tctacaaggt 4281301 ttgtaagggc ctcgaaaaga ttccgctgct cggccacttg tggtcgtggg agctcgcaat 4281361 ccctatgtcc ggcatcgcga tggccgttgt cggcggcgca ttgctctatc taacgattat 4281421 gacgctgatg aatgcgacca acctgagggg aattctcggc aggctgatcg agatgttgac 4281481 gaccttgcca aagttccccg gcctgcccgg gttgcccagc ctgcccgaca tcatcgacgg 4281541 cctctggccg ccgaagttgc ccgacattcc gatccccggc ctgcccgaca tcccgggcct 4281601 acccgacttc aaatggccgc ccacccccgg cagcccgttg ttccccgacc tcccgtcgtt 4281661 cccagggttc cccgggttcc cctccctacc cggtttcccc gggctccccg ggttcccgga 4281721 gttccccgcc atccccgggt tccccgcact gcccgggttg cccagcattc ccaacttgtt 4281781 ccccggcttg ccgggtctgg gcgacctgct gcccggcgta ggcgatttgg gcaagttacc 4281841 cacctggact gagctggccg ctttgcctga cttcttgggc ggcttcgccg gcctgcccag 4281901 cttgggtttt ggcaatctgc tcagctttgc cagtttgccc accgtgggtc aggtgaccgc 4281961 caccatgggt cagctgcaac agctcgtggc ggccggcggt ggccccagcc aactggccag 4282021 catgggcagc caacaagcgc aactgatctc gtcgcaggcc cagcaaggag gccagcagca 4282081 cgccaccctc gtgagcgaca agaaggaaga cgaggaaggc gcggccgcag gcgtggccga 4282141 ggcggagcgt gcacccatcg acgctggcac cgcggccagc caacgggggc aggaggggac 4282201 cgtcctttga tcggacaccg agtcgccagc aggtctgtgc catagcgagt cgaagccata 4282261 gcgagtagaa agttaaacgt agaggagggt tcaacccatg accggatttc tcggtgtcgt 4282321 gccttcgttc ctgaaggtgc tggcgggcat gcacaacgag atcgtgggtg atatcaaaag 4282381 ggcgaccgat acggtcgccg ggattagcgg acgagttcag cttacccatg gttcgttcac 4282441 gtcgaaattc aatgacacgc tgcaagagtt tgagaccacc cgtagcagca cgggcacggg 4282501 tttgcaggga gtcaccagcg gactggccaa taatctgctc gcagccgccg gcgcctacct 4282561 caaggccgac gatggcctag ccggtgttat cgacaagatt ttcggttgat catgacgggt 4282621 ccgtccgctg caggccgcgc gggcaccgcc gacaacgtgg tcggcgtcga ggtaaccatc 4282681 gacggcatgt tggtgatcgc cgatcggtta cacctggttg atttccctgt cacgcttggg 4282741 attcggccga atatcccgca agaggatctg cgagacatcg tctgggaaca ggtgcagcgt 4282801 gacctcacag cgcaaggggt gctcgacctc cacggggagc cccaaccgac ggtcgcggag 4282861 atggtcgaaa ccctgggcag gccagatcgg accttggagg gtcgctggtg gcggcgcgac 4282921 attggcggcg tcatggtgcg cttcgtcgtg tgccgcaggg gcgaccgcca tgtgatcgcg 4282981 gcgcgcgacg gcgacatgct ggtgctgcag ttggtggcgc cgcaggtcgg cttggcgggc 4283041 atggtgacag cggtgctggg gcccgccgaa cccgccaacg tcgaacccct gacgggtgtg 4283101 gcaaccgagc tagccgaatg cacaaccgcg tcccaattga cgcaatacgg tatcgcaccg 4283161 gcctcggccc gcgtctatgc cgagatcgtg ggtaacccga ccggctgggt ggagatcgtt 4283221 gccagccaac gccaccccgg cggcaccacg acgcagaccg acgccgccgc tggcgtcctg 4283281 gactccaagc tcggtaggct ggtgtcgctt ccccgccgtg ttggaggcga cctgtacgga 4283341 agcttcctgc ccggcactca gcagaacttg gagcgtgcgc tggacggctt gctagagctg 4283401 ctccctgcgg gcgcttggct agatcacacc tcagatcacg cacaagcctc ctcccgaggc 4283461 tgacccctca catctccgct acgacttcag aaagggacgc catggtggac ccgccgggca 4283521 acgacgacga ccacggtgat ctcgacgccc tcgatttctc cgccgcccac accaacgagg 4283581 cgtcgccgct ggacgcctta gacgactatg cgccggtgca gaccgatgac gccgaaggcg 4283641 acctggacgc cctccatgcg ctcaccgaac gcgacgagga gccggagctg gagttgttca 4283701 cggtgaccaa ccctcaaggg tcggtgtcgg tctcaaccct gatggacggc agaatccagc 4283761 acgtcgagct gacggacaag gcgaccagca tgtccgaagc gcagctggcc gacgagatct 4283821 tcgttattgc cgatctggcc cgccaaaagg cgcgggcgtc gcagtacacg ttcatggtgg 4283881 agaacatcgg tgaactgacc gacgaagacg cagaaggcag cgccctgctg cgggaattcg 4283941 tggggatgac cctgaatctg ccgacgccgg aagaggctgc cgcagccgaa gccgaagtgt 4284001 tcgccacccg ctacgatgtc gactacacct cccggtacaa ggccgatgac tgatcgcttg 4284061 gccagtctgt tcgaaagcgc cgtcagcatg ttgccgatgt cggaggcgcg gtcgctagat 4284121 ctgttcaccg agatcaccaa ctacgacgaa tccgcttgcg acgcatggat cggccggatc 4284181 cggtgtgggg acaccgaccg ggtgacgctg tttcgcgcct ggtattcgcg ccgcaatttc 4284241 ggacagttgt cgggatcggt ccagatctcg atgagcacgt taaacgccag gattgccatc 4284301 ggggggctgt acggcgatat cacctacccg gtcacctcgc cgctagcgat caccatgggc 4284361 tttgccgcat gcgaggcagc gcaaggcaat tacgccgacg ccatggaggc cttagaggcc 4284421 gccccggtcg cgggttccga gcacctggtg gcgtggatga aggcggttgt ctacggcgcg 4284481 gccgaacgct ggaccgacgt gatcgaccag gtcaagagtg ctgggaaatg gccggacaag 4284541 tttttggccg gcgcggccgg tgtggcgcac ggggttgccg cggcaaacct ggccttgttc 4284601 accgaagccg aacgccgact caccgaggcc aacgactcgc ccgccggtga ggcgtgtgcg 4284661 cgcgccatcg cctggtatct ggcgatggca cggcgcagcc agggcaacga aagcgccgcg 4284721 gtggcgctgc tggaatggtt acagaccact caccccgagc ccaaagtggc tgtggcgctg 4284781 aaggatccct cctaccggct gaagacgacc accgccgaac agatcgcatc ccgcgccgat 4284841 ccctgggatc cgggcagtgt cgtgaccgac aactccggcc gggagcggct gctcgccgag 4284901 gcccaagccg aactcgaccg ccaaattggg ctcacccggg ttaaaaatca gattgaacgc 4284961 taccgcgcgg cgacgctgat ggcccgggtc cgcgccgcca agggtatgaa ggtcgcccag 4285021 cccagcaagc acatgatctt caccggaccg cccggtaccg gcaagaccac gatcgcgcgg 4285081 gtggtggcca atatcctggc cggcttaggc gtcattgccg aacccaaact cgtcgagacg 4285141 tcgcgcaagg acttcgtcgc cgagtacgag gggcaatcgg cggtcaagac cgctaagacg 4285201 atcgatcagg cgctgggcgg ggtgcttttc atcgacgagg cttatgcgct ggtgcaggaa 4285261 agagacggcc gcaccgatcc gttcggtcaa gaggcgctgg acacgctgct ggcgcggatg 4285321 gagaacgacc gggaccggct ggtggtgatc atcgccgggt acagctccga catagatcgg 4285381 ctgctggaaa ccaacgaggg tctgcggtcg cggttcgcca ctcgcatcga gttcgacacc 4285441 tattcccccg aggaactcct cgagatcgcc aacgtcattg ccgctgctga tgattcggcg 4285501 ttgaccgcag aggcggccga gaactttctt caggccgcca agcagttgga gcagcgcatg 4285561 ttgcgcggcc ggcgcgccct ggacgtcgcc ggcaacggtc ggtatgcgcg ccagctggtg 4285621 gaggccagcg agcaatgccg ggacatgcgt ctagcccagg tcctcgatat cgacaccctc 4285681 gacgaagacc ggcttcgcga gatcaacggc tcagatatgg cggaggctat cgccgcggtg 4285741 cacgcacacc tcaacatgag agaatgaact atggggcttc gcctcaccac caaggttcag 4285801 gttagcggct ggcgttttct gctgcgccgg ctcgaacacg ccatcgtgcg ccgggacacc 4285861 cggatgtttg acgacccgct gcagttctac agccgctcga tcgctcttgg catcgtcgtc 4285921 gcggtcctga ttctggcggg tgccgcgctg ctggcgtact tcaaaccaca aggcaaactc 4285981 ggcggcacca gcctgttcac cgaccgcgcg accaaccagc tttacgtgct gctgtccgga 4286041 cagttgcatc cggtctacaa cctgacttcg gcgcggctgg tgctgggcaa tccggccaac 4286101 ccggccaccg tgaagtcctc cgaactgagc aagctgccga tgggccagac cgttggaatc 4286161 cccggcgccc cctacgccac gcctgtttcg gcgggcagca cctcgatctg gaccctatgc 4286221 gacaccgtcg cccgagccga ctccacttcc ccggtagtgc agaccgcggt catcgcgatg 4286281 ccgttggaga tcgatgcttc gatcgatccg ctccagtcac acgaagcggt gctggtgtcc 4286341 taccagggcg aaacctggat cgtcacaact aagggacgcc acgccataga tctgaccgac 4286401 cgcgccctca cctcgtcgat ggggataccg gtgacggcca ggccaacccc gatctcggag 4286461 ggcatgttca acgcgctgcc tgatatgggg ccctggcagc tgccgccgat accggcggcg 4286521 ggcgcgccca attcgcttgg cctacctgat gatctagtga tcggatcggt cttccagatc 4286581 cacaccgaca agggcccgca atactatgtg gtgctgcccg acggcatcgc gcaggtcaac 4286641 gcgacaaccg ctgcggcgct gcgcgccacc caggcgcacg ggctggtcgc gccaccggca 4286701 atggtgccca gtctggtcgt cagaatcgcc gaacgggtat acccctcacc gctacccgat 4286761 gaaccgctca agatcgtgtc ccggccgcag gatcccgcgc tgtgctggtc atggcaacgc 4286821 agcgccggcg accagtcgcc gcagtcaacg gtgctgtccg gccggcatct gccgatatcg 4286881 ccctcagcga tgaacatggg gatcaagcag atccacggga cggcgaccgt ttacctcgac 4286941 ggcggaaaat tcgtggcact gcaatccccc gatcctcgat acaccgaatc gatgtactac 4287001 atcgatccac agggcgtgcg ttatggggtg cctaacgcgg agacagccaa gtcgctgggc 4287061 ctgagttcac cccaaaacgc gccctgggag atcgttcgtc tcctggtcga cggtccggtg 4287121 ctgtcgaaag atgccgcact gctcgagcac gacacgctgc ccgctgaccc tagcccccga 4287181 aaagttcccg ccggagcctc cggagccccc tgatgacgac caagaagttc actcccacca 4287241 ttacccgtgg cccccggttg accccgggcg agatcagcct cacgccgccc gatgacctgg 4287301 gcatcgacat cccaccgtcg ggcgtccaaa agatccttcc ctacgtgatg ggtggcgcca 4287361 tgctcggcat gatcgccatc atggtggccg gcggcaccag gcagctgtcg ccgtacatgt 4287421 tgatgatgcc gctgatgatg atcgtgatga tggtcggcgg tctggccggt agcaccggtg 4287481 gtggcggcaa gaaggtgccc gaaatcaacg ccgaccgcaa ggagtacctg cggtatttgg 4287541 caggactacg cacccgagtg acgtcctcgg ccacctctca ggtggcgttc ttctcctacc 4287601 acgcaccgca tcccgaggat ctgttgtcga tcgtcggcac ccaacggcag tggtcccggc 4287661 cggccaacgc cgacttctat gcggccaccc gaatcggtat cggtgaccag ccggcggtgg 4287721 atcgattatt gaagccggcc gtcggcgggg agttggccgc cgccagcgca gcacctcagc 4287781 cgttcctgga gccggtcagt catatgtggg tggtcaagtt tctacgaacc catggattga 4287841 tccatgactg cccgaaactg ctgcaactcc gtacctttcc gactatcgcg atcggcgggg 4287901 acttggcggg ggcagccggc ctgatgacgg cgatgatctg tcacctagcc gtgttccacc 4287961 caccggacct gctgcagatc cgggtgctca ccgaggaacc cgacgacccc gactggtcct 4288021 ggctcaaatg gcttccgcac gtacagcacc agaccgaaac cgatgcggcc gggtccaccc 4288081 ggctgatctt cacgcgccag gaaggtctgt cggacctggc cgcgcgcggg ccacacgcac 4288141 ccgattcgct tcccggcggc ccctacgtag tcgtcgtcga cctgaccggc ggcaaggctg 4288201 gattcccgcc cgacggtagg gccggtgtca cggtgatcac gttgggcaac catcgcggct 4288261 cggcctaccg catcagggtg cacgaggatg ggacggctga tgaccggctc cctaaccaat 4288321 cgtttcgcca ggtgacatcg gtcaccgatc ggatgtcgcc gcagcaagcc agccgtatcg 4288381 cgcgaaagtt ggccggatgg tccatcacgg gcaccatcct cgacaagacg tcgcgggtcc 4288441 agaagaaggt ggccaccgac tggcaccagc tggtcggtgc gcaaagtgtc gaggagataa 4288501 caccttcccg ctggaggatg tacaccgaca ccgaccgtga ccggctaaag atcccgtttg 4288561 gtcatgaact aaagaccggc aacgtcatgt acctggacat caaagagggc gcggaattcg 4288621 gcgccggacc gcacggcatg ctcatcggga ccacggggtc tgggaagtcc gaattcctgc 4288681 gcaccctgat cctgtcgctg gtggcaatga ctcatccaga tcaggtgaat ctcctgctca 4288741 ccgacttcaa aggtggttca accttcctgg gaatggaaaa gcttccgcac actgccgctg 4288801 tcgtcaccaa catggccgag gaagccgagc tcgtcagccg gatgggcgag gtgttgaccg 4288861 gagaactcga tcggcgccag tcgatcctcc gacaggccgg gatgaaagtc ggcgcggccg 4288921 gagccctgtc cggcgtggcc gaatacgaga agtaccgcga acgcggtgcc gacctacccc 4288981 cgctgccaac gcttttcgtc gtcgtcgacg agttcgccga gctgttgcag agtcacccgg 4289041 acttcatcgg gctgttcgac cggatctgcc gcgtcgggcg gtcgctgagg gtccatctgc 4289101 tgctggctac ccagtcgctg cagaccggcg gtgttcgcat cgacaaactg gagccaaacc 4289161 tgacatatcg aatcgcattg cgcaccacca gctctcatga atccaaggcg gtaatcggca 4289221 caccggaggc gcagtacatc accaacaagg agagcggtgt cgggtttctc cgggtcggca 4289281 tggaagaccc ggtcaagttc agcaccttct acatcagtgg gccatacatg ccgccggcgg 4289341 caggcgtcga aaccaatggt gaagccggag ggcccggtca acagaccact agacaagccg 4289401 cgcgcattca caggttcacc gcggcaccgg ttctcgagga ggcgccgaca ccgtgacccg 4289461 cgccggcgac gatgcaaagc gcagcgatga ggaggagcgg cgccaacggc ccgcgccggc 4289521 gacgatgcaa agcgcagcga tgaggaggag cggcgcgcat gactgctgaa ccggaagtac 4289581 ggacgctgcg cgaggttgtg ctggaccagc tcggcactgc tgaatcgcgt gcgtacaaga 4289641 tgtggctgcc gccgttgacc aatccggtcc cgctcaacga gctcatcgcc cgtgatcggc 4289701 gacaacccct gcgatttgcc ctggggatca tggatgaacc gcgccgccat ctacaggatg 4289761 tgtggggcgt agacgtttcc ggggccggcg gcaacatcgg tattgggggc gcacctcaaa 4289821 ccgggaagtc gacgctactg cagacgatgg tgatgtcggc cgccgccaca cactcaccgc 4289881 gcaacgttca gttctattgc atcgacctag gtggcggcgg gctgatctat ctcgaaaacc 4289941 ttccacacgt cggtggggta gccaatcggt ccgagcccga caaggtcaac cgggtggtcg 4290001 cagagatgca agccgtcatg cggcaacggg aaaccacctt caaggaacac cgagtgggct 4290061 cgatcgggat gtaccggcag ctgcgtgacg atccaagtca acccgttgcg tccgatccat 4290121 acggcgacgt ctttctgatc atcgacggat ggcccggttt tgtcggcgag ttccccgacc 4290181 ttgaggggca ggttcaagat ctggccgccc aggggctggc gttcggcgtc cacgtcatca 4290241 tctccacgcc acgctggaca gagctgaagt cgcgtgttcg cgactacctc ggcaccaaga 4290301 tcgagttccg gcttggtgac gtcaatgaaa cccagatcga ccggattacc cgcgagatcc 4290361 cggcgaatcg tccgggtcgg gcagtgtcga tggaaaagca ccatctgatg atcggcgtgc 4290421 ccaggttcga cggcgtgcac agcgccgata acctggtgga ggcgatcacc gcgggggtga 4290481 cgcagatcgc ttcccagcac accgaacagg cacctccggt gcgggtcctg ccggagcgta 4290541 tccacctgca cgaactcgac ccgaacccgc cgggaccaga gtccgactac cgcactcgct 4290601 gggagattcc gatcggcttg cgcgagacgg acctgacgcc ggctcactgc cacatgcaca 4290661 cgaacccgca cctactgatc ttcggtgcgg ccaaatcggg caagacgacc attgcccacg 4290721 cgatcgcgcg cgccatttgt gcccgaaaca gtccccagca ggtgcggttc atgctcgcgg 4290781 actaccgctc gggcctgctg gacgcggtgc cggacaccca tctgctgggc gccggcgcga 4290841 tcaaccgcaa cagcgcgtcg ctagacgagg ccgttcaagc actggcggtc aacctgaaga 4290901 agcggttgcc gccgaccgac ctgacgacgg cgcagctacg ctcgcgttcg tggtggagcg 4290961 gatttgacgt cgtgcttctg gtcgacgatt ggcacatgat cgtgggtgcc gccgggggga 4291021 tgccgccgat ggcaccgctg gccccgttat tgccggcggc ggcagatatc gggttgcaca 4291081 tcattgtcac ctgtcagatg agccaggctt acaaggcaac catggacaag ttcgtcggcg 4291141 ccgcattcgg gtcgggcgct ccgacaatgt tcctttcggg cgagaagcag gaattcccat 4291201 ccagtgagtt caaggtcaag cggcgccccc ctggccaggc atttctcgtc tcgccagacg 4291261 gcaaagaggt catccaggcc ccctacatcg agcctccaga agaagtgttc gcagcacccc 4291321 caagcgccgg ttaagattat ttcattgccg gtgtagcagg acccgagctc agcccggtaa 4291381 tcgagttcgg gcaatgctga ccatcgggtt tgtttccggc tataaccgaa cggtttgtgt 4291441 acgggataca aatacaggga gggaagaagt aggcaaatgg aaaaaatgtc acatgatccg 4291501 atcgctgccg acattggcac gcaagtgagc gacaacgctc tgcacggcgt gacggccggc 4291561 tcgacggcgc tgacgtcggt gaccgggctg gttcccgcgg gggccgatga ggtctccgcc 4291621 caagcggcga cggcgttcac atcggagggc atccaattgc tggcttccaa tgcatcggcc 4291681 caagaccagc tccaccgtgc gggcgaagcg gtccaggacg tcgcccgcac ctattcgcaa 4291741 atcgacgacg gcgccgccgg cgtcttcgcc taataggccc ccaacacatc ggagggagtg 4291801 atcaccatgc tgtggcacgc aatgccaccg gagctaaata ccgcacggct gatggccggc 4291861 gcgggtccgg ctccaatgct tgcggcggcc gcgggatggc agacgctttc ggcggctctg 4291921 gacgctcagg ccgtcgagtt gaccgcgcgc ctgaactctc tgggagaagc ctggactgga 4291981 ggtggcagcg acaaggcgct tgcggctgca acgccgatgg tggtctggct acaaaccgcg 4292041 tcaacacagg ccaagacccg tgcgatgcag gcgacggcgc aagccgcggc atacacccag 4292101 gccatggcca cgacgccgtc gctgccggag atcgccgcca accacatcac ccaggccgtc 4292161 cttacggcca ccaacttctt cggtatcaac acgatcccga tcgcgttgac cgagatggat 4292221 tatttcatcc gtatgtggaa ccaggcagcc ctggcaatgg aggtctacca ggccgagacc 4292281 gcggttaaca cgcttttcga gaagctcgag ccgatggcgt cgatccttga tcccggcgcg 4292341 agccagagca cgacgaaccc gatcttcgga atgccctccc ctggcagctc aacaccggtt 4292401 ggccagttgc cgccggcggc tacccagacc ctcggccaac tgggtgagat gagcggcccg 4292461 atgcagcagc tgacccagcc gctgcagcag gtgacgtcgt tgttcagcca ggtgggcggc 4292521 accggcggcg gcaacccagc cgacgaggaa gccgcgcaga tgggcctgct cggcaccagt 4292581 ccgctgtcga accatccgct ggctggtgga tcaggcccca gcgcgggcgc gggcctgctg 4292641 cgcgcggagt cgctacctgg cgcaggtggg tcgttgaccc gcacgccgct gatgtctcag 4292701 ctgatcgaaa agccggttgc cccctcggtg atgccggcgg ctgctgccgg atcgtcggcg 4292761 acgggtggcg ccgctccggt gggtgcggga gcgatgggcc agggtgcgca atccggcggc 4292821 tccaccaggc cgggtctggt cgcgccggca ccgctcgcgc aggagcgtga agaagacgac 4292881 gaggacgact gggacgaaga ggacgactgg tgagctcccg taatgacaac agacttcccg 4292941 gccacccggg ccggaagact tgccaacatt ttggcgagga aggtaaagag agaaagtagt 4293001 ccagcatggc agagatgaag accgatgccg ctaccctcgc gcaggaggca ggtaatttcg 4293061 agcggatctc cggcgacctg aaaacccaga tcgaccaggt ggagtcgacg gcaggttcgt 4293121 tgcagggcca gtggcgcggc gcggcgggga cggccgccca ggccgcggtg gtgcgcttcc 4293181 aagaagcagc caataagcag aagcaggaac tcgacgagat ctcgacgaat attcgtcagg 4293241 ccggcgtcca atactcgagg gccgacgagg agcagcagca ggcgctgtcc tcgcaaatgg 4293301 gcttctgacc cgctaatacg aaaagaaacg gagcaaaaac atgacagagc agcagtggaa 4293361 tttcgcgggt atcgaggccg cggcaagcgc aatccaggga aatgtcacgt ccattcattc 4293421 cctccttgac gaggggaagc agtccctgac caagctcgca gcggcctggg gcggtagcgg 4293481 ttcggaggcg taccagggtg tccagcaaaa atgggacgcc acggctaccg agctgaacaa 4293541 cgcgctgcag aacctggcgc ggacgatcag cgaagccggt caggcaatgg cttcgaccga 4293601 aggcaacgtc actgggatgt tcgcataggg caacgccgag ttcgcgtaga atagcgaaac 4293661 acgggatcgg gcgagttcga ccttccgtcg gtctcgccct ttctcgtgtt tatacgtttg 4293721 agcgcactct gagaggttgt catggcggcc gactacgaca agctcttccg gccgcacgaa 4293781 ggtatggaag ctccggacga tatggcagcg cagccgttct tcgaccccag tgcttcgttt 4293841 ccgccggcgc ccgcatcggc aaacctaccg aagcccaacg gccagactcc gcccccgacg 4293901 tccgacgacc tgtcggagcg gttcgtgtcg gccccgccgc cgccaccccc acccccacct 4293961 ccgcctccgc caactccgat gccgatcgcc gcaggagagc cgccctcgcc ggaaccggcc 4294021 gcatctaaac cacccacacc ccccatgccc atcgccggac ccgaaccggc cccacccaaa 4294081 ccacccacac cccccatgcc catcgccgga cccgaaccgg ccccacccaa accacccaca 4294141 cctccgatgc ccatcgccgg acctgcaccc accccaaccg aatcccagtt ggcgcccccc 4294201 agaccaccga caccacaaac gccaaccgga gcgccgcagc aaccggaatc accggcgccc 4294261 cacgtaccct cgcacgggcc acatcaaccc cggcgcaccg caccagcacc gccctgggca 4294321 aagatgccaa tcggcgaacc cccgcccgct ccgtccagac cgtctgcgtc cccggccgaa 4294381 ccaccgaccc ggcctgcccc ccaacactcc cgacgtgcgc gccggggtca ccgctatcgc 4294441 acagacaccg aacgaaacgt cgggaaggta gcaactggtc catccatcca ggcgcggctg 4294501 cgggcagagg aagcatccgg cgcgcagctc gcccccggaa cggagccctc gccagcgccg 4294561 ttgggccaac cgagatcgta tctggctccg cccacccgcc ccgcgccgac agaacctccc 4294621 cccagcccct cgccgcagcg caactccggt cggcgtgccg agcgacgcgt ccaccccgat 4294681 ttagccgccc aacatgccgc ggcgcaacct gattcaatta cggccgcaac cactggcggt 4294741 cgtcgccgca agcgtgcagc gccggatctc gacgcgacac agaaatcctt aaggccggcg 4294801 gccaaggggc cgaaggtgaa gaaggtgaag ccccagaaac cgaaggccac gaagccgccc 4294861 aaagtggtgt cgcagcgcgg ctggcgacat tgggtgcatg cgttgacgcg aatcaacctg 4294921 ggcctgtcac ccgacgagaa gtacgagctg gacctgcacg ctcgagtccg ccgcaatccc 4294981 cgcgggtcgt atcagatcgc cgtcgtcggt ctcaaaggtg gggctggcaa aaccacgctg 4295041 acagcagcgt tggggtcgac gttggctcag gtgcgggccg accggatcct ggctctagac 4295101 gcggatccag gcgccggaaa cctcgccgat cgggtagggc gacaatcggg cgcgaccatc 4295161 gctgatgtgc ttgcagaaaa agagctgtcg cactacaacg acatccgcgc acacactagc 4295221 gtcaatgcgg tcaatctgga agtgctgccg gcaccggaat acagctcggc gcagcgcgcg 4295281 ctcagcgacg ccgactggca tttcatcgcc gatcctgcgt cgaggtttta caacctcgtc 4295341 ttggctgatt gtggggccgg cttcttcgac ccgctgaccc gcggcgtgct gtccacggtg 4295401 tccggtgtcg tggtcgtggc aagtgtctca atcgacggcg cacaacaggc gtcggtcgcg 4295461 ttggactggt tgcgcaacaa cggttaccaa gatttggcga gccgcgcatg cgtggtcatc 4295521 aatcacatca tgccgggaga acccaatgtc gcagttaaag acctggtgcg gcatttcgaa 4295581 cagcaagttc aacccggccg ggtcgtggtc atgccgtggg acaggcacat tgcggccgga 4295641 accgagattt cactcgactt gctcgaccct atctacaagc gcaaggtcct cgaattggcc 4295701 gcagcgctat ccgacgattt cgagagggct ggacgtcgtt gagcgcacct gctgttgctg 4295761 ctggtcctac cgccgcgggg gcaaccgctg cgcggcctgc caccacccgg gtgacgatcc 4295821 tgaccggcag acggatgacc gatttggtac tgccagcggc ggtgccgatg gaaacttata 4295881 ttgacgacac cgtcgcggtg ctttccgagg tgttggaaga cacgccggct gatgtactcg 4295941 gcggcttcga ctttaccgcg caaggcgtgt gggcgttcgc tcgtcccgga tcgccgccgc 4296001 tgaagctcga ccagtcactc gatgacgccg gggtggtcga cgggtcactg ctgactctgg 4296061 tgtcagtcag tcgcaccgag cgctaccgac cgttggtcga ggatgtcatc gacgcgatcg 4296121 ccgtgcttga cgagtcacct gagttcgacc gcacggcatt gaatcgcttt gtgggggcgg 4296181 cgatcccgct tttgaccgcg cccgtcatcg ggatggcgat gcgggcgtgg tgggaaactg 4296241 ggcgtagctt gtggtggccg ttggcgattg gcatcctggg gatcgctgtg ctggtaggca 4296301 gcttcgtcgc gaacaggttc taccagagcg gccacctggc cgagtgccta ctggtcacga 4296361 cgtatctgct gatcgcaacc gccgcagcgc tggccgtgcc gttgccgcgc ggggtcaact 4296421 cgttgggggc gccacaagtt gccggcgccg ctacggccgt gctgtttttg accttgatga 4296481 cgcggggcgg ccctcggaag cgtcatgagt tggcgtcgtt tgccgtgatc accgctatcg 4296541 cggtcatcgc ggccgccgct gccttcggct atggatacca ggactgggtc cccgcggggg 4296601 ggatcgcatt cgggctgttc attgtgacga atgcggccaa gctgaccgtc gcggtcgcgc 4296661 ggatcgcgct gccgccgatt ccggtacccg gcgaaaccgt ggacaacgag gagttgctcg 4296721 atcccgtcgc gaccccggag gctaccagcg aagaaacccc gacctggcag gccatcatcg 4296781 cgtcggtgcc cgcgtccgcg gtccggctca ccgagcgcag caaactggcc aagcaacttc 4296841 tcatcggata cgtcacgtcg ggcaccctga ttctggctgc cggtgccatc gcggtcgtgg 4296901 tgcgcgggca cttctttgta cacagcctgg tggtcgcggg tttgatcacg accgtctgcg 4296961 gatttcgctc gcggctttac gccgagcgct ggtgtgcgtg ggcgttgctg gcggcgacgg 4297021 tcgcgattcc gacgggtctg acggccaaac tcatcatctg gtacccgcac tatgcctggc 4297081 tgttgttgag cgtctacctc acggtagccc tggttgcgct cgtggtggtc gggtcgatgg 4297141 ctcacgtccg gcgcgtttca ccggtcgtaa aacgaactct ggaattgatc gacggcgcca 4297201 tgatcgctgc catcattccc atgctgctgt ggatcaccgg ggtgtacgac acggtccgca 4297261 atatccggtt ctgagccgga tcggctgatt ggcggttcct gacagaacat cgaggacacg 4297321 gcgcaggttt gcataccttc ggcgcccgac aaattgctgc gattgagcgt gtggcgcgtc 4297381 cggtaaaatt tgctcgatgg ggaacacgta taggagatcc ggcaatggct gaaccgttgg 4297441 ccgtcgatcc caccggcttg agcgcagcgg ccgcgaaatt ggccggcctc gtttttccgc 4297501 agcctccggc gccgatcgcg gtcagcggaa cggattcggt ggtagcagca atcaacaaga 4297561 ccatgccaag catcgaatcg ctggtcagtg acgggctgcc cggcgtgaaa gccgccctga 4297621 ctcgaacagc atccaacatg aacgcggcgg cggacgtcta tgcgaagacc gatcagtcac 4297681 tgggaaccag tttgagccag tatgcattcg gctcgtcggg cgaaggcctg gctggcgtcg 4297741 cctcggtcgg tggtcagcca agtcaggcta cccagctgct gagcacaccc gtgtcacagg 4297801 tcacgaccca gctcggcgag acggccgctg agctggcacc ccgtgttgtt gcgacggtgc 4297861 cgcaactcgt tcagctggct ccgcacgccg ttcagatgtc gcaaaacgca tcccccatcg 4297921 ctcagacgat cagtcaaacc gcccaacagg ccgcccagag cgcgcagggc ggcagcggcc 4297981 caatgcccgc acagcttgcc agcgctgaaa aaccggccac cgagcaagcg gagccggtcc 4298041 acgaagtgac aaacgacgat cagggcgacc agggcgacgt gcagccggcc gaggtcgttg 4298101 ccgcggcacg tgacgaaggc gccggcgcat caccgggcca gcagcccggc ggaggcgttc 4298161 ccgcgcaagc catggatacc ggagccggtg cccgcccagc ggcgagtccg ctggcggccc 4298221 ccgtcgatcc gtcgactccg gcaccctcaa caaccacaac gttgtagacc gggcctgcca 4298281 gcggctccgt ctcgcacgca gcgcctgttg ctgtcctggc ctcgtcagga tgcggcggcc 4298341 agggcccggt cgagcaaccc ggtgacgtat tgccagtaca gccagtccgc gacggccaca 4298401 cgctggacgg ccgcgtcagt cgcagtgtgc gcttggtgca gggcaatctc ctgtgagtgg 4298461 gcagcgtagg cccggaacgc ccgcagatga gcggcctcgc ggccggtagc ggtgctggtc 4298521 atgggcttca tcagctcgaa ccacagcatg tgccgctcat cgcccggtgg attgacatcc 4298581 accggcgccg gcggcaacaa gtcgagcaaa cgctgatcgg tagtgtcggc cagctgagcc 4298641 gccgccgagg ggtcgacgac ctccagccgc gaccggcccg tcattttgcc gctctccgga 4298701 atgtcatctg gctccagcac aatcttggcc acaccgggat ccgaactggc caactgctcc 4298761 gcggtaccga tcaccgcccg cagcgtcatg tcgtggaaag ccgcccaggc ttgcacggcc 4298821 aaaaccgggt aggtggcaca gcgtgcaatt tcgtcaaccg ggattgcgtg atccgcgctg 4298881 gccaagtaca ccttattcgg caattccatc ccgtcgggta tgtaggccag cccatagctg 4298941 ttggccacga cgatggaacc gtcggtggtc accgcggtga tccagaagaa cccgtagtcg 4299001 cccgcgttgt tgtcggacgc gttgagcgcc gccgcgatgc gtcgcgccaa ccgcagcgca 4299061 tcaccgcggc cacgctggcg ggcgctggca gctgcagtgg cggcgtcgcg tgccgcccga 4299121 gccgccgaca ccgggatcat cgacaccggc gtaccgtcat ctgcagactc gctgcgatcg 4299181 ggtttgtcga tgtgatcggt cgacggaggg cgggcaggag gtgccgtccg cgccgaggcc 4299241 gcccgcgtgc tcggtgccgc cgccttgtcc gaggtagcca ccggcgcccg cccagtggca 4299301 gcatgcgacc ccgcgcccga ggccgcggcc gtacccacgc tcgaacgcgc gcccgctccc 4299361 acggcggtac cgctcggcgc ggcggccgcc gcccgtgcgc ccgggacacc ggacgccgca 4299421 gccggcgtca ccgacgcggc ggattcgtcc gcatgggcag gccccgactg cgtccccccg 4299481 cccgcatgct ggcccggcac accaggttgc tccgccaacg ccgcgggttt gacgtgcggc 4299541 gccggctcgc cccctggggt gcccggtgtt gctggaccag acggaccggg agtggccggt 4299601 gtaaccggct ggggcccagg cgatggcgcc ggtgccggag ccggctgcgg gtgtggagcg 4299661 ggagctgggg taacgggcgt ggccggggtt gccggtgtgg ccggggcgac cgggggggtg 4299721 accggcgtga tcggggttgg ctcgcctggt gtgcccggtt tgaccggggt caccggggtg 4299781 accggcttgc ccggggtcac cggcgtgacg ggagtgccgg gcgttggtgt gatcggagtt 4299841 accggcgctc ccgggatggg tgtgattggg gttcccgggg tgatcggggt tcccggcgtg 4299901 atcggggttc ccggggtgat cggggttccc ggcgtgatcg gggttcccgg tgtgcccggt 4299961 gtgcccggtg tgcccgggga tggcacgacc agggtaggca cgtctggggg tggcggcgac 4300021 ttctgctgaa gcaaatcctc gagtgcgttc ttcggaggtt tccaattctt ggattccagc 4300081 acccgctcag cggtctcggc gaccagactg acattggccc catgcgtcgc cgtgaccaat 4300141 gaattgatgg cggtatggcg ctcatcagca tccaggctag ggtcattctc caggatatcg 4300201 atctcccgtt gagcgccatc cacattattg ccgatatcgg atttagcttg ctcaatcaac 4300261 ccggcaatat gcctgtgcca ggtaatcacc gtggcgagat aatcctgcag cgtcatcaat 4300321 tgattgatgt ttgcacccag ggcgccgttg gcagcattgg cggcgccgcc ggaccatagg 4300381 ccgccttcga agacgtggcc tttctgctgg cggcaggtgt ccaatacatc ggtgaccctt 4300441 tgcaaaacct ggctatattc ctgggcccgg tcatagaaag tgtcttcatc ggcttccacc 4300501 cagccgcccg gatccagcat ctgtctggca tagctgcccg tcggcctggt aatactcatc 4300561 ccctactgcc ctccccaaac cgccagatcg cctcgcggat caccgtccgg ttggcctccg 4300621 gcatttcacg ccggctcggc cgctggatcc accccgcgcc ggtattcgca gtaacccgtt 4300681 gaatccgcgc gcatgatgca ccgcttgggc gatcagccgg gtggtcacct cgcttgcgct 4300741 ggccgcgctg tcgcacgggg cgctcggtgg taacggacgt cataattaac cagcgtaacc 4300801 gaacctaaga ccagctagct gcggcaatat tggcgaccag gactatggcg ccctccgaac 4300861 ccggccgatc catgtcaaaa cattgacaat gcgtactcac gccgtgtcgg gcgcgctgaa 4300921 tgaccgcatt gcggcgctca ttcggtgcgt agtcgctacc accgcaacaa tgggcttagg 4300981 ccattccttc gttcatcgcg cgggacatgg ccgataacgc agcggtcagc tgctcgcccg 4301041 ccgcgtcgtt atacgcggac gccgcggcct gcgcattgtg cagcgcctcg ttgacccgct 4301101 gagccaccgc ctcggcaccc agcttcttca gcaaaccatc ttcgatgcgc aggccggtga 4301161 gccactggtg cccattgatc gtcacttcga cggtctcggc ttcgtcggtg gcgcggaagg 4301221 atccgttgtt catctgattg agcgtcccgt ctagggccga ctgaaaccgc gccgccagcg 4301281 tcaacgcccg ggcgacatgc gggtccaatt cgtccatgct cacttcgact ccttactgtc 4301341 ctggcgccga cggttaccaa tgacggcctc ggtccatgcc cgatcctcgg tgtagagcgc 4301401 ctcgtcttcc tgctgagaac ccttggactt ggcgccccct tgtccctgat gcgcggcacc 4301461 catcggcatt cccatgccac cgccgcccag cgcggcgccg ccgccggccc ttccctggcc 4301521 taagccggca atgtcaccag cgccagcggg ccgcaccgat tcggcgcccc cgatcgcgga 4301581 tcccaacggc gccgacggca ccccgccgcc tccaccgcca ccgagcgatg ccgctttgac 4301641 cgccacgtcg cccgacagcg ctgcggcttc ccgcccagcc gacgtcagct gcgccgccgt 4301701 gtcagccggg aggccaccac ccggcgatcc ggtaggcgga accatcggtg cggctggcat 4301761 cccggtaccg ggagtcacac cggagccgtc agacggcggc atcaggaagc cagggatcaa 4301821 tccctgctct tgcggaggcg gggcgggtcg atcttgatgg cgggggggag gcttcggcgg 4301881 gtttaccggt tccagggctg ccttgttgtt gtattcggtc agcaccttct ccgacctctg 4301941 ctgatactcc gcgtacaccg ggagaatttg gtcgcgggcc gaagggtttt ccgcgtaaag 4302001 ccgttcgagc ccgactatgt cttcataagt cggatgttcc cgcctagccc acacgtgcag 4302061 ctgcgcgaca tattgagcct gcttggccat cgcagcgctc aatttggcca tgtggagtat 4302121 ccattgccgt tgttgatcga gcgaagcctc gcaagcggta gccgcatcgc cttcccagtt 4302181 gtcaaacccc cggaaccgct tgacgtcgcc ttgcagcgtc aggttgaaag tgttccaccc 4302241 atccgcaaag tgcgcgagcg atgcgccttg gtcgcccgtt tcgagcttcc ttgccgcttc 4302301 tttgagatcc atgaagttgg gttcaccggc cgtggccacc ctcggcgtat cggttagttc 4302361 ggccgaactg tcccctccga cggccccggc cgattctgcc tgcacagttc cttcgccgtc 4302421 gttgtccagc gcggtcgcag cctcctcatc aacctcgcca tacgccttgg ccgcgttgcg 4302481 cagcgaggtc gccagacgct gccgctcttt ggcaccggcc gccaggtatt cccgcatgtt 4302541 gtcggcggac aataccagct gttgggcggc gtttttagcc gccgtgagtt cgcacggtgt 4302601 gatggggaca tcagtcggtg ggtccgccat cggggcctcc acctcgttgg ccctgttcaa 4302661 aatctcttgc tgatccaccg tcacggtctg cgactgcgtc atatcggatc atcctcctta 4302721 gtgctatagc cattatcgtc gctaaactga aaggttcctg cactaatttg atgccgcccg 4302781 ttcatgccgg catcgcgaac ggatcgccct acttcggcag cgccatctgg tagcggcttt 4302841 cctcgggtgg ggaaacccgg cgaatcggca gctgccgatg ccgcggggta ccgatcacat 4302901 tgtgccgcag aatcacccgg tcaataccgg gatgcgggcc gagataggtc gtcgcattcg 4302961 gccacgccac ctttacctcc tgcccgatgt gtgcgccgat caaccgggca aattcctcga 4303021 actgtggccc gactgtgacc atcgcacctg ccgccgccgc acgcaccacg aactgggtga 4303081 atgtctgagc gtcacccagg ttgagggcga tgtcgacatc gtcgaagggc atgtagaccg 4303141 ggcatcggtt caccgtctcg ccgaccagta ccccagctga cccgatcggc agctggcagt 4303201 ggcggttggc caccagatgc tggccttgca gcgcgggccg ctgcccgcca aataggcggg 4303261 cgaagcccct gggtgtcttg ggcttgtccg ccgtggtcag caacaccgtg gactgcgggg 4303321 ccatccccgg cgcgacccgg actctggtga tggtgtggtc cgcgcgcgcc gaccaccata 4303381 catccggacc tccgggcgcc gcgtaggcgg cagtgtaggc atcgcgcccc ttgatcatcg 4303441 accatttctc ccgcacaaag ccgatgtcgg tggcgtggtc gtagtcatcg aagctgcggc 4303501 cacacaccgc gtcgacacca tggctagcca gtcgatcggc aatgcgcgtc gcggacgcca 4303561 ccaaataccg ggccagtcct gcgacgcctt catcgcggcg ctgcgccgat ttgcgggtgc 4303621 gttccgggtc ggcgcgcagc acgatccagg tccggcggtt cgccggcgcc gggtctgtcc 4303681 cgatcacctg ctgatacaga ctcaccacgt ccggcgctgc ggtattgccg acgcggtagc 4303741 cggctgagac gatatcggcc tccaagtcgg gacagtgcac cgacaggagc tcctccacca 4303801 gtccggtgtc cagcatgtcg tcggtgtggg cttgcccgtc gacgatgacc gtcggcgtga 4303861 atggtcgggg aatgagctcg attacggcga ccagaaactc gccttgccag cgcaccgcaa 4303921 cgtgatctcc tggcttcacg gtggccccga ccacaggttc tgacgaggaa tccgggggcc 4303981 gtcggcgccg ccgcaaccac gcgtacaccg ccgccaccca gccggtgatc cggcggccgt 4304041 agaaagtgac cgtggccacg atgacgccca acgaggccag cgcaatcccc gcccaccagt 4304101 agcgcgtctc caagaatgcg atgatgcatg gcggggccaa cgcggaggca agcaaggcgt 4304161 gcccggtgct gaaccgcagc cctaaaggat ttctcatcgg cggctcagcg cccgtctagc 4304221 cagcgcgccc aggcccaggg ccaacgtaag gccgacggcc accaacgcca cagccgtaat 4304281 cgggcgacga tcgggacccg gctccaccac cgggggtgga agtcgtctga cgttgtatgg 4304341 cgccgaagca gggccgggcg gaatgtccca cgtcagcgcg gccaccgcat cgatgacgcc 4304401 ggcgccgacc aggtcgtcga ccccgccccc ggggtgtctc gcggtggcgg tgatccggtg 4304461 gatgatctgc gccggcgtca ggtcggggaa ccgctgccga agcagggccg ccagacccga 4304521 cacatatgcc gcggcaaacg aggtgccggc gatgggtacc ggcccctccc ggccttgcaa 4304581 cgcattcacc ggttcaccgg tgtcgccgag cgcgacgatg ttttctgcgg gcgcggccac 4304641 gtccacccac ggtccgtgca tcgagaacga gctgggcatc ccggtctggc cgataccgcc 4304701 gacgcttaac accagcggtg cgtaccacgc cggggtgaca acggtctgca cattgttcca 4304761 gccgcgtggg tcgccgggtg tggacgggtc cggcgccgga ttctgtacgc aatcgccacc 4304821 ggtgttgccg gccgcgacca ccaccaccac gcctttgacg ttgaccgcat agtcgatgga 4304881 tgcacccagt gaggtttcat cgatcggcct gctcaccttg tagcaggcgg cttcactgat 4304941 gttgatcaca cccacgccga ggttggcggc gtgcaccacg gcgcgggcaa gactgcggat 4305001 ggaaccggcg gccggggtgg cgttggggtc attcgggttg gcttgtgagc cgaccggttc 4305061 gaaggcctca gacgtctgac gtagcgagag cagtcgagcg tcgggcgcga cgccgacgaa 4305121 cccgtcggtg ggcgcgggcc ggcccgcgat gatggatgct gtgagagtcc catgggcatc 4305181 acagtcagac aggccgttac cggcctggtc gacgaaatcg ccgccaggtt ccgccgggac 4305241 ccgtggcgaa gcgtcgacac cggtgtcgat caccgccacc gtcaccccgg ccccggtcgc 4305301 gaacttgtgg gcatcggcca cgcccagata cgtgttgctc cacggcggat cgtggaaccc 4305361 ggaccccggc agcgtggtgg gcgacgcgca caaaacgcgc tgttcggtag gctgatccgg 4305421 gcccgccacg tcgggcggca acgcgcccgg atcgatcggc ggtggcgtga tggccgatgc 4305481 gggcgacgcg gtgagcaacg ccagcgccac cgtgatcaga aagatacggt gcactcccag 4305541 aacactccat tcgttgagat tcattgcgat tcattgagct gcgttgctac cttgggccac 4305601 ttgacggacc tgtgtgcatt ttagacgtaa cggctgggca aacaacgctg tcacgcctgg 4305661 gctggtccgc cgcgccgacc agggcgcgta ggcgctgtac ctggaccacg ccgggactca 4305721 acggttttgc taccgcacta gccgatatgc ggctgctacc aaacgatcgc ggccatgtct 4305781 cggttgtctg agcacacgct gcgtatcgcg gcatcgatgt cggtggcggt gatgatctgc 4305841 agatcctgaa ccgataccgg ttggcccgca cgtttttgcg caaccacccg ggtgtcccgg 4305901 aacccttcgg cgcgttcgat cacgttgcgg gcgaaccgac cgttttgcat agcgtcgata 4305961 ccgtgctgcc cactaggggt ggtgtagtta cggatggtgg tgaccgcgtc gaggaatacc 4306021 tcccgtgcgg cgtcatcgag ctggctggcg cgcggtgtag cgtagcggtg tccaatctcg 4306081 acgatctcca ccggcgaata agactcgaac cgcagctttc ggttgaaccg gccagccaaa 4306141 cccgggttca cggtgaggaa ttcatccacc tgatcctcat agccggcccc gatgaaacag 4306201 aagtcgaatc ggtgtgtttc caattgaacc aggagttgat tgaccgcctc catgccgatc 4306261 atgtccggtg ttccgtcttg atgacgttcg atcagcgagt agaactcgtc catgaaaatg 4306321 attcgcccga gtgacttttc gatcagctcg ttcgtcttgg gtcctgactc cccgatgtag 4306381 tgcccacaga agtccgatcg gcgaacttct cgaatttcgg ggtgacgcac gatccccatg 4306441 ccggcgtaga tcttgccgag cgcttcagcg gtggttgtct tacctgtgcc tggtggcccc 4306501 accagcaaca tgtggttggt ctgcccctcc accggtaggc cgtgctctag gcgcatcatg 4306561 cgcacctcga gttggtcttc cagcgccgat accgcttgct tgaccgccgc caggcccacc 4306621 tgtttggcca gcagttcccg gccctcggct agcagctcgc cgcgccgctg cgctgcattg 4306681 tcgtcatcga gctggtcgcg gcttttcgcc gtcgaagcat cccaacggtc ggagcggctg 4306741 gcgatggttc gttcatcggt aacaatcaag cgcaggttcg ggtccgccag ggcttctttg 4306801 gcggcgtcgg tgagcacccc gttgatggtg gccttcgaca gccagatctg ggccttgtcc 4306861 tcctcatgca gttgccggta caccatcccc cgcacatacg ccaagtcggc gaccagcagc 4306921 ggaatatcgg ccggtccgat cgccgcggtg agcacgtcgg cgccgaaccg ccccgatgac 4306981 ctgctgtgtc cgatcacgtc cacccggtcc agccagtcca gggccactcg cccctgcccg 4307041 agatgggccg cggcgtgggc tgccagcgca caaatcgacg cggtcaccgc cggcatgacg 4307101 atcgcctgtg gcggcagatc ctcggcggcc gtcgacaaca cgtcgggcca tcgctgcgtg 4307161 acgtacatca ggaacgcccg agccagctga tgccactggt agttgcgcca cgaatccaat 4307221 agctcgcggt ttgctaacag ggcatcggcc ttcgcatact cccccgcgat cgtcaacgcc 4307281 gacgacagcg ccagccccac ctgagatgcg tcggtcaccg tgatcccgat ggatggtccc 4307341 agctggacct cagcggccaa cgtccggccg atccgcgtgg tctcgcggtg cagccactcg 4307401 ctatgggcgt tgagctgctt aagcgaggcc agatcgcggt caccgcaggc gatacgaccc 4307461 agccacgcgt cggccatcga cggatcggcc tcggtggcag ccacaaactc aggcaacgcc 4307521 gccacgcatc cctggccatt cttgatcgtc atcgcccgat cgaaatgccg gcgcgcagtg 4307581 agtaaatcac ccatcgtgtc caccattctc gacatcgccg ccgctgtcac cgcggttgca 4307641 acgtgtgtct gtcactctgt gcctcaaatt ccgttggcaa cgttctaccg gcctatcgac 4307701 atcgtgaccg gctcaaggct gacatagcgg ttctccgcac ggaacatttc catctcaacc 4307761 agccagtttt gtcctgccgc accgactttc accgttgccc gatcgatttg ttcgatggtc 4307821 acctcgaagc catgccgatc gctctcggac agcgaggtac cgggtcgggc aatggtgatg 4307881 acactggctg gccgtggcgt gggcgaaatc gcgacatcga caccgctgcc ttcagatttg 4307941 ccgtcatcgc cgttcttgcg ccgccgcacg tactccacga cgccgacagt ggtgcgcggc 4308001 gcgggccgtg gtgtgccgac gatgctcaac tgcggcatgc gtacgctggc ccaacgctct 4308061 tggtcgcgag tgtgcacaca cacccgctca ccggcaccga cgacgcgaat cacgatcctc 4308121 ttggcgatcg tgtcgtccgc ggccacgaag acgcgcgaca gctcaccggc gtcggtaacg 4308181 ggaatcatca gccggtcccc gttgctcagc ttgccaatca acacccccga cggtccgatc 4308241 tcggtgacta gctgcgccgg caacgggcag cgccgctgtc cgcgtaggtg tggacgtggc 4308301 ccgcacatgt tggccgcagc cgcggcggct tgctcaccat tgagccgacg caagatcaca 4308361 ctgggcgggg taggcgccgg cgtcggtgtg cgcacggtga tggtcgcggt gcacgtcgcg 4308421 tccgggtaca ccgttacgtt ctggatgacc tcatcggcac gcagcgtcca ggcttgcgag 4308481 agaacccgcg acgaaatcgc ctcagccggg tacgcatacg tcgtcatcca cccggcttca 4308541 ccgcggatag ctttccagcg ctgcgcactc ccggctaccg cgtccgaccc cagccggcga 4308601 tcaagctcag ccaagtctgt tgcggtggcc agtttggcgc gcaagccctg acagcgcagg 4308661 gagctggcaa cgcgttgggc gaccgaaatg gcagcggccc caacgctggt acgccagcgt 4308721 aaagcttggg tgttgccgat caccggaagc cgcatgatca gccacgtttc gcgccgcccg 4308781 gcatacggcg gcgtaccgat ctccgcgtca tacacccgcg ggtaatcgcc gacggtgccg 4308841 gttcgcgagc cgaaggtgac gacgctgatt gaatcgagtt ccaggtccag cgggtggcgc 4308901 ggcaacggcg cgagctcaac gacgtcaatc acgttgtcgc tttctacggt caccgacccg 4308961 gtgaccgtag tcgcccggtg cgctcggccg agaagttgca ccgccaccac cgcgacaccg 4309021 tcttgcacgc ggacgccacc cccggatcgg ttgttggcca aggtaattgg gtcattccat 4309081 ttgacgggac gccgaccccg cagccccagt accgcccacg accacgccgg ctgaccccac 4309141 cactgtacga acaccaaggc gacgccgacc acgacagcca tgaccgcacc tagctggccg 4309201 cccagcgccc agcccgccga cgcgagcacg aacactgtcc acaccccggc gacccgcctc 4309261 gcactgcgcg ggctgaaccc ggtcagcttg gacgtcaacg cgccctccgt agccgagccc 4309321 cgattgccat tgccagcaca ccggtggcca ctgcgccgac gaacccgata gcgatattgc 4309381 gcgcccggtg atcgggcgga gggggtggcg cggcgggcgt gatcacccgg ctctgtgcac 4309441 ccggggccat ccgatcaccg gatgggatgt taaacgtcaa tgcggcgacc ggatccacca 4309501 gcccgtaccc cagtttgttg tccacgcccg caggcggatt gtgcgccgac tgcacgatcc 4309561 ggttgatcac ttggtaggca gtcaactcgg ggaatttggc ccgcaccagt gccgcgacgc 4309621 cgctgacgta ggccgccgaa aagctggtgc cccagaacgg catattcttc tcgcctggcc 4309681 gcgacggcgg gtaggcattg accggtccgc cgccttgtgg cgatagaccc atgatgtggg 4309741 ttcccggtgc cgcgacaccg acccacggac ccgacatgct cttgtccagt gcggcgccgt 4309801 aggcatcgac ggcacctacc gacaggacgt aatcagagaa ccatgacggt gacgacacaa 4309861 ccgtgacctg atgccagtcc cggggatctg acgggtccag cgggtcatac atcgggttgt 4309921 tgccgcagcc ggcctccccg tcgttgccgg ctgctgccac gatcaccgca tccttgacgg 4309981 tggccgcata ccacagcgcg gcgcccagca cccgctggtc gcccggagcc gccgcaggca 4310041 gacatgcggt caccgaaatg ttgatcactt tcgcccccat gttcgccgcg tgtaccacgg 4310101 cacgcgccac cgagtcgagg gtgcccgctt tgactttctc atcggagttg ggacccgccg 4310161 acgacgggtt gaccggctcg aaggcccgcg aggactgccg aatcgagatg atggtcgcat 4310221 gcggggccac ccccaccacc ccgtccgggg cgcccggcgg gggtgggggc accgcgggtt 4310281 cgtcttcggt ttgcggatcc ggcggtccat tggacggcgc catggcgccc gcatcctcgg 4310341 gtggtggtgg cggtggcgca acggtttggg tgatcgtcac cggcggcggc gggggcatcg 4310401 ggggcgggac ttctaccggc ggcgccggcg cggcggtgac cggcggcggc ccggccggcg 4310461 gtgggaacgc cgcggtggcc ggcatggccc ttggcatcgg taaaatccca agcggtgcag 4310521 cggcaatgat cgaactcacc accgtgccgt gcgcgtcgca atccgatagg ccgtcctccc 4310581 ccatgatgta gtcgccaccg ggcaccaccg gcagccgcgg gttgggactg acgccggtgt 4310641 cgatgactgc cacgggcaca ccgttgccgg tgctgtactg ccacgccttg ctgatgttga 4310701 ccaggttgaa gcccggtgct agctgcgcca cgtcgggatt tcttacggtg atcggtgtgg 4310761 agcagctgtt ggagcggcgc atgggctgat caggtccagg ccgcgcgtct gcaggcacca 4310821 tcgccggatc taccgacggc ggtgggatag cctgtgccgc aggaacatta gctgacaaag 4310881 caacgagggt gagggcggcg ctcgcggccg cggcccgcag gccaggtcgg tttagtggcg 4310941 aagccatgca aacagccccc ctagggccgc agccgctggt aacaacgcga tcatggccag 4311001 cacttctagc cattccacgg tcaaccggat gatcggccta aaccgcgtcg ccggtaccac 4311061 gagggccacg gccaaaccca atgcggcgaa agccgcgacg aagatcgcag gccaaagcag 4311121 cccggtctga acacctttcg gggtgtcgag ggcgtactta agcaccccgg cacacaccgc 4311181 ggcggacgcc ccgcacacca atgcgaccgc ttggtatttg gcggcgaacc cgcggccctg 4311241 ggtgatgaag aggcccaccg tcaagccggc aaccaacaac gccaaccagg cccacggttg 4311301 acgtggcgtc agcacccccc ataccgcggc gggcagtacg agcgacaccc cgacgcacat 4311361 acccacctgt accgcgttaa ccagccgcgc cgacgcggcg atcgcggtgc cgcgggcggt 4311421 gatgccggtc agttcattgt cctcatcgtc ggcgtcggct tcgctgaccg gagccaccgt 4311481 atcgacgggc attcccgcac ggcgcgcgaa cagatcccgg ccggtgatcg atccgaagtg 4311541 cgggggtcgt acccgtgcca cccacaacgc aacggtcgga gtcatcctga tcaggacaag 4311601 cagccctacc agcacgcaaa tcgccagcac ctgcatcgaa accggcctaa acattcggac 4311661 ggcggcgaca gcggcaagga tcccgcacac cgttaccacc gcggtgacca ctgcggtctg 4311721 ccaccgcttg cgggtcgcca cgccgatcgt gatcgcaccc agaaccacca ccacgagccc 4311781 gatcagcgca tgagccgcca ttcacggacc atggggtcgg cattaccggc ctggtccaga 4311841 gcccccaccg ccatcaactc ggcggccacc gggtggcgaa gtgcccgctc ggcggtgtcc 4311901 aaccgcggca acaatggccg taatcccagt tccggacagg tttgttccac cccggttacc 4311961 gcttgtagta cccacaggcc atcgaccgtc gtcgtcagca tgtcacgttc atctcaacca 4312021 gctagcagca agcagaaggt ggggcagacg cgcggtccgc gcatgtaccc caccttcact 4312081 cggcccacgg ccggctttag aacaagcccg cgatggcctg gtcggttccg atcgcgttgt 4312141 ccagcacgtg gccggtggta gtcccatgct gacccaccgt ctcaatgagc ccctgcagcc 4312201 ccgacagcat ctgcgcctgg cgtcgaaaaa cccttgcgcg ccgtggcccg cgaaaaactc 4312261 ttgcagcgca tttgttttgc tggcggtgtc ttcgtaaatc atgtggagct ggccggcgcg 4312321 cgagcccacg tcggaagcga agtcggatac ggctcccggg ttatacgtga tttgatctga 4312381 catgtgaaat tcctttccga ggcgtgaaac gagttgggtc aggatccgtg gctagcgccg 4312441 aacagcgcct gaaacgctgt ctgcgagtcc gcctcgtgtc cctccatcag ggctgcggcc 4312501 tgcacgaggc cctcggccag gcgcgtgccc ccggtaagga ccttgttcaa ttcattggtg 4312561 atctcggtgg ctgtcatatg cgaagcaacg acgccggcac cagaccaggt ggcggggttc 4312621 atgacgtttt cctggttggc taggtagccc ttggcgattc ccatggcttg ctccatattc 4312681 gcctggatat cgttggcggt gctgcgcagc atctgcggtg ttacctgaat tgtgtctgcc 4312741 acggcccttc tcctttactg ccgttagccg ttccccctca aatatcgggg catgacgcga 4312801 agtgtatggc tgctctgcgg acctgtcgat tcaccctgtg cccgagctag atctaccgcc 4312861 ggtcatcgac aacacgcacc gtcgcggcct gctcggactt gccatgggat ccgcggtgac 4312921 cccccgccgc atgtccgacc ggcatgcctc cgataggggt gccacccacc gtggtcgtcg 4312981 gcgcgcgcac gacgtcggcg cccaaagccc cgctgggccg caacccgacc ggcctgccgc 4313041 tggtgcccga ttcgaaagcg ctgacgggac gtgtgaagct cgtcgctggc ataccgccgc 4313101 cgcccagggc ggcgcctccg ccgcccgcac cgacctcggt ggccgcggcc gatatcccac 4313161 cggccgccga agccgccgag gctcccggcg ccgccccgcc catgcccagt gcgcccgggt 4313221 tggcgaacat gcccaccatc gattgcagcg gctgcatcgc gctcatcggt gcctgcatca 4313281 gtcccgacgg cgcctgcagc gcctgcgggg ccgcttgcat caccgcctgc atcggctgca 4313341 tgaacgtgct gagctgattg ccgaagttct cacccgccga cgttgactga cccgcgccgg 4313401 ttgaccctgc ctgcacgccc tggtaggcgg aacgcatccc gtcaccggcc gcggcctcgg 4313461 cggccgcctg gccaaccgcc gccgcagcct gtgccggagc ggccggagat gcacccatgg 4313521 tcgcgaccgg cggcggaatt gccagactct cggccagcgc ggcgagaacc cctccgtagg 4313581 tggcgcccac cgcggcgtta ttcggccaca tcaccccgaa atactcgacg tccaaagaga 4313641 cgattcgagg cgttagtgtc cacagcacgc tggggttgat ggcgttgtcg acgccccatt 4313701 cgtcgcggtt ctccatgcac tcgggggcag ggcgcatggc cgcgttggcg gtctcaaacg 4313761 ccgcgatcgc ggtcgatacc acggccggct tcacgtcgac ccagccggcc agtccgtgca 4313821 gcgtggcgtt gagcatggtg acgttaagcg ccgaggccgc cgacccgaca cccaaccagc 4313881 tcgccgcggt ggcggcggtg ttgatcgccg acgcgacacc cgaggcgtgg tggctggcgc 4313941 ccagtgtggt ccacgccgtt tgattggcca gatgggtgcc cacgccggtg cccgccttga 4314001 gcagcaggtc gttggcctca ggtgtccgcg cagcccatcc tggatcgggc atgcctactt 4314061 acccttgcag cgccgatgcc gccgcgcgcg ccgcttctgt ggtcacgtac acgcccgacg 4314121 cgaggccctg ctaacccgcg aacaggccgc gctgactggc gtgttcggca acgacaccca 4314181 ggtagctggc accgcacgcg ttgagcgctg cggagaacat cgcggagtcg ggatcaccac 4314241 ccatcggcgt ggtgctaagc agggctggcg ccgctccggc ggccgctgcc tcggtttccg 4314301 cactgatcgc cgactcggca gctgccgacg ccagcactgc ttctggttgc acagaccaaa 4314361 ccatgttcgc ccctccgatt gcttctgcaa tgcgtgatgg tcgctgagtg taatgcgagt 4314421 cggccgatcg cgtatgcgca aatcagtcgt ctgcaccgat gccgacgtcg aactggtcca 4314481 cgccgcccca tgtgttccag catgtcagcg gtacgtgtgg ggcggatgtg aaatctgcga 4314541 cgcctggagg atacgcgcgg gtgtcactga acccgtacag cgacatggtc aggcagaaag 4314601 tagccatgcg cgctatcttg cgatccggcc ctactgctcg ccgggcaccg acggataccc 4314661 caccaaaatc ccctcgacgt cgccgtcggc accgaccaac agacctcgtc caggcggcaa 4314721 cgtttgggct cgcaccgatc gattgattcg gttttgcgga tcgttatcca tatacaactg 4314781 ggccactttc gccgaggtct gggatttcac ccaggggtcc atcggcatcg tggcccagtt 4314841 cgcgctgttg cgcgtgctga atacgtgcaa accgacctgg cgggcgcgtt ccatcaactt 4314901 ccacagcgcc gcacccaccg gcggcttctg tgggtagctc tgagccggcc gcaggtcctg 4314961 cacgtcgtcg atgagcacaa agtgccgcgg tccttcccac ggcttgagtg cgcgcaactc 4315021 ctcctggctc aaacccttgg gcggcaaccg cggcagcaag atctgctggg ccaactcggt 4315081 gatcacctcg tcgatttcat cttggtcgta ggcatacgcg cgcacatacc caggggcgtg 4315141 cagatctcgc agaccgtgcg gagccgtttt agggtcgatc agcgtgagct gcgcctgctg 4315201 cgggctgaac cggttcatca ccgcctcgcc gatggccacc agcgccgtgg tcttgccgca 4315261 gccttgccga cctaagatca tcaaccctgg gctctcgcgc agcttgatcg gcaccggacc 4315321 cagctcgtgg cgctctccga tcgcaaacgc gatcgacaga tcgtcaccgc cctggtggac 4315381 ggcctcgtgc tcgacaatcg ccgacagttc cacccgctgt ggcagccgct gcagacttgc 4315441 gtgcttggtc accccggcca cgtcggcgat tcgcgccccg acatcggtga tgcccaccag 4315501 ctcgccggta ccggggtcgg ccagggccgg aacaccgatt cgcagctcgt gcaggctttc 4315561 cgtcaaacca aatcctgggc ggttcaacgt ccgccgcgcc gcctcccgcg attcgatcga 4315621 caaatgcccc atctggctct caccgggatc ggccagccgc aactgaattc gcgccgtgac 4315681 attctgcagc aggctctgcc gctgcccatg aatccagccg ccggcactgc acatcaggtg 4315741 caccccgtat tcgggaccgc ggctgctcaa cgagatgatg cggtccccca acagggtgtc 4315801 cttggcgtac aggtcgtcgt agtcgtcgag caccacaaag acatcgccga acgcgtcggt 4315861 gggatcggtg ccacccaccc cgtcgccgcc gatcccgaac cggcgctcgc ggaacccgtc 4315921 catgtcgatc ttggctcgcc gaaacgcctc ttcccgcgca tcgatcagcg catccatggt 4315981 gctcaagatg cgttcgatgc cctcggcatc cttgggcgac acgatatcgg taacgtgtgg 4316041 aagcgaccca atctgggcca tggtcgcccc gccgatgcaa aagaacgtca ctcgctccgg 4316101 ggtgtacatc gttgccgccg aacacatcag cgccatcaag gttgtggtct tgccgcgctg 4316161 cttggcgccc accacgatga tgttgctgcg tagcgcgtcg acggcgtgta ccacttgctg 4316221 ggattcttcg gggatgtcca tcactcccac cgggaacatc agtcccgggt tttgaccgta 4316281 gtcgacatgc cagggtttgc cacgatacgc agccaccagc ctatcgaccg gctcggggtc 4316341 ttccagcggc gccaaccacg gccggcgcgg cgatcggtgc ggcacgttgt atagcgactc 4316401 ccgcagcacg tcgacgatct tcttcttctt gaaaccgtcg tcgtaataga ggaattcgtc 4316461 gggttccgca tcggcggccg cggcggtcgc caatgcctcg gcgtcggcgg gcatccagcg 4316521 gttggtactg ccagtcgtac agccggggtt gggtcaacgt catgtcgatg gttcgggcca 4316581 cctctttctt cttcggcacc acaaacggcg cagagaggta aaagcagcgg aacggttcca 4316641 gatcccgcgg ccccaccttg agcagcgcga aaccgttctc cttcgacggc agatggtagg 4316701 cggcgtcgct gccgatcact tcgcggctgt catcaccgga ttcagcgcgc agcgcaatcc 4316761 gaaacgcgat gttggacttg accttttgca gcgacgacag gtccagccgt tgaccgccta 4316821 gcatgaagaa gacgttggcg ccgcgaccct cctgaccgat gtggatgatc agatcaatcc 4316881 actttttgtg gttggcgaac agctccaggt attcgtcgac gatcaccagc agcaccggca 4316941 ccggcggcag atcgcgtccg gcgaggcgaa tctcttcgta gtcgttggcg tcgcgcgcac 4317001 ctaccgattt gaacagttcg tagcgctgtt tgatctcgcc gtcgataact ctgcgcatcc 4317061 gctcggccag atgccgctcg tctttgccga ggttggatag cgcggccacc acgtgcggga 4317121 tgcccaggat gtcctgggca gccgattcga atttcatgtc gacgaagatg acgttgaatg 4317181 tttccggtga gtgcgtcagc gcgatcccat agaccaacga caagaagagc tccgacttgc 4317241 ccgagccgct ggttccgatg accactgagt gaaacccgaa gccgccaaag tccttggcgc 4317301 gcaggatgat gttctgcagc tcgccgttcg gtttggcgcc caccggaatc tcacaccacc 4317361 gatcgtcgcc gcgaccgcgc cgctcggccc acaaccgatc gacatccaat tcccgggggt 4317421 cgctaatgcc gagcgaacgc agcagctcgg ccgcgccgct ggtggaatcg gtgacctcgc 4317481 tgcgactggt cggtgaccac cgcgccatcg cccgcgcata tcggtaggcc cggtggatgg 4317541 acagctggtc ggcatgcgcg aagaacgtgc cgcgcgctcg caacagcggc gccgggcgct 4317601 ggtcgtcatc ggcgtctgcg ccatcgcgac cggccttgac cgcggttgcc gccccatgtc 4317661 gttgggccat ctcgaagacc tggtcctcgg cgaaccccac accggtgccc acccgggacg 4317721 cgatgcgcag caccgtaagc ccggccttgc cgacctgccc gaccacgctc tcccacgcat 4317781 ccgggctgcc ggtgttgtcg tcgacgatca ccaggtgcgg ccccaaatcc acgccgacct 4317841 gcccggtttc cagcgccgag cccatcgcgg ttgggctggc caccgtcggc ggggtccatg 4317901 cgcctcgctt gcccttcata tgcagctcgg ctcccagcgc cgcctccagt tcctcgggtg 4317961 tggcaaagat cagccgccgc cagccgcagg catcgaacag ctcgtcgtgc aggttgtggg 4318021 ggagccacac catccacgcc cacacctcgg ggttgcgcgt caccaccatc agcttgacgt 4318081 cacgcgggtt gtgaaacacc gccagcgagc acaacaccga ccgcatcagc gaccgcaccc 4318141 ggtccaggtc ctcgctcacg aagctgaagc ctggtgccga ccgtaggttc accaccttgg 4318201 cgatatcgcg aatcttgcgc tgctccaaga tgaaatcgcg cagcgcctgc ccggtcacgg 4318261 gctctagctc ctcatcggag gaaatgtccg gccaggtcac cgacaacacc gaatctggtg 4318321 cgtgctgcac acccgtgccc acccgcacct ctaagaagtc gacgtcgccg cggccacgct 4318381 cccacatccg cggaccgcca atgatggcgc ccagtccggg tgggtccgaa tgcacggcgt 4318441 tctgccattc acgttgcgca cacaccgccg tctggatttc gtcgcggttg gtgtccaggt 4318501 cacgaagata tcgacgacgc cccttctcca actcacccca ggtgatcttg cgggctcgac 4318561 cgaatcgtcc ggagaacgcc agcatgctga acgcgccgat gcccatcagc gggaagaacc 4318621 ccgtggccaa gctgcgcacg cccgacacgt acagcatgac gatggtgccg atcagcgcca 4318681 cgatcaacgc gggaacgccg atcatcaccc agatgttgcg cggctcgcgc tccggcagag 4318741 ctatcggcgg attcggagcc acccgaacgg gtttcggcgg gtcgatgttg acgcggttga 4318801 tgggaaacgc tttcttggac atctaggcgc ccgccttcgc cgttgtcgtc acaatggcca 4318861 cctggccgag ggtgggcaca gtatcgcgag ccaagagtgc cgcatcccgc gacagagccg 4318921 gtcccgcagc aaaagtccgc agcaacggcc acggcgcctg cacggccgca cccggatcca 4318981 ggcccagcgc ccgcagcgtc gcctcgtcgt tggcgatccc gaatcgcacc ccattgccgg 4319041 acacccagaa caacgattcg cgcgactcgg cggtgatcac accgctggtc gatgtcacga 4319101 agttggccgc gccgggcaac accagcacct gggtggccac caccgacgcc ggggcgcggt 4319161 catcgcgtac cagccgcacg atccggctgt ccatcgacgg gggcaccgga agcccccgcc 4319221 cgttgtagac cgcgacccgg gcctgtggac ccgtcgacgc cttctcccac gacacgcagg 4319281 tggtcggatc cgccgcggtg tcaacgaaat tcagccgccc ggccgggtag tactccaccg 4319341 gcagcgaggt cacctgcggt gtgtggacca gcacatcggg ggtcaccacc cgcggcgccg 4319401 ccgccccgta ggagttcgcg ctgcgcagca gatcggccac gaagctgctg atcttttgca 4319461 ccccgtcggg cagcagcaca tagaactggc tgcccccgcc ggcggtttgg gcctgcaaca 4319521 ccgatcccac ccgagcgccc ggcacccacg tcgacggggt gcccgcctcg ggcaccgctg 4319581 gcacccgcag cggctcggtc gcgggcagcc cgtcgaagag cgcccgtgag atctgtattg 4319641 gtgatgtcac gccggggtcg agccccaagc tcaaggtgac cgccctgttg gtcggatcga 4319701 tctgtgagcg tttgccaccc cagatcacgt aggtgctgcc gtcgaaagtc accagcagcc 4319761 cggcgtcgtc gcgcaggtgt gtggcgcggc caccgccggt gatcgggccc gcgatcgagg 4319821 tgaccaccgg cttgtccgcg ctgcgcgggc gtcccgccgt gtcgcacacc gcccacgccg 4319881 agaccgcgcc ccggttcacc ggcatggccg cgggtgcgcc cgggatgccg accagcggcc 4319941 cggtcggata cttggcgatc tcggcgggct tgacccatgt cggctgcccc gccgtgccgg 4320001 tggccagccg cgcggacgtc aagttcagcg ccggatacaa ccggccgtcg atgcgcgcgt 4320061 agagtgcccc ggagtcgcgg tccccgatga tcgccgagtc acccacaatg ccggtgggct 4320121 tgagcacgtt gagcagcatc atccatccgg cggcaatggc caccaacacc atcgacaacg 4320181 ccagcgcggc ggtctgcttg cggtcgtcgt gtttcatgcg caccgagaac cgggtggtcg 4320241 ccgcccgcag ccgccggttg tagaacagat gaccggaatt ttggtcgcgg ttggacaaac 4320301 tcagcggcat tctcaatacc ccctggggct gcggcgagga tcggcctgct ggatcagatc 4320361 ggccagattc gatgcgtcgg cggccacccc gtagtggcct ctcgcgtagt tgatgaacgc 4320421 gcacgcctgg gcgaccaaat cgtggatgtt ggtcgacgtg cccggctcgt gatggccgcg 4320481 aacgtcggag cgatgaactg ccagacgccc ctgctggggg tgccccgcgc ggcgttggaa 4320541 tcccagtggt ttatggcgtt ggcgttgtag ttcgattcgc gacgggccac caggtccatg 4320601 ccgcgagtcc agcgtgcccg cgcggctgga tcgtgaacgc ctttgatatc caacgctttt 4320661 tggaccgccg ccaacacttg ggcgcgtcca ccgggtgtgg ttacctgggg tcgtcttgcc 4320721 gcggcggtgc gcaggtagcg cagccggcgc agccgcagcc ccaatagccg cgcccgcgac 4320781 ctacatcgcg cgatgtgccg atgctgagcc cgcagccggg cggccatccg ggccatcgcc 4320841 tcccgccggc ccagcggtgt gtcggtcaag gccatggcat cggtcttggc ggcttccagg 4320901 agtgcacgcg tcgcggtcct ggcgtgcgca tgatcgatct gggctgccgc catgatctgg 4320961 gccagcgcct caccggtgtt ggccaatcga cgcagcgctc ttgcggcgcc gcgccagcgg 4321021 tacgccgctg cggtgggaac ggcgttggcc acccaggata tcgcgttcgc atactgctgg 4321081 atctgcggcg cgtcgatgtc ggcgccggaa acgccgcccg cgaacaggcc gtggccccgg 4321141 gacagcgccg ctatcgcctg cgtggtcagg ggatcagtca agggttcgcc ttcggtgcca 4321201 atcctgtgcc atgtgctcac atccgttgcc gggtgccatc acctcggccg taccgaccag 4321261 accacgacct tgtcgattgc ggccggtgcc cctgcgccac gaccatactg ccgttgacgc 4321321 gacccgacta ctcagttgtg gcgcgaaggc cgacctcgcc ccagggcgac tattccttaa 4321381 ccttgtcgtc gttcggcaca acaaggatcc gtcgcgtcga cttggtgacc accggcttgt 4321441 cgtcggcaga tttcaccgga acactcggcg gtactgttaa gcggcccttg accggttgac 4321501 cattcggcac agccggggcc gtcacccgct tctcgaccgg cttgtcctta ttggctcctt 4321561 ccgcgcccgc acccaacgcg cccggcggca ccatcggcat gccggtcatg ccggccggcc 4321621 ccgacgcccg cggggtgcca ctaaccgggt ccggcgtcac cgacttggcc ggcgccccgg 4321681 ctggagtcgt cggtggcgac gacgtcggca cgggtggggg acccagatag cccgtcgggg 4321741 tggtgccacc acccccgccg ccggcgccga cgtcaccagc gcccggctcg ccgccgaggc 4321801 cgggctcacc ttcgatgctg tccaccagcc gcgccccgtc cgcgacgtcc agtccctccg 4321861 cgccataggt ctgttgaagc gcactcatca gcggctgcat tgctccctgc cccgcttgca 4321921 tggcctgctg gggaagctgc gtgagtgggc ccatgacgcc gccgaccgcg ccgccgagcg 4321981 cgccggtaat gcccgacacc gcttgttgca tcatctgcgt cgccccctgg gcctccgcct 4322041 gagcgcccac cccctggaat tgttgggccg catcggcctc attcgccgag aacttttgca 4322101 cggcatcggc cgcatgcgcc cgccgatcta ggtcctccag gccgctggac tcgacgtcgc 4322161 cgggcacacc ggaattaccg gcggcgaaaa gtgccccatt agcgatgtcg gcaggtgcgg 4322221 gtaggtctac ggggacagcg ggaaacggcg ccggtccgga cgctggcggc gtcgtcaaaa 4322281 cctgcaataa gatttccggc gtcaccttga tcggaactcc cggagccggg ccgggtgccg 4322341 gattctgatc tccggtcatg atcacacctc gaacttcatc ggtagcgccc cttcggacgc 4322401 tctttcgtgt gcttgtcgac attggccgca gcatcgccat tttgtcacgc cgcgcgtcga 4322461 ccggtattca gctcacggtg tcgggcctcg tatggtgatc agggagtttc gggcagcggt 4322521 tcaggcagcg aaccctcgtg agccgccacg cctggtggtt acggcatacc aggccaggtg 4322581 atagttggcg aggtagtcct gctcgtcgat cagtgcctcg atcgccgcca gcagcatcca 4322641 atccccgaca gcggtgagct cgtgactggg ataggccttg agcaccgact ccttgaccgc 4322701 ggtgatgcag ccgtgcagca gctcggcttc gttttccagc acgccggttt tgcgtaccgc 4322761 cggcagcgcg atcgcctgcg cgatccgcgg caggctatcg cggcggcgca cagcttcgac 4322821 caaggtcggc ccgaactcgt ccaccttggg tatcgccgag cgcgctgacc gatcaccggt 4322881 cagcgcaggc gcatctggcc ccggctcggc aacgtaggtg ttggactcgt gggctgccac 4322941 ggcgacgacg gcgcccagca agtcgatcac gtcggcatca cggcgtcgcg cggttggctc 4323001 cagcagcgtc acgttcgcgg gcagccggac gtggggcgga atccacccgc cggccaaatc 4323061 ggtgaccagc agggtggtgg tgccgtcgtc gcgcagcccg gccgcccatg agattcgcgg 4323121 ctcctggcgc gccacggcat ccacgattcg ctgtaggcgt tgctgctcag ccgcccgagc 4323181 cgataccgcg cccgccgtcg cgccggcggt ggccgacagt gccgaggcgc cggccattgt 4323241 cgacgagctc gcaccagcct gtccagccac agctttcgag gctgcgcgct ccaccggaga 4323301 aaccagcgcc ccacccgccg atggggccga tgacgccgag ggcgccaccg gcgcgccgga 4323361 tacgggcgcc gtaggaaccg agggcacggc gggggctgcc acgacggggg ccgtagatca 4323421 gagccgtaag ccggcagcgg tcccgcgggg acagccgagc caccaaccac cggcgcagcg 4323481 ggtgccgcca ccggcccggc ggtcaccacg gtcggcgcga ccggcccggt agtaccggtc 4323541 gacgccggtg gagcgcccgg ggtgttcgcc ggcgtgtcaa ctggcccgtg tgtggcttcg 4323601 atgcccgcag ccatggtcgg cgcagacacc acgggcggtg tcgttatcgg aggagtggcc 4323661 tgcggcgggg gaaccgaccc cgactgcatc gccgtcattg ccccttccga cagcgaatgc 4323721 gcgccagccg cggccggttg ccccgtcacc atcccggtcg caaacgattg cccaatcgac 4323781 gtaggcgata cgccctgtcc gagggccgcc ggcgacagcg acccgccggg catcgccacc 4323841 gggggccagt ggcgatgact ggcggtgtag cagcggcggg tgttgtgacc accggggcag 4323901 gtggtacagg tcctgcgctg gcgtgccgac tcgaagcgcc tattgctcgc ggcgccggtt 4323961 gtgagcaggc ggcctggaca ccactgccag aaaagccgcc agcacccacc gattgtggcg 4324021 atgccaggtc tccggcgccc tcaacactcc caaagctacc gccacgcgcg ccgggaccgg 4324081 tcagcgctgc cagatcgttc tccctgatca ggcggggtgg tggtgcgtcg tcgacgttga 4324141 aaccattggc ccgtgcccac gtccggggat cgtcaccgat atcttcggct tcgaggatct 4324201 cctgcatggc cgtcatgacc ttgtcgacgg cgtcccggga tgcgttcgcc gcatcggcat 4324261 tgcacctggt ttggatcgcc tggatttccg ccaactgctc cggcaacggc tttttcgacg 4324321 caagaacgtc gtcgatttcc ttatttcctt cccctgcaat gccggtcaac cggctccgca 4324381 aatagtcgat ggcgtcagcg gcggtattga aggcgccctt ctttatttcg tacttctctg 4324441 ccttagtgac ctcggatttc gctccccgaa ggtaccggcc aatcaggtct tcggccgttc 4324501 taccctgatt ccgcaacaaa agatcatgtt ggctgatcag attccttgcg agctcttgct 4324561 tttgcatggc ccaagtggcc cagtgttgcg cggcggcacg cagggccgcc gacggggccg 4324621 gccaccacgc ccccaccagc accgcgctcc acctaccggg cggaagatca gccgccacca 4324681 catacctgct tcatagcagc atctttcacg ttgccgtcgt caagtgcagc ctgccactca 4324741 gcttgagtgc caccactcgc cattacgatc gttgtgcggt aagcggtcgc tagcgcgcgc 4324801 ggcgtcgcgg tgtttggcat ctagggcggg atcggctgct gcattgtcga ggattgccgc 4324861 agcatttgtg agcgtgatgc gggccagtgc cttatcgctc ccgttcgtgt caactggtac 4324921 agcatgggcc accagcttgt atgtgtcgca caattgccgc tgagccgcgg cagtctgggc 4324981 agcggtgtag gtaggcaccg aggtcgtagc cggtgtagcc gcgggcctgg cgtttgtcag 4325041 ggccacgatc agcgcagcga ccgccaccac agcagcgatc gcggccacca cgatggcggg 4325101 ccaactacgt gtgcgtggta tgggccaggg tgctggcgcg gtcacgccgc agatggtatc 4325161 cgctgaccgc ctgtttgccg cttgcaccag accacaccaa cccggacacg ccgcggcgga 4325221 tgcgttacgt caccggtgac cacgcggtgc aggtgttcca actgaccagc accgttatcg 4325281 atctcaccac caagcgcaaa cacaccacgg tcgtgtacgc ggccacctcc atgtcgggaa 4325341 cgccacccct gcacaggtag cctgctggtt gctgggtcat tgcgccatgc cttcgagaac 4325401 aaattgcatc ggatgcgcga cgtcacctac gcaaaacccc tcccaagtcc gcgctggtca 4325461 gggccccaag gttagggcac ccgcgcaaca gcgccgccgg cccgctccgt atcgacggcc 4325521 acgacaacat cgcgtccacg ctacgccgca atgacgcggc cctcgccagc ccgttaaacc 4325581 atcacagtcc tgttgaaacg ccattttgcc gaggccttgg gcgcatcgcc ggtaagcgct 4325641 gctgaccgcc cggctgaaat cgatgagcat cactatctta tctactgttt tagtatgcgg 4325701 attgtcgcga caatggcatc gcacgagaaa cgtcaaccta acccttatag tccttccaaa 4325761 gggtgaataa gggcttacct tcgctatcca ggaaagaatc ctttatcacg ttgacagata 4325821 cgtctaggta atgtgacaat tcaaccagtc gatcagccgc ggtaattgca acaaccgttc 4325881 cactggaatc gataagggta tcccgttgcc gaccggcaaa cgtcatcgtc ccgatgctat 4325941 attccggcat cagctcttcg ggctgaaaag gtgcccggat cgccggcagt tcacgctcgc 4326001 ttcggacgga gcctccgaaa tacccgtaaa gatatttttc gatcacggac attgatgcag 4326061 cggcgaattc atagccttcc cggctcatgc gatcggatga cgtaataaca taccatccgg 4326121 cgagccggtc gataaagtag cggacttcac cgcccttgtt ccaaaggata gtccggccgt 4326181 cattcgtttc cgacccttgg atcatgttca tgccagataa gcggatccag tcctgcaaat 4326241 ccgttgacag gtccacacct attgtcactg tcgcaacacc ccgcgcctta ttaactcttc 4326301 cactttgcgc atctcgttct gatgatcgaa tatccgcact tggatggatc cgcccggctg 4326361 gccgcacccc ggcgcgacct cagatacttc gatgaaccat ccctcaggca accaatcaat 4326421 ggtatacgcg tggtaggggt cgcgtaacga cgtcacgtgc agggcacgtt gttcccatga 4326481 tgccgggcgc ccatgttcca tgatcgccag gtacttgccc tgatcgccgc ctatacgatc 4326541 tagctggggg ccgtagtcac taagaaattt ttcgagatta gtgtaggcga tccttgtccc 4326601 tggaaccgca ccattgttag gcggaaaatt agagtactgc tggccccatg ggcctacact 4326661 attaaatcgc tcttgatacc gttcttgggt atagggctgt ccctgaggat cgcggccgaa 4326721 tggggcgttg gggtcctcca tgagctgggc cacaaccggg tttatccgac tgcgatcggc 4326781 cggattgtct gtaaagtccc agtggcgcga taatggctcg ccatactgcg ggtcaaccgc 4326841 ttcgtcggat aaccgatgcc aaccctctcc agctggttcg ttcgaatgca ttgcaagctg 4326901 ctgctcgcga tgcggcgcat gcagcccagg tggctcgcta ccgctaccgt ggcctgaccc 4326961 gtgagacgca ccgtcgtggg tcggctcgga tccgtgtgcg ctcagtgatc ggccgtgagg 4327021 ccccgattcg gttgagtgaa cgcctccggg cgtggaatgc ggcgccgccg cgggtgctgc 4327081 tggtgtggtc gcccactggg gttggtgcgc ggtagcgggc gctgactcga caggaggtcc 4327141 gccaagcaac gtcgtcgccg gtggtgcttg cgccgggaca tgttcacccg gttgcggcag 4327201 gccatgcggc acatgtgtgc cgggcgtggt ggctgcggat acccggggct ggcctgccga 4327261 cgccgacgac ggcgccaccg gttcagccgg tctgtcgacg ggcggcggtt tggattcggt 4327321 ggggctgtgc ggcagtggac cgttggcggg cacgggcgcc ggtttcgccg cgggcgcggg 4327381 tgccgggtgg cccgattctg gtggttcgat ccgtggtggt tgcggtcctg gccgcggcgg 4327441 cgttgctggg ggctcaaggt gcggtgtcgt cggctcaagc cgctccttga ggcctcgcac 4327501 gcccgcgaga atgtcgcggc ccttgctgcc aagtttcgac agcggcccgc ccggcaaagc 4327561 tagcgtcgcg gcgtcgaata cggtcttgcc tagcgcctca ttagggttgg tcgtccactc 4327621 atcccaatgg atgaggcttt tgccgaactg cttccacgac tccacaacgc cgggagcgtt 4327681 ctcgccgccc aggcccgcca gcggcgccat cccagtcagc atctcctccc aggagcgata 4327741 ccacccgaac gggtctatcg aggcgcgcag tggccctagg tcccaggagt ccttggccat 4327801 cccgaaggcc tcctcgccga agcctttgag ctgctgcccg gtgccatcga tgaccacacc 4327861 caccgggttg ctgtgcaaga accgatccca ttgtttgcct gcgtggtctg ccatcgcggt 4327921 gatcaccgcc tcggcgtgcg acaccaccgc ggtgatctcc gcagccaacg cgtccacttc 4327981 cccgctgaac tggtcgacca ccaccgcgat gtcatgggcg atgcgctgga tctcgtcttc 4328041 gtcctggtcg gtcagaaact cccacacctc tttgatcccg gtcagcggat cgcagatgcg 4328101 ggccaacaaa tccaggaccg ccgcatgcac cgcgtcgatg cgggcggcat aggcgtctag 4328161 ctgggccgcc agctggtggc attggcccac gacagcggtg gtgctggcgt acgcgtcagc 4328221 aaacgccgac tcgatcagcc ccgcctccgg gagctgctgg gcgcgaataa cgcccatcgg 4328281 ccccgccgtc gactgaatct cagtcagcgc gaactgcgtg cccgcgctgc gccacgccac 4328341 agccgccgca cgtagctttg tcgaatcccc gttcggccag atcatcccga tatacggggc 4328401 cacccacccc cagcccttcg gggcgccacc gccgccaccg accgccgacg gcggcgcacc 4328461 cacgccgaca cagccgctcg gcggcggcgc cggcaacggc gccgcccgcc cagcgacatc 4328521 cgacatcgcc tcggccaacg agtagttgtg cgcgctcatg cgcaccccat cgccgaggtt 4328581 gcacaatccg ttgcgcgcca ccgacatcgc ctgcaccagc gcggccgccg aaccgtcata 4328641 ggagcgcccg aacaccgccc cagccggatc atcaccggcc atccccgcac acccggccag 4328701 cgccgcggtc agcgacgaga tcaccgcacc caaacccgca cccgcagcca ccaccgcgcc 4328761 gcccgcgcta tcaagggccg cgggatcgac cgccaacggc gccatcagct cacgaccaca 4328821 tacccaaatt cgtggccacc gcgccggtgt agttggcgtg cgcgctctgc cccgcggccg 4328881 tgagctgggc caacgcctgg cgcatcatcg cctcaccggc agcccaatgt cgttgcgcct 4328941 cagcatgagc cgccgcgccc tcccccgtcc acgtcacatg cagccgggta accaaggact 4329001 caatctcggc gaccagctcc tcgacgtggc gaccgaattc ggccatccgc gccaccgcat 4329061 cagccaacac ggtcggatcc acccgaaacg gctcagccac cgcccacctc acgaagcacc 4329121 tgcgccgacg cggtctcgtt gtgttgataa cccgcaccgg cgtgagctat cgccgccgcc 4329181 agcatcgaca atcccagctg cacctcaccg gccccgcgat gccatagctc ctacgccgag 4329241 ccatacgcac tgcccgacgc cccgcgccac ccgcccaaca tctgcccgac ctgagcgtcc 4329301 agctcggcca gttgaaccgc gagatgctcg gccgctccat ccaacgacgc ggcgaaaccc 4329361 tgcatcaccg caggctctac gcgcagcgtg tcgtcggcac ccatggccgc aacctaacaa 4329421 tgcccaggca ccgccacaat tcagccgccc gggcgcaccc gccgcagccc taaaggctgc 4329481 tggcgccgtc ggcggtgccg tcgccgtcgg tgtcggtcag ccgtacgtcc cagcggccgt 4329541 cgccatcggt gtcgacgtat ccggtcacac gctgctcacc agcacacagc acccgatcgg 4329601 ccagcccgtc accgtcggta tcgagtagcc ggtcgtctaa accaccgaac ccgtcgaagt 4329661 caaccagtgg accaccggtg tgctcgacgc cgtcgagccc ataccagcgc agttgtccgc 4329721 cgcggtcgac ggcgaccgcc caggtccccg atccgtcgtc gatgaagtag ctttccgggg 4329781 tgccgtcgtt gtcgacgtcg aatacggcgt ggtcggcaac gtcgtcgccg tcgaagtcgg 4329841 ccagcgcgtc atcgcgcaga ccgtcgccgt cgagatccag gccaatcgcg tccagccggc 4329901 cgtcaccgtc gaggtcgacg tcgaacgggc ggttccagat cccggcgctg ccgtcgtcgc 4329961 cggctatgca gtactccaca accgttctga cgcgactccc aagctagcgg ttcccccgtg 4330021 atttccacca ggacagcagc tcggttgtcg cctcctcggt ggacaacggg ccgcgctcta 4330081 gccgcagctc cttcaagtag cgccacgcct cgccgacttg cgggcccgcc ggaatgtcga 4330141 gcaccgccat gatctggttg ccgtccaggt cggggcgcac ccgatccaga tcctcctggg 4330201 cggccagctc cgcgatccgc tcttccagcc ggtcgtaact ggcctgcaac cgcgcggccc 4330261 ggcgcttgtt gcgggtcgtg cagtcggcgc gcaccagctt gtgcagccgt ggcagtaggg 4330321 ccccggcgtc ggtgacatag cggcgcaccg cagagtcggt ccatttccca tcgccgtagc 4330381 cgtgaaaccg cagatgcagg tagaccagct gcgagatgtc gtcgatcatc tgcttggaat 4330441 acttcagcgc ccgcatccgc ttgcgcacca tcttggcgcc gaccacttcg tggtgatgga 4330501 agctcacccc accgtcgggt tcgtgacggc gggtggcggg cttgccgatg tcgtgcagca 4330561 gcgccgccca gcgcaacacc agatccgggc cgtcgtcctc cagcgcgatc gcctgccgca 4330621 gcacggtcaa ggaatgctga tagacgtcct tgtgctggtg atgttcgtcg atcgccatcc 4330681 gcatcccacc gatttcaggc aagaccacag cacccatacc gctctgcacc atcaggtcga 4330741 tacccgcggc cggatcctca ccgaccagca gcttgtccag ctcggcggcc acccgttcgg 4330801 cgctgattcg ggccaactgc ggcgccatct cttcgatcgc cgcgcgcacc cgcggcgcca 4330861 ccgcgaatcc aagttgcgag acgaaccgcg cggcgcgcag catccgcaac ggatcgtcgc 4330921 caaaggaccc cgacggcgcc gccggggtgt ctaacacctt ggcccgcagc gccgccaagc 4330981 caccaagcgg atccaggaat tcgcccggcc cagtggcggt gacgcgcaca gccattgcgt 4331041 tcgtggtgaa gtcgcggcgg accagatcgc cctcgaggca atcgccgaaa cgtacctctg 4331101 gatgacgcga aacccggtcg tagctgtcgg cacggaatgt ggtgatctcc atgcggtggt 4331161 cgctcttacc cacgccgacg gtgccgaatt cgattccggt atcccacacc gcatcggccc 4331221 acggccgcac gatctcctgc acccgctcgg gacgggcgtc ggtggtgaag tccaggtcgg 4331281 ggctcaaccg gcccaacagt gcatctcgca ccgaaccgcc gaccagatac aactcgtgtc 4331341 ccgcggcggc gaacaccgac ccgagttccc gcaataaggc agcatgcctg ttcaaggcaa 4331401 ccgcagcggc ggttagcaga tcggcttcct ggacggcttc cggcacgttc gatcagccta 4331461 atggcagtcg aagtgggccg ggacggtcgg tggaggaacc ggcaaccctc gttgccgcac 4331521 ccgtcgcatt ggccggtgtc gggacgaggt gtcgtcgtgc ccatctccgc gcgacaaaca 4331581 gccggcgaca atattaagaa tccttgggtg cggtcgcgtc ttgtcgctcg aaggtgggca 4331641 aatcgtgcgc ccccgacaca gcgacttctg tgatagatgt gactggcgcg actcaattgg 4331701 tcagcgcggg tcgcctgcac cgccccgctc cctcgcccaa cgaataagtc ctggccgacg 4331761 atgggcgctc agacggcgag tacatcggga acacccgccc gtaccagcta ctatcgctgg 4331821 ggtgtccgac ggcgaacaag ccaaatcacg tcgacgccgg gggcggcgcc gcgggcggcg 4331881 cgctgcggct acagccgaga atcacatgga cgcccaaccg gccggcgacg ccaccccgac 4331941 cccggcaacg gcgaagcggt cccggtcccg ctcacctcgt cgcgggtcga ctcggatgcg 4332001 caccgtgcac gaaacatcgg ctggagggtt ggtcattgac ggtatcgacg gtccacgaga 4332061 cgcgcaggtc gcggctctga tcggccgcgt cgaccggcgc ggccggctgc tgtggtcgct 4332121 acccaagggg cacatcgagt tgggcgagac cgccgagcag accgccatcc gcgaggtcgc 4332181 cgaggagacc ggcatccgcg gcagtgtgct tgccgcgctg gggcgcatcg actactggtt 4332241 cgtcaccgac ggccggcggg tgcacaagac cgtccaccat tatttgatgc ggtttttagg 4332301 cggagagctg tccgacgaag acctcgaggt agccgaggta gcctgggtgc cgatccggga 4332361 actgccgtct cgactggcct acgccgacga acgtcgacta gccgaggtgg ccgacgaact 4332421 gatcgacaag ctgcagagcg acggccccgc cgcgcttccg ccgctaccac ccagctcgcc 4332481 tcgtcgacgg ccgcaaacgc attcacgcgc tcgtcatgcc gatgactcag caccgggtca 4332541 gcacaacggt cccgggccgg ggccgtgacc gcactgcaac tccgctgggc cgctttggcg 4332601 cgcgtcacct cagcgatcgg cgtcgtggcc ggcctcgcga tggcgctcac ggtaccgtcg 4332661 gcggcaccgc acgcgctcgc aggcgagccc agcccgacgc cttttgtcca ggtccgcatc 4332721 gatcaggtga ccccggacgt ggtgaccact tccagcgaac cccatgtcac cgtcagcgga 4332781 acggtgacca ataccggtga ccgcccagtc cgcgatgtga tggtccggct tgagcacgcc 4332841 gccgcggtca cgtcgtcaac ggcgttacgc acctcgctcg acggcggcac cgaccagtac 4332901 cagccggccg cggacttcct cacggtcgcc cccgaactag accgcgggca agaggccggc 4332961 tttaccctct cggccccgct gcgctcgctg accaggccgt cgttggccgt caaccagccc 4333021 gggatctacc cggtcctggt caacgtcaat gggacacccg actacggtgc gcctgcgcgg 4333081 ctcgacaatg cgcggttcct gttgcccgtg gtcggagtgc cacccgacca ggccaccgac 4333141 ttcggctccg ctgttgcacc agaaacgacg gcgccggtct ggatcaccat gctgtggccg 4333201 ctggccgacc ggccccggtt ggcccccggg gcacccggtg gcaccgttcc cgtccggctg 4333261 gtcgacgacg acctggcaaa ctcgctggcc aacggcggcc ggctggacat cctcctgtcg 4333321 gcggccgagt tcgccaccaa ccgggaagtc gaccccgacg gcgccgtcgg ccgagcgctg 4333381 tgcctggcca tcgacccaga tctactcatc accgtcaatg cgatgaccgg cggctacgtc 4333441 gtgtccgact cgcccgacgg ggccgctcaa ctaccgggca ccccgaccca cccgggcacc 4333501 ggccaggccg ccgcatccag ctggctggat cgattgcgga cgctagtcca ccggacatgc 4333561 gtgacgccgc tgccttttgc ccaagccgac ctggatgctt tgcagcgggt taatgatccg 4333621 aggctgagcg cgatcgcaac catcagcccc gccgacatcg tcgaccgcat cctggatgtc 4333681 agctccaccc gcggcgcaac cgtgctgccc gacggcccgt tgaccggccg ggcgatcaac 4333741 ttgctcagca cccacggcag cacggttgcc gtcgcggccg ccgattttag ccccgaggaa 4333801 cagcagggtt cgtcccagat cggctccgcg ctcttacccg ctaccgcgcc ccggcggttg 4333861 tccccgcggg tggtagcggc gccgtttgat cccgcggtcg gggccgcgct ggccgccgcg 4333921 ggaacaaacc cgaccgttcc tacctatcta gatccctcgt tgttcgttcg gatcgcgcat 4333981 gaatcgatca ccgcgcgccg ccaggacgcc ttgggcgcaa tgctgtggcg cagcttggag 4334041 ccgaatgccg cgccccgtac ccaaatcctg gtgccgccgg cgtcgtggag cctggccagc 4334101 gacgacgcgc aggtcatcct gaccgcgctg gccaccgcca tccggtctgg cctggccgtg 4334161 ccgcgaccac taccggtggt gatcgctgac gccgcggccc gcaccgagcc accggaaccc 4334221 ccgggcgctt acagcgccgc tcgcggccgg ttcaatgacg acatcaccac gcagatcggc 4334281 gggcaggttg cccggctatg gaagctgacc tcggcgttga ccatcgatga ccgcaccggg 4334341 ctgaccggcg tgcagtacac cgcaccacta cgcgaggaca tgttgcgcgc gctgagccaa 4334401 tcgctaccac ccgatacccg caacgggctg gcccagcagc ggctggccgt cgttggaaag 4334461 acgatcgacg atcttttcgg cgcggtgacc atcgtcaacc cgggcggctc ctacactctg 4334521 gccaccgagc acagtccgct gccgttggcg ctgcataatg gcctcgccgt gccaatccgg 4334581 gtccggctac aggtcgatgc tccgcccggg atgacggtgg ccgatgtcgg tcagatcgag 4334641 ctaccgcccg ggtacctgcc gctacgagta ccaatcgagg tgaacttcac acagcgggtt 4334701 gccgtcgacg tgtcgctgcg gacccccgac ggcgtcgcgc tgggtgaacc ggtgcggttg 4334761 tcggtgcact ccaacgccta cggcaaggtg ttgttcgcga tcacgctatc cgctgcggcc 4334821 gtgctggtaa cgctggcggg ccggcgcctt tggcaccggt tccgtggcca gcctgatcgc 4334881 gccgacctgg atcgccccga cctgcctacc ggcaaacacg ccccgcagcg ccgtgccgta 4334941 gccagtcggg atgacgaaaa gcaccgggta tgagaccctc ccctggagag gtgcccacgg 4335001 catcgcagag gcagcccgag ctgtccgacg cggcgctggt atcgcactcc tgggcaatgg 4335061 cattcgcgac gctgatcagc cggatcaccg gctttgcccg gatcgtgctg ctggccgcga 4335121 tcttaggtgc ggcgctggcc agctcgttct cggtggccaa ccagctgccg aacctggtcg 4335181 ccgcactcgt gctggaggcc accttcaccg ccatcttcgt accggtgctg gcccgcgccg 4335241 agcaggacga cccggacggc ggcgcggcgt tcgtgcgccg tttggtcacg ttggcaacca 4335301 ccctgctgct gggcgccacc acgctgtcgg tgctggccgc gccactgctt gtgcggttga 4335361 tgctgggcac aaacccacag gttaacgagc cgctgaccac ggcgttcgct tacctgctgc 4335421 taccgcaagt cctcgtctac ggcctctcgt cggtattcat ggcgatcctg aacacccgca 4335481 atgtgttcgg gccgccggcc tgggcgcccg tcgtcaacaa tgtcgtcgcc atcgcgaccc 4335541 tagcggtgta tctggcggtc cccggcgagc tttcagtcga tccggttcgg atgggcaacg 4335601 ccaagctgct ggtgctcggc atcggcacca ccgcaggcgt gtttgcacag accgcggtgc 4335661 tgctggtggc catccggcgc gagcacatca gcctgcgccc cctgtgggga atcgatcagc 4335721 ggctcaagcg ctttggcgcg atggccgccg cgatggtgct ctatgtgctg atcagccagc 4335781 tcggcctggt ggtcggtaac cggatcgcca gcacggcagc ggcttccggc cccgcgatct 4335841 acaactacac ctggctagtg ctgatgttgc cattcggcat gatcggcgtg acggtgctga 4335901 ccgtggtgat gccgcggctg agccgcaatg ccgcggccga cgataccccg gccgtgctcg 4335961 ccgacctgtc gctagccacc aggctgacca tgatcacgct gatcccaacg gtggcgttca 4336021 tgacggtcgg cggtccggcg atcggtagcg cgctttttgc atacggcaac ttcggcgacg 4336081 ttgatgccgg gtacctgggg gcggcgatcg cattgtcggc gttcacgttg atcccctatg 4336141 cgttagtgct gttgcagcta cgcgtgttct acgcccgcga gcagccgtgg acaccaatca 4336201 cgatcatcgt ggtcatcacc ggcgtcaaga tcctcggctc gctgctggcg ccgcatatta 4336261 ccggtgatcc ccagctggtc gcggcctatc tcgggctggc taacggactc ggatttctcg 4336321 ccggcacgat cgtcggctac tacatactgc gtcgggccct gcggcccgac ggcggccagc 4336381 tgatcggcgt cggcgaggcg cgaaccgccc tggtgaccgt cgccgcgtcg ttgcttgccg 4336441 gactgctggc acacgtggcc gatcggttac tagggctaag cgagctgacg gcccacgcgg 4336501 gcagcgtcgg ttcgctgctg cggctgtcgg tgctggctct catcatgctg ccaattctgg 4336561 ctgcggtcac cctctgcgca cgggtgcccg aggcgcgggc ggcgctggat gccgtgcgag 4336621 cccgaatcag gagccggcgc ttgaagaccg ggcctcagac ccagaatgtc ttggatcaat 4336681 cgtctcgccc cggaccggtc acgtaccctg agcggaggcg tttggccccg ccgcggggga 4336741 aaagtgtggt ccacgagccg atccggcgca ggcctccgga gcaggtagcc agagccggga 4336801 gagcgaaagg accggaggtg atcgaccgcc catcggagaa cgcctcgttt ggtgccgcgt 4336861 cgggtgccga gctgccgcgg cccgtcgccg acgagcttca gctcgacgcg ccagccggcc 4336921 gtgaccccgg ccccgtttcc cggccgcacc catccgacct gcaaaacggc gatctgcccg 4336981 ccgatgcggc ccgtgggccg attgcgttcg acgcgctccg cgaaccggac cgagaatcgt 4337041 cggccccccc agatgatgtg cagctggttc ccggcgcccg catcgctaac ggccgctacc 4337101 gcctgctgat cttccacggg ggtgtaccac ccctgcagtt ctggcaggcg cttgacacag 4337161 cgctggaccg ccaggtggcg ctgaccttcg tcgacccgca gggcgtcctg cccgacgacg 4337221 tcctccagga gaccttgtcc cgtacgttgc ggctcagccg gatcgacaag cccggtgtcg 4337281 cccgagtgct tgacgtcgtg cacacccggg ccggtggtct ggtagtcgcg gagtggatcc 4337341 gcggcggttc gttacaggaa gtcgccgaca cctcaccgtc gccggttggc gccatccggg 4337401 cgatgcagtc cctggccgcg gccgcagatg ctgcccaccg cgccggtgtt gcgctgtcga 4337461 tcgaccatcc cagccgggtg cgcgtgagca tcgacggcga cgtcgtgctg gcctacccgg 4337521 cgaccatgcc ggacgccaac ccgcaagacg acatccgcgg catcggcgcc tccctgtacg 4337581 ccctgctggt caaccggtgg ccgctgccgg aggccggcgt gcgcagcggg ttggcacccg 4337641 ccgagcgcga caccgctggc cagcccatcg aacccgccga catcgaccgt gacatcccct 4337701 tccagatttc cgcggtggcg gcccggtcgg ttcaaggaga cggcgggata cgcagcgcgt 4337761 caacgctgtt gaatctaatg cagcaggcga ccgcggtggc cgatcgcacc gaggtgctgg 4337821 gaccgatcga cgaagcaccg gtctccgcgg ccccgcgcac atccgcgccc aacagcgaaa 4337881 cctacacccg ccgccgtcgc aacctgctga tcggcatcgg cgcgggtgct gccgtcctca 4337941 tggtggccct gctggtcttg gcttcggtgt tgagccggat attcggcgat gtcagcggcg 4338001 gcctcaacaa ggacgaactg ggcctcaacg cacccaccgc gtcgacctcg gcggccagtt 4338061 cggcgccgcc cggcagcgtc gtcaaaccca ccaaggtcac ggtcttctcc cccgacggcg 4338121 gcgccgacaa ccccggggag gctgatttgg ccatcgacgg caatccggcc acttcctgga 4338181 agaccgacat ctataccgac cccgtcccgt tccctagctt caagaacgga gtcggtttga 4338241 tgttgcagct gccccaggcc acggtggtcg gcaccgtcgc catcgacgtg gccagcaccg 4338301 gcaccaaggt ggagatccgc tcggcatcca cgccgacgcc ggcaacgctg gaggataccg 4338361 ccgtgttgac ttcggccacc gcgctgcggc ccggccacaa caccatctcg gtcgaggcgg 4338421 ccgcgcccac ctcgaatctg ctggtgtgga tctctacctt gggaaccacc gacggaaaga 4338481 gtcaagccga catctcggag atcacgattt acgccgcgtc ctgaccgggc cgggcacggc 4338541 cagccagggt gaagtgctat gccgccaccg attggttact gtccggccgt gggtttcggg 4338601 ggccgtcacg agcgcagcga cgccgagctg ctggccgccc atgtcgccgg cgaccggtac 4338661 gccttcgatc agttgttccg ccgtcatcac cgccagctac accggctcgc gcggctcacc 4338721 agccggacct ccgaggacgc cgacgatgcg ctgcaagacg cgatgctgtc agcgcaccgc 4338781 ggcgccggct cgttccggta cgatgccgcc gtcagcagtt ggttgcaccg catcgtggtc 4338841 aacgcttgcc tggaccggct gcgtcgggcc aaagcccatc cgaccgcccc tctagaagat 4338901 gtctatccgg tcgcggaccg gaccgcgcag gtcgagaccg cgatcgcggt gcagcgggca 4338961 ctgatgcggc tgcccgtcga gcagcgggcc gcggtggtcg ccgtggacat gcagggctat 4339021 tcgatcgccg acacctcccg gatgctgggc gtggccgagg gcaccgtcaa gagccgctgc 4339081 gcccgggcgc gggcccgcct agcgcggctg ctgggctatc tcaacaccgg ggtgaacatc 4339141 cggcgctgac cccgttgccg gtccgtcgta gcatcgatcc acgggctcgc cgctacccca 4339201 catctggcta ttgccaccgg gcatgacgga cactggggcc gatgagtgca gccgacaagg 4339261 atccagacaa acatagcgcc gatgcggacc cgccgctgac cgttgagctg ctggccgacc 4339321 tgcaagcagg tctgctggac gacgcaaccg ccgcccgcat ccgcagccgg gtccgctcag 4339381 acccgcaggc tcagcaaatc ctgcgcgcgt tgaaccgggt acgccgcgat gtcgccgcga 4339441 tgggtgccga ccccgcttgg gggccagctg ctcgcccagc ggtcgtcgac agcatttcgg 4339501 cggccttacg gtcggcgcgc ccgaacagct cacccggcgc cgctcacgcc gcccgtccgc 4339561 acgtccaccc cgtccgaatg atcgccggcg cggccggatt gtgcgccgtg gccacagcga 4339621 tcggtgtcgg cgccgtggtc gatgcaccgc cacccgcacc gagtgcaccg acaaccgcgc 4339681 agcacatcac ggtgtcaaaa cctgccccgg tgattccgct gtctcggccg caggttctcg 4339741 acctgcttca ccacaccccg gactatggcc cacccggagg cccgctgggc gatccgtccc 4339801 ggcgtacgtc ctgcctgagc ggcctcggct atccggcgtc cacgccggtg ctgggcgcgc 4339861 agccgatcga tatcgacgct cggcccgccg tactgctggt gatacccgcg gacacgcccg 4339921 acaaactggc cgtttttgcg gtcgcgccgc actgcagcgc cgccgatacc gggttgttgg 4339981 ctagcaccgt ggtcccccgc gcatgatggg tctgggtgct gtcgctcgcc tgcgggaaca 4340041 gcagtgccta cgctggcgtt cgttgtctca agatctgccc tcgcactcga aaggctcgca 4340101 tgaccgcccc gcctgtccat gaccgcgcac accaccccgt tcgcgacgtg atcgttatcg 4340161 gctccggtcc cgcggggtac actgcggcgc tctacgccgc ccgtgcccag ctggcgccgc 4340221 tggtcttcga gggcacgtct ttcggcggcg cgctgatgac caccaccggc gtggagaact 4340281 acccgggatt tcgcaacggc atcaccggtc cagagttgat ggatgagatg cgggaacagg 4340341 cgctgcgatt cggcgcggac ctgcgtatgg aagacgtcga gtcggtatca cttcacgggc 4340401 cgctgaaatc ggtcgtcacc gccgacggac agacccaccg ggcccgagcc gtgatcctgg 4340461 caatgggcgc agcggcacgc tatctgcagg tgcccggcga acaggaattg ctcgggcgcg 4340521 gggtgagctc gtgcgccacc tgcgacggat tcttcttccg cgatcaggac atcgccgtca 4340581 tcggcggcgg tgactcggca atggaggaag ctaccttcct gacccgattc gctcgcagtg 4340641 tgacgctggt gcatcgccgc gacgagttcc gggcttccaa aatcatgctc gatcgcgccc 4340701 gcaacaacga caagatacgg ttcctcacca accacaccgt ggtcgcggtg gacggggaca 4340761 ccacagtgac cggcttgcgg gtacgcgaca ccaacaccgg tgccgaaacc accctgccgg 4340821 taaccggtgt tttcgtcgcg atcggccacg agccgcggtc gggcttggtg cgcgaggcca 4340881 tcgacgtcga cccggacggc tacgtgttgg tgcaggggcg taccaccagc acctcactgc 4340941 cgggcgtgtt cgctgccggc gacctggtgg atcgcaccta tcgccaggcg gttaccgcag 4341001 cgggcagtgg ctgcgccgcg gctatcgacg ccgagcgctg gctcgccgag cacgcagcaa 4341061 ccggagaagc tgacagtacc gacgcattga taggagcaca acgatgaccg attccgagaa 4341121 gtccgccacc atcaaagtta ccgacgcatc ctttgccacc gacgtgctat ccagcaacaa 4341181 gcctgtgctg gttgactttt gggcgacatg gtgtggacct tgcaagatgg tagcgcccgt 4341241 tctcgaggaa atcgccaccg agcgcgcaac agacctcacc gtcgccaagc tcgacgtgga 4341301 caccaacccg gagaccgccc gcaacttcca ggtcgtctcg atccctaccc tgatcttgtt 4341361 caaggacggc cagccggtga aacgaatcgt tggcgccaag ggtaaggctg cgttgctgcg 4341421 cgagctctca gacgtggttc ccaacctcaa ctagcccccg cggttagcct ggggttttcc 4341481 cgaaatcggc aaggatctgc gacaataccg gttggctggt ccgcattgtc aacgatgtga 4341541 gctaatcccg gagggccctt ggtatgccga gtccgcgccg cgaagacggc gatgcgctgc 4341601 gctgtggcga ccgcagtgcg gccgtcaccg agatccgggc tgcgctgacc gcgttaggga 4341661 tgctggatca tcaggaagaa gacctgacga cgggccgtaa cgtcgccctt gagttgttcg 4341721 acgcgcagct cgaccaggcg gtccgtgcct tccaacagca tcgcggcctg ctggtggacg 4341781 gcatcgtcgg tgaggccacc taccgcgcgt tgaaagaagc ctcctaccgg ctcggggccc 4341841 gcacgctgta ccaccaattc ggcgccccgc tctacgggga cgacgtcgct acactgcagg 4341901 cccggctgca ggatcttggt ttctacaccg ggctggtcga cggtcatttc gggttgcaga 4341961 cccacaatgc gttgatgtcc tatcagcgtg agtacggact tgccgcagac ggtatctgcg 4342021 gcccagaaac gttgcgctcc ttgtactttc taagttcgcg agtcagcggt ggctcgccac 4342081 atgcgattcg cgaagaagag ctggtccgca gctcggggcc gaagctgtct ggcaaacgga 4342141 tcatcattga tcccggtcgc ggcggcgtgg accacggact tatcgcgcaa ggtccggctg 4342201 ggcccatcag cgaagcagac ttgttgtggg acttggcaag tcggctcgaa ggacggatgg 4342261 cagctatcgg tatggagacc cacctgtccc gtccgaccaa ccgtagtccg tccgacgcag 4342321 agcgtgccgc caccgccaac gccgttggcg cagacctgat gatcagcctg cgctgcgaga 4342381 cccagaccag tctcgcggcc aacggcgtgg cttcctttca cttcggcaac tcgcacggct 4342441 cggtgtctac catcggccgc aatcttgccg atttcattca acgagaagtg gtggcgcgca 4342501 ccggtttacg ggattgccgt gtgcatggtc gaacgtggga tctgttgcgg ctgaccagga 4342561 tgccgaccgt tcaggtcgat atcggctaca tcaccaaccc ccacgatcgt gggatgctgg 4342621 tctcaacgca gacgcgcgat gccatcgccg aaggcattct cgccgcggtc aaacggctgt 4342681 atctgttagg caagaacgat cggcccaccg gcacattcac tttcgccgag ttgctggccc 4342741 acgaactgtc tgtcgagcga gcgggtagac tcggcggttc ttaagcccag tggccgcgtg 4342801 gggtttacga cgtgttgccg gccgtcgacc ccgctgctat cggctcttgc agtcgagcat 4342861 tctccagcaa gcgttcaaga gcggcctcga cttcggcttt ccaccccagc cctttgtcca 4342921 gttcgaggcg tagcctcgga aagtacgggt gcggtgccac cacgacgaaa cccacgtcca 4342981 tcaagaagtt cgcgtcgatg atgcagtgtt cgacacagca gtcgccgagg gcctccaaca 4343041 ccggccgcac atcaggtgtt accgcgcccg ggttttgcaa atcggtggcc gctggtgtcc 4343101 ggccgaaagc ttccagcgcc cggacgccgc gccgaaccaa ctcttcaatc acccgggcaa 4343161 tcagactgtg cggtaagtcg tcatctgctt gcccgcgctc gatgcccatc gacgtaagca 4343221 gcaccgcgtc cgccgacacc ggcgcggtag gaaaccgctg ggctcgcggc accgcactgg 4343281 gcggagcgta gagcacatac ccgaggcagg gtggttcggc gtggctgcgc tcatccggga 4343341 ctgccgttgc gacctgaccg cacgaacccc actccagcat caccatcgac aaccaggctt 4343401 ccttttcgaa ttcggggtcg gcgaggtggt cgtctttgcc gagaatcgcg gggtcgacct 4343461 cccagaaaac gcagcgccgc gcatgcttgg ggagctgctc gaaggcttcg agtcgtaacg 4343521 ctgtgatacg agcggacact agtctcctgg cctccgtgcg gcattgcaac cgatggccct 4343581 acacctccgc gggccaatgt gcaccagcaa cccttctaga ataagagagt cgatcgctat 4343641 cgggccagta ttcgcgatgc cactccagcc gacttgcacc gcatcgtgtc cggccggtga 4343701 caattgtccg gtccattgcc ccgtccaatc tcgaatccgc ttgccgcaca ccgcgtctcc 4343761 gttgattccc gctccccgca gcgggttggc ttaggcgccg gaaccggcgc gttgtcacag 4343821 tgacgtaatt acagagcgtc cctgtgcagg cctttatctc ggccatcagt ggtcatcaaa 4343881 ccgactatgc gcgctaaatc atcgaccgag ccgaactcca ccacaatctt acccttgcgt 4343941 ttgcccagac tgacggtcac ccgcgtgtca aaggtggtcg atagacgctc agcaacatct 4344001 tggaggccag gcatctgaat cggcttacgc cgcggcggcg cgggtgtagt cgcgtcgctg 4344061 tgatgggctt ggcgattggc ctcgtgattg gccagcgtga ccgtctcctc ggtggctcgc 4344121 accgacaggc cctccgcgac gatccggctc gccagctcct cttgcgcctc cggtccggcc 4344181 tcgagcgaca gcagggcgcg agcatgcccg gccgacagca cgccggcggc cactcgccgc 4344241 tgtaccggga tggggagttt gagcaatcgg atcatgttgg tgatcaaggg ccgcgagcgg 4344301 ccgatgcgcg ccgccagttc atcgtgggtg accccgaatt cgtcgagcaa ttgctggtat 4344361 gccgccgctt cttctaacgg attcagctgt actcgatgaa tattttccag gagggcgtcg 4344421 cgcagcagat tatcgtcgcc ggtctcacgc acgatggccg ggatggtggc caagcccgcc 4344481 tcttgggcag cccgccagcg ccgctccccc atcactatct ggtagcgcac gccggtttgg 4344541 gatccagcca atgaccgcac cacgatcggc tgcaggagac cgaattcgcg gatggagtgc 4344601 accaactcgg ccagtgcctc ttcgtcgaac acctgacgcg gctgacgggg attagcctcg 4344661 atggcgctcg gtgggatttc ccgatagatg gcgcccatca cggaagtgtc cgggaccggt 4344721 ccgccgatta cgacatctgc cgtggcagat cccatccggg gacccaaggt cggtggcccc 4344781 gattctccgt ctgccgggcc agtcgggatc agcgcagcca ggccacggcc gaggccaccc 4344841 tttctgcgtg acggctgggt catggtcgtc ccttcgcgga tggtggtcgg tcacgctcgg 4344901 caagttcgcg gctcgcgtcg aggtaactca tcgcgccgcg cgaaccggga tcgtaatcga 4344961 tgatggtcat gctgtagccc ggcgcttcgg aaaccttgac gctgcgtgga atcaccgtcc 4345021 gcaacacttt gcttccgaaa tactgacgga cctcgtcggc tacttgatcg gcgagctttg 4345081 tccggccgtc atacatggta aggatcacgg tggtgacctc gagttggggg ttgaggtggg 4345141 ccttcaccat ctcgatgttg cgcataagct gcgacacacc ctccaacgcg tagtactcgc 4345201 attggatcgg gatcatcacc tccggtgccg cgacgagtgc gttgatggtc agcagcccca 4345261 gcgagggcgg gcaatcgacg aaaacgtagt cgaagtcgaa gttgtcgagt gcggccaggg 4345321 cggtgcgcaa ccggttctcg cgcgccacca tgctcaccaa ttcgatttcg gcgccggcca 4345381 gatcgatcgt cgccgggatg cagaacagcc gctcgctgtg cgggctgcgc cgtagcgccg 4345441 tgtgcaacga aacctcgccg ataagcatct cgtaggacga gggtgtgccg gattgccggt 4345501 cggtgatacc caatgcggtg ctcgcgttgc cctggggatc gagatcgatc acgagtgtct 4345561 tgaggccctg cacagcaagc gcggcagcga tattgacggc ggtggtcgtc ttaccgaccc 4345621 cgcccttctg attcgcgatg gtgagcaccc ggcgtcgacc cggccgctgc agcggctcgt 4345681 gggtggtgtg caggacccgc atcgcacgtt ctgctgcagc gccgatgggg gtgtcgaatt 4345741 ctgtcgatgt ttcacgtgaa acattcatcg tcggattgtg cgcggcctca ggcgtcggtg 4345801 tcggtggtgt catttcccgc tggaatggtt cgatagttga agcctggccc gaccttacga 4345861 gcgcggacgg tccagcggcc accgggcccc acggagcact cacgccgtcc ctccactcgc 4345921 catccgtgcc gaccctcggg cgatctgctt tccacgtcgc gcgaacacca cggtcgcggg 4345981 cggacgcaaa tagttcgcgc cacatgtcac caccctgaca tcaaccgcgc ccgatgcgat 4346041 catcacacgc cggtgctccc gtacttcgtc gtgagcccgc tcgcctttga tggcgagcat 4346101 tcgcccgttc ggccgtatca acggcatgct ccatttcgtc aacttgtcca acgcggccac 4346161 cgcccgtgac accgcagcgt cgctgccgcc caattggtcc tgcacccagg actcctcggc 4346221 gcgcccccgc acgatctcaa cggccacgcc cagatctgtc accatctctc gaagaaactc 4346281 ggtgcggcgc agtagcggtt ctaggagaac tacctggagg tccggccgcg ctatcgccaa 4346341 tggcacgccc ggcaacccgg ctccgctacc gatatccacg acccggtcac cgcgttcgag 4346401 gagctcaccg atcacggcgc agttcagtag atgccggtcc catagcctac cgacttcgcg 4346461 gggtcccacc agcccccgct ccacaccggg tcccgccaac gcttcggcgt accgccgagc 4346521 aaggccaagc cgcggtccga agatcgcaga cgccgcgggc tcgatcggag acattacgca 4346581 ctccgccggc tcgtgaggtc tgtgtcatgt ttcacgtgaa acattctccg ctctcgagac 4346641 gctggcccag ccgctcggcc acgcatcgct tactgcggcg tcggtcggag ccgctggctc 4346701 gcgagctagt cgcggagcac aacgactcgg cgttctggct ccacgccttc gctttcgctg 4346761 tgcacacctg gcaccgctgc aaccgcatcg tggacgatct tccgttcgaa cggcgtcatt 4346821 ggaacgagtt cctcgcggtc accggtttcg gccactcgcc gcgccacctc gtcggccagc 4346881 gccgccaatt cctcccggcg ccgccgtcgc cacctcgcga tgtctagcat caaccggctc 4346941 cacacaccgg tcttctgatg caccgccaac cgggtgagtt cctgcagagc gtcgagcacc 4347001 tcgcccccgc gcccgaccaa cttgttcagg tcgtcactgc cgtcgatgct caccaccgca 4347061 cgattgcctt cgacatcgag gtcgatgtcg ccatcgaagt ccaacacgtc caataactct 4347121 tccaggtagt cgcctgcaat ctcgccctcg gcgaccaatc tctcttcttg atcgtcggcc 4347181 tcgtcagcat ccgtcgccgt gtcctcccgg acgcctccac ccggtgcttc tgcgtcgacg 4347241 tcgaagtcgg tggtgtcagc gtcggccatg gcttgctctc ccctcgtctg cgggtgggtt 4347301 gtgtttgtgg gaccgcctgc ccggctgccc ggaaggattg tcaacgtttg cgttttttcg 4347361 gtcgcacccc gggccgcggc gtacgggcgc tcgggccgct gttgcgtctg gccggattgg 4347421 acgtgtcggc tggtcgctca gtgctggcgt ccgactctgc cccgtcatcg gtgtccccgg 4347481 cttctgttgg ggctgccgca ttggtcgctg gagcggtctt cgggctccgc ttgggcttag 4347541 ctcccggggc cggcgcgttg gccgcccggc gccggaccgc ctcctgcttt ttggcctcct 4347601 cctccttttc gatcatgccg aagacgtaat gctgctgccc gaacgtccag atattgttcg 4347661 agaaccaata caagatgatc gccagtggca ggaacggtcc gccgacgact acgccgagcg 4347721 gaaatacgta cagcgccagc ttgttcatca tcgcggtctg tggattcgca gccgcctcgg 4347781 cgctctgccg cgcgatagac gcgcgactgt tgaagtacgt cgcgatgccg gccaagatca 4347841 tcaccggcac acccaccgcg atcaacgcag ggcgactgaa atcgacgaac gcatccaacc 4347901 cggaccgttg cgtcatgtac gccccgatcg gagcgccgaa caagttggca tctaggaagt 4347961 ggccgacgtc gaccgggcta aagacgtagt tcccagtcag tcggttctcg atcaccgaca 4348021 agtgtggttg accaaagccc ccggtcgtac ggttaaacga gcgcaacaca tgatagagcc 4348081 caagaaacac cggaatctgc gccagcatcg gcaaacatcc gagaatgggg ttgaagccgt 4348141 gctcgcgttg cagcttttgc atttcgagcg ccatccgctg acgatccttg ccgtatttct 4348201 tttgcaaggc cttgatctgt ggttgcagtt cctgcatctg cctggtggtg cgaatctggc 4348261 gcacgaacgg cttgtacagc agcgcacgca gcgtgaagac caggaacatc accgacaacg 4348321 cccaggcgaa gaagttggat ggtcctagca caaacgcgaa cagccggtac caaacccaca 4348381 tgatccacga caccgggtag tagatgaagt cgagactgaa gaaatcaaac aaaagactca 4348441 cgctcccctc gctttgacgc agggttccag tcgtcgttcg cgccgtcgac gtctgtctgg 4348501 cagctccggc ctgtcgttaa gccttccggt atcggatccc atcctccccg atgccatggt 4348561 ccgcactttg cgagcctgat catggtcaac cagcttcccc gcaacaggcc atactcggtg 4348621 agcgcatcga cggcgtactg actacaggta gggacaaagc ggcacgacgc cggtcgtagc 4348681 ggcgaaagca tgtgccgata gacctggata acgaaaatca acccccgcgc tgatgctcta 4348741 ccggtaaccc ggaccacgcg cccacagctt tgcctagaca gactcaccga tcactacctg 4348801 ccagttcgac agccctccgc aagccgcatc gcagttgctg ctccaaccga gccgaggaga 4348861 catgccggct gctcggcagc gcgcggatca ccacatgatc ggacgggtgg agttctttga 4348921 cgatcgaccc agccacgtgc cgcagccgac gtgccacgcg gtggcgttcc acggccgacc 4348981 ccaccgactt ggcgataatc agtccgacgc gcggcccacc gccactccca cgccaccaat 4349041 aaacgaccat gtcagaccgc acggtacgca tcccgtgctt caccgttgtt tcaaaatccg 4349101 ctgaccgcct catgcggttg cgtgcacgaa gcaccgcaaa taagcccggt gttgcaatca 4349161 agcactgagc gtgcgccgac ccttgcgtcg ccggctggac acaattgacc tcccggcgcg 4349221 ggtacgcatc cgtaagcgga aaccgtgaac acgagctcgc cgccggttgt tcggctggaa 4349281 ggtccttttg cccttggtca cgggcgtctc ctcgctatgt ctggcaacat caccatccgg 4349341 ccactcactg ccttccaact cgattggccc gcgggacagt cggaggtggt tttcgctgct 4349401 ggccggcgcg gtccctggac taatccaggt cgcagccgca tcgccgactt tcgggcgact 4349461 gttcgagggt acttacgcgc cttcgcctgg tcaaacctcg cccacccggc aaccgcttca 4349521 gggcatcctg cccgctaagc tgctcaccat ccgtacaccc gagaccgcca cactcacaaa 4349581 gaacccacca caacgcaaaa caacggttgg cagccgtacg gaaaactgtt agcttcgggc 4349641 ggtgtagtta tcacgccgtt tcagcgtgga aacggcactc gacaatcaag cgaggatggc 4349701 ggatcgacta gcggcccgga caacttgaac cgggtgtttt caacacgagg atcgcgagcc 4349761 gttgccggta ggttgcggct ggttatcgac ggtactgtcc acatttgtgg atagccatgt 4349821 ggacagttca cctgcccaca acaacggttg tagctcgacc cggaaccaag acccggaact 4349881 aacgagaacc agggagatac gtcg //